[2022-07-09 00:00:04,889][25689] Initializing learners... [2022-07-09 00:00:04,890][25689] Initializing the learner 0 for policy 0 [2022-07-09 00:00:04,910][25974] WARNING! It is recommended to enable Fixed KL loss (https://arxiv.org/pdf/1707.06347.pdf) for continuous action tasks. I.e. set --kl_loss_coeff=1.0 [2022-07-09 00:00:07,914][25974] LEARNER pid 25974 parent 25689 [2022-07-09 00:00:07,915][25974] Set environment var CUDA_VISIBLE_DEVICES to '0' for learner process 0 [2022-07-09 00:00:07,924][25974] Visible devices: 1 [2022-07-09 00:00:07,927][25974] Setting fixed seed 0 [2022-07-09 00:00:07,928][25974] Waiting for the learner to initialize... [2022-07-09 00:00:10,292][25974] No checkpoints found [2022-07-09 00:00:10,292][25974] Did not load from checkpoint, starting from scratch! [2022-07-09 00:00:10,293][25974] Broadcast model weights for model version 0 [2022-07-09 00:00:10,296][25974] Learner 0 initialized [2022-07-09 00:00:10,296][25689] Initializing policy workers... [2022-07-09 00:00:10,298][25689] Initializing policy worker 0 for policy 0 [2022-07-09 00:00:10,303][25689] Initializing actors... [2022-07-09 00:00:10,313][26022] Set environment var CUDA_VISIBLE_DEVICES to '0' for inference process 0 [2022-07-09 00:00:10,320][26023] Initializing vector env runner 0... [2022-07-09 00:00:10,322][26023] ACTOR worker 0 pid 26023 parent 25689 [2022-07-09 00:00:10,322][26023] Initializing envs for env runner 0... [2022-07-09 00:00:10,324][26022] Visible devices: 1 [2022-07-09 00:00:10,325][26022] Initializing model on the policy worker 0-0... [2022-07-09 00:00:10,325][26022] POLICY worker 0-0 pid 26022 parent 25689 [2022-07-09 00:00:10,328][26024] Initializing vector env runner 1... [2022-07-09 00:00:10,330][26024] ACTOR worker 1 pid 26024 parent 25689 [2022-07-09 00:00:10,331][26024] Initializing envs for env runner 1... [2022-07-09 00:00:10,356][26025] Initializing vector env runner 2... [2022-07-09 00:00:10,358][26025] ACTOR worker 2 pid 26025 parent 25689 [2022-07-09 00:00:10,359][26025] Initializing envs for env runner 2... [2022-07-09 00:00:10,363][26023] Worker 0 uses CPU cores [0] [2022-07-09 00:00:10,363][26040] Initializing vector env runner 6... [2022-07-09 00:00:10,365][26040] ACTOR worker 6 pid 26040 parent 25689 [2022-07-09 00:00:10,366][26040] Initializing envs for env runner 6... [2022-07-09 00:00:10,368][26036] Initializing vector env runner 4... [2022-07-09 00:00:10,370][26036] ACTOR worker 4 pid 26036 parent 25689 [2022-07-09 00:00:10,372][26038] Initializing vector env runner 5... [2022-07-09 00:00:10,375][26038] ACTOR worker 5 pid 26038 parent 25689 [2022-07-09 00:00:10,376][26038] Initializing envs for env runner 5... [2022-07-09 00:00:10,376][26024] Worker 1 uses CPU cores [1] [2022-07-09 00:00:10,379][26042] Initializing vector env runner 8... [2022-07-09 00:00:10,381][26042] ACTOR worker 8 pid 26042 parent 25689 [2022-07-09 00:00:10,384][26027] Initializing vector env runner 3... [2022-07-09 00:00:10,386][26027] ACTOR worker 3 pid 26027 parent 25689 [2022-07-09 00:00:10,387][26027] Initializing envs for env runner 3... [2022-07-09 00:00:10,392][26041] Initializing vector env runner 7... [2022-07-09 00:00:10,395][26041] ACTOR worker 7 pid 26041 parent 25689 [2022-07-09 00:00:10,400][26044] Initializing vector env runner 9... [2022-07-09 00:00:10,402][26044] ACTOR worker 9 pid 26044 parent 25689 [2022-07-09 00:00:10,404][26061] Initializing vector env runner 11... [2022-07-09 00:00:10,397][26052] Initializing vector env runner 10... [2022-07-09 00:00:10,407][26052] ACTOR worker 10 pid 26052 parent 25689 [2022-07-09 00:00:10,408][26052] Initializing envs for env runner 10... [2022-07-09 00:00:10,411][26036] Initializing envs for env runner 4... [2022-07-09 00:00:10,415][26044] Initializing envs for env runner 9... [2022-07-09 00:00:10,406][26061] ACTOR worker 11 pid 26061 parent 25689 [2022-07-09 00:00:10,420][26061] Initializing envs for env runner 11... [2022-07-09 00:00:10,423][26041] Initializing envs for env runner 7... [2022-07-09 00:00:10,382][26042] Initializing envs for env runner 8... [2022-07-09 00:00:10,440][26063] Initializing vector env runner 12... [2022-07-09 00:00:10,442][26063] ACTOR worker 12 pid 26063 parent 25689 [2022-07-09 00:00:10,472][26063] Initializing envs for env runner 12... [2022-07-09 00:00:10,444][26064] Initializing vector env runner 13... [2022-07-09 00:00:10,475][26064] ACTOR worker 13 pid 26064 parent 25689 [2022-07-09 00:00:10,476][26064] Initializing envs for env runner 13... [2022-07-09 00:00:10,448][26066] Initializing vector env runner 15... [2022-07-09 00:00:10,450][26025] Worker 2 uses CPU cores [2] [2022-07-09 00:00:10,458][26040] Worker 6 uses CPU cores [6] [2022-07-09 00:00:10,479][26066] ACTOR worker 15 pid 26066 parent 25689 [2022-07-09 00:00:10,480][26066] Initializing envs for env runner 15... [2022-07-09 00:00:10,460][26069] Initializing vector env runner 18... [2022-07-09 00:00:10,485][26069] ACTOR worker 18 pid 26069 parent 25689 [2022-07-09 00:00:10,486][26069] Initializing envs for env runner 18... [2022-07-09 00:00:10,484][26067] Initializing vector env runner 16... [2022-07-09 00:00:10,488][26068] Initializing vector env runner 17... [2022-07-09 00:00:10,490][26068] ACTOR worker 17 pid 26068 parent 25689 [2022-07-09 00:00:10,428][26065] Initializing vector env runner 14... [2022-07-09 00:00:10,491][26065] ACTOR worker 14 pid 26065 parent 25689 [2022-07-09 00:00:10,492][26065] Initializing envs for env runner 14... [2022-07-09 00:00:10,510][26086] Initializing vector env runner 21... [2022-07-09 00:00:10,512][26086] ACTOR worker 21 pid 26086 parent 25689 [2022-07-09 00:00:10,513][26086] Initializing envs for env runner 21... [2022-07-09 00:00:10,516][26085] Initializing vector env runner 20... [2022-07-09 00:00:10,518][26085] ACTOR worker 20 pid 26085 parent 25689 [2022-07-09 00:00:10,486][26067] ACTOR worker 16 pid 26067 parent 25689 [2022-07-09 00:00:10,524][26067] Initializing envs for env runner 16... [2022-07-09 00:00:10,528][26077] Initializing vector env runner 19... [2022-07-09 00:00:10,530][26077] ACTOR worker 19 pid 26077 parent 25689 [2022-07-09 00:00:10,531][26068] Initializing envs for env runner 17... [2022-07-09 00:00:10,539][26085] Initializing envs for env runner 20... [2022-07-09 00:00:10,544][26094] Initializing vector env runner 22... [2022-07-09 00:00:10,546][26094] ACTOR worker 22 pid 26094 parent 25689 [2022-07-09 00:00:10,555][26077] Initializing envs for env runner 19... [2022-07-09 00:00:10,553][26044] Worker 9 uses CPU cores [1] [2022-07-09 00:00:10,563][26052] Worker 10 uses CPU cores [2] [2022-07-09 00:00:10,568][26027] Worker 3 uses CPU cores [3] [2022-07-09 00:00:10,573][26036] Worker 4 uses CPU cores [4] [2022-07-09 00:00:10,576][26100] Initializing vector env runner 23... [2022-07-09 00:00:10,578][26100] ACTOR worker 23 pid 26100 parent 25689 [2022-07-09 00:00:10,579][26094] Initializing envs for env runner 22... [2022-07-09 00:00:10,597][26038] Worker 5 uses CPU cores [5] [2022-07-09 00:00:10,602][26042] Worker 8 uses CPU cores [0] [2022-07-09 00:00:10,598][26100] Initializing envs for env runner 23... [2022-07-09 00:00:10,622][26065] Worker 14 uses CPU cores [6] [2022-07-09 00:00:10,608][26122] Initializing vector env runner 24... [2022-07-09 00:00:10,681][26122] ACTOR worker 24 pid 26122 parent 25689 [2022-07-09 00:00:10,626][26061] Worker 11 uses CPU cores [3] [2022-07-09 00:00:10,650][26069] Worker 18 uses CPU cores [2] [2022-07-09 00:00:10,683][26122] Initializing envs for env runner 24... [2022-07-09 00:00:10,668][26140] Initializing vector env runner 25... [2022-07-09 00:00:10,665][26158] Initializing vector env runner 26... [2022-07-09 00:00:10,691][26140] ACTOR worker 25 pid 26140 parent 25689 [2022-07-09 00:00:10,622][26041] Worker 7 uses CPU cores [7] [2022-07-09 00:00:10,691][26158] ACTOR worker 26 pid 26158 parent 25689 [2022-07-09 00:00:10,692][26140] Initializing envs for env runner 25... [2022-07-09 00:00:10,695][26158] Initializing envs for env runner 26... [2022-07-09 00:00:10,715][26064] Worker 13 uses CPU cores [5] [2022-07-09 00:00:10,723][26066] Worker 15 uses CPU cores [7] [2022-07-09 00:00:10,731][26063] Worker 12 uses CPU cores [4] [2022-07-09 00:00:10,757][26197] Initializing vector env runner 27... [2022-07-09 00:00:10,759][26197] ACTOR worker 27 pid 26197 parent 25689 [2022-07-09 00:00:10,760][26197] Initializing envs for env runner 27... [2022-07-09 00:00:10,763][26067] Worker 16 uses CPU cores [0] [2022-07-09 00:00:10,801][26077] Worker 19 uses CPU cores [3] [2022-07-09 00:00:10,792][26212] Initializing vector env runner 28... [2022-07-09 00:00:10,801][26085] Worker 20 uses CPU cores [4] [2022-07-09 00:00:10,802][26068] Worker 17 uses CPU cores [1] [2022-07-09 00:00:10,809][26212] ACTOR worker 28 pid 26212 parent 25689 [2022-07-09 00:00:10,811][26212] Initializing envs for env runner 28... [2022-07-09 00:00:10,813][26086] Worker 21 uses CPU cores [5] [2022-07-09 00:00:10,820][26214] Initializing vector env runner 29... [2022-07-09 00:00:10,825][26094] Worker 22 uses CPU cores [6] [2022-07-09 00:00:10,822][26214] ACTOR worker 29 pid 26214 parent 25689 [2022-07-09 00:00:10,860][26214] Initializing envs for env runner 29... [2022-07-09 00:00:10,862][26122] Worker 24 uses CPU cores [0] [2022-07-09 00:00:10,892][26251] Initializing vector env runner 30... [2022-07-09 00:00:10,894][26251] ACTOR worker 30 pid 26251 parent 25689 [2022-07-09 00:00:10,898][26100] Worker 23 uses CPU cores [7] [2022-07-09 00:00:10,923][26251] Initializing envs for env runner 30... [2022-07-09 00:00:10,934][26260] Initializing vector env runner 31... [2022-07-09 00:00:10,939][26260] ACTOR worker 31 pid 26260 parent 25689 [2022-07-09 00:00:10,944][26260] Initializing envs for env runner 31... [2022-07-09 00:00:10,944][26140] Worker 25 uses CPU cores [1] [2022-07-09 00:00:10,977][26158] Worker 26 uses CPU cores [2] [2022-07-09 00:00:10,979][26283] Initializing vector env runner 32... [2022-07-09 00:00:10,981][26283] ACTOR worker 32 pid 26283 parent 25689 [2022-07-09 00:00:10,982][26283] Initializing envs for env runner 32... [2022-07-09 00:00:10,982][26214] Worker 29 uses CPU cores [5] [2022-07-09 00:00:11,016][26197] Worker 27 uses CPU cores [3] [2022-07-09 00:00:11,012][26286] Initializing vector env runner 33... [2022-07-09 00:00:11,018][26212] Worker 28 uses CPU cores [4] [2022-07-09 00:00:11,055][26286] ACTOR worker 33 pid 26286 parent 25689 [2022-07-09 00:00:11,056][26286] Initializing envs for env runner 33... [2022-07-09 00:00:11,052][26251] Worker 30 uses CPU cores [6] [2022-07-09 00:00:11,068][26317] Initializing vector env runner 35... [2022-07-09 00:00:11,144][26317] ACTOR worker 35 pid 26317 parent 25689 [2022-07-09 00:00:11,146][26317] Initializing envs for env runner 35... [2022-07-09 00:00:11,148][26286] Worker 33 uses CPU cores [2, 3] [2022-07-09 00:00:11,074][26309] Initializing vector env runner 34... [2022-07-09 00:00:11,151][26309] ACTOR worker 34 pid 26309 parent 25689 [2022-07-09 00:00:11,152][26309] Initializing envs for env runner 34... [2022-07-09 00:00:11,161][26283] Worker 32 uses CPU cores [0, 1] [2022-07-09 00:00:11,126][26260] Worker 31 uses CPU cores [7] [2022-07-09 00:00:11,241][26309] Worker 34 uses CPU cores [4, 5] [2022-07-09 00:00:11,256][26317] Worker 35 uses CPU cores [6, 7] [2022-07-09 00:00:17,206][26022] Initialized model on the policy worker 0-0! [2022-07-09 00:00:17,206][26022] Min num requests: 12 [2022-07-09 00:00:34,077][26251] Decorrelating experience for 128 frames... [2022-07-09 00:00:34,091][26065] Decorrelating experience for 256 frames... [2022-07-09 00:00:34,123][26040] Decorrelating experience for 384 frames... [2022-07-09 00:00:34,173][26212] Decorrelating experience for 0 frames... [2022-07-09 00:00:34,179][25689] Progress for 36 workers: 1/144 envs initialized... [2022-07-09 00:00:34,181][26212] Decorrelating experience for 384 frames... [2022-07-09 00:00:34,192][26036] Decorrelating experience for 256 frames... [2022-07-09 00:00:34,209][26094] Decorrelating experience for 0 frames... [2022-07-09 00:00:34,209][25689] Progress for 36 workers: 2/144 envs initialized... [2022-07-09 00:00:34,210][26094] Decorrelating experience for 128 frames... [2022-07-09 00:00:34,269][26063] Decorrelating experience for 384 frames... [2022-07-09 00:00:34,356][26085] Decorrelating experience for 0 frames... [2022-07-09 00:00:34,357][25689] Progress for 36 workers: 3/144 envs initialized... [2022-07-09 00:00:34,359][26085] Decorrelating experience for 128 frames... [2022-07-09 00:00:34,468][26064] Decorrelating experience for 512 frames... [2022-07-09 00:00:34,476][26038] Decorrelating experience for 384 frames... [2022-07-09 00:00:34,481][26086] Decorrelating experience for 256 frames... [2022-07-09 00:00:34,648][26214] Decorrelating experience for 256 frames... [2022-07-09 00:00:35,260][26068] Decorrelating experience for 256 frames... [2022-07-09 00:00:35,272][26044] Decorrelating experience for 384 frames... [2022-07-09 00:00:35,277][26023] Decorrelating experience for 0 frames... [2022-07-09 00:00:35,280][25689] Progress for 36 workers: 4/144 envs initialized... [2022-07-09 00:00:35,286][26023] Decorrelating experience for 512 frames... [2022-07-09 00:00:35,299][26024] Decorrelating experience for 384 frames... [2022-07-09 00:00:35,337][26122] Decorrelating experience for 256 frames... [2022-07-09 00:00:35,340][26042] Decorrelating experience for 0 frames... [2022-07-09 00:00:35,352][26140] Decorrelating experience for 384 frames... [2022-07-09 00:00:35,353][25689] Progress for 36 workers: 5/144 envs initialized... [2022-07-09 00:00:35,355][26042] Decorrelating experience for 512 frames... [2022-07-09 00:00:35,407][26067] Decorrelating experience for 0 frames... [2022-07-09 00:00:35,408][25689] Progress for 36 workers: 6/144 envs initialized... [2022-07-09 00:00:35,410][26067] Decorrelating experience for 512 frames... [2022-07-09 00:00:36,128][26309] Decorrelating experience for 256 frames... [2022-07-09 00:00:36,224][26100] Decorrelating experience for 256 frames... [2022-07-09 00:00:36,246][26066] Decorrelating experience for 128 frames... [2022-07-09 00:00:36,281][26061] Decorrelating experience for 512 frames... [2022-07-09 00:00:36,305][26027] Decorrelating experience for 384 frames... [2022-07-09 00:00:36,326][26041] Decorrelating experience for 256 frames... [2022-07-09 00:00:36,370][26260] Decorrelating experience for 512 frames... [2022-07-09 00:00:36,454][26077] Decorrelating experience for 512 frames... [2022-07-09 00:00:36,484][26197] Decorrelating experience for 128 frames... [2022-07-09 00:00:36,935][26158] Decorrelating experience for 128 frames... [2022-07-09 00:00:36,934][26025] Decorrelating experience for 128 frames... [2022-07-09 00:00:36,942][26069] Decorrelating experience for 256 frames... [2022-07-09 00:00:36,968][26052] Decorrelating experience for 128 frames... [2022-07-09 00:00:36,994][26283] Decorrelating experience for 384 frames... [2022-07-09 00:00:37,964][26286] Decorrelating experience for 256 frames... [2022-07-09 00:00:38,511][26317] Decorrelating experience for 384 frames... [2022-07-09 00:01:47,551][25689] Progress for 36 workers: 7/144 envs initialized... [2022-07-09 00:01:47,552][26085] Decorrelating experience for 640 frames... [2022-07-09 00:01:47,847][25689] Progress for 36 workers: 8/144 envs initialized... [2022-07-09 00:01:47,848][26036] Decorrelating experience for 384 frames... [2022-07-09 00:01:48,475][25689] Progress for 36 workers: 9/144 envs initialized... [2022-07-09 00:01:48,476][26212] Decorrelating experience for 512 frames... [2022-07-09 00:01:48,706][25689] Progress for 36 workers: 10/144 envs initialized... [2022-07-09 00:01:48,711][26063] Decorrelating experience for 640 frames... [2022-07-09 00:01:49,243][25689] Progress for 36 workers: 11/144 envs initialized... [2022-07-09 00:01:49,244][26068] Decorrelating experience for 640 frames... [2022-07-09 00:01:49,747][25689] Progress for 36 workers: 12/144 envs initialized... [2022-07-09 00:01:49,749][26036] Decorrelating experience for 768 frames... [2022-07-09 00:01:49,843][25689] Progress for 36 workers: 13/144 envs initialized... [2022-07-09 00:01:49,844][26024] Decorrelating experience for 384 frames... [2022-07-09 00:01:49,867][25689] Progress for 36 workers: 14/144 envs initialized... [2022-07-09 00:01:49,868][26044] Decorrelating experience for 256 frames... [2022-07-09 00:01:50,139][25689] Progress for 36 workers: 15/144 envs initialized... [2022-07-09 00:01:50,140][26140] Decorrelating experience for 384 frames... [2022-07-09 00:01:50,699][25689] Progress for 36 workers: 16/144 envs initialized... [2022-07-09 00:01:50,700][26085] Decorrelating experience for 768 frames... [2022-07-09 00:01:50,959][25689] Progress for 36 workers: 17/144 envs initialized... [2022-07-09 00:01:50,967][26212] Decorrelating experience for 512 frames... [2022-07-09 00:01:51,087][25689] Progress for 36 workers: 18/144 envs initialized... [2022-07-09 00:01:51,088][26044] Decorrelating experience for 640 frames... [2022-07-09 00:01:51,648][25689] Progress for 36 workers: 19/144 envs initialized... [2022-07-09 00:01:51,650][26024] Decorrelating experience for 384 frames... [2022-07-09 00:01:51,808][25689] Progress for 36 workers: 20/144 envs initialized... [2022-07-09 00:01:51,815][26063] Decorrelating experience for 640 frames... [2022-07-09 00:01:51,963][25689] Progress for 36 workers: 21/144 envs initialized... [2022-07-09 00:01:51,964][26140] Decorrelating experience for 512 frames... [2022-07-09 00:01:52,305][25689] Progress for 36 workers: 22/144 envs initialized... [2022-07-09 00:01:52,307][26068] Decorrelating experience for 640 frames... [2022-07-09 00:01:52,335][25689] Progress for 36 workers: 23/144 envs initialized... [2022-07-09 00:01:52,336][26066] Decorrelating experience for 128 frames... [2022-07-09 00:01:52,780][25689] Progress for 36 workers: 24/144 envs initialized... [2022-07-09 00:01:52,797][26025] Decorrelating experience for 128 frames... [2022-07-09 00:01:52,971][25689] Progress for 36 workers: 25/144 envs initialized... [2022-07-09 00:01:52,979][25689] Progress for 36 workers: 26/144 envs initialized... [2022-07-09 00:01:52,995][26052] Decorrelating experience for 128 frames... [2022-07-09 00:01:52,997][26158] Decorrelating experience for 512 frames... [2022-07-09 00:01:53,094][25689] Progress for 36 workers: 27/144 envs initialized... [2022-07-09 00:01:53,107][26066] Decorrelating experience for 384 frames... [2022-07-09 00:01:53,141][25689] Progress for 36 workers: 28/144 envs initialized... [2022-07-09 00:01:53,142][26100] Decorrelating experience for 128 frames... [2022-07-09 00:01:53,325][25689] Progress for 36 workers: 29/144 envs initialized... [2022-07-09 00:01:53,325][26041] Decorrelating experience for 256 frames... [2022-07-09 00:01:53,443][25689] Progress for 36 workers: 30/144 envs initialized... [2022-07-09 00:01:53,443][26212] Finished reset for worker 28 [2022-07-09 00:01:53,454][25689] Progress for 36 workers: 31/144 envs initialized... [2022-07-09 00:01:53,464][25689] Progress for 36 workers: 32/144 envs initialized... [2022-07-09 00:01:53,466][26036] Decorrelating experience for 896 frames... [2022-07-09 00:01:53,467][26024] Decorrelating experience for 640 frames... [2022-07-09 00:01:53,585][25689] Progress for 36 workers: 33/144 envs initialized... [2022-07-09 00:01:53,604][26025] Decorrelating experience for 512 frames... [2022-07-09 00:01:53,614][25689] Progress for 36 workers: 34/144 envs initialized... [2022-07-09 00:01:53,619][26069] Decorrelating experience for 384 frames... [2022-07-09 00:01:53,733][25689] Progress for 36 workers: 35/144 envs initialized... [2022-07-09 00:01:53,747][26251] Decorrelating experience for 512 frames... [2022-07-09 00:01:53,785][25689] Progress for 36 workers: 36/144 envs initialized... [2022-07-09 00:01:53,795][26052] Decorrelating experience for 768 frames... [2022-07-09 00:01:53,939][25689] Progress for 36 workers: 37/144 envs initialized... [2022-07-09 00:01:53,955][26100] Decorrelating experience for 256 frames... [2022-07-09 00:01:54,102][25689] Progress for 36 workers: 38/144 envs initialized... [2022-07-09 00:01:54,102][26044] Decorrelating experience for 512 frames... [2022-07-09 00:01:54,132][25689] Progress for 36 workers: 39/144 envs initialized... [2022-07-09 00:01:54,134][26094] Decorrelating experience for 512 frames... [2022-07-09 00:01:54,246][25689] Progress for 36 workers: 40/144 envs initialized... [2022-07-09 00:01:54,247][26085] Finished reset for worker 20 [2022-07-09 00:01:54,346][25689] Progress for 36 workers: 41/144 envs initialized... [2022-07-09 00:01:54,347][26065] Decorrelating experience for 256 frames... [2022-07-09 00:01:54,374][25689] Progress for 36 workers: 42/144 envs initialized... [2022-07-09 00:01:54,387][26140] Decorrelating experience for 512 frames... [2022-07-09 00:01:54,432][25689] Progress for 36 workers: 43/144 envs initialized... [2022-07-09 00:01:54,433][26063] Decorrelating experience for 896 frames... [2022-07-09 00:01:54,891][25689] Progress for 36 workers: 44/144 envs initialized... [2022-07-09 00:01:54,891][26260] Decorrelating experience for 256 frames... [2022-07-09 00:01:54,904][25689] Progress for 36 workers: 45/144 envs initialized... [2022-07-09 00:01:54,905][26041] Decorrelating experience for 768 frames... [2022-07-09 00:01:55,178][25689] Progress for 36 workers: 46/144 envs initialized... [2022-07-09 00:01:55,189][26040] Decorrelating experience for 384 frames... [2022-07-09 00:01:55,313][25689] Progress for 36 workers: 47/144 envs initialized... [2022-07-09 00:01:55,319][26068] Decorrelating experience for 512 frames... [2022-07-09 00:01:55,627][25689] Progress for 36 workers: 48/144 envs initialized... [2022-07-09 00:01:55,627][26066] Decorrelating experience for 384 frames... [2022-07-09 00:01:55,661][25689] Progress for 36 workers: 49/144 envs initialized... [2022-07-09 00:01:55,661][26197] Decorrelating experience for 128 frames... [2022-07-09 00:01:55,674][25689] Progress for 36 workers: 50/144 envs initialized... [2022-07-09 00:01:55,679][26100] Decorrelating experience for 896 frames... [2022-07-09 00:01:55,888][25689] Progress for 36 workers: 51/144 envs initialized... [2022-07-09 00:01:55,889][26036] Finished reset for worker 4 [2022-07-09 00:01:55,895][25689] Progress for 36 workers: 52/144 envs initialized... [2022-07-09 00:01:55,896][26065] Decorrelating experience for 768 frames... [2022-07-09 00:01:56,090][25689] Progress for 36 workers: 53/144 envs initialized... [2022-07-09 00:01:56,091][26069] Decorrelating experience for 640 frames... [2022-07-09 00:01:56,182][25689] Progress for 36 workers: 54/144 envs initialized... [2022-07-09 00:01:56,203][26158] Decorrelating experience for 640 frames... [2022-07-09 00:01:56,240][25689] Progress for 36 workers: 55/144 envs initialized... [2022-07-09 00:01:56,241][26063] Finished reset for worker 12 [2022-07-09 00:01:56,451][25689] Progress for 36 workers: 56/144 envs initialized... [2022-07-09 00:01:56,452][26260] Decorrelating experience for 512 frames... [2022-07-09 00:01:56,455][25689] Progress for 36 workers: 57/144 envs initialized... [2022-07-09 00:01:56,456][26197] Decorrelating experience for 256 frames... [2022-07-09 00:01:56,659][25689] Progress for 36 workers: 58/144 envs initialized... [2022-07-09 00:01:56,660][26025] Decorrelating experience for 640 frames... [2022-07-09 00:01:56,669][25689] Progress for 36 workers: 59/144 envs initialized... [2022-07-09 00:01:56,670][26251] Decorrelating experience for 512 frames... [2022-07-09 00:01:56,676][25689] Progress for 36 workers: 60/144 envs initialized... [2022-07-09 00:01:56,676][26024] Finished reset for worker 1 [2022-07-09 00:01:56,769][25689] Progress for 36 workers: 61/144 envs initialized... [2022-07-09 00:01:56,769][26044] Finished reset for worker 9 [2022-07-09 00:01:56,943][25689] Progress for 36 workers: 62/144 envs initialized... [2022-07-09 00:01:56,944][26140] Finished reset for worker 25 [2022-07-09 00:01:56,976][25689] Progress for 36 workers: 63/144 envs initialized... [2022-07-09 00:01:56,977][26027] Decorrelating experience for 128 frames... [2022-07-09 00:01:57,172][25689] Progress for 36 workers: 64/144 envs initialized... [2022-07-09 00:01:57,173][26094] Decorrelating experience for 384 frames... [2022-07-09 00:01:57,429][25689] Progress for 36 workers: 65/144 envs initialized... [2022-07-09 00:01:57,429][25689] Progress for 36 workers: 66/144 envs initialized... [2022-07-09 00:01:57,429][26066] Finished reset for worker 15 [2022-07-09 00:01:57,430][26068] Finished reset for worker 17 [2022-07-09 00:01:57,453][25689] Progress for 36 workers: 67/144 envs initialized... [2022-07-09 00:01:57,454][26040] Decorrelating experience for 768 frames... [2022-07-09 00:01:57,726][25689] Progress for 36 workers: 68/144 envs initialized... [2022-07-09 00:01:57,742][25689] Progress for 36 workers: 69/144 envs initialized... [2022-07-09 00:01:57,726][26027] Decorrelating experience for 384 frames... [2022-07-09 00:01:57,742][26061] Decorrelating experience for 256 frames... [2022-07-09 00:01:57,934][25689] Progress for 36 workers: 70/144 envs initialized... [2022-07-09 00:01:57,950][25689] Progress for 36 workers: 71/144 envs initialized... [2022-07-09 00:01:57,934][26077] Decorrelating experience for 512 frames... [2022-07-09 00:01:57,967][26197] Decorrelating experience for 640 frames... [2022-07-09 00:01:57,979][25689] Progress for 36 workers: 72/144 envs initialized... [2022-07-09 00:01:57,980][26052] Decorrelating experience for 512 frames... [2022-07-09 00:01:58,457][25689] Progress for 36 workers: 73/144 envs initialized... [2022-07-09 00:01:58,465][26041] Decorrelating experience for 384 frames... [2022-07-09 00:01:58,506][25689] Progress for 36 workers: 74/144 envs initialized... [2022-07-09 00:01:58,515][26260] Decorrelating experience for 768 frames... [2022-07-09 00:01:58,961][25689] Progress for 36 workers: 75/144 envs initialized... [2022-07-09 00:01:58,962][26283] Decorrelating experience for 640 frames... [2022-07-09 00:01:59,013][25689] Progress for 36 workers: 76/144 envs initialized... [2022-07-09 00:01:59,013][26069] Decorrelating experience for 896 frames... [2022-07-09 00:01:59,105][25689] Progress for 36 workers: 77/144 envs initialized... [2022-07-09 00:01:59,106][26158] Decorrelating experience for 896 frames... [2022-07-09 00:01:59,224][25689] Progress for 36 workers: 78/144 envs initialized... [2022-07-09 00:01:59,224][26100] Finished reset for worker 23 [2022-07-09 00:01:59,294][25689] Progress for 36 workers: 79/144 envs initialized... [2022-07-09 00:01:59,315][26061] Decorrelating experience for 640 frames... [2022-07-09 00:01:59,383][25689] Progress for 36 workers: 80/144 envs initialized... [2022-07-09 00:01:59,383][26086] Decorrelating experience for 128 frames... [2022-07-09 00:01:59,425][25689] Progress for 36 workers: 81/144 envs initialized... [2022-07-09 00:01:59,426][26094] Finished reset for worker 22 [2022-07-09 00:01:59,555][26025] Finished reset for worker 2 [2022-07-09 00:01:59,555][25689] Progress for 36 workers: 82/144 envs initialized... [2022-07-09 00:01:59,610][25689] Progress for 36 workers: 83/144 envs initialized... [2022-07-09 00:01:59,612][26251] Decorrelating experience for 512 frames... [2022-07-09 00:01:59,619][25689] Progress for 36 workers: 84/144 envs initialized... [2022-07-09 00:01:59,619][26041] Finished reset for worker 7 [2022-07-09 00:01:59,688][25689] Progress for 36 workers: 85/144 envs initialized... [2022-07-09 00:01:59,689][26283] Decorrelating experience for 384 frames... [2022-07-09 00:01:59,919][25689] Progress for 36 workers: 86/144 envs initialized... [2022-07-09 00:01:59,920][26214] Decorrelating experience for 384 frames... [2022-07-09 00:01:59,985][25689] Progress for 36 workers: 87/144 envs initialized... [2022-07-09 00:01:59,986][26027] Decorrelating experience for 384 frames... [2022-07-09 00:02:00,067][25689] Progress for 36 workers: 88/144 envs initialized... [2022-07-09 00:02:00,067][26260] Finished reset for worker 31 [2022-07-09 00:02:00,119][25689] Progress for 36 workers: 89/144 envs initialized... [2022-07-09 00:02:00,120][26283] Decorrelating experience for 384 frames... [2022-07-09 00:02:00,129][25689] Progress for 36 workers: 90/144 envs initialized... [2022-07-09 00:02:00,129][26038] Decorrelating experience for 384 frames... [2022-07-09 00:02:00,134][25689] Progress for 36 workers: 91/144 envs initialized... [2022-07-09 00:02:00,151][26086] Decorrelating experience for 256 frames... [2022-07-09 00:02:00,169][25689] Progress for 36 workers: 92/144 envs initialized... [2022-07-09 00:02:00,170][26065] Decorrelating experience for 640 frames... [2022-07-09 00:02:00,172][25689] Progress for 36 workers: 93/144 envs initialized... [2022-07-09 00:02:00,172][26052] Finished reset for worker 10 [2022-07-09 00:02:00,569][25689] Progress for 36 workers: 94/144 envs initialized... [2022-07-09 00:02:00,570][26283] Finished reset for worker 32 [2022-07-09 00:02:00,870][25689] Progress for 36 workers: 95/144 envs initialized... [2022-07-09 00:02:00,883][26122] Decorrelating experience for 256 frames... [2022-07-09 00:02:00,892][25689] Progress for 36 workers: 96/144 envs initialized... [2022-07-09 00:02:00,892][26064] Decorrelating experience for 256 frames... [2022-07-09 00:02:00,947][25689] Progress for 36 workers: 97/144 envs initialized... [2022-07-09 00:02:00,947][26077] Decorrelating experience for 512 frames... [2022-07-09 00:02:01,069][25689] Progress for 36 workers: 98/144 envs initialized... [2022-07-09 00:02:01,069][26040] Decorrelating experience for 512 frames... [2022-07-09 00:02:01,319][25689] Progress for 36 workers: 99/144 envs initialized... [2022-07-09 00:02:01,335][26086] Decorrelating experience for 896 frames... [2022-07-09 00:02:01,664][26197] Finished reset for worker 27 [2022-07-09 00:02:01,664][25689] Progress for 36 workers: 100/144 envs initialized... [2022-07-09 00:02:01,878][25689] Progress for 36 workers: 101/144 envs initialized... [2022-07-09 00:02:01,878][26317] Decorrelating experience for 512 frames... [2022-07-09 00:02:01,927][25689] Progress for 36 workers: 102/144 envs initialized... [2022-07-09 00:02:01,927][26251] Finished reset for worker 30 [2022-07-09 00:02:01,968][25689] Progress for 36 workers: 103/144 envs initialized... [2022-07-09 00:02:01,968][26214] Decorrelating experience for 640 frames... [2022-07-09 00:02:02,081][25689] Progress for 36 workers: 104/144 envs initialized... [2022-07-09 00:02:02,081][26038] Decorrelating experience for 768 frames... [2022-07-09 00:02:02,128][26027] Finished reset for worker 3 [2022-07-09 00:02:02,128][25689] Progress for 36 workers: 105/144 envs initialized... [2022-07-09 00:02:02,131][25689] Progress for 36 workers: 106/144 envs initialized... [2022-07-09 00:02:02,131][26069] Finished reset for worker 18 [2022-07-09 00:02:02,146][25689] Progress for 36 workers: 107/144 envs initialized... [2022-07-09 00:02:02,146][26158] Finished reset for worker 26 [2022-07-09 00:02:02,194][25689] Progress for 36 workers: 108/144 envs initialized... [2022-07-09 00:02:02,196][26042] Decorrelating experience for 256 frames... [2022-07-09 00:02:02,196][25689] Progress for 36 workers: 109/144 envs initialized... [2022-07-09 00:02:02,207][26023] Decorrelating experience for 640 frames... [2022-07-09 00:02:02,256][25689] Progress for 36 workers: 110/144 envs initialized... [2022-07-09 00:02:02,257][26064] Decorrelating experience for 256 frames... [2022-07-09 00:02:02,425][25689] Progress for 36 workers: 111/144 envs initialized... [2022-07-09 00:02:02,426][26122] Decorrelating experience for 512 frames... [2022-07-09 00:02:02,451][25689] Progress for 36 workers: 112/144 envs initialized... [2022-07-09 00:02:02,452][26317] Decorrelating experience for 256 frames... [2022-07-09 00:02:02,475][25689] Progress for 36 workers: 113/144 envs initialized... [2022-07-09 00:02:02,475][26061] Decorrelating experience for 896 frames... [2022-07-09 00:02:02,501][26065] Finished reset for worker 14 [2022-07-09 00:02:02,501][25689] Progress for 36 workers: 114/144 envs initialized... [2022-07-09 00:02:02,541][25689] Progress for 36 workers: 115/144 envs initialized... [2022-07-09 00:02:02,542][26067] Decorrelating experience for 512 frames... [2022-07-09 00:02:02,644][25689] Progress for 36 workers: 116/144 envs initialized... [2022-07-09 00:02:02,644][26040] Finished reset for worker 6 [2022-07-09 00:02:02,696][25689] Progress for 36 workers: 117/144 envs initialized... [2022-07-09 00:02:02,696][26286] Decorrelating experience for 512 frames... [2022-07-09 00:02:02,736][25689] Progress for 36 workers: 118/144 envs initialized... [2022-07-09 00:02:02,737][26317] Decorrelating experience for 896 frames... [2022-07-09 00:02:02,766][25689] Progress for 36 workers: 119/144 envs initialized... [2022-07-09 00:02:02,783][26309] Decorrelating experience for 384 frames... [2022-07-09 00:02:02,827][25689] Progress for 36 workers: 120/144 envs initialized... [2022-07-09 00:02:02,827][26077] Decorrelating experience for 896 frames... [2022-07-09 00:02:03,067][25689] Progress for 36 workers: 121/144 envs initialized... [2022-07-09 00:02:03,067][26286] Decorrelating experience for 768 frames... [2022-07-09 00:02:03,190][25689] Progress for 36 workers: 122/144 envs initialized... [2022-07-09 00:02:03,190][26064] Decorrelating experience for 640 frames... [2022-07-09 00:02:03,364][25689] Progress for 36 workers: 123/144 envs initialized... [2022-07-09 00:02:03,364][26042] Decorrelating experience for 768 frames... [2022-07-09 00:02:03,402][25689] Progress for 36 workers: 124/144 envs initialized... [2022-07-09 00:02:03,402][26309] Decorrelating experience for 256 frames... [2022-07-09 00:02:03,686][25689] Progress for 36 workers: 125/144 envs initialized... [2022-07-09 00:02:03,686][26309] Decorrelating experience for 640 frames... [2022-07-09 00:02:03,741][25689] Progress for 36 workers: 126/144 envs initialized... [2022-07-09 00:02:03,741][26317] Finished reset for worker 35 [2022-07-09 00:02:03,952][25689] Progress for 36 workers: 127/144 envs initialized... [2022-07-09 00:02:03,953][26286] Decorrelating experience for 768 frames... [2022-07-09 00:02:04,057][25689] Progress for 36 workers: 128/144 envs initialized... [2022-07-09 00:02:04,058][26214] Decorrelating experience for 896 frames... [2022-07-09 00:02:04,215][25689] Progress for 36 workers: 129/144 envs initialized... [2022-07-09 00:02:04,215][26061] Finished reset for worker 11 [2022-07-09 00:02:04,327][26077] Finished reset for worker 19 [2022-07-09 00:02:04,327][25689] Progress for 36 workers: 130/144 envs initialized... [2022-07-09 00:02:04,407][25689] Progress for 36 workers: 131/144 envs initialized... [2022-07-09 00:02:04,407][26309] Finished reset for worker 34 [2022-07-09 00:02:04,540][25689] Progress for 36 workers: 132/144 envs initialized... [2022-07-09 00:02:04,541][26038] Decorrelating experience for 384 frames... [2022-07-09 00:02:04,559][25689] Progress for 36 workers: 133/144 envs initialized... [2022-07-09 00:02:04,559][26086] Finished reset for worker 21 [2022-07-09 00:02:04,635][25689] Progress for 36 workers: 134/144 envs initialized... [2022-07-09 00:02:04,636][26122] Decorrelating experience for 512 frames... [2022-07-09 00:02:04,675][25689] Progress for 36 workers: 135/144 envs initialized... [2022-07-09 00:02:04,674][26286] Finished reset for worker 33 [2022-07-09 00:02:04,697][25689] Progress for 36 workers: 136/144 envs initialized... [2022-07-09 00:02:04,698][26067] Decorrelating experience for 896 frames... [2022-07-09 00:02:04,865][25689] Progress for 36 workers: 137/144 envs initialized... [2022-07-09 00:02:04,865][26023] Decorrelating experience for 384 frames... [2022-07-09 00:02:04,979][25689] Progress for 36 workers: 138/144 envs initialized... [2022-07-09 00:02:04,979][26064] Finished reset for worker 13 [2022-07-09 00:02:05,256][26038] Finished reset for worker 5 [2022-07-09 00:02:05,256][25689] Progress for 36 workers: 139/144 envs initialized... [2022-07-09 00:02:05,513][26214] Finished reset for worker 29 [2022-07-09 00:02:05,513][25689] Progress for 36 workers: 140/144 envs initialized... [2022-07-09 00:02:05,986][26023] Finished reset for worker 0 [2022-07-09 00:02:05,986][25689] Progress for 36 workers: 141/144 envs initialized... [2022-07-09 00:02:05,986][25689] Progress for 36 workers: 142/144 envs initialized... [2022-07-09 00:02:05,986][26042] Finished reset for worker 8 [2022-07-09 00:02:06,030][25689] Progress for 36 workers: 143/144 envs initialized... [2022-07-09 00:02:06,030][26122] Finished reset for worker 24 [2022-07-09 00:02:06,347][25689] Progress for 36 workers: 144/144 envs initialized... [2022-07-09 00:02:06,347][26067] Finished reset for worker 16 [2022-07-09 00:02:06,348][25689] Waiting for policy worker 0-0 to finish initialization... [2022-07-09 00:02:06,349][26022] Policy worker 0-0 initialized [2022-07-09 00:02:06,349][25689] Policy worker 0-0 initialized! [2022-07-09 00:02:06,350][25689] Collecting experience... [2022-07-09 00:02:09,303][26040] Worker 6, sleep for 1.667 sec to decorrelate experience collection [2022-07-09 00:02:09,383][26094] Worker 22, sleep for 6.111 sec to decorrelate experience collection [2022-07-09 00:02:09,387][26041] Worker 7, sleep for 1.944 sec to decorrelate experience collection [2022-07-09 00:02:09,395][26065] Worker 14, sleep for 3.889 sec to decorrelate experience collection [2022-07-09 00:02:09,400][26086] Worker 21, sleep for 5.833 sec to decorrelate experience collection [2022-07-09 00:02:09,408][26214] Worker 29, sleep for 8.056 sec to decorrelate experience collection [2022-07-09 00:02:09,410][25974] Allocating new CPU tensor batch (could not get from the pool) [2022-07-09 00:02:09,419][25974] Waiting for the first batch to be processed [2022-07-09 00:02:09,426][26038] Worker 5, sleep for 1.389 sec to decorrelate experience collection [2022-07-09 00:02:09,446][26025] Worker 2, sleep for 0.556 sec to decorrelate experience collection [2022-07-09 00:02:09,457][26064] Worker 13, sleep for 3.611 sec to decorrelate experience collection [2022-07-09 00:02:09,465][26251] Worker 30, sleep for 8.333 sec to decorrelate experience collection [2022-07-09 00:02:09,483][26158] Worker 26, sleep for 7.222 sec to decorrelate experience collection [2022-07-09 00:02:09,482][26260] Worker 31, sleep for 8.611 sec to decorrelate experience collection [2022-07-09 00:02:09,504][26066] Worker 15, sleep for 4.167 sec to decorrelate experience collection [2022-07-09 00:02:09,512][26069] Worker 18, sleep for 5.000 sec to decorrelate experience collection [2022-07-09 00:02:09,527][26100] Worker 23, sleep for 6.389 sec to decorrelate experience collection [2022-07-09 00:02:09,557][26286] Worker 33, sleep for 9.167 sec to decorrelate experience collection [2022-07-09 00:02:09,571][26317] Worker 35, sleep for 9.722 sec to decorrelate experience collection [2022-07-09 00:02:09,577][26122] Worker 24, sleep for 6.667 sec to decorrelate experience collection [2022-07-09 00:02:09,624][26309] Worker 34, sleep for 9.444 sec to decorrelate experience collection [2022-07-09 00:02:09,642][26052] Worker 10, sleep for 2.778 sec to decorrelate experience collection [2022-07-09 00:02:09,663][26197] Worker 27, sleep for 7.500 sec to decorrelate experience collection [2022-07-09 00:02:09,665][26068] Worker 17, sleep for 4.722 sec to decorrelate experience collection [2022-07-09 00:02:09,665][26036] Worker 4, sleep for 1.111 sec to decorrelate experience collection [2022-07-09 00:02:09,676][26044] Worker 9, sleep for 2.500 sec to decorrelate experience collection [2022-07-09 00:02:09,691][26061] Worker 11, sleep for 3.056 sec to decorrelate experience collection [2022-07-09 00:02:09,699][26140] Worker 25, sleep for 6.944 sec to decorrelate experience collection [2022-07-09 00:02:09,704][26023] Worker 0, sleep for 0.000 sec to decorrelate experience collection [2022-07-09 00:02:09,704][26023] Worker 0 awakens! [2022-07-09 00:02:09,726][26085] Worker 20, sleep for 5.556 sec to decorrelate experience collection [2022-07-09 00:02:09,718][26063] Worker 12, sleep for 3.333 sec to decorrelate experience collection [2022-07-09 00:02:09,735][26027] Worker 3, sleep for 0.833 sec to decorrelate experience collection [2022-07-09 00:02:09,745][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:02:09,750][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000000001_1024.pth [2022-07-09 00:02:09,751][25974] Saving a milestone ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/milestones/checkpoint_000000001_1024.pth.milestone [2022-07-09 00:02:09,755][26024] Worker 1, sleep for 0.278 sec to decorrelate experience collection [2022-07-09 00:02:09,760][26283] Worker 32, sleep for 8.889 sec to decorrelate experience collection [2022-07-09 00:02:09,783][26022] Updated weights on worker 0-0, policy_version 1 (0.00056) [2022-07-09 00:02:09,788][26077] Worker 19, sleep for 5.278 sec to decorrelate experience collection [2022-07-09 00:02:09,796][26042] Worker 8, sleep for 2.222 sec to decorrelate experience collection [2022-07-09 00:02:09,855][26067] Worker 16, sleep for 4.444 sec to decorrelate experience collection [2022-07-09 00:02:09,936][26212] Worker 28, sleep for 7.778 sec to decorrelate experience collection [2022-07-09 00:02:10,012][26025] Worker 2 awakens! [2022-07-09 00:02:10,035][26024] Worker 1 awakens! [2022-07-09 00:02:10,571][26027] Worker 3 awakens! [2022-07-09 00:02:10,781][26036] Worker 4 awakens! [2022-07-09 00:02:10,827][26038] Worker 5 awakens! [2022-07-09 00:02:10,970][26040] Worker 6 awakens! [2022-07-09 00:02:11,339][26041] Worker 7 awakens! [2022-07-09 00:02:12,021][26042] Worker 8 awakens! [2022-07-09 00:02:12,188][26044] Worker 9 awakens! [2022-07-09 00:02:12,427][26052] Worker 10 awakens! [2022-07-09 00:02:12,763][26061] Worker 11 awakens! [2022-07-09 00:02:12,998][26022] Updated weights on worker 0-0, policy_version 18 (0.00087) [2022-07-09 00:02:13,079][26063] Worker 12 awakens! [2022-07-09 00:02:13,095][26064] Worker 13 awakens! [2022-07-09 00:02:13,306][26065] Worker 14 awakens! [2022-07-09 00:02:13,691][26066] Worker 15 awakens! [2022-07-09 00:02:14,331][26067] Worker 16 awakens! [2022-07-09 00:02:14,413][26068] Worker 17 awakens! [2022-07-09 00:02:14,543][26069] Worker 18 awakens! [2022-07-09 00:02:14,973][26022] Updated weights on worker 0-0, policy_version 28 (0.00103) [2022-07-09 00:02:15,091][26077] Worker 19 awakens! [2022-07-09 00:02:15,264][26086] Worker 21 awakens! [2022-07-09 00:02:15,307][26085] Worker 20 awakens! [2022-07-09 00:02:15,500][26094] Worker 22 awakens! [2022-07-09 00:02:15,949][26100] Worker 23 awakens! [2022-07-09 00:02:16,276][26122] Worker 24 awakens! [2022-07-09 00:02:16,588][25689] Fps is (10 sec: 5061.0, 60 sec: 5061.0, 300 sec: 5061.0). Total num frames: 36864. Throughput: 0: nan. Samples: 35964. Policy #0 lag: (min: 0.0, avg: 3.4, max: 8.0) [2022-07-09 00:02:16,683][26140] Worker 25 awakens! [2022-07-09 00:02:16,742][26158] Worker 26 awakens! [2022-07-09 00:02:16,841][26022] Updated weights on worker 0-0, policy_version 38 (0.00090) [2022-07-09 00:02:17,199][26197] Worker 27 awakens! [2022-07-09 00:02:17,507][26214] Worker 29 awakens! [2022-07-09 00:02:17,751][26212] Worker 28 awakens! [2022-07-09 00:02:17,843][26251] Worker 30 awakens! [2022-07-09 00:02:18,147][26260] Worker 31 awakens! [2022-07-09 00:02:18,666][26022] Updated weights on worker 0-0, policy_version 48 (0.00090) [2022-07-09 00:02:18,691][26283] Worker 32 awakens! [2022-07-09 00:02:18,775][26286] Worker 33 awakens! [2022-07-09 00:02:19,115][26309] Worker 34 awakens! [2022-07-09 00:02:19,343][26317] Worker 35 awakens! [2022-07-09 00:02:20,706][26022] Updated weights on worker 0-0, policy_version 58 (0.00090) [2022-07-09 00:02:21,593][25689] Fps is (10 sec: 5291.4, 60 sec: 5291.4, 300 sec: 5291.4). Total num frames: 64512. Throughput: 0: 6715.8. Samples: 69576. Policy #0 lag: (min: 0.0, avg: 17.2, max: 54.0) [2022-07-09 00:02:21,594][25689] Avg episode reward: [(0, '-57.847')] [2022-07-09 00:02:22,373][26022] Updated weights on worker 0-0, policy_version 68 (0.00083) [2022-07-09 00:02:24,272][26022] Updated weights on worker 0-0, policy_version 78 (0.00091) [2022-07-09 00:02:25,903][26022] Updated weights on worker 0-0, policy_version 88 (0.00096) [2022-07-09 00:02:26,611][25689] Fps is (10 sec: 5516.9, 60 sec: 5364.0, 300 sec: 5364.0). Total num frames: 92160. Throughput: 0: 5016.3. Samples: 86242. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 00:02:26,612][25689] Avg episode reward: [(0, '-62.953')] [2022-07-09 00:02:27,994][26022] Updated weights on worker 0-0, policy_version 98 (0.00083) [2022-07-09 00:02:29,575][26022] Updated weights on worker 0-0, policy_version 108 (0.00096) [2022-07-09 00:02:31,627][25689] Fps is (10 sec: 5511.2, 60 sec: 5401.1, 300 sec: 5401.1). Total num frames: 119808. Throughput: 0: 5542.4. Samples: 119312. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 00:02:31,631][25689] Avg episode reward: [(0, '-63.427')] [2022-07-09 00:02:31,788][26022] Updated weights on worker 0-0, policy_version 118 (0.00095) [2022-07-09 00:02:33,503][26022] Updated weights on worker 0-0, policy_version 128 (0.00100) [2022-07-09 00:02:35,479][26022] Updated weights on worker 0-0, policy_version 138 (0.00090) [2022-07-09 00:02:36,635][25689] Fps is (10 sec: 5516.4, 60 sec: 5424.9, 300 sec: 5424.9). Total num frames: 147456. Throughput: 0: 5797.8. Samples: 152192. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 00:02:36,636][25689] Avg episode reward: [(0, '-66.720')] [2022-07-09 00:02:37,178][26022] Updated weights on worker 0-0, policy_version 148 (0.00086) [2022-07-09 00:02:39,391][26022] Updated weights on worker 0-0, policy_version 158 (0.00083) [2022-07-09 00:02:41,088][26022] Updated weights on worker 0-0, policy_version 168 (0.00097) [2022-07-09 00:02:41,639][25689] Fps is (10 sec: 5522.8, 60 sec: 5441.6, 300 sec: 5441.6). Total num frames: 175104. Throughput: 0: 4924.1. Samples: 168282. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 00:02:41,639][25689] Avg episode reward: [(0, '-69.724')] [2022-07-09 00:02:42,993][26022] Updated weights on worker 0-0, policy_version 178 (0.00113) [2022-07-09 00:02:44,558][26022] Updated weights on worker 0-0, policy_version 188 (0.00085) [2022-07-09 00:02:46,656][25689] Fps is (10 sec: 5313.5, 60 sec: 5393.1, 300 sec: 5393.1). Total num frames: 200704. Throughput: 0: 5734.2. Samples: 201184. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 00:02:46,657][25689] Avg episode reward: [(0, '-75.056')] [2022-07-09 00:02:46,840][26022] Updated weights on worker 0-0, policy_version 198 (0.00090) [2022-07-09 00:02:48,498][26022] Updated weights on worker 0-0, policy_version 208 (0.00093) [2022-07-09 00:02:50,615][26022] Updated weights on worker 0-0, policy_version 218 (0.00090) [2022-07-09 00:02:51,681][25689] Fps is (10 sec: 5506.4, 60 sec: 5457.8, 300 sec: 5457.8). Total num frames: 230400. Throughput: 0: 5712.7. Samples: 233876. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 00:02:51,682][25689] Avg episode reward: [(0, '-77.423')] [2022-07-09 00:02:52,246][26022] Updated weights on worker 0-0, policy_version 228 (0.00084) [2022-07-09 00:02:54,249][26022] Updated weights on worker 0-0, policy_version 238 (0.00080) [2022-07-09 00:02:55,940][26022] Updated weights on worker 0-0, policy_version 248 (0.00091) [2022-07-09 00:02:56,704][25689] Fps is (10 sec: 5503.3, 60 sec: 5417.6, 300 sec: 5417.6). Total num frames: 256000. Throughput: 0: 4898.1. Samples: 250492. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2022-07-09 00:02:56,705][25689] Avg episode reward: [(0, '-81.660')] [2022-07-09 00:02:58,112][26022] Updated weights on worker 0-0, policy_version 258 (0.00098) [2022-07-09 00:02:59,526][26022] Updated weights on worker 0-0, policy_version 268 (0.00058) [2022-07-09 00:03:01,699][26022] Updated weights on worker 0-0, policy_version 278 (0.00083) [2022-07-09 00:03:01,735][25689] Fps is (10 sec: 5398.3, 60 sec: 5445.9, 300 sec: 5445.9). Total num frames: 284672. Throughput: 0: 5729.5. Samples: 283420. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 00:03:01,735][25689] Avg episode reward: [(0, '-84.743')] [2022-07-09 00:03:03,735][26022] Updated weights on worker 0-0, policy_version 288 (0.00086) [2022-07-09 00:03:05,636][26022] Updated weights on worker 0-0, policy_version 298 (0.00095) [2022-07-09 00:03:06,750][25689] Fps is (10 sec: 5402.6, 60 sec: 5414.9, 300 sec: 5414.9). Total num frames: 310272. Throughput: 0: 5642.0. Samples: 314550. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 00:03:06,750][25689] Avg episode reward: [(0, '-87.796')] [2022-07-09 00:03:07,457][26022] Updated weights on worker 0-0, policy_version 308 (0.00083) [2022-07-09 00:03:09,623][26022] Updated weights on worker 0-0, policy_version 318 (0.00079) [2022-07-09 00:03:11,218][26022] Updated weights on worker 0-0, policy_version 328 (0.00086) [2022-07-09 00:03:11,766][25689] Fps is (10 sec: 5410.3, 60 sec: 5439.9, 300 sec: 5439.9). Total num frames: 338944. Throughput: 0: 4823.9. Samples: 330764. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 00:03:11,766][25689] Avg episode reward: [(0, '-92.200')] [2022-07-09 00:03:13,263][26022] Updated weights on worker 0-0, policy_version 338 (0.00086) [2022-07-09 00:03:14,811][26022] Updated weights on worker 0-0, policy_version 348 (0.00086) [2022-07-09 00:03:16,789][25689] Fps is (10 sec: 5405.9, 60 sec: 5443.1, 300 sec: 5413.5). Total num frames: 364544. Throughput: 0: 5652.0. Samples: 364012. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 00:03:16,790][25689] Avg episode reward: [(0, '-96.258')] [2022-07-09 00:03:17,063][26022] Updated weights on worker 0-0, policy_version 358 (0.00093) [2022-07-09 00:03:18,754][26022] Updated weights on worker 0-0, policy_version 368 (0.00083) [2022-07-09 00:03:20,626][26022] Updated weights on worker 0-0, policy_version 378 (0.00085) [2022-07-09 00:03:21,815][25689] Fps is (10 sec: 5298.8, 60 sec: 5441.2, 300 sec: 5419.8). Total num frames: 392192. Throughput: 0: 5650.1. Samples: 396876. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 00:03:21,816][25689] Avg episode reward: [(0, '-99.797')] [2022-07-09 00:03:22,534][26022] Updated weights on worker 0-0, policy_version 388 (0.00084) [2022-07-09 00:03:24,299][26022] Updated weights on worker 0-0, policy_version 398 (0.00083) [2022-07-09 00:03:26,382][26022] Updated weights on worker 0-0, policy_version 408 (0.00088) [2022-07-09 00:03:26,822][25689] Fps is (10 sec: 5613.7, 60 sec: 5459.2, 300 sec: 5440.2). Total num frames: 420864. Throughput: 0: 5729.8. Samples: 429560. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 00:03:26,822][25689] Avg episode reward: [(0, '-102.753')] [2022-07-09 00:03:28,128][26022] Updated weights on worker 0-0, policy_version 418 (0.00082) [2022-07-09 00:03:30,140][26022] Updated weights on worker 0-0, policy_version 428 (0.00093) [2022-07-09 00:03:31,801][26022] Updated weights on worker 0-0, policy_version 438 (0.00092) [2022-07-09 00:03:31,851][25689] Fps is (10 sec: 5612.1, 60 sec: 5458.0, 300 sec: 5443.8). Total num frames: 448512. Throughput: 0: 5742.2. Samples: 446094. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 00:03:31,851][25689] Avg episode reward: [(0, '-103.345')] [2022-07-09 00:03:33,895][26022] Updated weights on worker 0-0, policy_version 448 (0.00095) [2022-07-09 00:03:35,548][26022] Updated weights on worker 0-0, policy_version 458 (0.00087) [2022-07-09 00:03:36,860][25689] Fps is (10 sec: 5304.5, 60 sec: 5423.9, 300 sec: 5424.2). Total num frames: 474112. Throughput: 0: 5744.6. Samples: 479312. Policy #0 lag: (min: 0.0, avg: 9.0, max: 17.0) [2022-07-09 00:03:36,861][25689] Avg episode reward: [(0, '-103.838')] [2022-07-09 00:03:37,506][26022] Updated weights on worker 0-0, policy_version 468 (0.00089) [2022-07-09 00:03:39,275][26022] Updated weights on worker 0-0, policy_version 478 (0.00096) [2022-07-09 00:03:41,247][26022] Updated weights on worker 0-0, policy_version 488 (0.00089) [2022-07-09 00:03:41,931][25689] Fps is (10 sec: 5282.0, 60 sec: 5417.9, 300 sec: 5425.8). Total num frames: 501760. Throughput: 0: 5720.6. Samples: 511954. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 00:03:41,932][25689] Avg episode reward: [(0, '-103.205')] [2022-07-09 00:03:42,932][26022] Updated weights on worker 0-0, policy_version 498 (0.00089) [2022-07-09 00:03:45,111][26022] Updated weights on worker 0-0, policy_version 508 (0.00100) [2022-07-09 00:03:46,951][25689] Fps is (10 sec: 5479.9, 60 sec: 5451.7, 300 sec: 5430.1). Total num frames: 529408. Throughput: 0: 4913.4. Samples: 528462. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 00:03:46,951][25689] Avg episode reward: [(0, '-103.123')] [2022-07-09 00:03:46,999][26022] Updated weights on worker 0-0, policy_version 518 (0.00084) [2022-07-09 00:03:48,967][26022] Updated weights on worker 0-0, policy_version 528 (0.00093) [2022-07-09 00:03:50,424][26022] Updated weights on worker 0-0, policy_version 538 (0.00079) [2022-07-09 00:03:51,971][25689] Fps is (10 sec: 5507.8, 60 sec: 5418.1, 300 sec: 5434.0). Total num frames: 557056. Throughput: 0: 5722.6. Samples: 561236. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 00:03:51,972][25689] Avg episode reward: [(0, '-103.296')] [2022-07-09 00:03:52,731][26022] Updated weights on worker 0-0, policy_version 548 (0.00087) [2022-07-09 00:03:54,362][26022] Updated weights on worker 0-0, policy_version 558 (0.00087) [2022-07-09 00:03:56,407][26022] Updated weights on worker 0-0, policy_version 568 (0.00095) [2022-07-09 00:03:56,979][25689] Fps is (10 sec: 5514.1, 60 sec: 5453.4, 300 sec: 5438.1). Total num frames: 584704. Throughput: 0: 5697.2. Samples: 593932. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 00:03:56,979][25689] Avg episode reward: [(0, '-103.647')] [2022-07-09 00:03:58,166][26022] Updated weights on worker 0-0, policy_version 578 (0.00093) [2022-07-09 00:04:00,140][26022] Updated weights on worker 0-0, policy_version 588 (0.00089) [2022-07-09 00:04:02,018][25689] Fps is (10 sec: 5299.8, 60 sec: 5401.7, 300 sec: 5421.8). Total num frames: 610304. Throughput: 0: 4892.7. Samples: 610232. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 00:04:02,019][25689] Avg episode reward: [(0, '-103.701')] [2022-07-09 00:04:02,600][26022] Updated weights on worker 0-0, policy_version 598 (0.00091) [2022-07-09 00:04:04,395][26022] Updated weights on worker 0-0, policy_version 608 (0.00092) [2022-07-09 00:04:06,286][26022] Updated weights on worker 0-0, policy_version 618 (0.00102) [2022-07-09 00:04:07,025][25689] Fps is (10 sec: 5198.5, 60 sec: 5419.4, 300 sec: 5417.2). Total num frames: 636928. Throughput: 0: 5537.5. Samples: 639622. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 00:04:07,025][25689] Avg episode reward: [(0, '-102.960')] [2022-07-09 00:04:08,391][26022] Updated weights on worker 0-0, policy_version 628 (0.00094) [2022-07-09 00:04:09,883][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:04:09,895][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000000637_652288.pth [2022-07-09 00:04:09,963][26022] Updated weights on worker 0-0, policy_version 638 (0.00084) [2022-07-09 00:04:12,038][25689] Fps is (10 sec: 5212.3, 60 sec: 5368.8, 300 sec: 5404.3). Total num frames: 662528. Throughput: 0: 5532.3. Samples: 672250. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 00:04:12,038][25689] Avg episode reward: [(0, '-103.502')] [2022-07-09 00:04:12,081][26022] Updated weights on worker 0-0, policy_version 648 (0.00087) [2022-07-09 00:04:13,842][26022] Updated weights on worker 0-0, policy_version 658 (0.00090) [2022-07-09 00:04:15,871][26022] Updated weights on worker 0-0, policy_version 668 (0.00081) [2022-07-09 00:04:17,058][25689] Fps is (10 sec: 5409.4, 60 sec: 5420.0, 300 sec: 5416.6). Total num frames: 691200. Throughput: 0: 4714.4. Samples: 688592. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 00:04:17,059][25689] Avg episode reward: [(0, '-104.037')] [2022-07-09 00:04:17,547][26022] Updated weights on worker 0-0, policy_version 678 (0.00088) [2022-07-09 00:04:19,522][26022] Updated weights on worker 0-0, policy_version 688 (0.00106) [2022-07-09 00:04:21,107][26022] Updated weights on worker 0-0, policy_version 698 (0.00089) [2022-07-09 00:04:22,083][25689] Fps is (10 sec: 5504.8, 60 sec: 5403.1, 300 sec: 5412.1). Total num frames: 717824. Throughput: 0: 5548.7. Samples: 721564. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 00:04:22,083][25689] Avg episode reward: [(0, '-104.095')] [2022-07-09 00:04:23,258][26022] Updated weights on worker 0-0, policy_version 708 (0.00088) [2022-07-09 00:04:25,048][26022] Updated weights on worker 0-0, policy_version 718 (0.00092) [2022-07-09 00:04:27,013][26022] Updated weights on worker 0-0, policy_version 728 (0.00087) [2022-07-09 00:04:27,094][25689] Fps is (10 sec: 5407.7, 60 sec: 5385.7, 300 sec: 5416.0). Total num frames: 745472. Throughput: 0: 5711.7. Samples: 754250. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 00:04:27,095][25689] Avg episode reward: [(0, '-103.617')] [2022-07-09 00:04:28,814][26022] Updated weights on worker 0-0, policy_version 738 (0.00101) [2022-07-09 00:04:30,855][26022] Updated weights on worker 0-0, policy_version 748 (0.00606) [2022-07-09 00:04:32,122][25689] Fps is (10 sec: 5507.9, 60 sec: 5385.8, 300 sec: 5418.9). Total num frames: 773120. Throughput: 0: 4876.1. Samples: 770184. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 00:04:32,123][25689] Avg episode reward: [(0, '-103.121')] [2022-07-09 00:04:32,665][26022] Updated weights on worker 0-0, policy_version 758 (0.00100) [2022-07-09 00:04:34,671][26022] Updated weights on worker 0-0, policy_version 768 (0.00092) [2022-07-09 00:04:36,548][26022] Updated weights on worker 0-0, policy_version 778 (0.00094) [2022-07-09 00:04:37,132][25689] Fps is (10 sec: 5304.4, 60 sec: 5385.7, 300 sec: 5408.3). Total num frames: 798720. Throughput: 0: 5680.7. Samples: 802630. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 00:04:37,133][25689] Avg episode reward: [(0, '-102.955')] [2022-07-09 00:04:38,464][26022] Updated weights on worker 0-0, policy_version 788 (0.00087) [2022-07-09 00:04:40,394][26022] Updated weights on worker 0-0, policy_version 798 (0.00096) [2022-07-09 00:04:42,187][25689] Fps is (10 sec: 5290.6, 60 sec: 5387.2, 300 sec: 5410.3). Total num frames: 826368. Throughput: 0: 5652.5. Samples: 835202. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 00:04:42,187][25689] Avg episode reward: [(0, '-103.351')] [2022-07-09 00:04:42,191][26022] Updated weights on worker 0-0, policy_version 808 (0.00099) [2022-07-09 00:04:44,009][26022] Updated weights on worker 0-0, policy_version 818 (0.00087) [2022-07-09 00:04:45,894][26022] Updated weights on worker 0-0, policy_version 828 (0.00086) [2022-07-09 00:04:47,195][25689] Fps is (10 sec: 5597.0, 60 sec: 5405.2, 300 sec: 5420.5). Total num frames: 855040. Throughput: 0: 4856.0. Samples: 851862. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 00:04:47,196][25689] Avg episode reward: [(0, '-102.661')] [2022-07-09 00:04:47,684][26022] Updated weights on worker 0-0, policy_version 838 (0.00102) [2022-07-09 00:04:49,777][26022] Updated weights on worker 0-0, policy_version 848 (0.00088) [2022-07-09 00:04:51,567][26022] Updated weights on worker 0-0, policy_version 858 (0.00087) [2022-07-09 00:04:52,213][25689] Fps is (10 sec: 5617.0, 60 sec: 5405.4, 300 sec: 5423.2). Total num frames: 882688. Throughput: 0: 5711.8. Samples: 884942. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 00:04:52,214][25689] Avg episode reward: [(0, '-102.173')] [2022-07-09 00:04:53,314][26022] Updated weights on worker 0-0, policy_version 868 (0.00079) [2022-07-09 00:04:55,180][26022] Updated weights on worker 0-0, policy_version 878 (0.00087) [2022-07-09 00:04:57,228][26022] Updated weights on worker 0-0, policy_version 888 (0.00088) [2022-07-09 00:04:57,231][25689] Fps is (10 sec: 5305.9, 60 sec: 5370.5, 300 sec: 5413.5). Total num frames: 908288. Throughput: 0: 5742.7. Samples: 918048. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 00:04:57,231][25689] Avg episode reward: [(0, '-101.918')] [2022-07-09 00:04:58,875][26022] Updated weights on worker 0-0, policy_version 898 (0.00094) [2022-07-09 00:05:01,100][26022] Updated weights on worker 0-0, policy_version 908 (0.00089) [2022-07-09 00:05:02,267][25689] Fps is (10 sec: 5194.5, 60 sec: 5387.8, 300 sec: 5409.8). Total num frames: 934912. Throughput: 0: 4947.2. Samples: 934544. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 00:05:02,268][25689] Avg episode reward: [(0, '-102.224')] [2022-07-09 00:05:02,917][26022] Updated weights on worker 0-0, policy_version 918 (0.00093) [2022-07-09 00:05:05,187][26022] Updated weights on worker 0-0, policy_version 928 (0.00079) [2022-07-09 00:05:06,616][26022] Updated weights on worker 0-0, policy_version 938 (0.00058) [2022-07-09 00:05:07,275][25689] Fps is (10 sec: 5505.5, 60 sec: 5421.7, 300 sec: 5418.8). Total num frames: 963584. Throughput: 0: 5662.4. Samples: 965560. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 00:05:07,275][25689] Avg episode reward: [(0, '-102.095')] [2022-07-09 00:05:08,784][26022] Updated weights on worker 0-0, policy_version 948 (0.00084) [2022-07-09 00:05:10,360][26022] Updated weights on worker 0-0, policy_version 958 (0.00096) [2022-07-09 00:05:12,306][25689] Fps is (10 sec: 5508.2, 60 sec: 5437.0, 300 sec: 5415.2). Total num frames: 990208. Throughput: 0: 5650.1. Samples: 998468. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 00:05:12,307][25689] Avg episode reward: [(0, '-101.377')] [2022-07-09 00:05:12,364][26022] Updated weights on worker 0-0, policy_version 968 (0.00089) [2022-07-09 00:05:14,227][26022] Updated weights on worker 0-0, policy_version 978 (0.00081) [2022-07-09 00:05:16,073][26022] Updated weights on worker 0-0, policy_version 988 (0.00096) [2022-07-09 00:05:17,320][25689] Fps is (10 sec: 5402.6, 60 sec: 5420.6, 300 sec: 5417.9). Total num frames: 1017856. Throughput: 0: 4832.8. Samples: 1015136. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 00:05:17,320][25689] Avg episode reward: [(0, '-102.703')] [2022-07-09 00:05:17,711][26022] Updated weights on worker 0-0, policy_version 998 (0.00090) [2022-07-09 00:05:19,773][26022] Updated weights on worker 0-0, policy_version 1008 (0.00088) [2022-07-09 00:05:21,453][26022] Updated weights on worker 0-0, policy_version 1018 (0.00085) [2022-07-09 00:05:22,375][25689] Fps is (10 sec: 5593.6, 60 sec: 5451.9, 300 sec: 5424.6). Total num frames: 1046528. Throughput: 0: 5670.1. Samples: 1048556. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 00:05:22,375][25689] Avg episode reward: [(0, '-102.919')] [2022-07-09 00:05:23,410][26022] Updated weights on worker 0-0, policy_version 1028 (0.00087) [2022-07-09 00:05:25,174][26022] Updated weights on worker 0-0, policy_version 1038 (0.00086) [2022-07-09 00:05:27,104][26022] Updated weights on worker 0-0, policy_version 1048 (0.00097) [2022-07-09 00:05:27,380][25689] Fps is (10 sec: 5496.6, 60 sec: 5435.4, 300 sec: 5421.9). Total num frames: 1073152. Throughput: 0: 5784.1. Samples: 1081852. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 00:05:27,381][25689] Avg episode reward: [(0, '-102.728')] [2022-07-09 00:05:28,894][26022] Updated weights on worker 0-0, policy_version 1058 (0.00091) [2022-07-09 00:05:30,897][26022] Updated weights on worker 0-0, policy_version 1068 (0.00628) [2022-07-09 00:05:32,403][25689] Fps is (10 sec: 5514.2, 60 sec: 5452.9, 300 sec: 5429.1). Total num frames: 1101824. Throughput: 0: 4968.8. Samples: 1098324. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 00:05:32,403][25689] Avg episode reward: [(0, '-102.531')] [2022-07-09 00:05:32,790][26022] Updated weights on worker 0-0, policy_version 1078 (0.00095) [2022-07-09 00:05:34,577][26022] Updated weights on worker 0-0, policy_version 1088 (0.00084) [2022-07-09 00:05:36,396][26022] Updated weights on worker 0-0, policy_version 1098 (0.00089) [2022-07-09 00:05:37,428][25689] Fps is (10 sec: 5605.2, 60 sec: 5485.5, 300 sec: 5430.9). Total num frames: 1129472. Throughput: 0: 5793.3. Samples: 1131628. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 00:05:37,428][25689] Avg episode reward: [(0, '-102.279')] [2022-07-09 00:05:38,334][26022] Updated weights on worker 0-0, policy_version 1108 (0.00089) [2022-07-09 00:05:40,193][26022] Updated weights on worker 0-0, policy_version 1118 (0.00086) [2022-07-09 00:05:42,039][26022] Updated weights on worker 0-0, policy_version 1128 (0.00089) [2022-07-09 00:05:42,486][25689] Fps is (10 sec: 5483.8, 60 sec: 5485.2, 300 sec: 5431.7). Total num frames: 1157120. Throughput: 0: 5755.3. Samples: 1164304. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 00:05:42,487][25689] Avg episode reward: [(0, '-101.905')] [2022-07-09 00:05:43,901][26022] Updated weights on worker 0-0, policy_version 1138 (0.00092) [2022-07-09 00:05:45,755][26022] Updated weights on worker 0-0, policy_version 1148 (0.00087) [2022-07-09 00:05:47,497][25689] Fps is (10 sec: 5491.8, 60 sec: 5468.0, 300 sec: 5433.7). Total num frames: 1184768. Throughput: 0: 4923.4. Samples: 1180894. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 00:05:47,497][25689] Avg episode reward: [(0, '-99.936')] [2022-07-09 00:05:47,586][26022] Updated weights on worker 0-0, policy_version 1158 (0.00096) [2022-07-09 00:05:49,509][26022] Updated weights on worker 0-0, policy_version 1168 (0.00082) [2022-07-09 00:05:51,284][26022] Updated weights on worker 0-0, policy_version 1178 (0.00087) [2022-07-09 00:05:52,514][25689] Fps is (10 sec: 5514.4, 60 sec: 5468.1, 300 sec: 5435.5). Total num frames: 1212416. Throughput: 0: 5755.1. Samples: 1214066. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 00:05:52,515][25689] Avg episode reward: [(0, '-99.983')] [2022-07-09 00:05:53,383][26022] Updated weights on worker 0-0, policy_version 1188 (0.00086) [2022-07-09 00:05:55,003][26022] Updated weights on worker 0-0, policy_version 1198 (0.00086) [2022-07-09 00:05:57,143][26022] Updated weights on worker 0-0, policy_version 1208 (0.00081) [2022-07-09 00:05:57,531][25689] Fps is (10 sec: 5612.9, 60 sec: 5519.1, 300 sec: 5441.7). Total num frames: 1241088. Throughput: 0: 5732.2. Samples: 1246862. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 00:05:57,531][25689] Avg episode reward: [(0, '-99.426')] [2022-07-09 00:05:58,744][26022] Updated weights on worker 0-0, policy_version 1218 (0.00086) [2022-07-09 00:06:00,612][26022] Updated weights on worker 0-0, policy_version 1228 (0.00088) [2022-07-09 00:06:02,596][25689] Fps is (10 sec: 5179.7, 60 sec: 5465.5, 300 sec: 5424.3). Total num frames: 1264640. Throughput: 0: 4925.7. Samples: 1263360. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 00:06:02,597][25689] Avg episode reward: [(0, '-99.580')] [2022-07-09 00:06:03,033][26022] Updated weights on worker 0-0, policy_version 1238 (0.00081) [2022-07-09 00:06:04,756][26022] Updated weights on worker 0-0, policy_version 1248 (0.00087) [2022-07-09 00:06:06,825][26022] Updated weights on worker 0-0, policy_version 1258 (0.00087) [2022-07-09 00:06:07,598][25689] Fps is (10 sec: 5187.5, 60 sec: 5466.0, 300 sec: 5430.8). Total num frames: 1293312. Throughput: 0: 5623.3. Samples: 1293930. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 00:06:07,599][25689] Avg episode reward: [(0, '-99.822')] [2022-07-09 00:06:08,752][26022] Updated weights on worker 0-0, policy_version 1268 (0.00087) [2022-07-09 00:06:10,233][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:06:10,256][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000001275_1305600.pth [2022-07-09 00:06:10,512][26022] Updated weights on worker 0-0, policy_version 1278 (0.00087) [2022-07-09 00:06:12,394][26022] Updated weights on worker 0-0, policy_version 1288 (0.00108) [2022-07-09 00:06:12,611][25689] Fps is (10 sec: 5419.4, 60 sec: 5450.7, 300 sec: 5424.1). Total num frames: 1318912. Throughput: 0: 5587.1. Samples: 1326350. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 00:06:12,611][25689] Avg episode reward: [(0, '-100.831')] [2022-07-09 00:06:14,257][26022] Updated weights on worker 0-0, policy_version 1298 (0.00089) [2022-07-09 00:06:16,320][26022] Updated weights on worker 0-0, policy_version 1308 (0.00090) [2022-07-09 00:06:17,614][25689] Fps is (10 sec: 5214.2, 60 sec: 5434.7, 300 sec: 5422.0). Total num frames: 1345536. Throughput: 0: 4769.3. Samples: 1342644. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-09 00:06:17,616][25689] Avg episode reward: [(0, '-101.529')] [2022-07-09 00:06:18,182][26022] Updated weights on worker 0-0, policy_version 1318 (0.00079) [2022-07-09 00:06:20,126][26022] Updated weights on worker 0-0, policy_version 1328 (0.00084) [2022-07-09 00:06:21,863][26022] Updated weights on worker 0-0, policy_version 1338 (0.00089) [2022-07-09 00:06:22,763][25689] Fps is (10 sec: 5446.8, 60 sec: 5426.2, 300 sec: 5425.0). Total num frames: 1374208. Throughput: 0: 5537.7. Samples: 1375036. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 00:06:22,763][25689] Avg episode reward: [(0, '-101.075')] [2022-07-09 00:06:23,940][26022] Updated weights on worker 0-0, policy_version 1348 (0.00090) [2022-07-09 00:06:25,772][26022] Updated weights on worker 0-0, policy_version 1358 (0.00104) [2022-07-09 00:06:27,639][26022] Updated weights on worker 0-0, policy_version 1368 (0.00065) [2022-07-09 00:06:27,766][25689] Fps is (10 sec: 5446.9, 60 sec: 5426.5, 300 sec: 5423.0). Total num frames: 1400832. Throughput: 0: 5632.9. Samples: 1407532. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 00:06:27,767][25689] Avg episode reward: [(0, '-100.321')] [2022-07-09 00:06:29,532][26022] Updated weights on worker 0-0, policy_version 1378 (0.00091) [2022-07-09 00:06:31,745][26022] Updated weights on worker 0-0, policy_version 1388 (0.00088) [2022-07-09 00:06:32,867][25689] Fps is (10 sec: 5371.2, 60 sec: 5402.5, 300 sec: 5422.9). Total num frames: 1428480. Throughput: 0: 4814.0. Samples: 1423864. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 00:06:32,868][25689] Avg episode reward: [(0, '-100.915')] [2022-07-09 00:06:33,091][26022] Updated weights on worker 0-0, policy_version 1398 (0.00093) [2022-07-09 00:06:35,232][26022] Updated weights on worker 0-0, policy_version 1408 (0.00084) [2022-07-09 00:06:36,673][26022] Updated weights on worker 0-0, policy_version 1418 (0.00089) [2022-07-09 00:06:37,887][25689] Fps is (10 sec: 5463.6, 60 sec: 5403.0, 300 sec: 5424.6). Total num frames: 1456128. Throughput: 0: 5622.8. Samples: 1456634. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 00:06:37,887][25689] Avg episode reward: [(0, '-100.417')] [2022-07-09 00:06:39,044][26022] Updated weights on worker 0-0, policy_version 1428 (0.00083) [2022-07-09 00:06:40,744][26022] Updated weights on worker 0-0, policy_version 1438 (0.00090) [2022-07-09 00:06:42,672][26022] Updated weights on worker 0-0, policy_version 1448 (0.00082) [2022-07-09 00:06:42,963][25689] Fps is (10 sec: 5477.2, 60 sec: 5401.4, 300 sec: 5425.0). Total num frames: 1483776. Throughput: 0: 5649.2. Samples: 1489150. Policy #0 lag: (min: 0.0, avg: 7.5, max: 21.0) [2022-07-09 00:06:42,963][25689] Avg episode reward: [(0, '-99.863')] [2022-07-09 00:06:44,523][26022] Updated weights on worker 0-0, policy_version 1458 (0.00095) [2022-07-09 00:06:46,387][26022] Updated weights on worker 0-0, policy_version 1468 (0.00088) [2022-07-09 00:06:47,965][25689] Fps is (10 sec: 5486.5, 60 sec: 5402.1, 300 sec: 5426.8). Total num frames: 1511424. Throughput: 0: 4852.1. Samples: 1505542. Policy #0 lag: (min: 0.0, avg: 7.5, max: 21.0) [2022-07-09 00:06:47,966][25689] Avg episode reward: [(0, '-98.998')] [2022-07-09 00:06:48,305][26022] Updated weights on worker 0-0, policy_version 1478 (0.00086) [2022-07-09 00:06:50,243][26022] Updated weights on worker 0-0, policy_version 1488 (0.00088) [2022-07-09 00:06:52,190][26022] Updated weights on worker 0-0, policy_version 1498 (0.00091) [2022-07-09 00:06:52,982][25689] Fps is (10 sec: 5519.1, 60 sec: 5402.1, 300 sec: 5428.3). Total num frames: 1539072. Throughput: 0: 5694.6. Samples: 1538410. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 00:06:52,982][25689] Avg episode reward: [(0, '-99.107')] [2022-07-09 00:06:53,904][26022] Updated weights on worker 0-0, policy_version 1508 (0.00128) [2022-07-09 00:06:55,645][26022] Updated weights on worker 0-0, policy_version 1518 (0.00087) [2022-07-09 00:06:57,841][26022] Updated weights on worker 0-0, policy_version 1528 (0.00087) [2022-07-09 00:06:57,985][25689] Fps is (10 sec: 5314.0, 60 sec: 5352.5, 300 sec: 5422.9). Total num frames: 1564672. Throughput: 0: 5699.8. Samples: 1571194. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 00:06:57,986][25689] Avg episode reward: [(0, '-99.622')] [2022-07-09 00:06:59,565][26022] Updated weights on worker 0-0, policy_version 1538 (0.00086) [2022-07-09 00:07:01,921][26022] Updated weights on worker 0-0, policy_version 1548 (0.00103) [2022-07-09 00:07:03,068][25689] Fps is (10 sec: 5177.8, 60 sec: 5401.8, 300 sec: 5419.7). Total num frames: 1591296. Throughput: 0: 4894.2. Samples: 1587548. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 00:07:03,068][25689] Avg episode reward: [(0, '-100.717')] [2022-07-09 00:07:03,557][26022] Updated weights on worker 0-0, policy_version 1558 (0.00093) [2022-07-09 00:07:05,615][26022] Updated weights on worker 0-0, policy_version 1568 (0.00079) [2022-07-09 00:07:07,358][26022] Updated weights on worker 0-0, policy_version 1578 (0.00085) [2022-07-09 00:07:08,087][25689] Fps is (10 sec: 5271.1, 60 sec: 5366.4, 300 sec: 5417.7). Total num frames: 1617920. Throughput: 0: 5602.4. Samples: 1618274. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 00:07:08,088][25689] Avg episode reward: [(0, '-99.825')] [2022-07-09 00:07:09,237][26022] Updated weights on worker 0-0, policy_version 1588 (0.01106) [2022-07-09 00:07:11,286][26022] Updated weights on worker 0-0, policy_version 1598 (0.00087) [2022-07-09 00:07:13,156][25689] Fps is (10 sec: 5379.8, 60 sec: 5395.2, 300 sec: 5424.4). Total num frames: 1645568. Throughput: 0: 5608.2. Samples: 1651552. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-09 00:07:13,156][25689] Avg episode reward: [(0, '-99.267')] [2022-07-09 00:07:13,165][26022] Updated weights on worker 0-0, policy_version 1608 (0.00081) [2022-07-09 00:07:14,880][26022] Updated weights on worker 0-0, policy_version 1618 (0.00085) [2022-07-09 00:07:16,952][26022] Updated weights on worker 0-0, policy_version 1628 (0.00091) [2022-07-09 00:07:18,215][25689] Fps is (10 sec: 5662.3, 60 sec: 5441.0, 300 sec: 5430.3). Total num frames: 1675264. Throughput: 0: 4770.4. Samples: 1667694. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 00:07:18,215][25689] Avg episode reward: [(0, '-99.495')] [2022-07-09 00:07:18,651][26022] Updated weights on worker 0-0, policy_version 1638 (0.00091) [2022-07-09 00:07:20,672][26022] Updated weights on worker 0-0, policy_version 1648 (0.00090) [2022-07-09 00:07:22,722][26022] Updated weights on worker 0-0, policy_version 1658 (0.00084) [2022-07-09 00:07:23,259][25689] Fps is (10 sec: 5473.4, 60 sec: 5399.6, 300 sec: 5423.0). Total num frames: 1700864. Throughput: 0: 5580.5. Samples: 1700224. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 00:07:23,259][25689] Avg episode reward: [(0, '-99.553')] [2022-07-09 00:07:24,368][26022] Updated weights on worker 0-0, policy_version 1668 (0.00088) [2022-07-09 00:07:26,399][26022] Updated weights on worker 0-0, policy_version 1678 (0.00089) [2022-07-09 00:07:28,160][26022] Updated weights on worker 0-0, policy_version 1688 (0.00089) [2022-07-09 00:07:28,291][25689] Fps is (10 sec: 5284.3, 60 sec: 5413.9, 300 sec: 5422.6). Total num frames: 1728512. Throughput: 0: 5672.0. Samples: 1732870. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 00:07:28,292][25689] Avg episode reward: [(0, '-100.440')] [2022-07-09 00:07:30,030][26022] Updated weights on worker 0-0, policy_version 1698 (0.00088) [2022-07-09 00:07:32,052][26022] Updated weights on worker 0-0, policy_version 1708 (0.00087) [2022-07-09 00:07:33,328][25689] Fps is (10 sec: 5593.3, 60 sec: 5436.6, 300 sec: 5425.6). Total num frames: 1757184. Throughput: 0: 5645.3. Samples: 1765428. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 00:07:33,328][25689] Avg episode reward: [(0, '-98.888')] [2022-07-09 00:07:33,675][26022] Updated weights on worker 0-0, policy_version 1718 (0.00082) [2022-07-09 00:07:35,746][26022] Updated weights on worker 0-0, policy_version 1728 (0.00095) [2022-07-09 00:07:37,615][26022] Updated weights on worker 0-0, policy_version 1738 (0.00084) [2022-07-09 00:07:38,346][25689] Fps is (10 sec: 5397.7, 60 sec: 5402.9, 300 sec: 5418.4). Total num frames: 1782784. Throughput: 0: 5671.5. Samples: 1781868. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 00:07:38,346][25689] Avg episode reward: [(0, '-99.349')] [2022-07-09 00:07:39,273][26022] Updated weights on worker 0-0, policy_version 1748 (0.00085) [2022-07-09 00:07:41,574][26022] Updated weights on worker 0-0, policy_version 1758 (0.00095) [2022-07-09 00:07:43,103][26022] Updated weights on worker 0-0, policy_version 1768 (0.00103) [2022-07-09 00:07:43,418][25689] Fps is (10 sec: 5378.5, 60 sec: 5420.2, 300 sec: 5427.8). Total num frames: 1811456. Throughput: 0: 5677.7. Samples: 1814684. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 00:07:43,419][25689] Avg episode reward: [(0, '-99.349')] [2022-07-09 00:07:45,171][26022] Updated weights on worker 0-0, policy_version 1778 (0.00087) [2022-07-09 00:07:47,092][26022] Updated weights on worker 0-0, policy_version 1788 (0.00076) [2022-07-09 00:07:48,480][25689] Fps is (10 sec: 5557.4, 60 sec: 5414.9, 300 sec: 5420.2). Total num frames: 1839104. Throughput: 0: 5672.1. Samples: 1847382. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 00:07:48,481][25689] Avg episode reward: [(0, '-99.606')] [2022-07-09 00:07:48,720][26022] Updated weights on worker 0-0, policy_version 1798 (0.00085) [2022-07-09 00:07:50,876][26022] Updated weights on worker 0-0, policy_version 1808 (0.00091) [2022-07-09 00:07:52,522][26022] Updated weights on worker 0-0, policy_version 1818 (0.00085) [2022-07-09 00:07:53,503][25689] Fps is (10 sec: 5280.0, 60 sec: 5380.4, 300 sec: 5420.2). Total num frames: 1864704. Throughput: 0: 4880.8. Samples: 1863900. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 00:07:53,504][25689] Avg episode reward: [(0, '-98.684')] [2022-07-09 00:07:54,446][26022] Updated weights on worker 0-0, policy_version 1828 (0.00091) [2022-07-09 00:07:56,568][26022] Updated weights on worker 0-0, policy_version 1838 (0.00050) [2022-07-09 00:07:58,235][26022] Updated weights on worker 0-0, policy_version 1848 (0.00086) [2022-07-09 00:07:58,544][25689] Fps is (10 sec: 5494.2, 60 sec: 5444.7, 300 sec: 5423.4). Total num frames: 1894400. Throughput: 0: 5682.0. Samples: 1896636. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 00:07:58,545][25689] Avg episode reward: [(0, '-98.471')] [2022-07-09 00:08:00,183][26022] Updated weights on worker 0-0, policy_version 1858 (0.00094) [2022-07-09 00:08:01,925][26022] Updated weights on worker 0-0, policy_version 1868 (0.00097) [2022-07-09 00:08:03,603][25689] Fps is (10 sec: 5373.6, 60 sec: 5413.0, 300 sec: 5419.2). Total num frames: 1918976. Throughput: 0: 5606.6. Samples: 1927852. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 00:08:03,603][25689] Avg episode reward: [(0, '-99.001')] [2022-07-09 00:08:04,157][26022] Updated weights on worker 0-0, policy_version 1878 (0.00087) [2022-07-09 00:08:06,010][26022] Updated weights on worker 0-0, policy_version 1888 (0.00095) [2022-07-09 00:08:07,957][26022] Updated weights on worker 0-0, policy_version 1898 (0.00090) [2022-07-09 00:08:08,626][25689] Fps is (10 sec: 5180.1, 60 sec: 5429.6, 300 sec: 5415.6). Total num frames: 1946624. Throughput: 0: 4809.0. Samples: 1944264. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 00:08:08,627][25689] Avg episode reward: [(0, '-99.742')] [2022-07-09 00:08:09,739][26022] Updated weights on worker 0-0, policy_version 1908 (0.00088) [2022-07-09 00:08:10,309][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:08:10,318][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000001912_1957888.pth [2022-07-09 00:08:10,320][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000000001_1024.pth [2022-07-09 00:08:11,790][26022] Updated weights on worker 0-0, policy_version 1918 (0.00093) [2022-07-09 00:08:13,491][26022] Updated weights on worker 0-0, policy_version 1928 (0.00097) [2022-07-09 00:08:13,646][25689] Fps is (10 sec: 5607.7, 60 sec: 5450.9, 300 sec: 5426.0). Total num frames: 1975296. Throughput: 0: 5622.1. Samples: 1977146. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 00:08:13,647][25689] Avg episode reward: [(0, '-100.399')] [2022-07-09 00:08:15,517][26022] Updated weights on worker 0-0, policy_version 1938 (0.00085) [2022-07-09 00:08:17,232][26022] Updated weights on worker 0-0, policy_version 1948 (0.00090) [2022-07-09 00:08:18,669][25689] Fps is (10 sec: 5505.8, 60 sec: 5403.3, 300 sec: 5422.6). Total num frames: 2001920. Throughput: 0: 5628.9. Samples: 2009918. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 00:08:18,670][25689] Avg episode reward: [(0, '-100.774')] [2022-07-09 00:08:19,121][26022] Updated weights on worker 0-0, policy_version 1958 (0.00075) [2022-07-09 00:08:21,118][26022] Updated weights on worker 0-0, policy_version 1968 (0.00086) [2022-07-09 00:08:23,043][26022] Updated weights on worker 0-0, policy_version 1978 (0.00086) [2022-07-09 00:08:23,745][25689] Fps is (10 sec: 5373.9, 60 sec: 5434.3, 300 sec: 5417.9). Total num frames: 2029568. Throughput: 0: 4885.2. Samples: 2026252. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 00:08:23,746][25689] Avg episode reward: [(0, '-101.886')] [2022-07-09 00:08:24,802][26022] Updated weights on worker 0-0, policy_version 1988 (0.00086) [2022-07-09 00:08:26,865][26022] Updated weights on worker 0-0, policy_version 1998 (0.00089) [2022-07-09 00:08:28,402][26022] Updated weights on worker 0-0, policy_version 2008 (0.00082) [2022-07-09 00:08:28,815][25689] Fps is (10 sec: 5349.2, 60 sec: 5414.1, 300 sec: 5413.7). Total num frames: 2056192. Throughput: 0: 5674.1. Samples: 2058818. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 00:08:28,816][25689] Avg episode reward: [(0, '-101.568')] [2022-07-09 00:08:30,563][26022] Updated weights on worker 0-0, policy_version 2018 (0.00086) [2022-07-09 00:08:32,292][26022] Updated weights on worker 0-0, policy_version 2028 (0.00086) [2022-07-09 00:08:33,833][25689] Fps is (10 sec: 5481.5, 60 sec: 5415.7, 300 sec: 5423.9). Total num frames: 2084864. Throughput: 0: 5675.0. Samples: 2091708. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 00:08:33,834][25689] Avg episode reward: [(0, '-101.838')] [2022-07-09 00:08:34,187][26022] Updated weights on worker 0-0, policy_version 2038 (0.00092) [2022-07-09 00:08:36,041][26022] Updated weights on worker 0-0, policy_version 2048 (0.00090) [2022-07-09 00:08:38,103][26022] Updated weights on worker 0-0, policy_version 2058 (0.00091) [2022-07-09 00:08:38,934][25689] Fps is (10 sec: 5566.0, 60 sec: 5442.2, 300 sec: 5423.4). Total num frames: 2112512. Throughput: 0: 4853.1. Samples: 2108262. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 00:08:38,934][25689] Avg episode reward: [(0, '-100.844')] [2022-07-09 00:08:39,649][26022] Updated weights on worker 0-0, policy_version 2068 (0.00077) [2022-07-09 00:08:41,765][26022] Updated weights on worker 0-0, policy_version 2078 (0.00091) [2022-07-09 00:08:43,381][26022] Updated weights on worker 0-0, policy_version 2088 (0.00091) [2022-07-09 00:08:44,011][25689] Fps is (10 sec: 5433.3, 60 sec: 5424.8, 300 sec: 5422.3). Total num frames: 2140160. Throughput: 0: 5684.9. Samples: 2141458. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 00:08:44,011][25689] Avg episode reward: [(0, '-99.983')] [2022-07-09 00:08:45,483][26022] Updated weights on worker 0-0, policy_version 2098 (0.00086) [2022-07-09 00:08:47,221][26022] Updated weights on worker 0-0, policy_version 2108 (0.00092) [2022-07-09 00:08:49,080][25689] Fps is (10 sec: 5449.8, 60 sec: 5424.2, 300 sec: 5421.4). Total num frames: 2167808. Throughput: 0: 5689.9. Samples: 2174124. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 00:08:49,081][25689] Avg episode reward: [(0, '-100.045')] [2022-07-09 00:08:49,285][26022] Updated weights on worker 0-0, policy_version 2118 (0.00084) [2022-07-09 00:08:51,044][26022] Updated weights on worker 0-0, policy_version 2128 (0.00097) [2022-07-09 00:08:53,215][26022] Updated weights on worker 0-0, policy_version 2138 (0.00083) [2022-07-09 00:08:54,158][25689] Fps is (10 sec: 5449.4, 60 sec: 5453.0, 300 sec: 5420.1). Total num frames: 2195456. Throughput: 0: 4836.7. Samples: 2190012. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 00:08:54,158][25689] Avg episode reward: [(0, '-100.129')] [2022-07-09 00:08:54,759][26022] Updated weights on worker 0-0, policy_version 2148 (0.00895) [2022-07-09 00:08:56,810][26022] Updated weights on worker 0-0, policy_version 2158 (0.00109) [2022-07-09 00:08:58,446][26022] Updated weights on worker 0-0, policy_version 2168 (0.00089) [2022-07-09 00:08:59,197][25689] Fps is (10 sec: 5263.3, 60 sec: 5385.7, 300 sec: 5420.1). Total num frames: 2221056. Throughput: 0: 5647.4. Samples: 2222696. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 00:08:59,197][25689] Avg episode reward: [(0, '-100.113')] [2022-07-09 00:09:00,592][26022] Updated weights on worker 0-0, policy_version 2178 (0.00085) [2022-07-09 00:09:02,638][26022] Updated weights on worker 0-0, policy_version 2188 (0.00080) [2022-07-09 00:09:04,285][25689] Fps is (10 sec: 5157.0, 60 sec: 5416.9, 300 sec: 5418.7). Total num frames: 2247680. Throughput: 0: 5532.0. Samples: 2253614. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 00:09:04,285][25689] Avg episode reward: [(0, '-100.620')] [2022-07-09 00:09:04,657][26022] Updated weights on worker 0-0, policy_version 2198 (0.00078) [2022-07-09 00:09:06,471][26022] Updated weights on worker 0-0, policy_version 2208 (0.00082) [2022-07-09 00:09:08,455][26022] Updated weights on worker 0-0, policy_version 2218 (0.00073) [2022-07-09 00:09:09,304][25689] Fps is (10 sec: 5471.2, 60 sec: 5434.1, 300 sec: 5428.9). Total num frames: 2276352. Throughput: 0: 4750.5. Samples: 2270196. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 00:09:09,304][25689] Avg episode reward: [(0, '-101.301')] [2022-07-09 00:09:10,035][26022] Updated weights on worker 0-0, policy_version 2228 (0.00083) [2022-07-09 00:09:12,222][26022] Updated weights on worker 0-0, policy_version 2238 (0.00095) [2022-07-09 00:09:14,055][26022] Updated weights on worker 0-0, policy_version 2248 (0.00088) [2022-07-09 00:09:14,310][25689] Fps is (10 sec: 5515.5, 60 sec: 5401.6, 300 sec: 5422.2). Total num frames: 2302976. Throughput: 0: 5594.7. Samples: 2302760. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 00:09:14,315][25689] Avg episode reward: [(0, '-101.482')] [2022-07-09 00:09:15,902][26022] Updated weights on worker 0-0, policy_version 2258 (0.00090) [2022-07-09 00:09:17,894][26022] Updated weights on worker 0-0, policy_version 2268 (0.00090) [2022-07-09 00:09:19,323][25689] Fps is (10 sec: 5314.2, 60 sec: 5402.4, 300 sec: 5422.5). Total num frames: 2329600. Throughput: 0: 5600.0. Samples: 2335406. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 00:09:19,324][25689] Avg episode reward: [(0, '-101.421')] [2022-07-09 00:09:19,708][26022] Updated weights on worker 0-0, policy_version 2278 (0.00084) [2022-07-09 00:09:21,570][26022] Updated weights on worker 0-0, policy_version 2288 (0.00092) [2022-07-09 00:09:23,272][26022] Updated weights on worker 0-0, policy_version 2298 (0.00081) [2022-07-09 00:09:24,379][25689] Fps is (10 sec: 5492.0, 60 sec: 5421.2, 300 sec: 5425.1). Total num frames: 2358272. Throughput: 0: 4889.1. Samples: 2351856. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 00:09:24,379][25689] Avg episode reward: [(0, '-100.067')] [2022-07-09 00:09:25,311][26022] Updated weights on worker 0-0, policy_version 2308 (0.00086) [2022-07-09 00:09:27,201][26022] Updated weights on worker 0-0, policy_version 2318 (0.00093) [2022-07-09 00:09:29,094][26022] Updated weights on worker 0-0, policy_version 2328 (0.00102) [2022-07-09 00:09:29,387][25689] Fps is (10 sec: 5596.3, 60 sec: 5443.5, 300 sec: 5425.5). Total num frames: 2385920. Throughput: 0: 5697.7. Samples: 2384628. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 00:09:29,388][25689] Avg episode reward: [(0, '-100.475')] [2022-07-09 00:09:30,946][26022] Updated weights on worker 0-0, policy_version 2338 (0.00090) [2022-07-09 00:09:32,796][26022] Updated weights on worker 0-0, policy_version 2348 (0.00089) [2022-07-09 00:09:34,390][25689] Fps is (10 sec: 5421.0, 60 sec: 5411.1, 300 sec: 5429.0). Total num frames: 2412544. Throughput: 0: 5709.7. Samples: 2417412. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 00:09:34,391][25689] Avg episode reward: [(0, '-99.460')] [2022-07-09 00:09:34,788][26022] Updated weights on worker 0-0, policy_version 2358 (0.00087) [2022-07-09 00:09:36,659][26022] Updated weights on worker 0-0, policy_version 2368 (0.00089) [2022-07-09 00:09:38,550][26022] Updated weights on worker 0-0, policy_version 2378 (0.00095) [2022-07-09 00:09:39,396][25689] Fps is (10 sec: 5422.7, 60 sec: 5419.6, 300 sec: 5429.9). Total num frames: 2440192. Throughput: 0: 4894.9. Samples: 2433658. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 00:09:39,396][25689] Avg episode reward: [(0, '-100.222')] [2022-07-09 00:09:40,278][26022] Updated weights on worker 0-0, policy_version 2388 (0.00097) [2022-07-09 00:09:42,537][26022] Updated weights on worker 0-0, policy_version 2398 (0.00089) [2022-07-09 00:09:44,082][26022] Updated weights on worker 0-0, policy_version 2408 (0.00070) [2022-07-09 00:09:44,448][25689] Fps is (10 sec: 5497.9, 60 sec: 5421.8, 300 sec: 5425.7). Total num frames: 2467840. Throughput: 0: 5696.6. Samples: 2466182. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 00:09:44,448][25689] Avg episode reward: [(0, '-100.428')] [2022-07-09 00:09:46,301][26022] Updated weights on worker 0-0, policy_version 2418 (0.00091) [2022-07-09 00:09:48,024][26022] Updated weights on worker 0-0, policy_version 2428 (0.00094) [2022-07-09 00:09:49,464][25689] Fps is (10 sec: 5288.9, 60 sec: 5392.7, 300 sec: 5418.8). Total num frames: 2493440. Throughput: 0: 5655.2. Samples: 2498164. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 00:09:49,464][25689] Avg episode reward: [(0, '-100.746')] [2022-07-09 00:09:50,111][26022] Updated weights on worker 0-0, policy_version 2438 (0.00089) [2022-07-09 00:09:51,852][26022] Updated weights on worker 0-0, policy_version 2448 (0.00104) [2022-07-09 00:09:53,856][26022] Updated weights on worker 0-0, policy_version 2458 (0.00096) [2022-07-09 00:09:54,469][25689] Fps is (10 sec: 5313.7, 60 sec: 5399.2, 300 sec: 5426.0). Total num frames: 2521088. Throughput: 0: 4839.8. Samples: 2514588. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 00:09:54,469][25689] Avg episode reward: [(0, '-100.621')] [2022-07-09 00:09:55,487][26022] Updated weights on worker 0-0, policy_version 2468 (0.00091) [2022-07-09 00:09:57,531][26022] Updated weights on worker 0-0, policy_version 2478 (0.00092) [2022-07-09 00:09:59,408][26022] Updated weights on worker 0-0, policy_version 2488 (0.00103) [2022-07-09 00:09:59,499][25689] Fps is (10 sec: 5408.2, 60 sec: 5417.0, 300 sec: 5426.1). Total num frames: 2547712. Throughput: 0: 5653.7. Samples: 2547314. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 00:09:59,499][25689] Avg episode reward: [(0, '-100.084')] [2022-07-09 00:10:01,237][26022] Updated weights on worker 0-0, policy_version 2498 (0.00085) [2022-07-09 00:10:03,647][26022] Updated weights on worker 0-0, policy_version 2508 (0.00091) [2022-07-09 00:10:04,579][25689] Fps is (10 sec: 5266.7, 60 sec: 5417.7, 300 sec: 5417.9). Total num frames: 2574336. Throughput: 0: 5558.3. Samples: 2578076. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-09 00:10:04,579][25689] Avg episode reward: [(0, '-100.505')] [2022-07-09 00:10:05,446][26022] Updated weights on worker 0-0, policy_version 2518 (0.00091) [2022-07-09 00:10:07,385][26022] Updated weights on worker 0-0, policy_version 2528 (0.00089) [2022-07-09 00:10:09,093][26022] Updated weights on worker 0-0, policy_version 2538 (0.00091) [2022-07-09 00:10:09,621][25689] Fps is (10 sec: 5260.3, 60 sec: 5381.6, 300 sec: 5417.7). Total num frames: 2600960. Throughput: 0: 5575.3. Samples: 2610550. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-09 00:10:09,622][25689] Avg episode reward: [(0, '-99.839')] [2022-07-09 00:10:10,553][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:10:10,566][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000002545_2606080.pth [2022-07-09 00:10:10,567][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000000637_652288.pth [2022-07-09 00:10:11,102][26022] Updated weights on worker 0-0, policy_version 2548 (0.00083) [2022-07-09 00:10:12,996][26022] Updated weights on worker 0-0, policy_version 2558 (0.00093) [2022-07-09 00:10:14,659][25689] Fps is (10 sec: 5384.1, 60 sec: 5395.8, 300 sec: 5417.2). Total num frames: 2628608. Throughput: 0: 5550.9. Samples: 2626662. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 00:10:14,660][25689] Avg episode reward: [(0, '-99.505')] [2022-07-09 00:10:14,916][26022] Updated weights on worker 0-0, policy_version 2568 (0.00085) [2022-07-09 00:10:16,824][26022] Updated weights on worker 0-0, policy_version 2578 (0.00095) [2022-07-09 00:10:18,715][26022] Updated weights on worker 0-0, policy_version 2588 (0.00084) [2022-07-09 00:10:19,716][25689] Fps is (10 sec: 5376.2, 60 sec: 5391.9, 300 sec: 5410.3). Total num frames: 2655232. Throughput: 0: 5522.3. Samples: 2658962. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 00:10:19,717][25689] Avg episode reward: [(0, '-99.593')] [2022-07-09 00:10:20,615][26022] Updated weights on worker 0-0, policy_version 2598 (0.00084) [2022-07-09 00:10:22,342][26022] Updated weights on worker 0-0, policy_version 2608 (0.00095) [2022-07-09 00:10:24,344][26022] Updated weights on worker 0-0, policy_version 2618 (0.00104) [2022-07-09 00:10:24,777][25689] Fps is (10 sec: 5262.5, 60 sec: 5357.5, 300 sec: 5409.3). Total num frames: 2681856. Throughput: 0: 5634.7. Samples: 2691888. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 00:10:24,778][25689] Avg episode reward: [(0, '-100.321')] [2022-07-09 00:10:26,112][26022] Updated weights on worker 0-0, policy_version 2628 (0.00090) [2022-07-09 00:10:28,268][26022] Updated weights on worker 0-0, policy_version 2638 (0.00084) [2022-07-09 00:10:29,877][25689] Fps is (10 sec: 5341.2, 60 sec: 5349.4, 300 sec: 5404.4). Total num frames: 2709504. Throughput: 0: 4819.6. Samples: 2708176. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 00:10:29,878][25689] Avg episode reward: [(0, '-100.546')] [2022-07-09 00:10:30,028][26022] Updated weights on worker 0-0, policy_version 2648 (0.00086) [2022-07-09 00:10:32,002][26022] Updated weights on worker 0-0, policy_version 2658 (0.00093) [2022-07-09 00:10:33,860][26022] Updated weights on worker 0-0, policy_version 2668 (0.00089) [2022-07-09 00:10:34,939][25689] Fps is (10 sec: 5441.7, 60 sec: 5361.1, 300 sec: 5403.8). Total num frames: 2737152. Throughput: 0: 5607.3. Samples: 2740376. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 00:10:34,939][25689] Avg episode reward: [(0, '-100.851')] [2022-07-09 00:10:35,707][26022] Updated weights on worker 0-0, policy_version 2678 (0.00092) [2022-07-09 00:10:37,648][26022] Updated weights on worker 0-0, policy_version 2688 (0.00097) [2022-07-09 00:10:39,538][26022] Updated weights on worker 0-0, policy_version 2698 (0.00080) [2022-07-09 00:10:39,976][25689] Fps is (10 sec: 5475.3, 60 sec: 5358.3, 300 sec: 5404.1). Total num frames: 2764800. Throughput: 0: 5627.7. Samples: 2772980. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 00:10:39,977][25689] Avg episode reward: [(0, '-100.192')] [2022-07-09 00:10:41,488][26022] Updated weights on worker 0-0, policy_version 2708 (0.00084) [2022-07-09 00:10:43,191][26022] Updated weights on worker 0-0, policy_version 2718 (0.00090) [2022-07-09 00:10:45,023][26022] Updated weights on worker 0-0, policy_version 2728 (0.01023) [2022-07-09 00:10:45,041][25689] Fps is (10 sec: 5574.9, 60 sec: 5374.0, 300 sec: 5406.6). Total num frames: 2793472. Throughput: 0: 4812.3. Samples: 2789404. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 00:10:45,042][25689] Avg episode reward: [(0, '-100.227')] [2022-07-09 00:10:46,950][26022] Updated weights on worker 0-0, policy_version 2738 (0.00091) [2022-07-09 00:10:48,901][26022] Updated weights on worker 0-0, policy_version 2748 (0.00093) [2022-07-09 00:10:50,057][25689] Fps is (10 sec: 5485.2, 60 sec: 5390.9, 300 sec: 5403.2). Total num frames: 2820096. Throughput: 0: 5658.4. Samples: 2822364. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 00:10:50,058][25689] Avg episode reward: [(0, '-99.954')] [2022-07-09 00:10:50,757][26022] Updated weights on worker 0-0, policy_version 2758 (0.00089) [2022-07-09 00:10:52,578][26022] Updated weights on worker 0-0, policy_version 2768 (0.00088) [2022-07-09 00:10:54,477][26022] Updated weights on worker 0-0, policy_version 2778 (0.00086) [2022-07-09 00:10:55,079][25689] Fps is (10 sec: 5304.6, 60 sec: 5372.5, 300 sec: 5396.2). Total num frames: 2846720. Throughput: 0: 5707.9. Samples: 2855336. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 00:10:55,080][25689] Avg episode reward: [(0, '-99.298')] [2022-07-09 00:10:56,351][26022] Updated weights on worker 0-0, policy_version 2788 (0.00086) [2022-07-09 00:10:58,193][26022] Updated weights on worker 0-0, policy_version 2798 (0.00084) [2022-07-09 00:11:00,011][26022] Updated weights on worker 0-0, policy_version 2808 (0.00088) [2022-07-09 00:11:00,089][25689] Fps is (10 sec: 5614.2, 60 sec: 5425.0, 300 sec: 5417.9). Total num frames: 2876416. Throughput: 0: 4928.1. Samples: 2872096. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 00:11:00,090][25689] Avg episode reward: [(0, '-99.602')] [2022-07-09 00:11:02,320][26022] Updated weights on worker 0-0, policy_version 2818 (0.00088) [2022-07-09 00:11:03,949][26022] Updated weights on worker 0-0, policy_version 2828 (0.00092) [2022-07-09 00:11:05,128][25689] Fps is (10 sec: 5401.0, 60 sec: 5394.9, 300 sec: 5403.4). Total num frames: 2900992. Throughput: 0: 5658.6. Samples: 2903066. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 00:11:05,128][25689] Avg episode reward: [(0, '-99.932')] [2022-07-09 00:11:05,984][26022] Updated weights on worker 0-0, policy_version 2838 (0.00099) [2022-07-09 00:11:07,845][26022] Updated weights on worker 0-0, policy_version 2848 (0.00086) [2022-07-09 00:11:09,752][26022] Updated weights on worker 0-0, policy_version 2858 (0.00091) [2022-07-09 00:11:10,131][25689] Fps is (10 sec: 5200.7, 60 sec: 5415.4, 300 sec: 5410.5). Total num frames: 2928640. Throughput: 0: 5666.1. Samples: 2936102. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 00:11:10,131][25689] Avg episode reward: [(0, '-99.889')] [2022-07-09 00:11:11,355][26022] Updated weights on worker 0-0, policy_version 2868 (0.00086) [2022-07-09 00:11:13,314][26022] Updated weights on worker 0-0, policy_version 2878 (0.00086) [2022-07-09 00:11:15,150][25689] Fps is (10 sec: 5517.3, 60 sec: 5417.0, 300 sec: 5413.6). Total num frames: 2956288. Throughput: 0: 4860.5. Samples: 2952890. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 00:11:15,151][25689] Avg episode reward: [(0, '-100.276')] [2022-07-09 00:11:15,356][26022] Updated weights on worker 0-0, policy_version 2888 (0.00084) [2022-07-09 00:11:17,151][26022] Updated weights on worker 0-0, policy_version 2898 (0.00082) [2022-07-09 00:11:19,150][26022] Updated weights on worker 0-0, policy_version 2908 (0.00099) [2022-07-09 00:11:20,172][25689] Fps is (10 sec: 5302.9, 60 sec: 5403.2, 300 sec: 5405.6). Total num frames: 2981888. Throughput: 0: 5659.2. Samples: 2985750. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 00:11:20,172][25689] Avg episode reward: [(0, '-98.033')] [2022-07-09 00:11:20,786][26022] Updated weights on worker 0-0, policy_version 2918 (0.00083) [2022-07-09 00:11:22,876][26022] Updated weights on worker 0-0, policy_version 2928 (0.00096) [2022-07-09 00:11:24,694][26022] Updated weights on worker 0-0, policy_version 2938 (0.00089) [2022-07-09 00:11:25,294][25689] Fps is (10 sec: 5350.4, 60 sec: 5431.7, 300 sec: 5410.3). Total num frames: 3010560. Throughput: 0: 5689.5. Samples: 3017800. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 00:11:25,294][25689] Avg episode reward: [(0, '-98.643')] [2022-07-09 00:11:26,819][26022] Updated weights on worker 0-0, policy_version 2948 (0.00090) [2022-07-09 00:11:28,526][26022] Updated weights on worker 0-0, policy_version 2958 (0.00088) [2022-07-09 00:11:30,366][25689] Fps is (10 sec: 5524.7, 60 sec: 5434.1, 300 sec: 5410.9). Total num frames: 3038208. Throughput: 0: 4829.1. Samples: 3033820. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 00:11:30,367][25689] Avg episode reward: [(0, '-99.062')] [2022-07-09 00:11:30,491][26022] Updated weights on worker 0-0, policy_version 2968 (0.00084) [2022-07-09 00:11:32,303][26022] Updated weights on worker 0-0, policy_version 2978 (0.00085) [2022-07-09 00:11:34,262][26022] Updated weights on worker 0-0, policy_version 2988 (0.00090) [2022-07-09 00:11:35,404][25689] Fps is (10 sec: 5469.2, 60 sec: 5436.2, 300 sec: 5410.5). Total num frames: 3065856. Throughput: 0: 5609.1. Samples: 3066496. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 00:11:35,405][25689] Avg episode reward: [(0, '-98.696')] [2022-07-09 00:11:36,183][26022] Updated weights on worker 0-0, policy_version 2998 (0.00085) [2022-07-09 00:11:37,979][26022] Updated weights on worker 0-0, policy_version 3008 (0.00084) [2022-07-09 00:11:39,980][26022] Updated weights on worker 0-0, policy_version 3018 (0.00105) [2022-07-09 00:11:40,442][25689] Fps is (10 sec: 5487.8, 60 sec: 5436.2, 300 sec: 5411.2). Total num frames: 3093504. Throughput: 0: 5608.7. Samples: 3099440. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 00:11:40,444][25689] Avg episode reward: [(0, '-100.082')] [2022-07-09 00:11:41,813][26022] Updated weights on worker 0-0, policy_version 3028 (0.00087) [2022-07-09 00:11:43,748][26022] Updated weights on worker 0-0, policy_version 3038 (0.00091) [2022-07-09 00:11:45,386][26022] Updated weights on worker 0-0, policy_version 3048 (0.00092) [2022-07-09 00:11:45,541][25689] Fps is (10 sec: 5555.9, 60 sec: 5433.1, 300 sec: 5412.9). Total num frames: 3122176. Throughput: 0: 4835.4. Samples: 3115704. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 00:11:45,542][25689] Avg episode reward: [(0, '-100.967')] [2022-07-09 00:11:47,566][26022] Updated weights on worker 0-0, policy_version 3058 (0.00090) [2022-07-09 00:11:49,179][26022] Updated weights on worker 0-0, policy_version 3068 (0.00118) [2022-07-09 00:11:50,566][25689] Fps is (10 sec: 5360.8, 60 sec: 5415.4, 300 sec: 5405.9). Total num frames: 3147776. Throughput: 0: 5683.2. Samples: 3148622. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-09 00:11:50,567][25689] Avg episode reward: [(0, '-100.897')] [2022-07-09 00:11:51,438][26022] Updated weights on worker 0-0, policy_version 3078 (0.00085) [2022-07-09 00:11:52,921][26022] Updated weights on worker 0-0, policy_version 3088 (0.00076) [2022-07-09 00:11:54,964][26022] Updated weights on worker 0-0, policy_version 3098 (0.00093) [2022-07-09 00:11:55,613][25689] Fps is (10 sec: 5388.4, 60 sec: 5447.0, 300 sec: 5415.4). Total num frames: 3176448. Throughput: 0: 5700.1. Samples: 3181688. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-09 00:11:55,613][25689] Avg episode reward: [(0, '-100.946')] [2022-07-09 00:11:56,726][26022] Updated weights on worker 0-0, policy_version 3108 (0.00083) [2022-07-09 00:11:58,738][26022] Updated weights on worker 0-0, policy_version 3118 (0.00083) [2022-07-09 00:12:00,368][26022] Updated weights on worker 0-0, policy_version 3128 (0.00099) [2022-07-09 00:12:00,645][25689] Fps is (10 sec: 5587.9, 60 sec: 5411.2, 300 sec: 5419.8). Total num frames: 3204096. Throughput: 0: 4888.4. Samples: 3198200. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 00:12:00,646][25689] Avg episode reward: [(0, '-100.605')] [2022-07-09 00:12:02,769][26022] Updated weights on worker 0-0, policy_version 3138 (0.00086) [2022-07-09 00:12:04,446][26022] Updated weights on worker 0-0, policy_version 3148 (0.00087) [2022-07-09 00:12:05,731][25689] Fps is (10 sec: 5161.4, 60 sec: 5407.0, 300 sec: 5411.7). Total num frames: 3228672. Throughput: 0: 5613.7. Samples: 3229046. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 00:12:05,732][25689] Avg episode reward: [(0, '-100.067')] [2022-07-09 00:12:06,558][26022] Updated weights on worker 0-0, policy_version 3158 (0.00103) [2022-07-09 00:12:08,219][26022] Updated weights on worker 0-0, policy_version 3168 (0.00086) [2022-07-09 00:12:10,269][26022] Updated weights on worker 0-0, policy_version 3178 (0.00087) [2022-07-09 00:12:10,590][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:12:10,599][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000003181_3257344.pth [2022-07-09 00:12:10,600][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000001275_1305600.pth [2022-07-09 00:12:10,801][25689] Fps is (10 sec: 5343.7, 60 sec: 5434.7, 300 sec: 5418.5). Total num frames: 3258368. Throughput: 0: 5608.2. Samples: 3262106. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 00:12:10,802][25689] Avg episode reward: [(0, '-99.860')] [2022-07-09 00:12:11,898][26022] Updated weights on worker 0-0, policy_version 3188 (0.00087) [2022-07-09 00:12:13,836][26022] Updated weights on worker 0-0, policy_version 3198 (0.00088) [2022-07-09 00:12:15,758][26022] Updated weights on worker 0-0, policy_version 3208 (0.00093) [2022-07-09 00:12:15,855][25689] Fps is (10 sec: 5563.1, 60 sec: 5414.8, 300 sec: 5408.3). Total num frames: 3284992. Throughput: 0: 4788.5. Samples: 3278618. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 00:12:15,856][25689] Avg episode reward: [(0, '-99.003')] [2022-07-09 00:12:17,614][26022] Updated weights on worker 0-0, policy_version 3218 (0.00091) [2022-07-09 00:12:19,595][26022] Updated weights on worker 0-0, policy_version 3228 (0.00080) [2022-07-09 00:12:20,882][25689] Fps is (10 sec: 5485.6, 60 sec: 5464.9, 300 sec: 5418.9). Total num frames: 3313664. Throughput: 0: 5599.1. Samples: 3311506. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-09 00:12:20,882][25689] Avg episode reward: [(0, '-98.927')] [2022-07-09 00:12:21,417][26022] Updated weights on worker 0-0, policy_version 3238 (0.00097) [2022-07-09 00:12:23,123][26022] Updated weights on worker 0-0, policy_version 3248 (0.00087) [2022-07-09 00:12:25,391][26022] Updated weights on worker 0-0, policy_version 3258 (0.00087) [2022-07-09 00:12:25,955][25689] Fps is (10 sec: 5474.7, 60 sec: 5435.5, 300 sec: 5414.7). Total num frames: 3340288. Throughput: 0: 5701.9. Samples: 3344362. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 00:12:25,956][25689] Avg episode reward: [(0, '-99.201')] [2022-07-09 00:12:26,999][26022] Updated weights on worker 0-0, policy_version 3268 (0.00096) [2022-07-09 00:12:28,830][26022] Updated weights on worker 0-0, policy_version 3278 (0.00095) [2022-07-09 00:12:30,679][26022] Updated weights on worker 0-0, policy_version 3288 (0.00092) [2022-07-09 00:12:30,968][25689] Fps is (10 sec: 5279.3, 60 sec: 5424.0, 300 sec: 5408.3). Total num frames: 3366912. Throughput: 0: 5711.7. Samples: 3377290. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 00:12:30,969][25689] Avg episode reward: [(0, '-99.021')] [2022-07-09 00:12:32,680][26022] Updated weights on worker 0-0, policy_version 3298 (0.00087) [2022-07-09 00:12:34,345][26022] Updated weights on worker 0-0, policy_version 3308 (0.00097) [2022-07-09 00:12:35,976][25689] Fps is (10 sec: 5518.1, 60 sec: 5443.6, 300 sec: 5418.8). Total num frames: 3395584. Throughput: 0: 5731.9. Samples: 3393950. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 00:12:35,977][25689] Avg episode reward: [(0, '-99.791')] [2022-07-09 00:12:36,386][26022] Updated weights on worker 0-0, policy_version 3318 (0.00083) [2022-07-09 00:12:38,055][26022] Updated weights on worker 0-0, policy_version 3328 (0.00085) [2022-07-09 00:12:40,244][26022] Updated weights on worker 0-0, policy_version 3338 (0.01140) [2022-07-09 00:12:40,988][25689] Fps is (10 sec: 5723.0, 60 sec: 5462.9, 300 sec: 5419.9). Total num frames: 3424256. Throughput: 0: 5754.1. Samples: 3427200. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 00:12:40,988][25689] Avg episode reward: [(0, '-99.983')] [2022-07-09 00:12:41,752][26022] Updated weights on worker 0-0, policy_version 3348 (0.00085) [2022-07-09 00:12:43,882][26022] Updated weights on worker 0-0, policy_version 3358 (0.00092) [2022-07-09 00:12:45,661][26022] Updated weights on worker 0-0, policy_version 3368 (0.00093) [2022-07-09 00:12:46,051][25689] Fps is (10 sec: 5387.1, 60 sec: 5415.3, 300 sec: 5413.0). Total num frames: 3449856. Throughput: 0: 5751.2. Samples: 3459934. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 00:12:46,051][25689] Avg episode reward: [(0, '-100.484')] [2022-07-09 00:12:47,476][26022] Updated weights on worker 0-0, policy_version 3378 (0.00086) [2022-07-09 00:12:49,462][26022] Updated weights on worker 0-0, policy_version 3388 (0.00088) [2022-07-09 00:12:51,086][25689] Fps is (10 sec: 5374.6, 60 sec: 5465.2, 300 sec: 5423.1). Total num frames: 3478528. Throughput: 0: 4930.9. Samples: 3476490. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 00:12:51,086][25689] Avg episode reward: [(0, '-101.671')] [2022-07-09 00:12:51,200][26022] Updated weights on worker 0-0, policy_version 3398 (0.00084) [2022-07-09 00:12:53,103][26022] Updated weights on worker 0-0, policy_version 3408 (0.00087) [2022-07-09 00:12:54,903][26022] Updated weights on worker 0-0, policy_version 3418 (0.00082) [2022-07-09 00:12:56,097][25689] Fps is (10 sec: 5605.9, 60 sec: 5451.5, 300 sec: 5416.8). Total num frames: 3506176. Throughput: 0: 5737.9. Samples: 3509402. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 00:12:56,098][25689] Avg episode reward: [(0, '-102.506')] [2022-07-09 00:12:56,889][26022] Updated weights on worker 0-0, policy_version 3428 (0.00090) [2022-07-09 00:12:58,771][26022] Updated weights on worker 0-0, policy_version 3438 (0.00088) [2022-07-09 00:13:00,543][26022] Updated weights on worker 0-0, policy_version 3448 (0.00085) [2022-07-09 00:13:01,103][25689] Fps is (10 sec: 5520.3, 60 sec: 5453.9, 300 sec: 5428.1). Total num frames: 3533824. Throughput: 0: 5713.2. Samples: 3542120. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 00:13:01,103][25689] Avg episode reward: [(0, '-101.645')] [2022-07-09 00:13:03,100][26022] Updated weights on worker 0-0, policy_version 3458 (0.00099) [2022-07-09 00:13:04,723][26022] Updated weights on worker 0-0, policy_version 3468 (0.00087) [2022-07-09 00:13:06,175][25689] Fps is (10 sec: 5080.6, 60 sec: 5438.2, 300 sec: 5413.4). Total num frames: 3557376. Throughput: 0: 4798.9. Samples: 3556508. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 00:13:06,175][25689] Avg episode reward: [(0, '-101.466')] [2022-07-09 00:13:06,659][26022] Updated weights on worker 0-0, policy_version 3478 (0.00089) [2022-07-09 00:13:08,556][26022] Updated weights on worker 0-0, policy_version 3488 (0.00092) [2022-07-09 00:13:10,496][26022] Updated weights on worker 0-0, policy_version 3498 (0.00084) [2022-07-09 00:13:11,191][25689] Fps is (10 sec: 5075.3, 60 sec: 5409.2, 300 sec: 5410.0). Total num frames: 3585024. Throughput: 0: 5573.8. Samples: 3588552. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 00:13:11,191][25689] Avg episode reward: [(0, '-101.877')] [2022-07-09 00:13:12,193][26022] Updated weights on worker 0-0, policy_version 3508 (0.00094) [2022-07-09 00:13:14,359][26022] Updated weights on worker 0-0, policy_version 3518 (0.00087) [2022-07-09 00:13:16,008][26022] Updated weights on worker 0-0, policy_version 3528 (0.00090) [2022-07-09 00:13:16,219][25689] Fps is (10 sec: 5505.3, 60 sec: 5428.4, 300 sec: 5413.4). Total num frames: 3612672. Throughput: 0: 5565.5. Samples: 3621390. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 00:13:16,220][25689] Avg episode reward: [(0, '-101.293')] [2022-07-09 00:13:17,930][26022] Updated weights on worker 0-0, policy_version 3538 (0.00101) [2022-07-09 00:13:19,771][26022] Updated weights on worker 0-0, policy_version 3548 (0.00086) [2022-07-09 00:13:21,245][25689] Fps is (10 sec: 5499.9, 60 sec: 5411.5, 300 sec: 5414.3). Total num frames: 3640320. Throughput: 0: 4747.2. Samples: 3637738. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 00:13:21,245][25689] Avg episode reward: [(0, '-101.114')] [2022-07-09 00:13:21,592][26022] Updated weights on worker 0-0, policy_version 3558 (0.00082) [2022-07-09 00:13:23,792][26022] Updated weights on worker 0-0, policy_version 3568 (0.00089) [2022-07-09 00:13:25,440][26022] Updated weights on worker 0-0, policy_version 3578 (0.00092) [2022-07-09 00:13:26,350][25689] Fps is (10 sec: 5356.8, 60 sec: 5408.7, 300 sec: 5413.6). Total num frames: 3666944. Throughput: 0: 5661.4. Samples: 3670730. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 00:13:26,351][25689] Avg episode reward: [(0, '-100.507')] [2022-07-09 00:13:27,270][26022] Updated weights on worker 0-0, policy_version 3588 (0.00087) [2022-07-09 00:13:29,396][26022] Updated weights on worker 0-0, policy_version 3598 (0.00089) [2022-07-09 00:13:31,015][26022] Updated weights on worker 0-0, policy_version 3608 (0.00097) [2022-07-09 00:13:31,381][25689] Fps is (10 sec: 5455.5, 60 sec: 5441.0, 300 sec: 5413.4). Total num frames: 3695616. Throughput: 0: 5675.7. Samples: 3703144. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 00:13:31,381][25689] Avg episode reward: [(0, '-100.396')] [2022-07-09 00:13:33,117][26022] Updated weights on worker 0-0, policy_version 3618 (0.00086) [2022-07-09 00:13:34,756][26022] Updated weights on worker 0-0, policy_version 3628 (0.00092) [2022-07-09 00:13:36,395][25689] Fps is (10 sec: 5504.9, 60 sec: 5406.5, 300 sec: 5411.5). Total num frames: 3722240. Throughput: 0: 4866.3. Samples: 3719574. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 00:13:36,396][25689] Avg episode reward: [(0, '-100.716')] [2022-07-09 00:13:36,772][26022] Updated weights on worker 0-0, policy_version 3638 (0.00085) [2022-07-09 00:13:38,506][26022] Updated weights on worker 0-0, policy_version 3648 (0.00098) [2022-07-09 00:13:40,584][26022] Updated weights on worker 0-0, policy_version 3658 (0.00082) [2022-07-09 00:13:41,407][25689] Fps is (10 sec: 5412.8, 60 sec: 5389.5, 300 sec: 5412.7). Total num frames: 3749888. Throughput: 0: 5686.6. Samples: 3752394. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 00:13:41,408][25689] Avg episode reward: [(0, '-100.211')] [2022-07-09 00:13:42,456][26022] Updated weights on worker 0-0, policy_version 3668 (0.00088) [2022-07-09 00:13:44,376][26022] Updated weights on worker 0-0, policy_version 3678 (0.00089) [2022-07-09 00:13:46,237][26022] Updated weights on worker 0-0, policy_version 3688 (0.00085) [2022-07-09 00:13:46,459][25689] Fps is (10 sec: 5494.6, 60 sec: 5424.4, 300 sec: 5413.1). Total num frames: 3777536. Throughput: 0: 5706.0. Samples: 3785470. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 00:13:46,460][25689] Avg episode reward: [(0, '-100.488')] [2022-07-09 00:13:48,063][26022] Updated weights on worker 0-0, policy_version 3698 (0.00080) [2022-07-09 00:13:49,848][26022] Updated weights on worker 0-0, policy_version 3708 (0.00090) [2022-07-09 00:13:51,483][25689] Fps is (10 sec: 5488.2, 60 sec: 5408.5, 300 sec: 5414.0). Total num frames: 3805184. Throughput: 0: 4920.1. Samples: 3802048. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 00:13:51,483][25689] Avg episode reward: [(0, '-100.836')] [2022-07-09 00:13:51,805][26022] Updated weights on worker 0-0, policy_version 3718 (0.00096) [2022-07-09 00:13:53,716][26022] Updated weights on worker 0-0, policy_version 3728 (0.00092) [2022-07-09 00:13:55,454][26022] Updated weights on worker 0-0, policy_version 3738 (0.00099) [2022-07-09 00:13:56,500][25689] Fps is (10 sec: 5507.3, 60 sec: 5408.0, 300 sec: 5421.3). Total num frames: 3832832. Throughput: 0: 5732.5. Samples: 3834822. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 00:13:56,500][25689] Avg episode reward: [(0, '-100.727')] [2022-07-09 00:13:57,334][26022] Updated weights on worker 0-0, policy_version 3748 (0.00090) [2022-07-09 00:13:59,435][26022] Updated weights on worker 0-0, policy_version 3758 (0.00090) [2022-07-09 00:14:00,997][26022] Updated weights on worker 0-0, policy_version 3768 (0.00092) [2022-07-09 00:14:01,509][25689] Fps is (10 sec: 5412.7, 60 sec: 5390.7, 300 sec: 5422.7). Total num frames: 3859456. Throughput: 0: 5738.4. Samples: 3867748. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 00:14:01,510][25689] Avg episode reward: [(0, '-100.836')] [2022-07-09 00:14:03,427][26022] Updated weights on worker 0-0, policy_version 3778 (0.00094) [2022-07-09 00:14:05,199][26022] Updated weights on worker 0-0, policy_version 3788 (0.00087) [2022-07-09 00:14:06,604][25689] Fps is (10 sec: 5269.6, 60 sec: 5439.5, 300 sec: 5414.5). Total num frames: 3886080. Throughput: 0: 4806.5. Samples: 3882298. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 00:14:06,605][25689] Avg episode reward: [(0, '-100.329')] [2022-07-09 00:14:07,079][26022] Updated weights on worker 0-0, policy_version 3798 (0.00084) [2022-07-09 00:14:08,804][26022] Updated weights on worker 0-0, policy_version 3808 (0.00088) [2022-07-09 00:14:10,682][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:14:10,700][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000003817_3908608.pth [2022-07-09 00:14:10,701][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000001912_1957888.pth [2022-07-09 00:14:10,800][26022] Updated weights on worker 0-0, policy_version 3818 (0.00091) [2022-07-09 00:14:11,667][25689] Fps is (10 sec: 5343.1, 60 sec: 5435.3, 300 sec: 5416.9). Total num frames: 3913728. Throughput: 0: 5605.6. Samples: 3915192. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 00:14:11,667][25689] Avg episode reward: [(0, '-100.101')] [2022-07-09 00:14:12,669][26022] Updated weights on worker 0-0, policy_version 3828 (0.00092) [2022-07-09 00:14:14,547][26022] Updated weights on worker 0-0, policy_version 3838 (0.00085) [2022-07-09 00:14:16,302][26022] Updated weights on worker 0-0, policy_version 3848 (0.00088) [2022-07-09 00:14:16,765][25689] Fps is (10 sec: 5442.0, 60 sec: 5429.0, 300 sec: 5418.8). Total num frames: 3941376. Throughput: 0: 5581.7. Samples: 3947938. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 00:14:16,766][25689] Avg episode reward: [(0, '-100.334')] [2022-07-09 00:14:18,208][26022] Updated weights on worker 0-0, policy_version 3858 (0.00086) [2022-07-09 00:14:20,221][26022] Updated weights on worker 0-0, policy_version 3868 (0.00086) [2022-07-09 00:14:21,836][25689] Fps is (10 sec: 5538.1, 60 sec: 5441.8, 300 sec: 5418.5). Total num frames: 3970048. Throughput: 0: 4762.9. Samples: 3964562. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 00:14:21,837][25689] Avg episode reward: [(0, '-100.397')] [2022-07-09 00:14:22,095][26022] Updated weights on worker 0-0, policy_version 3878 (0.00091) [2022-07-09 00:14:23,786][26022] Updated weights on worker 0-0, policy_version 3888 (0.00088) [2022-07-09 00:14:25,919][26022] Updated weights on worker 0-0, policy_version 3898 (0.00087) [2022-07-09 00:14:26,940][25689] Fps is (10 sec: 5535.1, 60 sec: 5458.9, 300 sec: 5416.8). Total num frames: 3997696. Throughput: 0: 5667.5. Samples: 3997552. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 00:14:26,941][25689] Avg episode reward: [(0, '-100.783')] [2022-07-09 00:14:27,522][26022] Updated weights on worker 0-0, policy_version 3908 (0.00088) [2022-07-09 00:14:29,525][26022] Updated weights on worker 0-0, policy_version 3918 (0.00103) [2022-07-09 00:14:31,186][26022] Updated weights on worker 0-0, policy_version 3928 (0.00082) [2022-07-09 00:14:32,033][25689] Fps is (10 sec: 5322.7, 60 sec: 5419.5, 300 sec: 5415.1). Total num frames: 4024320. Throughput: 0: 5652.9. Samples: 4030318. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 00:14:32,033][25689] Avg episode reward: [(0, '-101.276')] [2022-07-09 00:14:33,279][26022] Updated weights on worker 0-0, policy_version 3938 (0.00084) [2022-07-09 00:14:35,137][26022] Updated weights on worker 0-0, policy_version 3948 (0.00087) [2022-07-09 00:14:36,975][26022] Updated weights on worker 0-0, policy_version 3958 (0.00098) [2022-07-09 00:14:37,074][25689] Fps is (10 sec: 5456.5, 60 sec: 5450.9, 300 sec: 5417.9). Total num frames: 4052992. Throughput: 0: 4878.3. Samples: 4047012. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 00:14:37,075][25689] Avg episode reward: [(0, '-101.626')] [2022-07-09 00:14:38,834][26022] Updated weights on worker 0-0, policy_version 3968 (0.00088) [2022-07-09 00:14:40,730][26022] Updated weights on worker 0-0, policy_version 3978 (0.00096) [2022-07-09 00:14:42,160][25689] Fps is (10 sec: 5662.2, 60 sec: 5461.1, 300 sec: 5420.8). Total num frames: 4081664. Throughput: 0: 5664.6. Samples: 4079688. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 00:14:42,161][25689] Avg episode reward: [(0, '-102.368')] [2022-07-09 00:14:42,386][26022] Updated weights on worker 0-0, policy_version 3988 (0.00092) [2022-07-09 00:14:44,587][26022] Updated weights on worker 0-0, policy_version 3998 (0.00085) [2022-07-09 00:14:46,297][26022] Updated weights on worker 0-0, policy_version 4008 (0.00084) [2022-07-09 00:14:47,310][25689] Fps is (10 sec: 5402.1, 60 sec: 5435.4, 300 sec: 5421.7). Total num frames: 4108288. Throughput: 0: 5646.5. Samples: 4112572. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 00:14:47,311][25689] Avg episode reward: [(0, '-102.727')] [2022-07-09 00:14:48,130][26022] Updated weights on worker 0-0, policy_version 4018 (0.00093) [2022-07-09 00:14:49,950][26022] Updated weights on worker 0-0, policy_version 4028 (0.00083) [2022-07-09 00:14:51,771][26022] Updated weights on worker 0-0, policy_version 4038 (0.00093) [2022-07-09 00:14:52,337][25689] Fps is (10 sec: 5534.1, 60 sec: 5468.8, 300 sec: 5428.2). Total num frames: 4137984. Throughput: 0: 5698.9. Samples: 4146030. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 00:14:52,337][25689] Avg episode reward: [(0, '-102.611')] [2022-07-09 00:14:53,770][26022] Updated weights on worker 0-0, policy_version 4048 (0.00090) [2022-07-09 00:14:55,523][26022] Updated weights on worker 0-0, policy_version 4058 (0.00093) [2022-07-09 00:14:57,343][25689] Fps is (10 sec: 5613.8, 60 sec: 5453.0, 300 sec: 5428.7). Total num frames: 4164608. Throughput: 0: 5696.1. Samples: 4162462. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 00:14:57,343][25689] Avg episode reward: [(0, '-102.121')] [2022-07-09 00:14:57,361][26022] Updated weights on worker 0-0, policy_version 4068 (0.00089) [2022-07-09 00:14:59,304][26022] Updated weights on worker 0-0, policy_version 4078 (0.00089) [2022-07-09 00:15:01,117][26022] Updated weights on worker 0-0, policy_version 4088 (0.00092) [2022-07-09 00:15:02,407][25689] Fps is (10 sec: 5186.2, 60 sec: 5431.3, 300 sec: 5425.5). Total num frames: 4190208. Throughput: 0: 5727.3. Samples: 4195646. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 00:15:02,407][25689] Avg episode reward: [(0, '-101.581')] [2022-07-09 00:15:03,393][26022] Updated weights on worker 0-0, policy_version 4098 (0.00095) [2022-07-09 00:15:05,141][26022] Updated weights on worker 0-0, policy_version 4108 (0.00082) [2022-07-09 00:15:07,215][26022] Updated weights on worker 0-0, policy_version 4118 (0.00083) [2022-07-09 00:15:07,458][25689] Fps is (10 sec: 5263.9, 60 sec: 5452.0, 300 sec: 5428.8). Total num frames: 4217856. Throughput: 0: 5662.5. Samples: 4226660. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 00:15:07,459][25689] Avg episode reward: [(0, '-101.090')] [2022-07-09 00:15:08,951][26022] Updated weights on worker 0-0, policy_version 4128 (0.00086) [2022-07-09 00:15:10,761][26022] Updated weights on worker 0-0, policy_version 4138 (0.00087) [2022-07-09 00:15:12,499][25689] Fps is (10 sec: 5580.4, 60 sec: 5470.8, 300 sec: 5432.2). Total num frames: 4246528. Throughput: 0: 4833.0. Samples: 4243474. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 00:15:12,500][25689] Avg episode reward: [(0, '-100.833')] [2022-07-09 00:15:12,545][26022] Updated weights on worker 0-0, policy_version 4148 (0.00087) [2022-07-09 00:15:14,484][26022] Updated weights on worker 0-0, policy_version 4158 (0.00093) [2022-07-09 00:15:16,508][26022] Updated weights on worker 0-0, policy_version 4168 (0.00086) [2022-07-09 00:15:17,567][25689] Fps is (10 sec: 5571.6, 60 sec: 5473.5, 300 sec: 5435.4). Total num frames: 4274176. Throughput: 0: 5661.8. Samples: 4276968. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-09 00:15:17,568][25689] Avg episode reward: [(0, '-100.161')] [2022-07-09 00:15:18,060][26022] Updated weights on worker 0-0, policy_version 4178 (0.00084) [2022-07-09 00:15:20,224][26022] Updated weights on worker 0-0, policy_version 4188 (0.00093) [2022-07-09 00:15:21,687][26022] Updated weights on worker 0-0, policy_version 4198 (0.00097) [2022-07-09 00:15:22,631][25689] Fps is (10 sec: 5659.7, 60 sec: 5491.0, 300 sec: 5445.7). Total num frames: 4303872. Throughput: 0: 5676.9. Samples: 4310458. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-09 00:15:22,633][25689] Avg episode reward: [(0, '-100.519')] [2022-07-09 00:15:23,916][26022] Updated weights on worker 0-0, policy_version 4208 (0.00087) [2022-07-09 00:15:25,393][26022] Updated weights on worker 0-0, policy_version 4218 (0.00085) [2022-07-09 00:15:27,670][26022] Updated weights on worker 0-0, policy_version 4228 (0.00089) [2022-07-09 00:15:27,700][25689] Fps is (10 sec: 5456.7, 60 sec: 5460.4, 300 sec: 5439.4). Total num frames: 4329472. Throughput: 0: 4946.6. Samples: 4326788. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2022-07-09 00:15:27,701][25689] Avg episode reward: [(0, '-100.579')] [2022-07-09 00:15:29,225][26022] Updated weights on worker 0-0, policy_version 4238 (0.00090) [2022-07-09 00:15:31,201][26022] Updated weights on worker 0-0, policy_version 4248 (0.00086) [2022-07-09 00:15:32,709][25689] Fps is (10 sec: 5486.7, 60 sec: 5518.5, 300 sec: 5447.2). Total num frames: 4359168. Throughput: 0: 5746.8. Samples: 4359618. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2022-07-09 00:15:32,710][25689] Avg episode reward: [(0, '-101.648')] [2022-07-09 00:15:33,135][26022] Updated weights on worker 0-0, policy_version 4258 (0.00088) [2022-07-09 00:15:34,815][26022] Updated weights on worker 0-0, policy_version 4268 (0.00085) [2022-07-09 00:15:36,630][26022] Updated weights on worker 0-0, policy_version 4278 (0.00086) [2022-07-09 00:15:37,742][25689] Fps is (10 sec: 5710.8, 60 sec: 5502.5, 300 sec: 5447.3). Total num frames: 4386816. Throughput: 0: 5766.5. Samples: 4393306. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 00:15:37,742][25689] Avg episode reward: [(0, '-101.547')] [2022-07-09 00:15:38,596][26022] Updated weights on worker 0-0, policy_version 4288 (0.00091) [2022-07-09 00:15:40,339][26022] Updated weights on worker 0-0, policy_version 4298 (0.00097) [2022-07-09 00:15:42,332][26022] Updated weights on worker 0-0, policy_version 4308 (0.00084) [2022-07-09 00:15:42,766][25689] Fps is (10 sec: 5396.8, 60 sec: 5474.4, 300 sec: 5441.2). Total num frames: 4413440. Throughput: 0: 4936.0. Samples: 4409842. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 00:15:42,766][25689] Avg episode reward: [(0, '-101.402')] [2022-07-09 00:15:43,886][26022] Updated weights on worker 0-0, policy_version 4318 (0.00088) [2022-07-09 00:15:45,950][26022] Updated weights on worker 0-0, policy_version 4328 (0.00054) [2022-07-09 00:15:47,627][26022] Updated weights on worker 0-0, policy_version 4338 (0.00091) [2022-07-09 00:15:47,893][25689] Fps is (10 sec: 5447.2, 60 sec: 5510.2, 300 sec: 5446.0). Total num frames: 4442112. Throughput: 0: 5773.3. Samples: 4443366. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-09 00:15:47,895][25689] Avg episode reward: [(0, '-101.077')] [2022-07-09 00:15:49,525][26022] Updated weights on worker 0-0, policy_version 4348 (0.00083) [2022-07-09 00:15:51,435][26022] Updated weights on worker 0-0, policy_version 4358 (0.00092) [2022-07-09 00:15:52,937][25689] Fps is (10 sec: 5637.7, 60 sec: 5491.7, 300 sec: 5452.5). Total num frames: 4470784. Throughput: 0: 5785.1. Samples: 4476638. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-09 00:15:52,938][25689] Avg episode reward: [(0, '-100.474')] [2022-07-09 00:15:53,296][26022] Updated weights on worker 0-0, policy_version 4368 (0.00083) [2022-07-09 00:15:55,095][26022] Updated weights on worker 0-0, policy_version 4378 (0.00087) [2022-07-09 00:15:57,231][26022] Updated weights on worker 0-0, policy_version 4388 (0.00087) [2022-07-09 00:15:57,959][25689] Fps is (10 sec: 5493.2, 60 sec: 5490.3, 300 sec: 5441.9). Total num frames: 4497408. Throughput: 0: 4917.5. Samples: 4492726. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 00:15:57,960][25689] Avg episode reward: [(0, '-100.386')] [2022-07-09 00:15:58,852][26022] Updated weights on worker 0-0, policy_version 4398 (0.00083) [2022-07-09 00:16:00,921][26022] Updated weights on worker 0-0, policy_version 4408 (0.00088) [2022-07-09 00:16:02,870][26022] Updated weights on worker 0-0, policy_version 4418 (0.00091) [2022-07-09 00:16:03,035][25689] Fps is (10 sec: 5273.2, 60 sec: 5506.1, 300 sec: 5448.1). Total num frames: 4524032. Throughput: 0: 5724.1. Samples: 4525866. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 00:16:03,036][25689] Avg episode reward: [(0, '-98.490')] [2022-07-09 00:16:05,080][26022] Updated weights on worker 0-0, policy_version 4428 (0.00085) [2022-07-09 00:16:06,612][26022] Updated weights on worker 0-0, policy_version 4438 (0.00089) [2022-07-09 00:16:08,099][25689] Fps is (10 sec: 5352.5, 60 sec: 5505.0, 300 sec: 5447.0). Total num frames: 4551680. Throughput: 0: 5607.5. Samples: 4556670. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 00:16:08,101][25689] Avg episode reward: [(0, '-98.640')] [2022-07-09 00:16:08,810][26022] Updated weights on worker 0-0, policy_version 4448 (0.00081) [2022-07-09 00:16:10,381][26022] Updated weights on worker 0-0, policy_version 4458 (0.00087) [2022-07-09 00:16:10,848][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:16:10,858][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000004460_4567040.pth [2022-07-09 00:16:10,858][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000002545_2606080.pth [2022-07-09 00:16:12,571][26022] Updated weights on worker 0-0, policy_version 4468 (0.00082) [2022-07-09 00:16:13,137][25689] Fps is (10 sec: 5474.0, 60 sec: 5488.4, 300 sec: 5446.7). Total num frames: 4579328. Throughput: 0: 4773.7. Samples: 4573064. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 00:16:13,138][25689] Avg episode reward: [(0, '-98.162')] [2022-07-09 00:16:14,130][26022] Updated weights on worker 0-0, policy_version 4478 (0.00088) [2022-07-09 00:16:16,258][26022] Updated weights on worker 0-0, policy_version 4488 (0.00087) [2022-07-09 00:16:17,878][26022] Updated weights on worker 0-0, policy_version 4498 (0.00093) [2022-07-09 00:16:18,208][25689] Fps is (10 sec: 5470.0, 60 sec: 5488.0, 300 sec: 5452.7). Total num frames: 4606976. Throughput: 0: 5602.9. Samples: 4606176. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 00:16:18,216][25689] Avg episode reward: [(0, '-97.677')] [2022-07-09 00:16:19,919][26022] Updated weights on worker 0-0, policy_version 4508 (0.00089) [2022-07-09 00:16:21,619][26022] Updated weights on worker 0-0, policy_version 4518 (0.00094) [2022-07-09 00:16:23,267][25689] Fps is (10 sec: 5357.1, 60 sec: 5437.8, 300 sec: 5446.9). Total num frames: 4633600. Throughput: 0: 5624.3. Samples: 4639660. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 00:16:23,268][25689] Avg episode reward: [(0, '-97.135')] [2022-07-09 00:16:23,562][26022] Updated weights on worker 0-0, policy_version 4528 (0.00085) [2022-07-09 00:16:25,207][26022] Updated weights on worker 0-0, policy_version 4538 (0.00086) [2022-07-09 00:16:27,395][26022] Updated weights on worker 0-0, policy_version 4548 (0.00096) [2022-07-09 00:16:28,306][25689] Fps is (10 sec: 5577.1, 60 sec: 5508.1, 300 sec: 5454.4). Total num frames: 4663296. Throughput: 0: 4916.1. Samples: 4656012. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 00:16:28,307][25689] Avg episode reward: [(0, '-97.280')] [2022-07-09 00:16:29,117][26022] Updated weights on worker 0-0, policy_version 4558 (0.00086) [2022-07-09 00:16:30,972][26022] Updated weights on worker 0-0, policy_version 4568 (0.00089) [2022-07-09 00:16:32,759][26022] Updated weights on worker 0-0, policy_version 4578 (0.00086) [2022-07-09 00:16:33,323][25689] Fps is (10 sec: 5601.1, 60 sec: 5456.8, 300 sec: 5451.4). Total num frames: 4689920. Throughput: 0: 5735.8. Samples: 4688848. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 00:16:33,323][25689] Avg episode reward: [(0, '-97.076')] [2022-07-09 00:16:34,727][26022] Updated weights on worker 0-0, policy_version 4588 (0.00089) [2022-07-09 00:16:36,531][26022] Updated weights on worker 0-0, policy_version 4598 (0.00086) [2022-07-09 00:16:38,353][25689] Fps is (10 sec: 5401.9, 60 sec: 5456.9, 300 sec: 5451.5). Total num frames: 4717568. Throughput: 0: 5753.0. Samples: 4722072. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 00:16:38,353][25689] Avg episode reward: [(0, '-96.689')] [2022-07-09 00:16:38,467][26022] Updated weights on worker 0-0, policy_version 4608 (0.00087) [2022-07-09 00:16:40,143][26022] Updated weights on worker 0-0, policy_version 4618 (0.00086) [2022-07-09 00:16:42,053][26022] Updated weights on worker 0-0, policy_version 4628 (0.00083) [2022-07-09 00:16:43,365][25689] Fps is (10 sec: 5404.1, 60 sec: 5458.0, 300 sec: 5446.2). Total num frames: 4744192. Throughput: 0: 5752.0. Samples: 4755264. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 00:16:43,366][25689] Avg episode reward: [(0, '-96.613')] [2022-07-09 00:16:43,938][26022] Updated weights on worker 0-0, policy_version 4638 (0.00084) [2022-07-09 00:16:45,880][26022] Updated weights on worker 0-0, policy_version 4648 (0.00087) [2022-07-09 00:16:47,641][26022] Updated weights on worker 0-0, policy_version 4658 (0.00093) [2022-07-09 00:16:48,507][25689] Fps is (10 sec: 5748.2, 60 sec: 5507.4, 300 sec: 5464.7). Total num frames: 4775936. Throughput: 0: 5733.1. Samples: 4771826. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 00:16:48,507][25689] Avg episode reward: [(0, '-97.520')] [2022-07-09 00:16:49,658][26022] Updated weights on worker 0-0, policy_version 4668 (0.00087) [2022-07-09 00:16:51,323][26022] Updated weights on worker 0-0, policy_version 4678 (0.00101) [2022-07-09 00:16:53,347][26022] Updated weights on worker 0-0, policy_version 4688 (0.00080) [2022-07-09 00:16:53,542][25689] Fps is (10 sec: 5634.7, 60 sec: 5457.5, 300 sec: 5454.6). Total num frames: 4801536. Throughput: 0: 5740.0. Samples: 4804910. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 00:16:53,543][25689] Avg episode reward: [(0, '-97.612')] [2022-07-09 00:16:54,932][26022] Updated weights on worker 0-0, policy_version 4698 (0.00096) [2022-07-09 00:16:56,983][26022] Updated weights on worker 0-0, policy_version 4708 (0.00088) [2022-07-09 00:16:58,575][25689] Fps is (10 sec: 5289.0, 60 sec: 5473.4, 300 sec: 5454.6). Total num frames: 4829184. Throughput: 0: 5727.2. Samples: 4837886. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 00:16:58,576][25689] Avg episode reward: [(0, '-97.619')] [2022-07-09 00:16:58,799][26022] Updated weights on worker 0-0, policy_version 4718 (0.00087) [2022-07-09 00:17:00,720][26022] Updated weights on worker 0-0, policy_version 4728 (0.00084) [2022-07-09 00:17:03,051][26022] Updated weights on worker 0-0, policy_version 4738 (0.00085) [2022-07-09 00:17:03,591][25689] Fps is (10 sec: 5299.0, 60 sec: 5461.9, 300 sec: 5459.3). Total num frames: 4854784. Throughput: 0: 4914.3. Samples: 4854658. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 00:17:03,593][25689] Avg episode reward: [(0, '-98.520')] [2022-07-09 00:17:04,730][26022] Updated weights on worker 0-0, policy_version 4748 (0.00081) [2022-07-09 00:17:06,615][26022] Updated weights on worker 0-0, policy_version 4758 (0.00081) [2022-07-09 00:17:08,672][25689] Fps is (10 sec: 5171.8, 60 sec: 5443.4, 300 sec: 5448.8). Total num frames: 4881408. Throughput: 0: 5630.5. Samples: 4885368. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 00:17:08,675][25689] Avg episode reward: [(0, '-98.641')] [2022-07-09 00:17:08,767][26022] Updated weights on worker 0-0, policy_version 4768 (0.00085) [2022-07-09 00:17:10,319][26022] Updated weights on worker 0-0, policy_version 4778 (0.00095) [2022-07-09 00:17:12,357][26022] Updated weights on worker 0-0, policy_version 4788 (0.00085) [2022-07-09 00:17:13,687][25689] Fps is (10 sec: 5578.2, 60 sec: 5479.3, 300 sec: 5459.8). Total num frames: 4911104. Throughput: 0: 5627.8. Samples: 4918282. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 00:17:13,689][25689] Avg episode reward: [(0, '-98.073')] [2022-07-09 00:17:14,099][26022] Updated weights on worker 0-0, policy_version 4798 (0.00083) [2022-07-09 00:17:16,055][26022] Updated weights on worker 0-0, policy_version 4808 (0.00102) [2022-07-09 00:17:18,032][26022] Updated weights on worker 0-0, policy_version 4818 (0.00083) [2022-07-09 00:17:18,721][25689] Fps is (10 sec: 5502.7, 60 sec: 5448.8, 300 sec: 5449.4). Total num frames: 4936704. Throughput: 0: 4818.9. Samples: 4934970. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 00:17:18,722][25689] Avg episode reward: [(0, '-97.545')] [2022-07-09 00:17:19,591][26022] Updated weights on worker 0-0, policy_version 4828 (0.00083) [2022-07-09 00:17:21,740][26022] Updated weights on worker 0-0, policy_version 4838 (0.00050) [2022-07-09 00:17:23,299][26022] Updated weights on worker 0-0, policy_version 4848 (0.00089) [2022-07-09 00:17:23,740][25689] Fps is (10 sec: 5500.6, 60 sec: 5503.3, 300 sec: 5460.7). Total num frames: 4966400. Throughput: 0: 5634.4. Samples: 4968186. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 00:17:23,741][25689] Avg episode reward: [(0, '-96.879')] [2022-07-09 00:17:25,375][26022] Updated weights on worker 0-0, policy_version 4858 (0.00091) [2022-07-09 00:17:27,164][26022] Updated weights on worker 0-0, policy_version 4868 (0.00086) [2022-07-09 00:17:28,840][25689] Fps is (10 sec: 5565.9, 60 sec: 5447.0, 300 sec: 5459.1). Total num frames: 4993024. Throughput: 0: 5728.9. Samples: 5000906. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 00:17:28,841][25689] Avg episode reward: [(0, '-97.461')] [2022-07-09 00:17:29,142][26022] Updated weights on worker 0-0, policy_version 4878 (0.00087) [2022-07-09 00:17:30,926][26022] Updated weights on worker 0-0, policy_version 4888 (0.00090) [2022-07-09 00:17:32,877][26022] Updated weights on worker 0-0, policy_version 4898 (0.00092) [2022-07-09 00:17:33,857][25689] Fps is (10 sec: 5465.7, 60 sec: 5480.8, 300 sec: 5458.9). Total num frames: 5021696. Throughput: 0: 4910.4. Samples: 5017324. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 00:17:33,859][25689] Avg episode reward: [(0, '-97.417')] [2022-07-09 00:17:34,688][26022] Updated weights on worker 0-0, policy_version 4908 (0.00094) [2022-07-09 00:17:36,514][26022] Updated weights on worker 0-0, policy_version 4918 (0.00081) [2022-07-09 00:17:38,315][26022] Updated weights on worker 0-0, policy_version 4928 (0.00079) [2022-07-09 00:17:38,883][25689] Fps is (10 sec: 5506.1, 60 sec: 5464.3, 300 sec: 5451.8). Total num frames: 5048320. Throughput: 0: 5740.7. Samples: 5050710. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 00:17:38,883][25689] Avg episode reward: [(0, '-96.610')] [2022-07-09 00:17:40,117][26022] Updated weights on worker 0-0, policy_version 4938 (0.00088) [2022-07-09 00:17:42,043][26022] Updated weights on worker 0-0, policy_version 4948 (0.00095) [2022-07-09 00:17:43,885][25689] Fps is (10 sec: 5309.8, 60 sec: 5465.2, 300 sec: 5456.3). Total num frames: 5074944. Throughput: 0: 5741.5. Samples: 5083850. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 00:17:43,886][25689] Avg episode reward: [(0, '-96.076')] [2022-07-09 00:17:44,095][26022] Updated weights on worker 0-0, policy_version 4958 (0.00067) [2022-07-09 00:17:45,617][26022] Updated weights on worker 0-0, policy_version 4968 (0.00095) [2022-07-09 00:17:47,688][26022] Updated weights on worker 0-0, policy_version 4978 (0.00086) [2022-07-09 00:17:48,958][25689] Fps is (10 sec: 5691.8, 60 sec: 5454.5, 300 sec: 5462.5). Total num frames: 5105664. Throughput: 0: 4948.9. Samples: 5100466. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 00:17:48,959][25689] Avg episode reward: [(0, '-95.761')] [2022-07-09 00:17:49,484][26022] Updated weights on worker 0-0, policy_version 4988 (0.00083) [2022-07-09 00:17:51,308][26022] Updated weights on worker 0-0, policy_version 4998 (0.00088) [2022-07-09 00:17:53,102][26022] Updated weights on worker 0-0, policy_version 5008 (0.00093) [2022-07-09 00:17:53,987][25689] Fps is (10 sec: 5676.5, 60 sec: 5472.0, 300 sec: 5458.8). Total num frames: 5132288. Throughput: 0: 5785.9. Samples: 5133796. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 00:17:53,988][25689] Avg episode reward: [(0, '-95.814')] [2022-07-09 00:17:54,929][26022] Updated weights on worker 0-0, policy_version 5018 (0.00092) [2022-07-09 00:17:56,772][26022] Updated weights on worker 0-0, policy_version 5028 (0.00086) [2022-07-09 00:17:58,914][26022] Updated weights on worker 0-0, policy_version 5038 (0.00086) [2022-07-09 00:17:59,014][25689] Fps is (10 sec: 5396.9, 60 sec: 5472.5, 300 sec: 5458.4). Total num frames: 5159936. Throughput: 0: 5772.2. Samples: 5166910. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 00:17:59,014][25689] Avg episode reward: [(0, '-94.790')] [2022-07-09 00:18:00,591][26022] Updated weights on worker 0-0, policy_version 5048 (0.00087) [2022-07-09 00:18:02,867][26022] Updated weights on worker 0-0, policy_version 5058 (0.00089) [2022-07-09 00:18:04,108][25689] Fps is (10 sec: 5362.4, 60 sec: 5482.4, 300 sec: 5468.3). Total num frames: 5186560. Throughput: 0: 4916.5. Samples: 5183276. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 00:18:04,110][25689] Avg episode reward: [(0, '-95.546')] [2022-07-09 00:18:04,602][26022] Updated weights on worker 0-0, policy_version 5068 (0.00356) [2022-07-09 00:18:06,489][26022] Updated weights on worker 0-0, policy_version 5078 (0.00083) [2022-07-09 00:18:08,466][26022] Updated weights on worker 0-0, policy_version 5088 (0.00098) [2022-07-09 00:18:09,163][25689] Fps is (10 sec: 5246.6, 60 sec: 5484.8, 300 sec: 5464.1). Total num frames: 5213184. Throughput: 0: 5650.2. Samples: 5214630. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 00:18:09,164][25689] Avg episode reward: [(0, '-95.491')] [2022-07-09 00:18:10,367][26022] Updated weights on worker 0-0, policy_version 5098 (0.00090) [2022-07-09 00:18:10,860][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:18:10,879][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000005101_5223424.pth [2022-07-09 00:18:10,880][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000003181_3257344.pth [2022-07-09 00:18:11,796][26022] Updated weights on worker 0-0, policy_version 5108 (0.00092) [2022-07-09 00:18:14,041][26022] Updated weights on worker 0-0, policy_version 5118 (0.00086) [2022-07-09 00:18:14,177][25689] Fps is (10 sec: 5390.0, 60 sec: 5451.0, 300 sec: 5464.4). Total num frames: 5240832. Throughput: 0: 5664.2. Samples: 5248156. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 00:18:14,179][25689] Avg episode reward: [(0, '-95.755')] [2022-07-09 00:18:15,557][26022] Updated weights on worker 0-0, policy_version 5128 (0.00085) [2022-07-09 00:18:17,559][26022] Updated weights on worker 0-0, policy_version 5138 (0.00080) [2022-07-09 00:18:19,199][25689] Fps is (10 sec: 5611.9, 60 sec: 5502.9, 300 sec: 5467.9). Total num frames: 5269504. Throughput: 0: 4845.0. Samples: 5264706. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 00:18:19,199][25689] Avg episode reward: [(0, '-96.358')] [2022-07-09 00:18:19,413][26022] Updated weights on worker 0-0, policy_version 5148 (0.00084) [2022-07-09 00:18:21,250][26022] Updated weights on worker 0-0, policy_version 5158 (0.00092) [2022-07-09 00:18:23,250][26022] Updated weights on worker 0-0, policy_version 5168 (0.00089) [2022-07-09 00:18:24,234][25689] Fps is (10 sec: 5600.1, 60 sec: 5467.5, 300 sec: 5472.6). Total num frames: 5297152. Throughput: 0: 5709.8. Samples: 5298192. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-09 00:18:24,235][25689] Avg episode reward: [(0, '-96.202')] [2022-07-09 00:18:25,007][26022] Updated weights on worker 0-0, policy_version 5178 (0.00088) [2022-07-09 00:18:26,891][26022] Updated weights on worker 0-0, policy_version 5188 (0.00099) [2022-07-09 00:18:29,072][26022] Updated weights on worker 0-0, policy_version 5198 (0.00083) [2022-07-09 00:18:29,339][25689] Fps is (10 sec: 5352.0, 60 sec: 5467.1, 300 sec: 5464.4). Total num frames: 5323776. Throughput: 0: 5746.6. Samples: 5330574. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-09 00:18:29,340][25689] Avg episode reward: [(0, '-97.039')] [2022-07-09 00:18:30,606][26022] Updated weights on worker 0-0, policy_version 5208 (0.00089) [2022-07-09 00:18:32,820][26022] Updated weights on worker 0-0, policy_version 5218 (0.00087) [2022-07-09 00:18:34,345][25689] Fps is (10 sec: 5469.0, 60 sec: 5468.1, 300 sec: 5471.4). Total num frames: 5352448. Throughput: 0: 4902.3. Samples: 5347024. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-09 00:18:34,345][25689] Avg episode reward: [(0, '-97.113')] [2022-07-09 00:18:34,423][26022] Updated weights on worker 0-0, policy_version 5228 (0.00087) [2022-07-09 00:18:36,442][26022] Updated weights on worker 0-0, policy_version 5238 (0.00087) [2022-07-09 00:18:38,171][26022] Updated weights on worker 0-0, policy_version 5248 (0.00621) [2022-07-09 00:18:39,375][25689] Fps is (10 sec: 5509.9, 60 sec: 5467.7, 300 sec: 5467.7). Total num frames: 5379072. Throughput: 0: 5708.6. Samples: 5379884. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-09 00:18:39,375][25689] Avg episode reward: [(0, '-96.986')] [2022-07-09 00:18:39,844][26022] Updated weights on worker 0-0, policy_version 5258 (0.00096) [2022-07-09 00:18:41,810][26022] Updated weights on worker 0-0, policy_version 5268 (0.00100) [2022-07-09 00:18:43,615][26022] Updated weights on worker 0-0, policy_version 5278 (0.00087) [2022-07-09 00:18:44,394][25689] Fps is (10 sec: 5400.5, 60 sec: 5483.1, 300 sec: 5468.2). Total num frames: 5406720. Throughput: 0: 5709.3. Samples: 5413292. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-09 00:18:44,396][25689] Avg episode reward: [(0, '-96.728')] [2022-07-09 00:18:45,363][26022] Updated weights on worker 0-0, policy_version 5288 (0.00085) [2022-07-09 00:18:47,590][26022] Updated weights on worker 0-0, policy_version 5298 (0.00051) [2022-07-09 00:18:49,248][26022] Updated weights on worker 0-0, policy_version 5308 (0.00084) [2022-07-09 00:18:49,433][25689] Fps is (10 sec: 5701.6, 60 sec: 5469.2, 300 sec: 5474.9). Total num frames: 5436416. Throughput: 0: 4932.3. Samples: 5429680. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 00:18:49,434][25689] Avg episode reward: [(0, '-96.514')] [2022-07-09 00:18:51,156][26022] Updated weights on worker 0-0, policy_version 5318 (0.00087) [2022-07-09 00:18:53,051][26022] Updated weights on worker 0-0, policy_version 5328 (0.00096) [2022-07-09 00:18:54,467][25689] Fps is (10 sec: 5489.8, 60 sec: 5451.9, 300 sec: 5467.7). Total num frames: 5462016. Throughput: 0: 5746.9. Samples: 5462662. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 00:18:54,472][25689] Avg episode reward: [(0, '-96.440')] [2022-07-09 00:18:54,907][26022] Updated weights on worker 0-0, policy_version 5338 (0.00085) [2022-07-09 00:18:56,768][26022] Updated weights on worker 0-0, policy_version 5348 (0.00093) [2022-07-09 00:18:58,691][26022] Updated weights on worker 0-0, policy_version 5358 (0.00091) [2022-07-09 00:18:59,504][25689] Fps is (10 sec: 5388.7, 60 sec: 5467.9, 300 sec: 5474.0). Total num frames: 5490688. Throughput: 0: 5743.6. Samples: 5495498. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 00:18:59,504][25689] Avg episode reward: [(0, '-96.009')] [2022-07-09 00:19:00,390][26022] Updated weights on worker 0-0, policy_version 5368 (0.00082) [2022-07-09 00:19:02,680][26022] Updated weights on worker 0-0, policy_version 5378 (0.00083) [2022-07-09 00:19:04,514][25689] Fps is (10 sec: 5401.9, 60 sec: 5458.6, 300 sec: 5472.2). Total num frames: 5516288. Throughput: 0: 4860.4. Samples: 5511080. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 00:19:04,514][25689] Avg episode reward: [(0, '-95.000')] [2022-07-09 00:19:04,543][26022] Updated weights on worker 0-0, policy_version 5388 (0.00095) [2022-07-09 00:19:06,476][26022] Updated weights on worker 0-0, policy_version 5398 (0.00081) [2022-07-09 00:19:08,462][26022] Updated weights on worker 0-0, policy_version 5408 (0.00523) [2022-07-09 00:19:09,564][25689] Fps is (10 sec: 5394.7, 60 sec: 5492.9, 300 sec: 5475.8). Total num frames: 5544960. Throughput: 0: 5659.9. Samples: 5543624. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 00:19:09,565][25689] Avg episode reward: [(0, '-95.377')] [2022-07-09 00:19:09,927][26022] Updated weights on worker 0-0, policy_version 5418 (0.00080) [2022-07-09 00:19:12,026][26022] Updated weights on worker 0-0, policy_version 5428 (0.00097) [2022-07-09 00:19:13,907][26022] Updated weights on worker 0-0, policy_version 5438 (0.00090) [2022-07-09 00:19:14,644][25689] Fps is (10 sec: 5559.3, 60 sec: 5486.9, 300 sec: 5476.2). Total num frames: 5572608. Throughput: 0: 5671.0. Samples: 5577090. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 00:19:14,645][25689] Avg episode reward: [(0, '-95.572')] [2022-07-09 00:19:15,646][26022] Updated weights on worker 0-0, policy_version 5448 (0.00088) [2022-07-09 00:19:17,793][26022] Updated weights on worker 0-0, policy_version 5458 (0.00094) [2022-07-09 00:19:19,046][26022] Updated weights on worker 0-0, policy_version 5468 (0.00101) [2022-07-09 00:19:19,664][25689] Fps is (10 sec: 5576.2, 60 sec: 5487.0, 300 sec: 5477.1). Total num frames: 5601280. Throughput: 0: 4878.8. Samples: 5593856. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 00:19:19,665][25689] Avg episode reward: [(0, '-95.085')] [2022-07-09 00:19:21,347][26022] Updated weights on worker 0-0, policy_version 5478 (0.00085) [2022-07-09 00:19:22,910][26022] Updated weights on worker 0-0, policy_version 5488 (0.00084) [2022-07-09 00:19:24,716][25689] Fps is (10 sec: 5490.1, 60 sec: 5468.6, 300 sec: 5474.6). Total num frames: 5627904. Throughput: 0: 5744.3. Samples: 5627130. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 00:19:24,717][25689] Avg episode reward: [(0, '-95.422')] [2022-07-09 00:19:25,005][26022] Updated weights on worker 0-0, policy_version 5498 (0.00106) [2022-07-09 00:19:26,800][26022] Updated weights on worker 0-0, policy_version 5508 (0.00095) [2022-07-09 00:19:28,811][26022] Updated weights on worker 0-0, policy_version 5518 (0.00090) [2022-07-09 00:19:29,787][25689] Fps is (10 sec: 5564.0, 60 sec: 5522.5, 300 sec: 5485.3). Total num frames: 5657600. Throughput: 0: 5760.2. Samples: 5660110. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 00:19:29,787][25689] Avg episode reward: [(0, '-95.484')] [2022-07-09 00:19:30,596][26022] Updated weights on worker 0-0, policy_version 5528 (0.00093) [2022-07-09 00:19:32,480][26022] Updated weights on worker 0-0, policy_version 5538 (0.00099) [2022-07-09 00:19:33,987][26022] Updated weights on worker 0-0, policy_version 5548 (0.00093) [2022-07-09 00:19:34,847][25689] Fps is (10 sec: 5660.3, 60 sec: 5500.6, 300 sec: 5481.5). Total num frames: 5685248. Throughput: 0: 5753.1. Samples: 5693320. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-09 00:19:34,848][25689] Avg episode reward: [(0, '-95.055')] [2022-07-09 00:19:36,201][26022] Updated weights on worker 0-0, policy_version 5558 (0.00086) [2022-07-09 00:19:37,752][26022] Updated weights on worker 0-0, policy_version 5568 (0.00088) [2022-07-09 00:19:39,891][25689] Fps is (10 sec: 5269.7, 60 sec: 5482.4, 300 sec: 5472.0). Total num frames: 5710848. Throughput: 0: 5736.1. Samples: 5709880. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-09 00:19:39,892][25689] Avg episode reward: [(0, '-94.640')] [2022-07-09 00:19:39,963][26022] Updated weights on worker 0-0, policy_version 5578 (0.00093) [2022-07-09 00:19:41,479][26022] Updated weights on worker 0-0, policy_version 5588 (0.00083) [2022-07-09 00:19:43,445][26022] Updated weights on worker 0-0, policy_version 5598 (0.00095) [2022-07-09 00:19:44,914][25689] Fps is (10 sec: 5492.7, 60 sec: 5515.9, 300 sec: 5484.7). Total num frames: 5740544. Throughput: 0: 5748.0. Samples: 5743228. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 00:19:44,915][25689] Avg episode reward: [(0, '-94.954')] [2022-07-09 00:19:45,376][26022] Updated weights on worker 0-0, policy_version 5608 (0.00090) [2022-07-09 00:19:47,055][26022] Updated weights on worker 0-0, policy_version 5618 (0.00084) [2022-07-09 00:19:49,038][26022] Updated weights on worker 0-0, policy_version 5628 (0.00091) [2022-07-09 00:19:49,977][25689] Fps is (10 sec: 5787.2, 60 sec: 5496.8, 300 sec: 5480.6). Total num frames: 5769216. Throughput: 0: 5767.2. Samples: 5776552. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 00:19:49,977][25689] Avg episode reward: [(0, '-94.555')] [2022-07-09 00:19:50,919][26022] Updated weights on worker 0-0, policy_version 5638 (0.00098) [2022-07-09 00:19:52,566][26022] Updated weights on worker 0-0, policy_version 5648 (0.00086) [2022-07-09 00:19:54,580][26022] Updated weights on worker 0-0, policy_version 5658 (0.00088) [2022-07-09 00:19:54,988][25689] Fps is (10 sec: 5590.8, 60 sec: 5532.7, 300 sec: 5483.9). Total num frames: 5796864. Throughput: 0: 4964.8. Samples: 5793316. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 00:19:54,989][25689] Avg episode reward: [(0, '-93.742')] [2022-07-09 00:19:56,252][26022] Updated weights on worker 0-0, policy_version 5668 (0.00096) [2022-07-09 00:19:58,134][26022] Updated weights on worker 0-0, policy_version 5678 (0.00098) [2022-07-09 00:20:00,007][25689] Fps is (10 sec: 5411.1, 60 sec: 5500.6, 300 sec: 5488.2). Total num frames: 5823488. Throughput: 0: 5801.3. Samples: 5826576. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 00:20:00,007][25689] Avg episode reward: [(0, '-92.545')] [2022-07-09 00:20:00,282][26022] Updated weights on worker 0-0, policy_version 5688 (0.00085) [2022-07-09 00:20:01,817][26022] Updated weights on worker 0-0, policy_version 5698 (0.00087) [2022-07-09 00:20:04,321][26022] Updated weights on worker 0-0, policy_version 5708 (0.00082) [2022-07-09 00:20:05,106][25689] Fps is (10 sec: 5161.6, 60 sec: 5492.4, 300 sec: 5480.4). Total num frames: 5849088. Throughput: 0: 5665.5. Samples: 5857624. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 00:20:05,106][25689] Avg episode reward: [(0, '-92.044')] [2022-07-09 00:20:05,902][26022] Updated weights on worker 0-0, policy_version 5718 (0.00086) [2022-07-09 00:20:07,806][26022] Updated weights on worker 0-0, policy_version 5728 (0.00087) [2022-07-09 00:20:09,963][26022] Updated weights on worker 0-0, policy_version 5738 (0.00090) [2022-07-09 00:20:10,216][25689] Fps is (10 sec: 5215.6, 60 sec: 5470.2, 300 sec: 5475.7). Total num frames: 5876736. Throughput: 0: 4822.5. Samples: 5874156. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 00:20:10,217][25689] Avg episode reward: [(0, '-92.442')] [2022-07-09 00:20:11,136][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:20:11,146][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000005747_5884928.pth [2022-07-09 00:20:11,146][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000003817_3908608.pth [2022-07-09 00:20:11,479][26022] Updated weights on worker 0-0, policy_version 5748 (0.00087) [2022-07-09 00:20:13,682][26022] Updated weights on worker 0-0, policy_version 5758 (0.00089) [2022-07-09 00:20:15,255][25689] Fps is (10 sec: 5649.9, 60 sec: 5507.6, 300 sec: 5483.1). Total num frames: 5906432. Throughput: 0: 5622.8. Samples: 5907276. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 00:20:15,256][25689] Avg episode reward: [(0, '-91.600')] [2022-07-09 00:20:15,258][26022] Updated weights on worker 0-0, policy_version 5768 (0.00084) [2022-07-09 00:20:17,167][26022] Updated weights on worker 0-0, policy_version 5778 (0.00079) [2022-07-09 00:20:19,051][26022] Updated weights on worker 0-0, policy_version 5788 (0.00096) [2022-07-09 00:20:20,292][25689] Fps is (10 sec: 5589.5, 60 sec: 5472.3, 300 sec: 5473.3). Total num frames: 5933056. Throughput: 0: 5612.4. Samples: 5940426. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 00:20:20,292][25689] Avg episode reward: [(0, '-92.306')] [2022-07-09 00:20:20,973][26022] Updated weights on worker 0-0, policy_version 5798 (0.00091) [2022-07-09 00:20:22,599][26022] Updated weights on worker 0-0, policy_version 5808 (0.00104) [2022-07-09 00:20:24,651][26022] Updated weights on worker 0-0, policy_version 5818 (0.00089) [2022-07-09 00:20:25,387][25689] Fps is (10 sec: 5457.8, 60 sec: 5502.2, 300 sec: 5483.1). Total num frames: 5961728. Throughput: 0: 4908.5. Samples: 5957174. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 00:20:25,388][25689] Avg episode reward: [(0, '-92.467')] [2022-07-09 00:20:26,239][26022] Updated weights on worker 0-0, policy_version 5828 (0.00083) [2022-07-09 00:20:28,405][26022] Updated weights on worker 0-0, policy_version 5838 (0.00096) [2022-07-09 00:20:30,110][26022] Updated weights on worker 0-0, policy_version 5848 (0.00091) [2022-07-09 00:20:30,452][25689] Fps is (10 sec: 5543.2, 60 sec: 5468.9, 300 sec: 5475.2). Total num frames: 5989376. Throughput: 0: 5732.8. Samples: 5990166. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 00:20:30,453][25689] Avg episode reward: [(0, '-93.444')] [2022-07-09 00:20:32,106][26022] Updated weights on worker 0-0, policy_version 5858 (0.00090) [2022-07-09 00:20:34,015][26022] Updated weights on worker 0-0, policy_version 5868 (0.00097) [2022-07-09 00:20:35,481][25689] Fps is (10 sec: 5477.9, 60 sec: 5471.8, 300 sec: 5475.3). Total num frames: 6017024. Throughput: 0: 5729.5. Samples: 6023160. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 00:20:35,482][25689] Avg episode reward: [(0, '-93.579')] [2022-07-09 00:20:35,917][26022] Updated weights on worker 0-0, policy_version 5878 (0.00085) [2022-07-09 00:20:37,610][26022] Updated weights on worker 0-0, policy_version 5888 (0.00088) [2022-07-09 00:20:39,510][26022] Updated weights on worker 0-0, policy_version 5898 (0.00089) [2022-07-09 00:20:40,503][25689] Fps is (10 sec: 5501.9, 60 sec: 5507.6, 300 sec: 5478.8). Total num frames: 6044672. Throughput: 0: 4913.2. Samples: 6039724. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 00:20:40,504][25689] Avg episode reward: [(0, '-93.946')] [2022-07-09 00:20:41,157][26022] Updated weights on worker 0-0, policy_version 5908 (0.00088) [2022-07-09 00:20:43,435][26022] Updated weights on worker 0-0, policy_version 5918 (0.00085) [2022-07-09 00:20:44,954][26022] Updated weights on worker 0-0, policy_version 5928 (0.00093) [2022-07-09 00:20:45,519][25689] Fps is (10 sec: 5611.0, 60 sec: 5491.3, 300 sec: 5480.8). Total num frames: 6073344. Throughput: 0: 5756.1. Samples: 6073054. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2022-07-09 00:20:45,519][25689] Avg episode reward: [(0, '-95.000')] [2022-07-09 00:20:46,943][26022] Updated weights on worker 0-0, policy_version 5938 (0.00085) [2022-07-09 00:20:48,768][26022] Updated weights on worker 0-0, policy_version 5948 (0.00087) [2022-07-09 00:20:50,447][26022] Updated weights on worker 0-0, policy_version 5958 (0.00098) [2022-07-09 00:20:50,641][25689] Fps is (10 sec: 5555.0, 60 sec: 5469.0, 300 sec: 5475.9). Total num frames: 6100992. Throughput: 0: 5749.5. Samples: 6106244. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2022-07-09 00:20:50,642][25689] Avg episode reward: [(0, '-93.635')] [2022-07-09 00:20:52,507][26022] Updated weights on worker 0-0, policy_version 5968 (0.00094) [2022-07-09 00:20:54,143][26022] Updated weights on worker 0-0, policy_version 5978 (0.00108) [2022-07-09 00:20:55,664][25689] Fps is (10 sec: 5450.6, 60 sec: 5468.0, 300 sec: 5479.4). Total num frames: 6128640. Throughput: 0: 4940.9. Samples: 6122878. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 00:20:55,664][25689] Avg episode reward: [(0, '-94.217')] [2022-07-09 00:20:56,016][26022] Updated weights on worker 0-0, policy_version 5988 (0.00095) [2022-07-09 00:20:58,175][26022] Updated weights on worker 0-0, policy_version 5998 (0.00088) [2022-07-09 00:20:59,776][26022] Updated weights on worker 0-0, policy_version 6008 (0.00083) [2022-07-09 00:21:00,668][25689] Fps is (10 sec: 5617.3, 60 sec: 5503.1, 300 sec: 5487.6). Total num frames: 6157312. Throughput: 0: 5777.4. Samples: 6156224. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 00:21:00,668][25689] Avg episode reward: [(0, '-94.014')] [2022-07-09 00:21:01,772][26022] Updated weights on worker 0-0, policy_version 6018 (0.00086) [2022-07-09 00:21:03,934][26022] Updated weights on worker 0-0, policy_version 6028 (0.00092) [2022-07-09 00:21:05,698][25689] Fps is (10 sec: 5306.4, 60 sec: 5492.4, 300 sec: 5477.9). Total num frames: 6181888. Throughput: 0: 5653.1. Samples: 6187132. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 00:21:05,699][25689] Avg episode reward: [(0, '-92.444')] [2022-07-09 00:21:05,878][26022] Updated weights on worker 0-0, policy_version 6038 (0.00088) [2022-07-09 00:21:07,785][26022] Updated weights on worker 0-0, policy_version 6048 (0.00093) [2022-07-09 00:21:09,395][26022] Updated weights on worker 0-0, policy_version 6058 (0.00084) [2022-07-09 00:21:10,779][25689] Fps is (10 sec: 5164.9, 60 sec: 5495.1, 300 sec: 5477.1). Total num frames: 6209536. Throughput: 0: 4827.6. Samples: 6203460. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 00:21:10,779][25689] Avg episode reward: [(0, '-92.858')] [2022-07-09 00:21:11,623][26022] Updated weights on worker 0-0, policy_version 6068 (0.00086) [2022-07-09 00:21:13,211][26022] Updated weights on worker 0-0, policy_version 6078 (0.00087) [2022-07-09 00:21:15,240][26022] Updated weights on worker 0-0, policy_version 6088 (0.00087) [2022-07-09 00:21:15,822][25689] Fps is (10 sec: 5462.3, 60 sec: 5461.0, 300 sec: 5477.6). Total num frames: 6237184. Throughput: 0: 5635.0. Samples: 6236468. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 00:21:15,822][25689] Avg episode reward: [(0, '-92.775')] [2022-07-09 00:21:16,938][26022] Updated weights on worker 0-0, policy_version 6098 (0.00083) [2022-07-09 00:21:18,857][26022] Updated weights on worker 0-0, policy_version 6108 (0.00083) [2022-07-09 00:21:20,595][26022] Updated weights on worker 0-0, policy_version 6118 (0.00088) [2022-07-09 00:21:20,845][25689] Fps is (10 sec: 5493.4, 60 sec: 5479.1, 300 sec: 5481.7). Total num frames: 6264832. Throughput: 0: 5606.2. Samples: 6269342. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 00:21:20,845][25689] Avg episode reward: [(0, '-92.756')] [2022-07-09 00:21:22,583][26022] Updated weights on worker 0-0, policy_version 6128 (0.00086) [2022-07-09 00:21:24,268][26022] Updated weights on worker 0-0, policy_version 6138 (0.00095) [2022-07-09 00:21:25,868][25689] Fps is (10 sec: 5504.1, 60 sec: 5468.7, 300 sec: 5475.1). Total num frames: 6292480. Throughput: 0: 4902.7. Samples: 6286018. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 00:21:25,868][25689] Avg episode reward: [(0, '-93.359')] [2022-07-09 00:21:26,296][26022] Updated weights on worker 0-0, policy_version 6148 (0.00095) [2022-07-09 00:21:28,176][26022] Updated weights on worker 0-0, policy_version 6158 (0.00091) [2022-07-09 00:21:30,238][26022] Updated weights on worker 0-0, policy_version 6168 (0.00090) [2022-07-09 00:21:30,936][25689] Fps is (10 sec: 5479.8, 60 sec: 5468.5, 300 sec: 5477.6). Total num frames: 6320128. Throughput: 0: 5728.7. Samples: 6318932. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-09 00:21:30,936][25689] Avg episode reward: [(0, '-93.924')] [2022-07-09 00:21:31,996][26022] Updated weights on worker 0-0, policy_version 6178 (0.00086) [2022-07-09 00:21:33,787][26022] Updated weights on worker 0-0, policy_version 6188 (0.00086) [2022-07-09 00:21:35,789][26022] Updated weights on worker 0-0, policy_version 6198 (0.00088) [2022-07-09 00:21:35,979][25689] Fps is (10 sec: 5469.0, 60 sec: 5467.2, 300 sec: 5477.4). Total num frames: 6347776. Throughput: 0: 5713.7. Samples: 6351640. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-09 00:21:35,979][25689] Avg episode reward: [(0, '-95.377')] [2022-07-09 00:21:37,621][26022] Updated weights on worker 0-0, policy_version 6208 (0.00085) [2022-07-09 00:21:39,434][26022] Updated weights on worker 0-0, policy_version 6218 (0.01343) [2022-07-09 00:21:40,994][25689] Fps is (10 sec: 5395.7, 60 sec: 5450.8, 300 sec: 5477.3). Total num frames: 6374400. Throughput: 0: 5718.2. Samples: 6384558. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-09 00:21:40,994][25689] Avg episode reward: [(0, '-94.820')] [2022-07-09 00:21:41,307][26022] Updated weights on worker 0-0, policy_version 6228 (0.00094) [2022-07-09 00:21:43,209][26022] Updated weights on worker 0-0, policy_version 6238 (0.00089) [2022-07-09 00:21:44,974][26022] Updated weights on worker 0-0, policy_version 6248 (0.00098) [2022-07-09 00:21:46,035][25689] Fps is (10 sec: 5498.8, 60 sec: 5448.6, 300 sec: 5468.9). Total num frames: 6403072. Throughput: 0: 5697.2. Samples: 6400912. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-09 00:21:46,035][25689] Avg episode reward: [(0, '-94.230')] [2022-07-09 00:21:47,191][26022] Updated weights on worker 0-0, policy_version 6258 (0.00086) [2022-07-09 00:21:48,515][26022] Updated weights on worker 0-0, policy_version 6268 (0.00097) [2022-07-09 00:21:50,960][26022] Updated weights on worker 0-0, policy_version 6278 (0.00093) [2022-07-09 00:21:51,177][25689] Fps is (10 sec: 5530.8, 60 sec: 5446.8, 300 sec: 5473.8). Total num frames: 6430720. Throughput: 0: 5685.9. Samples: 6434022. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-09 00:21:51,177][25689] Avg episode reward: [(0, '-93.841')] [2022-07-09 00:21:52,223][26022] Updated weights on worker 0-0, policy_version 6288 (0.00087) [2022-07-09 00:21:54,334][26022] Updated weights on worker 0-0, policy_version 6298 (0.00091) [2022-07-09 00:21:56,192][25689] Fps is (10 sec: 5343.2, 60 sec: 5430.6, 300 sec: 5470.7). Total num frames: 6457344. Throughput: 0: 5726.5. Samples: 6467390. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 00:21:56,192][25689] Avg episode reward: [(0, '-92.136')] [2022-07-09 00:21:56,273][26022] Updated weights on worker 0-0, policy_version 6308 (0.00090) [2022-07-09 00:21:57,935][26022] Updated weights on worker 0-0, policy_version 6318 (0.00080) [2022-07-09 00:21:59,940][26022] Updated weights on worker 0-0, policy_version 6328 (0.00081) [2022-07-09 00:22:01,210][25689] Fps is (10 sec: 5715.8, 60 sec: 5463.2, 300 sec: 5487.8). Total num frames: 6488064. Throughput: 0: 4930.0. Samples: 6484220. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 00:22:01,210][25689] Avg episode reward: [(0, '-91.762')] [2022-07-09 00:22:01,904][26022] Updated weights on worker 0-0, policy_version 6338 (0.00081) [2022-07-09 00:22:03,980][26022] Updated weights on worker 0-0, policy_version 6348 (0.00090) [2022-07-09 00:22:05,882][26022] Updated weights on worker 0-0, policy_version 6358 (0.00089) [2022-07-09 00:22:06,248][25689] Fps is (10 sec: 5498.9, 60 sec: 5462.5, 300 sec: 5481.7). Total num frames: 6512640. Throughput: 0: 5681.7. Samples: 6515756. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 00:22:06,248][25689] Avg episode reward: [(0, '-91.964')] [2022-07-09 00:22:07,558][26022] Updated weights on worker 0-0, policy_version 6368 (0.00082) [2022-07-09 00:22:09,626][26022] Updated weights on worker 0-0, policy_version 6378 (0.00083) [2022-07-09 00:22:11,305][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:22:11,324][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000006388_6541312.pth [2022-07-09 00:22:11,324][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000004460_4567040.pth [2022-07-09 00:22:11,325][25689] Fps is (10 sec: 5263.9, 60 sec: 5479.7, 300 sec: 5477.1). Total num frames: 6541312. Throughput: 0: 5715.4. Samples: 6549178. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 00:22:11,326][25689] Avg episode reward: [(0, '-90.489')] [2022-07-09 00:22:11,326][26022] Updated weights on worker 0-0, policy_version 6388 (0.00087) [2022-07-09 00:22:13,252][26022] Updated weights on worker 0-0, policy_version 6398 (0.00084) [2022-07-09 00:22:15,139][26022] Updated weights on worker 0-0, policy_version 6408 (0.00084) [2022-07-09 00:22:16,387][25689] Fps is (10 sec: 5554.6, 60 sec: 5478.0, 300 sec: 5483.5). Total num frames: 6568960. Throughput: 0: 4864.8. Samples: 6565636. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 00:22:16,388][25689] Avg episode reward: [(0, '-88.822')] [2022-07-09 00:22:16,966][26022] Updated weights on worker 0-0, policy_version 6418 (0.00088) [2022-07-09 00:22:18,678][26022] Updated weights on worker 0-0, policy_version 6428 (0.00089) [2022-07-09 00:22:20,610][26022] Updated weights on worker 0-0, policy_version 6438 (0.00091) [2022-07-09 00:22:21,403][25689] Fps is (10 sec: 5487.2, 60 sec: 5478.7, 300 sec: 5476.7). Total num frames: 6596608. Throughput: 0: 5695.0. Samples: 6599220. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 00:22:21,403][25689] Avg episode reward: [(0, '-87.546')] [2022-07-09 00:22:22,435][26022] Updated weights on worker 0-0, policy_version 6448 (0.00081) [2022-07-09 00:22:24,171][26022] Updated weights on worker 0-0, policy_version 6458 (0.00090) [2022-07-09 00:22:26,362][26022] Updated weights on worker 0-0, policy_version 6468 (0.00087) [2022-07-09 00:22:26,467][25689] Fps is (10 sec: 5485.8, 60 sec: 5474.9, 300 sec: 5480.8). Total num frames: 6624256. Throughput: 0: 5755.5. Samples: 6632128. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 00:22:26,467][25689] Avg episode reward: [(0, '-87.959')] [2022-07-09 00:22:27,971][26022] Updated weights on worker 0-0, policy_version 6478 (0.00096) [2022-07-09 00:22:30,227][26022] Updated weights on worker 0-0, policy_version 6488 (0.00438) [2022-07-09 00:22:31,523][25689] Fps is (10 sec: 5564.8, 60 sec: 5492.9, 300 sec: 5480.1). Total num frames: 6652928. Throughput: 0: 4915.0. Samples: 6648452. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 00:22:31,524][25689] Avg episode reward: [(0, '-86.843')] [2022-07-09 00:22:31,575][26022] Updated weights on worker 0-0, policy_version 6498 (0.00089) [2022-07-09 00:22:33,756][26022] Updated weights on worker 0-0, policy_version 6508 (0.00092) [2022-07-09 00:22:35,413][26022] Updated weights on worker 0-0, policy_version 6518 (0.00088) [2022-07-09 00:22:36,530][25689] Fps is (10 sec: 5494.7, 60 sec: 5479.2, 300 sec: 5480.4). Total num frames: 6679552. Throughput: 0: 5739.5. Samples: 6681248. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 00:22:36,531][25689] Avg episode reward: [(0, '-87.384')] [2022-07-09 00:22:37,479][26022] Updated weights on worker 0-0, policy_version 6528 (0.00082) [2022-07-09 00:22:39,245][26022] Updated weights on worker 0-0, policy_version 6538 (0.00083) [2022-07-09 00:22:41,034][26022] Updated weights on worker 0-0, policy_version 6548 (0.00089) [2022-07-09 00:22:41,550][25689] Fps is (10 sec: 5412.9, 60 sec: 5495.8, 300 sec: 5483.5). Total num frames: 6707200. Throughput: 0: 5738.1. Samples: 6714826. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 00:22:41,550][25689] Avg episode reward: [(0, '-87.093')] [2022-07-09 00:22:42,905][26022] Updated weights on worker 0-0, policy_version 6558 (0.00090) [2022-07-09 00:22:44,703][26022] Updated weights on worker 0-0, policy_version 6568 (0.00091) [2022-07-09 00:22:46,517][26022] Updated weights on worker 0-0, policy_version 6578 (0.00088) [2022-07-09 00:22:46,580][25689] Fps is (10 sec: 5604.3, 60 sec: 5496.7, 300 sec: 5477.4). Total num frames: 6735872. Throughput: 0: 4945.1. Samples: 6731588. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 00:22:46,580][25689] Avg episode reward: [(0, '-87.819')] [2022-07-09 00:22:48,466][26022] Updated weights on worker 0-0, policy_version 6588 (0.00093) [2022-07-09 00:22:50,219][26022] Updated weights on worker 0-0, policy_version 6598 (0.00084) [2022-07-09 00:22:51,620][25689] Fps is (10 sec: 5592.6, 60 sec: 5506.0, 300 sec: 5480.7). Total num frames: 6763520. Throughput: 0: 5787.3. Samples: 6764760. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 00:22:51,621][25689] Avg episode reward: [(0, '-88.666')] [2022-07-09 00:22:52,134][26022] Updated weights on worker 0-0, policy_version 6608 (0.00081) [2022-07-09 00:22:53,944][26022] Updated weights on worker 0-0, policy_version 6618 (0.00089) [2022-07-09 00:22:55,795][26022] Updated weights on worker 0-0, policy_version 6628 (0.00084) [2022-07-09 00:22:56,644][25689] Fps is (10 sec: 5494.5, 60 sec: 5522.2, 300 sec: 5480.7). Total num frames: 6791168. Throughput: 0: 5812.0. Samples: 6798146. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 00:22:56,644][25689] Avg episode reward: [(0, '-87.213')] [2022-07-09 00:22:57,771][26022] Updated weights on worker 0-0, policy_version 6638 (0.00095) [2022-07-09 00:22:59,438][26022] Updated weights on worker 0-0, policy_version 6648 (0.00086) [2022-07-09 00:23:01,491][26022] Updated weights on worker 0-0, policy_version 6658 (0.00089) [2022-07-09 00:23:01,650][25689] Fps is (10 sec: 5411.4, 60 sec: 5455.5, 300 sec: 5482.4). Total num frames: 6817792. Throughput: 0: 4973.6. Samples: 6814794. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 00:23:01,650][25689] Avg episode reward: [(0, '-87.765')] [2022-07-09 00:23:03,498][26022] Updated weights on worker 0-0, policy_version 6668 (0.00083) [2022-07-09 00:23:05,495][26022] Updated weights on worker 0-0, policy_version 6678 (0.00091) [2022-07-09 00:23:06,667][25689] Fps is (10 sec: 5312.4, 60 sec: 5491.2, 300 sec: 5483.1). Total num frames: 6844416. Throughput: 0: 5687.5. Samples: 6845834. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 00:23:06,668][25689] Avg episode reward: [(0, '-87.679')] [2022-07-09 00:23:07,568][26022] Updated weights on worker 0-0, policy_version 6688 (0.00088) [2022-07-09 00:23:09,009][26022] Updated weights on worker 0-0, policy_version 6698 (0.00087) [2022-07-09 00:23:11,117][26022] Updated weights on worker 0-0, policy_version 6708 (0.00093) [2022-07-09 00:23:11,726][25689] Fps is (10 sec: 5386.2, 60 sec: 5476.0, 300 sec: 5482.2). Total num frames: 6872064. Throughput: 0: 5683.9. Samples: 6879036. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 00:23:11,726][25689] Avg episode reward: [(0, '-86.851')] [2022-07-09 00:23:12,658][26022] Updated weights on worker 0-0, policy_version 6718 (0.00088) [2022-07-09 00:23:14,674][26022] Updated weights on worker 0-0, policy_version 6728 (0.00083) [2022-07-09 00:23:16,460][26022] Updated weights on worker 0-0, policy_version 6738 (0.00083) [2022-07-09 00:23:16,746][25689] Fps is (10 sec: 5486.1, 60 sec: 5479.7, 300 sec: 5478.8). Total num frames: 6899712. Throughput: 0: 4863.8. Samples: 6895920. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 00:23:16,747][25689] Avg episode reward: [(0, '-86.898')] [2022-07-09 00:23:18,251][26022] Updated weights on worker 0-0, policy_version 6748 (0.00083) [2022-07-09 00:23:20,179][26022] Updated weights on worker 0-0, policy_version 6758 (0.00091) [2022-07-09 00:23:21,771][25689] Fps is (10 sec: 5606.6, 60 sec: 5495.9, 300 sec: 5482.5). Total num frames: 6928384. Throughput: 0: 5698.2. Samples: 6929450. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 00:23:21,771][25689] Avg episode reward: [(0, '-87.624')] [2022-07-09 00:23:22,134][26022] Updated weights on worker 0-0, policy_version 6768 (0.00088) [2022-07-09 00:23:23,731][26022] Updated weights on worker 0-0, policy_version 6778 (0.00079) [2022-07-09 00:23:25,803][26022] Updated weights on worker 0-0, policy_version 6788 (0.00080) [2022-07-09 00:23:26,795][25689] Fps is (10 sec: 5706.3, 60 sec: 5516.5, 300 sec: 5490.8). Total num frames: 6957056. Throughput: 0: 5801.3. Samples: 6962606. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 00:23:26,796][25689] Avg episode reward: [(0, '-87.583')] [2022-07-09 00:23:27,688][26022] Updated weights on worker 0-0, policy_version 6798 (0.00088) [2022-07-09 00:23:29,606][26022] Updated weights on worker 0-0, policy_version 6809 (0.00090) [2022-07-09 00:23:31,613][26022] Updated weights on worker 0-0, policy_version 6819 (0.00090) [2022-07-09 00:23:31,888][25689] Fps is (10 sec: 5465.6, 60 sec: 5479.2, 300 sec: 5482.3). Total num frames: 6983680. Throughput: 0: 4961.1. Samples: 6979064. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 00:23:31,889][25689] Avg episode reward: [(0, '-87.939')] [2022-07-09 00:23:33,290][26022] Updated weights on worker 0-0, policy_version 6829 (0.00092) [2022-07-09 00:23:35,255][26022] Updated weights on worker 0-0, policy_version 6839 (0.00098) [2022-07-09 00:23:36,919][25689] Fps is (10 sec: 5361.2, 60 sec: 5494.0, 300 sec: 5485.8). Total num frames: 7011328. Throughput: 0: 5760.7. Samples: 7012128. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 00:23:36,919][25689] Avg episode reward: [(0, '-89.298')] [2022-07-09 00:23:37,322][26022] Updated weights on worker 0-0, policy_version 6849 (0.01019) [2022-07-09 00:23:38,940][26022] Updated weights on worker 0-0, policy_version 6859 (0.00083) [2022-07-09 00:23:40,839][26022] Updated weights on worker 0-0, policy_version 6869 (0.00086) [2022-07-09 00:23:41,939][25689] Fps is (10 sec: 5501.7, 60 sec: 5494.0, 300 sec: 5485.8). Total num frames: 7038976. Throughput: 0: 5744.9. Samples: 7045312. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 00:23:41,939][25689] Avg episode reward: [(0, '-90.503')] [2022-07-09 00:23:42,687][26022] Updated weights on worker 0-0, policy_version 6879 (0.00095) [2022-07-09 00:23:44,531][26022] Updated weights on worker 0-0, policy_version 6889 (0.00095) [2022-07-09 00:23:46,368][26022] Updated weights on worker 0-0, policy_version 6899 (0.00095) [2022-07-09 00:23:46,945][25689] Fps is (10 sec: 5515.2, 60 sec: 5479.2, 300 sec: 5479.5). Total num frames: 7066624. Throughput: 0: 4926.3. Samples: 7061868. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 00:23:46,945][25689] Avg episode reward: [(0, '-90.078')] [2022-07-09 00:23:48,151][26022] Updated weights on worker 0-0, policy_version 6909 (0.00095) [2022-07-09 00:23:50,207][26022] Updated weights on worker 0-0, policy_version 6919 (0.00086) [2022-07-09 00:23:52,008][25689] Fps is (10 sec: 5491.4, 60 sec: 5477.1, 300 sec: 5485.8). Total num frames: 7094272. Throughput: 0: 5757.3. Samples: 7094902. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 00:23:52,010][25689] Avg episode reward: [(0, '-89.836')] [2022-07-09 00:23:52,290][26022] Updated weights on worker 0-0, policy_version 6929 (0.00087) [2022-07-09 00:23:53,810][26022] Updated weights on worker 0-0, policy_version 6939 (0.00094) [2022-07-09 00:23:56,125][26022] Updated weights on worker 0-0, policy_version 6949 (0.00088) [2022-07-09 00:23:57,083][25689] Fps is (10 sec: 5656.4, 60 sec: 5506.4, 300 sec: 5488.6). Total num frames: 7123968. Throughput: 0: 5725.7. Samples: 7127582. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 00:23:57,083][25689] Avg episode reward: [(0, '-89.636')] [2022-07-09 00:23:57,536][26022] Updated weights on worker 0-0, policy_version 6959 (0.00090) [2022-07-09 00:23:59,487][26022] Updated weights on worker 0-0, policy_version 6969 (0.00082) [2022-07-09 00:24:01,647][26022] Updated weights on worker 0-0, policy_version 6979 (0.00095) [2022-07-09 00:24:02,101][25689] Fps is (10 sec: 5377.5, 60 sec: 5471.4, 300 sec: 5485.0). Total num frames: 7148544. Throughput: 0: 4912.3. Samples: 7144354. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 00:24:02,101][25689] Avg episode reward: [(0, '-89.475')] [2022-07-09 00:24:03,561][26022] Updated weights on worker 0-0, policy_version 6989 (0.00090) [2022-07-09 00:24:05,674][26022] Updated weights on worker 0-0, policy_version 6999 (0.00092) [2022-07-09 00:24:07,108][25689] Fps is (10 sec: 5107.2, 60 sec: 5472.3, 300 sec: 5478.9). Total num frames: 7175168. Throughput: 0: 5621.4. Samples: 7175212. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 00:24:07,108][25689] Avg episode reward: [(0, '-89.249')] [2022-07-09 00:24:07,346][26022] Updated weights on worker 0-0, policy_version 7009 (0.00088) [2022-07-09 00:24:09,378][26022] Updated weights on worker 0-0, policy_version 7019 (0.00090) [2022-07-09 00:24:11,023][26022] Updated weights on worker 0-0, policy_version 7029 (0.00090) [2022-07-09 00:24:11,345][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:24:11,354][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000007030_7198720.pth [2022-07-09 00:24:11,362][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000005101_5223424.pth [2022-07-09 00:24:12,217][25689] Fps is (10 sec: 5365.0, 60 sec: 5467.8, 300 sec: 5478.4). Total num frames: 7202816. Throughput: 0: 5604.4. Samples: 7208158. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 00:24:12,217][25689] Avg episode reward: [(0, '-88.480')] [2022-07-09 00:24:12,964][26022] Updated weights on worker 0-0, policy_version 7039 (0.00084) [2022-07-09 00:24:14,782][26022] Updated weights on worker 0-0, policy_version 7049 (0.00088) [2022-07-09 00:24:16,715][26022] Updated weights on worker 0-0, policy_version 7059 (0.00497) [2022-07-09 00:24:17,273][25689] Fps is (10 sec: 5641.5, 60 sec: 5498.4, 300 sec: 5481.1). Total num frames: 7232512. Throughput: 0: 4826.4. Samples: 7225026. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 00:24:17,273][25689] Avg episode reward: [(0, '-87.841')] [2022-07-09 00:24:18,435][26022] Updated weights on worker 0-0, policy_version 7069 (0.00091) [2022-07-09 00:24:20,307][26022] Updated weights on worker 0-0, policy_version 7079 (0.00091) [2022-07-09 00:24:22,080][26022] Updated weights on worker 0-0, policy_version 7089 (0.00092) [2022-07-09 00:24:22,282][25689] Fps is (10 sec: 5595.2, 60 sec: 5466.0, 300 sec: 5481.9). Total num frames: 7259136. Throughput: 0: 5648.5. Samples: 7258350. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 00:24:22,283][25689] Avg episode reward: [(0, '-88.482')] [2022-07-09 00:24:23,944][26022] Updated weights on worker 0-0, policy_version 7099 (0.00083) [2022-07-09 00:24:26,032][26022] Updated weights on worker 0-0, policy_version 7109 (0.00092) [2022-07-09 00:24:27,369][25689] Fps is (10 sec: 5476.6, 60 sec: 5460.3, 300 sec: 5478.2). Total num frames: 7287808. Throughput: 0: 5747.9. Samples: 7291674. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 00:24:27,370][25689] Avg episode reward: [(0, '-88.468')] [2022-07-09 00:24:27,750][26022] Updated weights on worker 0-0, policy_version 7119 (0.00087) [2022-07-09 00:24:29,677][26022] Updated weights on worker 0-0, policy_version 7129 (0.00093) [2022-07-09 00:24:31,292][26022] Updated weights on worker 0-0, policy_version 7139 (0.00096) [2022-07-09 00:24:32,479][25689] Fps is (10 sec: 5523.5, 60 sec: 5475.7, 300 sec: 5477.3). Total num frames: 7315456. Throughput: 0: 5770.7. Samples: 7325084. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 00:24:32,479][25689] Avg episode reward: [(0, '-89.293')] [2022-07-09 00:24:33,283][26022] Updated weights on worker 0-0, policy_version 7149 (0.00092) [2022-07-09 00:24:34,881][26022] Updated weights on worker 0-0, policy_version 7159 (0.00088) [2022-07-09 00:24:37,186][26022] Updated weights on worker 0-0, policy_version 7169 (0.00090) [2022-07-09 00:24:37,508][25689] Fps is (10 sec: 5454.1, 60 sec: 5475.8, 300 sec: 5484.4). Total num frames: 7343104. Throughput: 0: 5768.2. Samples: 7341746. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 00:24:37,510][25689] Avg episode reward: [(0, '-88.041')] [2022-07-09 00:24:38,580][26022] Updated weights on worker 0-0, policy_version 7179 (0.00088) [2022-07-09 00:24:40,856][26022] Updated weights on worker 0-0, policy_version 7189 (0.00102) [2022-07-09 00:24:42,410][26022] Updated weights on worker 0-0, policy_version 7199 (0.00100) [2022-07-09 00:24:42,530][25689] Fps is (10 sec: 5705.1, 60 sec: 5509.4, 300 sec: 5484.4). Total num frames: 7372800. Throughput: 0: 5753.3. Samples: 7374842. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2022-07-09 00:24:42,531][25689] Avg episode reward: [(0, '-88.007')] [2022-07-09 00:24:44,505][26022] Updated weights on worker 0-0, policy_version 7209 (0.00086) [2022-07-09 00:24:45,908][26022] Updated weights on worker 0-0, policy_version 7219 (0.00084) [2022-07-09 00:24:47,629][25689] Fps is (10 sec: 5463.6, 60 sec: 5467.3, 300 sec: 5473.5). Total num frames: 7398400. Throughput: 0: 5747.6. Samples: 7408116. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2022-07-09 00:24:47,630][25689] Avg episode reward: [(0, '-88.335')] [2022-07-09 00:24:48,199][26022] Updated weights on worker 0-0, policy_version 7229 (0.00088) [2022-07-09 00:24:49,911][26022] Updated weights on worker 0-0, policy_version 7239 (0.00087) [2022-07-09 00:24:51,665][26022] Updated weights on worker 0-0, policy_version 7249 (0.00098) [2022-07-09 00:24:52,728][25689] Fps is (10 sec: 5522.7, 60 sec: 5514.6, 300 sec: 5482.2). Total num frames: 7429120. Throughput: 0: 4925.8. Samples: 7424828. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2022-07-09 00:24:52,728][25689] Avg episode reward: [(0, '-89.915')] [2022-07-09 00:24:53,712][26022] Updated weights on worker 0-0, policy_version 7259 (0.00084) [2022-07-09 00:24:55,479][26022] Updated weights on worker 0-0, policy_version 7269 (0.00088) [2022-07-09 00:24:57,368][26022] Updated weights on worker 0-0, policy_version 7279 (0.00089) [2022-07-09 00:24:57,797][25689] Fps is (10 sec: 5639.3, 60 sec: 5464.5, 300 sec: 5481.2). Total num frames: 7455744. Throughput: 0: 5731.5. Samples: 7458036. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-09 00:24:57,798][25689] Avg episode reward: [(0, '-90.896')] [2022-07-09 00:24:59,110][26022] Updated weights on worker 0-0, policy_version 7289 (0.00085) [2022-07-09 00:25:00,951][26022] Updated weights on worker 0-0, policy_version 7299 (0.00082) [2022-07-09 00:25:02,804][25689] Fps is (10 sec: 5284.8, 60 sec: 5499.3, 300 sec: 5486.4). Total num frames: 7482368. Throughput: 0: 5680.2. Samples: 7490000. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-09 00:25:02,804][25689] Avg episode reward: [(0, '-90.578')] [2022-07-09 00:25:03,318][26022] Updated weights on worker 0-0, policy_version 7309 (0.00082) [2022-07-09 00:25:05,004][26022] Updated weights on worker 0-0, policy_version 7319 (0.00092) [2022-07-09 00:25:06,993][26022] Updated weights on worker 0-0, policy_version 7329 (0.00085) [2022-07-09 00:25:07,821][25689] Fps is (10 sec: 5414.4, 60 sec: 5515.2, 300 sec: 5488.1). Total num frames: 7510016. Throughput: 0: 4861.5. Samples: 7506282. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-09 00:25:07,821][25689] Avg episode reward: [(0, '-90.856')] [2022-07-09 00:25:08,621][26022] Updated weights on worker 0-0, policy_version 7339 (0.00101) [2022-07-09 00:25:10,652][26022] Updated weights on worker 0-0, policy_version 7349 (0.00086) [2022-07-09 00:25:12,451][26022] Updated weights on worker 0-0, policy_version 7359 (0.00083) [2022-07-09 00:25:12,936][25689] Fps is (10 sec: 5457.2, 60 sec: 5514.7, 300 sec: 5479.8). Total num frames: 7537664. Throughput: 0: 5671.3. Samples: 7539436. Policy #0 lag: (min: 0.0, avg: 8.8, max: 23.0) [2022-07-09 00:25:12,937][25689] Avg episode reward: [(0, '-92.043')] [2022-07-09 00:25:14,376][26022] Updated weights on worker 0-0, policy_version 7369 (0.00084) [2022-07-09 00:25:16,007][26022] Updated weights on worker 0-0, policy_version 7379 (0.00084) [2022-07-09 00:25:17,974][25689] Fps is (10 sec: 5446.3, 60 sec: 5482.6, 300 sec: 5483.2). Total num frames: 7565312. Throughput: 0: 5685.8. Samples: 7572756. Policy #0 lag: (min: 0.0, avg: 8.8, max: 23.0) [2022-07-09 00:25:17,976][25689] Avg episode reward: [(0, '-91.742')] [2022-07-09 00:25:18,108][26022] Updated weights on worker 0-0, policy_version 7389 (0.00090) [2022-07-09 00:25:19,742][26022] Updated weights on worker 0-0, policy_version 7399 (0.00083) [2022-07-09 00:25:21,832][26022] Updated weights on worker 0-0, policy_version 7409 (0.00091) [2022-07-09 00:25:22,981][25689] Fps is (10 sec: 5606.7, 60 sec: 5516.5, 300 sec: 5484.8). Total num frames: 7593984. Throughput: 0: 4923.7. Samples: 7589348. Policy #0 lag: (min: 0.0, avg: 8.8, max: 23.0) [2022-07-09 00:25:22,983][25689] Avg episode reward: [(0, '-91.343')] [2022-07-09 00:25:23,453][26022] Updated weights on worker 0-0, policy_version 7419 (0.00090) [2022-07-09 00:25:25,432][26022] Updated weights on worker 0-0, policy_version 7429 (0.00090) [2022-07-09 00:25:27,281][26022] Updated weights on worker 0-0, policy_version 7439 (0.00086) [2022-07-09 00:25:27,991][25689] Fps is (10 sec: 5520.3, 60 sec: 5489.8, 300 sec: 5482.4). Total num frames: 7620608. Throughput: 0: 5760.0. Samples: 7622460. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-09 00:25:27,992][25689] Avg episode reward: [(0, '-90.894')] [2022-07-09 00:25:29,031][26022] Updated weights on worker 0-0, policy_version 7449 (0.01083) [2022-07-09 00:25:31,050][26022] Updated weights on worker 0-0, policy_version 7459 (0.00099) [2022-07-09 00:25:32,659][26022] Updated weights on worker 0-0, policy_version 7469 (0.00084) [2022-07-09 00:25:33,061][25689] Fps is (10 sec: 5587.6, 60 sec: 5527.2, 300 sec: 5488.5). Total num frames: 7650304. Throughput: 0: 5786.8. Samples: 7655892. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-09 00:25:33,063][25689] Avg episode reward: [(0, '-91.256')] [2022-07-09 00:25:34,710][26022] Updated weights on worker 0-0, policy_version 7479 (0.00084) [2022-07-09 00:25:36,476][26022] Updated weights on worker 0-0, policy_version 7489 (0.00085) [2022-07-09 00:25:38,064][25689] Fps is (10 sec: 5590.8, 60 sec: 5512.6, 300 sec: 5485.4). Total num frames: 7676928. Throughput: 0: 4978.1. Samples: 7672768. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 00:25:38,066][25689] Avg episode reward: [(0, '-90.771')] [2022-07-09 00:25:38,544][26022] Updated weights on worker 0-0, policy_version 7499 (0.00094) [2022-07-09 00:25:40,305][26022] Updated weights on worker 0-0, policy_version 7509 (0.00088) [2022-07-09 00:25:42,159][26022] Updated weights on worker 0-0, policy_version 7519 (0.00084) [2022-07-09 00:25:43,076][25689] Fps is (10 sec: 5419.1, 60 sec: 5479.8, 300 sec: 5482.1). Total num frames: 7704576. Throughput: 0: 5791.7. Samples: 7705730. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 00:25:43,078][25689] Avg episode reward: [(0, '-90.311')] [2022-07-09 00:25:43,860][26022] Updated weights on worker 0-0, policy_version 7529 (0.00092) [2022-07-09 00:25:45,884][26022] Updated weights on worker 0-0, policy_version 7539 (0.00095) [2022-07-09 00:25:47,611][26022] Updated weights on worker 0-0, policy_version 7549 (0.00092) [2022-07-09 00:25:48,089][25689] Fps is (10 sec: 5515.9, 60 sec: 5521.3, 300 sec: 5484.1). Total num frames: 7732224. Throughput: 0: 5815.6. Samples: 7739344. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 00:25:48,090][25689] Avg episode reward: [(0, '-89.477')] [2022-07-09 00:25:49,479][26022] Updated weights on worker 0-0, policy_version 7559 (0.00080) [2022-07-09 00:25:51,351][26022] Updated weights on worker 0-0, policy_version 7569 (0.00083) [2022-07-09 00:25:53,056][26022] Updated weights on worker 0-0, policy_version 7579 (0.00085) [2022-07-09 00:25:53,151][25689] Fps is (10 sec: 5590.0, 60 sec: 5490.9, 300 sec: 5486.8). Total num frames: 7760896. Throughput: 0: 4977.0. Samples: 7755880. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 00:25:53,153][25689] Avg episode reward: [(0, '-89.148')] [2022-07-09 00:25:55,117][26022] Updated weights on worker 0-0, policy_version 7589 (0.00084) [2022-07-09 00:25:56,688][26022] Updated weights on worker 0-0, policy_version 7599 (0.00086) [2022-07-09 00:25:58,203][25689] Fps is (10 sec: 5467.3, 60 sec: 5492.5, 300 sec: 5479.0). Total num frames: 7787520. Throughput: 0: 5783.7. Samples: 7789244. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 00:25:58,204][25689] Avg episode reward: [(0, '-88.150')] [2022-07-09 00:25:58,816][26022] Updated weights on worker 0-0, policy_version 7609 (0.00090) [2022-07-09 00:26:00,393][26022] Updated weights on worker 0-0, policy_version 7619 (0.00085) [2022-07-09 00:26:02,793][26022] Updated weights on worker 0-0, policy_version 7629 (0.00085) [2022-07-09 00:26:03,215][25689] Fps is (10 sec: 5290.9, 60 sec: 5491.9, 300 sec: 5486.3). Total num frames: 7814144. Throughput: 0: 5695.6. Samples: 7820436. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 00:26:03,216][25689] Avg episode reward: [(0, '-87.350')] [2022-07-09 00:26:04,435][26022] Updated weights on worker 0-0, policy_version 7639 (0.00079) [2022-07-09 00:26:06,450][26022] Updated weights on worker 0-0, policy_version 7649 (0.00090) [2022-07-09 00:26:08,225][25689] Fps is (10 sec: 5415.5, 60 sec: 5492.6, 300 sec: 5487.6). Total num frames: 7841792. Throughput: 0: 4847.4. Samples: 7836948. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2022-07-09 00:26:08,225][25689] Avg episode reward: [(0, '-87.212')] [2022-07-09 00:26:08,339][26022] Updated weights on worker 0-0, policy_version 7659 (0.00560) [2022-07-09 00:26:10,355][26022] Updated weights on worker 0-0, policy_version 7669 (0.00091) [2022-07-09 00:26:11,484][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:26:11,500][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000007676_7860224.pth [2022-07-09 00:26:11,501][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000005747_5884928.pth [2022-07-09 00:26:12,158][26022] Updated weights on worker 0-0, policy_version 7679 (0.00090) [2022-07-09 00:26:13,280][25689] Fps is (10 sec: 5392.2, 60 sec: 5481.1, 300 sec: 5483.9). Total num frames: 7868416. Throughput: 0: 5649.1. Samples: 7869592. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2022-07-09 00:26:13,281][25689] Avg episode reward: [(0, '-88.395')] [2022-07-09 00:26:14,010][26022] Updated weights on worker 0-0, policy_version 7689 (0.00092) [2022-07-09 00:26:15,799][26022] Updated weights on worker 0-0, policy_version 7699 (0.00095) [2022-07-09 00:26:17,619][26022] Updated weights on worker 0-0, policy_version 7709 (0.00091) [2022-07-09 00:26:18,322][25689] Fps is (10 sec: 5476.3, 60 sec: 5497.7, 300 sec: 5487.0). Total num frames: 7897088. Throughput: 0: 5637.2. Samples: 7902658. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2022-07-09 00:26:18,323][25689] Avg episode reward: [(0, '-88.912')] [2022-07-09 00:26:19,668][26022] Updated weights on worker 0-0, policy_version 7719 (0.00086) [2022-07-09 00:26:21,474][26022] Updated weights on worker 0-0, policy_version 7729 (0.00080) [2022-07-09 00:26:23,286][26022] Updated weights on worker 0-0, policy_version 7739 (0.00089) [2022-07-09 00:26:23,357][25689] Fps is (10 sec: 5589.3, 60 sec: 5478.3, 300 sec: 5486.8). Total num frames: 7924736. Throughput: 0: 4892.4. Samples: 7918976. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-09 00:26:23,357][25689] Avg episode reward: [(0, '-90.244')] [2022-07-09 00:26:25,307][26022] Updated weights on worker 0-0, policy_version 7749 (0.00086) [2022-07-09 00:26:27,104][26022] Updated weights on worker 0-0, policy_version 7759 (0.00097) [2022-07-09 00:26:28,386][25689] Fps is (10 sec: 5392.9, 60 sec: 5476.5, 300 sec: 5484.1). Total num frames: 7951360. Throughput: 0: 5695.4. Samples: 7951772. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-09 00:26:28,386][25689] Avg episode reward: [(0, '-89.968')] [2022-07-09 00:26:29,092][26022] Updated weights on worker 0-0, policy_version 7769 (0.00094) [2022-07-09 00:26:30,788][26022] Updated weights on worker 0-0, policy_version 7779 (0.00094) [2022-07-09 00:26:32,631][26022] Updated weights on worker 0-0, policy_version 7789 (0.00090) [2022-07-09 00:26:33,466][25689] Fps is (10 sec: 5368.6, 60 sec: 5441.7, 300 sec: 5483.4). Total num frames: 7979008. Throughput: 0: 5694.0. Samples: 7984528. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 00:26:33,466][25689] Avg episode reward: [(0, '-90.105')] [2022-07-09 00:26:34,667][26022] Updated weights on worker 0-0, policy_version 7799 (0.00089) [2022-07-09 00:26:36,521][26022] Updated weights on worker 0-0, policy_version 7809 (0.00094) [2022-07-09 00:26:38,530][25689] Fps is (10 sec: 5349.8, 60 sec: 5436.2, 300 sec: 5482.5). Total num frames: 8005632. Throughput: 0: 4879.9. Samples: 8001272. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 00:26:38,531][25689] Avg episode reward: [(0, '-89.789')] [2022-07-09 00:26:38,695][26022] Updated weights on worker 0-0, policy_version 7819 (0.00077) [2022-07-09 00:26:40,879][26022] Updated weights on worker 0-0, policy_version 7829 (0.00083) [2022-07-09 00:26:42,761][26022] Updated weights on worker 0-0, policy_version 7839 (0.00604) [2022-07-09 00:26:43,550][25689] Fps is (10 sec: 5077.4, 60 sec: 5384.7, 300 sec: 5469.1). Total num frames: 8030208. Throughput: 0: 5493.8. Samples: 8029912. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 00:26:43,550][25689] Avg episode reward: [(0, '-89.024')] [2022-07-09 00:26:45,102][26022] Updated weights on worker 0-0, policy_version 7849 (0.00094) [2022-07-09 00:26:47,107][26022] Updated weights on worker 0-0, policy_version 7859 (0.00081) [2022-07-09 00:26:48,579][25689] Fps is (10 sec: 4687.7, 60 sec: 5298.7, 300 sec: 5454.0). Total num frames: 8052736. Throughput: 0: 5316.0. Samples: 8059118. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 00:26:48,581][25689] Avg episode reward: [(0, '-88.200')] [2022-07-09 00:26:49,340][26022] Updated weights on worker 0-0, policy_version 7869 (0.00075) [2022-07-09 00:26:51,784][26022] Updated weights on worker 0-0, policy_version 7879 (0.00105) [2022-07-09 00:26:53,663][25689] Fps is (10 sec: 4252.4, 60 sec: 5161.3, 300 sec: 5432.0). Total num frames: 8073216. Throughput: 0: 4343.3. Samples: 8072254. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 00:26:53,664][25689] Avg episode reward: [(0, '-87.567')] [2022-07-09 00:26:54,331][26022] Updated weights on worker 0-0, policy_version 7889 (0.00099) [2022-07-09 00:26:56,992][26022] Updated weights on worker 0-0, policy_version 7899 (0.00095) [2022-07-09 00:26:58,716][25689] Fps is (10 sec: 4242.4, 60 sec: 5093.5, 300 sec: 5403.8). Total num frames: 8095744. Throughput: 0: 4722.0. Samples: 8096590. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 00:26:58,717][25689] Avg episode reward: [(0, '-87.689')] [2022-07-09 00:26:59,632][26022] Updated weights on worker 0-0, policy_version 7909 (0.00089) [2022-07-09 00:27:02,566][26022] Updated weights on worker 0-0, policy_version 7919 (0.00090) [2022-07-09 00:27:03,719][25689] Fps is (10 sec: 4073.2, 60 sec: 4958.9, 300 sec: 5383.8). Total num frames: 8114176. Throughput: 0: 4437.1. Samples: 8119408. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 00:27:03,720][25689] Avg episode reward: [(0, '-88.440')] [2022-07-09 00:27:04,472][26022] Updated weights on worker 0-0, policy_version 7929 (0.00089) [2022-07-09 00:27:06,433][26022] Updated weights on worker 0-0, policy_version 7939 (0.00087) [2022-07-09 00:27:08,457][26022] Updated weights on worker 0-0, policy_version 7949 (0.00090) [2022-07-09 00:27:08,762][25689] Fps is (10 sec: 4586.8, 60 sec: 4956.1, 300 sec: 5381.0). Total num frames: 8141824. Throughput: 0: 4558.4. Samples: 8151124. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 00:27:08,763][25689] Avg episode reward: [(0, '-89.886')] [2022-07-09 00:27:10,283][26022] Updated weights on worker 0-0, policy_version 7959 (0.00086) [2022-07-09 00:27:12,306][26022] Updated weights on worker 0-0, policy_version 7970 (0.00090) [2022-07-09 00:27:13,830][25689] Fps is (10 sec: 5367.6, 60 sec: 4955.1, 300 sec: 5377.5). Total num frames: 8168448. Throughput: 0: 4715.2. Samples: 8167346. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 00:27:13,831][25689] Avg episode reward: [(0, '-88.710')] [2022-07-09 00:27:14,339][26022] Updated weights on worker 0-0, policy_version 7980 (0.00087) [2022-07-09 00:27:15,936][26022] Updated weights on worker 0-0, policy_version 7990 (0.00091) [2022-07-09 00:27:18,075][26022] Updated weights on worker 0-0, policy_version 8000 (0.00118) [2022-07-09 00:27:18,900][25689] Fps is (10 sec: 5555.3, 60 sec: 4969.7, 300 sec: 5383.4). Total num frames: 8198144. Throughput: 0: 5143.2. Samples: 8200402. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-09 00:27:18,901][25689] Avg episode reward: [(0, '-87.390')] [2022-07-09 00:27:19,658][26022] Updated weights on worker 0-0, policy_version 8010 (0.00078) [2022-07-09 00:27:21,776][26022] Updated weights on worker 0-0, policy_version 8020 (0.00089) [2022-07-09 00:27:23,532][26022] Updated weights on worker 0-0, policy_version 8030 (0.00081) [2022-07-09 00:27:23,986][25689] Fps is (10 sec: 5545.6, 60 sec: 4948.6, 300 sec: 5379.5). Total num frames: 8224768. Throughput: 0: 5629.3. Samples: 8233496. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-09 00:27:23,987][25689] Avg episode reward: [(0, '-85.789')] [2022-07-09 00:27:25,491][26022] Updated weights on worker 0-0, policy_version 8040 (0.00088) [2022-07-09 00:27:27,379][26022] Updated weights on worker 0-0, policy_version 8050 (0.00093) [2022-07-09 00:27:29,073][25689] Fps is (10 sec: 5335.3, 60 sec: 4960.8, 300 sec: 5375.5). Total num frames: 8252416. Throughput: 0: 4867.0. Samples: 8249974. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-09 00:27:29,073][25689] Avg episode reward: [(0, '-85.231')] [2022-07-09 00:27:29,218][26022] Updated weights on worker 0-0, policy_version 8060 (0.00082) [2022-07-09 00:27:31,044][26022] Updated weights on worker 0-0, policy_version 8070 (0.00092) [2022-07-09 00:27:32,965][26022] Updated weights on worker 0-0, policy_version 8080 (0.00082) [2022-07-09 00:27:34,138][25689] Fps is (10 sec: 5446.7, 60 sec: 4962.0, 300 sec: 5377.9). Total num frames: 8280064. Throughput: 0: 5690.3. Samples: 8282904. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 00:27:34,139][25689] Avg episode reward: [(0, '-84.508')] [2022-07-09 00:27:34,639][26022] Updated weights on worker 0-0, policy_version 8090 (0.00092) [2022-07-09 00:27:36,575][26022] Updated weights on worker 0-0, policy_version 8100 (0.00083) [2022-07-09 00:27:38,420][26022] Updated weights on worker 0-0, policy_version 8110 (0.00093) [2022-07-09 00:27:39,195][25689] Fps is (10 sec: 5463.0, 60 sec: 4979.5, 300 sec: 5377.3). Total num frames: 8307712. Throughput: 0: 5680.6. Samples: 8315686. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 00:27:39,195][25689] Avg episode reward: [(0, '-82.971')] [2022-07-09 00:27:40,371][26022] Updated weights on worker 0-0, policy_version 8120 (0.00089) [2022-07-09 00:27:42,203][26022] Updated weights on worker 0-0, policy_version 8130 (0.00086) [2022-07-09 00:27:44,163][26022] Updated weights on worker 0-0, policy_version 8140 (0.00087) [2022-07-09 00:27:44,221][25689] Fps is (10 sec: 5484.5, 60 sec: 5029.6, 300 sec: 5373.9). Total num frames: 8335360. Throughput: 0: 4870.6. Samples: 8332050. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 00:27:44,221][25689] Avg episode reward: [(0, '-83.021')] [2022-07-09 00:27:45,982][26022] Updated weights on worker 0-0, policy_version 8150 (0.00077) [2022-07-09 00:27:47,860][26022] Updated weights on worker 0-0, policy_version 8160 (0.00092) [2022-07-09 00:27:49,258][25689] Fps is (10 sec: 5494.6, 60 sec: 5113.3, 300 sec: 5373.9). Total num frames: 8363008. Throughput: 0: 5693.7. Samples: 8364906. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 00:27:49,259][25689] Avg episode reward: [(0, '-82.820')] [2022-07-09 00:27:49,772][26022] Updated weights on worker 0-0, policy_version 8170 (0.00118) [2022-07-09 00:27:51,681][26022] Updated weights on worker 0-0, policy_version 8180 (0.00086) [2022-07-09 00:27:53,680][26022] Updated weights on worker 0-0, policy_version 8190 (0.00092) [2022-07-09 00:27:54,364][25689] Fps is (10 sec: 5451.3, 60 sec: 5229.6, 300 sec: 5372.4). Total num frames: 8390656. Throughput: 0: 5673.6. Samples: 8397658. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 00:27:54,365][25689] Avg episode reward: [(0, '-83.040')] [2022-07-09 00:27:55,344][26022] Updated weights on worker 0-0, policy_version 8200 (0.00087) [2022-07-09 00:27:57,373][26022] Updated weights on worker 0-0, policy_version 8210 (0.00089) [2022-07-09 00:27:59,039][26022] Updated weights on worker 0-0, policy_version 8220 (0.00091) [2022-07-09 00:27:59,447][25689] Fps is (10 sec: 5427.1, 60 sec: 5311.3, 300 sec: 5374.5). Total num frames: 8418304. Throughput: 0: 4856.3. Samples: 8414038. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 00:27:59,448][25689] Avg episode reward: [(0, '-83.860')] [2022-07-09 00:28:01,094][26022] Updated weights on worker 0-0, policy_version 8230 (0.00095) [2022-07-09 00:28:03,215][26022] Updated weights on worker 0-0, policy_version 8240 (0.00084) [2022-07-09 00:28:04,544][25689] Fps is (10 sec: 5230.8, 60 sec: 5421.0, 300 sec: 5369.6). Total num frames: 8443904. Throughput: 0: 5548.2. Samples: 8444810. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 00:28:04,545][25689] Avg episode reward: [(0, '-83.485')] [2022-07-09 00:28:05,254][26022] Updated weights on worker 0-0, policy_version 8250 (0.01130) [2022-07-09 00:28:07,209][26022] Updated weights on worker 0-0, policy_version 8260 (0.00087) [2022-07-09 00:28:08,908][26022] Updated weights on worker 0-0, policy_version 8270 (0.00086) [2022-07-09 00:28:09,592][25689] Fps is (10 sec: 5248.6, 60 sec: 5420.5, 300 sec: 5369.8). Total num frames: 8471552. Throughput: 0: 5531.4. Samples: 8477382. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 00:28:09,593][25689] Avg episode reward: [(0, '-84.014')] [2022-07-09 00:28:11,000][26022] Updated weights on worker 0-0, policy_version 8280 (0.00089) [2022-07-09 00:28:11,675][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:28:11,688][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000008284_8482816.pth [2022-07-09 00:28:11,689][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000006388_6541312.pth [2022-07-09 00:28:12,826][26022] Updated weights on worker 0-0, policy_version 8290 (0.00092) [2022-07-09 00:28:14,704][25689] Fps is (10 sec: 5341.8, 60 sec: 5416.6, 300 sec: 5364.7). Total num frames: 8498176. Throughput: 0: 4694.8. Samples: 8493148. Policy #0 lag: (min: 0.0, avg: 7.0, max: 18.0) [2022-07-09 00:28:14,705][25689] Avg episode reward: [(0, '-85.008')] [2022-07-09 00:28:14,755][26022] Updated weights on worker 0-0, policy_version 8300 (0.00094) [2022-07-09 00:28:16,649][26022] Updated weights on worker 0-0, policy_version 8310 (0.00084) [2022-07-09 00:28:18,380][26022] Updated weights on worker 0-0, policy_version 8320 (0.00087) [2022-07-09 00:28:19,719][25689] Fps is (10 sec: 5258.4, 60 sec: 5371.0, 300 sec: 5358.0). Total num frames: 8524800. Throughput: 0: 5525.4. Samples: 8526046. Policy #0 lag: (min: 0.0, avg: 7.0, max: 18.0) [2022-07-09 00:28:19,719][25689] Avg episode reward: [(0, '-84.862')] [2022-07-09 00:28:20,462][26022] Updated weights on worker 0-0, policy_version 8330 (0.00084) [2022-07-09 00:28:22,563][26022] Updated weights on worker 0-0, policy_version 8340 (0.00090) [2022-07-09 00:28:24,336][26022] Updated weights on worker 0-0, policy_version 8350 (0.00088) [2022-07-09 00:28:24,766][25689] Fps is (10 sec: 5393.7, 60 sec: 5391.2, 300 sec: 5354.2). Total num frames: 8552448. Throughput: 0: 5555.3. Samples: 8557150. Policy #0 lag: (min: 0.0, avg: 7.0, max: 18.0) [2022-07-09 00:28:24,767][25689] Avg episode reward: [(0, '-85.325')] [2022-07-09 00:28:26,495][26022] Updated weights on worker 0-0, policy_version 8360 (0.00081) [2022-07-09 00:28:28,007][26022] Updated weights on worker 0-0, policy_version 8370 (0.00085) [2022-07-09 00:28:29,789][25689] Fps is (10 sec: 5185.8, 60 sec: 5346.3, 300 sec: 5348.6). Total num frames: 8577024. Throughput: 0: 4750.9. Samples: 8573336. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 00:28:29,790][25689] Avg episode reward: [(0, '-84.792')] [2022-07-09 00:28:30,196][26022] Updated weights on worker 0-0, policy_version 8380 (0.00092) [2022-07-09 00:28:31,892][26022] Updated weights on worker 0-0, policy_version 8390 (0.00094) [2022-07-09 00:28:33,926][26022] Updated weights on worker 0-0, policy_version 8400 (0.00093) [2022-07-09 00:28:34,917][25689] Fps is (10 sec: 5447.4, 60 sec: 5391.4, 300 sec: 5357.1). Total num frames: 8607744. Throughput: 0: 5578.1. Samples: 8605898. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 00:28:34,918][25689] Avg episode reward: [(0, '-85.284')] [2022-07-09 00:28:35,784][26022] Updated weights on worker 0-0, policy_version 8410 (0.00089) [2022-07-09 00:28:37,759][26022] Updated weights on worker 0-0, policy_version 8420 (0.00086) [2022-07-09 00:28:39,364][26022] Updated weights on worker 0-0, policy_version 8430 (0.00107) [2022-07-09 00:28:39,926][25689] Fps is (10 sec: 5556.3, 60 sec: 5361.9, 300 sec: 5350.5). Total num frames: 8633344. Throughput: 0: 5563.8. Samples: 8638472. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 00:28:39,926][25689] Avg episode reward: [(0, '-85.834')] [2022-07-09 00:28:41,465][26022] Updated weights on worker 0-0, policy_version 8440 (0.00084) [2022-07-09 00:28:43,323][26022] Updated weights on worker 0-0, policy_version 8450 (0.00054) [2022-07-09 00:28:44,932][25689] Fps is (10 sec: 5317.1, 60 sec: 5363.6, 300 sec: 5350.5). Total num frames: 8660992. Throughput: 0: 4834.6. Samples: 8654642. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 00:28:44,932][25689] Avg episode reward: [(0, '-85.858')] [2022-07-09 00:28:45,210][26022] Updated weights on worker 0-0, policy_version 8460 (0.00056) [2022-07-09 00:28:47,167][26022] Updated weights on worker 0-0, policy_version 8470 (0.00085) [2022-07-09 00:28:48,922][26022] Updated weights on worker 0-0, policy_version 8480 (0.00093) [2022-07-09 00:28:49,971][25689] Fps is (10 sec: 5402.8, 60 sec: 5346.6, 300 sec: 5347.5). Total num frames: 8687616. Throughput: 0: 5630.9. Samples: 8686976. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 00:28:49,971][25689] Avg episode reward: [(0, '-86.619')] [2022-07-09 00:28:50,936][26022] Updated weights on worker 0-0, policy_version 8490 (0.00096) [2022-07-09 00:28:53,037][26022] Updated weights on worker 0-0, policy_version 8500 (0.00093) [2022-07-09 00:28:54,719][26022] Updated weights on worker 0-0, policy_version 8510 (0.00082) [2022-07-09 00:28:55,035][25689] Fps is (10 sec: 5473.0, 60 sec: 5367.2, 300 sec: 5344.2). Total num frames: 8716288. Throughput: 0: 5645.7. Samples: 8719478. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 00:28:55,036][25689] Avg episode reward: [(0, '-86.763')] [2022-07-09 00:28:56,662][26022] Updated weights on worker 0-0, policy_version 8520 (0.00088) [2022-07-09 00:28:58,499][26022] Updated weights on worker 0-0, policy_version 8530 (0.00084) [2022-07-09 00:29:00,107][25689] Fps is (10 sec: 5455.4, 60 sec: 5351.3, 300 sec: 5350.1). Total num frames: 8742912. Throughput: 0: 5636.7. Samples: 8752228. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 00:29:00,107][25689] Avg episode reward: [(0, '-86.512')] [2022-07-09 00:29:00,314][26022] Updated weights on worker 0-0, policy_version 8540 (0.00088) [2022-07-09 00:29:02,732][26022] Updated weights on worker 0-0, policy_version 8550 (0.00091) [2022-07-09 00:29:04,445][26022] Updated weights on worker 0-0, policy_version 8560 (0.00094) [2022-07-09 00:29:05,117][25689] Fps is (10 sec: 4976.8, 60 sec: 5325.2, 300 sec: 5339.8). Total num frames: 8766464. Throughput: 0: 5549.9. Samples: 8766668. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 00:29:05,118][25689] Avg episode reward: [(0, '-86.618')] [2022-07-09 00:29:07,980][26022] Updated weights on worker 0-0, policy_version 8571 (0.00094) [2022-07-09 00:29:10,141][25689] Fps is (10 sec: 4184.0, 60 sec: 5175.0, 300 sec: 5310.3). Total num frames: 8784896. Throughput: 0: 5093.8. Samples: 8789722. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 00:29:10,142][25689] Avg episode reward: [(0, '-86.576')] [2022-07-09 00:29:10,900][26022] Updated weights on worker 0-0, policy_version 8581 (0.00090) [2022-07-09 00:29:14,169][26022] Updated weights on worker 0-0, policy_version 8591 (0.00095) [2022-07-09 00:29:15,215][25689] Fps is (10 sec: 3549.3, 60 sec: 5026.0, 300 sec: 5268.8). Total num frames: 8802304. Throughput: 0: 4529.8. Samples: 8810888. Policy #0 lag: (min: 0.0, avg: 9.6, max: 24.0) [2022-07-09 00:29:15,215][25689] Avg episode reward: [(0, '-87.183')] [2022-07-09 00:29:16,231][26022] Updated weights on worker 0-0, policy_version 8601 (0.00613) [2022-07-09 00:29:18,131][26022] Updated weights on worker 0-0, policy_version 8611 (0.00057) [2022-07-09 00:29:19,632][26022] Updated weights on worker 0-0, policy_version 8621 (0.00091) [2022-07-09 00:29:20,229][25689] Fps is (10 sec: 4568.1, 60 sec: 5059.9, 300 sec: 5275.6). Total num frames: 8830976. Throughput: 0: 3748.6. Samples: 8827656. Policy #0 lag: (min: 0.0, avg: 9.6, max: 24.0) [2022-07-09 00:29:20,229][25689] Avg episode reward: [(0, '-87.160')] [2022-07-09 00:29:22,044][26022] Updated weights on worker 0-0, policy_version 8631 (0.00089) [2022-07-09 00:29:23,291][26022] Updated weights on worker 0-0, policy_version 8641 (0.00098) [2022-07-09 00:29:25,251][25689] Fps is (10 sec: 5305.7, 60 sec: 5011.3, 300 sec: 5263.0). Total num frames: 8855552. Throughput: 0: 4656.9. Samples: 8860428. Policy #0 lag: (min: 0.0, avg: 9.6, max: 24.0) [2022-07-09 00:29:25,251][25689] Avg episode reward: [(0, '-86.522')] [2022-07-09 00:29:25,699][26022] Updated weights on worker 0-0, policy_version 8651 (0.00087) [2022-07-09 00:29:27,318][26022] Updated weights on worker 0-0, policy_version 8661 (0.00091) [2022-07-09 00:29:29,345][26022] Updated weights on worker 0-0, policy_version 8671 (0.00085) [2022-07-09 00:29:30,263][25689] Fps is (10 sec: 5306.7, 60 sec: 5079.9, 300 sec: 5268.1). Total num frames: 8884224. Throughput: 0: 5127.0. Samples: 8892884. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 00:29:30,263][25689] Avg episode reward: [(0, '-86.913')] [2022-07-09 00:29:31,122][26022] Updated weights on worker 0-0, policy_version 8681 (0.00089) [2022-07-09 00:29:33,091][26022] Updated weights on worker 0-0, policy_version 8691 (0.00083) [2022-07-09 00:29:34,855][26022] Updated weights on worker 0-0, policy_version 8701 (0.00096) [2022-07-09 00:29:35,315][25689] Fps is (10 sec: 5595.8, 60 sec: 5035.4, 300 sec: 5267.7). Total num frames: 8911872. Throughput: 0: 4900.5. Samples: 8909392. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 00:29:35,316][25689] Avg episode reward: [(0, '-87.293')] [2022-07-09 00:29:36,601][26022] Updated weights on worker 0-0, policy_version 8711 (0.00091) [2022-07-09 00:29:38,763][26022] Updated weights on worker 0-0, policy_version 8721 (0.00502) [2022-07-09 00:29:40,368][25689] Fps is (10 sec: 5573.6, 60 sec: 5082.6, 300 sec: 5263.8). Total num frames: 8940544. Throughput: 0: 5698.2. Samples: 8942412. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 00:29:40,368][25689] Avg episode reward: [(0, '-85.095')] [2022-07-09 00:29:40,369][26022] Updated weights on worker 0-0, policy_version 8731 (0.00096) [2022-07-09 00:29:42,352][26022] Updated weights on worker 0-0, policy_version 8741 (0.00086) [2022-07-09 00:29:44,285][26022] Updated weights on worker 0-0, policy_version 8751 (0.00054) [2022-07-09 00:29:45,369][25689] Fps is (10 sec: 5499.9, 60 sec: 5066.0, 300 sec: 5268.9). Total num frames: 8967168. Throughput: 0: 5695.5. Samples: 8975014. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 00:29:45,370][25689] Avg episode reward: [(0, '-84.711')] [2022-07-09 00:29:46,119][26022] Updated weights on worker 0-0, policy_version 8761 (0.00063) [2022-07-09 00:29:48,027][26022] Updated weights on worker 0-0, policy_version 8771 (0.00094) [2022-07-09 00:29:49,836][26022] Updated weights on worker 0-0, policy_version 8781 (0.00084) [2022-07-09 00:29:50,433][25689] Fps is (10 sec: 5290.3, 60 sec: 5063.9, 300 sec: 5255.8). Total num frames: 8993792. Throughput: 0: 4883.3. Samples: 8991380. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 00:29:50,434][25689] Avg episode reward: [(0, '-84.872')] [2022-07-09 00:29:51,551][26022] Updated weights on worker 0-0, policy_version 8791 (0.00079) [2022-07-09 00:29:53,678][26022] Updated weights on worker 0-0, policy_version 8801 (0.00086) [2022-07-09 00:29:55,220][26022] Updated weights on worker 0-0, policy_version 8811 (0.00086) [2022-07-09 00:29:55,500][25689] Fps is (10 sec: 5559.1, 60 sec: 5080.6, 300 sec: 5266.1). Total num frames: 9023488. Throughput: 0: 5712.6. Samples: 9024700. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 00:29:55,501][25689] Avg episode reward: [(0, '-84.474')] [2022-07-09 00:29:57,322][26022] Updated weights on worker 0-0, policy_version 8821 (0.00087) [2022-07-09 00:29:59,131][26022] Updated weights on worker 0-0, policy_version 8831 (0.00083) [2022-07-09 00:30:00,520][25689] Fps is (10 sec: 5583.1, 60 sec: 5085.0, 300 sec: 5265.9). Total num frames: 9050112. Throughput: 0: 5726.4. Samples: 9057814. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 00:30:00,521][25689] Avg episode reward: [(0, '-85.891')] [2022-07-09 00:30:00,863][26022] Updated weights on worker 0-0, policy_version 8841 (0.00087) [2022-07-09 00:30:03,296][26022] Updated weights on worker 0-0, policy_version 8851 (0.00087) [2022-07-09 00:30:05,051][26022] Updated weights on worker 0-0, policy_version 8861 (0.00095) [2022-07-09 00:30:05,541][25689] Fps is (10 sec: 5201.0, 60 sec: 5117.9, 300 sec: 5258.9). Total num frames: 9075712. Throughput: 0: 4816.4. Samples: 9072172. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 00:30:05,542][25689] Avg episode reward: [(0, '-85.758')] [2022-07-09 00:30:07,018][26022] Updated weights on worker 0-0, policy_version 8871 (0.00088) [2022-07-09 00:30:08,852][26022] Updated weights on worker 0-0, policy_version 8881 (0.00091) [2022-07-09 00:30:10,568][25689] Fps is (10 sec: 5299.3, 60 sec: 5270.2, 300 sec: 5260.5). Total num frames: 9103360. Throughput: 0: 5642.9. Samples: 9105002. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 00:30:10,569][25689] Avg episode reward: [(0, '-85.603')] [2022-07-09 00:30:10,597][26022] Updated weights on worker 0-0, policy_version 8891 (0.00086) [2022-07-09 00:30:11,752][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:30:11,774][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000008896_9109504.pth [2022-07-09 00:30:11,774][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000007030_7198720.pth [2022-07-09 00:30:12,749][26022] Updated weights on worker 0-0, policy_version 8901 (0.00086) [2022-07-09 00:30:14,444][26022] Updated weights on worker 0-0, policy_version 8911 (0.00613) [2022-07-09 00:30:15,688][25689] Fps is (10 sec: 5349.0, 60 sec: 5418.6, 300 sec: 5255.6). Total num frames: 9129984. Throughput: 0: 5581.1. Samples: 9137368. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 00:30:15,688][25689] Avg episode reward: [(0, '-86.755')] [2022-07-09 00:30:16,403][26022] Updated weights on worker 0-0, policy_version 8921 (0.00078) [2022-07-09 00:30:18,198][26022] Updated weights on worker 0-0, policy_version 8931 (0.00091) [2022-07-09 00:30:19,972][26022] Updated weights on worker 0-0, policy_version 8941 (0.00093) [2022-07-09 00:30:20,774][25689] Fps is (10 sec: 5317.9, 60 sec: 5395.2, 300 sec: 5250.8). Total num frames: 9157632. Throughput: 0: 4743.0. Samples: 9153876. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 00:30:20,775][25689] Avg episode reward: [(0, '-87.871')] [2022-07-09 00:30:22,011][26022] Updated weights on worker 0-0, policy_version 8951 (0.00087) [2022-07-09 00:30:23,874][26022] Updated weights on worker 0-0, policy_version 8961 (0.00084) [2022-07-09 00:30:25,569][26022] Updated weights on worker 0-0, policy_version 8971 (0.00088) [2022-07-09 00:30:25,837][25689] Fps is (10 sec: 5649.8, 60 sec: 5476.0, 300 sec: 5260.2). Total num frames: 9187328. Throughput: 0: 5649.1. Samples: 9186826. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 00:30:25,838][25689] Avg episode reward: [(0, '-86.513')] [2022-07-09 00:30:27,712][26022] Updated weights on worker 0-0, policy_version 8981 (0.00089) [2022-07-09 00:30:29,606][26022] Updated weights on worker 0-0, policy_version 8991 (0.00086) [2022-07-09 00:30:30,883][25689] Fps is (10 sec: 5368.9, 60 sec: 5405.5, 300 sec: 5243.4). Total num frames: 9211904. Throughput: 0: 5635.4. Samples: 9219480. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 00:30:30,883][25689] Avg episode reward: [(0, '-86.268')] [2022-07-09 00:30:31,428][26022] Updated weights on worker 0-0, policy_version 9001 (0.00094) [2022-07-09 00:30:33,461][26022] Updated weights on worker 0-0, policy_version 9011 (0.00681) [2022-07-09 00:30:35,079][26022] Updated weights on worker 0-0, policy_version 9021 (0.00887) [2022-07-09 00:30:35,990][25689] Fps is (10 sec: 5345.9, 60 sec: 5434.4, 300 sec: 5251.9). Total num frames: 9241600. Throughput: 0: 4856.2. Samples: 9235958. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 00:30:35,992][25689] Avg episode reward: [(0, '-86.912')] [2022-07-09 00:30:37,160][26022] Updated weights on worker 0-0, policy_version 9031 (0.00075) [2022-07-09 00:30:38,877][26022] Updated weights on worker 0-0, policy_version 9041 (0.00087) [2022-07-09 00:30:40,952][26022] Updated weights on worker 0-0, policy_version 9051 (0.00736) [2022-07-09 00:30:41,009][25689] Fps is (10 sec: 5562.0, 60 sec: 5403.6, 300 sec: 5248.3). Total num frames: 9268224. Throughput: 0: 5683.5. Samples: 9268878. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 00:30:41,009][25689] Avg episode reward: [(0, '-86.109')] [2022-07-09 00:30:42,685][26022] Updated weights on worker 0-0, policy_version 9061 (0.00083) [2022-07-09 00:30:44,855][26022] Updated weights on worker 0-0, policy_version 9071 (0.00394) [2022-07-09 00:30:46,072][25689] Fps is (10 sec: 5383.0, 60 sec: 5415.0, 300 sec: 5247.5). Total num frames: 9295872. Throughput: 0: 5615.4. Samples: 9300448. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-09 00:30:46,075][25689] Avg episode reward: [(0, '-86.509')] [2022-07-09 00:30:46,525][26022] Updated weights on worker 0-0, policy_version 9081 (0.00087) [2022-07-09 00:30:48,661][26022] Updated weights on worker 0-0, policy_version 9091 (0.00098) [2022-07-09 00:30:50,359][26022] Updated weights on worker 0-0, policy_version 9101 (0.00095) [2022-07-09 00:30:51,140][25689] Fps is (10 sec: 5356.9, 60 sec: 5414.6, 300 sec: 5240.5). Total num frames: 9322496. Throughput: 0: 4804.9. Samples: 9316814. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-09 00:30:51,142][25689] Avg episode reward: [(0, '-86.337')] [2022-07-09 00:30:52,694][26022] Updated weights on worker 0-0, policy_version 9111 (0.00090) [2022-07-09 00:30:54,455][26022] Updated weights on worker 0-0, policy_version 9121 (0.00098) [2022-07-09 00:30:56,246][25689] Fps is (10 sec: 5032.1, 60 sec: 5326.8, 300 sec: 5232.6). Total num frames: 9347072. Throughput: 0: 5477.4. Samples: 9346912. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-09 00:30:56,247][25689] Avg episode reward: [(0, '-85.894')] [2022-07-09 00:30:56,650][26022] Updated weights on worker 0-0, policy_version 9131 (0.00088) [2022-07-09 00:30:58,379][26022] Updated weights on worker 0-0, policy_version 9141 (0.00085) [2022-07-09 00:31:00,659][26022] Updated weights on worker 0-0, policy_version 9151 (0.00087) [2022-07-09 00:31:01,281][25689] Fps is (10 sec: 4947.7, 60 sec: 5308.7, 300 sec: 5228.8). Total num frames: 9372672. Throughput: 0: 5416.2. Samples: 9378678. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-09 00:31:01,282][25689] Avg episode reward: [(0, '-86.148')] [2022-07-09 00:31:02,573][26022] Updated weights on worker 0-0, policy_version 9161 (0.00082) [2022-07-09 00:31:04,937][26022] Updated weights on worker 0-0, policy_version 9171 (0.00085) [2022-07-09 00:31:06,332][25689] Fps is (10 sec: 5178.1, 60 sec: 5322.9, 300 sec: 5224.6). Total num frames: 9399296. Throughput: 0: 5344.7. Samples: 9408732. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 00:31:06,332][25689] Avg episode reward: [(0, '-84.686')] [2022-07-09 00:31:06,456][26022] Updated weights on worker 0-0, policy_version 9181 (0.00087) [2022-07-09 00:31:08,728][26022] Updated weights on worker 0-0, policy_version 9191 (0.00084) [2022-07-09 00:31:10,362][26022] Updated weights on worker 0-0, policy_version 9201 (0.00083) [2022-07-09 00:31:11,352][25689] Fps is (10 sec: 5287.4, 60 sec: 5306.7, 300 sec: 5225.3). Total num frames: 9425920. Throughput: 0: 5314.2. Samples: 9424224. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 00:31:11,352][25689] Avg episode reward: [(0, '-84.615')] [2022-07-09 00:31:12,631][26022] Updated weights on worker 0-0, policy_version 9211 (0.00088) [2022-07-09 00:31:14,250][26022] Updated weights on worker 0-0, policy_version 9221 (0.00087) [2022-07-09 00:31:16,396][25689] Fps is (10 sec: 5189.3, 60 sec: 5296.4, 300 sec: 5214.9). Total num frames: 9451520. Throughput: 0: 5431.9. Samples: 9456362. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 00:31:16,396][25689] Avg episode reward: [(0, '-82.970')] [2022-07-09 00:31:16,443][26022] Updated weights on worker 0-0, policy_version 9231 (0.00090) [2022-07-09 00:31:18,057][26022] Updated weights on worker 0-0, policy_version 9241 (0.00102) [2022-07-09 00:31:20,255][26022] Updated weights on worker 0-0, policy_version 9251 (0.00091) [2022-07-09 00:31:21,421][25689] Fps is (10 sec: 5186.5, 60 sec: 5284.9, 300 sec: 5211.7). Total num frames: 9478144. Throughput: 0: 5444.7. Samples: 9488334. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 00:31:21,422][25689] Avg episode reward: [(0, '-82.445')] [2022-07-09 00:31:21,951][26022] Updated weights on worker 0-0, policy_version 9261 (0.00085) [2022-07-09 00:31:24,063][26022] Updated weights on worker 0-0, policy_version 9271 (0.00051) [2022-07-09 00:31:26,020][26022] Updated weights on worker 0-0, policy_version 9281 (0.00098) [2022-07-09 00:31:26,423][25689] Fps is (10 sec: 5412.6, 60 sec: 5256.5, 300 sec: 5215.6). Total num frames: 9505792. Throughput: 0: 4735.2. Samples: 9503866. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 00:31:26,423][25689] Avg episode reward: [(0, '-82.408')] [2022-07-09 00:31:27,872][26022] Updated weights on worker 0-0, policy_version 9291 (0.00084) [2022-07-09 00:31:29,818][26022] Updated weights on worker 0-0, policy_version 9301 (0.00091) [2022-07-09 00:31:31,443][25689] Fps is (10 sec: 5415.4, 60 sec: 5292.5, 300 sec: 5213.2). Total num frames: 9532416. Throughput: 0: 5578.1. Samples: 9536294. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 00:31:31,443][25689] Avg episode reward: [(0, '-82.018')] [2022-07-09 00:31:31,775][26022] Updated weights on worker 0-0, policy_version 9311 (0.00090) [2022-07-09 00:31:33,423][26022] Updated weights on worker 0-0, policy_version 9321 (0.00086) [2022-07-09 00:31:35,488][26022] Updated weights on worker 0-0, policy_version 9331 (0.00098) [2022-07-09 00:31:36,508][25689] Fps is (10 sec: 5381.0, 60 sec: 5262.2, 300 sec: 5216.6). Total num frames: 9560064. Throughput: 0: 5598.2. Samples: 9568958. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 00:31:36,509][25689] Avg episode reward: [(0, '-81.717')] [2022-07-09 00:31:37,313][26022] Updated weights on worker 0-0, policy_version 9341 (0.00083) [2022-07-09 00:31:39,049][26022] Updated weights on worker 0-0, policy_version 9351 (0.00088) [2022-07-09 00:31:41,379][26022] Updated weights on worker 0-0, policy_version 9361 (0.00092) [2022-07-09 00:31:41,510][25689] Fps is (10 sec: 5391.1, 60 sec: 5263.8, 300 sec: 5223.8). Total num frames: 9586688. Throughput: 0: 4831.7. Samples: 9585396. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 00:31:41,510][25689] Avg episode reward: [(0, '-82.010')] [2022-07-09 00:31:43,720][26022] Updated weights on worker 0-0, policy_version 9371 (0.00086) [2022-07-09 00:31:45,743][26022] Updated weights on worker 0-0, policy_version 9381 (0.00084) [2022-07-09 00:31:46,520][25689] Fps is (10 sec: 4909.6, 60 sec: 5183.7, 300 sec: 5224.1). Total num frames: 9609216. Throughput: 0: 5417.9. Samples: 9612750. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 00:31:46,520][25689] Avg episode reward: [(0, '-82.241')] [2022-07-09 00:31:47,851][26022] Updated weights on worker 0-0, policy_version 9391 (0.00082) [2022-07-09 00:31:49,565][26022] Updated weights on worker 0-0, policy_version 9401 (0.00091) [2022-07-09 00:31:51,547][25689] Fps is (10 sec: 4896.9, 60 sec: 5187.2, 300 sec: 5245.8). Total num frames: 9635840. Throughput: 0: 5410.8. Samples: 9645074. Policy #0 lag: (min: 0.0, avg: 7.4, max: 20.0) [2022-07-09 00:31:51,548][25689] Avg episode reward: [(0, '-83.174')] [2022-07-09 00:31:51,627][26022] Updated weights on worker 0-0, policy_version 9411 (0.00086) [2022-07-09 00:31:53,429][26022] Updated weights on worker 0-0, policy_version 9421 (0.00089) [2022-07-09 00:31:55,397][26022] Updated weights on worker 0-0, policy_version 9431 (0.00086) [2022-07-09 00:31:56,589][25689] Fps is (10 sec: 5389.8, 60 sec: 5243.6, 300 sec: 5263.1). Total num frames: 9663488. Throughput: 0: 4602.5. Samples: 9661380. Policy #0 lag: (min: 0.0, avg: 7.4, max: 20.0) [2022-07-09 00:31:56,590][25689] Avg episode reward: [(0, '-82.745')] [2022-07-09 00:31:57,216][26022] Updated weights on worker 0-0, policy_version 9441 (0.00081) [2022-07-09 00:31:59,316][26022] Updated weights on worker 0-0, policy_version 9451 (0.00081) [2022-07-09 00:32:00,954][26022] Updated weights on worker 0-0, policy_version 9461 (0.00095) [2022-07-09 00:32:01,609][25689] Fps is (10 sec: 5393.9, 60 sec: 5261.9, 300 sec: 5290.3). Total num frames: 9690112. Throughput: 0: 5408.2. Samples: 9694096. Policy #0 lag: (min: 0.0, avg: 7.4, max: 20.0) [2022-07-09 00:32:01,609][25689] Avg episode reward: [(0, '-84.723')] [2022-07-09 00:32:03,517][26022] Updated weights on worker 0-0, policy_version 9471 (0.00092) [2022-07-09 00:32:05,076][26022] Updated weights on worker 0-0, policy_version 9481 (0.00092) [2022-07-09 00:32:06,639][25689] Fps is (10 sec: 5196.7, 60 sec: 5246.8, 300 sec: 5283.7). Total num frames: 9715712. Throughput: 0: 5560.7. Samples: 9724624. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 00:32:06,639][25689] Avg episode reward: [(0, '-85.422')] [2022-07-09 00:32:07,261][26022] Updated weights on worker 0-0, policy_version 9491 (0.00094) [2022-07-09 00:32:08,803][26022] Updated weights on worker 0-0, policy_version 9501 (0.00095) [2022-07-09 00:32:10,924][26022] Updated weights on worker 0-0, policy_version 9511 (0.00093) [2022-07-09 00:32:11,668][25689] Fps is (10 sec: 5293.4, 60 sec: 5262.9, 300 sec: 5287.8). Total num frames: 9743360. Throughput: 0: 4763.3. Samples: 9740916. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 00:32:11,669][25689] Avg episode reward: [(0, '-85.616')] [2022-07-09 00:32:11,990][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:32:12,003][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000009517_9745408.pth [2022-07-09 00:32:12,008][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000007676_7860224.pth [2022-07-09 00:32:12,583][26022] Updated weights on worker 0-0, policy_version 9521 (0.00091) [2022-07-09 00:32:14,600][26022] Updated weights on worker 0-0, policy_version 9531 (0.00083) [2022-07-09 00:32:16,386][26022] Updated weights on worker 0-0, policy_version 9541 (0.00924) [2022-07-09 00:32:16,770][25689] Fps is (10 sec: 5457.7, 60 sec: 5291.7, 300 sec: 5280.4). Total num frames: 9771008. Throughput: 0: 5566.7. Samples: 9773722. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 00:32:16,771][25689] Avg episode reward: [(0, '-85.672')] [2022-07-09 00:32:18,403][26022] Updated weights on worker 0-0, policy_version 9551 (0.00090) [2022-07-09 00:32:20,136][26022] Updated weights on worker 0-0, policy_version 9561 (0.00093) [2022-07-09 00:32:21,821][25689] Fps is (10 sec: 5345.2, 60 sec: 5289.5, 300 sec: 5281.0). Total num frames: 9797632. Throughput: 0: 5549.4. Samples: 9806264. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 00:32:21,822][25689] Avg episode reward: [(0, '-85.876')] [2022-07-09 00:32:22,224][26022] Updated weights on worker 0-0, policy_version 9571 (0.00092) [2022-07-09 00:32:23,950][26022] Updated weights on worker 0-0, policy_version 9581 (0.00091) [2022-07-09 00:32:26,011][26022] Updated weights on worker 0-0, policy_version 9591 (0.00094) [2022-07-09 00:32:26,829][25689] Fps is (10 sec: 5497.6, 60 sec: 5306.0, 300 sec: 5285.8). Total num frames: 9826304. Throughput: 0: 4863.0. Samples: 9822808. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 00:32:26,829][25689] Avg episode reward: [(0, '-85.896')] [2022-07-09 00:32:27,698][26022] Updated weights on worker 0-0, policy_version 9601 (0.00102) [2022-07-09 00:32:29,622][26022] Updated weights on worker 0-0, policy_version 9611 (0.00092) [2022-07-09 00:32:31,456][26022] Updated weights on worker 0-0, policy_version 9621 (0.00099) [2022-07-09 00:32:31,892][25689] Fps is (10 sec: 5592.8, 60 sec: 5319.1, 300 sec: 5285.9). Total num frames: 9853952. Throughput: 0: 5677.5. Samples: 9855736. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 00:32:31,893][25689] Avg episode reward: [(0, '-86.109')] [2022-07-09 00:32:33,638][26022] Updated weights on worker 0-0, policy_version 9631 (0.00087) [2022-07-09 00:32:35,206][26022] Updated weights on worker 0-0, policy_version 9641 (0.00100) [2022-07-09 00:32:37,000][25689] Fps is (10 sec: 5335.9, 60 sec: 5298.5, 300 sec: 5281.5). Total num frames: 9880576. Throughput: 0: 5691.1. Samples: 9888850. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 00:32:37,001][25689] Avg episode reward: [(0, '-85.805')] [2022-07-09 00:32:37,341][26022] Updated weights on worker 0-0, policy_version 9651 (0.00087) [2022-07-09 00:32:38,596][26022] Updated weights on worker 0-0, policy_version 9661 (0.00082) [2022-07-09 00:32:40,962][26022] Updated weights on worker 0-0, policy_version 9671 (0.00080) [2022-07-09 00:32:42,036][25689] Fps is (10 sec: 5653.0, 60 sec: 5363.1, 300 sec: 5291.7). Total num frames: 9911296. Throughput: 0: 4900.5. Samples: 9905324. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 00:32:42,037][25689] Avg episode reward: [(0, '-84.775')] [2022-07-09 00:32:42,513][26022] Updated weights on worker 0-0, policy_version 9681 (0.00085) [2022-07-09 00:32:44,448][26022] Updated weights on worker 0-0, policy_version 9691 (0.00089) [2022-07-09 00:32:46,394][26022] Updated weights on worker 0-0, policy_version 9701 (0.00095) [2022-07-09 00:32:47,040][25689] Fps is (10 sec: 5609.8, 60 sec: 5414.4, 300 sec: 5285.4). Total num frames: 9936896. Throughput: 0: 5723.0. Samples: 9938474. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 00:32:47,040][25689] Avg episode reward: [(0, '-83.941')] [2022-07-09 00:32:48,230][26022] Updated weights on worker 0-0, policy_version 9711 (0.00084) [2022-07-09 00:32:49,981][26022] Updated weights on worker 0-0, policy_version 9721 (0.00081) [2022-07-09 00:32:52,048][25689] Fps is (10 sec: 5215.7, 60 sec: 5416.1, 300 sec: 5283.7). Total num frames: 9963520. Throughput: 0: 5742.1. Samples: 9971478. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 00:32:52,049][25689] Avg episode reward: [(0, '-82.863')] [2022-07-09 00:32:52,214][26022] Updated weights on worker 0-0, policy_version 9731 (0.00102) [2022-07-09 00:32:53,607][26022] Updated weights on worker 0-0, policy_version 9741 (0.00086) [2022-07-09 00:32:55,947][26022] Updated weights on worker 0-0, policy_version 9751 (0.00082) [2022-07-09 00:32:57,169][25689] Fps is (10 sec: 5559.9, 60 sec: 5442.9, 300 sec: 5289.9). Total num frames: 9993216. Throughput: 0: 4907.6. Samples: 9987832. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 00:32:57,170][25689] Avg episode reward: [(0, '-81.836')] [2022-07-09 00:32:57,390][26022] Updated weights on worker 0-0, policy_version 9761 (0.00094) [2022-07-09 00:32:59,383][26022] Updated weights on worker 0-0, policy_version 9771 (0.00091) [2022-07-09 00:33:01,116][26022] Updated weights on worker 0-0, policy_version 9781 (0.00089) [2022-07-09 00:33:02,244][25689] Fps is (10 sec: 5423.5, 60 sec: 5421.0, 300 sec: 5290.3). Total num frames: 10018816. Throughput: 0: 5740.4. Samples: 10021330. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 00:33:02,245][25689] Avg episode reward: [(0, '-81.541')] [2022-07-09 00:33:03,339][26022] Updated weights on worker 0-0, policy_version 9791 (0.00086) [2022-07-09 00:33:05,303][26022] Updated weights on worker 0-0, policy_version 9801 (0.00252) [2022-07-09 00:33:07,220][26022] Updated weights on worker 0-0, policy_version 9811 (0.00081) [2022-07-09 00:33:07,273][25689] Fps is (10 sec: 5270.1, 60 sec: 5454.9, 300 sec: 5290.6). Total num frames: 10046464. Throughput: 0: 5630.8. Samples: 10052404. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 00:33:07,274][25689] Avg episode reward: [(0, '-81.118')] [2022-07-09 00:33:08,915][26022] Updated weights on worker 0-0, policy_version 9821 (0.00083) [2022-07-09 00:33:10,863][26022] Updated weights on worker 0-0, policy_version 9831 (0.00090) [2022-07-09 00:33:12,296][25689] Fps is (10 sec: 5501.3, 60 sec: 5455.5, 300 sec: 5295.6). Total num frames: 10074112. Throughput: 0: 4816.9. Samples: 10069006. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 00:33:12,296][25689] Avg episode reward: [(0, '-82.145')] [2022-07-09 00:33:12,714][26022] Updated weights on worker 0-0, policy_version 9841 (0.00082) [2022-07-09 00:33:14,526][26022] Updated weights on worker 0-0, policy_version 9851 (0.00093) [2022-07-09 00:33:16,438][26022] Updated weights on worker 0-0, policy_version 9861 (0.00091) [2022-07-09 00:33:17,400][25689] Fps is (10 sec: 5561.5, 60 sec: 5472.2, 300 sec: 5300.9). Total num frames: 10102784. Throughput: 0: 5669.8. Samples: 10102536. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 00:33:17,400][25689] Avg episode reward: [(0, '-83.533')] [2022-07-09 00:33:18,239][26022] Updated weights on worker 0-0, policy_version 9871 (0.00092) [2022-07-09 00:33:20,111][26022] Updated weights on worker 0-0, policy_version 9881 (0.00097) [2022-07-09 00:33:22,128][26022] Updated weights on worker 0-0, policy_version 9891 (0.00091) [2022-07-09 00:33:22,425][25689] Fps is (10 sec: 5358.1, 60 sec: 5457.7, 300 sec: 5294.4). Total num frames: 10128384. Throughput: 0: 5635.0. Samples: 10135048. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 00:33:22,425][25689] Avg episode reward: [(0, '-84.821')] [2022-07-09 00:33:23,865][26022] Updated weights on worker 0-0, policy_version 9901 (0.00086) [2022-07-09 00:33:26,052][26022] Updated weights on worker 0-0, policy_version 9911 (0.00097) [2022-07-09 00:33:27,439][25689] Fps is (10 sec: 5508.2, 60 sec: 5474.0, 300 sec: 5311.8). Total num frames: 10158080. Throughput: 0: 4922.7. Samples: 10151674. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 00:33:27,439][25689] Avg episode reward: [(0, '-85.610')] [2022-07-09 00:33:27,575][26022] Updated weights on worker 0-0, policy_version 9921 (0.00088) [2022-07-09 00:33:29,717][26022] Updated weights on worker 0-0, policy_version 9931 (0.00084) [2022-07-09 00:33:31,288][26022] Updated weights on worker 0-0, policy_version 9941 (0.00092) [2022-07-09 00:33:32,441][25689] Fps is (10 sec: 5520.7, 60 sec: 5445.6, 300 sec: 5296.8). Total num frames: 10183680. Throughput: 0: 5713.5. Samples: 10184106. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 00:33:32,441][25689] Avg episode reward: [(0, '-85.458')] [2022-07-09 00:33:33,419][26022] Updated weights on worker 0-0, policy_version 9951 (0.00095) [2022-07-09 00:33:34,957][26022] Updated weights on worker 0-0, policy_version 9961 (0.00081) [2022-07-09 00:33:36,994][26022] Updated weights on worker 0-0, policy_version 9971 (0.00084) [2022-07-09 00:33:37,544][25689] Fps is (10 sec: 5370.6, 60 sec: 5479.9, 300 sec: 5305.5). Total num frames: 10212352. Throughput: 0: 5683.3. Samples: 10217022. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 00:33:37,545][25689] Avg episode reward: [(0, '-84.280')] [2022-07-09 00:33:38,947][26022] Updated weights on worker 0-0, policy_version 9981 (0.00088) [2022-07-09 00:33:40,760][26022] Updated weights on worker 0-0, policy_version 9991 (0.00100) [2022-07-09 00:33:42,608][25689] Fps is (10 sec: 5539.6, 60 sec: 5426.7, 300 sec: 5304.5). Total num frames: 10240000. Throughput: 0: 4878.4. Samples: 10233508. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 00:33:42,608][25689] Avg episode reward: [(0, '-82.743')] [2022-07-09 00:33:42,770][26022] Updated weights on worker 0-0, policy_version 10001 (0.00089) [2022-07-09 00:33:44,496][26022] Updated weights on worker 0-0, policy_version 10011 (0.00079) [2022-07-09 00:33:46,562][26022] Updated weights on worker 0-0, policy_version 10021 (0.00809) [2022-07-09 00:33:47,699][25689] Fps is (10 sec: 5445.1, 60 sec: 5452.6, 300 sec: 5307.0). Total num frames: 10267648. Throughput: 0: 5674.5. Samples: 10266642. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 00:33:47,700][25689] Avg episode reward: [(0, '-81.988')] [2022-07-09 00:33:48,287][26022] Updated weights on worker 0-0, policy_version 10031 (0.00089) [2022-07-09 00:33:50,170][26022] Updated weights on worker 0-0, policy_version 10041 (0.00093) [2022-07-09 00:33:52,154][26022] Updated weights on worker 0-0, policy_version 10051 (0.00086) [2022-07-09 00:33:52,715][25689] Fps is (10 sec: 5369.8, 60 sec: 5452.0, 300 sec: 5300.9). Total num frames: 10294272. Throughput: 0: 5660.6. Samples: 10298868. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 00:33:52,715][25689] Avg episode reward: [(0, '-80.317')] [2022-07-09 00:33:53,888][26022] Updated weights on worker 0-0, policy_version 10061 (0.00091) [2022-07-09 00:33:55,970][26022] Updated weights on worker 0-0, policy_version 10071 (0.00098) [2022-07-09 00:33:57,778][25689] Fps is (10 sec: 5384.9, 60 sec: 5423.4, 300 sec: 5304.5). Total num frames: 10321920. Throughput: 0: 5646.1. Samples: 10331264. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 00:33:57,778][25689] Avg episode reward: [(0, '-79.375')] [2022-07-09 00:33:57,857][26022] Updated weights on worker 0-0, policy_version 10081 (0.00086) [2022-07-09 00:33:59,632][26022] Updated weights on worker 0-0, policy_version 10091 (0.00085) [2022-07-09 00:34:01,777][26022] Updated weights on worker 0-0, policy_version 10101 (0.00096) [2022-07-09 00:34:02,790][25689] Fps is (10 sec: 5183.3, 60 sec: 5412.1, 300 sec: 5307.9). Total num frames: 10346496. Throughput: 0: 5652.8. Samples: 10347594. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 00:34:02,791][25689] Avg episode reward: [(0, '-78.391')] [2022-07-09 00:34:03,608][26022] Updated weights on worker 0-0, policy_version 10111 (0.00089) [2022-07-09 00:34:05,824][26022] Updated weights on worker 0-0, policy_version 10121 (0.00085) [2022-07-09 00:34:07,339][26022] Updated weights on worker 0-0, policy_version 10131 (0.00082) [2022-07-09 00:34:07,853][25689] Fps is (10 sec: 5284.9, 60 sec: 5425.9, 300 sec: 5341.6). Total num frames: 10375168. Throughput: 0: 5552.6. Samples: 10378550. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 00:34:07,854][25689] Avg episode reward: [(0, '-79.111')] [2022-07-09 00:34:09,486][26022] Updated weights on worker 0-0, policy_version 10141 (0.00090) [2022-07-09 00:34:11,354][26022] Updated weights on worker 0-0, policy_version 10151 (0.00085) [2022-07-09 00:34:12,139][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:34:12,151][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000010155_10398720.pth [2022-07-09 00:34:12,152][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000008284_8482816.pth [2022-07-09 00:34:12,878][25689] Fps is (10 sec: 5582.9, 60 sec: 5425.7, 300 sec: 5376.9). Total num frames: 10402816. Throughput: 0: 5582.9. Samples: 10411438. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 00:34:12,879][25689] Avg episode reward: [(0, '-77.867')] [2022-07-09 00:34:13,142][26022] Updated weights on worker 0-0, policy_version 10161 (0.00103) [2022-07-09 00:34:15,147][26022] Updated weights on worker 0-0, policy_version 10171 (0.00081) [2022-07-09 00:34:16,828][26022] Updated weights on worker 0-0, policy_version 10181 (0.00095) [2022-07-09 00:34:17,935][25689] Fps is (10 sec: 5383.4, 60 sec: 5396.2, 300 sec: 5369.3). Total num frames: 10429440. Throughput: 0: 4789.8. Samples: 10427812. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 00:34:17,935][25689] Avg episode reward: [(0, '-78.597')] [2022-07-09 00:34:18,971][26022] Updated weights on worker 0-0, policy_version 10191 (0.00082) [2022-07-09 00:34:20,860][26022] Updated weights on worker 0-0, policy_version 10201 (0.00081) [2022-07-09 00:34:22,606][26022] Updated weights on worker 0-0, policy_version 10211 (0.00088) [2022-07-09 00:34:22,940][25689] Fps is (10 sec: 5495.4, 60 sec: 5448.7, 300 sec: 5383.3). Total num frames: 10458112. Throughput: 0: 5607.0. Samples: 10460576. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 00:34:22,941][25689] Avg episode reward: [(0, '-78.991')] [2022-07-09 00:34:24,447][26022] Updated weights on worker 0-0, policy_version 10221 (0.00083) [2022-07-09 00:34:26,283][26022] Updated weights on worker 0-0, policy_version 10231 (0.00090) [2022-07-09 00:34:27,949][25689] Fps is (10 sec: 5521.5, 60 sec: 5398.3, 300 sec: 5376.5). Total num frames: 10484736. Throughput: 0: 5741.5. Samples: 10493932. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 00:34:27,950][25689] Avg episode reward: [(0, '-78.702')] [2022-07-09 00:34:28,200][26022] Updated weights on worker 0-0, policy_version 10241 (0.00082) [2022-07-09 00:34:29,972][26022] Updated weights on worker 0-0, policy_version 10251 (0.00096) [2022-07-09 00:34:31,908][26022] Updated weights on worker 0-0, policy_version 10261 (0.00082) [2022-07-09 00:34:32,980][25689] Fps is (10 sec: 5405.4, 60 sec: 5429.6, 300 sec: 5376.9). Total num frames: 10512384. Throughput: 0: 4916.7. Samples: 10510276. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 00:34:32,981][25689] Avg episode reward: [(0, '-77.120')] [2022-07-09 00:34:33,635][26022] Updated weights on worker 0-0, policy_version 10271 (0.00091) [2022-07-09 00:34:35,719][26022] Updated weights on worker 0-0, policy_version 10281 (0.00086) [2022-07-09 00:34:37,457][26022] Updated weights on worker 0-0, policy_version 10291 (0.00087) [2022-07-09 00:34:38,076][25689] Fps is (10 sec: 5662.8, 60 sec: 5447.2, 300 sec: 5379.6). Total num frames: 10542080. Throughput: 0: 5731.0. Samples: 10543240. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 00:34:38,076][25689] Avg episode reward: [(0, '-76.096')] [2022-07-09 00:34:39,277][26022] Updated weights on worker 0-0, policy_version 10301 (0.00091) [2022-07-09 00:34:41,127][26022] Updated weights on worker 0-0, policy_version 10311 (0.00086) [2022-07-09 00:34:42,991][26022] Updated weights on worker 0-0, policy_version 10321 (0.00094) [2022-07-09 00:34:43,090][25689] Fps is (10 sec: 5570.8, 60 sec: 5434.7, 300 sec: 5379.3). Total num frames: 10568704. Throughput: 0: 5739.1. Samples: 10576218. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 00:34:43,091][25689] Avg episode reward: [(0, '-76.218')] [2022-07-09 00:34:45,082][26022] Updated weights on worker 0-0, policy_version 10331 (0.00083) [2022-07-09 00:34:46,905][26022] Updated weights on worker 0-0, policy_version 10341 (0.00086) [2022-07-09 00:34:48,097][25689] Fps is (10 sec: 5313.4, 60 sec: 5425.3, 300 sec: 5380.3). Total num frames: 10595328. Throughput: 0: 4895.1. Samples: 10592558. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 00:34:48,098][25689] Avg episode reward: [(0, '-76.058')] [2022-07-09 00:34:48,544][26022] Updated weights on worker 0-0, policy_version 10351 (0.00089) [2022-07-09 00:34:50,835][26022] Updated weights on worker 0-0, policy_version 10361 (0.00095) [2022-07-09 00:34:52,331][26022] Updated weights on worker 0-0, policy_version 10371 (0.00104) [2022-07-09 00:34:53,114][25689] Fps is (10 sec: 5414.4, 60 sec: 5442.2, 300 sec: 5374.4). Total num frames: 10622976. Throughput: 0: 5703.1. Samples: 10625098. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 00:34:53,114][25689] Avg episode reward: [(0, '-76.631')] [2022-07-09 00:34:54,443][26022] Updated weights on worker 0-0, policy_version 10381 (0.00082) [2022-07-09 00:34:55,973][26022] Updated weights on worker 0-0, policy_version 10391 (0.00093) [2022-07-09 00:34:58,199][25689] Fps is (10 sec: 5474.0, 60 sec: 5440.2, 300 sec: 5376.6). Total num frames: 10650624. Throughput: 0: 5711.6. Samples: 10658174. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 00:34:58,199][25689] Avg episode reward: [(0, '-77.277')] [2022-07-09 00:34:58,206][26022] Updated weights on worker 0-0, policy_version 10401 (0.00085) [2022-07-09 00:35:00,127][26022] Updated weights on worker 0-0, policy_version 10411 (0.00092) [2022-07-09 00:35:02,126][26022] Updated weights on worker 0-0, policy_version 10421 (0.00083) [2022-07-09 00:35:03,221][25689] Fps is (10 sec: 5268.2, 60 sec: 5456.3, 300 sec: 5376.6). Total num frames: 10676224. Throughput: 0: 4888.6. Samples: 10674630. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 00:35:03,222][25689] Avg episode reward: [(0, '-78.419')] [2022-07-09 00:35:04,084][26022] Updated weights on worker 0-0, policy_version 10431 (0.00081) [2022-07-09 00:35:05,889][26022] Updated weights on worker 0-0, policy_version 10441 (0.00092) [2022-07-09 00:35:07,753][26022] Updated weights on worker 0-0, policy_version 10451 (0.00094) [2022-07-09 00:35:08,263][25689] Fps is (10 sec: 5291.0, 60 sec: 5441.3, 300 sec: 5376.4). Total num frames: 10703872. Throughput: 0: 5596.3. Samples: 10705410. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 00:35:08,263][25689] Avg episode reward: [(0, '-79.090')] [2022-07-09 00:35:09,715][26022] Updated weights on worker 0-0, policy_version 10461 (0.00084) [2022-07-09 00:35:11,484][26022] Updated weights on worker 0-0, policy_version 10471 (0.00090) [2022-07-09 00:35:13,275][25689] Fps is (10 sec: 5500.0, 60 sec: 5442.4, 300 sec: 5381.7). Total num frames: 10731520. Throughput: 0: 5623.3. Samples: 10738472. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 00:35:13,275][25689] Avg episode reward: [(0, '-79.724')] [2022-07-09 00:35:13,365][26022] Updated weights on worker 0-0, policy_version 10481 (0.00087) [2022-07-09 00:35:15,232][26022] Updated weights on worker 0-0, policy_version 10491 (0.00083) [2022-07-09 00:35:17,137][26022] Updated weights on worker 0-0, policy_version 10501 (0.00092) [2022-07-09 00:35:18,353][25689] Fps is (10 sec: 5378.5, 60 sec: 5440.5, 300 sec: 5378.4). Total num frames: 10758144. Throughput: 0: 4804.4. Samples: 10755004. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 00:35:18,354][25689] Avg episode reward: [(0, '-78.899')] [2022-07-09 00:35:19,072][26022] Updated weights on worker 0-0, policy_version 10511 (0.00089) [2022-07-09 00:35:21,066][26022] Updated weights on worker 0-0, policy_version 10521 (0.00087) [2022-07-09 00:35:22,782][26022] Updated weights on worker 0-0, policy_version 10531 (0.00086) [2022-07-09 00:35:23,405][25689] Fps is (10 sec: 5559.6, 60 sec: 5453.2, 300 sec: 5378.6). Total num frames: 10787840. Throughput: 0: 5602.7. Samples: 10787714. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 00:35:23,406][25689] Avg episode reward: [(0, '-78.674')] [2022-07-09 00:35:24,733][26022] Updated weights on worker 0-0, policy_version 10541 (0.00514) [2022-07-09 00:35:26,429][26022] Updated weights on worker 0-0, policy_version 10551 (0.00089) [2022-07-09 00:35:28,418][25689] Fps is (10 sec: 5493.8, 60 sec: 5435.9, 300 sec: 5382.7). Total num frames: 10813440. Throughput: 0: 5719.5. Samples: 10820690. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 00:35:28,419][25689] Avg episode reward: [(0, '-77.877')] [2022-07-09 00:35:28,611][26022] Updated weights on worker 0-0, policy_version 10561 (0.00088) [2022-07-09 00:35:30,361][26022] Updated weights on worker 0-0, policy_version 10571 (0.00081) [2022-07-09 00:35:32,266][26022] Updated weights on worker 0-0, policy_version 10581 (0.00078) [2022-07-09 00:35:33,435][25689] Fps is (10 sec: 5309.2, 60 sec: 5437.2, 300 sec: 5377.4). Total num frames: 10841088. Throughput: 0: 4887.3. Samples: 10836998. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 00:35:33,442][25689] Avg episode reward: [(0, '-77.192')] [2022-07-09 00:35:33,984][26022] Updated weights on worker 0-0, policy_version 10591 (0.00094) [2022-07-09 00:35:35,918][26022] Updated weights on worker 0-0, policy_version 10601 (0.01117) [2022-07-09 00:35:37,764][26022] Updated weights on worker 0-0, policy_version 10611 (0.00092) [2022-07-09 00:35:38,527][25689] Fps is (10 sec: 5470.4, 60 sec: 5403.7, 300 sec: 5379.5). Total num frames: 10868736. Throughput: 0: 5679.0. Samples: 10869568. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 00:35:38,527][25689] Avg episode reward: [(0, '-78.606')] [2022-07-09 00:35:39,801][26022] Updated weights on worker 0-0, policy_version 10621 (0.00090) [2022-07-09 00:35:41,435][26022] Updated weights on worker 0-0, policy_version 10631 (0.00083) [2022-07-09 00:35:43,246][26022] Updated weights on worker 0-0, policy_version 10641 (0.00082) [2022-07-09 00:35:43,567][25689] Fps is (10 sec: 5457.5, 60 sec: 5418.3, 300 sec: 5380.0). Total num frames: 10896384. Throughput: 0: 5695.3. Samples: 10902540. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 00:35:43,567][25689] Avg episode reward: [(0, '-77.045')] [2022-07-09 00:35:45,090][26022] Updated weights on worker 0-0, policy_version 10651 (0.00101) [2022-07-09 00:35:47,227][26022] Updated weights on worker 0-0, policy_version 10661 (0.00084) [2022-07-09 00:35:48,603][25689] Fps is (10 sec: 5487.9, 60 sec: 5432.6, 300 sec: 5384.0). Total num frames: 10924032. Throughput: 0: 4872.0. Samples: 10919028. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 00:35:48,603][25689] Avg episode reward: [(0, '-76.445')] [2022-07-09 00:35:49,018][26022] Updated weights on worker 0-0, policy_version 10671 (0.00087) [2022-07-09 00:35:51,143][26022] Updated weights on worker 0-0, policy_version 10681 (0.00090) [2022-07-09 00:35:52,662][26022] Updated weights on worker 0-0, policy_version 10691 (0.00087) [2022-07-09 00:35:53,622][25689] Fps is (10 sec: 5499.0, 60 sec: 5432.3, 300 sec: 5395.9). Total num frames: 10951680. Throughput: 0: 5675.3. Samples: 10951570. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 00:35:53,623][25689] Avg episode reward: [(0, '-75.801')] [2022-07-09 00:35:54,818][26022] Updated weights on worker 0-0, policy_version 10701 (0.00084) [2022-07-09 00:35:56,443][26022] Updated weights on worker 0-0, policy_version 10711 (0.00086) [2022-07-09 00:35:58,447][26022] Updated weights on worker 0-0, policy_version 10721 (0.00089) [2022-07-09 00:35:58,741][25689] Fps is (10 sec: 5454.2, 60 sec: 5429.3, 300 sec: 5401.3). Total num frames: 10979328. Throughput: 0: 5700.7. Samples: 10984804. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 00:35:58,742][25689] Avg episode reward: [(0, '-75.649')] [2022-07-09 00:36:00,291][26022] Updated weights on worker 0-0, policy_version 10731 (0.00082) [2022-07-09 00:36:02,550][26022] Updated weights on worker 0-0, policy_version 10741 (0.00102) [2022-07-09 00:36:03,763][25689] Fps is (10 sec: 5149.9, 60 sec: 5412.4, 300 sec: 5394.9). Total num frames: 11003904. Throughput: 0: 4877.0. Samples: 11001038. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 00:36:03,764][25689] Avg episode reward: [(0, '-74.892')] [2022-07-09 00:36:04,402][26022] Updated weights on worker 0-0, policy_version 10751 (0.00090) [2022-07-09 00:36:06,246][26022] Updated weights on worker 0-0, policy_version 10761 (0.00085) [2022-07-09 00:36:08,286][26022] Updated weights on worker 0-0, policy_version 10771 (0.00081) [2022-07-09 00:36:08,773][25689] Fps is (10 sec: 5409.7, 60 sec: 5449.1, 300 sec: 5405.4). Total num frames: 11033600. Throughput: 0: 5595.9. Samples: 11031900. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 00:36:08,774][25689] Avg episode reward: [(0, '-75.308')] [2022-07-09 00:36:10,179][26022] Updated weights on worker 0-0, policy_version 10781 (0.00354) [2022-07-09 00:36:11,903][26022] Updated weights on worker 0-0, policy_version 10791 (0.00085) [2022-07-09 00:36:12,154][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:36:12,172][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000010793_11052032.pth [2022-07-09 00:36:12,173][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000008896_9109504.pth [2022-07-09 00:36:13,727][26022] Updated weights on worker 0-0, policy_version 10801 (0.00094) [2022-07-09 00:36:13,824][25689] Fps is (10 sec: 5597.6, 60 sec: 5428.7, 300 sec: 5408.7). Total num frames: 11060224. Throughput: 0: 5592.3. Samples: 11064546. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 00:36:13,825][25689] Avg episode reward: [(0, '-75.129')] [2022-07-09 00:36:15,653][26022] Updated weights on worker 0-0, policy_version 10811 (0.00087) [2022-07-09 00:36:17,641][26022] Updated weights on worker 0-0, policy_version 10821 (0.00092) [2022-07-09 00:36:18,910][25689] Fps is (10 sec: 5354.0, 60 sec: 5444.9, 300 sec: 5411.0). Total num frames: 11087872. Throughput: 0: 5595.3. Samples: 11097656. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 00:36:18,911][25689] Avg episode reward: [(0, '-74.120')] [2022-07-09 00:36:19,510][26022] Updated weights on worker 0-0, policy_version 10831 (0.00087) [2022-07-09 00:36:21,115][26022] Updated weights on worker 0-0, policy_version 10841 (0.00085) [2022-07-09 00:36:23,103][26022] Updated weights on worker 0-0, policy_version 10851 (0.00093) [2022-07-09 00:36:23,924][25689] Fps is (10 sec: 5474.8, 60 sec: 5414.5, 300 sec: 5410.8). Total num frames: 11115520. Throughput: 0: 5603.4. Samples: 11114010. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 00:36:23,925][25689] Avg episode reward: [(0, '-73.624')] [2022-07-09 00:36:25,067][26022] Updated weights on worker 0-0, policy_version 10861 (0.00083) [2022-07-09 00:36:26,936][26022] Updated weights on worker 0-0, policy_version 10871 (0.00084) [2022-07-09 00:36:28,786][26022] Updated weights on worker 0-0, policy_version 10881 (0.00096) [2022-07-09 00:36:29,018][25689] Fps is (10 sec: 5369.5, 60 sec: 5424.2, 300 sec: 5409.5). Total num frames: 11142144. Throughput: 0: 5695.6. Samples: 11147202. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 00:36:29,018][25689] Avg episode reward: [(0, '-74.248')] [2022-07-09 00:36:30,660][26022] Updated weights on worker 0-0, policy_version 10891 (0.00090) [2022-07-09 00:36:32,318][26022] Updated weights on worker 0-0, policy_version 10901 (0.00087) [2022-07-09 00:36:34,041][25689] Fps is (10 sec: 5465.8, 60 sec: 5440.5, 300 sec: 5413.7). Total num frames: 11170816. Throughput: 0: 5712.8. Samples: 11180040. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 00:36:34,042][25689] Avg episode reward: [(0, '-73.857')] [2022-07-09 00:36:34,302][26022] Updated weights on worker 0-0, policy_version 10911 (0.00087) [2022-07-09 00:36:36,003][26022] Updated weights on worker 0-0, policy_version 10921 (0.00079) [2022-07-09 00:36:38,039][26022] Updated weights on worker 0-0, policy_version 10931 (0.00089) [2022-07-09 00:36:39,093][25689] Fps is (10 sec: 5590.0, 60 sec: 5444.1, 300 sec: 5416.2). Total num frames: 11198464. Throughput: 0: 4895.9. Samples: 11196468. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 00:36:39,093][25689] Avg episode reward: [(0, '-74.539')] [2022-07-09 00:36:39,877][26022] Updated weights on worker 0-0, policy_version 10941 (0.00088) [2022-07-09 00:36:41,631][26022] Updated weights on worker 0-0, policy_version 10951 (0.00097) [2022-07-09 00:36:43,636][26022] Updated weights on worker 0-0, policy_version 10961 (0.00089) [2022-07-09 00:36:44,126][25689] Fps is (10 sec: 5584.5, 60 sec: 5461.6, 300 sec: 5436.4). Total num frames: 11227136. Throughput: 0: 5729.5. Samples: 11229756. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 00:36:44,127][25689] Avg episode reward: [(0, '-75.006')] [2022-07-09 00:36:45,663][26022] Updated weights on worker 0-0, policy_version 10971 (0.00093) [2022-07-09 00:36:47,311][26022] Updated weights on worker 0-0, policy_version 10981 (0.00090) [2022-07-09 00:36:49,137][25689] Fps is (10 sec: 5403.3, 60 sec: 5430.1, 300 sec: 5433.3). Total num frames: 11252736. Throughput: 0: 5751.3. Samples: 11262914. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 00:36:49,137][25689] Avg episode reward: [(0, '-75.961')] [2022-07-09 00:36:49,369][26022] Updated weights on worker 0-0, policy_version 10991 (0.00078) [2022-07-09 00:36:51,013][26022] Updated weights on worker 0-0, policy_version 11001 (0.00088) [2022-07-09 00:36:52,960][26022] Updated weights on worker 0-0, policy_version 11011 (0.00084) [2022-07-09 00:36:54,143][25689] Fps is (10 sec: 5418.0, 60 sec: 5448.2, 300 sec: 5437.4). Total num frames: 11281408. Throughput: 0: 4944.6. Samples: 11279436. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 00:36:54,144][25689] Avg episode reward: [(0, '-75.881')] [2022-07-09 00:36:54,610][26022] Updated weights on worker 0-0, policy_version 11021 (0.00088) [2022-07-09 00:36:56,797][26022] Updated weights on worker 0-0, policy_version 11031 (0.00089) [2022-07-09 00:36:58,613][26022] Updated weights on worker 0-0, policy_version 11041 (0.00084) [2022-07-09 00:36:59,204][25689] Fps is (10 sec: 5594.6, 60 sec: 5453.4, 300 sec: 5440.1). Total num frames: 11309056. Throughput: 0: 5767.4. Samples: 11312456. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 00:36:59,204][25689] Avg episode reward: [(0, '-76.317')] [2022-07-09 00:37:00,386][26022] Updated weights on worker 0-0, policy_version 11051 (0.00088) [2022-07-09 00:37:02,622][26022] Updated weights on worker 0-0, policy_version 11061 (0.00091) [2022-07-09 00:37:04,237][25689] Fps is (10 sec: 5275.1, 60 sec: 5469.3, 300 sec: 5440.0). Total num frames: 11334656. Throughput: 0: 5653.6. Samples: 11343456. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 00:37:04,240][25689] Avg episode reward: [(0, '-76.898')] [2022-07-09 00:37:04,523][26022] Updated weights on worker 0-0, policy_version 11071 (0.00093) [2022-07-09 00:37:06,238][26022] Updated weights on worker 0-0, policy_version 11081 (0.00096) [2022-07-09 00:37:08,388][26022] Updated weights on worker 0-0, policy_version 11091 (0.00096) [2022-07-09 00:37:09,253][25689] Fps is (10 sec: 5196.6, 60 sec: 5418.0, 300 sec: 5436.8). Total num frames: 11361280. Throughput: 0: 4821.5. Samples: 11359906. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 00:37:09,255][25689] Avg episode reward: [(0, '-77.066')] [2022-07-09 00:37:09,896][26022] Updated weights on worker 0-0, policy_version 11101 (0.00089) [2022-07-09 00:37:12,161][26022] Updated weights on worker 0-0, policy_version 11111 (0.00097) [2022-07-09 00:37:13,695][26022] Updated weights on worker 0-0, policy_version 11121 (0.00084) [2022-07-09 00:37:14,354][25689] Fps is (10 sec: 5465.7, 60 sec: 5447.4, 300 sec: 5440.3). Total num frames: 11389952. Throughput: 0: 5611.5. Samples: 11392850. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 00:37:14,356][25689] Avg episode reward: [(0, '-77.510')] [2022-07-09 00:37:15,748][26022] Updated weights on worker 0-0, policy_version 11131 (0.00105) [2022-07-09 00:37:17,511][26022] Updated weights on worker 0-0, policy_version 11141 (0.00094) [2022-07-09 00:37:19,408][25689] Fps is (10 sec: 5546.2, 60 sec: 5450.2, 300 sec: 5443.7). Total num frames: 11417600. Throughput: 0: 5588.9. Samples: 11425376. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 00:37:19,409][25689] Avg episode reward: [(0, '-76.878')] [2022-07-09 00:37:19,577][26022] Updated weights on worker 0-0, policy_version 11151 (0.00086) [2022-07-09 00:37:21,379][26022] Updated weights on worker 0-0, policy_version 11161 (0.00084) [2022-07-09 00:37:23,171][26022] Updated weights on worker 0-0, policy_version 11171 (0.00088) [2022-07-09 00:37:24,492][25689] Fps is (10 sec: 5353.7, 60 sec: 5427.1, 300 sec: 5435.4). Total num frames: 11444224. Throughput: 0: 4847.0. Samples: 11441628. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 00:37:24,492][25689] Avg episode reward: [(0, '-76.054')] [2022-07-09 00:37:25,187][26022] Updated weights on worker 0-0, policy_version 11181 (0.00087) [2022-07-09 00:37:26,963][26022] Updated weights on worker 0-0, policy_version 11191 (0.00091) [2022-07-09 00:37:29,013][26022] Updated weights on worker 0-0, policy_version 11201 (0.00090) [2022-07-09 00:37:29,497][25689] Fps is (10 sec: 5481.2, 60 sec: 5468.9, 300 sec: 5439.9). Total num frames: 11472896. Throughput: 0: 5671.3. Samples: 11474712. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 00:37:29,497][25689] Avg episode reward: [(0, '-75.146')] [2022-07-09 00:37:30,696][26022] Updated weights on worker 0-0, policy_version 11211 (0.00089) [2022-07-09 00:37:32,766][26022] Updated weights on worker 0-0, policy_version 11221 (0.00080) [2022-07-09 00:37:34,347][26022] Updated weights on worker 0-0, policy_version 11231 (0.00084) [2022-07-09 00:37:34,589][25689] Fps is (10 sec: 5679.0, 60 sec: 5462.6, 300 sec: 5447.1). Total num frames: 11501568. Throughput: 0: 5666.2. Samples: 11507504. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 00:37:34,590][25689] Avg episode reward: [(0, '-74.515')] [2022-07-09 00:37:36,491][26022] Updated weights on worker 0-0, policy_version 11241 (0.00095) [2022-07-09 00:37:38,194][26022] Updated weights on worker 0-0, policy_version 11251 (0.00111) [2022-07-09 00:37:39,671][25689] Fps is (10 sec: 5434.9, 60 sec: 5443.0, 300 sec: 5432.5). Total num frames: 11528192. Throughput: 0: 4873.4. Samples: 11524124. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 00:37:39,671][25689] Avg episode reward: [(0, '-74.715')] [2022-07-09 00:37:40,213][26022] Updated weights on worker 0-0, policy_version 11261 (0.00095) [2022-07-09 00:37:42,039][26022] Updated weights on worker 0-0, policy_version 11271 (0.00359) [2022-07-09 00:37:44,011][26022] Updated weights on worker 0-0, policy_version 11281 (0.00092) [2022-07-09 00:37:44,673][25689] Fps is (10 sec: 5382.3, 60 sec: 5429.0, 300 sec: 5439.4). Total num frames: 11555840. Throughput: 0: 5700.0. Samples: 11556660. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 00:37:44,673][25689] Avg episode reward: [(0, '-74.802')] [2022-07-09 00:37:45,730][26022] Updated weights on worker 0-0, policy_version 11291 (0.00098) [2022-07-09 00:37:47,641][26022] Updated weights on worker 0-0, policy_version 11301 (0.00088) [2022-07-09 00:37:49,553][26022] Updated weights on worker 0-0, policy_version 11311 (0.00086) [2022-07-09 00:37:49,717][25689] Fps is (10 sec: 5503.9, 60 sec: 5459.7, 300 sec: 5442.2). Total num frames: 11583488. Throughput: 0: 5675.6. Samples: 11589478. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 00:37:49,718][25689] Avg episode reward: [(0, '-74.867')] [2022-07-09 00:37:51,427][26022] Updated weights on worker 0-0, policy_version 11321 (0.00091) [2022-07-09 00:37:53,367][26022] Updated weights on worker 0-0, policy_version 11331 (0.00090) [2022-07-09 00:37:54,787][25689] Fps is (10 sec: 5467.2, 60 sec: 5437.2, 300 sec: 5436.2). Total num frames: 11611136. Throughput: 0: 4879.3. Samples: 11606054. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 00:37:54,787][25689] Avg episode reward: [(0, '-74.926')] [2022-07-09 00:37:55,239][26022] Updated weights on worker 0-0, policy_version 11341 (0.00093) [2022-07-09 00:37:57,195][26022] Updated weights on worker 0-0, policy_version 11351 (0.00086) [2022-07-09 00:37:58,842][26022] Updated weights on worker 0-0, policy_version 11361 (0.00078) [2022-07-09 00:37:59,829][25689] Fps is (10 sec: 5367.6, 60 sec: 5421.9, 300 sec: 5440.3). Total num frames: 11637760. Throughput: 0: 5682.0. Samples: 11638660. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 00:37:59,829][25689] Avg episode reward: [(0, '-74.381')] [2022-07-09 00:38:00,932][26022] Updated weights on worker 0-0, policy_version 11371 (0.00093) [2022-07-09 00:38:02,943][26022] Updated weights on worker 0-0, policy_version 11381 (0.00079) [2022-07-09 00:38:04,846][25689] Fps is (10 sec: 5089.8, 60 sec: 5406.5, 300 sec: 5430.2). Total num frames: 11662336. Throughput: 0: 5596.6. Samples: 11669562. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 00:38:04,847][25689] Avg episode reward: [(0, '-74.275')] [2022-07-09 00:38:04,987][26022] Updated weights on worker 0-0, policy_version 11391 (0.00090) [2022-07-09 00:38:06,762][26022] Updated weights on worker 0-0, policy_version 11401 (0.00088) [2022-07-09 00:38:08,779][26022] Updated weights on worker 0-0, policy_version 11411 (0.00087) [2022-07-09 00:38:09,861][25689] Fps is (10 sec: 5307.5, 60 sec: 5440.4, 300 sec: 5433.7). Total num frames: 11691008. Throughput: 0: 4791.6. Samples: 11685998. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 00:38:09,863][25689] Avg episode reward: [(0, '-74.923')] [2022-07-09 00:38:10,590][26022] Updated weights on worker 0-0, policy_version 11421 (0.00084) [2022-07-09 00:38:12,200][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:38:12,210][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000011429_11703296.pth [2022-07-09 00:38:12,210][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000009517_9745408.pth [2022-07-09 00:38:12,389][26022] Updated weights on worker 0-0, policy_version 11431 (0.00054) [2022-07-09 00:38:14,230][26022] Updated weights on worker 0-0, policy_version 11441 (0.00081) [2022-07-09 00:38:14,889][25689] Fps is (10 sec: 5506.1, 60 sec: 5413.1, 300 sec: 5428.3). Total num frames: 11717632. Throughput: 0: 5617.5. Samples: 11718978. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 00:38:14,891][25689] Avg episode reward: [(0, '-74.022')] [2022-07-09 00:38:16,015][26022] Updated weights on worker 0-0, policy_version 11451 (0.00089) [2022-07-09 00:38:18,188][26022] Updated weights on worker 0-0, policy_version 11461 (0.00093) [2022-07-09 00:38:19,738][26022] Updated weights on worker 0-0, policy_version 11471 (0.00088) [2022-07-09 00:38:19,937][25689] Fps is (10 sec: 5691.5, 60 sec: 5464.4, 300 sec: 5445.0). Total num frames: 11748352. Throughput: 0: 5641.8. Samples: 11752106. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 00:38:19,937][25689] Avg episode reward: [(0, '-73.952')] [2022-07-09 00:38:21,877][26022] Updated weights on worker 0-0, policy_version 11481 (0.00090) [2022-07-09 00:38:23,509][26022] Updated weights on worker 0-0, policy_version 11491 (0.00091) [2022-07-09 00:38:24,949][25689] Fps is (10 sec: 5394.6, 60 sec: 5420.0, 300 sec: 5424.4). Total num frames: 11771904. Throughput: 0: 4925.8. Samples: 11768586. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 00:38:24,950][25689] Avg episode reward: [(0, '-73.285')] [2022-07-09 00:38:25,550][26022] Updated weights on worker 0-0, policy_version 11501 (0.00085) [2022-07-09 00:38:27,367][26022] Updated weights on worker 0-0, policy_version 11511 (0.00090) [2022-07-09 00:38:29,118][26022] Updated weights on worker 0-0, policy_version 11521 (0.00095) [2022-07-09 00:38:29,967][25689] Fps is (10 sec: 5206.7, 60 sec: 5418.9, 300 sec: 5434.5). Total num frames: 11800576. Throughput: 0: 5707.8. Samples: 11800758. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 00:38:29,967][25689] Avg episode reward: [(0, '-73.422')] [2022-07-09 00:38:31,187][26022] Updated weights on worker 0-0, policy_version 11531 (0.00095) [2022-07-09 00:38:33,028][26022] Updated weights on worker 0-0, policy_version 11541 (0.00085) [2022-07-09 00:38:34,816][26022] Updated weights on worker 0-0, policy_version 11551 (0.00085) [2022-07-09 00:38:34,995][25689] Fps is (10 sec: 5606.5, 60 sec: 5407.7, 300 sec: 5432.4). Total num frames: 11828224. Throughput: 0: 5713.5. Samples: 11833854. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 00:38:34,995][25689] Avg episode reward: [(0, '-74.008')] [2022-07-09 00:38:36,772][26022] Updated weights on worker 0-0, policy_version 11561 (0.00088) [2022-07-09 00:38:38,638][26022] Updated weights on worker 0-0, policy_version 11571 (0.00092) [2022-07-09 00:38:40,037][25689] Fps is (10 sec: 5490.9, 60 sec: 5428.2, 300 sec: 5432.8). Total num frames: 11855872. Throughput: 0: 4882.2. Samples: 11850242. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 00:38:40,038][25689] Avg episode reward: [(0, '-73.635')] [2022-07-09 00:38:40,622][26022] Updated weights on worker 0-0, policy_version 11581 (0.00087) [2022-07-09 00:38:42,104][26022] Updated weights on worker 0-0, policy_version 11591 (0.00092) [2022-07-09 00:38:44,227][26022] Updated weights on worker 0-0, policy_version 11601 (0.00078) [2022-07-09 00:38:45,056][25689] Fps is (10 sec: 5597.6, 60 sec: 5443.6, 300 sec: 5437.6). Total num frames: 11884544. Throughput: 0: 5727.1. Samples: 11883740. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 00:38:45,057][25689] Avg episode reward: [(0, '-74.029')] [2022-07-09 00:38:45,981][26022] Updated weights on worker 0-0, policy_version 11611 (0.00088) [2022-07-09 00:38:47,838][26022] Updated weights on worker 0-0, policy_version 11621 (0.00084) [2022-07-09 00:38:49,738][26022] Updated weights on worker 0-0, policy_version 11631 (0.00086) [2022-07-09 00:38:50,107][25689] Fps is (10 sec: 5593.1, 60 sec: 5443.1, 300 sec: 5440.4). Total num frames: 11912192. Throughput: 0: 5770.9. Samples: 11916984. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 00:38:50,108][25689] Avg episode reward: [(0, '-74.430')] [2022-07-09 00:38:51,492][26022] Updated weights on worker 0-0, policy_version 11641 (0.00085) [2022-07-09 00:38:53,671][26022] Updated weights on worker 0-0, policy_version 11651 (0.00086) [2022-07-09 00:38:55,143][25689] Fps is (10 sec: 5380.7, 60 sec: 5429.1, 300 sec: 5437.4). Total num frames: 11938816. Throughput: 0: 4918.0. Samples: 11932940. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 00:38:55,143][25689] Avg episode reward: [(0, '-74.254')] [2022-07-09 00:38:55,524][26022] Updated weights on worker 0-0, policy_version 11661 (0.00083) [2022-07-09 00:38:57,347][26022] Updated weights on worker 0-0, policy_version 11671 (0.00089) [2022-07-09 00:38:59,240][26022] Updated weights on worker 0-0, policy_version 11681 (0.00092) [2022-07-09 00:39:00,190][25689] Fps is (10 sec: 5281.2, 60 sec: 5428.7, 300 sec: 5443.7). Total num frames: 11965440. Throughput: 0: 5730.1. Samples: 11965716. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 00:39:00,195][25689] Avg episode reward: [(0, '-74.730')] [2022-07-09 00:39:01,068][26022] Updated weights on worker 0-0, policy_version 11691 (0.00089) [2022-07-09 00:39:03,306][26022] Updated weights on worker 0-0, policy_version 11701 (0.00097) [2022-07-09 00:39:05,215][25689] Fps is (10 sec: 5184.9, 60 sec: 5444.9, 300 sec: 5434.0). Total num frames: 11991040. Throughput: 0: 5603.6. Samples: 11996702. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 00:39:05,216][25689] Avg episode reward: [(0, '-74.638')] [2022-07-09 00:39:05,240][26022] Updated weights on worker 0-0, policy_version 11711 (0.00092) [2022-07-09 00:39:07,015][26022] Updated weights on worker 0-0, policy_version 11721 (0.00086) [2022-07-09 00:39:08,991][26022] Updated weights on worker 0-0, policy_version 11731 (0.00091) [2022-07-09 00:39:10,235][25689] Fps is (10 sec: 5402.4, 60 sec: 5444.5, 300 sec: 5437.6). Total num frames: 12019712. Throughput: 0: 4781.4. Samples: 12013226. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 00:39:10,237][25689] Avg episode reward: [(0, '-73.694')] [2022-07-09 00:39:10,762][26022] Updated weights on worker 0-0, policy_version 11741 (0.00096) [2022-07-09 00:39:12,688][26022] Updated weights on worker 0-0, policy_version 11751 (0.00098) [2022-07-09 00:39:14,389][26022] Updated weights on worker 0-0, policy_version 11761 (0.00086) [2022-07-09 00:39:15,245][25689] Fps is (10 sec: 5717.0, 60 sec: 5480.0, 300 sec: 5445.3). Total num frames: 12048384. Throughput: 0: 5639.4. Samples: 12046308. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 00:39:15,247][25689] Avg episode reward: [(0, '-73.508')] [2022-07-09 00:39:16,291][26022] Updated weights on worker 0-0, policy_version 11771 (0.00088) [2022-07-09 00:39:18,102][26022] Updated weights on worker 0-0, policy_version 11781 (0.00085) [2022-07-09 00:39:20,104][26022] Updated weights on worker 0-0, policy_version 11791 (0.00083) [2022-07-09 00:39:20,298][25689] Fps is (10 sec: 5393.6, 60 sec: 5394.7, 300 sec: 5434.1). Total num frames: 12073984. Throughput: 0: 5658.1. Samples: 12079490. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 00:39:20,298][25689] Avg episode reward: [(0, '-73.621')] [2022-07-09 00:39:21,904][26022] Updated weights on worker 0-0, policy_version 11801 (0.00093) [2022-07-09 00:39:23,853][26022] Updated weights on worker 0-0, policy_version 11811 (0.00467) [2022-07-09 00:39:25,314][25689] Fps is (10 sec: 5491.8, 60 sec: 5496.2, 300 sec: 5444.3). Total num frames: 12103680. Throughput: 0: 4947.2. Samples: 12096138. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 00:39:25,315][25689] Avg episode reward: [(0, '-73.780')] [2022-07-09 00:39:25,431][26022] Updated weights on worker 0-0, policy_version 11821 (0.00091) [2022-07-09 00:39:27,621][26022] Updated weights on worker 0-0, policy_version 11831 (0.00090) [2022-07-09 00:39:29,256][26022] Updated weights on worker 0-0, policy_version 11841 (0.00084) [2022-07-09 00:39:30,331][25689] Fps is (10 sec: 5613.5, 60 sec: 5462.3, 300 sec: 5441.1). Total num frames: 12130304. Throughput: 0: 5773.1. Samples: 12129238. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 00:39:30,331][25689] Avg episode reward: [(0, '-73.374')] [2022-07-09 00:39:31,243][26022] Updated weights on worker 0-0, policy_version 11851 (0.00085) [2022-07-09 00:39:33,055][26022] Updated weights on worker 0-0, policy_version 11861 (0.00092) [2022-07-09 00:39:34,761][26022] Updated weights on worker 0-0, policy_version 11871 (0.00087) [2022-07-09 00:39:35,374][25689] Fps is (10 sec: 5395.2, 60 sec: 5461.0, 300 sec: 5435.2). Total num frames: 12157952. Throughput: 0: 5765.6. Samples: 12162360. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 00:39:35,374][25689] Avg episode reward: [(0, '-73.460')] [2022-07-09 00:39:36,567][26022] Updated weights on worker 0-0, policy_version 11881 (0.00080) [2022-07-09 00:39:38,787][26022] Updated weights on worker 0-0, policy_version 11891 (0.00086) [2022-07-09 00:39:40,257][26022] Updated weights on worker 0-0, policy_version 11901 (0.00093) [2022-07-09 00:39:40,452][25689] Fps is (10 sec: 5665.6, 60 sec: 5491.6, 300 sec: 5444.4). Total num frames: 12187648. Throughput: 0: 5751.0. Samples: 12195398. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 00:39:40,453][25689] Avg episode reward: [(0, '-73.011')] [2022-07-09 00:39:42,466][26022] Updated weights on worker 0-0, policy_version 11911 (0.00095) [2022-07-09 00:39:43,728][26022] Updated weights on worker 0-0, policy_version 11921 (0.00092) [2022-07-09 00:39:45,461][25689] Fps is (10 sec: 5481.7, 60 sec: 5441.7, 300 sec: 5440.9). Total num frames: 12213248. Throughput: 0: 5758.3. Samples: 12212148. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 00:39:45,461][25689] Avg episode reward: [(0, '-73.137')] [2022-07-09 00:39:46,041][26022] Updated weights on worker 0-0, policy_version 11931 (0.00081) [2022-07-09 00:39:47,899][26022] Updated weights on worker 0-0, policy_version 11941 (0.00089) [2022-07-09 00:39:49,602][26022] Updated weights on worker 0-0, policy_version 11951 (0.00090) [2022-07-09 00:39:50,475][25689] Fps is (10 sec: 5415.0, 60 sec: 5462.0, 300 sec: 5444.4). Total num frames: 12241920. Throughput: 0: 5753.6. Samples: 12245138. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 00:39:50,475][25689] Avg episode reward: [(0, '-72.848')] [2022-07-09 00:39:51,698][26022] Updated weights on worker 0-0, policy_version 11961 (0.00083) [2022-07-09 00:39:53,338][26022] Updated weights on worker 0-0, policy_version 11971 (0.00092) [2022-07-09 00:39:55,438][26022] Updated weights on worker 0-0, policy_version 11981 (0.00087) [2022-07-09 00:39:55,480][25689] Fps is (10 sec: 5519.0, 60 sec: 5464.7, 300 sec: 5442.4). Total num frames: 12268544. Throughput: 0: 5762.4. Samples: 12278222. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 00:39:55,482][25689] Avg episode reward: [(0, '-72.533')] [2022-07-09 00:39:57,127][26022] Updated weights on worker 0-0, policy_version 11991 (0.00100) [2022-07-09 00:39:58,992][26022] Updated weights on worker 0-0, policy_version 12001 (0.00087) [2022-07-09 00:40:00,548][25689] Fps is (10 sec: 5387.6, 60 sec: 5479.8, 300 sec: 5448.4). Total num frames: 12296192. Throughput: 0: 4944.5. Samples: 12294762. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 00:40:00,549][25689] Avg episode reward: [(0, '-72.567')] [2022-07-09 00:40:00,938][26022] Updated weights on worker 0-0, policy_version 12011 (0.00084) [2022-07-09 00:40:03,208][26022] Updated weights on worker 0-0, policy_version 12021 (0.00088) [2022-07-09 00:40:04,943][26022] Updated weights on worker 0-0, policy_version 12031 (0.00424) [2022-07-09 00:40:05,606][25689] Fps is (10 sec: 5359.9, 60 sec: 5493.8, 300 sec: 5444.7). Total num frames: 12322816. Throughput: 0: 5640.9. Samples: 12325782. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 00:40:05,606][25689] Avg episode reward: [(0, '-72.502')] [2022-07-09 00:40:06,781][26022] Updated weights on worker 0-0, policy_version 12041 (0.00084) [2022-07-09 00:40:08,768][26022] Updated weights on worker 0-0, policy_version 12051 (0.00087) [2022-07-09 00:40:10,598][26022] Updated weights on worker 0-0, policy_version 12061 (0.00084) [2022-07-09 00:40:10,633][25689] Fps is (10 sec: 5381.3, 60 sec: 5476.2, 300 sec: 5444.4). Total num frames: 12350464. Throughput: 0: 5650.1. Samples: 12359036. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 00:40:10,634][25689] Avg episode reward: [(0, '-72.327')] [2022-07-09 00:40:12,476][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:40:12,490][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000012071_12360704.pth [2022-07-09 00:40:12,491][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000010155_10398720.pth [2022-07-09 00:40:12,497][26022] Updated weights on worker 0-0, policy_version 12071 (0.00095) [2022-07-09 00:40:14,193][26022] Updated weights on worker 0-0, policy_version 12081 (0.00092) [2022-07-09 00:40:15,635][25689] Fps is (10 sec: 5411.2, 60 sec: 5443.0, 300 sec: 5445.8). Total num frames: 12377088. Throughput: 0: 4841.7. Samples: 12375808. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 00:40:15,636][25689] Avg episode reward: [(0, '-71.743')] [2022-07-09 00:40:16,231][26022] Updated weights on worker 0-0, policy_version 12091 (0.00082) [2022-07-09 00:40:17,951][26022] Updated weights on worker 0-0, policy_version 12101 (0.00099) [2022-07-09 00:40:19,727][26022] Updated weights on worker 0-0, policy_version 12111 (0.00093) [2022-07-09 00:40:20,732][25689] Fps is (10 sec: 5475.4, 60 sec: 5489.8, 300 sec: 5441.5). Total num frames: 12405760. Throughput: 0: 5665.3. Samples: 12409112. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 00:40:20,734][25689] Avg episode reward: [(0, '-71.356')] [2022-07-09 00:40:21,768][26022] Updated weights on worker 0-0, policy_version 12121 (0.00085) [2022-07-09 00:40:23,507][26022] Updated weights on worker 0-0, policy_version 12131 (0.00086) [2022-07-09 00:40:25,251][26022] Updated weights on worker 0-0, policy_version 12141 (0.00086) [2022-07-09 00:40:25,756][25689] Fps is (10 sec: 5564.8, 60 sec: 5455.3, 300 sec: 5448.2). Total num frames: 12433408. Throughput: 0: 5779.9. Samples: 12442248. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 00:40:25,758][25689] Avg episode reward: [(0, '-70.176')] [2022-07-09 00:40:27,315][26022] Updated weights on worker 0-0, policy_version 12151 (0.00090) [2022-07-09 00:40:29,196][26022] Updated weights on worker 0-0, policy_version 12161 (0.00093) [2022-07-09 00:40:30,765][25689] Fps is (10 sec: 5613.9, 60 sec: 5489.9, 300 sec: 5451.8). Total num frames: 12462080. Throughput: 0: 4947.6. Samples: 12458638. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 00:40:30,765][25689] Avg episode reward: [(0, '-70.425')] [2022-07-09 00:40:30,983][26022] Updated weights on worker 0-0, policy_version 12171 (0.00095) [2022-07-09 00:40:33,035][26022] Updated weights on worker 0-0, policy_version 12181 (0.00087) [2022-07-09 00:40:34,625][26022] Updated weights on worker 0-0, policy_version 12191 (0.00089) [2022-07-09 00:40:35,812][25689] Fps is (10 sec: 5600.7, 60 sec: 5489.5, 300 sec: 5452.6). Total num frames: 12489728. Throughput: 0: 5742.4. Samples: 12491670. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 00:40:35,814][25689] Avg episode reward: [(0, '-70.272')] [2022-07-09 00:40:36,636][26022] Updated weights on worker 0-0, policy_version 12201 (0.00084) [2022-07-09 00:40:38,236][26022] Updated weights on worker 0-0, policy_version 12211 (0.00097) [2022-07-09 00:40:40,349][26022] Updated weights on worker 0-0, policy_version 12221 (0.00089) [2022-07-09 00:40:40,894][25689] Fps is (10 sec: 5458.8, 60 sec: 5455.3, 300 sec: 5451.9). Total num frames: 12517376. Throughput: 0: 5750.9. Samples: 12525062. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 00:40:40,895][25689] Avg episode reward: [(0, '-70.395')] [2022-07-09 00:40:41,889][26022] Updated weights on worker 0-0, policy_version 12231 (0.00092) [2022-07-09 00:40:44,099][26022] Updated weights on worker 0-0, policy_version 12242 (0.00089) [2022-07-09 00:40:45,949][25689] Fps is (10 sec: 5556.0, 60 sec: 5501.9, 300 sec: 5455.0). Total num frames: 12546048. Throughput: 0: 4937.5. Samples: 12541950. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 00:40:45,949][25689] Avg episode reward: [(0, '-70.522')] [2022-07-09 00:40:45,952][26022] Updated weights on worker 0-0, policy_version 12252 (0.00086) [2022-07-09 00:40:47,777][26022] Updated weights on worker 0-0, policy_version 12262 (0.00188) [2022-07-09 00:40:49,724][26022] Updated weights on worker 0-0, policy_version 12272 (0.00083) [2022-07-09 00:40:51,015][25689] Fps is (10 sec: 5666.1, 60 sec: 5497.2, 300 sec: 5457.6). Total num frames: 12574720. Throughput: 0: 5756.2. Samples: 12575202. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 00:40:51,015][25689] Avg episode reward: [(0, '-70.036')] [2022-07-09 00:40:51,619][26022] Updated weights on worker 0-0, policy_version 12282 (0.00086) [2022-07-09 00:40:53,250][26022] Updated weights on worker 0-0, policy_version 12292 (0.00087) [2022-07-09 00:40:55,225][26022] Updated weights on worker 0-0, policy_version 12302 (0.00089) [2022-07-09 00:40:56,037][25689] Fps is (10 sec: 5481.3, 60 sec: 5495.7, 300 sec: 5455.9). Total num frames: 12601344. Throughput: 0: 5782.0. Samples: 12608610. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 00:40:56,037][25689] Avg episode reward: [(0, '-70.345')] [2022-07-09 00:40:56,939][26022] Updated weights on worker 0-0, policy_version 12312 (0.00083) [2022-07-09 00:40:58,994][26022] Updated weights on worker 0-0, policy_version 12322 (0.00088) [2022-07-09 00:41:00,831][26022] Updated weights on worker 0-0, policy_version 12332 (0.01263) [2022-07-09 00:41:01,112][25689] Fps is (10 sec: 5476.2, 60 sec: 5511.9, 300 sec: 5468.7). Total num frames: 12630016. Throughput: 0: 4948.0. Samples: 12625102. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 00:41:01,113][25689] Avg episode reward: [(0, '-71.169')] [2022-07-09 00:41:03,028][26022] Updated weights on worker 0-0, policy_version 12342 (0.00092) [2022-07-09 00:41:04,874][26022] Updated weights on worker 0-0, policy_version 12352 (0.00085) [2022-07-09 00:41:06,178][25689] Fps is (10 sec: 5351.7, 60 sec: 5494.3, 300 sec: 5453.9). Total num frames: 12655616. Throughput: 0: 5659.1. Samples: 12656430. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 00:41:06,178][25689] Avg episode reward: [(0, '-70.901')] [2022-07-09 00:41:06,781][26022] Updated weights on worker 0-0, policy_version 12362 (0.00086) [2022-07-09 00:41:08,534][26022] Updated weights on worker 0-0, policy_version 12372 (0.00087) [2022-07-09 00:41:10,798][26022] Updated weights on worker 0-0, policy_version 12382 (0.00090) [2022-07-09 00:41:11,220][25689] Fps is (10 sec: 5065.8, 60 sec: 5459.2, 300 sec: 5450.6). Total num frames: 12681216. Throughput: 0: 5642.5. Samples: 12689206. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 00:41:11,220][25689] Avg episode reward: [(0, '-70.524')] [2022-07-09 00:41:12,260][26022] Updated weights on worker 0-0, policy_version 12392 (0.00087) [2022-07-09 00:41:14,457][26022] Updated weights on worker 0-0, policy_version 12402 (0.00086) [2022-07-09 00:41:15,966][26022] Updated weights on worker 0-0, policy_version 12412 (0.00078) [2022-07-09 00:41:16,277][25689] Fps is (10 sec: 5577.0, 60 sec: 5521.8, 300 sec: 5461.5). Total num frames: 12711936. Throughput: 0: 4801.9. Samples: 12705800. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 00:41:16,277][25689] Avg episode reward: [(0, '-71.112')] [2022-07-09 00:41:18,181][26022] Updated weights on worker 0-0, policy_version 12422 (0.00090) [2022-07-09 00:41:19,562][26022] Updated weights on worker 0-0, policy_version 12432 (0.00095) [2022-07-09 00:41:21,417][25689] Fps is (10 sec: 5522.9, 60 sec: 5467.2, 300 sec: 5452.3). Total num frames: 12737536. Throughput: 0: 5587.1. Samples: 12738546. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 00:41:21,418][25689] Avg episode reward: [(0, '-71.688')] [2022-07-09 00:41:21,824][26022] Updated weights on worker 0-0, policy_version 12442 (0.00088) [2022-07-09 00:41:23,686][26022] Updated weights on worker 0-0, policy_version 12452 (0.00091) [2022-07-09 00:41:25,481][26022] Updated weights on worker 0-0, policy_version 12462 (0.00078) [2022-07-09 00:41:26,424][25689] Fps is (10 sec: 5348.6, 60 sec: 5485.6, 300 sec: 5460.8). Total num frames: 12766208. Throughput: 0: 5718.1. Samples: 12772198. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 00:41:26,425][25689] Avg episode reward: [(0, '-71.810')] [2022-07-09 00:41:27,308][26022] Updated weights on worker 0-0, policy_version 12472 (0.00088) [2022-07-09 00:41:29,174][26022] Updated weights on worker 0-0, policy_version 12482 (0.00093) [2022-07-09 00:41:30,939][26022] Updated weights on worker 0-0, policy_version 12492 (0.00088) [2022-07-09 00:41:31,440][25689] Fps is (10 sec: 5619.1, 60 sec: 5468.0, 300 sec: 5457.5). Total num frames: 12793856. Throughput: 0: 4916.4. Samples: 12788618. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 00:41:31,442][25689] Avg episode reward: [(0, '-70.459')] [2022-07-09 00:41:32,978][26022] Updated weights on worker 0-0, policy_version 12502 (0.00090) [2022-07-09 00:41:34,553][26022] Updated weights on worker 0-0, policy_version 12512 (0.00097) [2022-07-09 00:41:36,442][25689] Fps is (10 sec: 5417.2, 60 sec: 5455.2, 300 sec: 5454.9). Total num frames: 12820480. Throughput: 0: 5738.2. Samples: 12821514. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 00:41:36,443][25689] Avg episode reward: [(0, '-71.290')] [2022-07-09 00:41:36,615][26022] Updated weights on worker 0-0, policy_version 12522 (0.00084) [2022-07-09 00:41:38,297][26022] Updated weights on worker 0-0, policy_version 12532 (0.00094) [2022-07-09 00:41:40,365][26022] Updated weights on worker 0-0, policy_version 12542 (0.00089) [2022-07-09 00:41:41,503][25689] Fps is (10 sec: 5495.0, 60 sec: 5474.1, 300 sec: 5454.4). Total num frames: 12849152. Throughput: 0: 5773.7. Samples: 12854516. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 00:41:41,504][25689] Avg episode reward: [(0, '-71.845')] [2022-07-09 00:41:42,413][26022] Updated weights on worker 0-0, policy_version 12552 (0.00082) [2022-07-09 00:41:43,867][26022] Updated weights on worker 0-0, policy_version 12562 (0.00091) [2022-07-09 00:41:46,071][26022] Updated weights on worker 0-0, policy_version 12572 (0.00091) [2022-07-09 00:41:46,512][25689] Fps is (10 sec: 5593.1, 60 sec: 5461.3, 300 sec: 5461.3). Total num frames: 12876800. Throughput: 0: 4929.3. Samples: 12871218. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 00:41:46,512][25689] Avg episode reward: [(0, '-71.710')] [2022-07-09 00:41:47,759][26022] Updated weights on worker 0-0, policy_version 12582 (0.00085) [2022-07-09 00:41:49,650][26022] Updated weights on worker 0-0, policy_version 12592 (0.00091) [2022-07-09 00:41:51,362][26022] Updated weights on worker 0-0, policy_version 12602 (0.00089) [2022-07-09 00:41:51,570][25689] Fps is (10 sec: 5594.3, 60 sec: 5462.0, 300 sec: 5460.4). Total num frames: 12905472. Throughput: 0: 5766.3. Samples: 12904696. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 00:41:51,571][25689] Avg episode reward: [(0, '-71.229')] [2022-07-09 00:41:53,329][26022] Updated weights on worker 0-0, policy_version 12612 (0.00089) [2022-07-09 00:41:54,964][26022] Updated weights on worker 0-0, policy_version 12622 (0.00090) [2022-07-09 00:41:56,577][25689] Fps is (10 sec: 5595.5, 60 sec: 5480.3, 300 sec: 5461.4). Total num frames: 12933120. Throughput: 0: 5799.8. Samples: 12938292. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 00:41:56,577][25689] Avg episode reward: [(0, '-71.093')] [2022-07-09 00:41:56,867][26022] Updated weights on worker 0-0, policy_version 12632 (0.00088) [2022-07-09 00:41:58,604][26022] Updated weights on worker 0-0, policy_version 12642 (0.00084) [2022-07-09 00:42:00,638][26022] Updated weights on worker 0-0, policy_version 12652 (0.00090) [2022-07-09 00:42:01,655][25689] Fps is (10 sec: 5381.7, 60 sec: 5446.2, 300 sec: 5464.0). Total num frames: 12959744. Throughput: 0: 4984.4. Samples: 12954962. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 00:42:01,656][25689] Avg episode reward: [(0, '-70.818')] [2022-07-09 00:42:02,820][26022] Updated weights on worker 0-0, policy_version 12662 (0.00093) [2022-07-09 00:42:04,675][26022] Updated weights on worker 0-0, policy_version 12672 (0.00089) [2022-07-09 00:42:06,694][25689] Fps is (10 sec: 5162.2, 60 sec: 5448.6, 300 sec: 5460.1). Total num frames: 12985344. Throughput: 0: 5668.7. Samples: 12985624. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 00:42:06,694][25689] Avg episode reward: [(0, '-70.180')] [2022-07-09 00:42:06,743][26022] Updated weights on worker 0-0, policy_version 12682 (0.00089) [2022-07-09 00:42:08,409][26022] Updated weights on worker 0-0, policy_version 12692 (0.00090) [2022-07-09 00:42:10,309][26022] Updated weights on worker 0-0, policy_version 12702 (0.00085) [2022-07-09 00:42:11,731][25689] Fps is (10 sec: 5487.8, 60 sec: 5516.7, 300 sec: 5464.7). Total num frames: 13015040. Throughput: 0: 5655.3. Samples: 13018712. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 00:42:11,732][25689] Avg episode reward: [(0, '-70.144')] [2022-07-09 00:42:12,181][26022] Updated weights on worker 0-0, policy_version 12712 (0.00090) [2022-07-09 00:42:12,834][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:42:12,850][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000012715_13020160.pth [2022-07-09 00:42:12,850][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000010793_11052032.pth [2022-07-09 00:42:13,983][26022] Updated weights on worker 0-0, policy_version 12722 (0.00087) [2022-07-09 00:42:15,933][26022] Updated weights on worker 0-0, policy_version 12732 (0.00098) [2022-07-09 00:42:16,733][25689] Fps is (10 sec: 5609.6, 60 sec: 5453.9, 300 sec: 5462.2). Total num frames: 13041664. Throughput: 0: 4809.1. Samples: 13035226. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 00:42:16,734][25689] Avg episode reward: [(0, '-70.804')] [2022-07-09 00:42:17,787][26022] Updated weights on worker 0-0, policy_version 12742 (0.00087) [2022-07-09 00:42:19,710][26022] Updated weights on worker 0-0, policy_version 12752 (0.00093) [2022-07-09 00:42:21,600][26022] Updated weights on worker 0-0, policy_version 12762 (0.00082) [2022-07-09 00:42:21,820][25689] Fps is (10 sec: 5379.4, 60 sec: 5492.7, 300 sec: 5465.6). Total num frames: 13069312. Throughput: 0: 5615.0. Samples: 13068190. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 00:42:21,821][25689] Avg episode reward: [(0, '-70.404')] [2022-07-09 00:42:23,472][26022] Updated weights on worker 0-0, policy_version 12772 (0.00088) [2022-07-09 00:42:25,189][26022] Updated weights on worker 0-0, policy_version 12782 (0.00094) [2022-07-09 00:42:26,828][25689] Fps is (10 sec: 5477.6, 60 sec: 5475.6, 300 sec: 5462.1). Total num frames: 13096960. Throughput: 0: 5734.9. Samples: 13101096. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 00:42:26,829][25689] Avg episode reward: [(0, '-70.356')] [2022-07-09 00:42:27,134][26022] Updated weights on worker 0-0, policy_version 12792 (0.00092) [2022-07-09 00:42:28,999][26022] Updated weights on worker 0-0, policy_version 12802 (0.00095) [2022-07-09 00:42:30,984][26022] Updated weights on worker 0-0, policy_version 12812 (0.00086) [2022-07-09 00:42:31,843][25689] Fps is (10 sec: 5414.8, 60 sec: 5458.8, 300 sec: 5456.7). Total num frames: 13123584. Throughput: 0: 4911.2. Samples: 13117486. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 00:42:31,844][25689] Avg episode reward: [(0, '-71.144')] [2022-07-09 00:42:32,789][26022] Updated weights on worker 0-0, policy_version 12822 (0.00091) [2022-07-09 00:42:34,591][26022] Updated weights on worker 0-0, policy_version 12832 (0.00087) [2022-07-09 00:42:36,528][26022] Updated weights on worker 0-0, policy_version 12842 (0.00092) [2022-07-09 00:42:36,915][25689] Fps is (10 sec: 5482.3, 60 sec: 5486.4, 300 sec: 5463.7). Total num frames: 13152256. Throughput: 0: 5703.3. Samples: 13150326. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 00:42:36,916][25689] Avg episode reward: [(0, '-72.384')] [2022-07-09 00:42:38,332][26022] Updated weights on worker 0-0, policy_version 12852 (0.00088) [2022-07-09 00:42:40,350][26022] Updated weights on worker 0-0, policy_version 12862 (0.00093) [2022-07-09 00:42:41,974][25689] Fps is (10 sec: 5559.4, 60 sec: 5469.6, 300 sec: 5462.7). Total num frames: 13179904. Throughput: 0: 5694.6. Samples: 13182956. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 00:42:41,974][25689] Avg episode reward: [(0, '-71.979')] [2022-07-09 00:42:42,374][26022] Updated weights on worker 0-0, policy_version 12872 (0.00085) [2022-07-09 00:42:43,966][26022] Updated weights on worker 0-0, policy_version 12882 (0.00097) [2022-07-09 00:42:45,856][26022] Updated weights on worker 0-0, policy_version 12892 (0.00100) [2022-07-09 00:42:46,977][25689] Fps is (10 sec: 5393.5, 60 sec: 5453.2, 300 sec: 5460.0). Total num frames: 13206528. Throughput: 0: 5705.0. Samples: 13216044. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 00:42:46,978][25689] Avg episode reward: [(0, '-70.665')] [2022-07-09 00:42:47,709][26022] Updated weights on worker 0-0, policy_version 12902 (0.00084) [2022-07-09 00:42:49,516][26022] Updated weights on worker 0-0, policy_version 12912 (0.00088) [2022-07-09 00:42:51,582][26022] Updated weights on worker 0-0, policy_version 12922 (0.00086) [2022-07-09 00:42:51,989][25689] Fps is (10 sec: 5419.0, 60 sec: 5440.5, 300 sec: 5461.1). Total num frames: 13234176. Throughput: 0: 5712.1. Samples: 13232560. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 00:42:51,989][25689] Avg episode reward: [(0, '-71.800')] [2022-07-09 00:42:53,307][26022] Updated weights on worker 0-0, policy_version 12932 (0.00085) [2022-07-09 00:42:55,261][26022] Updated weights on worker 0-0, policy_version 12942 (0.00087) [2022-07-09 00:42:57,035][25689] Fps is (10 sec: 5497.5, 60 sec: 5436.9, 300 sec: 5464.4). Total num frames: 13261824. Throughput: 0: 5722.1. Samples: 13265460. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 00:42:57,036][25689] Avg episode reward: [(0, '-70.590')] [2022-07-09 00:42:57,188][26022] Updated weights on worker 0-0, policy_version 12952 (0.00089) [2022-07-09 00:42:58,947][26022] Updated weights on worker 0-0, policy_version 12962 (0.00099) [2022-07-09 00:43:00,671][26022] Updated weights on worker 0-0, policy_version 12972 (0.00091) [2022-07-09 00:43:02,091][25689] Fps is (10 sec: 5473.7, 60 sec: 5455.8, 300 sec: 5474.1). Total num frames: 13289472. Throughput: 0: 5742.7. Samples: 13298484. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 00:43:02,091][25689] Avg episode reward: [(0, '-69.424')] [2022-07-09 00:43:03,078][26022] Updated weights on worker 0-0, policy_version 12982 (0.00090) [2022-07-09 00:43:04,963][26022] Updated weights on worker 0-0, policy_version 12992 (0.00087) [2022-07-09 00:43:06,835][26022] Updated weights on worker 0-0, policy_version 13002 (0.00084) [2022-07-09 00:43:07,119][25689] Fps is (10 sec: 5382.3, 60 sec: 5473.7, 300 sec: 5466.9). Total num frames: 13316096. Throughput: 0: 4822.1. Samples: 13313170. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 00:43:07,119][25689] Avg episode reward: [(0, '-70.013')] [2022-07-09 00:43:08,582][26022] Updated weights on worker 0-0, policy_version 13012 (0.00083) [2022-07-09 00:43:10,318][26022] Updated weights on worker 0-0, policy_version 13022 (0.00086) [2022-07-09 00:43:12,126][25689] Fps is (10 sec: 5408.3, 60 sec: 5442.6, 300 sec: 5470.8). Total num frames: 13343744. Throughput: 0: 5652.4. Samples: 13346384. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 00:43:12,126][25689] Avg episode reward: [(0, '-70.603')] [2022-07-09 00:43:12,470][26022] Updated weights on worker 0-0, policy_version 13032 (0.00090) [2022-07-09 00:43:13,990][26022] Updated weights on worker 0-0, policy_version 13042 (0.00096) [2022-07-09 00:43:16,089][26022] Updated weights on worker 0-0, policy_version 13052 (0.00090) [2022-07-09 00:43:17,143][25689] Fps is (10 sec: 5414.2, 60 sec: 5441.3, 300 sec: 5457.5). Total num frames: 13370368. Throughput: 0: 5680.7. Samples: 13379684. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 00:43:17,149][25689] Avg episode reward: [(0, '-70.983')] [2022-07-09 00:43:17,640][26022] Updated weights on worker 0-0, policy_version 13062 (0.00097) [2022-07-09 00:43:19,650][26022] Updated weights on worker 0-0, policy_version 13072 (0.00092) [2022-07-09 00:43:21,548][26022] Updated weights on worker 0-0, policy_version 13082 (0.00091) [2022-07-09 00:43:22,252][25689] Fps is (10 sec: 5561.9, 60 sec: 5473.1, 300 sec: 5476.4). Total num frames: 13400064. Throughput: 0: 4853.9. Samples: 13396344. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 00:43:22,254][25689] Avg episode reward: [(0, '-70.305')] [2022-07-09 00:43:23,491][26022] Updated weights on worker 0-0, policy_version 13092 (0.00087) [2022-07-09 00:43:25,296][26022] Updated weights on worker 0-0, policy_version 13102 (0.00094) [2022-07-09 00:43:27,255][26022] Updated weights on worker 0-0, policy_version 13112 (0.00086) [2022-07-09 00:43:27,282][25689] Fps is (10 sec: 5554.8, 60 sec: 5454.2, 300 sec: 5469.3). Total num frames: 13426688. Throughput: 0: 5762.0. Samples: 13429350. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 00:43:27,282][25689] Avg episode reward: [(0, '-70.401')] [2022-07-09 00:43:28,896][26022] Updated weights on worker 0-0, policy_version 13122 (0.00094) [2022-07-09 00:43:30,979][26022] Updated weights on worker 0-0, policy_version 13132 (0.00096) [2022-07-09 00:43:32,321][25689] Fps is (10 sec: 5288.1, 60 sec: 5452.0, 300 sec: 5465.7). Total num frames: 13453312. Throughput: 0: 5698.8. Samples: 13461474. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 00:43:32,322][25689] Avg episode reward: [(0, '-71.061')] [2022-07-09 00:43:32,887][26022] Updated weights on worker 0-0, policy_version 13142 (0.00091) [2022-07-09 00:43:34,823][26022] Updated weights on worker 0-0, policy_version 13152 (0.00095) [2022-07-09 00:43:36,522][26022] Updated weights on worker 0-0, policy_version 13162 (0.00090) [2022-07-09 00:43:37,341][25689] Fps is (10 sec: 5395.1, 60 sec: 5439.7, 300 sec: 5466.1). Total num frames: 13480960. Throughput: 0: 4858.5. Samples: 13477818. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 00:43:37,343][25689] Avg episode reward: [(0, '-69.983')] [2022-07-09 00:43:38,612][26022] Updated weights on worker 0-0, policy_version 13172 (0.00097) [2022-07-09 00:43:40,380][26022] Updated weights on worker 0-0, policy_version 13182 (0.00087) [2022-07-09 00:43:42,270][26022] Updated weights on worker 0-0, policy_version 13192 (0.00091) [2022-07-09 00:43:42,443][25689] Fps is (10 sec: 5564.4, 60 sec: 5452.8, 300 sec: 5464.6). Total num frames: 13509632. Throughput: 0: 5651.5. Samples: 13510452. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 00:43:42,443][25689] Avg episode reward: [(0, '-69.475')] [2022-07-09 00:43:44,135][26022] Updated weights on worker 0-0, policy_version 13202 (0.00083) [2022-07-09 00:43:46,099][26022] Updated weights on worker 0-0, policy_version 13212 (0.00083) [2022-07-09 00:43:47,459][25689] Fps is (10 sec: 5465.2, 60 sec: 5451.7, 300 sec: 5461.8). Total num frames: 13536256. Throughput: 0: 5661.3. Samples: 13543580. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 00:43:47,460][25689] Avg episode reward: [(0, '-69.603')] [2022-07-09 00:43:47,831][26022] Updated weights on worker 0-0, policy_version 13222 (0.00090) [2022-07-09 00:43:49,791][26022] Updated weights on worker 0-0, policy_version 13232 (0.00091) [2022-07-09 00:43:51,531][26022] Updated weights on worker 0-0, policy_version 13242 (0.00093) [2022-07-09 00:43:52,469][25689] Fps is (10 sec: 5515.4, 60 sec: 5468.8, 300 sec: 5469.1). Total num frames: 13564928. Throughput: 0: 4899.2. Samples: 13560178. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 00:43:52,469][25689] Avg episode reward: [(0, '-69.340')] [2022-07-09 00:43:53,515][26022] Updated weights on worker 0-0, policy_version 13252 (0.00085) [2022-07-09 00:43:55,288][26022] Updated weights on worker 0-0, policy_version 13262 (0.00086) [2022-07-09 00:43:57,128][26022] Updated weights on worker 0-0, policy_version 13272 (0.00094) [2022-07-09 00:43:57,503][25689] Fps is (10 sec: 5505.4, 60 sec: 5453.0, 300 sec: 5469.3). Total num frames: 13591552. Throughput: 0: 5720.2. Samples: 13593148. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 00:43:57,504][25689] Avg episode reward: [(0, '-68.236')] [2022-07-09 00:43:58,885][26022] Updated weights on worker 0-0, policy_version 13282 (0.00082) [2022-07-09 00:44:00,791][26022] Updated weights on worker 0-0, policy_version 13292 (0.00093) [2022-07-09 00:44:02,636][25689] Fps is (10 sec: 5136.3, 60 sec: 5412.1, 300 sec: 5467.4). Total num frames: 13617152. Throughput: 0: 5680.3. Samples: 13625156. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 00:44:02,637][25689] Avg episode reward: [(0, '-68.190')] [2022-07-09 00:44:03,347][26022] Updated weights on worker 0-0, policy_version 13302 (0.00088) [2022-07-09 00:44:05,027][26022] Updated weights on worker 0-0, policy_version 13312 (0.00095) [2022-07-09 00:44:06,849][26022] Updated weights on worker 0-0, policy_version 13322 (0.00081) [2022-07-09 00:44:07,638][25689] Fps is (10 sec: 5355.2, 60 sec: 5448.4, 300 sec: 5467.7). Total num frames: 13645824. Throughput: 0: 4797.2. Samples: 13640380. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 00:44:07,638][25689] Avg episode reward: [(0, '-67.828')] [2022-07-09 00:44:08,679][26022] Updated weights on worker 0-0, policy_version 13332 (0.00088) [2022-07-09 00:44:10,620][26022] Updated weights on worker 0-0, policy_version 13342 (0.00084) [2022-07-09 00:44:12,624][26022] Updated weights on worker 0-0, policy_version 13352 (0.00091) [2022-07-09 00:44:12,699][25689] Fps is (10 sec: 5495.2, 60 sec: 5426.6, 300 sec: 5459.9). Total num frames: 13672448. Throughput: 0: 5603.8. Samples: 13673544. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 00:44:12,700][25689] Avg episode reward: [(0, '-67.668')] [2022-07-09 00:44:13,086][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:44:13,103][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000013354_13674496.pth [2022-07-09 00:44:13,104][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000011429_11703296.pth [2022-07-09 00:44:14,303][26022] Updated weights on worker 0-0, policy_version 13362 (0.00089) [2022-07-09 00:44:16,406][26022] Updated weights on worker 0-0, policy_version 13372 (0.00091) [2022-07-09 00:44:17,715][25689] Fps is (10 sec: 5487.3, 60 sec: 5460.5, 300 sec: 5470.9). Total num frames: 13701120. Throughput: 0: 5604.8. Samples: 13706430. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 00:44:17,715][25689] Avg episode reward: [(0, '-67.957')] [2022-07-09 00:44:18,292][26022] Updated weights on worker 0-0, policy_version 13382 (0.00089) [2022-07-09 00:44:20,075][26022] Updated weights on worker 0-0, policy_version 13392 (0.00097) [2022-07-09 00:44:21,767][26022] Updated weights on worker 0-0, policy_version 13402 (0.00087) [2022-07-09 00:44:22,781][25689] Fps is (10 sec: 5585.9, 60 sec: 5430.5, 300 sec: 5463.1). Total num frames: 13728768. Throughput: 0: 4856.8. Samples: 13722996. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 00:44:22,783][25689] Avg episode reward: [(0, '-68.631')] [2022-07-09 00:44:23,676][26022] Updated weights on worker 0-0, policy_version 13412 (0.00091) [2022-07-09 00:44:25,630][26022] Updated weights on worker 0-0, policy_version 13422 (0.00099) [2022-07-09 00:44:27,575][26022] Updated weights on worker 0-0, policy_version 13432 (0.00091) [2022-07-09 00:44:27,792][25689] Fps is (10 sec: 5487.2, 60 sec: 5449.1, 300 sec: 5466.6). Total num frames: 13756416. Throughput: 0: 5741.4. Samples: 13756094. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 00:44:27,792][25689] Avg episode reward: [(0, '-68.521')] [2022-07-09 00:44:29,313][26022] Updated weights on worker 0-0, policy_version 13442 (0.00093) [2022-07-09 00:44:31,282][26022] Updated weights on worker 0-0, policy_version 13452 (0.00082) [2022-07-09 00:44:32,879][25689] Fps is (10 sec: 5374.7, 60 sec: 5444.9, 300 sec: 5462.4). Total num frames: 13783040. Throughput: 0: 5716.4. Samples: 13788902. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 00:44:32,882][25689] Avg episode reward: [(0, '-69.304')] [2022-07-09 00:44:33,009][26022] Updated weights on worker 0-0, policy_version 13462 (0.00087) [2022-07-09 00:44:35,068][26022] Updated weights on worker 0-0, policy_version 13472 (0.00089) [2022-07-09 00:44:36,778][26022] Updated weights on worker 0-0, policy_version 13482 (0.00085) [2022-07-09 00:44:37,972][25689] Fps is (10 sec: 5431.8, 60 sec: 5455.2, 300 sec: 5458.7). Total num frames: 13811712. Throughput: 0: 4882.3. Samples: 13805332. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 00:44:37,973][25689] Avg episode reward: [(0, '-68.877')] [2022-07-09 00:44:38,688][26022] Updated weights on worker 0-0, policy_version 13492 (0.00088) [2022-07-09 00:44:40,768][26022] Updated weights on worker 0-0, policy_version 13502 (0.00091) [2022-07-09 00:44:42,257][26022] Updated weights on worker 0-0, policy_version 13512 (0.00087) [2022-07-09 00:44:43,079][25689] Fps is (10 sec: 5521.8, 60 sec: 5437.8, 300 sec: 5463.8). Total num frames: 13839360. Throughput: 0: 5688.4. Samples: 13838456. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 00:44:43,079][25689] Avg episode reward: [(0, '-69.173')] [2022-07-09 00:44:44,331][26022] Updated weights on worker 0-0, policy_version 13522 (0.00091) [2022-07-09 00:44:46,171][26022] Updated weights on worker 0-0, policy_version 13532 (0.00060) [2022-07-09 00:44:47,937][26022] Updated weights on worker 0-0, policy_version 13542 (0.00087) [2022-07-09 00:44:48,092][25689] Fps is (10 sec: 5463.8, 60 sec: 5455.0, 300 sec: 5460.3). Total num frames: 13867008. Throughput: 0: 5689.1. Samples: 13871584. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 00:44:48,093][25689] Avg episode reward: [(0, '-68.976')] [2022-07-09 00:44:49,807][26022] Updated weights on worker 0-0, policy_version 13552 (0.00082) [2022-07-09 00:44:51,663][26022] Updated weights on worker 0-0, policy_version 13562 (0.00085) [2022-07-09 00:44:53,095][25689] Fps is (10 sec: 5623.0, 60 sec: 5455.6, 300 sec: 5467.3). Total num frames: 13895680. Throughput: 0: 5731.5. Samples: 13904768. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 00:44:53,095][25689] Avg episode reward: [(0, '-69.059')] [2022-07-09 00:44:53,486][26022] Updated weights on worker 0-0, policy_version 13572 (0.00087) [2022-07-09 00:44:55,457][26022] Updated weights on worker 0-0, policy_version 13582 (0.00098) [2022-07-09 00:44:57,314][26022] Updated weights on worker 0-0, policy_version 13592 (0.00105) [2022-07-09 00:44:58,120][25689] Fps is (10 sec: 5514.1, 60 sec: 5456.4, 300 sec: 5464.6). Total num frames: 13922304. Throughput: 0: 5754.8. Samples: 13921284. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 00:44:58,122][25689] Avg episode reward: [(0, '-69.602')] [2022-07-09 00:44:59,088][26022] Updated weights on worker 0-0, policy_version 13602 (0.00092) [2022-07-09 00:45:01,067][26022] Updated weights on worker 0-0, policy_version 13612 (0.00093) [2022-07-09 00:45:03,255][25689] Fps is (10 sec: 5140.1, 60 sec: 5456.3, 300 sec: 5459.7). Total num frames: 13947904. Throughput: 0: 5654.3. Samples: 13952538. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 00:45:03,256][25689] Avg episode reward: [(0, '-69.147')] [2022-07-09 00:45:03,279][26022] Updated weights on worker 0-0, policy_version 13622 (0.00079) [2022-07-09 00:45:05,135][26022] Updated weights on worker 0-0, policy_version 13632 (0.00655) [2022-07-09 00:45:06,978][26022] Updated weights on worker 0-0, policy_version 13642 (0.00094) [2022-07-09 00:45:08,311][25689] Fps is (10 sec: 5225.3, 60 sec: 5434.5, 300 sec: 5459.2). Total num frames: 13975552. Throughput: 0: 5596.3. Samples: 13984734. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 00:45:08,311][25689] Avg episode reward: [(0, '-68.977')] [2022-07-09 00:45:08,781][26022] Updated weights on worker 0-0, policy_version 13652 (0.00087) [2022-07-09 00:45:10,879][26022] Updated weights on worker 0-0, policy_version 13662 (0.00085) [2022-07-09 00:45:12,628][26022] Updated weights on worker 0-0, policy_version 13672 (0.00087) [2022-07-09 00:45:13,314][25689] Fps is (10 sec: 5497.2, 60 sec: 5456.6, 300 sec: 5462.6). Total num frames: 14003200. Throughput: 0: 4763.7. Samples: 14001086. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 00:45:13,314][25689] Avg episode reward: [(0, '-69.221')] [2022-07-09 00:45:14,426][26022] Updated weights on worker 0-0, policy_version 13682 (0.00082) [2022-07-09 00:45:16,318][26022] Updated weights on worker 0-0, policy_version 13692 (0.00089) [2022-07-09 00:45:18,240][26022] Updated weights on worker 0-0, policy_version 13702 (0.00094) [2022-07-09 00:45:18,355][25689] Fps is (10 sec: 5505.4, 60 sec: 5437.5, 300 sec: 5460.2). Total num frames: 14030848. Throughput: 0: 5601.1. Samples: 14034618. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 00:45:18,355][25689] Avg episode reward: [(0, '-69.016')] [2022-07-09 00:45:20,141][26022] Updated weights on worker 0-0, policy_version 13712 (0.00094) [2022-07-09 00:45:21,848][26022] Updated weights on worker 0-0, policy_version 13722 (0.00090) [2022-07-09 00:45:23,425][25689] Fps is (10 sec: 5570.3, 60 sec: 5454.1, 300 sec: 5462.8). Total num frames: 14059520. Throughput: 0: 5707.8. Samples: 14067662. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 00:45:23,425][25689] Avg episode reward: [(0, '-69.478')] [2022-07-09 00:45:23,675][26022] Updated weights on worker 0-0, policy_version 13732 (0.00093) [2022-07-09 00:45:25,766][26022] Updated weights on worker 0-0, policy_version 13742 (0.00091) [2022-07-09 00:45:27,392][26022] Updated weights on worker 0-0, policy_version 13752 (0.00082) [2022-07-09 00:45:28,438][25689] Fps is (10 sec: 5484.1, 60 sec: 5436.9, 300 sec: 5455.9). Total num frames: 14086144. Throughput: 0: 4936.5. Samples: 14084090. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 00:45:28,438][25689] Avg episode reward: [(0, '-69.894')] [2022-07-09 00:45:29,520][26022] Updated weights on worker 0-0, policy_version 13762 (0.00087) [2022-07-09 00:45:31,053][26022] Updated weights on worker 0-0, policy_version 13772 (0.00087) [2022-07-09 00:45:33,363][26022] Updated weights on worker 0-0, policy_version 13782 (0.00091) [2022-07-09 00:45:33,460][25689] Fps is (10 sec: 5408.1, 60 sec: 5459.7, 300 sec: 5456.3). Total num frames: 14113792. Throughput: 0: 5756.1. Samples: 14117050. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 00:45:33,460][25689] Avg episode reward: [(0, '-69.818')] [2022-07-09 00:45:34,961][26022] Updated weights on worker 0-0, policy_version 13792 (0.00089) [2022-07-09 00:45:36,900][26022] Updated weights on worker 0-0, policy_version 13802 (0.00104) [2022-07-09 00:45:38,463][25689] Fps is (10 sec: 5515.4, 60 sec: 5450.8, 300 sec: 5457.8). Total num frames: 14141440. Throughput: 0: 5724.4. Samples: 14149728. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 00:45:38,464][25689] Avg episode reward: [(0, '-69.790')] [2022-07-09 00:45:38,760][26022] Updated weights on worker 0-0, policy_version 13812 (0.00093) [2022-07-09 00:45:40,595][26022] Updated weights on worker 0-0, policy_version 13822 (0.00090) [2022-07-09 00:45:42,679][26022] Updated weights on worker 0-0, policy_version 13832 (0.00076) [2022-07-09 00:45:43,508][25689] Fps is (10 sec: 5605.0, 60 sec: 5473.4, 300 sec: 5457.9). Total num frames: 14170112. Throughput: 0: 4907.2. Samples: 14166216. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 00:45:43,509][25689] Avg episode reward: [(0, '-69.367')] [2022-07-09 00:45:44,356][26022] Updated weights on worker 0-0, policy_version 13842 (0.00083) [2022-07-09 00:45:46,214][26022] Updated weights on worker 0-0, policy_version 13852 (0.00080) [2022-07-09 00:45:47,963][26022] Updated weights on worker 0-0, policy_version 13862 (0.00084) [2022-07-09 00:45:48,599][25689] Fps is (10 sec: 5455.7, 60 sec: 5449.5, 300 sec: 5450.6). Total num frames: 14196736. Throughput: 0: 5725.8. Samples: 14199530. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 00:45:48,599][25689] Avg episode reward: [(0, '-69.690')] [2022-07-09 00:45:50,025][26022] Updated weights on worker 0-0, policy_version 13872 (0.00084) [2022-07-09 00:45:51,669][26022] Updated weights on worker 0-0, policy_version 13882 (0.00083) [2022-07-09 00:45:53,619][25689] Fps is (10 sec: 5368.0, 60 sec: 5431.0, 300 sec: 5454.1). Total num frames: 14224384. Throughput: 0: 5721.8. Samples: 14232394. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 00:45:53,620][25689] Avg episode reward: [(0, '-69.410')] [2022-07-09 00:45:53,687][26022] Updated weights on worker 0-0, policy_version 13892 (0.00087) [2022-07-09 00:45:55,402][26022] Updated weights on worker 0-0, policy_version 13902 (0.00089) [2022-07-09 00:45:57,175][26022] Updated weights on worker 0-0, policy_version 13912 (0.00097) [2022-07-09 00:45:58,656][25689] Fps is (10 sec: 5498.3, 60 sec: 5446.9, 300 sec: 5451.4). Total num frames: 14252032. Throughput: 0: 4923.1. Samples: 14249136. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 00:45:58,656][25689] Avg episode reward: [(0, '-68.895')] [2022-07-09 00:45:59,019][26022] Updated weights on worker 0-0, policy_version 13922 (0.00106) [2022-07-09 00:46:00,903][26022] Updated weights on worker 0-0, policy_version 13932 (0.00086) [2022-07-09 00:46:03,119][26022] Updated weights on worker 0-0, policy_version 13942 (0.00086) [2022-07-09 00:46:03,725][25689] Fps is (10 sec: 5370.2, 60 sec: 5469.7, 300 sec: 5454.7). Total num frames: 14278656. Throughput: 0: 5638.8. Samples: 14280212. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 00:46:03,725][25689] Avg episode reward: [(0, '-68.660')] [2022-07-09 00:46:05,324][26022] Updated weights on worker 0-0, policy_version 13952 (0.00066) [2022-07-09 00:46:06,960][26022] Updated weights on worker 0-0, policy_version 13962 (0.00086) [2022-07-09 00:46:08,779][25689] Fps is (10 sec: 5260.1, 60 sec: 5452.9, 300 sec: 5458.0). Total num frames: 14305280. Throughput: 0: 5629.3. Samples: 14313128. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 00:46:08,779][25689] Avg episode reward: [(0, '-68.679')] [2022-07-09 00:46:09,037][26022] Updated weights on worker 0-0, policy_version 13972 (0.00095) [2022-07-09 00:46:10,548][26022] Updated weights on worker 0-0, policy_version 13982 (0.00090) [2022-07-09 00:46:12,727][26022] Updated weights on worker 0-0, policy_version 13992 (0.00090) [2022-07-09 00:46:13,177][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:46:13,188][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000013995_14330880.pth [2022-07-09 00:46:13,199][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000012071_12360704.pth [2022-07-09 00:46:13,869][25689] Fps is (10 sec: 5552.1, 60 sec: 5478.9, 300 sec: 5453.9). Total num frames: 14334976. Throughput: 0: 4812.4. Samples: 14329846. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 00:46:13,870][25689] Avg episode reward: [(0, '-67.843')] [2022-07-09 00:46:14,451][26022] Updated weights on worker 0-0, policy_version 14002 (0.00085) [2022-07-09 00:46:16,451][26022] Updated weights on worker 0-0, policy_version 14012 (0.00092) [2022-07-09 00:46:17,965][26022] Updated weights on worker 0-0, policy_version 14022 (0.00095) [2022-07-09 00:46:18,877][25689] Fps is (10 sec: 5577.5, 60 sec: 5465.0, 300 sec: 5459.8). Total num frames: 14361600. Throughput: 0: 5630.6. Samples: 14362992. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 00:46:18,877][25689] Avg episode reward: [(0, '-68.441')] [2022-07-09 00:46:19,999][26022] Updated weights on worker 0-0, policy_version 14032 (0.00088) [2022-07-09 00:46:21,928][26022] Updated weights on worker 0-0, policy_version 14042 (0.00089) [2022-07-09 00:46:23,654][26022] Updated weights on worker 0-0, policy_version 14052 (0.00093) [2022-07-09 00:46:23,929][25689] Fps is (10 sec: 5496.4, 60 sec: 5466.6, 300 sec: 5458.9). Total num frames: 14390272. Throughput: 0: 5753.4. Samples: 14396456. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 00:46:23,930][25689] Avg episode reward: [(0, '-69.059')] [2022-07-09 00:46:25,577][26022] Updated weights on worker 0-0, policy_version 14062 (0.00086) [2022-07-09 00:46:27,358][26022] Updated weights on worker 0-0, policy_version 14072 (0.00085) [2022-07-09 00:46:28,942][25689] Fps is (10 sec: 5595.5, 60 sec: 5483.5, 300 sec: 5459.0). Total num frames: 14417920. Throughput: 0: 4959.0. Samples: 14413116. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 00:46:28,942][25689] Avg episode reward: [(0, '-68.850')] [2022-07-09 00:46:29,569][26022] Updated weights on worker 0-0, policy_version 14082 (0.00096) [2022-07-09 00:46:31,048][26022] Updated weights on worker 0-0, policy_version 14092 (0.00610) [2022-07-09 00:46:33,208][26022] Updated weights on worker 0-0, policy_version 14102 (0.00629) [2022-07-09 00:46:33,967][25689] Fps is (10 sec: 5610.9, 60 sec: 5500.2, 300 sec: 5465.5). Total num frames: 14446592. Throughput: 0: 5801.7. Samples: 14446448. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 00:46:33,967][25689] Avg episode reward: [(0, '-69.038')] [2022-07-09 00:46:34,735][26022] Updated weights on worker 0-0, policy_version 14112 (0.00099) [2022-07-09 00:46:36,720][26022] Updated weights on worker 0-0, policy_version 14122 (0.00094) [2022-07-09 00:46:38,672][26022] Updated weights on worker 0-0, policy_version 14132 (0.00091) [2022-07-09 00:46:38,987][25689] Fps is (10 sec: 5402.9, 60 sec: 5464.9, 300 sec: 5455.9). Total num frames: 14472192. Throughput: 0: 5767.6. Samples: 14478978. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 00:46:38,987][25689] Avg episode reward: [(0, '-69.457')] [2022-07-09 00:46:40,446][26022] Updated weights on worker 0-0, policy_version 14142 (0.00090) [2022-07-09 00:46:42,427][26022] Updated weights on worker 0-0, policy_version 14152 (0.00086) [2022-07-09 00:46:44,061][25689] Fps is (10 sec: 5376.7, 60 sec: 5462.2, 300 sec: 5458.1). Total num frames: 14500864. Throughput: 0: 4914.5. Samples: 14495392. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 00:46:44,061][25689] Avg episode reward: [(0, '-69.211')] [2022-07-09 00:46:44,101][26022] Updated weights on worker 0-0, policy_version 14162 (0.00079) [2022-07-09 00:46:46,256][26022] Updated weights on worker 0-0, policy_version 14172 (0.00089) [2022-07-09 00:46:47,832][26022] Updated weights on worker 0-0, policy_version 14182 (0.00093) [2022-07-09 00:46:49,091][25689] Fps is (10 sec: 5573.9, 60 sec: 5484.6, 300 sec: 5455.2). Total num frames: 14528512. Throughput: 0: 5734.6. Samples: 14528664. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 00:46:49,093][25689] Avg episode reward: [(0, '-69.604')] [2022-07-09 00:46:49,643][26022] Updated weights on worker 0-0, policy_version 14192 (0.00081) [2022-07-09 00:46:51,664][26022] Updated weights on worker 0-0, policy_version 14202 (0.00085) [2022-07-09 00:46:53,394][26022] Updated weights on worker 0-0, policy_version 14212 (0.00056) [2022-07-09 00:46:54,157][25689] Fps is (10 sec: 5375.4, 60 sec: 5463.5, 300 sec: 5450.7). Total num frames: 14555136. Throughput: 0: 5719.3. Samples: 14561922. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 00:46:54,158][25689] Avg episode reward: [(0, '-69.595')] [2022-07-09 00:46:55,383][26022] Updated weights on worker 0-0, policy_version 14222 (0.00077) [2022-07-09 00:46:57,166][26022] Updated weights on worker 0-0, policy_version 14232 (0.00087) [2022-07-09 00:46:58,955][26022] Updated weights on worker 0-0, policy_version 14242 (0.00079) [2022-07-09 00:46:59,179][25689] Fps is (10 sec: 5582.9, 60 sec: 5498.7, 300 sec: 5462.0). Total num frames: 14584832. Throughput: 0: 4927.4. Samples: 14578474. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 00:46:59,179][25689] Avg episode reward: [(0, '-69.869')] [2022-07-09 00:47:00,994][26022] Updated weights on worker 0-0, policy_version 14252 (0.00082) [2022-07-09 00:47:03,087][26022] Updated weights on worker 0-0, policy_version 14262 (0.00089) [2022-07-09 00:47:04,269][25689] Fps is (10 sec: 5468.5, 60 sec: 5479.9, 300 sec: 5461.1). Total num frames: 14610432. Throughput: 0: 5646.1. Samples: 14609490. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 00:47:04,269][25689] Avg episode reward: [(0, '-69.986')] [2022-07-09 00:47:05,137][26022] Updated weights on worker 0-0, policy_version 14272 (0.00098) [2022-07-09 00:47:06,995][26022] Updated weights on worker 0-0, policy_version 14282 (0.00093) [2022-07-09 00:47:08,670][26022] Updated weights on worker 0-0, policy_version 14292 (0.00093) [2022-07-09 00:47:09,341][25689] Fps is (10 sec: 5340.4, 60 sec: 5512.0, 300 sec: 5457.0). Total num frames: 14639104. Throughput: 0: 5623.7. Samples: 14642548. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 00:47:09,342][25689] Avg episode reward: [(0, '-69.604')] [2022-07-09 00:47:10,540][26022] Updated weights on worker 0-0, policy_version 14302 (0.00088) [2022-07-09 00:47:12,285][26022] Updated weights on worker 0-0, policy_version 14312 (0.00089) [2022-07-09 00:47:14,398][25689] Fps is (10 sec: 5358.2, 60 sec: 5447.4, 300 sec: 5452.6). Total num frames: 14664704. Throughput: 0: 4809.0. Samples: 14659258. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 00:47:14,398][25689] Avg episode reward: [(0, '-70.511')] [2022-07-09 00:47:14,408][26022] Updated weights on worker 0-0, policy_version 14322 (0.00086) [2022-07-09 00:47:16,144][26022] Updated weights on worker 0-0, policy_version 14332 (0.00088) [2022-07-09 00:47:17,977][26022] Updated weights on worker 0-0, policy_version 14342 (0.00083) [2022-07-09 00:47:19,401][25689] Fps is (10 sec: 5496.8, 60 sec: 5498.6, 300 sec: 5461.0). Total num frames: 14694400. Throughput: 0: 5643.1. Samples: 14692590. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 00:47:19,402][25689] Avg episode reward: [(0, '-69.925')] [2022-07-09 00:47:19,756][26022] Updated weights on worker 0-0, policy_version 14352 (0.00048) [2022-07-09 00:47:21,707][26022] Updated weights on worker 0-0, policy_version 14362 (0.00053) [2022-07-09 00:47:23,595][26022] Updated weights on worker 0-0, policy_version 14372 (0.00085) [2022-07-09 00:47:24,467][25689] Fps is (10 sec: 5593.3, 60 sec: 5463.6, 300 sec: 5456.5). Total num frames: 14721024. Throughput: 0: 5759.1. Samples: 14725812. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 00:47:24,467][25689] Avg episode reward: [(0, '-69.901')] [2022-07-09 00:47:25,377][26022] Updated weights on worker 0-0, policy_version 14382 (0.00088) [2022-07-09 00:47:27,392][26022] Updated weights on worker 0-0, policy_version 14392 (0.00095) [2022-07-09 00:47:29,195][26022] Updated weights on worker 0-0, policy_version 14402 (0.00087) [2022-07-09 00:47:29,484][25689] Fps is (10 sec: 5484.1, 60 sec: 5480.1, 300 sec: 5463.3). Total num frames: 14749696. Throughput: 0: 5750.5. Samples: 14758378. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 00:47:29,485][25689] Avg episode reward: [(0, '-70.596')] [2022-07-09 00:47:31,093][26022] Updated weights on worker 0-0, policy_version 14412 (0.00098) [2022-07-09 00:47:32,924][26022] Updated weights on worker 0-0, policy_version 14422 (0.00092) [2022-07-09 00:47:34,506][25689] Fps is (10 sec: 5507.9, 60 sec: 5446.5, 300 sec: 5457.4). Total num frames: 14776320. Throughput: 0: 5738.0. Samples: 14774640. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 00:47:34,507][25689] Avg episode reward: [(0, '-71.304')] [2022-07-09 00:47:34,822][26022] Updated weights on worker 0-0, policy_version 14432 (0.00099) [2022-07-09 00:47:36,752][26022] Updated weights on worker 0-0, policy_version 14442 (0.00088) [2022-07-09 00:47:38,798][26022] Updated weights on worker 0-0, policy_version 14452 (0.00089) [2022-07-09 00:47:39,518][25689] Fps is (10 sec: 5306.7, 60 sec: 5464.1, 300 sec: 5454.8). Total num frames: 14802944. Throughput: 0: 5697.5. Samples: 14807208. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 00:47:39,519][25689] Avg episode reward: [(0, '-70.598')] [2022-07-09 00:47:40,449][26022] Updated weights on worker 0-0, policy_version 14462 (0.00083) [2022-07-09 00:47:42,494][26022] Updated weights on worker 0-0, policy_version 14472 (0.00081) [2022-07-09 00:47:44,167][26022] Updated weights on worker 0-0, policy_version 14482 (0.00087) [2022-07-09 00:47:44,648][25689] Fps is (10 sec: 5351.2, 60 sec: 5442.2, 300 sec: 5455.9). Total num frames: 14830592. Throughput: 0: 5670.5. Samples: 14840250. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 00:47:44,649][25689] Avg episode reward: [(0, '-71.352')] [2022-07-09 00:47:46,059][26022] Updated weights on worker 0-0, policy_version 14492 (0.00086) [2022-07-09 00:47:48,179][26022] Updated weights on worker 0-0, policy_version 14502 (0.00093) [2022-07-09 00:47:49,691][25689] Fps is (10 sec: 5536.2, 60 sec: 5457.9, 300 sec: 5458.8). Total num frames: 14859264. Throughput: 0: 4870.7. Samples: 14856802. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 00:47:49,692][25689] Avg episode reward: [(0, '-71.515')] [2022-07-09 00:47:49,779][26022] Updated weights on worker 0-0, policy_version 14512 (0.00085) [2022-07-09 00:47:51,934][26022] Updated weights on worker 0-0, policy_version 14522 (0.00087) [2022-07-09 00:47:53,595][26022] Updated weights on worker 0-0, policy_version 14532 (0.00081) [2022-07-09 00:47:54,747][25689] Fps is (10 sec: 5475.6, 60 sec: 5458.9, 300 sec: 5455.2). Total num frames: 14885888. Throughput: 0: 5691.9. Samples: 14889848. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 00:47:54,747][25689] Avg episode reward: [(0, '-71.625')] [2022-07-09 00:47:55,487][26022] Updated weights on worker 0-0, policy_version 14542 (0.00091) [2022-07-09 00:47:57,355][26022] Updated weights on worker 0-0, policy_version 14552 (0.00090) [2022-07-09 00:47:59,149][26022] Updated weights on worker 0-0, policy_version 14562 (0.00087) [2022-07-09 00:47:59,760][25689] Fps is (10 sec: 5491.9, 60 sec: 5442.7, 300 sec: 5459.4). Total num frames: 14914560. Throughput: 0: 5718.8. Samples: 14922966. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 00:47:59,762][25689] Avg episode reward: [(0, '-71.116')] [2022-07-09 00:48:01,015][26022] Updated weights on worker 0-0, policy_version 14572 (0.00112) [2022-07-09 00:48:03,254][26022] Updated weights on worker 0-0, policy_version 14582 (0.00085) [2022-07-09 00:48:04,876][25689] Fps is (10 sec: 5358.2, 60 sec: 5440.5, 300 sec: 5454.3). Total num frames: 14940160. Throughput: 0: 4813.0. Samples: 14937604. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 00:48:04,876][25689] Avg episode reward: [(0, '-71.055')] [2022-07-09 00:48:05,018][26022] Updated weights on worker 0-0, policy_version 14592 (0.00085) [2022-07-09 00:48:06,953][26022] Updated weights on worker 0-0, policy_version 14602 (0.00075) [2022-07-09 00:48:08,992][26022] Updated weights on worker 0-0, policy_version 14612 (0.00089) [2022-07-09 00:48:09,894][25689] Fps is (10 sec: 5153.3, 60 sec: 5411.5, 300 sec: 5450.7). Total num frames: 14966784. Throughput: 0: 5615.4. Samples: 14970248. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 00:48:09,896][25689] Avg episode reward: [(0, '-70.055')] [2022-07-09 00:48:10,573][26022] Updated weights on worker 0-0, policy_version 14622 (0.00088) [2022-07-09 00:48:12,652][26022] Updated weights on worker 0-0, policy_version 14632 (0.00088) [2022-07-09 00:48:13,285][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:48:13,300][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000014636_14987264.pth [2022-07-09 00:48:13,301][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000012715_13020160.pth [2022-07-09 00:48:14,388][26022] Updated weights on worker 0-0, policy_version 14642 (0.00089) [2022-07-09 00:48:14,934][25689] Fps is (10 sec: 5497.5, 60 sec: 5463.7, 300 sec: 5457.1). Total num frames: 14995456. Throughput: 0: 5618.5. Samples: 15003270. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 00:48:14,934][25689] Avg episode reward: [(0, '-70.580')] [2022-07-09 00:48:16,386][26022] Updated weights on worker 0-0, policy_version 14652 (0.00083) [2022-07-09 00:48:18,237][26022] Updated weights on worker 0-0, policy_version 14662 (0.00087) [2022-07-09 00:48:19,994][25689] Fps is (10 sec: 5576.6, 60 sec: 5424.8, 300 sec: 5451.2). Total num frames: 15023104. Throughput: 0: 4790.8. Samples: 15019900. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 00:48:19,994][25689] Avg episode reward: [(0, '-69.942')] [2022-07-09 00:48:20,042][26022] Updated weights on worker 0-0, policy_version 14672 (0.00081) [2022-07-09 00:48:21,919][26022] Updated weights on worker 0-0, policy_version 14682 (0.00088) [2022-07-09 00:48:23,663][26022] Updated weights on worker 0-0, policy_version 14692 (0.00093) [2022-07-09 00:48:25,115][25689] Fps is (10 sec: 5532.0, 60 sec: 5453.6, 300 sec: 5456.4). Total num frames: 15051776. Throughput: 0: 5719.8. Samples: 15053368. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 00:48:25,115][25689] Avg episode reward: [(0, '-70.055')] [2022-07-09 00:48:25,525][26022] Updated weights on worker 0-0, policy_version 14702 (0.00091) [2022-07-09 00:48:27,650][26022] Updated weights on worker 0-0, policy_version 14712 (0.00084) [2022-07-09 00:48:29,395][26022] Updated weights on worker 0-0, policy_version 14722 (0.00093) [2022-07-09 00:48:30,128][25689] Fps is (10 sec: 5557.5, 60 sec: 5437.2, 300 sec: 5460.3). Total num frames: 15079424. Throughput: 0: 5728.0. Samples: 15086144. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 00:48:30,128][25689] Avg episode reward: [(0, '-69.484')] [2022-07-09 00:48:31,186][26022] Updated weights on worker 0-0, policy_version 14732 (0.00093) [2022-07-09 00:48:33,011][26022] Updated weights on worker 0-0, policy_version 14742 (0.00099) [2022-07-09 00:48:34,916][26022] Updated weights on worker 0-0, policy_version 14752 (0.00095) [2022-07-09 00:48:35,144][25689] Fps is (10 sec: 5513.2, 60 sec: 5454.5, 300 sec: 5460.4). Total num frames: 15107072. Throughput: 0: 4915.8. Samples: 15102622. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 00:48:35,145][25689] Avg episode reward: [(0, '-69.372')] [2022-07-09 00:48:36,867][26022] Updated weights on worker 0-0, policy_version 14762 (0.00084) [2022-07-09 00:48:38,573][26022] Updated weights on worker 0-0, policy_version 14772 (0.00085) [2022-07-09 00:48:40,178][25689] Fps is (10 sec: 5502.0, 60 sec: 5469.5, 300 sec: 5458.2). Total num frames: 15134720. Throughput: 0: 5712.0. Samples: 15135192. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 00:48:40,179][25689] Avg episode reward: [(0, '-69.412')] [2022-07-09 00:48:40,603][26022] Updated weights on worker 0-0, policy_version 14782 (0.00090) [2022-07-09 00:48:42,430][26022] Updated weights on worker 0-0, policy_version 14792 (0.00086) [2022-07-09 00:48:44,299][26022] Updated weights on worker 0-0, policy_version 14802 (0.00088) [2022-07-09 00:48:45,243][25689] Fps is (10 sec: 5475.8, 60 sec: 5475.3, 300 sec: 5460.7). Total num frames: 15162368. Throughput: 0: 5702.4. Samples: 15168146. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 00:48:45,243][25689] Avg episode reward: [(0, '-69.067')] [2022-07-09 00:48:46,306][26022] Updated weights on worker 0-0, policy_version 14812 (0.00089) [2022-07-09 00:48:47,866][26022] Updated weights on worker 0-0, policy_version 14822 (0.00085) [2022-07-09 00:48:49,921][26022] Updated weights on worker 0-0, policy_version 14832 (0.00086) [2022-07-09 00:48:50,259][25689] Fps is (10 sec: 5383.6, 60 sec: 5444.0, 300 sec: 5453.7). Total num frames: 15188992. Throughput: 0: 4902.6. Samples: 15184840. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 00:48:50,259][25689] Avg episode reward: [(0, '-68.661')] [2022-07-09 00:48:51,482][26022] Updated weights on worker 0-0, policy_version 14842 (0.00081) [2022-07-09 00:48:53,646][26022] Updated weights on worker 0-0, policy_version 14852 (0.00090) [2022-07-09 00:48:55,265][25689] Fps is (10 sec: 5517.2, 60 sec: 5482.2, 300 sec: 5461.1). Total num frames: 15217664. Throughput: 0: 5733.1. Samples: 15217976. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 00:48:55,266][25689] Avg episode reward: [(0, '-67.937')] [2022-07-09 00:48:55,336][26022] Updated weights on worker 0-0, policy_version 14862 (0.00086) [2022-07-09 00:48:57,340][26022] Updated weights on worker 0-0, policy_version 14872 (0.00086) [2022-07-09 00:48:59,240][26022] Updated weights on worker 0-0, policy_version 14882 (0.00087) [2022-07-09 00:49:00,269][25689] Fps is (10 sec: 5626.0, 60 sec: 5466.1, 300 sec: 5470.4). Total num frames: 15245312. Throughput: 0: 5788.7. Samples: 15251498. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 00:49:00,270][25689] Avg episode reward: [(0, '-68.982')] [2022-07-09 00:49:00,991][26022] Updated weights on worker 0-0, policy_version 14892 (0.00421) [2022-07-09 00:49:03,435][26022] Updated weights on worker 0-0, policy_version 14902 (0.00086) [2022-07-09 00:49:04,974][26022] Updated weights on worker 0-0, policy_version 14912 (0.00051) [2022-07-09 00:49:05,403][25689] Fps is (10 sec: 5151.4, 60 sec: 5447.6, 300 sec: 5454.2). Total num frames: 15269888. Throughput: 0: 4841.6. Samples: 15265754. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 00:49:05,404][25689] Avg episode reward: [(0, '-68.972')] [2022-07-09 00:49:07,099][26022] Updated weights on worker 0-0, policy_version 14922 (0.00089) [2022-07-09 00:49:08,842][26022] Updated weights on worker 0-0, policy_version 14932 (0.00084) [2022-07-09 00:49:10,493][25689] Fps is (10 sec: 5208.2, 60 sec: 5474.9, 300 sec: 5460.5). Total num frames: 15298560. Throughput: 0: 5617.3. Samples: 15298504. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 00:49:10,494][25689] Avg episode reward: [(0, '-69.070')] [2022-07-09 00:49:10,683][26022] Updated weights on worker 0-0, policy_version 14942 (0.00083) [2022-07-09 00:49:12,746][26022] Updated weights on worker 0-0, policy_version 14952 (0.00086) [2022-07-09 00:49:14,599][26022] Updated weights on worker 0-0, policy_version 14962 (0.00090) [2022-07-09 00:49:15,545][25689] Fps is (10 sec: 5553.2, 60 sec: 5457.0, 300 sec: 5456.4). Total num frames: 15326208. Throughput: 0: 5605.9. Samples: 15331662. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 00:49:15,545][25689] Avg episode reward: [(0, '-69.172')] [2022-07-09 00:49:16,348][26022] Updated weights on worker 0-0, policy_version 14972 (0.00090) [2022-07-09 00:49:18,208][26022] Updated weights on worker 0-0, policy_version 14982 (0.00085) [2022-07-09 00:49:19,844][26022] Updated weights on worker 0-0, policy_version 14992 (0.00089) [2022-07-09 00:49:20,586][25689] Fps is (10 sec: 5478.9, 60 sec: 5458.7, 300 sec: 5456.9). Total num frames: 15353856. Throughput: 0: 4753.6. Samples: 15348074. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 00:49:20,586][25689] Avg episode reward: [(0, '-68.762')] [2022-07-09 00:49:22,144][26022] Updated weights on worker 0-0, policy_version 15002 (0.00092) [2022-07-09 00:49:23,799][26022] Updated weights on worker 0-0, policy_version 15012 (0.00053) [2022-07-09 00:49:25,673][25689] Fps is (10 sec: 5560.5, 60 sec: 5461.7, 300 sec: 5458.9). Total num frames: 15382528. Throughput: 0: 5705.5. Samples: 15381408. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 00:49:25,674][25689] Avg episode reward: [(0, '-68.986')] [2022-07-09 00:49:25,684][26022] Updated weights on worker 0-0, policy_version 15022 (0.00089) [2022-07-09 00:49:27,419][26022] Updated weights on worker 0-0, policy_version 15032 (0.00099) [2022-07-09 00:49:29,165][26022] Updated weights on worker 0-0, policy_version 15042 (0.00085) [2022-07-09 00:49:30,750][25689] Fps is (10 sec: 5540.8, 60 sec: 5455.9, 300 sec: 5462.5). Total num frames: 15410176. Throughput: 0: 5727.0. Samples: 15414516. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 00:49:30,751][25689] Avg episode reward: [(0, '-68.793')] [2022-07-09 00:49:31,383][26022] Updated weights on worker 0-0, policy_version 15052 (0.00085) [2022-07-09 00:49:32,924][26022] Updated weights on worker 0-0, policy_version 15062 (0.00083) [2022-07-09 00:49:35,046][26022] Updated weights on worker 0-0, policy_version 15072 (0.00093) [2022-07-09 00:49:35,770][25689] Fps is (10 sec: 5578.3, 60 sec: 5472.6, 300 sec: 5463.9). Total num frames: 15438848. Throughput: 0: 5731.0. Samples: 15447570. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 00:49:35,770][25689] Avg episode reward: [(0, '-68.647')] [2022-07-09 00:49:36,679][26022] Updated weights on worker 0-0, policy_version 15082 (0.00089) [2022-07-09 00:49:38,688][26022] Updated weights on worker 0-0, policy_version 15092 (0.00083) [2022-07-09 00:49:40,570][26022] Updated weights on worker 0-0, policy_version 15102 (0.00086) [2022-07-09 00:49:40,783][25689] Fps is (10 sec: 5511.8, 60 sec: 5457.5, 300 sec: 5462.2). Total num frames: 15465472. Throughput: 0: 5737.6. Samples: 15463956. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 00:49:40,784][25689] Avg episode reward: [(0, '-68.886')] [2022-07-09 00:49:42,385][26022] Updated weights on worker 0-0, policy_version 15112 (0.00087) [2022-07-09 00:49:44,143][26022] Updated weights on worker 0-0, policy_version 15122 (0.00094) [2022-07-09 00:49:45,894][25689] Fps is (10 sec: 5461.5, 60 sec: 5470.2, 300 sec: 5463.8). Total num frames: 15494144. Throughput: 0: 5716.5. Samples: 15497000. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 00:49:45,895][25689] Avg episode reward: [(0, '-67.758')] [2022-07-09 00:49:46,183][26022] Updated weights on worker 0-0, policy_version 15132 (0.00089) [2022-07-09 00:49:47,918][26022] Updated weights on worker 0-0, policy_version 15142 (0.01153) [2022-07-09 00:49:49,825][26022] Updated weights on worker 0-0, policy_version 15152 (0.00084) [2022-07-09 00:49:50,927][25689] Fps is (10 sec: 5552.0, 60 sec: 5485.6, 300 sec: 5459.8). Total num frames: 15521792. Throughput: 0: 5737.8. Samples: 15530284. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 00:49:50,927][25689] Avg episode reward: [(0, '-67.673')] [2022-07-09 00:49:51,694][26022] Updated weights on worker 0-0, policy_version 15162 (0.00088) [2022-07-09 00:49:53,456][26022] Updated weights on worker 0-0, policy_version 15172 (0.00088) [2022-07-09 00:49:55,600][26022] Updated weights on worker 0-0, policy_version 15182 (0.00088) [2022-07-09 00:49:55,991][25689] Fps is (10 sec: 5476.7, 60 sec: 5463.5, 300 sec: 5462.6). Total num frames: 15549440. Throughput: 0: 4913.7. Samples: 15546930. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 00:49:55,992][25689] Avg episode reward: [(0, '-67.527')] [2022-07-09 00:49:57,355][26022] Updated weights on worker 0-0, policy_version 15192 (0.00352) [2022-07-09 00:49:59,111][26022] Updated weights on worker 0-0, policy_version 15202 (0.00774) [2022-07-09 00:50:00,891][26022] Updated weights on worker 0-0, policy_version 15212 (0.00080) [2022-07-09 00:50:01,089][25689] Fps is (10 sec: 5441.2, 60 sec: 5455.1, 300 sec: 5470.1). Total num frames: 15577088. Throughput: 0: 5708.2. Samples: 15579870. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 00:50:01,090][25689] Avg episode reward: [(0, '-66.977')] [2022-07-09 00:50:03,083][26022] Updated weights on worker 0-0, policy_version 15222 (0.00491) [2022-07-09 00:50:05,143][26022] Updated weights on worker 0-0, policy_version 15232 (0.00103) [2022-07-09 00:50:06,204][25689] Fps is (10 sec: 5314.0, 60 sec: 5490.4, 300 sec: 5465.6). Total num frames: 15603712. Throughput: 0: 5599.0. Samples: 15610714. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 00:50:06,204][25689] Avg episode reward: [(0, '-67.000')] [2022-07-09 00:50:06,911][26022] Updated weights on worker 0-0, policy_version 15242 (0.00084) [2022-07-09 00:50:08,814][26022] Updated weights on worker 0-0, policy_version 15252 (0.00089) [2022-07-09 00:50:10,821][26022] Updated weights on worker 0-0, policy_version 15262 (0.00084) [2022-07-09 00:50:11,219][25689] Fps is (10 sec: 5155.6, 60 sec: 5446.7, 300 sec: 5458.5). Total num frames: 15629312. Throughput: 0: 4762.4. Samples: 15626922. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 00:50:11,219][25689] Avg episode reward: [(0, '-67.744')] [2022-07-09 00:50:12,671][26022] Updated weights on worker 0-0, policy_version 15272 (0.00082) [2022-07-09 00:50:13,549][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:50:13,560][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000015277_15643648.pth [2022-07-09 00:50:13,561][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000013354_13674496.pth [2022-07-09 00:50:14,390][26022] Updated weights on worker 0-0, policy_version 15282 (0.00082) [2022-07-09 00:50:16,300][25689] Fps is (10 sec: 5375.5, 60 sec: 5460.9, 300 sec: 5461.2). Total num frames: 15657984. Throughput: 0: 5561.6. Samples: 15659882. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 00:50:16,302][25689] Avg episode reward: [(0, '-67.546')] [2022-07-09 00:50:16,319][26022] Updated weights on worker 0-0, policy_version 15292 (0.00087) [2022-07-09 00:50:18,154][26022] Updated weights on worker 0-0, policy_version 15302 (0.00091) [2022-07-09 00:50:19,945][26022] Updated weights on worker 0-0, policy_version 15312 (0.00090) [2022-07-09 00:50:21,382][25689] Fps is (10 sec: 5642.2, 60 sec: 5474.0, 300 sec: 5461.0). Total num frames: 15686656. Throughput: 0: 5593.0. Samples: 15693368. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 00:50:21,384][25689] Avg episode reward: [(0, '-67.260')] [2022-07-09 00:50:22,004][26022] Updated weights on worker 0-0, policy_version 15322 (0.00086) [2022-07-09 00:50:23,511][26022] Updated weights on worker 0-0, policy_version 15332 (0.00081) [2022-07-09 00:50:25,739][26022] Updated weights on worker 0-0, policy_version 15342 (0.00087) [2022-07-09 00:50:26,490][25689] Fps is (10 sec: 5627.5, 60 sec: 5472.2, 300 sec: 5466.1). Total num frames: 15715328. Throughput: 0: 4899.0. Samples: 15710096. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 00:50:26,491][25689] Avg episode reward: [(0, '-67.661')] [2022-07-09 00:50:27,419][26022] Updated weights on worker 0-0, policy_version 15352 (0.00089) [2022-07-09 00:50:29,295][26022] Updated weights on worker 0-0, policy_version 15362 (0.00089) [2022-07-09 00:50:31,096][26022] Updated weights on worker 0-0, policy_version 15372 (0.00105) [2022-07-09 00:50:31,500][25689] Fps is (10 sec: 5465.4, 60 sec: 5461.4, 300 sec: 5462.9). Total num frames: 15741952. Throughput: 0: 5744.0. Samples: 15743416. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 00:50:31,500][25689] Avg episode reward: [(0, '-67.581')] [2022-07-09 00:50:32,775][26022] Updated weights on worker 0-0, policy_version 15382 (0.00086) [2022-07-09 00:50:34,958][26022] Updated weights on worker 0-0, policy_version 15392 (0.00083) [2022-07-09 00:50:36,502][25689] Fps is (10 sec: 5523.1, 60 sec: 5462.9, 300 sec: 5466.4). Total num frames: 15770624. Throughput: 0: 5786.2. Samples: 15776774. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 00:50:36,502][25689] Avg episode reward: [(0, '-66.660')] [2022-07-09 00:50:36,507][26022] Updated weights on worker 0-0, policy_version 15402 (0.00092) [2022-07-09 00:50:38,463][26022] Updated weights on worker 0-0, policy_version 15412 (0.00087) [2022-07-09 00:50:40,331][26022] Updated weights on worker 0-0, policy_version 15422 (0.00085) [2022-07-09 00:50:41,519][25689] Fps is (10 sec: 5519.0, 60 sec: 5462.6, 300 sec: 5460.0). Total num frames: 15797248. Throughput: 0: 4971.8. Samples: 15793484. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 00:50:41,519][25689] Avg episode reward: [(0, '-66.562')] [2022-07-09 00:50:42,008][26022] Updated weights on worker 0-0, policy_version 15432 (0.00089) [2022-07-09 00:50:44,086][26022] Updated weights on worker 0-0, policy_version 15442 (0.00084) [2022-07-09 00:50:45,915][26022] Updated weights on worker 0-0, policy_version 15452 (0.00082) [2022-07-09 00:50:46,560][25689] Fps is (10 sec: 5395.6, 60 sec: 5452.0, 300 sec: 5464.3). Total num frames: 15824896. Throughput: 0: 5783.4. Samples: 15826170. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 00:50:46,561][25689] Avg episode reward: [(0, '-66.220')] [2022-07-09 00:50:47,835][26022] Updated weights on worker 0-0, policy_version 15462 (0.00084) [2022-07-09 00:50:49,563][26022] Updated weights on worker 0-0, policy_version 15472 (0.00092) [2022-07-09 00:50:51,528][26022] Updated weights on worker 0-0, policy_version 15482 (0.00088) [2022-07-09 00:50:51,580][25689] Fps is (10 sec: 5597.7, 60 sec: 5470.0, 300 sec: 5467.8). Total num frames: 15853568. Throughput: 0: 5775.1. Samples: 15859382. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 00:50:51,581][25689] Avg episode reward: [(0, '-66.083')] [2022-07-09 00:50:53,479][26022] Updated weights on worker 0-0, policy_version 15492 (0.00081) [2022-07-09 00:50:55,098][26022] Updated weights on worker 0-0, policy_version 15502 (0.00092) [2022-07-09 00:50:56,610][25689] Fps is (10 sec: 5604.2, 60 sec: 5473.1, 300 sec: 5467.9). Total num frames: 15881216. Throughput: 0: 4926.2. Samples: 15875832. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 00:50:56,611][25689] Avg episode reward: [(0, '-65.713')] [2022-07-09 00:50:57,191][26022] Updated weights on worker 0-0, policy_version 15512 (0.00086) [2022-07-09 00:50:59,046][26022] Updated weights on worker 0-0, policy_version 15522 (0.00102) [2022-07-09 00:51:00,617][26022] Updated weights on worker 0-0, policy_version 15532 (0.00087) [2022-07-09 00:51:01,654][25689] Fps is (10 sec: 5387.3, 60 sec: 5461.1, 300 sec: 5468.4). Total num frames: 15907840. Throughput: 0: 5731.7. Samples: 15908894. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 00:51:01,655][25689] Avg episode reward: [(0, '-65.715')] [2022-07-09 00:51:03,340][26022] Updated weights on worker 0-0, policy_version 15542 (0.00094) [2022-07-09 00:51:04,806][26022] Updated weights on worker 0-0, policy_version 15552 (0.00093) [2022-07-09 00:51:06,797][25689] Fps is (10 sec: 5227.0, 60 sec: 5458.6, 300 sec: 5466.7). Total num frames: 15934464. Throughput: 0: 5627.4. Samples: 15940050. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 00:51:06,798][25689] Avg episode reward: [(0, '-66.251')] [2022-07-09 00:51:07,050][26022] Updated weights on worker 0-0, policy_version 15562 (0.00112) [2022-07-09 00:51:08,675][26022] Updated weights on worker 0-0, policy_version 15572 (0.00091) [2022-07-09 00:51:10,727][26022] Updated weights on worker 0-0, policy_version 15582 (0.00086) [2022-07-09 00:51:11,829][25689] Fps is (10 sec: 5434.8, 60 sec: 5507.7, 300 sec: 5464.4). Total num frames: 15963136. Throughput: 0: 4796.8. Samples: 15956510. Policy #0 lag: (min: 0.0, avg: 7.1, max: 17.0) [2022-07-09 00:51:11,829][25689] Avg episode reward: [(0, '-66.465')] [2022-07-09 00:51:12,434][26022] Updated weights on worker 0-0, policy_version 15592 (0.00092) [2022-07-09 00:51:14,425][26022] Updated weights on worker 0-0, policy_version 15602 (0.00090) [2022-07-09 00:51:16,157][26022] Updated weights on worker 0-0, policy_version 15612 (0.00090) [2022-07-09 00:51:16,848][25689] Fps is (10 sec: 5501.7, 60 sec: 5479.6, 300 sec: 5464.2). Total num frames: 15989760. Throughput: 0: 5586.3. Samples: 15988886. Policy #0 lag: (min: 0.0, avg: 7.1, max: 17.0) [2022-07-09 00:51:16,848][25689] Avg episode reward: [(0, '-67.121')] [2022-07-09 00:51:18,100][26022] Updated weights on worker 0-0, policy_version 15622 (0.00094) [2022-07-09 00:51:19,842][26022] Updated weights on worker 0-0, policy_version 15632 (0.00079) [2022-07-09 00:51:21,851][25689] Fps is (10 sec: 5313.0, 60 sec: 5452.9, 300 sec: 5458.2). Total num frames: 16016384. Throughput: 0: 5622.5. Samples: 16022450. Policy #0 lag: (min: 0.0, avg: 7.1, max: 17.0) [2022-07-09 00:51:21,851][25689] Avg episode reward: [(0, '-66.424')] [2022-07-09 00:51:21,951][26022] Updated weights on worker 0-0, policy_version 15642 (0.00086) [2022-07-09 00:51:23,427][26022] Updated weights on worker 0-0, policy_version 15652 (0.00316) [2022-07-09 00:51:25,568][26022] Updated weights on worker 0-0, policy_version 15662 (0.00085) [2022-07-09 00:51:26,955][25689] Fps is (10 sec: 5572.3, 60 sec: 5470.1, 300 sec: 5463.4). Total num frames: 16046080. Throughput: 0: 5719.4. Samples: 16055340. Policy #0 lag: (min: 0.0, avg: 7.1, max: 17.0) [2022-07-09 00:51:26,956][25689] Avg episode reward: [(0, '-66.769')] [2022-07-09 00:51:27,319][26022] Updated weights on worker 0-0, policy_version 15672 (0.00093) [2022-07-09 00:51:29,281][26022] Updated weights on worker 0-0, policy_version 15682 (0.00090) [2022-07-09 00:51:31,175][26022] Updated weights on worker 0-0, policy_version 15692 (0.00082) [2022-07-09 00:51:31,996][25689] Fps is (10 sec: 5551.2, 60 sec: 5467.3, 300 sec: 5456.2). Total num frames: 16072704. Throughput: 0: 5710.8. Samples: 16071684. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 00:51:31,997][25689] Avg episode reward: [(0, '-66.155')] [2022-07-09 00:51:32,904][26022] Updated weights on worker 0-0, policy_version 15702 (0.00088) [2022-07-09 00:51:34,985][26022] Updated weights on worker 0-0, policy_version 15712 (0.00095) [2022-07-09 00:51:36,639][26022] Updated weights on worker 0-0, policy_version 15722 (0.00083) [2022-07-09 00:51:37,060][25689] Fps is (10 sec: 5472.3, 60 sec: 5461.7, 300 sec: 5465.7). Total num frames: 16101376. Throughput: 0: 5739.8. Samples: 16104898. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 00:51:37,060][25689] Avg episode reward: [(0, '-66.241')] [2022-07-09 00:51:38,603][26022] Updated weights on worker 0-0, policy_version 15732 (0.00091) [2022-07-09 00:51:40,256][26022] Updated weights on worker 0-0, policy_version 15742 (0.00090) [2022-07-09 00:51:42,064][25689] Fps is (10 sec: 5594.4, 60 sec: 5479.8, 300 sec: 5463.6). Total num frames: 16129024. Throughput: 0: 5713.4. Samples: 16137934. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 00:51:42,064][25689] Avg episode reward: [(0, '-65.508')] [2022-07-09 00:51:42,183][26022] Updated weights on worker 0-0, policy_version 15752 (0.00081) [2022-07-09 00:51:44,052][26022] Updated weights on worker 0-0, policy_version 15762 (0.00102) [2022-07-09 00:51:46,021][26022] Updated weights on worker 0-0, policy_version 15772 (0.00092) [2022-07-09 00:51:47,157][25689] Fps is (10 sec: 5476.5, 60 sec: 5475.2, 300 sec: 5462.4). Total num frames: 16156672. Throughput: 0: 4912.6. Samples: 16154582. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 00:51:47,157][25689] Avg episode reward: [(0, '-65.116')] [2022-07-09 00:51:47,799][26022] Updated weights on worker 0-0, policy_version 15782 (0.00086) [2022-07-09 00:51:49,598][26022] Updated weights on worker 0-0, policy_version 15792 (0.00089) [2022-07-09 00:51:51,533][26022] Updated weights on worker 0-0, policy_version 15802 (0.00088) [2022-07-09 00:51:52,198][25689] Fps is (10 sec: 5456.1, 60 sec: 5456.3, 300 sec: 5466.3). Total num frames: 16184320. Throughput: 0: 5753.3. Samples: 16187912. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 00:51:52,202][25689] Avg episode reward: [(0, '-65.450')] [2022-07-09 00:51:53,256][26022] Updated weights on worker 0-0, policy_version 15812 (0.00085) [2022-07-09 00:51:55,297][26022] Updated weights on worker 0-0, policy_version 15822 (0.00087) [2022-07-09 00:51:56,811][26022] Updated weights on worker 0-0, policy_version 15832 (0.00088) [2022-07-09 00:51:57,231][25689] Fps is (10 sec: 5590.6, 60 sec: 5473.0, 300 sec: 5462.7). Total num frames: 16212992. Throughput: 0: 5757.5. Samples: 16221032. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 00:51:57,231][25689] Avg episode reward: [(0, '-65.857')] [2022-07-09 00:51:58,998][26022] Updated weights on worker 0-0, policy_version 15842 (0.00090) [2022-07-09 00:52:00,821][26022] Updated weights on worker 0-0, policy_version 15852 (0.00088) [2022-07-09 00:52:02,235][25689] Fps is (10 sec: 5305.5, 60 sec: 5442.8, 300 sec: 5460.8). Total num frames: 16237568. Throughput: 0: 4935.8. Samples: 16237494. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 00:52:02,235][25689] Avg episode reward: [(0, '-66.549')] [2022-07-09 00:52:02,982][26022] Updated weights on worker 0-0, policy_version 15862 (0.00088) [2022-07-09 00:52:05,151][26022] Updated weights on worker 0-0, policy_version 15872 (0.00083) [2022-07-09 00:52:06,670][26022] Updated weights on worker 0-0, policy_version 15882 (0.00085) [2022-07-09 00:52:07,301][25689] Fps is (10 sec: 5185.9, 60 sec: 5466.6, 300 sec: 5457.5). Total num frames: 16265216. Throughput: 0: 5660.2. Samples: 16268602. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 00:52:07,301][25689] Avg episode reward: [(0, '-66.644')] [2022-07-09 00:52:08,600][26022] Updated weights on worker 0-0, policy_version 15892 (0.00087) [2022-07-09 00:52:10,499][26022] Updated weights on worker 0-0, policy_version 15902 (0.00086) [2022-07-09 00:52:12,231][26022] Updated weights on worker 0-0, policy_version 15912 (0.00087) [2022-07-09 00:52:12,310][25689] Fps is (10 sec: 5590.1, 60 sec: 5468.7, 300 sec: 5468.7). Total num frames: 16293888. Throughput: 0: 5653.9. Samples: 16301620. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 00:52:12,310][25689] Avg episode reward: [(0, '-67.457')] [2022-07-09 00:52:13,753][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:52:13,766][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000015919_16301056.pth [2022-07-09 00:52:13,767][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000013995_14330880.pth [2022-07-09 00:52:14,313][26022] Updated weights on worker 0-0, policy_version 15922 (0.00086) [2022-07-09 00:52:15,983][26022] Updated weights on worker 0-0, policy_version 15932 (0.00098) [2022-07-09 00:52:17,339][25689] Fps is (10 sec: 5508.7, 60 sec: 5467.8, 300 sec: 5457.9). Total num frames: 16320512. Throughput: 0: 4819.1. Samples: 16317936. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 00:52:17,339][25689] Avg episode reward: [(0, '-68.324')] [2022-07-09 00:52:17,737][26022] Updated weights on worker 0-0, policy_version 15942 (0.00084) [2022-07-09 00:52:19,883][26022] Updated weights on worker 0-0, policy_version 15952 (0.00087) [2022-07-09 00:52:21,551][26022] Updated weights on worker 0-0, policy_version 15962 (0.00986) [2022-07-09 00:52:22,367][25689] Fps is (10 sec: 5294.3, 60 sec: 5465.5, 300 sec: 5458.6). Total num frames: 16347136. Throughput: 0: 5644.0. Samples: 16351122. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 00:52:22,369][25689] Avg episode reward: [(0, '-68.564')] [2022-07-09 00:52:23,517][26022] Updated weights on worker 0-0, policy_version 15972 (0.00095) [2022-07-09 00:52:25,303][26022] Updated weights on worker 0-0, policy_version 15982 (0.00087) [2022-07-09 00:52:27,186][26022] Updated weights on worker 0-0, policy_version 15992 (0.00098) [2022-07-09 00:52:27,421][25689] Fps is (10 sec: 5789.4, 60 sec: 5504.0, 300 sec: 5468.2). Total num frames: 16378880. Throughput: 0: 5756.3. Samples: 16384418. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 00:52:27,421][25689] Avg episode reward: [(0, '-68.221')] [2022-07-09 00:52:29,151][26022] Updated weights on worker 0-0, policy_version 16002 (0.00617) [2022-07-09 00:52:30,858][26022] Updated weights on worker 0-0, policy_version 16012 (0.00084) [2022-07-09 00:52:32,425][25689] Fps is (10 sec: 5599.7, 60 sec: 5473.4, 300 sec: 5461.7). Total num frames: 16403456. Throughput: 0: 4937.9. Samples: 16400948. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 00:52:32,425][25689] Avg episode reward: [(0, '-67.789')] [2022-07-09 00:52:33,080][26022] Updated weights on worker 0-0, policy_version 16022 (0.00084) [2022-07-09 00:52:34,495][26022] Updated weights on worker 0-0, policy_version 16032 (0.00089) [2022-07-09 00:52:36,481][26022] Updated weights on worker 0-0, policy_version 16042 (0.00087) [2022-07-09 00:52:37,465][25689] Fps is (10 sec: 5199.2, 60 sec: 5458.6, 300 sec: 5464.6). Total num frames: 16431104. Throughput: 0: 5780.2. Samples: 16434272. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 00:52:37,466][25689] Avg episode reward: [(0, '-67.189')] [2022-07-09 00:52:38,138][26022] Updated weights on worker 0-0, policy_version 16052 (0.00085) [2022-07-09 00:52:40,286][26022] Updated weights on worker 0-0, policy_version 16062 (0.00094) [2022-07-09 00:52:42,038][26022] Updated weights on worker 0-0, policy_version 16072 (0.00087) [2022-07-09 00:52:42,499][25689] Fps is (10 sec: 5590.8, 60 sec: 5472.8, 300 sec: 5469.8). Total num frames: 16459776. Throughput: 0: 5770.1. Samples: 16467282. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 00:52:42,499][25689] Avg episode reward: [(0, '-67.773')] [2022-07-09 00:52:43,906][26022] Updated weights on worker 0-0, policy_version 16082 (0.00623) [2022-07-09 00:52:45,787][26022] Updated weights on worker 0-0, policy_version 16092 (0.00095) [2022-07-09 00:52:47,604][25689] Fps is (10 sec: 5454.0, 60 sec: 5454.8, 300 sec: 5461.8). Total num frames: 16486400. Throughput: 0: 4904.6. Samples: 16483410. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 00:52:47,605][25689] Avg episode reward: [(0, '-67.248')] [2022-07-09 00:52:47,916][26022] Updated weights on worker 0-0, policy_version 16102 (0.00092) [2022-07-09 00:52:49,431][26022] Updated weights on worker 0-0, policy_version 16112 (0.00091) [2022-07-09 00:52:51,520][26022] Updated weights on worker 0-0, policy_version 16122 (0.00091) [2022-07-09 00:52:52,635][25689] Fps is (10 sec: 5556.3, 60 sec: 5489.7, 300 sec: 5472.6). Total num frames: 16516096. Throughput: 0: 5722.0. Samples: 16516588. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 00:52:52,635][25689] Avg episode reward: [(0, '-67.556')] [2022-07-09 00:52:53,370][26022] Updated weights on worker 0-0, policy_version 16132 (0.00091) [2022-07-09 00:52:55,103][26022] Updated weights on worker 0-0, policy_version 16142 (0.00082) [2022-07-09 00:52:57,179][26022] Updated weights on worker 0-0, policy_version 16152 (0.00092) [2022-07-09 00:52:57,662][25689] Fps is (10 sec: 5599.7, 60 sec: 5456.3, 300 sec: 5465.4). Total num frames: 16542720. Throughput: 0: 5717.9. Samples: 16549752. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 00:52:57,662][25689] Avg episode reward: [(0, '-67.983')] [2022-07-09 00:52:58,881][26022] Updated weights on worker 0-0, policy_version 16162 (0.00082) [2022-07-09 00:53:00,842][26022] Updated weights on worker 0-0, policy_version 16172 (0.00087) [2022-07-09 00:53:02,715][25689] Fps is (10 sec: 5282.3, 60 sec: 5485.7, 300 sec: 5470.0). Total num frames: 16569344. Throughput: 0: 4898.3. Samples: 16566310. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 00:53:02,716][25689] Avg episode reward: [(0, '-67.712')] [2022-07-09 00:53:02,909][26022] Updated weights on worker 0-0, policy_version 16182 (0.00094) [2022-07-09 00:53:04,848][26022] Updated weights on worker 0-0, policy_version 16192 (0.00086) [2022-07-09 00:53:06,555][26022] Updated weights on worker 0-0, policy_version 16202 (0.00089) [2022-07-09 00:53:07,789][25689] Fps is (10 sec: 5257.8, 60 sec: 5468.0, 300 sec: 5469.0). Total num frames: 16595968. Throughput: 0: 5653.3. Samples: 16597522. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 00:53:07,790][25689] Avg episode reward: [(0, '-67.465')] [2022-07-09 00:53:08,559][26022] Updated weights on worker 0-0, policy_version 16212 (0.00094) [2022-07-09 00:53:10,391][26022] Updated weights on worker 0-0, policy_version 16222 (0.00089) [2022-07-09 00:53:12,340][26022] Updated weights on worker 0-0, policy_version 16232 (0.00095) [2022-07-09 00:53:12,841][25689] Fps is (10 sec: 5359.8, 60 sec: 5447.2, 300 sec: 5465.3). Total num frames: 16623616. Throughput: 0: 5648.0. Samples: 16630712. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 00:53:12,842][25689] Avg episode reward: [(0, '-66.734')] [2022-07-09 00:53:14,284][26022] Updated weights on worker 0-0, policy_version 16242 (0.00089) [2022-07-09 00:53:15,939][26022] Updated weights on worker 0-0, policy_version 16252 (0.00087) [2022-07-09 00:53:17,876][25689] Fps is (10 sec: 5481.9, 60 sec: 5463.6, 300 sec: 5465.8). Total num frames: 16651264. Throughput: 0: 4809.2. Samples: 16646968. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 00:53:17,877][25689] Avg episode reward: [(0, '-67.163')] [2022-07-09 00:53:17,975][26022] Updated weights on worker 0-0, policy_version 16262 (0.00105) [2022-07-09 00:53:19,614][26022] Updated weights on worker 0-0, policy_version 16272 (0.00092) [2022-07-09 00:53:21,424][26022] Updated weights on worker 0-0, policy_version 16282 (0.00087) [2022-07-09 00:53:22,907][25689] Fps is (10 sec: 5697.0, 60 sec: 5514.2, 300 sec: 5470.9). Total num frames: 16680960. Throughput: 0: 5644.6. Samples: 16680282. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 00:53:22,907][25689] Avg episode reward: [(0, '-66.606')] [2022-07-09 00:53:23,435][26022] Updated weights on worker 0-0, policy_version 16292 (0.00568) [2022-07-09 00:53:25,161][26022] Updated weights on worker 0-0, policy_version 16302 (0.00088) [2022-07-09 00:53:27,224][26022] Updated weights on worker 0-0, policy_version 16312 (0.00089) [2022-07-09 00:53:27,971][25689] Fps is (10 sec: 5579.2, 60 sec: 5428.6, 300 sec: 5466.5). Total num frames: 16707584. Throughput: 0: 5765.7. Samples: 16713884. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 00:53:27,971][25689] Avg episode reward: [(0, '-66.943')] [2022-07-09 00:53:28,950][26022] Updated weights on worker 0-0, policy_version 16322 (0.00085) [2022-07-09 00:53:30,751][26022] Updated weights on worker 0-0, policy_version 16332 (0.00092) [2022-07-09 00:53:32,507][26022] Updated weights on worker 0-0, policy_version 16342 (0.00086) [2022-07-09 00:53:32,989][25689] Fps is (10 sec: 5586.1, 60 sec: 5511.9, 300 sec: 5473.4). Total num frames: 16737280. Throughput: 0: 4953.8. Samples: 16730520. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 00:53:32,989][25689] Avg episode reward: [(0, '-67.192')] [2022-07-09 00:53:34,552][26022] Updated weights on worker 0-0, policy_version 16352 (0.00090) [2022-07-09 00:53:36,242][26022] Updated weights on worker 0-0, policy_version 16362 (0.00092) [2022-07-09 00:53:38,005][25689] Fps is (10 sec: 5510.9, 60 sec: 5480.3, 300 sec: 5466.8). Total num frames: 16762880. Throughput: 0: 5808.4. Samples: 16763884. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 00:53:38,006][25689] Avg episode reward: [(0, '-68.042')] [2022-07-09 00:53:38,182][26022] Updated weights on worker 0-0, policy_version 16372 (0.00095) [2022-07-09 00:53:39,812][26022] Updated weights on worker 0-0, policy_version 16382 (0.00100) [2022-07-09 00:53:41,992][26022] Updated weights on worker 0-0, policy_version 16392 (0.00091) [2022-07-09 00:53:43,010][25689] Fps is (10 sec: 5415.5, 60 sec: 5482.8, 300 sec: 5471.3). Total num frames: 16791552. Throughput: 0: 5816.5. Samples: 16797216. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 00:53:43,012][25689] Avg episode reward: [(0, '-68.465')] [2022-07-09 00:53:43,486][26022] Updated weights on worker 0-0, policy_version 16402 (0.00083) [2022-07-09 00:53:45,820][26022] Updated weights on worker 0-0, policy_version 16412 (0.00088) [2022-07-09 00:53:47,189][26022] Updated weights on worker 0-0, policy_version 16422 (0.00089) [2022-07-09 00:53:48,067][25689] Fps is (10 sec: 5597.5, 60 sec: 5504.2, 300 sec: 5474.0). Total num frames: 16819200. Throughput: 0: 4952.4. Samples: 16813406. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 00:53:48,067][25689] Avg episode reward: [(0, '-68.962')] [2022-07-09 00:53:49,463][26022] Updated weights on worker 0-0, policy_version 16432 (0.00466) [2022-07-09 00:53:50,891][26022] Updated weights on worker 0-0, policy_version 16442 (0.00096) [2022-07-09 00:53:53,101][26022] Updated weights on worker 0-0, policy_version 16452 (0.00081) [2022-07-09 00:53:53,102][25689] Fps is (10 sec: 5377.9, 60 sec: 5453.0, 300 sec: 5466.6). Total num frames: 16845824. Throughput: 0: 5757.2. Samples: 16846316. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 00:53:53,103][25689] Avg episode reward: [(0, '-69.484')] [2022-07-09 00:53:54,931][26022] Updated weights on worker 0-0, policy_version 16462 (0.00089) [2022-07-09 00:53:56,701][26022] Updated weights on worker 0-0, policy_version 16472 (0.00088) [2022-07-09 00:53:58,116][25689] Fps is (10 sec: 5400.5, 60 sec: 5471.1, 300 sec: 5466.4). Total num frames: 16873472. Throughput: 0: 5760.3. Samples: 16879730. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 00:53:58,116][25689] Avg episode reward: [(0, '-70.086')] [2022-07-09 00:53:58,601][26022] Updated weights on worker 0-0, policy_version 16482 (0.00084) [2022-07-09 00:54:00,395][26022] Updated weights on worker 0-0, policy_version 16492 (0.00088) [2022-07-09 00:54:02,487][26022] Updated weights on worker 0-0, policy_version 16502 (0.00101) [2022-07-09 00:54:03,118][25689] Fps is (10 sec: 5418.4, 60 sec: 5475.7, 300 sec: 5475.7). Total num frames: 16900096. Throughput: 0: 4938.6. Samples: 16896520. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 00:54:03,119][25689] Avg episode reward: [(0, '-70.014')] [2022-07-09 00:54:04,629][26022] Updated weights on worker 0-0, policy_version 16512 (0.00092) [2022-07-09 00:54:06,337][26022] Updated weights on worker 0-0, policy_version 16522 (0.00083) [2022-07-09 00:54:08,162][26022] Updated weights on worker 0-0, policy_version 16532 (0.00087) [2022-07-09 00:54:08,191][25689] Fps is (10 sec: 5488.6, 60 sec: 5509.8, 300 sec: 5476.1). Total num frames: 16928768. Throughput: 0: 5678.7. Samples: 16927686. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 00:54:08,191][25689] Avg episode reward: [(0, '-69.899')] [2022-07-09 00:54:10,222][26022] Updated weights on worker 0-0, policy_version 16542 (0.00094) [2022-07-09 00:54:11,938][26022] Updated weights on worker 0-0, policy_version 16552 (0.00085) [2022-07-09 00:54:13,231][25689] Fps is (10 sec: 5468.2, 60 sec: 5493.9, 300 sec: 5472.8). Total num frames: 16955392. Throughput: 0: 5695.9. Samples: 16960966. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 00:54:13,231][25689] Avg episode reward: [(0, '-69.830')] [2022-07-09 00:54:13,883][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:54:13,894][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000016562_16959488.pth [2022-07-09 00:54:13,895][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000014636_14987264.pth [2022-07-09 00:54:13,906][26022] Updated weights on worker 0-0, policy_version 16562 (0.00095) [2022-07-09 00:54:15,617][26022] Updated weights on worker 0-0, policy_version 16572 (0.00091) [2022-07-09 00:54:17,394][26022] Updated weights on worker 0-0, policy_version 16582 (0.00088) [2022-07-09 00:54:18,234][25689] Fps is (10 sec: 5506.0, 60 sec: 5513.8, 300 sec: 5477.0). Total num frames: 16984064. Throughput: 0: 4852.0. Samples: 16977340. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 00:54:18,234][25689] Avg episode reward: [(0, '-69.405')] [2022-07-09 00:54:19,442][26022] Updated weights on worker 0-0, policy_version 16592 (0.00090) [2022-07-09 00:54:21,006][26022] Updated weights on worker 0-0, policy_version 16602 (0.00092) [2022-07-09 00:54:23,092][26022] Updated weights on worker 0-0, policy_version 16612 (0.00088) [2022-07-09 00:54:23,258][25689] Fps is (10 sec: 5616.5, 60 sec: 5480.4, 300 sec: 5474.7). Total num frames: 17011712. Throughput: 0: 5670.8. Samples: 17010730. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 00:54:23,259][25689] Avg episode reward: [(0, '-69.428')] [2022-07-09 00:54:25,137][26022] Updated weights on worker 0-0, policy_version 16622 (0.00084) [2022-07-09 00:54:26,645][26022] Updated weights on worker 0-0, policy_version 16632 (0.00090) [2022-07-09 00:54:28,383][25689] Fps is (10 sec: 5448.4, 60 sec: 5491.9, 300 sec: 5473.8). Total num frames: 17039360. Throughput: 0: 5759.2. Samples: 17043974. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 00:54:28,385][25689] Avg episode reward: [(0, '-68.893')] [2022-07-09 00:54:28,788][26022] Updated weights on worker 0-0, policy_version 16642 (0.00076) [2022-07-09 00:54:30,499][26022] Updated weights on worker 0-0, policy_version 16652 (0.00083) [2022-07-09 00:54:32,239][26022] Updated weights on worker 0-0, policy_version 16662 (0.00091) [2022-07-09 00:54:33,402][25689] Fps is (10 sec: 5451.1, 60 sec: 5457.9, 300 sec: 5470.4). Total num frames: 17067008. Throughput: 0: 4949.3. Samples: 17060802. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 00:54:33,404][25689] Avg episode reward: [(0, '-68.296')] [2022-07-09 00:54:34,160][26022] Updated weights on worker 0-0, policy_version 16672 (0.00091) [2022-07-09 00:54:35,898][26022] Updated weights on worker 0-0, policy_version 16682 (0.00085) [2022-07-09 00:54:37,857][26022] Updated weights on worker 0-0, policy_version 16692 (0.00088) [2022-07-09 00:54:38,420][25689] Fps is (10 sec: 5611.0, 60 sec: 5508.6, 300 sec: 5477.2). Total num frames: 17095680. Throughput: 0: 5777.7. Samples: 17093972. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 00:54:38,422][25689] Avg episode reward: [(0, '-69.189')] [2022-07-09 00:54:39,798][26022] Updated weights on worker 0-0, policy_version 16702 (0.00108) [2022-07-09 00:54:41,322][26022] Updated weights on worker 0-0, policy_version 16712 (0.00091) [2022-07-09 00:54:43,427][25689] Fps is (10 sec: 5516.0, 60 sec: 5474.5, 300 sec: 5472.2). Total num frames: 17122304. Throughput: 0: 5772.0. Samples: 17127144. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-09 00:54:43,429][25689] Avg episode reward: [(0, '-68.670')] [2022-07-09 00:54:43,474][26022] Updated weights on worker 0-0, policy_version 16722 (0.00087) [2022-07-09 00:54:45,257][26022] Updated weights on worker 0-0, policy_version 16732 (0.00098) [2022-07-09 00:54:47,050][26022] Updated weights on worker 0-0, policy_version 16742 (0.00089) [2022-07-09 00:54:48,558][25689] Fps is (10 sec: 5454.1, 60 sec: 5484.6, 300 sec: 5473.8). Total num frames: 17150976. Throughput: 0: 5765.3. Samples: 17160294. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-09 00:54:48,559][25689] Avg episode reward: [(0, '-68.963')] [2022-07-09 00:54:49,033][26022] Updated weights on worker 0-0, policy_version 16752 (0.00092) [2022-07-09 00:54:50,846][26022] Updated weights on worker 0-0, policy_version 16762 (0.00089) [2022-07-09 00:54:52,764][26022] Updated weights on worker 0-0, policy_version 16772 (0.00087) [2022-07-09 00:54:53,605][25689] Fps is (10 sec: 5634.4, 60 sec: 5517.5, 300 sec: 5477.6). Total num frames: 17179648. Throughput: 0: 5749.8. Samples: 17176960. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-09 00:54:53,605][25689] Avg episode reward: [(0, '-68.613')] [2022-07-09 00:54:54,603][26022] Updated weights on worker 0-0, policy_version 16782 (0.00082) [2022-07-09 00:54:56,335][26022] Updated weights on worker 0-0, policy_version 16792 (0.00096) [2022-07-09 00:54:58,407][26022] Updated weights on worker 0-0, policy_version 16802 (0.00097) [2022-07-09 00:54:58,645][25689] Fps is (10 sec: 5482.2, 60 sec: 5498.2, 300 sec: 5475.2). Total num frames: 17206272. Throughput: 0: 5749.9. Samples: 17210262. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-09 00:54:58,645][25689] Avg episode reward: [(0, '-69.243')] [2022-07-09 00:55:00,027][26022] Updated weights on worker 0-0, policy_version 16812 (0.00052) [2022-07-09 00:55:02,379][26022] Updated weights on worker 0-0, policy_version 16822 (0.00093) [2022-07-09 00:55:03,648][25689] Fps is (10 sec: 5199.7, 60 sec: 5481.2, 300 sec: 5473.8). Total num frames: 17231872. Throughput: 0: 5637.5. Samples: 17241140. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-09 00:55:03,649][25689] Avg episode reward: [(0, '-69.856')] [2022-07-09 00:55:04,122][26022] Updated weights on worker 0-0, policy_version 16832 (0.00089) [2022-07-09 00:55:06,063][26022] Updated weights on worker 0-0, policy_version 16842 (0.00087) [2022-07-09 00:55:08,000][26022] Updated weights on worker 0-0, policy_version 16852 (0.00091) [2022-07-09 00:55:08,754][25689] Fps is (10 sec: 5267.3, 60 sec: 5461.3, 300 sec: 5479.0). Total num frames: 17259520. Throughput: 0: 4825.2. Samples: 17257742. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 00:55:08,755][25689] Avg episode reward: [(0, '-69.331')] [2022-07-09 00:55:09,798][26022] Updated weights on worker 0-0, policy_version 16862 (0.00090) [2022-07-09 00:55:11,655][26022] Updated weights on worker 0-0, policy_version 16872 (0.00086) [2022-07-09 00:55:13,279][26022] Updated weights on worker 0-0, policy_version 16882 (0.00087) [2022-07-09 00:55:13,759][25689] Fps is (10 sec: 5570.3, 60 sec: 5498.3, 300 sec: 5480.4). Total num frames: 17288192. Throughput: 0: 5659.8. Samples: 17291030. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 00:55:13,759][25689] Avg episode reward: [(0, '-69.067')] [2022-07-09 00:55:15,522][26022] Updated weights on worker 0-0, policy_version 16892 (0.00087) [2022-07-09 00:55:17,092][26022] Updated weights on worker 0-0, policy_version 16902 (0.00092) [2022-07-09 00:55:18,782][25689] Fps is (10 sec: 5514.3, 60 sec: 5462.6, 300 sec: 5474.7). Total num frames: 17314816. Throughput: 0: 5654.8. Samples: 17324132. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 00:55:18,782][25689] Avg episode reward: [(0, '-68.971')] [2022-07-09 00:55:19,113][26022] Updated weights on worker 0-0, policy_version 16912 (0.00089) [2022-07-09 00:55:20,839][26022] Updated weights on worker 0-0, policy_version 16922 (0.00092) [2022-07-09 00:55:22,732][26022] Updated weights on worker 0-0, policy_version 16932 (0.00084) [2022-07-09 00:55:23,807][25689] Fps is (10 sec: 5605.1, 60 sec: 5496.4, 300 sec: 5479.6). Total num frames: 17344512. Throughput: 0: 4946.0. Samples: 17340846. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 00:55:23,807][25689] Avg episode reward: [(0, '-68.488')] [2022-07-09 00:55:24,675][26022] Updated weights on worker 0-0, policy_version 16942 (0.00095) [2022-07-09 00:55:26,543][26022] Updated weights on worker 0-0, policy_version 16952 (0.00089) [2022-07-09 00:55:28,495][26022] Updated weights on worker 0-0, policy_version 16962 (0.00082) [2022-07-09 00:55:28,902][25689] Fps is (10 sec: 5564.8, 60 sec: 5482.1, 300 sec: 5478.0). Total num frames: 17371136. Throughput: 0: 5764.2. Samples: 17373882. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 00:55:28,906][25689] Avg episode reward: [(0, '-68.302')] [2022-07-09 00:55:30,027][26022] Updated weights on worker 0-0, policy_version 16972 (0.00098) [2022-07-09 00:55:32,023][26022] Updated weights on worker 0-0, policy_version 16982 (0.00098) [2022-07-09 00:55:33,943][25689] Fps is (10 sec: 5455.1, 60 sec: 5497.1, 300 sec: 5477.3). Total num frames: 17399808. Throughput: 0: 5753.7. Samples: 17407166. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 00:55:33,945][25689] Avg episode reward: [(0, '-67.743')] [2022-07-09 00:55:33,947][26022] Updated weights on worker 0-0, policy_version 16992 (0.00082) [2022-07-09 00:55:35,852][26022] Updated weights on worker 0-0, policy_version 17002 (0.00091) [2022-07-09 00:55:37,546][26022] Updated weights on worker 0-0, policy_version 17012 (0.00084) [2022-07-09 00:55:38,992][25689] Fps is (10 sec: 5582.0, 60 sec: 5477.4, 300 sec: 5480.2). Total num frames: 17427456. Throughput: 0: 4939.5. Samples: 17423962. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 00:55:38,994][25689] Avg episode reward: [(0, '-67.964')] [2022-07-09 00:55:39,522][26022] Updated weights on worker 0-0, policy_version 17022 (0.00096) [2022-07-09 00:55:41,231][26022] Updated weights on worker 0-0, policy_version 17032 (0.00104) [2022-07-09 00:55:43,045][26022] Updated weights on worker 0-0, policy_version 17042 (0.00089) [2022-07-09 00:55:44,002][25689] Fps is (10 sec: 5599.1, 60 sec: 5510.9, 300 sec: 5484.2). Total num frames: 17456128. Throughput: 0: 5780.0. Samples: 17457574. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 00:55:44,003][25689] Avg episode reward: [(0, '-68.288')] [2022-07-09 00:55:44,895][26022] Updated weights on worker 0-0, policy_version 17052 (0.00088) [2022-07-09 00:55:46,655][26022] Updated weights on worker 0-0, policy_version 17062 (0.00088) [2022-07-09 00:55:48,771][26022] Updated weights on worker 0-0, policy_version 17072 (0.00086) [2022-07-09 00:55:49,051][25689] Fps is (10 sec: 5497.4, 60 sec: 5484.6, 300 sec: 5476.8). Total num frames: 17482752. Throughput: 0: 5809.5. Samples: 17490932. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 00:55:49,051][25689] Avg episode reward: [(0, '-68.026')] [2022-07-09 00:55:50,351][26022] Updated weights on worker 0-0, policy_version 17082 (0.00106) [2022-07-09 00:55:52,479][26022] Updated weights on worker 0-0, policy_version 17092 (0.00089) [2022-07-09 00:55:53,907][26022] Updated weights on worker 0-0, policy_version 17102 (0.00089) [2022-07-09 00:55:54,063][25689] Fps is (10 sec: 5598.2, 60 sec: 5504.6, 300 sec: 5484.0). Total num frames: 17512448. Throughput: 0: 4989.5. Samples: 17507550. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 00:55:54,063][25689] Avg episode reward: [(0, '-67.757')] [2022-07-09 00:55:56,075][26022] Updated weights on worker 0-0, policy_version 17112 (0.00094) [2022-07-09 00:55:57,796][26022] Updated weights on worker 0-0, policy_version 17122 (0.00081) [2022-07-09 00:55:59,071][25689] Fps is (10 sec: 5620.7, 60 sec: 5507.6, 300 sec: 5484.7). Total num frames: 17539072. Throughput: 0: 5839.7. Samples: 17541216. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 00:55:59,072][25689] Avg episode reward: [(0, '-66.107')] [2022-07-09 00:55:59,572][26022] Updated weights on worker 0-0, policy_version 17132 (0.00089) [2022-07-09 00:56:01,301][26022] Updated weights on worker 0-0, policy_version 17142 (0.00089) [2022-07-09 00:56:03,672][26022] Updated weights on worker 0-0, policy_version 17152 (0.00091) [2022-07-09 00:56:04,080][25689] Fps is (10 sec: 5315.5, 60 sec: 5524.0, 300 sec: 5487.1). Total num frames: 17565696. Throughput: 0: 5737.4. Samples: 17572768. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 00:56:04,081][25689] Avg episode reward: [(0, '-65.991')] [2022-07-09 00:56:05,509][26022] Updated weights on worker 0-0, policy_version 17162 (0.00079) [2022-07-09 00:56:07,413][26022] Updated weights on worker 0-0, policy_version 17172 (0.00088) [2022-07-09 00:56:08,996][26022] Updated weights on worker 0-0, policy_version 17182 (0.00088) [2022-07-09 00:56:09,140][25689] Fps is (10 sec: 5491.6, 60 sec: 5545.1, 300 sec: 5486.6). Total num frames: 17594368. Throughput: 0: 4895.0. Samples: 17589270. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 00:56:09,140][25689] Avg episode reward: [(0, '-65.285')] [2022-07-09 00:56:11,166][26022] Updated weights on worker 0-0, policy_version 17192 (0.00094) [2022-07-09 00:56:12,816][26022] Updated weights on worker 0-0, policy_version 17202 (0.00087) [2022-07-09 00:56:14,063][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:56:14,073][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000017208_17620992.pth [2022-07-09 00:56:14,073][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000015277_15643648.pth [2022-07-09 00:56:14,165][25689] Fps is (10 sec: 5483.2, 60 sec: 5509.4, 300 sec: 5486.5). Total num frames: 17620992. Throughput: 0: 5721.5. Samples: 17622564. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 00:56:14,165][25689] Avg episode reward: [(0, '-65.471')] [2022-07-09 00:56:14,795][26022] Updated weights on worker 0-0, policy_version 17212 (0.00100) [2022-07-09 00:56:16,615][26022] Updated weights on worker 0-0, policy_version 17222 (0.00087) [2022-07-09 00:56:18,461][26022] Updated weights on worker 0-0, policy_version 17232 (0.00085) [2022-07-09 00:56:19,197][25689] Fps is (10 sec: 5396.4, 60 sec: 5525.5, 300 sec: 5489.4). Total num frames: 17648640. Throughput: 0: 5673.5. Samples: 17655402. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 00:56:19,198][25689] Avg episode reward: [(0, '-65.916')] [2022-07-09 00:56:20,458][26022] Updated weights on worker 0-0, policy_version 17242 (0.00081) [2022-07-09 00:56:22,135][26022] Updated weights on worker 0-0, policy_version 17252 (0.00087) [2022-07-09 00:56:24,083][26022] Updated weights on worker 0-0, policy_version 17262 (0.00086) [2022-07-09 00:56:24,229][25689] Fps is (10 sec: 5494.4, 60 sec: 5491.0, 300 sec: 5483.9). Total num frames: 17676288. Throughput: 0: 4927.8. Samples: 17672056. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 00:56:24,229][25689] Avg episode reward: [(0, '-66.116')] [2022-07-09 00:56:25,947][26022] Updated weights on worker 0-0, policy_version 17272 (0.00086) [2022-07-09 00:56:27,652][26022] Updated weights on worker 0-0, policy_version 17282 (0.00099) [2022-07-09 00:56:29,283][25689] Fps is (10 sec: 5583.8, 60 sec: 5528.6, 300 sec: 5490.5). Total num frames: 17704960. Throughput: 0: 5765.0. Samples: 17705396. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 00:56:29,285][25689] Avg episode reward: [(0, '-66.472')] [2022-07-09 00:56:29,727][26022] Updated weights on worker 0-0, policy_version 17292 (0.00098) [2022-07-09 00:56:31,337][26022] Updated weights on worker 0-0, policy_version 17302 (0.00084) [2022-07-09 00:56:33,462][26022] Updated weights on worker 0-0, policy_version 17312 (0.00095) [2022-07-09 00:56:34,291][25689] Fps is (10 sec: 5699.2, 60 sec: 5531.7, 300 sec: 5491.5). Total num frames: 17733632. Throughput: 0: 5775.9. Samples: 17738810. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 00:56:34,292][25689] Avg episode reward: [(0, '-66.058')] [2022-07-09 00:56:34,888][26022] Updated weights on worker 0-0, policy_version 17322 (0.00086) [2022-07-09 00:56:37,016][26022] Updated weights on worker 0-0, policy_version 17332 (0.00115) [2022-07-09 00:56:38,766][26022] Updated weights on worker 0-0, policy_version 17342 (0.00085) [2022-07-09 00:56:39,324][25689] Fps is (10 sec: 5405.2, 60 sec: 5499.2, 300 sec: 5484.1). Total num frames: 17759232. Throughput: 0: 4981.7. Samples: 17755668. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 00:56:39,326][25689] Avg episode reward: [(0, '-65.691')] [2022-07-09 00:56:40,591][26022] Updated weights on worker 0-0, policy_version 17352 (0.00088) [2022-07-09 00:56:42,472][26022] Updated weights on worker 0-0, policy_version 17362 (0.00091) [2022-07-09 00:56:44,319][26022] Updated weights on worker 0-0, policy_version 17372 (0.00094) [2022-07-09 00:56:44,336][25689] Fps is (10 sec: 5504.6, 60 sec: 5516.0, 300 sec: 5492.5). Total num frames: 17788928. Throughput: 0: 5813.1. Samples: 17788942. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 00:56:44,337][25689] Avg episode reward: [(0, '-65.662')] [2022-07-09 00:56:46,183][26022] Updated weights on worker 0-0, policy_version 17382 (0.00087) [2022-07-09 00:56:48,071][26022] Updated weights on worker 0-0, policy_version 17392 (0.00096) [2022-07-09 00:56:49,404][25689] Fps is (10 sec: 5688.8, 60 sec: 5531.1, 300 sec: 5492.0). Total num frames: 17816576. Throughput: 0: 5784.5. Samples: 17821786. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 00:56:49,405][25689] Avg episode reward: [(0, '-66.227')] [2022-07-09 00:56:50,010][26022] Updated weights on worker 0-0, policy_version 17402 (0.00094) [2022-07-09 00:56:51,666][26022] Updated weights on worker 0-0, policy_version 17412 (0.00096) [2022-07-09 00:56:53,426][26022] Updated weights on worker 0-0, policy_version 17422 (0.00086) [2022-07-09 00:56:54,425][25689] Fps is (10 sec: 5379.2, 60 sec: 5479.4, 300 sec: 5485.3). Total num frames: 17843200. Throughput: 0: 4961.7. Samples: 17838712. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 00:56:54,427][25689] Avg episode reward: [(0, '-67.343')] [2022-07-09 00:56:55,491][26022] Updated weights on worker 0-0, policy_version 17432 (0.00089) [2022-07-09 00:56:57,090][26022] Updated weights on worker 0-0, policy_version 17442 (0.00100) [2022-07-09 00:56:59,147][26022] Updated weights on worker 0-0, policy_version 17452 (0.00083) [2022-07-09 00:56:59,438][25689] Fps is (10 sec: 5613.3, 60 sec: 5529.9, 300 sec: 5502.4). Total num frames: 17872896. Throughput: 0: 5802.0. Samples: 17872368. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 00:56:59,439][25689] Avg episode reward: [(0, '-67.787')] [2022-07-09 00:57:00,827][26022] Updated weights on worker 0-0, policy_version 17462 (0.00086) [2022-07-09 00:57:03,223][26022] Updated weights on worker 0-0, policy_version 17472 (0.00095) [2022-07-09 00:57:04,459][25689] Fps is (10 sec: 5409.0, 60 sec: 5494.9, 300 sec: 5492.9). Total num frames: 17897472. Throughput: 0: 5687.7. Samples: 17903396. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 00:57:04,460][25689] Avg episode reward: [(0, '-67.774')] [2022-07-09 00:57:04,985][26022] Updated weights on worker 0-0, policy_version 17482 (0.00104) [2022-07-09 00:57:06,913][26022] Updated weights on worker 0-0, policy_version 17492 (0.00088) [2022-07-09 00:57:08,799][26022] Updated weights on worker 0-0, policy_version 17502 (0.00086) [2022-07-09 00:57:09,527][25689] Fps is (10 sec: 5277.4, 60 sec: 5494.1, 300 sec: 5491.8). Total num frames: 17926144. Throughput: 0: 4876.3. Samples: 17919912. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 00:57:09,528][25689] Avg episode reward: [(0, '-69.009')] [2022-07-09 00:57:10,732][26022] Updated weights on worker 0-0, policy_version 17512 (0.00218) [2022-07-09 00:57:12,564][26022] Updated weights on worker 0-0, policy_version 17522 (0.00088) [2022-07-09 00:57:14,283][26022] Updated weights on worker 0-0, policy_version 17532 (0.00092) [2022-07-09 00:57:14,555][25689] Fps is (10 sec: 5477.2, 60 sec: 5493.9, 300 sec: 5491.8). Total num frames: 17952768. Throughput: 0: 5680.2. Samples: 17953050. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 00:57:14,555][25689] Avg episode reward: [(0, '-68.728')] [2022-07-09 00:57:16,085][26022] Updated weights on worker 0-0, policy_version 17542 (0.00084) [2022-07-09 00:57:18,117][26022] Updated weights on worker 0-0, policy_version 17552 (0.00087) [2022-07-09 00:57:19,567][25689] Fps is (10 sec: 5508.0, 60 sec: 5512.7, 300 sec: 5499.0). Total num frames: 17981440. Throughput: 0: 5652.8. Samples: 17986154. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 00:57:19,567][25689] Avg episode reward: [(0, '-68.734')] [2022-07-09 00:57:19,925][26022] Updated weights on worker 0-0, policy_version 17562 (0.00089) [2022-07-09 00:57:21,660][26022] Updated weights on worker 0-0, policy_version 17572 (0.00095) [2022-07-09 00:57:23,553][26022] Updated weights on worker 0-0, policy_version 17582 (0.00088) [2022-07-09 00:57:24,575][25689] Fps is (10 sec: 5722.8, 60 sec: 5531.8, 300 sec: 5489.5). Total num frames: 18010112. Throughput: 0: 4946.0. Samples: 18002890. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 00:57:24,575][25689] Avg episode reward: [(0, '-68.047')] [2022-07-09 00:57:25,583][26022] Updated weights on worker 0-0, policy_version 17592 (0.00084) [2022-07-09 00:57:27,339][26022] Updated weights on worker 0-0, policy_version 17602 (0.00090) [2022-07-09 00:57:29,102][26022] Updated weights on worker 0-0, policy_version 17612 (0.00085) [2022-07-09 00:57:29,646][25689] Fps is (10 sec: 5587.8, 60 sec: 5513.4, 300 sec: 5498.6). Total num frames: 18037760. Throughput: 0: 5779.9. Samples: 18036194. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 00:57:29,647][25689] Avg episode reward: [(0, '-68.971')] [2022-07-09 00:57:31,005][26022] Updated weights on worker 0-0, policy_version 17622 (0.00085) [2022-07-09 00:57:32,920][26022] Updated weights on worker 0-0, policy_version 17632 (0.00090) [2022-07-09 00:57:34,731][25689] Fps is (10 sec: 5343.8, 60 sec: 5472.4, 300 sec: 5494.3). Total num frames: 18064384. Throughput: 0: 5767.5. Samples: 18069416. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 00:57:34,731][25689] Avg episode reward: [(0, '-67.948')] [2022-07-09 00:57:34,737][26022] Updated weights on worker 0-0, policy_version 17642 (0.00049) [2022-07-09 00:57:36,553][26022] Updated weights on worker 0-0, policy_version 17652 (0.00088) [2022-07-09 00:57:38,479][26022] Updated weights on worker 0-0, policy_version 17662 (0.00096) [2022-07-09 00:57:39,778][25689] Fps is (10 sec: 5457.2, 60 sec: 5521.9, 300 sec: 5494.1). Total num frames: 18093056. Throughput: 0: 5774.9. Samples: 18102874. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 00:57:39,780][25689] Avg episode reward: [(0, '-68.447')] [2022-07-09 00:57:40,099][26022] Updated weights on worker 0-0, policy_version 17672 (0.00096) [2022-07-09 00:57:42,132][26022] Updated weights on worker 0-0, policy_version 17682 (0.00097) [2022-07-09 00:57:43,886][26022] Updated weights on worker 0-0, policy_version 17692 (0.00084) [2022-07-09 00:57:44,835][25689] Fps is (10 sec: 5675.3, 60 sec: 5500.9, 300 sec: 5501.9). Total num frames: 18121728. Throughput: 0: 5768.3. Samples: 18119756. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 00:57:44,836][25689] Avg episode reward: [(0, '-68.171')] [2022-07-09 00:57:45,722][26022] Updated weights on worker 0-0, policy_version 17702 (0.00088) [2022-07-09 00:57:47,633][26022] Updated weights on worker 0-0, policy_version 17712 (0.00096) [2022-07-09 00:57:49,462][26022] Updated weights on worker 0-0, policy_version 17722 (0.00092) [2022-07-09 00:57:49,903][25689] Fps is (10 sec: 5562.8, 60 sec: 5501.0, 300 sec: 5494.3). Total num frames: 18149376. Throughput: 0: 5766.0. Samples: 18152996. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 00:57:49,904][25689] Avg episode reward: [(0, '-68.509')] [2022-07-09 00:57:51,209][26022] Updated weights on worker 0-0, policy_version 17732 (0.00091) [2022-07-09 00:57:53,172][26022] Updated weights on worker 0-0, policy_version 17742 (0.00086) [2022-07-09 00:57:54,821][26022] Updated weights on worker 0-0, policy_version 17752 (0.00085) [2022-07-09 00:57:54,946][25689] Fps is (10 sec: 5671.7, 60 sec: 5549.7, 300 sec: 5504.3). Total num frames: 18179072. Throughput: 0: 5787.1. Samples: 18186400. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 00:57:54,947][25689] Avg episode reward: [(0, '-68.479')] [2022-07-09 00:57:56,823][26022] Updated weights on worker 0-0, policy_version 17762 (0.00086) [2022-07-09 00:57:58,612][26022] Updated weights on worker 0-0, policy_version 17772 (0.00090) [2022-07-09 00:57:59,970][25689] Fps is (10 sec: 5492.6, 60 sec: 5480.9, 300 sec: 5501.4). Total num frames: 18204672. Throughput: 0: 4953.9. Samples: 18202902. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 00:57:59,972][25689] Avg episode reward: [(0, '-68.521')] [2022-07-09 00:58:00,579][26022] Updated weights on worker 0-0, policy_version 17782 (0.00083) [2022-07-09 00:58:02,883][26022] Updated weights on worker 0-0, policy_version 17792 (0.00081) [2022-07-09 00:58:04,475][26022] Updated weights on worker 0-0, policy_version 17802 (0.00092) [2022-07-09 00:58:04,973][25689] Fps is (10 sec: 5208.0, 60 sec: 5516.4, 300 sec: 5502.7). Total num frames: 18231296. Throughput: 0: 5672.5. Samples: 18233990. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 00:58:04,976][25689] Avg episode reward: [(0, '-68.358')] [2022-07-09 00:58:06,584][26022] Updated weights on worker 0-0, policy_version 17812 (0.00093) [2022-07-09 00:58:08,192][26022] Updated weights on worker 0-0, policy_version 17822 (0.00092) [2022-07-09 00:58:10,034][25689] Fps is (10 sec: 5392.6, 60 sec: 5500.2, 300 sec: 5502.6). Total num frames: 18258944. Throughput: 0: 5680.6. Samples: 18267356. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 00:58:10,035][25689] Avg episode reward: [(0, '-68.890')] [2022-07-09 00:58:10,120][26022] Updated weights on worker 0-0, policy_version 17832 (0.00095) [2022-07-09 00:58:11,997][26022] Updated weights on worker 0-0, policy_version 17842 (0.00092) [2022-07-09 00:58:13,876][26022] Updated weights on worker 0-0, policy_version 17852 (0.00093) [2022-07-09 00:58:14,120][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 00:58:14,134][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000017854_18282496.pth [2022-07-09 00:58:14,134][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000015919_16301056.pth [2022-07-09 00:58:15,055][25689] Fps is (10 sec: 5484.6, 60 sec: 5517.7, 300 sec: 5502.8). Total num frames: 18286592. Throughput: 0: 4852.7. Samples: 18283986. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 00:58:15,056][25689] Avg episode reward: [(0, '-69.431')] [2022-07-09 00:58:15,762][26022] Updated weights on worker 0-0, policy_version 17862 (0.00092) [2022-07-09 00:58:17,630][26022] Updated weights on worker 0-0, policy_version 17872 (0.00086) [2022-07-09 00:58:19,356][26022] Updated weights on worker 0-0, policy_version 17882 (0.00090) [2022-07-09 00:58:20,070][25689] Fps is (10 sec: 5510.2, 60 sec: 5500.6, 300 sec: 5496.2). Total num frames: 18314240. Throughput: 0: 5682.2. Samples: 18317110. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 00:58:20,070][25689] Avg episode reward: [(0, '-68.741')] [2022-07-09 00:58:21,507][26022] Updated weights on worker 0-0, policy_version 17892 (0.00091) [2022-07-09 00:58:23,333][26022] Updated weights on worker 0-0, policy_version 17902 (0.00081) [2022-07-09 00:58:24,975][26022] Updated weights on worker 0-0, policy_version 17912 (0.00081) [2022-07-09 00:58:25,107][25689] Fps is (10 sec: 5501.4, 60 sec: 5481.0, 300 sec: 5500.2). Total num frames: 18341888. Throughput: 0: 5768.1. Samples: 18350120. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 00:58:25,107][25689] Avg episode reward: [(0, '-69.170')] [2022-07-09 00:58:26,938][26022] Updated weights on worker 0-0, policy_version 17922 (0.00083) [2022-07-09 00:58:28,686][26022] Updated weights on worker 0-0, policy_version 17932 (0.00085) [2022-07-09 00:58:30,161][25689] Fps is (10 sec: 5479.5, 60 sec: 5482.5, 300 sec: 5492.6). Total num frames: 18369536. Throughput: 0: 4929.3. Samples: 18366566. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 00:58:30,162][25689] Avg episode reward: [(0, '-68.842')] [2022-07-09 00:58:30,548][26022] Updated weights on worker 0-0, policy_version 17942 (0.00089) [2022-07-09 00:58:32,563][26022] Updated weights on worker 0-0, policy_version 17952 (0.00085) [2022-07-09 00:58:34,235][26022] Updated weights on worker 0-0, policy_version 17962 (0.00083) [2022-07-09 00:58:35,164][25689] Fps is (10 sec: 5498.3, 60 sec: 5507.0, 300 sec: 5499.8). Total num frames: 18397184. Throughput: 0: 5752.1. Samples: 18399650. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 00:58:35,164][25689] Avg episode reward: [(0, '-68.014')] [2022-07-09 00:58:36,332][26022] Updated weights on worker 0-0, policy_version 17972 (0.00088) [2022-07-09 00:58:37,939][26022] Updated weights on worker 0-0, policy_version 17982 (0.00086) [2022-07-09 00:58:39,972][26022] Updated weights on worker 0-0, policy_version 17992 (0.00087) [2022-07-09 00:58:40,167][25689] Fps is (10 sec: 5424.0, 60 sec: 5477.1, 300 sec: 5492.9). Total num frames: 18423808. Throughput: 0: 5757.5. Samples: 18432822. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 00:58:40,168][25689] Avg episode reward: [(0, '-67.845')] [2022-07-09 00:58:41,759][26022] Updated weights on worker 0-0, policy_version 18002 (0.00083) [2022-07-09 00:58:43,662][26022] Updated weights on worker 0-0, policy_version 18012 (0.00092) [2022-07-09 00:58:45,170][25689] Fps is (10 sec: 5526.2, 60 sec: 5482.0, 300 sec: 5497.3). Total num frames: 18452480. Throughput: 0: 4939.7. Samples: 18449226. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 00:58:45,170][25689] Avg episode reward: [(0, '-67.699')] [2022-07-09 00:58:45,551][26022] Updated weights on worker 0-0, policy_version 18022 (0.00090) [2022-07-09 00:58:47,422][26022] Updated weights on worker 0-0, policy_version 18032 (0.00090) [2022-07-09 00:58:49,213][26022] Updated weights on worker 0-0, policy_version 18042 (0.00090) [2022-07-09 00:58:50,322][25689] Fps is (10 sec: 5546.2, 60 sec: 5474.3, 300 sec: 5498.6). Total num frames: 18480128. Throughput: 0: 5743.2. Samples: 18482352. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 00:58:50,322][25689] Avg episode reward: [(0, '-67.431')] [2022-07-09 00:58:51,187][26022] Updated weights on worker 0-0, policy_version 18052 (0.00084) [2022-07-09 00:58:53,085][26022] Updated weights on worker 0-0, policy_version 18062 (0.00085) [2022-07-09 00:58:54,971][26022] Updated weights on worker 0-0, policy_version 18072 (0.00101) [2022-07-09 00:58:55,411][25689] Fps is (10 sec: 5399.3, 60 sec: 5436.2, 300 sec: 5497.2). Total num frames: 18507776. Throughput: 0: 5697.2. Samples: 18515004. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 00:58:55,412][25689] Avg episode reward: [(0, '-67.192')] [2022-07-09 00:58:56,646][26022] Updated weights on worker 0-0, policy_version 18082 (0.00094) [2022-07-09 00:58:58,591][26022] Updated weights on worker 0-0, policy_version 18092 (0.00095) [2022-07-09 00:59:00,450][25689] Fps is (10 sec: 5459.5, 60 sec: 5468.8, 300 sec: 5500.0). Total num frames: 18535424. Throughput: 0: 4857.2. Samples: 18531342. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 00:59:00,452][25689] Avg episode reward: [(0, '-67.085')] [2022-07-09 00:59:00,701][26022] Updated weights on worker 0-0, policy_version 18102 (0.00090) [2022-07-09 00:59:02,763][26022] Updated weights on worker 0-0, policy_version 18112 (0.00091) [2022-07-09 00:59:04,792][26022] Updated weights on worker 0-0, policy_version 18122 (0.00086) [2022-07-09 00:59:05,474][25689] Fps is (10 sec: 5291.6, 60 sec: 5450.0, 300 sec: 5490.6). Total num frames: 18561024. Throughput: 0: 5553.3. Samples: 18561980. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 00:59:05,474][25689] Avg episode reward: [(0, '-66.265')] [2022-07-09 00:59:06,515][26022] Updated weights on worker 0-0, policy_version 18132 (0.00087) [2022-07-09 00:59:08,395][26022] Updated weights on worker 0-0, policy_version 18142 (0.00096) [2022-07-09 00:59:10,225][26022] Updated weights on worker 0-0, policy_version 18152 (0.00878) [2022-07-09 00:59:10,574][25689] Fps is (10 sec: 5361.0, 60 sec: 5463.4, 300 sec: 5496.4). Total num frames: 18589696. Throughput: 0: 5575.7. Samples: 18595272. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 00:59:10,574][25689] Avg episode reward: [(0, '-66.228')] [2022-07-09 00:59:12,003][26022] Updated weights on worker 0-0, policy_version 18162 (0.00086) [2022-07-09 00:59:13,927][26022] Updated weights on worker 0-0, policy_version 18172 (0.00088) [2022-07-09 00:59:15,647][25689] Fps is (10 sec: 5435.4, 60 sec: 5441.8, 300 sec: 5488.2). Total num frames: 18616320. Throughput: 0: 4785.3. Samples: 18611840. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 00:59:15,648][25689] Avg episode reward: [(0, '-65.367')] [2022-07-09 00:59:15,945][26022] Updated weights on worker 0-0, policy_version 18182 (0.00091) [2022-07-09 00:59:17,395][26022] Updated weights on worker 0-0, policy_version 18192 (0.00078) [2022-07-09 00:59:19,564][26022] Updated weights on worker 0-0, policy_version 18202 (0.00091) [2022-07-09 00:59:20,711][25689] Fps is (10 sec: 5454.6, 60 sec: 5454.2, 300 sec: 5490.9). Total num frames: 18644992. Throughput: 0: 5610.8. Samples: 18645022. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 00:59:20,712][25689] Avg episode reward: [(0, '-65.246')] [2022-07-09 00:59:21,549][26022] Updated weights on worker 0-0, policy_version 18212 (0.00083) [2022-07-09 00:59:23,341][26022] Updated weights on worker 0-0, policy_version 18222 (0.00088) [2022-07-09 00:59:25,283][26022] Updated weights on worker 0-0, policy_version 18232 (0.00101) [2022-07-09 00:59:25,723][25689] Fps is (10 sec: 5488.0, 60 sec: 5439.6, 300 sec: 5489.5). Total num frames: 18671616. Throughput: 0: 5721.8. Samples: 18677840. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 00:59:25,723][25689] Avg episode reward: [(0, '-65.563')] [2022-07-09 00:59:27,003][26022] Updated weights on worker 0-0, policy_version 18242 (0.00088) [2022-07-09 00:59:29,032][26022] Updated weights on worker 0-0, policy_version 18252 (0.00087) [2022-07-09 00:59:30,838][25689] Fps is (10 sec: 5359.0, 60 sec: 5434.1, 300 sec: 5487.8). Total num frames: 18699264. Throughput: 0: 4878.8. Samples: 18694140. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 00:59:30,839][25689] Avg episode reward: [(0, '-65.616')] [2022-07-09 00:59:31,018][26022] Updated weights on worker 0-0, policy_version 18262 (0.00092) [2022-07-09 00:59:32,607][26022] Updated weights on worker 0-0, policy_version 18272 (0.00084) [2022-07-09 00:59:34,451][26022] Updated weights on worker 0-0, policy_version 18282 (0.00080) [2022-07-09 00:59:35,871][25689] Fps is (10 sec: 5549.5, 60 sec: 5448.3, 300 sec: 5487.5). Total num frames: 18727936. Throughput: 0: 5712.6. Samples: 18727374. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 00:59:35,872][25689] Avg episode reward: [(0, '-64.920')] [2022-07-09 00:59:36,484][26022] Updated weights on worker 0-0, policy_version 18292 (0.00086) [2022-07-09 00:59:38,157][26022] Updated weights on worker 0-0, policy_version 18302 (0.00089) [2022-07-09 00:59:40,243][26022] Updated weights on worker 0-0, policy_version 18312 (0.00089) [2022-07-09 00:59:40,891][25689] Fps is (10 sec: 5704.5, 60 sec: 5480.6, 300 sec: 5494.1). Total num frames: 18756608. Throughput: 0: 5744.4. Samples: 18760942. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 00:59:40,891][25689] Avg episode reward: [(0, '-64.940')] [2022-07-09 00:59:41,644][26022] Updated weights on worker 0-0, policy_version 18322 (0.00093) [2022-07-09 00:59:43,815][26022] Updated weights on worker 0-0, policy_version 18332 (0.00084) [2022-07-09 00:59:45,580][26022] Updated weights on worker 0-0, policy_version 18342 (0.00095) [2022-07-09 00:59:45,899][25689] Fps is (10 sec: 5514.6, 60 sec: 5446.4, 300 sec: 5489.5). Total num frames: 18783232. Throughput: 0: 4945.6. Samples: 18777622. Policy #0 lag: (min: 0.0, avg: 9.7, max: 24.0) [2022-07-09 00:59:45,899][25689] Avg episode reward: [(0, '-65.511')] [2022-07-09 00:59:47,368][26022] Updated weights on worker 0-0, policy_version 18352 (0.00089) [2022-07-09 00:59:49,308][26022] Updated weights on worker 0-0, policy_version 18362 (0.00110) [2022-07-09 00:59:50,950][25689] Fps is (10 sec: 5395.1, 60 sec: 5455.4, 300 sec: 5486.0). Total num frames: 18810880. Throughput: 0: 5810.7. Samples: 18811004. Policy #0 lag: (min: 0.0, avg: 9.7, max: 24.0) [2022-07-09 00:59:50,951][25689] Avg episode reward: [(0, '-65.350')] [2022-07-09 00:59:51,147][26022] Updated weights on worker 0-0, policy_version 18372 (0.00426) [2022-07-09 00:59:52,915][26022] Updated weights on worker 0-0, policy_version 18382 (0.00087) [2022-07-09 00:59:54,724][26022] Updated weights on worker 0-0, policy_version 18392 (0.00086) [2022-07-09 00:59:55,956][25689] Fps is (10 sec: 5599.7, 60 sec: 5479.8, 300 sec: 5493.5). Total num frames: 18839552. Throughput: 0: 5824.5. Samples: 18844358. Policy #0 lag: (min: 0.0, avg: 9.7, max: 24.0) [2022-07-09 00:59:55,957][25689] Avg episode reward: [(0, '-65.058')] [2022-07-09 00:59:56,509][26022] Updated weights on worker 0-0, policy_version 18402 (0.00090) [2022-07-09 00:59:58,413][26022] Updated weights on worker 0-0, policy_version 18412 (0.00085) [2022-07-09 01:00:00,265][26022] Updated weights on worker 0-0, policy_version 18422 (0.00094) [2022-07-09 01:00:00,974][25689] Fps is (10 sec: 5720.9, 60 sec: 5498.7, 300 sec: 5503.6). Total num frames: 18868224. Throughput: 0: 4994.7. Samples: 18861252. Policy #0 lag: (min: 0.0, avg: 9.7, max: 24.0) [2022-07-09 01:00:00,974][25689] Avg episode reward: [(0, '-65.884')] [2022-07-09 01:00:02,576][26022] Updated weights on worker 0-0, policy_version 18432 (0.00085) [2022-07-09 01:00:04,268][26022] Updated weights on worker 0-0, policy_version 18442 (0.00629) [2022-07-09 01:00:05,982][25689] Fps is (10 sec: 5311.1, 60 sec: 5483.1, 300 sec: 5495.0). Total num frames: 18892800. Throughput: 0: 5720.5. Samples: 18892510. Policy #0 lag: (min: 0.0, avg: 9.7, max: 24.0) [2022-07-09 01:00:05,984][25689] Avg episode reward: [(0, '-66.639')] [2022-07-09 01:00:06,332][26022] Updated weights on worker 0-0, policy_version 18452 (0.00086) [2022-07-09 01:00:07,951][26022] Updated weights on worker 0-0, policy_version 18462 (0.00088) [2022-07-09 01:00:09,980][26022] Updated weights on worker 0-0, policy_version 18472 (0.00095) [2022-07-09 01:00:11,015][25689] Fps is (10 sec: 5302.9, 60 sec: 5489.2, 300 sec: 5494.5). Total num frames: 18921472. Throughput: 0: 5723.2. Samples: 18925840. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 01:00:11,016][25689] Avg episode reward: [(0, '-66.313')] [2022-07-09 01:00:11,601][26022] Updated weights on worker 0-0, policy_version 18482 (0.00083) [2022-07-09 01:00:13,742][26022] Updated weights on worker 0-0, policy_version 18492 (0.00094) [2022-07-09 01:00:14,297][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:00:14,305][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000018496_18939904.pth [2022-07-09 01:00:14,306][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000016562_16959488.pth [2022-07-09 01:00:15,311][26022] Updated weights on worker 0-0, policy_version 18502 (0.00087) [2022-07-09 01:00:16,032][25689] Fps is (10 sec: 5604.0, 60 sec: 5511.3, 300 sec: 5498.1). Total num frames: 18949120. Throughput: 0: 4892.0. Samples: 18942568. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 01:00:16,033][25689] Avg episode reward: [(0, '-65.786')] [2022-07-09 01:00:17,394][26022] Updated weights on worker 0-0, policy_version 18512 (0.00086) [2022-07-09 01:00:19,140][26022] Updated weights on worker 0-0, policy_version 18522 (0.00084) [2022-07-09 01:00:21,067][25689] Fps is (10 sec: 5399.4, 60 sec: 5480.1, 300 sec: 5487.6). Total num frames: 18975744. Throughput: 0: 5703.5. Samples: 18975852. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 01:00:21,067][25689] Avg episode reward: [(0, '-65.615')] [2022-07-09 01:00:21,099][26022] Updated weights on worker 0-0, policy_version 18532 (0.00099) [2022-07-09 01:00:22,713][26022] Updated weights on worker 0-0, policy_version 18542 (0.00092) [2022-07-09 01:00:24,714][26022] Updated weights on worker 0-0, policy_version 18552 (0.00088) [2022-07-09 01:00:26,071][25689] Fps is (10 sec: 5406.3, 60 sec: 5497.7, 300 sec: 5492.7). Total num frames: 19003392. Throughput: 0: 5772.5. Samples: 19008472. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 01:00:26,072][25689] Avg episode reward: [(0, '-66.032')] [2022-07-09 01:00:26,509][26022] Updated weights on worker 0-0, policy_version 18562 (0.00090) [2022-07-09 01:00:28,514][26022] Updated weights on worker 0-0, policy_version 18572 (0.00081) [2022-07-09 01:00:30,210][26022] Updated weights on worker 0-0, policy_version 18582 (0.00091) [2022-07-09 01:00:31,140][25689] Fps is (10 sec: 5591.0, 60 sec: 5518.9, 300 sec: 5492.2). Total num frames: 19032064. Throughput: 0: 4927.0. Samples: 19024994. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 01:00:31,142][25689] Avg episode reward: [(0, '-66.180')] [2022-07-09 01:00:32,326][26022] Updated weights on worker 0-0, policy_version 18592 (0.00086) [2022-07-09 01:00:33,803][26022] Updated weights on worker 0-0, policy_version 18602 (0.00086) [2022-07-09 01:00:35,996][26022] Updated weights on worker 0-0, policy_version 18612 (0.00084) [2022-07-09 01:00:36,154][25689] Fps is (10 sec: 5585.8, 60 sec: 5503.7, 300 sec: 5492.8). Total num frames: 19059712. Throughput: 0: 5749.2. Samples: 19058250. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 01:00:36,155][25689] Avg episode reward: [(0, '-65.545')] [2022-07-09 01:00:37,579][26022] Updated weights on worker 0-0, policy_version 18622 (0.00080) [2022-07-09 01:00:39,652][26022] Updated weights on worker 0-0, policy_version 18632 (0.01141) [2022-07-09 01:00:41,165][25689] Fps is (10 sec: 5618.0, 60 sec: 5504.4, 300 sec: 5492.8). Total num frames: 19088384. Throughput: 0: 5770.9. Samples: 19091836. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 01:00:41,166][25689] Avg episode reward: [(0, '-65.650')] [2022-07-09 01:00:41,188][26022] Updated weights on worker 0-0, policy_version 18642 (0.00088) [2022-07-09 01:00:43,388][26022] Updated weights on worker 0-0, policy_version 18652 (0.00095) [2022-07-09 01:00:45,058][26022] Updated weights on worker 0-0, policy_version 18662 (0.00087) [2022-07-09 01:00:46,181][25689] Fps is (10 sec: 5514.4, 60 sec: 5503.7, 300 sec: 5493.4). Total num frames: 19115008. Throughput: 0: 4975.6. Samples: 19108534. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 01:00:46,183][25689] Avg episode reward: [(0, '-65.795')] [2022-07-09 01:00:46,975][26022] Updated weights on worker 0-0, policy_version 18672 (0.00083) [2022-07-09 01:00:48,639][26022] Updated weights on worker 0-0, policy_version 18682 (0.00096) [2022-07-09 01:00:50,839][26022] Updated weights on worker 0-0, policy_version 18692 (0.00095) [2022-07-09 01:00:51,277][25689] Fps is (10 sec: 5265.8, 60 sec: 5482.7, 300 sec: 5481.5). Total num frames: 19141632. Throughput: 0: 5806.3. Samples: 19141916. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 01:00:51,278][25689] Avg episode reward: [(0, '-65.079')] [2022-07-09 01:00:52,223][26022] Updated weights on worker 0-0, policy_version 18702 (0.00084) [2022-07-09 01:00:54,603][26022] Updated weights on worker 0-0, policy_version 18712 (0.00093) [2022-07-09 01:00:55,800][26022] Updated weights on worker 0-0, policy_version 18722 (0.00088) [2022-07-09 01:00:56,304][25689] Fps is (10 sec: 5665.0, 60 sec: 5514.7, 300 sec: 5494.9). Total num frames: 19172352. Throughput: 0: 5795.0. Samples: 19175020. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 01:00:56,305][25689] Avg episode reward: [(0, '-64.855')] [2022-07-09 01:00:58,232][26022] Updated weights on worker 0-0, policy_version 18732 (0.00085) [2022-07-09 01:00:59,690][26022] Updated weights on worker 0-0, policy_version 18742 (0.00092) [2022-07-09 01:01:01,325][25689] Fps is (10 sec: 5707.0, 60 sec: 5480.5, 300 sec: 5494.7). Total num frames: 19198976. Throughput: 0: 5786.6. Samples: 19208494. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 01:01:01,326][25689] Avg episode reward: [(0, '-65.505')] [2022-07-09 01:01:02,223][26022] Updated weights on worker 0-0, policy_version 18752 (0.00626) [2022-07-09 01:01:03,676][26022] Updated weights on worker 0-0, policy_version 18762 (0.00104) [2022-07-09 01:01:05,852][26022] Updated weights on worker 0-0, policy_version 18772 (0.00095) [2022-07-09 01:01:06,344][25689] Fps is (10 sec: 5201.6, 60 sec: 5496.5, 300 sec: 5485.2). Total num frames: 19224576. Throughput: 0: 5672.1. Samples: 19222898. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-09 01:01:06,346][25689] Avg episode reward: [(0, '-65.048')] [2022-07-09 01:01:07,652][26022] Updated weights on worker 0-0, policy_version 18782 (0.00091) [2022-07-09 01:01:09,447][26022] Updated weights on worker 0-0, policy_version 18792 (0.00080) [2022-07-09 01:01:11,096][26022] Updated weights on worker 0-0, policy_version 18802 (0.00077) [2022-07-09 01:01:11,467][25689] Fps is (10 sec: 5452.3, 60 sec: 5505.2, 300 sec: 5493.7). Total num frames: 19254272. Throughput: 0: 5680.1. Samples: 19256596. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-09 01:01:11,467][25689] Avg episode reward: [(0, '-64.927')] [2022-07-09 01:01:13,539][26022] Updated weights on worker 0-0, policy_version 18812 (0.00091) [2022-07-09 01:01:14,880][26022] Updated weights on worker 0-0, policy_version 18822 (0.00092) [2022-07-09 01:01:16,507][25689] Fps is (10 sec: 5440.9, 60 sec: 5469.2, 300 sec: 5486.6). Total num frames: 19279872. Throughput: 0: 5688.7. Samples: 19289950. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-09 01:01:16,507][25689] Avg episode reward: [(0, '-64.919')] [2022-07-09 01:01:17,107][26022] Updated weights on worker 0-0, policy_version 18832 (0.00089) [2022-07-09 01:01:18,473][26022] Updated weights on worker 0-0, policy_version 18842 (0.00085) [2022-07-09 01:01:20,741][26022] Updated weights on worker 0-0, policy_version 18852 (0.00092) [2022-07-09 01:01:21,598][25689] Fps is (10 sec: 5559.0, 60 sec: 5531.8, 300 sec: 5495.9). Total num frames: 19310592. Throughput: 0: 4836.1. Samples: 19306536. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-09 01:01:21,599][25689] Avg episode reward: [(0, '-64.132')] [2022-07-09 01:01:22,290][26022] Updated weights on worker 0-0, policy_version 18862 (0.00085) [2022-07-09 01:01:24,264][26022] Updated weights on worker 0-0, policy_version 18872 (0.00092) [2022-07-09 01:01:25,925][26022] Updated weights on worker 0-0, policy_version 18882 (0.00085) [2022-07-09 01:01:26,620][25689] Fps is (10 sec: 5670.4, 60 sec: 5513.3, 300 sec: 5489.6). Total num frames: 19337216. Throughput: 0: 5771.5. Samples: 19339922. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-09 01:01:26,621][25689] Avg episode reward: [(0, '-64.032')] [2022-07-09 01:01:28,047][26022] Updated weights on worker 0-0, policy_version 18892 (0.00088) [2022-07-09 01:01:29,629][26022] Updated weights on worker 0-0, policy_version 18902 (0.00093) [2022-07-09 01:01:31,689][25689] Fps is (10 sec: 5378.4, 60 sec: 5496.4, 300 sec: 5485.0). Total num frames: 19364864. Throughput: 0: 5730.4. Samples: 19372478. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 01:01:31,690][25689] Avg episode reward: [(0, '-63.567')] [2022-07-09 01:01:31,865][26022] Updated weights on worker 0-0, policy_version 18912 (0.00084) [2022-07-09 01:01:33,680][26022] Updated weights on worker 0-0, policy_version 18922 (0.00088) [2022-07-09 01:01:35,455][26022] Updated weights on worker 0-0, policy_version 18932 (0.00086) [2022-07-09 01:01:36,691][25689] Fps is (10 sec: 5490.7, 60 sec: 5497.5, 300 sec: 5492.5). Total num frames: 19392512. Throughput: 0: 4919.2. Samples: 19389238. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 01:01:36,691][25689] Avg episode reward: [(0, '-64.224')] [2022-07-09 01:01:37,395][26022] Updated weights on worker 0-0, policy_version 18942 (0.00083) [2022-07-09 01:01:39,189][26022] Updated weights on worker 0-0, policy_version 18952 (0.00105) [2022-07-09 01:01:40,984][26022] Updated weights on worker 0-0, policy_version 18962 (0.00092) [2022-07-09 01:01:41,692][25689] Fps is (10 sec: 5630.0, 60 sec: 5498.3, 300 sec: 5489.2). Total num frames: 19421184. Throughput: 0: 5768.3. Samples: 19422446. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 01:01:41,693][25689] Avg episode reward: [(0, '-64.459')] [2022-07-09 01:01:43,047][26022] Updated weights on worker 0-0, policy_version 18972 (0.00093) [2022-07-09 01:01:44,478][26022] Updated weights on worker 0-0, policy_version 18982 (0.00086) [2022-07-09 01:01:46,664][26022] Updated weights on worker 0-0, policy_version 18992 (0.00096) [2022-07-09 01:01:46,724][25689] Fps is (10 sec: 5511.5, 60 sec: 5497.0, 300 sec: 5486.5). Total num frames: 19447808. Throughput: 0: 5755.8. Samples: 19455636. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 01:01:46,724][25689] Avg episode reward: [(0, '-65.554')] [2022-07-09 01:01:48,323][26022] Updated weights on worker 0-0, policy_version 19002 (0.00083) [2022-07-09 01:01:50,313][26022] Updated weights on worker 0-0, policy_version 19012 (0.00114) [2022-07-09 01:01:51,774][25689] Fps is (10 sec: 5484.7, 60 sec: 5534.9, 300 sec: 5492.8). Total num frames: 19476480. Throughput: 0: 4954.8. Samples: 19471996. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 01:01:51,775][25689] Avg episode reward: [(0, '-66.357')] [2022-07-09 01:01:52,374][26022] Updated weights on worker 0-0, policy_version 19022 (0.00092) [2022-07-09 01:01:53,856][26022] Updated weights on worker 0-0, policy_version 19032 (0.00099) [2022-07-09 01:01:56,096][26022] Updated weights on worker 0-0, policy_version 19042 (0.00095) [2022-07-09 01:01:56,778][25689] Fps is (10 sec: 5499.8, 60 sec: 5469.3, 300 sec: 5482.6). Total num frames: 19503104. Throughput: 0: 5747.4. Samples: 19504688. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 01:01:56,778][25689] Avg episode reward: [(0, '-66.504')] [2022-07-09 01:01:57,614][26022] Updated weights on worker 0-0, policy_version 19052 (0.00082) [2022-07-09 01:01:59,710][26022] Updated weights on worker 0-0, policy_version 19062 (0.00090) [2022-07-09 01:02:01,782][25689] Fps is (10 sec: 5218.2, 60 sec: 5453.9, 300 sec: 5486.4). Total num frames: 19528704. Throughput: 0: 5719.0. Samples: 19537340. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 01:02:01,783][25689] Avg episode reward: [(0, '-66.752')] [2022-07-09 01:02:02,002][26022] Updated weights on worker 0-0, policy_version 19072 (0.00089) [2022-07-09 01:02:03,676][26022] Updated weights on worker 0-0, policy_version 19082 (0.00087) [2022-07-09 01:02:05,642][26022] Updated weights on worker 0-0, policy_version 19092 (0.00099) [2022-07-09 01:02:06,783][25689] Fps is (10 sec: 5219.5, 60 sec: 5472.4, 300 sec: 5480.7). Total num frames: 19555328. Throughput: 0: 4847.6. Samples: 19552878. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 01:02:06,784][25689] Avg episode reward: [(0, '-66.122')] [2022-07-09 01:02:07,350][26022] Updated weights on worker 0-0, policy_version 19102 (0.00102) [2022-07-09 01:02:09,381][26022] Updated weights on worker 0-0, policy_version 19112 (0.00085) [2022-07-09 01:02:11,110][26022] Updated weights on worker 0-0, policy_version 19122 (0.00085) [2022-07-09 01:02:11,817][25689] Fps is (10 sec: 5612.5, 60 sec: 5480.5, 300 sec: 5491.0). Total num frames: 19585024. Throughput: 0: 5701.3. Samples: 19586266. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 01:02:11,819][25689] Avg episode reward: [(0, '-65.608')] [2022-07-09 01:02:13,000][26022] Updated weights on worker 0-0, policy_version 19132 (0.00090) [2022-07-09 01:02:14,417][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:02:14,437][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000019140_19599360.pth [2022-07-09 01:02:14,437][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000017208_17620992.pth [2022-07-09 01:02:14,640][26022] Updated weights on worker 0-0, policy_version 19142 (0.00089) [2022-07-09 01:02:16,699][26022] Updated weights on worker 0-0, policy_version 19152 (0.00087) [2022-07-09 01:02:16,834][25689] Fps is (10 sec: 5604.0, 60 sec: 5499.7, 300 sec: 5484.0). Total num frames: 19611648. Throughput: 0: 5723.4. Samples: 19619474. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 01:02:16,834][25689] Avg episode reward: [(0, '-65.313')] [2022-07-09 01:02:18,310][26022] Updated weights on worker 0-0, policy_version 19162 (0.00094) [2022-07-09 01:02:20,476][26022] Updated weights on worker 0-0, policy_version 19172 (0.00082) [2022-07-09 01:02:21,854][25689] Fps is (10 sec: 5509.5, 60 sec: 5472.2, 300 sec: 5483.8). Total num frames: 19640320. Throughput: 0: 4926.1. Samples: 19636212. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 01:02:21,855][25689] Avg episode reward: [(0, '-64.800')] [2022-07-09 01:02:22,189][26022] Updated weights on worker 0-0, policy_version 19182 (0.00087) [2022-07-09 01:02:24,091][26022] Updated weights on worker 0-0, policy_version 19192 (0.00095) [2022-07-09 01:02:25,991][26022] Updated weights on worker 0-0, policy_version 19202 (0.00098) [2022-07-09 01:02:26,878][25689] Fps is (10 sec: 5505.3, 60 sec: 5472.0, 300 sec: 5481.2). Total num frames: 19666944. Throughput: 0: 5795.7. Samples: 19669338. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 01:02:26,878][25689] Avg episode reward: [(0, '-65.777')] [2022-07-09 01:02:27,928][26022] Updated weights on worker 0-0, policy_version 19212 (0.00091) [2022-07-09 01:02:29,537][26022] Updated weights on worker 0-0, policy_version 19222 (0.00102) [2022-07-09 01:02:31,872][26022] Updated weights on worker 0-0, policy_version 19232 (0.00086) [2022-07-09 01:02:31,978][25689] Fps is (10 sec: 5360.6, 60 sec: 5469.1, 300 sec: 5484.4). Total num frames: 19694592. Throughput: 0: 5741.1. Samples: 19702010. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 01:02:31,978][25689] Avg episode reward: [(0, '-66.184')] [2022-07-09 01:02:33,277][26022] Updated weights on worker 0-0, policy_version 19242 (0.00091) [2022-07-09 01:02:35,409][26022] Updated weights on worker 0-0, policy_version 19252 (0.00087) [2022-07-09 01:02:36,999][25689] Fps is (10 sec: 5564.2, 60 sec: 5484.3, 300 sec: 5484.8). Total num frames: 19723264. Throughput: 0: 4918.2. Samples: 19718654. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 01:02:37,000][25689] Avg episode reward: [(0, '-66.001')] [2022-07-09 01:02:37,131][26022] Updated weights on worker 0-0, policy_version 19262 (0.00084) [2022-07-09 01:02:38,842][26022] Updated weights on worker 0-0, policy_version 19272 (0.00088) [2022-07-09 01:02:40,965][26022] Updated weights on worker 0-0, policy_version 19282 (0.00084) [2022-07-09 01:02:42,032][25689] Fps is (10 sec: 5703.8, 60 sec: 5481.6, 300 sec: 5485.3). Total num frames: 19751936. Throughput: 0: 5732.3. Samples: 19751878. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 01:02:42,032][25689] Avg episode reward: [(0, '-65.923')] [2022-07-09 01:02:42,603][26022] Updated weights on worker 0-0, policy_version 19292 (0.00082) [2022-07-09 01:02:44,461][26022] Updated weights on worker 0-0, policy_version 19302 (0.00089) [2022-07-09 01:02:46,591][26022] Updated weights on worker 0-0, policy_version 19312 (0.00089) [2022-07-09 01:02:47,040][25689] Fps is (10 sec: 5405.3, 60 sec: 5466.6, 300 sec: 5479.5). Total num frames: 19777536. Throughput: 0: 5747.8. Samples: 19785228. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 01:02:47,041][25689] Avg episode reward: [(0, '-65.242')] [2022-07-09 01:02:48,079][26022] Updated weights on worker 0-0, policy_version 19322 (0.00088) [2022-07-09 01:02:50,318][26022] Updated weights on worker 0-0, policy_version 19332 (0.00090) [2022-07-09 01:02:51,856][26022] Updated weights on worker 0-0, policy_version 19342 (0.00100) [2022-07-09 01:02:52,116][25689] Fps is (10 sec: 5483.4, 60 sec: 5481.3, 300 sec: 5478.9). Total num frames: 19807232. Throughput: 0: 4954.8. Samples: 19801790. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 01:02:52,117][25689] Avg episode reward: [(0, '-65.024')] [2022-07-09 01:02:53,888][26022] Updated weights on worker 0-0, policy_version 19352 (0.00103) [2022-07-09 01:02:55,579][26022] Updated weights on worker 0-0, policy_version 19362 (0.00101) [2022-07-09 01:02:57,136][25689] Fps is (10 sec: 5578.6, 60 sec: 5479.8, 300 sec: 5482.4). Total num frames: 19833856. Throughput: 0: 5773.0. Samples: 19834900. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 01:02:57,136][25689] Avg episode reward: [(0, '-64.979')] [2022-07-09 01:02:57,563][26022] Updated weights on worker 0-0, policy_version 19372 (0.00083) [2022-07-09 01:02:59,438][26022] Updated weights on worker 0-0, policy_version 19382 (0.00089) [2022-07-09 01:03:01,341][26022] Updated weights on worker 0-0, policy_version 19392 (0.00095) [2022-07-09 01:03:02,204][25689] Fps is (10 sec: 5278.3, 60 sec: 5491.0, 300 sec: 5481.2). Total num frames: 19860480. Throughput: 0: 5753.4. Samples: 19867938. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 01:03:02,204][25689] Avg episode reward: [(0, '-64.172')] [2022-07-09 01:03:03,336][26022] Updated weights on worker 0-0, policy_version 19402 (0.00091) [2022-07-09 01:03:05,604][26022] Updated weights on worker 0-0, policy_version 19412 (0.00068) [2022-07-09 01:03:06,996][26022] Updated weights on worker 0-0, policy_version 19422 (0.00088) [2022-07-09 01:03:07,231][25689] Fps is (10 sec: 5376.2, 60 sec: 5505.6, 300 sec: 5481.8). Total num frames: 19888128. Throughput: 0: 4812.0. Samples: 19882386. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 01:03:07,231][25689] Avg episode reward: [(0, '-64.454')] [2022-07-09 01:03:09,170][26022] Updated weights on worker 0-0, policy_version 19432 (0.00098) [2022-07-09 01:03:11,003][26022] Updated weights on worker 0-0, policy_version 19442 (0.00094) [2022-07-09 01:03:12,321][25689] Fps is (10 sec: 5465.4, 60 sec: 5466.6, 300 sec: 5480.6). Total num frames: 19915776. Throughput: 0: 5604.1. Samples: 19915024. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 01:03:12,322][25689] Avg episode reward: [(0, '-65.381')] [2022-07-09 01:03:12,922][26022] Updated weights on worker 0-0, policy_version 19452 (0.00093) [2022-07-09 01:03:14,755][26022] Updated weights on worker 0-0, policy_version 19462 (0.00086) [2022-07-09 01:03:16,772][26022] Updated weights on worker 0-0, policy_version 19472 (0.00088) [2022-07-09 01:03:17,377][25689] Fps is (10 sec: 5349.0, 60 sec: 5463.1, 300 sec: 5476.4). Total num frames: 19942400. Throughput: 0: 5603.7. Samples: 19948324. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 01:03:17,377][25689] Avg episode reward: [(0, '-65.520')] [2022-07-09 01:03:18,376][26022] Updated weights on worker 0-0, policy_version 19482 (0.00102) [2022-07-09 01:03:20,560][26022] Updated weights on worker 0-0, policy_version 19492 (0.00097) [2022-07-09 01:03:22,108][26022] Updated weights on worker 0-0, policy_version 19502 (0.00096) [2022-07-09 01:03:22,400][25689] Fps is (10 sec: 5384.8, 60 sec: 5445.9, 300 sec: 5476.6). Total num frames: 19970048. Throughput: 0: 4798.8. Samples: 19964856. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-09 01:03:22,401][25689] Avg episode reward: [(0, '-65.202')] [2022-07-09 01:03:24,078][26022] Updated weights on worker 0-0, policy_version 19512 (0.01309) [2022-07-09 01:03:25,897][26022] Updated weights on worker 0-0, policy_version 19522 (0.00094) [2022-07-09 01:03:27,447][25689] Fps is (10 sec: 5592.7, 60 sec: 5477.6, 300 sec: 5480.2). Total num frames: 19998720. Throughput: 0: 5729.1. Samples: 19998208. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-09 01:03:27,448][25689] Avg episode reward: [(0, '-64.893')] [2022-07-09 01:03:27,754][26022] Updated weights on worker 0-0, policy_version 19532 (0.00094) [2022-07-09 01:03:29,579][26022] Updated weights on worker 0-0, policy_version 19542 (0.00091) [2022-07-09 01:03:31,499][26022] Updated weights on worker 0-0, policy_version 19552 (0.00091) [2022-07-09 01:03:32,491][25689] Fps is (10 sec: 5682.7, 60 sec: 5499.6, 300 sec: 5482.9). Total num frames: 20027392. Throughput: 0: 5751.1. Samples: 20031022. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-09 01:03:32,492][25689] Avg episode reward: [(0, '-64.313')] [2022-07-09 01:03:33,418][26022] Updated weights on worker 0-0, policy_version 19562 (0.00090) [2022-07-09 01:03:35,281][26022] Updated weights on worker 0-0, policy_version 19572 (0.00099) [2022-07-09 01:03:37,104][26022] Updated weights on worker 0-0, policy_version 19582 (0.00093) [2022-07-09 01:03:37,505][25689] Fps is (10 sec: 5497.9, 60 sec: 5466.5, 300 sec: 5482.7). Total num frames: 20054016. Throughput: 0: 4925.2. Samples: 20047458. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-09 01:03:37,505][25689] Avg episode reward: [(0, '-64.474')] [2022-07-09 01:03:38,950][26022] Updated weights on worker 0-0, policy_version 19592 (0.00080) [2022-07-09 01:03:40,721][26022] Updated weights on worker 0-0, policy_version 19602 (0.00091) [2022-07-09 01:03:42,494][26022] Updated weights on worker 0-0, policy_version 19612 (0.00095) [2022-07-09 01:03:42,547][25689] Fps is (10 sec: 5499.0, 60 sec: 5465.6, 300 sec: 5482.0). Total num frames: 20082688. Throughput: 0: 5753.3. Samples: 20080766. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-09 01:03:42,547][25689] Avg episode reward: [(0, '-65.294')] [2022-07-09 01:03:44,323][26022] Updated weights on worker 0-0, policy_version 19622 (0.00091) [2022-07-09 01:03:46,129][26022] Updated weights on worker 0-0, policy_version 19632 (0.00087) [2022-07-09 01:03:47,551][25689] Fps is (10 sec: 5606.3, 60 sec: 5499.9, 300 sec: 5484.7). Total num frames: 20110336. Throughput: 0: 5772.4. Samples: 20114254. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-09 01:03:47,551][25689] Avg episode reward: [(0, '-65.655')] [2022-07-09 01:03:48,215][26022] Updated weights on worker 0-0, policy_version 19642 (0.00088) [2022-07-09 01:03:49,979][26022] Updated weights on worker 0-0, policy_version 19652 (0.00086) [2022-07-09 01:03:51,784][26022] Updated weights on worker 0-0, policy_version 19662 (0.00081) [2022-07-09 01:03:52,677][25689] Fps is (10 sec: 5458.6, 60 sec: 5461.5, 300 sec: 5484.0). Total num frames: 20137984. Throughput: 0: 5764.6. Samples: 20147384. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 01:03:52,677][25689] Avg episode reward: [(0, '-66.714')] [2022-07-09 01:03:53,786][26022] Updated weights on worker 0-0, policy_version 19672 (0.00086) [2022-07-09 01:03:55,226][26022] Updated weights on worker 0-0, policy_version 19682 (0.00085) [2022-07-09 01:03:57,439][26022] Updated weights on worker 0-0, policy_version 19692 (0.00084) [2022-07-09 01:03:57,726][25689] Fps is (10 sec: 5535.0, 60 sec: 5492.6, 300 sec: 5487.3). Total num frames: 20166656. Throughput: 0: 5773.1. Samples: 20164196. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 01:03:57,726][25689] Avg episode reward: [(0, '-66.946')] [2022-07-09 01:03:59,327][26022] Updated weights on worker 0-0, policy_version 19702 (0.00351) [2022-07-09 01:04:00,878][26022] Updated weights on worker 0-0, policy_version 19712 (0.00085) [2022-07-09 01:04:02,778][25689] Fps is (10 sec: 5373.0, 60 sec: 5477.2, 300 sec: 5486.7). Total num frames: 20192256. Throughput: 0: 5772.2. Samples: 20197544. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 01:04:02,778][25689] Avg episode reward: [(0, '-66.742')] [2022-07-09 01:04:03,411][26022] Updated weights on worker 0-0, policy_version 19722 (0.00087) [2022-07-09 01:04:04,961][26022] Updated weights on worker 0-0, policy_version 19732 (0.00088) [2022-07-09 01:04:07,048][26022] Updated weights on worker 0-0, policy_version 19742 (0.00090) [2022-07-09 01:04:07,787][25689] Fps is (10 sec: 5394.4, 60 sec: 5495.7, 300 sec: 5488.4). Total num frames: 20220928. Throughput: 0: 5664.5. Samples: 20228880. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 01:04:07,787][25689] Avg episode reward: [(0, '-66.051')] [2022-07-09 01:04:08,855][26022] Updated weights on worker 0-0, policy_version 19752 (0.00072) [2022-07-09 01:04:10,523][26022] Updated weights on worker 0-0, policy_version 19762 (0.00083) [2022-07-09 01:04:12,523][26022] Updated weights on worker 0-0, policy_version 19772 (0.00084) [2022-07-09 01:04:12,837][25689] Fps is (10 sec: 5599.1, 60 sec: 5499.4, 300 sec: 5492.3). Total num frames: 20248576. Throughput: 0: 4871.8. Samples: 20245598. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 01:04:12,837][25689] Avg episode reward: [(0, '-65.012')] [2022-07-09 01:04:14,184][26022] Updated weights on worker 0-0, policy_version 19782 (0.00089) [2022-07-09 01:04:14,535][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:04:14,555][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000019783_20257792.pth [2022-07-09 01:04:14,567][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000017854_18282496.pth [2022-07-09 01:04:16,152][26022] Updated weights on worker 0-0, policy_version 19792 (0.00090) [2022-07-09 01:04:17,846][25689] Fps is (10 sec: 5395.2, 60 sec: 5503.6, 300 sec: 5486.4). Total num frames: 20275200. Throughput: 0: 5715.0. Samples: 20279182. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 01:04:17,847][25689] Avg episode reward: [(0, '-64.356')] [2022-07-09 01:04:18,136][26022] Updated weights on worker 0-0, policy_version 19802 (0.00086) [2022-07-09 01:04:19,883][26022] Updated weights on worker 0-0, policy_version 19812 (0.00091) [2022-07-09 01:04:21,587][26022] Updated weights on worker 0-0, policy_version 19822 (0.00096) [2022-07-09 01:04:22,869][25689] Fps is (10 sec: 5511.6, 60 sec: 5520.6, 300 sec: 5493.1). Total num frames: 20303872. Throughput: 0: 5729.3. Samples: 20312654. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 01:04:22,871][25689] Avg episode reward: [(0, '-64.247')] [2022-07-09 01:04:23,485][26022] Updated weights on worker 0-0, policy_version 19832 (0.00084) [2022-07-09 01:04:25,220][26022] Updated weights on worker 0-0, policy_version 19842 (0.00084) [2022-07-09 01:04:27,244][26022] Updated weights on worker 0-0, policy_version 19852 (0.00106) [2022-07-09 01:04:27,877][25689] Fps is (10 sec: 5614.6, 60 sec: 5507.2, 300 sec: 5495.1). Total num frames: 20331520. Throughput: 0: 5003.6. Samples: 20329404. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 01:04:27,879][25689] Avg episode reward: [(0, '-63.350')] [2022-07-09 01:04:29,014][26022] Updated weights on worker 0-0, policy_version 19862 (0.00089) [2022-07-09 01:04:31,018][26022] Updated weights on worker 0-0, policy_version 19872 (0.00087) [2022-07-09 01:04:32,997][25689] Fps is (10 sec: 5459.8, 60 sec: 5483.4, 300 sec: 5490.0). Total num frames: 20359168. Throughput: 0: 5771.6. Samples: 20361956. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 01:04:32,999][25689] Avg episode reward: [(0, '-64.168')] [2022-07-09 01:04:33,001][26022] Updated weights on worker 0-0, policy_version 19882 (0.00098) [2022-07-09 01:04:34,588][26022] Updated weights on worker 0-0, policy_version 19892 (0.00114) [2022-07-09 01:04:36,524][26022] Updated weights on worker 0-0, policy_version 19902 (0.00091) [2022-07-09 01:04:38,030][25689] Fps is (10 sec: 5446.4, 60 sec: 5498.6, 300 sec: 5486.3). Total num frames: 20386816. Throughput: 0: 5759.4. Samples: 20395428. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 01:04:38,032][25689] Avg episode reward: [(0, '-64.113')] [2022-07-09 01:04:38,558][26022] Updated weights on worker 0-0, policy_version 19912 (0.00086) [2022-07-09 01:04:40,062][26022] Updated weights on worker 0-0, policy_version 19922 (0.00090) [2022-07-09 01:04:42,198][26022] Updated weights on worker 0-0, policy_version 19932 (0.00089) [2022-07-09 01:04:43,068][25689] Fps is (10 sec: 5795.6, 60 sec: 5532.7, 300 sec: 5499.6). Total num frames: 20417536. Throughput: 0: 4920.9. Samples: 20412052. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 01:04:43,069][25689] Avg episode reward: [(0, '-64.148')] [2022-07-09 01:04:43,921][26022] Updated weights on worker 0-0, policy_version 19942 (0.00092) [2022-07-09 01:04:45,567][26022] Updated weights on worker 0-0, policy_version 19952 (0.00096) [2022-07-09 01:04:47,760][26022] Updated weights on worker 0-0, policy_version 19962 (0.00113) [2022-07-09 01:04:48,154][25689] Fps is (10 sec: 5563.0, 60 sec: 5491.5, 300 sec: 5492.0). Total num frames: 20443136. Throughput: 0: 5738.6. Samples: 20445764. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 01:04:48,154][25689] Avg episode reward: [(0, '-64.747')] [2022-07-09 01:04:49,411][26022] Updated weights on worker 0-0, policy_version 19972 (0.00095) [2022-07-09 01:04:51,120][26022] Updated weights on worker 0-0, policy_version 19982 (0.00095) [2022-07-09 01:04:53,040][26022] Updated weights on worker 0-0, policy_version 19992 (0.00086) [2022-07-09 01:04:53,274][25689] Fps is (10 sec: 5317.7, 60 sec: 5508.9, 300 sec: 5489.9). Total num frames: 20471808. Throughput: 0: 5783.2. Samples: 20479222. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 01:04:53,275][25689] Avg episode reward: [(0, '-65.013')] [2022-07-09 01:04:54,763][26022] Updated weights on worker 0-0, policy_version 20002 (0.00085) [2022-07-09 01:04:56,940][26022] Updated weights on worker 0-0, policy_version 20012 (0.00092) [2022-07-09 01:04:58,315][25689] Fps is (10 sec: 5542.7, 60 sec: 5492.7, 300 sec: 5486.1). Total num frames: 20499456. Throughput: 0: 4954.5. Samples: 20495932. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 01:04:58,316][25689] Avg episode reward: [(0, '-64.865')] [2022-07-09 01:04:58,707][26022] Updated weights on worker 0-0, policy_version 20022 (0.00081) [2022-07-09 01:05:00,461][26022] Updated weights on worker 0-0, policy_version 20032 (0.00090) [2022-07-09 01:05:02,823][26022] Updated weights on worker 0-0, policy_version 20042 (0.00083) [2022-07-09 01:05:03,385][25689] Fps is (10 sec: 5367.4, 60 sec: 5507.9, 300 sec: 5491.8). Total num frames: 20526080. Throughput: 0: 5682.5. Samples: 20527506. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 01:05:03,386][25689] Avg episode reward: [(0, '-64.457')] [2022-07-09 01:05:04,442][26022] Updated weights on worker 0-0, policy_version 20052 (0.00081) [2022-07-09 01:05:06,452][26022] Updated weights on worker 0-0, policy_version 20062 (0.00085) [2022-07-09 01:05:08,291][26022] Updated weights on worker 0-0, policy_version 20072 (0.00093) [2022-07-09 01:05:08,485][25689] Fps is (10 sec: 5336.4, 60 sec: 5482.8, 300 sec: 5487.1). Total num frames: 20553728. Throughput: 0: 5623.5. Samples: 20560098. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 01:05:08,486][25689] Avg episode reward: [(0, '-64.218')] [2022-07-09 01:05:10,093][26022] Updated weights on worker 0-0, policy_version 20082 (0.00087) [2022-07-09 01:05:11,952][26022] Updated weights on worker 0-0, policy_version 20092 (0.00086) [2022-07-09 01:05:13,569][25689] Fps is (10 sec: 5631.1, 60 sec: 5513.5, 300 sec: 5492.8). Total num frames: 20583424. Throughput: 0: 4803.8. Samples: 20576714. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:05:13,569][25689] Avg episode reward: [(0, '-64.934')] [2022-07-09 01:05:13,812][26022] Updated weights on worker 0-0, policy_version 20102 (0.00090) [2022-07-09 01:05:15,545][26022] Updated weights on worker 0-0, policy_version 20112 (0.00085) [2022-07-09 01:05:17,552][26022] Updated weights on worker 0-0, policy_version 20122 (0.00094) [2022-07-09 01:05:18,598][25689] Fps is (10 sec: 5670.5, 60 sec: 5528.6, 300 sec: 5496.3). Total num frames: 20611072. Throughput: 0: 5639.7. Samples: 20610322. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:05:18,598][25689] Avg episode reward: [(0, '-65.014')] [2022-07-09 01:05:19,100][26022] Updated weights on worker 0-0, policy_version 20132 (0.00087) [2022-07-09 01:05:21,295][26022] Updated weights on worker 0-0, policy_version 20142 (0.00092) [2022-07-09 01:05:23,119][26022] Updated weights on worker 0-0, policy_version 20152 (0.00938) [2022-07-09 01:05:23,636][25689] Fps is (10 sec: 5391.2, 60 sec: 5493.6, 300 sec: 5492.3). Total num frames: 20637696. Throughput: 0: 5737.3. Samples: 20643688. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:05:23,637][25689] Avg episode reward: [(0, '-64.078')] [2022-07-09 01:05:24,703][26022] Updated weights on worker 0-0, policy_version 20162 (0.00098) [2022-07-09 01:05:26,755][26022] Updated weights on worker 0-0, policy_version 20172 (0.00096) [2022-07-09 01:05:28,572][26022] Updated weights on worker 0-0, policy_version 20182 (0.00089) [2022-07-09 01:05:28,647][25689] Fps is (10 sec: 5502.8, 60 sec: 5510.1, 300 sec: 5493.3). Total num frames: 20666368. Throughput: 0: 4976.7. Samples: 20660436. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:05:28,647][25689] Avg episode reward: [(0, '-64.053')] [2022-07-09 01:05:30,470][26022] Updated weights on worker 0-0, policy_version 20192 (0.00100) [2022-07-09 01:05:32,263][26022] Updated weights on worker 0-0, policy_version 20202 (0.00094) [2022-07-09 01:05:33,716][25689] Fps is (10 sec: 5587.1, 60 sec: 5514.7, 300 sec: 5492.3). Total num frames: 20694016. Throughput: 0: 5785.3. Samples: 20693270. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:05:33,717][25689] Avg episode reward: [(0, '-64.772')] [2022-07-09 01:05:34,025][26022] Updated weights on worker 0-0, policy_version 20212 (0.00089) [2022-07-09 01:05:36,034][26022] Updated weights on worker 0-0, policy_version 20222 (0.00088) [2022-07-09 01:05:37,712][26022] Updated weights on worker 0-0, policy_version 20232 (0.00087) [2022-07-09 01:05:38,736][25689] Fps is (10 sec: 5480.9, 60 sec: 5515.9, 300 sec: 5488.7). Total num frames: 20721664. Throughput: 0: 5778.1. Samples: 20726680. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:05:38,736][25689] Avg episode reward: [(0, '-65.414')] [2022-07-09 01:05:39,591][26022] Updated weights on worker 0-0, policy_version 20242 (0.00098) [2022-07-09 01:05:41,466][26022] Updated weights on worker 0-0, policy_version 20252 (0.00082) [2022-07-09 01:05:43,069][26022] Updated weights on worker 0-0, policy_version 20262 (0.00088) [2022-07-09 01:05:43,784][25689] Fps is (10 sec: 5492.3, 60 sec: 5464.4, 300 sec: 5491.6). Total num frames: 20749312. Throughput: 0: 4963.5. Samples: 20743694. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 01:05:43,785][25689] Avg episode reward: [(0, '-65.335')] [2022-07-09 01:05:45,200][26022] Updated weights on worker 0-0, policy_version 20272 (0.00081) [2022-07-09 01:05:46,939][26022] Updated weights on worker 0-0, policy_version 20282 (0.00095) [2022-07-09 01:05:48,795][25689] Fps is (10 sec: 5700.7, 60 sec: 5538.7, 300 sec: 5503.5). Total num frames: 20779008. Throughput: 0: 5780.7. Samples: 20776906. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 01:05:48,795][26022] Updated weights on worker 0-0, policy_version 20292 (0.00084) [2022-07-09 01:05:48,796][25689] Avg episode reward: [(0, '-65.047')] [2022-07-09 01:05:50,720][26022] Updated weights on worker 0-0, policy_version 20302 (0.00105) [2022-07-09 01:05:52,632][26022] Updated weights on worker 0-0, policy_version 20312 (0.00085) [2022-07-09 01:05:53,922][25689] Fps is (10 sec: 5656.2, 60 sec: 5521.2, 300 sec: 5491.3). Total num frames: 20806656. Throughput: 0: 5776.3. Samples: 20809988. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 01:05:53,923][25689] Avg episode reward: [(0, '-65.107')] [2022-07-09 01:05:54,404][26022] Updated weights on worker 0-0, policy_version 20322 (0.00086) [2022-07-09 01:05:56,250][26022] Updated weights on worker 0-0, policy_version 20332 (0.00091) [2022-07-09 01:05:58,099][26022] Updated weights on worker 0-0, policy_version 20342 (0.00086) [2022-07-09 01:05:58,983][25689] Fps is (10 sec: 5327.1, 60 sec: 5502.5, 300 sec: 5490.6). Total num frames: 20833280. Throughput: 0: 5754.5. Samples: 20843192. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 01:05:58,983][25689] Avg episode reward: [(0, '-64.678')] [2022-07-09 01:06:00,117][26022] Updated weights on worker 0-0, policy_version 20352 (0.00085) [2022-07-09 01:06:02,199][26022] Updated weights on worker 0-0, policy_version 20362 (0.00092) [2022-07-09 01:06:04,058][25689] Fps is (10 sec: 5253.5, 60 sec: 5502.1, 300 sec: 5492.9). Total num frames: 20859904. Throughput: 0: 5634.8. Samples: 20857934. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 01:06:04,059][25689] Avg episode reward: [(0, '-64.496')] [2022-07-09 01:06:04,138][26022] Updated weights on worker 0-0, policy_version 20372 (0.00079) [2022-07-09 01:06:05,818][26022] Updated weights on worker 0-0, policy_version 20382 (0.00098) [2022-07-09 01:06:07,837][26022] Updated weights on worker 0-0, policy_version 20392 (0.00081) [2022-07-09 01:06:09,064][25689] Fps is (10 sec: 5484.8, 60 sec: 5527.5, 300 sec: 5491.7). Total num frames: 20888576. Throughput: 0: 5646.5. Samples: 20891358. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 01:06:09,065][25689] Avg episode reward: [(0, '-64.290')] [2022-07-09 01:06:09,632][26022] Updated weights on worker 0-0, policy_version 20402 (0.00091) [2022-07-09 01:06:11,479][26022] Updated weights on worker 0-0, policy_version 20412 (0.00088) [2022-07-09 01:06:13,301][26022] Updated weights on worker 0-0, policy_version 20422 (0.00094) [2022-07-09 01:06:14,131][25689] Fps is (10 sec: 5591.4, 60 sec: 5495.2, 300 sec: 5498.1). Total num frames: 20916224. Throughput: 0: 5690.9. Samples: 20924992. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 01:06:14,131][25689] Avg episode reward: [(0, '-64.198')] [2022-07-09 01:06:14,749][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:06:14,764][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000020429_20919296.pth [2022-07-09 01:06:14,765][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000018496_18939904.pth [2022-07-09 01:06:15,128][26022] Updated weights on worker 0-0, policy_version 20432 (0.00090) [2022-07-09 01:06:17,100][26022] Updated weights on worker 0-0, policy_version 20442 (0.00091) [2022-07-09 01:06:18,704][26022] Updated weights on worker 0-0, policy_version 20452 (0.00089) [2022-07-09 01:06:19,151][25689] Fps is (10 sec: 5685.3, 60 sec: 5529.9, 300 sec: 5495.9). Total num frames: 20945920. Throughput: 0: 4886.4. Samples: 20941742. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 01:06:19,151][25689] Avg episode reward: [(0, '-63.914')] [2022-07-09 01:06:20,714][26022] Updated weights on worker 0-0, policy_version 20462 (0.00087) [2022-07-09 01:06:22,463][26022] Updated weights on worker 0-0, policy_version 20472 (0.00078) [2022-07-09 01:06:24,183][25689] Fps is (10 sec: 5501.0, 60 sec: 5513.5, 300 sec: 5492.3). Total num frames: 20971520. Throughput: 0: 5819.6. Samples: 20975052. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 01:06:24,183][25689] Avg episode reward: [(0, '-65.056')] [2022-07-09 01:06:24,296][26022] Updated weights on worker 0-0, policy_version 20482 (0.00114) [2022-07-09 01:06:26,192][26022] Updated weights on worker 0-0, policy_version 20492 (0.00093) [2022-07-09 01:06:27,964][26022] Updated weights on worker 0-0, policy_version 20502 (0.00084) [2022-07-09 01:06:29,186][25689] Fps is (10 sec: 5306.0, 60 sec: 5497.2, 300 sec: 5493.5). Total num frames: 20999168. Throughput: 0: 5809.2. Samples: 21008250. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 01:06:29,187][25689] Avg episode reward: [(0, '-65.086')] [2022-07-09 01:06:29,877][26022] Updated weights on worker 0-0, policy_version 20512 (0.00083) [2022-07-09 01:06:31,941][26022] Updated weights on worker 0-0, policy_version 20522 (0.00089) [2022-07-09 01:06:33,615][26022] Updated weights on worker 0-0, policy_version 20532 (0.00090) [2022-07-09 01:06:34,281][25689] Fps is (10 sec: 5678.5, 60 sec: 5528.7, 300 sec: 5498.7). Total num frames: 21028864. Throughput: 0: 4949.1. Samples: 21024718. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 01:06:34,282][25689] Avg episode reward: [(0, '-64.398')] [2022-07-09 01:06:35,565][26022] Updated weights on worker 0-0, policy_version 20542 (0.00087) [2022-07-09 01:06:37,247][26022] Updated weights on worker 0-0, policy_version 20552 (0.00096) [2022-07-09 01:06:39,053][26022] Updated weights on worker 0-0, policy_version 20562 (0.00085) [2022-07-09 01:06:39,318][25689] Fps is (10 sec: 5659.9, 60 sec: 5527.2, 300 sec: 5494.6). Total num frames: 21056512. Throughput: 0: 5774.8. Samples: 21058202. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 01:06:39,318][25689] Avg episode reward: [(0, '-64.744')] [2022-07-09 01:06:40,822][26022] Updated weights on worker 0-0, policy_version 20572 (0.00094) [2022-07-09 01:06:42,702][26022] Updated weights on worker 0-0, policy_version 20582 (0.00618) [2022-07-09 01:06:44,341][25689] Fps is (10 sec: 5496.8, 60 sec: 5529.5, 300 sec: 5498.2). Total num frames: 21084160. Throughput: 0: 5801.8. Samples: 21092006. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 01:06:44,342][25689] Avg episode reward: [(0, '-64.031')] [2022-07-09 01:06:44,544][26022] Updated weights on worker 0-0, policy_version 20592 (0.00093) [2022-07-09 01:06:46,424][26022] Updated weights on worker 0-0, policy_version 20602 (0.00098) [2022-07-09 01:06:48,202][26022] Updated weights on worker 0-0, policy_version 20612 (0.00096) [2022-07-09 01:06:49,371][25689] Fps is (10 sec: 5500.4, 60 sec: 5493.9, 300 sec: 5495.1). Total num frames: 21111808. Throughput: 0: 4972.8. Samples: 21108624. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 01:06:49,371][25689] Avg episode reward: [(0, '-64.330')] [2022-07-09 01:06:50,334][26022] Updated weights on worker 0-0, policy_version 20622 (0.00098) [2022-07-09 01:06:52,025][26022] Updated weights on worker 0-0, policy_version 20632 (0.00089) [2022-07-09 01:06:53,921][26022] Updated weights on worker 0-0, policy_version 20642 (0.00086) [2022-07-09 01:06:54,503][25689] Fps is (10 sec: 5642.8, 60 sec: 5527.3, 300 sec: 5503.1). Total num frames: 21141504. Throughput: 0: 5791.8. Samples: 21141840. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 01:06:54,504][25689] Avg episode reward: [(0, '-64.511')] [2022-07-09 01:06:55,715][26022] Updated weights on worker 0-0, policy_version 20652 (0.00091) [2022-07-09 01:06:57,605][26022] Updated weights on worker 0-0, policy_version 20662 (0.00089) [2022-07-09 01:06:59,576][25689] Fps is (10 sec: 5519.0, 60 sec: 5526.2, 300 sec: 5505.2). Total num frames: 21168128. Throughput: 0: 5769.7. Samples: 21175084. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 01:06:59,576][25689] Avg episode reward: [(0, '-64.957')] [2022-07-09 01:06:59,582][26022] Updated weights on worker 0-0, policy_version 20672 (0.00083) [2022-07-09 01:07:01,352][26022] Updated weights on worker 0-0, policy_version 20682 (0.00093) [2022-07-09 01:07:03,524][26022] Updated weights on worker 0-0, policy_version 20692 (0.00088) [2022-07-09 01:07:04,597][25689] Fps is (10 sec: 5072.6, 60 sec: 5497.3, 300 sec: 5498.0). Total num frames: 21192704. Throughput: 0: 4909.4. Samples: 21191450. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 01:07:04,597][25689] Avg episode reward: [(0, '-64.912')] [2022-07-09 01:07:05,422][26022] Updated weights on worker 0-0, policy_version 20702 (0.00085) [2022-07-09 01:07:07,199][26022] Updated weights on worker 0-0, policy_version 20712 (0.00089) [2022-07-09 01:07:09,273][26022] Updated weights on worker 0-0, policy_version 20722 (0.00090) [2022-07-09 01:07:09,625][25689] Fps is (10 sec: 5400.4, 60 sec: 5512.2, 300 sec: 5498.1). Total num frames: 21222400. Throughput: 0: 5616.9. Samples: 21222392. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 01:07:09,626][25689] Avg episode reward: [(0, '-65.275')] [2022-07-09 01:07:11,070][26022] Updated weights on worker 0-0, policy_version 20732 (0.00091) [2022-07-09 01:07:12,787][26022] Updated weights on worker 0-0, policy_version 20742 (0.00049) [2022-07-09 01:07:14,675][25689] Fps is (10 sec: 5588.0, 60 sec: 5496.8, 300 sec: 5497.5). Total num frames: 21249024. Throughput: 0: 5644.3. Samples: 21255698. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 01:07:14,676][25689] Avg episode reward: [(0, '-65.532')] [2022-07-09 01:07:14,835][26022] Updated weights on worker 0-0, policy_version 20752 (0.00088) [2022-07-09 01:07:16,367][26022] Updated weights on worker 0-0, policy_version 20762 (0.00096) [2022-07-09 01:07:18,566][26022] Updated weights on worker 0-0, policy_version 20772 (0.00098) [2022-07-09 01:07:19,694][25689] Fps is (10 sec: 5492.0, 60 sec: 5480.0, 300 sec: 5497.5). Total num frames: 21277696. Throughput: 0: 4835.2. Samples: 21272358. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 01:07:19,694][25689] Avg episode reward: [(0, '-65.610')] [2022-07-09 01:07:20,094][26022] Updated weights on worker 0-0, policy_version 20782 (0.00088) [2022-07-09 01:07:22,069][26022] Updated weights on worker 0-0, policy_version 20792 (0.00095) [2022-07-09 01:07:23,949][26022] Updated weights on worker 0-0, policy_version 20802 (0.00089) [2022-07-09 01:07:24,703][25689] Fps is (10 sec: 5514.5, 60 sec: 5499.0, 300 sec: 5497.8). Total num frames: 21304320. Throughput: 0: 5666.1. Samples: 21305372. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 01:07:24,703][25689] Avg episode reward: [(0, '-65.492')] [2022-07-09 01:07:25,871][26022] Updated weights on worker 0-0, policy_version 20812 (0.00094) [2022-07-09 01:07:27,682][26022] Updated weights on worker 0-0, policy_version 20822 (0.00088) [2022-07-09 01:07:29,506][26022] Updated weights on worker 0-0, policy_version 20832 (0.00085) [2022-07-09 01:07:29,727][25689] Fps is (10 sec: 5409.3, 60 sec: 5497.1, 300 sec: 5499.2). Total num frames: 21331968. Throughput: 0: 5768.9. Samples: 21338356. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 01:07:29,727][25689] Avg episode reward: [(0, '-64.715')] [2022-07-09 01:07:31,366][26022] Updated weights on worker 0-0, policy_version 20842 (0.00097) [2022-07-09 01:07:33,288][26022] Updated weights on worker 0-0, policy_version 20852 (0.00090) [2022-07-09 01:07:34,825][25689] Fps is (10 sec: 5665.3, 60 sec: 5496.9, 300 sec: 5501.2). Total num frames: 21361664. Throughput: 0: 4920.8. Samples: 21354848. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 01:07:34,825][25689] Avg episode reward: [(0, '-64.968')] [2022-07-09 01:07:35,019][26022] Updated weights on worker 0-0, policy_version 20862 (0.00093) [2022-07-09 01:07:36,971][26022] Updated weights on worker 0-0, policy_version 20872 (0.00091) [2022-07-09 01:07:38,903][26022] Updated weights on worker 0-0, policy_version 20882 (0.00100) [2022-07-09 01:07:39,872][25689] Fps is (10 sec: 5652.5, 60 sec: 5495.9, 300 sec: 5497.5). Total num frames: 21389312. Throughput: 0: 5741.1. Samples: 21388202. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 01:07:39,872][25689] Avg episode reward: [(0, '-65.227')] [2022-07-09 01:07:40,758][26022] Updated weights on worker 0-0, policy_version 20892 (0.00092) [2022-07-09 01:07:42,519][26022] Updated weights on worker 0-0, policy_version 20902 (0.00087) [2022-07-09 01:07:44,549][26022] Updated weights on worker 0-0, policy_version 20912 (0.00067) [2022-07-09 01:07:44,913][25689] Fps is (10 sec: 5379.4, 60 sec: 5477.3, 300 sec: 5500.3). Total num frames: 21415936. Throughput: 0: 5741.5. Samples: 21421412. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 01:07:44,923][25689] Avg episode reward: [(0, '-65.102')] [2022-07-09 01:07:46,184][26022] Updated weights on worker 0-0, policy_version 20922 (0.00095) [2022-07-09 01:07:48,242][26022] Updated weights on worker 0-0, policy_version 20932 (0.00731) [2022-07-09 01:07:49,763][26022] Updated weights on worker 0-0, policy_version 20942 (0.00105) [2022-07-09 01:07:49,927][25689] Fps is (10 sec: 5601.2, 60 sec: 5512.7, 300 sec: 5501.5). Total num frames: 21445632. Throughput: 0: 4939.5. Samples: 21438134. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 01:07:49,927][25689] Avg episode reward: [(0, '-65.032')] [2022-07-09 01:07:51,878][26022] Updated weights on worker 0-0, policy_version 20952 (0.00084) [2022-07-09 01:07:53,447][26022] Updated weights on worker 0-0, policy_version 20962 (0.00084) [2022-07-09 01:07:55,047][25689] Fps is (10 sec: 5558.0, 60 sec: 5463.0, 300 sec: 5499.6). Total num frames: 21472256. Throughput: 0: 5785.2. Samples: 21471836. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 01:07:55,047][25689] Avg episode reward: [(0, '-64.680')] [2022-07-09 01:07:55,443][26022] Updated weights on worker 0-0, policy_version 20972 (0.00090) [2022-07-09 01:07:57,231][26022] Updated weights on worker 0-0, policy_version 20982 (0.00085) [2022-07-09 01:07:59,010][26022] Updated weights on worker 0-0, policy_version 20992 (0.00092) [2022-07-09 01:08:00,096][25689] Fps is (10 sec: 5235.9, 60 sec: 5465.1, 300 sec: 5500.0). Total num frames: 21498880. Throughput: 0: 5777.1. Samples: 21505042. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 01:08:00,107][25689] Avg episode reward: [(0, '-65.579')] [2022-07-09 01:08:00,879][26022] Updated weights on worker 0-0, policy_version 21002 (0.00111) [2022-07-09 01:08:03,383][26022] Updated weights on worker 0-0, policy_version 21012 (0.00082) [2022-07-09 01:08:05,047][26022] Updated weights on worker 0-0, policy_version 21022 (0.00102) [2022-07-09 01:08:05,144][25689] Fps is (10 sec: 5374.8, 60 sec: 5513.4, 300 sec: 5499.6). Total num frames: 21526528. Throughput: 0: 4896.8. Samples: 21520478. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 01:08:05,145][25689] Avg episode reward: [(0, '-65.632')] [2022-07-09 01:08:06,964][26022] Updated weights on worker 0-0, policy_version 21032 (0.00055) [2022-07-09 01:08:08,732][26022] Updated weights on worker 0-0, policy_version 21042 (0.00086) [2022-07-09 01:08:10,170][25689] Fps is (10 sec: 5590.7, 60 sec: 5496.7, 300 sec: 5504.2). Total num frames: 21555200. Throughput: 0: 5673.4. Samples: 21552984. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 01:08:10,171][25689] Avg episode reward: [(0, '-65.100')] [2022-07-09 01:08:10,708][26022] Updated weights on worker 0-0, policy_version 21052 (0.00094) [2022-07-09 01:08:12,494][26022] Updated weights on worker 0-0, policy_version 21062 (0.00085) [2022-07-09 01:08:14,459][26022] Updated weights on worker 0-0, policy_version 21072 (0.00086) [2022-07-09 01:08:14,818][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:08:14,830][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000021074_21579776.pth [2022-07-09 01:08:14,831][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000019140_19599360.pth [2022-07-09 01:08:15,226][25689] Fps is (10 sec: 5484.9, 60 sec: 5496.2, 300 sec: 5504.2). Total num frames: 21581824. Throughput: 0: 5659.5. Samples: 21586040. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 01:08:15,226][25689] Avg episode reward: [(0, '-65.590')] [2022-07-09 01:08:16,079][26022] Updated weights on worker 0-0, policy_version 21082 (0.00079) [2022-07-09 01:08:18,135][26022] Updated weights on worker 0-0, policy_version 21092 (0.00088) [2022-07-09 01:08:19,903][26022] Updated weights on worker 0-0, policy_version 21102 (0.00091) [2022-07-09 01:08:20,263][25689] Fps is (10 sec: 5377.5, 60 sec: 5477.7, 300 sec: 5504.0). Total num frames: 21609472. Throughput: 0: 5653.7. Samples: 21619056. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 01:08:20,263][25689] Avg episode reward: [(0, '-66.382')] [2022-07-09 01:08:21,857][26022] Updated weights on worker 0-0, policy_version 21112 (0.00090) [2022-07-09 01:08:23,711][26022] Updated weights on worker 0-0, policy_version 21122 (0.00088) [2022-07-09 01:08:25,271][25689] Fps is (10 sec: 5504.9, 60 sec: 5494.6, 300 sec: 5501.2). Total num frames: 21637120. Throughput: 0: 5722.1. Samples: 21635644. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 01:08:25,271][25689] Avg episode reward: [(0, '-65.701')] [2022-07-09 01:08:25,456][26022] Updated weights on worker 0-0, policy_version 21132 (0.00086) [2022-07-09 01:08:27,506][26022] Updated weights on worker 0-0, policy_version 21142 (0.00090) [2022-07-09 01:08:29,225][26022] Updated weights on worker 0-0, policy_version 21152 (0.00051) [2022-07-09 01:08:30,307][25689] Fps is (10 sec: 5505.1, 60 sec: 5493.5, 300 sec: 5497.9). Total num frames: 21664768. Throughput: 0: 5741.4. Samples: 21668598. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 01:08:30,308][25689] Avg episode reward: [(0, '-65.823')] [2022-07-09 01:08:31,241][26022] Updated weights on worker 0-0, policy_version 21162 (0.00084) [2022-07-09 01:08:32,930][26022] Updated weights on worker 0-0, policy_version 21172 (0.00101) [2022-07-09 01:08:34,940][26022] Updated weights on worker 0-0, policy_version 21182 (0.00091) [2022-07-09 01:08:35,360][25689] Fps is (10 sec: 5582.4, 60 sec: 5480.7, 300 sec: 5504.1). Total num frames: 21693440. Throughput: 0: 5745.3. Samples: 21701716. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 01:08:35,360][25689] Avg episode reward: [(0, '-65.746')] [2022-07-09 01:08:36,698][26022] Updated weights on worker 0-0, policy_version 21192 (0.00089) [2022-07-09 01:08:38,498][26022] Updated weights on worker 0-0, policy_version 21202 (0.00083) [2022-07-09 01:08:40,357][26022] Updated weights on worker 0-0, policy_version 21212 (0.00088) [2022-07-09 01:08:40,365][25689] Fps is (10 sec: 5599.3, 60 sec: 5484.5, 300 sec: 5501.3). Total num frames: 21721088. Throughput: 0: 4950.0. Samples: 21718564. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 01:08:40,366][25689] Avg episode reward: [(0, '-65.578')] [2022-07-09 01:08:42,103][26022] Updated weights on worker 0-0, policy_version 21222 (0.00083) [2022-07-09 01:08:44,216][26022] Updated weights on worker 0-0, policy_version 21232 (0.00089) [2022-07-09 01:08:45,375][25689] Fps is (10 sec: 5521.0, 60 sec: 5504.3, 300 sec: 5501.2). Total num frames: 21748736. Throughput: 0: 5783.3. Samples: 21751914. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 01:08:45,377][25689] Avg episode reward: [(0, '-64.312')] [2022-07-09 01:08:46,038][26022] Updated weights on worker 0-0, policy_version 21242 (0.00089) [2022-07-09 01:08:47,746][26022] Updated weights on worker 0-0, policy_version 21252 (0.00087) [2022-07-09 01:08:49,562][26022] Updated weights on worker 0-0, policy_version 21262 (0.00104) [2022-07-09 01:08:50,390][25689] Fps is (10 sec: 5311.6, 60 sec: 5436.4, 300 sec: 5496.4). Total num frames: 21774336. Throughput: 0: 5806.6. Samples: 21785212. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 01:08:50,391][25689] Avg episode reward: [(0, '-63.882')] [2022-07-09 01:08:51,487][26022] Updated weights on worker 0-0, policy_version 21272 (0.00088) [2022-07-09 01:08:53,519][26022] Updated weights on worker 0-0, policy_version 21282 (0.00081) [2022-07-09 01:08:55,372][26022] Updated weights on worker 0-0, policy_version 21292 (0.00095) [2022-07-09 01:08:55,465][25689] Fps is (10 sec: 5378.8, 60 sec: 5474.3, 300 sec: 5495.9). Total num frames: 21803008. Throughput: 0: 4958.1. Samples: 21801402. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 01:08:55,466][25689] Avg episode reward: [(0, '-63.718')] [2022-07-09 01:08:57,015][26022] Updated weights on worker 0-0, policy_version 21302 (0.00090) [2022-07-09 01:08:59,121][26022] Updated weights on worker 0-0, policy_version 21312 (0.00280) [2022-07-09 01:09:00,533][25689] Fps is (10 sec: 5754.4, 60 sec: 5523.5, 300 sec: 5509.4). Total num frames: 21832704. Throughput: 0: 5767.8. Samples: 21834888. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 01:09:00,534][25689] Avg episode reward: [(0, '-63.642')] [2022-07-09 01:09:00,679][26022] Updated weights on worker 0-0, policy_version 21322 (0.00081) [2022-07-09 01:09:03,307][26022] Updated weights on worker 0-0, policy_version 21332 (0.00090) [2022-07-09 01:09:04,655][26022] Updated weights on worker 0-0, policy_version 21342 (0.00088) [2022-07-09 01:09:05,603][25689] Fps is (10 sec: 5353.3, 60 sec: 5470.7, 300 sec: 5494.5). Total num frames: 21857280. Throughput: 0: 5646.0. Samples: 21866122. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 01:09:05,605][25689] Avg episode reward: [(0, '-63.954')] [2022-07-09 01:09:06,830][26022] Updated weights on worker 0-0, policy_version 21352 (0.00087) [2022-07-09 01:09:08,459][26022] Updated weights on worker 0-0, policy_version 21362 (0.00089) [2022-07-09 01:09:10,452][26022] Updated weights on worker 0-0, policy_version 21372 (0.00091) [2022-07-09 01:09:10,639][25689] Fps is (10 sec: 5269.3, 60 sec: 5469.8, 300 sec: 5498.2). Total num frames: 21885952. Throughput: 0: 4818.5. Samples: 21882778. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 01:09:10,640][25689] Avg episode reward: [(0, '-63.064')] [2022-07-09 01:09:12,294][26022] Updated weights on worker 0-0, policy_version 21382 (0.00089) [2022-07-09 01:09:14,125][26022] Updated weights on worker 0-0, policy_version 21392 (0.00122) [2022-07-09 01:09:15,685][25689] Fps is (10 sec: 5586.6, 60 sec: 5487.6, 300 sec: 5501.0). Total num frames: 21913600. Throughput: 0: 5633.9. Samples: 21915316. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 01:09:15,685][25689] Avg episode reward: [(0, '-63.383')] [2022-07-09 01:09:16,008][26022] Updated weights on worker 0-0, policy_version 21402 (0.00095) [2022-07-09 01:09:17,977][26022] Updated weights on worker 0-0, policy_version 21412 (0.00089) [2022-07-09 01:09:19,735][26022] Updated weights on worker 0-0, policy_version 21422 (0.00088) [2022-07-09 01:09:20,723][25689] Fps is (10 sec: 5381.9, 60 sec: 5470.6, 300 sec: 5493.8). Total num frames: 21940224. Throughput: 0: 5602.3. Samples: 21947996. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 01:09:20,723][25689] Avg episode reward: [(0, '-64.422')] [2022-07-09 01:09:21,719][26022] Updated weights on worker 0-0, policy_version 21432 (0.00094) [2022-07-09 01:09:23,534][26022] Updated weights on worker 0-0, policy_version 21442 (0.00088) [2022-07-09 01:09:25,652][26022] Updated weights on worker 0-0, policy_version 21452 (0.00089) [2022-07-09 01:09:25,805][25689] Fps is (10 sec: 5362.5, 60 sec: 5463.8, 300 sec: 5492.4). Total num frames: 21967872. Throughput: 0: 4867.2. Samples: 21964450. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 01:09:25,806][25689] Avg episode reward: [(0, '-64.295')] [2022-07-09 01:09:27,464][26022] Updated weights on worker 0-0, policy_version 21462 (0.00093) [2022-07-09 01:09:29,217][26022] Updated weights on worker 0-0, policy_version 21472 (0.00053) [2022-07-09 01:09:30,892][25689] Fps is (10 sec: 5437.5, 60 sec: 5459.3, 300 sec: 5493.0). Total num frames: 21995520. Throughput: 0: 5646.0. Samples: 21997130. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 01:09:30,893][25689] Avg episode reward: [(0, '-64.274')] [2022-07-09 01:09:31,072][26022] Updated weights on worker 0-0, policy_version 21482 (0.00082) [2022-07-09 01:09:32,908][26022] Updated weights on worker 0-0, policy_version 21492 (0.00083) [2022-07-09 01:09:34,987][26022] Updated weights on worker 0-0, policy_version 21502 (0.00088) [2022-07-09 01:09:35,964][25689] Fps is (10 sec: 5443.1, 60 sec: 5440.6, 300 sec: 5492.3). Total num frames: 22023168. Throughput: 0: 5659.5. Samples: 22030088. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 01:09:35,965][25689] Avg episode reward: [(0, '-65.268')] [2022-07-09 01:09:36,625][26022] Updated weights on worker 0-0, policy_version 21512 (0.00086) [2022-07-09 01:09:38,672][26022] Updated weights on worker 0-0, policy_version 21522 (0.00086) [2022-07-09 01:09:40,304][26022] Updated weights on worker 0-0, policy_version 21532 (0.00088) [2022-07-09 01:09:41,018][25689] Fps is (10 sec: 5461.0, 60 sec: 5436.3, 300 sec: 5481.7). Total num frames: 22050816. Throughput: 0: 4873.9. Samples: 22046912. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 01:09:41,019][25689] Avg episode reward: [(0, '-66.095')] [2022-07-09 01:09:42,339][26022] Updated weights on worker 0-0, policy_version 21542 (0.00091) [2022-07-09 01:09:44,093][26022] Updated weights on worker 0-0, policy_version 21552 (0.00103) [2022-07-09 01:09:45,916][26022] Updated weights on worker 0-0, policy_version 21562 (0.00088) [2022-07-09 01:09:46,056][25689] Fps is (10 sec: 5682.1, 60 sec: 5467.5, 300 sec: 5496.3). Total num frames: 22080512. Throughput: 0: 5724.1. Samples: 22080368. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 01:09:46,057][25689] Avg episode reward: [(0, '-66.668')] [2022-07-09 01:09:47,906][26022] Updated weights on worker 0-0, policy_version 21572 (0.00084) [2022-07-09 01:09:49,587][26022] Updated weights on worker 0-0, policy_version 21582 (0.00114) [2022-07-09 01:09:51,140][25689] Fps is (10 sec: 5564.3, 60 sec: 5478.2, 300 sec: 5490.2). Total num frames: 22107136. Throughput: 0: 5764.8. Samples: 22113852. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 01:09:51,141][25689] Avg episode reward: [(0, '-65.553')] [2022-07-09 01:09:51,625][26022] Updated weights on worker 0-0, policy_version 21592 (0.00091) [2022-07-09 01:09:53,215][26022] Updated weights on worker 0-0, policy_version 21602 (0.00107) [2022-07-09 01:09:55,156][26022] Updated weights on worker 0-0, policy_version 21612 (0.00092) [2022-07-09 01:09:56,194][25689] Fps is (10 sec: 5555.4, 60 sec: 5497.0, 300 sec: 5496.8). Total num frames: 22136832. Throughput: 0: 4970.6. Samples: 22130642. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 01:09:56,195][25689] Avg episode reward: [(0, '-65.668')] [2022-07-09 01:09:57,011][26022] Updated weights on worker 0-0, policy_version 21622 (0.00089) [2022-07-09 01:09:58,631][26022] Updated weights on worker 0-0, policy_version 21632 (0.00087) [2022-07-09 01:10:00,780][26022] Updated weights on worker 0-0, policy_version 21642 (0.00085) [2022-07-09 01:10:01,201][25689] Fps is (10 sec: 5699.3, 60 sec: 5468.7, 300 sec: 5501.4). Total num frames: 22164480. Throughput: 0: 5800.7. Samples: 22163988. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 01:10:01,202][25689] Avg episode reward: [(0, '-64.055')] [2022-07-09 01:10:02,812][26022] Updated weights on worker 0-0, policy_version 21652 (0.00094) [2022-07-09 01:10:04,627][26022] Updated weights on worker 0-0, policy_version 21662 (0.00092) [2022-07-09 01:10:06,213][25689] Fps is (10 sec: 5416.7, 60 sec: 5507.7, 300 sec: 5499.6). Total num frames: 22191104. Throughput: 0: 5706.5. Samples: 22195394. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 01:10:06,214][25689] Avg episode reward: [(0, '-63.772')] [2022-07-09 01:10:06,309][26022] Updated weights on worker 0-0, policy_version 21672 (0.00096) [2022-07-09 01:10:08,379][26022] Updated weights on worker 0-0, policy_version 21682 (0.00095) [2022-07-09 01:10:10,204][26022] Updated weights on worker 0-0, policy_version 21692 (0.00092) [2022-07-09 01:10:11,225][25689] Fps is (10 sec: 5312.0, 60 sec: 5476.1, 300 sec: 5490.6). Total num frames: 22217728. Throughput: 0: 4895.6. Samples: 22212184. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 01:10:11,226][25689] Avg episode reward: [(0, '-63.568')] [2022-07-09 01:10:11,998][26022] Updated weights on worker 0-0, policy_version 21702 (0.00088) [2022-07-09 01:10:13,774][26022] Updated weights on worker 0-0, policy_version 21712 (0.00089) [2022-07-09 01:10:14,941][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:10:14,954][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000021718_22239232.pth [2022-07-09 01:10:14,955][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000019783_20257792.pth [2022-07-09 01:10:15,608][26022] Updated weights on worker 0-0, policy_version 21722 (0.00091) [2022-07-09 01:10:16,304][25689] Fps is (10 sec: 5479.7, 60 sec: 5490.0, 300 sec: 5493.1). Total num frames: 22246400. Throughput: 0: 5718.3. Samples: 22245638. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 01:10:16,305][25689] Avg episode reward: [(0, '-64.170')] [2022-07-09 01:10:17,360][26022] Updated weights on worker 0-0, policy_version 21732 (0.00093) [2022-07-09 01:10:19,373][26022] Updated weights on worker 0-0, policy_version 21742 (0.00088) [2022-07-09 01:10:21,260][26022] Updated weights on worker 0-0, policy_version 21752 (0.00090) [2022-07-09 01:10:21,323][25689] Fps is (10 sec: 5679.1, 60 sec: 5525.6, 300 sec: 5500.3). Total num frames: 22275072. Throughput: 0: 5749.6. Samples: 22279678. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 01:10:21,323][25689] Avg episode reward: [(0, '-63.535')] [2022-07-09 01:10:22,932][26022] Updated weights on worker 0-0, policy_version 21762 (0.00089) [2022-07-09 01:10:24,796][26022] Updated weights on worker 0-0, policy_version 21772 (0.00093) [2022-07-09 01:10:26,375][25689] Fps is (10 sec: 5694.0, 60 sec: 5545.2, 300 sec: 5499.6). Total num frames: 22303744. Throughput: 0: 5008.4. Samples: 22296374. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 01:10:26,376][25689] Avg episode reward: [(0, '-63.960')] [2022-07-09 01:10:26,565][26022] Updated weights on worker 0-0, policy_version 21782 (0.00088) [2022-07-09 01:10:28,419][26022] Updated weights on worker 0-0, policy_version 21792 (0.00088) [2022-07-09 01:10:30,240][26022] Updated weights on worker 0-0, policy_version 21802 (0.00094) [2022-07-09 01:10:31,416][25689] Fps is (10 sec: 5579.8, 60 sec: 5549.4, 300 sec: 5500.1). Total num frames: 22331392. Throughput: 0: 5841.6. Samples: 22330132. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 01:10:31,417][25689] Avg episode reward: [(0, '-64.437')] [2022-07-09 01:10:32,128][26022] Updated weights on worker 0-0, policy_version 21812 (0.00097) [2022-07-09 01:10:34,025][26022] Updated weights on worker 0-0, policy_version 21822 (0.00084) [2022-07-09 01:10:35,947][26022] Updated weights on worker 0-0, policy_version 21832 (0.00098) [2022-07-09 01:10:36,497][25689] Fps is (10 sec: 5463.1, 60 sec: 5548.6, 300 sec: 5499.0). Total num frames: 22359040. Throughput: 0: 5821.1. Samples: 22363182. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 01:10:36,498][25689] Avg episode reward: [(0, '-63.967')] [2022-07-09 01:10:37,481][26022] Updated weights on worker 0-0, policy_version 21842 (0.00083) [2022-07-09 01:10:39,467][26022] Updated weights on worker 0-0, policy_version 21852 (0.00086) [2022-07-09 01:10:41,383][26022] Updated weights on worker 0-0, policy_version 21862 (0.00085) [2022-07-09 01:10:41,588][25689] Fps is (10 sec: 5436.2, 60 sec: 5545.2, 300 sec: 5498.2). Total num frames: 22386688. Throughput: 0: 5775.6. Samples: 22396722. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 01:10:41,589][25689] Avg episode reward: [(0, '-63.805')] [2022-07-09 01:10:43,220][26022] Updated weights on worker 0-0, policy_version 21872 (0.00087) [2022-07-09 01:10:45,034][26022] Updated weights on worker 0-0, policy_version 21882 (0.00095) [2022-07-09 01:10:46,666][25689] Fps is (10 sec: 5639.1, 60 sec: 5541.5, 300 sec: 5496.9). Total num frames: 22416384. Throughput: 0: 5769.3. Samples: 22413438. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 01:10:46,667][25689] Avg episode reward: [(0, '-63.358')] [2022-07-09 01:10:46,810][26022] Updated weights on worker 0-0, policy_version 21892 (0.00088) [2022-07-09 01:10:48,675][26022] Updated weights on worker 0-0, policy_version 21902 (0.00092) [2022-07-09 01:10:50,452][26022] Updated weights on worker 0-0, policy_version 21912 (0.00096) [2022-07-09 01:10:51,679][25689] Fps is (10 sec: 5784.3, 60 sec: 5581.9, 300 sec: 5502.5). Total num frames: 22445056. Throughput: 0: 5754.5. Samples: 22446732. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 01:10:51,679][25689] Avg episode reward: [(0, '-63.721')] [2022-07-09 01:10:52,470][26022] Updated weights on worker 0-0, policy_version 21922 (0.00102) [2022-07-09 01:10:54,256][26022] Updated weights on worker 0-0, policy_version 21932 (0.00089) [2022-07-09 01:10:56,131][26022] Updated weights on worker 0-0, policy_version 21942 (0.00098) [2022-07-09 01:10:56,784][25689] Fps is (10 sec: 5465.2, 60 sec: 5526.5, 300 sec: 5501.7). Total num frames: 22471680. Throughput: 0: 5776.0. Samples: 22480360. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 01:10:56,784][25689] Avg episode reward: [(0, '-63.443')] [2022-07-09 01:10:57,694][26022] Updated weights on worker 0-0, policy_version 21952 (0.00088) [2022-07-09 01:10:59,564][26022] Updated weights on worker 0-0, policy_version 21962 (0.00090) [2022-07-09 01:11:01,603][26022] Updated weights on worker 0-0, policy_version 21972 (0.00083) [2022-07-09 01:11:01,796][25689] Fps is (10 sec: 5364.2, 60 sec: 5526.0, 300 sec: 5506.3). Total num frames: 22499328. Throughput: 0: 4976.5. Samples: 22497288. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 01:11:01,797][25689] Avg episode reward: [(0, '-63.745')] [2022-07-09 01:11:03,955][26022] Updated weights on worker 0-0, policy_version 21982 (0.00092) [2022-07-09 01:11:05,647][26022] Updated weights on worker 0-0, policy_version 21992 (0.00084) [2022-07-09 01:11:06,878][25689] Fps is (10 sec: 5478.0, 60 sec: 5536.5, 300 sec: 5501.4). Total num frames: 22526976. Throughput: 0: 5708.0. Samples: 22528808. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 01:11:06,879][25689] Avg episode reward: [(0, '-63.991')] [2022-07-09 01:11:07,374][26022] Updated weights on worker 0-0, policy_version 22002 (0.00086) [2022-07-09 01:11:09,407][26022] Updated weights on worker 0-0, policy_version 22012 (0.00093) [2022-07-09 01:11:11,094][26022] Updated weights on worker 0-0, policy_version 22022 (0.00083) [2022-07-09 01:11:11,899][25689] Fps is (10 sec: 5270.5, 60 sec: 5518.8, 300 sec: 5495.4). Total num frames: 22552576. Throughput: 0: 5686.9. Samples: 22561724. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 01:11:11,908][25689] Avg episode reward: [(0, '-64.566')] [2022-07-09 01:11:12,971][26022] Updated weights on worker 0-0, policy_version 22032 (0.00083) [2022-07-09 01:11:14,937][26022] Updated weights on worker 0-0, policy_version 22042 (0.00086) [2022-07-09 01:11:16,679][26022] Updated weights on worker 0-0, policy_version 22052 (0.00092) [2022-07-09 01:11:17,026][25689] Fps is (10 sec: 5550.2, 60 sec: 5548.3, 300 sec: 5496.9). Total num frames: 22583296. Throughput: 0: 4844.3. Samples: 22578416. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 01:11:17,026][25689] Avg episode reward: [(0, '-63.880')] [2022-07-09 01:11:18,624][26022] Updated weights on worker 0-0, policy_version 22062 (0.00083) [2022-07-09 01:11:20,524][26022] Updated weights on worker 0-0, policy_version 22072 (0.00095) [2022-07-09 01:11:22,032][25689] Fps is (10 sec: 5861.3, 60 sec: 5549.3, 300 sec: 5507.6). Total num frames: 22611968. Throughput: 0: 5675.3. Samples: 22612134. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 01:11:22,034][25689] Avg episode reward: [(0, '-63.973')] [2022-07-09 01:11:22,034][26022] Updated weights on worker 0-0, policy_version 22082 (0.00092) [2022-07-09 01:11:24,171][26022] Updated weights on worker 0-0, policy_version 22092 (0.00080) [2022-07-09 01:11:25,481][26022] Updated weights on worker 0-0, policy_version 22102 (0.00069) [2022-07-09 01:11:27,047][25689] Fps is (10 sec: 5313.1, 60 sec: 5485.2, 300 sec: 5497.1). Total num frames: 22636544. Throughput: 0: 5783.0. Samples: 22645446. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 01:11:27,049][25689] Avg episode reward: [(0, '-63.321')] [2022-07-09 01:11:27,696][26022] Updated weights on worker 0-0, policy_version 22112 (0.00091) [2022-07-09 01:11:29,360][26022] Updated weights on worker 0-0, policy_version 22122 (0.00091) [2022-07-09 01:11:31,409][26022] Updated weights on worker 0-0, policy_version 22132 (0.00089) [2022-07-09 01:11:32,083][25689] Fps is (10 sec: 5501.5, 60 sec: 5536.3, 300 sec: 5501.6). Total num frames: 22667264. Throughput: 0: 4973.7. Samples: 22662114. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 01:11:32,085][25689] Avg episode reward: [(0, '-63.799')] [2022-07-09 01:11:33,190][26022] Updated weights on worker 0-0, policy_version 22142 (0.00093) [2022-07-09 01:11:35,068][26022] Updated weights on worker 0-0, policy_version 22152 (0.00088) [2022-07-09 01:11:36,969][26022] Updated weights on worker 0-0, policy_version 22162 (0.00085) [2022-07-09 01:11:37,136][25689] Fps is (10 sec: 5785.7, 60 sec: 5538.9, 300 sec: 5501.3). Total num frames: 22694912. Throughput: 0: 5820.0. Samples: 22695458. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 01:11:37,136][25689] Avg episode reward: [(0, '-63.963')] [2022-07-09 01:11:38,816][26022] Updated weights on worker 0-0, policy_version 22172 (0.00100) [2022-07-09 01:11:40,807][26022] Updated weights on worker 0-0, policy_version 22182 (0.00092) [2022-07-09 01:11:42,154][25689] Fps is (10 sec: 5491.0, 60 sec: 5545.6, 300 sec: 5501.4). Total num frames: 22722560. Throughput: 0: 5779.9. Samples: 22728434. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 01:11:42,154][25689] Avg episode reward: [(0, '-64.495')] [2022-07-09 01:11:42,518][26022] Updated weights on worker 0-0, policy_version 22192 (0.00083) [2022-07-09 01:11:44,498][26022] Updated weights on worker 0-0, policy_version 22202 (0.00088) [2022-07-09 01:11:46,063][26022] Updated weights on worker 0-0, policy_version 22212 (0.00089) [2022-07-09 01:11:47,223][25689] Fps is (10 sec: 5481.6, 60 sec: 5512.6, 300 sec: 5500.7). Total num frames: 22750208. Throughput: 0: 4953.1. Samples: 22745378. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 01:11:47,224][25689] Avg episode reward: [(0, '-64.661')] [2022-07-09 01:11:48,003][26022] Updated weights on worker 0-0, policy_version 22222 (0.00082) [2022-07-09 01:11:49,854][26022] Updated weights on worker 0-0, policy_version 22232 (0.00097) [2022-07-09 01:11:51,917][26022] Updated weights on worker 0-0, policy_version 22242 (0.00087) [2022-07-09 01:11:52,249][25689] Fps is (10 sec: 5477.1, 60 sec: 5494.4, 300 sec: 5495.8). Total num frames: 22777856. Throughput: 0: 5764.7. Samples: 22778366. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 01:11:52,250][25689] Avg episode reward: [(0, '-64.990')] [2022-07-09 01:11:53,801][26022] Updated weights on worker 0-0, policy_version 22252 (0.00077) [2022-07-09 01:11:55,607][26022] Updated weights on worker 0-0, policy_version 22262 (0.00094) [2022-07-09 01:11:57,307][25689] Fps is (10 sec: 5382.0, 60 sec: 5498.7, 300 sec: 5496.1). Total num frames: 22804480. Throughput: 0: 5691.2. Samples: 22810258. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 01:11:57,308][25689] Avg episode reward: [(0, '-65.464')] [2022-07-09 01:11:57,525][26022] Updated weights on worker 0-0, policy_version 22272 (0.00088) [2022-07-09 01:11:59,576][26022] Updated weights on worker 0-0, policy_version 22282 (0.00085) [2022-07-09 01:12:01,473][26022] Updated weights on worker 0-0, policy_version 22292 (0.00095) [2022-07-09 01:12:02,337][25689] Fps is (10 sec: 5177.0, 60 sec: 5463.3, 300 sec: 5499.3). Total num frames: 22830080. Throughput: 0: 4853.8. Samples: 22826400. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 01:12:02,338][25689] Avg episode reward: [(0, '-66.290')] [2022-07-09 01:12:03,789][26022] Updated weights on worker 0-0, policy_version 22302 (0.00088) [2022-07-09 01:12:05,555][26022] Updated weights on worker 0-0, policy_version 22312 (0.00088) [2022-07-09 01:12:07,393][25689] Fps is (10 sec: 5076.4, 60 sec: 5431.8, 300 sec: 5485.1). Total num frames: 22855680. Throughput: 0: 5519.0. Samples: 22856696. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 01:12:07,394][25689] Avg episode reward: [(0, '-65.881')] [2022-07-09 01:12:07,720][26022] Updated weights on worker 0-0, policy_version 22322 (0.00092) [2022-07-09 01:12:09,448][26022] Updated weights on worker 0-0, policy_version 22332 (0.00091) [2022-07-09 01:12:11,556][26022] Updated weights on worker 0-0, policy_version 22342 (0.00085) [2022-07-09 01:12:12,406][25689] Fps is (10 sec: 5288.5, 60 sec: 5466.4, 300 sec: 5489.2). Total num frames: 22883328. Throughput: 0: 5434.0. Samples: 22887896. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:12:12,406][25689] Avg episode reward: [(0, '-66.043')] [2022-07-09 01:12:13,402][26022] Updated weights on worker 0-0, policy_version 22352 (0.00090) [2022-07-09 01:12:15,073][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:12:15,097][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000022361_22897664.pth [2022-07-09 01:12:15,097][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000020429_20919296.pth [2022-07-09 01:12:15,198][26022] Updated weights on worker 0-0, policy_version 22362 (0.00084) [2022-07-09 01:12:17,159][26022] Updated weights on worker 0-0, policy_version 22372 (0.00086) [2022-07-09 01:12:17,455][25689] Fps is (10 sec: 5393.6, 60 sec: 5405.5, 300 sec: 5481.8). Total num frames: 22909952. Throughput: 0: 4681.4. Samples: 22904580. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:12:17,456][25689] Avg episode reward: [(0, '-66.074')] [2022-07-09 01:12:18,804][26022] Updated weights on worker 0-0, policy_version 22382 (0.00089) [2022-07-09 01:12:20,848][26022] Updated weights on worker 0-0, policy_version 22392 (0.00092) [2022-07-09 01:12:22,457][25689] Fps is (10 sec: 5501.4, 60 sec: 5406.0, 300 sec: 5488.8). Total num frames: 22938624. Throughput: 0: 5561.5. Samples: 22938296. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:12:22,458][25689] Avg episode reward: [(0, '-66.326')] [2022-07-09 01:12:22,527][26022] Updated weights on worker 0-0, policy_version 22402 (0.00088) [2022-07-09 01:12:24,484][26022] Updated weights on worker 0-0, policy_version 22412 (0.00081) [2022-07-09 01:12:26,383][26022] Updated weights on worker 0-0, policy_version 22422 (0.00089) [2022-07-09 01:12:27,511][25689] Fps is (10 sec: 5601.0, 60 sec: 5453.4, 300 sec: 5488.2). Total num frames: 22966272. Throughput: 0: 5710.5. Samples: 22971578. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:12:27,511][25689] Avg episode reward: [(0, '-66.279')] [2022-07-09 01:12:28,087][26022] Updated weights on worker 0-0, policy_version 22432 (0.00085) [2022-07-09 01:12:30,126][26022] Updated weights on worker 0-0, policy_version 22442 (0.00083) [2022-07-09 01:12:31,632][26022] Updated weights on worker 0-0, policy_version 22452 (0.00116) [2022-07-09 01:12:32,519][25689] Fps is (10 sec: 5597.3, 60 sec: 5422.0, 300 sec: 5486.4). Total num frames: 22994944. Throughput: 0: 4990.9. Samples: 22988278. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:12:32,526][25689] Avg episode reward: [(0, '-65.621')] [2022-07-09 01:12:33,771][26022] Updated weights on worker 0-0, policy_version 22462 (0.00089) [2022-07-09 01:12:35,332][26022] Updated weights on worker 0-0, policy_version 22472 (0.00089) [2022-07-09 01:12:37,457][26022] Updated weights on worker 0-0, policy_version 22482 (0.00082) [2022-07-09 01:12:37,587][25689] Fps is (10 sec: 5487.8, 60 sec: 5403.6, 300 sec: 5482.6). Total num frames: 23021568. Throughput: 0: 5807.3. Samples: 23021490. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:12:37,587][25689] Avg episode reward: [(0, '-65.891')] [2022-07-09 01:12:39,150][26022] Updated weights on worker 0-0, policy_version 22492 (0.00089) [2022-07-09 01:12:41,178][26022] Updated weights on worker 0-0, policy_version 22502 (0.00084) [2022-07-09 01:12:42,588][25689] Fps is (10 sec: 5491.7, 60 sec: 5422.1, 300 sec: 5490.2). Total num frames: 23050240. Throughput: 0: 5771.1. Samples: 23054474. Policy #0 lag: (min: 0.0, avg: 9.2, max: 18.0) [2022-07-09 01:12:42,589][25689] Avg episode reward: [(0, '-65.883')] [2022-07-09 01:12:42,824][26022] Updated weights on worker 0-0, policy_version 22512 (0.00093) [2022-07-09 01:12:44,763][26022] Updated weights on worker 0-0, policy_version 22522 (0.00088) [2022-07-09 01:12:46,723][26022] Updated weights on worker 0-0, policy_version 22532 (0.00084) [2022-07-09 01:12:47,634][25689] Fps is (10 sec: 5605.5, 60 sec: 5424.2, 300 sec: 5482.8). Total num frames: 23077888. Throughput: 0: 4956.7. Samples: 23071324. Policy #0 lag: (min: 0.0, avg: 9.2, max: 18.0) [2022-07-09 01:12:47,635][25689] Avg episode reward: [(0, '-65.367')] [2022-07-09 01:12:48,430][26022] Updated weights on worker 0-0, policy_version 22542 (0.00089) [2022-07-09 01:12:50,304][26022] Updated weights on worker 0-0, policy_version 22552 (0.00090) [2022-07-09 01:12:52,108][26022] Updated weights on worker 0-0, policy_version 22562 (0.00092) [2022-07-09 01:12:52,659][25689] Fps is (10 sec: 5592.2, 60 sec: 5441.2, 300 sec: 5491.4). Total num frames: 23106560. Throughput: 0: 5804.1. Samples: 23105174. Policy #0 lag: (min: 0.0, avg: 9.2, max: 18.0) [2022-07-09 01:12:52,660][25689] Avg episode reward: [(0, '-64.827')] [2022-07-09 01:12:53,739][26022] Updated weights on worker 0-0, policy_version 22572 (0.00092) [2022-07-09 01:12:55,797][26022] Updated weights on worker 0-0, policy_version 22582 (0.00088) [2022-07-09 01:12:57,559][26022] Updated weights on worker 0-0, policy_version 22592 (0.00090) [2022-07-09 01:12:57,783][25689] Fps is (10 sec: 5650.4, 60 sec: 5469.2, 300 sec: 5496.9). Total num frames: 23135232. Throughput: 0: 5799.0. Samples: 23138606. Policy #0 lag: (min: 0.0, avg: 9.2, max: 18.0) [2022-07-09 01:12:57,783][25689] Avg episode reward: [(0, '-64.479')] [2022-07-09 01:12:59,505][26022] Updated weights on worker 0-0, policy_version 22602 (0.00083) [2022-07-09 01:13:01,382][26022] Updated weights on worker 0-0, policy_version 22612 (0.00087) [2022-07-09 01:13:02,822][25689] Fps is (10 sec: 5340.2, 60 sec: 5468.3, 300 sec: 5490.2). Total num frames: 23160832. Throughput: 0: 5711.5. Samples: 23170042. Policy #0 lag: (min: 0.0, avg: 9.2, max: 18.0) [2022-07-09 01:13:02,823][25689] Avg episode reward: [(0, '-64.650')] [2022-07-09 01:13:03,494][26022] Updated weights on worker 0-0, policy_version 22622 (0.00091) [2022-07-09 01:13:05,386][26022] Updated weights on worker 0-0, policy_version 22632 (0.00091) [2022-07-09 01:13:07,288][26022] Updated weights on worker 0-0, policy_version 22642 (0.00089) [2022-07-09 01:13:07,832][25689] Fps is (10 sec: 5298.6, 60 sec: 5506.4, 300 sec: 5487.0). Total num frames: 23188480. Throughput: 0: 5717.2. Samples: 23186802. Policy #0 lag: (min: 0.0, avg: 9.2, max: 18.0) [2022-07-09 01:13:07,833][25689] Avg episode reward: [(0, '-64.332')] [2022-07-09 01:13:09,048][26022] Updated weights on worker 0-0, policy_version 22652 (0.00095) [2022-07-09 01:13:10,905][26022] Updated weights on worker 0-0, policy_version 22662 (0.00095) [2022-07-09 01:13:12,702][26022] Updated weights on worker 0-0, policy_version 22672 (0.00095) [2022-07-09 01:13:12,867][25689] Fps is (10 sec: 5504.7, 60 sec: 5504.3, 300 sec: 5490.8). Total num frames: 23216128. Throughput: 0: 5688.9. Samples: 23220136. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 01:13:12,868][25689] Avg episode reward: [(0, '-64.401')] [2022-07-09 01:13:14,533][26022] Updated weights on worker 0-0, policy_version 22682 (0.00089) [2022-07-09 01:13:16,204][26022] Updated weights on worker 0-0, policy_version 22692 (0.00087) [2022-07-09 01:13:17,980][25689] Fps is (10 sec: 5650.8, 60 sec: 5549.3, 300 sec: 5496.3). Total num frames: 23245824. Throughput: 0: 5696.2. Samples: 23253654. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 01:13:17,981][25689] Avg episode reward: [(0, '-65.491')] [2022-07-09 01:13:18,127][26022] Updated weights on worker 0-0, policy_version 22702 (0.00088) [2022-07-09 01:13:19,994][26022] Updated weights on worker 0-0, policy_version 22712 (0.00091) [2022-07-09 01:13:22,102][26022] Updated weights on worker 0-0, policy_version 22722 (0.00085) [2022-07-09 01:13:23,044][25689] Fps is (10 sec: 5735.7, 60 sec: 5543.7, 300 sec: 5498.7). Total num frames: 23274496. Throughput: 0: 4961.2. Samples: 23270364. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 01:13:23,045][25689] Avg episode reward: [(0, '-65.042')] [2022-07-09 01:13:23,527][26022] Updated weights on worker 0-0, policy_version 22732 (0.00085) [2022-07-09 01:13:25,832][26022] Updated weights on worker 0-0, policy_version 22742 (0.00103) [2022-07-09 01:13:27,168][26022] Updated weights on worker 0-0, policy_version 22752 (0.00089) [2022-07-09 01:13:28,050][25689] Fps is (10 sec: 5389.7, 60 sec: 5514.2, 300 sec: 5492.4). Total num frames: 23300096. Throughput: 0: 5767.5. Samples: 23303406. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 01:13:28,052][25689] Avg episode reward: [(0, '-64.636')] [2022-07-09 01:13:29,434][26022] Updated weights on worker 0-0, policy_version 22762 (0.00090) [2022-07-09 01:13:31,146][26022] Updated weights on worker 0-0, policy_version 22772 (0.00088) [2022-07-09 01:13:33,014][26022] Updated weights on worker 0-0, policy_version 22782 (0.00098) [2022-07-09 01:13:33,111][25689] Fps is (10 sec: 5390.7, 60 sec: 5509.3, 300 sec: 5492.3). Total num frames: 23328768. Throughput: 0: 5747.2. Samples: 23336482. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 01:13:33,113][25689] Avg episode reward: [(0, '-64.881')] [2022-07-09 01:13:34,908][26022] Updated weights on worker 0-0, policy_version 22792 (0.00094) [2022-07-09 01:13:36,849][26022] Updated weights on worker 0-0, policy_version 22802 (0.00082) [2022-07-09 01:13:38,216][25689] Fps is (10 sec: 5741.5, 60 sec: 5556.7, 300 sec: 5497.3). Total num frames: 23358464. Throughput: 0: 4922.0. Samples: 23353254. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 01:13:38,216][25689] Avg episode reward: [(0, '-65.684')] [2022-07-09 01:13:38,358][26022] Updated weights on worker 0-0, policy_version 22812 (0.00094) [2022-07-09 01:13:40,600][26022] Updated weights on worker 0-0, policy_version 22822 (0.00089) [2022-07-09 01:13:42,229][26022] Updated weights on worker 0-0, policy_version 22832 (0.00091) [2022-07-09 01:13:43,219][25689] Fps is (10 sec: 5572.2, 60 sec: 5522.8, 300 sec: 5494.0). Total num frames: 23385088. Throughput: 0: 5746.5. Samples: 23386300. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 01:13:43,219][25689] Avg episode reward: [(0, '-65.823')] [2022-07-09 01:13:44,228][26022] Updated weights on worker 0-0, policy_version 22842 (0.00085) [2022-07-09 01:13:45,988][26022] Updated weights on worker 0-0, policy_version 22852 (0.00094) [2022-07-09 01:13:47,745][26022] Updated weights on worker 0-0, policy_version 22862 (0.00090) [2022-07-09 01:13:48,262][25689] Fps is (10 sec: 5504.3, 60 sec: 5539.9, 300 sec: 5503.8). Total num frames: 23413760. Throughput: 0: 5764.1. Samples: 23419908. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 01:13:48,262][25689] Avg episode reward: [(0, '-64.900')] [2022-07-09 01:13:49,723][26022] Updated weights on worker 0-0, policy_version 22872 (0.00771) [2022-07-09 01:13:51,444][26022] Updated weights on worker 0-0, policy_version 22882 (0.00089) [2022-07-09 01:13:53,311][25689] Fps is (10 sec: 5479.1, 60 sec: 5504.0, 300 sec: 5497.4). Total num frames: 23440384. Throughput: 0: 4957.0. Samples: 23436612. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 01:13:53,311][25689] Avg episode reward: [(0, '-64.575')] [2022-07-09 01:13:53,398][26022] Updated weights on worker 0-0, policy_version 22892 (0.00092) [2022-07-09 01:13:55,299][26022] Updated weights on worker 0-0, policy_version 22902 (0.01148) [2022-07-09 01:13:57,131][26022] Updated weights on worker 0-0, policy_version 22912 (0.00089) [2022-07-09 01:13:58,428][25689] Fps is (10 sec: 5539.9, 60 sec: 5521.4, 300 sec: 5496.5). Total num frames: 23470080. Throughput: 0: 5755.0. Samples: 23469574. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 01:13:58,430][25689] Avg episode reward: [(0, '-65.529')] [2022-07-09 01:13:59,013][26022] Updated weights on worker 0-0, policy_version 22922 (0.00088) [2022-07-09 01:14:00,654][26022] Updated weights on worker 0-0, policy_version 22932 (0.00093) [2022-07-09 01:14:02,883][26022] Updated weights on worker 0-0, policy_version 22942 (0.00086) [2022-07-09 01:14:03,482][25689] Fps is (10 sec: 5336.0, 60 sec: 5503.2, 300 sec: 5496.8). Total num frames: 23494656. Throughput: 0: 5676.4. Samples: 23501322. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 01:14:03,484][25689] Avg episode reward: [(0, '-65.197')] [2022-07-09 01:14:04,698][26022] Updated weights on worker 0-0, policy_version 22952 (0.00088) [2022-07-09 01:14:06,751][26022] Updated weights on worker 0-0, policy_version 22962 (0.00091) [2022-07-09 01:14:08,404][26022] Updated weights on worker 0-0, policy_version 22972 (0.00093) [2022-07-09 01:14:08,501][25689] Fps is (10 sec: 5286.1, 60 sec: 5519.3, 300 sec: 5497.1). Total num frames: 23523328. Throughput: 0: 4851.7. Samples: 23518104. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 01:14:08,502][25689] Avg episode reward: [(0, '-65.400')] [2022-07-09 01:14:10,404][26022] Updated weights on worker 0-0, policy_version 22982 (0.00102) [2022-07-09 01:14:12,007][26022] Updated weights on worker 0-0, policy_version 22992 (0.00096) [2022-07-09 01:14:13,515][25689] Fps is (10 sec: 5613.4, 60 sec: 5521.2, 300 sec: 5497.7). Total num frames: 23550976. Throughput: 0: 5689.8. Samples: 23551568. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 01:14:13,517][25689] Avg episode reward: [(0, '-64.910')] [2022-07-09 01:14:14,118][26022] Updated weights on worker 0-0, policy_version 23002 (0.00087) [2022-07-09 01:14:15,239][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:14:15,255][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000023009_23561216.pth [2022-07-09 01:14:15,259][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000021074_21579776.pth [2022-07-09 01:14:15,737][26022] Updated weights on worker 0-0, policy_version 23012 (0.00088) [2022-07-09 01:14:17,715][26022] Updated weights on worker 0-0, policy_version 23022 (0.00368) [2022-07-09 01:14:18,579][25689] Fps is (10 sec: 5588.7, 60 sec: 5508.8, 300 sec: 5504.1). Total num frames: 23579648. Throughput: 0: 5718.9. Samples: 23584814. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 01:14:18,579][25689] Avg episode reward: [(0, '-65.363')] [2022-07-09 01:14:19,397][26022] Updated weights on worker 0-0, policy_version 23032 (0.00090) [2022-07-09 01:14:21,395][26022] Updated weights on worker 0-0, policy_version 23042 (0.00084) [2022-07-09 01:14:23,062][26022] Updated weights on worker 0-0, policy_version 23052 (0.00090) [2022-07-09 01:14:23,659][25689] Fps is (10 sec: 5552.1, 60 sec: 5490.4, 300 sec: 5504.1). Total num frames: 23607296. Throughput: 0: 4979.3. Samples: 23601790. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 01:14:23,659][25689] Avg episode reward: [(0, '-66.686')] [2022-07-09 01:14:25,042][26022] Updated weights on worker 0-0, policy_version 23062 (0.00098) [2022-07-09 01:14:26,736][26022] Updated weights on worker 0-0, policy_version 23072 (0.00088) [2022-07-09 01:14:28,668][25689] Fps is (10 sec: 5480.7, 60 sec: 5523.9, 300 sec: 5505.6). Total num frames: 23634944. Throughput: 0: 5813.4. Samples: 23635340. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 01:14:28,670][25689] Avg episode reward: [(0, '-66.296')] [2022-07-09 01:14:28,903][26022] Updated weights on worker 0-0, policy_version 23082 (0.00095) [2022-07-09 01:14:30,387][26022] Updated weights on worker 0-0, policy_version 23092 (0.00092) [2022-07-09 01:14:32,424][26022] Updated weights on worker 0-0, policy_version 23102 (0.00095) [2022-07-09 01:14:33,697][25689] Fps is (10 sec: 5508.4, 60 sec: 5509.9, 300 sec: 5506.4). Total num frames: 23662592. Throughput: 0: 5798.2. Samples: 23668590. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 01:14:33,698][25689] Avg episode reward: [(0, '-67.259')] [2022-07-09 01:14:34,151][26022] Updated weights on worker 0-0, policy_version 23112 (0.00089) [2022-07-09 01:14:36,038][26022] Updated weights on worker 0-0, policy_version 23122 (0.00083) [2022-07-09 01:14:37,880][26022] Updated weights on worker 0-0, policy_version 23132 (0.00093) [2022-07-09 01:14:38,831][25689] Fps is (10 sec: 5541.7, 60 sec: 5490.4, 300 sec: 5508.3). Total num frames: 23691264. Throughput: 0: 4960.4. Samples: 23685272. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 01:14:38,831][25689] Avg episode reward: [(0, '-66.218')] [2022-07-09 01:14:39,659][26022] Updated weights on worker 0-0, policy_version 23142 (0.00083) [2022-07-09 01:14:41,758][26022] Updated weights on worker 0-0, policy_version 23153 (0.00092) [2022-07-09 01:14:43,603][26022] Updated weights on worker 0-0, policy_version 23163 (0.00080) [2022-07-09 01:14:43,840][25689] Fps is (10 sec: 5552.8, 60 sec: 5506.7, 300 sec: 5502.0). Total num frames: 23718912. Throughput: 0: 5779.4. Samples: 23718426. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 01:14:43,843][25689] Avg episode reward: [(0, '-66.663')] [2022-07-09 01:14:45,411][26022] Updated weights on worker 0-0, policy_version 23173 (0.00104) [2022-07-09 01:14:47,603][26022] Updated weights on worker 0-0, policy_version 23183 (0.00085) [2022-07-09 01:14:48,847][25689] Fps is (10 sec: 5725.2, 60 sec: 5526.9, 300 sec: 5513.7). Total num frames: 23748608. Throughput: 0: 5781.1. Samples: 23751996. Policy #0 lag: (min: 0.0, avg: 6.8, max: 18.0) [2022-07-09 01:14:48,848][25689] Avg episode reward: [(0, '-65.867')] [2022-07-09 01:14:49,067][26022] Updated weights on worker 0-0, policy_version 23193 (0.00084) [2022-07-09 01:14:51,023][26022] Updated weights on worker 0-0, policy_version 23203 (0.00081) [2022-07-09 01:14:52,726][26022] Updated weights on worker 0-0, policy_version 23213 (0.00090) [2022-07-09 01:14:53,858][25689] Fps is (10 sec: 5622.0, 60 sec: 5530.4, 300 sec: 5504.2). Total num frames: 23775232. Throughput: 0: 4975.2. Samples: 23768892. Policy #0 lag: (min: 0.0, avg: 6.8, max: 18.0) [2022-07-09 01:14:53,858][25689] Avg episode reward: [(0, '-66.116')] [2022-07-09 01:14:54,791][26022] Updated weights on worker 0-0, policy_version 23223 (0.00086) [2022-07-09 01:14:56,578][26022] Updated weights on worker 0-0, policy_version 23233 (0.00092) [2022-07-09 01:14:58,427][26022] Updated weights on worker 0-0, policy_version 23243 (0.00085) [2022-07-09 01:14:58,935][25689] Fps is (10 sec: 5481.4, 60 sec: 5517.1, 300 sec: 5506.3). Total num frames: 23803904. Throughput: 0: 5814.6. Samples: 23802168. Policy #0 lag: (min: 0.0, avg: 6.8, max: 18.0) [2022-07-09 01:14:58,935][25689] Avg episode reward: [(0, '-65.868')] [2022-07-09 01:15:00,108][26022] Updated weights on worker 0-0, policy_version 23253 (0.00616) [2022-07-09 01:15:02,513][26022] Updated weights on worker 0-0, policy_version 23263 (0.00098) [2022-07-09 01:15:03,972][25689] Fps is (10 sec: 5365.9, 60 sec: 5535.6, 300 sec: 5502.4). Total num frames: 23829504. Throughput: 0: 5726.5. Samples: 23833712. Policy #0 lag: (min: 0.0, avg: 6.8, max: 18.0) [2022-07-09 01:15:03,973][25689] Avg episode reward: [(0, '-65.060')] [2022-07-09 01:15:04,138][26022] Updated weights on worker 0-0, policy_version 23273 (0.00084) [2022-07-09 01:15:06,083][26022] Updated weights on worker 0-0, policy_version 23283 (0.00611) [2022-07-09 01:15:07,911][26022] Updated weights on worker 0-0, policy_version 23293 (0.00098) [2022-07-09 01:15:08,980][25689] Fps is (10 sec: 5300.7, 60 sec: 5519.7, 300 sec: 5506.0). Total num frames: 23857152. Throughput: 0: 5731.2. Samples: 23867384. Policy #0 lag: (min: 0.0, avg: 6.8, max: 18.0) [2022-07-09 01:15:08,981][25689] Avg episode reward: [(0, '-65.032')] [2022-07-09 01:15:09,793][26022] Updated weights on worker 0-0, policy_version 23303 (0.00097) [2022-07-09 01:15:11,664][26022] Updated weights on worker 0-0, policy_version 23313 (0.00093) [2022-07-09 01:15:13,427][26022] Updated weights on worker 0-0, policy_version 23323 (0.00085) [2022-07-09 01:15:13,990][25689] Fps is (10 sec: 5519.8, 60 sec: 5520.0, 300 sec: 5503.8). Total num frames: 23884800. Throughput: 0: 5700.8. Samples: 23883660. Policy #0 lag: (min: 0.0, avg: 6.8, max: 18.0) [2022-07-09 01:15:13,990][25689] Avg episode reward: [(0, '-64.643')] [2022-07-09 01:15:15,239][26022] Updated weights on worker 0-0, policy_version 23333 (0.00080) [2022-07-09 01:15:17,174][26022] Updated weights on worker 0-0, policy_version 23343 (0.00091) [2022-07-09 01:15:18,970][26022] Updated weights on worker 0-0, policy_version 23353 (0.00089) [2022-07-09 01:15:19,042][25689] Fps is (10 sec: 5699.3, 60 sec: 5538.0, 300 sec: 5506.6). Total num frames: 23914496. Throughput: 0: 5722.4. Samples: 23917230. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-09 01:15:19,044][25689] Avg episode reward: [(0, '-64.982')] [2022-07-09 01:15:20,843][26022] Updated weights on worker 0-0, policy_version 23363 (0.00084) [2022-07-09 01:15:22,863][26022] Updated weights on worker 0-0, policy_version 23373 (0.00095) [2022-07-09 01:15:24,098][25689] Fps is (10 sec: 5571.8, 60 sec: 5523.3, 300 sec: 5499.7). Total num frames: 23941120. Throughput: 0: 5806.2. Samples: 23950566. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-09 01:15:24,098][25689] Avg episode reward: [(0, '-64.939')] [2022-07-09 01:15:24,439][26022] Updated weights on worker 0-0, policy_version 23383 (0.00083) [2022-07-09 01:15:26,565][26022] Updated weights on worker 0-0, policy_version 23393 (0.00092) [2022-07-09 01:15:28,016][26022] Updated weights on worker 0-0, policy_version 23403 (0.00084) [2022-07-09 01:15:29,118][25689] Fps is (10 sec: 5385.9, 60 sec: 5522.2, 300 sec: 5500.0). Total num frames: 23968768. Throughput: 0: 4945.5. Samples: 23966978. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-09 01:15:29,119][25689] Avg episode reward: [(0, '-65.018')] [2022-07-09 01:15:30,054][26022] Updated weights on worker 0-0, policy_version 23413 (0.00084) [2022-07-09 01:15:32,103][26022] Updated weights on worker 0-0, policy_version 23423 (0.00081) [2022-07-09 01:15:33,567][26022] Updated weights on worker 0-0, policy_version 23433 (0.00092) [2022-07-09 01:15:34,121][25689] Fps is (10 sec: 5619.1, 60 sec: 5541.7, 300 sec: 5504.9). Total num frames: 23997440. Throughput: 0: 5801.3. Samples: 24000446. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-09 01:15:34,121][25689] Avg episode reward: [(0, '-65.801')] [2022-07-09 01:15:35,707][26022] Updated weights on worker 0-0, policy_version 23443 (0.00088) [2022-07-09 01:15:37,320][26022] Updated weights on worker 0-0, policy_version 23453 (0.00586) [2022-07-09 01:15:39,202][25689] Fps is (10 sec: 5585.2, 60 sec: 5529.5, 300 sec: 5505.1). Total num frames: 24025088. Throughput: 0: 5791.5. Samples: 24033990. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-09 01:15:39,203][25689] Avg episode reward: [(0, '-65.161')] [2022-07-09 01:15:39,260][26022] Updated weights on worker 0-0, policy_version 23463 (0.00091) [2022-07-09 01:15:41,018][26022] Updated weights on worker 0-0, policy_version 23473 (0.00091) [2022-07-09 01:15:42,912][26022] Updated weights on worker 0-0, policy_version 23483 (0.00086) [2022-07-09 01:15:44,230][25689] Fps is (10 sec: 5571.0, 60 sec: 5544.7, 300 sec: 5502.6). Total num frames: 24053760. Throughput: 0: 4978.6. Samples: 24050798. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-09 01:15:44,231][25689] Avg episode reward: [(0, '-65.374')] [2022-07-09 01:15:44,800][26022] Updated weights on worker 0-0, policy_version 23493 (0.00085) [2022-07-09 01:15:46,702][26022] Updated weights on worker 0-0, policy_version 23503 (0.00051) [2022-07-09 01:15:48,278][26022] Updated weights on worker 0-0, policy_version 23513 (0.00088) [2022-07-09 01:15:49,253][25689] Fps is (10 sec: 5705.7, 60 sec: 5526.3, 300 sec: 5502.4). Total num frames: 24082432. Throughput: 0: 5831.7. Samples: 24084394. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-09 01:15:49,253][25689] Avg episode reward: [(0, '-65.158')] [2022-07-09 01:15:50,360][26022] Updated weights on worker 0-0, policy_version 23523 (0.00573) [2022-07-09 01:15:51,947][26022] Updated weights on worker 0-0, policy_version 23533 (0.00092) [2022-07-09 01:15:53,979][26022] Updated weights on worker 0-0, policy_version 23543 (0.00085) [2022-07-09 01:15:54,256][25689] Fps is (10 sec: 5515.4, 60 sec: 5527.0, 300 sec: 5504.3). Total num frames: 24109056. Throughput: 0: 5848.1. Samples: 24118200. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-09 01:15:54,257][25689] Avg episode reward: [(0, '-65.398')] [2022-07-09 01:15:55,667][26022] Updated weights on worker 0-0, policy_version 23553 (0.00100) [2022-07-09 01:15:57,511][26022] Updated weights on worker 0-0, policy_version 23563 (0.00086) [2022-07-09 01:15:59,391][25689] Fps is (10 sec: 5555.4, 60 sec: 5538.7, 300 sec: 5508.9). Total num frames: 24138752. Throughput: 0: 4996.5. Samples: 24134860. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-09 01:15:59,392][25689] Avg episode reward: [(0, '-65.478')] [2022-07-09 01:15:59,394][26022] Updated weights on worker 0-0, policy_version 23573 (0.00094) [2022-07-09 01:16:01,280][26022] Updated weights on worker 0-0, policy_version 23583 (0.00087) [2022-07-09 01:16:03,391][26022] Updated weights on worker 0-0, policy_version 23593 (0.00082) [2022-07-09 01:16:04,398][25689] Fps is (10 sec: 5351.2, 60 sec: 5524.5, 300 sec: 5500.0). Total num frames: 24163328. Throughput: 0: 5740.0. Samples: 24166562. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-09 01:16:04,399][25689] Avg episode reward: [(0, '-65.288')] [2022-07-09 01:16:05,248][26022] Updated weights on worker 0-0, policy_version 23603 (0.00091) [2022-07-09 01:16:07,115][26022] Updated weights on worker 0-0, policy_version 23613 (0.00092) [2022-07-09 01:16:08,880][26022] Updated weights on worker 0-0, policy_version 23623 (0.00082) [2022-07-09 01:16:09,454][25689] Fps is (10 sec: 5291.5, 60 sec: 5537.1, 300 sec: 5509.7). Total num frames: 24192000. Throughput: 0: 5718.9. Samples: 24199920. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-09 01:16:09,454][25689] Avg episode reward: [(0, '-65.398')] [2022-07-09 01:16:10,725][26022] Updated weights on worker 0-0, policy_version 23633 (0.01072) [2022-07-09 01:16:12,762][26022] Updated weights on worker 0-0, policy_version 23643 (0.00095) [2022-07-09 01:16:14,460][25689] Fps is (10 sec: 5699.0, 60 sec: 5554.3, 300 sec: 5505.0). Total num frames: 24220672. Throughput: 0: 4878.2. Samples: 24216760. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-09 01:16:14,461][25689] Avg episode reward: [(0, '-65.300')] [2022-07-09 01:16:14,463][26022] Updated weights on worker 0-0, policy_version 23653 (0.00087) [2022-07-09 01:16:15,302][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:16:15,318][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000023657_24224768.pth [2022-07-09 01:16:15,318][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000021718_22239232.pth [2022-07-09 01:16:16,432][26022] Updated weights on worker 0-0, policy_version 23663 (0.00087) [2022-07-09 01:16:18,126][26022] Updated weights on worker 0-0, policy_version 23673 (0.00086) [2022-07-09 01:16:19,501][25689] Fps is (10 sec: 5503.6, 60 sec: 5504.6, 300 sec: 5497.5). Total num frames: 24247296. Throughput: 0: 5722.0. Samples: 24249930. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-09 01:16:19,501][25689] Avg episode reward: [(0, '-65.123')] [2022-07-09 01:16:20,316][26022] Updated weights on worker 0-0, policy_version 23683 (0.00096) [2022-07-09 01:16:22,007][26022] Updated weights on worker 0-0, policy_version 23693 (0.00231) [2022-07-09 01:16:23,850][26022] Updated weights on worker 0-0, policy_version 23703 (0.00093) [2022-07-09 01:16:24,513][25689] Fps is (10 sec: 5500.4, 60 sec: 5542.4, 300 sec: 5511.3). Total num frames: 24275968. Throughput: 0: 5807.9. Samples: 24283390. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 01:16:24,515][25689] Avg episode reward: [(0, '-64.692')] [2022-07-09 01:16:25,482][26022] Updated weights on worker 0-0, policy_version 23713 (0.00086) [2022-07-09 01:16:27,515][26022] Updated weights on worker 0-0, policy_version 23723 (0.00090) [2022-07-09 01:16:29,227][26022] Updated weights on worker 0-0, policy_version 23733 (0.00090) [2022-07-09 01:16:29,524][25689] Fps is (10 sec: 5618.8, 60 sec: 5543.3, 300 sec: 5501.4). Total num frames: 24303616. Throughput: 0: 4975.8. Samples: 24299788. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 01:16:29,526][25689] Avg episode reward: [(0, '-64.355')] [2022-07-09 01:16:31,255][26022] Updated weights on worker 0-0, policy_version 23743 (0.00089) [2022-07-09 01:16:32,998][26022] Updated weights on worker 0-0, policy_version 23753 (0.00098) [2022-07-09 01:16:34,571][25689] Fps is (10 sec: 5498.1, 60 sec: 5522.3, 300 sec: 5501.5). Total num frames: 24331264. Throughput: 0: 5788.6. Samples: 24333172. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 01:16:34,572][25689] Avg episode reward: [(0, '-64.835')] [2022-07-09 01:16:34,863][26022] Updated weights on worker 0-0, policy_version 23763 (0.00088) [2022-07-09 01:16:36,699][26022] Updated weights on worker 0-0, policy_version 23773 (0.00085) [2022-07-09 01:16:38,772][26022] Updated weights on worker 0-0, policy_version 23783 (0.00097) [2022-07-09 01:16:39,701][25689] Fps is (10 sec: 5534.3, 60 sec: 5534.8, 300 sec: 5502.9). Total num frames: 24359936. Throughput: 0: 5757.7. Samples: 24366236. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 01:16:39,703][25689] Avg episode reward: [(0, '-64.630')] [2022-07-09 01:16:40,308][26022] Updated weights on worker 0-0, policy_version 23793 (0.00089) [2022-07-09 01:16:42,296][26022] Updated weights on worker 0-0, policy_version 23803 (0.00101) [2022-07-09 01:16:43,943][26022] Updated weights on worker 0-0, policy_version 23813 (0.00052) [2022-07-09 01:16:44,704][25689] Fps is (10 sec: 5558.1, 60 sec: 5520.2, 300 sec: 5504.1). Total num frames: 24387584. Throughput: 0: 4926.5. Samples: 24382858. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 01:16:44,705][25689] Avg episode reward: [(0, '-65.369')] [2022-07-09 01:16:46,115][26022] Updated weights on worker 0-0, policy_version 23823 (0.00089) [2022-07-09 01:16:47,752][26022] Updated weights on worker 0-0, policy_version 23833 (0.00086) [2022-07-09 01:16:49,744][25689] Fps is (10 sec: 5505.8, 60 sec: 5501.6, 300 sec: 5503.9). Total num frames: 24415232. Throughput: 0: 5764.9. Samples: 24416354. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 01:16:49,746][25689] Avg episode reward: [(0, '-65.195')] [2022-07-09 01:16:49,752][26022] Updated weights on worker 0-0, policy_version 23843 (0.00090) [2022-07-09 01:16:51,454][26022] Updated weights on worker 0-0, policy_version 23853 (0.00090) [2022-07-09 01:16:53,319][26022] Updated weights on worker 0-0, policy_version 23863 (0.00095) [2022-07-09 01:16:54,757][25689] Fps is (10 sec: 5601.9, 60 sec: 5534.6, 300 sec: 5511.6). Total num frames: 24443904. Throughput: 0: 5792.1. Samples: 24450096. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 01:16:54,759][25689] Avg episode reward: [(0, '-65.955')] [2022-07-09 01:16:55,108][26022] Updated weights on worker 0-0, policy_version 23873 (0.00084) [2022-07-09 01:16:56,991][26022] Updated weights on worker 0-0, policy_version 23883 (0.00084) [2022-07-09 01:16:58,832][26022] Updated weights on worker 0-0, policy_version 23893 (0.00059) [2022-07-09 01:16:59,842][25689] Fps is (10 sec: 5577.4, 60 sec: 5505.3, 300 sec: 5517.5). Total num frames: 24471552. Throughput: 0: 4986.3. Samples: 24466666. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 01:16:59,842][25689] Avg episode reward: [(0, '-65.910')] [2022-07-09 01:17:00,794][26022] Updated weights on worker 0-0, policy_version 23903 (0.00095) [2022-07-09 01:17:02,863][26022] Updated weights on worker 0-0, policy_version 23913 (0.00096) [2022-07-09 01:17:04,726][26022] Updated weights on worker 0-0, policy_version 23923 (0.00057) [2022-07-09 01:17:04,846][25689] Fps is (10 sec: 5278.2, 60 sec: 5522.5, 300 sec: 5518.4). Total num frames: 24497152. Throughput: 0: 5712.8. Samples: 24497926. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 01:17:04,848][25689] Avg episode reward: [(0, '-66.017')] [2022-07-09 01:17:06,522][26022] Updated weights on worker 0-0, policy_version 23933 (0.00086) [2022-07-09 01:17:08,564][26022] Updated weights on worker 0-0, policy_version 23943 (0.00092) [2022-07-09 01:17:09,913][25689] Fps is (10 sec: 5388.9, 60 sec: 5521.5, 300 sec: 5520.9). Total num frames: 24525824. Throughput: 0: 5704.5. Samples: 24531408. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 01:17:09,915][25689] Avg episode reward: [(0, '-65.622')] [2022-07-09 01:17:10,272][26022] Updated weights on worker 0-0, policy_version 23953 (0.00085) [2022-07-09 01:17:12,224][26022] Updated weights on worker 0-0, policy_version 23963 (0.00091) [2022-07-09 01:17:13,753][26022] Updated weights on worker 0-0, policy_version 23973 (0.00090) [2022-07-09 01:17:14,964][25689] Fps is (10 sec: 5565.9, 60 sec: 5500.4, 300 sec: 5524.3). Total num frames: 24553472. Throughput: 0: 4853.8. Samples: 24548178. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 01:17:14,965][25689] Avg episode reward: [(0, '-65.183')] [2022-07-09 01:17:15,890][26022] Updated weights on worker 0-0, policy_version 23983 (0.00093) [2022-07-09 01:17:17,518][26022] Updated weights on worker 0-0, policy_version 23993 (0.00090) [2022-07-09 01:17:19,419][26022] Updated weights on worker 0-0, policy_version 24003 (0.00081) [2022-07-09 01:17:20,023][25689] Fps is (10 sec: 5570.6, 60 sec: 5532.6, 300 sec: 5523.2). Total num frames: 24582144. Throughput: 0: 5692.1. Samples: 24581540. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 01:17:20,024][25689] Avg episode reward: [(0, '-65.382')] [2022-07-09 01:17:21,252][26022] Updated weights on worker 0-0, policy_version 24013 (0.00093) [2022-07-09 01:17:23,011][26022] Updated weights on worker 0-0, policy_version 24023 (0.00086) [2022-07-09 01:17:24,827][26022] Updated weights on worker 0-0, policy_version 24033 (0.00091) [2022-07-09 01:17:25,041][25689] Fps is (10 sec: 5690.9, 60 sec: 5532.2, 300 sec: 5527.3). Total num frames: 24610816. Throughput: 0: 5812.8. Samples: 24615316. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 01:17:25,041][25689] Avg episode reward: [(0, '-65.065')] [2022-07-09 01:17:26,789][26022] Updated weights on worker 0-0, policy_version 24043 (0.00086) [2022-07-09 01:17:28,611][26022] Updated weights on worker 0-0, policy_version 24053 (0.00088) [2022-07-09 01:17:30,095][25689] Fps is (10 sec: 5490.4, 60 sec: 5511.3, 300 sec: 5519.6). Total num frames: 24637440. Throughput: 0: 4987.9. Samples: 24632072. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 01:17:30,095][25689] Avg episode reward: [(0, '-65.189')] [2022-07-09 01:17:30,659][26022] Updated weights on worker 0-0, policy_version 24063 (0.00085) [2022-07-09 01:17:32,135][26022] Updated weights on worker 0-0, policy_version 24073 (0.00433) [2022-07-09 01:17:34,223][26022] Updated weights on worker 0-0, policy_version 24083 (0.00086) [2022-07-09 01:17:35,100][25689] Fps is (10 sec: 5497.3, 60 sec: 5532.0, 300 sec: 5527.6). Total num frames: 24666112. Throughput: 0: 5819.6. Samples: 24665356. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 01:17:35,103][25689] Avg episode reward: [(0, '-64.634')] [2022-07-09 01:17:36,076][26022] Updated weights on worker 0-0, policy_version 24093 (0.00091) [2022-07-09 01:17:37,831][26022] Updated weights on worker 0-0, policy_version 24103 (0.00081) [2022-07-09 01:17:39,636][26022] Updated weights on worker 0-0, policy_version 24113 (0.00086) [2022-07-09 01:17:40,160][25689] Fps is (10 sec: 5697.0, 60 sec: 5538.4, 300 sec: 5526.5). Total num frames: 24694784. Throughput: 0: 5827.2. Samples: 24698882. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 01:17:40,161][25689] Avg episode reward: [(0, '-65.115')] [2022-07-09 01:17:41,508][26022] Updated weights on worker 0-0, policy_version 24123 (0.00090) [2022-07-09 01:17:43,122][26022] Updated weights on worker 0-0, policy_version 24133 (0.00085) [2022-07-09 01:17:45,192][25689] Fps is (10 sec: 5479.2, 60 sec: 5518.9, 300 sec: 5523.4). Total num frames: 24721408. Throughput: 0: 4985.4. Samples: 24715776. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 01:17:45,192][25689] Avg episode reward: [(0, '-65.108')] [2022-07-09 01:17:45,381][26022] Updated weights on worker 0-0, policy_version 24143 (0.00088) [2022-07-09 01:17:46,790][26022] Updated weights on worker 0-0, policy_version 24153 (0.00102) [2022-07-09 01:17:48,816][26022] Updated weights on worker 0-0, policy_version 24163 (0.00088) [2022-07-09 01:17:50,212][25689] Fps is (10 sec: 5501.3, 60 sec: 5537.6, 300 sec: 5523.5). Total num frames: 24750080. Throughput: 0: 5815.3. Samples: 24749060. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 01:17:50,213][25689] Avg episode reward: [(0, '-65.277')] [2022-07-09 01:17:50,618][26022] Updated weights on worker 0-0, policy_version 24173 (0.00084) [2022-07-09 01:17:52,388][26022] Updated weights on worker 0-0, policy_version 24183 (0.00087) [2022-07-09 01:17:54,360][26022] Updated weights on worker 0-0, policy_version 24193 (0.00095) [2022-07-09 01:17:55,214][25689] Fps is (10 sec: 5721.7, 60 sec: 5538.7, 300 sec: 5525.7). Total num frames: 24778752. Throughput: 0: 5838.6. Samples: 24782796. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 01:17:55,214][25689] Avg episode reward: [(0, '-65.526')] [2022-07-09 01:17:56,362][26022] Updated weights on worker 0-0, policy_version 24203 (0.00087) [2022-07-09 01:17:57,867][26022] Updated weights on worker 0-0, policy_version 24213 (0.00087) [2022-07-09 01:17:59,953][26022] Updated weights on worker 0-0, policy_version 24223 (0.00090) [2022-07-09 01:18:00,269][25689] Fps is (10 sec: 5599.9, 60 sec: 5541.3, 300 sec: 5532.3). Total num frames: 24806400. Throughput: 0: 5006.1. Samples: 24799548. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 01:18:00,270][25689] Avg episode reward: [(0, '-66.223')] [2022-07-09 01:18:01,919][26022] Updated weights on worker 0-0, policy_version 24233 (0.00082) [2022-07-09 01:18:03,970][26022] Updated weights on worker 0-0, policy_version 24243 (0.00093) [2022-07-09 01:18:05,297][25689] Fps is (10 sec: 5281.0, 60 sec: 5539.2, 300 sec: 5525.1). Total num frames: 24832000. Throughput: 0: 5728.9. Samples: 24830956. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 01:18:05,297][25689] Avg episode reward: [(0, '-66.769')] [2022-07-09 01:18:05,819][26022] Updated weights on worker 0-0, policy_version 24253 (0.00090) [2022-07-09 01:18:07,502][26022] Updated weights on worker 0-0, policy_version 24263 (0.00081) [2022-07-09 01:18:09,386][26022] Updated weights on worker 0-0, policy_version 24273 (0.00088) [2022-07-09 01:18:10,304][25689] Fps is (10 sec: 5408.4, 60 sec: 5544.7, 300 sec: 5529.1). Total num frames: 24860672. Throughput: 0: 5745.5. Samples: 24864500. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 01:18:10,304][25689] Avg episode reward: [(0, '-67.049')] [2022-07-09 01:18:11,204][26022] Updated weights on worker 0-0, policy_version 24283 (0.00092) [2022-07-09 01:18:12,934][26022] Updated weights on worker 0-0, policy_version 24293 (0.00099) [2022-07-09 01:18:15,215][26022] Updated weights on worker 0-0, policy_version 24303 (0.00424) [2022-07-09 01:18:15,327][25689] Fps is (10 sec: 5512.9, 60 sec: 5530.3, 300 sec: 5520.4). Total num frames: 24887296. Throughput: 0: 4886.6. Samples: 24881080. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 01:18:15,329][25689] Avg episode reward: [(0, '-67.310')] [2022-07-09 01:18:15,411][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:18:15,423][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000024305_24888320.pth [2022-07-09 01:18:15,424][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000022361_22897664.pth [2022-07-09 01:18:16,792][26022] Updated weights on worker 0-0, policy_version 24313 (0.00092) [2022-07-09 01:18:18,680][26022] Updated weights on worker 0-0, policy_version 24323 (0.00090) [2022-07-09 01:18:20,432][25689] Fps is (10 sec: 5459.7, 60 sec: 5526.1, 300 sec: 5519.6). Total num frames: 24915968. Throughput: 0: 5688.8. Samples: 24914250. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 01:18:20,433][25689] Avg episode reward: [(0, '-66.384')] [2022-07-09 01:18:20,527][26022] Updated weights on worker 0-0, policy_version 24333 (0.00085) [2022-07-09 01:18:22,328][26022] Updated weights on worker 0-0, policy_version 24343 (0.00090) [2022-07-09 01:18:24,206][26022] Updated weights on worker 0-0, policy_version 24353 (0.00086) [2022-07-09 01:18:25,495][25689] Fps is (10 sec: 5539.1, 60 sec: 5505.0, 300 sec: 5525.5). Total num frames: 24943616. Throughput: 0: 5787.4. Samples: 24947850. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 01:18:25,497][25689] Avg episode reward: [(0, '-66.353')] [2022-07-09 01:18:26,013][26022] Updated weights on worker 0-0, policy_version 24363 (0.00097) [2022-07-09 01:18:27,897][26022] Updated weights on worker 0-0, policy_version 24373 (0.00089) [2022-07-09 01:18:29,867][26022] Updated weights on worker 0-0, policy_version 24383 (0.00096) [2022-07-09 01:18:30,514][25689] Fps is (10 sec: 5586.4, 60 sec: 5542.1, 300 sec: 5526.2). Total num frames: 24972288. Throughput: 0: 5761.8. Samples: 24980944. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 01:18:30,514][25689] Avg episode reward: [(0, '-65.967')] [2022-07-09 01:18:31,537][26022] Updated weights on worker 0-0, policy_version 24393 (0.00083) [2022-07-09 01:18:33,603][26022] Updated weights on worker 0-0, policy_version 24403 (0.00106) [2022-07-09 01:18:35,310][26022] Updated weights on worker 0-0, policy_version 24413 (0.00094) [2022-07-09 01:18:35,523][25689] Fps is (10 sec: 5615.9, 60 sec: 5524.7, 300 sec: 5521.1). Total num frames: 24999936. Throughput: 0: 5760.2. Samples: 24997414. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 01:18:35,524][25689] Avg episode reward: [(0, '-66.377')] [2022-07-09 01:18:37,193][26022] Updated weights on worker 0-0, policy_version 24423 (0.00087) [2022-07-09 01:18:39,016][26022] Updated weights on worker 0-0, policy_version 24433 (0.00082) [2022-07-09 01:18:40,563][25689] Fps is (10 sec: 5400.7, 60 sec: 5492.8, 300 sec: 5520.4). Total num frames: 25026560. Throughput: 0: 5791.1. Samples: 25030830. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 01:18:40,563][25689] Avg episode reward: [(0, '-65.959')] [2022-07-09 01:18:40,916][26022] Updated weights on worker 0-0, policy_version 24443 (0.00096) [2022-07-09 01:18:42,679][26022] Updated weights on worker 0-0, policy_version 24453 (0.00086) [2022-07-09 01:18:44,559][26022] Updated weights on worker 0-0, policy_version 24463 (0.00084) [2022-07-09 01:18:45,565][25689] Fps is (10 sec: 5608.7, 60 sec: 5546.3, 300 sec: 5524.7). Total num frames: 25056256. Throughput: 0: 5793.5. Samples: 25064126. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 01:18:45,565][25689] Avg episode reward: [(0, '-65.444')] [2022-07-09 01:18:46,375][26022] Updated weights on worker 0-0, policy_version 24473 (0.00086) [2022-07-09 01:18:48,495][26022] Updated weights on worker 0-0, policy_version 24483 (0.00089) [2022-07-09 01:18:50,018][26022] Updated weights on worker 0-0, policy_version 24493 (0.00081) [2022-07-09 01:18:50,590][25689] Fps is (10 sec: 5821.1, 60 sec: 5545.9, 300 sec: 5532.0). Total num frames: 25084928. Throughput: 0: 4979.4. Samples: 25080910. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 01:18:50,590][25689] Avg episode reward: [(0, '-66.150')] [2022-07-09 01:18:52,040][26022] Updated weights on worker 0-0, policy_version 24503 (0.00084) [2022-07-09 01:18:53,727][26022] Updated weights on worker 0-0, policy_version 24513 (0.00097) [2022-07-09 01:18:55,605][25689] Fps is (10 sec: 5405.4, 60 sec: 5493.8, 300 sec: 5520.1). Total num frames: 25110528. Throughput: 0: 5816.5. Samples: 25114220. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 01:18:55,606][25689] Avg episode reward: [(0, '-65.346')] [2022-07-09 01:18:55,773][26022] Updated weights on worker 0-0, policy_version 24523 (0.00094) [2022-07-09 01:18:57,336][26022] Updated weights on worker 0-0, policy_version 24533 (0.00089) [2022-07-09 01:18:59,387][26022] Updated weights on worker 0-0, policy_version 24543 (0.00088) [2022-07-09 01:19:00,671][25689] Fps is (10 sec: 5383.6, 60 sec: 5509.8, 300 sec: 5533.7). Total num frames: 25139200. Throughput: 0: 5794.3. Samples: 25147342. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 01:19:00,671][25689] Avg episode reward: [(0, '-65.416')] [2022-07-09 01:19:01,327][26022] Updated weights on worker 0-0, policy_version 24553 (0.00086) [2022-07-09 01:19:03,343][26022] Updated weights on worker 0-0, policy_version 24563 (0.00090) [2022-07-09 01:19:05,319][26022] Updated weights on worker 0-0, policy_version 24573 (0.00084) [2022-07-09 01:19:05,678][25689] Fps is (10 sec: 5286.3, 60 sec: 5494.7, 300 sec: 5520.1). Total num frames: 25163776. Throughput: 0: 4852.1. Samples: 25161718. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 01:19:05,679][25689] Avg episode reward: [(0, '-65.020')] [2022-07-09 01:19:07,030][26022] Updated weights on worker 0-0, policy_version 24583 (0.00113) [2022-07-09 01:19:09,085][26022] Updated weights on worker 0-0, policy_version 24593 (0.00370) [2022-07-09 01:19:10,706][25689] Fps is (10 sec: 5305.9, 60 sec: 5492.8, 300 sec: 5523.3). Total num frames: 25192448. Throughput: 0: 5677.2. Samples: 25195116. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:19:10,707][25689] Avg episode reward: [(0, '-65.556')] [2022-07-09 01:19:10,768][26022] Updated weights on worker 0-0, policy_version 24603 (0.00094) [2022-07-09 01:19:12,744][26022] Updated weights on worker 0-0, policy_version 24613 (0.00088) [2022-07-09 01:19:14,347][26022] Updated weights on worker 0-0, policy_version 24623 (0.00085) [2022-07-09 01:19:15,751][25689] Fps is (10 sec: 5591.4, 60 sec: 5507.8, 300 sec: 5520.2). Total num frames: 25220096. Throughput: 0: 5669.1. Samples: 25228426. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:19:15,751][25689] Avg episode reward: [(0, '-66.391')] [2022-07-09 01:19:16,476][26022] Updated weights on worker 0-0, policy_version 24633 (0.00050) [2022-07-09 01:19:18,245][26022] Updated weights on worker 0-0, policy_version 24643 (0.00095) [2022-07-09 01:19:20,087][26022] Updated weights on worker 0-0, policy_version 24653 (0.00082) [2022-07-09 01:19:20,886][25689] Fps is (10 sec: 5532.8, 60 sec: 5505.0, 300 sec: 5522.6). Total num frames: 25248768. Throughput: 0: 4836.8. Samples: 25245118. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:19:20,886][25689] Avg episode reward: [(0, '-65.670')] [2022-07-09 01:19:21,896][26022] Updated weights on worker 0-0, policy_version 24663 (0.00089) [2022-07-09 01:19:23,715][26022] Updated weights on worker 0-0, policy_version 24673 (0.00083) [2022-07-09 01:19:25,561][26022] Updated weights on worker 0-0, policy_version 24683 (0.00093) [2022-07-09 01:19:25,926][25689] Fps is (10 sec: 5535.0, 60 sec: 5507.1, 300 sec: 5522.0). Total num frames: 25276416. Throughput: 0: 5779.3. Samples: 25278736. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:19:25,927][25689] Avg episode reward: [(0, '-65.061')] [2022-07-09 01:19:27,437][26022] Updated weights on worker 0-0, policy_version 24693 (0.00090) [2022-07-09 01:19:29,295][26022] Updated weights on worker 0-0, policy_version 24703 (0.00099) [2022-07-09 01:19:30,981][25689] Fps is (10 sec: 5477.6, 60 sec: 5486.9, 300 sec: 5521.6). Total num frames: 25304064. Throughput: 0: 5751.9. Samples: 25311732. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:19:30,981][25689] Avg episode reward: [(0, '-64.909')] [2022-07-09 01:19:31,208][26022] Updated weights on worker 0-0, policy_version 24713 (0.00085) [2022-07-09 01:19:32,951][26022] Updated weights on worker 0-0, policy_version 24723 (0.00101) [2022-07-09 01:19:34,698][26022] Updated weights on worker 0-0, policy_version 24733 (0.00096) [2022-07-09 01:19:35,986][25689] Fps is (10 sec: 5598.6, 60 sec: 5504.2, 300 sec: 5524.0). Total num frames: 25332736. Throughput: 0: 4930.2. Samples: 25328188. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:19:35,986][25689] Avg episode reward: [(0, '-65.141')] [2022-07-09 01:19:36,750][26022] Updated weights on worker 0-0, policy_version 24743 (0.00087) [2022-07-09 01:19:38,557][26022] Updated weights on worker 0-0, policy_version 24753 (0.00093) [2022-07-09 01:19:40,499][26022] Updated weights on worker 0-0, policy_version 24763 (0.00085) [2022-07-09 01:19:41,061][25689] Fps is (10 sec: 5485.3, 60 sec: 5501.0, 300 sec: 5519.3). Total num frames: 25359360. Throughput: 0: 5771.0. Samples: 25361552. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 01:19:41,062][25689] Avg episode reward: [(0, '-65.222')] [2022-07-09 01:19:42,087][26022] Updated weights on worker 0-0, policy_version 24773 (0.00088) [2022-07-09 01:19:44,223][26022] Updated weights on worker 0-0, policy_version 24783 (0.00089) [2022-07-09 01:19:45,730][26022] Updated weights on worker 0-0, policy_version 24793 (0.00110) [2022-07-09 01:19:46,114][25689] Fps is (10 sec: 5560.7, 60 sec: 5496.4, 300 sec: 5518.4). Total num frames: 25389056. Throughput: 0: 5763.7. Samples: 25395094. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 01:19:46,115][25689] Avg episode reward: [(0, '-65.434')] [2022-07-09 01:19:48,036][26022] Updated weights on worker 0-0, policy_version 24803 (0.00087) [2022-07-09 01:19:49,425][26022] Updated weights on worker 0-0, policy_version 24813 (0.00081) [2022-07-09 01:19:51,131][25689] Fps is (10 sec: 5491.4, 60 sec: 5446.3, 300 sec: 5514.9). Total num frames: 25414656. Throughput: 0: 4957.6. Samples: 25411630. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 01:19:51,131][25689] Avg episode reward: [(0, '-66.210')] [2022-07-09 01:19:51,591][26022] Updated weights on worker 0-0, policy_version 24823 (0.00090) [2022-07-09 01:19:53,227][26022] Updated weights on worker 0-0, policy_version 24833 (0.00093) [2022-07-09 01:19:55,150][26022] Updated weights on worker 0-0, policy_version 24843 (0.00093) [2022-07-09 01:19:56,152][25689] Fps is (10 sec: 5508.7, 60 sec: 5513.5, 300 sec: 5519.4). Total num frames: 25444352. Throughput: 0: 5800.2. Samples: 25445156. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 01:19:56,152][25689] Avg episode reward: [(0, '-66.574')] [2022-07-09 01:19:56,948][26022] Updated weights on worker 0-0, policy_version 24853 (0.00085) [2022-07-09 01:19:58,924][26022] Updated weights on worker 0-0, policy_version 24863 (0.00089) [2022-07-09 01:20:00,662][26022] Updated weights on worker 0-0, policy_version 24873 (0.00088) [2022-07-09 01:20:01,240][25689] Fps is (10 sec: 5672.7, 60 sec: 5494.6, 300 sec: 5525.3). Total num frames: 25472000. Throughput: 0: 5808.2. Samples: 25478752. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 01:20:01,240][25689] Avg episode reward: [(0, '-66.909')] [2022-07-09 01:20:02,937][26022] Updated weights on worker 0-0, policy_version 24883 (0.00087) [2022-07-09 01:20:04,603][26022] Updated weights on worker 0-0, policy_version 24893 (0.00084) [2022-07-09 01:20:06,291][25689] Fps is (10 sec: 5251.8, 60 sec: 5507.5, 300 sec: 5517.6). Total num frames: 25497600. Throughput: 0: 4867.3. Samples: 25493300. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 01:20:06,293][25689] Avg episode reward: [(0, '-67.057')] [2022-07-09 01:20:06,692][26022] Updated weights on worker 0-0, policy_version 24903 (0.00091) [2022-07-09 01:20:08,436][26022] Updated weights on worker 0-0, policy_version 24913 (0.00084) [2022-07-09 01:20:10,235][26022] Updated weights on worker 0-0, policy_version 24923 (0.00083) [2022-07-09 01:20:11,335][25689] Fps is (10 sec: 5477.5, 60 sec: 5522.9, 300 sec: 5523.9). Total num frames: 25527296. Throughput: 0: 5700.1. Samples: 25526794. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 01:20:11,335][25689] Avg episode reward: [(0, '-66.449')] [2022-07-09 01:20:12,067][26022] Updated weights on worker 0-0, policy_version 24933 (0.00101) [2022-07-09 01:20:13,868][26022] Updated weights on worker 0-0, policy_version 24943 (0.00089) [2022-07-09 01:20:15,429][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:20:15,439][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000024951_25549824.pth [2022-07-09 01:20:15,451][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000023009_23561216.pth [2022-07-09 01:20:15,796][26022] Updated weights on worker 0-0, policy_version 24953 (0.00087) [2022-07-09 01:20:16,380][25689] Fps is (10 sec: 5683.9, 60 sec: 5522.9, 300 sec: 5517.1). Total num frames: 25554944. Throughput: 0: 5689.5. Samples: 25560242. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 01:20:16,380][25689] Avg episode reward: [(0, '-66.721')] [2022-07-09 01:20:17,744][26022] Updated weights on worker 0-0, policy_version 24963 (0.00089) [2022-07-09 01:20:19,521][26022] Updated weights on worker 0-0, policy_version 24973 (0.00092) [2022-07-09 01:20:21,420][26022] Updated weights on worker 0-0, policy_version 24983 (0.00093) [2022-07-09 01:20:21,475][25689] Fps is (10 sec: 5453.3, 60 sec: 5509.6, 300 sec: 5519.8). Total num frames: 25582592. Throughput: 0: 4805.4. Samples: 25575990. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 01:20:21,475][25689] Avg episode reward: [(0, '-66.407')] [2022-07-09 01:20:23,411][26022] Updated weights on worker 0-0, policy_version 24993 (0.00107) [2022-07-09 01:20:24,829][26022] Updated weights on worker 0-0, policy_version 25003 (0.00087) [2022-07-09 01:20:26,494][25689] Fps is (10 sec: 5365.9, 60 sec: 5494.7, 300 sec: 5516.4). Total num frames: 25609216. Throughput: 0: 5755.5. Samples: 25609580. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 01:20:26,496][25689] Avg episode reward: [(0, '-66.931')] [2022-07-09 01:20:27,127][26022] Updated weights on worker 0-0, policy_version 25013 (0.00086) [2022-07-09 01:20:28,741][26022] Updated weights on worker 0-0, policy_version 25023 (0.00095) [2022-07-09 01:20:30,536][26022] Updated weights on worker 0-0, policy_version 25033 (0.00088) [2022-07-09 01:20:31,517][25689] Fps is (10 sec: 5608.4, 60 sec: 5531.4, 300 sec: 5519.5). Total num frames: 25638912. Throughput: 0: 5758.7. Samples: 25643016. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 01:20:31,517][25689] Avg episode reward: [(0, '-67.081')] [2022-07-09 01:20:32,479][26022] Updated weights on worker 0-0, policy_version 25043 (0.00088) [2022-07-09 01:20:34,252][26022] Updated weights on worker 0-0, policy_version 25053 (0.00094) [2022-07-09 01:20:36,266][26022] Updated weights on worker 0-0, policy_version 25063 (0.00090) [2022-07-09 01:20:36,556][25689] Fps is (10 sec: 5699.1, 60 sec: 5511.4, 300 sec: 5520.3). Total num frames: 25666560. Throughput: 0: 4938.5. Samples: 25659882. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 01:20:36,556][25689] Avg episode reward: [(0, '-67.934')] [2022-07-09 01:20:37,898][26022] Updated weights on worker 0-0, policy_version 25073 (0.00081) [2022-07-09 01:20:39,859][26022] Updated weights on worker 0-0, policy_version 25083 (0.00096) [2022-07-09 01:20:41,603][25689] Fps is (10 sec: 5482.5, 60 sec: 5530.9, 300 sec: 5516.5). Total num frames: 25694208. Throughput: 0: 5818.6. Samples: 25693106. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 01:20:41,603][25689] Avg episode reward: [(0, '-67.815')] [2022-07-09 01:20:41,682][26022] Updated weights on worker 0-0, policy_version 25093 (0.00084) [2022-07-09 01:20:43,503][26022] Updated weights on worker 0-0, policy_version 25103 (0.00387) [2022-07-09 01:20:45,339][26022] Updated weights on worker 0-0, policy_version 25113 (0.00088) [2022-07-09 01:20:46,646][25689] Fps is (10 sec: 5581.7, 60 sec: 5514.8, 300 sec: 5516.1). Total num frames: 25722880. Throughput: 0: 5809.3. Samples: 25726648. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 01:20:46,646][25689] Avg episode reward: [(0, '-67.680')] [2022-07-09 01:20:47,144][26022] Updated weights on worker 0-0, policy_version 25123 (0.00085) [2022-07-09 01:20:48,943][26022] Updated weights on worker 0-0, policy_version 25133 (0.00100) [2022-07-09 01:20:50,824][26022] Updated weights on worker 0-0, policy_version 25143 (0.00095) [2022-07-09 01:20:51,650][25689] Fps is (10 sec: 5503.8, 60 sec: 5533.0, 300 sec: 5516.1). Total num frames: 25749504. Throughput: 0: 5803.5. Samples: 25759856. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 01:20:51,650][25689] Avg episode reward: [(0, '-67.611')] [2022-07-09 01:20:52,667][26022] Updated weights on worker 0-0, policy_version 25153 (0.00859) [2022-07-09 01:20:54,524][26022] Updated weights on worker 0-0, policy_version 25163 (0.00092) [2022-07-09 01:20:56,321][26022] Updated weights on worker 0-0, policy_version 25173 (0.00090) [2022-07-09 01:20:56,721][25689] Fps is (10 sec: 5590.0, 60 sec: 5528.4, 300 sec: 5517.3). Total num frames: 25779200. Throughput: 0: 5791.5. Samples: 25776668. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 01:20:56,721][25689] Avg episode reward: [(0, '-67.888')] [2022-07-09 01:20:58,248][26022] Updated weights on worker 0-0, policy_version 25183 (0.00087) [2022-07-09 01:21:00,070][26022] Updated weights on worker 0-0, policy_version 25193 (0.00084) [2022-07-09 01:21:01,852][25689] Fps is (10 sec: 5620.9, 60 sec: 5524.4, 300 sec: 5525.3). Total num frames: 25806848. Throughput: 0: 5791.7. Samples: 25810382. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 01:21:01,852][25689] Avg episode reward: [(0, '-67.397')] [2022-07-09 01:21:02,202][26022] Updated weights on worker 0-0, policy_version 25203 (0.00089) [2022-07-09 01:21:04,070][26022] Updated weights on worker 0-0, policy_version 25213 (0.00085) [2022-07-09 01:21:05,998][26022] Updated weights on worker 0-0, policy_version 25223 (0.00091) [2022-07-09 01:21:06,888][25689] Fps is (10 sec: 5337.8, 60 sec: 5542.7, 300 sec: 5518.8). Total num frames: 25833472. Throughput: 0: 5677.8. Samples: 25841580. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 01:21:06,889][25689] Avg episode reward: [(0, '-67.812')] [2022-07-09 01:21:07,831][26022] Updated weights on worker 0-0, policy_version 25233 (0.00102) [2022-07-09 01:21:09,718][26022] Updated weights on worker 0-0, policy_version 25243 (0.00088) [2022-07-09 01:21:11,359][26022] Updated weights on worker 0-0, policy_version 25253 (0.00096) [2022-07-09 01:21:11,916][25689] Fps is (10 sec: 5392.7, 60 sec: 5510.4, 300 sec: 5514.9). Total num frames: 25861120. Throughput: 0: 4860.9. Samples: 25858366. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 01:21:11,916][25689] Avg episode reward: [(0, '-67.758')] [2022-07-09 01:21:13,335][26022] Updated weights on worker 0-0, policy_version 25263 (0.00092) [2022-07-09 01:21:15,119][26022] Updated weights on worker 0-0, policy_version 25273 (0.00093) [2022-07-09 01:21:16,819][26022] Updated weights on worker 0-0, policy_version 25283 (0.00089) [2022-07-09 01:21:17,014][25689] Fps is (10 sec: 5562.3, 60 sec: 5522.4, 300 sec: 5520.8). Total num frames: 25889792. Throughput: 0: 5693.9. Samples: 25892212. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 01:21:17,014][25689] Avg episode reward: [(0, '-68.376')] [2022-07-09 01:21:18,737][26022] Updated weights on worker 0-0, policy_version 25293 (0.00087) [2022-07-09 01:21:20,551][26022] Updated weights on worker 0-0, policy_version 25303 (0.00080) [2022-07-09 01:21:22,114][25689] Fps is (10 sec: 5522.3, 60 sec: 5522.0, 300 sec: 5515.7). Total num frames: 25917440. Throughput: 0: 5676.0. Samples: 25925392. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:21:22,115][25689] Avg episode reward: [(0, '-67.724')] [2022-07-09 01:21:22,412][26022] Updated weights on worker 0-0, policy_version 25313 (0.00088) [2022-07-09 01:21:24,386][26022] Updated weights on worker 0-0, policy_version 25323 (0.00102) [2022-07-09 01:21:26,024][26022] Updated weights on worker 0-0, policy_version 25333 (0.00093) [2022-07-09 01:21:27,137][25689] Fps is (10 sec: 5563.2, 60 sec: 5555.3, 300 sec: 5518.9). Total num frames: 25946112. Throughput: 0: 4977.2. Samples: 25942364. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:21:27,146][25689] Avg episode reward: [(0, '-68.049')] [2022-07-09 01:21:28,050][26022] Updated weights on worker 0-0, policy_version 25343 (0.00095) [2022-07-09 01:21:29,923][26022] Updated weights on worker 0-0, policy_version 25353 (0.00652) [2022-07-09 01:21:31,698][26022] Updated weights on worker 0-0, policy_version 25363 (0.00086) [2022-07-09 01:21:32,151][25689] Fps is (10 sec: 5713.5, 60 sec: 5539.3, 300 sec: 5522.9). Total num frames: 25974784. Throughput: 0: 5809.5. Samples: 25975922. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:21:32,152][25689] Avg episode reward: [(0, '-68.148')] [2022-07-09 01:21:33,633][26022] Updated weights on worker 0-0, policy_version 25373 (0.00082) [2022-07-09 01:21:35,214][26022] Updated weights on worker 0-0, policy_version 25383 (0.00089) [2022-07-09 01:21:37,239][25689] Fps is (10 sec: 5372.4, 60 sec: 5501.1, 300 sec: 5513.4). Total num frames: 26000384. Throughput: 0: 5799.1. Samples: 26009504. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:21:37,240][25689] Avg episode reward: [(0, '-69.031')] [2022-07-09 01:21:37,367][26022] Updated weights on worker 0-0, policy_version 25393 (0.00085) [2022-07-09 01:21:38,832][26022] Updated weights on worker 0-0, policy_version 25403 (0.00090) [2022-07-09 01:21:40,804][26022] Updated weights on worker 0-0, policy_version 25413 (0.00091) [2022-07-09 01:21:42,279][25689] Fps is (10 sec: 5459.7, 60 sec: 5535.5, 300 sec: 5519.6). Total num frames: 26030080. Throughput: 0: 5017.3. Samples: 26026564. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:21:42,279][25689] Avg episode reward: [(0, '-68.812')] [2022-07-09 01:21:42,686][26022] Updated weights on worker 0-0, policy_version 25423 (0.00094) [2022-07-09 01:21:44,405][26022] Updated weights on worker 0-0, policy_version 25433 (0.00087) [2022-07-09 01:21:46,360][26022] Updated weights on worker 0-0, policy_version 25443 (0.00090) [2022-07-09 01:21:47,302][25689] Fps is (10 sec: 5902.2, 60 sec: 5554.2, 300 sec: 5526.8). Total num frames: 26059776. Throughput: 0: 5837.9. Samples: 26060084. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:21:47,304][25689] Avg episode reward: [(0, '-68.788')] [2022-07-09 01:21:48,288][26022] Updated weights on worker 0-0, policy_version 25453 (0.00093) [2022-07-09 01:21:50,015][26022] Updated weights on worker 0-0, policy_version 25463 (0.00086) [2022-07-09 01:21:52,075][26022] Updated weights on worker 0-0, policy_version 25473 (0.00087) [2022-07-09 01:21:52,371][25689] Fps is (10 sec: 5580.6, 60 sec: 5548.2, 300 sec: 5518.9). Total num frames: 26086400. Throughput: 0: 5803.8. Samples: 26093276. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:21:52,372][25689] Avg episode reward: [(0, '-68.351')] [2022-07-09 01:21:53,495][26022] Updated weights on worker 0-0, policy_version 25483 (0.00089) [2022-07-09 01:21:55,582][26022] Updated weights on worker 0-0, policy_version 25493 (0.00086) [2022-07-09 01:21:57,382][25689] Fps is (10 sec: 5485.5, 60 sec: 5536.8, 300 sec: 5523.7). Total num frames: 26115072. Throughput: 0: 4989.0. Samples: 26109998. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:21:57,383][25689] Avg episode reward: [(0, '-67.812')] [2022-07-09 01:21:57,385][26022] Updated weights on worker 0-0, policy_version 25503 (0.00094) [2022-07-09 01:21:59,337][26022] Updated weights on worker 0-0, policy_version 25513 (0.00087) [2022-07-09 01:22:01,100][26022] Updated weights on worker 0-0, policy_version 25523 (0.00080) [2022-07-09 01:22:02,437][25689] Fps is (10 sec: 5290.1, 60 sec: 5493.1, 300 sec: 5519.3). Total num frames: 26139648. Throughput: 0: 5791.6. Samples: 26143310. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:22:02,437][25689] Avg episode reward: [(0, '-67.260')] [2022-07-09 01:22:03,122][26022] Updated weights on worker 0-0, policy_version 25533 (0.00082) [2022-07-09 01:22:05,184][26022] Updated weights on worker 0-0, policy_version 25543 (0.00088) [2022-07-09 01:22:06,912][26022] Updated weights on worker 0-0, policy_version 25553 (0.00082) [2022-07-09 01:22:07,443][25689] Fps is (10 sec: 5292.8, 60 sec: 5529.7, 300 sec: 5520.4). Total num frames: 26168320. Throughput: 0: 5705.7. Samples: 26175000. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:22:07,443][25689] Avg episode reward: [(0, '-66.322')] [2022-07-09 01:22:08,840][26022] Updated weights on worker 0-0, policy_version 25563 (0.00998) [2022-07-09 01:22:10,525][26022] Updated weights on worker 0-0, policy_version 25573 (0.00085) [2022-07-09 01:22:12,456][25689] Fps is (10 sec: 5723.1, 60 sec: 5547.9, 300 sec: 5524.6). Total num frames: 26196992. Throughput: 0: 4900.3. Samples: 26191700. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:22:12,457][25689] Avg episode reward: [(0, '-66.230')] [2022-07-09 01:22:12,458][26022] Updated weights on worker 0-0, policy_version 25583 (0.00087) [2022-07-09 01:22:14,297][26022] Updated weights on worker 0-0, policy_version 25593 (0.00089) [2022-07-09 01:22:15,570][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:22:15,582][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000025600_26214400.pth [2022-07-09 01:22:15,582][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000023657_24224768.pth [2022-07-09 01:22:16,121][26022] Updated weights on worker 0-0, policy_version 25603 (0.00104) [2022-07-09 01:22:17,525][25689] Fps is (10 sec: 5585.9, 60 sec: 5533.6, 300 sec: 5521.0). Total num frames: 26224640. Throughput: 0: 5746.5. Samples: 26225750. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:22:17,526][25689] Avg episode reward: [(0, '-66.597')] [2022-07-09 01:22:18,098][26022] Updated weights on worker 0-0, policy_version 25613 (0.00089) [2022-07-09 01:22:19,767][26022] Updated weights on worker 0-0, policy_version 25623 (0.00091) [2022-07-09 01:22:21,738][26022] Updated weights on worker 0-0, policy_version 25633 (0.00083) [2022-07-09 01:22:22,598][25689] Fps is (10 sec: 5553.1, 60 sec: 5553.0, 300 sec: 5519.9). Total num frames: 26253312. Throughput: 0: 5739.3. Samples: 26259024. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:22:22,599][25689] Avg episode reward: [(0, '-66.650')] [2022-07-09 01:22:23,348][26022] Updated weights on worker 0-0, policy_version 25643 (0.00092) [2022-07-09 01:22:25,385][26022] Updated weights on worker 0-0, policy_version 25653 (0.00089) [2022-07-09 01:22:27,062][26022] Updated weights on worker 0-0, policy_version 25663 (0.00090) [2022-07-09 01:22:27,621][25689] Fps is (10 sec: 5679.8, 60 sec: 5553.1, 300 sec: 5527.4). Total num frames: 26281984. Throughput: 0: 5005.4. Samples: 26276002. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:22:27,622][25689] Avg episode reward: [(0, '-66.544')] [2022-07-09 01:22:29,028][26022] Updated weights on worker 0-0, policy_version 25673 (0.00087) [2022-07-09 01:22:30,672][26022] Updated weights on worker 0-0, policy_version 25683 (0.00093) [2022-07-09 01:22:32,643][25689] Fps is (10 sec: 5505.0, 60 sec: 5518.5, 300 sec: 5520.2). Total num frames: 26308608. Throughput: 0: 5832.2. Samples: 26309434. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 01:22:32,644][25689] Avg episode reward: [(0, '-66.435')] [2022-07-09 01:22:32,647][26022] Updated weights on worker 0-0, policy_version 25693 (0.00090) [2022-07-09 01:22:34,621][26022] Updated weights on worker 0-0, policy_version 25703 (0.00095) [2022-07-09 01:22:36,248][26022] Updated weights on worker 0-0, policy_version 25713 (0.00081) [2022-07-09 01:22:37,657][25689] Fps is (10 sec: 5407.7, 60 sec: 5559.1, 300 sec: 5517.6). Total num frames: 26336256. Throughput: 0: 5798.6. Samples: 26342488. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 01:22:37,658][25689] Avg episode reward: [(0, '-66.253')] [2022-07-09 01:22:38,239][26022] Updated weights on worker 0-0, policy_version 25723 (0.00090) [2022-07-09 01:22:40,013][26022] Updated weights on worker 0-0, policy_version 25733 (0.00085) [2022-07-09 01:22:41,841][26022] Updated weights on worker 0-0, policy_version 25743 (0.00096) [2022-07-09 01:22:42,784][25689] Fps is (10 sec: 5553.5, 60 sec: 5534.2, 300 sec: 5522.7). Total num frames: 26364928. Throughput: 0: 4969.7. Samples: 26359342. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 01:22:42,785][25689] Avg episode reward: [(0, '-65.951')] [2022-07-09 01:22:43,592][26022] Updated weights on worker 0-0, policy_version 25753 (0.00108) [2022-07-09 01:22:45,523][26022] Updated weights on worker 0-0, policy_version 25763 (0.00087) [2022-07-09 01:22:47,455][26022] Updated weights on worker 0-0, policy_version 25773 (0.00090) [2022-07-09 01:22:47,803][25689] Fps is (10 sec: 5652.2, 60 sec: 5517.7, 300 sec: 5522.7). Total num frames: 26393600. Throughput: 0: 5794.3. Samples: 26392940. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 01:22:47,803][25689] Avg episode reward: [(0, '-65.794')] [2022-07-09 01:22:49,216][26022] Updated weights on worker 0-0, policy_version 25783 (0.00091) [2022-07-09 01:22:51,040][26022] Updated weights on worker 0-0, policy_version 25793 (0.00088) [2022-07-09 01:22:52,874][25689] Fps is (10 sec: 5581.7, 60 sec: 5534.4, 300 sec: 5518.0). Total num frames: 26421248. Throughput: 0: 5788.6. Samples: 26426546. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 01:22:52,875][25689] Avg episode reward: [(0, '-65.346')] [2022-07-09 01:22:52,918][26022] Updated weights on worker 0-0, policy_version 25803 (0.00100) [2022-07-09 01:22:54,566][26022] Updated weights on worker 0-0, policy_version 25813 (0.00088) [2022-07-09 01:22:56,487][26022] Updated weights on worker 0-0, policy_version 25823 (0.00090) [2022-07-09 01:22:57,942][25689] Fps is (10 sec: 5655.7, 60 sec: 5546.1, 300 sec: 5524.7). Total num frames: 26450944. Throughput: 0: 5802.1. Samples: 26460182. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 01:22:57,943][25689] Avg episode reward: [(0, '-65.526')] [2022-07-09 01:22:58,400][26022] Updated weights on worker 0-0, policy_version 25833 (0.00094) [2022-07-09 01:23:00,170][26022] Updated weights on worker 0-0, policy_version 25843 (0.00090) [2022-07-09 01:23:02,458][26022] Updated weights on worker 0-0, policy_version 25853 (0.00086) [2022-07-09 01:23:03,000][25689] Fps is (10 sec: 5461.1, 60 sec: 5562.7, 300 sec: 5524.1). Total num frames: 26476544. Throughput: 0: 5809.6. Samples: 26476786. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 01:23:03,000][25689] Avg episode reward: [(0, '-64.506')] [2022-07-09 01:23:04,120][26022] Updated weights on worker 0-0, policy_version 25863 (0.00083) [2022-07-09 01:23:06,166][26022] Updated weights on worker 0-0, policy_version 25873 (0.00086) [2022-07-09 01:23:07,955][26022] Updated weights on worker 0-0, policy_version 25883 (0.00088) [2022-07-09 01:23:08,017][25689] Fps is (10 sec: 5386.9, 60 sec: 5561.7, 300 sec: 5523.9). Total num frames: 26505216. Throughput: 0: 5696.5. Samples: 26508088. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:23:08,017][25689] Avg episode reward: [(0, '-64.312')] [2022-07-09 01:23:09,944][26022] Updated weights on worker 0-0, policy_version 25893 (0.00090) [2022-07-09 01:23:11,665][26022] Updated weights on worker 0-0, policy_version 25903 (0.00053) [2022-07-09 01:23:13,055][25689] Fps is (10 sec: 5397.4, 60 sec: 5508.8, 300 sec: 5520.2). Total num frames: 26530816. Throughput: 0: 5691.6. Samples: 26541406. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:23:13,056][25689] Avg episode reward: [(0, '-63.980')] [2022-07-09 01:23:13,471][26022] Updated weights on worker 0-0, policy_version 25913 (0.00097) [2022-07-09 01:23:15,243][26022] Updated weights on worker 0-0, policy_version 25923 (0.00092) [2022-07-09 01:23:17,169][26022] Updated weights on worker 0-0, policy_version 25933 (0.00082) [2022-07-09 01:23:18,090][25689] Fps is (10 sec: 5387.5, 60 sec: 5528.7, 300 sec: 5521.5). Total num frames: 26559488. Throughput: 0: 4852.3. Samples: 26557948. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:23:18,091][25689] Avg episode reward: [(0, '-63.789')] [2022-07-09 01:23:19,063][26022] Updated weights on worker 0-0, policy_version 25943 (0.00085) [2022-07-09 01:23:20,892][26022] Updated weights on worker 0-0, policy_version 25953 (0.00088) [2022-07-09 01:23:22,783][26022] Updated weights on worker 0-0, policy_version 25963 (0.00082) [2022-07-09 01:23:23,119][25689] Fps is (10 sec: 5596.4, 60 sec: 5515.9, 300 sec: 5522.1). Total num frames: 26587136. Throughput: 0: 5703.7. Samples: 26591538. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:23:23,119][25689] Avg episode reward: [(0, '-63.557')] [2022-07-09 01:23:24,725][26022] Updated weights on worker 0-0, policy_version 25973 (0.00102) [2022-07-09 01:23:26,389][26022] Updated weights on worker 0-0, policy_version 25983 (0.00093) [2022-07-09 01:23:28,122][25689] Fps is (10 sec: 5716.2, 60 sec: 5534.6, 300 sec: 5525.9). Total num frames: 26616832. Throughput: 0: 5796.1. Samples: 26624622. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:23:28,123][25689] Avg episode reward: [(0, '-63.796')] [2022-07-09 01:23:28,130][26022] Updated weights on worker 0-0, policy_version 25993 (0.00086) [2022-07-09 01:23:30,186][26022] Updated weights on worker 0-0, policy_version 26003 (0.00089) [2022-07-09 01:23:31,895][26022] Updated weights on worker 0-0, policy_version 26013 (0.00090) [2022-07-09 01:23:33,186][25689] Fps is (10 sec: 5594.5, 60 sec: 5530.8, 300 sec: 5521.4). Total num frames: 26643456. Throughput: 0: 4965.0. Samples: 26641356. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:23:33,187][25689] Avg episode reward: [(0, '-63.570')] [2022-07-09 01:23:33,897][26022] Updated weights on worker 0-0, policy_version 26023 (0.00086) [2022-07-09 01:23:35,538][26022] Updated weights on worker 0-0, policy_version 26033 (0.00086) [2022-07-09 01:23:37,542][26022] Updated weights on worker 0-0, policy_version 26043 (0.00090) [2022-07-09 01:23:38,212][25689] Fps is (10 sec: 5480.5, 60 sec: 5546.6, 300 sec: 5528.5). Total num frames: 26672128. Throughput: 0: 5813.9. Samples: 26674932. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:23:38,212][25689] Avg episode reward: [(0, '-64.407')] [2022-07-09 01:23:39,372][26022] Updated weights on worker 0-0, policy_version 26053 (0.00094) [2022-07-09 01:23:41,187][26022] Updated weights on worker 0-0, policy_version 26063 (0.00054) [2022-07-09 01:23:42,854][26022] Updated weights on worker 0-0, policy_version 26073 (0.00092) [2022-07-09 01:23:43,310][25689] Fps is (10 sec: 5563.1, 60 sec: 5532.4, 300 sec: 5519.9). Total num frames: 26699776. Throughput: 0: 5782.2. Samples: 26708286. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 01:23:43,310][25689] Avg episode reward: [(0, '-65.013')] [2022-07-09 01:23:44,943][26022] Updated weights on worker 0-0, policy_version 26083 (0.00101) [2022-07-09 01:23:46,780][26022] Updated weights on worker 0-0, policy_version 26093 (0.00087) [2022-07-09 01:23:48,327][25689] Fps is (10 sec: 5568.1, 60 sec: 5532.5, 300 sec: 5520.0). Total num frames: 26728448. Throughput: 0: 4959.9. Samples: 26724834. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 01:23:48,328][25689] Avg episode reward: [(0, '-64.602')] [2022-07-09 01:23:48,419][26022] Updated weights on worker 0-0, policy_version 26103 (0.00097) [2022-07-09 01:23:50,529][26022] Updated weights on worker 0-0, policy_version 26113 (0.00094) [2022-07-09 01:23:52,285][26022] Updated weights on worker 0-0, policy_version 26123 (0.00088) [2022-07-09 01:23:53,355][25689] Fps is (10 sec: 5504.8, 60 sec: 5519.5, 300 sec: 5523.2). Total num frames: 26755072. Throughput: 0: 5770.6. Samples: 26757742. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 01:23:53,356][25689] Avg episode reward: [(0, '-65.794')] [2022-07-09 01:23:54,316][26022] Updated weights on worker 0-0, policy_version 26133 (0.00149) [2022-07-09 01:23:55,848][26022] Updated weights on worker 0-0, policy_version 26143 (0.00089) [2022-07-09 01:23:57,747][26022] Updated weights on worker 0-0, policy_version 26153 (0.00080) [2022-07-09 01:23:58,372][25689] Fps is (10 sec: 5504.9, 60 sec: 5507.2, 300 sec: 5524.1). Total num frames: 26783744. Throughput: 0: 5769.6. Samples: 26791246. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 01:23:58,372][25689] Avg episode reward: [(0, '-65.111')] [2022-07-09 01:23:59,783][26022] Updated weights on worker 0-0, policy_version 26163 (0.00109) [2022-07-09 01:24:01,936][26022] Updated weights on worker 0-0, policy_version 26173 (0.00095) [2022-07-09 01:24:03,434][25689] Fps is (10 sec: 5384.7, 60 sec: 5506.8, 300 sec: 5526.6). Total num frames: 26809344. Throughput: 0: 4945.0. Samples: 26807800. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 01:24:03,436][25689] Avg episode reward: [(0, '-65.277')] [2022-07-09 01:24:03,671][26022] Updated weights on worker 0-0, policy_version 26183 (0.00086) [2022-07-09 01:24:05,740][26022] Updated weights on worker 0-0, policy_version 26193 (0.00091) [2022-07-09 01:24:07,069][26022] Updated weights on worker 0-0, policy_version 26203 (0.00093) [2022-07-09 01:24:08,507][25689] Fps is (10 sec: 5253.9, 60 sec: 5484.8, 300 sec: 5522.3). Total num frames: 26836992. Throughput: 0: 5686.8. Samples: 26839592. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 01:24:08,507][25689] Avg episode reward: [(0, '-64.635')] [2022-07-09 01:24:09,252][26022] Updated weights on worker 0-0, policy_version 26213 (0.00089) [2022-07-09 01:24:10,738][26022] Updated weights on worker 0-0, policy_version 26223 (0.00092) [2022-07-09 01:24:12,908][26022] Updated weights on worker 0-0, policy_version 26233 (0.00085) [2022-07-09 01:24:13,571][25689] Fps is (10 sec: 5757.8, 60 sec: 5567.1, 300 sec: 5532.3). Total num frames: 26867712. Throughput: 0: 5720.7. Samples: 26873392. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 01:24:13,573][25689] Avg episode reward: [(0, '-64.527')] [2022-07-09 01:24:14,820][26022] Updated weights on worker 0-0, policy_version 26243 (0.00096) [2022-07-09 01:24:15,663][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:24:15,672][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000026248_26877952.pth [2022-07-09 01:24:15,673][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000024305_24888320.pth [2022-07-09 01:24:16,395][26022] Updated weights on worker 0-0, policy_version 26253 (0.00085) [2022-07-09 01:24:18,383][26022] Updated weights on worker 0-0, policy_version 26263 (0.00103) [2022-07-09 01:24:18,603][25689] Fps is (10 sec: 5679.4, 60 sec: 5533.5, 300 sec: 5527.3). Total num frames: 26894336. Throughput: 0: 4903.5. Samples: 26890452. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-09 01:24:18,604][25689] Avg episode reward: [(0, '-64.885')] [2022-07-09 01:24:20,002][26022] Updated weights on worker 0-0, policy_version 26273 (0.00093) [2022-07-09 01:24:21,891][26022] Updated weights on worker 0-0, policy_version 26283 (0.00090) [2022-07-09 01:24:23,668][25689] Fps is (10 sec: 5476.2, 60 sec: 5547.1, 300 sec: 5530.3). Total num frames: 26923008. Throughput: 0: 5739.0. Samples: 26923926. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-09 01:24:23,669][25689] Avg episode reward: [(0, '-64.394')] [2022-07-09 01:24:23,868][26022] Updated weights on worker 0-0, policy_version 26293 (0.00090) [2022-07-09 01:24:25,720][26022] Updated weights on worker 0-0, policy_version 26303 (0.00095) [2022-07-09 01:24:27,561][26022] Updated weights on worker 0-0, policy_version 26313 (0.00097) [2022-07-09 01:24:28,767][25689] Fps is (10 sec: 5541.5, 60 sec: 5504.6, 300 sec: 5529.4). Total num frames: 26950656. Throughput: 0: 5801.5. Samples: 26957132. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-09 01:24:28,767][25689] Avg episode reward: [(0, '-65.508')] [2022-07-09 01:24:29,472][26022] Updated weights on worker 0-0, policy_version 26323 (0.00094) [2022-07-09 01:24:31,116][26022] Updated weights on worker 0-0, policy_version 26333 (0.00089) [2022-07-09 01:24:33,077][26022] Updated weights on worker 0-0, policy_version 26343 (0.00082) [2022-07-09 01:24:33,785][25689] Fps is (10 sec: 5668.0, 60 sec: 5559.4, 300 sec: 5532.6). Total num frames: 26980352. Throughput: 0: 4965.6. Samples: 26973770. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-09 01:24:33,786][25689] Avg episode reward: [(0, '-65.440')] [2022-07-09 01:24:35,159][26022] Updated weights on worker 0-0, policy_version 26353 (0.00105) [2022-07-09 01:24:36,790][26022] Updated weights on worker 0-0, policy_version 26363 (0.00096) [2022-07-09 01:24:38,808][25689] Fps is (10 sec: 5405.0, 60 sec: 5492.1, 300 sec: 5526.7). Total num frames: 27004928. Throughput: 0: 5771.2. Samples: 27007056. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-09 01:24:38,808][25689] Avg episode reward: [(0, '-65.257')] [2022-07-09 01:24:38,819][26022] Updated weights on worker 0-0, policy_version 26373 (0.00087) [2022-07-09 01:24:40,360][26022] Updated weights on worker 0-0, policy_version 26383 (0.00123) [2022-07-09 01:24:42,316][26022] Updated weights on worker 0-0, policy_version 26393 (0.00089) [2022-07-09 01:24:43,854][25689] Fps is (10 sec: 5492.1, 60 sec: 5547.6, 300 sec: 5530.3). Total num frames: 27035648. Throughput: 0: 5786.7. Samples: 27040732. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-09 01:24:43,854][25689] Avg episode reward: [(0, '-64.709')] [2022-07-09 01:24:44,019][26022] Updated weights on worker 0-0, policy_version 26403 (0.00085) [2022-07-09 01:24:45,930][26022] Updated weights on worker 0-0, policy_version 26413 (0.00090) [2022-07-09 01:24:47,801][26022] Updated weights on worker 0-0, policy_version 26423 (0.00091) [2022-07-09 01:24:48,860][25689] Fps is (10 sec: 5806.7, 60 sec: 5531.7, 300 sec: 5537.4). Total num frames: 27063296. Throughput: 0: 5005.2. Samples: 27057702. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-09 01:24:48,860][25689] Avg episode reward: [(0, '-65.049')] [2022-07-09 01:24:49,559][26022] Updated weights on worker 0-0, policy_version 26433 (0.00090) [2022-07-09 01:24:51,396][26022] Updated weights on worker 0-0, policy_version 26443 (0.00051) [2022-07-09 01:24:53,248][26022] Updated weights on worker 0-0, policy_version 26453 (0.00084) [2022-07-09 01:24:53,899][25689] Fps is (10 sec: 5504.8, 60 sec: 5547.6, 300 sec: 5530.2). Total num frames: 27090944. Throughput: 0: 5835.2. Samples: 27091136. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 01:24:53,899][25689] Avg episode reward: [(0, '-64.869')] [2022-07-09 01:24:55,189][26022] Updated weights on worker 0-0, policy_version 26463 (0.00083) [2022-07-09 01:24:56,969][26022] Updated weights on worker 0-0, policy_version 26473 (0.00097) [2022-07-09 01:24:58,666][26022] Updated weights on worker 0-0, policy_version 26483 (0.00083) [2022-07-09 01:24:58,916][25689] Fps is (10 sec: 5600.7, 60 sec: 5547.6, 300 sec: 5534.9). Total num frames: 27119616. Throughput: 0: 5843.9. Samples: 27124564. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 01:24:58,916][25689] Avg episode reward: [(0, '-63.994')] [2022-07-09 01:25:00,666][26022] Updated weights on worker 0-0, policy_version 26493 (0.00084) [2022-07-09 01:25:02,737][26022] Updated weights on worker 0-0, policy_version 26503 (0.00100) [2022-07-09 01:25:03,975][25689] Fps is (10 sec: 5284.6, 60 sec: 5530.9, 300 sec: 5531.3). Total num frames: 27144192. Throughput: 0: 5004.1. Samples: 27141420. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 01:25:03,975][25689] Avg episode reward: [(0, '-63.802')] [2022-07-09 01:25:04,691][26022] Updated weights on worker 0-0, policy_version 26513 (0.00093) [2022-07-09 01:25:06,619][26022] Updated weights on worker 0-0, policy_version 26523 (0.00093) [2022-07-09 01:25:08,328][26022] Updated weights on worker 0-0, policy_version 26533 (0.00097) [2022-07-09 01:25:08,977][25689] Fps is (10 sec: 5496.0, 60 sec: 5588.2, 300 sec: 5535.6). Total num frames: 27174912. Throughput: 0: 5719.3. Samples: 27172758. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 01:25:08,977][25689] Avg episode reward: [(0, '-64.297')] [2022-07-09 01:25:10,274][26022] Updated weights on worker 0-0, policy_version 26543 (0.00089) [2022-07-09 01:25:11,851][26022] Updated weights on worker 0-0, policy_version 26553 (0.00085) [2022-07-09 01:25:13,941][26022] Updated weights on worker 0-0, policy_version 26563 (0.00088) [2022-07-09 01:25:13,991][25689] Fps is (10 sec: 5623.1, 60 sec: 5508.1, 300 sec: 5529.3). Total num frames: 27200512. Throughput: 0: 5753.7. Samples: 27206738. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 01:25:13,991][25689] Avg episode reward: [(0, '-64.469')] [2022-07-09 01:25:15,499][26022] Updated weights on worker 0-0, policy_version 26573 (0.00088) [2022-07-09 01:25:17,499][26022] Updated weights on worker 0-0, policy_version 26583 (0.00090) [2022-07-09 01:25:19,015][25689] Fps is (10 sec: 5508.8, 60 sec: 5559.8, 300 sec: 5537.5). Total num frames: 27230208. Throughput: 0: 4925.5. Samples: 27223562. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 01:25:19,016][25689] Avg episode reward: [(0, '-64.042')] [2022-07-09 01:25:19,147][26022] Updated weights on worker 0-0, policy_version 26593 (0.00086) [2022-07-09 01:25:20,913][26022] Updated weights on worker 0-0, policy_version 26603 (0.00087) [2022-07-09 01:25:22,778][26022] Updated weights on worker 0-0, policy_version 26613 (0.00086) [2022-07-09 01:25:24,066][25689] Fps is (10 sec: 5894.6, 60 sec: 5577.9, 300 sec: 5547.2). Total num frames: 27259904. Throughput: 0: 5779.4. Samples: 27257536. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 01:25:24,071][25689] Avg episode reward: [(0, '-64.245')] [2022-07-09 01:25:24,598][26022] Updated weights on worker 0-0, policy_version 26623 (0.00102) [2022-07-09 01:25:26,604][26022] Updated weights on worker 0-0, policy_version 26633 (0.00084) [2022-07-09 01:25:28,221][26022] Updated weights on worker 0-0, policy_version 26643 (0.00084) [2022-07-09 01:25:29,074][25689] Fps is (10 sec: 5497.1, 60 sec: 5552.4, 300 sec: 5533.7). Total num frames: 27285504. Throughput: 0: 5880.1. Samples: 27290930. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 01:25:29,074][25689] Avg episode reward: [(0, '-64.402')] [2022-07-09 01:25:30,102][26022] Updated weights on worker 0-0, policy_version 26653 (0.00085) [2022-07-09 01:25:32,011][26022] Updated weights on worker 0-0, policy_version 26663 (0.00084) [2022-07-09 01:25:33,758][26022] Updated weights on worker 0-0, policy_version 26673 (0.00087) [2022-07-09 01:25:34,085][25689] Fps is (10 sec: 5417.2, 60 sec: 5536.1, 300 sec: 5537.7). Total num frames: 27314176. Throughput: 0: 5027.9. Samples: 27307770. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 01:25:34,085][25689] Avg episode reward: [(0, '-64.160')] [2022-07-09 01:25:35,714][26022] Updated weights on worker 0-0, policy_version 26683 (0.00087) [2022-07-09 01:25:37,279][26022] Updated weights on worker 0-0, policy_version 26693 (0.00093) [2022-07-09 01:25:39,104][25689] Fps is (10 sec: 5615.1, 60 sec: 5587.4, 300 sec: 5538.2). Total num frames: 27341824. Throughput: 0: 5876.9. Samples: 27341624. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 01:25:39,104][25689] Avg episode reward: [(0, '-64.378')] [2022-07-09 01:25:39,544][26022] Updated weights on worker 0-0, policy_version 26703 (0.00090) [2022-07-09 01:25:41,148][26022] Updated weights on worker 0-0, policy_version 26713 (0.00086) [2022-07-09 01:25:43,035][26022] Updated weights on worker 0-0, policy_version 26723 (0.00086) [2022-07-09 01:25:44,214][25689] Fps is (10 sec: 5661.0, 60 sec: 5564.4, 300 sec: 5540.4). Total num frames: 27371520. Throughput: 0: 5839.8. Samples: 27375198. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 01:25:44,215][25689] Avg episode reward: [(0, '-64.158')] [2022-07-09 01:25:44,816][26022] Updated weights on worker 0-0, policy_version 26733 (0.00082) [2022-07-09 01:25:46,573][26022] Updated weights on worker 0-0, policy_version 26743 (0.00089) [2022-07-09 01:25:48,524][26022] Updated weights on worker 0-0, policy_version 26753 (0.00093) [2022-07-09 01:25:49,237][25689] Fps is (10 sec: 5759.9, 60 sec: 5579.8, 300 sec: 5546.9). Total num frames: 27400192. Throughput: 0: 5864.3. Samples: 27409176. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 01:25:49,238][25689] Avg episode reward: [(0, '-64.296')] [2022-07-09 01:25:50,128][26022] Updated weights on worker 0-0, policy_version 26763 (0.00096) [2022-07-09 01:25:52,038][26022] Updated weights on worker 0-0, policy_version 26773 (0.01201) [2022-07-09 01:25:53,795][26022] Updated weights on worker 0-0, policy_version 26783 (0.00087) [2022-07-09 01:25:54,270][25689] Fps is (10 sec: 5498.9, 60 sec: 5563.4, 300 sec: 5537.3). Total num frames: 27426816. Throughput: 0: 5852.7. Samples: 27425910. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 01:25:54,271][25689] Avg episode reward: [(0, '-64.239')] [2022-07-09 01:25:55,764][26022] Updated weights on worker 0-0, policy_version 26793 (0.00087) [2022-07-09 01:25:57,674][26022] Updated weights on worker 0-0, policy_version 26803 (0.00084) [2022-07-09 01:25:59,303][25689] Fps is (10 sec: 5595.1, 60 sec: 5578.9, 300 sec: 5546.0). Total num frames: 27456512. Throughput: 0: 5841.0. Samples: 27459608. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 01:25:59,304][25689] Avg episode reward: [(0, '-64.228')] [2022-07-09 01:25:59,311][26022] Updated weights on worker 0-0, policy_version 26813 (0.00340) [2022-07-09 01:26:01,380][26022] Updated weights on worker 0-0, policy_version 26823 (0.00090) [2022-07-09 01:26:03,390][26022] Updated weights on worker 0-0, policy_version 26833 (0.00088) [2022-07-09 01:26:04,399][25689] Fps is (10 sec: 5459.6, 60 sec: 5592.5, 300 sec: 5541.5). Total num frames: 27482112. Throughput: 0: 5733.1. Samples: 27490916. Policy #0 lag: (min: 0.0, avg: 10.6, max: 25.0) [2022-07-09 01:26:04,399][25689] Avg episode reward: [(0, '-64.053')] [2022-07-09 01:26:05,405][26022] Updated weights on worker 0-0, policy_version 26843 (0.00090) [2022-07-09 01:26:07,251][26022] Updated weights on worker 0-0, policy_version 26853 (0.00088) [2022-07-09 01:26:09,046][26022] Updated weights on worker 0-0, policy_version 26863 (0.00084) [2022-07-09 01:26:09,430][25689] Fps is (10 sec: 5258.2, 60 sec: 5539.0, 300 sec: 5541.4). Total num frames: 27509760. Throughput: 0: 4870.9. Samples: 27507532. Policy #0 lag: (min: 0.0, avg: 10.6, max: 25.0) [2022-07-09 01:26:09,430][25689] Avg episode reward: [(0, '-63.468')] [2022-07-09 01:26:10,820][26022] Updated weights on worker 0-0, policy_version 26873 (0.00092) [2022-07-09 01:26:12,780][26022] Updated weights on worker 0-0, policy_version 26883 (0.00115) [2022-07-09 01:26:14,451][25689] Fps is (10 sec: 5500.9, 60 sec: 5572.2, 300 sec: 5539.4). Total num frames: 27537408. Throughput: 0: 5712.9. Samples: 27541198. Policy #0 lag: (min: 0.0, avg: 10.6, max: 25.0) [2022-07-09 01:26:14,451][25689] Avg episode reward: [(0, '-63.424')] [2022-07-09 01:26:14,459][26022] Updated weights on worker 0-0, policy_version 26893 (0.00081) [2022-07-09 01:26:15,735][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:26:15,743][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000026900_27545600.pth [2022-07-09 01:26:15,746][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000024951_25549824.pth [2022-07-09 01:26:16,291][26022] Updated weights on worker 0-0, policy_version 26903 (0.00086) [2022-07-09 01:26:18,103][26022] Updated weights on worker 0-0, policy_version 26913 (0.00090) [2022-07-09 01:26:19,455][25689] Fps is (10 sec: 5516.0, 60 sec: 5540.2, 300 sec: 5541.2). Total num frames: 27565056. Throughput: 0: 5742.0. Samples: 27575316. Policy #0 lag: (min: 0.0, avg: 10.6, max: 25.0) [2022-07-09 01:26:19,455][25689] Avg episode reward: [(0, '-63.210')] [2022-07-09 01:26:19,851][26022] Updated weights on worker 0-0, policy_version 26923 (0.00897) [2022-07-09 01:26:21,662][26022] Updated weights on worker 0-0, policy_version 26933 (0.00091) [2022-07-09 01:26:23,528][26022] Updated weights on worker 0-0, policy_version 26943 (0.00092) [2022-07-09 01:26:24,576][25689] Fps is (10 sec: 5663.5, 60 sec: 5533.8, 300 sec: 5542.8). Total num frames: 27594752. Throughput: 0: 5023.2. Samples: 27592270. Policy #0 lag: (min: 0.0, avg: 10.6, max: 25.0) [2022-07-09 01:26:24,576][25689] Avg episode reward: [(0, '-63.329')] [2022-07-09 01:26:25,311][26022] Updated weights on worker 0-0, policy_version 26953 (0.00088) [2022-07-09 01:26:27,265][26022] Updated weights on worker 0-0, policy_version 26963 (0.00087) [2022-07-09 01:26:28,945][26022] Updated weights on worker 0-0, policy_version 26973 (0.00090) [2022-07-09 01:26:29,590][25689] Fps is (10 sec: 5758.9, 60 sec: 5584.0, 300 sec: 5542.8). Total num frames: 27623424. Throughput: 0: 5863.6. Samples: 27625742. Policy #0 lag: (min: 0.0, avg: 10.6, max: 25.0) [2022-07-09 01:26:29,590][25689] Avg episode reward: [(0, '-63.102')] [2022-07-09 01:26:31,022][26022] Updated weights on worker 0-0, policy_version 26983 (0.00085) [2022-07-09 01:26:32,482][26022] Updated weights on worker 0-0, policy_version 26993 (0.00086) [2022-07-09 01:26:34,587][26022] Updated weights on worker 0-0, policy_version 27003 (0.00091) [2022-07-09 01:26:34,674][25689] Fps is (10 sec: 5577.2, 60 sec: 5560.3, 300 sec: 5549.7). Total num frames: 27651072. Throughput: 0: 5857.7. Samples: 27659660. Policy #0 lag: (min: 0.0, avg: 10.6, max: 25.0) [2022-07-09 01:26:34,675][25689] Avg episode reward: [(0, '-63.909')] [2022-07-09 01:26:36,231][26022] Updated weights on worker 0-0, policy_version 27013 (0.00091) [2022-07-09 01:26:38,084][26022] Updated weights on worker 0-0, policy_version 27023 (0.00081) [2022-07-09 01:26:39,704][25689] Fps is (10 sec: 5568.2, 60 sec: 5576.2, 300 sec: 5546.5). Total num frames: 27679744. Throughput: 0: 4996.2. Samples: 27676488. Policy #0 lag: (min: 0.0, avg: 10.6, max: 25.0) [2022-07-09 01:26:39,705][25689] Avg episode reward: [(0, '-63.220')] [2022-07-09 01:26:39,875][26022] Updated weights on worker 0-0, policy_version 27033 (0.00097) [2022-07-09 01:26:41,807][26022] Updated weights on worker 0-0, policy_version 27043 (0.00090) [2022-07-09 01:26:43,483][26022] Updated weights on worker 0-0, policy_version 27053 (0.00088) [2022-07-09 01:26:44,767][25689] Fps is (10 sec: 5681.7, 60 sec: 5563.7, 300 sec: 5542.3). Total num frames: 27708416. Throughput: 0: 5845.3. Samples: 27710292. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:26:44,767][25689] Avg episode reward: [(0, '-63.486')] [2022-07-09 01:26:45,508][26022] Updated weights on worker 0-0, policy_version 27063 (0.00089) [2022-07-09 01:26:47,147][26022] Updated weights on worker 0-0, policy_version 27073 (0.00084) [2022-07-09 01:26:49,204][26022] Updated weights on worker 0-0, policy_version 27083 (0.00086) [2022-07-09 01:26:49,812][25689] Fps is (10 sec: 5673.3, 60 sec: 5561.7, 300 sec: 5549.6). Total num frames: 27737088. Throughput: 0: 5861.6. Samples: 27744278. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:26:49,813][25689] Avg episode reward: [(0, '-63.503')] [2022-07-09 01:26:50,810][26022] Updated weights on worker 0-0, policy_version 27094 (0.00088) [2022-07-09 01:26:52,911][26022] Updated weights on worker 0-0, policy_version 27104 (0.00084) [2022-07-09 01:26:54,447][26022] Updated weights on worker 0-0, policy_version 27114 (0.00057) [2022-07-09 01:26:54,891][25689] Fps is (10 sec: 5765.4, 60 sec: 5608.1, 300 sec: 5551.8). Total num frames: 27766784. Throughput: 0: 5025.9. Samples: 27761266. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:26:54,891][25689] Avg episode reward: [(0, '-63.259')] [2022-07-09 01:26:56,666][26022] Updated weights on worker 0-0, policy_version 27124 (0.00088) [2022-07-09 01:26:58,063][26022] Updated weights on worker 0-0, policy_version 27134 (0.00085) [2022-07-09 01:26:59,973][25689] Fps is (10 sec: 5542.7, 60 sec: 5552.9, 300 sec: 5558.2). Total num frames: 27793408. Throughput: 0: 5842.2. Samples: 27794902. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:26:59,974][25689] Avg episode reward: [(0, '-62.577')] [2022-07-09 01:27:00,297][26022] Updated weights on worker 0-0, policy_version 27144 (0.00999) [2022-07-09 01:27:02,242][26022] Updated weights on worker 0-0, policy_version 27154 (0.00089) [2022-07-09 01:27:04,314][26022] Updated weights on worker 0-0, policy_version 27164 (0.00799) [2022-07-09 01:27:05,022][25689] Fps is (10 sec: 5256.1, 60 sec: 5574.1, 300 sec: 5550.5). Total num frames: 27820032. Throughput: 0: 5726.3. Samples: 27826276. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:27:05,022][25689] Avg episode reward: [(0, '-62.526')] [2022-07-09 01:27:05,780][26022] Updated weights on worker 0-0, policy_version 27174 (0.00105) [2022-07-09 01:27:07,921][26022] Updated weights on worker 0-0, policy_version 27184 (0.00092) [2022-07-09 01:27:09,600][26022] Updated weights on worker 0-0, policy_version 27194 (0.00083) [2022-07-09 01:27:10,081][25689] Fps is (10 sec: 5470.7, 60 sec: 5588.4, 300 sec: 5549.6). Total num frames: 27848704. Throughput: 0: 4869.6. Samples: 27842978. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:27:10,081][25689] Avg episode reward: [(0, '-62.505')] [2022-07-09 01:27:11,534][26022] Updated weights on worker 0-0, policy_version 27204 (0.00084) [2022-07-09 01:27:13,228][26022] Updated weights on worker 0-0, policy_version 27214 (0.00086) [2022-07-09 01:27:15,101][25689] Fps is (10 sec: 5587.4, 60 sec: 5588.5, 300 sec: 5550.6). Total num frames: 27876352. Throughput: 0: 5718.2. Samples: 27876834. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:27:15,102][25689] Avg episode reward: [(0, '-62.273')] [2022-07-09 01:27:15,192][26022] Updated weights on worker 0-0, policy_version 27224 (0.00096) [2022-07-09 01:27:16,814][26022] Updated weights on worker 0-0, policy_version 27234 (0.00085) [2022-07-09 01:27:18,908][26022] Updated weights on worker 0-0, policy_version 27244 (0.00093) [2022-07-09 01:27:20,173][25689] Fps is (10 sec: 5682.2, 60 sec: 5616.0, 300 sec: 5554.0). Total num frames: 27906048. Throughput: 0: 5726.7. Samples: 27910580. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 01:27:20,174][25689] Avg episode reward: [(0, '-62.201')] [2022-07-09 01:27:20,387][26022] Updated weights on worker 0-0, policy_version 27254 (0.00079) [2022-07-09 01:27:22,587][26022] Updated weights on worker 0-0, policy_version 27264 (0.00095) [2022-07-09 01:27:24,005][26022] Updated weights on worker 0-0, policy_version 27274 (0.00081) [2022-07-09 01:27:25,282][25689] Fps is (10 sec: 5531.9, 60 sec: 5566.5, 300 sec: 5545.5). Total num frames: 27932672. Throughput: 0: 5001.2. Samples: 27927604. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 01:27:25,284][25689] Avg episode reward: [(0, '-62.740')] [2022-07-09 01:27:26,096][26022] Updated weights on worker 0-0, policy_version 27284 (0.00099) [2022-07-09 01:27:28,007][26022] Updated weights on worker 0-0, policy_version 27294 (0.00085) [2022-07-09 01:27:29,857][26022] Updated weights on worker 0-0, policy_version 27304 (0.00093) [2022-07-09 01:27:30,339][25689] Fps is (10 sec: 5439.0, 60 sec: 5562.5, 300 sec: 5551.8). Total num frames: 27961344. Throughput: 0: 5810.5. Samples: 27960690. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 01:27:30,341][25689] Avg episode reward: [(0, '-62.528')] [2022-07-09 01:27:31,706][26022] Updated weights on worker 0-0, policy_version 27314 (0.00094) [2022-07-09 01:27:33,657][26022] Updated weights on worker 0-0, policy_version 27324 (0.00090) [2022-07-09 01:27:35,357][25689] Fps is (10 sec: 5590.2, 60 sec: 5568.6, 300 sec: 5551.7). Total num frames: 27988992. Throughput: 0: 5790.0. Samples: 27994114. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 01:27:35,363][25689] Avg episode reward: [(0, '-62.641')] [2022-07-09 01:27:35,371][26022] Updated weights on worker 0-0, policy_version 27334 (0.00083) [2022-07-09 01:27:37,271][26022] Updated weights on worker 0-0, policy_version 27344 (0.00080) [2022-07-09 01:27:38,850][26022] Updated weights on worker 0-0, policy_version 27354 (0.00090) [2022-07-09 01:27:40,368][25689] Fps is (10 sec: 5514.0, 60 sec: 5553.5, 300 sec: 5550.4). Total num frames: 28016640. Throughput: 0: 4972.4. Samples: 28010998. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 01:27:40,369][25689] Avg episode reward: [(0, '-62.838')] [2022-07-09 01:27:40,766][26022] Updated weights on worker 0-0, policy_version 27364 (0.00093) [2022-07-09 01:27:42,710][26022] Updated weights on worker 0-0, policy_version 27374 (0.00106) [2022-07-09 01:27:44,573][26022] Updated weights on worker 0-0, policy_version 27384 (0.00083) [2022-07-09 01:27:45,412][25689] Fps is (10 sec: 5702.9, 60 sec: 5572.1, 300 sec: 5553.4). Total num frames: 28046336. Throughput: 0: 5813.6. Samples: 28044632. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 01:27:45,413][25689] Avg episode reward: [(0, '-63.245')] [2022-07-09 01:27:46,251][26022] Updated weights on worker 0-0, policy_version 27394 (0.00090) [2022-07-09 01:27:48,011][26022] Updated weights on worker 0-0, policy_version 27404 (0.00087) [2022-07-09 01:27:49,829][26022] Updated weights on worker 0-0, policy_version 27414 (0.00093) [2022-07-09 01:27:50,478][25689] Fps is (10 sec: 5773.2, 60 sec: 5570.2, 300 sec: 5556.9). Total num frames: 28075008. Throughput: 0: 5855.2. Samples: 28078604. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 01:27:50,478][25689] Avg episode reward: [(0, '-63.157')] [2022-07-09 01:27:51,790][26022] Updated weights on worker 0-0, policy_version 27424 (0.00087) [2022-07-09 01:27:53,303][26022] Updated weights on worker 0-0, policy_version 27434 (0.00085) [2022-07-09 01:27:55,486][25689] Fps is (10 sec: 5489.0, 60 sec: 5526.0, 300 sec: 5547.7). Total num frames: 28101632. Throughput: 0: 5038.6. Samples: 28095538. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 01:27:55,487][25689] Avg episode reward: [(0, '-63.506')] [2022-07-09 01:27:55,592][26022] Updated weights on worker 0-0, policy_version 27444 (0.00084) [2022-07-09 01:27:57,090][26022] Updated weights on worker 0-0, policy_version 27454 (0.00096) [2022-07-09 01:27:59,291][26022] Updated weights on worker 0-0, policy_version 27464 (0.00088) [2022-07-09 01:28:00,508][25689] Fps is (10 sec: 5615.1, 60 sec: 5582.2, 300 sec: 5562.1). Total num frames: 28131328. Throughput: 0: 5865.5. Samples: 28129132. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 01:28:00,509][25689] Avg episode reward: [(0, '-63.608')] [2022-07-09 01:28:00,830][26022] Updated weights on worker 0-0, policy_version 27474 (0.00090) [2022-07-09 01:28:03,338][26022] Updated weights on worker 0-0, policy_version 27484 (0.00091) [2022-07-09 01:28:04,956][26022] Updated weights on worker 0-0, policy_version 27494 (0.00087) [2022-07-09 01:28:05,615][25689] Fps is (10 sec: 5358.1, 60 sec: 5543.0, 300 sec: 5546.7). Total num frames: 28155904. Throughput: 0: 5733.3. Samples: 28160462. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 01:28:05,616][25689] Avg episode reward: [(0, '-63.183')] [2022-07-09 01:28:06,922][26022] Updated weights on worker 0-0, policy_version 27504 (0.00088) [2022-07-09 01:28:08,772][26022] Updated weights on worker 0-0, policy_version 27514 (0.00090) [2022-07-09 01:28:10,623][25689] Fps is (10 sec: 5163.3, 60 sec: 5530.9, 300 sec: 5554.2). Total num frames: 28183552. Throughput: 0: 5725.8. Samples: 28193948. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 01:28:10,623][25689] Avg episode reward: [(0, '-63.088')] [2022-07-09 01:28:10,647][26022] Updated weights on worker 0-0, policy_version 27524 (0.00081) [2022-07-09 01:28:12,354][26022] Updated weights on worker 0-0, policy_version 27534 (0.00086) [2022-07-09 01:28:14,210][26022] Updated weights on worker 0-0, policy_version 27544 (0.00094) [2022-07-09 01:28:15,683][25689] Fps is (10 sec: 5695.8, 60 sec: 5561.0, 300 sec: 5557.1). Total num frames: 28213248. Throughput: 0: 5709.0. Samples: 28210842. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 01:28:15,684][25689] Avg episode reward: [(0, '-63.365')] [2022-07-09 01:28:15,864][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:28:15,874][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000027554_28215296.pth [2022-07-09 01:28:15,879][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000025600_26214400.pth [2022-07-09 01:28:15,890][26022] Updated weights on worker 0-0, policy_version 27554 (0.00102) [2022-07-09 01:28:17,957][26022] Updated weights on worker 0-0, policy_version 27564 (0.00092) [2022-07-09 01:28:19,419][26022] Updated weights on worker 0-0, policy_version 27574 (0.00096) [2022-07-09 01:28:20,693][25689] Fps is (10 sec: 5796.0, 60 sec: 5549.8, 300 sec: 5560.9). Total num frames: 28241920. Throughput: 0: 5719.0. Samples: 28244568. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 01:28:20,693][25689] Avg episode reward: [(0, '-62.875')] [2022-07-09 01:28:21,607][26022] Updated weights on worker 0-0, policy_version 27584 (0.00091) [2022-07-09 01:28:23,272][26022] Updated weights on worker 0-0, policy_version 27594 (0.00101) [2022-07-09 01:28:25,096][26022] Updated weights on worker 0-0, policy_version 27604 (0.00095) [2022-07-09 01:28:25,798][25689] Fps is (10 sec: 5567.9, 60 sec: 5567.1, 300 sec: 5552.1). Total num frames: 28269568. Throughput: 0: 5833.9. Samples: 28278206. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 01:28:25,799][25689] Avg episode reward: [(0, '-62.878')] [2022-07-09 01:28:27,162][26022] Updated weights on worker 0-0, policy_version 27614 (0.00097) [2022-07-09 01:28:28,925][26022] Updated weights on worker 0-0, policy_version 27624 (0.00093) [2022-07-09 01:28:30,768][26022] Updated weights on worker 0-0, policy_version 27634 (0.00093) [2022-07-09 01:28:30,844][25689] Fps is (10 sec: 5447.0, 60 sec: 5551.1, 300 sec: 5555.9). Total num frames: 28297216. Throughput: 0: 4983.8. Samples: 28294730. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:28:30,845][25689] Avg episode reward: [(0, '-62.468')] [2022-07-09 01:28:32,633][26022] Updated weights on worker 0-0, policy_version 27644 (0.00085) [2022-07-09 01:28:34,344][26022] Updated weights on worker 0-0, policy_version 27654 (0.00094) [2022-07-09 01:28:35,894][25689] Fps is (10 sec: 5477.1, 60 sec: 5548.2, 300 sec: 5552.0). Total num frames: 28324864. Throughput: 0: 5811.3. Samples: 28328294. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:28:35,896][25689] Avg episode reward: [(0, '-62.817')] [2022-07-09 01:28:36,254][26022] Updated weights on worker 0-0, policy_version 27664 (0.00084) [2022-07-09 01:28:37,808][26022] Updated weights on worker 0-0, policy_version 27674 (0.00083) [2022-07-09 01:28:39,711][26022] Updated weights on worker 0-0, policy_version 27684 (0.00060) [2022-07-09 01:28:40,967][25689] Fps is (10 sec: 5665.2, 60 sec: 5576.3, 300 sec: 5559.4). Total num frames: 28354560. Throughput: 0: 5787.2. Samples: 28361896. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:28:40,967][25689] Avg episode reward: [(0, '-63.141')] [2022-07-09 01:28:41,694][26022] Updated weights on worker 0-0, policy_version 27694 (0.00089) [2022-07-09 01:28:43,512][26022] Updated weights on worker 0-0, policy_version 27704 (0.00093) [2022-07-09 01:28:45,193][26022] Updated weights on worker 0-0, policy_version 27714 (0.00090) [2022-07-09 01:28:46,017][25689] Fps is (10 sec: 5563.4, 60 sec: 5525.1, 300 sec: 5551.9). Total num frames: 28381184. Throughput: 0: 4973.7. Samples: 28378770. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:28:46,018][25689] Avg episode reward: [(0, '-62.343')] [2022-07-09 01:28:46,949][26022] Updated weights on worker 0-0, policy_version 27724 (0.00089) [2022-07-09 01:28:49,172][26022] Updated weights on worker 0-0, policy_version 27734 (0.00088) [2022-07-09 01:28:50,817][26022] Updated weights on worker 0-0, policy_version 27744 (0.00095) [2022-07-09 01:28:51,078][25689] Fps is (10 sec: 5570.2, 60 sec: 5542.5, 300 sec: 5561.6). Total num frames: 28410880. Throughput: 0: 5825.4. Samples: 28412596. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:28:51,078][25689] Avg episode reward: [(0, '-62.557')] [2022-07-09 01:28:52,913][26022] Updated weights on worker 0-0, policy_version 27754 (0.00097) [2022-07-09 01:28:54,519][26022] Updated weights on worker 0-0, policy_version 27764 (0.00095) [2022-07-09 01:28:56,109][25689] Fps is (10 sec: 5580.8, 60 sec: 5540.4, 300 sec: 5554.4). Total num frames: 28437504. Throughput: 0: 5794.5. Samples: 28445430. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:28:56,110][25689] Avg episode reward: [(0, '-62.229')] [2022-07-09 01:28:56,329][26022] Updated weights on worker 0-0, policy_version 27774 (0.00094) [2022-07-09 01:28:58,376][26022] Updated weights on worker 0-0, policy_version 27784 (0.00088) [2022-07-09 01:29:00,052][26022] Updated weights on worker 0-0, policy_version 27794 (0.00088) [2022-07-09 01:29:01,147][25689] Fps is (10 sec: 5389.9, 60 sec: 5505.1, 300 sec: 5561.8). Total num frames: 28465152. Throughput: 0: 4958.4. Samples: 28461956. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:29:01,147][25689] Avg episode reward: [(0, '-62.264')] [2022-07-09 01:29:02,516][26022] Updated weights on worker 0-0, policy_version 27804 (0.00080) [2022-07-09 01:29:04,027][26022] Updated weights on worker 0-0, policy_version 27814 (0.01303) [2022-07-09 01:29:06,129][26022] Updated weights on worker 0-0, policy_version 27824 (0.00084) [2022-07-09 01:29:06,263][25689] Fps is (10 sec: 5344.8, 60 sec: 5538.1, 300 sec: 5557.5). Total num frames: 28491776. Throughput: 0: 5637.9. Samples: 28492914. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:29:06,264][25689] Avg episode reward: [(0, '-62.108')] [2022-07-09 01:29:07,860][26022] Updated weights on worker 0-0, policy_version 27834 (0.00097) [2022-07-09 01:29:09,682][26022] Updated weights on worker 0-0, policy_version 27844 (0.00088) [2022-07-09 01:29:11,266][25689] Fps is (10 sec: 5565.5, 60 sec: 5572.2, 300 sec: 5555.2). Total num frames: 28521472. Throughput: 0: 5650.6. Samples: 28526674. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 01:29:11,267][25689] Avg episode reward: [(0, '-61.007')] [2022-07-09 01:29:11,440][26022] Updated weights on worker 0-0, policy_version 27854 (0.00096) [2022-07-09 01:29:13,338][26022] Updated weights on worker 0-0, policy_version 27864 (0.00088) [2022-07-09 01:29:15,091][26022] Updated weights on worker 0-0, policy_version 27874 (0.00093) [2022-07-09 01:29:16,336][25689] Fps is (10 sec: 5591.2, 60 sec: 5520.7, 300 sec: 5554.5). Total num frames: 28548096. Throughput: 0: 4851.4. Samples: 28543560. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 01:29:16,337][25689] Avg episode reward: [(0, '-61.402')] [2022-07-09 01:29:17,083][26022] Updated weights on worker 0-0, policy_version 27884 (0.00091) [2022-07-09 01:29:18,562][26022] Updated weights on worker 0-0, policy_version 27894 (0.00081) [2022-07-09 01:29:20,729][26022] Updated weights on worker 0-0, policy_version 27904 (0.00084) [2022-07-09 01:29:21,383][25689] Fps is (10 sec: 5668.3, 60 sec: 5551.1, 300 sec: 5561.7). Total num frames: 28578816. Throughput: 0: 5715.9. Samples: 28577622. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 01:29:21,383][25689] Avg episode reward: [(0, '-61.095')] [2022-07-09 01:29:22,296][26022] Updated weights on worker 0-0, policy_version 27914 (0.00086) [2022-07-09 01:29:24,147][26022] Updated weights on worker 0-0, policy_version 27924 (0.00088) [2022-07-09 01:29:26,123][26022] Updated weights on worker 0-0, policy_version 27934 (0.00098) [2022-07-09 01:29:26,511][25689] Fps is (10 sec: 5636.1, 60 sec: 5532.2, 300 sec: 5557.7). Total num frames: 28605440. Throughput: 0: 5866.3. Samples: 28611692. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 01:29:26,511][25689] Avg episode reward: [(0, '-61.538')] [2022-07-09 01:29:27,806][26022] Updated weights on worker 0-0, policy_version 27944 (0.00092) [2022-07-09 01:29:29,922][26022] Updated weights on worker 0-0, policy_version 27954 (0.00084) [2022-07-09 01:29:31,368][26022] Updated weights on worker 0-0, policy_version 27964 (0.00086) [2022-07-09 01:29:31,516][25689] Fps is (10 sec: 5557.9, 60 sec: 5569.7, 300 sec: 5558.0). Total num frames: 28635136. Throughput: 0: 5024.5. Samples: 28628420. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 01:29:31,516][25689] Avg episode reward: [(0, '-62.461')] [2022-07-09 01:29:33,492][26022] Updated weights on worker 0-0, policy_version 27974 (0.00082) [2022-07-09 01:29:35,043][26022] Updated weights on worker 0-0, policy_version 27984 (0.00086) [2022-07-09 01:29:36,520][25689] Fps is (10 sec: 5729.3, 60 sec: 5573.9, 300 sec: 5568.7). Total num frames: 28662784. Throughput: 0: 5869.1. Samples: 28662018. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 01:29:36,521][25689] Avg episode reward: [(0, '-62.065')] [2022-07-09 01:29:37,142][26022] Updated weights on worker 0-0, policy_version 27994 (0.00083) [2022-07-09 01:29:38,630][26022] Updated weights on worker 0-0, policy_version 28004 (0.00090) [2022-07-09 01:29:40,795][26022] Updated weights on worker 0-0, policy_version 28014 (0.00084) [2022-07-09 01:29:41,533][25689] Fps is (10 sec: 5622.5, 60 sec: 5562.4, 300 sec: 5562.4). Total num frames: 28691456. Throughput: 0: 5869.6. Samples: 28695896. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 01:29:41,534][25689] Avg episode reward: [(0, '-63.288')] [2022-07-09 01:29:42,484][26022] Updated weights on worker 0-0, policy_version 28024 (0.00088) [2022-07-09 01:29:44,293][26022] Updated weights on worker 0-0, policy_version 28034 (0.00083) [2022-07-09 01:29:46,282][26022] Updated weights on worker 0-0, policy_version 28044 (0.00081) [2022-07-09 01:29:46,637][25689] Fps is (10 sec: 5465.7, 60 sec: 5557.6, 300 sec: 5557.1). Total num frames: 28718080. Throughput: 0: 5022.9. Samples: 28712782. Policy #0 lag: (min: 0.0, avg: 8.6, max: 17.0) [2022-07-09 01:29:46,639][25689] Avg episode reward: [(0, '-64.031')] [2022-07-09 01:29:48,003][26022] Updated weights on worker 0-0, policy_version 28054 (0.00097) [2022-07-09 01:29:49,719][26022] Updated weights on worker 0-0, policy_version 28064 (0.00091) [2022-07-09 01:29:51,539][26022] Updated weights on worker 0-0, policy_version 28074 (0.00088) [2022-07-09 01:29:51,737][25689] Fps is (10 sec: 5519.3, 60 sec: 5553.9, 300 sec: 5562.9). Total num frames: 28747776. Throughput: 0: 5826.3. Samples: 28746236. Policy #0 lag: (min: 0.0, avg: 8.6, max: 17.0) [2022-07-09 01:29:51,738][25689] Avg episode reward: [(0, '-63.721')] [2022-07-09 01:29:53,311][26022] Updated weights on worker 0-0, policy_version 28084 (0.00083) [2022-07-09 01:29:55,175][26022] Updated weights on worker 0-0, policy_version 28094 (0.00081) [2022-07-09 01:29:56,835][25689] Fps is (10 sec: 5723.3, 60 sec: 5581.5, 300 sec: 5561.3). Total num frames: 28776448. Throughput: 0: 5814.5. Samples: 28780142. Policy #0 lag: (min: 0.0, avg: 8.6, max: 17.0) [2022-07-09 01:29:56,836][25689] Avg episode reward: [(0, '-64.644')] [2022-07-09 01:29:57,024][26022] Updated weights on worker 0-0, policy_version 28104 (0.00087) [2022-07-09 01:29:58,987][26022] Updated weights on worker 0-0, policy_version 28114 (0.00098) [2022-07-09 01:30:00,587][26022] Updated weights on worker 0-0, policy_version 28124 (0.00080) [2022-07-09 01:30:01,856][25689] Fps is (10 sec: 5565.8, 60 sec: 5583.1, 300 sec: 5572.4). Total num frames: 28804096. Throughput: 0: 5784.2. Samples: 28813450. Policy #0 lag: (min: 0.0, avg: 8.6, max: 17.0) [2022-07-09 01:30:01,857][25689] Avg episode reward: [(0, '-64.329')] [2022-07-09 01:30:02,967][26022] Updated weights on worker 0-0, policy_version 28134 (0.00090) [2022-07-09 01:30:04,813][26022] Updated weights on worker 0-0, policy_version 28144 (0.00093) [2022-07-09 01:30:06,694][26022] Updated weights on worker 0-0, policy_version 28154 (0.00086) [2022-07-09 01:30:06,888][25689] Fps is (10 sec: 5296.8, 60 sec: 5574.0, 300 sec: 5554.6). Total num frames: 28829696. Throughput: 0: 5694.0. Samples: 28828094. Policy #0 lag: (min: 0.0, avg: 8.6, max: 17.0) [2022-07-09 01:30:06,889][25689] Avg episode reward: [(0, '-64.472')] [2022-07-09 01:30:08,411][26022] Updated weights on worker 0-0, policy_version 28164 (0.00086) [2022-07-09 01:30:10,400][26022] Updated weights on worker 0-0, policy_version 28174 (0.00458) [2022-07-09 01:30:11,959][25689] Fps is (10 sec: 5372.0, 60 sec: 5550.8, 300 sec: 5563.9). Total num frames: 28858368. Throughput: 0: 5705.4. Samples: 28861608. Policy #0 lag: (min: 0.0, avg: 8.6, max: 17.0) [2022-07-09 01:30:11,963][25689] Avg episode reward: [(0, '-63.418')] [2022-07-09 01:30:12,304][26022] Updated weights on worker 0-0, policy_version 28184 (0.00116) [2022-07-09 01:30:13,966][26022] Updated weights on worker 0-0, policy_version 28194 (0.00083) [2022-07-09 01:30:15,877][26022] Updated weights on worker 0-0, policy_version 28204 (0.00807) [2022-07-09 01:30:15,969][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:30:15,989][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000028205_28881920.pth [2022-07-09 01:30:16,002][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000026248_26877952.pth [2022-07-09 01:30:17,023][25689] Fps is (10 sec: 5759.1, 60 sec: 5602.0, 300 sec: 5563.1). Total num frames: 28888064. Throughput: 0: 5700.0. Samples: 28895212. Policy #0 lag: (min: 0.0, avg: 8.6, max: 17.0) [2022-07-09 01:30:17,031][25689] Avg episode reward: [(0, '-63.192')] [2022-07-09 01:30:17,819][26022] Updated weights on worker 0-0, policy_version 28214 (0.00085) [2022-07-09 01:30:19,537][26022] Updated weights on worker 0-0, policy_version 28224 (0.00089) [2022-07-09 01:30:21,425][26022] Updated weights on worker 0-0, policy_version 28234 (0.00091) [2022-07-09 01:30:22,042][25689] Fps is (10 sec: 5484.0, 60 sec: 5520.1, 300 sec: 5550.0). Total num frames: 28913664. Throughput: 0: 4887.9. Samples: 28912116. Policy #0 lag: (min: 0.0, avg: 8.6, max: 17.0) [2022-07-09 01:30:22,044][25689] Avg episode reward: [(0, '-62.699')] [2022-07-09 01:30:23,004][26022] Updated weights on worker 0-0, policy_version 28244 (0.00082) [2022-07-09 01:30:25,127][26022] Updated weights on worker 0-0, policy_version 28254 (0.00087) [2022-07-09 01:30:26,631][26022] Updated weights on worker 0-0, policy_version 28264 (0.00084) [2022-07-09 01:30:27,103][25689] Fps is (10 sec: 5383.9, 60 sec: 5560.0, 300 sec: 5559.3). Total num frames: 28942336. Throughput: 0: 5806.9. Samples: 28945484. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 01:30:27,105][25689] Avg episode reward: [(0, '-63.338')] [2022-07-09 01:30:28,536][26022] Updated weights on worker 0-0, policy_version 28274 (0.00089) [2022-07-09 01:30:30,807][26022] Updated weights on worker 0-0, policy_version 28284 (0.00088) [2022-07-09 01:30:31,954][26022] Updated weights on worker 0-0, policy_version 28294 (0.00081) [2022-07-09 01:30:32,131][25689] Fps is (10 sec: 5887.0, 60 sec: 5574.9, 300 sec: 5565.9). Total num frames: 28973056. Throughput: 0: 5822.7. Samples: 28979062. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 01:30:32,132][25689] Avg episode reward: [(0, '-63.259')] [2022-07-09 01:30:34,381][26022] Updated weights on worker 0-0, policy_version 28304 (0.00089) [2022-07-09 01:30:35,980][26022] Updated weights on worker 0-0, policy_version 28314 (0.00091) [2022-07-09 01:30:37,135][25689] Fps is (10 sec: 5614.3, 60 sec: 5541.1, 300 sec: 5559.3). Total num frames: 28998656. Throughput: 0: 5003.2. Samples: 28995836. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 01:30:37,139][25689] Avg episode reward: [(0, '-63.501')] [2022-07-09 01:30:37,851][26022] Updated weights on worker 0-0, policy_version 28324 (0.00093) [2022-07-09 01:30:39,737][26022] Updated weights on worker 0-0, policy_version 28334 (0.00083) [2022-07-09 01:30:41,559][26022] Updated weights on worker 0-0, policy_version 28344 (0.00090) [2022-07-09 01:30:42,236][25689] Fps is (10 sec: 5370.7, 60 sec: 5533.0, 300 sec: 5556.0). Total num frames: 29027328. Throughput: 0: 5833.7. Samples: 29029922. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 01:30:42,236][25689] Avg episode reward: [(0, '-64.011')] [2022-07-09 01:30:43,171][26022] Updated weights on worker 0-0, policy_version 28354 (0.00090) [2022-07-09 01:30:45,342][26022] Updated weights on worker 0-0, policy_version 28364 (0.00084) [2022-07-09 01:30:46,636][26022] Updated weights on worker 0-0, policy_version 28374 (0.00087) [2022-07-09 01:30:47,339][25689] Fps is (10 sec: 5720.2, 60 sec: 5583.7, 300 sec: 5557.9). Total num frames: 29057024. Throughput: 0: 5833.0. Samples: 29063516. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 01:30:47,339][25689] Avg episode reward: [(0, '-64.201')] [2022-07-09 01:30:48,864][26022] Updated weights on worker 0-0, policy_version 28384 (0.00093) [2022-07-09 01:30:50,555][26022] Updated weights on worker 0-0, policy_version 28394 (0.00091) [2022-07-09 01:30:52,350][25689] Fps is (10 sec: 5669.5, 60 sec: 5558.1, 300 sec: 5561.8). Total num frames: 29084672. Throughput: 0: 5011.9. Samples: 29080402. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 01:30:52,351][25689] Avg episode reward: [(0, '-64.097')] [2022-07-09 01:30:52,381][26022] Updated weights on worker 0-0, policy_version 28404 (0.00091) [2022-07-09 01:30:54,347][26022] Updated weights on worker 0-0, policy_version 28414 (0.00080) [2022-07-09 01:30:56,079][26022] Updated weights on worker 0-0, policy_version 28424 (0.00095) [2022-07-09 01:30:57,393][25689] Fps is (10 sec: 5499.5, 60 sec: 5546.2, 300 sec: 5554.7). Total num frames: 29112320. Throughput: 0: 5837.3. Samples: 29114092. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 01:30:57,394][25689] Avg episode reward: [(0, '-63.652')] [2022-07-09 01:30:57,899][26022] Updated weights on worker 0-0, policy_version 28434 (0.00087) [2022-07-09 01:30:59,815][26022] Updated weights on worker 0-0, policy_version 28444 (0.00277) [2022-07-09 01:31:01,565][26022] Updated weights on worker 0-0, policy_version 28454 (0.00095) [2022-07-09 01:31:02,427][25689] Fps is (10 sec: 5385.9, 60 sec: 5528.2, 300 sec: 5559.3). Total num frames: 29138944. Throughput: 0: 5807.2. Samples: 29147176. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:31:02,428][25689] Avg episode reward: [(0, '-63.020')] [2022-07-09 01:31:03,752][26022] Updated weights on worker 0-0, policy_version 28464 (0.00078) [2022-07-09 01:31:05,774][26022] Updated weights on worker 0-0, policy_version 28474 (0.00085) [2022-07-09 01:31:07,187][26022] Updated weights on worker 0-0, policy_version 28484 (0.00086) [2022-07-09 01:31:07,507][25689] Fps is (10 sec: 5569.0, 60 sec: 5591.4, 300 sec: 5565.3). Total num frames: 29168640. Throughput: 0: 4897.4. Samples: 29162286. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:31:07,507][25689] Avg episode reward: [(0, '-62.771')] [2022-07-09 01:31:09,432][26022] Updated weights on worker 0-0, policy_version 28494 (0.00194) [2022-07-09 01:31:11,076][26022] Updated weights on worker 0-0, policy_version 28504 (0.00086) [2022-07-09 01:31:12,535][25689] Fps is (10 sec: 5672.9, 60 sec: 5578.4, 300 sec: 5565.1). Total num frames: 29196288. Throughput: 0: 5743.7. Samples: 29196338. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:31:12,539][25689] Avg episode reward: [(0, '-62.823')] [2022-07-09 01:31:12,841][26022] Updated weights on worker 0-0, policy_version 28514 (0.00085) [2022-07-09 01:31:14,712][26022] Updated weights on worker 0-0, policy_version 28524 (0.00092) [2022-07-09 01:31:16,259][26022] Updated weights on worker 0-0, policy_version 28534 (0.00090) [2022-07-09 01:31:17,546][25689] Fps is (10 sec: 5508.2, 60 sec: 5549.5, 300 sec: 5565.0). Total num frames: 29223936. Throughput: 0: 5768.7. Samples: 29230344. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:31:17,546][25689] Avg episode reward: [(0, '-62.431')] [2022-07-09 01:31:18,363][26022] Updated weights on worker 0-0, policy_version 28544 (0.00090) [2022-07-09 01:31:19,968][26022] Updated weights on worker 0-0, policy_version 28554 (0.00087) [2022-07-09 01:31:21,854][26022] Updated weights on worker 0-0, policy_version 28564 (0.00100) [2022-07-09 01:31:22,574][25689] Fps is (10 sec: 5508.2, 60 sec: 5582.5, 300 sec: 5559.9). Total num frames: 29251584. Throughput: 0: 4966.5. Samples: 29247238. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:31:22,575][25689] Avg episode reward: [(0, '-62.744')] [2022-07-09 01:31:23,936][26022] Updated weights on worker 0-0, policy_version 28574 (0.00082) [2022-07-09 01:31:25,667][26022] Updated weights on worker 0-0, policy_version 28584 (0.00085) [2022-07-09 01:31:27,475][26022] Updated weights on worker 0-0, policy_version 28594 (0.00094) [2022-07-09 01:31:27,634][25689] Fps is (10 sec: 5684.1, 60 sec: 5599.5, 300 sec: 5562.4). Total num frames: 29281280. Throughput: 0: 5879.2. Samples: 29280620. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:31:27,635][25689] Avg episode reward: [(0, '-63.015')] [2022-07-09 01:31:29,345][26022] Updated weights on worker 0-0, policy_version 28604 (0.00082) [2022-07-09 01:31:31,019][26022] Updated weights on worker 0-0, policy_version 28614 (0.00087) [2022-07-09 01:31:32,639][25689] Fps is (10 sec: 5596.0, 60 sec: 5533.9, 300 sec: 5560.5). Total num frames: 29307904. Throughput: 0: 5859.4. Samples: 29314130. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:31:32,639][25689] Avg episode reward: [(0, '-63.264')] [2022-07-09 01:31:33,290][26022] Updated weights on worker 0-0, policy_version 28624 (0.00615) [2022-07-09 01:31:34,770][26022] Updated weights on worker 0-0, policy_version 28634 (0.00085) [2022-07-09 01:31:36,941][26022] Updated weights on worker 0-0, policy_version 28644 (0.00086) [2022-07-09 01:31:37,643][25689] Fps is (10 sec: 5627.2, 60 sec: 5601.7, 300 sec: 5564.4). Total num frames: 29337600. Throughput: 0: 4990.8. Samples: 29330644. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:31:37,643][25689] Avg episode reward: [(0, '-63.588')] [2022-07-09 01:31:38,589][26022] Updated weights on worker 0-0, policy_version 28654 (0.00091) [2022-07-09 01:31:40,436][26022] Updated weights on worker 0-0, policy_version 28664 (0.00085) [2022-07-09 01:31:42,237][26022] Updated weights on worker 0-0, policy_version 28674 (0.00079) [2022-07-09 01:31:42,666][25689] Fps is (10 sec: 5514.7, 60 sec: 5558.1, 300 sec: 5554.9). Total num frames: 29363200. Throughput: 0: 5819.4. Samples: 29364158. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-09 01:31:42,666][25689] Avg episode reward: [(0, '-63.089')] [2022-07-09 01:31:43,922][26022] Updated weights on worker 0-0, policy_version 28684 (0.00089) [2022-07-09 01:31:46,016][26022] Updated weights on worker 0-0, policy_version 28694 (0.00088) [2022-07-09 01:31:47,732][25689] Fps is (10 sec: 5480.6, 60 sec: 5561.4, 300 sec: 5557.9). Total num frames: 29392896. Throughput: 0: 5818.8. Samples: 29397566. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-09 01:31:47,733][25689] Avg episode reward: [(0, '-62.405')] [2022-07-09 01:31:47,735][26022] Updated weights on worker 0-0, policy_version 28704 (0.00096) [2022-07-09 01:31:49,723][26022] Updated weights on worker 0-0, policy_version 28714 (0.00088) [2022-07-09 01:31:51,330][26022] Updated weights on worker 0-0, policy_version 28724 (0.00972) [2022-07-09 01:31:52,760][25689] Fps is (10 sec: 5478.1, 60 sec: 5526.0, 300 sec: 5545.1). Total num frames: 29418496. Throughput: 0: 4977.5. Samples: 29414282. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-09 01:31:52,761][25689] Avg episode reward: [(0, '-61.526')] [2022-07-09 01:31:53,191][26022] Updated weights on worker 0-0, policy_version 28734 (0.00090) [2022-07-09 01:31:55,026][26022] Updated weights on worker 0-0, policy_version 28744 (0.00087) [2022-07-09 01:31:56,791][26022] Updated weights on worker 0-0, policy_version 28754 (0.00089) [2022-07-09 01:31:57,795][25689] Fps is (10 sec: 5596.9, 60 sec: 5577.6, 300 sec: 5559.7). Total num frames: 29449216. Throughput: 0: 5829.0. Samples: 29448112. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-09 01:31:57,796][25689] Avg episode reward: [(0, '-60.948')] [2022-07-09 01:31:58,819][26022] Updated weights on worker 0-0, policy_version 28764 (0.00087) [2022-07-09 01:32:00,475][26022] Updated weights on worker 0-0, policy_version 28774 (0.00092) [2022-07-09 01:32:02,716][26022] Updated weights on worker 0-0, policy_version 28784 (0.00098) [2022-07-09 01:32:02,800][25689] Fps is (10 sec: 5711.3, 60 sec: 5580.2, 300 sec: 5560.5). Total num frames: 29475840. Throughput: 0: 5769.3. Samples: 29480320. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-09 01:32:02,802][25689] Avg episode reward: [(0, '-61.024')] [2022-07-09 01:32:04,680][26022] Updated weights on worker 0-0, policy_version 28794 (0.00086) [2022-07-09 01:32:06,384][26022] Updated weights on worker 0-0, policy_version 28804 (0.00094) [2022-07-09 01:32:07,897][25689] Fps is (10 sec: 5169.4, 60 sec: 5510.8, 300 sec: 5549.5). Total num frames: 29501440. Throughput: 0: 4897.6. Samples: 29496328. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-09 01:32:07,898][25689] Avg episode reward: [(0, '-61.377')] [2022-07-09 01:32:08,322][26022] Updated weights on worker 0-0, policy_version 28814 (0.00055) [2022-07-09 01:32:09,864][26022] Updated weights on worker 0-0, policy_version 28824 (0.00087) [2022-07-09 01:32:12,044][26022] Updated weights on worker 0-0, policy_version 28834 (0.00090) [2022-07-09 01:32:12,901][25689] Fps is (10 sec: 5575.5, 60 sec: 5563.9, 300 sec: 5560.1). Total num frames: 29532160. Throughput: 0: 5751.3. Samples: 29530124. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-09 01:32:12,902][25689] Avg episode reward: [(0, '-61.601')] [2022-07-09 01:32:13,686][26022] Updated weights on worker 0-0, policy_version 28844 (0.00088) [2022-07-09 01:32:15,587][26022] Updated weights on worker 0-0, policy_version 28854 (0.00083) [2022-07-09 01:32:16,042][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:32:16,054][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000028857_29549568.pth [2022-07-09 01:32:16,055][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000026900_27545600.pth [2022-07-09 01:32:17,341][26022] Updated weights on worker 0-0, policy_version 28864 (0.00086) [2022-07-09 01:32:17,920][25689] Fps is (10 sec: 5824.0, 60 sec: 5563.2, 300 sec: 5554.2). Total num frames: 29559808. Throughput: 0: 5767.5. Samples: 29564180. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 01:32:17,920][25689] Avg episode reward: [(0, '-61.378')] [2022-07-09 01:32:19,189][26022] Updated weights on worker 0-0, policy_version 28874 (0.00091) [2022-07-09 01:32:21,044][26022] Updated weights on worker 0-0, policy_version 28884 (0.00108) [2022-07-09 01:32:22,674][26022] Updated weights on worker 0-0, policy_version 28894 (0.00093) [2022-07-09 01:32:22,939][25689] Fps is (10 sec: 5509.0, 60 sec: 5564.1, 300 sec: 5559.4). Total num frames: 29587456. Throughput: 0: 5003.9. Samples: 29581092. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 01:32:22,939][25689] Avg episode reward: [(0, '-61.411')] [2022-07-09 01:32:24,472][26022] Updated weights on worker 0-0, policy_version 28904 (0.00091) [2022-07-09 01:32:26,647][26022] Updated weights on worker 0-0, policy_version 28914 (0.00089) [2022-07-09 01:32:27,993][25689] Fps is (10 sec: 5590.7, 60 sec: 5547.6, 300 sec: 5559.4). Total num frames: 29616128. Throughput: 0: 5887.4. Samples: 29614640. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 01:32:27,994][25689] Avg episode reward: [(0, '-61.973')] [2022-07-09 01:32:28,295][26022] Updated weights on worker 0-0, policy_version 28924 (0.00093) [2022-07-09 01:32:30,287][26022] Updated weights on worker 0-0, policy_version 28934 (0.00090) [2022-07-09 01:32:31,895][26022] Updated weights on worker 0-0, policy_version 28944 (0.00089) [2022-07-09 01:32:32,995][25689] Fps is (10 sec: 5702.3, 60 sec: 5581.8, 300 sec: 5563.2). Total num frames: 29644800. Throughput: 0: 5880.6. Samples: 29648286. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 01:32:32,995][25689] Avg episode reward: [(0, '-61.259')] [2022-07-09 01:32:33,647][26022] Updated weights on worker 0-0, policy_version 28954 (0.00087) [2022-07-09 01:32:35,832][26022] Updated weights on worker 0-0, policy_version 28964 (0.00091) [2022-07-09 01:32:37,439][26022] Updated weights on worker 0-0, policy_version 28974 (0.00079) [2022-07-09 01:32:38,018][25689] Fps is (10 sec: 5516.3, 60 sec: 5529.2, 300 sec: 5559.5). Total num frames: 29671424. Throughput: 0: 5016.2. Samples: 29664996. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 01:32:38,018][25689] Avg episode reward: [(0, '-61.013')] [2022-07-09 01:32:39,362][26022] Updated weights on worker 0-0, policy_version 28984 (0.00090) [2022-07-09 01:32:41,115][26022] Updated weights on worker 0-0, policy_version 28994 (0.00086) [2022-07-09 01:32:42,883][26022] Updated weights on worker 0-0, policy_version 29004 (0.00091) [2022-07-09 01:32:43,021][25689] Fps is (10 sec: 5617.5, 60 sec: 5598.9, 300 sec: 5560.3). Total num frames: 29701120. Throughput: 0: 5845.8. Samples: 29698488. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 01:32:43,021][25689] Avg episode reward: [(0, '-61.862')] [2022-07-09 01:32:44,995][26022] Updated weights on worker 0-0, policy_version 29014 (0.00087) [2022-07-09 01:32:46,483][26022] Updated weights on worker 0-0, policy_version 29024 (0.00098) [2022-07-09 01:32:48,075][25689] Fps is (10 sec: 5701.8, 60 sec: 5566.1, 300 sec: 5557.0). Total num frames: 29728768. Throughput: 0: 5850.6. Samples: 29732128. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 01:32:48,075][25689] Avg episode reward: [(0, '-62.690')] [2022-07-09 01:32:48,434][26022] Updated weights on worker 0-0, policy_version 29034 (0.00088) [2022-07-09 01:32:50,146][26022] Updated weights on worker 0-0, policy_version 29044 (0.00088) [2022-07-09 01:32:52,211][26022] Updated weights on worker 0-0, policy_version 29054 (0.00090) [2022-07-09 01:32:53,096][25689] Fps is (10 sec: 5488.1, 60 sec: 5600.6, 300 sec: 5560.2). Total num frames: 29756416. Throughput: 0: 5855.8. Samples: 29765996. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 01:32:53,097][25689] Avg episode reward: [(0, '-63.116')] [2022-07-09 01:32:53,747][26022] Updated weights on worker 0-0, policy_version 29064 (0.00085) [2022-07-09 01:32:55,786][26022] Updated weights on worker 0-0, policy_version 29074 (0.00087) [2022-07-09 01:32:57,507][26022] Updated weights on worker 0-0, policy_version 29084 (0.00082) [2022-07-09 01:32:58,120][25689] Fps is (10 sec: 5606.8, 60 sec: 5567.7, 300 sec: 5556.8). Total num frames: 29785088. Throughput: 0: 5868.0. Samples: 29782954. Policy #0 lag: (min: 0.0, avg: 7.1, max: 17.0) [2022-07-09 01:32:58,120][25689] Avg episode reward: [(0, '-63.453')] [2022-07-09 01:32:59,328][26022] Updated weights on worker 0-0, policy_version 29094 (0.00084) [2022-07-09 01:33:01,141][26022] Updated weights on worker 0-0, policy_version 29104 (0.00087) [2022-07-09 01:33:03,128][25689] Fps is (10 sec: 5409.9, 60 sec: 5550.4, 300 sec: 5562.1). Total num frames: 29810688. Throughput: 0: 5770.2. Samples: 29814514. Policy #0 lag: (min: 0.0, avg: 7.1, max: 17.0) [2022-07-09 01:33:03,129][25689] Avg episode reward: [(0, '-63.577')] [2022-07-09 01:33:03,383][26022] Updated weights on worker 0-0, policy_version 29114 (0.00101) [2022-07-09 01:33:05,198][26022] Updated weights on worker 0-0, policy_version 29124 (0.00085) [2022-07-09 01:33:07,107][26022] Updated weights on worker 0-0, policy_version 29134 (0.00089) [2022-07-09 01:33:08,219][25689] Fps is (10 sec: 5475.4, 60 sec: 5619.0, 300 sec: 5567.4). Total num frames: 29840384. Throughput: 0: 5750.9. Samples: 29847974. Policy #0 lag: (min: 0.0, avg: 7.1, max: 17.0) [2022-07-09 01:33:08,220][25689] Avg episode reward: [(0, '-64.104')] [2022-07-09 01:33:08,910][26022] Updated weights on worker 0-0, policy_version 29144 (0.00082) [2022-07-09 01:33:10,846][26022] Updated weights on worker 0-0, policy_version 29154 (0.00105) [2022-07-09 01:33:12,472][26022] Updated weights on worker 0-0, policy_version 29164 (0.00091) [2022-07-09 01:33:13,243][25689] Fps is (10 sec: 5568.4, 60 sec: 5549.2, 300 sec: 5557.7). Total num frames: 29867008. Throughput: 0: 4906.8. Samples: 29864850. Policy #0 lag: (min: 0.0, avg: 7.1, max: 17.0) [2022-07-09 01:33:13,244][25689] Avg episode reward: [(0, '-62.905')] [2022-07-09 01:33:14,538][26022] Updated weights on worker 0-0, policy_version 29174 (0.00084) [2022-07-09 01:33:15,917][26022] Updated weights on worker 0-0, policy_version 29184 (0.00089) [2022-07-09 01:33:18,185][26022] Updated weights on worker 0-0, policy_version 29194 (0.00086) [2022-07-09 01:33:18,258][25689] Fps is (10 sec: 5406.0, 60 sec: 5549.5, 300 sec: 5554.2). Total num frames: 29894656. Throughput: 0: 5749.4. Samples: 29898736. Policy #0 lag: (min: 0.0, avg: 7.1, max: 17.0) [2022-07-09 01:33:18,258][25689] Avg episode reward: [(0, '-62.977')] [2022-07-09 01:33:19,531][26022] Updated weights on worker 0-0, policy_version 29204 (0.00089) [2022-07-09 01:33:21,896][26022] Updated weights on worker 0-0, policy_version 29214 (0.00087) [2022-07-09 01:33:23,281][26022] Updated weights on worker 0-0, policy_version 29224 (0.00082) [2022-07-09 01:33:23,281][25689] Fps is (10 sec: 5712.3, 60 sec: 5583.0, 300 sec: 5562.6). Total num frames: 29924352. Throughput: 0: 5856.9. Samples: 29932548. Policy #0 lag: (min: 0.0, avg: 7.1, max: 17.0) [2022-07-09 01:33:23,283][25689] Avg episode reward: [(0, '-62.827')] [2022-07-09 01:33:25,385][26022] Updated weights on worker 0-0, policy_version 29234 (0.00093) [2022-07-09 01:33:27,149][26022] Updated weights on worker 0-0, policy_version 29244 (0.00089) [2022-07-09 01:33:28,403][25689] Fps is (10 sec: 5652.5, 60 sec: 5559.9, 300 sec: 5561.2). Total num frames: 29952000. Throughput: 0: 5008.3. Samples: 29949062. Policy #0 lag: (min: 0.0, avg: 7.1, max: 17.0) [2022-07-09 01:33:28,403][25689] Avg episode reward: [(0, '-63.572')] [2022-07-09 01:33:29,136][26022] Updated weights on worker 0-0, policy_version 29254 (0.00088) [2022-07-09 01:33:30,853][26022] Updated weights on worker 0-0, policy_version 29264 (0.00087) [2022-07-09 01:33:33,035][26022] Updated weights on worker 0-0, policy_version 29274 (0.00084) [2022-07-09 01:33:33,404][25689] Fps is (10 sec: 5361.6, 60 sec: 5526.1, 300 sec: 5558.7). Total num frames: 29978624. Throughput: 0: 5821.0. Samples: 29982208. Policy #0 lag: (min: 0.0, avg: 7.1, max: 17.0) [2022-07-09 01:33:33,404][25689] Avg episode reward: [(0, '-62.785')] [2022-07-09 01:33:34,607][26022] Updated weights on worker 0-0, policy_version 29284 (0.00083) [2022-07-09 01:33:36,779][26022] Updated weights on worker 0-0, policy_version 29294 (0.00086) [2022-07-09 01:33:38,215][26022] Updated weights on worker 0-0, policy_version 29304 (0.00083) [2022-07-09 01:33:38,445][25689] Fps is (10 sec: 5710.0, 60 sec: 5592.1, 300 sec: 5562.7). Total num frames: 30009344. Throughput: 0: 5764.3. Samples: 30015104. Policy #0 lag: (min: 0.0, avg: 7.0, max: 18.0) [2022-07-09 01:33:38,446][25689] Avg episode reward: [(0, '-62.947')] [2022-07-09 01:33:40,239][26022] Updated weights on worker 0-0, policy_version 29314 (0.00079) [2022-07-09 01:33:41,896][26022] Updated weights on worker 0-0, policy_version 29324 (0.00091) [2022-07-09 01:33:43,483][25689] Fps is (10 sec: 5587.5, 60 sec: 5521.2, 300 sec: 5559.5). Total num frames: 30034944. Throughput: 0: 4930.7. Samples: 30032158. Policy #0 lag: (min: 0.0, avg: 7.0, max: 18.0) [2022-07-09 01:33:43,484][25689] Avg episode reward: [(0, '-62.907')] [2022-07-09 01:33:43,807][26022] Updated weights on worker 0-0, policy_version 29334 (0.00084) [2022-07-09 01:33:45,766][26022] Updated weights on worker 0-0, policy_version 29344 (0.00088) [2022-07-09 01:33:47,321][26022] Updated weights on worker 0-0, policy_version 29354 (0.00088) [2022-07-09 01:33:48,532][25689] Fps is (10 sec: 5279.1, 60 sec: 5521.6, 300 sec: 5552.8). Total num frames: 30062592. Throughput: 0: 5810.8. Samples: 30066030. Policy #0 lag: (min: 0.0, avg: 7.0, max: 18.0) [2022-07-09 01:33:48,533][25689] Avg episode reward: [(0, '-62.618')] [2022-07-09 01:33:49,334][26022] Updated weights on worker 0-0, policy_version 29364 (0.00082) [2022-07-09 01:33:51,316][26022] Updated weights on worker 0-0, policy_version 29374 (0.00096) [2022-07-09 01:33:52,946][26022] Updated weights on worker 0-0, policy_version 29384 (0.00084) [2022-07-09 01:33:53,571][25689] Fps is (10 sec: 5684.8, 60 sec: 5554.0, 300 sec: 5563.0). Total num frames: 30092288. Throughput: 0: 5829.7. Samples: 30099776. Policy #0 lag: (min: 0.0, avg: 7.0, max: 18.0) [2022-07-09 01:33:53,571][25689] Avg episode reward: [(0, '-62.417')] [2022-07-09 01:33:54,894][26022] Updated weights on worker 0-0, policy_version 29394 (0.00089) [2022-07-09 01:33:56,605][26022] Updated weights on worker 0-0, policy_version 29404 (0.00088) [2022-07-09 01:33:58,334][26022] Updated weights on worker 0-0, policy_version 29414 (0.00083) [2022-07-09 01:33:58,583][25689] Fps is (10 sec: 5807.1, 60 sec: 5554.9, 300 sec: 5566.9). Total num frames: 30120960. Throughput: 0: 5047.6. Samples: 30116752. Policy #0 lag: (min: 0.0, avg: 7.0, max: 18.0) [2022-07-09 01:33:58,584][25689] Avg episode reward: [(0, '-62.089')] [2022-07-09 01:34:00,317][26022] Updated weights on worker 0-0, policy_version 29424 (0.01198) [2022-07-09 01:34:02,215][26022] Updated weights on worker 0-0, policy_version 29434 (0.00090) [2022-07-09 01:34:03,607][25689] Fps is (10 sec: 5203.7, 60 sec: 5519.7, 300 sec: 5558.3). Total num frames: 30144512. Throughput: 0: 5778.0. Samples: 30148430. Policy #0 lag: (min: 0.0, avg: 7.0, max: 18.0) [2022-07-09 01:34:03,607][25689] Avg episode reward: [(0, '-61.992')] [2022-07-09 01:34:04,443][26022] Updated weights on worker 0-0, policy_version 29444 (0.00084) [2022-07-09 01:34:05,923][26022] Updated weights on worker 0-0, policy_version 29454 (0.00093) [2022-07-09 01:34:08,051][26022] Updated weights on worker 0-0, policy_version 29464 (0.00091) [2022-07-09 01:34:08,649][25689] Fps is (10 sec: 5493.7, 60 sec: 5558.0, 300 sec: 5564.5). Total num frames: 30176256. Throughput: 0: 5758.7. Samples: 30181876. Policy #0 lag: (min: 0.0, avg: 7.0, max: 18.0) [2022-07-09 01:34:08,651][25689] Avg episode reward: [(0, '-61.789')] [2022-07-09 01:34:09,763][26022] Updated weights on worker 0-0, policy_version 29474 (0.00086) [2022-07-09 01:34:11,595][26022] Updated weights on worker 0-0, policy_version 29484 (0.00083) [2022-07-09 01:34:13,589][26022] Updated weights on worker 0-0, policy_version 29494 (0.00079) [2022-07-09 01:34:13,707][25689] Fps is (10 sec: 5677.7, 60 sec: 5538.0, 300 sec: 5561.3). Total num frames: 30201856. Throughput: 0: 4901.0. Samples: 30198462. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 01:34:13,707][25689] Avg episode reward: [(0, '-62.107')] [2022-07-09 01:34:15,337][26022] Updated weights on worker 0-0, policy_version 29504 (0.00081) [2022-07-09 01:34:16,411][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:34:16,430][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000029509_30217216.pth [2022-07-09 01:34:16,431][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000027554_28215296.pth [2022-07-09 01:34:17,015][26022] Updated weights on worker 0-0, policy_version 29514 (0.00104) [2022-07-09 01:34:18,710][25689] Fps is (10 sec: 5394.2, 60 sec: 5556.0, 300 sec: 5555.2). Total num frames: 30230528. Throughput: 0: 5723.8. Samples: 30231952. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 01:34:18,712][25689] Avg episode reward: [(0, '-62.683')] [2022-07-09 01:34:19,003][26022] Updated weights on worker 0-0, policy_version 29524 (0.00085) [2022-07-09 01:34:20,811][26022] Updated weights on worker 0-0, policy_version 29534 (0.00085) [2022-07-09 01:34:22,701][26022] Updated weights on worker 0-0, policy_version 29544 (0.00086) [2022-07-09 01:34:23,745][25689] Fps is (10 sec: 5610.5, 60 sec: 5521.1, 300 sec: 5560.4). Total num frames: 30258176. Throughput: 0: 5829.1. Samples: 30265818. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 01:34:23,746][25689] Avg episode reward: [(0, '-62.530')] [2022-07-09 01:34:24,576][26022] Updated weights on worker 0-0, policy_version 29554 (0.00089) [2022-07-09 01:34:26,189][26022] Updated weights on worker 0-0, policy_version 29564 (0.00088) [2022-07-09 01:34:28,129][26022] Updated weights on worker 0-0, policy_version 29574 (0.00101) [2022-07-09 01:34:28,830][25689] Fps is (10 sec: 5666.2, 60 sec: 5558.3, 300 sec: 5558.9). Total num frames: 30287872. Throughput: 0: 4987.1. Samples: 30282524. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 01:34:28,832][25689] Avg episode reward: [(0, '-62.698')] [2022-07-09 01:34:30,034][26022] Updated weights on worker 0-0, policy_version 29584 (0.00087) [2022-07-09 01:34:31,908][26022] Updated weights on worker 0-0, policy_version 29594 (0.00087) [2022-07-09 01:34:33,680][26022] Updated weights on worker 0-0, policy_version 29604 (0.00085) [2022-07-09 01:34:33,914][25689] Fps is (10 sec: 5639.0, 60 sec: 5567.6, 300 sec: 5557.4). Total num frames: 30315520. Throughput: 0: 5828.6. Samples: 30316244. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 01:34:33,916][25689] Avg episode reward: [(0, '-63.606')] [2022-07-09 01:34:35,477][26022] Updated weights on worker 0-0, policy_version 29614 (0.00085) [2022-07-09 01:34:37,203][26022] Updated weights on worker 0-0, policy_version 29624 (0.00086) [2022-07-09 01:34:38,938][25689] Fps is (10 sec: 5470.5, 60 sec: 5518.4, 300 sec: 5553.7). Total num frames: 30343168. Throughput: 0: 5807.0. Samples: 30349418. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 01:34:38,939][25689] Avg episode reward: [(0, '-63.482')] [2022-07-09 01:34:39,340][26022] Updated weights on worker 0-0, policy_version 29634 (0.00084) [2022-07-09 01:34:40,902][26022] Updated weights on worker 0-0, policy_version 29644 (0.00083) [2022-07-09 01:34:42,827][26022] Updated weights on worker 0-0, policy_version 29654 (0.00841) [2022-07-09 01:34:43,947][25689] Fps is (10 sec: 5715.5, 60 sec: 5588.8, 300 sec: 5565.8). Total num frames: 30372864. Throughput: 0: 4977.2. Samples: 30366368. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 01:34:43,949][25689] Avg episode reward: [(0, '-62.097')] [2022-07-09 01:34:44,614][26022] Updated weights on worker 0-0, policy_version 29664 (0.00089) [2022-07-09 01:34:46,391][26022] Updated weights on worker 0-0, policy_version 29674 (0.00089) [2022-07-09 01:34:48,211][26022] Updated weights on worker 0-0, policy_version 29684 (0.00087) [2022-07-09 01:34:49,018][25689] Fps is (10 sec: 5688.8, 60 sec: 5586.7, 300 sec: 5559.5). Total num frames: 30400512. Throughput: 0: 5836.7. Samples: 30400356. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 01:34:49,019][25689] Avg episode reward: [(0, '-62.446')] [2022-07-09 01:34:50,196][26022] Updated weights on worker 0-0, policy_version 29694 (0.00082) [2022-07-09 01:34:51,881][26022] Updated weights on worker 0-0, policy_version 29704 (0.00083) [2022-07-09 01:34:53,643][26022] Updated weights on worker 0-0, policy_version 29714 (0.00085) [2022-07-09 01:34:54,032][25689] Fps is (10 sec: 5584.8, 60 sec: 5572.1, 300 sec: 5561.1). Total num frames: 30429184. Throughput: 0: 5882.6. Samples: 30434588. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:34:54,032][25689] Avg episode reward: [(0, '-62.702')] [2022-07-09 01:34:55,418][26022] Updated weights on worker 0-0, policy_version 29724 (0.00090) [2022-07-09 01:34:57,347][26022] Updated weights on worker 0-0, policy_version 29734 (0.00528) [2022-07-09 01:34:59,001][26022] Updated weights on worker 0-0, policy_version 29744 (0.00083) [2022-07-09 01:34:59,044][25689] Fps is (10 sec: 5719.5, 60 sec: 5572.1, 300 sec: 5564.7). Total num frames: 30457856. Throughput: 0: 5091.4. Samples: 30451786. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:34:59,045][25689] Avg episode reward: [(0, '-63.329')] [2022-07-09 01:35:00,742][26022] Updated weights on worker 0-0, policy_version 29754 (0.00090) [2022-07-09 01:35:02,928][26022] Updated weights on worker 0-0, policy_version 29764 (0.00086) [2022-07-09 01:35:04,049][25689] Fps is (10 sec: 5417.7, 60 sec: 5607.7, 300 sec: 5565.2). Total num frames: 30483456. Throughput: 0: 5863.1. Samples: 30484228. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:35:04,050][25689] Avg episode reward: [(0, '-62.613')] [2022-07-09 01:35:04,751][26022] Updated weights on worker 0-0, policy_version 29774 (0.00083) [2022-07-09 01:35:06,639][26022] Updated weights on worker 0-0, policy_version 29784 (0.00085) [2022-07-09 01:35:08,685][26022] Updated weights on worker 0-0, policy_version 29794 (0.00083) [2022-07-09 01:35:09,153][25689] Fps is (10 sec: 5369.1, 60 sec: 5551.2, 300 sec: 5564.6). Total num frames: 30512128. Throughput: 0: 5804.0. Samples: 30517216. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:35:09,153][25689] Avg episode reward: [(0, '-62.420')] [2022-07-09 01:35:10,277][26022] Updated weights on worker 0-0, policy_version 29804 (0.00092) [2022-07-09 01:35:12,216][26022] Updated weights on worker 0-0, policy_version 29814 (0.00089) [2022-07-09 01:35:13,877][26022] Updated weights on worker 0-0, policy_version 29824 (0.00048) [2022-07-09 01:35:14,242][25689] Fps is (10 sec: 5625.9, 60 sec: 5599.1, 300 sec: 5560.7). Total num frames: 30540800. Throughput: 0: 4932.3. Samples: 30534272. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:35:14,242][25689] Avg episode reward: [(0, '-62.698')] [2022-07-09 01:35:15,863][26022] Updated weights on worker 0-0, policy_version 29834 (0.00091) [2022-07-09 01:35:17,643][26022] Updated weights on worker 0-0, policy_version 29844 (0.00090) [2022-07-09 01:35:19,252][25689] Fps is (10 sec: 5677.9, 60 sec: 5598.5, 300 sec: 5571.2). Total num frames: 30569472. Throughput: 0: 5755.6. Samples: 30568094. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:35:19,253][25689] Avg episode reward: [(0, '-62.168')] [2022-07-09 01:35:19,294][26022] Updated weights on worker 0-0, policy_version 29854 (0.00088) [2022-07-09 01:35:21,350][26022] Updated weights on worker 0-0, policy_version 29864 (0.00094) [2022-07-09 01:35:23,062][26022] Updated weights on worker 0-0, policy_version 29874 (0.00086) [2022-07-09 01:35:24,272][25689] Fps is (10 sec: 5615.1, 60 sec: 5599.9, 300 sec: 5568.5). Total num frames: 30597120. Throughput: 0: 5827.2. Samples: 30602070. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:35:24,273][25689] Avg episode reward: [(0, '-61.754')] [2022-07-09 01:35:24,969][26022] Updated weights on worker 0-0, policy_version 29884 (0.00078) [2022-07-09 01:35:26,695][26022] Updated weights on worker 0-0, policy_version 29894 (0.00089) [2022-07-09 01:35:28,482][26022] Updated weights on worker 0-0, policy_version 29904 (0.00088) [2022-07-09 01:35:29,313][25689] Fps is (10 sec: 5597.7, 60 sec: 5587.0, 300 sec: 5561.3). Total num frames: 30625792. Throughput: 0: 5037.7. Samples: 30618780. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:35:29,314][25689] Avg episode reward: [(0, '-61.357')] [2022-07-09 01:35:30,498][26022] Updated weights on worker 0-0, policy_version 29914 (0.00072) [2022-07-09 01:35:32,106][26022] Updated weights on worker 0-0, policy_version 29924 (0.00102) [2022-07-09 01:35:34,076][26022] Updated weights on worker 0-0, policy_version 29934 (0.00092) [2022-07-09 01:35:34,359][25689] Fps is (10 sec: 5583.7, 60 sec: 5590.6, 300 sec: 5567.5). Total num frames: 30653440. Throughput: 0: 5883.2. Samples: 30652622. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:35:34,359][25689] Avg episode reward: [(0, '-61.725')] [2022-07-09 01:35:35,856][26022] Updated weights on worker 0-0, policy_version 29944 (0.00092) [2022-07-09 01:35:37,742][26022] Updated weights on worker 0-0, policy_version 29954 (0.00098) [2022-07-09 01:35:39,375][25689] Fps is (10 sec: 5597.5, 60 sec: 5608.2, 300 sec: 5569.1). Total num frames: 30682112. Throughput: 0: 5856.2. Samples: 30685938. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:35:39,375][25689] Avg episode reward: [(0, '-62.184')] [2022-07-09 01:35:39,571][26022] Updated weights on worker 0-0, policy_version 29964 (0.00091) [2022-07-09 01:35:41,320][26022] Updated weights on worker 0-0, policy_version 29974 (0.00087) [2022-07-09 01:35:43,114][26022] Updated weights on worker 0-0, policy_version 29984 (0.00086) [2022-07-09 01:35:44,395][25689] Fps is (10 sec: 5611.8, 60 sec: 5573.4, 300 sec: 5563.7). Total num frames: 30709760. Throughput: 0: 5011.7. Samples: 30702918. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:35:44,395][25689] Avg episode reward: [(0, '-62.543')] [2022-07-09 01:35:44,950][26022] Updated weights on worker 0-0, policy_version 29994 (0.00093) [2022-07-09 01:35:46,971][26022] Updated weights on worker 0-0, policy_version 30004 (0.00104) [2022-07-09 01:35:48,556][26022] Updated weights on worker 0-0, policy_version 30014 (0.00089) [2022-07-09 01:35:49,468][25689] Fps is (10 sec: 5580.3, 60 sec: 5590.1, 300 sec: 5566.0). Total num frames: 30738432. Throughput: 0: 5856.8. Samples: 30736820. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:35:49,468][25689] Avg episode reward: [(0, '-63.296')] [2022-07-09 01:35:50,591][26022] Updated weights on worker 0-0, policy_version 30024 (0.00089) [2022-07-09 01:35:52,174][26022] Updated weights on worker 0-0, policy_version 30034 (0.00100) [2022-07-09 01:35:53,987][26022] Updated weights on worker 0-0, policy_version 30044 (0.00087) [2022-07-09 01:35:54,505][25689] Fps is (10 sec: 5671.7, 60 sec: 5587.9, 300 sec: 5569.6). Total num frames: 30767104. Throughput: 0: 5852.0. Samples: 30770520. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:35:54,506][25689] Avg episode reward: [(0, '-62.400')] [2022-07-09 01:35:56,072][26022] Updated weights on worker 0-0, policy_version 30054 (0.00089) [2022-07-09 01:35:57,646][26022] Updated weights on worker 0-0, policy_version 30064 (0.00080) [2022-07-09 01:35:59,419][26022] Updated weights on worker 0-0, policy_version 30074 (0.00089) [2022-07-09 01:35:59,544][25689] Fps is (10 sec: 5690.8, 60 sec: 5585.5, 300 sec: 5576.4). Total num frames: 30795776. Throughput: 0: 5888.2. Samples: 30804698. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:35:59,545][25689] Avg episode reward: [(0, '-61.785')] [2022-07-09 01:36:01,487][26022] Updated weights on worker 0-0, policy_version 30084 (0.00087) [2022-07-09 01:36:03,476][26022] Updated weights on worker 0-0, policy_version 30094 (0.00089) [2022-07-09 01:36:04,565][25689] Fps is (10 sec: 5496.8, 60 sec: 5600.9, 300 sec: 5567.1). Total num frames: 30822400. Throughput: 0: 5782.4. Samples: 30819550. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:36:04,565][25689] Avg episode reward: [(0, '-61.470')] [2022-07-09 01:36:05,452][26022] Updated weights on worker 0-0, policy_version 30104 (0.00087) [2022-07-09 01:36:07,189][26022] Updated weights on worker 0-0, policy_version 30114 (0.00093) [2022-07-09 01:36:09,031][26022] Updated weights on worker 0-0, policy_version 30124 (0.00087) [2022-07-09 01:36:09,611][25689] Fps is (10 sec: 5391.2, 60 sec: 5589.3, 300 sec: 5566.8). Total num frames: 30850048. Throughput: 0: 5786.9. Samples: 30853388. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 01:36:09,611][25689] Avg episode reward: [(0, '-61.053')] [2022-07-09 01:36:10,874][26022] Updated weights on worker 0-0, policy_version 30134 (0.00083) [2022-07-09 01:36:12,707][26022] Updated weights on worker 0-0, policy_version 30144 (0.00100) [2022-07-09 01:36:14,383][26022] Updated weights on worker 0-0, policy_version 30154 (0.00107) [2022-07-09 01:36:14,613][25689] Fps is (10 sec: 5604.9, 60 sec: 5597.4, 300 sec: 5570.4). Total num frames: 30878720. Throughput: 0: 5810.3. Samples: 30887354. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:36:14,614][25689] Avg episode reward: [(0, '-61.038')] [2022-07-09 01:36:16,361][26022] Updated weights on worker 0-0, policy_version 30164 (0.00089) [2022-07-09 01:36:16,513][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:36:16,533][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000030165_30888960.pth [2022-07-09 01:36:16,534][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000028205_28881920.pth [2022-07-09 01:36:17,898][26022] Updated weights on worker 0-0, policy_version 30174 (0.00088) [2022-07-09 01:36:19,623][25689] Fps is (10 sec: 5625.4, 60 sec: 5580.5, 300 sec: 5570.8). Total num frames: 30906368. Throughput: 0: 4958.2. Samples: 30904250. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:36:19,623][25689] Avg episode reward: [(0, '-60.979')] [2022-07-09 01:36:19,986][26022] Updated weights on worker 0-0, policy_version 30184 (0.00085) [2022-07-09 01:36:21,594][26022] Updated weights on worker 0-0, policy_version 30194 (0.00391) [2022-07-09 01:36:23,449][26022] Updated weights on worker 0-0, policy_version 30204 (0.00087) [2022-07-09 01:36:24,655][25689] Fps is (10 sec: 5608.4, 60 sec: 5596.2, 300 sec: 5567.8). Total num frames: 30935040. Throughput: 0: 5919.3. Samples: 30938472. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:36:24,656][25689] Avg episode reward: [(0, '-61.240')] [2022-07-09 01:36:25,288][26022] Updated weights on worker 0-0, policy_version 30214 (0.00099) [2022-07-09 01:36:27,128][26022] Updated weights on worker 0-0, policy_version 30224 (0.00086) [2022-07-09 01:36:28,963][26022] Updated weights on worker 0-0, policy_version 30234 (0.00098) [2022-07-09 01:36:29,789][25689] Fps is (10 sec: 5741.5, 60 sec: 5604.7, 300 sec: 5575.7). Total num frames: 30964736. Throughput: 0: 5885.6. Samples: 30972146. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:36:29,789][25689] Avg episode reward: [(0, '-61.635')] [2022-07-09 01:36:30,788][26022] Updated weights on worker 0-0, policy_version 30244 (0.00078) [2022-07-09 01:36:32,552][26022] Updated weights on worker 0-0, policy_version 30254 (0.00093) [2022-07-09 01:36:34,425][26022] Updated weights on worker 0-0, policy_version 30264 (0.00086) [2022-07-09 01:36:34,819][25689] Fps is (10 sec: 5642.2, 60 sec: 5606.1, 300 sec: 5568.4). Total num frames: 30992384. Throughput: 0: 5042.0. Samples: 30989230. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:36:34,820][25689] Avg episode reward: [(0, '-62.469')] [2022-07-09 01:36:35,995][26022] Updated weights on worker 0-0, policy_version 30274 (0.00095) [2022-07-09 01:36:37,942][26022] Updated weights on worker 0-0, policy_version 30284 (0.00087) [2022-07-09 01:36:39,827][25689] Fps is (10 sec: 5508.6, 60 sec: 5589.9, 300 sec: 5575.5). Total num frames: 31020032. Throughput: 0: 5872.8. Samples: 31022904. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:36:39,827][25689] Avg episode reward: [(0, '-62.316')] [2022-07-09 01:36:39,933][26022] Updated weights on worker 0-0, policy_version 30294 (0.00087) [2022-07-09 01:36:41,684][26022] Updated weights on worker 0-0, policy_version 30304 (0.00089) [2022-07-09 01:36:43,426][26022] Updated weights on worker 0-0, policy_version 30314 (0.00109) [2022-07-09 01:36:44,854][25689] Fps is (10 sec: 5510.1, 60 sec: 5589.2, 300 sec: 5569.4). Total num frames: 31047680. Throughput: 0: 5846.0. Samples: 31056554. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:36:44,855][25689] Avg episode reward: [(0, '-62.492')] [2022-07-09 01:36:45,209][26022] Updated weights on worker 0-0, policy_version 30324 (0.00084) [2022-07-09 01:36:47,317][26022] Updated weights on worker 0-0, policy_version 30334 (0.00093) [2022-07-09 01:36:49,090][26022] Updated weights on worker 0-0, policy_version 30344 (0.00096) [2022-07-09 01:36:49,939][25689] Fps is (10 sec: 5670.8, 60 sec: 5605.1, 300 sec: 5582.1). Total num frames: 31077376. Throughput: 0: 5016.9. Samples: 31073238. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:36:49,939][25689] Avg episode reward: [(0, '-62.749')] [2022-07-09 01:36:50,892][26022] Updated weights on worker 0-0, policy_version 30354 (0.00084) [2022-07-09 01:36:52,517][26022] Updated weights on worker 0-0, policy_version 30364 (0.00086) [2022-07-09 01:36:54,375][26022] Updated weights on worker 0-0, policy_version 30374 (0.00087) [2022-07-09 01:36:55,033][25689] Fps is (10 sec: 5734.1, 60 sec: 5599.8, 300 sec: 5574.1). Total num frames: 31106048. Throughput: 0: 5845.4. Samples: 31107392. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 01:36:55,034][25689] Avg episode reward: [(0, '-62.863')] [2022-07-09 01:36:56,191][26022] Updated weights on worker 0-0, policy_version 30384 (0.00094) [2022-07-09 01:36:58,028][26022] Updated weights on worker 0-0, policy_version 30394 (0.00091) [2022-07-09 01:36:59,926][26022] Updated weights on worker 0-0, policy_version 30404 (0.00558) [2022-07-09 01:37:00,038][25689] Fps is (10 sec: 5678.0, 60 sec: 5602.9, 300 sec: 5581.0). Total num frames: 31134720. Throughput: 0: 5866.2. Samples: 31141468. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 01:37:00,040][25689] Avg episode reward: [(0, '-62.792')] [2022-07-09 01:37:01,462][26022] Updated weights on worker 0-0, policy_version 30414 (0.00081) [2022-07-09 01:37:03,971][26022] Updated weights on worker 0-0, policy_version 30424 (0.00075) [2022-07-09 01:37:05,121][25689] Fps is (10 sec: 5481.8, 60 sec: 5597.2, 300 sec: 5584.7). Total num frames: 31161344. Throughput: 0: 4918.5. Samples: 31156234. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 01:37:05,121][25689] Avg episode reward: [(0, '-62.591')] [2022-07-09 01:37:05,579][26022] Updated weights on worker 0-0, policy_version 30434 (0.00091) [2022-07-09 01:37:07,563][26022] Updated weights on worker 0-0, policy_version 30444 (0.00617) [2022-07-09 01:37:09,375][26022] Updated weights on worker 0-0, policy_version 30454 (0.00087) [2022-07-09 01:37:10,225][25689] Fps is (10 sec: 5327.6, 60 sec: 5591.8, 300 sec: 5572.5). Total num frames: 31188992. Throughput: 0: 5760.0. Samples: 31190088. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 01:37:10,226][25689] Avg episode reward: [(0, '-62.906')] [2022-07-09 01:37:11,234][26022] Updated weights on worker 0-0, policy_version 30464 (0.00082) [2022-07-09 01:37:13,112][26022] Updated weights on worker 0-0, policy_version 30474 (0.00086) [2022-07-09 01:37:14,942][26022] Updated weights on worker 0-0, policy_version 30484 (0.00084) [2022-07-09 01:37:15,246][25689] Fps is (10 sec: 5562.4, 60 sec: 5590.1, 300 sec: 5575.9). Total num frames: 31217664. Throughput: 0: 5756.8. Samples: 31223750. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 01:37:15,246][25689] Avg episode reward: [(0, '-63.039')] [2022-07-09 01:37:16,472][26022] Updated weights on worker 0-0, policy_version 30494 (0.00083) [2022-07-09 01:37:18,571][26022] Updated weights on worker 0-0, policy_version 30504 (0.00085) [2022-07-09 01:37:20,271][25689] Fps is (10 sec: 5606.5, 60 sec: 5588.7, 300 sec: 5575.8). Total num frames: 31245312. Throughput: 0: 4907.1. Samples: 31240750. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 01:37:20,271][25689] Avg episode reward: [(0, '-61.819')] [2022-07-09 01:37:20,340][26022] Updated weights on worker 0-0, policy_version 30514 (0.00624) [2022-07-09 01:37:22,128][26022] Updated weights on worker 0-0, policy_version 30524 (0.00087) [2022-07-09 01:37:24,016][26022] Updated weights on worker 0-0, policy_version 30534 (0.00082) [2022-07-09 01:37:25,295][25689] Fps is (10 sec: 5706.3, 60 sec: 5606.4, 300 sec: 5579.8). Total num frames: 31275008. Throughput: 0: 5856.8. Samples: 31274390. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 01:37:25,295][25689] Avg episode reward: [(0, '-61.801')] [2022-07-09 01:37:25,794][26022] Updated weights on worker 0-0, policy_version 30544 (0.00086) [2022-07-09 01:37:27,656][26022] Updated weights on worker 0-0, policy_version 30554 (0.00085) [2022-07-09 01:37:29,567][26022] Updated weights on worker 0-0, policy_version 30564 (0.00438) [2022-07-09 01:37:30,341][25689] Fps is (10 sec: 5694.1, 60 sec: 5580.6, 300 sec: 5575.5). Total num frames: 31302656. Throughput: 0: 5853.1. Samples: 31307830. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 01:37:30,342][25689] Avg episode reward: [(0, '-61.894')] [2022-07-09 01:37:31,230][26022] Updated weights on worker 0-0, policy_version 30574 (0.00089) [2022-07-09 01:37:33,138][26022] Updated weights on worker 0-0, policy_version 30584 (0.00090) [2022-07-09 01:37:34,967][26022] Updated weights on worker 0-0, policy_version 30594 (0.00093) [2022-07-09 01:37:35,393][25689] Fps is (10 sec: 5475.9, 60 sec: 5578.6, 300 sec: 5578.4). Total num frames: 31330304. Throughput: 0: 5013.3. Samples: 31324754. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 01:37:35,393][25689] Avg episode reward: [(0, '-62.708')] [2022-07-09 01:37:36,796][26022] Updated weights on worker 0-0, policy_version 30604 (0.00091) [2022-07-09 01:37:38,570][26022] Updated weights on worker 0-0, policy_version 30614 (0.00085) [2022-07-09 01:37:40,435][25689] Fps is (10 sec: 5478.5, 60 sec: 5575.5, 300 sec: 5570.8). Total num frames: 31357952. Throughput: 0: 5849.1. Samples: 31358692. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 01:37:40,435][25689] Avg episode reward: [(0, '-62.192')] [2022-07-09 01:37:40,451][26022] Updated weights on worker 0-0, policy_version 30624 (0.00087) [2022-07-09 01:37:42,096][26022] Updated weights on worker 0-0, policy_version 30634 (0.00093) [2022-07-09 01:37:44,009][26022] Updated weights on worker 0-0, policy_version 30644 (0.00067) [2022-07-09 01:37:45,451][25689] Fps is (10 sec: 5701.4, 60 sec: 5610.4, 300 sec: 5578.4). Total num frames: 31387648. Throughput: 0: 5874.9. Samples: 31392804. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 01:37:45,451][25689] Avg episode reward: [(0, '-62.345')] [2022-07-09 01:37:45,784][26022] Updated weights on worker 0-0, policy_version 30654 (0.00087) [2022-07-09 01:37:47,602][26022] Updated weights on worker 0-0, policy_version 30664 (0.00089) [2022-07-09 01:37:49,352][26022] Updated weights on worker 0-0, policy_version 30674 (0.00110) [2022-07-09 01:37:50,576][25689] Fps is (10 sec: 5755.5, 60 sec: 5589.7, 300 sec: 5579.9). Total num frames: 31416320. Throughput: 0: 5030.5. Samples: 31409620. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 01:37:50,576][25689] Avg episode reward: [(0, '-63.050')] [2022-07-09 01:37:51,359][26022] Updated weights on worker 0-0, policy_version 30684 (0.00086) [2022-07-09 01:37:52,939][26022] Updated weights on worker 0-0, policy_version 30694 (0.00084) [2022-07-09 01:37:54,746][26022] Updated weights on worker 0-0, policy_version 30704 (0.00082) [2022-07-09 01:37:55,636][25689] Fps is (10 sec: 5630.2, 60 sec: 5592.9, 300 sec: 5579.2). Total num frames: 31444992. Throughput: 0: 5891.6. Samples: 31444016. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 01:37:55,636][25689] Avg episode reward: [(0, '-62.691')] [2022-07-09 01:37:56,632][26022] Updated weights on worker 0-0, policy_version 30714 (0.00082) [2022-07-09 01:37:58,419][26022] Updated weights on worker 0-0, policy_version 30724 (0.00082) [2022-07-09 01:38:00,211][26022] Updated weights on worker 0-0, policy_version 30734 (0.00098) [2022-07-09 01:38:00,663][25689] Fps is (10 sec: 5684.7, 60 sec: 5590.8, 300 sec: 5589.2). Total num frames: 31473664. Throughput: 0: 5904.9. Samples: 31478140. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 01:38:00,664][25689] Avg episode reward: [(0, '-60.868')] [2022-07-09 01:38:02,451][26022] Updated weights on worker 0-0, policy_version 30744 (0.00104) [2022-07-09 01:38:04,119][26022] Updated weights on worker 0-0, policy_version 30754 (0.00089) [2022-07-09 01:38:05,684][25689] Fps is (10 sec: 5401.4, 60 sec: 5579.6, 300 sec: 5576.7). Total num frames: 31499264. Throughput: 0: 4939.0. Samples: 31492734. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 01:38:05,684][25689] Avg episode reward: [(0, '-59.922')] [2022-07-09 01:38:06,143][26022] Updated weights on worker 0-0, policy_version 30764 (0.00080) [2022-07-09 01:38:07,800][26022] Updated weights on worker 0-0, policy_version 30774 (0.00080) [2022-07-09 01:38:09,675][26022] Updated weights on worker 0-0, policy_version 30784 (0.00081) [2022-07-09 01:38:10,770][25689] Fps is (10 sec: 5572.7, 60 sec: 5632.1, 300 sec: 5589.3). Total num frames: 31529984. Throughput: 0: 5795.5. Samples: 31526654. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 01:38:10,771][25689] Avg episode reward: [(0, '-60.563')] [2022-07-09 01:38:11,449][26022] Updated weights on worker 0-0, policy_version 30794 (0.00103) [2022-07-09 01:38:13,270][26022] Updated weights on worker 0-0, policy_version 30804 (0.00083) [2022-07-09 01:38:15,073][26022] Updated weights on worker 0-0, policy_version 30814 (0.00089) [2022-07-09 01:38:15,813][25689] Fps is (10 sec: 5661.2, 60 sec: 5596.2, 300 sec: 5585.4). Total num frames: 31556608. Throughput: 0: 5786.4. Samples: 31560768. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 01:38:15,814][25689] Avg episode reward: [(0, '-60.370')] [2022-07-09 01:38:16,544][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:38:16,561][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000030822_31561728.pth [2022-07-09 01:38:16,562][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000028857_29549568.pth [2022-07-09 01:38:16,774][26022] Updated weights on worker 0-0, policy_version 30824 (0.00081) [2022-07-09 01:38:18,867][26022] Updated weights on worker 0-0, policy_version 30834 (0.00086) [2022-07-09 01:38:20,343][26022] Updated weights on worker 0-0, policy_version 30844 (0.00082) [2022-07-09 01:38:20,899][25689] Fps is (10 sec: 5560.1, 60 sec: 5624.3, 300 sec: 5584.2). Total num frames: 31586304. Throughput: 0: 5776.0. Samples: 31595022. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 01:38:20,900][25689] Avg episode reward: [(0, '-60.705')] [2022-07-09 01:38:22,319][26022] Updated weights on worker 0-0, policy_version 30854 (0.00089) [2022-07-09 01:38:23,959][26022] Updated weights on worker 0-0, policy_version 30864 (0.00089) [2022-07-09 01:38:25,902][25689] Fps is (10 sec: 5683.9, 60 sec: 5592.5, 300 sec: 5586.4). Total num frames: 31613952. Throughput: 0: 5899.5. Samples: 31612010. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 01:38:25,902][25689] Avg episode reward: [(0, '-60.708')] [2022-07-09 01:38:25,916][26022] Updated weights on worker 0-0, policy_version 30874 (0.00092) [2022-07-09 01:38:27,643][26022] Updated weights on worker 0-0, policy_version 30884 (0.00065) [2022-07-09 01:38:29,647][26022] Updated weights on worker 0-0, policy_version 30894 (0.00083) [2022-07-09 01:38:31,032][25689] Fps is (10 sec: 5558.4, 60 sec: 5601.7, 300 sec: 5590.9). Total num frames: 31642624. Throughput: 0: 5879.4. Samples: 31645780. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 01:38:31,032][25689] Avg episode reward: [(0, '-61.349')] [2022-07-09 01:38:31,380][26022] Updated weights on worker 0-0, policy_version 30904 (0.00094) [2022-07-09 01:38:33,239][26022] Updated weights on worker 0-0, policy_version 30914 (0.00088) [2022-07-09 01:38:35,036][26022] Updated weights on worker 0-0, policy_version 30924 (0.00087) [2022-07-09 01:38:36,040][25689] Fps is (10 sec: 5757.2, 60 sec: 5639.5, 300 sec: 5588.1). Total num frames: 31672320. Throughput: 0: 5870.2. Samples: 31679504. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 01:38:36,040][25689] Avg episode reward: [(0, '-61.749')] [2022-07-09 01:38:37,034][26022] Updated weights on worker 0-0, policy_version 30934 (0.00100) [2022-07-09 01:38:38,670][26022] Updated weights on worker 0-0, policy_version 30944 (0.00091) [2022-07-09 01:38:40,735][26022] Updated weights on worker 0-0, policy_version 30954 (0.00095) [2022-07-09 01:38:41,103][25689] Fps is (10 sec: 5592.2, 60 sec: 5620.6, 300 sec: 5591.0). Total num frames: 31698944. Throughput: 0: 5005.1. Samples: 31696142. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 01:38:41,103][25689] Avg episode reward: [(0, '-61.772')] [2022-07-09 01:38:42,303][26022] Updated weights on worker 0-0, policy_version 30964 (0.00082) [2022-07-09 01:38:44,282][26022] Updated weights on worker 0-0, policy_version 30974 (0.00083) [2022-07-09 01:38:45,812][26022] Updated weights on worker 0-0, policy_version 30984 (0.00087) [2022-07-09 01:38:46,179][25689] Fps is (10 sec: 5554.8, 60 sec: 5615.1, 300 sec: 5597.4). Total num frames: 31728640. Throughput: 0: 5816.2. Samples: 31729946. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 01:38:46,179][25689] Avg episode reward: [(0, '-61.450')] [2022-07-09 01:38:47,869][26022] Updated weights on worker 0-0, policy_version 30994 (0.00086) [2022-07-09 01:38:49,607][26022] Updated weights on worker 0-0, policy_version 31004 (0.00085) [2022-07-09 01:38:51,283][25689] Fps is (10 sec: 5632.9, 60 sec: 5600.2, 300 sec: 5589.3). Total num frames: 31756288. Throughput: 0: 5819.4. Samples: 31763630. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 01:38:51,283][25689] Avg episode reward: [(0, '-61.769')] [2022-07-09 01:38:51,595][26022] Updated weights on worker 0-0, policy_version 31014 (0.00087) [2022-07-09 01:38:53,331][26022] Updated weights on worker 0-0, policy_version 31024 (0.00094) [2022-07-09 01:38:55,226][26022] Updated weights on worker 0-0, policy_version 31034 (0.00086) [2022-07-09 01:38:56,368][25689] Fps is (10 sec: 5527.6, 60 sec: 5597.9, 300 sec: 5587.9). Total num frames: 31784960. Throughput: 0: 4966.6. Samples: 31780464. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:38:56,368][25689] Avg episode reward: [(0, '-61.338')] [2022-07-09 01:38:56,937][26022] Updated weights on worker 0-0, policy_version 31044 (0.00256) [2022-07-09 01:38:58,818][26022] Updated weights on worker 0-0, policy_version 31054 (0.00080) [2022-07-09 01:39:00,545][26022] Updated weights on worker 0-0, policy_version 31064 (0.00087) [2022-07-09 01:39:01,381][25689] Fps is (10 sec: 5678.4, 60 sec: 5599.1, 300 sec: 5605.3). Total num frames: 31813632. Throughput: 0: 5830.6. Samples: 31814380. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:39:01,382][25689] Avg episode reward: [(0, '-61.435')] [2022-07-09 01:39:02,866][26022] Updated weights on worker 0-0, policy_version 31074 (0.00092) [2022-07-09 01:39:04,529][26022] Updated weights on worker 0-0, policy_version 31084 (0.00093) [2022-07-09 01:39:06,391][25689] Fps is (10 sec: 5414.5, 60 sec: 5600.1, 300 sec: 5585.3). Total num frames: 31839232. Throughput: 0: 5734.1. Samples: 31845846. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:39:06,392][25689] Avg episode reward: [(0, '-61.610')] [2022-07-09 01:39:06,581][26022] Updated weights on worker 0-0, policy_version 31094 (0.00082) [2022-07-09 01:39:08,285][26022] Updated weights on worker 0-0, policy_version 31104 (0.00082) [2022-07-09 01:39:10,134][26022] Updated weights on worker 0-0, policy_version 31114 (0.00086) [2022-07-09 01:39:11,471][25689] Fps is (10 sec: 5277.4, 60 sec: 5550.1, 300 sec: 5591.7). Total num frames: 31866880. Throughput: 0: 4891.9. Samples: 31862390. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:39:11,473][25689] Avg episode reward: [(0, '-60.896')] [2022-07-09 01:39:11,960][26022] Updated weights on worker 0-0, policy_version 31124 (0.00094) [2022-07-09 01:39:13,795][26022] Updated weights on worker 0-0, policy_version 31134 (0.00087) [2022-07-09 01:39:15,865][26022] Updated weights on worker 0-0, policy_version 31144 (0.00081) [2022-07-09 01:39:16,507][25689] Fps is (10 sec: 5466.1, 60 sec: 5567.6, 300 sec: 5587.7). Total num frames: 31894528. Throughput: 0: 5718.6. Samples: 31895634. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:39:16,509][25689] Avg episode reward: [(0, '-61.457')] [2022-07-09 01:39:17,528][26022] Updated weights on worker 0-0, policy_version 31154 (0.00095) [2022-07-09 01:39:19,522][26022] Updated weights on worker 0-0, policy_version 31164 (0.00086) [2022-07-09 01:39:21,226][26022] Updated weights on worker 0-0, policy_version 31174 (0.00090) [2022-07-09 01:39:21,523][25689] Fps is (10 sec: 5603.1, 60 sec: 5557.2, 300 sec: 5591.5). Total num frames: 31923200. Throughput: 0: 5691.7. Samples: 31929018. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:39:21,525][25689] Avg episode reward: [(0, '-61.633')] [2022-07-09 01:39:23,102][26022] Updated weights on worker 0-0, policy_version 31184 (0.00087) [2022-07-09 01:39:24,912][26022] Updated weights on worker 0-0, policy_version 31194 (0.00091) [2022-07-09 01:39:26,530][25689] Fps is (10 sec: 5619.4, 60 sec: 5556.8, 300 sec: 5586.1). Total num frames: 31950848. Throughput: 0: 4961.7. Samples: 31945768. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:39:26,530][25689] Avg episode reward: [(0, '-61.810')] [2022-07-09 01:39:26,750][26022] Updated weights on worker 0-0, policy_version 31204 (0.00083) [2022-07-09 01:39:28,574][26022] Updated weights on worker 0-0, policy_version 31214 (0.00089) [2022-07-09 01:39:30,556][26022] Updated weights on worker 0-0, policy_version 31224 (0.00098) [2022-07-09 01:39:31,596][25689] Fps is (10 sec: 5489.2, 60 sec: 5545.7, 300 sec: 5586.4). Total num frames: 31978496. Throughput: 0: 5801.1. Samples: 31979138. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:39:31,597][25689] Avg episode reward: [(0, '-60.851')] [2022-07-09 01:39:32,286][26022] Updated weights on worker 0-0, policy_version 31234 (0.00088) [2022-07-09 01:39:34,109][26022] Updated weights on worker 0-0, policy_version 31244 (0.00089) [2022-07-09 01:39:35,911][26022] Updated weights on worker 0-0, policy_version 31254 (0.00083) [2022-07-09 01:39:36,616][25689] Fps is (10 sec: 5584.0, 60 sec: 5527.8, 300 sec: 5589.9). Total num frames: 32007168. Throughput: 0: 5818.6. Samples: 32012636. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 01:39:36,616][25689] Avg episode reward: [(0, '-60.693')] [2022-07-09 01:39:37,938][26022] Updated weights on worker 0-0, policy_version 31264 (0.00087) [2022-07-09 01:39:39,551][26022] Updated weights on worker 0-0, policy_version 31274 (0.00089) [2022-07-09 01:39:41,604][26022] Updated weights on worker 0-0, policy_version 31284 (0.00091) [2022-07-09 01:39:41,621][25689] Fps is (10 sec: 5618.2, 60 sec: 5549.9, 300 sec: 5583.1). Total num frames: 32034816. Throughput: 0: 4997.4. Samples: 32029456. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 01:39:41,621][25689] Avg episode reward: [(0, '-59.689')] [2022-07-09 01:39:43,186][26022] Updated weights on worker 0-0, policy_version 31294 (0.00082) [2022-07-09 01:39:45,178][26022] Updated weights on worker 0-0, policy_version 31304 (0.00092) [2022-07-09 01:39:46,622][25689] Fps is (10 sec: 5628.2, 60 sec: 5539.8, 300 sec: 5587.9). Total num frames: 32063488. Throughput: 0: 5861.7. Samples: 32063544. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 01:39:46,623][25689] Avg episode reward: [(0, '-58.801')] [2022-07-09 01:39:46,902][26022] Updated weights on worker 0-0, policy_version 31314 (0.00087) [2022-07-09 01:39:48,754][26022] Updated weights on worker 0-0, policy_version 31324 (0.00082) [2022-07-09 01:39:50,720][26022] Updated weights on worker 0-0, policy_version 31334 (0.00086) [2022-07-09 01:39:51,731][25689] Fps is (10 sec: 5672.0, 60 sec: 5556.4, 300 sec: 5586.1). Total num frames: 32092160. Throughput: 0: 5880.4. Samples: 32097538. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 01:39:51,731][25689] Avg episode reward: [(0, '-58.723')] [2022-07-09 01:39:52,392][26022] Updated weights on worker 0-0, policy_version 31344 (0.00089) [2022-07-09 01:39:54,229][26022] Updated weights on worker 0-0, policy_version 31354 (0.00089) [2022-07-09 01:39:56,058][26022] Updated weights on worker 0-0, policy_version 31364 (0.00093) [2022-07-09 01:39:56,784][25689] Fps is (10 sec: 5542.6, 60 sec: 5542.4, 300 sec: 5581.9). Total num frames: 32119808. Throughput: 0: 5056.2. Samples: 32114610. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 01:39:56,784][25689] Avg episode reward: [(0, '-58.504')] [2022-07-09 01:39:57,849][26022] Updated weights on worker 0-0, policy_version 31374 (0.00083) [2022-07-09 01:39:59,647][26022] Updated weights on worker 0-0, policy_version 31384 (0.00097) [2022-07-09 01:40:01,308][26022] Updated weights on worker 0-0, policy_version 31394 (0.00095) [2022-07-09 01:40:01,839][25689] Fps is (10 sec: 5571.9, 60 sec: 5538.6, 300 sec: 5591.3). Total num frames: 32148480. Throughput: 0: 5875.9. Samples: 32148254. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 01:40:01,839][25689] Avg episode reward: [(0, '-59.712')] [2022-07-09 01:40:03,803][26022] Updated weights on worker 0-0, policy_version 31404 (0.00078) [2022-07-09 01:40:05,462][26022] Updated weights on worker 0-0, policy_version 31414 (0.00085) [2022-07-09 01:40:06,840][25689] Fps is (10 sec: 5498.6, 60 sec: 5556.3, 300 sec: 5586.3). Total num frames: 32175104. Throughput: 0: 5751.2. Samples: 32179820. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 01:40:06,840][25689] Avg episode reward: [(0, '-61.249')] [2022-07-09 01:40:07,416][26022] Updated weights on worker 0-0, policy_version 31424 (0.00094) [2022-07-09 01:40:09,128][26022] Updated weights on worker 0-0, policy_version 31434 (0.00092) [2022-07-09 01:40:10,916][26022] Updated weights on worker 0-0, policy_version 31444 (0.00081) [2022-07-09 01:40:11,929][25689] Fps is (10 sec: 5480.1, 60 sec: 5572.4, 300 sec: 5586.3). Total num frames: 32203776. Throughput: 0: 4905.8. Samples: 32196626. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 01:40:11,930][25689] Avg episode reward: [(0, '-61.843')] [2022-07-09 01:40:12,904][26022] Updated weights on worker 0-0, policy_version 31454 (0.00095) [2022-07-09 01:40:14,621][26022] Updated weights on worker 0-0, policy_version 31464 (0.00084) [2022-07-09 01:40:16,395][26022] Updated weights on worker 0-0, policy_version 31474 (0.00081) [2022-07-09 01:40:16,656][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:40:16,666][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000031475_32230400.pth [2022-07-09 01:40:16,666][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000029509_30217216.pth [2022-07-09 01:40:16,972][25689] Fps is (10 sec: 5659.9, 60 sec: 5588.7, 300 sec: 5585.7). Total num frames: 32232448. Throughput: 0: 5719.4. Samples: 32230074. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:40:16,972][25689] Avg episode reward: [(0, '-61.654')] [2022-07-09 01:40:18,385][26022] Updated weights on worker 0-0, policy_version 31484 (0.00083) [2022-07-09 01:40:20,164][26022] Updated weights on worker 0-0, policy_version 31494 (0.00093) [2022-07-09 01:40:21,977][25689] Fps is (10 sec: 5503.4, 60 sec: 5555.8, 300 sec: 5582.6). Total num frames: 32259072. Throughput: 0: 5739.1. Samples: 32263828. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:40:21,979][25689] Avg episode reward: [(0, '-62.334')] [2022-07-09 01:40:22,039][26022] Updated weights on worker 0-0, policy_version 31504 (0.00092) [2022-07-09 01:40:23,545][26022] Updated weights on worker 0-0, policy_version 31514 (0.00084) [2022-07-09 01:40:25,726][26022] Updated weights on worker 0-0, policy_version 31524 (0.00095) [2022-07-09 01:40:26,982][25689] Fps is (10 sec: 5523.8, 60 sec: 5572.9, 300 sec: 5583.2). Total num frames: 32287744. Throughput: 0: 5007.1. Samples: 32280674. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:40:26,983][25689] Avg episode reward: [(0, '-61.845')] [2022-07-09 01:40:27,378][26022] Updated weights on worker 0-0, policy_version 31534 (0.00089) [2022-07-09 01:40:29,260][26022] Updated weights on worker 0-0, policy_version 31544 (0.00086) [2022-07-09 01:40:31,058][26022] Updated weights on worker 0-0, policy_version 31554 (0.00083) [2022-07-09 01:40:32,094][25689] Fps is (10 sec: 5667.7, 60 sec: 5585.6, 300 sec: 5585.4). Total num frames: 32316416. Throughput: 0: 5831.1. Samples: 32314212. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:40:32,095][25689] Avg episode reward: [(0, '-61.210')] [2022-07-09 01:40:33,111][26022] Updated weights on worker 0-0, policy_version 31564 (0.00083) [2022-07-09 01:40:34,904][26022] Updated weights on worker 0-0, policy_version 31574 (0.00092) [2022-07-09 01:40:36,423][26022] Updated weights on worker 0-0, policy_version 31584 (0.00088) [2022-07-09 01:40:37,163][25689] Fps is (10 sec: 5632.6, 60 sec: 5581.1, 300 sec: 5584.4). Total num frames: 32345088. Throughput: 0: 5838.3. Samples: 32347956. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:40:37,164][25689] Avg episode reward: [(0, '-60.717')] [2022-07-09 01:40:38,458][26022] Updated weights on worker 0-0, policy_version 31594 (0.00095) [2022-07-09 01:40:40,156][26022] Updated weights on worker 0-0, policy_version 31604 (0.00081) [2022-07-09 01:40:42,125][26022] Updated weights on worker 0-0, policy_version 31614 (0.00084) [2022-07-09 01:40:42,198][25689] Fps is (10 sec: 5574.4, 60 sec: 5578.4, 300 sec: 5584.1). Total num frames: 32372736. Throughput: 0: 5817.6. Samples: 32381466. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:40:42,198][25689] Avg episode reward: [(0, '-59.793')] [2022-07-09 01:40:43,874][26022] Updated weights on worker 0-0, policy_version 31624 (0.00087) [2022-07-09 01:40:45,789][26022] Updated weights on worker 0-0, policy_version 31634 (0.00105) [2022-07-09 01:40:47,202][25689] Fps is (10 sec: 5609.9, 60 sec: 5578.1, 300 sec: 5585.4). Total num frames: 32401408. Throughput: 0: 5815.8. Samples: 32398270. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:40:47,203][25689] Avg episode reward: [(0, '-60.535')] [2022-07-09 01:40:47,611][26022] Updated weights on worker 0-0, policy_version 31644 (0.00088) [2022-07-09 01:40:49,424][26022] Updated weights on worker 0-0, policy_version 31654 (0.00093) [2022-07-09 01:40:51,313][26022] Updated weights on worker 0-0, policy_version 31664 (0.00084) [2022-07-09 01:40:52,312][25689] Fps is (10 sec: 5568.1, 60 sec: 5561.0, 300 sec: 5580.6). Total num frames: 32429056. Throughput: 0: 5811.9. Samples: 32431716. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:40:52,313][25689] Avg episode reward: [(0, '-59.374')] [2022-07-09 01:40:52,995][26022] Updated weights on worker 0-0, policy_version 31674 (0.00086) [2022-07-09 01:40:54,939][26022] Updated weights on worker 0-0, policy_version 31684 (0.00091) [2022-07-09 01:40:56,732][26022] Updated weights on worker 0-0, policy_version 31694 (0.00089) [2022-07-09 01:40:57,366][25689] Fps is (10 sec: 5742.7, 60 sec: 5611.7, 300 sec: 5587.2). Total num frames: 32459776. Throughput: 0: 5841.3. Samples: 32465970. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 01:40:57,367][25689] Avg episode reward: [(0, '-60.040')] [2022-07-09 01:40:58,477][26022] Updated weights on worker 0-0, policy_version 31704 (0.00099) [2022-07-09 01:41:00,243][26022] Updated weights on worker 0-0, policy_version 31714 (0.00090) [2022-07-09 01:41:02,421][25689] Fps is (10 sec: 5470.0, 60 sec: 5544.0, 300 sec: 5579.7). Total num frames: 32484352. Throughput: 0: 5028.7. Samples: 32483164. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 01:41:02,422][25689] Avg episode reward: [(0, '-59.583')] [2022-07-09 01:41:02,475][26022] Updated weights on worker 0-0, policy_version 31724 (0.00087) [2022-07-09 01:41:04,240][26022] Updated weights on worker 0-0, policy_version 31734 (0.00092) [2022-07-09 01:41:06,102][26022] Updated weights on worker 0-0, policy_version 31744 (0.00101) [2022-07-09 01:41:07,446][25689] Fps is (10 sec: 5282.7, 60 sec: 5575.7, 300 sec: 5583.5). Total num frames: 32513024. Throughput: 0: 5757.8. Samples: 32514828. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 01:41:07,446][25689] Avg episode reward: [(0, '-60.302')] [2022-07-09 01:41:07,885][26022] Updated weights on worker 0-0, policy_version 31754 (0.00085) [2022-07-09 01:41:09,736][26022] Updated weights on worker 0-0, policy_version 31764 (0.00087) [2022-07-09 01:41:11,797][26022] Updated weights on worker 0-0, policy_version 31774 (0.00109) [2022-07-09 01:41:12,523][25689] Fps is (10 sec: 5676.9, 60 sec: 5576.9, 300 sec: 5582.2). Total num frames: 32541696. Throughput: 0: 5763.8. Samples: 32548204. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 01:41:12,523][25689] Avg episode reward: [(0, '-61.608')] [2022-07-09 01:41:13,555][26022] Updated weights on worker 0-0, policy_version 31784 (0.00089) [2022-07-09 01:41:15,270][26022] Updated weights on worker 0-0, policy_version 31794 (0.00086) [2022-07-09 01:41:17,093][26022] Updated weights on worker 0-0, policy_version 31804 (0.00085) [2022-07-09 01:41:17,602][25689] Fps is (10 sec: 5545.4, 60 sec: 5556.5, 300 sec: 5580.8). Total num frames: 32569344. Throughput: 0: 4905.2. Samples: 32565232. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 01:41:17,603][25689] Avg episode reward: [(0, '-61.761')] [2022-07-09 01:41:18,983][26022] Updated weights on worker 0-0, policy_version 31814 (0.00090) [2022-07-09 01:41:20,704][26022] Updated weights on worker 0-0, policy_version 31824 (0.00089) [2022-07-09 01:41:22,608][26022] Updated weights on worker 0-0, policy_version 31834 (0.00093) [2022-07-09 01:41:22,628][25689] Fps is (10 sec: 5573.4, 60 sec: 5588.5, 300 sec: 5581.0). Total num frames: 32598016. Throughput: 0: 5730.1. Samples: 32598948. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 01:41:22,628][25689] Avg episode reward: [(0, '-62.530')] [2022-07-09 01:41:24,302][26022] Updated weights on worker 0-0, policy_version 31844 (0.00093) [2022-07-09 01:41:26,268][26022] Updated weights on worker 0-0, policy_version 31854 (0.00093) [2022-07-09 01:41:27,642][25689] Fps is (10 sec: 5711.6, 60 sec: 5587.6, 300 sec: 5579.8). Total num frames: 32626688. Throughput: 0: 5840.8. Samples: 32632790. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 01:41:27,642][25689] Avg episode reward: [(0, '-62.852')] [2022-07-09 01:41:27,863][26022] Updated weights on worker 0-0, policy_version 31864 (0.00095) [2022-07-09 01:41:29,766][26022] Updated weights on worker 0-0, policy_version 31874 (0.00101) [2022-07-09 01:41:31,497][26022] Updated weights on worker 0-0, policy_version 31884 (0.00088) [2022-07-09 01:41:32,695][25689] Fps is (10 sec: 5594.4, 60 sec: 5576.2, 300 sec: 5579.3). Total num frames: 32654336. Throughput: 0: 5040.7. Samples: 32649886. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 01:41:32,696][25689] Avg episode reward: [(0, '-63.144')] [2022-07-09 01:41:33,481][26022] Updated weights on worker 0-0, policy_version 31894 (0.00085) [2022-07-09 01:41:35,294][26022] Updated weights on worker 0-0, policy_version 31904 (0.00080) [2022-07-09 01:41:36,961][26022] Updated weights on worker 0-0, policy_version 31914 (0.00086) [2022-07-09 01:41:37,707][25689] Fps is (10 sec: 5494.0, 60 sec: 5564.5, 300 sec: 5579.3). Total num frames: 32681984. Throughput: 0: 5900.7. Samples: 32683862. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 01:41:37,707][25689] Avg episode reward: [(0, '-61.987')] [2022-07-09 01:41:38,859][26022] Updated weights on worker 0-0, policy_version 31924 (0.01278) [2022-07-09 01:41:40,760][26022] Updated weights on worker 0-0, policy_version 31934 (0.00082) [2022-07-09 01:41:42,648][26022] Updated weights on worker 0-0, policy_version 31944 (0.00090) [2022-07-09 01:41:42,721][25689] Fps is (10 sec: 5617.5, 60 sec: 5583.3, 300 sec: 5583.0). Total num frames: 32710656. Throughput: 0: 5901.3. Samples: 32717524. Policy #0 lag: (min: 1.0, avg: 10.0, max: 20.0) [2022-07-09 01:41:42,721][25689] Avg episode reward: [(0, '-61.697')] [2022-07-09 01:41:44,343][26022] Updated weights on worker 0-0, policy_version 31954 (0.00129) [2022-07-09 01:41:46,221][26022] Updated weights on worker 0-0, policy_version 31964 (0.00093) [2022-07-09 01:41:47,724][25689] Fps is (10 sec: 5724.5, 60 sec: 5583.5, 300 sec: 5581.0). Total num frames: 32739328. Throughput: 0: 5064.7. Samples: 32734498. Policy #0 lag: (min: 1.0, avg: 10.0, max: 20.0) [2022-07-09 01:41:47,724][25689] Avg episode reward: [(0, '-61.589')] [2022-07-09 01:41:48,042][26022] Updated weights on worker 0-0, policy_version 31974 (0.00090) [2022-07-09 01:41:49,891][26022] Updated weights on worker 0-0, policy_version 31984 (0.00081) [2022-07-09 01:41:51,597][26022] Updated weights on worker 0-0, policy_version 31994 (0.00083) [2022-07-09 01:41:52,799][25689] Fps is (10 sec: 5689.9, 60 sec: 5603.6, 300 sec: 5581.4). Total num frames: 32768000. Throughput: 0: 5871.2. Samples: 32767920. Policy #0 lag: (min: 1.0, avg: 10.0, max: 20.0) [2022-07-09 01:41:52,799][25689] Avg episode reward: [(0, '-62.895')] [2022-07-09 01:41:53,387][26022] Updated weights on worker 0-0, policy_version 32004 (0.00088) [2022-07-09 01:41:55,412][26022] Updated weights on worker 0-0, policy_version 32014 (0.00090) [2022-07-09 01:41:57,123][26022] Updated weights on worker 0-0, policy_version 32024 (0.00079) [2022-07-09 01:41:57,811][25689] Fps is (10 sec: 5583.4, 60 sec: 5556.7, 300 sec: 5577.8). Total num frames: 32795648. Throughput: 0: 5863.4. Samples: 32801740. Policy #0 lag: (min: 1.0, avg: 10.0, max: 20.0) [2022-07-09 01:41:57,811][25689] Avg episode reward: [(0, '-62.896')] [2022-07-09 01:41:59,012][26022] Updated weights on worker 0-0, policy_version 32034 (0.00093) [2022-07-09 01:42:00,612][26022] Updated weights on worker 0-0, policy_version 32044 (0.00090) [2022-07-09 01:42:02,903][25689] Fps is (10 sec: 5371.2, 60 sec: 5587.1, 300 sec: 5577.7). Total num frames: 32822272. Throughput: 0: 5021.1. Samples: 32818862. Policy #0 lag: (min: 1.0, avg: 10.0, max: 20.0) [2022-07-09 01:42:02,904][25689] Avg episode reward: [(0, '-63.430')] [2022-07-09 01:42:03,165][26022] Updated weights on worker 0-0, policy_version 32054 (0.00087) [2022-07-09 01:42:04,963][26022] Updated weights on worker 0-0, policy_version 32064 (0.00092) [2022-07-09 01:42:06,567][26022] Updated weights on worker 0-0, policy_version 32074 (0.00086) [2022-07-09 01:42:07,958][25689] Fps is (10 sec: 5449.2, 60 sec: 5584.3, 300 sec: 5582.0). Total num frames: 32850944. Throughput: 0: 5729.5. Samples: 32850432. Policy #0 lag: (min: 1.0, avg: 10.0, max: 20.0) [2022-07-09 01:42:07,959][25689] Avg episode reward: [(0, '-63.476')] [2022-07-09 01:42:08,615][26022] Updated weights on worker 0-0, policy_version 32084 (0.00089) [2022-07-09 01:42:10,289][26022] Updated weights on worker 0-0, policy_version 32094 (0.00086) [2022-07-09 01:42:12,267][26022] Updated weights on worker 0-0, policy_version 32104 (0.00090) [2022-07-09 01:42:13,054][25689] Fps is (10 sec: 5649.2, 60 sec: 5582.6, 300 sec: 5580.6). Total num frames: 32879616. Throughput: 0: 5738.5. Samples: 32884154. Policy #0 lag: (min: 1.0, avg: 10.0, max: 20.0) [2022-07-09 01:42:13,054][25689] Avg episode reward: [(0, '-64.398')] [2022-07-09 01:42:13,758][26022] Updated weights on worker 0-0, policy_version 32114 (0.00082) [2022-07-09 01:42:15,901][26022] Updated weights on worker 0-0, policy_version 32124 (0.00099) [2022-07-09 01:42:16,902][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:42:16,909][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000032130_32901120.pth [2022-07-09 01:42:16,910][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000030165_30888960.pth [2022-07-09 01:42:17,611][26022] Updated weights on worker 0-0, policy_version 32134 (0.00084) [2022-07-09 01:42:18,080][25689] Fps is (10 sec: 5564.5, 60 sec: 5587.5, 300 sec: 5580.6). Total num frames: 32907264. Throughput: 0: 4907.7. Samples: 32901214. Policy #0 lag: (min: 1.0, avg: 10.0, max: 20.0) [2022-07-09 01:42:18,080][25689] Avg episode reward: [(0, '-64.149')] [2022-07-09 01:42:19,492][26022] Updated weights on worker 0-0, policy_version 32144 (0.00085) [2022-07-09 01:42:21,316][26022] Updated weights on worker 0-0, policy_version 32154 (0.00087) [2022-07-09 01:42:23,084][25689] Fps is (10 sec: 5513.2, 60 sec: 5572.6, 300 sec: 5574.1). Total num frames: 32934912. Throughput: 0: 5731.4. Samples: 32934524. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 01:42:23,085][25689] Avg episode reward: [(0, '-63.352')] [2022-07-09 01:42:23,160][26022] Updated weights on worker 0-0, policy_version 32164 (0.00091) [2022-07-09 01:42:25,012][26022] Updated weights on worker 0-0, policy_version 32174 (0.00086) [2022-07-09 01:42:26,935][26022] Updated weights on worker 0-0, policy_version 32184 (0.00078) [2022-07-09 01:42:28,104][25689] Fps is (10 sec: 5720.6, 60 sec: 5589.0, 300 sec: 5581.5). Total num frames: 32964608. Throughput: 0: 5867.4. Samples: 32968634. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 01:42:28,105][25689] Avg episode reward: [(0, '-62.360')] [2022-07-09 01:42:28,452][26022] Updated weights on worker 0-0, policy_version 32194 (0.00094) [2022-07-09 01:42:30,643][26022] Updated weights on worker 0-0, policy_version 32204 (0.00097) [2022-07-09 01:42:32,121][26022] Updated weights on worker 0-0, policy_version 32214 (0.00086) [2022-07-09 01:42:33,220][25689] Fps is (10 sec: 5556.5, 60 sec: 5566.3, 300 sec: 5576.8). Total num frames: 32991232. Throughput: 0: 5020.0. Samples: 32985386. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 01:42:33,220][25689] Avg episode reward: [(0, '-62.380')] [2022-07-09 01:42:34,111][26022] Updated weights on worker 0-0, policy_version 32224 (0.00084) [2022-07-09 01:42:35,690][26022] Updated weights on worker 0-0, policy_version 32234 (0.00084) [2022-07-09 01:42:37,705][26022] Updated weights on worker 0-0, policy_version 32244 (0.00084) [2022-07-09 01:42:38,282][25689] Fps is (10 sec: 5533.3, 60 sec: 5595.4, 300 sec: 5583.3). Total num frames: 33020928. Throughput: 0: 5832.1. Samples: 33019038. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 01:42:38,283][25689] Avg episode reward: [(0, '-62.660')] [2022-07-09 01:42:39,509][26022] Updated weights on worker 0-0, policy_version 32254 (0.00085) [2022-07-09 01:42:41,193][26022] Updated weights on worker 0-0, policy_version 32264 (0.00082) [2022-07-09 01:42:42,973][26022] Updated weights on worker 0-0, policy_version 32274 (0.00083) [2022-07-09 01:42:43,327][25689] Fps is (10 sec: 5875.9, 60 sec: 5609.4, 300 sec: 5582.8). Total num frames: 33050624. Throughput: 0: 5859.2. Samples: 33053136. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 01:42:43,328][25689] Avg episode reward: [(0, '-61.408')] [2022-07-09 01:42:44,782][26022] Updated weights on worker 0-0, policy_version 32284 (0.00356) [2022-07-09 01:42:46,525][26022] Updated weights on worker 0-0, policy_version 32294 (0.00595) [2022-07-09 01:42:48,339][25689] Fps is (10 sec: 5600.2, 60 sec: 5574.8, 300 sec: 5578.0). Total num frames: 33077248. Throughput: 0: 5859.0. Samples: 33087192. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 01:42:48,340][25689] Avg episode reward: [(0, '-61.489')] [2022-07-09 01:42:48,549][26022] Updated weights on worker 0-0, policy_version 32304 (0.00084) [2022-07-09 01:42:50,430][26022] Updated weights on worker 0-0, policy_version 32314 (0.00100) [2022-07-09 01:42:51,901][26022] Updated weights on worker 0-0, policy_version 32324 (0.00089) [2022-07-09 01:42:53,381][25689] Fps is (10 sec: 5499.9, 60 sec: 5577.9, 300 sec: 5578.3). Total num frames: 33105920. Throughput: 0: 5886.9. Samples: 33104076. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 01:42:53,382][25689] Avg episode reward: [(0, '-61.737')] [2022-07-09 01:42:54,083][26022] Updated weights on worker 0-0, policy_version 32334 (0.00093) [2022-07-09 01:42:55,758][26022] Updated weights on worker 0-0, policy_version 32344 (0.00082) [2022-07-09 01:42:57,542][26022] Updated weights on worker 0-0, policy_version 32354 (0.00095) [2022-07-09 01:42:58,388][25689] Fps is (10 sec: 5808.4, 60 sec: 5612.1, 300 sec: 5582.2). Total num frames: 33135616. Throughput: 0: 5921.6. Samples: 33138096. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 01:42:58,389][25689] Avg episode reward: [(0, '-62.665')] [2022-07-09 01:42:59,422][26022] Updated weights on worker 0-0, policy_version 32364 (0.00082) [2022-07-09 01:43:01,047][26022] Updated weights on worker 0-0, policy_version 32374 (0.00086) [2022-07-09 01:43:03,411][25689] Fps is (10 sec: 5411.3, 60 sec: 5584.8, 300 sec: 5578.7). Total num frames: 33160192. Throughput: 0: 5818.9. Samples: 33170000. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 01:43:03,411][25689] Avg episode reward: [(0, '-63.298')] [2022-07-09 01:43:03,546][26022] Updated weights on worker 0-0, policy_version 32384 (0.00091) [2022-07-09 01:43:04,993][26022] Updated weights on worker 0-0, policy_version 32394 (0.00059) [2022-07-09 01:43:07,101][26022] Updated weights on worker 0-0, policy_version 32404 (0.00076) [2022-07-09 01:43:08,425][25689] Fps is (10 sec: 5305.0, 60 sec: 5588.5, 300 sec: 5573.2). Total num frames: 33188864. Throughput: 0: 4965.9. Samples: 33186938. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 01:43:08,426][25689] Avg episode reward: [(0, '-62.465')] [2022-07-09 01:43:08,928][26022] Updated weights on worker 0-0, policy_version 32414 (0.00090) [2022-07-09 01:43:10,625][26022] Updated weights on worker 0-0, policy_version 32424 (0.00097) [2022-07-09 01:43:12,535][26022] Updated weights on worker 0-0, policy_version 32434 (0.00087) [2022-07-09 01:43:13,541][25689] Fps is (10 sec: 5660.6, 60 sec: 5586.6, 300 sec: 5578.7). Total num frames: 33217536. Throughput: 0: 5802.3. Samples: 33221048. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 01:43:13,542][25689] Avg episode reward: [(0, '-63.010')] [2022-07-09 01:43:14,201][26022] Updated weights on worker 0-0, policy_version 32444 (0.00084) [2022-07-09 01:43:15,909][26022] Updated weights on worker 0-0, policy_version 32454 (0.00093) [2022-07-09 01:43:17,910][26022] Updated weights on worker 0-0, policy_version 32464 (0.00092) [2022-07-09 01:43:18,562][25689] Fps is (10 sec: 5657.4, 60 sec: 5604.1, 300 sec: 5576.5). Total num frames: 33246208. Throughput: 0: 5801.7. Samples: 33255136. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 01:43:18,562][25689] Avg episode reward: [(0, '-63.010')] [2022-07-09 01:43:19,612][26022] Updated weights on worker 0-0, policy_version 32474 (0.00093) [2022-07-09 01:43:21,588][26022] Updated weights on worker 0-0, policy_version 32484 (0.00084) [2022-07-09 01:43:23,255][26022] Updated weights on worker 0-0, policy_version 32494 (0.00084) [2022-07-09 01:43:23,605][25689] Fps is (10 sec: 5799.8, 60 sec: 5634.3, 300 sec: 5582.6). Total num frames: 33275904. Throughput: 0: 5044.6. Samples: 33271872. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 01:43:23,605][25689] Avg episode reward: [(0, '-62.346')] [2022-07-09 01:43:25,232][26022] Updated weights on worker 0-0, policy_version 32504 (0.00086) [2022-07-09 01:43:26,957][26022] Updated weights on worker 0-0, policy_version 32514 (0.00080) [2022-07-09 01:43:28,615][25689] Fps is (10 sec: 5602.0, 60 sec: 5584.4, 300 sec: 5577.9). Total num frames: 33302528. Throughput: 0: 5886.7. Samples: 33305790. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 01:43:28,616][25689] Avg episode reward: [(0, '-62.201')] [2022-07-09 01:43:28,890][26022] Updated weights on worker 0-0, policy_version 32524 (0.00095) [2022-07-09 01:43:30,567][26022] Updated weights on worker 0-0, policy_version 32534 (0.00097) [2022-07-09 01:43:32,606][26022] Updated weights on worker 0-0, policy_version 32544 (0.00095) [2022-07-09 01:43:33,736][25689] Fps is (10 sec: 5660.2, 60 sec: 5651.6, 300 sec: 5579.3). Total num frames: 33333248. Throughput: 0: 5883.7. Samples: 33339868. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 01:43:33,736][25689] Avg episode reward: [(0, '-62.361')] [2022-07-09 01:43:34,268][26022] Updated weights on worker 0-0, policy_version 32554 (0.00083) [2022-07-09 01:43:36,063][26022] Updated weights on worker 0-0, policy_version 32564 (0.00085) [2022-07-09 01:43:37,818][26022] Updated weights on worker 0-0, policy_version 32574 (0.00079) [2022-07-09 01:43:38,760][25689] Fps is (10 sec: 5753.5, 60 sec: 5621.4, 300 sec: 5583.4). Total num frames: 33360896. Throughput: 0: 5037.7. Samples: 33356890. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 01:43:38,761][25689] Avg episode reward: [(0, '-61.977')] [2022-07-09 01:43:39,725][26022] Updated weights on worker 0-0, policy_version 32584 (0.00080) [2022-07-09 01:43:41,420][26022] Updated weights on worker 0-0, policy_version 32594 (0.00090) [2022-07-09 01:43:43,347][26022] Updated weights on worker 0-0, policy_version 32604 (0.00092) [2022-07-09 01:43:43,774][25689] Fps is (10 sec: 5508.9, 60 sec: 5590.4, 300 sec: 5577.7). Total num frames: 33388544. Throughput: 0: 5898.3. Samples: 33390834. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 01:43:43,774][25689] Avg episode reward: [(0, '-62.086')] [2022-07-09 01:43:45,012][26022] Updated weights on worker 0-0, policy_version 32614 (0.00096) [2022-07-09 01:43:47,114][26022] Updated weights on worker 0-0, policy_version 32624 (0.00092) [2022-07-09 01:43:48,776][26022] Updated weights on worker 0-0, policy_version 32634 (0.00093) [2022-07-09 01:43:48,850][25689] Fps is (10 sec: 5683.4, 60 sec: 5635.2, 300 sec: 5585.1). Total num frames: 33418240. Throughput: 0: 5873.7. Samples: 33424642. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 01:43:48,850][25689] Avg episode reward: [(0, '-61.852')] [2022-07-09 01:43:50,704][26022] Updated weights on worker 0-0, policy_version 32644 (0.00105) [2022-07-09 01:43:52,570][26022] Updated weights on worker 0-0, policy_version 32654 (0.00097) [2022-07-09 01:43:53,977][25689] Fps is (10 sec: 5620.3, 60 sec: 5610.5, 300 sec: 5580.9). Total num frames: 33445888. Throughput: 0: 4978.8. Samples: 33440642. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 01:43:53,977][25689] Avg episode reward: [(0, '-61.918')] [2022-07-09 01:43:54,439][26022] Updated weights on worker 0-0, policy_version 32664 (0.00086) [2022-07-09 01:43:56,215][26022] Updated weights on worker 0-0, policy_version 32674 (0.00089) [2022-07-09 01:43:58,059][26022] Updated weights on worker 0-0, policy_version 32684 (0.00086) [2022-07-09 01:43:59,028][25689] Fps is (10 sec: 5432.9, 60 sec: 5572.6, 300 sec: 5576.8). Total num frames: 33473536. Throughput: 0: 5791.6. Samples: 33474274. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 01:43:59,029][25689] Avg episode reward: [(0, '-61.861')] [2022-07-09 01:43:59,662][26022] Updated weights on worker 0-0, policy_version 32694 (0.00091) [2022-07-09 01:44:01,887][26022] Updated weights on worker 0-0, policy_version 32704 (0.00491) [2022-07-09 01:44:03,611][26022] Updated weights on worker 0-0, policy_version 32714 (0.00087) [2022-07-09 01:44:04,127][25689] Fps is (10 sec: 5346.7, 60 sec: 5599.3, 300 sec: 5578.5). Total num frames: 33500160. Throughput: 0: 5649.2. Samples: 33505816. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 01:44:04,128][25689] Avg episode reward: [(0, '-62.057')] [2022-07-09 01:44:05,803][26022] Updated weights on worker 0-0, policy_version 32724 (0.00087) [2022-07-09 01:44:07,288][26022] Updated weights on worker 0-0, policy_version 32734 (0.00087) [2022-07-09 01:44:09,142][25689] Fps is (10 sec: 5365.9, 60 sec: 5582.4, 300 sec: 5579.7). Total num frames: 33527808. Throughput: 0: 4830.1. Samples: 33522656. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 01:44:09,143][25689] Avg episode reward: [(0, '-60.918')] [2022-07-09 01:44:09,344][26022] Updated weights on worker 0-0, policy_version 32744 (0.00089) [2022-07-09 01:44:11,347][26022] Updated weights on worker 0-0, policy_version 32754 (0.00085) [2022-07-09 01:44:12,869][26022] Updated weights on worker 0-0, policy_version 32764 (0.00085) [2022-07-09 01:44:14,203][25689] Fps is (10 sec: 5691.3, 60 sec: 5604.3, 300 sec: 5586.2). Total num frames: 33557504. Throughput: 0: 5732.9. Samples: 33556598. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 01:44:14,203][25689] Avg episode reward: [(0, '-60.709')] [2022-07-09 01:44:14,796][26022] Updated weights on worker 0-0, policy_version 32774 (0.00080) [2022-07-09 01:44:16,470][26022] Updated weights on worker 0-0, policy_version 32784 (0.00099) [2022-07-09 01:44:17,050][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:44:17,059][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000032787_33573888.pth [2022-07-09 01:44:17,059][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000030822_31561728.pth [2022-07-09 01:44:18,471][26022] Updated weights on worker 0-0, policy_version 32794 (0.00091) [2022-07-09 01:44:19,227][25689] Fps is (10 sec: 5686.2, 60 sec: 5587.1, 300 sec: 5582.6). Total num frames: 33585152. Throughput: 0: 5760.7. Samples: 33590634. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 01:44:19,227][25689] Avg episode reward: [(0, '-60.582')] [2022-07-09 01:44:20,279][26022] Updated weights on worker 0-0, policy_version 32804 (0.00088) [2022-07-09 01:44:22,185][26022] Updated weights on worker 0-0, policy_version 32814 (0.00086) [2022-07-09 01:44:23,727][26022] Updated weights on worker 0-0, policy_version 32824 (0.00084) [2022-07-09 01:44:24,254][25689] Fps is (10 sec: 5501.6, 60 sec: 5554.9, 300 sec: 5582.2). Total num frames: 33612800. Throughput: 0: 5030.7. Samples: 33607068. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 01:44:24,255][25689] Avg episode reward: [(0, '-60.254')] [2022-07-09 01:44:25,845][26022] Updated weights on worker 0-0, policy_version 32834 (0.00087) [2022-07-09 01:44:27,663][26022] Updated weights on worker 0-0, policy_version 32844 (0.00086) [2022-07-09 01:44:29,263][25689] Fps is (10 sec: 5509.4, 60 sec: 5571.8, 300 sec: 5583.3). Total num frames: 33640448. Throughput: 0: 5883.9. Samples: 33641048. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 01:44:29,264][25689] Avg episode reward: [(0, '-60.070')] [2022-07-09 01:44:29,407][26022] Updated weights on worker 0-0, policy_version 32854 (0.00083) [2022-07-09 01:44:31,321][26022] Updated weights on worker 0-0, policy_version 32864 (0.00057) [2022-07-09 01:44:33,080][26022] Updated weights on worker 0-0, policy_version 32874 (0.00087) [2022-07-09 01:44:34,320][25689] Fps is (10 sec: 5696.5, 60 sec: 5560.8, 300 sec: 5586.0). Total num frames: 33670144. Throughput: 0: 5875.4. Samples: 33674796. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 01:44:34,321][25689] Avg episode reward: [(0, '-61.099')] [2022-07-09 01:44:34,939][26022] Updated weights on worker 0-0, policy_version 32884 (0.00093) [2022-07-09 01:44:36,858][26022] Updated weights on worker 0-0, policy_version 32894 (0.00083) [2022-07-09 01:44:38,476][26022] Updated weights on worker 0-0, policy_version 32904 (0.00083) [2022-07-09 01:44:39,419][25689] Fps is (10 sec: 5646.7, 60 sec: 5554.0, 300 sec: 5584.2). Total num frames: 33697792. Throughput: 0: 4999.6. Samples: 33691586. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 01:44:39,419][25689] Avg episode reward: [(0, '-61.270')] [2022-07-09 01:44:40,405][26022] Updated weights on worker 0-0, policy_version 32914 (0.00085) [2022-07-09 01:44:42,207][26022] Updated weights on worker 0-0, policy_version 32924 (0.00090) [2022-07-09 01:44:44,075][26022] Updated weights on worker 0-0, policy_version 32934 (0.00092) [2022-07-09 01:44:44,448][25689] Fps is (10 sec: 5662.2, 60 sec: 5586.3, 300 sec: 5587.2). Total num frames: 33727488. Throughput: 0: 5861.3. Samples: 33725432. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 01:44:44,448][25689] Avg episode reward: [(0, '-61.482')] [2022-07-09 01:44:45,870][26022] Updated weights on worker 0-0, policy_version 32944 (0.00101) [2022-07-09 01:44:47,635][26022] Updated weights on worker 0-0, policy_version 32954 (0.00093) [2022-07-09 01:44:49,318][26022] Updated weights on worker 0-0, policy_version 32964 (0.00090) [2022-07-09 01:44:49,451][25689] Fps is (10 sec: 5716.2, 60 sec: 5559.3, 300 sec: 5585.7). Total num frames: 33755136. Throughput: 0: 5858.3. Samples: 33759310. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 01:44:49,451][25689] Avg episode reward: [(0, '-61.035')] [2022-07-09 01:44:51,217][26022] Updated weights on worker 0-0, policy_version 32974 (0.00084) [2022-07-09 01:44:53,342][26022] Updated weights on worker 0-0, policy_version 32984 (0.00087) [2022-07-09 01:44:54,554][25689] Fps is (10 sec: 5471.4, 60 sec: 5561.4, 300 sec: 5584.8). Total num frames: 33782784. Throughput: 0: 5841.2. Samples: 33792986. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 01:44:54,555][25689] Avg episode reward: [(0, '-60.910')] [2022-07-09 01:44:54,952][26022] Updated weights on worker 0-0, policy_version 32994 (0.00084) [2022-07-09 01:44:56,807][26022] Updated weights on worker 0-0, policy_version 33004 (0.00093) [2022-07-09 01:44:58,605][26022] Updated weights on worker 0-0, policy_version 33014 (0.00085) [2022-07-09 01:44:59,575][25689] Fps is (10 sec: 5562.6, 60 sec: 5581.1, 300 sec: 5585.4). Total num frames: 33811456. Throughput: 0: 5873.5. Samples: 33809976. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 01:44:59,576][25689] Avg episode reward: [(0, '-60.373')] [2022-07-09 01:45:00,278][26022] Updated weights on worker 0-0, policy_version 33024 (0.00084) [2022-07-09 01:45:02,620][26022] Updated weights on worker 0-0, policy_version 33034 (0.00090) [2022-07-09 01:45:04,271][26022] Updated weights on worker 0-0, policy_version 33044 (0.00090) [2022-07-09 01:45:04,577][25689] Fps is (10 sec: 5517.0, 60 sec: 5590.1, 300 sec: 5585.4). Total num frames: 33838080. Throughput: 0: 5787.5. Samples: 33841930. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 01:45:04,578][25689] Avg episode reward: [(0, '-60.406')] [2022-07-09 01:45:06,275][26022] Updated weights on worker 0-0, policy_version 33054 (0.00083) [2022-07-09 01:45:07,965][26022] Updated weights on worker 0-0, policy_version 33064 (0.00478) [2022-07-09 01:45:09,665][25689] Fps is (10 sec: 5480.2, 60 sec: 5600.2, 300 sec: 5585.4). Total num frames: 33866752. Throughput: 0: 5775.6. Samples: 33876062. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 01:45:09,667][25689] Avg episode reward: [(0, '-60.068')] [2022-07-09 01:45:09,843][26022] Updated weights on worker 0-0, policy_version 33074 (0.00088) [2022-07-09 01:45:11,460][26022] Updated weights on worker 0-0, policy_version 33084 (0.00086) [2022-07-09 01:45:13,426][26022] Updated weights on worker 0-0, policy_version 33094 (0.00082) [2022-07-09 01:45:14,774][25689] Fps is (10 sec: 5623.9, 60 sec: 5578.9, 300 sec: 5584.2). Total num frames: 33895424. Throughput: 0: 4939.7. Samples: 33892864. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 01:45:14,774][25689] Avg episode reward: [(0, '-60.676')] [2022-07-09 01:45:15,169][26022] Updated weights on worker 0-0, policy_version 33104 (0.00092) [2022-07-09 01:45:17,098][26022] Updated weights on worker 0-0, policy_version 33114 (0.00089) [2022-07-09 01:45:18,815][26022] Updated weights on worker 0-0, policy_version 33124 (0.00090) [2022-07-09 01:45:19,793][25689] Fps is (10 sec: 5560.9, 60 sec: 5579.3, 300 sec: 5587.3). Total num frames: 33923072. Throughput: 0: 5783.0. Samples: 33926896. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 01:45:19,794][25689] Avg episode reward: [(0, '-61.512')] [2022-07-09 01:45:20,648][26022] Updated weights on worker 0-0, policy_version 33134 (0.00095) [2022-07-09 01:45:22,515][26022] Updated weights on worker 0-0, policy_version 33144 (0.00081) [2022-07-09 01:45:24,320][26022] Updated weights on worker 0-0, policy_version 33154 (0.00086) [2022-07-09 01:45:24,886][25689] Fps is (10 sec: 5569.4, 60 sec: 5590.2, 300 sec: 5585.7). Total num frames: 33951744. Throughput: 0: 5829.1. Samples: 33960312. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 01:45:24,887][25689] Avg episode reward: [(0, '-61.924')] [2022-07-09 01:45:26,044][26022] Updated weights on worker 0-0, policy_version 33164 (0.00094) [2022-07-09 01:45:27,998][26022] Updated weights on worker 0-0, policy_version 33174 (0.00097) [2022-07-09 01:45:29,789][26022] Updated weights on worker 0-0, policy_version 33184 (0.00091) [2022-07-09 01:45:29,903][25689] Fps is (10 sec: 5773.5, 60 sec: 5623.3, 300 sec: 5590.9). Total num frames: 33981440. Throughput: 0: 5005.8. Samples: 33977366. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 01:45:29,903][25689] Avg episode reward: [(0, '-61.648')] [2022-07-09 01:45:31,800][26022] Updated weights on worker 0-0, policy_version 33194 (0.00081) [2022-07-09 01:45:33,318][26022] Updated weights on worker 0-0, policy_version 33204 (0.00090) [2022-07-09 01:45:35,053][25689] Fps is (10 sec: 5640.3, 60 sec: 5580.9, 300 sec: 5586.0). Total num frames: 34009088. Throughput: 0: 5840.9. Samples: 34011314. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 01:45:35,053][25689] Avg episode reward: [(0, '-61.088')] [2022-07-09 01:45:35,369][26022] Updated weights on worker 0-0, policy_version 33214 (0.00088) [2022-07-09 01:45:37,025][26022] Updated weights on worker 0-0, policy_version 33224 (0.00084) [2022-07-09 01:45:38,913][26022] Updated weights on worker 0-0, policy_version 33234 (0.00088) [2022-07-09 01:45:40,117][25689] Fps is (10 sec: 5514.1, 60 sec: 5601.0, 300 sec: 5588.9). Total num frames: 34037760. Throughput: 0: 5814.2. Samples: 34045060. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 01:45:40,117][25689] Avg episode reward: [(0, '-61.113')] [2022-07-09 01:45:40,624][26022] Updated weights on worker 0-0, policy_version 33244 (0.00099) [2022-07-09 01:45:42,533][26022] Updated weights on worker 0-0, policy_version 33254 (0.00087) [2022-07-09 01:45:44,342][26022] Updated weights on worker 0-0, policy_version 33264 (0.00082) [2022-07-09 01:45:45,147][25689] Fps is (10 sec: 5782.2, 60 sec: 5600.8, 300 sec: 5591.8). Total num frames: 34067456. Throughput: 0: 5022.5. Samples: 34062074. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 01:45:45,148][25689] Avg episode reward: [(0, '-60.187')] [2022-07-09 01:45:46,254][26022] Updated weights on worker 0-0, policy_version 33274 (0.00086) [2022-07-09 01:45:47,759][26022] Updated weights on worker 0-0, policy_version 33284 (0.00083) [2022-07-09 01:45:49,911][26022] Updated weights on worker 0-0, policy_version 33294 (0.00094) [2022-07-09 01:45:50,197][25689] Fps is (10 sec: 5688.4, 60 sec: 5596.5, 300 sec: 5592.9). Total num frames: 34095104. Throughput: 0: 5848.3. Samples: 34096054. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 01:45:50,198][25689] Avg episode reward: [(0, '-59.204')] [2022-07-09 01:45:51,565][26022] Updated weights on worker 0-0, policy_version 33304 (0.00093) [2022-07-09 01:45:53,578][26022] Updated weights on worker 0-0, policy_version 33314 (0.00105) [2022-07-09 01:45:55,287][25689] Fps is (10 sec: 5453.3, 60 sec: 5597.8, 300 sec: 5581.9). Total num frames: 34122752. Throughput: 0: 5832.1. Samples: 34129322. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 01:45:55,288][25689] Avg episode reward: [(0, '-58.966')] [2022-07-09 01:45:55,384][26022] Updated weights on worker 0-0, policy_version 33324 (0.00086) [2022-07-09 01:45:57,050][26022] Updated weights on worker 0-0, policy_version 33334 (0.00099) [2022-07-09 01:45:58,943][26022] Updated weights on worker 0-0, policy_version 33344 (0.00093) [2022-07-09 01:46:00,295][25689] Fps is (10 sec: 5679.2, 60 sec: 5615.9, 300 sec: 5600.0). Total num frames: 34152448. Throughput: 0: 5018.8. Samples: 34146330. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 01:46:00,295][25689] Avg episode reward: [(0, '-59.458')] [2022-07-09 01:46:00,663][26022] Updated weights on worker 0-0, policy_version 33354 (0.00092) [2022-07-09 01:46:02,842][26022] Updated weights on worker 0-0, policy_version 33364 (0.00089) [2022-07-09 01:46:04,774][26022] Updated weights on worker 0-0, policy_version 33374 (0.00083) [2022-07-09 01:46:05,331][25689] Fps is (10 sec: 5403.8, 60 sec: 5579.0, 300 sec: 5586.1). Total num frames: 34177024. Throughput: 0: 5755.6. Samples: 34178240. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 01:46:05,333][25689] Avg episode reward: [(0, '-58.801')] [2022-07-09 01:46:06,579][26022] Updated weights on worker 0-0, policy_version 33384 (0.00646) [2022-07-09 01:46:08,351][26022] Updated weights on worker 0-0, policy_version 33394 (0.00097) [2022-07-09 01:46:10,216][26022] Updated weights on worker 0-0, policy_version 33404 (0.00086) [2022-07-09 01:46:10,377][25689] Fps is (10 sec: 5281.2, 60 sec: 5582.8, 300 sec: 5586.6). Total num frames: 34205696. Throughput: 0: 5740.4. Samples: 34211894. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 01:46:10,380][25689] Avg episode reward: [(0, '-58.263')] [2022-07-09 01:46:12,078][26022] Updated weights on worker 0-0, policy_version 33414 (0.00093) [2022-07-09 01:46:13,836][26022] Updated weights on worker 0-0, policy_version 33424 (0.00086) [2022-07-09 01:46:15,439][25689] Fps is (10 sec: 5672.9, 60 sec: 5587.1, 300 sec: 5590.4). Total num frames: 34234368. Throughput: 0: 4941.2. Samples: 34228896. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 01:46:15,440][25689] Avg episode reward: [(0, '-58.127')] [2022-07-09 01:46:15,798][26022] Updated weights on worker 0-0, policy_version 33434 (0.00083) [2022-07-09 01:46:17,062][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:46:17,074][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000033441_34243584.pth [2022-07-09 01:46:17,074][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000031475_32230400.pth [2022-07-09 01:46:17,419][26022] Updated weights on worker 0-0, policy_version 33444 (0.00081) [2022-07-09 01:46:19,376][26022] Updated weights on worker 0-0, policy_version 33454 (0.00085) [2022-07-09 01:46:20,507][25689] Fps is (10 sec: 5661.1, 60 sec: 5599.5, 300 sec: 5589.6). Total num frames: 34263040. Throughput: 0: 5767.3. Samples: 34262900. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 01:46:20,509][25689] Avg episode reward: [(0, '-58.597')] [2022-07-09 01:46:21,035][26022] Updated weights on worker 0-0, policy_version 33464 (0.00088) [2022-07-09 01:46:23,025][26022] Updated weights on worker 0-0, policy_version 33474 (0.00084) [2022-07-09 01:46:24,731][26022] Updated weights on worker 0-0, policy_version 33484 (0.00085) [2022-07-09 01:46:25,568][25689] Fps is (10 sec: 5661.4, 60 sec: 5602.5, 300 sec: 5588.7). Total num frames: 34291712. Throughput: 0: 5852.3. Samples: 34296674. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 01:46:25,568][25689] Avg episode reward: [(0, '-59.965')] [2022-07-09 01:46:26,642][26022] Updated weights on worker 0-0, policy_version 33494 (0.00081) [2022-07-09 01:46:28,293][26022] Updated weights on worker 0-0, policy_version 33504 (0.00083) [2022-07-09 01:46:30,322][26022] Updated weights on worker 0-0, policy_version 33514 (0.00084) [2022-07-09 01:46:30,628][25689] Fps is (10 sec: 5665.6, 60 sec: 5581.6, 300 sec: 5592.0). Total num frames: 34320384. Throughput: 0: 5030.5. Samples: 34313766. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 01:46:30,629][25689] Avg episode reward: [(0, '-60.094')] [2022-07-09 01:46:31,987][26022] Updated weights on worker 0-0, policy_version 33524 (0.00094) [2022-07-09 01:46:33,871][26022] Updated weights on worker 0-0, policy_version 33534 (0.00088) [2022-07-09 01:46:35,631][26022] Updated weights on worker 0-0, policy_version 33544 (0.00867) [2022-07-09 01:46:35,717][25689] Fps is (10 sec: 5650.1, 60 sec: 5604.1, 300 sec: 5594.0). Total num frames: 34349056. Throughput: 0: 5858.5. Samples: 34347696. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 01:46:35,718][25689] Avg episode reward: [(0, '-59.979')] [2022-07-09 01:46:37,571][26022] Updated weights on worker 0-0, policy_version 33554 (0.00086) [2022-07-09 01:46:39,279][26022] Updated weights on worker 0-0, policy_version 33564 (0.00090) [2022-07-09 01:46:40,784][25689] Fps is (10 sec: 5646.1, 60 sec: 5603.8, 300 sec: 5593.0). Total num frames: 34377728. Throughput: 0: 5850.7. Samples: 34381538. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 01:46:40,785][25689] Avg episode reward: [(0, '-60.978')] [2022-07-09 01:46:41,159][26022] Updated weights on worker 0-0, policy_version 33574 (0.00080) [2022-07-09 01:46:42,876][26022] Updated weights on worker 0-0, policy_version 33584 (0.00095) [2022-07-09 01:46:44,751][26022] Updated weights on worker 0-0, policy_version 33594 (0.00090) [2022-07-09 01:46:45,791][25689] Fps is (10 sec: 5590.7, 60 sec: 5572.2, 300 sec: 5589.5). Total num frames: 34405376. Throughput: 0: 5884.3. Samples: 34415674. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 01:46:45,792][25689] Avg episode reward: [(0, '-61.571')] [2022-07-09 01:46:46,462][26022] Updated weights on worker 0-0, policy_version 33604 (0.00083) [2022-07-09 01:46:48,227][26022] Updated weights on worker 0-0, policy_version 33614 (0.00085) [2022-07-09 01:46:50,145][26022] Updated weights on worker 0-0, policy_version 33624 (0.00082) [2022-07-09 01:46:50,820][25689] Fps is (10 sec: 5816.1, 60 sec: 5624.8, 300 sec: 5597.2). Total num frames: 34436096. Throughput: 0: 5900.5. Samples: 34432908. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 01:46:50,821][25689] Avg episode reward: [(0, '-61.858')] [2022-07-09 01:46:51,868][26022] Updated weights on worker 0-0, policy_version 33634 (0.00089) [2022-07-09 01:46:53,811][26022] Updated weights on worker 0-0, policy_version 33644 (0.00089) [2022-07-09 01:46:55,506][26022] Updated weights on worker 0-0, policy_version 33654 (0.00089) [2022-07-09 01:46:55,899][25689] Fps is (10 sec: 5571.7, 60 sec: 5592.0, 300 sec: 5589.1). Total num frames: 34461696. Throughput: 0: 5911.3. Samples: 34467000. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 01:46:55,900][25689] Avg episode reward: [(0, '-60.923')] [2022-07-09 01:46:57,355][26022] Updated weights on worker 0-0, policy_version 33664 (0.00086) [2022-07-09 01:46:59,259][26022] Updated weights on worker 0-0, policy_version 33674 (0.00091) [2022-07-09 01:47:00,842][26022] Updated weights on worker 0-0, policy_version 33684 (0.00086) [2022-07-09 01:47:00,941][25689] Fps is (10 sec: 5564.8, 60 sec: 5605.8, 300 sec: 5603.8). Total num frames: 34492416. Throughput: 0: 5937.4. Samples: 34501214. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 01:47:00,941][25689] Avg episode reward: [(0, '-61.205')] [2022-07-09 01:47:03,139][26022] Updated weights on worker 0-0, policy_version 33694 (0.00087) [2022-07-09 01:47:05,100][26022] Updated weights on worker 0-0, policy_version 33704 (0.00094) [2022-07-09 01:47:05,966][25689] Fps is (10 sec: 5696.7, 60 sec: 5640.6, 300 sec: 5597.5). Total num frames: 34519040. Throughput: 0: 4977.9. Samples: 34516100. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 01:47:05,966][25689] Avg episode reward: [(0, '-61.223')] [2022-07-09 01:47:06,572][26022] Updated weights on worker 0-0, policy_version 33714 (0.00096) [2022-07-09 01:47:08,790][26022] Updated weights on worker 0-0, policy_version 33724 (0.00094) [2022-07-09 01:47:10,213][26022] Updated weights on worker 0-0, policy_version 33734 (0.00097) [2022-07-09 01:47:10,993][25689] Fps is (10 sec: 5399.2, 60 sec: 5625.5, 300 sec: 5595.4). Total num frames: 34546688. Throughput: 0: 5803.4. Samples: 34549978. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 01:47:10,993][25689] Avg episode reward: [(0, '-60.920')] [2022-07-09 01:47:12,300][26022] Updated weights on worker 0-0, policy_version 33744 (0.00064) [2022-07-09 01:47:13,895][26022] Updated weights on worker 0-0, policy_version 33754 (0.00085) [2022-07-09 01:47:15,861][26022] Updated weights on worker 0-0, policy_version 33764 (0.00085) [2022-07-09 01:47:16,070][25689] Fps is (10 sec: 5675.1, 60 sec: 5640.9, 300 sec: 5601.3). Total num frames: 34576384. Throughput: 0: 5797.2. Samples: 34583932. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 01:47:16,070][25689] Avg episode reward: [(0, '-60.213')] [2022-07-09 01:47:17,695][26022] Updated weights on worker 0-0, policy_version 33774 (0.00089) [2022-07-09 01:47:19,569][26022] Updated weights on worker 0-0, policy_version 33784 (0.00096) [2022-07-09 01:47:21,107][25689] Fps is (10 sec: 5568.5, 60 sec: 5610.0, 300 sec: 5597.2). Total num frames: 34603008. Throughput: 0: 4930.2. Samples: 34600636. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 01:47:21,107][25689] Avg episode reward: [(0, '-59.979')] [2022-07-09 01:47:21,260][26022] Updated weights on worker 0-0, policy_version 33794 (0.00090) [2022-07-09 01:47:23,404][26022] Updated weights on worker 0-0, policy_version 33804 (0.00085) [2022-07-09 01:47:24,962][26022] Updated weights on worker 0-0, policy_version 33814 (0.00088) [2022-07-09 01:47:26,121][25689] Fps is (10 sec: 5298.0, 60 sec: 5580.6, 300 sec: 5587.0). Total num frames: 34629632. Throughput: 0: 5844.8. Samples: 34633902. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:47:26,123][25689] Avg episode reward: [(0, '-59.768')] [2022-07-09 01:47:26,949][26022] Updated weights on worker 0-0, policy_version 33824 (0.00053) [2022-07-09 01:47:28,962][26022] Updated weights on worker 0-0, policy_version 33834 (0.00086) [2022-07-09 01:47:30,503][26022] Updated weights on worker 0-0, policy_version 33844 (0.00095) [2022-07-09 01:47:31,128][25689] Fps is (10 sec: 5722.3, 60 sec: 5619.3, 300 sec: 5602.8). Total num frames: 34660352. Throughput: 0: 5845.8. Samples: 34667684. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:47:31,128][25689] Avg episode reward: [(0, '-59.875')] [2022-07-09 01:47:32,443][26022] Updated weights on worker 0-0, policy_version 33854 (0.00087) [2022-07-09 01:47:33,952][26022] Updated weights on worker 0-0, policy_version 33864 (0.00090) [2022-07-09 01:47:36,057][26022] Updated weights on worker 0-0, policy_version 33874 (0.00089) [2022-07-09 01:47:36,218][25689] Fps is (10 sec: 5780.7, 60 sec: 5602.3, 300 sec: 5595.4). Total num frames: 34688000. Throughput: 0: 4998.0. Samples: 34684630. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:47:36,218][25689] Avg episode reward: [(0, '-59.614')] [2022-07-09 01:47:37,807][26022] Updated weights on worker 0-0, policy_version 33884 (0.00084) [2022-07-09 01:47:39,632][26022] Updated weights on worker 0-0, policy_version 33894 (0.00979) [2022-07-09 01:47:41,240][25689] Fps is (10 sec: 5468.2, 60 sec: 5589.6, 300 sec: 5589.0). Total num frames: 34715648. Throughput: 0: 5855.7. Samples: 34718530. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:47:41,240][25689] Avg episode reward: [(0, '-59.834')] [2022-07-09 01:47:41,483][26022] Updated weights on worker 0-0, policy_version 33904 (0.00087) [2022-07-09 01:47:43,311][26022] Updated weights on worker 0-0, policy_version 33914 (0.00092) [2022-07-09 01:47:44,907][26022] Updated weights on worker 0-0, policy_version 33924 (0.00090) [2022-07-09 01:47:46,279][25689] Fps is (10 sec: 5597.8, 60 sec: 5603.5, 300 sec: 5595.3). Total num frames: 34744320. Throughput: 0: 5891.0. Samples: 34752654. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:47:46,280][25689] Avg episode reward: [(0, '-61.830')] [2022-07-09 01:47:46,828][26022] Updated weights on worker 0-0, policy_version 33934 (0.00101) [2022-07-09 01:47:48,618][26022] Updated weights on worker 0-0, policy_version 33944 (0.00091) [2022-07-09 01:47:50,370][26022] Updated weights on worker 0-0, policy_version 33954 (0.00082) [2022-07-09 01:47:51,306][25689] Fps is (10 sec: 5798.4, 60 sec: 5586.7, 300 sec: 5599.0). Total num frames: 34774016. Throughput: 0: 5061.5. Samples: 34769816. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:47:51,307][25689] Avg episode reward: [(0, '-61.058')] [2022-07-09 01:47:52,301][26022] Updated weights on worker 0-0, policy_version 33964 (0.00086) [2022-07-09 01:47:53,900][26022] Updated weights on worker 0-0, policy_version 33974 (0.00082) [2022-07-09 01:47:55,833][26022] Updated weights on worker 0-0, policy_version 33984 (0.00087) [2022-07-09 01:47:56,392][25689] Fps is (10 sec: 5670.4, 60 sec: 5620.0, 300 sec: 5590.7). Total num frames: 34801664. Throughput: 0: 5893.3. Samples: 34803522. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:47:56,392][25689] Avg episode reward: [(0, '-62.629')] [2022-07-09 01:47:57,812][26022] Updated weights on worker 0-0, policy_version 33994 (0.00090) [2022-07-09 01:47:59,523][26022] Updated weights on worker 0-0, policy_version 34004 (0.00092) [2022-07-09 01:48:01,468][25689] Fps is (10 sec: 5441.3, 60 sec: 5566.0, 300 sec: 5600.0). Total num frames: 34829312. Throughput: 0: 5889.4. Samples: 34837664. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:48:01,469][25689] Avg episode reward: [(0, '-62.738')] [2022-07-09 01:48:01,563][26022] Updated weights on worker 0-0, policy_version 34014 (0.00108) [2022-07-09 01:48:03,555][26022] Updated weights on worker 0-0, policy_version 34024 (0.00094) [2022-07-09 01:48:05,154][26022] Updated weights on worker 0-0, policy_version 34034 (0.00089) [2022-07-09 01:48:06,503][25689] Fps is (10 sec: 5570.2, 60 sec: 5598.9, 300 sec: 5599.6). Total num frames: 34857984. Throughput: 0: 4941.1. Samples: 34852582. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 01:48:06,503][25689] Avg episode reward: [(0, '-62.485')] [2022-07-09 01:48:07,262][26022] Updated weights on worker 0-0, policy_version 34044 (0.00090) [2022-07-09 01:48:08,905][26022] Updated weights on worker 0-0, policy_version 34054 (0.00079) [2022-07-09 01:48:10,809][26022] Updated weights on worker 0-0, policy_version 34064 (0.00090) [2022-07-09 01:48:11,524][25689] Fps is (10 sec: 5499.3, 60 sec: 5582.6, 300 sec: 5594.5). Total num frames: 34884608. Throughput: 0: 5764.8. Samples: 34886364. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 01:48:11,524][25689] Avg episode reward: [(0, '-60.896')] [2022-07-09 01:48:12,429][26022] Updated weights on worker 0-0, policy_version 34074 (0.00678) [2022-07-09 01:48:14,595][26022] Updated weights on worker 0-0, policy_version 34084 (0.00082) [2022-07-09 01:48:16,166][26022] Updated weights on worker 0-0, policy_version 34094 (0.00093) [2022-07-09 01:48:16,598][25689] Fps is (10 sec: 5578.7, 60 sec: 5582.8, 300 sec: 5596.9). Total num frames: 34914304. Throughput: 0: 5781.9. Samples: 34920354. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 01:48:16,599][25689] Avg episode reward: [(0, '-61.646')] [2022-07-09 01:48:17,108][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:48:17,125][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000034100_34918400.pth [2022-07-09 01:48:17,126][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000032130_32901120.pth [2022-07-09 01:48:18,064][26022] Updated weights on worker 0-0, policy_version 34104 (0.00085) [2022-07-09 01:48:19,845][26022] Updated weights on worker 0-0, policy_version 34114 (0.00094) [2022-07-09 01:48:21,469][26022] Updated weights on worker 0-0, policy_version 34124 (0.00089) [2022-07-09 01:48:21,607][25689] Fps is (10 sec: 5788.6, 60 sec: 5619.3, 300 sec: 5594.2). Total num frames: 34942976. Throughput: 0: 4950.3. Samples: 34937354. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 01:48:21,609][25689] Avg episode reward: [(0, '-61.191')] [2022-07-09 01:48:23,477][26022] Updated weights on worker 0-0, policy_version 34134 (0.00091) [2022-07-09 01:48:25,074][26022] Updated weights on worker 0-0, policy_version 34144 (0.00087) [2022-07-09 01:48:26,629][25689] Fps is (10 sec: 5512.7, 60 sec: 5618.5, 300 sec: 5593.9). Total num frames: 34969600. Throughput: 0: 5901.5. Samples: 34971356. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 01:48:26,629][25689] Avg episode reward: [(0, '-60.685')] [2022-07-09 01:48:27,205][26022] Updated weights on worker 0-0, policy_version 34154 (0.00088) [2022-07-09 01:48:28,914][26022] Updated weights on worker 0-0, policy_version 34164 (0.00086) [2022-07-09 01:48:30,764][26022] Updated weights on worker 0-0, policy_version 34174 (0.00087) [2022-07-09 01:48:31,655][25689] Fps is (10 sec: 5605.2, 60 sec: 5599.9, 300 sec: 5592.3). Total num frames: 34999296. Throughput: 0: 5898.3. Samples: 35005102. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 01:48:31,659][25689] Avg episode reward: [(0, '-58.958')] [2022-07-09 01:48:32,453][26022] Updated weights on worker 0-0, policy_version 34184 (0.00088) [2022-07-09 01:48:34,360][26022] Updated weights on worker 0-0, policy_version 34194 (0.00057) [2022-07-09 01:48:36,024][26022] Updated weights on worker 0-0, policy_version 34204 (0.00094) [2022-07-09 01:48:36,750][25689] Fps is (10 sec: 5665.8, 60 sec: 5599.4, 300 sec: 5591.0). Total num frames: 35026944. Throughput: 0: 5047.3. Samples: 35022066. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 01:48:36,751][25689] Avg episode reward: [(0, '-58.854')] [2022-07-09 01:48:38,193][26022] Updated weights on worker 0-0, policy_version 34214 (0.00093) [2022-07-09 01:48:39,696][26022] Updated weights on worker 0-0, policy_version 34224 (0.00087) [2022-07-09 01:48:41,779][25689] Fps is (10 sec: 5461.5, 60 sec: 5598.8, 300 sec: 5590.7). Total num frames: 35054592. Throughput: 0: 5869.0. Samples: 35055746. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 01:48:41,780][25689] Avg episode reward: [(0, '-58.880')] [2022-07-09 01:48:41,799][26022] Updated weights on worker 0-0, policy_version 34234 (0.00082) [2022-07-09 01:48:43,187][26022] Updated weights on worker 0-0, policy_version 34244 (0.00085) [2022-07-09 01:48:45,381][26022] Updated weights on worker 0-0, policy_version 34254 (0.00083) [2022-07-09 01:48:46,801][25689] Fps is (10 sec: 5807.2, 60 sec: 5634.2, 300 sec: 5595.1). Total num frames: 35085312. Throughput: 0: 5857.0. Samples: 35089504. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 01:48:46,803][25689] Avg episode reward: [(0, '-58.820')] [2022-07-09 01:48:46,913][26022] Updated weights on worker 0-0, policy_version 34264 (0.00423) [2022-07-09 01:48:48,954][26022] Updated weights on worker 0-0, policy_version 34274 (0.00084) [2022-07-09 01:48:50,872][26022] Updated weights on worker 0-0, policy_version 34284 (0.00099) [2022-07-09 01:48:51,832][25689] Fps is (10 sec: 5602.4, 60 sec: 5566.2, 300 sec: 5590.0). Total num frames: 35110912. Throughput: 0: 5016.3. Samples: 35106318. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 01:48:51,835][25689] Avg episode reward: [(0, '-57.851')] [2022-07-09 01:48:52,628][26022] Updated weights on worker 0-0, policy_version 34294 (0.00087) [2022-07-09 01:48:54,550][26022] Updated weights on worker 0-0, policy_version 34304 (0.00100) [2022-07-09 01:48:56,448][26022] Updated weights on worker 0-0, policy_version 34314 (0.00096) [2022-07-09 01:48:56,896][25689] Fps is (10 sec: 5376.2, 60 sec: 5585.1, 300 sec: 5593.2). Total num frames: 35139584. Throughput: 0: 5847.1. Samples: 35139860. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 01:48:56,897][25689] Avg episode reward: [(0, '-57.475')] [2022-07-09 01:48:57,974][26022] Updated weights on worker 0-0, policy_version 34324 (0.00081) [2022-07-09 01:49:00,210][26022] Updated weights on worker 0-0, policy_version 34334 (0.00086) [2022-07-09 01:49:01,612][26022] Updated weights on worker 0-0, policy_version 34344 (0.00081) [2022-07-09 01:49:01,947][25689] Fps is (10 sec: 5770.6, 60 sec: 5621.3, 300 sec: 5604.5). Total num frames: 35169280. Throughput: 0: 5856.6. Samples: 35173858. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 01:49:01,947][25689] Avg episode reward: [(0, '-57.807')] [2022-07-09 01:49:04,046][26022] Updated weights on worker 0-0, policy_version 34354 (0.00082) [2022-07-09 01:49:05,499][26022] Updated weights on worker 0-0, policy_version 34364 (0.00085) [2022-07-09 01:49:06,951][25689] Fps is (10 sec: 5499.2, 60 sec: 5573.3, 300 sec: 5597.8). Total num frames: 35194880. Throughput: 0: 4936.9. Samples: 35188982. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 01:49:06,952][25689] Avg episode reward: [(0, '-57.809')] [2022-07-09 01:49:07,475][26022] Updated weights on worker 0-0, policy_version 34374 (0.00087) [2022-07-09 01:49:09,155][26022] Updated weights on worker 0-0, policy_version 34384 (0.00089) [2022-07-09 01:49:11,329][26022] Updated weights on worker 0-0, policy_version 34394 (0.00093) [2022-07-09 01:49:11,975][25689] Fps is (10 sec: 5514.2, 60 sec: 5623.9, 300 sec: 5598.5). Total num frames: 35224576. Throughput: 0: 5795.2. Samples: 35223048. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 01:49:11,975][25689] Avg episode reward: [(0, '-57.312')] [2022-07-09 01:49:12,844][26022] Updated weights on worker 0-0, policy_version 34404 (0.00085) [2022-07-09 01:49:14,887][26022] Updated weights on worker 0-0, policy_version 34414 (0.00080) [2022-07-09 01:49:16,264][26022] Updated weights on worker 0-0, policy_version 34424 (0.00049) [2022-07-09 01:49:17,039][25689] Fps is (10 sec: 5684.6, 60 sec: 5591.0, 300 sec: 5597.7). Total num frames: 35252224. Throughput: 0: 5811.6. Samples: 35256922. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 01:49:17,039][25689] Avg episode reward: [(0, '-57.086')] [2022-07-09 01:49:18,598][26022] Updated weights on worker 0-0, policy_version 34434 (0.00094) [2022-07-09 01:49:20,192][26022] Updated weights on worker 0-0, policy_version 34444 (0.00086) [2022-07-09 01:49:22,048][26022] Updated weights on worker 0-0, policy_version 34454 (0.00091) [2022-07-09 01:49:22,048][25689] Fps is (10 sec: 5489.3, 60 sec: 5573.9, 300 sec: 5598.1). Total num frames: 35279872. Throughput: 0: 4969.5. Samples: 35273752. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 01:49:22,049][25689] Avg episode reward: [(0, '-57.658')] [2022-07-09 01:49:23,732][26022] Updated weights on worker 0-0, policy_version 34464 (0.00087) [2022-07-09 01:49:25,763][26022] Updated weights on worker 0-0, policy_version 34474 (0.00087) [2022-07-09 01:49:27,065][25689] Fps is (10 sec: 5617.5, 60 sec: 5608.4, 300 sec: 5601.4). Total num frames: 35308544. Throughput: 0: 5885.1. Samples: 35307352. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 01:49:27,065][25689] Avg episode reward: [(0, '-59.227')] [2022-07-09 01:49:27,525][26022] Updated weights on worker 0-0, policy_version 34484 (0.00082) [2022-07-09 01:49:29,489][26022] Updated weights on worker 0-0, policy_version 34494 (0.00085) [2022-07-09 01:49:30,908][26022] Updated weights on worker 0-0, policy_version 34504 (0.00097) [2022-07-09 01:49:32,067][25689] Fps is (10 sec: 5723.7, 60 sec: 5593.6, 300 sec: 5599.0). Total num frames: 35337216. Throughput: 0: 5880.5. Samples: 35341200. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 01:49:32,067][25689] Avg episode reward: [(0, '-59.130')] [2022-07-09 01:49:33,154][26022] Updated weights on worker 0-0, policy_version 34514 (0.00089) [2022-07-09 01:49:34,606][26022] Updated weights on worker 0-0, policy_version 34524 (0.00090) [2022-07-09 01:49:36,714][26022] Updated weights on worker 0-0, policy_version 34534 (0.00085) [2022-07-09 01:49:37,138][25689] Fps is (10 sec: 5692.8, 60 sec: 5612.8, 300 sec: 5602.9). Total num frames: 35365888. Throughput: 0: 5036.0. Samples: 35358140. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2022-07-09 01:49:37,140][25689] Avg episode reward: [(0, '-59.427')] [2022-07-09 01:49:38,298][26022] Updated weights on worker 0-0, policy_version 34544 (0.00087) [2022-07-09 01:49:40,331][26022] Updated weights on worker 0-0, policy_version 34554 (0.00094) [2022-07-09 01:49:42,160][25689] Fps is (10 sec: 5478.4, 60 sec: 5596.5, 300 sec: 5592.7). Total num frames: 35392512. Throughput: 0: 5859.4. Samples: 35391598. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2022-07-09 01:49:42,161][25689] Avg episode reward: [(0, '-59.465')] [2022-07-09 01:49:42,209][26022] Updated weights on worker 0-0, policy_version 34564 (0.00091) [2022-07-09 01:49:43,954][26022] Updated weights on worker 0-0, policy_version 34574 (0.00088) [2022-07-09 01:49:45,708][26022] Updated weights on worker 0-0, policy_version 34584 (0.00086) [2022-07-09 01:49:47,181][25689] Fps is (10 sec: 5403.9, 60 sec: 5545.7, 300 sec: 5592.4). Total num frames: 35420160. Throughput: 0: 5866.3. Samples: 35425360. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2022-07-09 01:49:47,183][25689] Avg episode reward: [(0, '-59.561')] [2022-07-09 01:49:47,744][26022] Updated weights on worker 0-0, policy_version 34594 (0.00090) [2022-07-09 01:49:49,202][26022] Updated weights on worker 0-0, policy_version 34604 (0.00097) [2022-07-09 01:49:51,304][26022] Updated weights on worker 0-0, policy_version 34614 (0.00088) [2022-07-09 01:49:52,200][25689] Fps is (10 sec: 5813.6, 60 sec: 5631.6, 300 sec: 5604.3). Total num frames: 35450880. Throughput: 0: 5862.3. Samples: 35459228. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2022-07-09 01:49:52,201][25689] Avg episode reward: [(0, '-61.164')] [2022-07-09 01:49:52,875][26022] Updated weights on worker 0-0, policy_version 34624 (0.00083) [2022-07-09 01:49:54,985][26022] Updated weights on worker 0-0, policy_version 34634 (0.00096) [2022-07-09 01:49:56,815][26022] Updated weights on worker 0-0, policy_version 34644 (0.00087) [2022-07-09 01:49:57,277][25689] Fps is (10 sec: 5781.3, 60 sec: 5613.4, 300 sec: 5599.8). Total num frames: 35478528. Throughput: 0: 5853.1. Samples: 35476016. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2022-07-09 01:49:57,277][25689] Avg episode reward: [(0, '-60.124')] [2022-07-09 01:49:58,622][26022] Updated weights on worker 0-0, policy_version 34654 (0.00095) [2022-07-09 01:50:00,411][26022] Updated weights on worker 0-0, policy_version 34664 (0.00082) [2022-07-09 01:50:02,357][25689] Fps is (10 sec: 5141.9, 60 sec: 5526.0, 300 sec: 5591.5). Total num frames: 35503104. Throughput: 0: 5833.6. Samples: 35509416. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2022-07-09 01:50:02,357][25689] Avg episode reward: [(0, '-59.667')] [2022-07-09 01:50:02,645][26022] Updated weights on worker 0-0, policy_version 34674 (0.00096) [2022-07-09 01:50:04,467][26022] Updated weights on worker 0-0, policy_version 34684 (0.00083) [2022-07-09 01:50:06,328][26022] Updated weights on worker 0-0, policy_version 34694 (0.00084) [2022-07-09 01:50:07,361][25689] Fps is (10 sec: 5179.0, 60 sec: 5559.9, 300 sec: 5589.6). Total num frames: 35530752. Throughput: 0: 5725.0. Samples: 35540890. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2022-07-09 01:50:07,361][25689] Avg episode reward: [(0, '-59.238')] [2022-07-09 01:50:08,002][26022] Updated weights on worker 0-0, policy_version 34704 (0.00083) [2022-07-09 01:50:10,218][26022] Updated weights on worker 0-0, policy_version 34715 (0.00087) [2022-07-09 01:50:11,727][26022] Updated weights on worker 0-0, policy_version 34725 (0.00055) [2022-07-09 01:50:12,378][25689] Fps is (10 sec: 5722.2, 60 sec: 5560.5, 300 sec: 5594.8). Total num frames: 35560448. Throughput: 0: 4883.0. Samples: 35557758. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2022-07-09 01:50:12,379][25689] Avg episode reward: [(0, '-59.193')] [2022-07-09 01:50:13,819][26022] Updated weights on worker 0-0, policy_version 34735 (0.00086) [2022-07-09 01:50:15,356][26022] Updated weights on worker 0-0, policy_version 34745 (0.00091) [2022-07-09 01:50:17,293][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:50:17,314][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000034753_35587072.pth [2022-07-09 01:50:17,319][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000032787_33573888.pth [2022-07-09 01:50:17,505][25689] Fps is (10 sec: 5652.5, 60 sec: 5554.7, 300 sec: 5592.7). Total num frames: 35588096. Throughput: 0: 5728.3. Samples: 35591892. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2022-07-09 01:50:17,506][25689] Avg episode reward: [(0, '-57.925')] [2022-07-09 01:50:17,539][26022] Updated weights on worker 0-0, policy_version 34755 (0.00083) [2022-07-09 01:50:19,000][26022] Updated weights on worker 0-0, policy_version 34765 (0.00049) [2022-07-09 01:50:21,146][26022] Updated weights on worker 0-0, policy_version 34775 (0.00086) [2022-07-09 01:50:22,507][25689] Fps is (10 sec: 5762.8, 60 sec: 5606.2, 300 sec: 5601.3). Total num frames: 35618816. Throughput: 0: 5794.9. Samples: 35626182. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 01:50:22,507][25689] Avg episode reward: [(0, '-58.374')] [2022-07-09 01:50:22,631][26022] Updated weights on worker 0-0, policy_version 34785 (0.00092) [2022-07-09 01:50:24,579][26022] Updated weights on worker 0-0, policy_version 34795 (0.00052) [2022-07-09 01:50:26,393][26022] Updated weights on worker 0-0, policy_version 34805 (0.00079) [2022-07-09 01:50:27,530][25689] Fps is (10 sec: 5720.2, 60 sec: 5571.7, 300 sec: 5590.9). Total num frames: 35645440. Throughput: 0: 5076.0. Samples: 35643272. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 01:50:27,531][25689] Avg episode reward: [(0, '-58.402')] [2022-07-09 01:50:28,225][26022] Updated weights on worker 0-0, policy_version 34815 (0.00086) [2022-07-09 01:50:30,085][26022] Updated weights on worker 0-0, policy_version 34825 (0.00092) [2022-07-09 01:50:31,880][26022] Updated weights on worker 0-0, policy_version 34835 (0.00357) [2022-07-09 01:50:32,549][25689] Fps is (10 sec: 5710.2, 60 sec: 5604.0, 300 sec: 5603.7). Total num frames: 35676160. Throughput: 0: 5916.9. Samples: 35677108. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 01:50:32,550][25689] Avg episode reward: [(0, '-58.114')] [2022-07-09 01:50:33,537][26022] Updated weights on worker 0-0, policy_version 34845 (0.00095) [2022-07-09 01:50:35,580][26022] Updated weights on worker 0-0, policy_version 34855 (0.00091) [2022-07-09 01:50:37,074][26022] Updated weights on worker 0-0, policy_version 34865 (0.00523) [2022-07-09 01:50:37,590][25689] Fps is (10 sec: 5802.2, 60 sec: 5589.9, 300 sec: 5600.7). Total num frames: 35703808. Throughput: 0: 5920.5. Samples: 35710802. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 01:50:37,590][25689] Avg episode reward: [(0, '-59.456')] [2022-07-09 01:50:39,337][26022] Updated weights on worker 0-0, policy_version 34875 (0.00084) [2022-07-09 01:50:40,659][26022] Updated weights on worker 0-0, policy_version 34885 (0.00089) [2022-07-09 01:50:42,616][25689] Fps is (10 sec: 5289.4, 60 sec: 5572.6, 300 sec: 5587.0). Total num frames: 35729408. Throughput: 0: 5039.1. Samples: 35727516. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 01:50:42,617][25689] Avg episode reward: [(0, '-59.689')] [2022-07-09 01:50:42,863][26022] Updated weights on worker 0-0, policy_version 34895 (0.00085) [2022-07-09 01:50:44,506][26022] Updated weights on worker 0-0, policy_version 34905 (0.00084) [2022-07-09 01:50:46,484][26022] Updated weights on worker 0-0, policy_version 34915 (0.00092) [2022-07-09 01:50:47,639][25689] Fps is (10 sec: 5706.5, 60 sec: 5640.2, 300 sec: 5601.3). Total num frames: 35761152. Throughput: 0: 5887.2. Samples: 35761654. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 01:50:47,639][25689] Avg episode reward: [(0, '-59.799')] [2022-07-09 01:50:48,166][26022] Updated weights on worker 0-0, policy_version 34925 (0.00089) [2022-07-09 01:50:50,284][26022] Updated weights on worker 0-0, policy_version 34935 (0.00085) [2022-07-09 01:50:51,739][26022] Updated weights on worker 0-0, policy_version 34945 (0.00086) [2022-07-09 01:50:52,650][25689] Fps is (10 sec: 5816.8, 60 sec: 5573.1, 300 sec: 5599.3). Total num frames: 35787776. Throughput: 0: 5887.2. Samples: 35795448. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 01:50:52,651][25689] Avg episode reward: [(0, '-60.034')] [2022-07-09 01:50:53,898][26022] Updated weights on worker 0-0, policy_version 34955 (0.00088) [2022-07-09 01:50:55,360][26022] Updated weights on worker 0-0, policy_version 34965 (0.00088) [2022-07-09 01:50:57,455][26022] Updated weights on worker 0-0, policy_version 34975 (0.00087) [2022-07-09 01:50:57,698][25689] Fps is (10 sec: 5497.1, 60 sec: 5592.8, 300 sec: 5595.1). Total num frames: 35816448. Throughput: 0: 5049.4. Samples: 35812336. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 01:50:57,700][25689] Avg episode reward: [(0, '-60.007')] [2022-07-09 01:50:58,993][26022] Updated weights on worker 0-0, policy_version 34985 (0.00088) [2022-07-09 01:51:01,180][26022] Updated weights on worker 0-0, policy_version 34995 (0.00081) [2022-07-09 01:51:02,705][25689] Fps is (10 sec: 5499.7, 60 sec: 5633.5, 300 sec: 5602.5). Total num frames: 35843072. Throughput: 0: 5908.7. Samples: 35846214. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 01:51:02,706][25689] Avg episode reward: [(0, '-59.635')] [2022-07-09 01:51:02,960][26022] Updated weights on worker 0-0, policy_version 35005 (0.00091) [2022-07-09 01:51:04,889][26022] Updated weights on worker 0-0, policy_version 35015 (0.00093) [2022-07-09 01:51:06,871][26022] Updated weights on worker 0-0, policy_version 35025 (0.00096) [2022-07-09 01:51:07,729][25689] Fps is (10 sec: 5308.0, 60 sec: 5614.6, 300 sec: 5596.1). Total num frames: 35869696. Throughput: 0: 5796.0. Samples: 35878100. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 01:51:07,730][25689] Avg episode reward: [(0, '-59.148')] [2022-07-09 01:51:08,681][26022] Updated weights on worker 0-0, policy_version 35035 (0.00084) [2022-07-09 01:51:10,412][26022] Updated weights on worker 0-0, policy_version 35045 (0.00085) [2022-07-09 01:51:12,261][26022] Updated weights on worker 0-0, policy_version 35055 (0.00089) [2022-07-09 01:51:12,755][25689] Fps is (10 sec: 5501.9, 60 sec: 5596.8, 300 sec: 5596.7). Total num frames: 35898368. Throughput: 0: 4939.5. Samples: 35894756. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 01:51:12,756][25689] Avg episode reward: [(0, '-59.018')] [2022-07-09 01:51:13,987][26022] Updated weights on worker 0-0, policy_version 35065 (0.00089) [2022-07-09 01:51:15,814][26022] Updated weights on worker 0-0, policy_version 35075 (0.00086) [2022-07-09 01:51:17,718][26022] Updated weights on worker 0-0, policy_version 35085 (0.00090) [2022-07-09 01:51:17,824][25689] Fps is (10 sec: 5782.2, 60 sec: 5636.2, 300 sec: 5600.2). Total num frames: 35928064. Throughput: 0: 5787.4. Samples: 35928814. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 01:51:17,824][25689] Avg episode reward: [(0, '-60.201')] [2022-07-09 01:51:19,452][26022] Updated weights on worker 0-0, policy_version 35095 (0.00082) [2022-07-09 01:51:21,295][26022] Updated weights on worker 0-0, policy_version 35105 (0.00085) [2022-07-09 01:51:22,864][25689] Fps is (10 sec: 5773.9, 60 sec: 5598.6, 300 sec: 5600.6). Total num frames: 35956736. Throughput: 0: 5808.7. Samples: 35963314. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 01:51:22,866][25689] Avg episode reward: [(0, '-59.819')] [2022-07-09 01:51:22,889][26022] Updated weights on worker 0-0, policy_version 35115 (0.00102) [2022-07-09 01:51:24,883][26022] Updated weights on worker 0-0, policy_version 35125 (0.00083) [2022-07-09 01:51:26,859][26022] Updated weights on worker 0-0, policy_version 35135 (0.00080) [2022-07-09 01:51:27,924][25689] Fps is (10 sec: 5677.5, 60 sec: 5629.2, 300 sec: 5600.6). Total num frames: 35985408. Throughput: 0: 5067.0. Samples: 35980426. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 01:51:27,925][25689] Avg episode reward: [(0, '-59.741')] [2022-07-09 01:51:28,466][26022] Updated weights on worker 0-0, policy_version 35145 (0.00082) [2022-07-09 01:51:30,284][26022] Updated weights on worker 0-0, policy_version 35155 (0.00080) [2022-07-09 01:51:32,084][26022] Updated weights on worker 0-0, policy_version 35165 (0.00095) [2022-07-09 01:51:32,949][25689] Fps is (10 sec: 5584.8, 60 sec: 5577.8, 300 sec: 5598.3). Total num frames: 36013056. Throughput: 0: 5917.2. Samples: 36014244. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 01:51:32,949][25689] Avg episode reward: [(0, '-59.293')] [2022-07-09 01:51:33,815][26022] Updated weights on worker 0-0, policy_version 35175 (0.00092) [2022-07-09 01:51:35,842][26022] Updated weights on worker 0-0, policy_version 35185 (0.00089) [2022-07-09 01:51:37,439][26022] Updated weights on worker 0-0, policy_version 35195 (0.00091) [2022-07-09 01:51:37,995][25689] Fps is (10 sec: 5694.3, 60 sec: 5611.2, 300 sec: 5602.2). Total num frames: 36042752. Throughput: 0: 5919.4. Samples: 36048212. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 01:51:37,995][25689] Avg episode reward: [(0, '-59.645')] [2022-07-09 01:51:39,360][26022] Updated weights on worker 0-0, policy_version 35205 (0.00086) [2022-07-09 01:51:40,837][26022] Updated weights on worker 0-0, policy_version 35215 (0.00083) [2022-07-09 01:51:42,999][25689] Fps is (10 sec: 5603.8, 60 sec: 5630.2, 300 sec: 5598.8). Total num frames: 36069376. Throughput: 0: 5069.6. Samples: 36065390. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 01:51:43,001][25689] Avg episode reward: [(0, '-59.401')] [2022-07-09 01:51:43,007][26022] Updated weights on worker 0-0, policy_version 35225 (0.00082) [2022-07-09 01:51:44,710][26022] Updated weights on worker 0-0, policy_version 35235 (0.00084) [2022-07-09 01:51:46,443][26022] Updated weights on worker 0-0, policy_version 35245 (0.00081) [2022-07-09 01:51:48,027][25689] Fps is (10 sec: 5613.8, 60 sec: 5595.8, 300 sec: 5595.3). Total num frames: 36099072. Throughput: 0: 5931.8. Samples: 36099672. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 01:51:48,027][25689] Avg episode reward: [(0, '-59.622')] [2022-07-09 01:51:48,215][26022] Updated weights on worker 0-0, policy_version 35255 (0.00086) [2022-07-09 01:51:49,947][26022] Updated weights on worker 0-0, policy_version 35265 (0.00088) [2022-07-09 01:51:51,943][26022] Updated weights on worker 0-0, policy_version 35275 (0.00100) [2022-07-09 01:51:53,043][25689] Fps is (10 sec: 5913.3, 60 sec: 5646.3, 300 sec: 5610.3). Total num frames: 36128768. Throughput: 0: 5960.6. Samples: 36134018. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:51:53,043][25689] Avg episode reward: [(0, '-59.462')] [2022-07-09 01:51:53,593][26022] Updated weights on worker 0-0, policy_version 35285 (0.00083) [2022-07-09 01:51:55,492][26022] Updated weights on worker 0-0, policy_version 35295 (0.00086) [2022-07-09 01:51:57,279][26022] Updated weights on worker 0-0, policy_version 35305 (0.00088) [2022-07-09 01:51:58,121][25689] Fps is (10 sec: 5681.0, 60 sec: 5626.5, 300 sec: 5599.3). Total num frames: 36156416. Throughput: 0: 5109.0. Samples: 36151038. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:51:58,121][25689] Avg episode reward: [(0, '-59.825')] [2022-07-09 01:51:58,980][26022] Updated weights on worker 0-0, policy_version 35315 (0.00087) [2022-07-09 01:52:00,941][26022] Updated weights on worker 0-0, policy_version 35325 (0.00084) [2022-07-09 01:52:02,891][26022] Updated weights on worker 0-0, policy_version 35335 (0.00088) [2022-07-09 01:52:03,176][25689] Fps is (10 sec: 5456.7, 60 sec: 5638.9, 300 sec: 5602.2). Total num frames: 36184064. Throughput: 0: 5899.7. Samples: 36184432. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:52:03,178][25689] Avg episode reward: [(0, '-60.561')] [2022-07-09 01:52:04,844][26022] Updated weights on worker 0-0, policy_version 35345 (0.00089) [2022-07-09 01:52:06,539][26022] Updated weights on worker 0-0, policy_version 35355 (0.00089) [2022-07-09 01:52:08,199][25689] Fps is (10 sec: 5486.8, 60 sec: 5656.0, 300 sec: 5602.3). Total num frames: 36211712. Throughput: 0: 5825.2. Samples: 36217178. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:52:08,199][25689] Avg episode reward: [(0, '-60.566')] [2022-07-09 01:52:08,469][26022] Updated weights on worker 0-0, policy_version 35365 (0.00088) [2022-07-09 01:52:10,259][26022] Updated weights on worker 0-0, policy_version 35375 (0.00081) [2022-07-09 01:52:12,159][26022] Updated weights on worker 0-0, policy_version 35385 (0.00081) [2022-07-09 01:52:13,226][25689] Fps is (10 sec: 5604.2, 60 sec: 5655.9, 300 sec: 5599.7). Total num frames: 36240384. Throughput: 0: 4955.6. Samples: 36234038. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:52:13,227][25689] Avg episode reward: [(0, '-61.313')] [2022-07-09 01:52:13,914][26022] Updated weights on worker 0-0, policy_version 35395 (0.00085) [2022-07-09 01:52:15,665][26022] Updated weights on worker 0-0, policy_version 35405 (0.00092) [2022-07-09 01:52:17,455][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:52:17,469][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000035414_36263936.pth [2022-07-09 01:52:17,469][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000033441_34243584.pth [2022-07-09 01:52:17,771][26022] Updated weights on worker 0-0, policy_version 35415 (0.00084) [2022-07-09 01:52:18,273][25689] Fps is (10 sec: 5590.6, 60 sec: 5624.0, 300 sec: 5603.0). Total num frames: 36268032. Throughput: 0: 5814.8. Samples: 36268218. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:52:18,273][25689] Avg episode reward: [(0, '-60.821')] [2022-07-09 01:52:19,172][26022] Updated weights on worker 0-0, policy_version 35425 (0.00084) [2022-07-09 01:52:21,192][26022] Updated weights on worker 0-0, policy_version 35435 (0.00088) [2022-07-09 01:52:22,853][26022] Updated weights on worker 0-0, policy_version 35445 (0.00093) [2022-07-09 01:52:23,276][25689] Fps is (10 sec: 5604.1, 60 sec: 5627.5, 300 sec: 5610.1). Total num frames: 36296704. Throughput: 0: 5865.9. Samples: 36302334. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:52:23,276][25689] Avg episode reward: [(0, '-61.203')] [2022-07-09 01:52:24,683][26022] Updated weights on worker 0-0, policy_version 35455 (0.00088) [2022-07-09 01:52:26,691][26022] Updated weights on worker 0-0, policy_version 35465 (0.00082) [2022-07-09 01:52:28,230][26022] Updated weights on worker 0-0, policy_version 35475 (0.00098) [2022-07-09 01:52:28,279][25689] Fps is (10 sec: 5833.4, 60 sec: 5649.8, 300 sec: 5606.7). Total num frames: 36326400. Throughput: 0: 5088.8. Samples: 36319360. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:52:28,279][25689] Avg episode reward: [(0, '-60.455')] [2022-07-09 01:52:30,307][26022] Updated weights on worker 0-0, policy_version 35485 (0.00079) [2022-07-09 01:52:31,821][26022] Updated weights on worker 0-0, policy_version 35495 (0.00082) [2022-07-09 01:52:33,297][25689] Fps is (10 sec: 5620.2, 60 sec: 5633.4, 300 sec: 5604.6). Total num frames: 36353024. Throughput: 0: 5930.5. Samples: 36353068. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 01:52:33,297][25689] Avg episode reward: [(0, '-59.954')] [2022-07-09 01:52:33,876][26022] Updated weights on worker 0-0, policy_version 35505 (0.00088) [2022-07-09 01:52:35,814][26022] Updated weights on worker 0-0, policy_version 35515 (0.00085) [2022-07-09 01:52:37,473][26022] Updated weights on worker 0-0, policy_version 35525 (0.00083) [2022-07-09 01:52:38,342][25689] Fps is (10 sec: 5494.7, 60 sec: 5616.5, 300 sec: 5607.7). Total num frames: 36381696. Throughput: 0: 5916.0. Samples: 36386948. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-09 01:52:38,343][25689] Avg episode reward: [(0, '-59.792')] [2022-07-09 01:52:39,377][26022] Updated weights on worker 0-0, policy_version 35535 (0.00091) [2022-07-09 01:52:41,183][26022] Updated weights on worker 0-0, policy_version 35545 (0.00082) [2022-07-09 01:52:42,885][26022] Updated weights on worker 0-0, policy_version 35555 (0.00093) [2022-07-09 01:52:43,346][25689] Fps is (10 sec: 5808.4, 60 sec: 5667.5, 300 sec: 5611.8). Total num frames: 36411392. Throughput: 0: 5050.3. Samples: 36403692. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-09 01:52:43,346][25689] Avg episode reward: [(0, '-59.797')] [2022-07-09 01:52:44,756][26022] Updated weights on worker 0-0, policy_version 35565 (0.00087) [2022-07-09 01:52:46,535][26022] Updated weights on worker 0-0, policy_version 35575 (0.00326) [2022-07-09 01:52:48,225][26022] Updated weights on worker 0-0, policy_version 35585 (0.00085) [2022-07-09 01:52:48,353][25689] Fps is (10 sec: 5728.0, 60 sec: 5635.5, 300 sec: 5605.3). Total num frames: 36439040. Throughput: 0: 5909.6. Samples: 36437992. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-09 01:52:48,354][25689] Avg episode reward: [(0, '-58.482')] [2022-07-09 01:52:50,161][26022] Updated weights on worker 0-0, policy_version 35595 (0.00090) [2022-07-09 01:52:51,841][26022] Updated weights on worker 0-0, policy_version 35605 (0.00087) [2022-07-09 01:52:53,366][25689] Fps is (10 sec: 5518.4, 60 sec: 5601.8, 300 sec: 5606.6). Total num frames: 36466688. Throughput: 0: 5944.5. Samples: 36472368. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-09 01:52:53,367][25689] Avg episode reward: [(0, '-57.903')] [2022-07-09 01:52:53,592][26022] Updated weights on worker 0-0, policy_version 35615 (0.00087) [2022-07-09 01:52:55,486][26022] Updated weights on worker 0-0, policy_version 35625 (0.00085) [2022-07-09 01:52:57,186][26022] Updated weights on worker 0-0, policy_version 35635 (0.00090) [2022-07-09 01:52:58,430][25689] Fps is (10 sec: 5589.2, 60 sec: 5620.1, 300 sec: 5610.3). Total num frames: 36495360. Throughput: 0: 5105.2. Samples: 36489500. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-09 01:52:58,431][25689] Avg episode reward: [(0, '-58.273')] [2022-07-09 01:52:59,038][26022] Updated weights on worker 0-0, policy_version 35645 (0.00374) [2022-07-09 01:53:00,985][26022] Updated weights on worker 0-0, policy_version 35655 (0.00093) [2022-07-09 01:53:03,025][26022] Updated weights on worker 0-0, policy_version 35665 (0.00084) [2022-07-09 01:53:03,493][25689] Fps is (10 sec: 5662.7, 60 sec: 5636.4, 300 sec: 5609.8). Total num frames: 36524032. Throughput: 0: 5926.3. Samples: 36523088. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-09 01:53:03,493][25689] Avg episode reward: [(0, '-58.105')] [2022-07-09 01:53:05,123][26022] Updated weights on worker 0-0, policy_version 35675 (0.00094) [2022-07-09 01:53:06,704][26022] Updated weights on worker 0-0, policy_version 35685 (0.00083) [2022-07-09 01:53:08,447][26022] Updated weights on worker 0-0, policy_version 35695 (0.00084) [2022-07-09 01:53:08,544][25689] Fps is (10 sec: 5568.7, 60 sec: 5633.8, 300 sec: 5612.7). Total num frames: 36551680. Throughput: 0: 5803.0. Samples: 36555156. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-09 01:53:08,544][25689] Avg episode reward: [(0, '-57.693')] [2022-07-09 01:53:10,331][26022] Updated weights on worker 0-0, policy_version 35705 (0.00091) [2022-07-09 01:53:12,034][26022] Updated weights on worker 0-0, policy_version 35715 (0.00619) [2022-07-09 01:53:13,608][25689] Fps is (10 sec: 5466.3, 60 sec: 5613.3, 300 sec: 5605.9). Total num frames: 36579328. Throughput: 0: 5772.2. Samples: 36589212. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-09 01:53:13,609][25689] Avg episode reward: [(0, '-57.422')] [2022-07-09 01:53:13,980][26022] Updated weights on worker 0-0, policy_version 35725 (0.00089) [2022-07-09 01:53:15,695][26022] Updated weights on worker 0-0, policy_version 35735 (0.00084) [2022-07-09 01:53:17,511][26022] Updated weights on worker 0-0, policy_version 35745 (0.00081) [2022-07-09 01:53:18,675][25689] Fps is (10 sec: 5559.2, 60 sec: 5628.4, 300 sec: 5604.9). Total num frames: 36608000. Throughput: 0: 5769.7. Samples: 36606306. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-09 01:53:18,675][25689] Avg episode reward: [(0, '-58.753')] [2022-07-09 01:53:19,451][26022] Updated weights on worker 0-0, policy_version 35755 (0.00093) [2022-07-09 01:53:21,163][26022] Updated weights on worker 0-0, policy_version 35765 (0.00088) [2022-07-09 01:53:23,166][26022] Updated weights on worker 0-0, policy_version 35775 (0.00088) [2022-07-09 01:53:23,688][25689] Fps is (10 sec: 5689.3, 60 sec: 5627.5, 300 sec: 5611.9). Total num frames: 36636672. Throughput: 0: 5790.8. Samples: 36640034. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-09 01:53:23,689][25689] Avg episode reward: [(0, '-59.223')] [2022-07-09 01:53:24,814][26022] Updated weights on worker 0-0, policy_version 35785 (0.00083) [2022-07-09 01:53:26,657][26022] Updated weights on worker 0-0, policy_version 35795 (0.00092) [2022-07-09 01:53:28,530][26022] Updated weights on worker 0-0, policy_version 35805 (0.00110) [2022-07-09 01:53:28,696][25689] Fps is (10 sec: 5722.4, 60 sec: 5610.1, 300 sec: 5608.8). Total num frames: 36665344. Throughput: 0: 5877.2. Samples: 36673592. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 01:53:28,696][25689] Avg episode reward: [(0, '-59.016')] [2022-07-09 01:53:30,566][26022] Updated weights on worker 0-0, policy_version 35815 (0.00090) [2022-07-09 01:53:32,290][26022] Updated weights on worker 0-0, policy_version 35825 (0.00086) [2022-07-09 01:53:33,699][25689] Fps is (10 sec: 5523.5, 60 sec: 5611.5, 300 sec: 5607.1). Total num frames: 36691968. Throughput: 0: 5044.6. Samples: 36690560. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 01:53:33,699][25689] Avg episode reward: [(0, '-59.362')] [2022-07-09 01:53:34,086][26022] Updated weights on worker 0-0, policy_version 35835 (0.00086) [2022-07-09 01:53:35,696][26022] Updated weights on worker 0-0, policy_version 35845 (0.00087) [2022-07-09 01:53:37,498][26022] Updated weights on worker 0-0, policy_version 35855 (0.00085) [2022-07-09 01:53:38,740][25689] Fps is (10 sec: 5606.9, 60 sec: 5628.8, 300 sec: 5613.8). Total num frames: 36721664. Throughput: 0: 5890.3. Samples: 36724498. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 01:53:38,741][25689] Avg episode reward: [(0, '-58.926')] [2022-07-09 01:53:39,575][26022] Updated weights on worker 0-0, policy_version 35865 (0.00080) [2022-07-09 01:53:41,132][26022] Updated weights on worker 0-0, policy_version 35875 (0.00090) [2022-07-09 01:53:43,102][26022] Updated weights on worker 0-0, policy_version 35885 (0.01037) [2022-07-09 01:53:43,751][25689] Fps is (10 sec: 5806.7, 60 sec: 5611.2, 300 sec: 5607.1). Total num frames: 36750336. Throughput: 0: 5920.0. Samples: 36758806. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 01:53:43,751][25689] Avg episode reward: [(0, '-58.780')] [2022-07-09 01:53:44,872][26022] Updated weights on worker 0-0, policy_version 35895 (0.00086) [2022-07-09 01:53:46,572][26022] Updated weights on worker 0-0, policy_version 35905 (0.00086) [2022-07-09 01:53:48,440][26022] Updated weights on worker 0-0, policy_version 35915 (0.00087) [2022-07-09 01:53:48,755][25689] Fps is (10 sec: 5623.9, 60 sec: 5611.5, 300 sec: 5614.5). Total num frames: 36777984. Throughput: 0: 5102.8. Samples: 36775950. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 01:53:48,755][25689] Avg episode reward: [(0, '-58.692')] [2022-07-09 01:53:50,222][26022] Updated weights on worker 0-0, policy_version 35925 (0.00087) [2022-07-09 01:53:52,223][26022] Updated weights on worker 0-0, policy_version 35935 (0.00089) [2022-07-09 01:53:53,792][25689] Fps is (10 sec: 5608.9, 60 sec: 5626.2, 300 sec: 5615.0). Total num frames: 36806656. Throughput: 0: 5944.5. Samples: 36810002. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 01:53:53,793][25689] Avg episode reward: [(0, '-58.140')] [2022-07-09 01:53:53,870][26022] Updated weights on worker 0-0, policy_version 35945 (0.00084) [2022-07-09 01:53:55,582][26022] Updated weights on worker 0-0, policy_version 35955 (0.00085) [2022-07-09 01:53:57,432][26022] Updated weights on worker 0-0, policy_version 35965 (0.00090) [2022-07-09 01:53:58,852][25689] Fps is (10 sec: 5780.4, 60 sec: 5643.5, 300 sec: 5614.8). Total num frames: 36836352. Throughput: 0: 5931.8. Samples: 36843798. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 01:53:58,853][25689] Avg episode reward: [(0, '-58.885')] [2022-07-09 01:53:59,309][26022] Updated weights on worker 0-0, policy_version 35975 (0.00087) [2022-07-09 01:54:01,008][26022] Updated weights on worker 0-0, policy_version 35985 (0.00086) [2022-07-09 01:54:03,347][26022] Updated weights on worker 0-0, policy_version 35995 (0.00090) [2022-07-09 01:54:03,860][25689] Fps is (10 sec: 5390.2, 60 sec: 5580.7, 300 sec: 5611.3). Total num frames: 36860928. Throughput: 0: 5072.9. Samples: 36860822. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 01:54:03,863][25689] Avg episode reward: [(0, '-59.284')] [2022-07-09 01:54:05,185][26022] Updated weights on worker 0-0, policy_version 36005 (0.00088) [2022-07-09 01:54:06,919][26022] Updated weights on worker 0-0, policy_version 36015 (0.00089) [2022-07-09 01:54:08,592][26022] Updated weights on worker 0-0, policy_version 36025 (0.00111) [2022-07-09 01:54:08,907][25689] Fps is (10 sec: 5295.7, 60 sec: 5598.1, 300 sec: 5607.4). Total num frames: 36889600. Throughput: 0: 5783.0. Samples: 36892492. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 01:54:08,909][25689] Avg episode reward: [(0, '-59.657')] [2022-07-09 01:54:11,004][26022] Updated weights on worker 0-0, policy_version 36035 (0.00093) [2022-07-09 01:54:12,441][26022] Updated weights on worker 0-0, policy_version 36045 (0.00090) [2022-07-09 01:54:13,947][25689] Fps is (10 sec: 5482.3, 60 sec: 5583.5, 300 sec: 5604.4). Total num frames: 36916224. Throughput: 0: 5761.4. Samples: 36926124. Policy #0 lag: (min: 0.0, avg: 6.6, max: 19.0) [2022-07-09 01:54:13,948][25689] Avg episode reward: [(0, '-59.492')] [2022-07-09 01:54:14,433][26022] Updated weights on worker 0-0, policy_version 36055 (0.00092) [2022-07-09 01:54:15,823][26022] Updated weights on worker 0-0, policy_version 36065 (0.00274) [2022-07-09 01:54:17,482][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:54:17,494][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000036072_36937728.pth [2022-07-09 01:54:17,494][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000034100_34918400.pth [2022-07-09 01:54:17,971][26022] Updated weights on worker 0-0, policy_version 36075 (0.00093) [2022-07-09 01:54:18,989][25689] Fps is (10 sec: 5789.6, 60 sec: 5636.6, 300 sec: 5617.6). Total num frames: 36947968. Throughput: 0: 4924.9. Samples: 36942968. Policy #0 lag: (min: 0.0, avg: 6.6, max: 19.0) [2022-07-09 01:54:18,991][25689] Avg episode reward: [(0, '-59.104')] [2022-07-09 01:54:19,813][26022] Updated weights on worker 0-0, policy_version 36085 (0.00090) [2022-07-09 01:54:21,631][26022] Updated weights on worker 0-0, policy_version 36095 (0.00089) [2022-07-09 01:54:23,323][26022] Updated weights on worker 0-0, policy_version 36105 (0.00085) [2022-07-09 01:54:24,007][25689] Fps is (10 sec: 5700.2, 60 sec: 5585.2, 300 sec: 5607.2). Total num frames: 36973568. Throughput: 0: 5761.7. Samples: 36976900. Policy #0 lag: (min: 0.0, avg: 6.6, max: 19.0) [2022-07-09 01:54:24,007][25689] Avg episode reward: [(0, '-57.745')] [2022-07-09 01:54:25,219][26022] Updated weights on worker 0-0, policy_version 36115 (0.00093) [2022-07-09 01:54:26,821][26022] Updated weights on worker 0-0, policy_version 36125 (0.00108) [2022-07-09 01:54:28,944][26022] Updated weights on worker 0-0, policy_version 36135 (0.00088) [2022-07-09 01:54:29,039][25689] Fps is (10 sec: 5400.3, 60 sec: 5583.0, 300 sec: 5606.7). Total num frames: 37002240. Throughput: 0: 5877.9. Samples: 37010822. Policy #0 lag: (min: 0.0, avg: 6.6, max: 19.0) [2022-07-09 01:54:29,039][25689] Avg episode reward: [(0, '-57.671')] [2022-07-09 01:54:30,630][26022] Updated weights on worker 0-0, policy_version 36145 (0.00095) [2022-07-09 01:54:32,426][26022] Updated weights on worker 0-0, policy_version 36155 (0.00088) [2022-07-09 01:54:34,077][25689] Fps is (10 sec: 5694.2, 60 sec: 5613.7, 300 sec: 5607.3). Total num frames: 37030912. Throughput: 0: 5049.8. Samples: 37027780. Policy #0 lag: (min: 0.0, avg: 6.6, max: 19.0) [2022-07-09 01:54:34,079][25689] Avg episode reward: [(0, '-57.384')] [2022-07-09 01:54:34,202][26022] Updated weights on worker 0-0, policy_version 36165 (0.00082) [2022-07-09 01:54:35,919][26022] Updated weights on worker 0-0, policy_version 36175 (0.00078) [2022-07-09 01:54:37,678][26022] Updated weights on worker 0-0, policy_version 36185 (0.01403) [2022-07-09 01:54:39,129][25689] Fps is (10 sec: 5682.8, 60 sec: 5595.7, 300 sec: 5613.6). Total num frames: 37059584. Throughput: 0: 5906.4. Samples: 37061926. Policy #0 lag: (min: 0.0, avg: 6.6, max: 19.0) [2022-07-09 01:54:39,130][25689] Avg episode reward: [(0, '-56.913')] [2022-07-09 01:54:39,791][26022] Updated weights on worker 0-0, policy_version 36195 (0.00089) [2022-07-09 01:54:41,212][26022] Updated weights on worker 0-0, policy_version 36205 (0.00084) [2022-07-09 01:54:43,290][26022] Updated weights on worker 0-0, policy_version 36215 (0.00094) [2022-07-09 01:54:44,159][25689] Fps is (10 sec: 5789.6, 60 sec: 5610.9, 300 sec: 5620.4). Total num frames: 37089280. Throughput: 0: 5910.4. Samples: 37096006. Policy #0 lag: (min: 0.0, avg: 6.6, max: 19.0) [2022-07-09 01:54:44,159][25689] Avg episode reward: [(0, '-58.554')] [2022-07-09 01:54:45,053][26022] Updated weights on worker 0-0, policy_version 36225 (0.00087) [2022-07-09 01:54:46,808][26022] Updated weights on worker 0-0, policy_version 36235 (0.00091) [2022-07-09 01:54:48,663][26022] Updated weights on worker 0-0, policy_version 36245 (0.00088) [2022-07-09 01:54:49,176][25689] Fps is (10 sec: 5707.8, 60 sec: 5609.7, 300 sec: 5610.0). Total num frames: 37116928. Throughput: 0: 5073.1. Samples: 37112980. Policy #0 lag: (min: 0.0, avg: 6.6, max: 19.0) [2022-07-09 01:54:49,177][25689] Avg episode reward: [(0, '-58.884')] [2022-07-09 01:54:50,349][26022] Updated weights on worker 0-0, policy_version 36255 (0.00087) [2022-07-09 01:54:52,281][26022] Updated weights on worker 0-0, policy_version 36265 (0.00101) [2022-07-09 01:54:54,030][26022] Updated weights on worker 0-0, policy_version 36275 (0.00093) [2022-07-09 01:54:54,185][25689] Fps is (10 sec: 5616.9, 60 sec: 5612.2, 300 sec: 5614.8). Total num frames: 37145600. Throughput: 0: 5933.4. Samples: 37147090. Policy #0 lag: (min: 0.0, avg: 6.6, max: 19.0) [2022-07-09 01:54:54,186][25689] Avg episode reward: [(0, '-59.538')] [2022-07-09 01:54:55,937][26022] Updated weights on worker 0-0, policy_version 36285 (0.00091) [2022-07-09 01:54:57,852][26022] Updated weights on worker 0-0, policy_version 36295 (0.00086) [2022-07-09 01:54:59,276][25689] Fps is (10 sec: 5677.6, 60 sec: 5592.5, 300 sec: 5628.4). Total num frames: 37174272. Throughput: 0: 5915.1. Samples: 37181094. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 01:54:59,277][25689] Avg episode reward: [(0, '-60.622')] [2022-07-09 01:54:59,477][26022] Updated weights on worker 0-0, policy_version 36305 (0.00088) [2022-07-09 01:55:01,401][26022] Updated weights on worker 0-0, policy_version 36315 (0.00066) [2022-07-09 01:55:03,395][26022] Updated weights on worker 0-0, policy_version 36325 (0.00093) [2022-07-09 01:55:04,347][25689] Fps is (10 sec: 5441.8, 60 sec: 5620.6, 300 sec: 5623.6). Total num frames: 37200896. Throughput: 0: 5039.3. Samples: 37197740. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 01:55:04,349][25689] Avg episode reward: [(0, '-60.401')] [2022-07-09 01:55:05,584][26022] Updated weights on worker 0-0, policy_version 36335 (0.00091) [2022-07-09 01:55:07,199][26022] Updated weights on worker 0-0, policy_version 36345 (0.00089) [2022-07-09 01:55:08,891][26022] Updated weights on worker 0-0, policy_version 36355 (0.00078) [2022-07-09 01:55:09,359][25689] Fps is (10 sec: 5483.9, 60 sec: 5623.8, 300 sec: 5620.3). Total num frames: 37229568. Throughput: 0: 5805.8. Samples: 37230160. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 01:55:09,361][25689] Avg episode reward: [(0, '-59.652')] [2022-07-09 01:55:10,713][26022] Updated weights on worker 0-0, policy_version 36365 (0.00089) [2022-07-09 01:55:12,459][26022] Updated weights on worker 0-0, policy_version 36375 (0.00086) [2022-07-09 01:55:14,308][26022] Updated weights on worker 0-0, policy_version 36385 (0.00083) [2022-07-09 01:55:14,402][25689] Fps is (10 sec: 5702.9, 60 sec: 5657.3, 300 sec: 5625.3). Total num frames: 37258240. Throughput: 0: 5810.0. Samples: 37264548. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 01:55:14,404][25689] Avg episode reward: [(0, '-58.910')] [2022-07-09 01:55:16,111][26022] Updated weights on worker 0-0, policy_version 36395 (0.00081) [2022-07-09 01:55:17,899][26022] Updated weights on worker 0-0, policy_version 36405 (0.00083) [2022-07-09 01:55:19,538][25689] Fps is (10 sec: 5734.4, 60 sec: 5614.7, 300 sec: 5619.3). Total num frames: 37287936. Throughput: 0: 4961.5. Samples: 37281624. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 01:55:19,538][25689] Avg episode reward: [(0, '-59.399')] [2022-07-09 01:55:19,669][26022] Updated weights on worker 0-0, policy_version 36415 (0.00093) [2022-07-09 01:55:21,403][26022] Updated weights on worker 0-0, policy_version 36425 (0.00077) [2022-07-09 01:55:23,144][26022] Updated weights on worker 0-0, policy_version 36435 (0.00101) [2022-07-09 01:55:24,562][25689] Fps is (10 sec: 5644.4, 60 sec: 5648.0, 300 sec: 5622.8). Total num frames: 37315584. Throughput: 0: 5846.6. Samples: 37315928. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 01:55:24,562][25689] Avg episode reward: [(0, '-59.204')] [2022-07-09 01:55:25,003][26022] Updated weights on worker 0-0, policy_version 36445 (0.00082) [2022-07-09 01:55:26,773][26022] Updated weights on worker 0-0, policy_version 36455 (0.00088) [2022-07-09 01:55:28,613][26022] Updated weights on worker 0-0, policy_version 36465 (0.00087) [2022-07-09 01:55:29,590][25689] Fps is (10 sec: 5704.9, 60 sec: 5665.3, 300 sec: 5619.2). Total num frames: 37345280. Throughput: 0: 5917.9. Samples: 37349880. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 01:55:29,591][25689] Avg episode reward: [(0, '-58.898')] [2022-07-09 01:55:30,449][26022] Updated weights on worker 0-0, policy_version 36475 (0.00086) [2022-07-09 01:55:32,379][26022] Updated weights on worker 0-0, policy_version 36485 (0.00081) [2022-07-09 01:55:34,105][26022] Updated weights on worker 0-0, policy_version 36495 (0.00082) [2022-07-09 01:55:34,600][25689] Fps is (10 sec: 5712.7, 60 sec: 5651.1, 300 sec: 5619.7). Total num frames: 37372928. Throughput: 0: 5058.5. Samples: 37366716. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 01:55:34,600][25689] Avg episode reward: [(0, '-59.663')] [2022-07-09 01:55:36,051][26022] Updated weights on worker 0-0, policy_version 36505 (0.00095) [2022-07-09 01:55:37,780][26022] Updated weights on worker 0-0, policy_version 36515 (0.00075) [2022-07-09 01:55:39,669][25689] Fps is (10 sec: 5587.5, 60 sec: 5649.4, 300 sec: 5629.3). Total num frames: 37401600. Throughput: 0: 5908.5. Samples: 37400570. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 01:55:39,670][25689] Avg episode reward: [(0, '-60.569')] [2022-07-09 01:55:39,681][26022] Updated weights on worker 0-0, policy_version 36525 (0.00093) [2022-07-09 01:55:41,427][26022] Updated weights on worker 0-0, policy_version 36535 (0.00091) [2022-07-09 01:55:43,198][26022] Updated weights on worker 0-0, policy_version 36545 (0.00083) [2022-07-09 01:55:44,678][25689] Fps is (10 sec: 5689.7, 60 sec: 5634.4, 300 sec: 5619.2). Total num frames: 37430272. Throughput: 0: 5896.4. Samples: 37434544. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 01:55:44,680][25689] Avg episode reward: [(0, '-60.799')] [2022-07-09 01:55:44,999][26022] Updated weights on worker 0-0, policy_version 36555 (0.00093) [2022-07-09 01:55:46,716][26022] Updated weights on worker 0-0, policy_version 36565 (0.00085) [2022-07-09 01:55:48,701][26022] Updated weights on worker 0-0, policy_version 36575 (0.00087) [2022-07-09 01:55:49,691][25689] Fps is (10 sec: 5722.3, 60 sec: 5651.8, 300 sec: 5626.1). Total num frames: 37458944. Throughput: 0: 5061.9. Samples: 37451628. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:55:49,691][25689] Avg episode reward: [(0, '-59.926')] [2022-07-09 01:55:50,521][26022] Updated weights on worker 0-0, policy_version 36585 (0.00089) [2022-07-09 01:55:52,285][26022] Updated weights on worker 0-0, policy_version 36595 (0.00611) [2022-07-09 01:55:54,204][26022] Updated weights on worker 0-0, policy_version 36605 (0.00087) [2022-07-09 01:55:54,705][25689] Fps is (10 sec: 5718.8, 60 sec: 5651.3, 300 sec: 5626.7). Total num frames: 37487616. Throughput: 0: 5909.2. Samples: 37485526. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:55:54,706][25689] Avg episode reward: [(0, '-59.570')] [2022-07-09 01:55:55,915][26022] Updated weights on worker 0-0, policy_version 36615 (0.00089) [2022-07-09 01:55:57,753][26022] Updated weights on worker 0-0, policy_version 36625 (0.00093) [2022-07-09 01:55:59,676][26022] Updated weights on worker 0-0, policy_version 36635 (0.00108) [2022-07-09 01:55:59,766][25689] Fps is (10 sec: 5487.9, 60 sec: 5620.2, 300 sec: 5625.7). Total num frames: 37514240. Throughput: 0: 5918.6. Samples: 37519518. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:55:59,767][25689] Avg episode reward: [(0, '-59.831')] [2022-07-09 01:56:01,304][26022] Updated weights on worker 0-0, policy_version 36645 (0.00085) [2022-07-09 01:56:03,756][26022] Updated weights on worker 0-0, policy_version 36655 (0.00083) [2022-07-09 01:56:04,798][25689] Fps is (10 sec: 5377.4, 60 sec: 5640.8, 300 sec: 5629.0). Total num frames: 37541888. Throughput: 0: 4999.9. Samples: 37535142. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:56:04,798][25689] Avg episode reward: [(0, '-58.822')] [2022-07-09 01:56:05,165][26022] Updated weights on worker 0-0, policy_version 36665 (0.00092) [2022-07-09 01:56:07,048][26022] Updated weights on worker 0-0, policy_version 36675 (0.00081) [2022-07-09 01:56:08,755][26022] Updated weights on worker 0-0, policy_version 36685 (0.00109) [2022-07-09 01:56:09,835][25689] Fps is (10 sec: 5492.1, 60 sec: 5621.6, 300 sec: 5625.3). Total num frames: 37569536. Throughput: 0: 5822.9. Samples: 37568926. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:56:09,835][25689] Avg episode reward: [(0, '-59.267')] [2022-07-09 01:56:10,807][26022] Updated weights on worker 0-0, policy_version 36695 (0.00084) [2022-07-09 01:56:12,463][26022] Updated weights on worker 0-0, policy_version 36705 (0.00087) [2022-07-09 01:56:14,381][26022] Updated weights on worker 0-0, policy_version 36715 (0.00389) [2022-07-09 01:56:14,838][25689] Fps is (10 sec: 5609.4, 60 sec: 5625.2, 300 sec: 5623.1). Total num frames: 37598208. Throughput: 0: 5842.7. Samples: 37603156. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:56:14,839][25689] Avg episode reward: [(0, '-59.758')] [2022-07-09 01:56:16,087][26022] Updated weights on worker 0-0, policy_version 36725 (0.00083) [2022-07-09 01:56:17,623][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:56:17,634][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000036733_37614592.pth [2022-07-09 01:56:17,634][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000034753_35587072.pth [2022-07-09 01:56:17,977][26022] Updated weights on worker 0-0, policy_version 36735 (0.00088) [2022-07-09 01:56:19,425][26022] Updated weights on worker 0-0, policy_version 36745 (0.00088) [2022-07-09 01:56:19,893][25689] Fps is (10 sec: 5802.7, 60 sec: 5632.8, 300 sec: 5626.3). Total num frames: 37627904. Throughput: 0: 5858.2. Samples: 37637426. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:56:19,894][25689] Avg episode reward: [(0, '-60.628')] [2022-07-09 01:56:21,651][26022] Updated weights on worker 0-0, policy_version 36755 (0.00095) [2022-07-09 01:56:23,083][26022] Updated weights on worker 0-0, policy_version 36765 (0.00088) [2022-07-09 01:56:24,912][25689] Fps is (10 sec: 5692.1, 60 sec: 5633.2, 300 sec: 5623.6). Total num frames: 37655552. Throughput: 0: 5936.9. Samples: 37654560. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:56:24,913][25689] Avg episode reward: [(0, '-60.019')] [2022-07-09 01:56:25,181][26022] Updated weights on worker 0-0, policy_version 36775 (0.00088) [2022-07-09 01:56:26,749][26022] Updated weights on worker 0-0, policy_version 36785 (0.00089) [2022-07-09 01:56:28,622][26022] Updated weights on worker 0-0, policy_version 36795 (0.00092) [2022-07-09 01:56:29,929][25689] Fps is (10 sec: 5713.9, 60 sec: 5634.2, 300 sec: 5630.6). Total num frames: 37685248. Throughput: 0: 5971.2. Samples: 37688914. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 01:56:29,930][25689] Avg episode reward: [(0, '-59.721')] [2022-07-09 01:56:30,503][26022] Updated weights on worker 0-0, policy_version 36805 (0.00097) [2022-07-09 01:56:32,119][26022] Updated weights on worker 0-0, policy_version 36815 (0.00088) [2022-07-09 01:56:34,077][26022] Updated weights on worker 0-0, policy_version 36825 (0.00084) [2022-07-09 01:56:34,978][25689] Fps is (10 sec: 5798.8, 60 sec: 5647.6, 300 sec: 5627.1). Total num frames: 37713920. Throughput: 0: 5938.5. Samples: 37722756. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 01:56:34,979][25689] Avg episode reward: [(0, '-59.407')] [2022-07-09 01:56:36,000][26022] Updated weights on worker 0-0, policy_version 36835 (0.00090) [2022-07-09 01:56:37,586][26022] Updated weights on worker 0-0, policy_version 36845 (0.00096) [2022-07-09 01:56:39,581][26022] Updated weights on worker 0-0, policy_version 36855 (0.00090) [2022-07-09 01:56:40,023][25689] Fps is (10 sec: 5478.3, 60 sec: 5616.0, 300 sec: 5626.4). Total num frames: 37740544. Throughput: 0: 5068.1. Samples: 37739444. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 01:56:40,023][25689] Avg episode reward: [(0, '-59.659')] [2022-07-09 01:56:41,200][26022] Updated weights on worker 0-0, policy_version 36865 (0.00081) [2022-07-09 01:56:43,119][26022] Updated weights on worker 0-0, policy_version 36875 (0.00095) [2022-07-09 01:56:44,747][26022] Updated weights on worker 0-0, policy_version 36885 (0.00088) [2022-07-09 01:56:45,035][25689] Fps is (10 sec: 5701.9, 60 sec: 5649.6, 300 sec: 5630.1). Total num frames: 37771264. Throughput: 0: 5923.8. Samples: 37773762. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 01:56:45,036][25689] Avg episode reward: [(0, '-59.936')] [2022-07-09 01:56:46,906][26022] Updated weights on worker 0-0, policy_version 36895 (0.00086) [2022-07-09 01:56:48,628][26022] Updated weights on worker 0-0, policy_version 36905 (0.00096) [2022-07-09 01:56:50,054][25689] Fps is (10 sec: 5818.5, 60 sec: 5632.0, 300 sec: 5623.2). Total num frames: 37798912. Throughput: 0: 5922.4. Samples: 37808102. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 01:56:50,056][25689] Avg episode reward: [(0, '-59.565')] [2022-07-09 01:56:50,207][26022] Updated weights on worker 0-0, policy_version 36915 (0.00085) [2022-07-09 01:56:52,269][26022] Updated weights on worker 0-0, policy_version 36925 (0.00079) [2022-07-09 01:56:53,799][26022] Updated weights on worker 0-0, policy_version 36935 (0.00087) [2022-07-09 01:56:55,076][25689] Fps is (10 sec: 5507.3, 60 sec: 5614.4, 300 sec: 5624.2). Total num frames: 37826560. Throughput: 0: 5093.6. Samples: 37825124. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 01:56:55,077][25689] Avg episode reward: [(0, '-60.394')] [2022-07-09 01:56:55,871][26022] Updated weights on worker 0-0, policy_version 36945 (0.00083) [2022-07-09 01:56:57,466][26022] Updated weights on worker 0-0, policy_version 36955 (0.00087) [2022-07-09 01:56:59,394][26022] Updated weights on worker 0-0, policy_version 36965 (0.00089) [2022-07-09 01:57:00,166][25689] Fps is (10 sec: 5772.1, 60 sec: 5679.5, 300 sec: 5633.9). Total num frames: 37857280. Throughput: 0: 5947.1. Samples: 37859240. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 01:57:00,167][25689] Avg episode reward: [(0, '-60.824')] [2022-07-09 01:57:01,190][26022] Updated weights on worker 0-0, policy_version 36975 (0.00087) [2022-07-09 01:57:03,367][26022] Updated weights on worker 0-0, policy_version 36985 (0.00083) [2022-07-09 01:57:05,172][25689] Fps is (10 sec: 5477.0, 60 sec: 5631.1, 300 sec: 5623.9). Total num frames: 37881856. Throughput: 0: 5834.2. Samples: 37891242. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 01:57:05,174][25689] Avg episode reward: [(0, '-60.985')] [2022-07-09 01:57:05,183][26022] Updated weights on worker 0-0, policy_version 36995 (0.00087) [2022-07-09 01:57:07,128][26022] Updated weights on worker 0-0, policy_version 37005 (0.00389) [2022-07-09 01:57:08,898][26022] Updated weights on worker 0-0, policy_version 37015 (0.00089) [2022-07-09 01:57:10,193][25689] Fps is (10 sec: 5310.5, 60 sec: 5649.4, 300 sec: 5624.0). Total num frames: 37910528. Throughput: 0: 4955.7. Samples: 37907906. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 01:57:10,194][25689] Avg episode reward: [(0, '-60.938')] [2022-07-09 01:57:10,731][26022] Updated weights on worker 0-0, policy_version 37025 (0.00081) [2022-07-09 01:57:12,343][26022] Updated weights on worker 0-0, policy_version 37035 (0.00080) [2022-07-09 01:57:14,212][26022] Updated weights on worker 0-0, policy_version 37045 (0.00087) [2022-07-09 01:57:15,230][25689] Fps is (10 sec: 5803.1, 60 sec: 5663.3, 300 sec: 5631.1). Total num frames: 37940224. Throughput: 0: 5799.3. Samples: 37942004. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 01:57:15,230][25689] Avg episode reward: [(0, '-59.853')] [2022-07-09 01:57:16,087][26022] Updated weights on worker 0-0, policy_version 37055 (0.00084) [2022-07-09 01:57:17,965][26022] Updated weights on worker 0-0, policy_version 37065 (0.00089) [2022-07-09 01:57:19,713][26022] Updated weights on worker 0-0, policy_version 37075 (0.00080) [2022-07-09 01:57:20,342][25689] Fps is (10 sec: 5650.6, 60 sec: 5624.1, 300 sec: 5625.6). Total num frames: 37967872. Throughput: 0: 5792.6. Samples: 37976108. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 01:57:20,342][25689] Avg episode reward: [(0, '-60.300')] [2022-07-09 01:57:21,433][26022] Updated weights on worker 0-0, policy_version 37085 (0.00085) [2022-07-09 01:57:23,215][26022] Updated weights on worker 0-0, policy_version 37095 (0.00084) [2022-07-09 01:57:24,919][26022] Updated weights on worker 0-0, policy_version 37105 (0.00095) [2022-07-09 01:57:25,427][25689] Fps is (10 sec: 5623.5, 60 sec: 5651.8, 300 sec: 5624.0). Total num frames: 37997568. Throughput: 0: 5043.4. Samples: 37993398. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 01:57:25,427][25689] Avg episode reward: [(0, '-59.370')] [2022-07-09 01:57:26,788][26022] Updated weights on worker 0-0, policy_version 37115 (0.00089) [2022-07-09 01:57:28,542][26022] Updated weights on worker 0-0, policy_version 37125 (0.00099) [2022-07-09 01:57:30,465][25689] Fps is (10 sec: 5664.2, 60 sec: 5615.9, 300 sec: 5627.1). Total num frames: 38025216. Throughput: 0: 5901.2. Samples: 38027536. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 01:57:30,466][25689] Avg episode reward: [(0, '-59.691')] [2022-07-09 01:57:30,483][26022] Updated weights on worker 0-0, policy_version 37135 (0.00085) [2022-07-09 01:57:32,131][26022] Updated weights on worker 0-0, policy_version 37145 (0.00086) [2022-07-09 01:57:33,978][26022] Updated weights on worker 0-0, policy_version 37155 (0.00082) [2022-07-09 01:57:35,482][25689] Fps is (10 sec: 5703.2, 60 sec: 5635.9, 300 sec: 5631.1). Total num frames: 38054912. Throughput: 0: 5914.3. Samples: 38061780. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 01:57:35,482][25689] Avg episode reward: [(0, '-59.006')] [2022-07-09 01:57:35,877][26022] Updated weights on worker 0-0, policy_version 37165 (0.00098) [2022-07-09 01:57:37,652][26022] Updated weights on worker 0-0, policy_version 37175 (0.00093) [2022-07-09 01:57:39,660][26022] Updated weights on worker 0-0, policy_version 37185 (0.00090) [2022-07-09 01:57:40,526][25689] Fps is (10 sec: 5699.9, 60 sec: 5652.9, 300 sec: 5623.4). Total num frames: 38082560. Throughput: 0: 5071.3. Samples: 38078468. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 01:57:40,526][25689] Avg episode reward: [(0, '-59.869')] [2022-07-09 01:57:41,148][26022] Updated weights on worker 0-0, policy_version 37195 (0.00082) [2022-07-09 01:57:43,357][26022] Updated weights on worker 0-0, policy_version 37205 (0.00087) [2022-07-09 01:57:44,709][26022] Updated weights on worker 0-0, policy_version 37215 (0.00087) [2022-07-09 01:57:45,541][25689] Fps is (10 sec: 5598.7, 60 sec: 5618.7, 300 sec: 5626.7). Total num frames: 38111232. Throughput: 0: 5938.4. Samples: 38112844. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 01:57:45,542][25689] Avg episode reward: [(0, '-59.982')] [2022-07-09 01:57:46,860][26022] Updated weights on worker 0-0, policy_version 37225 (0.00088) [2022-07-09 01:57:48,375][26022] Updated weights on worker 0-0, policy_version 37235 (0.00074) [2022-07-09 01:57:50,287][26022] Updated weights on worker 0-0, policy_version 37245 (0.00085) [2022-07-09 01:57:50,557][25689] Fps is (10 sec: 5716.6, 60 sec: 5636.0, 300 sec: 5630.1). Total num frames: 38139904. Throughput: 0: 5942.3. Samples: 38146926. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 01:57:50,558][25689] Avg episode reward: [(0, '-59.765')] [2022-07-09 01:57:52,095][26022] Updated weights on worker 0-0, policy_version 37255 (0.00091) [2022-07-09 01:57:53,896][26022] Updated weights on worker 0-0, policy_version 37265 (0.00101) [2022-07-09 01:57:55,568][25689] Fps is (10 sec: 5719.0, 60 sec: 5653.9, 300 sec: 5631.1). Total num frames: 38168576. Throughput: 0: 5086.5. Samples: 38163950. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 01:57:55,568][25689] Avg episode reward: [(0, '-59.618')] [2022-07-09 01:57:55,913][26022] Updated weights on worker 0-0, policy_version 37275 (0.00087) [2022-07-09 01:57:57,386][26022] Updated weights on worker 0-0, policy_version 37285 (0.00083) [2022-07-09 01:57:59,410][26022] Updated weights on worker 0-0, policy_version 37295 (0.00089) [2022-07-09 01:58:00,602][25689] Fps is (10 sec: 5810.7, 60 sec: 5642.2, 300 sec: 5635.1). Total num frames: 38198272. Throughput: 0: 5962.2. Samples: 38198166. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 01:58:00,604][25689] Avg episode reward: [(0, '-59.465')] [2022-07-09 01:58:00,957][26022] Updated weights on worker 0-0, policy_version 37305 (0.00080) [2022-07-09 01:58:03,313][26022] Updated weights on worker 0-0, policy_version 37315 (0.00082) [2022-07-09 01:58:04,939][26022] Updated weights on worker 0-0, policy_version 37325 (0.00086) [2022-07-09 01:58:05,627][25689] Fps is (10 sec: 5497.3, 60 sec: 5657.4, 300 sec: 5628.7). Total num frames: 38223872. Throughput: 0: 5871.8. Samples: 38230782. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 01:58:05,629][25689] Avg episode reward: [(0, '-59.432')] [2022-07-09 01:58:06,760][26022] Updated weights on worker 0-0, policy_version 37335 (0.00093) [2022-07-09 01:58:08,662][26022] Updated weights on worker 0-0, policy_version 37345 (0.00086) [2022-07-09 01:58:10,515][26022] Updated weights on worker 0-0, policy_version 37355 (0.00107) [2022-07-09 01:58:10,635][25689] Fps is (10 sec: 5307.5, 60 sec: 5641.7, 300 sec: 5629.8). Total num frames: 38251520. Throughput: 0: 5016.6. Samples: 38247648. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:58:10,636][25689] Avg episode reward: [(0, '-59.553')] [2022-07-09 01:58:12,157][26022] Updated weights on worker 0-0, policy_version 37365 (0.00078) [2022-07-09 01:58:14,116][26022] Updated weights on worker 0-0, policy_version 37375 (0.00081) [2022-07-09 01:58:15,651][25689] Fps is (10 sec: 5720.7, 60 sec: 5643.6, 300 sec: 5634.2). Total num frames: 38281216. Throughput: 0: 5865.1. Samples: 38281738. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:58:15,651][25689] Avg episode reward: [(0, '-59.635')] [2022-07-09 01:58:15,825][26022] Updated weights on worker 0-0, policy_version 37385 (0.00092) [2022-07-09 01:58:17,640][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 01:58:17,654][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000037394_38291456.pth [2022-07-09 01:58:17,654][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000035414_36263936.pth [2022-07-09 01:58:17,775][26022] Updated weights on worker 0-0, policy_version 37395 (0.00085) [2022-07-09 01:58:19,367][26022] Updated weights on worker 0-0, policy_version 37405 (0.00088) [2022-07-09 01:58:20,747][25689] Fps is (10 sec: 5670.6, 60 sec: 5645.0, 300 sec: 5629.1). Total num frames: 38308864. Throughput: 0: 5843.6. Samples: 38315886. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:58:20,748][25689] Avg episode reward: [(0, '-59.266')] [2022-07-09 01:58:21,254][26022] Updated weights on worker 0-0, policy_version 37415 (0.00084) [2022-07-09 01:58:22,914][26022] Updated weights on worker 0-0, policy_version 37425 (0.00085) [2022-07-09 01:58:24,803][26022] Updated weights on worker 0-0, policy_version 37435 (0.00088) [2022-07-09 01:58:25,755][25689] Fps is (10 sec: 5776.7, 60 sec: 5669.3, 300 sec: 5636.1). Total num frames: 38339584. Throughput: 0: 5087.1. Samples: 38333174. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:58:25,755][25689] Avg episode reward: [(0, '-59.406')] [2022-07-09 01:58:26,699][26022] Updated weights on worker 0-0, policy_version 37445 (0.00083) [2022-07-09 01:58:28,565][26022] Updated weights on worker 0-0, policy_version 37455 (0.00089) [2022-07-09 01:58:30,298][26022] Updated weights on worker 0-0, policy_version 37465 (0.00088) [2022-07-09 01:58:30,773][25689] Fps is (10 sec: 5719.8, 60 sec: 5654.2, 300 sec: 5635.8). Total num frames: 38366208. Throughput: 0: 5926.3. Samples: 38366992. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:58:30,773][25689] Avg episode reward: [(0, '-60.667')] [2022-07-09 01:58:32,031][26022] Updated weights on worker 0-0, policy_version 37475 (0.00083) [2022-07-09 01:58:34,074][26022] Updated weights on worker 0-0, policy_version 37485 (0.00087) [2022-07-09 01:58:35,773][26022] Updated weights on worker 0-0, policy_version 37495 (0.00083) [2022-07-09 01:58:35,814][25689] Fps is (10 sec: 5497.0, 60 sec: 5634.9, 300 sec: 5632.3). Total num frames: 38394880. Throughput: 0: 5921.6. Samples: 38401136. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:58:35,814][25689] Avg episode reward: [(0, '-59.842')] [2022-07-09 01:58:37,519][26022] Updated weights on worker 0-0, policy_version 37505 (0.00097) [2022-07-09 01:58:39,484][26022] Updated weights on worker 0-0, policy_version 37515 (0.00090) [2022-07-09 01:58:40,889][25689] Fps is (10 sec: 5770.0, 60 sec: 5666.0, 300 sec: 5634.6). Total num frames: 38424576. Throughput: 0: 5063.0. Samples: 38417862. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:58:40,889][25689] Avg episode reward: [(0, '-60.056')] [2022-07-09 01:58:40,890][26022] Updated weights on worker 0-0, policy_version 37525 (0.00095) [2022-07-09 01:58:43,071][26022] Updated weights on worker 0-0, policy_version 37535 (0.00083) [2022-07-09 01:58:44,830][26022] Updated weights on worker 0-0, policy_version 37545 (0.00089) [2022-07-09 01:58:45,914][25689] Fps is (10 sec: 5576.1, 60 sec: 5631.1, 300 sec: 5630.7). Total num frames: 38451200. Throughput: 0: 5900.2. Samples: 38452120. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:58:45,915][25689] Avg episode reward: [(0, '-60.553')] [2022-07-09 01:58:46,491][26022] Updated weights on worker 0-0, policy_version 37555 (0.00081) [2022-07-09 01:58:48,423][26022] Updated weights on worker 0-0, policy_version 37565 (0.00084) [2022-07-09 01:58:50,084][26022] Updated weights on worker 0-0, policy_version 37575 (0.00089) [2022-07-09 01:58:50,939][25689] Fps is (10 sec: 5502.0, 60 sec: 5630.3, 300 sec: 5630.9). Total num frames: 38479872. Throughput: 0: 5918.8. Samples: 38486350. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:58:50,939][25689] Avg episode reward: [(0, '-61.023')] [2022-07-09 01:58:51,987][26022] Updated weights on worker 0-0, policy_version 37585 (0.00084) [2022-07-09 01:58:53,952][26022] Updated weights on worker 0-0, policy_version 37595 (0.00086) [2022-07-09 01:58:55,491][26022] Updated weights on worker 0-0, policy_version 37605 (0.00080) [2022-07-09 01:58:55,967][25689] Fps is (10 sec: 5704.4, 60 sec: 5628.7, 300 sec: 5628.1). Total num frames: 38508544. Throughput: 0: 5055.2. Samples: 38503012. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 01:58:55,967][25689] Avg episode reward: [(0, '-61.113')] [2022-07-09 01:58:57,584][26022] Updated weights on worker 0-0, policy_version 37615 (0.00050) [2022-07-09 01:58:59,276][26022] Updated weights on worker 0-0, policy_version 37625 (0.00090) [2022-07-09 01:59:01,039][25689] Fps is (10 sec: 5677.1, 60 sec: 5608.2, 300 sec: 5640.7). Total num frames: 38537216. Throughput: 0: 5922.1. Samples: 38537198. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 01:59:01,040][25689] Avg episode reward: [(0, '-60.442')] [2022-07-09 01:59:01,128][26022] Updated weights on worker 0-0, policy_version 37635 (0.00086) [2022-07-09 01:59:03,134][26022] Updated weights on worker 0-0, policy_version 37645 (0.00082) [2022-07-09 01:59:05,076][26022] Updated weights on worker 0-0, policy_version 37655 (0.00096) [2022-07-09 01:59:06,071][25689] Fps is (10 sec: 5472.7, 60 sec: 5624.5, 300 sec: 5634.1). Total num frames: 38563840. Throughput: 0: 5808.8. Samples: 38569206. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 01:59:06,071][25689] Avg episode reward: [(0, '-61.340')] [2022-07-09 01:59:06,783][26022] Updated weights on worker 0-0, policy_version 37665 (0.00089) [2022-07-09 01:59:08,822][26022] Updated weights on worker 0-0, policy_version 37675 (0.00084) [2022-07-09 01:59:10,358][26022] Updated weights on worker 0-0, policy_version 37685 (0.00094) [2022-07-09 01:59:11,080][25689] Fps is (10 sec: 5507.5, 60 sec: 5641.4, 300 sec: 5641.5). Total num frames: 38592512. Throughput: 0: 4944.4. Samples: 38585934. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 01:59:11,081][25689] Avg episode reward: [(0, '-60.534')] [2022-07-09 01:59:12,383][26022] Updated weights on worker 0-0, policy_version 37695 (0.00086) [2022-07-09 01:59:13,963][26022] Updated weights on worker 0-0, policy_version 37705 (0.00093) [2022-07-09 01:59:15,948][26022] Updated weights on worker 0-0, policy_version 37715 (0.00086) [2022-07-09 01:59:16,087][25689] Fps is (10 sec: 5622.7, 60 sec: 5608.3, 300 sec: 5628.4). Total num frames: 38620160. Throughput: 0: 5823.6. Samples: 38620186. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 01:59:16,089][25689] Avg episode reward: [(0, '-60.553')] [2022-07-09 01:59:17,576][26022] Updated weights on worker 0-0, policy_version 37725 (0.00088) [2022-07-09 01:59:19,595][26022] Updated weights on worker 0-0, policy_version 37735 (0.00091) [2022-07-09 01:59:21,187][25689] Fps is (10 sec: 5673.7, 60 sec: 5641.9, 300 sec: 5640.7). Total num frames: 38649856. Throughput: 0: 5807.8. Samples: 38654208. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 01:59:21,187][25689] Avg episode reward: [(0, '-58.969')] [2022-07-09 01:59:21,376][26022] Updated weights on worker 0-0, policy_version 37745 (0.00088) [2022-07-09 01:59:23,107][26022] Updated weights on worker 0-0, policy_version 37755 (0.00088) [2022-07-09 01:59:24,896][26022] Updated weights on worker 0-0, policy_version 37765 (0.00092) [2022-07-09 01:59:26,212][25689] Fps is (10 sec: 5562.8, 60 sec: 5572.5, 300 sec: 5633.9). Total num frames: 38676480. Throughput: 0: 5069.3. Samples: 38671306. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 01:59:26,212][25689] Avg episode reward: [(0, '-60.498')] [2022-07-09 01:59:26,773][26022] Updated weights on worker 0-0, policy_version 37775 (0.00087) [2022-07-09 01:59:28,659][26022] Updated weights on worker 0-0, policy_version 37785 (0.00098) [2022-07-09 01:59:30,499][26022] Updated weights on worker 0-0, policy_version 37795 (0.00087) [2022-07-09 01:59:31,251][25689] Fps is (10 sec: 5697.9, 60 sec: 5638.3, 300 sec: 5640.8). Total num frames: 38707200. Throughput: 0: 5907.2. Samples: 38705088. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 01:59:31,251][25689] Avg episode reward: [(0, '-60.154')] [2022-07-09 01:59:32,115][26022] Updated weights on worker 0-0, policy_version 37805 (0.00084) [2022-07-09 01:59:34,057][26022] Updated weights on worker 0-0, policy_version 37815 (0.00080) [2022-07-09 01:59:35,823][26022] Updated weights on worker 0-0, policy_version 37825 (0.00089) [2022-07-09 01:59:36,319][25689] Fps is (10 sec: 5673.4, 60 sec: 5601.9, 300 sec: 5633.6). Total num frames: 38733824. Throughput: 0: 5883.0. Samples: 38739210. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 01:59:36,320][25689] Avg episode reward: [(0, '-59.792')] [2022-07-09 01:59:37,634][26022] Updated weights on worker 0-0, policy_version 37835 (0.00086) [2022-07-09 01:59:39,616][26022] Updated weights on worker 0-0, policy_version 37845 (0.00087) [2022-07-09 01:59:41,354][25689] Fps is (10 sec: 5574.2, 60 sec: 5605.6, 300 sec: 5633.5). Total num frames: 38763520. Throughput: 0: 5876.7. Samples: 38772728. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 01:59:41,355][25689] Avg episode reward: [(0, '-59.903')] [2022-07-09 01:59:41,357][26022] Updated weights on worker 0-0, policy_version 37855 (0.00082) [2022-07-09 01:59:43,235][26022] Updated weights on worker 0-0, policy_version 37865 (0.00083) [2022-07-09 01:59:44,879][26022] Updated weights on worker 0-0, policy_version 37875 (0.00088) [2022-07-09 01:59:46,367][25689] Fps is (10 sec: 5808.6, 60 sec: 5640.6, 300 sec: 5637.0). Total num frames: 38792192. Throughput: 0: 5898.9. Samples: 38790204. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 01:59:46,368][25689] Avg episode reward: [(0, '-59.568')] [2022-07-09 01:59:46,818][26022] Updated weights on worker 0-0, policy_version 37885 (0.00086) [2022-07-09 01:59:48,411][26022] Updated weights on worker 0-0, policy_version 37895 (0.00086) [2022-07-09 01:59:50,256][26022] Updated weights on worker 0-0, policy_version 37905 (0.00090) [2022-07-09 01:59:51,416][25689] Fps is (10 sec: 5801.0, 60 sec: 5655.3, 300 sec: 5639.7). Total num frames: 38821888. Throughput: 0: 5926.9. Samples: 38824606. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 01:59:51,416][25689] Avg episode reward: [(0, '-59.611')] [2022-07-09 01:59:51,995][26022] Updated weights on worker 0-0, policy_version 37915 (0.00099) [2022-07-09 01:59:53,836][26022] Updated weights on worker 0-0, policy_version 37925 (0.00054) [2022-07-09 01:59:55,767][26022] Updated weights on worker 0-0, policy_version 37935 (0.00966) [2022-07-09 01:59:56,430][25689] Fps is (10 sec: 5699.0, 60 sec: 5639.7, 300 sec: 5637.7). Total num frames: 38849536. Throughput: 0: 5939.4. Samples: 38858654. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 01:59:56,430][25689] Avg episode reward: [(0, '-59.369')] [2022-07-09 01:59:57,499][26022] Updated weights on worker 0-0, policy_version 37945 (0.00082) [2022-07-09 01:59:59,350][26022] Updated weights on worker 0-0, policy_version 37955 (0.00092) [2022-07-09 02:00:01,112][26022] Updated weights on worker 0-0, policy_version 37965 (0.00089) [2022-07-09 02:00:01,567][25689] Fps is (10 sec: 5548.4, 60 sec: 5633.7, 300 sec: 5643.3). Total num frames: 38878208. Throughput: 0: 5088.8. Samples: 38875584. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 02:00:01,567][25689] Avg episode reward: [(0, '-59.925')] [2022-07-09 02:00:03,342][26022] Updated weights on worker 0-0, policy_version 37975 (0.00085) [2022-07-09 02:00:05,067][26022] Updated weights on worker 0-0, policy_version 37985 (0.00090) [2022-07-09 02:00:06,578][25689] Fps is (10 sec: 5347.6, 60 sec: 5618.6, 300 sec: 5633.0). Total num frames: 38903808. Throughput: 0: 5789.5. Samples: 38907216. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 02:00:06,579][25689] Avg episode reward: [(0, '-60.492')] [2022-07-09 02:00:06,910][26022] Updated weights on worker 0-0, policy_version 37995 (0.00082) [2022-07-09 02:00:08,827][26022] Updated weights on worker 0-0, policy_version 38005 (0.00096) [2022-07-09 02:00:10,632][26022] Updated weights on worker 0-0, policy_version 38015 (0.00085) [2022-07-09 02:00:11,585][25689] Fps is (10 sec: 5519.4, 60 sec: 5635.7, 300 sec: 5637.1). Total num frames: 38933504. Throughput: 0: 5768.3. Samples: 38940948. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 02:00:11,586][25689] Avg episode reward: [(0, '-60.316')] [2022-07-09 02:00:12,332][26022] Updated weights on worker 0-0, policy_version 38025 (0.00089) [2022-07-09 02:00:14,295][26022] Updated weights on worker 0-0, policy_version 38035 (0.00088) [2022-07-09 02:00:15,914][26022] Updated weights on worker 0-0, policy_version 38045 (0.00086) [2022-07-09 02:00:16,660][25689] Fps is (10 sec: 5586.4, 60 sec: 5612.6, 300 sec: 5628.0). Total num frames: 38960128. Throughput: 0: 4909.8. Samples: 38957982. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 02:00:16,661][25689] Avg episode reward: [(0, '-61.193')] [2022-07-09 02:00:17,675][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:00:17,691][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000038053_38966272.pth [2022-07-09 02:00:17,692][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000036072_36937728.pth [2022-07-09 02:00:17,899][26022] Updated weights on worker 0-0, policy_version 38055 (0.00084) [2022-07-09 02:00:19,747][26022] Updated weights on worker 0-0, policy_version 38065 (0.00081) [2022-07-09 02:00:21,387][26022] Updated weights on worker 0-0, policy_version 38075 (0.00091) [2022-07-09 02:00:21,778][25689] Fps is (10 sec: 5726.2, 60 sec: 5644.6, 300 sec: 5640.0). Total num frames: 38991872. Throughput: 0: 5760.3. Samples: 38992008. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 02:00:21,779][25689] Avg episode reward: [(0, '-60.890')] [2022-07-09 02:00:23,187][26022] Updated weights on worker 0-0, policy_version 38085 (0.00082) [2022-07-09 02:00:24,969][26022] Updated weights on worker 0-0, policy_version 38095 (0.00088) [2022-07-09 02:00:26,820][25689] Fps is (10 sec: 5745.1, 60 sec: 5643.1, 300 sec: 5629.4). Total num frames: 39018496. Throughput: 0: 5889.7. Samples: 39026432. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 02:00:26,821][25689] Avg episode reward: [(0, '-60.556')] [2022-07-09 02:00:26,832][26022] Updated weights on worker 0-0, policy_version 38105 (0.00085) [2022-07-09 02:00:28,609][26022] Updated weights on worker 0-0, policy_version 38115 (0.00083) [2022-07-09 02:00:30,384][26022] Updated weights on worker 0-0, policy_version 38125 (0.00092) [2022-07-09 02:00:31,896][25689] Fps is (10 sec: 5465.1, 60 sec: 5605.8, 300 sec: 5631.5). Total num frames: 39047168. Throughput: 0: 5043.0. Samples: 39043378. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 02:00:31,897][25689] Avg episode reward: [(0, '-60.183')] [2022-07-09 02:00:32,174][26022] Updated weights on worker 0-0, policy_version 38135 (0.00094) [2022-07-09 02:00:33,958][26022] Updated weights on worker 0-0, policy_version 38145 (0.00089) [2022-07-09 02:00:35,775][26022] Updated weights on worker 0-0, policy_version 38155 (0.00087) [2022-07-09 02:00:36,947][25689] Fps is (10 sec: 5763.3, 60 sec: 5658.1, 300 sec: 5635.3). Total num frames: 39076864. Throughput: 0: 5888.2. Samples: 39077436. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 02:00:36,948][25689] Avg episode reward: [(0, '-59.856')] [2022-07-09 02:00:37,619][26022] Updated weights on worker 0-0, policy_version 38165 (0.00086) [2022-07-09 02:00:39,479][26022] Updated weights on worker 0-0, policy_version 38175 (0.00090) [2022-07-09 02:00:41,331][26022] Updated weights on worker 0-0, policy_version 38185 (0.00093) [2022-07-09 02:00:42,074][25689] Fps is (10 sec: 5734.7, 60 sec: 5632.7, 300 sec: 5633.1). Total num frames: 39105536. Throughput: 0: 5867.4. Samples: 39111092. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 02:00:42,075][25689] Avg episode reward: [(0, '-60.919')] [2022-07-09 02:00:43,240][26022] Updated weights on worker 0-0, policy_version 38195 (0.00087) [2022-07-09 02:00:44,978][26022] Updated weights on worker 0-0, policy_version 38205 (0.00090) [2022-07-09 02:00:46,714][26022] Updated weights on worker 0-0, policy_version 38215 (0.00088) [2022-07-09 02:00:47,076][25689] Fps is (10 sec: 5560.5, 60 sec: 5616.9, 300 sec: 5629.9). Total num frames: 39133184. Throughput: 0: 5035.3. Samples: 39128428. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 02:00:47,076][25689] Avg episode reward: [(0, '-61.498')] [2022-07-09 02:00:48,634][26022] Updated weights on worker 0-0, policy_version 38225 (0.00085) [2022-07-09 02:00:50,126][26022] Updated weights on worker 0-0, policy_version 38235 (0.00085) [2022-07-09 02:00:52,132][25689] Fps is (10 sec: 5599.5, 60 sec: 5599.3, 300 sec: 5629.1). Total num frames: 39161856. Throughput: 0: 5890.1. Samples: 39162570. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 02:00:52,133][25689] Avg episode reward: [(0, '-60.881')] [2022-07-09 02:00:52,191][26022] Updated weights on worker 0-0, policy_version 38245 (0.00099) [2022-07-09 02:00:53,931][26022] Updated weights on worker 0-0, policy_version 38255 (0.00089) [2022-07-09 02:00:55,859][26022] Updated weights on worker 0-0, policy_version 38265 (0.00089) [2022-07-09 02:00:57,175][25689] Fps is (10 sec: 5779.2, 60 sec: 5630.3, 300 sec: 5639.8). Total num frames: 39191552. Throughput: 0: 5887.6. Samples: 39196532. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 02:00:57,176][25689] Avg episode reward: [(0, '-61.248')] [2022-07-09 02:00:57,583][26022] Updated weights on worker 0-0, policy_version 38275 (0.00087) [2022-07-09 02:00:59,265][26022] Updated weights on worker 0-0, policy_version 38285 (0.00073) [2022-07-09 02:01:01,169][26022] Updated weights on worker 0-0, policy_version 38295 (0.00082) [2022-07-09 02:01:02,212][25689] Fps is (10 sec: 5485.9, 60 sec: 5589.0, 300 sec: 5632.8). Total num frames: 39217152. Throughput: 0: 5093.4. Samples: 39213662. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 02:01:02,212][25689] Avg episode reward: [(0, '-61.479')] [2022-07-09 02:01:03,295][26022] Updated weights on worker 0-0, policy_version 38305 (0.00110) [2022-07-09 02:01:05,268][26022] Updated weights on worker 0-0, policy_version 38315 (0.00085) [2022-07-09 02:01:07,020][26022] Updated weights on worker 0-0, policy_version 38325 (0.00085) [2022-07-09 02:01:07,224][25689] Fps is (10 sec: 5400.7, 60 sec: 5639.6, 300 sec: 5636.7). Total num frames: 39245824. Throughput: 0: 5822.0. Samples: 39245734. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 02:01:07,225][25689] Avg episode reward: [(0, '-60.659')] [2022-07-09 02:01:08,607][26022] Updated weights on worker 0-0, policy_version 38335 (0.00081) [2022-07-09 02:01:10,372][26022] Updated weights on worker 0-0, policy_version 38345 (0.00083) [2022-07-09 02:01:12,262][25689] Fps is (10 sec: 5807.6, 60 sec: 5636.7, 300 sec: 5639.5). Total num frames: 39275520. Throughput: 0: 5836.3. Samples: 39280054. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 02:01:12,263][25689] Avg episode reward: [(0, '-59.886')] [2022-07-09 02:01:12,268][26022] Updated weights on worker 0-0, policy_version 38355 (0.00080) [2022-07-09 02:01:14,223][26022] Updated weights on worker 0-0, policy_version 38365 (0.00088) [2022-07-09 02:01:15,984][26022] Updated weights on worker 0-0, policy_version 38375 (0.00091) [2022-07-09 02:01:17,281][25689] Fps is (10 sec: 5702.3, 60 sec: 5658.8, 300 sec: 5633.3). Total num frames: 39303168. Throughput: 0: 5002.3. Samples: 39297106. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 02:01:17,281][25689] Avg episode reward: [(0, '-59.811')] [2022-07-09 02:01:17,724][26022] Updated weights on worker 0-0, policy_version 38385 (0.00083) [2022-07-09 02:01:19,627][26022] Updated weights on worker 0-0, policy_version 38395 (0.00083) [2022-07-09 02:01:21,193][26022] Updated weights on worker 0-0, policy_version 38405 (0.00087) [2022-07-09 02:01:22,426][25689] Fps is (10 sec: 5541.3, 60 sec: 5605.6, 300 sec: 5634.3). Total num frames: 39331840. Throughput: 0: 5814.5. Samples: 39331196. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 02:01:22,426][25689] Avg episode reward: [(0, '-60.335')] [2022-07-09 02:01:23,403][26022] Updated weights on worker 0-0, policy_version 38415 (0.00089) [2022-07-09 02:01:24,802][26022] Updated weights on worker 0-0, policy_version 38425 (0.00086) [2022-07-09 02:01:26,955][26022] Updated weights on worker 0-0, policy_version 38435 (0.00088) [2022-07-09 02:01:27,446][25689] Fps is (10 sec: 5842.4, 60 sec: 5675.1, 300 sec: 5637.7). Total num frames: 39362560. Throughput: 0: 5908.5. Samples: 39365216. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 02:01:27,447][25689] Avg episode reward: [(0, '-60.262')] [2022-07-09 02:01:28,596][26022] Updated weights on worker 0-0, policy_version 38445 (0.00085) [2022-07-09 02:01:30,479][26022] Updated weights on worker 0-0, policy_version 38455 (0.00085) [2022-07-09 02:01:32,247][26022] Updated weights on worker 0-0, policy_version 38465 (0.00081) [2022-07-09 02:01:32,526][25689] Fps is (10 sec: 5677.4, 60 sec: 5641.0, 300 sec: 5630.2). Total num frames: 39389184. Throughput: 0: 5882.2. Samples: 39399252. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 02:01:32,527][25689] Avg episode reward: [(0, '-61.925')] [2022-07-09 02:01:34,137][26022] Updated weights on worker 0-0, policy_version 38475 (0.00090) [2022-07-09 02:01:35,731][26022] Updated weights on worker 0-0, policy_version 38485 (0.00077) [2022-07-09 02:01:37,572][25689] Fps is (10 sec: 5461.0, 60 sec: 5624.6, 300 sec: 5637.1). Total num frames: 39417856. Throughput: 0: 5892.4. Samples: 39416670. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 02:01:37,573][25689] Avg episode reward: [(0, '-61.805')] [2022-07-09 02:01:37,623][26022] Updated weights on worker 0-0, policy_version 38495 (0.00089) [2022-07-09 02:01:39,231][26022] Updated weights on worker 0-0, policy_version 38505 (0.00082) [2022-07-09 02:01:41,048][26022] Updated weights on worker 0-0, policy_version 38515 (0.00084) [2022-07-09 02:01:42,613][25689] Fps is (10 sec: 5786.8, 60 sec: 5649.6, 300 sec: 5633.1). Total num frames: 39447552. Throughput: 0: 5941.6. Samples: 39451138. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 02:01:42,613][25689] Avg episode reward: [(0, '-62.521')] [2022-07-09 02:01:42,901][26022] Updated weights on worker 0-0, policy_version 38525 (0.00082) [2022-07-09 02:01:44,534][26022] Updated weights on worker 0-0, policy_version 38535 (0.00084) [2022-07-09 02:01:46,539][26022] Updated weights on worker 0-0, policy_version 38545 (0.00084) [2022-07-09 02:01:47,649][25689] Fps is (10 sec: 5893.6, 60 sec: 5680.1, 300 sec: 5639.7). Total num frames: 39477248. Throughput: 0: 5958.3. Samples: 39485592. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 02:01:47,651][25689] Avg episode reward: [(0, '-62.159')] [2022-07-09 02:01:48,227][26022] Updated weights on worker 0-0, policy_version 38555 (0.00082) [2022-07-09 02:01:50,087][26022] Updated weights on worker 0-0, policy_version 38565 (0.00080) [2022-07-09 02:01:52,002][26022] Updated weights on worker 0-0, policy_version 38575 (0.00084) [2022-07-09 02:01:52,683][25689] Fps is (10 sec: 5592.8, 60 sec: 5648.4, 300 sec: 5636.0). Total num frames: 39503872. Throughput: 0: 5126.8. Samples: 39502590. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 02:01:52,683][25689] Avg episode reward: [(0, '-61.068')] [2022-07-09 02:01:53,736][26022] Updated weights on worker 0-0, policy_version 38585 (0.00081) [2022-07-09 02:01:55,602][26022] Updated weights on worker 0-0, policy_version 38595 (0.00081) [2022-07-09 02:01:57,323][26022] Updated weights on worker 0-0, policy_version 38605 (0.00088) [2022-07-09 02:01:57,699][25689] Fps is (10 sec: 5604.4, 60 sec: 5651.0, 300 sec: 5634.0). Total num frames: 39533568. Throughput: 0: 5983.5. Samples: 39537098. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 02:01:57,700][25689] Avg episode reward: [(0, '-61.210')] [2022-07-09 02:01:59,131][26022] Updated weights on worker 0-0, policy_version 38615 (0.00081) [2022-07-09 02:02:00,880][26022] Updated weights on worker 0-0, policy_version 38625 (0.00084) [2022-07-09 02:02:02,765][25689] Fps is (10 sec: 5586.2, 60 sec: 5665.1, 300 sec: 5639.7). Total num frames: 39560192. Throughput: 0: 5855.5. Samples: 39569138. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 02:02:02,765][25689] Avg episode reward: [(0, '-60.758')] [2022-07-09 02:02:03,146][26022] Updated weights on worker 0-0, policy_version 38635 (0.00086) [2022-07-09 02:02:04,729][26022] Updated weights on worker 0-0, policy_version 38645 (0.00092) [2022-07-09 02:02:06,629][26022] Updated weights on worker 0-0, policy_version 38655 (0.00112) [2022-07-09 02:02:07,820][25689] Fps is (10 sec: 5564.5, 60 sec: 5678.0, 300 sec: 5642.5). Total num frames: 39589888. Throughput: 0: 4989.9. Samples: 39586240. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 02:02:07,820][25689] Avg episode reward: [(0, '-60.089')] [2022-07-09 02:02:08,318][26022] Updated weights on worker 0-0, policy_version 38665 (0.00091) [2022-07-09 02:02:10,265][26022] Updated weights on worker 0-0, policy_version 38675 (0.00087) [2022-07-09 02:02:11,779][26022] Updated weights on worker 0-0, policy_version 38685 (0.00093) [2022-07-09 02:02:12,827][25689] Fps is (10 sec: 5596.9, 60 sec: 5630.2, 300 sec: 5632.7). Total num frames: 39616512. Throughput: 0: 5859.6. Samples: 39620630. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 02:02:12,828][25689] Avg episode reward: [(0, '-60.103')] [2022-07-09 02:02:13,911][26022] Updated weights on worker 0-0, policy_version 38695 (0.00092) [2022-07-09 02:02:15,594][26022] Updated weights on worker 0-0, policy_version 38705 (0.00087) [2022-07-09 02:02:17,536][26022] Updated weights on worker 0-0, policy_version 38715 (0.00092) [2022-07-09 02:02:17,839][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:02:17,844][25689] Fps is (10 sec: 5516.2, 60 sec: 5647.2, 300 sec: 5638.0). Total num frames: 39645184. Throughput: 0: 5826.6. Samples: 39654478. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 02:02:17,845][25689] Avg episode reward: [(0, '-60.931')] [2022-07-09 02:02:17,849][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000038717_39646208.pth [2022-07-09 02:02:17,850][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000036733_37614592.pth [2022-07-09 02:02:19,283][26022] Updated weights on worker 0-0, policy_version 38725 (0.00098) [2022-07-09 02:02:21,115][26022] Updated weights on worker 0-0, policy_version 38735 (0.00086) [2022-07-09 02:02:22,838][26022] Updated weights on worker 0-0, policy_version 38745 (0.00085) [2022-07-09 02:02:22,915][25689] Fps is (10 sec: 5786.3, 60 sec: 5671.1, 300 sec: 5638.3). Total num frames: 39674880. Throughput: 0: 5062.7. Samples: 39671152. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 02:02:22,915][25689] Avg episode reward: [(0, '-61.611')] [2022-07-09 02:02:24,967][26022] Updated weights on worker 0-0, policy_version 38755 (0.00096) [2022-07-09 02:02:26,318][26022] Updated weights on worker 0-0, policy_version 38765 (0.00085) [2022-07-09 02:02:27,917][25689] Fps is (10 sec: 5591.6, 60 sec: 5605.1, 300 sec: 5635.5). Total num frames: 39701504. Throughput: 0: 5923.9. Samples: 39705290. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 02:02:27,917][25689] Avg episode reward: [(0, '-61.444')] [2022-07-09 02:02:28,662][26022] Updated weights on worker 0-0, policy_version 38775 (0.00084) [2022-07-09 02:02:29,980][26022] Updated weights on worker 0-0, policy_version 38785 (0.00087) [2022-07-09 02:02:31,988][26022] Updated weights on worker 0-0, policy_version 38795 (0.00092) [2022-07-09 02:02:32,925][25689] Fps is (10 sec: 5728.2, 60 sec: 5679.6, 300 sec: 5639.1). Total num frames: 39732224. Throughput: 0: 5912.0. Samples: 39739448. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 02:02:32,926][25689] Avg episode reward: [(0, '-61.251')] [2022-07-09 02:02:33,730][26022] Updated weights on worker 0-0, policy_version 38805 (0.00085) [2022-07-09 02:02:35,393][26022] Updated weights on worker 0-0, policy_version 38815 (0.00080) [2022-07-09 02:02:37,327][26022] Updated weights on worker 0-0, policy_version 38825 (0.00096) [2022-07-09 02:02:37,939][25689] Fps is (10 sec: 5823.8, 60 sec: 5665.6, 300 sec: 5639.7). Total num frames: 39759872. Throughput: 0: 5085.7. Samples: 39756674. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 02:02:37,939][25689] Avg episode reward: [(0, '-61.299')] [2022-07-09 02:02:39,299][26022] Updated weights on worker 0-0, policy_version 38835 (0.00086) [2022-07-09 02:02:40,860][26022] Updated weights on worker 0-0, policy_version 38845 (0.00084) [2022-07-09 02:02:43,010][25689] Fps is (10 sec: 5381.5, 60 sec: 5611.9, 300 sec: 5631.7). Total num frames: 39786496. Throughput: 0: 5946.1. Samples: 39790642. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 02:02:43,011][25689] Avg episode reward: [(0, '-61.671')] [2022-07-09 02:02:43,017][26022] Updated weights on worker 0-0, policy_version 38855 (0.00085) [2022-07-09 02:02:44,327][26022] Updated weights on worker 0-0, policy_version 38865 (0.00085) [2022-07-09 02:02:46,303][26022] Updated weights on worker 0-0, policy_version 38875 (0.00091) [2022-07-09 02:02:47,974][26022] Updated weights on worker 0-0, policy_version 38885 (0.00083) [2022-07-09 02:02:48,087][25689] Fps is (10 sec: 5751.8, 60 sec: 5642.1, 300 sec: 5640.9). Total num frames: 39818240. Throughput: 0: 5948.6. Samples: 39825274. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 02:02:48,087][25689] Avg episode reward: [(0, '-61.374')] [2022-07-09 02:02:49,862][26022] Updated weights on worker 0-0, policy_version 38895 (0.00089) [2022-07-09 02:02:51,584][26022] Updated weights on worker 0-0, policy_version 38905 (0.00086) [2022-07-09 02:02:53,115][25689] Fps is (10 sec: 5979.1, 60 sec: 5676.5, 300 sec: 5640.6). Total num frames: 39846912. Throughput: 0: 5116.5. Samples: 39842748. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 02:02:53,115][25689] Avg episode reward: [(0, '-61.486')] [2022-07-09 02:02:53,543][26022] Updated weights on worker 0-0, policy_version 38915 (0.00088) [2022-07-09 02:02:55,111][26022] Updated weights on worker 0-0, policy_version 38925 (0.00084) [2022-07-09 02:02:57,089][26022] Updated weights on worker 0-0, policy_version 38935 (0.00081) [2022-07-09 02:02:58,129][25689] Fps is (10 sec: 5812.2, 60 sec: 5676.6, 300 sec: 5641.0). Total num frames: 39876608. Throughput: 0: 5974.9. Samples: 39877308. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 02:02:58,129][25689] Avg episode reward: [(0, '-61.659')] [2022-07-09 02:02:58,809][26022] Updated weights on worker 0-0, policy_version 38945 (0.00083) [2022-07-09 02:03:00,543][26022] Updated weights on worker 0-0, policy_version 38955 (0.00085) [2022-07-09 02:03:02,626][26022] Updated weights on worker 0-0, policy_version 38965 (0.00090) [2022-07-09 02:03:03,181][25689] Fps is (10 sec: 5493.4, 60 sec: 5661.0, 300 sec: 5640.4). Total num frames: 39902208. Throughput: 0: 5900.9. Samples: 39909666. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 02:03:03,181][25689] Avg episode reward: [(0, '-61.963')] [2022-07-09 02:03:04,463][26022] Updated weights on worker 0-0, policy_version 38975 (0.00092) [2022-07-09 02:03:06,383][26022] Updated weights on worker 0-0, policy_version 38985 (0.00087) [2022-07-09 02:03:08,031][26022] Updated weights on worker 0-0, policy_version 38995 (0.00094) [2022-07-09 02:03:08,209][25689] Fps is (10 sec: 5485.7, 60 sec: 5663.5, 300 sec: 5646.9). Total num frames: 39931904. Throughput: 0: 5055.2. Samples: 39926996. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 02:03:08,209][25689] Avg episode reward: [(0, '-61.973')] [2022-07-09 02:03:09,759][26022] Updated weights on worker 0-0, policy_version 39005 (0.00062) [2022-07-09 02:03:11,496][26022] Updated weights on worker 0-0, policy_version 39015 (0.00081) [2022-07-09 02:03:13,268][25689] Fps is (10 sec: 5888.0, 60 sec: 5709.6, 300 sec: 5646.1). Total num frames: 39961600. Throughput: 0: 5884.1. Samples: 39961330. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 02:03:13,268][25689] Avg episode reward: [(0, '-61.294')] [2022-07-09 02:03:13,275][26022] Updated weights on worker 0-0, policy_version 39025 (0.00085) [2022-07-09 02:03:15,163][26022] Updated weights on worker 0-0, policy_version 39035 (0.00094) [2022-07-09 02:03:17,075][26022] Updated weights on worker 0-0, policy_version 39045 (0.00084) [2022-07-09 02:03:18,363][25689] Fps is (10 sec: 5748.2, 60 sec: 5702.1, 300 sec: 5649.6). Total num frames: 39990272. Throughput: 0: 5851.6. Samples: 39995710. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 02:03:18,364][25689] Avg episode reward: [(0, '-60.755')] [2022-07-09 02:03:18,854][26022] Updated weights on worker 0-0, policy_version 39055 (0.00089) [2022-07-09 02:03:20,675][26022] Updated weights on worker 0-0, policy_version 39065 (0.00091) [2022-07-09 02:03:22,436][26022] Updated weights on worker 0-0, policy_version 39075 (0.00086) [2022-07-09 02:03:23,460][25689] Fps is (10 sec: 5525.8, 60 sec: 5665.8, 300 sec: 5637.6). Total num frames: 40017920. Throughput: 0: 5077.0. Samples: 40012624. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 02:03:23,460][25689] Avg episode reward: [(0, '-60.343')] [2022-07-09 02:03:24,123][26022] Updated weights on worker 0-0, policy_version 39085 (0.00082) [2022-07-09 02:03:25,932][26022] Updated weights on worker 0-0, policy_version 39095 (0.00106) [2022-07-09 02:03:27,846][26022] Updated weights on worker 0-0, policy_version 39105 (0.00084) [2022-07-09 02:03:28,526][25689] Fps is (10 sec: 5541.7, 60 sec: 5693.6, 300 sec: 5643.5). Total num frames: 40046592. Throughput: 0: 5898.0. Samples: 40046826. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 02:03:28,527][25689] Avg episode reward: [(0, '-59.767')] [2022-07-09 02:03:29,554][26022] Updated weights on worker 0-0, policy_version 39115 (0.00080) [2022-07-09 02:03:31,345][26022] Updated weights on worker 0-0, policy_version 39125 (0.00082) [2022-07-09 02:03:33,270][26022] Updated weights on worker 0-0, policy_version 39135 (0.00092) [2022-07-09 02:03:33,533][25689] Fps is (10 sec: 5692.9, 60 sec: 5660.0, 300 sec: 5644.2). Total num frames: 40075264. Throughput: 0: 5904.4. Samples: 40080984. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 02:03:33,533][25689] Avg episode reward: [(0, '-60.457')] [2022-07-09 02:03:35,071][26022] Updated weights on worker 0-0, policy_version 39145 (0.00097) [2022-07-09 02:03:36,819][26022] Updated weights on worker 0-0, policy_version 39155 (0.00087) [2022-07-09 02:03:38,540][25689] Fps is (10 sec: 5726.4, 60 sec: 5677.5, 300 sec: 5642.0). Total num frames: 40103936. Throughput: 0: 5073.6. Samples: 40098080. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 02:03:38,540][25689] Avg episode reward: [(0, '-60.699')] [2022-07-09 02:03:38,583][26022] Updated weights on worker 0-0, policy_version 39165 (0.00083) [2022-07-09 02:03:40,516][26022] Updated weights on worker 0-0, policy_version 39175 (0.00089) [2022-07-09 02:03:42,241][26022] Updated weights on worker 0-0, policy_version 39185 (0.00085) [2022-07-09 02:03:43,627][25689] Fps is (10 sec: 5680.8, 60 sec: 5709.8, 300 sec: 5647.7). Total num frames: 40132608. Throughput: 0: 5916.4. Samples: 40131942. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 02:03:43,628][25689] Avg episode reward: [(0, '-61.156')] [2022-07-09 02:03:44,095][26022] Updated weights on worker 0-0, policy_version 39195 (0.00086) [2022-07-09 02:03:45,917][26022] Updated weights on worker 0-0, policy_version 39205 (0.00085) [2022-07-09 02:03:47,754][26022] Updated weights on worker 0-0, policy_version 39215 (0.00093) [2022-07-09 02:03:48,705][25689] Fps is (10 sec: 5641.2, 60 sec: 5659.0, 300 sec: 5646.7). Total num frames: 40161280. Throughput: 0: 5892.5. Samples: 40165732. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 02:03:48,706][25689] Avg episode reward: [(0, '-61.472')] [2022-07-09 02:03:49,574][26022] Updated weights on worker 0-0, policy_version 39225 (0.00083) [2022-07-09 02:03:51,666][26022] Updated weights on worker 0-0, policy_version 39235 (0.00089) [2022-07-09 02:03:53,129][26022] Updated weights on worker 0-0, policy_version 39245 (0.00087) [2022-07-09 02:03:53,707][25689] Fps is (10 sec: 5688.9, 60 sec: 5661.4, 300 sec: 5647.2). Total num frames: 40189952. Throughput: 0: 5048.8. Samples: 40182840. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 02:03:53,708][25689] Avg episode reward: [(0, '-62.716')] [2022-07-09 02:03:55,163][26022] Updated weights on worker 0-0, policy_version 39255 (0.01310) [2022-07-09 02:03:56,583][26022] Updated weights on worker 0-0, policy_version 39265 (0.00087) [2022-07-09 02:03:58,649][26022] Updated weights on worker 0-0, policy_version 39275 (0.00086) [2022-07-09 02:03:58,744][25689] Fps is (10 sec: 5712.4, 60 sec: 5642.5, 300 sec: 5647.9). Total num frames: 40218624. Throughput: 0: 5886.9. Samples: 40217018. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 02:03:58,744][25689] Avg episode reward: [(0, '-62.234')] [2022-07-09 02:04:00,290][26022] Updated weights on worker 0-0, policy_version 39285 (0.00081) [2022-07-09 02:04:02,512][26022] Updated weights on worker 0-0, policy_version 39295 (0.00051) [2022-07-09 02:04:03,781][25689] Fps is (10 sec: 5488.8, 60 sec: 5660.7, 300 sec: 5647.8). Total num frames: 40245248. Throughput: 0: 5817.4. Samples: 40249188. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 02:04:03,782][25689] Avg episode reward: [(0, '-61.930')] [2022-07-09 02:04:04,406][26022] Updated weights on worker 0-0, policy_version 39305 (0.00095) [2022-07-09 02:04:06,083][26022] Updated weights on worker 0-0, policy_version 39315 (0.00086) [2022-07-09 02:04:08,063][26022] Updated weights on worker 0-0, policy_version 39325 (0.00529) [2022-07-09 02:04:08,789][25689] Fps is (10 sec: 5402.7, 60 sec: 5628.8, 300 sec: 5644.4). Total num frames: 40272896. Throughput: 0: 5008.2. Samples: 40266316. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 02:04:08,790][25689] Avg episode reward: [(0, '-62.242')] [2022-07-09 02:04:09,616][26022] Updated weights on worker 0-0, policy_version 39335 (0.00090) [2022-07-09 02:04:11,390][26022] Updated weights on worker 0-0, policy_version 39345 (0.00086) [2022-07-09 02:04:13,080][26022] Updated weights on worker 0-0, policy_version 39355 (0.00086) [2022-07-09 02:04:13,812][25689] Fps is (10 sec: 5614.7, 60 sec: 5615.2, 300 sec: 5647.5). Total num frames: 40301568. Throughput: 0: 5874.5. Samples: 40300946. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 02:04:13,813][25689] Avg episode reward: [(0, '-62.337')] [2022-07-09 02:04:14,951][26022] Updated weights on worker 0-0, policy_version 39365 (0.00094) [2022-07-09 02:04:16,970][26022] Updated weights on worker 0-0, policy_version 39375 (0.00090) [2022-07-09 02:04:17,929][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:04:17,940][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000039381_40326144.pth [2022-07-09 02:04:17,941][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000037394_38291456.pth [2022-07-09 02:04:18,568][26022] Updated weights on worker 0-0, policy_version 39385 (0.00086) [2022-07-09 02:04:18,831][25689] Fps is (10 sec: 5710.4, 60 sec: 5622.3, 300 sec: 5645.6). Total num frames: 40330240. Throughput: 0: 5862.1. Samples: 40334772. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 02:04:18,833][25689] Avg episode reward: [(0, '-61.679')] [2022-07-09 02:04:20,446][26022] Updated weights on worker 0-0, policy_version 39395 (0.00083) [2022-07-09 02:04:22,498][26022] Updated weights on worker 0-0, policy_version 39405 (0.00083) [2022-07-09 02:04:23,952][25689] Fps is (10 sec: 5857.0, 60 sec: 5670.8, 300 sec: 5657.5). Total num frames: 40360960. Throughput: 0: 5950.5. Samples: 40369216. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 02:04:23,953][25689] Avg episode reward: [(0, '-61.326')] [2022-07-09 02:04:23,960][26022] Updated weights on worker 0-0, policy_version 39415 (0.00082) [2022-07-09 02:04:25,910][26022] Updated weights on worker 0-0, policy_version 39425 (0.00093) [2022-07-09 02:04:27,835][26022] Updated weights on worker 0-0, policy_version 39435 (0.00086) [2022-07-09 02:04:28,983][25689] Fps is (10 sec: 5648.4, 60 sec: 5640.2, 300 sec: 5643.9). Total num frames: 40387584. Throughput: 0: 5934.5. Samples: 40386158. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 02:04:28,984][25689] Avg episode reward: [(0, '-61.358')] [2022-07-09 02:04:29,468][26022] Updated weights on worker 0-0, policy_version 39445 (0.00087) [2022-07-09 02:04:31,444][26022] Updated weights on worker 0-0, policy_version 39455 (0.00090) [2022-07-09 02:04:33,050][26022] Updated weights on worker 0-0, policy_version 39465 (0.00087) [2022-07-09 02:04:34,070][25689] Fps is (10 sec: 5465.5, 60 sec: 5632.8, 300 sec: 5650.5). Total num frames: 40416256. Throughput: 0: 5892.7. Samples: 40420318. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 02:04:34,070][25689] Avg episode reward: [(0, '-61.398')] [2022-07-09 02:04:35,135][26022] Updated weights on worker 0-0, policy_version 39475 (0.00087) [2022-07-09 02:04:36,641][26022] Updated weights on worker 0-0, policy_version 39485 (0.00085) [2022-07-09 02:04:38,486][26022] Updated weights on worker 0-0, policy_version 39495 (0.00089) [2022-07-09 02:04:39,092][25689] Fps is (10 sec: 5774.0, 60 sec: 5648.3, 300 sec: 5650.7). Total num frames: 40445952. Throughput: 0: 5906.3. Samples: 40454438. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 02:04:39,092][25689] Avg episode reward: [(0, '-60.774')] [2022-07-09 02:04:40,410][26022] Updated weights on worker 0-0, policy_version 39505 (0.00108) [2022-07-09 02:04:41,988][26022] Updated weights on worker 0-0, policy_version 39515 (0.00100) [2022-07-09 02:04:44,035][26022] Updated weights on worker 0-0, policy_version 39525 (0.00082) [2022-07-09 02:04:44,144][25689] Fps is (10 sec: 5692.1, 60 sec: 5634.6, 300 sec: 5646.5). Total num frames: 40473600. Throughput: 0: 5062.1. Samples: 40471428. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 02:04:44,145][25689] Avg episode reward: [(0, '-60.322')] [2022-07-09 02:04:45,875][26022] Updated weights on worker 0-0, policy_version 39535 (0.00090) [2022-07-09 02:04:47,631][26022] Updated weights on worker 0-0, policy_version 39545 (0.00086) [2022-07-09 02:04:49,165][25689] Fps is (10 sec: 5693.0, 60 sec: 5657.0, 300 sec: 5647.1). Total num frames: 40503296. Throughput: 0: 5891.1. Samples: 40505048. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 02:04:49,165][25689] Avg episode reward: [(0, '-60.599')] [2022-07-09 02:04:49,283][26022] Updated weights on worker 0-0, policy_version 39555 (0.00089) [2022-07-09 02:04:51,201][26022] Updated weights on worker 0-0, policy_version 39565 (0.00082) [2022-07-09 02:04:53,095][26022] Updated weights on worker 0-0, policy_version 39575 (0.00094) [2022-07-09 02:04:54,206][25689] Fps is (10 sec: 5699.2, 60 sec: 5636.3, 300 sec: 5646.5). Total num frames: 40530944. Throughput: 0: 5905.4. Samples: 40539230. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 02:04:54,207][25689] Avg episode reward: [(0, '-59.895')] [2022-07-09 02:04:54,931][26022] Updated weights on worker 0-0, policy_version 39585 (0.00093) [2022-07-09 02:04:56,788][26022] Updated weights on worker 0-0, policy_version 39595 (0.00085) [2022-07-09 02:04:58,513][26022] Updated weights on worker 0-0, policy_version 39605 (0.00089) [2022-07-09 02:04:59,262][25689] Fps is (10 sec: 5577.8, 60 sec: 5634.6, 300 sec: 5648.1). Total num frames: 40559616. Throughput: 0: 5043.1. Samples: 40556156. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 02:04:59,263][25689] Avg episode reward: [(0, '-59.537')] [2022-07-09 02:05:00,260][26022] Updated weights on worker 0-0, policy_version 39615 (0.00092) [2022-07-09 02:05:02,426][26022] Updated weights on worker 0-0, policy_version 39625 (0.00088) [2022-07-09 02:05:04,326][25689] Fps is (10 sec: 5363.0, 60 sec: 5615.2, 300 sec: 5647.1). Total num frames: 40585216. Throughput: 0: 5772.5. Samples: 40587924. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 02:05:04,326][25689] Avg episode reward: [(0, '-58.944')] [2022-07-09 02:05:04,427][26022] Updated weights on worker 0-0, policy_version 39635 (0.00092) [2022-07-09 02:05:06,135][26022] Updated weights on worker 0-0, policy_version 39645 (0.00082) [2022-07-09 02:05:07,926][26022] Updated weights on worker 0-0, policy_version 39655 (0.00087) [2022-07-09 02:05:09,368][25689] Fps is (10 sec: 5370.0, 60 sec: 5628.9, 300 sec: 5643.0). Total num frames: 40613888. Throughput: 0: 5785.5. Samples: 40621936. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 02:05:09,369][25689] Avg episode reward: [(0, '-59.458')] [2022-07-09 02:05:09,674][26022] Updated weights on worker 0-0, policy_version 39665 (0.00094) [2022-07-09 02:05:11,379][26022] Updated weights on worker 0-0, policy_version 39675 (0.00094) [2022-07-09 02:05:13,384][26022] Updated weights on worker 0-0, policy_version 39685 (0.00091) [2022-07-09 02:05:14,371][25689] Fps is (10 sec: 5810.6, 60 sec: 5647.7, 300 sec: 5654.7). Total num frames: 40643584. Throughput: 0: 4938.1. Samples: 40638806. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 02:05:14,371][25689] Avg episode reward: [(0, '-59.753')] [2022-07-09 02:05:15,045][26022] Updated weights on worker 0-0, policy_version 39695 (0.00083) [2022-07-09 02:05:16,990][26022] Updated weights on worker 0-0, policy_version 39705 (0.00091) [2022-07-09 02:05:18,808][26022] Updated weights on worker 0-0, policy_version 39715 (0.00081) [2022-07-09 02:05:19,380][25689] Fps is (10 sec: 5727.4, 60 sec: 5631.6, 300 sec: 5643.0). Total num frames: 40671232. Throughput: 0: 5789.2. Samples: 40672626. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 02:05:19,381][25689] Avg episode reward: [(0, '-59.979')] [2022-07-09 02:05:20,563][26022] Updated weights on worker 0-0, policy_version 39725 (0.00098) [2022-07-09 02:05:22,211][26022] Updated weights on worker 0-0, policy_version 39735 (0.00088) [2022-07-09 02:05:24,185][26022] Updated weights on worker 0-0, policy_version 39745 (0.00090) [2022-07-09 02:05:24,426][25689] Fps is (10 sec: 5601.1, 60 sec: 5604.9, 300 sec: 5649.8). Total num frames: 40699904. Throughput: 0: 5928.7. Samples: 40707092. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 02:05:24,426][25689] Avg episode reward: [(0, '-60.269')] [2022-07-09 02:05:26,019][26022] Updated weights on worker 0-0, policy_version 39755 (0.00083) [2022-07-09 02:05:27,708][26022] Updated weights on worker 0-0, policy_version 39765 (0.00086) [2022-07-09 02:05:29,510][25689] Fps is (10 sec: 5559.9, 60 sec: 5616.8, 300 sec: 5646.2). Total num frames: 40727552. Throughput: 0: 5072.1. Samples: 40724098. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 02:05:29,511][25689] Avg episode reward: [(0, '-60.199')] [2022-07-09 02:05:29,676][26022] Updated weights on worker 0-0, policy_version 39775 (0.00092) [2022-07-09 02:05:31,036][26022] Updated weights on worker 0-0, policy_version 39785 (0.00084) [2022-07-09 02:05:33,198][26022] Updated weights on worker 0-0, policy_version 39795 (0.00089) [2022-07-09 02:05:34,568][25689] Fps is (10 sec: 5754.8, 60 sec: 5653.3, 300 sec: 5649.5). Total num frames: 40758272. Throughput: 0: 5917.7. Samples: 40758330. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 02:05:34,568][25689] Avg episode reward: [(0, '-59.688')] [2022-07-09 02:05:35,050][26022] Updated weights on worker 0-0, policy_version 39805 (0.00092) [2022-07-09 02:05:36,796][26022] Updated weights on worker 0-0, policy_version 39815 (0.00089) [2022-07-09 02:05:38,601][26022] Updated weights on worker 0-0, policy_version 39825 (0.00103) [2022-07-09 02:05:39,662][25689] Fps is (10 sec: 5749.2, 60 sec: 5612.8, 300 sec: 5646.7). Total num frames: 40785920. Throughput: 0: 5899.9. Samples: 40792288. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 02:05:39,663][25689] Avg episode reward: [(0, '-60.183')] [2022-07-09 02:05:40,442][26022] Updated weights on worker 0-0, policy_version 39835 (0.00248) [2022-07-09 02:05:42,242][26022] Updated weights on worker 0-0, policy_version 39845 (0.00082) [2022-07-09 02:05:43,971][26022] Updated weights on worker 0-0, policy_version 39855 (0.00088) [2022-07-09 02:05:44,740][25689] Fps is (10 sec: 5637.7, 60 sec: 5644.2, 300 sec: 5652.1). Total num frames: 40815616. Throughput: 0: 5024.8. Samples: 40809166. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 02:05:44,740][25689] Avg episode reward: [(0, '-59.718')] [2022-07-09 02:05:45,804][26022] Updated weights on worker 0-0, policy_version 39865 (0.00097) [2022-07-09 02:05:47,613][26022] Updated weights on worker 0-0, policy_version 39875 (0.00086) [2022-07-09 02:05:49,447][26022] Updated weights on worker 0-0, policy_version 39885 (0.00061) [2022-07-09 02:05:49,824][25689] Fps is (10 sec: 5642.9, 60 sec: 5604.5, 300 sec: 5648.1). Total num frames: 40843264. Throughput: 0: 5856.6. Samples: 40843072. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 02:05:49,825][25689] Avg episode reward: [(0, '-59.781')] [2022-07-09 02:05:51,301][26022] Updated weights on worker 0-0, policy_version 39895 (0.00085) [2022-07-09 02:05:52,938][26022] Updated weights on worker 0-0, policy_version 39905 (0.00089) [2022-07-09 02:05:54,796][26022] Updated weights on worker 0-0, policy_version 39915 (0.00080) [2022-07-09 02:05:54,891][25689] Fps is (10 sec: 5648.7, 60 sec: 5635.9, 300 sec: 5647.7). Total num frames: 40872960. Throughput: 0: 5857.3. Samples: 40877370. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 02:05:54,892][25689] Avg episode reward: [(0, '-59.649')] [2022-07-09 02:05:56,653][26022] Updated weights on worker 0-0, policy_version 39925 (0.00087) [2022-07-09 02:05:58,346][26022] Updated weights on worker 0-0, policy_version 39935 (0.00089) [2022-07-09 02:05:59,962][25689] Fps is (10 sec: 5656.5, 60 sec: 5617.6, 300 sec: 5653.9). Total num frames: 40900608. Throughput: 0: 5025.9. Samples: 40894312. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 02:05:59,963][25689] Avg episode reward: [(0, '-60.600')] [2022-07-09 02:06:00,199][26022] Updated weights on worker 0-0, policy_version 39945 (0.00080) [2022-07-09 02:06:02,391][26022] Updated weights on worker 0-0, policy_version 39955 (0.00082) [2022-07-09 02:06:04,255][26022] Updated weights on worker 0-0, policy_version 39965 (0.00089) [2022-07-09 02:06:05,030][25689] Fps is (10 sec: 5655.8, 60 sec: 5684.7, 300 sec: 5656.3). Total num frames: 40930304. Throughput: 0: 5797.2. Samples: 40926796. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 02:06:05,031][25689] Avg episode reward: [(0, '-60.382')] [2022-07-09 02:06:05,955][26022] Updated weights on worker 0-0, policy_version 39975 (0.00086) [2022-07-09 02:06:07,857][26022] Updated weights on worker 0-0, policy_version 39985 (0.00088) [2022-07-09 02:06:09,708][26022] Updated weights on worker 0-0, policy_version 39995 (0.00080) [2022-07-09 02:06:10,033][25689] Fps is (10 sec: 5490.9, 60 sec: 5637.8, 300 sec: 5643.2). Total num frames: 40955904. Throughput: 0: 5840.3. Samples: 40961094. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 02:06:10,033][25689] Avg episode reward: [(0, '-60.673')] [2022-07-09 02:06:11,215][26022] Updated weights on worker 0-0, policy_version 40005 (0.00082) [2022-07-09 02:06:13,235][26022] Updated weights on worker 0-0, policy_version 40015 (0.00079) [2022-07-09 02:06:14,695][26022] Updated weights on worker 0-0, policy_version 40025 (0.00091) [2022-07-09 02:06:15,060][25689] Fps is (10 sec: 5615.2, 60 sec: 5652.3, 300 sec: 5653.4). Total num frames: 40986624. Throughput: 0: 5866.6. Samples: 40995694. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 02:06:15,061][25689] Avg episode reward: [(0, '-60.821')] [2022-07-09 02:06:16,881][26022] Updated weights on worker 0-0, policy_version 40035 (0.00093) [2022-07-09 02:06:18,085][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:06:18,106][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000040043_41004032.pth [2022-07-09 02:06:18,107][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000038053_38966272.pth [2022-07-09 02:06:18,504][26022] Updated weights on worker 0-0, policy_version 40045 (0.00080) [2022-07-09 02:06:20,114][25689] Fps is (10 sec: 5789.8, 60 sec: 5648.3, 300 sec: 5651.7). Total num frames: 41014272. Throughput: 0: 5873.1. Samples: 41012664. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 02:06:20,116][25689] Avg episode reward: [(0, '-59.941')] [2022-07-09 02:06:20,291][26022] Updated weights on worker 0-0, policy_version 40055 (0.00082) [2022-07-09 02:06:22,039][26022] Updated weights on worker 0-0, policy_version 40065 (0.00074) [2022-07-09 02:06:23,901][26022] Updated weights on worker 0-0, policy_version 40075 (0.00082) [2022-07-09 02:06:25,239][25689] Fps is (10 sec: 5633.9, 60 sec: 5657.7, 300 sec: 5646.2). Total num frames: 41043968. Throughput: 0: 5963.2. Samples: 41047302. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 02:06:25,239][25689] Avg episode reward: [(0, '-58.743')] [2022-07-09 02:06:25,635][26022] Updated weights on worker 0-0, policy_version 40085 (0.00093) [2022-07-09 02:06:27,355][26022] Updated weights on worker 0-0, policy_version 40095 (0.00083) [2022-07-09 02:06:29,114][26022] Updated weights on worker 0-0, policy_version 40105 (0.00094) [2022-07-09 02:06:30,304][25689] Fps is (10 sec: 5828.3, 60 sec: 5693.2, 300 sec: 5656.8). Total num frames: 41073664. Throughput: 0: 5948.4. Samples: 41081678. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 02:06:30,305][25689] Avg episode reward: [(0, '-58.808')] [2022-07-09 02:06:31,085][26022] Updated weights on worker 0-0, policy_version 40115 (0.00087) [2022-07-09 02:06:32,780][26022] Updated weights on worker 0-0, policy_version 40125 (0.00086) [2022-07-09 02:06:34,573][26022] Updated weights on worker 0-0, policy_version 40135 (0.00085) [2022-07-09 02:06:35,335][25689] Fps is (10 sec: 5679.8, 60 sec: 5645.2, 300 sec: 5653.7). Total num frames: 41101312. Throughput: 0: 5085.9. Samples: 41098808. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 02:06:35,335][25689] Avg episode reward: [(0, '-58.345')] [2022-07-09 02:06:36,319][26022] Updated weights on worker 0-0, policy_version 40145 (0.00085) [2022-07-09 02:06:38,231][26022] Updated weights on worker 0-0, policy_version 40155 (0.00084) [2022-07-09 02:06:40,019][26022] Updated weights on worker 0-0, policy_version 40165 (0.00083) [2022-07-09 02:06:40,351][25689] Fps is (10 sec: 5707.8, 60 sec: 5686.2, 300 sec: 5654.2). Total num frames: 41131008. Throughput: 0: 5941.9. Samples: 41132910. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 02:06:40,351][25689] Avg episode reward: [(0, '-58.501')] [2022-07-09 02:06:41,840][26022] Updated weights on worker 0-0, policy_version 40175 (0.00087) [2022-07-09 02:06:43,475][26022] Updated weights on worker 0-0, policy_version 40185 (0.00084) [2022-07-09 02:06:45,465][25689] Fps is (10 sec: 5660.5, 60 sec: 5649.0, 300 sec: 5645.8). Total num frames: 41158656. Throughput: 0: 5944.0. Samples: 41167528. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 02:06:45,466][25689] Avg episode reward: [(0, '-59.345')] [2022-07-09 02:06:45,472][26022] Updated weights on worker 0-0, policy_version 40195 (0.00081) [2022-07-09 02:06:46,942][26022] Updated weights on worker 0-0, policy_version 40205 (0.00093) [2022-07-09 02:06:49,040][26022] Updated weights on worker 0-0, policy_version 40215 (0.01178) [2022-07-09 02:06:50,524][25689] Fps is (10 sec: 5737.7, 60 sec: 5702.1, 300 sec: 5659.1). Total num frames: 41189376. Throughput: 0: 5090.4. Samples: 41184602. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 02:06:50,524][25689] Avg episode reward: [(0, '-59.689')] [2022-07-09 02:06:50,592][26022] Updated weights on worker 0-0, policy_version 40225 (0.00092) [2022-07-09 02:06:52,706][26022] Updated weights on worker 0-0, policy_version 40235 (0.00088) [2022-07-09 02:06:54,066][26022] Updated weights on worker 0-0, policy_version 40245 (0.00085) [2022-07-09 02:06:55,528][25689] Fps is (10 sec: 5800.6, 60 sec: 5674.2, 300 sec: 5652.4). Total num frames: 41217024. Throughput: 0: 5961.2. Samples: 41219182. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 02:06:55,528][25689] Avg episode reward: [(0, '-59.269')] [2022-07-09 02:06:56,219][26022] Updated weights on worker 0-0, policy_version 40255 (0.00089) [2022-07-09 02:06:57,660][26022] Updated weights on worker 0-0, policy_version 40265 (0.00092) [2022-07-09 02:06:59,718][26022] Updated weights on worker 0-0, policy_version 40275 (0.00087) [2022-07-09 02:07:00,554][25689] Fps is (10 sec: 5716.7, 60 sec: 5712.1, 300 sec: 5663.5). Total num frames: 41246720. Throughput: 0: 5979.6. Samples: 41253720. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 02:07:00,555][25689] Avg episode reward: [(0, '-59.121')] [2022-07-09 02:07:01,324][26022] Updated weights on worker 0-0, policy_version 40285 (0.00084) [2022-07-09 02:07:03,603][26022] Updated weights on worker 0-0, policy_version 40295 (0.00087) [2022-07-09 02:07:05,335][26022] Updated weights on worker 0-0, policy_version 40305 (0.00079) [2022-07-09 02:07:05,670][25689] Fps is (10 sec: 5552.8, 60 sec: 5657.0, 300 sec: 5652.0). Total num frames: 41273344. Throughput: 0: 5014.2. Samples: 41268840. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 02:07:05,671][25689] Avg episode reward: [(0, '-59.047')] [2022-07-09 02:07:07,167][26022] Updated weights on worker 0-0, policy_version 40315 (0.00088) [2022-07-09 02:07:08,978][26022] Updated weights on worker 0-0, policy_version 40325 (0.00087) [2022-07-09 02:07:10,698][25689] Fps is (10 sec: 5451.4, 60 sec: 5705.3, 300 sec: 5658.5). Total num frames: 41302016. Throughput: 0: 5860.8. Samples: 41302840. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 02:07:10,699][25689] Avg episode reward: [(0, '-58.069')] [2022-07-09 02:07:10,809][26022] Updated weights on worker 0-0, policy_version 40335 (0.00082) [2022-07-09 02:07:12,492][26022] Updated weights on worker 0-0, policy_version 40345 (0.00092) [2022-07-09 02:07:14,407][26022] Updated weights on worker 0-0, policy_version 40355 (0.00087) [2022-07-09 02:07:15,705][25689] Fps is (10 sec: 5714.6, 60 sec: 5673.5, 300 sec: 5658.7). Total num frames: 41330688. Throughput: 0: 5849.4. Samples: 41337208. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 02:07:15,706][25689] Avg episode reward: [(0, '-57.832')] [2022-07-09 02:07:16,116][26022] Updated weights on worker 0-0, policy_version 40365 (0.00089) [2022-07-09 02:07:17,970][26022] Updated weights on worker 0-0, policy_version 40375 (0.01173) [2022-07-09 02:07:19,921][26022] Updated weights on worker 0-0, policy_version 40385 (0.00093) [2022-07-09 02:07:20,722][25689] Fps is (10 sec: 5618.2, 60 sec: 5676.8, 300 sec: 5652.8). Total num frames: 41358336. Throughput: 0: 4981.4. Samples: 41354184. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 02:07:20,724][25689] Avg episode reward: [(0, '-58.719')] [2022-07-09 02:07:21,462][26022] Updated weights on worker 0-0, policy_version 40395 (0.00092) [2022-07-09 02:07:23,585][26022] Updated weights on worker 0-0, policy_version 40405 (0.00089) [2022-07-09 02:07:25,339][26022] Updated weights on worker 0-0, policy_version 40415 (0.00088) [2022-07-09 02:07:25,772][25689] Fps is (10 sec: 5594.7, 60 sec: 5667.0, 300 sec: 5658.8). Total num frames: 41387008. Throughput: 0: 5928.1. Samples: 41388004. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 02:07:25,774][25689] Avg episode reward: [(0, '-58.900')] [2022-07-09 02:07:27,080][26022] Updated weights on worker 0-0, policy_version 40425 (0.00088) [2022-07-09 02:07:28,922][26022] Updated weights on worker 0-0, policy_version 40435 (0.00082) [2022-07-09 02:07:30,639][26022] Updated weights on worker 0-0, policy_version 40445 (0.00091) [2022-07-09 02:07:30,794][25689] Fps is (10 sec: 5693.4, 60 sec: 5654.1, 300 sec: 5651.7). Total num frames: 41415680. Throughput: 0: 5930.8. Samples: 41422028. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 02:07:30,796][25689] Avg episode reward: [(0, '-59.544')] [2022-07-09 02:07:32,499][26022] Updated weights on worker 0-0, policy_version 40455 (0.00089) [2022-07-09 02:07:34,465][26022] Updated weights on worker 0-0, policy_version 40465 (0.00087) [2022-07-09 02:07:35,809][25689] Fps is (10 sec: 5713.2, 60 sec: 5672.5, 300 sec: 5655.1). Total num frames: 41444352. Throughput: 0: 5068.9. Samples: 41439112. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 02:07:35,811][25689] Avg episode reward: [(0, '-60.805')] [2022-07-09 02:07:36,226][26022] Updated weights on worker 0-0, policy_version 40475 (0.00045) [2022-07-09 02:07:38,090][26022] Updated weights on worker 0-0, policy_version 40485 (0.00096) [2022-07-09 02:07:39,862][26022] Updated weights on worker 0-0, policy_version 40495 (0.00090) [2022-07-09 02:07:40,848][25689] Fps is (10 sec: 5704.0, 60 sec: 5653.4, 300 sec: 5662.6). Total num frames: 41473024. Throughput: 0: 5921.1. Samples: 41473348. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 02:07:40,848][25689] Avg episode reward: [(0, '-60.663')] [2022-07-09 02:07:41,408][26022] Updated weights on worker 0-0, policy_version 40505 (0.00086) [2022-07-09 02:07:43,576][26022] Updated weights on worker 0-0, policy_version 40515 (0.00094) [2022-07-09 02:07:44,957][26022] Updated weights on worker 0-0, policy_version 40525 (0.00068) [2022-07-09 02:07:45,977][25689] Fps is (10 sec: 5639.5, 60 sec: 5669.0, 300 sec: 5651.3). Total num frames: 41501696. Throughput: 0: 5923.7. Samples: 41507694. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 02:07:45,979][25689] Avg episode reward: [(0, '-60.054')] [2022-07-09 02:07:46,995][26022] Updated weights on worker 0-0, policy_version 40535 (0.00093) [2022-07-09 02:07:48,604][26022] Updated weights on worker 0-0, policy_version 40545 (0.00097) [2022-07-09 02:07:50,658][26022] Updated weights on worker 0-0, policy_version 40555 (0.00087) [2022-07-09 02:07:51,029][25689] Fps is (10 sec: 5632.4, 60 sec: 5635.7, 300 sec: 5650.8). Total num frames: 41530368. Throughput: 0: 5073.8. Samples: 41524694. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 02:07:51,030][25689] Avg episode reward: [(0, '-59.365')] [2022-07-09 02:07:52,319][26022] Updated weights on worker 0-0, policy_version 40565 (0.00082) [2022-07-09 02:07:54,067][26022] Updated weights on worker 0-0, policy_version 40575 (0.00088) [2022-07-09 02:07:55,797][26022] Updated weights on worker 0-0, policy_version 40585 (0.00088) [2022-07-09 02:07:56,093][25689] Fps is (10 sec: 5668.5, 60 sec: 5647.0, 300 sec: 5646.4). Total num frames: 41559040. Throughput: 0: 5905.8. Samples: 41558908. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 02:07:56,094][25689] Avg episode reward: [(0, '-59.833')] [2022-07-09 02:07:57,810][26022] Updated weights on worker 0-0, policy_version 40595 (0.00084) [2022-07-09 02:07:59,371][26022] Updated weights on worker 0-0, policy_version 40605 (0.00085) [2022-07-09 02:08:01,134][25689] Fps is (10 sec: 5776.1, 60 sec: 5645.7, 300 sec: 5660.4). Total num frames: 41588736. Throughput: 0: 5923.6. Samples: 41593516. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 02:08:01,134][25689] Avg episode reward: [(0, '-60.175')] [2022-07-09 02:08:01,203][26022] Updated weights on worker 0-0, policy_version 40615 (0.00092) [2022-07-09 02:08:03,327][26022] Updated weights on worker 0-0, policy_version 40625 (0.00091) [2022-07-09 02:08:05,195][26022] Updated weights on worker 0-0, policy_version 40635 (0.00092) [2022-07-09 02:08:06,171][25689] Fps is (10 sec: 5487.2, 60 sec: 5636.2, 300 sec: 5646.5). Total num frames: 41614336. Throughput: 0: 4994.2. Samples: 41608544. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 02:08:06,171][25689] Avg episode reward: [(0, '-59.555')] [2022-07-09 02:08:07,071][26022] Updated weights on worker 0-0, policy_version 40645 (0.00096) [2022-07-09 02:08:08,874][26022] Updated weights on worker 0-0, policy_version 40655 (0.00085) [2022-07-09 02:08:10,340][26022] Updated weights on worker 0-0, policy_version 40665 (0.00077) [2022-07-09 02:08:11,176][25689] Fps is (10 sec: 5608.5, 60 sec: 5672.2, 300 sec: 5650.9). Total num frames: 41645056. Throughput: 0: 5872.9. Samples: 41643016. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 02:08:11,176][25689] Avg episode reward: [(0, '-59.748')] [2022-07-09 02:08:12,449][26022] Updated weights on worker 0-0, policy_version 40675 (0.00083) [2022-07-09 02:08:13,974][26022] Updated weights on worker 0-0, policy_version 40685 (0.00086) [2022-07-09 02:08:16,029][26022] Updated weights on worker 0-0, policy_version 40695 (0.00086) [2022-07-09 02:08:16,188][25689] Fps is (10 sec: 5826.9, 60 sec: 5654.8, 300 sec: 5649.1). Total num frames: 41672704. Throughput: 0: 5898.3. Samples: 41677430. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 02:08:16,188][25689] Avg episode reward: [(0, '-60.045')] [2022-07-09 02:08:17,787][26022] Updated weights on worker 0-0, policy_version 40705 (0.00087) [2022-07-09 02:08:18,228][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:08:18,240][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000040708_41684992.pth [2022-07-09 02:08:18,240][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000038717_39646208.pth [2022-07-09 02:08:19,444][26022] Updated weights on worker 0-0, policy_version 40715 (0.00097) [2022-07-09 02:08:21,223][25689] Fps is (10 sec: 5605.7, 60 sec: 5670.1, 300 sec: 5653.7). Total num frames: 41701376. Throughput: 0: 5040.4. Samples: 41694772. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 02:08:21,223][25689] Avg episode reward: [(0, '-60.862')] [2022-07-09 02:08:21,366][26022] Updated weights on worker 0-0, policy_version 40725 (0.00092) [2022-07-09 02:08:23,144][26022] Updated weights on worker 0-0, policy_version 40735 (0.00082) [2022-07-09 02:08:24,917][26022] Updated weights on worker 0-0, policy_version 40745 (0.00089) [2022-07-09 02:08:26,313][25689] Fps is (10 sec: 5562.4, 60 sec: 5649.3, 300 sec: 5649.8). Total num frames: 41729024. Throughput: 0: 5954.3. Samples: 41728476. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 02:08:26,313][25689] Avg episode reward: [(0, '-61.299')] [2022-07-09 02:08:26,927][26022] Updated weights on worker 0-0, policy_version 40755 (0.00085) [2022-07-09 02:08:28,376][26022] Updated weights on worker 0-0, policy_version 40765 (0.00080) [2022-07-09 02:08:30,498][26022] Updated weights on worker 0-0, policy_version 40775 (0.00089) [2022-07-09 02:08:31,341][25689] Fps is (10 sec: 5667.3, 60 sec: 5665.7, 300 sec: 5652.8). Total num frames: 41758720. Throughput: 0: 5936.8. Samples: 41762734. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 02:08:31,343][25689] Avg episode reward: [(0, '-60.368')] [2022-07-09 02:08:32,179][26022] Updated weights on worker 0-0, policy_version 40785 (0.00081) [2022-07-09 02:08:34,005][26022] Updated weights on worker 0-0, policy_version 40795 (0.00082) [2022-07-09 02:08:35,627][26022] Updated weights on worker 0-0, policy_version 40805 (0.00092) [2022-07-09 02:08:36,374][25689] Fps is (10 sec: 5699.6, 60 sec: 5647.1, 300 sec: 5648.9). Total num frames: 41786368. Throughput: 0: 5922.6. Samples: 41796984. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 02:08:36,374][25689] Avg episode reward: [(0, '-59.545')] [2022-07-09 02:08:37,589][26022] Updated weights on worker 0-0, policy_version 40815 (0.00091) [2022-07-09 02:08:39,351][26022] Updated weights on worker 0-0, policy_version 40825 (0.00087) [2022-07-09 02:08:41,265][26022] Updated weights on worker 0-0, policy_version 40835 (0.00090) [2022-07-09 02:08:41,381][25689] Fps is (10 sec: 5711.4, 60 sec: 5666.9, 300 sec: 5653.8). Total num frames: 41816064. Throughput: 0: 5917.4. Samples: 41814058. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 02:08:41,382][25689] Avg episode reward: [(0, '-59.063')] [2022-07-09 02:08:42,924][26022] Updated weights on worker 0-0, policy_version 40845 (0.00094) [2022-07-09 02:08:44,912][26022] Updated weights on worker 0-0, policy_version 40855 (0.00081) [2022-07-09 02:08:46,364][26022] Updated weights on worker 0-0, policy_version 40865 (0.00083) [2022-07-09 02:08:46,508][25689] Fps is (10 sec: 5860.7, 60 sec: 5684.1, 300 sec: 5656.4). Total num frames: 41845760. Throughput: 0: 5943.4. Samples: 41848502. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 02:08:46,509][25689] Avg episode reward: [(0, '-58.912')] [2022-07-09 02:08:48,380][26022] Updated weights on worker 0-0, policy_version 40875 (0.00089) [2022-07-09 02:08:49,910][26022] Updated weights on worker 0-0, policy_version 40885 (0.00093) [2022-07-09 02:08:51,535][25689] Fps is (10 sec: 5647.8, 60 sec: 5669.5, 300 sec: 5652.5). Total num frames: 41873408. Throughput: 0: 5930.3. Samples: 41882488. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 02:08:51,535][25689] Avg episode reward: [(0, '-58.183')] [2022-07-09 02:08:52,059][26022] Updated weights on worker 0-0, policy_version 40895 (0.00079) [2022-07-09 02:08:53,575][26022] Updated weights on worker 0-0, policy_version 40905 (0.00085) [2022-07-09 02:08:55,611][26022] Updated weights on worker 0-0, policy_version 40915 (0.00095) [2022-07-09 02:08:56,547][25689] Fps is (10 sec: 5712.1, 60 sec: 5691.4, 300 sec: 5656.4). Total num frames: 41903104. Throughput: 0: 5083.4. Samples: 41899532. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 02:08:56,548][25689] Avg episode reward: [(0, '-57.840')] [2022-07-09 02:08:57,320][26022] Updated weights on worker 0-0, policy_version 40925 (0.00086) [2022-07-09 02:08:59,186][26022] Updated weights on worker 0-0, policy_version 40935 (0.00087) [2022-07-09 02:09:00,815][26022] Updated weights on worker 0-0, policy_version 40945 (0.00086) [2022-07-09 02:09:01,573][25689] Fps is (10 sec: 5712.3, 60 sec: 5658.8, 300 sec: 5660.0). Total num frames: 41930752. Throughput: 0: 5948.4. Samples: 41934168. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 02:09:01,574][25689] Avg episode reward: [(0, '-58.345')] [2022-07-09 02:09:02,978][26022] Updated weights on worker 0-0, policy_version 40955 (0.00087) [2022-07-09 02:09:04,881][26022] Updated weights on worker 0-0, policy_version 40965 (0.00084) [2022-07-09 02:09:06,701][25689] Fps is (10 sec: 5344.7, 60 sec: 5667.2, 300 sec: 5654.3). Total num frames: 41957376. Throughput: 0: 5831.7. Samples: 41966264. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 02:09:06,702][25689] Avg episode reward: [(0, '-58.576')] [2022-07-09 02:09:06,739][26022] Updated weights on worker 0-0, policy_version 40975 (0.00082) [2022-07-09 02:09:08,604][26022] Updated weights on worker 0-0, policy_version 40985 (0.00088) [2022-07-09 02:09:10,147][26022] Updated weights on worker 0-0, policy_version 40995 (0.00089) [2022-07-09 02:09:11,711][25689] Fps is (10 sec: 5454.5, 60 sec: 5633.0, 300 sec: 5654.5). Total num frames: 41986048. Throughput: 0: 4992.4. Samples: 41983216. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 02:09:11,711][25689] Avg episode reward: [(0, '-58.912')] [2022-07-09 02:09:12,067][26022] Updated weights on worker 0-0, policy_version 41005 (0.00087) [2022-07-09 02:09:13,692][26022] Updated weights on worker 0-0, policy_version 41015 (0.00096) [2022-07-09 02:09:15,626][26022] Updated weights on worker 0-0, policy_version 41025 (0.00080) [2022-07-09 02:09:16,774][25689] Fps is (10 sec: 5896.5, 60 sec: 5678.9, 300 sec: 5660.6). Total num frames: 42016768. Throughput: 0: 5836.9. Samples: 42017594. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 02:09:16,774][25689] Avg episode reward: [(0, '-59.281')] [2022-07-09 02:09:17,597][26022] Updated weights on worker 0-0, policy_version 41035 (0.00088) [2022-07-09 02:09:19,225][26022] Updated weights on worker 0-0, policy_version 41045 (0.00089) [2022-07-09 02:09:21,239][26022] Updated weights on worker 0-0, policy_version 41055 (0.00082) [2022-07-09 02:09:21,782][25689] Fps is (10 sec: 5795.4, 60 sec: 5664.5, 300 sec: 5652.4). Total num frames: 42044416. Throughput: 0: 5811.5. Samples: 42051612. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 02:09:21,783][25689] Avg episode reward: [(0, '-59.152')] [2022-07-09 02:09:22,946][26022] Updated weights on worker 0-0, policy_version 41065 (0.00086) [2022-07-09 02:09:24,724][26022] Updated weights on worker 0-0, policy_version 41075 (0.00081) [2022-07-09 02:09:26,679][26022] Updated weights on worker 0-0, policy_version 41085 (0.00087) [2022-07-09 02:09:26,895][25689] Fps is (10 sec: 5564.6, 60 sec: 5679.3, 300 sec: 5657.7). Total num frames: 42073088. Throughput: 0: 5070.9. Samples: 42068664. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 02:09:26,895][25689] Avg episode reward: [(0, '-59.483')] [2022-07-09 02:09:28,251][26022] Updated weights on worker 0-0, policy_version 41095 (0.00093) [2022-07-09 02:09:30,182][26022] Updated weights on worker 0-0, policy_version 41105 (0.00080) [2022-07-09 02:09:31,901][25689] Fps is (10 sec: 5565.9, 60 sec: 5647.6, 300 sec: 5655.8). Total num frames: 42100736. Throughput: 0: 5924.8. Samples: 42102840. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 02:09:31,902][25689] Avg episode reward: [(0, '-59.851')] [2022-07-09 02:09:31,910][26022] Updated weights on worker 0-0, policy_version 41115 (0.00098) [2022-07-09 02:09:33,639][26022] Updated weights on worker 0-0, policy_version 41125 (0.00087) [2022-07-09 02:09:35,718][26022] Updated weights on worker 0-0, policy_version 41135 (0.00089) [2022-07-09 02:09:36,911][25689] Fps is (10 sec: 5827.4, 60 sec: 5700.4, 300 sec: 5659.5). Total num frames: 42131456. Throughput: 0: 5939.4. Samples: 42137198. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 02:09:36,911][25689] Avg episode reward: [(0, '-60.430')] [2022-07-09 02:09:37,242][26022] Updated weights on worker 0-0, policy_version 41145 (0.00086) [2022-07-09 02:09:39,149][26022] Updated weights on worker 0-0, policy_version 41155 (0.00088) [2022-07-09 02:09:41,067][26022] Updated weights on worker 0-0, policy_version 41165 (0.00086) [2022-07-09 02:09:41,935][25689] Fps is (10 sec: 5715.1, 60 sec: 5648.2, 300 sec: 5656.6). Total num frames: 42158080. Throughput: 0: 5076.0. Samples: 42153906. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 02:09:41,935][25689] Avg episode reward: [(0, '-60.255')] [2022-07-09 02:09:42,774][26022] Updated weights on worker 0-0, policy_version 41175 (0.00086) [2022-07-09 02:09:44,582][26022] Updated weights on worker 0-0, policy_version 41185 (0.00083) [2022-07-09 02:09:46,277][26022] Updated weights on worker 0-0, policy_version 41195 (0.00084) [2022-07-09 02:09:46,991][25689] Fps is (10 sec: 5485.8, 60 sec: 5637.8, 300 sec: 5652.5). Total num frames: 42186752. Throughput: 0: 5941.6. Samples: 42188066. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 02:09:46,991][25689] Avg episode reward: [(0, '-60.468')] [2022-07-09 02:09:48,053][26022] Updated weights on worker 0-0, policy_version 41205 (0.00081) [2022-07-09 02:09:49,988][26022] Updated weights on worker 0-0, policy_version 41215 (0.00092) [2022-07-09 02:09:51,859][26022] Updated weights on worker 0-0, policy_version 41225 (0.00087) [2022-07-09 02:09:52,010][25689] Fps is (10 sec: 5691.5, 60 sec: 5655.4, 300 sec: 5656.3). Total num frames: 42215424. Throughput: 0: 5920.4. Samples: 42221894. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 02:09:52,011][25689] Avg episode reward: [(0, '-60.527')] [2022-07-09 02:09:53,719][26022] Updated weights on worker 0-0, policy_version 41235 (0.00087) [2022-07-09 02:09:55,451][26022] Updated weights on worker 0-0, policy_version 41245 (0.00085) [2022-07-09 02:09:57,022][25689] Fps is (10 sec: 5716.6, 60 sec: 5638.5, 300 sec: 5657.2). Total num frames: 42244096. Throughput: 0: 5062.2. Samples: 42239004. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 02:09:57,022][25689] Avg episode reward: [(0, '-60.710')] [2022-07-09 02:09:57,191][26022] Updated weights on worker 0-0, policy_version 41255 (0.00083) [2022-07-09 02:09:59,140][26022] Updated weights on worker 0-0, policy_version 41265 (0.00100) [2022-07-09 02:10:00,784][26022] Updated weights on worker 0-0, policy_version 41275 (0.00088) [2022-07-09 02:10:02,038][25689] Fps is (10 sec: 5412.1, 60 sec: 5605.6, 300 sec: 5658.1). Total num frames: 42269696. Throughput: 0: 5933.4. Samples: 42273186. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 02:10:02,039][25689] Avg episode reward: [(0, '-60.617')] [2022-07-09 02:10:02,950][26022] Updated weights on worker 0-0, policy_version 41285 (0.00082) [2022-07-09 02:10:04,815][26022] Updated weights on worker 0-0, policy_version 41295 (0.00095) [2022-07-09 02:10:06,689][26022] Updated weights on worker 0-0, policy_version 41305 (0.00089) [2022-07-09 02:10:07,160][25689] Fps is (10 sec: 5353.3, 60 sec: 5640.1, 300 sec: 5656.6). Total num frames: 42298368. Throughput: 0: 5782.3. Samples: 42304690. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 02:10:07,160][25689] Avg episode reward: [(0, '-60.026')] [2022-07-09 02:10:08,471][26022] Updated weights on worker 0-0, policy_version 41315 (0.00095) [2022-07-09 02:10:10,422][26022] Updated weights on worker 0-0, policy_version 41325 (0.00085) [2022-07-09 02:10:11,928][26022] Updated weights on worker 0-0, policy_version 41335 (0.00089) [2022-07-09 02:10:12,173][25689] Fps is (10 sec: 5759.0, 60 sec: 5656.7, 300 sec: 5656.4). Total num frames: 42328064. Throughput: 0: 4954.2. Samples: 42321786. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 02:10:12,174][25689] Avg episode reward: [(0, '-60.015')] [2022-07-09 02:10:13,973][26022] Updated weights on worker 0-0, policy_version 41345 (0.00086) [2022-07-09 02:10:15,546][26022] Updated weights on worker 0-0, policy_version 41355 (0.00090) [2022-07-09 02:10:17,184][25689] Fps is (10 sec: 5618.3, 60 sec: 5593.7, 300 sec: 5652.9). Total num frames: 42354688. Throughput: 0: 5802.9. Samples: 42356004. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 02:10:17,185][25689] Avg episode reward: [(0, '-59.662')] [2022-07-09 02:10:17,594][26022] Updated weights on worker 0-0, policy_version 41365 (0.00088) [2022-07-09 02:10:18,320][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:10:18,332][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000041369_42361856.pth [2022-07-09 02:10:18,333][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000039381_40326144.pth [2022-07-09 02:10:19,276][26022] Updated weights on worker 0-0, policy_version 41375 (0.00086) [2022-07-09 02:10:21,251][26022] Updated weights on worker 0-0, policy_version 41385 (0.00092) [2022-07-09 02:10:22,219][25689] Fps is (10 sec: 5504.5, 60 sec: 5608.3, 300 sec: 5653.1). Total num frames: 42383360. Throughput: 0: 5783.1. Samples: 42389892. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 02:10:22,219][25689] Avg episode reward: [(0, '-59.934')] [2022-07-09 02:10:22,958][26022] Updated weights on worker 0-0, policy_version 41395 (0.00089) [2022-07-09 02:10:24,671][26022] Updated weights on worker 0-0, policy_version 41405 (0.00092) [2022-07-09 02:10:26,586][26022] Updated weights on worker 0-0, policy_version 41415 (0.00085) [2022-07-09 02:10:27,275][25689] Fps is (10 sec: 5784.2, 60 sec: 5630.4, 300 sec: 5660.5). Total num frames: 42413056. Throughput: 0: 5074.5. Samples: 42406766. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 02:10:27,277][25689] Avg episode reward: [(0, '-60.018')] [2022-07-09 02:10:28,434][26022] Updated weights on worker 0-0, policy_version 41425 (0.00104) [2022-07-09 02:10:30,085][26022] Updated weights on worker 0-0, policy_version 41435 (0.00089) [2022-07-09 02:10:32,051][26022] Updated weights on worker 0-0, policy_version 41445 (0.00083) [2022-07-09 02:10:32,326][25689] Fps is (10 sec: 5673.5, 60 sec: 5626.3, 300 sec: 5650.3). Total num frames: 42440704. Throughput: 0: 5899.9. Samples: 42440686. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 02:10:32,328][25689] Avg episode reward: [(0, '-60.057')] [2022-07-09 02:10:33,738][26022] Updated weights on worker 0-0, policy_version 41455 (0.00086) [2022-07-09 02:10:35,607][26022] Updated weights on worker 0-0, policy_version 41465 (0.00087) [2022-07-09 02:10:37,190][26022] Updated weights on worker 0-0, policy_version 41475 (0.00089) [2022-07-09 02:10:37,339][25689] Fps is (10 sec: 5697.9, 60 sec: 5609.0, 300 sec: 5658.8). Total num frames: 42470400. Throughput: 0: 5913.0. Samples: 42475180. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 02:10:37,340][25689] Avg episode reward: [(0, '-59.389')] [2022-07-09 02:10:39,271][26022] Updated weights on worker 0-0, policy_version 41485 (0.00080) [2022-07-09 02:10:40,821][26022] Updated weights on worker 0-0, policy_version 41495 (0.00082) [2022-07-09 02:10:42,349][25689] Fps is (10 sec: 5721.1, 60 sec: 5627.3, 300 sec: 5653.2). Total num frames: 42498048. Throughput: 0: 5085.0. Samples: 42492256. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 02:10:42,350][25689] Avg episode reward: [(0, '-60.069')] [2022-07-09 02:10:43,024][26022] Updated weights on worker 0-0, policy_version 41505 (0.00090) [2022-07-09 02:10:44,201][26022] Updated weights on worker 0-0, policy_version 41515 (0.00267) [2022-07-09 02:10:46,490][26022] Updated weights on worker 0-0, policy_version 41525 (0.00109) [2022-07-09 02:10:47,483][25689] Fps is (10 sec: 5754.3, 60 sec: 5653.9, 300 sec: 5662.5). Total num frames: 42528768. Throughput: 0: 5941.1. Samples: 42526820. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 02:10:47,483][25689] Avg episode reward: [(0, '-60.309')] [2022-07-09 02:10:47,950][26022] Updated weights on worker 0-0, policy_version 41535 (0.00085) [2022-07-09 02:10:49,919][26022] Updated weights on worker 0-0, policy_version 41545 (0.00088) [2022-07-09 02:10:51,655][26022] Updated weights on worker 0-0, policy_version 41555 (0.00124) [2022-07-09 02:10:52,484][25689] Fps is (10 sec: 5759.3, 60 sec: 5638.7, 300 sec: 5656.9). Total num frames: 42556416. Throughput: 0: 5983.3. Samples: 42561296. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 02:10:52,484][25689] Avg episode reward: [(0, '-59.938')] [2022-07-09 02:10:53,476][26022] Updated weights on worker 0-0, policy_version 41565 (0.00096) [2022-07-09 02:10:55,187][26022] Updated weights on worker 0-0, policy_version 41575 (0.00090) [2022-07-09 02:10:57,056][26022] Updated weights on worker 0-0, policy_version 41585 (0.00083) [2022-07-09 02:10:57,510][25689] Fps is (10 sec: 5616.7, 60 sec: 5637.4, 300 sec: 5661.2). Total num frames: 42585088. Throughput: 0: 5113.5. Samples: 42578322. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 02:10:57,510][25689] Avg episode reward: [(0, '-60.123')] [2022-07-09 02:10:58,907][26022] Updated weights on worker 0-0, policy_version 41595 (0.00085) [2022-07-09 02:11:00,610][26022] Updated weights on worker 0-0, policy_version 41605 (0.00088) [2022-07-09 02:11:02,544][25689] Fps is (10 sec: 5496.6, 60 sec: 5652.6, 300 sec: 5651.5). Total num frames: 42611712. Throughput: 0: 5970.9. Samples: 42612836. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 02:11:02,544][25689] Avg episode reward: [(0, '-60.391')] [2022-07-09 02:11:02,664][26022] Updated weights on worker 0-0, policy_version 41615 (0.00084) [2022-07-09 02:11:04,701][26022] Updated weights on worker 0-0, policy_version 41625 (0.00094) [2022-07-09 02:11:06,267][26022] Updated weights on worker 0-0, policy_version 41635 (0.00089) [2022-07-09 02:11:07,667][25689] Fps is (10 sec: 5443.9, 60 sec: 5652.5, 300 sec: 5659.6). Total num frames: 42640384. Throughput: 0: 5861.4. Samples: 42645130. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 02:11:07,668][25689] Avg episode reward: [(0, '-60.936')] [2022-07-09 02:11:08,076][26022] Updated weights on worker 0-0, policy_version 41645 (0.00083) [2022-07-09 02:11:09,981][26022] Updated weights on worker 0-0, policy_version 41655 (0.00085) [2022-07-09 02:11:11,805][26022] Updated weights on worker 0-0, policy_version 41665 (0.00087) [2022-07-09 02:11:12,707][25689] Fps is (10 sec: 5844.0, 60 sec: 5666.9, 300 sec: 5659.3). Total num frames: 42671104. Throughput: 0: 5836.4. Samples: 42679324. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 02:11:12,707][25689] Avg episode reward: [(0, '-60.278')] [2022-07-09 02:11:13,578][26022] Updated weights on worker 0-0, policy_version 41675 (0.00090) [2022-07-09 02:11:15,235][26022] Updated weights on worker 0-0, policy_version 41685 (0.00080) [2022-07-09 02:11:17,152][26022] Updated weights on worker 0-0, policy_version 41695 (0.00088) [2022-07-09 02:11:17,728][25689] Fps is (10 sec: 5801.6, 60 sec: 5682.9, 300 sec: 5659.9). Total num frames: 42698752. Throughput: 0: 5843.1. Samples: 42696458. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 02:11:17,729][25689] Avg episode reward: [(0, '-59.956')] [2022-07-09 02:11:18,913][26022] Updated weights on worker 0-0, policy_version 41705 (0.00089) [2022-07-09 02:11:20,805][26022] Updated weights on worker 0-0, policy_version 41715 (0.00093) [2022-07-09 02:11:22,456][26022] Updated weights on worker 0-0, policy_version 41725 (0.00097) [2022-07-09 02:11:22,747][25689] Fps is (10 sec: 5609.5, 60 sec: 5684.4, 300 sec: 5658.5). Total num frames: 42727424. Throughput: 0: 5822.1. Samples: 42730460. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 02:11:22,747][25689] Avg episode reward: [(0, '-59.885')] [2022-07-09 02:11:24,429][26022] Updated weights on worker 0-0, policy_version 41735 (0.00083) [2022-07-09 02:11:26,107][26022] Updated weights on worker 0-0, policy_version 41745 (0.00084) [2022-07-09 02:11:27,809][25689] Fps is (10 sec: 5586.6, 60 sec: 5650.0, 300 sec: 5651.7). Total num frames: 42755072. Throughput: 0: 5930.4. Samples: 42764580. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 02:11:27,810][25689] Avg episode reward: [(0, '-59.495')] [2022-07-09 02:11:28,026][26022] Updated weights on worker 0-0, policy_version 41755 (0.00081) [2022-07-09 02:11:29,902][26022] Updated weights on worker 0-0, policy_version 41765 (0.00087) [2022-07-09 02:11:31,665][26022] Updated weights on worker 0-0, policy_version 41775 (0.00092) [2022-07-09 02:11:32,813][25689] Fps is (10 sec: 5595.2, 60 sec: 5671.4, 300 sec: 5655.7). Total num frames: 42783744. Throughput: 0: 5096.8. Samples: 42781800. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 02:11:32,813][25689] Avg episode reward: [(0, '-59.321')] [2022-07-09 02:11:33,385][26022] Updated weights on worker 0-0, policy_version 41785 (0.00087) [2022-07-09 02:11:35,106][26022] Updated weights on worker 0-0, policy_version 41795 (0.00085) [2022-07-09 02:11:36,851][26022] Updated weights on worker 0-0, policy_version 41805 (0.00084) [2022-07-09 02:11:37,875][25689] Fps is (10 sec: 5798.9, 60 sec: 5666.8, 300 sec: 5654.8). Total num frames: 42813440. Throughput: 0: 5951.0. Samples: 42816350. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 02:11:37,875][25689] Avg episode reward: [(0, '-59.122')] [2022-07-09 02:11:38,714][26022] Updated weights on worker 0-0, policy_version 41815 (0.00093) [2022-07-09 02:11:40,279][26022] Updated weights on worker 0-0, policy_version 41825 (0.00079) [2022-07-09 02:11:42,355][26022] Updated weights on worker 0-0, policy_version 41835 (0.00092) [2022-07-09 02:11:42,890][25689] Fps is (10 sec: 5791.8, 60 sec: 5683.2, 300 sec: 5660.1). Total num frames: 42842112. Throughput: 0: 5974.6. Samples: 42850810. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 02:11:42,891][25689] Avg episode reward: [(0, '-59.566')] [2022-07-09 02:11:43,958][26022] Updated weights on worker 0-0, policy_version 41845 (0.00082) [2022-07-09 02:11:45,797][26022] Updated weights on worker 0-0, policy_version 41855 (0.00084) [2022-07-09 02:11:47,683][26022] Updated weights on worker 0-0, policy_version 41865 (0.00091) [2022-07-09 02:11:47,932][25689] Fps is (10 sec: 5701.6, 60 sec: 5657.9, 300 sec: 5653.5). Total num frames: 42870784. Throughput: 0: 5139.0. Samples: 42867992. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 02:11:47,933][25689] Avg episode reward: [(0, '-59.799')] [2022-07-09 02:11:49,391][26022] Updated weights on worker 0-0, policy_version 41875 (0.00093) [2022-07-09 02:11:51,223][26022] Updated weights on worker 0-0, policy_version 41885 (0.00085) [2022-07-09 02:11:52,887][26022] Updated weights on worker 0-0, policy_version 41895 (0.00084) [2022-07-09 02:11:52,936][25689] Fps is (10 sec: 5810.0, 60 sec: 5691.5, 300 sec: 5660.4). Total num frames: 42900480. Throughput: 0: 6004.2. Samples: 42902626. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 02:11:52,937][25689] Avg episode reward: [(0, '-60.257')] [2022-07-09 02:11:54,670][26022] Updated weights on worker 0-0, policy_version 41905 (0.00089) [2022-07-09 02:11:56,502][26022] Updated weights on worker 0-0, policy_version 41915 (0.00089) [2022-07-09 02:11:57,943][25689] Fps is (10 sec: 5728.1, 60 sec: 5676.3, 300 sec: 5653.9). Total num frames: 42928128. Throughput: 0: 6006.9. Samples: 42936900. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 02:11:57,944][25689] Avg episode reward: [(0, '-60.461')] [2022-07-09 02:11:58,273][26022] Updated weights on worker 0-0, policy_version 41925 (0.00085) [2022-07-09 02:12:00,306][26022] Updated weights on worker 0-0, policy_version 41935 (0.00089) [2022-07-09 02:12:02,001][26022] Updated weights on worker 0-0, policy_version 41945 (0.00089) [2022-07-09 02:12:02,946][25689] Fps is (10 sec: 5524.4, 60 sec: 5696.3, 300 sec: 5659.5). Total num frames: 42955776. Throughput: 0: 5139.2. Samples: 42953878. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 02:12:02,947][25689] Avg episode reward: [(0, '-60.196')] [2022-07-09 02:12:04,198][26022] Updated weights on worker 0-0, policy_version 41955 (0.00082) [2022-07-09 02:12:06,059][26022] Updated weights on worker 0-0, policy_version 41965 (0.00084) [2022-07-09 02:12:07,638][26022] Updated weights on worker 0-0, policy_version 41975 (0.00089) [2022-07-09 02:12:07,994][25689] Fps is (10 sec: 5501.5, 60 sec: 5686.3, 300 sec: 5655.7). Total num frames: 42983424. Throughput: 0: 5894.6. Samples: 42986250. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 02:12:07,995][25689] Avg episode reward: [(0, '-60.266')] [2022-07-09 02:12:09,612][26022] Updated weights on worker 0-0, policy_version 41985 (0.00086) [2022-07-09 02:12:11,345][26022] Updated weights on worker 0-0, policy_version 41995 (0.00088) [2022-07-09 02:12:13,018][25689] Fps is (10 sec: 5591.6, 60 sec: 5653.9, 300 sec: 5655.3). Total num frames: 43012096. Throughput: 0: 5869.6. Samples: 43020498. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 02:12:13,018][25689] Avg episode reward: [(0, '-60.510')] [2022-07-09 02:12:13,064][26022] Updated weights on worker 0-0, policy_version 42005 (0.00089) [2022-07-09 02:12:14,820][26022] Updated weights on worker 0-0, policy_version 42015 (0.00080) [2022-07-09 02:12:16,487][26022] Updated weights on worker 0-0, policy_version 42025 (0.00089) [2022-07-09 02:12:18,022][25689] Fps is (10 sec: 5718.4, 60 sec: 5672.5, 300 sec: 5659.0). Total num frames: 43040768. Throughput: 0: 5026.8. Samples: 43037834. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 02:12:18,023][25689] Avg episode reward: [(0, '-60.515')] [2022-07-09 02:12:18,497][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:12:18,506][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000042035_43043840.pth [2022-07-09 02:12:18,506][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000040043_41004032.pth [2022-07-09 02:12:18,513][26022] Updated weights on worker 0-0, policy_version 42035 (0.00087) [2022-07-09 02:12:20,300][26022] Updated weights on worker 0-0, policy_version 42045 (0.00100) [2022-07-09 02:12:22,151][26022] Updated weights on worker 0-0, policy_version 42055 (0.00086) [2022-07-09 02:12:23,051][25689] Fps is (10 sec: 5715.6, 60 sec: 5671.5, 300 sec: 5659.4). Total num frames: 43069440. Throughput: 0: 5884.5. Samples: 43072186. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 02:12:23,051][25689] Avg episode reward: [(0, '-60.642')] [2022-07-09 02:12:23,811][26022] Updated weights on worker 0-0, policy_version 42065 (0.00083) [2022-07-09 02:12:25,453][26022] Updated weights on worker 0-0, policy_version 42075 (0.00084) [2022-07-09 02:12:27,503][26022] Updated weights on worker 0-0, policy_version 42085 (0.00088) [2022-07-09 02:12:28,152][25689] Fps is (10 sec: 5660.7, 60 sec: 5684.8, 300 sec: 5657.9). Total num frames: 43098112. Throughput: 0: 5955.9. Samples: 43106310. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 02:12:28,153][25689] Avg episode reward: [(0, '-61.002')] [2022-07-09 02:12:29,227][26022] Updated weights on worker 0-0, policy_version 42095 (0.00083) [2022-07-09 02:12:31,094][26022] Updated weights on worker 0-0, policy_version 42105 (0.00087) [2022-07-09 02:12:32,927][26022] Updated weights on worker 0-0, policy_version 42115 (0.00086) [2022-07-09 02:12:33,170][25689] Fps is (10 sec: 5666.7, 60 sec: 5683.5, 300 sec: 5657.9). Total num frames: 43126784. Throughput: 0: 5100.2. Samples: 43123278. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 02:12:33,171][25689] Avg episode reward: [(0, '-61.101')] [2022-07-09 02:12:34,605][26022] Updated weights on worker 0-0, policy_version 42125 (0.00085) [2022-07-09 02:12:36,590][26022] Updated weights on worker 0-0, policy_version 42135 (0.00095) [2022-07-09 02:12:38,181][25689] Fps is (10 sec: 5718.3, 60 sec: 5671.3, 300 sec: 5658.4). Total num frames: 43155456. Throughput: 0: 5925.2. Samples: 43157276. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-09 02:12:38,181][25689] Avg episode reward: [(0, '-61.386')] [2022-07-09 02:12:38,244][26022] Updated weights on worker 0-0, policy_version 42145 (0.00100) [2022-07-09 02:12:40,129][26022] Updated weights on worker 0-0, policy_version 42155 (0.00088) [2022-07-09 02:12:41,853][26022] Updated weights on worker 0-0, policy_version 42165 (0.00085) [2022-07-09 02:12:43,204][25689] Fps is (10 sec: 5715.3, 60 sec: 5670.6, 300 sec: 5660.4). Total num frames: 43184128. Throughput: 0: 5926.8. Samples: 43191628. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-09 02:12:43,204][25689] Avg episode reward: [(0, '-60.563')] [2022-07-09 02:12:43,730][26022] Updated weights on worker 0-0, policy_version 42175 (0.00088) [2022-07-09 02:12:45,502][26022] Updated weights on worker 0-0, policy_version 42185 (0.00084) [2022-07-09 02:12:47,291][26022] Updated weights on worker 0-0, policy_version 42195 (0.00093) [2022-07-09 02:12:48,277][25689] Fps is (10 sec: 5679.7, 60 sec: 5667.7, 300 sec: 5660.0). Total num frames: 43212800. Throughput: 0: 5081.6. Samples: 43208574. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-09 02:12:48,277][25689] Avg episode reward: [(0, '-61.443')] [2022-07-09 02:12:49,104][26022] Updated weights on worker 0-0, policy_version 42205 (0.00092) [2022-07-09 02:12:50,924][26022] Updated weights on worker 0-0, policy_version 42215 (0.00052) [2022-07-09 02:12:52,636][26022] Updated weights on worker 0-0, policy_version 42225 (0.00084) [2022-07-09 02:12:53,284][25689] Fps is (10 sec: 5688.4, 60 sec: 5650.4, 300 sec: 5661.1). Total num frames: 43241472. Throughput: 0: 5944.3. Samples: 43242842. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-09 02:12:53,285][25689] Avg episode reward: [(0, '-61.105')] [2022-07-09 02:12:54,556][26022] Updated weights on worker 0-0, policy_version 42235 (0.00119) [2022-07-09 02:12:56,259][26022] Updated weights on worker 0-0, policy_version 42245 (0.00095) [2022-07-09 02:12:58,289][25689] Fps is (10 sec: 5625.2, 60 sec: 5650.6, 300 sec: 5654.9). Total num frames: 43269120. Throughput: 0: 5933.2. Samples: 43276584. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-09 02:12:58,290][25689] Avg episode reward: [(0, '-60.467')] [2022-07-09 02:12:58,300][26022] Updated weights on worker 0-0, policy_version 42255 (0.00764) [2022-07-09 02:12:59,974][26022] Updated weights on worker 0-0, policy_version 42265 (0.00088) [2022-07-09 02:13:01,896][26022] Updated weights on worker 0-0, policy_version 42275 (0.00090) [2022-07-09 02:13:03,325][25689] Fps is (10 sec: 5507.0, 60 sec: 5647.5, 300 sec: 5661.8). Total num frames: 43296768. Throughput: 0: 5078.4. Samples: 43293814. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-09 02:13:03,326][25689] Avg episode reward: [(0, '-60.428')] [2022-07-09 02:13:03,992][26022] Updated weights on worker 0-0, policy_version 42285 (0.00095) [2022-07-09 02:13:05,747][26022] Updated weights on worker 0-0, policy_version 42295 (0.00096) [2022-07-09 02:13:07,537][26022] Updated weights on worker 0-0, policy_version 42305 (0.00082) [2022-07-09 02:13:08,431][25689] Fps is (10 sec: 5553.3, 60 sec: 5659.1, 300 sec: 5653.0). Total num frames: 43325440. Throughput: 0: 5819.1. Samples: 43325850. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-09 02:13:08,431][25689] Avg episode reward: [(0, '-60.014')] [2022-07-09 02:13:09,191][26022] Updated weights on worker 0-0, policy_version 42315 (0.00088) [2022-07-09 02:13:11,188][26022] Updated weights on worker 0-0, policy_version 42325 (0.00085) [2022-07-09 02:13:12,787][26022] Updated weights on worker 0-0, policy_version 42335 (0.00085) [2022-07-09 02:13:13,446][25689] Fps is (10 sec: 5564.8, 60 sec: 5642.9, 300 sec: 5652.9). Total num frames: 43353088. Throughput: 0: 5804.8. Samples: 43359878. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-09 02:13:13,447][25689] Avg episode reward: [(0, '-60.282')] [2022-07-09 02:13:14,836][26022] Updated weights on worker 0-0, policy_version 42345 (0.00094) [2022-07-09 02:13:16,594][26022] Updated weights on worker 0-0, policy_version 42355 (0.00101) [2022-07-09 02:13:18,282][26022] Updated weights on worker 0-0, policy_version 42365 (0.00382) [2022-07-09 02:13:18,454][25689] Fps is (10 sec: 5721.3, 60 sec: 5659.6, 300 sec: 5656.9). Total num frames: 43382784. Throughput: 0: 4984.1. Samples: 43377086. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-09 02:13:18,454][25689] Avg episode reward: [(0, '-59.090')] [2022-07-09 02:13:20,227][26022] Updated weights on worker 0-0, policy_version 42375 (0.00087) [2022-07-09 02:13:21,917][26022] Updated weights on worker 0-0, policy_version 42385 (0.00105) [2022-07-09 02:13:23,467][25689] Fps is (10 sec: 5722.7, 60 sec: 5644.1, 300 sec: 5658.4). Total num frames: 43410432. Throughput: 0: 5863.2. Samples: 43411906. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-09 02:13:23,467][25689] Avg episode reward: [(0, '-59.396')] [2022-07-09 02:13:23,795][26022] Updated weights on worker 0-0, policy_version 42395 (0.00088) [2022-07-09 02:13:25,518][26022] Updated weights on worker 0-0, policy_version 42405 (0.00086) [2022-07-09 02:13:27,389][26022] Updated weights on worker 0-0, policy_version 42415 (0.00091) [2022-07-09 02:13:28,584][25689] Fps is (10 sec: 5660.3, 60 sec: 5659.5, 300 sec: 5656.7). Total num frames: 43440128. Throughput: 0: 5950.1. Samples: 43445766. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 02:13:28,585][25689] Avg episode reward: [(0, '-60.697')] [2022-07-09 02:13:29,025][26022] Updated weights on worker 0-0, policy_version 42425 (0.00091) [2022-07-09 02:13:31,005][26022] Updated weights on worker 0-0, policy_version 42435 (0.00628) [2022-07-09 02:13:32,786][26022] Updated weights on worker 0-0, policy_version 42445 (0.00087) [2022-07-09 02:13:33,633][25689] Fps is (10 sec: 5640.7, 60 sec: 5639.7, 300 sec: 5656.4). Total num frames: 43467776. Throughput: 0: 5099.4. Samples: 43462818. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 02:13:33,633][25689] Avg episode reward: [(0, '-61.091')] [2022-07-09 02:13:34,380][26022] Updated weights on worker 0-0, policy_version 42455 (0.00084) [2022-07-09 02:13:36,344][26022] Updated weights on worker 0-0, policy_version 42465 (0.00091) [2022-07-09 02:13:37,996][26022] Updated weights on worker 0-0, policy_version 42475 (0.00053) [2022-07-09 02:13:38,648][25689] Fps is (10 sec: 5698.2, 60 sec: 5656.2, 300 sec: 5656.2). Total num frames: 43497472. Throughput: 0: 5948.4. Samples: 43497210. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 02:13:38,648][25689] Avg episode reward: [(0, '-60.773')] [2022-07-09 02:13:40,124][26022] Updated weights on worker 0-0, policy_version 42485 (0.00085) [2022-07-09 02:13:41,588][26022] Updated weights on worker 0-0, policy_version 42495 (0.00089) [2022-07-09 02:13:43,472][26022] Updated weights on worker 0-0, policy_version 42505 (0.00089) [2022-07-09 02:13:43,658][25689] Fps is (10 sec: 5924.2, 60 sec: 5674.3, 300 sec: 5658.4). Total num frames: 43527168. Throughput: 0: 5915.6. Samples: 43531350. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 02:13:43,659][25689] Avg episode reward: [(0, '-60.345')] [2022-07-09 02:13:45,324][26022] Updated weights on worker 0-0, policy_version 42515 (0.00108) [2022-07-09 02:13:47,088][26022] Updated weights on worker 0-0, policy_version 42525 (0.00087) [2022-07-09 02:13:48,740][25689] Fps is (10 sec: 5580.4, 60 sec: 5639.6, 300 sec: 5653.9). Total num frames: 43553792. Throughput: 0: 5085.5. Samples: 43548268. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 02:13:48,741][25689] Avg episode reward: [(0, '-60.635')] [2022-07-09 02:13:49,015][26022] Updated weights on worker 0-0, policy_version 42535 (0.00085) [2022-07-09 02:13:50,534][26022] Updated weights on worker 0-0, policy_version 42545 (0.00091) [2022-07-09 02:13:52,507][26022] Updated weights on worker 0-0, policy_version 42555 (0.00089) [2022-07-09 02:13:53,779][25689] Fps is (10 sec: 5564.5, 60 sec: 5653.6, 300 sec: 5653.4). Total num frames: 43583488. Throughput: 0: 5910.4. Samples: 43581892. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 02:13:53,780][25689] Avg episode reward: [(0, '-60.878')] [2022-07-09 02:13:54,601][26022] Updated weights on worker 0-0, policy_version 42565 (0.00086) [2022-07-09 02:13:56,108][26022] Updated weights on worker 0-0, policy_version 42575 (0.00094) [2022-07-09 02:13:58,147][26022] Updated weights on worker 0-0, policy_version 42585 (0.00092) [2022-07-09 02:13:58,860][25689] Fps is (10 sec: 5666.7, 60 sec: 5646.6, 300 sec: 5652.4). Total num frames: 43611136. Throughput: 0: 5883.3. Samples: 43616122. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 02:13:58,860][25689] Avg episode reward: [(0, '-60.435')] [2022-07-09 02:13:59,684][26022] Updated weights on worker 0-0, policy_version 42595 (0.00078) [2022-07-09 02:14:01,971][26022] Updated weights on worker 0-0, policy_version 42605 (0.00093) [2022-07-09 02:14:03,699][26022] Updated weights on worker 0-0, policy_version 42615 (0.00088) [2022-07-09 02:14:03,885][25689] Fps is (10 sec: 5370.3, 60 sec: 5630.6, 300 sec: 5654.3). Total num frames: 43637760. Throughput: 0: 5040.2. Samples: 43633300. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 02:14:03,886][25689] Avg episode reward: [(0, '-60.771')] [2022-07-09 02:14:05,625][26022] Updated weights on worker 0-0, policy_version 42625 (0.00090) [2022-07-09 02:14:07,361][26022] Updated weights on worker 0-0, policy_version 42635 (0.00084) [2022-07-09 02:14:09,029][25689] Fps is (10 sec: 5538.5, 60 sec: 5644.0, 300 sec: 5655.3). Total num frames: 43667456. Throughput: 0: 5770.3. Samples: 43665338. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 02:14:09,029][25689] Avg episode reward: [(0, '-60.409')] [2022-07-09 02:14:09,270][26022] Updated weights on worker 0-0, policy_version 42645 (0.00096) [2022-07-09 02:14:10,850][26022] Updated weights on worker 0-0, policy_version 42655 (0.00085) [2022-07-09 02:14:12,637][26022] Updated weights on worker 0-0, policy_version 42665 (0.00084) [2022-07-09 02:14:14,032][25689] Fps is (10 sec: 5651.6, 60 sec: 5645.2, 300 sec: 5646.1). Total num frames: 43695104. Throughput: 0: 5828.1. Samples: 43699926. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 02:14:14,032][25689] Avg episode reward: [(0, '-60.222')] [2022-07-09 02:14:14,583][26022] Updated weights on worker 0-0, policy_version 42675 (0.00082) [2022-07-09 02:14:16,143][26022] Updated weights on worker 0-0, policy_version 42685 (0.00102) [2022-07-09 02:14:18,100][26022] Updated weights on worker 0-0, policy_version 42695 (0.00102) [2022-07-09 02:14:18,618][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:14:18,637][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000042699_43723776.pth [2022-07-09 02:14:18,638][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000040708_41684992.pth [2022-07-09 02:14:19,050][25689] Fps is (10 sec: 5721.9, 60 sec: 5644.1, 300 sec: 5652.8). Total num frames: 43724800. Throughput: 0: 5852.2. Samples: 43734282. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 02:14:19,051][25689] Avg episode reward: [(0, '-59.606')] [2022-07-09 02:14:19,894][26022] Updated weights on worker 0-0, policy_version 42705 (0.00089) [2022-07-09 02:14:21,551][26022] Updated weights on worker 0-0, policy_version 42715 (0.00096) [2022-07-09 02:14:23,583][26022] Updated weights on worker 0-0, policy_version 42725 (0.00086) [2022-07-09 02:14:24,060][25689] Fps is (10 sec: 5718.2, 60 sec: 5644.4, 300 sec: 5651.3). Total num frames: 43752448. Throughput: 0: 5859.4. Samples: 43751512. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2022-07-09 02:14:24,061][25689] Avg episode reward: [(0, '-59.162')] [2022-07-09 02:14:25,180][26022] Updated weights on worker 0-0, policy_version 42735 (0.00087) [2022-07-09 02:14:27,206][26022] Updated weights on worker 0-0, policy_version 42745 (0.00083) [2022-07-09 02:14:28,813][26022] Updated weights on worker 0-0, policy_version 42755 (0.00083) [2022-07-09 02:14:29,142][25689] Fps is (10 sec: 5580.8, 60 sec: 5630.9, 300 sec: 5653.3). Total num frames: 43781120. Throughput: 0: 5971.5. Samples: 43785446. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2022-07-09 02:14:29,143][25689] Avg episode reward: [(0, '-59.682')] [2022-07-09 02:14:30,647][26022] Updated weights on worker 0-0, policy_version 42765 (0.00088) [2022-07-09 02:14:32,405][26022] Updated weights on worker 0-0, policy_version 42775 (0.00089) [2022-07-09 02:14:34,174][25689] Fps is (10 sec: 5771.0, 60 sec: 5666.2, 300 sec: 5649.4). Total num frames: 43810816. Throughput: 0: 5941.9. Samples: 43819610. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2022-07-09 02:14:34,175][25689] Avg episode reward: [(0, '-58.654')] [2022-07-09 02:14:34,348][26022] Updated weights on worker 0-0, policy_version 42785 (0.00091) [2022-07-09 02:14:35,855][26022] Updated weights on worker 0-0, policy_version 42795 (0.00097) [2022-07-09 02:14:37,973][26022] Updated weights on worker 0-0, policy_version 42805 (0.00093) [2022-07-09 02:14:39,199][25689] Fps is (10 sec: 5803.8, 60 sec: 5648.4, 300 sec: 5656.3). Total num frames: 43839488. Throughput: 0: 5094.6. Samples: 43836932. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2022-07-09 02:14:39,200][25689] Avg episode reward: [(0, '-58.685')] [2022-07-09 02:14:39,540][26022] Updated weights on worker 0-0, policy_version 42815 (0.00095) [2022-07-09 02:14:41,552][26022] Updated weights on worker 0-0, policy_version 42825 (0.00088) [2022-07-09 02:14:43,270][26022] Updated weights on worker 0-0, policy_version 42835 (0.00094) [2022-07-09 02:14:44,242][25689] Fps is (10 sec: 5695.9, 60 sec: 5628.4, 300 sec: 5656.5). Total num frames: 43868160. Throughput: 0: 5927.1. Samples: 43871130. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2022-07-09 02:14:44,242][25689] Avg episode reward: [(0, '-58.863')] [2022-07-09 02:14:45,210][26022] Updated weights on worker 0-0, policy_version 42846 (0.00086) [2022-07-09 02:14:46,928][26022] Updated weights on worker 0-0, policy_version 42856 (0.00087) [2022-07-09 02:14:49,029][26022] Updated weights on worker 0-0, policy_version 42866 (0.00087) [2022-07-09 02:14:49,284][25689] Fps is (10 sec: 5686.3, 60 sec: 5666.0, 300 sec: 5656.1). Total num frames: 43896832. Throughput: 0: 5970.0. Samples: 43905690. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2022-07-09 02:14:49,289][25689] Avg episode reward: [(0, '-59.651')] [2022-07-09 02:14:50,419][26022] Updated weights on worker 0-0, policy_version 42876 (0.00086) [2022-07-09 02:14:52,405][26022] Updated weights on worker 0-0, policy_version 42886 (0.00083) [2022-07-09 02:14:53,933][26022] Updated weights on worker 0-0, policy_version 42896 (0.00080) [2022-07-09 02:14:54,311][25689] Fps is (10 sec: 5796.9, 60 sec: 5667.2, 300 sec: 5659.3). Total num frames: 43926528. Throughput: 0: 5137.0. Samples: 43923050. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2022-07-09 02:14:54,312][25689] Avg episode reward: [(0, '-59.703')] [2022-07-09 02:14:55,915][26022] Updated weights on worker 0-0, policy_version 42906 (0.00101) [2022-07-09 02:14:57,635][26022] Updated weights on worker 0-0, policy_version 42916 (0.00088) [2022-07-09 02:14:59,317][25689] Fps is (10 sec: 5715.2, 60 sec: 5674.1, 300 sec: 5666.3). Total num frames: 43954176. Throughput: 0: 6014.2. Samples: 43957926. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2022-07-09 02:14:59,319][25689] Avg episode reward: [(0, '-60.862')] [2022-07-09 02:14:59,403][26022] Updated weights on worker 0-0, policy_version 42926 (0.00081) [2022-07-09 02:15:01,322][26022] Updated weights on worker 0-0, policy_version 42936 (0.00091) [2022-07-09 02:15:03,383][26022] Updated weights on worker 0-0, policy_version 42946 (0.00088) [2022-07-09 02:15:04,346][25689] Fps is (10 sec: 5611.9, 60 sec: 5707.6, 300 sec: 5668.1). Total num frames: 43982848. Throughput: 0: 5923.5. Samples: 43990220. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2022-07-09 02:15:04,348][25689] Avg episode reward: [(0, '-61.653')] [2022-07-09 02:15:05,148][26022] Updated weights on worker 0-0, policy_version 42956 (0.00085) [2022-07-09 02:15:06,932][26022] Updated weights on worker 0-0, policy_version 42966 (0.00089) [2022-07-09 02:15:08,853][26022] Updated weights on worker 0-0, policy_version 42976 (0.00086) [2022-07-09 02:15:09,450][25689] Fps is (10 sec: 5659.1, 60 sec: 5694.4, 300 sec: 5662.9). Total num frames: 44011520. Throughput: 0: 5038.1. Samples: 44007292. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2022-07-09 02:15:09,451][25689] Avg episode reward: [(0, '-62.078')] [2022-07-09 02:15:10,451][26022] Updated weights on worker 0-0, policy_version 42986 (0.00087) [2022-07-09 02:15:12,271][26022] Updated weights on worker 0-0, policy_version 42996 (0.00087) [2022-07-09 02:15:13,967][26022] Updated weights on worker 0-0, policy_version 43006 (0.00087) [2022-07-09 02:15:14,487][25689] Fps is (10 sec: 5554.0, 60 sec: 5691.3, 300 sec: 5665.9). Total num frames: 44039168. Throughput: 0: 5896.7. Samples: 44042022. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2022-07-09 02:15:14,487][25689] Avg episode reward: [(0, '-61.872')] [2022-07-09 02:15:15,797][26022] Updated weights on worker 0-0, policy_version 43016 (0.00082) [2022-07-09 02:15:17,723][26022] Updated weights on worker 0-0, policy_version 43026 (0.00090) [2022-07-09 02:15:19,331][26022] Updated weights on worker 0-0, policy_version 43036 (0.00085) [2022-07-09 02:15:19,586][25689] Fps is (10 sec: 5758.5, 60 sec: 5700.6, 300 sec: 5671.5). Total num frames: 44069888. Throughput: 0: 5853.5. Samples: 44076572. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 02:15:19,587][25689] Avg episode reward: [(0, '-61.229')] [2022-07-09 02:15:21,143][26022] Updated weights on worker 0-0, policy_version 43046 (0.00088) [2022-07-09 02:15:22,999][26022] Updated weights on worker 0-0, policy_version 43056 (0.00074) [2022-07-09 02:15:24,639][25689] Fps is (10 sec: 5850.4, 60 sec: 5713.5, 300 sec: 5668.2). Total num frames: 44098560. Throughput: 0: 5105.2. Samples: 44093816. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 02:15:24,639][25689] Avg episode reward: [(0, '-60.688')] [2022-07-09 02:15:24,678][26022] Updated weights on worker 0-0, policy_version 43066 (0.00086) [2022-07-09 02:15:26,692][26022] Updated weights on worker 0-0, policy_version 43076 (0.00082) [2022-07-09 02:15:28,402][26022] Updated weights on worker 0-0, policy_version 43086 (0.00091) [2022-07-09 02:15:29,713][25689] Fps is (10 sec: 5662.5, 60 sec: 5714.2, 300 sec: 5671.2). Total num frames: 44127232. Throughput: 0: 5938.7. Samples: 44127628. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 02:15:29,714][25689] Avg episode reward: [(0, '-59.788')] [2022-07-09 02:15:30,298][26022] Updated weights on worker 0-0, policy_version 43096 (0.00082) [2022-07-09 02:15:32,164][26022] Updated weights on worker 0-0, policy_version 43106 (0.00087) [2022-07-09 02:15:33,750][26022] Updated weights on worker 0-0, policy_version 43116 (0.00095) [2022-07-09 02:15:34,726][25689] Fps is (10 sec: 5583.1, 60 sec: 5682.1, 300 sec: 5664.3). Total num frames: 44154880. Throughput: 0: 5901.8. Samples: 44161470. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 02:15:34,727][25689] Avg episode reward: [(0, '-58.713')] [2022-07-09 02:15:35,766][26022] Updated weights on worker 0-0, policy_version 43126 (0.00075) [2022-07-09 02:15:37,650][26022] Updated weights on worker 0-0, policy_version 43136 (0.00116) [2022-07-09 02:15:39,331][26022] Updated weights on worker 0-0, policy_version 43146 (0.00086) [2022-07-09 02:15:39,741][25689] Fps is (10 sec: 5616.5, 60 sec: 5683.1, 300 sec: 5667.6). Total num frames: 44183552. Throughput: 0: 5053.6. Samples: 44178424. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 02:15:39,747][25689] Avg episode reward: [(0, '-58.686')] [2022-07-09 02:15:41,242][26022] Updated weights on worker 0-0, policy_version 43156 (0.00086) [2022-07-09 02:15:42,796][26022] Updated weights on worker 0-0, policy_version 43166 (0.00094) [2022-07-09 02:15:44,695][26022] Updated weights on worker 0-0, policy_version 43176 (0.00090) [2022-07-09 02:15:44,759][25689] Fps is (10 sec: 5715.5, 60 sec: 5685.4, 300 sec: 5662.9). Total num frames: 44212224. Throughput: 0: 5909.5. Samples: 44212718. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 02:15:44,760][25689] Avg episode reward: [(0, '-58.783')] [2022-07-09 02:15:46,381][26022] Updated weights on worker 0-0, policy_version 43186 (0.00088) [2022-07-09 02:15:48,263][26022] Updated weights on worker 0-0, policy_version 43196 (0.00084) [2022-07-09 02:15:49,871][25689] Fps is (10 sec: 5761.9, 60 sec: 5695.7, 300 sec: 5667.7). Total num frames: 44241920. Throughput: 0: 5926.1. Samples: 44247084. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 02:15:49,871][25689] Avg episode reward: [(0, '-59.182')] [2022-07-09 02:15:50,009][26022] Updated weights on worker 0-0, policy_version 43206 (0.00083) [2022-07-09 02:15:52,008][26022] Updated weights on worker 0-0, policy_version 43216 (0.00099) [2022-07-09 02:15:53,539][26022] Updated weights on worker 0-0, policy_version 43226 (0.00092) [2022-07-09 02:15:54,895][25689] Fps is (10 sec: 5556.7, 60 sec: 5645.3, 300 sec: 5660.9). Total num frames: 44268544. Throughput: 0: 5091.7. Samples: 44264162. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 02:15:54,895][25689] Avg episode reward: [(0, '-58.844')] [2022-07-09 02:15:55,579][26022] Updated weights on worker 0-0, policy_version 43236 (0.00088) [2022-07-09 02:15:57,185][26022] Updated weights on worker 0-0, policy_version 43246 (0.00083) [2022-07-09 02:15:59,011][26022] Updated weights on worker 0-0, policy_version 43256 (0.00085) [2022-07-09 02:15:59,913][25689] Fps is (10 sec: 5608.0, 60 sec: 5678.0, 300 sec: 5671.5). Total num frames: 44298240. Throughput: 0: 5963.0. Samples: 44298714. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 02:15:59,914][25689] Avg episode reward: [(0, '-59.658')] [2022-07-09 02:16:00,700][26022] Updated weights on worker 0-0, policy_version 43266 (0.00108) [2022-07-09 02:16:03,081][26022] Updated weights on worker 0-0, policy_version 43276 (0.00093) [2022-07-09 02:16:04,678][26022] Updated weights on worker 0-0, policy_version 43286 (0.00086) [2022-07-09 02:16:04,917][25689] Fps is (10 sec: 5721.7, 60 sec: 5663.5, 300 sec: 5670.3). Total num frames: 44325888. Throughput: 0: 5849.7. Samples: 44330634. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 02:16:04,917][25689] Avg episode reward: [(0, '-59.878')] [2022-07-09 02:16:06,571][26022] Updated weights on worker 0-0, policy_version 43296 (0.00085) [2022-07-09 02:16:08,322][26022] Updated weights on worker 0-0, policy_version 43306 (0.00087) [2022-07-09 02:16:10,002][25689] Fps is (10 sec: 5481.3, 60 sec: 5648.4, 300 sec: 5659.1). Total num frames: 44353536. Throughput: 0: 4994.3. Samples: 44347624. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 02:16:10,002][25689] Avg episode reward: [(0, '-59.478')] [2022-07-09 02:16:10,231][26022] Updated weights on worker 0-0, policy_version 43316 (0.00086) [2022-07-09 02:16:12,132][26022] Updated weights on worker 0-0, policy_version 43326 (0.00089) [2022-07-09 02:16:13,784][26022] Updated weights on worker 0-0, policy_version 43336 (0.00092) [2022-07-09 02:16:15,031][25689] Fps is (10 sec: 5669.9, 60 sec: 5682.9, 300 sec: 5665.9). Total num frames: 44383232. Throughput: 0: 5849.5. Samples: 44381948. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:16:15,031][25689] Avg episode reward: [(0, '-60.385')] [2022-07-09 02:16:15,593][26022] Updated weights on worker 0-0, policy_version 43346 (0.00048) [2022-07-09 02:16:17,316][26022] Updated weights on worker 0-0, policy_version 43356 (0.00088) [2022-07-09 02:16:18,677][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:16:18,690][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000043364_44404736.pth [2022-07-09 02:16:18,691][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000041369_42361856.pth [2022-07-09 02:16:19,042][26022] Updated weights on worker 0-0, policy_version 43366 (0.00083) [2022-07-09 02:16:20,062][25689] Fps is (10 sec: 5801.8, 60 sec: 5655.4, 300 sec: 5665.6). Total num frames: 44411904. Throughput: 0: 5859.4. Samples: 44416772. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:16:20,062][25689] Avg episode reward: [(0, '-60.056')] [2022-07-09 02:16:20,927][26022] Updated weights on worker 0-0, policy_version 43376 (0.00090) [2022-07-09 02:16:22,625][26022] Updated weights on worker 0-0, policy_version 43386 (0.00086) [2022-07-09 02:16:24,593][26022] Updated weights on worker 0-0, policy_version 43396 (0.00091) [2022-07-09 02:16:25,071][25689] Fps is (10 sec: 5711.5, 60 sec: 5659.5, 300 sec: 5670.1). Total num frames: 44440576. Throughput: 0: 5123.8. Samples: 44433898. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:16:25,072][25689] Avg episode reward: [(0, '-59.667')] [2022-07-09 02:16:26,323][26022] Updated weights on worker 0-0, policy_version 43406 (0.00087) [2022-07-09 02:16:28,146][26022] Updated weights on worker 0-0, policy_version 43416 (0.00084) [2022-07-09 02:16:29,724][26022] Updated weights on worker 0-0, policy_version 43426 (0.00295) [2022-07-09 02:16:30,129][25689] Fps is (10 sec: 5594.5, 60 sec: 5644.1, 300 sec: 5665.6). Total num frames: 44468224. Throughput: 0: 5987.9. Samples: 44468144. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:16:30,129][25689] Avg episode reward: [(0, '-60.033')] [2022-07-09 02:16:31,785][26022] Updated weights on worker 0-0, policy_version 43436 (0.00090) [2022-07-09 02:16:33,661][26022] Updated weights on worker 0-0, policy_version 43446 (0.00050) [2022-07-09 02:16:35,141][25689] Fps is (10 sec: 5592.6, 60 sec: 5661.1, 300 sec: 5663.1). Total num frames: 44496896. Throughput: 0: 5971.9. Samples: 44502046. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:16:35,142][25689] Avg episode reward: [(0, '-60.407')] [2022-07-09 02:16:35,344][26022] Updated weights on worker 0-0, policy_version 43456 (0.00082) [2022-07-09 02:16:37,058][26022] Updated weights on worker 0-0, policy_version 43466 (0.00087) [2022-07-09 02:16:38,836][26022] Updated weights on worker 0-0, policy_version 43476 (0.00080) [2022-07-09 02:16:40,171][25689] Fps is (10 sec: 5812.5, 60 sec: 5676.7, 300 sec: 5666.3). Total num frames: 44526592. Throughput: 0: 5115.4. Samples: 44519636. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:16:40,172][25689] Avg episode reward: [(0, '-59.788')] [2022-07-09 02:16:40,712][26022] Updated weights on worker 0-0, policy_version 43486 (0.00091) [2022-07-09 02:16:42,439][26022] Updated weights on worker 0-0, policy_version 43496 (0.00081) [2022-07-09 02:16:44,266][26022] Updated weights on worker 0-0, policy_version 43506 (0.00094) [2022-07-09 02:16:45,204][25689] Fps is (10 sec: 5800.5, 60 sec: 5675.3, 300 sec: 5666.5). Total num frames: 44555264. Throughput: 0: 5967.7. Samples: 44554046. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:16:45,204][25689] Avg episode reward: [(0, '-59.341')] [2022-07-09 02:16:45,930][26022] Updated weights on worker 0-0, policy_version 43516 (0.00093) [2022-07-09 02:16:47,802][26022] Updated weights on worker 0-0, policy_version 43526 (0.00088) [2022-07-09 02:16:49,624][26022] Updated weights on worker 0-0, policy_version 43536 (0.00088) [2022-07-09 02:16:50,274][25689] Fps is (10 sec: 5878.4, 60 sec: 5696.1, 300 sec: 5668.7). Total num frames: 44585984. Throughput: 0: 5981.5. Samples: 44588642. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:16:50,276][25689] Avg episode reward: [(0, '-60.156')] [2022-07-09 02:16:51,267][26022] Updated weights on worker 0-0, policy_version 43546 (0.00092) [2022-07-09 02:16:53,093][26022] Updated weights on worker 0-0, policy_version 43556 (0.00080) [2022-07-09 02:16:54,884][26022] Updated weights on worker 0-0, policy_version 43566 (0.00081) [2022-07-09 02:16:55,319][25689] Fps is (10 sec: 5669.0, 60 sec: 5694.1, 300 sec: 5664.5). Total num frames: 44612608. Throughput: 0: 6019.4. Samples: 44623506. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:16:55,321][25689] Avg episode reward: [(0, '-60.590')] [2022-07-09 02:16:56,651][26022] Updated weights on worker 0-0, policy_version 43576 (0.00097) [2022-07-09 02:16:58,631][26022] Updated weights on worker 0-0, policy_version 43586 (0.00095) [2022-07-09 02:17:00,099][26022] Updated weights on worker 0-0, policy_version 43596 (0.00088) [2022-07-09 02:17:00,349][25689] Fps is (10 sec: 5692.0, 60 sec: 5710.1, 300 sec: 5674.3). Total num frames: 44643328. Throughput: 0: 5995.0. Samples: 44640602. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:17:00,350][25689] Avg episode reward: [(0, '-60.482')] [2022-07-09 02:17:02,479][26022] Updated weights on worker 0-0, policy_version 43606 (0.00088) [2022-07-09 02:17:04,272][26022] Updated weights on worker 0-0, policy_version 43616 (0.00084) [2022-07-09 02:17:05,408][25689] Fps is (10 sec: 5582.7, 60 sec: 5671.0, 300 sec: 5667.2). Total num frames: 44668928. Throughput: 0: 5864.2. Samples: 44672526. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:17:05,413][25689] Avg episode reward: [(0, '-61.025')] [2022-07-09 02:17:06,158][26022] Updated weights on worker 0-0, policy_version 43626 (0.00083) [2022-07-09 02:17:07,816][26022] Updated weights on worker 0-0, policy_version 43636 (0.00079) [2022-07-09 02:17:09,677][26022] Updated weights on worker 0-0, policy_version 43646 (0.00089) [2022-07-09 02:17:10,488][25689] Fps is (10 sec: 5352.7, 60 sec: 5688.3, 300 sec: 5666.1). Total num frames: 44697600. Throughput: 0: 5850.9. Samples: 44706910. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 02:17:10,488][25689] Avg episode reward: [(0, '-61.527')] [2022-07-09 02:17:11,363][26022] Updated weights on worker 0-0, policy_version 43656 (0.00092) [2022-07-09 02:17:13,075][26022] Updated weights on worker 0-0, policy_version 43666 (0.00081) [2022-07-09 02:17:14,945][26022] Updated weights on worker 0-0, policy_version 43676 (0.00088) [2022-07-09 02:17:15,515][25689] Fps is (10 sec: 5876.1, 60 sec: 5705.4, 300 sec: 5672.6). Total num frames: 44728320. Throughput: 0: 4998.6. Samples: 44724456. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 02:17:15,515][25689] Avg episode reward: [(0, '-61.176')] [2022-07-09 02:17:16,656][26022] Updated weights on worker 0-0, policy_version 43686 (0.00086) [2022-07-09 02:17:18,391][26022] Updated weights on worker 0-0, policy_version 43696 (0.00081) [2022-07-09 02:17:20,378][26022] Updated weights on worker 0-0, policy_version 43706 (0.00080) [2022-07-09 02:17:20,574][25689] Fps is (10 sec: 5685.3, 60 sec: 5668.9, 300 sec: 5665.1). Total num frames: 44754944. Throughput: 0: 5859.6. Samples: 44759116. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 02:17:20,575][25689] Avg episode reward: [(0, '-60.754')] [2022-07-09 02:17:21,963][26022] Updated weights on worker 0-0, policy_version 43716 (0.01365) [2022-07-09 02:17:23,988][26022] Updated weights on worker 0-0, policy_version 43726 (0.00095) [2022-07-09 02:17:25,538][26022] Updated weights on worker 0-0, policy_version 43736 (0.00083) [2022-07-09 02:17:25,655][25689] Fps is (10 sec: 5655.2, 60 sec: 5696.0, 300 sec: 5672.4). Total num frames: 44785664. Throughput: 0: 5964.6. Samples: 44793294. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 02:17:25,655][25689] Avg episode reward: [(0, '-60.511')] [2022-07-09 02:17:27,476][26022] Updated weights on worker 0-0, policy_version 43746 (0.00095) [2022-07-09 02:17:29,452][26022] Updated weights on worker 0-0, policy_version 43756 (0.00084) [2022-07-09 02:17:30,714][25689] Fps is (10 sec: 5857.1, 60 sec: 5712.8, 300 sec: 5671.6). Total num frames: 44814336. Throughput: 0: 5092.9. Samples: 44809922. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 02:17:30,715][25689] Avg episode reward: [(0, '-60.834')] [2022-07-09 02:17:30,967][26022] Updated weights on worker 0-0, policy_version 43766 (0.00096) [2022-07-09 02:17:33,083][26022] Updated weights on worker 0-0, policy_version 43776 (0.00086) [2022-07-09 02:17:34,687][26022] Updated weights on worker 0-0, policy_version 43786 (0.00086) [2022-07-09 02:17:35,746][25689] Fps is (10 sec: 5580.9, 60 sec: 5694.0, 300 sec: 5667.7). Total num frames: 44841984. Throughput: 0: 5916.3. Samples: 44844152. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 02:17:35,747][25689] Avg episode reward: [(0, '-61.158')] [2022-07-09 02:17:36,390][26022] Updated weights on worker 0-0, policy_version 43796 (0.00089) [2022-07-09 02:17:38,395][26022] Updated weights on worker 0-0, policy_version 43806 (0.00095) [2022-07-09 02:17:40,062][26022] Updated weights on worker 0-0, policy_version 43816 (0.00086) [2022-07-09 02:17:40,800][25689] Fps is (10 sec: 5584.3, 60 sec: 5674.9, 300 sec: 5667.2). Total num frames: 44870656. Throughput: 0: 5899.0. Samples: 44878428. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 02:17:40,800][25689] Avg episode reward: [(0, '-60.297')] [2022-07-09 02:17:41,917][26022] Updated weights on worker 0-0, policy_version 43826 (0.00087) [2022-07-09 02:17:43,730][26022] Updated weights on worker 0-0, policy_version 43836 (0.00095) [2022-07-09 02:17:45,509][26022] Updated weights on worker 0-0, policy_version 43846 (0.00083) [2022-07-09 02:17:45,874][25689] Fps is (10 sec: 5662.4, 60 sec: 5671.1, 300 sec: 5667.2). Total num frames: 44899328. Throughput: 0: 5058.3. Samples: 44895566. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 02:17:45,874][25689] Avg episode reward: [(0, '-61.112')] [2022-07-09 02:17:47,275][26022] Updated weights on worker 0-0, policy_version 43856 (0.00082) [2022-07-09 02:17:49,051][26022] Updated weights on worker 0-0, policy_version 43866 (0.00082) [2022-07-09 02:17:50,823][26022] Updated weights on worker 0-0, policy_version 43876 (0.00096) [2022-07-09 02:17:50,921][25689] Fps is (10 sec: 5766.7, 60 sec: 5656.3, 300 sec: 5669.8). Total num frames: 44929024. Throughput: 0: 5943.1. Samples: 44930014. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 02:17:50,931][25689] Avg episode reward: [(0, '-59.925')] [2022-07-09 02:17:52,631][26022] Updated weights on worker 0-0, policy_version 43886 (0.00088) [2022-07-09 02:17:54,459][26022] Updated weights on worker 0-0, policy_version 43896 (0.00083) [2022-07-09 02:17:55,933][25689] Fps is (10 sec: 5802.4, 60 sec: 5693.2, 300 sec: 5673.1). Total num frames: 44957696. Throughput: 0: 5979.3. Samples: 44964852. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 02:17:55,935][25689] Avg episode reward: [(0, '-59.011')] [2022-07-09 02:17:56,245][26022] Updated weights on worker 0-0, policy_version 43906 (0.00094) [2022-07-09 02:17:58,079][26022] Updated weights on worker 0-0, policy_version 43916 (0.00433) [2022-07-09 02:17:59,760][26022] Updated weights on worker 0-0, policy_version 43926 (0.00087) [2022-07-09 02:18:00,969][25689] Fps is (10 sec: 5809.3, 60 sec: 5675.7, 300 sec: 5680.0). Total num frames: 44987392. Throughput: 0: 5126.9. Samples: 44981832. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 02:18:00,969][25689] Avg episode reward: [(0, '-59.274')] [2022-07-09 02:18:01,618][26022] Updated weights on worker 0-0, policy_version 43936 (0.00095) [2022-07-09 02:18:03,919][26022] Updated weights on worker 0-0, policy_version 43946 (0.00085) [2022-07-09 02:18:05,604][26022] Updated weights on worker 0-0, policy_version 43956 (0.00090) [2022-07-09 02:18:05,992][25689] Fps is (10 sec: 5497.3, 60 sec: 5679.1, 300 sec: 5671.3). Total num frames: 45012992. Throughput: 0: 5868.8. Samples: 45013634. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-09 02:18:05,994][25689] Avg episode reward: [(0, '-58.989')] [2022-07-09 02:18:07,515][26022] Updated weights on worker 0-0, policy_version 43966 (0.00097) [2022-07-09 02:18:09,136][26022] Updated weights on worker 0-0, policy_version 43976 (0.00087) [2022-07-09 02:18:11,015][26022] Updated weights on worker 0-0, policy_version 43986 (0.00081) [2022-07-09 02:18:11,109][25689] Fps is (10 sec: 5352.4, 60 sec: 5675.7, 300 sec: 5672.8). Total num frames: 45041664. Throughput: 0: 5831.2. Samples: 45047728. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-09 02:18:11,109][25689] Avg episode reward: [(0, '-59.701')] [2022-07-09 02:18:12,840][26022] Updated weights on worker 0-0, policy_version 43996 (0.00807) [2022-07-09 02:18:14,428][26022] Updated weights on worker 0-0, policy_version 44006 (0.00091) [2022-07-09 02:18:16,140][25689] Fps is (10 sec: 5650.5, 60 sec: 5641.5, 300 sec: 5668.9). Total num frames: 45070336. Throughput: 0: 4970.5. Samples: 45065292. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-09 02:18:16,141][25689] Avg episode reward: [(0, '-59.582')] [2022-07-09 02:18:16,403][26022] Updated weights on worker 0-0, policy_version 44016 (0.01051) [2022-07-09 02:18:18,117][26022] Updated weights on worker 0-0, policy_version 44026 (0.00081) [2022-07-09 02:18:18,727][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:18:18,741][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000044030_45086720.pth [2022-07-09 02:18:18,742][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000042035_43043840.pth [2022-07-09 02:18:19,955][26022] Updated weights on worker 0-0, policy_version 44036 (0.00087) [2022-07-09 02:18:21,182][25689] Fps is (10 sec: 5692.7, 60 sec: 5676.9, 300 sec: 5671.8). Total num frames: 45099008. Throughput: 0: 5844.5. Samples: 45099966. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-09 02:18:21,182][25689] Avg episode reward: [(0, '-59.973')] [2022-07-09 02:18:21,768][26022] Updated weights on worker 0-0, policy_version 44046 (0.00084) [2022-07-09 02:18:23,342][26022] Updated weights on worker 0-0, policy_version 44056 (0.00087) [2022-07-09 02:18:25,291][26022] Updated weights on worker 0-0, policy_version 44066 (0.00583) [2022-07-09 02:18:26,192][25689] Fps is (10 sec: 5908.6, 60 sec: 5683.5, 300 sec: 5677.3). Total num frames: 45129728. Throughput: 0: 5980.0. Samples: 45134432. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-09 02:18:26,192][25689] Avg episode reward: [(0, '-61.937')] [2022-07-09 02:18:26,984][26022] Updated weights on worker 0-0, policy_version 44076 (0.00086) [2022-07-09 02:18:28,850][26022] Updated weights on worker 0-0, policy_version 44086 (0.00093) [2022-07-09 02:18:30,903][26022] Updated weights on worker 0-0, policy_version 44096 (0.00094) [2022-07-09 02:18:31,265][25689] Fps is (10 sec: 5686.8, 60 sec: 5648.4, 300 sec: 5673.4). Total num frames: 45156352. Throughput: 0: 5133.4. Samples: 45151202. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-09 02:18:31,266][25689] Avg episode reward: [(0, '-61.282')] [2022-07-09 02:18:32,455][26022] Updated weights on worker 0-0, policy_version 44106 (0.00085) [2022-07-09 02:18:34,275][26022] Updated weights on worker 0-0, policy_version 44116 (0.00620) [2022-07-09 02:18:36,019][26022] Updated weights on worker 0-0, policy_version 44126 (0.00083) [2022-07-09 02:18:36,322][25689] Fps is (10 sec: 5559.8, 60 sec: 5679.9, 300 sec: 5672.6). Total num frames: 45186048. Throughput: 0: 5956.0. Samples: 45185494. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-09 02:18:36,322][25689] Avg episode reward: [(0, '-60.654')] [2022-07-09 02:18:37,769][26022] Updated weights on worker 0-0, policy_version 44136 (0.00088) [2022-07-09 02:18:39,516][26022] Updated weights on worker 0-0, policy_version 44146 (0.00086) [2022-07-09 02:18:41,359][25689] Fps is (10 sec: 5782.8, 60 sec: 5681.5, 300 sec: 5668.6). Total num frames: 45214720. Throughput: 0: 5947.0. Samples: 45219958. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-09 02:18:41,359][25689] Avg episode reward: [(0, '-60.989')] [2022-07-09 02:18:41,475][26022] Updated weights on worker 0-0, policy_version 44156 (0.00082) [2022-07-09 02:18:43,064][26022] Updated weights on worker 0-0, policy_version 44166 (0.00090) [2022-07-09 02:18:44,975][26022] Updated weights on worker 0-0, policy_version 44176 (0.00085) [2022-07-09 02:18:46,412][25689] Fps is (10 sec: 5683.2, 60 sec: 5683.4, 300 sec: 5676.1). Total num frames: 45243392. Throughput: 0: 5083.4. Samples: 45237218. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-09 02:18:46,412][25689] Avg episode reward: [(0, '-60.421')] [2022-07-09 02:18:46,796][26022] Updated weights on worker 0-0, policy_version 44186 (0.00097) [2022-07-09 02:18:48,528][26022] Updated weights on worker 0-0, policy_version 44196 (0.00107) [2022-07-09 02:18:50,400][26022] Updated weights on worker 0-0, policy_version 44206 (0.00089) [2022-07-09 02:18:51,537][25689] Fps is (10 sec: 5734.4, 60 sec: 5676.1, 300 sec: 5674.4). Total num frames: 45273088. Throughput: 0: 5943.2. Samples: 45271682. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-09 02:18:51,538][25689] Avg episode reward: [(0, '-59.787')] [2022-07-09 02:18:52,268][26022] Updated weights on worker 0-0, policy_version 44216 (0.00101) [2022-07-09 02:18:54,050][26022] Updated weights on worker 0-0, policy_version 44226 (0.00082) [2022-07-09 02:18:55,772][26022] Updated weights on worker 0-0, policy_version 44236 (0.00087) [2022-07-09 02:18:56,543][25689] Fps is (10 sec: 5761.1, 60 sec: 5676.7, 300 sec: 5679.3). Total num frames: 45301760. Throughput: 0: 5955.4. Samples: 45305922. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-09 02:18:56,545][25689] Avg episode reward: [(0, '-57.843')] [2022-07-09 02:18:57,385][26022] Updated weights on worker 0-0, policy_version 44246 (0.00088) [2022-07-09 02:18:59,279][26022] Updated weights on worker 0-0, policy_version 44256 (0.00098) [2022-07-09 02:19:00,944][26022] Updated weights on worker 0-0, policy_version 44266 (0.00088) [2022-07-09 02:19:01,574][25689] Fps is (10 sec: 5713.3, 60 sec: 5660.2, 300 sec: 5686.0). Total num frames: 45330432. Throughput: 0: 5952.7. Samples: 45340296. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 02:19:01,575][25689] Avg episode reward: [(0, '-58.507')] [2022-07-09 02:19:03,226][26022] Updated weights on worker 0-0, policy_version 44276 (0.00086) [2022-07-09 02:19:05,199][26022] Updated weights on worker 0-0, policy_version 44286 (0.00085) [2022-07-09 02:19:06,591][25689] Fps is (10 sec: 5605.4, 60 sec: 5694.6, 300 sec: 5681.6). Total num frames: 45358080. Throughput: 0: 5877.4. Samples: 45355818. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 02:19:06,591][25689] Avg episode reward: [(0, '-58.858')] [2022-07-09 02:19:06,770][26022] Updated weights on worker 0-0, policy_version 44296 (0.00086) [2022-07-09 02:19:08,648][26022] Updated weights on worker 0-0, policy_version 44306 (0.00082) [2022-07-09 02:19:10,357][26022] Updated weights on worker 0-0, policy_version 44316 (0.00086) [2022-07-09 02:19:11,694][25689] Fps is (10 sec: 5565.6, 60 sec: 5695.9, 300 sec: 5683.1). Total num frames: 45386752. Throughput: 0: 5880.3. Samples: 45390208. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 02:19:11,694][25689] Avg episode reward: [(0, '-58.632')] [2022-07-09 02:19:12,076][26022] Updated weights on worker 0-0, policy_version 44326 (0.00084) [2022-07-09 02:19:13,981][26022] Updated weights on worker 0-0, policy_version 44336 (0.00101) [2022-07-09 02:19:15,668][26022] Updated weights on worker 0-0, policy_version 44346 (0.00626) [2022-07-09 02:19:16,700][25689] Fps is (10 sec: 5672.4, 60 sec: 5698.3, 300 sec: 5679.9). Total num frames: 45415424. Throughput: 0: 5900.5. Samples: 45424856. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 02:19:16,700][25689] Avg episode reward: [(0, '-58.627')] [2022-07-09 02:19:17,575][26022] Updated weights on worker 0-0, policy_version 44356 (0.00081) [2022-07-09 02:19:19,127][26022] Updated weights on worker 0-0, policy_version 44366 (0.00090) [2022-07-09 02:19:20,992][26022] Updated weights on worker 0-0, policy_version 44376 (0.00081) [2022-07-09 02:19:21,719][25689] Fps is (10 sec: 5822.1, 60 sec: 5717.3, 300 sec: 5686.6). Total num frames: 45445120. Throughput: 0: 5058.8. Samples: 45442204. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 02:19:21,719][25689] Avg episode reward: [(0, '-59.648')] [2022-07-09 02:19:22,628][26022] Updated weights on worker 0-0, policy_version 44386 (0.00082) [2022-07-09 02:19:24,506][26022] Updated weights on worker 0-0, policy_version 44396 (0.00088) [2022-07-09 02:19:26,340][26022] Updated weights on worker 0-0, policy_version 44406 (0.00090) [2022-07-09 02:19:26,783][25689] Fps is (10 sec: 5788.7, 60 sec: 5678.5, 300 sec: 5687.0). Total num frames: 45473792. Throughput: 0: 6007.0. Samples: 45477112. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 02:19:26,783][25689] Avg episode reward: [(0, '-58.657')] [2022-07-09 02:19:28,155][26022] Updated weights on worker 0-0, policy_version 44416 (0.00090) [2022-07-09 02:19:29,947][26022] Updated weights on worker 0-0, policy_version 44426 (0.00090) [2022-07-09 02:19:31,703][26022] Updated weights on worker 0-0, policy_version 44436 (0.00088) [2022-07-09 02:19:31,868][25689] Fps is (10 sec: 5650.3, 60 sec: 5711.2, 300 sec: 5682.5). Total num frames: 45502464. Throughput: 0: 6008.4. Samples: 45511422. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 02:19:31,868][25689] Avg episode reward: [(0, '-58.936')] [2022-07-09 02:19:33,418][26022] Updated weights on worker 0-0, policy_version 44446 (0.01040) [2022-07-09 02:19:35,364][26022] Updated weights on worker 0-0, policy_version 44456 (0.00095) [2022-07-09 02:19:36,876][25689] Fps is (10 sec: 5580.2, 60 sec: 5681.9, 300 sec: 5679.4). Total num frames: 45530112. Throughput: 0: 5116.9. Samples: 45528094. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 02:19:36,876][25689] Avg episode reward: [(0, '-59.098')] [2022-07-09 02:19:37,159][26022] Updated weights on worker 0-0, policy_version 44466 (0.00083) [2022-07-09 02:19:38,928][26022] Updated weights on worker 0-0, policy_version 44476 (0.00107) [2022-07-09 02:19:40,859][26022] Updated weights on worker 0-0, policy_version 44486 (0.00084) [2022-07-09 02:19:41,895][25689] Fps is (10 sec: 5718.7, 60 sec: 5700.5, 300 sec: 5683.3). Total num frames: 45559808. Throughput: 0: 5954.4. Samples: 45562342. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 02:19:41,896][25689] Avg episode reward: [(0, '-59.221')] [2022-07-09 02:19:42,607][26022] Updated weights on worker 0-0, policy_version 44496 (0.00103) [2022-07-09 02:19:44,271][26022] Updated weights on worker 0-0, policy_version 44506 (0.00098) [2022-07-09 02:19:46,283][26022] Updated weights on worker 0-0, policy_version 44516 (0.00082) [2022-07-09 02:19:46,898][25689] Fps is (10 sec: 5824.1, 60 sec: 5705.3, 300 sec: 5684.0). Total num frames: 45588480. Throughput: 0: 5940.7. Samples: 45596608. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 02:19:46,898][25689] Avg episode reward: [(0, '-59.880')] [2022-07-09 02:19:48,038][26022] Updated weights on worker 0-0, policy_version 44526 (0.00081) [2022-07-09 02:19:49,871][26022] Updated weights on worker 0-0, policy_version 44536 (0.00087) [2022-07-09 02:19:51,669][26022] Updated weights on worker 0-0, policy_version 44546 (0.00086) [2022-07-09 02:19:51,982][25689] Fps is (10 sec: 5684.9, 60 sec: 5692.2, 300 sec: 5679.5). Total num frames: 45617152. Throughput: 0: 5085.6. Samples: 45613716. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 02:19:51,983][25689] Avg episode reward: [(0, '-59.412')] [2022-07-09 02:19:53,407][26022] Updated weights on worker 0-0, policy_version 44556 (0.00083) [2022-07-09 02:19:55,167][26022] Updated weights on worker 0-0, policy_version 44566 (0.00087) [2022-07-09 02:19:56,958][26022] Updated weights on worker 0-0, policy_version 44576 (0.00085) [2022-07-09 02:19:57,028][25689] Fps is (10 sec: 5660.5, 60 sec: 5688.4, 300 sec: 5682.2). Total num frames: 45645824. Throughput: 0: 5954.7. Samples: 45648096. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 02:19:57,029][25689] Avg episode reward: [(0, '-60.616')] [2022-07-09 02:19:58,783][26022] Updated weights on worker 0-0, policy_version 44586 (0.00089) [2022-07-09 02:20:00,506][26022] Updated weights on worker 0-0, policy_version 44596 (0.00088) [2022-07-09 02:20:02,074][25689] Fps is (10 sec: 5479.2, 60 sec: 5653.2, 300 sec: 5675.0). Total num frames: 45672448. Throughput: 0: 5867.1. Samples: 45680734. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 02:20:02,075][25689] Avg episode reward: [(0, '-60.398')] [2022-07-09 02:20:02,816][26022] Updated weights on worker 0-0, policy_version 44606 (0.00086) [2022-07-09 02:20:04,507][26022] Updated weights on worker 0-0, policy_version 44616 (0.00084) [2022-07-09 02:20:06,215][26022] Updated weights on worker 0-0, policy_version 44626 (0.00081) [2022-07-09 02:20:07,079][25689] Fps is (10 sec: 5501.8, 60 sec: 5671.2, 300 sec: 5676.9). Total num frames: 45701120. Throughput: 0: 4982.7. Samples: 45697164. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 02:20:07,079][25689] Avg episode reward: [(0, '-60.196')] [2022-07-09 02:20:07,960][26022] Updated weights on worker 0-0, policy_version 44636 (0.00089) [2022-07-09 02:20:09,827][26022] Updated weights on worker 0-0, policy_version 44646 (0.00088) [2022-07-09 02:20:11,490][26022] Updated weights on worker 0-0, policy_version 44656 (0.00092) [2022-07-09 02:20:12,201][25689] Fps is (10 sec: 5763.7, 60 sec: 5686.3, 300 sec: 5682.1). Total num frames: 45730816. Throughput: 0: 5842.1. Samples: 45731836. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 02:20:12,202][25689] Avg episode reward: [(0, '-60.030')] [2022-07-09 02:20:13,387][26022] Updated weights on worker 0-0, policy_version 44666 (0.00089) [2022-07-09 02:20:15,163][26022] Updated weights on worker 0-0, policy_version 44676 (0.00091) [2022-07-09 02:20:16,867][26022] Updated weights on worker 0-0, policy_version 44686 (0.00090) [2022-07-09 02:20:17,205][25689] Fps is (10 sec: 5764.2, 60 sec: 5686.6, 300 sec: 5677.1). Total num frames: 45759488. Throughput: 0: 5875.7. Samples: 45766648. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 02:20:17,205][25689] Avg episode reward: [(0, '-59.562')] [2022-07-09 02:20:18,738][26022] Updated weights on worker 0-0, policy_version 44696 (0.00090) [2022-07-09 02:20:18,834][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:20:18,842][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000044697_45769728.pth [2022-07-09 02:20:18,842][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000042699_43723776.pth [2022-07-09 02:20:20,447][26022] Updated weights on worker 0-0, policy_version 44706 (0.00088) [2022-07-09 02:20:22,230][25689] Fps is (10 sec: 5717.6, 60 sec: 5669.0, 300 sec: 5677.6). Total num frames: 45788160. Throughput: 0: 5127.6. Samples: 45784086. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 02:20:22,231][25689] Avg episode reward: [(0, '-59.232')] [2022-07-09 02:20:22,277][26022] Updated weights on worker 0-0, policy_version 44716 (0.00099) [2022-07-09 02:20:24,090][26022] Updated weights on worker 0-0, policy_version 44726 (0.00093) [2022-07-09 02:20:25,692][26022] Updated weights on worker 0-0, policy_version 44736 (0.00087) [2022-07-09 02:20:27,267][25689] Fps is (10 sec: 5699.0, 60 sec: 5671.6, 300 sec: 5678.3). Total num frames: 45816832. Throughput: 0: 6026.7. Samples: 45818832. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 02:20:27,267][25689] Avg episode reward: [(0, '-59.092')] [2022-07-09 02:20:27,572][26022] Updated weights on worker 0-0, policy_version 44746 (0.00083) [2022-07-09 02:20:29,147][26022] Updated weights on worker 0-0, policy_version 44756 (0.00080) [2022-07-09 02:20:31,159][26022] Updated weights on worker 0-0, policy_version 44766 (0.00085) [2022-07-09 02:20:32,349][25689] Fps is (10 sec: 5869.5, 60 sec: 5705.7, 300 sec: 5687.3). Total num frames: 45847552. Throughput: 0: 6027.9. Samples: 45853286. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 02:20:32,350][25689] Avg episode reward: [(0, '-58.788')] [2022-07-09 02:20:33,031][26022] Updated weights on worker 0-0, policy_version 44776 (0.00121) [2022-07-09 02:20:34,770][26022] Updated weights on worker 0-0, policy_version 44786 (0.00104) [2022-07-09 02:20:36,867][26022] Updated weights on worker 0-0, policy_version 44796 (0.00085) [2022-07-09 02:20:37,416][25689] Fps is (10 sec: 5750.7, 60 sec: 5700.1, 300 sec: 5682.9). Total num frames: 45875200. Throughput: 0: 5123.8. Samples: 45870210. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 02:20:37,417][25689] Avg episode reward: [(0, '-58.984')] [2022-07-09 02:20:38,255][26022] Updated weights on worker 0-0, policy_version 44806 (0.00085) [2022-07-09 02:20:40,336][26022] Updated weights on worker 0-0, policy_version 44816 (0.00094) [2022-07-09 02:20:42,093][26022] Updated weights on worker 0-0, policy_version 44826 (0.00082) [2022-07-09 02:20:42,430][25689] Fps is (10 sec: 5586.8, 60 sec: 5683.7, 300 sec: 5682.9). Total num frames: 45903872. Throughput: 0: 5942.1. Samples: 45904114. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 02:20:42,431][25689] Avg episode reward: [(0, '-59.490')] [2022-07-09 02:20:43,778][26022] Updated weights on worker 0-0, policy_version 44836 (0.00090) [2022-07-09 02:20:45,696][26022] Updated weights on worker 0-0, policy_version 44846 (0.00084) [2022-07-09 02:20:47,262][26022] Updated weights on worker 0-0, policy_version 44856 (0.00085) [2022-07-09 02:20:47,479][25689] Fps is (10 sec: 5800.5, 60 sec: 5696.3, 300 sec: 5684.1). Total num frames: 45933568. Throughput: 0: 5912.8. Samples: 45938344. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 02:20:47,480][25689] Avg episode reward: [(0, '-59.524')] [2022-07-09 02:20:49,080][26022] Updated weights on worker 0-0, policy_version 44866 (0.00086) [2022-07-09 02:20:50,915][26022] Updated weights on worker 0-0, policy_version 44876 (0.00083) [2022-07-09 02:20:52,522][25689] Fps is (10 sec: 5682.3, 60 sec: 5683.3, 300 sec: 5687.2). Total num frames: 45961216. Throughput: 0: 5071.2. Samples: 45955586. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:20:52,522][25689] Avg episode reward: [(0, '-59.470')] [2022-07-09 02:20:52,743][26022] Updated weights on worker 0-0, policy_version 44886 (0.00087) [2022-07-09 02:20:54,575][26022] Updated weights on worker 0-0, policy_version 44896 (0.00089) [2022-07-09 02:20:56,328][26022] Updated weights on worker 0-0, policy_version 44906 (0.00088) [2022-07-09 02:20:57,555][25689] Fps is (10 sec: 5691.2, 60 sec: 5701.4, 300 sec: 5686.9). Total num frames: 45990912. Throughput: 0: 5945.0. Samples: 45989934. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:20:57,555][25689] Avg episode reward: [(0, '-59.159')] [2022-07-09 02:20:58,083][26022] Updated weights on worker 0-0, policy_version 44916 (0.00089) [2022-07-09 02:21:00,160][26022] Updated weights on worker 0-0, policy_version 44926 (0.00090) [2022-07-09 02:21:01,511][26022] Updated weights on worker 0-0, policy_version 44936 (0.00096) [2022-07-09 02:21:02,570][25689] Fps is (10 sec: 5604.8, 60 sec: 5704.3, 300 sec: 5683.3). Total num frames: 46017536. Throughput: 0: 5967.8. Samples: 46024306. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:21:02,571][25689] Avg episode reward: [(0, '-58.692')] [2022-07-09 02:21:03,996][26022] Updated weights on worker 0-0, policy_version 44946 (0.00081) [2022-07-09 02:21:05,518][26022] Updated weights on worker 0-0, policy_version 44956 (0.00079) [2022-07-09 02:21:07,571][26022] Updated weights on worker 0-0, policy_version 44966 (0.00097) [2022-07-09 02:21:07,575][25689] Fps is (10 sec: 5416.3, 60 sec: 5687.4, 300 sec: 5684.8). Total num frames: 46045184. Throughput: 0: 5021.8. Samples: 46039262. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:21:07,576][25689] Avg episode reward: [(0, '-59.408')] [2022-07-09 02:21:09,259][26022] Updated weights on worker 0-0, policy_version 44976 (0.00086) [2022-07-09 02:21:11,053][26022] Updated weights on worker 0-0, policy_version 44986 (0.00085) [2022-07-09 02:21:12,569][26022] Updated weights on worker 0-0, policy_version 44996 (0.00085) [2022-07-09 02:21:12,655][25689] Fps is (10 sec: 5788.0, 60 sec: 5708.3, 300 sec: 5687.3). Total num frames: 46075904. Throughput: 0: 5842.4. Samples: 46073210. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:21:12,656][25689] Avg episode reward: [(0, '-59.682')] [2022-07-09 02:21:14,842][26022] Updated weights on worker 0-0, policy_version 45006 (0.00078) [2022-07-09 02:21:16,125][26022] Updated weights on worker 0-0, policy_version 45016 (0.00090) [2022-07-09 02:21:17,659][25689] Fps is (10 sec: 5585.2, 60 sec: 5657.4, 300 sec: 5677.5). Total num frames: 46101504. Throughput: 0: 5869.1. Samples: 46107926. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:21:17,659][25689] Avg episode reward: [(0, '-60.396')] [2022-07-09 02:21:18,303][26022] Updated weights on worker 0-0, policy_version 45026 (0.00087) [2022-07-09 02:21:19,951][26022] Updated weights on worker 0-0, policy_version 45036 (0.00084) [2022-07-09 02:21:21,857][26022] Updated weights on worker 0-0, policy_version 45046 (0.00085) [2022-07-09 02:21:22,697][25689] Fps is (10 sec: 5608.4, 60 sec: 5690.2, 300 sec: 5683.8). Total num frames: 46132224. Throughput: 0: 5011.9. Samples: 46125178. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:21:22,698][25689] Avg episode reward: [(0, '-60.562')] [2022-07-09 02:21:23,516][26022] Updated weights on worker 0-0, policy_version 45056 (0.00089) [2022-07-09 02:21:25,340][26022] Updated weights on worker 0-0, policy_version 45066 (0.00082) [2022-07-09 02:21:26,967][26022] Updated weights on worker 0-0, policy_version 45076 (0.00085) [2022-07-09 02:21:27,731][25689] Fps is (10 sec: 5998.5, 60 sec: 5707.3, 300 sec: 5691.1). Total num frames: 46161920. Throughput: 0: 5989.7. Samples: 46159988. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:21:27,732][25689] Avg episode reward: [(0, '-60.856')] [2022-07-09 02:21:29,138][26022] Updated weights on worker 0-0, policy_version 45086 (0.00104) [2022-07-09 02:21:30,453][26022] Updated weights on worker 0-0, policy_version 45096 (0.00089) [2022-07-09 02:21:32,694][26022] Updated weights on worker 0-0, policy_version 45106 (0.00087) [2022-07-09 02:21:32,829][25689] Fps is (10 sec: 5659.9, 60 sec: 5655.1, 300 sec: 5686.1). Total num frames: 46189568. Throughput: 0: 5994.1. Samples: 46194134. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:21:32,830][25689] Avg episode reward: [(0, '-60.591')] [2022-07-09 02:21:34,195][26022] Updated weights on worker 0-0, policy_version 45116 (0.00097) [2022-07-09 02:21:36,134][26022] Updated weights on worker 0-0, policy_version 45126 (0.00104) [2022-07-09 02:21:37,849][25689] Fps is (10 sec: 5566.3, 60 sec: 5676.4, 300 sec: 5682.8). Total num frames: 46218240. Throughput: 0: 5119.8. Samples: 46211294. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:21:37,850][25689] Avg episode reward: [(0, '-60.702')] [2022-07-09 02:21:37,860][26022] Updated weights on worker 0-0, policy_version 45136 (0.00098) [2022-07-09 02:21:39,723][26022] Updated weights on worker 0-0, policy_version 45146 (0.00093) [2022-07-09 02:21:41,540][26022] Updated weights on worker 0-0, policy_version 45156 (0.00084) [2022-07-09 02:21:42,947][25689] Fps is (10 sec: 5667.1, 60 sec: 5668.5, 300 sec: 5681.6). Total num frames: 46246912. Throughput: 0: 5933.8. Samples: 46245340. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:21:42,948][25689] Avg episode reward: [(0, '-60.173')] [2022-07-09 02:21:43,323][26022] Updated weights on worker 0-0, policy_version 45166 (0.00081) [2022-07-09 02:21:45,119][26022] Updated weights on worker 0-0, policy_version 45176 (0.00090) [2022-07-09 02:21:46,911][26022] Updated weights on worker 0-0, policy_version 45186 (0.00084) [2022-07-09 02:21:48,011][25689] Fps is (10 sec: 5643.1, 60 sec: 5650.2, 300 sec: 5674.8). Total num frames: 46275584. Throughput: 0: 5913.5. Samples: 46279914. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:21:48,012][25689] Avg episode reward: [(0, '-59.125')] [2022-07-09 02:21:48,508][26022] Updated weights on worker 0-0, policy_version 45196 (0.00087) [2022-07-09 02:21:50,454][26022] Updated weights on worker 0-0, policy_version 45206 (0.00083) [2022-07-09 02:21:52,063][26022] Updated weights on worker 0-0, policy_version 45216 (0.00084) [2022-07-09 02:21:53,100][25689] Fps is (10 sec: 5849.9, 60 sec: 5696.6, 300 sec: 5687.7). Total num frames: 46306304. Throughput: 0: 5927.7. Samples: 46314296. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 02:21:53,102][25689] Avg episode reward: [(0, '-59.534')] [2022-07-09 02:21:54,179][26022] Updated weights on worker 0-0, policy_version 45226 (0.00084) [2022-07-09 02:21:55,619][26022] Updated weights on worker 0-0, policy_version 45236 (0.00079) [2022-07-09 02:21:57,647][26022] Updated weights on worker 0-0, policy_version 45246 (0.00087) [2022-07-09 02:21:58,114][25689] Fps is (10 sec: 5777.1, 60 sec: 5664.6, 300 sec: 5677.7). Total num frames: 46333952. Throughput: 0: 5935.7. Samples: 46331582. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 02:21:58,115][25689] Avg episode reward: [(0, '-60.067')] [2022-07-09 02:21:59,363][26022] Updated weights on worker 0-0, policy_version 45256 (0.00081) [2022-07-09 02:22:01,121][26022] Updated weights on worker 0-0, policy_version 45266 (0.00079) [2022-07-09 02:22:03,138][25689] Fps is (10 sec: 5508.8, 60 sec: 5680.7, 300 sec: 5685.3). Total num frames: 46361600. Throughput: 0: 5965.4. Samples: 46365784. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 02:22:03,138][25689] Avg episode reward: [(0, '-59.940')] [2022-07-09 02:22:03,234][26022] Updated weights on worker 0-0, policy_version 45276 (0.00086) [2022-07-09 02:22:05,079][26022] Updated weights on worker 0-0, policy_version 45286 (0.00086) [2022-07-09 02:22:06,863][26022] Updated weights on worker 0-0, policy_version 45296 (0.00097) [2022-07-09 02:22:08,198][25689] Fps is (10 sec: 5686.8, 60 sec: 5709.3, 300 sec: 5689.1). Total num frames: 46391296. Throughput: 0: 5869.3. Samples: 46398398. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 02:22:08,198][25689] Avg episode reward: [(0, '-59.338')] [2022-07-09 02:22:08,693][26022] Updated weights on worker 0-0, policy_version 45306 (0.00095) [2022-07-09 02:22:10,427][26022] Updated weights on worker 0-0, policy_version 45316 (0.00083) [2022-07-09 02:22:12,281][26022] Updated weights on worker 0-0, policy_version 45326 (0.00084) [2022-07-09 02:22:13,283][25689] Fps is (10 sec: 5753.5, 60 sec: 5675.0, 300 sec: 5681.1). Total num frames: 46419968. Throughput: 0: 5005.8. Samples: 46415326. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 02:22:13,287][25689] Avg episode reward: [(0, '-59.406')] [2022-07-09 02:22:14,020][26022] Updated weights on worker 0-0, policy_version 45336 (0.00083) [2022-07-09 02:22:15,803][26022] Updated weights on worker 0-0, policy_version 45346 (0.00086) [2022-07-09 02:22:17,747][26022] Updated weights on worker 0-0, policy_version 45356 (0.00083) [2022-07-09 02:22:18,333][25689] Fps is (10 sec: 5758.6, 60 sec: 5738.2, 300 sec: 5691.6). Total num frames: 46449664. Throughput: 0: 5850.6. Samples: 46449878. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 02:22:18,334][25689] Avg episode reward: [(0, '-59.922')] [2022-07-09 02:22:18,896][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:22:18,912][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000045363_46451712.pth [2022-07-09 02:22:18,913][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000043364_44404736.pth [2022-07-09 02:22:19,229][26022] Updated weights on worker 0-0, policy_version 45366 (0.00089) [2022-07-09 02:22:21,198][26022] Updated weights on worker 0-0, policy_version 45376 (0.00081) [2022-07-09 02:22:22,909][26022] Updated weights on worker 0-0, policy_version 45386 (0.00085) [2022-07-09 02:22:23,389][25689] Fps is (10 sec: 5674.0, 60 sec: 5685.9, 300 sec: 5681.7). Total num frames: 46477312. Throughput: 0: 5861.1. Samples: 46484478. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 02:22:23,390][25689] Avg episode reward: [(0, '-58.419')] [2022-07-09 02:22:24,869][26022] Updated weights on worker 0-0, policy_version 45396 (0.00090) [2022-07-09 02:22:26,414][26022] Updated weights on worker 0-0, policy_version 45406 (0.00081) [2022-07-09 02:22:28,411][25689] Fps is (10 sec: 5486.9, 60 sec: 5653.2, 300 sec: 5679.0). Total num frames: 46504960. Throughput: 0: 5105.8. Samples: 46501604. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 02:22:28,412][25689] Avg episode reward: [(0, '-58.331')] [2022-07-09 02:22:28,458][26022] Updated weights on worker 0-0, policy_version 45416 (0.00083) [2022-07-09 02:22:30,108][26022] Updated weights on worker 0-0, policy_version 45426 (0.00090) [2022-07-09 02:22:31,997][26022] Updated weights on worker 0-0, policy_version 45436 (0.00082) [2022-07-09 02:22:33,534][25689] Fps is (10 sec: 5753.2, 60 sec: 5701.5, 300 sec: 5687.6). Total num frames: 46535680. Throughput: 0: 5951.1. Samples: 46535846. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 02:22:33,535][25689] Avg episode reward: [(0, '-58.936')] [2022-07-09 02:22:33,848][26022] Updated weights on worker 0-0, policy_version 45446 (0.00092) [2022-07-09 02:22:35,443][26022] Updated weights on worker 0-0, policy_version 45456 (0.00085) [2022-07-09 02:22:37,522][26022] Updated weights on worker 0-0, policy_version 45466 (0.00084) [2022-07-09 02:22:38,557][25689] Fps is (10 sec: 5954.5, 60 sec: 5718.1, 300 sec: 5691.6). Total num frames: 46565376. Throughput: 0: 5950.3. Samples: 46570218. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 02:22:38,559][25689] Avg episode reward: [(0, '-59.918')] [2022-07-09 02:22:39,199][26022] Updated weights on worker 0-0, policy_version 45476 (0.00085) [2022-07-09 02:22:41,074][26022] Updated weights on worker 0-0, policy_version 45486 (0.00089) [2022-07-09 02:22:42,642][26022] Updated weights on worker 0-0, policy_version 45496 (0.00092) [2022-07-09 02:22:43,638][25689] Fps is (10 sec: 5574.1, 60 sec: 5686.0, 300 sec: 5684.6). Total num frames: 46592000. Throughput: 0: 5057.8. Samples: 46586894. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 02:22:43,639][25689] Avg episode reward: [(0, '-59.961')] [2022-07-09 02:22:44,633][26022] Updated weights on worker 0-0, policy_version 45506 (0.00081) [2022-07-09 02:22:46,432][26022] Updated weights on worker 0-0, policy_version 45516 (0.00092) [2022-07-09 02:22:48,099][26022] Updated weights on worker 0-0, policy_version 45526 (0.00094) [2022-07-09 02:22:48,680][25689] Fps is (10 sec: 5563.6, 60 sec: 5704.9, 300 sec: 5684.7). Total num frames: 46621696. Throughput: 0: 5908.7. Samples: 46621368. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 02:22:48,681][25689] Avg episode reward: [(0, '-60.125')] [2022-07-09 02:22:50,046][26022] Updated weights on worker 0-0, policy_version 45536 (0.00088) [2022-07-09 02:22:51,986][26022] Updated weights on worker 0-0, policy_version 45546 (0.00093) [2022-07-09 02:22:53,547][26022] Updated weights on worker 0-0, policy_version 45556 (0.00085) [2022-07-09 02:22:53,743][25689] Fps is (10 sec: 5674.9, 60 sec: 5656.7, 300 sec: 5680.3). Total num frames: 46649344. Throughput: 0: 5928.1. Samples: 46655646. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 02:22:53,744][25689] Avg episode reward: [(0, '-60.389')] [2022-07-09 02:22:55,408][26022] Updated weights on worker 0-0, policy_version 45566 (0.00089) [2022-07-09 02:22:57,224][26022] Updated weights on worker 0-0, policy_version 45576 (0.00096) [2022-07-09 02:22:58,747][25689] Fps is (10 sec: 5594.9, 60 sec: 5674.6, 300 sec: 5677.5). Total num frames: 46678016. Throughput: 0: 5079.0. Samples: 46672758. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 02:22:58,747][25689] Avg episode reward: [(0, '-59.419')] [2022-07-09 02:22:58,986][26022] Updated weights on worker 0-0, policy_version 45586 (0.00093) [2022-07-09 02:23:00,738][26022] Updated weights on worker 0-0, policy_version 45596 (0.00093) [2022-07-09 02:23:02,966][26022] Updated weights on worker 0-0, policy_version 45606 (0.00089) [2022-07-09 02:23:03,801][25689] Fps is (10 sec: 5497.8, 60 sec: 5654.8, 300 sec: 5680.3). Total num frames: 46704640. Throughput: 0: 5897.9. Samples: 46705812. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 02:23:03,802][25689] Avg episode reward: [(0, '-59.414')] [2022-07-09 02:23:04,560][26022] Updated weights on worker 0-0, policy_version 45616 (0.00077) [2022-07-09 02:23:06,346][26022] Updated weights on worker 0-0, policy_version 45626 (0.00081) [2022-07-09 02:23:08,200][26022] Updated weights on worker 0-0, policy_version 45636 (0.00090) [2022-07-09 02:23:08,838][25689] Fps is (10 sec: 5479.6, 60 sec: 5640.1, 300 sec: 5681.8). Total num frames: 46733312. Throughput: 0: 5861.7. Samples: 46739524. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 02:23:08,838][25689] Avg episode reward: [(0, '-59.023')] [2022-07-09 02:23:10,075][26022] Updated weights on worker 0-0, policy_version 45646 (0.00080) [2022-07-09 02:23:11,828][26022] Updated weights on worker 0-0, policy_version 45656 (0.00083) [2022-07-09 02:23:13,617][26022] Updated weights on worker 0-0, policy_version 45666 (0.00088) [2022-07-09 02:23:13,957][25689] Fps is (10 sec: 5848.2, 60 sec: 5670.7, 300 sec: 5687.0). Total num frames: 46764032. Throughput: 0: 4988.0. Samples: 46756470. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 02:23:13,957][25689] Avg episode reward: [(0, '-59.057')] [2022-07-09 02:23:15,502][26022] Updated weights on worker 0-0, policy_version 45676 (0.00080) [2022-07-09 02:23:17,189][26022] Updated weights on worker 0-0, policy_version 45686 (0.00086) [2022-07-09 02:23:19,038][25689] Fps is (10 sec: 5722.1, 60 sec: 5634.1, 300 sec: 5682.8). Total num frames: 46791680. Throughput: 0: 5834.3. Samples: 46791146. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 02:23:19,039][25689] Avg episode reward: [(0, '-58.364')] [2022-07-09 02:23:19,083][26022] Updated weights on worker 0-0, policy_version 45696 (0.00086) [2022-07-09 02:23:20,719][26022] Updated weights on worker 0-0, policy_version 45706 (0.00084) [2022-07-09 02:23:22,516][26022] Updated weights on worker 0-0, policy_version 45716 (0.00089) [2022-07-09 02:23:24,140][25689] Fps is (10 sec: 5731.7, 60 sec: 5680.4, 300 sec: 5681.1). Total num frames: 46822400. Throughput: 0: 5899.4. Samples: 46825800. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 02:23:24,140][25689] Avg episode reward: [(0, '-57.784')] [2022-07-09 02:23:24,190][26022] Updated weights on worker 0-0, policy_version 45726 (0.00086) [2022-07-09 02:23:26,189][26022] Updated weights on worker 0-0, policy_version 45736 (0.00082) [2022-07-09 02:23:27,870][26022] Updated weights on worker 0-0, policy_version 45746 (0.00090) [2022-07-09 02:23:29,148][25689] Fps is (10 sec: 5672.0, 60 sec: 5664.8, 300 sec: 5682.3). Total num frames: 46849024. Throughput: 0: 5925.4. Samples: 46859872. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 02:23:29,150][25689] Avg episode reward: [(0, '-58.943')] [2022-07-09 02:23:29,598][26022] Updated weights on worker 0-0, policy_version 45756 (0.00081) [2022-07-09 02:23:31,639][26022] Updated weights on worker 0-0, policy_version 45766 (0.00091) [2022-07-09 02:23:33,324][26022] Updated weights on worker 0-0, policy_version 45776 (0.00082) [2022-07-09 02:23:34,182][25689] Fps is (10 sec: 5608.1, 60 sec: 5656.2, 300 sec: 5682.8). Total num frames: 46878720. Throughput: 0: 5952.6. Samples: 46876868. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 02:23:34,183][25689] Avg episode reward: [(0, '-60.110')] [2022-07-09 02:23:35,108][26022] Updated weights on worker 0-0, policy_version 45786 (0.00089) [2022-07-09 02:23:36,828][26022] Updated weights on worker 0-0, policy_version 45796 (0.00089) [2022-07-09 02:23:38,549][26022] Updated weights on worker 0-0, policy_version 45806 (0.00077) [2022-07-09 02:23:39,188][25689] Fps is (10 sec: 5915.6, 60 sec: 5657.9, 300 sec: 5686.8). Total num frames: 46908416. Throughput: 0: 5979.7. Samples: 46911636. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 02:23:39,189][25689] Avg episode reward: [(0, '-60.443')] [2022-07-09 02:23:40,636][26022] Updated weights on worker 0-0, policy_version 45816 (0.00084) [2022-07-09 02:23:42,202][26022] Updated weights on worker 0-0, policy_version 45826 (0.00084) [2022-07-09 02:23:44,052][26022] Updated weights on worker 0-0, policy_version 45836 (0.00544) [2022-07-09 02:23:44,203][25689] Fps is (10 sec: 5825.1, 60 sec: 5697.9, 300 sec: 5687.5). Total num frames: 46937088. Throughput: 0: 5985.9. Samples: 46945894. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 02:23:44,203][25689] Avg episode reward: [(0, '-59.745')] [2022-07-09 02:23:46,011][26022] Updated weights on worker 0-0, policy_version 45846 (0.00088) [2022-07-09 02:23:47,497][26022] Updated weights on worker 0-0, policy_version 45856 (0.00093) [2022-07-09 02:23:49,235][25689] Fps is (10 sec: 5605.7, 60 sec: 5665.0, 300 sec: 5682.4). Total num frames: 46964736. Throughput: 0: 5136.5. Samples: 46963048. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 02:23:49,235][25689] Avg episode reward: [(0, '-59.735')] [2022-07-09 02:23:49,408][26022] Updated weights on worker 0-0, policy_version 45866 (0.00089) [2022-07-09 02:23:50,893][26022] Updated weights on worker 0-0, policy_version 45876 (0.00094) [2022-07-09 02:23:53,235][26022] Updated weights on worker 0-0, policy_version 45886 (0.00083) [2022-07-09 02:23:54,284][25689] Fps is (10 sec: 5688.2, 60 sec: 5700.1, 300 sec: 5685.1). Total num frames: 46994432. Throughput: 0: 5991.1. Samples: 46997298. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 02:23:54,284][25689] Avg episode reward: [(0, '-59.590')] [2022-07-09 02:23:54,535][26022] Updated weights on worker 0-0, policy_version 45896 (0.00088) [2022-07-09 02:23:56,692][26022] Updated weights on worker 0-0, policy_version 45906 (0.00094) [2022-07-09 02:23:58,255][26022] Updated weights on worker 0-0, policy_version 45916 (0.00086) [2022-07-09 02:23:59,312][25689] Fps is (10 sec: 5690.8, 60 sec: 5680.9, 300 sec: 5681.7). Total num frames: 47022080. Throughput: 0: 5978.7. Samples: 47031948. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 02:23:59,312][25689] Avg episode reward: [(0, '-59.273')] [2022-07-09 02:24:00,007][26022] Updated weights on worker 0-0, policy_version 45926 (0.00089) [2022-07-09 02:24:02,302][26022] Updated weights on worker 0-0, policy_version 45936 (0.00085) [2022-07-09 02:24:04,050][26022] Updated weights on worker 0-0, policy_version 45946 (0.00088) [2022-07-09 02:24:04,323][25689] Fps is (10 sec: 5508.2, 60 sec: 5701.9, 300 sec: 5681.8). Total num frames: 47049728. Throughput: 0: 5037.6. Samples: 47047250. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 02:24:04,323][25689] Avg episode reward: [(0, '-57.852')] [2022-07-09 02:24:06,047][26022] Updated weights on worker 0-0, policy_version 45956 (0.00624) [2022-07-09 02:24:07,792][26022] Updated weights on worker 0-0, policy_version 45966 (0.00092) [2022-07-09 02:24:09,323][26022] Updated weights on worker 0-0, policy_version 45976 (0.00086) [2022-07-09 02:24:09,373][25689] Fps is (10 sec: 5699.1, 60 sec: 5717.5, 300 sec: 5686.2). Total num frames: 47079424. Throughput: 0: 5883.6. Samples: 47081536. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 02:24:09,374][25689] Avg episode reward: [(0, '-57.912')] [2022-07-09 02:24:11,483][26022] Updated weights on worker 0-0, policy_version 45986 (0.00080) [2022-07-09 02:24:12,859][26022] Updated weights on worker 0-0, policy_version 45996 (0.00088) [2022-07-09 02:24:14,469][25689] Fps is (10 sec: 5651.6, 60 sec: 5668.9, 300 sec: 5681.1). Total num frames: 47107072. Throughput: 0: 5873.8. Samples: 47115862. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 02:24:14,471][25689] Avg episode reward: [(0, '-58.263')] [2022-07-09 02:24:14,792][26022] Updated weights on worker 0-0, policy_version 46006 (0.00094) [2022-07-09 02:24:16,448][26022] Updated weights on worker 0-0, policy_version 46016 (0.00100) [2022-07-09 02:24:18,391][26022] Updated weights on worker 0-0, policy_version 46026 (0.00080) [2022-07-09 02:24:19,029][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:24:19,049][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000046030_47134720.pth [2022-07-09 02:24:19,049][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000044030_45086720.pth [2022-07-09 02:24:19,484][25689] Fps is (10 sec: 5874.5, 60 sec: 5743.0, 300 sec: 5688.0). Total num frames: 47138816. Throughput: 0: 5016.4. Samples: 47133138. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 02:24:19,484][25689] Avg episode reward: [(0, '-58.994')] [2022-07-09 02:24:20,151][26022] Updated weights on worker 0-0, policy_version 46036 (0.00079) [2022-07-09 02:24:21,948][26022] Updated weights on worker 0-0, policy_version 46046 (0.00079) [2022-07-09 02:24:23,583][26022] Updated weights on worker 0-0, policy_version 46056 (0.00086) [2022-07-09 02:24:24,503][25689] Fps is (10 sec: 5715.2, 60 sec: 5666.0, 300 sec: 5678.6). Total num frames: 47164416. Throughput: 0: 5961.3. Samples: 47167550. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 02:24:24,503][25689] Avg episode reward: [(0, '-58.915')] [2022-07-09 02:24:25,332][26022] Updated weights on worker 0-0, policy_version 46066 (0.00086) [2022-07-09 02:24:27,272][26022] Updated weights on worker 0-0, policy_version 46076 (0.00084) [2022-07-09 02:24:29,073][26022] Updated weights on worker 0-0, policy_version 46086 (0.00050) [2022-07-09 02:24:29,517][25689] Fps is (10 sec: 5613.5, 60 sec: 5733.3, 300 sec: 5686.8). Total num frames: 47195136. Throughput: 0: 5993.5. Samples: 47202264. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 02:24:29,517][25689] Avg episode reward: [(0, '-58.757')] [2022-07-09 02:24:30,996][26022] Updated weights on worker 0-0, policy_version 46096 (0.00089) [2022-07-09 02:24:32,498][26022] Updated weights on worker 0-0, policy_version 46106 (0.00080) [2022-07-09 02:24:34,405][26022] Updated weights on worker 0-0, policy_version 46116 (0.00081) [2022-07-09 02:24:34,564][25689] Fps is (10 sec: 5801.2, 60 sec: 5698.2, 300 sec: 5686.1). Total num frames: 47222784. Throughput: 0: 5157.0. Samples: 47219492. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 02:24:34,565][25689] Avg episode reward: [(0, '-58.868')] [2022-07-09 02:24:36,052][26022] Updated weights on worker 0-0, policy_version 46126 (0.00086) [2022-07-09 02:24:38,010][26022] Updated weights on worker 0-0, policy_version 46136 (0.00086) [2022-07-09 02:24:39,577][25689] Fps is (10 sec: 5699.9, 60 sec: 5697.5, 300 sec: 5686.2). Total num frames: 47252480. Throughput: 0: 6019.1. Samples: 47254082. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 02:24:39,578][25689] Avg episode reward: [(0, '-58.663')] [2022-07-09 02:24:39,666][26022] Updated weights on worker 0-0, policy_version 46146 (0.00083) [2022-07-09 02:24:41,709][26022] Updated weights on worker 0-0, policy_version 46156 (0.00081) [2022-07-09 02:24:43,178][26022] Updated weights on worker 0-0, policy_version 46166 (0.00086) [2022-07-09 02:24:44,598][25689] Fps is (10 sec: 5817.4, 60 sec: 5696.9, 300 sec: 5685.8). Total num frames: 47281152. Throughput: 0: 6017.9. Samples: 47288478. Policy #0 lag: (min: 0.0, avg: 9.6, max: 25.0) [2022-07-09 02:24:44,598][25689] Avg episode reward: [(0, '-59.037')] [2022-07-09 02:24:45,263][26022] Updated weights on worker 0-0, policy_version 46176 (0.00091) [2022-07-09 02:24:46,803][26022] Updated weights on worker 0-0, policy_version 46186 (0.00093) [2022-07-09 02:24:48,673][26022] Updated weights on worker 0-0, policy_version 46196 (0.00085) [2022-07-09 02:24:49,628][25689] Fps is (10 sec: 5705.1, 60 sec: 5714.0, 300 sec: 5686.9). Total num frames: 47309824. Throughput: 0: 5146.9. Samples: 47305778. Policy #0 lag: (min: 0.0, avg: 9.6, max: 25.0) [2022-07-09 02:24:49,629][25689] Avg episode reward: [(0, '-57.290')] [2022-07-09 02:24:50,371][26022] Updated weights on worker 0-0, policy_version 46206 (0.00087) [2022-07-09 02:24:52,239][26022] Updated weights on worker 0-0, policy_version 46216 (0.00087) [2022-07-09 02:24:54,079][26022] Updated weights on worker 0-0, policy_version 46226 (0.00089) [2022-07-09 02:24:54,659][25689] Fps is (10 sec: 5699.1, 60 sec: 5698.7, 300 sec: 5687.1). Total num frames: 47338496. Throughput: 0: 6014.6. Samples: 47340356. Policy #0 lag: (min: 0.0, avg: 9.6, max: 25.0) [2022-07-09 02:24:54,660][25689] Avg episode reward: [(0, '-57.463')] [2022-07-09 02:24:55,708][26022] Updated weights on worker 0-0, policy_version 46236 (0.00086) [2022-07-09 02:24:57,679][26022] Updated weights on worker 0-0, policy_version 46246 (0.00090) [2022-07-09 02:24:59,348][26022] Updated weights on worker 0-0, policy_version 46256 (0.00090) [2022-07-09 02:24:59,666][25689] Fps is (10 sec: 5713.0, 60 sec: 5717.7, 300 sec: 5694.8). Total num frames: 47367168. Throughput: 0: 5990.3. Samples: 47374420. Policy #0 lag: (min: 0.0, avg: 9.6, max: 25.0) [2022-07-09 02:24:59,666][25689] Avg episode reward: [(0, '-57.568')] [2022-07-09 02:25:01,268][26022] Updated weights on worker 0-0, policy_version 46266 (0.00089) [2022-07-09 02:25:03,307][26022] Updated weights on worker 0-0, policy_version 46276 (0.00093) [2022-07-09 02:25:04,675][25689] Fps is (10 sec: 5419.0, 60 sec: 5684.0, 300 sec: 5684.4). Total num frames: 47392768. Throughput: 0: 5032.7. Samples: 47389520. Policy #0 lag: (min: 0.0, avg: 9.6, max: 25.0) [2022-07-09 02:25:04,675][25689] Avg episode reward: [(0, '-56.801')] [2022-07-09 02:25:05,339][26022] Updated weights on worker 0-0, policy_version 46286 (0.00082) [2022-07-09 02:25:06,878][26022] Updated weights on worker 0-0, policy_version 46296 (0.00087) [2022-07-09 02:25:08,836][26022] Updated weights on worker 0-0, policy_version 46306 (0.00087) [2022-07-09 02:25:09,681][25689] Fps is (10 sec: 5623.4, 60 sec: 5705.1, 300 sec: 5690.0). Total num frames: 47423488. Throughput: 0: 5891.9. Samples: 47423928. Policy #0 lag: (min: 0.0, avg: 9.6, max: 25.0) [2022-07-09 02:25:09,683][25689] Avg episode reward: [(0, '-56.057')] [2022-07-09 02:25:10,387][26022] Updated weights on worker 0-0, policy_version 46316 (0.00084) [2022-07-09 02:25:12,406][26022] Updated weights on worker 0-0, policy_version 46326 (0.00084) [2022-07-09 02:25:14,164][26022] Updated weights on worker 0-0, policy_version 46336 (0.00083) [2022-07-09 02:25:14,782][25689] Fps is (10 sec: 5673.4, 60 sec: 5687.7, 300 sec: 5681.3). Total num frames: 47450112. Throughput: 0: 5857.3. Samples: 47458220. Policy #0 lag: (min: 0.0, avg: 9.6, max: 25.0) [2022-07-09 02:25:14,783][25689] Avg episode reward: [(0, '-56.070')] [2022-07-09 02:25:15,714][26022] Updated weights on worker 0-0, policy_version 46346 (0.00093) [2022-07-09 02:25:17,701][26022] Updated weights on worker 0-0, policy_version 46356 (0.00086) [2022-07-09 02:25:19,379][26022] Updated weights on worker 0-0, policy_version 46366 (0.00083) [2022-07-09 02:25:19,799][25689] Fps is (10 sec: 5667.8, 60 sec: 5670.5, 300 sec: 5688.3). Total num frames: 47480832. Throughput: 0: 5036.6. Samples: 47475822. Policy #0 lag: (min: 0.0, avg: 9.6, max: 25.0) [2022-07-09 02:25:19,799][25689] Avg episode reward: [(0, '-56.977')] [2022-07-09 02:25:21,315][26022] Updated weights on worker 0-0, policy_version 46376 (0.00793) [2022-07-09 02:25:23,038][26022] Updated weights on worker 0-0, policy_version 46386 (0.00082) [2022-07-09 02:25:24,723][26022] Updated weights on worker 0-0, policy_version 46396 (0.00089) [2022-07-09 02:25:24,812][25689] Fps is (10 sec: 5921.3, 60 sec: 5721.9, 300 sec: 5688.8). Total num frames: 47509504. Throughput: 0: 6007.5. Samples: 47510498. Policy #0 lag: (min: 0.0, avg: 9.6, max: 25.0) [2022-07-09 02:25:24,813][25689] Avg episode reward: [(0, '-55.980')] [2022-07-09 02:25:26,599][26022] Updated weights on worker 0-0, policy_version 46406 (0.01304) [2022-07-09 02:25:28,388][26022] Updated weights on worker 0-0, policy_version 46416 (0.00094) [2022-07-09 02:25:29,847][25689] Fps is (10 sec: 5706.8, 60 sec: 5686.0, 300 sec: 5682.8). Total num frames: 47538176. Throughput: 0: 5986.8. Samples: 47544656. Policy #0 lag: (min: 0.0, avg: 9.6, max: 25.0) [2022-07-09 02:25:29,849][25689] Avg episode reward: [(0, '-55.512')] [2022-07-09 02:25:30,068][26022] Updated weights on worker 0-0, policy_version 46426 (0.00084) [2022-07-09 02:25:32,008][26022] Updated weights on worker 0-0, policy_version 46436 (0.00085) [2022-07-09 02:25:33,748][26022] Updated weights on worker 0-0, policy_version 46446 (0.00085) [2022-07-09 02:25:35,007][25689] Fps is (10 sec: 5625.1, 60 sec: 5692.4, 300 sec: 5684.5). Total num frames: 47566848. Throughput: 0: 5108.7. Samples: 47561540. Policy #0 lag: (min: 0.0, avg: 9.6, max: 25.0) [2022-07-09 02:25:35,007][25689] Avg episode reward: [(0, '-56.127')] [2022-07-09 02:25:35,721][26022] Updated weights on worker 0-0, policy_version 46456 (0.00082) [2022-07-09 02:25:37,495][26022] Updated weights on worker 0-0, policy_version 46466 (0.00093) [2022-07-09 02:25:39,210][26022] Updated weights on worker 0-0, policy_version 46476 (0.00088) [2022-07-09 02:25:40,068][25689] Fps is (10 sec: 5610.3, 60 sec: 5670.9, 300 sec: 5683.6). Total num frames: 47595520. Throughput: 0: 5899.4. Samples: 47595400. Policy #0 lag: (min: 0.0, avg: 9.6, max: 25.0) [2022-07-09 02:25:40,071][25689] Avg episode reward: [(0, '-56.492')] [2022-07-09 02:25:41,044][26022] Updated weights on worker 0-0, policy_version 46486 (0.00079) [2022-07-09 02:25:42,903][26022] Updated weights on worker 0-0, policy_version 46496 (0.00095) [2022-07-09 02:25:44,772][26022] Updated weights on worker 0-0, policy_version 46506 (0.00085) [2022-07-09 02:25:45,103][25689] Fps is (10 sec: 5679.7, 60 sec: 5669.6, 300 sec: 5680.4). Total num frames: 47624192. Throughput: 0: 5847.8. Samples: 47629152. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 02:25:45,105][25689] Avg episode reward: [(0, '-56.648')] [2022-07-09 02:25:46,408][26022] Updated weights on worker 0-0, policy_version 46516 (0.00093) [2022-07-09 02:25:48,268][26022] Updated weights on worker 0-0, policy_version 46526 (0.00087) [2022-07-09 02:25:50,117][25689] Fps is (10 sec: 5604.8, 60 sec: 5654.2, 300 sec: 5681.0). Total num frames: 47651840. Throughput: 0: 5025.1. Samples: 47646514. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 02:25:50,117][25689] Avg episode reward: [(0, '-56.861')] [2022-07-09 02:25:50,135][26022] Updated weights on worker 0-0, policy_version 46536 (0.00088) [2022-07-09 02:25:51,845][26022] Updated weights on worker 0-0, policy_version 46546 (0.00080) [2022-07-09 02:25:53,658][26022] Updated weights on worker 0-0, policy_version 46556 (0.00092) [2022-07-09 02:25:55,164][25689] Fps is (10 sec: 5801.4, 60 sec: 5686.6, 300 sec: 5684.1). Total num frames: 47682560. Throughput: 0: 5906.7. Samples: 47680604. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 02:25:55,164][25689] Avg episode reward: [(0, '-57.723')] [2022-07-09 02:25:55,343][26022] Updated weights on worker 0-0, policy_version 46566 (0.00087) [2022-07-09 02:25:57,466][26022] Updated weights on worker 0-0, policy_version 46576 (0.00087) [2022-07-09 02:25:59,013][26022] Updated weights on worker 0-0, policy_version 46586 (0.00083) [2022-07-09 02:26:00,219][25689] Fps is (10 sec: 5676.4, 60 sec: 5648.2, 300 sec: 5683.4). Total num frames: 47709184. Throughput: 0: 5920.6. Samples: 47714704. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 02:26:00,219][25689] Avg episode reward: [(0, '-57.663')] [2022-07-09 02:26:00,971][26022] Updated weights on worker 0-0, policy_version 46596 (0.00079) [2022-07-09 02:26:03,117][26022] Updated weights on worker 0-0, policy_version 46606 (0.00082) [2022-07-09 02:26:04,801][26022] Updated weights on worker 0-0, policy_version 46616 (0.00086) [2022-07-09 02:26:05,271][25689] Fps is (10 sec: 5369.8, 60 sec: 5678.0, 300 sec: 5682.5). Total num frames: 47736832. Throughput: 0: 5034.5. Samples: 47730688. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 02:26:05,271][25689] Avg episode reward: [(0, '-57.088')] [2022-07-09 02:26:06,551][26022] Updated weights on worker 0-0, policy_version 46626 (0.00083) [2022-07-09 02:26:08,335][26022] Updated weights on worker 0-0, policy_version 46636 (0.00084) [2022-07-09 02:26:10,206][26022] Updated weights on worker 0-0, policy_version 46646 (0.00085) [2022-07-09 02:26:10,307][25689] Fps is (10 sec: 5583.0, 60 sec: 5641.5, 300 sec: 5676.4). Total num frames: 47765504. Throughput: 0: 5837.1. Samples: 47764364. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 02:26:10,307][25689] Avg episode reward: [(0, '-56.544')] [2022-07-09 02:26:11,946][26022] Updated weights on worker 0-0, policy_version 46656 (0.00088) [2022-07-09 02:26:13,823][26022] Updated weights on worker 0-0, policy_version 46666 (0.00095) [2022-07-09 02:26:15,414][25689] Fps is (10 sec: 5754.4, 60 sec: 5691.5, 300 sec: 5688.2). Total num frames: 47795200. Throughput: 0: 5837.4. Samples: 47798812. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 02:26:15,415][25689] Avg episode reward: [(0, '-57.364')] [2022-07-09 02:26:15,577][26022] Updated weights on worker 0-0, policy_version 46676 (0.00081) [2022-07-09 02:26:17,346][26022] Updated weights on worker 0-0, policy_version 46686 (0.00086) [2022-07-09 02:26:19,067][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:26:19,074][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000046696_47816704.pth [2022-07-09 02:26:19,074][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000044697_45769728.pth [2022-07-09 02:26:19,086][26022] Updated weights on worker 0-0, policy_version 46696 (0.00076) [2022-07-09 02:26:20,415][25689] Fps is (10 sec: 5774.0, 60 sec: 5659.2, 300 sec: 5682.0). Total num frames: 47823872. Throughput: 0: 5875.6. Samples: 47833370. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 02:26:20,416][25689] Avg episode reward: [(0, '-56.983')] [2022-07-09 02:26:20,860][26022] Updated weights on worker 0-0, policy_version 46706 (0.00088) [2022-07-09 02:26:22,669][26022] Updated weights on worker 0-0, policy_version 46716 (0.00085) [2022-07-09 02:26:24,408][26022] Updated weights on worker 0-0, policy_version 46726 (0.00083) [2022-07-09 02:26:25,447][25689] Fps is (10 sec: 5817.6, 60 sec: 5674.4, 300 sec: 5682.1). Total num frames: 47853568. Throughput: 0: 5949.6. Samples: 47850728. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 02:26:25,448][25689] Avg episode reward: [(0, '-57.058')] [2022-07-09 02:26:26,223][26022] Updated weights on worker 0-0, policy_version 46736 (0.00086) [2022-07-09 02:26:28,131][26022] Updated weights on worker 0-0, policy_version 46746 (0.00092) [2022-07-09 02:26:29,716][26022] Updated weights on worker 0-0, policy_version 46756 (0.00083) [2022-07-09 02:26:30,479][25689] Fps is (10 sec: 5698.1, 60 sec: 5657.7, 300 sec: 5683.3). Total num frames: 47881216. Throughput: 0: 5989.9. Samples: 47885196. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 02:26:30,480][25689] Avg episode reward: [(0, '-57.996')] [2022-07-09 02:26:31,619][26022] Updated weights on worker 0-0, policy_version 46766 (0.00083) [2022-07-09 02:26:33,492][26022] Updated weights on worker 0-0, policy_version 46776 (0.00091) [2022-07-09 02:26:35,035][26022] Updated weights on worker 0-0, policy_version 46786 (0.00081) [2022-07-09 02:26:35,550][25689] Fps is (10 sec: 5675.9, 60 sec: 5682.9, 300 sec: 5685.8). Total num frames: 47910912. Throughput: 0: 5998.5. Samples: 47919598. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 02:26:35,551][25689] Avg episode reward: [(0, '-58.326')] [2022-07-09 02:26:36,972][26022] Updated weights on worker 0-0, policy_version 46796 (0.00106) [2022-07-09 02:26:38,638][26022] Updated weights on worker 0-0, policy_version 46806 (0.00110) [2022-07-09 02:26:40,577][25689] Fps is (10 sec: 5780.1, 60 sec: 5686.2, 300 sec: 5687.2). Total num frames: 47939584. Throughput: 0: 5139.7. Samples: 47936996. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 02:26:40,578][25689] Avg episode reward: [(0, '-58.772')] [2022-07-09 02:26:40,596][26022] Updated weights on worker 0-0, policy_version 46816 (0.00093) [2022-07-09 02:26:42,428][26022] Updated weights on worker 0-0, policy_version 46826 (0.00089) [2022-07-09 02:26:44,035][26022] Updated weights on worker 0-0, policy_version 46836 (0.00085) [2022-07-09 02:26:45,611][25689] Fps is (10 sec: 5598.1, 60 sec: 5669.4, 300 sec: 5684.3). Total num frames: 47967232. Throughput: 0: 5986.3. Samples: 47971432. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 02:26:45,611][25689] Avg episode reward: [(0, '-58.040')] [2022-07-09 02:26:45,825][26022] Updated weights on worker 0-0, policy_version 46846 (0.00084) [2022-07-09 02:26:47,496][26022] Updated weights on worker 0-0, policy_version 46856 (0.00088) [2022-07-09 02:26:49,419][26022] Updated weights on worker 0-0, policy_version 46866 (0.00088) [2022-07-09 02:26:50,639][25689] Fps is (10 sec: 5699.3, 60 sec: 5701.9, 300 sec: 5682.0). Total num frames: 47996928. Throughput: 0: 6012.4. Samples: 48006404. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 02:26:50,639][25689] Avg episode reward: [(0, '-58.518')] [2022-07-09 02:26:51,151][26022] Updated weights on worker 0-0, policy_version 46876 (0.00088) [2022-07-09 02:26:52,864][26022] Updated weights on worker 0-0, policy_version 46886 (0.00090) [2022-07-09 02:26:54,685][26022] Updated weights on worker 0-0, policy_version 46896 (0.00811) [2022-07-09 02:26:55,777][25689] Fps is (10 sec: 5842.1, 60 sec: 5676.4, 300 sec: 5686.5). Total num frames: 48026624. Throughput: 0: 5150.3. Samples: 48023772. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 02:26:55,777][25689] Avg episode reward: [(0, '-58.865')] [2022-07-09 02:26:56,519][26022] Updated weights on worker 0-0, policy_version 46906 (0.00089) [2022-07-09 02:26:58,197][26022] Updated weights on worker 0-0, policy_version 46916 (0.00084) [2022-07-09 02:27:00,132][26022] Updated weights on worker 0-0, policy_version 46926 (0.00087) [2022-07-09 02:27:00,781][25689] Fps is (10 sec: 5956.9, 60 sec: 5748.8, 300 sec: 5697.2). Total num frames: 48057344. Throughput: 0: 6016.2. Samples: 48058546. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 02:27:00,781][25689] Avg episode reward: [(0, '-58.877')] [2022-07-09 02:27:02,148][26022] Updated weights on worker 0-0, policy_version 46936 (0.00083) [2022-07-09 02:27:03,959][26022] Updated weights on worker 0-0, policy_version 46946 (0.00092) [2022-07-09 02:27:05,770][26022] Updated weights on worker 0-0, policy_version 46956 (0.00088) [2022-07-09 02:27:05,831][25689] Fps is (10 sec: 5601.7, 60 sec: 5715.2, 300 sec: 5683.7). Total num frames: 48082944. Throughput: 0: 5916.6. Samples: 48091068. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 02:27:05,831][25689] Avg episode reward: [(0, '-58.410')] [2022-07-09 02:27:07,596][26022] Updated weights on worker 0-0, policy_version 46966 (0.00083) [2022-07-09 02:27:09,368][26022] Updated weights on worker 0-0, policy_version 46976 (0.00090) [2022-07-09 02:27:10,840][25689] Fps is (10 sec: 5293.6, 60 sec: 5700.8, 300 sec: 5681.7). Total num frames: 48110592. Throughput: 0: 5050.5. Samples: 48108430. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 02:27:10,840][25689] Avg episode reward: [(0, '-58.810')] [2022-07-09 02:27:11,078][26022] Updated weights on worker 0-0, policy_version 46986 (0.00090) [2022-07-09 02:27:12,920][26022] Updated weights on worker 0-0, policy_version 46996 (0.00086) [2022-07-09 02:27:14,762][26022] Updated weights on worker 0-0, policy_version 47006 (0.00086) [2022-07-09 02:27:15,896][25689] Fps is (10 sec: 5595.7, 60 sec: 5688.7, 300 sec: 5678.1). Total num frames: 48139264. Throughput: 0: 5883.6. Samples: 48142144. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 02:27:15,896][25689] Avg episode reward: [(0, '-58.697')] [2022-07-09 02:27:16,557][26022] Updated weights on worker 0-0, policy_version 47016 (0.00086) [2022-07-09 02:27:18,395][26022] Updated weights on worker 0-0, policy_version 47026 (0.00088) [2022-07-09 02:27:20,024][26022] Updated weights on worker 0-0, policy_version 47036 (0.00086) [2022-07-09 02:27:20,929][25689] Fps is (10 sec: 5785.5, 60 sec: 5702.7, 300 sec: 5685.5). Total num frames: 48168960. Throughput: 0: 5858.5. Samples: 48176580. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 02:27:20,929][25689] Avg episode reward: [(0, '-58.072')] [2022-07-09 02:27:22,010][26022] Updated weights on worker 0-0, policy_version 47046 (0.00086) [2022-07-09 02:27:23,667][26022] Updated weights on worker 0-0, policy_version 47056 (0.00083) [2022-07-09 02:27:25,458][26022] Updated weights on worker 0-0, policy_version 47066 (0.00084) [2022-07-09 02:27:25,955][25689] Fps is (10 sec: 5802.7, 60 sec: 5686.3, 300 sec: 5688.8). Total num frames: 48197632. Throughput: 0: 5106.3. Samples: 48193826. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 02:27:25,955][25689] Avg episode reward: [(0, '-56.598')] [2022-07-09 02:27:27,325][26022] Updated weights on worker 0-0, policy_version 47076 (0.00081) [2022-07-09 02:27:28,882][26022] Updated weights on worker 0-0, policy_version 47086 (0.00082) [2022-07-09 02:27:30,988][25689] Fps is (10 sec: 5599.0, 60 sec: 5686.2, 300 sec: 5680.2). Total num frames: 48225280. Throughput: 0: 5952.4. Samples: 48228356. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 02:27:30,988][25689] Avg episode reward: [(0, '-56.159')] [2022-07-09 02:27:31,048][26022] Updated weights on worker 0-0, policy_version 47096 (0.00089) [2022-07-09 02:27:32,708][26022] Updated weights on worker 0-0, policy_version 47106 (0.00084) [2022-07-09 02:27:34,350][26022] Updated weights on worker 0-0, policy_version 47116 (0.00091) [2022-07-09 02:27:36,083][25689] Fps is (10 sec: 5661.8, 60 sec: 5684.0, 300 sec: 5678.8). Total num frames: 48254976. Throughput: 0: 5976.0. Samples: 48262782. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 02:27:36,083][25689] Avg episode reward: [(0, '-56.189')] [2022-07-09 02:27:36,300][26022] Updated weights on worker 0-0, policy_version 47126 (0.00083) [2022-07-09 02:27:37,935][26022] Updated weights on worker 0-0, policy_version 47136 (0.00091) [2022-07-09 02:27:39,955][26022] Updated weights on worker 0-0, policy_version 47146 (0.00099) [2022-07-09 02:27:41,098][25689] Fps is (10 sec: 5975.4, 60 sec: 5718.9, 300 sec: 5693.9). Total num frames: 48285696. Throughput: 0: 5134.1. Samples: 48280134. Policy #0 lag: (min: 0.0, avg: 7.1, max: 17.0) [2022-07-09 02:27:41,099][25689] Avg episode reward: [(0, '-57.115')] [2022-07-09 02:27:41,565][26022] Updated weights on worker 0-0, policy_version 47156 (0.00087) [2022-07-09 02:27:43,268][26022] Updated weights on worker 0-0, policy_version 47166 (0.00089) [2022-07-09 02:27:45,282][26022] Updated weights on worker 0-0, policy_version 47176 (0.00505) [2022-07-09 02:27:46,131][25689] Fps is (10 sec: 5910.9, 60 sec: 5735.9, 300 sec: 5690.6). Total num frames: 48314368. Throughput: 0: 5980.1. Samples: 48314482. Policy #0 lag: (min: 0.0, avg: 7.1, max: 17.0) [2022-07-09 02:27:46,131][25689] Avg episode reward: [(0, '-57.017')] [2022-07-09 02:27:46,825][26022] Updated weights on worker 0-0, policy_version 47186 (0.00095) [2022-07-09 02:27:48,681][26022] Updated weights on worker 0-0, policy_version 47196 (0.00086) [2022-07-09 02:27:50,586][26022] Updated weights on worker 0-0, policy_version 47206 (0.00085) [2022-07-09 02:27:51,159][25689] Fps is (10 sec: 5496.4, 60 sec: 5685.1, 300 sec: 5687.8). Total num frames: 48340992. Throughput: 0: 5982.0. Samples: 48349020. Policy #0 lag: (min: 0.0, avg: 7.1, max: 17.0) [2022-07-09 02:27:51,159][25689] Avg episode reward: [(0, '-57.626')] [2022-07-09 02:27:52,315][26022] Updated weights on worker 0-0, policy_version 47216 (0.00082) [2022-07-09 02:27:54,111][26022] Updated weights on worker 0-0, policy_version 47226 (0.00084) [2022-07-09 02:27:55,715][26022] Updated weights on worker 0-0, policy_version 47236 (0.00089) [2022-07-09 02:27:56,197][25689] Fps is (10 sec: 5798.3, 60 sec: 5728.4, 300 sec: 5697.5). Total num frames: 48372736. Throughput: 0: 5142.8. Samples: 48366226. Policy #0 lag: (min: 0.0, avg: 7.1, max: 17.0) [2022-07-09 02:27:56,198][25689] Avg episode reward: [(0, '-57.893')] [2022-07-09 02:27:57,710][26022] Updated weights on worker 0-0, policy_version 47246 (0.00091) [2022-07-09 02:27:59,273][26022] Updated weights on worker 0-0, policy_version 47256 (0.00089) [2022-07-09 02:28:01,203][25689] Fps is (10 sec: 5811.2, 60 sec: 5660.5, 300 sec: 5698.4). Total num frames: 48399360. Throughput: 0: 6023.3. Samples: 48401228. Policy #0 lag: (min: 0.0, avg: 7.1, max: 17.0) [2022-07-09 02:28:01,203][25689] Avg episode reward: [(0, '-58.613')] [2022-07-09 02:28:01,322][26022] Updated weights on worker 0-0, policy_version 47266 (0.00083) [2022-07-09 02:28:03,076][26022] Updated weights on worker 0-0, policy_version 47276 (0.00087) [2022-07-09 02:28:05,083][26022] Updated weights on worker 0-0, policy_version 47286 (0.00087) [2022-07-09 02:28:06,213][25689] Fps is (10 sec: 5418.5, 60 sec: 5698.1, 300 sec: 5695.5). Total num frames: 48427008. Throughput: 0: 5933.4. Samples: 48433640. Policy #0 lag: (min: 0.0, avg: 7.1, max: 17.0) [2022-07-09 02:28:06,214][25689] Avg episode reward: [(0, '-58.818')] [2022-07-09 02:28:06,614][26022] Updated weights on worker 0-0, policy_version 47296 (0.00093) [2022-07-09 02:28:08,707][26022] Updated weights on worker 0-0, policy_version 47306 (0.00081) [2022-07-09 02:28:10,543][26022] Updated weights on worker 0-0, policy_version 47316 (0.00088) [2022-07-09 02:28:11,226][25689] Fps is (10 sec: 5618.7, 60 sec: 5714.7, 300 sec: 5690.6). Total num frames: 48455680. Throughput: 0: 5071.8. Samples: 48450800. Policy #0 lag: (min: 0.0, avg: 7.1, max: 17.0) [2022-07-09 02:28:11,228][25689] Avg episode reward: [(0, '-58.399')] [2022-07-09 02:28:12,059][26022] Updated weights on worker 0-0, policy_version 47326 (0.00087) [2022-07-09 02:28:13,918][26022] Updated weights on worker 0-0, policy_version 47336 (0.00083) [2022-07-09 02:28:15,868][26022] Updated weights on worker 0-0, policy_version 47346 (0.00081) [2022-07-09 02:28:16,291][25689] Fps is (10 sec: 5690.3, 60 sec: 5713.9, 300 sec: 5694.4). Total num frames: 48484352. Throughput: 0: 5934.8. Samples: 48485478. Policy #0 lag: (min: 0.0, avg: 7.1, max: 17.0) [2022-07-09 02:28:16,291][25689] Avg episode reward: [(0, '-58.751')] [2022-07-09 02:28:17,423][26022] Updated weights on worker 0-0, policy_version 47356 (0.00086) [2022-07-09 02:28:19,099][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:28:19,110][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000047364_48500736.pth [2022-07-09 02:28:19,110][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000045363_46451712.pth [2022-07-09 02:28:19,462][26022] Updated weights on worker 0-0, policy_version 47366 (0.00082) [2022-07-09 02:28:20,926][26022] Updated weights on worker 0-0, policy_version 47376 (0.00082) [2022-07-09 02:28:21,298][25689] Fps is (10 sec: 5896.7, 60 sec: 5733.2, 300 sec: 5696.2). Total num frames: 48515072. Throughput: 0: 5905.5. Samples: 48519904. Policy #0 lag: (min: 0.0, avg: 7.1, max: 17.0) [2022-07-09 02:28:21,307][25689] Avg episode reward: [(0, '-60.308')] [2022-07-09 02:28:23,060][26022] Updated weights on worker 0-0, policy_version 47386 (0.00087) [2022-07-09 02:28:24,511][26022] Updated weights on worker 0-0, policy_version 47396 (0.00091) [2022-07-09 02:28:26,320][25689] Fps is (10 sec: 5819.9, 60 sec: 5716.7, 300 sec: 5699.4). Total num frames: 48542720. Throughput: 0: 5149.5. Samples: 48537180. Policy #0 lag: (min: 0.0, avg: 7.1, max: 17.0) [2022-07-09 02:28:26,320][25689] Avg episode reward: [(0, '-59.869')] [2022-07-09 02:28:26,482][26022] Updated weights on worker 0-0, policy_version 47406 (0.00080) [2022-07-09 02:28:28,204][26022] Updated weights on worker 0-0, policy_version 47416 (0.00085) [2022-07-09 02:28:29,774][26022] Updated weights on worker 0-0, policy_version 47426 (0.00081) [2022-07-09 02:28:31,326][25689] Fps is (10 sec: 5616.8, 60 sec: 5736.2, 300 sec: 5696.5). Total num frames: 48571392. Throughput: 0: 6020.1. Samples: 48571800. Policy #0 lag: (min: 0.0, avg: 7.1, max: 17.0) [2022-07-09 02:28:31,326][25689] Avg episode reward: [(0, '-58.919')] [2022-07-09 02:28:31,887][26022] Updated weights on worker 0-0, policy_version 47436 (0.00097) [2022-07-09 02:28:33,562][26022] Updated weights on worker 0-0, policy_version 47446 (0.00091) [2022-07-09 02:28:35,233][26022] Updated weights on worker 0-0, policy_version 47456 (0.00086) [2022-07-09 02:28:36,373][25689] Fps is (10 sec: 5805.8, 60 sec: 5740.7, 300 sec: 5695.7). Total num frames: 48601088. Throughput: 0: 6009.5. Samples: 48606164. Policy #0 lag: (min: 0.0, avg: 7.1, max: 17.0) [2022-07-09 02:28:36,375][25689] Avg episode reward: [(0, '-58.179')] [2022-07-09 02:28:37,254][26022] Updated weights on worker 0-0, policy_version 47466 (0.00092) [2022-07-09 02:28:38,796][26022] Updated weights on worker 0-0, policy_version 47476 (0.00089) [2022-07-09 02:28:40,926][26022] Updated weights on worker 0-0, policy_version 47486 (0.00091) [2022-07-09 02:28:41,394][25689] Fps is (10 sec: 5593.5, 60 sec: 5672.3, 300 sec: 5688.7). Total num frames: 48627712. Throughput: 0: 5138.7. Samples: 48623172. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-09 02:28:41,396][25689] Avg episode reward: [(0, '-57.562')] [2022-07-09 02:28:42,435][26022] Updated weights on worker 0-0, policy_version 47496 (0.00094) [2022-07-09 02:28:44,379][26022] Updated weights on worker 0-0, policy_version 47506 (0.00091) [2022-07-09 02:28:45,975][26022] Updated weights on worker 0-0, policy_version 47516 (0.00080) [2022-07-09 02:28:46,407][25689] Fps is (10 sec: 5714.8, 60 sec: 5708.1, 300 sec: 5699.4). Total num frames: 48658432. Throughput: 0: 5996.7. Samples: 48657640. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-09 02:28:46,409][25689] Avg episode reward: [(0, '-57.716')] [2022-07-09 02:28:47,890][26022] Updated weights on worker 0-0, policy_version 47526 (0.00103) [2022-07-09 02:28:49,531][26022] Updated weights on worker 0-0, policy_version 47536 (0.00086) [2022-07-09 02:28:51,426][25689] Fps is (10 sec: 5818.5, 60 sec: 5726.0, 300 sec: 5693.1). Total num frames: 48686080. Throughput: 0: 5972.4. Samples: 48691848. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-09 02:28:51,426][25689] Avg episode reward: [(0, '-55.725')] [2022-07-09 02:28:51,585][26022] Updated weights on worker 0-0, policy_version 47546 (0.00092) [2022-07-09 02:28:53,241][26022] Updated weights on worker 0-0, policy_version 47556 (0.00088) [2022-07-09 02:28:55,159][26022] Updated weights on worker 0-0, policy_version 47566 (0.00082) [2022-07-09 02:28:56,473][25689] Fps is (10 sec: 5595.2, 60 sec: 5674.2, 300 sec: 5696.1). Total num frames: 48714752. Throughput: 0: 5099.7. Samples: 48708668. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-09 02:28:56,474][25689] Avg episode reward: [(0, '-55.870')] [2022-07-09 02:28:56,888][26022] Updated weights on worker 0-0, policy_version 47576 (0.00095) [2022-07-09 02:28:58,674][26022] Updated weights on worker 0-0, policy_version 47586 (0.00104) [2022-07-09 02:29:00,412][26022] Updated weights on worker 0-0, policy_version 47596 (0.00084) [2022-07-09 02:29:01,487][25689] Fps is (10 sec: 5699.8, 60 sec: 5707.4, 300 sec: 5699.5). Total num frames: 48743424. Throughput: 0: 5982.5. Samples: 48743374. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-09 02:29:01,487][25689] Avg episode reward: [(0, '-56.734')] [2022-07-09 02:29:02,727][26022] Updated weights on worker 0-0, policy_version 47606 (0.00083) [2022-07-09 02:29:04,261][26022] Updated weights on worker 0-0, policy_version 47616 (0.00084) [2022-07-09 02:29:06,275][26022] Updated weights on worker 0-0, policy_version 47626 (0.00080) [2022-07-09 02:29:06,512][25689] Fps is (10 sec: 5406.0, 60 sec: 5672.0, 300 sec: 5686.2). Total num frames: 48769024. Throughput: 0: 5872.1. Samples: 48775698. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-09 02:29:06,513][25689] Avg episode reward: [(0, '-57.114')] [2022-07-09 02:29:07,779][26022] Updated weights on worker 0-0, policy_version 47636 (0.00087) [2022-07-09 02:29:09,855][26022] Updated weights on worker 0-0, policy_version 47646 (0.00105) [2022-07-09 02:29:11,420][26022] Updated weights on worker 0-0, policy_version 47656 (0.00089) [2022-07-09 02:29:11,517][25689] Fps is (10 sec: 5615.2, 60 sec: 5706.8, 300 sec: 5698.3). Total num frames: 48799744. Throughput: 0: 5035.0. Samples: 48793004. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-09 02:29:11,517][25689] Avg episode reward: [(0, '-56.985')] [2022-07-09 02:29:13,449][26022] Updated weights on worker 0-0, policy_version 47666 (0.00792) [2022-07-09 02:29:14,961][26022] Updated weights on worker 0-0, policy_version 47676 (0.00087) [2022-07-09 02:29:16,563][25689] Fps is (10 sec: 5807.6, 60 sec: 5691.5, 300 sec: 5683.9). Total num frames: 48827392. Throughput: 0: 5913.3. Samples: 48827464. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-09 02:29:16,563][25689] Avg episode reward: [(0, '-57.583')] [2022-07-09 02:29:17,071][26022] Updated weights on worker 0-0, policy_version 47686 (0.00092) [2022-07-09 02:29:18,461][26022] Updated weights on worker 0-0, policy_version 47696 (0.00087) [2022-07-09 02:29:20,531][26022] Updated weights on worker 0-0, policy_version 47706 (0.00086) [2022-07-09 02:29:21,588][25689] Fps is (10 sec: 5693.9, 60 sec: 5672.9, 300 sec: 5697.6). Total num frames: 48857088. Throughput: 0: 5901.1. Samples: 48861992. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-09 02:29:21,589][25689] Avg episode reward: [(0, '-58.374')] [2022-07-09 02:29:22,138][26022] Updated weights on worker 0-0, policy_version 47716 (0.00087) [2022-07-09 02:29:23,962][26022] Updated weights on worker 0-0, policy_version 47726 (0.00084) [2022-07-09 02:29:25,935][26022] Updated weights on worker 0-0, policy_version 47736 (0.00095) [2022-07-09 02:29:26,595][25689] Fps is (10 sec: 5919.9, 60 sec: 5708.2, 300 sec: 5694.3). Total num frames: 48886784. Throughput: 0: 5166.5. Samples: 48879456. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-09 02:29:26,596][25689] Avg episode reward: [(0, '-58.334')] [2022-07-09 02:29:27,553][26022] Updated weights on worker 0-0, policy_version 47746 (0.00089) [2022-07-09 02:29:29,348][26022] Updated weights on worker 0-0, policy_version 47756 (0.00083) [2022-07-09 02:29:31,291][26022] Updated weights on worker 0-0, policy_version 47766 (0.00086) [2022-07-09 02:29:31,620][25689] Fps is (10 sec: 5716.1, 60 sec: 5689.4, 300 sec: 5694.7). Total num frames: 48914432. Throughput: 0: 6003.6. Samples: 48913694. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-09 02:29:31,620][25689] Avg episode reward: [(0, '-58.365')] [2022-07-09 02:29:32,939][26022] Updated weights on worker 0-0, policy_version 47776 (0.00090) [2022-07-09 02:29:34,823][26022] Updated weights on worker 0-0, policy_version 47786 (0.00092) [2022-07-09 02:29:36,483][26022] Updated weights on worker 0-0, policy_version 47796 (0.00098) [2022-07-09 02:29:36,675][25689] Fps is (10 sec: 5587.6, 60 sec: 5671.8, 300 sec: 5690.5). Total num frames: 48943104. Throughput: 0: 6001.7. Samples: 48948170. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-09 02:29:36,675][25689] Avg episode reward: [(0, '-58.574')] [2022-07-09 02:29:38,423][26022] Updated weights on worker 0-0, policy_version 47806 (0.00095) [2022-07-09 02:29:39,938][26022] Updated weights on worker 0-0, policy_version 47816 (0.00088) [2022-07-09 02:29:41,725][25689] Fps is (10 sec: 5775.8, 60 sec: 5719.9, 300 sec: 5693.3). Total num frames: 48972800. Throughput: 0: 5135.1. Samples: 48965400. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 02:29:41,726][25689] Avg episode reward: [(0, '-57.903')] [2022-07-09 02:29:41,975][26022] Updated weights on worker 0-0, policy_version 47826 (0.00082) [2022-07-09 02:29:43,296][26022] Updated weights on worker 0-0, policy_version 47836 (0.00091) [2022-07-09 02:29:45,322][26022] Updated weights on worker 0-0, policy_version 47846 (0.00087) [2022-07-09 02:29:46,727][25689] Fps is (10 sec: 5908.2, 60 sec: 5704.0, 300 sec: 5697.3). Total num frames: 49002496. Throughput: 0: 6018.9. Samples: 49000628. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 02:29:46,728][25689] Avg episode reward: [(0, '-57.847')] [2022-07-09 02:29:47,059][26022] Updated weights on worker 0-0, policy_version 47856 (0.00092) [2022-07-09 02:29:48,726][26022] Updated weights on worker 0-0, policy_version 47866 (0.00088) [2022-07-09 02:29:50,745][26022] Updated weights on worker 0-0, policy_version 47876 (0.00084) [2022-07-09 02:29:51,730][25689] Fps is (10 sec: 5936.2, 60 sec: 5739.4, 300 sec: 5701.3). Total num frames: 49032192. Throughput: 0: 6048.8. Samples: 49035340. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 02:29:51,731][25689] Avg episode reward: [(0, '-57.557')] [2022-07-09 02:29:52,402][26022] Updated weights on worker 0-0, policy_version 47886 (0.00084) [2022-07-09 02:29:54,182][26022] Updated weights on worker 0-0, policy_version 47896 (0.00086) [2022-07-09 02:29:56,116][26022] Updated weights on worker 0-0, policy_version 47906 (0.00093) [2022-07-09 02:29:56,822][25689] Fps is (10 sec: 5579.0, 60 sec: 5701.2, 300 sec: 5692.8). Total num frames: 49058816. Throughput: 0: 6043.5. Samples: 49069932. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 02:29:56,823][25689] Avg episode reward: [(0, '-57.566')] [2022-07-09 02:29:57,597][26022] Updated weights on worker 0-0, policy_version 47916 (0.00090) [2022-07-09 02:29:59,876][26022] Updated weights on worker 0-0, policy_version 47926 (0.00084) [2022-07-09 02:30:01,183][26022] Updated weights on worker 0-0, policy_version 47936 (0.00084) [2022-07-09 02:30:01,891][25689] Fps is (10 sec: 5643.8, 60 sec: 5729.9, 300 sec: 5708.9). Total num frames: 49089536. Throughput: 0: 6042.1. Samples: 49087244. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 02:30:01,891][25689] Avg episode reward: [(0, '-57.766')] [2022-07-09 02:30:03,572][26022] Updated weights on worker 0-0, policy_version 47946 (0.00080) [2022-07-09 02:30:05,115][26022] Updated weights on worker 0-0, policy_version 47956 (0.00067) [2022-07-09 02:30:06,903][25689] Fps is (10 sec: 5587.0, 60 sec: 5731.3, 300 sec: 5691.5). Total num frames: 49115136. Throughput: 0: 5896.1. Samples: 49119584. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 02:30:06,903][25689] Avg episode reward: [(0, '-58.277')] [2022-07-09 02:30:07,115][26022] Updated weights on worker 0-0, policy_version 47966 (0.00100) [2022-07-09 02:30:08,740][26022] Updated weights on worker 0-0, policy_version 47976 (0.00088) [2022-07-09 02:30:10,704][26022] Updated weights on worker 0-0, policy_version 47986 (0.00089) [2022-07-09 02:30:11,912][25689] Fps is (10 sec: 5722.3, 60 sec: 5747.7, 300 sec: 5710.5). Total num frames: 49146880. Throughput: 0: 5892.2. Samples: 49154256. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 02:30:11,913][25689] Avg episode reward: [(0, '-58.981')] [2022-07-09 02:30:12,214][26022] Updated weights on worker 0-0, policy_version 47996 (0.00087) [2022-07-09 02:30:14,150][26022] Updated weights on worker 0-0, policy_version 48006 (0.00091) [2022-07-09 02:30:15,911][26022] Updated weights on worker 0-0, policy_version 48016 (0.00090) [2022-07-09 02:30:16,998][25689] Fps is (10 sec: 5883.3, 60 sec: 5744.0, 300 sec: 5698.9). Total num frames: 49174528. Throughput: 0: 5048.5. Samples: 49171788. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 02:30:16,998][25689] Avg episode reward: [(0, '-59.620')] [2022-07-09 02:30:17,573][26022] Updated weights on worker 0-0, policy_version 48026 (0.00087) [2022-07-09 02:30:19,132][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:30:19,152][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000048034_49186816.pth [2022-07-09 02:30:19,153][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000046030_47134720.pth [2022-07-09 02:30:19,402][26022] Updated weights on worker 0-0, policy_version 48036 (0.00088) [2022-07-09 02:30:21,466][26022] Updated weights on worker 0-0, policy_version 48046 (0.00081) [2022-07-09 02:30:22,026][25689] Fps is (10 sec: 5467.2, 60 sec: 5709.7, 300 sec: 5695.1). Total num frames: 49202176. Throughput: 0: 5909.0. Samples: 49206224. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 02:30:22,027][25689] Avg episode reward: [(0, '-59.957')] [2022-07-09 02:30:22,943][26022] Updated weights on worker 0-0, policy_version 48056 (0.00090) [2022-07-09 02:30:24,980][26022] Updated weights on worker 0-0, policy_version 48066 (0.00089) [2022-07-09 02:30:26,498][26022] Updated weights on worker 0-0, policy_version 48076 (0.00085) [2022-07-09 02:30:27,063][25689] Fps is (10 sec: 5697.2, 60 sec: 5707.0, 300 sec: 5698.5). Total num frames: 49231872. Throughput: 0: 6005.1. Samples: 49240648. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 02:30:27,063][25689] Avg episode reward: [(0, '-59.696')] [2022-07-09 02:30:28,569][26022] Updated weights on worker 0-0, policy_version 48086 (0.00097) [2022-07-09 02:30:30,270][26022] Updated weights on worker 0-0, policy_version 48096 (0.00092) [2022-07-09 02:30:31,957][26022] Updated weights on worker 0-0, policy_version 48106 (0.00086) [2022-07-09 02:30:32,083][25689] Fps is (10 sec: 5804.1, 60 sec: 5724.4, 300 sec: 5701.2). Total num frames: 49260544. Throughput: 0: 5137.8. Samples: 49257888. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 02:30:32,083][25689] Avg episode reward: [(0, '-59.104')] [2022-07-09 02:30:33,887][26022] Updated weights on worker 0-0, policy_version 48116 (0.00080) [2022-07-09 02:30:35,668][26022] Updated weights on worker 0-0, policy_version 48126 (0.00093) [2022-07-09 02:30:37,160][25689] Fps is (10 sec: 5679.2, 60 sec: 5722.2, 300 sec: 5700.9). Total num frames: 49289216. Throughput: 0: 5969.8. Samples: 49292154. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 02:30:37,161][25689] Avg episode reward: [(0, '-60.159')] [2022-07-09 02:30:37,307][26022] Updated weights on worker 0-0, policy_version 48136 (0.00084) [2022-07-09 02:30:39,107][26022] Updated weights on worker 0-0, policy_version 48146 (0.00088) [2022-07-09 02:30:40,739][26022] Updated weights on worker 0-0, policy_version 48156 (0.00108) [2022-07-09 02:30:42,162][25689] Fps is (10 sec: 5689.1, 60 sec: 5709.9, 300 sec: 5701.5). Total num frames: 49317888. Throughput: 0: 5982.3. Samples: 49326684. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-09 02:30:42,163][25689] Avg episode reward: [(0, '-59.916')] [2022-07-09 02:30:42,922][26022] Updated weights on worker 0-0, policy_version 48166 (0.00082) [2022-07-09 02:30:44,196][26022] Updated weights on worker 0-0, policy_version 48176 (0.00078) [2022-07-09 02:30:46,422][26022] Updated weights on worker 0-0, policy_version 48186 (0.00085) [2022-07-09 02:30:47,178][25689] Fps is (10 sec: 5724.2, 60 sec: 5691.6, 300 sec: 5705.0). Total num frames: 49346560. Throughput: 0: 5144.7. Samples: 49344134. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-09 02:30:47,178][25689] Avg episode reward: [(0, '-59.632')] [2022-07-09 02:30:47,699][26022] Updated weights on worker 0-0, policy_version 48196 (0.00085) [2022-07-09 02:30:49,958][26022] Updated weights on worker 0-0, policy_version 48206 (0.00102) [2022-07-09 02:30:51,496][26022] Updated weights on worker 0-0, policy_version 48216 (0.00085) [2022-07-09 02:30:52,182][25689] Fps is (10 sec: 5825.1, 60 sec: 5691.5, 300 sec: 5702.3). Total num frames: 49376256. Throughput: 0: 5991.5. Samples: 49378316. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-09 02:30:52,183][25689] Avg episode reward: [(0, '-59.281')] [2022-07-09 02:30:53,512][26022] Updated weights on worker 0-0, policy_version 48226 (0.00080) [2022-07-09 02:30:55,182][26022] Updated weights on worker 0-0, policy_version 48236 (0.00086) [2022-07-09 02:30:57,238][26022] Updated weights on worker 0-0, policy_version 48246 (0.00089) [2022-07-09 02:30:57,266][25689] Fps is (10 sec: 5684.1, 60 sec: 5709.2, 300 sec: 5705.2). Total num frames: 49403904. Throughput: 0: 6003.8. Samples: 49412868. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-09 02:30:57,267][25689] Avg episode reward: [(0, '-59.071')] [2022-07-09 02:30:58,744][26022] Updated weights on worker 0-0, policy_version 48256 (0.00091) [2022-07-09 02:31:00,815][26022] Updated weights on worker 0-0, policy_version 48266 (0.00096) [2022-07-09 02:31:02,291][25689] Fps is (10 sec: 5571.7, 60 sec: 5679.5, 300 sec: 5709.2). Total num frames: 49432576. Throughput: 0: 5123.0. Samples: 49429802. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-09 02:31:02,291][25689] Avg episode reward: [(0, '-59.056')] [2022-07-09 02:31:02,572][26022] Updated weights on worker 0-0, policy_version 48276 (0.00088) [2022-07-09 02:31:04,664][26022] Updated weights on worker 0-0, policy_version 48286 (0.00088) [2022-07-09 02:31:06,303][26022] Updated weights on worker 0-0, policy_version 48296 (0.00082) [2022-07-09 02:31:07,312][25689] Fps is (10 sec: 5708.6, 60 sec: 5729.5, 300 sec: 5709.5). Total num frames: 49461248. Throughput: 0: 5847.1. Samples: 49461858. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-09 02:31:07,313][25689] Avg episode reward: [(0, '-58.036')] [2022-07-09 02:31:08,341][26022] Updated weights on worker 0-0, policy_version 48306 (0.00095) [2022-07-09 02:31:09,957][26022] Updated weights on worker 0-0, policy_version 48316 (0.00051) [2022-07-09 02:31:11,835][26022] Updated weights on worker 0-0, policy_version 48326 (0.00094) [2022-07-09 02:31:12,324][25689] Fps is (10 sec: 5409.1, 60 sec: 5627.5, 300 sec: 5697.5). Total num frames: 49486848. Throughput: 0: 5844.3. Samples: 49496030. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-09 02:31:12,325][25689] Avg episode reward: [(0, '-58.568')] [2022-07-09 02:31:13,334][26022] Updated weights on worker 0-0, policy_version 48336 (0.00088) [2022-07-09 02:31:15,429][26022] Updated weights on worker 0-0, policy_version 48346 (0.00081) [2022-07-09 02:31:17,039][26022] Updated weights on worker 0-0, policy_version 48356 (0.00087) [2022-07-09 02:31:17,371][25689] Fps is (10 sec: 5598.6, 60 sec: 5681.9, 300 sec: 5703.5). Total num frames: 49517568. Throughput: 0: 5008.4. Samples: 49513562. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-09 02:31:17,373][25689] Avg episode reward: [(0, '-58.024')] [2022-07-09 02:31:18,953][26022] Updated weights on worker 0-0, policy_version 48366 (0.00103) [2022-07-09 02:31:20,625][26022] Updated weights on worker 0-0, policy_version 48376 (0.00084) [2022-07-09 02:31:22,291][26022] Updated weights on worker 0-0, policy_version 48386 (0.00079) [2022-07-09 02:31:22,387][25689] Fps is (10 sec: 6004.1, 60 sec: 5717.1, 300 sec: 5703.8). Total num frames: 49547264. Throughput: 0: 5893.1. Samples: 49548230. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-09 02:31:22,387][25689] Avg episode reward: [(0, '-57.369')] [2022-07-09 02:31:24,274][26022] Updated weights on worker 0-0, policy_version 48396 (0.00083) [2022-07-09 02:31:25,858][26022] Updated weights on worker 0-0, policy_version 48406 (0.00088) [2022-07-09 02:31:27,407][25689] Fps is (10 sec: 5816.3, 60 sec: 5701.7, 300 sec: 5707.5). Total num frames: 49575936. Throughput: 0: 6013.9. Samples: 49582706. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-09 02:31:27,408][25689] Avg episode reward: [(0, '-56.566')] [2022-07-09 02:31:27,864][26022] Updated weights on worker 0-0, policy_version 48416 (0.00094) [2022-07-09 02:31:29,575][26022] Updated weights on worker 0-0, policy_version 48426 (0.00083) [2022-07-09 02:31:31,248][26022] Updated weights on worker 0-0, policy_version 48436 (0.00087) [2022-07-09 02:31:32,422][25689] Fps is (10 sec: 5714.0, 60 sec: 5702.1, 300 sec: 5705.1). Total num frames: 49604608. Throughput: 0: 5169.9. Samples: 49599936. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-09 02:31:32,423][25689] Avg episode reward: [(0, '-55.744')] [2022-07-09 02:31:33,002][26022] Updated weights on worker 0-0, policy_version 48446 (0.00089) [2022-07-09 02:31:34,974][26022] Updated weights on worker 0-0, policy_version 48456 (0.00084) [2022-07-09 02:31:36,711][26022] Updated weights on worker 0-0, policy_version 48466 (0.00082) [2022-07-09 02:31:37,467][25689] Fps is (10 sec: 5802.0, 60 sec: 5722.2, 300 sec: 5708.3). Total num frames: 49634304. Throughput: 0: 6040.8. Samples: 49634954. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-09 02:31:37,467][25689] Avg episode reward: [(0, '-56.643')] [2022-07-09 02:31:38,421][26022] Updated weights on worker 0-0, policy_version 48476 (0.00090) [2022-07-09 02:31:40,136][26022] Updated weights on worker 0-0, policy_version 48486 (0.00095) [2022-07-09 02:31:41,936][26022] Updated weights on worker 0-0, policy_version 48496 (0.00081) [2022-07-09 02:31:42,481][25689] Fps is (10 sec: 5701.2, 60 sec: 5704.1, 300 sec: 5708.6). Total num frames: 49661952. Throughput: 0: 6037.0. Samples: 49669538. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 02:31:42,483][25689] Avg episode reward: [(0, '-56.637')] [2022-07-09 02:31:43,746][26022] Updated weights on worker 0-0, policy_version 48506 (0.00083) [2022-07-09 02:31:45,392][26022] Updated weights on worker 0-0, policy_version 48516 (0.00084) [2022-07-09 02:31:47,183][26022] Updated weights on worker 0-0, policy_version 48526 (0.00082) [2022-07-09 02:31:47,491][25689] Fps is (10 sec: 5822.4, 60 sec: 5738.5, 300 sec: 5712.4). Total num frames: 49692672. Throughput: 0: 5188.9. Samples: 49686924. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 02:31:47,493][25689] Avg episode reward: [(0, '-58.078')] [2022-07-09 02:31:49,312][26022] Updated weights on worker 0-0, policy_version 48536 (0.00095) [2022-07-09 02:31:50,679][26022] Updated weights on worker 0-0, policy_version 48546 (0.00096) [2022-07-09 02:31:52,512][25689] Fps is (10 sec: 5614.4, 60 sec: 5669.1, 300 sec: 5700.9). Total num frames: 49718272. Throughput: 0: 6027.7. Samples: 49721030. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 02:31:52,512][25689] Avg episode reward: [(0, '-58.615')] [2022-07-09 02:31:52,796][26022] Updated weights on worker 0-0, policy_version 48556 (0.00081) [2022-07-09 02:31:54,234][26022] Updated weights on worker 0-0, policy_version 48566 (0.00079) [2022-07-09 02:31:56,190][26022] Updated weights on worker 0-0, policy_version 48576 (0.00092) [2022-07-09 02:31:57,581][25689] Fps is (10 sec: 5582.2, 60 sec: 5721.5, 300 sec: 5699.6). Total num frames: 49748992. Throughput: 0: 5991.8. Samples: 49755474. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 02:31:57,581][25689] Avg episode reward: [(0, '-59.130')] [2022-07-09 02:31:58,375][26022] Updated weights on worker 0-0, policy_version 48586 (0.00083) [2022-07-09 02:31:59,570][26022] Updated weights on worker 0-0, policy_version 48596 (0.00094) [2022-07-09 02:32:01,792][26022] Updated weights on worker 0-0, policy_version 48606 (0.00096) [2022-07-09 02:32:02,584][25689] Fps is (10 sec: 5693.5, 60 sec: 5689.5, 300 sec: 5704.0). Total num frames: 49775616. Throughput: 0: 5141.7. Samples: 49772902. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 02:32:02,584][25689] Avg episode reward: [(0, '-58.355')] [2022-07-09 02:32:03,583][26022] Updated weights on worker 0-0, policy_version 48616 (0.00087) [2022-07-09 02:32:05,496][26022] Updated weights on worker 0-0, policy_version 48626 (0.00087) [2022-07-09 02:32:07,510][26022] Updated weights on worker 0-0, policy_version 48636 (0.00096) [2022-07-09 02:32:07,587][25689] Fps is (10 sec: 5423.9, 60 sec: 5674.3, 300 sec: 5704.1). Total num frames: 49803264. Throughput: 0: 5882.8. Samples: 49805140. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 02:32:07,587][25689] Avg episode reward: [(0, '-58.209')] [2022-07-09 02:32:09,129][26022] Updated weights on worker 0-0, policy_version 48646 (0.00087) [2022-07-09 02:32:10,917][26022] Updated weights on worker 0-0, policy_version 48656 (0.00094) [2022-07-09 02:32:12,614][25689] Fps is (10 sec: 5614.8, 60 sec: 5723.8, 300 sec: 5704.6). Total num frames: 49831936. Throughput: 0: 5887.2. Samples: 49839378. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 02:32:12,615][25689] Avg episode reward: [(0, '-57.699')] [2022-07-09 02:32:12,843][26022] Updated weights on worker 0-0, policy_version 48666 (0.00084) [2022-07-09 02:32:14,435][26022] Updated weights on worker 0-0, policy_version 48676 (0.00089) [2022-07-09 02:32:16,622][26022] Updated weights on worker 0-0, policy_version 48686 (0.00090) [2022-07-09 02:32:17,659][25689] Fps is (10 sec: 5794.8, 60 sec: 5707.0, 300 sec: 5704.4). Total num frames: 49861632. Throughput: 0: 5002.0. Samples: 49855906. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 02:32:17,660][25689] Avg episode reward: [(0, '-56.805')] [2022-07-09 02:32:18,083][26022] Updated weights on worker 0-0, policy_version 48696 (0.00094) [2022-07-09 02:32:19,187][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:32:19,201][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000048701_49869824.pth [2022-07-09 02:32:19,201][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000046696_47816704.pth [2022-07-09 02:32:20,011][26022] Updated weights on worker 0-0, policy_version 48706 (0.00088) [2022-07-09 02:32:21,975][26022] Updated weights on worker 0-0, policy_version 48716 (0.00089) [2022-07-09 02:32:22,663][25689] Fps is (10 sec: 5706.9, 60 sec: 5674.2, 300 sec: 5701.4). Total num frames: 49889280. Throughput: 0: 5840.6. Samples: 49890174. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 02:32:22,663][25689] Avg episode reward: [(0, '-55.675')] [2022-07-09 02:32:23,606][26022] Updated weights on worker 0-0, policy_version 48726 (0.00086) [2022-07-09 02:32:25,355][26022] Updated weights on worker 0-0, policy_version 48736 (0.00087) [2022-07-09 02:32:27,221][26022] Updated weights on worker 0-0, policy_version 48746 (0.00091) [2022-07-09 02:32:27,665][25689] Fps is (10 sec: 5628.9, 60 sec: 5675.9, 300 sec: 5705.4). Total num frames: 49917952. Throughput: 0: 5929.7. Samples: 49924196. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 02:32:27,667][25689] Avg episode reward: [(0, '-55.426')] [2022-07-09 02:32:28,894][26022] Updated weights on worker 0-0, policy_version 48756 (0.00082) [2022-07-09 02:32:30,801][26022] Updated weights on worker 0-0, policy_version 48766 (0.00088) [2022-07-09 02:32:32,678][25689] Fps is (10 sec: 5623.4, 60 sec: 5659.1, 300 sec: 5700.1). Total num frames: 49945600. Throughput: 0: 5083.8. Samples: 49941378. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 02:32:32,679][25689] Avg episode reward: [(0, '-56.234')] [2022-07-09 02:32:32,697][26022] Updated weights on worker 0-0, policy_version 48776 (0.00086) [2022-07-09 02:32:34,204][26022] Updated weights on worker 0-0, policy_version 48786 (0.00085) [2022-07-09 02:32:36,344][26022] Updated weights on worker 0-0, policy_version 48796 (0.00087) [2022-07-09 02:32:37,687][26022] Updated weights on worker 0-0, policy_version 48806 (0.00087) [2022-07-09 02:32:37,766][25689] Fps is (10 sec: 5879.5, 60 sec: 5688.9, 300 sec: 5702.2). Total num frames: 49977344. Throughput: 0: 5956.4. Samples: 49975672. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 02:32:37,767][25689] Avg episode reward: [(0, '-56.630')] [2022-07-09 02:32:39,810][26022] Updated weights on worker 0-0, policy_version 48816 (0.00097) [2022-07-09 02:32:41,606][26022] Updated weights on worker 0-0, policy_version 48826 (0.00084) [2022-07-09 02:32:42,769][25689] Fps is (10 sec: 5784.4, 60 sec: 5673.0, 300 sec: 5695.8). Total num frames: 50003968. Throughput: 0: 5965.3. Samples: 50010112. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:32:42,769][25689] Avg episode reward: [(0, '-57.373')] [2022-07-09 02:32:43,278][26022] Updated weights on worker 0-0, policy_version 48836 (0.00087) [2022-07-09 02:32:45,163][26022] Updated weights on worker 0-0, policy_version 48846 (0.00092) [2022-07-09 02:32:46,810][26022] Updated weights on worker 0-0, policy_version 48856 (0.00077) [2022-07-09 02:32:47,790][25689] Fps is (10 sec: 5414.1, 60 sec: 5621.1, 300 sec: 5699.4). Total num frames: 50031616. Throughput: 0: 5113.0. Samples: 50027098. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:32:47,791][25689] Avg episode reward: [(0, '-58.208')] [2022-07-09 02:32:48,643][26022] Updated weights on worker 0-0, policy_version 48866 (0.00094) [2022-07-09 02:32:50,517][26022] Updated weights on worker 0-0, policy_version 48876 (0.00078) [2022-07-09 02:32:52,224][26022] Updated weights on worker 0-0, policy_version 48886 (0.00088) [2022-07-09 02:32:52,796][25689] Fps is (10 sec: 5820.9, 60 sec: 5707.4, 300 sec: 5696.6). Total num frames: 50062336. Throughput: 0: 5989.6. Samples: 50061876. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:32:52,796][25689] Avg episode reward: [(0, '-58.705')] [2022-07-09 02:32:54,072][26022] Updated weights on worker 0-0, policy_version 48896 (0.00087) [2022-07-09 02:32:55,743][26022] Updated weights on worker 0-0, policy_version 48906 (0.00088) [2022-07-09 02:32:57,723][26022] Updated weights on worker 0-0, policy_version 48916 (0.00088) [2022-07-09 02:32:57,922][25689] Fps is (10 sec: 5862.3, 60 sec: 5668.1, 300 sec: 5701.2). Total num frames: 50091008. Throughput: 0: 5990.3. Samples: 50096408. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:32:57,923][25689] Avg episode reward: [(0, '-59.793')] [2022-07-09 02:32:59,275][26022] Updated weights on worker 0-0, policy_version 48926 (0.00089) [2022-07-09 02:33:01,586][26022] Updated weights on worker 0-0, policy_version 48936 (0.00090) [2022-07-09 02:33:02,982][25689] Fps is (10 sec: 5629.7, 60 sec: 5696.6, 300 sec: 5703.7). Total num frames: 50119680. Throughput: 0: 5129.3. Samples: 50113792. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:33:02,984][25689] Avg episode reward: [(0, '-59.683')] [2022-07-09 02:33:03,071][26022] Updated weights on worker 0-0, policy_version 48946 (0.00083) [2022-07-09 02:33:05,233][26022] Updated weights on worker 0-0, policy_version 48956 (0.00088) [2022-07-09 02:33:06,697][26022] Updated weights on worker 0-0, policy_version 48966 (0.00086) [2022-07-09 02:33:08,010][25689] Fps is (10 sec: 5481.4, 60 sec: 5677.3, 300 sec: 5696.5). Total num frames: 50146304. Throughput: 0: 5899.9. Samples: 50146388. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:33:08,010][25689] Avg episode reward: [(0, '-60.269')] [2022-07-09 02:33:08,776][26022] Updated weights on worker 0-0, policy_version 48976 (0.00091) [2022-07-09 02:33:10,363][26022] Updated weights on worker 0-0, policy_version 48986 (0.00090) [2022-07-09 02:33:12,405][26022] Updated weights on worker 0-0, policy_version 48996 (0.00087) [2022-07-09 02:33:13,074][25689] Fps is (10 sec: 5580.3, 60 sec: 5690.8, 300 sec: 5699.9). Total num frames: 50176000. Throughput: 0: 5866.7. Samples: 50180844. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:33:13,075][25689] Avg episode reward: [(0, '-59.487')] [2022-07-09 02:33:13,954][26022] Updated weights on worker 0-0, policy_version 49006 (0.00083) [2022-07-09 02:33:15,826][26022] Updated weights on worker 0-0, policy_version 49016 (0.00092) [2022-07-09 02:33:17,413][26022] Updated weights on worker 0-0, policy_version 49026 (0.00086) [2022-07-09 02:33:18,139][25689] Fps is (10 sec: 5863.4, 60 sec: 5688.9, 300 sec: 5695.4). Total num frames: 50205696. Throughput: 0: 5027.8. Samples: 50198056. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:33:18,139][25689] Avg episode reward: [(0, '-58.908')] [2022-07-09 02:33:19,351][26022] Updated weights on worker 0-0, policy_version 49036 (0.00081) [2022-07-09 02:33:21,036][26022] Updated weights on worker 0-0, policy_version 49046 (0.00094) [2022-07-09 02:33:22,907][26022] Updated weights on worker 0-0, policy_version 49056 (0.00095) [2022-07-09 02:33:23,224][25689] Fps is (10 sec: 5750.7, 60 sec: 5698.1, 300 sec: 5697.6). Total num frames: 50234368. Throughput: 0: 5872.4. Samples: 50232664. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:33:23,225][25689] Avg episode reward: [(0, '-58.477')] [2022-07-09 02:33:24,687][26022] Updated weights on worker 0-0, policy_version 49066 (0.00088) [2022-07-09 02:33:26,587][26022] Updated weights on worker 0-0, policy_version 49076 (0.00089) [2022-07-09 02:33:28,255][25689] Fps is (10 sec: 5668.4, 60 sec: 5695.4, 300 sec: 5697.1). Total num frames: 50263040. Throughput: 0: 5953.2. Samples: 50266916. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:33:28,256][25689] Avg episode reward: [(0, '-58.274')] [2022-07-09 02:33:28,310][26022] Updated weights on worker 0-0, policy_version 49086 (0.00094) [2022-07-09 02:33:30,216][26022] Updated weights on worker 0-0, policy_version 49096 (0.00089) [2022-07-09 02:33:31,862][26022] Updated weights on worker 0-0, policy_version 49106 (0.00087) [2022-07-09 02:33:33,285][25689] Fps is (10 sec: 5699.9, 60 sec: 5710.8, 300 sec: 5694.0). Total num frames: 50291712. Throughput: 0: 5958.5. Samples: 50301268. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:33:33,285][25689] Avg episode reward: [(0, '-57.012')] [2022-07-09 02:33:33,689][26022] Updated weights on worker 0-0, policy_version 49116 (0.00087) [2022-07-09 02:33:35,431][26022] Updated weights on worker 0-0, policy_version 49126 (0.00087) [2022-07-09 02:33:37,199][26022] Updated weights on worker 0-0, policy_version 49136 (0.00088) [2022-07-09 02:33:38,365][25689] Fps is (10 sec: 5570.7, 60 sec: 5643.9, 300 sec: 5696.3). Total num frames: 50319360. Throughput: 0: 5953.6. Samples: 50318478. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:33:38,366][25689] Avg episode reward: [(0, '-57.967')] [2022-07-09 02:33:39,139][26022] Updated weights on worker 0-0, policy_version 49146 (0.00086) [2022-07-09 02:33:40,816][26022] Updated weights on worker 0-0, policy_version 49156 (0.00099) [2022-07-09 02:33:42,783][26022] Updated weights on worker 0-0, policy_version 49166 (0.00090) [2022-07-09 02:33:43,372][25689] Fps is (10 sec: 5786.2, 60 sec: 5711.1, 300 sec: 5696.5). Total num frames: 50350080. Throughput: 0: 5946.1. Samples: 50352466. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:33:43,373][25689] Avg episode reward: [(0, '-58.925')] [2022-07-09 02:33:44,458][26022] Updated weights on worker 0-0, policy_version 49176 (0.00084) [2022-07-09 02:33:46,269][26022] Updated weights on worker 0-0, policy_version 49186 (0.00087) [2022-07-09 02:33:48,036][26022] Updated weights on worker 0-0, policy_version 49196 (0.00085) [2022-07-09 02:33:48,402][25689] Fps is (10 sec: 5815.4, 60 sec: 5710.3, 300 sec: 5696.2). Total num frames: 50377728. Throughput: 0: 5981.0. Samples: 50387414. Policy #0 lag: (min: 0.0, avg: 9.5, max: 18.0) [2022-07-09 02:33:48,403][25689] Avg episode reward: [(0, '-58.293')] [2022-07-09 02:33:49,795][26022] Updated weights on worker 0-0, policy_version 49206 (0.00085) [2022-07-09 02:33:51,664][26022] Updated weights on worker 0-0, policy_version 49216 (0.00091) [2022-07-09 02:33:53,424][25689] Fps is (10 sec: 5602.8, 60 sec: 5675.0, 300 sec: 5696.7). Total num frames: 50406400. Throughput: 0: 5112.3. Samples: 50404228. Policy #0 lag: (min: 0.0, avg: 9.5, max: 18.0) [2022-07-09 02:33:53,425][25689] Avg episode reward: [(0, '-58.480')] [2022-07-09 02:33:53,434][26022] Updated weights on worker 0-0, policy_version 49226 (0.00088) [2022-07-09 02:33:55,083][26022] Updated weights on worker 0-0, policy_version 49236 (0.00081) [2022-07-09 02:33:57,320][26022] Updated weights on worker 0-0, policy_version 49246 (0.00094) [2022-07-09 02:33:58,553][25689] Fps is (10 sec: 5851.2, 60 sec: 5708.5, 300 sec: 5701.4). Total num frames: 50437120. Throughput: 0: 5944.8. Samples: 50438486. Policy #0 lag: (min: 0.0, avg: 9.5, max: 18.0) [2022-07-09 02:33:58,553][25689] Avg episode reward: [(0, '-57.270')] [2022-07-09 02:33:58,618][26022] Updated weights on worker 0-0, policy_version 49256 (0.00094) [2022-07-09 02:34:00,709][26022] Updated weights on worker 0-0, policy_version 49266 (0.00092) [2022-07-09 02:34:02,740][26022] Updated weights on worker 0-0, policy_version 49276 (0.00090) [2022-07-09 02:34:03,570][25689] Fps is (10 sec: 5551.2, 60 sec: 5661.8, 300 sec: 5701.6). Total num frames: 50462720. Throughput: 0: 5862.2. Samples: 50470870. Policy #0 lag: (min: 0.0, avg: 9.5, max: 18.0) [2022-07-09 02:34:03,570][25689] Avg episode reward: [(0, '-57.622')] [2022-07-09 02:34:04,509][26022] Updated weights on worker 0-0, policy_version 49286 (0.00091) [2022-07-09 02:34:06,467][26022] Updated weights on worker 0-0, policy_version 49296 (0.00091) [2022-07-09 02:34:07,940][26022] Updated weights on worker 0-0, policy_version 49306 (0.00088) [2022-07-09 02:34:08,572][25689] Fps is (10 sec: 5518.7, 60 sec: 5714.9, 300 sec: 5698.2). Total num frames: 50492416. Throughput: 0: 4995.1. Samples: 50488166. Policy #0 lag: (min: 0.0, avg: 9.5, max: 18.0) [2022-07-09 02:34:08,575][25689] Avg episode reward: [(0, '-56.659')] [2022-07-09 02:34:10,051][26022] Updated weights on worker 0-0, policy_version 49316 (0.00090) [2022-07-09 02:34:11,479][26022] Updated weights on worker 0-0, policy_version 49326 (0.00090) [2022-07-09 02:34:13,602][25689] Fps is (10 sec: 5716.1, 60 sec: 5684.4, 300 sec: 5698.5). Total num frames: 50520064. Throughput: 0: 5869.7. Samples: 50522664. Policy #0 lag: (min: 0.0, avg: 9.5, max: 18.0) [2022-07-09 02:34:13,603][25689] Avg episode reward: [(0, '-56.072')] [2022-07-09 02:34:13,604][26022] Updated weights on worker 0-0, policy_version 49336 (0.00082) [2022-07-09 02:34:15,172][26022] Updated weights on worker 0-0, policy_version 49346 (0.00093) [2022-07-09 02:34:17,209][26022] Updated weights on worker 0-0, policy_version 49356 (0.00096) [2022-07-09 02:34:18,750][25689] Fps is (10 sec: 5634.1, 60 sec: 5676.6, 300 sec: 5696.1). Total num frames: 50549760. Throughput: 0: 5865.8. Samples: 50556962. Policy #0 lag: (min: 0.0, avg: 9.5, max: 18.0) [2022-07-09 02:34:18,751][25689] Avg episode reward: [(0, '-56.332')] [2022-07-09 02:34:18,833][26022] Updated weights on worker 0-0, policy_version 49366 (0.00080) [2022-07-09 02:34:19,236][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:34:19,253][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000049368_50552832.pth [2022-07-09 02:34:19,254][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000047364_48500736.pth [2022-07-09 02:34:20,792][26022] Updated weights on worker 0-0, policy_version 49376 (0.00090) [2022-07-09 02:34:22,331][26022] Updated weights on worker 0-0, policy_version 49386 (0.00091) [2022-07-09 02:34:23,757][25689] Fps is (10 sec: 5747.9, 60 sec: 5684.0, 300 sec: 5692.7). Total num frames: 50578432. Throughput: 0: 5124.5. Samples: 50574316. Policy #0 lag: (min: 0.0, avg: 9.5, max: 18.0) [2022-07-09 02:34:23,757][25689] Avg episode reward: [(0, '-56.273')] [2022-07-09 02:34:24,238][26022] Updated weights on worker 0-0, policy_version 49396 (0.00119) [2022-07-09 02:34:26,229][26022] Updated weights on worker 0-0, policy_version 49406 (0.00080) [2022-07-09 02:34:27,761][26022] Updated weights on worker 0-0, policy_version 49416 (0.00081) [2022-07-09 02:34:28,780][25689] Fps is (10 sec: 5717.2, 60 sec: 5684.7, 300 sec: 5696.2). Total num frames: 50607104. Throughput: 0: 5952.0. Samples: 50608444. Policy #0 lag: (min: 0.0, avg: 9.5, max: 18.0) [2022-07-09 02:34:28,781][25689] Avg episode reward: [(0, '-57.581')] [2022-07-09 02:34:29,580][26022] Updated weights on worker 0-0, policy_version 49426 (0.00094) [2022-07-09 02:34:31,395][26022] Updated weights on worker 0-0, policy_version 49436 (0.00094) [2022-07-09 02:34:33,282][26022] Updated weights on worker 0-0, policy_version 49446 (0.00084) [2022-07-09 02:34:33,802][25689] Fps is (10 sec: 5810.6, 60 sec: 5702.3, 300 sec: 5700.2). Total num frames: 50636800. Throughput: 0: 5952.3. Samples: 50642900. Policy #0 lag: (min: 0.0, avg: 9.5, max: 18.0) [2022-07-09 02:34:33,802][25689] Avg episode reward: [(0, '-57.620')] [2022-07-09 02:34:35,099][26022] Updated weights on worker 0-0, policy_version 49456 (0.00087) [2022-07-09 02:34:36,840][26022] Updated weights on worker 0-0, policy_version 49466 (0.00088) [2022-07-09 02:34:38,505][26022] Updated weights on worker 0-0, policy_version 49476 (0.00095) [2022-07-09 02:34:38,902][25689] Fps is (10 sec: 5766.8, 60 sec: 5717.4, 300 sec: 5695.8). Total num frames: 50665472. Throughput: 0: 5121.2. Samples: 50660162. Policy #0 lag: (min: 0.0, avg: 9.5, max: 18.0) [2022-07-09 02:34:38,902][25689] Avg episode reward: [(0, '-58.448')] [2022-07-09 02:34:40,369][26022] Updated weights on worker 0-0, policy_version 49486 (0.00097) [2022-07-09 02:34:42,029][26022] Updated weights on worker 0-0, policy_version 49496 (0.00086) [2022-07-09 02:34:43,881][26022] Updated weights on worker 0-0, policy_version 49506 (0.00081) [2022-07-09 02:34:43,916][25689] Fps is (10 sec: 5669.5, 60 sec: 5682.9, 300 sec: 5692.2). Total num frames: 50694144. Throughput: 0: 5978.3. Samples: 50694836. Policy #0 lag: (min: 0.0, avg: 9.5, max: 18.0) [2022-07-09 02:34:43,917][25689] Avg episode reward: [(0, '-58.404')] [2022-07-09 02:34:45,640][26022] Updated weights on worker 0-0, policy_version 49516 (0.00087) [2022-07-09 02:34:47,384][26022] Updated weights on worker 0-0, policy_version 49526 (0.00086) [2022-07-09 02:34:48,928][25689] Fps is (10 sec: 5821.4, 60 sec: 5718.4, 300 sec: 5692.0). Total num frames: 50723840. Throughput: 0: 6008.1. Samples: 50729494. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 02:34:48,929][25689] Avg episode reward: [(0, '-57.967')] [2022-07-09 02:34:49,034][26022] Updated weights on worker 0-0, policy_version 49536 (0.00085) [2022-07-09 02:34:50,834][26022] Updated weights on worker 0-0, policy_version 49546 (0.00086) [2022-07-09 02:34:52,667][26022] Updated weights on worker 0-0, policy_version 49556 (0.00090) [2022-07-09 02:34:53,959][25689] Fps is (10 sec: 5811.8, 60 sec: 5717.6, 300 sec: 5700.1). Total num frames: 50752512. Throughput: 0: 5164.1. Samples: 50746994. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 02:34:53,960][25689] Avg episode reward: [(0, '-57.185')] [2022-07-09 02:34:54,523][26022] Updated weights on worker 0-0, policy_version 49566 (0.00089) [2022-07-09 02:34:56,424][26022] Updated weights on worker 0-0, policy_version 49576 (0.00087) [2022-07-09 02:34:57,934][26022] Updated weights on worker 0-0, policy_version 49586 (0.00088) [2022-07-09 02:34:59,087][25689] Fps is (10 sec: 5745.6, 60 sec: 5700.7, 300 sec: 5695.5). Total num frames: 50782208. Throughput: 0: 5992.4. Samples: 50781118. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 02:34:59,087][25689] Avg episode reward: [(0, '-56.498')] [2022-07-09 02:35:00,147][26022] Updated weights on worker 0-0, policy_version 49596 (0.00083) [2022-07-09 02:35:01,486][26022] Updated weights on worker 0-0, policy_version 49606 (0.00093) [2022-07-09 02:35:03,954][26022] Updated weights on worker 0-0, policy_version 49616 (0.00090) [2022-07-09 02:35:04,092][25689] Fps is (10 sec: 5356.0, 60 sec: 5684.9, 300 sec: 5692.2). Total num frames: 50806784. Throughput: 0: 5879.4. Samples: 50813458. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 02:35:04,094][25689] Avg episode reward: [(0, '-57.205')] [2022-07-09 02:35:05,495][26022] Updated weights on worker 0-0, policy_version 49626 (0.00087) [2022-07-09 02:35:07,440][26022] Updated weights on worker 0-0, policy_version 49636 (0.00087) [2022-07-09 02:35:09,070][26022] Updated weights on worker 0-0, policy_version 49646 (0.00084) [2022-07-09 02:35:09,167][25689] Fps is (10 sec: 5485.4, 60 sec: 5695.0, 300 sec: 5687.4). Total num frames: 50837504. Throughput: 0: 5001.9. Samples: 50830728. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 02:35:09,169][25689] Avg episode reward: [(0, '-56.907')] [2022-07-09 02:35:10,965][26022] Updated weights on worker 0-0, policy_version 49656 (0.00092) [2022-07-09 02:35:12,875][26022] Updated weights on worker 0-0, policy_version 49666 (0.00089) [2022-07-09 02:35:14,238][25689] Fps is (10 sec: 5853.8, 60 sec: 5708.0, 300 sec: 5691.2). Total num frames: 50866176. Throughput: 0: 5841.9. Samples: 50865460. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 02:35:14,238][25689] Avg episode reward: [(0, '-56.503')] [2022-07-09 02:35:14,588][26022] Updated weights on worker 0-0, policy_version 49676 (0.00087) [2022-07-09 02:35:16,272][26022] Updated weights on worker 0-0, policy_version 49686 (0.00085) [2022-07-09 02:35:17,920][26022] Updated weights on worker 0-0, policy_version 49696 (0.00087) [2022-07-09 02:35:19,267][25689] Fps is (10 sec: 5677.7, 60 sec: 5702.3, 300 sec: 5694.6). Total num frames: 50894848. Throughput: 0: 5906.8. Samples: 50900320. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 02:35:19,269][25689] Avg episode reward: [(0, '-56.779')] [2022-07-09 02:35:19,874][26022] Updated weights on worker 0-0, policy_version 49706 (0.00084) [2022-07-09 02:35:21,471][26022] Updated weights on worker 0-0, policy_version 49716 (0.00105) [2022-07-09 02:35:23,331][26022] Updated weights on worker 0-0, policy_version 49726 (0.00086) [2022-07-09 02:35:24,311][25689] Fps is (10 sec: 5794.6, 60 sec: 5715.8, 300 sec: 5694.5). Total num frames: 50924544. Throughput: 0: 5157.4. Samples: 50917734. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 02:35:24,312][25689] Avg episode reward: [(0, '-57.780')] [2022-07-09 02:35:25,235][26022] Updated weights on worker 0-0, policy_version 49736 (0.00090) [2022-07-09 02:35:26,971][26022] Updated weights on worker 0-0, policy_version 49746 (0.00083) [2022-07-09 02:35:28,817][26022] Updated weights on worker 0-0, policy_version 49756 (0.00093) [2022-07-09 02:35:29,317][25689] Fps is (10 sec: 5807.7, 60 sec: 5717.3, 300 sec: 5694.7). Total num frames: 50953216. Throughput: 0: 6025.7. Samples: 50952146. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 02:35:29,318][25689] Avg episode reward: [(0, '-59.048')] [2022-07-09 02:35:30,551][26022] Updated weights on worker 0-0, policy_version 49766 (0.00080) [2022-07-09 02:35:32,419][26022] Updated weights on worker 0-0, policy_version 49776 (0.00085) [2022-07-09 02:35:34,047][26022] Updated weights on worker 0-0, policy_version 49786 (0.00089) [2022-07-09 02:35:34,358][25689] Fps is (10 sec: 5707.2, 60 sec: 5698.6, 300 sec: 5695.4). Total num frames: 50981888. Throughput: 0: 6030.4. Samples: 50986794. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 02:35:34,359][25689] Avg episode reward: [(0, '-58.766')] [2022-07-09 02:35:35,776][26022] Updated weights on worker 0-0, policy_version 49796 (0.00085) [2022-07-09 02:35:37,704][26022] Updated weights on worker 0-0, policy_version 49806 (0.00088) [2022-07-09 02:35:39,230][26022] Updated weights on worker 0-0, policy_version 49816 (0.00090) [2022-07-09 02:35:39,413][25689] Fps is (10 sec: 5781.3, 60 sec: 5719.8, 300 sec: 5697.9). Total num frames: 51011584. Throughput: 0: 5150.2. Samples: 51004076. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 02:35:39,414][25689] Avg episode reward: [(0, '-58.870')] [2022-07-09 02:35:41,351][26022] Updated weights on worker 0-0, policy_version 49826 (0.00515) [2022-07-09 02:35:42,999][26022] Updated weights on worker 0-0, policy_version 49836 (0.00090) [2022-07-09 02:35:44,417][25689] Fps is (10 sec: 5700.7, 60 sec: 5703.8, 300 sec: 5694.6). Total num frames: 51039232. Throughput: 0: 6006.9. Samples: 51038514. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 02:35:44,418][25689] Avg episode reward: [(0, '-59.011')] [2022-07-09 02:35:44,761][26022] Updated weights on worker 0-0, policy_version 49846 (0.00085) [2022-07-09 02:35:46,568][26022] Updated weights on worker 0-0, policy_version 49856 (0.00089) [2022-07-09 02:35:48,367][26022] Updated weights on worker 0-0, policy_version 49866 (0.00084) [2022-07-09 02:35:49,423][25689] Fps is (10 sec: 5729.1, 60 sec: 5704.4, 300 sec: 5694.6). Total num frames: 51068928. Throughput: 0: 6016.0. Samples: 51073100. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:35:49,423][25689] Avg episode reward: [(0, '-58.715')] [2022-07-09 02:35:50,152][26022] Updated weights on worker 0-0, policy_version 49876 (0.00615) [2022-07-09 02:35:51,907][26022] Updated weights on worker 0-0, policy_version 49886 (0.00083) [2022-07-09 02:35:53,621][26022] Updated weights on worker 0-0, policy_version 49896 (0.00089) [2022-07-09 02:35:54,433][25689] Fps is (10 sec: 5827.4, 60 sec: 5706.3, 300 sec: 5699.5). Total num frames: 51097600. Throughput: 0: 5162.7. Samples: 51090436. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:35:54,434][25689] Avg episode reward: [(0, '-58.085')] [2022-07-09 02:35:55,656][26022] Updated weights on worker 0-0, policy_version 49906 (0.00099) [2022-07-09 02:35:57,343][26022] Updated weights on worker 0-0, policy_version 49916 (0.00085) [2022-07-09 02:35:59,033][26022] Updated weights on worker 0-0, policy_version 49926 (0.00090) [2022-07-09 02:35:59,468][25689] Fps is (10 sec: 5810.6, 60 sec: 5715.2, 300 sec: 5702.7). Total num frames: 51127296. Throughput: 0: 5996.1. Samples: 51124324. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:35:59,468][25689] Avg episode reward: [(0, '-57.775')] [2022-07-09 02:36:00,868][26022] Updated weights on worker 0-0, policy_version 49936 (0.00092) [2022-07-09 02:36:03,122][26022] Updated weights on worker 0-0, policy_version 49946 (0.00084) [2022-07-09 02:36:04,484][25689] Fps is (10 sec: 5502.0, 60 sec: 5731.1, 300 sec: 5692.5). Total num frames: 51152896. Throughput: 0: 5891.3. Samples: 51156730. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:36:04,484][25689] Avg episode reward: [(0, '-59.051')] [2022-07-09 02:36:05,020][26022] Updated weights on worker 0-0, policy_version 49956 (0.00099) [2022-07-09 02:36:06,767][26022] Updated weights on worker 0-0, policy_version 49966 (0.00084) [2022-07-09 02:36:08,380][26022] Updated weights on worker 0-0, policy_version 49976 (0.00096) [2022-07-09 02:36:09,495][25689] Fps is (10 sec: 5208.1, 60 sec: 5669.3, 300 sec: 5696.0). Total num frames: 51179520. Throughput: 0: 5027.3. Samples: 51174010. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:36:09,496][25689] Avg episode reward: [(0, '-58.926')] [2022-07-09 02:36:10,145][26022] Updated weights on worker 0-0, policy_version 49986 (0.00082) [2022-07-09 02:36:11,976][26022] Updated weights on worker 0-0, policy_version 49996 (0.00081) [2022-07-09 02:36:13,760][26022] Updated weights on worker 0-0, policy_version 50006 (0.00081) [2022-07-09 02:36:14,504][25689] Fps is (10 sec: 5620.3, 60 sec: 5692.0, 300 sec: 5693.2). Total num frames: 51209216. Throughput: 0: 5885.6. Samples: 51208566. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:36:14,505][25689] Avg episode reward: [(0, '-58.864')] [2022-07-09 02:36:15,537][26022] Updated weights on worker 0-0, policy_version 50016 (0.00088) [2022-07-09 02:36:17,449][26022] Updated weights on worker 0-0, policy_version 50026 (0.00086) [2022-07-09 02:36:18,987][26022] Updated weights on worker 0-0, policy_version 50036 (0.00088) [2022-07-09 02:36:19,347][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:36:19,355][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000050038_51238912.pth [2022-07-09 02:36:19,356][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000048034_49186816.pth [2022-07-09 02:36:19,567][25689] Fps is (10 sec: 5998.7, 60 sec: 5722.9, 300 sec: 5695.8). Total num frames: 51239936. Throughput: 0: 5917.0. Samples: 51243250. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:36:19,567][25689] Avg episode reward: [(0, '-59.295')] [2022-07-09 02:36:21,107][26022] Updated weights on worker 0-0, policy_version 50046 (0.00082) [2022-07-09 02:36:22,500][26022] Updated weights on worker 0-0, policy_version 50056 (0.00081) [2022-07-09 02:36:24,546][26022] Updated weights on worker 0-0, policy_version 50066 (0.00087) [2022-07-09 02:36:24,574][25689] Fps is (10 sec: 5898.1, 60 sec: 5709.3, 300 sec: 5696.0). Total num frames: 51268608. Throughput: 0: 5187.2. Samples: 51260944. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:36:24,575][25689] Avg episode reward: [(0, '-59.228')] [2022-07-09 02:36:26,105][26022] Updated weights on worker 0-0, policy_version 50076 (0.00093) [2022-07-09 02:36:27,971][26022] Updated weights on worker 0-0, policy_version 50086 (0.00080) [2022-07-09 02:36:29,579][25689] Fps is (10 sec: 5727.5, 60 sec: 5709.5, 300 sec: 5696.2). Total num frames: 51297280. Throughput: 0: 6050.7. Samples: 51295530. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:36:29,581][25689] Avg episode reward: [(0, '-58.249')] [2022-07-09 02:36:29,768][26022] Updated weights on worker 0-0, policy_version 50096 (0.00087) [2022-07-09 02:36:31,375][26022] Updated weights on worker 0-0, policy_version 50106 (0.00091) [2022-07-09 02:36:33,175][26022] Updated weights on worker 0-0, policy_version 50116 (0.00090) [2022-07-09 02:36:34,588][25689] Fps is (10 sec: 5726.3, 60 sec: 5712.5, 300 sec: 5693.5). Total num frames: 51325952. Throughput: 0: 6067.3. Samples: 51330422. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:36:34,590][25689] Avg episode reward: [(0, '-58.420')] [2022-07-09 02:36:35,058][26022] Updated weights on worker 0-0, policy_version 50126 (0.00089) [2022-07-09 02:36:36,637][26022] Updated weights on worker 0-0, policy_version 50136 (0.00094) [2022-07-09 02:36:38,637][26022] Updated weights on worker 0-0, policy_version 50146 (0.00084) [2022-07-09 02:36:39,637][25689] Fps is (10 sec: 5803.2, 60 sec: 5713.2, 300 sec: 5699.7). Total num frames: 51355648. Throughput: 0: 5203.3. Samples: 51347680. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:36:39,637][25689] Avg episode reward: [(0, '-58.248')] [2022-07-09 02:36:40,193][26022] Updated weights on worker 0-0, policy_version 50156 (0.00090) [2022-07-09 02:36:42,088][26022] Updated weights on worker 0-0, policy_version 50166 (0.00111) [2022-07-09 02:36:43,990][26022] Updated weights on worker 0-0, policy_version 50176 (0.00085) [2022-07-09 02:36:44,664][25689] Fps is (10 sec: 5691.3, 60 sec: 5710.9, 300 sec: 5689.0). Total num frames: 51383296. Throughput: 0: 6037.2. Samples: 51382230. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:36:44,666][25689] Avg episode reward: [(0, '-58.535')] [2022-07-09 02:36:45,648][26022] Updated weights on worker 0-0, policy_version 50186 (0.00079) [2022-07-09 02:36:47,564][26022] Updated weights on worker 0-0, policy_version 50196 (0.00088) [2022-07-09 02:36:49,360][26022] Updated weights on worker 0-0, policy_version 50206 (0.00087) [2022-07-09 02:36:49,667][25689] Fps is (10 sec: 5717.2, 60 sec: 5711.2, 300 sec: 5703.2). Total num frames: 51412992. Throughput: 0: 6044.8. Samples: 51416958. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:36:49,667][25689] Avg episode reward: [(0, '-57.816')] [2022-07-09 02:36:50,983][26022] Updated weights on worker 0-0, policy_version 50216 (0.00450) [2022-07-09 02:36:52,763][26022] Updated weights on worker 0-0, policy_version 50226 (0.00083) [2022-07-09 02:36:54,540][26022] Updated weights on worker 0-0, policy_version 50236 (0.00090) [2022-07-09 02:36:54,686][25689] Fps is (10 sec: 5824.0, 60 sec: 5710.4, 300 sec: 5697.2). Total num frames: 51441664. Throughput: 0: 5177.7. Samples: 51434482. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 02:36:54,688][25689] Avg episode reward: [(0, '-57.651')] [2022-07-09 02:36:56,405][26022] Updated weights on worker 0-0, policy_version 50246 (0.00092) [2022-07-09 02:36:58,224][26022] Updated weights on worker 0-0, policy_version 50256 (0.00055) [2022-07-09 02:36:59,724][26022] Updated weights on worker 0-0, policy_version 50266 (0.00081) [2022-07-09 02:36:59,823][25689] Fps is (10 sec: 5847.5, 60 sec: 5717.6, 300 sec: 5708.4). Total num frames: 51472384. Throughput: 0: 6008.3. Samples: 51468968. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 02:36:59,824][25689] Avg episode reward: [(0, '-57.056')] [2022-07-09 02:37:01,772][26022] Updated weights on worker 0-0, policy_version 50276 (0.00100) [2022-07-09 02:37:03,896][26022] Updated weights on worker 0-0, policy_version 50286 (0.00096) [2022-07-09 02:37:04,874][25689] Fps is (10 sec: 5528.2, 60 sec: 5714.3, 300 sec: 5700.6). Total num frames: 51497984. Throughput: 0: 5910.0. Samples: 51501670. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 02:37:04,876][25689] Avg episode reward: [(0, '-56.765')] [2022-07-09 02:37:05,515][26022] Updated weights on worker 0-0, policy_version 50296 (0.00086) [2022-07-09 02:37:07,487][26022] Updated weights on worker 0-0, policy_version 50306 (0.00092) [2022-07-09 02:37:09,082][26022] Updated weights on worker 0-0, policy_version 50316 (0.00081) [2022-07-09 02:37:09,907][25689] Fps is (10 sec: 5484.0, 60 sec: 5763.2, 300 sec: 5704.0). Total num frames: 51527680. Throughput: 0: 5905.8. Samples: 51536492. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 02:37:09,907][25689] Avg episode reward: [(0, '-56.485')] [2022-07-09 02:37:10,931][26022] Updated weights on worker 0-0, policy_version 50326 (0.00095) [2022-07-09 02:37:12,631][26022] Updated weights on worker 0-0, policy_version 50336 (0.00056) [2022-07-09 02:37:14,453][26022] Updated weights on worker 0-0, policy_version 50346 (0.00091) [2022-07-09 02:37:14,911][25689] Fps is (10 sec: 5917.2, 60 sec: 5763.6, 300 sec: 5704.7). Total num frames: 51557376. Throughput: 0: 5900.9. Samples: 51553828. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 02:37:14,912][25689] Avg episode reward: [(0, '-55.773')] [2022-07-09 02:37:16,301][26022] Updated weights on worker 0-0, policy_version 50356 (0.00088) [2022-07-09 02:37:18,051][26022] Updated weights on worker 0-0, policy_version 50366 (0.00084) [2022-07-09 02:37:19,788][26022] Updated weights on worker 0-0, policy_version 50376 (0.00098) [2022-07-09 02:37:20,024][25689] Fps is (10 sec: 5667.6, 60 sec: 5707.9, 300 sec: 5702.6). Total num frames: 51585024. Throughput: 0: 5892.0. Samples: 51587992. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 02:37:20,026][25689] Avg episode reward: [(0, '-55.683')] [2022-07-09 02:37:21,657][26022] Updated weights on worker 0-0, policy_version 50386 (0.00094) [2022-07-09 02:37:23,482][26022] Updated weights on worker 0-0, policy_version 50396 (0.00082) [2022-07-09 02:37:25,026][26022] Updated weights on worker 0-0, policy_version 50406 (0.00081) [2022-07-09 02:37:25,035][25689] Fps is (10 sec: 5765.5, 60 sec: 5741.5, 300 sec: 5709.4). Total num frames: 51615744. Throughput: 0: 6011.8. Samples: 51622872. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 02:37:25,035][25689] Avg episode reward: [(0, '-56.041')] [2022-07-09 02:37:27,043][26022] Updated weights on worker 0-0, policy_version 50416 (0.00099) [2022-07-09 02:37:28,687][26022] Updated weights on worker 0-0, policy_version 50426 (0.00932) [2022-07-09 02:37:30,038][25689] Fps is (10 sec: 5726.3, 60 sec: 5707.7, 300 sec: 5706.1). Total num frames: 51642368. Throughput: 0: 5135.1. Samples: 51639872. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 02:37:30,040][25689] Avg episode reward: [(0, '-56.251')] [2022-07-09 02:37:30,596][26022] Updated weights on worker 0-0, policy_version 50436 (0.00084) [2022-07-09 02:37:32,275][26022] Updated weights on worker 0-0, policy_version 50446 (0.00077) [2022-07-09 02:37:34,134][26022] Updated weights on worker 0-0, policy_version 50456 (0.00087) [2022-07-09 02:37:35,074][25689] Fps is (10 sec: 5711.8, 60 sec: 5739.1, 300 sec: 5703.7). Total num frames: 51673088. Throughput: 0: 5992.5. Samples: 51674656. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 02:37:35,076][25689] Avg episode reward: [(0, '-56.133')] [2022-07-09 02:37:35,729][26022] Updated weights on worker 0-0, policy_version 50466 (0.00083) [2022-07-09 02:37:37,738][26022] Updated weights on worker 0-0, policy_version 50476 (0.00089) [2022-07-09 02:37:39,351][26022] Updated weights on worker 0-0, policy_version 50486 (0.00088) [2022-07-09 02:37:40,195][25689] Fps is (10 sec: 5747.0, 60 sec: 5698.4, 300 sec: 5704.8). Total num frames: 51700736. Throughput: 0: 6011.9. Samples: 51709256. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 02:37:40,195][25689] Avg episode reward: [(0, '-56.154')] [2022-07-09 02:37:41,235][26022] Updated weights on worker 0-0, policy_version 50496 (0.00082) [2022-07-09 02:37:42,968][26022] Updated weights on worker 0-0, policy_version 50506 (0.00092) [2022-07-09 02:37:44,712][26022] Updated weights on worker 0-0, policy_version 50516 (0.00093) [2022-07-09 02:37:45,225][25689] Fps is (10 sec: 5750.2, 60 sec: 5748.9, 300 sec: 5715.0). Total num frames: 51731456. Throughput: 0: 5129.6. Samples: 51726440. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 02:37:45,226][25689] Avg episode reward: [(0, '-56.305')] [2022-07-09 02:37:46,520][26022] Updated weights on worker 0-0, policy_version 50526 (0.00083) [2022-07-09 02:37:48,207][26022] Updated weights on worker 0-0, policy_version 50536 (0.00098) [2022-07-09 02:37:50,131][26022] Updated weights on worker 0-0, policy_version 50546 (0.00081) [2022-07-09 02:37:50,236][25689] Fps is (10 sec: 5914.6, 60 sec: 5731.2, 300 sec: 5708.0). Total num frames: 51760128. Throughput: 0: 5996.8. Samples: 51760996. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 02:37:50,238][25689] Avg episode reward: [(0, '-56.885')] [2022-07-09 02:37:52,059][26022] Updated weights on worker 0-0, policy_version 50556 (0.00087) [2022-07-09 02:37:53,452][26022] Updated weights on worker 0-0, policy_version 50566 (0.00087) [2022-07-09 02:37:55,323][25689] Fps is (10 sec: 5678.7, 60 sec: 5724.8, 300 sec: 5708.8). Total num frames: 51788800. Throughput: 0: 5978.5. Samples: 51795712. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-09 02:37:55,324][25689] Avg episode reward: [(0, '-56.291')] [2022-07-09 02:37:55,574][26022] Updated weights on worker 0-0, policy_version 50576 (0.00089) [2022-07-09 02:37:57,091][26022] Updated weights on worker 0-0, policy_version 50586 (0.00906) [2022-07-09 02:37:59,126][26022] Updated weights on worker 0-0, policy_version 50596 (0.00083) [2022-07-09 02:38:00,400][25689] Fps is (10 sec: 5642.3, 60 sec: 5696.7, 300 sec: 5708.4). Total num frames: 51817472. Throughput: 0: 5112.3. Samples: 51812550. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-09 02:38:00,400][25689] Avg episode reward: [(0, '-57.348')] [2022-07-09 02:38:00,682][26022] Updated weights on worker 0-0, policy_version 50606 (0.00092) [2022-07-09 02:38:02,901][26022] Updated weights on worker 0-0, policy_version 50616 (0.00091) [2022-07-09 02:38:04,607][26022] Updated weights on worker 0-0, policy_version 50626 (0.00088) [2022-07-09 02:38:05,493][25689] Fps is (10 sec: 5437.3, 60 sec: 5709.6, 300 sec: 5707.2). Total num frames: 51844096. Throughput: 0: 5860.0. Samples: 51845210. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-09 02:38:05,493][25689] Avg episode reward: [(0, '-57.807')] [2022-07-09 02:38:06,372][26022] Updated weights on worker 0-0, policy_version 50636 (0.00087) [2022-07-09 02:38:08,326][26022] Updated weights on worker 0-0, policy_version 50646 (0.00090) [2022-07-09 02:38:10,003][26022] Updated weights on worker 0-0, policy_version 50656 (0.00090) [2022-07-09 02:38:10,539][25689] Fps is (10 sec: 5453.5, 60 sec: 5691.5, 300 sec: 5704.1). Total num frames: 51872768. Throughput: 0: 5855.8. Samples: 51879884. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-09 02:38:10,540][25689] Avg episode reward: [(0, '-57.117')] [2022-07-09 02:38:11,635][26022] Updated weights on worker 0-0, policy_version 50666 (0.00087) [2022-07-09 02:38:13,538][26022] Updated weights on worker 0-0, policy_version 50676 (0.00085) [2022-07-09 02:38:15,144][26022] Updated weights on worker 0-0, policy_version 50686 (0.00078) [2022-07-09 02:38:15,618][25689] Fps is (10 sec: 5967.0, 60 sec: 5718.3, 300 sec: 5710.7). Total num frames: 51904512. Throughput: 0: 5006.5. Samples: 51897318. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-09 02:38:15,618][25689] Avg episode reward: [(0, '-57.200')] [2022-07-09 02:38:17,147][26022] Updated weights on worker 0-0, policy_version 50696 (0.00087) [2022-07-09 02:38:18,679][26022] Updated weights on worker 0-0, policy_version 50706 (0.00089) [2022-07-09 02:38:19,429][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:38:19,442][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000050709_51926016.pth [2022-07-09 02:38:19,442][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000048701_49869824.pth [2022-07-09 02:38:20,689][25689] Fps is (10 sec: 5952.1, 60 sec: 5739.0, 300 sec: 5711.0). Total num frames: 51933184. Throughput: 0: 5893.0. Samples: 51932118. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-09 02:38:20,690][25689] Avg episode reward: [(0, '-57.853')] [2022-07-09 02:38:20,692][26022] Updated weights on worker 0-0, policy_version 50716 (0.00087) [2022-07-09 02:38:22,407][26022] Updated weights on worker 0-0, policy_version 50726 (0.00089) [2022-07-09 02:38:24,112][26022] Updated weights on worker 0-0, policy_version 50736 (0.00090) [2022-07-09 02:38:25,721][25689] Fps is (10 sec: 5675.8, 60 sec: 5703.3, 300 sec: 5711.0). Total num frames: 51961856. Throughput: 0: 6014.9. Samples: 51966882. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-09 02:38:25,721][25689] Avg episode reward: [(0, '-58.075')] [2022-07-09 02:38:25,937][26022] Updated weights on worker 0-0, policy_version 50746 (0.00081) [2022-07-09 02:38:27,699][26022] Updated weights on worker 0-0, policy_version 50756 (0.00095) [2022-07-09 02:38:29,472][26022] Updated weights on worker 0-0, policy_version 50766 (0.00091) [2022-07-09 02:38:30,729][25689] Fps is (10 sec: 5711.7, 60 sec: 5736.6, 300 sec: 5711.4). Total num frames: 51990528. Throughput: 0: 5156.5. Samples: 51983994. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-09 02:38:30,730][25689] Avg episode reward: [(0, '-57.782')] [2022-07-09 02:38:31,492][26022] Updated weights on worker 0-0, policy_version 50776 (0.00092) [2022-07-09 02:38:33,030][26022] Updated weights on worker 0-0, policy_version 50786 (0.00100) [2022-07-09 02:38:34,919][26022] Updated weights on worker 0-0, policy_version 50796 (0.00083) [2022-07-09 02:38:35,786][25689] Fps is (10 sec: 5799.0, 60 sec: 5717.8, 300 sec: 5718.7). Total num frames: 52020224. Throughput: 0: 6006.6. Samples: 52018462. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-09 02:38:35,787][25689] Avg episode reward: [(0, '-57.094')] [2022-07-09 02:38:36,612][26022] Updated weights on worker 0-0, policy_version 50806 (0.00081) [2022-07-09 02:38:38,501][26022] Updated weights on worker 0-0, policy_version 50816 (0.00088) [2022-07-09 02:38:40,256][26022] Updated weights on worker 0-0, policy_version 50826 (0.00088) [2022-07-09 02:38:40,885][25689] Fps is (10 sec: 5747.3, 60 sec: 5736.7, 300 sec: 5710.1). Total num frames: 52048896. Throughput: 0: 5972.9. Samples: 52052744. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-09 02:38:40,886][25689] Avg episode reward: [(0, '-58.212')] [2022-07-09 02:38:42,075][26022] Updated weights on worker 0-0, policy_version 50836 (0.00085) [2022-07-09 02:38:43,747][26022] Updated weights on worker 0-0, policy_version 50846 (0.00080) [2022-07-09 02:38:45,713][26022] Updated weights on worker 0-0, policy_version 50856 (0.00095) [2022-07-09 02:38:45,903][25689] Fps is (10 sec: 5668.4, 60 sec: 5704.1, 300 sec: 5713.7). Total num frames: 52077568. Throughput: 0: 5963.1. Samples: 52087226. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-09 02:38:45,903][25689] Avg episode reward: [(0, '-59.004')] [2022-07-09 02:38:47,519][26022] Updated weights on worker 0-0, policy_version 50866 (0.00086) [2022-07-09 02:38:49,157][26022] Updated weights on worker 0-0, policy_version 50876 (0.00084) [2022-07-09 02:38:50,905][25689] Fps is (10 sec: 5722.7, 60 sec: 5704.9, 300 sec: 5714.1). Total num frames: 52106240. Throughput: 0: 5977.7. Samples: 52104600. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-09 02:38:50,906][25689] Avg episode reward: [(0, '-59.209')] [2022-07-09 02:38:51,146][26022] Updated weights on worker 0-0, policy_version 50886 (0.00094) [2022-07-09 02:38:52,675][26022] Updated weights on worker 0-0, policy_version 50896 (0.00086) [2022-07-09 02:38:54,599][26022] Updated weights on worker 0-0, policy_version 50906 (0.00087) [2022-07-09 02:38:55,911][25689] Fps is (10 sec: 5729.5, 60 sec: 5712.5, 300 sec: 5709.6). Total num frames: 52134912. Throughput: 0: 5978.8. Samples: 52138784. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-09 02:38:55,912][25689] Avg episode reward: [(0, '-58.919')] [2022-07-09 02:38:56,390][26022] Updated weights on worker 0-0, policy_version 50916 (0.00092) [2022-07-09 02:38:58,190][26022] Updated weights on worker 0-0, policy_version 50926 (0.00089) [2022-07-09 02:38:59,995][26022] Updated weights on worker 0-0, policy_version 50936 (0.00088) [2022-07-09 02:39:01,022][25689] Fps is (10 sec: 5668.0, 60 sec: 5709.3, 300 sec: 5718.1). Total num frames: 52163584. Throughput: 0: 5972.9. Samples: 52173024. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:39:01,023][25689] Avg episode reward: [(0, '-58.412')] [2022-07-09 02:39:01,975][26022] Updated weights on worker 0-0, policy_version 50946 (0.00091) [2022-07-09 02:39:04,072][26022] Updated weights on worker 0-0, policy_version 50956 (0.00088) [2022-07-09 02:39:05,604][26022] Updated weights on worker 0-0, policy_version 50966 (0.00090) [2022-07-09 02:39:06,085][25689] Fps is (10 sec: 5535.8, 60 sec: 5729.1, 300 sec: 5710.1). Total num frames: 52191232. Throughput: 0: 5000.5. Samples: 52188148. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:39:06,085][25689] Avg episode reward: [(0, '-58.859')] [2022-07-09 02:39:07,699][26022] Updated weights on worker 0-0, policy_version 50976 (0.00087) [2022-07-09 02:39:09,150][26022] Updated weights on worker 0-0, policy_version 50986 (0.00091) [2022-07-09 02:39:11,089][25689] Fps is (10 sec: 5594.6, 60 sec: 5733.0, 300 sec: 5714.0). Total num frames: 52219904. Throughput: 0: 5847.5. Samples: 52222626. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:39:11,090][25689] Avg episode reward: [(0, '-58.441')] [2022-07-09 02:39:11,093][26022] Updated weights on worker 0-0, policy_version 50996 (0.00087) [2022-07-09 02:39:12,742][26022] Updated weights on worker 0-0, policy_version 51006 (0.00094) [2022-07-09 02:39:14,706][26022] Updated weights on worker 0-0, policy_version 51016 (0.00055) [2022-07-09 02:39:16,101][25689] Fps is (10 sec: 5827.4, 60 sec: 5705.5, 300 sec: 5716.6). Total num frames: 52249600. Throughput: 0: 5861.7. Samples: 52257130. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:39:16,101][25689] Avg episode reward: [(0, '-57.916')] [2022-07-09 02:39:16,319][26022] Updated weights on worker 0-0, policy_version 51026 (0.00087) [2022-07-09 02:39:18,236][26022] Updated weights on worker 0-0, policy_version 51036 (0.00084) [2022-07-09 02:39:19,970][26022] Updated weights on worker 0-0, policy_version 51046 (0.00085) [2022-07-09 02:39:21,185][25689] Fps is (10 sec: 5679.9, 60 sec: 5687.4, 300 sec: 5711.7). Total num frames: 52277248. Throughput: 0: 5028.3. Samples: 52274408. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:39:21,187][25689] Avg episode reward: [(0, '-57.839')] [2022-07-09 02:39:21,827][26022] Updated weights on worker 0-0, policy_version 51056 (0.00086) [2022-07-09 02:39:23,574][26022] Updated weights on worker 0-0, policy_version 51066 (0.00086) [2022-07-09 02:39:25,354][26022] Updated weights on worker 0-0, policy_version 51076 (0.00083) [2022-07-09 02:39:26,206][25689] Fps is (10 sec: 5472.3, 60 sec: 5671.5, 300 sec: 5708.3). Total num frames: 52304896. Throughput: 0: 6004.5. Samples: 52308962. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:39:26,207][25689] Avg episode reward: [(0, '-58.699')] [2022-07-09 02:39:27,178][26022] Updated weights on worker 0-0, policy_version 51086 (0.00086) [2022-07-09 02:39:29,028][26022] Updated weights on worker 0-0, policy_version 51096 (0.00097) [2022-07-09 02:39:30,693][26022] Updated weights on worker 0-0, policy_version 51106 (0.00088) [2022-07-09 02:39:31,219][25689] Fps is (10 sec: 5817.2, 60 sec: 5704.9, 300 sec: 5711.9). Total num frames: 52335616. Throughput: 0: 5998.3. Samples: 52343370. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:39:31,219][25689] Avg episode reward: [(0, '-59.233')] [2022-07-09 02:39:32,587][26022] Updated weights on worker 0-0, policy_version 51116 (0.00088) [2022-07-09 02:39:34,272][26022] Updated weights on worker 0-0, policy_version 51126 (0.00088) [2022-07-09 02:39:35,975][26022] Updated weights on worker 0-0, policy_version 51136 (0.00081) [2022-07-09 02:39:36,222][25689] Fps is (10 sec: 5827.0, 60 sec: 5676.1, 300 sec: 5710.3). Total num frames: 52363264. Throughput: 0: 5153.3. Samples: 52360822. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:39:36,224][25689] Avg episode reward: [(0, '-59.556')] [2022-07-09 02:39:37,845][26022] Updated weights on worker 0-0, policy_version 51146 (0.00089) [2022-07-09 02:39:39,663][26022] Updated weights on worker 0-0, policy_version 51156 (0.00085) [2022-07-09 02:39:41,360][25689] Fps is (10 sec: 5654.5, 60 sec: 5689.3, 300 sec: 5711.4). Total num frames: 52392960. Throughput: 0: 5993.6. Samples: 52395328. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:39:41,362][25689] Avg episode reward: [(0, '-58.974')] [2022-07-09 02:39:41,431][26022] Updated weights on worker 0-0, policy_version 51166 (0.00082) [2022-07-09 02:39:43,216][26022] Updated weights on worker 0-0, policy_version 51176 (0.00050) [2022-07-09 02:39:44,842][26022] Updated weights on worker 0-0, policy_version 51186 (0.00079) [2022-07-09 02:39:46,364][25689] Fps is (10 sec: 5856.0, 60 sec: 5707.5, 300 sec: 5711.5). Total num frames: 52422656. Throughput: 0: 6000.2. Samples: 52429920. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:39:46,366][25689] Avg episode reward: [(0, '-58.915')] [2022-07-09 02:39:46,827][26022] Updated weights on worker 0-0, policy_version 51196 (0.00084) [2022-07-09 02:39:48,325][26022] Updated weights on worker 0-0, policy_version 51206 (0.00089) [2022-07-09 02:39:50,299][26022] Updated weights on worker 0-0, policy_version 51216 (0.00083) [2022-07-09 02:39:51,409][25689] Fps is (10 sec: 5808.6, 60 sec: 5703.6, 300 sec: 5711.3). Total num frames: 52451328. Throughput: 0: 5143.1. Samples: 52447208. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:39:51,411][25689] Avg episode reward: [(0, '-59.363')] [2022-07-09 02:39:51,946][26022] Updated weights on worker 0-0, policy_version 51226 (0.00087) [2022-07-09 02:39:53,969][26022] Updated weights on worker 0-0, policy_version 51236 (0.00083) [2022-07-09 02:39:55,380][26022] Updated weights on worker 0-0, policy_version 51246 (0.00091) [2022-07-09 02:39:56,423][25689] Fps is (10 sec: 5701.1, 60 sec: 5702.8, 300 sec: 5710.0). Total num frames: 52480000. Throughput: 0: 6001.0. Samples: 52482044. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:39:56,424][25689] Avg episode reward: [(0, '-58.665')] [2022-07-09 02:39:57,499][26022] Updated weights on worker 0-0, policy_version 51256 (0.00082) [2022-07-09 02:39:58,868][26022] Updated weights on worker 0-0, policy_version 51266 (0.00089) [2022-07-09 02:40:01,053][26022] Updated weights on worker 0-0, policy_version 51276 (0.00091) [2022-07-09 02:40:01,483][25689] Fps is (10 sec: 5793.9, 60 sec: 5724.6, 300 sec: 5726.2). Total num frames: 52509696. Throughput: 0: 6010.5. Samples: 52516274. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 02:40:01,483][25689] Avg episode reward: [(0, '-58.367')] [2022-07-09 02:40:03,167][26022] Updated weights on worker 0-0, policy_version 51286 (0.00086) [2022-07-09 02:40:04,956][26022] Updated weights on worker 0-0, policy_version 51296 (0.00054) [2022-07-09 02:40:06,495][25689] Fps is (10 sec: 5591.4, 60 sec: 5712.4, 300 sec: 5713.6). Total num frames: 52536320. Throughput: 0: 5040.2. Samples: 52531384. Policy #0 lag: (min: 0.0, avg: 10.0, max: 19.0) [2022-07-09 02:40:06,496][25689] Avg episode reward: [(0, '-58.031')] [2022-07-09 02:40:06,556][26022] Updated weights on worker 0-0, policy_version 51306 (0.00087) [2022-07-09 02:40:08,546][26022] Updated weights on worker 0-0, policy_version 51316 (0.00078) [2022-07-09 02:40:10,103][26022] Updated weights on worker 0-0, policy_version 51326 (0.00094) [2022-07-09 02:40:11,506][25689] Fps is (10 sec: 5516.7, 60 sec: 5711.8, 300 sec: 5714.8). Total num frames: 52564992. Throughput: 0: 5901.1. Samples: 52565806. Policy #0 lag: (min: 0.0, avg: 10.0, max: 19.0) [2022-07-09 02:40:11,508][25689] Avg episode reward: [(0, '-58.199')] [2022-07-09 02:40:12,139][26022] Updated weights on worker 0-0, policy_version 51336 (0.00085) [2022-07-09 02:40:13,677][26022] Updated weights on worker 0-0, policy_version 51346 (0.00096) [2022-07-09 02:40:15,728][26022] Updated weights on worker 0-0, policy_version 51356 (0.00083) [2022-07-09 02:40:16,531][25689] Fps is (10 sec: 5714.2, 60 sec: 5693.6, 300 sec: 5714.8). Total num frames: 52593664. Throughput: 0: 5897.0. Samples: 52600622. Policy #0 lag: (min: 0.0, avg: 10.0, max: 19.0) [2022-07-09 02:40:16,531][25689] Avg episode reward: [(0, '-58.123')] [2022-07-09 02:40:17,332][26022] Updated weights on worker 0-0, policy_version 51366 (0.00089) [2022-07-09 02:40:19,238][26022] Updated weights on worker 0-0, policy_version 51376 (0.00085) [2022-07-09 02:40:19,584][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:40:19,595][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000051378_52611072.pth [2022-07-09 02:40:19,596][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000049368_50552832.pth [2022-07-09 02:40:20,827][26022] Updated weights on worker 0-0, policy_version 51386 (0.00093) [2022-07-09 02:40:21,574][25689] Fps is (10 sec: 5797.4, 60 sec: 5731.4, 300 sec: 5714.8). Total num frames: 52623360. Throughput: 0: 5056.4. Samples: 52617864. Policy #0 lag: (min: 0.0, avg: 10.0, max: 19.0) [2022-07-09 02:40:21,575][25689] Avg episode reward: [(0, '-57.426')] [2022-07-09 02:40:22,612][26022] Updated weights on worker 0-0, policy_version 51396 (0.00081) [2022-07-09 02:40:24,388][26022] Updated weights on worker 0-0, policy_version 51406 (0.00087) [2022-07-09 02:40:26,314][26022] Updated weights on worker 0-0, policy_version 51416 (0.00093) [2022-07-09 02:40:26,579][25689] Fps is (10 sec: 5707.1, 60 sec: 5732.9, 300 sec: 5711.4). Total num frames: 52651008. Throughput: 0: 6053.8. Samples: 52652964. Policy #0 lag: (min: 0.0, avg: 10.0, max: 19.0) [2022-07-09 02:40:26,579][25689] Avg episode reward: [(0, '-58.116')] [2022-07-09 02:40:27,932][26022] Updated weights on worker 0-0, policy_version 51426 (0.00091) [2022-07-09 02:40:29,860][26022] Updated weights on worker 0-0, policy_version 51436 (0.00098) [2022-07-09 02:40:31,471][26022] Updated weights on worker 0-0, policy_version 51446 (0.00084) [2022-07-09 02:40:31,600][25689] Fps is (10 sec: 5719.7, 60 sec: 5715.2, 300 sec: 5715.3). Total num frames: 52680704. Throughput: 0: 6035.5. Samples: 52687082. Policy #0 lag: (min: 0.0, avg: 10.0, max: 19.0) [2022-07-09 02:40:31,600][25689] Avg episode reward: [(0, '-58.320')] [2022-07-09 02:40:33,530][26022] Updated weights on worker 0-0, policy_version 51456 (0.00089) [2022-07-09 02:40:35,031][26022] Updated weights on worker 0-0, policy_version 51466 (0.00087) [2022-07-09 02:40:36,622][25689] Fps is (10 sec: 5709.5, 60 sec: 5713.4, 300 sec: 5709.0). Total num frames: 52708352. Throughput: 0: 5167.2. Samples: 52704438. Policy #0 lag: (min: 0.0, avg: 10.0, max: 19.0) [2022-07-09 02:40:36,623][25689] Avg episode reward: [(0, '-58.932')] [2022-07-09 02:40:36,988][26022] Updated weights on worker 0-0, policy_version 51476 (0.00095) [2022-07-09 02:40:38,584][26022] Updated weights on worker 0-0, policy_version 51486 (0.00091) [2022-07-09 02:40:40,507][26022] Updated weights on worker 0-0, policy_version 51496 (0.00081) [2022-07-09 02:40:41,671][25689] Fps is (10 sec: 5694.1, 60 sec: 5721.9, 300 sec: 5715.0). Total num frames: 52738048. Throughput: 0: 6038.8. Samples: 52739222. Policy #0 lag: (min: 0.0, avg: 10.0, max: 19.0) [2022-07-09 02:40:41,671][25689] Avg episode reward: [(0, '-58.984')] [2022-07-09 02:40:42,227][26022] Updated weights on worker 0-0, policy_version 51506 (0.00084) [2022-07-09 02:40:43,836][26022] Updated weights on worker 0-0, policy_version 51516 (0.00081) [2022-07-09 02:40:45,679][26022] Updated weights on worker 0-0, policy_version 51526 (0.00092) [2022-07-09 02:40:46,694][25689] Fps is (10 sec: 5795.0, 60 sec: 5703.0, 300 sec: 5711.2). Total num frames: 52766720. Throughput: 0: 6022.8. Samples: 52774116. Policy #0 lag: (min: 0.0, avg: 10.0, max: 19.0) [2022-07-09 02:40:46,695][25689] Avg episode reward: [(0, '-58.920')] [2022-07-09 02:40:47,501][26022] Updated weights on worker 0-0, policy_version 51536 (0.00087) [2022-07-09 02:40:49,137][26022] Updated weights on worker 0-0, policy_version 51546 (0.00080) [2022-07-09 02:40:51,079][26022] Updated weights on worker 0-0, policy_version 51556 (0.00086) [2022-07-09 02:40:51,707][25689] Fps is (10 sec: 5815.4, 60 sec: 5723.0, 300 sec: 5714.6). Total num frames: 52796416. Throughput: 0: 5198.7. Samples: 52791614. Policy #0 lag: (min: 0.0, avg: 10.0, max: 19.0) [2022-07-09 02:40:51,708][25689] Avg episode reward: [(0, '-58.532')] [2022-07-09 02:40:52,718][26022] Updated weights on worker 0-0, policy_version 51566 (0.00051) [2022-07-09 02:40:54,615][26022] Updated weights on worker 0-0, policy_version 51576 (0.00083) [2022-07-09 02:40:56,405][26022] Updated weights on worker 0-0, policy_version 51586 (0.00089) [2022-07-09 02:40:56,715][25689] Fps is (10 sec: 5824.6, 60 sec: 5723.6, 300 sec: 5711.7). Total num frames: 52825088. Throughput: 0: 6062.5. Samples: 52826252. Policy #0 lag: (min: 0.0, avg: 10.0, max: 19.0) [2022-07-09 02:40:56,716][25689] Avg episode reward: [(0, '-58.115')] [2022-07-09 02:40:58,040][26022] Updated weights on worker 0-0, policy_version 51596 (0.00080) [2022-07-09 02:40:59,922][26022] Updated weights on worker 0-0, policy_version 51606 (0.00093) [2022-07-09 02:41:01,647][26022] Updated weights on worker 0-0, policy_version 51616 (0.00082) [2022-07-09 02:41:01,815][25689] Fps is (10 sec: 5774.6, 60 sec: 5719.8, 300 sec: 5723.9). Total num frames: 52854784. Throughput: 0: 6038.8. Samples: 52860870. Policy #0 lag: (min: 0.0, avg: 10.0, max: 19.0) [2022-07-09 02:41:01,816][25689] Avg episode reward: [(0, '-57.504')] [2022-07-09 02:41:04,037][26022] Updated weights on worker 0-0, policy_version 51626 (0.00091) [2022-07-09 02:41:05,714][26022] Updated weights on worker 0-0, policy_version 51636 (0.00092) [2022-07-09 02:41:06,835][25689] Fps is (10 sec: 5565.4, 60 sec: 5719.1, 300 sec: 5723.7). Total num frames: 52881408. Throughput: 0: 5032.8. Samples: 52875480. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 02:41:06,836][25689] Avg episode reward: [(0, '-57.103')] [2022-07-09 02:41:07,795][26022] Updated weights on worker 0-0, policy_version 51646 (0.00086) [2022-07-09 02:41:09,287][26022] Updated weights on worker 0-0, policy_version 51656 (0.00088) [2022-07-09 02:41:11,223][26022] Updated weights on worker 0-0, policy_version 51666 (0.00050) [2022-07-09 02:41:11,853][25689] Fps is (10 sec: 5508.9, 60 sec: 5718.4, 300 sec: 5720.1). Total num frames: 52910080. Throughput: 0: 5861.7. Samples: 52909700. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 02:41:11,853][25689] Avg episode reward: [(0, '-57.672')] [2022-07-09 02:41:12,801][26022] Updated weights on worker 0-0, policy_version 51676 (0.00088) [2022-07-09 02:41:14,724][26022] Updated weights on worker 0-0, policy_version 51686 (0.00083) [2022-07-09 02:41:16,221][26022] Updated weights on worker 0-0, policy_version 51696 (0.00081) [2022-07-09 02:41:16,856][25689] Fps is (10 sec: 5722.1, 60 sec: 5720.4, 300 sec: 5714.3). Total num frames: 52938752. Throughput: 0: 5871.3. Samples: 52944506. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 02:41:16,857][25689] Avg episode reward: [(0, '-57.877')] [2022-07-09 02:41:18,359][26022] Updated weights on worker 0-0, policy_version 51706 (0.00085) [2022-07-09 02:41:19,947][26022] Updated weights on worker 0-0, policy_version 51716 (0.00082) [2022-07-09 02:41:21,922][25689] Fps is (10 sec: 5695.2, 60 sec: 5701.4, 300 sec: 5713.2). Total num frames: 52967424. Throughput: 0: 5000.7. Samples: 52961416. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 02:41:21,924][25689] Avg episode reward: [(0, '-58.052')] [2022-07-09 02:41:21,927][26022] Updated weights on worker 0-0, policy_version 51726 (0.00086) [2022-07-09 02:41:23,517][26022] Updated weights on worker 0-0, policy_version 51736 (0.00095) [2022-07-09 02:41:25,500][26022] Updated weights on worker 0-0, policy_version 51746 (0.00085) [2022-07-09 02:41:26,970][25689] Fps is (10 sec: 5669.9, 60 sec: 5714.2, 300 sec: 5712.4). Total num frames: 52996096. Throughput: 0: 5976.1. Samples: 52995810. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 02:41:26,972][25689] Avg episode reward: [(0, '-58.332')] [2022-07-09 02:41:27,063][26022] Updated weights on worker 0-0, policy_version 51756 (0.00085) [2022-07-09 02:41:29,226][26022] Updated weights on worker 0-0, policy_version 51766 (0.00123) [2022-07-09 02:41:30,775][26022] Updated weights on worker 0-0, policy_version 51776 (0.00090) [2022-07-09 02:41:31,982][25689] Fps is (10 sec: 5598.3, 60 sec: 5681.2, 300 sec: 5708.9). Total num frames: 53023744. Throughput: 0: 5977.8. Samples: 53030028. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 02:41:31,982][25689] Avg episode reward: [(0, '-58.377')] [2022-07-09 02:41:32,786][26022] Updated weights on worker 0-0, policy_version 51786 (0.00090) [2022-07-09 02:41:34,263][26022] Updated weights on worker 0-0, policy_version 51796 (0.00084) [2022-07-09 02:41:36,335][26022] Updated weights on worker 0-0, policy_version 51806 (0.00079) [2022-07-09 02:41:37,073][25689] Fps is (10 sec: 5777.5, 60 sec: 5725.5, 300 sec: 5711.5). Total num frames: 53054464. Throughput: 0: 5077.8. Samples: 53047160. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 02:41:37,073][25689] Avg episode reward: [(0, '-58.778')] [2022-07-09 02:41:38,164][26022] Updated weights on worker 0-0, policy_version 51816 (0.00084) [2022-07-09 02:41:39,721][26022] Updated weights on worker 0-0, policy_version 51826 (0.00084) [2022-07-09 02:41:41,749][26022] Updated weights on worker 0-0, policy_version 51836 (0.00082) [2022-07-09 02:41:42,125][25689] Fps is (10 sec: 5754.7, 60 sec: 5691.3, 300 sec: 5711.1). Total num frames: 53082112. Throughput: 0: 5948.4. Samples: 53081592. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 02:41:42,125][25689] Avg episode reward: [(0, '-58.728')] [2022-07-09 02:41:43,192][26022] Updated weights on worker 0-0, policy_version 51846 (0.00081) [2022-07-09 02:41:45,083][26022] Updated weights on worker 0-0, policy_version 51856 (0.00086) [2022-07-09 02:41:47,111][26022] Updated weights on worker 0-0, policy_version 51866 (0.00093) [2022-07-09 02:41:47,167][25689] Fps is (10 sec: 5579.4, 60 sec: 5689.6, 300 sec: 5706.9). Total num frames: 53110784. Throughput: 0: 5962.6. Samples: 53116236. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 02:41:47,168][25689] Avg episode reward: [(0, '-57.727')] [2022-07-09 02:41:48,595][26022] Updated weights on worker 0-0, policy_version 51876 (0.00087) [2022-07-09 02:41:50,566][26022] Updated weights on worker 0-0, policy_version 51886 (0.00091) [2022-07-09 02:41:52,058][26022] Updated weights on worker 0-0, policy_version 51896 (0.00088) [2022-07-09 02:41:52,183][25689] Fps is (10 sec: 5904.9, 60 sec: 5706.3, 300 sec: 5713.8). Total num frames: 53141504. Throughput: 0: 5126.4. Samples: 53133586. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 02:41:52,183][25689] Avg episode reward: [(0, '-57.917')] [2022-07-09 02:41:53,919][26022] Updated weights on worker 0-0, policy_version 51906 (0.00094) [2022-07-09 02:41:55,769][26022] Updated weights on worker 0-0, policy_version 51916 (0.00081) [2022-07-09 02:41:57,234][25689] Fps is (10 sec: 5798.3, 60 sec: 5685.3, 300 sec: 5705.1). Total num frames: 53169152. Throughput: 0: 6008.1. Samples: 53168288. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 02:41:57,234][25689] Avg episode reward: [(0, '-58.063')] [2022-07-09 02:41:57,651][26022] Updated weights on worker 0-0, policy_version 51926 (0.00086) [2022-07-09 02:41:59,224][26022] Updated weights on worker 0-0, policy_version 51936 (0.00074) [2022-07-09 02:42:01,160][26022] Updated weights on worker 0-0, policy_version 51946 (0.00088) [2022-07-09 02:42:02,273][25689] Fps is (10 sec: 5378.6, 60 sec: 5640.2, 300 sec: 5708.8). Total num frames: 53195776. Throughput: 0: 6018.0. Samples: 53202846. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 02:42:02,274][25689] Avg episode reward: [(0, '-57.627')] [2022-07-09 02:42:02,985][26022] Updated weights on worker 0-0, policy_version 51956 (0.00093) [2022-07-09 02:42:05,216][26022] Updated weights on worker 0-0, policy_version 51966 (0.00079) [2022-07-09 02:42:06,930][26022] Updated weights on worker 0-0, policy_version 51976 (0.00086) [2022-07-09 02:42:07,279][25689] Fps is (10 sec: 5504.5, 60 sec: 5675.3, 300 sec: 5705.9). Total num frames: 53224448. Throughput: 0: 5901.1. Samples: 53234920. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 02:42:07,280][25689] Avg episode reward: [(0, '-58.257')] [2022-07-09 02:42:08,772][26022] Updated weights on worker 0-0, policy_version 51986 (0.00088) [2022-07-09 02:42:10,274][26022] Updated weights on worker 0-0, policy_version 51996 (0.00088) [2022-07-09 02:42:12,291][25689] Fps is (10 sec: 5724.3, 60 sec: 5675.9, 300 sec: 5702.3). Total num frames: 53253120. Throughput: 0: 5892.6. Samples: 53252076. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 02:42:12,291][25689] Avg episode reward: [(0, '-58.768')] [2022-07-09 02:42:12,347][26022] Updated weights on worker 0-0, policy_version 52006 (0.00090) [2022-07-09 02:42:13,841][26022] Updated weights on worker 0-0, policy_version 52016 (0.00082) [2022-07-09 02:42:15,976][26022] Updated weights on worker 0-0, policy_version 52026 (0.00086) [2022-07-09 02:42:17,295][25689] Fps is (10 sec: 5929.6, 60 sec: 5709.7, 300 sec: 5714.7). Total num frames: 53283840. Throughput: 0: 5905.0. Samples: 53286752. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 02:42:17,296][25689] Avg episode reward: [(0, '-59.250')] [2022-07-09 02:42:17,396][26022] Updated weights on worker 0-0, policy_version 52036 (0.00090) [2022-07-09 02:42:19,491][26022] Updated weights on worker 0-0, policy_version 52046 (0.00091) [2022-07-09 02:42:19,764][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:42:19,773][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000052048_53297152.pth [2022-07-09 02:42:19,774][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000050038_51238912.pth [2022-07-09 02:42:20,839][26022] Updated weights on worker 0-0, policy_version 52056 (0.00096) [2022-07-09 02:42:22,392][25689] Fps is (10 sec: 5677.1, 60 sec: 5672.9, 300 sec: 5699.3). Total num frames: 53310464. Throughput: 0: 5889.6. Samples: 53321336. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 02:42:22,392][25689] Avg episode reward: [(0, '-59.167')] [2022-07-09 02:42:22,882][26022] Updated weights on worker 0-0, policy_version 52066 (0.00086) [2022-07-09 02:42:24,767][26022] Updated weights on worker 0-0, policy_version 52076 (0.00090) [2022-07-09 02:42:26,417][26022] Updated weights on worker 0-0, policy_version 52086 (0.00091) [2022-07-09 02:42:27,395][25689] Fps is (10 sec: 5678.0, 60 sec: 5711.0, 300 sec: 5713.1). Total num frames: 53341184. Throughput: 0: 5168.9. Samples: 53338896. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 02:42:27,395][25689] Avg episode reward: [(0, '-59.278')] [2022-07-09 02:42:28,378][26022] Updated weights on worker 0-0, policy_version 52096 (0.00088) [2022-07-09 02:42:29,947][26022] Updated weights on worker 0-0, policy_version 52106 (0.00101) [2022-07-09 02:42:31,888][26022] Updated weights on worker 0-0, policy_version 52116 (0.00095) [2022-07-09 02:42:32,407][25689] Fps is (10 sec: 5930.4, 60 sec: 5728.0, 300 sec: 5706.6). Total num frames: 53369856. Throughput: 0: 6033.5. Samples: 53373446. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 02:42:32,407][25689] Avg episode reward: [(0, '-59.777')] [2022-07-09 02:42:33,495][26022] Updated weights on worker 0-0, policy_version 52126 (0.00085) [2022-07-09 02:42:35,574][26022] Updated weights on worker 0-0, policy_version 52136 (0.00090) [2022-07-09 02:42:37,157][26022] Updated weights on worker 0-0, policy_version 52146 (0.00082) [2022-07-09 02:42:37,446][25689] Fps is (10 sec: 5705.4, 60 sec: 5699.0, 300 sec: 5711.6). Total num frames: 53398528. Throughput: 0: 5995.8. Samples: 53407570. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 02:42:37,446][25689] Avg episode reward: [(0, '-59.233')] [2022-07-09 02:42:39,115][26022] Updated weights on worker 0-0, policy_version 52156 (0.00095) [2022-07-09 02:42:40,716][26022] Updated weights on worker 0-0, policy_version 52166 (0.00086) [2022-07-09 02:42:42,561][25689] Fps is (10 sec: 5748.1, 60 sec: 5726.9, 300 sec: 5706.6). Total num frames: 53428224. Throughput: 0: 5116.4. Samples: 53424530. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 02:42:42,561][25689] Avg episode reward: [(0, '-59.227')] [2022-07-09 02:42:42,563][26022] Updated weights on worker 0-0, policy_version 52176 (0.00097) [2022-07-09 02:42:44,432][26022] Updated weights on worker 0-0, policy_version 52186 (0.00087) [2022-07-09 02:42:46,176][26022] Updated weights on worker 0-0, policy_version 52196 (0.00085) [2022-07-09 02:42:47,568][25689] Fps is (10 sec: 5766.1, 60 sec: 5730.2, 300 sec: 5706.6). Total num frames: 53456896. Throughput: 0: 5927.9. Samples: 53458482. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 02:42:47,569][25689] Avg episode reward: [(0, '-58.787')] [2022-07-09 02:42:48,201][26022] Updated weights on worker 0-0, policy_version 52206 (0.00058) [2022-07-09 02:42:49,709][26022] Updated weights on worker 0-0, policy_version 52216 (0.00088) [2022-07-09 02:42:51,647][26022] Updated weights on worker 0-0, policy_version 52226 (0.00086) [2022-07-09 02:42:52,621][25689] Fps is (10 sec: 5699.9, 60 sec: 5692.8, 300 sec: 5707.3). Total num frames: 53485568. Throughput: 0: 5923.5. Samples: 53493188. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 02:42:52,622][25689] Avg episode reward: [(0, '-59.275')] [2022-07-09 02:42:53,313][26022] Updated weights on worker 0-0, policy_version 52236 (0.00084) [2022-07-09 02:42:55,082][26022] Updated weights on worker 0-0, policy_version 52246 (0.00087) [2022-07-09 02:42:56,955][26022] Updated weights on worker 0-0, policy_version 52256 (0.00091) [2022-07-09 02:42:57,625][25689] Fps is (10 sec: 5702.0, 60 sec: 5714.2, 300 sec: 5708.7). Total num frames: 53514240. Throughput: 0: 5110.5. Samples: 53510698. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 02:42:57,626][25689] Avg episode reward: [(0, '-58.746')] [2022-07-09 02:42:58,694][26022] Updated weights on worker 0-0, policy_version 52266 (0.00086) [2022-07-09 02:43:00,379][26022] Updated weights on worker 0-0, policy_version 52276 (0.00082) [2022-07-09 02:43:02,671][25689] Fps is (10 sec: 5400.5, 60 sec: 5696.6, 300 sec: 5706.2). Total num frames: 53539840. Throughput: 0: 6008.4. Samples: 53545358. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 02:43:02,671][25689] Avg episode reward: [(0, '-57.690')] [2022-07-09 02:43:02,791][26022] Updated weights on worker 0-0, policy_version 52286 (0.00092) [2022-07-09 02:43:04,134][26022] Updated weights on worker 0-0, policy_version 52296 (0.00088) [2022-07-09 02:43:06,424][26022] Updated weights on worker 0-0, policy_version 52306 (0.00096) [2022-07-09 02:43:07,689][25689] Fps is (10 sec: 5596.4, 60 sec: 5729.4, 300 sec: 5713.6). Total num frames: 53570560. Throughput: 0: 5930.0. Samples: 53577796. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 02:43:07,689][25689] Avg episode reward: [(0, '-57.279')] [2022-07-09 02:43:07,804][26022] Updated weights on worker 0-0, policy_version 52316 (0.00089) [2022-07-09 02:43:09,847][26022] Updated weights on worker 0-0, policy_version 52326 (0.00090) [2022-07-09 02:43:11,669][26022] Updated weights on worker 0-0, policy_version 52336 (0.00086) [2022-07-09 02:43:12,708][25689] Fps is (10 sec: 5815.0, 60 sec: 5711.7, 300 sec: 5700.9). Total num frames: 53598208. Throughput: 0: 5072.4. Samples: 53595076. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 02:43:12,709][25689] Avg episode reward: [(0, '-57.026')] [2022-07-09 02:43:13,202][26022] Updated weights on worker 0-0, policy_version 52346 (0.00067) [2022-07-09 02:43:15,011][26022] Updated weights on worker 0-0, policy_version 52356 (0.00923) [2022-07-09 02:43:16,762][26022] Updated weights on worker 0-0, policy_version 52366 (0.00089) [2022-07-09 02:43:17,738][25689] Fps is (10 sec: 5604.2, 60 sec: 5675.4, 300 sec: 5701.7). Total num frames: 53626880. Throughput: 0: 5930.2. Samples: 53629974. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 02:43:17,740][25689] Avg episode reward: [(0, '-57.422')] [2022-07-09 02:43:18,575][26022] Updated weights on worker 0-0, policy_version 52376 (0.00081) [2022-07-09 02:43:20,343][26022] Updated weights on worker 0-0, policy_version 52386 (0.00079) [2022-07-09 02:43:21,999][26022] Updated weights on worker 0-0, policy_version 52396 (0.00086) [2022-07-09 02:43:22,845][25689] Fps is (10 sec: 5859.1, 60 sec: 5742.2, 300 sec: 5707.2). Total num frames: 53657600. Throughput: 0: 5912.0. Samples: 53664628. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 02:43:22,845][25689] Avg episode reward: [(0, '-57.254')] [2022-07-09 02:43:23,937][26022] Updated weights on worker 0-0, policy_version 52406 (0.00087) [2022-07-09 02:43:25,619][26022] Updated weights on worker 0-0, policy_version 52416 (0.00091) [2022-07-09 02:43:27,470][26022] Updated weights on worker 0-0, policy_version 52426 (0.00084) [2022-07-09 02:43:27,871][25689] Fps is (10 sec: 5861.6, 60 sec: 5706.2, 300 sec: 5706.9). Total num frames: 53686272. Throughput: 0: 5169.4. Samples: 53682124. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 02:43:27,871][25689] Avg episode reward: [(0, '-58.035')] [2022-07-09 02:43:29,314][26022] Updated weights on worker 0-0, policy_version 52436 (0.00092) [2022-07-09 02:43:30,958][26022] Updated weights on worker 0-0, policy_version 52446 (0.00088) [2022-07-09 02:43:32,776][26022] Updated weights on worker 0-0, policy_version 52456 (0.00097) [2022-07-09 02:43:32,874][25689] Fps is (10 sec: 5718.0, 60 sec: 5707.1, 300 sec: 5704.4). Total num frames: 53714944. Throughput: 0: 6021.8. Samples: 53716506. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 02:43:32,875][25689] Avg episode reward: [(0, '-58.396')] [2022-07-09 02:43:34,529][26022] Updated weights on worker 0-0, policy_version 52466 (0.00091) [2022-07-09 02:43:36,367][26022] Updated weights on worker 0-0, policy_version 52476 (0.00080) [2022-07-09 02:43:37,916][25689] Fps is (10 sec: 5708.7, 60 sec: 5706.7, 300 sec: 5705.5). Total num frames: 53743616. Throughput: 0: 6012.0. Samples: 53751280. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 02:43:37,918][25689] Avg episode reward: [(0, '-58.385')] [2022-07-09 02:43:38,123][26022] Updated weights on worker 0-0, policy_version 52486 (0.00083) [2022-07-09 02:43:39,796][26022] Updated weights on worker 0-0, policy_version 52496 (0.00091) [2022-07-09 02:43:41,611][26022] Updated weights on worker 0-0, policy_version 52506 (0.00081) [2022-07-09 02:43:42,962][25689] Fps is (10 sec: 5785.4, 60 sec: 5713.2, 300 sec: 5708.4). Total num frames: 53773312. Throughput: 0: 5168.5. Samples: 53768606. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 02:43:42,964][25689] Avg episode reward: [(0, '-57.610')] [2022-07-09 02:43:43,351][26022] Updated weights on worker 0-0, policy_version 52516 (0.00091) [2022-07-09 02:43:45,171][26022] Updated weights on worker 0-0, policy_version 52526 (0.00089) [2022-07-09 02:43:47,146][26022] Updated weights on worker 0-0, policy_version 52536 (0.00070) [2022-07-09 02:43:47,965][25689] Fps is (10 sec: 5706.6, 60 sec: 5696.8, 300 sec: 5705.0). Total num frames: 53800960. Throughput: 0: 6025.1. Samples: 53803192. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 02:43:47,965][25689] Avg episode reward: [(0, '-57.086')] [2022-07-09 02:43:48,506][26022] Updated weights on worker 0-0, policy_version 52546 (0.00087) [2022-07-09 02:43:50,531][26022] Updated weights on worker 0-0, policy_version 52556 (0.00089) [2022-07-09 02:43:52,472][26022] Updated weights on worker 0-0, policy_version 52566 (0.00086) [2022-07-09 02:43:53,008][25689] Fps is (10 sec: 5606.5, 60 sec: 5697.7, 300 sec: 5704.3). Total num frames: 53829632. Throughput: 0: 6002.6. Samples: 53837364. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 02:43:53,008][25689] Avg episode reward: [(0, '-57.244')] [2022-07-09 02:43:54,163][26022] Updated weights on worker 0-0, policy_version 52576 (0.00084) [2022-07-09 02:43:56,147][26022] Updated weights on worker 0-0, policy_version 52586 (0.00089) [2022-07-09 02:43:57,463][26022] Updated weights on worker 0-0, policy_version 52596 (0.00092) [2022-07-09 02:43:58,031][25689] Fps is (10 sec: 5900.1, 60 sec: 5729.8, 300 sec: 5712.9). Total num frames: 53860352. Throughput: 0: 5141.4. Samples: 53854700. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 02:43:58,031][25689] Avg episode reward: [(0, '-57.893')] [2022-07-09 02:43:59,694][26022] Updated weights on worker 0-0, policy_version 52606 (0.00086) [2022-07-09 02:44:01,356][26022] Updated weights on worker 0-0, policy_version 52616 (0.00098) [2022-07-09 02:44:03,095][25689] Fps is (10 sec: 5684.7, 60 sec: 5745.0, 300 sec: 5709.4). Total num frames: 53886976. Throughput: 0: 5950.9. Samples: 53888414. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 02:44:03,096][25689] Avg episode reward: [(0, '-57.029')] [2022-07-09 02:44:03,411][26022] Updated weights on worker 0-0, policy_version 52626 (0.00088) [2022-07-09 02:44:05,316][26022] Updated weights on worker 0-0, policy_version 52636 (0.00091) [2022-07-09 02:44:06,987][26022] Updated weights on worker 0-0, policy_version 52646 (0.00091) [2022-07-09 02:44:08,162][25689] Fps is (10 sec: 5357.2, 60 sec: 5689.6, 300 sec: 5704.7). Total num frames: 53914624. Throughput: 0: 5850.7. Samples: 53921358. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 02:44:08,162][25689] Avg episode reward: [(0, '-57.733')] [2022-07-09 02:44:08,967][26022] Updated weights on worker 0-0, policy_version 52656 (0.00092) [2022-07-09 02:44:10,545][26022] Updated weights on worker 0-0, policy_version 52666 (0.00090) [2022-07-09 02:44:12,316][26022] Updated weights on worker 0-0, policy_version 52676 (0.00082) [2022-07-09 02:44:13,195][25689] Fps is (10 sec: 5779.3, 60 sec: 5739.1, 300 sec: 5707.8). Total num frames: 53945344. Throughput: 0: 5015.1. Samples: 53938604. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 02:44:13,195][25689] Avg episode reward: [(0, '-57.999')] [2022-07-09 02:44:14,209][26022] Updated weights on worker 0-0, policy_version 52686 (0.00075) [2022-07-09 02:44:16,063][26022] Updated weights on worker 0-0, policy_version 52696 (0.00092) [2022-07-09 02:44:17,461][26022] Updated weights on worker 0-0, policy_version 52706 (0.00085) [2022-07-09 02:44:18,231][25689] Fps is (10 sec: 5898.3, 60 sec: 5738.5, 300 sec: 5712.1). Total num frames: 53974016. Throughput: 0: 5876.8. Samples: 53973412. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 02:44:18,232][25689] Avg episode reward: [(0, '-57.237')] [2022-07-09 02:44:19,628][26022] Updated weights on worker 0-0, policy_version 52716 (0.00082) [2022-07-09 02:44:19,792][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:44:19,807][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000052718_53983232.pth [2022-07-09 02:44:19,807][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000050709_51926016.pth [2022-07-09 02:44:21,155][26022] Updated weights on worker 0-0, policy_version 52726 (0.00081) [2022-07-09 02:44:23,039][26022] Updated weights on worker 0-0, policy_version 52736 (0.00068) [2022-07-09 02:44:23,305][25689] Fps is (10 sec: 5773.3, 60 sec: 5724.7, 300 sec: 5718.0). Total num frames: 54003712. Throughput: 0: 5934.6. Samples: 54008348. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 02:44:23,305][25689] Avg episode reward: [(0, '-57.713')] [2022-07-09 02:44:24,969][26022] Updated weights on worker 0-0, policy_version 52746 (0.00085) [2022-07-09 02:44:26,476][26022] Updated weights on worker 0-0, policy_version 52756 (0.00085) [2022-07-09 02:44:28,323][25689] Fps is (10 sec: 5681.8, 60 sec: 5708.4, 300 sec: 5707.6). Total num frames: 54031360. Throughput: 0: 5172.0. Samples: 54025636. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 02:44:28,324][25689] Avg episode reward: [(0, '-57.858')] [2022-07-09 02:44:28,409][26022] Updated weights on worker 0-0, policy_version 52766 (0.00086) [2022-07-09 02:44:30,068][26022] Updated weights on worker 0-0, policy_version 52776 (0.00090) [2022-07-09 02:44:31,736][26022] Updated weights on worker 0-0, policy_version 52786 (0.00080) [2022-07-09 02:44:33,348][25689] Fps is (10 sec: 5709.5, 60 sec: 5723.3, 300 sec: 5714.0). Total num frames: 54061056. Throughput: 0: 6035.0. Samples: 54060230. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 02:44:33,350][25689] Avg episode reward: [(0, '-58.148')] [2022-07-09 02:44:33,709][26022] Updated weights on worker 0-0, policy_version 52796 (0.00088) [2022-07-09 02:44:35,336][26022] Updated weights on worker 0-0, policy_version 52806 (0.00093) [2022-07-09 02:44:37,192][26022] Updated weights on worker 0-0, policy_version 52816 (0.00097) [2022-07-09 02:44:38,405][25689] Fps is (10 sec: 5789.5, 60 sec: 5721.9, 300 sec: 5712.2). Total num frames: 54089728. Throughput: 0: 6016.0. Samples: 54094780. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 02:44:38,407][25689] Avg episode reward: [(0, '-57.951')] [2022-07-09 02:44:38,989][26022] Updated weights on worker 0-0, policy_version 52826 (0.00084) [2022-07-09 02:44:40,831][26022] Updated weights on worker 0-0, policy_version 52836 (0.00086) [2022-07-09 02:44:42,515][26022] Updated weights on worker 0-0, policy_version 52846 (0.00094) [2022-07-09 02:44:43,537][25689] Fps is (10 sec: 5628.4, 60 sec: 5696.9, 300 sec: 5706.3). Total num frames: 54118400. Throughput: 0: 5134.9. Samples: 54112236. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 02:44:43,537][25689] Avg episode reward: [(0, '-57.621')] [2022-07-09 02:44:44,294][26022] Updated weights on worker 0-0, policy_version 52856 (0.00084) [2022-07-09 02:44:46,063][26022] Updated weights on worker 0-0, policy_version 52866 (0.00086) [2022-07-09 02:44:47,748][26022] Updated weights on worker 0-0, policy_version 52876 (0.00081) [2022-07-09 02:44:48,603][25689] Fps is (10 sec: 5924.7, 60 sec: 5758.5, 300 sec: 5716.2). Total num frames: 54150144. Throughput: 0: 5991.0. Samples: 54147128. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 02:44:48,603][25689] Avg episode reward: [(0, '-57.509')] [2022-07-09 02:44:49,734][26022] Updated weights on worker 0-0, policy_version 52886 (0.00088) [2022-07-09 02:44:51,303][26022] Updated weights on worker 0-0, policy_version 52896 (0.00097) [2022-07-09 02:44:53,139][26022] Updated weights on worker 0-0, policy_version 52906 (0.00088) [2022-07-09 02:44:53,635][25689] Fps is (10 sec: 5982.9, 60 sec: 5759.5, 300 sec: 5715.8). Total num frames: 54178816. Throughput: 0: 6002.9. Samples: 54182008. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 02:44:53,635][25689] Avg episode reward: [(0, '-57.270')] [2022-07-09 02:44:54,733][26022] Updated weights on worker 0-0, policy_version 52916 (0.00085) [2022-07-09 02:44:56,563][26022] Updated weights on worker 0-0, policy_version 52926 (0.00080) [2022-07-09 02:44:58,433][26022] Updated weights on worker 0-0, policy_version 52936 (0.00094) [2022-07-09 02:44:58,673][25689] Fps is (10 sec: 5694.5, 60 sec: 5724.4, 300 sec: 5712.8). Total num frames: 54207488. Throughput: 0: 6025.3. Samples: 54216896. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 02:44:58,674][25689] Avg episode reward: [(0, '-56.837')] [2022-07-09 02:45:00,139][26022] Updated weights on worker 0-0, policy_version 52946 (0.00089) [2022-07-09 02:45:02,427][26022] Updated weights on worker 0-0, policy_version 52956 (0.00078) [2022-07-09 02:45:03,815][25689] Fps is (10 sec: 5532.4, 60 sec: 5733.9, 300 sec: 5713.8). Total num frames: 54235136. Throughput: 0: 5927.4. Samples: 54232432. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 02:45:03,817][25689] Avg episode reward: [(0, '-56.657')] [2022-07-09 02:45:03,936][26022] Updated weights on worker 0-0, policy_version 52966 (0.00088) [2022-07-09 02:45:05,894][26022] Updated weights on worker 0-0, policy_version 52976 (0.01176) [2022-07-09 02:45:07,622][26022] Updated weights on worker 0-0, policy_version 52986 (0.00096) [2022-07-09 02:45:08,847][25689] Fps is (10 sec: 5535.4, 60 sec: 5754.0, 300 sec: 5713.4). Total num frames: 54263808. Throughput: 0: 5902.2. Samples: 54266616. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 02:45:08,848][25689] Avg episode reward: [(0, '-56.254')] [2022-07-09 02:45:09,498][26022] Updated weights on worker 0-0, policy_version 52996 (0.00093) [2022-07-09 02:45:11,273][26022] Updated weights on worker 0-0, policy_version 53006 (0.00087) [2022-07-09 02:45:12,864][26022] Updated weights on worker 0-0, policy_version 53016 (0.00087) [2022-07-09 02:45:13,873][25689] Fps is (10 sec: 5803.0, 60 sec: 5737.8, 300 sec: 5716.8). Total num frames: 54293504. Throughput: 0: 5885.9. Samples: 54301128. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 02:45:13,873][25689] Avg episode reward: [(0, '-57.069')] [2022-07-09 02:45:14,851][26022] Updated weights on worker 0-0, policy_version 53026 (0.00093) [2022-07-09 02:45:16,438][26022] Updated weights on worker 0-0, policy_version 53036 (0.00091) [2022-07-09 02:45:18,432][26022] Updated weights on worker 0-0, policy_version 53046 (0.00089) [2022-07-09 02:45:18,881][25689] Fps is (10 sec: 5817.0, 60 sec: 5740.5, 300 sec: 5714.0). Total num frames: 54322176. Throughput: 0: 5030.5. Samples: 54318554. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 02:45:18,882][25689] Avg episode reward: [(0, '-57.662')] [2022-07-09 02:45:20,079][26022] Updated weights on worker 0-0, policy_version 53056 (0.00087) [2022-07-09 02:45:22,037][26022] Updated weights on worker 0-0, policy_version 53066 (0.00094) [2022-07-09 02:45:23,692][26022] Updated weights on worker 0-0, policy_version 53076 (0.00088) [2022-07-09 02:45:23,964][25689] Fps is (10 sec: 5682.7, 60 sec: 5722.7, 300 sec: 5716.0). Total num frames: 54350848. Throughput: 0: 5986.2. Samples: 54353048. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 02:45:23,964][25689] Avg episode reward: [(0, '-57.951')] [2022-07-09 02:45:25,435][26022] Updated weights on worker 0-0, policy_version 53086 (0.00083) [2022-07-09 02:45:27,211][26022] Updated weights on worker 0-0, policy_version 53096 (0.00093) [2022-07-09 02:45:29,041][25689] Fps is (10 sec: 5644.0, 60 sec: 5734.1, 300 sec: 5711.4). Total num frames: 54379520. Throughput: 0: 5999.9. Samples: 54387780. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 02:45:29,042][25689] Avg episode reward: [(0, '-58.178')] [2022-07-09 02:45:29,119][26022] Updated weights on worker 0-0, policy_version 53106 (0.00093) [2022-07-09 02:45:30,788][26022] Updated weights on worker 0-0, policy_version 53116 (0.00089) [2022-07-09 02:45:32,680][26022] Updated weights on worker 0-0, policy_version 53126 (0.00097) [2022-07-09 02:45:34,146][25689] Fps is (10 sec: 5732.6, 60 sec: 5726.5, 300 sec: 5716.7). Total num frames: 54409216. Throughput: 0: 5125.3. Samples: 54405032. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 02:45:34,146][25689] Avg episode reward: [(0, '-58.313')] [2022-07-09 02:45:34,330][26022] Updated weights on worker 0-0, policy_version 53136 (0.00095) [2022-07-09 02:45:36,205][26022] Updated weights on worker 0-0, policy_version 53146 (0.00086) [2022-07-09 02:45:37,833][26022] Updated weights on worker 0-0, policy_version 53156 (0.00097) [2022-07-09 02:45:39,152][25689] Fps is (10 sec: 5772.7, 60 sec: 5731.3, 300 sec: 5714.1). Total num frames: 54437888. Throughput: 0: 5991.1. Samples: 54440002. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 02:45:39,153][25689] Avg episode reward: [(0, '-57.654')] [2022-07-09 02:45:39,647][26022] Updated weights on worker 0-0, policy_version 53166 (0.00083) [2022-07-09 02:45:41,482][26022] Updated weights on worker 0-0, policy_version 53176 (0.00085) [2022-07-09 02:45:43,230][26022] Updated weights on worker 0-0, policy_version 53186 (0.00090) [2022-07-09 02:45:44,249][25689] Fps is (10 sec: 5777.4, 60 sec: 5751.5, 300 sec: 5716.1). Total num frames: 54467584. Throughput: 0: 5989.0. Samples: 54474532. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 02:45:44,249][25689] Avg episode reward: [(0, '-56.228')] [2022-07-09 02:45:44,972][26022] Updated weights on worker 0-0, policy_version 53196 (0.00086) [2022-07-09 02:45:46,737][26022] Updated weights on worker 0-0, policy_version 53206 (0.00086) [2022-07-09 02:45:48,606][26022] Updated weights on worker 0-0, policy_version 53216 (0.00087) [2022-07-09 02:45:49,258][25689] Fps is (10 sec: 5877.3, 60 sec: 5723.1, 300 sec: 5716.2). Total num frames: 54497280. Throughput: 0: 5149.4. Samples: 54491882. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 02:45:49,258][25689] Avg episode reward: [(0, '-55.775')] [2022-07-09 02:45:50,344][26022] Updated weights on worker 0-0, policy_version 53226 (0.00100) [2022-07-09 02:45:52,178][26022] Updated weights on worker 0-0, policy_version 53236 (0.00082) [2022-07-09 02:45:53,867][26022] Updated weights on worker 0-0, policy_version 53246 (0.00082) [2022-07-09 02:45:54,273][25689] Fps is (10 sec: 5822.6, 60 sec: 5724.7, 300 sec: 5716.1). Total num frames: 54525952. Throughput: 0: 6018.9. Samples: 54526176. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 02:45:54,274][25689] Avg episode reward: [(0, '-55.509')] [2022-07-09 02:45:55,690][26022] Updated weights on worker 0-0, policy_version 53256 (0.00089) [2022-07-09 02:45:57,515][26022] Updated weights on worker 0-0, policy_version 53266 (0.00085) [2022-07-09 02:45:59,230][26022] Updated weights on worker 0-0, policy_version 53276 (0.00087) [2022-07-09 02:45:59,293][25689] Fps is (10 sec: 5714.0, 60 sec: 5726.4, 300 sec: 5714.2). Total num frames: 54554624. Throughput: 0: 5999.3. Samples: 54560834. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 02:45:59,296][25689] Avg episode reward: [(0, '-56.487')] [2022-07-09 02:46:00,868][26022] Updated weights on worker 0-0, policy_version 53286 (0.00082) [2022-07-09 02:46:03,170][26022] Updated weights on worker 0-0, policy_version 53296 (0.00086) [2022-07-09 02:46:04,414][25689] Fps is (10 sec: 5452.9, 60 sec: 5711.5, 300 sec: 5712.2). Total num frames: 54581248. Throughput: 0: 5139.7. Samples: 54578174. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 02:46:04,416][25689] Avg episode reward: [(0, '-56.194')] [2022-07-09 02:46:04,890][26022] Updated weights on worker 0-0, policy_version 53306 (0.00093) [2022-07-09 02:46:06,629][26022] Updated weights on worker 0-0, policy_version 53316 (0.00080) [2022-07-09 02:46:08,361][26022] Updated weights on worker 0-0, policy_version 53326 (0.00094) [2022-07-09 02:46:09,422][25689] Fps is (10 sec: 5560.7, 60 sec: 5730.7, 300 sec: 5715.9). Total num frames: 54610944. Throughput: 0: 5899.7. Samples: 54610844. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 02:46:09,422][25689] Avg episode reward: [(0, '-56.723')] [2022-07-09 02:46:10,372][26022] Updated weights on worker 0-0, policy_version 53336 (0.00092) [2022-07-09 02:46:11,985][26022] Updated weights on worker 0-0, policy_version 53346 (0.00083) [2022-07-09 02:46:13,980][26022] Updated weights on worker 0-0, policy_version 53356 (0.00087) [2022-07-09 02:46:14,502][25689] Fps is (10 sec: 5785.9, 60 sec: 5708.7, 300 sec: 5714.4). Total num frames: 54639616. Throughput: 0: 5882.0. Samples: 54645160. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 02:46:14,502][25689] Avg episode reward: [(0, '-57.584')] [2022-07-09 02:46:15,399][26022] Updated weights on worker 0-0, policy_version 53366 (0.00085) [2022-07-09 02:46:17,379][26022] Updated weights on worker 0-0, policy_version 53376 (0.00740) [2022-07-09 02:46:19,201][26022] Updated weights on worker 0-0, policy_version 53386 (0.00080) [2022-07-09 02:46:19,544][25689] Fps is (10 sec: 5665.2, 60 sec: 5705.5, 300 sec: 5714.8). Total num frames: 54668288. Throughput: 0: 5031.7. Samples: 54662728. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 02:46:19,545][25689] Avg episode reward: [(0, '-57.050')] [2022-07-09 02:46:19,822][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:46:19,834][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000053389_54670336.pth [2022-07-09 02:46:19,834][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000051378_52611072.pth [2022-07-09 02:46:21,024][26022] Updated weights on worker 0-0, policy_version 53396 (0.00086) [2022-07-09 02:46:22,840][26022] Updated weights on worker 0-0, policy_version 53406 (0.00080) [2022-07-09 02:46:24,453][26022] Updated weights on worker 0-0, policy_version 53416 (0.00079) [2022-07-09 02:46:24,655][25689] Fps is (10 sec: 5748.8, 60 sec: 5719.8, 300 sec: 5717.1). Total num frames: 54697984. Throughput: 0: 5884.5. Samples: 54697284. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 02:46:24,655][25689] Avg episode reward: [(0, '-57.189')] [2022-07-09 02:46:26,320][26022] Updated weights on worker 0-0, policy_version 53426 (0.00085) [2022-07-09 02:46:27,995][26022] Updated weights on worker 0-0, policy_version 53436 (0.00090) [2022-07-09 02:46:29,668][25689] Fps is (10 sec: 5765.4, 60 sec: 5725.8, 300 sec: 5720.5). Total num frames: 54726656. Throughput: 0: 5975.7. Samples: 54731828. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 02:46:29,668][25689] Avg episode reward: [(0, '-57.761')] [2022-07-09 02:46:29,911][26022] Updated weights on worker 0-0, policy_version 53446 (0.00082) [2022-07-09 02:46:31,681][26022] Updated weights on worker 0-0, policy_version 53456 (0.00082) [2022-07-09 02:46:33,607][26022] Updated weights on worker 0-0, policy_version 53466 (0.00089) [2022-07-09 02:46:34,677][25689] Fps is (10 sec: 5721.4, 60 sec: 5717.9, 300 sec: 5715.2). Total num frames: 54755328. Throughput: 0: 5143.5. Samples: 54748934. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 02:46:34,678][25689] Avg episode reward: [(0, '-57.675')] [2022-07-09 02:46:35,309][26022] Updated weights on worker 0-0, policy_version 53476 (0.00358) [2022-07-09 02:46:37,106][26022] Updated weights on worker 0-0, policy_version 53486 (0.00087) [2022-07-09 02:46:38,732][26022] Updated weights on worker 0-0, policy_version 53496 (0.00086) [2022-07-09 02:46:39,684][25689] Fps is (10 sec: 5622.7, 60 sec: 5701.0, 300 sec: 5716.0). Total num frames: 54782976. Throughput: 0: 5982.2. Samples: 54783212. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 02:46:39,684][25689] Avg episode reward: [(0, '-57.442')] [2022-07-09 02:46:40,697][26022] Updated weights on worker 0-0, policy_version 53506 (0.00083) [2022-07-09 02:46:42,561][26022] Updated weights on worker 0-0, policy_version 53516 (0.00089) [2022-07-09 02:46:44,310][26022] Updated weights on worker 0-0, policy_version 53526 (0.00089) [2022-07-09 02:46:44,786][25689] Fps is (10 sec: 5673.1, 60 sec: 5700.4, 300 sec: 5718.4). Total num frames: 54812672. Throughput: 0: 5971.1. Samples: 54817488. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 02:46:44,786][25689] Avg episode reward: [(0, '-57.124')] [2022-07-09 02:46:46,124][26022] Updated weights on worker 0-0, policy_version 53536 (0.00084) [2022-07-09 02:46:47,862][26022] Updated weights on worker 0-0, policy_version 53546 (0.00083) [2022-07-09 02:46:49,515][26022] Updated weights on worker 0-0, policy_version 53556 (0.00084) [2022-07-09 02:46:49,825][25689] Fps is (10 sec: 5957.6, 60 sec: 5714.5, 300 sec: 5717.9). Total num frames: 54843392. Throughput: 0: 5111.0. Samples: 54834852. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 02:46:49,826][25689] Avg episode reward: [(0, '-57.228')] [2022-07-09 02:46:51,323][26022] Updated weights on worker 0-0, policy_version 53566 (0.00088) [2022-07-09 02:46:53,138][26022] Updated weights on worker 0-0, policy_version 53576 (0.00087) [2022-07-09 02:46:54,771][26022] Updated weights on worker 0-0, policy_version 53586 (0.00083) [2022-07-09 02:46:54,865][25689] Fps is (10 sec: 5892.2, 60 sec: 5712.1, 300 sec: 5721.5). Total num frames: 54872064. Throughput: 0: 5992.3. Samples: 54869906. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 02:46:54,867][25689] Avg episode reward: [(0, '-56.797')] [2022-07-09 02:46:56,481][26022] Updated weights on worker 0-0, policy_version 53596 (0.00089) [2022-07-09 02:46:58,375][26022] Updated weights on worker 0-0, policy_version 53606 (0.00079) [2022-07-09 02:46:59,898][25689] Fps is (10 sec: 5693.0, 60 sec: 5711.0, 300 sec: 5728.6). Total num frames: 54900736. Throughput: 0: 6028.2. Samples: 54905064. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 02:46:59,898][25689] Avg episode reward: [(0, '-56.778')] [2022-07-09 02:46:59,980][26022] Updated weights on worker 0-0, policy_version 53616 (0.00371) [2022-07-09 02:47:02,508][26022] Updated weights on worker 0-0, policy_version 53626 (0.00464) [2022-07-09 02:47:03,839][26022] Updated weights on worker 0-0, policy_version 53636 (0.00087) [2022-07-09 02:47:04,998][25689] Fps is (10 sec: 5457.0, 60 sec: 5712.9, 300 sec: 5719.9). Total num frames: 54927360. Throughput: 0: 5939.1. Samples: 54937534. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 02:47:05,003][25689] Avg episode reward: [(0, '-56.252')] [2022-07-09 02:47:06,047][26022] Updated weights on worker 0-0, policy_version 53646 (0.00096) [2022-07-09 02:47:07,431][26022] Updated weights on worker 0-0, policy_version 53656 (0.00089) [2022-07-09 02:47:09,424][26022] Updated weights on worker 0-0, policy_version 53666 (0.00080) [2022-07-09 02:47:10,060][25689] Fps is (10 sec: 5743.7, 60 sec: 5741.6, 300 sec: 5729.2). Total num frames: 54959104. Throughput: 0: 5929.2. Samples: 54954828. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 02:47:10,060][25689] Avg episode reward: [(0, '-56.155')] [2022-07-09 02:47:11,138][26022] Updated weights on worker 0-0, policy_version 53676 (0.00088) [2022-07-09 02:47:12,769][26022] Updated weights on worker 0-0, policy_version 53686 (0.00088) [2022-07-09 02:47:14,605][26022] Updated weights on worker 0-0, policy_version 53696 (0.00090) [2022-07-09 02:47:15,128][25689] Fps is (10 sec: 5863.0, 60 sec: 5725.8, 300 sec: 5717.7). Total num frames: 54986752. Throughput: 0: 5909.5. Samples: 54989650. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 02:47:15,130][25689] Avg episode reward: [(0, '-55.224')] [2022-07-09 02:47:16,461][26022] Updated weights on worker 0-0, policy_version 53706 (0.00086) [2022-07-09 02:47:18,129][26022] Updated weights on worker 0-0, policy_version 53716 (0.00085) [2022-07-09 02:47:19,847][26022] Updated weights on worker 0-0, policy_version 53726 (0.00088) [2022-07-09 02:47:20,192][25689] Fps is (10 sec: 5659.7, 60 sec: 5740.7, 300 sec: 5728.6). Total num frames: 55016448. Throughput: 0: 5895.5. Samples: 55024708. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 02:47:20,192][25689] Avg episode reward: [(0, '-55.629')] [2022-07-09 02:47:21,694][26022] Updated weights on worker 0-0, policy_version 53736 (0.00083) [2022-07-09 02:47:23,635][26022] Updated weights on worker 0-0, policy_version 53746 (0.00090) [2022-07-09 02:47:25,249][25689] Fps is (10 sec: 5868.4, 60 sec: 5745.8, 300 sec: 5724.1). Total num frames: 55046144. Throughput: 0: 5156.5. Samples: 55041956. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 02:47:25,251][25689] Avg episode reward: [(0, '-55.497')] [2022-07-09 02:47:25,251][26022] Updated weights on worker 0-0, policy_version 53756 (0.00080) [2022-07-09 02:47:26,917][26022] Updated weights on worker 0-0, policy_version 53766 (0.00090) [2022-07-09 02:47:28,775][26022] Updated weights on worker 0-0, policy_version 53776 (0.00093) [2022-07-09 02:47:30,279][25689] Fps is (10 sec: 5786.2, 60 sec: 5744.1, 300 sec: 5723.8). Total num frames: 55074816. Throughput: 0: 6034.8. Samples: 55076848. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 02:47:30,281][25689] Avg episode reward: [(0, '-56.427')] [2022-07-09 02:47:30,457][26022] Updated weights on worker 0-0, policy_version 53786 (0.00086) [2022-07-09 02:47:32,391][26022] Updated weights on worker 0-0, policy_version 53796 (0.00081) [2022-07-09 02:47:33,914][26022] Updated weights on worker 0-0, policy_version 53806 (0.00093) [2022-07-09 02:47:35,308][25689] Fps is (10 sec: 5700.8, 60 sec: 5742.3, 300 sec: 5724.0). Total num frames: 55103488. Throughput: 0: 6039.7. Samples: 55111528. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 02:47:35,308][25689] Avg episode reward: [(0, '-56.031')] [2022-07-09 02:47:35,815][26022] Updated weights on worker 0-0, policy_version 53816 (0.00085) [2022-07-09 02:47:37,659][26022] Updated weights on worker 0-0, policy_version 53826 (0.00087) [2022-07-09 02:47:39,414][26022] Updated weights on worker 0-0, policy_version 53836 (0.00106) [2022-07-09 02:47:40,312][25689] Fps is (10 sec: 5715.9, 60 sec: 5759.5, 300 sec: 5722.7). Total num frames: 55132160. Throughput: 0: 5168.1. Samples: 55128690. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 02:47:40,315][25689] Avg episode reward: [(0, '-55.705')] [2022-07-09 02:47:41,310][26022] Updated weights on worker 0-0, policy_version 53846 (0.00098) [2022-07-09 02:47:43,151][26022] Updated weights on worker 0-0, policy_version 53856 (0.00083) [2022-07-09 02:47:44,813][26022] Updated weights on worker 0-0, policy_version 53866 (0.00083) [2022-07-09 02:47:45,390][25689] Fps is (10 sec: 5890.8, 60 sec: 5778.6, 300 sec: 5728.2). Total num frames: 55162880. Throughput: 0: 6019.5. Samples: 55163196. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 02:47:45,391][25689] Avg episode reward: [(0, '-56.124')] [2022-07-09 02:47:46,557][26022] Updated weights on worker 0-0, policy_version 53876 (0.00102) [2022-07-09 02:47:48,327][26022] Updated weights on worker 0-0, policy_version 53886 (0.00088) [2022-07-09 02:47:50,134][26022] Updated weights on worker 0-0, policy_version 53896 (0.00086) [2022-07-09 02:47:50,418][25689] Fps is (10 sec: 5876.6, 60 sec: 5745.9, 300 sec: 5728.7). Total num frames: 55191552. Throughput: 0: 6015.4. Samples: 55197990. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 02:47:50,419][25689] Avg episode reward: [(0, '-56.415')] [2022-07-09 02:47:51,862][26022] Updated weights on worker 0-0, policy_version 53906 (0.00086) [2022-07-09 02:47:53,681][26022] Updated weights on worker 0-0, policy_version 53916 (0.00091) [2022-07-09 02:47:55,438][25689] Fps is (10 sec: 5604.9, 60 sec: 5730.9, 300 sec: 5724.9). Total num frames: 55219200. Throughput: 0: 5153.9. Samples: 55215278. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 02:47:55,439][25689] Avg episode reward: [(0, '-57.098')] [2022-07-09 02:47:55,452][26022] Updated weights on worker 0-0, policy_version 53926 (0.00086) [2022-07-09 02:47:57,291][26022] Updated weights on worker 0-0, policy_version 53936 (0.00087) [2022-07-09 02:47:58,968][26022] Updated weights on worker 0-0, policy_version 53946 (0.00108) [2022-07-09 02:48:00,471][25689] Fps is (10 sec: 5602.2, 60 sec: 5730.8, 300 sec: 5735.5). Total num frames: 55247872. Throughput: 0: 6012.3. Samples: 55249896. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 02:48:00,473][25689] Avg episode reward: [(0, '-57.302')] [2022-07-09 02:48:00,842][26022] Updated weights on worker 0-0, policy_version 53956 (0.00094) [2022-07-09 02:48:02,778][26022] Updated weights on worker 0-0, policy_version 53966 (0.00083) [2022-07-09 02:48:04,735][26022] Updated weights on worker 0-0, policy_version 53976 (0.00087) [2022-07-09 02:48:05,589][25689] Fps is (10 sec: 5649.3, 60 sec: 5763.0, 300 sec: 5726.7). Total num frames: 55276544. Throughput: 0: 5913.1. Samples: 55282634. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 02:48:05,589][25689] Avg episode reward: [(0, '-57.674')] [2022-07-09 02:48:06,321][26022] Updated weights on worker 0-0, policy_version 53986 (0.00082) [2022-07-09 02:48:08,319][26022] Updated weights on worker 0-0, policy_version 53996 (0.00093) [2022-07-09 02:48:09,903][26022] Updated weights on worker 0-0, policy_version 54006 (0.00087) [2022-07-09 02:48:10,606][25689] Fps is (10 sec: 5557.0, 60 sec: 5699.6, 300 sec: 5726.7). Total num frames: 55304192. Throughput: 0: 5061.9. Samples: 55300180. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 02:48:10,608][25689] Avg episode reward: [(0, '-58.554')] [2022-07-09 02:48:11,924][26022] Updated weights on worker 0-0, policy_version 54016 (0.00093) [2022-07-09 02:48:13,450][26022] Updated weights on worker 0-0, policy_version 54026 (0.00082) [2022-07-09 02:48:15,360][26022] Updated weights on worker 0-0, policy_version 54036 (0.00088) [2022-07-09 02:48:15,622][25689] Fps is (10 sec: 5715.7, 60 sec: 5738.4, 300 sec: 5730.5). Total num frames: 55333888. Throughput: 0: 5907.5. Samples: 55334512. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 02:48:15,623][25689] Avg episode reward: [(0, '-58.192')] [2022-07-09 02:48:16,956][26022] Updated weights on worker 0-0, policy_version 54046 (0.00091) [2022-07-09 02:48:19,027][26022] Updated weights on worker 0-0, policy_version 54056 (0.00088) [2022-07-09 02:48:19,875][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:48:19,888][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000054061_55358464.pth [2022-07-09 02:48:19,889][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000052048_53297152.pth [2022-07-09 02:48:20,387][26022] Updated weights on worker 0-0, policy_version 54066 (0.00086) [2022-07-09 02:48:20,653][25689] Fps is (10 sec: 5911.8, 60 sec: 5741.5, 300 sec: 5728.5). Total num frames: 55363584. Throughput: 0: 5905.1. Samples: 55369070. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 02:48:20,655][25689] Avg episode reward: [(0, '-58.195')] [2022-07-09 02:48:22,623][26022] Updated weights on worker 0-0, policy_version 54076 (0.00092) [2022-07-09 02:48:24,068][26022] Updated weights on worker 0-0, policy_version 54086 (0.00087) [2022-07-09 02:48:25,715][25689] Fps is (10 sec: 5681.1, 60 sec: 5707.1, 300 sec: 5724.3). Total num frames: 55391232. Throughput: 0: 5165.1. Samples: 55386590. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 02:48:25,716][25689] Avg episode reward: [(0, '-57.394')] [2022-07-09 02:48:26,130][26022] Updated weights on worker 0-0, policy_version 54096 (0.00095) [2022-07-09 02:48:27,746][26022] Updated weights on worker 0-0, policy_version 54106 (0.00095) [2022-07-09 02:48:29,580][26022] Updated weights on worker 0-0, policy_version 54116 (0.00082) [2022-07-09 02:48:30,727][25689] Fps is (10 sec: 5794.0, 60 sec: 5742.8, 300 sec: 5731.0). Total num frames: 55421952. Throughput: 0: 6000.5. Samples: 55420912. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 02:48:30,727][25689] Avg episode reward: [(0, '-57.737')] [2022-07-09 02:48:31,267][26022] Updated weights on worker 0-0, policy_version 54126 (0.00081) [2022-07-09 02:48:33,214][26022] Updated weights on worker 0-0, policy_version 54136 (0.00084) [2022-07-09 02:48:34,950][26022] Updated weights on worker 0-0, policy_version 54146 (0.00090) [2022-07-09 02:48:35,753][25689] Fps is (10 sec: 5814.8, 60 sec: 5726.0, 300 sec: 5727.9). Total num frames: 55449600. Throughput: 0: 6028.0. Samples: 55455866. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 02:48:35,755][25689] Avg episode reward: [(0, '-57.223')] [2022-07-09 02:48:36,743][26022] Updated weights on worker 0-0, policy_version 54156 (0.00091) [2022-07-09 02:48:38,323][26022] Updated weights on worker 0-0, policy_version 54166 (0.00094) [2022-07-09 02:48:40,198][26022] Updated weights on worker 0-0, policy_version 54176 (0.00081) [2022-07-09 02:48:40,790][25689] Fps is (10 sec: 5698.3, 60 sec: 5739.8, 300 sec: 5728.1). Total num frames: 55479296. Throughput: 0: 5177.2. Samples: 55473324. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 02:48:40,792][25689] Avg episode reward: [(0, '-57.540')] [2022-07-09 02:48:41,969][26022] Updated weights on worker 0-0, policy_version 54186 (0.00092) [2022-07-09 02:48:43,778][26022] Updated weights on worker 0-0, policy_version 54196 (0.00083) [2022-07-09 02:48:45,481][26022] Updated weights on worker 0-0, policy_version 54206 (0.00090) [2022-07-09 02:48:45,885][25689] Fps is (10 sec: 5862.1, 60 sec: 5721.3, 300 sec: 5733.2). Total num frames: 55508992. Throughput: 0: 6008.8. Samples: 55507784. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 02:48:45,886][25689] Avg episode reward: [(0, '-56.998')] [2022-07-09 02:48:47,207][26022] Updated weights on worker 0-0, policy_version 54216 (0.00086) [2022-07-09 02:48:49,020][26022] Updated weights on worker 0-0, policy_version 54226 (0.00082) [2022-07-09 02:48:50,863][26022] Updated weights on worker 0-0, policy_version 54236 (0.00088) [2022-07-09 02:48:50,933][25689] Fps is (10 sec: 5754.7, 60 sec: 5719.5, 300 sec: 5733.1). Total num frames: 55537664. Throughput: 0: 6026.5. Samples: 55542684. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 02:48:50,933][25689] Avg episode reward: [(0, '-57.248')] [2022-07-09 02:48:52,658][26022] Updated weights on worker 0-0, policy_version 54246 (0.00086) [2022-07-09 02:48:54,503][26022] Updated weights on worker 0-0, policy_version 54256 (0.00091) [2022-07-09 02:48:55,941][25689] Fps is (10 sec: 5702.5, 60 sec: 5737.6, 300 sec: 5726.5). Total num frames: 55566336. Throughput: 0: 5139.9. Samples: 55559628. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 02:48:55,941][25689] Avg episode reward: [(0, '-56.821')] [2022-07-09 02:48:56,187][26022] Updated weights on worker 0-0, policy_version 54266 (0.00096) [2022-07-09 02:48:58,091][26022] Updated weights on worker 0-0, policy_version 54276 (0.00089) [2022-07-09 02:48:59,618][26022] Updated weights on worker 0-0, policy_version 54286 (0.00091) [2022-07-09 02:49:00,962][25689] Fps is (10 sec: 5615.7, 60 sec: 5721.8, 300 sec: 5730.8). Total num frames: 55593984. Throughput: 0: 6003.6. Samples: 55594426. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 02:49:00,962][25689] Avg episode reward: [(0, '-56.840')] [2022-07-09 02:49:01,888][26022] Updated weights on worker 0-0, policy_version 54296 (0.00087) [2022-07-09 02:49:03,631][26022] Updated weights on worker 0-0, policy_version 54306 (0.00055) [2022-07-09 02:49:05,523][26022] Updated weights on worker 0-0, policy_version 54316 (0.00084) [2022-07-09 02:49:06,039][25689] Fps is (10 sec: 5577.3, 60 sec: 5725.6, 300 sec: 5734.0). Total num frames: 55622656. Throughput: 0: 5910.2. Samples: 55626898. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 02:49:06,039][25689] Avg episode reward: [(0, '-57.856')] [2022-07-09 02:49:07,335][26022] Updated weights on worker 0-0, policy_version 54326 (0.00086) [2022-07-09 02:49:09,044][26022] Updated weights on worker 0-0, policy_version 54336 (0.00074) [2022-07-09 02:49:10,636][26022] Updated weights on worker 0-0, policy_version 54346 (0.00088) [2022-07-09 02:49:11,057][25689] Fps is (10 sec: 5781.8, 60 sec: 5759.4, 300 sec: 5730.9). Total num frames: 55652352. Throughput: 0: 5044.4. Samples: 55644198. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 02:49:11,057][25689] Avg episode reward: [(0, '-58.542')] [2022-07-09 02:49:12,716][26022] Updated weights on worker 0-0, policy_version 54356 (0.00083) [2022-07-09 02:49:14,134][26022] Updated weights on worker 0-0, policy_version 54366 (0.00090) [2022-07-09 02:49:16,043][26022] Updated weights on worker 0-0, policy_version 54376 (0.00086) [2022-07-09 02:49:16,068][25689] Fps is (10 sec: 5819.5, 60 sec: 5742.8, 300 sec: 5731.3). Total num frames: 55681024. Throughput: 0: 5933.5. Samples: 55679056. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 02:49:16,069][25689] Avg episode reward: [(0, '-58.406')] [2022-07-09 02:49:17,663][26022] Updated weights on worker 0-0, policy_version 54386 (0.00088) [2022-07-09 02:49:19,643][26022] Updated weights on worker 0-0, policy_version 54396 (0.00092) [2022-07-09 02:49:21,070][25689] Fps is (10 sec: 5726.8, 60 sec: 5728.7, 300 sec: 5729.3). Total num frames: 55709696. Throughput: 0: 5947.1. Samples: 55714012. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 02:49:21,072][25689] Avg episode reward: [(0, '-58.267')] [2022-07-09 02:49:21,377][26022] Updated weights on worker 0-0, policy_version 54406 (0.00099) [2022-07-09 02:49:23,132][26022] Updated weights on worker 0-0, policy_version 54416 (0.00085) [2022-07-09 02:49:24,694][26022] Updated weights on worker 0-0, policy_version 54426 (0.00082) [2022-07-09 02:49:26,112][25689] Fps is (10 sec: 5505.7, 60 sec: 5713.7, 300 sec: 5725.4). Total num frames: 55736320. Throughput: 0: 5200.1. Samples: 55731280. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 02:49:26,114][25689] Avg episode reward: [(0, '-58.023')] [2022-07-09 02:49:26,755][26022] Updated weights on worker 0-0, policy_version 54436 (0.00098) [2022-07-09 02:49:28,402][26022] Updated weights on worker 0-0, policy_version 54446 (0.00081) [2022-07-09 02:49:30,434][26022] Updated weights on worker 0-0, policy_version 54456 (0.00088) [2022-07-09 02:49:31,114][25689] Fps is (10 sec: 5913.1, 60 sec: 5748.5, 300 sec: 5736.2). Total num frames: 55769088. Throughput: 0: 6080.8. Samples: 55766164. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 02:49:31,115][25689] Avg episode reward: [(0, '-57.262')] [2022-07-09 02:49:32,066][26022] Updated weights on worker 0-0, policy_version 54466 (0.00090) [2022-07-09 02:49:33,727][26022] Updated weights on worker 0-0, policy_version 54476 (0.00089) [2022-07-09 02:49:35,543][26022] Updated weights on worker 0-0, policy_version 54486 (0.00088) [2022-07-09 02:49:36,121][25689] Fps is (10 sec: 6036.1, 60 sec: 5750.4, 300 sec: 5733.7). Total num frames: 55796736. Throughput: 0: 6098.8. Samples: 55801352. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 02:49:36,121][25689] Avg episode reward: [(0, '-57.352')] [2022-07-09 02:49:37,327][26022] Updated weights on worker 0-0, policy_version 54496 (0.00085) [2022-07-09 02:49:38,995][26022] Updated weights on worker 0-0, policy_version 54506 (0.00086) [2022-07-09 02:49:40,987][26022] Updated weights on worker 0-0, policy_version 54516 (0.00084) [2022-07-09 02:49:41,126][25689] Fps is (10 sec: 5523.2, 60 sec: 5719.5, 300 sec: 5732.7). Total num frames: 55824384. Throughput: 0: 5217.5. Samples: 55818652. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 02:49:41,126][25689] Avg episode reward: [(0, '-55.622')] [2022-07-09 02:49:42,528][26022] Updated weights on worker 0-0, policy_version 54526 (0.00093) [2022-07-09 02:49:44,439][26022] Updated weights on worker 0-0, policy_version 54536 (0.00082) [2022-07-09 02:49:46,024][26022] Updated weights on worker 0-0, policy_version 54546 (0.00091) [2022-07-09 02:49:46,197][25689] Fps is (10 sec: 5792.5, 60 sec: 5738.6, 300 sec: 5729.1). Total num frames: 55855104. Throughput: 0: 6061.6. Samples: 55853032. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 02:49:46,198][25689] Avg episode reward: [(0, '-55.969')] [2022-07-09 02:49:48,023][26022] Updated weights on worker 0-0, policy_version 54556 (0.00129) [2022-07-09 02:49:49,861][26022] Updated weights on worker 0-0, policy_version 54566 (0.00085) [2022-07-09 02:49:51,221][25689] Fps is (10 sec: 5883.1, 60 sec: 5740.9, 300 sec: 5729.3). Total num frames: 55883776. Throughput: 0: 6059.4. Samples: 55888000. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 02:49:51,226][25689] Avg episode reward: [(0, '-55.878')] [2022-07-09 02:49:51,362][26022] Updated weights on worker 0-0, policy_version 54576 (0.00086) [2022-07-09 02:49:53,424][26022] Updated weights on worker 0-0, policy_version 54586 (0.00085) [2022-07-09 02:49:54,859][26022] Updated weights on worker 0-0, policy_version 54596 (0.00095) [2022-07-09 02:49:56,233][25689] Fps is (10 sec: 5714.3, 60 sec: 5740.6, 300 sec: 5729.8). Total num frames: 55912448. Throughput: 0: 6032.5. Samples: 55922676. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 02:49:56,233][25689] Avg episode reward: [(0, '-56.659')] [2022-07-09 02:49:56,966][26022] Updated weights on worker 0-0, policy_version 54606 (0.00086) [2022-07-09 02:49:58,575][26022] Updated weights on worker 0-0, policy_version 54616 (0.00088) [2022-07-09 02:50:00,257][26022] Updated weights on worker 0-0, policy_version 54626 (0.00084) [2022-07-09 02:50:01,255][25689] Fps is (10 sec: 5817.3, 60 sec: 5774.5, 300 sec: 5739.0). Total num frames: 55942144. Throughput: 0: 6024.4. Samples: 55939916. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 02:50:01,255][25689] Avg episode reward: [(0, '-55.928')] [2022-07-09 02:50:02,511][26022] Updated weights on worker 0-0, policy_version 54636 (0.00085) [2022-07-09 02:50:04,158][26022] Updated weights on worker 0-0, policy_version 54646 (0.00088) [2022-07-09 02:50:06,220][26022] Updated weights on worker 0-0, policy_version 54656 (0.00083) [2022-07-09 02:50:06,327][25689] Fps is (10 sec: 5478.0, 60 sec: 5724.0, 300 sec: 5727.9). Total num frames: 55967744. Throughput: 0: 5934.4. Samples: 55972488. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 02:50:06,327][25689] Avg episode reward: [(0, '-55.486')] [2022-07-09 02:50:07,740][26022] Updated weights on worker 0-0, policy_version 54666 (0.00081) [2022-07-09 02:50:09,650][26022] Updated weights on worker 0-0, policy_version 54676 (0.00097) [2022-07-09 02:50:11,312][26022] Updated weights on worker 0-0, policy_version 54686 (0.00088) [2022-07-09 02:50:11,332][25689] Fps is (10 sec: 5588.8, 60 sec: 5742.2, 300 sec: 5731.7). Total num frames: 55998464. Throughput: 0: 5926.1. Samples: 56007178. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 02:50:11,333][25689] Avg episode reward: [(0, '-55.951')] [2022-07-09 02:50:13,228][26022] Updated weights on worker 0-0, policy_version 54696 (0.00087) [2022-07-09 02:50:14,904][26022] Updated weights on worker 0-0, policy_version 54706 (0.00091) [2022-07-09 02:50:16,349][25689] Fps is (10 sec: 5823.8, 60 sec: 5724.7, 300 sec: 5728.1). Total num frames: 56026112. Throughput: 0: 5045.0. Samples: 56024164. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 02:50:16,350][25689] Avg episode reward: [(0, '-55.919')] [2022-07-09 02:50:16,770][26022] Updated weights on worker 0-0, policy_version 54716 (0.00088) [2022-07-09 02:50:18,441][26022] Updated weights on worker 0-0, policy_version 54726 (0.00083) [2022-07-09 02:50:19,964][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:50:19,979][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000054734_56047616.pth [2022-07-09 02:50:19,980][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000052718_53983232.pth [2022-07-09 02:50:19,980][25974] Saving a milestone ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/milestones/checkpoint_000054734_56047616.pth.milestone [2022-07-09 02:50:20,193][26022] Updated weights on worker 0-0, policy_version 54736 (0.00096) [2022-07-09 02:50:21,390][25689] Fps is (10 sec: 5701.1, 60 sec: 5737.9, 300 sec: 5732.4). Total num frames: 56055808. Throughput: 0: 5912.9. Samples: 56058976. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 02:50:21,391][25689] Avg episode reward: [(0, '-54.603')] [2022-07-09 02:50:21,901][26022] Updated weights on worker 0-0, policy_version 54746 (0.00090) [2022-07-09 02:50:23,786][26022] Updated weights on worker 0-0, policy_version 54756 (0.00087) [2022-07-09 02:50:25,662][26022] Updated weights on worker 0-0, policy_version 54766 (0.00088) [2022-07-09 02:50:26,461][25689] Fps is (10 sec: 5772.4, 60 sec: 5769.1, 300 sec: 5732.5). Total num frames: 56084480. Throughput: 0: 6024.0. Samples: 56093774. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 02:50:26,461][25689] Avg episode reward: [(0, '-53.587')] [2022-07-09 02:50:27,404][26022] Updated weights on worker 0-0, policy_version 54776 (0.00081) [2022-07-09 02:50:28,953][26022] Updated weights on worker 0-0, policy_version 54786 (0.00090) [2022-07-09 02:50:31,042][26022] Updated weights on worker 0-0, policy_version 54796 (0.00080) [2022-07-09 02:50:31,548][25689] Fps is (10 sec: 5746.2, 60 sec: 5710.2, 300 sec: 5732.8). Total num frames: 56114176. Throughput: 0: 5141.9. Samples: 56111126. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 02:50:31,548][25689] Avg episode reward: [(0, '-53.587')] [2022-07-09 02:50:32,671][26022] Updated weights on worker 0-0, policy_version 54806 (0.00082) [2022-07-09 02:50:34,364][26022] Updated weights on worker 0-0, policy_version 54816 (0.00080) [2022-07-09 02:50:35,995][26022] Updated weights on worker 0-0, policy_version 54826 (0.00086) [2022-07-09 02:50:36,560][25689] Fps is (10 sec: 5779.6, 60 sec: 5726.7, 300 sec: 5732.7). Total num frames: 56142848. Throughput: 0: 6016.6. Samples: 56145764. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 02:50:36,560][25689] Avg episode reward: [(0, '-53.823')] [2022-07-09 02:50:37,999][26022] Updated weights on worker 0-0, policy_version 54836 (0.00090) [2022-07-09 02:50:39,725][26022] Updated weights on worker 0-0, policy_version 54846 (0.00087) [2022-07-09 02:50:41,608][25689] Fps is (10 sec: 5699.9, 60 sec: 5739.4, 300 sec: 5730.2). Total num frames: 56171520. Throughput: 0: 6009.7. Samples: 56180482. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 02:50:41,611][25689] Avg episode reward: [(0, '-54.151')] [2022-07-09 02:50:41,639][26022] Updated weights on worker 0-0, policy_version 54856 (0.00110) [2022-07-09 02:50:43,256][26022] Updated weights on worker 0-0, policy_version 54866 (0.00097) [2022-07-09 02:50:45,075][26022] Updated weights on worker 0-0, policy_version 54876 (0.00074) [2022-07-09 02:50:46,680][25689] Fps is (10 sec: 5868.3, 60 sec: 5739.4, 300 sec: 5732.4). Total num frames: 56202240. Throughput: 0: 5149.7. Samples: 56197900. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 02:50:46,681][25689] Avg episode reward: [(0, '-54.424')] [2022-07-09 02:50:47,006][26022] Updated weights on worker 0-0, policy_version 54886 (0.00088) [2022-07-09 02:50:48,678][26022] Updated weights on worker 0-0, policy_version 54896 (0.00088) [2022-07-09 02:50:50,367][26022] Updated weights on worker 0-0, policy_version 54906 (0.00093) [2022-07-09 02:50:51,714][25689] Fps is (10 sec: 5877.1, 60 sec: 5738.4, 300 sec: 5732.1). Total num frames: 56230912. Throughput: 0: 6019.2. Samples: 56232510. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 02:50:51,715][25689] Avg episode reward: [(0, '-55.347')] [2022-07-09 02:50:52,266][26022] Updated weights on worker 0-0, policy_version 54916 (0.00088) [2022-07-09 02:50:53,960][26022] Updated weights on worker 0-0, policy_version 54926 (0.00091) [2022-07-09 02:50:55,861][26022] Updated weights on worker 0-0, policy_version 54936 (0.00616) [2022-07-09 02:50:56,722][25689] Fps is (10 sec: 5710.9, 60 sec: 5738.8, 300 sec: 5732.3). Total num frames: 56259584. Throughput: 0: 6017.0. Samples: 56267080. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 02:50:56,723][25689] Avg episode reward: [(0, '-55.527')] [2022-07-09 02:50:57,424][26022] Updated weights on worker 0-0, policy_version 54946 (0.00058) [2022-07-09 02:50:59,335][26022] Updated weights on worker 0-0, policy_version 54956 (0.00090) [2022-07-09 02:51:01,011][26022] Updated weights on worker 0-0, policy_version 54966 (0.00094) [2022-07-09 02:51:01,732][25689] Fps is (10 sec: 5724.0, 60 sec: 5723.0, 300 sec: 5741.3). Total num frames: 56288256. Throughput: 0: 5170.2. Samples: 56284526. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 02:51:01,733][25689] Avg episode reward: [(0, '-55.656')] [2022-07-09 02:51:03,228][26022] Updated weights on worker 0-0, policy_version 54976 (0.00091) [2022-07-09 02:51:04,784][26022] Updated weights on worker 0-0, policy_version 54986 (0.00086) [2022-07-09 02:51:06,847][25689] Fps is (10 sec: 5360.1, 60 sec: 5719.0, 300 sec: 5725.5). Total num frames: 56313856. Throughput: 0: 5916.6. Samples: 56317216. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 02:51:06,847][25689] Avg episode reward: [(0, '-55.031')] [2022-07-09 02:51:07,038][26022] Updated weights on worker 0-0, policy_version 54996 (0.00093) [2022-07-09 02:51:08,401][26022] Updated weights on worker 0-0, policy_version 55006 (0.00081) [2022-07-09 02:51:10,367][26022] Updated weights on worker 0-0, policy_version 55016 (0.00079) [2022-07-09 02:51:11,877][25689] Fps is (10 sec: 5652.6, 60 sec: 5733.5, 300 sec: 5736.8). Total num frames: 56345600. Throughput: 0: 5928.9. Samples: 56352054. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 02:51:11,879][25689] Avg episode reward: [(0, '-55.538')] [2022-07-09 02:51:11,885][26022] Updated weights on worker 0-0, policy_version 55026 (0.00092) [2022-07-09 02:51:13,801][26022] Updated weights on worker 0-0, policy_version 55036 (0.00084) [2022-07-09 02:51:15,515][26022] Updated weights on worker 0-0, policy_version 55046 (0.00083) [2022-07-09 02:51:16,915][25689] Fps is (10 sec: 6000.9, 60 sec: 5748.5, 300 sec: 5736.9). Total num frames: 56374272. Throughput: 0: 5070.9. Samples: 56369476. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 02:51:16,915][25689] Avg episode reward: [(0, '-55.269')] [2022-07-09 02:51:17,363][26022] Updated weights on worker 0-0, policy_version 55056 (0.00087) [2022-07-09 02:51:19,031][26022] Updated weights on worker 0-0, policy_version 55066 (0.00090) [2022-07-09 02:51:21,215][26022] Updated weights on worker 0-0, policy_version 55076 (0.00083) [2022-07-09 02:51:21,939][25689] Fps is (10 sec: 5698.9, 60 sec: 5733.1, 300 sec: 5735.1). Total num frames: 56402944. Throughput: 0: 5914.6. Samples: 56404042. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 02:51:21,940][25689] Avg episode reward: [(0, '-56.037')] [2022-07-09 02:51:22,577][26022] Updated weights on worker 0-0, policy_version 55086 (0.00088) [2022-07-09 02:51:24,664][26022] Updated weights on worker 0-0, policy_version 55096 (0.00094) [2022-07-09 02:51:26,024][26022] Updated weights on worker 0-0, policy_version 55106 (0.00086) [2022-07-09 02:51:27,051][25689] Fps is (10 sec: 5657.4, 60 sec: 5729.2, 300 sec: 5733.2). Total num frames: 56431616. Throughput: 0: 5999.8. Samples: 56438436. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 02:51:27,051][25689] Avg episode reward: [(0, '-56.723')] [2022-07-09 02:51:28,267][26022] Updated weights on worker 0-0, policy_version 55116 (0.00089) [2022-07-09 02:51:29,810][26022] Updated weights on worker 0-0, policy_version 55126 (0.00089) [2022-07-09 02:51:31,698][26022] Updated weights on worker 0-0, policy_version 55136 (0.00091) [2022-07-09 02:51:32,078][25689] Fps is (10 sec: 5857.9, 60 sec: 5751.8, 300 sec: 5739.8). Total num frames: 56462336. Throughput: 0: 5133.6. Samples: 56455758. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 02:51:32,079][25689] Avg episode reward: [(0, '-56.481')] [2022-07-09 02:51:33,287][26022] Updated weights on worker 0-0, policy_version 55146 (0.00095) [2022-07-09 02:51:35,162][26022] Updated weights on worker 0-0, policy_version 55156 (0.00090) [2022-07-09 02:51:36,941][26022] Updated weights on worker 0-0, policy_version 55166 (0.00091) [2022-07-09 02:51:37,097][25689] Fps is (10 sec: 5911.8, 60 sec: 5751.2, 300 sec: 5743.0). Total num frames: 56491008. Throughput: 0: 5993.4. Samples: 56490436. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 02:51:37,098][25689] Avg episode reward: [(0, '-56.839')] [2022-07-09 02:51:38,714][26022] Updated weights on worker 0-0, policy_version 55176 (0.00081) [2022-07-09 02:51:40,369][26022] Updated weights on worker 0-0, policy_version 55186 (0.00085) [2022-07-09 02:51:42,134][25689] Fps is (10 sec: 5600.8, 60 sec: 5735.4, 300 sec: 5737.3). Total num frames: 56518656. Throughput: 0: 5997.6. Samples: 56525160. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 02:51:42,135][25689] Avg episode reward: [(0, '-56.370')] [2022-07-09 02:51:42,595][26022] Updated weights on worker 0-0, policy_version 55196 (0.00087) [2022-07-09 02:51:43,929][26022] Updated weights on worker 0-0, policy_version 55206 (0.00090) [2022-07-09 02:51:45,942][26022] Updated weights on worker 0-0, policy_version 55216 (0.00094) [2022-07-09 02:51:47,256][25689] Fps is (10 sec: 5644.6, 60 sec: 5713.7, 300 sec: 5732.3). Total num frames: 56548352. Throughput: 0: 5137.8. Samples: 56542248. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 02:51:47,257][25689] Avg episode reward: [(0, '-55.632')] [2022-07-09 02:51:47,639][26022] Updated weights on worker 0-0, policy_version 55226 (0.00088) [2022-07-09 02:51:49,367][26022] Updated weights on worker 0-0, policy_version 55236 (0.00058) [2022-07-09 02:51:51,251][26022] Updated weights on worker 0-0, policy_version 55246 (0.00087) [2022-07-09 02:51:52,296][25689] Fps is (10 sec: 5945.2, 60 sec: 5746.9, 300 sec: 5739.2). Total num frames: 56579072. Throughput: 0: 6005.6. Samples: 56577178. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 02:51:52,297][25689] Avg episode reward: [(0, '-55.079')] [2022-07-09 02:51:52,732][26022] Updated weights on worker 0-0, policy_version 55256 (0.00088) [2022-07-09 02:51:54,767][26022] Updated weights on worker 0-0, policy_version 55266 (0.00083) [2022-07-09 02:51:56,372][26022] Updated weights on worker 0-0, policy_version 55276 (0.00092) [2022-07-09 02:51:57,308][25689] Fps is (10 sec: 5704.7, 60 sec: 5712.7, 300 sec: 5732.7). Total num frames: 56605696. Throughput: 0: 6014.7. Samples: 56612000. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 02:51:57,309][25689] Avg episode reward: [(0, '-55.227')] [2022-07-09 02:51:58,151][26022] Updated weights on worker 0-0, policy_version 55286 (0.00095) [2022-07-09 02:52:00,037][26022] Updated weights on worker 0-0, policy_version 55296 (0.00087) [2022-07-09 02:52:01,937][26022] Updated weights on worker 0-0, policy_version 55306 (0.00096) [2022-07-09 02:52:02,333][25689] Fps is (10 sec: 5509.5, 60 sec: 5711.4, 300 sec: 5741.0). Total num frames: 56634368. Throughput: 0: 5167.6. Samples: 56629538. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 02:52:02,333][25689] Avg episode reward: [(0, '-56.217')] [2022-07-09 02:52:03,727][26022] Updated weights on worker 0-0, policy_version 55316 (0.00085) [2022-07-09 02:52:05,735][26022] Updated weights on worker 0-0, policy_version 55326 (0.00086) [2022-07-09 02:52:07,101][26022] Updated weights on worker 0-0, policy_version 55336 (0.00085) [2022-07-09 02:52:07,453][25689] Fps is (10 sec: 5854.5, 60 sec: 5795.3, 300 sec: 5736.5). Total num frames: 56665088. Throughput: 0: 5946.4. Samples: 56662346. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 02:52:07,453][25689] Avg episode reward: [(0, '-56.126')] [2022-07-09 02:52:09,290][26022] Updated weights on worker 0-0, policy_version 55346 (0.00086) [2022-07-09 02:52:10,827][26022] Updated weights on worker 0-0, policy_version 55356 (0.00086) [2022-07-09 02:52:12,474][25689] Fps is (10 sec: 5856.5, 60 sec: 5745.5, 300 sec: 5740.8). Total num frames: 56693760. Throughput: 0: 5952.3. Samples: 56697282. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 02:52:12,476][25689] Avg episode reward: [(0, '-55.653')] [2022-07-09 02:52:12,610][26022] Updated weights on worker 0-0, policy_version 55366 (0.00101) [2022-07-09 02:52:14,419][26022] Updated weights on worker 0-0, policy_version 55376 (0.00089) [2022-07-09 02:52:16,151][26022] Updated weights on worker 0-0, policy_version 55386 (0.00086) [2022-07-09 02:52:17,495][25689] Fps is (10 sec: 5608.3, 60 sec: 5730.2, 300 sec: 5734.7). Total num frames: 56721408. Throughput: 0: 5100.5. Samples: 56714964. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 02:52:17,496][25689] Avg episode reward: [(0, '-56.285')] [2022-07-09 02:52:18,013][26022] Updated weights on worker 0-0, policy_version 55396 (0.00090) [2022-07-09 02:52:19,803][26022] Updated weights on worker 0-0, policy_version 55406 (0.00088) [2022-07-09 02:52:19,983][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:52:19,997][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000055408_56737792.pth [2022-07-09 02:52:19,998][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000053389_54670336.pth [2022-07-09 02:52:21,464][26022] Updated weights on worker 0-0, policy_version 55416 (0.00089) [2022-07-09 02:52:22,558][25689] Fps is (10 sec: 5686.5, 60 sec: 5743.4, 300 sec: 5734.6). Total num frames: 56751104. Throughput: 0: 5939.3. Samples: 56749664. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 02:52:22,560][25689] Avg episode reward: [(0, '-54.897')] [2022-07-09 02:52:23,343][26022] Updated weights on worker 0-0, policy_version 55426 (0.00088) [2022-07-09 02:52:24,968][26022] Updated weights on worker 0-0, policy_version 55436 (0.00097) [2022-07-09 02:52:26,923][26022] Updated weights on worker 0-0, policy_version 55446 (0.00077) [2022-07-09 02:52:27,635][25689] Fps is (10 sec: 6059.3, 60 sec: 5797.4, 300 sec: 5744.1). Total num frames: 56782848. Throughput: 0: 6033.8. Samples: 56784120. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 02:52:27,635][25689] Avg episode reward: [(0, '-54.292')] [2022-07-09 02:52:28,623][26022] Updated weights on worker 0-0, policy_version 55456 (0.00086) [2022-07-09 02:52:30,315][26022] Updated weights on worker 0-0, policy_version 55466 (0.00095) [2022-07-09 02:52:32,324][26022] Updated weights on worker 0-0, policy_version 55476 (0.00089) [2022-07-09 02:52:32,686][25689] Fps is (10 sec: 5661.9, 60 sec: 5710.7, 300 sec: 5733.3). Total num frames: 56808448. Throughput: 0: 6002.2. Samples: 56818600. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 02:52:32,687][25689] Avg episode reward: [(0, '-54.497')] [2022-07-09 02:52:34,059][26022] Updated weights on worker 0-0, policy_version 55487 (0.00086) [2022-07-09 02:52:36,004][26022] Updated weights on worker 0-0, policy_version 55497 (0.00087) [2022-07-09 02:52:37,707][25689] Fps is (10 sec: 5490.1, 60 sec: 5727.4, 300 sec: 5736.4). Total num frames: 56838144. Throughput: 0: 5986.6. Samples: 56835964. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 02:52:37,708][25689] Avg episode reward: [(0, '-54.237')] [2022-07-09 02:52:37,749][26022] Updated weights on worker 0-0, policy_version 55507 (0.00086) [2022-07-09 02:52:39,456][26022] Updated weights on worker 0-0, policy_version 55517 (0.00502) [2022-07-09 02:52:41,276][26022] Updated weights on worker 0-0, policy_version 55527 (0.00096) [2022-07-09 02:52:42,786][25689] Fps is (10 sec: 5779.4, 60 sec: 5740.3, 300 sec: 5729.5). Total num frames: 56866816. Throughput: 0: 5978.0. Samples: 56870584. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 02:52:42,786][25689] Avg episode reward: [(0, '-54.231')] [2022-07-09 02:52:43,147][26022] Updated weights on worker 0-0, policy_version 55537 (0.00083) [2022-07-09 02:52:44,783][26022] Updated weights on worker 0-0, policy_version 55547 (0.00085) [2022-07-09 02:52:46,695][26022] Updated weights on worker 0-0, policy_version 55557 (0.00086) [2022-07-09 02:52:47,868][25689] Fps is (10 sec: 5845.0, 60 sec: 5761.0, 300 sec: 5735.4). Total num frames: 56897536. Throughput: 0: 5988.1. Samples: 56905278. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 02:52:47,869][25689] Avg episode reward: [(0, '-55.354')] [2022-07-09 02:52:48,423][26022] Updated weights on worker 0-0, policy_version 55567 (0.00088) [2022-07-09 02:52:50,187][26022] Updated weights on worker 0-0, policy_version 55577 (0.00096) [2022-07-09 02:52:51,930][26022] Updated weights on worker 0-0, policy_version 55587 (0.00085) [2022-07-09 02:52:52,875][25689] Fps is (10 sec: 5886.4, 60 sec: 5730.3, 300 sec: 5739.1). Total num frames: 56926208. Throughput: 0: 5151.3. Samples: 56922600. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 02:52:52,877][25689] Avg episode reward: [(0, '-56.092')] [2022-07-09 02:52:53,611][26022] Updated weights on worker 0-0, policy_version 55597 (0.00091) [2022-07-09 02:52:55,544][26022] Updated weights on worker 0-0, policy_version 55607 (0.00087) [2022-07-09 02:52:57,281][26022] Updated weights on worker 0-0, policy_version 55617 (0.00083) [2022-07-09 02:52:57,901][25689] Fps is (10 sec: 5715.6, 60 sec: 5762.8, 300 sec: 5739.2). Total num frames: 56954880. Throughput: 0: 6014.0. Samples: 56957412. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 02:52:57,903][25689] Avg episode reward: [(0, '-56.106')] [2022-07-09 02:52:58,956][26022] Updated weights on worker 0-0, policy_version 55627 (0.00095) [2022-07-09 02:53:00,748][26022] Updated weights on worker 0-0, policy_version 55637 (0.00090) [2022-07-09 02:53:02,732][26022] Updated weights on worker 0-0, policy_version 55647 (0.00079) [2022-07-09 02:53:02,914][25689] Fps is (10 sec: 5610.2, 60 sec: 5747.0, 300 sec: 5737.8). Total num frames: 56982528. Throughput: 0: 5967.4. Samples: 56990698. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 02:53:02,915][25689] Avg episode reward: [(0, '-55.663')] [2022-07-09 02:53:04,472][26022] Updated weights on worker 0-0, policy_version 55657 (0.00090) [2022-07-09 02:53:06,363][26022] Updated weights on worker 0-0, policy_version 55667 (0.00084) [2022-07-09 02:53:07,913][26022] Updated weights on worker 0-0, policy_version 55677 (0.00085) [2022-07-09 02:53:08,021][25689] Fps is (10 sec: 5767.3, 60 sec: 5748.2, 300 sec: 5746.4). Total num frames: 57013248. Throughput: 0: 5098.5. Samples: 57008030. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 02:53:08,022][25689] Avg episode reward: [(0, '-56.192')] [2022-07-09 02:53:09,992][26022] Updated weights on worker 0-0, policy_version 55687 (0.00846) [2022-07-09 02:53:11,676][26022] Updated weights on worker 0-0, policy_version 55697 (0.00085) [2022-07-09 02:53:13,059][25689] Fps is (10 sec: 5854.1, 60 sec: 5746.6, 300 sec: 5742.5). Total num frames: 57041920. Throughput: 0: 5961.4. Samples: 57042926. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 02:53:13,060][25689] Avg episode reward: [(0, '-55.792')] [2022-07-09 02:53:13,237][26022] Updated weights on worker 0-0, policy_version 55707 (0.00085) [2022-07-09 02:53:15,379][26022] Updated weights on worker 0-0, policy_version 55717 (0.00095) [2022-07-09 02:53:16,804][26022] Updated weights on worker 0-0, policy_version 55727 (0.00084) [2022-07-09 02:53:18,117][25689] Fps is (10 sec: 5578.9, 60 sec: 5743.2, 300 sec: 5735.1). Total num frames: 57069568. Throughput: 0: 5950.8. Samples: 57077712. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 02:53:18,117][25689] Avg episode reward: [(0, '-55.283')] [2022-07-09 02:53:18,972][26022] Updated weights on worker 0-0, policy_version 55737 (0.00092) [2022-07-09 02:53:20,390][26022] Updated weights on worker 0-0, policy_version 55747 (0.00097) [2022-07-09 02:53:22,225][26022] Updated weights on worker 0-0, policy_version 55757 (0.00084) [2022-07-09 02:53:23,135][25689] Fps is (10 sec: 5691.2, 60 sec: 5747.4, 300 sec: 5742.8). Total num frames: 57099264. Throughput: 0: 5159.6. Samples: 57095032. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-09 02:53:23,136][25689] Avg episode reward: [(0, '-55.057')] [2022-07-09 02:53:24,170][26022] Updated weights on worker 0-0, policy_version 55767 (0.00092) [2022-07-09 02:53:25,636][26022] Updated weights on worker 0-0, policy_version 55777 (0.00092) [2022-07-09 02:53:27,562][26022] Updated weights on worker 0-0, policy_version 55787 (0.00079) [2022-07-09 02:53:28,191][25689] Fps is (10 sec: 5895.6, 60 sec: 5715.6, 300 sec: 5738.5). Total num frames: 57128960. Throughput: 0: 6040.7. Samples: 57129866. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 02:53:28,192][25689] Avg episode reward: [(0, '-54.386')] [2022-07-09 02:53:29,307][26022] Updated weights on worker 0-0, policy_version 55797 (0.00088) [2022-07-09 02:53:31,127][26022] Updated weights on worker 0-0, policy_version 55807 (0.00084) [2022-07-09 02:53:32,960][26022] Updated weights on worker 0-0, policy_version 55817 (0.00098) [2022-07-09 02:53:33,207][25689] Fps is (10 sec: 5795.4, 60 sec: 5769.7, 300 sec: 5742.2). Total num frames: 57157632. Throughput: 0: 6046.8. Samples: 57164754. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 02:53:33,209][25689] Avg episode reward: [(0, '-54.737')] [2022-07-09 02:53:34,548][26022] Updated weights on worker 0-0, policy_version 55827 (0.00087) [2022-07-09 02:53:36,374][26022] Updated weights on worker 0-0, policy_version 55837 (0.00084) [2022-07-09 02:53:37,919][26022] Updated weights on worker 0-0, policy_version 55847 (0.00080) [2022-07-09 02:53:38,214][25689] Fps is (10 sec: 5823.2, 60 sec: 5771.0, 300 sec: 5742.8). Total num frames: 57187328. Throughput: 0: 5198.5. Samples: 57182188. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 02:53:38,214][25689] Avg episode reward: [(0, '-54.850')] [2022-07-09 02:53:39,825][26022] Updated weights on worker 0-0, policy_version 55857 (0.00090) [2022-07-09 02:53:41,713][26022] Updated weights on worker 0-0, policy_version 55867 (0.00084) [2022-07-09 02:53:43,216][25689] Fps is (10 sec: 5831.5, 60 sec: 5778.3, 300 sec: 5741.1). Total num frames: 57216000. Throughput: 0: 6068.1. Samples: 57216882. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 02:53:43,216][25689] Avg episode reward: [(0, '-54.859')] [2022-07-09 02:53:43,439][26022] Updated weights on worker 0-0, policy_version 55877 (0.00086) [2022-07-09 02:53:45,342][26022] Updated weights on worker 0-0, policy_version 55887 (0.00088) [2022-07-09 02:53:46,983][26022] Updated weights on worker 0-0, policy_version 55897 (0.00097) [2022-07-09 02:53:48,262][25689] Fps is (10 sec: 5808.9, 60 sec: 5764.8, 300 sec: 5744.6). Total num frames: 57245696. Throughput: 0: 6063.2. Samples: 57251562. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 02:53:48,263][25689] Avg episode reward: [(0, '-54.826')] [2022-07-09 02:53:49,051][26022] Updated weights on worker 0-0, policy_version 55907 (0.00087) [2022-07-09 02:53:50,457][26022] Updated weights on worker 0-0, policy_version 55917 (0.00084) [2022-07-09 02:53:52,599][26022] Updated weights on worker 0-0, policy_version 55927 (0.00089) [2022-07-09 02:53:53,267][25689] Fps is (10 sec: 5806.9, 60 sec: 5765.0, 300 sec: 5744.6). Total num frames: 57274368. Throughput: 0: 5184.8. Samples: 57268764. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 02:53:53,268][25689] Avg episode reward: [(0, '-55.255')] [2022-07-09 02:53:54,071][26022] Updated weights on worker 0-0, policy_version 55937 (0.00086) [2022-07-09 02:53:55,976][26022] Updated weights on worker 0-0, policy_version 55947 (0.00091) [2022-07-09 02:53:57,718][26022] Updated weights on worker 0-0, policy_version 55957 (0.00085) [2022-07-09 02:53:58,276][25689] Fps is (10 sec: 5726.6, 60 sec: 5766.7, 300 sec: 5748.3). Total num frames: 57303040. Throughput: 0: 6038.8. Samples: 57303336. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 02:53:58,276][25689] Avg episode reward: [(0, '-56.185')] [2022-07-09 02:53:59,531][26022] Updated weights on worker 0-0, policy_version 55967 (0.00091) [2022-07-09 02:54:01,071][26022] Updated weights on worker 0-0, policy_version 55977 (0.00084) [2022-07-09 02:54:03,282][25689] Fps is (10 sec: 5419.3, 60 sec: 5733.4, 300 sec: 5739.4). Total num frames: 57328640. Throughput: 0: 5949.8. Samples: 57336270. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 02:54:03,284][25689] Avg episode reward: [(0, '-55.955')] [2022-07-09 02:54:03,516][26022] Updated weights on worker 0-0, policy_version 55987 (0.00084) [2022-07-09 02:54:05,171][26022] Updated weights on worker 0-0, policy_version 55997 (0.00087) [2022-07-09 02:54:07,021][26022] Updated weights on worker 0-0, policy_version 56007 (0.00087) [2022-07-09 02:54:08,356][25689] Fps is (10 sec: 5688.7, 60 sec: 5753.5, 300 sec: 5745.2). Total num frames: 57360384. Throughput: 0: 5079.0. Samples: 57353620. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 02:54:08,357][25689] Avg episode reward: [(0, '-55.482')] [2022-07-09 02:54:08,575][26022] Updated weights on worker 0-0, policy_version 56017 (0.01297) [2022-07-09 02:54:10,469][26022] Updated weights on worker 0-0, policy_version 56027 (0.00089) [2022-07-09 02:54:12,047][26022] Updated weights on worker 0-0, policy_version 56037 (0.00110) [2022-07-09 02:54:13,394][25689] Fps is (10 sec: 5974.9, 60 sec: 5753.6, 300 sec: 5744.7). Total num frames: 57389056. Throughput: 0: 5958.1. Samples: 57388678. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 02:54:13,394][25689] Avg episode reward: [(0, '-55.858')] [2022-07-09 02:54:14,106][26022] Updated weights on worker 0-0, policy_version 56047 (0.00082) [2022-07-09 02:54:15,611][26022] Updated weights on worker 0-0, policy_version 56057 (0.00082) [2022-07-09 02:54:17,668][26022] Updated weights on worker 0-0, policy_version 56067 (0.00094) [2022-07-09 02:54:18,415][25689] Fps is (10 sec: 5701.0, 60 sec: 5774.0, 300 sec: 5744.3). Total num frames: 57417728. Throughput: 0: 5969.8. Samples: 57423562. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 02:54:18,415][25689] Avg episode reward: [(0, '-55.586')] [2022-07-09 02:54:19,078][26022] Updated weights on worker 0-0, policy_version 56077 (0.00094) [2022-07-09 02:54:20,213][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:54:20,226][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000056082_57427968.pth [2022-07-09 02:54:20,226][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000054061_55358464.pth [2022-07-09 02:54:21,202][26022] Updated weights on worker 0-0, policy_version 56087 (0.00078) [2022-07-09 02:54:22,611][26022] Updated weights on worker 0-0, policy_version 56097 (0.00088) [2022-07-09 02:54:23,425][25689] Fps is (10 sec: 5614.2, 60 sec: 5740.8, 300 sec: 5748.3). Total num frames: 57445376. Throughput: 0: 5193.7. Samples: 57440888. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 02:54:23,431][25689] Avg episode reward: [(0, '-55.901')] [2022-07-09 02:54:24,470][26022] Updated weights on worker 0-0, policy_version 56107 (0.00083) [2022-07-09 02:54:26,313][26022] Updated weights on worker 0-0, policy_version 56117 (0.00089) [2022-07-09 02:54:28,161][26022] Updated weights on worker 0-0, policy_version 56127 (0.00087) [2022-07-09 02:54:28,552][25689] Fps is (10 sec: 5757.8, 60 sec: 5751.0, 300 sec: 5739.0). Total num frames: 57476096. Throughput: 0: 6030.7. Samples: 57475416. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 02:54:28,553][25689] Avg episode reward: [(0, '-55.305')] [2022-07-09 02:54:29,994][26022] Updated weights on worker 0-0, policy_version 56137 (0.00095) [2022-07-09 02:54:31,586][26022] Updated weights on worker 0-0, policy_version 56147 (0.00090) [2022-07-09 02:54:33,257][26022] Updated weights on worker 0-0, policy_version 56157 (0.00085) [2022-07-09 02:54:33,630][25689] Fps is (10 sec: 5920.3, 60 sec: 5762.0, 300 sec: 5744.5). Total num frames: 57505792. Throughput: 0: 6014.6. Samples: 57510392. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 02:54:33,631][25689] Avg episode reward: [(0, '-55.264')] [2022-07-09 02:54:35,296][26022] Updated weights on worker 0-0, policy_version 56167 (0.00090) [2022-07-09 02:54:36,930][26022] Updated weights on worker 0-0, policy_version 56177 (0.00095) [2022-07-09 02:54:38,684][25689] Fps is (10 sec: 5760.8, 60 sec: 5740.6, 300 sec: 5747.0). Total num frames: 57534464. Throughput: 0: 5136.9. Samples: 57527682. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 02:54:38,684][25689] Avg episode reward: [(0, '-55.108')] [2022-07-09 02:54:38,715][26022] Updated weights on worker 0-0, policy_version 56187 (0.00088) [2022-07-09 02:54:40,519][26022] Updated weights on worker 0-0, policy_version 56197 (0.01000) [2022-07-09 02:54:42,199][26022] Updated weights on worker 0-0, policy_version 56207 (0.00090) [2022-07-09 02:54:43,712][25689] Fps is (10 sec: 5687.8, 60 sec: 5738.1, 300 sec: 5741.0). Total num frames: 57563136. Throughput: 0: 5991.3. Samples: 57562432. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 02:54:43,713][25689] Avg episode reward: [(0, '-54.726')] [2022-07-09 02:54:44,083][26022] Updated weights on worker 0-0, policy_version 56217 (0.00082) [2022-07-09 02:54:45,757][26022] Updated weights on worker 0-0, policy_version 56227 (0.00092) [2022-07-09 02:54:47,522][26022] Updated weights on worker 0-0, policy_version 56237 (0.00081) [2022-07-09 02:54:48,779][25689] Fps is (10 sec: 5883.4, 60 sec: 5753.1, 300 sec: 5747.1). Total num frames: 57593856. Throughput: 0: 6020.7. Samples: 57597196. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 02:54:48,779][25689] Avg episode reward: [(0, '-54.848')] [2022-07-09 02:54:49,358][26022] Updated weights on worker 0-0, policy_version 56247 (0.00088) [2022-07-09 02:54:51,162][26022] Updated weights on worker 0-0, policy_version 56257 (0.00085) [2022-07-09 02:54:52,743][26022] Updated weights on worker 0-0, policy_version 56267 (0.00086) [2022-07-09 02:54:53,781][25689] Fps is (10 sec: 5898.8, 60 sec: 5753.5, 300 sec: 5747.2). Total num frames: 57622528. Throughput: 0: 5176.5. Samples: 57614700. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 02:54:53,782][25689] Avg episode reward: [(0, '-55.239')] [2022-07-09 02:54:54,788][26022] Updated weights on worker 0-0, policy_version 56277 (0.00086) [2022-07-09 02:54:56,363][26022] Updated weights on worker 0-0, policy_version 56287 (0.00084) [2022-07-09 02:54:58,273][26022] Updated weights on worker 0-0, policy_version 56297 (0.00085) [2022-07-09 02:54:58,796][25689] Fps is (10 sec: 5826.5, 60 sec: 5769.7, 300 sec: 5747.4). Total num frames: 57652224. Throughput: 0: 6060.9. Samples: 57649582. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 02:54:58,798][25689] Avg episode reward: [(0, '-55.753')] [2022-07-09 02:54:59,865][26022] Updated weights on worker 0-0, policy_version 56307 (0.00082) [2022-07-09 02:55:01,599][26022] Updated weights on worker 0-0, policy_version 56317 (0.00091) [2022-07-09 02:55:03,625][26022] Updated weights on worker 0-0, policy_version 56327 (0.00057) [2022-07-09 02:55:03,805][25689] Fps is (10 sec: 5618.5, 60 sec: 5786.4, 300 sec: 5752.0). Total num frames: 57678848. Throughput: 0: 5979.2. Samples: 57682570. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 02:55:03,805][25689] Avg episode reward: [(0, '-56.677')] [2022-07-09 02:55:05,726][26022] Updated weights on worker 0-0, policy_version 56337 (0.00082) [2022-07-09 02:55:07,244][26022] Updated weights on worker 0-0, policy_version 56347 (0.00083) [2022-07-09 02:55:08,938][25689] Fps is (10 sec: 5553.6, 60 sec: 5747.0, 300 sec: 5746.1). Total num frames: 57708544. Throughput: 0: 5961.9. Samples: 57717382. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 02:55:08,938][25689] Avg episode reward: [(0, '-56.794')] [2022-07-09 02:55:09,253][26022] Updated weights on worker 0-0, policy_version 56357 (0.00096) [2022-07-09 02:55:10,782][26022] Updated weights on worker 0-0, policy_version 56367 (0.00094) [2022-07-09 02:55:12,726][26022] Updated weights on worker 0-0, policy_version 56377 (0.00091) [2022-07-09 02:55:13,973][25689] Fps is (10 sec: 5739.9, 60 sec: 5747.1, 300 sec: 5749.2). Total num frames: 57737216. Throughput: 0: 5940.5. Samples: 57734656. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 02:55:13,974][25689] Avg episode reward: [(0, '-57.022')] [2022-07-09 02:55:14,371][26022] Updated weights on worker 0-0, policy_version 56387 (0.00087) [2022-07-09 02:55:16,130][26022] Updated weights on worker 0-0, policy_version 56397 (0.00089) [2022-07-09 02:55:17,727][26022] Updated weights on worker 0-0, policy_version 56407 (0.00081) [2022-07-09 02:55:19,063][25689] Fps is (10 sec: 5663.3, 60 sec: 5740.6, 300 sec: 5744.8). Total num frames: 57765888. Throughput: 0: 5931.6. Samples: 57769796. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 02:55:19,064][25689] Avg episode reward: [(0, '-57.282')] [2022-07-09 02:55:19,531][26022] Updated weights on worker 0-0, policy_version 56417 (0.00084) [2022-07-09 02:55:21,396][26022] Updated weights on worker 0-0, policy_version 56427 (0.00087) [2022-07-09 02:55:23,089][26022] Updated weights on worker 0-0, policy_version 56437 (0.00079) [2022-07-09 02:55:24,120][25689] Fps is (10 sec: 5853.4, 60 sec: 5786.9, 300 sec: 5752.0). Total num frames: 57796608. Throughput: 0: 6022.7. Samples: 57804924. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 02:55:24,121][25689] Avg episode reward: [(0, '-56.900')] [2022-07-09 02:55:24,892][26022] Updated weights on worker 0-0, policy_version 56447 (0.00087) [2022-07-09 02:55:26,719][26022] Updated weights on worker 0-0, policy_version 56457 (0.00094) [2022-07-09 02:55:28,425][26022] Updated weights on worker 0-0, policy_version 56467 (0.00095) [2022-07-09 02:55:29,231][25689] Fps is (10 sec: 5841.1, 60 sec: 5754.6, 300 sec: 5748.1). Total num frames: 57825280. Throughput: 0: 5164.0. Samples: 57822178. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 02:55:29,232][25689] Avg episode reward: [(0, '-56.916')] [2022-07-09 02:55:30,438][26022] Updated weights on worker 0-0, policy_version 56477 (0.00086) [2022-07-09 02:55:32,027][26022] Updated weights on worker 0-0, policy_version 56487 (0.00081) [2022-07-09 02:55:33,836][26022] Updated weights on worker 0-0, policy_version 56497 (0.00087) [2022-07-09 02:55:34,259][25689] Fps is (10 sec: 5756.6, 60 sec: 5759.3, 300 sec: 5751.2). Total num frames: 57854976. Throughput: 0: 6035.4. Samples: 57857090. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 02:55:34,260][25689] Avg episode reward: [(0, '-56.427')] [2022-07-09 02:55:35,513][26022] Updated weights on worker 0-0, policy_version 56507 (0.00080) [2022-07-09 02:55:37,387][26022] Updated weights on worker 0-0, policy_version 56517 (0.00082) [2022-07-09 02:55:38,906][26022] Updated weights on worker 0-0, policy_version 56527 (0.00096) [2022-07-09 02:55:39,300][25689] Fps is (10 sec: 6000.2, 60 sec: 5794.4, 300 sec: 5758.2). Total num frames: 57885696. Throughput: 0: 6053.8. Samples: 57892306. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 02:55:39,300][25689] Avg episode reward: [(0, '-57.030')] [2022-07-09 02:55:40,931][26022] Updated weights on worker 0-0, policy_version 56537 (0.00084) [2022-07-09 02:55:42,388][26022] Updated weights on worker 0-0, policy_version 56547 (0.00086) [2022-07-09 02:55:44,345][25689] Fps is (10 sec: 5787.3, 60 sec: 5775.9, 300 sec: 5748.4). Total num frames: 57913344. Throughput: 0: 5192.2. Samples: 57909936. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:55:44,345][25689] Avg episode reward: [(0, '-57.262')] [2022-07-09 02:55:44,448][26022] Updated weights on worker 0-0, policy_version 56557 (0.00058) [2022-07-09 02:55:45,882][26022] Updated weights on worker 0-0, policy_version 56567 (0.00091) [2022-07-09 02:55:48,019][26022] Updated weights on worker 0-0, policy_version 56577 (0.00094) [2022-07-09 02:55:49,395][25689] Fps is (10 sec: 5782.1, 60 sec: 5777.5, 300 sec: 5755.0). Total num frames: 57944064. Throughput: 0: 6083.2. Samples: 57944838. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:55:49,395][25689] Avg episode reward: [(0, '-57.509')] [2022-07-09 02:55:49,463][26022] Updated weights on worker 0-0, policy_version 56587 (0.00088) [2022-07-09 02:55:51,516][26022] Updated weights on worker 0-0, policy_version 56597 (0.00081) [2022-07-09 02:55:53,080][26022] Updated weights on worker 0-0, policy_version 56607 (0.00093) [2022-07-09 02:55:54,452][25689] Fps is (10 sec: 5774.9, 60 sec: 5755.3, 300 sec: 5750.6). Total num frames: 57971712. Throughput: 0: 6066.0. Samples: 57979580. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:55:54,453][25689] Avg episode reward: [(0, '-57.659')] [2022-07-09 02:55:54,994][26022] Updated weights on worker 0-0, policy_version 56617 (0.00094) [2022-07-09 02:55:56,574][26022] Updated weights on worker 0-0, policy_version 56627 (0.00095) [2022-07-09 02:55:58,436][26022] Updated weights on worker 0-0, policy_version 56637 (0.00083) [2022-07-09 02:55:59,487][25689] Fps is (10 sec: 5783.1, 60 sec: 5770.3, 300 sec: 5757.0). Total num frames: 58002432. Throughput: 0: 5186.9. Samples: 57997016. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:55:59,488][25689] Avg episode reward: [(0, '-57.718')] [2022-07-09 02:56:00,171][26022] Updated weights on worker 0-0, policy_version 56647 (0.00086) [2022-07-09 02:56:02,439][26022] Updated weights on worker 0-0, policy_version 56657 (0.00088) [2022-07-09 02:56:04,130][26022] Updated weights on worker 0-0, policy_version 56667 (0.00097) [2022-07-09 02:56:04,501][25689] Fps is (10 sec: 5808.7, 60 sec: 5786.7, 300 sec: 5765.8). Total num frames: 58030080. Throughput: 0: 5946.6. Samples: 58029794. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:56:04,502][25689] Avg episode reward: [(0, '-57.914')] [2022-07-09 02:56:06,081][26022] Updated weights on worker 0-0, policy_version 56677 (0.00078) [2022-07-09 02:56:07,483][26022] Updated weights on worker 0-0, policy_version 56687 (0.00084) [2022-07-09 02:56:09,548][25689] Fps is (10 sec: 5394.7, 60 sec: 5744.3, 300 sec: 5748.3). Total num frames: 58056704. Throughput: 0: 5939.0. Samples: 58064526. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:56:09,548][25689] Avg episode reward: [(0, '-57.332')] [2022-07-09 02:56:09,642][26022] Updated weights on worker 0-0, policy_version 56697 (0.00090) [2022-07-09 02:56:10,967][26022] Updated weights on worker 0-0, policy_version 56707 (0.00085) [2022-07-09 02:56:13,247][26022] Updated weights on worker 0-0, policy_version 56717 (0.00089) [2022-07-09 02:56:14,511][26022] Updated weights on worker 0-0, policy_version 56727 (0.00083) [2022-07-09 02:56:14,597][25689] Fps is (10 sec: 5882.4, 60 sec: 5810.5, 300 sec: 5761.8). Total num frames: 58089472. Throughput: 0: 5070.6. Samples: 58081728. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:56:14,598][25689] Avg episode reward: [(0, '-56.888')] [2022-07-09 02:56:16,742][26022] Updated weights on worker 0-0, policy_version 56737 (0.00087) [2022-07-09 02:56:18,085][26022] Updated weights on worker 0-0, policy_version 56747 (0.00089) [2022-07-09 02:56:19,645][25689] Fps is (10 sec: 5882.2, 60 sec: 5780.8, 300 sec: 5754.5). Total num frames: 58116096. Throughput: 0: 5929.4. Samples: 58116536. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:56:19,645][25689] Avg episode reward: [(0, '-56.206')] [2022-07-09 02:56:20,292][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:56:20,300][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000056757_58119168.pth [2022-07-09 02:56:20,303][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000054734_56047616.pth [2022-07-09 02:56:20,307][26022] Updated weights on worker 0-0, policy_version 56757 (0.00081) [2022-07-09 02:56:21,704][26022] Updated weights on worker 0-0, policy_version 56767 (0.00099) [2022-07-09 02:56:23,791][26022] Updated weights on worker 0-0, policy_version 56777 (0.00084) [2022-07-09 02:56:24,696][25689] Fps is (10 sec: 5678.2, 60 sec: 5781.3, 300 sec: 5762.6). Total num frames: 58146816. Throughput: 0: 6014.9. Samples: 58151268. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:56:24,697][25689] Avg episode reward: [(0, '-56.369')] [2022-07-09 02:56:25,347][26022] Updated weights on worker 0-0, policy_version 56787 (0.00083) [2022-07-09 02:56:27,370][26022] Updated weights on worker 0-0, policy_version 56797 (0.00084) [2022-07-09 02:56:28,947][26022] Updated weights on worker 0-0, policy_version 56807 (0.00077) [2022-07-09 02:56:29,818][25689] Fps is (10 sec: 5737.3, 60 sec: 5763.3, 300 sec: 5750.4). Total num frames: 58174464. Throughput: 0: 5119.2. Samples: 58168294. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:56:29,819][25689] Avg episode reward: [(0, '-55.900')] [2022-07-09 02:56:31,007][26022] Updated weights on worker 0-0, policy_version 56817 (0.00088) [2022-07-09 02:56:32,573][26022] Updated weights on worker 0-0, policy_version 56827 (0.00052) [2022-07-09 02:56:34,484][26022] Updated weights on worker 0-0, policy_version 56837 (0.00092) [2022-07-09 02:56:34,838][25689] Fps is (10 sec: 5553.2, 60 sec: 5747.2, 300 sec: 5750.4). Total num frames: 58203136. Throughput: 0: 5986.2. Samples: 58202894. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:56:34,839][25689] Avg episode reward: [(0, '-56.487')] [2022-07-09 02:56:35,931][26022] Updated weights on worker 0-0, policy_version 56847 (0.00079) [2022-07-09 02:56:38,156][26022] Updated weights on worker 0-0, policy_version 56857 (0.00089) [2022-07-09 02:56:39,375][26022] Updated weights on worker 0-0, policy_version 56867 (0.00089) [2022-07-09 02:56:39,842][25689] Fps is (10 sec: 5924.9, 60 sec: 5750.7, 300 sec: 5761.3). Total num frames: 58233856. Throughput: 0: 5992.5. Samples: 58237570. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:56:39,843][25689] Avg episode reward: [(0, '-56.681')] [2022-07-09 02:56:41,692][26022] Updated weights on worker 0-0, policy_version 56877 (0.00087) [2022-07-09 02:56:43,055][26022] Updated weights on worker 0-0, policy_version 56887 (0.00087) [2022-07-09 02:56:44,862][25689] Fps is (10 sec: 5720.6, 60 sec: 5736.1, 300 sec: 5753.0). Total num frames: 58260480. Throughput: 0: 5141.9. Samples: 58254960. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:56:44,864][25689] Avg episode reward: [(0, '-56.781')] [2022-07-09 02:56:45,231][26022] Updated weights on worker 0-0, policy_version 56897 (0.00100) [2022-07-09 02:56:46,699][26022] Updated weights on worker 0-0, policy_version 56907 (0.00094) [2022-07-09 02:56:48,766][26022] Updated weights on worker 0-0, policy_version 56917 (0.00086) [2022-07-09 02:56:49,920][25689] Fps is (10 sec: 5689.9, 60 sec: 5735.4, 300 sec: 5752.7). Total num frames: 58291200. Throughput: 0: 6037.6. Samples: 58289664. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 02:56:49,921][25689] Avg episode reward: [(0, '-57.287')] [2022-07-09 02:56:50,101][26022] Updated weights on worker 0-0, policy_version 56927 (0.00083) [2022-07-09 02:56:52,360][26022] Updated weights on worker 0-0, policy_version 56937 (0.00093) [2022-07-09 02:56:53,882][26022] Updated weights on worker 0-0, policy_version 56947 (0.00083) [2022-07-09 02:56:54,972][25689] Fps is (10 sec: 5773.7, 60 sec: 5735.9, 300 sec: 5755.3). Total num frames: 58318848. Throughput: 0: 6009.8. Samples: 58323892. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 02:56:54,973][25689] Avg episode reward: [(0, '-57.714')] [2022-07-09 02:56:55,800][26022] Updated weights on worker 0-0, policy_version 56957 (0.00083) [2022-07-09 02:56:57,463][26022] Updated weights on worker 0-0, policy_version 56967 (0.00092) [2022-07-09 02:56:59,357][26022] Updated weights on worker 0-0, policy_version 56977 (0.00097) [2022-07-09 02:57:00,043][25689] Fps is (10 sec: 5564.0, 60 sec: 5698.8, 300 sec: 5754.4). Total num frames: 58347520. Throughput: 0: 5135.4. Samples: 58341306. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 02:57:00,044][25689] Avg episode reward: [(0, '-57.576')] [2022-07-09 02:57:00,936][26022] Updated weights on worker 0-0, policy_version 56987 (0.00081) [2022-07-09 02:57:03,443][26022] Updated weights on worker 0-0, policy_version 56997 (0.00082) [2022-07-09 02:57:04,759][26022] Updated weights on worker 0-0, policy_version 57007 (0.00095) [2022-07-09 02:57:05,101][25689] Fps is (10 sec: 5863.1, 60 sec: 5745.1, 300 sec: 5755.6). Total num frames: 58378240. Throughput: 0: 5884.0. Samples: 58374048. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 02:57:05,104][25689] Avg episode reward: [(0, '-57.853')] [2022-07-09 02:57:07,005][26022] Updated weights on worker 0-0, policy_version 57017 (0.00066) [2022-07-09 02:57:08,334][26022] Updated weights on worker 0-0, policy_version 57027 (0.00090) [2022-07-09 02:57:10,177][25689] Fps is (10 sec: 5557.4, 60 sec: 5725.6, 300 sec: 5744.3). Total num frames: 58403840. Throughput: 0: 5868.0. Samples: 58408528. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 02:57:10,177][25689] Avg episode reward: [(0, '-57.577')] [2022-07-09 02:57:10,444][26022] Updated weights on worker 0-0, policy_version 57037 (0.00091) [2022-07-09 02:57:11,948][26022] Updated weights on worker 0-0, policy_version 57047 (0.00084) [2022-07-09 02:57:13,913][26022] Updated weights on worker 0-0, policy_version 57057 (0.00092) [2022-07-09 02:57:15,194][25689] Fps is (10 sec: 5478.7, 60 sec: 5677.9, 300 sec: 5751.2). Total num frames: 58433536. Throughput: 0: 5912.5. Samples: 58443458. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 02:57:15,195][25689] Avg episode reward: [(0, '-57.253')] [2022-07-09 02:57:15,737][26022] Updated weights on worker 0-0, policy_version 57067 (0.00070) [2022-07-09 02:57:17,439][26022] Updated weights on worker 0-0, policy_version 57077 (0.00102) [2022-07-09 02:57:18,946][26022] Updated weights on worker 0-0, policy_version 57087 (0.00078) [2022-07-09 02:57:20,202][25689] Fps is (10 sec: 5822.2, 60 sec: 5715.5, 300 sec: 5748.8). Total num frames: 58462208. Throughput: 0: 5928.2. Samples: 58460814. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 02:57:20,202][25689] Avg episode reward: [(0, '-57.031')] [2022-07-09 02:57:20,928][26022] Updated weights on worker 0-0, policy_version 57097 (0.00085) [2022-07-09 02:57:22,626][26022] Updated weights on worker 0-0, policy_version 57107 (0.00091) [2022-07-09 02:57:24,575][26022] Updated weights on worker 0-0, policy_version 57117 (0.00089) [2022-07-09 02:57:25,204][25689] Fps is (10 sec: 5831.1, 60 sec: 5703.2, 300 sec: 5743.4). Total num frames: 58491904. Throughput: 0: 6038.5. Samples: 58495438. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 02:57:25,206][25689] Avg episode reward: [(0, '-57.189')] [2022-07-09 02:57:26,041][26022] Updated weights on worker 0-0, policy_version 57127 (0.00095) [2022-07-09 02:57:28,153][26022] Updated weights on worker 0-0, policy_version 57137 (0.00084) [2022-07-09 02:57:29,922][26022] Updated weights on worker 0-0, policy_version 57147 (0.00086) [2022-07-09 02:57:30,249][25689] Fps is (10 sec: 5809.4, 60 sec: 5727.4, 300 sec: 5753.8). Total num frames: 58520576. Throughput: 0: 6056.7. Samples: 58530098. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 02:57:30,249][25689] Avg episode reward: [(0, '-57.192')] [2022-07-09 02:57:31,732][26022] Updated weights on worker 0-0, policy_version 57157 (0.00090) [2022-07-09 02:57:33,291][26022] Updated weights on worker 0-0, policy_version 57167 (0.00090) [2022-07-09 02:57:35,260][26022] Updated weights on worker 0-0, policy_version 57177 (0.00091) [2022-07-09 02:57:35,263][25689] Fps is (10 sec: 5599.1, 60 sec: 5711.0, 300 sec: 5747.1). Total num frames: 58548224. Throughput: 0: 5184.5. Samples: 58547504. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 02:57:35,263][25689] Avg episode reward: [(0, '-57.276')] [2022-07-09 02:57:36,842][26022] Updated weights on worker 0-0, policy_version 57187 (0.00088) [2022-07-09 02:57:38,653][26022] Updated weights on worker 0-0, policy_version 57197 (0.00105) [2022-07-09 02:57:40,275][25689] Fps is (10 sec: 5821.9, 60 sec: 5710.3, 300 sec: 5755.3). Total num frames: 58578944. Throughput: 0: 6047.9. Samples: 58582212. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 02:57:40,275][25689] Avg episode reward: [(0, '-57.401')] [2022-07-09 02:57:40,407][26022] Updated weights on worker 0-0, policy_version 57207 (0.00087) [2022-07-09 02:57:42,057][26022] Updated weights on worker 0-0, policy_version 57217 (0.00084) [2022-07-09 02:57:43,910][26022] Updated weights on worker 0-0, policy_version 57227 (0.00083) [2022-07-09 02:57:45,298][25689] Fps is (10 sec: 6020.5, 60 sec: 5760.8, 300 sec: 5753.0). Total num frames: 58608640. Throughput: 0: 6065.9. Samples: 58617326. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 02:57:45,298][25689] Avg episode reward: [(0, '-57.755')] [2022-07-09 02:57:45,626][26022] Updated weights on worker 0-0, policy_version 57237 (0.00096) [2022-07-09 02:57:47,509][26022] Updated weights on worker 0-0, policy_version 57247 (0.00099) [2022-07-09 02:57:49,175][26022] Updated weights on worker 0-0, policy_version 57257 (0.00086) [2022-07-09 02:57:50,371][25689] Fps is (10 sec: 5679.4, 60 sec: 5708.6, 300 sec: 5748.2). Total num frames: 58636288. Throughput: 0: 5205.8. Samples: 58634852. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 02:57:50,372][25689] Avg episode reward: [(0, '-57.385')] [2022-07-09 02:57:50,822][26022] Updated weights on worker 0-0, policy_version 57267 (0.00104) [2022-07-09 02:57:52,744][26022] Updated weights on worker 0-0, policy_version 57277 (0.00091) [2022-07-09 02:57:54,477][26022] Updated weights on worker 0-0, policy_version 57287 (0.01087) [2022-07-09 02:57:55,396][25689] Fps is (10 sec: 5780.4, 60 sec: 5762.0, 300 sec: 5755.2). Total num frames: 58667008. Throughput: 0: 6070.3. Samples: 58669716. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 02:57:55,396][25689] Avg episode reward: [(0, '-57.378')] [2022-07-09 02:57:56,289][26022] Updated weights on worker 0-0, policy_version 57297 (0.00086) [2022-07-09 02:57:58,003][26022] Updated weights on worker 0-0, policy_version 57307 (0.00091) [2022-07-09 02:57:59,789][26022] Updated weights on worker 0-0, policy_version 57317 (0.00091) [2022-07-09 02:58:00,397][25689] Fps is (10 sec: 5821.8, 60 sec: 5751.6, 300 sec: 5755.4). Total num frames: 58694656. Throughput: 0: 6085.1. Samples: 58704660. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 02:58:00,398][25689] Avg episode reward: [(0, '-56.177')] [2022-07-09 02:58:01,971][26022] Updated weights on worker 0-0, policy_version 57327 (0.00097) [2022-07-09 02:58:03,829][26022] Updated weights on worker 0-0, policy_version 57337 (0.00088) [2022-07-09 02:58:05,492][25689] Fps is (10 sec: 5477.0, 60 sec: 5697.4, 300 sec: 5745.3). Total num frames: 58722304. Throughput: 0: 5069.2. Samples: 58719692. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 02:58:05,492][25689] Avg episode reward: [(0, '-55.742')] [2022-07-09 02:58:05,584][26022] Updated weights on worker 0-0, policy_version 57347 (0.00097) [2022-07-09 02:58:07,311][26022] Updated weights on worker 0-0, policy_version 57357 (0.00090) [2022-07-09 02:58:08,830][26022] Updated weights on worker 0-0, policy_version 57367 (0.00086) [2022-07-09 02:58:10,533][25689] Fps is (10 sec: 5657.7, 60 sec: 5768.4, 300 sec: 5748.7). Total num frames: 58752000. Throughput: 0: 5926.6. Samples: 58754340. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 02:58:10,533][25689] Avg episode reward: [(0, '-56.517')] [2022-07-09 02:58:10,821][26022] Updated weights on worker 0-0, policy_version 57377 (0.00088) [2022-07-09 02:58:12,668][26022] Updated weights on worker 0-0, policy_version 57387 (0.00082) [2022-07-09 02:58:14,367][26022] Updated weights on worker 0-0, policy_version 57397 (0.00079) [2022-07-09 02:58:15,540][25689] Fps is (10 sec: 5706.9, 60 sec: 5735.5, 300 sec: 5749.7). Total num frames: 58779648. Throughput: 0: 5938.9. Samples: 58789350. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 02:58:15,540][25689] Avg episode reward: [(0, '-55.952')] [2022-07-09 02:58:15,957][26022] Updated weights on worker 0-0, policy_version 57407 (0.00094) [2022-07-09 02:58:18,067][26022] Updated weights on worker 0-0, policy_version 57417 (0.00768) [2022-07-09 02:58:19,511][26022] Updated weights on worker 0-0, policy_version 57427 (0.00082) [2022-07-09 02:58:20,372][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 02:58:20,384][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000057431_58809344.pth [2022-07-09 02:58:20,385][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000055408_56737792.pth [2022-07-09 02:58:20,583][25689] Fps is (10 sec: 5705.9, 60 sec: 5749.1, 300 sec: 5749.2). Total num frames: 58809344. Throughput: 0: 5051.8. Samples: 58806634. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 02:58:20,583][25689] Avg episode reward: [(0, '-56.164')] [2022-07-09 02:58:21,423][26022] Updated weights on worker 0-0, policy_version 57437 (0.00087) [2022-07-09 02:58:23,154][26022] Updated weights on worker 0-0, policy_version 57447 (0.00088) [2022-07-09 02:58:24,997][26022] Updated weights on worker 0-0, policy_version 57457 (0.00083) [2022-07-09 02:58:25,607][25689] Fps is (10 sec: 5899.4, 60 sec: 5747.0, 300 sec: 5749.8). Total num frames: 58839040. Throughput: 0: 6047.4. Samples: 58841338. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 02:58:25,608][25689] Avg episode reward: [(0, '-57.429')] [2022-07-09 02:58:26,749][26022] Updated weights on worker 0-0, policy_version 57467 (0.00082) [2022-07-09 02:58:28,410][26022] Updated weights on worker 0-0, policy_version 57477 (0.00080) [2022-07-09 02:58:30,565][26022] Updated weights on worker 0-0, policy_version 57487 (0.00087) [2022-07-09 02:58:30,709][25689] Fps is (10 sec: 5763.8, 60 sec: 5741.5, 300 sec: 5748.1). Total num frames: 58867712. Throughput: 0: 6031.2. Samples: 58876030. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 02:58:30,710][25689] Avg episode reward: [(0, '-56.897')] [2022-07-09 02:58:31,914][26022] Updated weights on worker 0-0, policy_version 57497 (0.00086) [2022-07-09 02:58:33,843][26022] Updated weights on worker 0-0, policy_version 57507 (0.00089) [2022-07-09 02:58:35,395][26022] Updated weights on worker 0-0, policy_version 57517 (0.00089) [2022-07-09 02:58:35,728][25689] Fps is (10 sec: 5969.4, 60 sec: 5808.8, 300 sec: 5754.8). Total num frames: 58899456. Throughput: 0: 5160.2. Samples: 58893526. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 02:58:35,729][25689] Avg episode reward: [(0, '-57.527')] [2022-07-09 02:58:37,303][26022] Updated weights on worker 0-0, policy_version 57527 (0.00085) [2022-07-09 02:58:38,941][26022] Updated weights on worker 0-0, policy_version 57537 (0.00753) [2022-07-09 02:58:40,745][25689] Fps is (10 sec: 5918.2, 60 sec: 5757.5, 300 sec: 5751.0). Total num frames: 58927104. Throughput: 0: 6040.2. Samples: 58928418. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 02:58:40,745][25689] Avg episode reward: [(0, '-56.819')] [2022-07-09 02:58:40,893][26022] Updated weights on worker 0-0, policy_version 57547 (0.00084) [2022-07-09 02:58:42,699][26022] Updated weights on worker 0-0, policy_version 57557 (0.00081) [2022-07-09 02:58:44,233][26022] Updated weights on worker 0-0, policy_version 57567 (0.00084) [2022-07-09 02:58:45,759][25689] Fps is (10 sec: 5614.4, 60 sec: 5741.4, 300 sec: 5748.2). Total num frames: 58955776. Throughput: 0: 6054.5. Samples: 58963350. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 02:58:45,760][25689] Avg episode reward: [(0, '-56.684')] [2022-07-09 02:58:46,183][26022] Updated weights on worker 0-0, policy_version 57577 (0.00101) [2022-07-09 02:58:48,029][26022] Updated weights on worker 0-0, policy_version 57587 (0.00081) [2022-07-09 02:58:49,599][26022] Updated weights on worker 0-0, policy_version 57597 (0.00086) [2022-07-09 02:58:50,870][25689] Fps is (10 sec: 5764.5, 60 sec: 5771.7, 300 sec: 5749.6). Total num frames: 58985472. Throughput: 0: 5192.0. Samples: 58980706. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 02:58:50,871][25689] Avg episode reward: [(0, '-56.733')] [2022-07-09 02:58:51,744][26022] Updated weights on worker 0-0, policy_version 57607 (0.00087) [2022-07-09 02:58:53,098][26022] Updated weights on worker 0-0, policy_version 57617 (0.00091) [2022-07-09 02:58:55,146][26022] Updated weights on worker 0-0, policy_version 57627 (0.00082) [2022-07-09 02:58:55,910][25689] Fps is (10 sec: 5951.9, 60 sec: 5770.2, 300 sec: 5755.9). Total num frames: 59016192. Throughput: 0: 6039.4. Samples: 59015414. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 02:58:55,911][25689] Avg episode reward: [(0, '-57.202')] [2022-07-09 02:58:56,725][26022] Updated weights on worker 0-0, policy_version 57637 (0.00088) [2022-07-09 02:58:58,635][26022] Updated weights on worker 0-0, policy_version 57647 (0.00086) [2022-07-09 02:59:00,309][26022] Updated weights on worker 0-0, policy_version 57657 (0.00093) [2022-07-09 02:59:00,929][25689] Fps is (10 sec: 5803.0, 60 sec: 5768.6, 300 sec: 5762.5). Total num frames: 59043840. Throughput: 0: 6031.9. Samples: 59050164. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 02:59:00,929][25689] Avg episode reward: [(0, '-56.647')] [2022-07-09 02:59:02,501][26022] Updated weights on worker 0-0, policy_version 57667 (0.00087) [2022-07-09 02:59:04,196][26022] Updated weights on worker 0-0, policy_version 57677 (0.00089) [2022-07-09 02:59:05,976][25689] Fps is (10 sec: 5391.8, 60 sec: 5756.2, 300 sec: 5745.9). Total num frames: 59070464. Throughput: 0: 5051.6. Samples: 59065476. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 02:59:05,976][25689] Avg episode reward: [(0, '-56.072')] [2022-07-09 02:59:06,120][26022] Updated weights on worker 0-0, policy_version 57687 (0.00095) [2022-07-09 02:59:07,626][26022] Updated weights on worker 0-0, policy_version 57697 (0.00084) [2022-07-09 02:59:09,597][26022] Updated weights on worker 0-0, policy_version 57707 (0.00084) [2022-07-09 02:59:11,044][25689] Fps is (10 sec: 5669.1, 60 sec: 5770.6, 300 sec: 5752.1). Total num frames: 59101184. Throughput: 0: 5935.6. Samples: 59100446. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 02:59:11,044][25689] Avg episode reward: [(0, '-56.085')] [2022-07-09 02:59:11,278][26022] Updated weights on worker 0-0, policy_version 57717 (0.00078) [2022-07-09 02:59:12,993][26022] Updated weights on worker 0-0, policy_version 57727 (0.00087) [2022-07-09 02:59:14,686][26022] Updated weights on worker 0-0, policy_version 57737 (0.00082) [2022-07-09 02:59:16,047][25689] Fps is (10 sec: 5897.3, 60 sec: 5787.9, 300 sec: 5752.5). Total num frames: 59129856. Throughput: 0: 5973.9. Samples: 59135710. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 02:59:16,049][25689] Avg episode reward: [(0, '-55.175')] [2022-07-09 02:59:16,615][26022] Updated weights on worker 0-0, policy_version 57747 (0.00087) [2022-07-09 02:59:18,233][26022] Updated weights on worker 0-0, policy_version 57757 (0.00100) [2022-07-09 02:59:20,394][26022] Updated weights on worker 0-0, policy_version 57767 (0.00084) [2022-07-09 02:59:21,050][25689] Fps is (10 sec: 5833.2, 60 sec: 5791.7, 300 sec: 5759.5). Total num frames: 59159552. Throughput: 0: 5117.1. Samples: 59153126. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 02:59:21,051][25689] Avg episode reward: [(0, '-55.006')] [2022-07-09 02:59:21,485][26022] Updated weights on worker 0-0, policy_version 57777 (0.00283) [2022-07-09 02:59:23,690][26022] Updated weights on worker 0-0, policy_version 57787 (0.00082) [2022-07-09 02:59:25,202][26022] Updated weights on worker 0-0, policy_version 57797 (0.00095) [2022-07-09 02:59:26,080][25689] Fps is (10 sec: 5715.4, 60 sec: 5757.3, 300 sec: 5751.1). Total num frames: 59187200. Throughput: 0: 6111.1. Samples: 59188336. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 02:59:26,082][25689] Avg episode reward: [(0, '-55.388')] [2022-07-09 02:59:27,129][26022] Updated weights on worker 0-0, policy_version 57807 (0.00088) [2022-07-09 02:59:29,055][26022] Updated weights on worker 0-0, policy_version 57817 (0.00084) [2022-07-09 02:59:30,687][26022] Updated weights on worker 0-0, policy_version 57827 (0.00080) [2022-07-09 02:59:31,198][25689] Fps is (10 sec: 5650.6, 60 sec: 5772.7, 300 sec: 5750.3). Total num frames: 59216896. Throughput: 0: 6050.2. Samples: 59222386. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 02:59:31,199][25689] Avg episode reward: [(0, '-55.983')] [2022-07-09 02:59:32,355][26022] Updated weights on worker 0-0, policy_version 57837 (0.00085) [2022-07-09 02:59:34,331][26022] Updated weights on worker 0-0, policy_version 57847 (0.00096) [2022-07-09 02:59:35,727][26022] Updated weights on worker 0-0, policy_version 57857 (0.00081) [2022-07-09 02:59:36,287][25689] Fps is (10 sec: 5818.7, 60 sec: 5732.2, 300 sec: 5753.1). Total num frames: 59246592. Throughput: 0: 5150.7. Samples: 59239962. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 02:59:36,288][25689] Avg episode reward: [(0, '-56.014')] [2022-07-09 02:59:37,915][26022] Updated weights on worker 0-0, policy_version 57867 (0.00081) [2022-07-09 02:59:39,362][26022] Updated weights on worker 0-0, policy_version 57877 (0.00094) [2022-07-09 02:59:41,068][26022] Updated weights on worker 0-0, policy_version 57887 (0.00089) [2022-07-09 02:59:41,335][25689] Fps is (10 sec: 5859.2, 60 sec: 5763.1, 300 sec: 5756.1). Total num frames: 59276288. Throughput: 0: 6001.9. Samples: 59274874. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 02:59:41,336][25689] Avg episode reward: [(0, '-56.213')] [2022-07-09 02:59:43,083][26022] Updated weights on worker 0-0, policy_version 57897 (0.00095) [2022-07-09 02:59:44,610][26022] Updated weights on worker 0-0, policy_version 57907 (0.00103) [2022-07-09 02:59:46,375][25689] Fps is (10 sec: 5786.3, 60 sec: 5760.7, 300 sec: 5749.8). Total num frames: 59304960. Throughput: 0: 5994.2. Samples: 59309986. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 02:59:46,377][25689] Avg episode reward: [(0, '-56.335')] [2022-07-09 02:59:46,603][26022] Updated weights on worker 0-0, policy_version 57917 (0.00085) [2022-07-09 02:59:48,283][26022] Updated weights on worker 0-0, policy_version 57927 (0.00098) [2022-07-09 02:59:49,960][26022] Updated weights on worker 0-0, policy_version 57937 (0.00089) [2022-07-09 02:59:51,458][25689] Fps is (10 sec: 5867.2, 60 sec: 5780.2, 300 sec: 5755.1). Total num frames: 59335680. Throughput: 0: 5178.9. Samples: 59327306. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 02:59:51,458][25689] Avg episode reward: [(0, '-57.223')] [2022-07-09 02:59:51,895][26022] Updated weights on worker 0-0, policy_version 57947 (0.00086) [2022-07-09 02:59:53,492][26022] Updated weights on worker 0-0, policy_version 57957 (0.00084) [2022-07-09 02:59:55,366][26022] Updated weights on worker 0-0, policy_version 57967 (0.00115) [2022-07-09 02:59:56,489][25689] Fps is (10 sec: 5872.3, 60 sec: 5747.2, 300 sec: 5751.3). Total num frames: 59364352. Throughput: 0: 6058.9. Samples: 59362360. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 02:59:56,489][25689] Avg episode reward: [(0, '-56.133')] [2022-07-09 02:59:57,119][26022] Updated weights on worker 0-0, policy_version 57977 (0.00091) [2022-07-09 02:59:58,698][26022] Updated weights on worker 0-0, policy_version 57987 (0.00081) [2022-07-09 03:00:00,899][26022] Updated weights on worker 0-0, policy_version 57997 (0.00083) [2022-07-09 03:00:01,497][25689] Fps is (10 sec: 5711.7, 60 sec: 5765.1, 300 sec: 5758.2). Total num frames: 59393024. Throughput: 0: 6052.5. Samples: 59396908. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 03:00:01,498][25689] Avg episode reward: [(0, '-56.270')] [2022-07-09 03:00:02,599][26022] Updated weights on worker 0-0, policy_version 58007 (0.00092) [2022-07-09 03:00:04,585][26022] Updated weights on worker 0-0, policy_version 58017 (0.00092) [2022-07-09 03:00:06,252][26022] Updated weights on worker 0-0, policy_version 58027 (0.00084) [2022-07-09 03:00:06,506][25689] Fps is (10 sec: 5520.2, 60 sec: 5768.8, 300 sec: 5750.3). Total num frames: 59419648. Throughput: 0: 5950.6. Samples: 59429778. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 03:00:06,506][25689] Avg episode reward: [(0, '-56.229')] [2022-07-09 03:00:07,902][26022] Updated weights on worker 0-0, policy_version 58037 (0.00585) [2022-07-09 03:00:09,916][26022] Updated weights on worker 0-0, policy_version 58047 (0.00080) [2022-07-09 03:00:11,615][25689] Fps is (10 sec: 5566.7, 60 sec: 5748.0, 300 sec: 5752.3). Total num frames: 59449344. Throughput: 0: 5940.1. Samples: 59447042. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 03:00:11,615][25689] Avg episode reward: [(0, '-57.009')] [2022-07-09 03:00:11,648][26022] Updated weights on worker 0-0, policy_version 58057 (0.00083) [2022-07-09 03:00:13,263][26022] Updated weights on worker 0-0, policy_version 58067 (0.00085) [2022-07-09 03:00:15,243][26022] Updated weights on worker 0-0, policy_version 58077 (0.00086) [2022-07-09 03:00:16,618][25689] Fps is (10 sec: 5873.6, 60 sec: 5764.9, 300 sec: 5757.4). Total num frames: 59479040. Throughput: 0: 5949.9. Samples: 59482124. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 03:00:16,620][25689] Avg episode reward: [(0, '-56.193')] [2022-07-09 03:00:16,886][26022] Updated weights on worker 0-0, policy_version 58087 (0.00091) [2022-07-09 03:00:18,699][26022] Updated weights on worker 0-0, policy_version 58097 (0.00082) [2022-07-09 03:00:20,436][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:00:20,446][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000058106_59500544.pth [2022-07-09 03:00:20,446][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000056082_57427968.pth [2022-07-09 03:00:20,486][26022] Updated weights on worker 0-0, policy_version 58107 (0.00087) [2022-07-09 03:00:21,714][25689] Fps is (10 sec: 5982.2, 60 sec: 5772.9, 300 sec: 5756.7). Total num frames: 59509760. Throughput: 0: 5944.2. Samples: 59517080. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 03:00:21,715][25689] Avg episode reward: [(0, '-56.816')] [2022-07-09 03:00:22,000][26022] Updated weights on worker 0-0, policy_version 58117 (0.00086) [2022-07-09 03:00:24,112][26022] Updated weights on worker 0-0, policy_version 58127 (0.00091) [2022-07-09 03:00:25,679][26022] Updated weights on worker 0-0, policy_version 58137 (0.00345) [2022-07-09 03:00:26,732][25689] Fps is (10 sec: 5669.3, 60 sec: 5757.2, 300 sec: 5751.6). Total num frames: 59536384. Throughput: 0: 5175.5. Samples: 59534460. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 03:00:26,733][25689] Avg episode reward: [(0, '-56.461')] [2022-07-09 03:00:27,600][26022] Updated weights on worker 0-0, policy_version 58147 (0.00080) [2022-07-09 03:00:29,565][26022] Updated weights on worker 0-0, policy_version 58157 (0.00083) [2022-07-09 03:00:31,027][26022] Updated weights on worker 0-0, policy_version 58167 (0.00087) [2022-07-09 03:00:31,803][25689] Fps is (10 sec: 5683.9, 60 sec: 5778.6, 300 sec: 5754.2). Total num frames: 59567104. Throughput: 0: 6018.2. Samples: 59568538. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 03:00:31,803][25689] Avg episode reward: [(0, '-56.208')] [2022-07-09 03:00:33,036][26022] Updated weights on worker 0-0, policy_version 58177 (0.00086) [2022-07-09 03:00:34,488][26022] Updated weights on worker 0-0, policy_version 58187 (0.00093) [2022-07-09 03:00:36,523][26022] Updated weights on worker 0-0, policy_version 58197 (0.00086) [2022-07-09 03:00:36,849][25689] Fps is (10 sec: 5870.6, 60 sec: 5765.7, 300 sec: 5747.2). Total num frames: 59595776. Throughput: 0: 6007.1. Samples: 59603658. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 03:00:36,850][25689] Avg episode reward: [(0, '-56.274')] [2022-07-09 03:00:38,083][26022] Updated weights on worker 0-0, policy_version 58207 (0.00084) [2022-07-09 03:00:40,150][26022] Updated weights on worker 0-0, policy_version 58217 (0.00091) [2022-07-09 03:00:41,567][26022] Updated weights on worker 0-0, policy_version 58227 (0.00089) [2022-07-09 03:00:41,919][25689] Fps is (10 sec: 5769.8, 60 sec: 5763.6, 300 sec: 5753.6). Total num frames: 59625472. Throughput: 0: 5156.8. Samples: 59621272. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 03:00:41,919][25689] Avg episode reward: [(0, '-56.110')] [2022-07-09 03:00:43,527][26022] Updated weights on worker 0-0, policy_version 58237 (0.00087) [2022-07-09 03:00:45,108][26022] Updated weights on worker 0-0, policy_version 58247 (0.00086) [2022-07-09 03:00:46,919][26022] Updated weights on worker 0-0, policy_version 58257 (0.00087) [2022-07-09 03:00:46,994][25689] Fps is (10 sec: 5854.6, 60 sec: 5777.2, 300 sec: 5749.7). Total num frames: 59655168. Throughput: 0: 6021.0. Samples: 59656454. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 03:00:46,994][25689] Avg episode reward: [(0, '-55.756')] [2022-07-09 03:00:48,678][26022] Updated weights on worker 0-0, policy_version 58267 (0.00089) [2022-07-09 03:00:50,486][26022] Updated weights on worker 0-0, policy_version 58277 (0.00087) [2022-07-09 03:00:52,056][25689] Fps is (10 sec: 5858.7, 60 sec: 5762.2, 300 sec: 5756.5). Total num frames: 59684864. Throughput: 0: 6072.7. Samples: 59691532. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 03:00:52,057][25689] Avg episode reward: [(0, '-56.415')] [2022-07-09 03:00:52,222][26022] Updated weights on worker 0-0, policy_version 58287 (0.00084) [2022-07-09 03:00:53,923][26022] Updated weights on worker 0-0, policy_version 58297 (0.00086) [2022-07-09 03:00:55,612][26022] Updated weights on worker 0-0, policy_version 58307 (0.00088) [2022-07-09 03:00:57,083][25689] Fps is (10 sec: 5785.1, 60 sec: 5762.7, 300 sec: 5749.8). Total num frames: 59713536. Throughput: 0: 5208.9. Samples: 59709054. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 03:00:57,084][25689] Avg episode reward: [(0, '-57.968')] [2022-07-09 03:00:57,487][26022] Updated weights on worker 0-0, policy_version 58317 (0.00084) [2022-07-09 03:00:59,105][26022] Updated weights on worker 0-0, policy_version 58327 (0.00085) [2022-07-09 03:01:01,168][26022] Updated weights on worker 0-0, policy_version 58337 (0.00093) [2022-07-09 03:01:02,143][25689] Fps is (10 sec: 5684.9, 60 sec: 5757.7, 300 sec: 5752.3). Total num frames: 59742208. Throughput: 0: 6067.0. Samples: 59743974. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 03:01:02,145][25689] Avg episode reward: [(0, '-57.406')] [2022-07-09 03:01:03,370][26022] Updated weights on worker 0-0, policy_version 58347 (0.00087) [2022-07-09 03:01:05,067][26022] Updated weights on worker 0-0, policy_version 58357 (0.00084) [2022-07-09 03:01:06,767][26022] Updated weights on worker 0-0, policy_version 58367 (0.00091) [2022-07-09 03:01:07,184][25689] Fps is (10 sec: 5575.8, 60 sec: 5771.6, 300 sec: 5755.9). Total num frames: 59769856. Throughput: 0: 5940.8. Samples: 59776398. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 03:01:07,187][25689] Avg episode reward: [(0, '-57.147')] [2022-07-09 03:01:08,409][26022] Updated weights on worker 0-0, policy_version 58377 (0.00087) [2022-07-09 03:01:10,414][26022] Updated weights on worker 0-0, policy_version 58387 (0.00090) [2022-07-09 03:01:12,054][26022] Updated weights on worker 0-0, policy_version 58397 (0.00086) [2022-07-09 03:01:12,217][25689] Fps is (10 sec: 5692.8, 60 sec: 5778.8, 300 sec: 5745.9). Total num frames: 59799552. Throughput: 0: 5073.0. Samples: 59793804. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 03:01:12,217][25689] Avg episode reward: [(0, '-57.281')] [2022-07-09 03:01:13,631][26022] Updated weights on worker 0-0, policy_version 58407 (0.00098) [2022-07-09 03:01:15,439][26022] Updated weights on worker 0-0, policy_version 58417 (0.00081) [2022-07-09 03:01:17,137][26022] Updated weights on worker 0-0, policy_version 58427 (0.00085) [2022-07-09 03:01:17,289][25689] Fps is (10 sec: 5978.7, 60 sec: 5789.1, 300 sec: 5759.1). Total num frames: 59830272. Throughput: 0: 5950.0. Samples: 59829280. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 03:01:17,290][25689] Avg episode reward: [(0, '-57.055')] [2022-07-09 03:01:19,187][26022] Updated weights on worker 0-0, policy_version 58437 (0.00095) [2022-07-09 03:01:20,587][26022] Updated weights on worker 0-0, policy_version 58447 (0.00078) [2022-07-09 03:01:22,319][25689] Fps is (10 sec: 5676.0, 60 sec: 5727.8, 300 sec: 5745.8). Total num frames: 59856896. Throughput: 0: 5962.4. Samples: 59864272. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 03:01:22,320][25689] Avg episode reward: [(0, '-56.344')] [2022-07-09 03:01:22,561][26022] Updated weights on worker 0-0, policy_version 58457 (0.00086) [2022-07-09 03:01:24,118][26022] Updated weights on worker 0-0, policy_version 58467 (0.00081) [2022-07-09 03:01:26,068][26022] Updated weights on worker 0-0, policy_version 58477 (0.00095) [2022-07-09 03:01:27,375][25689] Fps is (10 sec: 5685.6, 60 sec: 5791.9, 300 sec: 5757.4). Total num frames: 59887616. Throughput: 0: 5221.0. Samples: 59881816. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 03:01:27,375][25689] Avg episode reward: [(0, '-54.980')] [2022-07-09 03:01:27,839][26022] Updated weights on worker 0-0, policy_version 58487 (0.00087) [2022-07-09 03:01:29,553][26022] Updated weights on worker 0-0, policy_version 58497 (0.00090) [2022-07-09 03:01:31,419][26022] Updated weights on worker 0-0, policy_version 58507 (0.00087) [2022-07-09 03:01:32,448][25689] Fps is (10 sec: 5964.9, 60 sec: 5774.7, 300 sec: 5759.8). Total num frames: 59917312. Throughput: 0: 6052.8. Samples: 59916262. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 03:01:32,449][25689] Avg episode reward: [(0, '-55.057')] [2022-07-09 03:01:33,146][26022] Updated weights on worker 0-0, policy_version 58517 (0.00088) [2022-07-09 03:01:34,963][26022] Updated weights on worker 0-0, policy_version 58527 (0.00086) [2022-07-09 03:01:36,619][26022] Updated weights on worker 0-0, policy_version 58537 (0.00087) [2022-07-09 03:01:37,474][25689] Fps is (10 sec: 5779.3, 60 sec: 5776.6, 300 sec: 5752.5). Total num frames: 59945984. Throughput: 0: 6049.1. Samples: 59951382. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 03:01:37,475][25689] Avg episode reward: [(0, '-54.943')] [2022-07-09 03:01:38,435][26022] Updated weights on worker 0-0, policy_version 58547 (0.00081) [2022-07-09 03:01:40,175][26022] Updated weights on worker 0-0, policy_version 58557 (0.00084) [2022-07-09 03:01:41,695][26022] Updated weights on worker 0-0, policy_version 58567 (0.00082) [2022-07-09 03:01:42,499][25689] Fps is (10 sec: 5705.5, 60 sec: 5764.0, 300 sec: 5759.3). Total num frames: 59974656. Throughput: 0: 5175.9. Samples: 59968718. Policy #0 lag: (min: 1.0, avg: 9.6, max: 20.0) [2022-07-09 03:01:42,499][25689] Avg episode reward: [(0, '-55.320')] [2022-07-09 03:01:43,659][26022] Updated weights on worker 0-0, policy_version 58577 (0.00092) [2022-07-09 03:01:45,502][26022] Updated weights on worker 0-0, policy_version 58587 (0.00053) [2022-07-09 03:01:47,235][26022] Updated weights on worker 0-0, policy_version 58597 (0.00085) [2022-07-09 03:01:47,503][25689] Fps is (10 sec: 5922.0, 60 sec: 5787.7, 300 sec: 5760.3). Total num frames: 60005376. Throughput: 0: 6046.2. Samples: 60003516. Policy #0 lag: (min: 1.0, avg: 9.6, max: 20.0) [2022-07-09 03:01:47,505][25689] Avg episode reward: [(0, '-55.754')] [2022-07-09 03:01:49,091][26022] Updated weights on worker 0-0, policy_version 58607 (0.00088) [2022-07-09 03:01:50,772][26022] Updated weights on worker 0-0, policy_version 58617 (0.00089) [2022-07-09 03:01:52,505][26022] Updated weights on worker 0-0, policy_version 58627 (0.00083) [2022-07-09 03:01:52,639][25689] Fps is (10 sec: 5857.0, 60 sec: 5763.8, 300 sec: 5762.1). Total num frames: 60034048. Throughput: 0: 6048.0. Samples: 60038376. Policy #0 lag: (min: 1.0, avg: 9.6, max: 20.0) [2022-07-09 03:01:52,640][25689] Avg episode reward: [(0, '-55.518')] [2022-07-09 03:01:54,269][26022] Updated weights on worker 0-0, policy_version 58637 (0.00089) [2022-07-09 03:01:56,079][26022] Updated weights on worker 0-0, policy_version 58647 (0.00085) [2022-07-09 03:01:57,639][26022] Updated weights on worker 0-0, policy_version 58657 (0.00088) [2022-07-09 03:01:57,704][25689] Fps is (10 sec: 5822.5, 60 sec: 5793.9, 300 sec: 5769.1). Total num frames: 60064768. Throughput: 0: 6040.9. Samples: 60073586. Policy #0 lag: (min: 1.0, avg: 9.6, max: 20.0) [2022-07-09 03:01:57,704][25689] Avg episode reward: [(0, '-55.939')] [2022-07-09 03:01:59,630][26022] Updated weights on worker 0-0, policy_version 58667 (0.00089) [2022-07-09 03:02:01,423][26022] Updated weights on worker 0-0, policy_version 58677 (0.00081) [2022-07-09 03:02:02,746][25689] Fps is (10 sec: 5572.5, 60 sec: 5745.0, 300 sec: 5752.3). Total num frames: 60090368. Throughput: 0: 6021.3. Samples: 60090632. Policy #0 lag: (min: 1.0, avg: 9.6, max: 20.0) [2022-07-09 03:02:02,748][25689] Avg episode reward: [(0, '-55.163')] [2022-07-09 03:02:03,507][26022] Updated weights on worker 0-0, policy_version 58687 (0.00082) [2022-07-09 03:02:05,264][26022] Updated weights on worker 0-0, policy_version 58697 (0.00090) [2022-07-09 03:02:07,254][26022] Updated weights on worker 0-0, policy_version 58707 (0.00611) [2022-07-09 03:02:07,801][25689] Fps is (10 sec: 5476.4, 60 sec: 5777.4, 300 sec: 5766.4). Total num frames: 60120064. Throughput: 0: 5887.4. Samples: 60123016. Policy #0 lag: (min: 1.0, avg: 9.6, max: 20.0) [2022-07-09 03:02:07,801][25689] Avg episode reward: [(0, '-55.996')] [2022-07-09 03:02:08,957][26022] Updated weights on worker 0-0, policy_version 58717 (0.00084) [2022-07-09 03:02:10,687][26022] Updated weights on worker 0-0, policy_version 58727 (0.00084) [2022-07-09 03:02:12,450][26022] Updated weights on worker 0-0, policy_version 58737 (0.00088) [2022-07-09 03:02:12,841][25689] Fps is (10 sec: 5781.9, 60 sec: 5759.8, 300 sec: 5762.5). Total num frames: 60148736. Throughput: 0: 5904.6. Samples: 60157658. Policy #0 lag: (min: 1.0, avg: 9.6, max: 20.0) [2022-07-09 03:02:12,841][25689] Avg episode reward: [(0, '-55.539')] [2022-07-09 03:02:14,208][26022] Updated weights on worker 0-0, policy_version 58747 (0.00086) [2022-07-09 03:02:15,953][26022] Updated weights on worker 0-0, policy_version 58757 (0.00082) [2022-07-09 03:02:17,876][25689] Fps is (10 sec: 5691.4, 60 sec: 5729.5, 300 sec: 5762.0). Total num frames: 60177408. Throughput: 0: 5032.4. Samples: 60175102. Policy #0 lag: (min: 1.0, avg: 9.6, max: 20.0) [2022-07-09 03:02:17,877][25689] Avg episode reward: [(0, '-55.545')] [2022-07-09 03:02:17,880][26022] Updated weights on worker 0-0, policy_version 58767 (0.00103) [2022-07-09 03:02:19,430][26022] Updated weights on worker 0-0, policy_version 58777 (0.00083) [2022-07-09 03:02:20,551][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:02:20,567][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000058783_60193792.pth [2022-07-09 03:02:20,567][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000056757_58119168.pth [2022-07-09 03:02:21,292][26022] Updated weights on worker 0-0, policy_version 58787 (0.00085) [2022-07-09 03:02:22,884][25689] Fps is (10 sec: 5913.7, 60 sec: 5799.3, 300 sec: 5765.3). Total num frames: 60208128. Throughput: 0: 5926.8. Samples: 60209986. Policy #0 lag: (min: 1.0, avg: 9.6, max: 20.0) [2022-07-09 03:02:22,884][25689] Avg episode reward: [(0, '-55.415')] [2022-07-09 03:02:22,891][26022] Updated weights on worker 0-0, policy_version 58797 (0.00089) [2022-07-09 03:02:24,625][26022] Updated weights on worker 0-0, policy_version 58807 (0.00087) [2022-07-09 03:02:26,605][26022] Updated weights on worker 0-0, policy_version 58817 (0.00089) [2022-07-09 03:02:27,939][25689] Fps is (10 sec: 5902.4, 60 sec: 5765.6, 300 sec: 5765.1). Total num frames: 60236800. Throughput: 0: 6049.2. Samples: 60244834. Policy #0 lag: (min: 1.0, avg: 9.6, max: 20.0) [2022-07-09 03:02:27,939][25689] Avg episode reward: [(0, '-56.006')] [2022-07-09 03:02:28,215][26022] Updated weights on worker 0-0, policy_version 58827 (0.00095) [2022-07-09 03:02:30,276][26022] Updated weights on worker 0-0, policy_version 58837 (0.00088) [2022-07-09 03:02:31,867][26022] Updated weights on worker 0-0, policy_version 58847 (0.00091) [2022-07-09 03:02:33,016][25689] Fps is (10 sec: 5457.3, 60 sec: 5714.4, 300 sec: 5760.5). Total num frames: 60263424. Throughput: 0: 5156.2. Samples: 60261688. Policy #0 lag: (min: 1.0, avg: 9.6, max: 20.0) [2022-07-09 03:02:33,017][25689] Avg episode reward: [(0, '-54.708')] [2022-07-09 03:02:33,701][26022] Updated weights on worker 0-0, policy_version 58857 (0.00081) [2022-07-09 03:02:35,588][26022] Updated weights on worker 0-0, policy_version 58867 (0.00095) [2022-07-09 03:02:37,079][26022] Updated weights on worker 0-0, policy_version 58877 (0.00092) [2022-07-09 03:02:38,095][25689] Fps is (10 sec: 5645.9, 60 sec: 5743.2, 300 sec: 5759.2). Total num frames: 60294144. Throughput: 0: 6016.9. Samples: 60296758. Policy #0 lag: (min: 1.0, avg: 9.6, max: 20.0) [2022-07-09 03:02:38,096][25689] Avg episode reward: [(0, '-55.157')] [2022-07-09 03:02:39,189][26022] Updated weights on worker 0-0, policy_version 58887 (0.00084) [2022-07-09 03:02:40,734][26022] Updated weights on worker 0-0, policy_version 58897 (0.00080) [2022-07-09 03:02:42,569][26022] Updated weights on worker 0-0, policy_version 58907 (0.00083) [2022-07-09 03:02:43,108][25689] Fps is (10 sec: 5885.3, 60 sec: 5744.3, 300 sec: 5755.9). Total num frames: 60322816. Throughput: 0: 6016.3. Samples: 60331660. Policy #0 lag: (min: 1.0, avg: 9.6, max: 20.0) [2022-07-09 03:02:43,108][25689] Avg episode reward: [(0, '-56.418')] [2022-07-09 03:02:44,255][26022] Updated weights on worker 0-0, policy_version 58917 (0.00082) [2022-07-09 03:02:46,163][26022] Updated weights on worker 0-0, policy_version 58927 (0.00083) [2022-07-09 03:02:47,736][26022] Updated weights on worker 0-0, policy_version 58937 (0.00086) [2022-07-09 03:02:48,136][25689] Fps is (10 sec: 5813.2, 60 sec: 5725.2, 300 sec: 5763.7). Total num frames: 60352512. Throughput: 0: 5158.1. Samples: 60349014. Policy #0 lag: (min: 1.0, avg: 9.6, max: 20.0) [2022-07-09 03:02:48,136][25689] Avg episode reward: [(0, '-56.276')] [2022-07-09 03:02:49,624][26022] Updated weights on worker 0-0, policy_version 58947 (0.00121) [2022-07-09 03:02:51,255][26022] Updated weights on worker 0-0, policy_version 58957 (0.00088) [2022-07-09 03:02:53,175][25689] Fps is (10 sec: 5797.8, 60 sec: 5734.4, 300 sec: 5756.5). Total num frames: 60381184. Throughput: 0: 6050.4. Samples: 60383656. Policy #0 lag: (min: 1.0, avg: 9.6, max: 20.0) [2022-07-09 03:02:53,176][25689] Avg episode reward: [(0, '-56.536')] [2022-07-09 03:02:53,244][26022] Updated weights on worker 0-0, policy_version 58967 (0.00084) [2022-07-09 03:02:54,860][26022] Updated weights on worker 0-0, policy_version 58977 (0.00089) [2022-07-09 03:02:56,654][26022] Updated weights on worker 0-0, policy_version 58987 (0.00091) [2022-07-09 03:02:58,227][25689] Fps is (10 sec: 5784.0, 60 sec: 5718.6, 300 sec: 5762.4). Total num frames: 60410880. Throughput: 0: 6052.0. Samples: 60418596. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 03:02:58,228][25689] Avg episode reward: [(0, '-57.630')] [2022-07-09 03:02:58,447][26022] Updated weights on worker 0-0, policy_version 58997 (0.00090) [2022-07-09 03:03:00,276][26022] Updated weights on worker 0-0, policy_version 59007 (0.00084) [2022-07-09 03:03:02,354][26022] Updated weights on worker 0-0, policy_version 59017 (0.00089) [2022-07-09 03:03:03,273][25689] Fps is (10 sec: 5678.6, 60 sec: 5752.1, 300 sec: 5763.4). Total num frames: 60438528. Throughput: 0: 5195.2. Samples: 60436424. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 03:03:03,274][25689] Avg episode reward: [(0, '-57.239')] [2022-07-09 03:03:04,123][26022] Updated weights on worker 0-0, policy_version 59027 (0.00081) [2022-07-09 03:03:05,807][26022] Updated weights on worker 0-0, policy_version 59037 (0.00079) [2022-07-09 03:03:07,595][26022] Updated weights on worker 0-0, policy_version 59047 (0.00079) [2022-07-09 03:03:08,340][25689] Fps is (10 sec: 5670.3, 60 sec: 5751.0, 300 sec: 5762.9). Total num frames: 60468224. Throughput: 0: 5945.7. Samples: 60469144. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 03:03:08,341][25689] Avg episode reward: [(0, '-56.971')] [2022-07-09 03:03:09,326][26022] Updated weights on worker 0-0, policy_version 59057 (0.00089) [2022-07-09 03:03:11,052][26022] Updated weights on worker 0-0, policy_version 59067 (0.00084) [2022-07-09 03:03:13,020][26022] Updated weights on worker 0-0, policy_version 59077 (0.00093) [2022-07-09 03:03:13,459][25689] Fps is (10 sec: 5831.0, 60 sec: 5760.4, 300 sec: 5767.6). Total num frames: 60497920. Throughput: 0: 5934.4. Samples: 60504028. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 03:03:13,461][25689] Avg episode reward: [(0, '-57.163')] [2022-07-09 03:03:14,548][26022] Updated weights on worker 0-0, policy_version 59087 (0.00084) [2022-07-09 03:03:16,385][26022] Updated weights on worker 0-0, policy_version 59097 (0.00086) [2022-07-09 03:03:18,105][26022] Updated weights on worker 0-0, policy_version 59107 (0.00089) [2022-07-09 03:03:18,498][25689] Fps is (10 sec: 5746.4, 60 sec: 5760.1, 300 sec: 5764.2). Total num frames: 60526592. Throughput: 0: 5087.6. Samples: 60521724. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 03:03:18,498][25689] Avg episode reward: [(0, '-56.989')] [2022-07-09 03:03:19,715][26022] Updated weights on worker 0-0, policy_version 59117 (0.00088) [2022-07-09 03:03:21,554][26022] Updated weights on worker 0-0, policy_version 59127 (0.00084) [2022-07-09 03:03:23,349][26022] Updated weights on worker 0-0, policy_version 59137 (0.00083) [2022-07-09 03:03:23,569][25689] Fps is (10 sec: 5874.7, 60 sec: 5754.0, 300 sec: 5766.8). Total num frames: 60557312. Throughput: 0: 5945.5. Samples: 60557090. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 03:03:23,569][25689] Avg episode reward: [(0, '-55.928')] [2022-07-09 03:03:25,037][26022] Updated weights on worker 0-0, policy_version 59147 (0.00092) [2022-07-09 03:03:26,940][26022] Updated weights on worker 0-0, policy_version 59157 (0.00082) [2022-07-09 03:03:28,339][26022] Updated weights on worker 0-0, policy_version 59167 (0.00081) [2022-07-09 03:03:28,664][25689] Fps is (10 sec: 5942.5, 60 sec: 5767.0, 300 sec: 5770.3). Total num frames: 60587008. Throughput: 0: 6054.3. Samples: 60592192. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 03:03:28,668][25689] Avg episode reward: [(0, '-55.801')] [2022-07-09 03:03:30,384][26022] Updated weights on worker 0-0, policy_version 59177 (0.00087) [2022-07-09 03:03:32,422][26022] Updated weights on worker 0-0, policy_version 59187 (0.00086) [2022-07-09 03:03:33,719][25689] Fps is (10 sec: 5750.6, 60 sec: 5803.0, 300 sec: 5759.3). Total num frames: 60615680. Throughput: 0: 5203.7. Samples: 60609452. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 03:03:33,719][25689] Avg episode reward: [(0, '-56.343')] [2022-07-09 03:03:33,820][26022] Updated weights on worker 0-0, policy_version 59197 (0.00086) [2022-07-09 03:03:35,827][26022] Updated weights on worker 0-0, policy_version 59207 (0.00082) [2022-07-09 03:03:37,307][26022] Updated weights on worker 0-0, policy_version 59217 (0.00083) [2022-07-09 03:03:38,736][25689] Fps is (10 sec: 5795.0, 60 sec: 5792.0, 300 sec: 5766.2). Total num frames: 60645376. Throughput: 0: 6055.4. Samples: 60644278. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 03:03:38,737][25689] Avg episode reward: [(0, '-56.265')] [2022-07-09 03:03:39,321][26022] Updated weights on worker 0-0, policy_version 59227 (0.00087) [2022-07-09 03:03:40,903][26022] Updated weights on worker 0-0, policy_version 59237 (0.00086) [2022-07-09 03:03:42,877][26022] Updated weights on worker 0-0, policy_version 59247 (0.00087) [2022-07-09 03:03:43,742][25689] Fps is (10 sec: 6027.6, 60 sec: 5826.4, 300 sec: 5773.2). Total num frames: 60676096. Throughput: 0: 6058.5. Samples: 60679308. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 03:03:43,743][25689] Avg episode reward: [(0, '-57.150')] [2022-07-09 03:03:44,383][26022] Updated weights on worker 0-0, policy_version 59257 (0.00088) [2022-07-09 03:03:46,274][26022] Updated weights on worker 0-0, policy_version 59267 (0.00089) [2022-07-09 03:03:47,903][26022] Updated weights on worker 0-0, policy_version 59277 (0.00083) [2022-07-09 03:03:48,808][25689] Fps is (10 sec: 5693.8, 60 sec: 5772.2, 300 sec: 5763.8). Total num frames: 60702720. Throughput: 0: 5195.1. Samples: 60696840. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 03:03:48,808][25689] Avg episode reward: [(0, '-57.262')] [2022-07-09 03:03:49,700][26022] Updated weights on worker 0-0, policy_version 59287 (0.00087) [2022-07-09 03:03:51,614][26022] Updated weights on worker 0-0, policy_version 59297 (0.00086) [2022-07-09 03:03:53,204][26022] Updated weights on worker 0-0, policy_version 59307 (0.00092) [2022-07-09 03:03:53,882][25689] Fps is (10 sec: 5655.3, 60 sec: 5802.6, 300 sec: 5763.1). Total num frames: 60733440. Throughput: 0: 6039.0. Samples: 60731218. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 03:03:53,882][25689] Avg episode reward: [(0, '-57.436')] [2022-07-09 03:03:55,251][26022] Updated weights on worker 0-0, policy_version 59317 (0.00088) [2022-07-09 03:03:56,880][26022] Updated weights on worker 0-0, policy_version 59327 (0.00090) [2022-07-09 03:03:58,463][26022] Updated weights on worker 0-0, policy_version 59337 (0.00092) [2022-07-09 03:03:58,905][25689] Fps is (10 sec: 5983.7, 60 sec: 5805.4, 300 sec: 5769.9). Total num frames: 60763136. Throughput: 0: 6061.7. Samples: 60766532. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 03:03:58,905][25689] Avg episode reward: [(0, '-56.621')] [2022-07-09 03:04:00,288][26022] Updated weights on worker 0-0, policy_version 59347 (0.00088) [2022-07-09 03:04:01,948][26022] Updated weights on worker 0-0, policy_version 59357 (0.00091) [2022-07-09 03:04:03,933][25689] Fps is (10 sec: 5603.5, 60 sec: 5790.2, 300 sec: 5770.3). Total num frames: 60789760. Throughput: 0: 5194.1. Samples: 60784182. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 03:04:03,933][25689] Avg episode reward: [(0, '-57.161')] [2022-07-09 03:04:04,336][26022] Updated weights on worker 0-0, policy_version 59367 (0.00090) [2022-07-09 03:04:06,008][26022] Updated weights on worker 0-0, policy_version 59377 (0.00086) [2022-07-09 03:04:07,912][26022] Updated weights on worker 0-0, policy_version 59387 (0.00627) [2022-07-09 03:04:08,959][25689] Fps is (10 sec: 5398.0, 60 sec: 5760.3, 300 sec: 5760.8). Total num frames: 60817408. Throughput: 0: 5937.6. Samples: 60816488. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 03:04:08,959][25689] Avg episode reward: [(0, '-56.621')] [2022-07-09 03:04:09,540][26022] Updated weights on worker 0-0, policy_version 59397 (0.00082) [2022-07-09 03:04:11,567][26022] Updated weights on worker 0-0, policy_version 59407 (0.00086) [2022-07-09 03:04:13,132][26022] Updated weights on worker 0-0, policy_version 59417 (0.00089) [2022-07-09 03:04:14,009][25689] Fps is (10 sec: 5690.7, 60 sec: 5766.8, 300 sec: 5763.3). Total num frames: 60847104. Throughput: 0: 5983.0. Samples: 60851642. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 03:04:14,010][25689] Avg episode reward: [(0, '-57.712')] [2022-07-09 03:04:14,899][26022] Updated weights on worker 0-0, policy_version 59427 (0.00092) [2022-07-09 03:04:16,502][26022] Updated weights on worker 0-0, policy_version 59437 (0.00082) [2022-07-09 03:04:18,326][26022] Updated weights on worker 0-0, policy_version 59447 (0.00092) [2022-07-09 03:04:19,019][25689] Fps is (10 sec: 5903.9, 60 sec: 5786.6, 300 sec: 5763.2). Total num frames: 60876800. Throughput: 0: 5985.7. Samples: 60886928. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 03:04:19,019][25689] Avg episode reward: [(0, '-56.956')] [2022-07-09 03:04:19,880][26022] Updated weights on worker 0-0, policy_version 59457 (0.00086) [2022-07-09 03:04:20,698][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:04:20,712][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000059461_60888064.pth [2022-07-09 03:04:20,712][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000057431_58809344.pth [2022-07-09 03:04:22,031][26022] Updated weights on worker 0-0, policy_version 59467 (0.00098) [2022-07-09 03:04:23,616][26022] Updated weights on worker 0-0, policy_version 59477 (0.00093) [2022-07-09 03:04:24,050][25689] Fps is (10 sec: 6017.0, 60 sec: 5790.3, 300 sec: 5773.5). Total num frames: 60907520. Throughput: 0: 5990.5. Samples: 60904696. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 03:04:24,051][25689] Avg episode reward: [(0, '-57.088')] [2022-07-09 03:04:25,363][26022] Updated weights on worker 0-0, policy_version 59487 (0.00083) [2022-07-09 03:04:27,063][26022] Updated weights on worker 0-0, policy_version 59497 (0.00089) [2022-07-09 03:04:28,831][26022] Updated weights on worker 0-0, policy_version 59507 (0.00091) [2022-07-09 03:04:29,080][25689] Fps is (10 sec: 5903.2, 60 sec: 5779.7, 300 sec: 5771.7). Total num frames: 60936192. Throughput: 0: 6123.9. Samples: 60939706. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 03:04:29,080][25689] Avg episode reward: [(0, '-57.232')] [2022-07-09 03:04:30,676][26022] Updated weights on worker 0-0, policy_version 59517 (0.00085) [2022-07-09 03:04:32,365][26022] Updated weights on worker 0-0, policy_version 59527 (0.00093) [2022-07-09 03:04:34,211][25689] Fps is (10 sec: 5643.8, 60 sec: 5772.4, 300 sec: 5767.5). Total num frames: 60964864. Throughput: 0: 6077.4. Samples: 60974414. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 03:04:34,211][25689] Avg episode reward: [(0, '-56.482')] [2022-07-09 03:04:34,253][26022] Updated weights on worker 0-0, policy_version 59537 (0.00087) [2022-07-09 03:04:35,996][26022] Updated weights on worker 0-0, policy_version 59547 (0.00091) [2022-07-09 03:04:37,533][26022] Updated weights on worker 0-0, policy_version 59557 (0.00086) [2022-07-09 03:04:39,235][25689] Fps is (10 sec: 5747.4, 60 sec: 5771.7, 300 sec: 5767.9). Total num frames: 60994560. Throughput: 0: 5201.5. Samples: 60992084. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 03:04:39,236][25689] Avg episode reward: [(0, '-55.585')] [2022-07-09 03:04:39,474][26022] Updated weights on worker 0-0, policy_version 59567 (0.00086) [2022-07-09 03:04:41,066][26022] Updated weights on worker 0-0, policy_version 59577 (0.00093) [2022-07-09 03:04:43,007][26022] Updated weights on worker 0-0, policy_version 59587 (0.00087) [2022-07-09 03:04:44,254][25689] Fps is (10 sec: 5913.5, 60 sec: 5753.5, 300 sec: 5771.8). Total num frames: 61024256. Throughput: 0: 6040.1. Samples: 61026730. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 03:04:44,255][25689] Avg episode reward: [(0, '-56.177')] [2022-07-09 03:04:44,745][26022] Updated weights on worker 0-0, policy_version 59597 (0.00084) [2022-07-09 03:04:46,702][26022] Updated weights on worker 0-0, policy_version 59607 (0.00091) [2022-07-09 03:04:48,182][26022] Updated weights on worker 0-0, policy_version 59617 (0.00089) [2022-07-09 03:04:49,278][25689] Fps is (10 sec: 5811.9, 60 sec: 5791.4, 300 sec: 5766.1). Total num frames: 61052928. Throughput: 0: 6021.5. Samples: 61061330. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 03:04:49,279][25689] Avg episode reward: [(0, '-56.465')] [2022-07-09 03:04:50,364][26022] Updated weights on worker 0-0, policy_version 59627 (0.00101) [2022-07-09 03:04:51,622][26022] Updated weights on worker 0-0, policy_version 59637 (0.00087) [2022-07-09 03:04:53,626][26022] Updated weights on worker 0-0, policy_version 59647 (0.00233) [2022-07-09 03:04:54,351][25689] Fps is (10 sec: 5679.7, 60 sec: 5757.7, 300 sec: 5765.2). Total num frames: 61081600. Throughput: 0: 5182.6. Samples: 61078790. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 03:04:54,351][25689] Avg episode reward: [(0, '-56.581')] [2022-07-09 03:04:55,305][26022] Updated weights on worker 0-0, policy_version 59657 (0.00087) [2022-07-09 03:04:57,204][26022] Updated weights on worker 0-0, policy_version 59667 (0.00092) [2022-07-09 03:04:58,965][26022] Updated weights on worker 0-0, policy_version 59677 (0.00086) [2022-07-09 03:04:59,366][25689] Fps is (10 sec: 5887.3, 60 sec: 5775.3, 300 sec: 5772.0). Total num frames: 61112320. Throughput: 0: 6046.5. Samples: 61113806. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 03:04:59,368][25689] Avg episode reward: [(0, '-56.827')] [2022-07-09 03:05:00,545][26022] Updated weights on worker 0-0, policy_version 59687 (0.00083) [2022-07-09 03:05:02,684][26022] Updated weights on worker 0-0, policy_version 59697 (0.00082) [2022-07-09 03:05:04,414][25689] Fps is (10 sec: 5596.6, 60 sec: 5756.5, 300 sec: 5767.8). Total num frames: 61137920. Throughput: 0: 5924.8. Samples: 61146172. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 03:05:04,416][25689] Avg episode reward: [(0, '-56.809')] [2022-07-09 03:05:04,603][26022] Updated weights on worker 0-0, policy_version 59707 (0.00093) [2022-07-09 03:05:06,255][26022] Updated weights on worker 0-0, policy_version 59717 (0.00087) [2022-07-09 03:05:08,277][26022] Updated weights on worker 0-0, policy_version 59727 (0.00087) [2022-07-09 03:05:09,433][25689] Fps is (10 sec: 5390.9, 60 sec: 5774.1, 300 sec: 5766.1). Total num frames: 61166592. Throughput: 0: 5070.1. Samples: 61163520. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 03:05:09,434][25689] Avg episode reward: [(0, '-57.128')] [2022-07-09 03:05:09,807][26022] Updated weights on worker 0-0, policy_version 59737 (0.00090) [2022-07-09 03:05:11,738][26022] Updated weights on worker 0-0, policy_version 59747 (0.00080) [2022-07-09 03:05:13,449][26022] Updated weights on worker 0-0, policy_version 59757 (0.00087) [2022-07-09 03:05:14,475][25689] Fps is (10 sec: 5801.6, 60 sec: 5775.0, 300 sec: 5765.3). Total num frames: 61196288. Throughput: 0: 5939.8. Samples: 61198322. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 03:05:14,475][25689] Avg episode reward: [(0, '-56.982')] [2022-07-09 03:05:15,227][26022] Updated weights on worker 0-0, policy_version 59767 (0.00087) [2022-07-09 03:05:16,978][26022] Updated weights on worker 0-0, policy_version 59777 (0.00095) [2022-07-09 03:05:18,680][26022] Updated weights on worker 0-0, policy_version 59787 (0.00086) [2022-07-09 03:05:19,484][25689] Fps is (10 sec: 5807.2, 60 sec: 5758.0, 300 sec: 5760.1). Total num frames: 61224960. Throughput: 0: 5921.5. Samples: 61232936. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 03:05:19,486][25689] Avg episode reward: [(0, '-56.516')] [2022-07-09 03:05:20,510][26022] Updated weights on worker 0-0, policy_version 59797 (0.00088) [2022-07-09 03:05:22,299][26022] Updated weights on worker 0-0, policy_version 59807 (0.00089) [2022-07-09 03:05:24,055][26022] Updated weights on worker 0-0, policy_version 59817 (0.00095) [2022-07-09 03:05:24,512][25689] Fps is (10 sec: 5815.3, 60 sec: 5741.4, 300 sec: 5770.3). Total num frames: 61254656. Throughput: 0: 5189.5. Samples: 61250468. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 03:05:24,514][25689] Avg episode reward: [(0, '-56.586')] [2022-07-09 03:05:25,779][26022] Updated weights on worker 0-0, policy_version 59827 (0.00094) [2022-07-09 03:05:27,624][26022] Updated weights on worker 0-0, policy_version 59837 (0.00096) [2022-07-09 03:05:29,527][25689] Fps is (10 sec: 5710.2, 60 sec: 5725.9, 300 sec: 5761.0). Total num frames: 61282304. Throughput: 0: 6043.0. Samples: 61284944. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 03:05:29,527][25689] Avg episode reward: [(0, '-56.472')] [2022-07-09 03:05:29,614][26022] Updated weights on worker 0-0, policy_version 59847 (0.00088) [2022-07-09 03:05:31,160][26022] Updated weights on worker 0-0, policy_version 59857 (0.00088) [2022-07-09 03:05:33,107][26022] Updated weights on worker 0-0, policy_version 59867 (0.00085) [2022-07-09 03:05:34,578][25689] Fps is (10 sec: 5798.4, 60 sec: 5767.4, 300 sec: 5767.8). Total num frames: 61313024. Throughput: 0: 6018.4. Samples: 61319312. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 03:05:34,580][25689] Avg episode reward: [(0, '-56.387')] [2022-07-09 03:05:34,618][26022] Updated weights on worker 0-0, policy_version 59877 (0.00081) [2022-07-09 03:05:36,606][26022] Updated weights on worker 0-0, policy_version 59887 (0.00090) [2022-07-09 03:05:38,330][26022] Updated weights on worker 0-0, policy_version 59897 (0.00096) [2022-07-09 03:05:39,623][25689] Fps is (10 sec: 5882.7, 60 sec: 5748.5, 300 sec: 5764.9). Total num frames: 61341696. Throughput: 0: 5147.2. Samples: 61336592. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 03:05:39,624][25689] Avg episode reward: [(0, '-56.868')] [2022-07-09 03:05:40,182][26022] Updated weights on worker 0-0, policy_version 59907 (0.00087) [2022-07-09 03:05:41,944][26022] Updated weights on worker 0-0, policy_version 59917 (0.00095) [2022-07-09 03:05:43,663][26022] Updated weights on worker 0-0, policy_version 59927 (0.00088) [2022-07-09 03:05:44,627][25689] Fps is (10 sec: 5706.2, 60 sec: 5732.9, 300 sec: 5762.8). Total num frames: 61370368. Throughput: 0: 6009.0. Samples: 61371342. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 03:05:44,634][25689] Avg episode reward: [(0, '-56.194')] [2022-07-09 03:05:45,416][26022] Updated weights on worker 0-0, policy_version 59937 (0.00084) [2022-07-09 03:05:47,223][26022] Updated weights on worker 0-0, policy_version 59947 (0.00086) [2022-07-09 03:05:48,955][26022] Updated weights on worker 0-0, policy_version 59957 (0.00090) [2022-07-09 03:05:49,651][25689] Fps is (10 sec: 5718.3, 60 sec: 5732.9, 300 sec: 5760.1). Total num frames: 61399040. Throughput: 0: 6023.4. Samples: 61406160. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 03:05:49,651][25689] Avg episode reward: [(0, '-57.037')] [2022-07-09 03:05:50,836][26022] Updated weights on worker 0-0, policy_version 59967 (0.00080) [2022-07-09 03:05:52,654][26022] Updated weights on worker 0-0, policy_version 59977 (0.00091) [2022-07-09 03:05:54,333][26022] Updated weights on worker 0-0, policy_version 59987 (0.00090) [2022-07-09 03:05:54,703][25689] Fps is (10 sec: 5792.8, 60 sec: 5751.8, 300 sec: 5763.0). Total num frames: 61428736. Throughput: 0: 5171.8. Samples: 61423396. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 03:05:54,704][25689] Avg episode reward: [(0, '-57.710')] [2022-07-09 03:05:56,307][26022] Updated weights on worker 0-0, policy_version 59997 (0.01137) [2022-07-09 03:05:57,929][26022] Updated weights on worker 0-0, policy_version 60007 (0.00089) [2022-07-09 03:05:59,698][26022] Updated weights on worker 0-0, policy_version 60017 (0.00086) [2022-07-09 03:05:59,707][25689] Fps is (10 sec: 5804.3, 60 sec: 5719.0, 300 sec: 5764.1). Total num frames: 61457408. Throughput: 0: 6040.1. Samples: 61457902. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 03:05:59,707][25689] Avg episode reward: [(0, '-56.851')] [2022-07-09 03:06:01,652][26022] Updated weights on worker 0-0, policy_version 60027 (0.00095) [2022-07-09 03:06:03,557][26022] Updated weights on worker 0-0, policy_version 60037 (0.00085) [2022-07-09 03:06:04,709][25689] Fps is (10 sec: 5423.9, 60 sec: 5723.3, 300 sec: 5758.0). Total num frames: 61483008. Throughput: 0: 5906.7. Samples: 61489960. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 03:06:04,710][25689] Avg episode reward: [(0, '-56.342')] [2022-07-09 03:06:05,716][26022] Updated weights on worker 0-0, policy_version 60047 (0.00099) [2022-07-09 03:06:07,387][26022] Updated weights on worker 0-0, policy_version 60057 (0.00080) [2022-07-09 03:06:09,151][26022] Updated weights on worker 0-0, policy_version 60067 (0.00091) [2022-07-09 03:06:09,727][25689] Fps is (10 sec: 5416.5, 60 sec: 5723.5, 300 sec: 5754.9). Total num frames: 61511680. Throughput: 0: 5002.3. Samples: 61506586. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 03:06:09,727][25689] Avg episode reward: [(0, '-56.346')] [2022-07-09 03:06:10,935][26022] Updated weights on worker 0-0, policy_version 60077 (0.00089) [2022-07-09 03:06:12,845][26022] Updated weights on worker 0-0, policy_version 60087 (0.00090) [2022-07-09 03:06:14,528][26022] Updated weights on worker 0-0, policy_version 60097 (0.00086) [2022-07-09 03:06:14,801][25689] Fps is (10 sec: 5682.1, 60 sec: 5703.3, 300 sec: 5747.9). Total num frames: 61540352. Throughput: 0: 5843.7. Samples: 61540844. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 03:06:14,802][25689] Avg episode reward: [(0, '-56.758')] [2022-07-09 03:06:16,328][26022] Updated weights on worker 0-0, policy_version 60107 (0.00083) [2022-07-09 03:06:18,077][26022] Updated weights on worker 0-0, policy_version 60117 (0.00088) [2022-07-09 03:06:19,823][25689] Fps is (10 sec: 5679.9, 60 sec: 5702.3, 300 sec: 5755.0). Total num frames: 61569024. Throughput: 0: 5854.6. Samples: 61575672. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 03:06:19,823][25689] Avg episode reward: [(0, '-56.869')] [2022-07-09 03:06:19,841][26022] Updated weights on worker 0-0, policy_version 60127 (0.00092) [2022-07-09 03:06:20,798][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:06:20,816][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000060132_61575168.pth [2022-07-09 03:06:20,816][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000058106_59500544.pth [2022-07-09 03:06:21,675][26022] Updated weights on worker 0-0, policy_version 60137 (0.00087) [2022-07-09 03:06:23,340][26022] Updated weights on worker 0-0, policy_version 60147 (0.00085) [2022-07-09 03:06:24,829][25689] Fps is (10 sec: 5820.5, 60 sec: 5704.2, 300 sec: 5752.5). Total num frames: 61598720. Throughput: 0: 5110.1. Samples: 61592776. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 03:06:24,830][25689] Avg episode reward: [(0, '-56.801')] [2022-07-09 03:06:25,251][26022] Updated weights on worker 0-0, policy_version 60157 (0.00088) [2022-07-09 03:06:26,794][26022] Updated weights on worker 0-0, policy_version 60167 (0.00086) [2022-07-09 03:06:28,835][26022] Updated weights on worker 0-0, policy_version 60177 (0.00084) [2022-07-09 03:06:29,838][25689] Fps is (10 sec: 5929.9, 60 sec: 5738.7, 300 sec: 5753.7). Total num frames: 61628416. Throughput: 0: 6010.8. Samples: 61627472. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 03:06:29,839][25689] Avg episode reward: [(0, '-57.123')] [2022-07-09 03:06:30,554][26022] Updated weights on worker 0-0, policy_version 60187 (0.00082) [2022-07-09 03:06:32,217][26022] Updated weights on worker 0-0, policy_version 60197 (0.00092) [2022-07-09 03:06:34,066][26022] Updated weights on worker 0-0, policy_version 60207 (0.00082) [2022-07-09 03:06:34,931][25689] Fps is (10 sec: 5677.0, 60 sec: 5683.9, 300 sec: 5749.0). Total num frames: 61656064. Throughput: 0: 6034.3. Samples: 61662308. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 03:06:34,931][25689] Avg episode reward: [(0, '-58.659')] [2022-07-09 03:06:35,795][26022] Updated weights on worker 0-0, policy_version 60217 (0.00084) [2022-07-09 03:06:37,672][26022] Updated weights on worker 0-0, policy_version 60227 (0.00083) [2022-07-09 03:06:39,342][26022] Updated weights on worker 0-0, policy_version 60237 (0.00086) [2022-07-09 03:06:39,958][25689] Fps is (10 sec: 5666.6, 60 sec: 5702.5, 300 sec: 5752.4). Total num frames: 61685760. Throughput: 0: 5149.4. Samples: 61679356. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 03:06:39,958][25689] Avg episode reward: [(0, '-57.915')] [2022-07-09 03:06:41,235][26022] Updated weights on worker 0-0, policy_version 60247 (0.00089) [2022-07-09 03:06:42,844][26022] Updated weights on worker 0-0, policy_version 60257 (0.00085) [2022-07-09 03:06:44,741][26022] Updated weights on worker 0-0, policy_version 60267 (0.00083) [2022-07-09 03:06:44,967][25689] Fps is (10 sec: 5815.8, 60 sec: 5702.1, 300 sec: 5745.4). Total num frames: 61714432. Throughput: 0: 6046.8. Samples: 61714542. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 03:06:44,967][25689] Avg episode reward: [(0, '-57.736')] [2022-07-09 03:06:46,445][26022] Updated weights on worker 0-0, policy_version 60277 (0.00085) [2022-07-09 03:06:48,227][26022] Updated weights on worker 0-0, policy_version 60287 (0.00090) [2022-07-09 03:06:49,909][26022] Updated weights on worker 0-0, policy_version 60297 (0.00086) [2022-07-09 03:06:49,992][25689] Fps is (10 sec: 5817.1, 60 sec: 5719.0, 300 sec: 5751.0). Total num frames: 61744128. Throughput: 0: 6037.9. Samples: 61749156. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 03:06:49,992][25689] Avg episode reward: [(0, '-58.211')] [2022-07-09 03:06:51,909][26022] Updated weights on worker 0-0, policy_version 60307 (0.00093) [2022-07-09 03:06:53,479][26022] Updated weights on worker 0-0, policy_version 60317 (0.00090) [2022-07-09 03:06:55,026][25689] Fps is (10 sec: 5700.5, 60 sec: 5686.7, 300 sec: 5741.3). Total num frames: 61771776. Throughput: 0: 5182.4. Samples: 61766452. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 03:06:55,028][25689] Avg episode reward: [(0, '-58.178')] [2022-07-09 03:06:55,328][26022] Updated weights on worker 0-0, policy_version 60327 (0.00088) [2022-07-09 03:06:56,994][26022] Updated weights on worker 0-0, policy_version 60337 (0.00086) [2022-07-09 03:06:58,811][26022] Updated weights on worker 0-0, policy_version 60347 (0.00088) [2022-07-09 03:07:00,051][25689] Fps is (10 sec: 5802.2, 60 sec: 5718.6, 300 sec: 5758.8). Total num frames: 61802496. Throughput: 0: 6073.7. Samples: 61801398. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 03:07:00,052][25689] Avg episode reward: [(0, '-57.944')] [2022-07-09 03:07:00,659][26022] Updated weights on worker 0-0, policy_version 60357 (0.01043) [2022-07-09 03:07:02,600][26022] Updated weights on worker 0-0, policy_version 60367 (0.00093) [2022-07-09 03:07:04,515][26022] Updated weights on worker 0-0, policy_version 60377 (0.00085) [2022-07-09 03:07:05,082][25689] Fps is (10 sec: 5702.7, 60 sec: 5733.0, 300 sec: 5749.0). Total num frames: 61829120. Throughput: 0: 5949.3. Samples: 61834212. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 03:07:05,082][25689] Avg episode reward: [(0, '-57.597')] [2022-07-09 03:07:06,406][26022] Updated weights on worker 0-0, policy_version 60387 (0.00083) [2022-07-09 03:07:08,184][26022] Updated weights on worker 0-0, policy_version 60397 (0.00086) [2022-07-09 03:07:09,977][26022] Updated weights on worker 0-0, policy_version 60407 (0.00081) [2022-07-09 03:07:10,160][25689] Fps is (10 sec: 5470.0, 60 sec: 5727.1, 300 sec: 5748.2). Total num frames: 61857792. Throughput: 0: 5042.4. Samples: 61850854. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 03:07:10,162][25689] Avg episode reward: [(0, '-57.968')] [2022-07-09 03:07:12,030][26022] Updated weights on worker 0-0, policy_version 60417 (0.00103) [2022-07-09 03:07:13,295][26022] Updated weights on worker 0-0, policy_version 60427 (0.00086) [2022-07-09 03:07:15,244][25689] Fps is (10 sec: 5642.7, 60 sec: 5726.3, 300 sec: 5747.3). Total num frames: 61886464. Throughput: 0: 5875.8. Samples: 61885250. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 03:07:15,245][25689] Avg episode reward: [(0, '-58.349')] [2022-07-09 03:07:15,431][26022] Updated weights on worker 0-0, policy_version 60437 (0.00085) [2022-07-09 03:07:16,947][26022] Updated weights on worker 0-0, policy_version 60447 (0.00089) [2022-07-09 03:07:18,957][26022] Updated weights on worker 0-0, policy_version 60457 (0.00086) [2022-07-09 03:07:20,331][25689] Fps is (10 sec: 5738.8, 60 sec: 5737.0, 300 sec: 5742.3). Total num frames: 61916160. Throughput: 0: 5836.5. Samples: 61919760. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 03:07:20,332][25689] Avg episode reward: [(0, '-57.287')] [2022-07-09 03:07:20,695][26022] Updated weights on worker 0-0, policy_version 60467 (0.00085) [2022-07-09 03:07:22,412][26022] Updated weights on worker 0-0, policy_version 60477 (0.00097) [2022-07-09 03:07:24,189][26022] Updated weights on worker 0-0, policy_version 60487 (0.00080) [2022-07-09 03:07:25,337][25689] Fps is (10 sec: 5782.8, 60 sec: 5720.1, 300 sec: 5743.2). Total num frames: 61944832. Throughput: 0: 5938.5. Samples: 61954502. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 03:07:25,338][25689] Avg episode reward: [(0, '-57.722')] [2022-07-09 03:07:26,056][26022] Updated weights on worker 0-0, policy_version 60497 (0.00087) [2022-07-09 03:07:27,732][26022] Updated weights on worker 0-0, policy_version 60507 (0.00095) [2022-07-09 03:07:29,458][26022] Updated weights on worker 0-0, policy_version 60517 (0.00085) [2022-07-09 03:07:30,371][25689] Fps is (10 sec: 5711.7, 60 sec: 5700.9, 300 sec: 5751.0). Total num frames: 61973504. Throughput: 0: 5998.1. Samples: 61972078. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 03:07:30,371][25689] Avg episode reward: [(0, '-57.491')] [2022-07-09 03:07:31,259][26022] Updated weights on worker 0-0, policy_version 60527 (0.00089) [2022-07-09 03:07:33,066][26022] Updated weights on worker 0-0, policy_version 60537 (0.00090) [2022-07-09 03:07:34,823][26022] Updated weights on worker 0-0, policy_version 60547 (0.00094) [2022-07-09 03:07:35,417][25689] Fps is (10 sec: 5790.7, 60 sec: 5739.1, 300 sec: 5748.2). Total num frames: 62003200. Throughput: 0: 6026.4. Samples: 62006820. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 03:07:35,418][25689] Avg episode reward: [(0, '-57.274')] [2022-07-09 03:07:36,616][26022] Updated weights on worker 0-0, policy_version 60557 (0.00087) [2022-07-09 03:07:38,378][26022] Updated weights on worker 0-0, policy_version 60567 (0.00088) [2022-07-09 03:07:40,212][26022] Updated weights on worker 0-0, policy_version 60577 (0.00080) [2022-07-09 03:07:40,418][25689] Fps is (10 sec: 5809.3, 60 sec: 5724.6, 300 sec: 5748.4). Total num frames: 62031872. Throughput: 0: 6056.7. Samples: 62041422. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 03:07:40,419][25689] Avg episode reward: [(0, '-57.104')] [2022-07-09 03:07:41,955][26022] Updated weights on worker 0-0, policy_version 60587 (0.00102) [2022-07-09 03:07:43,692][26022] Updated weights on worker 0-0, policy_version 60597 (0.00083) [2022-07-09 03:07:45,427][25689] Fps is (10 sec: 5728.9, 60 sec: 5724.6, 300 sec: 5745.3). Total num frames: 62060544. Throughput: 0: 5200.7. Samples: 62058980. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 03:07:45,427][25689] Avg episode reward: [(0, '-56.767')] [2022-07-09 03:07:45,471][26022] Updated weights on worker 0-0, policy_version 60607 (0.00084) [2022-07-09 03:07:47,101][26022] Updated weights on worker 0-0, policy_version 60617 (0.00083) [2022-07-09 03:07:48,928][26022] Updated weights on worker 0-0, policy_version 60627 (0.00088) [2022-07-09 03:07:50,447][25689] Fps is (10 sec: 5820.0, 60 sec: 5725.1, 300 sec: 5749.1). Total num frames: 62090240. Throughput: 0: 6069.0. Samples: 62093920. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 03:07:50,448][25689] Avg episode reward: [(0, '-57.221')] [2022-07-09 03:07:50,621][26022] Updated weights on worker 0-0, policy_version 60637 (0.00085) [2022-07-09 03:07:52,467][26022] Updated weights on worker 0-0, policy_version 60647 (0.00083) [2022-07-09 03:07:53,994][26022] Updated weights on worker 0-0, policy_version 60657 (0.00089) [2022-07-09 03:07:55,484][25689] Fps is (10 sec: 5905.2, 60 sec: 5758.7, 300 sec: 5749.4). Total num frames: 62119936. Throughput: 0: 6111.8. Samples: 62129466. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 03:07:55,485][25689] Avg episode reward: [(0, '-57.320')] [2022-07-09 03:07:55,919][26022] Updated weights on worker 0-0, policy_version 60667 (0.00092) [2022-07-09 03:07:57,571][26022] Updated weights on worker 0-0, policy_version 60677 (0.00085) [2022-07-09 03:07:59,321][26022] Updated weights on worker 0-0, policy_version 60687 (0.00088) [2022-07-09 03:08:00,503][25689] Fps is (10 sec: 5906.1, 60 sec: 5742.4, 300 sec: 5756.8). Total num frames: 62149632. Throughput: 0: 5264.4. Samples: 62147158. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 03:08:00,504][25689] Avg episode reward: [(0, '-57.082')] [2022-07-09 03:08:01,372][26022] Updated weights on worker 0-0, policy_version 60697 (0.00094) [2022-07-09 03:08:03,221][26022] Updated weights on worker 0-0, policy_version 60707 (0.00087) [2022-07-09 03:08:05,140][26022] Updated weights on worker 0-0, policy_version 60717 (0.00086) [2022-07-09 03:08:05,507][25689] Fps is (10 sec: 5721.2, 60 sec: 5761.8, 300 sec: 5751.2). Total num frames: 62177280. Throughput: 0: 6026.7. Samples: 62180000. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 03:08:05,508][25689] Avg episode reward: [(0, '-57.429')] [2022-07-09 03:08:06,694][26022] Updated weights on worker 0-0, policy_version 60727 (0.00095) [2022-07-09 03:08:08,571][26022] Updated weights on worker 0-0, policy_version 60737 (0.00098) [2022-07-09 03:08:10,375][26022] Updated weights on worker 0-0, policy_version 60747 (0.00084) [2022-07-09 03:08:10,523][25689] Fps is (10 sec: 5518.9, 60 sec: 5750.9, 300 sec: 5746.3). Total num frames: 62204928. Throughput: 0: 6008.1. Samples: 62214536. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 03:08:10,523][25689] Avg episode reward: [(0, '-58.291')] [2022-07-09 03:08:12,023][26022] Updated weights on worker 0-0, policy_version 60757 (0.00103) [2022-07-09 03:08:13,994][26022] Updated weights on worker 0-0, policy_version 60767 (0.00086) [2022-07-09 03:08:15,569][25689] Fps is (10 sec: 5699.1, 60 sec: 5771.4, 300 sec: 5749.6). Total num frames: 62234624. Throughput: 0: 5100.8. Samples: 62231916. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 03:08:15,570][25689] Avg episode reward: [(0, '-57.212')] [2022-07-09 03:08:15,760][26022] Updated weights on worker 0-0, policy_version 60777 (0.00096) [2022-07-09 03:08:17,416][26022] Updated weights on worker 0-0, policy_version 60787 (0.00088) [2022-07-09 03:08:19,393][26022] Updated weights on worker 0-0, policy_version 60797 (0.00851) [2022-07-09 03:08:20,577][25689] Fps is (10 sec: 5907.3, 60 sec: 5779.0, 300 sec: 5747.4). Total num frames: 62264320. Throughput: 0: 5969.8. Samples: 62266992. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 03:08:20,577][25689] Avg episode reward: [(0, '-56.606')] [2022-07-09 03:08:20,816][26022] Updated weights on worker 0-0, policy_version 60807 (0.00088) [2022-07-09 03:08:20,957][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:08:20,969][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000060808_62267392.pth [2022-07-09 03:08:20,970][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000058783_60193792.pth [2022-07-09 03:08:22,836][26022] Updated weights on worker 0-0, policy_version 60817 (0.00617) [2022-07-09 03:08:24,501][26022] Updated weights on worker 0-0, policy_version 60827 (0.00087) [2022-07-09 03:08:25,602][25689] Fps is (10 sec: 5919.9, 60 sec: 5794.2, 300 sec: 5748.7). Total num frames: 62294016. Throughput: 0: 6069.5. Samples: 62301964. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 03:08:25,603][25689] Avg episode reward: [(0, '-55.993')] [2022-07-09 03:08:26,140][26022] Updated weights on worker 0-0, policy_version 60837 (0.00093) [2022-07-09 03:08:28,032][26022] Updated weights on worker 0-0, policy_version 60847 (0.00087) [2022-07-09 03:08:29,769][26022] Updated weights on worker 0-0, policy_version 60857 (0.00098) [2022-07-09 03:08:30,633][25689] Fps is (10 sec: 5702.1, 60 sec: 5777.4, 300 sec: 5745.7). Total num frames: 62321664. Throughput: 0: 5208.4. Samples: 62319280. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 03:08:30,634][25689] Avg episode reward: [(0, '-55.703')] [2022-07-09 03:08:31,584][26022] Updated weights on worker 0-0, policy_version 60867 (0.00091) [2022-07-09 03:08:33,193][26022] Updated weights on worker 0-0, policy_version 60877 (0.00088) [2022-07-09 03:08:35,040][26022] Updated weights on worker 0-0, policy_version 60887 (0.00088) [2022-07-09 03:08:35,682][25689] Fps is (10 sec: 5790.2, 60 sec: 5794.1, 300 sec: 5748.5). Total num frames: 62352384. Throughput: 0: 6081.1. Samples: 62354226. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 03:08:35,684][25689] Avg episode reward: [(0, '-55.681')] [2022-07-09 03:08:37,043][26022] Updated weights on worker 0-0, policy_version 60897 (0.00094) [2022-07-09 03:08:38,604][26022] Updated weights on worker 0-0, policy_version 60907 (0.00098) [2022-07-09 03:08:40,307][26022] Updated weights on worker 0-0, policy_version 60917 (0.00083) [2022-07-09 03:08:40,699][25689] Fps is (10 sec: 5900.6, 60 sec: 5792.7, 300 sec: 5741.4). Total num frames: 62381056. Throughput: 0: 6073.9. Samples: 62389212. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 03:08:40,700][25689] Avg episode reward: [(0, '-56.251')] [2022-07-09 03:08:42,258][26022] Updated weights on worker 0-0, policy_version 60927 (0.00086) [2022-07-09 03:08:43,876][26022] Updated weights on worker 0-0, policy_version 60937 (0.00095) [2022-07-09 03:08:45,719][25689] Fps is (10 sec: 5611.6, 60 sec: 5774.6, 300 sec: 5745.8). Total num frames: 62408704. Throughput: 0: 5203.7. Samples: 62406646. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 03:08:45,720][25689] Avg episode reward: [(0, '-56.921')] [2022-07-09 03:08:45,851][26022] Updated weights on worker 0-0, policy_version 60947 (0.00086) [2022-07-09 03:08:47,631][26022] Updated weights on worker 0-0, policy_version 60957 (0.00095) [2022-07-09 03:08:48,991][26022] Updated weights on worker 0-0, policy_version 60967 (0.00087) [2022-07-09 03:08:50,743][25689] Fps is (10 sec: 5709.3, 60 sec: 5774.2, 300 sec: 5743.3). Total num frames: 62438400. Throughput: 0: 6099.3. Samples: 62441932. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 03:08:50,743][25689] Avg episode reward: [(0, '-57.348')] [2022-07-09 03:08:51,128][26022] Updated weights on worker 0-0, policy_version 60977 (0.00096) [2022-07-09 03:08:52,575][26022] Updated weights on worker 0-0, policy_version 60987 (0.00083) [2022-07-09 03:08:54,460][26022] Updated weights on worker 0-0, policy_version 60997 (0.00626) [2022-07-09 03:08:55,840][25689] Fps is (10 sec: 5969.1, 60 sec: 5785.4, 300 sec: 5745.3). Total num frames: 62469120. Throughput: 0: 6077.9. Samples: 62476744. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 03:08:55,841][25689] Avg episode reward: [(0, '-57.298')] [2022-07-09 03:08:56,209][26022] Updated weights on worker 0-0, policy_version 61007 (0.00089) [2022-07-09 03:08:57,769][26022] Updated weights on worker 0-0, policy_version 61017 (0.00086) [2022-07-09 03:08:59,700][26022] Updated weights on worker 0-0, policy_version 61027 (0.00089) [2022-07-09 03:09:00,854][25689] Fps is (10 sec: 5975.1, 60 sec: 5785.9, 300 sec: 5755.9). Total num frames: 62498816. Throughput: 0: 5223.8. Samples: 62494498. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 03:09:00,854][25689] Avg episode reward: [(0, '-57.126')] [2022-07-09 03:09:01,417][26022] Updated weights on worker 0-0, policy_version 61037 (0.00088) [2022-07-09 03:09:03,375][26022] Updated weights on worker 0-0, policy_version 61047 (0.00088) [2022-07-09 03:09:05,263][26022] Updated weights on worker 0-0, policy_version 61057 (0.00086) [2022-07-09 03:09:05,897][25689] Fps is (10 sec: 5600.3, 60 sec: 5765.2, 300 sec: 5752.2). Total num frames: 62525440. Throughput: 0: 6001.2. Samples: 62527738. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 03:09:05,899][25689] Avg episode reward: [(0, '-56.806')] [2022-07-09 03:09:06,922][26022] Updated weights on worker 0-0, policy_version 61067 (0.00085) [2022-07-09 03:09:08,843][26022] Updated weights on worker 0-0, policy_version 61077 (0.00086) [2022-07-09 03:09:10,633][26022] Updated weights on worker 0-0, policy_version 61087 (0.00088) [2022-07-09 03:09:10,950][25689] Fps is (10 sec: 5578.6, 60 sec: 5795.6, 300 sec: 5752.1). Total num frames: 62555136. Throughput: 0: 5958.8. Samples: 62562342. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 03:09:10,951][25689] Avg episode reward: [(0, '-56.552')] [2022-07-09 03:09:12,275][26022] Updated weights on worker 0-0, policy_version 61097 (0.00055) [2022-07-09 03:09:13,930][26022] Updated weights on worker 0-0, policy_version 61107 (0.00094) [2022-07-09 03:09:15,739][26022] Updated weights on worker 0-0, policy_version 61117 (0.00074) [2022-07-09 03:09:16,046][25689] Fps is (10 sec: 5851.9, 60 sec: 5790.8, 300 sec: 5750.4). Total num frames: 62584832. Throughput: 0: 5111.4. Samples: 62580024. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 03:09:16,048][25689] Avg episode reward: [(0, '-56.515')] [2022-07-09 03:09:17,486][26022] Updated weights on worker 0-0, policy_version 61127 (0.00093) [2022-07-09 03:09:19,306][26022] Updated weights on worker 0-0, policy_version 61137 (0.00086) [2022-07-09 03:09:20,974][26022] Updated weights on worker 0-0, policy_version 61147 (0.00083) [2022-07-09 03:09:21,074][25689] Fps is (10 sec: 5866.5, 60 sec: 5788.9, 300 sec: 5747.1). Total num frames: 62614528. Throughput: 0: 5983.6. Samples: 62615484. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 03:09:21,074][25689] Avg episode reward: [(0, '-56.235')] [2022-07-09 03:09:22,785][26022] Updated weights on worker 0-0, policy_version 61157 (0.00121) [2022-07-09 03:09:24,367][26022] Updated weights on worker 0-0, policy_version 61167 (0.00087) [2022-07-09 03:09:26,081][25689] Fps is (10 sec: 5816.4, 60 sec: 5773.6, 300 sec: 5747.5). Total num frames: 62643200. Throughput: 0: 6091.4. Samples: 62650688. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 03:09:26,083][25689] Avg episode reward: [(0, '-56.043')] [2022-07-09 03:09:26,440][26022] Updated weights on worker 0-0, policy_version 61177 (0.00091) [2022-07-09 03:09:27,920][26022] Updated weights on worker 0-0, policy_version 61187 (0.00085) [2022-07-09 03:09:29,960][26022] Updated weights on worker 0-0, policy_version 61197 (0.00087) [2022-07-09 03:09:31,159][25689] Fps is (10 sec: 5888.6, 60 sec: 5819.9, 300 sec: 5755.4). Total num frames: 62673920. Throughput: 0: 5235.7. Samples: 62668154. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 03:09:31,160][25689] Avg episode reward: [(0, '-55.521')] [2022-07-09 03:09:31,535][26022] Updated weights on worker 0-0, policy_version 61207 (0.00089) [2022-07-09 03:09:33,406][26022] Updated weights on worker 0-0, policy_version 61217 (0.00088) [2022-07-09 03:09:35,048][26022] Updated weights on worker 0-0, policy_version 61227 (0.00090) [2022-07-09 03:09:36,307][25689] Fps is (10 sec: 5908.2, 60 sec: 5793.6, 300 sec: 5753.0). Total num frames: 62703616. Throughput: 0: 6077.1. Samples: 62703152. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 03:09:36,307][25689] Avg episode reward: [(0, '-55.389')] [2022-07-09 03:09:36,794][26022] Updated weights on worker 0-0, policy_version 61237 (0.00095) [2022-07-09 03:09:38,564][26022] Updated weights on worker 0-0, policy_version 61247 (0.00088) [2022-07-09 03:09:40,392][26022] Updated weights on worker 0-0, policy_version 61257 (0.00098) [2022-07-09 03:09:41,330][25689] Fps is (10 sec: 5638.2, 60 sec: 5776.0, 300 sec: 5746.0). Total num frames: 62731264. Throughput: 0: 6045.2. Samples: 62737938. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 03:09:41,330][25689] Avg episode reward: [(0, '-55.665')] [2022-07-09 03:09:42,073][26022] Updated weights on worker 0-0, policy_version 61267 (0.00089) [2022-07-09 03:09:44,046][26022] Updated weights on worker 0-0, policy_version 61277 (0.00090) [2022-07-09 03:09:45,498][26022] Updated weights on worker 0-0, policy_version 61287 (0.00087) [2022-07-09 03:09:46,333][25689] Fps is (10 sec: 5923.6, 60 sec: 5845.2, 300 sec: 5756.8). Total num frames: 62763008. Throughput: 0: 5191.1. Samples: 62755818. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 03:09:46,333][25689] Avg episode reward: [(0, '-55.736')] [2022-07-09 03:09:47,544][26022] Updated weights on worker 0-0, policy_version 61297 (0.00088) [2022-07-09 03:09:48,955][26022] Updated weights on worker 0-0, policy_version 61307 (0.00101) [2022-07-09 03:09:51,020][26022] Updated weights on worker 0-0, policy_version 61317 (0.00084) [2022-07-09 03:09:51,365][25689] Fps is (10 sec: 6020.5, 60 sec: 5827.5, 300 sec: 5757.6). Total num frames: 62791680. Throughput: 0: 6071.6. Samples: 62790834. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 03:09:51,366][25689] Avg episode reward: [(0, '-55.541')] [2022-07-09 03:09:52,528][26022] Updated weights on worker 0-0, policy_version 61327 (0.00088) [2022-07-09 03:09:54,409][26022] Updated weights on worker 0-0, policy_version 61337 (0.00080) [2022-07-09 03:09:56,081][26022] Updated weights on worker 0-0, policy_version 61347 (0.00086) [2022-07-09 03:09:56,431][25689] Fps is (10 sec: 5780.4, 60 sec: 5813.7, 300 sec: 5753.1). Total num frames: 62821376. Throughput: 0: 6108.1. Samples: 62826070. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 03:09:56,431][25689] Avg episode reward: [(0, '-55.903')] [2022-07-09 03:09:57,773][26022] Updated weights on worker 0-0, policy_version 61357 (0.00081) [2022-07-09 03:09:59,418][26022] Updated weights on worker 0-0, policy_version 61367 (0.00087) [2022-07-09 03:10:01,471][25689] Fps is (10 sec: 5674.5, 60 sec: 5777.4, 300 sec: 5760.2). Total num frames: 62849024. Throughput: 0: 5253.7. Samples: 62843752. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 03:10:01,471][25689] Avg episode reward: [(0, '-56.834')] [2022-07-09 03:10:01,508][26022] Updated weights on worker 0-0, policy_version 61377 (0.00095) [2022-07-09 03:10:03,419][26022] Updated weights on worker 0-0, policy_version 61387 (0.00083) [2022-07-09 03:10:05,292][26022] Updated weights on worker 0-0, policy_version 61397 (0.00082) [2022-07-09 03:10:06,474][25689] Fps is (10 sec: 5709.4, 60 sec: 5831.9, 300 sec: 5763.9). Total num frames: 62878720. Throughput: 0: 6009.5. Samples: 62876858. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 03:10:06,476][25689] Avg episode reward: [(0, '-56.553')] [2022-07-09 03:10:06,945][26022] Updated weights on worker 0-0, policy_version 61407 (0.00087) [2022-07-09 03:10:08,657][26022] Updated weights on worker 0-0, policy_version 61417 (0.00090) [2022-07-09 03:10:10,659][26022] Updated weights on worker 0-0, policy_version 61427 (0.00105) [2022-07-09 03:10:11,501][25689] Fps is (10 sec: 5717.4, 60 sec: 5800.6, 300 sec: 5757.3). Total num frames: 62906368. Throughput: 0: 5990.3. Samples: 62911452. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 03:10:11,502][25689] Avg episode reward: [(0, '-56.193')] [2022-07-09 03:10:12,160][26022] Updated weights on worker 0-0, policy_version 61437 (0.00094) [2022-07-09 03:10:14,163][26022] Updated weights on worker 0-0, policy_version 61447 (0.00078) [2022-07-09 03:10:15,757][26022] Updated weights on worker 0-0, policy_version 61457 (0.00086) [2022-07-09 03:10:16,611][25689] Fps is (10 sec: 5657.2, 60 sec: 5799.3, 300 sec: 5758.8). Total num frames: 62936064. Throughput: 0: 5089.4. Samples: 62928776. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 03:10:16,613][25689] Avg episode reward: [(0, '-56.433')] [2022-07-09 03:10:17,743][26022] Updated weights on worker 0-0, policy_version 61467 (0.00093) [2022-07-09 03:10:19,252][26022] Updated weights on worker 0-0, policy_version 61477 (0.00087) [2022-07-09 03:10:21,035][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:10:21,050][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000061486_62961664.pth [2022-07-09 03:10:21,050][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000059461_60888064.pth [2022-07-09 03:10:21,222][26022] Updated weights on worker 0-0, policy_version 61487 (0.00078) [2022-07-09 03:10:21,679][25689] Fps is (10 sec: 5835.2, 60 sec: 5795.4, 300 sec: 5758.1). Total num frames: 62965760. Throughput: 0: 5942.3. Samples: 62963834. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 03:10:21,680][25689] Avg episode reward: [(0, '-56.499')] [2022-07-09 03:10:22,790][26022] Updated weights on worker 0-0, policy_version 61497 (0.00089) [2022-07-09 03:10:24,625][26022] Updated weights on worker 0-0, policy_version 61507 (0.00084) [2022-07-09 03:10:26,321][26022] Updated weights on worker 0-0, policy_version 61517 (0.00086) [2022-07-09 03:10:26,699][25689] Fps is (10 sec: 5785.6, 60 sec: 5794.2, 300 sec: 5761.4). Total num frames: 62994432. Throughput: 0: 6046.1. Samples: 62999140. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:10:26,701][25689] Avg episode reward: [(0, '-56.703')] [2022-07-09 03:10:28,079][26022] Updated weights on worker 0-0, policy_version 61527 (0.00081) [2022-07-09 03:10:29,924][26022] Updated weights on worker 0-0, policy_version 61537 (0.00084) [2022-07-09 03:10:31,500][26022] Updated weights on worker 0-0, policy_version 61547 (0.00090) [2022-07-09 03:10:31,721][25689] Fps is (10 sec: 5812.0, 60 sec: 5782.6, 300 sec: 5758.5). Total num frames: 63024128. Throughput: 0: 5203.9. Samples: 63016678. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:10:31,722][25689] Avg episode reward: [(0, '-56.105')] [2022-07-09 03:10:33,488][26022] Updated weights on worker 0-0, policy_version 61557 (0.00089) [2022-07-09 03:10:35,050][26022] Updated weights on worker 0-0, policy_version 61567 (0.00091) [2022-07-09 03:10:36,771][25689] Fps is (10 sec: 5795.3, 60 sec: 5775.1, 300 sec: 5758.4). Total num frames: 63052800. Throughput: 0: 6096.6. Samples: 63051684. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:10:36,771][25689] Avg episode reward: [(0, '-56.351')] [2022-07-09 03:10:36,892][26022] Updated weights on worker 0-0, policy_version 61577 (0.00091) [2022-07-09 03:10:38,676][26022] Updated weights on worker 0-0, policy_version 61587 (0.00084) [2022-07-09 03:10:40,433][26022] Updated weights on worker 0-0, policy_version 61597 (0.00088) [2022-07-09 03:10:41,806][25689] Fps is (10 sec: 5889.5, 60 sec: 5824.8, 300 sec: 5764.7). Total num frames: 63083520. Throughput: 0: 6114.8. Samples: 63086906. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:10:41,806][25689] Avg episode reward: [(0, '-57.521')] [2022-07-09 03:10:42,286][26022] Updated weights on worker 0-0, policy_version 61607 (0.00087) [2022-07-09 03:10:44,140][26022] Updated weights on worker 0-0, policy_version 61617 (0.00093) [2022-07-09 03:10:45,541][26022] Updated weights on worker 0-0, policy_version 61627 (0.00081) [2022-07-09 03:10:46,813][25689] Fps is (10 sec: 5812.1, 60 sec: 5756.6, 300 sec: 5761.6). Total num frames: 63111168. Throughput: 0: 6094.9. Samples: 63121732. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:10:46,814][25689] Avg episode reward: [(0, '-57.613')] [2022-07-09 03:10:47,522][26022] Updated weights on worker 0-0, policy_version 61637 (0.00083) [2022-07-09 03:10:49,020][26022] Updated weights on worker 0-0, policy_version 61647 (0.00088) [2022-07-09 03:10:50,970][26022] Updated weights on worker 0-0, policy_version 61657 (0.00094) [2022-07-09 03:10:51,818][25689] Fps is (10 sec: 5829.4, 60 sec: 5793.0, 300 sec: 5766.0). Total num frames: 63141888. Throughput: 0: 6099.3. Samples: 63139256. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:10:51,819][25689] Avg episode reward: [(0, '-57.709')] [2022-07-09 03:10:52,676][26022] Updated weights on worker 0-0, policy_version 61667 (0.00086) [2022-07-09 03:10:54,380][26022] Updated weights on worker 0-0, policy_version 61677 (0.00094) [2022-07-09 03:10:56,451][26022] Updated weights on worker 0-0, policy_version 61687 (0.00099) [2022-07-09 03:10:56,949][25689] Fps is (10 sec: 5859.7, 60 sec: 5769.9, 300 sec: 5763.5). Total num frames: 63170560. Throughput: 0: 6062.6. Samples: 63174016. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:10:56,949][25689] Avg episode reward: [(0, '-57.908')] [2022-07-09 03:10:57,856][26022] Updated weights on worker 0-0, policy_version 61697 (0.00092) [2022-07-09 03:10:59,795][26022] Updated weights on worker 0-0, policy_version 61707 (0.00084) [2022-07-09 03:11:01,448][26022] Updated weights on worker 0-0, policy_version 61717 (0.00087) [2022-07-09 03:11:01,991][25689] Fps is (10 sec: 5637.1, 60 sec: 5786.6, 300 sec: 5773.1). Total num frames: 63199232. Throughput: 0: 6008.3. Samples: 63208184. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:11:01,991][25689] Avg episode reward: [(0, '-57.908')] [2022-07-09 03:11:03,691][26022] Updated weights on worker 0-0, policy_version 61727 (0.00094) [2022-07-09 03:11:05,597][26022] Updated weights on worker 0-0, policy_version 61737 (0.00810) [2022-07-09 03:11:07,000][25689] Fps is (10 sec: 5603.2, 60 sec: 5752.3, 300 sec: 5769.8). Total num frames: 63226880. Throughput: 0: 5074.4. Samples: 63224168. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:11:07,001][25689] Avg episode reward: [(0, '-57.280')] [2022-07-09 03:11:07,191][26022] Updated weights on worker 0-0, policy_version 61747 (0.00095) [2022-07-09 03:11:08,976][26022] Updated weights on worker 0-0, policy_version 61757 (0.00087) [2022-07-09 03:11:10,737][26022] Updated weights on worker 0-0, policy_version 61767 (0.00087) [2022-07-09 03:11:12,045][25689] Fps is (10 sec: 5703.6, 60 sec: 5784.3, 300 sec: 5773.8). Total num frames: 63256576. Throughput: 0: 5926.6. Samples: 63259130. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:11:12,045][25689] Avg episode reward: [(0, '-56.118')] [2022-07-09 03:11:12,556][26022] Updated weights on worker 0-0, policy_version 61777 (0.00093) [2022-07-09 03:11:14,392][26022] Updated weights on worker 0-0, policy_version 61787 (0.00088) [2022-07-09 03:11:15,966][26022] Updated weights on worker 0-0, policy_version 61797 (0.00099) [2022-07-09 03:11:17,118][25689] Fps is (10 sec: 5870.3, 60 sec: 5787.9, 300 sec: 5776.2). Total num frames: 63286272. Throughput: 0: 5939.4. Samples: 63293806. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:11:17,118][25689] Avg episode reward: [(0, '-56.124')] [2022-07-09 03:11:17,861][26022] Updated weights on worker 0-0, policy_version 61807 (0.00109) [2022-07-09 03:11:19,477][26022] Updated weights on worker 0-0, policy_version 61817 (0.00083) [2022-07-09 03:11:21,231][26022] Updated weights on worker 0-0, policy_version 61827 (0.00078) [2022-07-09 03:11:22,153][25689] Fps is (10 sec: 5672.8, 60 sec: 5757.1, 300 sec: 5768.8). Total num frames: 63313920. Throughput: 0: 5122.5. Samples: 63311466. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:11:22,154][25689] Avg episode reward: [(0, '-55.693')] [2022-07-09 03:11:23,045][26022] Updated weights on worker 0-0, policy_version 61837 (0.00086) [2022-07-09 03:11:24,729][26022] Updated weights on worker 0-0, policy_version 61847 (0.00087) [2022-07-09 03:11:26,576][26022] Updated weights on worker 0-0, policy_version 61857 (0.00087) [2022-07-09 03:11:27,207][25689] Fps is (10 sec: 5683.6, 60 sec: 5770.9, 300 sec: 5767.9). Total num frames: 63343616. Throughput: 0: 6059.2. Samples: 63346604. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:11:27,207][25689] Avg episode reward: [(0, '-55.648')] [2022-07-09 03:11:28,281][26022] Updated weights on worker 0-0, policy_version 61867 (0.00088) [2022-07-09 03:11:30,185][26022] Updated weights on worker 0-0, policy_version 61877 (0.00090) [2022-07-09 03:11:31,790][26022] Updated weights on worker 0-0, policy_version 61887 (0.00045) [2022-07-09 03:11:32,250][25689] Fps is (10 sec: 5983.7, 60 sec: 5785.8, 300 sec: 5779.2). Total num frames: 63374336. Throughput: 0: 6067.3. Samples: 63381722. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:11:32,251][25689] Avg episode reward: [(0, '-55.606')] [2022-07-09 03:11:33,779][26022] Updated weights on worker 0-0, policy_version 61897 (0.00066) [2022-07-09 03:11:35,183][26022] Updated weights on worker 0-0, policy_version 61907 (0.00085) [2022-07-09 03:11:37,305][25689] Fps is (10 sec: 5881.2, 60 sec: 5785.2, 300 sec: 5775.2). Total num frames: 63403008. Throughput: 0: 5216.7. Samples: 63399120. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:11:37,306][25689] Avg episode reward: [(0, '-56.916')] [2022-07-09 03:11:37,306][26022] Updated weights on worker 0-0, policy_version 61917 (0.00083) [2022-07-09 03:11:38,854][26022] Updated weights on worker 0-0, policy_version 61927 (0.00086) [2022-07-09 03:11:40,728][26022] Updated weights on worker 0-0, policy_version 61937 (0.00082) [2022-07-09 03:11:42,249][26022] Updated weights on worker 0-0, policy_version 61947 (0.00085) [2022-07-09 03:11:42,309][25689] Fps is (10 sec: 6006.0, 60 sec: 5805.1, 300 sec: 5785.6). Total num frames: 63434752. Throughput: 0: 6089.5. Samples: 63434206. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-09 03:11:42,311][25689] Avg episode reward: [(0, '-56.755')] [2022-07-09 03:11:44,271][26022] Updated weights on worker 0-0, policy_version 61957 (0.00096) [2022-07-09 03:11:45,940][26022] Updated weights on worker 0-0, policy_version 61967 (0.00088) [2022-07-09 03:11:47,325][25689] Fps is (10 sec: 5927.5, 60 sec: 5804.3, 300 sec: 5778.9). Total num frames: 63462400. Throughput: 0: 6078.3. Samples: 63468888. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-09 03:11:47,326][25689] Avg episode reward: [(0, '-57.570')] [2022-07-09 03:11:47,791][26022] Updated weights on worker 0-0, policy_version 61977 (0.00086) [2022-07-09 03:11:49,435][26022] Updated weights on worker 0-0, policy_version 61987 (0.00084) [2022-07-09 03:11:51,488][26022] Updated weights on worker 0-0, policy_version 61997 (0.00086) [2022-07-09 03:11:52,372][25689] Fps is (10 sec: 5495.4, 60 sec: 5749.6, 300 sec: 5778.7). Total num frames: 63490048. Throughput: 0: 5198.0. Samples: 63486314. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-09 03:11:52,372][25689] Avg episode reward: [(0, '-58.393')] [2022-07-09 03:11:52,950][26022] Updated weights on worker 0-0, policy_version 62007 (0.00088) [2022-07-09 03:11:54,877][26022] Updated weights on worker 0-0, policy_version 62017 (0.00050) [2022-07-09 03:11:56,574][26022] Updated weights on worker 0-0, policy_version 62027 (0.00555) [2022-07-09 03:11:57,441][25689] Fps is (10 sec: 5770.1, 60 sec: 5789.3, 300 sec: 5777.9). Total num frames: 63520768. Throughput: 0: 6061.9. Samples: 63521178. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-09 03:11:57,441][25689] Avg episode reward: [(0, '-58.751')] [2022-07-09 03:11:58,515][26022] Updated weights on worker 0-0, policy_version 62037 (0.00089) [2022-07-09 03:12:00,067][26022] Updated weights on worker 0-0, policy_version 62047 (0.00415) [2022-07-09 03:12:02,189][26022] Updated weights on worker 0-0, policy_version 62057 (0.00119) [2022-07-09 03:12:02,449][25689] Fps is (10 sec: 5690.1, 60 sec: 5758.6, 300 sec: 5778.3). Total num frames: 63547392. Throughput: 0: 5967.7. Samples: 63554396. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-09 03:12:02,451][25689] Avg episode reward: [(0, '-58.630')] [2022-07-09 03:12:03,882][26022] Updated weights on worker 0-0, policy_version 62067 (0.00081) [2022-07-09 03:12:05,796][26022] Updated weights on worker 0-0, policy_version 62077 (0.00094) [2022-07-09 03:12:07,465][25689] Fps is (10 sec: 5516.4, 60 sec: 5775.0, 300 sec: 5779.5). Total num frames: 63576064. Throughput: 0: 5119.8. Samples: 63571998. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-09 03:12:07,465][25689] Avg episode reward: [(0, '-58.746')] [2022-07-09 03:12:07,479][26022] Updated weights on worker 0-0, policy_version 62087 (0.00088) [2022-07-09 03:12:09,147][26022] Updated weights on worker 0-0, policy_version 62097 (0.00088) [2022-07-09 03:12:11,036][26022] Updated weights on worker 0-0, policy_version 62107 (0.00091) [2022-07-09 03:12:12,490][25689] Fps is (10 sec: 5813.2, 60 sec: 5776.9, 300 sec: 5784.1). Total num frames: 63605760. Throughput: 0: 5987.1. Samples: 63606764. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-09 03:12:12,491][25689] Avg episode reward: [(0, '-58.522')] [2022-07-09 03:12:12,944][26022] Updated weights on worker 0-0, policy_version 62117 (0.00091) [2022-07-09 03:12:14,562][26022] Updated weights on worker 0-0, policy_version 62127 (0.00094) [2022-07-09 03:12:16,398][26022] Updated weights on worker 0-0, policy_version 62137 (0.00082) [2022-07-09 03:12:17,627][25689] Fps is (10 sec: 5844.4, 60 sec: 5770.7, 300 sec: 5783.1). Total num frames: 63635456. Throughput: 0: 5949.9. Samples: 63641284. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-09 03:12:17,627][25689] Avg episode reward: [(0, '-58.149')] [2022-07-09 03:12:18,147][26022] Updated weights on worker 0-0, policy_version 62147 (0.00096) [2022-07-09 03:12:19,845][26022] Updated weights on worker 0-0, policy_version 62157 (0.00086) [2022-07-09 03:12:21,103][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:12:21,115][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000062163_63654912.pth [2022-07-09 03:12:21,115][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000060132_61575168.pth [2022-07-09 03:12:21,650][26022] Updated weights on worker 0-0, policy_version 62167 (0.00095) [2022-07-09 03:12:22,645][25689] Fps is (10 sec: 5848.7, 60 sec: 5806.3, 300 sec: 5786.3). Total num frames: 63665152. Throughput: 0: 5171.2. Samples: 63658830. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-09 03:12:22,645][25689] Avg episode reward: [(0, '-57.107')] [2022-07-09 03:12:23,335][26022] Updated weights on worker 0-0, policy_version 62177 (0.00079) [2022-07-09 03:12:25,269][26022] Updated weights on worker 0-0, policy_version 62187 (0.00091) [2022-07-09 03:12:27,118][26022] Updated weights on worker 0-0, policy_version 62197 (0.01198) [2022-07-09 03:12:27,716][25689] Fps is (10 sec: 5683.6, 60 sec: 5770.7, 300 sec: 5782.2). Total num frames: 63692800. Throughput: 0: 5989.0. Samples: 63693286. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-09 03:12:27,717][25689] Avg episode reward: [(0, '-57.876')] [2022-07-09 03:12:28,698][26022] Updated weights on worker 0-0, policy_version 62207 (0.00085) [2022-07-09 03:12:30,638][26022] Updated weights on worker 0-0, policy_version 62217 (0.00797) [2022-07-09 03:12:32,407][26022] Updated weights on worker 0-0, policy_version 62227 (0.00087) [2022-07-09 03:12:32,719][25689] Fps is (10 sec: 5692.2, 60 sec: 5757.7, 300 sec: 5783.0). Total num frames: 63722496. Throughput: 0: 5989.7. Samples: 63727930. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-09 03:12:32,719][25689] Avg episode reward: [(0, '-57.197')] [2022-07-09 03:12:34,147][26022] Updated weights on worker 0-0, policy_version 62237 (0.00089) [2022-07-09 03:12:35,814][26022] Updated weights on worker 0-0, policy_version 62247 (0.00085) [2022-07-09 03:12:37,521][26022] Updated weights on worker 0-0, policy_version 62257 (0.00083) [2022-07-09 03:12:37,805][25689] Fps is (10 sec: 5785.5, 60 sec: 5754.8, 300 sec: 5781.3). Total num frames: 63751168. Throughput: 0: 5165.6. Samples: 63745516. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-09 03:12:37,805][25689] Avg episode reward: [(0, '-56.910')] [2022-07-09 03:12:39,379][26022] Updated weights on worker 0-0, policy_version 62267 (0.00087) [2022-07-09 03:12:41,195][26022] Updated weights on worker 0-0, policy_version 62277 (0.00082) [2022-07-09 03:12:42,855][25689] Fps is (10 sec: 5758.3, 60 sec: 5716.5, 300 sec: 5784.0). Total num frames: 63780864. Throughput: 0: 6017.8. Samples: 63780454. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-09 03:12:42,856][25689] Avg episode reward: [(0, '-56.352')] [2022-07-09 03:12:42,967][26022] Updated weights on worker 0-0, policy_version 62287 (0.00089) [2022-07-09 03:12:44,598][26022] Updated weights on worker 0-0, policy_version 62297 (0.00084) [2022-07-09 03:12:46,441][26022] Updated weights on worker 0-0, policy_version 62307 (0.00084) [2022-07-09 03:12:47,883][25689] Fps is (10 sec: 5892.9, 60 sec: 5749.2, 300 sec: 5783.8). Total num frames: 63810560. Throughput: 0: 6052.7. Samples: 63815352. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-09 03:12:47,884][25689] Avg episode reward: [(0, '-56.812')] [2022-07-09 03:12:48,106][26022] Updated weights on worker 0-0, policy_version 62317 (0.00091) [2022-07-09 03:12:49,817][26022] Updated weights on worker 0-0, policy_version 62327 (0.00089) [2022-07-09 03:12:51,785][26022] Updated weights on worker 0-0, policy_version 62337 (0.00087) [2022-07-09 03:12:52,886][25689] Fps is (10 sec: 5819.0, 60 sec: 5770.3, 300 sec: 5781.1). Total num frames: 63839232. Throughput: 0: 5192.1. Samples: 63832640. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-09 03:12:52,887][25689] Avg episode reward: [(0, '-56.759')] [2022-07-09 03:12:53,267][26022] Updated weights on worker 0-0, policy_version 62347 (0.00083) [2022-07-09 03:12:55,376][26022] Updated weights on worker 0-0, policy_version 62357 (0.00083) [2022-07-09 03:12:56,886][26022] Updated weights on worker 0-0, policy_version 62367 (0.00090) [2022-07-09 03:12:57,929][25689] Fps is (10 sec: 5810.3, 60 sec: 5755.8, 300 sec: 5780.6). Total num frames: 63868928. Throughput: 0: 6063.6. Samples: 63867540. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 03:12:57,929][25689] Avg episode reward: [(0, '-55.055')] [2022-07-09 03:12:58,799][26022] Updated weights on worker 0-0, policy_version 62377 (0.00083) [2022-07-09 03:13:00,472][26022] Updated weights on worker 0-0, policy_version 62387 (0.00084) [2022-07-09 03:13:02,335][26022] Updated weights on worker 0-0, policy_version 62397 (0.00087) [2022-07-09 03:13:02,967][25689] Fps is (10 sec: 5688.1, 60 sec: 5769.9, 300 sec: 5779.9). Total num frames: 63896576. Throughput: 0: 6066.3. Samples: 63902460. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 03:13:02,968][25689] Avg episode reward: [(0, '-55.423')] [2022-07-09 03:13:04,526][26022] Updated weights on worker 0-0, policy_version 62407 (0.00086) [2022-07-09 03:13:05,941][26022] Updated weights on worker 0-0, policy_version 62417 (0.00091) [2022-07-09 03:13:07,970][26022] Updated weights on worker 0-0, policy_version 62427 (0.00289) [2022-07-09 03:13:08,014][25689] Fps is (10 sec: 5584.3, 60 sec: 5766.9, 300 sec: 5782.7). Total num frames: 63925248. Throughput: 0: 5093.3. Samples: 63917886. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 03:13:08,015][25689] Avg episode reward: [(0, '-55.553')] [2022-07-09 03:13:09,518][26022] Updated weights on worker 0-0, policy_version 62437 (0.00086) [2022-07-09 03:13:11,465][26022] Updated weights on worker 0-0, policy_version 62447 (0.00056) [2022-07-09 03:13:13,030][25689] Fps is (10 sec: 5698.8, 60 sec: 5750.9, 300 sec: 5779.9). Total num frames: 63953920. Throughput: 0: 5959.0. Samples: 63952678. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 03:13:13,031][25689] Avg episode reward: [(0, '-55.153')] [2022-07-09 03:13:13,408][26022] Updated weights on worker 0-0, policy_version 62457 (0.00088) [2022-07-09 03:13:14,884][26022] Updated weights on worker 0-0, policy_version 62467 (0.00089) [2022-07-09 03:13:16,842][26022] Updated weights on worker 0-0, policy_version 62477 (0.00089) [2022-07-09 03:13:18,162][25689] Fps is (10 sec: 5853.2, 60 sec: 5768.3, 300 sec: 5780.9). Total num frames: 63984640. Throughput: 0: 5932.5. Samples: 63987570. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 03:13:18,162][25689] Avg episode reward: [(0, '-55.395')] [2022-07-09 03:13:18,534][26022] Updated weights on worker 0-0, policy_version 62487 (0.00083) [2022-07-09 03:13:20,277][26022] Updated weights on worker 0-0, policy_version 62497 (0.00088) [2022-07-09 03:13:22,185][26022] Updated weights on worker 0-0, policy_version 62507 (0.00088) [2022-07-09 03:13:23,186][25689] Fps is (10 sec: 5949.0, 60 sec: 5767.7, 300 sec: 5781.0). Total num frames: 64014336. Throughput: 0: 5952.6. Samples: 64022810. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 03:13:23,186][25689] Avg episode reward: [(0, '-55.240')] [2022-07-09 03:13:23,431][26022] Updated weights on worker 0-0, policy_version 62517 (0.00094) [2022-07-09 03:13:25,709][26022] Updated weights on worker 0-0, policy_version 62527 (0.00092) [2022-07-09 03:13:27,122][26022] Updated weights on worker 0-0, policy_version 62537 (0.00084) [2022-07-09 03:13:28,246][25689] Fps is (10 sec: 5686.6, 60 sec: 5768.8, 300 sec: 5780.4). Total num frames: 64041984. Throughput: 0: 6041.7. Samples: 64040116. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 03:13:28,246][25689] Avg episode reward: [(0, '-55.299')] [2022-07-09 03:13:29,158][26022] Updated weights on worker 0-0, policy_version 62547 (0.00081) [2022-07-09 03:13:30,796][26022] Updated weights on worker 0-0, policy_version 62557 (0.00083) [2022-07-09 03:13:32,524][26022] Updated weights on worker 0-0, policy_version 62567 (0.00087) [2022-07-09 03:13:33,292][25689] Fps is (10 sec: 5674.0, 60 sec: 5764.7, 300 sec: 5777.0). Total num frames: 64071680. Throughput: 0: 6033.6. Samples: 64074932. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 03:13:33,293][25689] Avg episode reward: [(0, '-56.065')] [2022-07-09 03:13:34,275][26022] Updated weights on worker 0-0, policy_version 62577 (0.00078) [2022-07-09 03:13:36,131][26022] Updated weights on worker 0-0, policy_version 62587 (0.00080) [2022-07-09 03:13:37,903][26022] Updated weights on worker 0-0, policy_version 62597 (0.00090) [2022-07-09 03:13:38,406][25689] Fps is (10 sec: 5845.5, 60 sec: 5778.9, 300 sec: 5778.6). Total num frames: 64101376. Throughput: 0: 6045.8. Samples: 64109964. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 03:13:38,406][25689] Avg episode reward: [(0, '-56.512')] [2022-07-09 03:13:39,786][26022] Updated weights on worker 0-0, policy_version 62607 (0.00083) [2022-07-09 03:13:41,344][26022] Updated weights on worker 0-0, policy_version 62617 (0.00104) [2022-07-09 03:13:43,319][26022] Updated weights on worker 0-0, policy_version 62627 (0.00083) [2022-07-09 03:13:43,427][25689] Fps is (10 sec: 5860.3, 60 sec: 5781.7, 300 sec: 5785.4). Total num frames: 64131072. Throughput: 0: 5175.6. Samples: 64127570. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 03:13:43,427][25689] Avg episode reward: [(0, '-56.579')] [2022-07-09 03:13:44,800][26022] Updated weights on worker 0-0, policy_version 62637 (0.00093) [2022-07-09 03:13:46,894][26022] Updated weights on worker 0-0, policy_version 62647 (0.00055) [2022-07-09 03:13:48,453][25689] Fps is (10 sec: 5911.2, 60 sec: 5781.9, 300 sec: 5785.4). Total num frames: 64160768. Throughput: 0: 6062.5. Samples: 64162624. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 03:13:48,455][25689] Avg episode reward: [(0, '-56.953')] [2022-07-09 03:13:48,456][26022] Updated weights on worker 0-0, policy_version 62657 (0.00092) [2022-07-09 03:13:50,178][26022] Updated weights on worker 0-0, policy_version 62667 (0.00087) [2022-07-09 03:13:51,991][26022] Updated weights on worker 0-0, policy_version 62677 (0.00083) [2022-07-09 03:13:53,464][25689] Fps is (10 sec: 5713.1, 60 sec: 5764.2, 300 sec: 5776.7). Total num frames: 64188416. Throughput: 0: 6078.9. Samples: 64197554. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 03:13:53,464][25689] Avg episode reward: [(0, '-57.454')] [2022-07-09 03:13:53,757][26022] Updated weights on worker 0-0, policy_version 62687 (0.00087) [2022-07-09 03:13:55,455][26022] Updated weights on worker 0-0, policy_version 62697 (0.00088) [2022-07-09 03:13:57,289][26022] Updated weights on worker 0-0, policy_version 62707 (0.00087) [2022-07-09 03:13:58,568][25689] Fps is (10 sec: 5770.3, 60 sec: 5775.2, 300 sec: 5778.4). Total num frames: 64219136. Throughput: 0: 5207.2. Samples: 64214954. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 03:13:58,569][25689] Avg episode reward: [(0, '-56.748')] [2022-07-09 03:13:58,803][26022] Updated weights on worker 0-0, policy_version 62717 (0.00083) [2022-07-09 03:14:00,826][26022] Updated weights on worker 0-0, policy_version 62727 (0.00088) [2022-07-09 03:14:02,736][26022] Updated weights on worker 0-0, policy_version 62737 (0.00091) [2022-07-09 03:14:03,635][25689] Fps is (10 sec: 5738.7, 60 sec: 5772.6, 300 sec: 5781.4). Total num frames: 64246784. Throughput: 0: 6027.7. Samples: 64249378. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 03:14:03,635][25689] Avg episode reward: [(0, '-57.193')] [2022-07-09 03:14:04,673][26022] Updated weights on worker 0-0, policy_version 62747 (0.00087) [2022-07-09 03:14:06,253][26022] Updated weights on worker 0-0, policy_version 62757 (0.00088) [2022-07-09 03:14:08,147][26022] Updated weights on worker 0-0, policy_version 62767 (0.00089) [2022-07-09 03:14:08,670][25689] Fps is (10 sec: 5676.8, 60 sec: 5790.6, 300 sec: 5781.7). Total num frames: 64276480. Throughput: 0: 5970.5. Samples: 64283328. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 03:14:08,671][25689] Avg episode reward: [(0, '-56.703')] [2022-07-09 03:14:09,902][26022] Updated weights on worker 0-0, policy_version 62777 (0.00084) [2022-07-09 03:14:11,747][26022] Updated weights on worker 0-0, policy_version 62787 (0.00085) [2022-07-09 03:14:13,645][26022] Updated weights on worker 0-0, policy_version 62797 (0.00088) [2022-07-09 03:14:13,679][25689] Fps is (10 sec: 5709.4, 60 sec: 5774.4, 300 sec: 5776.6). Total num frames: 64304128. Throughput: 0: 5103.9. Samples: 64300728. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-09 03:14:13,680][25689] Avg episode reward: [(0, '-56.515')] [2022-07-09 03:14:14,952][26022] Updated weights on worker 0-0, policy_version 62807 (0.00091) [2022-07-09 03:14:17,033][26022] Updated weights on worker 0-0, policy_version 62817 (0.00088) [2022-07-09 03:14:18,728][26022] Updated weights on worker 0-0, policy_version 62827 (0.00092) [2022-07-09 03:14:18,761][25689] Fps is (10 sec: 5783.9, 60 sec: 5779.0, 300 sec: 5778.9). Total num frames: 64334848. Throughput: 0: 5983.4. Samples: 64335776. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-09 03:14:18,762][25689] Avg episode reward: [(0, '-55.961')] [2022-07-09 03:14:20,443][26022] Updated weights on worker 0-0, policy_version 62837 (0.00087) [2022-07-09 03:14:21,199][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:14:21,209][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000062842_64350208.pth [2022-07-09 03:14:21,215][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000060808_62267392.pth [2022-07-09 03:14:22,316][26022] Updated weights on worker 0-0, policy_version 62847 (0.00081) [2022-07-09 03:14:23,772][25689] Fps is (10 sec: 5884.0, 60 sec: 5763.4, 300 sec: 5778.9). Total num frames: 64363520. Throughput: 0: 6037.7. Samples: 64370962. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-09 03:14:23,773][25689] Avg episode reward: [(0, '-55.216')] [2022-07-09 03:14:23,871][26022] Updated weights on worker 0-0, policy_version 62857 (0.00091) [2022-07-09 03:14:25,680][26022] Updated weights on worker 0-0, policy_version 62867 (0.00089) [2022-07-09 03:14:27,463][26022] Updated weights on worker 0-0, policy_version 62877 (0.00082) [2022-07-09 03:14:28,799][25689] Fps is (10 sec: 5814.8, 60 sec: 5800.3, 300 sec: 5776.4). Total num frames: 64393216. Throughput: 0: 5210.3. Samples: 64388208. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-09 03:14:28,800][25689] Avg episode reward: [(0, '-55.541')] [2022-07-09 03:14:29,149][26022] Updated weights on worker 0-0, policy_version 62887 (0.00092) [2022-07-09 03:14:31,045][26022] Updated weights on worker 0-0, policy_version 62897 (0.00086) [2022-07-09 03:14:32,706][26022] Updated weights on worker 0-0, policy_version 62907 (0.00085) [2022-07-09 03:14:33,828][25689] Fps is (10 sec: 5906.3, 60 sec: 5802.0, 300 sec: 5778.7). Total num frames: 64422912. Throughput: 0: 6065.1. Samples: 64422936. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-09 03:14:33,829][25689] Avg episode reward: [(0, '-55.651')] [2022-07-09 03:14:34,586][26022] Updated weights on worker 0-0, policy_version 62917 (0.00088) [2022-07-09 03:14:36,329][26022] Updated weights on worker 0-0, policy_version 62927 (0.00084) [2022-07-09 03:14:38,099][26022] Updated weights on worker 0-0, policy_version 62937 (0.00435) [2022-07-09 03:14:38,873][25689] Fps is (10 sec: 5794.0, 60 sec: 5791.7, 300 sec: 5781.7). Total num frames: 64451584. Throughput: 0: 6077.4. Samples: 64458004. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-09 03:14:38,874][25689] Avg episode reward: [(0, '-55.546')] [2022-07-09 03:14:39,650][26022] Updated weights on worker 0-0, policy_version 62947 (0.00072) [2022-07-09 03:14:41,746][26022] Updated weights on worker 0-0, policy_version 62957 (0.00083) [2022-07-09 03:14:43,230][26022] Updated weights on worker 0-0, policy_version 62967 (0.00085) [2022-07-09 03:14:43,883][25689] Fps is (10 sec: 5703.0, 60 sec: 5775.8, 300 sec: 5771.3). Total num frames: 64480256. Throughput: 0: 5185.8. Samples: 64475252. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-09 03:14:43,884][25689] Avg episode reward: [(0, '-55.737')] [2022-07-09 03:14:45,267][26022] Updated weights on worker 0-0, policy_version 62977 (0.00091) [2022-07-09 03:14:47,041][26022] Updated weights on worker 0-0, policy_version 62987 (0.00087) [2022-07-09 03:14:48,595][26022] Updated weights on worker 0-0, policy_version 62997 (0.00082) [2022-07-09 03:14:48,974][25689] Fps is (10 sec: 5880.1, 60 sec: 5786.6, 300 sec: 5777.0). Total num frames: 64510976. Throughput: 0: 6034.5. Samples: 64509950. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-09 03:14:48,974][25689] Avg episode reward: [(0, '-56.232')] [2022-07-09 03:14:50,538][26022] Updated weights on worker 0-0, policy_version 63007 (0.00083) [2022-07-09 03:14:52,074][26022] Updated weights on worker 0-0, policy_version 63017 (0.00088) [2022-07-09 03:14:54,009][25689] Fps is (10 sec: 5865.2, 60 sec: 5801.1, 300 sec: 5774.2). Total num frames: 64539648. Throughput: 0: 6044.9. Samples: 64544930. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-09 03:14:54,010][25689] Avg episode reward: [(0, '-56.704')] [2022-07-09 03:14:54,014][26022] Updated weights on worker 0-0, policy_version 63027 (0.00085) [2022-07-09 03:14:55,835][26022] Updated weights on worker 0-0, policy_version 63037 (0.00086) [2022-07-09 03:14:57,675][26022] Updated weights on worker 0-0, policy_version 63047 (0.00089) [2022-07-09 03:14:59,103][25689] Fps is (10 sec: 5560.3, 60 sec: 5751.5, 300 sec: 5773.1). Total num frames: 64567296. Throughput: 0: 5130.3. Samples: 64561788. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-09 03:14:59,103][25689] Avg episode reward: [(0, '-56.063')] [2022-07-09 03:14:59,424][26022] Updated weights on worker 0-0, policy_version 63057 (0.00084) [2022-07-09 03:15:01,122][26022] Updated weights on worker 0-0, policy_version 63067 (0.00097) [2022-07-09 03:15:03,149][26022] Updated weights on worker 0-0, policy_version 63077 (0.00085) [2022-07-09 03:15:04,148][25689] Fps is (10 sec: 5555.2, 60 sec: 5770.4, 300 sec: 5768.9). Total num frames: 64595968. Throughput: 0: 5893.9. Samples: 64594690. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-09 03:15:04,148][25689] Avg episode reward: [(0, '-55.866')] [2022-07-09 03:15:05,111][26022] Updated weights on worker 0-0, policy_version 63087 (0.00094) [2022-07-09 03:15:06,806][26022] Updated weights on worker 0-0, policy_version 63097 (0.00102) [2022-07-09 03:15:08,710][26022] Updated weights on worker 0-0, policy_version 63107 (0.00091) [2022-07-09 03:15:09,167][25689] Fps is (10 sec: 5698.0, 60 sec: 5755.0, 300 sec: 5772.5). Total num frames: 64624640. Throughput: 0: 5917.2. Samples: 64629434. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-09 03:15:09,167][25689] Avg episode reward: [(0, '-56.590')] [2022-07-09 03:15:10,260][26022] Updated weights on worker 0-0, policy_version 63117 (0.00087) [2022-07-09 03:15:12,114][26022] Updated weights on worker 0-0, policy_version 63127 (0.00083) [2022-07-09 03:15:13,767][26022] Updated weights on worker 0-0, policy_version 63137 (0.00091) [2022-07-09 03:15:14,169][25689] Fps is (10 sec: 5722.1, 60 sec: 5772.5, 300 sec: 5771.1). Total num frames: 64653312. Throughput: 0: 5046.4. Samples: 64646664. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-09 03:15:14,170][25689] Avg episode reward: [(0, '-56.602')] [2022-07-09 03:15:15,795][26022] Updated weights on worker 0-0, policy_version 63147 (0.00086) [2022-07-09 03:15:17,523][26022] Updated weights on worker 0-0, policy_version 63157 (0.00086) [2022-07-09 03:15:19,212][25689] Fps is (10 sec: 5708.3, 60 sec: 5742.4, 300 sec: 5768.2). Total num frames: 64681984. Throughput: 0: 5934.1. Samples: 64681120. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-09 03:15:19,213][25689] Avg episode reward: [(0, '-55.975')] [2022-07-09 03:15:19,228][26022] Updated weights on worker 0-0, policy_version 63167 (0.00087) [2022-07-09 03:15:20,988][26022] Updated weights on worker 0-0, policy_version 63177 (0.00091) [2022-07-09 03:15:22,628][26022] Updated weights on worker 0-0, policy_version 63187 (0.00087) [2022-07-09 03:15:24,273][25689] Fps is (10 sec: 5979.4, 60 sec: 5788.5, 300 sec: 5777.7). Total num frames: 64713728. Throughput: 0: 6042.8. Samples: 64716304. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-09 03:15:24,274][25689] Avg episode reward: [(0, '-55.752')] [2022-07-09 03:15:24,278][26022] Updated weights on worker 0-0, policy_version 63197 (0.00086) [2022-07-09 03:15:26,220][26022] Updated weights on worker 0-0, policy_version 63207 (0.00093) [2022-07-09 03:15:27,820][26022] Updated weights on worker 0-0, policy_version 63217 (0.00089) [2022-07-09 03:15:29,304][25689] Fps is (10 sec: 5885.5, 60 sec: 5754.3, 300 sec: 5770.7). Total num frames: 64741376. Throughput: 0: 5182.4. Samples: 64733792. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-09 03:15:29,305][25689] Avg episode reward: [(0, '-55.617')] [2022-07-09 03:15:29,791][26022] Updated weights on worker 0-0, policy_version 63227 (0.00087) [2022-07-09 03:15:31,679][26022] Updated weights on worker 0-0, policy_version 63237 (0.00088) [2022-07-09 03:15:33,352][26022] Updated weights on worker 0-0, policy_version 63247 (0.00083) [2022-07-09 03:15:34,321][25689] Fps is (10 sec: 5605.3, 60 sec: 5738.5, 300 sec: 5771.3). Total num frames: 64770048. Throughput: 0: 6054.4. Samples: 64768670. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 03:15:34,321][25689] Avg episode reward: [(0, '-55.829')] [2022-07-09 03:15:35,275][26022] Updated weights on worker 0-0, policy_version 63257 (0.00086) [2022-07-09 03:15:37,108][26022] Updated weights on worker 0-0, policy_version 63267 (0.00087) [2022-07-09 03:15:38,702][26022] Updated weights on worker 0-0, policy_version 63277 (0.00090) [2022-07-09 03:15:39,440][25689] Fps is (10 sec: 5758.1, 60 sec: 5748.3, 300 sec: 5766.2). Total num frames: 64799744. Throughput: 0: 6027.6. Samples: 64803046. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 03:15:39,442][25689] Avg episode reward: [(0, '-55.670')] [2022-07-09 03:15:40,526][26022] Updated weights on worker 0-0, policy_version 63287 (0.00090) [2022-07-09 03:15:42,187][26022] Updated weights on worker 0-0, policy_version 63297 (0.00086) [2022-07-09 03:15:44,184][26022] Updated weights on worker 0-0, policy_version 63307 (0.00083) [2022-07-09 03:15:44,472][25689] Fps is (10 sec: 5649.0, 60 sec: 5729.4, 300 sec: 5765.7). Total num frames: 64827392. Throughput: 0: 6004.2. Samples: 64837582. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 03:15:44,472][25689] Avg episode reward: [(0, '-55.075')] [2022-07-09 03:15:45,684][26022] Updated weights on worker 0-0, policy_version 63317 (0.00090) [2022-07-09 03:15:47,835][26022] Updated weights on worker 0-0, policy_version 63327 (0.00085) [2022-07-09 03:15:49,284][26022] Updated weights on worker 0-0, policy_version 63337 (0.00090) [2022-07-09 03:15:49,492][25689] Fps is (10 sec: 5806.8, 60 sec: 5736.1, 300 sec: 5765.5). Total num frames: 64858112. Throughput: 0: 5986.6. Samples: 64854650. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 03:15:49,492][25689] Avg episode reward: [(0, '-55.047')] [2022-07-09 03:15:51,246][26022] Updated weights on worker 0-0, policy_version 63347 (0.00425) [2022-07-09 03:15:52,806][26022] Updated weights on worker 0-0, policy_version 63357 (0.00087) [2022-07-09 03:15:54,495][25689] Fps is (10 sec: 5925.7, 60 sec: 5739.2, 300 sec: 5767.9). Total num frames: 64886784. Throughput: 0: 5990.1. Samples: 64889514. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 03:15:54,495][25689] Avg episode reward: [(0, '-55.227')] [2022-07-09 03:15:54,653][26022] Updated weights on worker 0-0, policy_version 63367 (0.00086) [2022-07-09 03:15:56,555][26022] Updated weights on worker 0-0, policy_version 63377 (0.01255) [2022-07-09 03:15:58,256][26022] Updated weights on worker 0-0, policy_version 63387 (0.00100) [2022-07-09 03:15:59,567][25689] Fps is (10 sec: 5691.8, 60 sec: 5758.1, 300 sec: 5767.3). Total num frames: 64915456. Throughput: 0: 6019.6. Samples: 64924200. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 03:15:59,567][25689] Avg episode reward: [(0, '-55.253')] [2022-07-09 03:15:59,989][26022] Updated weights on worker 0-0, policy_version 63397 (0.00089) [2022-07-09 03:16:02,174][26022] Updated weights on worker 0-0, policy_version 63407 (0.00087) [2022-07-09 03:16:03,737][26022] Updated weights on worker 0-0, policy_version 63417 (0.00092) [2022-07-09 03:16:04,569][25689] Fps is (10 sec: 5489.1, 60 sec: 5728.4, 300 sec: 5764.1). Total num frames: 64942080. Throughput: 0: 5065.0. Samples: 64939372. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 03:16:04,569][25689] Avg episode reward: [(0, '-55.577')] [2022-07-09 03:16:05,774][26022] Updated weights on worker 0-0, policy_version 63427 (0.00093) [2022-07-09 03:16:07,386][26022] Updated weights on worker 0-0, policy_version 63437 (0.00089) [2022-07-09 03:16:09,221][26022] Updated weights on worker 0-0, policy_version 63447 (0.00090) [2022-07-09 03:16:09,611][25689] Fps is (10 sec: 5505.4, 60 sec: 5726.1, 300 sec: 5760.7). Total num frames: 64970752. Throughput: 0: 5944.5. Samples: 64974246. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 03:16:09,611][25689] Avg episode reward: [(0, '-55.983')] [2022-07-09 03:16:10,913][26022] Updated weights on worker 0-0, policy_version 63457 (0.00083) [2022-07-09 03:16:12,869][26022] Updated weights on worker 0-0, policy_version 63467 (0.00085) [2022-07-09 03:16:14,615][25689] Fps is (10 sec: 5708.1, 60 sec: 5726.0, 300 sec: 5758.6). Total num frames: 64999424. Throughput: 0: 5951.2. Samples: 65009252. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 03:16:14,615][25689] Avg episode reward: [(0, '-56.058')] [2022-07-09 03:16:14,641][26022] Updated weights on worker 0-0, policy_version 63477 (0.00086) [2022-07-09 03:16:16,397][26022] Updated weights on worker 0-0, policy_version 63487 (0.00090) [2022-07-09 03:16:18,098][26022] Updated weights on worker 0-0, policy_version 63497 (0.00095) [2022-07-09 03:16:19,741][25689] Fps is (10 sec: 5761.6, 60 sec: 5735.0, 300 sec: 5763.7). Total num frames: 65029120. Throughput: 0: 5071.9. Samples: 65026524. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 03:16:19,742][25689] Avg episode reward: [(0, '-56.357')] [2022-07-09 03:16:20,000][26022] Updated weights on worker 0-0, policy_version 63507 (0.00088) [2022-07-09 03:16:21,262][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:16:21,277][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000063515_65039360.pth [2022-07-09 03:16:21,277][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000061486_62961664.pth [2022-07-09 03:16:21,639][26022] Updated weights on worker 0-0, policy_version 63517 (0.00086) [2022-07-09 03:16:23,464][26022] Updated weights on worker 0-0, policy_version 63527 (0.00083) [2022-07-09 03:16:24,750][25689] Fps is (10 sec: 5960.7, 60 sec: 5723.0, 300 sec: 5768.0). Total num frames: 65059840. Throughput: 0: 6040.7. Samples: 65061284. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 03:16:24,751][25689] Avg episode reward: [(0, '-56.011')] [2022-07-09 03:16:25,091][26022] Updated weights on worker 0-0, policy_version 63537 (0.00091) [2022-07-09 03:16:26,970][26022] Updated weights on worker 0-0, policy_version 63547 (0.00089) [2022-07-09 03:16:28,670][26022] Updated weights on worker 0-0, policy_version 63557 (0.00082) [2022-07-09 03:16:29,776][25689] Fps is (10 sec: 5918.5, 60 sec: 5740.4, 300 sec: 5761.4). Total num frames: 65088512. Throughput: 0: 6032.7. Samples: 65095898. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 03:16:29,777][25689] Avg episode reward: [(0, '-55.432')] [2022-07-09 03:16:30,755][26022] Updated weights on worker 0-0, policy_version 63567 (0.00085) [2022-07-09 03:16:32,141][26022] Updated weights on worker 0-0, policy_version 63577 (0.00084) [2022-07-09 03:16:34,071][26022] Updated weights on worker 0-0, policy_version 63587 (0.00085) [2022-07-09 03:16:34,782][25689] Fps is (10 sec: 5716.1, 60 sec: 5741.4, 300 sec: 5762.4). Total num frames: 65117184. Throughput: 0: 5160.8. Samples: 65113334. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 03:16:34,783][25689] Avg episode reward: [(0, '-54.898')] [2022-07-09 03:16:35,634][26022] Updated weights on worker 0-0, policy_version 63597 (0.00093) [2022-07-09 03:16:37,583][26022] Updated weights on worker 0-0, policy_version 63607 (0.00085) [2022-07-09 03:16:39,226][26022] Updated weights on worker 0-0, policy_version 63617 (0.00088) [2022-07-09 03:16:39,850][25689] Fps is (10 sec: 5692.3, 60 sec: 5729.4, 300 sec: 5750.8). Total num frames: 65145856. Throughput: 0: 6065.2. Samples: 65148488. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 03:16:39,851][25689] Avg episode reward: [(0, '-54.306')] [2022-07-09 03:16:40,903][26022] Updated weights on worker 0-0, policy_version 63627 (0.00084) [2022-07-09 03:16:42,899][26022] Updated weights on worker 0-0, policy_version 63637 (0.00098) [2022-07-09 03:16:44,595][26022] Updated weights on worker 0-0, policy_version 63647 (0.00088) [2022-07-09 03:16:44,858][25689] Fps is (10 sec: 5793.2, 60 sec: 5765.6, 300 sec: 5757.9). Total num frames: 65175552. Throughput: 0: 6081.7. Samples: 65183570. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 03:16:44,858][25689] Avg episode reward: [(0, '-54.624')] [2022-07-09 03:16:46,373][26022] Updated weights on worker 0-0, policy_version 63657 (0.00095) [2022-07-09 03:16:48,143][26022] Updated weights on worker 0-0, policy_version 63667 (0.00089) [2022-07-09 03:16:49,805][26022] Updated weights on worker 0-0, policy_version 63677 (0.00092) [2022-07-09 03:16:49,878][25689] Fps is (10 sec: 5922.8, 60 sec: 5748.6, 300 sec: 5765.3). Total num frames: 65205248. Throughput: 0: 5220.1. Samples: 65200830. Policy #0 lag: (min: 0.0, avg: 9.1, max: 23.0) [2022-07-09 03:16:49,878][25689] Avg episode reward: [(0, '-54.547')] [2022-07-09 03:16:51,815][26022] Updated weights on worker 0-0, policy_version 63687 (0.00087) [2022-07-09 03:16:53,427][26022] Updated weights on worker 0-0, policy_version 63697 (0.00088) [2022-07-09 03:16:54,883][25689] Fps is (10 sec: 5719.9, 60 sec: 5731.4, 300 sec: 5756.2). Total num frames: 65232896. Throughput: 0: 6065.5. Samples: 65235254. Policy #0 lag: (min: 0.0, avg: 9.1, max: 23.0) [2022-07-09 03:16:54,885][25689] Avg episode reward: [(0, '-54.857')] [2022-07-09 03:16:55,273][26022] Updated weights on worker 0-0, policy_version 63707 (0.00082) [2022-07-09 03:16:56,943][26022] Updated weights on worker 0-0, policy_version 63717 (0.00086) [2022-07-09 03:16:58,694][26022] Updated weights on worker 0-0, policy_version 63727 (0.00082) [2022-07-09 03:16:59,916][25689] Fps is (10 sec: 5610.7, 60 sec: 5735.2, 300 sec: 5762.6). Total num frames: 65261568. Throughput: 0: 6068.5. Samples: 65270256. Policy #0 lag: (min: 0.0, avg: 9.1, max: 23.0) [2022-07-09 03:16:59,916][25689] Avg episode reward: [(0, '-54.798')] [2022-07-09 03:17:00,426][26022] Updated weights on worker 0-0, policy_version 63737 (0.00107) [2022-07-09 03:17:02,815][26022] Updated weights on worker 0-0, policy_version 63747 (0.00092) [2022-07-09 03:17:04,329][26022] Updated weights on worker 0-0, policy_version 63757 (0.00953) [2022-07-09 03:17:04,939][25689] Fps is (10 sec: 5702.5, 60 sec: 5767.1, 300 sec: 5762.4). Total num frames: 65290240. Throughput: 0: 5074.2. Samples: 65285466. Policy #0 lag: (min: 0.0, avg: 9.1, max: 23.0) [2022-07-09 03:17:04,941][25689] Avg episode reward: [(0, '-55.557')] [2022-07-09 03:17:06,419][26022] Updated weights on worker 0-0, policy_version 63767 (0.00080) [2022-07-09 03:17:07,835][26022] Updated weights on worker 0-0, policy_version 63777 (0.00087) [2022-07-09 03:17:09,854][26022] Updated weights on worker 0-0, policy_version 63787 (0.00087) [2022-07-09 03:17:09,947][25689] Fps is (10 sec: 5614.7, 60 sec: 5753.4, 300 sec: 5755.9). Total num frames: 65317888. Throughput: 0: 5952.1. Samples: 65320280. Policy #0 lag: (min: 0.0, avg: 9.1, max: 23.0) [2022-07-09 03:17:09,947][25689] Avg episode reward: [(0, '-55.144')] [2022-07-09 03:17:11,535][26022] Updated weights on worker 0-0, policy_version 63797 (0.00091) [2022-07-09 03:17:13,540][26022] Updated weights on worker 0-0, policy_version 63807 (0.00093) [2022-07-09 03:17:14,920][26022] Updated weights on worker 0-0, policy_version 63817 (0.00085) [2022-07-09 03:17:14,952][25689] Fps is (10 sec: 5829.3, 60 sec: 5787.2, 300 sec: 5761.9). Total num frames: 65348608. Throughput: 0: 5952.6. Samples: 65354714. Policy #0 lag: (min: 0.0, avg: 9.1, max: 23.0) [2022-07-09 03:17:14,953][25689] Avg episode reward: [(0, '-54.687')] [2022-07-09 03:17:17,026][26022] Updated weights on worker 0-0, policy_version 63827 (0.00081) [2022-07-09 03:17:18,542][26022] Updated weights on worker 0-0, policy_version 63837 (0.00088) [2022-07-09 03:17:20,039][25689] Fps is (10 sec: 5580.3, 60 sec: 5723.0, 300 sec: 5746.8). Total num frames: 65374208. Throughput: 0: 5061.1. Samples: 65372104. Policy #0 lag: (min: 0.0, avg: 9.1, max: 23.0) [2022-07-09 03:17:20,040][25689] Avg episode reward: [(0, '-55.719')] [2022-07-09 03:17:20,635][26022] Updated weights on worker 0-0, policy_version 63847 (0.00095) [2022-07-09 03:17:22,027][26022] Updated weights on worker 0-0, policy_version 63857 (0.00092) [2022-07-09 03:17:24,158][26022] Updated weights on worker 0-0, policy_version 63867 (0.00066) [2022-07-09 03:17:25,058][25689] Fps is (10 sec: 5674.1, 60 sec: 5739.1, 300 sec: 5761.6). Total num frames: 65405952. Throughput: 0: 6008.6. Samples: 65406354. Policy #0 lag: (min: 0.0, avg: 9.1, max: 23.0) [2022-07-09 03:17:25,058][25689] Avg episode reward: [(0, '-55.838')] [2022-07-09 03:17:25,743][26022] Updated weights on worker 0-0, policy_version 63877 (0.00097) [2022-07-09 03:17:27,643][26022] Updated weights on worker 0-0, policy_version 63887 (0.00052) [2022-07-09 03:17:29,422][26022] Updated weights on worker 0-0, policy_version 63897 (0.00084) [2022-07-09 03:17:30,095][25689] Fps is (10 sec: 5906.2, 60 sec: 5721.1, 300 sec: 5754.0). Total num frames: 65433600. Throughput: 0: 5980.3. Samples: 65440774. Policy #0 lag: (min: 0.0, avg: 9.1, max: 23.0) [2022-07-09 03:17:30,096][25689] Avg episode reward: [(0, '-56.152')] [2022-07-09 03:17:31,187][26022] Updated weights on worker 0-0, policy_version 63907 (0.00086) [2022-07-09 03:17:32,907][26022] Updated weights on worker 0-0, policy_version 63917 (0.00085) [2022-07-09 03:17:35,011][26022] Updated weights on worker 0-0, policy_version 63927 (0.00098) [2022-07-09 03:17:35,097][25689] Fps is (10 sec: 5507.9, 60 sec: 5704.4, 300 sec: 5752.2). Total num frames: 65461248. Throughput: 0: 5129.5. Samples: 65458052. Policy #0 lag: (min: 0.0, avg: 9.1, max: 23.0) [2022-07-09 03:17:35,098][25689] Avg episode reward: [(0, '-56.028')] [2022-07-09 03:17:36,428][26022] Updated weights on worker 0-0, policy_version 63937 (0.00108) [2022-07-09 03:17:38,515][26022] Updated weights on worker 0-0, policy_version 63947 (0.00084) [2022-07-09 03:17:40,034][26022] Updated weights on worker 0-0, policy_version 63957 (0.00105) [2022-07-09 03:17:40,158][25689] Fps is (10 sec: 5800.6, 60 sec: 5739.1, 300 sec: 5755.4). Total num frames: 65491968. Throughput: 0: 5982.8. Samples: 65492468. Policy #0 lag: (min: 0.0, avg: 9.1, max: 23.0) [2022-07-09 03:17:40,158][25689] Avg episode reward: [(0, '-56.598')] [2022-07-09 03:17:41,920][26022] Updated weights on worker 0-0, policy_version 63967 (0.00086) [2022-07-09 03:17:43,695][26022] Updated weights on worker 0-0, policy_version 63977 (0.00088) [2022-07-09 03:17:45,172][25689] Fps is (10 sec: 5895.3, 60 sec: 5721.5, 300 sec: 5752.3). Total num frames: 65520640. Throughput: 0: 6024.6. Samples: 65527532. Policy #0 lag: (min: 0.0, avg: 9.1, max: 23.0) [2022-07-09 03:17:45,172][25689] Avg episode reward: [(0, '-56.693')] [2022-07-09 03:17:45,419][26022] Updated weights on worker 0-0, policy_version 63987 (0.00085) [2022-07-09 03:17:47,138][26022] Updated weights on worker 0-0, policy_version 63997 (0.00092) [2022-07-09 03:17:49,142][26022] Updated weights on worker 0-0, policy_version 64007 (0.00092) [2022-07-09 03:17:50,208][25689] Fps is (10 sec: 5705.8, 60 sec: 5703.0, 300 sec: 5751.6). Total num frames: 65549312. Throughput: 0: 5178.3. Samples: 65544924. Policy #0 lag: (min: 0.0, avg: 9.1, max: 23.0) [2022-07-09 03:17:50,208][25689] Avg episode reward: [(0, '-55.987')] [2022-07-09 03:17:50,682][26022] Updated weights on worker 0-0, policy_version 64017 (0.00092) [2022-07-09 03:17:52,676][26022] Updated weights on worker 0-0, policy_version 64027 (0.00093) [2022-07-09 03:17:54,303][26022] Updated weights on worker 0-0, policy_version 64037 (0.00081) [2022-07-09 03:17:55,217][25689] Fps is (10 sec: 5912.2, 60 sec: 5753.5, 300 sec: 5755.7). Total num frames: 65580032. Throughput: 0: 6037.6. Samples: 65579528. Policy #0 lag: (min: 0.0, avg: 9.1, max: 23.0) [2022-07-09 03:17:55,219][25689] Avg episode reward: [(0, '-55.951')] [2022-07-09 03:17:56,115][26022] Updated weights on worker 0-0, policy_version 64047 (0.00091) [2022-07-09 03:17:57,823][26022] Updated weights on worker 0-0, policy_version 64057 (0.00081) [2022-07-09 03:17:59,718][26022] Updated weights on worker 0-0, policy_version 64067 (0.00095) [2022-07-09 03:18:00,300][25689] Fps is (10 sec: 5681.9, 60 sec: 5714.9, 300 sec: 5751.4). Total num frames: 65606656. Throughput: 0: 6051.7. Samples: 65614364. Policy #0 lag: (min: 0.0, avg: 9.1, max: 23.0) [2022-07-09 03:18:00,302][25689] Avg episode reward: [(0, '-55.910')] [2022-07-09 03:18:01,197][26022] Updated weights on worker 0-0, policy_version 64077 (0.00094) [2022-07-09 03:18:03,750][26022] Updated weights on worker 0-0, policy_version 64087 (0.00102) [2022-07-09 03:18:05,149][26022] Updated weights on worker 0-0, policy_version 64097 (0.00094) [2022-07-09 03:18:05,356][25689] Fps is (10 sec: 5454.3, 60 sec: 5711.8, 300 sec: 5751.2). Total num frames: 65635328. Throughput: 0: 5054.4. Samples: 65629548. Policy #0 lag: (min: 0.0, avg: 9.1, max: 23.0) [2022-07-09 03:18:05,360][25689] Avg episode reward: [(0, '-55.422')] [2022-07-09 03:18:07,077][26022] Updated weights on worker 0-0, policy_version 64107 (0.00092) [2022-07-09 03:18:08,902][26022] Updated weights on worker 0-0, policy_version 64117 (0.00082) [2022-07-09 03:18:10,421][25689] Fps is (10 sec: 5665.7, 60 sec: 5723.2, 300 sec: 5750.3). Total num frames: 65664000. Throughput: 0: 5896.8. Samples: 65664120. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 03:18:10,422][25689] Avg episode reward: [(0, '-55.178')] [2022-07-09 03:18:10,576][26022] Updated weights on worker 0-0, policy_version 64127 (0.00053) [2022-07-09 03:18:12,379][26022] Updated weights on worker 0-0, policy_version 64137 (0.00079) [2022-07-09 03:18:14,104][26022] Updated weights on worker 0-0, policy_version 64147 (0.00088) [2022-07-09 03:18:15,498][25689] Fps is (10 sec: 5755.1, 60 sec: 5699.6, 300 sec: 5747.9). Total num frames: 65693696. Throughput: 0: 5906.4. Samples: 65699310. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 03:18:15,498][25689] Avg episode reward: [(0, '-55.442')] [2022-07-09 03:18:15,939][26022] Updated weights on worker 0-0, policy_version 64157 (0.00084) [2022-07-09 03:18:17,789][26022] Updated weights on worker 0-0, policy_version 64167 (0.00085) [2022-07-09 03:18:19,470][26022] Updated weights on worker 0-0, policy_version 64177 (0.00086) [2022-07-09 03:18:20,613][25689] Fps is (10 sec: 5827.6, 60 sec: 5764.6, 300 sec: 5746.1). Total num frames: 65723392. Throughput: 0: 5039.1. Samples: 65716722. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 03:18:20,613][25689] Avg episode reward: [(0, '-55.493')] [2022-07-09 03:18:21,332][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:18:21,345][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000064187_65727488.pth [2022-07-09 03:18:21,346][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000062163_63654912.pth [2022-07-09 03:18:21,348][26022] Updated weights on worker 0-0, policy_version 64187 (0.00086) [2022-07-09 03:18:23,039][26022] Updated weights on worker 0-0, policy_version 64197 (0.00095) [2022-07-09 03:18:24,875][26022] Updated weights on worker 0-0, policy_version 64207 (0.00088) [2022-07-09 03:18:25,625][25689] Fps is (10 sec: 5864.6, 60 sec: 5731.5, 300 sec: 5753.9). Total num frames: 65753088. Throughput: 0: 5998.0. Samples: 65751122. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 03:18:25,625][25689] Avg episode reward: [(0, '-55.274')] [2022-07-09 03:18:26,576][26022] Updated weights on worker 0-0, policy_version 64217 (0.00087) [2022-07-09 03:18:28,333][26022] Updated weights on worker 0-0, policy_version 64227 (0.00083) [2022-07-09 03:18:30,116][26022] Updated weights on worker 0-0, policy_version 64237 (0.00097) [2022-07-09 03:18:30,643][25689] Fps is (10 sec: 5819.4, 60 sec: 5750.2, 300 sec: 5751.1). Total num frames: 65781760. Throughput: 0: 6008.0. Samples: 65785610. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 03:18:30,643][25689] Avg episode reward: [(0, '-55.189')] [2022-07-09 03:18:32,010][26022] Updated weights on worker 0-0, policy_version 64247 (0.00093) [2022-07-09 03:18:33,603][26022] Updated weights on worker 0-0, policy_version 64257 (0.00089) [2022-07-09 03:18:35,357][26022] Updated weights on worker 0-0, policy_version 64267 (0.00090) [2022-07-09 03:18:35,655][25689] Fps is (10 sec: 5717.2, 60 sec: 5766.2, 300 sec: 5749.6). Total num frames: 65810432. Throughput: 0: 5134.4. Samples: 65802806. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 03:18:35,655][25689] Avg episode reward: [(0, '-55.537')] [2022-07-09 03:18:37,209][26022] Updated weights on worker 0-0, policy_version 64277 (0.00084) [2022-07-09 03:18:38,998][26022] Updated weights on worker 0-0, policy_version 64287 (0.00086) [2022-07-09 03:18:40,645][26022] Updated weights on worker 0-0, policy_version 64297 (0.00088) [2022-07-09 03:18:40,734][25689] Fps is (10 sec: 5783.7, 60 sec: 5747.4, 300 sec: 5748.4). Total num frames: 65840128. Throughput: 0: 6013.9. Samples: 65837730. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 03:18:40,735][25689] Avg episode reward: [(0, '-55.340')] [2022-07-09 03:18:42,616][26022] Updated weights on worker 0-0, policy_version 64307 (0.00085) [2022-07-09 03:18:44,275][26022] Updated weights on worker 0-0, policy_version 64317 (0.00084) [2022-07-09 03:18:45,794][25689] Fps is (10 sec: 5655.5, 60 sec: 5726.2, 300 sec: 5740.9). Total num frames: 65867776. Throughput: 0: 6022.6. Samples: 65872594. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 03:18:45,796][25689] Avg episode reward: [(0, '-55.353')] [2022-07-09 03:18:46,168][26022] Updated weights on worker 0-0, policy_version 64327 (0.00082) [2022-07-09 03:18:47,691][26022] Updated weights on worker 0-0, policy_version 64337 (0.00090) [2022-07-09 03:18:49,589][26022] Updated weights on worker 0-0, policy_version 64347 (0.00085) [2022-07-09 03:18:50,812][25689] Fps is (10 sec: 5791.7, 60 sec: 5761.7, 300 sec: 5751.1). Total num frames: 65898496. Throughput: 0: 6038.5. Samples: 65907404. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 03:18:50,813][25689] Avg episode reward: [(0, '-55.656')] [2022-07-09 03:18:51,388][26022] Updated weights on worker 0-0, policy_version 64357 (0.00088) [2022-07-09 03:18:53,239][26022] Updated weights on worker 0-0, policy_version 64367 (0.00082) [2022-07-09 03:18:54,967][26022] Updated weights on worker 0-0, policy_version 64377 (0.00088) [2022-07-09 03:18:55,821][25689] Fps is (10 sec: 5821.4, 60 sec: 5711.1, 300 sec: 5742.6). Total num frames: 65926144. Throughput: 0: 6036.8. Samples: 65924544. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 03:18:55,821][25689] Avg episode reward: [(0, '-55.455')] [2022-07-09 03:18:56,835][26022] Updated weights on worker 0-0, policy_version 64387 (0.00091) [2022-07-09 03:18:58,498][26022] Updated weights on worker 0-0, policy_version 64397 (0.00097) [2022-07-09 03:19:00,369][26022] Updated weights on worker 0-0, policy_version 64407 (0.00089) [2022-07-09 03:19:00,897][25689] Fps is (10 sec: 5686.3, 60 sec: 5762.4, 300 sec: 5749.3). Total num frames: 65955840. Throughput: 0: 6018.8. Samples: 65959084. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 03:19:00,897][25689] Avg episode reward: [(0, '-55.477')] [2022-07-09 03:19:02,303][26022] Updated weights on worker 0-0, policy_version 64417 (0.00105) [2022-07-09 03:19:04,158][26022] Updated weights on worker 0-0, policy_version 64427 (0.00082) [2022-07-09 03:19:05,930][25689] Fps is (10 sec: 5672.4, 60 sec: 5747.6, 300 sec: 5742.5). Total num frames: 65983488. Throughput: 0: 5928.6. Samples: 65991970. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 03:19:05,930][25689] Avg episode reward: [(0, '-55.108')] [2022-07-09 03:19:05,931][26022] Updated weights on worker 0-0, policy_version 64437 (0.00082) [2022-07-09 03:19:07,763][26022] Updated weights on worker 0-0, policy_version 64447 (0.00084) [2022-07-09 03:19:09,529][26022] Updated weights on worker 0-0, policy_version 64457 (0.00086) [2022-07-09 03:19:10,998][25689] Fps is (10 sec: 5474.0, 60 sec: 5730.5, 300 sec: 5741.3). Total num frames: 66011136. Throughput: 0: 5048.2. Samples: 66009308. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 03:19:10,999][25689] Avg episode reward: [(0, '-56.342')] [2022-07-09 03:19:11,290][26022] Updated weights on worker 0-0, policy_version 64467 (0.00090) [2022-07-09 03:19:12,911][26022] Updated weights on worker 0-0, policy_version 64477 (0.00087) [2022-07-09 03:19:14,715][26022] Updated weights on worker 0-0, policy_version 64487 (0.00082) [2022-07-09 03:19:16,075][25689] Fps is (10 sec: 5753.6, 60 sec: 5747.4, 300 sec: 5741.4). Total num frames: 66041856. Throughput: 0: 5923.5. Samples: 66044518. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 03:19:16,076][25689] Avg episode reward: [(0, '-56.729')] [2022-07-09 03:19:16,738][26022] Updated weights on worker 0-0, policy_version 64497 (0.00085) [2022-07-09 03:19:18,196][26022] Updated weights on worker 0-0, policy_version 64507 (0.00087) [2022-07-09 03:19:20,076][26022] Updated weights on worker 0-0, policy_version 64517 (0.00084) [2022-07-09 03:19:21,206][25689] Fps is (10 sec: 6019.0, 60 sec: 5762.8, 300 sec: 5746.0). Total num frames: 66072576. Throughput: 0: 5901.2. Samples: 66078932. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 03:19:21,206][25689] Avg episode reward: [(0, '-56.102')] [2022-07-09 03:19:21,935][26022] Updated weights on worker 0-0, policy_version 64527 (0.00089) [2022-07-09 03:19:23,555][26022] Updated weights on worker 0-0, policy_version 64537 (0.00086) [2022-07-09 03:19:25,554][26022] Updated weights on worker 0-0, policy_version 64547 (0.00102) [2022-07-09 03:19:26,230][25689] Fps is (10 sec: 5747.6, 60 sec: 5727.8, 300 sec: 5739.2). Total num frames: 66100224. Throughput: 0: 5137.9. Samples: 66096268. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 03:19:26,230][25689] Avg episode reward: [(0, '-56.262')] [2022-07-09 03:19:27,038][26022] Updated weights on worker 0-0, policy_version 64557 (0.00087) [2022-07-09 03:19:29,161][26022] Updated weights on worker 0-0, policy_version 64567 (0.00095) [2022-07-09 03:19:30,546][26022] Updated weights on worker 0-0, policy_version 64577 (0.00224) [2022-07-09 03:19:31,240][25689] Fps is (10 sec: 5613.0, 60 sec: 5728.6, 300 sec: 5736.1). Total num frames: 66128896. Throughput: 0: 6010.2. Samples: 66130964. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 03:19:31,240][25689] Avg episode reward: [(0, '-56.709')] [2022-07-09 03:19:32,575][26022] Updated weights on worker 0-0, policy_version 64587 (0.00091) [2022-07-09 03:19:34,530][26022] Updated weights on worker 0-0, policy_version 64597 (0.00098) [2022-07-09 03:19:36,058][26022] Updated weights on worker 0-0, policy_version 64607 (0.00087) [2022-07-09 03:19:36,262][25689] Fps is (10 sec: 5818.3, 60 sec: 5744.6, 300 sec: 5740.0). Total num frames: 66158592. Throughput: 0: 5987.8. Samples: 66165394. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 03:19:36,262][25689] Avg episode reward: [(0, '-56.250')] [2022-07-09 03:19:37,837][26022] Updated weights on worker 0-0, policy_version 64617 (0.00087) [2022-07-09 03:19:39,747][26022] Updated weights on worker 0-0, policy_version 64627 (0.00107) [2022-07-09 03:19:41,328][25689] Fps is (10 sec: 5786.0, 60 sec: 5729.0, 300 sec: 5738.9). Total num frames: 66187264. Throughput: 0: 5149.2. Samples: 66182542. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 03:19:41,328][25689] Avg episode reward: [(0, '-55.715')] [2022-07-09 03:19:41,455][26022] Updated weights on worker 0-0, policy_version 64637 (0.00095) [2022-07-09 03:19:43,299][26022] Updated weights on worker 0-0, policy_version 64647 (0.00089) [2022-07-09 03:19:45,011][26022] Updated weights on worker 0-0, policy_version 64657 (0.00093) [2022-07-09 03:19:46,367][25689] Fps is (10 sec: 5674.4, 60 sec: 5747.8, 300 sec: 5733.0). Total num frames: 66215936. Throughput: 0: 6010.2. Samples: 66217300. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 03:19:46,375][25689] Avg episode reward: [(0, '-55.992')] [2022-07-09 03:19:46,778][26022] Updated weights on worker 0-0, policy_version 64667 (0.00083) [2022-07-09 03:19:48,633][26022] Updated weights on worker 0-0, policy_version 64677 (0.00087) [2022-07-09 03:19:50,442][26022] Updated weights on worker 0-0, policy_version 64687 (0.00085) [2022-07-09 03:19:51,406][25689] Fps is (10 sec: 5791.1, 60 sec: 5728.9, 300 sec: 5736.4). Total num frames: 66245632. Throughput: 0: 6013.7. Samples: 66252240. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 03:19:51,408][25689] Avg episode reward: [(0, '-56.291')] [2022-07-09 03:19:51,949][26022] Updated weights on worker 0-0, policy_version 64697 (0.00087) [2022-07-09 03:19:54,047][26022] Updated weights on worker 0-0, policy_version 64707 (0.00086) [2022-07-09 03:19:55,444][26022] Updated weights on worker 0-0, policy_version 64717 (0.00093) [2022-07-09 03:19:56,430][25689] Fps is (10 sec: 5800.6, 60 sec: 5744.3, 300 sec: 5741.2). Total num frames: 66274304. Throughput: 0: 5171.1. Samples: 66269686. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 03:19:56,432][25689] Avg episode reward: [(0, '-56.325')] [2022-07-09 03:19:57,494][26022] Updated weights on worker 0-0, policy_version 64727 (0.00080) [2022-07-09 03:19:59,343][26022] Updated weights on worker 0-0, policy_version 64737 (0.00090) [2022-07-09 03:20:00,954][26022] Updated weights on worker 0-0, policy_version 64747 (0.00088) [2022-07-09 03:20:01,470][25689] Fps is (10 sec: 5901.6, 60 sec: 5764.6, 300 sec: 5748.2). Total num frames: 66305024. Throughput: 0: 6055.8. Samples: 66304522. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 03:20:01,471][25689] Avg episode reward: [(0, '-56.737')] [2022-07-09 03:20:03,132][26022] Updated weights on worker 0-0, policy_version 64757 (0.00095) [2022-07-09 03:20:04,759][26022] Updated weights on worker 0-0, policy_version 64767 (0.00085) [2022-07-09 03:20:06,404][26022] Updated weights on worker 0-0, policy_version 64777 (0.00083) [2022-07-09 03:20:06,502][25689] Fps is (10 sec: 5693.3, 60 sec: 5747.9, 300 sec: 5741.0). Total num frames: 66331648. Throughput: 0: 5966.4. Samples: 66337432. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 03:20:06,503][25689] Avg episode reward: [(0, '-56.141')] [2022-07-09 03:20:08,410][26022] Updated weights on worker 0-0, policy_version 64787 (0.00101) [2022-07-09 03:20:09,987][26022] Updated weights on worker 0-0, policy_version 64797 (0.00087) [2022-07-09 03:20:11,507][25689] Fps is (10 sec: 5305.1, 60 sec: 5736.9, 300 sec: 5734.1). Total num frames: 66358272. Throughput: 0: 5095.7. Samples: 66354668. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 03:20:11,509][25689] Avg episode reward: [(0, '-56.412')] [2022-07-09 03:20:11,828][26022] Updated weights on worker 0-0, policy_version 64807 (0.00089) [2022-07-09 03:20:13,660][26022] Updated weights on worker 0-0, policy_version 64817 (0.00090) [2022-07-09 03:20:15,391][26022] Updated weights on worker 0-0, policy_version 64827 (0.00096) [2022-07-09 03:20:16,537][25689] Fps is (10 sec: 5612.3, 60 sec: 5724.4, 300 sec: 5737.8). Total num frames: 66387968. Throughput: 0: 5940.7. Samples: 66389138. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 03:20:16,539][25689] Avg episode reward: [(0, '-56.447')] [2022-07-09 03:20:17,468][26022] Updated weights on worker 0-0, policy_version 64837 (0.00087) [2022-07-09 03:20:19,069][26022] Updated weights on worker 0-0, policy_version 64847 (0.00087) [2022-07-09 03:20:20,823][26022] Updated weights on worker 0-0, policy_version 64857 (0.00088) [2022-07-09 03:20:21,587][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:20:21,598][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000064861_66417664.pth [2022-07-09 03:20:21,598][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000062842_64350208.pth [2022-07-09 03:20:21,599][25689] Fps is (10 sec: 5885.2, 60 sec: 5714.0, 300 sec: 5730.9). Total num frames: 66417664. Throughput: 0: 5917.5. Samples: 66423634. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 03:20:21,600][25689] Avg episode reward: [(0, '-56.357')] [2022-07-09 03:20:22,747][26022] Updated weights on worker 0-0, policy_version 64867 (0.00087) [2022-07-09 03:20:24,286][26022] Updated weights on worker 0-0, policy_version 64877 (0.00085) [2022-07-09 03:20:26,267][26022] Updated weights on worker 0-0, policy_version 64887 (0.00087) [2022-07-09 03:20:26,624][25689] Fps is (10 sec: 5888.1, 60 sec: 5747.8, 300 sec: 5737.9). Total num frames: 66447360. Throughput: 0: 5132.5. Samples: 66440708. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 03:20:26,624][25689] Avg episode reward: [(0, '-55.905')] [2022-07-09 03:20:27,942][26022] Updated weights on worker 0-0, policy_version 64897 (0.00090) [2022-07-09 03:20:29,847][26022] Updated weights on worker 0-0, policy_version 64907 (0.00093) [2022-07-09 03:20:31,467][26022] Updated weights on worker 0-0, policy_version 64917 (0.00083) [2022-07-09 03:20:31,724][25689] Fps is (10 sec: 5764.9, 60 sec: 5739.3, 300 sec: 5736.3). Total num frames: 66476032. Throughput: 0: 5965.3. Samples: 66475266. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 03:20:31,725][25689] Avg episode reward: [(0, '-55.447')] [2022-07-09 03:20:33,466][26022] Updated weights on worker 0-0, policy_version 64927 (0.00090) [2022-07-09 03:20:34,850][26022] Updated weights on worker 0-0, policy_version 64937 (0.00089) [2022-07-09 03:20:36,747][25689] Fps is (10 sec: 5664.9, 60 sec: 5722.3, 300 sec: 5734.7). Total num frames: 66504704. Throughput: 0: 5983.5. Samples: 66510062. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 03:20:36,748][25689] Avg episode reward: [(0, '-55.628')] [2022-07-09 03:20:36,879][26022] Updated weights on worker 0-0, policy_version 64947 (0.00086) [2022-07-09 03:20:38,495][26022] Updated weights on worker 0-0, policy_version 64957 (0.00086) [2022-07-09 03:20:40,459][26022] Updated weights on worker 0-0, policy_version 64967 (0.00085) [2022-07-09 03:20:41,862][25689] Fps is (10 sec: 5757.2, 60 sec: 5734.5, 300 sec: 5740.0). Total num frames: 66534400. Throughput: 0: 5118.0. Samples: 66527340. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 03:20:41,863][25689] Avg episode reward: [(0, '-55.719')] [2022-07-09 03:20:42,380][26022] Updated weights on worker 0-0, policy_version 64977 (0.00089) [2022-07-09 03:20:43,983][26022] Updated weights on worker 0-0, policy_version 64987 (0.00092) [2022-07-09 03:20:45,691][26022] Updated weights on worker 0-0, policy_version 64997 (0.00083) [2022-07-09 03:20:46,871][25689] Fps is (10 sec: 5866.4, 60 sec: 5754.4, 300 sec: 5736.7). Total num frames: 66564096. Throughput: 0: 5991.9. Samples: 66562024. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 03:20:46,873][25689] Avg episode reward: [(0, '-55.368')] [2022-07-09 03:20:47,687][26022] Updated weights on worker 0-0, policy_version 65007 (0.00091) [2022-07-09 03:20:49,176][26022] Updated weights on worker 0-0, policy_version 65017 (0.00091) [2022-07-09 03:20:51,234][26022] Updated weights on worker 0-0, policy_version 65027 (0.00105) [2022-07-09 03:20:51,911][25689] Fps is (10 sec: 5706.3, 60 sec: 5720.4, 300 sec: 5732.6). Total num frames: 66591744. Throughput: 0: 6006.1. Samples: 66596514. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 03:20:51,912][25689] Avg episode reward: [(0, '-55.341')] [2022-07-09 03:20:52,609][26022] Updated weights on worker 0-0, policy_version 65037 (0.00084) [2022-07-09 03:20:54,797][26022] Updated weights on worker 0-0, policy_version 65047 (0.00102) [2022-07-09 03:20:56,299][26022] Updated weights on worker 0-0, policy_version 65057 (0.00089) [2022-07-09 03:20:56,915][25689] Fps is (10 sec: 5709.4, 60 sec: 5739.2, 300 sec: 5737.3). Total num frames: 66621440. Throughput: 0: 5159.6. Samples: 66614120. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 03:20:56,915][25689] Avg episode reward: [(0, '-56.189')] [2022-07-09 03:20:58,252][26022] Updated weights on worker 0-0, policy_version 65067 (0.00086) [2022-07-09 03:20:59,816][26022] Updated weights on worker 0-0, policy_version 65077 (0.00089) [2022-07-09 03:21:02,033][25689] Fps is (10 sec: 5564.4, 60 sec: 5664.2, 300 sec: 5735.1). Total num frames: 66648064. Throughput: 0: 6024.6. Samples: 66648860. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 03:21:02,033][25689] Avg episode reward: [(0, '-55.378')] [2022-07-09 03:21:02,137][26022] Updated weights on worker 0-0, policy_version 65087 (0.00086) [2022-07-09 03:21:03,535][26022] Updated weights on worker 0-0, policy_version 65097 (0.00090) [2022-07-09 03:21:05,759][26022] Updated weights on worker 0-0, policy_version 65107 (0.00090) [2022-07-09 03:21:07,048][25689] Fps is (10 sec: 5658.8, 60 sec: 5733.4, 300 sec: 5742.5). Total num frames: 66678784. Throughput: 0: 5942.1. Samples: 66681920. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 03:21:07,049][25689] Avg episode reward: [(0, '-55.753')] [2022-07-09 03:21:07,054][26022] Updated weights on worker 0-0, policy_version 65117 (0.00083) [2022-07-09 03:21:09,101][26022] Updated weights on worker 0-0, policy_version 65127 (0.00935) [2022-07-09 03:21:10,671][26022] Updated weights on worker 0-0, policy_version 65137 (0.00080) [2022-07-09 03:21:12,063][25689] Fps is (10 sec: 5819.4, 60 sec: 5749.4, 300 sec: 5738.8). Total num frames: 66706432. Throughput: 0: 5102.8. Samples: 66699342. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 03:21:12,063][25689] Avg episode reward: [(0, '-56.294')] [2022-07-09 03:21:12,607][26022] Updated weights on worker 0-0, policy_version 65147 (0.00085) [2022-07-09 03:21:14,551][26022] Updated weights on worker 0-0, policy_version 65157 (0.00087) [2022-07-09 03:21:15,965][26022] Updated weights on worker 0-0, policy_version 65167 (0.00087) [2022-07-09 03:21:17,064][25689] Fps is (10 sec: 5623.0, 60 sec: 5735.2, 300 sec: 5737.8). Total num frames: 66735104. Throughput: 0: 5953.9. Samples: 66734088. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 03:21:17,065][25689] Avg episode reward: [(0, '-55.725')] [2022-07-09 03:21:18,009][26022] Updated weights on worker 0-0, policy_version 65177 (0.00095) [2022-07-09 03:21:19,429][26022] Updated weights on worker 0-0, policy_version 65187 (0.00086) [2022-07-09 03:21:21,416][26022] Updated weights on worker 0-0, policy_version 65197 (0.00089) [2022-07-09 03:21:22,187][25689] Fps is (10 sec: 5866.3, 60 sec: 5746.4, 300 sec: 5735.6). Total num frames: 66765824. Throughput: 0: 5973.3. Samples: 66769248. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 03:21:22,188][25689] Avg episode reward: [(0, '-55.619')] [2022-07-09 03:21:23,020][26022] Updated weights on worker 0-0, policy_version 65207 (0.00088) [2022-07-09 03:21:24,858][26022] Updated weights on worker 0-0, policy_version 65217 (0.00088) [2022-07-09 03:21:26,693][26022] Updated weights on worker 0-0, policy_version 65227 (0.00095) [2022-07-09 03:21:27,279][25689] Fps is (10 sec: 5914.4, 60 sec: 5740.0, 300 sec: 5737.8). Total num frames: 66795520. Throughput: 0: 6041.4. Samples: 66804144. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 03:21:27,280][25689] Avg episode reward: [(0, '-55.645')] [2022-07-09 03:21:28,324][26022] Updated weights on worker 0-0, policy_version 65237 (0.00088) [2022-07-09 03:21:30,038][26022] Updated weights on worker 0-0, policy_version 65247 (0.00079) [2022-07-09 03:21:31,982][26022] Updated weights on worker 0-0, policy_version 65257 (0.00081) [2022-07-09 03:21:32,287][25689] Fps is (10 sec: 5778.9, 60 sec: 5748.7, 300 sec: 5737.7). Total num frames: 66824192. Throughput: 0: 6040.8. Samples: 66821514. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 03:21:32,288][25689] Avg episode reward: [(0, '-54.648')] [2022-07-09 03:21:33,646][26022] Updated weights on worker 0-0, policy_version 65267 (0.00072) [2022-07-09 03:21:35,551][26022] Updated weights on worker 0-0, policy_version 65277 (0.00099) [2022-07-09 03:21:37,085][26022] Updated weights on worker 0-0, policy_version 65287 (0.00089) [2022-07-09 03:21:37,337][25689] Fps is (10 sec: 5803.1, 60 sec: 5763.0, 300 sec: 5741.5). Total num frames: 66853888. Throughput: 0: 6043.7. Samples: 66856614. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 03:21:37,338][25689] Avg episode reward: [(0, '-54.412')] [2022-07-09 03:21:38,970][26022] Updated weights on worker 0-0, policy_version 65297 (0.00097) [2022-07-09 03:21:40,867][26022] Updated weights on worker 0-0, policy_version 65307 (0.00092) [2022-07-09 03:21:42,413][25689] Fps is (10 sec: 5865.7, 60 sec: 5766.8, 300 sec: 5740.2). Total num frames: 66883584. Throughput: 0: 6036.6. Samples: 66891342. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 03:21:42,413][25689] Avg episode reward: [(0, '-54.913')] [2022-07-09 03:21:42,543][26022] Updated weights on worker 0-0, policy_version 65317 (0.00088) [2022-07-09 03:21:44,334][26022] Updated weights on worker 0-0, policy_version 65327 (0.00083) [2022-07-09 03:21:45,911][26022] Updated weights on worker 0-0, policy_version 65337 (0.00091) [2022-07-09 03:21:47,465][25689] Fps is (10 sec: 5864.3, 60 sec: 5762.6, 300 sec: 5739.6). Total num frames: 66913280. Throughput: 0: 5187.9. Samples: 66908866. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 03:21:47,466][25689] Avg episode reward: [(0, '-54.666')] [2022-07-09 03:21:47,817][26022] Updated weights on worker 0-0, policy_version 65347 (0.00085) [2022-07-09 03:21:49,710][26022] Updated weights on worker 0-0, policy_version 65357 (0.00086) [2022-07-09 03:21:51,434][26022] Updated weights on worker 0-0, policy_version 65367 (0.00091) [2022-07-09 03:21:52,512][25689] Fps is (10 sec: 5880.6, 60 sec: 5795.8, 300 sec: 5745.7). Total num frames: 66942976. Throughput: 0: 6069.9. Samples: 66944278. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 03:21:52,513][25689] Avg episode reward: [(0, '-54.380')] [2022-07-09 03:21:52,944][26022] Updated weights on worker 0-0, policy_version 65377 (0.00089) [2022-07-09 03:21:54,965][26022] Updated weights on worker 0-0, policy_version 65387 (0.00081) [2022-07-09 03:21:56,455][26022] Updated weights on worker 0-0, policy_version 65397 (0.00085) [2022-07-09 03:21:57,545][25689] Fps is (10 sec: 5790.7, 60 sec: 5776.1, 300 sec: 5745.7). Total num frames: 66971648. Throughput: 0: 6057.4. Samples: 66979018. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 03:21:57,546][25689] Avg episode reward: [(0, '-54.427')] [2022-07-09 03:21:58,491][26022] Updated weights on worker 0-0, policy_version 65407 (0.00084) [2022-07-09 03:22:00,037][26022] Updated weights on worker 0-0, policy_version 65417 (0.00085) [2022-07-09 03:22:02,340][26022] Updated weights on worker 0-0, policy_version 65427 (0.00086) [2022-07-09 03:22:02,629][25689] Fps is (10 sec: 5566.7, 60 sec: 5796.2, 300 sec: 5741.0). Total num frames: 66999296. Throughput: 0: 5193.3. Samples: 66996334. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 03:22:02,630][25689] Avg episode reward: [(0, '-55.159')] [2022-07-09 03:22:04,008][26022] Updated weights on worker 0-0, policy_version 65437 (0.00090) [2022-07-09 03:22:05,798][26022] Updated weights on worker 0-0, policy_version 65447 (0.00092) [2022-07-09 03:22:07,675][25689] Fps is (10 sec: 5559.3, 60 sec: 5759.5, 300 sec: 5743.7). Total num frames: 67027968. Throughput: 0: 5959.5. Samples: 67029310. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 03:22:07,676][25689] Avg episode reward: [(0, '-54.947')] [2022-07-09 03:22:07,679][26022] Updated weights on worker 0-0, policy_version 65457 (0.00086) [2022-07-09 03:22:09,451][26022] Updated weights on worker 0-0, policy_version 65467 (0.00625) [2022-07-09 03:22:11,147][26022] Updated weights on worker 0-0, policy_version 65477 (0.00087) [2022-07-09 03:22:12,699][25689] Fps is (10 sec: 5695.0, 60 sec: 5775.6, 300 sec: 5736.5). Total num frames: 67056640. Throughput: 0: 5937.5. Samples: 67064136. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 03:22:12,699][25689] Avg episode reward: [(0, '-54.815')] [2022-07-09 03:22:12,922][26022] Updated weights on worker 0-0, policy_version 65487 (0.00050) [2022-07-09 03:22:14,622][26022] Updated weights on worker 0-0, policy_version 65497 (0.00094) [2022-07-09 03:22:16,368][26022] Updated weights on worker 0-0, policy_version 65507 (0.00095) [2022-07-09 03:22:17,763][25689] Fps is (10 sec: 5887.8, 60 sec: 5803.4, 300 sec: 5754.2). Total num frames: 67087360. Throughput: 0: 5086.1. Samples: 67081848. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 03:22:17,763][25689] Avg episode reward: [(0, '-55.409')] [2022-07-09 03:22:18,090][26022] Updated weights on worker 0-0, policy_version 65517 (0.00091) [2022-07-09 03:22:19,825][26022] Updated weights on worker 0-0, policy_version 65527 (0.00088) [2022-07-09 03:22:21,690][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:22:21,707][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000065537_67109888.pth [2022-07-09 03:22:21,708][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000063515_65039360.pth [2022-07-09 03:22:21,712][26022] Updated weights on worker 0-0, policy_version 65537 (0.00086) [2022-07-09 03:22:22,890][25689] Fps is (10 sec: 5928.0, 60 sec: 5786.0, 300 sec: 5745.2). Total num frames: 67117056. Throughput: 0: 5962.1. Samples: 67117132. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 03:22:22,891][25689] Avg episode reward: [(0, '-55.405')] [2022-07-09 03:22:23,341][26022] Updated weights on worker 0-0, policy_version 65547 (0.00087) [2022-07-09 03:22:25,092][26022] Updated weights on worker 0-0, policy_version 65557 (0.00088) [2022-07-09 03:22:26,802][26022] Updated weights on worker 0-0, policy_version 65567 (0.00082) [2022-07-09 03:22:27,955][25689] Fps is (10 sec: 5826.9, 60 sec: 5788.6, 300 sec: 5751.5). Total num frames: 67146752. Throughput: 0: 6047.6. Samples: 67151954. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 03:22:27,956][25689] Avg episode reward: [(0, '-55.225')] [2022-07-09 03:22:28,584][26022] Updated weights on worker 0-0, policy_version 65577 (0.00092) [2022-07-09 03:22:30,469][26022] Updated weights on worker 0-0, policy_version 65587 (0.00090) [2022-07-09 03:22:32,027][26022] Updated weights on worker 0-0, policy_version 65597 (0.00086) [2022-07-09 03:22:33,011][25689] Fps is (10 sec: 5767.2, 60 sec: 5784.1, 300 sec: 5753.9). Total num frames: 67175424. Throughput: 0: 5185.8. Samples: 67169472. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 03:22:33,011][25689] Avg episode reward: [(0, '-55.535')] [2022-07-09 03:22:33,863][26022] Updated weights on worker 0-0, policy_version 65607 (0.00084) [2022-07-09 03:22:35,606][26022] Updated weights on worker 0-0, policy_version 65617 (0.00090) [2022-07-09 03:22:37,347][26022] Updated weights on worker 0-0, policy_version 65627 (0.00082) [2022-07-09 03:22:38,027][25689] Fps is (10 sec: 5795.2, 60 sec: 5787.3, 300 sec: 5751.3). Total num frames: 67205120. Throughput: 0: 6064.6. Samples: 67204744. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 03:22:38,028][25689] Avg episode reward: [(0, '-54.696')] [2022-07-09 03:22:39,128][26022] Updated weights on worker 0-0, policy_version 65637 (0.00083) [2022-07-09 03:22:40,907][26022] Updated weights on worker 0-0, policy_version 65647 (0.00084) [2022-07-09 03:22:42,718][26022] Updated weights on worker 0-0, policy_version 65657 (0.00086) [2022-07-09 03:22:43,107][25689] Fps is (10 sec: 5883.0, 60 sec: 5786.9, 300 sec: 5753.5). Total num frames: 67234816. Throughput: 0: 6042.8. Samples: 67239294. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 03:22:43,107][25689] Avg episode reward: [(0, '-55.561')] [2022-07-09 03:22:44,550][26022] Updated weights on worker 0-0, policy_version 65667 (0.00095) [2022-07-09 03:22:46,258][26022] Updated weights on worker 0-0, policy_version 65677 (0.00081) [2022-07-09 03:22:48,116][25689] Fps is (10 sec: 5785.2, 60 sec: 5774.1, 300 sec: 5754.0). Total num frames: 67263488. Throughput: 0: 5197.1. Samples: 67256732. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 03:22:48,117][25689] Avg episode reward: [(0, '-55.225')] [2022-07-09 03:22:48,128][26022] Updated weights on worker 0-0, policy_version 65687 (0.00094) [2022-07-09 03:22:49,797][26022] Updated weights on worker 0-0, policy_version 65697 (0.00088) [2022-07-09 03:22:51,681][26022] Updated weights on worker 0-0, policy_version 65707 (0.00091) [2022-07-09 03:22:53,150][25689] Fps is (10 sec: 5811.3, 60 sec: 5775.4, 300 sec: 5750.1). Total num frames: 67293184. Throughput: 0: 6047.6. Samples: 67291268. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 03:22:53,151][25689] Avg episode reward: [(0, '-55.537')] [2022-07-09 03:22:53,301][26022] Updated weights on worker 0-0, policy_version 65717 (0.00089) [2022-07-09 03:22:55,240][26022] Updated weights on worker 0-0, policy_version 65727 (0.00085) [2022-07-09 03:22:57,063][26022] Updated weights on worker 0-0, policy_version 65737 (0.00086) [2022-07-09 03:22:58,156][25689] Fps is (10 sec: 5609.8, 60 sec: 5744.1, 300 sec: 5751.6). Total num frames: 67319808. Throughput: 0: 6013.0. Samples: 67325780. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 03:22:58,157][25689] Avg episode reward: [(0, '-55.282')] [2022-07-09 03:22:58,868][26022] Updated weights on worker 0-0, policy_version 65747 (0.00087) [2022-07-09 03:23:00,664][26022] Updated weights on worker 0-0, policy_version 65757 (0.00091) [2022-07-09 03:23:02,782][26022] Updated weights on worker 0-0, policy_version 65767 (0.00092) [2022-07-09 03:23:03,239][25689] Fps is (10 sec: 5379.6, 60 sec: 5744.3, 300 sec: 5747.6). Total num frames: 67347456. Throughput: 0: 5151.9. Samples: 67343014. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 03:23:03,239][25689] Avg episode reward: [(0, '-55.351')] [2022-07-09 03:23:04,490][26022] Updated weights on worker 0-0, policy_version 65777 (0.00117) [2022-07-09 03:23:06,269][26022] Updated weights on worker 0-0, policy_version 65787 (0.00082) [2022-07-09 03:23:07,990][26022] Updated weights on worker 0-0, policy_version 65797 (0.00085) [2022-07-09 03:23:08,287][25689] Fps is (10 sec: 5559.2, 60 sec: 5744.1, 300 sec: 5748.0). Total num frames: 67376128. Throughput: 0: 5897.0. Samples: 67375678. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 03:23:08,288][25689] Avg episode reward: [(0, '-55.312')] [2022-07-09 03:23:09,831][26022] Updated weights on worker 0-0, policy_version 65807 (0.00086) [2022-07-09 03:23:11,691][26022] Updated weights on worker 0-0, policy_version 65817 (0.00094) [2022-07-09 03:23:13,319][25689] Fps is (10 sec: 5891.8, 60 sec: 5777.1, 300 sec: 5752.3). Total num frames: 67406848. Throughput: 0: 5897.0. Samples: 67410204. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 03:23:13,320][25689] Avg episode reward: [(0, '-54.695')] [2022-07-09 03:23:13,322][26022] Updated weights on worker 0-0, policy_version 65827 (0.00085) [2022-07-09 03:23:15,297][26022] Updated weights on worker 0-0, policy_version 65837 (0.00088) [2022-07-09 03:23:16,893][26022] Updated weights on worker 0-0, policy_version 65847 (0.00053) [2022-07-09 03:23:18,360][25689] Fps is (10 sec: 5896.0, 60 sec: 5745.4, 300 sec: 5750.3). Total num frames: 67435520. Throughput: 0: 5910.1. Samples: 67445190. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 03:23:18,361][25689] Avg episode reward: [(0, '-55.119')] [2022-07-09 03:23:18,659][26022] Updated weights on worker 0-0, policy_version 65857 (0.00093) [2022-07-09 03:23:20,628][26022] Updated weights on worker 0-0, policy_version 65867 (0.00095) [2022-07-09 03:23:22,295][26022] Updated weights on worker 0-0, policy_version 65877 (0.00055) [2022-07-09 03:23:23,403][25689] Fps is (10 sec: 5788.3, 60 sec: 5753.5, 300 sec: 5749.7). Total num frames: 67465216. Throughput: 0: 5924.7. Samples: 67462480. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 03:23:23,404][25689] Avg episode reward: [(0, '-55.203')] [2022-07-09 03:23:24,177][26022] Updated weights on worker 0-0, policy_version 65887 (0.00088) [2022-07-09 03:23:25,760][26022] Updated weights on worker 0-0, policy_version 65897 (0.00092) [2022-07-09 03:23:27,447][26022] Updated weights on worker 0-0, policy_version 65907 (0.00082) [2022-07-09 03:23:28,442][25689] Fps is (10 sec: 5789.5, 60 sec: 5739.1, 300 sec: 5749.3). Total num frames: 67493888. Throughput: 0: 6034.4. Samples: 67497302. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:23:28,443][25689] Avg episode reward: [(0, '-55.699')] [2022-07-09 03:23:29,176][26022] Updated weights on worker 0-0, policy_version 65917 (0.00091) [2022-07-09 03:23:31,040][26022] Updated weights on worker 0-0, policy_version 65927 (0.00090) [2022-07-09 03:23:32,832][26022] Updated weights on worker 0-0, policy_version 65937 (0.00053) [2022-07-09 03:23:33,455][25689] Fps is (10 sec: 5806.9, 60 sec: 5760.1, 300 sec: 5752.7). Total num frames: 67523584. Throughput: 0: 6062.5. Samples: 67532274. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:23:33,455][25689] Avg episode reward: [(0, '-55.181')] [2022-07-09 03:23:34,581][26022] Updated weights on worker 0-0, policy_version 65947 (0.00082) [2022-07-09 03:23:36,214][26022] Updated weights on worker 0-0, policy_version 65957 (0.00083) [2022-07-09 03:23:38,126][26022] Updated weights on worker 0-0, policy_version 65967 (0.00094) [2022-07-09 03:23:38,460][25689] Fps is (10 sec: 5723.8, 60 sec: 5727.2, 300 sec: 5747.2). Total num frames: 67551232. Throughput: 0: 5216.4. Samples: 67550040. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:23:38,461][25689] Avg episode reward: [(0, '-55.342')] [2022-07-09 03:23:39,663][26022] Updated weights on worker 0-0, policy_version 65977 (0.00091) [2022-07-09 03:23:41,548][26022] Updated weights on worker 0-0, policy_version 65987 (0.00090) [2022-07-09 03:23:43,079][26022] Updated weights on worker 0-0, policy_version 65997 (0.00079) [2022-07-09 03:23:43,585][25689] Fps is (10 sec: 5761.7, 60 sec: 5739.8, 300 sec: 5756.3). Total num frames: 67581952. Throughput: 0: 6087.4. Samples: 67585336. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:23:43,585][25689] Avg episode reward: [(0, '-55.711')] [2022-07-09 03:23:44,967][26022] Updated weights on worker 0-0, policy_version 66007 (0.00089) [2022-07-09 03:23:46,710][26022] Updated weights on worker 0-0, policy_version 66017 (0.00100) [2022-07-09 03:23:48,454][26022] Updated weights on worker 0-0, policy_version 66027 (0.00049) [2022-07-09 03:23:48,616][25689] Fps is (10 sec: 6049.7, 60 sec: 5771.6, 300 sec: 5756.1). Total num frames: 67612672. Throughput: 0: 6099.3. Samples: 67620352. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:23:48,617][25689] Avg episode reward: [(0, '-55.818')] [2022-07-09 03:23:50,274][26022] Updated weights on worker 0-0, policy_version 66037 (0.00092) [2022-07-09 03:23:52,096][26022] Updated weights on worker 0-0, policy_version 66047 (0.00083) [2022-07-09 03:23:53,659][25689] Fps is (10 sec: 5895.3, 60 sec: 5753.9, 300 sec: 5758.8). Total num frames: 67641344. Throughput: 0: 5207.3. Samples: 67637486. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:23:53,660][25689] Avg episode reward: [(0, '-55.206')] [2022-07-09 03:23:53,765][26022] Updated weights on worker 0-0, policy_version 66057 (0.00093) [2022-07-09 03:23:55,539][26022] Updated weights on worker 0-0, policy_version 66067 (0.00085) [2022-07-09 03:23:57,325][26022] Updated weights on worker 0-0, policy_version 66077 (0.00080) [2022-07-09 03:23:58,686][25689] Fps is (10 sec: 5694.5, 60 sec: 5785.7, 300 sec: 5756.3). Total num frames: 67670016. Throughput: 0: 6056.2. Samples: 67672532. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:23:58,687][25689] Avg episode reward: [(0, '-55.140')] [2022-07-09 03:23:59,258][26022] Updated weights on worker 0-0, policy_version 66087 (0.00085) [2022-07-09 03:24:00,963][26022] Updated weights on worker 0-0, policy_version 66097 (0.00084) [2022-07-09 03:24:03,138][26022] Updated weights on worker 0-0, policy_version 66107 (0.00093) [2022-07-09 03:24:03,819][25689] Fps is (10 sec: 5543.6, 60 sec: 5780.9, 300 sec: 5754.4). Total num frames: 67697664. Throughput: 0: 5921.6. Samples: 67705152. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:24:03,819][25689] Avg episode reward: [(0, '-55.344')] [2022-07-09 03:24:04,767][26022] Updated weights on worker 0-0, policy_version 66117 (0.00085) [2022-07-09 03:24:06,457][26022] Updated weights on worker 0-0, policy_version 66127 (0.00085) [2022-07-09 03:24:08,224][26022] Updated weights on worker 0-0, policy_version 66137 (0.00088) [2022-07-09 03:24:08,849][25689] Fps is (10 sec: 5743.5, 60 sec: 5816.5, 300 sec: 5765.5). Total num frames: 67728384. Throughput: 0: 5058.0. Samples: 67722688. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:24:08,849][25689] Avg episode reward: [(0, '-54.680')] [2022-07-09 03:24:10,240][26022] Updated weights on worker 0-0, policy_version 66147 (0.00618) [2022-07-09 03:24:11,699][26022] Updated weights on worker 0-0, policy_version 66157 (0.00090) [2022-07-09 03:24:13,705][26022] Updated weights on worker 0-0, policy_version 66167 (0.00086) [2022-07-09 03:24:13,852][25689] Fps is (10 sec: 5715.4, 60 sec: 5751.6, 300 sec: 5753.1). Total num frames: 67755008. Throughput: 0: 5959.8. Samples: 67757830. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:24:13,853][25689] Avg episode reward: [(0, '-54.639')] [2022-07-09 03:24:15,274][26022] Updated weights on worker 0-0, policy_version 66177 (0.00088) [2022-07-09 03:24:17,150][26022] Updated weights on worker 0-0, policy_version 66187 (0.00102) [2022-07-09 03:24:18,864][25689] Fps is (10 sec: 5725.6, 60 sec: 5788.2, 300 sec: 5755.4). Total num frames: 67785728. Throughput: 0: 5949.7. Samples: 67792584. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:24:18,865][25689] Avg episode reward: [(0, '-54.168')] [2022-07-09 03:24:18,870][26022] Updated weights on worker 0-0, policy_version 66197 (0.00327) [2022-07-09 03:24:20,521][26022] Updated weights on worker 0-0, policy_version 66207 (0.00055) [2022-07-09 03:24:21,767][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:24:21,787][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000066213_67802112.pth [2022-07-09 03:24:21,788][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000064187_65727488.pth [2022-07-09 03:24:22,563][26022] Updated weights on worker 0-0, policy_version 66217 (0.00088) [2022-07-09 03:24:23,930][25689] Fps is (10 sec: 5994.6, 60 sec: 5785.9, 300 sec: 5761.5). Total num frames: 67815424. Throughput: 0: 5222.8. Samples: 67810192. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:24:23,931][25689] Avg episode reward: [(0, '-54.843')] [2022-07-09 03:24:24,068][26022] Updated weights on worker 0-0, policy_version 66227 (0.00083) [2022-07-09 03:24:25,865][26022] Updated weights on worker 0-0, policy_version 66237 (0.00092) [2022-07-09 03:24:27,679][26022] Updated weights on worker 0-0, policy_version 66247 (0.00095) [2022-07-09 03:24:28,973][25689] Fps is (10 sec: 5672.5, 60 sec: 5768.7, 300 sec: 5757.4). Total num frames: 67843072. Throughput: 0: 6082.3. Samples: 67845090. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:24:28,974][25689] Avg episode reward: [(0, '-56.083')] [2022-07-09 03:24:29,311][26022] Updated weights on worker 0-0, policy_version 66257 (0.00084) [2022-07-09 03:24:31,386][26022] Updated weights on worker 0-0, policy_version 66267 (0.00088) [2022-07-09 03:24:32,777][26022] Updated weights on worker 0-0, policy_version 66277 (0.00085) [2022-07-09 03:24:33,987][25689] Fps is (10 sec: 5701.9, 60 sec: 5768.5, 300 sec: 5757.6). Total num frames: 67872768. Throughput: 0: 6066.9. Samples: 67879990. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:24:33,988][25689] Avg episode reward: [(0, '-55.703')] [2022-07-09 03:24:34,832][26022] Updated weights on worker 0-0, policy_version 66287 (0.00088) [2022-07-09 03:24:36,516][26022] Updated weights on worker 0-0, policy_version 66297 (0.00089) [2022-07-09 03:24:38,273][26022] Updated weights on worker 0-0, policy_version 66307 (0.00083) [2022-07-09 03:24:39,010][25689] Fps is (10 sec: 5713.3, 60 sec: 5766.9, 300 sec: 5755.0). Total num frames: 67900416. Throughput: 0: 5202.2. Samples: 67897388. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:24:39,011][25689] Avg episode reward: [(0, '-55.848')] [2022-07-09 03:24:40,026][26022] Updated weights on worker 0-0, policy_version 66317 (0.00086) [2022-07-09 03:24:41,854][26022] Updated weights on worker 0-0, policy_version 66327 (0.00081) [2022-07-09 03:24:43,623][26022] Updated weights on worker 0-0, policy_version 66337 (0.00094) [2022-07-09 03:24:44,065][25689] Fps is (10 sec: 5893.6, 60 sec: 5790.5, 300 sec: 5765.0). Total num frames: 67932160. Throughput: 0: 6070.7. Samples: 67932422. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:24:44,065][25689] Avg episode reward: [(0, '-55.618')] [2022-07-09 03:24:45,367][26022] Updated weights on worker 0-0, policy_version 66347 (0.00085) [2022-07-09 03:24:47,039][26022] Updated weights on worker 0-0, policy_version 66358 (0.00088) [2022-07-09 03:24:48,846][26022] Updated weights on worker 0-0, policy_version 66368 (0.00081) [2022-07-09 03:24:49,091][25689] Fps is (10 sec: 5992.9, 60 sec: 5757.1, 300 sec: 5761.8). Total num frames: 67960832. Throughput: 0: 6085.5. Samples: 67967520. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:24:49,092][25689] Avg episode reward: [(0, '-55.280')] [2022-07-09 03:24:50,743][26022] Updated weights on worker 0-0, policy_version 66378 (0.00097) [2022-07-09 03:24:52,543][26022] Updated weights on worker 0-0, policy_version 66388 (0.00083) [2022-07-09 03:24:54,098][25689] Fps is (10 sec: 5817.3, 60 sec: 5777.5, 300 sec: 5765.6). Total num frames: 67990528. Throughput: 0: 5222.3. Samples: 67985014. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 03:24:54,100][25689] Avg episode reward: [(0, '-55.376')] [2022-07-09 03:24:54,110][26022] Updated weights on worker 0-0, policy_version 66398 (0.00088) [2022-07-09 03:24:55,938][26022] Updated weights on worker 0-0, policy_version 66408 (0.00087) [2022-07-09 03:24:57,627][26022] Updated weights on worker 0-0, policy_version 66418 (0.00081) [2022-07-09 03:24:59,114][25689] Fps is (10 sec: 6027.8, 60 sec: 5812.4, 300 sec: 5766.0). Total num frames: 68021248. Throughput: 0: 6132.6. Samples: 68020678. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 03:24:59,115][25689] Avg episode reward: [(0, '-55.560')] [2022-07-09 03:24:59,270][26022] Updated weights on worker 0-0, policy_version 66428 (0.00089) [2022-07-09 03:25:01,199][26022] Updated weights on worker 0-0, policy_version 66438 (0.00100) [2022-07-09 03:25:03,244][26022] Updated weights on worker 0-0, policy_version 66448 (0.00087) [2022-07-09 03:25:04,187][25689] Fps is (10 sec: 5582.6, 60 sec: 5784.3, 300 sec: 5761.8). Total num frames: 68046848. Throughput: 0: 6021.0. Samples: 68053576. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 03:25:04,187][25689] Avg episode reward: [(0, '-54.877')] [2022-07-09 03:25:05,107][26022] Updated weights on worker 0-0, policy_version 66458 (0.00081) [2022-07-09 03:25:06,572][26022] Updated weights on worker 0-0, policy_version 66468 (0.00087) [2022-07-09 03:25:08,539][26022] Updated weights on worker 0-0, policy_version 66478 (0.00085) [2022-07-09 03:25:09,194][25689] Fps is (10 sec: 5587.7, 60 sec: 5786.5, 300 sec: 5775.5). Total num frames: 68077568. Throughput: 0: 5145.4. Samples: 68070954. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 03:25:09,194][25689] Avg episode reward: [(0, '-55.569')] [2022-07-09 03:25:10,246][26022] Updated weights on worker 0-0, policy_version 66488 (0.00083) [2022-07-09 03:25:11,962][26022] Updated weights on worker 0-0, policy_version 66498 (0.00079) [2022-07-09 03:25:14,068][26022] Updated weights on worker 0-0, policy_version 66508 (0.00087) [2022-07-09 03:25:14,204][25689] Fps is (10 sec: 5826.7, 60 sec: 5802.8, 300 sec: 5769.0). Total num frames: 68105216. Throughput: 0: 6039.5. Samples: 68106442. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 03:25:14,204][25689] Avg episode reward: [(0, '-56.514')] [2022-07-09 03:25:15,472][26022] Updated weights on worker 0-0, policy_version 66518 (0.00083) [2022-07-09 03:25:17,383][26022] Updated weights on worker 0-0, policy_version 66528 (0.00085) [2022-07-09 03:25:18,912][26022] Updated weights on worker 0-0, policy_version 66538 (0.00084) [2022-07-09 03:25:19,217][25689] Fps is (10 sec: 5823.1, 60 sec: 5802.7, 300 sec: 5773.4). Total num frames: 68135936. Throughput: 0: 6010.0. Samples: 68141496. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 03:25:19,218][25689] Avg episode reward: [(0, '-55.965')] [2022-07-09 03:25:20,830][26022] Updated weights on worker 0-0, policy_version 66548 (0.00093) [2022-07-09 03:25:22,733][26022] Updated weights on worker 0-0, policy_version 66558 (0.00092) [2022-07-09 03:25:24,279][25689] Fps is (10 sec: 5894.9, 60 sec: 5786.1, 300 sec: 5769.3). Total num frames: 68164608. Throughput: 0: 5236.6. Samples: 68158792. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 03:25:24,280][25689] Avg episode reward: [(0, '-55.796')] [2022-07-09 03:25:24,468][26022] Updated weights on worker 0-0, policy_version 66568 (0.00085) [2022-07-09 03:25:26,161][26022] Updated weights on worker 0-0, policy_version 66578 (0.00097) [2022-07-09 03:25:27,904][26022] Updated weights on worker 0-0, policy_version 66588 (0.00096) [2022-07-09 03:25:29,309][25689] Fps is (10 sec: 5681.9, 60 sec: 5804.3, 300 sec: 5770.6). Total num frames: 68193280. Throughput: 0: 6105.8. Samples: 68193778. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 03:25:29,310][25689] Avg episode reward: [(0, '-56.726')] [2022-07-09 03:25:29,688][26022] Updated weights on worker 0-0, policy_version 66598 (0.00082) [2022-07-09 03:25:31,463][26022] Updated weights on worker 0-0, policy_version 66608 (0.00082) [2022-07-09 03:25:33,156][26022] Updated weights on worker 0-0, policy_version 66618 (0.00094) [2022-07-09 03:25:34,316][25689] Fps is (10 sec: 5815.2, 60 sec: 5805.0, 300 sec: 5774.4). Total num frames: 68222976. Throughput: 0: 6077.2. Samples: 68228668. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 03:25:34,318][25689] Avg episode reward: [(0, '-55.844')] [2022-07-09 03:25:34,874][26022] Updated weights on worker 0-0, policy_version 66628 (0.00080) [2022-07-09 03:25:36,608][26022] Updated weights on worker 0-0, policy_version 66638 (0.00087) [2022-07-09 03:25:38,555][26022] Updated weights on worker 0-0, policy_version 66648 (0.00097) [2022-07-09 03:25:39,325][25689] Fps is (10 sec: 5929.8, 60 sec: 5840.3, 300 sec: 5776.4). Total num frames: 68252672. Throughput: 0: 5202.2. Samples: 68246104. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 03:25:39,326][25689] Avg episode reward: [(0, '-56.627')] [2022-07-09 03:25:40,433][26022] Updated weights on worker 0-0, policy_version 66658 (0.00085) [2022-07-09 03:25:42,131][26022] Updated weights on worker 0-0, policy_version 66668 (0.00086) [2022-07-09 03:25:43,923][26022] Updated weights on worker 0-0, policy_version 66678 (0.00086) [2022-07-09 03:25:44,375][25689] Fps is (10 sec: 5802.5, 60 sec: 5789.8, 300 sec: 5772.2). Total num frames: 68281344. Throughput: 0: 6082.4. Samples: 68281024. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 03:25:44,376][25689] Avg episode reward: [(0, '-56.486')] [2022-07-09 03:25:45,436][26022] Updated weights on worker 0-0, policy_version 66688 (0.00092) [2022-07-09 03:25:47,129][26022] Updated weights on worker 0-0, policy_version 66698 (0.00092) [2022-07-09 03:25:48,967][26022] Updated weights on worker 0-0, policy_version 66708 (0.00093) [2022-07-09 03:25:49,394][25689] Fps is (10 sec: 5797.3, 60 sec: 5807.6, 300 sec: 5779.5). Total num frames: 68311040. Throughput: 0: 6087.2. Samples: 68316034. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 03:25:49,394][25689] Avg episode reward: [(0, '-56.788')] [2022-07-09 03:25:50,850][26022] Updated weights on worker 0-0, policy_version 66718 (0.00090) [2022-07-09 03:25:52,509][26022] Updated weights on worker 0-0, policy_version 66728 (0.00087) [2022-07-09 03:25:54,303][26022] Updated weights on worker 0-0, policy_version 66738 (0.00082) [2022-07-09 03:25:54,403][25689] Fps is (10 sec: 5820.8, 60 sec: 5790.4, 300 sec: 5775.9). Total num frames: 68339712. Throughput: 0: 5219.1. Samples: 68333502. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 03:25:54,403][25689] Avg episode reward: [(0, '-56.625')] [2022-07-09 03:25:55,981][26022] Updated weights on worker 0-0, policy_version 66748 (0.00086) [2022-07-09 03:25:58,090][26022] Updated weights on worker 0-0, policy_version 66758 (0.00093) [2022-07-09 03:25:59,420][25689] Fps is (10 sec: 5821.3, 60 sec: 5773.2, 300 sec: 5788.2). Total num frames: 68369408. Throughput: 0: 6090.0. Samples: 68368482. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 03:25:59,422][25689] Avg episode reward: [(0, '-57.391')] [2022-07-09 03:25:59,516][26022] Updated weights on worker 0-0, policy_version 66768 (0.00098) [2022-07-09 03:26:02,088][26022] Updated weights on worker 0-0, policy_version 66778 (0.00097) [2022-07-09 03:26:03,432][26022] Updated weights on worker 0-0, policy_version 66788 (0.00085) [2022-07-09 03:26:04,505][25689] Fps is (10 sec: 5574.9, 60 sec: 5789.0, 300 sec: 5773.1). Total num frames: 68396032. Throughput: 0: 5933.7. Samples: 68400468. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 03:26:04,506][25689] Avg episode reward: [(0, '-56.664')] [2022-07-09 03:26:05,624][26022] Updated weights on worker 0-0, policy_version 66798 (0.00058) [2022-07-09 03:26:07,164][26022] Updated weights on worker 0-0, policy_version 66808 (0.00082) [2022-07-09 03:26:09,045][26022] Updated weights on worker 0-0, policy_version 66818 (0.00093) [2022-07-09 03:26:09,514][25689] Fps is (10 sec: 5478.4, 60 sec: 5754.9, 300 sec: 5776.7). Total num frames: 68424704. Throughput: 0: 5072.6. Samples: 68418096. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 03:26:09,514][25689] Avg episode reward: [(0, '-56.047')] [2022-07-09 03:26:10,743][26022] Updated weights on worker 0-0, policy_version 66828 (0.00085) [2022-07-09 03:26:12,353][26022] Updated weights on worker 0-0, policy_version 66838 (0.00084) [2022-07-09 03:26:14,266][26022] Updated weights on worker 0-0, policy_version 66848 (0.00091) [2022-07-09 03:26:14,550][25689] Fps is (10 sec: 5810.7, 60 sec: 5786.4, 300 sec: 5779.4). Total num frames: 68454400. Throughput: 0: 5934.3. Samples: 68453062. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 03:26:14,550][25689] Avg episode reward: [(0, '-55.306')] [2022-07-09 03:26:16,101][26022] Updated weights on worker 0-0, policy_version 66858 (0.00091) [2022-07-09 03:26:17,723][26022] Updated weights on worker 0-0, policy_version 66868 (0.00084) [2022-07-09 03:26:19,558][25689] Fps is (10 sec: 5709.0, 60 sec: 5735.9, 300 sec: 5771.3). Total num frames: 68482048. Throughput: 0: 5920.9. Samples: 68487718. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 03:26:19,559][25689] Avg episode reward: [(0, '-55.535')] [2022-07-09 03:26:19,649][26022] Updated weights on worker 0-0, policy_version 66878 (0.00090) [2022-07-09 03:26:21,214][26022] Updated weights on worker 0-0, policy_version 66888 (0.00087) [2022-07-09 03:26:21,794][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:26:21,811][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000066891_68496384.pth [2022-07-09 03:26:21,811][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000064861_66417664.pth [2022-07-09 03:26:22,976][26022] Updated weights on worker 0-0, policy_version 66898 (0.00087) [2022-07-09 03:26:24,657][25689] Fps is (10 sec: 5572.1, 60 sec: 5732.4, 300 sec: 5767.7). Total num frames: 68510720. Throughput: 0: 5194.0. Samples: 68505140. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 03:26:24,659][25689] Avg episode reward: [(0, '-55.388')] [2022-07-09 03:26:24,871][26022] Updated weights on worker 0-0, policy_version 66908 (0.00082) [2022-07-09 03:26:26,668][26022] Updated weights on worker 0-0, policy_version 66918 (0.00080) [2022-07-09 03:26:28,488][26022] Updated weights on worker 0-0, policy_version 66928 (0.00089) [2022-07-09 03:26:29,671][25689] Fps is (10 sec: 5873.1, 60 sec: 5767.9, 300 sec: 5774.5). Total num frames: 68541440. Throughput: 0: 6044.1. Samples: 68539928. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 03:26:29,672][25689] Avg episode reward: [(0, '-54.842')] [2022-07-09 03:26:30,095][26022] Updated weights on worker 0-0, policy_version 66938 (0.00085) [2022-07-09 03:26:31,887][26022] Updated weights on worker 0-0, policy_version 66948 (0.00091) [2022-07-09 03:26:33,752][26022] Updated weights on worker 0-0, policy_version 66958 (0.00084) [2022-07-09 03:26:34,687][25689] Fps is (10 sec: 5921.8, 60 sec: 5750.1, 300 sec: 5771.7). Total num frames: 68570112. Throughput: 0: 6013.0. Samples: 68574144. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 03:26:34,687][25689] Avg episode reward: [(0, '-55.425')] [2022-07-09 03:26:35,443][26022] Updated weights on worker 0-0, policy_version 66968 (0.00082) [2022-07-09 03:26:37,203][26022] Updated weights on worker 0-0, policy_version 66978 (0.00085) [2022-07-09 03:26:39,179][26022] Updated weights on worker 0-0, policy_version 66988 (0.00112) [2022-07-09 03:26:39,697][25689] Fps is (10 sec: 5617.3, 60 sec: 5716.1, 300 sec: 5766.1). Total num frames: 68597760. Throughput: 0: 5144.4. Samples: 68591320. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 03:26:39,697][25689] Avg episode reward: [(0, '-56.225')] [2022-07-09 03:26:40,932][26022] Updated weights on worker 0-0, policy_version 66998 (0.00091) [2022-07-09 03:26:42,596][26022] Updated weights on worker 0-0, policy_version 67008 (0.00103) [2022-07-09 03:26:44,605][26022] Updated weights on worker 0-0, policy_version 67018 (0.00083) [2022-07-09 03:26:44,762][25689] Fps is (10 sec: 5691.7, 60 sec: 5731.6, 300 sec: 5765.9). Total num frames: 68627456. Throughput: 0: 6012.8. Samples: 68626024. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 03:26:44,762][25689] Avg episode reward: [(0, '-56.034')] [2022-07-09 03:26:46,015][26022] Updated weights on worker 0-0, policy_version 67028 (0.00086) [2022-07-09 03:26:48,126][26022] Updated weights on worker 0-0, policy_version 67038 (0.00095) [2022-07-09 03:26:49,662][26022] Updated weights on worker 0-0, policy_version 67048 (0.00093) [2022-07-09 03:26:49,772][25689] Fps is (10 sec: 5996.3, 60 sec: 5749.3, 300 sec: 5770.0). Total num frames: 68658176. Throughput: 0: 6008.7. Samples: 68660714. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 03:26:49,773][25689] Avg episode reward: [(0, '-55.631')] [2022-07-09 03:26:51,535][26022] Updated weights on worker 0-0, policy_version 67058 (0.00096) [2022-07-09 03:26:53,253][26022] Updated weights on worker 0-0, policy_version 67068 (0.00087) [2022-07-09 03:26:54,777][25689] Fps is (10 sec: 5827.9, 60 sec: 5732.8, 300 sec: 5767.1). Total num frames: 68685824. Throughput: 0: 5181.1. Samples: 68678234. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 03:26:54,778][25689] Avg episode reward: [(0, '-55.681')] [2022-07-09 03:26:55,111][26022] Updated weights on worker 0-0, policy_version 67078 (0.00088) [2022-07-09 03:26:56,807][26022] Updated weights on worker 0-0, policy_version 67088 (0.00085) [2022-07-09 03:26:58,691][26022] Updated weights on worker 0-0, policy_version 67098 (0.00083) [2022-07-09 03:26:59,799][25689] Fps is (10 sec: 5719.5, 60 sec: 5732.4, 300 sec: 5775.2). Total num frames: 68715520. Throughput: 0: 6072.3. Samples: 68713382. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 03:26:59,799][25689] Avg episode reward: [(0, '-55.767')] [2022-07-09 03:27:00,245][26022] Updated weights on worker 0-0, policy_version 67108 (0.00079) [2022-07-09 03:27:02,530][26022] Updated weights on worker 0-0, policy_version 67118 (0.00084) [2022-07-09 03:27:04,129][26022] Updated weights on worker 0-0, policy_version 67128 (0.00085) [2022-07-09 03:27:04,868][25689] Fps is (10 sec: 5683.0, 60 sec: 5750.9, 300 sec: 5771.4). Total num frames: 68743168. Throughput: 0: 5959.1. Samples: 68745836. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 03:27:04,868][25689] Avg episode reward: [(0, '-55.817')] [2022-07-09 03:27:06,058][26022] Updated weights on worker 0-0, policy_version 67138 (0.00089) [2022-07-09 03:27:07,768][26022] Updated weights on worker 0-0, policy_version 67148 (0.00091) [2022-07-09 03:27:09,575][26022] Updated weights on worker 0-0, policy_version 67158 (0.00095) [2022-07-09 03:27:09,903][25689] Fps is (10 sec: 5472.6, 60 sec: 5731.4, 300 sec: 5767.7). Total num frames: 68770816. Throughput: 0: 5942.3. Samples: 68780334. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 03:27:09,903][25689] Avg episode reward: [(0, '-55.957')] [2022-07-09 03:27:11,248][26022] Updated weights on worker 0-0, policy_version 67168 (0.00083) [2022-07-09 03:27:13,359][26022] Updated weights on worker 0-0, policy_version 67178 (0.00819) [2022-07-09 03:27:14,733][26022] Updated weights on worker 0-0, policy_version 67188 (0.00084) [2022-07-09 03:27:14,940][25689] Fps is (10 sec: 5794.9, 60 sec: 5748.3, 300 sec: 5768.2). Total num frames: 68801536. Throughput: 0: 5922.8. Samples: 68797654. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 03:27:14,940][25689] Avg episode reward: [(0, '-56.324')] [2022-07-09 03:27:16,756][26022] Updated weights on worker 0-0, policy_version 67198 (0.00089) [2022-07-09 03:27:18,350][26022] Updated weights on worker 0-0, policy_version 67208 (0.00092) [2022-07-09 03:27:19,991][25689] Fps is (10 sec: 5785.4, 60 sec: 5744.1, 300 sec: 5762.8). Total num frames: 68829184. Throughput: 0: 5886.7. Samples: 68832252. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 03:27:19,992][25689] Avg episode reward: [(0, '-56.716')] [2022-07-09 03:27:20,188][26022] Updated weights on worker 0-0, policy_version 67218 (0.00093) [2022-07-09 03:27:21,995][26022] Updated weights on worker 0-0, policy_version 67228 (0.00094) [2022-07-09 03:27:23,674][26022] Updated weights on worker 0-0, policy_version 67238 (0.00086) [2022-07-09 03:27:25,034][25689] Fps is (10 sec: 5579.3, 60 sec: 5749.5, 300 sec: 5759.8). Total num frames: 68857856. Throughput: 0: 6021.1. Samples: 68867262. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 03:27:25,035][25689] Avg episode reward: [(0, '-57.047')] [2022-07-09 03:27:25,466][26022] Updated weights on worker 0-0, policy_version 67248 (0.00086) [2022-07-09 03:27:27,357][26022] Updated weights on worker 0-0, policy_version 67258 (0.00088) [2022-07-09 03:27:28,920][26022] Updated weights on worker 0-0, policy_version 67268 (0.00088) [2022-07-09 03:27:30,061][25689] Fps is (10 sec: 5796.6, 60 sec: 5731.3, 300 sec: 5763.8). Total num frames: 68887552. Throughput: 0: 5172.7. Samples: 68884606. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 03:27:30,061][25689] Avg episode reward: [(0, '-55.853')] [2022-07-09 03:27:30,795][26022] Updated weights on worker 0-0, policy_version 67278 (0.00084) [2022-07-09 03:27:32,770][26022] Updated weights on worker 0-0, policy_version 67288 (0.00087) [2022-07-09 03:27:34,267][26022] Updated weights on worker 0-0, policy_version 67298 (0.00084) [2022-07-09 03:27:35,097][25689] Fps is (10 sec: 5800.3, 60 sec: 5729.3, 300 sec: 5759.9). Total num frames: 68916224. Throughput: 0: 6032.9. Samples: 68919262. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:27:35,098][25689] Avg episode reward: [(0, '-55.773')] [2022-07-09 03:27:36,175][26022] Updated weights on worker 0-0, policy_version 67308 (0.00089) [2022-07-09 03:27:37,774][26022] Updated weights on worker 0-0, policy_version 67318 (0.00087) [2022-07-09 03:27:39,646][26022] Updated weights on worker 0-0, policy_version 67328 (0.00091) [2022-07-09 03:27:40,115][25689] Fps is (10 sec: 5805.5, 60 sec: 5762.5, 300 sec: 5761.1). Total num frames: 68945920. Throughput: 0: 6069.5. Samples: 68954390. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:27:40,115][25689] Avg episode reward: [(0, '-55.897')] [2022-07-09 03:27:41,467][26022] Updated weights on worker 0-0, policy_version 67338 (0.00085) [2022-07-09 03:27:43,160][26022] Updated weights on worker 0-0, policy_version 67348 (0.00091) [2022-07-09 03:27:45,021][26022] Updated weights on worker 0-0, policy_version 67358 (0.00080) [2022-07-09 03:27:45,248][25689] Fps is (10 sec: 5851.3, 60 sec: 5756.1, 300 sec: 5762.2). Total num frames: 68975616. Throughput: 0: 5168.7. Samples: 68971738. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:27:45,248][25689] Avg episode reward: [(0, '-56.299')] [2022-07-09 03:27:46,808][26022] Updated weights on worker 0-0, policy_version 67368 (0.00399) [2022-07-09 03:27:48,457][26022] Updated weights on worker 0-0, policy_version 67378 (0.00076) [2022-07-09 03:27:50,291][25689] Fps is (10 sec: 5735.8, 60 sec: 5719.1, 300 sec: 5758.6). Total num frames: 69004288. Throughput: 0: 6028.0. Samples: 69006552. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:27:50,291][25689] Avg episode reward: [(0, '-56.357')] [2022-07-09 03:27:50,462][26022] Updated weights on worker 0-0, policy_version 67388 (0.00087) [2022-07-09 03:27:52,063][26022] Updated weights on worker 0-0, policy_version 67398 (0.00085) [2022-07-09 03:27:53,724][26022] Updated weights on worker 0-0, policy_version 67408 (0.00096) [2022-07-09 03:27:55,342][25689] Fps is (10 sec: 5883.7, 60 sec: 5765.5, 300 sec: 5771.5). Total num frames: 69035008. Throughput: 0: 6031.9. Samples: 69041376. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:27:55,342][25689] Avg episode reward: [(0, '-55.883')] [2022-07-09 03:27:55,694][26022] Updated weights on worker 0-0, policy_version 67418 (0.00084) [2022-07-09 03:27:57,230][26022] Updated weights on worker 0-0, policy_version 67428 (0.00095) [2022-07-09 03:27:59,334][26022] Updated weights on worker 0-0, policy_version 67438 (0.00081) [2022-07-09 03:28:00,358][25689] Fps is (10 sec: 5899.3, 60 sec: 5749.0, 300 sec: 5776.2). Total num frames: 69063680. Throughput: 0: 5154.0. Samples: 69058726. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:28:00,360][25689] Avg episode reward: [(0, '-56.178')] [2022-07-09 03:28:00,834][26022] Updated weights on worker 0-0, policy_version 67448 (0.00088) [2022-07-09 03:28:03,137][26022] Updated weights on worker 0-0, policy_version 67458 (0.00085) [2022-07-09 03:28:04,829][26022] Updated weights on worker 0-0, policy_version 67468 (0.00092) [2022-07-09 03:28:05,446][25689] Fps is (10 sec: 5472.3, 60 sec: 5730.3, 300 sec: 5768.5). Total num frames: 69090304. Throughput: 0: 5906.2. Samples: 69091038. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:28:05,447][25689] Avg episode reward: [(0, '-54.918')] [2022-07-09 03:28:06,584][26022] Updated weights on worker 0-0, policy_version 67478 (0.00057) [2022-07-09 03:28:08,311][26022] Updated weights on worker 0-0, policy_version 67488 (0.00084) [2022-07-09 03:28:09,929][26022] Updated weights on worker 0-0, policy_version 67498 (0.00093) [2022-07-09 03:28:10,458][25689] Fps is (10 sec: 5475.0, 60 sec: 5749.4, 300 sec: 5762.0). Total num frames: 69118976. Throughput: 0: 5925.7. Samples: 69126058. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:28:10,459][25689] Avg episode reward: [(0, '-54.569')] [2022-07-09 03:28:11,846][26022] Updated weights on worker 0-0, policy_version 67508 (0.00085) [2022-07-09 03:28:13,817][26022] Updated weights on worker 0-0, policy_version 67518 (0.00084) [2022-07-09 03:28:15,320][26022] Updated weights on worker 0-0, policy_version 67528 (0.00096) [2022-07-09 03:28:15,463][25689] Fps is (10 sec: 5929.5, 60 sec: 5752.5, 300 sec: 5769.6). Total num frames: 69149696. Throughput: 0: 5073.1. Samples: 69143452. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:28:15,463][25689] Avg episode reward: [(0, '-52.975')] [2022-07-09 03:28:17,343][26022] Updated weights on worker 0-0, policy_version 67538 (0.00080) [2022-07-09 03:28:18,835][26022] Updated weights on worker 0-0, policy_version 67548 (0.00086) [2022-07-09 03:28:20,486][25689] Fps is (10 sec: 5820.7, 60 sec: 5755.2, 300 sec: 5763.1). Total num frames: 69177344. Throughput: 0: 5939.4. Samples: 69178272. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:28:20,486][25689] Avg episode reward: [(0, '-53.184')] [2022-07-09 03:28:20,701][26022] Updated weights on worker 0-0, policy_version 67558 (0.00086) [2022-07-09 03:28:21,966][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:28:21,975][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000067565_69186560.pth [2022-07-09 03:28:21,983][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000065537_67109888.pth [2022-07-09 03:28:22,615][26022] Updated weights on worker 0-0, policy_version 67568 (0.00087) [2022-07-09 03:28:24,204][26022] Updated weights on worker 0-0, policy_version 67578 (0.00085) [2022-07-09 03:28:25,600][25689] Fps is (10 sec: 5656.9, 60 sec: 5765.4, 300 sec: 5765.1). Total num frames: 69207040. Throughput: 0: 6043.4. Samples: 69212832. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:28:25,600][25689] Avg episode reward: [(0, '-53.829')] [2022-07-09 03:28:26,262][26022] Updated weights on worker 0-0, policy_version 67588 (0.00098) [2022-07-09 03:28:27,674][26022] Updated weights on worker 0-0, policy_version 67598 (0.00086) [2022-07-09 03:28:29,776][26022] Updated weights on worker 0-0, policy_version 67608 (0.00090) [2022-07-09 03:28:30,603][25689] Fps is (10 sec: 5870.5, 60 sec: 5767.6, 300 sec: 5765.3). Total num frames: 69236736. Throughput: 0: 5162.0. Samples: 69230046. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:28:30,603][25689] Avg episode reward: [(0, '-54.191')] [2022-07-09 03:28:31,477][26022] Updated weights on worker 0-0, policy_version 67618 (0.00087) [2022-07-09 03:28:33,261][26022] Updated weights on worker 0-0, policy_version 67628 (0.00086) [2022-07-09 03:28:34,889][26022] Updated weights on worker 0-0, policy_version 67638 (0.00090) [2022-07-09 03:28:35,626][25689] Fps is (10 sec: 5719.6, 60 sec: 5752.0, 300 sec: 5765.0). Total num frames: 69264384. Throughput: 0: 6004.3. Samples: 69264516. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:28:35,626][25689] Avg episode reward: [(0, '-55.377')] [2022-07-09 03:28:36,780][26022] Updated weights on worker 0-0, policy_version 67648 (0.00091) [2022-07-09 03:28:38,690][26022] Updated weights on worker 0-0, policy_version 67658 (0.00091) [2022-07-09 03:28:40,311][26022] Updated weights on worker 0-0, policy_version 67668 (0.00087) [2022-07-09 03:28:40,643][25689] Fps is (10 sec: 5609.6, 60 sec: 5735.1, 300 sec: 5760.2). Total num frames: 69293056. Throughput: 0: 5997.5. Samples: 69299164. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:28:40,643][25689] Avg episode reward: [(0, '-55.590')] [2022-07-09 03:28:41,990][26022] Updated weights on worker 0-0, policy_version 67678 (0.00084) [2022-07-09 03:28:43,770][26022] Updated weights on worker 0-0, policy_version 67688 (0.00090) [2022-07-09 03:28:45,539][26022] Updated weights on worker 0-0, policy_version 67698 (0.00088) [2022-07-09 03:28:45,759][25689] Fps is (10 sec: 5861.3, 60 sec: 5753.6, 300 sec: 5758.5). Total num frames: 69323776. Throughput: 0: 5155.2. Samples: 69316756. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:28:45,759][25689] Avg episode reward: [(0, '-56.092')] [2022-07-09 03:28:47,585][26022] Updated weights on worker 0-0, policy_version 67708 (0.00083) [2022-07-09 03:28:48,952][26022] Updated weights on worker 0-0, policy_version 67718 (0.00086) [2022-07-09 03:28:50,833][25689] Fps is (10 sec: 5828.6, 60 sec: 5750.7, 300 sec: 5757.9). Total num frames: 69352448. Throughput: 0: 6024.0. Samples: 69351910. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:28:50,833][25689] Avg episode reward: [(0, '-55.747')] [2022-07-09 03:28:50,910][26022] Updated weights on worker 0-0, policy_version 67728 (0.00092) [2022-07-09 03:28:52,578][26022] Updated weights on worker 0-0, policy_version 67738 (0.00087) [2022-07-09 03:28:54,429][26022] Updated weights on worker 0-0, policy_version 67748 (0.00087) [2022-07-09 03:28:55,838][25689] Fps is (10 sec: 5790.8, 60 sec: 5738.1, 300 sec: 5761.8). Total num frames: 69382144. Throughput: 0: 6043.5. Samples: 69386670. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:28:55,839][25689] Avg episode reward: [(0, '-55.479')] [2022-07-09 03:28:56,340][26022] Updated weights on worker 0-0, policy_version 67758 (0.00087) [2022-07-09 03:28:57,743][26022] Updated weights on worker 0-0, policy_version 67768 (0.00087) [2022-07-09 03:28:59,759][26022] Updated weights on worker 0-0, policy_version 67778 (0.00083) [2022-07-09 03:29:00,859][25689] Fps is (10 sec: 6025.4, 60 sec: 5771.5, 300 sec: 5774.3). Total num frames: 69412864. Throughput: 0: 5190.1. Samples: 69404090. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:29:00,861][25689] Avg episode reward: [(0, '-55.611')] [2022-07-09 03:29:01,225][26022] Updated weights on worker 0-0, policy_version 67788 (0.00091) [2022-07-09 03:29:03,639][26022] Updated weights on worker 0-0, policy_version 67798 (0.00089) [2022-07-09 03:29:05,503][26022] Updated weights on worker 0-0, policy_version 67808 (0.00085) [2022-07-09 03:29:05,909][25689] Fps is (10 sec: 5592.6, 60 sec: 5758.3, 300 sec: 5756.7). Total num frames: 69438464. Throughput: 0: 5954.5. Samples: 69436738. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:29:05,911][25689] Avg episode reward: [(0, '-55.332')] [2022-07-09 03:29:07,047][26022] Updated weights on worker 0-0, policy_version 67818 (0.00090) [2022-07-09 03:29:08,931][26022] Updated weights on worker 0-0, policy_version 67828 (0.00089) [2022-07-09 03:29:10,710][26022] Updated weights on worker 0-0, policy_version 67838 (0.00089) [2022-07-09 03:29:10,914][25689] Fps is (10 sec: 5499.9, 60 sec: 5775.9, 300 sec: 5767.0). Total num frames: 69468160. Throughput: 0: 5960.0. Samples: 69471592. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:29:10,915][25689] Avg episode reward: [(0, '-55.642')] [2022-07-09 03:29:12,224][26022] Updated weights on worker 0-0, policy_version 67848 (0.00091) [2022-07-09 03:29:14,229][26022] Updated weights on worker 0-0, policy_version 67858 (0.00097) [2022-07-09 03:29:15,619][26022] Updated weights on worker 0-0, policy_version 67868 (0.00090) [2022-07-09 03:29:15,961][25689] Fps is (10 sec: 5908.1, 60 sec: 5754.8, 300 sec: 5762.9). Total num frames: 69497856. Throughput: 0: 5091.0. Samples: 69489118. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:29:15,962][25689] Avg episode reward: [(0, '-54.865')] [2022-07-09 03:29:17,671][26022] Updated weights on worker 0-0, policy_version 67878 (0.00088) [2022-07-09 03:29:19,187][26022] Updated weights on worker 0-0, policy_version 67888 (0.00087) [2022-07-09 03:29:20,998][25689] Fps is (10 sec: 5686.4, 60 sec: 5753.6, 300 sec: 5756.5). Total num frames: 69525504. Throughput: 0: 5956.1. Samples: 69524036. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:29:20,998][25689] Avg episode reward: [(0, '-54.668')] [2022-07-09 03:29:21,212][26022] Updated weights on worker 0-0, policy_version 67898 (0.00088) [2022-07-09 03:29:22,973][26022] Updated weights on worker 0-0, policy_version 67908 (0.00086) [2022-07-09 03:29:24,716][26022] Updated weights on worker 0-0, policy_version 67918 (0.00087) [2022-07-09 03:29:26,105][25689] Fps is (10 sec: 5653.0, 60 sec: 5754.2, 300 sec: 5762.2). Total num frames: 69555200. Throughput: 0: 6062.0. Samples: 69559170. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:29:26,106][25689] Avg episode reward: [(0, '-54.681')] [2022-07-09 03:29:26,551][26022] Updated weights on worker 0-0, policy_version 67928 (0.00084) [2022-07-09 03:29:28,126][26022] Updated weights on worker 0-0, policy_version 67938 (0.00093) [2022-07-09 03:29:29,894][26022] Updated weights on worker 0-0, policy_version 67948 (0.00084) [2022-07-09 03:29:31,121][25689] Fps is (10 sec: 5867.1, 60 sec: 5753.0, 300 sec: 5762.2). Total num frames: 69584896. Throughput: 0: 6063.9. Samples: 69594126. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:29:31,121][25689] Avg episode reward: [(0, '-53.751')] [2022-07-09 03:29:31,571][26022] Updated weights on worker 0-0, policy_version 67958 (0.00085) [2022-07-09 03:29:33,475][26022] Updated weights on worker 0-0, policy_version 67968 (0.00100) [2022-07-09 03:29:35,495][26022] Updated weights on worker 0-0, policy_version 67978 (0.00054) [2022-07-09 03:29:36,127][25689] Fps is (10 sec: 5824.3, 60 sec: 5771.5, 300 sec: 5765.9). Total num frames: 69613568. Throughput: 0: 6059.5. Samples: 69611310. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:29:36,127][25689] Avg episode reward: [(0, '-54.371')] [2022-07-09 03:29:37,099][26022] Updated weights on worker 0-0, policy_version 67988 (0.00084) [2022-07-09 03:29:38,839][26022] Updated weights on worker 0-0, policy_version 67998 (0.00083) [2022-07-09 03:29:40,695][26022] Updated weights on worker 0-0, policy_version 68008 (0.00086) [2022-07-09 03:29:41,155][25689] Fps is (10 sec: 5817.0, 60 sec: 5787.4, 300 sec: 5759.5). Total num frames: 69643264. Throughput: 0: 6061.3. Samples: 69646212. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:29:41,155][25689] Avg episode reward: [(0, '-53.589')] [2022-07-09 03:29:42,194][26022] Updated weights on worker 0-0, policy_version 68018 (0.00081) [2022-07-09 03:29:44,187][26022] Updated weights on worker 0-0, policy_version 68028 (0.00097) [2022-07-09 03:29:45,735][26022] Updated weights on worker 0-0, policy_version 68038 (0.00085) [2022-07-09 03:29:46,226][25689] Fps is (10 sec: 5881.0, 60 sec: 5774.8, 300 sec: 5762.1). Total num frames: 69672960. Throughput: 0: 6071.0. Samples: 69681322. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:29:46,227][25689] Avg episode reward: [(0, '-54.352')] [2022-07-09 03:29:47,578][26022] Updated weights on worker 0-0, policy_version 68048 (0.00089) [2022-07-09 03:29:49,569][26022] Updated weights on worker 0-0, policy_version 68058 (0.00053) [2022-07-09 03:29:51,154][26022] Updated weights on worker 0-0, policy_version 68068 (0.00091) [2022-07-09 03:29:51,246][25689] Fps is (10 sec: 5784.3, 60 sec: 5780.0, 300 sec: 5758.4). Total num frames: 69701632. Throughput: 0: 5202.8. Samples: 69698832. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:29:51,248][25689] Avg episode reward: [(0, '-54.196')] [2022-07-09 03:29:52,896][26022] Updated weights on worker 0-0, policy_version 68078 (0.00092) [2022-07-09 03:29:54,713][26022] Updated weights on worker 0-0, policy_version 68088 (0.00081) [2022-07-09 03:29:56,261][25689] Fps is (10 sec: 5816.4, 60 sec: 5779.0, 300 sec: 5755.0). Total num frames: 69731328. Throughput: 0: 6096.4. Samples: 69734056. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:29:56,263][25689] Avg episode reward: [(0, '-54.102')] [2022-07-09 03:29:56,350][26022] Updated weights on worker 0-0, policy_version 68098 (0.00088) [2022-07-09 03:29:58,364][26022] Updated weights on worker 0-0, policy_version 68108 (0.00086) [2022-07-09 03:29:59,711][26022] Updated weights on worker 0-0, policy_version 68118 (0.00085) [2022-07-09 03:30:01,278][25689] Fps is (10 sec: 5817.9, 60 sec: 5745.5, 300 sec: 5766.4). Total num frames: 69760000. Throughput: 0: 6111.8. Samples: 69769202. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:30:01,280][25689] Avg episode reward: [(0, '-54.779')] [2022-07-09 03:30:02,009][26022] Updated weights on worker 0-0, policy_version 68128 (0.00095) [2022-07-09 03:30:03,638][26022] Updated weights on worker 0-0, policy_version 68138 (0.00083) [2022-07-09 03:30:05,535][26022] Updated weights on worker 0-0, policy_version 68148 (0.00575) [2022-07-09 03:30:06,419][25689] Fps is (10 sec: 5544.4, 60 sec: 5770.7, 300 sec: 5753.5). Total num frames: 69787648. Throughput: 0: 5108.5. Samples: 69784478. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:30:06,421][25689] Avg episode reward: [(0, '-54.810')] [2022-07-09 03:30:07,353][26022] Updated weights on worker 0-0, policy_version 68158 (0.00087) [2022-07-09 03:30:09,283][26022] Updated weights on worker 0-0, policy_version 68168 (0.00089) [2022-07-09 03:30:10,914][26022] Updated weights on worker 0-0, policy_version 68178 (0.00086) [2022-07-09 03:30:11,448][25689] Fps is (10 sec: 5739.4, 60 sec: 5785.3, 300 sec: 5763.5). Total num frames: 69818368. Throughput: 0: 5965.7. Samples: 69819352. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:30:11,452][25689] Avg episode reward: [(0, '-55.131')] [2022-07-09 03:30:12,731][26022] Updated weights on worker 0-0, policy_version 68188 (0.00089) [2022-07-09 03:30:14,408][26022] Updated weights on worker 0-0, policy_version 68198 (0.00086) [2022-07-09 03:30:16,151][26022] Updated weights on worker 0-0, policy_version 68208 (0.00084) [2022-07-09 03:30:16,492][25689] Fps is (10 sec: 5895.8, 60 sec: 5768.7, 300 sec: 5756.0). Total num frames: 69847040. Throughput: 0: 5959.4. Samples: 69854624. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:30:16,493][25689] Avg episode reward: [(0, '-55.047')] [2022-07-09 03:30:17,811][26022] Updated weights on worker 0-0, policy_version 68218 (0.00090) [2022-07-09 03:30:19,453][26022] Updated weights on worker 0-0, policy_version 68228 (0.00086) [2022-07-09 03:30:21,234][26022] Updated weights on worker 0-0, policy_version 68238 (0.00086) [2022-07-09 03:30:21,502][25689] Fps is (10 sec: 5805.2, 60 sec: 5805.1, 300 sec: 5760.4). Total num frames: 69876736. Throughput: 0: 5102.9. Samples: 69872404. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 03:30:21,503][25689] Avg episode reward: [(0, '-54.848')] [2022-07-09 03:30:21,998][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:30:22,010][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000068242_69879808.pth [2022-07-09 03:30:22,011][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000066213_67802112.pth [2022-07-09 03:30:22,870][26022] Updated weights on worker 0-0, policy_version 68248 (0.00091) [2022-07-09 03:30:24,767][26022] Updated weights on worker 0-0, policy_version 68258 (0.00083) [2022-07-09 03:30:26,452][26022] Updated weights on worker 0-0, policy_version 68268 (0.00087) [2022-07-09 03:30:26,541][25689] Fps is (10 sec: 5910.4, 60 sec: 5811.6, 300 sec: 5763.7). Total num frames: 69906432. Throughput: 0: 6110.4. Samples: 69907434. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 03:30:26,542][25689] Avg episode reward: [(0, '-54.790')] [2022-07-09 03:30:28,322][26022] Updated weights on worker 0-0, policy_version 68278 (0.00087) [2022-07-09 03:30:30,120][26022] Updated weights on worker 0-0, policy_version 68288 (0.00086) [2022-07-09 03:30:31,566][25689] Fps is (10 sec: 5698.0, 60 sec: 5776.9, 300 sec: 5756.5). Total num frames: 69934080. Throughput: 0: 6109.5. Samples: 69942264. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 03:30:31,566][25689] Avg episode reward: [(0, '-54.808')] [2022-07-09 03:30:31,843][26022] Updated weights on worker 0-0, policy_version 68298 (0.00085) [2022-07-09 03:30:33,667][26022] Updated weights on worker 0-0, policy_version 68308 (0.00099) [2022-07-09 03:30:35,297][26022] Updated weights on worker 0-0, policy_version 68318 (0.00084) [2022-07-09 03:30:36,567][25689] Fps is (10 sec: 5617.7, 60 sec: 5777.4, 300 sec: 5753.2). Total num frames: 69962752. Throughput: 0: 5242.4. Samples: 69959862. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 03:30:36,567][25689] Avg episode reward: [(0, '-54.106')] [2022-07-09 03:30:37,092][26022] Updated weights on worker 0-0, policy_version 68328 (0.00111) [2022-07-09 03:30:38,987][26022] Updated weights on worker 0-0, policy_version 68338 (0.00095) [2022-07-09 03:30:40,481][26022] Updated weights on worker 0-0, policy_version 68348 (0.00083) [2022-07-09 03:30:41,585][25689] Fps is (10 sec: 5825.6, 60 sec: 5778.3, 300 sec: 5757.2). Total num frames: 69992448. Throughput: 0: 6089.0. Samples: 69994690. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 03:30:41,586][25689] Avg episode reward: [(0, '-54.033')] [2022-07-09 03:30:42,545][26022] Updated weights on worker 0-0, policy_version 68358 (0.00085) [2022-07-09 03:30:44,092][26022] Updated weights on worker 0-0, policy_version 68368 (0.00094) [2022-07-09 03:30:45,860][26022] Updated weights on worker 0-0, policy_version 68378 (0.00089) [2022-07-09 03:30:46,710][25689] Fps is (10 sec: 5956.2, 60 sec: 5790.1, 300 sec: 5758.6). Total num frames: 70023168. Throughput: 0: 6051.7. Samples: 70029490. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 03:30:46,711][25689] Avg episode reward: [(0, '-54.555')] [2022-07-09 03:30:47,795][26022] Updated weights on worker 0-0, policy_version 68388 (0.00089) [2022-07-09 03:30:49,391][26022] Updated weights on worker 0-0, policy_version 68398 (0.00093) [2022-07-09 03:30:51,255][26022] Updated weights on worker 0-0, policy_version 68408 (0.00082) [2022-07-09 03:30:51,798][25689] Fps is (10 sec: 5915.5, 60 sec: 5800.4, 300 sec: 5760.5). Total num frames: 70052864. Throughput: 0: 5187.2. Samples: 70047214. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 03:30:51,799][25689] Avg episode reward: [(0, '-54.412')] [2022-07-09 03:30:52,699][26022] Updated weights on worker 0-0, policy_version 68418 (0.00089) [2022-07-09 03:30:54,795][26022] Updated weights on worker 0-0, policy_version 68428 (0.00079) [2022-07-09 03:30:56,340][26022] Updated weights on worker 0-0, policy_version 68438 (0.00087) [2022-07-09 03:30:56,816][25689] Fps is (10 sec: 5775.6, 60 sec: 5783.3, 300 sec: 5757.1). Total num frames: 70081536. Throughput: 0: 6038.2. Samples: 70082132. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 03:30:56,817][25689] Avg episode reward: [(0, '-54.350')] [2022-07-09 03:30:58,246][26022] Updated weights on worker 0-0, policy_version 68448 (0.00091) [2022-07-09 03:31:00,013][26022] Updated weights on worker 0-0, policy_version 68458 (0.00082) [2022-07-09 03:31:01,883][25689] Fps is (10 sec: 5584.5, 60 sec: 5761.6, 300 sec: 5760.8). Total num frames: 70109184. Throughput: 0: 6020.2. Samples: 70116890. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 03:31:01,884][25689] Avg episode reward: [(0, '-54.755')] [2022-07-09 03:31:02,136][26022] Updated weights on worker 0-0, policy_version 68468 (0.00090) [2022-07-09 03:31:03,995][26022] Updated weights on worker 0-0, policy_version 68478 (0.00586) [2022-07-09 03:31:05,682][26022] Updated weights on worker 0-0, policy_version 68488 (0.00055) [2022-07-09 03:31:06,963][25689] Fps is (10 sec: 5651.3, 60 sec: 5801.2, 300 sec: 5762.9). Total num frames: 70138880. Throughput: 0: 5072.2. Samples: 70132218. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 03:31:06,964][25689] Avg episode reward: [(0, '-55.547')] [2022-07-09 03:31:07,299][26022] Updated weights on worker 0-0, policy_version 68498 (0.00085) [2022-07-09 03:31:09,431][26022] Updated weights on worker 0-0, policy_version 68508 (0.00085) [2022-07-09 03:31:10,902][26022] Updated weights on worker 0-0, policy_version 68518 (0.00088) [2022-07-09 03:31:12,064][25689] Fps is (10 sec: 5733.5, 60 sec: 5760.6, 300 sec: 5758.2). Total num frames: 70167552. Throughput: 0: 5914.4. Samples: 70167072. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 03:31:12,064][25689] Avg episode reward: [(0, '-54.769')] [2022-07-09 03:31:12,849][26022] Updated weights on worker 0-0, policy_version 68528 (0.00087) [2022-07-09 03:31:14,460][26022] Updated weights on worker 0-0, policy_version 68538 (0.00086) [2022-07-09 03:31:16,195][26022] Updated weights on worker 0-0, policy_version 68548 (0.00099) [2022-07-09 03:31:17,090][25689] Fps is (10 sec: 5865.0, 60 sec: 5796.2, 300 sec: 5768.2). Total num frames: 70198272. Throughput: 0: 5934.4. Samples: 70202444. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 03:31:17,090][25689] Avg episode reward: [(0, '-54.957')] [2022-07-09 03:31:17,908][26022] Updated weights on worker 0-0, policy_version 68558 (0.00085) [2022-07-09 03:31:19,557][26022] Updated weights on worker 0-0, policy_version 68568 (0.00081) [2022-07-09 03:31:21,472][26022] Updated weights on worker 0-0, policy_version 68578 (0.00064) [2022-07-09 03:31:22,141][25689] Fps is (10 sec: 5995.3, 60 sec: 5792.2, 300 sec: 5772.6). Total num frames: 70227968. Throughput: 0: 5095.9. Samples: 70220120. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 03:31:22,141][25689] Avg episode reward: [(0, '-54.997')] [2022-07-09 03:31:23,182][26022] Updated weights on worker 0-0, policy_version 68588 (0.00092) [2022-07-09 03:31:25,080][26022] Updated weights on worker 0-0, policy_version 68598 (0.00089) [2022-07-09 03:31:26,721][26022] Updated weights on worker 0-0, policy_version 68608 (0.00079) [2022-07-09 03:31:27,278][25689] Fps is (10 sec: 5728.9, 60 sec: 5766.0, 300 sec: 5763.3). Total num frames: 70256640. Throughput: 0: 6046.6. Samples: 70255052. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 03:31:27,278][25689] Avg episode reward: [(0, '-54.399')] [2022-07-09 03:31:28,549][26022] Updated weights on worker 0-0, policy_version 68618 (0.00086) [2022-07-09 03:31:30,382][26022] Updated weights on worker 0-0, policy_version 68628 (0.00086) [2022-07-09 03:31:32,161][26022] Updated weights on worker 0-0, policy_version 68638 (0.00085) [2022-07-09 03:31:32,329][25689] Fps is (10 sec: 5728.8, 60 sec: 5797.2, 300 sec: 5766.1). Total num frames: 70286336. Throughput: 0: 6051.9. Samples: 70289718. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 03:31:32,330][25689] Avg episode reward: [(0, '-53.509')] [2022-07-09 03:31:33,678][26022] Updated weights on worker 0-0, policy_version 68648 (0.00089) [2022-07-09 03:31:35,655][26022] Updated weights on worker 0-0, policy_version 68658 (0.00090) [2022-07-09 03:31:37,319][26022] Updated weights on worker 0-0, policy_version 68668 (0.00092) [2022-07-09 03:31:37,419][25689] Fps is (10 sec: 5856.7, 60 sec: 5805.6, 300 sec: 5771.4). Total num frames: 70316032. Throughput: 0: 6015.6. Samples: 70324736. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 03:31:37,421][25689] Avg episode reward: [(0, '-53.567')] [2022-07-09 03:31:39,173][26022] Updated weights on worker 0-0, policy_version 68678 (0.00090) [2022-07-09 03:31:40,983][26022] Updated weights on worker 0-0, policy_version 68688 (0.00093) [2022-07-09 03:31:42,484][25689] Fps is (10 sec: 5848.7, 60 sec: 5801.1, 300 sec: 5771.4). Total num frames: 70345728. Throughput: 0: 5983.3. Samples: 70341840. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 03:31:42,485][25689] Avg episode reward: [(0, '-53.985')] [2022-07-09 03:31:42,520][26022] Updated weights on worker 0-0, policy_version 68698 (0.00087) [2022-07-09 03:31:44,517][26022] Updated weights on worker 0-0, policy_version 68708 (0.00082) [2022-07-09 03:31:46,161][26022] Updated weights on worker 0-0, policy_version 68718 (0.00093) [2022-07-09 03:31:47,575][25689] Fps is (10 sec: 5646.1, 60 sec: 5753.8, 300 sec: 5759.5). Total num frames: 70373376. Throughput: 0: 5989.3. Samples: 70376620. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:31:47,576][25689] Avg episode reward: [(0, '-53.664')] [2022-07-09 03:31:47,997][26022] Updated weights on worker 0-0, policy_version 68728 (0.00550) [2022-07-09 03:31:49,833][26022] Updated weights on worker 0-0, policy_version 68738 (0.00086) [2022-07-09 03:31:51,351][26022] Updated weights on worker 0-0, policy_version 68748 (0.00084) [2022-07-09 03:31:52,632][25689] Fps is (10 sec: 5650.8, 60 sec: 5756.8, 300 sec: 5765.4). Total num frames: 70403072. Throughput: 0: 5999.7. Samples: 70411528. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:31:52,633][25689] Avg episode reward: [(0, '-54.020')] [2022-07-09 03:31:53,319][26022] Updated weights on worker 0-0, policy_version 68758 (0.00086) [2022-07-09 03:31:55,023][26022] Updated weights on worker 0-0, policy_version 68768 (0.00082) [2022-07-09 03:31:56,839][26022] Updated weights on worker 0-0, policy_version 68778 (0.00094) [2022-07-09 03:31:57,657][25689] Fps is (10 sec: 5992.5, 60 sec: 5789.8, 300 sec: 5768.7). Total num frames: 70433792. Throughput: 0: 5158.9. Samples: 70429144. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:31:57,658][25689] Avg episode reward: [(0, '-53.689')] [2022-07-09 03:31:58,557][26022] Updated weights on worker 0-0, policy_version 68788 (0.00093) [2022-07-09 03:32:00,235][26022] Updated weights on worker 0-0, policy_version 68798 (0.00097) [2022-07-09 03:32:02,186][26022] Updated weights on worker 0-0, policy_version 68808 (0.00097) [2022-07-09 03:32:02,673][25689] Fps is (10 sec: 5711.0, 60 sec: 5777.8, 300 sec: 5766.3). Total num frames: 70460416. Throughput: 0: 6066.1. Samples: 70464308. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:32:02,673][25689] Avg episode reward: [(0, '-54.432')] [2022-07-09 03:32:04,247][26022] Updated weights on worker 0-0, policy_version 68818 (0.00095) [2022-07-09 03:32:05,893][26022] Updated weights on worker 0-0, policy_version 68828 (0.00083) [2022-07-09 03:32:07,709][25689] Fps is (10 sec: 5501.0, 60 sec: 5765.1, 300 sec: 5769.8). Total num frames: 70489088. Throughput: 0: 5962.6. Samples: 70496670. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:32:07,710][25689] Avg episode reward: [(0, '-54.312')] [2022-07-09 03:32:07,818][26022] Updated weights on worker 0-0, policy_version 68838 (0.00082) [2022-07-09 03:32:09,700][26022] Updated weights on worker 0-0, policy_version 68848 (0.00092) [2022-07-09 03:32:11,130][26022] Updated weights on worker 0-0, policy_version 68858 (0.00093) [2022-07-09 03:32:12,727][25689] Fps is (10 sec: 5805.6, 60 sec: 5789.9, 300 sec: 5766.7). Total num frames: 70518784. Throughput: 0: 5107.9. Samples: 70514170. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:32:12,727][25689] Avg episode reward: [(0, '-55.044')] [2022-07-09 03:32:13,171][26022] Updated weights on worker 0-0, policy_version 68868 (0.00087) [2022-07-09 03:32:14,521][26022] Updated weights on worker 0-0, policy_version 68878 (0.00098) [2022-07-09 03:32:16,640][26022] Updated weights on worker 0-0, policy_version 68888 (0.00087) [2022-07-09 03:32:17,740][25689] Fps is (10 sec: 5920.7, 60 sec: 5774.2, 300 sec: 5774.3). Total num frames: 70548480. Throughput: 0: 5994.1. Samples: 70549524. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:32:17,741][25689] Avg episode reward: [(0, '-54.732')] [2022-07-09 03:32:18,115][26022] Updated weights on worker 0-0, policy_version 68898 (0.00087) [2022-07-09 03:32:20,063][26022] Updated weights on worker 0-0, policy_version 68908 (0.00086) [2022-07-09 03:32:21,707][26022] Updated weights on worker 0-0, policy_version 68918 (0.00088) [2022-07-09 03:32:22,114][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:32:22,128][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000068920_70574080.pth [2022-07-09 03:32:22,129][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000066891_68496384.pth [2022-07-09 03:32:22,764][25689] Fps is (10 sec: 5815.1, 60 sec: 5759.9, 300 sec: 5774.7). Total num frames: 70577152. Throughput: 0: 5979.7. Samples: 70584446. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:32:22,764][25689] Avg episode reward: [(0, '-55.169')] [2022-07-09 03:32:23,492][26022] Updated weights on worker 0-0, policy_version 68928 (0.00091) [2022-07-09 03:32:25,266][26022] Updated weights on worker 0-0, policy_version 68938 (0.00088) [2022-07-09 03:32:27,165][26022] Updated weights on worker 0-0, policy_version 68948 (0.00086) [2022-07-09 03:32:27,878][25689] Fps is (10 sec: 5757.3, 60 sec: 5779.0, 300 sec: 5773.0). Total num frames: 70606848. Throughput: 0: 5217.6. Samples: 70601906. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:32:27,879][25689] Avg episode reward: [(0, '-55.421')] [2022-07-09 03:32:28,859][26022] Updated weights on worker 0-0, policy_version 68958 (0.00101) [2022-07-09 03:32:30,659][26022] Updated weights on worker 0-0, policy_version 68968 (0.00084) [2022-07-09 03:32:32,293][26022] Updated weights on worker 0-0, policy_version 68978 (0.00077) [2022-07-09 03:32:32,927][25689] Fps is (10 sec: 5743.2, 60 sec: 5762.3, 300 sec: 5772.7). Total num frames: 70635520. Throughput: 0: 6061.4. Samples: 70636612. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:32:32,927][25689] Avg episode reward: [(0, '-55.600')] [2022-07-09 03:32:34,123][26022] Updated weights on worker 0-0, policy_version 68988 (0.00081) [2022-07-09 03:32:36,011][26022] Updated weights on worker 0-0, policy_version 68998 (0.00102) [2022-07-09 03:32:37,603][26022] Updated weights on worker 0-0, policy_version 69008 (0.00087) [2022-07-09 03:32:38,006][25689] Fps is (10 sec: 5864.2, 60 sec: 5780.2, 300 sec: 5775.0). Total num frames: 70666240. Throughput: 0: 6020.6. Samples: 70671536. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:32:38,007][25689] Avg episode reward: [(0, '-55.315')] [2022-07-09 03:32:39,495][26022] Updated weights on worker 0-0, policy_version 69018 (0.00093) [2022-07-09 03:32:41,073][26022] Updated weights on worker 0-0, policy_version 69028 (0.00090) [2022-07-09 03:32:42,985][26022] Updated weights on worker 0-0, policy_version 69038 (0.00084) [2022-07-09 03:32:43,093][25689] Fps is (10 sec: 5842.2, 60 sec: 5761.3, 300 sec: 5772.4). Total num frames: 70694912. Throughput: 0: 5140.8. Samples: 70688954. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:32:43,094][25689] Avg episode reward: [(0, '-55.117')] [2022-07-09 03:32:44,553][26022] Updated weights on worker 0-0, policy_version 69048 (0.00091) [2022-07-09 03:32:46,555][26022] Updated weights on worker 0-0, policy_version 69058 (0.00086) [2022-07-09 03:32:48,071][26022] Updated weights on worker 0-0, policy_version 69068 (0.00092) [2022-07-09 03:32:48,215][25689] Fps is (10 sec: 5918.2, 60 sec: 5825.9, 300 sec: 5781.2). Total num frames: 70726656. Throughput: 0: 6000.0. Samples: 70723924. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:32:48,215][25689] Avg episode reward: [(0, '-54.560')] [2022-07-09 03:32:50,060][26022] Updated weights on worker 0-0, policy_version 69078 (0.00083) [2022-07-09 03:32:51,799][26022] Updated weights on worker 0-0, policy_version 69088 (0.00085) [2022-07-09 03:32:53,285][25689] Fps is (10 sec: 5827.3, 60 sec: 5790.8, 300 sec: 5770.5). Total num frames: 70754304. Throughput: 0: 6014.8. Samples: 70759062. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:32:53,286][25689] Avg episode reward: [(0, '-54.827')] [2022-07-09 03:32:53,647][26022] Updated weights on worker 0-0, policy_version 69098 (0.00084) [2022-07-09 03:32:55,143][26022] Updated weights on worker 0-0, policy_version 69108 (0.00687) [2022-07-09 03:32:57,066][26022] Updated weights on worker 0-0, policy_version 69118 (0.00093) [2022-07-09 03:32:58,308][25689] Fps is (10 sec: 5783.1, 60 sec: 5791.1, 300 sec: 5777.3). Total num frames: 70785024. Throughput: 0: 5176.6. Samples: 70776628. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:32:58,308][25689] Avg episode reward: [(0, '-54.034')] [2022-07-09 03:32:58,575][26022] Updated weights on worker 0-0, policy_version 69128 (0.00094) [2022-07-09 03:33:00,622][26022] Updated weights on worker 0-0, policy_version 69138 (0.00089) [2022-07-09 03:33:02,756][26022] Updated weights on worker 0-0, policy_version 69148 (0.00091) [2022-07-09 03:33:03,392][25689] Fps is (10 sec: 5572.3, 60 sec: 5767.7, 300 sec: 5773.9). Total num frames: 70810624. Throughput: 0: 6023.4. Samples: 70811224. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:33:03,393][25689] Avg episode reward: [(0, '-53.583')] [2022-07-09 03:33:04,536][26022] Updated weights on worker 0-0, policy_version 69158 (0.00088) [2022-07-09 03:33:06,210][26022] Updated weights on worker 0-0, policy_version 69168 (0.00082) [2022-07-09 03:33:08,140][26022] Updated weights on worker 0-0, policy_version 69178 (0.00087) [2022-07-09 03:33:08,451][25689] Fps is (10 sec: 5350.4, 60 sec: 5765.5, 300 sec: 5773.0). Total num frames: 70839296. Throughput: 0: 5901.9. Samples: 70843358. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:33:08,452][25689] Avg episode reward: [(0, '-53.224')] [2022-07-09 03:33:09,901][26022] Updated weights on worker 0-0, policy_version 69188 (0.00093) [2022-07-09 03:33:11,790][26022] Updated weights on worker 0-0, policy_version 69198 (0.00086) [2022-07-09 03:33:13,476][25689] Fps is (10 sec: 5686.6, 60 sec: 5747.9, 300 sec: 5765.8). Total num frames: 70867968. Throughput: 0: 5879.3. Samples: 70877772. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:33:13,477][25689] Avg episode reward: [(0, '-53.746')] [2022-07-09 03:33:13,548][26022] Updated weights on worker 0-0, policy_version 69208 (0.00087) [2022-07-09 03:33:15,326][26022] Updated weights on worker 0-0, policy_version 69218 (0.00085) [2022-07-09 03:33:17,195][26022] Updated weights on worker 0-0, policy_version 69228 (0.00090) [2022-07-09 03:33:18,497][25689] Fps is (10 sec: 5810.3, 60 sec: 5747.3, 300 sec: 5772.7). Total num frames: 70897664. Throughput: 0: 5874.7. Samples: 70895234. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:33:18,497][25689] Avg episode reward: [(0, '-54.490')] [2022-07-09 03:33:18,719][26022] Updated weights on worker 0-0, policy_version 69238 (0.00086) [2022-07-09 03:33:20,624][26022] Updated weights on worker 0-0, policy_version 69248 (0.00086) [2022-07-09 03:33:22,449][26022] Updated weights on worker 0-0, policy_version 69258 (0.00050) [2022-07-09 03:33:23,508][25689] Fps is (10 sec: 5818.3, 60 sec: 5748.5, 300 sec: 5771.2). Total num frames: 70926336. Throughput: 0: 5915.8. Samples: 70930224. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:33:23,508][25689] Avg episode reward: [(0, '-54.287')] [2022-07-09 03:33:23,976][26022] Updated weights on worker 0-0, policy_version 69268 (0.00084) [2022-07-09 03:33:26,040][26022] Updated weights on worker 0-0, policy_version 69278 (0.00099) [2022-07-09 03:33:27,551][26022] Updated weights on worker 0-0, policy_version 69288 (0.00083) [2022-07-09 03:33:28,547][25689] Fps is (10 sec: 5909.7, 60 sec: 5772.5, 300 sec: 5774.0). Total num frames: 70957056. Throughput: 0: 6054.4. Samples: 70965024. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:33:28,547][25689] Avg episode reward: [(0, '-54.740')] [2022-07-09 03:33:29,571][26022] Updated weights on worker 0-0, policy_version 69298 (0.00089) [2022-07-09 03:33:31,074][26022] Updated weights on worker 0-0, policy_version 69308 (0.00089) [2022-07-09 03:33:32,986][26022] Updated weights on worker 0-0, policy_version 69318 (0.00082) [2022-07-09 03:33:33,564][25689] Fps is (10 sec: 5804.4, 60 sec: 5758.6, 300 sec: 5774.1). Total num frames: 70984704. Throughput: 0: 5206.0. Samples: 70982348. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:33:33,564][25689] Avg episode reward: [(0, '-54.926')] [2022-07-09 03:33:34,659][26022] Updated weights on worker 0-0, policy_version 69328 (0.00086) [2022-07-09 03:33:36,693][26022] Updated weights on worker 0-0, policy_version 69338 (0.00088) [2022-07-09 03:33:38,164][26022] Updated weights on worker 0-0, policy_version 69348 (0.00087) [2022-07-09 03:33:38,566][25689] Fps is (10 sec: 5723.7, 60 sec: 5749.1, 300 sec: 5777.8). Total num frames: 71014400. Throughput: 0: 6075.2. Samples: 71017156. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:33:38,567][25689] Avg episode reward: [(0, '-55.071')] [2022-07-09 03:33:40,052][26022] Updated weights on worker 0-0, policy_version 69358 (0.00085) [2022-07-09 03:33:41,911][26022] Updated weights on worker 0-0, policy_version 69368 (0.00092) [2022-07-09 03:33:43,509][26022] Updated weights on worker 0-0, policy_version 69378 (0.00089) [2022-07-09 03:33:43,580][25689] Fps is (10 sec: 5827.1, 60 sec: 5755.9, 300 sec: 5772.9). Total num frames: 71043072. Throughput: 0: 6068.9. Samples: 71052042. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:33:43,582][25689] Avg episode reward: [(0, '-54.838')] [2022-07-09 03:33:45,365][26022] Updated weights on worker 0-0, policy_version 69388 (0.00083) [2022-07-09 03:33:47,211][26022] Updated weights on worker 0-0, policy_version 69398 (0.00101) [2022-07-09 03:33:48,668][25689] Fps is (10 sec: 5777.7, 60 sec: 5725.3, 300 sec: 5776.1). Total num frames: 71072768. Throughput: 0: 5186.4. Samples: 71069380. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:33:48,668][25689] Avg episode reward: [(0, '-54.353')] [2022-07-09 03:33:48,741][26022] Updated weights on worker 0-0, policy_version 69408 (0.00094) [2022-07-09 03:33:50,647][26022] Updated weights on worker 0-0, policy_version 69418 (0.00083) [2022-07-09 03:33:52,241][26022] Updated weights on worker 0-0, policy_version 69428 (0.00052) [2022-07-09 03:33:53,684][25689] Fps is (10 sec: 5675.5, 60 sec: 5730.4, 300 sec: 5769.0). Total num frames: 71100416. Throughput: 0: 6073.8. Samples: 71104556. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:33:53,685][25689] Avg episode reward: [(0, '-53.987')] [2022-07-09 03:33:54,142][26022] Updated weights on worker 0-0, policy_version 69438 (0.00089) [2022-07-09 03:33:55,828][26022] Updated weights on worker 0-0, policy_version 69448 (0.00086) [2022-07-09 03:33:57,609][26022] Updated weights on worker 0-0, policy_version 69458 (0.00087) [2022-07-09 03:33:58,706][25689] Fps is (10 sec: 5712.6, 60 sec: 5713.5, 300 sec: 5765.5). Total num frames: 71130112. Throughput: 0: 6083.8. Samples: 71139686. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:33:58,708][25689] Avg episode reward: [(0, '-53.879')] [2022-07-09 03:33:59,427][26022] Updated weights on worker 0-0, policy_version 69468 (0.00079) [2022-07-09 03:34:00,986][26022] Updated weights on worker 0-0, policy_version 69478 (0.00087) [2022-07-09 03:34:03,275][26022] Updated weights on worker 0-0, policy_version 69488 (0.00086) [2022-07-09 03:34:03,751][25689] Fps is (10 sec: 5798.2, 60 sec: 5768.2, 300 sec: 5775.9). Total num frames: 71158784. Throughput: 0: 5212.1. Samples: 71157172. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:34:03,753][25689] Avg episode reward: [(0, '-54.277')] [2022-07-09 03:34:05,178][26022] Updated weights on worker 0-0, policy_version 69498 (0.00088) [2022-07-09 03:34:06,735][26022] Updated weights on worker 0-0, policy_version 69508 (0.00088) [2022-07-09 03:34:08,507][26022] Updated weights on worker 0-0, policy_version 69518 (0.00090) [2022-07-09 03:34:08,810][25689] Fps is (10 sec: 5675.6, 60 sec: 5768.2, 300 sec: 5771.5). Total num frames: 71187456. Throughput: 0: 5988.4. Samples: 71189996. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:34:08,810][25689] Avg episode reward: [(0, '-53.673')] [2022-07-09 03:34:10,265][26022] Updated weights on worker 0-0, policy_version 69528 (0.00094) [2022-07-09 03:34:12,100][26022] Updated weights on worker 0-0, policy_version 69538 (0.00090) [2022-07-09 03:34:13,826][25689] Fps is (10 sec: 5691.4, 60 sec: 5769.0, 300 sec: 5768.6). Total num frames: 71216128. Throughput: 0: 5968.9. Samples: 71224782. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:34:13,827][25689] Avg episode reward: [(0, '-54.160')] [2022-07-09 03:34:13,858][26022] Updated weights on worker 0-0, policy_version 69548 (0.00082) [2022-07-09 03:34:15,656][26022] Updated weights on worker 0-0, policy_version 69558 (0.00090) [2022-07-09 03:34:17,218][26022] Updated weights on worker 0-0, policy_version 69568 (0.00082) [2022-07-09 03:34:18,834][25689] Fps is (10 sec: 5822.8, 60 sec: 5770.3, 300 sec: 5776.1). Total num frames: 71245824. Throughput: 0: 5103.4. Samples: 71242406. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:34:18,834][25689] Avg episode reward: [(0, '-54.836')] [2022-07-09 03:34:19,244][26022] Updated weights on worker 0-0, policy_version 69578 (0.00087) [2022-07-09 03:34:20,762][26022] Updated weights on worker 0-0, policy_version 69588 (0.00084) [2022-07-09 03:34:22,186][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:34:22,206][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000069595_71265280.pth [2022-07-09 03:34:22,206][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000067565_69186560.pth [2022-07-09 03:34:22,770][26022] Updated weights on worker 0-0, policy_version 69598 (0.00088) [2022-07-09 03:34:23,853][25689] Fps is (10 sec: 5923.5, 60 sec: 5786.5, 300 sec: 5777.8). Total num frames: 71275520. Throughput: 0: 5975.1. Samples: 71277284. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:34:23,853][25689] Avg episode reward: [(0, '-55.333')] [2022-07-09 03:34:24,344][26022] Updated weights on worker 0-0, policy_version 69608 (0.00090) [2022-07-09 03:34:26,268][26022] Updated weights on worker 0-0, policy_version 69618 (0.00088) [2022-07-09 03:34:27,922][26022] Updated weights on worker 0-0, policy_version 69628 (0.00089) [2022-07-09 03:34:28,887][25689] Fps is (10 sec: 5806.1, 60 sec: 5753.0, 300 sec: 5774.0). Total num frames: 71304192. Throughput: 0: 6084.1. Samples: 71312146. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:34:28,887][25689] Avg episode reward: [(0, '-55.116')] [2022-07-09 03:34:29,868][26022] Updated weights on worker 0-0, policy_version 69638 (0.00084) [2022-07-09 03:34:31,368][26022] Updated weights on worker 0-0, policy_version 69648 (0.00086) [2022-07-09 03:34:33,315][26022] Updated weights on worker 0-0, policy_version 69658 (0.00083) [2022-07-09 03:34:33,892][25689] Fps is (10 sec: 5610.1, 60 sec: 5754.1, 300 sec: 5770.6). Total num frames: 71331840. Throughput: 0: 5215.7. Samples: 71329438. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 03:34:33,892][25689] Avg episode reward: [(0, '-54.380')] [2022-07-09 03:34:34,741][26022] Updated weights on worker 0-0, policy_version 69668 (0.00091) [2022-07-09 03:34:36,953][26022] Updated weights on worker 0-0, policy_version 69678 (0.00086) [2022-07-09 03:34:38,633][26022] Updated weights on worker 0-0, policy_version 69688 (0.00092) [2022-07-09 03:34:38,911][25689] Fps is (10 sec: 5720.3, 60 sec: 5752.5, 300 sec: 5770.7). Total num frames: 71361536. Throughput: 0: 6062.6. Samples: 71364128. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 03:34:38,912][25689] Avg episode reward: [(0, '-54.831')] [2022-07-09 03:34:40,392][26022] Updated weights on worker 0-0, policy_version 69698 (0.00537) [2022-07-09 03:34:42,094][26022] Updated weights on worker 0-0, policy_version 69708 (0.00089) [2022-07-09 03:34:43,911][26022] Updated weights on worker 0-0, policy_version 69718 (0.00084) [2022-07-09 03:34:43,940][25689] Fps is (10 sec: 5910.6, 60 sec: 5768.1, 300 sec: 5771.5). Total num frames: 71391232. Throughput: 0: 6068.0. Samples: 71399176. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 03:34:43,941][25689] Avg episode reward: [(0, '-54.387')] [2022-07-09 03:34:45,700][26022] Updated weights on worker 0-0, policy_version 69728 (0.00092) [2022-07-09 03:34:47,475][26022] Updated weights on worker 0-0, policy_version 69738 (0.00085) [2022-07-09 03:34:49,025][25689] Fps is (10 sec: 5872.4, 60 sec: 5768.3, 300 sec: 5773.7). Total num frames: 71420928. Throughput: 0: 5190.2. Samples: 71416668. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 03:34:49,026][25689] Avg episode reward: [(0, '-54.209')] [2022-07-09 03:34:49,125][26022] Updated weights on worker 0-0, policy_version 69748 (0.00080) [2022-07-09 03:34:50,957][26022] Updated weights on worker 0-0, policy_version 69758 (0.00083) [2022-07-09 03:34:52,766][26022] Updated weights on worker 0-0, policy_version 69768 (0.00091) [2022-07-09 03:34:54,039][25689] Fps is (10 sec: 5779.7, 60 sec: 5785.5, 300 sec: 5770.3). Total num frames: 71449600. Throughput: 0: 6049.6. Samples: 71451320. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 03:34:54,040][25689] Avg episode reward: [(0, '-55.241')] [2022-07-09 03:34:54,566][26022] Updated weights on worker 0-0, policy_version 69778 (0.00085) [2022-07-09 03:34:56,182][26022] Updated weights on worker 0-0, policy_version 69788 (0.00084) [2022-07-09 03:34:58,095][26022] Updated weights on worker 0-0, policy_version 69798 (0.00086) [2022-07-09 03:34:59,068][25689] Fps is (10 sec: 5710.0, 60 sec: 5767.9, 300 sec: 5770.1). Total num frames: 71478272. Throughput: 0: 6063.1. Samples: 71486338. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 03:34:59,068][25689] Avg episode reward: [(0, '-55.045')] [2022-07-09 03:34:59,695][26022] Updated weights on worker 0-0, policy_version 69808 (0.00090) [2022-07-09 03:35:01,899][26022] Updated weights on worker 0-0, policy_version 69818 (0.00108) [2022-07-09 03:35:03,571][26022] Updated weights on worker 0-0, policy_version 69828 (0.00094) [2022-07-09 03:35:04,079][25689] Fps is (10 sec: 5609.8, 60 sec: 5754.1, 300 sec: 5772.6). Total num frames: 71505920. Throughput: 0: 5140.7. Samples: 71502702. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 03:35:04,079][25689] Avg episode reward: [(0, '-55.343')] [2022-07-09 03:35:05,465][26022] Updated weights on worker 0-0, policy_version 69838 (0.00095) [2022-07-09 03:35:07,186][26022] Updated weights on worker 0-0, policy_version 69848 (0.00083) [2022-07-09 03:35:08,980][26022] Updated weights on worker 0-0, policy_version 69858 (0.00093) [2022-07-09 03:35:09,169][25689] Fps is (10 sec: 5677.0, 60 sec: 5768.1, 300 sec: 5768.0). Total num frames: 71535616. Throughput: 0: 5959.5. Samples: 71536716. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 03:35:09,169][25689] Avg episode reward: [(0, '-56.070')] [2022-07-09 03:35:10,633][26022] Updated weights on worker 0-0, policy_version 69868 (0.00091) [2022-07-09 03:35:12,537][26022] Updated weights on worker 0-0, policy_version 69878 (0.00083) [2022-07-09 03:35:14,179][25689] Fps is (10 sec: 5779.1, 60 sec: 5768.8, 300 sec: 5768.7). Total num frames: 71564288. Throughput: 0: 5960.6. Samples: 71571364. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 03:35:14,180][25689] Avg episode reward: [(0, '-55.878')] [2022-07-09 03:35:14,251][26022] Updated weights on worker 0-0, policy_version 69888 (0.00092) [2022-07-09 03:35:15,996][26022] Updated weights on worker 0-0, policy_version 69898 (0.00096) [2022-07-09 03:35:17,854][26022] Updated weights on worker 0-0, policy_version 69908 (0.00088) [2022-07-09 03:35:19,249][25689] Fps is (10 sec: 5790.4, 60 sec: 5762.8, 300 sec: 5767.5). Total num frames: 71593984. Throughput: 0: 5081.5. Samples: 71588888. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 03:35:19,250][25689] Avg episode reward: [(0, '-56.005')] [2022-07-09 03:35:19,547][26022] Updated weights on worker 0-0, policy_version 69918 (0.00085) [2022-07-09 03:35:21,578][26022] Updated weights on worker 0-0, policy_version 69928 (0.00084) [2022-07-09 03:35:22,916][26022] Updated weights on worker 0-0, policy_version 69938 (0.00083) [2022-07-09 03:35:24,322][25689] Fps is (10 sec: 5754.0, 60 sec: 5740.6, 300 sec: 5763.4). Total num frames: 71622656. Throughput: 0: 5972.5. Samples: 71623608. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 03:35:24,323][25689] Avg episode reward: [(0, '-55.403')] [2022-07-09 03:35:25,007][26022] Updated weights on worker 0-0, policy_version 69948 (0.00088) [2022-07-09 03:35:26,561][26022] Updated weights on worker 0-0, policy_version 69958 (0.00087) [2022-07-09 03:35:28,553][26022] Updated weights on worker 0-0, policy_version 69968 (0.00086) [2022-07-09 03:35:29,357][25689] Fps is (10 sec: 5875.6, 60 sec: 5774.5, 300 sec: 5773.5). Total num frames: 71653376. Throughput: 0: 6020.7. Samples: 71658264. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 03:35:29,358][25689] Avg episode reward: [(0, '-56.078')] [2022-07-09 03:35:30,216][26022] Updated weights on worker 0-0, policy_version 69978 (0.00092) [2022-07-09 03:35:31,828][26022] Updated weights on worker 0-0, policy_version 69988 (0.00089) [2022-07-09 03:35:33,763][26022] Updated weights on worker 0-0, policy_version 69998 (0.00084) [2022-07-09 03:35:34,452][25689] Fps is (10 sec: 5762.0, 60 sec: 5765.9, 300 sec: 5768.2). Total num frames: 71681024. Throughput: 0: 5151.7. Samples: 71675808. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 03:35:34,453][25689] Avg episode reward: [(0, '-56.620')] [2022-07-09 03:35:35,436][26022] Updated weights on worker 0-0, policy_version 70008 (0.00091) [2022-07-09 03:35:37,302][26022] Updated weights on worker 0-0, policy_version 70018 (0.00114) [2022-07-09 03:35:39,095][26022] Updated weights on worker 0-0, policy_version 70028 (0.00090) [2022-07-09 03:35:39,459][25689] Fps is (10 sec: 5676.8, 60 sec: 5767.1, 300 sec: 5768.5). Total num frames: 71710720. Throughput: 0: 6023.1. Samples: 71710612. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 03:35:39,459][25689] Avg episode reward: [(0, '-55.965')] [2022-07-09 03:35:40,627][26022] Updated weights on worker 0-0, policy_version 70038 (0.00087) [2022-07-09 03:35:42,535][26022] Updated weights on worker 0-0, policy_version 70048 (0.00100) [2022-07-09 03:35:44,453][26022] Updated weights on worker 0-0, policy_version 70058 (0.00083) [2022-07-09 03:35:44,504][25689] Fps is (10 sec: 5908.9, 60 sec: 5765.6, 300 sec: 5766.6). Total num frames: 71740416. Throughput: 0: 6037.5. Samples: 71745452. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 03:35:44,504][25689] Avg episode reward: [(0, '-55.723')] [2022-07-09 03:35:46,007][26022] Updated weights on worker 0-0, policy_version 70068 (0.00087) [2022-07-09 03:35:47,963][26022] Updated weights on worker 0-0, policy_version 70078 (0.00108) [2022-07-09 03:35:49,555][26022] Updated weights on worker 0-0, policy_version 70088 (0.00084) [2022-07-09 03:35:49,578][25689] Fps is (10 sec: 5869.2, 60 sec: 5766.6, 300 sec: 5766.8). Total num frames: 71770112. Throughput: 0: 5172.0. Samples: 71762846. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 03:35:49,578][25689] Avg episode reward: [(0, '-55.626')] [2022-07-09 03:35:51,385][26022] Updated weights on worker 0-0, policy_version 70098 (0.00087) [2022-07-09 03:35:53,106][26022] Updated weights on worker 0-0, policy_version 70108 (0.00085) [2022-07-09 03:35:54,582][25689] Fps is (10 sec: 5791.4, 60 sec: 5767.6, 300 sec: 5767.1). Total num frames: 71798784. Throughput: 0: 6072.4. Samples: 71798044. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 03:35:54,582][25689] Avg episode reward: [(0, '-55.013')] [2022-07-09 03:35:54,872][26022] Updated weights on worker 0-0, policy_version 70118 (0.00098) [2022-07-09 03:35:56,560][26022] Updated weights on worker 0-0, policy_version 70128 (0.00087) [2022-07-09 03:35:58,451][26022] Updated weights on worker 0-0, policy_version 70138 (0.00087) [2022-07-09 03:35:59,611][25689] Fps is (10 sec: 5715.4, 60 sec: 5767.5, 300 sec: 5771.3). Total num frames: 71827456. Throughput: 0: 6069.4. Samples: 71832926. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 03:35:59,612][25689] Avg episode reward: [(0, '-54.133')] [2022-07-09 03:36:00,152][26022] Updated weights on worker 0-0, policy_version 70148 (0.00085) [2022-07-09 03:36:01,924][26022] Updated weights on worker 0-0, policy_version 70158 (0.00090) [2022-07-09 03:36:04,003][26022] Updated weights on worker 0-0, policy_version 70168 (0.00086) [2022-07-09 03:36:04,621][25689] Fps is (10 sec: 5406.2, 60 sec: 5733.8, 300 sec: 5758.9). Total num frames: 71853056. Throughput: 0: 5211.0. Samples: 71850282. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 03:36:04,621][25689] Avg episode reward: [(0, '-54.166')] [2022-07-09 03:36:05,763][26022] Updated weights on worker 0-0, policy_version 70178 (0.00086) [2022-07-09 03:36:07,776][26022] Updated weights on worker 0-0, policy_version 70188 (0.00086) [2022-07-09 03:36:09,398][26022] Updated weights on worker 0-0, policy_version 70198 (0.00116) [2022-07-09 03:36:09,670][25689] Fps is (10 sec: 5700.8, 60 sec: 5771.5, 300 sec: 5770.2). Total num frames: 71884800. Throughput: 0: 5972.1. Samples: 71882838. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 03:36:09,670][25689] Avg episode reward: [(0, '-54.245')] [2022-07-09 03:36:11,063][26022] Updated weights on worker 0-0, policy_version 70208 (0.00091) [2022-07-09 03:36:13,114][26022] Updated weights on worker 0-0, policy_version 70218 (0.00086) [2022-07-09 03:36:14,605][26022] Updated weights on worker 0-0, policy_version 70228 (0.00083) [2022-07-09 03:36:14,769][25689] Fps is (10 sec: 5953.3, 60 sec: 5763.0, 300 sec: 5761.9). Total num frames: 71913472. Throughput: 0: 5908.0. Samples: 71917310. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 03:36:14,770][25689] Avg episode reward: [(0, '-54.355')] [2022-07-09 03:36:16,572][26022] Updated weights on worker 0-0, policy_version 70238 (0.00091) [2022-07-09 03:36:18,154][26022] Updated weights on worker 0-0, policy_version 70248 (0.00077) [2022-07-09 03:36:19,783][25689] Fps is (10 sec: 5670.2, 60 sec: 5751.4, 300 sec: 5759.2). Total num frames: 71942144. Throughput: 0: 5052.1. Samples: 71934838. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 03:36:19,784][25689] Avg episode reward: [(0, '-54.499')] [2022-07-09 03:36:20,176][26022] Updated weights on worker 0-0, policy_version 70258 (0.00080) [2022-07-09 03:36:21,699][26022] Updated weights on worker 0-0, policy_version 70268 (0.00083) [2022-07-09 03:36:22,257][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:36:22,278][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000070270_71956480.pth [2022-07-09 03:36:22,279][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000068242_69879808.pth [2022-07-09 03:36:23,538][26022] Updated weights on worker 0-0, policy_version 70278 (0.00112) [2022-07-09 03:36:24,789][25689] Fps is (10 sec: 5825.2, 60 sec: 5774.8, 300 sec: 5765.2). Total num frames: 71971840. Throughput: 0: 5926.7. Samples: 71969812. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 03:36:24,789][25689] Avg episode reward: [(0, '-54.812')] [2022-07-09 03:36:25,472][26022] Updated weights on worker 0-0, policy_version 70288 (0.00093) [2022-07-09 03:36:27,055][26022] Updated weights on worker 0-0, policy_version 70298 (0.00092) [2022-07-09 03:36:29,099][26022] Updated weights on worker 0-0, policy_version 70308 (0.00095) [2022-07-09 03:36:29,945][25689] Fps is (10 sec: 5642.9, 60 sec: 5712.5, 300 sec: 5756.2). Total num frames: 71999488. Throughput: 0: 5995.1. Samples: 72004390. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 03:36:29,946][25689] Avg episode reward: [(0, '-54.964')] [2022-07-09 03:36:30,580][26022] Updated weights on worker 0-0, policy_version 70318 (0.00083) [2022-07-09 03:36:32,653][26022] Updated weights on worker 0-0, policy_version 70328 (0.00093) [2022-07-09 03:36:34,042][26022] Updated weights on worker 0-0, policy_version 70338 (0.00088) [2022-07-09 03:36:34,948][25689] Fps is (10 sec: 5846.3, 60 sec: 5789.0, 300 sec: 5764.8). Total num frames: 72031232. Throughput: 0: 6039.3. Samples: 72039176. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 03:36:34,948][25689] Avg episode reward: [(0, '-54.457')] [2022-07-09 03:36:36,067][26022] Updated weights on worker 0-0, policy_version 70348 (0.00093) [2022-07-09 03:36:37,842][26022] Updated weights on worker 0-0, policy_version 70358 (0.00087) [2022-07-09 03:36:39,556][26022] Updated weights on worker 0-0, policy_version 70368 (0.00088) [2022-07-09 03:36:40,008][25689] Fps is (10 sec: 5902.1, 60 sec: 5750.0, 300 sec: 5758.0). Total num frames: 72058880. Throughput: 0: 6016.7. Samples: 72056526. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 03:36:40,009][25689] Avg episode reward: [(0, '-54.414')] [2022-07-09 03:36:41,426][26022] Updated weights on worker 0-0, policy_version 70378 (0.00082) [2022-07-09 03:36:43,058][26022] Updated weights on worker 0-0, policy_version 70388 (0.00087) [2022-07-09 03:36:44,917][26022] Updated weights on worker 0-0, policy_version 70398 (0.00088) [2022-07-09 03:36:45,048][25689] Fps is (10 sec: 5779.1, 60 sec: 5767.4, 300 sec: 5769.4). Total num frames: 72089600. Throughput: 0: 5986.1. Samples: 72091082. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 03:36:45,048][25689] Avg episode reward: [(0, '-54.259')] [2022-07-09 03:36:46,911][26022] Updated weights on worker 0-0, policy_version 70408 (0.00083) [2022-07-09 03:36:48,194][26022] Updated weights on worker 0-0, policy_version 70418 (0.00089) [2022-07-09 03:36:50,177][25689] Fps is (10 sec: 5639.3, 60 sec: 5711.5, 300 sec: 5757.6). Total num frames: 72116224. Throughput: 0: 6002.1. Samples: 72125822. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 03:36:50,177][25689] Avg episode reward: [(0, '-55.138')] [2022-07-09 03:36:50,500][26022] Updated weights on worker 0-0, policy_version 70428 (0.00086) [2022-07-09 03:36:51,864][26022] Updated weights on worker 0-0, policy_version 70438 (0.00103) [2022-07-09 03:36:53,768][26022] Updated weights on worker 0-0, policy_version 70448 (0.00080) [2022-07-09 03:36:55,266][25689] Fps is (10 sec: 5712.0, 60 sec: 5754.1, 300 sec: 5759.8). Total num frames: 72147968. Throughput: 0: 5136.0. Samples: 72143530. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 03:36:55,267][25689] Avg episode reward: [(0, '-54.625')] [2022-07-09 03:36:55,476][26022] Updated weights on worker 0-0, policy_version 70458 (0.00096) [2022-07-09 03:36:57,181][26022] Updated weights on worker 0-0, policy_version 70468 (0.00086) [2022-07-09 03:36:59,103][26022] Updated weights on worker 0-0, policy_version 70478 (0.00090) [2022-07-09 03:37:00,327][25689] Fps is (10 sec: 5851.3, 60 sec: 5734.2, 300 sec: 5762.4). Total num frames: 72175616. Throughput: 0: 5989.7. Samples: 72178230. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 03:37:00,329][25689] Avg episode reward: [(0, '-54.706')] [2022-07-09 03:37:00,747][26022] Updated weights on worker 0-0, policy_version 70488 (0.00088) [2022-07-09 03:37:03,094][26022] Updated weights on worker 0-0, policy_version 70498 (0.00084) [2022-07-09 03:37:04,601][26022] Updated weights on worker 0-0, policy_version 70508 (0.00567) [2022-07-09 03:37:05,335][25689] Fps is (10 sec: 5491.8, 60 sec: 5768.2, 300 sec: 5759.5). Total num frames: 72203264. Throughput: 0: 5913.1. Samples: 72211042. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 03:37:05,335][25689] Avg episode reward: [(0, '-54.799')] [2022-07-09 03:37:06,544][26022] Updated weights on worker 0-0, policy_version 70518 (0.00088) [2022-07-09 03:37:08,228][26022] Updated weights on worker 0-0, policy_version 70528 (0.00088) [2022-07-09 03:37:10,275][26022] Updated weights on worker 0-0, policy_version 70538 (0.00103) [2022-07-09 03:37:10,443][25689] Fps is (10 sec: 5567.3, 60 sec: 5712.0, 300 sec: 5754.3). Total num frames: 72231936. Throughput: 0: 5054.9. Samples: 72228266. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 03:37:10,445][25689] Avg episode reward: [(0, '-55.156')] [2022-07-09 03:37:11,747][26022] Updated weights on worker 0-0, policy_version 70548 (0.00080) [2022-07-09 03:37:13,603][26022] Updated weights on worker 0-0, policy_version 70558 (0.00085) [2022-07-09 03:37:15,250][26022] Updated weights on worker 0-0, policy_version 70568 (0.00096) [2022-07-09 03:37:15,461][25689] Fps is (10 sec: 5865.0, 60 sec: 5753.4, 300 sec: 5757.7). Total num frames: 72262656. Throughput: 0: 5908.2. Samples: 72262848. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 03:37:15,467][25689] Avg episode reward: [(0, '-54.979')] [2022-07-09 03:37:17,268][26022] Updated weights on worker 0-0, policy_version 70578 (0.00087) [2022-07-09 03:37:18,805][26022] Updated weights on worker 0-0, policy_version 70588 (0.00086) [2022-07-09 03:37:20,539][25689] Fps is (10 sec: 5883.1, 60 sec: 5747.4, 300 sec: 5756.6). Total num frames: 72291328. Throughput: 0: 5916.4. Samples: 72297810. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 03:37:20,539][25689] Avg episode reward: [(0, '-54.556')] [2022-07-09 03:37:20,626][26022] Updated weights on worker 0-0, policy_version 70598 (0.00087) [2022-07-09 03:37:22,424][26022] Updated weights on worker 0-0, policy_version 70608 (0.00089) [2022-07-09 03:37:24,235][26022] Updated weights on worker 0-0, policy_version 70618 (0.00107) [2022-07-09 03:37:25,547][25689] Fps is (10 sec: 5685.8, 60 sec: 5730.3, 300 sec: 5755.2). Total num frames: 72320000. Throughput: 0: 5154.0. Samples: 72315214. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 03:37:25,547][25689] Avg episode reward: [(0, '-53.792')] [2022-07-09 03:37:25,984][26022] Updated weights on worker 0-0, policy_version 70628 (0.00090) [2022-07-09 03:37:27,774][26022] Updated weights on worker 0-0, policy_version 70638 (0.00085) [2022-07-09 03:37:29,397][26022] Updated weights on worker 0-0, policy_version 70648 (0.00091) [2022-07-09 03:37:30,658][25689] Fps is (10 sec: 5767.8, 60 sec: 5768.3, 300 sec: 5757.5). Total num frames: 72349696. Throughput: 0: 6009.1. Samples: 72349740. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:37:30,659][25689] Avg episode reward: [(0, '-53.391')] [2022-07-09 03:37:31,309][26022] Updated weights on worker 0-0, policy_version 70658 (0.00091) [2022-07-09 03:37:33,279][26022] Updated weights on worker 0-0, policy_version 70668 (0.00094) [2022-07-09 03:37:34,797][26022] Updated weights on worker 0-0, policy_version 70678 (0.00089) [2022-07-09 03:37:35,750][25689] Fps is (10 sec: 5720.3, 60 sec: 5709.2, 300 sec: 5750.3). Total num frames: 72378368. Throughput: 0: 5989.9. Samples: 72384378. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:37:35,752][25689] Avg episode reward: [(0, '-52.956')] [2022-07-09 03:37:36,824][26022] Updated weights on worker 0-0, policy_version 70688 (0.00090) [2022-07-09 03:37:38,115][26022] Updated weights on worker 0-0, policy_version 70698 (0.00080) [2022-07-09 03:37:40,292][26022] Updated weights on worker 0-0, policy_version 70708 (0.00094) [2022-07-09 03:37:40,772][25689] Fps is (10 sec: 5771.4, 60 sec: 5746.6, 300 sec: 5755.0). Total num frames: 72408064. Throughput: 0: 5125.9. Samples: 72401522. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:37:40,772][25689] Avg episode reward: [(0, '-52.916')] [2022-07-09 03:37:42,004][26022] Updated weights on worker 0-0, policy_version 70718 (0.00088) [2022-07-09 03:37:43,655][26022] Updated weights on worker 0-0, policy_version 70728 (0.00086) [2022-07-09 03:37:45,632][26022] Updated weights on worker 0-0, policy_version 70738 (0.00101) [2022-07-09 03:37:45,776][25689] Fps is (10 sec: 5821.9, 60 sec: 5716.2, 300 sec: 5747.0). Total num frames: 72436736. Throughput: 0: 5982.0. Samples: 72436226. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:37:45,776][25689] Avg episode reward: [(0, '-52.681')] [2022-07-09 03:37:47,226][26022] Updated weights on worker 0-0, policy_version 70748 (0.00089) [2022-07-09 03:37:49,291][26022] Updated weights on worker 0-0, policy_version 70758 (0.00083) [2022-07-09 03:37:50,836][25689] Fps is (10 sec: 5697.6, 60 sec: 5756.5, 300 sec: 5750.6). Total num frames: 72465408. Throughput: 0: 6008.0. Samples: 72470970. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:37:50,837][25689] Avg episode reward: [(0, '-53.527')] [2022-07-09 03:37:50,858][26022] Updated weights on worker 0-0, policy_version 70768 (0.00083) [2022-07-09 03:37:52,548][26022] Updated weights on worker 0-0, policy_version 70778 (0.00051) [2022-07-09 03:37:54,412][26022] Updated weights on worker 0-0, policy_version 70788 (0.00082) [2022-07-09 03:37:55,839][25689] Fps is (10 sec: 5902.0, 60 sec: 5747.8, 300 sec: 5751.0). Total num frames: 72496128. Throughput: 0: 5182.6. Samples: 72488490. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:37:55,839][25689] Avg episode reward: [(0, '-54.573')] [2022-07-09 03:37:56,080][26022] Updated weights on worker 0-0, policy_version 70798 (0.00096) [2022-07-09 03:37:57,999][26022] Updated weights on worker 0-0, policy_version 70808 (0.00089) [2022-07-09 03:37:59,610][26022] Updated weights on worker 0-0, policy_version 70818 (0.00098) [2022-07-09 03:38:00,856][25689] Fps is (10 sec: 5825.2, 60 sec: 5751.9, 300 sec: 5759.2). Total num frames: 72523776. Throughput: 0: 6073.7. Samples: 72523510. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:38:00,857][25689] Avg episode reward: [(0, '-55.163')] [2022-07-09 03:38:01,339][26022] Updated weights on worker 0-0, policy_version 70828 (0.00098) [2022-07-09 03:38:03,592][26022] Updated weights on worker 0-0, policy_version 70838 (0.00090) [2022-07-09 03:38:05,322][26022] Updated weights on worker 0-0, policy_version 70848 (0.00886) [2022-07-09 03:38:05,862][25689] Fps is (10 sec: 5414.8, 60 sec: 5735.2, 300 sec: 5753.3). Total num frames: 72550400. Throughput: 0: 5991.8. Samples: 72556578. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:38:05,862][25689] Avg episode reward: [(0, '-55.165')] [2022-07-09 03:38:07,091][26022] Updated weights on worker 0-0, policy_version 70858 (0.00089) [2022-07-09 03:38:09,029][26022] Updated weights on worker 0-0, policy_version 70868 (0.00077) [2022-07-09 03:38:10,523][26022] Updated weights on worker 0-0, policy_version 70878 (0.00085) [2022-07-09 03:38:10,989][25689] Fps is (10 sec: 5456.9, 60 sec: 5733.4, 300 sec: 5751.4). Total num frames: 72579072. Throughput: 0: 5090.1. Samples: 72573550. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:38:10,990][25689] Avg episode reward: [(0, '-54.967')] [2022-07-09 03:38:12,598][26022] Updated weights on worker 0-0, policy_version 70888 (0.00090) [2022-07-09 03:38:14,435][26022] Updated weights on worker 0-0, policy_version 70898 (0.00096) [2022-07-09 03:38:15,955][26022] Updated weights on worker 0-0, policy_version 70908 (0.00092) [2022-07-09 03:38:16,061][25689] Fps is (10 sec: 5823.3, 60 sec: 5728.4, 300 sec: 5753.8). Total num frames: 72609792. Throughput: 0: 5923.4. Samples: 72608272. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:38:16,061][25689] Avg episode reward: [(0, '-54.507')] [2022-07-09 03:38:17,783][26022] Updated weights on worker 0-0, policy_version 70918 (0.00079) [2022-07-09 03:38:19,384][26022] Updated weights on worker 0-0, policy_version 70928 (0.00093) [2022-07-09 03:38:21,130][25689] Fps is (10 sec: 5957.8, 60 sec: 5746.0, 300 sec: 5756.1). Total num frames: 72639488. Throughput: 0: 5886.3. Samples: 72642848. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:38:21,131][25689] Avg episode reward: [(0, '-55.116')] [2022-07-09 03:38:21,272][26022] Updated weights on worker 0-0, policy_version 70938 (0.00092) [2022-07-09 03:38:22,444][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:38:22,460][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000070944_72646656.pth [2022-07-09 03:38:22,461][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000068920_70574080.pth [2022-07-09 03:38:23,102][26022] Updated weights on worker 0-0, policy_version 70948 (0.00096) [2022-07-09 03:38:24,725][26022] Updated weights on worker 0-0, policy_version 70958 (0.00083) [2022-07-09 03:38:26,146][25689] Fps is (10 sec: 5686.1, 60 sec: 5728.4, 300 sec: 5746.3). Total num frames: 72667136. Throughput: 0: 5982.9. Samples: 72677936. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:38:26,146][25689] Avg episode reward: [(0, '-55.227')] [2022-07-09 03:38:26,534][26022] Updated weights on worker 0-0, policy_version 70968 (0.00088) [2022-07-09 03:38:28,303][26022] Updated weights on worker 0-0, policy_version 70978 (0.00086) [2022-07-09 03:38:30,042][26022] Updated weights on worker 0-0, policy_version 70988 (0.00093) [2022-07-09 03:38:31,262][25689] Fps is (10 sec: 5862.0, 60 sec: 5761.8, 300 sec: 5758.1). Total num frames: 72698880. Throughput: 0: 6020.0. Samples: 72695590. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:38:31,262][25689] Avg episode reward: [(0, '-55.319')] [2022-07-09 03:38:31,814][26022] Updated weights on worker 0-0, policy_version 70998 (0.00089) [2022-07-09 03:38:33,548][26022] Updated weights on worker 0-0, policy_version 71008 (0.00100) [2022-07-09 03:38:35,335][26022] Updated weights on worker 0-0, policy_version 71018 (0.00081) [2022-07-09 03:38:36,288][25689] Fps is (10 sec: 5956.8, 60 sec: 5768.0, 300 sec: 5754.2). Total num frames: 72727552. Throughput: 0: 6058.2. Samples: 72730814. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:38:36,289][25689] Avg episode reward: [(0, '-55.153')] [2022-07-09 03:38:36,977][26022] Updated weights on worker 0-0, policy_version 71028 (0.00080) [2022-07-09 03:38:38,916][26022] Updated weights on worker 0-0, policy_version 71038 (0.00089) [2022-07-09 03:38:40,598][26022] Updated weights on worker 0-0, policy_version 71048 (0.00086) [2022-07-09 03:38:41,296][25689] Fps is (10 sec: 5816.8, 60 sec: 5769.3, 300 sec: 5757.8). Total num frames: 72757248. Throughput: 0: 6098.7. Samples: 72765836. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:38:41,297][25689] Avg episode reward: [(0, '-55.463')] [2022-07-09 03:38:42,389][26022] Updated weights on worker 0-0, policy_version 71058 (0.00083) [2022-07-09 03:38:44,159][26022] Updated weights on worker 0-0, policy_version 71068 (0.00096) [2022-07-09 03:38:45,944][26022] Updated weights on worker 0-0, policy_version 71078 (0.00093) [2022-07-09 03:38:46,339][25689] Fps is (10 sec: 5705.6, 60 sec: 5748.7, 300 sec: 5751.7). Total num frames: 72784896. Throughput: 0: 5214.0. Samples: 72783222. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:38:46,340][25689] Avg episode reward: [(0, '-54.837')] [2022-07-09 03:38:47,643][26022] Updated weights on worker 0-0, policy_version 71088 (0.00086) [2022-07-09 03:38:49,496][26022] Updated weights on worker 0-0, policy_version 71098 (0.00086) [2022-07-09 03:38:51,267][26022] Updated weights on worker 0-0, policy_version 71108 (0.00085) [2022-07-09 03:38:51,444][25689] Fps is (10 sec: 5751.7, 60 sec: 5778.3, 300 sec: 5760.3). Total num frames: 72815616. Throughput: 0: 6046.1. Samples: 72817616. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:38:51,446][25689] Avg episode reward: [(0, '-53.947')] [2022-07-09 03:38:53,359][26022] Updated weights on worker 0-0, policy_version 71118 (0.00726) [2022-07-09 03:38:54,680][26022] Updated weights on worker 0-0, policy_version 71128 (0.00083) [2022-07-09 03:38:56,481][25689] Fps is (10 sec: 5653.9, 60 sec: 5707.4, 300 sec: 5749.7). Total num frames: 72842240. Throughput: 0: 6002.1. Samples: 72852014. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 03:38:56,482][25689] Avg episode reward: [(0, '-53.254')] [2022-07-09 03:38:56,845][26022] Updated weights on worker 0-0, policy_version 71138 (0.00086) [2022-07-09 03:38:58,241][26022] Updated weights on worker 0-0, policy_version 71148 (0.00093) [2022-07-09 03:39:00,297][26022] Updated weights on worker 0-0, policy_version 71158 (0.00086) [2022-07-09 03:39:01,491][25689] Fps is (10 sec: 5809.8, 60 sec: 5775.7, 300 sec: 5760.7). Total num frames: 72873984. Throughput: 0: 5127.9. Samples: 72869392. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 03:39:01,491][25689] Avg episode reward: [(0, '-52.942')] [2022-07-09 03:39:02,151][26022] Updated weights on worker 0-0, policy_version 71168 (0.00087) [2022-07-09 03:39:04,135][26022] Updated weights on worker 0-0, policy_version 71178 (0.00090) [2022-07-09 03:39:05,962][26022] Updated weights on worker 0-0, policy_version 71188 (0.00966) [2022-07-09 03:39:06,582][25689] Fps is (10 sec: 5778.4, 60 sec: 5767.6, 300 sec: 5753.2). Total num frames: 72900608. Throughput: 0: 5875.4. Samples: 72902160. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 03:39:06,583][25689] Avg episode reward: [(0, '-52.710')] [2022-07-09 03:39:07,621][26022] Updated weights on worker 0-0, policy_version 71198 (0.00088) [2022-07-09 03:39:09,544][26022] Updated weights on worker 0-0, policy_version 71208 (0.00084) [2022-07-09 03:39:11,279][26022] Updated weights on worker 0-0, policy_version 71218 (0.00088) [2022-07-09 03:39:11,648][25689] Fps is (10 sec: 5544.9, 60 sec: 5790.3, 300 sec: 5755.7). Total num frames: 72930304. Throughput: 0: 5902.6. Samples: 72936870. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 03:39:11,648][25689] Avg episode reward: [(0, '-53.015')] [2022-07-09 03:39:12,755][26022] Updated weights on worker 0-0, policy_version 71228 (0.00081) [2022-07-09 03:39:14,687][26022] Updated weights on worker 0-0, policy_version 71238 (0.00090) [2022-07-09 03:39:16,652][25689] Fps is (10 sec: 5592.8, 60 sec: 5729.1, 300 sec: 5745.5). Total num frames: 72956928. Throughput: 0: 5081.2. Samples: 72954508. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 03:39:16,653][25689] Avg episode reward: [(0, '-52.837')] [2022-07-09 03:39:16,668][26022] Updated weights on worker 0-0, policy_version 71248 (0.00080) [2022-07-09 03:39:18,239][26022] Updated weights on worker 0-0, policy_version 71258 (0.00088) [2022-07-09 03:39:20,166][26022] Updated weights on worker 0-0, policy_version 71268 (0.00082) [2022-07-09 03:39:21,677][25689] Fps is (10 sec: 5717.9, 60 sec: 5750.3, 300 sec: 5748.8). Total num frames: 72987648. Throughput: 0: 5917.5. Samples: 72988842. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 03:39:21,677][25689] Avg episode reward: [(0, '-53.705')] [2022-07-09 03:39:21,771][26022] Updated weights on worker 0-0, policy_version 71278 (0.00084) [2022-07-09 03:39:23,542][26022] Updated weights on worker 0-0, policy_version 71288 (0.00087) [2022-07-09 03:39:25,193][26022] Updated weights on worker 0-0, policy_version 71298 (0.00086) [2022-07-09 03:39:26,681][25689] Fps is (10 sec: 5922.0, 60 sec: 5768.3, 300 sec: 5749.4). Total num frames: 73016320. Throughput: 0: 6050.7. Samples: 73023774. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 03:39:26,682][25689] Avg episode reward: [(0, '-54.044')] [2022-07-09 03:39:27,104][26022] Updated weights on worker 0-0, policy_version 71308 (0.00093) [2022-07-09 03:39:28,913][26022] Updated weights on worker 0-0, policy_version 71318 (0.00085) [2022-07-09 03:39:30,533][26022] Updated weights on worker 0-0, policy_version 71328 (0.00087) [2022-07-09 03:39:31,822][25689] Fps is (10 sec: 5652.3, 60 sec: 5715.2, 300 sec: 5750.2). Total num frames: 73044992. Throughput: 0: 5144.0. Samples: 73040646. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 03:39:31,823][25689] Avg episode reward: [(0, '-54.796')] [2022-07-09 03:39:32,683][26022] Updated weights on worker 0-0, policy_version 71338 (0.00089) [2022-07-09 03:39:34,294][26022] Updated weights on worker 0-0, policy_version 71348 (0.00085) [2022-07-09 03:39:36,067][26022] Updated weights on worker 0-0, policy_version 71358 (0.00087) [2022-07-09 03:39:36,901][25689] Fps is (10 sec: 5911.5, 60 sec: 5760.9, 300 sec: 5755.9). Total num frames: 73076736. Throughput: 0: 5984.1. Samples: 73075680. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 03:39:36,902][25689] Avg episode reward: [(0, '-54.251')] [2022-07-09 03:39:37,612][26022] Updated weights on worker 0-0, policy_version 71368 (0.00093) [2022-07-09 03:39:39,473][26022] Updated weights on worker 0-0, policy_version 71378 (0.00078) [2022-07-09 03:39:41,271][26022] Updated weights on worker 0-0, policy_version 71388 (0.00087) [2022-07-09 03:39:41,989][25689] Fps is (10 sec: 5942.4, 60 sec: 5736.4, 300 sec: 5751.3). Total num frames: 73105408. Throughput: 0: 6015.9. Samples: 73111038. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 03:39:41,990][25689] Avg episode reward: [(0, '-54.910')] [2022-07-09 03:39:43,083][26022] Updated weights on worker 0-0, policy_version 71398 (0.00084) [2022-07-09 03:39:44,563][26022] Updated weights on worker 0-0, policy_version 71408 (0.00085) [2022-07-09 03:39:46,482][26022] Updated weights on worker 0-0, policy_version 71418 (0.00083) [2022-07-09 03:39:47,070][25689] Fps is (10 sec: 5740.0, 60 sec: 5766.5, 300 sec: 5751.4). Total num frames: 73135104. Throughput: 0: 5138.8. Samples: 73128548. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 03:39:47,071][25689] Avg episode reward: [(0, '-54.263')] [2022-07-09 03:39:48,182][26022] Updated weights on worker 0-0, policy_version 71428 (0.00087) [2022-07-09 03:39:50,005][26022] Updated weights on worker 0-0, policy_version 71438 (0.00769) [2022-07-09 03:39:51,825][26022] Updated weights on worker 0-0, policy_version 71448 (0.00093) [2022-07-09 03:39:52,112][25689] Fps is (10 sec: 5766.1, 60 sec: 5738.8, 300 sec: 5750.9). Total num frames: 73163776. Throughput: 0: 6037.6. Samples: 73163146. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 03:39:52,112][25689] Avg episode reward: [(0, '-53.982')] [2022-07-09 03:39:53,472][26022] Updated weights on worker 0-0, policy_version 71458 (0.00089) [2022-07-09 03:39:55,401][26022] Updated weights on worker 0-0, policy_version 71468 (0.00085) [2022-07-09 03:39:57,085][26022] Updated weights on worker 0-0, policy_version 71478 (0.00088) [2022-07-09 03:39:57,184][25689] Fps is (10 sec: 5771.1, 60 sec: 5786.1, 300 sec: 5753.5). Total num frames: 73193472. Throughput: 0: 6030.0. Samples: 73197984. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 03:39:57,184][25689] Avg episode reward: [(0, '-53.690')] [2022-07-09 03:39:58,983][26022] Updated weights on worker 0-0, policy_version 71488 (0.00086) [2022-07-09 03:40:00,598][26022] Updated weights on worker 0-0, policy_version 71498 (0.00082) [2022-07-09 03:40:02,218][25689] Fps is (10 sec: 5572.9, 60 sec: 5699.5, 300 sec: 5749.6). Total num frames: 73220096. Throughput: 0: 5162.7. Samples: 73215472. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 03:40:02,218][25689] Avg episode reward: [(0, '-54.204')] [2022-07-09 03:40:02,936][26022] Updated weights on worker 0-0, policy_version 71508 (0.00093) [2022-07-09 03:40:04,383][26022] Updated weights on worker 0-0, policy_version 71518 (0.00088) [2022-07-09 03:40:06,405][26022] Updated weights on worker 0-0, policy_version 71528 (0.00087) [2022-07-09 03:40:07,255][25689] Fps is (10 sec: 5694.1, 60 sec: 5772.1, 300 sec: 5754.1). Total num frames: 73250816. Throughput: 0: 5923.1. Samples: 73248102. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 03:40:07,255][25689] Avg episode reward: [(0, '-53.918')] [2022-07-09 03:40:08,069][26022] Updated weights on worker 0-0, policy_version 71538 (0.00081) [2022-07-09 03:40:09,800][26022] Updated weights on worker 0-0, policy_version 71548 (0.00094) [2022-07-09 03:40:11,575][26022] Updated weights on worker 0-0, policy_version 71558 (0.00431) [2022-07-09 03:40:12,321][25689] Fps is (10 sec: 5878.7, 60 sec: 5755.2, 300 sec: 5753.0). Total num frames: 73279488. Throughput: 0: 5937.3. Samples: 73283132. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 03:40:12,321][25689] Avg episode reward: [(0, '-54.371')] [2022-07-09 03:40:13,302][26022] Updated weights on worker 0-0, policy_version 71568 (0.00082) [2022-07-09 03:40:14,956][26022] Updated weights on worker 0-0, policy_version 71578 (0.00087) [2022-07-09 03:40:16,964][26022] Updated weights on worker 0-0, policy_version 71588 (0.00084) [2022-07-09 03:40:17,380][25689] Fps is (10 sec: 5663.4, 60 sec: 5783.7, 300 sec: 5749.7). Total num frames: 73308160. Throughput: 0: 5948.7. Samples: 73318124. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 03:40:17,380][25689] Avg episode reward: [(0, '-54.914')] [2022-07-09 03:40:18,576][26022] Updated weights on worker 0-0, policy_version 71598 (0.00082) [2022-07-09 03:40:20,422][26022] Updated weights on worker 0-0, policy_version 71608 (0.00085) [2022-07-09 03:40:22,244][26022] Updated weights on worker 0-0, policy_version 71618 (0.00092) [2022-07-09 03:40:22,390][25689] Fps is (10 sec: 5694.8, 60 sec: 5751.3, 300 sec: 5751.0). Total num frames: 73336832. Throughput: 0: 5951.7. Samples: 73335532. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:40:22,391][25689] Avg episode reward: [(0, '-55.806')] [2022-07-09 03:40:22,699][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:40:22,710][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000071620_73338880.pth [2022-07-09 03:40:22,710][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000069595_71265280.pth [2022-07-09 03:40:23,730][26022] Updated weights on worker 0-0, policy_version 71628 (0.00100) [2022-07-09 03:40:25,787][26022] Updated weights on worker 0-0, policy_version 71638 (0.00094) [2022-07-09 03:40:27,351][26022] Updated weights on worker 0-0, policy_version 71648 (0.00087) [2022-07-09 03:40:27,413][25689] Fps is (10 sec: 5919.6, 60 sec: 5783.3, 300 sec: 5751.2). Total num frames: 73367552. Throughput: 0: 6051.0. Samples: 73370082. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:40:27,413][25689] Avg episode reward: [(0, '-55.043')] [2022-07-09 03:40:29,262][26022] Updated weights on worker 0-0, policy_version 71658 (0.00094) [2022-07-09 03:40:31,143][26022] Updated weights on worker 0-0, policy_version 71668 (0.00086) [2022-07-09 03:40:32,491][25689] Fps is (10 sec: 5879.6, 60 sec: 5789.3, 300 sec: 5754.9). Total num frames: 73396224. Throughput: 0: 6041.6. Samples: 73404996. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:40:32,492][25689] Avg episode reward: [(0, '-54.822')] [2022-07-09 03:40:32,675][26022] Updated weights on worker 0-0, policy_version 71678 (0.00097) [2022-07-09 03:40:34,630][26022] Updated weights on worker 0-0, policy_version 71688 (0.00089) [2022-07-09 03:40:36,262][26022] Updated weights on worker 0-0, policy_version 71698 (0.00092) [2022-07-09 03:40:37,495][25689] Fps is (10 sec: 5586.1, 60 sec: 5728.9, 300 sec: 5748.1). Total num frames: 73423872. Throughput: 0: 5183.5. Samples: 73422392. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:40:37,496][25689] Avg episode reward: [(0, '-54.808')] [2022-07-09 03:40:38,126][26022] Updated weights on worker 0-0, policy_version 71708 (0.00082) [2022-07-09 03:40:39,696][26022] Updated weights on worker 0-0, policy_version 71718 (0.00082) [2022-07-09 03:40:41,629][26022] Updated weights on worker 0-0, policy_version 71728 (0.00085) [2022-07-09 03:40:42,496][25689] Fps is (10 sec: 5731.5, 60 sec: 5754.0, 300 sec: 5749.0). Total num frames: 73453568. Throughput: 0: 6064.9. Samples: 73457474. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:40:42,497][25689] Avg episode reward: [(0, '-54.262')] [2022-07-09 03:40:43,265][26022] Updated weights on worker 0-0, policy_version 71738 (0.00089) [2022-07-09 03:40:45,071][26022] Updated weights on worker 0-0, policy_version 71748 (0.00081) [2022-07-09 03:40:46,739][26022] Updated weights on worker 0-0, policy_version 71758 (0.00087) [2022-07-09 03:40:47,536][25689] Fps is (10 sec: 6017.2, 60 sec: 5774.9, 300 sec: 5753.1). Total num frames: 73484288. Throughput: 0: 6094.2. Samples: 73492714. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:40:47,536][25689] Avg episode reward: [(0, '-53.544')] [2022-07-09 03:40:48,528][26022] Updated weights on worker 0-0, policy_version 71768 (0.00093) [2022-07-09 03:40:50,244][26022] Updated weights on worker 0-0, policy_version 71778 (0.00102) [2022-07-09 03:40:52,118][26022] Updated weights on worker 0-0, policy_version 71788 (0.00090) [2022-07-09 03:40:52,604][25689] Fps is (10 sec: 5875.8, 60 sec: 5772.3, 300 sec: 5751.8). Total num frames: 73512960. Throughput: 0: 5216.0. Samples: 73509904. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:40:52,605][25689] Avg episode reward: [(0, '-53.671')] [2022-07-09 03:40:53,853][26022] Updated weights on worker 0-0, policy_version 71798 (0.00085) [2022-07-09 03:40:55,639][26022] Updated weights on worker 0-0, policy_version 71808 (0.00086) [2022-07-09 03:40:57,402][26022] Updated weights on worker 0-0, policy_version 71818 (0.00083) [2022-07-09 03:40:57,662][25689] Fps is (10 sec: 5865.4, 60 sec: 5790.7, 300 sec: 5758.2). Total num frames: 73543680. Throughput: 0: 6083.1. Samples: 73545066. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:40:57,662][25689] Avg episode reward: [(0, '-54.873')] [2022-07-09 03:40:59,136][26022] Updated weights on worker 0-0, policy_version 71828 (0.00095) [2022-07-09 03:41:00,814][26022] Updated weights on worker 0-0, policy_version 71838 (0.00088) [2022-07-09 03:41:02,723][25689] Fps is (10 sec: 5667.2, 60 sec: 5788.1, 300 sec: 5760.6). Total num frames: 73570304. Throughput: 0: 5972.1. Samples: 73578268. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:41:02,723][25689] Avg episode reward: [(0, '-55.170')] [2022-07-09 03:41:03,015][26022] Updated weights on worker 0-0, policy_version 71848 (0.00873) [2022-07-09 03:41:04,963][26022] Updated weights on worker 0-0, policy_version 71858 (0.00084) [2022-07-09 03:41:06,516][26022] Updated weights on worker 0-0, policy_version 71868 (0.00091) [2022-07-09 03:41:07,737][25689] Fps is (10 sec: 5386.7, 60 sec: 5739.5, 300 sec: 5747.5). Total num frames: 73597952. Throughput: 0: 5088.7. Samples: 73595512. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:41:07,739][25689] Avg episode reward: [(0, '-54.857')] [2022-07-09 03:41:08,361][26022] Updated weights on worker 0-0, policy_version 71878 (0.00092) [2022-07-09 03:41:09,958][26022] Updated weights on worker 0-0, policy_version 71888 (0.00084) [2022-07-09 03:41:11,914][26022] Updated weights on worker 0-0, policy_version 71898 (0.00090) [2022-07-09 03:41:12,792][25689] Fps is (10 sec: 5796.9, 60 sec: 5774.4, 300 sec: 5755.3). Total num frames: 73628672. Throughput: 0: 5971.1. Samples: 73630444. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:41:12,792][25689] Avg episode reward: [(0, '-54.878')] [2022-07-09 03:41:13,583][26022] Updated weights on worker 0-0, policy_version 71908 (0.00410) [2022-07-09 03:41:15,378][26022] Updated weights on worker 0-0, policy_version 71918 (0.00094) [2022-07-09 03:41:17,152][26022] Updated weights on worker 0-0, policy_version 71928 (0.00092) [2022-07-09 03:41:17,795][25689] Fps is (10 sec: 6006.6, 60 sec: 5796.7, 300 sec: 5758.9). Total num frames: 73658368. Throughput: 0: 5988.5. Samples: 73665636. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:41:17,796][25689] Avg episode reward: [(0, '-54.469')] [2022-07-09 03:41:18,919][26022] Updated weights on worker 0-0, policy_version 71938 (0.00084) [2022-07-09 03:41:20,648][26022] Updated weights on worker 0-0, policy_version 71948 (0.00088) [2022-07-09 03:41:22,361][26022] Updated weights on worker 0-0, policy_version 71958 (0.00087) [2022-07-09 03:41:22,811][25689] Fps is (10 sec: 5723.5, 60 sec: 5779.2, 300 sec: 5751.8). Total num frames: 73686016. Throughput: 0: 5210.5. Samples: 73682936. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:41:22,811][25689] Avg episode reward: [(0, '-53.706')] [2022-07-09 03:41:24,231][26022] Updated weights on worker 0-0, policy_version 71968 (0.00084) [2022-07-09 03:41:25,995][26022] Updated weights on worker 0-0, policy_version 71978 (0.00097) [2022-07-09 03:41:27,812][25689] Fps is (10 sec: 5622.3, 60 sec: 5747.3, 300 sec: 5758.3). Total num frames: 73714688. Throughput: 0: 6075.7. Samples: 73717484. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:41:27,813][25689] Avg episode reward: [(0, '-53.604')] [2022-07-09 03:41:27,838][26022] Updated weights on worker 0-0, policy_version 71988 (0.00084) [2022-07-09 03:41:29,514][26022] Updated weights on worker 0-0, policy_version 71998 (0.00083) [2022-07-09 03:41:31,310][26022] Updated weights on worker 0-0, policy_version 72008 (0.00090) [2022-07-09 03:41:32,939][25689] Fps is (10 sec: 5864.0, 60 sec: 5776.6, 300 sec: 5752.4). Total num frames: 73745408. Throughput: 0: 6049.7. Samples: 73752328. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:41:32,939][25689] Avg episode reward: [(0, '-54.572')] [2022-07-09 03:41:33,006][26022] Updated weights on worker 0-0, policy_version 72018 (0.00090) [2022-07-09 03:41:34,693][26022] Updated weights on worker 0-0, policy_version 72028 (0.00087) [2022-07-09 03:41:36,515][26022] Updated weights on worker 0-0, policy_version 72038 (0.00089) [2022-07-09 03:41:37,959][25689] Fps is (10 sec: 5853.4, 60 sec: 5792.0, 300 sec: 5756.7). Total num frames: 73774080. Throughput: 0: 5177.4. Samples: 73770032. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:41:37,965][25689] Avg episode reward: [(0, '-54.563')] [2022-07-09 03:41:38,454][26022] Updated weights on worker 0-0, policy_version 72048 (0.00081) [2022-07-09 03:41:40,007][26022] Updated weights on worker 0-0, policy_version 72058 (0.00098) [2022-07-09 03:41:42,041][26022] Updated weights on worker 0-0, policy_version 72068 (0.00089) [2022-07-09 03:41:42,991][25689] Fps is (10 sec: 5704.5, 60 sec: 5772.1, 300 sec: 5749.9). Total num frames: 73802752. Throughput: 0: 6029.4. Samples: 73804614. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:41:42,992][25689] Avg episode reward: [(0, '-54.124')] [2022-07-09 03:41:43,454][26022] Updated weights on worker 0-0, policy_version 72078 (0.00087) [2022-07-09 03:41:45,504][26022] Updated weights on worker 0-0, policy_version 72088 (0.00085) [2022-07-09 03:41:47,124][26022] Updated weights on worker 0-0, policy_version 72098 (0.00084) [2022-07-09 03:41:48,001][25689] Fps is (10 sec: 5812.0, 60 sec: 5758.0, 300 sec: 5762.5). Total num frames: 73832448. Throughput: 0: 6033.6. Samples: 73839298. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 03:41:48,002][25689] Avg episode reward: [(0, '-54.662')] [2022-07-09 03:41:49,086][26022] Updated weights on worker 0-0, policy_version 72108 (0.00086) [2022-07-09 03:41:50,699][26022] Updated weights on worker 0-0, policy_version 72118 (0.00088) [2022-07-09 03:41:52,643][26022] Updated weights on worker 0-0, policy_version 72128 (0.00089) [2022-07-09 03:41:53,056][25689] Fps is (10 sec: 5799.1, 60 sec: 5759.3, 300 sec: 5752.9). Total num frames: 73861120. Throughput: 0: 5179.2. Samples: 73856520. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 03:41:53,057][25689] Avg episode reward: [(0, '-55.374')] [2022-07-09 03:41:54,182][26022] Updated weights on worker 0-0, policy_version 72138 (0.00080) [2022-07-09 03:41:56,160][26022] Updated weights on worker 0-0, policy_version 72148 (0.00091) [2022-07-09 03:41:57,689][26022] Updated weights on worker 0-0, policy_version 72158 (0.00084) [2022-07-09 03:41:58,067][25689] Fps is (10 sec: 5798.9, 60 sec: 5746.8, 300 sec: 5760.7). Total num frames: 73890816. Throughput: 0: 6045.8. Samples: 73891600. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 03:41:58,067][25689] Avg episode reward: [(0, '-55.073')] [2022-07-09 03:41:59,521][26022] Updated weights on worker 0-0, policy_version 72168 (0.00088) [2022-07-09 03:42:01,264][26022] Updated weights on worker 0-0, policy_version 72178 (0.00087) [2022-07-09 03:42:03,078][25689] Fps is (10 sec: 5619.7, 60 sec: 5751.5, 300 sec: 5757.2). Total num frames: 73917440. Throughput: 0: 5976.8. Samples: 73924670. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 03:42:03,079][25689] Avg episode reward: [(0, '-56.210')] [2022-07-09 03:42:03,644][26022] Updated weights on worker 0-0, policy_version 72188 (0.00089) [2022-07-09 03:42:05,236][26022] Updated weights on worker 0-0, policy_version 72198 (0.00084) [2022-07-09 03:42:07,003][26022] Updated weights on worker 0-0, policy_version 72208 (0.00089) [2022-07-09 03:42:08,108][25689] Fps is (10 sec: 5609.0, 60 sec: 5784.0, 300 sec: 5762.2). Total num frames: 73947136. Throughput: 0: 5110.7. Samples: 73942056. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 03:42:08,109][25689] Avg episode reward: [(0, '-56.808')] [2022-07-09 03:42:08,723][26022] Updated weights on worker 0-0, policy_version 72218 (0.00087) [2022-07-09 03:42:10,680][26022] Updated weights on worker 0-0, policy_version 72228 (0.00096) [2022-07-09 03:42:12,354][26022] Updated weights on worker 0-0, policy_version 72238 (0.00094) [2022-07-09 03:42:13,234][25689] Fps is (10 sec: 5747.6, 60 sec: 5743.3, 300 sec: 5753.2). Total num frames: 73975808. Throughput: 0: 5943.5. Samples: 73976442. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 03:42:13,234][25689] Avg episode reward: [(0, '-56.525')] [2022-07-09 03:42:14,203][26022] Updated weights on worker 0-0, policy_version 72248 (0.00091) [2022-07-09 03:42:15,921][26022] Updated weights on worker 0-0, policy_version 72258 (0.00092) [2022-07-09 03:42:17,620][26022] Updated weights on worker 0-0, policy_version 72268 (0.00084) [2022-07-09 03:42:18,237][25689] Fps is (10 sec: 5762.2, 60 sec: 5743.3, 300 sec: 5758.1). Total num frames: 74005504. Throughput: 0: 5917.8. Samples: 74010964. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 03:42:18,238][25689] Avg episode reward: [(0, '-56.225')] [2022-07-09 03:42:19,564][26022] Updated weights on worker 0-0, policy_version 72278 (0.00087) [2022-07-09 03:42:21,180][26022] Updated weights on worker 0-0, policy_version 72288 (0.00086) [2022-07-09 03:42:22,724][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:42:22,735][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000072295_74030080.pth [2022-07-09 03:42:22,735][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000070270_71956480.pth [2022-07-09 03:42:23,103][26022] Updated weights on worker 0-0, policy_version 72298 (0.00099) [2022-07-09 03:42:23,249][25689] Fps is (10 sec: 5827.8, 60 sec: 5760.6, 300 sec: 5758.0). Total num frames: 74034176. Throughput: 0: 5140.6. Samples: 74028358. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 03:42:23,249][25689] Avg episode reward: [(0, '-56.035')] [2022-07-09 03:42:24,997][26022] Updated weights on worker 0-0, policy_version 72308 (0.00083) [2022-07-09 03:42:26,669][26022] Updated weights on worker 0-0, policy_version 72318 (0.00693) [2022-07-09 03:42:28,256][25689] Fps is (10 sec: 5723.5, 60 sec: 5760.0, 300 sec: 5756.6). Total num frames: 74062848. Throughput: 0: 5975.3. Samples: 74062450. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 03:42:28,257][25689] Avg episode reward: [(0, '-55.698')] [2022-07-09 03:42:28,565][26022] Updated weights on worker 0-0, policy_version 72328 (0.00084) [2022-07-09 03:42:30,063][26022] Updated weights on worker 0-0, policy_version 72338 (0.00090) [2022-07-09 03:42:32,055][26022] Updated weights on worker 0-0, policy_version 72348 (0.00095) [2022-07-09 03:42:33,298][25689] Fps is (10 sec: 5706.1, 60 sec: 5734.2, 300 sec: 5757.6). Total num frames: 74091520. Throughput: 0: 6028.0. Samples: 74097394. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 03:42:33,300][25689] Avg episode reward: [(0, '-55.624')] [2022-07-09 03:42:33,763][26022] Updated weights on worker 0-0, policy_version 72358 (0.00100) [2022-07-09 03:42:35,489][26022] Updated weights on worker 0-0, policy_version 72368 (0.00091) [2022-07-09 03:42:37,222][26022] Updated weights on worker 0-0, policy_version 72378 (0.00084) [2022-07-09 03:42:38,329][25689] Fps is (10 sec: 5591.5, 60 sec: 5716.2, 300 sec: 5750.5). Total num frames: 74119168. Throughput: 0: 5164.6. Samples: 74114728. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 03:42:38,329][25689] Avg episode reward: [(0, '-54.974')] [2022-07-09 03:42:39,066][26022] Updated weights on worker 0-0, policy_version 72388 (0.00084) [2022-07-09 03:42:40,806][26022] Updated weights on worker 0-0, policy_version 72398 (0.00091) [2022-07-09 03:42:42,731][26022] Updated weights on worker 0-0, policy_version 72408 (0.00085) [2022-07-09 03:42:43,341][25689] Fps is (10 sec: 5710.3, 60 sec: 5735.1, 300 sec: 5753.8). Total num frames: 74148864. Throughput: 0: 6027.7. Samples: 74149466. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 03:42:43,343][25689] Avg episode reward: [(0, '-54.345')] [2022-07-09 03:42:44,359][26022] Updated weights on worker 0-0, policy_version 72418 (0.00096) [2022-07-09 03:42:46,404][26022] Updated weights on worker 0-0, policy_version 72428 (0.00093) [2022-07-09 03:42:47,841][26022] Updated weights on worker 0-0, policy_version 72438 (0.00084) [2022-07-09 03:42:48,346][25689] Fps is (10 sec: 6031.1, 60 sec: 5752.6, 300 sec: 5761.7). Total num frames: 74179584. Throughput: 0: 6060.0. Samples: 74184194. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 03:42:48,347][25689] Avg episode reward: [(0, '-54.003')] [2022-07-09 03:42:49,725][26022] Updated weights on worker 0-0, policy_version 72448 (0.00094) [2022-07-09 03:42:51,276][26022] Updated weights on worker 0-0, policy_version 72458 (0.00086) [2022-07-09 03:42:53,338][26022] Updated weights on worker 0-0, policy_version 72468 (0.00086) [2022-07-09 03:42:53,388][25689] Fps is (10 sec: 5809.2, 60 sec: 5736.8, 300 sec: 5750.7). Total num frames: 74207232. Throughput: 0: 5190.2. Samples: 74201664. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 03:42:53,389][25689] Avg episode reward: [(0, '-53.052')] [2022-07-09 03:42:54,848][26022] Updated weights on worker 0-0, policy_version 72478 (0.00087) [2022-07-09 03:42:56,830][26022] Updated weights on worker 0-0, policy_version 72488 (0.00086) [2022-07-09 03:42:58,214][26022] Updated weights on worker 0-0, policy_version 72498 (0.00056) [2022-07-09 03:42:58,398][25689] Fps is (10 sec: 5806.6, 60 sec: 5753.8, 300 sec: 5761.1). Total num frames: 74237952. Throughput: 0: 6061.3. Samples: 74236374. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 03:42:58,398][25689] Avg episode reward: [(0, '-52.072')] [2022-07-09 03:43:00,190][26022] Updated weights on worker 0-0, policy_version 72508 (0.00061) [2022-07-09 03:43:02,439][26022] Updated weights on worker 0-0, policy_version 72518 (0.00089) [2022-07-09 03:43:03,423][25689] Fps is (10 sec: 5612.3, 60 sec: 5735.6, 300 sec: 5757.3). Total num frames: 74263552. Throughput: 0: 5946.3. Samples: 74268884. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 03:43:03,424][25689] Avg episode reward: [(0, '-52.009')] [2022-07-09 03:43:04,060][26022] Updated weights on worker 0-0, policy_version 72528 (0.00084) [2022-07-09 03:43:06,075][26022] Updated weights on worker 0-0, policy_version 72538 (0.00083) [2022-07-09 03:43:07,755][26022] Updated weights on worker 0-0, policy_version 72548 (0.00056) [2022-07-09 03:43:08,428][25689] Fps is (10 sec: 5411.1, 60 sec: 5721.0, 300 sec: 5759.7). Total num frames: 74292224. Throughput: 0: 5089.8. Samples: 74286406. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 03:43:08,428][25689] Avg episode reward: [(0, '-52.196')] [2022-07-09 03:43:09,578][26022] Updated weights on worker 0-0, policy_version 72558 (0.00085) [2022-07-09 03:43:11,340][26022] Updated weights on worker 0-0, policy_version 72568 (0.00085) [2022-07-09 03:43:13,063][26022] Updated weights on worker 0-0, policy_version 72578 (0.00086) [2022-07-09 03:43:13,470][25689] Fps is (10 sec: 5809.6, 60 sec: 5745.9, 300 sec: 5756.8). Total num frames: 74321920. Throughput: 0: 5933.5. Samples: 74320820. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 03:43:13,471][25689] Avg episode reward: [(0, '-52.844')] [2022-07-09 03:43:14,824][26022] Updated weights on worker 0-0, policy_version 72588 (0.00088) [2022-07-09 03:43:16,674][26022] Updated weights on worker 0-0, policy_version 72598 (0.00079) [2022-07-09 03:43:18,473][25689] Fps is (10 sec: 5708.8, 60 sec: 5712.0, 300 sec: 5751.2). Total num frames: 74349568. Throughput: 0: 5935.7. Samples: 74355530. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 03:43:18,473][25689] Avg episode reward: [(0, '-53.152')] [2022-07-09 03:43:18,489][26022] Updated weights on worker 0-0, policy_version 72608 (0.00081) [2022-07-09 03:43:20,280][26022] Updated weights on worker 0-0, policy_version 72618 (0.00084) [2022-07-09 03:43:21,970][26022] Updated weights on worker 0-0, policy_version 72628 (0.00084) [2022-07-09 03:43:23,495][25689] Fps is (10 sec: 5618.2, 60 sec: 5711.0, 300 sec: 5754.5). Total num frames: 74378240. Throughput: 0: 5179.4. Samples: 74372842. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 03:43:23,495][25689] Avg episode reward: [(0, '-53.884')] [2022-07-09 03:43:23,732][26022] Updated weights on worker 0-0, policy_version 72638 (0.00081) [2022-07-09 03:43:25,488][26022] Updated weights on worker 0-0, policy_version 72648 (0.00091) [2022-07-09 03:43:27,431][26022] Updated weights on worker 0-0, policy_version 72658 (0.00098) [2022-07-09 03:43:28,507][25689] Fps is (10 sec: 5817.0, 60 sec: 5727.6, 300 sec: 5749.7). Total num frames: 74407936. Throughput: 0: 6013.6. Samples: 74407152. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 03:43:28,507][25689] Avg episode reward: [(0, '-54.830')] [2022-07-09 03:43:29,192][26022] Updated weights on worker 0-0, policy_version 72668 (0.00090) [2022-07-09 03:43:31,016][26022] Updated weights on worker 0-0, policy_version 72678 (0.00094) [2022-07-09 03:43:32,596][26022] Updated weights on worker 0-0, policy_version 72688 (0.00085) [2022-07-09 03:43:33,561][25689] Fps is (10 sec: 5696.5, 60 sec: 5709.4, 300 sec: 5745.7). Total num frames: 74435584. Throughput: 0: 6022.5. Samples: 74441818. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 03:43:33,562][25689] Avg episode reward: [(0, '-54.563')] [2022-07-09 03:43:34,472][26022] Updated weights on worker 0-0, policy_version 72698 (0.00082) [2022-07-09 03:43:36,169][26022] Updated weights on worker 0-0, policy_version 72708 (0.00087) [2022-07-09 03:43:38,145][26022] Updated weights on worker 0-0, policy_version 72718 (0.00083) [2022-07-09 03:43:38,587][25689] Fps is (10 sec: 5790.4, 60 sec: 5760.8, 300 sec: 5748.8). Total num frames: 74466304. Throughput: 0: 5160.6. Samples: 74459332. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 03:43:38,588][25689] Avg episode reward: [(0, '-54.130')] [2022-07-09 03:43:39,860][26022] Updated weights on worker 0-0, policy_version 72728 (0.00437) [2022-07-09 03:43:41,620][26022] Updated weights on worker 0-0, policy_version 72738 (0.00085) [2022-07-09 03:43:43,220][26022] Updated weights on worker 0-0, policy_version 72748 (0.00092) [2022-07-09 03:43:43,591][25689] Fps is (10 sec: 5921.8, 60 sec: 5744.6, 300 sec: 5753.0). Total num frames: 74494976. Throughput: 0: 6033.4. Samples: 74494090. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 03:43:43,591][25689] Avg episode reward: [(0, '-53.761')] [2022-07-09 03:43:45,193][26022] Updated weights on worker 0-0, policy_version 72758 (0.00086) [2022-07-09 03:43:46,784][26022] Updated weights on worker 0-0, policy_version 72768 (0.00102) [2022-07-09 03:43:48,594][25689] Fps is (10 sec: 5730.4, 60 sec: 5710.8, 300 sec: 5748.1). Total num frames: 74523648. Throughput: 0: 6056.8. Samples: 74528816. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 03:43:48,596][25689] Avg episode reward: [(0, '-54.330')] [2022-07-09 03:43:48,693][26022] Updated weights on worker 0-0, policy_version 72778 (0.00080) [2022-07-09 03:43:50,291][26022] Updated weights on worker 0-0, policy_version 72788 (0.00096) [2022-07-09 03:43:52,354][26022] Updated weights on worker 0-0, policy_version 72798 (0.00086) [2022-07-09 03:43:53,676][25689] Fps is (10 sec: 5685.6, 60 sec: 5724.0, 300 sec: 5754.1). Total num frames: 74552320. Throughput: 0: 5186.3. Samples: 74546142. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 03:43:53,678][25689] Avg episode reward: [(0, '-54.539')] [2022-07-09 03:43:54,132][26022] Updated weights on worker 0-0, policy_version 72808 (0.00095) [2022-07-09 03:43:55,789][26022] Updated weights on worker 0-0, policy_version 72818 (0.00108) [2022-07-09 03:43:57,760][26022] Updated weights on worker 0-0, policy_version 72828 (0.00090) [2022-07-09 03:43:58,715][25689] Fps is (10 sec: 5767.1, 60 sec: 5704.3, 300 sec: 5746.6). Total num frames: 74582016. Throughput: 0: 5993.9. Samples: 74579976. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 03:43:58,715][25689] Avg episode reward: [(0, '-54.338')] [2022-07-09 03:43:59,261][26022] Updated weights on worker 0-0, policy_version 72838 (0.00091) [2022-07-09 03:44:01,729][26022] Updated weights on worker 0-0, policy_version 72848 (0.00080) [2022-07-09 03:44:03,413][26022] Updated weights on worker 0-0, policy_version 72858 (0.00084) [2022-07-09 03:44:03,725][25689] Fps is (10 sec: 5605.0, 60 sec: 5722.7, 300 sec: 5748.2). Total num frames: 74608640. Throughput: 0: 5876.8. Samples: 74612412. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 03:44:03,725][25689] Avg episode reward: [(0, '-54.464')] [2022-07-09 03:44:05,208][26022] Updated weights on worker 0-0, policy_version 72868 (0.00092) [2022-07-09 03:44:06,940][26022] Updated weights on worker 0-0, policy_version 72878 (0.00094) [2022-07-09 03:44:08,659][26022] Updated weights on worker 0-0, policy_version 72888 (0.00085) [2022-07-09 03:44:08,730][25689] Fps is (10 sec: 5521.4, 60 sec: 5722.7, 300 sec: 5745.9). Total num frames: 74637312. Throughput: 0: 5016.3. Samples: 74629822. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 03:44:08,730][25689] Avg episode reward: [(0, '-54.333')] [2022-07-09 03:44:10,377][26022] Updated weights on worker 0-0, policy_version 72898 (0.00084) [2022-07-09 03:44:12,263][26022] Updated weights on worker 0-0, policy_version 72908 (0.00090) [2022-07-09 03:44:13,784][26022] Updated weights on worker 0-0, policy_version 72918 (0.00084) [2022-07-09 03:44:13,880][25689] Fps is (10 sec: 5848.2, 60 sec: 5729.4, 300 sec: 5756.9). Total num frames: 74668032. Throughput: 0: 5869.4. Samples: 74664726. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 03:44:13,881][25689] Avg episode reward: [(0, '-54.220')] [2022-07-09 03:44:15,763][26022] Updated weights on worker 0-0, policy_version 72928 (0.00082) [2022-07-09 03:44:17,416][26022] Updated weights on worker 0-0, policy_version 72938 (0.00085) [2022-07-09 03:44:18,885][25689] Fps is (10 sec: 5747.4, 60 sec: 5729.1, 300 sec: 5746.9). Total num frames: 74695680. Throughput: 0: 5943.1. Samples: 74699850. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 03:44:18,886][25689] Avg episode reward: [(0, '-54.140')] [2022-07-09 03:44:19,090][26022] Updated weights on worker 0-0, policy_version 72948 (0.00097) [2022-07-09 03:44:21,067][26022] Updated weights on worker 0-0, policy_version 72958 (0.00080) [2022-07-09 03:44:22,631][26022] Updated weights on worker 0-0, policy_version 72968 (0.00096) [2022-07-09 03:44:22,746][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:44:22,762][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000072969_74720256.pth [2022-07-09 03:44:22,762][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000070944_72646656.pth [2022-07-09 03:44:23,940][25689] Fps is (10 sec: 5700.4, 60 sec: 5743.0, 300 sec: 5749.4). Total num frames: 74725376. Throughput: 0: 6055.2. Samples: 74734820. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 03:44:23,940][25689] Avg episode reward: [(0, '-54.500')] [2022-07-09 03:44:24,879][26022] Updated weights on worker 0-0, policy_version 72978 (0.00086) [2022-07-09 03:44:26,273][26022] Updated weights on worker 0-0, policy_version 72988 (0.00092) [2022-07-09 03:44:28,279][26022] Updated weights on worker 0-0, policy_version 72998 (0.00100) [2022-07-09 03:44:29,031][25689] Fps is (10 sec: 5853.9, 60 sec: 5735.5, 300 sec: 5753.8). Total num frames: 74755072. Throughput: 0: 5999.9. Samples: 74751628. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 03:44:29,031][25689] Avg episode reward: [(0, '-54.137')] [2022-07-09 03:44:29,958][26022] Updated weights on worker 0-0, policy_version 73008 (0.00082) [2022-07-09 03:44:31,768][26022] Updated weights on worker 0-0, policy_version 73018 (0.00090) [2022-07-09 03:44:33,430][26022] Updated weights on worker 0-0, policy_version 73028 (0.00080) [2022-07-09 03:44:34,085][25689] Fps is (10 sec: 5753.3, 60 sec: 5752.4, 300 sec: 5743.9). Total num frames: 74783744. Throughput: 0: 6012.7. Samples: 74786212. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 03:44:34,086][25689] Avg episode reward: [(0, '-54.087')] [2022-07-09 03:44:35,284][26022] Updated weights on worker 0-0, policy_version 73038 (0.00091) [2022-07-09 03:44:37,075][26022] Updated weights on worker 0-0, policy_version 73048 (0.00089) [2022-07-09 03:44:38,902][26022] Updated weights on worker 0-0, policy_version 73058 (0.00092) [2022-07-09 03:44:39,132][25689] Fps is (10 sec: 5676.9, 60 sec: 5716.6, 300 sec: 5744.7). Total num frames: 74812416. Throughput: 0: 5992.5. Samples: 74821180. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 03:44:39,133][25689] Avg episode reward: [(0, '-54.489')] [2022-07-09 03:44:40,523][26022] Updated weights on worker 0-0, policy_version 73068 (0.00084) [2022-07-09 03:44:42,444][26022] Updated weights on worker 0-0, policy_version 73078 (0.00087) [2022-07-09 03:44:43,922][26022] Updated weights on worker 0-0, policy_version 73088 (0.00083) [2022-07-09 03:44:44,175][25689] Fps is (10 sec: 5784.8, 60 sec: 5729.8, 300 sec: 5745.5). Total num frames: 74842112. Throughput: 0: 5132.0. Samples: 74838666. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 03:44:44,175][25689] Avg episode reward: [(0, '-54.437')] [2022-07-09 03:44:45,818][26022] Updated weights on worker 0-0, policy_version 73098 (0.00093) [2022-07-09 03:44:47,570][26022] Updated weights on worker 0-0, policy_version 73108 (0.00094) [2022-07-09 03:44:49,213][25689] Fps is (10 sec: 5789.9, 60 sec: 5726.5, 300 sec: 5745.5). Total num frames: 74870784. Throughput: 0: 6040.5. Samples: 74873540. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:44:49,214][25689] Avg episode reward: [(0, '-54.195')] [2022-07-09 03:44:49,440][26022] Updated weights on worker 0-0, policy_version 73118 (0.00088) [2022-07-09 03:44:51,166][26022] Updated weights on worker 0-0, policy_version 73128 (0.00438) [2022-07-09 03:44:52,985][26022] Updated weights on worker 0-0, policy_version 73138 (0.00099) [2022-07-09 03:44:54,277][25689] Fps is (10 sec: 5879.0, 60 sec: 5762.0, 300 sec: 5749.1). Total num frames: 74901504. Throughput: 0: 6056.5. Samples: 74908506. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:44:54,278][25689] Avg episode reward: [(0, '-54.962')] [2022-07-09 03:44:54,605][26022] Updated weights on worker 0-0, policy_version 73148 (0.00089) [2022-07-09 03:44:56,477][26022] Updated weights on worker 0-0, policy_version 73158 (0.00096) [2022-07-09 03:44:58,216][26022] Updated weights on worker 0-0, policy_version 73168 (0.00093) [2022-07-09 03:44:59,282][25689] Fps is (10 sec: 5898.5, 60 sec: 5748.3, 300 sec: 5756.6). Total num frames: 74930176. Throughput: 0: 5189.7. Samples: 74925752. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:44:59,284][25689] Avg episode reward: [(0, '-55.049')] [2022-07-09 03:44:59,953][26022] Updated weights on worker 0-0, policy_version 73178 (0.01076) [2022-07-09 03:45:01,752][26022] Updated weights on worker 0-0, policy_version 73188 (0.00080) [2022-07-09 03:45:03,787][26022] Updated weights on worker 0-0, policy_version 73198 (0.00085) [2022-07-09 03:45:04,299][25689] Fps is (10 sec: 5517.8, 60 sec: 5747.7, 300 sec: 5743.2). Total num frames: 74956800. Throughput: 0: 5956.9. Samples: 74958540. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:45:04,299][25689] Avg episode reward: [(0, '-55.186')] [2022-07-09 03:45:05,568][26022] Updated weights on worker 0-0, policy_version 73208 (0.00084) [2022-07-09 03:45:07,583][26022] Updated weights on worker 0-0, policy_version 73218 (0.00087) [2022-07-09 03:45:09,181][26022] Updated weights on worker 0-0, policy_version 73228 (0.00083) [2022-07-09 03:45:09,307][25689] Fps is (10 sec: 5618.2, 60 sec: 5764.3, 300 sec: 5747.8). Total num frames: 74986496. Throughput: 0: 5957.0. Samples: 74993238. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:45:09,308][25689] Avg episode reward: [(0, '-55.491')] [2022-07-09 03:45:11,086][26022] Updated weights on worker 0-0, policy_version 73238 (0.00052) [2022-07-09 03:45:12,660][26022] Updated weights on worker 0-0, policy_version 73248 (0.00082) [2022-07-09 03:45:14,405][25689] Fps is (10 sec: 5876.9, 60 sec: 5752.4, 300 sec: 5750.5). Total num frames: 75016192. Throughput: 0: 5074.8. Samples: 75010648. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:45:14,405][25689] Avg episode reward: [(0, '-55.492')] [2022-07-09 03:45:14,411][26022] Updated weights on worker 0-0, policy_version 73258 (0.00084) [2022-07-09 03:45:16,361][26022] Updated weights on worker 0-0, policy_version 73268 (0.00083) [2022-07-09 03:45:17,804][26022] Updated weights on worker 0-0, policy_version 73278 (0.00087) [2022-07-09 03:45:19,416][25689] Fps is (10 sec: 5672.3, 60 sec: 5751.8, 300 sec: 5747.0). Total num frames: 75043840. Throughput: 0: 5964.7. Samples: 75045846. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:45:19,417][25689] Avg episode reward: [(0, '-55.238')] [2022-07-09 03:45:19,723][26022] Updated weights on worker 0-0, policy_version 73288 (0.00081) [2022-07-09 03:45:21,357][26022] Updated weights on worker 0-0, policy_version 73298 (0.00088) [2022-07-09 03:45:23,190][26022] Updated weights on worker 0-0, policy_version 73308 (0.00085) [2022-07-09 03:45:24,487][25689] Fps is (10 sec: 5789.0, 60 sec: 5767.1, 300 sec: 5746.1). Total num frames: 75074560. Throughput: 0: 6059.9. Samples: 75080880. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:45:24,488][25689] Avg episode reward: [(0, '-55.427')] [2022-07-09 03:45:25,079][26022] Updated weights on worker 0-0, policy_version 73318 (0.00094) [2022-07-09 03:45:26,783][26022] Updated weights on worker 0-0, policy_version 73328 (0.00076) [2022-07-09 03:45:28,581][26022] Updated weights on worker 0-0, policy_version 73338 (0.00095) [2022-07-09 03:45:29,503][25689] Fps is (10 sec: 5786.7, 60 sec: 5740.5, 300 sec: 5743.8). Total num frames: 75102208. Throughput: 0: 5192.9. Samples: 75098114. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:45:29,503][25689] Avg episode reward: [(0, '-55.451')] [2022-07-09 03:45:30,351][26022] Updated weights on worker 0-0, policy_version 73348 (0.00087) [2022-07-09 03:45:32,311][26022] Updated weights on worker 0-0, policy_version 73358 (0.00090) [2022-07-09 03:45:33,775][26022] Updated weights on worker 0-0, policy_version 73368 (0.00089) [2022-07-09 03:45:34,600][25689] Fps is (10 sec: 5670.2, 60 sec: 5753.2, 300 sec: 5748.9). Total num frames: 75131904. Throughput: 0: 6041.8. Samples: 75132668. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:45:34,601][25689] Avg episode reward: [(0, '-55.358')] [2022-07-09 03:45:35,640][26022] Updated weights on worker 0-0, policy_version 73378 (0.00087) [2022-07-09 03:45:37,199][26022] Updated weights on worker 0-0, policy_version 73388 (0.00084) [2022-07-09 03:45:39,266][26022] Updated weights on worker 0-0, policy_version 73398 (0.00093) [2022-07-09 03:45:39,659][25689] Fps is (10 sec: 5848.0, 60 sec: 5769.1, 300 sec: 5747.8). Total num frames: 75161600. Throughput: 0: 6021.1. Samples: 75167728. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:45:39,659][25689] Avg episode reward: [(0, '-55.150')] [2022-07-09 03:45:41,063][26022] Updated weights on worker 0-0, policy_version 73408 (0.00089) [2022-07-09 03:45:42,636][26022] Updated weights on worker 0-0, policy_version 73418 (0.00084) [2022-07-09 03:45:44,449][26022] Updated weights on worker 0-0, policy_version 73428 (0.00083) [2022-07-09 03:45:44,701][25689] Fps is (10 sec: 5778.7, 60 sec: 5752.2, 300 sec: 5740.9). Total num frames: 75190272. Throughput: 0: 5156.5. Samples: 75185112. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:45:44,701][25689] Avg episode reward: [(0, '-55.349')] [2022-07-09 03:45:46,405][26022] Updated weights on worker 0-0, policy_version 73438 (0.00084) [2022-07-09 03:45:48,020][26022] Updated weights on worker 0-0, policy_version 73448 (0.00087) [2022-07-09 03:45:49,735][25689] Fps is (10 sec: 5792.8, 60 sec: 5769.6, 300 sec: 5745.0). Total num frames: 75219968. Throughput: 0: 5998.7. Samples: 75219480. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:45:49,735][25689] Avg episode reward: [(0, '-54.760')] [2022-07-09 03:45:49,886][26022] Updated weights on worker 0-0, policy_version 73458 (0.00086) [2022-07-09 03:45:51,595][26022] Updated weights on worker 0-0, policy_version 73468 (0.00091) [2022-07-09 03:45:53,552][26022] Updated weights on worker 0-0, policy_version 73478 (0.00088) [2022-07-09 03:45:54,870][25689] Fps is (10 sec: 5739.5, 60 sec: 5729.0, 300 sec: 5736.6). Total num frames: 75248640. Throughput: 0: 5973.0. Samples: 75253742. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:45:54,873][25689] Avg episode reward: [(0, '-54.581')] [2022-07-09 03:45:55,213][26022] Updated weights on worker 0-0, policy_version 73488 (0.00597) [2022-07-09 03:45:56,943][26022] Updated weights on worker 0-0, policy_version 73498 (0.00086) [2022-07-09 03:45:58,931][26022] Updated weights on worker 0-0, policy_version 73508 (0.00095) [2022-07-09 03:45:59,892][25689] Fps is (10 sec: 5847.3, 60 sec: 5761.2, 300 sec: 5751.1). Total num frames: 75279360. Throughput: 0: 5107.3. Samples: 75271066. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:45:59,893][25689] Avg episode reward: [(0, '-54.587')] [2022-07-09 03:46:00,636][26022] Updated weights on worker 0-0, policy_version 73518 (0.00097) [2022-07-09 03:46:02,861][26022] Updated weights on worker 0-0, policy_version 73528 (0.00095) [2022-07-09 03:46:04,397][26022] Updated weights on worker 0-0, policy_version 73538 (0.00087) [2022-07-09 03:46:04,913][25689] Fps is (10 sec: 5506.3, 60 sec: 5727.0, 300 sec: 5740.7). Total num frames: 75303936. Throughput: 0: 5873.8. Samples: 75303832. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:46:04,913][25689] Avg episode reward: [(0, '-54.607')] [2022-07-09 03:46:06,392][26022] Updated weights on worker 0-0, policy_version 73548 (0.00091) [2022-07-09 03:46:08,070][26022] Updated weights on worker 0-0, policy_version 73558 (0.00085) [2022-07-09 03:46:09,945][25689] Fps is (10 sec: 5398.8, 60 sec: 5724.7, 300 sec: 5737.7). Total num frames: 75333632. Throughput: 0: 5897.3. Samples: 75338664. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:46:09,946][25689] Avg episode reward: [(0, '-54.392')] [2022-07-09 03:46:09,948][26022] Updated weights on worker 0-0, policy_version 73568 (0.00088) [2022-07-09 03:46:11,507][26022] Updated weights on worker 0-0, policy_version 73578 (0.00538) [2022-07-09 03:46:13,364][26022] Updated weights on worker 0-0, policy_version 73588 (0.00103) [2022-07-09 03:46:15,029][25689] Fps is (10 sec: 5972.3, 60 sec: 5742.9, 300 sec: 5739.5). Total num frames: 75364352. Throughput: 0: 5073.4. Samples: 75356014. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:46:15,029][25689] Avg episode reward: [(0, '-54.951')] [2022-07-09 03:46:15,036][26022] Updated weights on worker 0-0, policy_version 73598 (0.00094) [2022-07-09 03:46:16,877][26022] Updated weights on worker 0-0, policy_version 73608 (0.00080) [2022-07-09 03:46:18,716][26022] Updated weights on worker 0-0, policy_version 73618 (0.00090) [2022-07-09 03:46:20,078][25689] Fps is (10 sec: 5860.7, 60 sec: 5756.2, 300 sec: 5742.3). Total num frames: 75393024. Throughput: 0: 5944.1. Samples: 75391058. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 03:46:20,079][25689] Avg episode reward: [(0, '-55.184')] [2022-07-09 03:46:20,279][26022] Updated weights on worker 0-0, policy_version 73628 (0.00083) [2022-07-09 03:46:21,987][26022] Updated weights on worker 0-0, policy_version 73638 (0.00085) [2022-07-09 03:46:22,864][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:46:22,882][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000073642_75409408.pth [2022-07-09 03:46:22,883][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000071620_73338880.pth [2022-07-09 03:46:23,652][26022] Updated weights on worker 0-0, policy_version 73648 (0.00089) [2022-07-09 03:46:25,088][25689] Fps is (10 sec: 5802.6, 60 sec: 5745.2, 300 sec: 5745.6). Total num frames: 75422720. Throughput: 0: 6070.7. Samples: 75426308. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 03:46:25,088][25689] Avg episode reward: [(0, '-55.394')] [2022-07-09 03:46:25,593][26022] Updated weights on worker 0-0, policy_version 73658 (0.00095) [2022-07-09 03:46:27,442][26022] Updated weights on worker 0-0, policy_version 73668 (0.00093) [2022-07-09 03:46:28,962][26022] Updated weights on worker 0-0, policy_version 73678 (0.00083) [2022-07-09 03:46:30,089][25689] Fps is (10 sec: 5728.5, 60 sec: 5746.5, 300 sec: 5737.7). Total num frames: 75450368. Throughput: 0: 6071.8. Samples: 75460976. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 03:46:30,089][25689] Avg episode reward: [(0, '-54.569')] [2022-07-09 03:46:30,847][26022] Updated weights on worker 0-0, policy_version 73688 (0.00096) [2022-07-09 03:46:32,838][26022] Updated weights on worker 0-0, policy_version 73698 (0.00082) [2022-07-09 03:46:34,276][26022] Updated weights on worker 0-0, policy_version 73708 (0.00093) [2022-07-09 03:46:35,236][25689] Fps is (10 sec: 5751.6, 60 sec: 5758.7, 300 sec: 5742.1). Total num frames: 75481088. Throughput: 0: 6063.0. Samples: 75478530. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 03:46:35,236][25689] Avg episode reward: [(0, '-54.951')] [2022-07-09 03:46:36,282][26022] Updated weights on worker 0-0, policy_version 73718 (0.00557) [2022-07-09 03:46:37,883][26022] Updated weights on worker 0-0, policy_version 73728 (0.00095) [2022-07-09 03:46:39,641][26022] Updated weights on worker 0-0, policy_version 73738 (0.00083) [2022-07-09 03:46:40,259][25689] Fps is (10 sec: 5940.8, 60 sec: 5762.1, 300 sec: 5745.8). Total num frames: 75510784. Throughput: 0: 6070.2. Samples: 75513554. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 03:46:40,259][25689] Avg episode reward: [(0, '-54.462')] [2022-07-09 03:46:41,415][26022] Updated weights on worker 0-0, policy_version 73748 (0.00092) [2022-07-09 03:46:43,085][26022] Updated weights on worker 0-0, policy_version 73758 (0.00091) [2022-07-09 03:46:45,012][26022] Updated weights on worker 0-0, policy_version 73768 (0.00088) [2022-07-09 03:46:45,262][25689] Fps is (10 sec: 5719.2, 60 sec: 5748.9, 300 sec: 5739.0). Total num frames: 75538432. Throughput: 0: 6061.6. Samples: 75548600. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 03:46:45,263][25689] Avg episode reward: [(0, '-54.758')] [2022-07-09 03:46:46,518][26022] Updated weights on worker 0-0, policy_version 73778 (0.00092) [2022-07-09 03:46:48,483][26022] Updated weights on worker 0-0, policy_version 73788 (0.00096) [2022-07-09 03:46:50,080][26022] Updated weights on worker 0-0, policy_version 73798 (0.00084) [2022-07-09 03:46:50,306][25689] Fps is (10 sec: 5809.4, 60 sec: 5764.9, 300 sec: 5746.1). Total num frames: 75569152. Throughput: 0: 5196.0. Samples: 75566024. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 03:46:50,307][25689] Avg episode reward: [(0, '-54.914')] [2022-07-09 03:46:51,923][26022] Updated weights on worker 0-0, policy_version 73808 (0.00082) [2022-07-09 03:46:53,681][26022] Updated weights on worker 0-0, policy_version 73818 (0.00088) [2022-07-09 03:46:55,268][26022] Updated weights on worker 0-0, policy_version 73828 (0.00087) [2022-07-09 03:46:55,396][25689] Fps is (10 sec: 6063.2, 60 sec: 5803.1, 300 sec: 5748.0). Total num frames: 75599872. Throughput: 0: 6096.0. Samples: 75601426. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 03:46:55,398][25689] Avg episode reward: [(0, '-55.453')] [2022-07-09 03:46:57,429][26022] Updated weights on worker 0-0, policy_version 73838 (0.00084) [2022-07-09 03:46:58,974][26022] Updated weights on worker 0-0, policy_version 73848 (0.00080) [2022-07-09 03:47:00,404][25689] Fps is (10 sec: 5881.5, 60 sec: 5770.5, 300 sec: 5755.0). Total num frames: 75628544. Throughput: 0: 6096.9. Samples: 75636378. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 03:47:00,405][25689] Avg episode reward: [(0, '-55.127')] [2022-07-09 03:47:00,751][26022] Updated weights on worker 0-0, policy_version 73858 (0.00077) [2022-07-09 03:47:02,719][26022] Updated weights on worker 0-0, policy_version 73868 (0.00090) [2022-07-09 03:47:04,623][26022] Updated weights on worker 0-0, policy_version 73878 (0.00093) [2022-07-09 03:47:05,419][25689] Fps is (10 sec: 5516.6, 60 sec: 5804.9, 300 sec: 5744.9). Total num frames: 75655168. Throughput: 0: 5121.0. Samples: 75651824. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 03:47:05,420][25689] Avg episode reward: [(0, '-55.181')] [2022-07-09 03:47:06,271][26022] Updated weights on worker 0-0, policy_version 73888 (0.00098) [2022-07-09 03:47:08,295][26022] Updated weights on worker 0-0, policy_version 73898 (0.00064) [2022-07-09 03:47:09,699][26022] Updated weights on worker 0-0, policy_version 73908 (0.00081) [2022-07-09 03:47:10,436][25689] Fps is (10 sec: 5614.0, 60 sec: 5806.3, 300 sec: 5750.5). Total num frames: 75684864. Throughput: 0: 5993.1. Samples: 75686666. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 03:47:10,438][25689] Avg episode reward: [(0, '-54.869')] [2022-07-09 03:47:11,747][26022] Updated weights on worker 0-0, policy_version 73918 (0.00084) [2022-07-09 03:47:13,452][26022] Updated weights on worker 0-0, policy_version 73928 (0.00087) [2022-07-09 03:47:15,211][26022] Updated weights on worker 0-0, policy_version 73938 (0.00086) [2022-07-09 03:47:15,491][25689] Fps is (10 sec: 5795.3, 60 sec: 5775.2, 300 sec: 5746.0). Total num frames: 75713536. Throughput: 0: 5984.6. Samples: 75721688. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 03:47:15,493][25689] Avg episode reward: [(0, '-54.674')] [2022-07-09 03:47:16,987][26022] Updated weights on worker 0-0, policy_version 73948 (0.00092) [2022-07-09 03:47:18,702][26022] Updated weights on worker 0-0, policy_version 73958 (0.00089) [2022-07-09 03:47:20,506][25689] Fps is (10 sec: 5796.0, 60 sec: 5795.5, 300 sec: 5749.4). Total num frames: 75743232. Throughput: 0: 5104.7. Samples: 75738994. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 03:47:20,507][25689] Avg episode reward: [(0, '-54.551')] [2022-07-09 03:47:20,508][26022] Updated weights on worker 0-0, policy_version 73968 (0.00098) [2022-07-09 03:47:22,302][26022] Updated weights on worker 0-0, policy_version 73978 (0.00572) [2022-07-09 03:47:23,842][26022] Updated weights on worker 0-0, policy_version 73988 (0.00087) [2022-07-09 03:47:25,516][25689] Fps is (10 sec: 5822.4, 60 sec: 5778.5, 300 sec: 5749.4). Total num frames: 75771904. Throughput: 0: 6079.2. Samples: 75773994. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 03:47:25,518][25689] Avg episode reward: [(0, '-54.506')] [2022-07-09 03:47:25,873][26022] Updated weights on worker 0-0, policy_version 73998 (0.00084) [2022-07-09 03:47:27,614][26022] Updated weights on worker 0-0, policy_version 74008 (0.00087) [2022-07-09 03:47:29,332][26022] Updated weights on worker 0-0, policy_version 74018 (0.00087) [2022-07-09 03:47:30,579][25689] Fps is (10 sec: 5794.8, 60 sec: 5806.5, 300 sec: 5752.4). Total num frames: 75801600. Throughput: 0: 6061.4. Samples: 75808760. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 03:47:30,580][25689] Avg episode reward: [(0, '-54.337')] [2022-07-09 03:47:31,088][26022] Updated weights on worker 0-0, policy_version 74028 (0.00096) [2022-07-09 03:47:32,797][26022] Updated weights on worker 0-0, policy_version 74038 (0.00088) [2022-07-09 03:47:34,609][26022] Updated weights on worker 0-0, policy_version 74048 (0.00087) [2022-07-09 03:47:35,685][25689] Fps is (10 sec: 5739.4, 60 sec: 5776.4, 300 sec: 5754.4). Total num frames: 75830272. Throughput: 0: 5164.9. Samples: 75825990. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 03:47:35,686][25689] Avg episode reward: [(0, '-54.851')] [2022-07-09 03:47:36,395][26022] Updated weights on worker 0-0, policy_version 74058 (0.00085) [2022-07-09 03:47:38,091][26022] Updated weights on worker 0-0, policy_version 74068 (0.00084) [2022-07-09 03:47:39,889][26022] Updated weights on worker 0-0, policy_version 74078 (0.00095) [2022-07-09 03:47:40,729][25689] Fps is (10 sec: 5851.4, 60 sec: 5791.4, 300 sec: 5757.2). Total num frames: 75860992. Throughput: 0: 6038.9. Samples: 75861116. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 03:47:40,730][25689] Avg episode reward: [(0, '-54.468')] [2022-07-09 03:47:41,703][26022] Updated weights on worker 0-0, policy_version 74088 (0.00094) [2022-07-09 03:47:43,377][26022] Updated weights on worker 0-0, policy_version 74098 (0.00087) [2022-07-09 03:47:45,250][26022] Updated weights on worker 0-0, policy_version 74108 (0.00094) [2022-07-09 03:47:45,775][25689] Fps is (10 sec: 5886.4, 60 sec: 5804.3, 300 sec: 5749.5). Total num frames: 75889664. Throughput: 0: 6033.4. Samples: 75896228. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 03:47:45,776][25689] Avg episode reward: [(0, '-55.312')] [2022-07-09 03:47:46,815][26022] Updated weights on worker 0-0, policy_version 74118 (0.00087) [2022-07-09 03:47:48,767][26022] Updated weights on worker 0-0, policy_version 74128 (0.00094) [2022-07-09 03:47:50,359][26022] Updated weights on worker 0-0, policy_version 74138 (0.00095) [2022-07-09 03:47:50,850][25689] Fps is (10 sec: 5766.9, 60 sec: 5784.3, 300 sec: 5755.8). Total num frames: 75919360. Throughput: 0: 5170.5. Samples: 75913572. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:47:50,850][25689] Avg episode reward: [(0, '-54.867')] [2022-07-09 03:47:52,423][26022] Updated weights on worker 0-0, policy_version 74148 (0.00090) [2022-07-09 03:47:53,912][26022] Updated weights on worker 0-0, policy_version 74158 (0.00095) [2022-07-09 03:47:55,797][26022] Updated weights on worker 0-0, policy_version 74168 (0.00087) [2022-07-09 03:47:55,946][25689] Fps is (10 sec: 5940.1, 60 sec: 5783.8, 300 sec: 5754.1). Total num frames: 75950080. Throughput: 0: 6039.5. Samples: 75948354. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:47:55,947][25689] Avg episode reward: [(0, '-54.896')] [2022-07-09 03:47:57,671][26022] Updated weights on worker 0-0, policy_version 74178 (0.00086) [2022-07-09 03:47:59,227][26022] Updated weights on worker 0-0, policy_version 74188 (0.00088) [2022-07-09 03:48:01,007][25689] Fps is (10 sec: 5645.8, 60 sec: 5744.9, 300 sec: 5756.9). Total num frames: 75976704. Throughput: 0: 6013.4. Samples: 75983056. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:48:01,007][25689] Avg episode reward: [(0, '-53.966')] [2022-07-09 03:48:01,612][26022] Updated weights on worker 0-0, policy_version 74198 (0.00100) [2022-07-09 03:48:03,258][26022] Updated weights on worker 0-0, policy_version 74208 (0.00085) [2022-07-09 03:48:05,095][26022] Updated weights on worker 0-0, policy_version 74218 (0.00089) [2022-07-09 03:48:06,052][25689] Fps is (10 sec: 5370.3, 60 sec: 5759.0, 300 sec: 5752.6). Total num frames: 76004352. Throughput: 0: 5028.0. Samples: 75998180. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:48:06,053][25689] Avg episode reward: [(0, '-52.663')] [2022-07-09 03:48:06,710][26022] Updated weights on worker 0-0, policy_version 74228 (0.00093) [2022-07-09 03:48:08,541][26022] Updated weights on worker 0-0, policy_version 74238 (0.00090) [2022-07-09 03:48:10,184][26022] Updated weights on worker 0-0, policy_version 74248 (0.00092) [2022-07-09 03:48:11,114][25689] Fps is (10 sec: 5775.2, 60 sec: 5771.6, 300 sec: 5755.7). Total num frames: 76035072. Throughput: 0: 5900.7. Samples: 76033142. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:48:11,119][25689] Avg episode reward: [(0, '-52.645')] [2022-07-09 03:48:12,090][26022] Updated weights on worker 0-0, policy_version 74258 (0.00095) [2022-07-09 03:48:13,818][26022] Updated weights on worker 0-0, policy_version 74268 (0.00066) [2022-07-09 03:48:15,704][26022] Updated weights on worker 0-0, policy_version 74278 (0.00089) [2022-07-09 03:48:16,206][25689] Fps is (10 sec: 5748.2, 60 sec: 5751.2, 300 sec: 5754.0). Total num frames: 76062720. Throughput: 0: 5904.5. Samples: 76067980. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:48:16,207][25689] Avg episode reward: [(0, '-52.380')] [2022-07-09 03:48:17,180][26022] Updated weights on worker 0-0, policy_version 74288 (0.00085) [2022-07-09 03:48:19,239][26022] Updated weights on worker 0-0, policy_version 74298 (0.00051) [2022-07-09 03:48:20,814][26022] Updated weights on worker 0-0, policy_version 74308 (0.00090) [2022-07-09 03:48:21,244][25689] Fps is (10 sec: 5761.4, 60 sec: 5765.9, 300 sec: 5760.5). Total num frames: 76093440. Throughput: 0: 5057.5. Samples: 76085408. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:48:21,247][25689] Avg episode reward: [(0, '-52.893')] [2022-07-09 03:48:22,708][26022] Updated weights on worker 0-0, policy_version 74318 (0.00089) [2022-07-09 03:48:22,941][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:48:22,954][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000074319_76102656.pth [2022-07-09 03:48:22,959][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000072295_74030080.pth [2022-07-09 03:48:24,487][26022] Updated weights on worker 0-0, policy_version 74328 (0.00089) [2022-07-09 03:48:26,043][26022] Updated weights on worker 0-0, policy_version 74338 (0.00093) [2022-07-09 03:48:26,258][25689] Fps is (10 sec: 5908.4, 60 sec: 5765.5, 300 sec: 5757.1). Total num frames: 76122112. Throughput: 0: 6052.7. Samples: 76120482. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:48:26,260][25689] Avg episode reward: [(0, '-54.560')] [2022-07-09 03:48:28,139][26022] Updated weights on worker 0-0, policy_version 74348 (0.00094) [2022-07-09 03:48:29,815][26022] Updated weights on worker 0-0, policy_version 74358 (0.00095) [2022-07-09 03:48:31,320][25689] Fps is (10 sec: 5589.8, 60 sec: 5731.9, 300 sec: 5756.9). Total num frames: 76149760. Throughput: 0: 6011.7. Samples: 76154618. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:48:31,320][25689] Avg episode reward: [(0, '-55.066')] [2022-07-09 03:48:31,574][26022] Updated weights on worker 0-0, policy_version 74368 (0.00091) [2022-07-09 03:48:33,247][26022] Updated weights on worker 0-0, policy_version 74378 (0.00113) [2022-07-09 03:48:35,279][26022] Updated weights on worker 0-0, policy_version 74388 (0.00093) [2022-07-09 03:48:36,370][25689] Fps is (10 sec: 5671.2, 60 sec: 5754.1, 300 sec: 5753.0). Total num frames: 76179456. Throughput: 0: 5998.1. Samples: 76188922. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:48:36,370][25689] Avg episode reward: [(0, '-55.795')] [2022-07-09 03:48:36,939][26022] Updated weights on worker 0-0, policy_version 74398 (0.00086) [2022-07-09 03:48:38,835][26022] Updated weights on worker 0-0, policy_version 74408 (0.00093) [2022-07-09 03:48:40,582][26022] Updated weights on worker 0-0, policy_version 74418 (0.00087) [2022-07-09 03:48:41,373][25689] Fps is (10 sec: 5704.3, 60 sec: 5707.3, 300 sec: 5749.6). Total num frames: 76207104. Throughput: 0: 5997.7. Samples: 76206132. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:48:41,373][25689] Avg episode reward: [(0, '-55.613')] [2022-07-09 03:48:42,206][26022] Updated weights on worker 0-0, policy_version 74428 (0.00085) [2022-07-09 03:48:44,022][26022] Updated weights on worker 0-0, policy_version 74438 (0.00089) [2022-07-09 03:48:45,775][26022] Updated weights on worker 0-0, policy_version 74448 (0.00091) [2022-07-09 03:48:46,392][25689] Fps is (10 sec: 5721.7, 60 sec: 5726.7, 300 sec: 5752.7). Total num frames: 76236800. Throughput: 0: 5977.8. Samples: 76240836. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:48:46,393][25689] Avg episode reward: [(0, '-56.228')] [2022-07-09 03:48:47,559][26022] Updated weights on worker 0-0, policy_version 74458 (0.00086) [2022-07-09 03:48:49,449][26022] Updated weights on worker 0-0, policy_version 74468 (0.00086) [2022-07-09 03:48:51,111][26022] Updated weights on worker 0-0, policy_version 74478 (0.00091) [2022-07-09 03:48:51,425][25689] Fps is (10 sec: 5908.5, 60 sec: 5730.7, 300 sec: 5757.1). Total num frames: 76266496. Throughput: 0: 6012.5. Samples: 76275498. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:48:51,425][25689] Avg episode reward: [(0, '-55.471')] [2022-07-09 03:48:52,924][26022] Updated weights on worker 0-0, policy_version 74488 (0.00089) [2022-07-09 03:48:54,701][26022] Updated weights on worker 0-0, policy_version 74498 (0.00618) [2022-07-09 03:48:56,365][26022] Updated weights on worker 0-0, policy_version 74508 (0.00096) [2022-07-09 03:48:56,563][25689] Fps is (10 sec: 5839.3, 60 sec: 5709.8, 300 sec: 5755.2). Total num frames: 76296192. Throughput: 0: 5160.3. Samples: 76293126. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:48:56,563][25689] Avg episode reward: [(0, '-55.468')] [2022-07-09 03:48:58,247][26022] Updated weights on worker 0-0, policy_version 74518 (0.00091) [2022-07-09 03:49:00,098][26022] Updated weights on worker 0-0, policy_version 74528 (0.00086) [2022-07-09 03:49:01,573][25689] Fps is (10 sec: 5751.2, 60 sec: 5748.4, 300 sec: 5762.1). Total num frames: 76324864. Throughput: 0: 6012.8. Samples: 76327596. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:49:01,574][25689] Avg episode reward: [(0, '-54.836')] [2022-07-09 03:49:01,741][26022] Updated weights on worker 0-0, policy_version 74538 (0.00084) [2022-07-09 03:49:03,975][26022] Updated weights on worker 0-0, policy_version 74548 (0.00101) [2022-07-09 03:49:05,928][26022] Updated weights on worker 0-0, policy_version 74558 (0.00087) [2022-07-09 03:49:06,597][25689] Fps is (10 sec: 5612.9, 60 sec: 5750.5, 300 sec: 5758.3). Total num frames: 76352512. Throughput: 0: 5920.5. Samples: 76360460. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:49:06,597][25689] Avg episode reward: [(0, '-55.262')] [2022-07-09 03:49:07,547][26022] Updated weights on worker 0-0, policy_version 74568 (0.00088) [2022-07-09 03:49:09,268][26022] Updated weights on worker 0-0, policy_version 74578 (0.00089) [2022-07-09 03:49:10,995][26022] Updated weights on worker 0-0, policy_version 74588 (0.00094) [2022-07-09 03:49:11,655][25689] Fps is (10 sec: 5586.4, 60 sec: 5716.9, 300 sec: 5753.2). Total num frames: 76381184. Throughput: 0: 5064.5. Samples: 76377956. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:49:11,656][25689] Avg episode reward: [(0, '-54.710')] [2022-07-09 03:49:12,788][26022] Updated weights on worker 0-0, policy_version 74598 (0.00098) [2022-07-09 03:49:14,683][26022] Updated weights on worker 0-0, policy_version 74608 (0.00089) [2022-07-09 03:49:16,326][26022] Updated weights on worker 0-0, policy_version 74618 (0.00087) [2022-07-09 03:49:16,725][25689] Fps is (10 sec: 5762.7, 60 sec: 5752.9, 300 sec: 5758.8). Total num frames: 76410880. Throughput: 0: 5920.9. Samples: 76412506. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 03:49:16,726][25689] Avg episode reward: [(0, '-54.133')] [2022-07-09 03:49:18,229][26022] Updated weights on worker 0-0, policy_version 74628 (0.00089) [2022-07-09 03:49:19,795][26022] Updated weights on worker 0-0, policy_version 74638 (0.00082) [2022-07-09 03:49:21,750][25689] Fps is (10 sec: 5680.6, 60 sec: 5703.4, 300 sec: 5752.5). Total num frames: 76438528. Throughput: 0: 5933.7. Samples: 76447316. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:49:21,750][25689] Avg episode reward: [(0, '-54.414')] [2022-07-09 03:49:21,764][26022] Updated weights on worker 0-0, policy_version 74648 (0.00093) [2022-07-09 03:49:23,524][26022] Updated weights on worker 0-0, policy_version 74658 (0.00080) [2022-07-09 03:49:25,224][26022] Updated weights on worker 0-0, policy_version 74668 (0.00091) [2022-07-09 03:49:26,763][25689] Fps is (10 sec: 5712.8, 60 sec: 5720.4, 300 sec: 5754.0). Total num frames: 76468224. Throughput: 0: 5177.7. Samples: 76464876. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:49:26,764][25689] Avg episode reward: [(0, '-53.931')] [2022-07-09 03:49:26,981][26022] Updated weights on worker 0-0, policy_version 74678 (0.00089) [2022-07-09 03:49:28,700][26022] Updated weights on worker 0-0, policy_version 74688 (0.00087) [2022-07-09 03:49:30,418][26022] Updated weights on worker 0-0, policy_version 74698 (0.00085) [2022-07-09 03:49:31,775][25689] Fps is (10 sec: 5924.4, 60 sec: 5759.0, 300 sec: 5758.3). Total num frames: 76497920. Throughput: 0: 6048.3. Samples: 76499646. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:49:31,775][25689] Avg episode reward: [(0, '-54.044')] [2022-07-09 03:49:32,254][26022] Updated weights on worker 0-0, policy_version 74708 (0.00090) [2022-07-09 03:49:33,873][26022] Updated weights on worker 0-0, policy_version 74718 (0.00082) [2022-07-09 03:49:35,779][26022] Updated weights on worker 0-0, policy_version 74728 (0.00084) [2022-07-09 03:49:36,886][25689] Fps is (10 sec: 5867.5, 60 sec: 5753.2, 300 sec: 5760.5). Total num frames: 76527616. Throughput: 0: 6076.9. Samples: 76535016. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:49:36,886][25689] Avg episode reward: [(0, '-54.920')] [2022-07-09 03:49:37,435][26022] Updated weights on worker 0-0, policy_version 74738 (0.01239) [2022-07-09 03:49:39,054][26022] Updated weights on worker 0-0, policy_version 74748 (0.00085) [2022-07-09 03:49:40,956][26022] Updated weights on worker 0-0, policy_version 74758 (0.00095) [2022-07-09 03:49:41,897][25689] Fps is (10 sec: 5867.4, 60 sec: 5786.2, 300 sec: 5761.1). Total num frames: 76557312. Throughput: 0: 5227.7. Samples: 76552640. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:49:41,898][25689] Avg episode reward: [(0, '-55.478')] [2022-07-09 03:49:42,718][26022] Updated weights on worker 0-0, policy_version 74768 (0.00764) [2022-07-09 03:49:44,553][26022] Updated weights on worker 0-0, policy_version 74778 (0.00084) [2022-07-09 03:49:46,231][26022] Updated weights on worker 0-0, policy_version 74788 (0.00105) [2022-07-09 03:49:46,964][25689] Fps is (10 sec: 5892.8, 60 sec: 5781.7, 300 sec: 5764.0). Total num frames: 76587008. Throughput: 0: 6062.9. Samples: 76587352. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:49:46,965][25689] Avg episode reward: [(0, '-55.917')] [2022-07-09 03:49:48,039][26022] Updated weights on worker 0-0, policy_version 74798 (0.00102) [2022-07-09 03:49:49,863][26022] Updated weights on worker 0-0, policy_version 74808 (0.00084) [2022-07-09 03:49:51,556][26022] Updated weights on worker 0-0, policy_version 74818 (0.00080) [2022-07-09 03:49:51,988][25689] Fps is (10 sec: 5885.9, 60 sec: 5782.5, 300 sec: 5761.3). Total num frames: 76616704. Throughput: 0: 6065.1. Samples: 76622240. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:49:51,988][25689] Avg episode reward: [(0, '-56.480')] [2022-07-09 03:49:53,466][26022] Updated weights on worker 0-0, policy_version 74828 (0.00088) [2022-07-09 03:49:55,087][26022] Updated weights on worker 0-0, policy_version 74838 (0.00092) [2022-07-09 03:49:56,724][26022] Updated weights on worker 0-0, policy_version 74848 (0.00091) [2022-07-09 03:49:57,050][25689] Fps is (10 sec: 5787.0, 60 sec: 5772.8, 300 sec: 5760.2). Total num frames: 76645376. Throughput: 0: 5190.7. Samples: 76639686. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:49:57,052][25689] Avg episode reward: [(0, '-56.426')] [2022-07-09 03:49:58,745][26022] Updated weights on worker 0-0, policy_version 74858 (0.00088) [2022-07-09 03:50:00,366][26022] Updated weights on worker 0-0, policy_version 74868 (0.00444) [2022-07-09 03:50:02,074][25689] Fps is (10 sec: 5482.1, 60 sec: 5737.7, 300 sec: 5760.0). Total num frames: 76672000. Throughput: 0: 6025.3. Samples: 76674214. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:50:02,079][25689] Avg episode reward: [(0, '-56.805')] [2022-07-09 03:50:02,664][26022] Updated weights on worker 0-0, policy_version 74878 (0.00090) [2022-07-09 03:50:04,310][26022] Updated weights on worker 0-0, policy_version 74888 (0.00086) [2022-07-09 03:50:06,163][26022] Updated weights on worker 0-0, policy_version 74898 (0.00083) [2022-07-09 03:50:07,099][25689] Fps is (10 sec: 5604.8, 60 sec: 5771.5, 300 sec: 5759.7). Total num frames: 76701696. Throughput: 0: 5954.8. Samples: 76707250. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:50:07,099][25689] Avg episode reward: [(0, '-55.018')] [2022-07-09 03:50:07,660][26022] Updated weights on worker 0-0, policy_version 74908 (0.00090) [2022-07-09 03:50:09,549][26022] Updated weights on worker 0-0, policy_version 74918 (0.00086) [2022-07-09 03:50:11,351][26022] Updated weights on worker 0-0, policy_version 74928 (0.00085) [2022-07-09 03:50:12,193][25689] Fps is (10 sec: 5869.5, 60 sec: 5784.9, 300 sec: 5759.8). Total num frames: 76731392. Throughput: 0: 5074.1. Samples: 76724764. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:50:12,195][25689] Avg episode reward: [(0, '-54.842')] [2022-07-09 03:50:13,150][26022] Updated weights on worker 0-0, policy_version 74938 (0.00092) [2022-07-09 03:50:14,663][26022] Updated weights on worker 0-0, policy_version 74948 (0.00087) [2022-07-09 03:50:16,772][26022] Updated weights on worker 0-0, policy_version 74958 (0.00082) [2022-07-09 03:50:17,326][25689] Fps is (10 sec: 5707.0, 60 sec: 5762.0, 300 sec: 5760.9). Total num frames: 76760064. Throughput: 0: 5930.3. Samples: 76759928. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:50:17,327][25689] Avg episode reward: [(0, '-54.526')] [2022-07-09 03:50:17,980][26022] Updated weights on worker 0-0, policy_version 74968 (0.00085) [2022-07-09 03:50:20,268][26022] Updated weights on worker 0-0, policy_version 74978 (0.00051) [2022-07-09 03:50:21,520][26022] Updated weights on worker 0-0, policy_version 74988 (0.00083) [2022-07-09 03:50:22,350][25689] Fps is (10 sec: 5847.3, 60 sec: 5812.8, 300 sec: 5761.8). Total num frames: 76790784. Throughput: 0: 5980.4. Samples: 76795472. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:50:22,351][25689] Avg episode reward: [(0, '-54.243')] [2022-07-09 03:50:23,035][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:50:23,041][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000074995_76794880.pth [2022-07-09 03:50:23,051][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000072969_74720256.pth [2022-07-09 03:50:23,494][26022] Updated weights on worker 0-0, policy_version 74998 (0.00087) [2022-07-09 03:50:25,163][26022] Updated weights on worker 0-0, policy_version 75008 (0.00093) [2022-07-09 03:50:27,019][26022] Updated weights on worker 0-0, policy_version 75018 (0.00088) [2022-07-09 03:50:27,381][25689] Fps is (10 sec: 5906.7, 60 sec: 5794.2, 300 sec: 5764.9). Total num frames: 76819456. Throughput: 0: 5221.2. Samples: 76813150. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:50:27,382][25689] Avg episode reward: [(0, '-53.203')] [2022-07-09 03:50:28,700][26022] Updated weights on worker 0-0, policy_version 75028 (0.00095) [2022-07-09 03:50:30,584][26022] Updated weights on worker 0-0, policy_version 75038 (0.00096) [2022-07-09 03:50:32,209][26022] Updated weights on worker 0-0, policy_version 75048 (0.00085) [2022-07-09 03:50:32,394][25689] Fps is (10 sec: 5913.4, 60 sec: 5811.0, 300 sec: 5770.0). Total num frames: 76850176. Throughput: 0: 6102.8. Samples: 76848046. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:50:32,394][25689] Avg episode reward: [(0, '-53.500')] [2022-07-09 03:50:34,285][26022] Updated weights on worker 0-0, policy_version 75058 (0.00092) [2022-07-09 03:50:35,862][26022] Updated weights on worker 0-0, policy_version 75068 (0.00086) [2022-07-09 03:50:37,439][25689] Fps is (10 sec: 5803.4, 60 sec: 5783.5, 300 sec: 5763.4). Total num frames: 76877824. Throughput: 0: 6109.2. Samples: 76882800. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:50:37,439][25689] Avg episode reward: [(0, '-52.747')] [2022-07-09 03:50:37,689][26022] Updated weights on worker 0-0, policy_version 75078 (0.00083) [2022-07-09 03:50:39,315][26022] Updated weights on worker 0-0, policy_version 75088 (0.00094) [2022-07-09 03:50:41,187][26022] Updated weights on worker 0-0, policy_version 75098 (0.00089) [2022-07-09 03:50:42,448][25689] Fps is (10 sec: 5703.3, 60 sec: 5783.7, 300 sec: 5767.5). Total num frames: 76907520. Throughput: 0: 5220.8. Samples: 76900400. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:50:42,449][25689] Avg episode reward: [(0, '-52.328')] [2022-07-09 03:50:42,744][26022] Updated weights on worker 0-0, policy_version 75108 (0.00092) [2022-07-09 03:50:44,574][26022] Updated weights on worker 0-0, policy_version 75118 (0.00090) [2022-07-09 03:50:46,342][26022] Updated weights on worker 0-0, policy_version 75128 (0.00319) [2022-07-09 03:50:47,474][25689] Fps is (10 sec: 5918.6, 60 sec: 5787.7, 300 sec: 5767.6). Total num frames: 76937216. Throughput: 0: 6104.2. Samples: 76935798. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 03:50:47,475][25689] Avg episode reward: [(0, '-52.521')] [2022-07-09 03:50:48,248][26022] Updated weights on worker 0-0, policy_version 75138 (0.00079) [2022-07-09 03:50:49,842][26022] Updated weights on worker 0-0, policy_version 75148 (0.00050) [2022-07-09 03:50:51,670][26022] Updated weights on worker 0-0, policy_version 75158 (0.00091) [2022-07-09 03:50:52,507][25689] Fps is (10 sec: 5802.9, 60 sec: 5769.8, 300 sec: 5769.6). Total num frames: 76965888. Throughput: 0: 6092.0. Samples: 76970574. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:50:52,507][25689] Avg episode reward: [(0, '-52.851')] [2022-07-09 03:50:53,308][26022] Updated weights on worker 0-0, policy_version 75168 (0.00086) [2022-07-09 03:50:55,223][26022] Updated weights on worker 0-0, policy_version 75178 (0.00088) [2022-07-09 03:50:56,873][26022] Updated weights on worker 0-0, policy_version 75188 (0.00084) [2022-07-09 03:50:57,558][25689] Fps is (10 sec: 5787.7, 60 sec: 5787.8, 300 sec: 5765.6). Total num frames: 76995584. Throughput: 0: 5236.1. Samples: 76988148. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:50:57,559][25689] Avg episode reward: [(0, '-52.354')] [2022-07-09 03:50:58,781][26022] Updated weights on worker 0-0, policy_version 75198 (0.00091) [2022-07-09 03:51:00,545][26022] Updated weights on worker 0-0, policy_version 75208 (0.00087) [2022-07-09 03:51:02,518][26022] Updated weights on worker 0-0, policy_version 75218 (0.00093) [2022-07-09 03:51:02,571][25689] Fps is (10 sec: 5697.5, 60 sec: 5805.8, 300 sec: 5776.1). Total num frames: 77023232. Throughput: 0: 6104.8. Samples: 77023248. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:51:02,572][25689] Avg episode reward: [(0, '-53.254')] [2022-07-09 03:51:04,295][26022] Updated weights on worker 0-0, policy_version 75228 (0.00087) [2022-07-09 03:51:06,074][26022] Updated weights on worker 0-0, policy_version 75238 (0.00082) [2022-07-09 03:51:07,590][25689] Fps is (10 sec: 5614.1, 60 sec: 5789.4, 300 sec: 5772.9). Total num frames: 77051904. Throughput: 0: 5975.8. Samples: 77056010. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:51:07,591][25689] Avg episode reward: [(0, '-53.241')] [2022-07-09 03:51:07,806][26022] Updated weights on worker 0-0, policy_version 75248 (0.00084) [2022-07-09 03:51:09,686][26022] Updated weights on worker 0-0, policy_version 75258 (0.00086) [2022-07-09 03:51:11,308][26022] Updated weights on worker 0-0, policy_version 75268 (0.00093) [2022-07-09 03:51:12,592][25689] Fps is (10 sec: 5824.8, 60 sec: 5798.2, 300 sec: 5771.0). Total num frames: 77081600. Throughput: 0: 5126.2. Samples: 77073536. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:51:12,592][25689] Avg episode reward: [(0, '-53.491')] [2022-07-09 03:51:13,179][26022] Updated weights on worker 0-0, policy_version 75278 (0.00542) [2022-07-09 03:51:14,895][26022] Updated weights on worker 0-0, policy_version 75288 (0.00086) [2022-07-09 03:51:16,669][26022] Updated weights on worker 0-0, policy_version 75298 (0.00092) [2022-07-09 03:51:17,648][25689] Fps is (10 sec: 5905.1, 60 sec: 5822.6, 300 sec: 5774.4). Total num frames: 77111296. Throughput: 0: 6004.7. Samples: 77108780. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:51:17,649][25689] Avg episode reward: [(0, '-53.458')] [2022-07-09 03:51:18,275][26022] Updated weights on worker 0-0, policy_version 75308 (0.00096) [2022-07-09 03:51:20,060][26022] Updated weights on worker 0-0, policy_version 75318 (0.00091) [2022-07-09 03:51:21,943][26022] Updated weights on worker 0-0, policy_version 75328 (0.00084) [2022-07-09 03:51:22,655][25689] Fps is (10 sec: 5800.5, 60 sec: 5790.4, 300 sec: 5771.0). Total num frames: 77139968. Throughput: 0: 6002.9. Samples: 77143806. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:51:22,657][25689] Avg episode reward: [(0, '-54.252')] [2022-07-09 03:51:23,447][26022] Updated weights on worker 0-0, policy_version 75338 (0.00089) [2022-07-09 03:51:25,426][26022] Updated weights on worker 0-0, policy_version 75348 (0.00086) [2022-07-09 03:51:27,258][26022] Updated weights on worker 0-0, policy_version 75358 (0.00083) [2022-07-09 03:51:27,674][25689] Fps is (10 sec: 5821.8, 60 sec: 5808.5, 300 sec: 5777.5). Total num frames: 77169664. Throughput: 0: 6114.2. Samples: 77178804. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:51:27,674][25689] Avg episode reward: [(0, '-54.975')] [2022-07-09 03:51:28,852][26022] Updated weights on worker 0-0, policy_version 75368 (0.00089) [2022-07-09 03:51:30,672][26022] Updated weights on worker 0-0, policy_version 75378 (0.00085) [2022-07-09 03:51:32,412][26022] Updated weights on worker 0-0, policy_version 75388 (0.00081) [2022-07-09 03:51:32,679][25689] Fps is (10 sec: 5720.6, 60 sec: 5758.3, 300 sec: 5769.9). Total num frames: 77197312. Throughput: 0: 6111.4. Samples: 77196294. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:51:32,679][25689] Avg episode reward: [(0, '-54.513')] [2022-07-09 03:51:34,131][26022] Updated weights on worker 0-0, policy_version 75398 (0.00111) [2022-07-09 03:51:36,129][26022] Updated weights on worker 0-0, policy_version 75408 (0.00091) [2022-07-09 03:51:37,750][25689] Fps is (10 sec: 5691.1, 60 sec: 5789.7, 300 sec: 5769.0). Total num frames: 77227008. Throughput: 0: 6072.6. Samples: 77230850. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:51:37,750][25689] Avg episode reward: [(0, '-53.996')] [2022-07-09 03:51:37,935][26022] Updated weights on worker 0-0, policy_version 75418 (0.00114) [2022-07-09 03:51:39,412][26022] Updated weights on worker 0-0, policy_version 75428 (0.00089) [2022-07-09 03:51:41,308][26022] Updated weights on worker 0-0, policy_version 75438 (0.00091) [2022-07-09 03:51:42,765][25689] Fps is (10 sec: 5888.2, 60 sec: 5789.1, 300 sec: 5775.6). Total num frames: 77256704. Throughput: 0: 6076.9. Samples: 77266018. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:51:42,766][25689] Avg episode reward: [(0, '-54.255')] [2022-07-09 03:51:43,125][26022] Updated weights on worker 0-0, policy_version 75448 (0.00084) [2022-07-09 03:51:44,793][26022] Updated weights on worker 0-0, policy_version 75458 (0.00085) [2022-07-09 03:51:46,672][26022] Updated weights on worker 0-0, policy_version 75468 (0.00098) [2022-07-09 03:51:47,812][25689] Fps is (10 sec: 5902.7, 60 sec: 5787.1, 300 sec: 5772.1). Total num frames: 77286400. Throughput: 0: 5199.6. Samples: 77283514. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:51:47,812][25689] Avg episode reward: [(0, '-53.831')] [2022-07-09 03:51:48,345][26022] Updated weights on worker 0-0, policy_version 75478 (0.00096) [2022-07-09 03:51:50,112][26022] Updated weights on worker 0-0, policy_version 75488 (0.00095) [2022-07-09 03:51:51,974][26022] Updated weights on worker 0-0, policy_version 75498 (0.00080) [2022-07-09 03:51:52,815][25689] Fps is (10 sec: 5808.1, 60 sec: 5790.0, 300 sec: 5766.9). Total num frames: 77315072. Throughput: 0: 6069.1. Samples: 77318502. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:51:52,815][25689] Avg episode reward: [(0, '-54.803')] [2022-07-09 03:51:53,504][26022] Updated weights on worker 0-0, policy_version 75508 (0.00082) [2022-07-09 03:51:55,545][26022] Updated weights on worker 0-0, policy_version 75518 (0.00095) [2022-07-09 03:51:57,144][26022] Updated weights on worker 0-0, policy_version 75528 (0.00079) [2022-07-09 03:51:57,882][25689] Fps is (10 sec: 5592.6, 60 sec: 5754.6, 300 sec: 5762.4). Total num frames: 77342720. Throughput: 0: 6080.3. Samples: 77353260. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:51:57,884][25689] Avg episode reward: [(0, '-53.900')] [2022-07-09 03:51:58,932][26022] Updated weights on worker 0-0, policy_version 75538 (0.00093) [2022-07-09 03:52:00,731][26022] Updated weights on worker 0-0, policy_version 75548 (0.00087) [2022-07-09 03:52:02,603][26022] Updated weights on worker 0-0, policy_version 75558 (0.00081) [2022-07-09 03:52:02,912][25689] Fps is (10 sec: 5679.2, 60 sec: 5786.9, 300 sec: 5772.4). Total num frames: 77372416. Throughput: 0: 5209.2. Samples: 77370966. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:52:02,912][25689] Avg episode reward: [(0, '-54.297')] [2022-07-09 03:52:04,608][26022] Updated weights on worker 0-0, policy_version 75568 (0.00088) [2022-07-09 03:52:06,194][26022] Updated weights on worker 0-0, policy_version 75578 (0.00081) [2022-07-09 03:52:07,932][25689] Fps is (10 sec: 5705.8, 60 sec: 5769.8, 300 sec: 5765.4). Total num frames: 77400064. Throughput: 0: 5959.9. Samples: 77403430. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:52:07,933][25689] Avg episode reward: [(0, '-54.217')] [2022-07-09 03:52:08,033][26022] Updated weights on worker 0-0, policy_version 75588 (0.00095) [2022-07-09 03:52:10,043][26022] Updated weights on worker 0-0, policy_version 75598 (0.00087) [2022-07-09 03:52:11,505][26022] Updated weights on worker 0-0, policy_version 75608 (0.00088) [2022-07-09 03:52:12,939][25689] Fps is (10 sec: 5617.0, 60 sec: 5752.4, 300 sec: 5766.4). Total num frames: 77428736. Throughput: 0: 5955.6. Samples: 77438352. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:52:12,939][25689] Avg episode reward: [(0, '-53.958')] [2022-07-09 03:52:13,463][26022] Updated weights on worker 0-0, policy_version 75618 (0.00090) [2022-07-09 03:52:15,159][26022] Updated weights on worker 0-0, policy_version 75628 (0.00087) [2022-07-09 03:52:17,016][26022] Updated weights on worker 0-0, policy_version 75638 (0.00092) [2022-07-09 03:52:18,077][25689] Fps is (10 sec: 5854.6, 60 sec: 5761.5, 300 sec: 5767.5). Total num frames: 77459456. Throughput: 0: 5079.7. Samples: 77455842. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 03:52:18,079][25689] Avg episode reward: [(0, '-53.710')] [2022-07-09 03:52:18,638][26022] Updated weights on worker 0-0, policy_version 75648 (0.00100) [2022-07-09 03:52:20,621][26022] Updated weights on worker 0-0, policy_version 75658 (0.00089) [2022-07-09 03:52:22,179][26022] Updated weights on worker 0-0, policy_version 75668 (0.00050) [2022-07-09 03:52:23,113][25689] Fps is (10 sec: 5837.3, 60 sec: 5758.6, 300 sec: 5766.9). Total num frames: 77488128. Throughput: 0: 5933.2. Samples: 77490824. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:52:23,114][25689] Avg episode reward: [(0, '-53.600')] [2022-07-09 03:52:23,119][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:52:23,127][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000075673_77489152.pth [2022-07-09 03:52:23,128][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000073642_75409408.pth [2022-07-09 03:52:23,908][26022] Updated weights on worker 0-0, policy_version 75678 (0.00085) [2022-07-09 03:52:25,678][26022] Updated weights on worker 0-0, policy_version 75688 (0.00090) [2022-07-09 03:52:27,486][26022] Updated weights on worker 0-0, policy_version 75698 (0.00088) [2022-07-09 03:52:28,129][25689] Fps is (10 sec: 5806.5, 60 sec: 5759.0, 300 sec: 5767.8). Total num frames: 77517824. Throughput: 0: 6040.6. Samples: 77525430. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:52:28,129][25689] Avg episode reward: [(0, '-54.040')] [2022-07-09 03:52:29,315][26022] Updated weights on worker 0-0, policy_version 75708 (0.00096) [2022-07-09 03:52:31,093][26022] Updated weights on worker 0-0, policy_version 75718 (0.00087) [2022-07-09 03:52:32,819][26022] Updated weights on worker 0-0, policy_version 75728 (0.00090) [2022-07-09 03:52:33,141][25689] Fps is (10 sec: 5820.4, 60 sec: 5775.2, 300 sec: 5769.7). Total num frames: 77546496. Throughput: 0: 5172.6. Samples: 77542852. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:52:33,142][25689] Avg episode reward: [(0, '-53.421')] [2022-07-09 03:52:34,768][26022] Updated weights on worker 0-0, policy_version 75738 (0.00091) [2022-07-09 03:52:36,470][26022] Updated weights on worker 0-0, policy_version 75748 (0.00084) [2022-07-09 03:52:38,122][26022] Updated weights on worker 0-0, policy_version 75758 (0.00081) [2022-07-09 03:52:38,210][25689] Fps is (10 sec: 5789.7, 60 sec: 5775.4, 300 sec: 5765.7). Total num frames: 77576192. Throughput: 0: 6034.0. Samples: 77577328. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:52:38,211][25689] Avg episode reward: [(0, '-53.972')] [2022-07-09 03:52:39,866][26022] Updated weights on worker 0-0, policy_version 75768 (0.00088) [2022-07-09 03:52:41,611][26022] Updated weights on worker 0-0, policy_version 75778 (0.00085) [2022-07-09 03:52:43,218][25689] Fps is (10 sec: 5894.0, 60 sec: 5776.2, 300 sec: 5769.9). Total num frames: 77605888. Throughput: 0: 6060.9. Samples: 77612678. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:52:43,219][25689] Avg episode reward: [(0, '-53.927')] [2022-07-09 03:52:43,420][26022] Updated weights on worker 0-0, policy_version 75788 (0.00089) [2022-07-09 03:52:45,162][26022] Updated weights on worker 0-0, policy_version 75798 (0.00083) [2022-07-09 03:52:46,729][26022] Updated weights on worker 0-0, policy_version 75808 (0.00079) [2022-07-09 03:52:48,246][25689] Fps is (10 sec: 5815.8, 60 sec: 5760.9, 300 sec: 5767.4). Total num frames: 77634560. Throughput: 0: 5208.6. Samples: 77630214. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:52:48,247][25689] Avg episode reward: [(0, '-54.564')] [2022-07-09 03:52:48,894][26022] Updated weights on worker 0-0, policy_version 75818 (0.00087) [2022-07-09 03:52:50,312][26022] Updated weights on worker 0-0, policy_version 75828 (0.00089) [2022-07-09 03:52:52,255][26022] Updated weights on worker 0-0, policy_version 75838 (0.00084) [2022-07-09 03:52:53,277][25689] Fps is (10 sec: 5700.9, 60 sec: 5758.3, 300 sec: 5761.8). Total num frames: 77663232. Throughput: 0: 6069.2. Samples: 77665058. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:52:53,277][25689] Avg episode reward: [(0, '-53.697')] [2022-07-09 03:52:53,884][26022] Updated weights on worker 0-0, policy_version 75848 (0.00082) [2022-07-09 03:52:55,855][26022] Updated weights on worker 0-0, policy_version 75858 (0.00098) [2022-07-09 03:52:57,475][26022] Updated weights on worker 0-0, policy_version 75868 (0.00092) [2022-07-09 03:52:58,375][25689] Fps is (10 sec: 5965.1, 60 sec: 5823.1, 300 sec: 5778.3). Total num frames: 77694976. Throughput: 0: 6092.8. Samples: 77700186. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:52:58,375][25689] Avg episode reward: [(0, '-54.019')] [2022-07-09 03:52:59,395][26022] Updated weights on worker 0-0, policy_version 75878 (0.00086) [2022-07-09 03:53:00,911][26022] Updated weights on worker 0-0, policy_version 75888 (0.00086) [2022-07-09 03:53:03,375][26022] Updated weights on worker 0-0, policy_version 75898 (0.00088) [2022-07-09 03:53:03,469][25689] Fps is (10 sec: 5525.8, 60 sec: 5732.4, 300 sec: 5767.0). Total num frames: 77719552. Throughput: 0: 5182.0. Samples: 77717620. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:53:03,470][25689] Avg episode reward: [(0, '-53.409')] [2022-07-09 03:53:04,731][26022] Updated weights on worker 0-0, policy_version 75908 (0.00088) [2022-07-09 03:53:06,942][26022] Updated weights on worker 0-0, policy_version 75918 (0.00082) [2022-07-09 03:53:08,340][26022] Updated weights on worker 0-0, policy_version 75928 (0.00094) [2022-07-09 03:53:08,508][25689] Fps is (10 sec: 5456.7, 60 sec: 5781.3, 300 sec: 5767.4). Total num frames: 77750272. Throughput: 0: 5891.1. Samples: 77749578. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:53:08,509][25689] Avg episode reward: [(0, '-53.454')] [2022-07-09 03:53:10,295][26022] Updated weights on worker 0-0, policy_version 75938 (0.00080) [2022-07-09 03:53:12,110][26022] Updated weights on worker 0-0, policy_version 75948 (0.00089) [2022-07-09 03:53:13,558][25689] Fps is (10 sec: 5886.6, 60 sec: 5777.1, 300 sec: 5771.7). Total num frames: 77778944. Throughput: 0: 5899.4. Samples: 77784708. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:53:13,559][25689] Avg episode reward: [(0, '-53.280')] [2022-07-09 03:53:13,675][26022] Updated weights on worker 0-0, policy_version 75958 (0.00082) [2022-07-09 03:53:15,458][26022] Updated weights on worker 0-0, policy_version 75968 (0.00083) [2022-07-09 03:53:17,508][26022] Updated weights on worker 0-0, policy_version 75978 (0.00093) [2022-07-09 03:53:18,624][25689] Fps is (10 sec: 5770.2, 60 sec: 5767.2, 300 sec: 5767.7). Total num frames: 77808640. Throughput: 0: 5038.7. Samples: 77802214. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:53:18,624][25689] Avg episode reward: [(0, '-52.956')] [2022-07-09 03:53:18,897][26022] Updated weights on worker 0-0, policy_version 75988 (0.00090) [2022-07-09 03:53:21,100][26022] Updated weights on worker 0-0, policy_version 75998 (0.00806) [2022-07-09 03:53:22,503][26022] Updated weights on worker 0-0, policy_version 76008 (0.00087) [2022-07-09 03:53:23,722][25689] Fps is (10 sec: 5742.8, 60 sec: 5761.3, 300 sec: 5766.1). Total num frames: 77837312. Throughput: 0: 5886.6. Samples: 77836842. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:53:23,723][25689] Avg episode reward: [(0, '-53.340')] [2022-07-09 03:53:24,495][26022] Updated weights on worker 0-0, policy_version 76018 (0.00993) [2022-07-09 03:53:26,086][26022] Updated weights on worker 0-0, policy_version 76028 (0.00092) [2022-07-09 03:53:28,094][26022] Updated weights on worker 0-0, policy_version 76038 (0.00052) [2022-07-09 03:53:28,793][25689] Fps is (10 sec: 5739.8, 60 sec: 5756.1, 300 sec: 5772.8). Total num frames: 77867008. Throughput: 0: 6012.9. Samples: 77871546. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:53:28,793][25689] Avg episode reward: [(0, '-53.283')] [2022-07-09 03:53:29,783][26022] Updated weights on worker 0-0, policy_version 76048 (0.00081) [2022-07-09 03:53:31,581][26022] Updated weights on worker 0-0, policy_version 76058 (0.00086) [2022-07-09 03:53:33,154][26022] Updated weights on worker 0-0, policy_version 76068 (0.00093) [2022-07-09 03:53:33,857][25689] Fps is (10 sec: 5860.1, 60 sec: 5768.0, 300 sec: 5772.5). Total num frames: 77896704. Throughput: 0: 5135.6. Samples: 77888954. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:53:33,859][25689] Avg episode reward: [(0, '-53.851')] [2022-07-09 03:53:35,117][26022] Updated weights on worker 0-0, policy_version 76078 (0.00090) [2022-07-09 03:53:36,568][26022] Updated weights on worker 0-0, policy_version 76088 (0.00088) [2022-07-09 03:53:38,854][26022] Updated weights on worker 0-0, policy_version 76098 (0.00087) [2022-07-09 03:53:38,973][25689] Fps is (10 sec: 5632.8, 60 sec: 5729.8, 300 sec: 5770.3). Total num frames: 77924352. Throughput: 0: 5970.7. Samples: 77923714. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:53:38,974][25689] Avg episode reward: [(0, '-53.559')] [2022-07-09 03:53:40,312][26022] Updated weights on worker 0-0, policy_version 76108 (0.00110) [2022-07-09 03:53:42,269][26022] Updated weights on worker 0-0, policy_version 76118 (0.00083) [2022-07-09 03:53:43,882][26022] Updated weights on worker 0-0, policy_version 76128 (0.00088) [2022-07-09 03:53:44,044][25689] Fps is (10 sec: 5830.2, 60 sec: 5757.5, 300 sec: 5776.2). Total num frames: 77956096. Throughput: 0: 5986.1. Samples: 77958490. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:53:44,044][25689] Avg episode reward: [(0, '-53.615')] [2022-07-09 03:53:45,847][26022] Updated weights on worker 0-0, policy_version 76138 (0.00101) [2022-07-09 03:53:47,401][26022] Updated weights on worker 0-0, policy_version 76148 (0.00084) [2022-07-09 03:53:49,065][25689] Fps is (10 sec: 5884.9, 60 sec: 5741.4, 300 sec: 5769.5). Total num frames: 77983744. Throughput: 0: 6001.1. Samples: 77993204. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:53:49,066][25689] Avg episode reward: [(0, '-53.326')] [2022-07-09 03:53:49,319][26022] Updated weights on worker 0-0, policy_version 76158 (0.00092) [2022-07-09 03:53:50,766][26022] Updated weights on worker 0-0, policy_version 76168 (0.00088) [2022-07-09 03:53:52,940][26022] Updated weights on worker 0-0, policy_version 76178 (0.00094) [2022-07-09 03:53:54,130][25689] Fps is (10 sec: 5787.0, 60 sec: 5771.8, 300 sec: 5774.4). Total num frames: 78014464. Throughput: 0: 5998.4. Samples: 78010560. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 03:53:54,131][25689] Avg episode reward: [(0, '-53.603')] [2022-07-09 03:53:54,549][26022] Updated weights on worker 0-0, policy_version 76188 (0.00095) [2022-07-09 03:53:56,492][26022] Updated weights on worker 0-0, policy_version 76198 (0.00097) [2022-07-09 03:53:58,035][26022] Updated weights on worker 0-0, policy_version 76208 (0.01056) [2022-07-09 03:53:59,226][25689] Fps is (10 sec: 5845.1, 60 sec: 5721.4, 300 sec: 5772.7). Total num frames: 78043136. Throughput: 0: 5997.0. Samples: 78045174. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2022-07-09 03:53:59,227][25689] Avg episode reward: [(0, '-54.081')] [2022-07-09 03:53:59,997][26022] Updated weights on worker 0-0, policy_version 76218 (0.00083) [2022-07-09 03:54:01,555][26022] Updated weights on worker 0-0, policy_version 76228 (0.00088) [2022-07-09 03:54:03,702][26022] Updated weights on worker 0-0, policy_version 76238 (0.00085) [2022-07-09 03:54:04,298][25689] Fps is (10 sec: 5538.8, 60 sec: 5774.1, 300 sec: 5771.8). Total num frames: 78070784. Throughput: 0: 5900.5. Samples: 78078000. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2022-07-09 03:54:04,299][25689] Avg episode reward: [(0, '-53.186')] [2022-07-09 03:54:05,447][26022] Updated weights on worker 0-0, policy_version 76248 (0.00082) [2022-07-09 03:54:07,250][26022] Updated weights on worker 0-0, policy_version 76258 (0.00088) [2022-07-09 03:54:08,926][26022] Updated weights on worker 0-0, policy_version 76268 (0.00086) [2022-07-09 03:54:09,311][25689] Fps is (10 sec: 5585.0, 60 sec: 5742.9, 300 sec: 5772.7). Total num frames: 78099456. Throughput: 0: 5042.3. Samples: 78095286. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2022-07-09 03:54:09,311][25689] Avg episode reward: [(0, '-53.663')] [2022-07-09 03:54:10,831][26022] Updated weights on worker 0-0, policy_version 76278 (0.00082) [2022-07-09 03:54:12,540][26022] Updated weights on worker 0-0, policy_version 76288 (0.00085) [2022-07-09 03:54:14,356][25689] Fps is (10 sec: 5803.5, 60 sec: 5760.3, 300 sec: 5773.2). Total num frames: 78129152. Throughput: 0: 5909.4. Samples: 78130080. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2022-07-09 03:54:14,356][25689] Avg episode reward: [(0, '-53.923')] [2022-07-09 03:54:14,367][26022] Updated weights on worker 0-0, policy_version 76298 (0.00054) [2022-07-09 03:54:16,129][26022] Updated weights on worker 0-0, policy_version 76308 (0.00087) [2022-07-09 03:54:17,779][26022] Updated weights on worker 0-0, policy_version 76318 (0.00085) [2022-07-09 03:54:19,419][25689] Fps is (10 sec: 5774.4, 60 sec: 5743.6, 300 sec: 5775.9). Total num frames: 78157824. Throughput: 0: 5942.4. Samples: 78165164. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2022-07-09 03:54:19,419][25689] Avg episode reward: [(0, '-54.398')] [2022-07-09 03:54:19,568][26022] Updated weights on worker 0-0, policy_version 76328 (0.00083) [2022-07-09 03:54:21,317][26022] Updated weights on worker 0-0, policy_version 76338 (0.00099) [2022-07-09 03:54:23,135][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:54:23,152][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000076348_78180352.pth [2022-07-09 03:54:23,152][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000074319_76102656.pth [2022-07-09 03:54:23,160][26022] Updated weights on worker 0-0, policy_version 76348 (0.00095) [2022-07-09 03:54:24,432][25689] Fps is (10 sec: 5894.3, 60 sec: 5785.4, 300 sec: 5779.3). Total num frames: 78188544. Throughput: 0: 5211.1. Samples: 78182918. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2022-07-09 03:54:24,432][25689] Avg episode reward: [(0, '-54.256')] [2022-07-09 03:54:24,869][26022] Updated weights on worker 0-0, policy_version 76358 (0.00083) [2022-07-09 03:54:26,681][26022] Updated weights on worker 0-0, policy_version 76368 (0.00087) [2022-07-09 03:54:28,330][26022] Updated weights on worker 0-0, policy_version 76378 (0.00083) [2022-07-09 03:54:29,455][25689] Fps is (10 sec: 5815.8, 60 sec: 5756.2, 300 sec: 5772.2). Total num frames: 78216192. Throughput: 0: 6096.7. Samples: 78218100. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2022-07-09 03:54:29,455][25689] Avg episode reward: [(0, '-53.980')] [2022-07-09 03:54:30,050][26022] Updated weights on worker 0-0, policy_version 76388 (0.00083) [2022-07-09 03:54:31,915][26022] Updated weights on worker 0-0, policy_version 76398 (0.00090) [2022-07-09 03:54:33,627][26022] Updated weights on worker 0-0, policy_version 76408 (0.00089) [2022-07-09 03:54:34,470][25689] Fps is (10 sec: 5814.6, 60 sec: 5777.8, 300 sec: 5777.5). Total num frames: 78246912. Throughput: 0: 6129.4. Samples: 78253370. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2022-07-09 03:54:34,471][25689] Avg episode reward: [(0, '-54.315')] [2022-07-09 03:54:35,377][26022] Updated weights on worker 0-0, policy_version 76418 (0.00086) [2022-07-09 03:54:37,167][26022] Updated weights on worker 0-0, policy_version 76428 (0.00091) [2022-07-09 03:54:38,929][26022] Updated weights on worker 0-0, policy_version 76438 (0.00086) [2022-07-09 03:54:39,549][25689] Fps is (10 sec: 5883.5, 60 sec: 5798.2, 300 sec: 5772.7). Total num frames: 78275584. Throughput: 0: 5246.9. Samples: 78270788. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2022-07-09 03:54:39,551][25689] Avg episode reward: [(0, '-54.102')] [2022-07-09 03:54:40,663][26022] Updated weights on worker 0-0, policy_version 76448 (0.00082) [2022-07-09 03:54:42,503][26022] Updated weights on worker 0-0, policy_version 76458 (0.00085) [2022-07-09 03:54:44,203][26022] Updated weights on worker 0-0, policy_version 76468 (0.00088) [2022-07-09 03:54:44,577][25689] Fps is (10 sec: 5673.5, 60 sec: 5751.5, 300 sec: 5770.1). Total num frames: 78304256. Throughput: 0: 6058.9. Samples: 78304978. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2022-07-09 03:54:44,579][25689] Avg episode reward: [(0, '-53.723')] [2022-07-09 03:54:45,939][26022] Updated weights on worker 0-0, policy_version 76478 (0.00087) [2022-07-09 03:54:47,817][26022] Updated weights on worker 0-0, policy_version 76488 (0.00095) [2022-07-09 03:54:49,479][26022] Updated weights on worker 0-0, policy_version 76498 (0.00086) [2022-07-09 03:54:49,587][25689] Fps is (10 sec: 5815.1, 60 sec: 5786.5, 300 sec: 5770.3). Total num frames: 78333952. Throughput: 0: 6051.2. Samples: 78339922. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2022-07-09 03:54:49,588][25689] Avg episode reward: [(0, '-53.585')] [2022-07-09 03:54:51,138][26022] Updated weights on worker 0-0, policy_version 76508 (0.00082) [2022-07-09 03:54:53,068][26022] Updated weights on worker 0-0, policy_version 76518 (0.00093) [2022-07-09 03:54:54,600][25689] Fps is (10 sec: 5926.0, 60 sec: 5774.5, 300 sec: 5774.7). Total num frames: 78363648. Throughput: 0: 5175.1. Samples: 78357542. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2022-07-09 03:54:54,600][25689] Avg episode reward: [(0, '-53.435')] [2022-07-09 03:54:54,615][26022] Updated weights on worker 0-0, policy_version 76528 (0.00173) [2022-07-09 03:54:56,735][26022] Updated weights on worker 0-0, policy_version 76538 (0.00089) [2022-07-09 03:54:58,296][26022] Updated weights on worker 0-0, policy_version 76548 (0.00083) [2022-07-09 03:54:59,694][25689] Fps is (10 sec: 5673.8, 60 sec: 5757.8, 300 sec: 5776.8). Total num frames: 78391296. Throughput: 0: 6047.3. Samples: 78392606. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2022-07-09 03:54:59,695][25689] Avg episode reward: [(0, '-53.255')] [2022-07-09 03:55:00,251][26022] Updated weights on worker 0-0, policy_version 76558 (0.00083) [2022-07-09 03:55:02,242][26022] Updated weights on worker 0-0, policy_version 76568 (0.00100) [2022-07-09 03:55:04,183][26022] Updated weights on worker 0-0, policy_version 76578 (0.00080) [2022-07-09 03:55:04,733][25689] Fps is (10 sec: 5557.7, 60 sec: 5777.8, 300 sec: 5773.1). Total num frames: 78419968. Throughput: 0: 5963.8. Samples: 78425184. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2022-07-09 03:55:04,736][25689] Avg episode reward: [(0, '-53.916')] [2022-07-09 03:55:05,800][26022] Updated weights on worker 0-0, policy_version 76588 (0.00093) [2022-07-09 03:55:07,555][26022] Updated weights on worker 0-0, policy_version 76598 (0.00102) [2022-07-09 03:55:09,387][26022] Updated weights on worker 0-0, policy_version 76608 (0.00092) [2022-07-09 03:55:09,751][25689] Fps is (10 sec: 5599.5, 60 sec: 5760.3, 300 sec: 5767.7). Total num frames: 78447616. Throughput: 0: 5092.6. Samples: 78442614. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2022-07-09 03:55:09,753][25689] Avg episode reward: [(0, '-54.251')] [2022-07-09 03:55:11,252][26022] Updated weights on worker 0-0, policy_version 76618 (0.00084) [2022-07-09 03:55:12,746][26022] Updated weights on worker 0-0, policy_version 76628 (0.00091) [2022-07-09 03:55:14,756][25689] Fps is (10 sec: 5619.0, 60 sec: 5747.2, 300 sec: 5770.2). Total num frames: 78476288. Throughput: 0: 5946.5. Samples: 78477404. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2022-07-09 03:55:14,757][25689] Avg episode reward: [(0, '-54.297')] [2022-07-09 03:55:14,792][26022] Updated weights on worker 0-0, policy_version 76638 (0.00080) [2022-07-09 03:55:16,415][26022] Updated weights on worker 0-0, policy_version 76648 (0.00092) [2022-07-09 03:55:18,220][26022] Updated weights on worker 0-0, policy_version 76658 (0.00089) [2022-07-09 03:55:19,812][25689] Fps is (10 sec: 5801.9, 60 sec: 5764.9, 300 sec: 5766.1). Total num frames: 78505984. Throughput: 0: 5935.6. Samples: 78512020. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2022-07-09 03:55:19,812][25689] Avg episode reward: [(0, '-54.017')] [2022-07-09 03:55:20,054][26022] Updated weights on worker 0-0, policy_version 76668 (0.00092) [2022-07-09 03:55:21,640][26022] Updated weights on worker 0-0, policy_version 76678 (0.00091) [2022-07-09 03:55:23,506][26022] Updated weights on worker 0-0, policy_version 76688 (0.00092) [2022-07-09 03:55:24,842][25689] Fps is (10 sec: 5888.7, 60 sec: 5746.3, 300 sec: 5769.6). Total num frames: 78535680. Throughput: 0: 5182.9. Samples: 78529406. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2022-07-09 03:55:24,843][25689] Avg episode reward: [(0, '-54.097')] [2022-07-09 03:55:25,415][26022] Updated weights on worker 0-0, policy_version 76698 (0.00084) [2022-07-09 03:55:27,075][26022] Updated weights on worker 0-0, policy_version 76708 (0.00081) [2022-07-09 03:55:29,076][26022] Updated weights on worker 0-0, policy_version 76718 (0.00091) [2022-07-09 03:55:29,876][25689] Fps is (10 sec: 5799.6, 60 sec: 5762.2, 300 sec: 5762.3). Total num frames: 78564352. Throughput: 0: 6028.6. Samples: 78563936. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 03:55:29,876][25689] Avg episode reward: [(0, '-54.283')] [2022-07-09 03:55:30,574][26022] Updated weights on worker 0-0, policy_version 76728 (0.00092) [2022-07-09 03:55:32,521][26022] Updated weights on worker 0-0, policy_version 76738 (0.00088) [2022-07-09 03:55:34,039][26022] Updated weights on worker 0-0, policy_version 76748 (0.00087) [2022-07-09 03:55:34,906][25689] Fps is (10 sec: 5799.9, 60 sec: 5743.9, 300 sec: 5769.5). Total num frames: 78594048. Throughput: 0: 6038.2. Samples: 78599070. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 03:55:34,906][25689] Avg episode reward: [(0, '-54.038')] [2022-07-09 03:55:35,984][26022] Updated weights on worker 0-0, policy_version 76758 (0.00593) [2022-07-09 03:55:37,559][26022] Updated weights on worker 0-0, policy_version 76768 (0.00084) [2022-07-09 03:55:39,566][26022] Updated weights on worker 0-0, policy_version 76778 (0.00086) [2022-07-09 03:55:40,022][25689] Fps is (10 sec: 5753.0, 60 sec: 5740.4, 300 sec: 5764.0). Total num frames: 78622720. Throughput: 0: 5165.4. Samples: 78616412. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 03:55:40,022][25689] Avg episode reward: [(0, '-54.125')] [2022-07-09 03:55:41,264][26022] Updated weights on worker 0-0, policy_version 76788 (0.00090) [2022-07-09 03:55:43,075][26022] Updated weights on worker 0-0, policy_version 76798 (0.00087) [2022-07-09 03:55:44,672][26022] Updated weights on worker 0-0, policy_version 76808 (0.00085) [2022-07-09 03:55:45,103][25689] Fps is (10 sec: 5724.0, 60 sec: 5752.3, 300 sec: 5762.9). Total num frames: 78652416. Throughput: 0: 6000.9. Samples: 78650988. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 03:55:45,104][25689] Avg episode reward: [(0, '-54.694')] [2022-07-09 03:55:46,615][26022] Updated weights on worker 0-0, policy_version 76818 (0.00084) [2022-07-09 03:55:48,282][26022] Updated weights on worker 0-0, policy_version 76828 (0.00092) [2022-07-09 03:55:50,115][25689] Fps is (10 sec: 5782.9, 60 sec: 5735.1, 300 sec: 5763.3). Total num frames: 78681088. Throughput: 0: 6014.5. Samples: 78685664. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 03:55:50,116][25689] Avg episode reward: [(0, '-54.565')] [2022-07-09 03:55:50,255][26022] Updated weights on worker 0-0, policy_version 76838 (0.00083) [2022-07-09 03:55:51,733][26022] Updated weights on worker 0-0, policy_version 76848 (0.00086) [2022-07-09 03:55:53,735][26022] Updated weights on worker 0-0, policy_version 76858 (0.00084) [2022-07-09 03:55:55,120][25689] Fps is (10 sec: 5826.9, 60 sec: 5735.9, 300 sec: 5764.2). Total num frames: 78710784. Throughput: 0: 5145.6. Samples: 78703084. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 03:55:55,127][25689] Avg episode reward: [(0, '-54.215')] [2022-07-09 03:55:55,404][26022] Updated weights on worker 0-0, policy_version 76868 (0.00093) [2022-07-09 03:55:57,170][26022] Updated weights on worker 0-0, policy_version 76878 (0.00087) [2022-07-09 03:55:58,980][26022] Updated weights on worker 0-0, policy_version 76888 (0.00087) [2022-07-09 03:56:00,244][25689] Fps is (10 sec: 5863.3, 60 sec: 5766.8, 300 sec: 5768.9). Total num frames: 78740480. Throughput: 0: 6010.6. Samples: 78737962. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 03:56:00,246][25689] Avg episode reward: [(0, '-54.427')] [2022-07-09 03:56:00,626][26022] Updated weights on worker 0-0, policy_version 76898 (0.00085) [2022-07-09 03:56:03,005][26022] Updated weights on worker 0-0, policy_version 76908 (0.00337) [2022-07-09 03:56:04,669][26022] Updated weights on worker 0-0, policy_version 76918 (0.00090) [2022-07-09 03:56:05,255][25689] Fps is (10 sec: 5455.9, 60 sec: 5718.8, 300 sec: 5758.7). Total num frames: 78766080. Throughput: 0: 5932.9. Samples: 78770548. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 03:56:05,256][25689] Avg episode reward: [(0, '-54.024')] [2022-07-09 03:56:06,376][26022] Updated weights on worker 0-0, policy_version 76928 (0.00086) [2022-07-09 03:56:08,575][26022] Updated weights on worker 0-0, policy_version 76938 (0.00086) [2022-07-09 03:56:09,948][26022] Updated weights on worker 0-0, policy_version 76948 (0.00089) [2022-07-09 03:56:10,268][25689] Fps is (10 sec: 5618.7, 60 sec: 5770.1, 300 sec: 5762.0). Total num frames: 78796800. Throughput: 0: 5913.2. Samples: 78804834. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 03:56:10,270][25689] Avg episode reward: [(0, '-53.922')] [2022-07-09 03:56:11,950][26022] Updated weights on worker 0-0, policy_version 76958 (0.00087) [2022-07-09 03:56:13,567][26022] Updated weights on worker 0-0, policy_version 76968 (0.00085) [2022-07-09 03:56:15,266][26022] Updated weights on worker 0-0, policy_version 76978 (0.00082) [2022-07-09 03:56:15,277][25689] Fps is (10 sec: 5926.2, 60 sec: 5769.7, 300 sec: 5759.4). Total num frames: 78825472. Throughput: 0: 5908.1. Samples: 78822174. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 03:56:15,277][25689] Avg episode reward: [(0, '-53.723')] [2022-07-09 03:56:17,143][26022] Updated weights on worker 0-0, policy_version 76988 (0.00091) [2022-07-09 03:56:18,837][26022] Updated weights on worker 0-0, policy_version 76998 (0.00085) [2022-07-09 03:56:20,356][25689] Fps is (10 sec: 5684.6, 60 sec: 5750.5, 300 sec: 5758.0). Total num frames: 78854144. Throughput: 0: 5910.1. Samples: 78856822. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 03:56:20,356][25689] Avg episode reward: [(0, '-53.916')] [2022-07-09 03:56:20,598][26022] Updated weights on worker 0-0, policy_version 77008 (0.00079) [2022-07-09 03:56:22,419][26022] Updated weights on worker 0-0, policy_version 77018 (0.00100) [2022-07-09 03:56:23,422][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:56:23,433][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000077023_78871552.pth [2022-07-09 03:56:23,434][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000074995_76794880.pth [2022-07-09 03:56:24,092][26022] Updated weights on worker 0-0, policy_version 77028 (0.00087) [2022-07-09 03:56:25,408][25689] Fps is (10 sec: 5660.1, 60 sec: 5731.5, 300 sec: 5754.0). Total num frames: 78882816. Throughput: 0: 6016.5. Samples: 78891800. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 03:56:25,409][25689] Avg episode reward: [(0, '-53.206')] [2022-07-09 03:56:26,104][26022] Updated weights on worker 0-0, policy_version 77038 (0.00090) [2022-07-09 03:56:27,721][26022] Updated weights on worker 0-0, policy_version 77048 (0.00082) [2022-07-09 03:56:29,454][26022] Updated weights on worker 0-0, policy_version 77058 (0.00059) [2022-07-09 03:56:30,411][25689] Fps is (10 sec: 5804.8, 60 sec: 5751.4, 300 sec: 5760.9). Total num frames: 78912512. Throughput: 0: 5181.6. Samples: 78909206. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 03:56:30,414][25689] Avg episode reward: [(0, '-53.128')] [2022-07-09 03:56:31,436][26022] Updated weights on worker 0-0, policy_version 77068 (0.00091) [2022-07-09 03:56:32,850][26022] Updated weights on worker 0-0, policy_version 77078 (0.00093) [2022-07-09 03:56:34,842][26022] Updated weights on worker 0-0, policy_version 77088 (0.00088) [2022-07-09 03:56:35,459][25689] Fps is (10 sec: 5909.4, 60 sec: 5749.7, 300 sec: 5761.3). Total num frames: 78942208. Throughput: 0: 6044.7. Samples: 78944166. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 03:56:35,461][25689] Avg episode reward: [(0, '-52.597')] [2022-07-09 03:56:36,567][26022] Updated weights on worker 0-0, policy_version 77098 (0.00086) [2022-07-09 03:56:38,263][26022] Updated weights on worker 0-0, policy_version 77108 (0.00098) [2022-07-09 03:56:40,220][26022] Updated weights on worker 0-0, policy_version 77118 (0.00093) [2022-07-09 03:56:40,551][25689] Fps is (10 sec: 5756.0, 60 sec: 5751.9, 300 sec: 5756.4). Total num frames: 78970880. Throughput: 0: 6036.3. Samples: 78978728. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 03:56:40,551][25689] Avg episode reward: [(0, '-52.490')] [2022-07-09 03:56:41,961][26022] Updated weights on worker 0-0, policy_version 77128 (0.00083) [2022-07-09 03:56:43,584][26022] Updated weights on worker 0-0, policy_version 77138 (0.00086) [2022-07-09 03:56:45,415][26022] Updated weights on worker 0-0, policy_version 77148 (0.00094) [2022-07-09 03:56:45,569][25689] Fps is (10 sec: 5773.0, 60 sec: 5757.9, 300 sec: 5757.0). Total num frames: 79000576. Throughput: 0: 5191.9. Samples: 78996474. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 03:56:45,570][25689] Avg episode reward: [(0, '-52.710')] [2022-07-09 03:56:46,951][26022] Updated weights on worker 0-0, policy_version 77158 (0.00082) [2022-07-09 03:56:49,181][26022] Updated weights on worker 0-0, policy_version 77168 (0.00087) [2022-07-09 03:56:50,519][26022] Updated weights on worker 0-0, policy_version 77178 (0.00094) [2022-07-09 03:56:50,614][25689] Fps is (10 sec: 5901.9, 60 sec: 5771.7, 300 sec: 5759.6). Total num frames: 79030272. Throughput: 0: 6060.8. Samples: 79031656. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 03:56:50,615][25689] Avg episode reward: [(0, '-52.615')] [2022-07-09 03:56:52,427][26022] Updated weights on worker 0-0, policy_version 77188 (0.00092) [2022-07-09 03:56:54,154][26022] Updated weights on worker 0-0, policy_version 77198 (0.00082) [2022-07-09 03:56:55,715][25689] Fps is (10 sec: 5752.8, 60 sec: 5745.7, 300 sec: 5762.4). Total num frames: 79058944. Throughput: 0: 6059.3. Samples: 79066906. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 03:56:55,716][25689] Avg episode reward: [(0, '-52.907')] [2022-07-09 03:56:55,843][26022] Updated weights on worker 0-0, policy_version 77208 (0.00091) [2022-07-09 03:56:57,768][26022] Updated weights on worker 0-0, policy_version 77218 (0.00088) [2022-07-09 03:56:59,634][26022] Updated weights on worker 0-0, policy_version 77228 (0.00086) [2022-07-09 03:57:00,803][25689] Fps is (10 sec: 5728.9, 60 sec: 5749.2, 300 sec: 5761.3). Total num frames: 79088640. Throughput: 0: 5213.5. Samples: 79084310. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 03:57:00,803][25689] Avg episode reward: [(0, '-52.966')] [2022-07-09 03:57:01,120][26022] Updated weights on worker 0-0, policy_version 77238 (0.00091) [2022-07-09 03:57:03,609][26022] Updated weights on worker 0-0, policy_version 77248 (0.00120) [2022-07-09 03:57:05,009][26022] Updated weights on worker 0-0, policy_version 77258 (0.00089) [2022-07-09 03:57:05,847][25689] Fps is (10 sec: 5659.8, 60 sec: 5779.8, 300 sec: 5760.8). Total num frames: 79116288. Throughput: 0: 5937.1. Samples: 79116866. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-09 03:57:05,849][25689] Avg episode reward: [(0, '-52.833')] [2022-07-09 03:57:07,086][26022] Updated weights on worker 0-0, policy_version 77268 (0.00086) [2022-07-09 03:57:08,608][26022] Updated weights on worker 0-0, policy_version 77278 (0.00091) [2022-07-09 03:57:10,591][26022] Updated weights on worker 0-0, policy_version 77288 (0.00081) [2022-07-09 03:57:10,887][25689] Fps is (10 sec: 5483.1, 60 sec: 5726.5, 300 sec: 5756.7). Total num frames: 79143936. Throughput: 0: 5906.3. Samples: 79151396. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-09 03:57:10,888][25689] Avg episode reward: [(0, '-53.154')] [2022-07-09 03:57:12,198][26022] Updated weights on worker 0-0, policy_version 77298 (0.00978) [2022-07-09 03:57:13,991][26022] Updated weights on worker 0-0, policy_version 77308 (0.00095) [2022-07-09 03:57:15,809][26022] Updated weights on worker 0-0, policy_version 77318 (0.00085) [2022-07-09 03:57:15,905][25689] Fps is (10 sec: 5701.5, 60 sec: 5742.6, 300 sec: 5755.6). Total num frames: 79173632. Throughput: 0: 5048.6. Samples: 79168834. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-09 03:57:15,905][25689] Avg episode reward: [(0, '-52.877')] [2022-07-09 03:57:17,563][26022] Updated weights on worker 0-0, policy_version 77328 (0.00083) [2022-07-09 03:57:19,298][26022] Updated weights on worker 0-0, policy_version 77338 (0.00081) [2022-07-09 03:57:20,979][25689] Fps is (10 sec: 5783.8, 60 sec: 5743.0, 300 sec: 5754.9). Total num frames: 79202304. Throughput: 0: 5926.2. Samples: 79203880. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-09 03:57:20,980][25689] Avg episode reward: [(0, '-53.989')] [2022-07-09 03:57:21,178][26022] Updated weights on worker 0-0, policy_version 77348 (0.00089) [2022-07-09 03:57:22,812][26022] Updated weights on worker 0-0, policy_version 77358 (0.00099) [2022-07-09 03:57:24,682][26022] Updated weights on worker 0-0, policy_version 77368 (0.00086) [2022-07-09 03:57:26,037][25689] Fps is (10 sec: 5861.6, 60 sec: 5776.2, 300 sec: 5757.5). Total num frames: 79233024. Throughput: 0: 6028.3. Samples: 79238578. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-09 03:57:26,039][25689] Avg episode reward: [(0, '-54.390')] [2022-07-09 03:57:26,361][26022] Updated weights on worker 0-0, policy_version 77378 (0.00095) [2022-07-09 03:57:27,990][26022] Updated weights on worker 0-0, policy_version 77388 (0.00113) [2022-07-09 03:57:30,064][26022] Updated weights on worker 0-0, policy_version 77398 (0.00099) [2022-07-09 03:57:31,079][25689] Fps is (10 sec: 5880.7, 60 sec: 5755.7, 300 sec: 5756.9). Total num frames: 79261696. Throughput: 0: 5170.5. Samples: 79255794. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-09 03:57:31,079][25689] Avg episode reward: [(0, '-54.280')] [2022-07-09 03:57:31,700][26022] Updated weights on worker 0-0, policy_version 77408 (0.00088) [2022-07-09 03:57:33,537][26022] Updated weights on worker 0-0, policy_version 77418 (0.00073) [2022-07-09 03:57:35,163][26022] Updated weights on worker 0-0, policy_version 77428 (0.00090) [2022-07-09 03:57:36,086][25689] Fps is (10 sec: 5706.8, 60 sec: 5742.7, 300 sec: 5754.7). Total num frames: 79290368. Throughput: 0: 6037.8. Samples: 79290682. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-09 03:57:36,087][25689] Avg episode reward: [(0, '-54.675')] [2022-07-09 03:57:36,993][26022] Updated weights on worker 0-0, policy_version 77438 (0.00084) [2022-07-09 03:57:38,852][26022] Updated weights on worker 0-0, policy_version 77448 (0.00084) [2022-07-09 03:57:40,736][26022] Updated weights on worker 0-0, policy_version 77458 (0.00099) [2022-07-09 03:57:41,182][25689] Fps is (10 sec: 5675.9, 60 sec: 5742.3, 300 sec: 5749.5). Total num frames: 79319040. Throughput: 0: 6007.9. Samples: 79325254. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-09 03:57:41,182][25689] Avg episode reward: [(0, '-55.387')] [2022-07-09 03:57:42,362][26022] Updated weights on worker 0-0, policy_version 77468 (0.00090) [2022-07-09 03:57:44,151][26022] Updated weights on worker 0-0, policy_version 77478 (0.00087) [2022-07-09 03:57:45,984][26022] Updated weights on worker 0-0, policy_version 77488 (0.00084) [2022-07-09 03:57:46,194][25689] Fps is (10 sec: 5774.4, 60 sec: 5742.9, 300 sec: 5753.3). Total num frames: 79348736. Throughput: 0: 5155.9. Samples: 79342502. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-09 03:57:46,194][25689] Avg episode reward: [(0, '-55.558')] [2022-07-09 03:57:47,443][26022] Updated weights on worker 0-0, policy_version 77498 (0.00093) [2022-07-09 03:57:49,551][26022] Updated weights on worker 0-0, policy_version 77508 (0.00094) [2022-07-09 03:57:51,034][26022] Updated weights on worker 0-0, policy_version 77518 (0.00088) [2022-07-09 03:57:51,229][25689] Fps is (10 sec: 5911.4, 60 sec: 5743.9, 300 sec: 5756.6). Total num frames: 79378432. Throughput: 0: 6044.9. Samples: 79377600. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-09 03:57:51,229][25689] Avg episode reward: [(0, '-54.606')] [2022-07-09 03:57:53,008][26022] Updated weights on worker 0-0, policy_version 77528 (0.00083) [2022-07-09 03:57:54,783][26022] Updated weights on worker 0-0, policy_version 77538 (0.00093) [2022-07-09 03:57:56,236][25689] Fps is (10 sec: 5914.3, 60 sec: 5769.7, 300 sec: 5751.5). Total num frames: 79408128. Throughput: 0: 6045.8. Samples: 79412506. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-09 03:57:56,236][25689] Avg episode reward: [(0, '-55.595')] [2022-07-09 03:57:56,335][26022] Updated weights on worker 0-0, policy_version 77548 (0.00090) [2022-07-09 03:57:58,355][26022] Updated weights on worker 0-0, policy_version 77558 (0.00091) [2022-07-09 03:57:59,871][26022] Updated weights on worker 0-0, policy_version 77568 (0.00088) [2022-07-09 03:58:01,330][25689] Fps is (10 sec: 5676.9, 60 sec: 5735.2, 300 sec: 5761.8). Total num frames: 79435776. Throughput: 0: 5200.5. Samples: 79430034. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-09 03:58:01,330][25689] Avg episode reward: [(0, '-56.218')] [2022-07-09 03:58:02,166][26022] Updated weights on worker 0-0, policy_version 77578 (0.00094) [2022-07-09 03:58:03,855][26022] Updated weights on worker 0-0, policy_version 77588 (0.00098) [2022-07-09 03:58:05,672][26022] Updated weights on worker 0-0, policy_version 77598 (0.00337) [2022-07-09 03:58:06,376][25689] Fps is (10 sec: 5453.0, 60 sec: 5735.0, 300 sec: 5751.4). Total num frames: 79463424. Throughput: 0: 5957.8. Samples: 79462746. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-09 03:58:06,377][25689] Avg episode reward: [(0, '-54.893')] [2022-07-09 03:58:07,441][26022] Updated weights on worker 0-0, policy_version 77608 (0.00054) [2022-07-09 03:58:09,186][26022] Updated weights on worker 0-0, policy_version 77618 (0.00089) [2022-07-09 03:58:10,967][26022] Updated weights on worker 0-0, policy_version 77628 (0.00086) [2022-07-09 03:58:11,404][25689] Fps is (10 sec: 5691.9, 60 sec: 5770.0, 300 sec: 5755.2). Total num frames: 79493120. Throughput: 0: 5948.7. Samples: 79497620. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-09 03:58:11,405][25689] Avg episode reward: [(0, '-54.734')] [2022-07-09 03:58:12,657][26022] Updated weights on worker 0-0, policy_version 77638 (0.00085) [2022-07-09 03:58:14,553][26022] Updated weights on worker 0-0, policy_version 77648 (0.00087) [2022-07-09 03:58:16,188][26022] Updated weights on worker 0-0, policy_version 77658 (0.00091) [2022-07-09 03:58:16,468][25689] Fps is (10 sec: 5885.2, 60 sec: 5765.6, 300 sec: 5755.3). Total num frames: 79522816. Throughput: 0: 5056.6. Samples: 79514812. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-09 03:58:16,468][25689] Avg episode reward: [(0, '-54.451')] [2022-07-09 03:58:18,238][26022] Updated weights on worker 0-0, policy_version 77668 (0.00087) [2022-07-09 03:58:19,730][26022] Updated weights on worker 0-0, policy_version 77678 (0.00095) [2022-07-09 03:58:21,518][25689] Fps is (10 sec: 5670.0, 60 sec: 5751.0, 300 sec: 5752.8). Total num frames: 79550464. Throughput: 0: 5913.8. Samples: 79549424. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-09 03:58:21,519][25689] Avg episode reward: [(0, '-54.589')] [2022-07-09 03:58:21,585][26022] Updated weights on worker 0-0, policy_version 77688 (0.00087) [2022-07-09 03:58:23,344][26022] Updated weights on worker 0-0, policy_version 77698 (0.00095) [2022-07-09 03:58:23,758][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 03:58:23,770][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000077699_79563776.pth [2022-07-09 03:58:23,770][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000075673_77489152.pth [2022-07-09 03:58:25,092][26022] Updated weights on worker 0-0, policy_version 77708 (0.00083) [2022-07-09 03:58:26,549][25689] Fps is (10 sec: 5688.4, 60 sec: 5736.7, 300 sec: 5753.5). Total num frames: 79580160. Throughput: 0: 5997.1. Samples: 79583724. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-09 03:58:26,549][25689] Avg episode reward: [(0, '-53.569')] [2022-07-09 03:58:27,167][26022] Updated weights on worker 0-0, policy_version 77718 (0.00084) [2022-07-09 03:58:28,827][26022] Updated weights on worker 0-0, policy_version 77728 (0.00088) [2022-07-09 03:58:30,484][26022] Updated weights on worker 0-0, policy_version 77738 (0.00093) [2022-07-09 03:58:31,610][25689] Fps is (10 sec: 5783.9, 60 sec: 5734.8, 300 sec: 5750.2). Total num frames: 79608832. Throughput: 0: 5964.7. Samples: 79618138. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-09 03:58:31,610][25689] Avg episode reward: [(0, '-53.895')] [2022-07-09 03:58:32,337][26022] Updated weights on worker 0-0, policy_version 77748 (0.00086) [2022-07-09 03:58:34,111][26022] Updated weights on worker 0-0, policy_version 77758 (0.00095) [2022-07-09 03:58:36,011][26022] Updated weights on worker 0-0, policy_version 77768 (0.00085) [2022-07-09 03:58:36,678][25689] Fps is (10 sec: 5661.5, 60 sec: 5729.1, 300 sec: 5754.5). Total num frames: 79637504. Throughput: 0: 5971.7. Samples: 79635498. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-09 03:58:36,678][25689] Avg episode reward: [(0, '-54.548')] [2022-07-09 03:58:37,695][26022] Updated weights on worker 0-0, policy_version 77778 (0.00083) [2022-07-09 03:58:39,349][26022] Updated weights on worker 0-0, policy_version 77788 (0.00093) [2022-07-09 03:58:41,230][26022] Updated weights on worker 0-0, policy_version 77798 (0.00087) [2022-07-09 03:58:41,731][25689] Fps is (10 sec: 5766.5, 60 sec: 5750.0, 300 sec: 5748.0). Total num frames: 79667200. Throughput: 0: 5973.8. Samples: 79670174. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-09 03:58:41,733][25689] Avg episode reward: [(0, '-55.525')] [2022-07-09 03:58:42,930][26022] Updated weights on worker 0-0, policy_version 77808 (0.00089) [2022-07-09 03:58:44,860][26022] Updated weights on worker 0-0, policy_version 77818 (0.00088) [2022-07-09 03:58:46,478][26022] Updated weights on worker 0-0, policy_version 77828 (0.00092) [2022-07-09 03:58:46,736][25689] Fps is (10 sec: 5802.8, 60 sec: 5733.8, 300 sec: 5751.7). Total num frames: 79695872. Throughput: 0: 5996.9. Samples: 79704786. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-09 03:58:46,738][25689] Avg episode reward: [(0, '-55.621')] [2022-07-09 03:58:48,391][26022] Updated weights on worker 0-0, policy_version 77838 (0.00097) [2022-07-09 03:58:50,209][26022] Updated weights on worker 0-0, policy_version 77848 (0.00081) [2022-07-09 03:58:51,741][25689] Fps is (10 sec: 5626.8, 60 sec: 5702.8, 300 sec: 5742.6). Total num frames: 79723520. Throughput: 0: 5164.1. Samples: 79722098. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-09 03:58:51,741][25689] Avg episode reward: [(0, '-56.066')] [2022-07-09 03:58:52,051][26022] Updated weights on worker 0-0, policy_version 77858 (0.00090) [2022-07-09 03:58:53,713][26022] Updated weights on worker 0-0, policy_version 77868 (0.00089) [2022-07-09 03:58:55,568][26022] Updated weights on worker 0-0, policy_version 77878 (0.00091) [2022-07-09 03:58:56,763][25689] Fps is (10 sec: 5719.1, 60 sec: 5701.4, 300 sec: 5747.5). Total num frames: 79753216. Throughput: 0: 6036.2. Samples: 79756738. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-09 03:58:56,763][25689] Avg episode reward: [(0, '-55.960')] [2022-07-09 03:58:57,213][26022] Updated weights on worker 0-0, policy_version 77888 (0.00094) [2022-07-09 03:58:58,938][26022] Updated weights on worker 0-0, policy_version 77898 (0.00094) [2022-07-09 03:59:01,006][26022] Updated weights on worker 0-0, policy_version 77908 (0.00106) [2022-07-09 03:59:01,915][25689] Fps is (10 sec: 5938.4, 60 sec: 5746.6, 300 sec: 5756.2). Total num frames: 79783936. Throughput: 0: 5998.4. Samples: 79791240. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-09 03:59:01,915][25689] Avg episode reward: [(0, '-55.123')] [2022-07-09 03:59:03,023][26022] Updated weights on worker 0-0, policy_version 77918 (0.00059) [2022-07-09 03:59:04,795][26022] Updated weights on worker 0-0, policy_version 77928 (0.00085) [2022-07-09 03:59:06,655][26022] Updated weights on worker 0-0, policy_version 77938 (0.00097) [2022-07-09 03:59:06,986][25689] Fps is (10 sec: 5509.4, 60 sec: 5710.5, 300 sec: 5744.8). Total num frames: 79809536. Throughput: 0: 5011.8. Samples: 79806272. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-09 03:59:06,986][25689] Avg episode reward: [(0, '-54.797')] [2022-07-09 03:59:08,295][26022] Updated weights on worker 0-0, policy_version 77948 (0.00090) [2022-07-09 03:59:10,239][26022] Updated weights on worker 0-0, policy_version 77958 (0.00084) [2022-07-09 03:59:11,799][26022] Updated weights on worker 0-0, policy_version 77968 (0.00097) [2022-07-09 03:59:11,999][25689] Fps is (10 sec: 5585.1, 60 sec: 5728.9, 300 sec: 5748.8). Total num frames: 79840256. Throughput: 0: 5874.4. Samples: 79841102. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-09 03:59:11,999][25689] Avg episode reward: [(0, '-54.104')] [2022-07-09 03:59:13,753][26022] Updated weights on worker 0-0, policy_version 77978 (0.00084) [2022-07-09 03:59:15,303][26022] Updated weights on worker 0-0, policy_version 77988 (0.00087) [2022-07-09 03:59:17,016][25689] Fps is (10 sec: 5819.4, 60 sec: 5699.5, 300 sec: 5746.3). Total num frames: 79867904. Throughput: 0: 5866.6. Samples: 79875552. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-09 03:59:17,016][25689] Avg episode reward: [(0, '-54.134')] [2022-07-09 03:59:17,356][26022] Updated weights on worker 0-0, policy_version 77998 (0.00089) [2022-07-09 03:59:18,994][26022] Updated weights on worker 0-0, policy_version 78008 (0.00082) [2022-07-09 03:59:20,747][26022] Updated weights on worker 0-0, policy_version 78018 (0.00084) [2022-07-09 03:59:22,064][25689] Fps is (10 sec: 5696.9, 60 sec: 5733.4, 300 sec: 5742.2). Total num frames: 79897600. Throughput: 0: 5043.6. Samples: 79892868. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-09 03:59:22,067][25689] Avg episode reward: [(0, '-53.167')] [2022-07-09 03:59:22,684][26022] Updated weights on worker 0-0, policy_version 78028 (0.00088) [2022-07-09 03:59:24,285][26022] Updated weights on worker 0-0, policy_version 78038 (0.00083) [2022-07-09 03:59:26,058][26022] Updated weights on worker 0-0, policy_version 78048 (0.00093) [2022-07-09 03:59:27,072][25689] Fps is (10 sec: 5804.3, 60 sec: 5718.7, 300 sec: 5745.9). Total num frames: 79926272. Throughput: 0: 6047.9. Samples: 79927750. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-09 03:59:27,072][25689] Avg episode reward: [(0, '-53.250')] [2022-07-09 03:59:27,947][26022] Updated weights on worker 0-0, policy_version 78058 (0.00094) [2022-07-09 03:59:29,586][26022] Updated weights on worker 0-0, policy_version 78068 (0.00089) [2022-07-09 03:59:31,449][26022] Updated weights on worker 0-0, policy_version 78078 (0.00094) [2022-07-09 03:59:32,075][25689] Fps is (10 sec: 5933.0, 60 sec: 5758.0, 300 sec: 5746.1). Total num frames: 79956992. Throughput: 0: 6029.7. Samples: 79962156. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-09 03:59:32,075][25689] Avg episode reward: [(0, '-53.056')] [2022-07-09 03:59:33,154][26022] Updated weights on worker 0-0, policy_version 78088 (0.00090) [2022-07-09 03:59:34,939][26022] Updated weights on worker 0-0, policy_version 78098 (0.00094) [2022-07-09 03:59:36,659][26022] Updated weights on worker 0-0, policy_version 78108 (0.00084) [2022-07-09 03:59:37,099][25689] Fps is (10 sec: 5820.6, 60 sec: 5745.2, 300 sec: 5743.8). Total num frames: 79984640. Throughput: 0: 5187.7. Samples: 79979740. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-09 03:59:37,100][25689] Avg episode reward: [(0, '-52.615')] [2022-07-09 03:59:38,458][26022] Updated weights on worker 0-0, policy_version 78118 (0.00094) [2022-07-09 03:59:40,190][26022] Updated weights on worker 0-0, policy_version 78128 (0.00093) [2022-07-09 03:59:42,179][25689] Fps is (10 sec: 5574.1, 60 sec: 5725.9, 300 sec: 5742.8). Total num frames: 80013312. Throughput: 0: 6039.6. Samples: 80014350. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-09 03:59:42,179][25689] Avg episode reward: [(0, '-52.610')] [2022-07-09 03:59:42,184][26022] Updated weights on worker 0-0, policy_version 78138 (0.00087) [2022-07-09 03:59:43,595][26022] Updated weights on worker 0-0, policy_version 78148 (0.00081) [2022-07-09 03:59:45,704][26022] Updated weights on worker 0-0, policy_version 78158 (0.00091) [2022-07-09 03:59:47,075][26022] Updated weights on worker 0-0, policy_version 78168 (0.00085) [2022-07-09 03:59:47,203][25689] Fps is (10 sec: 5878.3, 60 sec: 5757.9, 300 sec: 5745.9). Total num frames: 80044032. Throughput: 0: 6036.0. Samples: 80049264. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-09 03:59:47,203][25689] Avg episode reward: [(0, '-53.141')] [2022-07-09 03:59:49,268][26022] Updated weights on worker 0-0, policy_version 78178 (0.00091) [2022-07-09 03:59:50,701][26022] Updated weights on worker 0-0, policy_version 78188 (0.00084) [2022-07-09 03:59:52,215][25689] Fps is (10 sec: 5815.4, 60 sec: 5757.1, 300 sec: 5739.1). Total num frames: 80071680. Throughput: 0: 5184.8. Samples: 80066580. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-09 03:59:52,216][25689] Avg episode reward: [(0, '-53.462')] [2022-07-09 03:59:52,931][26022] Updated weights on worker 0-0, policy_version 78198 (0.00090) [2022-07-09 03:59:54,175][26022] Updated weights on worker 0-0, policy_version 78208 (0.00093) [2022-07-09 03:59:56,440][26022] Updated weights on worker 0-0, policy_version 78218 (0.00080) [2022-07-09 03:59:57,218][25689] Fps is (10 sec: 5828.1, 60 sec: 5775.9, 300 sec: 5751.1). Total num frames: 80102400. Throughput: 0: 6063.6. Samples: 80101732. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-09 03:59:57,218][25689] Avg episode reward: [(0, '-53.556')] [2022-07-09 03:59:57,559][26022] Updated weights on worker 0-0, policy_version 78228 (0.00091) [2022-07-09 03:59:59,727][26022] Updated weights on worker 0-0, policy_version 78238 (0.00890) [2022-07-09 04:00:01,403][26022] Updated weights on worker 0-0, policy_version 78248 (0.00088) [2022-07-09 04:00:02,357][25689] Fps is (10 sec: 5553.2, 60 sec: 5692.4, 300 sec: 5738.9). Total num frames: 80128000. Throughput: 0: 6061.7. Samples: 80136668. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-09 04:00:02,359][25689] Avg episode reward: [(0, '-53.976')] [2022-07-09 04:00:03,519][26022] Updated weights on worker 0-0, policy_version 78258 (0.00082) [2022-07-09 04:00:05,366][26022] Updated weights on worker 0-0, policy_version 78268 (0.00097) [2022-07-09 04:00:07,241][26022] Updated weights on worker 0-0, policy_version 78278 (0.00081) [2022-07-09 04:00:07,367][25689] Fps is (10 sec: 5347.6, 60 sec: 5749.1, 300 sec: 5742.5). Total num frames: 80156672. Throughput: 0: 5109.1. Samples: 80152282. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-09 04:00:07,367][25689] Avg episode reward: [(0, '-53.943')] [2022-07-09 04:00:08,873][26022] Updated weights on worker 0-0, policy_version 78288 (0.00083) [2022-07-09 04:00:10,786][26022] Updated weights on worker 0-0, policy_version 78298 (0.00082) [2022-07-09 04:00:12,277][26022] Updated weights on worker 0-0, policy_version 78308 (0.00337) [2022-07-09 04:00:12,458][25689] Fps is (10 sec: 5981.4, 60 sec: 5758.6, 300 sec: 5751.2). Total num frames: 80188416. Throughput: 0: 5949.0. Samples: 80187004. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:00:12,459][25689] Avg episode reward: [(0, '-54.207')] [2022-07-09 04:00:14,369][26022] Updated weights on worker 0-0, policy_version 78318 (0.00223) [2022-07-09 04:00:15,725][26022] Updated weights on worker 0-0, policy_version 78328 (0.00087) [2022-07-09 04:00:17,473][25689] Fps is (10 sec: 5876.7, 60 sec: 5758.7, 300 sec: 5745.1). Total num frames: 80216064. Throughput: 0: 5921.1. Samples: 80221664. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:00:17,474][25689] Avg episode reward: [(0, '-53.805')] [2022-07-09 04:00:17,813][26022] Updated weights on worker 0-0, policy_version 78338 (0.00087) [2022-07-09 04:00:19,608][26022] Updated weights on worker 0-0, policy_version 78348 (0.00087) [2022-07-09 04:00:21,311][26022] Updated weights on worker 0-0, policy_version 78358 (0.00093) [2022-07-09 04:00:22,504][25689] Fps is (10 sec: 5606.2, 60 sec: 5743.5, 300 sec: 5741.6). Total num frames: 80244736. Throughput: 0: 5080.4. Samples: 80239022. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:00:22,505][25689] Avg episode reward: [(0, '-54.585')] [2022-07-09 04:00:23,044][26022] Updated weights on worker 0-0, policy_version 78368 (0.00096) [2022-07-09 04:00:23,784][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:00:23,797][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000078373_80253952.pth [2022-07-09 04:00:23,798][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000076348_78180352.pth [2022-07-09 04:00:24,877][26022] Updated weights on worker 0-0, policy_version 78378 (0.00086) [2022-07-09 04:00:26,387][26022] Updated weights on worker 0-0, policy_version 78388 (0.00087) [2022-07-09 04:00:27,506][25689] Fps is (10 sec: 5817.4, 60 sec: 5760.9, 300 sec: 5745.6). Total num frames: 80274432. Throughput: 0: 6033.5. Samples: 80273794. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:00:27,507][25689] Avg episode reward: [(0, '-54.465')] [2022-07-09 04:00:28,397][26022] Updated weights on worker 0-0, policy_version 78398 (0.00094) [2022-07-09 04:00:30,148][26022] Updated weights on worker 0-0, policy_version 78408 (0.00090) [2022-07-09 04:00:31,822][26022] Updated weights on worker 0-0, policy_version 78418 (0.00085) [2022-07-09 04:00:32,514][25689] Fps is (10 sec: 5831.2, 60 sec: 5726.7, 300 sec: 5742.6). Total num frames: 80303104. Throughput: 0: 6068.1. Samples: 80308704. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:00:32,514][25689] Avg episode reward: [(0, '-54.018')] [2022-07-09 04:00:33,688][26022] Updated weights on worker 0-0, policy_version 78428 (0.00093) [2022-07-09 04:00:35,367][26022] Updated weights on worker 0-0, policy_version 78438 (0.00088) [2022-07-09 04:00:37,111][26022] Updated weights on worker 0-0, policy_version 78448 (0.00095) [2022-07-09 04:00:37,524][25689] Fps is (10 sec: 5724.4, 60 sec: 5745.0, 300 sec: 5744.7). Total num frames: 80331776. Throughput: 0: 5214.0. Samples: 80326208. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:00:37,524][25689] Avg episode reward: [(0, '-53.597')] [2022-07-09 04:00:39,094][26022] Updated weights on worker 0-0, policy_version 78458 (0.00088) [2022-07-09 04:00:40,577][26022] Updated weights on worker 0-0, policy_version 78468 (0.00087) [2022-07-09 04:00:42,542][26022] Updated weights on worker 0-0, policy_version 78478 (0.00087) [2022-07-09 04:00:42,637][25689] Fps is (10 sec: 5765.7, 60 sec: 5758.7, 300 sec: 5744.1). Total num frames: 80361472. Throughput: 0: 6043.6. Samples: 80360698. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:00:42,638][25689] Avg episode reward: [(0, '-53.305')] [2022-07-09 04:00:44,377][26022] Updated weights on worker 0-0, policy_version 78488 (0.00088) [2022-07-09 04:00:45,838][26022] Updated weights on worker 0-0, policy_version 78498 (0.00089) [2022-07-09 04:00:47,643][25689] Fps is (10 sec: 5667.0, 60 sec: 5709.6, 300 sec: 5740.7). Total num frames: 80389120. Throughput: 0: 6042.5. Samples: 80395468. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:00:47,644][25689] Avg episode reward: [(0, '-52.898')] [2022-07-09 04:00:48,013][26022] Updated weights on worker 0-0, policy_version 78508 (0.00087) [2022-07-09 04:00:49,417][26022] Updated weights on worker 0-0, policy_version 78518 (0.00087) [2022-07-09 04:00:51,415][26022] Updated weights on worker 0-0, policy_version 78528 (0.00084) [2022-07-09 04:00:52,699][25689] Fps is (10 sec: 5902.4, 60 sec: 5773.2, 300 sec: 5746.6). Total num frames: 80420864. Throughput: 0: 5162.3. Samples: 80412910. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:00:52,701][25689] Avg episode reward: [(0, '-52.637')] [2022-07-09 04:00:53,141][26022] Updated weights on worker 0-0, policy_version 78538 (0.00081) [2022-07-09 04:00:54,760][26022] Updated weights on worker 0-0, policy_version 78548 (0.00089) [2022-07-09 04:00:57,055][26022] Updated weights on worker 0-0, policy_version 78558 (0.00092) [2022-07-09 04:00:57,725][25689] Fps is (10 sec: 5992.3, 60 sec: 5737.1, 300 sec: 5745.1). Total num frames: 80449536. Throughput: 0: 6024.2. Samples: 80447904. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:00:57,727][25689] Avg episode reward: [(0, '-53.088')] [2022-07-09 04:00:58,194][26022] Updated weights on worker 0-0, policy_version 78568 (0.00081) [2022-07-09 04:01:00,365][26022] Updated weights on worker 0-0, policy_version 78578 (0.00092) [2022-07-09 04:01:02,183][26022] Updated weights on worker 0-0, policy_version 78588 (0.00086) [2022-07-09 04:01:02,800][25689] Fps is (10 sec: 5474.7, 60 sec: 5760.2, 300 sec: 5747.3). Total num frames: 80476160. Throughput: 0: 5968.5. Samples: 80481040. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:01:02,802][25689] Avg episode reward: [(0, '-53.890')] [2022-07-09 04:01:04,118][26022] Updated weights on worker 0-0, policy_version 78598 (0.00086) [2022-07-09 04:01:06,017][26022] Updated weights on worker 0-0, policy_version 78608 (0.00081) [2022-07-09 04:01:07,755][26022] Updated weights on worker 0-0, policy_version 78618 (0.00079) [2022-07-09 04:01:07,816][25689] Fps is (10 sec: 5479.9, 60 sec: 5759.6, 300 sec: 5740.4). Total num frames: 80504832. Throughput: 0: 5067.6. Samples: 80497696. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:01:07,818][25689] Avg episode reward: [(0, '-53.301')] [2022-07-09 04:01:09,375][26022] Updated weights on worker 0-0, policy_version 78628 (0.00084) [2022-07-09 04:01:11,532][26022] Updated weights on worker 0-0, policy_version 78638 (0.00081) [2022-07-09 04:01:12,839][25689] Fps is (10 sec: 5814.0, 60 sec: 5732.1, 300 sec: 5743.5). Total num frames: 80534528. Throughput: 0: 5947.8. Samples: 80532696. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:01:12,841][25689] Avg episode reward: [(0, '-53.459')] [2022-07-09 04:01:12,910][26022] Updated weights on worker 0-0, policy_version 78648 (0.00095) [2022-07-09 04:01:14,818][26022] Updated weights on worker 0-0, policy_version 78658 (0.00084) [2022-07-09 04:01:16,411][26022] Updated weights on worker 0-0, policy_version 78668 (0.00091) [2022-07-09 04:01:17,844][25689] Fps is (10 sec: 5718.0, 60 sec: 5733.1, 300 sec: 5741.5). Total num frames: 80562176. Throughput: 0: 5954.7. Samples: 80567708. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:01:17,846][25689] Avg episode reward: [(0, '-53.380')] [2022-07-09 04:01:18,412][26022] Updated weights on worker 0-0, policy_version 78678 (0.00089) [2022-07-09 04:01:20,214][26022] Updated weights on worker 0-0, policy_version 78688 (0.00084) [2022-07-09 04:01:21,827][26022] Updated weights on worker 0-0, policy_version 78698 (0.00082) [2022-07-09 04:01:22,917][25689] Fps is (10 sec: 5791.7, 60 sec: 5763.0, 300 sec: 5748.0). Total num frames: 80592896. Throughput: 0: 5159.3. Samples: 80584830. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:01:22,917][25689] Avg episode reward: [(0, '-53.228')] [2022-07-09 04:01:23,725][26022] Updated weights on worker 0-0, policy_version 78708 (0.00087) [2022-07-09 04:01:25,423][26022] Updated weights on worker 0-0, policy_version 78718 (0.00088) [2022-07-09 04:01:27,090][26022] Updated weights on worker 0-0, policy_version 78728 (0.00097) [2022-07-09 04:01:27,995][25689] Fps is (10 sec: 5851.0, 60 sec: 5738.9, 300 sec: 5743.1). Total num frames: 80621568. Throughput: 0: 6047.9. Samples: 80619738. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:01:27,996][25689] Avg episode reward: [(0, '-53.061')] [2022-07-09 04:01:28,875][26022] Updated weights on worker 0-0, policy_version 78738 (0.00089) [2022-07-09 04:01:30,678][26022] Updated weights on worker 0-0, policy_version 78748 (0.00086) [2022-07-09 04:01:32,375][26022] Updated weights on worker 0-0, policy_version 78758 (0.00090) [2022-07-09 04:01:33,031][25689] Fps is (10 sec: 5669.9, 60 sec: 5736.2, 300 sec: 5739.9). Total num frames: 80650240. Throughput: 0: 6029.3. Samples: 80654438. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:01:33,031][25689] Avg episode reward: [(0, '-53.407')] [2022-07-09 04:01:34,210][26022] Updated weights on worker 0-0, policy_version 78768 (0.00090) [2022-07-09 04:01:35,964][26022] Updated weights on worker 0-0, policy_version 78778 (0.00084) [2022-07-09 04:01:37,806][26022] Updated weights on worker 0-0, policy_version 78788 (0.00085) [2022-07-09 04:01:38,033][25689] Fps is (10 sec: 5916.9, 60 sec: 5770.8, 300 sec: 5748.5). Total num frames: 80680960. Throughput: 0: 6014.8. Samples: 80689138. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:01:38,033][25689] Avg episode reward: [(0, '-54.623')] [2022-07-09 04:01:39,779][26022] Updated weights on worker 0-0, policy_version 78798 (0.00095) [2022-07-09 04:01:41,244][26022] Updated weights on worker 0-0, policy_version 78808 (0.00088) [2022-07-09 04:01:43,113][25689] Fps is (10 sec: 5687.3, 60 sec: 5723.1, 300 sec: 5737.0). Total num frames: 80707584. Throughput: 0: 6022.9. Samples: 80706472. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:01:43,114][25689] Avg episode reward: [(0, '-54.704')] [2022-07-09 04:01:43,205][26022] Updated weights on worker 0-0, policy_version 78818 (0.00094) [2022-07-09 04:01:44,796][26022] Updated weights on worker 0-0, policy_version 78828 (0.00092) [2022-07-09 04:01:46,704][26022] Updated weights on worker 0-0, policy_version 78838 (0.00088) [2022-07-09 04:01:48,125][25689] Fps is (10 sec: 5682.0, 60 sec: 5773.3, 300 sec: 5741.1). Total num frames: 80738304. Throughput: 0: 6036.3. Samples: 80741248. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-09 04:01:48,125][25689] Avg episode reward: [(0, '-54.671')] [2022-07-09 04:01:48,495][26022] Updated weights on worker 0-0, policy_version 78848 (0.00084) [2022-07-09 04:01:50,172][26022] Updated weights on worker 0-0, policy_version 78858 (0.00085) [2022-07-09 04:01:52,032][26022] Updated weights on worker 0-0, policy_version 78868 (0.00090) [2022-07-09 04:01:53,155][25689] Fps is (10 sec: 5914.8, 60 sec: 5725.1, 300 sec: 5742.5). Total num frames: 80766976. Throughput: 0: 6038.4. Samples: 80775956. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-09 04:01:53,155][25689] Avg episode reward: [(0, '-55.237')] [2022-07-09 04:01:53,761][26022] Updated weights on worker 0-0, policy_version 78878 (0.00082) [2022-07-09 04:01:55,535][26022] Updated weights on worker 0-0, policy_version 78888 (0.00090) [2022-07-09 04:01:57,418][26022] Updated weights on worker 0-0, policy_version 78898 (0.00085) [2022-07-09 04:01:58,165][25689] Fps is (10 sec: 5915.8, 60 sec: 5760.5, 300 sec: 5747.4). Total num frames: 80797696. Throughput: 0: 5178.8. Samples: 80793396. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-09 04:01:58,165][25689] Avg episode reward: [(0, '-54.542')] [2022-07-09 04:01:58,972][26022] Updated weights on worker 0-0, policy_version 78908 (0.00083) [2022-07-09 04:02:00,941][26022] Updated weights on worker 0-0, policy_version 78918 (0.00087) [2022-07-09 04:02:02,914][26022] Updated weights on worker 0-0, policy_version 78928 (0.00085) [2022-07-09 04:02:03,273][25689] Fps is (10 sec: 5566.3, 60 sec: 5740.4, 300 sec: 5739.3). Total num frames: 80823296. Throughput: 0: 5925.1. Samples: 80825918. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-09 04:02:03,273][25689] Avg episode reward: [(0, '-54.096')] [2022-07-09 04:02:04,727][26022] Updated weights on worker 0-0, policy_version 78938 (0.00082) [2022-07-09 04:02:06,487][26022] Updated weights on worker 0-0, policy_version 78948 (0.00088) [2022-07-09 04:02:08,282][25689] Fps is (10 sec: 5263.1, 60 sec: 5724.1, 300 sec: 5739.9). Total num frames: 80850944. Throughput: 0: 5945.5. Samples: 80861090. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-09 04:02:08,282][25689] Avg episode reward: [(0, '-53.650')] [2022-07-09 04:02:08,362][26022] Updated weights on worker 0-0, policy_version 78958 (0.00096) [2022-07-09 04:02:10,003][26022] Updated weights on worker 0-0, policy_version 78968 (0.00090) [2022-07-09 04:02:11,794][26022] Updated weights on worker 0-0, policy_version 78978 (0.01206) [2022-07-09 04:02:13,320][25689] Fps is (10 sec: 5911.5, 60 sec: 5756.6, 300 sec: 5746.4). Total num frames: 80882688. Throughput: 0: 5087.3. Samples: 80878540. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-09 04:02:13,321][25689] Avg episode reward: [(0, '-53.322')] [2022-07-09 04:02:13,525][26022] Updated weights on worker 0-0, policy_version 78988 (0.00092) [2022-07-09 04:02:15,344][26022] Updated weights on worker 0-0, policy_version 78998 (0.00109) [2022-07-09 04:02:17,028][26022] Updated weights on worker 0-0, policy_version 79008 (0.00085) [2022-07-09 04:02:18,347][25689] Fps is (10 sec: 5900.8, 60 sec: 5754.5, 300 sec: 5743.8). Total num frames: 80910336. Throughput: 0: 5954.8. Samples: 80913578. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-09 04:02:18,347][25689] Avg episode reward: [(0, '-53.206')] [2022-07-09 04:02:19,027][26022] Updated weights on worker 0-0, policy_version 79018 (0.00084) [2022-07-09 04:02:20,495][26022] Updated weights on worker 0-0, policy_version 79028 (0.00093) [2022-07-09 04:02:22,433][26022] Updated weights on worker 0-0, policy_version 79038 (0.00080) [2022-07-09 04:02:23,382][25689] Fps is (10 sec: 5800.8, 60 sec: 5758.1, 300 sec: 5744.3). Total num frames: 80941056. Throughput: 0: 6083.4. Samples: 80948250. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-09 04:02:23,382][25689] Avg episode reward: [(0, '-52.759')] [2022-07-09 04:02:23,810][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:02:23,822][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000079047_80944128.pth [2022-07-09 04:02:23,822][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000077023_78871552.pth [2022-07-09 04:02:23,947][26022] Updated weights on worker 0-0, policy_version 79048 (0.00083) [2022-07-09 04:02:25,968][26022] Updated weights on worker 0-0, policy_version 79058 (0.00086) [2022-07-09 04:02:27,643][26022] Updated weights on worker 0-0, policy_version 79068 (0.00094) [2022-07-09 04:02:28,387][25689] Fps is (10 sec: 5915.6, 60 sec: 5765.0, 300 sec: 5745.0). Total num frames: 80969728. Throughput: 0: 5202.7. Samples: 80965690. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-09 04:02:28,387][25689] Avg episode reward: [(0, '-53.665')] [2022-07-09 04:02:29,530][26022] Updated weights on worker 0-0, policy_version 79078 (0.00090) [2022-07-09 04:02:31,336][26022] Updated weights on worker 0-0, policy_version 79088 (0.00089) [2022-07-09 04:02:32,882][26022] Updated weights on worker 0-0, policy_version 79098 (0.00086) [2022-07-09 04:02:33,391][25689] Fps is (10 sec: 5729.2, 60 sec: 5768.0, 300 sec: 5745.0). Total num frames: 80998400. Throughput: 0: 6081.4. Samples: 81000602. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-09 04:02:33,391][25689] Avg episode reward: [(0, '-53.447')] [2022-07-09 04:02:34,916][26022] Updated weights on worker 0-0, policy_version 79108 (0.00091) [2022-07-09 04:02:36,626][26022] Updated weights on worker 0-0, policy_version 79118 (0.00086) [2022-07-09 04:02:38,407][25689] Fps is (10 sec: 5518.3, 60 sec: 5698.8, 300 sec: 5739.7). Total num frames: 81025024. Throughput: 0: 6012.2. Samples: 81034186. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-09 04:02:38,408][25689] Avg episode reward: [(0, '-53.957')] [2022-07-09 04:02:38,673][26022] Updated weights on worker 0-0, policy_version 79128 (0.01123) [2022-07-09 04:02:40,383][26022] Updated weights on worker 0-0, policy_version 79138 (0.00795) [2022-07-09 04:02:42,099][26022] Updated weights on worker 0-0, policy_version 79148 (0.00088) [2022-07-09 04:02:43,460][25689] Fps is (10 sec: 5593.5, 60 sec: 5752.4, 300 sec: 5738.9). Total num frames: 81054720. Throughput: 0: 5117.7. Samples: 81051004. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-09 04:02:43,460][25689] Avg episode reward: [(0, '-54.425')] [2022-07-09 04:02:43,965][26022] Updated weights on worker 0-0, policy_version 79158 (0.00094) [2022-07-09 04:02:45,692][26022] Updated weights on worker 0-0, policy_version 79168 (0.00095) [2022-07-09 04:02:47,728][26022] Updated weights on worker 0-0, policy_version 79178 (0.00092) [2022-07-09 04:02:48,462][25689] Fps is (10 sec: 5804.8, 60 sec: 5719.3, 300 sec: 5736.1). Total num frames: 81083392. Throughput: 0: 5917.4. Samples: 81084486. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-09 04:02:48,463][25689] Avg episode reward: [(0, '-54.344')] [2022-07-09 04:02:49,520][26022] Updated weights on worker 0-0, policy_version 79188 (0.00095) [2022-07-09 04:02:51,199][26022] Updated weights on worker 0-0, policy_version 79198 (0.00086) [2022-07-09 04:02:52,819][26022] Updated weights on worker 0-0, policy_version 79208 (0.00089) [2022-07-09 04:02:53,480][25689] Fps is (10 sec: 5620.7, 60 sec: 5703.5, 300 sec: 5729.0). Total num frames: 81111040. Throughput: 0: 5909.5. Samples: 81119318. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-09 04:02:53,481][25689] Avg episode reward: [(0, '-54.769')] [2022-07-09 04:02:54,766][26022] Updated weights on worker 0-0, policy_version 79218 (0.00083) [2022-07-09 04:02:56,414][26022] Updated weights on worker 0-0, policy_version 79228 (0.00086) [2022-07-09 04:02:58,284][26022] Updated weights on worker 0-0, policy_version 79238 (0.00083) [2022-07-09 04:02:58,485][25689] Fps is (10 sec: 5721.3, 60 sec: 5686.9, 300 sec: 5737.6). Total num frames: 81140736. Throughput: 0: 5097.8. Samples: 81136540. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-09 04:02:58,486][25689] Avg episode reward: [(0, '-53.955')] [2022-07-09 04:02:59,935][26022] Updated weights on worker 0-0, policy_version 79248 (0.00089) [2022-07-09 04:03:02,207][26022] Updated weights on worker 0-0, policy_version 79258 (0.00088) [2022-07-09 04:03:03,584][25689] Fps is (10 sec: 5675.4, 60 sec: 5721.8, 300 sec: 5736.6). Total num frames: 81168384. Throughput: 0: 5872.7. Samples: 81169186. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-09 04:03:03,584][25689] Avg episode reward: [(0, '-54.050')] [2022-07-09 04:03:03,918][26022] Updated weights on worker 0-0, policy_version 79268 (0.00095) [2022-07-09 04:03:05,736][26022] Updated weights on worker 0-0, policy_version 79278 (0.00086) [2022-07-09 04:03:07,419][26022] Updated weights on worker 0-0, policy_version 79288 (0.00092) [2022-07-09 04:03:08,586][25689] Fps is (10 sec: 5575.6, 60 sec: 5739.4, 300 sec: 5733.6). Total num frames: 81197056. Throughput: 0: 5955.8. Samples: 81204340. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-09 04:03:08,587][25689] Avg episode reward: [(0, '-53.800')] [2022-07-09 04:03:09,139][26022] Updated weights on worker 0-0, policy_version 79298 (0.00092) [2022-07-09 04:03:10,775][26022] Updated weights on worker 0-0, policy_version 79308 (0.00090) [2022-07-09 04:03:12,614][26022] Updated weights on worker 0-0, policy_version 79318 (0.00087) [2022-07-09 04:03:13,621][25689] Fps is (10 sec: 5815.0, 60 sec: 5705.7, 300 sec: 5734.2). Total num frames: 81226752. Throughput: 0: 5104.8. Samples: 81222136. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-09 04:03:13,622][25689] Avg episode reward: [(0, '-54.288')] [2022-07-09 04:03:14,283][26022] Updated weights on worker 0-0, policy_version 79328 (0.00091) [2022-07-09 04:03:16,066][26022] Updated weights on worker 0-0, policy_version 79338 (0.00083) [2022-07-09 04:03:17,711][26022] Updated weights on worker 0-0, policy_version 79348 (0.00086) [2022-07-09 04:03:18,631][25689] Fps is (10 sec: 5913.1, 60 sec: 5741.4, 300 sec: 5741.9). Total num frames: 81256448. Throughput: 0: 5995.2. Samples: 81257314. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-09 04:03:18,631][25689] Avg episode reward: [(0, '-54.122')] [2022-07-09 04:03:19,778][26022] Updated weights on worker 0-0, policy_version 79358 (0.00086) [2022-07-09 04:03:21,476][26022] Updated weights on worker 0-0, policy_version 79368 (0.00121) [2022-07-09 04:03:23,133][26022] Updated weights on worker 0-0, policy_version 79378 (0.00083) [2022-07-09 04:03:23,702][25689] Fps is (10 sec: 5790.2, 60 sec: 5704.0, 300 sec: 5737.6). Total num frames: 81285120. Throughput: 0: 6099.2. Samples: 81291890. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-09 04:03:23,702][25689] Avg episode reward: [(0, '-54.348')] [2022-07-09 04:03:24,783][26022] Updated weights on worker 0-0, policy_version 79388 (0.00084) [2022-07-09 04:03:26,639][26022] Updated weights on worker 0-0, policy_version 79398 (0.00089) [2022-07-09 04:03:28,457][26022] Updated weights on worker 0-0, policy_version 79408 (0.00091) [2022-07-09 04:03:28,723][25689] Fps is (10 sec: 5783.5, 60 sec: 5719.5, 300 sec: 5741.9). Total num frames: 81314816. Throughput: 0: 5220.7. Samples: 81309466. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:03:28,723][25689] Avg episode reward: [(0, '-54.168')] [2022-07-09 04:03:30,219][26022] Updated weights on worker 0-0, policy_version 79418 (0.00083) [2022-07-09 04:03:32,104][26022] Updated weights on worker 0-0, policy_version 79428 (0.00085) [2022-07-09 04:03:33,728][25689] Fps is (10 sec: 5821.4, 60 sec: 5719.3, 300 sec: 5743.1). Total num frames: 81343488. Throughput: 0: 6080.8. Samples: 81344402. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:03:33,729][25689] Avg episode reward: [(0, '-53.581')] [2022-07-09 04:03:33,866][26022] Updated weights on worker 0-0, policy_version 79438 (0.00081) [2022-07-09 04:03:35,373][26022] Updated weights on worker 0-0, policy_version 79448 (0.00092) [2022-07-09 04:03:37,355][26022] Updated weights on worker 0-0, policy_version 79458 (0.00092) [2022-07-09 04:03:38,733][25689] Fps is (10 sec: 5932.8, 60 sec: 5788.3, 300 sec: 5747.5). Total num frames: 81374208. Throughput: 0: 6065.9. Samples: 81379256. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:03:38,734][25689] Avg episode reward: [(0, '-53.617')] [2022-07-09 04:03:39,026][26022] Updated weights on worker 0-0, policy_version 79468 (0.00083) [2022-07-09 04:03:40,779][26022] Updated weights on worker 0-0, policy_version 79478 (0.00094) [2022-07-09 04:03:42,735][26022] Updated weights on worker 0-0, policy_version 79488 (0.00089) [2022-07-09 04:03:43,847][25689] Fps is (10 sec: 5869.7, 60 sec: 5765.5, 300 sec: 5745.4). Total num frames: 81402880. Throughput: 0: 5211.4. Samples: 81396872. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:03:43,847][25689] Avg episode reward: [(0, '-53.058')] [2022-07-09 04:03:44,203][26022] Updated weights on worker 0-0, policy_version 79498 (0.00093) [2022-07-09 04:03:46,261][26022] Updated weights on worker 0-0, policy_version 79508 (0.00088) [2022-07-09 04:03:47,730][26022] Updated weights on worker 0-0, policy_version 79518 (0.00085) [2022-07-09 04:03:48,873][25689] Fps is (10 sec: 5756.4, 60 sec: 5780.2, 300 sec: 5751.8). Total num frames: 81432576. Throughput: 0: 6084.2. Samples: 81432066. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:03:48,874][25689] Avg episode reward: [(0, '-52.504')] [2022-07-09 04:03:49,781][26022] Updated weights on worker 0-0, policy_version 79528 (0.00087) [2022-07-09 04:03:51,448][26022] Updated weights on worker 0-0, policy_version 79538 (0.00085) [2022-07-09 04:03:53,163][26022] Updated weights on worker 0-0, policy_version 79548 (0.00098) [2022-07-09 04:03:53,876][25689] Fps is (10 sec: 5717.5, 60 sec: 5781.5, 300 sec: 5745.3). Total num frames: 81460224. Throughput: 0: 6051.2. Samples: 81466322. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:03:53,877][25689] Avg episode reward: [(0, '-53.072')] [2022-07-09 04:03:55,068][26022] Updated weights on worker 0-0, policy_version 79558 (0.00093) [2022-07-09 04:03:56,982][26022] Updated weights on worker 0-0, policy_version 79568 (0.01355) [2022-07-09 04:03:58,596][26022] Updated weights on worker 0-0, policy_version 79578 (0.00086) [2022-07-09 04:03:58,887][25689] Fps is (10 sec: 5624.4, 60 sec: 5764.1, 300 sec: 5741.1). Total num frames: 81488896. Throughput: 0: 5174.0. Samples: 81483528. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:03:58,887][25689] Avg episode reward: [(0, '-53.156')] [2022-07-09 04:04:00,463][26022] Updated weights on worker 0-0, policy_version 79588 (0.00097) [2022-07-09 04:04:02,519][26022] Updated weights on worker 0-0, policy_version 79598 (0.00078) [2022-07-09 04:04:04,018][25689] Fps is (10 sec: 5452.5, 60 sec: 5744.1, 300 sec: 5743.4). Total num frames: 81515520. Throughput: 0: 5941.5. Samples: 81516718. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:04:04,018][25689] Avg episode reward: [(0, '-52.758')] [2022-07-09 04:04:04,314][26022] Updated weights on worker 0-0, policy_version 79608 (0.00098) [2022-07-09 04:04:06,158][26022] Updated weights on worker 0-0, policy_version 79618 (0.00095) [2022-07-09 04:04:08,005][26022] Updated weights on worker 0-0, policy_version 79628 (0.00086) [2022-07-09 04:04:09,047][25689] Fps is (10 sec: 5442.0, 60 sec: 5741.5, 300 sec: 5736.2). Total num frames: 81544192. Throughput: 0: 5838.1. Samples: 81549846. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:04:09,048][25689] Avg episode reward: [(0, '-52.798')] [2022-07-09 04:04:09,855][26022] Updated weights on worker 0-0, policy_version 79638 (0.00093) [2022-07-09 04:04:11,667][26022] Updated weights on worker 0-0, policy_version 79648 (0.00086) [2022-07-09 04:04:13,119][26022] Updated weights on worker 0-0, policy_version 79658 (0.00080) [2022-07-09 04:04:14,087][25689] Fps is (10 sec: 5796.6, 60 sec: 5741.0, 300 sec: 5742.7). Total num frames: 81573888. Throughput: 0: 4987.6. Samples: 81567124. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:04:14,087][25689] Avg episode reward: [(0, '-53.131')] [2022-07-09 04:04:15,204][26022] Updated weights on worker 0-0, policy_version 79668 (0.00081) [2022-07-09 04:04:16,780][26022] Updated weights on worker 0-0, policy_version 79678 (0.00093) [2022-07-09 04:04:18,614][26022] Updated weights on worker 0-0, policy_version 79688 (0.00083) [2022-07-09 04:04:19,105][25689] Fps is (10 sec: 5905.5, 60 sec: 5740.3, 300 sec: 5743.3). Total num frames: 81603584. Throughput: 0: 5850.5. Samples: 81601814. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:04:19,105][25689] Avg episode reward: [(0, '-54.129')] [2022-07-09 04:04:20,478][26022] Updated weights on worker 0-0, policy_version 79698 (0.00091) [2022-07-09 04:04:22,166][26022] Updated weights on worker 0-0, policy_version 79708 (0.00091) [2022-07-09 04:04:23,904][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:04:23,918][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000079717_81630208.pth [2022-07-09 04:04:23,919][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000077699_79563776.pth [2022-07-09 04:04:24,091][26022] Updated weights on worker 0-0, policy_version 79718 (0.00094) [2022-07-09 04:04:24,158][25689] Fps is (10 sec: 5795.9, 60 sec: 5742.0, 300 sec: 5742.4). Total num frames: 81632256. Throughput: 0: 5933.1. Samples: 81636210. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:04:24,158][25689] Avg episode reward: [(0, '-53.540')] [2022-07-09 04:04:25,655][26022] Updated weights on worker 0-0, policy_version 79728 (0.00096) [2022-07-09 04:04:27,454][26022] Updated weights on worker 0-0, policy_version 79738 (0.00088) [2022-07-09 04:04:29,183][25689] Fps is (10 sec: 5588.4, 60 sec: 5707.7, 300 sec: 5731.6). Total num frames: 81659904. Throughput: 0: 5144.3. Samples: 81653428. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:04:29,183][25689] Avg episode reward: [(0, '-54.749')] [2022-07-09 04:04:29,399][26022] Updated weights on worker 0-0, policy_version 79748 (0.00089) [2022-07-09 04:04:30,931][26022] Updated weights on worker 0-0, policy_version 79758 (0.00086) [2022-07-09 04:04:33,103][26022] Updated weights on worker 0-0, policy_version 79768 (0.00091) [2022-07-09 04:04:34,257][25689] Fps is (10 sec: 5779.5, 60 sec: 5735.1, 300 sec: 5741.0). Total num frames: 81690624. Throughput: 0: 6002.0. Samples: 81688182. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:04:34,257][25689] Avg episode reward: [(0, '-54.700')] [2022-07-09 04:04:34,572][26022] Updated weights on worker 0-0, policy_version 79778 (0.00088) [2022-07-09 04:04:36,418][26022] Updated weights on worker 0-0, policy_version 79788 (0.00090) [2022-07-09 04:04:38,234][26022] Updated weights on worker 0-0, policy_version 79798 (0.00086) [2022-07-09 04:04:39,330][25689] Fps is (10 sec: 5751.9, 60 sec: 5677.9, 300 sec: 5737.7). Total num frames: 81718272. Throughput: 0: 5989.1. Samples: 81722948. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:04:39,331][25689] Avg episode reward: [(0, '-54.482')] [2022-07-09 04:04:39,912][26022] Updated weights on worker 0-0, policy_version 79808 (0.00089) [2022-07-09 04:04:41,786][26022] Updated weights on worker 0-0, policy_version 79818 (0.00088) [2022-07-09 04:04:43,635][26022] Updated weights on worker 0-0, policy_version 79828 (0.00083) [2022-07-09 04:04:44,386][25689] Fps is (10 sec: 5762.3, 60 sec: 5717.1, 300 sec: 5737.1). Total num frames: 81748992. Throughput: 0: 6006.0. Samples: 81757702. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:04:44,386][25689] Avg episode reward: [(0, '-54.265')] [2022-07-09 04:04:45,282][26022] Updated weights on worker 0-0, policy_version 79838 (0.00088) [2022-07-09 04:04:47,074][26022] Updated weights on worker 0-0, policy_version 79848 (0.00083) [2022-07-09 04:04:48,814][26022] Updated weights on worker 0-0, policy_version 79858 (0.00240) [2022-07-09 04:04:49,394][25689] Fps is (10 sec: 5799.6, 60 sec: 5685.0, 300 sec: 5737.2). Total num frames: 81776640. Throughput: 0: 6009.8. Samples: 81774896. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:04:49,395][25689] Avg episode reward: [(0, '-54.804')] [2022-07-09 04:04:50,670][26022] Updated weights on worker 0-0, policy_version 79868 (0.00099) [2022-07-09 04:04:52,433][26022] Updated weights on worker 0-0, policy_version 79878 (0.00082) [2022-07-09 04:04:54,223][26022] Updated weights on worker 0-0, policy_version 79888 (0.00094) [2022-07-09 04:04:54,406][25689] Fps is (10 sec: 5723.1, 60 sec: 5718.1, 300 sec: 5733.5). Total num frames: 81806336. Throughput: 0: 6023.9. Samples: 81809556. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:04:54,406][25689] Avg episode reward: [(0, '-54.441')] [2022-07-09 04:04:55,996][26022] Updated weights on worker 0-0, policy_version 79898 (0.00098) [2022-07-09 04:04:57,854][26022] Updated weights on worker 0-0, policy_version 79908 (0.00082) [2022-07-09 04:04:59,408][26022] Updated weights on worker 0-0, policy_version 79918 (0.00088) [2022-07-09 04:04:59,451][25689] Fps is (10 sec: 5905.9, 60 sec: 5731.7, 300 sec: 5749.1). Total num frames: 81836032. Throughput: 0: 6035.5. Samples: 81844386. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:04:59,451][25689] Avg episode reward: [(0, '-54.551')] [2022-07-09 04:05:01,189][26022] Updated weights on worker 0-0, policy_version 79928 (0.00089) [2022-07-09 04:05:03,420][26022] Updated weights on worker 0-0, policy_version 79938 (0.00085) [2022-07-09 04:05:04,543][25689] Fps is (10 sec: 5555.8, 60 sec: 5735.4, 300 sec: 5740.7). Total num frames: 81862656. Throughput: 0: 5053.0. Samples: 81859554. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 04:05:04,543][25689] Avg episode reward: [(0, '-54.251')] [2022-07-09 04:05:05,251][26022] Updated weights on worker 0-0, policy_version 79948 (0.00094) [2022-07-09 04:05:06,926][26022] Updated weights on worker 0-0, policy_version 79958 (0.00087) [2022-07-09 04:05:08,746][26022] Updated weights on worker 0-0, policy_version 79968 (0.00087) [2022-07-09 04:05:09,612][25689] Fps is (10 sec: 5441.9, 60 sec: 5731.7, 300 sec: 5730.7). Total num frames: 81891328. Throughput: 0: 5894.6. Samples: 81894070. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 04:05:09,612][25689] Avg episode reward: [(0, '-54.103')] [2022-07-09 04:05:10,588][26022] Updated weights on worker 0-0, policy_version 79978 (0.00085) [2022-07-09 04:05:12,327][26022] Updated weights on worker 0-0, policy_version 79988 (0.00091) [2022-07-09 04:05:13,964][26022] Updated weights on worker 0-0, policy_version 79998 (0.00087) [2022-07-09 04:05:14,643][25689] Fps is (10 sec: 5779.0, 60 sec: 5732.5, 300 sec: 5737.3). Total num frames: 81921024. Throughput: 0: 5881.3. Samples: 81928576. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 04:05:14,643][25689] Avg episode reward: [(0, '-53.870')] [2022-07-09 04:05:16,122][26022] Updated weights on worker 0-0, policy_version 80008 (0.00084) [2022-07-09 04:05:17,543][26022] Updated weights on worker 0-0, policy_version 80018 (0.00092) [2022-07-09 04:05:19,505][26022] Updated weights on worker 0-0, policy_version 80028 (0.00086) [2022-07-09 04:05:19,718][25689] Fps is (10 sec: 5775.4, 60 sec: 5710.1, 300 sec: 5736.5). Total num frames: 81949696. Throughput: 0: 5012.9. Samples: 81945984. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 04:05:19,719][25689] Avg episode reward: [(0, '-54.108')] [2022-07-09 04:05:21,299][26022] Updated weights on worker 0-0, policy_version 80038 (0.00088) [2022-07-09 04:05:23,054][26022] Updated weights on worker 0-0, policy_version 80048 (0.00085) [2022-07-09 04:05:24,797][25689] Fps is (10 sec: 5647.6, 60 sec: 5707.8, 300 sec: 5731.6). Total num frames: 81978368. Throughput: 0: 5957.3. Samples: 81980212. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 04:05:24,797][25689] Avg episode reward: [(0, '-54.145')] [2022-07-09 04:05:24,979][26022] Updated weights on worker 0-0, policy_version 80058 (0.00086) [2022-07-09 04:05:26,626][26022] Updated weights on worker 0-0, policy_version 80068 (0.00086) [2022-07-09 04:05:28,465][26022] Updated weights on worker 0-0, policy_version 80078 (0.00088) [2022-07-09 04:05:29,803][25689] Fps is (10 sec: 5787.7, 60 sec: 5743.3, 300 sec: 5735.0). Total num frames: 82008064. Throughput: 0: 5983.5. Samples: 82014884. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 04:05:29,804][25689] Avg episode reward: [(0, '-54.241')] [2022-07-09 04:05:29,963][26022] Updated weights on worker 0-0, policy_version 80088 (0.00083) [2022-07-09 04:05:32,071][26022] Updated weights on worker 0-0, policy_version 80098 (0.00091) [2022-07-09 04:05:33,831][26022] Updated weights on worker 0-0, policy_version 80108 (0.00085) [2022-07-09 04:05:34,806][25689] Fps is (10 sec: 5729.1, 60 sec: 5699.3, 300 sec: 5731.7). Total num frames: 82035712. Throughput: 0: 5142.2. Samples: 82032258. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 04:05:34,807][25689] Avg episode reward: [(0, '-55.230')] [2022-07-09 04:05:35,455][26022] Updated weights on worker 0-0, policy_version 80118 (0.00085) [2022-07-09 04:05:37,484][26022] Updated weights on worker 0-0, policy_version 80128 (0.00092) [2022-07-09 04:05:38,935][26022] Updated weights on worker 0-0, policy_version 80138 (0.00089) [2022-07-09 04:05:39,841][25689] Fps is (10 sec: 5712.5, 60 sec: 5736.7, 300 sec: 5733.2). Total num frames: 82065408. Throughput: 0: 6006.1. Samples: 82066846. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 04:05:39,842][25689] Avg episode reward: [(0, '-54.043')] [2022-07-09 04:05:40,953][26022] Updated weights on worker 0-0, policy_version 80148 (0.00091) [2022-07-09 04:05:42,607][26022] Updated weights on worker 0-0, policy_version 80158 (0.00082) [2022-07-09 04:05:44,229][26022] Updated weights on worker 0-0, policy_version 80168 (0.00087) [2022-07-09 04:05:44,899][25689] Fps is (10 sec: 5884.5, 60 sec: 5719.6, 300 sec: 5739.1). Total num frames: 82095104. Throughput: 0: 6046.7. Samples: 82101764. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 04:05:44,899][25689] Avg episode reward: [(0, '-52.981')] [2022-07-09 04:05:46,332][26022] Updated weights on worker 0-0, policy_version 80178 (0.00086) [2022-07-09 04:05:47,729][26022] Updated weights on worker 0-0, policy_version 80188 (0.00101) [2022-07-09 04:05:49,842][26022] Updated weights on worker 0-0, policy_version 80198 (0.00095) [2022-07-09 04:05:49,904][25689] Fps is (10 sec: 5800.2, 60 sec: 5736.8, 300 sec: 5729.8). Total num frames: 82123776. Throughput: 0: 5191.2. Samples: 82119234. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 04:05:49,907][25689] Avg episode reward: [(0, '-52.374')] [2022-07-09 04:05:51,525][26022] Updated weights on worker 0-0, policy_version 80208 (0.00096) [2022-07-09 04:05:53,238][26022] Updated weights on worker 0-0, policy_version 80218 (0.00082) [2022-07-09 04:05:54,994][25689] Fps is (10 sec: 5680.1, 60 sec: 5712.5, 300 sec: 5728.5). Total num frames: 82152448. Throughput: 0: 6035.2. Samples: 82154100. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 04:05:54,995][25689] Avg episode reward: [(0, '-52.486')] [2022-07-09 04:05:55,086][26022] Updated weights on worker 0-0, policy_version 80228 (0.00088) [2022-07-09 04:05:56,775][26022] Updated weights on worker 0-0, policy_version 80238 (0.00085) [2022-07-09 04:05:58,574][26022] Updated weights on worker 0-0, policy_version 80248 (0.00085) [2022-07-09 04:06:00,054][25689] Fps is (10 sec: 5851.5, 60 sec: 5728.0, 300 sec: 5742.6). Total num frames: 82183168. Throughput: 0: 6049.7. Samples: 82189128. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 04:06:00,055][25689] Avg episode reward: [(0, '-52.364')] [2022-07-09 04:06:00,319][26022] Updated weights on worker 0-0, policy_version 80258 (0.00081) [2022-07-09 04:06:02,503][26022] Updated weights on worker 0-0, policy_version 80268 (0.00096) [2022-07-09 04:06:04,260][26022] Updated weights on worker 0-0, policy_version 80278 (0.00090) [2022-07-09 04:06:05,157][25689] Fps is (10 sec: 5642.6, 60 sec: 5727.0, 300 sec: 5734.0). Total num frames: 82209792. Throughput: 0: 5055.1. Samples: 82204172. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 04:06:05,159][25689] Avg episode reward: [(0, '-51.390')] [2022-07-09 04:06:06,116][26022] Updated weights on worker 0-0, policy_version 80288 (0.00091) [2022-07-09 04:06:07,650][26022] Updated weights on worker 0-0, policy_version 80298 (0.00087) [2022-07-09 04:06:09,458][26022] Updated weights on worker 0-0, policy_version 80308 (0.00085) [2022-07-09 04:06:10,175][25689] Fps is (10 sec: 5463.8, 60 sec: 5731.8, 300 sec: 5730.7). Total num frames: 82238464. Throughput: 0: 5909.7. Samples: 82239026. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 04:06:10,175][25689] Avg episode reward: [(0, '-52.081')] [2022-07-09 04:06:11,207][26022] Updated weights on worker 0-0, policy_version 80318 (0.00085) [2022-07-09 04:06:13,241][26022] Updated weights on worker 0-0, policy_version 80328 (0.00092) [2022-07-09 04:06:14,665][26022] Updated weights on worker 0-0, policy_version 80338 (0.00083) [2022-07-09 04:06:15,180][25689] Fps is (10 sec: 5823.3, 60 sec: 5734.2, 300 sec: 5737.6). Total num frames: 82268160. Throughput: 0: 5935.3. Samples: 82273910. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 04:06:15,181][25689] Avg episode reward: [(0, '-52.792')] [2022-07-09 04:06:16,522][26022] Updated weights on worker 0-0, policy_version 80348 (0.00091) [2022-07-09 04:06:18,190][26022] Updated weights on worker 0-0, policy_version 80358 (0.00615) [2022-07-09 04:06:20,089][26022] Updated weights on worker 0-0, policy_version 80368 (0.00093) [2022-07-09 04:06:20,187][25689] Fps is (10 sec: 5829.9, 60 sec: 5740.8, 300 sec: 5732.0). Total num frames: 82296832. Throughput: 0: 5081.4. Samples: 82291428. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 04:06:20,187][25689] Avg episode reward: [(0, '-53.266')] [2022-07-09 04:06:21,898][26022] Updated weights on worker 0-0, policy_version 80378 (0.00088) [2022-07-09 04:06:23,624][26022] Updated weights on worker 0-0, policy_version 80388 (0.00087) [2022-07-09 04:06:24,043][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:06:24,053][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000080391_82320384.pth [2022-07-09 04:06:24,054][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000078373_80253952.pth [2022-07-09 04:06:25,286][25689] Fps is (10 sec: 5674.7, 60 sec: 5738.8, 300 sec: 5731.6). Total num frames: 82325504. Throughput: 0: 6051.4. Samples: 82325978. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 04:06:25,286][25689] Avg episode reward: [(0, '-54.017')] [2022-07-09 04:06:25,453][26022] Updated weights on worker 0-0, policy_version 80398 (0.00082) [2022-07-09 04:06:27,128][26022] Updated weights on worker 0-0, policy_version 80408 (0.00084) [2022-07-09 04:06:29,207][26022] Updated weights on worker 0-0, policy_version 80418 (0.00090) [2022-07-09 04:06:30,290][25689] Fps is (10 sec: 5878.1, 60 sec: 5755.9, 300 sec: 5739.0). Total num frames: 82356224. Throughput: 0: 6040.9. Samples: 82360542. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 04:06:30,291][25689] Avg episode reward: [(0, '-54.092')] [2022-07-09 04:06:30,661][26022] Updated weights on worker 0-0, policy_version 80428 (0.00090) [2022-07-09 04:06:32,675][26022] Updated weights on worker 0-0, policy_version 80438 (0.00084) [2022-07-09 04:06:34,313][26022] Updated weights on worker 0-0, policy_version 80448 (0.00094) [2022-07-09 04:06:35,328][25689] Fps is (10 sec: 5812.0, 60 sec: 5752.6, 300 sec: 5728.0). Total num frames: 82383872. Throughput: 0: 5149.6. Samples: 82377664. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 04:06:35,329][25689] Avg episode reward: [(0, '-54.273')] [2022-07-09 04:06:36,145][26022] Updated weights on worker 0-0, policy_version 80458 (0.00084) [2022-07-09 04:06:37,964][26022] Updated weights on worker 0-0, policy_version 80468 (0.00092) [2022-07-09 04:06:39,628][26022] Updated weights on worker 0-0, policy_version 80478 (0.00092) [2022-07-09 04:06:40,403][25689] Fps is (10 sec: 5569.2, 60 sec: 5731.9, 300 sec: 5735.0). Total num frames: 82412544. Throughput: 0: 5965.0. Samples: 82412020. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 04:06:40,404][25689] Avg episode reward: [(0, '-54.263')] [2022-07-09 04:06:41,634][26022] Updated weights on worker 0-0, policy_version 80488 (0.00084) [2022-07-09 04:06:43,271][26022] Updated weights on worker 0-0, policy_version 80498 (0.00085) [2022-07-09 04:06:45,173][26022] Updated weights on worker 0-0, policy_version 80508 (0.00085) [2022-07-09 04:06:45,552][25689] Fps is (10 sec: 5709.4, 60 sec: 5723.3, 300 sec: 5728.9). Total num frames: 82442240. Throughput: 0: 5943.4. Samples: 82446426. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 04:06:45,552][25689] Avg episode reward: [(0, '-54.389')] [2022-07-09 04:06:46,657][26022] Updated weights on worker 0-0, policy_version 80518 (0.00084) [2022-07-09 04:06:48,714][26022] Updated weights on worker 0-0, policy_version 80528 (0.00085) [2022-07-09 04:06:50,328][26022] Updated weights on worker 0-0, policy_version 80538 (0.00093) [2022-07-09 04:06:50,567][25689] Fps is (10 sec: 5742.6, 60 sec: 5722.4, 300 sec: 5729.2). Total num frames: 82470912. Throughput: 0: 5083.0. Samples: 82463608. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 04:06:50,568][25689] Avg episode reward: [(0, '-54.183')] [2022-07-09 04:06:52,339][26022] Updated weights on worker 0-0, policy_version 80548 (0.00094) [2022-07-09 04:06:54,139][26022] Updated weights on worker 0-0, policy_version 80558 (0.00088) [2022-07-09 04:06:55,599][25689] Fps is (10 sec: 5707.3, 60 sec: 5727.9, 300 sec: 5721.9). Total num frames: 82499584. Throughput: 0: 5923.7. Samples: 82497742. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 04:06:55,600][25689] Avg episode reward: [(0, '-53.272')] [2022-07-09 04:06:55,882][26022] Updated weights on worker 0-0, policy_version 80568 (0.00088) [2022-07-09 04:06:57,557][26022] Updated weights on worker 0-0, policy_version 80578 (0.00088) [2022-07-09 04:06:59,616][26022] Updated weights on worker 0-0, policy_version 80588 (0.00082) [2022-07-09 04:07:00,664][25689] Fps is (10 sec: 5780.7, 60 sec: 5710.5, 300 sec: 5736.5). Total num frames: 82529280. Throughput: 0: 5925.2. Samples: 82532072. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 04:07:00,665][25689] Avg episode reward: [(0, '-54.430')] [2022-07-09 04:07:01,159][26022] Updated weights on worker 0-0, policy_version 80598 (0.00087) [2022-07-09 04:07:03,559][26022] Updated weights on worker 0-0, policy_version 80608 (0.00091) [2022-07-09 04:07:05,201][26022] Updated weights on worker 0-0, policy_version 80618 (0.00084) [2022-07-09 04:07:05,805][25689] Fps is (10 sec: 5418.1, 60 sec: 5690.0, 300 sec: 5727.1). Total num frames: 82554880. Throughput: 0: 5805.6. Samples: 82564008. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 04:07:05,806][25689] Avg episode reward: [(0, '-54.734')] [2022-07-09 04:07:07,012][26022] Updated weights on worker 0-0, policy_version 80628 (0.00092) [2022-07-09 04:07:08,878][26022] Updated weights on worker 0-0, policy_version 80638 (0.00081) [2022-07-09 04:07:10,592][26022] Updated weights on worker 0-0, policy_version 80648 (0.00084) [2022-07-09 04:07:10,809][25689] Fps is (10 sec: 5349.8, 60 sec: 5691.3, 300 sec: 5717.4). Total num frames: 82583552. Throughput: 0: 5800.7. Samples: 82581024. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 04:07:10,809][25689] Avg episode reward: [(0, '-55.036')] [2022-07-09 04:07:12,453][26022] Updated weights on worker 0-0, policy_version 80658 (0.00090) [2022-07-09 04:07:14,343][26022] Updated weights on worker 0-0, policy_version 80668 (0.00605) [2022-07-09 04:07:15,803][26022] Updated weights on worker 0-0, policy_version 80678 (0.00087) [2022-07-09 04:07:15,900][25689] Fps is (10 sec: 5883.1, 60 sec: 5700.1, 300 sec: 5726.5). Total num frames: 82614272. Throughput: 0: 5799.7. Samples: 82615484. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 04:07:15,902][25689] Avg episode reward: [(0, '-55.343')] [2022-07-09 04:07:17,989][26022] Updated weights on worker 0-0, policy_version 80688 (0.00086) [2022-07-09 04:07:19,425][26022] Updated weights on worker 0-0, policy_version 80698 (0.00096) [2022-07-09 04:07:20,963][25689] Fps is (10 sec: 5647.4, 60 sec: 5661.2, 300 sec: 5712.2). Total num frames: 82640896. Throughput: 0: 5805.4. Samples: 82649914. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 04:07:20,963][25689] Avg episode reward: [(0, '-55.499')] [2022-07-09 04:07:21,565][26022] Updated weights on worker 0-0, policy_version 80708 (0.00085) [2022-07-09 04:07:22,961][26022] Updated weights on worker 0-0, policy_version 80718 (0.00089) [2022-07-09 04:07:25,156][26022] Updated weights on worker 0-0, policy_version 80728 (0.00081) [2022-07-09 04:07:26,059][25689] Fps is (10 sec: 5645.0, 60 sec: 5695.2, 300 sec: 5717.3). Total num frames: 82671616. Throughput: 0: 5079.2. Samples: 82666880. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 04:07:26,059][25689] Avg episode reward: [(0, '-54.285')] [2022-07-09 04:07:26,824][26022] Updated weights on worker 0-0, policy_version 80738 (0.00086) [2022-07-09 04:07:28,663][26022] Updated weights on worker 0-0, policy_version 80748 (0.00409) [2022-07-09 04:07:30,367][26022] Updated weights on worker 0-0, policy_version 80758 (0.00093) [2022-07-09 04:07:31,079][25689] Fps is (10 sec: 5871.2, 60 sec: 5660.0, 300 sec: 5717.0). Total num frames: 82700288. Throughput: 0: 5931.7. Samples: 82701258. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 04:07:31,079][25689] Avg episode reward: [(0, '-54.157')] [2022-07-09 04:07:32,169][26022] Updated weights on worker 0-0, policy_version 80768 (0.00092) [2022-07-09 04:07:33,872][26022] Updated weights on worker 0-0, policy_version 80778 (0.00091) [2022-07-09 04:07:35,868][26022] Updated weights on worker 0-0, policy_version 80788 (0.00095) [2022-07-09 04:07:36,094][25689] Fps is (10 sec: 5714.2, 60 sec: 5679.0, 300 sec: 5723.9). Total num frames: 82728960. Throughput: 0: 5944.9. Samples: 82735534. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 04:07:36,095][25689] Avg episode reward: [(0, '-54.296')] [2022-07-09 04:07:37,588][26022] Updated weights on worker 0-0, policy_version 80798 (0.00089) [2022-07-09 04:07:39,412][26022] Updated weights on worker 0-0, policy_version 80808 (0.00092) [2022-07-09 04:07:41,051][26022] Updated weights on worker 0-0, policy_version 80818 (0.00091) [2022-07-09 04:07:41,123][25689] Fps is (10 sec: 5709.3, 60 sec: 5683.3, 300 sec: 5720.9). Total num frames: 82757632. Throughput: 0: 5087.3. Samples: 82752472. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 04:07:41,123][25689] Avg episode reward: [(0, '-53.909')] [2022-07-09 04:07:43,018][26022] Updated weights on worker 0-0, policy_version 80828 (0.00093) [2022-07-09 04:07:44,736][26022] Updated weights on worker 0-0, policy_version 80838 (0.00087) [2022-07-09 04:07:46,256][25689] Fps is (10 sec: 5542.0, 60 sec: 5651.0, 300 sec: 5715.0). Total num frames: 82785280. Throughput: 0: 5955.3. Samples: 82787164. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 04:07:46,257][25689] Avg episode reward: [(0, '-53.095')] [2022-07-09 04:07:46,679][26022] Updated weights on worker 0-0, policy_version 80848 (0.00084) [2022-07-09 04:07:48,223][26022] Updated weights on worker 0-0, policy_version 80858 (0.00084) [2022-07-09 04:07:50,091][26022] Updated weights on worker 0-0, policy_version 80868 (0.00099) [2022-07-09 04:07:51,277][25689] Fps is (10 sec: 5748.1, 60 sec: 5684.2, 300 sec: 5725.2). Total num frames: 82816000. Throughput: 0: 5959.2. Samples: 82821622. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 04:07:51,278][25689] Avg episode reward: [(0, '-53.179')] [2022-07-09 04:07:51,930][26022] Updated weights on worker 0-0, policy_version 80878 (0.00088) [2022-07-09 04:07:53,596][26022] Updated weights on worker 0-0, policy_version 80888 (0.00091) [2022-07-09 04:07:55,571][26022] Updated weights on worker 0-0, policy_version 80898 (0.00085) [2022-07-09 04:07:56,302][25689] Fps is (10 sec: 5708.4, 60 sec: 5651.1, 300 sec: 5714.5). Total num frames: 82842624. Throughput: 0: 5102.5. Samples: 82838644. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 04:07:56,303][25689] Avg episode reward: [(0, '-53.146')] [2022-07-09 04:07:57,091][26022] Updated weights on worker 0-0, policy_version 80908 (0.00106) [2022-07-09 04:07:58,823][26022] Updated weights on worker 0-0, policy_version 80918 (0.00089) [2022-07-09 04:08:00,695][26022] Updated weights on worker 0-0, policy_version 80928 (0.00877) [2022-07-09 04:08:01,375][25689] Fps is (10 sec: 5577.4, 60 sec: 5650.4, 300 sec: 5721.9). Total num frames: 82872320. Throughput: 0: 5966.7. Samples: 82873310. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 04:08:01,376][25689] Avg episode reward: [(0, '-53.623')] [2022-07-09 04:08:02,959][26022] Updated weights on worker 0-0, policy_version 80938 (0.00086) [2022-07-09 04:08:04,642][26022] Updated weights on worker 0-0, policy_version 80948 (0.00089) [2022-07-09 04:08:06,443][25689] Fps is (10 sec: 5654.7, 60 sec: 5691.0, 300 sec: 5717.2). Total num frames: 82899968. Throughput: 0: 5879.6. Samples: 82905852. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 04:08:06,443][25689] Avg episode reward: [(0, '-53.680')] [2022-07-09 04:08:06,505][26022] Updated weights on worker 0-0, policy_version 80958 (0.00090) [2022-07-09 04:08:08,168][26022] Updated weights on worker 0-0, policy_version 80968 (0.00092) [2022-07-09 04:08:10,090][26022] Updated weights on worker 0-0, policy_version 80978 (0.00100) [2022-07-09 04:08:11,516][25689] Fps is (10 sec: 5553.4, 60 sec: 5684.4, 300 sec: 5713.0). Total num frames: 82928640. Throughput: 0: 5009.6. Samples: 82923014. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 04:08:11,518][25689] Avg episode reward: [(0, '-53.561')] [2022-07-09 04:08:11,787][26022] Updated weights on worker 0-0, policy_version 80988 (0.00095) [2022-07-09 04:08:13,476][26022] Updated weights on worker 0-0, policy_version 80998 (0.00086) [2022-07-09 04:08:15,271][26022] Updated weights on worker 0-0, policy_version 81008 (0.00090) [2022-07-09 04:08:16,529][25689] Fps is (10 sec: 5888.7, 60 sec: 5691.9, 300 sec: 5716.4). Total num frames: 82959360. Throughput: 0: 5908.3. Samples: 82958150. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 04:08:16,529][25689] Avg episode reward: [(0, '-53.673')] [2022-07-09 04:08:17,090][26022] Updated weights on worker 0-0, policy_version 81018 (0.00092) [2022-07-09 04:08:18,860][26022] Updated weights on worker 0-0, policy_version 81028 (0.00083) [2022-07-09 04:08:20,686][26022] Updated weights on worker 0-0, policy_version 81038 (0.00083) [2022-07-09 04:08:21,536][25689] Fps is (10 sec: 5825.1, 60 sec: 5713.9, 300 sec: 5714.2). Total num frames: 82987008. Throughput: 0: 5935.8. Samples: 82992986. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 04:08:21,537][25689] Avg episode reward: [(0, '-53.575')] [2022-07-09 04:08:22,488][26022] Updated weights on worker 0-0, policy_version 81048 (0.00088) [2022-07-09 04:08:24,137][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:08:24,151][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000081058_83003392.pth [2022-07-09 04:08:24,152][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000079047_80944128.pth [2022-07-09 04:08:24,158][26022] Updated weights on worker 0-0, policy_version 81058 (0.00099) [2022-07-09 04:08:26,178][26022] Updated weights on worker 0-0, policy_version 81068 (0.00088) [2022-07-09 04:08:26,587][25689] Fps is (10 sec: 5701.1, 60 sec: 5701.3, 300 sec: 5713.6). Total num frames: 83016704. Throughput: 0: 5172.8. Samples: 83010054. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 04:08:26,589][25689] Avg episode reward: [(0, '-53.635')] [2022-07-09 04:08:27,810][26022] Updated weights on worker 0-0, policy_version 81078 (0.00089) [2022-07-09 04:08:29,711][26022] Updated weights on worker 0-0, policy_version 81088 (0.00084) [2022-07-09 04:08:31,464][26022] Updated weights on worker 0-0, policy_version 81098 (0.00097) [2022-07-09 04:08:31,643][25689] Fps is (10 sec: 5775.4, 60 sec: 5697.9, 300 sec: 5712.7). Total num frames: 83045376. Throughput: 0: 6024.0. Samples: 83044256. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 04:08:31,645][25689] Avg episode reward: [(0, '-53.440')] [2022-07-09 04:08:33,248][26022] Updated weights on worker 0-0, policy_version 81108 (0.00091) [2022-07-09 04:08:34,903][26022] Updated weights on worker 0-0, policy_version 81118 (0.00083) [2022-07-09 04:08:36,681][25689] Fps is (10 sec: 5681.3, 60 sec: 5695.8, 300 sec: 5705.2). Total num frames: 83074048. Throughput: 0: 6003.8. Samples: 83079138. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 04:08:36,682][25689] Avg episode reward: [(0, '-54.035')] [2022-07-09 04:08:36,875][26022] Updated weights on worker 0-0, policy_version 81128 (0.00088) [2022-07-09 04:08:38,302][26022] Updated weights on worker 0-0, policy_version 81138 (0.00088) [2022-07-09 04:08:40,543][26022] Updated weights on worker 0-0, policy_version 81148 (0.00083) [2022-07-09 04:08:41,698][25689] Fps is (10 sec: 5906.3, 60 sec: 5730.6, 300 sec: 5713.9). Total num frames: 83104768. Throughput: 0: 5125.3. Samples: 83096322. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 04:08:41,699][25689] Avg episode reward: [(0, '-54.635')] [2022-07-09 04:08:41,810][26022] Updated weights on worker 0-0, policy_version 81158 (0.00099) [2022-07-09 04:08:43,967][26022] Updated weights on worker 0-0, policy_version 81168 (0.00095) [2022-07-09 04:08:45,571][26022] Updated weights on worker 0-0, policy_version 81178 (0.00089) [2022-07-09 04:08:46,737][25689] Fps is (10 sec: 5600.3, 60 sec: 5705.7, 300 sec: 5699.9). Total num frames: 83130368. Throughput: 0: 5984.5. Samples: 83130642. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 04:08:46,738][25689] Avg episode reward: [(0, '-54.963')] [2022-07-09 04:08:47,536][26022] Updated weights on worker 0-0, policy_version 81188 (0.00062) [2022-07-09 04:08:49,158][26022] Updated weights on worker 0-0, policy_version 81198 (0.00087) [2022-07-09 04:08:51,131][26022] Updated weights on worker 0-0, policy_version 81208 (0.00093) [2022-07-09 04:08:51,747][25689] Fps is (10 sec: 5604.9, 60 sec: 5706.8, 300 sec: 5710.1). Total num frames: 83161088. Throughput: 0: 6013.1. Samples: 83165142. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 04:08:51,747][25689] Avg episode reward: [(0, '-54.651')] [2022-07-09 04:08:52,902][26022] Updated weights on worker 0-0, policy_version 81218 (0.00096) [2022-07-09 04:08:54,437][26022] Updated weights on worker 0-0, policy_version 81228 (0.00094) [2022-07-09 04:08:56,453][26022] Updated weights on worker 0-0, policy_version 81238 (0.00087) [2022-07-09 04:08:56,769][25689] Fps is (10 sec: 5818.1, 60 sec: 5723.9, 300 sec: 5706.4). Total num frames: 83188736. Throughput: 0: 5136.3. Samples: 83182320. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 04:08:56,770][25689] Avg episode reward: [(0, '-54.790')] [2022-07-09 04:08:58,207][26022] Updated weights on worker 0-0, policy_version 81248 (0.00095) [2022-07-09 04:08:59,963][26022] Updated weights on worker 0-0, policy_version 81258 (0.00103) [2022-07-09 04:09:01,778][25689] Fps is (10 sec: 5614.7, 60 sec: 5713.1, 300 sec: 5715.6). Total num frames: 83217408. Throughput: 0: 5999.9. Samples: 83216794. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 04:09:01,778][25689] Avg episode reward: [(0, '-54.680')] [2022-07-09 04:09:01,839][26022] Updated weights on worker 0-0, policy_version 81268 (0.00088) [2022-07-09 04:09:03,811][26022] Updated weights on worker 0-0, policy_version 81278 (0.00084) [2022-07-09 04:09:05,700][26022] Updated weights on worker 0-0, policy_version 81288 (0.00090) [2022-07-09 04:09:06,906][25689] Fps is (10 sec: 5656.8, 60 sec: 5724.3, 300 sec: 5713.7). Total num frames: 83246080. Throughput: 0: 5899.2. Samples: 83249624. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 04:09:06,907][25689] Avg episode reward: [(0, '-54.423')] [2022-07-09 04:09:07,486][26022] Updated weights on worker 0-0, policy_version 81298 (0.00089) [2022-07-09 04:09:09,127][26022] Updated weights on worker 0-0, policy_version 81308 (0.00079) [2022-07-09 04:09:11,012][26022] Updated weights on worker 0-0, policy_version 81318 (0.00087) [2022-07-09 04:09:11,941][25689] Fps is (10 sec: 5742.8, 60 sec: 5744.9, 300 sec: 5713.8). Total num frames: 83275776. Throughput: 0: 5913.9. Samples: 83284570. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 04:09:11,942][25689] Avg episode reward: [(0, '-54.231')] [2022-07-09 04:09:12,671][26022] Updated weights on worker 0-0, policy_version 81328 (0.00094) [2022-07-09 04:09:14,533][26022] Updated weights on worker 0-0, policy_version 81338 (0.00086) [2022-07-09 04:09:16,288][26022] Updated weights on worker 0-0, policy_version 81348 (0.00103) [2022-07-09 04:09:16,955][25689] Fps is (10 sec: 5706.9, 60 sec: 5694.0, 300 sec: 5707.0). Total num frames: 83303424. Throughput: 0: 5929.3. Samples: 83302004. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 04:09:16,955][25689] Avg episode reward: [(0, '-54.505')] [2022-07-09 04:09:17,890][26022] Updated weights on worker 0-0, policy_version 81358 (0.00103) [2022-07-09 04:09:19,708][26022] Updated weights on worker 0-0, policy_version 81368 (0.00089) [2022-07-09 04:09:21,359][26022] Updated weights on worker 0-0, policy_version 81378 (0.00091) [2022-07-09 04:09:21,986][25689] Fps is (10 sec: 5811.0, 60 sec: 5742.6, 300 sec: 5714.3). Total num frames: 83334144. Throughput: 0: 5948.3. Samples: 83336998. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 04:09:21,986][25689] Avg episode reward: [(0, '-54.519')] [2022-07-09 04:09:23,263][26022] Updated weights on worker 0-0, policy_version 81388 (0.00088) [2022-07-09 04:09:24,898][26022] Updated weights on worker 0-0, policy_version 81398 (0.00087) [2022-07-09 04:09:26,766][26022] Updated weights on worker 0-0, policy_version 81408 (0.00082) [2022-07-09 04:09:27,107][25689] Fps is (10 sec: 5850.2, 60 sec: 5719.0, 300 sec: 5715.9). Total num frames: 83362816. Throughput: 0: 6038.1. Samples: 83371596. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 04:09:27,107][25689] Avg episode reward: [(0, '-55.193')] [2022-07-09 04:09:28,735][26022] Updated weights on worker 0-0, policy_version 81418 (0.00094) [2022-07-09 04:09:30,252][26022] Updated weights on worker 0-0, policy_version 81428 (0.00090) [2022-07-09 04:09:32,102][26022] Updated weights on worker 0-0, policy_version 81438 (0.00089) [2022-07-09 04:09:32,151][25689] Fps is (10 sec: 5742.0, 60 sec: 5737.0, 300 sec: 5713.1). Total num frames: 83392512. Throughput: 0: 5168.5. Samples: 83389022. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 04:09:32,151][25689] Avg episode reward: [(0, '-55.321')] [2022-07-09 04:09:33,786][26022] Updated weights on worker 0-0, policy_version 81448 (0.00086) [2022-07-09 04:09:35,724][26022] Updated weights on worker 0-0, policy_version 81458 (0.00619) [2022-07-09 04:09:37,158][25689] Fps is (10 sec: 5807.1, 60 sec: 5739.9, 300 sec: 5717.8). Total num frames: 83421184. Throughput: 0: 6036.1. Samples: 83423954. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 04:09:37,163][25689] Avg episode reward: [(0, '-55.345')] [2022-07-09 04:09:37,387][26022] Updated weights on worker 0-0, policy_version 81468 (0.00086) [2022-07-09 04:09:39,069][26022] Updated weights on worker 0-0, policy_version 81478 (0.00086) [2022-07-09 04:09:40,693][26022] Updated weights on worker 0-0, policy_version 81488 (0.00085) [2022-07-09 04:09:42,168][25689] Fps is (10 sec: 5724.4, 60 sec: 5706.8, 300 sec: 5711.8). Total num frames: 83449856. Throughput: 0: 6047.9. Samples: 83459060. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 04:09:42,170][25689] Avg episode reward: [(0, '-54.580')] [2022-07-09 04:09:42,809][26022] Updated weights on worker 0-0, policy_version 81498 (0.00085) [2022-07-09 04:09:44,259][26022] Updated weights on worker 0-0, policy_version 81508 (0.00086) [2022-07-09 04:09:46,154][26022] Updated weights on worker 0-0, policy_version 81518 (0.00090) [2022-07-09 04:09:47,210][25689] Fps is (10 sec: 5806.4, 60 sec: 5774.2, 300 sec: 5718.0). Total num frames: 83479552. Throughput: 0: 5213.8. Samples: 83476412. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 04:09:47,211][25689] Avg episode reward: [(0, '-55.058')] [2022-07-09 04:09:47,997][26022] Updated weights on worker 0-0, policy_version 81528 (0.00090) [2022-07-09 04:09:49,761][26022] Updated weights on worker 0-0, policy_version 81538 (0.00096) [2022-07-09 04:09:51,538][26022] Updated weights on worker 0-0, policy_version 81548 (0.00087) [2022-07-09 04:09:52,223][25689] Fps is (10 sec: 5805.2, 60 sec: 5740.1, 300 sec: 5714.5). Total num frames: 83508224. Throughput: 0: 6087.1. Samples: 83511202. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 04:09:52,224][25689] Avg episode reward: [(0, '-54.816')] [2022-07-09 04:09:53,149][26022] Updated weights on worker 0-0, policy_version 81558 (0.00080) [2022-07-09 04:09:54,975][26022] Updated weights on worker 0-0, policy_version 81568 (0.00084) [2022-07-09 04:09:56,880][26022] Updated weights on worker 0-0, policy_version 81578 (0.00091) [2022-07-09 04:09:57,227][25689] Fps is (10 sec: 5827.3, 60 sec: 5775.7, 300 sec: 5715.3). Total num frames: 83537920. Throughput: 0: 6075.6. Samples: 83545884. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 04:09:57,227][25689] Avg episode reward: [(0, '-54.621')] [2022-07-09 04:09:58,545][26022] Updated weights on worker 0-0, policy_version 81588 (0.00105) [2022-07-09 04:10:00,251][26022] Updated weights on worker 0-0, policy_version 81598 (0.00082) [2022-07-09 04:10:02,240][25689] Fps is (10 sec: 5724.5, 60 sec: 5758.3, 300 sec: 5720.3). Total num frames: 83565568. Throughput: 0: 5191.5. Samples: 83563262. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 04:10:02,241][25689] Avg episode reward: [(0, '-53.575')] [2022-07-09 04:10:02,822][26022] Updated weights on worker 0-0, policy_version 81608 (0.00079) [2022-07-09 04:10:04,233][26022] Updated weights on worker 0-0, policy_version 81618 (0.00087) [2022-07-09 04:10:06,207][26022] Updated weights on worker 0-0, policy_version 81628 (0.00085) [2022-07-09 04:10:07,313][25689] Fps is (10 sec: 5583.6, 60 sec: 5763.6, 300 sec: 5720.2). Total num frames: 83594240. Throughput: 0: 5939.4. Samples: 83595812. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 04:10:07,314][25689] Avg episode reward: [(0, '-53.851')] [2022-07-09 04:10:07,679][26022] Updated weights on worker 0-0, policy_version 81638 (0.00096) [2022-07-09 04:10:09,809][26022] Updated weights on worker 0-0, policy_version 81648 (0.00092) [2022-07-09 04:10:11,332][26022] Updated weights on worker 0-0, policy_version 81658 (0.00093) [2022-07-09 04:10:12,316][25689] Fps is (10 sec: 5488.0, 60 sec: 5715.8, 300 sec: 5710.4). Total num frames: 83620864. Throughput: 0: 5917.1. Samples: 83630096. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 04:10:12,316][25689] Avg episode reward: [(0, '-54.024')] [2022-07-09 04:10:13,344][26022] Updated weights on worker 0-0, policy_version 81668 (0.00086) [2022-07-09 04:10:15,155][26022] Updated weights on worker 0-0, policy_version 81678 (0.00092) [2022-07-09 04:10:16,903][26022] Updated weights on worker 0-0, policy_version 81688 (0.00092) [2022-07-09 04:10:17,327][25689] Fps is (10 sec: 5726.8, 60 sec: 5766.9, 300 sec: 5718.5). Total num frames: 83651584. Throughput: 0: 5042.4. Samples: 83647236. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 04:10:17,327][25689] Avg episode reward: [(0, '-54.398')] [2022-07-09 04:10:18,684][26022] Updated weights on worker 0-0, policy_version 81698 (0.00093) [2022-07-09 04:10:20,371][26022] Updated weights on worker 0-0, policy_version 81708 (0.00087) [2022-07-09 04:10:22,075][26022] Updated weights on worker 0-0, policy_version 81718 (0.00088) [2022-07-09 04:10:22,346][25689] Fps is (10 sec: 5921.5, 60 sec: 5734.1, 300 sec: 5719.7). Total num frames: 83680256. Throughput: 0: 5902.8. Samples: 83681942. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 04:10:22,346][25689] Avg episode reward: [(0, '-53.192')] [2022-07-09 04:10:23,922][26022] Updated weights on worker 0-0, policy_version 81728 (0.00089) [2022-07-09 04:10:24,245][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:10:24,267][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000081730_83691520.pth [2022-07-09 04:10:24,268][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000079717_81630208.pth [2022-07-09 04:10:25,603][26022] Updated weights on worker 0-0, policy_version 81738 (0.00089) [2022-07-09 04:10:27,397][25689] Fps is (10 sec: 5592.8, 60 sec: 5723.8, 300 sec: 5711.9). Total num frames: 83707904. Throughput: 0: 5992.7. Samples: 83716166. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 04:10:27,397][25689] Avg episode reward: [(0, '-53.251')] [2022-07-09 04:10:27,579][26022] Updated weights on worker 0-0, policy_version 81748 (0.00472) [2022-07-09 04:10:29,586][26022] Updated weights on worker 0-0, policy_version 81758 (0.00088) [2022-07-09 04:10:31,020][26022] Updated weights on worker 0-0, policy_version 81768 (0.00088) [2022-07-09 04:10:32,413][25689] Fps is (10 sec: 5594.3, 60 sec: 5709.4, 300 sec: 5715.1). Total num frames: 83736576. Throughput: 0: 5134.5. Samples: 83733286. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 04:10:32,414][25689] Avg episode reward: [(0, '-53.474')] [2022-07-09 04:10:33,022][26022] Updated weights on worker 0-0, policy_version 81778 (0.00088) [2022-07-09 04:10:34,625][26022] Updated weights on worker 0-0, policy_version 81788 (0.00088) [2022-07-09 04:10:36,591][26022] Updated weights on worker 0-0, policy_version 81798 (0.00084) [2022-07-09 04:10:37,428][25689] Fps is (10 sec: 5818.6, 60 sec: 5725.7, 300 sec: 5715.5). Total num frames: 83766272. Throughput: 0: 5974.9. Samples: 83767340. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 04:10:37,429][25689] Avg episode reward: [(0, '-53.366')] [2022-07-09 04:10:38,459][26022] Updated weights on worker 0-0, policy_version 81808 (0.00098) [2022-07-09 04:10:40,140][26022] Updated weights on worker 0-0, policy_version 81818 (0.00085) [2022-07-09 04:10:42,119][26022] Updated weights on worker 0-0, policy_version 81828 (0.00095) [2022-07-09 04:10:42,457][25689] Fps is (10 sec: 5811.1, 60 sec: 5723.9, 300 sec: 5712.6). Total num frames: 83794944. Throughput: 0: 5950.4. Samples: 83801614. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 04:10:42,458][25689] Avg episode reward: [(0, '-52.378')] [2022-07-09 04:10:43,923][26022] Updated weights on worker 0-0, policy_version 81838 (0.00088) [2022-07-09 04:10:45,669][26022] Updated weights on worker 0-0, policy_version 81848 (0.00089) [2022-07-09 04:10:47,482][26022] Updated weights on worker 0-0, policy_version 81858 (0.00087) [2022-07-09 04:10:47,501][25689] Fps is (10 sec: 5591.0, 60 sec: 5689.7, 300 sec: 5708.4). Total num frames: 83822592. Throughput: 0: 5089.1. Samples: 83818480. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 04:10:47,502][25689] Avg episode reward: [(0, '-52.252')] [2022-07-09 04:10:49,409][26022] Updated weights on worker 0-0, policy_version 81868 (0.00094) [2022-07-09 04:10:51,037][26022] Updated weights on worker 0-0, policy_version 81878 (0.00083) [2022-07-09 04:10:52,531][25689] Fps is (10 sec: 5387.7, 60 sec: 5654.2, 300 sec: 5702.7). Total num frames: 83849216. Throughput: 0: 5909.6. Samples: 83852172. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 04:10:52,531][25689] Avg episode reward: [(0, '-52.998')] [2022-07-09 04:10:53,081][26022] Updated weights on worker 0-0, policy_version 81888 (0.00088) [2022-07-09 04:10:54,573][26022] Updated weights on worker 0-0, policy_version 81898 (0.00080) [2022-07-09 04:10:56,702][26022] Updated weights on worker 0-0, policy_version 81908 (0.00389) [2022-07-09 04:10:57,542][25689] Fps is (10 sec: 5711.0, 60 sec: 5670.4, 300 sec: 5703.6). Total num frames: 83879936. Throughput: 0: 5906.9. Samples: 83886154. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 04:10:57,543][25689] Avg episode reward: [(0, '-53.077')] [2022-07-09 04:10:58,180][26022] Updated weights on worker 0-0, policy_version 81918 (0.00094) [2022-07-09 04:11:00,265][26022] Updated weights on worker 0-0, policy_version 81928 (0.00460) [2022-07-09 04:11:02,498][26022] Updated weights on worker 0-0, policy_version 81938 (0.00089) [2022-07-09 04:11:02,557][25689] Fps is (10 sec: 5514.9, 60 sec: 5619.4, 300 sec: 5698.5). Total num frames: 83904512. Throughput: 0: 5048.3. Samples: 83903084. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 04:11:02,558][25689] Avg episode reward: [(0, '-53.823')] [2022-07-09 04:11:04,084][26022] Updated weights on worker 0-0, policy_version 81948 (0.00092) [2022-07-09 04:11:05,915][26022] Updated weights on worker 0-0, policy_version 81958 (0.00088) [2022-07-09 04:11:07,607][25689] Fps is (10 sec: 5290.5, 60 sec: 5621.5, 300 sec: 5697.8). Total num frames: 83933184. Throughput: 0: 5822.3. Samples: 83935542. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 04:11:07,608][25689] Avg episode reward: [(0, '-54.210')] [2022-07-09 04:11:07,886][26022] Updated weights on worker 0-0, policy_version 81968 (0.00086) [2022-07-09 04:11:09,405][26022] Updated weights on worker 0-0, policy_version 81978 (0.00086) [2022-07-09 04:11:11,454][26022] Updated weights on worker 0-0, policy_version 81988 (0.00084) [2022-07-09 04:11:12,620][25689] Fps is (10 sec: 5901.9, 60 sec: 5688.5, 300 sec: 5701.1). Total num frames: 83963904. Throughput: 0: 5855.5. Samples: 83969808. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 04:11:12,623][25689] Avg episode reward: [(0, '-54.300')] [2022-07-09 04:11:13,076][26022] Updated weights on worker 0-0, policy_version 81998 (0.00078) [2022-07-09 04:11:14,871][26022] Updated weights on worker 0-0, policy_version 82008 (0.00085) [2022-07-09 04:11:16,876][26022] Updated weights on worker 0-0, policy_version 82018 (0.00092) [2022-07-09 04:11:17,639][25689] Fps is (10 sec: 5818.5, 60 sec: 5636.8, 300 sec: 5697.5). Total num frames: 83991552. Throughput: 0: 5030.2. Samples: 83987242. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 04:11:17,644][25689] Avg episode reward: [(0, '-54.028')] [2022-07-09 04:11:18,429][26022] Updated weights on worker 0-0, policy_version 82028 (0.00091) [2022-07-09 04:11:20,374][26022] Updated weights on worker 0-0, policy_version 82038 (0.00087) [2022-07-09 04:11:21,996][26022] Updated weights on worker 0-0, policy_version 82048 (0.00084) [2022-07-09 04:11:22,648][25689] Fps is (10 sec: 5616.6, 60 sec: 5637.7, 300 sec: 5699.2). Total num frames: 84020224. Throughput: 0: 5898.8. Samples: 84021594. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 04:11:22,648][25689] Avg episode reward: [(0, '-53.310')] [2022-07-09 04:11:23,723][26022] Updated weights on worker 0-0, policy_version 82058 (0.00365) [2022-07-09 04:11:25,743][26022] Updated weights on worker 0-0, policy_version 82068 (0.00089) [2022-07-09 04:11:27,391][26022] Updated weights on worker 0-0, policy_version 82078 (0.01254) [2022-07-09 04:11:27,696][25689] Fps is (10 sec: 5701.9, 60 sec: 5655.0, 300 sec: 5691.5). Total num frames: 84048896. Throughput: 0: 5978.3. Samples: 84055638. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 04:11:27,696][25689] Avg episode reward: [(0, '-52.537')] [2022-07-09 04:11:29,252][26022] Updated weights on worker 0-0, policy_version 82088 (0.00086) [2022-07-09 04:11:31,225][26022] Updated weights on worker 0-0, policy_version 82098 (0.00086) [2022-07-09 04:11:32,723][25689] Fps is (10 sec: 5691.8, 60 sec: 5654.0, 300 sec: 5695.1). Total num frames: 84077568. Throughput: 0: 5099.1. Samples: 84072312. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 04:11:32,723][25689] Avg episode reward: [(0, '-51.834')] [2022-07-09 04:11:32,783][26022] Updated weights on worker 0-0, policy_version 82108 (0.00094) [2022-07-09 04:11:34,785][26022] Updated weights on worker 0-0, policy_version 82118 (0.00086) [2022-07-09 04:11:36,454][26022] Updated weights on worker 0-0, policy_version 82128 (0.00092) [2022-07-09 04:11:37,726][25689] Fps is (10 sec: 5614.9, 60 sec: 5621.1, 300 sec: 5693.0). Total num frames: 84105216. Throughput: 0: 5933.1. Samples: 84106424. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:11:37,727][25689] Avg episode reward: [(0, '-51.038')] [2022-07-09 04:11:38,371][26022] Updated weights on worker 0-0, policy_version 82138 (0.00094) [2022-07-09 04:11:40,090][26022] Updated weights on worker 0-0, policy_version 82148 (0.00075) [2022-07-09 04:11:41,863][26022] Updated weights on worker 0-0, policy_version 82158 (0.00089) [2022-07-09 04:11:42,730][25689] Fps is (10 sec: 5525.4, 60 sec: 5606.5, 300 sec: 5688.9). Total num frames: 84132864. Throughput: 0: 5942.5. Samples: 84140934. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:11:42,731][25689] Avg episode reward: [(0, '-52.073')] [2022-07-09 04:11:43,711][26022] Updated weights on worker 0-0, policy_version 82168 (0.00092) [2022-07-09 04:11:45,671][26022] Updated weights on worker 0-0, policy_version 82178 (0.00090) [2022-07-09 04:11:46,998][26022] Updated weights on worker 0-0, policy_version 82188 (0.00089) [2022-07-09 04:11:47,792][25689] Fps is (10 sec: 5900.3, 60 sec: 5672.8, 300 sec: 5698.4). Total num frames: 84164608. Throughput: 0: 5094.1. Samples: 84158012. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:11:47,793][25689] Avg episode reward: [(0, '-51.803')] [2022-07-09 04:11:49,311][26022] Updated weights on worker 0-0, policy_version 82198 (0.00085) [2022-07-09 04:11:50,608][26022] Updated weights on worker 0-0, policy_version 82208 (0.00096) [2022-07-09 04:11:52,613][26022] Updated weights on worker 0-0, policy_version 82218 (0.00087) [2022-07-09 04:11:52,871][25689] Fps is (10 sec: 5857.1, 60 sec: 5685.1, 300 sec: 5694.0). Total num frames: 84192256. Throughput: 0: 5969.1. Samples: 84192576. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:11:52,871][25689] Avg episode reward: [(0, '-51.655')] [2022-07-09 04:11:54,244][26022] Updated weights on worker 0-0, policy_version 82228 (0.00096) [2022-07-09 04:11:56,247][26022] Updated weights on worker 0-0, policy_version 82238 (0.00085) [2022-07-09 04:11:57,884][25689] Fps is (10 sec: 5580.9, 60 sec: 5651.1, 300 sec: 5691.6). Total num frames: 84220928. Throughput: 0: 5986.9. Samples: 84227104. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:11:57,884][25689] Avg episode reward: [(0, '-52.208')] [2022-07-09 04:11:58,015][26022] Updated weights on worker 0-0, policy_version 82248 (0.00086) [2022-07-09 04:11:59,798][26022] Updated weights on worker 0-0, policy_version 82258 (0.00096) [2022-07-09 04:12:01,850][26022] Updated weights on worker 0-0, policy_version 82268 (0.00093) [2022-07-09 04:12:02,925][25689] Fps is (10 sec: 5499.4, 60 sec: 5682.5, 300 sec: 5696.9). Total num frames: 84247552. Throughput: 0: 5116.5. Samples: 84244262. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:12:02,928][25689] Avg episode reward: [(0, '-52.289')] [2022-07-09 04:12:04,054][26022] Updated weights on worker 0-0, policy_version 82278 (0.00085) [2022-07-09 04:12:05,664][26022] Updated weights on worker 0-0, policy_version 82288 (0.00093) [2022-07-09 04:12:07,518][26022] Updated weights on worker 0-0, policy_version 82298 (0.00088) [2022-07-09 04:12:07,997][25689] Fps is (10 sec: 5568.9, 60 sec: 5697.4, 300 sec: 5699.1). Total num frames: 84277248. Throughput: 0: 5832.0. Samples: 84275848. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:12:07,999][25689] Avg episode reward: [(0, '-52.528')] [2022-07-09 04:12:09,133][26022] Updated weights on worker 0-0, policy_version 82308 (0.00083) [2022-07-09 04:12:11,011][26022] Updated weights on worker 0-0, policy_version 82318 (0.00089) [2022-07-09 04:12:12,811][26022] Updated weights on worker 0-0, policy_version 82328 (0.00087) [2022-07-09 04:12:13,018][25689] Fps is (10 sec: 5682.0, 60 sec: 5645.8, 300 sec: 5690.1). Total num frames: 84304896. Throughput: 0: 5856.8. Samples: 84310574. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:12:13,018][25689] Avg episode reward: [(0, '-53.334')] [2022-07-09 04:12:14,419][26022] Updated weights on worker 0-0, policy_version 82338 (0.00091) [2022-07-09 04:12:16,379][26022] Updated weights on worker 0-0, policy_version 82348 (0.00089) [2022-07-09 04:12:18,065][25689] Fps is (10 sec: 5594.3, 60 sec: 5660.1, 300 sec: 5697.3). Total num frames: 84333568. Throughput: 0: 4992.7. Samples: 84327860. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:12:18,072][25689] Avg episode reward: [(0, '-53.083')] [2022-07-09 04:12:18,079][26022] Updated weights on worker 0-0, policy_version 82358 (0.00086) [2022-07-09 04:12:19,768][26022] Updated weights on worker 0-0, policy_version 82368 (0.00083) [2022-07-09 04:12:21,582][26022] Updated weights on worker 0-0, policy_version 82378 (0.00085) [2022-07-09 04:12:23,111][25689] Fps is (10 sec: 5884.4, 60 sec: 5690.5, 300 sec: 5698.2). Total num frames: 84364288. Throughput: 0: 5861.5. Samples: 84362578. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:12:23,111][25689] Avg episode reward: [(0, '-53.307')] [2022-07-09 04:12:23,190][26022] Updated weights on worker 0-0, policy_version 82388 (0.00102) [2022-07-09 04:12:24,273][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:12:24,286][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000082394_84371456.pth [2022-07-09 04:12:24,286][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000080391_82320384.pth [2022-07-09 04:12:25,294][26022] Updated weights on worker 0-0, policy_version 82398 (0.00075) [2022-07-09 04:12:26,951][26022] Updated weights on worker 0-0, policy_version 82408 (0.00089) [2022-07-09 04:12:28,206][25689] Fps is (10 sec: 5755.3, 60 sec: 5669.1, 300 sec: 5693.3). Total num frames: 84391936. Throughput: 0: 6005.3. Samples: 84397208. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:12:28,207][25689] Avg episode reward: [(0, '-53.483')] [2022-07-09 04:12:28,948][26022] Updated weights on worker 0-0, policy_version 82418 (0.00113) [2022-07-09 04:12:30,499][26022] Updated weights on worker 0-0, policy_version 82428 (0.00105) [2022-07-09 04:12:32,340][26022] Updated weights on worker 0-0, policy_version 82438 (0.00086) [2022-07-09 04:12:33,240][25689] Fps is (10 sec: 5661.2, 60 sec: 5685.4, 300 sec: 5696.4). Total num frames: 84421632. Throughput: 0: 5135.0. Samples: 84414416. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:12:33,240][25689] Avg episode reward: [(0, '-53.296')] [2022-07-09 04:12:34,180][26022] Updated weights on worker 0-0, policy_version 82448 (0.00089) [2022-07-09 04:12:35,943][26022] Updated weights on worker 0-0, policy_version 82458 (0.00085) [2022-07-09 04:12:37,748][26022] Updated weights on worker 0-0, policy_version 82468 (0.00087) [2022-07-09 04:12:38,244][25689] Fps is (10 sec: 5814.7, 60 sec: 5702.3, 300 sec: 5696.9). Total num frames: 84450304. Throughput: 0: 5976.9. Samples: 84448472. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:12:38,245][25689] Avg episode reward: [(0, '-52.719')] [2022-07-09 04:12:39,427][26022] Updated weights on worker 0-0, policy_version 82478 (0.00097) [2022-07-09 04:12:41,398][26022] Updated weights on worker 0-0, policy_version 82488 (0.00096) [2022-07-09 04:12:43,252][26022] Updated weights on worker 0-0, policy_version 82498 (0.00088) [2022-07-09 04:12:43,293][25689] Fps is (10 sec: 5602.6, 60 sec: 5698.1, 300 sec: 5698.5). Total num frames: 84477952. Throughput: 0: 5954.3. Samples: 84482748. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:12:43,293][25689] Avg episode reward: [(0, '-52.865')] [2022-07-09 04:12:44,842][26022] Updated weights on worker 0-0, policy_version 82508 (0.00092) [2022-07-09 04:12:46,820][26022] Updated weights on worker 0-0, policy_version 82518 (0.00092) [2022-07-09 04:12:48,358][25689] Fps is (10 sec: 5670.1, 60 sec: 5664.0, 300 sec: 5694.2). Total num frames: 84507648. Throughput: 0: 5098.0. Samples: 84499940. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:12:48,358][25689] Avg episode reward: [(0, '-52.531')] [2022-07-09 04:12:48,515][26022] Updated weights on worker 0-0, policy_version 82528 (0.00095) [2022-07-09 04:12:50,437][26022] Updated weights on worker 0-0, policy_version 82538 (0.00095) [2022-07-09 04:12:52,050][26022] Updated weights on worker 0-0, policy_version 82548 (0.00073) [2022-07-09 04:12:53,437][25689] Fps is (10 sec: 5753.6, 60 sec: 5680.7, 300 sec: 5700.1). Total num frames: 84536320. Throughput: 0: 5935.1. Samples: 84534288. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:12:53,438][25689] Avg episode reward: [(0, '-53.119')] [2022-07-09 04:12:53,946][26022] Updated weights on worker 0-0, policy_version 82558 (0.00086) [2022-07-09 04:12:55,522][26022] Updated weights on worker 0-0, policy_version 82568 (0.00089) [2022-07-09 04:12:57,500][26022] Updated weights on worker 0-0, policy_version 82578 (0.00088) [2022-07-09 04:12:58,500][25689] Fps is (10 sec: 5653.9, 60 sec: 5676.1, 300 sec: 5696.8). Total num frames: 84564992. Throughput: 0: 5937.1. Samples: 84568732. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:12:58,500][25689] Avg episode reward: [(0, '-53.063')] [2022-07-09 04:12:59,159][26022] Updated weights on worker 0-0, policy_version 82588 (0.00083) [2022-07-09 04:13:01,070][26022] Updated weights on worker 0-0, policy_version 82598 (0.00093) [2022-07-09 04:13:03,249][26022] Updated weights on worker 0-0, policy_version 82608 (0.00087) [2022-07-09 04:13:03,517][25689] Fps is (10 sec: 5485.6, 60 sec: 5678.4, 300 sec: 5694.4). Total num frames: 84591616. Throughput: 0: 5832.8. Samples: 84600716. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:13:03,518][25689] Avg episode reward: [(0, '-52.782')] [2022-07-09 04:13:05,086][26022] Updated weights on worker 0-0, policy_version 82618 (0.00087) [2022-07-09 04:13:06,843][26022] Updated weights on worker 0-0, policy_version 82628 (0.00105) [2022-07-09 04:13:08,577][25689] Fps is (10 sec: 5589.1, 60 sec: 5679.6, 300 sec: 5698.1). Total num frames: 84621312. Throughput: 0: 5837.7. Samples: 84617974. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:13:08,577][25689] Avg episode reward: [(0, '-52.819')] [2022-07-09 04:13:08,588][26022] Updated weights on worker 0-0, policy_version 82638 (0.00095) [2022-07-09 04:13:10,321][26022] Updated weights on worker 0-0, policy_version 82648 (0.00089) [2022-07-09 04:13:12,133][26022] Updated weights on worker 0-0, policy_version 82658 (0.00082) [2022-07-09 04:13:13,647][25689] Fps is (10 sec: 5762.0, 60 sec: 5691.7, 300 sec: 5690.1). Total num frames: 84649984. Throughput: 0: 5849.8. Samples: 84652512. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:13:13,648][25689] Avg episode reward: [(0, '-53.319')] [2022-07-09 04:13:13,940][26022] Updated weights on worker 0-0, policy_version 82668 (0.00085) [2022-07-09 04:13:15,762][26022] Updated weights on worker 0-0, policy_version 82678 (0.00412) [2022-07-09 04:13:17,335][26022] Updated weights on worker 0-0, policy_version 82688 (0.00086) [2022-07-09 04:13:18,666][25689] Fps is (10 sec: 5683.5, 60 sec: 5694.4, 300 sec: 5693.3). Total num frames: 84678656. Throughput: 0: 5874.7. Samples: 84687202. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 04:13:18,668][25689] Avg episode reward: [(0, '-53.049')] [2022-07-09 04:13:19,255][26022] Updated weights on worker 0-0, policy_version 82698 (0.00086) [2022-07-09 04:13:21,024][26022] Updated weights on worker 0-0, policy_version 82708 (0.00105) [2022-07-09 04:13:22,804][26022] Updated weights on worker 0-0, policy_version 82718 (0.00086) [2022-07-09 04:13:23,676][25689] Fps is (10 sec: 5717.8, 60 sec: 5663.9, 300 sec: 5690.6). Total num frames: 84707328. Throughput: 0: 5147.9. Samples: 84704492. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 04:13:23,678][25689] Avg episode reward: [(0, '-53.415')] [2022-07-09 04:13:24,554][26022] Updated weights on worker 0-0, policy_version 82728 (0.00097) [2022-07-09 04:13:26,455][26022] Updated weights on worker 0-0, policy_version 82738 (0.00100) [2022-07-09 04:13:28,250][26022] Updated weights on worker 0-0, policy_version 82748 (0.00088) [2022-07-09 04:13:28,730][25689] Fps is (10 sec: 5697.9, 60 sec: 5684.7, 300 sec: 5690.7). Total num frames: 84736000. Throughput: 0: 5988.6. Samples: 84738666. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 04:13:28,731][25689] Avg episode reward: [(0, '-53.553')] [2022-07-09 04:13:30,018][26022] Updated weights on worker 0-0, policy_version 82758 (0.00093) [2022-07-09 04:13:31,895][26022] Updated weights on worker 0-0, policy_version 82768 (0.00096) [2022-07-09 04:13:33,608][26022] Updated weights on worker 0-0, policy_version 82778 (0.00088) [2022-07-09 04:13:33,808][25689] Fps is (10 sec: 5760.8, 60 sec: 5680.6, 300 sec: 5693.3). Total num frames: 84765696. Throughput: 0: 5951.7. Samples: 84772504. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 04:13:33,810][25689] Avg episode reward: [(0, '-53.503')] [2022-07-09 04:13:35,340][26022] Updated weights on worker 0-0, policy_version 82788 (0.00103) [2022-07-09 04:13:37,144][26022] Updated weights on worker 0-0, policy_version 82798 (0.00092) [2022-07-09 04:13:38,820][25689] Fps is (10 sec: 5683.6, 60 sec: 5663.0, 300 sec: 5683.1). Total num frames: 84793344. Throughput: 0: 5101.6. Samples: 84790018. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 04:13:38,821][25689] Avg episode reward: [(0, '-53.114')] [2022-07-09 04:13:38,968][26022] Updated weights on worker 0-0, policy_version 82808 (0.00090) [2022-07-09 04:13:40,610][26022] Updated weights on worker 0-0, policy_version 82818 (0.00091) [2022-07-09 04:13:42,631][26022] Updated weights on worker 0-0, policy_version 82828 (0.00089) [2022-07-09 04:13:43,827][25689] Fps is (10 sec: 5723.6, 60 sec: 5700.7, 300 sec: 5697.5). Total num frames: 84823040. Throughput: 0: 5951.6. Samples: 84824422. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 04:13:43,828][25689] Avg episode reward: [(0, '-53.110')] [2022-07-09 04:13:44,343][26022] Updated weights on worker 0-0, policy_version 82838 (0.00089) [2022-07-09 04:13:46,141][26022] Updated weights on worker 0-0, policy_version 82848 (0.00086) [2022-07-09 04:13:47,992][26022] Updated weights on worker 0-0, policy_version 82858 (0.00085) [2022-07-09 04:13:48,885][25689] Fps is (10 sec: 5900.6, 60 sec: 5701.3, 300 sec: 5693.1). Total num frames: 84852736. Throughput: 0: 5973.0. Samples: 84859050. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 04:13:48,886][25689] Avg episode reward: [(0, '-53.315')] [2022-07-09 04:13:49,630][26022] Updated weights on worker 0-0, policy_version 82868 (0.00089) [2022-07-09 04:13:51,591][26022] Updated weights on worker 0-0, policy_version 82878 (0.00083) [2022-07-09 04:13:53,165][26022] Updated weights on worker 0-0, policy_version 82888 (0.00085) [2022-07-09 04:13:53,961][25689] Fps is (10 sec: 5658.4, 60 sec: 5684.7, 300 sec: 5692.1). Total num frames: 84880384. Throughput: 0: 5133.9. Samples: 84875968. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 04:13:53,962][25689] Avg episode reward: [(0, '-53.666')] [2022-07-09 04:13:55,116][26022] Updated weights on worker 0-0, policy_version 82898 (0.00090) [2022-07-09 04:13:56,937][26022] Updated weights on worker 0-0, policy_version 82908 (0.00087) [2022-07-09 04:13:58,684][26022] Updated weights on worker 0-0, policy_version 82918 (0.00086) [2022-07-09 04:13:58,978][25689] Fps is (10 sec: 5681.5, 60 sec: 5706.0, 300 sec: 5695.4). Total num frames: 84910080. Throughput: 0: 5983.7. Samples: 84910640. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 04:13:58,979][25689] Avg episode reward: [(0, '-53.856')] [2022-07-09 04:14:00,578][26022] Updated weights on worker 0-0, policy_version 82928 (0.00102) [2022-07-09 04:14:02,571][26022] Updated weights on worker 0-0, policy_version 82938 (0.00095) [2022-07-09 04:14:03,999][25689] Fps is (10 sec: 5407.0, 60 sec: 5671.8, 300 sec: 5683.7). Total num frames: 84934656. Throughput: 0: 5852.7. Samples: 84942480. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 04:14:03,999][25689] Avg episode reward: [(0, '-53.803')] [2022-07-09 04:14:04,615][26022] Updated weights on worker 0-0, policy_version 82948 (0.00088) [2022-07-09 04:14:06,363][26022] Updated weights on worker 0-0, policy_version 82958 (0.00089) [2022-07-09 04:14:08,054][26022] Updated weights on worker 0-0, policy_version 82968 (0.00090) [2022-07-09 04:14:09,097][25689] Fps is (10 sec: 5363.4, 60 sec: 5668.2, 300 sec: 5682.5). Total num frames: 84964352. Throughput: 0: 4974.0. Samples: 84959586. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 04:14:09,098][25689] Avg episode reward: [(0, '-53.528')] [2022-07-09 04:14:09,943][26022] Updated weights on worker 0-0, policy_version 82978 (0.00090) [2022-07-09 04:14:11,521][26022] Updated weights on worker 0-0, policy_version 82988 (0.00086) [2022-07-09 04:14:13,472][26022] Updated weights on worker 0-0, policy_version 82998 (0.00082) [2022-07-09 04:14:14,138][25689] Fps is (10 sec: 5857.6, 60 sec: 5687.9, 300 sec: 5688.8). Total num frames: 84994048. Throughput: 0: 5857.0. Samples: 84994142. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 04:14:14,139][25689] Avg episode reward: [(0, '-53.265')] [2022-07-09 04:14:15,186][26022] Updated weights on worker 0-0, policy_version 83008 (0.00086) [2022-07-09 04:14:17,039][26022] Updated weights on worker 0-0, policy_version 83018 (0.00093) [2022-07-09 04:14:18,699][26022] Updated weights on worker 0-0, policy_version 83028 (0.00090) [2022-07-09 04:14:19,191][25689] Fps is (10 sec: 5782.8, 60 sec: 5684.7, 300 sec: 5681.5). Total num frames: 85022720. Throughput: 0: 5836.2. Samples: 85028604. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 04:14:19,191][25689] Avg episode reward: [(0, '-53.978')] [2022-07-09 04:14:20,579][26022] Updated weights on worker 0-0, policy_version 83038 (0.00084) [2022-07-09 04:14:22,403][26022] Updated weights on worker 0-0, policy_version 83048 (0.00086) [2022-07-09 04:14:24,257][25689] Fps is (10 sec: 5566.0, 60 sec: 5662.6, 300 sec: 5679.1). Total num frames: 85050368. Throughput: 0: 5102.0. Samples: 85045836. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 04:14:24,257][25689] Avg episode reward: [(0, '-52.850')] [2022-07-09 04:14:24,297][26022] Updated weights on worker 0-0, policy_version 83058 (0.00085) [2022-07-09 04:14:24,302][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:14:24,316][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000083059_85052416.pth [2022-07-09 04:14:24,316][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000081058_83003392.pth [2022-07-09 04:14:26,027][26022] Updated weights on worker 0-0, policy_version 83068 (0.00097) [2022-07-09 04:14:27,842][26022] Updated weights on worker 0-0, policy_version 83078 (0.00084) [2022-07-09 04:14:29,307][25689] Fps is (10 sec: 5769.5, 60 sec: 5696.7, 300 sec: 5682.4). Total num frames: 85081088. Throughput: 0: 5960.5. Samples: 85080048. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 04:14:29,308][25689] Avg episode reward: [(0, '-53.441')] [2022-07-09 04:14:29,554][26022] Updated weights on worker 0-0, policy_version 83088 (0.00080) [2022-07-09 04:14:31,455][26022] Updated weights on worker 0-0, policy_version 83098 (0.00091) [2022-07-09 04:14:33,112][26022] Updated weights on worker 0-0, policy_version 83108 (0.00094) [2022-07-09 04:14:34,329][25689] Fps is (10 sec: 5693.4, 60 sec: 5651.3, 300 sec: 5675.3). Total num frames: 85107712. Throughput: 0: 5938.0. Samples: 85114034. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 04:14:34,330][25689] Avg episode reward: [(0, '-53.662')] [2022-07-09 04:14:34,963][26022] Updated weights on worker 0-0, policy_version 83118 (0.00088) [2022-07-09 04:14:36,713][26022] Updated weights on worker 0-0, policy_version 83128 (0.00087) [2022-07-09 04:14:38,638][26022] Updated weights on worker 0-0, policy_version 83138 (0.00086) [2022-07-09 04:14:39,355][25689] Fps is (10 sec: 5707.3, 60 sec: 5700.6, 300 sec: 5681.9). Total num frames: 85138432. Throughput: 0: 5091.7. Samples: 85131274. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 04:14:39,357][25689] Avg episode reward: [(0, '-53.982')] [2022-07-09 04:14:40,257][26022] Updated weights on worker 0-0, policy_version 83148 (0.00087) [2022-07-09 04:14:42,070][26022] Updated weights on worker 0-0, policy_version 83158 (0.00089) [2022-07-09 04:14:43,873][26022] Updated weights on worker 0-0, policy_version 83168 (0.00086) [2022-07-09 04:14:44,385][25689] Fps is (10 sec: 5702.1, 60 sec: 5647.7, 300 sec: 5671.8). Total num frames: 85165056. Throughput: 0: 5966.3. Samples: 85165930. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 04:14:44,387][25689] Avg episode reward: [(0, '-53.774')] [2022-07-09 04:14:45,634][26022] Updated weights on worker 0-0, policy_version 83178 (0.00092) [2022-07-09 04:14:47,566][26022] Updated weights on worker 0-0, policy_version 83188 (0.00089) [2022-07-09 04:14:49,133][26022] Updated weights on worker 0-0, policy_version 83198 (0.00085) [2022-07-09 04:14:49,451][25689] Fps is (10 sec: 5679.8, 60 sec: 5663.9, 300 sec: 5677.6). Total num frames: 85195776. Throughput: 0: 5981.8. Samples: 85200542. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 04:14:49,451][25689] Avg episode reward: [(0, '-54.511')] [2022-07-09 04:14:51,028][26022] Updated weights on worker 0-0, policy_version 83208 (0.00101) [2022-07-09 04:14:52,825][26022] Updated weights on worker 0-0, policy_version 83218 (0.00089) [2022-07-09 04:14:54,500][25689] Fps is (10 sec: 5972.8, 60 sec: 5700.2, 300 sec: 5676.8). Total num frames: 85225472. Throughput: 0: 5143.5. Samples: 85217788. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 04:14:54,502][25689] Avg episode reward: [(0, '-53.990')] [2022-07-09 04:14:54,511][26022] Updated weights on worker 0-0, policy_version 83228 (0.00094) [2022-07-09 04:14:56,652][26022] Updated weights on worker 0-0, policy_version 83238 (0.00086) [2022-07-09 04:14:58,061][26022] Updated weights on worker 0-0, policy_version 83248 (0.00090) [2022-07-09 04:14:59,539][25689] Fps is (10 sec: 5583.0, 60 sec: 5647.5, 300 sec: 5672.8). Total num frames: 85252096. Throughput: 0: 5988.7. Samples: 85252150. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:14:59,539][25689] Avg episode reward: [(0, '-53.590')] [2022-07-09 04:15:00,154][26022] Updated weights on worker 0-0, policy_version 83258 (0.00091) [2022-07-09 04:15:01,875][26022] Updated weights on worker 0-0, policy_version 83268 (0.00095) [2022-07-09 04:15:04,013][26022] Updated weights on worker 0-0, policy_version 83278 (0.00090) [2022-07-09 04:15:04,548][25689] Fps is (10 sec: 5299.4, 60 sec: 5682.3, 300 sec: 5667.2). Total num frames: 85278720. Throughput: 0: 5861.9. Samples: 85284124. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:15:04,549][25689] Avg episode reward: [(0, '-53.150')] [2022-07-09 04:15:05,912][26022] Updated weights on worker 0-0, policy_version 83288 (0.00088) [2022-07-09 04:15:07,706][26022] Updated weights on worker 0-0, policy_version 83298 (0.00089) [2022-07-09 04:15:09,327][26022] Updated weights on worker 0-0, policy_version 83308 (0.00090) [2022-07-09 04:15:09,652][25689] Fps is (10 sec: 5569.1, 60 sec: 5681.9, 300 sec: 5675.6). Total num frames: 85308416. Throughput: 0: 4966.7. Samples: 85300874. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:15:09,652][25689] Avg episode reward: [(0, '-53.821')] [2022-07-09 04:15:11,163][26022] Updated weights on worker 0-0, policy_version 83318 (0.00093) [2022-07-09 04:15:12,788][26022] Updated weights on worker 0-0, policy_version 83328 (0.00088) [2022-07-09 04:15:14,744][25689] Fps is (10 sec: 5724.5, 60 sec: 5660.1, 300 sec: 5667.1). Total num frames: 85337088. Throughput: 0: 5820.1. Samples: 85335610. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:15:14,745][25689] Avg episode reward: [(0, '-53.978')] [2022-07-09 04:15:14,850][26022] Updated weights on worker 0-0, policy_version 83338 (0.00086) [2022-07-09 04:15:16,442][26022] Updated weights on worker 0-0, policy_version 83348 (0.00098) [2022-07-09 04:15:18,303][26022] Updated weights on worker 0-0, policy_version 83358 (0.00094) [2022-07-09 04:15:19,780][25689] Fps is (10 sec: 5762.8, 60 sec: 5678.6, 300 sec: 5670.3). Total num frames: 85366784. Throughput: 0: 5820.4. Samples: 85369962. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:15:19,780][25689] Avg episode reward: [(0, '-53.355')] [2022-07-09 04:15:20,170][26022] Updated weights on worker 0-0, policy_version 83368 (0.00101) [2022-07-09 04:15:21,803][26022] Updated weights on worker 0-0, policy_version 83378 (0.00097) [2022-07-09 04:15:23,778][26022] Updated weights on worker 0-0, policy_version 83388 (0.00089) [2022-07-09 04:15:24,834][25689] Fps is (10 sec: 5886.6, 60 sec: 5713.6, 300 sec: 5677.1). Total num frames: 85396480. Throughput: 0: 5927.5. Samples: 85404366. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:15:24,834][25689] Avg episode reward: [(0, '-54.248')] [2022-07-09 04:15:25,439][26022] Updated weights on worker 0-0, policy_version 83398 (0.00107) [2022-07-09 04:15:27,335][26022] Updated weights on worker 0-0, policy_version 83408 (0.00083) [2022-07-09 04:15:28,982][26022] Updated weights on worker 0-0, policy_version 83418 (0.00092) [2022-07-09 04:15:29,891][25689] Fps is (10 sec: 5570.2, 60 sec: 5645.4, 300 sec: 5669.4). Total num frames: 85423104. Throughput: 0: 5942.6. Samples: 85421148. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:15:29,891][25689] Avg episode reward: [(0, '-54.693')] [2022-07-09 04:15:30,971][26022] Updated weights on worker 0-0, policy_version 83428 (0.00088) [2022-07-09 04:15:32,615][26022] Updated weights on worker 0-0, policy_version 83438 (0.00088) [2022-07-09 04:15:34,612][26022] Updated weights on worker 0-0, policy_version 83448 (0.00086) [2022-07-09 04:15:34,940][25689] Fps is (10 sec: 5370.1, 60 sec: 5659.6, 300 sec: 5661.9). Total num frames: 85450752. Throughput: 0: 5933.9. Samples: 85455450. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:15:34,941][25689] Avg episode reward: [(0, '-54.844')] [2022-07-09 04:15:36,342][26022] Updated weights on worker 0-0, policy_version 83458 (0.00096) [2022-07-09 04:15:38,115][26022] Updated weights on worker 0-0, policy_version 83468 (0.00084) [2022-07-09 04:15:39,957][25689] Fps is (10 sec: 5696.8, 60 sec: 5643.6, 300 sec: 5665.6). Total num frames: 85480448. Throughput: 0: 5927.4. Samples: 85489558. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:15:39,957][25689] Avg episode reward: [(0, '-54.297')] [2022-07-09 04:15:39,998][26022] Updated weights on worker 0-0, policy_version 83478 (0.00087) [2022-07-09 04:15:41,742][26022] Updated weights on worker 0-0, policy_version 83488 (0.00083) [2022-07-09 04:15:43,527][26022] Updated weights on worker 0-0, policy_version 83498 (0.00089) [2022-07-09 04:15:45,030][25689] Fps is (10 sec: 5886.4, 60 sec: 5690.3, 300 sec: 5671.9). Total num frames: 85510144. Throughput: 0: 5085.3. Samples: 85507068. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:15:45,030][25689] Avg episode reward: [(0, '-54.400')] [2022-07-09 04:15:45,257][26022] Updated weights on worker 0-0, policy_version 83508 (0.00086) [2022-07-09 04:15:46,961][26022] Updated weights on worker 0-0, policy_version 83518 (0.00085) [2022-07-09 04:15:49,040][26022] Updated weights on worker 0-0, policy_version 83528 (0.00087) [2022-07-09 04:15:50,117][25689] Fps is (10 sec: 5845.4, 60 sec: 5671.4, 300 sec: 5681.1). Total num frames: 85539840. Throughput: 0: 5946.4. Samples: 85541422. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:15:50,118][25689] Avg episode reward: [(0, '-54.522')] [2022-07-09 04:15:50,538][26022] Updated weights on worker 0-0, policy_version 83538 (0.00093) [2022-07-09 04:15:52,623][26022] Updated weights on worker 0-0, policy_version 83548 (0.00085) [2022-07-09 04:15:54,043][26022] Updated weights on worker 0-0, policy_version 83558 (0.00085) [2022-07-09 04:15:55,169][25689] Fps is (10 sec: 5655.8, 60 sec: 5637.5, 300 sec: 5670.0). Total num frames: 85567488. Throughput: 0: 5931.0. Samples: 85575426. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:15:55,169][25689] Avg episode reward: [(0, '-54.083')] [2022-07-09 04:15:56,067][26022] Updated weights on worker 0-0, policy_version 83568 (0.00086) [2022-07-09 04:15:57,772][26022] Updated weights on worker 0-0, policy_version 83578 (0.00093) [2022-07-09 04:15:59,707][26022] Updated weights on worker 0-0, policy_version 83588 (0.00086) [2022-07-09 04:16:00,170][25689] Fps is (10 sec: 5704.4, 60 sec: 5691.6, 300 sec: 5687.5). Total num frames: 85597184. Throughput: 0: 5104.4. Samples: 85592734. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:16:00,170][25689] Avg episode reward: [(0, '-54.541')] [2022-07-09 04:16:01,320][26022] Updated weights on worker 0-0, policy_version 83598 (0.00086) [2022-07-09 04:16:03,662][26022] Updated weights on worker 0-0, policy_version 83608 (0.00093) [2022-07-09 04:16:05,222][25689] Fps is (10 sec: 5602.4, 60 sec: 5687.6, 300 sec: 5680.6). Total num frames: 85623808. Throughput: 0: 5834.1. Samples: 85624868. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:16:05,222][25689] Avg episode reward: [(0, '-54.473')] [2022-07-09 04:16:05,273][26022] Updated weights on worker 0-0, policy_version 83618 (0.00085) [2022-07-09 04:16:07,228][26022] Updated weights on worker 0-0, policy_version 83628 (0.00091) [2022-07-09 04:16:08,999][26022] Updated weights on worker 0-0, policy_version 83638 (0.00087) [2022-07-09 04:16:10,291][25689] Fps is (10 sec: 5261.3, 60 sec: 5640.2, 300 sec: 5665.8). Total num frames: 85650432. Throughput: 0: 5823.1. Samples: 85658892. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:16:10,291][25689] Avg episode reward: [(0, '-54.517')] [2022-07-09 04:16:10,813][26022] Updated weights on worker 0-0, policy_version 83648 (0.00086) [2022-07-09 04:16:12,658][26022] Updated weights on worker 0-0, policy_version 83658 (0.00097) [2022-07-09 04:16:14,399][26022] Updated weights on worker 0-0, policy_version 83668 (0.00103) [2022-07-09 04:16:15,308][25689] Fps is (10 sec: 5685.2, 60 sec: 5681.0, 300 sec: 5676.1). Total num frames: 85681152. Throughput: 0: 4998.4. Samples: 85676090. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:16:15,309][25689] Avg episode reward: [(0, '-54.276')] [2022-07-09 04:16:16,265][26022] Updated weights on worker 0-0, policy_version 83678 (0.00087) [2022-07-09 04:16:18,059][26022] Updated weights on worker 0-0, policy_version 83688 (0.00090) [2022-07-09 04:16:19,702][26022] Updated weights on worker 0-0, policy_version 83698 (0.00090) [2022-07-09 04:16:20,317][25689] Fps is (10 sec: 5923.3, 60 sec: 5666.6, 300 sec: 5676.1). Total num frames: 85709824. Throughput: 0: 5835.9. Samples: 85710314. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:16:20,318][25689] Avg episode reward: [(0, '-54.418')] [2022-07-09 04:16:21,694][26022] Updated weights on worker 0-0, policy_version 83708 (0.00088) [2022-07-09 04:16:23,503][26022] Updated weights on worker 0-0, policy_version 83718 (0.00085) [2022-07-09 04:16:24,373][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:16:24,386][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000083723_85732352.pth [2022-07-09 04:16:24,386][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000081730_83691520.pth [2022-07-09 04:16:25,299][26022] Updated weights on worker 0-0, policy_version 83728 (0.00092) [2022-07-09 04:16:25,335][25689] Fps is (10 sec: 5718.9, 60 sec: 5653.0, 300 sec: 5676.7). Total num frames: 85738496. Throughput: 0: 5965.4. Samples: 85744854. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:16:25,336][25689] Avg episode reward: [(0, '-54.236')] [2022-07-09 04:16:26,999][26022] Updated weights on worker 0-0, policy_version 83738 (0.00098) [2022-07-09 04:16:28,710][26022] Updated weights on worker 0-0, policy_version 83748 (0.00090) [2022-07-09 04:16:30,395][25689] Fps is (10 sec: 5690.4, 60 sec: 5686.6, 300 sec: 5676.0). Total num frames: 85767168. Throughput: 0: 5125.7. Samples: 85761940. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:16:30,396][25689] Avg episode reward: [(0, '-54.011')] [2022-07-09 04:16:30,636][26022] Updated weights on worker 0-0, policy_version 83758 (0.00092) [2022-07-09 04:16:32,340][26022] Updated weights on worker 0-0, policy_version 83768 (0.00084) [2022-07-09 04:16:34,171][26022] Updated weights on worker 0-0, policy_version 83778 (0.00086) [2022-07-09 04:16:35,442][25689] Fps is (10 sec: 5572.4, 60 sec: 5686.8, 300 sec: 5675.2). Total num frames: 85794816. Throughput: 0: 5952.5. Samples: 85795938. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:16:35,443][25689] Avg episode reward: [(0, '-53.506')] [2022-07-09 04:16:35,969][26022] Updated weights on worker 0-0, policy_version 83788 (0.00093) [2022-07-09 04:16:37,790][26022] Updated weights on worker 0-0, policy_version 83798 (0.00093) [2022-07-09 04:16:39,784][26022] Updated weights on worker 0-0, policy_version 83808 (0.00089) [2022-07-09 04:16:40,487][25689] Fps is (10 sec: 5580.7, 60 sec: 5667.3, 300 sec: 5677.9). Total num frames: 85823488. Throughput: 0: 5908.3. Samples: 85829480. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 04:16:40,487][25689] Avg episode reward: [(0, '-53.746')] [2022-07-09 04:16:41,426][26022] Updated weights on worker 0-0, policy_version 83818 (0.00098) [2022-07-09 04:16:43,366][26022] Updated weights on worker 0-0, policy_version 83828 (0.00086) [2022-07-09 04:16:45,043][26022] Updated weights on worker 0-0, policy_version 83838 (0.00084) [2022-07-09 04:16:45,562][25689] Fps is (10 sec: 5666.6, 60 sec: 5650.2, 300 sec: 5667.3). Total num frames: 85852160. Throughput: 0: 5025.0. Samples: 85846492. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 04:16:45,562][25689] Avg episode reward: [(0, '-52.917')] [2022-07-09 04:16:46,911][26022] Updated weights on worker 0-0, policy_version 83848 (0.00095) [2022-07-09 04:16:48,599][26022] Updated weights on worker 0-0, policy_version 83858 (0.00085) [2022-07-09 04:16:50,566][26022] Updated weights on worker 0-0, policy_version 83868 (0.00084) [2022-07-09 04:16:50,662][25689] Fps is (10 sec: 5635.4, 60 sec: 5632.0, 300 sec: 5670.3). Total num frames: 85880832. Throughput: 0: 5861.7. Samples: 85880744. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 04:16:50,663][25689] Avg episode reward: [(0, '-52.931')] [2022-07-09 04:16:52,172][26022] Updated weights on worker 0-0, policy_version 83878 (0.00086) [2022-07-09 04:16:54,136][26022] Updated weights on worker 0-0, policy_version 83888 (0.00086) [2022-07-09 04:16:55,665][25689] Fps is (10 sec: 5878.3, 60 sec: 5687.3, 300 sec: 5677.4). Total num frames: 85911552. Throughput: 0: 5900.7. Samples: 85915268. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 04:16:55,666][25689] Avg episode reward: [(0, '-52.957')] [2022-07-09 04:16:55,666][26022] Updated weights on worker 0-0, policy_version 83898 (0.00088) [2022-07-09 04:16:57,735][26022] Updated weights on worker 0-0, policy_version 83908 (0.00089) [2022-07-09 04:16:59,323][26022] Updated weights on worker 0-0, policy_version 83918 (0.00531) [2022-07-09 04:17:00,670][25689] Fps is (10 sec: 5832.4, 60 sec: 5653.1, 300 sec: 5681.5). Total num frames: 85939200. Throughput: 0: 5117.3. Samples: 85932764. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 04:17:00,670][25689] Avg episode reward: [(0, '-53.309')] [2022-07-09 04:17:01,100][26022] Updated weights on worker 0-0, policy_version 83928 (0.00113) [2022-07-09 04:17:03,172][26022] Updated weights on worker 0-0, policy_version 83938 (0.00088) [2022-07-09 04:17:04,967][26022] Updated weights on worker 0-0, policy_version 83948 (0.00087) [2022-07-09 04:17:05,760][25689] Fps is (10 sec: 5376.1, 60 sec: 5649.5, 300 sec: 5670.9). Total num frames: 85965824. Throughput: 0: 5882.9. Samples: 85965320. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 04:17:05,761][25689] Avg episode reward: [(0, '-53.549')] [2022-07-09 04:17:06,931][26022] Updated weights on worker 0-0, policy_version 83958 (0.00093) [2022-07-09 04:17:08,741][26022] Updated weights on worker 0-0, policy_version 83968 (0.00084) [2022-07-09 04:17:10,268][26022] Updated weights on worker 0-0, policy_version 83978 (0.00089) [2022-07-09 04:17:10,871][25689] Fps is (10 sec: 5521.1, 60 sec: 5696.4, 300 sec: 5676.0). Total num frames: 85995520. Throughput: 0: 5886.9. Samples: 85999710. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 04:17:10,871][25689] Avg episode reward: [(0, '-54.708')] [2022-07-09 04:17:12,127][26022] Updated weights on worker 0-0, policy_version 83988 (0.00091) [2022-07-09 04:17:13,732][26022] Updated weights on worker 0-0, policy_version 83998 (0.00090) [2022-07-09 04:17:15,704][26022] Updated weights on worker 0-0, policy_version 84008 (0.00091) [2022-07-09 04:17:15,920][25689] Fps is (10 sec: 5946.8, 60 sec: 5693.4, 300 sec: 5682.9). Total num frames: 86026240. Throughput: 0: 5899.8. Samples: 86034766. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 04:17:15,920][25689] Avg episode reward: [(0, '-55.254')] [2022-07-09 04:17:17,272][26022] Updated weights on worker 0-0, policy_version 84018 (0.00082) [2022-07-09 04:17:19,238][26022] Updated weights on worker 0-0, policy_version 84028 (0.00089) [2022-07-09 04:17:20,841][26022] Updated weights on worker 0-0, policy_version 84038 (0.00093) [2022-07-09 04:17:20,923][25689] Fps is (10 sec: 5908.2, 60 sec: 5694.0, 300 sec: 5676.8). Total num frames: 86054912. Throughput: 0: 5898.2. Samples: 86052222. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 04:17:20,924][25689] Avg episode reward: [(0, '-55.200')] [2022-07-09 04:17:22,678][26022] Updated weights on worker 0-0, policy_version 84048 (0.00078) [2022-07-09 04:17:24,363][26022] Updated weights on worker 0-0, policy_version 84058 (0.00086) [2022-07-09 04:17:25,969][25689] Fps is (10 sec: 5604.2, 60 sec: 5674.4, 300 sec: 5677.7). Total num frames: 86082560. Throughput: 0: 6025.4. Samples: 86087088. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 04:17:25,970][25689] Avg episode reward: [(0, '-54.480')] [2022-07-09 04:17:26,502][26022] Updated weights on worker 0-0, policy_version 84068 (0.00091) [2022-07-09 04:17:27,850][26022] Updated weights on worker 0-0, policy_version 84078 (0.00088) [2022-07-09 04:17:30,152][26022] Updated weights on worker 0-0, policy_version 84088 (0.00096) [2022-07-09 04:17:31,025][25689] Fps is (10 sec: 5676.9, 60 sec: 5691.7, 300 sec: 5677.3). Total num frames: 86112256. Throughput: 0: 6015.4. Samples: 86120944. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 04:17:31,025][25689] Avg episode reward: [(0, '-53.511')] [2022-07-09 04:17:31,564][26022] Updated weights on worker 0-0, policy_version 84098 (0.00094) [2022-07-09 04:17:33,703][26022] Updated weights on worker 0-0, policy_version 84108 (0.00093) [2022-07-09 04:17:35,157][26022] Updated weights on worker 0-0, policy_version 84118 (0.00092) [2022-07-09 04:17:36,042][25689] Fps is (10 sec: 5794.7, 60 sec: 5711.4, 300 sec: 5677.1). Total num frames: 86140928. Throughput: 0: 5139.0. Samples: 86138174. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 04:17:36,043][25689] Avg episode reward: [(0, '-53.014')] [2022-07-09 04:17:37,321][26022] Updated weights on worker 0-0, policy_version 84128 (0.00083) [2022-07-09 04:17:38,757][26022] Updated weights on worker 0-0, policy_version 84138 (0.00050) [2022-07-09 04:17:40,982][26022] Updated weights on worker 0-0, policy_version 84148 (0.00094) [2022-07-09 04:17:41,064][25689] Fps is (10 sec: 5508.1, 60 sec: 5679.8, 300 sec: 5674.2). Total num frames: 86167552. Throughput: 0: 5959.2. Samples: 86172242. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 04:17:41,064][25689] Avg episode reward: [(0, '-53.039')] [2022-07-09 04:17:42,399][26022] Updated weights on worker 0-0, policy_version 84158 (0.00090) [2022-07-09 04:17:44,379][26022] Updated weights on worker 0-0, policy_version 84168 (0.00093) [2022-07-09 04:17:46,028][26022] Updated weights on worker 0-0, policy_version 84178 (0.00086) [2022-07-09 04:17:46,081][25689] Fps is (10 sec: 5711.9, 60 sec: 5719.0, 300 sec: 5678.5). Total num frames: 86198272. Throughput: 0: 5932.5. Samples: 86206404. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 04:17:46,082][25689] Avg episode reward: [(0, '-52.622')] [2022-07-09 04:17:47,815][26022] Updated weights on worker 0-0, policy_version 84188 (0.00089) [2022-07-09 04:17:49,722][26022] Updated weights on worker 0-0, policy_version 84198 (0.00086) [2022-07-09 04:17:51,182][25689] Fps is (10 sec: 5870.1, 60 sec: 5719.1, 300 sec: 5678.1). Total num frames: 86226944. Throughput: 0: 5094.8. Samples: 86223638. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 04:17:51,182][25689] Avg episode reward: [(0, '-52.537')] [2022-07-09 04:17:51,658][26022] Updated weights on worker 0-0, policy_version 84208 (0.00082) [2022-07-09 04:17:53,254][26022] Updated weights on worker 0-0, policy_version 84218 (0.00085) [2022-07-09 04:17:55,340][26022] Updated weights on worker 0-0, policy_version 84228 (0.00083) [2022-07-09 04:17:56,201][25689] Fps is (10 sec: 5464.1, 60 sec: 5649.8, 300 sec: 5672.0). Total num frames: 86253568. Throughput: 0: 5937.3. Samples: 86257866. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 04:17:56,202][25689] Avg episode reward: [(0, '-53.361')] [2022-07-09 04:17:56,629][26022] Updated weights on worker 0-0, policy_version 84238 (0.00094) [2022-07-09 04:17:59,041][26022] Updated weights on worker 0-0, policy_version 84248 (0.00097) [2022-07-09 04:18:00,203][26022] Updated weights on worker 0-0, policy_version 84258 (0.00089) [2022-07-09 04:18:01,206][25689] Fps is (10 sec: 5720.3, 60 sec: 5700.5, 300 sec: 5686.0). Total num frames: 86284288. Throughput: 0: 5939.7. Samples: 86291882. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 04:18:01,207][25689] Avg episode reward: [(0, '-53.785')] [2022-07-09 04:18:02,916][26022] Updated weights on worker 0-0, policy_version 84268 (0.00090) [2022-07-09 04:18:04,201][26022] Updated weights on worker 0-0, policy_version 84278 (0.00077) [2022-07-09 04:18:06,244][25689] Fps is (10 sec: 5608.2, 60 sec: 5688.6, 300 sec: 5672.7). Total num frames: 86309888. Throughput: 0: 4979.5. Samples: 86306802. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 04:18:06,244][25689] Avg episode reward: [(0, '-53.696')] [2022-07-09 04:18:06,367][26022] Updated weights on worker 0-0, policy_version 84288 (0.00048) [2022-07-09 04:18:07,987][26022] Updated weights on worker 0-0, policy_version 84298 (0.00107) [2022-07-09 04:18:10,131][26022] Updated weights on worker 0-0, policy_version 84308 (0.00085) [2022-07-09 04:18:11,288][25689] Fps is (10 sec: 5383.2, 60 sec: 5677.9, 300 sec: 5673.2). Total num frames: 86338560. Throughput: 0: 5814.4. Samples: 86340544. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 04:18:11,288][25689] Avg episode reward: [(0, '-52.977')] [2022-07-09 04:18:11,692][26022] Updated weights on worker 0-0, policy_version 84318 (0.00086) [2022-07-09 04:18:13,595][26022] Updated weights on worker 0-0, policy_version 84328 (0.00090) [2022-07-09 04:18:15,213][26022] Updated weights on worker 0-0, policy_version 84338 (0.00085) [2022-07-09 04:18:16,293][25689] Fps is (10 sec: 5706.1, 60 sec: 5648.0, 300 sec: 5673.5). Total num frames: 86367232. Throughput: 0: 5815.9. Samples: 86374720. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 04:18:16,294][25689] Avg episode reward: [(0, '-53.091')] [2022-07-09 04:18:17,122][26022] Updated weights on worker 0-0, policy_version 84348 (0.00082) [2022-07-09 04:18:18,876][26022] Updated weights on worker 0-0, policy_version 84358 (0.00093) [2022-07-09 04:18:20,752][26022] Updated weights on worker 0-0, policy_version 84368 (0.00130) [2022-07-09 04:18:21,299][25689] Fps is (10 sec: 5728.2, 60 sec: 5647.9, 300 sec: 5673.6). Total num frames: 86395904. Throughput: 0: 4981.4. Samples: 86391970. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 04:18:21,299][25689] Avg episode reward: [(0, '-53.205')] [2022-07-09 04:18:22,494][26022] Updated weights on worker 0-0, policy_version 84378 (0.00092) [2022-07-09 04:18:24,190][26022] Updated weights on worker 0-0, policy_version 84388 (0.00086) [2022-07-09 04:18:24,697][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:18:24,726][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000084390_86415360.pth [2022-07-09 04:18:24,727][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000082394_84371456.pth [2022-07-09 04:18:26,139][26022] Updated weights on worker 0-0, policy_version 84398 (0.00089) [2022-07-09 04:18:26,306][25689] Fps is (10 sec: 5624.8, 60 sec: 5651.5, 300 sec: 5671.0). Total num frames: 86423552. Throughput: 0: 5970.5. Samples: 86426584. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 04:18:26,307][25689] Avg episode reward: [(0, '-52.699')] [2022-07-09 04:18:27,774][26022] Updated weights on worker 0-0, policy_version 84408 (0.00087) [2022-07-09 04:18:29,804][26022] Updated weights on worker 0-0, policy_version 84418 (0.00094) [2022-07-09 04:18:31,371][25689] Fps is (10 sec: 5693.3, 60 sec: 5650.6, 300 sec: 5671.2). Total num frames: 86453248. Throughput: 0: 5978.9. Samples: 86460618. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 04:18:31,371][25689] Avg episode reward: [(0, '-53.042')] [2022-07-09 04:18:31,447][26022] Updated weights on worker 0-0, policy_version 84428 (0.00087) [2022-07-09 04:18:33,244][26022] Updated weights on worker 0-0, policy_version 84438 (0.00085) [2022-07-09 04:18:35,234][26022] Updated weights on worker 0-0, policy_version 84448 (0.00087) [2022-07-09 04:18:36,376][25689] Fps is (10 sec: 5796.2, 60 sec: 5651.7, 300 sec: 5674.8). Total num frames: 86481920. Throughput: 0: 5143.0. Samples: 86478006. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 04:18:36,377][25689] Avg episode reward: [(0, '-53.184')] [2022-07-09 04:18:36,757][26022] Updated weights on worker 0-0, policy_version 84458 (0.00079) [2022-07-09 04:18:38,705][26022] Updated weights on worker 0-0, policy_version 84468 (0.00085) [2022-07-09 04:18:40,407][26022] Updated weights on worker 0-0, policy_version 84478 (0.00083) [2022-07-09 04:18:41,405][25689] Fps is (10 sec: 5816.8, 60 sec: 5702.0, 300 sec: 5674.4). Total num frames: 86511616. Throughput: 0: 6002.2. Samples: 86512652. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 04:18:41,406][25689] Avg episode reward: [(0, '-52.982')] [2022-07-09 04:18:42,103][26022] Updated weights on worker 0-0, policy_version 84488 (0.00092) [2022-07-09 04:18:43,912][26022] Updated weights on worker 0-0, policy_version 84498 (0.00088) [2022-07-09 04:18:45,741][26022] Updated weights on worker 0-0, policy_version 84508 (0.00095) [2022-07-09 04:18:46,409][25689] Fps is (10 sec: 5715.9, 60 sec: 5652.4, 300 sec: 5668.6). Total num frames: 86539264. Throughput: 0: 5983.3. Samples: 86546862. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 04:18:46,409][25689] Avg episode reward: [(0, '-52.980')] [2022-07-09 04:18:47,506][26022] Updated weights on worker 0-0, policy_version 84518 (0.00087) [2022-07-09 04:18:49,557][26022] Updated weights on worker 0-0, policy_version 84528 (0.00091) [2022-07-09 04:18:51,108][26022] Updated weights on worker 0-0, policy_version 84538 (0.00096) [2022-07-09 04:18:51,543][25689] Fps is (10 sec: 5656.5, 60 sec: 5666.1, 300 sec: 5674.3). Total num frames: 86568960. Throughput: 0: 5129.5. Samples: 86564090. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 04:18:51,543][25689] Avg episode reward: [(0, '-52.510')] [2022-07-09 04:18:53,107][26022] Updated weights on worker 0-0, policy_version 84548 (0.00084) [2022-07-09 04:18:54,473][26022] Updated weights on worker 0-0, policy_version 84558 (0.00088) [2022-07-09 04:18:56,535][26022] Updated weights on worker 0-0, policy_version 84568 (0.00087) [2022-07-09 04:18:56,590][25689] Fps is (10 sec: 5733.0, 60 sec: 5697.5, 300 sec: 5670.3). Total num frames: 86597632. Throughput: 0: 5953.5. Samples: 86598346. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 04:18:56,590][25689] Avg episode reward: [(0, '-52.401')] [2022-07-09 04:18:58,525][26022] Updated weights on worker 0-0, policy_version 84578 (0.00086) [2022-07-09 04:18:59,837][26022] Updated weights on worker 0-0, policy_version 84588 (0.00093) [2022-07-09 04:19:01,610][25689] Fps is (10 sec: 5594.7, 60 sec: 5645.2, 300 sec: 5680.7). Total num frames: 86625280. Throughput: 0: 5951.8. Samples: 86632904. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 04:19:01,610][25689] Avg episode reward: [(0, '-52.261')] [2022-07-09 04:19:02,289][26022] Updated weights on worker 0-0, policy_version 84598 (0.00088) [2022-07-09 04:19:03,898][26022] Updated weights on worker 0-0, policy_version 84608 (0.00096) [2022-07-09 04:19:05,836][26022] Updated weights on worker 0-0, policy_version 84618 (0.00086) [2022-07-09 04:19:06,646][25689] Fps is (10 sec: 5600.6, 60 sec: 5696.2, 300 sec: 5678.4). Total num frames: 86653952. Throughput: 0: 4995.2. Samples: 86647950. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 04:19:06,648][25689] Avg episode reward: [(0, '-52.488')] [2022-07-09 04:19:07,772][26022] Updated weights on worker 0-0, policy_version 84628 (0.00092) [2022-07-09 04:19:09,514][26022] Updated weights on worker 0-0, policy_version 84638 (0.00651) [2022-07-09 04:19:11,249][26022] Updated weights on worker 0-0, policy_version 84648 (0.00092) [2022-07-09 04:19:11,702][25689] Fps is (10 sec: 5783.3, 60 sec: 5712.0, 300 sec: 5678.1). Total num frames: 86683648. Throughput: 0: 5850.1. Samples: 86682024. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 04:19:11,704][25689] Avg episode reward: [(0, '-53.215')] [2022-07-09 04:19:13,146][26022] Updated weights on worker 0-0, policy_version 84658 (0.00082) [2022-07-09 04:19:14,590][26022] Updated weights on worker 0-0, policy_version 84668 (0.00092) [2022-07-09 04:19:16,725][25689] Fps is (10 sec: 5485.9, 60 sec: 5659.5, 300 sec: 5668.3). Total num frames: 86709248. Throughput: 0: 5858.6. Samples: 86716312. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 04:19:16,727][25689] Avg episode reward: [(0, '-53.075')] [2022-07-09 04:19:16,810][26022] Updated weights on worker 0-0, policy_version 84678 (0.00096) [2022-07-09 04:19:18,285][26022] Updated weights on worker 0-0, policy_version 84688 (0.00087) [2022-07-09 04:19:20,282][26022] Updated weights on worker 0-0, policy_version 84698 (0.00091) [2022-07-09 04:19:21,743][25689] Fps is (10 sec: 5507.3, 60 sec: 5675.3, 300 sec: 5676.2). Total num frames: 86738944. Throughput: 0: 4993.7. Samples: 86733442. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 04:19:21,743][25689] Avg episode reward: [(0, '-53.606')] [2022-07-09 04:19:22,232][26022] Updated weights on worker 0-0, policy_version 84708 (0.00092) [2022-07-09 04:19:23,864][26022] Updated weights on worker 0-0, policy_version 84718 (0.00094) [2022-07-09 04:19:25,818][26022] Updated weights on worker 0-0, policy_version 84728 (0.00091) [2022-07-09 04:19:26,757][25689] Fps is (10 sec: 5818.1, 60 sec: 5691.6, 300 sec: 5670.0). Total num frames: 86767616. Throughput: 0: 5953.5. Samples: 86767684. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 04:19:26,759][25689] Avg episode reward: [(0, '-53.944')] [2022-07-09 04:19:27,524][26022] Updated weights on worker 0-0, policy_version 84738 (0.00100) [2022-07-09 04:19:29,224][26022] Updated weights on worker 0-0, policy_version 84748 (0.00094) [2022-07-09 04:19:31,139][26022] Updated weights on worker 0-0, policy_version 84758 (0.00087) [2022-07-09 04:19:31,858][25689] Fps is (10 sec: 5669.0, 60 sec: 5671.3, 300 sec: 5675.3). Total num frames: 86796288. Throughput: 0: 5941.0. Samples: 86801770. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 04:19:31,859][25689] Avg episode reward: [(0, '-53.882')] [2022-07-09 04:19:32,740][26022] Updated weights on worker 0-0, policy_version 84768 (0.00090) [2022-07-09 04:19:34,837][26022] Updated weights on worker 0-0, policy_version 84778 (0.00083) [2022-07-09 04:19:36,432][26022] Updated weights on worker 0-0, policy_version 84788 (0.00078) [2022-07-09 04:19:36,915][25689] Fps is (10 sec: 5645.6, 60 sec: 5666.5, 300 sec: 5667.9). Total num frames: 86824960. Throughput: 0: 5085.8. Samples: 86818992. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 04:19:36,917][25689] Avg episode reward: [(0, '-53.315')] [2022-07-09 04:19:38,184][26022] Updated weights on worker 0-0, policy_version 84798 (0.00085) [2022-07-09 04:19:39,992][26022] Updated weights on worker 0-0, policy_version 84808 (0.00079) [2022-07-09 04:19:41,839][26022] Updated weights on worker 0-0, policy_version 84818 (0.00082) [2022-07-09 04:19:41,937][25689] Fps is (10 sec: 5689.7, 60 sec: 5650.2, 300 sec: 5674.9). Total num frames: 86853632. Throughput: 0: 5921.5. Samples: 86853020. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 04:19:41,939][25689] Avg episode reward: [(0, '-53.331')] [2022-07-09 04:19:43,593][26022] Updated weights on worker 0-0, policy_version 84828 (0.00106) [2022-07-09 04:19:45,604][26022] Updated weights on worker 0-0, policy_version 84838 (0.00087) [2022-07-09 04:19:47,037][25689] Fps is (10 sec: 5665.0, 60 sec: 5658.1, 300 sec: 5667.4). Total num frames: 86882304. Throughput: 0: 5898.9. Samples: 86887312. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 04:19:47,039][25689] Avg episode reward: [(0, '-54.450')] [2022-07-09 04:19:47,098][26022] Updated weights on worker 0-0, policy_version 84848 (0.00085) [2022-07-09 04:19:49,050][26022] Updated weights on worker 0-0, policy_version 84858 (0.00086) [2022-07-09 04:19:50,740][26022] Updated weights on worker 0-0, policy_version 84868 (0.00094) [2022-07-09 04:19:52,096][25689] Fps is (10 sec: 5745.3, 60 sec: 5665.1, 300 sec: 5667.2). Total num frames: 86912000. Throughput: 0: 5078.6. Samples: 86904550. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 04:19:52,096][25689] Avg episode reward: [(0, '-54.185')] [2022-07-09 04:19:52,686][26022] Updated weights on worker 0-0, policy_version 84878 (0.00086) [2022-07-09 04:19:54,479][26022] Updated weights on worker 0-0, policy_version 84888 (0.00083) [2022-07-09 04:19:56,135][26022] Updated weights on worker 0-0, policy_version 84898 (0.00098) [2022-07-09 04:19:57,099][25689] Fps is (10 sec: 5800.9, 60 sec: 5669.2, 300 sec: 5674.7). Total num frames: 86940672. Throughput: 0: 5945.0. Samples: 86938988. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 04:19:57,100][25689] Avg episode reward: [(0, '-54.197')] [2022-07-09 04:19:58,014][26022] Updated weights on worker 0-0, policy_version 84908 (0.00085) [2022-07-09 04:19:59,776][26022] Updated weights on worker 0-0, policy_version 84918 (0.00093) [2022-07-09 04:20:01,515][26022] Updated weights on worker 0-0, policy_version 84928 (0.00098) [2022-07-09 04:20:02,114][25689] Fps is (10 sec: 5621.7, 60 sec: 5669.7, 300 sec: 5678.1). Total num frames: 86968320. Throughput: 0: 5966.3. Samples: 86973406. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 04:20:02,115][25689] Avg episode reward: [(0, '-53.962')] [2022-07-09 04:20:03,762][26022] Updated weights on worker 0-0, policy_version 84938 (0.00089) [2022-07-09 04:20:05,600][26022] Updated weights on worker 0-0, policy_version 84948 (0.00090) [2022-07-09 04:20:07,122][25689] Fps is (10 sec: 5414.9, 60 sec: 5638.4, 300 sec: 5669.6). Total num frames: 86994944. Throughput: 0: 5035.5. Samples: 86988448. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 04:20:07,123][25689] Avg episode reward: [(0, '-54.091')] [2022-07-09 04:20:07,242][26022] Updated weights on worker 0-0, policy_version 84958 (0.00092) [2022-07-09 04:20:09,138][26022] Updated weights on worker 0-0, policy_version 84968 (0.00092) [2022-07-09 04:20:10,739][26022] Updated weights on worker 0-0, policy_version 84978 (0.00095) [2022-07-09 04:20:12,199][25689] Fps is (10 sec: 5584.7, 60 sec: 5636.5, 300 sec: 5673.3). Total num frames: 87024640. Throughput: 0: 5885.5. Samples: 87022866. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 04:20:12,199][25689] Avg episode reward: [(0, '-53.874')] [2022-07-09 04:20:12,735][26022] Updated weights on worker 0-0, policy_version 84988 (0.00085) [2022-07-09 04:20:14,338][26022] Updated weights on worker 0-0, policy_version 84998 (0.00090) [2022-07-09 04:20:16,262][26022] Updated weights on worker 0-0, policy_version 85008 (0.00086) [2022-07-09 04:20:17,219][25689] Fps is (10 sec: 5881.9, 60 sec: 5704.5, 300 sec: 5673.6). Total num frames: 87054336. Throughput: 0: 5867.4. Samples: 87057042. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 04:20:17,220][25689] Avg episode reward: [(0, '-53.570')] [2022-07-09 04:20:18,039][26022] Updated weights on worker 0-0, policy_version 85018 (0.00093) [2022-07-09 04:20:19,792][26022] Updated weights on worker 0-0, policy_version 85028 (0.00090) [2022-07-09 04:20:21,703][26022] Updated weights on worker 0-0, policy_version 85038 (0.00089) [2022-07-09 04:20:22,256][25689] Fps is (10 sec: 5701.5, 60 sec: 5668.8, 300 sec: 5667.0). Total num frames: 87081984. Throughput: 0: 5855.1. Samples: 87091340. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 04:20:22,257][25689] Avg episode reward: [(0, '-53.688')] [2022-07-09 04:20:23,289][26022] Updated weights on worker 0-0, policy_version 85048 (0.00090) [2022-07-09 04:20:24,846][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:20:24,861][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000085056_87097344.pth [2022-07-09 04:20:24,861][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000083059_85052416.pth [2022-07-09 04:20:25,298][26022] Updated weights on worker 0-0, policy_version 85058 (0.00091) [2022-07-09 04:20:27,095][26022] Updated weights on worker 0-0, policy_version 85068 (0.00086) [2022-07-09 04:20:27,269][25689] Fps is (10 sec: 5604.0, 60 sec: 5669.0, 300 sec: 5674.8). Total num frames: 87110656. Throughput: 0: 5958.0. Samples: 87108486. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 04:20:27,270][25689] Avg episode reward: [(0, '-53.464')] [2022-07-09 04:20:28,734][26022] Updated weights on worker 0-0, policy_version 85078 (0.00089) [2022-07-09 04:20:30,777][26022] Updated weights on worker 0-0, policy_version 85088 (0.00088) [2022-07-09 04:20:32,350][25689] Fps is (10 sec: 5782.7, 60 sec: 5687.8, 300 sec: 5681.0). Total num frames: 87140352. Throughput: 0: 5942.2. Samples: 87142608. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 04:20:32,354][26022] Updated weights on worker 0-0, policy_version 85098 (0.00085) [2022-07-09 04:20:32,359][25689] Avg episode reward: [(0, '-53.765')] [2022-07-09 04:20:34,322][26022] Updated weights on worker 0-0, policy_version 85108 (0.00089) [2022-07-09 04:20:36,003][26022] Updated weights on worker 0-0, policy_version 85118 (0.00112) [2022-07-09 04:20:37,371][25689] Fps is (10 sec: 5676.5, 60 sec: 5674.1, 300 sec: 5674.1). Total num frames: 87168000. Throughput: 0: 5949.1. Samples: 87176928. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 04:20:37,373][25689] Avg episode reward: [(0, '-53.916')] [2022-07-09 04:20:37,716][26022] Updated weights on worker 0-0, policy_version 85128 (0.00612) [2022-07-09 04:20:39,603][26022] Updated weights on worker 0-0, policy_version 85138 (0.00087) [2022-07-09 04:20:41,664][26022] Updated weights on worker 0-0, policy_version 85148 (0.00084) [2022-07-09 04:20:42,402][25689] Fps is (10 sec: 5602.8, 60 sec: 5673.3, 300 sec: 5671.4). Total num frames: 87196672. Throughput: 0: 5107.3. Samples: 87194230. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 04:20:42,404][25689] Avg episode reward: [(0, '-54.091')] [2022-07-09 04:20:43,072][26022] Updated weights on worker 0-0, policy_version 85158 (0.00090) [2022-07-09 04:20:45,216][26022] Updated weights on worker 0-0, policy_version 85168 (0.00087) [2022-07-09 04:20:46,416][26022] Updated weights on worker 0-0, policy_version 85178 (0.00086) [2022-07-09 04:20:47,426][25689] Fps is (10 sec: 5703.0, 60 sec: 5680.5, 300 sec: 5669.2). Total num frames: 87225344. Throughput: 0: 5946.8. Samples: 87228356. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 04:20:47,427][25689] Avg episode reward: [(0, '-53.675')] [2022-07-09 04:20:48,818][26022] Updated weights on worker 0-0, policy_version 85188 (0.00085) [2022-07-09 04:20:50,135][26022] Updated weights on worker 0-0, policy_version 85198 (0.00080) [2022-07-09 04:20:52,252][26022] Updated weights on worker 0-0, policy_version 85208 (0.00093) [2022-07-09 04:20:52,481][25689] Fps is (10 sec: 5791.4, 60 sec: 5680.9, 300 sec: 5676.0). Total num frames: 87255040. Throughput: 0: 5967.8. Samples: 87262744. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 04:20:52,483][25689] Avg episode reward: [(0, '-54.423')] [2022-07-09 04:20:53,852][26022] Updated weights on worker 0-0, policy_version 85218 (0.00092) [2022-07-09 04:20:55,726][26022] Updated weights on worker 0-0, policy_version 85228 (0.00082) [2022-07-09 04:20:57,543][25689] Fps is (10 sec: 5668.5, 60 sec: 5658.4, 300 sec: 5668.0). Total num frames: 87282688. Throughput: 0: 5121.9. Samples: 87280244. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 04:20:57,544][25689] Avg episode reward: [(0, '-55.018')] [2022-07-09 04:20:57,554][26022] Updated weights on worker 0-0, policy_version 85238 (0.00092) [2022-07-09 04:20:59,272][26022] Updated weights on worker 0-0, policy_version 85248 (0.00092) [2022-07-09 04:21:00,952][26022] Updated weights on worker 0-0, policy_version 85258 (0.00087) [2022-07-09 04:21:02,563][25689] Fps is (10 sec: 5383.0, 60 sec: 5641.0, 300 sec: 5668.6). Total num frames: 87309312. Throughput: 0: 5974.8. Samples: 87314684. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 04:21:02,563][25689] Avg episode reward: [(0, '-54.446')] [2022-07-09 04:21:03,315][26022] Updated weights on worker 0-0, policy_version 85268 (0.00085) [2022-07-09 04:21:04,698][26022] Updated weights on worker 0-0, policy_version 85278 (0.00092) [2022-07-09 04:21:06,884][26022] Updated weights on worker 0-0, policy_version 85288 (0.00091) [2022-07-09 04:21:07,570][25689] Fps is (10 sec: 5718.8, 60 sec: 5708.8, 300 sec: 5683.5). Total num frames: 87340032. Throughput: 0: 5892.7. Samples: 87347056. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 04:21:07,570][25689] Avg episode reward: [(0, '-54.629')] [2022-07-09 04:21:08,318][26022] Updated weights on worker 0-0, policy_version 85298 (0.00092) [2022-07-09 04:21:10,352][26022] Updated weights on worker 0-0, policy_version 85308 (0.00089) [2022-07-09 04:21:12,471][26022] Updated weights on worker 0-0, policy_version 85318 (0.01104) [2022-07-09 04:21:12,630][25689] Fps is (10 sec: 5696.1, 60 sec: 5659.6, 300 sec: 5669.0). Total num frames: 87366656. Throughput: 0: 5025.4. Samples: 87364002. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 04:21:12,630][25689] Avg episode reward: [(0, '-54.801')] [2022-07-09 04:21:13,988][26022] Updated weights on worker 0-0, policy_version 85328 (0.00092) [2022-07-09 04:21:15,821][26022] Updated weights on worker 0-0, policy_version 85338 (0.00087) [2022-07-09 04:21:17,661][25689] Fps is (10 sec: 5581.0, 60 sec: 5658.6, 300 sec: 5672.0). Total num frames: 87396352. Throughput: 0: 5861.2. Samples: 87398162. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 04:21:17,662][25689] Avg episode reward: [(0, '-55.061')] [2022-07-09 04:21:17,668][26022] Updated weights on worker 0-0, policy_version 85348 (0.00091) [2022-07-09 04:21:19,239][26022] Updated weights on worker 0-0, policy_version 85358 (0.00096) [2022-07-09 04:21:21,296][26022] Updated weights on worker 0-0, policy_version 85368 (0.00618) [2022-07-09 04:21:22,684][25689] Fps is (10 sec: 5907.6, 60 sec: 5693.9, 300 sec: 5675.3). Total num frames: 87426048. Throughput: 0: 5874.0. Samples: 87432874. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 04:21:22,684][25689] Avg episode reward: [(0, '-54.772')] [2022-07-09 04:21:22,823][26022] Updated weights on worker 0-0, policy_version 85378 (0.00085) [2022-07-09 04:21:24,652][26022] Updated weights on worker 0-0, policy_version 85388 (0.00092) [2022-07-09 04:21:26,493][26022] Updated weights on worker 0-0, policy_version 85398 (0.00096) [2022-07-09 04:21:27,729][25689] Fps is (10 sec: 5797.6, 60 sec: 5690.8, 300 sec: 5675.6). Total num frames: 87454720. Throughput: 0: 5107.8. Samples: 87450024. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 04:21:27,729][25689] Avg episode reward: [(0, '-54.840')] [2022-07-09 04:21:28,268][26022] Updated weights on worker 0-0, policy_version 85408 (0.00085) [2022-07-09 04:21:30,097][26022] Updated weights on worker 0-0, policy_version 85418 (0.00083) [2022-07-09 04:21:32,154][26022] Updated weights on worker 0-0, policy_version 85428 (0.00559) [2022-07-09 04:21:32,803][25689] Fps is (10 sec: 5666.9, 60 sec: 5674.5, 300 sec: 5678.5). Total num frames: 87483392. Throughput: 0: 5957.6. Samples: 87484182. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 04:21:32,803][25689] Avg episode reward: [(0, '-54.807')] [2022-07-09 04:21:33,523][26022] Updated weights on worker 0-0, policy_version 85438 (0.00089) [2022-07-09 04:21:35,623][26022] Updated weights on worker 0-0, policy_version 85448 (0.00085) [2022-07-09 04:21:36,960][26022] Updated weights on worker 0-0, policy_version 85458 (0.00082) [2022-07-09 04:21:37,816][25689] Fps is (10 sec: 5684.7, 60 sec: 5692.2, 300 sec: 5679.1). Total num frames: 87512064. Throughput: 0: 5986.4. Samples: 87518818. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 04:21:37,817][25689] Avg episode reward: [(0, '-54.426')] [2022-07-09 04:21:39,306][26022] Updated weights on worker 0-0, policy_version 85468 (0.00084) [2022-07-09 04:21:40,863][26022] Updated weights on worker 0-0, policy_version 85478 (0.00098) [2022-07-09 04:21:42,687][26022] Updated weights on worker 0-0, policy_version 85488 (0.00087) [2022-07-09 04:21:42,830][25689] Fps is (10 sec: 5718.8, 60 sec: 5693.8, 300 sec: 5680.3). Total num frames: 87540736. Throughput: 0: 5115.8. Samples: 87535940. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 04:21:42,830][25689] Avg episode reward: [(0, '-54.046')] [2022-07-09 04:21:44,383][26022] Updated weights on worker 0-0, policy_version 85498 (0.00078) [2022-07-09 04:21:46,037][26022] Updated weights on worker 0-0, policy_version 85508 (0.00091) [2022-07-09 04:21:47,842][25689] Fps is (10 sec: 5719.8, 60 sec: 5695.0, 300 sec: 5682.0). Total num frames: 87569408. Throughput: 0: 5996.3. Samples: 87570628. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 04:21:47,842][25689] Avg episode reward: [(0, '-53.968')] [2022-07-09 04:21:47,981][26022] Updated weights on worker 0-0, policy_version 85518 (0.00080) [2022-07-09 04:21:49,803][26022] Updated weights on worker 0-0, policy_version 85528 (0.00085) [2022-07-09 04:21:51,630][26022] Updated weights on worker 0-0, policy_version 85538 (0.00090) [2022-07-09 04:21:52,942][25689] Fps is (10 sec: 5771.9, 60 sec: 5690.6, 300 sec: 5676.7). Total num frames: 87599104. Throughput: 0: 5998.4. Samples: 87604988. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 04:21:52,943][25689] Avg episode reward: [(0, '-53.505')] [2022-07-09 04:21:53,291][26022] Updated weights on worker 0-0, policy_version 85548 (0.00083) [2022-07-09 04:21:55,046][26022] Updated weights on worker 0-0, policy_version 85558 (0.00092) [2022-07-09 04:21:56,919][26022] Updated weights on worker 0-0, policy_version 85568 (0.00090) [2022-07-09 04:21:57,948][25689] Fps is (10 sec: 5674.1, 60 sec: 5695.9, 300 sec: 5676.7). Total num frames: 87626752. Throughput: 0: 5142.8. Samples: 87622352. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 04:21:57,949][25689] Avg episode reward: [(0, '-53.475')] [2022-07-09 04:21:58,557][26022] Updated weights on worker 0-0, policy_version 85578 (0.00087) [2022-07-09 04:22:00,463][26022] Updated weights on worker 0-0, policy_version 85588 (0.00079) [2022-07-09 04:22:02,492][26022] Updated weights on worker 0-0, policy_version 85598 (0.00083) [2022-07-09 04:22:02,960][25689] Fps is (10 sec: 5520.1, 60 sec: 5713.7, 300 sec: 5681.6). Total num frames: 87654400. Throughput: 0: 6004.1. Samples: 87656800. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 04:22:02,962][25689] Avg episode reward: [(0, '-53.312')] [2022-07-09 04:22:04,248][26022] Updated weights on worker 0-0, policy_version 85608 (0.00091) [2022-07-09 04:22:06,253][26022] Updated weights on worker 0-0, policy_version 85618 (0.00090) [2022-07-09 04:22:07,960][26022] Updated weights on worker 0-0, policy_version 85628 (0.00082) [2022-07-09 04:22:07,971][25689] Fps is (10 sec: 5618.9, 60 sec: 5679.3, 300 sec: 5680.1). Total num frames: 87683072. Throughput: 0: 5898.5. Samples: 87689362. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 04:22:07,972][25689] Avg episode reward: [(0, '-54.048')] [2022-07-09 04:22:09,705][26022] Updated weights on worker 0-0, policy_version 85638 (0.00084) [2022-07-09 04:22:11,668][26022] Updated weights on worker 0-0, policy_version 85648 (0.00082) [2022-07-09 04:22:13,037][25689] Fps is (10 sec: 5792.2, 60 sec: 5729.7, 300 sec: 5676.3). Total num frames: 87712768. Throughput: 0: 5050.7. Samples: 87706478. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 04:22:13,039][25689] Avg episode reward: [(0, '-53.908')] [2022-07-09 04:22:13,296][26022] Updated weights on worker 0-0, policy_version 85658 (0.00101) [2022-07-09 04:22:15,153][26022] Updated weights on worker 0-0, policy_version 85668 (0.00087) [2022-07-09 04:22:16,970][26022] Updated weights on worker 0-0, policy_version 85678 (0.00090) [2022-07-09 04:22:18,105][25689] Fps is (10 sec: 5658.7, 60 sec: 5692.3, 300 sec: 5671.6). Total num frames: 87740416. Throughput: 0: 5884.3. Samples: 87740962. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 04:22:18,106][25689] Avg episode reward: [(0, '-53.914')] [2022-07-09 04:22:18,598][26022] Updated weights on worker 0-0, policy_version 85688 (0.00093) [2022-07-09 04:22:20,559][26022] Updated weights on worker 0-0, policy_version 85698 (0.00091) [2022-07-09 04:22:22,178][26022] Updated weights on worker 0-0, policy_version 85708 (0.00087) [2022-07-09 04:22:23,122][25689] Fps is (10 sec: 5584.3, 60 sec: 5675.8, 300 sec: 5675.6). Total num frames: 87769088. Throughput: 0: 5883.3. Samples: 87775422. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 04:22:23,123][25689] Avg episode reward: [(0, '-54.093')] [2022-07-09 04:22:24,007][26022] Updated weights on worker 0-0, policy_version 85718 (0.00086) [2022-07-09 04:22:25,154][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:22:25,175][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000085724_87781376.pth [2022-07-09 04:22:25,176][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000083723_85732352.pth [2022-07-09 04:22:25,880][26022] Updated weights on worker 0-0, policy_version 85728 (0.00092) [2022-07-09 04:22:27,495][26022] Updated weights on worker 0-0, policy_version 85738 (0.00088) [2022-07-09 04:22:28,178][25689] Fps is (10 sec: 5794.9, 60 sec: 5691.8, 300 sec: 5675.6). Total num frames: 87798784. Throughput: 0: 5115.7. Samples: 87792730. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 04:22:28,178][25689] Avg episode reward: [(0, '-54.336')] [2022-07-09 04:22:29,580][26022] Updated weights on worker 0-0, policy_version 85748 (0.00088) [2022-07-09 04:22:31,156][26022] Updated weights on worker 0-0, policy_version 85758 (0.00080) [2022-07-09 04:22:32,942][26022] Updated weights on worker 0-0, policy_version 85768 (0.00086) [2022-07-09 04:22:33,223][25689] Fps is (10 sec: 5879.7, 60 sec: 5711.4, 300 sec: 5678.5). Total num frames: 87828480. Throughput: 0: 5977.5. Samples: 87827144. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 04:22:33,224][25689] Avg episode reward: [(0, '-53.831')] [2022-07-09 04:22:35,143][26022] Updated weights on worker 0-0, policy_version 85778 (0.00085) [2022-07-09 04:22:36,471][26022] Updated weights on worker 0-0, policy_version 85788 (0.00085) [2022-07-09 04:22:38,257][25689] Fps is (10 sec: 5486.2, 60 sec: 5658.7, 300 sec: 5674.9). Total num frames: 87854080. Throughput: 0: 5965.1. Samples: 87861168. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 04:22:38,257][25689] Avg episode reward: [(0, '-53.410')] [2022-07-09 04:22:38,629][26022] Updated weights on worker 0-0, policy_version 85798 (0.00089) [2022-07-09 04:22:40,062][26022] Updated weights on worker 0-0, policy_version 85808 (0.00092) [2022-07-09 04:22:42,027][26022] Updated weights on worker 0-0, policy_version 85818 (0.00097) [2022-07-09 04:22:43,332][25689] Fps is (10 sec: 5672.7, 60 sec: 5703.8, 300 sec: 5677.2). Total num frames: 87885824. Throughput: 0: 5093.6. Samples: 87878364. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 04:22:43,332][25689] Avg episode reward: [(0, '-53.597')] [2022-07-09 04:22:43,765][26022] Updated weights on worker 0-0, policy_version 85828 (0.00085) [2022-07-09 04:22:45,511][26022] Updated weights on worker 0-0, policy_version 85838 (0.00093) [2022-07-09 04:22:47,427][26022] Updated weights on worker 0-0, policy_version 85848 (0.00090) [2022-07-09 04:22:48,431][25689] Fps is (10 sec: 5837.7, 60 sec: 5678.7, 300 sec: 5673.8). Total num frames: 87913472. Throughput: 0: 5919.1. Samples: 87912610. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 04:22:48,431][25689] Avg episode reward: [(0, '-53.778')] [2022-07-09 04:22:49,165][26022] Updated weights on worker 0-0, policy_version 85858 (0.00050) [2022-07-09 04:22:51,132][26022] Updated weights on worker 0-0, policy_version 85868 (0.00096) [2022-07-09 04:22:52,646][26022] Updated weights on worker 0-0, policy_version 85878 (0.00080) [2022-07-09 04:22:53,541][25689] Fps is (10 sec: 5616.9, 60 sec: 5677.8, 300 sec: 5682.4). Total num frames: 87943168. Throughput: 0: 5901.7. Samples: 87947054. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 04:22:53,541][25689] Avg episode reward: [(0, '-53.677')] [2022-07-09 04:22:54,454][26022] Updated weights on worker 0-0, policy_version 85888 (0.00083) [2022-07-09 04:22:56,147][26022] Updated weights on worker 0-0, policy_version 85898 (0.00088) [2022-07-09 04:22:58,001][26022] Updated weights on worker 0-0, policy_version 85908 (0.00087) [2022-07-09 04:22:58,598][25689] Fps is (10 sec: 5841.4, 60 sec: 5706.7, 300 sec: 5678.0). Total num frames: 87972864. Throughput: 0: 5934.9. Samples: 87981892. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 04:22:58,598][25689] Avg episode reward: [(0, '-53.480')] [2022-07-09 04:22:59,863][26022] Updated weights on worker 0-0, policy_version 85918 (0.00094) [2022-07-09 04:23:01,511][26022] Updated weights on worker 0-0, policy_version 85928 (0.00089) [2022-07-09 04:23:03,647][25689] Fps is (10 sec: 5471.6, 60 sec: 5669.4, 300 sec: 5677.7). Total num frames: 87998464. Throughput: 0: 5945.4. Samples: 87999146. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 04:23:03,648][25689] Avg episode reward: [(0, '-53.400')] [2022-07-09 04:23:03,806][26022] Updated weights on worker 0-0, policy_version 85938 (0.00084) [2022-07-09 04:23:05,520][26022] Updated weights on worker 0-0, policy_version 85948 (0.00089) [2022-07-09 04:23:07,387][26022] Updated weights on worker 0-0, policy_version 85958 (0.00090) [2022-07-09 04:23:08,657][25689] Fps is (10 sec: 5497.1, 60 sec: 5686.5, 300 sec: 5681.8). Total num frames: 88028160. Throughput: 0: 5862.8. Samples: 88031194. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 04:23:08,658][25689] Avg episode reward: [(0, '-53.726')] [2022-07-09 04:23:09,279][26022] Updated weights on worker 0-0, policy_version 85968 (0.00076) [2022-07-09 04:23:10,861][26022] Updated weights on worker 0-0, policy_version 85978 (0.00085) [2022-07-09 04:23:12,839][26022] Updated weights on worker 0-0, policy_version 85988 (0.00086) [2022-07-09 04:23:13,775][25689] Fps is (10 sec: 5864.0, 60 sec: 5681.5, 300 sec: 5683.1). Total num frames: 88057856. Throughput: 0: 5856.4. Samples: 88065554. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 04:23:13,776][25689] Avg episode reward: [(0, '-53.445')] [2022-07-09 04:23:14,392][26022] Updated weights on worker 0-0, policy_version 85998 (0.00094) [2022-07-09 04:23:16,485][26022] Updated weights on worker 0-0, policy_version 86008 (0.00100) [2022-07-09 04:23:18,268][26022] Updated weights on worker 0-0, policy_version 86018 (0.00082) [2022-07-09 04:23:18,809][25689] Fps is (10 sec: 5547.7, 60 sec: 5667.9, 300 sec: 5675.7). Total num frames: 88084480. Throughput: 0: 4974.7. Samples: 88082436. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 04:23:18,809][25689] Avg episode reward: [(0, '-53.954')] [2022-07-09 04:23:19,957][26022] Updated weights on worker 0-0, policy_version 86028 (0.00090) [2022-07-09 04:23:21,727][26022] Updated weights on worker 0-0, policy_version 86038 (0.00089) [2022-07-09 04:23:23,390][26022] Updated weights on worker 0-0, policy_version 86048 (0.00093) [2022-07-09 04:23:23,848][25689] Fps is (10 sec: 5692.8, 60 sec: 5699.5, 300 sec: 5685.4). Total num frames: 88115200. Throughput: 0: 5845.0. Samples: 88117222. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 04:23:23,849][25689] Avg episode reward: [(0, '-53.744')] [2022-07-09 04:23:25,416][26022] Updated weights on worker 0-0, policy_version 86058 (0.00095) [2022-07-09 04:23:26,977][26022] Updated weights on worker 0-0, policy_version 86068 (0.00084) [2022-07-09 04:23:28,937][25689] Fps is (10 sec: 5763.2, 60 sec: 5662.8, 300 sec: 5678.1). Total num frames: 88142848. Throughput: 0: 5927.7. Samples: 88151406. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 04:23:28,937][25689] Avg episode reward: [(0, '-53.526')] [2022-07-09 04:23:29,020][26022] Updated weights on worker 0-0, policy_version 86078 (0.00086) [2022-07-09 04:23:30,551][26022] Updated weights on worker 0-0, policy_version 86088 (0.00099) [2022-07-09 04:23:32,558][26022] Updated weights on worker 0-0, policy_version 86098 (0.00084) [2022-07-09 04:23:34,005][25689] Fps is (10 sec: 5746.9, 60 sec: 5677.5, 300 sec: 5683.8). Total num frames: 88173568. Throughput: 0: 5092.8. Samples: 88168578. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:23:34,005][25689] Avg episode reward: [(0, '-52.939')] [2022-07-09 04:23:34,198][26022] Updated weights on worker 0-0, policy_version 86108 (0.00089) [2022-07-09 04:23:36,080][26022] Updated weights on worker 0-0, policy_version 86118 (0.00084) [2022-07-09 04:23:37,815][26022] Updated weights on worker 0-0, policy_version 86128 (0.00095) [2022-07-09 04:23:39,014][25689] Fps is (10 sec: 5791.9, 60 sec: 5713.5, 300 sec: 5677.2). Total num frames: 88201216. Throughput: 0: 5958.3. Samples: 88202826. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:23:39,015][25689] Avg episode reward: [(0, '-52.770')] [2022-07-09 04:23:39,832][26022] Updated weights on worker 0-0, policy_version 86138 (0.00090) [2022-07-09 04:23:41,127][26022] Updated weights on worker 0-0, policy_version 86148 (0.00126) [2022-07-09 04:23:43,400][26022] Updated weights on worker 0-0, policy_version 86158 (0.00090) [2022-07-09 04:23:44,021][25689] Fps is (10 sec: 5622.9, 60 sec: 5669.3, 300 sec: 5680.6). Total num frames: 88229888. Throughput: 0: 5954.8. Samples: 88237346. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:23:44,021][25689] Avg episode reward: [(0, '-53.203')] [2022-07-09 04:23:44,943][26022] Updated weights on worker 0-0, policy_version 86168 (0.00087) [2022-07-09 04:23:46,959][26022] Updated weights on worker 0-0, policy_version 86178 (0.00080) [2022-07-09 04:23:48,587][26022] Updated weights on worker 0-0, policy_version 86188 (0.00091) [2022-07-09 04:23:49,027][25689] Fps is (10 sec: 5727.0, 60 sec: 5694.8, 300 sec: 5679.6). Total num frames: 88258560. Throughput: 0: 5119.7. Samples: 88254264. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:23:49,028][25689] Avg episode reward: [(0, '-53.851')] [2022-07-09 04:23:50,342][26022] Updated weights on worker 0-0, policy_version 86198 (0.00087) [2022-07-09 04:23:52,129][26022] Updated weights on worker 0-0, policy_version 86208 (0.00086) [2022-07-09 04:23:54,019][26022] Updated weights on worker 0-0, policy_version 86218 (0.00083) [2022-07-09 04:23:54,135][25689] Fps is (10 sec: 5669.5, 60 sec: 5678.2, 300 sec: 5678.5). Total num frames: 88287232. Throughput: 0: 5951.7. Samples: 88288390. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:23:54,136][25689] Avg episode reward: [(0, '-54.004')] [2022-07-09 04:23:55,877][26022] Updated weights on worker 0-0, policy_version 86228 (0.00101) [2022-07-09 04:23:57,665][26022] Updated weights on worker 0-0, policy_version 86238 (0.00087) [2022-07-09 04:23:59,152][25689] Fps is (10 sec: 5562.6, 60 sec: 5648.1, 300 sec: 5678.5). Total num frames: 88314880. Throughput: 0: 5949.0. Samples: 88322626. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:23:59,152][25689] Avg episode reward: [(0, '-53.856')] [2022-07-09 04:23:59,443][26022] Updated weights on worker 0-0, policy_version 86248 (0.00099) [2022-07-09 04:24:01,188][26022] Updated weights on worker 0-0, policy_version 86258 (0.00093) [2022-07-09 04:24:03,419][26022] Updated weights on worker 0-0, policy_version 86268 (0.00091) [2022-07-09 04:24:04,162][25689] Fps is (10 sec: 5617.3, 60 sec: 5702.6, 300 sec: 5679.0). Total num frames: 88343552. Throughput: 0: 5079.5. Samples: 88339650. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:24:04,162][25689] Avg episode reward: [(0, '-54.389')] [2022-07-09 04:24:05,417][26022] Updated weights on worker 0-0, policy_version 86278 (0.00092) [2022-07-09 04:24:06,886][26022] Updated weights on worker 0-0, policy_version 86288 (0.00086) [2022-07-09 04:24:09,129][26022] Updated weights on worker 0-0, policy_version 86298 (0.00048) [2022-07-09 04:24:09,198][25689] Fps is (10 sec: 5402.3, 60 sec: 5632.4, 300 sec: 5665.6). Total num frames: 88369152. Throughput: 0: 5828.3. Samples: 88371826. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:24:09,198][25689] Avg episode reward: [(0, '-54.091')] [2022-07-09 04:24:10,318][26022] Updated weights on worker 0-0, policy_version 86308 (0.00090) [2022-07-09 04:24:12,688][26022] Updated weights on worker 0-0, policy_version 86318 (0.00091) [2022-07-09 04:24:14,146][26022] Updated weights on worker 0-0, policy_version 86328 (0.00091) [2022-07-09 04:24:14,262][25689] Fps is (10 sec: 5575.9, 60 sec: 5654.4, 300 sec: 5682.1). Total num frames: 88399872. Throughput: 0: 5842.1. Samples: 88405972. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:24:14,263][25689] Avg episode reward: [(0, '-53.742')] [2022-07-09 04:24:16,315][26022] Updated weights on worker 0-0, policy_version 86338 (0.00096) [2022-07-09 04:24:17,744][26022] Updated weights on worker 0-0, policy_version 86348 (0.00085) [2022-07-09 04:24:19,315][25689] Fps is (10 sec: 5769.5, 60 sec: 5669.6, 300 sec: 5674.5). Total num frames: 88427520. Throughput: 0: 4977.4. Samples: 88422984. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:24:19,315][25689] Avg episode reward: [(0, '-52.899')] [2022-07-09 04:24:19,784][26022] Updated weights on worker 0-0, policy_version 86358 (0.00087) [2022-07-09 04:24:21,248][26022] Updated weights on worker 0-0, policy_version 86368 (0.00095) [2022-07-09 04:24:23,282][26022] Updated weights on worker 0-0, policy_version 86378 (0.00088) [2022-07-09 04:24:24,331][25689] Fps is (10 sec: 5797.0, 60 sec: 5671.7, 300 sec: 5681.4). Total num frames: 88458240. Throughput: 0: 5841.1. Samples: 88457460. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:24:24,331][25689] Avg episode reward: [(0, '-53.159')] [2022-07-09 04:24:24,982][26022] Updated weights on worker 0-0, policy_version 86388 (0.00087) [2022-07-09 04:24:25,436][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:24:25,450][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000086390_88463360.pth [2022-07-09 04:24:25,450][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000084390_86415360.pth [2022-07-09 04:24:26,718][26022] Updated weights on worker 0-0, policy_version 86398 (0.00091) [2022-07-09 04:24:28,492][26022] Updated weights on worker 0-0, policy_version 86408 (0.00086) [2022-07-09 04:24:29,347][25689] Fps is (10 sec: 5818.2, 60 sec: 5678.5, 300 sec: 5679.5). Total num frames: 88485888. Throughput: 0: 5974.9. Samples: 88492212. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:24:29,347][25689] Avg episode reward: [(0, '-53.110')] [2022-07-09 04:24:30,311][26022] Updated weights on worker 0-0, policy_version 86418 (0.00093) [2022-07-09 04:24:31,923][26022] Updated weights on worker 0-0, policy_version 86428 (0.00087) [2022-07-09 04:24:33,891][26022] Updated weights on worker 0-0, policy_version 86438 (0.00082) [2022-07-09 04:24:34,473][25689] Fps is (10 sec: 5754.8, 60 sec: 5673.0, 300 sec: 5685.1). Total num frames: 88516608. Throughput: 0: 5121.4. Samples: 88509482. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:24:34,474][25689] Avg episode reward: [(0, '-53.085')] [2022-07-09 04:24:35,667][26022] Updated weights on worker 0-0, policy_version 86448 (0.00086) [2022-07-09 04:24:37,347][26022] Updated weights on worker 0-0, policy_version 86458 (0.00088) [2022-07-09 04:24:39,314][26022] Updated weights on worker 0-0, policy_version 86468 (0.00091) [2022-07-09 04:24:39,490][25689] Fps is (10 sec: 5855.4, 60 sec: 5689.3, 300 sec: 5685.2). Total num frames: 88545280. Throughput: 0: 5998.6. Samples: 88544006. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:24:39,490][25689] Avg episode reward: [(0, '-53.331')] [2022-07-09 04:24:40,880][26022] Updated weights on worker 0-0, policy_version 86478 (0.00076) [2022-07-09 04:24:42,642][26022] Updated weights on worker 0-0, policy_version 86488 (0.00088) [2022-07-09 04:24:44,493][25689] Fps is (10 sec: 5518.6, 60 sec: 5655.7, 300 sec: 5680.2). Total num frames: 88571904. Throughput: 0: 6003.4. Samples: 88578504. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:24:44,494][25689] Avg episode reward: [(0, '-52.589')] [2022-07-09 04:24:44,589][26022] Updated weights on worker 0-0, policy_version 86498 (0.00082) [2022-07-09 04:24:46,195][26022] Updated weights on worker 0-0, policy_version 86508 (0.00100) [2022-07-09 04:24:48,396][26022] Updated weights on worker 0-0, policy_version 86518 (0.00081) [2022-07-09 04:24:49,525][25689] Fps is (10 sec: 5612.0, 60 sec: 5670.3, 300 sec: 5680.7). Total num frames: 88601600. Throughput: 0: 5118.6. Samples: 88595498. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:24:49,527][25689] Avg episode reward: [(0, '-53.021')] [2022-07-09 04:24:49,755][26022] Updated weights on worker 0-0, policy_version 86528 (0.00089) [2022-07-09 04:24:51,877][26022] Updated weights on worker 0-0, policy_version 86538 (0.00091) [2022-07-09 04:24:53,609][26022] Updated weights on worker 0-0, policy_version 86548 (0.00086) [2022-07-09 04:24:54,591][25689] Fps is (10 sec: 5780.2, 60 sec: 5674.2, 300 sec: 5679.5). Total num frames: 88630272. Throughput: 0: 5964.7. Samples: 88629480. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:24:54,592][25689] Avg episode reward: [(0, '-53.410')] [2022-07-09 04:24:55,417][26022] Updated weights on worker 0-0, policy_version 86558 (0.00092) [2022-07-09 04:24:57,091][26022] Updated weights on worker 0-0, policy_version 86568 (0.00471) [2022-07-09 04:24:58,849][26022] Updated weights on worker 0-0, policy_version 86578 (0.00091) [2022-07-09 04:24:59,615][25689] Fps is (10 sec: 5582.0, 60 sec: 5673.6, 300 sec: 5679.3). Total num frames: 88657920. Throughput: 0: 5957.0. Samples: 88663892. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:24:59,615][25689] Avg episode reward: [(0, '-53.179')] [2022-07-09 04:25:00,648][26022] Updated weights on worker 0-0, policy_version 86588 (0.00093) [2022-07-09 04:25:03,028][26022] Updated weights on worker 0-0, policy_version 86598 (0.00093) [2022-07-09 04:25:04,653][25689] Fps is (10 sec: 5495.6, 60 sec: 5653.9, 300 sec: 5682.2). Total num frames: 88685568. Throughput: 0: 5055.9. Samples: 88680430. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:25:04,654][25689] Avg episode reward: [(0, '-52.900')] [2022-07-09 04:25:04,718][26022] Updated weights on worker 0-0, policy_version 86608 (0.00086) [2022-07-09 04:25:06,737][26022] Updated weights on worker 0-0, policy_version 86618 (0.00087) [2022-07-09 04:25:08,372][26022] Updated weights on worker 0-0, policy_version 86628 (0.00092) [2022-07-09 04:25:09,660][25689] Fps is (10 sec: 5505.0, 60 sec: 5690.6, 300 sec: 5676.6). Total num frames: 88713216. Throughput: 0: 5813.2. Samples: 88712542. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:25:09,660][25689] Avg episode reward: [(0, '-52.222')] [2022-07-09 04:25:10,257][26022] Updated weights on worker 0-0, policy_version 86638 (0.00094) [2022-07-09 04:25:12,063][26022] Updated weights on worker 0-0, policy_version 86648 (0.00086) [2022-07-09 04:25:13,803][26022] Updated weights on worker 0-0, policy_version 86658 (0.00086) [2022-07-09 04:25:14,729][25689] Fps is (10 sec: 5589.5, 60 sec: 5656.2, 300 sec: 5672.3). Total num frames: 88741888. Throughput: 0: 5825.3. Samples: 88746788. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:25:14,730][25689] Avg episode reward: [(0, '-53.050')] [2022-07-09 04:25:15,610][26022] Updated weights on worker 0-0, policy_version 86668 (0.00100) [2022-07-09 04:25:17,491][26022] Updated weights on worker 0-0, policy_version 86678 (0.00089) [2022-07-09 04:25:19,120][26022] Updated weights on worker 0-0, policy_version 86688 (0.00091) [2022-07-09 04:25:19,744][25689] Fps is (10 sec: 5788.1, 60 sec: 5693.7, 300 sec: 5679.6). Total num frames: 88771584. Throughput: 0: 4966.9. Samples: 88763870. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:25:19,744][25689] Avg episode reward: [(0, '-52.613')] [2022-07-09 04:25:21,168][26022] Updated weights on worker 0-0, policy_version 86698 (0.00087) [2022-07-09 04:25:22,816][26022] Updated weights on worker 0-0, policy_version 86708 (0.00090) [2022-07-09 04:25:24,561][26022] Updated weights on worker 0-0, policy_version 86718 (0.00094) [2022-07-09 04:25:24,746][25689] Fps is (10 sec: 5827.3, 60 sec: 5661.1, 300 sec: 5679.8). Total num frames: 88800256. Throughput: 0: 5855.4. Samples: 88798080. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:25:24,746][25689] Avg episode reward: [(0, '-52.337')] [2022-07-09 04:25:26,540][26022] Updated weights on worker 0-0, policy_version 86728 (0.00094) [2022-07-09 04:25:28,175][26022] Updated weights on worker 0-0, policy_version 86738 (0.00094) [2022-07-09 04:25:29,757][25689] Fps is (10 sec: 5522.0, 60 sec: 5644.6, 300 sec: 5670.8). Total num frames: 88826880. Throughput: 0: 5970.8. Samples: 88832542. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:25:29,758][25689] Avg episode reward: [(0, '-52.084')] [2022-07-09 04:25:30,096][26022] Updated weights on worker 0-0, policy_version 86748 (0.00087) [2022-07-09 04:25:31,713][26022] Updated weights on worker 0-0, policy_version 86758 (0.00112) [2022-07-09 04:25:33,621][26022] Updated weights on worker 0-0, policy_version 86768 (0.00089) [2022-07-09 04:25:34,874][25689] Fps is (10 sec: 5762.8, 60 sec: 5662.5, 300 sec: 5682.7). Total num frames: 88858624. Throughput: 0: 5948.6. Samples: 88866620. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:25:34,874][25689] Avg episode reward: [(0, '-52.765')] [2022-07-09 04:25:35,436][26022] Updated weights on worker 0-0, policy_version 86778 (0.00092) [2022-07-09 04:25:37,227][26022] Updated weights on worker 0-0, policy_version 86788 (0.00086) [2022-07-09 04:25:39,108][26022] Updated weights on worker 0-0, policy_version 86798 (0.00088) [2022-07-09 04:25:39,914][25689] Fps is (10 sec: 5847.5, 60 sec: 5643.3, 300 sec: 5679.1). Total num frames: 88886272. Throughput: 0: 5942.6. Samples: 88883736. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:25:39,915][25689] Avg episode reward: [(0, '-53.024')] [2022-07-09 04:25:40,763][26022] Updated weights on worker 0-0, policy_version 86808 (0.00085) [2022-07-09 04:25:42,407][26022] Updated weights on worker 0-0, policy_version 86818 (0.00093) [2022-07-09 04:25:44,447][26022] Updated weights on worker 0-0, policy_version 86828 (0.00091) [2022-07-09 04:25:44,948][25689] Fps is (10 sec: 5590.4, 60 sec: 5674.3, 300 sec: 5678.9). Total num frames: 88914944. Throughput: 0: 5937.9. Samples: 88918042. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:25:44,949][25689] Avg episode reward: [(0, '-51.926')] [2022-07-09 04:25:46,219][26022] Updated weights on worker 0-0, policy_version 86838 (0.00088) [2022-07-09 04:25:47,984][26022] Updated weights on worker 0-0, policy_version 86848 (0.00090) [2022-07-09 04:25:49,878][26022] Updated weights on worker 0-0, policy_version 86858 (0.00097) [2022-07-09 04:25:49,974][25689] Fps is (10 sec: 5598.5, 60 sec: 5641.0, 300 sec: 5672.6). Total num frames: 88942592. Throughput: 0: 5920.3. Samples: 88952230. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:25:49,974][25689] Avg episode reward: [(0, '-52.603')] [2022-07-09 04:25:51,559][26022] Updated weights on worker 0-0, policy_version 86868 (0.00097) [2022-07-09 04:25:53,448][26022] Updated weights on worker 0-0, policy_version 86878 (0.00091) [2022-07-09 04:25:54,991][26022] Updated weights on worker 0-0, policy_version 86888 (0.00082) [2022-07-09 04:25:55,085][25689] Fps is (10 sec: 5757.9, 60 sec: 5670.7, 300 sec: 5682.0). Total num frames: 88973312. Throughput: 0: 5087.1. Samples: 88969438. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:25:55,086][25689] Avg episode reward: [(0, '-53.184')] [2022-07-09 04:25:57,002][26022] Updated weights on worker 0-0, policy_version 86898 (0.00091) [2022-07-09 04:25:58,684][26022] Updated weights on worker 0-0, policy_version 86908 (0.00087) [2022-07-09 04:26:00,106][25689] Fps is (10 sec: 5760.6, 60 sec: 5670.9, 300 sec: 5685.4). Total num frames: 89000960. Throughput: 0: 5954.1. Samples: 89003962. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:26:00,107][25689] Avg episode reward: [(0, '-52.567')] [2022-07-09 04:26:00,495][26022] Updated weights on worker 0-0, policy_version 86918 (0.00086) [2022-07-09 04:26:02,568][26022] Updated weights on worker 0-0, policy_version 86928 (0.00092) [2022-07-09 04:26:04,383][26022] Updated weights on worker 0-0, policy_version 86938 (0.00087) [2022-07-09 04:26:05,137][25689] Fps is (10 sec: 5501.3, 60 sec: 5671.6, 300 sec: 5674.6). Total num frames: 89028608. Throughput: 0: 5870.0. Samples: 89036548. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:26:05,137][25689] Avg episode reward: [(0, '-52.043')] [2022-07-09 04:26:06,339][26022] Updated weights on worker 0-0, policy_version 86948 (0.00087) [2022-07-09 04:26:07,902][26022] Updated weights on worker 0-0, policy_version 86958 (0.00086) [2022-07-09 04:26:09,932][26022] Updated weights on worker 0-0, policy_version 86968 (0.00085) [2022-07-09 04:26:10,151][25689] Fps is (10 sec: 5504.9, 60 sec: 5670.9, 300 sec: 5678.9). Total num frames: 89056256. Throughput: 0: 5026.3. Samples: 89053646. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:26:10,159][25689] Avg episode reward: [(0, '-53.201')] [2022-07-09 04:26:11,458][26022] Updated weights on worker 0-0, policy_version 86978 (0.00085) [2022-07-09 04:26:13,364][26022] Updated weights on worker 0-0, policy_version 86988 (0.00086) [2022-07-09 04:26:15,112][26022] Updated weights on worker 0-0, policy_version 86998 (0.00087) [2022-07-09 04:26:15,231][25689] Fps is (10 sec: 5681.1, 60 sec: 5686.9, 300 sec: 5678.0). Total num frames: 89085952. Throughput: 0: 5886.3. Samples: 89088020. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:26:15,231][25689] Avg episode reward: [(0, '-52.792')] [2022-07-09 04:26:16,779][26022] Updated weights on worker 0-0, policy_version 87008 (0.00087) [2022-07-09 04:26:18,697][26022] Updated weights on worker 0-0, policy_version 87018 (0.00091) [2022-07-09 04:26:20,247][25689] Fps is (10 sec: 5882.9, 60 sec: 5686.7, 300 sec: 5678.1). Total num frames: 89115648. Throughput: 0: 5868.8. Samples: 89122162. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:26:20,247][25689] Avg episode reward: [(0, '-52.571')] [2022-07-09 04:26:20,697][26022] Updated weights on worker 0-0, policy_version 87028 (0.00089) [2022-07-09 04:26:22,284][26022] Updated weights on worker 0-0, policy_version 87038 (0.00092) [2022-07-09 04:26:23,945][26022] Updated weights on worker 0-0, policy_version 87048 (0.00093) [2022-07-09 04:26:25,260][25689] Fps is (10 sec: 5615.6, 60 sec: 5651.8, 300 sec: 5671.9). Total num frames: 89142272. Throughput: 0: 5114.6. Samples: 89139470. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:26:25,260][25689] Avg episode reward: [(0, '-52.363')] [2022-07-09 04:26:25,464][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:26:25,481][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000087055_89144320.pth [2022-07-09 04:26:25,482][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000085056_87097344.pth [2022-07-09 04:26:25,726][26022] Updated weights on worker 0-0, policy_version 87058 (0.00086) [2022-07-09 04:26:27,608][26022] Updated weights on worker 0-0, policy_version 87068 (0.00088) [2022-07-09 04:26:29,542][26022] Updated weights on worker 0-0, policy_version 87078 (0.00387) [2022-07-09 04:26:30,267][25689] Fps is (10 sec: 5620.8, 60 sec: 5703.1, 300 sec: 5676.6). Total num frames: 89171968. Throughput: 0: 5968.4. Samples: 89173704. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:26:30,267][25689] Avg episode reward: [(0, '-53.199')] [2022-07-09 04:26:31,191][26022] Updated weights on worker 0-0, policy_version 87088 (0.00083) [2022-07-09 04:26:33,127][26022] Updated weights on worker 0-0, policy_version 87098 (0.00094) [2022-07-09 04:26:34,871][26022] Updated weights on worker 0-0, policy_version 87108 (0.00081) [2022-07-09 04:26:35,331][25689] Fps is (10 sec: 5795.7, 60 sec: 5657.2, 300 sec: 5675.6). Total num frames: 89200640. Throughput: 0: 5971.5. Samples: 89208048. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:26:35,331][25689] Avg episode reward: [(0, '-53.090')] [2022-07-09 04:26:36,663][26022] Updated weights on worker 0-0, policy_version 87118 (0.00087) [2022-07-09 04:26:38,477][26022] Updated weights on worker 0-0, policy_version 87128 (0.00095) [2022-07-09 04:26:40,231][26022] Updated weights on worker 0-0, policy_version 87138 (0.00091) [2022-07-09 04:26:40,345][25689] Fps is (10 sec: 5689.8, 60 sec: 5676.6, 300 sec: 5675.6). Total num frames: 89229312. Throughput: 0: 5125.6. Samples: 89225178. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:26:40,346][25689] Avg episode reward: [(0, '-53.435')] [2022-07-09 04:26:41,987][26022] Updated weights on worker 0-0, policy_version 87148 (0.00088) [2022-07-09 04:26:43,886][26022] Updated weights on worker 0-0, policy_version 87158 (0.00087) [2022-07-09 04:26:45,351][25689] Fps is (10 sec: 5825.2, 60 sec: 5696.2, 300 sec: 5679.2). Total num frames: 89259008. Throughput: 0: 5994.5. Samples: 89259904. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:26:45,352][25689] Avg episode reward: [(0, '-53.635')] [2022-07-09 04:26:45,632][26022] Updated weights on worker 0-0, policy_version 87168 (0.00092) [2022-07-09 04:26:47,284][26022] Updated weights on worker 0-0, policy_version 87178 (0.00085) [2022-07-09 04:26:49,099][26022] Updated weights on worker 0-0, policy_version 87188 (0.00092) [2022-07-09 04:26:50,364][25689] Fps is (10 sec: 5723.3, 60 sec: 5697.3, 300 sec: 5673.9). Total num frames: 89286656. Throughput: 0: 6004.5. Samples: 89294382. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:26:50,365][25689] Avg episode reward: [(0, '-53.520')] [2022-07-09 04:26:50,896][26022] Updated weights on worker 0-0, policy_version 87198 (0.00091) [2022-07-09 04:26:52,733][26022] Updated weights on worker 0-0, policy_version 87208 (0.00055) [2022-07-09 04:26:54,536][26022] Updated weights on worker 0-0, policy_version 87218 (0.00094) [2022-07-09 04:26:55,487][25689] Fps is (10 sec: 5657.1, 60 sec: 5679.3, 300 sec: 5678.6). Total num frames: 89316352. Throughput: 0: 5124.8. Samples: 89311348. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 04:26:55,488][25689] Avg episode reward: [(0, '-53.996')] [2022-07-09 04:26:56,263][26022] Updated weights on worker 0-0, policy_version 87228 (0.00085) [2022-07-09 04:26:58,177][26022] Updated weights on worker 0-0, policy_version 87238 (0.00086) [2022-07-09 04:26:59,655][26022] Updated weights on worker 0-0, policy_version 87248 (0.00094) [2022-07-09 04:27:00,511][25689] Fps is (10 sec: 5651.7, 60 sec: 5679.0, 300 sec: 5678.4). Total num frames: 89344000. Throughput: 0: 5984.0. Samples: 89345850. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 04:27:00,511][25689] Avg episode reward: [(0, '-53.381')] [2022-07-09 04:27:02,093][26022] Updated weights on worker 0-0, policy_version 87258 (0.00086) [2022-07-09 04:27:03,803][26022] Updated weights on worker 0-0, policy_version 87268 (0.00088) [2022-07-09 04:27:05,528][25689] Fps is (10 sec: 5507.1, 60 sec: 5680.3, 300 sec: 5674.8). Total num frames: 89371648. Throughput: 0: 5864.3. Samples: 89378232. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 04:27:05,529][25689] Avg episode reward: [(0, '-53.419')] [2022-07-09 04:27:05,595][26022] Updated weights on worker 0-0, policy_version 87278 (0.00095) [2022-07-09 04:27:07,419][26022] Updated weights on worker 0-0, policy_version 87288 (0.00089) [2022-07-09 04:27:09,177][26022] Updated weights on worker 0-0, policy_version 87298 (0.00085) [2022-07-09 04:27:10,602][25689] Fps is (10 sec: 5581.0, 60 sec: 5691.6, 300 sec: 5671.2). Total num frames: 89400320. Throughput: 0: 4983.0. Samples: 89395226. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 04:27:10,603][25689] Avg episode reward: [(0, '-53.147')] [2022-07-09 04:27:10,904][26022] Updated weights on worker 0-0, policy_version 87308 (0.00093) [2022-07-09 04:27:12,791][26022] Updated weights on worker 0-0, policy_version 87318 (0.00098) [2022-07-09 04:27:14,675][26022] Updated weights on worker 0-0, policy_version 87328 (0.00084) [2022-07-09 04:27:15,680][25689] Fps is (10 sec: 5648.8, 60 sec: 5674.8, 300 sec: 5674.5). Total num frames: 89428992. Throughput: 0: 5839.3. Samples: 89429258. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 04:27:15,682][25689] Avg episode reward: [(0, '-53.113')] [2022-07-09 04:27:16,516][26022] Updated weights on worker 0-0, policy_version 87338 (0.00094) [2022-07-09 04:27:18,290][26022] Updated weights on worker 0-0, policy_version 87348 (0.00082) [2022-07-09 04:27:20,014][26022] Updated weights on worker 0-0, policy_version 87358 (0.00086) [2022-07-09 04:27:20,694][25689] Fps is (10 sec: 5682.4, 60 sec: 5658.1, 300 sec: 5674.5). Total num frames: 89457664. Throughput: 0: 5810.0. Samples: 89463112. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 04:27:20,694][25689] Avg episode reward: [(0, '-52.986')] [2022-07-09 04:27:22,090][26022] Updated weights on worker 0-0, policy_version 87368 (0.00101) [2022-07-09 04:27:23,767][26022] Updated weights on worker 0-0, policy_version 87378 (0.00081) [2022-07-09 04:27:25,520][26022] Updated weights on worker 0-0, policy_version 87388 (0.00095) [2022-07-09 04:27:25,793][25689] Fps is (10 sec: 5670.4, 60 sec: 5683.9, 300 sec: 5670.3). Total num frames: 89486336. Throughput: 0: 5035.0. Samples: 89480264. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 04:27:25,793][25689] Avg episode reward: [(0, '-53.482')] [2022-07-09 04:27:27,433][26022] Updated weights on worker 0-0, policy_version 87398 (0.00085) [2022-07-09 04:27:29,173][26022] Updated weights on worker 0-0, policy_version 87408 (0.00088) [2022-07-09 04:27:30,825][25689] Fps is (10 sec: 5558.7, 60 sec: 5647.7, 300 sec: 5663.6). Total num frames: 89513984. Throughput: 0: 5892.3. Samples: 89514388. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 04:27:30,826][25689] Avg episode reward: [(0, '-53.270')] [2022-07-09 04:27:31,028][26022] Updated weights on worker 0-0, policy_version 87418 (0.00096) [2022-07-09 04:27:32,543][26022] Updated weights on worker 0-0, policy_version 87428 (0.00093) [2022-07-09 04:27:34,599][26022] Updated weights on worker 0-0, policy_version 87438 (0.00088) [2022-07-09 04:27:35,949][25689] Fps is (10 sec: 5747.0, 60 sec: 5675.9, 300 sec: 5679.1). Total num frames: 89544704. Throughput: 0: 5885.3. Samples: 89548548. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 04:27:35,951][25689] Avg episode reward: [(0, '-53.606')] [2022-07-09 04:27:36,305][26022] Updated weights on worker 0-0, policy_version 87448 (0.00080) [2022-07-09 04:27:38,222][26022] Updated weights on worker 0-0, policy_version 87458 (0.00090) [2022-07-09 04:27:39,872][26022] Updated weights on worker 0-0, policy_version 87468 (0.00089) [2022-07-09 04:27:40,991][25689] Fps is (10 sec: 5741.5, 60 sec: 5656.4, 300 sec: 5666.0). Total num frames: 89572352. Throughput: 0: 5056.4. Samples: 89565746. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 04:27:40,993][25689] Avg episode reward: [(0, '-53.285')] [2022-07-09 04:27:41,768][26022] Updated weights on worker 0-0, policy_version 87478 (0.00093) [2022-07-09 04:27:43,435][26022] Updated weights on worker 0-0, policy_version 87488 (0.00089) [2022-07-09 04:27:45,202][26022] Updated weights on worker 0-0, policy_version 87498 (0.00101) [2022-07-09 04:27:46,019][25689] Fps is (10 sec: 5694.6, 60 sec: 5654.4, 300 sec: 5674.2). Total num frames: 89602048. Throughput: 0: 5929.6. Samples: 89600196. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 04:27:46,019][25689] Avg episode reward: [(0, '-52.852')] [2022-07-09 04:27:46,952][26022] Updated weights on worker 0-0, policy_version 87508 (0.00086) [2022-07-09 04:27:48,767][26022] Updated weights on worker 0-0, policy_version 87518 (0.00093) [2022-07-09 04:27:50,408][26022] Updated weights on worker 0-0, policy_version 87528 (0.00088) [2022-07-09 04:27:51,023][25689] Fps is (10 sec: 5818.6, 60 sec: 5672.2, 300 sec: 5672.8). Total num frames: 89630720. Throughput: 0: 5963.4. Samples: 89634832. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 04:27:51,023][25689] Avg episode reward: [(0, '-53.249')] [2022-07-09 04:27:52,458][26022] Updated weights on worker 0-0, policy_version 87538 (0.00093) [2022-07-09 04:27:54,120][26022] Updated weights on worker 0-0, policy_version 87548 (0.00089) [2022-07-09 04:27:55,860][26022] Updated weights on worker 0-0, policy_version 87558 (0.00087) [2022-07-09 04:27:56,072][25689] Fps is (10 sec: 5704.1, 60 sec: 5662.2, 300 sec: 5669.5). Total num frames: 89659392. Throughput: 0: 5994.7. Samples: 89669178. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 04:27:56,074][25689] Avg episode reward: [(0, '-53.207')] [2022-07-09 04:27:57,888][26022] Updated weights on worker 0-0, policy_version 87568 (0.00095) [2022-07-09 04:27:59,495][26022] Updated weights on worker 0-0, policy_version 87578 (0.00083) [2022-07-09 04:28:01,085][25689] Fps is (10 sec: 5698.8, 60 sec: 5680.0, 300 sec: 5680.5). Total num frames: 89688064. Throughput: 0: 6007.7. Samples: 89686464. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 04:28:01,086][25689] Avg episode reward: [(0, '-53.116')] [2022-07-09 04:28:01,347][26022] Updated weights on worker 0-0, policy_version 87588 (0.00086) [2022-07-09 04:28:03,445][26022] Updated weights on worker 0-0, policy_version 87598 (0.00093) [2022-07-09 04:28:05,199][26022] Updated weights on worker 0-0, policy_version 87608 (0.00093) [2022-07-09 04:28:06,106][25689] Fps is (10 sec: 5408.8, 60 sec: 5645.9, 300 sec: 5666.5). Total num frames: 89713664. Throughput: 0: 5913.0. Samples: 89718970. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 04:28:06,108][25689] Avg episode reward: [(0, '-52.563')] [2022-07-09 04:28:07,124][26022] Updated weights on worker 0-0, policy_version 87618 (0.00085) [2022-07-09 04:28:08,818][26022] Updated weights on worker 0-0, policy_version 87628 (0.00092) [2022-07-09 04:28:10,541][26022] Updated weights on worker 0-0, policy_version 87638 (0.00089) [2022-07-09 04:28:11,111][25689] Fps is (10 sec: 5515.3, 60 sec: 5669.2, 300 sec: 5668.7). Total num frames: 89743360. Throughput: 0: 5897.1. Samples: 89753294. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 04:28:11,112][25689] Avg episode reward: [(0, '-52.191')] [2022-07-09 04:28:12,567][26022] Updated weights on worker 0-0, policy_version 87648 (0.00095) [2022-07-09 04:28:14,275][26022] Updated weights on worker 0-0, policy_version 87658 (0.00084) [2022-07-09 04:28:15,909][26022] Updated weights on worker 0-0, policy_version 87668 (0.00092) [2022-07-09 04:28:16,213][25689] Fps is (10 sec: 5977.4, 60 sec: 5700.8, 300 sec: 5681.2). Total num frames: 89774080. Throughput: 0: 5031.8. Samples: 89770526. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 04:28:16,214][25689] Avg episode reward: [(0, '-52.089')] [2022-07-09 04:28:17,863][26022] Updated weights on worker 0-0, policy_version 87678 (0.00088) [2022-07-09 04:28:19,408][26022] Updated weights on worker 0-0, policy_version 87688 (0.00086) [2022-07-09 04:28:21,235][25689] Fps is (10 sec: 5663.9, 60 sec: 5666.2, 300 sec: 5667.7). Total num frames: 89800704. Throughput: 0: 5875.7. Samples: 89804860. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 04:28:21,236][25689] Avg episode reward: [(0, '-51.890')] [2022-07-09 04:28:21,456][26022] Updated weights on worker 0-0, policy_version 87698 (0.00081) [2022-07-09 04:28:23,142][26022] Updated weights on worker 0-0, policy_version 87708 (0.00106) [2022-07-09 04:28:24,921][26022] Updated weights on worker 0-0, policy_version 87718 (0.00089) [2022-07-09 04:28:25,541][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:28:25,558][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000087722_89827328.pth [2022-07-09 04:28:25,559][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000085724_87781376.pth [2022-07-09 04:28:26,245][25689] Fps is (10 sec: 5614.3, 60 sec: 5691.5, 300 sec: 5676.1). Total num frames: 89830400. Throughput: 0: 5971.9. Samples: 89839238. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 04:28:26,247][25689] Avg episode reward: [(0, '-51.264')] [2022-07-09 04:28:26,821][26022] Updated weights on worker 0-0, policy_version 87728 (0.00089) [2022-07-09 04:28:28,419][26022] Updated weights on worker 0-0, policy_version 87738 (0.00088) [2022-07-09 04:28:30,563][26022] Updated weights on worker 0-0, policy_version 87748 (0.00088) [2022-07-09 04:28:31,308][25689] Fps is (10 sec: 5794.5, 60 sec: 5705.5, 300 sec: 5669.3). Total num frames: 89859072. Throughput: 0: 5106.0. Samples: 89856422. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 04:28:31,310][25689] Avg episode reward: [(0, '-51.414')] [2022-07-09 04:28:31,887][26022] Updated weights on worker 0-0, policy_version 87758 (0.00088) [2022-07-09 04:28:34,013][26022] Updated weights on worker 0-0, policy_version 87768 (0.00087) [2022-07-09 04:28:35,686][26022] Updated weights on worker 0-0, policy_version 87778 (0.00114) [2022-07-09 04:28:36,415][25689] Fps is (10 sec: 5638.1, 60 sec: 5673.2, 300 sec: 5670.9). Total num frames: 89887744. Throughput: 0: 5945.8. Samples: 89890644. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 04:28:36,416][25689] Avg episode reward: [(0, '-51.629')] [2022-07-09 04:28:37,571][26022] Updated weights on worker 0-0, policy_version 87788 (0.00110) [2022-07-09 04:28:39,406][26022] Updated weights on worker 0-0, policy_version 87798 (0.00087) [2022-07-09 04:28:41,091][26022] Updated weights on worker 0-0, policy_version 87808 (0.00091) [2022-07-09 04:28:41,427][25689] Fps is (10 sec: 5768.3, 60 sec: 5710.0, 300 sec: 5674.2). Total num frames: 89917440. Throughput: 0: 5938.7. Samples: 89924772. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 04:28:41,428][25689] Avg episode reward: [(0, '-51.955')] [2022-07-09 04:28:42,939][26022] Updated weights on worker 0-0, policy_version 87818 (0.00087) [2022-07-09 04:28:44,712][26022] Updated weights on worker 0-0, policy_version 87828 (0.00091) [2022-07-09 04:28:46,432][25689] Fps is (10 sec: 5725.1, 60 sec: 5678.3, 300 sec: 5670.8). Total num frames: 89945088. Throughput: 0: 5094.2. Samples: 89942074. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:28:46,432][25689] Avg episode reward: [(0, '-52.993')] [2022-07-09 04:28:46,505][26022] Updated weights on worker 0-0, policy_version 87838 (0.00085) [2022-07-09 04:28:48,224][26022] Updated weights on worker 0-0, policy_version 87848 (0.00090) [2022-07-09 04:28:50,119][26022] Updated weights on worker 0-0, policy_version 87858 (0.00089) [2022-07-09 04:28:51,456][25689] Fps is (10 sec: 5615.9, 60 sec: 5676.4, 300 sec: 5672.4). Total num frames: 89973760. Throughput: 0: 5967.1. Samples: 89976642. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:28:51,456][25689] Avg episode reward: [(0, '-52.190')] [2022-07-09 04:28:51,724][26022] Updated weights on worker 0-0, policy_version 87868 (0.00087) [2022-07-09 04:28:53,792][26022] Updated weights on worker 0-0, policy_version 87878 (0.00089) [2022-07-09 04:28:55,356][26022] Updated weights on worker 0-0, policy_version 87888 (0.00089) [2022-07-09 04:28:56,577][25689] Fps is (10 sec: 5652.1, 60 sec: 5669.6, 300 sec: 5673.9). Total num frames: 90002432. Throughput: 0: 5936.6. Samples: 90010336. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:28:56,578][25689] Avg episode reward: [(0, '-52.918')] [2022-07-09 04:28:57,426][26022] Updated weights on worker 0-0, policy_version 87898 (0.00094) [2022-07-09 04:28:58,998][26022] Updated weights on worker 0-0, policy_version 87908 (0.00087) [2022-07-09 04:29:00,932][26022] Updated weights on worker 0-0, policy_version 87918 (0.00087) [2022-07-09 04:29:01,588][25689] Fps is (10 sec: 5659.6, 60 sec: 5669.8, 300 sec: 5673.8). Total num frames: 90031104. Throughput: 0: 5111.2. Samples: 90027818. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:29:01,588][25689] Avg episode reward: [(0, '-52.856')] [2022-07-09 04:29:03,112][26022] Updated weights on worker 0-0, policy_version 87928 (0.00085) [2022-07-09 04:29:05,025][26022] Updated weights on worker 0-0, policy_version 87938 (0.00092) [2022-07-09 04:29:06,543][26022] Updated weights on worker 0-0, policy_version 87948 (0.00091) [2022-07-09 04:29:06,637][25689] Fps is (10 sec: 5598.2, 60 sec: 5701.0, 300 sec: 5680.5). Total num frames: 90058752. Throughput: 0: 5841.2. Samples: 90060100. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:29:06,638][25689] Avg episode reward: [(0, '-53.029')] [2022-07-09 04:29:08,446][26022] Updated weights on worker 0-0, policy_version 87958 (0.00086) [2022-07-09 04:29:09,924][26022] Updated weights on worker 0-0, policy_version 87968 (0.00088) [2022-07-09 04:29:11,654][25689] Fps is (10 sec: 5493.1, 60 sec: 5666.0, 300 sec: 5671.1). Total num frames: 90086400. Throughput: 0: 5842.9. Samples: 90094660. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:29:11,655][25689] Avg episode reward: [(0, '-52.843')] [2022-07-09 04:29:12,004][26022] Updated weights on worker 0-0, policy_version 87978 (0.00094) [2022-07-09 04:29:13,613][26022] Updated weights on worker 0-0, policy_version 87988 (0.00089) [2022-07-09 04:29:15,554][26022] Updated weights on worker 0-0, policy_version 87998 (0.00097) [2022-07-09 04:29:16,755][25689] Fps is (10 sec: 5566.4, 60 sec: 5632.3, 300 sec: 5673.6). Total num frames: 90115072. Throughput: 0: 5028.1. Samples: 90111794. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:29:16,756][25689] Avg episode reward: [(0, '-52.826')] [2022-07-09 04:29:17,391][26022] Updated weights on worker 0-0, policy_version 88008 (0.00085) [2022-07-09 04:29:18,975][26022] Updated weights on worker 0-0, policy_version 88018 (0.00093) [2022-07-09 04:29:20,972][26022] Updated weights on worker 0-0, policy_version 88028 (0.00092) [2022-07-09 04:29:21,779][25689] Fps is (10 sec: 5865.9, 60 sec: 5699.8, 300 sec: 5673.4). Total num frames: 90145792. Throughput: 0: 5854.3. Samples: 90146024. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:29:21,782][25689] Avg episode reward: [(0, '-53.653')] [2022-07-09 04:29:22,725][26022] Updated weights on worker 0-0, policy_version 88038 (0.00089) [2022-07-09 04:29:24,521][26022] Updated weights on worker 0-0, policy_version 88048 (0.00085) [2022-07-09 04:29:26,562][26022] Updated weights on worker 0-0, policy_version 88058 (0.00095) [2022-07-09 04:29:26,811][25689] Fps is (10 sec: 5702.8, 60 sec: 5647.0, 300 sec: 5669.7). Total num frames: 90172416. Throughput: 0: 5938.6. Samples: 90179902. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:29:26,811][25689] Avg episode reward: [(0, '-54.053')] [2022-07-09 04:29:28,076][26022] Updated weights on worker 0-0, policy_version 88068 (0.00090) [2022-07-09 04:29:30,158][26022] Updated weights on worker 0-0, policy_version 88078 (0.00088) [2022-07-09 04:29:31,706][26022] Updated weights on worker 0-0, policy_version 88088 (0.00084) [2022-07-09 04:29:31,842][25689] Fps is (10 sec: 5597.2, 60 sec: 5667.0, 300 sec: 5668.1). Total num frames: 90202112. Throughput: 0: 5076.0. Samples: 90197130. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:29:31,842][25689] Avg episode reward: [(0, '-53.419')] [2022-07-09 04:29:33,598][26022] Updated weights on worker 0-0, policy_version 88098 (0.00083) [2022-07-09 04:29:35,179][26022] Updated weights on worker 0-0, policy_version 88108 (0.00090) [2022-07-09 04:29:36,915][25689] Fps is (10 sec: 5776.8, 60 sec: 5670.2, 300 sec: 5667.0). Total num frames: 90230784. Throughput: 0: 5943.5. Samples: 90231612. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:29:36,915][25689] Avg episode reward: [(0, '-53.022')] [2022-07-09 04:29:37,189][26022] Updated weights on worker 0-0, policy_version 88118 (0.00469) [2022-07-09 04:29:38,965][26022] Updated weights on worker 0-0, policy_version 88128 (0.00086) [2022-07-09 04:29:40,640][26022] Updated weights on worker 0-0, policy_version 88138 (0.00095) [2022-07-09 04:29:41,919][25689] Fps is (10 sec: 5791.8, 60 sec: 5670.8, 300 sec: 5677.3). Total num frames: 90260480. Throughput: 0: 5960.3. Samples: 90266064. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:29:41,920][25689] Avg episode reward: [(0, '-52.676')] [2022-07-09 04:29:42,428][26022] Updated weights on worker 0-0, policy_version 88148 (0.00089) [2022-07-09 04:29:44,166][26022] Updated weights on worker 0-0, policy_version 88158 (0.00396) [2022-07-09 04:29:45,914][26022] Updated weights on worker 0-0, policy_version 88168 (0.00098) [2022-07-09 04:29:46,939][25689] Fps is (10 sec: 5720.4, 60 sec: 5669.4, 300 sec: 5670.6). Total num frames: 90288128. Throughput: 0: 5151.0. Samples: 90283584. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:29:46,941][25689] Avg episode reward: [(0, '-52.064')] [2022-07-09 04:29:47,746][26022] Updated weights on worker 0-0, policy_version 88178 (0.00086) [2022-07-09 04:29:49,476][26022] Updated weights on worker 0-0, policy_version 88188 (0.00090) [2022-07-09 04:29:51,286][26022] Updated weights on worker 0-0, policy_version 88198 (0.00080) [2022-07-09 04:29:51,944][25689] Fps is (10 sec: 5720.3, 60 sec: 5688.1, 300 sec: 5675.3). Total num frames: 90317824. Throughput: 0: 6023.7. Samples: 90318220. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:29:51,944][25689] Avg episode reward: [(0, '-52.772')] [2022-07-09 04:29:53,117][26022] Updated weights on worker 0-0, policy_version 88208 (0.00084) [2022-07-09 04:29:55,019][26022] Updated weights on worker 0-0, policy_version 88218 (0.00378) [2022-07-09 04:29:56,744][26022] Updated weights on worker 0-0, policy_version 88228 (0.00060) [2022-07-09 04:29:57,005][25689] Fps is (10 sec: 5696.6, 60 sec: 5676.8, 300 sec: 5674.5). Total num frames: 90345472. Throughput: 0: 6006.2. Samples: 90352282. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:29:57,006][25689] Avg episode reward: [(0, '-52.922')] [2022-07-09 04:29:58,572][26022] Updated weights on worker 0-0, policy_version 88238 (0.00084) [2022-07-09 04:30:00,316][26022] Updated weights on worker 0-0, policy_version 88248 (0.00084) [2022-07-09 04:30:02,022][25689] Fps is (10 sec: 5588.4, 60 sec: 5676.3, 300 sec: 5678.4). Total num frames: 90374144. Throughput: 0: 5142.5. Samples: 90369442. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:30:02,022][25689] Avg episode reward: [(0, '-53.350')] [2022-07-09 04:30:02,440][26022] Updated weights on worker 0-0, policy_version 88258 (0.00096) [2022-07-09 04:30:04,415][26022] Updated weights on worker 0-0, policy_version 88268 (0.00092) [2022-07-09 04:30:05,867][26022] Updated weights on worker 0-0, policy_version 88278 (0.00084) [2022-07-09 04:30:07,039][25689] Fps is (10 sec: 5613.3, 60 sec: 5679.4, 300 sec: 5678.2). Total num frames: 90401792. Throughput: 0: 5885.1. Samples: 90401872. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:30:07,039][25689] Avg episode reward: [(0, '-53.712')] [2022-07-09 04:30:07,890][26022] Updated weights on worker 0-0, policy_version 88288 (0.00096) [2022-07-09 04:30:09,532][26022] Updated weights on worker 0-0, policy_version 88298 (0.00097) [2022-07-09 04:30:11,373][26022] Updated weights on worker 0-0, policy_version 88308 (0.00081) [2022-07-09 04:30:12,053][25689] Fps is (10 sec: 5716.5, 60 sec: 5713.5, 300 sec: 5682.7). Total num frames: 90431488. Throughput: 0: 5882.9. Samples: 90436522. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:30:12,054][25689] Avg episode reward: [(0, '-53.900')] [2022-07-09 04:30:13,369][26022] Updated weights on worker 0-0, policy_version 88318 (0.00093) [2022-07-09 04:30:14,655][26022] Updated weights on worker 0-0, policy_version 88328 (0.00088) [2022-07-09 04:30:16,796][26022] Updated weights on worker 0-0, policy_version 88338 (0.00087) [2022-07-09 04:30:17,143][25689] Fps is (10 sec: 5776.5, 60 sec: 5714.5, 300 sec: 5677.8). Total num frames: 90460160. Throughput: 0: 5044.3. Samples: 90453862. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:30:17,144][25689] Avg episode reward: [(0, '-53.686')] [2022-07-09 04:30:18,482][26022] Updated weights on worker 0-0, policy_version 88348 (0.00091) [2022-07-09 04:30:20,257][26022] Updated weights on worker 0-0, policy_version 88358 (0.00091) [2022-07-09 04:30:21,887][26022] Updated weights on worker 0-0, policy_version 88368 (0.00088) [2022-07-09 04:30:22,235][25689] Fps is (10 sec: 5632.1, 60 sec: 5674.3, 300 sec: 5676.1). Total num frames: 90488832. Throughput: 0: 5875.7. Samples: 90488206. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:30:22,235][25689] Avg episode reward: [(0, '-53.231')] [2022-07-09 04:30:23,843][26022] Updated weights on worker 0-0, policy_version 88378 (0.00087) [2022-07-09 04:30:25,696][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:30:25,708][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000088388_90509312.pth [2022-07-09 04:30:25,708][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000086390_88463360.pth [2022-07-09 04:30:25,711][26022] Updated weights on worker 0-0, policy_version 88388 (0.00088) [2022-07-09 04:30:27,251][25689] Fps is (10 sec: 5673.4, 60 sec: 5709.6, 300 sec: 5682.9). Total num frames: 90517504. Throughput: 0: 5959.2. Samples: 90522320. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:30:27,251][25689] Avg episode reward: [(0, '-53.166')] [2022-07-09 04:30:27,558][26022] Updated weights on worker 0-0, policy_version 88398 (0.00093) [2022-07-09 04:30:29,188][26022] Updated weights on worker 0-0, policy_version 88408 (0.00087) [2022-07-09 04:30:31,314][26022] Updated weights on worker 0-0, policy_version 88418 (0.00089) [2022-07-09 04:30:32,261][25689] Fps is (10 sec: 5821.5, 60 sec: 5711.5, 300 sec: 5678.0). Total num frames: 90547200. Throughput: 0: 5106.9. Samples: 90539722. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 04:30:32,262][25689] Avg episode reward: [(0, '-52.606')] [2022-07-09 04:30:32,702][26022] Updated weights on worker 0-0, policy_version 88428 (0.00088) [2022-07-09 04:30:34,655][26022] Updated weights on worker 0-0, policy_version 88438 (0.00896) [2022-07-09 04:30:36,331][26022] Updated weights on worker 0-0, policy_version 88448 (0.00089) [2022-07-09 04:30:37,345][25689] Fps is (10 sec: 5680.8, 60 sec: 5693.6, 300 sec: 5677.2). Total num frames: 90574848. Throughput: 0: 5949.4. Samples: 90574052. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 04:30:37,346][25689] Avg episode reward: [(0, '-52.922')] [2022-07-09 04:30:38,433][26022] Updated weights on worker 0-0, policy_version 88458 (0.00081) [2022-07-09 04:30:39,861][26022] Updated weights on worker 0-0, policy_version 88468 (0.00091) [2022-07-09 04:30:41,677][26022] Updated weights on worker 0-0, policy_version 88478 (0.00087) [2022-07-09 04:30:42,360][25689] Fps is (10 sec: 5678.2, 60 sec: 5692.6, 300 sec: 5681.0). Total num frames: 90604544. Throughput: 0: 5982.0. Samples: 90608596. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 04:30:42,360][25689] Avg episode reward: [(0, '-53.156')] [2022-07-09 04:30:43,660][26022] Updated weights on worker 0-0, policy_version 88488 (0.00094) [2022-07-09 04:30:45,154][26022] Updated weights on worker 0-0, policy_version 88498 (0.00086) [2022-07-09 04:30:47,247][26022] Updated weights on worker 0-0, policy_version 88508 (0.00092) [2022-07-09 04:30:47,368][25689] Fps is (10 sec: 5721.1, 60 sec: 5693.7, 300 sec: 5681.3). Total num frames: 90632192. Throughput: 0: 5144.8. Samples: 90625820. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 04:30:47,369][25689] Avg episode reward: [(0, '-52.912')] [2022-07-09 04:30:48,810][26022] Updated weights on worker 0-0, policy_version 88518 (0.00109) [2022-07-09 04:30:50,760][26022] Updated weights on worker 0-0, policy_version 88528 (0.00087) [2022-07-09 04:30:52,299][26022] Updated weights on worker 0-0, policy_version 88538 (0.00088) [2022-07-09 04:30:52,399][25689] Fps is (10 sec: 5814.3, 60 sec: 5708.2, 300 sec: 5682.9). Total num frames: 90662912. Throughput: 0: 5995.9. Samples: 90660466. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 04:30:52,399][25689] Avg episode reward: [(0, '-52.765')] [2022-07-09 04:30:54,185][26022] Updated weights on worker 0-0, policy_version 88548 (0.00087) [2022-07-09 04:30:56,244][26022] Updated weights on worker 0-0, policy_version 88558 (0.00089) [2022-07-09 04:30:57,532][25689] Fps is (10 sec: 5843.5, 60 sec: 5718.3, 300 sec: 5684.2). Total num frames: 90691584. Throughput: 0: 5952.4. Samples: 90694214. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 04:30:57,534][25689] Avg episode reward: [(0, '-53.499')] [2022-07-09 04:30:57,896][26022] Updated weights on worker 0-0, policy_version 88568 (0.00100) [2022-07-09 04:30:59,772][26022] Updated weights on worker 0-0, policy_version 88578 (0.00085) [2022-07-09 04:31:01,698][26022] Updated weights on worker 0-0, policy_version 88588 (0.00086) [2022-07-09 04:31:02,562][25689] Fps is (10 sec: 5440.6, 60 sec: 5683.2, 300 sec: 5680.8). Total num frames: 90718208. Throughput: 0: 5853.0. Samples: 90726840. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 04:31:02,563][25689] Avg episode reward: [(0, '-54.591')] [2022-07-09 04:31:03,575][26022] Updated weights on worker 0-0, policy_version 88598 (0.00089) [2022-07-09 04:31:05,331][26022] Updated weights on worker 0-0, policy_version 88608 (0.00096) [2022-07-09 04:31:06,996][26022] Updated weights on worker 0-0, policy_version 88618 (0.00091) [2022-07-09 04:31:07,600][25689] Fps is (10 sec: 5492.3, 60 sec: 5698.2, 300 sec: 5683.7). Total num frames: 90746880. Throughput: 0: 5853.8. Samples: 90744252. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 04:31:07,600][25689] Avg episode reward: [(0, '-54.600')] [2022-07-09 04:31:08,909][26022] Updated weights on worker 0-0, policy_version 88628 (0.00086) [2022-07-09 04:31:10,779][26022] Updated weights on worker 0-0, policy_version 88638 (0.00082) [2022-07-09 04:31:12,432][26022] Updated weights on worker 0-0, policy_version 88648 (0.00082) [2022-07-09 04:31:12,627][25689] Fps is (10 sec: 5901.0, 60 sec: 5713.9, 300 sec: 5688.2). Total num frames: 90777600. Throughput: 0: 5843.1. Samples: 90778662. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 04:31:12,627][25689] Avg episode reward: [(0, '-54.062')] [2022-07-09 04:31:14,431][26022] Updated weights on worker 0-0, policy_version 88658 (0.00087) [2022-07-09 04:31:15,859][26022] Updated weights on worker 0-0, policy_version 88668 (0.00087) [2022-07-09 04:31:17,680][25689] Fps is (10 sec: 5689.0, 60 sec: 5683.6, 300 sec: 5677.2). Total num frames: 90804224. Throughput: 0: 5914.2. Samples: 90813372. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 04:31:17,680][25689] Avg episode reward: [(0, '-54.424')] [2022-07-09 04:31:17,947][26022] Updated weights on worker 0-0, policy_version 88678 (0.00080) [2022-07-09 04:31:19,520][26022] Updated weights on worker 0-0, policy_version 88688 (0.00088) [2022-07-09 04:31:21,339][26022] Updated weights on worker 0-0, policy_version 88698 (0.00089) [2022-07-09 04:31:22,709][25689] Fps is (10 sec: 5586.4, 60 sec: 5706.4, 300 sec: 5687.2). Total num frames: 90833920. Throughput: 0: 5151.6. Samples: 90830626. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 04:31:22,709][25689] Avg episode reward: [(0, '-53.196')] [2022-07-09 04:31:23,232][26022] Updated weights on worker 0-0, policy_version 88708 (0.00088) [2022-07-09 04:31:24,839][26022] Updated weights on worker 0-0, policy_version 88718 (0.00086) [2022-07-09 04:31:26,812][26022] Updated weights on worker 0-0, policy_version 88728 (0.00091) [2022-07-09 04:31:27,716][25689] Fps is (10 sec: 5815.7, 60 sec: 5707.2, 300 sec: 5683.7). Total num frames: 90862592. Throughput: 0: 5992.5. Samples: 90864798. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 04:31:27,717][25689] Avg episode reward: [(0, '-52.626')] [2022-07-09 04:31:28,702][26022] Updated weights on worker 0-0, policy_version 88738 (0.00085) [2022-07-09 04:31:30,295][26022] Updated weights on worker 0-0, policy_version 88748 (0.00083) [2022-07-09 04:31:32,226][26022] Updated weights on worker 0-0, policy_version 88758 (0.00090) [2022-07-09 04:31:32,721][25689] Fps is (10 sec: 5727.3, 60 sec: 5690.8, 300 sec: 5684.9). Total num frames: 90891264. Throughput: 0: 5998.6. Samples: 90899198. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 04:31:32,721][25689] Avg episode reward: [(0, '-52.384')] [2022-07-09 04:31:33,930][26022] Updated weights on worker 0-0, policy_version 88768 (0.00098) [2022-07-09 04:31:35,764][26022] Updated weights on worker 0-0, policy_version 88778 (0.00096) [2022-07-09 04:31:37,616][26022] Updated weights on worker 0-0, policy_version 88788 (0.00087) [2022-07-09 04:31:37,839][25689] Fps is (10 sec: 5766.1, 60 sec: 5721.5, 300 sec: 5686.3). Total num frames: 90920960. Throughput: 0: 5105.8. Samples: 90916300. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 04:31:37,839][25689] Avg episode reward: [(0, '-52.350')] [2022-07-09 04:31:39,342][26022] Updated weights on worker 0-0, policy_version 88798 (0.00088) [2022-07-09 04:31:41,174][26022] Updated weights on worker 0-0, policy_version 88808 (0.00849) [2022-07-09 04:31:42,867][25689] Fps is (10 sec: 5652.0, 60 sec: 5686.4, 300 sec: 5679.0). Total num frames: 90948608. Throughput: 0: 5959.0. Samples: 90950748. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 04:31:42,867][25689] Avg episode reward: [(0, '-52.672')] [2022-07-09 04:31:42,900][26022] Updated weights on worker 0-0, policy_version 88818 (0.00084) [2022-07-09 04:31:44,574][26022] Updated weights on worker 0-0, policy_version 88828 (0.00080) [2022-07-09 04:31:46,403][26022] Updated weights on worker 0-0, policy_version 88838 (0.00080) [2022-07-09 04:31:47,915][25689] Fps is (10 sec: 5691.1, 60 sec: 5716.5, 300 sec: 5685.3). Total num frames: 90978304. Throughput: 0: 5980.6. Samples: 90985598. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 04:31:47,915][25689] Avg episode reward: [(0, '-52.808')] [2022-07-09 04:31:48,111][26022] Updated weights on worker 0-0, policy_version 88848 (0.00096) [2022-07-09 04:31:49,874][26022] Updated weights on worker 0-0, policy_version 88858 (0.00083) [2022-07-09 04:31:51,913][26022] Updated weights on worker 0-0, policy_version 88868 (0.00086) [2022-07-09 04:31:52,931][25689] Fps is (10 sec: 5799.7, 60 sec: 5684.0, 300 sec: 5683.9). Total num frames: 91006976. Throughput: 0: 5133.3. Samples: 91002940. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 04:31:52,933][25689] Avg episode reward: [(0, '-52.934')] [2022-07-09 04:31:53,424][26022] Updated weights on worker 0-0, policy_version 88878 (0.00082) [2022-07-09 04:31:55,395][26022] Updated weights on worker 0-0, policy_version 88888 (0.00090) [2022-07-09 04:31:57,059][26022] Updated weights on worker 0-0, policy_version 88898 (0.00093) [2022-07-09 04:31:58,052][25689] Fps is (10 sec: 5657.0, 60 sec: 5685.2, 300 sec: 5685.4). Total num frames: 91035648. Throughput: 0: 5964.2. Samples: 91036854. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 04:31:58,053][25689] Avg episode reward: [(0, '-52.725')] [2022-07-09 04:31:59,027][26022] Updated weights on worker 0-0, policy_version 88908 (0.00092) [2022-07-09 04:32:00,663][26022] Updated weights on worker 0-0, policy_version 88918 (0.00092) [2022-07-09 04:32:02,854][26022] Updated weights on worker 0-0, policy_version 88928 (0.00087) [2022-07-09 04:32:03,098][25689] Fps is (10 sec: 5539.2, 60 sec: 5700.5, 300 sec: 5684.9). Total num frames: 91063296. Throughput: 0: 5863.4. Samples: 91069374. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 04:32:03,100][25689] Avg episode reward: [(0, '-52.785')] [2022-07-09 04:32:04,607][26022] Updated weights on worker 0-0, policy_version 88938 (0.00448) [2022-07-09 04:32:06,401][26022] Updated weights on worker 0-0, policy_version 88948 (0.00082) [2022-07-09 04:32:08,109][25689] Fps is (10 sec: 5701.5, 60 sec: 5720.0, 300 sec: 5689.5). Total num frames: 91092992. Throughput: 0: 5010.0. Samples: 91086772. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 04:32:08,110][25689] Avg episode reward: [(0, '-52.385')] [2022-07-09 04:32:08,113][26022] Updated weights on worker 0-0, policy_version 88958 (0.00084) [2022-07-09 04:32:09,912][26022] Updated weights on worker 0-0, policy_version 88968 (0.00087) [2022-07-09 04:32:11,635][26022] Updated weights on worker 0-0, policy_version 88978 (0.00083) [2022-07-09 04:32:13,186][25689] Fps is (10 sec: 5684.5, 60 sec: 5664.5, 300 sec: 5686.1). Total num frames: 91120640. Throughput: 0: 5850.0. Samples: 91121434. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 04:32:13,188][25689] Avg episode reward: [(0, '-52.695')] [2022-07-09 04:32:13,737][26022] Updated weights on worker 0-0, policy_version 88988 (0.01086) [2022-07-09 04:32:15,259][26022] Updated weights on worker 0-0, policy_version 88998 (0.00090) [2022-07-09 04:32:17,238][26022] Updated weights on worker 0-0, policy_version 89008 (0.00087) [2022-07-09 04:32:18,276][25689] Fps is (10 sec: 5640.6, 60 sec: 5711.8, 300 sec: 5688.1). Total num frames: 91150336. Throughput: 0: 5889.8. Samples: 91155970. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 04:32:18,276][25689] Avg episode reward: [(0, '-52.514')] [2022-07-09 04:32:18,767][26022] Updated weights on worker 0-0, policy_version 89018 (0.00094) [2022-07-09 04:32:20,708][26022] Updated weights on worker 0-0, policy_version 89028 (0.00090) [2022-07-09 04:32:22,377][26022] Updated weights on worker 0-0, policy_version 89038 (0.00085) [2022-07-09 04:32:23,295][25689] Fps is (10 sec: 5672.4, 60 sec: 5678.9, 300 sec: 5686.2). Total num frames: 91177984. Throughput: 0: 5135.4. Samples: 91173094. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 04:32:23,296][25689] Avg episode reward: [(0, '-52.329')] [2022-07-09 04:32:24,147][26022] Updated weights on worker 0-0, policy_version 89048 (0.00083) [2022-07-09 04:32:25,819][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:32:25,840][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000089057_91194368.pth [2022-07-09 04:32:25,840][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000087055_89144320.pth [2022-07-09 04:32:26,170][26022] Updated weights on worker 0-0, policy_version 89058 (0.00314) [2022-07-09 04:32:27,793][26022] Updated weights on worker 0-0, policy_version 89068 (0.00084) [2022-07-09 04:32:28,311][25689] Fps is (10 sec: 5612.2, 60 sec: 5678.1, 300 sec: 5689.9). Total num frames: 91206656. Throughput: 0: 5973.3. Samples: 91207442. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 04:32:28,311][25689] Avg episode reward: [(0, '-52.189')] [2022-07-09 04:32:29,542][26022] Updated weights on worker 0-0, policy_version 89078 (0.00089) [2022-07-09 04:32:31,515][26022] Updated weights on worker 0-0, policy_version 89088 (0.00092) [2022-07-09 04:32:33,286][26022] Updated weights on worker 0-0, policy_version 89098 (0.00095) [2022-07-09 04:32:33,325][25689] Fps is (10 sec: 5819.2, 60 sec: 5694.1, 300 sec: 5688.6). Total num frames: 91236352. Throughput: 0: 5974.8. Samples: 91241764. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 04:32:33,326][25689] Avg episode reward: [(0, '-52.251')] [2022-07-09 04:32:35,149][26022] Updated weights on worker 0-0, policy_version 89108 (0.00082) [2022-07-09 04:32:36,767][26022] Updated weights on worker 0-0, policy_version 89118 (0.00081) [2022-07-09 04:32:38,465][25689] Fps is (10 sec: 5748.0, 60 sec: 5675.1, 300 sec: 5690.2). Total num frames: 91265024. Throughput: 0: 5092.2. Samples: 91258780. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 04:32:38,466][25689] Avg episode reward: [(0, '-52.445')] [2022-07-09 04:32:38,602][26022] Updated weights on worker 0-0, policy_version 89128 (0.00089) [2022-07-09 04:32:40,506][26022] Updated weights on worker 0-0, policy_version 89138 (0.00096) [2022-07-09 04:32:42,190][26022] Updated weights on worker 0-0, policy_version 89148 (0.00085) [2022-07-09 04:32:43,475][25689] Fps is (10 sec: 5649.7, 60 sec: 5693.7, 300 sec: 5687.0). Total num frames: 91293696. Throughput: 0: 5966.5. Samples: 91293498. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 04:32:43,476][25689] Avg episode reward: [(0, '-53.814')] [2022-07-09 04:32:43,906][26022] Updated weights on worker 0-0, policy_version 89158 (0.00089) [2022-07-09 04:32:45,786][26022] Updated weights on worker 0-0, policy_version 89168 (0.00086) [2022-07-09 04:32:47,331][26022] Updated weights on worker 0-0, policy_version 89178 (0.00083) [2022-07-09 04:32:48,479][25689] Fps is (10 sec: 5829.0, 60 sec: 5697.9, 300 sec: 5690.5). Total num frames: 91323392. Throughput: 0: 5979.1. Samples: 91328028. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 04:32:48,479][25689] Avg episode reward: [(0, '-53.544')] [2022-07-09 04:32:49,454][26022] Updated weights on worker 0-0, policy_version 89188 (0.00095) [2022-07-09 04:32:51,024][26022] Updated weights on worker 0-0, policy_version 89198 (0.00091) [2022-07-09 04:32:52,896][26022] Updated weights on worker 0-0, policy_version 89208 (0.00087) [2022-07-09 04:32:53,519][25689] Fps is (10 sec: 5913.5, 60 sec: 5712.5, 300 sec: 5694.1). Total num frames: 91353088. Throughput: 0: 5137.4. Samples: 91345506. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 04:32:53,519][25689] Avg episode reward: [(0, '-52.997')] [2022-07-09 04:32:54,718][26022] Updated weights on worker 0-0, policy_version 89218 (0.00091) [2022-07-09 04:32:56,463][26022] Updated weights on worker 0-0, policy_version 89228 (0.00090) [2022-07-09 04:32:58,254][26022] Updated weights on worker 0-0, policy_version 89238 (0.00082) [2022-07-09 04:32:58,670][25689] Fps is (10 sec: 5727.3, 60 sec: 5709.7, 300 sec: 5691.5). Total num frames: 91381760. Throughput: 0: 5982.7. Samples: 91379658. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 04:32:58,672][25689] Avg episode reward: [(0, '-52.708')] [2022-07-09 04:33:00,153][26022] Updated weights on worker 0-0, policy_version 89248 (0.00094) [2022-07-09 04:33:01,633][26022] Updated weights on worker 0-0, policy_version 89258 (0.00089) [2022-07-09 04:33:03,693][25689] Fps is (10 sec: 5334.1, 60 sec: 5678.1, 300 sec: 5691.4). Total num frames: 91407360. Throughput: 0: 5850.1. Samples: 91411776. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 04:33:03,694][25689] Avg episode reward: [(0, '-52.477')] [2022-07-09 04:33:03,982][26022] Updated weights on worker 0-0, policy_version 89268 (0.00075) [2022-07-09 04:33:05,924][26022] Updated weights on worker 0-0, policy_version 89278 (0.00089) [2022-07-09 04:33:07,628][26022] Updated weights on worker 0-0, policy_version 89288 (0.00054) [2022-07-09 04:33:08,759][25689] Fps is (10 sec: 5480.8, 60 sec: 5673.0, 300 sec: 5690.3). Total num frames: 91437056. Throughput: 0: 5801.2. Samples: 91445678. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 04:33:08,760][25689] Avg episode reward: [(0, '-53.126')] [2022-07-09 04:33:09,639][26022] Updated weights on worker 0-0, policy_version 89298 (0.00083) [2022-07-09 04:33:11,152][26022] Updated weights on worker 0-0, policy_version 89308 (0.00091) [2022-07-09 04:33:13,153][26022] Updated weights on worker 0-0, policy_version 89318 (0.00204) [2022-07-09 04:33:13,775][25689] Fps is (10 sec: 5687.9, 60 sec: 5678.6, 300 sec: 5681.6). Total num frames: 91464704. Throughput: 0: 5789.4. Samples: 91462778. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 04:33:13,776][25689] Avg episode reward: [(0, '-52.057')] [2022-07-09 04:33:14,839][26022] Updated weights on worker 0-0, policy_version 89328 (0.00090) [2022-07-09 04:33:16,644][26022] Updated weights on worker 0-0, policy_version 89338 (0.00088) [2022-07-09 04:33:18,479][26022] Updated weights on worker 0-0, policy_version 89348 (0.00086) [2022-07-09 04:33:18,859][25689] Fps is (10 sec: 5576.4, 60 sec: 5662.3, 300 sec: 5687.3). Total num frames: 91493376. Throughput: 0: 5813.7. Samples: 91497028. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 04:33:18,859][25689] Avg episode reward: [(0, '-52.325')] [2022-07-09 04:33:20,263][26022] Updated weights on worker 0-0, policy_version 89358 (0.00082) [2022-07-09 04:33:22,116][26022] Updated weights on worker 0-0, policy_version 89368 (0.00094) [2022-07-09 04:33:23,894][25689] Fps is (10 sec: 5565.8, 60 sec: 5660.8, 300 sec: 5679.9). Total num frames: 91521024. Throughput: 0: 5903.7. Samples: 91531036. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 04:33:23,895][25689] Avg episode reward: [(0, '-53.547')] [2022-07-09 04:33:24,121][26022] Updated weights on worker 0-0, policy_version 89378 (0.00093) [2022-07-09 04:33:25,552][26022] Updated weights on worker 0-0, policy_version 89388 (0.00055) [2022-07-09 04:33:27,700][26022] Updated weights on worker 0-0, policy_version 89398 (0.00093) [2022-07-09 04:33:28,922][25689] Fps is (10 sec: 5799.9, 60 sec: 5693.5, 300 sec: 5687.5). Total num frames: 91551744. Throughput: 0: 5068.3. Samples: 91547870. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 04:33:28,923][25689] Avg episode reward: [(0, '-53.577')] [2022-07-09 04:33:29,343][26022] Updated weights on worker 0-0, policy_version 89408 (0.00093) [2022-07-09 04:33:31,202][26022] Updated weights on worker 0-0, policy_version 89418 (0.00089) [2022-07-09 04:33:33,168][26022] Updated weights on worker 0-0, policy_version 89428 (0.00087) [2022-07-09 04:33:33,938][25689] Fps is (10 sec: 5811.5, 60 sec: 5659.6, 300 sec: 5685.8). Total num frames: 91579392. Throughput: 0: 5913.4. Samples: 91582006. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 04:33:33,938][25689] Avg episode reward: [(0, '-53.375')] [2022-07-09 04:33:34,822][26022] Updated weights on worker 0-0, policy_version 89438 (0.00092) [2022-07-09 04:33:36,643][26022] Updated weights on worker 0-0, policy_version 89448 (0.00074) [2022-07-09 04:33:38,180][26022] Updated weights on worker 0-0, policy_version 89458 (0.00087) [2022-07-09 04:33:39,021][25689] Fps is (10 sec: 5678.3, 60 sec: 5681.8, 300 sec: 5684.4). Total num frames: 91609088. Throughput: 0: 5912.8. Samples: 91616244. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 04:33:39,021][25689] Avg episode reward: [(0, '-52.966')] [2022-07-09 04:33:40,110][26022] Updated weights on worker 0-0, policy_version 89468 (0.00101) [2022-07-09 04:33:41,940][26022] Updated weights on worker 0-0, policy_version 89478 (0.00091) [2022-07-09 04:33:43,838][26022] Updated weights on worker 0-0, policy_version 89488 (0.00087) [2022-07-09 04:33:44,051][25689] Fps is (10 sec: 5670.2, 60 sec: 5663.0, 300 sec: 5683.9). Total num frames: 91636736. Throughput: 0: 5080.2. Samples: 91633436. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 04:33:44,052][25689] Avg episode reward: [(0, '-53.466')] [2022-07-09 04:33:45,489][26022] Updated weights on worker 0-0, policy_version 89498 (0.00088) [2022-07-09 04:33:47,411][26022] Updated weights on worker 0-0, policy_version 89508 (0.00093) [2022-07-09 04:33:49,072][25689] Fps is (10 sec: 5705.5, 60 sec: 5661.4, 300 sec: 5687.4). Total num frames: 91666432. Throughput: 0: 5960.2. Samples: 91667966. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 04:33:49,072][25689] Avg episode reward: [(0, '-54.082')] [2022-07-09 04:33:49,079][26022] Updated weights on worker 0-0, policy_version 89518 (0.00090) [2022-07-09 04:33:51,059][26022] Updated weights on worker 0-0, policy_version 89528 (0.00086) [2022-07-09 04:33:52,799][26022] Updated weights on worker 0-0, policy_version 89538 (0.00061) [2022-07-09 04:33:54,136][25689] Fps is (10 sec: 5686.2, 60 sec: 5625.4, 300 sec: 5685.1). Total num frames: 91694080. Throughput: 0: 5947.5. Samples: 91702134. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 04:33:54,136][25689] Avg episode reward: [(0, '-53.686')] [2022-07-09 04:33:54,654][26022] Updated weights on worker 0-0, policy_version 89548 (0.00088) [2022-07-09 04:33:56,475][26022] Updated weights on worker 0-0, policy_version 89558 (0.00089) [2022-07-09 04:33:58,119][26022] Updated weights on worker 0-0, policy_version 89568 (0.00090) [2022-07-09 04:33:59,279][25689] Fps is (10 sec: 5517.6, 60 sec: 5626.1, 300 sec: 5682.6). Total num frames: 91722752. Throughput: 0: 5072.9. Samples: 91719010. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 04:33:59,280][25689] Avg episode reward: [(0, '-53.828')] [2022-07-09 04:33:59,892][26022] Updated weights on worker 0-0, policy_version 89578 (0.00087) [2022-07-09 04:34:01,883][26022] Updated weights on worker 0-0, policy_version 89588 (0.00075) [2022-07-09 04:34:04,012][26022] Updated weights on worker 0-0, policy_version 89598 (0.00084) [2022-07-09 04:34:04,293][25689] Fps is (10 sec: 5544.8, 60 sec: 5660.8, 300 sec: 5683.2). Total num frames: 91750400. Throughput: 0: 5814.8. Samples: 91751138. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 04:34:04,294][25689] Avg episode reward: [(0, '-54.270')] [2022-07-09 04:34:05,769][26022] Updated weights on worker 0-0, policy_version 89608 (0.00085) [2022-07-09 04:34:07,456][26022] Updated weights on worker 0-0, policy_version 89618 (0.00090) [2022-07-09 04:34:09,259][26022] Updated weights on worker 0-0, policy_version 89628 (0.00090) [2022-07-09 04:34:09,386][25689] Fps is (10 sec: 5572.4, 60 sec: 5641.3, 300 sec: 5685.2). Total num frames: 91779072. Throughput: 0: 5792.4. Samples: 91785634. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 04:34:09,387][25689] Avg episode reward: [(0, '-53.753')] [2022-07-09 04:34:10,975][26022] Updated weights on worker 0-0, policy_version 89638 (0.00091) [2022-07-09 04:34:12,939][26022] Updated weights on worker 0-0, policy_version 89648 (0.00090) [2022-07-09 04:34:14,472][25689] Fps is (10 sec: 5734.3, 60 sec: 5668.6, 300 sec: 5689.0). Total num frames: 91808768. Throughput: 0: 4953.6. Samples: 91802870. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 04:34:14,472][25689] Avg episode reward: [(0, '-53.645')] [2022-07-09 04:34:14,655][26022] Updated weights on worker 0-0, policy_version 89658 (0.00089) [2022-07-09 04:34:16,424][26022] Updated weights on worker 0-0, policy_version 89668 (0.00091) [2022-07-09 04:34:18,175][26022] Updated weights on worker 0-0, policy_version 89678 (0.01020) [2022-07-09 04:34:19,559][25689] Fps is (10 sec: 5838.4, 60 sec: 5685.1, 300 sec: 5684.3). Total num frames: 91838464. Throughput: 0: 5845.4. Samples: 91837552. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 04:34:19,559][25689] Avg episode reward: [(0, '-53.379')] [2022-07-09 04:34:19,916][26022] Updated weights on worker 0-0, policy_version 89688 (0.00090) [2022-07-09 04:34:21,742][26022] Updated weights on worker 0-0, policy_version 89698 (0.00091) [2022-07-09 04:34:23,492][26022] Updated weights on worker 0-0, policy_version 89708 (0.00103) [2022-07-09 04:34:24,565][25689] Fps is (10 sec: 5681.2, 60 sec: 5687.8, 300 sec: 5688.2). Total num frames: 91866112. Throughput: 0: 5970.3. Samples: 91872170. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 04:34:24,566][25689] Avg episode reward: [(0, '-53.340')] [2022-07-09 04:34:25,276][26022] Updated weights on worker 0-0, policy_version 89718 (0.00214) [2022-07-09 04:34:25,878][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:34:25,886][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000089722_91875328.pth [2022-07-09 04:34:25,887][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000087722_89827328.pth [2022-07-09 04:34:27,294][26022] Updated weights on worker 0-0, policy_version 89728 (0.00085) [2022-07-09 04:34:28,602][26022] Updated weights on worker 0-0, policy_version 89738 (0.00089) [2022-07-09 04:34:29,576][25689] Fps is (10 sec: 5724.6, 60 sec: 5672.6, 300 sec: 5688.6). Total num frames: 91895808. Throughput: 0: 5136.9. Samples: 91889346. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 04:34:29,576][25689] Avg episode reward: [(0, '-52.465')] [2022-07-09 04:34:30,899][26022] Updated weights on worker 0-0, policy_version 89748 (0.00082) [2022-07-09 04:34:32,115][26022] Updated weights on worker 0-0, policy_version 89758 (0.00094) [2022-07-09 04:34:34,247][26022] Updated weights on worker 0-0, policy_version 89768 (0.00086) [2022-07-09 04:34:34,587][25689] Fps is (10 sec: 5926.6, 60 sec: 5706.8, 300 sec: 5693.2). Total num frames: 91925504. Throughput: 0: 6022.6. Samples: 91924014. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 04:34:34,587][25689] Avg episode reward: [(0, '-52.442')] [2022-07-09 04:34:35,964][26022] Updated weights on worker 0-0, policy_version 89778 (0.00087) [2022-07-09 04:34:37,688][26022] Updated weights on worker 0-0, policy_version 89788 (0.00083) [2022-07-09 04:34:39,494][26022] Updated weights on worker 0-0, policy_version 89798 (0.00079) [2022-07-09 04:34:39,689][25689] Fps is (10 sec: 5670.0, 60 sec: 5671.2, 300 sec: 5684.5). Total num frames: 91953152. Throughput: 0: 6017.3. Samples: 91958684. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 04:34:39,690][25689] Avg episode reward: [(0, '-52.302')] [2022-07-09 04:34:41,167][26022] Updated weights on worker 0-0, policy_version 89808 (0.00603) [2022-07-09 04:34:42,968][26022] Updated weights on worker 0-0, policy_version 89818 (0.00089) [2022-07-09 04:34:44,694][25689] Fps is (10 sec: 5673.5, 60 sec: 5707.4, 300 sec: 5691.7). Total num frames: 91982848. Throughput: 0: 5159.1. Samples: 91976016. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 04:34:44,696][25689] Avg episode reward: [(0, '-51.823')] [2022-07-09 04:34:45,027][26022] Updated weights on worker 0-0, policy_version 89828 (0.00091) [2022-07-09 04:34:46,515][26022] Updated weights on worker 0-0, policy_version 89838 (0.00093) [2022-07-09 04:34:48,580][26022] Updated weights on worker 0-0, policy_version 89848 (0.00092) [2022-07-09 04:34:49,703][25689] Fps is (10 sec: 5829.0, 60 sec: 5691.6, 300 sec: 5688.1). Total num frames: 92011520. Throughput: 0: 6025.1. Samples: 92010614. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 04:34:49,704][25689] Avg episode reward: [(0, '-51.285')] [2022-07-09 04:34:50,030][26022] Updated weights on worker 0-0, policy_version 89858 (0.00090) [2022-07-09 04:34:52,026][26022] Updated weights on worker 0-0, policy_version 89868 (0.00085) [2022-07-09 04:34:53,688][26022] Updated weights on worker 0-0, policy_version 89878 (0.00088) [2022-07-09 04:34:54,711][25689] Fps is (10 sec: 5724.8, 60 sec: 5713.8, 300 sec: 5692.6). Total num frames: 92040192. Throughput: 0: 6013.1. Samples: 92045022. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 04:34:54,712][25689] Avg episode reward: [(0, '-51.408')] [2022-07-09 04:34:55,625][26022] Updated weights on worker 0-0, policy_version 89888 (0.00085) [2022-07-09 04:34:57,286][26022] Updated weights on worker 0-0, policy_version 89898 (0.00088) [2022-07-09 04:34:59,270][26022] Updated weights on worker 0-0, policy_version 89908 (0.00095) [2022-07-09 04:34:59,840][25689] Fps is (10 sec: 5656.8, 60 sec: 5715.1, 300 sec: 5690.5). Total num frames: 92068864. Throughput: 0: 5136.7. Samples: 92062192. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 04:34:59,840][25689] Avg episode reward: [(0, '-50.621')] [2022-07-09 04:35:00,998][26022] Updated weights on worker 0-0, policy_version 89918 (0.00084) [2022-07-09 04:35:03,130][26022] Updated weights on worker 0-0, policy_version 89928 (0.00091) [2022-07-09 04:35:04,869][25689] Fps is (10 sec: 5443.6, 60 sec: 5696.8, 300 sec: 5686.8). Total num frames: 92095488. Throughput: 0: 5846.9. Samples: 92093976. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 04:35:04,869][25689] Avg episode reward: [(0, '-50.870')] [2022-07-09 04:35:04,996][26022] Updated weights on worker 0-0, policy_version 89938 (0.00088) [2022-07-09 04:35:06,664][26022] Updated weights on worker 0-0, policy_version 89948 (0.00083) [2022-07-09 04:35:08,365][26022] Updated weights on worker 0-0, policy_version 89958 (0.00093) [2022-07-09 04:35:09,870][25689] Fps is (10 sec: 5411.0, 60 sec: 5688.5, 300 sec: 5680.2). Total num frames: 92123136. Throughput: 0: 5832.1. Samples: 92128230. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 04:35:09,871][25689] Avg episode reward: [(0, '-51.135')] [2022-07-09 04:35:10,513][26022] Updated weights on worker 0-0, policy_version 89968 (0.00088) [2022-07-09 04:35:12,016][26022] Updated weights on worker 0-0, policy_version 89978 (0.01188) [2022-07-09 04:35:13,896][26022] Updated weights on worker 0-0, policy_version 89988 (0.00084) [2022-07-09 04:35:14,919][25689] Fps is (10 sec: 5705.9, 60 sec: 5692.0, 300 sec: 5684.4). Total num frames: 92152832. Throughput: 0: 5813.3. Samples: 92162496. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 04:35:14,919][25689] Avg episode reward: [(0, '-51.128')] [2022-07-09 04:35:15,670][26022] Updated weights on worker 0-0, policy_version 89998 (0.00090) [2022-07-09 04:35:17,435][26022] Updated weights on worker 0-0, policy_version 90008 (0.00091) [2022-07-09 04:35:19,251][26022] Updated weights on worker 0-0, policy_version 90018 (0.00083) [2022-07-09 04:35:19,983][25689] Fps is (10 sec: 5771.2, 60 sec: 5677.1, 300 sec: 5684.9). Total num frames: 92181504. Throughput: 0: 5834.5. Samples: 92179720. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 04:35:19,984][25689] Avg episode reward: [(0, '-50.796')] [2022-07-09 04:35:20,930][26022] Updated weights on worker 0-0, policy_version 90028 (0.00083) [2022-07-09 04:35:22,911][26022] Updated weights on worker 0-0, policy_version 90038 (0.00087) [2022-07-09 04:35:24,608][26022] Updated weights on worker 0-0, policy_version 90048 (0.00090) [2022-07-09 04:35:25,002][25689] Fps is (10 sec: 5788.6, 60 sec: 5709.9, 300 sec: 5688.3). Total num frames: 92211200. Throughput: 0: 5976.0. Samples: 92214292. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 04:35:25,003][25689] Avg episode reward: [(0, '-51.003')] [2022-07-09 04:35:26,668][26022] Updated weights on worker 0-0, policy_version 90058 (0.00089) [2022-07-09 04:35:28,260][26022] Updated weights on worker 0-0, policy_version 90068 (0.00087) [2022-07-09 04:35:30,022][25689] Fps is (10 sec: 5610.6, 60 sec: 5658.2, 300 sec: 5677.8). Total num frames: 92237824. Throughput: 0: 5943.9. Samples: 92248010. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 04:35:30,022][25689] Avg episode reward: [(0, '-51.420')] [2022-07-09 04:35:30,267][26022] Updated weights on worker 0-0, policy_version 90078 (0.00093) [2022-07-09 04:35:31,992][26022] Updated weights on worker 0-0, policy_version 90088 (0.00089) [2022-07-09 04:35:33,781][26022] Updated weights on worker 0-0, policy_version 90098 (0.00089) [2022-07-09 04:35:35,032][25689] Fps is (10 sec: 5513.3, 60 sec: 5641.4, 300 sec: 5682.6). Total num frames: 92266496. Throughput: 0: 5107.5. Samples: 92265222. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 04:35:35,032][25689] Avg episode reward: [(0, '-52.488')] [2022-07-09 04:35:35,545][26022] Updated weights on worker 0-0, policy_version 90108 (0.00115) [2022-07-09 04:35:37,546][26022] Updated weights on worker 0-0, policy_version 90118 (0.00059) [2022-07-09 04:35:39,034][26022] Updated weights on worker 0-0, policy_version 90128 (0.00081) [2022-07-09 04:35:40,145][25689] Fps is (10 sec: 5765.4, 60 sec: 5674.2, 300 sec: 5680.8). Total num frames: 92296192. Throughput: 0: 5930.9. Samples: 92299298. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 04:35:40,146][25689] Avg episode reward: [(0, '-52.912')] [2022-07-09 04:35:41,070][26022] Updated weights on worker 0-0, policy_version 90138 (0.00090) [2022-07-09 04:35:42,649][26022] Updated weights on worker 0-0, policy_version 90148 (0.00086) [2022-07-09 04:35:44,598][26022] Updated weights on worker 0-0, policy_version 90158 (0.00088) [2022-07-09 04:35:45,157][25689] Fps is (10 sec: 5764.4, 60 sec: 5656.6, 300 sec: 5684.1). Total num frames: 92324864. Throughput: 0: 5946.1. Samples: 92334138. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 04:35:45,163][25689] Avg episode reward: [(0, '-53.223')] [2022-07-09 04:35:46,247][26022] Updated weights on worker 0-0, policy_version 90168 (0.00090) [2022-07-09 04:35:48,000][26022] Updated weights on worker 0-0, policy_version 90178 (0.00091) [2022-07-09 04:35:49,719][26022] Updated weights on worker 0-0, policy_version 90188 (0.00093) [2022-07-09 04:35:50,191][25689] Fps is (10 sec: 5810.4, 60 sec: 5671.2, 300 sec: 5680.6). Total num frames: 92354560. Throughput: 0: 5121.6. Samples: 92351308. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 04:35:50,192][25689] Avg episode reward: [(0, '-52.626')] [2022-07-09 04:35:51,700][26022] Updated weights on worker 0-0, policy_version 90198 (0.00090) [2022-07-09 04:35:53,313][26022] Updated weights on worker 0-0, policy_version 90208 (0.00090) [2022-07-09 04:35:55,232][25689] Fps is (10 sec: 5692.0, 60 sec: 5651.2, 300 sec: 5679.0). Total num frames: 92382208. Throughput: 0: 5963.5. Samples: 92385686. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 04:35:55,239][25689] Avg episode reward: [(0, '-52.601')] [2022-07-09 04:35:55,298][26022] Updated weights on worker 0-0, policy_version 90218 (0.00094) [2022-07-09 04:35:56,987][26022] Updated weights on worker 0-0, policy_version 90228 (0.00090) [2022-07-09 04:35:58,770][26022] Updated weights on worker 0-0, policy_version 90238 (0.00096) [2022-07-09 04:36:00,333][25689] Fps is (10 sec: 5653.9, 60 sec: 5670.7, 300 sec: 5687.9). Total num frames: 92411904. Throughput: 0: 5979.9. Samples: 92420020. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 04:36:00,334][25689] Avg episode reward: [(0, '-51.842')] [2022-07-09 04:36:00,623][26022] Updated weights on worker 0-0, policy_version 90248 (0.00088) [2022-07-09 04:36:02,678][26022] Updated weights on worker 0-0, policy_version 90258 (0.00096) [2022-07-09 04:36:04,551][26022] Updated weights on worker 0-0, policy_version 90268 (0.00098) [2022-07-09 04:36:05,350][25689] Fps is (10 sec: 5667.1, 60 sec: 5688.7, 300 sec: 5684.9). Total num frames: 92439552. Throughput: 0: 5000.1. Samples: 92435106. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 04:36:05,351][25689] Avg episode reward: [(0, '-51.931')] [2022-07-09 04:36:06,512][26022] Updated weights on worker 0-0, policy_version 90278 (0.00088) [2022-07-09 04:36:07,894][26022] Updated weights on worker 0-0, policy_version 90288 (0.00093) [2022-07-09 04:36:10,105][26022] Updated weights on worker 0-0, policy_version 90298 (0.00087) [2022-07-09 04:36:10,359][25689] Fps is (10 sec: 5413.3, 60 sec: 5671.1, 300 sec: 5671.5). Total num frames: 92466176. Throughput: 0: 5868.0. Samples: 92469654. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 04:36:10,359][25689] Avg episode reward: [(0, '-52.574')] [2022-07-09 04:36:11,352][26022] Updated weights on worker 0-0, policy_version 90308 (0.00087) [2022-07-09 04:36:13,514][26022] Updated weights on worker 0-0, policy_version 90318 (0.00083) [2022-07-09 04:36:15,175][26022] Updated weights on worker 0-0, policy_version 90328 (0.00082) [2022-07-09 04:36:15,410][25689] Fps is (10 sec: 5598.5, 60 sec: 5670.9, 300 sec: 5681.8). Total num frames: 92495872. Throughput: 0: 5885.7. Samples: 92504450. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 04:36:15,411][25689] Avg episode reward: [(0, '-52.137')] [2022-07-09 04:36:17,018][26022] Updated weights on worker 0-0, policy_version 90338 (0.00096) [2022-07-09 04:36:18,879][26022] Updated weights on worker 0-0, policy_version 90348 (0.00101) [2022-07-09 04:36:20,443][26022] Updated weights on worker 0-0, policy_version 90358 (0.00063) [2022-07-09 04:36:20,508][25689] Fps is (10 sec: 5952.7, 60 sec: 5701.6, 300 sec: 5683.9). Total num frames: 92526592. Throughput: 0: 5042.2. Samples: 92521746. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 04:36:20,509][25689] Avg episode reward: [(0, '-52.886')] [2022-07-09 04:36:22,297][26022] Updated weights on worker 0-0, policy_version 90368 (0.00088) [2022-07-09 04:36:24,187][26022] Updated weights on worker 0-0, policy_version 90378 (0.00084) [2022-07-09 04:36:25,516][25689] Fps is (10 sec: 5877.0, 60 sec: 5685.7, 300 sec: 5683.9). Total num frames: 92555264. Throughput: 0: 5996.4. Samples: 92556028. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 04:36:25,519][25689] Avg episode reward: [(0, '-53.970')] [2022-07-09 04:36:25,900][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:36:25,918][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000090388_92557312.pth [2022-07-09 04:36:25,919][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000088388_90509312.pth [2022-07-09 04:36:25,923][26022] Updated weights on worker 0-0, policy_version 90388 (0.00095) [2022-07-09 04:36:27,693][26022] Updated weights on worker 0-0, policy_version 90398 (0.00082) [2022-07-09 04:36:29,403][26022] Updated weights on worker 0-0, policy_version 90408 (0.00083) [2022-07-09 04:36:30,543][25689] Fps is (10 sec: 5612.6, 60 sec: 5701.9, 300 sec: 5680.1). Total num frames: 92582912. Throughput: 0: 5997.3. Samples: 92590702. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 04:36:30,545][25689] Avg episode reward: [(0, '-54.133')] [2022-07-09 04:36:31,185][26022] Updated weights on worker 0-0, policy_version 90418 (0.00084) [2022-07-09 04:36:33,127][26022] Updated weights on worker 0-0, policy_version 90428 (0.00090) [2022-07-09 04:36:34,779][26022] Updated weights on worker 0-0, policy_version 90438 (0.00088) [2022-07-09 04:36:35,572][25689] Fps is (10 sec: 5702.4, 60 sec: 5717.0, 300 sec: 5681.8). Total num frames: 92612608. Throughput: 0: 5133.4. Samples: 92607950. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 04:36:35,573][25689] Avg episode reward: [(0, '-53.950')] [2022-07-09 04:36:36,694][26022] Updated weights on worker 0-0, policy_version 90448 (0.00088) [2022-07-09 04:36:38,398][26022] Updated weights on worker 0-0, policy_version 90458 (0.00509) [2022-07-09 04:36:40,039][26022] Updated weights on worker 0-0, policy_version 90468 (0.00091) [2022-07-09 04:36:40,601][25689] Fps is (10 sec: 5802.8, 60 sec: 5708.1, 300 sec: 5685.2). Total num frames: 92641280. Throughput: 0: 6016.9. Samples: 92642644. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 04:36:40,602][25689] Avg episode reward: [(0, '-52.959')] [2022-07-09 04:36:41,860][26022] Updated weights on worker 0-0, policy_version 90478 (0.00085) [2022-07-09 04:36:43,596][26022] Updated weights on worker 0-0, policy_version 90488 (0.00086) [2022-07-09 04:36:45,423][26022] Updated weights on worker 0-0, policy_version 90498 (0.00094) [2022-07-09 04:36:45,621][25689] Fps is (10 sec: 5808.4, 60 sec: 5724.3, 300 sec: 5685.7). Total num frames: 92670976. Throughput: 0: 6032.6. Samples: 92677314. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 04:36:45,622][25689] Avg episode reward: [(0, '-53.399')] [2022-07-09 04:36:47,293][26022] Updated weights on worker 0-0, policy_version 90508 (0.00090) [2022-07-09 04:36:48,884][26022] Updated weights on worker 0-0, policy_version 90518 (0.00092) [2022-07-09 04:36:50,639][25689] Fps is (10 sec: 5713.0, 60 sec: 5691.9, 300 sec: 5682.2). Total num frames: 92698624. Throughput: 0: 5179.6. Samples: 92694792. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 04:36:50,639][25689] Avg episode reward: [(0, '-53.077')] [2022-07-09 04:36:50,839][26022] Updated weights on worker 0-0, policy_version 90528 (0.00097) [2022-07-09 04:36:52,471][26022] Updated weights on worker 0-0, policy_version 90538 (0.00083) [2022-07-09 04:36:54,480][26022] Updated weights on worker 0-0, policy_version 90548 (0.00089) [2022-07-09 04:36:55,677][25689] Fps is (10 sec: 5702.3, 60 sec: 5726.0, 300 sec: 5687.2). Total num frames: 92728320. Throughput: 0: 6030.3. Samples: 92729190. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 04:36:55,678][25689] Avg episode reward: [(0, '-53.543')] [2022-07-09 04:36:56,050][26022] Updated weights on worker 0-0, policy_version 90558 (0.00097) [2022-07-09 04:36:57,891][26022] Updated weights on worker 0-0, policy_version 90568 (0.00098) [2022-07-09 04:36:59,735][26022] Updated weights on worker 0-0, policy_version 90578 (0.00094) [2022-07-09 04:37:00,767][25689] Fps is (10 sec: 5864.2, 60 sec: 5727.2, 300 sec: 5693.3). Total num frames: 92758016. Throughput: 0: 5989.2. Samples: 92763418. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 04:37:00,767][25689] Avg episode reward: [(0, '-52.386')] [2022-07-09 04:37:01,522][26022] Updated weights on worker 0-0, policy_version 90588 (0.00087) [2022-07-09 04:37:03,596][26022] Updated weights on worker 0-0, policy_version 90598 (0.00091) [2022-07-09 04:37:05,604][26022] Updated weights on worker 0-0, policy_version 90608 (0.00092) [2022-07-09 04:37:05,849][25689] Fps is (10 sec: 5335.7, 60 sec: 5670.2, 300 sec: 5674.8). Total num frames: 92782592. Throughput: 0: 4987.3. Samples: 92778196. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 04:37:05,849][25689] Avg episode reward: [(0, '-53.563')] [2022-07-09 04:37:07,340][26022] Updated weights on worker 0-0, policy_version 90618 (0.00082) [2022-07-09 04:37:09,210][26022] Updated weights on worker 0-0, policy_version 90628 (0.00089) [2022-07-09 04:37:10,926][25689] Fps is (10 sec: 5443.1, 60 sec: 5731.5, 300 sec: 5685.1). Total num frames: 92813312. Throughput: 0: 5799.9. Samples: 92812454. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 04:37:10,926][25689] Avg episode reward: [(0, '-52.801')] [2022-07-09 04:37:10,931][26022] Updated weights on worker 0-0, policy_version 90638 (0.00072) [2022-07-09 04:37:12,747][26022] Updated weights on worker 0-0, policy_version 90648 (0.00096) [2022-07-09 04:37:14,498][26022] Updated weights on worker 0-0, policy_version 90658 (0.00087) [2022-07-09 04:37:15,951][25689] Fps is (10 sec: 5879.2, 60 sec: 5717.0, 300 sec: 5682.9). Total num frames: 92841984. Throughput: 0: 5814.0. Samples: 92847060. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 04:37:15,951][25689] Avg episode reward: [(0, '-51.895')] [2022-07-09 04:37:16,247][26022] Updated weights on worker 0-0, policy_version 90668 (0.00089) [2022-07-09 04:37:18,111][26022] Updated weights on worker 0-0, policy_version 90678 (0.00086) [2022-07-09 04:37:19,811][26022] Updated weights on worker 0-0, policy_version 90688 (0.00087) [2022-07-09 04:37:21,000][25689] Fps is (10 sec: 5692.2, 60 sec: 5687.8, 300 sec: 5685.7). Total num frames: 92870656. Throughput: 0: 4981.3. Samples: 92864206. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 04:37:21,000][25689] Avg episode reward: [(0, '-51.980')] [2022-07-09 04:37:21,686][26022] Updated weights on worker 0-0, policy_version 90698 (0.00093) [2022-07-09 04:37:23,534][26022] Updated weights on worker 0-0, policy_version 90708 (0.00090) [2022-07-09 04:37:25,075][26022] Updated weights on worker 0-0, policy_version 90718 (0.00088) [2022-07-09 04:37:26,015][25689] Fps is (10 sec: 5799.6, 60 sec: 5704.0, 300 sec: 5689.2). Total num frames: 92900352. Throughput: 0: 5979.7. Samples: 92898784. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 04:37:26,016][25689] Avg episode reward: [(0, '-51.343')] [2022-07-09 04:37:27,106][26022] Updated weights on worker 0-0, policy_version 90728 (0.00093) [2022-07-09 04:37:28,850][26022] Updated weights on worker 0-0, policy_version 90738 (0.00083) [2022-07-09 04:37:30,684][26022] Updated weights on worker 0-0, policy_version 90748 (0.00092) [2022-07-09 04:37:31,027][25689] Fps is (10 sec: 5718.7, 60 sec: 5705.4, 300 sec: 5682.4). Total num frames: 92928000. Throughput: 0: 6009.1. Samples: 92933246. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 04:37:31,028][25689] Avg episode reward: [(0, '-51.259')] [2022-07-09 04:37:32,355][26022] Updated weights on worker 0-0, policy_version 90758 (0.00084) [2022-07-09 04:37:34,035][26022] Updated weights on worker 0-0, policy_version 90768 (0.00088) [2022-07-09 04:37:35,911][26022] Updated weights on worker 0-0, policy_version 90778 (0.00082) [2022-07-09 04:37:36,043][25689] Fps is (10 sec: 5616.5, 60 sec: 5689.8, 300 sec: 5684.7). Total num frames: 92956672. Throughput: 0: 5139.9. Samples: 92950330. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 04:37:36,043][25689] Avg episode reward: [(0, '-51.153')] [2022-07-09 04:37:37,708][26022] Updated weights on worker 0-0, policy_version 90788 (0.00095) [2022-07-09 04:37:39,682][26022] Updated weights on worker 0-0, policy_version 90798 (0.00091) [2022-07-09 04:37:41,102][25689] Fps is (10 sec: 5793.5, 60 sec: 5703.9, 300 sec: 5687.2). Total num frames: 92986368. Throughput: 0: 6000.5. Samples: 92984830. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 04:37:41,102][25689] Avg episode reward: [(0, '-51.357')] [2022-07-09 04:37:41,360][26022] Updated weights on worker 0-0, policy_version 90808 (0.00090) [2022-07-09 04:37:43,115][26022] Updated weights on worker 0-0, policy_version 90818 (0.00088) [2022-07-09 04:37:44,919][26022] Updated weights on worker 0-0, policy_version 90828 (0.00084) [2022-07-09 04:37:46,139][25689] Fps is (10 sec: 5882.7, 60 sec: 5702.3, 300 sec: 5686.6). Total num frames: 93016064. Throughput: 0: 5998.0. Samples: 93019486. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 04:37:46,139][25689] Avg episode reward: [(0, '-52.593')] [2022-07-09 04:37:46,723][26022] Updated weights on worker 0-0, policy_version 90838 (0.00091) [2022-07-09 04:37:48,499][26022] Updated weights on worker 0-0, policy_version 90848 (0.00081) [2022-07-09 04:37:50,372][26022] Updated weights on worker 0-0, policy_version 90858 (0.00088) [2022-07-09 04:37:51,189][25689] Fps is (10 sec: 5684.7, 60 sec: 5699.2, 300 sec: 5679.5). Total num frames: 93043712. Throughput: 0: 5127.7. Samples: 93036630. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 04:37:51,190][25689] Avg episode reward: [(0, '-53.194')] [2022-07-09 04:37:51,941][26022] Updated weights on worker 0-0, policy_version 90868 (0.00092) [2022-07-09 04:37:53,863][26022] Updated weights on worker 0-0, policy_version 90878 (0.00086) [2022-07-09 04:37:55,663][26022] Updated weights on worker 0-0, policy_version 90888 (0.00095) [2022-07-09 04:37:56,274][25689] Fps is (10 sec: 5556.8, 60 sec: 5677.9, 300 sec: 5680.8). Total num frames: 93072384. Throughput: 0: 5964.8. Samples: 93071006. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 04:37:56,274][25689] Avg episode reward: [(0, '-53.159')] [2022-07-09 04:37:57,460][26022] Updated weights on worker 0-0, policy_version 90898 (0.00087) [2022-07-09 04:37:59,258][26022] Updated weights on worker 0-0, policy_version 90908 (0.00093) [2022-07-09 04:38:00,950][26022] Updated weights on worker 0-0, policy_version 90918 (0.00095) [2022-07-09 04:38:01,391][25689] Fps is (10 sec: 5721.5, 60 sec: 5675.4, 300 sec: 5692.8). Total num frames: 93102080. Throughput: 0: 5939.6. Samples: 93105338. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 04:38:01,391][25689] Avg episode reward: [(0, '-53.034')] [2022-07-09 04:38:03,152][26022] Updated weights on worker 0-0, policy_version 90928 (0.00083) [2022-07-09 04:38:04,829][26022] Updated weights on worker 0-0, policy_version 90938 (0.00089) [2022-07-09 04:38:06,401][25689] Fps is (10 sec: 5561.4, 60 sec: 5715.9, 300 sec: 5683.5). Total num frames: 93128704. Throughput: 0: 5825.0. Samples: 93137514. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 04:38:06,401][25689] Avg episode reward: [(0, '-52.678')] [2022-07-09 04:38:06,762][26022] Updated weights on worker 0-0, policy_version 90948 (0.00088) [2022-07-09 04:38:08,403][26022] Updated weights on worker 0-0, policy_version 90958 (0.00086) [2022-07-09 04:38:10,195][26022] Updated weights on worker 0-0, policy_version 90968 (0.00087) [2022-07-09 04:38:11,409][25689] Fps is (10 sec: 5519.4, 60 sec: 5688.6, 300 sec: 5687.1). Total num frames: 93157376. Throughput: 0: 5851.1. Samples: 93154938. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 04:38:11,410][25689] Avg episode reward: [(0, '-52.599')] [2022-07-09 04:38:11,852][26022] Updated weights on worker 0-0, policy_version 90978 (0.00096) [2022-07-09 04:38:14,025][26022] Updated weights on worker 0-0, policy_version 90988 (0.00069) [2022-07-09 04:38:15,408][26022] Updated weights on worker 0-0, policy_version 90998 (0.00087) [2022-07-09 04:38:16,450][25689] Fps is (10 sec: 5604.4, 60 sec: 5670.2, 300 sec: 5684.5). Total num frames: 93185024. Throughput: 0: 5878.7. Samples: 93189616. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 04:38:16,451][25689] Avg episode reward: [(0, '-52.117')] [2022-07-09 04:38:17,428][26022] Updated weights on worker 0-0, policy_version 91008 (0.00080) [2022-07-09 04:38:19,142][26022] Updated weights on worker 0-0, policy_version 91018 (0.00092) [2022-07-09 04:38:20,970][26022] Updated weights on worker 0-0, policy_version 91028 (0.00090) [2022-07-09 04:38:21,512][25689] Fps is (10 sec: 5878.5, 60 sec: 5719.7, 300 sec: 5697.7). Total num frames: 93216768. Throughput: 0: 5908.1. Samples: 93224220. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 04:38:21,513][25689] Avg episode reward: [(0, '-51.379')] [2022-07-09 04:38:22,829][26022] Updated weights on worker 0-0, policy_version 91038 (0.00089) [2022-07-09 04:38:24,391][26022] Updated weights on worker 0-0, policy_version 91048 (0.00092) [2022-07-09 04:38:25,921][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:38:25,941][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000091056_93241344.pth [2022-07-09 04:38:25,941][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000089057_91194368.pth [2022-07-09 04:38:26,354][26022] Updated weights on worker 0-0, policy_version 91058 (0.00055) [2022-07-09 04:38:26,539][25689] Fps is (10 sec: 5887.0, 60 sec: 5684.8, 300 sec: 5687.4). Total num frames: 93244416. Throughput: 0: 5169.8. Samples: 93241622. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 04:38:26,540][25689] Avg episode reward: [(0, '-51.991')] [2022-07-09 04:38:28,006][26022] Updated weights on worker 0-0, policy_version 91068 (0.00545) [2022-07-09 04:38:29,875][26022] Updated weights on worker 0-0, policy_version 91078 (0.00051) [2022-07-09 04:38:31,419][26022] Updated weights on worker 0-0, policy_version 91088 (0.00089) [2022-07-09 04:38:31,555][25689] Fps is (10 sec: 5710.1, 60 sec: 5718.2, 300 sec: 5694.3). Total num frames: 93274112. Throughput: 0: 6021.5. Samples: 93276246. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 04:38:31,557][25689] Avg episode reward: [(0, '-52.097')] [2022-07-09 04:38:33,345][26022] Updated weights on worker 0-0, policy_version 91098 (0.00093) [2022-07-09 04:38:35,273][26022] Updated weights on worker 0-0, policy_version 91108 (0.00094) [2022-07-09 04:38:36,567][25689] Fps is (10 sec: 5820.5, 60 sec: 5718.6, 300 sec: 5692.2). Total num frames: 93302784. Throughput: 0: 6003.7. Samples: 93310390. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 04:38:36,567][25689] Avg episode reward: [(0, '-52.783')] [2022-07-09 04:38:36,995][26022] Updated weights on worker 0-0, policy_version 91118 (0.00083) [2022-07-09 04:38:38,743][26022] Updated weights on worker 0-0, policy_version 91128 (0.00096) [2022-07-09 04:38:40,564][26022] Updated weights on worker 0-0, policy_version 91138 (0.00083) [2022-07-09 04:38:41,649][25689] Fps is (10 sec: 5681.2, 60 sec: 5699.5, 300 sec: 5694.7). Total num frames: 93331456. Throughput: 0: 5134.1. Samples: 93327602. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 04:38:41,649][25689] Avg episode reward: [(0, '-52.011')] [2022-07-09 04:38:42,419][26022] Updated weights on worker 0-0, policy_version 91148 (0.00103) [2022-07-09 04:38:44,154][26022] Updated weights on worker 0-0, policy_version 91158 (0.00088) [2022-07-09 04:38:45,950][26022] Updated weights on worker 0-0, policy_version 91168 (0.00089) [2022-07-09 04:38:46,653][25689] Fps is (10 sec: 5685.2, 60 sec: 5685.6, 300 sec: 5691.5). Total num frames: 93360128. Throughput: 0: 6002.8. Samples: 93362366. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 04:38:46,654][25689] Avg episode reward: [(0, '-52.552')] [2022-07-09 04:38:47,733][26022] Updated weights on worker 0-0, policy_version 91178 (0.00091) [2022-07-09 04:38:49,496][26022] Updated weights on worker 0-0, policy_version 91188 (0.00081) [2022-07-09 04:38:51,306][26022] Updated weights on worker 0-0, policy_version 91198 (0.00088) [2022-07-09 04:38:51,690][25689] Fps is (10 sec: 5710.6, 60 sec: 5703.8, 300 sec: 5695.5). Total num frames: 93388800. Throughput: 0: 5973.9. Samples: 93396532. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 04:38:51,691][25689] Avg episode reward: [(0, '-53.625')] [2022-07-09 04:38:53,149][26022] Updated weights on worker 0-0, policy_version 91208 (0.00093) [2022-07-09 04:38:54,790][26022] Updated weights on worker 0-0, policy_version 91218 (0.00090) [2022-07-09 04:38:56,641][26022] Updated weights on worker 0-0, policy_version 91228 (0.00084) [2022-07-09 04:38:56,717][25689] Fps is (10 sec: 5697.8, 60 sec: 5709.3, 300 sec: 5697.7). Total num frames: 93417472. Throughput: 0: 5133.7. Samples: 93413838. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 04:38:56,718][25689] Avg episode reward: [(0, '-53.379')] [2022-07-09 04:38:58,487][26022] Updated weights on worker 0-0, policy_version 91238 (0.00083) [2022-07-09 04:39:00,082][26022] Updated weights on worker 0-0, policy_version 91248 (0.00383) [2022-07-09 04:39:01,822][25689] Fps is (10 sec: 5660.0, 60 sec: 5693.5, 300 sec: 5699.4). Total num frames: 93446144. Throughput: 0: 5996.6. Samples: 93448572. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 04:39:01,822][25689] Avg episode reward: [(0, '-53.096')] [2022-07-09 04:39:02,530][26022] Updated weights on worker 0-0, policy_version 91258 (0.00090) [2022-07-09 04:39:03,923][26022] Updated weights on worker 0-0, policy_version 91268 (0.00087) [2022-07-09 04:39:05,856][26022] Updated weights on worker 0-0, policy_version 91278 (0.00088) [2022-07-09 04:39:06,828][25689] Fps is (10 sec: 5570.7, 60 sec: 5710.9, 300 sec: 5697.7). Total num frames: 93473792. Throughput: 0: 5877.9. Samples: 93480950. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 04:39:06,828][25689] Avg episode reward: [(0, '-53.223')] [2022-07-09 04:39:07,529][26022] Updated weights on worker 0-0, policy_version 91288 (0.00084) [2022-07-09 04:39:09,468][26022] Updated weights on worker 0-0, policy_version 91298 (0.00082) [2022-07-09 04:39:11,218][26022] Updated weights on worker 0-0, policy_version 91308 (0.00088) [2022-07-09 04:39:11,849][25689] Fps is (10 sec: 5718.8, 60 sec: 5726.5, 300 sec: 5698.9). Total num frames: 93503488. Throughput: 0: 5046.8. Samples: 93498270. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 04:39:11,850][25689] Avg episode reward: [(0, '-53.408')] [2022-07-09 04:39:12,967][26022] Updated weights on worker 0-0, policy_version 91318 (0.00084) [2022-07-09 04:39:14,562][26022] Updated weights on worker 0-0, policy_version 91328 (0.01180) [2022-07-09 04:39:16,786][26022] Updated weights on worker 0-0, policy_version 91338 (0.00090) [2022-07-09 04:39:16,876][25689] Fps is (10 sec: 5604.9, 60 sec: 5710.9, 300 sec: 5689.7). Total num frames: 93530112. Throughput: 0: 5921.1. Samples: 93533198. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 04:39:16,877][25689] Avg episode reward: [(0, '-53.534')] [2022-07-09 04:39:18,167][26022] Updated weights on worker 0-0, policy_version 91348 (0.00085) [2022-07-09 04:39:20,212][26022] Updated weights on worker 0-0, policy_version 91358 (0.00096) [2022-07-09 04:39:21,859][26022] Updated weights on worker 0-0, policy_version 91368 (0.00089) [2022-07-09 04:39:21,932][25689] Fps is (10 sec: 5687.3, 60 sec: 5694.6, 300 sec: 5699.1). Total num frames: 93560832. Throughput: 0: 5907.8. Samples: 93567378. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 04:39:21,932][25689] Avg episode reward: [(0, '-53.972')] [2022-07-09 04:39:23,778][26022] Updated weights on worker 0-0, policy_version 91378 (0.00064) [2022-07-09 04:39:25,342][26022] Updated weights on worker 0-0, policy_version 91388 (0.00091) [2022-07-09 04:39:26,967][25689] Fps is (10 sec: 5885.8, 60 sec: 5710.7, 300 sec: 5695.2). Total num frames: 93589504. Throughput: 0: 5153.4. Samples: 93584736. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 04:39:26,968][25689] Avg episode reward: [(0, '-54.344')] [2022-07-09 04:39:27,497][26022] Updated weights on worker 0-0, policy_version 91398 (0.00087) [2022-07-09 04:39:29,042][26022] Updated weights on worker 0-0, policy_version 91408 (0.00088) [2022-07-09 04:39:31,167][26022] Updated weights on worker 0-0, policy_version 91418 (0.00085) [2022-07-09 04:39:31,988][25689] Fps is (10 sec: 5804.4, 60 sec: 5710.3, 300 sec: 5695.0). Total num frames: 93619200. Throughput: 0: 5996.9. Samples: 93619040. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 04:39:31,990][25689] Avg episode reward: [(0, '-53.846')] [2022-07-09 04:39:32,337][26022] Updated weights on worker 0-0, policy_version 91428 (0.00088) [2022-07-09 04:39:34,551][26022] Updated weights on worker 0-0, policy_version 91438 (0.00095) [2022-07-09 04:39:36,503][26022] Updated weights on worker 0-0, policy_version 91448 (0.00101) [2022-07-09 04:39:36,993][25689] Fps is (10 sec: 5617.5, 60 sec: 5677.0, 300 sec: 5693.4). Total num frames: 93645824. Throughput: 0: 5971.3. Samples: 93653320. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 04:39:36,993][25689] Avg episode reward: [(0, '-54.047')] [2022-07-09 04:39:37,808][26022] Updated weights on worker 0-0, policy_version 91458 (0.00091) [2022-07-09 04:39:40,005][26022] Updated weights on worker 0-0, policy_version 91468 (0.00087) [2022-07-09 04:39:41,640][26022] Updated weights on worker 0-0, policy_version 91478 (0.00093) [2022-07-09 04:39:42,143][25689] Fps is (10 sec: 5546.0, 60 sec: 5687.5, 300 sec: 5690.6). Total num frames: 93675520. Throughput: 0: 5109.7. Samples: 93670650. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 04:39:42,144][25689] Avg episode reward: [(0, '-54.144')] [2022-07-09 04:39:43,335][26022] Updated weights on worker 0-0, policy_version 91488 (0.00092) [2022-07-09 04:39:45,394][26022] Updated weights on worker 0-0, policy_version 91498 (0.00088) [2022-07-09 04:39:46,761][26022] Updated weights on worker 0-0, policy_version 91508 (0.00087) [2022-07-09 04:39:47,154][25689] Fps is (10 sec: 5845.1, 60 sec: 5703.9, 300 sec: 5694.0). Total num frames: 93705216. Throughput: 0: 5958.8. Samples: 93705026. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 04:39:47,154][25689] Avg episode reward: [(0, '-53.633')] [2022-07-09 04:39:48,868][26022] Updated weights on worker 0-0, policy_version 91518 (0.00091) [2022-07-09 04:39:50,499][26022] Updated weights on worker 0-0, policy_version 91528 (0.00100) [2022-07-09 04:39:52,164][25689] Fps is (10 sec: 5722.6, 60 sec: 5689.5, 300 sec: 5690.6). Total num frames: 93732864. Throughput: 0: 5969.4. Samples: 93739478. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 04:39:52,164][25689] Avg episode reward: [(0, '-53.614')] [2022-07-09 04:39:52,382][26022] Updated weights on worker 0-0, policy_version 91538 (0.00088) [2022-07-09 04:39:54,275][26022] Updated weights on worker 0-0, policy_version 91548 (0.00087) [2022-07-09 04:39:55,840][26022] Updated weights on worker 0-0, policy_version 91558 (0.00091) [2022-07-09 04:39:57,165][25689] Fps is (10 sec: 5830.0, 60 sec: 5725.8, 300 sec: 5699.9). Total num frames: 93763584. Throughput: 0: 5126.6. Samples: 93756738. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 04:39:57,166][25689] Avg episode reward: [(0, '-53.810')] [2022-07-09 04:39:57,504][26022] Updated weights on worker 0-0, policy_version 91568 (0.00086) [2022-07-09 04:39:59,471][26022] Updated weights on worker 0-0, policy_version 91578 (0.00087) [2022-07-09 04:40:01,209][26022] Updated weights on worker 0-0, policy_version 91588 (0.00085) [2022-07-09 04:40:02,292][25689] Fps is (10 sec: 5560.4, 60 sec: 5672.8, 300 sec: 5694.6). Total num frames: 93789184. Throughput: 0: 5994.8. Samples: 93791442. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 04:40:02,293][25689] Avg episode reward: [(0, '-53.866')] [2022-07-09 04:40:03,302][26022] Updated weights on worker 0-0, policy_version 91598 (0.00087) [2022-07-09 04:40:05,235][26022] Updated weights on worker 0-0, policy_version 91608 (0.00092) [2022-07-09 04:40:06,959][26022] Updated weights on worker 0-0, policy_version 91618 (0.00081) [2022-07-09 04:40:07,370][25689] Fps is (10 sec: 5318.1, 60 sec: 5683.0, 300 sec: 5696.5). Total num frames: 93817856. Throughput: 0: 5865.7. Samples: 93823612. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 04:40:07,371][25689] Avg episode reward: [(0, '-53.878')] [2022-07-09 04:40:08,734][26022] Updated weights on worker 0-0, policy_version 91628 (0.00097) [2022-07-09 04:40:10,555][26022] Updated weights on worker 0-0, policy_version 91638 (0.00089) [2022-07-09 04:40:12,171][26022] Updated weights on worker 0-0, policy_version 91648 (0.00098) [2022-07-09 04:40:12,428][25689] Fps is (10 sec: 5859.7, 60 sec: 5696.5, 300 sec: 5699.8). Total num frames: 93848576. Throughput: 0: 5003.6. Samples: 93840874. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 04:40:12,429][25689] Avg episode reward: [(0, '-53.505')] [2022-07-09 04:40:14,196][26022] Updated weights on worker 0-0, policy_version 91658 (0.00087) [2022-07-09 04:40:15,770][26022] Updated weights on worker 0-0, policy_version 91668 (0.00089) [2022-07-09 04:40:17,490][25689] Fps is (10 sec: 5767.6, 60 sec: 5710.1, 300 sec: 5696.4). Total num frames: 93876224. Throughput: 0: 5833.3. Samples: 93875300. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 04:40:17,491][25689] Avg episode reward: [(0, '-53.061')] [2022-07-09 04:40:17,739][26022] Updated weights on worker 0-0, policy_version 91678 (0.00080) [2022-07-09 04:40:19,437][26022] Updated weights on worker 0-0, policy_version 91688 (0.00306) [2022-07-09 04:40:21,195][26022] Updated weights on worker 0-0, policy_version 91698 (0.00090) [2022-07-09 04:40:22,556][25689] Fps is (10 sec: 5662.1, 60 sec: 5692.3, 300 sec: 5695.5). Total num frames: 93905920. Throughput: 0: 5835.9. Samples: 93909698. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 04:40:22,556][25689] Avg episode reward: [(0, '-53.090')] [2022-07-09 04:40:23,013][26022] Updated weights on worker 0-0, policy_version 91708 (0.00084) [2022-07-09 04:40:24,780][26022] Updated weights on worker 0-0, policy_version 91718 (0.00089) [2022-07-09 04:40:26,055][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:40:26,065][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000091725_93926400.pth [2022-07-09 04:40:26,066][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000089722_91875328.pth [2022-07-09 04:40:26,549][26022] Updated weights on worker 0-0, policy_version 91728 (0.00092) [2022-07-09 04:40:27,657][25689] Fps is (10 sec: 5741.4, 60 sec: 5686.1, 300 sec: 5700.8). Total num frames: 93934592. Throughput: 0: 5953.4. Samples: 93944384. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 04:40:27,657][25689] Avg episode reward: [(0, '-52.805')] [2022-07-09 04:40:28,374][26022] Updated weights on worker 0-0, policy_version 91738 (0.00086) [2022-07-09 04:40:30,083][26022] Updated weights on worker 0-0, policy_version 91748 (0.00090) [2022-07-09 04:40:32,105][26022] Updated weights on worker 0-0, policy_version 91758 (0.00086) [2022-07-09 04:40:32,681][25689] Fps is (10 sec: 5663.7, 60 sec: 5668.9, 300 sec: 5700.6). Total num frames: 93963264. Throughput: 0: 5965.5. Samples: 93961690. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 04:40:32,681][25689] Avg episode reward: [(0, '-52.984')] [2022-07-09 04:40:33,559][26022] Updated weights on worker 0-0, policy_version 91768 (0.00088) [2022-07-09 04:40:35,574][26022] Updated weights on worker 0-0, policy_version 91778 (0.00097) [2022-07-09 04:40:37,179][26022] Updated weights on worker 0-0, policy_version 91788 (0.00087) [2022-07-09 04:40:37,696][25689] Fps is (10 sec: 5813.8, 60 sec: 5718.5, 300 sec: 5702.4). Total num frames: 93992960. Throughput: 0: 5966.7. Samples: 93995862. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 04:40:37,697][25689] Avg episode reward: [(0, '-52.780')] [2022-07-09 04:40:39,181][26022] Updated weights on worker 0-0, policy_version 91798 (0.00106) [2022-07-09 04:40:41,032][26022] Updated weights on worker 0-0, policy_version 91808 (0.00095) [2022-07-09 04:40:42,726][26022] Updated weights on worker 0-0, policy_version 91818 (0.00118) [2022-07-09 04:40:42,827][25689] Fps is (10 sec: 5752.5, 60 sec: 5703.5, 300 sec: 5700.2). Total num frames: 94021632. Throughput: 0: 5956.5. Samples: 94030444. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 04:40:42,828][25689] Avg episode reward: [(0, '-52.991')] [2022-07-09 04:40:44,338][26022] Updated weights on worker 0-0, policy_version 91828 (0.00089) [2022-07-09 04:40:46,295][26022] Updated weights on worker 0-0, policy_version 91838 (0.00086) [2022-07-09 04:40:47,851][25689] Fps is (10 sec: 5747.6, 60 sec: 5702.2, 300 sec: 5700.3). Total num frames: 94051328. Throughput: 0: 5112.9. Samples: 94047638. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 04:40:47,852][25689] Avg episode reward: [(0, '-52.987')] [2022-07-09 04:40:48,096][26022] Updated weights on worker 0-0, policy_version 91848 (0.00091) [2022-07-09 04:40:49,754][26022] Updated weights on worker 0-0, policy_version 91858 (0.00096) [2022-07-09 04:40:51,515][26022] Updated weights on worker 0-0, policy_version 91868 (0.00092) [2022-07-09 04:40:52,919][25689] Fps is (10 sec: 5783.5, 60 sec: 5713.6, 300 sec: 5703.3). Total num frames: 94080000. Throughput: 0: 5950.2. Samples: 94082114. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 04:40:52,920][25689] Avg episode reward: [(0, '-52.827')] [2022-07-09 04:40:53,430][26022] Updated weights on worker 0-0, policy_version 91878 (0.00313) [2022-07-09 04:40:55,103][26022] Updated weights on worker 0-0, policy_version 91888 (0.00097) [2022-07-09 04:40:56,970][26022] Updated weights on worker 0-0, policy_version 91898 (0.00094) [2022-07-09 04:40:57,990][25689] Fps is (10 sec: 5554.7, 60 sec: 5656.6, 300 sec: 5697.0). Total num frames: 94107648. Throughput: 0: 5947.5. Samples: 94116562. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 04:40:57,991][25689] Avg episode reward: [(0, '-52.616')] [2022-07-09 04:40:58,628][26022] Updated weights on worker 0-0, policy_version 91908 (0.00078) [2022-07-09 04:41:00,679][26022] Updated weights on worker 0-0, policy_version 91918 (0.00086) [2022-07-09 04:41:02,657][26022] Updated weights on worker 0-0, policy_version 91928 (0.00087) [2022-07-09 04:41:03,091][25689] Fps is (10 sec: 5536.9, 60 sec: 5709.6, 300 sec: 5698.8). Total num frames: 94136320. Throughput: 0: 5094.3. Samples: 94133674. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 04:41:03,092][25689] Avg episode reward: [(0, '-52.514')] [2022-07-09 04:41:04,578][26022] Updated weights on worker 0-0, policy_version 91938 (0.00086) [2022-07-09 04:41:06,257][26022] Updated weights on worker 0-0, policy_version 91948 (0.00091) [2022-07-09 04:41:08,161][25689] Fps is (10 sec: 5638.3, 60 sec: 5710.4, 300 sec: 5704.5). Total num frames: 94164992. Throughput: 0: 5805.3. Samples: 94165542. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 04:41:08,161][25689] Avg episode reward: [(0, '-52.367')] [2022-07-09 04:41:08,164][26022] Updated weights on worker 0-0, policy_version 91958 (0.00097) [2022-07-09 04:41:09,945][26022] Updated weights on worker 0-0, policy_version 91968 (0.00087) [2022-07-09 04:41:11,744][26022] Updated weights on worker 0-0, policy_version 91978 (0.00096) [2022-07-09 04:41:13,169][25689] Fps is (10 sec: 5588.2, 60 sec: 5664.4, 300 sec: 5698.5). Total num frames: 94192640. Throughput: 0: 5810.0. Samples: 94199768. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 04:41:13,170][25689] Avg episode reward: [(0, '-53.073')] [2022-07-09 04:41:13,466][26022] Updated weights on worker 0-0, policy_version 91988 (0.00089) [2022-07-09 04:41:15,315][26022] Updated weights on worker 0-0, policy_version 91998 (0.00090) [2022-07-09 04:41:16,938][26022] Updated weights on worker 0-0, policy_version 92008 (0.00083) [2022-07-09 04:41:18,225][25689] Fps is (10 sec: 5697.9, 60 sec: 5698.7, 300 sec: 5695.8). Total num frames: 94222336. Throughput: 0: 4970.2. Samples: 94217132. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 04:41:18,226][25689] Avg episode reward: [(0, '-52.574')] [2022-07-09 04:41:18,929][26022] Updated weights on worker 0-0, policy_version 92018 (0.00085) [2022-07-09 04:41:20,734][26022] Updated weights on worker 0-0, policy_version 92028 (0.00099) [2022-07-09 04:41:22,430][26022] Updated weights on worker 0-0, policy_version 92038 (0.00087) [2022-07-09 04:41:23,265][25689] Fps is (10 sec: 5680.4, 60 sec: 5667.5, 300 sec: 5691.8). Total num frames: 94249984. Throughput: 0: 5838.0. Samples: 94251446. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 04:41:23,265][25689] Avg episode reward: [(0, '-52.604')] [2022-07-09 04:41:24,396][26022] Updated weights on worker 0-0, policy_version 92048 (0.00093) [2022-07-09 04:41:26,054][26022] Updated weights on worker 0-0, policy_version 92058 (0.00094) [2022-07-09 04:41:27,905][26022] Updated weights on worker 0-0, policy_version 92068 (0.00086) [2022-07-09 04:41:28,283][25689] Fps is (10 sec: 5599.6, 60 sec: 5675.2, 300 sec: 5695.4). Total num frames: 94278656. Throughput: 0: 5974.9. Samples: 94285768. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 04:41:28,283][25689] Avg episode reward: [(0, '-52.897')] [2022-07-09 04:41:29,788][26022] Updated weights on worker 0-0, policy_version 92078 (0.00095) [2022-07-09 04:41:31,344][26022] Updated weights on worker 0-0, policy_version 92088 (0.00085) [2022-07-09 04:41:33,315][25689] Fps is (10 sec: 5705.4, 60 sec: 5674.4, 300 sec: 5691.9). Total num frames: 94307328. Throughput: 0: 5117.2. Samples: 94302858. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 04:41:33,316][25689] Avg episode reward: [(0, '-52.471')] [2022-07-09 04:41:33,378][26022] Updated weights on worker 0-0, policy_version 92098 (0.00065) [2022-07-09 04:41:34,915][26022] Updated weights on worker 0-0, policy_version 92108 (0.00087) [2022-07-09 04:41:36,962][26022] Updated weights on worker 0-0, policy_version 92118 (0.00088) [2022-07-09 04:41:38,331][25689] Fps is (10 sec: 5808.9, 60 sec: 5674.4, 300 sec: 5695.6). Total num frames: 94337024. Throughput: 0: 5973.7. Samples: 94337238. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 04:41:38,332][25689] Avg episode reward: [(0, '-52.384')] [2022-07-09 04:41:38,602][26022] Updated weights on worker 0-0, policy_version 92128 (0.00093) [2022-07-09 04:41:40,422][26022] Updated weights on worker 0-0, policy_version 92138 (0.00096) [2022-07-09 04:41:42,203][26022] Updated weights on worker 0-0, policy_version 92148 (0.00111) [2022-07-09 04:41:43,362][25689] Fps is (10 sec: 5809.5, 60 sec: 5683.7, 300 sec: 5691.9). Total num frames: 94365696. Throughput: 0: 5978.6. Samples: 94371602. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 04:41:43,364][25689] Avg episode reward: [(0, '-52.932')] [2022-07-09 04:41:43,904][26022] Updated weights on worker 0-0, policy_version 92158 (0.00089) [2022-07-09 04:41:45,831][26022] Updated weights on worker 0-0, policy_version 92168 (0.00095) [2022-07-09 04:41:47,545][26022] Updated weights on worker 0-0, policy_version 92178 (0.00091) [2022-07-09 04:41:48,387][25689] Fps is (10 sec: 5702.5, 60 sec: 5666.7, 300 sec: 5695.2). Total num frames: 94394368. Throughput: 0: 5129.2. Samples: 94388886. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 04:41:48,387][25689] Avg episode reward: [(0, '-53.145')] [2022-07-09 04:41:49,256][26022] Updated weights on worker 0-0, policy_version 92188 (0.00097) [2022-07-09 04:41:51,140][26022] Updated weights on worker 0-0, policy_version 92198 (0.00089) [2022-07-09 04:41:52,908][26022] Updated weights on worker 0-0, policy_version 92208 (0.00087) [2022-07-09 04:41:53,395][25689] Fps is (10 sec: 5613.9, 60 sec: 5655.5, 300 sec: 5688.9). Total num frames: 94422016. Throughput: 0: 5992.3. Samples: 94423180. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 04:41:53,395][25689] Avg episode reward: [(0, '-52.605')] [2022-07-09 04:41:54,655][26022] Updated weights on worker 0-0, policy_version 92218 (0.00097) [2022-07-09 04:41:56,696][26022] Updated weights on worker 0-0, policy_version 92228 (0.00087) [2022-07-09 04:41:58,211][26022] Updated weights on worker 0-0, policy_version 92238 (0.00092) [2022-07-09 04:41:58,407][25689] Fps is (10 sec: 5825.3, 60 sec: 5711.8, 300 sec: 5693.9). Total num frames: 94452736. Throughput: 0: 5992.9. Samples: 94457550. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 04:41:58,407][25689] Avg episode reward: [(0, '-52.444')] [2022-07-09 04:42:00,178][26022] Updated weights on worker 0-0, policy_version 92248 (0.00086) [2022-07-09 04:42:01,865][26022] Updated weights on worker 0-0, policy_version 92258 (0.00096) [2022-07-09 04:42:03,516][25689] Fps is (10 sec: 5665.6, 60 sec: 5677.1, 300 sec: 5700.2). Total num frames: 94479360. Throughput: 0: 5123.3. Samples: 94474856. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 04:42:03,517][25689] Avg episode reward: [(0, '-52.136')] [2022-07-09 04:42:03,980][26022] Updated weights on worker 0-0, policy_version 92268 (0.00088) [2022-07-09 04:42:05,944][26022] Updated weights on worker 0-0, policy_version 92278 (0.00089) [2022-07-09 04:42:07,567][26022] Updated weights on worker 0-0, policy_version 92288 (0.00088) [2022-07-09 04:42:08,568][25689] Fps is (10 sec: 5442.1, 60 sec: 5678.9, 300 sec: 5693.8). Total num frames: 94508032. Throughput: 0: 5854.4. Samples: 94507030. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 04:42:08,568][25689] Avg episode reward: [(0, '-51.190')] [2022-07-09 04:42:09,452][26022] Updated weights on worker 0-0, policy_version 92298 (0.00093) [2022-07-09 04:42:11,401][26022] Updated weights on worker 0-0, policy_version 92308 (0.00091) [2022-07-09 04:42:13,057][26022] Updated weights on worker 0-0, policy_version 92318 (0.00084) [2022-07-09 04:42:13,614][25689] Fps is (10 sec: 5678.8, 60 sec: 5692.2, 300 sec: 5693.4). Total num frames: 94536704. Throughput: 0: 5835.6. Samples: 94541172. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 04:42:13,614][25689] Avg episode reward: [(0, '-51.322')] [2022-07-09 04:42:14,741][26022] Updated weights on worker 0-0, policy_version 92328 (0.00085) [2022-07-09 04:42:16,604][26022] Updated weights on worker 0-0, policy_version 92338 (0.00096) [2022-07-09 04:42:18,280][26022] Updated weights on worker 0-0, policy_version 92348 (0.00083) [2022-07-09 04:42:18,714][25689] Fps is (10 sec: 5651.5, 60 sec: 5671.1, 300 sec: 5692.4). Total num frames: 94565376. Throughput: 0: 4980.4. Samples: 94558690. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 04:42:18,716][25689] Avg episode reward: [(0, '-51.084')] [2022-07-09 04:42:20,018][26022] Updated weights on worker 0-0, policy_version 92358 (0.00084) [2022-07-09 04:42:22,034][26022] Updated weights on worker 0-0, policy_version 92368 (0.00086) [2022-07-09 04:42:23,685][26022] Updated weights on worker 0-0, policy_version 92378 (0.00089) [2022-07-09 04:42:23,773][25689] Fps is (10 sec: 5745.4, 60 sec: 5703.1, 300 sec: 5691.6). Total num frames: 94595072. Throughput: 0: 5844.0. Samples: 94593238. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 04:42:23,774][25689] Avg episode reward: [(0, '-51.951')] [2022-07-09 04:42:25,647][26022] Updated weights on worker 0-0, policy_version 92388 (0.00085) [2022-07-09 04:42:26,125][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:42:26,135][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000092392_94609408.pth [2022-07-09 04:42:26,136][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000090388_92557312.pth [2022-07-09 04:42:27,175][26022] Updated weights on worker 0-0, policy_version 92398 (0.00085) [2022-07-09 04:42:28,871][25689] Fps is (10 sec: 5645.8, 60 sec: 5678.7, 300 sec: 5690.0). Total num frames: 94622720. Throughput: 0: 5941.0. Samples: 94627654. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 04:42:28,872][25689] Avg episode reward: [(0, '-52.266')] [2022-07-09 04:42:29,219][26022] Updated weights on worker 0-0, policy_version 92408 (0.00099) [2022-07-09 04:42:30,904][26022] Updated weights on worker 0-0, policy_version 92418 (0.00094) [2022-07-09 04:42:32,665][26022] Updated weights on worker 0-0, policy_version 92428 (0.00087) [2022-07-09 04:42:33,891][25689] Fps is (10 sec: 5667.8, 60 sec: 5696.9, 300 sec: 5693.3). Total num frames: 94652416. Throughput: 0: 5963.4. Samples: 94662088. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 04:42:33,891][25689] Avg episode reward: [(0, '-53.064')] [2022-07-09 04:42:34,394][26022] Updated weights on worker 0-0, policy_version 92438 (0.00092) [2022-07-09 04:42:36,168][26022] Updated weights on worker 0-0, policy_version 92448 (0.00087) [2022-07-09 04:42:38,006][26022] Updated weights on worker 0-0, policy_version 92458 (0.00088) [2022-07-09 04:42:38,927][25689] Fps is (10 sec: 5906.5, 60 sec: 5694.9, 300 sec: 5693.8). Total num frames: 94682112. Throughput: 0: 5972.9. Samples: 94679416. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 04:42:38,927][25689] Avg episode reward: [(0, '-52.279')] [2022-07-09 04:42:39,871][26022] Updated weights on worker 0-0, policy_version 92468 (0.00099) [2022-07-09 04:42:41,539][26022] Updated weights on worker 0-0, policy_version 92478 (0.00088) [2022-07-09 04:42:43,295][26022] Updated weights on worker 0-0, policy_version 92488 (0.00091) [2022-07-09 04:42:44,040][25689] Fps is (10 sec: 5851.9, 60 sec: 5704.1, 300 sec: 5692.3). Total num frames: 94711808. Throughput: 0: 5971.0. Samples: 94714250. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 04:42:44,040][25689] Avg episode reward: [(0, '-52.045')] [2022-07-09 04:42:45,123][26022] Updated weights on worker 0-0, policy_version 92498 (0.00084) [2022-07-09 04:42:46,949][26022] Updated weights on worker 0-0, policy_version 92508 (0.00080) [2022-07-09 04:42:48,626][26022] Updated weights on worker 0-0, policy_version 92518 (0.00090) [2022-07-09 04:42:49,088][25689] Fps is (10 sec: 5844.9, 60 sec: 5718.8, 300 sec: 5699.2). Total num frames: 94741504. Throughput: 0: 5998.2. Samples: 94748918. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 04:42:49,088][25689] Avg episode reward: [(0, '-52.010')] [2022-07-09 04:42:50,663][26022] Updated weights on worker 0-0, policy_version 92528 (0.00092) [2022-07-09 04:42:52,056][26022] Updated weights on worker 0-0, policy_version 92538 (0.00090) [2022-07-09 04:42:54,103][25689] Fps is (10 sec: 5596.7, 60 sec: 5701.2, 300 sec: 5693.7). Total num frames: 94768128. Throughput: 0: 5144.5. Samples: 94766070. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 04:42:54,103][25689] Avg episode reward: [(0, '-52.210')] [2022-07-09 04:42:54,145][26022] Updated weights on worker 0-0, policy_version 92548 (0.00093) [2022-07-09 04:42:55,875][26022] Updated weights on worker 0-0, policy_version 92558 (0.00094) [2022-07-09 04:42:57,470][26022] Updated weights on worker 0-0, policy_version 92568 (0.00088) [2022-07-09 04:42:59,151][25689] Fps is (10 sec: 5495.1, 60 sec: 5664.2, 300 sec: 5691.6). Total num frames: 94796800. Throughput: 0: 5984.2. Samples: 94800442. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 04:42:59,151][25689] Avg episode reward: [(0, '-52.093')] [2022-07-09 04:42:59,624][26022] Updated weights on worker 0-0, policy_version 92578 (0.00085) [2022-07-09 04:43:01,224][26022] Updated weights on worker 0-0, policy_version 92588 (0.00090) [2022-07-09 04:43:03,449][26022] Updated weights on worker 0-0, policy_version 92598 (0.00086) [2022-07-09 04:43:04,207][25689] Fps is (10 sec: 5776.7, 60 sec: 5719.7, 300 sec: 5701.0). Total num frames: 94826496. Throughput: 0: 5870.8. Samples: 94832648. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 04:43:04,208][25689] Avg episode reward: [(0, '-52.523')] [2022-07-09 04:43:05,182][26022] Updated weights on worker 0-0, policy_version 92608 (0.00081) [2022-07-09 04:43:06,838][26022] Updated weights on worker 0-0, policy_version 92618 (0.00077) [2022-07-09 04:43:08,866][26022] Updated weights on worker 0-0, policy_version 92628 (0.00085) [2022-07-09 04:43:09,216][25689] Fps is (10 sec: 5493.8, 60 sec: 5673.1, 300 sec: 5690.7). Total num frames: 94852096. Throughput: 0: 5012.2. Samples: 94849804. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 04:43:09,216][25689] Avg episode reward: [(0, '-52.645')] [2022-07-09 04:43:10,477][26022] Updated weights on worker 0-0, policy_version 92638 (0.00098) [2022-07-09 04:43:12,326][26022] Updated weights on worker 0-0, policy_version 92648 (0.00084) [2022-07-09 04:43:14,067][26022] Updated weights on worker 0-0, policy_version 92658 (0.00090) [2022-07-09 04:43:14,228][25689] Fps is (10 sec: 5517.9, 60 sec: 5693.2, 300 sec: 5698.1). Total num frames: 94881792. Throughput: 0: 5868.6. Samples: 94884176. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 04:43:14,229][25689] Avg episode reward: [(0, '-52.699')] [2022-07-09 04:43:15,738][26022] Updated weights on worker 0-0, policy_version 92668 (0.00089) [2022-07-09 04:43:17,802][26022] Updated weights on worker 0-0, policy_version 92678 (0.00099) [2022-07-09 04:43:19,230][25689] Fps is (10 sec: 5930.7, 60 sec: 5719.3, 300 sec: 5692.4). Total num frames: 94911488. Throughput: 0: 5895.2. Samples: 94918816. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 04:43:19,231][25689] Avg episode reward: [(0, '-52.320')] [2022-07-09 04:43:19,346][26022] Updated weights on worker 0-0, policy_version 92688 (0.00053) [2022-07-09 04:43:21,123][26022] Updated weights on worker 0-0, policy_version 92698 (0.00084) [2022-07-09 04:43:22,928][26022] Updated weights on worker 0-0, policy_version 92708 (0.00093) [2022-07-09 04:43:24,327][25689] Fps is (10 sec: 5779.8, 60 sec: 5698.9, 300 sec: 5694.5). Total num frames: 94940160. Throughput: 0: 5134.2. Samples: 94935948. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 04:43:24,327][25689] Avg episode reward: [(0, '-52.281')] [2022-07-09 04:43:24,707][26022] Updated weights on worker 0-0, policy_version 92718 (0.00090) [2022-07-09 04:43:26,616][26022] Updated weights on worker 0-0, policy_version 92728 (0.00098) [2022-07-09 04:43:28,383][26022] Updated weights on worker 0-0, policy_version 92738 (0.00090) [2022-07-09 04:43:29,339][25689] Fps is (10 sec: 5672.8, 60 sec: 5723.9, 300 sec: 5691.1). Total num frames: 94968832. Throughput: 0: 6000.4. Samples: 94970550. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 04:43:29,339][25689] Avg episode reward: [(0, '-52.277')] [2022-07-09 04:43:29,933][26022] Updated weights on worker 0-0, policy_version 92748 (0.00094) [2022-07-09 04:43:31,888][26022] Updated weights on worker 0-0, policy_version 92758 (0.00091) [2022-07-09 04:43:33,583][26022] Updated weights on worker 0-0, policy_version 92768 (0.00093) [2022-07-09 04:43:34,366][25689] Fps is (10 sec: 5813.9, 60 sec: 5723.2, 300 sec: 5694.2). Total num frames: 94998528. Throughput: 0: 6017.0. Samples: 95005346. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 04:43:34,367][25689] Avg episode reward: [(0, '-53.073')] [2022-07-09 04:43:35,482][26022] Updated weights on worker 0-0, policy_version 92778 (0.00057) [2022-07-09 04:43:37,134][26022] Updated weights on worker 0-0, policy_version 92788 (0.00096) [2022-07-09 04:43:38,976][26022] Updated weights on worker 0-0, policy_version 92798 (0.00088) [2022-07-09 04:43:39,371][25689] Fps is (10 sec: 5818.1, 60 sec: 5709.2, 300 sec: 5695.7). Total num frames: 95027200. Throughput: 0: 5149.9. Samples: 95022538. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 04:43:39,372][25689] Avg episode reward: [(0, '-52.409')] [2022-07-09 04:43:40,678][26022] Updated weights on worker 0-0, policy_version 92808 (0.00088) [2022-07-09 04:43:42,635][26022] Updated weights on worker 0-0, policy_version 92818 (0.00085) [2022-07-09 04:43:44,419][26022] Updated weights on worker 0-0, policy_version 92828 (0.00088) [2022-07-09 04:43:44,464][25689] Fps is (10 sec: 5780.2, 60 sec: 5711.1, 300 sec: 5697.5). Total num frames: 95056896. Throughput: 0: 6000.9. Samples: 95056790. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 04:43:44,465][25689] Avg episode reward: [(0, '-51.684')] [2022-07-09 04:43:46,273][26022] Updated weights on worker 0-0, policy_version 92838 (0.00755) [2022-07-09 04:43:47,876][26022] Updated weights on worker 0-0, policy_version 92848 (0.00088) [2022-07-09 04:43:49,486][25689] Fps is (10 sec: 5770.2, 60 sec: 5696.6, 300 sec: 5697.8). Total num frames: 95085568. Throughput: 0: 6000.3. Samples: 95091442. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 04:43:49,487][25689] Avg episode reward: [(0, '-51.553')] [2022-07-09 04:43:49,728][26022] Updated weights on worker 0-0, policy_version 92858 (0.00090) [2022-07-09 04:43:51,528][26022] Updated weights on worker 0-0, policy_version 92868 (0.00093) [2022-07-09 04:43:53,338][26022] Updated weights on worker 0-0, policy_version 92878 (0.00084) [2022-07-09 04:43:54,490][25689] Fps is (10 sec: 5719.6, 60 sec: 5731.6, 300 sec: 5698.2). Total num frames: 95114240. Throughput: 0: 5123.1. Samples: 95108442. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 04:43:54,490][25689] Avg episode reward: [(0, '-50.723')] [2022-07-09 04:43:55,264][26022] Updated weights on worker 0-0, policy_version 92888 (0.00087) [2022-07-09 04:43:57,020][26022] Updated weights on worker 0-0, policy_version 92898 (0.00096) [2022-07-09 04:43:58,633][26022] Updated weights on worker 0-0, policy_version 92908 (0.00459) [2022-07-09 04:43:59,551][25689] Fps is (10 sec: 5595.9, 60 sec: 5713.4, 300 sec: 5695.6). Total num frames: 95141888. Throughput: 0: 5967.0. Samples: 95142952. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 04:43:59,551][25689] Avg episode reward: [(0, '-51.168')] [2022-07-09 04:44:00,496][26022] Updated weights on worker 0-0, policy_version 92918 (0.00089) [2022-07-09 04:44:02,580][26022] Updated weights on worker 0-0, policy_version 92928 (0.00087) [2022-07-09 04:44:04,490][26022] Updated weights on worker 0-0, policy_version 92938 (0.00098) [2022-07-09 04:44:04,604][25689] Fps is (10 sec: 5366.0, 60 sec: 5662.8, 300 sec: 5691.2). Total num frames: 95168512. Throughput: 0: 5891.2. Samples: 95175438. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 04:44:04,605][25689] Avg episode reward: [(0, '-51.332')] [2022-07-09 04:44:06,033][26022] Updated weights on worker 0-0, policy_version 92948 (0.00086) [2022-07-09 04:44:08,152][26022] Updated weights on worker 0-0, policy_version 92958 (0.00092) [2022-07-09 04:44:09,633][25689] Fps is (10 sec: 5586.4, 60 sec: 5728.7, 300 sec: 5691.1). Total num frames: 95198208. Throughput: 0: 5028.3. Samples: 95192744. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 04:44:09,634][25689] Avg episode reward: [(0, '-52.265')] [2022-07-09 04:44:09,654][26022] Updated weights on worker 0-0, policy_version 92968 (0.00085) [2022-07-09 04:44:11,639][26022] Updated weights on worker 0-0, policy_version 92978 (0.00094) [2022-07-09 04:44:13,419][26022] Updated weights on worker 0-0, policy_version 92988 (0.00086) [2022-07-09 04:44:14,645][25689] Fps is (10 sec: 5813.1, 60 sec: 5711.8, 300 sec: 5698.3). Total num frames: 95226880. Throughput: 0: 5888.1. Samples: 95227116. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 04:44:14,646][25689] Avg episode reward: [(0, '-51.911')] [2022-07-09 04:44:15,072][26022] Updated weights on worker 0-0, policy_version 92998 (0.00084) [2022-07-09 04:44:16,947][26022] Updated weights on worker 0-0, policy_version 93008 (0.00085) [2022-07-09 04:44:18,544][26022] Updated weights on worker 0-0, policy_version 93018 (0.00090) [2022-07-09 04:44:19,730][25689] Fps is (10 sec: 5679.1, 60 sec: 5687.0, 300 sec: 5690.8). Total num frames: 95255552. Throughput: 0: 5898.1. Samples: 95261970. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 04:44:19,731][25689] Avg episode reward: [(0, '-52.442')] [2022-07-09 04:44:20,489][26022] Updated weights on worker 0-0, policy_version 93028 (0.00092) [2022-07-09 04:44:22,258][26022] Updated weights on worker 0-0, policy_version 93038 (0.00081) [2022-07-09 04:44:23,986][26022] Updated weights on worker 0-0, policy_version 93048 (0.00087) [2022-07-09 04:44:24,822][25689] Fps is (10 sec: 5835.8, 60 sec: 5721.3, 300 sec: 5696.6). Total num frames: 95286272. Throughput: 0: 5127.2. Samples: 95279100. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 04:44:24,823][25689] Avg episode reward: [(0, '-52.253')] [2022-07-09 04:44:25,923][26022] Updated weights on worker 0-0, policy_version 93058 (0.00087) [2022-07-09 04:44:26,182][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:44:26,194][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000093061_95294464.pth [2022-07-09 04:44:26,194][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000091056_93241344.pth [2022-07-09 04:44:27,610][26022] Updated weights on worker 0-0, policy_version 93068 (0.00364) [2022-07-09 04:44:29,391][26022] Updated weights on worker 0-0, policy_version 93078 (0.00084) [2022-07-09 04:44:29,867][25689] Fps is (10 sec: 5758.2, 60 sec: 5701.3, 300 sec: 5689.3). Total num frames: 95313920. Throughput: 0: 5963.4. Samples: 95313406. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 04:44:29,867][25689] Avg episode reward: [(0, '-51.727')] [2022-07-09 04:44:31,258][26022] Updated weights on worker 0-0, policy_version 93088 (0.00088) [2022-07-09 04:44:33,001][26022] Updated weights on worker 0-0, policy_version 93098 (0.00090) [2022-07-09 04:44:34,803][26022] Updated weights on worker 0-0, policy_version 93108 (0.00099) [2022-07-09 04:44:34,918][25689] Fps is (10 sec: 5680.0, 60 sec: 5699.1, 300 sec: 5698.7). Total num frames: 95343616. Throughput: 0: 5947.8. Samples: 95347696. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 04:44:34,918][25689] Avg episode reward: [(0, '-52.438')] [2022-07-09 04:44:36,633][26022] Updated weights on worker 0-0, policy_version 93118 (0.00095) [2022-07-09 04:44:38,343][26022] Updated weights on worker 0-0, policy_version 93128 (0.00099) [2022-07-09 04:44:39,931][25689] Fps is (10 sec: 5698.0, 60 sec: 5681.4, 300 sec: 5694.5). Total num frames: 95371264. Throughput: 0: 5089.4. Samples: 95364778. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 04:44:39,931][25689] Avg episode reward: [(0, '-52.707')] [2022-07-09 04:44:40,298][26022] Updated weights on worker 0-0, policy_version 93138 (0.00087) [2022-07-09 04:44:41,964][26022] Updated weights on worker 0-0, policy_version 93148 (0.00093) [2022-07-09 04:44:43,740][26022] Updated weights on worker 0-0, policy_version 93158 (0.00087) [2022-07-09 04:44:45,023][25689] Fps is (10 sec: 5472.2, 60 sec: 5647.7, 300 sec: 5686.0). Total num frames: 95398912. Throughput: 0: 5920.9. Samples: 95398706. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 04:44:45,024][25689] Avg episode reward: [(0, '-53.761')] [2022-07-09 04:44:45,658][26022] Updated weights on worker 0-0, policy_version 93168 (0.00080) [2022-07-09 04:44:47,463][26022] Updated weights on worker 0-0, policy_version 93178 (0.01276) [2022-07-09 04:44:49,083][26022] Updated weights on worker 0-0, policy_version 93188 (0.00091) [2022-07-09 04:44:50,062][25689] Fps is (10 sec: 5760.8, 60 sec: 5679.9, 300 sec: 5695.8). Total num frames: 95429632. Throughput: 0: 5938.0. Samples: 95433330. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 04:44:50,064][25689] Avg episode reward: [(0, '-54.352')] [2022-07-09 04:44:51,220][26022] Updated weights on worker 0-0, policy_version 93198 (0.00091) [2022-07-09 04:44:52,563][26022] Updated weights on worker 0-0, policy_version 93208 (0.00090) [2022-07-09 04:44:54,512][26022] Updated weights on worker 0-0, policy_version 93218 (0.00088) [2022-07-09 04:44:55,067][25689] Fps is (10 sec: 5811.0, 60 sec: 5662.8, 300 sec: 5685.4). Total num frames: 95457280. Throughput: 0: 5108.3. Samples: 95450624. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 04:44:55,068][25689] Avg episode reward: [(0, '-54.959')] [2022-07-09 04:44:56,351][26022] Updated weights on worker 0-0, policy_version 93228 (0.00096) [2022-07-09 04:44:58,082][26022] Updated weights on worker 0-0, policy_version 93238 (0.00093) [2022-07-09 04:44:59,995][26022] Updated weights on worker 0-0, policy_version 93248 (0.00090) [2022-07-09 04:45:00,083][25689] Fps is (10 sec: 5722.5, 60 sec: 5700.9, 300 sec: 5701.3). Total num frames: 95486976. Throughput: 0: 5951.9. Samples: 95484726. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 04:45:00,084][25689] Avg episode reward: [(0, '-55.156')] [2022-07-09 04:45:02,152][26022] Updated weights on worker 0-0, policy_version 93258 (0.00081) [2022-07-09 04:45:03,751][26022] Updated weights on worker 0-0, policy_version 93268 (0.00097) [2022-07-09 04:45:05,130][25689] Fps is (10 sec: 5494.9, 60 sec: 5684.5, 300 sec: 5691.6). Total num frames: 95512576. Throughput: 0: 5874.2. Samples: 95516822. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 04:45:05,131][25689] Avg episode reward: [(0, '-54.080')] [2022-07-09 04:45:05,899][26022] Updated weights on worker 0-0, policy_version 93278 (0.00081) [2022-07-09 04:45:07,417][26022] Updated weights on worker 0-0, policy_version 93288 (0.00087) [2022-07-09 04:45:09,338][26022] Updated weights on worker 0-0, policy_version 93298 (0.00090) [2022-07-09 04:45:10,134][25689] Fps is (10 sec: 5501.5, 60 sec: 5686.8, 300 sec: 5689.1). Total num frames: 95542272. Throughput: 0: 5862.8. Samples: 95551008. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 04:45:10,135][25689] Avg episode reward: [(0, '-54.607')] [2022-07-09 04:45:11,223][26022] Updated weights on worker 0-0, policy_version 93308 (0.00073) [2022-07-09 04:45:12,786][26022] Updated weights on worker 0-0, policy_version 93318 (0.00097) [2022-07-09 04:45:14,962][26022] Updated weights on worker 0-0, policy_version 93328 (0.00084) [2022-07-09 04:45:15,147][25689] Fps is (10 sec: 5622.9, 60 sec: 5653.0, 300 sec: 5686.7). Total num frames: 95568896. Throughput: 0: 5831.2. Samples: 95567710. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 04:45:15,147][25689] Avg episode reward: [(0, '-54.201')] [2022-07-09 04:45:16,348][26022] Updated weights on worker 0-0, policy_version 93338 (0.00085) [2022-07-09 04:45:18,394][26022] Updated weights on worker 0-0, policy_version 93348 (0.00086) [2022-07-09 04:45:20,151][25689] Fps is (10 sec: 5520.7, 60 sec: 5660.6, 300 sec: 5684.4). Total num frames: 95597568. Throughput: 0: 5865.3. Samples: 95602426. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 04:45:20,151][25689] Avg episode reward: [(0, '-53.080')] [2022-07-09 04:45:20,220][26022] Updated weights on worker 0-0, policy_version 93358 (0.00090) [2022-07-09 04:45:21,916][26022] Updated weights on worker 0-0, policy_version 93368 (0.00092) [2022-07-09 04:45:23,711][26022] Updated weights on worker 0-0, policy_version 93378 (0.00082) [2022-07-09 04:45:25,208][25689] Fps is (10 sec: 5801.2, 60 sec: 5646.9, 300 sec: 5688.7). Total num frames: 95627264. Throughput: 0: 5962.0. Samples: 95636524. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 04:45:25,208][25689] Avg episode reward: [(0, '-53.045')] [2022-07-09 04:45:25,457][26022] Updated weights on worker 0-0, policy_version 93388 (0.00088) [2022-07-09 04:45:27,378][26022] Updated weights on worker 0-0, policy_version 93398 (0.00091) [2022-07-09 04:45:29,161][26022] Updated weights on worker 0-0, policy_version 93408 (0.00086) [2022-07-09 04:45:30,235][25689] Fps is (10 sec: 5686.5, 60 sec: 5648.5, 300 sec: 5685.2). Total num frames: 95654912. Throughput: 0: 5095.0. Samples: 95653420. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 04:45:30,235][25689] Avg episode reward: [(0, '-52.694')] [2022-07-09 04:45:30,751][26022] Updated weights on worker 0-0, policy_version 93418 (0.00112) [2022-07-09 04:45:32,775][26022] Updated weights on worker 0-0, policy_version 93428 (0.00086) [2022-07-09 04:45:34,485][26022] Updated weights on worker 0-0, policy_version 93438 (0.00089) [2022-07-09 04:45:35,310][25689] Fps is (10 sec: 5575.4, 60 sec: 5629.4, 300 sec: 5680.6). Total num frames: 95683584. Throughput: 0: 5940.1. Samples: 95687480. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 04:45:35,310][25689] Avg episode reward: [(0, '-51.878')] [2022-07-09 04:45:36,327][26022] Updated weights on worker 0-0, policy_version 93448 (0.00090) [2022-07-09 04:45:38,188][26022] Updated weights on worker 0-0, policy_version 93458 (0.00090) [2022-07-09 04:45:39,966][26022] Updated weights on worker 0-0, policy_version 93468 (0.00085) [2022-07-09 04:45:40,314][25689] Fps is (10 sec: 5689.5, 60 sec: 5647.1, 300 sec: 5683.0). Total num frames: 95712256. Throughput: 0: 5915.9. Samples: 95721708. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 04:45:40,314][25689] Avg episode reward: [(0, '-51.447')] [2022-07-09 04:45:41,723][26022] Updated weights on worker 0-0, policy_version 93478 (0.00086) [2022-07-09 04:45:43,570][26022] Updated weights on worker 0-0, policy_version 93488 (0.00052) [2022-07-09 04:45:45,226][26022] Updated weights on worker 0-0, policy_version 93498 (0.00086) [2022-07-09 04:45:45,364][25689] Fps is (10 sec: 5805.3, 60 sec: 5685.0, 300 sec: 5682.5). Total num frames: 95741952. Throughput: 0: 5073.6. Samples: 95738788. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 04:45:45,364][25689] Avg episode reward: [(0, '-51.133')] [2022-07-09 04:45:47,074][26022] Updated weights on worker 0-0, policy_version 93508 (0.00092) [2022-07-09 04:45:48,981][26022] Updated weights on worker 0-0, policy_version 93518 (0.00090) [2022-07-09 04:45:50,379][25689] Fps is (10 sec: 5798.8, 60 sec: 5653.4, 300 sec: 5683.5). Total num frames: 95770624. Throughput: 0: 5948.8. Samples: 95773256. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 04:45:50,380][25689] Avg episode reward: [(0, '-50.863')] [2022-07-09 04:45:50,692][26022] Updated weights on worker 0-0, policy_version 93528 (0.00092) [2022-07-09 04:45:52,509][26022] Updated weights on worker 0-0, policy_version 93538 (0.01127) [2022-07-09 04:45:54,327][26022] Updated weights on worker 0-0, policy_version 93549 (0.00087) [2022-07-09 04:45:55,404][25689] Fps is (10 sec: 5711.4, 60 sec: 5668.4, 300 sec: 5687.9). Total num frames: 95799296. Throughput: 0: 5977.7. Samples: 95807600. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 04:45:55,405][25689] Avg episode reward: [(0, '-50.911')] [2022-07-09 04:45:56,299][26022] Updated weights on worker 0-0, policy_version 93559 (0.00083) [2022-07-09 04:45:58,195][26022] Updated weights on worker 0-0, policy_version 93569 (0.00089) [2022-07-09 04:45:59,985][26022] Updated weights on worker 0-0, policy_version 93579 (0.00088) [2022-07-09 04:46:00,504][25689] Fps is (10 sec: 5664.0, 60 sec: 5643.7, 300 sec: 5687.9). Total num frames: 95827968. Throughput: 0: 5098.2. Samples: 95824640. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 04:46:00,504][25689] Avg episode reward: [(0, '-51.630')] [2022-07-09 04:46:01,640][26022] Updated weights on worker 0-0, policy_version 93589 (0.00092) [2022-07-09 04:46:03,789][26022] Updated weights on worker 0-0, policy_version 93599 (0.00093) [2022-07-09 04:46:05,501][26022] Updated weights on worker 0-0, policy_version 93609 (0.00100) [2022-07-09 04:46:05,584][25689] Fps is (10 sec: 5532.4, 60 sec: 5674.4, 300 sec: 5684.2). Total num frames: 95855616. Throughput: 0: 5855.7. Samples: 95857192. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 04:46:05,585][25689] Avg episode reward: [(0, '-52.375')] [2022-07-09 04:46:07,429][26022] Updated weights on worker 0-0, policy_version 93619 (0.00086) [2022-07-09 04:46:09,261][26022] Updated weights on worker 0-0, policy_version 93629 (0.00086) [2022-07-09 04:46:10,600][25689] Fps is (10 sec: 5476.9, 60 sec: 5639.5, 300 sec: 5684.1). Total num frames: 95883264. Throughput: 0: 5835.8. Samples: 95891258. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 04:46:10,600][25689] Avg episode reward: [(0, '-52.373')] [2022-07-09 04:46:11,019][26022] Updated weights on worker 0-0, policy_version 93639 (0.00089) [2022-07-09 04:46:12,805][26022] Updated weights on worker 0-0, policy_version 93649 (0.00088) [2022-07-09 04:46:14,512][26022] Updated weights on worker 0-0, policy_version 93659 (0.00083) [2022-07-09 04:46:15,605][25689] Fps is (10 sec: 5620.3, 60 sec: 5674.0, 300 sec: 5681.6). Total num frames: 95911936. Throughput: 0: 4986.1. Samples: 95908324. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 04:46:15,606][25689] Avg episode reward: [(0, '-51.891')] [2022-07-09 04:46:16,570][26022] Updated weights on worker 0-0, policy_version 93669 (0.00095) [2022-07-09 04:46:17,988][26022] Updated weights on worker 0-0, policy_version 93679 (0.00088) [2022-07-09 04:46:20,012][26022] Updated weights on worker 0-0, policy_version 93689 (0.00083) [2022-07-09 04:46:20,622][25689] Fps is (10 sec: 5823.6, 60 sec: 5689.7, 300 sec: 5688.9). Total num frames: 95941632. Throughput: 0: 5868.2. Samples: 95942700. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 04:46:20,623][25689] Avg episode reward: [(0, '-52.027')] [2022-07-09 04:46:21,646][26022] Updated weights on worker 0-0, policy_version 93699 (0.00090) [2022-07-09 04:46:23,519][26022] Updated weights on worker 0-0, policy_version 93709 (0.00089) [2022-07-09 04:46:25,200][26022] Updated weights on worker 0-0, policy_version 93719 (0.00091) [2022-07-09 04:46:25,664][25689] Fps is (10 sec: 5599.1, 60 sec: 5640.4, 300 sec: 5681.6). Total num frames: 95968256. Throughput: 0: 5973.6. Samples: 95977140. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 04:46:25,664][25689] Avg episode reward: [(0, '-51.520')] [2022-07-09 04:46:26,225][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:46:26,235][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000093724_95973376.pth [2022-07-09 04:46:26,245][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000091725_93926400.pth [2022-07-09 04:46:27,102][26022] Updated weights on worker 0-0, policy_version 93729 (0.00085) [2022-07-09 04:46:28,836][26022] Updated weights on worker 0-0, policy_version 93739 (0.00087) [2022-07-09 04:46:30,687][25689] Fps is (10 sec: 5596.0, 60 sec: 5674.6, 300 sec: 5685.2). Total num frames: 95997952. Throughput: 0: 5129.3. Samples: 95994290. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 04:46:30,687][25689] Avg episode reward: [(0, '-51.281')] [2022-07-09 04:46:30,804][26022] Updated weights on worker 0-0, policy_version 93749 (0.00096) [2022-07-09 04:46:32,231][26022] Updated weights on worker 0-0, policy_version 93759 (0.00088) [2022-07-09 04:46:34,367][26022] Updated weights on worker 0-0, policy_version 93769 (0.00086) [2022-07-09 04:46:35,698][25689] Fps is (10 sec: 6021.2, 60 sec: 5714.5, 300 sec: 5688.8). Total num frames: 96028672. Throughput: 0: 5991.7. Samples: 96028714. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 04:46:35,699][25689] Avg episode reward: [(0, '-51.048')] [2022-07-09 04:46:35,887][26022] Updated weights on worker 0-0, policy_version 93779 (0.00088) [2022-07-09 04:46:37,927][26022] Updated weights on worker 0-0, policy_version 93789 (0.00086) [2022-07-09 04:46:39,832][26022] Updated weights on worker 0-0, policy_version 93799 (0.00089) [2022-07-09 04:46:40,701][25689] Fps is (10 sec: 5828.6, 60 sec: 5697.6, 300 sec: 5685.9). Total num frames: 96056320. Throughput: 0: 5992.8. Samples: 96063026. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 04:46:40,701][25689] Avg episode reward: [(0, '-51.527')] [2022-07-09 04:46:41,399][26022] Updated weights on worker 0-0, policy_version 93809 (0.00093) [2022-07-09 04:46:43,246][26022] Updated weights on worker 0-0, policy_version 93819 (0.00087) [2022-07-09 04:46:45,091][26022] Updated weights on worker 0-0, policy_version 93829 (0.00087) [2022-07-09 04:46:45,775][25689] Fps is (10 sec: 5385.4, 60 sec: 5644.5, 300 sec: 5678.0). Total num frames: 96082944. Throughput: 0: 5131.7. Samples: 96080346. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 04:46:45,777][25689] Avg episode reward: [(0, '-52.090')] [2022-07-09 04:46:46,675][26022] Updated weights on worker 0-0, policy_version 93839 (0.00087) [2022-07-09 04:46:48,784][26022] Updated weights on worker 0-0, policy_version 93849 (0.00094) [2022-07-09 04:46:50,233][26022] Updated weights on worker 0-0, policy_version 93859 (0.00056) [2022-07-09 04:46:50,830][25689] Fps is (10 sec: 5661.1, 60 sec: 5674.7, 300 sec: 5687.4). Total num frames: 96113664. Throughput: 0: 5976.1. Samples: 96114670. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 04:46:50,831][25689] Avg episode reward: [(0, '-52.406')] [2022-07-09 04:46:52,056][26022] Updated weights on worker 0-0, policy_version 93869 (0.00100) [2022-07-09 04:46:54,107][26022] Updated weights on worker 0-0, policy_version 93879 (0.00092) [2022-07-09 04:46:55,855][25689] Fps is (10 sec: 5790.4, 60 sec: 5657.7, 300 sec: 5676.9). Total num frames: 96141312. Throughput: 0: 5962.9. Samples: 96148914. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 04:46:55,857][25689] Avg episode reward: [(0, '-52.160')] [2022-07-09 04:46:55,906][26022] Updated weights on worker 0-0, policy_version 93889 (0.00081) [2022-07-09 04:46:57,690][26022] Updated weights on worker 0-0, policy_version 93899 (0.00093) [2022-07-09 04:46:59,374][26022] Updated weights on worker 0-0, policy_version 93909 (0.00084) [2022-07-09 04:47:00,931][25689] Fps is (10 sec: 5576.0, 60 sec: 5660.0, 300 sec: 5684.4). Total num frames: 96169984. Throughput: 0: 5090.1. Samples: 96165998. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 04:47:00,931][25689] Avg episode reward: [(0, '-53.432')] [2022-07-09 04:47:01,210][26022] Updated weights on worker 0-0, policy_version 93919 (0.00090) [2022-07-09 04:47:03,354][26022] Updated weights on worker 0-0, policy_version 93929 (0.00083) [2022-07-09 04:47:05,042][26022] Updated weights on worker 0-0, policy_version 93939 (0.00095) [2022-07-09 04:47:06,019][25689] Fps is (10 sec: 5440.8, 60 sec: 5642.3, 300 sec: 5676.8). Total num frames: 96196608. Throughput: 0: 5827.1. Samples: 96198306. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 04:47:06,019][25689] Avg episode reward: [(0, '-52.979')] [2022-07-09 04:47:06,859][26022] Updated weights on worker 0-0, policy_version 93949 (0.00086) [2022-07-09 04:47:08,852][26022] Updated weights on worker 0-0, policy_version 93959 (0.00093) [2022-07-09 04:47:10,428][26022] Updated weights on worker 0-0, policy_version 93969 (0.00090) [2022-07-09 04:47:11,033][25689] Fps is (10 sec: 5777.6, 60 sec: 5710.2, 300 sec: 5687.8). Total num frames: 96228352. Throughput: 0: 5836.1. Samples: 96232576. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 04:47:11,034][25689] Avg episode reward: [(0, '-53.897')] [2022-07-09 04:47:12,293][26022] Updated weights on worker 0-0, policy_version 93979 (0.00087) [2022-07-09 04:47:14,097][26022] Updated weights on worker 0-0, policy_version 93989 (0.00087) [2022-07-09 04:47:15,931][26022] Updated weights on worker 0-0, policy_version 93999 (0.00102) [2022-07-09 04:47:16,047][25689] Fps is (10 sec: 5820.5, 60 sec: 5675.6, 300 sec: 5682.5). Total num frames: 96254976. Throughput: 0: 4991.4. Samples: 96249696. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 04:47:16,047][25689] Avg episode reward: [(0, '-53.317')] [2022-07-09 04:47:17,694][26022] Updated weights on worker 0-0, policy_version 94009 (0.00086) [2022-07-09 04:47:19,528][26022] Updated weights on worker 0-0, policy_version 94019 (0.00079) [2022-07-09 04:47:21,049][25689] Fps is (10 sec: 5520.8, 60 sec: 5660.0, 300 sec: 5680.2). Total num frames: 96283648. Throughput: 0: 5872.8. Samples: 96284150. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 04:47:21,050][25689] Avg episode reward: [(0, '-53.213')] [2022-07-09 04:47:21,318][26022] Updated weights on worker 0-0, policy_version 94029 (0.00090) [2022-07-09 04:47:23,110][26022] Updated weights on worker 0-0, policy_version 94039 (0.00087) [2022-07-09 04:47:24,890][26022] Updated weights on worker 0-0, policy_version 94049 (0.00096) [2022-07-09 04:47:26,107][25689] Fps is (10 sec: 5598.4, 60 sec: 5675.4, 300 sec: 5680.9). Total num frames: 96311296. Throughput: 0: 5979.3. Samples: 96318418. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 04:47:26,108][25689] Avg episode reward: [(0, '-53.498')] [2022-07-09 04:47:26,657][26022] Updated weights on worker 0-0, policy_version 94059 (0.00052) [2022-07-09 04:47:28,660][26022] Updated weights on worker 0-0, policy_version 94069 (0.00089) [2022-07-09 04:47:30,167][26022] Updated weights on worker 0-0, policy_version 94079 (0.00088) [2022-07-09 04:47:31,116][25689] Fps is (10 sec: 5696.4, 60 sec: 5676.7, 300 sec: 5681.1). Total num frames: 96340992. Throughput: 0: 5123.1. Samples: 96335462. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 04:47:31,116][25689] Avg episode reward: [(0, '-52.064')] [2022-07-09 04:47:32,109][26022] Updated weights on worker 0-0, policy_version 94089 (0.00086) [2022-07-09 04:47:33,860][26022] Updated weights on worker 0-0, policy_version 94099 (0.00086) [2022-07-09 04:47:35,608][26022] Updated weights on worker 0-0, policy_version 94109 (0.00085) [2022-07-09 04:47:36,144][25689] Fps is (10 sec: 5917.1, 60 sec: 5658.2, 300 sec: 5681.3). Total num frames: 96370688. Throughput: 0: 5985.6. Samples: 96369990. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 04:47:36,145][25689] Avg episode reward: [(0, '-51.747')] [2022-07-09 04:47:37,370][26022] Updated weights on worker 0-0, policy_version 94119 (0.00095) [2022-07-09 04:47:39,182][26022] Updated weights on worker 0-0, policy_version 94129 (0.00076) [2022-07-09 04:47:40,996][26022] Updated weights on worker 0-0, policy_version 94139 (0.00092) [2022-07-09 04:47:41,165][25689] Fps is (10 sec: 5808.5, 60 sec: 5673.5, 300 sec: 5679.6). Total num frames: 96399360. Throughput: 0: 5979.4. Samples: 96404428. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 04:47:41,165][25689] Avg episode reward: [(0, '-52.147')] [2022-07-09 04:47:42,670][26022] Updated weights on worker 0-0, policy_version 94149 (0.00099) [2022-07-09 04:47:44,512][26022] Updated weights on worker 0-0, policy_version 94159 (0.00087) [2022-07-09 04:47:46,204][25689] Fps is (10 sec: 5700.1, 60 sec: 5710.6, 300 sec: 5676.3). Total num frames: 96428032. Throughput: 0: 5131.3. Samples: 96421544. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 04:47:46,205][25689] Avg episode reward: [(0, '-52.128')] [2022-07-09 04:47:46,281][26022] Updated weights on worker 0-0, policy_version 94169 (0.00082) [2022-07-09 04:47:48,155][26022] Updated weights on worker 0-0, policy_version 94179 (0.00093) [2022-07-09 04:47:50,062][26022] Updated weights on worker 0-0, policy_version 94189 (0.00081) [2022-07-09 04:47:51,223][25689] Fps is (10 sec: 5497.5, 60 sec: 5646.2, 300 sec: 5676.2). Total num frames: 96454656. Throughput: 0: 5983.7. Samples: 96455776. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 04:47:51,223][25689] Avg episode reward: [(0, '-51.694')] [2022-07-09 04:47:51,608][26022] Updated weights on worker 0-0, policy_version 94199 (0.00084) [2022-07-09 04:47:53,517][26022] Updated weights on worker 0-0, policy_version 94209 (0.00089) [2022-07-09 04:47:55,193][26022] Updated weights on worker 0-0, policy_version 94219 (0.00090) [2022-07-09 04:47:56,231][25689] Fps is (10 sec: 5719.3, 60 sec: 5698.8, 300 sec: 5683.9). Total num frames: 96485376. Throughput: 0: 5999.0. Samples: 96490490. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 04:47:56,231][25689] Avg episode reward: [(0, '-51.257')] [2022-07-09 04:47:57,199][26022] Updated weights on worker 0-0, policy_version 94229 (0.00087) [2022-07-09 04:47:58,723][26022] Updated weights on worker 0-0, policy_version 94239 (0.00090) [2022-07-09 04:48:00,551][26022] Updated weights on worker 0-0, policy_version 94249 (0.00086) [2022-07-09 04:48:01,240][25689] Fps is (10 sec: 6031.1, 60 sec: 5721.9, 300 sec: 5684.8). Total num frames: 96515072. Throughput: 0: 5156.1. Samples: 96507940. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 04:48:01,242][25689] Avg episode reward: [(0, '-51.665')] [2022-07-09 04:48:02,707][26022] Updated weights on worker 0-0, policy_version 94259 (0.00085) [2022-07-09 04:48:04,528][26022] Updated weights on worker 0-0, policy_version 94269 (0.00087) [2022-07-09 04:48:06,230][26022] Updated weights on worker 0-0, policy_version 94279 (0.00080) [2022-07-09 04:48:06,347][25689] Fps is (10 sec: 5567.2, 60 sec: 5720.2, 300 sec: 5686.4). Total num frames: 96541696. Throughput: 0: 5897.7. Samples: 96540340. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 04:48:06,347][25689] Avg episode reward: [(0, '-51.182')] [2022-07-09 04:48:08,167][26022] Updated weights on worker 0-0, policy_version 94289 (0.00093) [2022-07-09 04:48:09,731][26022] Updated weights on worker 0-0, policy_version 94299 (0.00093) [2022-07-09 04:48:11,357][25689] Fps is (10 sec: 5364.5, 60 sec: 5652.7, 300 sec: 5679.5). Total num frames: 96569344. Throughput: 0: 5892.9. Samples: 96574426. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 04:48:11,359][25689] Avg episode reward: [(0, '-50.627')] [2022-07-09 04:48:11,883][26022] Updated weights on worker 0-0, policy_version 94309 (0.00897) [2022-07-09 04:48:13,383][26022] Updated weights on worker 0-0, policy_version 94319 (0.00086) [2022-07-09 04:48:15,257][26022] Updated weights on worker 0-0, policy_version 94329 (0.00088) [2022-07-09 04:48:16,383][25689] Fps is (10 sec: 5713.9, 60 sec: 5702.5, 300 sec: 5679.1). Total num frames: 96599040. Throughput: 0: 5030.8. Samples: 96591872. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 04:48:16,383][25689] Avg episode reward: [(0, '-50.096')] [2022-07-09 04:48:16,923][26022] Updated weights on worker 0-0, policy_version 94339 (0.00098) [2022-07-09 04:48:18,871][26022] Updated weights on worker 0-0, policy_version 94349 (0.00079) [2022-07-09 04:48:20,531][26022] Updated weights on worker 0-0, policy_version 94359 (0.00092) [2022-07-09 04:48:21,433][25689] Fps is (10 sec: 5792.7, 60 sec: 5697.9, 300 sec: 5680.0). Total num frames: 96627712. Throughput: 0: 5881.5. Samples: 96626704. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 04:48:21,434][25689] Avg episode reward: [(0, '-50.269')] [2022-07-09 04:48:22,357][26022] Updated weights on worker 0-0, policy_version 94369 (0.00086) [2022-07-09 04:48:24,098][26022] Updated weights on worker 0-0, policy_version 94379 (0.00090) [2022-07-09 04:48:25,784][26022] Updated weights on worker 0-0, policy_version 94389 (0.00084) [2022-07-09 04:48:26,363][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:48:26,376][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000094392_96657408.pth [2022-07-09 04:48:26,376][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000092392_94609408.pth [2022-07-09 04:48:26,484][25689] Fps is (10 sec: 5778.2, 60 sec: 5732.5, 300 sec: 5682.7). Total num frames: 96657408. Throughput: 0: 5999.4. Samples: 96661152. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 04:48:26,486][25689] Avg episode reward: [(0, '-51.536')] [2022-07-09 04:48:27,870][26022] Updated weights on worker 0-0, policy_version 94399 (0.00094) [2022-07-09 04:48:29,400][26022] Updated weights on worker 0-0, policy_version 94409 (0.00093) [2022-07-09 04:48:31,459][26022] Updated weights on worker 0-0, policy_version 94419 (0.00086) [2022-07-09 04:48:31,492][25689] Fps is (10 sec: 5701.1, 60 sec: 5698.7, 300 sec: 5676.1). Total num frames: 96685056. Throughput: 0: 6008.2. Samples: 96695398. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 04:48:31,493][25689] Avg episode reward: [(0, '-51.415')] [2022-07-09 04:48:33,137][26022] Updated weights on worker 0-0, policy_version 94429 (0.00080) [2022-07-09 04:48:34,823][26022] Updated weights on worker 0-0, policy_version 94439 (0.00087) [2022-07-09 04:48:36,503][25689] Fps is (10 sec: 5723.8, 60 sec: 5700.3, 300 sec: 5679.5). Total num frames: 96714752. Throughput: 0: 6000.7. Samples: 96712606. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 04:48:36,505][25689] Avg episode reward: [(0, '-52.075')] [2022-07-09 04:48:36,758][26022] Updated weights on worker 0-0, policy_version 94449 (0.00089) [2022-07-09 04:48:38,394][26022] Updated weights on worker 0-0, policy_version 94459 (0.00096) [2022-07-09 04:48:40,427][26022] Updated weights on worker 0-0, policy_version 94469 (0.00090) [2022-07-09 04:48:41,531][25689] Fps is (10 sec: 5814.2, 60 sec: 5699.6, 300 sec: 5677.3). Total num frames: 96743424. Throughput: 0: 5981.2. Samples: 96746910. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 04:48:41,531][25689] Avg episode reward: [(0, '-52.031')] [2022-07-09 04:48:42,009][26022] Updated weights on worker 0-0, policy_version 94479 (0.00089) [2022-07-09 04:48:43,895][26022] Updated weights on worker 0-0, policy_version 94489 (0.00095) [2022-07-09 04:48:45,681][26022] Updated weights on worker 0-0, policy_version 94499 (0.00085) [2022-07-09 04:48:46,587][25689] Fps is (10 sec: 5787.7, 60 sec: 5715.0, 300 sec: 5680.1). Total num frames: 96773120. Throughput: 0: 5981.5. Samples: 96781400. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 04:48:46,588][25689] Avg episode reward: [(0, '-51.846')] [2022-07-09 04:48:47,397][26022] Updated weights on worker 0-0, policy_version 94509 (0.00096) [2022-07-09 04:48:48,945][26022] Updated weights on worker 0-0, policy_version 94519 (0.00089) [2022-07-09 04:48:51,240][26022] Updated weights on worker 0-0, policy_version 94529 (0.00088) [2022-07-09 04:48:51,617][25689] Fps is (10 sec: 5584.0, 60 sec: 5714.0, 300 sec: 5672.7). Total num frames: 96799744. Throughput: 0: 5138.3. Samples: 96798804. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 04:48:51,617][25689] Avg episode reward: [(0, '-51.988')] [2022-07-09 04:48:52,671][26022] Updated weights on worker 0-0, policy_version 94539 (0.00090) [2022-07-09 04:48:54,759][26022] Updated weights on worker 0-0, policy_version 94549 (0.00082) [2022-07-09 04:48:56,275][26022] Updated weights on worker 0-0, policy_version 94559 (0.00092) [2022-07-09 04:48:56,640][25689] Fps is (10 sec: 5602.3, 60 sec: 5695.5, 300 sec: 5680.3). Total num frames: 96829440. Throughput: 0: 5974.5. Samples: 96832918. Policy #0 lag: (min: 0.0, avg: 9.8, max: 19.0) [2022-07-09 04:48:56,641][25689] Avg episode reward: [(0, '-51.320')] [2022-07-09 04:48:58,358][26022] Updated weights on worker 0-0, policy_version 94569 (0.00093) [2022-07-09 04:48:59,818][26022] Updated weights on worker 0-0, policy_version 94579 (0.00092) [2022-07-09 04:49:01,671][25689] Fps is (10 sec: 5601.7, 60 sec: 5642.7, 300 sec: 5680.7). Total num frames: 96856064. Throughput: 0: 5949.7. Samples: 96866738. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 04:49:01,671][25689] Avg episode reward: [(0, '-49.991')] [2022-07-09 04:49:02,199][26022] Updated weights on worker 0-0, policy_version 94589 (0.00079) [2022-07-09 04:49:03,908][26022] Updated weights on worker 0-0, policy_version 94599 (0.00063) [2022-07-09 04:49:05,892][26022] Updated weights on worker 0-0, policy_version 94609 (0.00087) [2022-07-09 04:49:06,785][25689] Fps is (10 sec: 5451.0, 60 sec: 5675.9, 300 sec: 5675.6). Total num frames: 96884736. Throughput: 0: 4984.3. Samples: 96882068. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 04:49:06,785][25689] Avg episode reward: [(0, '-51.342')] [2022-07-09 04:49:07,440][26022] Updated weights on worker 0-0, policy_version 94619 (0.00082) [2022-07-09 04:49:09,401][26022] Updated weights on worker 0-0, policy_version 94629 (0.00088) [2022-07-09 04:49:11,007][26022] Updated weights on worker 0-0, policy_version 94639 (0.00094) [2022-07-09 04:49:11,818][25689] Fps is (10 sec: 5651.1, 60 sec: 5690.7, 300 sec: 5675.2). Total num frames: 96913408. Throughput: 0: 5827.9. Samples: 96916536. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 04:49:11,819][25689] Avg episode reward: [(0, '-52.827')] [2022-07-09 04:49:12,914][26022] Updated weights on worker 0-0, policy_version 94649 (0.00084) [2022-07-09 04:49:14,769][26022] Updated weights on worker 0-0, policy_version 94659 (0.00085) [2022-07-09 04:49:16,469][26022] Updated weights on worker 0-0, policy_version 94669 (0.00094) [2022-07-09 04:49:16,836][25689] Fps is (10 sec: 5806.8, 60 sec: 5691.4, 300 sec: 5680.0). Total num frames: 96943104. Throughput: 0: 5843.2. Samples: 96950926. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 04:49:16,837][25689] Avg episode reward: [(0, '-52.839')] [2022-07-09 04:49:18,295][26022] Updated weights on worker 0-0, policy_version 94679 (0.00086) [2022-07-09 04:49:20,159][26022] Updated weights on worker 0-0, policy_version 94689 (0.00091) [2022-07-09 04:49:21,745][26022] Updated weights on worker 0-0, policy_version 94699 (0.00087) [2022-07-09 04:49:21,846][25689] Fps is (10 sec: 5820.7, 60 sec: 5695.3, 300 sec: 5674.6). Total num frames: 96971776. Throughput: 0: 5016.2. Samples: 96967940. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 04:49:21,846][25689] Avg episode reward: [(0, '-52.293')] [2022-07-09 04:49:23,711][26022] Updated weights on worker 0-0, policy_version 94709 (0.00084) [2022-07-09 04:49:25,335][26022] Updated weights on worker 0-0, policy_version 94719 (0.00086) [2022-07-09 04:49:26,885][25689] Fps is (10 sec: 5604.4, 60 sec: 5662.4, 300 sec: 5674.7). Total num frames: 96999424. Throughput: 0: 5998.4. Samples: 97002640. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 04:49:26,886][25689] Avg episode reward: [(0, '-52.363')] [2022-07-09 04:49:27,150][26022] Updated weights on worker 0-0, policy_version 94729 (0.00088) [2022-07-09 04:49:29,035][26022] Updated weights on worker 0-0, policy_version 94739 (0.00090) [2022-07-09 04:49:30,644][26022] Updated weights on worker 0-0, policy_version 94749 (0.00100) [2022-07-09 04:49:31,925][25689] Fps is (10 sec: 5587.5, 60 sec: 5676.3, 300 sec: 5671.5). Total num frames: 97028096. Throughput: 0: 6006.9. Samples: 97037318. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 04:49:31,926][25689] Avg episode reward: [(0, '-52.903')] [2022-07-09 04:49:32,583][26022] Updated weights on worker 0-0, policy_version 94759 (0.00087) [2022-07-09 04:49:34,159][26022] Updated weights on worker 0-0, policy_version 94769 (0.00090) [2022-07-09 04:49:36,143][26022] Updated weights on worker 0-0, policy_version 94779 (0.00086) [2022-07-09 04:49:36,976][25689] Fps is (10 sec: 5987.1, 60 sec: 5706.4, 300 sec: 5684.6). Total num frames: 97059840. Throughput: 0: 5150.8. Samples: 97054664. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 04:49:36,977][25689] Avg episode reward: [(0, '-52.051')] [2022-07-09 04:49:38,004][26022] Updated weights on worker 0-0, policy_version 94789 (0.00089) [2022-07-09 04:49:39,534][26022] Updated weights on worker 0-0, policy_version 94799 (0.00083) [2022-07-09 04:49:41,387][26022] Updated weights on worker 0-0, policy_version 94809 (0.00087) [2022-07-09 04:49:42,006][25689] Fps is (10 sec: 5891.7, 60 sec: 5689.3, 300 sec: 5685.8). Total num frames: 97087488. Throughput: 0: 6020.1. Samples: 97089306. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 04:49:42,006][25689] Avg episode reward: [(0, '-51.231')] [2022-07-09 04:49:43,214][26022] Updated weights on worker 0-0, policy_version 94819 (0.00092) [2022-07-09 04:49:44,899][26022] Updated weights on worker 0-0, policy_version 94829 (0.00058) [2022-07-09 04:49:46,690][26022] Updated weights on worker 0-0, policy_version 94839 (0.00095) [2022-07-09 04:49:47,088][25689] Fps is (10 sec: 5570.0, 60 sec: 5670.0, 300 sec: 5678.1). Total num frames: 97116160. Throughput: 0: 5996.5. Samples: 97123784. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 04:49:47,088][25689] Avg episode reward: [(0, '-50.769')] [2022-07-09 04:49:48,565][26022] Updated weights on worker 0-0, policy_version 94849 (0.00086) [2022-07-09 04:49:50,336][26022] Updated weights on worker 0-0, policy_version 94859 (0.00088) [2022-07-09 04:49:52,044][26022] Updated weights on worker 0-0, policy_version 94869 (0.00094) [2022-07-09 04:49:52,094][25689] Fps is (10 sec: 5786.0, 60 sec: 5723.0, 300 sec: 5684.9). Total num frames: 97145856. Throughput: 0: 5139.8. Samples: 97140980. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 04:49:52,094][25689] Avg episode reward: [(0, '-51.192')] [2022-07-09 04:49:54,023][26022] Updated weights on worker 0-0, policy_version 94879 (0.00446) [2022-07-09 04:49:55,636][26022] Updated weights on worker 0-0, policy_version 94889 (0.00083) [2022-07-09 04:49:57,137][25689] Fps is (10 sec: 5808.1, 60 sec: 5704.2, 300 sec: 5681.0). Total num frames: 97174528. Throughput: 0: 5992.9. Samples: 97175486. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 04:49:57,138][25689] Avg episode reward: [(0, '-51.210')] [2022-07-09 04:49:57,446][26022] Updated weights on worker 0-0, policy_version 94899 (0.00085) [2022-07-09 04:49:59,238][26022] Updated weights on worker 0-0, policy_version 94909 (0.00082) [2022-07-09 04:50:00,981][26022] Updated weights on worker 0-0, policy_version 94919 (0.00093) [2022-07-09 04:50:02,146][25689] Fps is (10 sec: 5500.6, 60 sec: 5706.2, 300 sec: 5685.1). Total num frames: 97201152. Throughput: 0: 5907.8. Samples: 97208292. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 04:50:02,147][25689] Avg episode reward: [(0, '-50.814')] [2022-07-09 04:50:03,167][26022] Updated weights on worker 0-0, policy_version 94929 (0.00095) [2022-07-09 04:50:05,075][26022] Updated weights on worker 0-0, policy_version 94939 (0.00095) [2022-07-09 04:50:06,766][26022] Updated weights on worker 0-0, policy_version 94949 (0.00088) [2022-07-09 04:50:07,195][25689] Fps is (10 sec: 5599.7, 60 sec: 5729.4, 300 sec: 5684.3). Total num frames: 97230848. Throughput: 0: 5036.0. Samples: 97225042. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 04:50:07,195][25689] Avg episode reward: [(0, '-50.446')] [2022-07-09 04:50:08,631][26022] Updated weights on worker 0-0, policy_version 94959 (0.00094) [2022-07-09 04:50:10,284][26022] Updated weights on worker 0-0, policy_version 94969 (0.00094) [2022-07-09 04:50:12,026][26022] Updated weights on worker 0-0, policy_version 94979 (0.00083) [2022-07-09 04:50:12,210][25689] Fps is (10 sec: 5800.1, 60 sec: 5731.1, 300 sec: 5691.1). Total num frames: 97259520. Throughput: 0: 5890.0. Samples: 97259464. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 04:50:12,210][25689] Avg episode reward: [(0, '-50.030')] [2022-07-09 04:50:13,711][26022] Updated weights on worker 0-0, policy_version 94989 (0.00080) [2022-07-09 04:50:15,691][26022] Updated weights on worker 0-0, policy_version 94999 (0.00095) [2022-07-09 04:50:17,240][25689] Fps is (10 sec: 5606.8, 60 sec: 5696.1, 300 sec: 5687.2). Total num frames: 97287168. Throughput: 0: 5918.1. Samples: 97294456. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 04:50:17,240][25689] Avg episode reward: [(0, '-49.646')] [2022-07-09 04:50:17,363][26022] Updated weights on worker 0-0, policy_version 95009 (0.00096) [2022-07-09 04:50:19,170][26022] Updated weights on worker 0-0, policy_version 95019 (0.00086) [2022-07-09 04:50:21,053][26022] Updated weights on worker 0-0, policy_version 95029 (0.00086) [2022-07-09 04:50:22,260][25689] Fps is (10 sec: 5604.1, 60 sec: 5695.1, 300 sec: 5684.5). Total num frames: 97315840. Throughput: 0: 5128.8. Samples: 97311448. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 04:50:22,260][25689] Avg episode reward: [(0, '-49.748')] [2022-07-09 04:50:22,930][26022] Updated weights on worker 0-0, policy_version 95039 (0.00089) [2022-07-09 04:50:24,604][26022] Updated weights on worker 0-0, policy_version 95049 (0.00093) [2022-07-09 04:50:26,377][26022] Updated weights on worker 0-0, policy_version 95059 (0.00084) [2022-07-09 04:50:26,521][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:50:26,530][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000095060_97341440.pth [2022-07-09 04:50:26,530][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000093061_95294464.pth [2022-07-09 04:50:27,387][25689] Fps is (10 sec: 5752.4, 60 sec: 5720.7, 300 sec: 5689.4). Total num frames: 97345536. Throughput: 0: 5984.2. Samples: 97345874. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 04:50:27,387][25689] Avg episode reward: [(0, '-49.623')] [2022-07-09 04:50:28,215][26022] Updated weights on worker 0-0, policy_version 95069 (0.00083) [2022-07-09 04:50:29,957][26022] Updated weights on worker 0-0, policy_version 95079 (0.00087) [2022-07-09 04:50:31,835][26022] Updated weights on worker 0-0, policy_version 95089 (0.00095) [2022-07-09 04:50:32,403][25689] Fps is (10 sec: 5653.4, 60 sec: 5706.1, 300 sec: 5687.1). Total num frames: 97373184. Throughput: 0: 5971.6. Samples: 97380048. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 04:50:32,403][25689] Avg episode reward: [(0, '-49.813')] [2022-07-09 04:50:33,546][26022] Updated weights on worker 0-0, policy_version 95099 (0.00089) [2022-07-09 04:50:35,460][26022] Updated weights on worker 0-0, policy_version 95109 (0.00086) [2022-07-09 04:50:37,280][26022] Updated weights on worker 0-0, policy_version 95119 (0.00085) [2022-07-09 04:50:37,424][25689] Fps is (10 sec: 5610.9, 60 sec: 5658.0, 300 sec: 5686.8). Total num frames: 97401856. Throughput: 0: 5092.3. Samples: 97397240. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 04:50:37,425][25689] Avg episode reward: [(0, '-50.000')] [2022-07-09 04:50:38,783][26022] Updated weights on worker 0-0, policy_version 95129 (0.00090) [2022-07-09 04:50:40,976][26022] Updated weights on worker 0-0, policy_version 95139 (0.00086) [2022-07-09 04:50:42,444][25689] Fps is (10 sec: 5813.0, 60 sec: 5692.9, 300 sec: 5687.4). Total num frames: 97431552. Throughput: 0: 5948.8. Samples: 97431520. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 04:50:42,444][25689] Avg episode reward: [(0, '-50.147')] [2022-07-09 04:50:42,564][26022] Updated weights on worker 0-0, policy_version 95149 (0.00086) [2022-07-09 04:50:44,390][26022] Updated weights on worker 0-0, policy_version 95159 (0.00092) [2022-07-09 04:50:45,984][26022] Updated weights on worker 0-0, policy_version 95169 (0.00083) [2022-07-09 04:50:47,563][25689] Fps is (10 sec: 5757.0, 60 sec: 5689.4, 300 sec: 5685.4). Total num frames: 97460224. Throughput: 0: 5947.6. Samples: 97465874. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 04:50:47,563][25689] Avg episode reward: [(0, '-51.024')] [2022-07-09 04:50:47,889][26022] Updated weights on worker 0-0, policy_version 95179 (0.00099) [2022-07-09 04:50:49,637][26022] Updated weights on worker 0-0, policy_version 95189 (0.00091) [2022-07-09 04:50:51,369][26022] Updated weights on worker 0-0, policy_version 95199 (0.00088) [2022-07-09 04:50:52,576][25689] Fps is (10 sec: 5760.9, 60 sec: 5688.7, 300 sec: 5689.1). Total num frames: 97489920. Throughput: 0: 5122.5. Samples: 97483382. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 04:50:52,576][25689] Avg episode reward: [(0, '-51.091')] [2022-07-09 04:50:53,260][26022] Updated weights on worker 0-0, policy_version 95209 (0.00079) [2022-07-09 04:50:55,010][26022] Updated weights on worker 0-0, policy_version 95219 (0.00091) [2022-07-09 04:50:56,771][26022] Updated weights on worker 0-0, policy_version 95229 (0.00085) [2022-07-09 04:50:57,597][25689] Fps is (10 sec: 5918.9, 60 sec: 5707.7, 300 sec: 5694.0). Total num frames: 97519616. Throughput: 0: 5983.1. Samples: 97517936. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 04:50:57,598][25689] Avg episode reward: [(0, '-51.650')] [2022-07-09 04:50:58,610][26022] Updated weights on worker 0-0, policy_version 95239 (0.00086) [2022-07-09 04:51:00,187][26022] Updated weights on worker 0-0, policy_version 95249 (0.00085) [2022-07-09 04:51:02,530][26022] Updated weights on worker 0-0, policy_version 95259 (0.00088) [2022-07-09 04:51:02,637][25689] Fps is (10 sec: 5597.7, 60 sec: 5704.9, 300 sec: 5691.3). Total num frames: 97546240. Throughput: 0: 5999.5. Samples: 97552668. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 04:51:02,637][25689] Avg episode reward: [(0, '-51.840')] [2022-07-09 04:51:04,240][26022] Updated weights on worker 0-0, policy_version 95269 (0.00085) [2022-07-09 04:51:05,951][26022] Updated weights on worker 0-0, policy_version 95279 (0.00088) [2022-07-09 04:51:07,703][25689] Fps is (10 sec: 5370.5, 60 sec: 5669.3, 300 sec: 5690.4). Total num frames: 97573888. Throughput: 0: 5056.5. Samples: 97567712. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 04:51:07,703][25689] Avg episode reward: [(0, '-51.870')] [2022-07-09 04:51:07,840][26022] Updated weights on worker 0-0, policy_version 95289 (0.00091) [2022-07-09 04:51:09,604][26022] Updated weights on worker 0-0, policy_version 95299 (0.00085) [2022-07-09 04:51:11,395][26022] Updated weights on worker 0-0, policy_version 95309 (0.00091) [2022-07-09 04:51:12,736][25689] Fps is (10 sec: 5678.0, 60 sec: 5684.5, 300 sec: 5693.3). Total num frames: 97603584. Throughput: 0: 5878.9. Samples: 97601904. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 04:51:12,737][25689] Avg episode reward: [(0, '-52.051')] [2022-07-09 04:51:13,465][26022] Updated weights on worker 0-0, policy_version 95319 (0.00088) [2022-07-09 04:51:14,783][26022] Updated weights on worker 0-0, policy_version 95329 (0.00083) [2022-07-09 04:51:16,891][26022] Updated weights on worker 0-0, policy_version 95339 (0.00086) [2022-07-09 04:51:17,796][25689] Fps is (10 sec: 5884.6, 60 sec: 5715.6, 300 sec: 5692.5). Total num frames: 97633280. Throughput: 0: 5883.3. Samples: 97636770. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 04:51:17,796][25689] Avg episode reward: [(0, '-52.104')] [2022-07-09 04:51:18,478][26022] Updated weights on worker 0-0, policy_version 95349 (0.00085) [2022-07-09 04:51:20,257][26022] Updated weights on worker 0-0, policy_version 95359 (0.00090) [2022-07-09 04:51:22,246][26022] Updated weights on worker 0-0, policy_version 95369 (0.00086) [2022-07-09 04:51:22,799][25689] Fps is (10 sec: 5698.7, 60 sec: 5700.2, 300 sec: 5696.6). Total num frames: 97660928. Throughput: 0: 5032.8. Samples: 97654140. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 04:51:22,799][25689] Avg episode reward: [(0, '-51.752')] [2022-07-09 04:51:23,785][26022] Updated weights on worker 0-0, policy_version 95379 (0.00102) [2022-07-09 04:51:25,615][26022] Updated weights on worker 0-0, policy_version 95389 (0.00086) [2022-07-09 04:51:27,524][26022] Updated weights on worker 0-0, policy_version 95399 (0.00085) [2022-07-09 04:51:27,860][25689] Fps is (10 sec: 5596.3, 60 sec: 5689.6, 300 sec: 5692.5). Total num frames: 97689600. Throughput: 0: 5976.6. Samples: 97688180. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 04:51:27,860][25689] Avg episode reward: [(0, '-52.016')] [2022-07-09 04:51:29,264][26022] Updated weights on worker 0-0, policy_version 95409 (0.00094) [2022-07-09 04:51:31,194][26022] Updated weights on worker 0-0, policy_version 95419 (0.00092) [2022-07-09 04:51:32,879][25689] Fps is (10 sec: 5688.8, 60 sec: 5706.2, 300 sec: 5685.4). Total num frames: 97718272. Throughput: 0: 5992.6. Samples: 97722612. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 04:51:32,879][25689] Avg episode reward: [(0, '-51.967')] [2022-07-09 04:51:32,944][26022] Updated weights on worker 0-0, policy_version 95429 (0.00089) [2022-07-09 04:51:34,652][26022] Updated weights on worker 0-0, policy_version 95439 (0.00095) [2022-07-09 04:51:36,545][26022] Updated weights on worker 0-0, policy_version 95449 (0.00086) [2022-07-09 04:51:37,919][25689] Fps is (10 sec: 5802.4, 60 sec: 5721.4, 300 sec: 5691.6). Total num frames: 97747968. Throughput: 0: 5981.7. Samples: 97757140. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 04:51:37,919][25689] Avg episode reward: [(0, '-51.896')] [2022-07-09 04:51:38,284][26022] Updated weights on worker 0-0, policy_version 95459 (0.00407) [2022-07-09 04:51:39,961][26022] Updated weights on worker 0-0, policy_version 95469 (0.00087) [2022-07-09 04:51:41,904][26022] Updated weights on worker 0-0, policy_version 95479 (0.00089) [2022-07-09 04:51:42,925][25689] Fps is (10 sec: 5912.1, 60 sec: 5722.7, 300 sec: 5703.3). Total num frames: 97777664. Throughput: 0: 5961.8. Samples: 97774128. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 04:51:42,925][25689] Avg episode reward: [(0, '-51.382')] [2022-07-09 04:51:43,485][26022] Updated weights on worker 0-0, policy_version 95489 (0.00083) [2022-07-09 04:51:45,462][26022] Updated weights on worker 0-0, policy_version 95499 (0.00088) [2022-07-09 04:51:47,060][26022] Updated weights on worker 0-0, policy_version 95509 (0.00088) [2022-07-09 04:51:48,048][25689] Fps is (10 sec: 5661.2, 60 sec: 5705.3, 300 sec: 5691.6). Total num frames: 97805312. Throughput: 0: 5980.8. Samples: 97808924. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 04:51:48,050][25689] Avg episode reward: [(0, '-51.233')] [2022-07-09 04:51:48,892][26022] Updated weights on worker 0-0, policy_version 95519 (0.00090) [2022-07-09 04:51:50,787][26022] Updated weights on worker 0-0, policy_version 95529 (0.00088) [2022-07-09 04:51:52,404][26022] Updated weights on worker 0-0, policy_version 95539 (0.00092) [2022-07-09 04:51:53,130][25689] Fps is (10 sec: 5719.5, 60 sec: 5715.7, 300 sec: 5700.9). Total num frames: 97836032. Throughput: 0: 5974.6. Samples: 97843604. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 04:51:53,132][25689] Avg episode reward: [(0, '-51.017')] [2022-07-09 04:51:54,273][26022] Updated weights on worker 0-0, policy_version 95549 (0.00089) [2022-07-09 04:51:55,960][26022] Updated weights on worker 0-0, policy_version 95559 (0.00088) [2022-07-09 04:51:57,760][26022] Updated weights on worker 0-0, policy_version 95569 (0.00086) [2022-07-09 04:51:58,200][25689] Fps is (10 sec: 5850.5, 60 sec: 5694.3, 300 sec: 5701.0). Total num frames: 97864704. Throughput: 0: 5110.6. Samples: 97860788. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 04:51:58,200][25689] Avg episode reward: [(0, '-50.821')] [2022-07-09 04:51:59,703][26022] Updated weights on worker 0-0, policy_version 95579 (0.00086) [2022-07-09 04:52:01,440][26022] Updated weights on worker 0-0, policy_version 95589 (0.00086) [2022-07-09 04:52:03,223][25689] Fps is (10 sec: 5377.2, 60 sec: 5678.9, 300 sec: 5698.8). Total num frames: 97890304. Throughput: 0: 5932.2. Samples: 97894540. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 04:52:03,224][25689] Avg episode reward: [(0, '-51.382')] [2022-07-09 04:52:03,479][26022] Updated weights on worker 0-0, policy_version 95599 (0.00089) [2022-07-09 04:52:05,343][26022] Updated weights on worker 0-0, policy_version 95609 (0.00096) [2022-07-09 04:52:06,885][26022] Updated weights on worker 0-0, policy_version 95619 (0.00087) [2022-07-09 04:52:08,330][25689] Fps is (10 sec: 5458.4, 60 sec: 5708.8, 300 sec: 5690.1). Total num frames: 97920000. Throughput: 0: 5866.3. Samples: 97927906. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 04:52:08,331][25689] Avg episode reward: [(0, '-51.638')] [2022-07-09 04:52:09,029][26022] Updated weights on worker 0-0, policy_version 95629 (0.00086) [2022-07-09 04:52:10,526][26022] Updated weights on worker 0-0, policy_version 95639 (0.00089) [2022-07-09 04:52:12,417][26022] Updated weights on worker 0-0, policy_version 95649 (0.00091) [2022-07-09 04:52:13,359][25689] Fps is (10 sec: 5859.5, 60 sec: 5709.3, 300 sec: 5700.1). Total num frames: 97949696. Throughput: 0: 5026.6. Samples: 97945288. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 04:52:13,361][25689] Avg episode reward: [(0, '-51.883')] [2022-07-09 04:52:14,285][26022] Updated weights on worker 0-0, policy_version 95659 (0.00090) [2022-07-09 04:52:15,935][26022] Updated weights on worker 0-0, policy_version 95669 (0.00082) [2022-07-09 04:52:17,794][26022] Updated weights on worker 0-0, policy_version 95679 (0.00091) [2022-07-09 04:52:18,441][25689] Fps is (10 sec: 5874.2, 60 sec: 5707.1, 300 sec: 5702.1). Total num frames: 97979392. Throughput: 0: 5863.5. Samples: 97979472. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 04:52:18,442][25689] Avg episode reward: [(0, '-52.329')] [2022-07-09 04:52:19,617][26022] Updated weights on worker 0-0, policy_version 95689 (0.00083) [2022-07-09 04:52:21,403][26022] Updated weights on worker 0-0, policy_version 95699 (0.00091) [2022-07-09 04:52:23,250][26022] Updated weights on worker 0-0, policy_version 95709 (0.00086) [2022-07-09 04:52:23,525][25689] Fps is (10 sec: 5640.7, 60 sec: 5699.5, 300 sec: 5701.5). Total num frames: 98007040. Throughput: 0: 5864.6. Samples: 98013604. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 04:52:23,527][25689] Avg episode reward: [(0, '-51.966')] [2022-07-09 04:52:24,884][26022] Updated weights on worker 0-0, policy_version 95719 (0.00092) [2022-07-09 04:52:26,839][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:52:26,850][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000095728_98025472.pth [2022-07-09 04:52:26,850][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000093724_95973376.pth [2022-07-09 04:52:26,944][26022] Updated weights on worker 0-0, policy_version 95729 (0.00094) [2022-07-09 04:52:28,558][26022] Updated weights on worker 0-0, policy_version 95739 (0.00084) [2022-07-09 04:52:28,572][25689] Fps is (10 sec: 5660.5, 60 sec: 5717.8, 300 sec: 5700.8). Total num frames: 98036736. Throughput: 0: 5088.5. Samples: 98030904. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 04:52:28,572][25689] Avg episode reward: [(0, '-52.264')] [2022-07-09 04:52:30,457][26022] Updated weights on worker 0-0, policy_version 95749 (0.00088) [2022-07-09 04:52:32,074][26022] Updated weights on worker 0-0, policy_version 95759 (0.00092) [2022-07-09 04:52:33,576][25689] Fps is (10 sec: 5807.1, 60 sec: 5719.1, 300 sec: 5697.8). Total num frames: 98065408. Throughput: 0: 5945.4. Samples: 98065488. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 04:52:33,577][25689] Avg episode reward: [(0, '-51.524')] [2022-07-09 04:52:33,979][26022] Updated weights on worker 0-0, policy_version 95769 (0.00084) [2022-07-09 04:52:35,646][26022] Updated weights on worker 0-0, policy_version 95779 (0.00089) [2022-07-09 04:52:37,459][26022] Updated weights on worker 0-0, policy_version 95789 (0.00083) [2022-07-09 04:52:38,591][25689] Fps is (10 sec: 5723.4, 60 sec: 5704.7, 300 sec: 5698.0). Total num frames: 98094080. Throughput: 0: 5988.8. Samples: 98100144. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 04:52:38,591][25689] Avg episode reward: [(0, '-51.493')] [2022-07-09 04:52:39,367][26022] Updated weights on worker 0-0, policy_version 95799 (0.00087) [2022-07-09 04:52:41,045][26022] Updated weights on worker 0-0, policy_version 95809 (0.00416) [2022-07-09 04:52:42,948][26022] Updated weights on worker 0-0, policy_version 95819 (0.00071) [2022-07-09 04:52:43,592][25689] Fps is (10 sec: 5623.1, 60 sec: 5671.3, 300 sec: 5695.2). Total num frames: 98121728. Throughput: 0: 5155.0. Samples: 98117050. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 04:52:43,592][25689] Avg episode reward: [(0, '-51.522')] [2022-07-09 04:52:44,599][26022] Updated weights on worker 0-0, policy_version 95829 (0.00078) [2022-07-09 04:52:46,492][26022] Updated weights on worker 0-0, policy_version 95839 (0.00087) [2022-07-09 04:52:48,391][26022] Updated weights on worker 0-0, policy_version 95849 (0.00092) [2022-07-09 04:52:48,721][25689] Fps is (10 sec: 5559.6, 60 sec: 5687.7, 300 sec: 5700.0). Total num frames: 98150400. Throughput: 0: 5974.2. Samples: 98151280. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-09 04:52:48,723][25689] Avg episode reward: [(0, '-52.503')] [2022-07-09 04:52:49,893][26022] Updated weights on worker 0-0, policy_version 95859 (0.00087) [2022-07-09 04:52:51,945][26022] Updated weights on worker 0-0, policy_version 95869 (0.00988) [2022-07-09 04:52:53,515][26022] Updated weights on worker 0-0, policy_version 95879 (0.00082) [2022-07-09 04:52:53,753][25689] Fps is (10 sec: 5744.0, 60 sec: 5675.4, 300 sec: 5696.1). Total num frames: 98180096. Throughput: 0: 5955.5. Samples: 98185654. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:52:53,754][25689] Avg episode reward: [(0, '-52.635')] [2022-07-09 04:52:55,409][26022] Updated weights on worker 0-0, policy_version 95889 (0.00093) [2022-07-09 04:52:57,488][26022] Updated weights on worker 0-0, policy_version 95899 (0.00084) [2022-07-09 04:52:58,848][25689] Fps is (10 sec: 5864.8, 60 sec: 5690.0, 300 sec: 5694.5). Total num frames: 98209792. Throughput: 0: 5071.0. Samples: 98202868. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:52:58,849][25689] Avg episode reward: [(0, '-52.589')] [2022-07-09 04:52:58,851][26022] Updated weights on worker 0-0, policy_version 95909 (0.00090) [2022-07-09 04:53:00,993][26022] Updated weights on worker 0-0, policy_version 95919 (0.00092) [2022-07-09 04:53:02,638][26022] Updated weights on worker 0-0, policy_version 95929 (0.00056) [2022-07-09 04:53:03,949][25689] Fps is (10 sec: 5423.7, 60 sec: 5682.7, 300 sec: 5691.1). Total num frames: 98235392. Throughput: 0: 5827.1. Samples: 98235672. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:53:03,949][25689] Avg episode reward: [(0, '-52.500')] [2022-07-09 04:53:04,899][26022] Updated weights on worker 0-0, policy_version 95939 (0.00087) [2022-07-09 04:53:06,703][26022] Updated weights on worker 0-0, policy_version 95949 (0.00084) [2022-07-09 04:53:08,250][26022] Updated weights on worker 0-0, policy_version 95959 (0.00098) [2022-07-09 04:53:08,997][25689] Fps is (10 sec: 5448.2, 60 sec: 5688.2, 300 sec: 5697.3). Total num frames: 98265088. Throughput: 0: 5836.0. Samples: 98269612. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:53:08,999][25689] Avg episode reward: [(0, '-52.070')] [2022-07-09 04:53:10,347][26022] Updated weights on worker 0-0, policy_version 95969 (0.00083) [2022-07-09 04:53:11,887][26022] Updated weights on worker 0-0, policy_version 95979 (0.00086) [2022-07-09 04:53:13,787][26022] Updated weights on worker 0-0, policy_version 95989 (0.00089) [2022-07-09 04:53:14,027][25689] Fps is (10 sec: 5791.7, 60 sec: 5671.3, 300 sec: 5693.8). Total num frames: 98293760. Throughput: 0: 4981.0. Samples: 98286634. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:53:14,028][25689] Avg episode reward: [(0, '-52.292')] [2022-07-09 04:53:15,429][26022] Updated weights on worker 0-0, policy_version 95999 (0.00086) [2022-07-09 04:53:17,146][26022] Updated weights on worker 0-0, policy_version 96009 (0.00085) [2022-07-09 04:53:19,034][25689] Fps is (10 sec: 5713.5, 60 sec: 5661.4, 300 sec: 5694.6). Total num frames: 98322432. Throughput: 0: 5864.5. Samples: 98321250. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:53:19,035][25689] Avg episode reward: [(0, '-51.722')] [2022-07-09 04:53:19,087][26022] Updated weights on worker 0-0, policy_version 96019 (0.00094) [2022-07-09 04:53:20,639][26022] Updated weights on worker 0-0, policy_version 96029 (0.00094) [2022-07-09 04:53:22,664][26022] Updated weights on worker 0-0, policy_version 96039 (0.00090) [2022-07-09 04:53:24,050][25689] Fps is (10 sec: 5823.4, 60 sec: 5701.6, 300 sec: 5695.3). Total num frames: 98352128. Throughput: 0: 5990.6. Samples: 98356088. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:53:24,052][25689] Avg episode reward: [(0, '-51.116')] [2022-07-09 04:53:24,290][26022] Updated weights on worker 0-0, policy_version 96049 (0.00080) [2022-07-09 04:53:26,135][26022] Updated weights on worker 0-0, policy_version 96059 (0.00082) [2022-07-09 04:53:27,947][26022] Updated weights on worker 0-0, policy_version 96069 (0.00078) [2022-07-09 04:53:29,179][25689] Fps is (10 sec: 5753.4, 60 sec: 5676.9, 300 sec: 5696.4). Total num frames: 98380800. Throughput: 0: 5986.7. Samples: 98390434. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:53:29,180][25689] Avg episode reward: [(0, '-51.208')] [2022-07-09 04:53:29,799][26022] Updated weights on worker 0-0, policy_version 96079 (0.00087) [2022-07-09 04:53:31,525][26022] Updated weights on worker 0-0, policy_version 96089 (0.00050) [2022-07-09 04:53:33,119][26022] Updated weights on worker 0-0, policy_version 96099 (0.00085) [2022-07-09 04:53:34,206][25689] Fps is (10 sec: 5545.1, 60 sec: 5657.9, 300 sec: 5689.2). Total num frames: 98408448. Throughput: 0: 5996.1. Samples: 98407634. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:53:34,207][25689] Avg episode reward: [(0, '-50.891')] [2022-07-09 04:53:35,069][26022] Updated weights on worker 0-0, policy_version 96109 (0.00093) [2022-07-09 04:53:36,982][26022] Updated weights on worker 0-0, policy_version 96119 (0.00081) [2022-07-09 04:53:38,472][26022] Updated weights on worker 0-0, policy_version 96129 (0.00092) [2022-07-09 04:53:39,278][25689] Fps is (10 sec: 5881.1, 60 sec: 5703.2, 300 sec: 5698.7). Total num frames: 98440192. Throughput: 0: 5970.4. Samples: 98442114. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:53:39,278][25689] Avg episode reward: [(0, '-50.575')] [2022-07-09 04:53:40,563][26022] Updated weights on worker 0-0, policy_version 96139 (0.00083) [2022-07-09 04:53:42,073][26022] Updated weights on worker 0-0, policy_version 96149 (0.00085) [2022-07-09 04:53:44,120][26022] Updated weights on worker 0-0, policy_version 96159 (0.00088) [2022-07-09 04:53:44,303][25689] Fps is (10 sec: 5983.5, 60 sec: 5717.8, 300 sec: 5695.8). Total num frames: 98468864. Throughput: 0: 5949.2. Samples: 98476582. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:53:44,304][25689] Avg episode reward: [(0, '-51.193')] [2022-07-09 04:53:45,751][26022] Updated weights on worker 0-0, policy_version 96169 (0.00096) [2022-07-09 04:53:47,568][26022] Updated weights on worker 0-0, policy_version 96179 (0.00095) [2022-07-09 04:53:49,340][25689] Fps is (10 sec: 5597.0, 60 sec: 5709.6, 300 sec: 5699.1). Total num frames: 98496512. Throughput: 0: 5123.6. Samples: 98493732. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:53:49,341][25689] Avg episode reward: [(0, '-50.885')] [2022-07-09 04:53:49,444][26022] Updated weights on worker 0-0, policy_version 96189 (0.00092) [2022-07-09 04:53:51,068][26022] Updated weights on worker 0-0, policy_version 96199 (0.00098) [2022-07-09 04:53:53,014][26022] Updated weights on worker 0-0, policy_version 96209 (0.00091) [2022-07-09 04:53:54,365][25689] Fps is (10 sec: 5597.4, 60 sec: 5693.4, 300 sec: 5695.7). Total num frames: 98525184. Throughput: 0: 5955.4. Samples: 98527688. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:53:54,366][25689] Avg episode reward: [(0, '-51.030')] [2022-07-09 04:53:54,777][26022] Updated weights on worker 0-0, policy_version 96219 (0.00083) [2022-07-09 04:53:56,590][26022] Updated weights on worker 0-0, policy_version 96229 (0.00091) [2022-07-09 04:53:58,484][26022] Updated weights on worker 0-0, policy_version 96239 (0.00244) [2022-07-09 04:53:59,391][25689] Fps is (10 sec: 5603.6, 60 sec: 5666.1, 300 sec: 5699.2). Total num frames: 98552832. Throughput: 0: 5944.5. Samples: 98561678. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:53:59,391][25689] Avg episode reward: [(0, '-51.181')] [2022-07-09 04:54:00,079][26022] Updated weights on worker 0-0, policy_version 96249 (0.00086) [2022-07-09 04:54:02,362][26022] Updated weights on worker 0-0, policy_version 96259 (0.00090) [2022-07-09 04:54:04,154][26022] Updated weights on worker 0-0, policy_version 96269 (0.00088) [2022-07-09 04:54:04,434][25689] Fps is (10 sec: 5390.3, 60 sec: 5688.4, 300 sec: 5693.7). Total num frames: 98579456. Throughput: 0: 4980.6. Samples: 98576844. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:54:04,435][25689] Avg episode reward: [(0, '-51.352')] [2022-07-09 04:54:05,792][26022] Updated weights on worker 0-0, policy_version 96279 (0.00082) [2022-07-09 04:54:07,880][26022] Updated weights on worker 0-0, policy_version 96289 (0.00090) [2022-07-09 04:54:09,499][25689] Fps is (10 sec: 5571.8, 60 sec: 5686.9, 300 sec: 5696.5). Total num frames: 98609152. Throughput: 0: 5838.7. Samples: 98611434. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:54:09,499][25689] Avg episode reward: [(0, '-51.492')] [2022-07-09 04:54:09,531][26022] Updated weights on worker 0-0, policy_version 96299 (0.00078) [2022-07-09 04:54:11,366][26022] Updated weights on worker 0-0, policy_version 96309 (0.00091) [2022-07-09 04:54:13,070][26022] Updated weights on worker 0-0, policy_version 96319 (0.00090) [2022-07-09 04:54:14,562][25689] Fps is (10 sec: 5762.9, 60 sec: 5683.7, 300 sec: 5692.2). Total num frames: 98637824. Throughput: 0: 5852.6. Samples: 98645892. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:54:14,564][25689] Avg episode reward: [(0, '-51.535')] [2022-07-09 04:54:14,966][26022] Updated weights on worker 0-0, policy_version 96329 (0.00092) [2022-07-09 04:54:16,672][26022] Updated weights on worker 0-0, policy_version 96339 (0.00085) [2022-07-09 04:54:18,478][26022] Updated weights on worker 0-0, policy_version 96349 (0.00085) [2022-07-09 04:54:19,657][25689] Fps is (10 sec: 5846.7, 60 sec: 5709.2, 300 sec: 5697.5). Total num frames: 98668544. Throughput: 0: 5004.6. Samples: 98663102. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:54:19,658][25689] Avg episode reward: [(0, '-52.332')] [2022-07-09 04:54:20,059][26022] Updated weights on worker 0-0, policy_version 96359 (0.00090) [2022-07-09 04:54:22,094][26022] Updated weights on worker 0-0, policy_version 96369 (0.00088) [2022-07-09 04:54:24,139][26022] Updated weights on worker 0-0, policy_version 96379 (0.00088) [2022-07-09 04:54:24,678][25689] Fps is (10 sec: 5669.0, 60 sec: 5658.1, 300 sec: 5694.4). Total num frames: 98695168. Throughput: 0: 5956.4. Samples: 98697424. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:54:24,679][25689] Avg episode reward: [(0, '-53.026')] [2022-07-09 04:54:25,678][26022] Updated weights on worker 0-0, policy_version 96389 (0.00089) [2022-07-09 04:54:26,881][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:54:26,890][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000096396_98709504.pth [2022-07-09 04:54:26,892][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000094392_96657408.pth [2022-07-09 04:54:27,675][26022] Updated weights on worker 0-0, policy_version 96399 (0.00094) [2022-07-09 04:54:29,281][26022] Updated weights on worker 0-0, policy_version 96409 (0.00085) [2022-07-09 04:54:29,734][25689] Fps is (10 sec: 5487.3, 60 sec: 5664.9, 300 sec: 5694.1). Total num frames: 98723840. Throughput: 0: 5922.0. Samples: 98731268. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:54:29,735][25689] Avg episode reward: [(0, '-53.330')] [2022-07-09 04:54:31,267][26022] Updated weights on worker 0-0, policy_version 96419 (0.00093) [2022-07-09 04:54:33,096][26022] Updated weights on worker 0-0, policy_version 96429 (0.00092) [2022-07-09 04:54:34,532][26022] Updated weights on worker 0-0, policy_version 96439 (0.00088) [2022-07-09 04:54:34,762][25689] Fps is (10 sec: 5889.2, 60 sec: 5715.6, 300 sec: 5691.1). Total num frames: 98754560. Throughput: 0: 5072.0. Samples: 98748348. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:54:34,764][25689] Avg episode reward: [(0, '-53.529')] [2022-07-09 04:54:36,652][26022] Updated weights on worker 0-0, policy_version 96449 (0.00092) [2022-07-09 04:54:38,035][26022] Updated weights on worker 0-0, policy_version 96459 (0.00087) [2022-07-09 04:54:39,798][25689] Fps is (10 sec: 5697.9, 60 sec: 5634.3, 300 sec: 5687.5). Total num frames: 98781184. Throughput: 0: 5955.7. Samples: 98783056. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:54:39,800][25689] Avg episode reward: [(0, '-53.349')] [2022-07-09 04:54:40,029][26022] Updated weights on worker 0-0, policy_version 96469 (0.00080) [2022-07-09 04:54:41,639][26022] Updated weights on worker 0-0, policy_version 96479 (0.00096) [2022-07-09 04:54:43,593][26022] Updated weights on worker 0-0, policy_version 96489 (0.00085) [2022-07-09 04:54:44,813][25689] Fps is (10 sec: 5807.3, 60 sec: 5686.1, 300 sec: 5699.1). Total num frames: 98812928. Throughput: 0: 5981.5. Samples: 98817866. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 04:54:44,814][25689] Avg episode reward: [(0, '-52.379')] [2022-07-09 04:54:45,500][26022] Updated weights on worker 0-0, policy_version 96499 (0.00091) [2022-07-09 04:54:47,082][26022] Updated weights on worker 0-0, policy_version 96509 (0.00087) [2022-07-09 04:54:48,922][26022] Updated weights on worker 0-0, policy_version 96519 (0.00086) [2022-07-09 04:54:49,893][25689] Fps is (10 sec: 5984.6, 60 sec: 5698.9, 300 sec: 5694.2). Total num frames: 98841600. Throughput: 0: 5149.6. Samples: 98835080. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 04:54:49,894][25689] Avg episode reward: [(0, '-52.390')] [2022-07-09 04:54:50,638][26022] Updated weights on worker 0-0, policy_version 96529 (0.00093) [2022-07-09 04:54:52,458][26022] Updated weights on worker 0-0, policy_version 96539 (0.00088) [2022-07-09 04:54:54,320][26022] Updated weights on worker 0-0, policy_version 96549 (0.00086) [2022-07-09 04:54:54,930][25689] Fps is (10 sec: 5567.0, 60 sec: 5680.9, 300 sec: 5690.9). Total num frames: 98869248. Throughput: 0: 6024.7. Samples: 98869852. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 04:54:54,931][25689] Avg episode reward: [(0, '-51.211')] [2022-07-09 04:54:55,818][26022] Updated weights on worker 0-0, policy_version 96559 (0.00083) [2022-07-09 04:54:57,975][26022] Updated weights on worker 0-0, policy_version 96569 (0.00115) [2022-07-09 04:54:59,334][26022] Updated weights on worker 0-0, policy_version 96579 (0.00086) [2022-07-09 04:54:59,935][25689] Fps is (10 sec: 5608.9, 60 sec: 5699.8, 300 sec: 5697.9). Total num frames: 98897920. Throughput: 0: 6013.0. Samples: 98904136. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 04:54:59,935][25689] Avg episode reward: [(0, '-51.506')] [2022-07-09 04:55:01,824][26022] Updated weights on worker 0-0, policy_version 96589 (0.00085) [2022-07-09 04:55:03,305][26022] Updated weights on worker 0-0, policy_version 96599 (0.00079) [2022-07-09 04:55:04,962][25689] Fps is (10 sec: 5511.7, 60 sec: 5701.2, 300 sec: 5688.0). Total num frames: 98924544. Throughput: 0: 5035.8. Samples: 98919334. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 04:55:04,963][25689] Avg episode reward: [(0, '-50.884')] [2022-07-09 04:55:05,353][26022] Updated weights on worker 0-0, policy_version 96609 (0.00084) [2022-07-09 04:55:07,055][26022] Updated weights on worker 0-0, policy_version 96619 (0.00089) [2022-07-09 04:55:08,748][26022] Updated weights on worker 0-0, policy_version 96629 (0.00089) [2022-07-09 04:55:10,012][25689] Fps is (10 sec: 5588.7, 60 sec: 5702.7, 300 sec: 5690.8). Total num frames: 98954240. Throughput: 0: 5911.7. Samples: 98954016. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 04:55:10,013][25689] Avg episode reward: [(0, '-51.149')] [2022-07-09 04:55:10,680][26022] Updated weights on worker 0-0, policy_version 96639 (0.00092) [2022-07-09 04:55:12,344][26022] Updated weights on worker 0-0, policy_version 96649 (0.00088) [2022-07-09 04:55:14,310][26022] Updated weights on worker 0-0, policy_version 96659 (0.00094) [2022-07-09 04:55:15,080][25689] Fps is (10 sec: 5869.9, 60 sec: 5719.1, 300 sec: 5696.9). Total num frames: 98983936. Throughput: 0: 5882.7. Samples: 98988390. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 04:55:15,081][25689] Avg episode reward: [(0, '-51.439')] [2022-07-09 04:55:15,961][26022] Updated weights on worker 0-0, policy_version 96669 (0.00085) [2022-07-09 04:55:17,839][26022] Updated weights on worker 0-0, policy_version 96679 (0.00087) [2022-07-09 04:55:19,567][26022] Updated weights on worker 0-0, policy_version 96689 (0.00092) [2022-07-09 04:55:20,098][25689] Fps is (10 sec: 5787.1, 60 sec: 5692.6, 300 sec: 5696.9). Total num frames: 99012608. Throughput: 0: 5040.3. Samples: 99005766. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 04:55:20,099][25689] Avg episode reward: [(0, '-52.903')] [2022-07-09 04:55:21,347][26022] Updated weights on worker 0-0, policy_version 96699 (0.00089) [2022-07-09 04:55:22,972][26022] Updated weights on worker 0-0, policy_version 96709 (0.00088) [2022-07-09 04:55:24,774][26022] Updated weights on worker 0-0, policy_version 96719 (0.00089) [2022-07-09 04:55:25,107][25689] Fps is (10 sec: 5821.4, 60 sec: 5744.5, 300 sec: 5699.2). Total num frames: 99042304. Throughput: 0: 6037.1. Samples: 99040948. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 04:55:25,107][25689] Avg episode reward: [(0, '-52.310')] [2022-07-09 04:55:26,617][26022] Updated weights on worker 0-0, policy_version 96729 (0.00090) [2022-07-09 04:55:28,509][26022] Updated weights on worker 0-0, policy_version 96739 (0.00087) [2022-07-09 04:55:30,154][25689] Fps is (10 sec: 5804.4, 60 sec: 5745.4, 300 sec: 5702.1). Total num frames: 99070976. Throughput: 0: 6018.9. Samples: 99075248. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 04:55:30,155][25689] Avg episode reward: [(0, '-51.608')] [2022-07-09 04:55:30,160][26022] Updated weights on worker 0-0, policy_version 96749 (0.00090) [2022-07-09 04:55:32,007][26022] Updated weights on worker 0-0, policy_version 96759 (0.00087) [2022-07-09 04:55:33,531][26022] Updated weights on worker 0-0, policy_version 96769 (0.00098) [2022-07-09 04:55:35,211][25689] Fps is (10 sec: 5675.6, 60 sec: 5708.8, 300 sec: 5701.4). Total num frames: 99099648. Throughput: 0: 5182.2. Samples: 99092708. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 04:55:35,211][25689] Avg episode reward: [(0, '-51.925')] [2022-07-09 04:55:35,536][26022] Updated weights on worker 0-0, policy_version 96779 (0.00087) [2022-07-09 04:55:37,325][26022] Updated weights on worker 0-0, policy_version 96789 (0.00085) [2022-07-09 04:55:38,821][26022] Updated weights on worker 0-0, policy_version 96799 (0.00088) [2022-07-09 04:55:40,256][25689] Fps is (10 sec: 5676.3, 60 sec: 5741.8, 300 sec: 5697.4). Total num frames: 99128320. Throughput: 0: 6035.6. Samples: 99127432. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 04:55:40,257][25689] Avg episode reward: [(0, '-52.388')] [2022-07-09 04:55:40,803][26022] Updated weights on worker 0-0, policy_version 96809 (0.00081) [2022-07-09 04:55:42,490][26022] Updated weights on worker 0-0, policy_version 96819 (0.00092) [2022-07-09 04:55:44,386][26022] Updated weights on worker 0-0, policy_version 96829 (0.00088) [2022-07-09 04:55:45,271][25689] Fps is (10 sec: 5801.5, 60 sec: 5707.9, 300 sec: 5702.9). Total num frames: 99158016. Throughput: 0: 6005.2. Samples: 99162040. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 04:55:45,272][25689] Avg episode reward: [(0, '-52.380')] [2022-07-09 04:55:46,052][26022] Updated weights on worker 0-0, policy_version 96839 (0.00092) [2022-07-09 04:55:47,963][26022] Updated weights on worker 0-0, policy_version 96849 (0.00085) [2022-07-09 04:55:49,591][26022] Updated weights on worker 0-0, policy_version 96859 (0.00088) [2022-07-09 04:55:50,337][25689] Fps is (10 sec: 5789.8, 60 sec: 5709.2, 300 sec: 5698.4). Total num frames: 99186688. Throughput: 0: 5147.6. Samples: 99179144. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 04:55:50,338][25689] Avg episode reward: [(0, '-52.374')] [2022-07-09 04:55:51,726][26022] Updated weights on worker 0-0, policy_version 96869 (0.00094) [2022-07-09 04:55:53,244][26022] Updated weights on worker 0-0, policy_version 96879 (0.00086) [2022-07-09 04:55:55,048][26022] Updated weights on worker 0-0, policy_version 96889 (0.00081) [2022-07-09 04:55:55,345][25689] Fps is (10 sec: 5692.7, 60 sec: 5728.9, 300 sec: 5695.2). Total num frames: 99215360. Throughput: 0: 5998.7. Samples: 99213486. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 04:55:55,345][25689] Avg episode reward: [(0, '-51.886')] [2022-07-09 04:55:57,063][26022] Updated weights on worker 0-0, policy_version 96899 (0.00094) [2022-07-09 04:55:58,625][26022] Updated weights on worker 0-0, policy_version 96909 (0.00082) [2022-07-09 04:56:00,356][25689] Fps is (10 sec: 5519.1, 60 sec: 5694.4, 300 sec: 5695.8). Total num frames: 99241984. Throughput: 0: 5991.6. Samples: 99247864. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 04:56:00,357][25689] Avg episode reward: [(0, '-51.355')] [2022-07-09 04:56:00,634][26022] Updated weights on worker 0-0, policy_version 96919 (0.00084) [2022-07-09 04:56:02,581][26022] Updated weights on worker 0-0, policy_version 96929 (0.00099) [2022-07-09 04:56:04,313][26022] Updated weights on worker 0-0, policy_version 96939 (0.00092) [2022-07-09 04:56:05,378][25689] Fps is (10 sec: 5613.5, 60 sec: 5745.8, 300 sec: 5703.5). Total num frames: 99271680. Throughput: 0: 5056.3. Samples: 99263704. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 04:56:05,378][25689] Avg episode reward: [(0, '-51.314')] [2022-07-09 04:56:06,154][26022] Updated weights on worker 0-0, policy_version 96949 (0.00097) [2022-07-09 04:56:07,888][26022] Updated weights on worker 0-0, policy_version 96959 (0.00082) [2022-07-09 04:56:09,935][26022] Updated weights on worker 0-0, policy_version 96969 (0.00095) [2022-07-09 04:56:10,430][25689] Fps is (10 sec: 5692.3, 60 sec: 5711.7, 300 sec: 5696.3). Total num frames: 99299328. Throughput: 0: 5902.5. Samples: 99297744. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 04:56:10,431][25689] Avg episode reward: [(0, '-51.055')] [2022-07-09 04:56:11,507][26022] Updated weights on worker 0-0, policy_version 96979 (0.00086) [2022-07-09 04:56:13,199][26022] Updated weights on worker 0-0, policy_version 96989 (0.00056) [2022-07-09 04:56:15,120][26022] Updated weights on worker 0-0, policy_version 96999 (0.00089) [2022-07-09 04:56:15,449][25689] Fps is (10 sec: 5591.9, 60 sec: 5699.4, 300 sec: 5693.6). Total num frames: 99328000. Throughput: 0: 5917.4. Samples: 99332452. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 04:56:15,450][25689] Avg episode reward: [(0, '-50.690')] [2022-07-09 04:56:16,653][26022] Updated weights on worker 0-0, policy_version 97009 (0.00087) [2022-07-09 04:56:18,678][26022] Updated weights on worker 0-0, policy_version 97019 (0.00085) [2022-07-09 04:56:20,229][26022] Updated weights on worker 0-0, policy_version 97029 (0.00084) [2022-07-09 04:56:20,450][25689] Fps is (10 sec: 5927.3, 60 sec: 5734.8, 300 sec: 5704.0). Total num frames: 99358720. Throughput: 0: 5954.8. Samples: 99367520. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 04:56:20,451][25689] Avg episode reward: [(0, '-50.248')] [2022-07-09 04:56:22,127][26022] Updated weights on worker 0-0, policy_version 97039 (0.00086) [2022-07-09 04:56:23,822][26022] Updated weights on worker 0-0, policy_version 97049 (0.00088) [2022-07-09 04:56:25,483][25689] Fps is (10 sec: 5817.0, 60 sec: 5698.7, 300 sec: 5701.1). Total num frames: 99386368. Throughput: 0: 6035.0. Samples: 99385042. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 04:56:25,484][25689] Avg episode reward: [(0, '-50.125')] [2022-07-09 04:56:25,674][26022] Updated weights on worker 0-0, policy_version 97059 (0.00093) [2022-07-09 04:56:26,950][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:56:26,964][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000097067_99396608.pth [2022-07-09 04:56:26,964][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000095060_97341440.pth [2022-07-09 04:56:27,614][26022] Updated weights on worker 0-0, policy_version 97069 (0.00088) [2022-07-09 04:56:29,093][26022] Updated weights on worker 0-0, policy_version 97079 (0.00087) [2022-07-09 04:56:30,531][25689] Fps is (10 sec: 5688.8, 60 sec: 5715.6, 300 sec: 5704.0). Total num frames: 99416064. Throughput: 0: 6054.9. Samples: 99419450. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 04:56:30,531][25689] Avg episode reward: [(0, '-50.247')] [2022-07-09 04:56:31,176][26022] Updated weights on worker 0-0, policy_version 97089 (0.00095) [2022-07-09 04:56:32,600][26022] Updated weights on worker 0-0, policy_version 97099 (0.00086) [2022-07-09 04:56:34,740][26022] Updated weights on worker 0-0, policy_version 97109 (0.00084) [2022-07-09 04:56:35,536][25689] Fps is (10 sec: 5806.0, 60 sec: 5720.4, 300 sec: 5701.2). Total num frames: 99444736. Throughput: 0: 6041.0. Samples: 99453798. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 04:56:35,537][25689] Avg episode reward: [(0, '-50.208')] [2022-07-09 04:56:36,317][26022] Updated weights on worker 0-0, policy_version 97119 (0.00077) [2022-07-09 04:56:38,092][26022] Updated weights on worker 0-0, policy_version 97129 (0.00084) [2022-07-09 04:56:39,871][26022] Updated weights on worker 0-0, policy_version 97139 (0.00085) [2022-07-09 04:56:40,555][25689] Fps is (10 sec: 5720.6, 60 sec: 5723.0, 300 sec: 5697.5). Total num frames: 99473408. Throughput: 0: 5157.5. Samples: 99471208. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 04:56:40,555][25689] Avg episode reward: [(0, '-50.437')] [2022-07-09 04:56:41,744][26022] Updated weights on worker 0-0, policy_version 97149 (0.00090) [2022-07-09 04:56:43,326][26022] Updated weights on worker 0-0, policy_version 97159 (0.00089) [2022-07-09 04:56:45,516][26022] Updated weights on worker 0-0, policy_version 97169 (0.00086) [2022-07-09 04:56:45,573][25689] Fps is (10 sec: 5713.4, 60 sec: 5705.7, 300 sec: 5702.9). Total num frames: 99502080. Throughput: 0: 6018.0. Samples: 99505942. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:56:45,580][25689] Avg episode reward: [(0, '-50.172')] [2022-07-09 04:56:46,897][26022] Updated weights on worker 0-0, policy_version 97179 (0.00090) [2022-07-09 04:56:48,879][26022] Updated weights on worker 0-0, policy_version 97189 (0.00081) [2022-07-09 04:56:50,445][26022] Updated weights on worker 0-0, policy_version 97199 (0.00095) [2022-07-09 04:56:50,657][25689] Fps is (10 sec: 5879.1, 60 sec: 5738.0, 300 sec: 5702.9). Total num frames: 99532800. Throughput: 0: 6018.7. Samples: 99540584. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:56:50,658][25689] Avg episode reward: [(0, '-50.349')] [2022-07-09 04:56:52,400][26022] Updated weights on worker 0-0, policy_version 97209 (0.00085) [2022-07-09 04:56:54,019][26022] Updated weights on worker 0-0, policy_version 97219 (0.00090) [2022-07-09 04:56:55,659][25689] Fps is (10 sec: 5888.4, 60 sec: 5738.4, 300 sec: 5704.2). Total num frames: 99561472. Throughput: 0: 5187.7. Samples: 99558190. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:56:55,660][25689] Avg episode reward: [(0, '-50.768')] [2022-07-09 04:56:55,819][26022] Updated weights on worker 0-0, policy_version 97229 (0.00093) [2022-07-09 04:56:57,690][26022] Updated weights on worker 0-0, policy_version 97239 (0.00105) [2022-07-09 04:56:59,532][26022] Updated weights on worker 0-0, policy_version 97249 (0.00090) [2022-07-09 04:57:00,677][25689] Fps is (10 sec: 5620.6, 60 sec: 5754.8, 300 sec: 5711.2). Total num frames: 99589120. Throughput: 0: 6028.0. Samples: 99592506. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:57:00,678][25689] Avg episode reward: [(0, '-51.583')] [2022-07-09 04:57:01,220][26022] Updated weights on worker 0-0, policy_version 97259 (0.00082) [2022-07-09 04:57:03,380][26022] Updated weights on worker 0-0, policy_version 97269 (0.00089) [2022-07-09 04:57:05,035][26022] Updated weights on worker 0-0, policy_version 97279 (0.00087) [2022-07-09 04:57:05,694][25689] Fps is (10 sec: 5510.3, 60 sec: 5721.3, 300 sec: 5706.0). Total num frames: 99616768. Throughput: 0: 5918.2. Samples: 99625024. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:57:05,697][25689] Avg episode reward: [(0, '-51.817')] [2022-07-09 04:57:07,220][26022] Updated weights on worker 0-0, policy_version 97289 (0.00091) [2022-07-09 04:57:08,581][26022] Updated weights on worker 0-0, policy_version 97299 (0.00085) [2022-07-09 04:57:10,587][26022] Updated weights on worker 0-0, policy_version 97309 (0.00088) [2022-07-09 04:57:10,760][25689] Fps is (10 sec: 5586.0, 60 sec: 5737.1, 300 sec: 5701.9). Total num frames: 99645440. Throughput: 0: 5062.0. Samples: 99642344. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:57:10,760][25689] Avg episode reward: [(0, '-52.884')] [2022-07-09 04:57:12,191][26022] Updated weights on worker 0-0, policy_version 97319 (0.00083) [2022-07-09 04:57:14,137][26022] Updated weights on worker 0-0, policy_version 97329 (0.00048) [2022-07-09 04:57:15,737][26022] Updated weights on worker 0-0, policy_version 97339 (0.00087) [2022-07-09 04:57:15,807][25689] Fps is (10 sec: 5771.6, 60 sec: 5751.3, 300 sec: 5702.5). Total num frames: 99675136. Throughput: 0: 5899.8. Samples: 99677060. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:57:15,808][25689] Avg episode reward: [(0, '-52.238')] [2022-07-09 04:57:17,919][26022] Updated weights on worker 0-0, policy_version 97349 (0.00101) [2022-07-09 04:57:19,174][26022] Updated weights on worker 0-0, policy_version 97359 (0.00084) [2022-07-09 04:57:20,835][25689] Fps is (10 sec: 5590.0, 60 sec: 5681.0, 300 sec: 5700.2). Total num frames: 99701760. Throughput: 0: 5917.2. Samples: 99711782. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:57:20,835][25689] Avg episode reward: [(0, '-51.187')] [2022-07-09 04:57:21,380][26022] Updated weights on worker 0-0, policy_version 97369 (0.00085) [2022-07-09 04:57:22,529][26022] Updated weights on worker 0-0, policy_version 97379 (0.00086) [2022-07-09 04:57:24,876][26022] Updated weights on worker 0-0, policy_version 97389 (0.00086) [2022-07-09 04:57:25,839][25689] Fps is (10 sec: 5716.2, 60 sec: 5734.5, 300 sec: 5704.4). Total num frames: 99732480. Throughput: 0: 5168.0. Samples: 99729132. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:57:25,840][25689] Avg episode reward: [(0, '-50.712')] [2022-07-09 04:57:26,224][26022] Updated weights on worker 0-0, policy_version 97399 (0.00085) [2022-07-09 04:57:28,386][26022] Updated weights on worker 0-0, policy_version 97409 (0.00600) [2022-07-09 04:57:30,153][26022] Updated weights on worker 0-0, policy_version 97419 (0.00082) [2022-07-09 04:57:30,958][25689] Fps is (10 sec: 5866.7, 60 sec: 5710.8, 300 sec: 5702.2). Total num frames: 99761152. Throughput: 0: 5979.1. Samples: 99763114. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:57:30,959][25689] Avg episode reward: [(0, '-50.077')] [2022-07-09 04:57:31,870][26022] Updated weights on worker 0-0, policy_version 97429 (0.00436) [2022-07-09 04:57:33,573][26022] Updated weights on worker 0-0, policy_version 97439 (0.00091) [2022-07-09 04:57:35,584][26022] Updated weights on worker 0-0, policy_version 97449 (0.00095) [2022-07-09 04:57:35,978][25689] Fps is (10 sec: 5554.7, 60 sec: 5692.5, 300 sec: 5698.7). Total num frames: 99788800. Throughput: 0: 6007.0. Samples: 99798228. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:57:35,979][25689] Avg episode reward: [(0, '-49.791')] [2022-07-09 04:57:36,952][26022] Updated weights on worker 0-0, policy_version 97459 (0.00089) [2022-07-09 04:57:39,186][26022] Updated weights on worker 0-0, policy_version 97469 (0.00092) [2022-07-09 04:57:40,543][26022] Updated weights on worker 0-0, policy_version 97479 (0.00091) [2022-07-09 04:57:40,988][25689] Fps is (10 sec: 5921.7, 60 sec: 5744.1, 300 sec: 5712.3). Total num frames: 99820544. Throughput: 0: 5157.7. Samples: 99815726. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:57:40,988][25689] Avg episode reward: [(0, '-49.070')] [2022-07-09 04:57:42,475][26022] Updated weights on worker 0-0, policy_version 97489 (0.00091) [2022-07-09 04:57:44,331][26022] Updated weights on worker 0-0, policy_version 97499 (0.00088) [2022-07-09 04:57:45,958][26022] Updated weights on worker 0-0, policy_version 97509 (0.00097) [2022-07-09 04:57:46,009][25689] Fps is (10 sec: 6023.3, 60 sec: 5743.9, 300 sec: 5714.4). Total num frames: 99849216. Throughput: 0: 6036.2. Samples: 99850878. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:57:46,009][25689] Avg episode reward: [(0, '-50.254')] [2022-07-09 04:57:47,836][26022] Updated weights on worker 0-0, policy_version 97519 (0.00102) [2022-07-09 04:57:49,432][26022] Updated weights on worker 0-0, policy_version 97529 (0.00611) [2022-07-09 04:57:51,066][25689] Fps is (10 sec: 5791.4, 60 sec: 5729.5, 300 sec: 5713.9). Total num frames: 99878912. Throughput: 0: 6089.3. Samples: 99885558. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:57:51,067][25689] Avg episode reward: [(0, '-50.297')] [2022-07-09 04:57:51,179][26022] Updated weights on worker 0-0, policy_version 97539 (0.00626) [2022-07-09 04:57:52,988][26022] Updated weights on worker 0-0, policy_version 97549 (0.00089) [2022-07-09 04:57:54,877][26022] Updated weights on worker 0-0, policy_version 97559 (0.00089) [2022-07-09 04:57:56,073][25689] Fps is (10 sec: 5799.4, 60 sec: 5729.0, 300 sec: 5712.1). Total num frames: 99907584. Throughput: 0: 5213.2. Samples: 99902986. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:57:56,074][25689] Avg episode reward: [(0, '-50.314')] [2022-07-09 04:57:56,678][26022] Updated weights on worker 0-0, policy_version 97569 (0.00097) [2022-07-09 04:57:58,395][26022] Updated weights on worker 0-0, policy_version 97579 (0.00090) [2022-07-09 04:58:00,265][26022] Updated weights on worker 0-0, policy_version 97589 (0.00090) [2022-07-09 04:58:01,101][25689] Fps is (10 sec: 5612.6, 60 sec: 5728.1, 300 sec: 5720.4). Total num frames: 99935232. Throughput: 0: 6049.9. Samples: 99937408. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:58:01,101][25689] Avg episode reward: [(0, '-50.758')] [2022-07-09 04:58:02,180][26022] Updated weights on worker 0-0, policy_version 97599 (0.00092) [2022-07-09 04:58:04,187][26022] Updated weights on worker 0-0, policy_version 97609 (0.00097) [2022-07-09 04:58:05,635][26022] Updated weights on worker 0-0, policy_version 97619 (0.00093) [2022-07-09 04:58:06,137][25689] Fps is (10 sec: 5698.2, 60 sec: 5760.2, 300 sec: 5720.7). Total num frames: 99964928. Throughput: 0: 5933.6. Samples: 99970310. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:58:06,137][25689] Avg episode reward: [(0, '-50.197')] [2022-07-09 04:58:07,847][26022] Updated weights on worker 0-0, policy_version 97629 (0.00109) [2022-07-09 04:58:09,164][26022] Updated weights on worker 0-0, policy_version 97639 (0.00098) [2022-07-09 04:58:11,171][26022] Updated weights on worker 0-0, policy_version 97649 (0.00080) [2022-07-09 04:58:11,212][25689] Fps is (10 sec: 5671.3, 60 sec: 5742.3, 300 sec: 5716.4). Total num frames: 99992576. Throughput: 0: 5061.5. Samples: 99987526. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:58:11,212][25689] Avg episode reward: [(0, '-50.313')] [2022-07-09 04:58:12,744][26022] Updated weights on worker 0-0, policy_version 97659 (0.00094) [2022-07-09 04:58:14,728][26022] Updated weights on worker 0-0, policy_version 97669 (0.00093) [2022-07-09 04:58:16,251][25689] Fps is (10 sec: 5568.6, 60 sec: 5726.2, 300 sec: 5715.7). Total num frames: 100021248. Throughput: 0: 5911.5. Samples: 100022264. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:58:16,251][25689] Avg episode reward: [(0, '-50.556')] [2022-07-09 04:58:16,621][26022] Updated weights on worker 0-0, policy_version 97679 (0.00080) [2022-07-09 04:58:18,326][26022] Updated weights on worker 0-0, policy_version 97689 (0.00094) [2022-07-09 04:58:19,883][26022] Updated weights on worker 0-0, policy_version 97699 (0.00092) [2022-07-09 04:58:21,274][25689] Fps is (10 sec: 5597.2, 60 sec: 5743.5, 300 sec: 5708.7). Total num frames: 100048896. Throughput: 0: 5930.4. Samples: 100057044. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:58:21,276][25689] Avg episode reward: [(0, '-50.651')] [2022-07-09 04:58:21,732][26022] Updated weights on worker 0-0, policy_version 97709 (0.00085) [2022-07-09 04:58:23,388][26022] Updated weights on worker 0-0, policy_version 97719 (0.00088) [2022-07-09 04:58:25,340][26022] Updated weights on worker 0-0, policy_version 97729 (0.00090) [2022-07-09 04:58:26,282][25689] Fps is (10 sec: 5920.5, 60 sec: 5760.1, 300 sec: 5721.4). Total num frames: 100080640. Throughput: 0: 5176.4. Samples: 100074592. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:58:26,283][25689] Avg episode reward: [(0, '-50.481')] [2022-07-09 04:58:27,139][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 04:58:27,148][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000097739_100084736.pth [2022-07-09 04:58:27,149][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000095728_98025472.pth [2022-07-09 04:58:27,155][26022] Updated weights on worker 0-0, policy_version 97739 (0.00091) [2022-07-09 04:58:28,831][26022] Updated weights on worker 0-0, policy_version 97749 (0.00095) [2022-07-09 04:58:30,622][26022] Updated weights on worker 0-0, policy_version 97759 (0.00085) [2022-07-09 04:58:31,372][25689] Fps is (10 sec: 5881.9, 60 sec: 5746.0, 300 sec: 5720.2). Total num frames: 100108288. Throughput: 0: 6017.1. Samples: 100108828. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:58:31,372][25689] Avg episode reward: [(0, '-50.891')] [2022-07-09 04:58:32,505][26022] Updated weights on worker 0-0, policy_version 97769 (0.00087) [2022-07-09 04:58:34,236][26022] Updated weights on worker 0-0, policy_version 97779 (0.00086) [2022-07-09 04:58:36,078][26022] Updated weights on worker 0-0, policy_version 97789 (0.00100) [2022-07-09 04:58:36,426][25689] Fps is (10 sec: 5653.3, 60 sec: 5776.7, 300 sec: 5713.6). Total num frames: 100137984. Throughput: 0: 5998.7. Samples: 100143288. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 04:58:36,426][25689] Avg episode reward: [(0, '-51.531')] [2022-07-09 04:58:37,847][26022] Updated weights on worker 0-0, policy_version 97799 (0.00089) [2022-07-09 04:58:39,639][26022] Updated weights on worker 0-0, policy_version 97809 (0.00090) [2022-07-09 04:58:41,441][25689] Fps is (10 sec: 5694.7, 60 sec: 5708.3, 300 sec: 5710.4). Total num frames: 100165632. Throughput: 0: 5132.1. Samples: 100160544. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 04:58:41,442][25689] Avg episode reward: [(0, '-51.018')] [2022-07-09 04:58:41,460][26022] Updated weights on worker 0-0, policy_version 97819 (0.00084) [2022-07-09 04:58:43,056][26022] Updated weights on worker 0-0, policy_version 97829 (0.00094) [2022-07-09 04:58:44,831][26022] Updated weights on worker 0-0, policy_version 97839 (0.00089) [2022-07-09 04:58:46,456][25689] Fps is (10 sec: 5717.3, 60 sec: 5725.9, 300 sec: 5717.7). Total num frames: 100195328. Throughput: 0: 5977.0. Samples: 100195168. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 04:58:46,456][25689] Avg episode reward: [(0, '-52.066')] [2022-07-09 04:58:46,627][26022] Updated weights on worker 0-0, policy_version 97849 (0.00086) [2022-07-09 04:58:48,431][26022] Updated weights on worker 0-0, policy_version 97859 (0.00082) [2022-07-09 04:58:50,374][26022] Updated weights on worker 0-0, policy_version 97869 (0.00091) [2022-07-09 04:58:51,511][25689] Fps is (10 sec: 5796.4, 60 sec: 5709.1, 300 sec: 5717.1). Total num frames: 100224000. Throughput: 0: 5983.9. Samples: 100229340. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 04:58:51,511][25689] Avg episode reward: [(0, '-52.464')] [2022-07-09 04:58:51,998][26022] Updated weights on worker 0-0, policy_version 97879 (0.00093) [2022-07-09 04:58:54,033][26022] Updated weights on worker 0-0, policy_version 97889 (0.00085) [2022-07-09 04:58:55,510][26022] Updated weights on worker 0-0, policy_version 97899 (0.00087) [2022-07-09 04:58:56,539][25689] Fps is (10 sec: 5585.6, 60 sec: 5690.3, 300 sec: 5717.1). Total num frames: 100251648. Throughput: 0: 5125.9. Samples: 100246384. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 04:58:56,539][25689] Avg episode reward: [(0, '-52.901')] [2022-07-09 04:58:57,392][26022] Updated weights on worker 0-0, policy_version 97909 (0.00086) [2022-07-09 04:58:59,352][26022] Updated weights on worker 0-0, policy_version 97919 (0.00089) [2022-07-09 04:59:00,970][26022] Updated weights on worker 0-0, policy_version 97929 (0.00051) [2022-07-09 04:59:01,555][25689] Fps is (10 sec: 5607.3, 60 sec: 5708.3, 300 sec: 5724.5). Total num frames: 100280320. Throughput: 0: 5991.4. Samples: 100281052. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 04:59:01,555][25689] Avg episode reward: [(0, '-52.034')] [2022-07-09 04:59:03,486][26022] Updated weights on worker 0-0, policy_version 97939 (0.00086) [2022-07-09 04:59:04,952][26022] Updated weights on worker 0-0, policy_version 97949 (0.00079) [2022-07-09 04:59:06,590][25689] Fps is (10 sec: 5603.3, 60 sec: 5674.5, 300 sec: 5718.2). Total num frames: 100307968. Throughput: 0: 5857.1. Samples: 100313096. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 04:59:06,590][25689] Avg episode reward: [(0, '-51.932')] [2022-07-09 04:59:06,953][26022] Updated weights on worker 0-0, policy_version 97959 (0.00089) [2022-07-09 04:59:08,493][26022] Updated weights on worker 0-0, policy_version 97969 (0.00088) [2022-07-09 04:59:10,425][26022] Updated weights on worker 0-0, policy_version 97979 (0.00744) [2022-07-09 04:59:11,652][25689] Fps is (10 sec: 5578.1, 60 sec: 5692.7, 300 sec: 5718.2). Total num frames: 100336640. Throughput: 0: 5010.2. Samples: 100330248. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 04:59:11,652][25689] Avg episode reward: [(0, '-52.191')] [2022-07-09 04:59:12,289][26022] Updated weights on worker 0-0, policy_version 97989 (0.00090) [2022-07-09 04:59:13,917][26022] Updated weights on worker 0-0, policy_version 97999 (0.00085) [2022-07-09 04:59:15,691][26022] Updated weights on worker 0-0, policy_version 98009 (0.00091) [2022-07-09 04:59:16,675][25689] Fps is (10 sec: 5787.3, 60 sec: 5711.0, 300 sec: 5716.1). Total num frames: 100366336. Throughput: 0: 5875.7. Samples: 100364702. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 04:59:16,676][25689] Avg episode reward: [(0, '-51.336')] [2022-07-09 04:59:17,529][26022] Updated weights on worker 0-0, policy_version 98019 (0.00091) [2022-07-09 04:59:19,539][26022] Updated weights on worker 0-0, policy_version 98029 (0.00086) [2022-07-09 04:59:20,966][26022] Updated weights on worker 0-0, policy_version 98039 (0.00086) [2022-07-09 04:59:21,722][25689] Fps is (10 sec: 5694.5, 60 sec: 5708.9, 300 sec: 5719.1). Total num frames: 100393984. Throughput: 0: 5857.1. Samples: 100399170. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 04:59:21,722][25689] Avg episode reward: [(0, '-51.055')] [2022-07-09 04:59:22,967][26022] Updated weights on worker 0-0, policy_version 98049 (0.00082) [2022-07-09 04:59:24,452][26022] Updated weights on worker 0-0, policy_version 98059 (0.00090) [2022-07-09 04:59:26,420][26022] Updated weights on worker 0-0, policy_version 98069 (0.00207) [2022-07-09 04:59:26,794][25689] Fps is (10 sec: 5768.3, 60 sec: 5685.9, 300 sec: 5725.7). Total num frames: 100424704. Throughput: 0: 5117.3. Samples: 100416488. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 04:59:26,795][25689] Avg episode reward: [(0, '-50.924')] [2022-07-09 04:59:28,384][26022] Updated weights on worker 0-0, policy_version 98079 (0.00086) [2022-07-09 04:59:29,971][26022] Updated weights on worker 0-0, policy_version 98089 (0.00091) [2022-07-09 04:59:31,855][25689] Fps is (10 sec: 5760.1, 60 sec: 5688.6, 300 sec: 5714.7). Total num frames: 100452352. Throughput: 0: 5978.8. Samples: 100451040. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 04:59:31,855][25689] Avg episode reward: [(0, '-50.658')] [2022-07-09 04:59:31,942][26022] Updated weights on worker 0-0, policy_version 98099 (0.00081) [2022-07-09 04:59:33,755][26022] Updated weights on worker 0-0, policy_version 98109 (0.00093) [2022-07-09 04:59:35,477][26022] Updated weights on worker 0-0, policy_version 98119 (0.00092) [2022-07-09 04:59:36,956][25689] Fps is (10 sec: 5542.0, 60 sec: 5667.2, 300 sec: 5720.3). Total num frames: 100481024. Throughput: 0: 5947.2. Samples: 100485318. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 04:59:36,957][25689] Avg episode reward: [(0, '-51.494')] [2022-07-09 04:59:37,437][26022] Updated weights on worker 0-0, policy_version 98129 (0.00086) [2022-07-09 04:59:38,786][26022] Updated weights on worker 0-0, policy_version 98139 (0.00094) [2022-07-09 04:59:40,906][26022] Updated weights on worker 0-0, policy_version 98149 (0.00093) [2022-07-09 04:59:41,963][25689] Fps is (10 sec: 5875.5, 60 sec: 5718.8, 300 sec: 5717.0). Total num frames: 100511744. Throughput: 0: 5968.1. Samples: 100519974. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 04:59:41,965][25689] Avg episode reward: [(0, '-51.656')] [2022-07-09 04:59:42,508][26022] Updated weights on worker 0-0, policy_version 98159 (0.00083) [2022-07-09 04:59:44,377][26022] Updated weights on worker 0-0, policy_version 98169 (0.00088) [2022-07-09 04:59:46,072][26022] Updated weights on worker 0-0, policy_version 98179 (0.00094) [2022-07-09 04:59:47,026][25689] Fps is (10 sec: 5694.9, 60 sec: 5663.5, 300 sec: 5710.5). Total num frames: 100538368. Throughput: 0: 5968.4. Samples: 100537238. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 04:59:47,026][25689] Avg episode reward: [(0, '-52.071')] [2022-07-09 04:59:47,904][26022] Updated weights on worker 0-0, policy_version 98189 (0.00083) [2022-07-09 04:59:49,675][26022] Updated weights on worker 0-0, policy_version 98199 (0.00090) [2022-07-09 04:59:51,530][26022] Updated weights on worker 0-0, policy_version 98209 (0.00096) [2022-07-09 04:59:52,126][25689] Fps is (10 sec: 5642.8, 60 sec: 5693.2, 300 sec: 5719.6). Total num frames: 100569088. Throughput: 0: 5943.2. Samples: 100571512. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 04:59:52,126][25689] Avg episode reward: [(0, '-51.405')] [2022-07-09 04:59:53,246][26022] Updated weights on worker 0-0, policy_version 98219 (0.00093) [2022-07-09 04:59:55,008][26022] Updated weights on worker 0-0, policy_version 98229 (0.00090) [2022-07-09 04:59:56,970][26022] Updated weights on worker 0-0, policy_version 98239 (0.00092) [2022-07-09 04:59:57,161][25689] Fps is (10 sec: 5759.1, 60 sec: 5692.4, 300 sec: 5715.6). Total num frames: 100596736. Throughput: 0: 5971.6. Samples: 100605970. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 04:59:57,161][25689] Avg episode reward: [(0, '-52.453')] [2022-07-09 04:59:58,700][26022] Updated weights on worker 0-0, policy_version 98249 (0.00090) [2022-07-09 05:00:00,442][26022] Updated weights on worker 0-0, policy_version 98259 (0.00090) [2022-07-09 05:00:02,164][25689] Fps is (10 sec: 5406.2, 60 sec: 5659.9, 300 sec: 5716.0). Total num frames: 100623360. Throughput: 0: 5121.8. Samples: 100623440. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 05:00:02,165][25689] Avg episode reward: [(0, '-51.988')] [2022-07-09 05:00:02,635][26022] Updated weights on worker 0-0, policy_version 98269 (0.00093) [2022-07-09 05:00:04,394][26022] Updated weights on worker 0-0, policy_version 98279 (0.00082) [2022-07-09 05:00:06,195][26022] Updated weights on worker 0-0, policy_version 98289 (0.00093) [2022-07-09 05:00:07,201][25689] Fps is (10 sec: 5711.1, 60 sec: 5710.3, 300 sec: 5719.7). Total num frames: 100654080. Throughput: 0: 5878.6. Samples: 100655842. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 05:00:07,202][25689] Avg episode reward: [(0, '-50.848')] [2022-07-09 05:00:08,020][26022] Updated weights on worker 0-0, policy_version 98299 (0.00091) [2022-07-09 05:00:09,662][26022] Updated weights on worker 0-0, policy_version 98309 (0.00096) [2022-07-09 05:00:11,652][26022] Updated weights on worker 0-0, policy_version 98319 (0.00083) [2022-07-09 05:00:12,235][25689] Fps is (10 sec: 5795.8, 60 sec: 5696.1, 300 sec: 5713.5). Total num frames: 100681728. Throughput: 0: 5902.4. Samples: 100690206. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 05:00:12,236][25689] Avg episode reward: [(0, '-50.118')] [2022-07-09 05:00:13,235][26022] Updated weights on worker 0-0, policy_version 98329 (0.00092) [2022-07-09 05:00:15,110][26022] Updated weights on worker 0-0, policy_version 98339 (0.00090) [2022-07-09 05:00:16,895][26022] Updated weights on worker 0-0, policy_version 98349 (0.00087) [2022-07-09 05:00:17,263][25689] Fps is (10 sec: 5699.5, 60 sec: 5695.7, 300 sec: 5716.7). Total num frames: 100711424. Throughput: 0: 5045.5. Samples: 100707394. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 05:00:17,263][25689] Avg episode reward: [(0, '-50.337')] [2022-07-09 05:00:18,675][26022] Updated weights on worker 0-0, policy_version 98359 (0.00094) [2022-07-09 05:00:20,341][26022] Updated weights on worker 0-0, policy_version 98369 (0.00087) [2022-07-09 05:00:22,064][26022] Updated weights on worker 0-0, policy_version 98379 (0.00083) [2022-07-09 05:00:22,343][25689] Fps is (10 sec: 5774.5, 60 sec: 5709.4, 300 sec: 5711.9). Total num frames: 100740096. Throughput: 0: 5893.6. Samples: 100742364. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 05:00:22,343][25689] Avg episode reward: [(0, '-50.532')] [2022-07-09 05:00:23,783][26022] Updated weights on worker 0-0, policy_version 98389 (0.00091) [2022-07-09 05:00:25,669][26022] Updated weights on worker 0-0, policy_version 98399 (0.00092) [2022-07-09 05:00:27,283][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:00:27,299][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000098408_100769792.pth [2022-07-09 05:00:27,303][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000096396_98709504.pth [2022-07-09 05:00:27,348][25689] Fps is (10 sec: 5787.4, 60 sec: 5698.9, 300 sec: 5716.2). Total num frames: 100769792. Throughput: 0: 6003.3. Samples: 100776788. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 05:00:27,348][25689] Avg episode reward: [(0, '-50.731')] [2022-07-09 05:00:27,472][26022] Updated weights on worker 0-0, policy_version 98409 (0.00092) [2022-07-09 05:00:29,348][26022] Updated weights on worker 0-0, policy_version 98419 (0.00081) [2022-07-09 05:00:31,292][26022] Updated weights on worker 0-0, policy_version 98429 (0.00092) [2022-07-09 05:00:32,404][25689] Fps is (10 sec: 5699.6, 60 sec: 5699.3, 300 sec: 5712.8). Total num frames: 100797440. Throughput: 0: 5143.9. Samples: 100793952. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 05:00:32,404][25689] Avg episode reward: [(0, '-51.606')] [2022-07-09 05:00:32,815][26022] Updated weights on worker 0-0, policy_version 98439 (0.00096) [2022-07-09 05:00:34,840][26022] Updated weights on worker 0-0, policy_version 98449 (0.00508) [2022-07-09 05:00:36,415][26022] Updated weights on worker 0-0, policy_version 98459 (0.00088) [2022-07-09 05:00:37,483][25689] Fps is (10 sec: 5455.9, 60 sec: 5684.5, 300 sec: 5708.7). Total num frames: 100825088. Throughput: 0: 5957.3. Samples: 100827854. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 05:00:37,484][25689] Avg episode reward: [(0, '-51.246')] [2022-07-09 05:00:38,400][26022] Updated weights on worker 0-0, policy_version 98469 (0.00096) [2022-07-09 05:00:40,187][26022] Updated weights on worker 0-0, policy_version 98479 (0.00087) [2022-07-09 05:00:41,976][26022] Updated weights on worker 0-0, policy_version 98489 (0.00085) [2022-07-09 05:00:42,507][25689] Fps is (10 sec: 5776.9, 60 sec: 5682.9, 300 sec: 5711.9). Total num frames: 100855808. Throughput: 0: 5939.9. Samples: 100862140. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 05:00:42,508][25689] Avg episode reward: [(0, '-51.047')] [2022-07-09 05:00:43,643][26022] Updated weights on worker 0-0, policy_version 98499 (0.00083) [2022-07-09 05:00:45,499][26022] Updated weights on worker 0-0, policy_version 98509 (0.00088) [2022-07-09 05:00:47,082][26022] Updated weights on worker 0-0, policy_version 98519 (0.00091) [2022-07-09 05:00:47,603][25689] Fps is (10 sec: 5970.1, 60 sec: 5730.5, 300 sec: 5714.8). Total num frames: 100885504. Throughput: 0: 5072.0. Samples: 100879522. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 05:00:47,604][25689] Avg episode reward: [(0, '-51.548')] [2022-07-09 05:00:49,296][26022] Updated weights on worker 0-0, policy_version 98529 (0.00091) [2022-07-09 05:00:50,594][26022] Updated weights on worker 0-0, policy_version 98539 (0.00094) [2022-07-09 05:00:52,673][25689] Fps is (10 sec: 5540.3, 60 sec: 5665.6, 300 sec: 5706.7). Total num frames: 100912128. Throughput: 0: 5914.4. Samples: 100913832. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 05:00:52,674][25689] Avg episode reward: [(0, '-51.892')] [2022-07-09 05:00:52,897][26022] Updated weights on worker 0-0, policy_version 98549 (0.00086) [2022-07-09 05:00:54,401][26022] Updated weights on worker 0-0, policy_version 98559 (0.00086) [2022-07-09 05:00:56,323][26022] Updated weights on worker 0-0, policy_version 98569 (0.00086) [2022-07-09 05:00:57,742][25689] Fps is (10 sec: 5555.0, 60 sec: 5696.3, 300 sec: 5716.0). Total num frames: 100941824. Throughput: 0: 5922.1. Samples: 100947826. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 05:00:57,742][25689] Avg episode reward: [(0, '-51.219')] [2022-07-09 05:00:58,247][26022] Updated weights on worker 0-0, policy_version 98579 (0.00089) [2022-07-09 05:00:59,796][26022] Updated weights on worker 0-0, policy_version 98589 (0.00085) [2022-07-09 05:01:01,804][26022] Updated weights on worker 0-0, policy_version 98599 (0.00102) [2022-07-09 05:01:02,767][25689] Fps is (10 sec: 5579.4, 60 sec: 5694.2, 300 sec: 5705.5). Total num frames: 100968448. Throughput: 0: 5086.0. Samples: 100965182. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 05:01:02,768][25689] Avg episode reward: [(0, '-51.002')] [2022-07-09 05:01:03,959][26022] Updated weights on worker 0-0, policy_version 98609 (0.00093) [2022-07-09 05:01:05,712][26022] Updated weights on worker 0-0, policy_version 98619 (0.00095) [2022-07-09 05:01:07,710][26022] Updated weights on worker 0-0, policy_version 98629 (0.00098) [2022-07-09 05:01:07,808][25689] Fps is (10 sec: 5391.4, 60 sec: 5643.2, 300 sec: 5705.8). Total num frames: 100996096. Throughput: 0: 5793.2. Samples: 100996574. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 05:01:07,808][25689] Avg episode reward: [(0, '-52.212')] [2022-07-09 05:01:09,191][26022] Updated weights on worker 0-0, policy_version 98639 (0.00087) [2022-07-09 05:01:11,374][26022] Updated weights on worker 0-0, policy_version 98649 (0.00092) [2022-07-09 05:01:12,835][26022] Updated weights on worker 0-0, policy_version 98659 (0.00092) [2022-07-09 05:01:12,923][25689] Fps is (10 sec: 5747.1, 60 sec: 5686.2, 300 sec: 5710.8). Total num frames: 101026816. Throughput: 0: 5771.6. Samples: 101030708. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 05:01:12,924][25689] Avg episode reward: [(0, '-51.973')] [2022-07-09 05:01:14,827][26022] Updated weights on worker 0-0, policy_version 98669 (0.00088) [2022-07-09 05:01:16,666][26022] Updated weights on worker 0-0, policy_version 98679 (0.00085) [2022-07-09 05:01:17,963][25689] Fps is (10 sec: 5747.9, 60 sec: 5651.4, 300 sec: 5699.7). Total num frames: 101054464. Throughput: 0: 4951.9. Samples: 101047958. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 05:01:17,963][25689] Avg episode reward: [(0, '-52.170')] [2022-07-09 05:01:18,346][26022] Updated weights on worker 0-0, policy_version 98689 (0.00079) [2022-07-09 05:01:20,024][26022] Updated weights on worker 0-0, policy_version 98699 (0.00091) [2022-07-09 05:01:21,976][26022] Updated weights on worker 0-0, policy_version 98709 (0.00083) [2022-07-09 05:01:22,996][25689] Fps is (10 sec: 5692.9, 60 sec: 5672.6, 300 sec: 5706.6). Total num frames: 101084160. Throughput: 0: 5804.0. Samples: 101082590. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 05:01:22,997][25689] Avg episode reward: [(0, '-51.695')] [2022-07-09 05:01:23,653][26022] Updated weights on worker 0-0, policy_version 98719 (0.00083) [2022-07-09 05:01:25,475][26022] Updated weights on worker 0-0, policy_version 98729 (0.00079) [2022-07-09 05:01:27,262][26022] Updated weights on worker 0-0, policy_version 98739 (0.00098) [2022-07-09 05:01:28,049][25689] Fps is (10 sec: 5685.5, 60 sec: 5634.5, 300 sec: 5699.6). Total num frames: 101111808. Throughput: 0: 5940.6. Samples: 101116816. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 05:01:28,049][25689] Avg episode reward: [(0, '-52.331')] [2022-07-09 05:01:29,219][26022] Updated weights on worker 0-0, policy_version 98749 (0.00091) [2022-07-09 05:01:30,892][26022] Updated weights on worker 0-0, policy_version 98759 (0.00091) [2022-07-09 05:01:32,865][26022] Updated weights on worker 0-0, policy_version 98769 (0.00089) [2022-07-09 05:01:33,096][25689] Fps is (10 sec: 5576.5, 60 sec: 5652.2, 300 sec: 5698.8). Total num frames: 101140480. Throughput: 0: 5951.2. Samples: 101150758. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 05:01:33,097][25689] Avg episode reward: [(0, '-52.019')] [2022-07-09 05:01:34,604][26022] Updated weights on worker 0-0, policy_version 98779 (0.00087) [2022-07-09 05:01:36,415][26022] Updated weights on worker 0-0, policy_version 98789 (0.00090) [2022-07-09 05:01:38,119][25689] Fps is (10 sec: 5694.5, 60 sec: 5674.3, 300 sec: 5698.8). Total num frames: 101169152. Throughput: 0: 5934.7. Samples: 101167578. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 05:01:38,119][25689] Avg episode reward: [(0, '-51.084')] [2022-07-09 05:01:38,279][26022] Updated weights on worker 0-0, policy_version 98799 (0.00097) [2022-07-09 05:01:39,950][26022] Updated weights on worker 0-0, policy_version 98809 (0.00091) [2022-07-09 05:01:41,696][26022] Updated weights on worker 0-0, policy_version 98819 (0.00084) [2022-07-09 05:01:43,155][25689] Fps is (10 sec: 5802.7, 60 sec: 5656.3, 300 sec: 5701.9). Total num frames: 101198848. Throughput: 0: 5930.6. Samples: 101202140. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 05:01:43,155][25689] Avg episode reward: [(0, '-50.924')] [2022-07-09 05:01:43,577][26022] Updated weights on worker 0-0, policy_version 98829 (0.00084) [2022-07-09 05:01:45,174][26022] Updated weights on worker 0-0, policy_version 98839 (0.00088) [2022-07-09 05:01:47,227][26022] Updated weights on worker 0-0, policy_version 98849 (0.00088) [2022-07-09 05:01:48,156][25689] Fps is (10 sec: 5916.9, 60 sec: 5665.1, 300 sec: 5700.0). Total num frames: 101228544. Throughput: 0: 5957.5. Samples: 101236608. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 05:01:48,157][25689] Avg episode reward: [(0, '-50.896')] [2022-07-09 05:01:48,803][26022] Updated weights on worker 0-0, policy_version 98859 (0.00095) [2022-07-09 05:01:50,645][26022] Updated weights on worker 0-0, policy_version 98869 (0.00093) [2022-07-09 05:01:52,382][26022] Updated weights on worker 0-0, policy_version 98879 (0.00084) [2022-07-09 05:01:53,198][25689] Fps is (10 sec: 5607.4, 60 sec: 5667.7, 300 sec: 5692.3). Total num frames: 101255168. Throughput: 0: 5131.0. Samples: 101253904. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 05:01:53,200][25689] Avg episode reward: [(0, '-50.743')] [2022-07-09 05:01:54,211][26022] Updated weights on worker 0-0, policy_version 98889 (0.00086) [2022-07-09 05:01:55,919][26022] Updated weights on worker 0-0, policy_version 98899 (0.00087) [2022-07-09 05:01:57,847][26022] Updated weights on worker 0-0, policy_version 98909 (0.00089) [2022-07-09 05:01:58,215][25689] Fps is (10 sec: 5599.4, 60 sec: 5672.6, 300 sec: 5699.3). Total num frames: 101284864. Throughput: 0: 6020.9. Samples: 101288574. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 05:01:58,216][25689] Avg episode reward: [(0, '-50.779')] [2022-07-09 05:01:59,470][26022] Updated weights on worker 0-0, policy_version 98919 (0.00091) [2022-07-09 05:02:01,335][26022] Updated weights on worker 0-0, policy_version 98929 (0.00091) [2022-07-09 05:02:03,219][25689] Fps is (10 sec: 5620.4, 60 sec: 5674.6, 300 sec: 5696.1). Total num frames: 101311488. Throughput: 0: 5925.5. Samples: 101321032. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 05:02:03,220][25689] Avg episode reward: [(0, '-51.100')] [2022-07-09 05:02:03,616][26022] Updated weights on worker 0-0, policy_version 98939 (0.00087) [2022-07-09 05:02:05,338][26022] Updated weights on worker 0-0, policy_version 98949 (0.00088) [2022-07-09 05:02:07,108][26022] Updated weights on worker 0-0, policy_version 98959 (0.00084) [2022-07-09 05:02:08,238][25689] Fps is (10 sec: 5516.5, 60 sec: 5693.5, 300 sec: 5696.9). Total num frames: 101340160. Throughput: 0: 5040.1. Samples: 101337824. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 05:02:08,240][25689] Avg episode reward: [(0, '-50.857')] [2022-07-09 05:02:09,186][26022] Updated weights on worker 0-0, policy_version 98969 (0.00093) [2022-07-09 05:02:10,590][26022] Updated weights on worker 0-0, policy_version 98979 (0.00336) [2022-07-09 05:02:12,719][26022] Updated weights on worker 0-0, policy_version 98989 (0.00096) [2022-07-09 05:02:13,364][25689] Fps is (10 sec: 5753.6, 60 sec: 5675.7, 300 sec: 5695.5). Total num frames: 101369856. Throughput: 0: 5871.7. Samples: 101372308. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 05:02:13,366][25689] Avg episode reward: [(0, '-50.982')] [2022-07-09 05:02:14,217][26022] Updated weights on worker 0-0, policy_version 98999 (0.00087) [2022-07-09 05:02:16,052][26022] Updated weights on worker 0-0, policy_version 99009 (0.00052) [2022-07-09 05:02:17,919][26022] Updated weights on worker 0-0, policy_version 99019 (0.00099) [2022-07-09 05:02:18,371][25689] Fps is (10 sec: 5659.6, 60 sec: 5678.7, 300 sec: 5699.3). Total num frames: 101397504. Throughput: 0: 5874.9. Samples: 101406988. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 05:02:18,371][25689] Avg episode reward: [(0, '-50.904')] [2022-07-09 05:02:19,670][26022] Updated weights on worker 0-0, policy_version 99029 (0.00084) [2022-07-09 05:02:21,359][26022] Updated weights on worker 0-0, policy_version 99039 (0.00081) [2022-07-09 05:02:23,103][26022] Updated weights on worker 0-0, policy_version 99049 (0.00089) [2022-07-09 05:02:23,399][25689] Fps is (10 sec: 5714.3, 60 sec: 5679.2, 300 sec: 5695.4). Total num frames: 101427200. Throughput: 0: 5135.0. Samples: 101424656. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 05:02:23,401][25689] Avg episode reward: [(0, '-51.174')] [2022-07-09 05:02:24,887][26022] Updated weights on worker 0-0, policy_version 99059 (0.00095) [2022-07-09 05:02:26,980][26022] Updated weights on worker 0-0, policy_version 99069 (0.00098) [2022-07-09 05:02:27,402][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:02:27,414][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000099072_101449728.pth [2022-07-09 05:02:27,415][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000097067_99396608.pth [2022-07-09 05:02:28,407][25689] Fps is (10 sec: 5815.4, 60 sec: 5700.3, 300 sec: 5697.5). Total num frames: 101455872. Throughput: 0: 6010.6. Samples: 101459052. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 05:02:28,408][25689] Avg episode reward: [(0, '-51.697')] [2022-07-09 05:02:28,463][26022] Updated weights on worker 0-0, policy_version 99079 (0.00092) [2022-07-09 05:02:30,564][26022] Updated weights on worker 0-0, policy_version 99089 (0.00092) [2022-07-09 05:02:32,072][26022] Updated weights on worker 0-0, policy_version 99099 (0.00091) [2022-07-09 05:02:33,520][25689] Fps is (10 sec: 5665.8, 60 sec: 5694.2, 300 sec: 5699.2). Total num frames: 101484544. Throughput: 0: 5984.2. Samples: 101492926. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 05:02:33,522][25689] Avg episode reward: [(0, '-52.624')] [2022-07-09 05:02:34,104][26022] Updated weights on worker 0-0, policy_version 99109 (0.00098) [2022-07-09 05:02:35,769][26022] Updated weights on worker 0-0, policy_version 99119 (0.00085) [2022-07-09 05:02:37,510][26022] Updated weights on worker 0-0, policy_version 99129 (0.00090) [2022-07-09 05:02:38,525][25689] Fps is (10 sec: 5667.5, 60 sec: 5695.8, 300 sec: 5688.9). Total num frames: 101513216. Throughput: 0: 5112.0. Samples: 101510020. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 05:02:38,527][25689] Avg episode reward: [(0, '-53.113')] [2022-07-09 05:02:39,315][26022] Updated weights on worker 0-0, policy_version 99139 (0.00090) [2022-07-09 05:02:41,114][26022] Updated weights on worker 0-0, policy_version 99149 (0.00085) [2022-07-09 05:02:42,819][26022] Updated weights on worker 0-0, policy_version 99159 (0.00081) [2022-07-09 05:02:43,535][25689] Fps is (10 sec: 5623.3, 60 sec: 5664.3, 300 sec: 5685.7). Total num frames: 101540864. Throughput: 0: 5954.6. Samples: 101544560. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 05:02:43,537][25689] Avg episode reward: [(0, '-52.771')] [2022-07-09 05:02:44,665][26022] Updated weights on worker 0-0, policy_version 99169 (0.00086) [2022-07-09 05:02:46,362][26022] Updated weights on worker 0-0, policy_version 99179 (0.00087) [2022-07-09 05:02:48,309][26022] Updated weights on worker 0-0, policy_version 99189 (0.00089) [2022-07-09 05:02:48,542][25689] Fps is (10 sec: 5725.0, 60 sec: 5663.9, 300 sec: 5686.7). Total num frames: 101570560. Throughput: 0: 5968.4. Samples: 101579222. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 05:02:48,542][25689] Avg episode reward: [(0, '-52.175')] [2022-07-09 05:02:50,030][26022] Updated weights on worker 0-0, policy_version 99199 (0.00087) [2022-07-09 05:02:51,876][26022] Updated weights on worker 0-0, policy_version 99209 (0.00089) [2022-07-09 05:02:53,354][26022] Updated weights on worker 0-0, policy_version 99219 (0.00088) [2022-07-09 05:02:53,670][25689] Fps is (10 sec: 5961.2, 60 sec: 5723.5, 300 sec: 5691.2). Total num frames: 101601280. Throughput: 0: 5141.1. Samples: 101596520. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 05:02:53,671][25689] Avg episode reward: [(0, '-52.357')] [2022-07-09 05:02:55,343][26022] Updated weights on worker 0-0, policy_version 99229 (0.00088) [2022-07-09 05:02:56,959][26022] Updated weights on worker 0-0, policy_version 99239 (0.00590) [2022-07-09 05:02:58,708][25689] Fps is (10 sec: 5741.2, 60 sec: 5687.6, 300 sec: 5691.0). Total num frames: 101628928. Throughput: 0: 6004.3. Samples: 101631204. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 05:02:58,710][25689] Avg episode reward: [(0, '-51.546')] [2022-07-09 05:02:58,830][26022] Updated weights on worker 0-0, policy_version 99249 (0.00092) [2022-07-09 05:03:00,543][26022] Updated weights on worker 0-0, policy_version 99259 (0.00089) [2022-07-09 05:03:02,816][26022] Updated weights on worker 0-0, policy_version 99269 (0.00093) [2022-07-09 05:03:03,731][25689] Fps is (10 sec: 5496.3, 60 sec: 5702.8, 300 sec: 5684.4). Total num frames: 101656576. Throughput: 0: 5892.8. Samples: 101663568. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 05:03:03,731][25689] Avg episode reward: [(0, '-51.036')] [2022-07-09 05:03:04,515][26022] Updated weights on worker 0-0, policy_version 99279 (0.00092) [2022-07-09 05:03:06,559][26022] Updated weights on worker 0-0, policy_version 99289 (0.00099) [2022-07-09 05:03:08,247][26022] Updated weights on worker 0-0, policy_version 99299 (0.00084) [2022-07-09 05:03:08,759][25689] Fps is (10 sec: 5501.8, 60 sec: 5685.1, 300 sec: 5685.3). Total num frames: 101684224. Throughput: 0: 5027.9. Samples: 101680870. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 05:03:08,760][25689] Avg episode reward: [(0, '-50.636')] [2022-07-09 05:03:10,156][26022] Updated weights on worker 0-0, policy_version 99309 (0.00090) [2022-07-09 05:03:11,847][26022] Updated weights on worker 0-0, policy_version 99319 (0.00091) [2022-07-09 05:03:13,596][26022] Updated weights on worker 0-0, policy_version 99329 (0.00086) [2022-07-09 05:03:13,912][25689] Fps is (10 sec: 5632.2, 60 sec: 5682.5, 300 sec: 5686.6). Total num frames: 101713920. Throughput: 0: 5850.5. Samples: 101714946. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 05:03:13,913][25689] Avg episode reward: [(0, '-51.540')] [2022-07-09 05:03:15,555][26022] Updated weights on worker 0-0, policy_version 99339 (0.00081) [2022-07-09 05:03:17,148][26022] Updated weights on worker 0-0, policy_version 99349 (0.00090) [2022-07-09 05:03:18,936][25689] Fps is (10 sec: 5735.0, 60 sec: 5697.7, 300 sec: 5690.0). Total num frames: 101742592. Throughput: 0: 5848.5. Samples: 101749508. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 05:03:18,938][25689] Avg episode reward: [(0, '-51.841')] [2022-07-09 05:03:19,104][26022] Updated weights on worker 0-0, policy_version 99359 (0.00082) [2022-07-09 05:03:20,652][26022] Updated weights on worker 0-0, policy_version 99369 (0.00096) [2022-07-09 05:03:22,554][26022] Updated weights on worker 0-0, policy_version 99379 (0.00089) [2022-07-09 05:03:23,960][25689] Fps is (10 sec: 5910.7, 60 sec: 5715.0, 300 sec: 5686.2). Total num frames: 101773312. Throughput: 0: 5112.9. Samples: 101767000. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 05:03:23,961][25689] Avg episode reward: [(0, '-51.667')] [2022-07-09 05:03:24,126][26022] Updated weights on worker 0-0, policy_version 99389 (0.00088) [2022-07-09 05:03:26,048][26022] Updated weights on worker 0-0, policy_version 99399 (0.00094) [2022-07-09 05:03:27,810][26022] Updated weights on worker 0-0, policy_version 99409 (0.00089) [2022-07-09 05:03:28,971][25689] Fps is (10 sec: 5816.8, 60 sec: 5698.0, 300 sec: 5687.8). Total num frames: 101800960. Throughput: 0: 5968.2. Samples: 101801498. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 05:03:28,973][25689] Avg episode reward: [(0, '-51.423')] [2022-07-09 05:03:29,667][26022] Updated weights on worker 0-0, policy_version 99419 (0.00096) [2022-07-09 05:03:31,347][26022] Updated weights on worker 0-0, policy_version 99429 (0.00088) [2022-07-09 05:03:33,184][26022] Updated weights on worker 0-0, policy_version 99439 (0.00084) [2022-07-09 05:03:34,043][25689] Fps is (10 sec: 5687.3, 60 sec: 5718.6, 300 sec: 5687.4). Total num frames: 101830656. Throughput: 0: 6020.0. Samples: 101836136. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 05:03:34,044][25689] Avg episode reward: [(0, '-51.030')] [2022-07-09 05:03:35,019][26022] Updated weights on worker 0-0, policy_version 99449 (0.00082) [2022-07-09 05:03:36,736][26022] Updated weights on worker 0-0, policy_version 99459 (0.00088) [2022-07-09 05:03:38,673][26022] Updated weights on worker 0-0, policy_version 99469 (0.00091) [2022-07-09 05:03:39,139][25689] Fps is (10 sec: 5639.8, 60 sec: 5693.3, 300 sec: 5685.9). Total num frames: 101858304. Throughput: 0: 5137.8. Samples: 101853302. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 05:03:39,139][25689] Avg episode reward: [(0, '-50.669')] [2022-07-09 05:03:40,358][26022] Updated weights on worker 0-0, policy_version 99479 (0.00093) [2022-07-09 05:03:42,170][26022] Updated weights on worker 0-0, policy_version 99489 (0.00087) [2022-07-09 05:03:43,753][26022] Updated weights on worker 0-0, policy_version 99499 (0.00096) [2022-07-09 05:03:44,168][25689] Fps is (10 sec: 5764.7, 60 sec: 5742.1, 300 sec: 5689.0). Total num frames: 101889024. Throughput: 0: 5983.7. Samples: 101887918. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 05:03:44,169][25689] Avg episode reward: [(0, '-50.186')] [2022-07-09 05:03:45,828][26022] Updated weights on worker 0-0, policy_version 99509 (0.00080) [2022-07-09 05:03:47,407][26022] Updated weights on worker 0-0, policy_version 99519 (0.00092) [2022-07-09 05:03:49,207][25689] Fps is (10 sec: 5796.9, 60 sec: 5705.3, 300 sec: 5685.9). Total num frames: 101916672. Throughput: 0: 5977.9. Samples: 101922470. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 05:03:49,208][25689] Avg episode reward: [(0, '-50.457')] [2022-07-09 05:03:49,263][26022] Updated weights on worker 0-0, policy_version 99529 (0.00087) [2022-07-09 05:03:51,128][26022] Updated weights on worker 0-0, policy_version 99539 (0.00093) [2022-07-09 05:03:52,787][26022] Updated weights on worker 0-0, policy_version 99549 (0.00083) [2022-07-09 05:03:54,323][25689] Fps is (10 sec: 5646.8, 60 sec: 5689.5, 300 sec: 5691.1). Total num frames: 101946368. Throughput: 0: 5083.4. Samples: 101939228. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 05:03:54,331][25689] Avg episode reward: [(0, '-50.451')] [2022-07-09 05:03:54,601][26022] Updated weights on worker 0-0, policy_version 99559 (0.00082) [2022-07-09 05:03:56,465][26022] Updated weights on worker 0-0, policy_version 99569 (0.00086) [2022-07-09 05:03:58,151][26022] Updated weights on worker 0-0, policy_version 99579 (0.00089) [2022-07-09 05:03:59,340][25689] Fps is (10 sec: 5659.6, 60 sec: 5691.6, 300 sec: 5687.7). Total num frames: 101974016. Throughput: 0: 5956.5. Samples: 101973630. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 05:03:59,340][25689] Avg episode reward: [(0, '-50.449')] [2022-07-09 05:03:59,935][26022] Updated weights on worker 0-0, policy_version 99589 (0.00080) [2022-07-09 05:04:02,224][26022] Updated weights on worker 0-0, policy_version 99599 (0.00113) [2022-07-09 05:04:03,804][26022] Updated weights on worker 0-0, policy_version 99609 (0.00089) [2022-07-09 05:04:04,355][25689] Fps is (10 sec: 5614.0, 60 sec: 5709.1, 300 sec: 5691.5). Total num frames: 102002688. Throughput: 0: 5851.6. Samples: 102006044. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 05:04:04,356][25689] Avg episode reward: [(0, '-51.130')] [2022-07-09 05:04:05,778][26022] Updated weights on worker 0-0, policy_version 99619 (0.00085) [2022-07-09 05:04:07,399][26022] Updated weights on worker 0-0, policy_version 99629 (0.00087) [2022-07-09 05:04:09,326][26022] Updated weights on worker 0-0, policy_version 99639 (0.00087) [2022-07-09 05:04:09,367][25689] Fps is (10 sec: 5616.7, 60 sec: 5710.7, 300 sec: 5689.0). Total num frames: 102030336. Throughput: 0: 4998.2. Samples: 102023230. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 05:04:09,367][25689] Avg episode reward: [(0, '-51.471')] [2022-07-09 05:04:11,095][26022] Updated weights on worker 0-0, policy_version 99649 (0.00085) [2022-07-09 05:04:12,988][26022] Updated weights on worker 0-0, policy_version 99659 (0.00092) [2022-07-09 05:04:14,467][25689] Fps is (10 sec: 5570.1, 60 sec: 5698.8, 300 sec: 5684.1). Total num frames: 102059008. Throughput: 0: 5856.9. Samples: 102057204. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 05:04:14,467][25689] Avg episode reward: [(0, '-52.357')] [2022-07-09 05:04:14,887][26022] Updated weights on worker 0-0, policy_version 99669 (0.00087) [2022-07-09 05:04:16,493][26022] Updated weights on worker 0-0, policy_version 99679 (0.00091) [2022-07-09 05:04:18,498][26022] Updated weights on worker 0-0, policy_version 99689 (0.00087) [2022-07-09 05:04:19,507][25689] Fps is (10 sec: 5756.0, 60 sec: 5714.2, 300 sec: 5691.1). Total num frames: 102088704. Throughput: 0: 5853.4. Samples: 102091678. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 05:04:19,508][25689] Avg episode reward: [(0, '-52.747')] [2022-07-09 05:04:19,905][26022] Updated weights on worker 0-0, policy_version 99699 (0.00095) [2022-07-09 05:04:21,821][26022] Updated weights on worker 0-0, policy_version 99709 (0.00093) [2022-07-09 05:04:23,700][26022] Updated weights on worker 0-0, policy_version 99719 (0.00092) [2022-07-09 05:04:24,597][25689] Fps is (10 sec: 5761.5, 60 sec: 5674.2, 300 sec: 5683.9). Total num frames: 102117376. Throughput: 0: 5935.5. Samples: 102126190. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 05:04:24,598][25689] Avg episode reward: [(0, '-52.526')] [2022-07-09 05:04:25,515][26022] Updated weights on worker 0-0, policy_version 99729 (0.00092) [2022-07-09 05:04:27,180][26022] Updated weights on worker 0-0, policy_version 99739 (0.00085) [2022-07-09 05:04:27,527][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:04:27,541][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000099741_102134784.pth [2022-07-09 05:04:27,541][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000097739_100084736.pth [2022-07-09 05:04:29,080][26022] Updated weights on worker 0-0, policy_version 99749 (0.00089) [2022-07-09 05:04:29,676][25689] Fps is (10 sec: 5739.5, 60 sec: 5701.5, 300 sec: 5690.4). Total num frames: 102147072. Throughput: 0: 5918.2. Samples: 102143428. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 05:04:29,677][25689] Avg episode reward: [(0, '-51.847')] [2022-07-09 05:04:30,600][26022] Updated weights on worker 0-0, policy_version 99759 (0.00094) [2022-07-09 05:04:32,615][26022] Updated weights on worker 0-0, policy_version 99769 (0.00087) [2022-07-09 05:04:34,133][26022] Updated weights on worker 0-0, policy_version 99779 (0.00070) [2022-07-09 05:04:34,799][25689] Fps is (10 sec: 5621.3, 60 sec: 5663.1, 300 sec: 5686.6). Total num frames: 102174720. Throughput: 0: 5935.2. Samples: 102177878. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 05:04:34,799][25689] Avg episode reward: [(0, '-51.613')] [2022-07-09 05:04:36,140][26022] Updated weights on worker 0-0, policy_version 99789 (0.00090) [2022-07-09 05:04:37,998][26022] Updated weights on worker 0-0, policy_version 99799 (0.00087) [2022-07-09 05:04:39,550][26022] Updated weights on worker 0-0, policy_version 99809 (0.00089) [2022-07-09 05:04:39,873][25689] Fps is (10 sec: 5724.2, 60 sec: 5715.6, 300 sec: 5685.3). Total num frames: 102205440. Throughput: 0: 5926.8. Samples: 102212384. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-09 05:04:39,874][25689] Avg episode reward: [(0, '-51.509')] [2022-07-09 05:04:41,643][26022] Updated weights on worker 0-0, policy_version 99819 (0.00089) [2022-07-09 05:04:43,151][26022] Updated weights on worker 0-0, policy_version 99829 (0.00099) [2022-07-09 05:04:44,943][25689] Fps is (10 sec: 5854.6, 60 sec: 5678.2, 300 sec: 5692.0). Total num frames: 102234112. Throughput: 0: 5092.2. Samples: 102229794. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-09 05:04:44,944][25689] Avg episode reward: [(0, '-50.453')] [2022-07-09 05:04:45,045][26022] Updated weights on worker 0-0, policy_version 99839 (0.00093) [2022-07-09 05:04:46,863][26022] Updated weights on worker 0-0, policy_version 99849 (0.00094) [2022-07-09 05:04:48,599][26022] Updated weights on worker 0-0, policy_version 99859 (0.00094) [2022-07-09 05:04:50,030][25689] Fps is (10 sec: 5646.1, 60 sec: 5690.6, 300 sec: 5685.4). Total num frames: 102262784. Throughput: 0: 5943.0. Samples: 102264386. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-09 05:04:50,035][25689] Avg episode reward: [(0, '-51.527')] [2022-07-09 05:04:50,528][26022] Updated weights on worker 0-0, policy_version 99869 (0.00093) [2022-07-09 05:04:51,966][26022] Updated weights on worker 0-0, policy_version 99879 (0.00086) [2022-07-09 05:04:54,036][26022] Updated weights on worker 0-0, policy_version 99889 (0.00095) [2022-07-09 05:04:55,128][25689] Fps is (10 sec: 5730.8, 60 sec: 5692.2, 300 sec: 5691.0). Total num frames: 102292480. Throughput: 0: 5946.9. Samples: 102298774. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-09 05:04:55,128][25689] Avg episode reward: [(0, '-51.190')] [2022-07-09 05:04:55,752][26022] Updated weights on worker 0-0, policy_version 99899 (0.00092) [2022-07-09 05:04:57,613][26022] Updated weights on worker 0-0, policy_version 99909 (0.00083) [2022-07-09 05:04:59,250][26022] Updated weights on worker 0-0, policy_version 99919 (0.00090) [2022-07-09 05:05:00,148][25689] Fps is (10 sec: 5768.6, 60 sec: 5708.7, 300 sec: 5697.6). Total num frames: 102321152. Throughput: 0: 5103.2. Samples: 102315850. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-09 05:05:00,148][25689] Avg episode reward: [(0, '-52.012')] [2022-07-09 05:05:01,171][26022] Updated weights on worker 0-0, policy_version 99929 (0.00088) [2022-07-09 05:05:03,243][26022] Updated weights on worker 0-0, policy_version 99939 (0.00086) [2022-07-09 05:05:05,163][25689] Fps is (10 sec: 5408.4, 60 sec: 5658.3, 300 sec: 5680.8). Total num frames: 102346752. Throughput: 0: 5873.9. Samples: 102348562. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-09 05:05:05,163][25689] Avg episode reward: [(0, '-51.713')] [2022-07-09 05:05:05,186][26022] Updated weights on worker 0-0, policy_version 99949 (0.00085) [2022-07-09 05:05:06,761][26022] Updated weights on worker 0-0, policy_version 99959 (0.00095) [2022-07-09 05:05:08,624][26022] Updated weights on worker 0-0, policy_version 99969 (0.00101) [2022-07-09 05:05:10,189][25689] Fps is (10 sec: 5608.9, 60 sec: 5707.4, 300 sec: 5691.3). Total num frames: 102377472. Throughput: 0: 5880.0. Samples: 102382924. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-09 05:05:10,190][25689] Avg episode reward: [(0, '-51.575')] [2022-07-09 05:05:10,331][26022] Updated weights on worker 0-0, policy_version 99979 (0.00097) [2022-07-09 05:05:12,240][26022] Updated weights on worker 0-0, policy_version 99989 (0.00087) [2022-07-09 05:05:13,849][26022] Updated weights on worker 0-0, policy_version 99999 (0.00095) [2022-07-09 05:05:15,239][25689] Fps is (10 sec: 5792.8, 60 sec: 5695.2, 300 sec: 5684.0). Total num frames: 102405120. Throughput: 0: 5046.9. Samples: 102400270. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-09 05:05:15,239][25689] Avg episode reward: [(0, '-51.657')] [2022-07-09 05:05:15,814][26022] Updated weights on worker 0-0, policy_version 100009 (0.00086) [2022-07-09 05:05:17,587][26022] Updated weights on worker 0-0, policy_version 100019 (0.00087) [2022-07-09 05:05:19,299][26022] Updated weights on worker 0-0, policy_version 100029 (0.00088) [2022-07-09 05:05:20,244][25689] Fps is (10 sec: 5703.0, 60 sec: 5698.5, 300 sec: 5688.9). Total num frames: 102434816. Throughput: 0: 5918.4. Samples: 102434788. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-09 05:05:20,244][25689] Avg episode reward: [(0, '-50.773')] [2022-07-09 05:05:20,932][26022] Updated weights on worker 0-0, policy_version 100039 (0.00091) [2022-07-09 05:05:22,732][26022] Updated weights on worker 0-0, policy_version 100049 (0.00102) [2022-07-09 05:05:24,790][26022] Updated weights on worker 0-0, policy_version 100059 (0.00094) [2022-07-09 05:05:25,249][25689] Fps is (10 sec: 5626.2, 60 sec: 5672.8, 300 sec: 5678.6). Total num frames: 102461440. Throughput: 0: 6019.3. Samples: 102469468. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-09 05:05:25,250][25689] Avg episode reward: [(0, '-50.739')] [2022-07-09 05:05:26,308][26022] Updated weights on worker 0-0, policy_version 100069 (0.00080) [2022-07-09 05:05:28,290][26022] Updated weights on worker 0-0, policy_version 100079 (0.00088) [2022-07-09 05:05:29,802][26022] Updated weights on worker 0-0, policy_version 100089 (0.00090) [2022-07-09 05:05:30,283][25689] Fps is (10 sec: 5712.3, 60 sec: 5693.9, 300 sec: 5689.3). Total num frames: 102492160. Throughput: 0: 5167.3. Samples: 102486754. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-09 05:05:30,283][25689] Avg episode reward: [(0, '-51.035')] [2022-07-09 05:05:31,824][26022] Updated weights on worker 0-0, policy_version 100099 (0.00088) [2022-07-09 05:05:33,645][26022] Updated weights on worker 0-0, policy_version 100109 (0.00093) [2022-07-09 05:05:35,327][25689] Fps is (10 sec: 5893.5, 60 sec: 5718.2, 300 sec: 5693.4). Total num frames: 102520832. Throughput: 0: 6045.6. Samples: 102521714. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-09 05:05:35,327][25689] Avg episode reward: [(0, '-50.837')] [2022-07-09 05:05:35,350][26022] Updated weights on worker 0-0, policy_version 100119 (0.00100) [2022-07-09 05:05:37,045][26022] Updated weights on worker 0-0, policy_version 100129 (0.00085) [2022-07-09 05:05:38,732][26022] Updated weights on worker 0-0, policy_version 100139 (0.00096) [2022-07-09 05:05:40,341][25689] Fps is (10 sec: 5803.0, 60 sec: 5707.0, 300 sec: 5690.2). Total num frames: 102550528. Throughput: 0: 6021.8. Samples: 102555808. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-09 05:05:40,341][25689] Avg episode reward: [(0, '-51.563')] [2022-07-09 05:05:40,815][26022] Updated weights on worker 0-0, policy_version 100149 (0.00094) [2022-07-09 05:05:42,502][26022] Updated weights on worker 0-0, policy_version 100159 (0.00092) [2022-07-09 05:05:44,254][26022] Updated weights on worker 0-0, policy_version 100169 (0.00088) [2022-07-09 05:05:45,355][25689] Fps is (10 sec: 5820.3, 60 sec: 5712.3, 300 sec: 5688.3). Total num frames: 102579200. Throughput: 0: 5145.1. Samples: 102572914. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-09 05:05:45,355][25689] Avg episode reward: [(0, '-51.596')] [2022-07-09 05:05:45,898][26022] Updated weights on worker 0-0, policy_version 100179 (0.00099) [2022-07-09 05:05:47,778][26022] Updated weights on worker 0-0, policy_version 100189 (0.00087) [2022-07-09 05:05:49,481][26022] Updated weights on worker 0-0, policy_version 100199 (0.00086) [2022-07-09 05:05:50,390][25689] Fps is (10 sec: 5502.9, 60 sec: 5683.2, 300 sec: 5689.0). Total num frames: 102605824. Throughput: 0: 6013.1. Samples: 102607658. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-09 05:05:50,390][25689] Avg episode reward: [(0, '-52.123')] [2022-07-09 05:05:51,368][26022] Updated weights on worker 0-0, policy_version 100209 (0.00098) [2022-07-09 05:05:53,155][26022] Updated weights on worker 0-0, policy_version 100219 (0.00082) [2022-07-09 05:05:54,917][26022] Updated weights on worker 0-0, policy_version 100229 (0.00085) [2022-07-09 05:05:55,426][25689] Fps is (10 sec: 5694.2, 60 sec: 5706.1, 300 sec: 5693.0). Total num frames: 102636544. Throughput: 0: 5979.7. Samples: 102641900. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-09 05:05:55,426][25689] Avg episode reward: [(0, '-52.317')] [2022-07-09 05:05:56,727][26022] Updated weights on worker 0-0, policy_version 100239 (0.00088) [2022-07-09 05:05:58,395][26022] Updated weights on worker 0-0, policy_version 100249 (0.00392) [2022-07-09 05:06:00,151][26022] Updated weights on worker 0-0, policy_version 100259 (0.00086) [2022-07-09 05:06:00,439][25689] Fps is (10 sec: 5909.8, 60 sec: 5706.7, 300 sec: 5700.1). Total num frames: 102665216. Throughput: 0: 5149.9. Samples: 102659312. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-09 05:06:00,440][25689] Avg episode reward: [(0, '-51.908')] [2022-07-09 05:06:02,344][26022] Updated weights on worker 0-0, policy_version 100269 (0.00100) [2022-07-09 05:06:04,192][26022] Updated weights on worker 0-0, policy_version 100279 (0.00081) [2022-07-09 05:06:05,461][25689] Fps is (10 sec: 5407.9, 60 sec: 5706.0, 300 sec: 5693.6). Total num frames: 102690816. Throughput: 0: 5904.8. Samples: 102691638. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-09 05:06:05,462][25689] Avg episode reward: [(0, '-52.699')] [2022-07-09 05:06:06,040][26022] Updated weights on worker 0-0, policy_version 100289 (0.00086) [2022-07-09 05:06:07,884][26022] Updated weights on worker 0-0, policy_version 100299 (0.00088) [2022-07-09 05:06:09,527][26022] Updated weights on worker 0-0, policy_version 100309 (0.00083) [2022-07-09 05:06:10,495][25689] Fps is (10 sec: 5397.3, 60 sec: 5671.4, 300 sec: 5688.3). Total num frames: 102719488. Throughput: 0: 5864.4. Samples: 102725562. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-09 05:06:10,497][25689] Avg episode reward: [(0, '-52.302')] [2022-07-09 05:06:11,833][26022] Updated weights on worker 0-0, policy_version 100319 (0.00086) [2022-07-09 05:06:13,176][26022] Updated weights on worker 0-0, policy_version 100329 (0.00087) [2022-07-09 05:06:15,319][26022] Updated weights on worker 0-0, policy_version 100339 (0.00085) [2022-07-09 05:06:15,564][25689] Fps is (10 sec: 5777.5, 60 sec: 5703.5, 300 sec: 5694.6). Total num frames: 102749184. Throughput: 0: 4985.8. Samples: 102742306. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-09 05:06:15,566][25689] Avg episode reward: [(0, '-53.278')] [2022-07-09 05:06:16,737][26022] Updated weights on worker 0-0, policy_version 100349 (0.00081) [2022-07-09 05:06:18,661][26022] Updated weights on worker 0-0, policy_version 100359 (0.00091) [2022-07-09 05:06:20,560][26022] Updated weights on worker 0-0, policy_version 100369 (0.00084) [2022-07-09 05:06:20,582][25689] Fps is (10 sec: 5786.5, 60 sec: 5685.3, 300 sec: 5691.5). Total num frames: 102777856. Throughput: 0: 5824.2. Samples: 102776626. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-09 05:06:20,583][25689] Avg episode reward: [(0, '-53.198')] [2022-07-09 05:06:22,180][26022] Updated weights on worker 0-0, policy_version 100379 (0.00084) [2022-07-09 05:06:24,072][26022] Updated weights on worker 0-0, policy_version 100389 (0.00083) [2022-07-09 05:06:25,586][25689] Fps is (10 sec: 5823.9, 60 sec: 5736.3, 300 sec: 5699.3). Total num frames: 102807552. Throughput: 0: 5960.4. Samples: 102811590. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-09 05:06:25,587][25689] Avg episode reward: [(0, '-53.335')] [2022-07-09 05:06:25,746][26022] Updated weights on worker 0-0, policy_version 100399 (0.00099) [2022-07-09 05:06:27,651][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:06:27,662][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000100409_102818816.pth [2022-07-09 05:06:27,662][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000098408_100769792.pth [2022-07-09 05:06:27,674][26022] Updated weights on worker 0-0, policy_version 100409 (0.00089) [2022-07-09 05:06:29,501][26022] Updated weights on worker 0-0, policy_version 100419 (0.00100) [2022-07-09 05:06:30,589][25689] Fps is (10 sec: 5730.3, 60 sec: 5688.3, 300 sec: 5696.7). Total num frames: 102835200. Throughput: 0: 5128.2. Samples: 102828608. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-09 05:06:30,591][25689] Avg episode reward: [(0, '-53.788')] [2022-07-09 05:06:31,375][26022] Updated weights on worker 0-0, policy_version 100429 (0.00090) [2022-07-09 05:06:32,900][26022] Updated weights on worker 0-0, policy_version 100439 (0.00094) [2022-07-09 05:06:34,860][26022] Updated weights on worker 0-0, policy_version 100449 (0.00088) [2022-07-09 05:06:35,678][25689] Fps is (10 sec: 5682.0, 60 sec: 5701.0, 300 sec: 5698.8). Total num frames: 102864896. Throughput: 0: 5992.7. Samples: 102862846. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-09 05:06:35,679][25689] Avg episode reward: [(0, '-53.943')] [2022-07-09 05:06:36,537][26022] Updated weights on worker 0-0, policy_version 100459 (0.00094) [2022-07-09 05:06:38,445][26022] Updated weights on worker 0-0, policy_version 100469 (0.00103) [2022-07-09 05:06:40,177][26022] Updated weights on worker 0-0, policy_version 100479 (0.00093) [2022-07-09 05:06:40,710][25689] Fps is (10 sec: 5665.5, 60 sec: 5665.4, 300 sec: 5692.0). Total num frames: 102892544. Throughput: 0: 5985.6. Samples: 102897108. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 05:06:40,711][25689] Avg episode reward: [(0, '-53.557')] [2022-07-09 05:06:41,953][26022] Updated weights on worker 0-0, policy_version 100489 (0.00086) [2022-07-09 05:06:43,739][26022] Updated weights on worker 0-0, policy_version 100499 (0.00092) [2022-07-09 05:06:45,594][26022] Updated weights on worker 0-0, policy_version 100509 (0.00089) [2022-07-09 05:06:45,722][25689] Fps is (10 sec: 5607.8, 60 sec: 5665.6, 300 sec: 5688.4). Total num frames: 102921216. Throughput: 0: 5105.8. Samples: 102914398. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 05:06:45,723][25689] Avg episode reward: [(0, '-53.707')] [2022-07-09 05:06:47,316][26022] Updated weights on worker 0-0, policy_version 100519 (0.00084) [2022-07-09 05:06:49,259][26022] Updated weights on worker 0-0, policy_version 100529 (0.00095) [2022-07-09 05:06:50,731][25689] Fps is (10 sec: 5824.7, 60 sec: 5718.9, 300 sec: 5699.4). Total num frames: 102950912. Throughput: 0: 5990.0. Samples: 102949260. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 05:06:50,732][25689] Avg episode reward: [(0, '-53.314')] [2022-07-09 05:06:50,746][26022] Updated weights on worker 0-0, policy_version 100539 (0.00090) [2022-07-09 05:06:52,862][26022] Updated weights on worker 0-0, policy_version 100549 (0.00086) [2022-07-09 05:06:54,479][26022] Updated weights on worker 0-0, policy_version 100559 (0.00090) [2022-07-09 05:06:55,775][25689] Fps is (10 sec: 5805.7, 60 sec: 5684.2, 300 sec: 5695.4). Total num frames: 102979584. Throughput: 0: 6004.1. Samples: 102983508. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 05:06:55,776][25689] Avg episode reward: [(0, '-52.857')] [2022-07-09 05:06:56,371][26022] Updated weights on worker 0-0, policy_version 100569 (0.00087) [2022-07-09 05:06:58,048][26022] Updated weights on worker 0-0, policy_version 100579 (0.00087) [2022-07-09 05:06:59,786][26022] Updated weights on worker 0-0, policy_version 100589 (0.00082) [2022-07-09 05:07:00,783][25689] Fps is (10 sec: 5705.0, 60 sec: 5684.8, 300 sec: 5702.2). Total num frames: 103008256. Throughput: 0: 5160.3. Samples: 103000688. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 05:07:00,784][25689] Avg episode reward: [(0, '-51.984')] [2022-07-09 05:07:01,608][26022] Updated weights on worker 0-0, policy_version 100599 (0.00088) [2022-07-09 05:07:03,787][26022] Updated weights on worker 0-0, policy_version 100609 (0.00087) [2022-07-09 05:07:05,336][26022] Updated weights on worker 0-0, policy_version 100619 (0.00082) [2022-07-09 05:07:05,794][25689] Fps is (10 sec: 5621.4, 60 sec: 5719.8, 300 sec: 5698.9). Total num frames: 103035904. Throughput: 0: 5928.4. Samples: 103033396. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 05:07:05,795][25689] Avg episode reward: [(0, '-51.816')] [2022-07-09 05:07:07,359][26022] Updated weights on worker 0-0, policy_version 100629 (0.00089) [2022-07-09 05:07:08,999][26022] Updated weights on worker 0-0, policy_version 100639 (0.00088) [2022-07-09 05:07:10,807][25689] Fps is (10 sec: 5516.0, 60 sec: 5704.7, 300 sec: 5694.2). Total num frames: 103063552. Throughput: 0: 5908.0. Samples: 103067870. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 05:07:10,808][25689] Avg episode reward: [(0, '-51.382')] [2022-07-09 05:07:10,884][26022] Updated weights on worker 0-0, policy_version 100649 (0.00088) [2022-07-09 05:07:12,704][26022] Updated weights on worker 0-0, policy_version 100659 (0.00093) [2022-07-09 05:07:14,300][26022] Updated weights on worker 0-0, policy_version 100669 (0.00089) [2022-07-09 05:07:15,907][25689] Fps is (10 sec: 5569.3, 60 sec: 5684.9, 300 sec: 5695.8). Total num frames: 103092224. Throughput: 0: 5882.4. Samples: 103101930. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 05:07:15,907][25689] Avg episode reward: [(0, '-51.358')] [2022-07-09 05:07:16,241][26022] Updated weights on worker 0-0, policy_version 100679 (0.00079) [2022-07-09 05:07:18,164][26022] Updated weights on worker 0-0, policy_version 100689 (0.00084) [2022-07-09 05:07:19,715][26022] Updated weights on worker 0-0, policy_version 100699 (0.00080) [2022-07-09 05:07:20,930][25689] Fps is (10 sec: 5665.0, 60 sec: 5684.3, 300 sec: 5692.5). Total num frames: 103120896. Throughput: 0: 5881.7. Samples: 103119188. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 05:07:20,930][25689] Avg episode reward: [(0, '-51.555')] [2022-07-09 05:07:21,909][26022] Updated weights on worker 0-0, policy_version 100709 (0.00090) [2022-07-09 05:07:23,143][26022] Updated weights on worker 0-0, policy_version 100719 (0.00094) [2022-07-09 05:07:25,389][26022] Updated weights on worker 0-0, policy_version 100729 (0.00091) [2022-07-09 05:07:25,932][25689] Fps is (10 sec: 5822.3, 60 sec: 5684.6, 300 sec: 5696.1). Total num frames: 103150592. Throughput: 0: 5974.6. Samples: 103153710. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 05:07:25,932][25689] Avg episode reward: [(0, '-51.105')] [2022-07-09 05:07:26,863][26022] Updated weights on worker 0-0, policy_version 100739 (0.00088) [2022-07-09 05:07:28,751][26022] Updated weights on worker 0-0, policy_version 100749 (0.00095) [2022-07-09 05:07:30,611][26022] Updated weights on worker 0-0, policy_version 100759 (0.00098) [2022-07-09 05:07:30,936][25689] Fps is (10 sec: 5730.8, 60 sec: 5684.4, 300 sec: 5694.7). Total num frames: 103178240. Throughput: 0: 5957.0. Samples: 103187778. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 05:07:30,937][25689] Avg episode reward: [(0, '-50.863')] [2022-07-09 05:07:32,348][26022] Updated weights on worker 0-0, policy_version 100769 (0.00082) [2022-07-09 05:07:34,192][26022] Updated weights on worker 0-0, policy_version 100779 (0.00087) [2022-07-09 05:07:36,014][25689] Fps is (10 sec: 5687.5, 60 sec: 5685.5, 300 sec: 5696.8). Total num frames: 103207936. Throughput: 0: 5128.4. Samples: 103205050. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 05:07:36,015][25689] Avg episode reward: [(0, '-51.840')] [2022-07-09 05:07:36,019][26022] Updated weights on worker 0-0, policy_version 100789 (0.00087) [2022-07-09 05:07:37,659][26022] Updated weights on worker 0-0, policy_version 100799 (0.00085) [2022-07-09 05:07:39,632][26022] Updated weights on worker 0-0, policy_version 100809 (0.00087) [2022-07-09 05:07:41,070][25689] Fps is (10 sec: 5759.9, 60 sec: 5700.2, 300 sec: 5699.3). Total num frames: 103236608. Throughput: 0: 5960.4. Samples: 103239230. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 05:07:41,070][25689] Avg episode reward: [(0, '-52.210')] [2022-07-09 05:07:41,200][26022] Updated weights on worker 0-0, policy_version 100819 (0.00092) [2022-07-09 05:07:43,137][26022] Updated weights on worker 0-0, policy_version 100829 (0.00094) [2022-07-09 05:07:45,018][26022] Updated weights on worker 0-0, policy_version 100839 (0.00081) [2022-07-09 05:07:46,100][25689] Fps is (10 sec: 5787.4, 60 sec: 5715.5, 300 sec: 5698.9). Total num frames: 103266304. Throughput: 0: 5947.3. Samples: 103273654. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 05:07:46,100][25689] Avg episode reward: [(0, '-51.272')] [2022-07-09 05:07:46,610][26022] Updated weights on worker 0-0, policy_version 100849 (0.00091) [2022-07-09 05:07:48,651][26022] Updated weights on worker 0-0, policy_version 100859 (0.00093) [2022-07-09 05:07:50,158][26022] Updated weights on worker 0-0, policy_version 100869 (0.00091) [2022-07-09 05:07:51,120][25689] Fps is (10 sec: 5603.9, 60 sec: 5663.6, 300 sec: 5687.2). Total num frames: 103292928. Throughput: 0: 5115.8. Samples: 103291034. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 05:07:51,120][25689] Avg episode reward: [(0, '-51.251')] [2022-07-09 05:07:51,922][26022] Updated weights on worker 0-0, policy_version 100879 (0.00085) [2022-07-09 05:07:53,962][26022] Updated weights on worker 0-0, policy_version 100889 (0.00088) [2022-07-09 05:07:55,659][26022] Updated weights on worker 0-0, policy_version 100899 (0.00050) [2022-07-09 05:07:56,161][25689] Fps is (10 sec: 5801.2, 60 sec: 5714.7, 300 sec: 5700.9). Total num frames: 103324672. Throughput: 0: 5971.1. Samples: 103325348. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 05:07:56,161][25689] Avg episode reward: [(0, '-51.238')] [2022-07-09 05:07:57,498][26022] Updated weights on worker 0-0, policy_version 100909 (0.00083) [2022-07-09 05:07:59,159][26022] Updated weights on worker 0-0, policy_version 100919 (0.00092) [2022-07-09 05:08:00,843][26022] Updated weights on worker 0-0, policy_version 100929 (0.00092) [2022-07-09 05:08:01,204][25689] Fps is (10 sec: 5889.8, 60 sec: 5694.4, 300 sec: 5700.5). Total num frames: 103352320. Throughput: 0: 5992.6. Samples: 103359886. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 05:08:01,204][25689] Avg episode reward: [(0, '-50.802')] [2022-07-09 05:08:03,280][26022] Updated weights on worker 0-0, policy_version 100939 (0.00086) [2022-07-09 05:08:04,871][26022] Updated weights on worker 0-0, policy_version 100949 (0.00090) [2022-07-09 05:08:06,208][25689] Fps is (10 sec: 5198.0, 60 sec: 5644.3, 300 sec: 5690.6). Total num frames: 103376896. Throughput: 0: 5046.0. Samples: 103375120. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 05:08:06,209][25689] Avg episode reward: [(0, '-50.870')] [2022-07-09 05:08:06,784][26022] Updated weights on worker 0-0, policy_version 100959 (0.00092) [2022-07-09 05:08:08,587][26022] Updated weights on worker 0-0, policy_version 100969 (0.00088) [2022-07-09 05:08:10,319][26022] Updated weights on worker 0-0, policy_version 100979 (0.00086) [2022-07-09 05:08:11,289][25689] Fps is (10 sec: 5584.6, 60 sec: 5705.7, 300 sec: 5698.9). Total num frames: 103408640. Throughput: 0: 5873.0. Samples: 103409484. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 05:08:11,291][25689] Avg episode reward: [(0, '-50.200')] [2022-07-09 05:08:12,187][26022] Updated weights on worker 0-0, policy_version 100989 (0.00086) [2022-07-09 05:08:13,897][26022] Updated weights on worker 0-0, policy_version 100999 (0.00621) [2022-07-09 05:08:15,851][26022] Updated weights on worker 0-0, policy_version 101009 (0.00087) [2022-07-09 05:08:16,364][25689] Fps is (10 sec: 5747.0, 60 sec: 5674.1, 300 sec: 5691.0). Total num frames: 103435264. Throughput: 0: 5848.6. Samples: 103443508. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 05:08:16,365][25689] Avg episode reward: [(0, '-50.168')] [2022-07-09 05:08:17,593][26022] Updated weights on worker 0-0, policy_version 101019 (0.00085) [2022-07-09 05:08:19,288][26022] Updated weights on worker 0-0, policy_version 101029 (0.00088) [2022-07-09 05:08:20,973][26022] Updated weights on worker 0-0, policy_version 101039 (0.00088) [2022-07-09 05:08:21,378][25689] Fps is (10 sec: 5582.3, 60 sec: 5691.9, 300 sec: 5687.8). Total num frames: 103464960. Throughput: 0: 5010.2. Samples: 103460962. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 05:08:21,378][25689] Avg episode reward: [(0, '-51.257')] [2022-07-09 05:08:22,811][26022] Updated weights on worker 0-0, policy_version 101049 (0.00082) [2022-07-09 05:08:24,694][26022] Updated weights on worker 0-0, policy_version 101059 (0.00089) [2022-07-09 05:08:26,383][25689] Fps is (10 sec: 5825.9, 60 sec: 5674.7, 300 sec: 5691.3). Total num frames: 103493632. Throughput: 0: 5972.0. Samples: 103495606. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 05:08:26,383][25689] Avg episode reward: [(0, '-52.172')] [2022-07-09 05:08:26,390][26022] Updated weights on worker 0-0, policy_version 101069 (0.00088) [2022-07-09 05:08:27,787][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:08:27,796][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000101077_103502848.pth [2022-07-09 05:08:27,797][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000099072_101449728.pth [2022-07-09 05:08:28,216][26022] Updated weights on worker 0-0, policy_version 101079 (0.00087) [2022-07-09 05:08:29,974][26022] Updated weights on worker 0-0, policy_version 101089 (0.00086) [2022-07-09 05:08:31,391][25689] Fps is (10 sec: 5727.0, 60 sec: 5691.3, 300 sec: 5689.1). Total num frames: 103522304. Throughput: 0: 5977.4. Samples: 103529642. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 05:08:31,393][25689] Avg episode reward: [(0, '-51.323')] [2022-07-09 05:08:31,959][26022] Updated weights on worker 0-0, policy_version 101099 (0.00415) [2022-07-09 05:08:33,598][26022] Updated weights on worker 0-0, policy_version 101109 (0.00092) [2022-07-09 05:08:35,435][26022] Updated weights on worker 0-0, policy_version 101119 (0.00083) [2022-07-09 05:08:36,535][25689] Fps is (10 sec: 5749.4, 60 sec: 5685.1, 300 sec: 5695.1). Total num frames: 103552000. Throughput: 0: 5116.1. Samples: 103546704. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 05:08:36,535][25689] Avg episode reward: [(0, '-51.751')] [2022-07-09 05:08:37,183][26022] Updated weights on worker 0-0, policy_version 101129 (0.00083) [2022-07-09 05:08:38,956][26022] Updated weights on worker 0-0, policy_version 101139 (0.00597) [2022-07-09 05:08:40,914][26022] Updated weights on worker 0-0, policy_version 101149 (0.00104) [2022-07-09 05:08:41,557][25689] Fps is (10 sec: 5741.4, 60 sec: 5688.2, 300 sec: 5688.3). Total num frames: 103580672. Throughput: 0: 5946.6. Samples: 103580958. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 05:08:41,557][25689] Avg episode reward: [(0, '-51.930')] [2022-07-09 05:08:42,431][26022] Updated weights on worker 0-0, policy_version 101159 (0.00097) [2022-07-09 05:08:44,433][26022] Updated weights on worker 0-0, policy_version 101169 (0.00095) [2022-07-09 05:08:46,387][26022] Updated weights on worker 0-0, policy_version 101179 (0.00057) [2022-07-09 05:08:46,568][25689] Fps is (10 sec: 5613.4, 60 sec: 5656.1, 300 sec: 5688.9). Total num frames: 103608320. Throughput: 0: 5937.8. Samples: 103615460. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 05:08:46,568][25689] Avg episode reward: [(0, '-52.687')] [2022-07-09 05:08:47,886][26022] Updated weights on worker 0-0, policy_version 101189 (0.00092) [2022-07-09 05:08:49,614][26022] Updated weights on worker 0-0, policy_version 101199 (0.00089) [2022-07-09 05:08:51,585][25689] Fps is (10 sec: 5616.3, 60 sec: 5690.3, 300 sec: 5687.3). Total num frames: 103636992. Throughput: 0: 5109.6. Samples: 103632826. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 05:08:51,585][25689] Avg episode reward: [(0, '-53.131')] [2022-07-09 05:08:51,679][26022] Updated weights on worker 0-0, policy_version 101209 (0.00087) [2022-07-09 05:08:53,335][26022] Updated weights on worker 0-0, policy_version 101219 (0.00084) [2022-07-09 05:08:55,467][26022] Updated weights on worker 0-0, policy_version 101229 (0.00085) [2022-07-09 05:08:56,710][25689] Fps is (10 sec: 5855.9, 60 sec: 5665.5, 300 sec: 5695.6). Total num frames: 103667712. Throughput: 0: 5950.2. Samples: 103666752. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 05:08:56,711][25689] Avg episode reward: [(0, '-52.428')] [2022-07-09 05:08:56,809][26022] Updated weights on worker 0-0, policy_version 101239 (0.00088) [2022-07-09 05:08:59,135][26022] Updated weights on worker 0-0, policy_version 101249 (0.00091) [2022-07-09 05:09:00,379][26022] Updated weights on worker 0-0, policy_version 101259 (0.00100) [2022-07-09 05:09:01,715][25689] Fps is (10 sec: 5559.4, 60 sec: 5635.2, 300 sec: 5685.4). Total num frames: 103693312. Throughput: 0: 5960.6. Samples: 103701116. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 05:09:01,716][25689] Avg episode reward: [(0, '-52.276')] [2022-07-09 05:09:02,961][26022] Updated weights on worker 0-0, policy_version 101269 (0.00092) [2022-07-09 05:09:04,330][26022] Updated weights on worker 0-0, policy_version 101279 (0.00089) [2022-07-09 05:09:06,507][26022] Updated weights on worker 0-0, policy_version 101289 (0.00088) [2022-07-09 05:09:06,777][25689] Fps is (10 sec: 5492.6, 60 sec: 5714.3, 300 sec: 5691.4). Total num frames: 103723008. Throughput: 0: 4983.1. Samples: 103716166. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 05:09:06,778][25689] Avg episode reward: [(0, '-52.176')] [2022-07-09 05:09:07,986][26022] Updated weights on worker 0-0, policy_version 101299 (0.00086) [2022-07-09 05:09:09,894][26022] Updated weights on worker 0-0, policy_version 101309 (0.00082) [2022-07-09 05:09:11,431][26022] Updated weights on worker 0-0, policy_version 101319 (0.00100) [2022-07-09 05:09:11,790][25689] Fps is (10 sec: 5793.8, 60 sec: 5670.0, 300 sec: 5693.0). Total num frames: 103751680. Throughput: 0: 5832.8. Samples: 103750676. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 05:09:11,790][25689] Avg episode reward: [(0, '-51.484')] [2022-07-09 05:09:13,570][26022] Updated weights on worker 0-0, policy_version 101329 (0.00100) [2022-07-09 05:09:15,030][26022] Updated weights on worker 0-0, policy_version 101339 (0.00091) [2022-07-09 05:09:16,884][25689] Fps is (10 sec: 5572.4, 60 sec: 5685.1, 300 sec: 5685.1). Total num frames: 103779328. Throughput: 0: 5856.3. Samples: 103784900. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 05:09:16,886][25689] Avg episode reward: [(0, '-51.000')] [2022-07-09 05:09:17,115][26022] Updated weights on worker 0-0, policy_version 101349 (0.00081) [2022-07-09 05:09:18,700][26022] Updated weights on worker 0-0, policy_version 101359 (0.00084) [2022-07-09 05:09:20,615][26022] Updated weights on worker 0-0, policy_version 101369 (0.00109) [2022-07-09 05:09:21,919][25689] Fps is (10 sec: 5661.3, 60 sec: 5683.2, 300 sec: 5689.6). Total num frames: 103809024. Throughput: 0: 5006.0. Samples: 103802254. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 05:09:21,920][25689] Avg episode reward: [(0, '-51.252')] [2022-07-09 05:09:22,371][26022] Updated weights on worker 0-0, policy_version 101379 (0.00086) [2022-07-09 05:09:24,267][26022] Updated weights on worker 0-0, policy_version 101389 (0.00095) [2022-07-09 05:09:25,828][26022] Updated weights on worker 0-0, policy_version 101399 (0.00087) [2022-07-09 05:09:26,944][25689] Fps is (10 sec: 5904.3, 60 sec: 5698.2, 300 sec: 5690.7). Total num frames: 103838720. Throughput: 0: 5981.4. Samples: 103836788. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 05:09:26,945][25689] Avg episode reward: [(0, '-51.911')] [2022-07-09 05:09:27,917][26022] Updated weights on worker 0-0, policy_version 101409 (0.00091) [2022-07-09 05:09:29,378][26022] Updated weights on worker 0-0, policy_version 101419 (0.00092) [2022-07-09 05:09:31,362][26022] Updated weights on worker 0-0, policy_version 101429 (0.00094) [2022-07-09 05:09:31,951][25689] Fps is (10 sec: 5818.3, 60 sec: 5698.3, 300 sec: 5696.3). Total num frames: 103867392. Throughput: 0: 5982.1. Samples: 103871280. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 05:09:31,952][25689] Avg episode reward: [(0, '-51.187')] [2022-07-09 05:09:32,835][26022] Updated weights on worker 0-0, policy_version 101439 (0.00086) [2022-07-09 05:09:34,740][26022] Updated weights on worker 0-0, policy_version 101449 (0.00091) [2022-07-09 05:09:36,615][26022] Updated weights on worker 0-0, policy_version 101459 (0.00084) [2022-07-09 05:09:37,017][25689] Fps is (10 sec: 5692.6, 60 sec: 5688.7, 300 sec: 5689.6). Total num frames: 103896064. Throughput: 0: 5152.6. Samples: 103888636. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 05:09:37,018][25689] Avg episode reward: [(0, '-51.426')] [2022-07-09 05:09:38,238][26022] Updated weights on worker 0-0, policy_version 101469 (0.00080) [2022-07-09 05:09:40,095][26022] Updated weights on worker 0-0, policy_version 101479 (0.00094) [2022-07-09 05:09:41,936][26022] Updated weights on worker 0-0, policy_version 101489 (0.00089) [2022-07-09 05:09:42,055][25689] Fps is (10 sec: 5675.2, 60 sec: 5687.2, 300 sec: 5690.2). Total num frames: 103924736. Throughput: 0: 6003.6. Samples: 103923144. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 05:09:42,055][25689] Avg episode reward: [(0, '-51.623')] [2022-07-09 05:09:43,688][26022] Updated weights on worker 0-0, policy_version 101499 (0.00099) [2022-07-09 05:09:45,726][26022] Updated weights on worker 0-0, policy_version 101509 (0.00094) [2022-07-09 05:09:47,068][25689] Fps is (10 sec: 5807.1, 60 sec: 5720.8, 300 sec: 5695.0). Total num frames: 103954432. Throughput: 0: 6003.8. Samples: 103957612. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 05:09:47,069][25689] Avg episode reward: [(0, '-52.515')] [2022-07-09 05:09:47,178][26022] Updated weights on worker 0-0, policy_version 101519 (0.00083) [2022-07-09 05:09:49,220][26022] Updated weights on worker 0-0, policy_version 101529 (0.00086) [2022-07-09 05:09:50,811][26022] Updated weights on worker 0-0, policy_version 101539 (0.00085) [2022-07-09 05:09:52,076][25689] Fps is (10 sec: 5722.5, 60 sec: 5704.8, 300 sec: 5689.9). Total num frames: 103982080. Throughput: 0: 6007.0. Samples: 103992172. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 05:09:52,076][25689] Avg episode reward: [(0, '-51.998')] [2022-07-09 05:09:52,617][26022] Updated weights on worker 0-0, policy_version 101549 (0.00087) [2022-07-09 05:09:54,451][26022] Updated weights on worker 0-0, policy_version 101559 (0.00096) [2022-07-09 05:09:56,379][26022] Updated weights on worker 0-0, policy_version 101569 (0.00090) [2022-07-09 05:09:57,135][25689] Fps is (10 sec: 5594.7, 60 sec: 5677.1, 300 sec: 5689.1). Total num frames: 104010752. Throughput: 0: 6000.7. Samples: 104009358. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 05:09:57,136][25689] Avg episode reward: [(0, '-52.227')] [2022-07-09 05:09:57,936][26022] Updated weights on worker 0-0, policy_version 101579 (0.00090) [2022-07-09 05:09:59,741][26022] Updated weights on worker 0-0, policy_version 101589 (0.00089) [2022-07-09 05:10:01,655][26022] Updated weights on worker 0-0, policy_version 101599 (0.00090) [2022-07-09 05:10:02,140][25689] Fps is (10 sec: 5697.6, 60 sec: 5728.0, 300 sec: 5699.7). Total num frames: 104039424. Throughput: 0: 6009.9. Samples: 104043856. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 05:10:02,147][25689] Avg episode reward: [(0, '-53.015')] [2022-07-09 05:10:03,768][26022] Updated weights on worker 0-0, policy_version 101609 (0.00091) [2022-07-09 05:10:05,588][26022] Updated weights on worker 0-0, policy_version 101619 (0.00082) [2022-07-09 05:10:07,183][25689] Fps is (10 sec: 5707.1, 60 sec: 5712.9, 300 sec: 5692.5). Total num frames: 104068096. Throughput: 0: 5898.5. Samples: 104076258. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 05:10:07,185][25689] Avg episode reward: [(0, '-53.115')] [2022-07-09 05:10:07,187][26022] Updated weights on worker 0-0, policy_version 101629 (0.00089) [2022-07-09 05:10:09,184][26022] Updated weights on worker 0-0, policy_version 101639 (0.00088) [2022-07-09 05:10:11,104][26022] Updated weights on worker 0-0, policy_version 101649 (0.00087) [2022-07-09 05:10:12,208][25689] Fps is (10 sec: 5594.4, 60 sec: 5694.7, 300 sec: 5692.9). Total num frames: 104095744. Throughput: 0: 5030.6. Samples: 104093448. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 05:10:12,208][25689] Avg episode reward: [(0, '-53.039')] [2022-07-09 05:10:12,660][26022] Updated weights on worker 0-0, policy_version 101659 (0.00087) [2022-07-09 05:10:14,781][26022] Updated weights on worker 0-0, policy_version 101669 (0.00086) [2022-07-09 05:10:16,297][26022] Updated weights on worker 0-0, policy_version 101679 (0.00083) [2022-07-09 05:10:17,292][25689] Fps is (10 sec: 5571.1, 60 sec: 5712.7, 300 sec: 5688.0). Total num frames: 104124416. Throughput: 0: 5868.1. Samples: 104127642. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 05:10:17,294][25689] Avg episode reward: [(0, '-51.733')] [2022-07-09 05:10:18,189][26022] Updated weights on worker 0-0, policy_version 101689 (0.00092) [2022-07-09 05:10:19,714][26022] Updated weights on worker 0-0, policy_version 101699 (0.00082) [2022-07-09 05:10:21,670][26022] Updated weights on worker 0-0, policy_version 101709 (0.00089) [2022-07-09 05:10:22,345][25689] Fps is (10 sec: 5757.9, 60 sec: 5710.9, 300 sec: 5697.4). Total num frames: 104154112. Throughput: 0: 5860.8. Samples: 104162270. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 05:10:22,346][25689] Avg episode reward: [(0, '-51.814')] [2022-07-09 05:10:23,514][26022] Updated weights on worker 0-0, policy_version 101719 (0.00088) [2022-07-09 05:10:25,096][26022] Updated weights on worker 0-0, policy_version 101729 (0.00089) [2022-07-09 05:10:27,014][26022] Updated weights on worker 0-0, policy_version 101739 (0.00090) [2022-07-09 05:10:27,376][25689] Fps is (10 sec: 5788.1, 60 sec: 5693.3, 300 sec: 5690.5). Total num frames: 104182784. Throughput: 0: 5124.6. Samples: 104179742. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 05:10:27,377][25689] Avg episode reward: [(0, '-52.627')] [2022-07-09 05:10:27,808][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:10:27,818][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000101744_104185856.pth [2022-07-09 05:10:27,818][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000099741_102134784.pth [2022-07-09 05:10:29,004][26022] Updated weights on worker 0-0, policy_version 101749 (0.00089) [2022-07-09 05:10:30,395][26022] Updated weights on worker 0-0, policy_version 101759 (0.00082) [2022-07-09 05:10:32,419][25689] Fps is (10 sec: 5590.7, 60 sec: 5673.1, 300 sec: 5687.1). Total num frames: 104210432. Throughput: 0: 5957.3. Samples: 104213850. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 05:10:32,419][25689] Avg episode reward: [(0, '-52.068')] [2022-07-09 05:10:32,545][26022] Updated weights on worker 0-0, policy_version 101769 (0.00089) [2022-07-09 05:10:34,223][26022] Updated weights on worker 0-0, policy_version 101779 (0.00087) [2022-07-09 05:10:35,826][26022] Updated weights on worker 0-0, policy_version 101789 (0.00088) [2022-07-09 05:10:37,474][25689] Fps is (10 sec: 5679.0, 60 sec: 5691.1, 300 sec: 5686.3). Total num frames: 104240128. Throughput: 0: 5981.4. Samples: 104248356. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 05:10:37,474][25689] Avg episode reward: [(0, '-51.523')] [2022-07-09 05:10:37,874][26022] Updated weights on worker 0-0, policy_version 101799 (0.00086) [2022-07-09 05:10:39,347][26022] Updated weights on worker 0-0, policy_version 101809 (0.00090) [2022-07-09 05:10:41,400][26022] Updated weights on worker 0-0, policy_version 101819 (0.00086) [2022-07-09 05:10:42,520][25689] Fps is (10 sec: 5879.6, 60 sec: 5707.2, 300 sec: 5689.2). Total num frames: 104269824. Throughput: 0: 5116.8. Samples: 104265504. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 05:10:42,520][25689] Avg episode reward: [(0, '-52.205')] [2022-07-09 05:10:42,945][26022] Updated weights on worker 0-0, policy_version 101829 (0.00092) [2022-07-09 05:10:44,951][26022] Updated weights on worker 0-0, policy_version 101839 (0.00085) [2022-07-09 05:10:46,638][26022] Updated weights on worker 0-0, policy_version 101849 (0.00086) [2022-07-09 05:10:47,576][25689] Fps is (10 sec: 5574.8, 60 sec: 5652.4, 300 sec: 5688.8). Total num frames: 104296448. Throughput: 0: 5950.6. Samples: 104299944. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 05:10:47,578][25689] Avg episode reward: [(0, '-52.997')] [2022-07-09 05:10:48,456][26022] Updated weights on worker 0-0, policy_version 101859 (0.00088) [2022-07-09 05:10:50,387][26022] Updated weights on worker 0-0, policy_version 101869 (0.00087) [2022-07-09 05:10:51,915][26022] Updated weights on worker 0-0, policy_version 101879 (0.00091) [2022-07-09 05:10:52,629][25689] Fps is (10 sec: 5672.5, 60 sec: 5698.9, 300 sec: 5688.4). Total num frames: 104327168. Throughput: 0: 5958.6. Samples: 104334276. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 05:10:52,630][25689] Avg episode reward: [(0, '-52.818')] [2022-07-09 05:10:53,914][26022] Updated weights on worker 0-0, policy_version 101889 (0.00095) [2022-07-09 05:10:55,472][26022] Updated weights on worker 0-0, policy_version 101899 (0.00098) [2022-07-09 05:10:57,432][26022] Updated weights on worker 0-0, policy_version 101909 (0.00081) [2022-07-09 05:10:57,757][25689] Fps is (10 sec: 5833.6, 60 sec: 5692.4, 300 sec: 5686.2). Total num frames: 104355840. Throughput: 0: 5084.0. Samples: 104351478. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 05:10:57,758][25689] Avg episode reward: [(0, '-52.337')] [2022-07-09 05:10:59,248][26022] Updated weights on worker 0-0, policy_version 101919 (0.00082) [2022-07-09 05:11:00,990][26022] Updated weights on worker 0-0, policy_version 101929 (0.00093) [2022-07-09 05:11:02,792][25689] Fps is (10 sec: 5642.3, 60 sec: 5689.6, 300 sec: 5696.3). Total num frames: 104384512. Throughput: 0: 5945.4. Samples: 104386030. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 05:11:02,793][25689] Avg episode reward: [(0, '-52.871')] [2022-07-09 05:11:02,982][26022] Updated weights on worker 0-0, policy_version 101939 (0.00085) [2022-07-09 05:11:05,173][26022] Updated weights on worker 0-0, policy_version 101949 (0.00087) [2022-07-09 05:11:06,582][26022] Updated weights on worker 0-0, policy_version 101959 (0.00093) [2022-07-09 05:11:07,831][25689] Fps is (10 sec: 5489.4, 60 sec: 5656.3, 300 sec: 5689.3). Total num frames: 104411136. Throughput: 0: 5852.8. Samples: 104418488. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 05:11:07,831][25689] Avg episode reward: [(0, '-52.520')] [2022-07-09 05:11:08,618][26022] Updated weights on worker 0-0, policy_version 101969 (0.00087) [2022-07-09 05:11:10,276][26022] Updated weights on worker 0-0, policy_version 101979 (0.00088) [2022-07-09 05:11:12,096][26022] Updated weights on worker 0-0, policy_version 101989 (0.00087) [2022-07-09 05:11:12,876][25689] Fps is (10 sec: 5686.8, 60 sec: 5705.0, 300 sec: 5693.3). Total num frames: 104441856. Throughput: 0: 5001.7. Samples: 104435546. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 05:11:12,876][25689] Avg episode reward: [(0, '-51.761')] [2022-07-09 05:11:13,814][26022] Updated weights on worker 0-0, policy_version 101999 (0.00087) [2022-07-09 05:11:15,684][26022] Updated weights on worker 0-0, policy_version 102009 (0.00088) [2022-07-09 05:11:17,587][26022] Updated weights on worker 0-0, policy_version 102019 (0.00096) [2022-07-09 05:11:17,972][25689] Fps is (10 sec: 5654.4, 60 sec: 5670.1, 300 sec: 5684.9). Total num frames: 104468480. Throughput: 0: 5858.7. Samples: 104469910. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 05:11:17,973][25689] Avg episode reward: [(0, '-51.838')] [2022-07-09 05:11:19,248][26022] Updated weights on worker 0-0, policy_version 102029 (0.00097) [2022-07-09 05:11:21,167][26022] Updated weights on worker 0-0, policy_version 102039 (0.00085) [2022-07-09 05:11:22,987][25689] Fps is (10 sec: 5469.1, 60 sec: 5656.8, 300 sec: 5681.2). Total num frames: 104497152. Throughput: 0: 5851.3. Samples: 104504194. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 05:11:22,987][25689] Avg episode reward: [(0, '-51.419')] [2022-07-09 05:11:23,036][26022] Updated weights on worker 0-0, policy_version 102049 (0.00086) [2022-07-09 05:11:24,733][26022] Updated weights on worker 0-0, policy_version 102059 (0.00091) [2022-07-09 05:11:26,599][26022] Updated weights on worker 0-0, policy_version 102069 (0.00081) [2022-07-09 05:11:28,012][25689] Fps is (10 sec: 5813.8, 60 sec: 5674.3, 300 sec: 5687.7). Total num frames: 104526848. Throughput: 0: 5098.0. Samples: 104521370. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 05:11:28,012][25689] Avg episode reward: [(0, '-51.005')] [2022-07-09 05:11:28,191][26022] Updated weights on worker 0-0, policy_version 102079 (0.00095) [2022-07-09 05:11:30,212][26022] Updated weights on worker 0-0, policy_version 102089 (0.00090) [2022-07-09 05:11:31,836][26022] Updated weights on worker 0-0, policy_version 102099 (0.00093) [2022-07-09 05:11:33,020][25689] Fps is (10 sec: 5817.2, 60 sec: 5694.4, 300 sec: 5685.8). Total num frames: 104555520. Throughput: 0: 5961.4. Samples: 104555634. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 05:11:33,021][25689] Avg episode reward: [(0, '-50.780')] [2022-07-09 05:11:33,734][26022] Updated weights on worker 0-0, policy_version 102109 (0.00089) [2022-07-09 05:11:35,414][26022] Updated weights on worker 0-0, policy_version 102119 (0.00090) [2022-07-09 05:11:37,267][26022] Updated weights on worker 0-0, policy_version 102129 (0.00088) [2022-07-09 05:11:38,081][25689] Fps is (10 sec: 5593.4, 60 sec: 5660.1, 300 sec: 5685.2). Total num frames: 104583168. Throughput: 0: 5975.4. Samples: 104590066. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 05:11:38,081][25689] Avg episode reward: [(0, '-50.661')] [2022-07-09 05:11:38,986][26022] Updated weights on worker 0-0, policy_version 102139 (0.00102) [2022-07-09 05:11:40,714][26022] Updated weights on worker 0-0, policy_version 102149 (0.00085) [2022-07-09 05:11:42,633][26022] Updated weights on worker 0-0, policy_version 102159 (0.00088) [2022-07-09 05:11:43,105][25689] Fps is (10 sec: 5787.7, 60 sec: 5679.0, 300 sec: 5691.9). Total num frames: 104613888. Throughput: 0: 5124.1. Samples: 104607282. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 05:11:43,106][25689] Avg episode reward: [(0, '-50.217')] [2022-07-09 05:11:44,324][26022] Updated weights on worker 0-0, policy_version 102169 (0.00094) [2022-07-09 05:11:46,100][26022] Updated weights on worker 0-0, policy_version 102179 (0.00087) [2022-07-09 05:11:47,939][26022] Updated weights on worker 0-0, policy_version 102189 (0.00087) [2022-07-09 05:11:48,147][25689] Fps is (10 sec: 5900.1, 60 sec: 5714.2, 300 sec: 5687.8). Total num frames: 104642560. Throughput: 0: 5987.3. Samples: 104641926. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 05:11:48,147][25689] Avg episode reward: [(0, '-50.853')] [2022-07-09 05:11:49,634][26022] Updated weights on worker 0-0, policy_version 102199 (0.00095) [2022-07-09 05:11:51,572][26022] Updated weights on worker 0-0, policy_version 102209 (0.00089) [2022-07-09 05:11:52,998][26022] Updated weights on worker 0-0, policy_version 102219 (0.00089) [2022-07-09 05:11:53,155][25689] Fps is (10 sec: 5807.7, 60 sec: 5701.5, 300 sec: 5691.9). Total num frames: 104672256. Throughput: 0: 6006.6. Samples: 104676576. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 05:11:53,156][25689] Avg episode reward: [(0, '-51.662')] [2022-07-09 05:11:55,176][26022] Updated weights on worker 0-0, policy_version 102229 (0.00081) [2022-07-09 05:11:56,675][26022] Updated weights on worker 0-0, policy_version 102239 (0.00089) [2022-07-09 05:11:58,201][25689] Fps is (10 sec: 5601.6, 60 sec: 5675.4, 300 sec: 5684.3). Total num frames: 104698880. Throughput: 0: 5152.9. Samples: 104693746. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 05:11:58,203][25689] Avg episode reward: [(0, '-52.415')] [2022-07-09 05:11:58,711][26022] Updated weights on worker 0-0, policy_version 102249 (0.00087) [2022-07-09 05:12:00,268][26022] Updated weights on worker 0-0, policy_version 102259 (0.00090) [2022-07-09 05:12:02,525][26022] Updated weights on worker 0-0, policy_version 102269 (0.00083) [2022-07-09 05:12:03,213][25689] Fps is (10 sec: 5395.8, 60 sec: 5660.5, 300 sec: 5684.3). Total num frames: 104726528. Throughput: 0: 6005.1. Samples: 104728036. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 05:12:03,214][25689] Avg episode reward: [(0, '-52.676')] [2022-07-09 05:12:04,511][26022] Updated weights on worker 0-0, policy_version 102279 (0.00053) [2022-07-09 05:12:06,164][26022] Updated weights on worker 0-0, policy_version 102289 (0.00095) [2022-07-09 05:12:08,079][26022] Updated weights on worker 0-0, policy_version 102299 (0.00092) [2022-07-09 05:12:08,239][25689] Fps is (10 sec: 5610.9, 60 sec: 5695.7, 300 sec: 5687.5). Total num frames: 104755200. Throughput: 0: 5887.3. Samples: 104760214. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 05:12:08,240][25689] Avg episode reward: [(0, '-53.932')] [2022-07-09 05:12:09,623][26022] Updated weights on worker 0-0, policy_version 102309 (0.00056) [2022-07-09 05:12:11,666][26022] Updated weights on worker 0-0, policy_version 102319 (0.00094) [2022-07-09 05:12:13,302][25689] Fps is (10 sec: 5684.2, 60 sec: 5660.1, 300 sec: 5688.2). Total num frames: 104783872. Throughput: 0: 5004.8. Samples: 104777406. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 05:12:13,303][25689] Avg episode reward: [(0, '-53.630')] [2022-07-09 05:12:13,394][26022] Updated weights on worker 0-0, policy_version 102329 (0.00092) [2022-07-09 05:12:15,099][26022] Updated weights on worker 0-0, policy_version 102339 (0.00092) [2022-07-09 05:12:16,935][26022] Updated weights on worker 0-0, policy_version 102349 (0.00083) [2022-07-09 05:12:18,428][25689] Fps is (10 sec: 5828.9, 60 sec: 5725.0, 300 sec: 5693.1). Total num frames: 104814592. Throughput: 0: 5817.1. Samples: 104811408. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 05:12:18,429][25689] Avg episode reward: [(0, '-53.780')] [2022-07-09 05:12:18,836][26022] Updated weights on worker 0-0, policy_version 102359 (0.00089) [2022-07-09 05:12:20,647][26022] Updated weights on worker 0-0, policy_version 102369 (0.00084) [2022-07-09 05:12:22,505][26022] Updated weights on worker 0-0, policy_version 102379 (0.00090) [2022-07-09 05:12:23,442][25689] Fps is (10 sec: 5553.9, 60 sec: 5674.2, 300 sec: 5679.1). Total num frames: 104840192. Throughput: 0: 5821.6. Samples: 104845800. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 05:12:23,443][25689] Avg episode reward: [(0, '-53.306')] [2022-07-09 05:12:24,021][26022] Updated weights on worker 0-0, policy_version 102389 (0.00085) [2022-07-09 05:12:26,062][26022] Updated weights on worker 0-0, policy_version 102399 (0.00093) [2022-07-09 05:12:27,624][26022] Updated weights on worker 0-0, policy_version 102409 (0.00091) [2022-07-09 05:12:27,883][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:12:27,894][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000102411_104868864.pth [2022-07-09 05:12:27,894][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000100409_102818816.pth [2022-07-09 05:12:28,445][25689] Fps is (10 sec: 5622.6, 60 sec: 5693.3, 300 sec: 5689.5). Total num frames: 104870912. Throughput: 0: 5097.7. Samples: 104863218. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 05:12:28,445][25689] Avg episode reward: [(0, '-52.854')] [2022-07-09 05:12:29,626][26022] Updated weights on worker 0-0, policy_version 102419 (0.00092) [2022-07-09 05:12:31,199][26022] Updated weights on worker 0-0, policy_version 102429 (0.00085) [2022-07-09 05:12:33,003][26022] Updated weights on worker 0-0, policy_version 102439 (0.00079) [2022-07-09 05:12:33,496][25689] Fps is (10 sec: 5907.5, 60 sec: 5689.3, 300 sec: 5686.6). Total num frames: 104899584. Throughput: 0: 5961.9. Samples: 104897802. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 05:12:33,497][25689] Avg episode reward: [(0, '-51.596')] [2022-07-09 05:12:34,651][26022] Updated weights on worker 0-0, policy_version 102449 (0.00086) [2022-07-09 05:12:36,604][26022] Updated weights on worker 0-0, policy_version 102459 (0.00087) [2022-07-09 05:12:38,435][26022] Updated weights on worker 0-0, policy_version 102469 (0.00084) [2022-07-09 05:12:38,636][25689] Fps is (10 sec: 5727.3, 60 sec: 5715.6, 300 sec: 5688.4). Total num frames: 104929280. Throughput: 0: 5996.8. Samples: 104932590. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 05:12:38,637][25689] Avg episode reward: [(0, '-52.007')] [2022-07-09 05:12:40,090][26022] Updated weights on worker 0-0, policy_version 102479 (0.00100) [2022-07-09 05:12:41,879][26022] Updated weights on worker 0-0, policy_version 102489 (0.00088) [2022-07-09 05:12:43,533][26022] Updated weights on worker 0-0, policy_version 102499 (0.00087) [2022-07-09 05:12:43,731][25689] Fps is (10 sec: 5802.6, 60 sec: 5692.0, 300 sec: 5687.1). Total num frames: 104958976. Throughput: 0: 5982.7. Samples: 104967184. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 05:12:43,732][25689] Avg episode reward: [(0, '-51.840')] [2022-07-09 05:12:45,415][26022] Updated weights on worker 0-0, policy_version 102509 (0.00075) [2022-07-09 05:12:47,330][26022] Updated weights on worker 0-0, policy_version 102519 (0.00090) [2022-07-09 05:12:48,770][25689] Fps is (10 sec: 5860.7, 60 sec: 5709.3, 300 sec: 5697.1). Total num frames: 104988672. Throughput: 0: 5970.6. Samples: 104984570. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 05:12:48,771][25689] Avg episode reward: [(0, '-51.806')] [2022-07-09 05:12:48,924][26022] Updated weights on worker 0-0, policy_version 102529 (0.00087) [2022-07-09 05:12:50,812][26022] Updated weights on worker 0-0, policy_version 102539 (0.00097) [2022-07-09 05:12:52,420][26022] Updated weights on worker 0-0, policy_version 102549 (0.00088) [2022-07-09 05:12:53,857][25689] Fps is (10 sec: 5764.5, 60 sec: 5685.0, 300 sec: 5685.9). Total num frames: 105017344. Throughput: 0: 5981.5. Samples: 105019590. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 05:12:53,857][25689] Avg episode reward: [(0, '-52.293')] [2022-07-09 05:12:54,268][26022] Updated weights on worker 0-0, policy_version 102559 (0.00092) [2022-07-09 05:12:56,000][26022] Updated weights on worker 0-0, policy_version 102569 (0.00091) [2022-07-09 05:12:57,792][26022] Updated weights on worker 0-0, policy_version 102579 (0.00092) [2022-07-09 05:12:58,921][25689] Fps is (10 sec: 5649.3, 60 sec: 5717.1, 300 sec: 5689.0). Total num frames: 105046016. Throughput: 0: 5998.6. Samples: 105054268. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 05:12:58,921][25689] Avg episode reward: [(0, '-52.709')] [2022-07-09 05:12:59,589][26022] Updated weights on worker 0-0, policy_version 102589 (0.00085) [2022-07-09 05:13:01,404][26022] Updated weights on worker 0-0, policy_version 102599 (0.00089) [2022-07-09 05:13:03,588][26022] Updated weights on worker 0-0, policy_version 102609 (0.00098) [2022-07-09 05:13:03,926][25689] Fps is (10 sec: 5695.0, 60 sec: 5734.6, 300 sec: 5702.7). Total num frames: 105074688. Throughput: 0: 5124.3. Samples: 105070668. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 05:13:03,927][25689] Avg episode reward: [(0, '-52.695')] [2022-07-09 05:13:05,395][26022] Updated weights on worker 0-0, policy_version 102619 (0.00410) [2022-07-09 05:13:06,979][26022] Updated weights on worker 0-0, policy_version 102629 (0.00086) [2022-07-09 05:13:08,941][25689] Fps is (10 sec: 5518.6, 60 sec: 5701.9, 300 sec: 5686.8). Total num frames: 105101312. Throughput: 0: 5915.6. Samples: 105103892. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 05:13:08,942][25689] Avg episode reward: [(0, '-52.676')] [2022-07-09 05:13:09,075][26022] Updated weights on worker 0-0, policy_version 102639 (0.00089) [2022-07-09 05:13:10,585][26022] Updated weights on worker 0-0, policy_version 102649 (0.00091) [2022-07-09 05:13:12,398][26022] Updated weights on worker 0-0, policy_version 102659 (0.00087) [2022-07-09 05:13:13,978][25689] Fps is (10 sec: 5603.1, 60 sec: 5721.1, 300 sec: 5697.8). Total num frames: 105131008. Throughput: 0: 5905.9. Samples: 105138422. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 05:13:13,978][25689] Avg episode reward: [(0, '-52.160')] [2022-07-09 05:13:14,208][26022] Updated weights on worker 0-0, policy_version 102669 (0.00083) [2022-07-09 05:13:15,942][26022] Updated weights on worker 0-0, policy_version 102679 (0.00093) [2022-07-09 05:13:17,773][26022] Updated weights on worker 0-0, policy_version 102689 (0.00087) [2022-07-09 05:13:19,117][25689] Fps is (10 sec: 5936.8, 60 sec: 5719.9, 300 sec: 5698.8). Total num frames: 105161728. Throughput: 0: 5017.2. Samples: 105155598. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 05:13:19,119][25689] Avg episode reward: [(0, '-51.449')] [2022-07-09 05:13:19,672][26022] Updated weights on worker 0-0, policy_version 102699 (0.00098) [2022-07-09 05:13:21,416][26022] Updated weights on worker 0-0, policy_version 102709 (0.00084) [2022-07-09 05:13:23,076][26022] Updated weights on worker 0-0, policy_version 102719 (0.00082) [2022-07-09 05:13:24,135][25689] Fps is (10 sec: 5645.8, 60 sec: 5736.5, 300 sec: 5691.7). Total num frames: 105188352. Throughput: 0: 5909.7. Samples: 105190094. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 05:13:24,136][25689] Avg episode reward: [(0, '-51.445')] [2022-07-09 05:13:25,037][26022] Updated weights on worker 0-0, policy_version 102729 (0.00086) [2022-07-09 05:13:26,541][26022] Updated weights on worker 0-0, policy_version 102739 (0.00083) [2022-07-09 05:13:28,442][26022] Updated weights on worker 0-0, policy_version 102749 (0.00092) [2022-07-09 05:13:29,144][25689] Fps is (10 sec: 5616.7, 60 sec: 5718.9, 300 sec: 5695.1). Total num frames: 105218048. Throughput: 0: 5981.1. Samples: 105224734. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 05:13:29,146][25689] Avg episode reward: [(0, '-52.040')] [2022-07-09 05:13:30,164][26022] Updated weights on worker 0-0, policy_version 102759 (0.00093) [2022-07-09 05:13:32,006][26022] Updated weights on worker 0-0, policy_version 102769 (0.00079) [2022-07-09 05:13:33,694][26022] Updated weights on worker 0-0, policy_version 102779 (0.00085) [2022-07-09 05:13:34,159][25689] Fps is (10 sec: 5924.7, 60 sec: 5739.2, 300 sec: 5697.6). Total num frames: 105247744. Throughput: 0: 5143.1. Samples: 105242216. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 05:13:34,161][25689] Avg episode reward: [(0, '-51.575')] [2022-07-09 05:13:35,587][26022] Updated weights on worker 0-0, policy_version 102789 (0.00088) [2022-07-09 05:13:37,169][26022] Updated weights on worker 0-0, policy_version 102799 (0.00085) [2022-07-09 05:13:39,074][26022] Updated weights on worker 0-0, policy_version 102809 (0.00083) [2022-07-09 05:13:39,302][25689] Fps is (10 sec: 5847.2, 60 sec: 5739.0, 300 sec: 5698.7). Total num frames: 105277440. Throughput: 0: 6026.4. Samples: 105277238. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 05:13:39,302][25689] Avg episode reward: [(0, '-51.164')] [2022-07-09 05:13:40,778][26022] Updated weights on worker 0-0, policy_version 102819 (0.00086) [2022-07-09 05:13:42,540][26022] Updated weights on worker 0-0, policy_version 102829 (0.00077) [2022-07-09 05:13:44,276][26022] Updated weights on worker 0-0, policy_version 102839 (0.00100) [2022-07-09 05:13:44,366][25689] Fps is (10 sec: 5818.8, 60 sec: 5741.9, 300 sec: 5704.6). Total num frames: 105307136. Throughput: 0: 6034.4. Samples: 105312178. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 05:13:44,367][25689] Avg episode reward: [(0, '-51.334')] [2022-07-09 05:13:46,149][26022] Updated weights on worker 0-0, policy_version 102849 (0.00376) [2022-07-09 05:13:47,769][26022] Updated weights on worker 0-0, policy_version 102859 (0.00091) [2022-07-09 05:13:49,379][25689] Fps is (10 sec: 5792.2, 60 sec: 5727.5, 300 sec: 5704.7). Total num frames: 105335808. Throughput: 0: 5188.1. Samples: 105329712. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 05:13:49,379][25689] Avg episode reward: [(0, '-51.621')] [2022-07-09 05:13:49,542][26022] Updated weights on worker 0-0, policy_version 102869 (0.00095) [2022-07-09 05:13:51,478][26022] Updated weights on worker 0-0, policy_version 102879 (0.00090) [2022-07-09 05:13:53,122][26022] Updated weights on worker 0-0, policy_version 102889 (0.00097) [2022-07-09 05:13:54,392][25689] Fps is (10 sec: 5821.6, 60 sec: 5751.4, 300 sec: 5703.4). Total num frames: 105365504. Throughput: 0: 6030.9. Samples: 105364240. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 05:13:54,393][25689] Avg episode reward: [(0, '-50.913')] [2022-07-09 05:13:55,122][26022] Updated weights on worker 0-0, policy_version 102899 (0.00666) [2022-07-09 05:13:56,758][26022] Updated weights on worker 0-0, policy_version 102909 (0.00097) [2022-07-09 05:13:58,639][26022] Updated weights on worker 0-0, policy_version 102919 (0.00092) [2022-07-09 05:13:59,475][25689] Fps is (10 sec: 5780.9, 60 sec: 5749.5, 300 sec: 5712.2). Total num frames: 105394176. Throughput: 0: 6001.7. Samples: 105398314. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 05:13:59,476][25689] Avg episode reward: [(0, '-51.318')] [2022-07-09 05:14:00,377][26022] Updated weights on worker 0-0, policy_version 102929 (0.00086) [2022-07-09 05:14:02,659][26022] Updated weights on worker 0-0, policy_version 102939 (0.00086) [2022-07-09 05:14:04,362][26022] Updated weights on worker 0-0, policy_version 102949 (0.00087) [2022-07-09 05:14:04,534][25689] Fps is (10 sec: 5351.2, 60 sec: 5693.8, 300 sec: 5698.5). Total num frames: 105419776. Throughput: 0: 5054.8. Samples: 105414124. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 05:14:04,535][25689] Avg episode reward: [(0, '-51.970')] [2022-07-09 05:14:06,114][26022] Updated weights on worker 0-0, policy_version 102959 (0.00088) [2022-07-09 05:14:07,965][26022] Updated weights on worker 0-0, policy_version 102969 (0.00083) [2022-07-09 05:14:09,547][25689] Fps is (10 sec: 5490.4, 60 sec: 5744.6, 300 sec: 5701.9). Total num frames: 105449472. Throughput: 0: 5867.5. Samples: 105448048. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 05:14:09,547][25689] Avg episode reward: [(0, '-52.194')] [2022-07-09 05:14:09,620][26022] Updated weights on worker 0-0, policy_version 102979 (0.00095) [2022-07-09 05:14:11,389][26022] Updated weights on worker 0-0, policy_version 102989 (0.00086) [2022-07-09 05:14:13,328][26022] Updated weights on worker 0-0, policy_version 102999 (0.00087) [2022-07-09 05:14:14,569][25689] Fps is (10 sec: 5816.3, 60 sec: 5729.1, 300 sec: 5706.8). Total num frames: 105478144. Throughput: 0: 5868.3. Samples: 105482646. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 05:14:14,571][25689] Avg episode reward: [(0, '-52.848')] [2022-07-09 05:14:14,841][26022] Updated weights on worker 0-0, policy_version 103009 (0.00086) [2022-07-09 05:14:16,870][26022] Updated weights on worker 0-0, policy_version 103019 (0.00103) [2022-07-09 05:14:18,461][26022] Updated weights on worker 0-0, policy_version 103029 (0.00089) [2022-07-09 05:14:19,650][25689] Fps is (10 sec: 5675.6, 60 sec: 5700.8, 300 sec: 5702.4). Total num frames: 105506816. Throughput: 0: 5894.1. Samples: 105517226. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 05:14:19,652][25689] Avg episode reward: [(0, '-52.471')] [2022-07-09 05:14:20,256][26022] Updated weights on worker 0-0, policy_version 103039 (0.00092) [2022-07-09 05:14:22,167][26022] Updated weights on worker 0-0, policy_version 103049 (0.00086) [2022-07-09 05:14:23,859][26022] Updated weights on worker 0-0, policy_version 103059 (0.00083) [2022-07-09 05:14:24,690][25689] Fps is (10 sec: 5767.4, 60 sec: 5749.5, 300 sec: 5702.1). Total num frames: 105536512. Throughput: 0: 5960.6. Samples: 105534264. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 05:14:24,690][25689] Avg episode reward: [(0, '-52.638')] [2022-07-09 05:14:25,734][26022] Updated weights on worker 0-0, policy_version 103069 (0.00093) [2022-07-09 05:14:27,652][26022] Updated weights on worker 0-0, policy_version 103079 (0.00082) [2022-07-09 05:14:28,043][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:14:28,057][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000103082_105555968.pth [2022-07-09 05:14:28,057][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000101077_103502848.pth [2022-07-09 05:14:29,400][26022] Updated weights on worker 0-0, policy_version 103089 (0.00083) [2022-07-09 05:14:29,717][25689] Fps is (10 sec: 5798.3, 60 sec: 5730.9, 300 sec: 5701.8). Total num frames: 105565184. Throughput: 0: 5993.4. Samples: 105568934. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 05:14:29,717][25689] Avg episode reward: [(0, '-53.211')] [2022-07-09 05:14:31,024][26022] Updated weights on worker 0-0, policy_version 103099 (0.00078) [2022-07-09 05:14:32,966][26022] Updated weights on worker 0-0, policy_version 103109 (0.00088) [2022-07-09 05:14:34,520][26022] Updated weights on worker 0-0, policy_version 103119 (0.00087) [2022-07-09 05:14:34,778][25689] Fps is (10 sec: 5684.2, 60 sec: 5709.6, 300 sec: 5701.9). Total num frames: 105593856. Throughput: 0: 5968.9. Samples: 105603268. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 05:14:34,779][25689] Avg episode reward: [(0, '-53.063')] [2022-07-09 05:14:36,449][26022] Updated weights on worker 0-0, policy_version 103129 (0.00085) [2022-07-09 05:14:38,180][26022] Updated weights on worker 0-0, policy_version 103139 (0.00085) [2022-07-09 05:14:39,824][25689] Fps is (10 sec: 5673.5, 60 sec: 5701.8, 300 sec: 5701.7). Total num frames: 105622528. Throughput: 0: 5116.7. Samples: 105620450. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:14:39,825][25689] Avg episode reward: [(0, '-52.948')] [2022-07-09 05:14:40,159][26022] Updated weights on worker 0-0, policy_version 103149 (0.00095) [2022-07-09 05:14:41,773][26022] Updated weights on worker 0-0, policy_version 103159 (0.00086) [2022-07-09 05:14:43,516][26022] Updated weights on worker 0-0, policy_version 103169 (0.00084) [2022-07-09 05:14:44,832][25689] Fps is (10 sec: 5703.7, 60 sec: 5690.2, 300 sec: 5698.4). Total num frames: 105651200. Throughput: 0: 6000.4. Samples: 105655126. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:14:44,832][25689] Avg episode reward: [(0, '-53.490')] [2022-07-09 05:14:45,348][26022] Updated weights on worker 0-0, policy_version 103179 (0.00093) [2022-07-09 05:14:46,956][26022] Updated weights on worker 0-0, policy_version 103189 (0.00615) [2022-07-09 05:14:48,905][26022] Updated weights on worker 0-0, policy_version 103199 (0.00090) [2022-07-09 05:14:49,871][25689] Fps is (10 sec: 5809.7, 60 sec: 5704.7, 300 sec: 5704.6). Total num frames: 105680896. Throughput: 0: 5972.8. Samples: 105689310. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:14:49,871][25689] Avg episode reward: [(0, '-53.454')] [2022-07-09 05:14:50,622][26022] Updated weights on worker 0-0, policy_version 103209 (0.00094) [2022-07-09 05:14:52,565][26022] Updated weights on worker 0-0, policy_version 103219 (0.00080) [2022-07-09 05:14:54,350][26022] Updated weights on worker 0-0, policy_version 103229 (0.00097) [2022-07-09 05:14:54,881][25689] Fps is (10 sec: 5706.5, 60 sec: 5671.1, 300 sec: 5702.2). Total num frames: 105708544. Throughput: 0: 5124.8. Samples: 105706290. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:14:54,881][25689] Avg episode reward: [(0, '-52.887')] [2022-07-09 05:14:56,119][26022] Updated weights on worker 0-0, policy_version 103239 (0.00088) [2022-07-09 05:14:57,991][26022] Updated weights on worker 0-0, policy_version 103249 (0.00094) [2022-07-09 05:14:59,670][26022] Updated weights on worker 0-0, policy_version 103259 (0.00101) [2022-07-09 05:14:59,975][25689] Fps is (10 sec: 5675.5, 60 sec: 5687.1, 300 sec: 5703.9). Total num frames: 105738240. Throughput: 0: 5964.7. Samples: 105740642. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:14:59,975][25689] Avg episode reward: [(0, '-52.936')] [2022-07-09 05:15:01,654][26022] Updated weights on worker 0-0, policy_version 103269 (0.00093) [2022-07-09 05:15:03,635][26022] Updated weights on worker 0-0, policy_version 103279 (0.00085) [2022-07-09 05:15:05,015][25689] Fps is (10 sec: 5557.6, 60 sec: 5705.8, 300 sec: 5697.1). Total num frames: 105764864. Throughput: 0: 5847.6. Samples: 105773146. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:15:05,015][25689] Avg episode reward: [(0, '-52.764')] [2022-07-09 05:15:05,493][26022] Updated weights on worker 0-0, policy_version 103289 (0.00098) [2022-07-09 05:15:07,099][26022] Updated weights on worker 0-0, policy_version 103299 (0.00093) [2022-07-09 05:15:09,106][26022] Updated weights on worker 0-0, policy_version 103309 (0.00088) [2022-07-09 05:15:10,027][25689] Fps is (10 sec: 5500.9, 60 sec: 5688.9, 300 sec: 5700.7). Total num frames: 105793536. Throughput: 0: 5017.3. Samples: 105790438. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:15:10,027][25689] Avg episode reward: [(0, '-52.183')] [2022-07-09 05:15:10,798][26022] Updated weights on worker 0-0, policy_version 103319 (0.00090) [2022-07-09 05:15:12,456][26022] Updated weights on worker 0-0, policy_version 103329 (0.00092) [2022-07-09 05:15:14,290][26022] Updated weights on worker 0-0, policy_version 103339 (0.00100) [2022-07-09 05:15:15,060][25689] Fps is (10 sec: 5708.5, 60 sec: 5687.9, 300 sec: 5701.7). Total num frames: 105822208. Throughput: 0: 5887.7. Samples: 105825100. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:15:15,061][25689] Avg episode reward: [(0, '-51.908')] [2022-07-09 05:15:16,056][26022] Updated weights on worker 0-0, policy_version 103349 (0.00086) [2022-07-09 05:15:18,020][26022] Updated weights on worker 0-0, policy_version 103359 (0.00083) [2022-07-09 05:15:19,567][26022] Updated weights on worker 0-0, policy_version 103369 (0.00085) [2022-07-09 05:15:20,121][25689] Fps is (10 sec: 5782.3, 60 sec: 5706.7, 300 sec: 5701.6). Total num frames: 105851904. Throughput: 0: 5893.3. Samples: 105859370. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:15:20,121][25689] Avg episode reward: [(0, '-51.382')] [2022-07-09 05:15:21,487][26022] Updated weights on worker 0-0, policy_version 103379 (0.00477) [2022-07-09 05:15:23,180][26022] Updated weights on worker 0-0, policy_version 103389 (0.00106) [2022-07-09 05:15:25,046][26022] Updated weights on worker 0-0, policy_version 103399 (0.00089) [2022-07-09 05:15:25,202][25689] Fps is (10 sec: 5755.1, 60 sec: 5685.9, 300 sec: 5700.6). Total num frames: 105880576. Throughput: 0: 5128.7. Samples: 105876680. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:15:25,203][25689] Avg episode reward: [(0, '-51.737')] [2022-07-09 05:15:26,930][26022] Updated weights on worker 0-0, policy_version 103409 (0.00085) [2022-07-09 05:15:28,664][26022] Updated weights on worker 0-0, policy_version 103419 (0.00086) [2022-07-09 05:15:30,297][25689] Fps is (10 sec: 5635.0, 60 sec: 5679.4, 300 sec: 5703.0). Total num frames: 105909248. Throughput: 0: 5951.8. Samples: 105911084. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:15:30,298][25689] Avg episode reward: [(0, '-51.697')] [2022-07-09 05:15:30,532][26022] Updated weights on worker 0-0, policy_version 103429 (0.00090) [2022-07-09 05:15:32,203][26022] Updated weights on worker 0-0, policy_version 103439 (0.00098) [2022-07-09 05:15:34,128][26022] Updated weights on worker 0-0, policy_version 103449 (0.00087) [2022-07-09 05:15:35,319][25689] Fps is (10 sec: 5668.3, 60 sec: 5683.2, 300 sec: 5700.3). Total num frames: 105937920. Throughput: 0: 5938.8. Samples: 105945412. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:15:35,319][25689] Avg episode reward: [(0, '-51.395')] [2022-07-09 05:15:35,643][26022] Updated weights on worker 0-0, policy_version 103459 (0.00090) [2022-07-09 05:15:37,550][26022] Updated weights on worker 0-0, policy_version 103469 (0.00083) [2022-07-09 05:15:39,352][26022] Updated weights on worker 0-0, policy_version 103479 (0.00082) [2022-07-09 05:15:40,363][25689] Fps is (10 sec: 5798.9, 60 sec: 5700.3, 300 sec: 5700.3). Total num frames: 105967616. Throughput: 0: 5108.3. Samples: 105962770. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:15:40,363][25689] Avg episode reward: [(0, '-51.300')] [2022-07-09 05:15:41,253][26022] Updated weights on worker 0-0, policy_version 103489 (0.00086) [2022-07-09 05:15:42,820][26022] Updated weights on worker 0-0, policy_version 103499 (0.00087) [2022-07-09 05:15:44,903][26022] Updated weights on worker 0-0, policy_version 103509 (0.00088) [2022-07-09 05:15:45,375][25689] Fps is (10 sec: 5804.3, 60 sec: 5699.9, 300 sec: 5708.0). Total num frames: 105996288. Throughput: 0: 5985.4. Samples: 105997422. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:15:45,375][25689] Avg episode reward: [(0, '-51.474')] [2022-07-09 05:15:46,332][26022] Updated weights on worker 0-0, policy_version 103519 (0.00083) [2022-07-09 05:15:48,503][26022] Updated weights on worker 0-0, policy_version 103529 (0.00090) [2022-07-09 05:15:49,873][26022] Updated weights on worker 0-0, policy_version 103539 (0.00094) [2022-07-09 05:15:50,391][25689] Fps is (10 sec: 5820.4, 60 sec: 5702.0, 300 sec: 5705.3). Total num frames: 106025984. Throughput: 0: 6006.8. Samples: 106031782. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:15:50,391][25689] Avg episode reward: [(0, '-51.955')] [2022-07-09 05:15:52,104][26022] Updated weights on worker 0-0, policy_version 103549 (0.00090) [2022-07-09 05:15:53,602][26022] Updated weights on worker 0-0, policy_version 103559 (0.00082) [2022-07-09 05:15:55,407][25689] Fps is (10 sec: 5716.1, 60 sec: 5701.5, 300 sec: 5704.0). Total num frames: 106053632. Throughput: 0: 5140.5. Samples: 106048674. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:15:55,408][25689] Avg episode reward: [(0, '-52.495')] [2022-07-09 05:15:55,467][26022] Updated weights on worker 0-0, policy_version 103569 (0.00087) [2022-07-09 05:15:57,292][26022] Updated weights on worker 0-0, policy_version 103579 (0.00085) [2022-07-09 05:15:59,276][26022] Updated weights on worker 0-0, policy_version 103589 (0.00096) [2022-07-09 05:16:00,510][25689] Fps is (10 sec: 5565.5, 60 sec: 5683.7, 300 sec: 5702.7). Total num frames: 106082304. Throughput: 0: 5952.1. Samples: 106082690. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:16:00,511][25689] Avg episode reward: [(0, '-53.030')] [2022-07-09 05:16:00,970][26022] Updated weights on worker 0-0, policy_version 103599 (0.00117) [2022-07-09 05:16:03,154][26022] Updated weights on worker 0-0, policy_version 103609 (0.00083) [2022-07-09 05:16:04,860][26022] Updated weights on worker 0-0, policy_version 103619 (0.00089) [2022-07-09 05:16:05,567][25689] Fps is (10 sec: 5543.1, 60 sec: 5699.0, 300 sec: 5705.8). Total num frames: 106109952. Throughput: 0: 5825.4. Samples: 106115050. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:16:05,567][25689] Avg episode reward: [(0, '-53.108')] [2022-07-09 05:16:06,618][26022] Updated weights on worker 0-0, policy_version 103629 (0.00091) [2022-07-09 05:16:08,390][26022] Updated weights on worker 0-0, policy_version 103639 (0.00098) [2022-07-09 05:16:10,144][26022] Updated weights on worker 0-0, policy_version 103649 (0.00084) [2022-07-09 05:16:10,585][25689] Fps is (10 sec: 5488.7, 60 sec: 5681.6, 300 sec: 5696.0). Total num frames: 106137600. Throughput: 0: 4986.1. Samples: 106132470. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:16:10,585][25689] Avg episode reward: [(0, '-53.182')] [2022-07-09 05:16:11,964][26022] Updated weights on worker 0-0, policy_version 103659 (0.00082) [2022-07-09 05:16:13,701][26022] Updated weights on worker 0-0, policy_version 103669 (0.00086) [2022-07-09 05:16:15,520][26022] Updated weights on worker 0-0, policy_version 103679 (0.00088) [2022-07-09 05:16:15,594][25689] Fps is (10 sec: 5719.2, 60 sec: 5700.8, 300 sec: 5708.0). Total num frames: 106167296. Throughput: 0: 5861.3. Samples: 106166996. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:16:15,594][25689] Avg episode reward: [(0, '-52.048')] [2022-07-09 05:16:17,231][26022] Updated weights on worker 0-0, policy_version 103689 (0.00086) [2022-07-09 05:16:19,099][26022] Updated weights on worker 0-0, policy_version 103699 (0.00085) [2022-07-09 05:16:20,644][25689] Fps is (10 sec: 5802.2, 60 sec: 5684.8, 300 sec: 5707.3). Total num frames: 106195968. Throughput: 0: 5917.9. Samples: 106201842. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:16:20,645][25689] Avg episode reward: [(0, '-52.087')] [2022-07-09 05:16:20,828][26022] Updated weights on worker 0-0, policy_version 103709 (0.00091) [2022-07-09 05:16:22,596][26022] Updated weights on worker 0-0, policy_version 103719 (0.00090) [2022-07-09 05:16:24,345][26022] Updated weights on worker 0-0, policy_version 103729 (0.00088) [2022-07-09 05:16:25,655][25689] Fps is (10 sec: 5699.3, 60 sec: 5691.4, 300 sec: 5704.1). Total num frames: 106224640. Throughput: 0: 5177.8. Samples: 106219062. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:16:25,656][25689] Avg episode reward: [(0, '-51.256')] [2022-07-09 05:16:26,291][26022] Updated weights on worker 0-0, policy_version 103739 (0.00099) [2022-07-09 05:16:28,018][26022] Updated weights on worker 0-0, policy_version 103749 (0.00074) [2022-07-09 05:16:28,282][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:16:28,293][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000103750_106240000.pth [2022-07-09 05:16:28,293][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000101744_104185856.pth [2022-07-09 05:16:29,825][26022] Updated weights on worker 0-0, policy_version 103759 (0.00053) [2022-07-09 05:16:30,661][25689] Fps is (10 sec: 5725.0, 60 sec: 5699.9, 300 sec: 5704.2). Total num frames: 106253312. Throughput: 0: 6004.7. Samples: 106253022. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:16:30,663][25689] Avg episode reward: [(0, '-50.572')] [2022-07-09 05:16:31,879][26022] Updated weights on worker 0-0, policy_version 103769 (0.00097) [2022-07-09 05:16:33,396][26022] Updated weights on worker 0-0, policy_version 103779 (0.00084) [2022-07-09 05:16:35,320][26022] Updated weights on worker 0-0, policy_version 103789 (0.00090) [2022-07-09 05:16:35,672][25689] Fps is (10 sec: 5724.7, 60 sec: 5700.8, 300 sec: 5708.6). Total num frames: 106281984. Throughput: 0: 5985.9. Samples: 106287184. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:16:35,674][25689] Avg episode reward: [(0, '-51.055')] [2022-07-09 05:16:37,154][26022] Updated weights on worker 0-0, policy_version 103799 (0.00084) [2022-07-09 05:16:38,832][26022] Updated weights on worker 0-0, policy_version 103809 (0.00091) [2022-07-09 05:16:40,722][25689] Fps is (10 sec: 5597.6, 60 sec: 5666.3, 300 sec: 5697.7). Total num frames: 106309632. Throughput: 0: 5097.7. Samples: 106304194. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:16:40,724][25689] Avg episode reward: [(0, '-51.495')] [2022-07-09 05:16:40,866][26022] Updated weights on worker 0-0, policy_version 103819 (0.00526) [2022-07-09 05:16:42,372][26022] Updated weights on worker 0-0, policy_version 103829 (0.00092) [2022-07-09 05:16:44,208][26022] Updated weights on worker 0-0, policy_version 103839 (0.00091) [2022-07-09 05:16:45,748][25689] Fps is (10 sec: 5691.4, 60 sec: 5682.0, 300 sec: 5701.5). Total num frames: 106339328. Throughput: 0: 5961.2. Samples: 106338836. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:16:45,748][25689] Avg episode reward: [(0, '-52.232')] [2022-07-09 05:16:45,974][26022] Updated weights on worker 0-0, policy_version 103849 (0.00084) [2022-07-09 05:16:47,556][26022] Updated weights on worker 0-0, policy_version 103859 (0.00087) [2022-07-09 05:16:49,540][26022] Updated weights on worker 0-0, policy_version 103869 (0.00054) [2022-07-09 05:16:50,776][25689] Fps is (10 sec: 6009.0, 60 sec: 5697.8, 300 sec: 5704.5). Total num frames: 106370048. Throughput: 0: 6007.7. Samples: 106373870. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:16:50,777][25689] Avg episode reward: [(0, '-51.848')] [2022-07-09 05:16:51,203][26022] Updated weights on worker 0-0, policy_version 103879 (0.00086) [2022-07-09 05:16:53,198][26022] Updated weights on worker 0-0, policy_version 103889 (0.00085) [2022-07-09 05:16:54,906][26022] Updated weights on worker 0-0, policy_version 103899 (0.00086) [2022-07-09 05:16:55,778][25689] Fps is (10 sec: 5818.8, 60 sec: 5699.1, 300 sec: 5708.8). Total num frames: 106397696. Throughput: 0: 5170.3. Samples: 106391140. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:16:55,780][25689] Avg episode reward: [(0, '-52.371')] [2022-07-09 05:16:56,469][26022] Updated weights on worker 0-0, policy_version 103909 (0.00089) [2022-07-09 05:16:58,485][26022] Updated weights on worker 0-0, policy_version 103919 (0.00088) [2022-07-09 05:17:00,241][26022] Updated weights on worker 0-0, policy_version 103929 (0.00090) [2022-07-09 05:17:00,843][25689] Fps is (10 sec: 5594.9, 60 sec: 5702.8, 300 sec: 5711.3). Total num frames: 106426368. Throughput: 0: 6030.5. Samples: 106425530. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:17:00,843][25689] Avg episode reward: [(0, '-51.686')] [2022-07-09 05:17:02,239][26022] Updated weights on worker 0-0, policy_version 103939 (0.00093) [2022-07-09 05:17:04,102][26022] Updated weights on worker 0-0, policy_version 103949 (0.00842) [2022-07-09 05:17:05,727][26022] Updated weights on worker 0-0, policy_version 103959 (0.00085) [2022-07-09 05:17:05,873][25689] Fps is (10 sec: 5680.5, 60 sec: 5722.3, 300 sec: 5711.2). Total num frames: 106455040. Throughput: 0: 5950.2. Samples: 106458588. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:17:05,874][25689] Avg episode reward: [(0, '-51.558')] [2022-07-09 05:17:07,696][26022] Updated weights on worker 0-0, policy_version 103969 (0.00087) [2022-07-09 05:17:09,299][26022] Updated weights on worker 0-0, policy_version 103979 (0.00083) [2022-07-09 05:17:10,905][25689] Fps is (10 sec: 5699.2, 60 sec: 5737.9, 300 sec: 5711.8). Total num frames: 106483712. Throughput: 0: 5068.4. Samples: 106475888. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:17:10,905][25689] Avg episode reward: [(0, '-51.897')] [2022-07-09 05:17:11,025][26022] Updated weights on worker 0-0, policy_version 103989 (0.00082) [2022-07-09 05:17:12,923][26022] Updated weights on worker 0-0, policy_version 103999 (0.00086) [2022-07-09 05:17:14,874][26022] Updated weights on worker 0-0, policy_version 104009 (0.00090) [2022-07-09 05:17:15,910][25689] Fps is (10 sec: 5611.4, 60 sec: 5704.3, 300 sec: 5703.8). Total num frames: 106511360. Throughput: 0: 5933.9. Samples: 106510598. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:17:15,911][25689] Avg episode reward: [(0, '-50.950')] [2022-07-09 05:17:16,257][26022] Updated weights on worker 0-0, policy_version 104019 (0.00091) [2022-07-09 05:17:18,429][26022] Updated weights on worker 0-0, policy_version 104029 (0.00087) [2022-07-09 05:17:19,843][26022] Updated weights on worker 0-0, policy_version 104039 (0.00096) [2022-07-09 05:17:21,000][25689] Fps is (10 sec: 5578.9, 60 sec: 5700.6, 300 sec: 5712.6). Total num frames: 106540032. Throughput: 0: 5918.6. Samples: 106544832. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:17:21,000][25689] Avg episode reward: [(0, '-51.101')] [2022-07-09 05:17:21,779][26022] Updated weights on worker 0-0, policy_version 104049 (0.00089) [2022-07-09 05:17:23,501][26022] Updated weights on worker 0-0, policy_version 104059 (0.00092) [2022-07-09 05:17:25,575][26022] Updated weights on worker 0-0, policy_version 104069 (0.00097) [2022-07-09 05:17:26,007][25689] Fps is (10 sec: 5679.2, 60 sec: 5700.9, 300 sec: 5705.7). Total num frames: 106568704. Throughput: 0: 5978.6. Samples: 106578960. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:17:26,008][25689] Avg episode reward: [(0, '-50.866')] [2022-07-09 05:17:27,244][26022] Updated weights on worker 0-0, policy_version 104079 (0.00095) [2022-07-09 05:17:29,194][26022] Updated weights on worker 0-0, policy_version 104089 (0.00115) [2022-07-09 05:17:30,641][26022] Updated weights on worker 0-0, policy_version 104099 (0.00086) [2022-07-09 05:17:31,050][25689] Fps is (10 sec: 5909.2, 60 sec: 5731.3, 300 sec: 5712.7). Total num frames: 106599424. Throughput: 0: 5978.0. Samples: 106596320. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:17:31,051][25689] Avg episode reward: [(0, '-51.596')] [2022-07-09 05:17:32,671][26022] Updated weights on worker 0-0, policy_version 104109 (0.00086) [2022-07-09 05:17:34,226][26022] Updated weights on worker 0-0, policy_version 104119 (0.00092) [2022-07-09 05:17:36,068][25689] Fps is (10 sec: 5801.4, 60 sec: 5713.7, 300 sec: 5708.2). Total num frames: 106627072. Throughput: 0: 5983.4. Samples: 106631212. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:17:36,069][25689] Avg episode reward: [(0, '-52.233')] [2022-07-09 05:17:36,249][26022] Updated weights on worker 0-0, policy_version 104129 (0.00085) [2022-07-09 05:17:37,695][26022] Updated weights on worker 0-0, policy_version 104139 (0.00090) [2022-07-09 05:17:39,703][26022] Updated weights on worker 0-0, policy_version 104149 (0.00074) [2022-07-09 05:17:41,192][25689] Fps is (10 sec: 5755.0, 60 sec: 5757.5, 300 sec: 5711.1). Total num frames: 106657792. Throughput: 0: 5987.1. Samples: 106665728. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:17:41,193][25689] Avg episode reward: [(0, '-52.096')] [2022-07-09 05:17:41,281][26022] Updated weights on worker 0-0, policy_version 104159 (0.00091) [2022-07-09 05:17:43,443][26022] Updated weights on worker 0-0, policy_version 104169 (0.00087) [2022-07-09 05:17:44,886][26022] Updated weights on worker 0-0, policy_version 104179 (0.00097) [2022-07-09 05:17:46,227][25689] Fps is (10 sec: 5745.7, 60 sec: 5722.8, 300 sec: 5704.2). Total num frames: 106685440. Throughput: 0: 5150.9. Samples: 106683108. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:17:46,227][25689] Avg episode reward: [(0, '-53.014')] [2022-07-09 05:17:46,879][26022] Updated weights on worker 0-0, policy_version 104189 (0.00085) [2022-07-09 05:17:48,408][26022] Updated weights on worker 0-0, policy_version 104199 (0.00089) [2022-07-09 05:17:50,255][26022] Updated weights on worker 0-0, policy_version 104209 (0.00078) [2022-07-09 05:17:51,325][25689] Fps is (10 sec: 5659.6, 60 sec: 5699.4, 300 sec: 5707.5). Total num frames: 106715136. Throughput: 0: 5988.5. Samples: 106717732. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:17:51,325][25689] Avg episode reward: [(0, '-53.303')] [2022-07-09 05:17:52,031][26022] Updated weights on worker 0-0, policy_version 104219 (0.00084) [2022-07-09 05:17:53,902][26022] Updated weights on worker 0-0, policy_version 104229 (0.00112) [2022-07-09 05:17:55,612][26022] Updated weights on worker 0-0, policy_version 104239 (0.00090) [2022-07-09 05:17:56,372][25689] Fps is (10 sec: 5753.1, 60 sec: 5712.0, 300 sec: 5707.8). Total num frames: 106743808. Throughput: 0: 5956.2. Samples: 106752146. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:17:56,373][25689] Avg episode reward: [(0, '-53.173')] [2022-07-09 05:17:57,364][26022] Updated weights on worker 0-0, policy_version 104249 (0.00085) [2022-07-09 05:17:59,126][26022] Updated weights on worker 0-0, policy_version 104259 (0.00090) [2022-07-09 05:18:01,043][26022] Updated weights on worker 0-0, policy_version 104269 (0.00091) [2022-07-09 05:18:01,495][25689] Fps is (10 sec: 5638.4, 60 sec: 5706.5, 300 sec: 5705.5). Total num frames: 106772480. Throughput: 0: 5103.4. Samples: 106769340. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:18:01,495][25689] Avg episode reward: [(0, '-53.320')] [2022-07-09 05:18:03,136][26022] Updated weights on worker 0-0, policy_version 104279 (0.00094) [2022-07-09 05:18:04,933][26022] Updated weights on worker 0-0, policy_version 104289 (0.00090) [2022-07-09 05:18:06,509][25689] Fps is (10 sec: 5556.3, 60 sec: 5691.2, 300 sec: 5709.0). Total num frames: 106800128. Throughput: 0: 5849.9. Samples: 106801754. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:18:06,510][25689] Avg episode reward: [(0, '-52.361')] [2022-07-09 05:18:06,759][26022] Updated weights on worker 0-0, policy_version 104299 (0.00084) [2022-07-09 05:18:08,693][26022] Updated weights on worker 0-0, policy_version 104309 (0.00083) [2022-07-09 05:18:10,124][26022] Updated weights on worker 0-0, policy_version 104319 (0.00090) [2022-07-09 05:18:11,531][25689] Fps is (10 sec: 5611.5, 60 sec: 5692.0, 300 sec: 5705.8). Total num frames: 106828800. Throughput: 0: 5870.8. Samples: 106836362. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:18:11,533][25689] Avg episode reward: [(0, '-51.194')] [2022-07-09 05:18:12,171][26022] Updated weights on worker 0-0, policy_version 104329 (0.00080) [2022-07-09 05:18:13,864][26022] Updated weights on worker 0-0, policy_version 104339 (0.00091) [2022-07-09 05:18:15,614][26022] Updated weights on worker 0-0, policy_version 104349 (0.00090) [2022-07-09 05:18:16,565][25689] Fps is (10 sec: 5702.2, 60 sec: 5706.2, 300 sec: 5701.0). Total num frames: 106857472. Throughput: 0: 5029.1. Samples: 106853698. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:18:16,565][25689] Avg episode reward: [(0, '-51.151')] [2022-07-09 05:18:17,495][26022] Updated weights on worker 0-0, policy_version 104359 (0.00086) [2022-07-09 05:18:18,959][26022] Updated weights on worker 0-0, policy_version 104369 (0.00092) [2022-07-09 05:18:21,038][26022] Updated weights on worker 0-0, policy_version 104379 (0.00087) [2022-07-09 05:18:21,667][25689] Fps is (10 sec: 5960.6, 60 sec: 5755.7, 300 sec: 5716.6). Total num frames: 106889216. Throughput: 0: 5915.5. Samples: 106888670. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:18:21,669][25689] Avg episode reward: [(0, '-50.671')] [2022-07-09 05:18:22,488][26022] Updated weights on worker 0-0, policy_version 104389 (0.00092) [2022-07-09 05:18:24,434][26022] Updated weights on worker 0-0, policy_version 104399 (0.00084) [2022-07-09 05:18:26,540][26022] Updated weights on worker 0-0, policy_version 104409 (0.00080) [2022-07-09 05:18:26,726][25689] Fps is (10 sec: 5744.4, 60 sec: 5717.1, 300 sec: 5705.3). Total num frames: 106915840. Throughput: 0: 5985.2. Samples: 106922758. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:18:26,726][25689] Avg episode reward: [(0, '-50.744')] [2022-07-09 05:18:28,121][26022] Updated weights on worker 0-0, policy_version 104419 (0.00086) [2022-07-09 05:18:28,367][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:18:28,378][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000104421_106927104.pth [2022-07-09 05:18:28,378][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000102411_104868864.pth [2022-07-09 05:18:29,925][26022] Updated weights on worker 0-0, policy_version 104429 (0.00089) [2022-07-09 05:18:31,696][26022] Updated weights on worker 0-0, policy_version 104439 (0.00086) [2022-07-09 05:18:31,793][25689] Fps is (10 sec: 5562.3, 60 sec: 5698.0, 300 sec: 5704.3). Total num frames: 106945536. Throughput: 0: 5122.3. Samples: 106940146. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:18:31,793][25689] Avg episode reward: [(0, '-49.798')] [2022-07-09 05:18:33,382][26022] Updated weights on worker 0-0, policy_version 104449 (0.00095) [2022-07-09 05:18:35,331][26022] Updated weights on worker 0-0, policy_version 104459 (0.00085) [2022-07-09 05:18:36,856][25689] Fps is (10 sec: 5863.1, 60 sec: 5727.5, 300 sec: 5705.8). Total num frames: 106975232. Throughput: 0: 5958.1. Samples: 106974592. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 05:18:36,856][25689] Avg episode reward: [(0, '-50.351')] [2022-07-09 05:18:36,942][26022] Updated weights on worker 0-0, policy_version 104469 (0.00087) [2022-07-09 05:18:38,854][26022] Updated weights on worker 0-0, policy_version 104479 (0.00080) [2022-07-09 05:18:40,594][26022] Updated weights on worker 0-0, policy_version 104489 (0.00085) [2022-07-09 05:18:41,911][25689] Fps is (10 sec: 5667.3, 60 sec: 5683.4, 300 sec: 5699.1). Total num frames: 107002880. Throughput: 0: 5939.3. Samples: 107008904. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 05:18:41,912][25689] Avg episode reward: [(0, '-50.916')] [2022-07-09 05:18:42,419][26022] Updated weights on worker 0-0, policy_version 104499 (0.00100) [2022-07-09 05:18:44,397][26022] Updated weights on worker 0-0, policy_version 104509 (0.00084) [2022-07-09 05:18:45,995][26022] Updated weights on worker 0-0, policy_version 104519 (0.00082) [2022-07-09 05:18:46,920][25689] Fps is (10 sec: 5494.0, 60 sec: 5685.7, 300 sec: 5695.7). Total num frames: 107030528. Throughput: 0: 5116.0. Samples: 107026074. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 05:18:46,921][25689] Avg episode reward: [(0, '-50.479')] [2022-07-09 05:18:47,776][26022] Updated weights on worker 0-0, policy_version 104529 (0.00080) [2022-07-09 05:18:49,565][26022] Updated weights on worker 0-0, policy_version 104539 (0.00084) [2022-07-09 05:18:51,383][26022] Updated weights on worker 0-0, policy_version 104549 (0.00096) [2022-07-09 05:18:51,939][25689] Fps is (10 sec: 5616.5, 60 sec: 5676.3, 300 sec: 5692.2). Total num frames: 107059200. Throughput: 0: 5971.8. Samples: 107060454. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 05:18:51,939][25689] Avg episode reward: [(0, '-50.732')] [2022-07-09 05:18:53,219][26022] Updated weights on worker 0-0, policy_version 104559 (0.00086) [2022-07-09 05:18:55,096][26022] Updated weights on worker 0-0, policy_version 104569 (0.00093) [2022-07-09 05:18:56,755][26022] Updated weights on worker 0-0, policy_version 104579 (0.00090) [2022-07-09 05:18:56,980][25689] Fps is (10 sec: 5904.2, 60 sec: 5710.7, 300 sec: 5699.9). Total num frames: 107089920. Throughput: 0: 5953.8. Samples: 107094406. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 05:18:56,980][25689] Avg episode reward: [(0, '-50.614')] [2022-07-09 05:18:58,632][26022] Updated weights on worker 0-0, policy_version 104589 (0.00089) [2022-07-09 05:19:00,354][26022] Updated weights on worker 0-0, policy_version 104599 (0.00093) [2022-07-09 05:19:02,122][25689] Fps is (10 sec: 5732.0, 60 sec: 5692.0, 300 sec: 5705.2). Total num frames: 107117568. Throughput: 0: 5085.6. Samples: 107111686. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 05:19:02,122][25689] Avg episode reward: [(0, '-50.819')] [2022-07-09 05:19:02,672][26022] Updated weights on worker 0-0, policy_version 104609 (0.00106) [2022-07-09 05:19:04,305][26022] Updated weights on worker 0-0, policy_version 104619 (0.00083) [2022-07-09 05:19:06,240][26022] Updated weights on worker 0-0, policy_version 104629 (0.00084) [2022-07-09 05:19:07,143][25689] Fps is (10 sec: 5440.8, 60 sec: 5691.3, 300 sec: 5698.1). Total num frames: 107145216. Throughput: 0: 5841.9. Samples: 107144212. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 05:19:07,143][25689] Avg episode reward: [(0, '-50.373')] [2022-07-09 05:19:07,750][26022] Updated weights on worker 0-0, policy_version 104639 (0.00087) [2022-07-09 05:19:09,674][26022] Updated weights on worker 0-0, policy_version 104649 (0.00090) [2022-07-09 05:19:11,254][26022] Updated weights on worker 0-0, policy_version 104659 (0.00523) [2022-07-09 05:19:12,227][25689] Fps is (10 sec: 5674.8, 60 sec: 5702.5, 300 sec: 5700.4). Total num frames: 107174912. Throughput: 0: 5838.0. Samples: 107178894. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 05:19:12,228][25689] Avg episode reward: [(0, '-50.558')] [2022-07-09 05:19:13,421][26022] Updated weights on worker 0-0, policy_version 104669 (0.00089) [2022-07-09 05:19:14,827][26022] Updated weights on worker 0-0, policy_version 104679 (0.00094) [2022-07-09 05:19:16,981][26022] Updated weights on worker 0-0, policy_version 104689 (0.00096) [2022-07-09 05:19:17,255][25689] Fps is (10 sec: 5771.8, 60 sec: 5702.9, 300 sec: 5701.4). Total num frames: 107203584. Throughput: 0: 5016.0. Samples: 107196110. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 05:19:17,256][25689] Avg episode reward: [(0, '-50.362')] [2022-07-09 05:19:18,505][26022] Updated weights on worker 0-0, policy_version 104699 (0.00084) [2022-07-09 05:19:20,391][26022] Updated weights on worker 0-0, policy_version 104709 (0.00325) [2022-07-09 05:19:22,125][26022] Updated weights on worker 0-0, policy_version 104719 (0.00087) [2022-07-09 05:19:22,322][25689] Fps is (10 sec: 5680.5, 60 sec: 5655.7, 300 sec: 5697.5). Total num frames: 107232256. Throughput: 0: 5891.0. Samples: 107230684. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 05:19:22,322][25689] Avg episode reward: [(0, '-50.451')] [2022-07-09 05:19:23,856][26022] Updated weights on worker 0-0, policy_version 104729 (0.00096) [2022-07-09 05:19:25,809][26022] Updated weights on worker 0-0, policy_version 104739 (0.00085) [2022-07-09 05:19:27,404][25689] Fps is (10 sec: 5751.6, 60 sec: 5704.1, 300 sec: 5699.8). Total num frames: 107261952. Throughput: 0: 5977.5. Samples: 107265320. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 05:19:27,404][25689] Avg episode reward: [(0, '-50.593')] [2022-07-09 05:19:27,523][26022] Updated weights on worker 0-0, policy_version 104749 (0.00094) [2022-07-09 05:19:29,519][26022] Updated weights on worker 0-0, policy_version 104759 (0.00098) [2022-07-09 05:19:31,019][26022] Updated weights on worker 0-0, policy_version 104769 (0.00087) [2022-07-09 05:19:32,429][25689] Fps is (10 sec: 5774.5, 60 sec: 5691.1, 300 sec: 5700.5). Total num frames: 107290624. Throughput: 0: 5982.7. Samples: 107299762. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 05:19:32,431][25689] Avg episode reward: [(0, '-50.369')] [2022-07-09 05:19:32,688][26022] Updated weights on worker 0-0, policy_version 104779 (0.00088) [2022-07-09 05:19:34,418][26022] Updated weights on worker 0-0, policy_version 104789 (0.00085) [2022-07-09 05:19:36,236][26022] Updated weights on worker 0-0, policy_version 104799 (0.00432) [2022-07-09 05:19:37,515][25689] Fps is (10 sec: 5772.7, 60 sec: 5689.0, 300 sec: 5703.2). Total num frames: 107320320. Throughput: 0: 5980.1. Samples: 107317262. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 05:19:37,515][25689] Avg episode reward: [(0, '-51.403')] [2022-07-09 05:19:38,247][26022] Updated weights on worker 0-0, policy_version 104809 (0.00092) [2022-07-09 05:19:39,997][26022] Updated weights on worker 0-0, policy_version 104819 (0.00080) [2022-07-09 05:19:41,667][26022] Updated weights on worker 0-0, policy_version 104829 (0.00086) [2022-07-09 05:19:42,583][25689] Fps is (10 sec: 5748.7, 60 sec: 5704.7, 300 sec: 5702.1). Total num frames: 107348992. Throughput: 0: 5983.1. Samples: 107351908. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 05:19:42,583][25689] Avg episode reward: [(0, '-50.853')] [2022-07-09 05:19:43,428][26022] Updated weights on worker 0-0, policy_version 104839 (0.00086) [2022-07-09 05:19:45,276][26022] Updated weights on worker 0-0, policy_version 104849 (0.00085) [2022-07-09 05:19:47,016][26022] Updated weights on worker 0-0, policy_version 104859 (0.00092) [2022-07-09 05:19:47,625][25689] Fps is (10 sec: 5773.3, 60 sec: 5735.4, 300 sec: 5702.0). Total num frames: 107378688. Throughput: 0: 5989.6. Samples: 107386436. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 05:19:47,625][25689] Avg episode reward: [(0, '-50.942')] [2022-07-09 05:19:49,047][26022] Updated weights on worker 0-0, policy_version 104869 (0.00093) [2022-07-09 05:19:50,591][26022] Updated weights on worker 0-0, policy_version 104879 (0.00092) [2022-07-09 05:19:52,473][26022] Updated weights on worker 0-0, policy_version 104889 (0.00086) [2022-07-09 05:19:52,632][25689] Fps is (10 sec: 5808.2, 60 sec: 5736.4, 300 sec: 5705.5). Total num frames: 107407360. Throughput: 0: 5151.6. Samples: 107403836. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 05:19:52,632][25689] Avg episode reward: [(0, '-51.593')] [2022-07-09 05:19:53,945][26022] Updated weights on worker 0-0, policy_version 104899 (0.00084) [2022-07-09 05:19:55,944][26022] Updated weights on worker 0-0, policy_version 104909 (0.00095) [2022-07-09 05:19:57,691][25689] Fps is (10 sec: 5696.8, 60 sec: 5701.0, 300 sec: 5702.7). Total num frames: 107436032. Throughput: 0: 6008.1. Samples: 107438482. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 05:19:57,691][25689] Avg episode reward: [(0, '-52.012')] [2022-07-09 05:19:57,747][26022] Updated weights on worker 0-0, policy_version 104919 (0.00082) [2022-07-09 05:19:59,622][26022] Updated weights on worker 0-0, policy_version 104929 (0.00092) [2022-07-09 05:20:01,308][26022] Updated weights on worker 0-0, policy_version 104939 (0.00089) [2022-07-09 05:20:02,774][25689] Fps is (10 sec: 5451.9, 60 sec: 5689.6, 300 sec: 5701.9). Total num frames: 107462656. Throughput: 0: 5898.5. Samples: 107471010. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 05:20:02,775][25689] Avg episode reward: [(0, '-52.386')] [2022-07-09 05:20:03,610][26022] Updated weights on worker 0-0, policy_version 104949 (0.00096) [2022-07-09 05:20:05,256][26022] Updated weights on worker 0-0, policy_version 104959 (0.00083) [2022-07-09 05:20:07,396][26022] Updated weights on worker 0-0, policy_version 104969 (0.00090) [2022-07-09 05:20:07,874][25689] Fps is (10 sec: 5530.7, 60 sec: 5716.0, 300 sec: 5703.7). Total num frames: 107492352. Throughput: 0: 4988.3. Samples: 107487444. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 05:20:07,874][25689] Avg episode reward: [(0, '-51.899')] [2022-07-09 05:20:08,705][26022] Updated weights on worker 0-0, policy_version 104979 (0.00094) [2022-07-09 05:20:10,730][26022] Updated weights on worker 0-0, policy_version 104989 (0.00087) [2022-07-09 05:20:12,454][26022] Updated weights on worker 0-0, policy_version 104999 (0.00090) [2022-07-09 05:20:12,889][25689] Fps is (10 sec: 5770.5, 60 sec: 5705.5, 300 sec: 5704.0). Total num frames: 107521024. Throughput: 0: 5838.6. Samples: 107522112. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 05:20:12,890][25689] Avg episode reward: [(0, '-52.631')] [2022-07-09 05:20:14,262][26022] Updated weights on worker 0-0, policy_version 105009 (0.00087) [2022-07-09 05:20:15,906][26022] Updated weights on worker 0-0, policy_version 105019 (0.00084) [2022-07-09 05:20:17,677][26022] Updated weights on worker 0-0, policy_version 105029 (0.00092) [2022-07-09 05:20:17,898][25689] Fps is (10 sec: 5720.5, 60 sec: 5707.4, 300 sec: 5701.6). Total num frames: 107549696. Throughput: 0: 5889.5. Samples: 107557492. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 05:20:17,899][25689] Avg episode reward: [(0, '-53.171')] [2022-07-09 05:20:19,385][26022] Updated weights on worker 0-0, policy_version 105039 (0.00093) [2022-07-09 05:20:21,406][26022] Updated weights on worker 0-0, policy_version 105049 (0.00098) [2022-07-09 05:20:22,756][26022] Updated weights on worker 0-0, policy_version 105059 (0.00090) [2022-07-09 05:20:22,991][25689] Fps is (10 sec: 5981.2, 60 sec: 5755.6, 300 sec: 5711.7). Total num frames: 107581440. Throughput: 0: 5144.9. Samples: 107575020. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 05:20:22,991][25689] Avg episode reward: [(0, '-51.929')] [2022-07-09 05:20:24,495][26022] Updated weights on worker 0-0, policy_version 105069 (0.00088) [2022-07-09 05:20:26,590][26022] Updated weights on worker 0-0, policy_version 105079 (0.00089) [2022-07-09 05:20:28,005][25689] Fps is (10 sec: 5977.8, 60 sec: 5745.1, 300 sec: 5713.2). Total num frames: 107610112. Throughput: 0: 6087.0. Samples: 107609980. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 05:20:28,005][25689] Avg episode reward: [(0, '-52.648')] [2022-07-09 05:20:28,022][26022] Updated weights on worker 0-0, policy_version 105089 (0.00095) [2022-07-09 05:20:28,402][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:20:28,435][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000105091_107613184.pth [2022-07-09 05:20:28,435][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000103082_105555968.pth [2022-07-09 05:20:30,101][26022] Updated weights on worker 0-0, policy_version 105099 (0.00086) [2022-07-09 05:20:31,507][26022] Updated weights on worker 0-0, policy_version 105109 (0.00090) [2022-07-09 05:20:33,011][25689] Fps is (10 sec: 5620.7, 60 sec: 5730.1, 300 sec: 5710.1). Total num frames: 107637760. Throughput: 0: 6073.1. Samples: 107644310. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 05:20:33,011][25689] Avg episode reward: [(0, '-52.375')] [2022-07-09 05:20:33,525][26022] Updated weights on worker 0-0, policy_version 105119 (0.00090) [2022-07-09 05:20:35,336][26022] Updated weights on worker 0-0, policy_version 105129 (0.00090) [2022-07-09 05:20:37,120][26022] Updated weights on worker 0-0, policy_version 105139 (0.00082) [2022-07-09 05:20:38,044][25689] Fps is (10 sec: 5711.9, 60 sec: 5735.0, 300 sec: 5710.3). Total num frames: 107667456. Throughput: 0: 5171.4. Samples: 107661676. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 05:20:38,045][25689] Avg episode reward: [(0, '-52.199')] [2022-07-09 05:20:38,927][26022] Updated weights on worker 0-0, policy_version 105149 (0.00083) [2022-07-09 05:20:40,738][26022] Updated weights on worker 0-0, policy_version 105159 (0.00051) [2022-07-09 05:20:42,323][26022] Updated weights on worker 0-0, policy_version 105169 (0.00086) [2022-07-09 05:20:43,089][25689] Fps is (10 sec: 5791.3, 60 sec: 5737.2, 300 sec: 5709.6). Total num frames: 107696128. Throughput: 0: 6043.1. Samples: 107696478. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:20:43,090][25689] Avg episode reward: [(0, '-51.473')] [2022-07-09 05:20:44,417][26022] Updated weights on worker 0-0, policy_version 105179 (0.00095) [2022-07-09 05:20:46,007][26022] Updated weights on worker 0-0, policy_version 105189 (0.00105) [2022-07-09 05:20:47,754][26022] Updated weights on worker 0-0, policy_version 105199 (0.00084) [2022-07-09 05:20:48,102][25689] Fps is (10 sec: 5701.3, 60 sec: 5723.0, 300 sec: 5706.3). Total num frames: 107724800. Throughput: 0: 6036.8. Samples: 107731304. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:20:48,103][25689] Avg episode reward: [(0, '-51.221')] [2022-07-09 05:20:49,530][26022] Updated weights on worker 0-0, policy_version 105209 (0.00088) [2022-07-09 05:20:51,227][26022] Updated weights on worker 0-0, policy_version 105219 (0.00085) [2022-07-09 05:20:53,135][25689] Fps is (10 sec: 5708.3, 60 sec: 5720.6, 300 sec: 5709.4). Total num frames: 107753472. Throughput: 0: 5188.4. Samples: 107748722. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:20:53,135][25689] Avg episode reward: [(0, '-51.275')] [2022-07-09 05:20:53,176][26022] Updated weights on worker 0-0, policy_version 105229 (0.00085) [2022-07-09 05:20:54,963][26022] Updated weights on worker 0-0, policy_version 105239 (0.00083) [2022-07-09 05:20:56,541][26022] Updated weights on worker 0-0, policy_version 105249 (0.00087) [2022-07-09 05:20:58,147][25689] Fps is (10 sec: 5811.0, 60 sec: 5742.0, 300 sec: 5714.6). Total num frames: 107783168. Throughput: 0: 6056.8. Samples: 107783434. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:20:58,147][25689] Avg episode reward: [(0, '-50.259')] [2022-07-09 05:20:58,283][26022] Updated weights on worker 0-0, policy_version 105259 (0.00089) [2022-07-09 05:21:00,143][26022] Updated weights on worker 0-0, policy_version 105269 (0.00091) [2022-07-09 05:21:02,504][26022] Updated weights on worker 0-0, policy_version 105279 (0.00087) [2022-07-09 05:21:03,207][25689] Fps is (10 sec: 5591.8, 60 sec: 5744.2, 300 sec: 5711.1). Total num frames: 107809792. Throughput: 0: 5916.8. Samples: 107815510. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:21:03,207][25689] Avg episode reward: [(0, '-50.911')] [2022-07-09 05:21:04,108][26022] Updated weights on worker 0-0, policy_version 105289 (0.00087) [2022-07-09 05:21:06,060][26022] Updated weights on worker 0-0, policy_version 105299 (0.00086) [2022-07-09 05:21:07,587][26022] Updated weights on worker 0-0, policy_version 105309 (0.00089) [2022-07-09 05:21:08,251][25689] Fps is (10 sec: 5472.7, 60 sec: 5732.5, 300 sec: 5714.0). Total num frames: 107838464. Throughput: 0: 5042.8. Samples: 107832908. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:21:08,251][25689] Avg episode reward: [(0, '-50.756')] [2022-07-09 05:21:09,631][26022] Updated weights on worker 0-0, policy_version 105319 (0.00091) [2022-07-09 05:21:11,226][26022] Updated weights on worker 0-0, policy_version 105329 (0.00086) [2022-07-09 05:21:13,112][26022] Updated weights on worker 0-0, policy_version 105339 (0.00090) [2022-07-09 05:21:13,268][25689] Fps is (10 sec: 5902.9, 60 sec: 5766.3, 300 sec: 5717.3). Total num frames: 107869184. Throughput: 0: 5908.7. Samples: 107867684. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:21:13,269][25689] Avg episode reward: [(0, '-50.752')] [2022-07-09 05:21:14,752][26022] Updated weights on worker 0-0, policy_version 105349 (0.00089) [2022-07-09 05:21:16,640][26022] Updated weights on worker 0-0, policy_version 105359 (0.00086) [2022-07-09 05:21:18,309][25689] Fps is (10 sec: 5803.0, 60 sec: 5746.2, 300 sec: 5714.0). Total num frames: 107896832. Throughput: 0: 5888.1. Samples: 107902152. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:21:18,309][25689] Avg episode reward: [(0, '-51.247')] [2022-07-09 05:21:18,331][26022] Updated weights on worker 0-0, policy_version 105369 (0.00087) [2022-07-09 05:21:20,342][26022] Updated weights on worker 0-0, policy_version 105379 (0.00091) [2022-07-09 05:21:21,933][26022] Updated weights on worker 0-0, policy_version 105389 (0.00091) [2022-07-09 05:21:23,351][25689] Fps is (10 sec: 5585.8, 60 sec: 5700.2, 300 sec: 5713.4). Total num frames: 107925504. Throughput: 0: 5149.5. Samples: 107919244. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:21:23,351][25689] Avg episode reward: [(0, '-50.783')] [2022-07-09 05:21:23,856][26022] Updated weights on worker 0-0, policy_version 105399 (0.00085) [2022-07-09 05:21:25,624][26022] Updated weights on worker 0-0, policy_version 105409 (0.00092) [2022-07-09 05:21:27,395][26022] Updated weights on worker 0-0, policy_version 105419 (0.00087) [2022-07-09 05:21:28,365][25689] Fps is (10 sec: 5804.5, 60 sec: 5717.2, 300 sec: 5716.7). Total num frames: 107955200. Throughput: 0: 6010.2. Samples: 107953796. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:21:28,366][25689] Avg episode reward: [(0, '-50.563')] [2022-07-09 05:21:29,134][26022] Updated weights on worker 0-0, policy_version 105429 (0.00086) [2022-07-09 05:21:31,143][26022] Updated weights on worker 0-0, policy_version 105439 (0.00498) [2022-07-09 05:21:32,628][26022] Updated weights on worker 0-0, policy_version 105449 (0.00091) [2022-07-09 05:21:33,374][25689] Fps is (10 sec: 5619.1, 60 sec: 5699.9, 300 sec: 5709.9). Total num frames: 107981824. Throughput: 0: 5973.7. Samples: 107987790. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:21:33,376][25689] Avg episode reward: [(0, '-51.023')] [2022-07-09 05:21:34,805][26022] Updated weights on worker 0-0, policy_version 105459 (0.01109) [2022-07-09 05:21:36,198][26022] Updated weights on worker 0-0, policy_version 105469 (0.00085) [2022-07-09 05:21:38,232][26022] Updated weights on worker 0-0, policy_version 105479 (0.00094) [2022-07-09 05:21:38,383][25689] Fps is (10 sec: 5519.2, 60 sec: 5685.2, 300 sec: 5714.1). Total num frames: 108010496. Throughput: 0: 5119.1. Samples: 108004916. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:21:38,384][25689] Avg episode reward: [(0, '-51.200')] [2022-07-09 05:21:40,115][26022] Updated weights on worker 0-0, policy_version 105489 (0.00087) [2022-07-09 05:21:41,765][26022] Updated weights on worker 0-0, policy_version 105499 (0.00096) [2022-07-09 05:21:43,423][25689] Fps is (10 sec: 5808.1, 60 sec: 5702.6, 300 sec: 5713.8). Total num frames: 108040192. Throughput: 0: 5969.2. Samples: 108039060. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:21:43,425][25689] Avg episode reward: [(0, '-51.005')] [2022-07-09 05:21:43,518][26022] Updated weights on worker 0-0, policy_version 105509 (0.00093) [2022-07-09 05:21:45,607][26022] Updated weights on worker 0-0, policy_version 105519 (0.00096) [2022-07-09 05:21:46,904][26022] Updated weights on worker 0-0, policy_version 105529 (0.00082) [2022-07-09 05:21:48,430][25689] Fps is (10 sec: 5708.0, 60 sec: 5686.3, 300 sec: 5703.9). Total num frames: 108067840. Throughput: 0: 5957.9. Samples: 108073340. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:21:48,431][25689] Avg episode reward: [(0, '-50.651')] [2022-07-09 05:21:49,096][26022] Updated weights on worker 0-0, policy_version 105539 (0.00101) [2022-07-09 05:21:50,413][26022] Updated weights on worker 0-0, policy_version 105549 (0.00090) [2022-07-09 05:21:52,554][26022] Updated weights on worker 0-0, policy_version 105559 (0.00089) [2022-07-09 05:21:53,434][25689] Fps is (10 sec: 5830.9, 60 sec: 5723.0, 300 sec: 5714.2). Total num frames: 108098560. Throughput: 0: 5127.6. Samples: 108090648. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:21:53,434][25689] Avg episode reward: [(0, '-51.213')] [2022-07-09 05:21:54,411][26022] Updated weights on worker 0-0, policy_version 105569 (0.00086) [2022-07-09 05:21:56,038][26022] Updated weights on worker 0-0, policy_version 105579 (0.00089) [2022-07-09 05:21:57,947][26022] Updated weights on worker 0-0, policy_version 105589 (0.00089) [2022-07-09 05:21:58,467][25689] Fps is (10 sec: 5815.1, 60 sec: 5687.0, 300 sec: 5711.4). Total num frames: 108126208. Throughput: 0: 5973.8. Samples: 108124890. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:21:58,468][25689] Avg episode reward: [(0, '-51.114')] [2022-07-09 05:21:59,742][26022] Updated weights on worker 0-0, policy_version 105599 (0.00091) [2022-07-09 05:22:01,288][26022] Updated weights on worker 0-0, policy_version 105609 (0.00090) [2022-07-09 05:22:03,562][25689] Fps is (10 sec: 5358.6, 60 sec: 5683.7, 300 sec: 5703.3). Total num frames: 108152832. Throughput: 0: 5849.2. Samples: 108156852. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:22:03,562][25689] Avg episode reward: [(0, '-50.546')] [2022-07-09 05:22:03,739][26022] Updated weights on worker 0-0, policy_version 105619 (0.00082) [2022-07-09 05:22:05,423][26022] Updated weights on worker 0-0, policy_version 105629 (0.00089) [2022-07-09 05:22:07,333][26022] Updated weights on worker 0-0, policy_version 105639 (0.00091) [2022-07-09 05:22:08,601][25689] Fps is (10 sec: 5456.6, 60 sec: 5684.2, 300 sec: 5703.1). Total num frames: 108181504. Throughput: 0: 5850.0. Samples: 108191342. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:22:08,602][25689] Avg episode reward: [(0, '-50.951')] [2022-07-09 05:22:08,983][26022] Updated weights on worker 0-0, policy_version 105649 (0.00090) [2022-07-09 05:22:10,915][26022] Updated weights on worker 0-0, policy_version 105659 (0.00086) [2022-07-09 05:22:12,529][26022] Updated weights on worker 0-0, policy_version 105669 (0.00086) [2022-07-09 05:22:13,629][25689] Fps is (10 sec: 5696.2, 60 sec: 5649.3, 300 sec: 5706.1). Total num frames: 108210176. Throughput: 0: 5839.8. Samples: 108208582. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:22:13,629][25689] Avg episode reward: [(0, '-51.217')] [2022-07-09 05:22:14,470][26022] Updated weights on worker 0-0, policy_version 105679 (0.00084) [2022-07-09 05:22:16,011][26022] Updated weights on worker 0-0, policy_version 105689 (0.00088) [2022-07-09 05:22:18,219][26022] Updated weights on worker 0-0, policy_version 105699 (0.00088) [2022-07-09 05:22:18,670][25689] Fps is (10 sec: 5695.0, 60 sec: 5666.2, 300 sec: 5707.0). Total num frames: 108238848. Throughput: 0: 5840.7. Samples: 108242888. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:22:18,670][25689] Avg episode reward: [(0, '-51.411')] [2022-07-09 05:22:19,654][26022] Updated weights on worker 0-0, policy_version 105709 (0.00084) [2022-07-09 05:22:21,571][26022] Updated weights on worker 0-0, policy_version 105719 (0.00086) [2022-07-09 05:22:23,319][26022] Updated weights on worker 0-0, policy_version 105729 (0.00094) [2022-07-09 05:22:23,732][25689] Fps is (10 sec: 5777.2, 60 sec: 5681.3, 300 sec: 5709.4). Total num frames: 108268544. Throughput: 0: 5970.0. Samples: 108277268. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:22:23,732][25689] Avg episode reward: [(0, '-51.448')] [2022-07-09 05:22:25,271][26022] Updated weights on worker 0-0, policy_version 105739 (0.00089) [2022-07-09 05:22:26,862][26022] Updated weights on worker 0-0, policy_version 105749 (0.00096) [2022-07-09 05:22:28,441][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:22:28,464][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000105758_108296192.pth [2022-07-09 05:22:28,465][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000103750_106240000.pth [2022-07-09 05:22:28,754][25689] Fps is (10 sec: 5788.0, 60 sec: 5663.5, 300 sec: 5703.0). Total num frames: 108297216. Throughput: 0: 5118.1. Samples: 108294490. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:22:28,755][25689] Avg episode reward: [(0, '-52.127')] [2022-07-09 05:22:28,756][26022] Updated weights on worker 0-0, policy_version 105759 (0.00083) [2022-07-09 05:22:30,264][26022] Updated weights on worker 0-0, policy_version 105769 (0.00077) [2022-07-09 05:22:32,400][26022] Updated weights on worker 0-0, policy_version 105779 (0.00094) [2022-07-09 05:22:33,803][25689] Fps is (10 sec: 5795.4, 60 sec: 5710.6, 300 sec: 5709.2). Total num frames: 108326912. Throughput: 0: 5969.7. Samples: 108329018. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:22:33,804][25689] Avg episode reward: [(0, '-51.885')] [2022-07-09 05:22:34,087][26022] Updated weights on worker 0-0, policy_version 105789 (0.00091) [2022-07-09 05:22:35,930][26022] Updated weights on worker 0-0, policy_version 105799 (0.00100) [2022-07-09 05:22:37,612][26022] Updated weights on worker 0-0, policy_version 105809 (0.00085) [2022-07-09 05:22:38,804][25689] Fps is (10 sec: 5807.6, 60 sec: 5711.4, 300 sec: 5704.7). Total num frames: 108355584. Throughput: 0: 5991.3. Samples: 108363520. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:22:38,805][25689] Avg episode reward: [(0, '-50.877')] [2022-07-09 05:22:39,475][26022] Updated weights on worker 0-0, policy_version 105819 (0.00090) [2022-07-09 05:22:41,079][26022] Updated weights on worker 0-0, policy_version 105829 (0.00087) [2022-07-09 05:22:43,072][26022] Updated weights on worker 0-0, policy_version 105839 (0.00089) [2022-07-09 05:22:43,903][25689] Fps is (10 sec: 5677.8, 60 sec: 5688.9, 300 sec: 5706.9). Total num frames: 108384256. Throughput: 0: 5135.9. Samples: 108380864. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 05:22:43,903][25689] Avg episode reward: [(0, '-50.819')] [2022-07-09 05:22:44,830][26022] Updated weights on worker 0-0, policy_version 105849 (0.00099) [2022-07-09 05:22:46,744][26022] Updated weights on worker 0-0, policy_version 105859 (0.00093) [2022-07-09 05:22:48,561][26022] Updated weights on worker 0-0, policy_version 105869 (0.00090) [2022-07-09 05:22:48,950][25689] Fps is (10 sec: 5551.2, 60 sec: 5685.1, 300 sec: 5701.0). Total num frames: 108411904. Throughput: 0: 5958.1. Samples: 108414818. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 05:22:48,950][25689] Avg episode reward: [(0, '-50.243')] [2022-07-09 05:22:50,239][26022] Updated weights on worker 0-0, policy_version 105879 (0.00091) [2022-07-09 05:22:52,014][26022] Updated weights on worker 0-0, policy_version 105889 (0.00087) [2022-07-09 05:22:53,718][26022] Updated weights on worker 0-0, policy_version 105899 (0.00089) [2022-07-09 05:22:53,970][25689] Fps is (10 sec: 5594.4, 60 sec: 5649.7, 300 sec: 5701.5). Total num frames: 108440576. Throughput: 0: 5960.0. Samples: 108449212. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 05:22:53,971][25689] Avg episode reward: [(0, '-49.135')] [2022-07-09 05:22:55,395][26022] Updated weights on worker 0-0, policy_version 105909 (0.00080) [2022-07-09 05:22:57,339][26022] Updated weights on worker 0-0, policy_version 105919 (0.00107) [2022-07-09 05:22:58,978][25689] Fps is (10 sec: 5718.6, 60 sec: 5669.1, 300 sec: 5703.8). Total num frames: 108469248. Throughput: 0: 5112.5. Samples: 108466656. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 05:22:58,978][25689] Avg episode reward: [(0, '-49.630')] [2022-07-09 05:22:59,206][26022] Updated weights on worker 0-0, policy_version 105929 (0.00085) [2022-07-09 05:23:00,730][26022] Updated weights on worker 0-0, policy_version 105939 (0.00094) [2022-07-09 05:23:03,039][26022] Updated weights on worker 0-0, policy_version 105949 (0.00089) [2022-07-09 05:23:04,092][25689] Fps is (10 sec: 5665.6, 60 sec: 5701.1, 300 sec: 5705.3). Total num frames: 108497920. Throughput: 0: 5850.0. Samples: 108498968. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 05:23:04,092][25689] Avg episode reward: [(0, '-49.565')] [2022-07-09 05:23:04,878][26022] Updated weights on worker 0-0, policy_version 105959 (0.00093) [2022-07-09 05:23:06,570][26022] Updated weights on worker 0-0, policy_version 105969 (0.00107) [2022-07-09 05:23:08,371][26022] Updated weights on worker 0-0, policy_version 105979 (0.00098) [2022-07-09 05:23:09,111][25689] Fps is (10 sec: 5557.8, 60 sec: 5686.0, 300 sec: 5701.9). Total num frames: 108525568. Throughput: 0: 5888.7. Samples: 108533542. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 05:23:09,112][25689] Avg episode reward: [(0, '-49.726')] [2022-07-09 05:23:10,024][26022] Updated weights on worker 0-0, policy_version 105989 (0.00086) [2022-07-09 05:23:11,929][26022] Updated weights on worker 0-0, policy_version 105999 (0.00088) [2022-07-09 05:23:13,693][26022] Updated weights on worker 0-0, policy_version 106009 (0.00089) [2022-07-09 05:23:14,130][25689] Fps is (10 sec: 5610.6, 60 sec: 5686.9, 300 sec: 5702.2). Total num frames: 108554240. Throughput: 0: 5046.5. Samples: 108550948. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 05:23:14,131][25689] Avg episode reward: [(0, '-49.551')] [2022-07-09 05:23:15,536][26022] Updated weights on worker 0-0, policy_version 106019 (0.00096) [2022-07-09 05:23:17,390][26022] Updated weights on worker 0-0, policy_version 106029 (0.00081) [2022-07-09 05:23:19,143][25689] Fps is (10 sec: 5715.9, 60 sec: 5689.5, 300 sec: 5693.6). Total num frames: 108582912. Throughput: 0: 5899.4. Samples: 108585622. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 05:23:19,144][25689] Avg episode reward: [(0, '-50.119')] [2022-07-09 05:23:19,159][26022] Updated weights on worker 0-0, policy_version 106039 (0.00082) [2022-07-09 05:23:20,756][26022] Updated weights on worker 0-0, policy_version 106049 (0.00083) [2022-07-09 05:23:22,679][26022] Updated weights on worker 0-0, policy_version 106059 (0.00086) [2022-07-09 05:23:24,222][25689] Fps is (10 sec: 5884.6, 60 sec: 5704.8, 300 sec: 5706.9). Total num frames: 108613632. Throughput: 0: 6017.6. Samples: 108620108. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 05:23:24,223][25689] Avg episode reward: [(0, '-50.475')] [2022-07-09 05:23:24,323][26022] Updated weights on worker 0-0, policy_version 106069 (0.00089) [2022-07-09 05:23:26,263][26022] Updated weights on worker 0-0, policy_version 106079 (0.00088) [2022-07-09 05:23:28,003][26022] Updated weights on worker 0-0, policy_version 106089 (0.00094) [2022-07-09 05:23:29,226][25689] Fps is (10 sec: 5687.4, 60 sec: 5672.7, 300 sec: 5697.8). Total num frames: 108640256. Throughput: 0: 5154.1. Samples: 108637216. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 05:23:29,227][25689] Avg episode reward: [(0, '-50.574')] [2022-07-09 05:23:29,747][26022] Updated weights on worker 0-0, policy_version 106099 (0.00085) [2022-07-09 05:23:31,889][26022] Updated weights on worker 0-0, policy_version 106109 (0.00097) [2022-07-09 05:23:33,291][26022] Updated weights on worker 0-0, policy_version 106119 (0.00095) [2022-07-09 05:23:34,304][25689] Fps is (10 sec: 5687.9, 60 sec: 5686.9, 300 sec: 5701.0). Total num frames: 108670976. Throughput: 0: 5970.2. Samples: 108671394. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 05:23:34,305][25689] Avg episode reward: [(0, '-51.375')] [2022-07-09 05:23:35,468][26022] Updated weights on worker 0-0, policy_version 106129 (0.00086) [2022-07-09 05:23:36,984][26022] Updated weights on worker 0-0, policy_version 106139 (0.00089) [2022-07-09 05:23:38,982][26022] Updated weights on worker 0-0, policy_version 106149 (0.00085) [2022-07-09 05:23:39,327][25689] Fps is (10 sec: 5778.7, 60 sec: 5668.0, 300 sec: 5701.6). Total num frames: 108698624. Throughput: 0: 5942.8. Samples: 108705568. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 05:23:39,327][25689] Avg episode reward: [(0, '-51.496')] [2022-07-09 05:23:40,615][26022] Updated weights on worker 0-0, policy_version 106159 (0.00097) [2022-07-09 05:23:42,632][26022] Updated weights on worker 0-0, policy_version 106169 (0.00092) [2022-07-09 05:23:44,155][26022] Updated weights on worker 0-0, policy_version 106179 (0.00090) [2022-07-09 05:23:44,460][25689] Fps is (10 sec: 5646.7, 60 sec: 5681.7, 300 sec: 5706.1). Total num frames: 108728320. Throughput: 0: 5064.3. Samples: 108722594. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 05:23:44,460][25689] Avg episode reward: [(0, '-50.823')] [2022-07-09 05:23:46,167][26022] Updated weights on worker 0-0, policy_version 106189 (0.00092) [2022-07-09 05:23:47,736][26022] Updated weights on worker 0-0, policy_version 106199 (0.00093) [2022-07-09 05:23:49,478][25689] Fps is (10 sec: 5649.1, 60 sec: 5684.4, 300 sec: 5702.7). Total num frames: 108755968. Throughput: 0: 5914.1. Samples: 108756986. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 05:23:49,478][25689] Avg episode reward: [(0, '-50.315')] [2022-07-09 05:23:49,778][26022] Updated weights on worker 0-0, policy_version 106209 (0.00096) [2022-07-09 05:23:51,468][26022] Updated weights on worker 0-0, policy_version 106219 (0.00086) [2022-07-09 05:23:53,498][26022] Updated weights on worker 0-0, policy_version 106229 (0.00093) [2022-07-09 05:23:54,486][25689] Fps is (10 sec: 5616.9, 60 sec: 5685.5, 300 sec: 5696.4). Total num frames: 108784640. Throughput: 0: 5925.7. Samples: 108790988. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 05:23:54,487][25689] Avg episode reward: [(0, '-50.521')] [2022-07-09 05:23:55,018][26022] Updated weights on worker 0-0, policy_version 106239 (0.00095) [2022-07-09 05:23:56,792][26022] Updated weights on worker 0-0, policy_version 106249 (0.00088) [2022-07-09 05:23:58,569][26022] Updated weights on worker 0-0, policy_version 106259 (0.00086) [2022-07-09 05:23:59,521][25689] Fps is (10 sec: 5811.8, 60 sec: 5699.9, 300 sec: 5705.4). Total num frames: 108814336. Throughput: 0: 5091.2. Samples: 108808380. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 05:23:59,522][25689] Avg episode reward: [(0, '-50.181')] [2022-07-09 05:24:00,371][26022] Updated weights on worker 0-0, policy_version 106269 (0.00099) [2022-07-09 05:24:02,431][26022] Updated weights on worker 0-0, policy_version 106279 (0.00100) [2022-07-09 05:24:04,460][26022] Updated weights on worker 0-0, policy_version 106289 (0.00096) [2022-07-09 05:24:04,560][25689] Fps is (10 sec: 5590.6, 60 sec: 5673.0, 300 sec: 5701.6). Total num frames: 108840960. Throughput: 0: 5873.4. Samples: 108840654. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 05:24:04,561][25689] Avg episode reward: [(0, '-51.148')] [2022-07-09 05:24:06,314][26022] Updated weights on worker 0-0, policy_version 106299 (0.00087) [2022-07-09 05:24:07,848][26022] Updated weights on worker 0-0, policy_version 106309 (0.00096) [2022-07-09 05:24:09,564][25689] Fps is (10 sec: 5301.7, 60 sec: 5657.5, 300 sec: 5692.8). Total num frames: 108867584. Throughput: 0: 5868.1. Samples: 108874856. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 05:24:09,567][25689] Avg episode reward: [(0, '-51.681')] [2022-07-09 05:24:09,865][26022] Updated weights on worker 0-0, policy_version 106319 (0.00084) [2022-07-09 05:24:11,279][26022] Updated weights on worker 0-0, policy_version 106329 (0.00088) [2022-07-09 05:24:13,371][26022] Updated weights on worker 0-0, policy_version 106339 (0.00083) [2022-07-09 05:24:14,596][25689] Fps is (10 sec: 5713.9, 60 sec: 5690.1, 300 sec: 5699.6). Total num frames: 108898304. Throughput: 0: 5030.9. Samples: 108892160. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 05:24:14,597][25689] Avg episode reward: [(0, '-52.170')] [2022-07-09 05:24:15,229][26022] Updated weights on worker 0-0, policy_version 106349 (0.00089) [2022-07-09 05:24:16,780][26022] Updated weights on worker 0-0, policy_version 106359 (0.00089) [2022-07-09 05:24:18,869][26022] Updated weights on worker 0-0, policy_version 106369 (0.00096) [2022-07-09 05:24:19,625][25689] Fps is (10 sec: 5903.1, 60 sec: 5688.7, 300 sec: 5700.3). Total num frames: 108926976. Throughput: 0: 5881.7. Samples: 108926630. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 05:24:19,627][25689] Avg episode reward: [(0, '-52.390')] [2022-07-09 05:24:20,417][26022] Updated weights on worker 0-0, policy_version 106379 (0.00083) [2022-07-09 05:24:22,251][26022] Updated weights on worker 0-0, policy_version 106389 (0.00085) [2022-07-09 05:24:24,024][26022] Updated weights on worker 0-0, policy_version 106399 (0.00089) [2022-07-09 05:24:24,675][25689] Fps is (10 sec: 5791.4, 60 sec: 5674.6, 300 sec: 5700.9). Total num frames: 108956672. Throughput: 0: 6001.5. Samples: 108961370. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 05:24:24,675][25689] Avg episode reward: [(0, '-53.053')] [2022-07-09 05:24:25,785][26022] Updated weights on worker 0-0, policy_version 106409 (0.00637) [2022-07-09 05:24:27,529][26022] Updated weights on worker 0-0, policy_version 106419 (0.00093) [2022-07-09 05:24:28,670][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:24:28,682][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000106426_108980224.pth [2022-07-09 05:24:28,682][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000104421_106927104.pth [2022-07-09 05:24:29,455][26022] Updated weights on worker 0-0, policy_version 106429 (0.00085) [2022-07-09 05:24:29,678][25689] Fps is (10 sec: 5704.2, 60 sec: 5691.5, 300 sec: 5697.9). Total num frames: 108984320. Throughput: 0: 5165.6. Samples: 108978758. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 05:24:29,683][25689] Avg episode reward: [(0, '-52.703')] [2022-07-09 05:24:30,896][26022] Updated weights on worker 0-0, policy_version 106439 (0.00093) [2022-07-09 05:24:33,039][26022] Updated weights on worker 0-0, policy_version 106449 (0.00093) [2022-07-09 05:24:34,499][26022] Updated weights on worker 0-0, policy_version 106459 (0.00096) [2022-07-09 05:24:34,692][25689] Fps is (10 sec: 5827.0, 60 sec: 5697.6, 300 sec: 5702.7). Total num frames: 109015040. Throughput: 0: 6052.0. Samples: 109013776. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 05:24:34,692][25689] Avg episode reward: [(0, '-52.801')] [2022-07-09 05:24:36,491][26022] Updated weights on worker 0-0, policy_version 106469 (0.00086) [2022-07-09 05:24:38,095][26022] Updated weights on worker 0-0, policy_version 106479 (0.00083) [2022-07-09 05:24:39,703][25689] Fps is (10 sec: 5822.7, 60 sec: 5698.7, 300 sec: 5700.4). Total num frames: 109042688. Throughput: 0: 6053.0. Samples: 109048158. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 05:24:39,703][25689] Avg episode reward: [(0, '-52.343')] [2022-07-09 05:24:40,037][26022] Updated weights on worker 0-0, policy_version 106489 (0.00052) [2022-07-09 05:24:41,663][26022] Updated weights on worker 0-0, policy_version 106499 (0.00085) [2022-07-09 05:24:43,830][26022] Updated weights on worker 0-0, policy_version 106509 (0.00093) [2022-07-09 05:24:44,768][25689] Fps is (10 sec: 5589.5, 60 sec: 5688.1, 300 sec: 5696.5). Total num frames: 109071360. Throughput: 0: 5178.7. Samples: 109065424. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 05:24:44,768][25689] Avg episode reward: [(0, '-51.835')] [2022-07-09 05:24:45,144][26022] Updated weights on worker 0-0, policy_version 106519 (0.00091) [2022-07-09 05:24:47,335][26022] Updated weights on worker 0-0, policy_version 106529 (0.00081) [2022-07-09 05:24:48,846][26022] Updated weights on worker 0-0, policy_version 106539 (0.00082) [2022-07-09 05:24:49,778][25689] Fps is (10 sec: 5691.5, 60 sec: 5705.8, 300 sec: 5696.4). Total num frames: 109100032. Throughput: 0: 6017.3. Samples: 109099704. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 05:24:49,779][25689] Avg episode reward: [(0, '-51.400')] [2022-07-09 05:24:50,804][26022] Updated weights on worker 0-0, policy_version 106549 (0.00087) [2022-07-09 05:24:52,532][26022] Updated weights on worker 0-0, policy_version 106559 (0.00087) [2022-07-09 05:24:54,386][26022] Updated weights on worker 0-0, policy_version 106569 (0.00086) [2022-07-09 05:24:54,803][25689] Fps is (10 sec: 5816.3, 60 sec: 5721.3, 300 sec: 5700.5). Total num frames: 109129728. Throughput: 0: 5983.6. Samples: 109134114. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 05:24:54,803][25689] Avg episode reward: [(0, '-50.603')] [2022-07-09 05:24:56,078][26022] Updated weights on worker 0-0, policy_version 106579 (0.00084) [2022-07-09 05:24:57,781][26022] Updated weights on worker 0-0, policy_version 106589 (0.00093) [2022-07-09 05:24:59,671][26022] Updated weights on worker 0-0, policy_version 106599 (0.00088) [2022-07-09 05:24:59,809][25689] Fps is (10 sec: 5716.5, 60 sec: 5690.0, 300 sec: 5705.5). Total num frames: 109157376. Throughput: 0: 5136.4. Samples: 109151434. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 05:24:59,810][25689] Avg episode reward: [(0, '-50.807')] [2022-07-09 05:25:01,402][26022] Updated weights on worker 0-0, policy_version 106609 (0.00086) [2022-07-09 05:25:03,730][26022] Updated weights on worker 0-0, policy_version 106619 (0.00083) [2022-07-09 05:25:04,861][25689] Fps is (10 sec: 5395.8, 60 sec: 5688.8, 300 sec: 5696.0). Total num frames: 109184000. Throughput: 0: 5888.2. Samples: 109183738. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 05:25:04,866][25689] Avg episode reward: [(0, '-50.572')] [2022-07-09 05:25:05,279][26022] Updated weights on worker 0-0, policy_version 106629 (0.00092) [2022-07-09 05:25:07,189][26022] Updated weights on worker 0-0, policy_version 106639 (0.00092) [2022-07-09 05:25:08,785][26022] Updated weights on worker 0-0, policy_version 106649 (0.00086) [2022-07-09 05:25:09,901][25689] Fps is (10 sec: 5479.5, 60 sec: 5719.4, 300 sec: 5695.6). Total num frames: 109212672. Throughput: 0: 5874.8. Samples: 109217920. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 05:25:09,901][25689] Avg episode reward: [(0, '-50.969')] [2022-07-09 05:25:10,799][26022] Updated weights on worker 0-0, policy_version 106659 (0.00090) [2022-07-09 05:25:12,462][26022] Updated weights on worker 0-0, policy_version 106669 (0.00090) [2022-07-09 05:25:14,309][26022] Updated weights on worker 0-0, policy_version 106679 (0.00087) [2022-07-09 05:25:14,943][25689] Fps is (10 sec: 5789.3, 60 sec: 5701.5, 300 sec: 5698.4). Total num frames: 109242368. Throughput: 0: 5019.0. Samples: 109235198. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 05:25:14,943][25689] Avg episode reward: [(0, '-51.119')] [2022-07-09 05:25:16,050][26022] Updated weights on worker 0-0, policy_version 106689 (0.00094) [2022-07-09 05:25:17,939][26022] Updated weights on worker 0-0, policy_version 106699 (0.00087) [2022-07-09 05:25:19,520][26022] Updated weights on worker 0-0, policy_version 106709 (0.00090) [2022-07-09 05:25:19,957][25689] Fps is (10 sec: 5906.1, 60 sec: 5719.9, 300 sec: 5693.0). Total num frames: 109272064. Throughput: 0: 5878.5. Samples: 109269872. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 05:25:19,957][25689] Avg episode reward: [(0, '-51.273')] [2022-07-09 05:25:21,473][26022] Updated weights on worker 0-0, policy_version 106719 (0.00092) [2022-07-09 05:25:23,232][26022] Updated weights on worker 0-0, policy_version 106729 (0.00088) [2022-07-09 05:25:24,943][26022] Updated weights on worker 0-0, policy_version 106739 (0.00088) [2022-07-09 05:25:25,036][25689] Fps is (10 sec: 5782.8, 60 sec: 5700.0, 300 sec: 5691.7). Total num frames: 109300736. Throughput: 0: 5982.7. Samples: 109304442. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 05:25:25,037][25689] Avg episode reward: [(0, '-51.252')] [2022-07-09 05:25:26,741][26022] Updated weights on worker 0-0, policy_version 106749 (0.00082) [2022-07-09 05:25:28,509][26022] Updated weights on worker 0-0, policy_version 106759 (0.00088) [2022-07-09 05:25:30,048][25689] Fps is (10 sec: 5784.0, 60 sec: 5733.2, 300 sec: 5698.5). Total num frames: 109330432. Throughput: 0: 6016.3. Samples: 109339132. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 05:25:30,048][25689] Avg episode reward: [(0, '-50.573')] [2022-07-09 05:25:30,349][26022] Updated weights on worker 0-0, policy_version 106769 (0.00084) [2022-07-09 05:25:32,055][26022] Updated weights on worker 0-0, policy_version 106779 (0.00087) [2022-07-09 05:25:33,803][26022] Updated weights on worker 0-0, policy_version 106789 (0.00087) [2022-07-09 05:25:35,064][25689] Fps is (10 sec: 5718.6, 60 sec: 5682.1, 300 sec: 5692.0). Total num frames: 109358080. Throughput: 0: 6022.8. Samples: 109356384. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 05:25:35,065][25689] Avg episode reward: [(0, '-50.739')] [2022-07-09 05:25:35,692][26022] Updated weights on worker 0-0, policy_version 106799 (0.00093) [2022-07-09 05:25:37,396][26022] Updated weights on worker 0-0, policy_version 106809 (0.00092) [2022-07-09 05:25:39,051][26022] Updated weights on worker 0-0, policy_version 106819 (0.00088) [2022-07-09 05:25:40,073][25689] Fps is (10 sec: 5617.7, 60 sec: 5699.2, 300 sec: 5692.6). Total num frames: 109386752. Throughput: 0: 6008.5. Samples: 109390742. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 05:25:40,074][25689] Avg episode reward: [(0, '-51.027')] [2022-07-09 05:25:40,995][26022] Updated weights on worker 0-0, policy_version 106829 (0.00088) [2022-07-09 05:25:42,926][26022] Updated weights on worker 0-0, policy_version 106839 (0.00092) [2022-07-09 05:25:44,543][26022] Updated weights on worker 0-0, policy_version 106849 (0.00088) [2022-07-09 05:25:45,139][25689] Fps is (10 sec: 5793.6, 60 sec: 5716.1, 300 sec: 5695.1). Total num frames: 109416448. Throughput: 0: 6010.3. Samples: 109425262. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 05:25:45,141][25689] Avg episode reward: [(0, '-51.265')] [2022-07-09 05:25:46,570][26022] Updated weights on worker 0-0, policy_version 106859 (0.00087) [2022-07-09 05:25:48,146][26022] Updated weights on worker 0-0, policy_version 106869 (0.00084) [2022-07-09 05:25:50,030][26022] Updated weights on worker 0-0, policy_version 106879 (0.00084) [2022-07-09 05:25:50,155][25689] Fps is (10 sec: 5688.2, 60 sec: 5698.6, 300 sec: 5692.0). Total num frames: 109444096. Throughput: 0: 5149.9. Samples: 109442678. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 05:25:50,156][25689] Avg episode reward: [(0, '-51.272')] [2022-07-09 05:25:51,719][26022] Updated weights on worker 0-0, policy_version 106889 (0.00094) [2022-07-09 05:25:53,543][26022] Updated weights on worker 0-0, policy_version 106899 (0.00089) [2022-07-09 05:25:55,169][25689] Fps is (10 sec: 5716.8, 60 sec: 5699.6, 300 sec: 5691.9). Total num frames: 109473792. Throughput: 0: 6021.7. Samples: 109477452. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 05:25:55,170][25689] Avg episode reward: [(0, '-51.610')] [2022-07-09 05:25:55,262][26022] Updated weights on worker 0-0, policy_version 106909 (0.00093) [2022-07-09 05:25:56,959][26022] Updated weights on worker 0-0, policy_version 106919 (0.00096) [2022-07-09 05:25:58,766][26022] Updated weights on worker 0-0, policy_version 106929 (0.00088) [2022-07-09 05:26:00,191][25689] Fps is (10 sec: 5815.7, 60 sec: 5715.1, 300 sec: 5699.5). Total num frames: 109502464. Throughput: 0: 6040.9. Samples: 109512270. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 05:26:00,192][25689] Avg episode reward: [(0, '-52.196')] [2022-07-09 05:26:00,707][26022] Updated weights on worker 0-0, policy_version 106939 (0.00087) [2022-07-09 05:26:02,617][26022] Updated weights on worker 0-0, policy_version 106949 (0.00091) [2022-07-09 05:26:04,603][26022] Updated weights on worker 0-0, policy_version 106959 (0.00087) [2022-07-09 05:26:05,236][25689] Fps is (10 sec: 5594.8, 60 sec: 5732.7, 300 sec: 5696.1). Total num frames: 109530112. Throughput: 0: 5088.0. Samples: 109527516. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 05:26:05,237][25689] Avg episode reward: [(0, '-52.041')] [2022-07-09 05:26:06,116][26022] Updated weights on worker 0-0, policy_version 106969 (0.00085) [2022-07-09 05:26:08,085][26022] Updated weights on worker 0-0, policy_version 106979 (0.00085) [2022-07-09 05:26:09,636][26022] Updated weights on worker 0-0, policy_version 106989 (0.00083) [2022-07-09 05:26:10,337][25689] Fps is (10 sec: 5652.1, 60 sec: 5743.9, 300 sec: 5691.0). Total num frames: 109559808. Throughput: 0: 5898.0. Samples: 109561712. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 05:26:10,337][25689] Avg episode reward: [(0, '-51.974')] [2022-07-09 05:26:11,692][26022] Updated weights on worker 0-0, policy_version 106999 (0.00088) [2022-07-09 05:26:13,152][26022] Updated weights on worker 0-0, policy_version 107009 (0.00073) [2022-07-09 05:26:15,129][26022] Updated weights on worker 0-0, policy_version 107019 (0.00090) [2022-07-09 05:26:15,354][25689] Fps is (10 sec: 5768.8, 60 sec: 5729.3, 300 sec: 5694.9). Total num frames: 109588480. Throughput: 0: 5906.0. Samples: 109596660. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 05:26:15,354][25689] Avg episode reward: [(0, '-51.468')] [2022-07-09 05:26:16,770][26022] Updated weights on worker 0-0, policy_version 107029 (0.00084) [2022-07-09 05:26:18,583][26022] Updated weights on worker 0-0, policy_version 107039 (0.00057) [2022-07-09 05:26:20,377][25689] Fps is (10 sec: 5711.2, 60 sec: 5711.4, 300 sec: 5695.3). Total num frames: 109617152. Throughput: 0: 5048.4. Samples: 109614178. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 05:26:20,378][25689] Avg episode reward: [(0, '-52.033')] [2022-07-09 05:26:20,442][26022] Updated weights on worker 0-0, policy_version 107049 (0.00089) [2022-07-09 05:26:22,249][26022] Updated weights on worker 0-0, policy_version 107059 (0.00086) [2022-07-09 05:26:24,078][26022] Updated weights on worker 0-0, policy_version 107069 (0.00095) [2022-07-09 05:26:25,444][25689] Fps is (10 sec: 5683.2, 60 sec: 5712.7, 300 sec: 5690.8). Total num frames: 109645824. Throughput: 0: 5990.4. Samples: 109648570. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 05:26:25,444][25689] Avg episode reward: [(0, '-52.414')] [2022-07-09 05:26:25,860][26022] Updated weights on worker 0-0, policy_version 107079 (0.00090) [2022-07-09 05:26:27,555][26022] Updated weights on worker 0-0, policy_version 107089 (0.00049) [2022-07-09 05:26:28,779][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:26:28,794][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000107096_109666304.pth [2022-07-09 05:26:28,795][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000105091_107613184.pth [2022-07-09 05:26:29,444][26022] Updated weights on worker 0-0, policy_version 107099 (0.00089) [2022-07-09 05:26:30,460][25689] Fps is (10 sec: 5687.3, 60 sec: 5695.3, 300 sec: 5697.6). Total num frames: 109674496. Throughput: 0: 6029.2. Samples: 109683040. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 05:26:30,461][25689] Avg episode reward: [(0, '-52.341')] [2022-07-09 05:26:31,171][26022] Updated weights on worker 0-0, policy_version 107109 (0.00088) [2022-07-09 05:26:32,870][26022] Updated weights on worker 0-0, policy_version 107119 (0.00099) [2022-07-09 05:26:34,681][26022] Updated weights on worker 0-0, policy_version 107129 (0.00088) [2022-07-09 05:26:35,477][25689] Fps is (10 sec: 5817.4, 60 sec: 5729.1, 300 sec: 5700.9). Total num frames: 109704192. Throughput: 0: 5157.9. Samples: 109700454. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 05:26:35,478][25689] Avg episode reward: [(0, '-51.344')] [2022-07-09 05:26:36,595][26022] Updated weights on worker 0-0, policy_version 107139 (0.00107) [2022-07-09 05:26:38,193][26022] Updated weights on worker 0-0, policy_version 107149 (0.01025) [2022-07-09 05:26:40,149][26022] Updated weights on worker 0-0, policy_version 107159 (0.00088) [2022-07-09 05:26:40,531][25689] Fps is (10 sec: 5694.1, 60 sec: 5707.9, 300 sec: 5693.7). Total num frames: 109731840. Throughput: 0: 5979.6. Samples: 109734688. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 05:26:40,531][25689] Avg episode reward: [(0, '-52.164')] [2022-07-09 05:26:41,747][26022] Updated weights on worker 0-0, policy_version 107169 (0.00082) [2022-07-09 05:26:43,679][26022] Updated weights on worker 0-0, policy_version 107179 (0.00094) [2022-07-09 05:26:45,488][26022] Updated weights on worker 0-0, policy_version 107189 (0.00083) [2022-07-09 05:26:45,572][25689] Fps is (10 sec: 5680.5, 60 sec: 5710.2, 300 sec: 5699.9). Total num frames: 109761536. Throughput: 0: 5992.5. Samples: 109769188. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 05:26:45,572][25689] Avg episode reward: [(0, '-52.209')] [2022-07-09 05:26:47,249][26022] Updated weights on worker 0-0, policy_version 107199 (0.00084) [2022-07-09 05:26:48,975][26022] Updated weights on worker 0-0, policy_version 107209 (0.00095) [2022-07-09 05:26:50,595][25689] Fps is (10 sec: 5799.9, 60 sec: 5726.5, 300 sec: 5692.7). Total num frames: 109790208. Throughput: 0: 5134.2. Samples: 109786414. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 05:26:50,595][25689] Avg episode reward: [(0, '-51.681')] [2022-07-09 05:26:50,975][26022] Updated weights on worker 0-0, policy_version 107219 (0.00091) [2022-07-09 05:26:52,524][26022] Updated weights on worker 0-0, policy_version 107229 (0.00088) [2022-07-09 05:26:54,585][26022] Updated weights on worker 0-0, policy_version 107239 (0.00084) [2022-07-09 05:26:55,618][25689] Fps is (10 sec: 5708.3, 60 sec: 5708.8, 300 sec: 5696.3). Total num frames: 109818880. Throughput: 0: 5969.1. Samples: 109820674. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 05:26:55,619][25689] Avg episode reward: [(0, '-51.956')] [2022-07-09 05:26:56,130][26022] Updated weights on worker 0-0, policy_version 107249 (0.00099) [2022-07-09 05:26:58,015][26022] Updated weights on worker 0-0, policy_version 107259 (0.00100) [2022-07-09 05:26:59,910][26022] Updated weights on worker 0-0, policy_version 107269 (0.00100) [2022-07-09 05:27:00,621][25689] Fps is (10 sec: 5719.1, 60 sec: 5710.5, 300 sec: 5705.0). Total num frames: 109847552. Throughput: 0: 5991.8. Samples: 109855066. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 05:27:00,623][25689] Avg episode reward: [(0, '-52.208')] [2022-07-09 05:27:01,700][26022] Updated weights on worker 0-0, policy_version 107279 (0.00091) [2022-07-09 05:27:03,811][26022] Updated weights on worker 0-0, policy_version 107289 (0.00082) [2022-07-09 05:27:05,405][26022] Updated weights on worker 0-0, policy_version 107299 (0.00083) [2022-07-09 05:27:05,735][25689] Fps is (10 sec: 5465.5, 60 sec: 5687.1, 300 sec: 5696.7). Total num frames: 109874176. Throughput: 0: 4999.8. Samples: 109869998. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 05:27:05,735][25689] Avg episode reward: [(0, '-52.808')] [2022-07-09 05:27:07,401][26022] Updated weights on worker 0-0, policy_version 107309 (0.00089) [2022-07-09 05:27:09,282][26022] Updated weights on worker 0-0, policy_version 107319 (0.00093) [2022-07-09 05:27:10,758][25689] Fps is (10 sec: 5454.9, 60 sec: 5677.4, 300 sec: 5696.7). Total num frames: 109902848. Throughput: 0: 5850.0. Samples: 109904372. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 05:27:10,760][25689] Avg episode reward: [(0, '-51.961')] [2022-07-09 05:27:11,016][26022] Updated weights on worker 0-0, policy_version 107329 (0.00593) [2022-07-09 05:27:12,583][26022] Updated weights on worker 0-0, policy_version 107339 (0.00090) [2022-07-09 05:27:14,683][26022] Updated weights on worker 0-0, policy_version 107349 (0.00087) [2022-07-09 05:27:15,770][25689] Fps is (10 sec: 5918.5, 60 sec: 5711.9, 300 sec: 5704.2). Total num frames: 109933568. Throughput: 0: 5868.0. Samples: 109938926. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 05:27:15,770][25689] Avg episode reward: [(0, '-52.794')] [2022-07-09 05:27:16,157][26022] Updated weights on worker 0-0, policy_version 107359 (0.00082) [2022-07-09 05:27:18,144][26022] Updated weights on worker 0-0, policy_version 107369 (0.00088) [2022-07-09 05:27:19,952][26022] Updated weights on worker 0-0, policy_version 107379 (0.00087) [2022-07-09 05:27:20,800][25689] Fps is (10 sec: 5710.6, 60 sec: 5677.3, 300 sec: 5694.5). Total num frames: 109960192. Throughput: 0: 5010.9. Samples: 109956180. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 05:27:20,801][25689] Avg episode reward: [(0, '-52.065')] [2022-07-09 05:27:21,475][26022] Updated weights on worker 0-0, policy_version 107389 (0.00088) [2022-07-09 05:27:23,565][26022] Updated weights on worker 0-0, policy_version 107399 (0.00095) [2022-07-09 05:27:25,017][26022] Updated weights on worker 0-0, policy_version 107409 (0.00091) [2022-07-09 05:27:25,838][25689] Fps is (10 sec: 5594.0, 60 sec: 5697.0, 300 sec: 5697.6). Total num frames: 109989888. Throughput: 0: 6019.6. Samples: 109991008. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 05:27:25,838][25689] Avg episode reward: [(0, '-52.017')] [2022-07-09 05:27:26,980][26022] Updated weights on worker 0-0, policy_version 107419 (0.00091) [2022-07-09 05:27:28,678][26022] Updated weights on worker 0-0, policy_version 107429 (0.00083) [2022-07-09 05:27:30,476][26022] Updated weights on worker 0-0, policy_version 107439 (0.00090) [2022-07-09 05:27:30,889][25689] Fps is (10 sec: 5886.9, 60 sec: 5710.7, 300 sec: 5697.6). Total num frames: 110019584. Throughput: 0: 6023.4. Samples: 110025626. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 05:27:30,890][25689] Avg episode reward: [(0, '-52.685')] [2022-07-09 05:27:32,245][26022] Updated weights on worker 0-0, policy_version 107449 (0.00088) [2022-07-09 05:27:34,103][26022] Updated weights on worker 0-0, policy_version 107459 (0.00098) [2022-07-09 05:27:35,841][26022] Updated weights on worker 0-0, policy_version 107469 (0.00093) [2022-07-09 05:27:35,915][25689] Fps is (10 sec: 5792.2, 60 sec: 5692.9, 300 sec: 5697.1). Total num frames: 110048256. Throughput: 0: 5171.6. Samples: 110043108. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 05:27:35,915][25689] Avg episode reward: [(0, '-52.705')] [2022-07-09 05:27:37,533][26022] Updated weights on worker 0-0, policy_version 107479 (0.00087) [2022-07-09 05:27:39,474][26022] Updated weights on worker 0-0, policy_version 107489 (0.00088) [2022-07-09 05:27:40,922][25689] Fps is (10 sec: 5817.5, 60 sec: 5731.2, 300 sec: 5702.3). Total num frames: 110077952. Throughput: 0: 6033.8. Samples: 110077592. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 05:27:40,923][25689] Avg episode reward: [(0, '-52.941')] [2022-07-09 05:27:40,951][26022] Updated weights on worker 0-0, policy_version 107499 (0.00086) [2022-07-09 05:27:43,156][26022] Updated weights on worker 0-0, policy_version 107509 (0.00086) [2022-07-09 05:27:44,544][26022] Updated weights on worker 0-0, policy_version 107519 (0.00084) [2022-07-09 05:27:46,036][25689] Fps is (10 sec: 5665.8, 60 sec: 5690.4, 300 sec: 5701.0). Total num frames: 110105600. Throughput: 0: 6019.9. Samples: 110112598. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 05:27:46,036][25689] Avg episode reward: [(0, '-52.887')] [2022-07-09 05:27:46,388][26022] Updated weights on worker 0-0, policy_version 107529 (0.00090) [2022-07-09 05:27:48,304][26022] Updated weights on worker 0-0, policy_version 107539 (0.00093) [2022-07-09 05:27:49,897][26022] Updated weights on worker 0-0, policy_version 107549 (0.00095) [2022-07-09 05:27:51,063][25689] Fps is (10 sec: 5755.9, 60 sec: 5723.9, 300 sec: 5707.8). Total num frames: 110136320. Throughput: 0: 5171.0. Samples: 110129946. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 05:27:51,064][25689] Avg episode reward: [(0, '-52.637')] [2022-07-09 05:27:51,811][26022] Updated weights on worker 0-0, policy_version 107559 (0.00085) [2022-07-09 05:27:53,646][26022] Updated weights on worker 0-0, policy_version 107569 (0.00091) [2022-07-09 05:27:55,218][26022] Updated weights on worker 0-0, policy_version 107579 (0.00089) [2022-07-09 05:27:56,092][25689] Fps is (10 sec: 5906.0, 60 sec: 5723.3, 300 sec: 5707.4). Total num frames: 110164992. Throughput: 0: 6025.5. Samples: 110164686. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 05:27:56,092][25689] Avg episode reward: [(0, '-52.232')] [2022-07-09 05:27:57,095][26022] Updated weights on worker 0-0, policy_version 107589 (0.00089) [2022-07-09 05:27:58,756][26022] Updated weights on worker 0-0, policy_version 107599 (0.00090) [2022-07-09 05:28:00,763][26022] Updated weights on worker 0-0, policy_version 107609 (0.00092) [2022-07-09 05:28:01,127][25689] Fps is (10 sec: 5799.4, 60 sec: 5737.2, 300 sec: 5712.3). Total num frames: 110194688. Throughput: 0: 6014.2. Samples: 110199110. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 05:28:01,128][25689] Avg episode reward: [(0, '-51.392')] [2022-07-09 05:28:02,895][26022] Updated weights on worker 0-0, policy_version 107619 (0.00088) [2022-07-09 05:28:04,589][26022] Updated weights on worker 0-0, policy_version 107629 (0.00087) [2022-07-09 05:28:06,173][25689] Fps is (10 sec: 5485.2, 60 sec: 5726.7, 300 sec: 5704.9). Total num frames: 110220288. Throughput: 0: 5049.5. Samples: 110214280. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 05:28:06,173][25689] Avg episode reward: [(0, '-51.052')] [2022-07-09 05:28:06,503][26022] Updated weights on worker 0-0, policy_version 107639 (0.00103) [2022-07-09 05:28:08,154][26022] Updated weights on worker 0-0, policy_version 107649 (0.00113) [2022-07-09 05:28:09,943][26022] Updated weights on worker 0-0, policy_version 107659 (0.00089) [2022-07-09 05:28:11,189][25689] Fps is (10 sec: 5393.9, 60 sec: 5727.4, 300 sec: 5705.0). Total num frames: 110248960. Throughput: 0: 5888.5. Samples: 110248458. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 05:28:11,189][25689] Avg episode reward: [(0, '-50.978')] [2022-07-09 05:28:11,936][26022] Updated weights on worker 0-0, policy_version 107669 (0.00087) [2022-07-09 05:28:13,422][26022] Updated weights on worker 0-0, policy_version 107679 (0.00085) [2022-07-09 05:28:15,530][26022] Updated weights on worker 0-0, policy_version 107689 (0.00090) [2022-07-09 05:28:16,216][25689] Fps is (10 sec: 5913.5, 60 sec: 5725.9, 300 sec: 5711.6). Total num frames: 110279680. Throughput: 0: 5886.7. Samples: 110283152. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 05:28:16,217][25689] Avg episode reward: [(0, '-51.399')] [2022-07-09 05:28:16,906][26022] Updated weights on worker 0-0, policy_version 107699 (0.00090) [2022-07-09 05:28:19,006][26022] Updated weights on worker 0-0, policy_version 107709 (0.00092) [2022-07-09 05:28:20,748][26022] Updated weights on worker 0-0, policy_version 107719 (0.00088) [2022-07-09 05:28:21,293][25689] Fps is (10 sec: 5776.5, 60 sec: 5738.4, 300 sec: 5701.3). Total num frames: 110307328. Throughput: 0: 5028.3. Samples: 110300512. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 05:28:21,295][25689] Avg episode reward: [(0, '-51.361')] [2022-07-09 05:28:22,525][26022] Updated weights on worker 0-0, policy_version 107729 (0.00088) [2022-07-09 05:28:24,349][26022] Updated weights on worker 0-0, policy_version 107739 (0.00085) [2022-07-09 05:28:26,053][26022] Updated weights on worker 0-0, policy_version 107749 (0.00099) [2022-07-09 05:28:26,375][25689] Fps is (10 sec: 5645.1, 60 sec: 5734.3, 300 sec: 5710.2). Total num frames: 110337024. Throughput: 0: 5971.6. Samples: 110334916. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 05:28:26,375][25689] Avg episode reward: [(0, '-51.418')] [2022-07-09 05:28:27,786][26022] Updated weights on worker 0-0, policy_version 107759 (0.00089) [2022-07-09 05:28:28,808][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:28:28,823][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000107764_110350336.pth [2022-07-09 05:28:28,827][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000105758_108296192.pth [2022-07-09 05:28:29,465][26022] Updated weights on worker 0-0, policy_version 107769 (0.00086) [2022-07-09 05:28:31,413][25689] Fps is (10 sec: 5666.5, 60 sec: 5701.6, 300 sec: 5700.6). Total num frames: 110364672. Throughput: 0: 5978.6. Samples: 110369370. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 05:28:31,414][25689] Avg episode reward: [(0, '-51.861')] [2022-07-09 05:28:31,459][26022] Updated weights on worker 0-0, policy_version 107779 (0.00100) [2022-07-09 05:28:33,089][26022] Updated weights on worker 0-0, policy_version 107789 (0.00091) [2022-07-09 05:28:35,033][26022] Updated weights on worker 0-0, policy_version 107799 (0.00095) [2022-07-09 05:28:36,431][25689] Fps is (10 sec: 5702.3, 60 sec: 5719.3, 300 sec: 5707.6). Total num frames: 110394368. Throughput: 0: 5948.4. Samples: 110403396. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 05:28:36,431][25689] Avg episode reward: [(0, '-52.324')] [2022-07-09 05:28:36,619][26022] Updated weights on worker 0-0, policy_version 107809 (0.00089) [2022-07-09 05:28:38,684][26022] Updated weights on worker 0-0, policy_version 107819 (0.00089) [2022-07-09 05:28:40,180][26022] Updated weights on worker 0-0, policy_version 107829 (0.00088) [2022-07-09 05:28:41,446][25689] Fps is (10 sec: 5715.5, 60 sec: 5684.7, 300 sec: 5702.9). Total num frames: 110422016. Throughput: 0: 5956.4. Samples: 110420552. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 05:28:41,447][25689] Avg episode reward: [(0, '-52.390')] [2022-07-09 05:28:42,354][26022] Updated weights on worker 0-0, policy_version 107839 (0.00092) [2022-07-09 05:28:43,789][26022] Updated weights on worker 0-0, policy_version 107849 (0.00086) [2022-07-09 05:28:45,902][26022] Updated weights on worker 0-0, policy_version 107859 (0.00094) [2022-07-09 05:28:46,529][25689] Fps is (10 sec: 5779.9, 60 sec: 5738.4, 300 sec: 5712.0). Total num frames: 110452736. Throughput: 0: 5981.7. Samples: 110455476. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:28:46,530][25689] Avg episode reward: [(0, '-52.049')] [2022-07-09 05:28:47,399][26022] Updated weights on worker 0-0, policy_version 107869 (0.00083) [2022-07-09 05:28:49,259][26022] Updated weights on worker 0-0, policy_version 107879 (0.00087) [2022-07-09 05:28:51,063][26022] Updated weights on worker 0-0, policy_version 107889 (0.00091) [2022-07-09 05:28:51,537][25689] Fps is (10 sec: 5784.2, 60 sec: 5689.4, 300 sec: 5708.6). Total num frames: 110480384. Throughput: 0: 5978.8. Samples: 110489688. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:28:51,538][25689] Avg episode reward: [(0, '-52.445')] [2022-07-09 05:28:52,878][26022] Updated weights on worker 0-0, policy_version 107899 (0.00094) [2022-07-09 05:28:54,682][26022] Updated weights on worker 0-0, policy_version 107909 (0.00094) [2022-07-09 05:28:56,372][26022] Updated weights on worker 0-0, policy_version 107919 (0.00057) [2022-07-09 05:28:56,559][25689] Fps is (10 sec: 5717.7, 60 sec: 5707.1, 300 sec: 5708.9). Total num frames: 110510080. Throughput: 0: 5159.7. Samples: 110507248. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:28:56,559][25689] Avg episode reward: [(0, '-52.555')] [2022-07-09 05:28:58,319][26022] Updated weights on worker 0-0, policy_version 107929 (0.00066) [2022-07-09 05:29:00,052][26022] Updated weights on worker 0-0, policy_version 107939 (0.00088) [2022-07-09 05:29:01,568][25689] Fps is (10 sec: 5819.2, 60 sec: 5692.6, 300 sec: 5716.3). Total num frames: 110538752. Throughput: 0: 6038.8. Samples: 110542060. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:29:01,568][25689] Avg episode reward: [(0, '-52.419')] [2022-07-09 05:29:01,738][26022] Updated weights on worker 0-0, policy_version 107949 (0.00093) [2022-07-09 05:29:04,076][26022] Updated weights on worker 0-0, policy_version 107959 (0.00086) [2022-07-09 05:29:05,665][26022] Updated weights on worker 0-0, policy_version 107969 (0.00084) [2022-07-09 05:29:06,683][25689] Fps is (10 sec: 5461.8, 60 sec: 5703.0, 300 sec: 5714.2). Total num frames: 110565376. Throughput: 0: 5900.7. Samples: 110574394. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:29:06,684][25689] Avg episode reward: [(0, '-52.312')] [2022-07-09 05:29:07,468][26022] Updated weights on worker 0-0, policy_version 107979 (0.00087) [2022-07-09 05:29:09,244][26022] Updated weights on worker 0-0, policy_version 107989 (0.00094) [2022-07-09 05:29:11,002][26022] Updated weights on worker 0-0, policy_version 107999 (0.00087) [2022-07-09 05:29:11,694][25689] Fps is (10 sec: 5460.6, 60 sec: 5703.5, 300 sec: 5707.7). Total num frames: 110594048. Throughput: 0: 5047.0. Samples: 110591418. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:29:11,694][25689] Avg episode reward: [(0, '-51.613')] [2022-07-09 05:29:12,789][26022] Updated weights on worker 0-0, policy_version 108009 (0.00088) [2022-07-09 05:29:14,828][26022] Updated weights on worker 0-0, policy_version 108019 (0.00085) [2022-07-09 05:29:16,415][26022] Updated weights on worker 0-0, policy_version 108029 (0.00082) [2022-07-09 05:29:16,710][25689] Fps is (10 sec: 5718.9, 60 sec: 5670.7, 300 sec: 5708.0). Total num frames: 110622720. Throughput: 0: 5879.7. Samples: 110625730. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:29:16,710][25689] Avg episode reward: [(0, '-51.640')] [2022-07-09 05:29:18,171][26022] Updated weights on worker 0-0, policy_version 108039 (0.00087) [2022-07-09 05:29:20,052][26022] Updated weights on worker 0-0, policy_version 108049 (0.00090) [2022-07-09 05:29:21,746][25689] Fps is (10 sec: 5704.5, 60 sec: 5691.4, 300 sec: 5704.8). Total num frames: 110651392. Throughput: 0: 5845.8. Samples: 110660020. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:29:21,747][25689] Avg episode reward: [(0, '-50.594')] [2022-07-09 05:29:21,943][26022] Updated weights on worker 0-0, policy_version 108059 (0.00083) [2022-07-09 05:29:23,578][26022] Updated weights on worker 0-0, policy_version 108069 (0.00091) [2022-07-09 05:29:25,598][26022] Updated weights on worker 0-0, policy_version 108079 (0.00091) [2022-07-09 05:29:26,882][25689] Fps is (10 sec: 5838.7, 60 sec: 5703.2, 300 sec: 5712.6). Total num frames: 110682112. Throughput: 0: 5081.9. Samples: 110677044. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:29:26,883][25689] Avg episode reward: [(0, '-50.913')] [2022-07-09 05:29:27,016][26022] Updated weights on worker 0-0, policy_version 108089 (0.00094) [2022-07-09 05:29:29,094][26022] Updated weights on worker 0-0, policy_version 108099 (0.01201) [2022-07-09 05:29:30,751][26022] Updated weights on worker 0-0, policy_version 108109 (0.00086) [2022-07-09 05:29:31,896][25689] Fps is (10 sec: 5750.9, 60 sec: 5705.6, 300 sec: 5702.2). Total num frames: 110709760. Throughput: 0: 5951.1. Samples: 110711640. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:29:31,896][25689] Avg episode reward: [(0, '-50.723')] [2022-07-09 05:29:32,416][26022] Updated weights on worker 0-0, policy_version 108119 (0.00085) [2022-07-09 05:29:34,359][26022] Updated weights on worker 0-0, policy_version 108129 (0.00085) [2022-07-09 05:29:35,999][26022] Updated weights on worker 0-0, policy_version 108139 (0.00088) [2022-07-09 05:29:36,972][25689] Fps is (10 sec: 5581.8, 60 sec: 5683.2, 300 sec: 5704.4). Total num frames: 110738432. Throughput: 0: 5951.4. Samples: 110746318. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:29:36,972][25689] Avg episode reward: [(0, '-51.167')] [2022-07-09 05:29:37,838][26022] Updated weights on worker 0-0, policy_version 108149 (0.00092) [2022-07-09 05:29:39,577][26022] Updated weights on worker 0-0, policy_version 108159 (0.00085) [2022-07-09 05:29:41,563][26022] Updated weights on worker 0-0, policy_version 108169 (0.00082) [2022-07-09 05:29:42,018][25689] Fps is (10 sec: 5664.8, 60 sec: 5697.1, 300 sec: 5704.8). Total num frames: 110767104. Throughput: 0: 5109.2. Samples: 110763596. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:29:42,019][25689] Avg episode reward: [(0, '-51.521')] [2022-07-09 05:29:43,106][26022] Updated weights on worker 0-0, policy_version 108179 (0.00089) [2022-07-09 05:29:45,044][26022] Updated weights on worker 0-0, policy_version 108189 (0.00085) [2022-07-09 05:29:46,711][26022] Updated weights on worker 0-0, policy_version 108199 (0.00091) [2022-07-09 05:29:47,120][25689] Fps is (10 sec: 5852.6, 60 sec: 5695.4, 300 sec: 5709.9). Total num frames: 110797824. Throughput: 0: 5971.5. Samples: 110797896. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:29:47,120][25689] Avg episode reward: [(0, '-52.455')] [2022-07-09 05:29:48,674][26022] Updated weights on worker 0-0, policy_version 108209 (0.00090) [2022-07-09 05:29:50,187][26022] Updated weights on worker 0-0, policy_version 108219 (0.00087) [2022-07-09 05:29:52,146][25689] Fps is (10 sec: 5763.5, 60 sec: 5693.7, 300 sec: 5703.0). Total num frames: 110825472. Throughput: 0: 5964.2. Samples: 110832416. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:29:52,146][25689] Avg episode reward: [(0, '-52.756')] [2022-07-09 05:29:52,360][26022] Updated weights on worker 0-0, policy_version 108229 (0.00087) [2022-07-09 05:29:53,900][26022] Updated weights on worker 0-0, policy_version 108239 (0.00111) [2022-07-09 05:29:55,857][26022] Updated weights on worker 0-0, policy_version 108249 (0.00090) [2022-07-09 05:29:57,202][25689] Fps is (10 sec: 5789.5, 60 sec: 5707.4, 300 sec: 5712.4). Total num frames: 110856192. Throughput: 0: 5103.3. Samples: 110849556. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:29:57,202][25689] Avg episode reward: [(0, '-51.936')] [2022-07-09 05:29:57,400][26022] Updated weights on worker 0-0, policy_version 108259 (0.00086) [2022-07-09 05:29:59,403][26022] Updated weights on worker 0-0, policy_version 108269 (0.00086) [2022-07-09 05:30:01,045][26022] Updated weights on worker 0-0, policy_version 108279 (0.00090) [2022-07-09 05:30:02,247][25689] Fps is (10 sec: 5575.7, 60 sec: 5653.3, 300 sec: 5709.1). Total num frames: 110881792. Throughput: 0: 5971.5. Samples: 110884390. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:30:02,247][25689] Avg episode reward: [(0, '-52.026')] [2022-07-09 05:30:03,213][26022] Updated weights on worker 0-0, policy_version 108289 (0.00087) [2022-07-09 05:30:04,981][26022] Updated weights on worker 0-0, policy_version 108299 (0.00086) [2022-07-09 05:30:06,612][26022] Updated weights on worker 0-0, policy_version 108309 (0.00081) [2022-07-09 05:30:07,295][25689] Fps is (10 sec: 5478.4, 60 sec: 5710.3, 300 sec: 5712.3). Total num frames: 110911488. Throughput: 0: 5899.8. Samples: 110916926. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:30:07,296][25689] Avg episode reward: [(0, '-52.510')] [2022-07-09 05:30:08,624][26022] Updated weights on worker 0-0, policy_version 108319 (0.00082) [2022-07-09 05:30:10,201][26022] Updated weights on worker 0-0, policy_version 108329 (0.00091) [2022-07-09 05:30:12,267][26022] Updated weights on worker 0-0, policy_version 108339 (0.00093) [2022-07-09 05:30:12,326][25689] Fps is (10 sec: 5689.5, 60 sec: 5691.5, 300 sec: 5705.7). Total num frames: 110939136. Throughput: 0: 5046.1. Samples: 110934246. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:30:12,326][25689] Avg episode reward: [(0, '-51.885')] [2022-07-09 05:30:13,950][26022] Updated weights on worker 0-0, policy_version 108349 (0.00094) [2022-07-09 05:30:15,783][26022] Updated weights on worker 0-0, policy_version 108359 (0.00081) [2022-07-09 05:30:17,347][25689] Fps is (10 sec: 5603.1, 60 sec: 5691.1, 300 sec: 5702.1). Total num frames: 110967808. Throughput: 0: 5899.8. Samples: 110968408. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:30:17,347][25689] Avg episode reward: [(0, '-51.375')] [2022-07-09 05:30:17,608][26022] Updated weights on worker 0-0, policy_version 108369 (0.00097) [2022-07-09 05:30:19,167][26022] Updated weights on worker 0-0, policy_version 108379 (0.00085) [2022-07-09 05:30:21,154][26022] Updated weights on worker 0-0, policy_version 108389 (0.00091) [2022-07-09 05:30:22,382][25689] Fps is (10 sec: 5804.2, 60 sec: 5708.1, 300 sec: 5706.4). Total num frames: 110997504. Throughput: 0: 5875.9. Samples: 111002702. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:30:22,382][25689] Avg episode reward: [(0, '-52.042')] [2022-07-09 05:30:22,926][26022] Updated weights on worker 0-0, policy_version 108399 (0.00096) [2022-07-09 05:30:24,734][26022] Updated weights on worker 0-0, policy_version 108409 (0.00087) [2022-07-09 05:30:26,439][26022] Updated weights on worker 0-0, policy_version 108419 (0.00080) [2022-07-09 05:30:27,423][25689] Fps is (10 sec: 5691.1, 60 sec: 5666.3, 300 sec: 5698.9). Total num frames: 111025152. Throughput: 0: 5115.5. Samples: 111019890. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:30:27,423][25689] Avg episode reward: [(0, '-52.325')] [2022-07-09 05:30:28,269][26022] Updated weights on worker 0-0, policy_version 108429 (0.00089) [2022-07-09 05:30:28,897][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:30:28,906][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000108433_111035392.pth [2022-07-09 05:30:28,908][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000106426_108980224.pth [2022-07-09 05:30:30,011][26022] Updated weights on worker 0-0, policy_version 108439 (0.00083) [2022-07-09 05:30:31,906][26022] Updated weights on worker 0-0, policy_version 108449 (0.00093) [2022-07-09 05:30:32,424][25689] Fps is (10 sec: 5710.3, 60 sec: 5701.3, 300 sec: 5706.1). Total num frames: 111054848. Throughput: 0: 5967.4. Samples: 111054180. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:30:32,425][25689] Avg episode reward: [(0, '-52.216')] [2022-07-09 05:30:33,569][26022] Updated weights on worker 0-0, policy_version 108459 (0.00083) [2022-07-09 05:30:35,445][26022] Updated weights on worker 0-0, policy_version 108469 (0.00091) [2022-07-09 05:30:37,155][26022] Updated weights on worker 0-0, policy_version 108479 (0.00086) [2022-07-09 05:30:37,431][25689] Fps is (10 sec: 5934.4, 60 sec: 5724.7, 300 sec: 5709.6). Total num frames: 111084544. Throughput: 0: 5998.5. Samples: 111088882. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:30:37,431][25689] Avg episode reward: [(0, '-51.621')] [2022-07-09 05:30:39,049][26022] Updated weights on worker 0-0, policy_version 108489 (0.00081) [2022-07-09 05:30:40,715][26022] Updated weights on worker 0-0, policy_version 108499 (0.00087) [2022-07-09 05:30:42,441][25689] Fps is (10 sec: 5622.2, 60 sec: 5694.3, 300 sec: 5700.3). Total num frames: 111111168. Throughput: 0: 5154.2. Samples: 111106094. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:30:42,442][25689] Avg episode reward: [(0, '-51.854')] [2022-07-09 05:30:42,677][26022] Updated weights on worker 0-0, policy_version 108509 (0.00085) [2022-07-09 05:30:44,356][26022] Updated weights on worker 0-0, policy_version 108519 (0.00377) [2022-07-09 05:30:46,168][26022] Updated weights on worker 0-0, policy_version 108529 (0.00104) [2022-07-09 05:30:47,528][25689] Fps is (10 sec: 5780.8, 60 sec: 5712.6, 300 sec: 5712.7). Total num frames: 111142912. Throughput: 0: 6002.8. Samples: 111140576. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 05:30:47,528][25689] Avg episode reward: [(0, '-52.732')] [2022-07-09 05:30:47,725][26022] Updated weights on worker 0-0, policy_version 108539 (0.00086) [2022-07-09 05:30:49,648][26022] Updated weights on worker 0-0, policy_version 108549 (0.00084) [2022-07-09 05:30:51,579][26022] Updated weights on worker 0-0, policy_version 108559 (0.00094) [2022-07-09 05:30:52,567][25689] Fps is (10 sec: 5764.5, 60 sec: 5694.4, 300 sec: 5701.9). Total num frames: 111169536. Throughput: 0: 6003.6. Samples: 111175110. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 05:30:52,567][25689] Avg episode reward: [(0, '-52.181')] [2022-07-09 05:30:53,096][26022] Updated weights on worker 0-0, policy_version 108569 (0.00499) [2022-07-09 05:30:55,055][26022] Updated weights on worker 0-0, policy_version 108579 (0.00084) [2022-07-09 05:30:56,513][26022] Updated weights on worker 0-0, policy_version 108589 (0.00093) [2022-07-09 05:30:57,603][25689] Fps is (10 sec: 5589.8, 60 sec: 5679.3, 300 sec: 5705.1). Total num frames: 111199232. Throughput: 0: 5144.9. Samples: 111192668. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 05:30:57,604][25689] Avg episode reward: [(0, '-52.697')] [2022-07-09 05:30:58,631][26022] Updated weights on worker 0-0, policy_version 108599 (0.00085) [2022-07-09 05:31:00,381][26022] Updated weights on worker 0-0, policy_version 108609 (0.00081) [2022-07-09 05:31:02,326][26022] Updated weights on worker 0-0, policy_version 108619 (0.00091) [2022-07-09 05:31:02,610][25689] Fps is (10 sec: 5811.4, 60 sec: 5733.8, 300 sec: 5709.3). Total num frames: 111227904. Throughput: 0: 6020.5. Samples: 111227522. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 05:31:02,611][25689] Avg episode reward: [(0, '-52.382')] [2022-07-09 05:31:04,435][26022] Updated weights on worker 0-0, policy_version 108629 (0.00084) [2022-07-09 05:31:05,802][26022] Updated weights on worker 0-0, policy_version 108639 (0.00086) [2022-07-09 05:31:07,656][25689] Fps is (10 sec: 5704.4, 60 sec: 5717.1, 300 sec: 5706.9). Total num frames: 111256576. Throughput: 0: 5938.3. Samples: 111260102. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 05:31:07,656][25689] Avg episode reward: [(0, '-52.678')] [2022-07-09 05:31:07,663][26022] Updated weights on worker 0-0, policy_version 108649 (0.00100) [2022-07-09 05:31:09,509][26022] Updated weights on worker 0-0, policy_version 108659 (0.00476) [2022-07-09 05:31:11,336][26022] Updated weights on worker 0-0, policy_version 108669 (0.00082) [2022-07-09 05:31:12,704][25689] Fps is (10 sec: 5579.6, 60 sec: 5715.4, 300 sec: 5702.8). Total num frames: 111284224. Throughput: 0: 5085.3. Samples: 111277516. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 05:31:12,705][25689] Avg episode reward: [(0, '-53.270')] [2022-07-09 05:31:13,062][26022] Updated weights on worker 0-0, policy_version 108679 (0.00084) [2022-07-09 05:31:14,826][26022] Updated weights on worker 0-0, policy_version 108689 (0.00086) [2022-07-09 05:31:16,553][26022] Updated weights on worker 0-0, policy_version 108699 (0.00087) [2022-07-09 05:31:17,711][25689] Fps is (10 sec: 5702.9, 60 sec: 5733.7, 300 sec: 5706.6). Total num frames: 111313920. Throughput: 0: 5950.0. Samples: 111312310. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 05:31:17,713][25689] Avg episode reward: [(0, '-52.448')] [2022-07-09 05:31:18,387][26022] Updated weights on worker 0-0, policy_version 108709 (0.00090) [2022-07-09 05:31:19,987][26022] Updated weights on worker 0-0, policy_version 108719 (0.00086) [2022-07-09 05:31:21,885][26022] Updated weights on worker 0-0, policy_version 108729 (0.00093) [2022-07-09 05:31:22,719][25689] Fps is (10 sec: 5930.3, 60 sec: 5736.3, 300 sec: 5711.2). Total num frames: 111343616. Throughput: 0: 5946.8. Samples: 111347106. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 05:31:22,720][25689] Avg episode reward: [(0, '-51.946')] [2022-07-09 05:31:23,719][26022] Updated weights on worker 0-0, policy_version 108739 (0.00085) [2022-07-09 05:31:25,552][26022] Updated weights on worker 0-0, policy_version 108749 (0.00090) [2022-07-09 05:31:27,492][26022] Updated weights on worker 0-0, policy_version 108759 (0.00088) [2022-07-09 05:31:27,773][25689] Fps is (10 sec: 5698.9, 60 sec: 5735.0, 300 sec: 5707.0). Total num frames: 111371264. Throughput: 0: 6039.0. Samples: 111381592. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 05:31:27,774][25689] Avg episode reward: [(0, '-51.852')] [2022-07-09 05:31:29,046][26022] Updated weights on worker 0-0, policy_version 108769 (0.00092) [2022-07-09 05:31:30,919][26022] Updated weights on worker 0-0, policy_version 108779 (0.00090) [2022-07-09 05:31:32,741][26022] Updated weights on worker 0-0, policy_version 108789 (0.00081) [2022-07-09 05:31:32,831][25689] Fps is (10 sec: 5670.9, 60 sec: 5729.6, 300 sec: 5706.2). Total num frames: 111400960. Throughput: 0: 6020.4. Samples: 111398690. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 05:31:32,833][25689] Avg episode reward: [(0, '-51.523')] [2022-07-09 05:31:34,319][26022] Updated weights on worker 0-0, policy_version 108799 (0.00095) [2022-07-09 05:31:36,130][26022] Updated weights on worker 0-0, policy_version 108809 (0.00094) [2022-07-09 05:31:37,881][25689] Fps is (10 sec: 5774.9, 60 sec: 5708.7, 300 sec: 5709.7). Total num frames: 111429632. Throughput: 0: 6001.7. Samples: 111433362. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 05:31:37,881][25689] Avg episode reward: [(0, '-51.031')] [2022-07-09 05:31:37,989][26022] Updated weights on worker 0-0, policy_version 108819 (0.00086) [2022-07-09 05:31:39,739][26022] Updated weights on worker 0-0, policy_version 108829 (0.00089) [2022-07-09 05:31:41,597][26022] Updated weights on worker 0-0, policy_version 108839 (0.00092) [2022-07-09 05:31:42,887][25689] Fps is (10 sec: 5702.7, 60 sec: 5743.0, 300 sec: 5707.0). Total num frames: 111458304. Throughput: 0: 5998.7. Samples: 111468086. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 05:31:42,887][25689] Avg episode reward: [(0, '-51.037')] [2022-07-09 05:31:43,138][26022] Updated weights on worker 0-0, policy_version 108849 (0.00090) [2022-07-09 05:31:45,268][26022] Updated weights on worker 0-0, policy_version 108859 (0.00093) [2022-07-09 05:31:46,779][26022] Updated weights on worker 0-0, policy_version 108869 (0.00088) [2022-07-09 05:31:48,018][25689] Fps is (10 sec: 5757.8, 60 sec: 5704.9, 300 sec: 5708.3). Total num frames: 111488000. Throughput: 0: 5111.3. Samples: 111485066. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 05:31:48,018][25689] Avg episode reward: [(0, '-52.184')] [2022-07-09 05:31:48,700][26022] Updated weights on worker 0-0, policy_version 108879 (0.00085) [2022-07-09 05:31:50,311][26022] Updated weights on worker 0-0, policy_version 108889 (0.00085) [2022-07-09 05:31:52,169][26022] Updated weights on worker 0-0, policy_version 108899 (0.00087) [2022-07-09 05:31:53,094][25689] Fps is (10 sec: 5718.4, 60 sec: 5735.2, 300 sec: 5707.3). Total num frames: 111516672. Throughput: 0: 5987.6. Samples: 111520014. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 05:31:53,099][25689] Avg episode reward: [(0, '-51.690')] [2022-07-09 05:31:53,895][26022] Updated weights on worker 0-0, policy_version 108909 (0.00097) [2022-07-09 05:31:55,792][26022] Updated weights on worker 0-0, policy_version 108919 (0.00086) [2022-07-09 05:31:57,398][26022] Updated weights on worker 0-0, policy_version 108929 (0.00104) [2022-07-09 05:31:58,123][25689] Fps is (10 sec: 5775.8, 60 sec: 5735.9, 300 sec: 5710.3). Total num frames: 111546368. Throughput: 0: 5974.2. Samples: 111554296. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 05:31:58,124][25689] Avg episode reward: [(0, '-51.831')] [2022-07-09 05:31:59,329][26022] Updated weights on worker 0-0, policy_version 108939 (0.00093) [2022-07-09 05:32:01,151][26022] Updated weights on worker 0-0, policy_version 108949 (0.00062) [2022-07-09 05:32:03,129][25689] Fps is (10 sec: 5612.5, 60 sec: 5702.2, 300 sec: 5712.3). Total num frames: 111572992. Throughput: 0: 5101.3. Samples: 111571346. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 05:32:03,131][25689] Avg episode reward: [(0, '-51.983')] [2022-07-09 05:32:03,271][26022] Updated weights on worker 0-0, policy_version 108959 (0.00098) [2022-07-09 05:32:05,232][26022] Updated weights on worker 0-0, policy_version 108969 (0.00088) [2022-07-09 05:32:07,015][26022] Updated weights on worker 0-0, policy_version 108979 (0.00083) [2022-07-09 05:32:08,265][25689] Fps is (10 sec: 5351.2, 60 sec: 5676.7, 300 sec: 5706.7). Total num frames: 111600640. Throughput: 0: 5826.0. Samples: 111603030. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 05:32:08,266][25689] Avg episode reward: [(0, '-51.880')] [2022-07-09 05:32:08,697][26022] Updated weights on worker 0-0, policy_version 108989 (0.00083) [2022-07-09 05:32:10,674][26022] Updated weights on worker 0-0, policy_version 108999 (0.00100) [2022-07-09 05:32:12,176][26022] Updated weights on worker 0-0, policy_version 109009 (0.00084) [2022-07-09 05:32:13,341][25689] Fps is (10 sec: 5615.0, 60 sec: 5707.9, 300 sec: 5702.0). Total num frames: 111630336. Throughput: 0: 5803.4. Samples: 111637518. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 05:32:13,342][25689] Avg episode reward: [(0, '-52.438')] [2022-07-09 05:32:14,168][26022] Updated weights on worker 0-0, policy_version 109019 (0.00085) [2022-07-09 05:32:15,728][26022] Updated weights on worker 0-0, policy_version 109029 (0.00090) [2022-07-09 05:32:17,734][26022] Updated weights on worker 0-0, policy_version 109039 (0.00088) [2022-07-09 05:32:18,382][25689] Fps is (10 sec: 5769.5, 60 sec: 5687.9, 300 sec: 5708.7). Total num frames: 111659008. Throughput: 0: 4950.0. Samples: 111654576. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 05:32:18,383][25689] Avg episode reward: [(0, '-52.477')] [2022-07-09 05:32:19,535][26022] Updated weights on worker 0-0, policy_version 109049 (0.00095) [2022-07-09 05:32:21,122][26022] Updated weights on worker 0-0, policy_version 109059 (0.00083) [2022-07-09 05:32:22,899][26022] Updated weights on worker 0-0, policy_version 109069 (0.00089) [2022-07-09 05:32:23,471][25689] Fps is (10 sec: 5761.9, 60 sec: 5680.3, 300 sec: 5707.7). Total num frames: 111688704. Throughput: 0: 5803.4. Samples: 111689402. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 05:32:23,472][25689] Avg episode reward: [(0, '-53.305')] [2022-07-09 05:32:24,848][26022] Updated weights on worker 0-0, policy_version 109079 (0.00048) [2022-07-09 05:32:26,516][26022] Updated weights on worker 0-0, policy_version 109089 (0.00091) [2022-07-09 05:32:28,458][26022] Updated weights on worker 0-0, policy_version 109099 (0.00085) [2022-07-09 05:32:28,593][25689] Fps is (10 sec: 5716.5, 60 sec: 5690.8, 300 sec: 5703.0). Total num frames: 111717376. Throughput: 0: 5958.1. Samples: 111724140. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 05:32:28,593][25689] Avg episode reward: [(0, '-53.328')] [2022-07-09 05:32:28,990][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:32:29,001][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000109102_111720448.pth [2022-07-09 05:32:29,001][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000107096_109666304.pth [2022-07-09 05:32:30,163][26022] Updated weights on worker 0-0, policy_version 109109 (0.00092) [2022-07-09 05:32:31,863][26022] Updated weights on worker 0-0, policy_version 109119 (0.00091) [2022-07-09 05:32:33,594][25689] Fps is (10 sec: 5867.0, 60 sec: 5713.0, 300 sec: 5710.3). Total num frames: 111748096. Throughput: 0: 5134.3. Samples: 111741502. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 05:32:33,595][25689] Avg episode reward: [(0, '-54.212')] [2022-07-09 05:32:33,595][26022] Updated weights on worker 0-0, policy_version 109129 (0.00080) [2022-07-09 05:32:35,543][26022] Updated weights on worker 0-0, policy_version 109139 (0.00085) [2022-07-09 05:32:37,174][26022] Updated weights on worker 0-0, policy_version 109149 (0.00084) [2022-07-09 05:32:38,597][25689] Fps is (10 sec: 5834.4, 60 sec: 5700.5, 300 sec: 5703.5). Total num frames: 111775744. Throughput: 0: 6027.1. Samples: 111776410. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 05:32:38,598][25689] Avg episode reward: [(0, '-53.235')] [2022-07-09 05:32:39,115][26022] Updated weights on worker 0-0, policy_version 109159 (0.00086) [2022-07-09 05:32:40,732][26022] Updated weights on worker 0-0, policy_version 109169 (0.00093) [2022-07-09 05:32:42,532][26022] Updated weights on worker 0-0, policy_version 109179 (0.00093) [2022-07-09 05:32:43,622][25689] Fps is (10 sec: 5718.8, 60 sec: 5715.6, 300 sec: 5712.1). Total num frames: 111805440. Throughput: 0: 6056.4. Samples: 111811440. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 05:32:43,622][25689] Avg episode reward: [(0, '-52.889')] [2022-07-09 05:32:44,087][26022] Updated weights on worker 0-0, policy_version 109189 (0.00078) [2022-07-09 05:32:46,079][26022] Updated weights on worker 0-0, policy_version 109199 (0.00096) [2022-07-09 05:32:47,872][26022] Updated weights on worker 0-0, policy_version 109209 (0.00092) [2022-07-09 05:32:48,707][25689] Fps is (10 sec: 5773.1, 60 sec: 5703.0, 300 sec: 5704.1). Total num frames: 111834112. Throughput: 0: 5180.8. Samples: 111828348. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 05:32:48,709][25689] Avg episode reward: [(0, '-52.656')] [2022-07-09 05:32:49,631][26022] Updated weights on worker 0-0, policy_version 109219 (0.00088) [2022-07-09 05:32:51,298][26022] Updated weights on worker 0-0, policy_version 109229 (0.00085) [2022-07-09 05:32:53,100][26022] Updated weights on worker 0-0, policy_version 109239 (0.00086) [2022-07-09 05:32:53,712][25689] Fps is (10 sec: 5784.4, 60 sec: 5726.6, 300 sec: 5708.0). Total num frames: 111863808. Throughput: 0: 6042.8. Samples: 111863070. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 05:32:53,714][25689] Avg episode reward: [(0, '-52.192')] [2022-07-09 05:32:54,891][26022] Updated weights on worker 0-0, policy_version 109249 (0.00088) [2022-07-09 05:32:56,628][26022] Updated weights on worker 0-0, policy_version 109259 (0.00087) [2022-07-09 05:32:58,569][26022] Updated weights on worker 0-0, policy_version 109269 (0.00092) [2022-07-09 05:32:58,722][25689] Fps is (10 sec: 5725.8, 60 sec: 5694.7, 300 sec: 5701.6). Total num frames: 111891456. Throughput: 0: 6032.2. Samples: 111897810. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 05:32:58,723][25689] Avg episode reward: [(0, '-52.443')] [2022-07-09 05:33:00,095][26022] Updated weights on worker 0-0, policy_version 109279 (0.00077) [2022-07-09 05:33:02,364][26022] Updated weights on worker 0-0, policy_version 109289 (0.00086) [2022-07-09 05:33:03,759][25689] Fps is (10 sec: 5605.8, 60 sec: 5725.5, 300 sec: 5712.1). Total num frames: 111920128. Throughput: 0: 5162.1. Samples: 111915390. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 05:33:03,761][25689] Avg episode reward: [(0, '-51.598')] [2022-07-09 05:33:04,018][26022] Updated weights on worker 0-0, policy_version 109299 (0.00100) [2022-07-09 05:33:05,899][26022] Updated weights on worker 0-0, policy_version 109309 (0.00087) [2022-07-09 05:33:07,652][26022] Updated weights on worker 0-0, policy_version 109319 (0.00090) [2022-07-09 05:33:08,828][25689] Fps is (10 sec: 5775.9, 60 sec: 5765.7, 300 sec: 5714.5). Total num frames: 111949824. Throughput: 0: 5962.0. Samples: 111948306. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 05:33:08,828][25689] Avg episode reward: [(0, '-53.101')] [2022-07-09 05:33:09,399][26022] Updated weights on worker 0-0, policy_version 109329 (0.00086) [2022-07-09 05:33:11,192][26022] Updated weights on worker 0-0, policy_version 109339 (0.00087) [2022-07-09 05:33:12,816][26022] Updated weights on worker 0-0, policy_version 109349 (0.00079) [2022-07-09 05:33:13,871][25689] Fps is (10 sec: 5772.3, 60 sec: 5751.9, 300 sec: 5707.3). Total num frames: 111978496. Throughput: 0: 5960.1. Samples: 111983216. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 05:33:13,872][25689] Avg episode reward: [(0, '-54.010')] [2022-07-09 05:33:14,684][26022] Updated weights on worker 0-0, policy_version 109359 (0.00090) [2022-07-09 05:33:16,558][26022] Updated weights on worker 0-0, policy_version 109369 (0.00086) [2022-07-09 05:33:18,258][26022] Updated weights on worker 0-0, policy_version 109379 (0.00087) [2022-07-09 05:33:18,875][25689] Fps is (10 sec: 5707.5, 60 sec: 5755.4, 300 sec: 5712.2). Total num frames: 112007168. Throughput: 0: 5090.1. Samples: 112000390. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 05:33:18,875][25689] Avg episode reward: [(0, '-53.776')] [2022-07-09 05:33:20,103][26022] Updated weights on worker 0-0, policy_version 109389 (0.00089) [2022-07-09 05:33:21,719][26022] Updated weights on worker 0-0, policy_version 109399 (0.00086) [2022-07-09 05:33:23,655][26022] Updated weights on worker 0-0, policy_version 109409 (0.00088) [2022-07-09 05:33:23,903][25689] Fps is (10 sec: 5818.0, 60 sec: 5761.2, 300 sec: 5713.2). Total num frames: 112036864. Throughput: 0: 5945.0. Samples: 112035146. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 05:33:23,905][25689] Avg episode reward: [(0, '-53.880')] [2022-07-09 05:33:25,331][26022] Updated weights on worker 0-0, policy_version 109419 (0.00053) [2022-07-09 05:33:27,103][26022] Updated weights on worker 0-0, policy_version 109429 (0.00083) [2022-07-09 05:33:28,979][25689] Fps is (10 sec: 5675.3, 60 sec: 5748.6, 300 sec: 5712.5). Total num frames: 112064512. Throughput: 0: 6011.1. Samples: 112069438. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 05:33:28,979][25689] Avg episode reward: [(0, '-53.778')] [2022-07-09 05:33:29,044][26022] Updated weights on worker 0-0, policy_version 109439 (0.00087) [2022-07-09 05:33:30,891][26022] Updated weights on worker 0-0, policy_version 109449 (0.00094) [2022-07-09 05:33:32,401][26022] Updated weights on worker 0-0, policy_version 109459 (0.00090) [2022-07-09 05:33:33,985][25689] Fps is (10 sec: 5586.3, 60 sec: 5714.3, 300 sec: 5709.3). Total num frames: 112093184. Throughput: 0: 5145.3. Samples: 112086708. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 05:33:33,985][25689] Avg episode reward: [(0, '-52.655')] [2022-07-09 05:33:34,352][26022] Updated weights on worker 0-0, policy_version 109469 (0.00088) [2022-07-09 05:33:35,929][26022] Updated weights on worker 0-0, policy_version 109479 (0.00085) [2022-07-09 05:33:37,860][26022] Updated weights on worker 0-0, policy_version 109489 (0.00090) [2022-07-09 05:33:38,986][25689] Fps is (10 sec: 5730.0, 60 sec: 5731.3, 300 sec: 5713.0). Total num frames: 112121856. Throughput: 0: 6015.9. Samples: 112121380. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 05:33:38,987][25689] Avg episode reward: [(0, '-52.347')] [2022-07-09 05:33:39,583][26022] Updated weights on worker 0-0, policy_version 109499 (0.00085) [2022-07-09 05:33:41,423][26022] Updated weights on worker 0-0, policy_version 109509 (0.00088) [2022-07-09 05:33:43,003][26022] Updated weights on worker 0-0, policy_version 109519 (0.00085) [2022-07-09 05:33:44,017][25689] Fps is (10 sec: 5817.8, 60 sec: 5730.7, 300 sec: 5710.5). Total num frames: 112151552. Throughput: 0: 6013.3. Samples: 112156100. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 05:33:44,018][25689] Avg episode reward: [(0, '-51.886')] [2022-07-09 05:33:44,973][26022] Updated weights on worker 0-0, policy_version 109529 (0.00086) [2022-07-09 05:33:46,712][26022] Updated weights on worker 0-0, policy_version 109539 (0.00091) [2022-07-09 05:33:48,379][26022] Updated weights on worker 0-0, policy_version 109549 (0.00084) [2022-07-09 05:33:49,109][25689] Fps is (10 sec: 5968.4, 60 sec: 5764.0, 300 sec: 5719.3). Total num frames: 112182272. Throughput: 0: 6016.7. Samples: 112190556. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 05:33:49,110][25689] Avg episode reward: [(0, '-51.852')] [2022-07-09 05:33:50,269][26022] Updated weights on worker 0-0, policy_version 109559 (0.00085) [2022-07-09 05:33:51,915][26022] Updated weights on worker 0-0, policy_version 109569 (0.00095) [2022-07-09 05:33:53,863][26022] Updated weights on worker 0-0, policy_version 109579 (0.00091) [2022-07-09 05:33:54,120][25689] Fps is (10 sec: 5676.3, 60 sec: 5712.7, 300 sec: 5709.1). Total num frames: 112208896. Throughput: 0: 6007.2. Samples: 112207664. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 05:33:54,120][25689] Avg episode reward: [(0, '-52.420')] [2022-07-09 05:33:55,622][26022] Updated weights on worker 0-0, policy_version 109589 (0.00091) [2022-07-09 05:33:57,452][26022] Updated weights on worker 0-0, policy_version 109599 (0.00611) [2022-07-09 05:33:59,152][25689] Fps is (10 sec: 5709.7, 60 sec: 5761.3, 300 sec: 5715.6). Total num frames: 112239616. Throughput: 0: 6000.1. Samples: 112242378. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 05:33:59,153][25689] Avg episode reward: [(0, '-52.763')] [2022-07-09 05:33:59,167][26022] Updated weights on worker 0-0, policy_version 109609 (0.00095) [2022-07-09 05:34:00,900][26022] Updated weights on worker 0-0, policy_version 109619 (0.00083) [2022-07-09 05:34:02,972][26022] Updated weights on worker 0-0, policy_version 109629 (0.00093) [2022-07-09 05:34:04,201][25689] Fps is (10 sec: 5586.6, 60 sec: 5709.4, 300 sec: 5713.4). Total num frames: 112265216. Throughput: 0: 5877.6. Samples: 112274732. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 05:34:04,202][25689] Avg episode reward: [(0, '-52.092')] [2022-07-09 05:34:05,060][26022] Updated weights on worker 0-0, policy_version 109639 (0.00092) [2022-07-09 05:34:06,692][26022] Updated weights on worker 0-0, policy_version 109649 (0.00085) [2022-07-09 05:34:08,523][26022] Updated weights on worker 0-0, policy_version 109659 (0.00088) [2022-07-09 05:34:09,266][25689] Fps is (10 sec: 5366.1, 60 sec: 5692.8, 300 sec: 5712.4). Total num frames: 112293888. Throughput: 0: 5020.8. Samples: 112291762. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 05:34:09,267][25689] Avg episode reward: [(0, '-52.060')] [2022-07-09 05:34:10,307][26022] Updated weights on worker 0-0, policy_version 109669 (0.00084) [2022-07-09 05:34:12,019][26022] Updated weights on worker 0-0, policy_version 109679 (0.00091) [2022-07-09 05:34:13,716][26022] Updated weights on worker 0-0, policy_version 109689 (0.00091) [2022-07-09 05:34:14,280][25689] Fps is (10 sec: 5791.2, 60 sec: 5712.5, 300 sec: 5715.8). Total num frames: 112323584. Throughput: 0: 5902.9. Samples: 112326666. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 05:34:14,280][25689] Avg episode reward: [(0, '-51.422')] [2022-07-09 05:34:15,703][26022] Updated weights on worker 0-0, policy_version 109699 (0.00084) [2022-07-09 05:34:17,359][26022] Updated weights on worker 0-0, policy_version 109709 (0.00095) [2022-07-09 05:34:19,288][26022] Updated weights on worker 0-0, policy_version 109719 (0.00089) [2022-07-09 05:34:19,299][25689] Fps is (10 sec: 5817.9, 60 sec: 5711.1, 300 sec: 5716.2). Total num frames: 112352256. Throughput: 0: 5893.3. Samples: 112361106. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 05:34:19,299][25689] Avg episode reward: [(0, '-52.190')] [2022-07-09 05:34:20,939][26022] Updated weights on worker 0-0, policy_version 109729 (0.00085) [2022-07-09 05:34:22,699][26022] Updated weights on worker 0-0, policy_version 109739 (0.00080) [2022-07-09 05:34:24,320][25689] Fps is (10 sec: 5813.3, 60 sec: 5711.7, 300 sec: 5714.9). Total num frames: 112381952. Throughput: 0: 5162.6. Samples: 112378600. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 05:34:24,322][25689] Avg episode reward: [(0, '-51.578')] [2022-07-09 05:34:24,351][26022] Updated weights on worker 0-0, policy_version 109749 (0.00082) [2022-07-09 05:34:26,411][26022] Updated weights on worker 0-0, policy_version 109759 (0.00098) [2022-07-09 05:34:27,896][26022] Updated weights on worker 0-0, policy_version 109769 (0.00091) [2022-07-09 05:34:29,136][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:34:29,144][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000109775_112409600.pth [2022-07-09 05:34:29,145][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000107764_110350336.pth [2022-07-09 05:34:29,375][25689] Fps is (10 sec: 5792.5, 60 sec: 5730.7, 300 sec: 5717.6). Total num frames: 112410624. Throughput: 0: 6036.6. Samples: 112413152. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 05:34:29,376][25689] Avg episode reward: [(0, '-51.382')] [2022-07-09 05:34:29,988][26022] Updated weights on worker 0-0, policy_version 109779 (0.00089) [2022-07-09 05:34:31,584][26022] Updated weights on worker 0-0, policy_version 109789 (0.00082) [2022-07-09 05:34:33,346][26022] Updated weights on worker 0-0, policy_version 109799 (0.00085) [2022-07-09 05:34:34,434][25689] Fps is (10 sec: 5771.4, 60 sec: 5742.6, 300 sec: 5721.4). Total num frames: 112440320. Throughput: 0: 6011.1. Samples: 112447812. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 05:34:34,436][25689] Avg episode reward: [(0, '-51.295')] [2022-07-09 05:34:35,231][26022] Updated weights on worker 0-0, policy_version 109809 (0.00090) [2022-07-09 05:34:36,912][26022] Updated weights on worker 0-0, policy_version 109819 (0.00092) [2022-07-09 05:34:38,691][26022] Updated weights on worker 0-0, policy_version 109829 (0.00086) [2022-07-09 05:34:39,454][25689] Fps is (10 sec: 5892.5, 60 sec: 5757.8, 300 sec: 5725.3). Total num frames: 112470016. Throughput: 0: 5161.7. Samples: 112465144. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 05:34:39,455][25689] Avg episode reward: [(0, '-51.487')] [2022-07-09 05:34:40,620][26022] Updated weights on worker 0-0, policy_version 109839 (0.00077) [2022-07-09 05:34:42,162][26022] Updated weights on worker 0-0, policy_version 109849 (0.00092) [2022-07-09 05:34:44,168][26022] Updated weights on worker 0-0, policy_version 109859 (0.00086) [2022-07-09 05:34:44,520][25689] Fps is (10 sec: 5786.7, 60 sec: 5737.5, 300 sec: 5719.1). Total num frames: 112498688. Throughput: 0: 5993.2. Samples: 112499662. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 05:34:44,521][25689] Avg episode reward: [(0, '-51.330')] [2022-07-09 05:34:45,687][26022] Updated weights on worker 0-0, policy_version 109869 (0.00086) [2022-07-09 05:34:47,607][26022] Updated weights on worker 0-0, policy_version 109879 (0.00092) [2022-07-09 05:34:49,577][26022] Updated weights on worker 0-0, policy_version 109889 (0.00090) [2022-07-09 05:34:49,589][25689] Fps is (10 sec: 5557.3, 60 sec: 5688.9, 300 sec: 5718.3). Total num frames: 112526336. Throughput: 0: 5981.5. Samples: 112534060. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:34:49,589][25689] Avg episode reward: [(0, '-51.974')] [2022-07-09 05:34:51,248][26022] Updated weights on worker 0-0, policy_version 109899 (0.00087) [2022-07-09 05:34:52,984][26022] Updated weights on worker 0-0, policy_version 109909 (0.00087) [2022-07-09 05:34:54,627][25689] Fps is (10 sec: 5674.0, 60 sec: 5737.1, 300 sec: 5715.2). Total num frames: 112556032. Throughput: 0: 5130.9. Samples: 112551420. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:34:54,627][25689] Avg episode reward: [(0, '-52.549')] [2022-07-09 05:34:54,767][26022] Updated weights on worker 0-0, policy_version 109919 (0.00090) [2022-07-09 05:34:56,371][26022] Updated weights on worker 0-0, policy_version 109929 (0.00087) [2022-07-09 05:34:58,252][26022] Updated weights on worker 0-0, policy_version 109939 (0.00087) [2022-07-09 05:34:59,663][25689] Fps is (10 sec: 5895.2, 60 sec: 5719.8, 300 sec: 5729.1). Total num frames: 112585728. Throughput: 0: 5997.4. Samples: 112586346. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:34:59,664][25689] Avg episode reward: [(0, '-52.975')] [2022-07-09 05:35:00,096][26022] Updated weights on worker 0-0, policy_version 109949 (0.00095) [2022-07-09 05:35:01,888][26022] Updated weights on worker 0-0, policy_version 109959 (0.00103) [2022-07-09 05:35:04,153][26022] Updated weights on worker 0-0, policy_version 109969 (0.00085) [2022-07-09 05:35:04,713][25689] Fps is (10 sec: 5482.0, 60 sec: 5719.7, 300 sec: 5715.3). Total num frames: 112611328. Throughput: 0: 5895.0. Samples: 112618702. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:35:04,714][25689] Avg episode reward: [(0, '-53.275')] [2022-07-09 05:35:05,623][26022] Updated weights on worker 0-0, policy_version 109979 (0.00092) [2022-07-09 05:35:07,680][26022] Updated weights on worker 0-0, policy_version 109989 (0.00082) [2022-07-09 05:35:09,112][26022] Updated weights on worker 0-0, policy_version 109999 (0.00095) [2022-07-09 05:35:09,783][25689] Fps is (10 sec: 5565.7, 60 sec: 5753.2, 300 sec: 5724.9). Total num frames: 112642048. Throughput: 0: 5046.2. Samples: 112635962. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:35:09,783][25689] Avg episode reward: [(0, '-53.260')] [2022-07-09 05:35:11,189][26022] Updated weights on worker 0-0, policy_version 110009 (0.00091) [2022-07-09 05:35:12,704][26022] Updated weights on worker 0-0, policy_version 110019 (0.00092) [2022-07-09 05:35:14,760][26022] Updated weights on worker 0-0, policy_version 110029 (0.00090) [2022-07-09 05:35:14,801][25689] Fps is (10 sec: 5786.0, 60 sec: 5718.8, 300 sec: 5721.5). Total num frames: 112669696. Throughput: 0: 5903.3. Samples: 112670516. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:35:14,802][25689] Avg episode reward: [(0, '-52.547')] [2022-07-09 05:35:16,374][26022] Updated weights on worker 0-0, policy_version 110039 (0.00087) [2022-07-09 05:35:18,512][26022] Updated weights on worker 0-0, policy_version 110049 (0.00089) [2022-07-09 05:35:19,809][25689] Fps is (10 sec: 5719.4, 60 sec: 5736.8, 300 sec: 5722.0). Total num frames: 112699392. Throughput: 0: 5890.9. Samples: 112705020. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:35:19,809][25689] Avg episode reward: [(0, '-52.137')] [2022-07-09 05:35:19,823][26022] Updated weights on worker 0-0, policy_version 110059 (0.00096) [2022-07-09 05:35:21,954][26022] Updated weights on worker 0-0, policy_version 110069 (0.00088) [2022-07-09 05:35:23,491][26022] Updated weights on worker 0-0, policy_version 110079 (0.00087) [2022-07-09 05:35:24,896][25689] Fps is (10 sec: 5680.6, 60 sec: 5696.8, 300 sec: 5721.1). Total num frames: 112727040. Throughput: 0: 5128.9. Samples: 112722216. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:35:24,896][25689] Avg episode reward: [(0, '-51.576')] [2022-07-09 05:35:25,274][26022] Updated weights on worker 0-0, policy_version 110089 (0.00087) [2022-07-09 05:35:27,111][26022] Updated weights on worker 0-0, policy_version 110099 (0.00088) [2022-07-09 05:35:28,965][26022] Updated weights on worker 0-0, policy_version 110109 (0.00084) [2022-07-09 05:35:29,968][25689] Fps is (10 sec: 5543.8, 60 sec: 5695.2, 300 sec: 5716.3). Total num frames: 112755712. Throughput: 0: 5986.0. Samples: 112756792. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:35:29,968][25689] Avg episode reward: [(0, '-51.212')] [2022-07-09 05:35:30,606][26022] Updated weights on worker 0-0, policy_version 110119 (0.00092) [2022-07-09 05:35:32,739][26022] Updated weights on worker 0-0, policy_version 110129 (0.00090) [2022-07-09 05:35:34,187][26022] Updated weights on worker 0-0, policy_version 110139 (0.00088) [2022-07-09 05:35:34,983][25689] Fps is (10 sec: 5888.1, 60 sec: 5716.2, 300 sec: 5719.6). Total num frames: 112786432. Throughput: 0: 5987.7. Samples: 112791356. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:35:34,983][25689] Avg episode reward: [(0, '-51.981')] [2022-07-09 05:35:36,220][26022] Updated weights on worker 0-0, policy_version 110149 (0.00082) [2022-07-09 05:35:37,626][26022] Updated weights on worker 0-0, policy_version 110159 (0.00089) [2022-07-09 05:35:39,633][26022] Updated weights on worker 0-0, policy_version 110169 (0.00087) [2022-07-09 05:35:39,999][25689] Fps is (10 sec: 5920.8, 60 sec: 5699.7, 300 sec: 5726.4). Total num frames: 112815104. Throughput: 0: 5125.0. Samples: 112808494. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:35:40,000][25689] Avg episode reward: [(0, '-52.000')] [2022-07-09 05:35:41,708][26022] Updated weights on worker 0-0, policy_version 110179 (0.00088) [2022-07-09 05:35:42,975][26022] Updated weights on worker 0-0, policy_version 110189 (0.00090) [2022-07-09 05:35:45,002][25689] Fps is (10 sec: 5621.2, 60 sec: 5688.7, 300 sec: 5714.2). Total num frames: 112842752. Throughput: 0: 6016.6. Samples: 112843186. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:35:45,002][25689] Avg episode reward: [(0, '-52.763')] [2022-07-09 05:35:45,155][26022] Updated weights on worker 0-0, policy_version 110199 (0.00086) [2022-07-09 05:35:46,438][26022] Updated weights on worker 0-0, policy_version 110209 (0.00081) [2022-07-09 05:35:48,561][26022] Updated weights on worker 0-0, policy_version 110219 (0.00079) [2022-07-09 05:35:50,127][25689] Fps is (10 sec: 5763.2, 60 sec: 5734.2, 300 sec: 5726.4). Total num frames: 112873472. Throughput: 0: 5995.2. Samples: 112877648. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:35:50,127][25689] Avg episode reward: [(0, '-52.785')] [2022-07-09 05:35:50,143][26022] Updated weights on worker 0-0, policy_version 110229 (0.00092) [2022-07-09 05:35:51,839][26022] Updated weights on worker 0-0, policy_version 110239 (0.00086) [2022-07-09 05:35:54,012][26022] Updated weights on worker 0-0, policy_version 110249 (0.00083) [2022-07-09 05:35:55,155][25689] Fps is (10 sec: 5849.5, 60 sec: 5718.2, 300 sec: 5723.1). Total num frames: 112902144. Throughput: 0: 5131.9. Samples: 112894880. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:35:55,157][25689] Avg episode reward: [(0, '-52.795')] [2022-07-09 05:35:55,498][26022] Updated weights on worker 0-0, policy_version 110259 (0.00086) [2022-07-09 05:35:57,281][26022] Updated weights on worker 0-0, policy_version 110269 (0.00085) [2022-07-09 05:35:59,004][26022] Updated weights on worker 0-0, policy_version 110279 (0.00094) [2022-07-09 05:36:00,200][25689] Fps is (10 sec: 5794.3, 60 sec: 5717.4, 300 sec: 5725.8). Total num frames: 112931840. Throughput: 0: 6016.4. Samples: 112930032. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:36:00,200][25689] Avg episode reward: [(0, '-53.190')] [2022-07-09 05:36:00,978][26022] Updated weights on worker 0-0, policy_version 110289 (0.00092) [2022-07-09 05:36:03,058][26022] Updated weights on worker 0-0, policy_version 110299 (0.00088) [2022-07-09 05:36:05,056][26022] Updated weights on worker 0-0, policy_version 110309 (0.00088) [2022-07-09 05:36:05,259][25689] Fps is (10 sec: 5472.6, 60 sec: 5716.5, 300 sec: 5715.2). Total num frames: 112957440. Throughput: 0: 5893.0. Samples: 112962562. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:36:05,260][25689] Avg episode reward: [(0, '-52.565')] [2022-07-09 05:36:06,624][26022] Updated weights on worker 0-0, policy_version 110319 (0.00086) [2022-07-09 05:36:08,667][26022] Updated weights on worker 0-0, policy_version 110329 (0.00094) [2022-07-09 05:36:10,138][26022] Updated weights on worker 0-0, policy_version 110339 (0.00092) [2022-07-09 05:36:10,312][25689] Fps is (10 sec: 5569.8, 60 sec: 5718.1, 300 sec: 5725.5). Total num frames: 112988160. Throughput: 0: 5915.5. Samples: 112997052. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:36:10,312][25689] Avg episode reward: [(0, '-51.906')] [2022-07-09 05:36:12,132][26022] Updated weights on worker 0-0, policy_version 110349 (0.00087) [2022-07-09 05:36:13,680][26022] Updated weights on worker 0-0, policy_version 110359 (0.00087) [2022-07-09 05:36:15,339][25689] Fps is (10 sec: 5790.8, 60 sec: 5717.3, 300 sec: 5718.2). Total num frames: 113015808. Throughput: 0: 5916.7. Samples: 113014300. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:36:15,339][25689] Avg episode reward: [(0, '-51.433')] [2022-07-09 05:36:15,599][26022] Updated weights on worker 0-0, policy_version 110369 (0.00092) [2022-07-09 05:36:17,202][26022] Updated weights on worker 0-0, policy_version 110379 (0.00089) [2022-07-09 05:36:19,031][26022] Updated weights on worker 0-0, policy_version 110389 (0.00097) [2022-07-09 05:36:20,347][25689] Fps is (10 sec: 5714.3, 60 sec: 5717.3, 300 sec: 5718.2). Total num frames: 113045504. Throughput: 0: 5897.3. Samples: 113048844. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:36:20,347][25689] Avg episode reward: [(0, '-50.523')] [2022-07-09 05:36:20,762][26022] Updated weights on worker 0-0, policy_version 110399 (0.00091) [2022-07-09 05:36:22,675][26022] Updated weights on worker 0-0, policy_version 110409 (0.00087) [2022-07-09 05:36:24,291][26022] Updated weights on worker 0-0, policy_version 110419 (0.00090) [2022-07-09 05:36:25,366][25689] Fps is (10 sec: 5718.6, 60 sec: 5723.7, 300 sec: 5718.8). Total num frames: 113073152. Throughput: 0: 6019.5. Samples: 113083596. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:36:25,367][25689] Avg episode reward: [(0, '-50.006')] [2022-07-09 05:36:26,286][26022] Updated weights on worker 0-0, policy_version 110429 (0.00618) [2022-07-09 05:36:27,808][26022] Updated weights on worker 0-0, policy_version 110439 (0.00085) [2022-07-09 05:36:29,321][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:36:29,332][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000110446_113096704.pth [2022-07-09 05:36:29,334][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000108433_111035392.pth [2022-07-09 05:36:29,821][26022] Updated weights on worker 0-0, policy_version 110449 (0.00093) [2022-07-09 05:36:30,488][25689] Fps is (10 sec: 5553.6, 60 sec: 5719.0, 300 sec: 5714.2). Total num frames: 113101824. Throughput: 0: 5136.8. Samples: 113100694. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:36:30,488][25689] Avg episode reward: [(0, '-50.321')] [2022-07-09 05:36:31,646][26022] Updated weights on worker 0-0, policy_version 110459 (0.00092) [2022-07-09 05:36:33,331][26022] Updated weights on worker 0-0, policy_version 110469 (0.00097) [2022-07-09 05:36:35,245][26022] Updated weights on worker 0-0, policy_version 110479 (0.00081) [2022-07-09 05:36:35,518][25689] Fps is (10 sec: 5951.2, 60 sec: 5734.4, 300 sec: 5724.9). Total num frames: 113133568. Throughput: 0: 5985.5. Samples: 113135086. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:36:35,519][25689] Avg episode reward: [(0, '-50.351')] [2022-07-09 05:36:37,209][26022] Updated weights on worker 0-0, policy_version 110489 (0.00164) [2022-07-09 05:36:38,708][26022] Updated weights on worker 0-0, policy_version 110499 (0.00087) [2022-07-09 05:36:40,601][25689] Fps is (10 sec: 5771.7, 60 sec: 5694.4, 300 sec: 5716.5). Total num frames: 113160192. Throughput: 0: 5959.9. Samples: 113169556. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:36:40,601][25689] Avg episode reward: [(0, '-49.976')] [2022-07-09 05:36:40,822][26022] Updated weights on worker 0-0, policy_version 110509 (0.00094) [2022-07-09 05:36:42,077][26022] Updated weights on worker 0-0, policy_version 110519 (0.00078) [2022-07-09 05:36:44,147][26022] Updated weights on worker 0-0, policy_version 110529 (0.00093) [2022-07-09 05:36:45,607][25689] Fps is (10 sec: 5582.6, 60 sec: 5727.9, 300 sec: 5718.9). Total num frames: 113189888. Throughput: 0: 5108.9. Samples: 113187002. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:36:45,607][25689] Avg episode reward: [(0, '-50.521')] [2022-07-09 05:36:45,667][26022] Updated weights on worker 0-0, policy_version 110539 (0.00084) [2022-07-09 05:36:47,558][26022] Updated weights on worker 0-0, policy_version 110549 (0.00085) [2022-07-09 05:36:49,432][26022] Updated weights on worker 0-0, policy_version 110559 (0.00088) [2022-07-09 05:36:50,723][25689] Fps is (10 sec: 5867.2, 60 sec: 5711.7, 300 sec: 5721.6). Total num frames: 113219584. Throughput: 0: 5971.1. Samples: 113221522. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 05:36:50,724][25689] Avg episode reward: [(0, '-51.907')] [2022-07-09 05:36:51,146][26022] Updated weights on worker 0-0, policy_version 110569 (0.00084) [2022-07-09 05:36:52,905][26022] Updated weights on worker 0-0, policy_version 110579 (0.00091) [2022-07-09 05:36:54,784][26022] Updated weights on worker 0-0, policy_version 110589 (0.00093) [2022-07-09 05:36:55,729][25689] Fps is (10 sec: 5766.2, 60 sec: 5713.9, 300 sec: 5718.6). Total num frames: 113248256. Throughput: 0: 5997.7. Samples: 113256306. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 05:36:55,730][25689] Avg episode reward: [(0, '-51.794')] [2022-07-09 05:36:56,435][26022] Updated weights on worker 0-0, policy_version 110599 (0.00083) [2022-07-09 05:36:58,291][26022] Updated weights on worker 0-0, policy_version 110609 (0.00087) [2022-07-09 05:36:59,894][26022] Updated weights on worker 0-0, policy_version 110619 (0.00095) [2022-07-09 05:37:00,773][25689] Fps is (10 sec: 5705.9, 60 sec: 5697.0, 300 sec: 5724.7). Total num frames: 113276928. Throughput: 0: 5155.2. Samples: 113273550. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 05:37:00,774][25689] Avg episode reward: [(0, '-52.060')] [2022-07-09 05:37:02,134][26022] Updated weights on worker 0-0, policy_version 110629 (0.00092) [2022-07-09 05:37:03,914][26022] Updated weights on worker 0-0, policy_version 110639 (0.00106) [2022-07-09 05:37:05,683][26022] Updated weights on worker 0-0, policy_version 110649 (0.00088) [2022-07-09 05:37:05,782][25689] Fps is (10 sec: 5602.7, 60 sec: 5735.7, 300 sec: 5727.2). Total num frames: 113304576. Throughput: 0: 5902.4. Samples: 113306082. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 05:37:05,782][25689] Avg episode reward: [(0, '-52.561')] [2022-07-09 05:37:07,678][26022] Updated weights on worker 0-0, policy_version 110659 (0.00081) [2022-07-09 05:37:09,098][26022] Updated weights on worker 0-0, policy_version 110669 (0.00083) [2022-07-09 05:37:10,843][25689] Fps is (10 sec: 5491.0, 60 sec: 5684.0, 300 sec: 5720.6). Total num frames: 113332224. Throughput: 0: 5912.9. Samples: 113340490. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 05:37:10,844][25689] Avg episode reward: [(0, '-52.738')] [2022-07-09 05:37:11,322][26022] Updated weights on worker 0-0, policy_version 110679 (0.00084) [2022-07-09 05:37:12,592][26022] Updated weights on worker 0-0, policy_version 110689 (0.00087) [2022-07-09 05:37:14,893][26022] Updated weights on worker 0-0, policy_version 110699 (0.00091) [2022-07-09 05:37:15,891][25689] Fps is (10 sec: 5773.9, 60 sec: 5732.9, 300 sec: 5727.4). Total num frames: 113362944. Throughput: 0: 5018.6. Samples: 113357488. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 05:37:15,891][25689] Avg episode reward: [(0, '-52.594')] [2022-07-09 05:37:16,475][26022] Updated weights on worker 0-0, policy_version 110709 (0.00090) [2022-07-09 05:37:18,191][26022] Updated weights on worker 0-0, policy_version 110719 (0.00091) [2022-07-09 05:37:20,139][26022] Updated weights on worker 0-0, policy_version 110729 (0.00078) [2022-07-09 05:37:20,978][25689] Fps is (10 sec: 5860.3, 60 sec: 5708.5, 300 sec: 5723.9). Total num frames: 113391616. Throughput: 0: 5871.5. Samples: 113392182. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 05:37:20,978][25689] Avg episode reward: [(0, '-52.624')] [2022-07-09 05:37:21,692][26022] Updated weights on worker 0-0, policy_version 110739 (0.00092) [2022-07-09 05:37:23,667][26022] Updated weights on worker 0-0, policy_version 110749 (0.00087) [2022-07-09 05:37:25,440][26022] Updated weights on worker 0-0, policy_version 110759 (0.01398) [2022-07-09 05:37:25,984][25689] Fps is (10 sec: 5580.0, 60 sec: 5709.8, 300 sec: 5722.7). Total num frames: 113419264. Throughput: 0: 5974.0. Samples: 113426770. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 05:37:25,984][25689] Avg episode reward: [(0, '-53.249')] [2022-07-09 05:37:26,942][26022] Updated weights on worker 0-0, policy_version 110769 (0.00084) [2022-07-09 05:37:29,296][26022] Updated weights on worker 0-0, policy_version 110779 (0.00089) [2022-07-09 05:37:30,710][26022] Updated weights on worker 0-0, policy_version 110789 (0.00099) [2022-07-09 05:37:31,113][25689] Fps is (10 sec: 5758.9, 60 sec: 5742.8, 300 sec: 5720.3). Total num frames: 113449984. Throughput: 0: 5088.6. Samples: 113443634. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 05:37:31,114][25689] Avg episode reward: [(0, '-53.135')] [2022-07-09 05:37:32,872][26022] Updated weights on worker 0-0, policy_version 110799 (0.00078) [2022-07-09 05:37:34,435][26022] Updated weights on worker 0-0, policy_version 110809 (0.00084) [2022-07-09 05:37:36,151][25689] Fps is (10 sec: 5740.7, 60 sec: 5674.6, 300 sec: 5719.6). Total num frames: 113477632. Throughput: 0: 5942.2. Samples: 113477880. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 05:37:36,151][25689] Avg episode reward: [(0, '-53.221')] [2022-07-09 05:37:36,211][26022] Updated weights on worker 0-0, policy_version 110819 (0.00091) [2022-07-09 05:37:38,225][26022] Updated weights on worker 0-0, policy_version 110829 (0.00088) [2022-07-09 05:37:39,719][26022] Updated weights on worker 0-0, policy_version 110839 (0.00088) [2022-07-09 05:37:41,191][25689] Fps is (10 sec: 5487.0, 60 sec: 5695.4, 300 sec: 5712.4). Total num frames: 113505280. Throughput: 0: 5943.8. Samples: 113512324. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 05:37:41,191][25689] Avg episode reward: [(0, '-53.205')] [2022-07-09 05:37:41,694][26022] Updated weights on worker 0-0, policy_version 110849 (0.00086) [2022-07-09 05:37:43,274][26022] Updated weights on worker 0-0, policy_version 110859 (0.00087) [2022-07-09 05:37:45,232][26022] Updated weights on worker 0-0, policy_version 110869 (0.00085) [2022-07-09 05:37:46,225][25689] Fps is (10 sec: 5895.6, 60 sec: 5726.6, 300 sec: 5723.7). Total num frames: 113537024. Throughput: 0: 5079.6. Samples: 113529588. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 05:37:46,225][25689] Avg episode reward: [(0, '-53.044')] [2022-07-09 05:37:46,851][26022] Updated weights on worker 0-0, policy_version 110879 (0.00085) [2022-07-09 05:37:48,675][26022] Updated weights on worker 0-0, policy_version 110889 (0.00092) [2022-07-09 05:37:50,393][26022] Updated weights on worker 0-0, policy_version 110899 (0.00073) [2022-07-09 05:37:51,366][25689] Fps is (10 sec: 5837.2, 60 sec: 5690.5, 300 sec: 5714.2). Total num frames: 113564672. Throughput: 0: 5940.0. Samples: 113563934. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 05:37:51,366][25689] Avg episode reward: [(0, '-51.878')] [2022-07-09 05:37:52,261][26022] Updated weights on worker 0-0, policy_version 110909 (0.00090) [2022-07-09 05:37:54,014][26022] Updated weights on worker 0-0, policy_version 110919 (0.00087) [2022-07-09 05:37:55,838][26022] Updated weights on worker 0-0, policy_version 110929 (0.00084) [2022-07-09 05:37:56,368][25689] Fps is (10 sec: 5552.7, 60 sec: 5690.9, 300 sec: 5717.8). Total num frames: 113593344. Throughput: 0: 5972.2. Samples: 113598618. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 05:37:56,369][25689] Avg episode reward: [(0, '-51.302')] [2022-07-09 05:37:57,572][26022] Updated weights on worker 0-0, policy_version 110939 (0.00090) [2022-07-09 05:37:59,501][26022] Updated weights on worker 0-0, policy_version 110949 (0.00086) [2022-07-09 05:38:01,248][26022] Updated weights on worker 0-0, policy_version 110959 (0.00086) [2022-07-09 05:38:01,378][25689] Fps is (10 sec: 5829.7, 60 sec: 5711.0, 300 sec: 5721.8). Total num frames: 113623040. Throughput: 0: 5132.3. Samples: 113615930. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 05:38:01,383][25689] Avg episode reward: [(0, '-51.410')] [2022-07-09 05:38:03,353][26022] Updated weights on worker 0-0, policy_version 110969 (0.00081) [2022-07-09 05:38:05,101][26022] Updated weights on worker 0-0, policy_version 110979 (0.00108) [2022-07-09 05:38:06,398][25689] Fps is (10 sec: 5615.2, 60 sec: 5693.0, 300 sec: 5712.4). Total num frames: 113649664. Throughput: 0: 5887.4. Samples: 113648354. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 05:38:06,398][25689] Avg episode reward: [(0, '-51.105')] [2022-07-09 05:38:06,987][26022] Updated weights on worker 0-0, policy_version 110989 (0.00092) [2022-07-09 05:38:08,491][26022] Updated weights on worker 0-0, policy_version 110999 (0.00083) [2022-07-09 05:38:10,651][26022] Updated weights on worker 0-0, policy_version 111009 (0.00091) [2022-07-09 05:38:11,483][25689] Fps is (10 sec: 5573.0, 60 sec: 5724.5, 300 sec: 5715.0). Total num frames: 113679360. Throughput: 0: 5921.8. Samples: 113683068. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 05:38:11,484][25689] Avg episode reward: [(0, '-51.372')] [2022-07-09 05:38:12,147][26022] Updated weights on worker 0-0, policy_version 111019 (0.00086) [2022-07-09 05:38:14,103][26022] Updated weights on worker 0-0, policy_version 111029 (0.00084) [2022-07-09 05:38:15,707][26022] Updated weights on worker 0-0, policy_version 111039 (0.00084) [2022-07-09 05:38:16,507][25689] Fps is (10 sec: 5773.9, 60 sec: 5693.0, 300 sec: 5714.7). Total num frames: 113708032. Throughput: 0: 5054.4. Samples: 113700408. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 05:38:16,508][25689] Avg episode reward: [(0, '-52.909')] [2022-07-09 05:38:17,589][26022] Updated weights on worker 0-0, policy_version 111049 (0.00054) [2022-07-09 05:38:19,264][26022] Updated weights on worker 0-0, policy_version 111059 (0.00081) [2022-07-09 05:38:21,315][26022] Updated weights on worker 0-0, policy_version 111069 (0.00087) [2022-07-09 05:38:21,535][25689] Fps is (10 sec: 5705.1, 60 sec: 5698.6, 300 sec: 5711.2). Total num frames: 113736704. Throughput: 0: 5915.4. Samples: 113735168. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 05:38:21,535][25689] Avg episode reward: [(0, '-52.964')] [2022-07-09 05:38:22,817][26022] Updated weights on worker 0-0, policy_version 111079 (0.00088) [2022-07-09 05:38:24,619][26022] Updated weights on worker 0-0, policy_version 111089 (0.00083) [2022-07-09 05:38:26,438][26022] Updated weights on worker 0-0, policy_version 111099 (0.00050) [2022-07-09 05:38:26,537][25689] Fps is (10 sec: 5716.9, 60 sec: 5715.8, 300 sec: 5716.1). Total num frames: 113765376. Throughput: 0: 6029.1. Samples: 113769778. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 05:38:26,538][25689] Avg episode reward: [(0, '-53.244')] [2022-07-09 05:38:28,120][26022] Updated weights on worker 0-0, policy_version 111109 (0.00088) [2022-07-09 05:38:29,429][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:38:29,443][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000111116_113782784.pth [2022-07-09 05:38:29,443][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000109102_111720448.pth [2022-07-09 05:38:29,444][25974] Saving a milestone ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/milestones/checkpoint_000111116_113782784.pth.milestone [2022-07-09 05:38:30,050][26022] Updated weights on worker 0-0, policy_version 111119 (0.00089) [2022-07-09 05:38:31,526][26022] Updated weights on worker 0-0, policy_version 111129 (0.00086) [2022-07-09 05:38:31,584][25689] Fps is (10 sec: 5910.2, 60 sec: 5723.6, 300 sec: 5722.2). Total num frames: 113796096. Throughput: 0: 5158.9. Samples: 113786764. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 05:38:31,584][25689] Avg episode reward: [(0, '-54.169')] [2022-07-09 05:38:33,612][26022] Updated weights on worker 0-0, policy_version 111139 (0.00092) [2022-07-09 05:38:35,176][26022] Updated weights on worker 0-0, policy_version 111149 (0.00093) [2022-07-09 05:38:36,643][25689] Fps is (10 sec: 5674.7, 60 sec: 5704.7, 300 sec: 5714.2). Total num frames: 113822720. Throughput: 0: 6023.9. Samples: 113821704. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 05:38:36,643][25689] Avg episode reward: [(0, '-54.029')] [2022-07-09 05:38:37,056][26022] Updated weights on worker 0-0, policy_version 111159 (0.00080) [2022-07-09 05:38:38,943][26022] Updated weights on worker 0-0, policy_version 111169 (0.00093) [2022-07-09 05:38:40,407][26022] Updated weights on worker 0-0, policy_version 111179 (0.00087) [2022-07-09 05:38:41,686][25689] Fps is (10 sec: 5575.0, 60 sec: 5738.2, 300 sec: 5713.9). Total num frames: 113852416. Throughput: 0: 6018.9. Samples: 113856456. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 05:38:41,687][25689] Avg episode reward: [(0, '-54.144')] [2022-07-09 05:38:42,369][26022] Updated weights on worker 0-0, policy_version 111189 (0.00076) [2022-07-09 05:38:43,996][26022] Updated weights on worker 0-0, policy_version 111199 (0.00095) [2022-07-09 05:38:46,111][26022] Updated weights on worker 0-0, policy_version 111209 (0.00101) [2022-07-09 05:38:46,757][25689] Fps is (10 sec: 5973.1, 60 sec: 5717.8, 300 sec: 5714.3). Total num frames: 113883136. Throughput: 0: 5989.1. Samples: 113890878. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 05:38:46,758][25689] Avg episode reward: [(0, '-55.188')] [2022-07-09 05:38:47,836][26022] Updated weights on worker 0-0, policy_version 111219 (0.00085) [2022-07-09 05:38:49,467][26022] Updated weights on worker 0-0, policy_version 111229 (0.00092) [2022-07-09 05:38:51,328][26022] Updated weights on worker 0-0, policy_version 111239 (0.00090) [2022-07-09 05:38:51,888][25689] Fps is (10 sec: 5721.5, 60 sec: 5718.8, 300 sec: 5715.5). Total num frames: 113910784. Throughput: 0: 5984.7. Samples: 113908276. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 05:38:51,888][25689] Avg episode reward: [(0, '-55.104')] [2022-07-09 05:38:53,183][26022] Updated weights on worker 0-0, policy_version 111249 (0.00085) [2022-07-09 05:38:54,957][26022] Updated weights on worker 0-0, policy_version 111259 (0.00092) [2022-07-09 05:38:56,811][26022] Updated weights on worker 0-0, policy_version 111269 (0.00080) [2022-07-09 05:38:56,909][25689] Fps is (10 sec: 5547.8, 60 sec: 5716.9, 300 sec: 5708.8). Total num frames: 113939456. Throughput: 0: 5945.8. Samples: 113942204. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 05:38:56,910][25689] Avg episode reward: [(0, '-54.070')] [2022-07-09 05:38:58,543][26022] Updated weights on worker 0-0, policy_version 111279 (0.00092) [2022-07-09 05:39:00,242][26022] Updated weights on worker 0-0, policy_version 111289 (0.00085) [2022-07-09 05:39:01,944][25689] Fps is (10 sec: 5803.8, 60 sec: 5714.6, 300 sec: 5722.8). Total num frames: 113969152. Throughput: 0: 5949.9. Samples: 113976992. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 05:39:01,945][25689] Avg episode reward: [(0, '-54.038')] [2022-07-09 05:39:02,211][26022] Updated weights on worker 0-0, policy_version 111299 (0.00090) [2022-07-09 05:39:04,191][26022] Updated weights on worker 0-0, policy_version 111309 (0.00093) [2022-07-09 05:39:06,030][26022] Updated weights on worker 0-0, policy_version 111319 (0.00085) [2022-07-09 05:39:07,006][25689] Fps is (10 sec: 5577.7, 60 sec: 5710.6, 300 sec: 5716.0). Total num frames: 113995776. Throughput: 0: 5007.9. Samples: 113992284. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 05:39:07,007][25689] Avg episode reward: [(0, '-54.087')] [2022-07-09 05:39:07,782][26022] Updated weights on worker 0-0, policy_version 111329 (0.00091) [2022-07-09 05:39:09,590][26022] Updated weights on worker 0-0, policy_version 111339 (0.00088) [2022-07-09 05:39:11,299][26022] Updated weights on worker 0-0, policy_version 111349 (0.00088) [2022-07-09 05:39:12,103][25689] Fps is (10 sec: 5544.3, 60 sec: 5709.6, 300 sec: 5714.4). Total num frames: 114025472. Throughput: 0: 5875.4. Samples: 114027048. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 05:39:12,103][25689] Avg episode reward: [(0, '-53.740')] [2022-07-09 05:39:13,035][26022] Updated weights on worker 0-0, policy_version 111359 (0.00087) [2022-07-09 05:39:14,738][26022] Updated weights on worker 0-0, policy_version 111369 (0.00086) [2022-07-09 05:39:16,531][26022] Updated weights on worker 0-0, policy_version 111379 (0.00086) [2022-07-09 05:39:17,151][25689] Fps is (10 sec: 5753.9, 60 sec: 5707.3, 300 sec: 5713.9). Total num frames: 114054144. Throughput: 0: 5897.8. Samples: 114061582. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 05:39:17,151][25689] Avg episode reward: [(0, '-53.506')] [2022-07-09 05:39:18,416][26022] Updated weights on worker 0-0, policy_version 111389 (0.00091) [2022-07-09 05:39:20,238][26022] Updated weights on worker 0-0, policy_version 111399 (0.00364) [2022-07-09 05:39:21,974][26022] Updated weights on worker 0-0, policy_version 111409 (0.00107) [2022-07-09 05:39:22,188][25689] Fps is (10 sec: 5787.6, 60 sec: 5723.3, 300 sec: 5713.6). Total num frames: 114083840. Throughput: 0: 5027.2. Samples: 114078758. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 05:39:22,188][25689] Avg episode reward: [(0, '-52.934')] [2022-07-09 05:39:23,803][26022] Updated weights on worker 0-0, policy_version 111419 (0.00082) [2022-07-09 05:39:25,610][26022] Updated weights on worker 0-0, policy_version 111429 (0.00089) [2022-07-09 05:39:27,191][25689] Fps is (10 sec: 5813.3, 60 sec: 5723.3, 300 sec: 5714.6). Total num frames: 114112512. Throughput: 0: 5994.0. Samples: 114113270. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 05:39:27,193][25689] Avg episode reward: [(0, '-53.096')] [2022-07-09 05:39:27,394][26022] Updated weights on worker 0-0, policy_version 111439 (0.00052) [2022-07-09 05:39:29,053][26022] Updated weights on worker 0-0, policy_version 111449 (0.00087) [2022-07-09 05:39:30,925][26022] Updated weights on worker 0-0, policy_version 111459 (0.00080) [2022-07-09 05:39:32,262][25689] Fps is (10 sec: 5793.8, 60 sec: 5704.1, 300 sec: 5714.3). Total num frames: 114142208. Throughput: 0: 5976.1. Samples: 114147522. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 05:39:32,262][25689] Avg episode reward: [(0, '-52.059')] [2022-07-09 05:39:32,538][26022] Updated weights on worker 0-0, policy_version 111469 (0.00095) [2022-07-09 05:39:34,531][26022] Updated weights on worker 0-0, policy_version 111479 (0.00089) [2022-07-09 05:39:36,203][26022] Updated weights on worker 0-0, policy_version 111489 (0.00088) [2022-07-09 05:39:37,271][25689] Fps is (10 sec: 5688.6, 60 sec: 5725.6, 300 sec: 5707.7). Total num frames: 114169856. Throughput: 0: 5133.0. Samples: 114164862. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 05:39:37,273][25689] Avg episode reward: [(0, '-51.806')] [2022-07-09 05:39:38,148][26022] Updated weights on worker 0-0, policy_version 111499 (0.00095) [2022-07-09 05:39:39,974][26022] Updated weights on worker 0-0, policy_version 111509 (0.00096) [2022-07-09 05:39:41,621][26022] Updated weights on worker 0-0, policy_version 111519 (0.00089) [2022-07-09 05:39:42,362][25689] Fps is (10 sec: 5576.3, 60 sec: 5704.3, 300 sec: 5707.2). Total num frames: 114198528. Throughput: 0: 5975.0. Samples: 114199298. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 05:39:42,362][25689] Avg episode reward: [(0, '-52.863')] [2022-07-09 05:39:43,333][26022] Updated weights on worker 0-0, policy_version 111529 (0.00090) [2022-07-09 05:39:45,177][26022] Updated weights on worker 0-0, policy_version 111539 (0.00094) [2022-07-09 05:39:46,815][26022] Updated weights on worker 0-0, policy_version 111549 (0.00093) [2022-07-09 05:39:47,383][25689] Fps is (10 sec: 5772.5, 60 sec: 5692.1, 300 sec: 5715.0). Total num frames: 114228224. Throughput: 0: 5991.9. Samples: 114234258. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 05:39:47,383][25689] Avg episode reward: [(0, '-52.177')] [2022-07-09 05:39:48,842][26022] Updated weights on worker 0-0, policy_version 111559 (0.00084) [2022-07-09 05:39:50,374][26022] Updated weights on worker 0-0, policy_version 111569 (0.00054) [2022-07-09 05:39:52,354][26022] Updated weights on worker 0-0, policy_version 111579 (0.00093) [2022-07-09 05:39:52,443][25689] Fps is (10 sec: 5789.8, 60 sec: 5715.6, 300 sec: 5711.1). Total num frames: 114256896. Throughput: 0: 5151.6. Samples: 114251486. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 05:39:52,443][25689] Avg episode reward: [(0, '-52.259')] [2022-07-09 05:39:53,975][26022] Updated weights on worker 0-0, policy_version 111589 (0.00091) [2022-07-09 05:39:56,099][26022] Updated weights on worker 0-0, policy_version 111599 (0.00102) [2022-07-09 05:39:57,449][25689] Fps is (10 sec: 5798.5, 60 sec: 5734.0, 300 sec: 5711.7). Total num frames: 114286592. Throughput: 0: 5987.4. Samples: 114285672. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 05:39:57,449][25689] Avg episode reward: [(0, '-53.218')] [2022-07-09 05:39:57,584][26022] Updated weights on worker 0-0, policy_version 111609 (0.00085) [2022-07-09 05:39:59,620][26022] Updated weights on worker 0-0, policy_version 111619 (0.00092) [2022-07-09 05:40:01,067][26022] Updated weights on worker 0-0, policy_version 111629 (0.00094) [2022-07-09 05:40:02,465][25689] Fps is (10 sec: 5619.9, 60 sec: 5685.1, 300 sec: 5715.8). Total num frames: 114313216. Throughput: 0: 6011.8. Samples: 114320152. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 05:40:02,465][25689] Avg episode reward: [(0, '-53.626')] [2022-07-09 05:40:03,539][26022] Updated weights on worker 0-0, policy_version 111639 (0.00085) [2022-07-09 05:40:05,146][26022] Updated weights on worker 0-0, policy_version 111649 (0.00092) [2022-07-09 05:40:07,065][26022] Updated weights on worker 0-0, policy_version 111659 (0.00086) [2022-07-09 05:40:07,507][25689] Fps is (10 sec: 5599.3, 60 sec: 5737.7, 300 sec: 5712.9). Total num frames: 114342912. Throughput: 0: 5036.4. Samples: 114335612. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 05:40:07,508][25689] Avg episode reward: [(0, '-53.899')] [2022-07-09 05:40:08,835][26022] Updated weights on worker 0-0, policy_version 111669 (0.00087) [2022-07-09 05:40:10,517][26022] Updated weights on worker 0-0, policy_version 111679 (0.00089) [2022-07-09 05:40:12,352][26022] Updated weights on worker 0-0, policy_version 111689 (0.00087) [2022-07-09 05:40:12,572][25689] Fps is (10 sec: 5673.5, 60 sec: 5706.8, 300 sec: 5712.0). Total num frames: 114370560. Throughput: 0: 5884.2. Samples: 114369928. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 05:40:12,572][25689] Avg episode reward: [(0, '-53.176')] [2022-07-09 05:40:14,041][26022] Updated weights on worker 0-0, policy_version 111699 (0.00090) [2022-07-09 05:40:15,851][26022] Updated weights on worker 0-0, policy_version 111709 (0.00096) [2022-07-09 05:40:17,594][25689] Fps is (10 sec: 5583.5, 60 sec: 5709.3, 300 sec: 5708.3). Total num frames: 114399232. Throughput: 0: 5898.9. Samples: 114404506. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 05:40:17,595][25689] Avg episode reward: [(0, '-53.609')] [2022-07-09 05:40:17,752][26022] Updated weights on worker 0-0, policy_version 111719 (0.00087) [2022-07-09 05:40:19,346][26022] Updated weights on worker 0-0, policy_version 111729 (0.00088) [2022-07-09 05:40:21,169][26022] Updated weights on worker 0-0, policy_version 111739 (0.00086) [2022-07-09 05:40:22,622][25689] Fps is (10 sec: 5909.3, 60 sec: 5727.0, 300 sec: 5719.7). Total num frames: 114429952. Throughput: 0: 5065.2. Samples: 114422256. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 05:40:22,623][25689] Avg episode reward: [(0, '-53.995')] [2022-07-09 05:40:22,905][26022] Updated weights on worker 0-0, policy_version 111749 (0.00089) [2022-07-09 05:40:24,512][26022] Updated weights on worker 0-0, policy_version 111759 (0.00092) [2022-07-09 05:40:26,627][26022] Updated weights on worker 0-0, policy_version 111769 (0.00088) [2022-07-09 05:40:27,712][25689] Fps is (10 sec: 5768.6, 60 sec: 5701.9, 300 sec: 5715.9). Total num frames: 114457600. Throughput: 0: 5999.8. Samples: 114456838. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 05:40:27,713][25689] Avg episode reward: [(0, '-54.055')] [2022-07-09 05:40:28,278][26022] Updated weights on worker 0-0, policy_version 111779 (0.00090) [2022-07-09 05:40:29,471][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:40:29,480][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000111785_114467840.pth [2022-07-09 05:40:29,483][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000109775_112409600.pth [2022-07-09 05:40:29,977][26022] Updated weights on worker 0-0, policy_version 111789 (0.00091) [2022-07-09 05:40:31,961][26022] Updated weights on worker 0-0, policy_version 111799 (0.00091) [2022-07-09 05:40:32,781][25689] Fps is (10 sec: 5544.4, 60 sec: 5685.3, 300 sec: 5708.0). Total num frames: 114486272. Throughput: 0: 6008.3. Samples: 114491348. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 05:40:32,781][25689] Avg episode reward: [(0, '-53.411')] [2022-07-09 05:40:33,369][26022] Updated weights on worker 0-0, policy_version 111809 (0.00309) [2022-07-09 05:40:35,595][26022] Updated weights on worker 0-0, policy_version 111819 (0.00085) [2022-07-09 05:40:37,022][26022] Updated weights on worker 0-0, policy_version 111829 (0.00079) [2022-07-09 05:40:37,828][25689] Fps is (10 sec: 5770.0, 60 sec: 5715.5, 300 sec: 5710.9). Total num frames: 114515968. Throughput: 0: 6010.5. Samples: 114526122. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 05:40:37,829][25689] Avg episode reward: [(0, '-53.865')] [2022-07-09 05:40:39,027][26022] Updated weights on worker 0-0, policy_version 111839 (0.00073) [2022-07-09 05:40:40,737][26022] Updated weights on worker 0-0, policy_version 111849 (0.00083) [2022-07-09 05:40:42,304][26022] Updated weights on worker 0-0, policy_version 111859 (0.00085) [2022-07-09 05:40:42,912][25689] Fps is (10 sec: 5861.9, 60 sec: 5733.0, 300 sec: 5716.2). Total num frames: 114545664. Throughput: 0: 5977.6. Samples: 114543540. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 05:40:42,913][25689] Avg episode reward: [(0, '-53.042')] [2022-07-09 05:40:44,224][26022] Updated weights on worker 0-0, policy_version 111869 (0.00092) [2022-07-09 05:40:46,138][26022] Updated weights on worker 0-0, policy_version 111879 (0.00084) [2022-07-09 05:40:47,709][26022] Updated weights on worker 0-0, policy_version 111889 (0.00084) [2022-07-09 05:40:47,937][25689] Fps is (10 sec: 5875.0, 60 sec: 5732.6, 300 sec: 5714.7). Total num frames: 114575360. Throughput: 0: 6000.0. Samples: 114578188. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 05:40:47,939][25689] Avg episode reward: [(0, '-52.906')] [2022-07-09 05:40:49,691][26022] Updated weights on worker 0-0, policy_version 111899 (0.00082) [2022-07-09 05:40:51,083][26022] Updated weights on worker 0-0, policy_version 111909 (0.00089) [2022-07-09 05:40:53,026][25689] Fps is (10 sec: 5771.1, 60 sec: 5729.9, 300 sec: 5713.5). Total num frames: 114604032. Throughput: 0: 5994.4. Samples: 114612708. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 05:40:53,027][25689] Avg episode reward: [(0, '-52.460')] [2022-07-09 05:40:53,247][26022] Updated weights on worker 0-0, policy_version 111919 (0.00361) [2022-07-09 05:40:54,953][26022] Updated weights on worker 0-0, policy_version 111929 (0.00093) [2022-07-09 05:40:56,642][26022] Updated weights on worker 0-0, policy_version 111939 (0.00085) [2022-07-09 05:40:58,073][25689] Fps is (10 sec: 5556.8, 60 sec: 5692.3, 300 sec: 5706.6). Total num frames: 114631680. Throughput: 0: 5122.6. Samples: 114629828. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-09 05:40:58,073][25689] Avg episode reward: [(0, '-52.789')] [2022-07-09 05:40:58,453][26022] Updated weights on worker 0-0, policy_version 111949 (0.00079) [2022-07-09 05:41:00,347][26022] Updated weights on worker 0-0, policy_version 111959 (0.00084) [2022-07-09 05:41:02,464][26022] Updated weights on worker 0-0, policy_version 111969 (0.00132) [2022-07-09 05:41:03,083][25689] Fps is (10 sec: 5600.4, 60 sec: 5726.6, 300 sec: 5717.9). Total num frames: 114660352. Throughput: 0: 5924.6. Samples: 114663042. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-09 05:41:03,083][25689] Avg episode reward: [(0, '-52.710')] [2022-07-09 05:41:04,271][26022] Updated weights on worker 0-0, policy_version 111979 (0.00083) [2022-07-09 05:41:05,927][26022] Updated weights on worker 0-0, policy_version 111989 (0.00081) [2022-07-09 05:41:07,823][26022] Updated weights on worker 0-0, policy_version 111999 (0.00087) [2022-07-09 05:41:08,097][25689] Fps is (10 sec: 5720.5, 60 sec: 5712.4, 300 sec: 5711.7). Total num frames: 114689024. Throughput: 0: 5907.9. Samples: 114697290. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-09 05:41:08,097][25689] Avg episode reward: [(0, '-52.428')] [2022-07-09 05:41:09,533][26022] Updated weights on worker 0-0, policy_version 112009 (0.00084) [2022-07-09 05:41:11,240][26022] Updated weights on worker 0-0, policy_version 112019 (0.00092) [2022-07-09 05:41:13,091][26022] Updated weights on worker 0-0, policy_version 112029 (0.00081) [2022-07-09 05:41:13,184][25689] Fps is (10 sec: 5778.5, 60 sec: 5744.1, 300 sec: 5717.4). Total num frames: 114718720. Throughput: 0: 5057.1. Samples: 114714646. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-09 05:41:13,184][25689] Avg episode reward: [(0, '-52.778')] [2022-07-09 05:41:14,847][26022] Updated weights on worker 0-0, policy_version 112039 (0.00087) [2022-07-09 05:41:16,625][26022] Updated weights on worker 0-0, policy_version 112049 (0.00094) [2022-07-09 05:41:18,256][25689] Fps is (10 sec: 5745.6, 60 sec: 5739.4, 300 sec: 5712.8). Total num frames: 114747392. Throughput: 0: 5940.1. Samples: 114749718. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-09 05:41:18,256][25689] Avg episode reward: [(0, '-52.794')] [2022-07-09 05:41:18,339][26022] Updated weights on worker 0-0, policy_version 112059 (0.00090) [2022-07-09 05:41:20,016][26022] Updated weights on worker 0-0, policy_version 112069 (0.00620) [2022-07-09 05:41:21,767][26022] Updated weights on worker 0-0, policy_version 112079 (0.00090) [2022-07-09 05:41:23,299][25689] Fps is (10 sec: 5770.3, 60 sec: 5721.1, 300 sec: 5719.2). Total num frames: 114777088. Throughput: 0: 6014.7. Samples: 114784636. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-09 05:41:23,304][25689] Avg episode reward: [(0, '-53.202')] [2022-07-09 05:41:23,575][26022] Updated weights on worker 0-0, policy_version 112089 (0.00089) [2022-07-09 05:41:25,474][26022] Updated weights on worker 0-0, policy_version 112099 (0.00085) [2022-07-09 05:41:27,239][26022] Updated weights on worker 0-0, policy_version 112109 (0.00087) [2022-07-09 05:41:28,343][25689] Fps is (10 sec: 5684.9, 60 sec: 5725.4, 300 sec: 5717.3). Total num frames: 114804736. Throughput: 0: 5170.4. Samples: 114801968. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-09 05:41:28,343][25689] Avg episode reward: [(0, '-52.746')] [2022-07-09 05:41:28,896][26022] Updated weights on worker 0-0, policy_version 112119 (0.00090) [2022-07-09 05:41:30,854][26022] Updated weights on worker 0-0, policy_version 112129 (0.00095) [2022-07-09 05:41:32,451][26022] Updated weights on worker 0-0, policy_version 112139 (0.00100) [2022-07-09 05:41:33,390][25689] Fps is (10 sec: 5784.1, 60 sec: 5761.2, 300 sec: 5713.5). Total num frames: 114835456. Throughput: 0: 6030.7. Samples: 114836504. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-09 05:41:33,392][25689] Avg episode reward: [(0, '-52.147')] [2022-07-09 05:41:34,419][26022] Updated weights on worker 0-0, policy_version 112149 (0.00094) [2022-07-09 05:41:36,011][26022] Updated weights on worker 0-0, policy_version 112159 (0.00085) [2022-07-09 05:41:37,886][26022] Updated weights on worker 0-0, policy_version 112169 (0.00081) [2022-07-09 05:41:38,432][25689] Fps is (10 sec: 5886.7, 60 sec: 5744.8, 300 sec: 5721.1). Total num frames: 114864128. Throughput: 0: 6013.2. Samples: 114871042. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-09 05:41:38,434][25689] Avg episode reward: [(0, '-53.435')] [2022-07-09 05:41:39,639][26022] Updated weights on worker 0-0, policy_version 112179 (0.00090) [2022-07-09 05:41:41,298][26022] Updated weights on worker 0-0, policy_version 112189 (0.00093) [2022-07-09 05:41:43,245][26022] Updated weights on worker 0-0, policy_version 112199 (0.00054) [2022-07-09 05:41:43,453][25689] Fps is (10 sec: 5800.2, 60 sec: 5750.8, 300 sec: 5720.9). Total num frames: 114893824. Throughput: 0: 5148.4. Samples: 114888398. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-09 05:41:43,455][25689] Avg episode reward: [(0, '-52.844')] [2022-07-09 05:41:45,119][26022] Updated weights on worker 0-0, policy_version 112209 (0.00077) [2022-07-09 05:41:46,799][26022] Updated weights on worker 0-0, policy_version 112219 (0.00096) [2022-07-09 05:41:48,503][25689] Fps is (10 sec: 5592.1, 60 sec: 5697.7, 300 sec: 5711.8). Total num frames: 114920448. Throughput: 0: 5989.3. Samples: 114922716. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-09 05:41:48,504][25689] Avg episode reward: [(0, '-52.557')] [2022-07-09 05:41:48,723][26022] Updated weights on worker 0-0, policy_version 112229 (0.00082) [2022-07-09 05:41:50,182][26022] Updated weights on worker 0-0, policy_version 112239 (0.00087) [2022-07-09 05:41:52,183][26022] Updated weights on worker 0-0, policy_version 112249 (0.00083) [2022-07-09 05:41:53,624][25689] Fps is (10 sec: 5638.0, 60 sec: 5728.5, 300 sec: 5716.5). Total num frames: 114951168. Throughput: 0: 5963.5. Samples: 114957170. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-09 05:41:53,625][25689] Avg episode reward: [(0, '-52.360')] [2022-07-09 05:41:54,050][26022] Updated weights on worker 0-0, policy_version 112259 (0.00096) [2022-07-09 05:41:55,583][26022] Updated weights on worker 0-0, policy_version 112269 (0.00081) [2022-07-09 05:41:57,781][26022] Updated weights on worker 0-0, policy_version 112279 (0.00086) [2022-07-09 05:41:58,679][25689] Fps is (10 sec: 5836.7, 60 sec: 5744.6, 300 sec: 5716.3). Total num frames: 114979840. Throughput: 0: 5101.4. Samples: 114974330. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-09 05:41:58,680][25689] Avg episode reward: [(0, '-52.835')] [2022-07-09 05:41:59,348][26022] Updated weights on worker 0-0, policy_version 112289 (0.00091) [2022-07-09 05:42:01,208][26022] Updated weights on worker 0-0, policy_version 112299 (0.00081) [2022-07-09 05:42:03,399][26022] Updated weights on worker 0-0, policy_version 112309 (0.00089) [2022-07-09 05:42:03,686][25689] Fps is (10 sec: 5394.0, 60 sec: 5694.2, 300 sec: 5709.4). Total num frames: 115005440. Throughput: 0: 5827.3. Samples: 115006300. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-09 05:42:03,687][25689] Avg episode reward: [(0, '-53.008')] [2022-07-09 05:42:04,837][26022] Updated weights on worker 0-0, policy_version 112319 (0.00083) [2022-07-09 05:42:07,112][26022] Updated weights on worker 0-0, policy_version 112329 (0.00098) [2022-07-09 05:42:08,678][26022] Updated weights on worker 0-0, policy_version 112339 (0.00087) [2022-07-09 05:42:08,699][25689] Fps is (10 sec: 5518.4, 60 sec: 5711.2, 300 sec: 5717.2). Total num frames: 115035136. Throughput: 0: 5853.3. Samples: 115040928. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-09 05:42:08,700][25689] Avg episode reward: [(0, '-52.418')] [2022-07-09 05:42:10,495][26022] Updated weights on worker 0-0, policy_version 112349 (0.00089) [2022-07-09 05:42:12,267][26022] Updated weights on worker 0-0, policy_version 112359 (0.00088) [2022-07-09 05:42:13,835][25689] Fps is (10 sec: 5751.0, 60 sec: 5689.7, 300 sec: 5708.7). Total num frames: 115063808. Throughput: 0: 4997.8. Samples: 115058180. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-09 05:42:13,835][25689] Avg episode reward: [(0, '-52.502')] [2022-07-09 05:42:14,067][26022] Updated weights on worker 0-0, policy_version 112369 (0.00090) [2022-07-09 05:42:15,883][26022] Updated weights on worker 0-0, policy_version 112379 (0.00084) [2022-07-09 05:42:17,717][26022] Updated weights on worker 0-0, policy_version 112389 (0.00090) [2022-07-09 05:42:18,894][25689] Fps is (10 sec: 5624.7, 60 sec: 5690.9, 300 sec: 5709.2). Total num frames: 115092480. Throughput: 0: 5851.2. Samples: 115092614. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-09 05:42:18,895][25689] Avg episode reward: [(0, '-52.568')] [2022-07-09 05:42:19,310][26022] Updated weights on worker 0-0, policy_version 112399 (0.00097) [2022-07-09 05:42:21,350][26022] Updated weights on worker 0-0, policy_version 112409 (0.00089) [2022-07-09 05:42:22,862][26022] Updated weights on worker 0-0, policy_version 112419 (0.00057) [2022-07-09 05:42:23,949][25689] Fps is (10 sec: 5771.0, 60 sec: 5689.8, 300 sec: 5715.1). Total num frames: 115122176. Throughput: 0: 5951.2. Samples: 115126890. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-09 05:42:23,949][25689] Avg episode reward: [(0, '-51.844')] [2022-07-09 05:42:25,053][26022] Updated weights on worker 0-0, policy_version 112429 (0.00508) [2022-07-09 05:42:26,399][26022] Updated weights on worker 0-0, policy_version 112439 (0.00083) [2022-07-09 05:42:28,468][26022] Updated weights on worker 0-0, policy_version 112449 (0.00095) [2022-07-09 05:42:28,962][25689] Fps is (10 sec: 5797.8, 60 sec: 5709.6, 300 sec: 5710.5). Total num frames: 115150848. Throughput: 0: 5080.5. Samples: 115143872. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-09 05:42:28,962][25689] Avg episode reward: [(0, '-51.733')] [2022-07-09 05:42:29,601][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:42:29,615][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000112457_115155968.pth [2022-07-09 05:42:29,615][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000110446_113096704.pth [2022-07-09 05:42:30,058][26022] Updated weights on worker 0-0, policy_version 112459 (0.00082) [2022-07-09 05:42:31,899][26022] Updated weights on worker 0-0, policy_version 112469 (0.00090) [2022-07-09 05:42:33,793][26022] Updated weights on worker 0-0, policy_version 112479 (0.00093) [2022-07-09 05:42:34,039][25689] Fps is (10 sec: 5581.8, 60 sec: 5656.2, 300 sec: 5709.7). Total num frames: 115178496. Throughput: 0: 5942.7. Samples: 115178246. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-09 05:42:34,039][25689] Avg episode reward: [(0, '-52.449')] [2022-07-09 05:42:35,537][26022] Updated weights on worker 0-0, policy_version 112489 (0.00087) [2022-07-09 05:42:37,268][26022] Updated weights on worker 0-0, policy_version 112499 (0.00087) [2022-07-09 05:42:39,050][25689] Fps is (10 sec: 5684.4, 60 sec: 5676.0, 300 sec: 5717.2). Total num frames: 115208192. Throughput: 0: 5970.8. Samples: 115212958. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-09 05:42:39,050][25689] Avg episode reward: [(0, '-52.290')] [2022-07-09 05:42:39,099][26022] Updated weights on worker 0-0, policy_version 112509 (0.00088) [2022-07-09 05:42:40,707][26022] Updated weights on worker 0-0, policy_version 112519 (0.00087) [2022-07-09 05:42:42,610][26022] Updated weights on worker 0-0, policy_version 112529 (0.00084) [2022-07-09 05:42:44,064][25689] Fps is (10 sec: 5822.0, 60 sec: 5659.7, 300 sec: 5707.2). Total num frames: 115236864. Throughput: 0: 5143.5. Samples: 115230352. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-09 05:42:44,066][25689] Avg episode reward: [(0, '-52.240')] [2022-07-09 05:42:44,342][26022] Updated weights on worker 0-0, policy_version 112539 (0.00053) [2022-07-09 05:42:46,297][26022] Updated weights on worker 0-0, policy_version 112549 (0.00081) [2022-07-09 05:42:48,033][26022] Updated weights on worker 0-0, policy_version 112559 (0.00083) [2022-07-09 05:42:49,099][25689] Fps is (10 sec: 5808.2, 60 sec: 5711.8, 300 sec: 5716.2). Total num frames: 115266560. Throughput: 0: 6004.1. Samples: 115264778. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-09 05:42:49,100][25689] Avg episode reward: [(0, '-52.758')] [2022-07-09 05:42:49,877][26022] Updated weights on worker 0-0, policy_version 112569 (0.00089) [2022-07-09 05:42:51,489][26022] Updated weights on worker 0-0, policy_version 112579 (0.00083) [2022-07-09 05:42:53,425][26022] Updated weights on worker 0-0, policy_version 112589 (0.00084) [2022-07-09 05:42:54,223][25689] Fps is (10 sec: 5745.8, 60 sec: 5677.8, 300 sec: 5713.8). Total num frames: 115295232. Throughput: 0: 5985.8. Samples: 115299062. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-09 05:42:54,224][25689] Avg episode reward: [(0, '-53.110')] [2022-07-09 05:42:55,122][26022] Updated weights on worker 0-0, policy_version 112599 (0.00078) [2022-07-09 05:42:57,058][26022] Updated weights on worker 0-0, policy_version 112609 (0.00088) [2022-07-09 05:42:58,823][26022] Updated weights on worker 0-0, policy_version 112619 (0.00094) [2022-07-09 05:42:59,258][25689] Fps is (10 sec: 5543.5, 60 sec: 5662.6, 300 sec: 5706.4). Total num frames: 115322880. Throughput: 0: 5937.5. Samples: 115332948. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 05:42:59,259][25689] Avg episode reward: [(0, '-52.585')] [2022-07-09 05:43:00,522][26022] Updated weights on worker 0-0, policy_version 112629 (0.00085) [2022-07-09 05:43:02,819][26022] Updated weights on worker 0-0, policy_version 112639 (0.00091) [2022-07-09 05:43:04,276][25689] Fps is (10 sec: 5500.3, 60 sec: 5695.4, 300 sec: 5709.9). Total num frames: 115350528. Throughput: 0: 5895.7. Samples: 115349514. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 05:43:04,278][25689] Avg episode reward: [(0, '-52.713')] [2022-07-09 05:43:04,519][26022] Updated weights on worker 0-0, policy_version 112649 (0.00078) [2022-07-09 05:43:06,230][26022] Updated weights on worker 0-0, policy_version 112659 (0.00089) [2022-07-09 05:43:08,047][26022] Updated weights on worker 0-0, policy_version 112669 (0.00087) [2022-07-09 05:43:09,367][25689] Fps is (10 sec: 5774.4, 60 sec: 5705.1, 300 sec: 5713.3). Total num frames: 115381248. Throughput: 0: 5830.6. Samples: 115382952. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 05:43:09,367][25689] Avg episode reward: [(0, '-53.050')] [2022-07-09 05:43:09,707][26022] Updated weights on worker 0-0, policy_version 112679 (0.00085) [2022-07-09 05:43:11,564][26022] Updated weights on worker 0-0, policy_version 112689 (0.00087) [2022-07-09 05:43:13,532][26022] Updated weights on worker 0-0, policy_version 112699 (0.00089) [2022-07-09 05:43:14,416][25689] Fps is (10 sec: 5856.9, 60 sec: 5713.2, 300 sec: 5712.8). Total num frames: 115409920. Throughput: 0: 5885.7. Samples: 115417916. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 05:43:14,417][25689] Avg episode reward: [(0, '-53.023')] [2022-07-09 05:43:15,039][26022] Updated weights on worker 0-0, policy_version 112709 (0.00096) [2022-07-09 05:43:17,012][26022] Updated weights on worker 0-0, policy_version 112719 (0.00088) [2022-07-09 05:43:18,401][26022] Updated weights on worker 0-0, policy_version 112729 (0.00082) [2022-07-09 05:43:19,447][25689] Fps is (10 sec: 5587.0, 60 sec: 5699.0, 300 sec: 5709.3). Total num frames: 115437568. Throughput: 0: 5069.0. Samples: 115435284. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 05:43:19,448][25689] Avg episode reward: [(0, '-52.631')] [2022-07-09 05:43:20,560][26022] Updated weights on worker 0-0, policy_version 112739 (0.00082) [2022-07-09 05:43:22,047][26022] Updated weights on worker 0-0, policy_version 112749 (0.00085) [2022-07-09 05:43:24,019][26022] Updated weights on worker 0-0, policy_version 112759 (0.00094) [2022-07-09 05:43:24,464][25689] Fps is (10 sec: 5809.3, 60 sec: 5719.5, 300 sec: 5715.9). Total num frames: 115468288. Throughput: 0: 5963.3. Samples: 115469900. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 05:43:24,464][25689] Avg episode reward: [(0, '-52.760')] [2022-07-09 05:43:25,618][26022] Updated weights on worker 0-0, policy_version 112769 (0.00086) [2022-07-09 05:43:27,664][26022] Updated weights on worker 0-0, policy_version 112779 (0.00083) [2022-07-09 05:43:29,170][26022] Updated weights on worker 0-0, policy_version 112789 (0.00089) [2022-07-09 05:43:29,503][25689] Fps is (10 sec: 5906.3, 60 sec: 5717.0, 300 sec: 5709.2). Total num frames: 115496960. Throughput: 0: 6030.2. Samples: 115504378. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 05:43:29,503][25689] Avg episode reward: [(0, '-53.646')] [2022-07-09 05:43:31,319][26022] Updated weights on worker 0-0, policy_version 112799 (0.00087) [2022-07-09 05:43:32,848][26022] Updated weights on worker 0-0, policy_version 112809 (0.00090) [2022-07-09 05:43:34,565][25689] Fps is (10 sec: 5676.8, 60 sec: 5735.3, 300 sec: 5716.0). Total num frames: 115525632. Throughput: 0: 5141.6. Samples: 115521512. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 05:43:34,565][25689] Avg episode reward: [(0, '-53.939')] [2022-07-09 05:43:34,715][26022] Updated weights on worker 0-0, policy_version 112819 (0.00093) [2022-07-09 05:43:36,723][26022] Updated weights on worker 0-0, policy_version 112829 (0.00081) [2022-07-09 05:43:38,095][26022] Updated weights on worker 0-0, policy_version 112839 (0.00086) [2022-07-09 05:43:39,621][25689] Fps is (10 sec: 5667.4, 60 sec: 5714.1, 300 sec: 5712.3). Total num frames: 115554304. Throughput: 0: 5989.7. Samples: 115556120. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 05:43:39,621][25689] Avg episode reward: [(0, '-53.821')] [2022-07-09 05:43:40,087][26022] Updated weights on worker 0-0, policy_version 112849 (0.00081) [2022-07-09 05:43:41,898][26022] Updated weights on worker 0-0, policy_version 112859 (0.00091) [2022-07-09 05:43:43,546][26022] Updated weights on worker 0-0, policy_version 112869 (0.00081) [2022-07-09 05:43:44,656][25689] Fps is (10 sec: 5784.3, 60 sec: 5729.1, 300 sec: 5709.5). Total num frames: 115584000. Throughput: 0: 5992.5. Samples: 115590902. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 05:43:44,656][25689] Avg episode reward: [(0, '-53.839')] [2022-07-09 05:43:45,451][26022] Updated weights on worker 0-0, policy_version 112879 (0.00092) [2022-07-09 05:43:47,023][26022] Updated weights on worker 0-0, policy_version 112889 (0.00094) [2022-07-09 05:43:48,932][26022] Updated weights on worker 0-0, policy_version 112899 (0.00087) [2022-07-09 05:43:49,672][25689] Fps is (10 sec: 5807.3, 60 sec: 5714.0, 300 sec: 5715.2). Total num frames: 115612672. Throughput: 0: 5147.4. Samples: 115608196. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 05:43:49,672][25689] Avg episode reward: [(0, '-53.440')] [2022-07-09 05:43:50,699][26022] Updated weights on worker 0-0, policy_version 112909 (0.00092) [2022-07-09 05:43:52,292][26022] Updated weights on worker 0-0, policy_version 112919 (0.00081) [2022-07-09 05:43:54,204][26022] Updated weights on worker 0-0, policy_version 112929 (0.00094) [2022-07-09 05:43:54,739][25689] Fps is (10 sec: 5788.6, 60 sec: 5736.2, 300 sec: 5717.7). Total num frames: 115642368. Throughput: 0: 6018.1. Samples: 115642922. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 05:43:54,740][25689] Avg episode reward: [(0, '-51.657')] [2022-07-09 05:43:56,075][26022] Updated weights on worker 0-0, policy_version 112939 (0.00088) [2022-07-09 05:43:57,799][26022] Updated weights on worker 0-0, policy_version 112949 (0.00091) [2022-07-09 05:43:59,590][26022] Updated weights on worker 0-0, policy_version 112959 (0.00095) [2022-07-09 05:43:59,792][25689] Fps is (10 sec: 5666.1, 60 sec: 5734.6, 300 sec: 5710.5). Total num frames: 115670016. Throughput: 0: 6035.8. Samples: 115677870. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 05:43:59,793][25689] Avg episode reward: [(0, '-51.415')] [2022-07-09 05:44:01,372][26022] Updated weights on worker 0-0, policy_version 112969 (0.00088) [2022-07-09 05:44:03,627][26022] Updated weights on worker 0-0, policy_version 112979 (0.00086) [2022-07-09 05:44:04,799][25689] Fps is (10 sec: 5598.5, 60 sec: 5752.5, 300 sec: 5718.5). Total num frames: 115698688. Throughput: 0: 5067.9. Samples: 115692986. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 05:44:04,800][25689] Avg episode reward: [(0, '-51.529')] [2022-07-09 05:44:05,343][26022] Updated weights on worker 0-0, policy_version 112989 (0.00366) [2022-07-09 05:44:07,203][26022] Updated weights on worker 0-0, policy_version 112999 (0.00081) [2022-07-09 05:44:08,789][26022] Updated weights on worker 0-0, policy_version 113009 (0.00087) [2022-07-09 05:44:09,870][25689] Fps is (10 sec: 5588.5, 60 sec: 5703.6, 300 sec: 5712.1). Total num frames: 115726336. Throughput: 0: 5894.0. Samples: 115727244. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 05:44:09,872][25689] Avg episode reward: [(0, '-51.375')] [2022-07-09 05:44:10,752][26022] Updated weights on worker 0-0, policy_version 113019 (0.00094) [2022-07-09 05:44:12,354][26022] Updated weights on worker 0-0, policy_version 113029 (0.00065) [2022-07-09 05:44:14,406][26022] Updated weights on worker 0-0, policy_version 113039 (0.00116) [2022-07-09 05:44:14,989][25689] Fps is (10 sec: 5727.9, 60 sec: 5730.9, 300 sec: 5717.6). Total num frames: 115757056. Throughput: 0: 5880.5. Samples: 115762002. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 05:44:14,991][25689] Avg episode reward: [(0, '-52.117')] [2022-07-09 05:44:15,918][26022] Updated weights on worker 0-0, policy_version 113049 (0.00087) [2022-07-09 05:44:17,682][26022] Updated weights on worker 0-0, policy_version 113059 (0.00084) [2022-07-09 05:44:19,453][26022] Updated weights on worker 0-0, policy_version 113069 (0.00087) [2022-07-09 05:44:19,996][25689] Fps is (10 sec: 5865.1, 60 sec: 5750.0, 300 sec: 5714.7). Total num frames: 115785728. Throughput: 0: 5023.4. Samples: 115779364. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 05:44:19,997][25689] Avg episode reward: [(0, '-53.324')] [2022-07-09 05:44:21,198][26022] Updated weights on worker 0-0, policy_version 113079 (0.00092) [2022-07-09 05:44:23,074][26022] Updated weights on worker 0-0, policy_version 113089 (0.00089) [2022-07-09 05:44:24,736][26022] Updated weights on worker 0-0, policy_version 113099 (0.00090) [2022-07-09 05:44:25,057][25689] Fps is (10 sec: 5695.6, 60 sec: 5712.0, 300 sec: 5713.6). Total num frames: 115814400. Throughput: 0: 5986.8. Samples: 115814268. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 05:44:25,058][25689] Avg episode reward: [(0, '-53.704')] [2022-07-09 05:44:26,558][26022] Updated weights on worker 0-0, policy_version 113109 (0.00088) [2022-07-09 05:44:28,413][26022] Updated weights on worker 0-0, policy_version 113119 (0.00087) [2022-07-09 05:44:29,780][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:44:29,790][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000113128_115843072.pth [2022-07-09 05:44:29,790][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000111116_113782784.pth [2022-07-09 05:44:30,055][26022] Updated weights on worker 0-0, policy_version 113129 (0.00086) [2022-07-09 05:44:30,148][25689] Fps is (10 sec: 5749.3, 60 sec: 5724.0, 300 sec: 5713.2). Total num frames: 115844096. Throughput: 0: 5989.9. Samples: 115848710. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 05:44:30,149][25689] Avg episode reward: [(0, '-53.991')] [2022-07-09 05:44:31,993][26022] Updated weights on worker 0-0, policy_version 113139 (0.00099) [2022-07-09 05:44:33,910][26022] Updated weights on worker 0-0, policy_version 113149 (0.00091) [2022-07-09 05:44:35,273][25689] Fps is (10 sec: 5813.7, 60 sec: 5735.0, 300 sec: 5717.9). Total num frames: 115873792. Throughput: 0: 5113.4. Samples: 115865722. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 05:44:35,275][25689] Avg episode reward: [(0, '-54.700')] [2022-07-09 05:44:35,387][26022] Updated weights on worker 0-0, policy_version 113159 (0.00091) [2022-07-09 05:44:37,538][26022] Updated weights on worker 0-0, policy_version 113169 (0.00086) [2022-07-09 05:44:38,920][26022] Updated weights on worker 0-0, policy_version 113179 (0.00091) [2022-07-09 05:44:40,300][25689] Fps is (10 sec: 5649.0, 60 sec: 5720.9, 300 sec: 5715.7). Total num frames: 115901440. Throughput: 0: 5954.8. Samples: 115900266. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 05:44:40,300][25689] Avg episode reward: [(0, '-54.734')] [2022-07-09 05:44:40,932][26022] Updated weights on worker 0-0, policy_version 113189 (0.00085) [2022-07-09 05:44:42,630][26022] Updated weights on worker 0-0, policy_version 113199 (0.00089) [2022-07-09 05:44:44,318][26022] Updated weights on worker 0-0, policy_version 113209 (0.00089) [2022-07-09 05:44:45,373][25689] Fps is (10 sec: 5677.7, 60 sec: 5717.3, 300 sec: 5714.7). Total num frames: 115931136. Throughput: 0: 5949.0. Samples: 115935126. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 05:44:45,373][25689] Avg episode reward: [(0, '-53.626')] [2022-07-09 05:44:46,272][26022] Updated weights on worker 0-0, policy_version 113219 (0.00087) [2022-07-09 05:44:48,056][26022] Updated weights on worker 0-0, policy_version 113229 (0.00089) [2022-07-09 05:44:49,621][26022] Updated weights on worker 0-0, policy_version 113239 (0.00085) [2022-07-09 05:44:50,381][25689] Fps is (10 sec: 5992.7, 60 sec: 5751.7, 300 sec: 5722.5). Total num frames: 115961856. Throughput: 0: 5987.7. Samples: 115969858. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 05:44:50,383][25689] Avg episode reward: [(0, '-52.946')] [2022-07-09 05:44:51,627][26022] Updated weights on worker 0-0, policy_version 113249 (0.00080) [2022-07-09 05:44:53,079][26022] Updated weights on worker 0-0, policy_version 113259 (0.00086) [2022-07-09 05:44:55,046][26022] Updated weights on worker 0-0, policy_version 113269 (0.00089) [2022-07-09 05:44:55,443][25689] Fps is (10 sec: 5897.7, 60 sec: 5735.3, 300 sec: 5718.0). Total num frames: 115990528. Throughput: 0: 6033.5. Samples: 115987420. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 05:44:55,444][25689] Avg episode reward: [(0, '-52.681')] [2022-07-09 05:44:56,867][26022] Updated weights on worker 0-0, policy_version 113279 (0.00079) [2022-07-09 05:44:58,489][26022] Updated weights on worker 0-0, policy_version 113289 (0.00089) [2022-07-09 05:45:00,436][26022] Updated weights on worker 0-0, policy_version 113299 (0.00094) [2022-07-09 05:45:00,530][25689] Fps is (10 sec: 5549.6, 60 sec: 5732.2, 300 sec: 5720.1). Total num frames: 116018176. Throughput: 0: 6026.0. Samples: 116022172. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2022-07-09 05:45:00,530][25689] Avg episode reward: [(0, '-52.744')] [2022-07-09 05:45:02,434][26022] Updated weights on worker 0-0, policy_version 113309 (0.00085) [2022-07-09 05:45:04,267][26022] Updated weights on worker 0-0, policy_version 113319 (0.00088) [2022-07-09 05:45:05,544][25689] Fps is (10 sec: 5474.6, 60 sec: 5714.7, 300 sec: 5713.8). Total num frames: 116045824. Throughput: 0: 5911.7. Samples: 116054370. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2022-07-09 05:45:05,544][25689] Avg episode reward: [(0, '-52.025')] [2022-07-09 05:45:06,079][26022] Updated weights on worker 0-0, policy_version 113329 (0.00083) [2022-07-09 05:45:07,895][26022] Updated weights on worker 0-0, policy_version 113339 (0.00081) [2022-07-09 05:45:09,599][26022] Updated weights on worker 0-0, policy_version 113349 (0.00090) [2022-07-09 05:45:10,609][25689] Fps is (10 sec: 5587.5, 60 sec: 5732.0, 300 sec: 5717.2). Total num frames: 116074496. Throughput: 0: 5033.9. Samples: 116071684. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2022-07-09 05:45:10,610][25689] Avg episode reward: [(0, '-52.724')] [2022-07-09 05:45:11,494][26022] Updated weights on worker 0-0, policy_version 113359 (0.00088) [2022-07-09 05:45:13,065][26022] Updated weights on worker 0-0, policy_version 113369 (0.00085) [2022-07-09 05:45:15,115][26022] Updated weights on worker 0-0, policy_version 113379 (0.00079) [2022-07-09 05:45:15,685][25689] Fps is (10 sec: 5755.8, 60 sec: 5719.3, 300 sec: 5719.6). Total num frames: 116104192. Throughput: 0: 5881.2. Samples: 116106464. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2022-07-09 05:45:15,686][25689] Avg episode reward: [(0, '-52.529')] [2022-07-09 05:45:16,675][26022] Updated weights on worker 0-0, policy_version 113389 (0.00088) [2022-07-09 05:45:18,585][26022] Updated weights on worker 0-0, policy_version 113399 (0.00090) [2022-07-09 05:45:20,192][26022] Updated weights on worker 0-0, policy_version 113409 (0.00089) [2022-07-09 05:45:20,688][25689] Fps is (10 sec: 5791.2, 60 sec: 5719.7, 300 sec: 5713.2). Total num frames: 116132864. Throughput: 0: 5901.8. Samples: 116141142. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2022-07-09 05:45:20,690][25689] Avg episode reward: [(0, '-52.791')] [2022-07-09 05:45:22,003][26022] Updated weights on worker 0-0, policy_version 113419 (0.00087) [2022-07-09 05:45:23,814][26022] Updated weights on worker 0-0, policy_version 113429 (0.00086) [2022-07-09 05:45:25,606][26022] Updated weights on worker 0-0, policy_version 113439 (0.00093) [2022-07-09 05:45:25,710][25689] Fps is (10 sec: 5720.0, 60 sec: 5723.4, 300 sec: 5718.0). Total num frames: 116161536. Throughput: 0: 5155.9. Samples: 116158344. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2022-07-09 05:45:25,711][25689] Avg episode reward: [(0, '-52.159')] [2022-07-09 05:45:27,606][26022] Updated weights on worker 0-0, policy_version 113449 (0.00091) [2022-07-09 05:45:29,232][26022] Updated weights on worker 0-0, policy_version 113459 (0.00090) [2022-07-09 05:45:30,728][25689] Fps is (10 sec: 5711.4, 60 sec: 5713.3, 300 sec: 5718.9). Total num frames: 116190208. Throughput: 0: 5999.4. Samples: 116192386. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2022-07-09 05:45:30,729][25689] Avg episode reward: [(0, '-51.819')] [2022-07-09 05:45:31,191][26022] Updated weights on worker 0-0, policy_version 113469 (0.00090) [2022-07-09 05:45:32,964][26022] Updated weights on worker 0-0, policy_version 113479 (0.00090) [2022-07-09 05:45:34,709][26022] Updated weights on worker 0-0, policy_version 113489 (0.00098) [2022-07-09 05:45:35,792][25689] Fps is (10 sec: 5586.1, 60 sec: 5685.2, 300 sec: 5711.7). Total num frames: 116217856. Throughput: 0: 5958.3. Samples: 116226268. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2022-07-09 05:45:35,794][25689] Avg episode reward: [(0, '-52.207')] [2022-07-09 05:45:36,752][26022] Updated weights on worker 0-0, policy_version 113499 (0.00091) [2022-07-09 05:45:38,129][26022] Updated weights on worker 0-0, policy_version 113509 (0.00087) [2022-07-09 05:45:40,180][26022] Updated weights on worker 0-0, policy_version 113519 (0.00092) [2022-07-09 05:45:40,831][25689] Fps is (10 sec: 5676.3, 60 sec: 5717.9, 300 sec: 5712.6). Total num frames: 116247552. Throughput: 0: 5076.7. Samples: 116243400. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2022-07-09 05:45:40,831][25689] Avg episode reward: [(0, '-52.596')] [2022-07-09 05:45:41,846][26022] Updated weights on worker 0-0, policy_version 113529 (0.00089) [2022-07-09 05:45:43,577][26022] Updated weights on worker 0-0, policy_version 113539 (0.00088) [2022-07-09 05:45:45,494][26022] Updated weights on worker 0-0, policy_version 113549 (0.00087) [2022-07-09 05:45:45,861][25689] Fps is (10 sec: 5797.0, 60 sec: 5705.1, 300 sec: 5709.1). Total num frames: 116276224. Throughput: 0: 5941.5. Samples: 116278070. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2022-07-09 05:45:45,861][25689] Avg episode reward: [(0, '-52.743')] [2022-07-09 05:45:47,403][26022] Updated weights on worker 0-0, policy_version 113559 (0.00089) [2022-07-09 05:45:48,827][26022] Updated weights on worker 0-0, policy_version 113569 (0.00086) [2022-07-09 05:45:50,879][25689] Fps is (10 sec: 5605.1, 60 sec: 5653.4, 300 sec: 5707.0). Total num frames: 116303872. Throughput: 0: 5970.2. Samples: 116312688. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2022-07-09 05:45:50,879][25689] Avg episode reward: [(0, '-52.901')] [2022-07-09 05:45:50,901][26022] Updated weights on worker 0-0, policy_version 113579 (0.00084) [2022-07-09 05:45:52,462][26022] Updated weights on worker 0-0, policy_version 113589 (0.00091) [2022-07-09 05:45:54,376][26022] Updated weights on worker 0-0, policy_version 113599 (0.00086) [2022-07-09 05:45:55,968][25689] Fps is (10 sec: 5875.9, 60 sec: 5701.6, 300 sec: 5719.9). Total num frames: 116335616. Throughput: 0: 5139.0. Samples: 116329954. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2022-07-09 05:45:55,969][25689] Avg episode reward: [(0, '-53.373')] [2022-07-09 05:45:55,969][26022] Updated weights on worker 0-0, policy_version 113609 (0.00085) [2022-07-09 05:45:57,702][26022] Updated weights on worker 0-0, policy_version 113619 (0.00080) [2022-07-09 05:45:59,487][26022] Updated weights on worker 0-0, policy_version 113629 (0.00083) [2022-07-09 05:46:01,042][25689] Fps is (10 sec: 5944.5, 60 sec: 5719.7, 300 sec: 5718.7). Total num frames: 116364288. Throughput: 0: 6022.2. Samples: 116365116. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2022-07-09 05:46:01,043][25689] Avg episode reward: [(0, '-54.021')] [2022-07-09 05:46:01,695][26022] Updated weights on worker 0-0, policy_version 113639 (0.00104) [2022-07-09 05:46:03,467][26022] Updated weights on worker 0-0, policy_version 113649 (0.00085) [2022-07-09 05:46:05,384][26022] Updated weights on worker 0-0, policy_version 113659 (0.00092) [2022-07-09 05:46:06,045][25689] Fps is (10 sec: 5385.8, 60 sec: 5686.9, 300 sec: 5708.6). Total num frames: 116389888. Throughput: 0: 5903.7. Samples: 116397234. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2022-07-09 05:46:06,046][25689] Avg episode reward: [(0, '-53.860')] [2022-07-09 05:46:07,052][26022] Updated weights on worker 0-0, policy_version 113669 (0.00089) [2022-07-09 05:46:09,064][26022] Updated weights on worker 0-0, policy_version 113679 (0.00086) [2022-07-09 05:46:10,615][26022] Updated weights on worker 0-0, policy_version 113689 (0.00086) [2022-07-09 05:46:11,074][25689] Fps is (10 sec: 5511.7, 60 sec: 5707.2, 300 sec: 5709.7). Total num frames: 116419584. Throughput: 0: 5040.9. Samples: 116414492. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2022-07-09 05:46:11,075][25689] Avg episode reward: [(0, '-53.471')] [2022-07-09 05:46:12,519][26022] Updated weights on worker 0-0, policy_version 113699 (0.00084) [2022-07-09 05:46:14,204][26022] Updated weights on worker 0-0, policy_version 113709 (0.00093) [2022-07-09 05:46:15,883][26022] Updated weights on worker 0-0, policy_version 113719 (0.00091) [2022-07-09 05:46:16,136][25689] Fps is (10 sec: 5886.0, 60 sec: 5708.6, 300 sec: 5713.3). Total num frames: 116449280. Throughput: 0: 5909.9. Samples: 116449140. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2022-07-09 05:46:16,136][25689] Avg episode reward: [(0, '-53.060')] [2022-07-09 05:46:17,854][26022] Updated weights on worker 0-0, policy_version 113729 (0.00065) [2022-07-09 05:46:19,673][26022] Updated weights on worker 0-0, policy_version 113739 (0.00084) [2022-07-09 05:46:21,146][25689] Fps is (10 sec: 5897.2, 60 sec: 5724.9, 300 sec: 5714.0). Total num frames: 116478976. Throughput: 0: 5901.8. Samples: 116483764. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2022-07-09 05:46:21,146][25689] Avg episode reward: [(0, '-53.785')] [2022-07-09 05:46:21,155][26022] Updated weights on worker 0-0, policy_version 113749 (0.00081) [2022-07-09 05:46:23,132][26022] Updated weights on worker 0-0, policy_version 113759 (0.00103) [2022-07-09 05:46:24,607][26022] Updated weights on worker 0-0, policy_version 113769 (0.00059) [2022-07-09 05:46:26,207][25689] Fps is (10 sec: 5693.9, 60 sec: 5704.2, 300 sec: 5713.6). Total num frames: 116506624. Throughput: 0: 5150.1. Samples: 116501068. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2022-07-09 05:46:26,207][25689] Avg episode reward: [(0, '-53.987')] [2022-07-09 05:46:26,714][26022] Updated weights on worker 0-0, policy_version 113779 (0.00086) [2022-07-09 05:46:28,323][26022] Updated weights on worker 0-0, policy_version 113789 (0.00091) [2022-07-09 05:46:29,887][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:46:29,899][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000113797_116528128.pth [2022-07-09 05:46:29,900][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000111785_114467840.pth [2022-07-09 05:46:30,285][26022] Updated weights on worker 0-0, policy_version 113799 (0.00089) [2022-07-09 05:46:31,239][25689] Fps is (10 sec: 5681.5, 60 sec: 5719.9, 300 sec: 5710.5). Total num frames: 116536320. Throughput: 0: 6029.2. Samples: 116536066. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2022-07-09 05:46:31,239][25689] Avg episode reward: [(0, '-53.021')] [2022-07-09 05:46:31,751][26022] Updated weights on worker 0-0, policy_version 113809 (0.00085) [2022-07-09 05:46:33,788][26022] Updated weights on worker 0-0, policy_version 113819 (0.00099) [2022-07-09 05:46:35,469][26022] Updated weights on worker 0-0, policy_version 113829 (0.00084) [2022-07-09 05:46:36,303][25689] Fps is (10 sec: 5578.4, 60 sec: 5702.9, 300 sec: 5703.2). Total num frames: 116562944. Throughput: 0: 6003.6. Samples: 116570216. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2022-07-09 05:46:36,303][25689] Avg episode reward: [(0, '-53.173')] [2022-07-09 05:46:37,283][26022] Updated weights on worker 0-0, policy_version 113839 (0.00086) [2022-07-09 05:46:39,115][26022] Updated weights on worker 0-0, policy_version 113849 (0.00460) [2022-07-09 05:46:41,014][26022] Updated weights on worker 0-0, policy_version 113859 (0.00093) [2022-07-09 05:46:41,349][25689] Fps is (10 sec: 5671.7, 60 sec: 5719.1, 300 sec: 5706.2). Total num frames: 116593664. Throughput: 0: 5124.0. Samples: 116587288. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2022-07-09 05:46:41,350][25689] Avg episode reward: [(0, '-52.705')] [2022-07-09 05:46:42,608][26022] Updated weights on worker 0-0, policy_version 113869 (0.00087) [2022-07-09 05:46:44,764][26022] Updated weights on worker 0-0, policy_version 113879 (0.00089) [2022-07-09 05:46:46,035][26022] Updated weights on worker 0-0, policy_version 113889 (0.00092) [2022-07-09 05:46:46,371][25689] Fps is (10 sec: 6102.2, 60 sec: 5753.7, 300 sec: 5720.4). Total num frames: 116624384. Throughput: 0: 6000.3. Samples: 116622062. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2022-07-09 05:46:46,372][25689] Avg episode reward: [(0, '-52.758')] [2022-07-09 05:46:48,234][26022] Updated weights on worker 0-0, policy_version 113899 (0.00091) [2022-07-09 05:46:49,707][26022] Updated weights on worker 0-0, policy_version 113909 (0.00088) [2022-07-09 05:46:51,426][25689] Fps is (10 sec: 5588.8, 60 sec: 5716.3, 300 sec: 5704.5). Total num frames: 116649984. Throughput: 0: 5968.6. Samples: 116656558. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2022-07-09 05:46:51,428][25689] Avg episode reward: [(0, '-51.998')] [2022-07-09 05:46:51,703][26022] Updated weights on worker 0-0, policy_version 113919 (0.00083) [2022-07-09 05:46:53,336][26022] Updated weights on worker 0-0, policy_version 113929 (0.00091) [2022-07-09 05:46:55,057][26022] Updated weights on worker 0-0, policy_version 113939 (0.00081) [2022-07-09 05:46:56,481][25689] Fps is (10 sec: 5672.1, 60 sec: 5719.6, 300 sec: 5714.8). Total num frames: 116681728. Throughput: 0: 5123.3. Samples: 116673596. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2022-07-09 05:46:56,482][25689] Avg episode reward: [(0, '-52.559')] [2022-07-09 05:46:57,143][26022] Updated weights on worker 0-0, policy_version 113949 (0.00085) [2022-07-09 05:46:58,731][26022] Updated weights on worker 0-0, policy_version 113959 (0.00098) [2022-07-09 05:47:00,558][26022] Updated weights on worker 0-0, policy_version 113969 (0.00085) [2022-07-09 05:47:01,519][25689] Fps is (10 sec: 5884.7, 60 sec: 5706.1, 300 sec: 5721.1). Total num frames: 116709376. Throughput: 0: 5989.4. Samples: 116708094. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 05:47:01,521][25689] Avg episode reward: [(0, '-52.580')] [2022-07-09 05:47:02,722][26022] Updated weights on worker 0-0, policy_version 113979 (0.00092) [2022-07-09 05:47:04,498][26022] Updated weights on worker 0-0, policy_version 113989 (0.00086) [2022-07-09 05:47:06,545][25689] Fps is (10 sec: 5290.9, 60 sec: 5703.9, 300 sec: 5707.1). Total num frames: 116734976. Throughput: 0: 5839.5. Samples: 116739868. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 05:47:06,546][25689] Avg episode reward: [(0, '-52.500')] [2022-07-09 05:47:06,546][26022] Updated weights on worker 0-0, policy_version 113999 (0.00082) [2022-07-09 05:47:08,058][26022] Updated weights on worker 0-0, policy_version 114009 (0.00088) [2022-07-09 05:47:09,907][26022] Updated weights on worker 0-0, policy_version 114019 (0.00091) [2022-07-09 05:47:11,573][25689] Fps is (10 sec: 5601.5, 60 sec: 5720.9, 300 sec: 5716.0). Total num frames: 116765696. Throughput: 0: 4994.5. Samples: 116757182. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 05:47:11,577][26022] Updated weights on worker 0-0, policy_version 114029 (0.00097) [2022-07-09 05:47:11,577][25689] Avg episode reward: [(0, '-53.167')] [2022-07-09 05:47:13,484][26022] Updated weights on worker 0-0, policy_version 114039 (0.00086) [2022-07-09 05:47:15,354][26022] Updated weights on worker 0-0, policy_version 114049 (0.00088) [2022-07-09 05:47:16,645][25689] Fps is (10 sec: 5880.4, 60 sec: 5703.0, 300 sec: 5715.8). Total num frames: 116794368. Throughput: 0: 5860.6. Samples: 116791770. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 05:47:16,646][25689] Avg episode reward: [(0, '-53.811')] [2022-07-09 05:47:17,037][26022] Updated weights on worker 0-0, policy_version 114059 (0.00085) [2022-07-09 05:47:18,986][26022] Updated weights on worker 0-0, policy_version 114069 (0.00092) [2022-07-09 05:47:20,595][26022] Updated weights on worker 0-0, policy_version 114079 (0.00088) [2022-07-09 05:47:21,704][25689] Fps is (10 sec: 5761.5, 60 sec: 5698.4, 300 sec: 5715.7). Total num frames: 116824064. Throughput: 0: 5863.4. Samples: 116826448. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 05:47:21,705][25689] Avg episode reward: [(0, '-54.857')] [2022-07-09 05:47:22,376][26022] Updated weights on worker 0-0, policy_version 114089 (0.00083) [2022-07-09 05:47:24,214][26022] Updated weights on worker 0-0, policy_version 114099 (0.00096) [2022-07-09 05:47:25,759][26022] Updated weights on worker 0-0, policy_version 114109 (0.00088) [2022-07-09 05:47:26,735][25689] Fps is (10 sec: 5683.3, 60 sec: 5701.2, 300 sec: 5711.9). Total num frames: 116851712. Throughput: 0: 5998.7. Samples: 116860982. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 05:47:26,736][25689] Avg episode reward: [(0, '-54.828')] [2022-07-09 05:47:27,799][26022] Updated weights on worker 0-0, policy_version 114119 (0.00100) [2022-07-09 05:47:29,577][26022] Updated weights on worker 0-0, policy_version 114129 (0.00084) [2022-07-09 05:47:31,320][26022] Updated weights on worker 0-0, policy_version 114139 (0.00097) [2022-07-09 05:47:31,763][25689] Fps is (10 sec: 5599.3, 60 sec: 5684.8, 300 sec: 5716.3). Total num frames: 116880384. Throughput: 0: 5995.5. Samples: 116878226. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 05:47:31,763][25689] Avg episode reward: [(0, '-55.274')] [2022-07-09 05:47:33,125][26022] Updated weights on worker 0-0, policy_version 114149 (0.00082) [2022-07-09 05:47:34,954][26022] Updated weights on worker 0-0, policy_version 114159 (0.00086) [2022-07-09 05:47:36,806][25689] Fps is (10 sec: 5592.2, 60 sec: 5703.6, 300 sec: 5708.8). Total num frames: 116908032. Throughput: 0: 5961.5. Samples: 116911960. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 05:47:36,807][25689] Avg episode reward: [(0, '-55.062')] [2022-07-09 05:47:36,842][26022] Updated weights on worker 0-0, policy_version 114169 (0.00081) [2022-07-09 05:47:38,563][26022] Updated weights on worker 0-0, policy_version 114179 (0.00082) [2022-07-09 05:47:40,528][26022] Updated weights on worker 0-0, policy_version 114189 (0.00094) [2022-07-09 05:47:41,810][25689] Fps is (10 sec: 5707.4, 60 sec: 5690.7, 300 sec: 5712.5). Total num frames: 116937728. Throughput: 0: 5954.4. Samples: 116946166. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 05:47:41,811][25689] Avg episode reward: [(0, '-53.638')] [2022-07-09 05:47:42,088][26022] Updated weights on worker 0-0, policy_version 114199 (0.00106) [2022-07-09 05:47:44,031][26022] Updated weights on worker 0-0, policy_version 114209 (0.00091) [2022-07-09 05:47:45,602][26022] Updated weights on worker 0-0, policy_version 114219 (0.00092) [2022-07-09 05:47:46,847][25689] Fps is (10 sec: 5711.4, 60 sec: 5638.5, 300 sec: 5705.5). Total num frames: 116965376. Throughput: 0: 5092.4. Samples: 116963398. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 05:47:46,847][25689] Avg episode reward: [(0, '-53.552')] [2022-07-09 05:47:47,580][26022] Updated weights on worker 0-0, policy_version 114229 (0.00096) [2022-07-09 05:47:49,351][26022] Updated weights on worker 0-0, policy_version 114239 (0.00084) [2022-07-09 05:47:51,011][26022] Updated weights on worker 0-0, policy_version 114249 (0.00086) [2022-07-09 05:47:51,871][25689] Fps is (10 sec: 5699.9, 60 sec: 5709.2, 300 sec: 5710.9). Total num frames: 116995072. Throughput: 0: 5955.6. Samples: 116997982. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 05:47:51,871][25689] Avg episode reward: [(0, '-53.000')] [2022-07-09 05:47:52,947][26022] Updated weights on worker 0-0, policy_version 114259 (0.00090) [2022-07-09 05:47:54,620][26022] Updated weights on worker 0-0, policy_version 114269 (0.00095) [2022-07-09 05:47:56,535][26022] Updated weights on worker 0-0, policy_version 114279 (0.00094) [2022-07-09 05:47:56,955][25689] Fps is (10 sec: 5875.5, 60 sec: 5672.5, 300 sec: 5716.9). Total num frames: 117024768. Throughput: 0: 5983.1. Samples: 117032512. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 05:47:56,956][25689] Avg episode reward: [(0, '-53.093')] [2022-07-09 05:47:58,200][26022] Updated weights on worker 0-0, policy_version 114289 (0.00087) [2022-07-09 05:47:59,953][26022] Updated weights on worker 0-0, policy_version 114299 (0.00083) [2022-07-09 05:48:01,973][25689] Fps is (10 sec: 5676.0, 60 sec: 5674.4, 300 sec: 5716.8). Total num frames: 117052416. Throughput: 0: 5140.3. Samples: 117049812. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 05:48:01,974][25689] Avg episode reward: [(0, '-52.637')] [2022-07-09 05:48:02,000][26022] Updated weights on worker 0-0, policy_version 114309 (0.00095) [2022-07-09 05:48:03,941][26022] Updated weights on worker 0-0, policy_version 114319 (0.00090) [2022-07-09 05:48:05,835][26022] Updated weights on worker 0-0, policy_version 114329 (0.00087) [2022-07-09 05:48:06,996][25689] Fps is (10 sec: 5507.2, 60 sec: 5708.6, 300 sec: 5707.8). Total num frames: 117080064. Throughput: 0: 5872.4. Samples: 117081722. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 05:48:06,998][25689] Avg episode reward: [(0, '-54.093')] [2022-07-09 05:48:07,607][26022] Updated weights on worker 0-0, policy_version 114339 (0.00085) [2022-07-09 05:48:09,347][26022] Updated weights on worker 0-0, policy_version 114349 (0.00086) [2022-07-09 05:48:11,112][26022] Updated weights on worker 0-0, policy_version 114359 (0.00091) [2022-07-09 05:48:12,045][25689] Fps is (10 sec: 5592.1, 60 sec: 5672.8, 300 sec: 5707.8). Total num frames: 117108736. Throughput: 0: 5870.4. Samples: 117116412. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 05:48:12,045][25689] Avg episode reward: [(0, '-54.080')] [2022-07-09 05:48:12,827][26022] Updated weights on worker 0-0, policy_version 114369 (0.00088) [2022-07-09 05:48:14,838][26022] Updated weights on worker 0-0, policy_version 114379 (0.00091) [2022-07-09 05:48:16,511][26022] Updated weights on worker 0-0, policy_version 114389 (0.00088) [2022-07-09 05:48:17,159][25689] Fps is (10 sec: 5642.0, 60 sec: 5668.8, 300 sec: 5709.7). Total num frames: 117137408. Throughput: 0: 5006.9. Samples: 117133674. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 05:48:17,161][25689] Avg episode reward: [(0, '-54.099')] [2022-07-09 05:48:18,279][26022] Updated weights on worker 0-0, policy_version 114399 (0.00091) [2022-07-09 05:48:20,271][26022] Updated weights on worker 0-0, policy_version 114409 (0.00085) [2022-07-09 05:48:21,833][26022] Updated weights on worker 0-0, policy_version 114419 (0.00091) [2022-07-09 05:48:22,197][25689] Fps is (10 sec: 5749.1, 60 sec: 5670.7, 300 sec: 5705.8). Total num frames: 117167104. Throughput: 0: 5846.0. Samples: 117168040. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 05:48:22,199][25689] Avg episode reward: [(0, '-53.393')] [2022-07-09 05:48:23,631][26022] Updated weights on worker 0-0, policy_version 114429 (0.00091) [2022-07-09 05:48:25,411][26022] Updated weights on worker 0-0, policy_version 114439 (0.00098) [2022-07-09 05:48:27,237][25689] Fps is (10 sec: 5690.0, 60 sec: 5669.9, 300 sec: 5702.4). Total num frames: 117194752. Throughput: 0: 5972.4. Samples: 117202614. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 05:48:27,243][25689] Avg episode reward: [(0, '-54.156')] [2022-07-09 05:48:27,256][26022] Updated weights on worker 0-0, policy_version 114449 (0.00093) [2022-07-09 05:48:29,127][26022] Updated weights on worker 0-0, policy_version 114459 (0.00090) [2022-07-09 05:48:29,967][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:48:29,985][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000114465_117212160.pth [2022-07-09 05:48:29,985][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000112457_115155968.pth [2022-07-09 05:48:30,696][26022] Updated weights on worker 0-0, policy_version 114469 (0.00083) [2022-07-09 05:48:32,289][25689] Fps is (10 sec: 5479.2, 60 sec: 5650.7, 300 sec: 5699.1). Total num frames: 117222400. Throughput: 0: 5107.1. Samples: 117219802. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 05:48:32,290][25689] Avg episode reward: [(0, '-53.753')] [2022-07-09 05:48:32,678][26022] Updated weights on worker 0-0, policy_version 114479 (0.00087) [2022-07-09 05:48:34,374][26022] Updated weights on worker 0-0, policy_version 114489 (0.00098) [2022-07-09 05:48:36,148][26022] Updated weights on worker 0-0, policy_version 114499 (0.00094) [2022-07-09 05:48:37,337][25689] Fps is (10 sec: 5880.8, 60 sec: 5718.0, 300 sec: 5709.6). Total num frames: 117254144. Throughput: 0: 5966.3. Samples: 117254058. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 05:48:37,339][25689] Avg episode reward: [(0, '-53.902')] [2022-07-09 05:48:37,836][26022] Updated weights on worker 0-0, policy_version 114509 (0.00092) [2022-07-09 05:48:39,799][26022] Updated weights on worker 0-0, policy_version 114519 (0.00085) [2022-07-09 05:48:41,320][26022] Updated weights on worker 0-0, policy_version 114529 (0.00083) [2022-07-09 05:48:42,353][25689] Fps is (10 sec: 5901.4, 60 sec: 5682.9, 300 sec: 5703.1). Total num frames: 117281792. Throughput: 0: 5979.9. Samples: 117288572. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 05:48:42,355][25689] Avg episode reward: [(0, '-53.368')] [2022-07-09 05:48:43,336][26022] Updated weights on worker 0-0, policy_version 114539 (0.00082) [2022-07-09 05:48:44,979][26022] Updated weights on worker 0-0, policy_version 114549 (0.00105) [2022-07-09 05:48:47,021][26022] Updated weights on worker 0-0, policy_version 114559 (0.00095) [2022-07-09 05:48:47,366][25689] Fps is (10 sec: 5717.9, 60 sec: 5719.1, 300 sec: 5706.6). Total num frames: 117311488. Throughput: 0: 5132.0. Samples: 117305912. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 05:48:47,367][25689] Avg episode reward: [(0, '-53.515')] [2022-07-09 05:48:48,645][26022] Updated weights on worker 0-0, policy_version 114569 (0.00089) [2022-07-09 05:48:50,393][26022] Updated weights on worker 0-0, policy_version 114579 (0.00087) [2022-07-09 05:48:52,283][26022] Updated weights on worker 0-0, policy_version 114589 (0.00082) [2022-07-09 05:48:52,379][25689] Fps is (10 sec: 5719.9, 60 sec: 5686.2, 300 sec: 5700.7). Total num frames: 117339136. Throughput: 0: 5999.2. Samples: 117340322. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 05:48:52,381][25689] Avg episode reward: [(0, '-53.512')] [2022-07-09 05:48:54,235][26022] Updated weights on worker 0-0, policy_version 114599 (0.00085) [2022-07-09 05:48:55,726][26022] Updated weights on worker 0-0, policy_version 114609 (0.00088) [2022-07-09 05:48:57,467][25689] Fps is (10 sec: 5575.2, 60 sec: 5668.9, 300 sec: 5703.5). Total num frames: 117367808. Throughput: 0: 5988.9. Samples: 117374618. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 05:48:57,470][25689] Avg episode reward: [(0, '-52.932')] [2022-07-09 05:48:57,676][26022] Updated weights on worker 0-0, policy_version 114619 (0.00092) [2022-07-09 05:48:59,328][26022] Updated weights on worker 0-0, policy_version 114629 (0.00094) [2022-07-09 05:49:01,386][26022] Updated weights on worker 0-0, policy_version 114639 (0.00289) [2022-07-09 05:49:02,482][25689] Fps is (10 sec: 5574.7, 60 sec: 5669.3, 300 sec: 5699.9). Total num frames: 117395456. Throughput: 0: 5142.3. Samples: 117392078. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2022-07-09 05:49:02,484][25689] Avg episode reward: [(0, '-52.174')] [2022-07-09 05:49:03,375][26022] Updated weights on worker 0-0, policy_version 114649 (0.00086) [2022-07-09 05:49:05,271][26022] Updated weights on worker 0-0, policy_version 114659 (0.00094) [2022-07-09 05:49:06,924][26022] Updated weights on worker 0-0, policy_version 114669 (0.00083) [2022-07-09 05:49:07,515][25689] Fps is (10 sec: 5503.3, 60 sec: 5668.3, 300 sec: 5700.6). Total num frames: 117423104. Throughput: 0: 5857.4. Samples: 117423938. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2022-07-09 05:49:07,517][25689] Avg episode reward: [(0, '-52.298')] [2022-07-09 05:49:08,744][26022] Updated weights on worker 0-0, policy_version 114679 (0.00088) [2022-07-09 05:49:10,635][26022] Updated weights on worker 0-0, policy_version 114689 (0.00091) [2022-07-09 05:49:12,478][26022] Updated weights on worker 0-0, policy_version 114699 (0.00091) [2022-07-09 05:49:12,537][25689] Fps is (10 sec: 5600.8, 60 sec: 5670.8, 300 sec: 5695.6). Total num frames: 117451776. Throughput: 0: 5854.4. Samples: 117458338. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2022-07-09 05:49:12,539][25689] Avg episode reward: [(0, '-52.502')] [2022-07-09 05:49:14,099][26022] Updated weights on worker 0-0, policy_version 114709 (0.00098) [2022-07-09 05:49:16,022][26022] Updated weights on worker 0-0, policy_version 114719 (0.00114) [2022-07-09 05:49:17,626][25689] Fps is (10 sec: 5772.9, 60 sec: 5690.2, 300 sec: 5697.5). Total num frames: 117481472. Throughput: 0: 5008.0. Samples: 117475572. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2022-07-09 05:49:17,628][25689] Avg episode reward: [(0, '-52.173')] [2022-07-09 05:49:17,647][26022] Updated weights on worker 0-0, policy_version 114729 (0.00092) [2022-07-09 05:49:19,451][26022] Updated weights on worker 0-0, policy_version 114739 (0.00090) [2022-07-09 05:49:21,162][26022] Updated weights on worker 0-0, policy_version 114749 (0.00091) [2022-07-09 05:49:22,670][25689] Fps is (10 sec: 5760.7, 60 sec: 5672.7, 300 sec: 5697.8). Total num frames: 117510144. Throughput: 0: 5850.2. Samples: 117510180. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2022-07-09 05:49:22,670][25689] Avg episode reward: [(0, '-51.445')] [2022-07-09 05:49:23,166][26022] Updated weights on worker 0-0, policy_version 114759 (0.00092) [2022-07-09 05:49:24,843][26022] Updated weights on worker 0-0, policy_version 114769 (0.00083) [2022-07-09 05:49:26,662][26022] Updated weights on worker 0-0, policy_version 114779 (0.00084) [2022-07-09 05:49:27,728][25689] Fps is (10 sec: 5778.2, 60 sec: 5704.9, 300 sec: 5698.5). Total num frames: 117539840. Throughput: 0: 5973.0. Samples: 117544666. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2022-07-09 05:49:27,730][25689] Avg episode reward: [(0, '-51.163')] [2022-07-09 05:49:28,285][26022] Updated weights on worker 0-0, policy_version 114789 (0.00090) [2022-07-09 05:49:30,263][26022] Updated weights on worker 0-0, policy_version 114799 (0.00088) [2022-07-09 05:49:32,033][26022] Updated weights on worker 0-0, policy_version 114809 (0.00089) [2022-07-09 05:49:32,731][25689] Fps is (10 sec: 5902.9, 60 sec: 5743.3, 300 sec: 5700.8). Total num frames: 117569536. Throughput: 0: 5128.7. Samples: 117561904. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2022-07-09 05:49:32,732][25689] Avg episode reward: [(0, '-52.048')] [2022-07-09 05:49:33,902][26022] Updated weights on worker 0-0, policy_version 114819 (0.00085) [2022-07-09 05:49:35,617][26022] Updated weights on worker 0-0, policy_version 114829 (0.00088) [2022-07-09 05:49:37,611][26022] Updated weights on worker 0-0, policy_version 114839 (0.00095) [2022-07-09 05:49:37,798][25689] Fps is (10 sec: 5592.3, 60 sec: 5656.7, 300 sec: 5696.6). Total num frames: 117596160. Throughput: 0: 5957.3. Samples: 117595746. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2022-07-09 05:49:37,799][25689] Avg episode reward: [(0, '-52.523')] [2022-07-09 05:49:39,206][26022] Updated weights on worker 0-0, policy_version 114849 (0.00091) [2022-07-09 05:49:41,175][26022] Updated weights on worker 0-0, policy_version 114859 (0.00080) [2022-07-09 05:49:42,716][26022] Updated weights on worker 0-0, policy_version 114869 (0.00087) [2022-07-09 05:49:42,802][25689] Fps is (10 sec: 5592.4, 60 sec: 5691.9, 300 sec: 5697.9). Total num frames: 117625856. Throughput: 0: 5939.5. Samples: 117629756. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2022-07-09 05:49:42,803][25689] Avg episode reward: [(0, '-52.845')] [2022-07-09 05:49:44,885][26022] Updated weights on worker 0-0, policy_version 114879 (0.00087) [2022-07-09 05:49:46,230][26022] Updated weights on worker 0-0, policy_version 114889 (0.00086) [2022-07-09 05:49:47,816][25689] Fps is (10 sec: 5622.3, 60 sec: 5640.9, 300 sec: 5684.0). Total num frames: 117652480. Throughput: 0: 5097.5. Samples: 117647066. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2022-07-09 05:49:47,816][25689] Avg episode reward: [(0, '-52.728')] [2022-07-09 05:49:48,295][26022] Updated weights on worker 0-0, policy_version 114899 (0.00083) [2022-07-09 05:49:49,900][26022] Updated weights on worker 0-0, policy_version 114909 (0.00089) [2022-07-09 05:49:51,856][26022] Updated weights on worker 0-0, policy_version 114919 (0.00091) [2022-07-09 05:49:52,836][25689] Fps is (10 sec: 5816.9, 60 sec: 5708.0, 300 sec: 5695.1). Total num frames: 117684224. Throughput: 0: 5969.4. Samples: 117681920. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2022-07-09 05:49:52,837][25689] Avg episode reward: [(0, '-53.317')] [2022-07-09 05:49:53,616][26022] Updated weights on worker 0-0, policy_version 114929 (0.00086) [2022-07-09 05:49:55,273][26022] Updated weights on worker 0-0, policy_version 114939 (0.00088) [2022-07-09 05:49:56,991][26022] Updated weights on worker 0-0, policy_version 114949 (0.00094) [2022-07-09 05:49:57,948][25689] Fps is (10 sec: 5861.9, 60 sec: 5688.9, 300 sec: 5694.7). Total num frames: 117711872. Throughput: 0: 6000.3. Samples: 117716646. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2022-07-09 05:49:57,948][25689] Avg episode reward: [(0, '-53.917')] [2022-07-09 05:49:58,767][26022] Updated weights on worker 0-0, policy_version 114959 (0.00091) [2022-07-09 05:50:00,561][26022] Updated weights on worker 0-0, policy_version 114969 (0.00092) [2022-07-09 05:50:02,784][26022] Updated weights on worker 0-0, policy_version 114979 (0.00057) [2022-07-09 05:50:02,953][25689] Fps is (10 sec: 5466.0, 60 sec: 5689.7, 300 sec: 5694.8). Total num frames: 117739520. Throughput: 0: 5936.6. Samples: 117749382. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2022-07-09 05:50:02,953][25689] Avg episode reward: [(0, '-53.198')] [2022-07-09 05:50:04,412][26022] Updated weights on worker 0-0, policy_version 114989 (0.00092) [2022-07-09 05:50:06,367][26022] Updated weights on worker 0-0, policy_version 114999 (0.00089) [2022-07-09 05:50:07,969][25689] Fps is (10 sec: 5620.1, 60 sec: 5708.3, 300 sec: 5695.8). Total num frames: 117768192. Throughput: 0: 5927.0. Samples: 117766514. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2022-07-09 05:50:07,969][25689] Avg episode reward: [(0, '-52.841')] [2022-07-09 05:50:08,116][26022] Updated weights on worker 0-0, policy_version 115009 (0.00088) [2022-07-09 05:50:09,970][26022] Updated weights on worker 0-0, policy_version 115019 (0.00091) [2022-07-09 05:50:11,672][26022] Updated weights on worker 0-0, policy_version 115029 (0.00091) [2022-07-09 05:50:12,979][25689] Fps is (10 sec: 5617.1, 60 sec: 5692.5, 300 sec: 5690.1). Total num frames: 117795840. Throughput: 0: 5890.4. Samples: 117800570. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2022-07-09 05:50:12,980][25689] Avg episode reward: [(0, '-52.952')] [2022-07-09 05:50:13,368][26022] Updated weights on worker 0-0, policy_version 115039 (0.00084) [2022-07-09 05:50:15,271][26022] Updated weights on worker 0-0, policy_version 115049 (0.00094) [2022-07-09 05:50:17,091][26022] Updated weights on worker 0-0, policy_version 115059 (0.00095) [2022-07-09 05:50:18,085][25689] Fps is (10 sec: 5668.4, 60 sec: 5690.9, 300 sec: 5691.6). Total num frames: 117825536. Throughput: 0: 5901.0. Samples: 117835478. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2022-07-09 05:50:18,086][25689] Avg episode reward: [(0, '-52.700')] [2022-07-09 05:50:18,764][26022] Updated weights on worker 0-0, policy_version 115069 (0.00083) [2022-07-09 05:50:20,612][26022] Updated weights on worker 0-0, policy_version 115079 (0.00084) [2022-07-09 05:50:22,298][26022] Updated weights on worker 0-0, policy_version 115089 (0.00091) [2022-07-09 05:50:23,102][25689] Fps is (10 sec: 5867.3, 60 sec: 5710.3, 300 sec: 5695.2). Total num frames: 117855232. Throughput: 0: 5127.4. Samples: 117852694. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2022-07-09 05:50:23,102][25689] Avg episode reward: [(0, '-52.653')] [2022-07-09 05:50:24,349][26022] Updated weights on worker 0-0, policy_version 115099 (0.00090) [2022-07-09 05:50:25,876][26022] Updated weights on worker 0-0, policy_version 115109 (0.00089) [2022-07-09 05:50:27,822][26022] Updated weights on worker 0-0, policy_version 115119 (0.00089) [2022-07-09 05:50:28,119][25689] Fps is (10 sec: 5817.3, 60 sec: 5697.3, 300 sec: 5695.2). Total num frames: 117883904. Throughput: 0: 5968.8. Samples: 117886784. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2022-07-09 05:50:28,119][25689] Avg episode reward: [(0, '-52.577')] [2022-07-09 05:50:29,540][26022] Updated weights on worker 0-0, policy_version 115129 (0.00088) [2022-07-09 05:50:30,074][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:50:30,096][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000115132_117895168.pth [2022-07-09 05:50:30,096][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000113128_115843072.pth [2022-07-09 05:50:31,422][26022] Updated weights on worker 0-0, policy_version 115139 (0.00087) [2022-07-09 05:50:33,051][26022] Updated weights on worker 0-0, policy_version 115149 (0.00085) [2022-07-09 05:50:33,171][25689] Fps is (10 sec: 5695.1, 60 sec: 5675.7, 300 sec: 5698.9). Total num frames: 117912576. Throughput: 0: 5982.2. Samples: 117921360. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2022-07-09 05:50:33,171][25689] Avg episode reward: [(0, '-52.839')] [2022-07-09 05:50:34,898][26022] Updated weights on worker 0-0, policy_version 115159 (0.00082) [2022-07-09 05:50:36,751][26022] Updated weights on worker 0-0, policy_version 115169 (0.00088) [2022-07-09 05:50:38,234][25689] Fps is (10 sec: 5770.4, 60 sec: 5727.0, 300 sec: 5698.4). Total num frames: 117942272. Throughput: 0: 5112.1. Samples: 117938478. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2022-07-09 05:50:38,235][25689] Avg episode reward: [(0, '-52.811')] [2022-07-09 05:50:38,586][26022] Updated weights on worker 0-0, policy_version 115179 (0.00088) [2022-07-09 05:50:40,391][26022] Updated weights on worker 0-0, policy_version 115189 (0.00090) [2022-07-09 05:50:42,168][26022] Updated weights on worker 0-0, policy_version 115199 (0.00092) [2022-07-09 05:50:43,256][25689] Fps is (10 sec: 5686.0, 60 sec: 5691.3, 300 sec: 5695.1). Total num frames: 117969920. Throughput: 0: 5940.5. Samples: 117972420. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2022-07-09 05:50:43,256][25689] Avg episode reward: [(0, '-53.425')] [2022-07-09 05:50:43,918][26022] Updated weights on worker 0-0, policy_version 115209 (0.00082) [2022-07-09 05:50:45,858][26022] Updated weights on worker 0-0, policy_version 115219 (0.00082) [2022-07-09 05:50:47,325][26022] Updated weights on worker 0-0, policy_version 115229 (0.00060) [2022-07-09 05:50:48,338][25689] Fps is (10 sec: 5674.9, 60 sec: 5735.6, 300 sec: 5700.8). Total num frames: 117999616. Throughput: 0: 5959.3. Samples: 118007280. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2022-07-09 05:50:48,339][25689] Avg episode reward: [(0, '-53.143')] [2022-07-09 05:50:49,380][26022] Updated weights on worker 0-0, policy_version 115239 (0.00093) [2022-07-09 05:50:50,790][26022] Updated weights on worker 0-0, policy_version 115249 (0.00071) [2022-07-09 05:50:52,918][26022] Updated weights on worker 0-0, policy_version 115259 (0.00089) [2022-07-09 05:50:53,394][25689] Fps is (10 sec: 5858.2, 60 sec: 5698.5, 300 sec: 5694.5). Total num frames: 118029312. Throughput: 0: 5105.0. Samples: 118024600. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2022-07-09 05:50:53,395][25689] Avg episode reward: [(0, '-53.237')] [2022-07-09 05:50:54,444][26022] Updated weights on worker 0-0, policy_version 115269 (0.00089) [2022-07-09 05:50:56,247][26022] Updated weights on worker 0-0, policy_version 115279 (0.00084) [2022-07-09 05:50:58,130][26022] Updated weights on worker 0-0, policy_version 115289 (0.00097) [2022-07-09 05:50:58,462][25689] Fps is (10 sec: 5664.6, 60 sec: 5702.6, 300 sec: 5691.2). Total num frames: 118056960. Throughput: 0: 5966.1. Samples: 118059160. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2022-07-09 05:50:58,462][25689] Avg episode reward: [(0, '-53.217')] [2022-07-09 05:50:59,853][26022] Updated weights on worker 0-0, policy_version 115299 (0.00079) [2022-07-09 05:51:01,994][26022] Updated weights on worker 0-0, policy_version 115309 (0.00085) [2022-07-09 05:51:03,532][25689] Fps is (10 sec: 5454.4, 60 sec: 5696.5, 300 sec: 5696.8). Total num frames: 118084608. Throughput: 0: 5883.4. Samples: 118091712. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2022-07-09 05:51:03,532][25689] Avg episode reward: [(0, '-53.429')] [2022-07-09 05:51:03,951][26022] Updated weights on worker 0-0, policy_version 115319 (0.00087) [2022-07-09 05:51:05,516][26022] Updated weights on worker 0-0, policy_version 115329 (0.00097) [2022-07-09 05:51:07,578][26022] Updated weights on worker 0-0, policy_version 115339 (0.00090) [2022-07-09 05:51:08,613][25689] Fps is (10 sec: 5548.3, 60 sec: 5690.4, 300 sec: 5692.4). Total num frames: 118113280. Throughput: 0: 5021.5. Samples: 118109086. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:51:08,613][25689] Avg episode reward: [(0, '-53.074')] [2022-07-09 05:51:09,022][26022] Updated weights on worker 0-0, policy_version 115349 (0.00088) [2022-07-09 05:51:11,015][26022] Updated weights on worker 0-0, policy_version 115359 (0.00084) [2022-07-09 05:51:12,838][26022] Updated weights on worker 0-0, policy_version 115369 (0.00085) [2022-07-09 05:51:13,637][25689] Fps is (10 sec: 5775.9, 60 sec: 5722.8, 300 sec: 5693.1). Total num frames: 118142976. Throughput: 0: 5861.1. Samples: 118143248. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:51:13,638][25689] Avg episode reward: [(0, '-53.754')] [2022-07-09 05:51:14,556][26022] Updated weights on worker 0-0, policy_version 115379 (0.00084) [2022-07-09 05:51:16,429][26022] Updated weights on worker 0-0, policy_version 115389 (0.00085) [2022-07-09 05:51:18,134][26022] Updated weights on worker 0-0, policy_version 115399 (0.00089) [2022-07-09 05:51:18,690][25689] Fps is (10 sec: 5791.7, 60 sec: 5710.9, 300 sec: 5688.8). Total num frames: 118171648. Throughput: 0: 5869.1. Samples: 118177884. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:51:18,691][25689] Avg episode reward: [(0, '-53.067')] [2022-07-09 05:51:19,984][26022] Updated weights on worker 0-0, policy_version 115409 (0.00088) [2022-07-09 05:51:21,671][26022] Updated weights on worker 0-0, policy_version 115419 (0.00081) [2022-07-09 05:51:23,467][26022] Updated weights on worker 0-0, policy_version 115429 (0.00087) [2022-07-09 05:51:23,695][25689] Fps is (10 sec: 5701.8, 60 sec: 5695.2, 300 sec: 5693.3). Total num frames: 118200320. Throughput: 0: 5132.7. Samples: 118195202. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:51:23,695][25689] Avg episode reward: [(0, '-53.097')] [2022-07-09 05:51:25,235][26022] Updated weights on worker 0-0, policy_version 115439 (0.00085) [2022-07-09 05:51:27,044][26022] Updated weights on worker 0-0, policy_version 115449 (0.00098) [2022-07-09 05:51:28,701][25689] Fps is (10 sec: 5728.5, 60 sec: 5696.2, 300 sec: 5690.4). Total num frames: 118228992. Throughput: 0: 5999.7. Samples: 118229608. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:51:28,701][25689] Avg episode reward: [(0, '-53.911')] [2022-07-09 05:51:28,758][26022] Updated weights on worker 0-0, policy_version 115459 (0.00089) [2022-07-09 05:51:30,506][26022] Updated weights on worker 0-0, policy_version 115469 (0.00088) [2022-07-09 05:51:32,414][26022] Updated weights on worker 0-0, policy_version 115479 (0.00087) [2022-07-09 05:51:33,718][25689] Fps is (10 sec: 5721.3, 60 sec: 5699.5, 300 sec: 5698.2). Total num frames: 118257664. Throughput: 0: 6031.1. Samples: 118264354. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:51:33,718][25689] Avg episode reward: [(0, '-54.631')] [2022-07-09 05:51:34,047][26022] Updated weights on worker 0-0, policy_version 115489 (0.00054) [2022-07-09 05:51:36,064][26022] Updated weights on worker 0-0, policy_version 115499 (0.00087) [2022-07-09 05:51:37,772][26022] Updated weights on worker 0-0, policy_version 115509 (0.00093) [2022-07-09 05:51:38,772][25689] Fps is (10 sec: 5592.0, 60 sec: 5666.5, 300 sec: 5687.7). Total num frames: 118285312. Throughput: 0: 5171.0. Samples: 118281728. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:51:38,773][25689] Avg episode reward: [(0, '-54.561')] [2022-07-09 05:51:39,396][26022] Updated weights on worker 0-0, policy_version 115519 (0.00089) [2022-07-09 05:51:41,507][26022] Updated weights on worker 0-0, policy_version 115529 (0.00089) [2022-07-09 05:51:42,903][26022] Updated weights on worker 0-0, policy_version 115539 (0.00109) [2022-07-09 05:51:43,777][25689] Fps is (10 sec: 5802.6, 60 sec: 5718.9, 300 sec: 5688.0). Total num frames: 118316032. Throughput: 0: 6021.4. Samples: 118316122. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:51:43,777][25689] Avg episode reward: [(0, '-55.647')] [2022-07-09 05:51:44,883][26022] Updated weights on worker 0-0, policy_version 115549 (0.00080) [2022-07-09 05:51:46,594][26022] Updated weights on worker 0-0, policy_version 115559 (0.00084) [2022-07-09 05:51:48,336][26022] Updated weights on worker 0-0, policy_version 115569 (0.00082) [2022-07-09 05:51:48,794][25689] Fps is (10 sec: 6028.5, 60 sec: 5725.1, 300 sec: 5702.5). Total num frames: 118345728. Throughput: 0: 6031.9. Samples: 118350810. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:51:48,795][25689] Avg episode reward: [(0, '-55.592')] [2022-07-09 05:51:50,343][26022] Updated weights on worker 0-0, policy_version 115579 (0.00094) [2022-07-09 05:51:51,796][26022] Updated weights on worker 0-0, policy_version 115589 (0.00088) [2022-07-09 05:51:53,815][25689] Fps is (10 sec: 5610.6, 60 sec: 5677.5, 300 sec: 5686.0). Total num frames: 118372352. Throughput: 0: 5162.6. Samples: 118368108. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:51:53,816][25689] Avg episode reward: [(0, '-55.023')] [2022-07-09 05:51:53,974][26022] Updated weights on worker 0-0, policy_version 115599 (0.00104) [2022-07-09 05:51:55,526][26022] Updated weights on worker 0-0, policy_version 115609 (0.00088) [2022-07-09 05:51:57,365][26022] Updated weights on worker 0-0, policy_version 115619 (0.00087) [2022-07-09 05:51:58,911][25689] Fps is (10 sec: 5667.9, 60 sec: 5725.6, 300 sec: 5695.2). Total num frames: 118403072. Throughput: 0: 5996.3. Samples: 118402488. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:51:58,912][25689] Avg episode reward: [(0, '-53.482')] [2022-07-09 05:51:59,054][26022] Updated weights on worker 0-0, policy_version 115629 (0.00090) [2022-07-09 05:52:00,866][26022] Updated weights on worker 0-0, policy_version 115639 (0.00091) [2022-07-09 05:52:03,171][26022] Updated weights on worker 0-0, policy_version 115649 (0.00090) [2022-07-09 05:52:03,923][25689] Fps is (10 sec: 5774.5, 60 sec: 5731.2, 300 sec: 5702.3). Total num frames: 118430720. Throughput: 0: 5877.8. Samples: 118434536. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:52:03,925][25689] Avg episode reward: [(0, '-53.403')] [2022-07-09 05:52:04,888][26022] Updated weights on worker 0-0, policy_version 115659 (0.00087) [2022-07-09 05:52:06,624][26022] Updated weights on worker 0-0, policy_version 115669 (0.00089) [2022-07-09 05:52:08,633][26022] Updated weights on worker 0-0, policy_version 115679 (0.00091) [2022-07-09 05:52:08,952][25689] Fps is (10 sec: 5405.3, 60 sec: 5702.1, 300 sec: 5688.5). Total num frames: 118457344. Throughput: 0: 5006.6. Samples: 118451732. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:52:08,952][25689] Avg episode reward: [(0, '-53.075')] [2022-07-09 05:52:10,048][26022] Updated weights on worker 0-0, policy_version 115689 (0.00088) [2022-07-09 05:52:12,115][26022] Updated weights on worker 0-0, policy_version 115699 (0.00084) [2022-07-09 05:52:13,825][26022] Updated weights on worker 0-0, policy_version 115709 (0.00088) [2022-07-09 05:52:13,961][25689] Fps is (10 sec: 5508.2, 60 sec: 5686.6, 300 sec: 5689.7). Total num frames: 118486016. Throughput: 0: 5847.2. Samples: 118485910. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:52:13,962][25689] Avg episode reward: [(0, '-53.321')] [2022-07-09 05:52:15,614][26022] Updated weights on worker 0-0, policy_version 115719 (0.00087) [2022-07-09 05:52:17,524][26022] Updated weights on worker 0-0, policy_version 115729 (0.00085) [2022-07-09 05:52:18,905][26022] Updated weights on worker 0-0, policy_version 115739 (0.00091) [2022-07-09 05:52:19,007][25689] Fps is (10 sec: 6008.4, 60 sec: 5738.2, 300 sec: 5696.9). Total num frames: 118517760. Throughput: 0: 5872.9. Samples: 118520508. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:52:19,008][25689] Avg episode reward: [(0, '-53.713')] [2022-07-09 05:52:21,098][26022] Updated weights on worker 0-0, policy_version 115749 (0.00087) [2022-07-09 05:52:22,791][26022] Updated weights on worker 0-0, policy_version 115759 (0.00468) [2022-07-09 05:52:24,078][25689] Fps is (10 sec: 5668.0, 60 sec: 5681.0, 300 sec: 5689.2). Total num frames: 118543360. Throughput: 0: 5120.6. Samples: 118537746. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:52:24,079][25689] Avg episode reward: [(0, '-53.144')] [2022-07-09 05:52:24,624][26022] Updated weights on worker 0-0, policy_version 115769 (0.00087) [2022-07-09 05:52:26,322][26022] Updated weights on worker 0-0, policy_version 115779 (0.00087) [2022-07-09 05:52:27,985][26022] Updated weights on worker 0-0, policy_version 115789 (0.00085) [2022-07-09 05:52:29,100][25689] Fps is (10 sec: 5580.5, 60 sec: 5713.5, 300 sec: 5696.2). Total num frames: 118574080. Throughput: 0: 5990.9. Samples: 118572434. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:52:29,100][25689] Avg episode reward: [(0, '-54.022')] [2022-07-09 05:52:29,839][26022] Updated weights on worker 0-0, policy_version 115799 (0.00084) [2022-07-09 05:52:30,220][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:52:30,238][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000115801_118580224.pth [2022-07-09 05:52:30,238][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000113797_116528128.pth [2022-07-09 05:52:31,573][26022] Updated weights on worker 0-0, policy_version 115809 (0.00084) [2022-07-09 05:52:33,299][26022] Updated weights on worker 0-0, policy_version 115819 (0.00084) [2022-07-09 05:52:34,178][25689] Fps is (10 sec: 5880.9, 60 sec: 5707.7, 300 sec: 5699.0). Total num frames: 118602752. Throughput: 0: 5986.7. Samples: 118606938. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:52:34,178][25689] Avg episode reward: [(0, '-53.980')] [2022-07-09 05:52:35,303][26022] Updated weights on worker 0-0, policy_version 115829 (0.00054) [2022-07-09 05:52:36,944][26022] Updated weights on worker 0-0, policy_version 115839 (0.00091) [2022-07-09 05:52:38,963][26022] Updated weights on worker 0-0, policy_version 115849 (0.00086) [2022-07-09 05:52:39,218][25689] Fps is (10 sec: 5566.1, 60 sec: 5709.1, 300 sec: 5691.4). Total num frames: 118630400. Throughput: 0: 5108.6. Samples: 118623756. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:52:39,218][25689] Avg episode reward: [(0, '-53.038')] [2022-07-09 05:52:40,564][26022] Updated weights on worker 0-0, policy_version 115859 (0.00085) [2022-07-09 05:52:42,607][26022] Updated weights on worker 0-0, policy_version 115869 (0.00087) [2022-07-09 05:52:44,143][26022] Updated weights on worker 0-0, policy_version 115879 (0.00091) [2022-07-09 05:52:44,222][25689] Fps is (10 sec: 5709.3, 60 sec: 5692.2, 300 sec: 5698.9). Total num frames: 118660096. Throughput: 0: 5946.5. Samples: 118657526. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:52:44,222][25689] Avg episode reward: [(0, '-53.206')] [2022-07-09 05:52:46,107][26022] Updated weights on worker 0-0, policy_version 115889 (0.00082) [2022-07-09 05:52:47,810][26022] Updated weights on worker 0-0, policy_version 115899 (0.00096) [2022-07-09 05:52:49,265][25689] Fps is (10 sec: 5809.3, 60 sec: 5672.8, 300 sec: 5695.1). Total num frames: 118688768. Throughput: 0: 5929.6. Samples: 118692008. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:52:49,266][25689] Avg episode reward: [(0, '-53.241')] [2022-07-09 05:52:49,607][26022] Updated weights on worker 0-0, policy_version 115909 (0.00102) [2022-07-09 05:52:51,287][26022] Updated weights on worker 0-0, policy_version 115919 (0.00071) [2022-07-09 05:52:53,440][26022] Updated weights on worker 0-0, policy_version 115929 (0.00094) [2022-07-09 05:52:54,271][25689] Fps is (10 sec: 5604.3, 60 sec: 5691.1, 300 sec: 5689.7). Total num frames: 118716416. Throughput: 0: 5953.5. Samples: 118726564. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:52:54,271][25689] Avg episode reward: [(0, '-52.838')] [2022-07-09 05:52:54,981][26022] Updated weights on worker 0-0, policy_version 115939 (0.00085) [2022-07-09 05:52:56,683][26022] Updated weights on worker 0-0, policy_version 115949 (0.00684) [2022-07-09 05:52:58,471][26022] Updated weights on worker 0-0, policy_version 115959 (0.00089) [2022-07-09 05:52:59,373][25689] Fps is (10 sec: 5673.0, 60 sec: 5673.6, 300 sec: 5695.0). Total num frames: 118746112. Throughput: 0: 5956.1. Samples: 118743804. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:52:59,374][25689] Avg episode reward: [(0, '-52.464')] [2022-07-09 05:53:00,446][26022] Updated weights on worker 0-0, policy_version 115969 (0.00084) [2022-07-09 05:53:02,499][26022] Updated weights on worker 0-0, policy_version 115979 (0.00087) [2022-07-09 05:53:04,379][25689] Fps is (10 sec: 5470.7, 60 sec: 5640.3, 300 sec: 5688.5). Total num frames: 118771712. Throughput: 0: 5884.8. Samples: 118776146. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 05:53:04,380][25689] Avg episode reward: [(0, '-52.645')] [2022-07-09 05:53:04,405][26022] Updated weights on worker 0-0, policy_version 115989 (0.00088) [2022-07-09 05:53:05,933][26022] Updated weights on worker 0-0, policy_version 115999 (0.00089) [2022-07-09 05:53:08,013][26022] Updated weights on worker 0-0, policy_version 116009 (0.00099) [2022-07-09 05:53:09,433][25689] Fps is (10 sec: 5598.4, 60 sec: 5705.7, 300 sec: 5695.2). Total num frames: 118802432. Throughput: 0: 5894.5. Samples: 118810890. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:53:09,434][25689] Avg episode reward: [(0, '-53.249')] [2022-07-09 05:53:09,462][26022] Updated weights on worker 0-0, policy_version 116019 (0.00087) [2022-07-09 05:53:11,474][26022] Updated weights on worker 0-0, policy_version 116029 (0.00098) [2022-07-09 05:53:13,080][26022] Updated weights on worker 0-0, policy_version 116039 (0.00087) [2022-07-09 05:53:14,446][25689] Fps is (10 sec: 5798.0, 60 sec: 5688.5, 300 sec: 5693.7). Total num frames: 118830080. Throughput: 0: 5014.8. Samples: 118827736. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:53:14,446][25689] Avg episode reward: [(0, '-53.760')] [2022-07-09 05:53:14,864][26022] Updated weights on worker 0-0, policy_version 116049 (0.00085) [2022-07-09 05:53:16,835][26022] Updated weights on worker 0-0, policy_version 116059 (0.00082) [2022-07-09 05:53:18,345][26022] Updated weights on worker 0-0, policy_version 116069 (0.00091) [2022-07-09 05:53:19,538][25689] Fps is (10 sec: 5674.9, 60 sec: 5650.3, 300 sec: 5692.7). Total num frames: 118859776. Throughput: 0: 5881.1. Samples: 118862396. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:53:19,539][25689] Avg episode reward: [(0, '-53.857')] [2022-07-09 05:53:20,359][26022] Updated weights on worker 0-0, policy_version 116079 (0.00089) [2022-07-09 05:53:22,229][26022] Updated weights on worker 0-0, policy_version 116089 (0.00085) [2022-07-09 05:53:23,911][26022] Updated weights on worker 0-0, policy_version 116099 (0.00087) [2022-07-09 05:53:24,630][25689] Fps is (10 sec: 5731.2, 60 sec: 5699.1, 300 sec: 5695.2). Total num frames: 118888448. Throughput: 0: 5968.8. Samples: 118897020. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:53:24,630][25689] Avg episode reward: [(0, '-54.101')] [2022-07-09 05:53:25,693][26022] Updated weights on worker 0-0, policy_version 116109 (0.00087) [2022-07-09 05:53:27,450][26022] Updated weights on worker 0-0, policy_version 116119 (0.00086) [2022-07-09 05:53:29,325][26022] Updated weights on worker 0-0, policy_version 116129 (0.00086) [2022-07-09 05:53:29,645][25689] Fps is (10 sec: 5876.6, 60 sec: 5699.7, 300 sec: 5706.2). Total num frames: 118919168. Throughput: 0: 5099.6. Samples: 118913960. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:53:29,645][25689] Avg episode reward: [(0, '-54.055')] [2022-07-09 05:53:31,046][26022] Updated weights on worker 0-0, policy_version 116139 (0.00082) [2022-07-09 05:53:32,843][26022] Updated weights on worker 0-0, policy_version 116149 (0.00090) [2022-07-09 05:53:34,710][25689] Fps is (10 sec: 5688.6, 60 sec: 5667.0, 300 sec: 5688.6). Total num frames: 118945792. Throughput: 0: 5952.8. Samples: 118948368. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:53:34,711][25689] Avg episode reward: [(0, '-52.945')] [2022-07-09 05:53:34,735][26022] Updated weights on worker 0-0, policy_version 116159 (0.00096) [2022-07-09 05:53:36,563][26022] Updated weights on worker 0-0, policy_version 116169 (0.00092) [2022-07-09 05:53:38,246][26022] Updated weights on worker 0-0, policy_version 116179 (0.00086) [2022-07-09 05:53:39,748][25689] Fps is (10 sec: 5473.0, 60 sec: 5684.2, 300 sec: 5691.7). Total num frames: 118974464. Throughput: 0: 5947.8. Samples: 118982600. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:53:39,751][25689] Avg episode reward: [(0, '-52.607')] [2022-07-09 05:53:40,128][26022] Updated weights on worker 0-0, policy_version 116189 (0.00086) [2022-07-09 05:53:41,623][26022] Updated weights on worker 0-0, policy_version 116199 (0.00084) [2022-07-09 05:53:43,826][26022] Updated weights on worker 0-0, policy_version 116209 (0.00084) [2022-07-09 05:53:44,770][25689] Fps is (10 sec: 5903.9, 60 sec: 5699.4, 300 sec: 5694.9). Total num frames: 119005184. Throughput: 0: 5096.1. Samples: 118999658. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:53:44,771][25689] Avg episode reward: [(0, '-52.641')] [2022-07-09 05:53:45,319][26022] Updated weights on worker 0-0, policy_version 116219 (0.00095) [2022-07-09 05:53:47,219][26022] Updated weights on worker 0-0, policy_version 116229 (0.00093) [2022-07-09 05:53:49,000][26022] Updated weights on worker 0-0, policy_version 116239 (0.00092) [2022-07-09 05:53:49,822][25689] Fps is (10 sec: 5794.3, 60 sec: 5681.7, 300 sec: 5694.2). Total num frames: 119032832. Throughput: 0: 5966.7. Samples: 119034348. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:53:49,823][25689] Avg episode reward: [(0, '-52.333')] [2022-07-09 05:53:50,816][26022] Updated weights on worker 0-0, policy_version 116249 (0.00084) [2022-07-09 05:53:52,511][26022] Updated weights on worker 0-0, policy_version 116259 (0.00092) [2022-07-09 05:53:54,613][26022] Updated weights on worker 0-0, policy_version 116269 (0.00082) [2022-07-09 05:53:54,830][25689] Fps is (10 sec: 5496.5, 60 sec: 5681.4, 300 sec: 5692.3). Total num frames: 119060480. Throughput: 0: 5969.2. Samples: 119068470. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:53:54,831][25689] Avg episode reward: [(0, '-51.440')] [2022-07-09 05:53:55,993][26022] Updated weights on worker 0-0, policy_version 116279 (0.00109) [2022-07-09 05:53:58,063][26022] Updated weights on worker 0-0, policy_version 116289 (0.00092) [2022-07-09 05:53:59,656][26022] Updated weights on worker 0-0, policy_version 116299 (0.00088) [2022-07-09 05:53:59,939][25689] Fps is (10 sec: 5769.1, 60 sec: 5697.8, 300 sec: 5700.8). Total num frames: 119091200. Throughput: 0: 5111.8. Samples: 119085812. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:53:59,939][25689] Avg episode reward: [(0, '-51.928')] [2022-07-09 05:54:01,468][26022] Updated weights on worker 0-0, policy_version 116309 (0.00079) [2022-07-09 05:54:03,528][26022] Updated weights on worker 0-0, policy_version 116319 (0.00103) [2022-07-09 05:54:05,000][25689] Fps is (10 sec: 5739.7, 60 sec: 5726.3, 300 sec: 5700.3). Total num frames: 119118848. Throughput: 0: 5883.9. Samples: 119118686. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:54:05,000][25689] Avg episode reward: [(0, '-52.421')] [2022-07-09 05:54:05,319][26022] Updated weights on worker 0-0, policy_version 116329 (0.00077) [2022-07-09 05:54:07,178][26022] Updated weights on worker 0-0, policy_version 116339 (0.00086) [2022-07-09 05:54:09,190][26022] Updated weights on worker 0-0, policy_version 116349 (0.00084) [2022-07-09 05:54:10,016][25689] Fps is (10 sec: 5487.5, 60 sec: 5679.3, 300 sec: 5697.0). Total num frames: 119146496. Throughput: 0: 5892.1. Samples: 119153334. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:54:10,017][25689] Avg episode reward: [(0, '-52.576')] [2022-07-09 05:54:10,407][26022] Updated weights on worker 0-0, policy_version 116359 (0.00084) [2022-07-09 05:54:12,712][26022] Updated weights on worker 0-0, policy_version 116369 (0.00086) [2022-07-09 05:54:14,099][26022] Updated weights on worker 0-0, policy_version 116379 (0.00080) [2022-07-09 05:54:15,029][25689] Fps is (10 sec: 5615.6, 60 sec: 5696.1, 300 sec: 5695.0). Total num frames: 119175168. Throughput: 0: 5057.3. Samples: 119170620. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:54:15,029][25689] Avg episode reward: [(0, '-52.791')] [2022-07-09 05:54:16,219][26022] Updated weights on worker 0-0, policy_version 116389 (0.00085) [2022-07-09 05:54:17,979][26022] Updated weights on worker 0-0, policy_version 116399 (0.00084) [2022-07-09 05:54:19,719][26022] Updated weights on worker 0-0, policy_version 116409 (0.00082) [2022-07-09 05:54:20,092][25689] Fps is (10 sec: 5792.5, 60 sec: 5698.8, 300 sec: 5698.0). Total num frames: 119204864. Throughput: 0: 5907.0. Samples: 119204858. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:54:20,093][25689] Avg episode reward: [(0, '-52.581')] [2022-07-09 05:54:21,627][26022] Updated weights on worker 0-0, policy_version 116419 (0.00094) [2022-07-09 05:54:23,310][26022] Updated weights on worker 0-0, policy_version 116429 (0.00089) [2022-07-09 05:54:24,992][26022] Updated weights on worker 0-0, policy_version 116439 (0.00086) [2022-07-09 05:54:25,134][25689] Fps is (10 sec: 5877.6, 60 sec: 5720.5, 300 sec: 5698.3). Total num frames: 119234560. Throughput: 0: 6003.4. Samples: 119239558. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:54:25,135][25689] Avg episode reward: [(0, '-53.492')] [2022-07-09 05:54:26,895][26022] Updated weights on worker 0-0, policy_version 116449 (0.00087) [2022-07-09 05:54:28,450][26022] Updated weights on worker 0-0, policy_version 116459 (0.00086) [2022-07-09 05:54:30,201][25689] Fps is (10 sec: 5672.8, 60 sec: 5664.8, 300 sec: 5690.2). Total num frames: 119262208. Throughput: 0: 5123.1. Samples: 119256744. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:54:30,202][25689] Avg episode reward: [(0, '-53.439')] [2022-07-09 05:54:30,243][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:54:30,251][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000116468_119263232.pth [2022-07-09 05:54:30,263][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000114465_117212160.pth [2022-07-09 05:54:30,443][26022] Updated weights on worker 0-0, policy_version 116469 (0.00092) [2022-07-09 05:54:32,141][26022] Updated weights on worker 0-0, policy_version 116479 (0.00087) [2022-07-09 05:54:33,855][26022] Updated weights on worker 0-0, policy_version 116489 (0.00093) [2022-07-09 05:54:35,289][25689] Fps is (10 sec: 5646.6, 60 sec: 5713.4, 300 sec: 5700.2). Total num frames: 119291904. Throughput: 0: 5969.0. Samples: 119291554. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:54:35,290][25689] Avg episode reward: [(0, '-53.614')] [2022-07-09 05:54:35,673][26022] Updated weights on worker 0-0, policy_version 116499 (0.00090) [2022-07-09 05:54:37,523][26022] Updated weights on worker 0-0, policy_version 116509 (0.00084) [2022-07-09 05:54:39,163][26022] Updated weights on worker 0-0, policy_version 116519 (0.00090) [2022-07-09 05:54:40,344][25689] Fps is (10 sec: 5855.1, 60 sec: 5728.7, 300 sec: 5699.2). Total num frames: 119321600. Throughput: 0: 5993.9. Samples: 119326248. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:54:40,345][25689] Avg episode reward: [(0, '-53.207')] [2022-07-09 05:54:41,183][26022] Updated weights on worker 0-0, policy_version 116529 (0.00089) [2022-07-09 05:54:42,537][26022] Updated weights on worker 0-0, policy_version 116539 (0.00076) [2022-07-09 05:54:44,703][26022] Updated weights on worker 0-0, policy_version 116549 (0.00101) [2022-07-09 05:54:45,375][25689] Fps is (10 sec: 5888.5, 60 sec: 5710.9, 300 sec: 5709.2). Total num frames: 119351296. Throughput: 0: 5135.9. Samples: 119343516. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:54:45,376][25689] Avg episode reward: [(0, '-53.196')] [2022-07-09 05:54:46,331][26022] Updated weights on worker 0-0, policy_version 116559 (0.00090) [2022-07-09 05:54:47,973][26022] Updated weights on worker 0-0, policy_version 116569 (0.00083) [2022-07-09 05:54:49,880][26022] Updated weights on worker 0-0, policy_version 116579 (0.00083) [2022-07-09 05:54:50,436][25689] Fps is (10 sec: 5783.8, 60 sec: 5727.0, 300 sec: 5698.1). Total num frames: 119379968. Throughput: 0: 6008.4. Samples: 119378324. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:54:50,436][25689] Avg episode reward: [(0, '-52.922')] [2022-07-09 05:54:51,662][26022] Updated weights on worker 0-0, policy_version 116589 (0.00083) [2022-07-09 05:54:53,270][26022] Updated weights on worker 0-0, policy_version 116599 (0.00091) [2022-07-09 05:54:55,203][26022] Updated weights on worker 0-0, policy_version 116609 (0.00086) [2022-07-09 05:54:55,477][25689] Fps is (10 sec: 5778.1, 60 sec: 5757.7, 300 sec: 5706.3). Total num frames: 119409664. Throughput: 0: 6045.2. Samples: 119413592. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:54:55,477][25689] Avg episode reward: [(0, '-52.852')] [2022-07-09 05:54:56,716][26022] Updated weights on worker 0-0, policy_version 116619 (0.00089) [2022-07-09 05:54:58,781][26022] Updated weights on worker 0-0, policy_version 116629 (0.00086) [2022-07-09 05:55:00,336][26022] Updated weights on worker 0-0, policy_version 116639 (0.00093) [2022-07-09 05:55:00,531][25689] Fps is (10 sec: 5883.3, 60 sec: 5746.0, 300 sec: 5712.2). Total num frames: 119439360. Throughput: 0: 6043.9. Samples: 119448254. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:55:00,531][25689] Avg episode reward: [(0, '-53.224')] [2022-07-09 05:55:02,475][26022] Updated weights on worker 0-0, policy_version 116649 (0.00085) [2022-07-09 05:55:04,555][26022] Updated weights on worker 0-0, policy_version 116659 (0.00094) [2022-07-09 05:55:05,558][25689] Fps is (10 sec: 5485.3, 60 sec: 5715.4, 300 sec: 5701.7). Total num frames: 119464960. Throughput: 0: 5928.0. Samples: 119463158. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:55:05,558][25689] Avg episode reward: [(0, '-53.500')] [2022-07-09 05:55:05,934][26022] Updated weights on worker 0-0, policy_version 116669 (0.00085) [2022-07-09 05:55:08,105][26022] Updated weights on worker 0-0, policy_version 116679 (0.00083) [2022-07-09 05:55:09,570][26022] Updated weights on worker 0-0, policy_version 116689 (0.00092) [2022-07-09 05:55:10,590][25689] Fps is (10 sec: 5497.1, 60 sec: 5747.7, 300 sec: 5708.2). Total num frames: 119494656. Throughput: 0: 5935.8. Samples: 119497954. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2022-07-09 05:55:10,590][25689] Avg episode reward: [(0, '-53.828')] [2022-07-09 05:55:11,534][26022] Updated weights on worker 0-0, policy_version 116699 (0.00461) [2022-07-09 05:55:13,376][26022] Updated weights on worker 0-0, policy_version 116709 (0.00096) [2022-07-09 05:55:15,106][26022] Updated weights on worker 0-0, policy_version 116719 (0.00093) [2022-07-09 05:55:15,635][25689] Fps is (10 sec: 5792.2, 60 sec: 5744.7, 300 sec: 5705.9). Total num frames: 119523328. Throughput: 0: 5884.3. Samples: 119532206. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2022-07-09 05:55:15,635][25689] Avg episode reward: [(0, '-54.181')] [2022-07-09 05:55:16,770][26022] Updated weights on worker 0-0, policy_version 116729 (0.00097) [2022-07-09 05:55:18,465][26022] Updated weights on worker 0-0, policy_version 116739 (0.00085) [2022-07-09 05:55:20,535][26022] Updated weights on worker 0-0, policy_version 116749 (0.00082) [2022-07-09 05:55:20,764][25689] Fps is (10 sec: 5636.0, 60 sec: 5721.5, 300 sec: 5700.3). Total num frames: 119552000. Throughput: 0: 5006.5. Samples: 119549554. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2022-07-09 05:55:20,765][25689] Avg episode reward: [(0, '-54.144')] [2022-07-09 05:55:21,992][26022] Updated weights on worker 0-0, policy_version 116759 (0.00084) [2022-07-09 05:55:24,002][26022] Updated weights on worker 0-0, policy_version 116769 (0.00081) [2022-07-09 05:55:25,731][26022] Updated weights on worker 0-0, policy_version 116779 (0.00094) [2022-07-09 05:55:25,828][25689] Fps is (10 sec: 5726.2, 60 sec: 5719.4, 300 sec: 5702.9). Total num frames: 119581696. Throughput: 0: 5981.3. Samples: 119584400. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2022-07-09 05:55:25,828][25689] Avg episode reward: [(0, '-54.027')] [2022-07-09 05:55:27,458][26022] Updated weights on worker 0-0, policy_version 116789 (0.00087) [2022-07-09 05:55:29,382][26022] Updated weights on worker 0-0, policy_version 116799 (0.00094) [2022-07-09 05:55:30,813][26022] Updated weights on worker 0-0, policy_version 116809 (0.00081) [2022-07-09 05:55:30,883][25689] Fps is (10 sec: 5971.0, 60 sec: 5771.2, 300 sec: 5709.7). Total num frames: 119612416. Throughput: 0: 5965.6. Samples: 119619012. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2022-07-09 05:55:30,883][25689] Avg episode reward: [(0, '-54.832')] [2022-07-09 05:55:32,982][26022] Updated weights on worker 0-0, policy_version 116819 (0.00095) [2022-07-09 05:55:34,642][26022] Updated weights on worker 0-0, policy_version 116829 (0.00085) [2022-07-09 05:55:35,894][25689] Fps is (10 sec: 5798.2, 60 sec: 5744.7, 300 sec: 5703.8). Total num frames: 119640064. Throughput: 0: 5149.1. Samples: 119636526. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2022-07-09 05:55:35,895][25689] Avg episode reward: [(0, '-54.921')] [2022-07-09 05:55:36,394][26022] Updated weights on worker 0-0, policy_version 116839 (0.00085) [2022-07-09 05:55:38,159][26022] Updated weights on worker 0-0, policy_version 116849 (0.00082) [2022-07-09 05:55:39,731][26022] Updated weights on worker 0-0, policy_version 116859 (0.00089) [2022-07-09 05:55:41,000][25689] Fps is (10 sec: 5566.6, 60 sec: 5723.0, 300 sec: 5705.6). Total num frames: 119668736. Throughput: 0: 6015.0. Samples: 119671272. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2022-07-09 05:55:41,001][25689] Avg episode reward: [(0, '-55.090')] [2022-07-09 05:55:41,687][26022] Updated weights on worker 0-0, policy_version 116869 (0.00084) [2022-07-09 05:55:43,422][26022] Updated weights on worker 0-0, policy_version 116879 (0.00081) [2022-07-09 05:55:45,095][26022] Updated weights on worker 0-0, policy_version 116889 (0.00090) [2022-07-09 05:55:46,056][25689] Fps is (10 sec: 5844.6, 60 sec: 5737.5, 300 sec: 5709.6). Total num frames: 119699456. Throughput: 0: 6012.7. Samples: 119706028. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2022-07-09 05:55:46,057][25689] Avg episode reward: [(0, '-54.631')] [2022-07-09 05:55:47,127][26022] Updated weights on worker 0-0, policy_version 116899 (0.00088) [2022-07-09 05:55:48,588][26022] Updated weights on worker 0-0, policy_version 116909 (0.00088) [2022-07-09 05:55:50,583][26022] Updated weights on worker 0-0, policy_version 116919 (0.00089) [2022-07-09 05:55:51,068][25689] Fps is (10 sec: 5899.3, 60 sec: 5742.2, 300 sec: 5707.0). Total num frames: 119728128. Throughput: 0: 5179.8. Samples: 119723568. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2022-07-09 05:55:51,068][25689] Avg episode reward: [(0, '-54.584')] [2022-07-09 05:55:52,248][26022] Updated weights on worker 0-0, policy_version 116929 (0.00089) [2022-07-09 05:55:53,994][26022] Updated weights on worker 0-0, policy_version 116939 (0.00084) [2022-07-09 05:55:55,686][26022] Updated weights on worker 0-0, policy_version 116949 (0.00084) [2022-07-09 05:55:56,129][25689] Fps is (10 sec: 5693.1, 60 sec: 5723.4, 300 sec: 5710.5). Total num frames: 119756800. Throughput: 0: 6020.3. Samples: 119758344. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2022-07-09 05:55:56,129][25689] Avg episode reward: [(0, '-54.113')] [2022-07-09 05:55:57,574][26022] Updated weights on worker 0-0, policy_version 116959 (0.00084) [2022-07-09 05:55:59,193][26022] Updated weights on worker 0-0, policy_version 116969 (0.00084) [2022-07-09 05:56:00,984][26022] Updated weights on worker 0-0, policy_version 116979 (0.00100) [2022-07-09 05:56:01,190][25689] Fps is (10 sec: 5766.1, 60 sec: 5722.7, 300 sec: 5717.6). Total num frames: 119786496. Throughput: 0: 6037.6. Samples: 119793172. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2022-07-09 05:56:01,191][25689] Avg episode reward: [(0, '-53.930')] [2022-07-09 05:56:03,286][26022] Updated weights on worker 0-0, policy_version 116989 (0.00086) [2022-07-09 05:56:04,968][26022] Updated weights on worker 0-0, policy_version 116999 (0.00094) [2022-07-09 05:56:06,242][25689] Fps is (10 sec: 5569.2, 60 sec: 5737.3, 300 sec: 5711.3). Total num frames: 119813120. Throughput: 0: 5070.4. Samples: 119808376. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2022-07-09 05:56:06,242][25689] Avg episode reward: [(0, '-53.630')] [2022-07-09 05:56:06,867][26022] Updated weights on worker 0-0, policy_version 117009 (0.00089) [2022-07-09 05:56:08,291][26022] Updated weights on worker 0-0, policy_version 117019 (0.00081) [2022-07-09 05:56:10,409][26022] Updated weights on worker 0-0, policy_version 117029 (0.00088) [2022-07-09 05:56:11,271][25689] Fps is (10 sec: 5587.1, 60 sec: 5737.6, 300 sec: 5711.2). Total num frames: 119842816. Throughput: 0: 5918.0. Samples: 119843128. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2022-07-09 05:56:11,271][25689] Avg episode reward: [(0, '-53.290')] [2022-07-09 05:56:11,973][26022] Updated weights on worker 0-0, policy_version 117039 (0.00087) [2022-07-09 05:56:13,802][26022] Updated weights on worker 0-0, policy_version 117049 (0.00086) [2022-07-09 05:56:15,653][26022] Updated weights on worker 0-0, policy_version 117059 (0.00098) [2022-07-09 05:56:16,286][25689] Fps is (10 sec: 5709.3, 60 sec: 5723.5, 300 sec: 5708.5). Total num frames: 119870464. Throughput: 0: 5913.8. Samples: 119877546. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2022-07-09 05:56:16,286][25689] Avg episode reward: [(0, '-53.866')] [2022-07-09 05:56:17,458][26022] Updated weights on worker 0-0, policy_version 117069 (0.00086) [2022-07-09 05:56:19,270][26022] Updated weights on worker 0-0, policy_version 117079 (0.00095) [2022-07-09 05:56:20,936][26022] Updated weights on worker 0-0, policy_version 117089 (0.00086) [2022-07-09 05:56:21,329][25689] Fps is (10 sec: 5701.1, 60 sec: 5748.6, 300 sec: 5711.2). Total num frames: 119900160. Throughput: 0: 5047.2. Samples: 119894816. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2022-07-09 05:56:21,330][25689] Avg episode reward: [(0, '-54.065')] [2022-07-09 05:56:22,660][26022] Updated weights on worker 0-0, policy_version 117099 (0.00060) [2022-07-09 05:56:24,562][26022] Updated weights on worker 0-0, policy_version 117109 (0.00086) [2022-07-09 05:56:26,175][26022] Updated weights on worker 0-0, policy_version 117119 (0.00087) [2022-07-09 05:56:26,397][25689] Fps is (10 sec: 5974.7, 60 sec: 5765.0, 300 sec: 5716.8). Total num frames: 119930880. Throughput: 0: 6035.7. Samples: 119930032. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2022-07-09 05:56:26,398][25689] Avg episode reward: [(0, '-54.047')] [2022-07-09 05:56:28,151][26022] Updated weights on worker 0-0, policy_version 117129 (0.00092) [2022-07-09 05:56:29,767][26022] Updated weights on worker 0-0, policy_version 117139 (0.00092) [2022-07-09 05:56:30,450][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:56:30,464][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000117142_119953408.pth [2022-07-09 05:56:30,464][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000115132_117895168.pth [2022-07-09 05:56:31,477][25689] Fps is (10 sec: 5751.7, 60 sec: 5712.0, 300 sec: 5712.2). Total num frames: 119958528. Throughput: 0: 5998.7. Samples: 119964340. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2022-07-09 05:56:31,478][25689] Avg episode reward: [(0, '-54.509')] [2022-07-09 05:56:31,684][26022] Updated weights on worker 0-0, policy_version 117149 (0.00091) [2022-07-09 05:56:33,494][26022] Updated weights on worker 0-0, policy_version 117159 (0.00086) [2022-07-09 05:56:35,201][26022] Updated weights on worker 0-0, policy_version 117169 (0.00088) [2022-07-09 05:56:36,530][25689] Fps is (10 sec: 5659.4, 60 sec: 5741.8, 300 sec: 5719.1). Total num frames: 119988224. Throughput: 0: 5145.7. Samples: 119981718. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2022-07-09 05:56:36,531][25689] Avg episode reward: [(0, '-54.634')] [2022-07-09 05:56:37,020][26022] Updated weights on worker 0-0, policy_version 117179 (0.00083) [2022-07-09 05:56:38,753][26022] Updated weights on worker 0-0, policy_version 117189 (0.00091) [2022-07-09 05:56:40,639][26022] Updated weights on worker 0-0, policy_version 117199 (0.00086) [2022-07-09 05:56:41,599][25689] Fps is (10 sec: 5766.6, 60 sec: 5745.4, 300 sec: 5711.0). Total num frames: 120016896. Throughput: 0: 5981.0. Samples: 120016048. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2022-07-09 05:56:41,599][25689] Avg episode reward: [(0, '-54.710')] [2022-07-09 05:56:42,402][26022] Updated weights on worker 0-0, policy_version 117209 (0.00084) [2022-07-09 05:56:44,150][26022] Updated weights on worker 0-0, policy_version 117219 (0.00060) [2022-07-09 05:56:45,906][26022] Updated weights on worker 0-0, policy_version 117229 (0.00089) [2022-07-09 05:56:46,619][25689] Fps is (10 sec: 5684.0, 60 sec: 5715.0, 300 sec: 5707.5). Total num frames: 120045568. Throughput: 0: 5941.0. Samples: 120050166. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2022-07-09 05:56:46,619][25689] Avg episode reward: [(0, '-54.469')] [2022-07-09 05:56:47,687][26022] Updated weights on worker 0-0, policy_version 117239 (0.00103) [2022-07-09 05:56:49,448][26022] Updated weights on worker 0-0, policy_version 117249 (0.00081) [2022-07-09 05:56:51,479][26022] Updated weights on worker 0-0, policy_version 117259 (0.00109) [2022-07-09 05:56:51,651][25689] Fps is (10 sec: 5806.6, 60 sec: 5729.9, 300 sec: 5717.6). Total num frames: 120075264. Throughput: 0: 5109.6. Samples: 120067420. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2022-07-09 05:56:51,651][25689] Avg episode reward: [(0, '-53.588')] [2022-07-09 05:56:52,997][26022] Updated weights on worker 0-0, policy_version 117269 (0.00092) [2022-07-09 05:56:54,894][26022] Updated weights on worker 0-0, policy_version 117279 (0.00095) [2022-07-09 05:56:56,621][26022] Updated weights on worker 0-0, policy_version 117289 (0.00081) [2022-07-09 05:56:56,660][25689] Fps is (10 sec: 5812.9, 60 sec: 5734.8, 300 sec: 5712.4). Total num frames: 120103936. Throughput: 0: 5986.1. Samples: 120102218. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2022-07-09 05:56:56,660][25689] Avg episode reward: [(0, '-53.529')] [2022-07-09 05:56:58,531][26022] Updated weights on worker 0-0, policy_version 117299 (0.00084) [2022-07-09 05:57:00,230][26022] Updated weights on worker 0-0, policy_version 117309 (0.00092) [2022-07-09 05:57:01,715][25689] Fps is (10 sec: 5697.9, 60 sec: 5718.6, 300 sec: 5715.0). Total num frames: 120132608. Throughput: 0: 5995.0. Samples: 120136644. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2022-07-09 05:57:01,715][25689] Avg episode reward: [(0, '-53.748')] [2022-07-09 05:57:02,379][26022] Updated weights on worker 0-0, policy_version 117319 (0.00074) [2022-07-09 05:57:04,007][26022] Updated weights on worker 0-0, policy_version 117329 (0.00093) [2022-07-09 05:57:06,072][26022] Updated weights on worker 0-0, policy_version 117339 (0.00084) [2022-07-09 05:57:06,717][25689] Fps is (10 sec: 5599.7, 60 sec: 5740.1, 300 sec: 5719.0). Total num frames: 120160256. Throughput: 0: 5063.0. Samples: 120151926. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2022-07-09 05:57:06,719][25689] Avg episode reward: [(0, '-53.435')] [2022-07-09 05:57:07,575][26022] Updated weights on worker 0-0, policy_version 117349 (0.00094) [2022-07-09 05:57:09,548][26022] Updated weights on worker 0-0, policy_version 117359 (0.00092) [2022-07-09 05:57:11,428][26022] Updated weights on worker 0-0, policy_version 117369 (0.00201) [2022-07-09 05:57:11,734][25689] Fps is (10 sec: 5416.6, 60 sec: 5690.4, 300 sec: 5712.0). Total num frames: 120186880. Throughput: 0: 5937.1. Samples: 120186658. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 05:57:11,736][25689] Avg episode reward: [(0, '-53.486')] [2022-07-09 05:57:12,859][26022] Updated weights on worker 0-0, policy_version 117379 (0.00096) [2022-07-09 05:57:14,933][26022] Updated weights on worker 0-0, policy_version 117389 (0.00086) [2022-07-09 05:57:16,359][26022] Updated weights on worker 0-0, policy_version 117399 (0.00087) [2022-07-09 05:57:16,775][25689] Fps is (10 sec: 5803.6, 60 sec: 5755.7, 300 sec: 5712.1). Total num frames: 120218624. Throughput: 0: 5921.5. Samples: 120221328. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 05:57:16,775][25689] Avg episode reward: [(0, '-53.754')] [2022-07-09 05:57:18,441][26022] Updated weights on worker 0-0, policy_version 117409 (0.00083) [2022-07-09 05:57:20,213][26022] Updated weights on worker 0-0, policy_version 117419 (0.00084) [2022-07-09 05:57:21,742][26022] Updated weights on worker 0-0, policy_version 117429 (0.00095) [2022-07-09 05:57:21,841][25689] Fps is (10 sec: 5977.5, 60 sec: 5736.6, 300 sec: 5722.5). Total num frames: 120247296. Throughput: 0: 5935.2. Samples: 120256100. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 05:57:21,842][25689] Avg episode reward: [(0, '-53.782')] [2022-07-09 05:57:23,654][26022] Updated weights on worker 0-0, policy_version 117439 (0.00082) [2022-07-09 05:57:25,350][26022] Updated weights on worker 0-0, policy_version 117449 (0.00087) [2022-07-09 05:57:26,850][25689] Fps is (10 sec: 5691.7, 60 sec: 5708.4, 300 sec: 5715.8). Total num frames: 120275968. Throughput: 0: 6029.6. Samples: 120273316. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 05:57:26,850][25689] Avg episode reward: [(0, '-53.880')] [2022-07-09 05:57:27,090][26022] Updated weights on worker 0-0, policy_version 117459 (0.00097) [2022-07-09 05:57:29,024][26022] Updated weights on worker 0-0, policy_version 117469 (0.01398) [2022-07-09 05:57:30,697][26022] Updated weights on worker 0-0, policy_version 117479 (0.00089) [2022-07-09 05:57:31,865][25689] Fps is (10 sec: 5618.8, 60 sec: 5714.5, 300 sec: 5713.6). Total num frames: 120303616. Throughput: 0: 6019.2. Samples: 120307828. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 05:57:31,865][25689] Avg episode reward: [(0, '-53.401')] [2022-07-09 05:57:32,581][26022] Updated weights on worker 0-0, policy_version 117489 (0.00091) [2022-07-09 05:57:34,363][26022] Updated weights on worker 0-0, policy_version 117499 (0.00084) [2022-07-09 05:57:36,107][26022] Updated weights on worker 0-0, policy_version 117509 (0.00081) [2022-07-09 05:57:36,882][25689] Fps is (10 sec: 5716.1, 60 sec: 5718.0, 300 sec: 5720.9). Total num frames: 120333312. Throughput: 0: 6012.9. Samples: 120342230. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 05:57:36,882][25689] Avg episode reward: [(0, '-53.295')] [2022-07-09 05:57:37,995][26022] Updated weights on worker 0-0, policy_version 117519 (0.00081) [2022-07-09 05:57:39,883][26022] Updated weights on worker 0-0, policy_version 117529 (0.00087) [2022-07-09 05:57:41,418][26022] Updated weights on worker 0-0, policy_version 117539 (0.00088) [2022-07-09 05:57:41,962][25689] Fps is (10 sec: 5882.0, 60 sec: 5733.8, 300 sec: 5719.4). Total num frames: 120363008. Throughput: 0: 5139.1. Samples: 120359502. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 05:57:41,963][25689] Avg episode reward: [(0, '-52.784')] [2022-07-09 05:57:43,392][26022] Updated weights on worker 0-0, policy_version 117549 (0.00085) [2022-07-09 05:57:44,975][26022] Updated weights on worker 0-0, policy_version 117559 (0.00088) [2022-07-09 05:57:46,964][26022] Updated weights on worker 0-0, policy_version 117569 (0.00089) [2022-07-09 05:57:47,002][25689] Fps is (10 sec: 5666.1, 60 sec: 5714.9, 300 sec: 5716.1). Total num frames: 120390656. Throughput: 0: 5987.1. Samples: 120393972. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 05:57:47,003][25689] Avg episode reward: [(0, '-52.995')] [2022-07-09 05:57:48,842][26022] Updated weights on worker 0-0, policy_version 117579 (0.00089) [2022-07-09 05:57:50,378][26022] Updated weights on worker 0-0, policy_version 117589 (0.00092) [2022-07-09 05:57:52,064][25689] Fps is (10 sec: 5575.0, 60 sec: 5695.1, 300 sec: 5718.4). Total num frames: 120419328. Throughput: 0: 5948.8. Samples: 120427992. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 05:57:52,065][25689] Avg episode reward: [(0, '-53.649')] [2022-07-09 05:57:52,346][26022] Updated weights on worker 0-0, policy_version 117599 (0.00087) [2022-07-09 05:57:53,954][26022] Updated weights on worker 0-0, policy_version 117609 (0.00088) [2022-07-09 05:57:55,712][26022] Updated weights on worker 0-0, policy_version 117619 (0.00089) [2022-07-09 05:57:57,078][25689] Fps is (10 sec: 5793.2, 60 sec: 5711.7, 300 sec: 5720.1). Total num frames: 120449024. Throughput: 0: 5100.8. Samples: 120445246. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 05:57:57,078][25689] Avg episode reward: [(0, '-53.746')] [2022-07-09 05:57:57,545][26022] Updated weights on worker 0-0, policy_version 117629 (0.00085) [2022-07-09 05:57:59,358][26022] Updated weights on worker 0-0, policy_version 117639 (0.00085) [2022-07-09 05:58:01,217][26022] Updated weights on worker 0-0, policy_version 117649 (0.00088) [2022-07-09 05:58:02,155][25689] Fps is (10 sec: 5581.6, 60 sec: 5675.7, 300 sec: 5722.2). Total num frames: 120475648. Throughput: 0: 5955.4. Samples: 120479756. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 05:58:02,155][25689] Avg episode reward: [(0, '-53.740')] [2022-07-09 05:58:03,401][26022] Updated weights on worker 0-0, policy_version 117659 (0.00085) [2022-07-09 05:58:05,293][26022] Updated weights on worker 0-0, policy_version 117669 (0.00093) [2022-07-09 05:58:06,788][26022] Updated weights on worker 0-0, policy_version 117679 (0.00089) [2022-07-09 05:58:07,190][25689] Fps is (10 sec: 5468.0, 60 sec: 5689.6, 300 sec: 5715.7). Total num frames: 120504320. Throughput: 0: 5851.6. Samples: 120512104. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 05:58:07,192][25689] Avg episode reward: [(0, '-53.856')] [2022-07-09 05:58:08,755][26022] Updated weights on worker 0-0, policy_version 117689 (0.00091) [2022-07-09 05:58:10,298][26022] Updated weights on worker 0-0, policy_version 117699 (0.00081) [2022-07-09 05:58:12,222][25689] Fps is (10 sec: 5696.2, 60 sec: 5722.0, 300 sec: 5718.8). Total num frames: 120532992. Throughput: 0: 5029.2. Samples: 120529368. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 05:58:12,224][25689] Avg episode reward: [(0, '-54.606')] [2022-07-09 05:58:12,319][26022] Updated weights on worker 0-0, policy_version 117709 (0.00082) [2022-07-09 05:58:14,085][26022] Updated weights on worker 0-0, policy_version 117719 (0.00084) [2022-07-09 05:58:15,843][26022] Updated weights on worker 0-0, policy_version 117729 (0.00091) [2022-07-09 05:58:17,243][25689] Fps is (10 sec: 5704.0, 60 sec: 5673.0, 300 sec: 5716.7). Total num frames: 120561664. Throughput: 0: 5898.2. Samples: 120564188. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 05:58:17,245][25689] Avg episode reward: [(0, '-55.081')] [2022-07-09 05:58:17,763][26022] Updated weights on worker 0-0, policy_version 117739 (0.00089) [2022-07-09 05:58:19,443][26022] Updated weights on worker 0-0, policy_version 117749 (0.00082) [2022-07-09 05:58:21,054][26022] Updated weights on worker 0-0, policy_version 117759 (0.00095) [2022-07-09 05:58:22,345][25689] Fps is (10 sec: 5765.5, 60 sec: 5686.6, 300 sec: 5719.9). Total num frames: 120591360. Throughput: 0: 5881.9. Samples: 120598516. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 05:58:22,346][25689] Avg episode reward: [(0, '-54.173')] [2022-07-09 05:58:23,150][26022] Updated weights on worker 0-0, policy_version 117769 (0.00090) [2022-07-09 05:58:24,587][26022] Updated weights on worker 0-0, policy_version 117779 (0.00090) [2022-07-09 05:58:26,672][26022] Updated weights on worker 0-0, policy_version 117789 (0.00090) [2022-07-09 05:58:27,368][25689] Fps is (10 sec: 5765.1, 60 sec: 5685.3, 300 sec: 5712.9). Total num frames: 120620032. Throughput: 0: 5149.0. Samples: 120615998. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 05:58:27,368][25689] Avg episode reward: [(0, '-54.432')] [2022-07-09 05:58:28,358][26022] Updated weights on worker 0-0, policy_version 117799 (0.00093) [2022-07-09 05:58:30,147][26022] Updated weights on worker 0-0, policy_version 117809 (0.00090) [2022-07-09 05:58:30,485][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 05:58:30,498][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000117811_120638464.pth [2022-07-09 05:58:30,498][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000115801_118580224.pth [2022-07-09 05:58:31,943][26022] Updated weights on worker 0-0, policy_version 117819 (0.00084) [2022-07-09 05:58:32,430][25689] Fps is (10 sec: 5787.7, 60 sec: 5714.7, 300 sec: 5723.3). Total num frames: 120649728. Throughput: 0: 5986.5. Samples: 120650346. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 05:58:32,431][25689] Avg episode reward: [(0, '-54.290')] [2022-07-09 05:58:33,642][26022] Updated weights on worker 0-0, policy_version 117829 (0.00089) [2022-07-09 05:58:35,454][26022] Updated weights on worker 0-0, policy_version 117839 (0.00085) [2022-07-09 05:58:37,211][26022] Updated weights on worker 0-0, policy_version 117849 (0.00089) [2022-07-09 05:58:37,503][25689] Fps is (10 sec: 5658.1, 60 sec: 5675.7, 300 sec: 5719.2). Total num frames: 120677376. Throughput: 0: 5945.9. Samples: 120684648. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 05:58:37,503][25689] Avg episode reward: [(0, '-54.430')] [2022-07-09 05:58:39,072][26022] Updated weights on worker 0-0, policy_version 117859 (0.00088) [2022-07-09 05:58:40,928][26022] Updated weights on worker 0-0, policy_version 117869 (0.00088) [2022-07-09 05:58:42,557][25689] Fps is (10 sec: 5662.4, 60 sec: 5678.1, 300 sec: 5715.1). Total num frames: 120707072. Throughput: 0: 5109.8. Samples: 120701794. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 05:58:42,558][25689] Avg episode reward: [(0, '-54.544')] [2022-07-09 05:58:42,760][26022] Updated weights on worker 0-0, policy_version 117879 (0.00093) [2022-07-09 05:58:44,379][26022] Updated weights on worker 0-0, policy_version 117889 (0.00892) [2022-07-09 05:58:46,217][26022] Updated weights on worker 0-0, policy_version 117899 (0.00092) [2022-07-09 05:58:47,563][25689] Fps is (10 sec: 5903.7, 60 sec: 5715.1, 300 sec: 5722.9). Total num frames: 120736768. Throughput: 0: 5973.2. Samples: 120736626. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 05:58:47,563][25689] Avg episode reward: [(0, '-54.899')] [2022-07-09 05:58:47,808][26022] Updated weights on worker 0-0, policy_version 117909 (0.00091) [2022-07-09 05:58:49,823][26022] Updated weights on worker 0-0, policy_version 117919 (0.00096) [2022-07-09 05:58:51,395][26022] Updated weights on worker 0-0, policy_version 117929 (0.00087) [2022-07-09 05:58:52,575][25689] Fps is (10 sec: 5622.2, 60 sec: 5686.0, 300 sec: 5719.4). Total num frames: 120763392. Throughput: 0: 5984.7. Samples: 120770904. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 05:58:52,575][25689] Avg episode reward: [(0, '-53.688')] [2022-07-09 05:58:53,439][26022] Updated weights on worker 0-0, policy_version 117939 (0.00100) [2022-07-09 05:58:54,991][26022] Updated weights on worker 0-0, policy_version 117949 (0.00086) [2022-07-09 05:58:56,802][26022] Updated weights on worker 0-0, policy_version 117959 (0.00086) [2022-07-09 05:58:57,582][25689] Fps is (10 sec: 5621.1, 60 sec: 5686.6, 300 sec: 5717.9). Total num frames: 120793088. Throughput: 0: 5165.0. Samples: 120788360. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 05:58:57,583][25689] Avg episode reward: [(0, '-54.009')] [2022-07-09 05:58:58,582][26022] Updated weights on worker 0-0, policy_version 117969 (0.00083) [2022-07-09 05:59:00,459][26022] Updated weights on worker 0-0, policy_version 117979 (0.00053) [2022-07-09 05:59:02,534][26022] Updated weights on worker 0-0, policy_version 117989 (0.00087) [2022-07-09 05:59:02,655][25689] Fps is (10 sec: 5790.5, 60 sec: 5720.9, 300 sec: 5721.1). Total num frames: 120821760. Throughput: 0: 6023.2. Samples: 120822844. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 05:59:02,655][25689] Avg episode reward: [(0, '-53.978')] [2022-07-09 05:59:04,435][26022] Updated weights on worker 0-0, policy_version 117999 (0.00093) [2022-07-09 05:59:06,058][26022] Updated weights on worker 0-0, policy_version 118009 (0.00098) [2022-07-09 05:59:07,684][25689] Fps is (10 sec: 5575.3, 60 sec: 5704.5, 300 sec: 5720.8). Total num frames: 120849408. Throughput: 0: 5886.6. Samples: 120855072. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 05:59:07,685][25689] Avg episode reward: [(0, '-54.222')] [2022-07-09 05:59:08,018][26022] Updated weights on worker 0-0, policy_version 118019 (0.00093) [2022-07-09 05:59:09,824][26022] Updated weights on worker 0-0, policy_version 118029 (0.00085) [2022-07-09 05:59:11,657][26022] Updated weights on worker 0-0, policy_version 118039 (0.00086) [2022-07-09 05:59:12,696][25689] Fps is (10 sec: 5608.8, 60 sec: 5706.4, 300 sec: 5720.9). Total num frames: 120878080. Throughput: 0: 5028.5. Samples: 120872084. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 05:59:12,698][25689] Avg episode reward: [(0, '-53.201')] [2022-07-09 05:59:13,388][26022] Updated weights on worker 0-0, policy_version 118049 (0.00092) [2022-07-09 05:59:15,211][26022] Updated weights on worker 0-0, policy_version 118059 (0.00089) [2022-07-09 05:59:16,834][26022] Updated weights on worker 0-0, policy_version 118069 (0.00084) [2022-07-09 05:59:17,723][25689] Fps is (10 sec: 5610.3, 60 sec: 5689.0, 300 sec: 5714.7). Total num frames: 120905728. Throughput: 0: 5861.3. Samples: 120906408. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:59:17,723][25689] Avg episode reward: [(0, '-52.729')] [2022-07-09 05:59:18,997][26022] Updated weights on worker 0-0, policy_version 118079 (0.00118) [2022-07-09 05:59:20,455][26022] Updated weights on worker 0-0, policy_version 118089 (0.00083) [2022-07-09 05:59:22,450][26022] Updated weights on worker 0-0, policy_version 118099 (0.00086) [2022-07-09 05:59:22,784][25689] Fps is (10 sec: 5684.6, 60 sec: 5692.8, 300 sec: 5714.3). Total num frames: 120935424. Throughput: 0: 5834.8. Samples: 120940292. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:59:22,784][25689] Avg episode reward: [(0, '-53.801')] [2022-07-09 05:59:24,057][26022] Updated weights on worker 0-0, policy_version 118109 (0.00088) [2022-07-09 05:59:25,863][26022] Updated weights on worker 0-0, policy_version 118119 (0.00089) [2022-07-09 05:59:27,763][26022] Updated weights on worker 0-0, policy_version 118129 (0.00095) [2022-07-09 05:59:27,795][25689] Fps is (10 sec: 5795.1, 60 sec: 5693.9, 300 sec: 5718.8). Total num frames: 120964096. Throughput: 0: 5106.0. Samples: 120957758. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:59:27,796][25689] Avg episode reward: [(0, '-53.315')] [2022-07-09 05:59:29,654][26022] Updated weights on worker 0-0, policy_version 118139 (0.00088) [2022-07-09 05:59:31,274][26022] Updated weights on worker 0-0, policy_version 118149 (0.00079) [2022-07-09 05:59:32,805][25689] Fps is (10 sec: 5722.1, 60 sec: 5681.8, 300 sec: 5716.9). Total num frames: 120992768. Throughput: 0: 5974.6. Samples: 120992228. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:59:32,806][25689] Avg episode reward: [(0, '-53.336')] [2022-07-09 05:59:33,140][26022] Updated weights on worker 0-0, policy_version 118159 (0.00099) [2022-07-09 05:59:34,713][26022] Updated weights on worker 0-0, policy_version 118169 (0.00091) [2022-07-09 05:59:36,686][26022] Updated weights on worker 0-0, policy_version 118179 (0.00085) [2022-07-09 05:59:37,835][25689] Fps is (10 sec: 5711.4, 60 sec: 5702.8, 300 sec: 5713.9). Total num frames: 121021440. Throughput: 0: 5983.4. Samples: 121026748. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:59:37,835][25689] Avg episode reward: [(0, '-53.645')] [2022-07-09 05:59:38,516][26022] Updated weights on worker 0-0, policy_version 118189 (0.00092) [2022-07-09 05:59:40,208][26022] Updated weights on worker 0-0, policy_version 118199 (0.00083) [2022-07-09 05:59:42,082][26022] Updated weights on worker 0-0, policy_version 118209 (0.00091) [2022-07-09 05:59:42,903][25689] Fps is (10 sec: 5780.2, 60 sec: 5701.5, 300 sec: 5713.2). Total num frames: 121051136. Throughput: 0: 5146.3. Samples: 121043832. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:59:42,904][25689] Avg episode reward: [(0, '-53.933')] [2022-07-09 05:59:43,877][26022] Updated weights on worker 0-0, policy_version 118219 (0.00091) [2022-07-09 05:59:45,552][26022] Updated weights on worker 0-0, policy_version 118229 (0.00089) [2022-07-09 05:59:47,351][26022] Updated weights on worker 0-0, policy_version 118239 (0.00086) [2022-07-09 05:59:47,912][25689] Fps is (10 sec: 5690.8, 60 sec: 5667.3, 300 sec: 5710.8). Total num frames: 121078784. Throughput: 0: 6005.5. Samples: 121078570. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:59:47,912][25689] Avg episode reward: [(0, '-54.724')] [2022-07-09 05:59:49,251][26022] Updated weights on worker 0-0, policy_version 118249 (0.00083) [2022-07-09 05:59:51,074][26022] Updated weights on worker 0-0, policy_version 118259 (0.00093) [2022-07-09 05:59:52,831][26022] Updated weights on worker 0-0, policy_version 118269 (0.00091) [2022-07-09 05:59:52,922][25689] Fps is (10 sec: 5723.5, 60 sec: 5718.3, 300 sec: 5711.4). Total num frames: 121108480. Throughput: 0: 5991.4. Samples: 121112758. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:59:52,923][25689] Avg episode reward: [(0, '-54.801')] [2022-07-09 05:59:54,441][26022] Updated weights on worker 0-0, policy_version 118279 (0.00086) [2022-07-09 05:59:56,256][26022] Updated weights on worker 0-0, policy_version 118289 (0.00087) [2022-07-09 05:59:57,927][25689] Fps is (10 sec: 5725.8, 60 sec: 5684.7, 300 sec: 5705.4). Total num frames: 121136128. Throughput: 0: 5142.4. Samples: 121130068. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 05:59:57,927][25689] Avg episode reward: [(0, '-54.565')] [2022-07-09 05:59:58,093][26022] Updated weights on worker 0-0, policy_version 118299 (0.00088) [2022-07-09 05:59:59,844][26022] Updated weights on worker 0-0, policy_version 118309 (0.00086) [2022-07-09 06:00:01,888][26022] Updated weights on worker 0-0, policy_version 118319 (0.00091) [2022-07-09 06:00:03,018][25689] Fps is (10 sec: 5477.3, 60 sec: 5666.0, 300 sec: 5711.1). Total num frames: 121163776. Throughput: 0: 6003.3. Samples: 121164586. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:00:03,018][25689] Avg episode reward: [(0, '-54.547')] [2022-07-09 06:00:03,815][26022] Updated weights on worker 0-0, policy_version 118329 (0.00084) [2022-07-09 06:00:05,500][26022] Updated weights on worker 0-0, policy_version 118339 (0.00104) [2022-07-09 06:00:07,428][26022] Updated weights on worker 0-0, policy_version 118349 (0.00086) [2022-07-09 06:00:08,035][25689] Fps is (10 sec: 5571.5, 60 sec: 5684.1, 300 sec: 5707.9). Total num frames: 121192448. Throughput: 0: 5893.0. Samples: 121197158. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:00:08,036][25689] Avg episode reward: [(0, '-54.319')] [2022-07-09 06:00:08,956][26022] Updated weights on worker 0-0, policy_version 118359 (0.00090) [2022-07-09 06:00:10,955][26022] Updated weights on worker 0-0, policy_version 118369 (0.00083) [2022-07-09 06:00:12,463][26022] Updated weights on worker 0-0, policy_version 118379 (0.00085) [2022-07-09 06:00:13,058][25689] Fps is (10 sec: 5813.3, 60 sec: 5700.0, 300 sec: 5711.8). Total num frames: 121222144. Throughput: 0: 5055.3. Samples: 121214550. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:00:13,059][25689] Avg episode reward: [(0, '-54.398')] [2022-07-09 06:00:14,459][26022] Updated weights on worker 0-0, policy_version 118389 (0.00086) [2022-07-09 06:00:15,936][26022] Updated weights on worker 0-0, policy_version 118399 (0.00085) [2022-07-09 06:00:17,985][26022] Updated weights on worker 0-0, policy_version 118409 (0.00092) [2022-07-09 06:00:18,089][25689] Fps is (10 sec: 5805.5, 60 sec: 5716.5, 300 sec: 5713.7). Total num frames: 121250816. Throughput: 0: 5914.6. Samples: 121249322. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:00:18,090][25689] Avg episode reward: [(0, '-53.778')] [2022-07-09 06:00:19,619][26022] Updated weights on worker 0-0, policy_version 118419 (0.00094) [2022-07-09 06:00:21,662][26022] Updated weights on worker 0-0, policy_version 118429 (0.00091) [2022-07-09 06:00:23,137][25689] Fps is (10 sec: 5689.4, 60 sec: 5700.8, 300 sec: 5710.5). Total num frames: 121279488. Throughput: 0: 5908.2. Samples: 121283456. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:00:23,139][25689] Avg episode reward: [(0, '-53.924')] [2022-07-09 06:00:23,391][26022] Updated weights on worker 0-0, policy_version 118439 (0.00089) [2022-07-09 06:00:25,144][26022] Updated weights on worker 0-0, policy_version 118449 (0.00095) [2022-07-09 06:00:26,975][26022] Updated weights on worker 0-0, policy_version 118459 (0.00093) [2022-07-09 06:00:28,163][25689] Fps is (10 sec: 5692.4, 60 sec: 5699.4, 300 sec: 5704.2). Total num frames: 121308160. Throughput: 0: 5981.5. Samples: 121317552. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:00:28,163][25689] Avg episode reward: [(0, '-54.396')] [2022-07-09 06:00:28,754][26022] Updated weights on worker 0-0, policy_version 118469 (0.00097) [2022-07-09 06:00:30,557][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:00:30,572][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000118479_121322496.pth [2022-07-09 06:00:30,573][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000116468_119263232.pth [2022-07-09 06:00:30,579][26022] Updated weights on worker 0-0, policy_version 118479 (0.00085) [2022-07-09 06:00:32,426][26022] Updated weights on worker 0-0, policy_version 118489 (0.00093) [2022-07-09 06:00:33,174][25689] Fps is (10 sec: 5713.1, 60 sec: 5699.3, 300 sec: 5707.6). Total num frames: 121336832. Throughput: 0: 5970.1. Samples: 121334648. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:00:33,175][25689] Avg episode reward: [(0, '-53.728')] [2022-07-09 06:00:34,246][26022] Updated weights on worker 0-0, policy_version 118499 (0.00089) [2022-07-09 06:00:35,986][26022] Updated weights on worker 0-0, policy_version 118509 (0.00086) [2022-07-09 06:00:37,731][26022] Updated weights on worker 0-0, policy_version 118519 (0.00086) [2022-07-09 06:00:38,178][25689] Fps is (10 sec: 5725.8, 60 sec: 5701.8, 300 sec: 5709.6). Total num frames: 121365504. Throughput: 0: 5969.8. Samples: 121369248. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:00:38,180][25689] Avg episode reward: [(0, '-54.179')] [2022-07-09 06:00:39,552][26022] Updated weights on worker 0-0, policy_version 118529 (0.00095) [2022-07-09 06:00:41,329][26022] Updated weights on worker 0-0, policy_version 118539 (0.00091) [2022-07-09 06:00:43,113][26022] Updated weights on worker 0-0, policy_version 118549 (0.00390) [2022-07-09 06:00:43,236][25689] Fps is (10 sec: 5699.3, 60 sec: 5685.8, 300 sec: 5702.7). Total num frames: 121394176. Throughput: 0: 5978.1. Samples: 121403610. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:00:43,238][25689] Avg episode reward: [(0, '-53.889')] [2022-07-09 06:00:44,966][26022] Updated weights on worker 0-0, policy_version 118559 (0.00096) [2022-07-09 06:00:46,623][26022] Updated weights on worker 0-0, policy_version 118569 (0.00096) [2022-07-09 06:00:48,240][25689] Fps is (10 sec: 5699.1, 60 sec: 5703.2, 300 sec: 5702.8). Total num frames: 121422848. Throughput: 0: 5144.4. Samples: 121420836. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:00:48,242][25689] Avg episode reward: [(0, '-53.736')] [2022-07-09 06:00:48,506][26022] Updated weights on worker 0-0, policy_version 118579 (0.00091) [2022-07-09 06:00:50,406][26022] Updated weights on worker 0-0, policy_version 118589 (0.00085) [2022-07-09 06:00:52,098][26022] Updated weights on worker 0-0, policy_version 118599 (0.00097) [2022-07-09 06:00:53,249][25689] Fps is (10 sec: 5726.9, 60 sec: 5686.3, 300 sec: 5703.8). Total num frames: 121451520. Throughput: 0: 5987.7. Samples: 121454850. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:00:53,250][25689] Avg episode reward: [(0, '-53.470')] [2022-07-09 06:00:53,965][26022] Updated weights on worker 0-0, policy_version 118609 (0.00082) [2022-07-09 06:00:55,550][26022] Updated weights on worker 0-0, policy_version 118619 (0.00090) [2022-07-09 06:00:57,357][26022] Updated weights on worker 0-0, policy_version 118629 (0.00085) [2022-07-09 06:00:58,279][25689] Fps is (10 sec: 5610.3, 60 sec: 5684.0, 300 sec: 5697.5). Total num frames: 121479168. Throughput: 0: 5991.3. Samples: 121489676. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:00:58,280][25689] Avg episode reward: [(0, '-53.521')] [2022-07-09 06:00:59,074][26022] Updated weights on worker 0-0, policy_version 118639 (0.00082) [2022-07-09 06:01:01,077][26022] Updated weights on worker 0-0, policy_version 118649 (0.00084) [2022-07-09 06:01:03,144][26022] Updated weights on worker 0-0, policy_version 118659 (0.00123) [2022-07-09 06:01:03,316][25689] Fps is (10 sec: 5594.4, 60 sec: 5706.0, 300 sec: 5704.7). Total num frames: 121507840. Throughput: 0: 5139.4. Samples: 121506810. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:01:03,317][25689] Avg episode reward: [(0, '-53.244')] [2022-07-09 06:01:04,980][26022] Updated weights on worker 0-0, policy_version 118669 (0.00086) [2022-07-09 06:01:06,699][26022] Updated weights on worker 0-0, policy_version 118679 (0.00084) [2022-07-09 06:01:08,320][25689] Fps is (10 sec: 5609.0, 60 sec: 5690.4, 300 sec: 5698.3). Total num frames: 121535488. Throughput: 0: 5871.1. Samples: 121538726. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:01:08,320][25689] Avg episode reward: [(0, '-53.085')] [2022-07-09 06:01:08,679][26022] Updated weights on worker 0-0, policy_version 118689 (0.00090) [2022-07-09 06:01:10,307][26022] Updated weights on worker 0-0, policy_version 118699 (0.00086) [2022-07-09 06:01:12,367][26022] Updated weights on worker 0-0, policy_version 118709 (0.00092) [2022-07-09 06:01:13,324][25689] Fps is (10 sec: 5525.2, 60 sec: 5658.1, 300 sec: 5698.5). Total num frames: 121563136. Throughput: 0: 5890.2. Samples: 121573096. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:01:13,325][25689] Avg episode reward: [(0, '-53.270')] [2022-07-09 06:01:13,905][26022] Updated weights on worker 0-0, policy_version 118719 (0.00053) [2022-07-09 06:01:15,933][26022] Updated weights on worker 0-0, policy_version 118729 (0.00085) [2022-07-09 06:01:17,366][26022] Updated weights on worker 0-0, policy_version 118739 (0.00094) [2022-07-09 06:01:18,339][25689] Fps is (10 sec: 5723.6, 60 sec: 5676.7, 300 sec: 5699.0). Total num frames: 121592832. Throughput: 0: 5018.9. Samples: 121590354. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 06:01:18,339][25689] Avg episode reward: [(0, '-52.635')] [2022-07-09 06:01:19,388][26022] Updated weights on worker 0-0, policy_version 118749 (0.00085) [2022-07-09 06:01:21,027][26022] Updated weights on worker 0-0, policy_version 118759 (0.00083) [2022-07-09 06:01:22,917][26022] Updated weights on worker 0-0, policy_version 118769 (0.00083) [2022-07-09 06:01:23,463][25689] Fps is (10 sec: 5858.0, 60 sec: 5686.4, 300 sec: 5694.5). Total num frames: 121622528. Throughput: 0: 5862.3. Samples: 121624916. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 06:01:23,464][25689] Avg episode reward: [(0, '-53.155')] [2022-07-09 06:01:24,616][26022] Updated weights on worker 0-0, policy_version 118779 (0.00083) [2022-07-09 06:01:26,467][26022] Updated weights on worker 0-0, policy_version 118789 (0.00089) [2022-07-09 06:01:28,274][26022] Updated weights on worker 0-0, policy_version 118799 (0.00088) [2022-07-09 06:01:28,489][25689] Fps is (10 sec: 5750.2, 60 sec: 5686.4, 300 sec: 5699.0). Total num frames: 121651200. Throughput: 0: 5972.9. Samples: 121659196. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 06:01:28,490][25689] Avg episode reward: [(0, '-53.685')] [2022-07-09 06:01:30,190][26022] Updated weights on worker 0-0, policy_version 118809 (0.00086) [2022-07-09 06:01:31,867][26022] Updated weights on worker 0-0, policy_version 118819 (0.00100) [2022-07-09 06:01:33,511][25689] Fps is (10 sec: 5605.1, 60 sec: 5668.5, 300 sec: 5692.7). Total num frames: 121678848. Throughput: 0: 5119.3. Samples: 121676438. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 06:01:33,511][25689] Avg episode reward: [(0, '-54.209')] [2022-07-09 06:01:33,658][26022] Updated weights on worker 0-0, policy_version 118829 (0.00087) [2022-07-09 06:01:35,465][26022] Updated weights on worker 0-0, policy_version 118839 (0.00080) [2022-07-09 06:01:37,047][26022] Updated weights on worker 0-0, policy_version 118849 (0.00089) [2022-07-09 06:01:38,543][25689] Fps is (10 sec: 5703.9, 60 sec: 5682.8, 300 sec: 5696.9). Total num frames: 121708544. Throughput: 0: 5981.7. Samples: 121711208. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 06:01:38,543][25689] Avg episode reward: [(0, '-54.897')] [2022-07-09 06:01:39,107][26022] Updated weights on worker 0-0, policy_version 118859 (0.00086) [2022-07-09 06:01:40,562][26022] Updated weights on worker 0-0, policy_version 118869 (0.00088) [2022-07-09 06:01:42,687][26022] Updated weights on worker 0-0, policy_version 118879 (0.00085) [2022-07-09 06:01:43,620][25689] Fps is (10 sec: 5976.4, 60 sec: 5714.9, 300 sec: 5702.7). Total num frames: 121739264. Throughput: 0: 5982.6. Samples: 121745508. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 06:01:43,621][25689] Avg episode reward: [(0, '-55.485')] [2022-07-09 06:01:44,286][26022] Updated weights on worker 0-0, policy_version 118889 (0.00077) [2022-07-09 06:01:46,189][26022] Updated weights on worker 0-0, policy_version 118899 (0.00069) [2022-07-09 06:01:47,942][26022] Updated weights on worker 0-0, policy_version 118909 (0.00091) [2022-07-09 06:01:48,669][25689] Fps is (10 sec: 5663.0, 60 sec: 5676.8, 300 sec: 5692.0). Total num frames: 121765888. Throughput: 0: 5127.8. Samples: 121762672. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 06:01:48,669][25689] Avg episode reward: [(0, '-55.021')] [2022-07-09 06:01:49,740][26022] Updated weights on worker 0-0, policy_version 118919 (0.00082) [2022-07-09 06:01:51,545][26022] Updated weights on worker 0-0, policy_version 118929 (0.00094) [2022-07-09 06:01:53,312][26022] Updated weights on worker 0-0, policy_version 118939 (0.00088) [2022-07-09 06:01:53,709][25689] Fps is (10 sec: 5683.9, 60 sec: 5707.7, 300 sec: 5698.3). Total num frames: 121796608. Throughput: 0: 5978.0. Samples: 121797182. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 06:01:53,710][25689] Avg episode reward: [(0, '-55.594')] [2022-07-09 06:01:55,104][26022] Updated weights on worker 0-0, policy_version 118949 (0.00096) [2022-07-09 06:01:56,856][26022] Updated weights on worker 0-0, policy_version 118959 (0.00103) [2022-07-09 06:01:58,588][26022] Updated weights on worker 0-0, policy_version 118969 (0.00090) [2022-07-09 06:01:58,802][25689] Fps is (10 sec: 5760.2, 60 sec: 5701.7, 300 sec: 5694.1). Total num frames: 121824256. Throughput: 0: 5942.9. Samples: 121831606. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 06:01:58,802][25689] Avg episode reward: [(0, '-55.894')] [2022-07-09 06:02:00,372][26022] Updated weights on worker 0-0, policy_version 118979 (0.00086) [2022-07-09 06:02:02,827][26022] Updated weights on worker 0-0, policy_version 118989 (0.00079) [2022-07-09 06:02:03,909][25689] Fps is (10 sec: 5320.7, 60 sec: 5661.4, 300 sec: 5688.7). Total num frames: 121850880. Throughput: 0: 5087.5. Samples: 121848728. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 06:02:03,910][25689] Avg episode reward: [(0, '-56.604')] [2022-07-09 06:02:04,339][26022] Updated weights on worker 0-0, policy_version 118999 (0.00092) [2022-07-09 06:02:06,397][26022] Updated weights on worker 0-0, policy_version 119009 (0.00084) [2022-07-09 06:02:08,013][26022] Updated weights on worker 0-0, policy_version 119019 (0.00091) [2022-07-09 06:02:08,924][25689] Fps is (10 sec: 5462.8, 60 sec: 5677.2, 300 sec: 5695.6). Total num frames: 121879552. Throughput: 0: 5815.9. Samples: 121880474. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 06:02:08,925][25689] Avg episode reward: [(0, '-57.022')] [2022-07-09 06:02:09,915][26022] Updated weights on worker 0-0, policy_version 119029 (0.00083) [2022-07-09 06:02:11,734][26022] Updated weights on worker 0-0, policy_version 119039 (0.00088) [2022-07-09 06:02:13,419][26022] Updated weights on worker 0-0, policy_version 119049 (0.00088) [2022-07-09 06:02:14,014][25689] Fps is (10 sec: 5776.3, 60 sec: 5703.0, 300 sec: 5687.8). Total num frames: 121909248. Throughput: 0: 5803.7. Samples: 121915024. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 06:02:14,014][25689] Avg episode reward: [(0, '-56.792')] [2022-07-09 06:02:15,404][26022] Updated weights on worker 0-0, policy_version 119059 (0.00097) [2022-07-09 06:02:17,015][26022] Updated weights on worker 0-0, policy_version 119069 (0.00087) [2022-07-09 06:02:18,868][26022] Updated weights on worker 0-0, policy_version 119079 (0.00091) [2022-07-09 06:02:19,061][25689] Fps is (10 sec: 5656.8, 60 sec: 5666.2, 300 sec: 5684.7). Total num frames: 121936896. Throughput: 0: 4948.5. Samples: 121931860. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 06:02:19,062][25689] Avg episode reward: [(0, '-56.934')] [2022-07-09 06:02:20,735][26022] Updated weights on worker 0-0, policy_version 119089 (0.00093) [2022-07-09 06:02:22,520][26022] Updated weights on worker 0-0, policy_version 119099 (0.00093) [2022-07-09 06:02:24,145][25689] Fps is (10 sec: 5660.2, 60 sec: 5670.0, 300 sec: 5686.7). Total num frames: 121966592. Throughput: 0: 5798.2. Samples: 121966060. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 06:02:24,145][25689] Avg episode reward: [(0, '-56.407')] [2022-07-09 06:02:24,226][26022] Updated weights on worker 0-0, policy_version 119109 (0.00088) [2022-07-09 06:02:26,303][26022] Updated weights on worker 0-0, policy_version 119119 (0.00087) [2022-07-09 06:02:27,775][26022] Updated weights on worker 0-0, policy_version 119129 (0.00099) [2022-07-09 06:02:29,159][25689] Fps is (10 sec: 5679.0, 60 sec: 5654.2, 300 sec: 5686.7). Total num frames: 121994240. Throughput: 0: 5917.9. Samples: 122000220. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 06:02:29,159][25689] Avg episode reward: [(0, '-55.866')] [2022-07-09 06:02:29,820][26022] Updated weights on worker 0-0, policy_version 119139 (0.00087) [2022-07-09 06:02:30,702][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:02:30,714][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000119146_122005504.pth [2022-07-09 06:02:30,714][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000117142_119953408.pth [2022-07-09 06:02:31,357][26022] Updated weights on worker 0-0, policy_version 119149 (0.00088) [2022-07-09 06:02:33,324][26022] Updated weights on worker 0-0, policy_version 119159 (0.00086) [2022-07-09 06:02:34,186][25689] Fps is (10 sec: 5710.9, 60 sec: 5687.5, 300 sec: 5686.5). Total num frames: 122023936. Throughput: 0: 5064.3. Samples: 122017182. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 06:02:34,187][25689] Avg episode reward: [(0, '-55.567')] [2022-07-09 06:02:35,044][26022] Updated weights on worker 0-0, policy_version 119169 (0.00091) [2022-07-09 06:02:36,938][26022] Updated weights on worker 0-0, policy_version 119179 (0.00084) [2022-07-09 06:02:38,472][26022] Updated weights on worker 0-0, policy_version 119189 (0.00083) [2022-07-09 06:02:39,192][25689] Fps is (10 sec: 5919.3, 60 sec: 5689.9, 300 sec: 5687.9). Total num frames: 122053632. Throughput: 0: 5961.7. Samples: 122051876. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 06:02:39,193][25689] Avg episode reward: [(0, '-54.196')] [2022-07-09 06:02:40,498][26022] Updated weights on worker 0-0, policy_version 119199 (0.00083) [2022-07-09 06:02:42,122][26022] Updated weights on worker 0-0, policy_version 119209 (0.00090) [2022-07-09 06:02:43,907][26022] Updated weights on worker 0-0, policy_version 119219 (0.00094) [2022-07-09 06:02:44,254][25689] Fps is (10 sec: 5797.4, 60 sec: 5657.6, 300 sec: 5691.0). Total num frames: 122082304. Throughput: 0: 5980.3. Samples: 122086318. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 06:02:44,255][25689] Avg episode reward: [(0, '-53.913')] [2022-07-09 06:02:45,826][26022] Updated weights on worker 0-0, policy_version 119229 (0.00089) [2022-07-09 06:02:47,315][26022] Updated weights on worker 0-0, policy_version 119239 (0.00086) [2022-07-09 06:02:49,288][25689] Fps is (10 sec: 5578.4, 60 sec: 5675.8, 300 sec: 5688.0). Total num frames: 122109952. Throughput: 0: 5134.6. Samples: 122103578. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 06:02:49,289][25689] Avg episode reward: [(0, '-54.036')] [2022-07-09 06:02:49,413][26022] Updated weights on worker 0-0, policy_version 119249 (0.00086) [2022-07-09 06:02:50,887][26022] Updated weights on worker 0-0, policy_version 119259 (0.00086) [2022-07-09 06:02:52,955][26022] Updated weights on worker 0-0, policy_version 119269 (0.00391) [2022-07-09 06:02:54,308][25689] Fps is (10 sec: 5703.5, 60 sec: 5660.8, 300 sec: 5687.9). Total num frames: 122139648. Throughput: 0: 6008.4. Samples: 122138084. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 06:02:54,310][25689] Avg episode reward: [(0, '-54.404')] [2022-07-09 06:02:54,646][26022] Updated weights on worker 0-0, policy_version 119279 (0.00080) [2022-07-09 06:02:56,504][26022] Updated weights on worker 0-0, policy_version 119289 (0.00096) [2022-07-09 06:02:58,337][26022] Updated weights on worker 0-0, policy_version 119299 (0.00090) [2022-07-09 06:02:59,331][25689] Fps is (10 sec: 5710.3, 60 sec: 5667.4, 300 sec: 5692.4). Total num frames: 122167296. Throughput: 0: 5960.3. Samples: 122171906. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 06:02:59,332][25689] Avg episode reward: [(0, '-54.159')] [2022-07-09 06:03:00,227][26022] Updated weights on worker 0-0, policy_version 119309 (0.00087) [2022-07-09 06:03:02,163][26022] Updated weights on worker 0-0, policy_version 119319 (0.00073) [2022-07-09 06:03:04,103][26022] Updated weights on worker 0-0, policy_version 119329 (0.00052) [2022-07-09 06:03:04,410][25689] Fps is (10 sec: 5271.1, 60 sec: 5653.1, 300 sec: 5681.2). Total num frames: 122192896. Throughput: 0: 5816.1. Samples: 122203548. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 06:03:04,411][25689] Avg episode reward: [(0, '-54.216')] [2022-07-09 06:03:05,887][26022] Updated weights on worker 0-0, policy_version 119339 (0.00096) [2022-07-09 06:03:07,782][26022] Updated weights on worker 0-0, policy_version 119349 (0.00067) [2022-07-09 06:03:09,448][25689] Fps is (10 sec: 5465.3, 60 sec: 5667.8, 300 sec: 5684.6). Total num frames: 122222592. Throughput: 0: 5806.4. Samples: 122220634. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 06:03:09,450][25689] Avg episode reward: [(0, '-55.463')] [2022-07-09 06:03:09,586][26022] Updated weights on worker 0-0, policy_version 119359 (0.00084) [2022-07-09 06:03:11,594][26022] Updated weights on worker 0-0, policy_version 119369 (0.00083) [2022-07-09 06:03:13,062][26022] Updated weights on worker 0-0, policy_version 119379 (0.00091) [2022-07-09 06:03:14,479][25689] Fps is (10 sec: 5695.1, 60 sec: 5639.5, 300 sec: 5680.9). Total num frames: 122250240. Throughput: 0: 5774.4. Samples: 122254560. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 06:03:14,480][25689] Avg episode reward: [(0, '-55.636')] [2022-07-09 06:03:15,187][26022] Updated weights on worker 0-0, policy_version 119389 (0.00088) [2022-07-09 06:03:16,632][26022] Updated weights on worker 0-0, policy_version 119399 (0.00091) [2022-07-09 06:03:18,683][26022] Updated weights on worker 0-0, policy_version 119409 (0.00089) [2022-07-09 06:03:19,507][25689] Fps is (10 sec: 5701.0, 60 sec: 5675.2, 300 sec: 5682.4). Total num frames: 122279936. Throughput: 0: 5819.8. Samples: 122289326. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 06:03:19,507][25689] Avg episode reward: [(0, '-54.765')] [2022-07-09 06:03:20,204][26022] Updated weights on worker 0-0, policy_version 119419 (0.00084) [2022-07-09 06:03:22,131][26022] Updated weights on worker 0-0, policy_version 119429 (0.00093) [2022-07-09 06:03:23,820][26022] Updated weights on worker 0-0, policy_version 119439 (0.00086) [2022-07-09 06:03:24,565][25689] Fps is (10 sec: 5786.8, 60 sec: 5660.6, 300 sec: 5681.7). Total num frames: 122308608. Throughput: 0: 5115.8. Samples: 122306660. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 06:03:24,566][25689] Avg episode reward: [(0, '-55.007')] [2022-07-09 06:03:25,890][26022] Updated weights on worker 0-0, policy_version 119449 (0.00098) [2022-07-09 06:03:27,487][26022] Updated weights on worker 0-0, policy_version 119459 (0.00090) [2022-07-09 06:03:29,351][26022] Updated weights on worker 0-0, policy_version 119469 (0.00086) [2022-07-09 06:03:29,588][25689] Fps is (10 sec: 5688.2, 60 sec: 5676.7, 300 sec: 5679.0). Total num frames: 122337280. Throughput: 0: 5970.9. Samples: 122340886. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 06:03:29,589][25689] Avg episode reward: [(0, '-54.233')] [2022-07-09 06:03:31,016][26022] Updated weights on worker 0-0, policy_version 119479 (0.00088) [2022-07-09 06:03:33,074][26022] Updated weights on worker 0-0, policy_version 119489 (0.00092) [2022-07-09 06:03:34,543][26022] Updated weights on worker 0-0, policy_version 119499 (0.00088) [2022-07-09 06:03:34,667][25689] Fps is (10 sec: 5778.2, 60 sec: 5671.9, 300 sec: 5685.7). Total num frames: 122366976. Throughput: 0: 5976.0. Samples: 122375202. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 06:03:34,667][25689] Avg episode reward: [(0, '-54.853')] [2022-07-09 06:03:36,478][26022] Updated weights on worker 0-0, policy_version 119509 (0.00090) [2022-07-09 06:03:38,131][26022] Updated weights on worker 0-0, policy_version 119519 (0.00090) [2022-07-09 06:03:39,684][25689] Fps is (10 sec: 5680.2, 60 sec: 5637.1, 300 sec: 5679.6). Total num frames: 122394624. Throughput: 0: 5114.4. Samples: 122392518. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 06:03:39,684][25689] Avg episode reward: [(0, '-54.038')] [2022-07-09 06:03:40,014][26022] Updated weights on worker 0-0, policy_version 119529 (0.00086) [2022-07-09 06:03:41,746][26022] Updated weights on worker 0-0, policy_version 119539 (0.00090) [2022-07-09 06:03:43,580][26022] Updated weights on worker 0-0, policy_version 119549 (0.00087) [2022-07-09 06:03:44,744][25689] Fps is (10 sec: 5690.6, 60 sec: 5654.1, 300 sec: 5678.5). Total num frames: 122424320. Throughput: 0: 5966.8. Samples: 122427060. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 06:03:44,744][25689] Avg episode reward: [(0, '-55.285')] [2022-07-09 06:03:45,378][26022] Updated weights on worker 0-0, policy_version 119559 (0.00607) [2022-07-09 06:03:47,211][26022] Updated weights on worker 0-0, policy_version 119569 (0.00084) [2022-07-09 06:03:48,984][26022] Updated weights on worker 0-0, policy_version 119579 (0.00868) [2022-07-09 06:03:49,754][25689] Fps is (10 sec: 5795.9, 60 sec: 5673.3, 300 sec: 5685.5). Total num frames: 122452992. Throughput: 0: 5961.0. Samples: 122461096. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 06:03:49,755][25689] Avg episode reward: [(0, '-54.318')] [2022-07-09 06:03:50,871][26022] Updated weights on worker 0-0, policy_version 119589 (0.00104) [2022-07-09 06:03:52,640][26022] Updated weights on worker 0-0, policy_version 119599 (0.00089) [2022-07-09 06:03:54,406][26022] Updated weights on worker 0-0, policy_version 119609 (0.00090) [2022-07-09 06:03:54,770][25689] Fps is (10 sec: 5617.1, 60 sec: 5639.8, 300 sec: 5678.4). Total num frames: 122480640. Throughput: 0: 5109.1. Samples: 122477912. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 06:03:54,771][25689] Avg episode reward: [(0, '-54.378')] [2022-07-09 06:03:56,211][26022] Updated weights on worker 0-0, policy_version 119619 (0.00086) [2022-07-09 06:03:58,118][26022] Updated weights on worker 0-0, policy_version 119629 (0.00095) [2022-07-09 06:03:59,787][25689] Fps is (10 sec: 5715.4, 60 sec: 5674.1, 300 sec: 5682.9). Total num frames: 122510336. Throughput: 0: 5934.7. Samples: 122511828. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 06:03:59,788][25689] Avg episode reward: [(0, '-54.079')] [2022-07-09 06:03:59,792][26022] Updated weights on worker 0-0, policy_version 119639 (0.00086) [2022-07-09 06:04:02,004][26022] Updated weights on worker 0-0, policy_version 119649 (0.00091) [2022-07-09 06:04:03,669][26022] Updated weights on worker 0-0, policy_version 119659 (0.00083) [2022-07-09 06:04:04,886][25689] Fps is (10 sec: 5567.5, 60 sec: 5689.3, 300 sec: 5678.1). Total num frames: 122536960. Throughput: 0: 5810.3. Samples: 122544092. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 06:04:04,888][25689] Avg episode reward: [(0, '-53.417')] [2022-07-09 06:04:05,531][26022] Updated weights on worker 0-0, policy_version 119669 (0.00090) [2022-07-09 06:04:07,546][26022] Updated weights on worker 0-0, policy_version 119679 (0.00083) [2022-07-09 06:04:09,172][26022] Updated weights on worker 0-0, policy_version 119689 (0.00097) [2022-07-09 06:04:09,908][25689] Fps is (10 sec: 5362.7, 60 sec: 5657.0, 300 sec: 5674.5). Total num frames: 122564608. Throughput: 0: 4971.1. Samples: 122561280. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 06:04:09,908][25689] Avg episode reward: [(0, '-53.612')] [2022-07-09 06:04:10,955][26022] Updated weights on worker 0-0, policy_version 119699 (0.00091) [2022-07-09 06:04:12,718][26022] Updated weights on worker 0-0, policy_version 119709 (0.00088) [2022-07-09 06:04:14,408][26022] Updated weights on worker 0-0, policy_version 119719 (0.00084) [2022-07-09 06:04:14,939][25689] Fps is (10 sec: 5602.6, 60 sec: 5673.9, 300 sec: 5677.9). Total num frames: 122593280. Throughput: 0: 5825.7. Samples: 122595406. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 06:04:14,939][25689] Avg episode reward: [(0, '-53.799')] [2022-07-09 06:04:16,287][26022] Updated weights on worker 0-0, policy_version 119729 (0.00086) [2022-07-09 06:04:18,163][26022] Updated weights on worker 0-0, policy_version 119739 (0.00055) [2022-07-09 06:04:19,944][25689] Fps is (10 sec: 5815.7, 60 sec: 5676.0, 300 sec: 5678.9). Total num frames: 122622976. Throughput: 0: 5850.0. Samples: 122629744. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 06:04:19,945][25689] Avg episode reward: [(0, '-53.984')] [2022-07-09 06:04:19,949][26022] Updated weights on worker 0-0, policy_version 119749 (0.00088) [2022-07-09 06:04:21,638][26022] Updated weights on worker 0-0, policy_version 119759 (0.00094) [2022-07-09 06:04:23,511][26022] Updated weights on worker 0-0, policy_version 119769 (0.00095) [2022-07-09 06:04:24,996][25689] Fps is (10 sec: 5701.8, 60 sec: 5659.7, 300 sec: 5674.7). Total num frames: 122650624. Throughput: 0: 5106.4. Samples: 122646780. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 06:04:24,998][25689] Avg episode reward: [(0, '-54.746')] [2022-07-09 06:04:25,462][26022] Updated weights on worker 0-0, policy_version 119779 (0.00087) [2022-07-09 06:04:27,172][26022] Updated weights on worker 0-0, policy_version 119789 (0.00089) [2022-07-09 06:04:28,960][26022] Updated weights on worker 0-0, policy_version 119799 (0.00092) [2022-07-09 06:04:30,005][25689] Fps is (10 sec: 5597.8, 60 sec: 5660.9, 300 sec: 5674.7). Total num frames: 122679296. Throughput: 0: 5948.9. Samples: 122680838. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 06:04:30,006][25689] Avg episode reward: [(0, '-56.057')] [2022-07-09 06:04:30,696][26022] Updated weights on worker 0-0, policy_version 119809 (0.00091) [2022-07-09 06:04:30,924][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:04:30,936][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000119810_122685440.pth [2022-07-09 06:04:30,937][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000117811_120638464.pth [2022-07-09 06:04:32,584][26022] Updated weights on worker 0-0, policy_version 119819 (0.00086) [2022-07-09 06:04:34,443][26022] Updated weights on worker 0-0, policy_version 119829 (0.00087) [2022-07-09 06:04:35,022][25689] Fps is (10 sec: 5821.4, 60 sec: 5666.7, 300 sec: 5678.4). Total num frames: 122708992. Throughput: 0: 5979.4. Samples: 122715494. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 06:04:35,023][25689] Avg episode reward: [(0, '-55.515')] [2022-07-09 06:04:36,127][26022] Updated weights on worker 0-0, policy_version 119839 (0.00058) [2022-07-09 06:04:37,783][26022] Updated weights on worker 0-0, policy_version 119849 (0.00085) [2022-07-09 06:04:39,591][26022] Updated weights on worker 0-0, policy_version 119859 (0.00083) [2022-07-09 06:04:40,056][25689] Fps is (10 sec: 5807.5, 60 sec: 5682.1, 300 sec: 5675.6). Total num frames: 122737664. Throughput: 0: 5121.6. Samples: 122732748. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 06:04:40,057][25689] Avg episode reward: [(0, '-55.698')] [2022-07-09 06:04:41,468][26022] Updated weights on worker 0-0, policy_version 119869 (0.00092) [2022-07-09 06:04:43,196][26022] Updated weights on worker 0-0, policy_version 119879 (0.00084) [2022-07-09 06:04:44,959][26022] Updated weights on worker 0-0, policy_version 119889 (0.00087) [2022-07-09 06:04:45,147][25689] Fps is (10 sec: 5764.9, 60 sec: 5679.2, 300 sec: 5680.9). Total num frames: 122767360. Throughput: 0: 5994.4. Samples: 122767574. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 06:04:45,148][25689] Avg episode reward: [(0, '-55.089')] [2022-07-09 06:04:46,874][26022] Updated weights on worker 0-0, policy_version 119899 (0.00080) [2022-07-09 06:04:48,555][26022] Updated weights on worker 0-0, policy_version 119909 (0.00083) [2022-07-09 06:04:50,105][26022] Updated weights on worker 0-0, policy_version 119919 (0.00093) [2022-07-09 06:04:50,215][25689] Fps is (10 sec: 5846.1, 60 sec: 5690.7, 300 sec: 5679.8). Total num frames: 122797056. Throughput: 0: 6007.3. Samples: 122802244. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 06:04:50,215][25689] Avg episode reward: [(0, '-55.260')] [2022-07-09 06:04:52,234][26022] Updated weights on worker 0-0, policy_version 119929 (0.00091) [2022-07-09 06:04:53,755][26022] Updated weights on worker 0-0, policy_version 119939 (0.00087) [2022-07-09 06:04:55,249][25689] Fps is (10 sec: 5676.6, 60 sec: 5689.0, 300 sec: 5679.3). Total num frames: 122824704. Throughput: 0: 5143.9. Samples: 122819538. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 06:04:55,249][25689] Avg episode reward: [(0, '-54.678')] [2022-07-09 06:04:55,698][26022] Updated weights on worker 0-0, policy_version 119949 (0.00088) [2022-07-09 06:04:57,421][26022] Updated weights on worker 0-0, policy_version 119959 (0.00084) [2022-07-09 06:04:59,491][26022] Updated weights on worker 0-0, policy_version 119969 (0.00085) [2022-07-09 06:05:00,312][25689] Fps is (10 sec: 5476.3, 60 sec: 5650.9, 300 sec: 5679.8). Total num frames: 122852352. Throughput: 0: 5975.0. Samples: 122853780. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 06:05:00,313][25689] Avg episode reward: [(0, '-54.523')] [2022-07-09 06:05:00,999][26022] Updated weights on worker 0-0, policy_version 119979 (0.00090) [2022-07-09 06:05:03,132][26022] Updated weights on worker 0-0, policy_version 119989 (0.00086) [2022-07-09 06:05:04,907][26022] Updated weights on worker 0-0, policy_version 119999 (0.00094) [2022-07-09 06:05:05,376][25689] Fps is (10 sec: 5561.5, 60 sec: 5688.0, 300 sec: 5678.9). Total num frames: 122881024. Throughput: 0: 5863.7. Samples: 122886188. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 06:05:05,376][25689] Avg episode reward: [(0, '-55.421')] [2022-07-09 06:05:06,837][26022] Updated weights on worker 0-0, policy_version 120009 (0.00094) [2022-07-09 06:05:08,480][26022] Updated weights on worker 0-0, policy_version 120019 (0.00092) [2022-07-09 06:05:10,370][26022] Updated weights on worker 0-0, policy_version 120029 (0.00057) [2022-07-09 06:05:10,416][25689] Fps is (10 sec: 5675.4, 60 sec: 5703.1, 300 sec: 5675.2). Total num frames: 122909696. Throughput: 0: 5009.5. Samples: 122903442. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 06:05:10,417][25689] Avg episode reward: [(0, '-55.590')] [2022-07-09 06:05:11,907][26022] Updated weights on worker 0-0, policy_version 120039 (0.00089) [2022-07-09 06:05:13,788][26022] Updated weights on worker 0-0, policy_version 120049 (0.00092) [2022-07-09 06:05:15,436][25689] Fps is (10 sec: 5700.2, 60 sec: 5704.2, 300 sec: 5675.4). Total num frames: 122938368. Throughput: 0: 5870.1. Samples: 122938036. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 06:05:15,436][25689] Avg episode reward: [(0, '-55.036')] [2022-07-09 06:05:15,639][26022] Updated weights on worker 0-0, policy_version 120059 (0.00089) [2022-07-09 06:05:17,482][26022] Updated weights on worker 0-0, policy_version 120069 (0.00087) [2022-07-09 06:05:19,272][26022] Updated weights on worker 0-0, policy_version 120079 (0.00083) [2022-07-09 06:05:20,499][25689] Fps is (10 sec: 5687.1, 60 sec: 5681.8, 300 sec: 5675.1). Total num frames: 122967040. Throughput: 0: 5872.3. Samples: 122972326. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 06:05:20,501][25689] Avg episode reward: [(0, '-54.398')] [2022-07-09 06:05:20,948][26022] Updated weights on worker 0-0, policy_version 120089 (0.00089) [2022-07-09 06:05:22,730][26022] Updated weights on worker 0-0, policy_version 120099 (0.00087) [2022-07-09 06:05:24,791][26022] Updated weights on worker 0-0, policy_version 120109 (0.00106) [2022-07-09 06:05:25,577][25689] Fps is (10 sec: 5755.4, 60 sec: 5713.2, 300 sec: 5677.5). Total num frames: 122996736. Throughput: 0: 5114.2. Samples: 122989502. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-09 06:05:25,578][25689] Avg episode reward: [(0, '-53.766')] [2022-07-09 06:05:26,171][26022] Updated weights on worker 0-0, policy_version 120119 (0.00089) [2022-07-09 06:05:28,392][26022] Updated weights on worker 0-0, policy_version 120129 (0.00084) [2022-07-09 06:05:29,843][26022] Updated weights on worker 0-0, policy_version 120139 (0.00082) [2022-07-09 06:05:30,659][25689] Fps is (10 sec: 5745.1, 60 sec: 5706.4, 300 sec: 5676.2). Total num frames: 123025408. Throughput: 0: 5948.3. Samples: 123023852. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-09 06:05:30,661][25689] Avg episode reward: [(0, '-53.621')] [2022-07-09 06:05:31,794][26022] Updated weights on worker 0-0, policy_version 120149 (0.00087) [2022-07-09 06:05:33,572][26022] Updated weights on worker 0-0, policy_version 120159 (0.00089) [2022-07-09 06:05:35,271][26022] Updated weights on worker 0-0, policy_version 120169 (0.00089) [2022-07-09 06:05:35,716][25689] Fps is (10 sec: 5756.9, 60 sec: 5702.6, 300 sec: 5678.6). Total num frames: 123055104. Throughput: 0: 5930.8. Samples: 123058314. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-09 06:05:35,716][25689] Avg episode reward: [(0, '-53.448')] [2022-07-09 06:05:37,179][26022] Updated weights on worker 0-0, policy_version 120179 (0.00097) [2022-07-09 06:05:38,876][26022] Updated weights on worker 0-0, policy_version 120189 (0.00492) [2022-07-09 06:05:40,525][26022] Updated weights on worker 0-0, policy_version 120199 (0.00083) [2022-07-09 06:05:40,739][25689] Fps is (10 sec: 5892.2, 60 sec: 5720.5, 300 sec: 5682.7). Total num frames: 123084800. Throughput: 0: 5970.8. Samples: 123093172. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-09 06:05:40,740][25689] Avg episode reward: [(0, '-53.604')] [2022-07-09 06:05:42,555][26022] Updated weights on worker 0-0, policy_version 120209 (0.00087) [2022-07-09 06:05:44,139][26022] Updated weights on worker 0-0, policy_version 120219 (0.00092) [2022-07-09 06:05:45,795][25689] Fps is (10 sec: 5689.6, 60 sec: 5690.1, 300 sec: 5678.3). Total num frames: 123112448. Throughput: 0: 5980.9. Samples: 123110420. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-09 06:05:45,795][25689] Avg episode reward: [(0, '-53.462')] [2022-07-09 06:05:45,998][26022] Updated weights on worker 0-0, policy_version 120229 (0.00088) [2022-07-09 06:05:47,883][26022] Updated weights on worker 0-0, policy_version 120239 (0.00080) [2022-07-09 06:05:49,636][26022] Updated weights on worker 0-0, policy_version 120249 (0.00082) [2022-07-09 06:05:50,797][25689] Fps is (10 sec: 5701.1, 60 sec: 5696.2, 300 sec: 5681.9). Total num frames: 123142144. Throughput: 0: 5990.6. Samples: 123144492. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-09 06:05:50,797][25689] Avg episode reward: [(0, '-53.719')] [2022-07-09 06:05:51,657][26022] Updated weights on worker 0-0, policy_version 120259 (0.00085) [2022-07-09 06:05:53,339][26022] Updated weights on worker 0-0, policy_version 120269 (0.00086) [2022-07-09 06:05:55,119][26022] Updated weights on worker 0-0, policy_version 120279 (0.00088) [2022-07-09 06:05:55,835][25689] Fps is (10 sec: 5813.5, 60 sec: 5712.8, 300 sec: 5685.1). Total num frames: 123170816. Throughput: 0: 5993.6. Samples: 123178898. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-09 06:05:55,835][25689] Avg episode reward: [(0, '-54.108')] [2022-07-09 06:05:56,841][26022] Updated weights on worker 0-0, policy_version 120289 (0.00082) [2022-07-09 06:05:58,644][26022] Updated weights on worker 0-0, policy_version 120299 (0.00091) [2022-07-09 06:06:00,479][26022] Updated weights on worker 0-0, policy_version 120309 (0.00095) [2022-07-09 06:06:00,842][25689] Fps is (10 sec: 5709.0, 60 sec: 5735.0, 300 sec: 5685.7). Total num frames: 123199488. Throughput: 0: 5106.2. Samples: 123195822. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-09 06:06:00,842][25689] Avg episode reward: [(0, '-54.095')] [2022-07-09 06:06:02,644][26022] Updated weights on worker 0-0, policy_version 120319 (0.00090) [2022-07-09 06:06:04,322][26022] Updated weights on worker 0-0, policy_version 120329 (0.00085) [2022-07-09 06:06:05,869][25689] Fps is (10 sec: 5408.4, 60 sec: 5687.6, 300 sec: 5678.4). Total num frames: 123225088. Throughput: 0: 5856.7. Samples: 123227992. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-09 06:06:05,870][25689] Avg episode reward: [(0, '-53.664')] [2022-07-09 06:06:06,262][26022] Updated weights on worker 0-0, policy_version 120339 (0.00093) [2022-07-09 06:06:07,962][26022] Updated weights on worker 0-0, policy_version 120349 (0.00091) [2022-07-09 06:06:09,756][26022] Updated weights on worker 0-0, policy_version 120359 (0.00085) [2022-07-09 06:06:10,891][25689] Fps is (10 sec: 5298.8, 60 sec: 5672.5, 300 sec: 5678.1). Total num frames: 123252736. Throughput: 0: 5872.9. Samples: 123262500. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-09 06:06:10,891][25689] Avg episode reward: [(0, '-54.690')] [2022-07-09 06:06:11,628][26022] Updated weights on worker 0-0, policy_version 120369 (0.00088) [2022-07-09 06:06:13,439][26022] Updated weights on worker 0-0, policy_version 120379 (0.00092) [2022-07-09 06:06:15,199][26022] Updated weights on worker 0-0, policy_version 120389 (0.00085) [2022-07-09 06:06:15,897][25689] Fps is (10 sec: 5820.8, 60 sec: 5707.6, 300 sec: 5681.7). Total num frames: 123283456. Throughput: 0: 5017.5. Samples: 123279560. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-09 06:06:15,897][25689] Avg episode reward: [(0, '-54.521')] [2022-07-09 06:06:17,129][26022] Updated weights on worker 0-0, policy_version 120399 (0.00088) [2022-07-09 06:06:18,556][26022] Updated weights on worker 0-0, policy_version 120409 (0.00088) [2022-07-09 06:06:20,648][26022] Updated weights on worker 0-0, policy_version 120419 (0.00094) [2022-07-09 06:06:20,901][25689] Fps is (10 sec: 5728.5, 60 sec: 5679.3, 300 sec: 5673.6). Total num frames: 123310080. Throughput: 0: 5898.5. Samples: 123314142. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-09 06:06:20,911][25689] Avg episode reward: [(0, '-54.852')] [2022-07-09 06:06:22,331][26022] Updated weights on worker 0-0, policy_version 120429 (0.00104) [2022-07-09 06:06:23,900][26022] Updated weights on worker 0-0, policy_version 120439 (0.00086) [2022-07-09 06:06:26,043][25689] Fps is (10 sec: 5449.9, 60 sec: 5656.3, 300 sec: 5671.4). Total num frames: 123338752. Throughput: 0: 5982.3. Samples: 123348676. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-09 06:06:26,044][25689] Avg episode reward: [(0, '-54.884')] [2022-07-09 06:06:26,060][26022] Updated weights on worker 0-0, policy_version 120449 (0.00093) [2022-07-09 06:06:27,579][26022] Updated weights on worker 0-0, policy_version 120459 (0.00108) [2022-07-09 06:06:29,409][26022] Updated weights on worker 0-0, policy_version 120469 (0.00383) [2022-07-09 06:06:31,060][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:06:31,070][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000120478_123369472.pth [2022-07-09 06:06:31,070][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000118479_121322496.pth [2022-07-09 06:06:31,071][25689] Fps is (10 sec: 5839.8, 60 sec: 5695.3, 300 sec: 5681.6). Total num frames: 123369472. Throughput: 0: 5119.7. Samples: 123365818. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-09 06:06:31,072][25689] Avg episode reward: [(0, '-54.891')] [2022-07-09 06:06:31,450][26022] Updated weights on worker 0-0, policy_version 120479 (0.00093) [2022-07-09 06:06:32,944][26022] Updated weights on worker 0-0, policy_version 120489 (0.00086) [2022-07-09 06:06:35,019][26022] Updated weights on worker 0-0, policy_version 120499 (0.00091) [2022-07-09 06:06:36,135][25689] Fps is (10 sec: 5885.3, 60 sec: 5677.7, 300 sec: 5677.6). Total num frames: 123398144. Throughput: 0: 5967.4. Samples: 123400326. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-09 06:06:36,135][25689] Avg episode reward: [(0, '-54.787')] [2022-07-09 06:06:36,608][26022] Updated weights on worker 0-0, policy_version 120509 (0.00055) [2022-07-09 06:06:38,383][26022] Updated weights on worker 0-0, policy_version 120519 (0.00090) [2022-07-09 06:06:40,263][26022] Updated weights on worker 0-0, policy_version 120529 (0.00093) [2022-07-09 06:06:41,138][25689] Fps is (10 sec: 5797.8, 60 sec: 5679.5, 300 sec: 5675.5). Total num frames: 123427840. Throughput: 0: 5958.8. Samples: 123434732. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-09 06:06:41,139][25689] Avg episode reward: [(0, '-54.645')] [2022-07-09 06:06:42,017][26022] Updated weights on worker 0-0, policy_version 120539 (0.00083) [2022-07-09 06:06:43,711][26022] Updated weights on worker 0-0, policy_version 120549 (0.00079) [2022-07-09 06:06:45,649][26022] Updated weights on worker 0-0, policy_version 120559 (0.00418) [2022-07-09 06:06:46,234][25689] Fps is (10 sec: 5678.0, 60 sec: 5675.8, 300 sec: 5678.1). Total num frames: 123455488. Throughput: 0: 5103.2. Samples: 123451714. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-09 06:06:46,236][25689] Avg episode reward: [(0, '-54.422')] [2022-07-09 06:06:47,270][26022] Updated weights on worker 0-0, policy_version 120569 (0.00090) [2022-07-09 06:06:49,236][26022] Updated weights on worker 0-0, policy_version 120579 (0.00091) [2022-07-09 06:06:50,923][26022] Updated weights on worker 0-0, policy_version 120589 (0.00087) [2022-07-09 06:06:51,273][25689] Fps is (10 sec: 5557.3, 60 sec: 5655.4, 300 sec: 5671.2). Total num frames: 123484160. Throughput: 0: 5955.4. Samples: 123486128. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-09 06:06:51,275][25689] Avg episode reward: [(0, '-54.204')] [2022-07-09 06:06:52,824][26022] Updated weights on worker 0-0, policy_version 120599 (0.00106) [2022-07-09 06:06:54,486][26022] Updated weights on worker 0-0, policy_version 120609 (0.00092) [2022-07-09 06:06:56,281][25689] Fps is (10 sec: 5707.8, 60 sec: 5658.2, 300 sec: 5676.3). Total num frames: 123512832. Throughput: 0: 5984.0. Samples: 123520880. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-09 06:06:56,281][25689] Avg episode reward: [(0, '-54.407')] [2022-07-09 06:06:56,412][26022] Updated weights on worker 0-0, policy_version 120619 (0.00085) [2022-07-09 06:06:57,970][26022] Updated weights on worker 0-0, policy_version 120629 (0.00095) [2022-07-09 06:06:59,942][26022] Updated weights on worker 0-0, policy_version 120639 (0.00102) [2022-07-09 06:07:01,302][25689] Fps is (10 sec: 5717.5, 60 sec: 5656.8, 300 sec: 5684.8). Total num frames: 123541504. Throughput: 0: 5110.9. Samples: 123537788. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-09 06:07:01,304][25689] Avg episode reward: [(0, '-54.922')] [2022-07-09 06:07:01,767][26022] Updated weights on worker 0-0, policy_version 120649 (0.00086) [2022-07-09 06:07:03,800][26022] Updated weights on worker 0-0, policy_version 120659 (0.00092) [2022-07-09 06:07:05,795][26022] Updated weights on worker 0-0, policy_version 120669 (0.00081) [2022-07-09 06:07:06,400][25689] Fps is (10 sec: 5666.6, 60 sec: 5701.0, 300 sec: 5683.2). Total num frames: 123570176. Throughput: 0: 5856.4. Samples: 123569816. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-09 06:07:06,401][25689] Avg episode reward: [(0, '-56.006')] [2022-07-09 06:07:07,460][26022] Updated weights on worker 0-0, policy_version 120679 (0.00091) [2022-07-09 06:07:09,363][26022] Updated weights on worker 0-0, policy_version 120689 (0.00092) [2022-07-09 06:07:11,140][26022] Updated weights on worker 0-0, policy_version 120699 (0.00091) [2022-07-09 06:07:11,404][25689] Fps is (10 sec: 5473.8, 60 sec: 5685.7, 300 sec: 5674.5). Total num frames: 123596800. Throughput: 0: 5859.0. Samples: 123604078. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-09 06:07:11,405][25689] Avg episode reward: [(0, '-55.697')] [2022-07-09 06:07:12,839][26022] Updated weights on worker 0-0, policy_version 120709 (0.00087) [2022-07-09 06:07:14,711][26022] Updated weights on worker 0-0, policy_version 120719 (0.00083) [2022-07-09 06:07:16,419][25689] Fps is (10 sec: 5519.4, 60 sec: 5651.1, 300 sec: 5678.6). Total num frames: 123625472. Throughput: 0: 4982.6. Samples: 123621220. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-09 06:07:16,420][25689] Avg episode reward: [(0, '-55.952')] [2022-07-09 06:07:16,698][26022] Updated weights on worker 0-0, policy_version 120729 (0.00090) [2022-07-09 06:07:18,102][26022] Updated weights on worker 0-0, policy_version 120739 (0.00086) [2022-07-09 06:07:20,031][26022] Updated weights on worker 0-0, policy_version 120749 (0.00089) [2022-07-09 06:07:21,424][25689] Fps is (10 sec: 5927.6, 60 sec: 5718.7, 300 sec: 5683.5). Total num frames: 123656192. Throughput: 0: 5868.1. Samples: 123655864. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-09 06:07:21,425][25689] Avg episode reward: [(0, '-55.525')] [2022-07-09 06:07:21,743][26022] Updated weights on worker 0-0, policy_version 120759 (0.00083) [2022-07-09 06:07:23,574][26022] Updated weights on worker 0-0, policy_version 120769 (0.00092) [2022-07-09 06:07:25,482][26022] Updated weights on worker 0-0, policy_version 120779 (0.00085) [2022-07-09 06:07:26,545][25689] Fps is (10 sec: 5663.0, 60 sec: 5686.8, 300 sec: 5678.1). Total num frames: 123682816. Throughput: 0: 5959.0. Samples: 123689858. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 06:07:26,545][25689] Avg episode reward: [(0, '-54.642')] [2022-07-09 06:07:26,994][26022] Updated weights on worker 0-0, policy_version 120789 (0.00076) [2022-07-09 06:07:29,086][26022] Updated weights on worker 0-0, policy_version 120799 (0.00090) [2022-07-09 06:07:30,796][26022] Updated weights on worker 0-0, policy_version 120809 (0.00290) [2022-07-09 06:07:31,616][25689] Fps is (10 sec: 5526.1, 60 sec: 5665.9, 300 sec: 5677.2). Total num frames: 123712512. Throughput: 0: 5091.6. Samples: 123706986. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 06:07:31,616][25689] Avg episode reward: [(0, '-54.958')] [2022-07-09 06:07:32,594][26022] Updated weights on worker 0-0, policy_version 120819 (0.00090) [2022-07-09 06:07:34,342][26022] Updated weights on worker 0-0, policy_version 120829 (0.00089) [2022-07-09 06:07:36,117][26022] Updated weights on worker 0-0, policy_version 120839 (0.00086) [2022-07-09 06:07:36,626][25689] Fps is (10 sec: 5891.6, 60 sec: 5687.9, 300 sec: 5677.2). Total num frames: 123742208. Throughput: 0: 5954.0. Samples: 123741532. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 06:07:36,626][25689] Avg episode reward: [(0, '-54.733')] [2022-07-09 06:07:37,891][26022] Updated weights on worker 0-0, policy_version 120849 (0.00356) [2022-07-09 06:07:39,617][26022] Updated weights on worker 0-0, policy_version 120859 (0.00084) [2022-07-09 06:07:41,633][25689] Fps is (10 sec: 5724.7, 60 sec: 5653.7, 300 sec: 5674.8). Total num frames: 123769856. Throughput: 0: 5950.4. Samples: 123776114. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 06:07:41,633][25689] Avg episode reward: [(0, '-54.846')] [2022-07-09 06:07:41,637][26022] Updated weights on worker 0-0, policy_version 120869 (0.00209) [2022-07-09 06:07:43,265][26022] Updated weights on worker 0-0, policy_version 120879 (0.00088) [2022-07-09 06:07:45,108][26022] Updated weights on worker 0-0, policy_version 120889 (0.01138) [2022-07-09 06:07:46,712][25689] Fps is (10 sec: 5685.6, 60 sec: 5689.1, 300 sec: 5680.8). Total num frames: 123799552. Throughput: 0: 5145.0. Samples: 123793616. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 06:07:46,712][25689] Avg episode reward: [(0, '-54.923')] [2022-07-09 06:07:46,727][26022] Updated weights on worker 0-0, policy_version 120899 (0.00091) [2022-07-09 06:07:48,571][26022] Updated weights on worker 0-0, policy_version 120909 (0.00090) [2022-07-09 06:07:50,324][26022] Updated weights on worker 0-0, policy_version 120919 (0.00095) [2022-07-09 06:07:51,746][25689] Fps is (10 sec: 5771.3, 60 sec: 5689.5, 300 sec: 5677.1). Total num frames: 123828224. Throughput: 0: 6024.1. Samples: 123828254. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 06:07:51,747][25689] Avg episode reward: [(0, '-55.865')] [2022-07-09 06:07:52,243][26022] Updated weights on worker 0-0, policy_version 120929 (0.00095) [2022-07-09 06:07:53,715][26022] Updated weights on worker 0-0, policy_version 120939 (0.00082) [2022-07-09 06:07:55,809][26022] Updated weights on worker 0-0, policy_version 120949 (0.00103) [2022-07-09 06:07:56,764][25689] Fps is (10 sec: 5806.5, 60 sec: 5705.5, 300 sec: 5684.0). Total num frames: 123857920. Throughput: 0: 6008.5. Samples: 123862532. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 06:07:56,765][25689] Avg episode reward: [(0, '-55.949')] [2022-07-09 06:07:57,304][26022] Updated weights on worker 0-0, policy_version 120959 (0.00087) [2022-07-09 06:07:59,333][26022] Updated weights on worker 0-0, policy_version 120969 (0.00090) [2022-07-09 06:08:00,995][26022] Updated weights on worker 0-0, policy_version 120979 (0.00081) [2022-07-09 06:08:01,775][25689] Fps is (10 sec: 5820.0, 60 sec: 5706.5, 300 sec: 5695.7). Total num frames: 123886592. Throughput: 0: 5149.7. Samples: 123879842. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 06:08:01,776][25689] Avg episode reward: [(0, '-55.681')] [2022-07-09 06:08:03,156][26022] Updated weights on worker 0-0, policy_version 120989 (0.00083) [2022-07-09 06:08:04,951][26022] Updated weights on worker 0-0, policy_version 120999 (0.00094) [2022-07-09 06:08:06,728][26022] Updated weights on worker 0-0, policy_version 121009 (0.00083) [2022-07-09 06:08:06,831][25689] Fps is (10 sec: 5492.9, 60 sec: 5676.6, 300 sec: 5685.0). Total num frames: 123913216. Throughput: 0: 5904.5. Samples: 123912408. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 06:08:06,833][25689] Avg episode reward: [(0, '-56.289')] [2022-07-09 06:08:08,664][26022] Updated weights on worker 0-0, policy_version 121019 (0.00088) [2022-07-09 06:08:10,597][26022] Updated weights on worker 0-0, policy_version 121029 (0.00086) [2022-07-09 06:08:11,850][25689] Fps is (10 sec: 5590.1, 60 sec: 5726.0, 300 sec: 5692.1). Total num frames: 123942912. Throughput: 0: 5872.8. Samples: 123946320. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 06:08:11,851][25689] Avg episode reward: [(0, '-55.910')] [2022-07-09 06:08:12,041][26022] Updated weights on worker 0-0, policy_version 121039 (0.00083) [2022-07-09 06:08:14,029][26022] Updated weights on worker 0-0, policy_version 121049 (0.00091) [2022-07-09 06:08:16,039][26022] Updated weights on worker 0-0, policy_version 121059 (0.00091) [2022-07-09 06:08:16,872][25689] Fps is (10 sec: 5609.0, 60 sec: 5691.4, 300 sec: 5681.9). Total num frames: 123969536. Throughput: 0: 5020.4. Samples: 123963482. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 06:08:16,873][25689] Avg episode reward: [(0, '-56.617')] [2022-07-09 06:08:17,645][26022] Updated weights on worker 0-0, policy_version 121069 (0.00089) [2022-07-09 06:08:19,630][26022] Updated weights on worker 0-0, policy_version 121079 (0.00096) [2022-07-09 06:08:20,868][26022] Updated weights on worker 0-0, policy_version 121089 (0.00099) [2022-07-09 06:08:21,906][25689] Fps is (10 sec: 5498.9, 60 sec: 5654.9, 300 sec: 5682.4). Total num frames: 123998208. Throughput: 0: 5851.2. Samples: 123997630. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 06:08:21,908][25689] Avg episode reward: [(0, '-56.302')] [2022-07-09 06:08:23,164][26022] Updated weights on worker 0-0, policy_version 121099 (0.00084) [2022-07-09 06:08:24,814][26022] Updated weights on worker 0-0, policy_version 121109 (0.00085) [2022-07-09 06:08:26,559][26022] Updated weights on worker 0-0, policy_version 121119 (0.00082) [2022-07-09 06:08:26,958][25689] Fps is (10 sec: 5888.3, 60 sec: 5729.1, 300 sec: 5688.7). Total num frames: 124028928. Throughput: 0: 5948.4. Samples: 124032134. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 06:08:26,959][25689] Avg episode reward: [(0, '-56.143')] [2022-07-09 06:08:28,435][26022] Updated weights on worker 0-0, policy_version 121129 (0.00087) [2022-07-09 06:08:30,064][26022] Updated weights on worker 0-0, policy_version 121139 (0.00085) [2022-07-09 06:08:31,316][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:08:31,326][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000121144_124051456.pth [2022-07-09 06:08:31,327][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000119146_122005504.pth [2022-07-09 06:08:31,970][25689] Fps is (10 sec: 5697.8, 60 sec: 5683.8, 300 sec: 5679.6). Total num frames: 124055552. Throughput: 0: 5117.3. Samples: 124049280. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 06:08:31,971][25689] Avg episode reward: [(0, '-56.437')] [2022-07-09 06:08:32,104][26022] Updated weights on worker 0-0, policy_version 121149 (0.00087) [2022-07-09 06:08:33,767][26022] Updated weights on worker 0-0, policy_version 121159 (0.00088) [2022-07-09 06:08:35,632][26022] Updated weights on worker 0-0, policy_version 121169 (0.00092) [2022-07-09 06:08:36,975][25689] Fps is (10 sec: 5622.4, 60 sec: 5684.2, 300 sec: 5686.8). Total num frames: 124085248. Throughput: 0: 5975.6. Samples: 124083612. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 06:08:36,976][25689] Avg episode reward: [(0, '-56.410')] [2022-07-09 06:08:37,132][26022] Updated weights on worker 0-0, policy_version 121179 (0.00082) [2022-07-09 06:08:39,255][26022] Updated weights on worker 0-0, policy_version 121189 (0.00085) [2022-07-09 06:08:40,715][26022] Updated weights on worker 0-0, policy_version 121199 (0.00088) [2022-07-09 06:08:41,977][25689] Fps is (10 sec: 5628.3, 60 sec: 5667.8, 300 sec: 5677.5). Total num frames: 124111872. Throughput: 0: 5996.4. Samples: 124117982. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 06:08:41,977][25689] Avg episode reward: [(0, '-55.700')] [2022-07-09 06:08:42,804][26022] Updated weights on worker 0-0, policy_version 121209 (0.00085) [2022-07-09 06:08:44,348][26022] Updated weights on worker 0-0, policy_version 121219 (0.00087) [2022-07-09 06:08:46,355][26022] Updated weights on worker 0-0, policy_version 121229 (0.00085) [2022-07-09 06:08:47,093][25689] Fps is (10 sec: 5768.5, 60 sec: 5698.1, 300 sec: 5685.8). Total num frames: 124143616. Throughput: 0: 5112.0. Samples: 124135066. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 06:08:47,094][25689] Avg episode reward: [(0, '-55.773')] [2022-07-09 06:08:48,109][26022] Updated weights on worker 0-0, policy_version 121239 (0.00089) [2022-07-09 06:08:49,855][26022] Updated weights on worker 0-0, policy_version 121249 (0.00284) [2022-07-09 06:08:51,602][26022] Updated weights on worker 0-0, policy_version 121259 (0.00093) [2022-07-09 06:08:52,121][25689] Fps is (10 sec: 5955.6, 60 sec: 5698.8, 300 sec: 5689.1). Total num frames: 124172288. Throughput: 0: 5981.4. Samples: 124169810. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 06:08:52,121][25689] Avg episode reward: [(0, '-55.667')] [2022-07-09 06:08:53,395][26022] Updated weights on worker 0-0, policy_version 121269 (0.00083) [2022-07-09 06:08:55,012][26022] Updated weights on worker 0-0, policy_version 121279 (0.00086) [2022-07-09 06:08:56,859][26022] Updated weights on worker 0-0, policy_version 121289 (0.00085) [2022-07-09 06:08:57,178][25689] Fps is (10 sec: 5686.2, 60 sec: 5678.1, 300 sec: 5684.9). Total num frames: 124200960. Throughput: 0: 5985.1. Samples: 124204530. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 06:08:57,179][25689] Avg episode reward: [(0, '-55.914')] [2022-07-09 06:08:58,656][26022] Updated weights on worker 0-0, policy_version 121299 (0.00079) [2022-07-09 06:09:00,362][26022] Updated weights on worker 0-0, policy_version 121309 (0.00090) [2022-07-09 06:09:02,211][25689] Fps is (10 sec: 5378.9, 60 sec: 5625.3, 300 sec: 5682.7). Total num frames: 124226560. Throughput: 0: 5893.2. Samples: 124237226. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 06:09:02,211][25689] Avg episode reward: [(0, '-55.705')] [2022-07-09 06:09:02,683][26022] Updated weights on worker 0-0, policy_version 121319 (0.00082) [2022-07-09 06:09:04,085][26022] Updated weights on worker 0-0, policy_version 121329 (0.00086) [2022-07-09 06:09:06,246][26022] Updated weights on worker 0-0, policy_version 121339 (0.00083) [2022-07-09 06:09:07,279][25689] Fps is (10 sec: 5677.5, 60 sec: 5708.9, 300 sec: 5695.6). Total num frames: 124258304. Throughput: 0: 5914.9. Samples: 124254458. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 06:09:07,279][25689] Avg episode reward: [(0, '-55.874')] [2022-07-09 06:09:07,852][26022] Updated weights on worker 0-0, policy_version 121349 (0.00084) [2022-07-09 06:09:09,627][26022] Updated weights on worker 0-0, policy_version 121359 (0.00089) [2022-07-09 06:09:11,319][26022] Updated weights on worker 0-0, policy_version 121369 (0.00089) [2022-07-09 06:09:12,292][25689] Fps is (10 sec: 5891.5, 60 sec: 5675.6, 300 sec: 5692.5). Total num frames: 124285952. Throughput: 0: 5929.4. Samples: 124289410. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 06:09:12,292][25689] Avg episode reward: [(0, '-55.989')] [2022-07-09 06:09:13,186][26022] Updated weights on worker 0-0, policy_version 121379 (0.00087) [2022-07-09 06:09:14,882][26022] Updated weights on worker 0-0, policy_version 121389 (0.00084) [2022-07-09 06:09:16,652][26022] Updated weights on worker 0-0, policy_version 121399 (0.00087) [2022-07-09 06:09:17,313][25689] Fps is (10 sec: 5714.9, 60 sec: 5726.4, 300 sec: 5692.2). Total num frames: 124315648. Throughput: 0: 5953.5. Samples: 124324400. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 06:09:17,313][25689] Avg episode reward: [(0, '-57.106')] [2022-07-09 06:09:18,468][26022] Updated weights on worker 0-0, policy_version 121409 (0.00087) [2022-07-09 06:09:20,138][26022] Updated weights on worker 0-0, policy_version 121419 (0.00091) [2022-07-09 06:09:22,008][26022] Updated weights on worker 0-0, policy_version 121429 (0.00079) [2022-07-09 06:09:22,339][25689] Fps is (10 sec: 5809.4, 60 sec: 5727.2, 300 sec: 5696.1). Total num frames: 124344320. Throughput: 0: 5191.3. Samples: 124341716. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 06:09:22,339][25689] Avg episode reward: [(0, '-56.450')] [2022-07-09 06:09:23,614][26022] Updated weights on worker 0-0, policy_version 121439 (0.00081) [2022-07-09 06:09:25,523][26022] Updated weights on worker 0-0, policy_version 121449 (0.00088) [2022-07-09 06:09:27,153][26022] Updated weights on worker 0-0, policy_version 121459 (0.00108) [2022-07-09 06:09:27,416][25689] Fps is (10 sec: 5878.5, 60 sec: 5724.8, 300 sec: 5701.7). Total num frames: 124375040. Throughput: 0: 6085.1. Samples: 124376996. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 06:09:27,416][25689] Avg episode reward: [(0, '-56.200')] [2022-07-09 06:09:29,100][26022] Updated weights on worker 0-0, policy_version 121469 (0.00622) [2022-07-09 06:09:30,816][26022] Updated weights on worker 0-0, policy_version 121479 (0.00097) [2022-07-09 06:09:32,419][25689] Fps is (10 sec: 5790.2, 60 sec: 5742.6, 300 sec: 5695.1). Total num frames: 124402688. Throughput: 0: 6062.6. Samples: 124411436. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:09:32,420][25689] Avg episode reward: [(0, '-55.092')] [2022-07-09 06:09:32,644][26022] Updated weights on worker 0-0, policy_version 121489 (0.00088) [2022-07-09 06:09:34,475][26022] Updated weights on worker 0-0, policy_version 121499 (0.00084) [2022-07-09 06:09:35,907][26022] Updated weights on worker 0-0, policy_version 121509 (0.00081) [2022-07-09 06:09:37,425][25689] Fps is (10 sec: 5627.1, 60 sec: 5725.6, 300 sec: 5695.6). Total num frames: 124431360. Throughput: 0: 5192.9. Samples: 124428840. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:09:37,425][25689] Avg episode reward: [(0, '-54.913')] [2022-07-09 06:09:38,024][26022] Updated weights on worker 0-0, policy_version 121519 (0.00092) [2022-07-09 06:09:39,679][26022] Updated weights on worker 0-0, policy_version 121529 (0.00089) [2022-07-09 06:09:41,442][26022] Updated weights on worker 0-0, policy_version 121539 (0.00084) [2022-07-09 06:09:42,434][25689] Fps is (10 sec: 5828.3, 60 sec: 5775.7, 300 sec: 5697.2). Total num frames: 124461056. Throughput: 0: 6064.1. Samples: 124463574. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:09:42,434][25689] Avg episode reward: [(0, '-54.424')] [2022-07-09 06:09:43,423][26022] Updated weights on worker 0-0, policy_version 121549 (0.00089) [2022-07-09 06:09:45,022][26022] Updated weights on worker 0-0, policy_version 121559 (0.00087) [2022-07-09 06:09:46,904][26022] Updated weights on worker 0-0, policy_version 121569 (0.00093) [2022-07-09 06:09:47,482][25689] Fps is (10 sec: 5905.1, 60 sec: 5748.3, 300 sec: 5697.5). Total num frames: 124490752. Throughput: 0: 6038.8. Samples: 124498172. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:09:47,483][25689] Avg episode reward: [(0, '-53.818')] [2022-07-09 06:09:48,514][26022] Updated weights on worker 0-0, policy_version 121579 (0.00083) [2022-07-09 06:09:50,150][26022] Updated weights on worker 0-0, policy_version 121589 (0.00097) [2022-07-09 06:09:52,093][26022] Updated weights on worker 0-0, policy_version 121599 (0.00094) [2022-07-09 06:09:52,492][25689] Fps is (10 sec: 5701.4, 60 sec: 5733.1, 300 sec: 5698.0). Total num frames: 124518400. Throughput: 0: 5187.3. Samples: 124515558. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:09:52,492][25689] Avg episode reward: [(0, '-53.743')] [2022-07-09 06:09:53,883][26022] Updated weights on worker 0-0, policy_version 121609 (0.00086) [2022-07-09 06:09:55,635][26022] Updated weights on worker 0-0, policy_version 121619 (0.00085) [2022-07-09 06:09:57,323][26022] Updated weights on worker 0-0, policy_version 121629 (0.00090) [2022-07-09 06:09:57,521][25689] Fps is (10 sec: 5814.4, 60 sec: 5769.7, 300 sec: 5709.0). Total num frames: 124549120. Throughput: 0: 6065.2. Samples: 124550728. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:09:57,522][25689] Avg episode reward: [(0, '-54.115')] [2022-07-09 06:09:58,964][26022] Updated weights on worker 0-0, policy_version 121639 (0.00093) [2022-07-09 06:10:01,013][26022] Updated weights on worker 0-0, policy_version 121649 (0.00093) [2022-07-09 06:10:02,524][25689] Fps is (10 sec: 5613.9, 60 sec: 5772.5, 300 sec: 5699.8). Total num frames: 124574720. Throughput: 0: 5972.1. Samples: 124583554. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:10:02,526][25689] Avg episode reward: [(0, '-53.930')] [2022-07-09 06:10:02,979][26022] Updated weights on worker 0-0, policy_version 121659 (0.00089) [2022-07-09 06:10:04,875][26022] Updated weights on worker 0-0, policy_version 121669 (0.00090) [2022-07-09 06:10:06,353][26022] Updated weights on worker 0-0, policy_version 121679 (0.00085) [2022-07-09 06:10:07,630][25689] Fps is (10 sec: 5469.8, 60 sec: 5734.9, 300 sec: 5702.0). Total num frames: 124604416. Throughput: 0: 5103.1. Samples: 124600988. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:10:07,631][25689] Avg episode reward: [(0, '-52.944')] [2022-07-09 06:10:08,263][26022] Updated weights on worker 0-0, policy_version 121689 (0.00084) [2022-07-09 06:10:09,978][26022] Updated weights on worker 0-0, policy_version 121699 (0.00607) [2022-07-09 06:10:11,783][26022] Updated weights on worker 0-0, policy_version 121709 (0.00086) [2022-07-09 06:10:12,639][25689] Fps is (10 sec: 5973.0, 60 sec: 5786.2, 300 sec: 5709.1). Total num frames: 124635136. Throughput: 0: 5991.4. Samples: 124636268. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:10:12,641][25689] Avg episode reward: [(0, '-53.635')] [2022-07-09 06:10:13,545][26022] Updated weights on worker 0-0, policy_version 121719 (0.00084) [2022-07-09 06:10:15,291][26022] Updated weights on worker 0-0, policy_version 121729 (0.00087) [2022-07-09 06:10:17,175][26022] Updated weights on worker 0-0, policy_version 121739 (0.00085) [2022-07-09 06:10:17,701][25689] Fps is (10 sec: 5795.9, 60 sec: 5748.4, 300 sec: 5705.7). Total num frames: 124662784. Throughput: 0: 5972.3. Samples: 124671248. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:10:17,702][25689] Avg episode reward: [(0, '-53.726')] [2022-07-09 06:10:18,723][26022] Updated weights on worker 0-0, policy_version 121749 (0.00082) [2022-07-09 06:10:20,793][26022] Updated weights on worker 0-0, policy_version 121759 (0.00087) [2022-07-09 06:10:22,238][26022] Updated weights on worker 0-0, policy_version 121769 (0.00083) [2022-07-09 06:10:22,706][25689] Fps is (10 sec: 5899.3, 60 sec: 5801.2, 300 sec: 5714.0). Total num frames: 124694528. Throughput: 0: 5215.8. Samples: 124688822. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:10:22,708][25689] Avg episode reward: [(0, '-53.808')] [2022-07-09 06:10:24,155][26022] Updated weights on worker 0-0, policy_version 121779 (0.00084) [2022-07-09 06:10:25,802][26022] Updated weights on worker 0-0, policy_version 121789 (0.00092) [2022-07-09 06:10:27,693][26022] Updated weights on worker 0-0, policy_version 121799 (0.00089) [2022-07-09 06:10:27,854][25689] Fps is (10 sec: 5849.7, 60 sec: 5743.7, 300 sec: 5709.3). Total num frames: 124722176. Throughput: 0: 6063.2. Samples: 124723606. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:10:27,856][25689] Avg episode reward: [(0, '-53.634')] [2022-07-09 06:10:29,312][26022] Updated weights on worker 0-0, policy_version 121809 (0.00084) [2022-07-09 06:10:31,228][26022] Updated weights on worker 0-0, policy_version 121819 (0.00096) [2022-07-09 06:10:31,496][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:10:31,505][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000121821_124744704.pth [2022-07-09 06:10:31,505][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000119810_122685440.pth [2022-07-09 06:10:32,931][25689] Fps is (10 sec: 5608.5, 60 sec: 5770.5, 300 sec: 5708.9). Total num frames: 124751872. Throughput: 0: 6013.2. Samples: 124758288. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:10:32,931][25689] Avg episode reward: [(0, '-53.719')] [2022-07-09 06:10:32,966][26022] Updated weights on worker 0-0, policy_version 121829 (0.00085) [2022-07-09 06:10:34,869][26022] Updated weights on worker 0-0, policy_version 121839 (0.00086) [2022-07-09 06:10:36,495][26022] Updated weights on worker 0-0, policy_version 121849 (0.00095) [2022-07-09 06:10:38,017][25689] Fps is (10 sec: 5843.5, 60 sec: 5779.7, 300 sec: 5707.7). Total num frames: 124781568. Throughput: 0: 5141.8. Samples: 124775712. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:10:38,018][25689] Avg episode reward: [(0, '-54.122')] [2022-07-09 06:10:38,344][26022] Updated weights on worker 0-0, policy_version 121859 (0.00096) [2022-07-09 06:10:39,957][26022] Updated weights on worker 0-0, policy_version 121869 (0.00051) [2022-07-09 06:10:42,122][26022] Updated weights on worker 0-0, policy_version 121880 (0.00094) [2022-07-09 06:10:43,097][25689] Fps is (10 sec: 5842.1, 60 sec: 5773.0, 300 sec: 5714.1). Total num frames: 124811264. Throughput: 0: 5978.0. Samples: 124810718. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:10:43,098][25689] Avg episode reward: [(0, '-54.587')] [2022-07-09 06:10:43,762][26022] Updated weights on worker 0-0, policy_version 121890 (0.00087) [2022-07-09 06:10:45,536][26022] Updated weights on worker 0-0, policy_version 121900 (0.00090) [2022-07-09 06:10:47,319][26022] Updated weights on worker 0-0, policy_version 121910 (0.00098) [2022-07-09 06:10:48,182][25689] Fps is (10 sec: 5843.2, 60 sec: 5769.5, 300 sec: 5712.5). Total num frames: 124840960. Throughput: 0: 6010.4. Samples: 124845786. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:10:48,182][25689] Avg episode reward: [(0, '-55.401')] [2022-07-09 06:10:48,957][26022] Updated weights on worker 0-0, policy_version 121920 (0.00086) [2022-07-09 06:10:50,839][26022] Updated weights on worker 0-0, policy_version 121930 (0.00097) [2022-07-09 06:10:52,441][26022] Updated weights on worker 0-0, policy_version 121940 (0.00098) [2022-07-09 06:10:53,231][25689] Fps is (10 sec: 5759.5, 60 sec: 5782.6, 300 sec: 5712.3). Total num frames: 124869632. Throughput: 0: 5169.3. Samples: 124863228. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:10:53,232][25689] Avg episode reward: [(0, '-55.547')] [2022-07-09 06:10:54,368][26022] Updated weights on worker 0-0, policy_version 121950 (0.00090) [2022-07-09 06:10:56,178][26022] Updated weights on worker 0-0, policy_version 121960 (0.00091) [2022-07-09 06:10:57,987][26022] Updated weights on worker 0-0, policy_version 121970 (0.00084) [2022-07-09 06:10:58,238][25689] Fps is (10 sec: 5702.1, 60 sec: 5751.0, 300 sec: 5712.3). Total num frames: 124898304. Throughput: 0: 6022.7. Samples: 124897496. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:10:58,239][25689] Avg episode reward: [(0, '-56.256')] [2022-07-09 06:10:59,631][26022] Updated weights on worker 0-0, policy_version 121980 (0.00082) [2022-07-09 06:11:01,266][26022] Updated weights on worker 0-0, policy_version 121990 (0.00084) [2022-07-09 06:11:03,243][25689] Fps is (10 sec: 5625.7, 60 sec: 5784.6, 300 sec: 5719.6). Total num frames: 124925952. Throughput: 0: 5991.4. Samples: 124931418. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:11:03,243][25689] Avg episode reward: [(0, '-56.356')] [2022-07-09 06:11:03,507][26022] Updated weights on worker 0-0, policy_version 122000 (0.00086) [2022-07-09 06:11:05,606][26022] Updated weights on worker 0-0, policy_version 122010 (0.00086) [2022-07-09 06:11:07,216][26022] Updated weights on worker 0-0, policy_version 122020 (0.00090) [2022-07-09 06:11:08,328][25689] Fps is (10 sec: 5582.0, 60 sec: 5769.7, 300 sec: 5721.8). Total num frames: 124954624. Throughput: 0: 5072.0. Samples: 124947966. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:11:08,328][25689] Avg episode reward: [(0, '-56.125')] [2022-07-09 06:11:08,948][26022] Updated weights on worker 0-0, policy_version 122030 (0.00086) [2022-07-09 06:11:10,553][26022] Updated weights on worker 0-0, policy_version 122040 (0.00080) [2022-07-09 06:11:12,419][26022] Updated weights on worker 0-0, policy_version 122050 (0.00084) [2022-07-09 06:11:13,352][25689] Fps is (10 sec: 5874.7, 60 sec: 5768.2, 300 sec: 5721.5). Total num frames: 124985344. Throughput: 0: 5947.2. Samples: 124982892. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:11:13,353][25689] Avg episode reward: [(0, '-55.558')] [2022-07-09 06:11:14,199][26022] Updated weights on worker 0-0, policy_version 122060 (0.00086) [2022-07-09 06:11:15,884][26022] Updated weights on worker 0-0, policy_version 122070 (0.00083) [2022-07-09 06:11:17,714][26022] Updated weights on worker 0-0, policy_version 122080 (0.00090) [2022-07-09 06:11:18,375][25689] Fps is (10 sec: 5809.4, 60 sec: 5771.9, 300 sec: 5724.6). Total num frames: 125012992. Throughput: 0: 5983.0. Samples: 125017974. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:11:18,375][25689] Avg episode reward: [(0, '-55.978')] [2022-07-09 06:11:19,355][26022] Updated weights on worker 0-0, policy_version 122090 (0.00078) [2022-07-09 06:11:21,327][26022] Updated weights on worker 0-0, policy_version 122100 (0.00085) [2022-07-09 06:11:22,903][26022] Updated weights on worker 0-0, policy_version 122110 (0.00079) [2022-07-09 06:11:23,402][25689] Fps is (10 sec: 5706.1, 60 sec: 5736.2, 300 sec: 5730.2). Total num frames: 125042688. Throughput: 0: 5152.4. Samples: 125035286. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:11:23,403][25689] Avg episode reward: [(0, '-56.173')] [2022-07-09 06:11:24,658][26022] Updated weights on worker 0-0, policy_version 122120 (0.00083) [2022-07-09 06:11:26,643][26022] Updated weights on worker 0-0, policy_version 122130 (0.00083) [2022-07-09 06:11:28,294][26022] Updated weights on worker 0-0, policy_version 122140 (0.00082) [2022-07-09 06:11:28,496][25689] Fps is (10 sec: 5868.1, 60 sec: 5775.0, 300 sec: 5725.5). Total num frames: 125072384. Throughput: 0: 6059.6. Samples: 125070178. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:11:28,497][25689] Avg episode reward: [(0, '-55.148')] [2022-07-09 06:11:30,280][26022] Updated weights on worker 0-0, policy_version 122150 (0.00089) [2022-07-09 06:11:31,694][26022] Updated weights on worker 0-0, policy_version 122160 (0.00079) [2022-07-09 06:11:33,519][25689] Fps is (10 sec: 5668.3, 60 sec: 5746.4, 300 sec: 5722.9). Total num frames: 125100032. Throughput: 0: 6056.0. Samples: 125105018. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 06:11:33,519][25689] Avg episode reward: [(0, '-54.890')] [2022-07-09 06:11:33,650][26022] Updated weights on worker 0-0, policy_version 122170 (0.00089) [2022-07-09 06:11:35,113][26022] Updated weights on worker 0-0, policy_version 122180 (0.00093) [2022-07-09 06:11:37,083][26022] Updated weights on worker 0-0, policy_version 122190 (0.00094) [2022-07-09 06:11:38,559][25689] Fps is (10 sec: 5698.8, 60 sec: 5750.8, 300 sec: 5722.2). Total num frames: 125129728. Throughput: 0: 6022.3. Samples: 125139526. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 06:11:38,559][25689] Avg episode reward: [(0, '-55.301')] [2022-07-09 06:11:39,003][26022] Updated weights on worker 0-0, policy_version 122200 (0.00093) [2022-07-09 06:11:40,724][26022] Updated weights on worker 0-0, policy_version 122210 (0.00090) [2022-07-09 06:11:42,486][26022] Updated weights on worker 0-0, policy_version 122220 (0.00090) [2022-07-09 06:11:43,560][25689] Fps is (10 sec: 5915.0, 60 sec: 5758.3, 300 sec: 5730.9). Total num frames: 125159424. Throughput: 0: 6040.9. Samples: 125157056. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 06:11:43,560][25689] Avg episode reward: [(0, '-55.145')] [2022-07-09 06:11:44,227][26022] Updated weights on worker 0-0, policy_version 122230 (0.00082) [2022-07-09 06:11:45,863][26022] Updated weights on worker 0-0, policy_version 122240 (0.00096) [2022-07-09 06:11:47,690][26022] Updated weights on worker 0-0, policy_version 122250 (0.00090) [2022-07-09 06:11:48,676][25689] Fps is (10 sec: 5870.1, 60 sec: 5755.2, 300 sec: 5732.8). Total num frames: 125189120. Throughput: 0: 6046.7. Samples: 125192202. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 06:11:48,677][25689] Avg episode reward: [(0, '-55.298')] [2022-07-09 06:11:49,259][26022] Updated weights on worker 0-0, policy_version 122260 (0.00093) [2022-07-09 06:11:51,311][26022] Updated weights on worker 0-0, policy_version 122270 (0.00079) [2022-07-09 06:11:52,834][26022] Updated weights on worker 0-0, policy_version 122280 (0.00097) [2022-07-09 06:11:53,704][25689] Fps is (10 sec: 5753.5, 60 sec: 5757.3, 300 sec: 5732.4). Total num frames: 125217792. Throughput: 0: 6051.9. Samples: 125227180. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 06:11:53,705][25689] Avg episode reward: [(0, '-55.864')] [2022-07-09 06:11:54,879][26022] Updated weights on worker 0-0, policy_version 122290 (0.00083) [2022-07-09 06:11:56,393][26022] Updated weights on worker 0-0, policy_version 122300 (0.00089) [2022-07-09 06:11:58,379][26022] Updated weights on worker 0-0, policy_version 122310 (0.00086) [2022-07-09 06:11:58,737][25689] Fps is (10 sec: 5801.4, 60 sec: 5771.7, 300 sec: 5735.7). Total num frames: 125247488. Throughput: 0: 5198.4. Samples: 125244422. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 06:11:58,738][25689] Avg episode reward: [(0, '-56.528')] [2022-07-09 06:12:00,055][26022] Updated weights on worker 0-0, policy_version 122320 (0.00093) [2022-07-09 06:12:02,171][26022] Updated weights on worker 0-0, policy_version 122330 (0.00085) [2022-07-09 06:12:03,769][25689] Fps is (10 sec: 5697.6, 60 sec: 5769.1, 300 sec: 5733.5). Total num frames: 125275136. Throughput: 0: 5947.5. Samples: 125277250. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 06:12:03,769][25689] Avg episode reward: [(0, '-56.780')] [2022-07-09 06:12:03,803][26022] Updated weights on worker 0-0, policy_version 122340 (0.00085) [2022-07-09 06:12:05,884][26022] Updated weights on worker 0-0, policy_version 122350 (0.00093) [2022-07-09 06:12:07,639][26022] Updated weights on worker 0-0, policy_version 122360 (0.00082) [2022-07-09 06:12:08,858][25689] Fps is (10 sec: 5564.7, 60 sec: 5768.7, 300 sec: 5738.7). Total num frames: 125303808. Throughput: 0: 5896.3. Samples: 125311200. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 06:12:08,859][25689] Avg episode reward: [(0, '-56.803')] [2022-07-09 06:12:09,660][26022] Updated weights on worker 0-0, policy_version 122370 (0.00081) [2022-07-09 06:12:11,030][26022] Updated weights on worker 0-0, policy_version 122380 (0.00100) [2022-07-09 06:12:13,115][26022] Updated weights on worker 0-0, policy_version 122390 (0.00088) [2022-07-09 06:12:13,933][25689] Fps is (10 sec: 5642.0, 60 sec: 5730.2, 300 sec: 5737.6). Total num frames: 125332480. Throughput: 0: 5010.6. Samples: 125328536. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 06:12:13,933][25689] Avg episode reward: [(0, '-56.689')] [2022-07-09 06:12:14,627][26022] Updated weights on worker 0-0, policy_version 122400 (0.00086) [2022-07-09 06:12:16,680][26022] Updated weights on worker 0-0, policy_version 122410 (0.00086) [2022-07-09 06:12:18,180][26022] Updated weights on worker 0-0, policy_version 122420 (0.00085) [2022-07-09 06:12:18,954][25689] Fps is (10 sec: 5781.6, 60 sec: 5764.1, 300 sec: 5733.8). Total num frames: 125362176. Throughput: 0: 5884.1. Samples: 125363378. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 06:12:18,954][25689] Avg episode reward: [(0, '-56.895')] [2022-07-09 06:12:20,011][26022] Updated weights on worker 0-0, policy_version 122430 (0.00087) [2022-07-09 06:12:21,801][26022] Updated weights on worker 0-0, policy_version 122440 (0.00088) [2022-07-09 06:12:23,671][26022] Updated weights on worker 0-0, policy_version 122450 (0.00096) [2022-07-09 06:12:23,967][25689] Fps is (10 sec: 5817.0, 60 sec: 5748.6, 300 sec: 5742.8). Total num frames: 125390848. Throughput: 0: 5990.4. Samples: 125398242. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 06:12:23,967][25689] Avg episode reward: [(0, '-56.160')] [2022-07-09 06:12:25,380][26022] Updated weights on worker 0-0, policy_version 122460 (0.00086) [2022-07-09 06:12:27,197][26022] Updated weights on worker 0-0, policy_version 122470 (0.00091) [2022-07-09 06:12:28,817][26022] Updated weights on worker 0-0, policy_version 122480 (0.00091) [2022-07-09 06:12:29,071][25689] Fps is (10 sec: 5870.5, 60 sec: 5764.5, 300 sec: 5745.6). Total num frames: 125421568. Throughput: 0: 5154.3. Samples: 125415382. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 06:12:29,071][25689] Avg episode reward: [(0, '-56.031')] [2022-07-09 06:12:30,688][26022] Updated weights on worker 0-0, policy_version 122490 (0.00088) [2022-07-09 06:12:31,623][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:12:31,634][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000122495_125434880.pth [2022-07-09 06:12:31,634][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000120478_123369472.pth [2022-07-09 06:12:32,336][26022] Updated weights on worker 0-0, policy_version 122500 (0.00090) [2022-07-09 06:12:34,057][26022] Updated weights on worker 0-0, policy_version 122510 (0.00611) [2022-07-09 06:12:34,079][25689] Fps is (10 sec: 5873.2, 60 sec: 5782.8, 300 sec: 5742.2). Total num frames: 125450240. Throughput: 0: 6028.6. Samples: 125449990. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 06:12:34,080][25689] Avg episode reward: [(0, '-56.221')] [2022-07-09 06:12:36,060][26022] Updated weights on worker 0-0, policy_version 122520 (0.00081) [2022-07-09 06:12:37,995][26022] Updated weights on worker 0-0, policy_version 122530 (0.00086) [2022-07-09 06:12:39,090][25689] Fps is (10 sec: 5518.7, 60 sec: 5734.8, 300 sec: 5738.7). Total num frames: 125476864. Throughput: 0: 6026.2. Samples: 125484726. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 06:12:39,091][25689] Avg episode reward: [(0, '-56.333')] [2022-07-09 06:12:39,669][26022] Updated weights on worker 0-0, policy_version 122540 (0.00093) [2022-07-09 06:12:41,200][26022] Updated weights on worker 0-0, policy_version 122550 (0.00088) [2022-07-09 06:12:43,034][26022] Updated weights on worker 0-0, policy_version 122560 (0.00083) [2022-07-09 06:12:44,118][25689] Fps is (10 sec: 5712.2, 60 sec: 5749.2, 300 sec: 5743.1). Total num frames: 125507584. Throughput: 0: 5157.5. Samples: 125502172. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 06:12:44,118][25689] Avg episode reward: [(0, '-57.318')] [2022-07-09 06:12:44,979][26022] Updated weights on worker 0-0, policy_version 122570 (0.00075) [2022-07-09 06:12:46,599][26022] Updated weights on worker 0-0, policy_version 122580 (0.00093) [2022-07-09 06:12:48,343][26022] Updated weights on worker 0-0, policy_version 122590 (0.00084) [2022-07-09 06:12:49,226][25689] Fps is (10 sec: 5759.0, 60 sec: 5716.2, 300 sec: 5738.2). Total num frames: 125535232. Throughput: 0: 6032.7. Samples: 125536970. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 06:12:49,226][25689] Avg episode reward: [(0, '-56.779')] [2022-07-09 06:12:50,107][26022] Updated weights on worker 0-0, policy_version 122600 (0.00084) [2022-07-09 06:12:51,972][26022] Updated weights on worker 0-0, policy_version 122610 (0.00084) [2022-07-09 06:12:53,484][26022] Updated weights on worker 0-0, policy_version 122620 (0.00083) [2022-07-09 06:12:54,282][25689] Fps is (10 sec: 5843.3, 60 sec: 5764.2, 300 sec: 5744.4). Total num frames: 125566976. Throughput: 0: 6044.1. Samples: 125572098. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 06:12:54,283][25689] Avg episode reward: [(0, '-56.912')] [2022-07-09 06:12:55,514][26022] Updated weights on worker 0-0, policy_version 122630 (0.00080) [2022-07-09 06:12:56,911][26022] Updated weights on worker 0-0, policy_version 122640 (0.00082) [2022-07-09 06:12:59,059][26022] Updated weights on worker 0-0, policy_version 122650 (0.00088) [2022-07-09 06:12:59,351][25689] Fps is (10 sec: 5966.8, 60 sec: 5743.9, 300 sec: 5743.3). Total num frames: 125595648. Throughput: 0: 5175.2. Samples: 125589578. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 06:12:59,351][25689] Avg episode reward: [(0, '-56.634')] [2022-07-09 06:13:00,484][26022] Updated weights on worker 0-0, policy_version 122660 (0.00085) [2022-07-09 06:13:02,880][26022] Updated weights on worker 0-0, policy_version 122670 (0.00094) [2022-07-09 06:13:04,409][25689] Fps is (10 sec: 5561.6, 60 sec: 5741.5, 300 sec: 5746.7). Total num frames: 125623296. Throughput: 0: 5927.2. Samples: 125622438. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 06:13:04,409][25689] Avg episode reward: [(0, '-56.134')] [2022-07-09 06:13:04,473][26022] Updated weights on worker 0-0, policy_version 122680 (0.00088) [2022-07-09 06:13:06,473][26022] Updated weights on worker 0-0, policy_version 122690 (0.00102) [2022-07-09 06:13:08,129][26022] Updated weights on worker 0-0, policy_version 122700 (0.00089) [2022-07-09 06:13:09,472][25689] Fps is (10 sec: 5463.7, 60 sec: 5727.0, 300 sec: 5738.9). Total num frames: 125650944. Throughput: 0: 5922.0. Samples: 125656866. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 06:13:09,472][25689] Avg episode reward: [(0, '-55.944')] [2022-07-09 06:13:10,166][26022] Updated weights on worker 0-0, policy_version 122710 (0.00086) [2022-07-09 06:13:11,526][26022] Updated weights on worker 0-0, policy_version 122720 (0.00087) [2022-07-09 06:13:13,579][26022] Updated weights on worker 0-0, policy_version 122730 (0.00089) [2022-07-09 06:13:14,503][25689] Fps is (10 sec: 5883.7, 60 sec: 5781.9, 300 sec: 5756.0). Total num frames: 125682688. Throughput: 0: 5056.5. Samples: 125674344. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 06:13:14,503][25689] Avg episode reward: [(0, '-55.847')] [2022-07-09 06:13:14,990][26022] Updated weights on worker 0-0, policy_version 122740 (0.00102) [2022-07-09 06:13:17,051][26022] Updated weights on worker 0-0, policy_version 122750 (0.00093) [2022-07-09 06:13:18,728][26022] Updated weights on worker 0-0, policy_version 122760 (0.00093) [2022-07-09 06:13:19,508][25689] Fps is (10 sec: 5815.7, 60 sec: 5732.7, 300 sec: 5749.6). Total num frames: 125709312. Throughput: 0: 5932.9. Samples: 125709166. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 06:13:19,508][25689] Avg episode reward: [(0, '-56.530')] [2022-07-09 06:13:20,632][26022] Updated weights on worker 0-0, policy_version 122770 (0.00088) [2022-07-09 06:13:22,277][26022] Updated weights on worker 0-0, policy_version 122780 (0.00085) [2022-07-09 06:13:24,152][26022] Updated weights on worker 0-0, policy_version 122790 (0.00095) [2022-07-09 06:13:24,553][25689] Fps is (10 sec: 5502.0, 60 sec: 5729.6, 300 sec: 5742.9). Total num frames: 125737984. Throughput: 0: 6006.0. Samples: 125743426. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 06:13:24,554][25689] Avg episode reward: [(0, '-56.389')] [2022-07-09 06:13:26,031][26022] Updated weights on worker 0-0, policy_version 122800 (0.00098) [2022-07-09 06:13:27,819][26022] Updated weights on worker 0-0, policy_version 122810 (0.00079) [2022-07-09 06:13:29,478][26022] Updated weights on worker 0-0, policy_version 122820 (0.00086) [2022-07-09 06:13:29,673][25689] Fps is (10 sec: 5741.9, 60 sec: 5711.2, 300 sec: 5751.1). Total num frames: 125767680. Throughput: 0: 5129.4. Samples: 125760488. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 06:13:29,674][25689] Avg episode reward: [(0, '-55.902')] [2022-07-09 06:13:31,516][26022] Updated weights on worker 0-0, policy_version 122830 (0.00084) [2022-07-09 06:13:33,097][26022] Updated weights on worker 0-0, policy_version 122840 (0.00081) [2022-07-09 06:13:34,676][25689] Fps is (10 sec: 5766.1, 60 sec: 5711.7, 300 sec: 5747.7). Total num frames: 125796352. Throughput: 0: 5998.6. Samples: 125795352. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:13:34,676][25689] Avg episode reward: [(0, '-56.193')] [2022-07-09 06:13:34,930][26022] Updated weights on worker 0-0, policy_version 122850 (0.00084) [2022-07-09 06:13:36,503][26022] Updated weights on worker 0-0, policy_version 122860 (0.00091) [2022-07-09 06:13:38,449][26022] Updated weights on worker 0-0, policy_version 122870 (0.00094) [2022-07-09 06:13:39,713][25689] Fps is (10 sec: 5915.5, 60 sec: 5776.9, 300 sec: 5760.8). Total num frames: 125827072. Throughput: 0: 5986.7. Samples: 125830130. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:13:39,717][25689] Avg episode reward: [(0, '-55.615')] [2022-07-09 06:13:40,156][26022] Updated weights on worker 0-0, policy_version 122880 (0.00091) [2022-07-09 06:13:42,044][26022] Updated weights on worker 0-0, policy_version 122890 (0.00091) [2022-07-09 06:13:43,833][26022] Updated weights on worker 0-0, policy_version 122900 (0.00086) [2022-07-09 06:13:44,747][25689] Fps is (10 sec: 5795.5, 60 sec: 5725.6, 300 sec: 5748.6). Total num frames: 125854720. Throughput: 0: 6036.9. Samples: 125865334. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:13:44,747][25689] Avg episode reward: [(0, '-55.046')] [2022-07-09 06:13:45,447][26022] Updated weights on worker 0-0, policy_version 122910 (0.00086) [2022-07-09 06:13:47,345][26022] Updated weights on worker 0-0, policy_version 122920 (0.00094) [2022-07-09 06:13:48,872][26022] Updated weights on worker 0-0, policy_version 122930 (0.00089) [2022-07-09 06:13:49,789][25689] Fps is (10 sec: 5792.9, 60 sec: 5782.5, 300 sec: 5755.2). Total num frames: 125885440. Throughput: 0: 6076.2. Samples: 125882716. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:13:49,789][25689] Avg episode reward: [(0, '-55.471')] [2022-07-09 06:13:50,780][26022] Updated weights on worker 0-0, policy_version 122940 (0.00093) [2022-07-09 06:13:52,546][26022] Updated weights on worker 0-0, policy_version 122950 (0.00081) [2022-07-09 06:13:54,422][26022] Updated weights on worker 0-0, policy_version 122960 (0.00084) [2022-07-09 06:13:54,820][25689] Fps is (10 sec: 5896.0, 60 sec: 5734.2, 300 sec: 5755.8). Total num frames: 125914112. Throughput: 0: 6044.4. Samples: 125917114. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:13:54,821][25689] Avg episode reward: [(0, '-54.824')] [2022-07-09 06:13:56,152][26022] Updated weights on worker 0-0, policy_version 122970 (0.00089) [2022-07-09 06:13:57,894][26022] Updated weights on worker 0-0, policy_version 122980 (0.00085) [2022-07-09 06:13:59,688][26022] Updated weights on worker 0-0, policy_version 122990 (0.00091) [2022-07-09 06:13:59,823][25689] Fps is (10 sec: 5817.1, 60 sec: 5757.4, 300 sec: 5770.1). Total num frames: 125943808. Throughput: 0: 6070.2. Samples: 125952200. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:13:59,823][25689] Avg episode reward: [(0, '-54.951')] [2022-07-09 06:14:01,410][26022] Updated weights on worker 0-0, policy_version 123000 (0.00093) [2022-07-09 06:14:03,468][26022] Updated weights on worker 0-0, policy_version 123010 (0.00092) [2022-07-09 06:14:04,842][25689] Fps is (10 sec: 5517.6, 60 sec: 5727.2, 300 sec: 5750.4). Total num frames: 125969408. Throughput: 0: 5068.9. Samples: 125967194. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:14:04,842][25689] Avg episode reward: [(0, '-55.823')] [2022-07-09 06:14:05,486][26022] Updated weights on worker 0-0, policy_version 123020 (0.00089) [2022-07-09 06:14:06,904][26022] Updated weights on worker 0-0, policy_version 123030 (0.00082) [2022-07-09 06:14:09,163][26022] Updated weights on worker 0-0, policy_version 123040 (0.00093) [2022-07-09 06:14:09,902][25689] Fps is (10 sec: 5384.6, 60 sec: 5744.4, 300 sec: 5752.9). Total num frames: 125998080. Throughput: 0: 5903.5. Samples: 126001452. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:14:09,902][25689] Avg episode reward: [(0, '-56.092')] [2022-07-09 06:14:10,637][26022] Updated weights on worker 0-0, policy_version 123050 (0.00092) [2022-07-09 06:14:12,524][26022] Updated weights on worker 0-0, policy_version 123060 (0.00090) [2022-07-09 06:14:14,107][26022] Updated weights on worker 0-0, policy_version 123070 (0.00084) [2022-07-09 06:14:14,912][25689] Fps is (10 sec: 5796.3, 60 sec: 5712.5, 300 sec: 5753.1). Total num frames: 126027776. Throughput: 0: 5954.1. Samples: 126036742. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:14:14,913][25689] Avg episode reward: [(0, '-57.380')] [2022-07-09 06:14:15,895][26022] Updated weights on worker 0-0, policy_version 123080 (0.00081) [2022-07-09 06:14:17,644][26022] Updated weights on worker 0-0, policy_version 123090 (0.00088) [2022-07-09 06:14:19,344][26022] Updated weights on worker 0-0, policy_version 123100 (0.00094) [2022-07-09 06:14:19,930][25689] Fps is (10 sec: 5820.2, 60 sec: 5745.1, 300 sec: 5753.3). Total num frames: 126056448. Throughput: 0: 5084.4. Samples: 126054436. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:14:19,931][25689] Avg episode reward: [(0, '-56.665')] [2022-07-09 06:14:21,167][26022] Updated weights on worker 0-0, policy_version 123110 (0.00091) [2022-07-09 06:14:22,915][26022] Updated weights on worker 0-0, policy_version 123120 (0.00087) [2022-07-09 06:14:24,536][26022] Updated weights on worker 0-0, policy_version 123130 (0.00100) [2022-07-09 06:14:24,989][25689] Fps is (10 sec: 5995.3, 60 sec: 5794.7, 300 sec: 5757.1). Total num frames: 126088192. Throughput: 0: 6093.3. Samples: 126089956. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:14:24,990][25689] Avg episode reward: [(0, '-56.342')] [2022-07-09 06:14:26,443][26022] Updated weights on worker 0-0, policy_version 123140 (0.00100) [2022-07-09 06:14:28,051][26022] Updated weights on worker 0-0, policy_version 123150 (0.00086) [2022-07-09 06:14:29,820][26022] Updated weights on worker 0-0, policy_version 123160 (0.00091) [2022-07-09 06:14:30,051][25689] Fps is (10 sec: 5868.5, 60 sec: 5766.4, 300 sec: 5755.9). Total num frames: 126115840. Throughput: 0: 6122.5. Samples: 126124814. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:14:30,051][25689] Avg episode reward: [(0, '-56.992')] [2022-07-09 06:14:31,546][26022] Updated weights on worker 0-0, policy_version 123170 (0.00172) [2022-07-09 06:14:31,742][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:14:31,752][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000123171_126127104.pth [2022-07-09 06:14:31,763][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000121144_124051456.pth [2022-07-09 06:14:33,414][26022] Updated weights on worker 0-0, policy_version 123180 (0.00084) [2022-07-09 06:14:35,080][25689] Fps is (10 sec: 5784.0, 60 sec: 5797.7, 300 sec: 5762.4). Total num frames: 126146560. Throughput: 0: 5242.8. Samples: 126142482. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:14:35,081][25689] Avg episode reward: [(0, '-56.048')] [2022-07-09 06:14:35,087][26022] Updated weights on worker 0-0, policy_version 123190 (0.00097) [2022-07-09 06:14:36,942][26022] Updated weights on worker 0-0, policy_version 123200 (0.00081) [2022-07-09 06:14:38,513][26022] Updated weights on worker 0-0, policy_version 123210 (0.00094) [2022-07-09 06:14:40,166][25689] Fps is (10 sec: 5669.2, 60 sec: 5725.3, 300 sec: 5750.6). Total num frames: 126173184. Throughput: 0: 6066.7. Samples: 126177198. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:14:40,166][25689] Avg episode reward: [(0, '-55.581')] [2022-07-09 06:14:40,539][26022] Updated weights on worker 0-0, policy_version 123220 (0.00080) [2022-07-09 06:14:41,952][26022] Updated weights on worker 0-0, policy_version 123230 (0.00081) [2022-07-09 06:14:43,841][26022] Updated weights on worker 0-0, policy_version 123240 (0.00081) [2022-07-09 06:14:45,167][25689] Fps is (10 sec: 5786.3, 60 sec: 5796.2, 300 sec: 5758.4). Total num frames: 126204928. Throughput: 0: 6078.2. Samples: 126212606. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:14:45,168][25689] Avg episode reward: [(0, '-56.311')] [2022-07-09 06:14:45,996][26022] Updated weights on worker 0-0, policy_version 123250 (0.01102) [2022-07-09 06:14:47,267][26022] Updated weights on worker 0-0, policy_version 123260 (0.00089) [2022-07-09 06:14:49,263][26022] Updated weights on worker 0-0, policy_version 123270 (0.00087) [2022-07-09 06:14:50,273][25689] Fps is (10 sec: 6180.1, 60 sec: 5790.1, 300 sec: 5766.8). Total num frames: 126235648. Throughput: 0: 5209.9. Samples: 126230170. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:14:50,274][25689] Avg episode reward: [(0, '-56.670')] [2022-07-09 06:14:50,831][26022] Updated weights on worker 0-0, policy_version 123280 (0.00086) [2022-07-09 06:14:52,594][26022] Updated weights on worker 0-0, policy_version 123290 (0.00084) [2022-07-09 06:14:54,530][26022] Updated weights on worker 0-0, policy_version 123300 (0.00093) [2022-07-09 06:14:55,356][25689] Fps is (10 sec: 5829.2, 60 sec: 5785.1, 300 sec: 5758.9). Total num frames: 126264320. Throughput: 0: 6067.5. Samples: 126265506. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:14:55,357][25689] Avg episode reward: [(0, '-57.005')] [2022-07-09 06:14:56,055][26022] Updated weights on worker 0-0, policy_version 123310 (0.00091) [2022-07-09 06:14:57,942][26022] Updated weights on worker 0-0, policy_version 123320 (0.00085) [2022-07-09 06:14:59,647][26022] Updated weights on worker 0-0, policy_version 123330 (0.00088) [2022-07-09 06:15:00,370][25689] Fps is (10 sec: 5780.5, 60 sec: 5784.0, 300 sec: 5772.4). Total num frames: 126294016. Throughput: 0: 6102.1. Samples: 126300488. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:15:00,371][25689] Avg episode reward: [(0, '-56.635')] [2022-07-09 06:15:01,903][26022] Updated weights on worker 0-0, policy_version 123340 (0.00090) [2022-07-09 06:15:03,560][26022] Updated weights on worker 0-0, policy_version 123350 (0.00053) [2022-07-09 06:15:05,325][26022] Updated weights on worker 0-0, policy_version 123360 (0.00090) [2022-07-09 06:15:05,403][25689] Fps is (10 sec: 5605.9, 60 sec: 5799.6, 300 sec: 5763.5). Total num frames: 126320640. Throughput: 0: 5094.8. Samples: 126315700. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:15:05,403][25689] Avg episode reward: [(0, '-57.168')] [2022-07-09 06:15:06,960][26022] Updated weights on worker 0-0, policy_version 123370 (0.00081) [2022-07-09 06:15:08,860][26022] Updated weights on worker 0-0, policy_version 123380 (0.00099) [2022-07-09 06:15:10,526][25689] Fps is (10 sec: 5444.8, 60 sec: 5793.6, 300 sec: 5754.4). Total num frames: 126349312. Throughput: 0: 5955.8. Samples: 126350794. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:15:10,528][25689] Avg episode reward: [(0, '-56.527')] [2022-07-09 06:15:10,796][26022] Updated weights on worker 0-0, policy_version 123390 (0.00084) [2022-07-09 06:15:12,362][26022] Updated weights on worker 0-0, policy_version 123400 (0.00090) [2022-07-09 06:15:14,234][26022] Updated weights on worker 0-0, policy_version 123410 (0.00079) [2022-07-09 06:15:15,538][25689] Fps is (10 sec: 5960.8, 60 sec: 5827.2, 300 sec: 5769.2). Total num frames: 126381056. Throughput: 0: 5944.7. Samples: 126385482. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:15:15,539][25689] Avg episode reward: [(0, '-55.741')] [2022-07-09 06:15:15,647][26022] Updated weights on worker 0-0, policy_version 123420 (0.00091) [2022-07-09 06:15:17,675][26022] Updated weights on worker 0-0, policy_version 123430 (0.00090) [2022-07-09 06:15:19,610][26022] Updated weights on worker 0-0, policy_version 123440 (0.00085) [2022-07-09 06:15:20,554][25689] Fps is (10 sec: 5820.3, 60 sec: 5793.6, 300 sec: 5751.8). Total num frames: 126407680. Throughput: 0: 5083.9. Samples: 126403102. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:15:20,555][25689] Avg episode reward: [(0, '-55.015')] [2022-07-09 06:15:20,987][26022] Updated weights on worker 0-0, policy_version 123450 (0.00085) [2022-07-09 06:15:23,007][26022] Updated weights on worker 0-0, policy_version 123460 (0.00076) [2022-07-09 06:15:24,546][26022] Updated weights on worker 0-0, policy_version 123470 (0.00091) [2022-07-09 06:15:25,574][25689] Fps is (10 sec: 5611.8, 60 sec: 5763.5, 300 sec: 5761.1). Total num frames: 126437376. Throughput: 0: 6074.7. Samples: 126438236. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:15:25,575][25689] Avg episode reward: [(0, '-54.671')] [2022-07-09 06:15:26,283][26022] Updated weights on worker 0-0, policy_version 123480 (0.00086) [2022-07-09 06:15:28,392][26022] Updated weights on worker 0-0, policy_version 123490 (0.00085) [2022-07-09 06:15:29,883][26022] Updated weights on worker 0-0, policy_version 123500 (0.00108) [2022-07-09 06:15:30,667][25689] Fps is (10 sec: 5974.1, 60 sec: 5811.2, 300 sec: 5764.2). Total num frames: 126468096. Throughput: 0: 6082.0. Samples: 126473292. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:15:30,668][25689] Avg episode reward: [(0, '-54.471')] [2022-07-09 06:15:31,490][26022] Updated weights on worker 0-0, policy_version 123510 (0.00083) [2022-07-09 06:15:33,262][26022] Updated weights on worker 0-0, policy_version 123520 (0.00081) [2022-07-09 06:15:35,068][26022] Updated weights on worker 0-0, policy_version 123530 (0.00085) [2022-07-09 06:15:35,741][25689] Fps is (10 sec: 5942.3, 60 sec: 5790.1, 300 sec: 5764.5). Total num frames: 126497792. Throughput: 0: 5231.0. Samples: 126491164. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:15:35,742][25689] Avg episode reward: [(0, '-54.152')] [2022-07-09 06:15:36,801][26022] Updated weights on worker 0-0, policy_version 123540 (0.00080) [2022-07-09 06:15:38,669][26022] Updated weights on worker 0-0, policy_version 123550 (0.00087) [2022-07-09 06:15:40,401][26022] Updated weights on worker 0-0, policy_version 123560 (0.00080) [2022-07-09 06:15:40,825][25689] Fps is (10 sec: 5846.8, 60 sec: 5840.9, 300 sec: 5764.4). Total num frames: 126527488. Throughput: 0: 6088.4. Samples: 126526520. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 06:15:40,826][25689] Avg episode reward: [(0, '-54.215')] [2022-07-09 06:15:42,179][26022] Updated weights on worker 0-0, policy_version 123570 (0.00089) [2022-07-09 06:15:43,766][26022] Updated weights on worker 0-0, policy_version 123580 (0.00084) [2022-07-09 06:15:45,534][26022] Updated weights on worker 0-0, policy_version 123590 (0.00109) [2022-07-09 06:15:45,870][25689] Fps is (10 sec: 5863.6, 60 sec: 5803.0, 300 sec: 5765.2). Total num frames: 126557184. Throughput: 0: 6071.3. Samples: 126561458. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 06:15:45,872][25689] Avg episode reward: [(0, '-55.392')] [2022-07-09 06:15:47,431][26022] Updated weights on worker 0-0, policy_version 123600 (0.00084) [2022-07-09 06:15:49,157][26022] Updated weights on worker 0-0, policy_version 123610 (0.00085) [2022-07-09 06:15:50,934][25689] Fps is (10 sec: 5774.1, 60 sec: 5773.2, 300 sec: 5764.9). Total num frames: 126585856. Throughput: 0: 5204.7. Samples: 126578774. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 06:15:50,934][25689] Avg episode reward: [(0, '-56.106')] [2022-07-09 06:15:50,942][26022] Updated weights on worker 0-0, policy_version 123620 (0.00092) [2022-07-09 06:15:52,827][26022] Updated weights on worker 0-0, policy_version 123630 (0.00086) [2022-07-09 06:15:54,472][26022] Updated weights on worker 0-0, policy_version 123640 (0.00085) [2022-07-09 06:15:55,949][25689] Fps is (10 sec: 5790.9, 60 sec: 5796.5, 300 sec: 5768.2). Total num frames: 126615552. Throughput: 0: 6064.4. Samples: 126613714. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 06:15:55,950][25689] Avg episode reward: [(0, '-55.630')] [2022-07-09 06:15:56,240][26022] Updated weights on worker 0-0, policy_version 123650 (0.00087) [2022-07-09 06:15:57,920][26022] Updated weights on worker 0-0, policy_version 123660 (0.00085) [2022-07-09 06:15:59,894][26022] Updated weights on worker 0-0, policy_version 123670 (0.00089) [2022-07-09 06:16:00,976][25689] Fps is (10 sec: 5914.6, 60 sec: 5795.4, 300 sec: 5774.6). Total num frames: 126645248. Throughput: 0: 6078.6. Samples: 126649006. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 06:16:00,976][25689] Avg episode reward: [(0, '-55.564')] [2022-07-09 06:16:01,316][26022] Updated weights on worker 0-0, policy_version 123680 (0.00095) [2022-07-09 06:16:03,543][26022] Updated weights on worker 0-0, policy_version 123690 (0.00085) [2022-07-09 06:16:05,245][26022] Updated weights on worker 0-0, policy_version 123700 (0.00085) [2022-07-09 06:16:05,984][25689] Fps is (10 sec: 5612.7, 60 sec: 5797.7, 300 sec: 5769.2). Total num frames: 126671872. Throughput: 0: 5998.6. Samples: 126682112. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 06:16:05,984][25689] Avg episode reward: [(0, '-54.804')] [2022-07-09 06:16:07,025][26022] Updated weights on worker 0-0, policy_version 123710 (0.00082) [2022-07-09 06:16:08,836][26022] Updated weights on worker 0-0, policy_version 123720 (0.00086) [2022-07-09 06:16:10,612][26022] Updated weights on worker 0-0, policy_version 123730 (0.00080) [2022-07-09 06:16:11,034][25689] Fps is (10 sec: 5497.7, 60 sec: 5804.8, 300 sec: 5761.9). Total num frames: 126700544. Throughput: 0: 6008.9. Samples: 126699550. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 06:16:11,034][25689] Avg episode reward: [(0, '-55.286')] [2022-07-09 06:16:12,386][26022] Updated weights on worker 0-0, policy_version 123740 (0.00079) [2022-07-09 06:16:14,105][26022] Updated weights on worker 0-0, policy_version 123750 (0.00081) [2022-07-09 06:16:15,840][26022] Updated weights on worker 0-0, policy_version 123760 (0.00088) [2022-07-09 06:16:16,039][25689] Fps is (10 sec: 5906.6, 60 sec: 5788.5, 300 sec: 5772.5). Total num frames: 126731264. Throughput: 0: 6008.6. Samples: 126734424. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 06:16:16,040][25689] Avg episode reward: [(0, '-55.575')] [2022-07-09 06:16:17,892][26022] Updated weights on worker 0-0, policy_version 123770 (0.00086) [2022-07-09 06:16:19,495][26022] Updated weights on worker 0-0, policy_version 123780 (0.00087) [2022-07-09 06:16:21,091][25689] Fps is (10 sec: 5905.3, 60 sec: 5818.9, 300 sec: 5768.6). Total num frames: 126759936. Throughput: 0: 5988.0. Samples: 126769458. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 06:16:21,091][25689] Avg episode reward: [(0, '-55.592')] [2022-07-09 06:16:21,282][26022] Updated weights on worker 0-0, policy_version 123790 (0.00084) [2022-07-09 06:16:22,910][26022] Updated weights on worker 0-0, policy_version 123800 (0.00083) [2022-07-09 06:16:24,864][26022] Updated weights on worker 0-0, policy_version 123810 (0.00623) [2022-07-09 06:16:26,094][25689] Fps is (10 sec: 5805.0, 60 sec: 5820.5, 300 sec: 5770.4). Total num frames: 126789632. Throughput: 0: 5216.7. Samples: 126787020. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 06:16:26,094][25689] Avg episode reward: [(0, '-55.711')] [2022-07-09 06:16:26,383][26022] Updated weights on worker 0-0, policy_version 123820 (0.00084) [2022-07-09 06:16:28,309][26022] Updated weights on worker 0-0, policy_version 123830 (0.00085) [2022-07-09 06:16:29,799][26022] Updated weights on worker 0-0, policy_version 123840 (0.00083) [2022-07-09 06:16:31,209][25689] Fps is (10 sec: 5768.6, 60 sec: 5784.6, 300 sec: 5772.0). Total num frames: 126818304. Throughput: 0: 6058.8. Samples: 126821790. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 06:16:31,210][25689] Avg episode reward: [(0, '-55.255')] [2022-07-09 06:16:31,949][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:16:31,966][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000123850_126822400.pth [2022-07-09 06:16:31,966][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000121821_124744704.pth [2022-07-09 06:16:31,974][26022] Updated weights on worker 0-0, policy_version 123850 (0.00082) [2022-07-09 06:16:33,462][26022] Updated weights on worker 0-0, policy_version 123860 (0.00083) [2022-07-09 06:16:35,284][26022] Updated weights on worker 0-0, policy_version 123870 (0.00081) [2022-07-09 06:16:36,211][25689] Fps is (10 sec: 5667.9, 60 sec: 5774.5, 300 sec: 5769.3). Total num frames: 126846976. Throughput: 0: 6057.8. Samples: 126856622. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 06:16:36,212][25689] Avg episode reward: [(0, '-55.002')] [2022-07-09 06:16:36,992][26022] Updated weights on worker 0-0, policy_version 123880 (0.00087) [2022-07-09 06:16:39,000][26022] Updated weights on worker 0-0, policy_version 123890 (0.00085) [2022-07-09 06:16:40,475][26022] Updated weights on worker 0-0, policy_version 123900 (0.00101) [2022-07-09 06:16:41,219][25689] Fps is (10 sec: 5933.4, 60 sec: 5798.8, 300 sec: 5772.6). Total num frames: 126877696. Throughput: 0: 5202.8. Samples: 126874180. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 06:16:41,220][25689] Avg episode reward: [(0, '-53.799')] [2022-07-09 06:16:42,550][26022] Updated weights on worker 0-0, policy_version 123910 (0.00097) [2022-07-09 06:16:43,931][26022] Updated weights on worker 0-0, policy_version 123920 (0.00090) [2022-07-09 06:16:46,040][26022] Updated weights on worker 0-0, policy_version 123930 (0.00085) [2022-07-09 06:16:46,249][25689] Fps is (10 sec: 5917.0, 60 sec: 5783.3, 300 sec: 5770.9). Total num frames: 126906368. Throughput: 0: 6055.8. Samples: 126909074. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 06:16:46,249][25689] Avg episode reward: [(0, '-53.551')] [2022-07-09 06:16:47,364][26022] Updated weights on worker 0-0, policy_version 123940 (0.00082) [2022-07-09 06:16:49,572][26022] Updated weights on worker 0-0, policy_version 123950 (0.00087) [2022-07-09 06:16:50,981][26022] Updated weights on worker 0-0, policy_version 123960 (0.00086) [2022-07-09 06:16:51,370][25689] Fps is (10 sec: 5749.8, 60 sec: 5794.7, 300 sec: 5772.5). Total num frames: 126936064. Throughput: 0: 6057.4. Samples: 126943912. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 06:16:51,371][25689] Avg episode reward: [(0, '-53.095')] [2022-07-09 06:16:52,950][26022] Updated weights on worker 0-0, policy_version 123970 (0.00094) [2022-07-09 06:16:54,697][26022] Updated weights on worker 0-0, policy_version 123980 (0.00082) [2022-07-09 06:16:56,392][25689] Fps is (10 sec: 5855.4, 60 sec: 5794.1, 300 sec: 5772.7). Total num frames: 126965760. Throughput: 0: 5196.6. Samples: 126961492. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 06:16:56,392][25689] Avg episode reward: [(0, '-53.778')] [2022-07-09 06:16:56,392][26022] Updated weights on worker 0-0, policy_version 123990 (0.00081) [2022-07-09 06:16:58,141][26022] Updated weights on worker 0-0, policy_version 124000 (0.00086) [2022-07-09 06:17:00,087][26022] Updated weights on worker 0-0, policy_version 124010 (0.00087) [2022-07-09 06:17:01,478][25689] Fps is (10 sec: 5774.4, 60 sec: 5771.4, 300 sec: 5775.1). Total num frames: 126994432. Throughput: 0: 6033.6. Samples: 126996416. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 06:17:01,479][25689] Avg episode reward: [(0, '-55.335')] [2022-07-09 06:17:01,749][26022] Updated weights on worker 0-0, policy_version 124020 (0.00085) [2022-07-09 06:17:03,938][26022] Updated weights on worker 0-0, policy_version 124030 (0.00086) [2022-07-09 06:17:05,345][26022] Updated weights on worker 0-0, policy_version 124040 (0.00092) [2022-07-09 06:17:06,559][25689] Fps is (10 sec: 5640.2, 60 sec: 5798.3, 300 sec: 5775.3). Total num frames: 127023104. Throughput: 0: 5928.0. Samples: 127029472. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 06:17:06,561][25689] Avg episode reward: [(0, '-55.487')] [2022-07-09 06:17:07,699][26022] Updated weights on worker 0-0, policy_version 124050 (0.00083) [2022-07-09 06:17:08,803][26022] Updated weights on worker 0-0, policy_version 124060 (0.00085) [2022-07-09 06:17:11,017][26022] Updated weights on worker 0-0, policy_version 124070 (0.00581) [2022-07-09 06:17:11,683][25689] Fps is (10 sec: 5619.5, 60 sec: 5791.2, 300 sec: 5774.3). Total num frames: 127051776. Throughput: 0: 5067.9. Samples: 127046854. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 06:17:11,683][25689] Avg episode reward: [(0, '-55.513')] [2022-07-09 06:17:12,485][26022] Updated weights on worker 0-0, policy_version 124080 (0.00086) [2022-07-09 06:17:14,315][26022] Updated weights on worker 0-0, policy_version 124090 (0.00095) [2022-07-09 06:17:16,201][26022] Updated weights on worker 0-0, policy_version 124100 (0.00080) [2022-07-09 06:17:16,691][25689] Fps is (10 sec: 5760.7, 60 sec: 5774.1, 300 sec: 5774.5). Total num frames: 127081472. Throughput: 0: 5935.2. Samples: 127081970. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 06:17:16,691][25689] Avg episode reward: [(0, '-56.042')] [2022-07-09 06:17:17,873][26022] Updated weights on worker 0-0, policy_version 124110 (0.00090) [2022-07-09 06:17:19,535][26022] Updated weights on worker 0-0, policy_version 124120 (0.00092) [2022-07-09 06:17:21,519][26022] Updated weights on worker 0-0, policy_version 124130 (0.00087) [2022-07-09 06:17:21,710][25689] Fps is (10 sec: 5820.8, 60 sec: 5777.2, 300 sec: 5774.4). Total num frames: 127110144. Throughput: 0: 5969.4. Samples: 127117188. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 06:17:21,711][25689] Avg episode reward: [(0, '-55.142')] [2022-07-09 06:17:22,975][26022] Updated weights on worker 0-0, policy_version 124140 (0.00400) [2022-07-09 06:17:24,922][26022] Updated weights on worker 0-0, policy_version 124150 (0.00085) [2022-07-09 06:17:26,512][26022] Updated weights on worker 0-0, policy_version 124160 (0.00090) [2022-07-09 06:17:26,718][25689] Fps is (10 sec: 5922.8, 60 sec: 5793.6, 300 sec: 5776.3). Total num frames: 127140864. Throughput: 0: 5232.1. Samples: 127134950. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 06:17:26,719][25689] Avg episode reward: [(0, '-53.688')] [2022-07-09 06:17:28,463][26022] Updated weights on worker 0-0, policy_version 124170 (0.00092) [2022-07-09 06:17:30,166][26022] Updated weights on worker 0-0, policy_version 124180 (0.00093) [2022-07-09 06:17:31,826][25689] Fps is (10 sec: 5769.7, 60 sec: 5777.4, 300 sec: 5770.9). Total num frames: 127168512. Throughput: 0: 6107.0. Samples: 127169872. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 06:17:31,827][25689] Avg episode reward: [(0, '-53.669')] [2022-07-09 06:17:32,026][26022] Updated weights on worker 0-0, policy_version 124190 (0.00089) [2022-07-09 06:17:33,693][26022] Updated weights on worker 0-0, policy_version 124200 (0.00085) [2022-07-09 06:17:35,575][26022] Updated weights on worker 0-0, policy_version 124210 (0.00084) [2022-07-09 06:17:36,838][25689] Fps is (10 sec: 5666.9, 60 sec: 5793.4, 300 sec: 5781.2). Total num frames: 127198208. Throughput: 0: 6095.3. Samples: 127204772. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 06:17:36,839][25689] Avg episode reward: [(0, '-54.315')] [2022-07-09 06:17:37,112][26022] Updated weights on worker 0-0, policy_version 124220 (0.00092) [2022-07-09 06:17:39,237][26022] Updated weights on worker 0-0, policy_version 124230 (0.00071) [2022-07-09 06:17:40,506][26022] Updated weights on worker 0-0, policy_version 124240 (0.00086) [2022-07-09 06:17:41,860][25689] Fps is (10 sec: 5919.0, 60 sec: 5775.1, 300 sec: 5777.9). Total num frames: 127227904. Throughput: 0: 5217.8. Samples: 127222328. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 06:17:41,861][25689] Avg episode reward: [(0, '-54.407')] [2022-07-09 06:17:42,674][26022] Updated weights on worker 0-0, policy_version 124250 (0.00088) [2022-07-09 06:17:44,083][26022] Updated weights on worker 0-0, policy_version 124260 (0.00092) [2022-07-09 06:17:46,025][26022] Updated weights on worker 0-0, policy_version 124270 (0.00093) [2022-07-09 06:17:46,877][25689] Fps is (10 sec: 5916.0, 60 sec: 5793.2, 300 sec: 5786.5). Total num frames: 127257600. Throughput: 0: 6071.1. Samples: 127257334. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 06:17:46,877][25689] Avg episode reward: [(0, '-54.352')] [2022-07-09 06:17:47,685][26022] Updated weights on worker 0-0, policy_version 124280 (0.00093) [2022-07-09 06:17:49,560][26022] Updated weights on worker 0-0, policy_version 124290 (0.00081) [2022-07-09 06:17:51,255][26022] Updated weights on worker 0-0, policy_version 124300 (0.00090) [2022-07-09 06:17:51,922][25689] Fps is (10 sec: 5902.8, 60 sec: 5800.5, 300 sec: 5779.9). Total num frames: 127287296. Throughput: 0: 6093.5. Samples: 127292326. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 06:17:51,924][25689] Avg episode reward: [(0, '-55.955')] [2022-07-09 06:17:52,979][26022] Updated weights on worker 0-0, policy_version 124310 (0.00079) [2022-07-09 06:17:54,702][26022] Updated weights on worker 0-0, policy_version 124320 (0.00087) [2022-07-09 06:17:56,651][26022] Updated weights on worker 0-0, policy_version 124330 (0.00086) [2022-07-09 06:17:56,931][25689] Fps is (10 sec: 5805.0, 60 sec: 5784.8, 300 sec: 5781.0). Total num frames: 127315968. Throughput: 0: 5229.5. Samples: 127309854. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 06:17:56,933][25689] Avg episode reward: [(0, '-56.058')] [2022-07-09 06:17:58,102][26022] Updated weights on worker 0-0, policy_version 124340 (0.00085) [2022-07-09 06:17:59,940][26022] Updated weights on worker 0-0, policy_version 124350 (0.00086) [2022-07-09 06:18:01,991][25689] Fps is (10 sec: 5593.7, 60 sec: 5770.5, 300 sec: 5781.0). Total num frames: 127343616. Throughput: 0: 6089.5. Samples: 127344912. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 06:18:01,991][25689] Avg episode reward: [(0, '-55.444')] [2022-07-09 06:18:02,214][26022] Updated weights on worker 0-0, policy_version 124360 (0.00089) [2022-07-09 06:18:03,881][26022] Updated weights on worker 0-0, policy_version 124370 (0.00083) [2022-07-09 06:18:05,712][26022] Updated weights on worker 0-0, policy_version 124380 (0.00086) [2022-07-09 06:18:07,003][25689] Fps is (10 sec: 5592.0, 60 sec: 5776.9, 300 sec: 5785.4). Total num frames: 127372288. Throughput: 0: 5965.5. Samples: 127377398. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 06:18:07,004][25689] Avg episode reward: [(0, '-55.799')] [2022-07-09 06:18:07,487][26022] Updated weights on worker 0-0, policy_version 124390 (0.00089) [2022-07-09 06:18:09,193][26022] Updated weights on worker 0-0, policy_version 124400 (0.00085) [2022-07-09 06:18:11,022][26022] Updated weights on worker 0-0, policy_version 124410 (0.00087) [2022-07-09 06:18:12,061][25689] Fps is (10 sec: 5694.6, 60 sec: 5783.3, 300 sec: 5774.6). Total num frames: 127400960. Throughput: 0: 5090.0. Samples: 127394830. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 06:18:12,061][25689] Avg episode reward: [(0, '-56.330')] [2022-07-09 06:18:12,794][26022] Updated weights on worker 0-0, policy_version 124420 (0.00086) [2022-07-09 06:18:14,523][26022] Updated weights on worker 0-0, policy_version 124430 (0.00084) [2022-07-09 06:18:16,358][26022] Updated weights on worker 0-0, policy_version 124440 (0.00091) [2022-07-09 06:18:17,071][25689] Fps is (10 sec: 5695.6, 60 sec: 5766.1, 300 sec: 5781.4). Total num frames: 127429632. Throughput: 0: 5946.3. Samples: 127429610. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 06:18:17,076][25689] Avg episode reward: [(0, '-55.256')] [2022-07-09 06:18:17,973][26022] Updated weights on worker 0-0, policy_version 124450 (0.00082) [2022-07-09 06:18:19,936][26022] Updated weights on worker 0-0, policy_version 124460 (0.00092) [2022-07-09 06:18:21,575][26022] Updated weights on worker 0-0, policy_version 124470 (0.00087) [2022-07-09 06:18:22,094][25689] Fps is (10 sec: 5919.2, 60 sec: 5799.7, 300 sec: 5788.7). Total num frames: 127460352. Throughput: 0: 5961.7. Samples: 127464764. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 06:18:22,095][25689] Avg episode reward: [(0, '-55.723')] [2022-07-09 06:18:23,359][26022] Updated weights on worker 0-0, policy_version 124480 (0.00089) [2022-07-09 06:18:24,862][26022] Updated weights on worker 0-0, policy_version 124490 (0.00088) [2022-07-09 06:18:26,875][26022] Updated weights on worker 0-0, policy_version 124500 (0.00085) [2022-07-09 06:18:27,180][25689] Fps is (10 sec: 5875.1, 60 sec: 5758.3, 300 sec: 5785.9). Total num frames: 127489024. Throughput: 0: 5210.4. Samples: 127482530. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 06:18:27,181][25689] Avg episode reward: [(0, '-56.045')] [2022-07-09 06:18:28,645][26022] Updated weights on worker 0-0, policy_version 124510 (0.00095) [2022-07-09 06:18:30,407][26022] Updated weights on worker 0-0, policy_version 124520 (0.00086) [2022-07-09 06:18:32,030][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:18:32,046][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000124529_127517696.pth [2022-07-09 06:18:32,046][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000122495_125434880.pth [2022-07-09 06:18:32,152][26022] Updated weights on worker 0-0, policy_version 124530 (0.00083) [2022-07-09 06:18:32,257][25689] Fps is (10 sec: 5743.2, 60 sec: 5795.2, 300 sec: 5787.9). Total num frames: 127518720. Throughput: 0: 6038.0. Samples: 127516778. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 06:18:32,258][25689] Avg episode reward: [(0, '-56.568')] [2022-07-09 06:18:34,043][26022] Updated weights on worker 0-0, policy_version 124540 (0.00089) [2022-07-09 06:18:35,703][26022] Updated weights on worker 0-0, policy_version 124550 (0.00084) [2022-07-09 06:18:37,259][25689] Fps is (10 sec: 5791.2, 60 sec: 5779.1, 300 sec: 5781.7). Total num frames: 127547392. Throughput: 0: 6033.5. Samples: 127551414. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 06:18:37,260][25689] Avg episode reward: [(0, '-56.824')] [2022-07-09 06:18:37,563][26022] Updated weights on worker 0-0, policy_version 124560 (0.00086) [2022-07-09 06:18:39,284][26022] Updated weights on worker 0-0, policy_version 124570 (0.00090) [2022-07-09 06:18:41,112][26022] Updated weights on worker 0-0, policy_version 124580 (0.00090) [2022-07-09 06:18:42,313][25689] Fps is (10 sec: 5804.3, 60 sec: 5776.2, 300 sec: 5788.2). Total num frames: 127577088. Throughput: 0: 5147.5. Samples: 127568840. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 06:18:42,314][25689] Avg episode reward: [(0, '-56.331')] [2022-07-09 06:18:42,850][26022] Updated weights on worker 0-0, policy_version 124590 (0.00085) [2022-07-09 06:18:44,613][26022] Updated weights on worker 0-0, policy_version 124600 (0.00087) [2022-07-09 06:18:46,428][26022] Updated weights on worker 0-0, policy_version 124610 (0.00084) [2022-07-09 06:18:47,349][25689] Fps is (10 sec: 5683.2, 60 sec: 5740.4, 300 sec: 5778.0). Total num frames: 127604736. Throughput: 0: 5998.3. Samples: 127603510. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 06:18:47,350][25689] Avg episode reward: [(0, '-56.185')] [2022-07-09 06:18:48,166][26022] Updated weights on worker 0-0, policy_version 124620 (0.00092) [2022-07-09 06:18:49,994][26022] Updated weights on worker 0-0, policy_version 124630 (0.00088) [2022-07-09 06:18:51,640][26022] Updated weights on worker 0-0, policy_version 124640 (0.00088) [2022-07-09 06:18:52,476][25689] Fps is (10 sec: 5642.3, 60 sec: 5732.6, 300 sec: 5779.5). Total num frames: 127634432. Throughput: 0: 5996.1. Samples: 127638014. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 06:18:52,477][25689] Avg episode reward: [(0, '-56.782')] [2022-07-09 06:18:53,647][26022] Updated weights on worker 0-0, policy_version 124650 (0.00085) [2022-07-09 06:18:55,238][26022] Updated weights on worker 0-0, policy_version 124660 (0.00094) [2022-07-09 06:18:57,081][26022] Updated weights on worker 0-0, policy_version 124670 (0.00091) [2022-07-09 06:18:57,485][25689] Fps is (10 sec: 5859.4, 60 sec: 5749.6, 300 sec: 5779.4). Total num frames: 127664128. Throughput: 0: 6026.6. Samples: 127673310. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 06:18:57,486][25689] Avg episode reward: [(0, '-56.929')] [2022-07-09 06:18:58,741][26022] Updated weights on worker 0-0, policy_version 124680 (0.00091) [2022-07-09 06:19:00,533][26022] Updated weights on worker 0-0, policy_version 124690 (0.00086) [2022-07-09 06:19:02,552][25689] Fps is (10 sec: 5691.5, 60 sec: 5748.9, 300 sec: 5785.4). Total num frames: 127691776. Throughput: 0: 6025.1. Samples: 127690780. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 06:19:02,552][25689] Avg episode reward: [(0, '-56.105')] [2022-07-09 06:19:02,725][26022] Updated weights on worker 0-0, policy_version 124700 (0.00087) [2022-07-09 06:19:04,334][26022] Updated weights on worker 0-0, policy_version 124710 (0.00096) [2022-07-09 06:19:06,129][26022] Updated weights on worker 0-0, policy_version 124720 (0.00092) [2022-07-09 06:19:07,574][25689] Fps is (10 sec: 5684.0, 60 sec: 5764.9, 300 sec: 5789.6). Total num frames: 127721472. Throughput: 0: 5924.8. Samples: 127723340. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 06:19:07,575][25689] Avg episode reward: [(0, '-56.331')] [2022-07-09 06:19:08,054][26022] Updated weights on worker 0-0, policy_version 124730 (0.00088) [2022-07-09 06:19:09,795][26022] Updated weights on worker 0-0, policy_version 124740 (0.00082) [2022-07-09 06:19:11,647][26022] Updated weights on worker 0-0, policy_version 124750 (0.00089) [2022-07-09 06:19:12,615][25689] Fps is (10 sec: 5800.4, 60 sec: 5766.5, 300 sec: 5785.5). Total num frames: 127750144. Throughput: 0: 5964.7. Samples: 127758132. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 06:19:12,616][25689] Avg episode reward: [(0, '-55.356')] [2022-07-09 06:19:13,288][26022] Updated weights on worker 0-0, policy_version 124760 (0.00099) [2022-07-09 06:19:15,114][26022] Updated weights on worker 0-0, policy_version 124770 (0.00086) [2022-07-09 06:19:16,897][26022] Updated weights on worker 0-0, policy_version 124780 (0.00090) [2022-07-09 06:19:17,628][25689] Fps is (10 sec: 5602.1, 60 sec: 5749.4, 300 sec: 5782.2). Total num frames: 127777792. Throughput: 0: 5068.7. Samples: 127775404. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 06:19:17,628][25689] Avg episode reward: [(0, '-55.711')] [2022-07-09 06:19:18,655][26022] Updated weights on worker 0-0, policy_version 124790 (0.00087) [2022-07-09 06:19:20,460][26022] Updated weights on worker 0-0, policy_version 124800 (0.00085) [2022-07-09 06:19:22,079][26022] Updated weights on worker 0-0, policy_version 124810 (0.00086) [2022-07-09 06:19:22,631][25689] Fps is (10 sec: 5725.1, 60 sec: 5734.3, 300 sec: 5776.4). Total num frames: 127807488. Throughput: 0: 5954.0. Samples: 127810330. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 06:19:22,631][25689] Avg episode reward: [(0, '-55.937')] [2022-07-09 06:19:23,858][26022] Updated weights on worker 0-0, policy_version 124820 (0.00084) [2022-07-09 06:19:25,827][26022] Updated weights on worker 0-0, policy_version 124830 (0.00092) [2022-07-09 06:19:27,415][26022] Updated weights on worker 0-0, policy_version 124840 (0.00083) [2022-07-09 06:19:27,634][25689] Fps is (10 sec: 6038.0, 60 sec: 5776.1, 300 sec: 5787.9). Total num frames: 127838208. Throughput: 0: 6071.0. Samples: 127845120. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 06:19:27,635][25689] Avg episode reward: [(0, '-55.747')] [2022-07-09 06:19:29,261][26022] Updated weights on worker 0-0, policy_version 124850 (0.00080) [2022-07-09 06:19:30,815][26022] Updated weights on worker 0-0, policy_version 124860 (0.00053) [2022-07-09 06:19:32,702][25689] Fps is (10 sec: 5694.2, 60 sec: 5726.1, 300 sec: 5773.4). Total num frames: 127864832. Throughput: 0: 5205.3. Samples: 127862690. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 06:19:32,702][25689] Avg episode reward: [(0, '-55.413')] [2022-07-09 06:19:32,795][26022] Updated weights on worker 0-0, policy_version 124870 (0.00081) [2022-07-09 06:19:34,365][26022] Updated weights on worker 0-0, policy_version 124880 (0.00086) [2022-07-09 06:19:36,278][26022] Updated weights on worker 0-0, policy_version 124890 (0.00082) [2022-07-09 06:19:37,694][26022] Updated weights on worker 0-0, policy_version 124900 (0.00090) [2022-07-09 06:19:37,720][25689] Fps is (10 sec: 5888.6, 60 sec: 5792.3, 300 sec: 5795.3). Total num frames: 127897600. Throughput: 0: 6089.6. Samples: 127897756. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 06:19:37,720][25689] Avg episode reward: [(0, '-56.458')] [2022-07-09 06:19:39,790][26022] Updated weights on worker 0-0, policy_version 124910 (0.00100) [2022-07-09 06:19:41,304][26022] Updated weights on worker 0-0, policy_version 124920 (0.00089) [2022-07-09 06:19:42,738][25689] Fps is (10 sec: 5815.9, 60 sec: 5728.0, 300 sec: 5774.3). Total num frames: 127923200. Throughput: 0: 6079.3. Samples: 127932564. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-09 06:19:42,738][25689] Avg episode reward: [(0, '-57.191')] [2022-07-09 06:19:43,303][26022] Updated weights on worker 0-0, policy_version 124930 (0.00091) [2022-07-09 06:19:45,053][26022] Updated weights on worker 0-0, policy_version 124940 (0.00101) [2022-07-09 06:19:46,833][26022] Updated weights on worker 0-0, policy_version 124950 (0.00082) [2022-07-09 06:19:47,749][25689] Fps is (10 sec: 5615.2, 60 sec: 5781.2, 300 sec: 5776.2). Total num frames: 127953920. Throughput: 0: 5195.1. Samples: 127949624. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:19:47,750][25689] Avg episode reward: [(0, '-56.432')] [2022-07-09 06:19:48,628][26022] Updated weights on worker 0-0, policy_version 124960 (0.00094) [2022-07-09 06:19:50,402][26022] Updated weights on worker 0-0, policy_version 124970 (0.00085) [2022-07-09 06:19:52,097][26022] Updated weights on worker 0-0, policy_version 124980 (0.00086) [2022-07-09 06:19:52,838][25689] Fps is (10 sec: 5981.4, 60 sec: 5784.9, 300 sec: 5779.5). Total num frames: 127983616. Throughput: 0: 6045.3. Samples: 127984422. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:19:52,839][25689] Avg episode reward: [(0, '-56.186')] [2022-07-09 06:19:53,904][26022] Updated weights on worker 0-0, policy_version 124990 (0.00078) [2022-07-09 06:19:55,463][26022] Updated weights on worker 0-0, policy_version 125000 (0.00085) [2022-07-09 06:19:57,450][26022] Updated weights on worker 0-0, policy_version 125010 (0.00088) [2022-07-09 06:19:57,905][25689] Fps is (10 sec: 5848.4, 60 sec: 5779.3, 300 sec: 5778.5). Total num frames: 128013312. Throughput: 0: 6026.8. Samples: 128019408. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:19:57,905][25689] Avg episode reward: [(0, '-56.163')] [2022-07-09 06:19:59,141][26022] Updated weights on worker 0-0, policy_version 125020 (0.00085) [2022-07-09 06:20:00,910][26022] Updated weights on worker 0-0, policy_version 125030 (0.00091) [2022-07-09 06:20:02,917][25689] Fps is (10 sec: 5486.2, 60 sec: 5750.6, 300 sec: 5775.4). Total num frames: 128038912. Throughput: 0: 5171.2. Samples: 128036918. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:20:02,918][25689] Avg episode reward: [(0, '-56.988')] [2022-07-09 06:20:03,176][26022] Updated weights on worker 0-0, policy_version 125040 (0.00088) [2022-07-09 06:20:04,805][26022] Updated weights on worker 0-0, policy_version 125050 (0.00089) [2022-07-09 06:20:06,810][26022] Updated weights on worker 0-0, policy_version 125060 (0.00088) [2022-07-09 06:20:07,938][25689] Fps is (10 sec: 5511.3, 60 sec: 5750.7, 300 sec: 5780.9). Total num frames: 128068608. Throughput: 0: 5944.3. Samples: 128069632. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:20:07,939][25689] Avg episode reward: [(0, '-57.024')] [2022-07-09 06:20:08,304][26022] Updated weights on worker 0-0, policy_version 125070 (0.00090) [2022-07-09 06:20:10,110][26022] Updated weights on worker 0-0, policy_version 125080 (0.00064) [2022-07-09 06:20:11,936][26022] Updated weights on worker 0-0, policy_version 125090 (0.00087) [2022-07-09 06:20:13,076][25689] Fps is (10 sec: 5846.3, 60 sec: 5758.4, 300 sec: 5771.5). Total num frames: 128098304. Throughput: 0: 5939.6. Samples: 128104628. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:20:13,077][25689] Avg episode reward: [(0, '-56.350')] [2022-07-09 06:20:13,640][26022] Updated weights on worker 0-0, policy_version 125100 (0.00082) [2022-07-09 06:20:15,355][26022] Updated weights on worker 0-0, policy_version 125110 (0.00088) [2022-07-09 06:20:17,207][26022] Updated weights on worker 0-0, policy_version 125120 (0.00085) [2022-07-09 06:20:18,091][25689] Fps is (10 sec: 5647.9, 60 sec: 5758.2, 300 sec: 5775.0). Total num frames: 128125952. Throughput: 0: 5076.8. Samples: 128121890. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:20:18,092][25689] Avg episode reward: [(0, '-56.332')] [2022-07-09 06:20:18,839][26022] Updated weights on worker 0-0, policy_version 125130 (0.00082) [2022-07-09 06:20:20,896][26022] Updated weights on worker 0-0, policy_version 125140 (0.00109) [2022-07-09 06:20:22,502][26022] Updated weights on worker 0-0, policy_version 125150 (0.00090) [2022-07-09 06:20:23,123][25689] Fps is (10 sec: 5809.7, 60 sec: 5772.4, 300 sec: 5778.2). Total num frames: 128156672. Throughput: 0: 5921.0. Samples: 128156556. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:20:23,124][25689] Avg episode reward: [(0, '-56.699')] [2022-07-09 06:20:24,304][26022] Updated weights on worker 0-0, policy_version 125160 (0.00091) [2022-07-09 06:20:25,999][26022] Updated weights on worker 0-0, policy_version 125170 (0.00097) [2022-07-09 06:20:27,795][26022] Updated weights on worker 0-0, policy_version 125180 (0.00084) [2022-07-09 06:20:28,127][25689] Fps is (10 sec: 5918.2, 60 sec: 5738.4, 300 sec: 5773.1). Total num frames: 128185344. Throughput: 0: 6021.8. Samples: 128191204. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:20:28,127][25689] Avg episode reward: [(0, '-56.596')] [2022-07-09 06:20:29,616][26022] Updated weights on worker 0-0, policy_version 125190 (0.00080) [2022-07-09 06:20:31,450][26022] Updated weights on worker 0-0, policy_version 125200 (0.00098) [2022-07-09 06:20:32,068][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:20:32,080][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000125203_128207872.pth [2022-07-09 06:20:32,080][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000123171_126127104.pth [2022-07-09 06:20:33,173][25689] Fps is (10 sec: 5807.5, 60 sec: 5791.3, 300 sec: 5773.6). Total num frames: 128215040. Throughput: 0: 5164.7. Samples: 128208424. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:20:33,174][25689] Avg episode reward: [(0, '-57.005')] [2022-07-09 06:20:33,179][26022] Updated weights on worker 0-0, policy_version 125210 (0.00529) [2022-07-09 06:20:34,973][26022] Updated weights on worker 0-0, policy_version 125220 (0.00189) [2022-07-09 06:20:36,715][26022] Updated weights on worker 0-0, policy_version 125230 (0.00081) [2022-07-09 06:20:38,212][25689] Fps is (10 sec: 5787.2, 60 sec: 5721.5, 300 sec: 5771.0). Total num frames: 128243712. Throughput: 0: 6023.1. Samples: 128243082. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:20:38,214][25689] Avg episode reward: [(0, '-58.229')] [2022-07-09 06:20:38,372][26022] Updated weights on worker 0-0, policy_version 125240 (0.00101) [2022-07-09 06:20:40,296][26022] Updated weights on worker 0-0, policy_version 125250 (0.00087) [2022-07-09 06:20:42,027][26022] Updated weights on worker 0-0, policy_version 125260 (0.00087) [2022-07-09 06:20:43,270][25689] Fps is (10 sec: 5679.7, 60 sec: 5768.6, 300 sec: 5767.3). Total num frames: 128272384. Throughput: 0: 6029.1. Samples: 128278022. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:20:43,270][25689] Avg episode reward: [(0, '-57.671')] [2022-07-09 06:20:43,791][26022] Updated weights on worker 0-0, policy_version 125270 (0.00085) [2022-07-09 06:20:45,847][26022] Updated weights on worker 0-0, policy_version 125280 (0.00083) [2022-07-09 06:20:47,329][26022] Updated weights on worker 0-0, policy_version 125290 (0.00511) [2022-07-09 06:20:48,299][25689] Fps is (10 sec: 5583.9, 60 sec: 5716.2, 300 sec: 5764.6). Total num frames: 128300032. Throughput: 0: 5163.4. Samples: 128295360. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:20:48,299][25689] Avg episode reward: [(0, '-57.818')] [2022-07-09 06:20:49,292][26022] Updated weights on worker 0-0, policy_version 125300 (0.00089) [2022-07-09 06:20:50,887][26022] Updated weights on worker 0-0, policy_version 125310 (0.00082) [2022-07-09 06:20:52,648][26022] Updated weights on worker 0-0, policy_version 125320 (0.00085) [2022-07-09 06:20:53,390][25689] Fps is (10 sec: 5868.7, 60 sec: 5749.8, 300 sec: 5770.0). Total num frames: 128331776. Throughput: 0: 6017.0. Samples: 128330066. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:20:53,390][25689] Avg episode reward: [(0, '-57.957')] [2022-07-09 06:20:54,404][26022] Updated weights on worker 0-0, policy_version 125330 (0.00089) [2022-07-09 06:20:56,259][26022] Updated weights on worker 0-0, policy_version 125340 (0.00087) [2022-07-09 06:20:57,892][26022] Updated weights on worker 0-0, policy_version 125350 (0.00084) [2022-07-09 06:20:58,479][25689] Fps is (10 sec: 5934.8, 60 sec: 5730.8, 300 sec: 5765.3). Total num frames: 128360448. Throughput: 0: 6019.1. Samples: 128365066. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:20:58,479][25689] Avg episode reward: [(0, '-57.430')] [2022-07-09 06:20:59,854][26022] Updated weights on worker 0-0, policy_version 125360 (0.00086) [2022-07-09 06:21:01,595][26022] Updated weights on worker 0-0, policy_version 125370 (0.00090) [2022-07-09 06:21:03,485][25689] Fps is (10 sec: 5477.2, 60 sec: 5748.3, 300 sec: 5765.3). Total num frames: 128387072. Throughput: 0: 5162.4. Samples: 128382384. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:21:03,486][25689] Avg episode reward: [(0, '-56.175')] [2022-07-09 06:21:03,740][26022] Updated weights on worker 0-0, policy_version 125380 (0.00086) [2022-07-09 06:21:05,550][26022] Updated weights on worker 0-0, policy_version 125390 (0.00088) [2022-07-09 06:21:07,127][26022] Updated weights on worker 0-0, policy_version 125400 (0.00083) [2022-07-09 06:21:08,515][25689] Fps is (10 sec: 5407.6, 60 sec: 5713.6, 300 sec: 5762.3). Total num frames: 128414720. Throughput: 0: 5918.8. Samples: 128415014. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:21:08,515][25689] Avg episode reward: [(0, '-55.706')] [2022-07-09 06:21:08,906][26022] Updated weights on worker 0-0, policy_version 125410 (0.00083) [2022-07-09 06:21:10,695][26022] Updated weights on worker 0-0, policy_version 125420 (0.00083) [2022-07-09 06:21:12,648][26022] Updated weights on worker 0-0, policy_version 125430 (0.00085) [2022-07-09 06:21:13,614][25689] Fps is (10 sec: 5964.8, 60 sec: 5768.0, 300 sec: 5767.4). Total num frames: 128447488. Throughput: 0: 5929.0. Samples: 128449976. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:21:13,615][25689] Avg episode reward: [(0, '-55.498')] [2022-07-09 06:21:14,233][26022] Updated weights on worker 0-0, policy_version 125440 (0.00080) [2022-07-09 06:21:16,000][26022] Updated weights on worker 0-0, policy_version 125450 (0.00082) [2022-07-09 06:21:17,741][26022] Updated weights on worker 0-0, policy_version 125460 (0.00087) [2022-07-09 06:21:18,681][25689] Fps is (10 sec: 5942.6, 60 sec: 5763.1, 300 sec: 5763.6). Total num frames: 128475136. Throughput: 0: 5936.4. Samples: 128484998. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:21:18,682][25689] Avg episode reward: [(0, '-55.440')] [2022-07-09 06:21:19,555][26022] Updated weights on worker 0-0, policy_version 125470 (0.00082) [2022-07-09 06:21:21,246][26022] Updated weights on worker 0-0, policy_version 125480 (0.00085) [2022-07-09 06:21:23,077][26022] Updated weights on worker 0-0, policy_version 125490 (0.00086) [2022-07-09 06:21:23,721][25689] Fps is (10 sec: 5775.3, 60 sec: 5762.3, 300 sec: 5766.4). Total num frames: 128505856. Throughput: 0: 5934.6. Samples: 128502472. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:21:23,721][25689] Avg episode reward: [(0, '-55.534')] [2022-07-09 06:21:24,658][26022] Updated weights on worker 0-0, policy_version 125500 (0.00079) [2022-07-09 06:21:26,642][26022] Updated weights on worker 0-0, policy_version 125510 (0.00092) [2022-07-09 06:21:28,363][26022] Updated weights on worker 0-0, policy_version 125520 (0.00092) [2022-07-09 06:21:28,780][25689] Fps is (10 sec: 5881.2, 60 sec: 5757.1, 300 sec: 5767.4). Total num frames: 128534528. Throughput: 0: 6049.5. Samples: 128537610. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:21:28,781][25689] Avg episode reward: [(0, '-56.215')] [2022-07-09 06:21:30,242][26022] Updated weights on worker 0-0, policy_version 125530 (0.00078) [2022-07-09 06:21:31,960][26022] Updated weights on worker 0-0, policy_version 125540 (0.00080) [2022-07-09 06:21:33,526][26022] Updated weights on worker 0-0, policy_version 125550 (0.00090) [2022-07-09 06:21:33,853][25689] Fps is (10 sec: 5861.8, 60 sec: 5771.5, 300 sec: 5773.0). Total num frames: 128565248. Throughput: 0: 6058.3. Samples: 128572588. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:21:33,853][25689] Avg episode reward: [(0, '-56.672')] [2022-07-09 06:21:35,435][26022] Updated weights on worker 0-0, policy_version 125560 (0.00088) [2022-07-09 06:21:37,062][26022] Updated weights on worker 0-0, policy_version 125570 (0.00102) [2022-07-09 06:21:38,747][26022] Updated weights on worker 0-0, policy_version 125580 (0.00085) [2022-07-09 06:21:38,910][25689] Fps is (10 sec: 5862.7, 60 sec: 5769.7, 300 sec: 5765.1). Total num frames: 128593920. Throughput: 0: 5202.1. Samples: 128590232. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:21:38,911][25689] Avg episode reward: [(0, '-57.289')] [2022-07-09 06:21:40,757][26022] Updated weights on worker 0-0, policy_version 125590 (0.00082) [2022-07-09 06:21:42,234][26022] Updated weights on worker 0-0, policy_version 125600 (0.00081) [2022-07-09 06:21:43,915][25689] Fps is (10 sec: 5597.3, 60 sec: 5757.8, 300 sec: 5762.2). Total num frames: 128621568. Throughput: 0: 6090.6. Samples: 128625466. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:21:43,915][25689] Avg episode reward: [(0, '-57.557')] [2022-07-09 06:21:44,290][26022] Updated weights on worker 0-0, policy_version 125610 (0.00090) [2022-07-09 06:21:45,874][26022] Updated weights on worker 0-0, policy_version 125620 (0.00088) [2022-07-09 06:21:47,617][26022] Updated weights on worker 0-0, policy_version 125630 (0.00081) [2022-07-09 06:21:48,980][25689] Fps is (10 sec: 5796.7, 60 sec: 5805.0, 300 sec: 5766.7). Total num frames: 128652288. Throughput: 0: 6061.0. Samples: 128660040. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:21:48,980][25689] Avg episode reward: [(0, '-57.699')] [2022-07-09 06:21:49,657][26022] Updated weights on worker 0-0, policy_version 125640 (0.00081) [2022-07-09 06:21:51,163][26022] Updated weights on worker 0-0, policy_version 125650 (0.00079) [2022-07-09 06:21:52,912][26022] Updated weights on worker 0-0, policy_version 125660 (0.00078) [2022-07-09 06:21:54,027][25689] Fps is (10 sec: 5974.7, 60 sec: 5775.5, 300 sec: 5766.2). Total num frames: 128681984. Throughput: 0: 5197.0. Samples: 128677432. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:21:54,027][25689] Avg episode reward: [(0, '-56.968')] [2022-07-09 06:21:54,635][26022] Updated weights on worker 0-0, policy_version 125670 (0.00086) [2022-07-09 06:21:56,515][26022] Updated weights on worker 0-0, policy_version 125680 (0.00095) [2022-07-09 06:21:58,086][26022] Updated weights on worker 0-0, policy_version 125690 (0.00109) [2022-07-09 06:21:59,045][25689] Fps is (10 sec: 5697.1, 60 sec: 5765.3, 300 sec: 5764.1). Total num frames: 128709632. Throughput: 0: 6080.4. Samples: 128712660. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:21:59,046][25689] Avg episode reward: [(0, '-57.205')] [2022-07-09 06:21:59,987][26022] Updated weights on worker 0-0, policy_version 125700 (0.00080) [2022-07-09 06:22:01,538][26022] Updated weights on worker 0-0, policy_version 125710 (0.00081) [2022-07-09 06:22:03,944][26022] Updated weights on worker 0-0, policy_version 125720 (0.00083) [2022-07-09 06:22:04,063][25689] Fps is (10 sec: 5611.7, 60 sec: 5798.1, 300 sec: 5765.3). Total num frames: 128738304. Throughput: 0: 5958.0. Samples: 128745508. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:22:04,063][25689] Avg episode reward: [(0, '-57.008')] [2022-07-09 06:22:05,457][26022] Updated weights on worker 0-0, policy_version 125730 (0.00087) [2022-07-09 06:22:07,299][26022] Updated weights on worker 0-0, policy_version 125740 (0.00081) [2022-07-09 06:22:09,080][26022] Updated weights on worker 0-0, policy_version 125750 (0.00092) [2022-07-09 06:22:09,083][25689] Fps is (10 sec: 5713.0, 60 sec: 5815.9, 300 sec: 5767.3). Total num frames: 128766976. Throughput: 0: 5128.0. Samples: 128763128. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:22:09,083][25689] Avg episode reward: [(0, '-56.825')] [2022-07-09 06:22:11,054][26022] Updated weights on worker 0-0, policy_version 125760 (0.00085) [2022-07-09 06:22:12,618][26022] Updated weights on worker 0-0, policy_version 125770 (0.00086) [2022-07-09 06:22:14,123][25689] Fps is (10 sec: 5802.0, 60 sec: 5770.8, 300 sec: 5766.7). Total num frames: 128796672. Throughput: 0: 5995.9. Samples: 128797926. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:22:14,123][25689] Avg episode reward: [(0, '-56.675')] [2022-07-09 06:22:14,380][26022] Updated weights on worker 0-0, policy_version 125780 (0.00082) [2022-07-09 06:22:16,099][26022] Updated weights on worker 0-0, policy_version 125790 (0.00088) [2022-07-09 06:22:17,915][26022] Updated weights on worker 0-0, policy_version 125800 (0.00079) [2022-07-09 06:22:19,127][25689] Fps is (10 sec: 5913.1, 60 sec: 5810.7, 300 sec: 5770.4). Total num frames: 128826368. Throughput: 0: 6007.3. Samples: 128833296. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:22:19,127][25689] Avg episode reward: [(0, '-56.760')] [2022-07-09 06:22:19,712][26022] Updated weights on worker 0-0, policy_version 125810 (0.00089) [2022-07-09 06:22:21,475][26022] Updated weights on worker 0-0, policy_version 125820 (0.00090) [2022-07-09 06:22:23,191][26022] Updated weights on worker 0-0, policy_version 125830 (0.00087) [2022-07-09 06:22:24,130][25689] Fps is (10 sec: 5832.8, 60 sec: 5780.3, 300 sec: 5763.7). Total num frames: 128855040. Throughput: 0: 5252.9. Samples: 128850918. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:22:24,130][25689] Avg episode reward: [(0, '-56.990')] [2022-07-09 06:22:24,895][26022] Updated weights on worker 0-0, policy_version 125840 (0.00084) [2022-07-09 06:22:26,731][26022] Updated weights on worker 0-0, policy_version 125850 (0.00083) [2022-07-09 06:22:28,471][26022] Updated weights on worker 0-0, policy_version 125860 (0.00084) [2022-07-09 06:22:29,147][25689] Fps is (10 sec: 5825.1, 60 sec: 5801.3, 300 sec: 5772.3). Total num frames: 128884736. Throughput: 0: 6130.3. Samples: 128886128. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:22:29,147][25689] Avg episode reward: [(0, '-57.015')] [2022-07-09 06:22:30,071][26022] Updated weights on worker 0-0, policy_version 125870 (0.00080) [2022-07-09 06:22:31,965][26022] Updated weights on worker 0-0, policy_version 125880 (0.00080) [2022-07-09 06:22:32,208][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:22:32,221][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000125881_128902144.pth [2022-07-09 06:22:32,221][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000123850_126822400.pth [2022-07-09 06:22:33,747][26022] Updated weights on worker 0-0, policy_version 125890 (0.00091) [2022-07-09 06:22:34,216][25689] Fps is (10 sec: 5786.5, 60 sec: 5767.7, 300 sec: 5767.7). Total num frames: 128913408. Throughput: 0: 6104.8. Samples: 128920594. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:22:34,217][25689] Avg episode reward: [(0, '-57.452')] [2022-07-09 06:22:35,539][26022] Updated weights on worker 0-0, policy_version 125900 (0.00086) [2022-07-09 06:22:37,271][26022] Updated weights on worker 0-0, policy_version 125910 (0.00089) [2022-07-09 06:22:38,928][26022] Updated weights on worker 0-0, policy_version 125920 (0.00086) [2022-07-09 06:22:39,222][25689] Fps is (10 sec: 5793.2, 60 sec: 5789.7, 300 sec: 5768.1). Total num frames: 128943104. Throughput: 0: 5223.4. Samples: 128938262. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:22:39,223][25689] Avg episode reward: [(0, '-56.479')] [2022-07-09 06:22:40,814][26022] Updated weights on worker 0-0, policy_version 125930 (0.00087) [2022-07-09 06:22:42,459][26022] Updated weights on worker 0-0, policy_version 125940 (0.00090) [2022-07-09 06:22:44,235][25689] Fps is (10 sec: 5825.9, 60 sec: 5805.8, 300 sec: 5764.7). Total num frames: 128971776. Throughput: 0: 6086.6. Samples: 128973292. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:22:44,236][25689] Avg episode reward: [(0, '-56.130')] [2022-07-09 06:22:44,322][26022] Updated weights on worker 0-0, policy_version 125950 (0.00084) [2022-07-09 06:22:46,064][26022] Updated weights on worker 0-0, policy_version 125960 (0.00056) [2022-07-09 06:22:47,643][26022] Updated weights on worker 0-0, policy_version 125970 (0.00089) [2022-07-09 06:22:49,238][25689] Fps is (10 sec: 5725.1, 60 sec: 5777.8, 300 sec: 5762.1). Total num frames: 129000448. Throughput: 0: 6084.9. Samples: 129008382. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:22:49,239][25689] Avg episode reward: [(0, '-55.595')] [2022-07-09 06:22:49,763][26022] Updated weights on worker 0-0, policy_version 125980 (0.00083) [2022-07-09 06:22:51,114][26022] Updated weights on worker 0-0, policy_version 125990 (0.00084) [2022-07-09 06:22:53,275][26022] Updated weights on worker 0-0, policy_version 126000 (0.00089) [2022-07-09 06:22:54,359][25689] Fps is (10 sec: 5866.7, 60 sec: 5787.7, 300 sec: 5766.8). Total num frames: 129031168. Throughput: 0: 5222.1. Samples: 129025778. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:22:54,359][25689] Avg episode reward: [(0, '-55.337')] [2022-07-09 06:22:54,583][26022] Updated weights on worker 0-0, policy_version 126010 (0.00085) [2022-07-09 06:22:56,789][26022] Updated weights on worker 0-0, policy_version 126020 (0.00084) [2022-07-09 06:22:58,271][26022] Updated weights on worker 0-0, policy_version 126030 (0.00169) [2022-07-09 06:22:59,403][25689] Fps is (10 sec: 5742.3, 60 sec: 5785.2, 300 sec: 5767.1). Total num frames: 129058816. Throughput: 0: 6068.9. Samples: 129060738. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:22:59,403][25689] Avg episode reward: [(0, '-55.492')] [2022-07-09 06:23:00,098][26022] Updated weights on worker 0-0, policy_version 126040 (0.00083) [2022-07-09 06:23:02,335][26022] Updated weights on worker 0-0, policy_version 126050 (0.00083) [2022-07-09 06:23:04,033][26022] Updated weights on worker 0-0, policy_version 126060 (0.00080) [2022-07-09 06:23:04,434][25689] Fps is (10 sec: 5488.1, 60 sec: 5767.0, 300 sec: 5763.3). Total num frames: 129086464. Throughput: 0: 5970.0. Samples: 129093880. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:23:04,437][25689] Avg episode reward: [(0, '-55.504')] [2022-07-09 06:23:05,630][26022] Updated weights on worker 0-0, policy_version 126070 (0.00069) [2022-07-09 06:23:07,546][26022] Updated weights on worker 0-0, policy_version 126080 (0.00085) [2022-07-09 06:23:09,070][26022] Updated weights on worker 0-0, policy_version 126090 (0.00091) [2022-07-09 06:23:09,457][25689] Fps is (10 sec: 5805.2, 60 sec: 5800.6, 300 sec: 5770.8). Total num frames: 129117184. Throughput: 0: 5100.2. Samples: 129111502. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:23:09,458][25689] Avg episode reward: [(0, '-56.279')] [2022-07-09 06:23:11,159][26022] Updated weights on worker 0-0, policy_version 126100 (0.00090) [2022-07-09 06:23:12,651][26022] Updated weights on worker 0-0, policy_version 126110 (0.00084) [2022-07-09 06:23:14,497][25689] Fps is (10 sec: 6004.1, 60 sec: 5800.6, 300 sec: 5773.7). Total num frames: 129146880. Throughput: 0: 6008.8. Samples: 129146784. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:23:14,502][26022] Updated weights on worker 0-0, policy_version 126120 (0.00095) [2022-07-09 06:23:14,502][25689] Avg episode reward: [(0, '-56.435')] [2022-07-09 06:23:16,265][26022] Updated weights on worker 0-0, policy_version 126130 (0.00084) [2022-07-09 06:23:18,021][26022] Updated weights on worker 0-0, policy_version 126140 (0.00082) [2022-07-09 06:23:19,523][25689] Fps is (10 sec: 5697.2, 60 sec: 5764.6, 300 sec: 5763.3). Total num frames: 129174528. Throughput: 0: 6017.2. Samples: 129181802. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:23:19,523][25689] Avg episode reward: [(0, '-56.042')] [2022-07-09 06:23:19,743][26022] Updated weights on worker 0-0, policy_version 126150 (0.00068) [2022-07-09 06:23:21,547][26022] Updated weights on worker 0-0, policy_version 126160 (0.00083) [2022-07-09 06:23:23,237][26022] Updated weights on worker 0-0, policy_version 126170 (0.00860) [2022-07-09 06:23:24,595][25689] Fps is (10 sec: 5780.0, 60 sec: 5791.9, 300 sec: 5770.5). Total num frames: 129205248. Throughput: 0: 5221.5. Samples: 129199150. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:23:24,595][25689] Avg episode reward: [(0, '-56.945')] [2022-07-09 06:23:25,029][26022] Updated weights on worker 0-0, policy_version 126180 (0.00090) [2022-07-09 06:23:26,710][26022] Updated weights on worker 0-0, policy_version 126190 (0.00098) [2022-07-09 06:23:28,626][26022] Updated weights on worker 0-0, policy_version 126200 (0.00087) [2022-07-09 06:23:29,636][25689] Fps is (10 sec: 5973.9, 60 sec: 5789.6, 300 sec: 5771.2). Total num frames: 129234944. Throughput: 0: 6094.7. Samples: 129234484. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:23:29,636][25689] Avg episode reward: [(0, '-56.371')] [2022-07-09 06:23:30,265][26022] Updated weights on worker 0-0, policy_version 126210 (0.00093) [2022-07-09 06:23:32,112][26022] Updated weights on worker 0-0, policy_version 126220 (0.00084) [2022-07-09 06:23:33,714][26022] Updated weights on worker 0-0, policy_version 126230 (0.00089) [2022-07-09 06:23:34,690][25689] Fps is (10 sec: 5781.7, 60 sec: 5791.1, 300 sec: 5770.2). Total num frames: 129263616. Throughput: 0: 6059.8. Samples: 129269152. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:23:34,692][25689] Avg episode reward: [(0, '-55.934')] [2022-07-09 06:23:35,471][26022] Updated weights on worker 0-0, policy_version 126240 (0.00371) [2022-07-09 06:23:37,496][26022] Updated weights on worker 0-0, policy_version 126250 (0.00100) [2022-07-09 06:23:39,134][26022] Updated weights on worker 0-0, policy_version 126260 (0.00090) [2022-07-09 06:23:39,694][25689] Fps is (10 sec: 5803.0, 60 sec: 5791.2, 300 sec: 5771.1). Total num frames: 129293312. Throughput: 0: 5199.1. Samples: 129286676. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:23:39,696][25689] Avg episode reward: [(0, '-56.983')] [2022-07-09 06:23:40,986][26022] Updated weights on worker 0-0, policy_version 126270 (0.00067) [2022-07-09 06:23:42,707][26022] Updated weights on worker 0-0, policy_version 126280 (0.00085) [2022-07-09 06:23:44,382][26022] Updated weights on worker 0-0, policy_version 126290 (0.00080) [2022-07-09 06:23:44,702][25689] Fps is (10 sec: 5932.4, 60 sec: 5808.7, 300 sec: 5778.6). Total num frames: 129323008. Throughput: 0: 6094.6. Samples: 129321692. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:23:44,703][25689] Avg episode reward: [(0, '-56.818')] [2022-07-09 06:23:46,147][26022] Updated weights on worker 0-0, policy_version 126300 (0.00083) [2022-07-09 06:23:48,038][26022] Updated weights on worker 0-0, policy_version 126310 (0.00082) [2022-07-09 06:23:49,629][26022] Updated weights on worker 0-0, policy_version 126320 (0.00079) [2022-07-09 06:23:49,722][25689] Fps is (10 sec: 5820.8, 60 sec: 5807.1, 300 sec: 5777.2). Total num frames: 129351680. Throughput: 0: 6082.1. Samples: 129356648. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:23:49,723][25689] Avg episode reward: [(0, '-56.693')] [2022-07-09 06:23:51,546][26022] Updated weights on worker 0-0, policy_version 126330 (0.00305) [2022-07-09 06:23:53,112][26022] Updated weights on worker 0-0, policy_version 126340 (0.00083) [2022-07-09 06:23:54,780][25689] Fps is (10 sec: 5689.9, 60 sec: 5779.2, 300 sec: 5772.8). Total num frames: 129380352. Throughput: 0: 5198.9. Samples: 129373596. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 06:23:54,782][25689] Avg episode reward: [(0, '-55.297')] [2022-07-09 06:23:55,006][26022] Updated weights on worker 0-0, policy_version 126350 (0.00089) [2022-07-09 06:23:56,838][26022] Updated weights on worker 0-0, policy_version 126360 (0.00084) [2022-07-09 06:23:58,382][26022] Updated weights on worker 0-0, policy_version 126370 (0.01141) [2022-07-09 06:23:59,841][25689] Fps is (10 sec: 5565.5, 60 sec: 5777.5, 300 sec: 5772.9). Total num frames: 129408000. Throughput: 0: 6071.3. Samples: 129408994. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 06:23:59,843][25689] Avg episode reward: [(0, '-56.313')] [2022-07-09 06:24:00,365][26022] Updated weights on worker 0-0, policy_version 126380 (0.00086) [2022-07-09 06:24:02,255][26022] Updated weights on worker 0-0, policy_version 126390 (0.00087) [2022-07-09 06:24:04,094][26022] Updated weights on worker 0-0, policy_version 126400 (0.00086) [2022-07-09 06:24:04,870][25689] Fps is (10 sec: 5683.4, 60 sec: 5811.7, 300 sec: 5772.8). Total num frames: 129437696. Throughput: 0: 5949.4. Samples: 129441680. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 06:24:04,870][25689] Avg episode reward: [(0, '-56.017')] [2022-07-09 06:24:06,004][26022] Updated weights on worker 0-0, policy_version 126410 (0.00081) [2022-07-09 06:24:07,613][26022] Updated weights on worker 0-0, policy_version 126420 (0.00084) [2022-07-09 06:24:09,429][26022] Updated weights on worker 0-0, policy_version 126430 (0.00081) [2022-07-09 06:24:09,906][25689] Fps is (10 sec: 5901.2, 60 sec: 5793.5, 300 sec: 5776.3). Total num frames: 129467392. Throughput: 0: 5067.3. Samples: 129458926. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 06:24:09,907][25689] Avg episode reward: [(0, '-55.880')] [2022-07-09 06:24:11,247][26022] Updated weights on worker 0-0, policy_version 126440 (0.00088) [2022-07-09 06:24:12,948][26022] Updated weights on worker 0-0, policy_version 126450 (0.00093) [2022-07-09 06:24:14,831][26022] Updated weights on worker 0-0, policy_version 126460 (0.00089) [2022-07-09 06:24:15,022][25689] Fps is (10 sec: 5748.9, 60 sec: 5769.2, 300 sec: 5777.7). Total num frames: 129496064. Throughput: 0: 5954.1. Samples: 129494122. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 06:24:15,023][25689] Avg episode reward: [(0, '-55.785')] [2022-07-09 06:24:16,468][26022] Updated weights on worker 0-0, policy_version 126470 (0.00085) [2022-07-09 06:24:18,334][26022] Updated weights on worker 0-0, policy_version 126480 (0.00086) [2022-07-09 06:24:19,862][26022] Updated weights on worker 0-0, policy_version 126490 (0.00090) [2022-07-09 06:24:20,062][25689] Fps is (10 sec: 5747.0, 60 sec: 5801.7, 300 sec: 5777.0). Total num frames: 129525760. Throughput: 0: 5942.4. Samples: 129529154. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 06:24:20,062][25689] Avg episode reward: [(0, '-55.701')] [2022-07-09 06:24:21,819][26022] Updated weights on worker 0-0, policy_version 126500 (0.00190) [2022-07-09 06:24:23,515][26022] Updated weights on worker 0-0, policy_version 126510 (0.00083) [2022-07-09 06:24:25,073][25689] Fps is (10 sec: 5807.7, 60 sec: 5773.8, 300 sec: 5770.0). Total num frames: 129554432. Throughput: 0: 6049.2. Samples: 129563892. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 06:24:25,074][25689] Avg episode reward: [(0, '-56.006')] [2022-07-09 06:24:25,393][26022] Updated weights on worker 0-0, policy_version 126520 (0.00077) [2022-07-09 06:24:26,970][26022] Updated weights on worker 0-0, policy_version 126530 (0.00087) [2022-07-09 06:24:28,678][26022] Updated weights on worker 0-0, policy_version 126540 (0.00087) [2022-07-09 06:24:30,088][25689] Fps is (10 sec: 5821.9, 60 sec: 5776.3, 300 sec: 5781.4). Total num frames: 129584128. Throughput: 0: 6071.5. Samples: 129581460. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 06:24:30,089][25689] Avg episode reward: [(0, '-55.506')] [2022-07-09 06:24:30,639][26022] Updated weights on worker 0-0, policy_version 126550 (0.00083) [2022-07-09 06:24:32,225][26022] Updated weights on worker 0-0, policy_version 126560 (0.00092) [2022-07-09 06:24:32,592][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:24:32,601][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000126561_129598464.pth [2022-07-09 06:24:32,601][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000124529_127517696.pth [2022-07-09 06:24:34,135][26022] Updated weights on worker 0-0, policy_version 126570 (0.00084) [2022-07-09 06:24:35,204][25689] Fps is (10 sec: 5760.8, 60 sec: 5770.3, 300 sec: 5765.7). Total num frames: 129612800. Throughput: 0: 6039.9. Samples: 129616020. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 06:24:35,205][25689] Avg episode reward: [(0, '-54.546')] [2022-07-09 06:24:35,739][26022] Updated weights on worker 0-0, policy_version 126580 (0.00086) [2022-07-09 06:24:37,732][26022] Updated weights on worker 0-0, policy_version 126590 (0.00092) [2022-07-09 06:24:39,466][26022] Updated weights on worker 0-0, policy_version 126600 (0.00084) [2022-07-09 06:24:40,267][25689] Fps is (10 sec: 5733.6, 60 sec: 5764.7, 300 sec: 5778.6). Total num frames: 129642496. Throughput: 0: 6033.6. Samples: 129651066. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 06:24:40,268][25689] Avg episode reward: [(0, '-54.899')] [2022-07-09 06:24:41,241][26022] Updated weights on worker 0-0, policy_version 126610 (0.00092) [2022-07-09 06:24:42,959][26022] Updated weights on worker 0-0, policy_version 126620 (0.00080) [2022-07-09 06:24:44,810][26022] Updated weights on worker 0-0, policy_version 126630 (0.00090) [2022-07-09 06:24:45,292][25689] Fps is (10 sec: 5887.9, 60 sec: 5763.1, 300 sec: 5774.9). Total num frames: 129672192. Throughput: 0: 5172.2. Samples: 129668470. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 06:24:45,292][25689] Avg episode reward: [(0, '-55.185')] [2022-07-09 06:24:46,544][26022] Updated weights on worker 0-0, policy_version 126640 (0.00081) [2022-07-09 06:24:48,042][26022] Updated weights on worker 0-0, policy_version 126650 (0.01205) [2022-07-09 06:24:50,047][26022] Updated weights on worker 0-0, policy_version 126660 (0.00082) [2022-07-09 06:24:50,321][25689] Fps is (10 sec: 5805.8, 60 sec: 5762.2, 300 sec: 5772.6). Total num frames: 129700864. Throughput: 0: 6031.4. Samples: 129703494. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 06:24:50,321][25689] Avg episode reward: [(0, '-55.318')] [2022-07-09 06:24:51,692][26022] Updated weights on worker 0-0, policy_version 126670 (0.00091) [2022-07-09 06:24:53,509][26022] Updated weights on worker 0-0, policy_version 126680 (0.00085) [2022-07-09 06:24:55,211][26022] Updated weights on worker 0-0, policy_version 126690 (0.00085) [2022-07-09 06:24:55,427][25689] Fps is (10 sec: 5860.0, 60 sec: 5791.4, 300 sec: 5775.3). Total num frames: 129731584. Throughput: 0: 6055.2. Samples: 129738470. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 06:24:55,427][25689] Avg episode reward: [(0, '-56.042')] [2022-07-09 06:24:57,195][26022] Updated weights on worker 0-0, policy_version 126700 (0.00103) [2022-07-09 06:24:58,796][26022] Updated weights on worker 0-0, policy_version 126710 (0.00086) [2022-07-09 06:25:00,446][25689] Fps is (10 sec: 5866.0, 60 sec: 5812.4, 300 sec: 5785.5). Total num frames: 129760256. Throughput: 0: 5187.7. Samples: 129755744. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 06:25:00,446][25689] Avg episode reward: [(0, '-56.162')] [2022-07-09 06:25:00,654][26022] Updated weights on worker 0-0, policy_version 126720 (0.00092) [2022-07-09 06:25:02,559][26022] Updated weights on worker 0-0, policy_version 126730 (0.00087) [2022-07-09 06:25:04,740][26022] Updated weights on worker 0-0, policy_version 126740 (0.00087) [2022-07-09 06:25:05,467][25689] Fps is (10 sec: 5507.7, 60 sec: 5762.4, 300 sec: 5775.1). Total num frames: 129786880. Throughput: 0: 5932.1. Samples: 129788150. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 06:25:05,467][25689] Avg episode reward: [(0, '-56.460')] [2022-07-09 06:25:06,131][26022] Updated weights on worker 0-0, policy_version 126750 (0.00085) [2022-07-09 06:25:08,122][26022] Updated weights on worker 0-0, policy_version 126760 (0.00082) [2022-07-09 06:25:09,681][26022] Updated weights on worker 0-0, policy_version 126770 (0.00086) [2022-07-09 06:25:10,490][25689] Fps is (10 sec: 5505.0, 60 sec: 5746.7, 300 sec: 5773.9). Total num frames: 129815552. Throughput: 0: 5937.5. Samples: 129823250. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 06:25:10,491][25689] Avg episode reward: [(0, '-55.617')] [2022-07-09 06:25:11,560][26022] Updated weights on worker 0-0, policy_version 126780 (0.00094) [2022-07-09 06:25:13,239][26022] Updated weights on worker 0-0, policy_version 126790 (0.00272) [2022-07-09 06:25:15,127][26022] Updated weights on worker 0-0, policy_version 126800 (0.00095) [2022-07-09 06:25:15,594][25689] Fps is (10 sec: 5864.6, 60 sec: 5781.7, 300 sec: 5782.5). Total num frames: 129846272. Throughput: 0: 5073.8. Samples: 129840794. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 06:25:15,594][25689] Avg episode reward: [(0, '-55.982')] [2022-07-09 06:25:16,878][26022] Updated weights on worker 0-0, policy_version 126810 (0.00087) [2022-07-09 06:25:18,406][26022] Updated weights on worker 0-0, policy_version 126820 (0.00092) [2022-07-09 06:25:20,320][26022] Updated weights on worker 0-0, policy_version 126830 (0.00086) [2022-07-09 06:25:20,667][25689] Fps is (10 sec: 5936.8, 60 sec: 5778.5, 300 sec: 5778.3). Total num frames: 129875968. Throughput: 0: 5940.9. Samples: 129875876. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 06:25:20,667][25689] Avg episode reward: [(0, '-55.631')] [2022-07-09 06:25:22,123][26022] Updated weights on worker 0-0, policy_version 126840 (0.00090) [2022-07-09 06:25:23,765][26022] Updated weights on worker 0-0, policy_version 126850 (0.00084) [2022-07-09 06:25:25,599][26022] Updated weights on worker 0-0, policy_version 126860 (0.00078) [2022-07-09 06:25:25,694][25689] Fps is (10 sec: 5779.0, 60 sec: 5777.0, 300 sec: 5777.8). Total num frames: 129904640. Throughput: 0: 6068.1. Samples: 129910892. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 06:25:25,695][25689] Avg episode reward: [(0, '-55.846')] [2022-07-09 06:25:27,271][26022] Updated weights on worker 0-0, policy_version 126870 (0.00085) [2022-07-09 06:25:29,101][26022] Updated weights on worker 0-0, policy_version 126880 (0.00085) [2022-07-09 06:25:30,754][25689] Fps is (10 sec: 5786.2, 60 sec: 5772.7, 300 sec: 5777.6). Total num frames: 129934336. Throughput: 0: 5194.9. Samples: 129928520. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 06:25:30,756][25689] Avg episode reward: [(0, '-56.507')] [2022-07-09 06:25:31,078][26022] Updated weights on worker 0-0, policy_version 126890 (0.00086) [2022-07-09 06:25:32,671][26022] Updated weights on worker 0-0, policy_version 126900 (0.00079) [2022-07-09 06:25:34,390][26022] Updated weights on worker 0-0, policy_version 126910 (0.00101) [2022-07-09 06:25:35,807][25689] Fps is (10 sec: 5872.6, 60 sec: 5795.6, 300 sec: 5780.7). Total num frames: 129964032. Throughput: 0: 6066.7. Samples: 129963424. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 06:25:35,809][25689] Avg episode reward: [(0, '-57.360')] [2022-07-09 06:25:36,129][26022] Updated weights on worker 0-0, policy_version 126920 (0.00087) [2022-07-09 06:25:37,803][26022] Updated weights on worker 0-0, policy_version 126930 (0.00085) [2022-07-09 06:25:39,716][26022] Updated weights on worker 0-0, policy_version 126940 (0.00087) [2022-07-09 06:25:40,812][25689] Fps is (10 sec: 5803.1, 60 sec: 5784.3, 300 sec: 5781.8). Total num frames: 129992704. Throughput: 0: 6090.2. Samples: 129998566. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 06:25:40,813][25689] Avg episode reward: [(0, '-57.852')] [2022-07-09 06:25:41,211][26022] Updated weights on worker 0-0, policy_version 126950 (0.00087) [2022-07-09 06:25:43,251][26022] Updated weights on worker 0-0, policy_version 126960 (0.00087) [2022-07-09 06:25:44,635][26022] Updated weights on worker 0-0, policy_version 126970 (0.00088) [2022-07-09 06:25:45,834][25689] Fps is (10 sec: 5617.3, 60 sec: 5750.7, 300 sec: 5781.9). Total num frames: 130020352. Throughput: 0: 5237.1. Samples: 130016366. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 06:25:45,834][25689] Avg episode reward: [(0, '-57.082')] [2022-07-09 06:25:46,653][26022] Updated weights on worker 0-0, policy_version 126980 (0.00082) [2022-07-09 06:25:48,418][26022] Updated weights on worker 0-0, policy_version 126990 (0.00624) [2022-07-09 06:25:50,047][26022] Updated weights on worker 0-0, policy_version 127000 (0.00081) [2022-07-09 06:25:50,863][25689] Fps is (10 sec: 5909.4, 60 sec: 5801.5, 300 sec: 5783.1). Total num frames: 130052096. Throughput: 0: 6119.0. Samples: 130051564. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 06:25:50,864][25689] Avg episode reward: [(0, '-57.319')] [2022-07-09 06:25:51,909][26022] Updated weights on worker 0-0, policy_version 127010 (0.00085) [2022-07-09 06:25:53,722][26022] Updated weights on worker 0-0, policy_version 127020 (0.00093) [2022-07-09 06:25:55,456][26022] Updated weights on worker 0-0, policy_version 127030 (0.00086) [2022-07-09 06:25:55,912][25689] Fps is (10 sec: 5994.8, 60 sec: 5773.1, 300 sec: 5783.9). Total num frames: 130080768. Throughput: 0: 6105.1. Samples: 130086162. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 06:25:55,912][25689] Avg episode reward: [(0, '-56.961')] [2022-07-09 06:25:57,106][26022] Updated weights on worker 0-0, policy_version 127040 (0.00085) [2022-07-09 06:25:58,936][26022] Updated weights on worker 0-0, policy_version 127050 (0.00082) [2022-07-09 06:26:00,752][26022] Updated weights on worker 0-0, policy_version 127060 (0.00097) [2022-07-09 06:26:00,925][25689] Fps is (10 sec: 5800.5, 60 sec: 5790.5, 300 sec: 5794.1). Total num frames: 130110464. Throughput: 0: 5220.7. Samples: 130103570. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 06:26:00,926][25689] Avg episode reward: [(0, '-57.110')] [2022-07-09 06:26:02,969][26022] Updated weights on worker 0-0, policy_version 127070 (0.00083) [2022-07-09 06:26:04,653][26022] Updated weights on worker 0-0, policy_version 127080 (0.00089) [2022-07-09 06:26:05,948][25689] Fps is (10 sec: 5611.9, 60 sec: 5790.4, 300 sec: 5790.8). Total num frames: 130137088. Throughput: 0: 5961.4. Samples: 130136274. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 06:26:05,948][25689] Avg episode reward: [(0, '-57.124')] [2022-07-09 06:26:06,588][26022] Updated weights on worker 0-0, policy_version 127090 (0.00088) [2022-07-09 06:26:08,188][26022] Updated weights on worker 0-0, policy_version 127100 (0.00091) [2022-07-09 06:26:09,995][26022] Updated weights on worker 0-0, policy_version 127110 (0.00086) [2022-07-09 06:26:10,977][25689] Fps is (10 sec: 5603.1, 60 sec: 5806.8, 300 sec: 5781.8). Total num frames: 130166784. Throughput: 0: 5949.1. Samples: 130171226. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 06:26:10,977][25689] Avg episode reward: [(0, '-57.191')] [2022-07-09 06:26:11,707][26022] Updated weights on worker 0-0, policy_version 127120 (0.00089) [2022-07-09 06:26:13,510][26022] Updated weights on worker 0-0, policy_version 127130 (0.00089) [2022-07-09 06:26:15,304][26022] Updated weights on worker 0-0, policy_version 127140 (0.00078) [2022-07-09 06:26:16,071][25689] Fps is (10 sec: 5765.8, 60 sec: 5773.8, 300 sec: 5784.7). Total num frames: 130195456. Throughput: 0: 5084.3. Samples: 130188656. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 06:26:16,071][25689] Avg episode reward: [(0, '-57.878')] [2022-07-09 06:26:16,963][26022] Updated weights on worker 0-0, policy_version 127150 (0.00085) [2022-07-09 06:26:18,759][26022] Updated weights on worker 0-0, policy_version 127160 (0.00093) [2022-07-09 06:26:20,724][26022] Updated weights on worker 0-0, policy_version 127170 (0.00089) [2022-07-09 06:26:21,076][25689] Fps is (10 sec: 5779.6, 60 sec: 5780.3, 300 sec: 5782.0). Total num frames: 130225152. Throughput: 0: 5960.2. Samples: 130223672. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 06:26:21,076][25689] Avg episode reward: [(0, '-57.736')] [2022-07-09 06:26:22,180][26022] Updated weights on worker 0-0, policy_version 127180 (0.00091) [2022-07-09 06:26:23,940][26022] Updated weights on worker 0-0, policy_version 127190 (0.00082) [2022-07-09 06:26:25,939][26022] Updated weights on worker 0-0, policy_version 127200 (0.00096) [2022-07-09 06:26:26,084][25689] Fps is (10 sec: 5726.7, 60 sec: 5765.2, 300 sec: 5779.5). Total num frames: 130252800. Throughput: 0: 6068.0. Samples: 130258464. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 06:26:26,085][25689] Avg episode reward: [(0, '-58.453')] [2022-07-09 06:26:27,619][26022] Updated weights on worker 0-0, policy_version 127210 (0.00084) [2022-07-09 06:26:29,492][26022] Updated weights on worker 0-0, policy_version 127220 (0.00080) [2022-07-09 06:26:31,101][25689] Fps is (10 sec: 5720.3, 60 sec: 5769.4, 300 sec: 5777.2). Total num frames: 130282496. Throughput: 0: 5202.8. Samples: 130275926. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 06:26:31,101][25689] Avg episode reward: [(0, '-57.691')] [2022-07-09 06:26:31,151][26022] Updated weights on worker 0-0, policy_version 127230 (0.00086) [2022-07-09 06:26:32,666][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:26:32,678][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000127239_130292736.pth [2022-07-09 06:26:32,679][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000125203_128207872.pth [2022-07-09 06:26:32,947][26022] Updated weights on worker 0-0, policy_version 127240 (0.00086) [2022-07-09 06:26:34,664][26022] Updated weights on worker 0-0, policy_version 127250 (0.00083) [2022-07-09 06:26:36,156][25689] Fps is (10 sec: 5896.9, 60 sec: 5769.2, 300 sec: 5780.6). Total num frames: 130312192. Throughput: 0: 6054.8. Samples: 130310268. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 06:26:36,158][25689] Avg episode reward: [(0, '-58.111')] [2022-07-09 06:26:36,450][26022] Updated weights on worker 0-0, policy_version 127260 (0.00089) [2022-07-09 06:26:38,392][26022] Updated weights on worker 0-0, policy_version 127270 (0.00088) [2022-07-09 06:26:39,883][26022] Updated weights on worker 0-0, policy_version 127280 (0.00089) [2022-07-09 06:26:41,203][25689] Fps is (10 sec: 5777.8, 60 sec: 5765.2, 300 sec: 5783.3). Total num frames: 130340864. Throughput: 0: 6024.4. Samples: 130344924. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 06:26:41,203][25689] Avg episode reward: [(0, '-58.515')] [2022-07-09 06:26:41,938][26022] Updated weights on worker 0-0, policy_version 127290 (0.00086) [2022-07-09 06:26:43,625][26022] Updated weights on worker 0-0, policy_version 127300 (0.00082) [2022-07-09 06:26:45,379][26022] Updated weights on worker 0-0, policy_version 127310 (0.00089) [2022-07-09 06:26:46,237][25689] Fps is (10 sec: 5688.1, 60 sec: 5780.8, 300 sec: 5777.0). Total num frames: 130369536. Throughput: 0: 5144.1. Samples: 130362134. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 06:26:46,238][25689] Avg episode reward: [(0, '-58.367')] [2022-07-09 06:26:47,294][26022] Updated weights on worker 0-0, policy_version 127320 (0.00084) [2022-07-09 06:26:48,903][26022] Updated weights on worker 0-0, policy_version 127330 (0.00085) [2022-07-09 06:26:50,753][26022] Updated weights on worker 0-0, policy_version 127340 (0.00092) [2022-07-09 06:26:51,339][25689] Fps is (10 sec: 5758.3, 60 sec: 5740.0, 300 sec: 5775.9). Total num frames: 130399232. Throughput: 0: 5972.9. Samples: 130396808. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 06:26:51,339][25689] Avg episode reward: [(0, '-58.538')] [2022-07-09 06:26:52,626][26022] Updated weights on worker 0-0, policy_version 127350 (0.00081) [2022-07-09 06:26:54,353][26022] Updated weights on worker 0-0, policy_version 127360 (0.00255) [2022-07-09 06:26:56,107][26022] Updated weights on worker 0-0, policy_version 127370 (0.00094) [2022-07-09 06:26:56,420][25689] Fps is (10 sec: 5832.9, 60 sec: 5754.0, 300 sec: 5781.6). Total num frames: 130428928. Throughput: 0: 5975.3. Samples: 130431350. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 06:26:56,420][25689] Avg episode reward: [(0, '-58.640')] [2022-07-09 06:26:57,808][26022] Updated weights on worker 0-0, policy_version 127380 (0.00080) [2022-07-09 06:26:59,613][26022] Updated weights on worker 0-0, policy_version 127390 (0.00088) [2022-07-09 06:27:01,399][26022] Updated weights on worker 0-0, policy_version 127400 (0.00087) [2022-07-09 06:27:01,425][25689] Fps is (10 sec: 5787.1, 60 sec: 5737.8, 300 sec: 5781.8). Total num frames: 130457600. Throughput: 0: 6003.8. Samples: 130466334. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 06:27:01,425][25689] Avg episode reward: [(0, '-58.541')] [2022-07-09 06:27:03,671][26022] Updated weights on worker 0-0, policy_version 127410 (0.00086) [2022-07-09 06:27:05,406][26022] Updated weights on worker 0-0, policy_version 127420 (0.00091) [2022-07-09 06:27:06,440][25689] Fps is (10 sec: 5416.3, 60 sec: 5721.6, 300 sec: 5771.6). Total num frames: 130483200. Throughput: 0: 5896.9. Samples: 130481264. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 06:27:06,440][25689] Avg episode reward: [(0, '-57.557')] [2022-07-09 06:27:07,129][26022] Updated weights on worker 0-0, policy_version 127430 (0.00082) [2022-07-09 06:27:08,857][26022] Updated weights on worker 0-0, policy_version 127440 (0.00084) [2022-07-09 06:27:10,477][26022] Updated weights on worker 0-0, policy_version 127450 (0.00082) [2022-07-09 06:27:11,468][25689] Fps is (10 sec: 5506.0, 60 sec: 5721.7, 300 sec: 5771.8). Total num frames: 130512896. Throughput: 0: 5948.2. Samples: 130516538. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 06:27:11,468][25689] Avg episode reward: [(0, '-56.951')] [2022-07-09 06:27:12,519][26022] Updated weights on worker 0-0, policy_version 127460 (0.00088) [2022-07-09 06:27:13,936][26022] Updated weights on worker 0-0, policy_version 127470 (0.00096) [2022-07-09 06:27:15,935][26022] Updated weights on worker 0-0, policy_version 127480 (0.00089) [2022-07-09 06:27:16,556][25689] Fps is (10 sec: 5769.9, 60 sec: 5722.3, 300 sec: 5766.8). Total num frames: 130541568. Throughput: 0: 5965.4. Samples: 130551470. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 06:27:16,559][25689] Avg episode reward: [(0, '-56.666')] [2022-07-09 06:27:17,499][26022] Updated weights on worker 0-0, policy_version 127490 (0.00082) [2022-07-09 06:27:19,582][26022] Updated weights on worker 0-0, policy_version 127500 (0.00080) [2022-07-09 06:27:21,134][26022] Updated weights on worker 0-0, policy_version 127510 (0.00086) [2022-07-09 06:27:21,577][25689] Fps is (10 sec: 5875.3, 60 sec: 5737.7, 300 sec: 5773.3). Total num frames: 130572288. Throughput: 0: 5084.8. Samples: 130568802. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 06:27:21,578][25689] Avg episode reward: [(0, '-56.375')] [2022-07-09 06:27:23,052][26022] Updated weights on worker 0-0, policy_version 127520 (0.00087) [2022-07-09 06:27:24,708][26022] Updated weights on worker 0-0, policy_version 127530 (0.00087) [2022-07-09 06:27:26,494][26022] Updated weights on worker 0-0, policy_version 127540 (0.00091) [2022-07-09 06:27:26,592][25689] Fps is (10 sec: 6019.6, 60 sec: 5770.9, 300 sec: 5773.3). Total num frames: 130601984. Throughput: 0: 6060.9. Samples: 130603406. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 06:27:26,594][25689] Avg episode reward: [(0, '-56.623')] [2022-07-09 06:27:28,507][26022] Updated weights on worker 0-0, policy_version 127550 (0.00093) [2022-07-09 06:27:30,193][26022] Updated weights on worker 0-0, policy_version 127560 (0.00091) [2022-07-09 06:27:31,604][25689] Fps is (10 sec: 5616.7, 60 sec: 5720.6, 300 sec: 5767.6). Total num frames: 130628608. Throughput: 0: 6031.1. Samples: 130637980. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 06:27:31,606][25689] Avg episode reward: [(0, '-56.452')] [2022-07-09 06:27:31,891][26022] Updated weights on worker 0-0, policy_version 127570 (0.00085) [2022-07-09 06:27:33,650][26022] Updated weights on worker 0-0, policy_version 127580 (0.00086) [2022-07-09 06:27:35,245][26022] Updated weights on worker 0-0, policy_version 127590 (0.00613) [2022-07-09 06:27:36,653][25689] Fps is (10 sec: 5699.9, 60 sec: 5738.1, 300 sec: 5770.2). Total num frames: 130659328. Throughput: 0: 5176.1. Samples: 130655494. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 06:27:36,654][25689] Avg episode reward: [(0, '-56.201')] [2022-07-09 06:27:37,242][26022] Updated weights on worker 0-0, policy_version 127600 (0.00099) [2022-07-09 06:27:38,998][26022] Updated weights on worker 0-0, policy_version 127610 (0.00095) [2022-07-09 06:27:40,583][26022] Updated weights on worker 0-0, policy_version 127620 (0.00087) [2022-07-09 06:27:41,728][25689] Fps is (10 sec: 5967.3, 60 sec: 5752.3, 300 sec: 5772.4). Total num frames: 130689024. Throughput: 0: 6037.8. Samples: 130690474. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 06:27:41,729][25689] Avg episode reward: [(0, '-56.703')] [2022-07-09 06:27:42,208][26022] Updated weights on worker 0-0, policy_version 127630 (0.00094) [2022-07-09 06:27:44,048][26022] Updated weights on worker 0-0, policy_version 127640 (0.00092) [2022-07-09 06:27:45,963][26022] Updated weights on worker 0-0, policy_version 127650 (0.00087) [2022-07-09 06:27:46,740][25689] Fps is (10 sec: 5786.4, 60 sec: 5754.5, 300 sec: 5772.2). Total num frames: 130717696. Throughput: 0: 6047.7. Samples: 130725254. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 06:27:46,740][25689] Avg episode reward: [(0, '-56.795')] [2022-07-09 06:27:47,744][26022] Updated weights on worker 0-0, policy_version 127660 (0.00084) [2022-07-09 06:27:49,423][26022] Updated weights on worker 0-0, policy_version 127670 (0.00080) [2022-07-09 06:27:51,360][26022] Updated weights on worker 0-0, policy_version 127680 (0.00085) [2022-07-09 06:27:51,763][25689] Fps is (10 sec: 5918.5, 60 sec: 5778.9, 300 sec: 5774.1). Total num frames: 130748416. Throughput: 0: 5193.8. Samples: 130742684. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 06:27:51,765][25689] Avg episode reward: [(0, '-57.138')] [2022-07-09 06:27:52,850][26022] Updated weights on worker 0-0, policy_version 127690 (0.00106) [2022-07-09 06:27:54,828][26022] Updated weights on worker 0-0, policy_version 127700 (0.00094) [2022-07-09 06:27:56,389][26022] Updated weights on worker 0-0, policy_version 127710 (0.00085) [2022-07-09 06:27:56,897][25689] Fps is (10 sec: 5746.8, 60 sec: 5740.0, 300 sec: 5772.4). Total num frames: 130776064. Throughput: 0: 6035.5. Samples: 130777676. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-09 06:27:56,897][25689] Avg episode reward: [(0, '-57.046')] [2022-07-09 06:27:58,568][26022] Updated weights on worker 0-0, policy_version 127720 (0.00092) [2022-07-09 06:27:59,915][26022] Updated weights on worker 0-0, policy_version 127730 (0.00089) [2022-07-09 06:28:01,964][25689] Fps is (10 sec: 5420.9, 60 sec: 5717.2, 300 sec: 5771.7). Total num frames: 130803712. Throughput: 0: 6037.7. Samples: 130812650. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 06:28:01,964][25689] Avg episode reward: [(0, '-56.743')] [2022-07-09 06:28:02,157][26022] Updated weights on worker 0-0, policy_version 127740 (0.00091) [2022-07-09 06:28:03,794][26022] Updated weights on worker 0-0, policy_version 127750 (0.00083) [2022-07-09 06:28:05,722][26022] Updated weights on worker 0-0, policy_version 127760 (0.00537) [2022-07-09 06:28:07,032][25689] Fps is (10 sec: 5759.0, 60 sec: 5796.7, 300 sec: 5770.8). Total num frames: 130834432. Throughput: 0: 5076.0. Samples: 130828256. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 06:28:07,032][25689] Avg episode reward: [(0, '-57.215')] [2022-07-09 06:28:07,586][26022] Updated weights on worker 0-0, policy_version 127770 (0.00087) [2022-07-09 06:28:09,248][26022] Updated weights on worker 0-0, policy_version 127780 (0.00097) [2022-07-09 06:28:10,950][26022] Updated weights on worker 0-0, policy_version 127790 (0.00089) [2022-07-09 06:28:12,061][25689] Fps is (10 sec: 5881.8, 60 sec: 5779.6, 300 sec: 5767.6). Total num frames: 130863104. Throughput: 0: 5947.7. Samples: 130863412. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 06:28:12,062][25689] Avg episode reward: [(0, '-57.442')] [2022-07-09 06:28:12,629][26022] Updated weights on worker 0-0, policy_version 127800 (0.00086) [2022-07-09 06:28:14,317][26022] Updated weights on worker 0-0, policy_version 127810 (0.00082) [2022-07-09 06:28:16,295][26022] Updated weights on worker 0-0, policy_version 127820 (0.00087) [2022-07-09 06:28:17,099][25689] Fps is (10 sec: 5899.5, 60 sec: 5818.3, 300 sec: 5777.7). Total num frames: 130893824. Throughput: 0: 6003.5. Samples: 130898960. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 06:28:17,099][25689] Avg episode reward: [(0, '-57.566')] [2022-07-09 06:28:17,778][26022] Updated weights on worker 0-0, policy_version 127830 (0.00105) [2022-07-09 06:28:19,647][26022] Updated weights on worker 0-0, policy_version 127840 (0.00082) [2022-07-09 06:28:21,180][26022] Updated weights on worker 0-0, policy_version 127850 (0.00085) [2022-07-09 06:28:22,150][25689] Fps is (10 sec: 5887.2, 60 sec: 5781.6, 300 sec: 5771.2). Total num frames: 130922496. Throughput: 0: 5148.2. Samples: 130916572. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 06:28:22,150][25689] Avg episode reward: [(0, '-57.622')] [2022-07-09 06:28:23,178][26022] Updated weights on worker 0-0, policy_version 127860 (0.00086) [2022-07-09 06:28:24,847][26022] Updated weights on worker 0-0, policy_version 127870 (0.00088) [2022-07-09 06:28:26,809][26022] Updated weights on worker 0-0, policy_version 127880 (0.00093) [2022-07-09 06:28:27,158][25689] Fps is (10 sec: 5598.7, 60 sec: 5748.4, 300 sec: 5764.9). Total num frames: 130950144. Throughput: 0: 6113.8. Samples: 130951306. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 06:28:27,159][25689] Avg episode reward: [(0, '-57.340')] [2022-07-09 06:28:28,434][26022] Updated weights on worker 0-0, policy_version 127890 (0.00091) [2022-07-09 06:28:30,341][26022] Updated weights on worker 0-0, policy_version 127900 (0.00092) [2022-07-09 06:28:32,068][26022] Updated weights on worker 0-0, policy_version 127910 (0.00086) [2022-07-09 06:28:32,218][25689] Fps is (10 sec: 5695.7, 60 sec: 5794.6, 300 sec: 5768.3). Total num frames: 130979840. Throughput: 0: 6080.0. Samples: 130985962. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 06:28:32,218][25689] Avg episode reward: [(0, '-57.213')] [2022-07-09 06:28:32,700][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:28:32,722][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000127914_130983936.pth [2022-07-09 06:28:32,723][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000125881_128902144.pth [2022-07-09 06:28:33,795][26022] Updated weights on worker 0-0, policy_version 127920 (0.00081) [2022-07-09 06:28:35,630][26022] Updated weights on worker 0-0, policy_version 127930 (0.00096) [2022-07-09 06:28:37,308][25689] Fps is (10 sec: 5952.6, 60 sec: 5790.6, 300 sec: 5770.1). Total num frames: 131010560. Throughput: 0: 5166.8. Samples: 131003376. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 06:28:37,310][26022] Updated weights on worker 0-0, policy_version 127940 (0.00095) [2022-07-09 06:28:37,309][25689] Avg episode reward: [(0, '-57.191')] [2022-07-09 06:28:39,213][26022] Updated weights on worker 0-0, policy_version 127950 (0.00096) [2022-07-09 06:28:40,830][26022] Updated weights on worker 0-0, policy_version 127960 (0.00098) [2022-07-09 06:28:42,380][25689] Fps is (10 sec: 5844.2, 60 sec: 5774.1, 300 sec: 5765.4). Total num frames: 131039232. Throughput: 0: 6004.0. Samples: 131038036. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 06:28:42,381][25689] Avg episode reward: [(0, '-56.670')] [2022-07-09 06:28:42,650][26022] Updated weights on worker 0-0, policy_version 127970 (0.00092) [2022-07-09 06:28:44,339][26022] Updated weights on worker 0-0, policy_version 127980 (0.00090) [2022-07-09 06:28:46,123][26022] Updated weights on worker 0-0, policy_version 127990 (0.00085) [2022-07-09 06:28:47,406][25689] Fps is (10 sec: 5678.5, 60 sec: 5772.7, 300 sec: 5765.2). Total num frames: 131067904. Throughput: 0: 5991.6. Samples: 131072624. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 06:28:47,407][25689] Avg episode reward: [(0, '-56.884')] [2022-07-09 06:28:48,055][26022] Updated weights on worker 0-0, policy_version 128000 (0.00088) [2022-07-09 06:28:49,679][26022] Updated weights on worker 0-0, policy_version 128010 (0.00082) [2022-07-09 06:28:51,604][26022] Updated weights on worker 0-0, policy_version 128020 (0.00088) [2022-07-09 06:28:52,413][25689] Fps is (10 sec: 5817.9, 60 sec: 5757.4, 300 sec: 5769.7). Total num frames: 131097600. Throughput: 0: 5149.3. Samples: 131089952. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 06:28:52,413][25689] Avg episode reward: [(0, '-56.641')] [2022-07-09 06:28:53,354][26022] Updated weights on worker 0-0, policy_version 128030 (0.00081) [2022-07-09 06:28:55,106][26022] Updated weights on worker 0-0, policy_version 128040 (0.00082) [2022-07-09 06:28:57,034][26022] Updated weights on worker 0-0, policy_version 128050 (0.00081) [2022-07-09 06:28:57,457][25689] Fps is (10 sec: 5705.6, 60 sec: 5765.9, 300 sec: 5770.0). Total num frames: 131125248. Throughput: 0: 5986.8. Samples: 131124004. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 06:28:57,458][25689] Avg episode reward: [(0, '-57.224')] [2022-07-09 06:28:58,768][26022] Updated weights on worker 0-0, policy_version 128060 (0.00086) [2022-07-09 06:29:00,534][26022] Updated weights on worker 0-0, policy_version 128070 (0.00088) [2022-07-09 06:29:02,479][25689] Fps is (10 sec: 5391.6, 60 sec: 5753.3, 300 sec: 5759.8). Total num frames: 131151872. Throughput: 0: 5942.1. Samples: 131157464. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 06:29:02,479][25689] Avg episode reward: [(0, '-56.821')] [2022-07-09 06:29:02,783][26022] Updated weights on worker 0-0, policy_version 128080 (0.00093) [2022-07-09 06:29:04,569][26022] Updated weights on worker 0-0, policy_version 128090 (0.00082) [2022-07-09 06:29:06,164][26022] Updated weights on worker 0-0, policy_version 128100 (0.00084) [2022-07-09 06:29:07,495][25689] Fps is (10 sec: 5610.8, 60 sec: 5741.3, 300 sec: 5760.2). Total num frames: 131181568. Throughput: 0: 5051.8. Samples: 131174106. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 06:29:07,496][25689] Avg episode reward: [(0, '-56.945')] [2022-07-09 06:29:07,993][26022] Updated weights on worker 0-0, policy_version 128110 (0.00089) [2022-07-09 06:29:09,706][26022] Updated weights on worker 0-0, policy_version 128120 (0.00083) [2022-07-09 06:29:11,458][26022] Updated weights on worker 0-0, policy_version 128130 (0.00076) [2022-07-09 06:29:12,502][25689] Fps is (10 sec: 5823.5, 60 sec: 5743.5, 300 sec: 5762.3). Total num frames: 131210240. Throughput: 0: 5932.8. Samples: 131209134. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 06:29:12,503][25689] Avg episode reward: [(0, '-56.460')] [2022-07-09 06:29:13,345][26022] Updated weights on worker 0-0, policy_version 128140 (0.00088) [2022-07-09 06:29:14,895][26022] Updated weights on worker 0-0, policy_version 128150 (0.00088) [2022-07-09 06:29:16,824][26022] Updated weights on worker 0-0, policy_version 128160 (0.00090) [2022-07-09 06:29:17,547][25689] Fps is (10 sec: 5806.5, 60 sec: 5725.8, 300 sec: 5762.2). Total num frames: 131239936. Throughput: 0: 5965.0. Samples: 131243842. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 06:29:17,547][25689] Avg episode reward: [(0, '-56.165')] [2022-07-09 06:29:18,620][26022] Updated weights on worker 0-0, policy_version 128170 (0.00094) [2022-07-09 06:29:20,361][26022] Updated weights on worker 0-0, policy_version 128180 (0.00080) [2022-07-09 06:29:22,176][26022] Updated weights on worker 0-0, policy_version 128190 (0.00092) [2022-07-09 06:29:22,559][25689] Fps is (10 sec: 5905.5, 60 sec: 5746.4, 300 sec: 5765.6). Total num frames: 131269632. Throughput: 0: 5171.5. Samples: 131261308. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 06:29:22,560][25689] Avg episode reward: [(0, '-55.706')] [2022-07-09 06:29:23,892][26022] Updated weights on worker 0-0, policy_version 128200 (0.00083) [2022-07-09 06:29:25,716][26022] Updated weights on worker 0-0, policy_version 128210 (0.00089) [2022-07-09 06:29:27,516][26022] Updated weights on worker 0-0, policy_version 128220 (0.00086) [2022-07-09 06:29:27,580][25689] Fps is (10 sec: 5715.5, 60 sec: 5745.2, 300 sec: 5758.6). Total num frames: 131297280. Throughput: 0: 6070.0. Samples: 131296024. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 06:29:27,581][25689] Avg episode reward: [(0, '-56.355')] [2022-07-09 06:29:29,167][26022] Updated weights on worker 0-0, policy_version 128230 (0.00084) [2022-07-09 06:29:31,107][26022] Updated weights on worker 0-0, policy_version 128240 (0.00081) [2022-07-09 06:29:32,623][25689] Fps is (10 sec: 5697.9, 60 sec: 5746.8, 300 sec: 5763.5). Total num frames: 131326976. Throughput: 0: 6021.8. Samples: 131330300. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 06:29:32,624][25689] Avg episode reward: [(0, '-56.621')] [2022-07-09 06:29:32,701][26022] Updated weights on worker 0-0, policy_version 128250 (0.00082) [2022-07-09 06:29:34,767][26022] Updated weights on worker 0-0, policy_version 128260 (0.00083) [2022-07-09 06:29:36,412][26022] Updated weights on worker 0-0, policy_version 128270 (0.00084) [2022-07-09 06:29:37,747][25689] Fps is (10 sec: 5741.0, 60 sec: 5709.7, 300 sec: 5758.9). Total num frames: 131355648. Throughput: 0: 5996.9. Samples: 131364978. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 06:29:37,747][25689] Avg episode reward: [(0, '-55.989')] [2022-07-09 06:29:38,204][26022] Updated weights on worker 0-0, policy_version 128280 (0.00088) [2022-07-09 06:29:39,867][26022] Updated weights on worker 0-0, policy_version 128290 (0.00083) [2022-07-09 06:29:41,531][26022] Updated weights on worker 0-0, policy_version 128300 (0.00082) [2022-07-09 06:29:42,779][25689] Fps is (10 sec: 5747.1, 60 sec: 5730.5, 300 sec: 5758.7). Total num frames: 131385344. Throughput: 0: 5997.2. Samples: 131382572. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 06:29:42,779][25689] Avg episode reward: [(0, '-56.538')] [2022-07-09 06:29:43,552][26022] Updated weights on worker 0-0, policy_version 128310 (0.00084) [2022-07-09 06:29:45,237][26022] Updated weights on worker 0-0, policy_version 128320 (0.00051) [2022-07-09 06:29:46,887][26022] Updated weights on worker 0-0, policy_version 128330 (0.00099) [2022-07-09 06:29:47,826][25689] Fps is (10 sec: 5892.8, 60 sec: 5745.5, 300 sec: 5761.8). Total num frames: 131415040. Throughput: 0: 5999.0. Samples: 131417478. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 06:29:47,826][25689] Avg episode reward: [(0, '-56.634')] [2022-07-09 06:29:48,683][26022] Updated weights on worker 0-0, policy_version 128340 (0.00090) [2022-07-09 06:29:50,535][26022] Updated weights on worker 0-0, policy_version 128350 (0.00081) [2022-07-09 06:29:52,265][26022] Updated weights on worker 0-0, policy_version 128360 (0.00083) [2022-07-09 06:29:52,886][25689] Fps is (10 sec: 5774.9, 60 sec: 5723.4, 300 sec: 5755.8). Total num frames: 131443712. Throughput: 0: 6012.5. Samples: 131452134. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 06:29:52,887][25689] Avg episode reward: [(0, '-55.982')] [2022-07-09 06:29:54,084][26022] Updated weights on worker 0-0, policy_version 128370 (0.00082) [2022-07-09 06:29:55,899][26022] Updated weights on worker 0-0, policy_version 128380 (0.00083) [2022-07-09 06:29:57,578][26022] Updated weights on worker 0-0, policy_version 128390 (0.00088) [2022-07-09 06:29:57,948][25689] Fps is (10 sec: 5664.8, 60 sec: 5738.6, 300 sec: 5755.0). Total num frames: 131472384. Throughput: 0: 5162.1. Samples: 131469260. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 06:29:57,949][25689] Avg episode reward: [(0, '-55.299')] [2022-07-09 06:29:59,464][26022] Updated weights on worker 0-0, policy_version 128400 (0.00082) [2022-07-09 06:30:01,124][26022] Updated weights on worker 0-0, policy_version 128410 (0.00083) [2022-07-09 06:30:02,969][25689] Fps is (10 sec: 5484.2, 60 sec: 5738.7, 300 sec: 5755.0). Total num frames: 131499008. Throughput: 0: 6009.2. Samples: 131503898. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:30:02,969][25689] Avg episode reward: [(0, '-55.449')] [2022-07-09 06:30:03,255][26022] Updated weights on worker 0-0, policy_version 128420 (0.00098) [2022-07-09 06:30:05,097][26022] Updated weights on worker 0-0, policy_version 128430 (0.00086) [2022-07-09 06:30:06,959][26022] Updated weights on worker 0-0, policy_version 128440 (0.00090) [2022-07-09 06:30:08,009][25689] Fps is (10 sec: 5496.2, 60 sec: 5719.5, 300 sec: 5754.7). Total num frames: 131527680. Throughput: 0: 5885.4. Samples: 131536266. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:30:08,010][25689] Avg episode reward: [(0, '-55.770')] [2022-07-09 06:30:08,787][26022] Updated weights on worker 0-0, policy_version 128450 (0.00089) [2022-07-09 06:30:10,534][26022] Updated weights on worker 0-0, policy_version 128460 (0.00087) [2022-07-09 06:30:12,067][26022] Updated weights on worker 0-0, policy_version 128470 (0.00081) [2022-07-09 06:30:13,046][25689] Fps is (10 sec: 5792.4, 60 sec: 5733.6, 300 sec: 5752.5). Total num frames: 131557376. Throughput: 0: 5032.1. Samples: 131553578. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:30:13,046][25689] Avg episode reward: [(0, '-54.882')] [2022-07-09 06:30:14,110][26022] Updated weights on worker 0-0, policy_version 128480 (0.00089) [2022-07-09 06:30:15,806][26022] Updated weights on worker 0-0, policy_version 128490 (0.00561) [2022-07-09 06:30:17,615][26022] Updated weights on worker 0-0, policy_version 128500 (0.00083) [2022-07-09 06:30:18,097][25689] Fps is (10 sec: 5988.7, 60 sec: 5749.9, 300 sec: 5756.4). Total num frames: 131588096. Throughput: 0: 5920.0. Samples: 131588542. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:30:18,099][25689] Avg episode reward: [(0, '-55.242')] [2022-07-09 06:30:19,334][26022] Updated weights on worker 0-0, policy_version 128510 (0.00081) [2022-07-09 06:30:20,951][26022] Updated weights on worker 0-0, policy_version 128520 (0.00087) [2022-07-09 06:30:22,863][26022] Updated weights on worker 0-0, policy_version 128530 (0.00088) [2022-07-09 06:30:23,203][25689] Fps is (10 sec: 5746.5, 60 sec: 5707.3, 300 sec: 5751.4). Total num frames: 131615744. Throughput: 0: 5913.5. Samples: 131623550. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:30:23,203][25689] Avg episode reward: [(0, '-55.881')] [2022-07-09 06:30:24,553][26022] Updated weights on worker 0-0, policy_version 128540 (0.00094) [2022-07-09 06:30:26,362][26022] Updated weights on worker 0-0, policy_version 128550 (0.00078) [2022-07-09 06:30:28,221][25689] Fps is (10 sec: 5563.1, 60 sec: 5724.4, 300 sec: 5748.8). Total num frames: 131644416. Throughput: 0: 5192.7. Samples: 131641222. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:30:28,223][25689] Avg episode reward: [(0, '-55.902')] [2022-07-09 06:30:28,298][26022] Updated weights on worker 0-0, policy_version 128560 (0.00086) [2022-07-09 06:30:29,808][26022] Updated weights on worker 0-0, policy_version 128570 (0.00087) [2022-07-09 06:30:31,776][26022] Updated weights on worker 0-0, policy_version 128580 (0.00091) [2022-07-09 06:30:32,724][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:30:32,749][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000128586_131672064.pth [2022-07-09 06:30:32,750][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000126561_129598464.pth [2022-07-09 06:30:33,239][25689] Fps is (10 sec: 5917.9, 60 sec: 5743.7, 300 sec: 5752.9). Total num frames: 131675136. Throughput: 0: 6049.7. Samples: 131675740. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:30:33,240][25689] Avg episode reward: [(0, '-55.950')] [2022-07-09 06:30:33,371][26022] Updated weights on worker 0-0, policy_version 128590 (0.00086) [2022-07-09 06:30:35,325][26022] Updated weights on worker 0-0, policy_version 128600 (0.00087) [2022-07-09 06:30:37,069][26022] Updated weights on worker 0-0, policy_version 128610 (0.00080) [2022-07-09 06:30:38,376][25689] Fps is (10 sec: 5949.4, 60 sec: 5759.3, 300 sec: 5753.8). Total num frames: 131704832. Throughput: 0: 6011.6. Samples: 131710450. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:30:38,377][25689] Avg episode reward: [(0, '-56.038')] [2022-07-09 06:30:38,836][26022] Updated weights on worker 0-0, policy_version 128620 (0.00089) [2022-07-09 06:30:40,412][26022] Updated weights on worker 0-0, policy_version 128630 (0.00084) [2022-07-09 06:30:42,336][26022] Updated weights on worker 0-0, policy_version 128640 (0.00089) [2022-07-09 06:30:43,383][25689] Fps is (10 sec: 5653.0, 60 sec: 5728.0, 300 sec: 5754.1). Total num frames: 131732480. Throughput: 0: 5165.8. Samples: 131727796. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:30:43,383][25689] Avg episode reward: [(0, '-56.501')] [2022-07-09 06:30:44,011][26022] Updated weights on worker 0-0, policy_version 128650 (0.00085) [2022-07-09 06:30:45,969][26022] Updated weights on worker 0-0, policy_version 128660 (0.00082) [2022-07-09 06:30:47,476][26022] Updated weights on worker 0-0, policy_version 128670 (0.00082) [2022-07-09 06:30:48,460][25689] Fps is (10 sec: 5686.5, 60 sec: 5725.0, 300 sec: 5746.3). Total num frames: 131762176. Throughput: 0: 5994.3. Samples: 131762542. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:30:48,461][25689] Avg episode reward: [(0, '-56.743')] [2022-07-09 06:30:49,437][26022] Updated weights on worker 0-0, policy_version 128680 (0.00087) [2022-07-09 06:30:51,032][26022] Updated weights on worker 0-0, policy_version 128690 (0.00085) [2022-07-09 06:30:52,955][26022] Updated weights on worker 0-0, policy_version 128700 (0.00089) [2022-07-09 06:30:53,546][25689] Fps is (10 sec: 5844.0, 60 sec: 5739.6, 300 sec: 5749.0). Total num frames: 131791872. Throughput: 0: 5985.1. Samples: 131797280. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:30:53,546][25689] Avg episode reward: [(0, '-56.275')] [2022-07-09 06:30:54,495][26022] Updated weights on worker 0-0, policy_version 128710 (0.00093) [2022-07-09 06:30:56,522][26022] Updated weights on worker 0-0, policy_version 128720 (0.00085) [2022-07-09 06:30:58,105][26022] Updated weights on worker 0-0, policy_version 128730 (0.00088) [2022-07-09 06:30:58,627][25689] Fps is (10 sec: 5841.7, 60 sec: 5754.6, 300 sec: 5747.7). Total num frames: 131821568. Throughput: 0: 5152.5. Samples: 131814794. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:30:58,628][25689] Avg episode reward: [(0, '-55.945')] [2022-07-09 06:31:00,175][26022] Updated weights on worker 0-0, policy_version 128740 (0.00089) [2022-07-09 06:31:01,784][26022] Updated weights on worker 0-0, policy_version 128750 (0.00094) [2022-07-09 06:31:03,629][25689] Fps is (10 sec: 5483.9, 60 sec: 5739.5, 300 sec: 5744.7). Total num frames: 131847168. Throughput: 0: 5986.9. Samples: 131849008. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:31:03,629][25689] Avg episode reward: [(0, '-56.033')] [2022-07-09 06:31:03,961][26022] Updated weights on worker 0-0, policy_version 128760 (0.00093) [2022-07-09 06:31:05,769][26022] Updated weights on worker 0-0, policy_version 128770 (0.00080) [2022-07-09 06:31:07,526][26022] Updated weights on worker 0-0, policy_version 128780 (0.00095) [2022-07-09 06:31:08,645][25689] Fps is (10 sec: 5519.7, 60 sec: 5758.7, 300 sec: 5744.9). Total num frames: 131876864. Throughput: 0: 5911.0. Samples: 131881854. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:31:08,646][25689] Avg episode reward: [(0, '-56.741')] [2022-07-09 06:31:09,323][26022] Updated weights on worker 0-0, policy_version 128790 (0.00091) [2022-07-09 06:31:10,933][26022] Updated weights on worker 0-0, policy_version 128800 (0.00087) [2022-07-09 06:31:12,856][26022] Updated weights on worker 0-0, policy_version 128810 (0.00090) [2022-07-09 06:31:13,659][25689] Fps is (10 sec: 5921.6, 60 sec: 5760.9, 300 sec: 5749.9). Total num frames: 131906560. Throughput: 0: 5061.8. Samples: 131899088. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:31:13,659][25689] Avg episode reward: [(0, '-57.078')] [2022-07-09 06:31:14,479][26022] Updated weights on worker 0-0, policy_version 128820 (0.00093) [2022-07-09 06:31:16,483][26022] Updated weights on worker 0-0, policy_version 128830 (0.00084) [2022-07-09 06:31:17,994][26022] Updated weights on worker 0-0, policy_version 128840 (0.00086) [2022-07-09 06:31:18,791][25689] Fps is (10 sec: 5651.9, 60 sec: 5702.6, 300 sec: 5740.6). Total num frames: 131934208. Throughput: 0: 5898.5. Samples: 131933730. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:31:18,792][25689] Avg episode reward: [(0, '-56.677')] [2022-07-09 06:31:20,037][26022] Updated weights on worker 0-0, policy_version 128850 (0.00555) [2022-07-09 06:31:21,627][26022] Updated weights on worker 0-0, policy_version 128860 (0.00082) [2022-07-09 06:31:23,541][26022] Updated weights on worker 0-0, policy_version 128870 (0.00085) [2022-07-09 06:31:23,801][25689] Fps is (10 sec: 5653.8, 60 sec: 5745.4, 300 sec: 5747.4). Total num frames: 131963904. Throughput: 0: 5907.4. Samples: 131968174. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:31:23,802][25689] Avg episode reward: [(0, '-56.832')] [2022-07-09 06:31:25,108][26022] Updated weights on worker 0-0, policy_version 128880 (0.00078) [2022-07-09 06:31:27,104][26022] Updated weights on worker 0-0, policy_version 128890 (0.00087) [2022-07-09 06:31:28,776][26022] Updated weights on worker 0-0, policy_version 128900 (0.00092) [2022-07-09 06:31:28,824][25689] Fps is (10 sec: 5919.6, 60 sec: 5761.8, 300 sec: 5747.3). Total num frames: 131993600. Throughput: 0: 5973.9. Samples: 132002402. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:31:28,825][25689] Avg episode reward: [(0, '-57.585')] [2022-07-09 06:31:30,675][26022] Updated weights on worker 0-0, policy_version 128910 (0.00096) [2022-07-09 06:31:32,671][26022] Updated weights on worker 0-0, policy_version 128920 (0.00090) [2022-07-09 06:31:33,853][25689] Fps is (10 sec: 5705.0, 60 sec: 5710.1, 300 sec: 5740.9). Total num frames: 132021248. Throughput: 0: 5965.6. Samples: 132019558. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:31:33,853][25689] Avg episode reward: [(0, '-57.488')] [2022-07-09 06:31:34,264][26022] Updated weights on worker 0-0, policy_version 128930 (0.00091) [2022-07-09 06:31:36,020][26022] Updated weights on worker 0-0, policy_version 128940 (0.00088) [2022-07-09 06:31:37,685][26022] Updated weights on worker 0-0, policy_version 128950 (0.00085) [2022-07-09 06:31:38,946][25689] Fps is (10 sec: 5665.6, 60 sec: 5714.3, 300 sec: 5743.5). Total num frames: 132050944. Throughput: 0: 5979.5. Samples: 132054244. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:31:38,946][25689] Avg episode reward: [(0, '-57.343')] [2022-07-09 06:31:39,598][26022] Updated weights on worker 0-0, policy_version 128960 (0.00089) [2022-07-09 06:31:41,365][26022] Updated weights on worker 0-0, policy_version 128970 (0.00085) [2022-07-09 06:31:43,106][26022] Updated weights on worker 0-0, policy_version 128980 (0.00089) [2022-07-09 06:31:43,947][25689] Fps is (10 sec: 5782.2, 60 sec: 5731.7, 300 sec: 5744.1). Total num frames: 132079616. Throughput: 0: 6016.9. Samples: 132089390. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:31:43,948][25689] Avg episode reward: [(0, '-57.530')] [2022-07-09 06:31:44,759][26022] Updated weights on worker 0-0, policy_version 128990 (0.00088) [2022-07-09 06:31:46,538][26022] Updated weights on worker 0-0, policy_version 129000 (0.00085) [2022-07-09 06:31:48,277][26022] Updated weights on worker 0-0, policy_version 129010 (0.00098) [2022-07-09 06:31:48,971][25689] Fps is (10 sec: 5822.4, 60 sec: 5736.8, 300 sec: 5745.6). Total num frames: 132109312. Throughput: 0: 5180.2. Samples: 132106764. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:31:48,971][25689] Avg episode reward: [(0, '-58.502')] [2022-07-09 06:31:50,102][26022] Updated weights on worker 0-0, policy_version 129020 (0.00089) [2022-07-09 06:31:51,875][26022] Updated weights on worker 0-0, policy_version 129030 (0.00087) [2022-07-09 06:31:53,671][26022] Updated weights on worker 0-0, policy_version 129040 (0.00087) [2022-07-09 06:31:54,043][25689] Fps is (10 sec: 5781.4, 60 sec: 5721.1, 300 sec: 5742.3). Total num frames: 132137984. Throughput: 0: 6047.7. Samples: 132141662. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:31:54,043][25689] Avg episode reward: [(0, '-58.847')] [2022-07-09 06:31:55,405][26022] Updated weights on worker 0-0, policy_version 129050 (0.00085) [2022-07-09 06:31:57,296][26022] Updated weights on worker 0-0, policy_version 129060 (0.00091) [2022-07-09 06:31:58,819][26022] Updated weights on worker 0-0, policy_version 129070 (0.00092) [2022-07-09 06:31:59,129][25689] Fps is (10 sec: 5846.4, 60 sec: 5737.6, 300 sec: 5747.7). Total num frames: 132168704. Throughput: 0: 6060.6. Samples: 132176568. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:31:59,131][25689] Avg episode reward: [(0, '-58.894')] [2022-07-09 06:32:00,786][26022] Updated weights on worker 0-0, policy_version 129080 (0.00084) [2022-07-09 06:32:02,777][26022] Updated weights on worker 0-0, policy_version 129090 (0.00088) [2022-07-09 06:32:04,170][25689] Fps is (10 sec: 5561.4, 60 sec: 5733.9, 300 sec: 5747.2). Total num frames: 132194304. Throughput: 0: 5098.4. Samples: 132192494. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 06:32:04,170][25689] Avg episode reward: [(0, '-59.330')] [2022-07-09 06:32:04,785][26022] Updated weights on worker 0-0, policy_version 129100 (0.00086) [2022-07-09 06:32:06,356][26022] Updated weights on worker 0-0, policy_version 129110 (0.00084) [2022-07-09 06:32:08,159][26022] Updated weights on worker 0-0, policy_version 129120 (0.00084) [2022-07-09 06:32:09,265][25689] Fps is (10 sec: 5455.4, 60 sec: 5726.4, 300 sec: 5745.9). Total num frames: 132224000. Throughput: 0: 5903.7. Samples: 132226578. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:32:09,266][25689] Avg episode reward: [(0, '-58.972')] [2022-07-09 06:32:09,994][26022] Updated weights on worker 0-0, policy_version 129130 (0.00084) [2022-07-09 06:32:11,693][26022] Updated weights on worker 0-0, policy_version 129140 (0.00084) [2022-07-09 06:32:13,483][26022] Updated weights on worker 0-0, policy_version 129150 (0.00084) [2022-07-09 06:32:14,312][25689] Fps is (10 sec: 5957.2, 60 sec: 5740.2, 300 sec: 5753.5). Total num frames: 132254720. Throughput: 0: 5911.2. Samples: 132261474. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:32:14,312][25689] Avg episode reward: [(0, '-58.534')] [2022-07-09 06:32:15,311][26022] Updated weights on worker 0-0, policy_version 129160 (0.00085) [2022-07-09 06:32:16,903][26022] Updated weights on worker 0-0, policy_version 129170 (0.00087) [2022-07-09 06:32:18,739][26022] Updated weights on worker 0-0, policy_version 129180 (0.00095) [2022-07-09 06:32:19,366][25689] Fps is (10 sec: 5880.3, 60 sec: 5764.6, 300 sec: 5746.0). Total num frames: 132283392. Throughput: 0: 5068.0. Samples: 132279126. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:32:19,366][25689] Avg episode reward: [(0, '-59.143')] [2022-07-09 06:32:20,387][26022] Updated weights on worker 0-0, policy_version 129190 (0.00085) [2022-07-09 06:32:22,257][26022] Updated weights on worker 0-0, policy_version 129200 (0.00086) [2022-07-09 06:32:24,167][26022] Updated weights on worker 0-0, policy_version 129210 (0.00089) [2022-07-09 06:32:24,399][25689] Fps is (10 sec: 5684.5, 60 sec: 5745.4, 300 sec: 5742.2). Total num frames: 132312064. Throughput: 0: 6024.2. Samples: 132314360. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:32:24,400][25689] Avg episode reward: [(0, '-59.036')] [2022-07-09 06:32:25,541][26022] Updated weights on worker 0-0, policy_version 129220 (0.00081) [2022-07-09 06:32:27,635][26022] Updated weights on worker 0-0, policy_version 129230 (0.00088) [2022-07-09 06:32:29,129][26022] Updated weights on worker 0-0, policy_version 129240 (0.00086) [2022-07-09 06:32:29,404][25689] Fps is (10 sec: 5916.5, 60 sec: 5764.1, 300 sec: 5756.1). Total num frames: 132342784. Throughput: 0: 6099.8. Samples: 132349420. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:32:29,404][25689] Avg episode reward: [(0, '-59.166')] [2022-07-09 06:32:31,195][26022] Updated weights on worker 0-0, policy_version 129250 (0.00080) [2022-07-09 06:32:32,736][26022] Updated weights on worker 0-0, policy_version 129260 (0.00082) [2022-07-09 06:32:32,896][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:32:32,910][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000129261_132363264.pth [2022-07-09 06:32:32,911][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000127239_130292736.pth [2022-07-09 06:32:34,426][25689] Fps is (10 sec: 5821.5, 60 sec: 5764.7, 300 sec: 5746.3). Total num frames: 132370432. Throughput: 0: 5233.9. Samples: 132366750. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:32:34,426][25689] Avg episode reward: [(0, '-57.899')] [2022-07-09 06:32:34,674][26022] Updated weights on worker 0-0, policy_version 129270 (0.00095) [2022-07-09 06:32:36,309][26022] Updated weights on worker 0-0, policy_version 129280 (0.00089) [2022-07-09 06:32:38,166][26022] Updated weights on worker 0-0, policy_version 129290 (0.00080) [2022-07-09 06:32:39,489][25689] Fps is (10 sec: 5685.9, 60 sec: 5767.5, 300 sec: 5746.6). Total num frames: 132400128. Throughput: 0: 6080.3. Samples: 132401484. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:32:39,490][25689] Avg episode reward: [(0, '-58.367')] [2022-07-09 06:32:39,930][26022] Updated weights on worker 0-0, policy_version 129300 (0.00097) [2022-07-09 06:32:41,677][26022] Updated weights on worker 0-0, policy_version 129310 (0.00089) [2022-07-09 06:32:43,208][26022] Updated weights on worker 0-0, policy_version 129320 (0.00089) [2022-07-09 06:32:44,502][25689] Fps is (10 sec: 5894.5, 60 sec: 5783.3, 300 sec: 5750.0). Total num frames: 132429824. Throughput: 0: 6079.7. Samples: 132436576. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:32:44,502][25689] Avg episode reward: [(0, '-58.409')] [2022-07-09 06:32:45,129][26022] Updated weights on worker 0-0, policy_version 129330 (0.00080) [2022-07-09 06:32:46,976][26022] Updated weights on worker 0-0, policy_version 129340 (0.00102) [2022-07-09 06:32:48,506][26022] Updated weights on worker 0-0, policy_version 129350 (0.00079) [2022-07-09 06:32:49,521][25689] Fps is (10 sec: 5920.3, 60 sec: 5783.7, 300 sec: 5746.6). Total num frames: 132459520. Throughput: 0: 5203.3. Samples: 132454096. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:32:49,522][25689] Avg episode reward: [(0, '-57.593')] [2022-07-09 06:32:50,466][26022] Updated weights on worker 0-0, policy_version 129360 (0.00080) [2022-07-09 06:32:51,985][26022] Updated weights on worker 0-0, policy_version 129370 (0.00083) [2022-07-09 06:32:54,044][26022] Updated weights on worker 0-0, policy_version 129380 (0.00092) [2022-07-09 06:32:54,538][25689] Fps is (10 sec: 5815.5, 60 sec: 5789.0, 300 sec: 5752.3). Total num frames: 132488192. Throughput: 0: 6091.3. Samples: 132489262. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:32:54,539][25689] Avg episode reward: [(0, '-57.694')] [2022-07-09 06:32:55,759][26022] Updated weights on worker 0-0, policy_version 129390 (0.00079) [2022-07-09 06:32:57,315][26022] Updated weights on worker 0-0, policy_version 129400 (0.00080) [2022-07-09 06:32:58,962][26022] Updated weights on worker 0-0, policy_version 129410 (0.00084) [2022-07-09 06:32:59,581][25689] Fps is (10 sec: 5904.1, 60 sec: 5793.2, 300 sec: 5763.1). Total num frames: 132518912. Throughput: 0: 6133.8. Samples: 132524720. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:32:59,581][25689] Avg episode reward: [(0, '-58.221')] [2022-07-09 06:33:00,899][26022] Updated weights on worker 0-0, policy_version 129420 (0.00082) [2022-07-09 06:33:02,934][26022] Updated weights on worker 0-0, policy_version 129430 (0.00088) [2022-07-09 06:33:04,587][25689] Fps is (10 sec: 5604.6, 60 sec: 5796.5, 300 sec: 5747.1). Total num frames: 132544512. Throughput: 0: 5148.2. Samples: 132539980. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:33:04,589][25689] Avg episode reward: [(0, '-58.474')] [2022-07-09 06:33:04,843][26022] Updated weights on worker 0-0, policy_version 129440 (0.00093) [2022-07-09 06:33:06,394][26022] Updated weights on worker 0-0, policy_version 129450 (0.00086) [2022-07-09 06:33:08,240][26022] Updated weights on worker 0-0, policy_version 129460 (0.00090) [2022-07-09 06:33:09,603][25689] Fps is (10 sec: 5517.0, 60 sec: 5804.1, 300 sec: 5750.8). Total num frames: 132574208. Throughput: 0: 6014.1. Samples: 132574872. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:33:09,604][25689] Avg episode reward: [(0, '-57.803')] [2022-07-09 06:33:10,028][26022] Updated weights on worker 0-0, policy_version 129470 (0.00557) [2022-07-09 06:33:11,676][26022] Updated weights on worker 0-0, policy_version 129480 (0.00086) [2022-07-09 06:33:13,593][26022] Updated weights on worker 0-0, policy_version 129490 (0.00081) [2022-07-09 06:33:14,615][25689] Fps is (10 sec: 5922.9, 60 sec: 5790.5, 300 sec: 5747.9). Total num frames: 132603904. Throughput: 0: 6019.6. Samples: 132610114. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:33:14,617][25689] Avg episode reward: [(0, '-57.439')] [2022-07-09 06:33:15,171][26022] Updated weights on worker 0-0, policy_version 129500 (0.00087) [2022-07-09 06:33:17,054][26022] Updated weights on worker 0-0, policy_version 129510 (0.00084) [2022-07-09 06:33:18,984][26022] Updated weights on worker 0-0, policy_version 129520 (0.00085) [2022-07-09 06:33:19,761][25689] Fps is (10 sec: 5746.3, 60 sec: 5781.6, 300 sec: 5746.0). Total num frames: 132632576. Throughput: 0: 5087.4. Samples: 132627384. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:33:19,761][25689] Avg episode reward: [(0, '-57.207')] [2022-07-09 06:33:20,599][26022] Updated weights on worker 0-0, policy_version 129530 (0.00090) [2022-07-09 06:33:22,461][26022] Updated weights on worker 0-0, policy_version 129540 (0.00085) [2022-07-09 06:33:24,071][26022] Updated weights on worker 0-0, policy_version 129550 (0.00085) [2022-07-09 06:33:24,799][25689] Fps is (10 sec: 5731.0, 60 sec: 5798.2, 300 sec: 5752.3). Total num frames: 132662272. Throughput: 0: 6060.6. Samples: 132662478. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:33:24,800][25689] Avg episode reward: [(0, '-57.687')] [2022-07-09 06:33:25,760][26022] Updated weights on worker 0-0, policy_version 129560 (0.00089) [2022-07-09 06:33:27,827][26022] Updated weights on worker 0-0, policy_version 129570 (0.00089) [2022-07-09 06:33:29,472][26022] Updated weights on worker 0-0, policy_version 129580 (0.00083) [2022-07-09 06:33:29,840][25689] Fps is (10 sec: 5790.9, 60 sec: 5760.8, 300 sec: 5749.2). Total num frames: 132690944. Throughput: 0: 6035.7. Samples: 132697014. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:33:29,841][25689] Avg episode reward: [(0, '-57.256')] [2022-07-09 06:33:31,144][26022] Updated weights on worker 0-0, policy_version 129590 (0.00080) [2022-07-09 06:33:33,256][26022] Updated weights on worker 0-0, policy_version 129600 (0.00096) [2022-07-09 06:33:34,735][26022] Updated weights on worker 0-0, policy_version 129610 (0.00095) [2022-07-09 06:33:34,855][25689] Fps is (10 sec: 5906.3, 60 sec: 5812.3, 300 sec: 5750.7). Total num frames: 132721664. Throughput: 0: 5152.3. Samples: 132714398. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:33:34,856][25689] Avg episode reward: [(0, '-57.244')] [2022-07-09 06:33:36,567][26022] Updated weights on worker 0-0, policy_version 129620 (0.00087) [2022-07-09 06:33:38,367][26022] Updated weights on worker 0-0, policy_version 129630 (0.00092) [2022-07-09 06:33:40,008][25689] Fps is (10 sec: 5841.0, 60 sec: 5786.8, 300 sec: 5749.1). Total num frames: 132750336. Throughput: 0: 6013.0. Samples: 132749132. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:33:40,009][25689] Avg episode reward: [(0, '-57.095')] [2022-07-09 06:33:40,037][26022] Updated weights on worker 0-0, policy_version 129640 (0.00086) [2022-07-09 06:33:41,980][26022] Updated weights on worker 0-0, policy_version 129650 (0.00612) [2022-07-09 06:33:43,412][26022] Updated weights on worker 0-0, policy_version 129660 (0.00094) [2022-07-09 06:33:45,069][25689] Fps is (10 sec: 5714.7, 60 sec: 5782.2, 300 sec: 5751.9). Total num frames: 132780032. Throughput: 0: 6004.9. Samples: 132784194. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:33:45,069][25689] Avg episode reward: [(0, '-56.338')] [2022-07-09 06:33:45,563][26022] Updated weights on worker 0-0, policy_version 129670 (0.00095) [2022-07-09 06:33:46,970][26022] Updated weights on worker 0-0, policy_version 129680 (0.00083) [2022-07-09 06:33:48,873][26022] Updated weights on worker 0-0, policy_version 129690 (0.00085) [2022-07-09 06:33:50,072][25689] Fps is (10 sec: 5901.4, 60 sec: 5783.7, 300 sec: 5752.0). Total num frames: 132809728. Throughput: 0: 5170.9. Samples: 132801632. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:33:50,073][25689] Avg episode reward: [(0, '-55.684')] [2022-07-09 06:33:50,640][26022] Updated weights on worker 0-0, policy_version 129700 (0.00094) [2022-07-09 06:33:52,372][26022] Updated weights on worker 0-0, policy_version 129710 (0.00085) [2022-07-09 06:33:54,316][26022] Updated weights on worker 0-0, policy_version 129720 (0.00084) [2022-07-09 06:33:55,173][25689] Fps is (10 sec: 5776.8, 60 sec: 5775.7, 300 sec: 5754.3). Total num frames: 132838400. Throughput: 0: 6001.2. Samples: 132836328. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:33:55,174][25689] Avg episode reward: [(0, '-55.546')] [2022-07-09 06:33:55,972][26022] Updated weights on worker 0-0, policy_version 129730 (0.00087) [2022-07-09 06:33:57,756][26022] Updated weights on worker 0-0, policy_version 129740 (0.00087) [2022-07-09 06:33:59,600][26022] Updated weights on worker 0-0, policy_version 129750 (0.00086) [2022-07-09 06:34:00,262][25689] Fps is (10 sec: 5627.5, 60 sec: 5737.5, 300 sec: 5759.9). Total num frames: 132867072. Throughput: 0: 6023.1. Samples: 132871124. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:34:00,265][25689] Avg episode reward: [(0, '-55.686')] [2022-07-09 06:34:01,196][26022] Updated weights on worker 0-0, policy_version 129760 (0.00201) [2022-07-09 06:34:03,729][26022] Updated weights on worker 0-0, policy_version 129770 (0.00091) [2022-07-09 06:34:05,137][26022] Updated weights on worker 0-0, policy_version 129780 (0.00087) [2022-07-09 06:34:05,365][25689] Fps is (10 sec: 5625.8, 60 sec: 5778.9, 300 sec: 5754.8). Total num frames: 132895744. Throughput: 0: 5874.0. Samples: 132903418. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 06:34:05,366][25689] Avg episode reward: [(0, '-56.571')] [2022-07-09 06:34:07,152][26022] Updated weights on worker 0-0, policy_version 129790 (0.00096) [2022-07-09 06:34:08,664][26022] Updated weights on worker 0-0, policy_version 129800 (0.00502) [2022-07-09 06:34:10,443][25689] Fps is (10 sec: 5632.7, 60 sec: 5756.3, 300 sec: 5753.4). Total num frames: 132924416. Throughput: 0: 5842.7. Samples: 132920650. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 06:34:10,443][25689] Avg episode reward: [(0, '-57.082')] [2022-07-09 06:34:10,700][26022] Updated weights on worker 0-0, policy_version 129810 (0.00089) [2022-07-09 06:34:12,330][26022] Updated weights on worker 0-0, policy_version 129820 (0.00094) [2022-07-09 06:34:14,141][26022] Updated weights on worker 0-0, policy_version 129830 (0.00621) [2022-07-09 06:34:15,450][25689] Fps is (10 sec: 5788.1, 60 sec: 5756.7, 300 sec: 5754.1). Total num frames: 132954112. Throughput: 0: 5884.1. Samples: 132955640. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 06:34:15,450][25689] Avg episode reward: [(0, '-57.889')] [2022-07-09 06:34:15,750][26022] Updated weights on worker 0-0, policy_version 129840 (0.00085) [2022-07-09 06:34:17,514][26022] Updated weights on worker 0-0, policy_version 129850 (0.00081) [2022-07-09 06:34:19,185][26022] Updated weights on worker 0-0, policy_version 129860 (0.00083) [2022-07-09 06:34:20,518][25689] Fps is (10 sec: 5894.6, 60 sec: 5780.9, 300 sec: 5753.0). Total num frames: 132983808. Throughput: 0: 5912.3. Samples: 132990886. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 06:34:20,519][25689] Avg episode reward: [(0, '-57.429')] [2022-07-09 06:34:21,238][26022] Updated weights on worker 0-0, policy_version 129870 (0.00087) [2022-07-09 06:34:22,831][26022] Updated weights on worker 0-0, policy_version 129880 (0.00085) [2022-07-09 06:34:24,699][26022] Updated weights on worker 0-0, policy_version 129890 (0.00100) [2022-07-09 06:34:25,548][25689] Fps is (10 sec: 5881.3, 60 sec: 5781.7, 300 sec: 5759.8). Total num frames: 133013504. Throughput: 0: 5207.8. Samples: 133008524. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 06:34:25,548][25689] Avg episode reward: [(0, '-56.996')] [2022-07-09 06:34:26,180][26022] Updated weights on worker 0-0, policy_version 129900 (0.00090) [2022-07-09 06:34:28,286][26022] Updated weights on worker 0-0, policy_version 129910 (0.00085) [2022-07-09 06:34:29,679][26022] Updated weights on worker 0-0, policy_version 129920 (0.00086) [2022-07-09 06:34:30,550][25689] Fps is (10 sec: 5818.4, 60 sec: 5785.4, 300 sec: 5757.1). Total num frames: 133042176. Throughput: 0: 6113.7. Samples: 133043580. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 06:34:30,551][25689] Avg episode reward: [(0, '-56.333')] [2022-07-09 06:34:31,696][26022] Updated weights on worker 0-0, policy_version 129930 (0.00048) [2022-07-09 06:34:33,019][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:34:33,030][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000129939_133057536.pth [2022-07-09 06:34:33,030][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000127914_130983936.pth [2022-07-09 06:34:33,226][26022] Updated weights on worker 0-0, policy_version 129940 (0.00088) [2022-07-09 06:34:35,072][26022] Updated weights on worker 0-0, policy_version 129950 (0.00085) [2022-07-09 06:34:35,555][25689] Fps is (10 sec: 5730.6, 60 sec: 5752.7, 300 sec: 5759.4). Total num frames: 133070848. Throughput: 0: 6113.0. Samples: 133078542. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 06:34:35,555][25689] Avg episode reward: [(0, '-55.990')] [2022-07-09 06:34:36,946][26022] Updated weights on worker 0-0, policy_version 129960 (0.00086) [2022-07-09 06:34:38,675][26022] Updated weights on worker 0-0, policy_version 129970 (0.00087) [2022-07-09 06:34:40,367][26022] Updated weights on worker 0-0, policy_version 129980 (0.00088) [2022-07-09 06:34:40,684][25689] Fps is (10 sec: 5860.7, 60 sec: 5788.7, 300 sec: 5761.0). Total num frames: 133101568. Throughput: 0: 5214.9. Samples: 133096048. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 06:34:40,684][25689] Avg episode reward: [(0, '-56.361')] [2022-07-09 06:34:42,197][26022] Updated weights on worker 0-0, policy_version 129990 (0.00090) [2022-07-09 06:34:43,613][26022] Updated weights on worker 0-0, policy_version 130000 (0.00087) [2022-07-09 06:34:45,650][26022] Updated weights on worker 0-0, policy_version 130010 (0.00610) [2022-07-09 06:34:45,703][25689] Fps is (10 sec: 5953.0, 60 sec: 5792.6, 300 sec: 5761.5). Total num frames: 133131264. Throughput: 0: 6091.9. Samples: 133131310. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 06:34:45,704][25689] Avg episode reward: [(0, '-57.695')] [2022-07-09 06:34:47,314][26022] Updated weights on worker 0-0, policy_version 130020 (0.00084) [2022-07-09 06:34:49,048][26022] Updated weights on worker 0-0, policy_version 130030 (0.00082) [2022-07-09 06:34:50,731][25689] Fps is (10 sec: 5809.7, 60 sec: 5773.4, 300 sec: 5762.1). Total num frames: 133159936. Throughput: 0: 6105.9. Samples: 133166800. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 06:34:50,731][25689] Avg episode reward: [(0, '-58.057')] [2022-07-09 06:34:50,919][26022] Updated weights on worker 0-0, policy_version 130040 (0.00089) [2022-07-09 06:34:52,478][26022] Updated weights on worker 0-0, policy_version 130050 (0.00086) [2022-07-09 06:34:54,438][26022] Updated weights on worker 0-0, policy_version 130060 (0.00053) [2022-07-09 06:34:55,750][25689] Fps is (10 sec: 5810.0, 60 sec: 5798.1, 300 sec: 5766.4). Total num frames: 133189632. Throughput: 0: 5239.2. Samples: 133184348. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 06:34:55,750][25689] Avg episode reward: [(0, '-57.893')] [2022-07-09 06:34:55,998][26022] Updated weights on worker 0-0, policy_version 130070 (0.00087) [2022-07-09 06:34:57,894][26022] Updated weights on worker 0-0, policy_version 130080 (0.00091) [2022-07-09 06:34:59,700][26022] Updated weights on worker 0-0, policy_version 130090 (0.00088) [2022-07-09 06:35:00,848][25689] Fps is (10 sec: 5769.1, 60 sec: 5797.3, 300 sec: 5771.8). Total num frames: 133218304. Throughput: 0: 6106.3. Samples: 133219176. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 06:35:00,849][25689] Avg episode reward: [(0, '-58.796')] [2022-07-09 06:35:01,336][26022] Updated weights on worker 0-0, policy_version 130100 (0.00093) [2022-07-09 06:35:03,618][26022] Updated weights on worker 0-0, policy_version 130110 (0.00090) [2022-07-09 06:35:05,401][26022] Updated weights on worker 0-0, policy_version 130120 (0.00082) [2022-07-09 06:35:05,892][25689] Fps is (10 sec: 5654.1, 60 sec: 5803.0, 300 sec: 5771.7). Total num frames: 133246976. Throughput: 0: 5964.3. Samples: 133251718. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 06:35:05,892][25689] Avg episode reward: [(0, '-58.373')] [2022-07-09 06:35:07,196][26022] Updated weights on worker 0-0, policy_version 130130 (0.00082) [2022-07-09 06:35:08,879][26022] Updated weights on worker 0-0, policy_version 130140 (0.00086) [2022-07-09 06:35:10,669][26022] Updated weights on worker 0-0, policy_version 130150 (0.00082) [2022-07-09 06:35:10,980][25689] Fps is (10 sec: 5660.0, 60 sec: 5802.0, 300 sec: 5767.3). Total num frames: 133275648. Throughput: 0: 5054.6. Samples: 133269150. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 06:35:10,980][25689] Avg episode reward: [(0, '-57.710')] [2022-07-09 06:35:12,479][26022] Updated weights on worker 0-0, policy_version 130160 (0.00083) [2022-07-09 06:35:14,044][26022] Updated weights on worker 0-0, policy_version 130170 (0.00084) [2022-07-09 06:35:16,064][25689] Fps is (10 sec: 5637.1, 60 sec: 5777.6, 300 sec: 5759.8). Total num frames: 133304320. Throughput: 0: 5899.5. Samples: 133304196. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 06:35:16,066][25689] Avg episode reward: [(0, '-57.974')] [2022-07-09 06:35:16,070][26022] Updated weights on worker 0-0, policy_version 130180 (0.00087) [2022-07-09 06:35:17,540][26022] Updated weights on worker 0-0, policy_version 130190 (0.00084) [2022-07-09 06:35:19,523][26022] Updated weights on worker 0-0, policy_version 130200 (0.00079) [2022-07-09 06:35:21,085][26022] Updated weights on worker 0-0, policy_version 130210 (0.00089) [2022-07-09 06:35:21,119][25689] Fps is (10 sec: 5857.7, 60 sec: 5795.9, 300 sec: 5771.1). Total num frames: 133335040. Throughput: 0: 5920.9. Samples: 133339196. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 06:35:21,119][25689] Avg episode reward: [(0, '-58.557')] [2022-07-09 06:35:22,866][26022] Updated weights on worker 0-0, policy_version 130220 (0.00097) [2022-07-09 06:35:24,799][26022] Updated weights on worker 0-0, policy_version 130230 (0.00086) [2022-07-09 06:35:26,128][25689] Fps is (10 sec: 5901.8, 60 sec: 5780.9, 300 sec: 5771.3). Total num frames: 133363712. Throughput: 0: 5179.9. Samples: 133356540. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 06:35:26,128][25689] Avg episode reward: [(0, '-58.663')] [2022-07-09 06:35:26,534][26022] Updated weights on worker 0-0, policy_version 130240 (0.00093) [2022-07-09 06:35:28,316][26022] Updated weights on worker 0-0, policy_version 130250 (0.00095) [2022-07-09 06:35:30,162][26022] Updated weights on worker 0-0, policy_version 130260 (0.00079) [2022-07-09 06:35:31,136][25689] Fps is (10 sec: 5724.7, 60 sec: 5780.4, 300 sec: 5764.6). Total num frames: 133392384. Throughput: 0: 6055.4. Samples: 133391202. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 06:35:31,137][25689] Avg episode reward: [(0, '-57.929')] [2022-07-09 06:35:31,819][26022] Updated weights on worker 0-0, policy_version 130270 (0.00086) [2022-07-09 06:35:33,689][26022] Updated weights on worker 0-0, policy_version 130280 (0.00086) [2022-07-09 06:35:35,270][26022] Updated weights on worker 0-0, policy_version 130290 (0.00088) [2022-07-09 06:35:36,159][25689] Fps is (10 sec: 5716.8, 60 sec: 5778.6, 300 sec: 5763.4). Total num frames: 133421056. Throughput: 0: 6054.6. Samples: 133425858. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 06:35:36,159][25689] Avg episode reward: [(0, '-58.206')] [2022-07-09 06:35:37,288][26022] Updated weights on worker 0-0, policy_version 130300 (0.00089) [2022-07-09 06:35:38,857][26022] Updated weights on worker 0-0, policy_version 130310 (0.00084) [2022-07-09 06:35:40,698][26022] Updated weights on worker 0-0, policy_version 130320 (0.00086) [2022-07-09 06:35:41,202][25689] Fps is (10 sec: 5696.4, 60 sec: 5753.0, 300 sec: 5766.1). Total num frames: 133449728. Throughput: 0: 5177.4. Samples: 133443176. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 06:35:41,203][25689] Avg episode reward: [(0, '-57.833')] [2022-07-09 06:35:42,622][26022] Updated weights on worker 0-0, policy_version 130330 (0.00088) [2022-07-09 06:35:44,324][26022] Updated weights on worker 0-0, policy_version 130340 (0.00080) [2022-07-09 06:35:45,881][26022] Updated weights on worker 0-0, policy_version 130350 (0.00085) [2022-07-09 06:35:46,218][25689] Fps is (10 sec: 5904.4, 60 sec: 5770.3, 300 sec: 5770.7). Total num frames: 133480448. Throughput: 0: 6054.0. Samples: 133478164. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 06:35:46,219][25689] Avg episode reward: [(0, '-56.979')] [2022-07-09 06:35:47,939][26022] Updated weights on worker 0-0, policy_version 130360 (0.00083) [2022-07-09 06:35:49,326][26022] Updated weights on worker 0-0, policy_version 130370 (0.00084) [2022-07-09 06:35:51,238][25689] Fps is (10 sec: 5714.2, 60 sec: 5737.1, 300 sec: 5761.7). Total num frames: 133507072. Throughput: 0: 6056.0. Samples: 133512940. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 06:35:51,238][25689] Avg episode reward: [(0, '-56.653')] [2022-07-09 06:35:51,457][26022] Updated weights on worker 0-0, policy_version 130380 (0.00084) [2022-07-09 06:35:52,815][26022] Updated weights on worker 0-0, policy_version 130390 (0.00090) [2022-07-09 06:35:54,762][26022] Updated weights on worker 0-0, policy_version 130400 (0.00085) [2022-07-09 06:35:56,291][25689] Fps is (10 sec: 5692.9, 60 sec: 5750.8, 300 sec: 5765.7). Total num frames: 133537792. Throughput: 0: 5180.5. Samples: 133530152. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 06:35:56,291][25689] Avg episode reward: [(0, '-56.997')] [2022-07-09 06:35:56,711][26022] Updated weights on worker 0-0, policy_version 130410 (0.00082) [2022-07-09 06:35:58,288][26022] Updated weights on worker 0-0, policy_version 130420 (0.00088) [2022-07-09 06:36:00,191][26022] Updated weights on worker 0-0, policy_version 130430 (0.00082) [2022-07-09 06:36:01,351][25689] Fps is (10 sec: 5872.7, 60 sec: 5754.4, 300 sec: 5774.8). Total num frames: 133566464. Throughput: 0: 6054.7. Samples: 133565170. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 06:36:01,351][25689] Avg episode reward: [(0, '-57.248')] [2022-07-09 06:36:01,860][26022] Updated weights on worker 0-0, policy_version 130440 (0.00092) [2022-07-09 06:36:04,006][26022] Updated weights on worker 0-0, policy_version 130450 (0.00086) [2022-07-09 06:36:05,817][26022] Updated weights on worker 0-0, policy_version 130460 (0.00086) [2022-07-09 06:36:06,378][25689] Fps is (10 sec: 5583.2, 60 sec: 5739.1, 300 sec: 5767.8). Total num frames: 133594112. Throughput: 0: 5940.7. Samples: 133597930. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 06:36:06,379][25689] Avg episode reward: [(0, '-57.528')] [2022-07-09 06:36:07,615][26022] Updated weights on worker 0-0, policy_version 130470 (0.00086) [2022-07-09 06:36:09,407][26022] Updated weights on worker 0-0, policy_version 130480 (0.00084) [2022-07-09 06:36:11,131][26022] Updated weights on worker 0-0, policy_version 130490 (0.00084) [2022-07-09 06:36:11,400][25689] Fps is (10 sec: 5706.1, 60 sec: 5762.2, 300 sec: 5767.6). Total num frames: 133623808. Throughput: 0: 5930.3. Samples: 133632512. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 06:36:11,401][25689] Avg episode reward: [(0, '-57.952')] [2022-07-09 06:36:12,965][26022] Updated weights on worker 0-0, policy_version 130500 (0.00087) [2022-07-09 06:36:14,555][26022] Updated weights on worker 0-0, policy_version 130510 (0.00085) [2022-07-09 06:36:16,384][26022] Updated weights on worker 0-0, policy_version 130520 (0.00078) [2022-07-09 06:36:16,431][25689] Fps is (10 sec: 5806.2, 60 sec: 5767.5, 300 sec: 5773.0). Total num frames: 133652480. Throughput: 0: 5956.4. Samples: 133650114. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 06:36:16,431][25689] Avg episode reward: [(0, '-58.073')] [2022-07-09 06:36:18,041][26022] Updated weights on worker 0-0, policy_version 130530 (0.00084) [2022-07-09 06:36:20,028][26022] Updated weights on worker 0-0, policy_version 130540 (0.00094) [2022-07-09 06:36:21,471][25689] Fps is (10 sec: 5796.1, 60 sec: 5751.9, 300 sec: 5772.4). Total num frames: 133682176. Throughput: 0: 5972.1. Samples: 133685328. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 06:36:21,471][25689] Avg episode reward: [(0, '-57.729')] [2022-07-09 06:36:21,631][26022] Updated weights on worker 0-0, policy_version 130550 (0.00058) [2022-07-09 06:36:23,459][26022] Updated weights on worker 0-0, policy_version 130560 (0.00077) [2022-07-09 06:36:25,030][26022] Updated weights on worker 0-0, policy_version 130570 (0.00091) [2022-07-09 06:36:26,525][25689] Fps is (10 sec: 5883.9, 60 sec: 5764.5, 300 sec: 5771.8). Total num frames: 133711872. Throughput: 0: 6089.4. Samples: 133720612. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 06:36:26,525][25689] Avg episode reward: [(0, '-57.702')] [2022-07-09 06:36:26,962][26022] Updated weights on worker 0-0, policy_version 130580 (0.00088) [2022-07-09 06:36:28,463][26022] Updated weights on worker 0-0, policy_version 130590 (0.00086) [2022-07-09 06:36:30,504][26022] Updated weights on worker 0-0, policy_version 130600 (0.00097) [2022-07-09 06:36:31,537][25689] Fps is (10 sec: 5900.3, 60 sec: 5781.1, 300 sec: 5779.0). Total num frames: 133741568. Throughput: 0: 5243.9. Samples: 133738104. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 06:36:31,537][25689] Avg episode reward: [(0, '-57.102')] [2022-07-09 06:36:32,094][26022] Updated weights on worker 0-0, policy_version 130610 (0.00079) [2022-07-09 06:36:33,092][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:36:33,110][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000130614_133748736.pth [2022-07-09 06:36:33,110][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000128586_131672064.pth [2022-07-09 06:36:33,783][26022] Updated weights on worker 0-0, policy_version 130620 (0.00082) [2022-07-09 06:36:35,983][26022] Updated weights on worker 0-0, policy_version 130630 (0.00089) [2022-07-09 06:36:36,559][25689] Fps is (10 sec: 5817.0, 60 sec: 5781.2, 300 sec: 5777.0). Total num frames: 133770240. Throughput: 0: 6115.7. Samples: 133773210. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 06:36:36,559][25689] Avg episode reward: [(0, '-56.766')] [2022-07-09 06:36:37,335][26022] Updated weights on worker 0-0, policy_version 130640 (0.00083) [2022-07-09 06:36:39,341][26022] Updated weights on worker 0-0, policy_version 130650 (0.00084) [2022-07-09 06:36:40,772][26022] Updated weights on worker 0-0, policy_version 130660 (0.00095) [2022-07-09 06:36:41,634][25689] Fps is (10 sec: 5679.1, 60 sec: 5778.2, 300 sec: 5775.5). Total num frames: 133798912. Throughput: 0: 6083.4. Samples: 133807990. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 06:36:41,635][25689] Avg episode reward: [(0, '-56.981')] [2022-07-09 06:36:42,851][26022] Updated weights on worker 0-0, policy_version 130670 (0.00090) [2022-07-09 06:36:44,371][26022] Updated weights on worker 0-0, policy_version 130680 (0.00082) [2022-07-09 06:36:46,318][26022] Updated weights on worker 0-0, policy_version 130690 (0.00087) [2022-07-09 06:36:46,654][25689] Fps is (10 sec: 5883.0, 60 sec: 5777.7, 300 sec: 5779.0). Total num frames: 133829632. Throughput: 0: 5211.5. Samples: 133825518. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 06:36:46,656][25689] Avg episode reward: [(0, '-56.846')] [2022-07-09 06:36:47,981][26022] Updated weights on worker 0-0, policy_version 130700 (0.00082) [2022-07-09 06:36:49,716][26022] Updated weights on worker 0-0, policy_version 130710 (0.00085) [2022-07-09 06:36:51,567][26022] Updated weights on worker 0-0, policy_version 130720 (0.00089) [2022-07-09 06:36:51,695][25689] Fps is (10 sec: 5801.3, 60 sec: 5792.6, 300 sec: 5776.2). Total num frames: 133857280. Throughput: 0: 6078.0. Samples: 133860628. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 06:36:51,697][25689] Avg episode reward: [(0, '-57.472')] [2022-07-09 06:36:53,249][26022] Updated weights on worker 0-0, policy_version 130730 (0.00081) [2022-07-09 06:36:54,910][26022] Updated weights on worker 0-0, policy_version 130740 (0.00079) [2022-07-09 06:36:56,659][26022] Updated weights on worker 0-0, policy_version 130750 (0.00082) [2022-07-09 06:36:56,796][25689] Fps is (10 sec: 5755.4, 60 sec: 5788.1, 300 sec: 5775.9). Total num frames: 133888000. Throughput: 0: 6049.3. Samples: 133895630. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 06:36:56,797][25689] Avg episode reward: [(0, '-57.927')] [2022-07-09 06:36:58,578][26022] Updated weights on worker 0-0, policy_version 130760 (0.00082) [2022-07-09 06:37:00,187][26022] Updated weights on worker 0-0, policy_version 130770 (0.00094) [2022-07-09 06:37:01,833][25689] Fps is (10 sec: 5757.6, 60 sec: 5773.4, 300 sec: 5782.9). Total num frames: 133915648. Throughput: 0: 5204.1. Samples: 133913106. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 06:37:01,833][25689] Avg episode reward: [(0, '-57.763')] [2022-07-09 06:37:02,391][26022] Updated weights on worker 0-0, policy_version 130780 (0.00093) [2022-07-09 06:37:04,128][26022] Updated weights on worker 0-0, policy_version 130790 (0.00095) [2022-07-09 06:37:05,903][26022] Updated weights on worker 0-0, policy_version 130800 (0.00086) [2022-07-09 06:37:06,890][25689] Fps is (10 sec: 5477.9, 60 sec: 5770.5, 300 sec: 5776.7). Total num frames: 133943296. Throughput: 0: 5952.8. Samples: 133945978. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 06:37:06,891][25689] Avg episode reward: [(0, '-57.955')] [2022-07-09 06:37:07,600][26022] Updated weights on worker 0-0, policy_version 130810 (0.00085) [2022-07-09 06:37:09,590][26022] Updated weights on worker 0-0, policy_version 130820 (0.00086) [2022-07-09 06:37:11,348][26022] Updated weights on worker 0-0, policy_version 130830 (0.00085) [2022-07-09 06:37:11,899][25689] Fps is (10 sec: 5798.3, 60 sec: 5788.7, 300 sec: 5777.4). Total num frames: 133974016. Throughput: 0: 5939.8. Samples: 133980636. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 06:37:11,900][25689] Avg episode reward: [(0, '-58.823')] [2022-07-09 06:37:13,067][26022] Updated weights on worker 0-0, policy_version 130840 (0.00078) [2022-07-09 06:37:14,675][26022] Updated weights on worker 0-0, policy_version 130850 (0.00084) [2022-07-09 06:37:16,612][26022] Updated weights on worker 0-0, policy_version 130860 (0.00083) [2022-07-09 06:37:16,914][25689] Fps is (10 sec: 6027.1, 60 sec: 5807.1, 300 sec: 5781.6). Total num frames: 134003712. Throughput: 0: 5100.6. Samples: 133998246. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 06:37:16,915][25689] Avg episode reward: [(0, '-58.838')] [2022-07-09 06:37:18,316][26022] Updated weights on worker 0-0, policy_version 130870 (0.00088) [2022-07-09 06:37:19,959][26022] Updated weights on worker 0-0, policy_version 130880 (0.00091) [2022-07-09 06:37:21,896][26022] Updated weights on worker 0-0, policy_version 130890 (0.00097) [2022-07-09 06:37:22,029][25689] Fps is (10 sec: 5762.1, 60 sec: 5783.0, 300 sec: 5780.1). Total num frames: 134032384. Throughput: 0: 5945.6. Samples: 134033184. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 06:37:22,029][25689] Avg episode reward: [(0, '-58.847')] [2022-07-09 06:37:23,491][26022] Updated weights on worker 0-0, policy_version 130900 (0.00093) [2022-07-09 06:37:25,452][26022] Updated weights on worker 0-0, policy_version 130910 (0.00083) [2022-07-09 06:37:27,085][25689] Fps is (10 sec: 5738.8, 60 sec: 5782.8, 300 sec: 5775.6). Total num frames: 134062080. Throughput: 0: 6032.2. Samples: 134067798. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 06:37:27,086][25689] Avg episode reward: [(0, '-58.093')] [2022-07-09 06:37:27,087][26022] Updated weights on worker 0-0, policy_version 130920 (0.00098) [2022-07-09 06:37:28,925][26022] Updated weights on worker 0-0, policy_version 130930 (0.00086) [2022-07-09 06:37:30,791][26022] Updated weights on worker 0-0, policy_version 130940 (0.00085) [2022-07-09 06:37:32,096][25689] Fps is (10 sec: 5696.1, 60 sec: 5749.1, 300 sec: 5775.8). Total num frames: 134089728. Throughput: 0: 5183.3. Samples: 134085322. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 06:37:32,097][25689] Avg episode reward: [(0, '-57.162')] [2022-07-09 06:37:32,456][26022] Updated weights on worker 0-0, policy_version 130950 (0.00089) [2022-07-09 06:37:34,325][26022] Updated weights on worker 0-0, policy_version 130960 (0.00089) [2022-07-09 06:37:35,979][26022] Updated weights on worker 0-0, policy_version 130970 (0.00080) [2022-07-09 06:37:37,103][25689] Fps is (10 sec: 5621.6, 60 sec: 5750.5, 300 sec: 5773.5). Total num frames: 134118400. Throughput: 0: 6028.5. Samples: 134119956. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 06:37:37,104][25689] Avg episode reward: [(0, '-56.388')] [2022-07-09 06:37:37,816][26022] Updated weights on worker 0-0, policy_version 130980 (0.00085) [2022-07-09 06:37:39,422][26022] Updated weights on worker 0-0, policy_version 130990 (0.00088) [2022-07-09 06:37:41,435][26022] Updated weights on worker 0-0, policy_version 131000 (0.00087) [2022-07-09 06:37:42,186][25689] Fps is (10 sec: 5886.3, 60 sec: 5783.6, 300 sec: 5775.6). Total num frames: 134149120. Throughput: 0: 6028.3. Samples: 134154696. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 06:37:42,186][25689] Avg episode reward: [(0, '-56.342')] [2022-07-09 06:37:43,070][26022] Updated weights on worker 0-0, policy_version 131010 (0.00083) [2022-07-09 06:37:45,030][26022] Updated weights on worker 0-0, policy_version 131020 (0.00394) [2022-07-09 06:37:46,444][26022] Updated weights on worker 0-0, policy_version 131030 (0.00100) [2022-07-09 06:37:47,227][25689] Fps is (10 sec: 5765.6, 60 sec: 5730.9, 300 sec: 5768.3). Total num frames: 134176768. Throughput: 0: 5180.0. Samples: 134172132. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 06:37:47,227][25689] Avg episode reward: [(0, '-56.225')] [2022-07-09 06:37:48,514][26022] Updated weights on worker 0-0, policy_version 131040 (0.00082) [2022-07-09 06:37:49,873][26022] Updated weights on worker 0-0, policy_version 131050 (0.00085) [2022-07-09 06:37:51,884][26022] Updated weights on worker 0-0, policy_version 131060 (0.00615) [2022-07-09 06:37:52,259][25689] Fps is (10 sec: 5794.6, 60 sec: 5782.5, 300 sec: 5774.9). Total num frames: 134207488. Throughput: 0: 6047.3. Samples: 134207252. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 06:37:52,259][25689] Avg episode reward: [(0, '-56.412')] [2022-07-09 06:37:53,412][26022] Updated weights on worker 0-0, policy_version 131070 (0.00086) [2022-07-09 06:37:55,379][26022] Updated weights on worker 0-0, policy_version 131080 (0.00096) [2022-07-09 06:37:57,137][26022] Updated weights on worker 0-0, policy_version 131090 (0.00079) [2022-07-09 06:37:57,266][25689] Fps is (10 sec: 5915.9, 60 sec: 5757.5, 300 sec: 5768.7). Total num frames: 134236160. Throughput: 0: 6070.4. Samples: 134242352. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 06:37:57,267][25689] Avg episode reward: [(0, '-56.924')] [2022-07-09 06:37:58,871][26022] Updated weights on worker 0-0, policy_version 131100 (0.00092) [2022-07-09 06:38:00,561][26022] Updated weights on worker 0-0, policy_version 131110 (0.00079) [2022-07-09 06:38:02,307][25689] Fps is (10 sec: 5605.0, 60 sec: 5757.2, 300 sec: 5774.9). Total num frames: 134263808. Throughput: 0: 5229.9. Samples: 134259926. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 06:38:02,309][25689] Avg episode reward: [(0, '-56.730')] [2022-07-09 06:38:02,651][26022] Updated weights on worker 0-0, policy_version 131120 (0.00094) [2022-07-09 06:38:04,604][26022] Updated weights on worker 0-0, policy_version 131130 (0.00207) [2022-07-09 06:38:06,127][26022] Updated weights on worker 0-0, policy_version 131140 (0.00085) [2022-07-09 06:38:07,320][25689] Fps is (10 sec: 5601.8, 60 sec: 5778.3, 300 sec: 5771.5). Total num frames: 134292480. Throughput: 0: 6003.4. Samples: 134292760. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 06:38:07,322][25689] Avg episode reward: [(0, '-56.662')] [2022-07-09 06:38:08,094][26022] Updated weights on worker 0-0, policy_version 131150 (0.00084) [2022-07-09 06:38:09,842][26022] Updated weights on worker 0-0, policy_version 131160 (0.00085) [2022-07-09 06:38:11,766][26022] Updated weights on worker 0-0, policy_version 131170 (0.00082) [2022-07-09 06:38:12,323][25689] Fps is (10 sec: 5827.7, 60 sec: 5762.0, 300 sec: 5771.7). Total num frames: 134322176. Throughput: 0: 5983.3. Samples: 134327300. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 06:38:12,323][25689] Avg episode reward: [(0, '-57.150')] [2022-07-09 06:38:13,371][26022] Updated weights on worker 0-0, policy_version 131180 (0.00086) [2022-07-09 06:38:15,160][26022] Updated weights on worker 0-0, policy_version 131190 (0.00090) [2022-07-09 06:38:17,016][26022] Updated weights on worker 0-0, policy_version 131200 (0.00090) [2022-07-09 06:38:17,410][25689] Fps is (10 sec: 5784.5, 60 sec: 5738.1, 300 sec: 5772.8). Total num frames: 134350848. Throughput: 0: 5086.8. Samples: 134344822. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 06:38:17,411][25689] Avg episode reward: [(0, '-56.974')] [2022-07-09 06:38:18,597][26022] Updated weights on worker 0-0, policy_version 131210 (0.00084) [2022-07-09 06:38:20,559][26022] Updated weights on worker 0-0, policy_version 131220 (0.00086) [2022-07-09 06:38:22,185][26022] Updated weights on worker 0-0, policy_version 131230 (0.00090) [2022-07-09 06:38:22,448][25689] Fps is (10 sec: 5764.7, 60 sec: 5762.4, 300 sec: 5772.8). Total num frames: 134380544. Throughput: 0: 5949.3. Samples: 134379750. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2022-07-09 06:38:22,448][25689] Avg episode reward: [(0, '-56.536')] [2022-07-09 06:38:23,969][26022] Updated weights on worker 0-0, policy_version 131240 (0.00083) [2022-07-09 06:38:25,684][26022] Updated weights on worker 0-0, policy_version 131250 (0.00082) [2022-07-09 06:38:27,496][25689] Fps is (10 sec: 5787.4, 60 sec: 5746.3, 300 sec: 5772.7). Total num frames: 134409216. Throughput: 0: 6046.8. Samples: 134414758. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2022-07-09 06:38:27,496][25689] Avg episode reward: [(0, '-57.905')] [2022-07-09 06:38:27,535][26022] Updated weights on worker 0-0, policy_version 131260 (0.00094) [2022-07-09 06:38:29,243][26022] Updated weights on worker 0-0, policy_version 131270 (0.00086) [2022-07-09 06:38:30,895][26022] Updated weights on worker 0-0, policy_version 131280 (0.00093) [2022-07-09 06:38:32,516][25689] Fps is (10 sec: 5898.8, 60 sec: 5796.2, 300 sec: 5772.6). Total num frames: 134439936. Throughput: 0: 5203.6. Samples: 134432380. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2022-07-09 06:38:32,517][25689] Avg episode reward: [(0, '-58.589')] [2022-07-09 06:38:32,633][26022] Updated weights on worker 0-0, policy_version 131290 (0.00086) [2022-07-09 06:38:33,299][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:38:33,313][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000131294_134445056.pth [2022-07-09 06:38:33,314][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000129261_132363264.pth [2022-07-09 06:38:34,614][26022] Updated weights on worker 0-0, policy_version 131300 (0.00089) [2022-07-09 06:38:36,233][26022] Updated weights on worker 0-0, policy_version 131310 (0.00097) [2022-07-09 06:38:37,586][25689] Fps is (10 sec: 5885.9, 60 sec: 5790.2, 300 sec: 5774.2). Total num frames: 134468608. Throughput: 0: 6079.1. Samples: 134467474. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2022-07-09 06:38:37,587][25689] Avg episode reward: [(0, '-58.864')] [2022-07-09 06:38:38,013][26022] Updated weights on worker 0-0, policy_version 131320 (0.00089) [2022-07-09 06:38:39,737][26022] Updated weights on worker 0-0, policy_version 131330 (0.00094) [2022-07-09 06:38:41,590][26022] Updated weights on worker 0-0, policy_version 131340 (0.00088) [2022-07-09 06:38:42,684][25689] Fps is (10 sec: 5740.2, 60 sec: 5771.8, 300 sec: 5773.5). Total num frames: 134498304. Throughput: 0: 6040.5. Samples: 134501990. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2022-07-09 06:38:42,690][25689] Avg episode reward: [(0, '-57.533')] [2022-07-09 06:38:43,254][26022] Updated weights on worker 0-0, policy_version 131350 (0.00094) [2022-07-09 06:38:45,211][26022] Updated weights on worker 0-0, policy_version 131360 (0.00094) [2022-07-09 06:38:46,924][26022] Updated weights on worker 0-0, policy_version 131370 (0.00085) [2022-07-09 06:38:47,701][25689] Fps is (10 sec: 5770.8, 60 sec: 5791.1, 300 sec: 5769.8). Total num frames: 134526976. Throughput: 0: 5176.5. Samples: 134519348. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2022-07-09 06:38:47,701][25689] Avg episode reward: [(0, '-57.755')] [2022-07-09 06:38:48,684][26022] Updated weights on worker 0-0, policy_version 131380 (0.00091) [2022-07-09 06:38:50,363][26022] Updated weights on worker 0-0, policy_version 131390 (0.00091) [2022-07-09 06:38:52,126][26022] Updated weights on worker 0-0, policy_version 131400 (0.00089) [2022-07-09 06:38:52,722][25689] Fps is (10 sec: 5611.1, 60 sec: 5741.3, 300 sec: 5767.9). Total num frames: 134554624. Throughput: 0: 6025.2. Samples: 134554122. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2022-07-09 06:38:52,722][25689] Avg episode reward: [(0, '-56.432')] [2022-07-09 06:38:54,084][26022] Updated weights on worker 0-0, policy_version 131410 (0.00081) [2022-07-09 06:38:55,977][26022] Updated weights on worker 0-0, policy_version 131420 (0.00091) [2022-07-09 06:38:57,609][26022] Updated weights on worker 0-0, policy_version 131430 (0.00081) [2022-07-09 06:38:57,728][25689] Fps is (10 sec: 5820.9, 60 sec: 5775.3, 300 sec: 5776.4). Total num frames: 134585344. Throughput: 0: 6010.5. Samples: 134588536. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2022-07-09 06:38:57,729][25689] Avg episode reward: [(0, '-55.460')] [2022-07-09 06:38:59,573][26022] Updated weights on worker 0-0, policy_version 131440 (0.00092) [2022-07-09 06:39:01,173][26022] Updated weights on worker 0-0, policy_version 131450 (0.00090) [2022-07-09 06:39:02,833][25689] Fps is (10 sec: 5570.2, 60 sec: 5735.3, 300 sec: 5766.0). Total num frames: 134610944. Throughput: 0: 5917.9. Samples: 134621226. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2022-07-09 06:39:02,833][25689] Avg episode reward: [(0, '-55.682')] [2022-07-09 06:39:03,411][26022] Updated weights on worker 0-0, policy_version 131460 (0.00089) [2022-07-09 06:39:04,928][26022] Updated weights on worker 0-0, policy_version 131470 (0.00084) [2022-07-09 06:39:06,815][26022] Updated weights on worker 0-0, policy_version 131480 (0.00082) [2022-07-09 06:39:07,847][25689] Fps is (10 sec: 5667.2, 60 sec: 5786.0, 300 sec: 5777.6). Total num frames: 134642688. Throughput: 0: 5928.6. Samples: 134638784. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2022-07-09 06:39:07,848][25689] Avg episode reward: [(0, '-56.958')] [2022-07-09 06:39:08,520][26022] Updated weights on worker 0-0, policy_version 131490 (0.00089) [2022-07-09 06:39:10,324][26022] Updated weights on worker 0-0, policy_version 131500 (0.00095) [2022-07-09 06:39:12,192][26022] Updated weights on worker 0-0, policy_version 131510 (0.00997) [2022-07-09 06:39:12,858][25689] Fps is (10 sec: 5924.1, 60 sec: 5751.3, 300 sec: 5770.6). Total num frames: 134670336. Throughput: 0: 5915.5. Samples: 134673240. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2022-07-09 06:39:12,859][25689] Avg episode reward: [(0, '-58.781')] [2022-07-09 06:39:13,977][26022] Updated weights on worker 0-0, policy_version 131520 (0.00082) [2022-07-09 06:39:15,723][26022] Updated weights on worker 0-0, policy_version 131530 (0.00084) [2022-07-09 06:39:17,701][26022] Updated weights on worker 0-0, policy_version 131540 (0.00083) [2022-07-09 06:39:17,895][25689] Fps is (10 sec: 5503.3, 60 sec: 5739.3, 300 sec: 5764.3). Total num frames: 134697984. Throughput: 0: 5926.0. Samples: 134708042. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2022-07-09 06:39:17,895][25689] Avg episode reward: [(0, '-58.247')] [2022-07-09 06:39:19,025][26022] Updated weights on worker 0-0, policy_version 131550 (0.00093) [2022-07-09 06:39:21,283][26022] Updated weights on worker 0-0, policy_version 131560 (0.00083) [2022-07-09 06:39:22,783][26022] Updated weights on worker 0-0, policy_version 131570 (0.00081) [2022-07-09 06:39:22,945][25689] Fps is (10 sec: 5786.4, 60 sec: 5754.9, 300 sec: 5767.4). Total num frames: 134728704. Throughput: 0: 5175.7. Samples: 134725324. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2022-07-09 06:39:22,946][25689] Avg episode reward: [(0, '-58.528')] [2022-07-09 06:39:24,687][26022] Updated weights on worker 0-0, policy_version 131580 (0.00095) [2022-07-09 06:39:26,519][26022] Updated weights on worker 0-0, policy_version 131590 (0.00086) [2022-07-09 06:39:27,959][25689] Fps is (10 sec: 5799.6, 60 sec: 5741.3, 300 sec: 5763.7). Total num frames: 134756352. Throughput: 0: 6015.6. Samples: 134759770. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2022-07-09 06:39:27,959][25689] Avg episode reward: [(0, '-59.196')] [2022-07-09 06:39:28,186][26022] Updated weights on worker 0-0, policy_version 131600 (0.00088) [2022-07-09 06:39:30,003][26022] Updated weights on worker 0-0, policy_version 131610 (0.00086) [2022-07-09 06:39:31,699][26022] Updated weights on worker 0-0, policy_version 131620 (0.00087) [2022-07-09 06:39:32,988][25689] Fps is (10 sec: 5709.9, 60 sec: 5723.5, 300 sec: 5766.7). Total num frames: 134786048. Throughput: 0: 6024.2. Samples: 134794508. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2022-07-09 06:39:32,990][25689] Avg episode reward: [(0, '-60.180')] [2022-07-09 06:39:33,523][26022] Updated weights on worker 0-0, policy_version 131630 (0.00086) [2022-07-09 06:39:35,234][26022] Updated weights on worker 0-0, policy_version 131640 (0.00106) [2022-07-09 06:39:37,115][26022] Updated weights on worker 0-0, policy_version 131650 (0.00083) [2022-07-09 06:39:38,008][25689] Fps is (10 sec: 5807.9, 60 sec: 5728.2, 300 sec: 5761.9). Total num frames: 134814720. Throughput: 0: 5159.8. Samples: 134811826. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2022-07-09 06:39:38,009][25689] Avg episode reward: [(0, '-58.207')] [2022-07-09 06:39:38,816][26022] Updated weights on worker 0-0, policy_version 131660 (0.00093) [2022-07-09 06:39:40,669][26022] Updated weights on worker 0-0, policy_version 131670 (0.00090) [2022-07-09 06:39:42,339][26022] Updated weights on worker 0-0, policy_version 131680 (0.00085) [2022-07-09 06:39:43,073][25689] Fps is (10 sec: 5787.8, 60 sec: 5731.4, 300 sec: 5761.0). Total num frames: 134844416. Throughput: 0: 6016.9. Samples: 134846430. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2022-07-09 06:39:43,073][25689] Avg episode reward: [(0, '-57.310')] [2022-07-09 06:39:44,201][26022] Updated weights on worker 0-0, policy_version 131690 (0.00084) [2022-07-09 06:39:45,668][26022] Updated weights on worker 0-0, policy_version 131700 (0.00095) [2022-07-09 06:39:47,589][26022] Updated weights on worker 0-0, policy_version 131710 (0.00084) [2022-07-09 06:39:48,104][25689] Fps is (10 sec: 5781.6, 60 sec: 5730.0, 300 sec: 5761.0). Total num frames: 134873088. Throughput: 0: 6051.7. Samples: 134881682. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2022-07-09 06:39:48,104][25689] Avg episode reward: [(0, '-58.155')] [2022-07-09 06:39:49,247][26022] Updated weights on worker 0-0, policy_version 131720 (0.00085) [2022-07-09 06:39:50,931][26022] Updated weights on worker 0-0, policy_version 131730 (0.00083) [2022-07-09 06:39:52,842][26022] Updated weights on worker 0-0, policy_version 131740 (0.00083) [2022-07-09 06:39:53,110][25689] Fps is (10 sec: 5917.0, 60 sec: 5782.3, 300 sec: 5764.6). Total num frames: 134903808. Throughput: 0: 5200.4. Samples: 134899152. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2022-07-09 06:39:53,111][25689] Avg episode reward: [(0, '-58.211')] [2022-07-09 06:39:54,795][26022] Updated weights on worker 0-0, policy_version 131750 (0.00086) [2022-07-09 06:39:56,266][26022] Updated weights on worker 0-0, policy_version 131760 (0.00078) [2022-07-09 06:39:58,128][25689] Fps is (10 sec: 5822.9, 60 sec: 5730.4, 300 sec: 5762.8). Total num frames: 134931456. Throughput: 0: 6076.9. Samples: 134934090. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2022-07-09 06:39:58,128][25689] Avg episode reward: [(0, '-57.663')] [2022-07-09 06:39:58,315][26022] Updated weights on worker 0-0, policy_version 131770 (0.00095) [2022-07-09 06:39:59,966][26022] Updated weights on worker 0-0, policy_version 131780 (0.00069) [2022-07-09 06:40:01,793][26022] Updated weights on worker 0-0, policy_version 131790 (0.00078) [2022-07-09 06:40:03,249][25689] Fps is (10 sec: 5454.0, 60 sec: 5762.7, 300 sec: 5757.8). Total num frames: 134959104. Throughput: 0: 5960.8. Samples: 134966696. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2022-07-09 06:40:03,249][25689] Avg episode reward: [(0, '-56.809')] [2022-07-09 06:40:03,986][26022] Updated weights on worker 0-0, policy_version 131801 (0.00095) [2022-07-09 06:40:05,707][26022] Updated weights on worker 0-0, policy_version 131811 (0.00078) [2022-07-09 06:40:07,787][26022] Updated weights on worker 0-0, policy_version 131821 (0.00086) [2022-07-09 06:40:08,332][25689] Fps is (10 sec: 5720.0, 60 sec: 5739.2, 300 sec: 5764.8). Total num frames: 134989824. Throughput: 0: 5063.2. Samples: 134984100. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2022-07-09 06:40:08,333][25689] Avg episode reward: [(0, '-57.344')] [2022-07-09 06:40:09,265][26022] Updated weights on worker 0-0, policy_version 131831 (0.00074) [2022-07-09 06:40:11,183][26022] Updated weights on worker 0-0, policy_version 131841 (0.00081) [2022-07-09 06:40:12,732][26022] Updated weights on worker 0-0, policy_version 131851 (0.00090) [2022-07-09 06:40:13,381][25689] Fps is (10 sec: 5659.9, 60 sec: 5718.7, 300 sec: 5758.6). Total num frames: 135016448. Throughput: 0: 5895.8. Samples: 135018660. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2022-07-09 06:40:13,382][25689] Avg episode reward: [(0, '-57.650')] [2022-07-09 06:40:14,706][26022] Updated weights on worker 0-0, policy_version 131861 (0.00088) [2022-07-09 06:40:16,690][26022] Updated weights on worker 0-0, policy_version 131871 (0.00090) [2022-07-09 06:40:18,259][26022] Updated weights on worker 0-0, policy_version 131881 (0.00085) [2022-07-09 06:40:18,417][25689] Fps is (10 sec: 5787.3, 60 sec: 5786.4, 300 sec: 5762.4). Total num frames: 135048192. Throughput: 0: 5868.9. Samples: 135053168. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2022-07-09 06:40:18,418][25689] Avg episode reward: [(0, '-58.111')] [2022-07-09 06:40:20,031][26022] Updated weights on worker 0-0, policy_version 131891 (0.00094) [2022-07-09 06:40:21,928][26022] Updated weights on worker 0-0, policy_version 131901 (0.00085) [2022-07-09 06:40:23,493][25689] Fps is (10 sec: 5873.0, 60 sec: 5733.2, 300 sec: 5757.7). Total num frames: 135075840. Throughput: 0: 5119.6. Samples: 135070340. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:40:23,495][25689] Avg episode reward: [(0, '-57.535')] [2022-07-09 06:40:23,627][26022] Updated weights on worker 0-0, policy_version 131911 (0.00090) [2022-07-09 06:40:25,583][26022] Updated weights on worker 0-0, policy_version 131921 (0.00081) [2022-07-09 06:40:27,102][26022] Updated weights on worker 0-0, policy_version 131931 (0.00092) [2022-07-09 06:40:28,594][25689] Fps is (10 sec: 5534.3, 60 sec: 5741.9, 300 sec: 5755.9). Total num frames: 135104512. Throughput: 0: 5957.5. Samples: 135104812. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:40:28,594][25689] Avg episode reward: [(0, '-57.127')] [2022-07-09 06:40:29,153][26022] Updated weights on worker 0-0, policy_version 131941 (0.00084) [2022-07-09 06:40:30,714][26022] Updated weights on worker 0-0, policy_version 131951 (0.00091) [2022-07-09 06:40:32,613][26022] Updated weights on worker 0-0, policy_version 131961 (0.00083) [2022-07-09 06:40:33,492][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:40:33,507][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000131967_135134208.pth [2022-07-09 06:40:33,508][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000129939_133057536.pth [2022-07-09 06:40:33,624][25689] Fps is (10 sec: 5761.5, 60 sec: 5741.8, 300 sec: 5759.2). Total num frames: 135134208. Throughput: 0: 5985.7. Samples: 135139832. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:40:33,625][25689] Avg episode reward: [(0, '-56.446')] [2022-07-09 06:40:34,051][26022] Updated weights on worker 0-0, policy_version 131971 (0.00087) [2022-07-09 06:40:36,221][26022] Updated weights on worker 0-0, policy_version 131981 (0.00626) [2022-07-09 06:40:37,689][26022] Updated weights on worker 0-0, policy_version 131991 (0.00085) [2022-07-09 06:40:38,643][25689] Fps is (10 sec: 5910.3, 60 sec: 5758.8, 300 sec: 5763.1). Total num frames: 135163904. Throughput: 0: 5158.3. Samples: 135157498. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:40:38,644][25689] Avg episode reward: [(0, '-56.726')] [2022-07-09 06:40:39,585][26022] Updated weights on worker 0-0, policy_version 132001 (0.00082) [2022-07-09 06:40:41,215][26022] Updated weights on worker 0-0, policy_version 132011 (0.00081) [2022-07-09 06:40:43,143][26022] Updated weights on worker 0-0, policy_version 132021 (0.00095) [2022-07-09 06:40:43,683][25689] Fps is (10 sec: 5700.5, 60 sec: 5727.3, 300 sec: 5752.3). Total num frames: 135191552. Throughput: 0: 6042.8. Samples: 135192346. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:40:43,684][25689] Avg episode reward: [(0, '-56.553')] [2022-07-09 06:40:44,700][26022] Updated weights on worker 0-0, policy_version 132031 (0.00081) [2022-07-09 06:40:46,748][26022] Updated weights on worker 0-0, policy_version 132041 (0.00088) [2022-07-09 06:40:48,340][26022] Updated weights on worker 0-0, policy_version 132051 (0.00085) [2022-07-09 06:40:48,698][25689] Fps is (10 sec: 5804.8, 60 sec: 5762.7, 300 sec: 5766.1). Total num frames: 135222272. Throughput: 0: 6069.0. Samples: 135226824. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:40:48,698][25689] Avg episode reward: [(0, '-56.140')] [2022-07-09 06:40:50,272][26022] Updated weights on worker 0-0, policy_version 132061 (0.00085) [2022-07-09 06:40:51,925][26022] Updated weights on worker 0-0, policy_version 132071 (0.00083) [2022-07-09 06:40:53,709][25689] Fps is (10 sec: 5822.1, 60 sec: 5711.6, 300 sec: 5756.6). Total num frames: 135249920. Throughput: 0: 5201.6. Samples: 135244304. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:40:53,709][25689] Avg episode reward: [(0, '-56.416')] [2022-07-09 06:40:53,873][26022] Updated weights on worker 0-0, policy_version 132081 (0.00092) [2022-07-09 06:40:55,343][26022] Updated weights on worker 0-0, policy_version 132091 (0.00088) [2022-07-09 06:40:57,250][26022] Updated weights on worker 0-0, policy_version 132101 (0.00090) [2022-07-09 06:40:58,719][25689] Fps is (10 sec: 5722.5, 60 sec: 5746.1, 300 sec: 5761.1). Total num frames: 135279616. Throughput: 0: 6055.6. Samples: 135279070. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:40:58,719][25689] Avg episode reward: [(0, '-56.987')] [2022-07-09 06:40:59,123][26022] Updated weights on worker 0-0, policy_version 132111 (0.00085) [2022-07-09 06:41:00,859][26022] Updated weights on worker 0-0, policy_version 132121 (0.00085) [2022-07-09 06:41:02,953][26022] Updated weights on worker 0-0, policy_version 132131 (0.00420) [2022-07-09 06:41:03,780][25689] Fps is (10 sec: 5693.9, 60 sec: 5751.8, 300 sec: 5760.4). Total num frames: 135307264. Throughput: 0: 5936.9. Samples: 135311656. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:41:03,780][25689] Avg episode reward: [(0, '-56.729')] [2022-07-09 06:41:04,749][26022] Updated weights on worker 0-0, policy_version 132141 (0.00084) [2022-07-09 06:41:06,326][26022] Updated weights on worker 0-0, policy_version 132151 (0.00091) [2022-07-09 06:41:08,328][26022] Updated weights on worker 0-0, policy_version 132161 (0.00089) [2022-07-09 06:41:08,798][25689] Fps is (10 sec: 5485.8, 60 sec: 5707.1, 300 sec: 5753.6). Total num frames: 135334912. Throughput: 0: 5085.8. Samples: 135329052. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:41:08,799][25689] Avg episode reward: [(0, '-56.288')] [2022-07-09 06:41:09,989][26022] Updated weights on worker 0-0, policy_version 132171 (0.00087) [2022-07-09 06:41:11,914][26022] Updated weights on worker 0-0, policy_version 132181 (0.00089) [2022-07-09 06:41:13,551][26022] Updated weights on worker 0-0, policy_version 132191 (0.00107) [2022-07-09 06:41:13,831][25689] Fps is (10 sec: 5807.3, 60 sec: 5776.4, 300 sec: 5760.4). Total num frames: 135365632. Throughput: 0: 5934.7. Samples: 135363722. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:41:13,831][25689] Avg episode reward: [(0, '-56.163')] [2022-07-09 06:41:15,409][26022] Updated weights on worker 0-0, policy_version 132201 (0.00084) [2022-07-09 06:41:17,079][26022] Updated weights on worker 0-0, policy_version 132211 (0.00086) [2022-07-09 06:41:18,842][25689] Fps is (10 sec: 5811.4, 60 sec: 5711.0, 300 sec: 5754.1). Total num frames: 135393280. Throughput: 0: 5917.3. Samples: 135398146. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:41:18,843][25689] Avg episode reward: [(0, '-55.569')] [2022-07-09 06:41:19,002][26022] Updated weights on worker 0-0, policy_version 132221 (0.00089) [2022-07-09 06:41:20,737][26022] Updated weights on worker 0-0, policy_version 132231 (0.00091) [2022-07-09 06:41:22,395][26022] Updated weights on worker 0-0, policy_version 132241 (0.00082) [2022-07-09 06:41:23,892][25689] Fps is (10 sec: 5801.2, 60 sec: 5764.4, 300 sec: 5757.6). Total num frames: 135424000. Throughput: 0: 5171.5. Samples: 135415666. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:41:23,892][25689] Avg episode reward: [(0, '-54.925')] [2022-07-09 06:41:24,071][26022] Updated weights on worker 0-0, policy_version 132251 (0.00083) [2022-07-09 06:41:25,888][26022] Updated weights on worker 0-0, policy_version 132261 (0.00091) [2022-07-09 06:41:27,824][26022] Updated weights on worker 0-0, policy_version 132271 (0.00090) [2022-07-09 06:41:28,915][25689] Fps is (10 sec: 5693.1, 60 sec: 5737.8, 300 sec: 5747.1). Total num frames: 135450624. Throughput: 0: 6044.4. Samples: 135450644. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:41:28,915][25689] Avg episode reward: [(0, '-55.547')] [2022-07-09 06:41:29,538][26022] Updated weights on worker 0-0, policy_version 132281 (0.00086) [2022-07-09 06:41:31,279][26022] Updated weights on worker 0-0, policy_version 132291 (0.00097) [2022-07-09 06:41:33,030][26022] Updated weights on worker 0-0, policy_version 132301 (0.00084) [2022-07-09 06:41:33,943][25689] Fps is (10 sec: 5705.4, 60 sec: 5755.0, 300 sec: 5753.9). Total num frames: 135481344. Throughput: 0: 6039.7. Samples: 135485194. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:41:33,944][25689] Avg episode reward: [(0, '-57.130')] [2022-07-09 06:41:34,879][26022] Updated weights on worker 0-0, policy_version 132311 (0.00086) [2022-07-09 06:41:36,698][26022] Updated weights on worker 0-0, policy_version 132321 (0.00088) [2022-07-09 06:41:38,385][26022] Updated weights on worker 0-0, policy_version 132331 (0.00092) [2022-07-09 06:41:39,020][25689] Fps is (10 sec: 5877.5, 60 sec: 5732.5, 300 sec: 5753.8). Total num frames: 135510016. Throughput: 0: 5171.5. Samples: 135502494. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:41:39,020][25689] Avg episode reward: [(0, '-57.058')] [2022-07-09 06:41:40,333][26022] Updated weights on worker 0-0, policy_version 132341 (0.00091) [2022-07-09 06:41:41,996][26022] Updated weights on worker 0-0, policy_version 132351 (0.00086) [2022-07-09 06:41:43,780][26022] Updated weights on worker 0-0, policy_version 132361 (0.00091) [2022-07-09 06:41:44,055][25689] Fps is (10 sec: 5670.6, 60 sec: 5750.0, 300 sec: 5746.6). Total num frames: 135538688. Throughput: 0: 6023.9. Samples: 135537128. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:41:44,056][25689] Avg episode reward: [(0, '-57.700')] [2022-07-09 06:41:45,597][26022] Updated weights on worker 0-0, policy_version 132371 (0.00084) [2022-07-09 06:41:47,114][26022] Updated weights on worker 0-0, policy_version 132381 (0.00083) [2022-07-09 06:41:49,073][25689] Fps is (10 sec: 5602.1, 60 sec: 5698.8, 300 sec: 5747.1). Total num frames: 135566336. Throughput: 0: 5996.6. Samples: 135571524. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:41:49,075][25689] Avg episode reward: [(0, '-59.365')] [2022-07-09 06:41:49,222][26022] Updated weights on worker 0-0, policy_version 132391 (0.00085) [2022-07-09 06:41:50,777][26022] Updated weights on worker 0-0, policy_version 132401 (0.00085) [2022-07-09 06:41:52,616][26022] Updated weights on worker 0-0, policy_version 132411 (0.00081) [2022-07-09 06:41:54,080][25689] Fps is (10 sec: 5924.6, 60 sec: 5767.0, 300 sec: 5752.3). Total num frames: 135598080. Throughput: 0: 5160.1. Samples: 135589104. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:41:54,080][25689] Avg episode reward: [(0, '-59.921')] [2022-07-09 06:41:54,419][26022] Updated weights on worker 0-0, policy_version 132421 (0.00083) [2022-07-09 06:41:55,931][26022] Updated weights on worker 0-0, policy_version 132431 (0.00079) [2022-07-09 06:41:58,012][26022] Updated weights on worker 0-0, policy_version 132441 (0.00371) [2022-07-09 06:41:59,144][25689] Fps is (10 sec: 5998.8, 60 sec: 5744.8, 300 sec: 5755.3). Total num frames: 135626752. Throughput: 0: 6045.7. Samples: 135624162. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:41:59,156][25689] Avg episode reward: [(0, '-58.272')] [2022-07-09 06:41:59,586][26022] Updated weights on worker 0-0, policy_version 132451 (0.00082) [2022-07-09 06:42:01,563][26022] Updated weights on worker 0-0, policy_version 132461 (0.00094) [2022-07-09 06:42:03,615][26022] Updated weights on worker 0-0, policy_version 132471 (0.00082) [2022-07-09 06:42:04,206][25689] Fps is (10 sec: 5359.4, 60 sec: 5710.9, 300 sec: 5748.3). Total num frames: 135652352. Throughput: 0: 5941.4. Samples: 135656854. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:42:04,207][25689] Avg episode reward: [(0, '-57.060')] [2022-07-09 06:42:05,403][26022] Updated weights on worker 0-0, policy_version 132481 (0.00082) [2022-07-09 06:42:06,999][26022] Updated weights on worker 0-0, policy_version 132491 (0.00084) [2022-07-09 06:42:08,874][26022] Updated weights on worker 0-0, policy_version 132501 (0.00079) [2022-07-09 06:42:09,214][25689] Fps is (10 sec: 5491.4, 60 sec: 5745.8, 300 sec: 5744.9). Total num frames: 135682048. Throughput: 0: 5951.6. Samples: 135691396. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:42:09,215][25689] Avg episode reward: [(0, '-57.442')] [2022-07-09 06:42:10,636][26022] Updated weights on worker 0-0, policy_version 132511 (0.00103) [2022-07-09 06:42:12,364][26022] Updated weights on worker 0-0, policy_version 132521 (0.00095) [2022-07-09 06:42:14,250][25689] Fps is (10 sec: 5811.2, 60 sec: 5711.5, 300 sec: 5741.0). Total num frames: 135710720. Throughput: 0: 5928.1. Samples: 135708676. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:42:14,251][25689] Avg episode reward: [(0, '-57.053')] [2022-07-09 06:42:14,336][26022] Updated weights on worker 0-0, policy_version 132531 (0.00091) [2022-07-09 06:42:16,033][26022] Updated weights on worker 0-0, policy_version 132541 (0.00087) [2022-07-09 06:42:18,096][26022] Updated weights on worker 0-0, policy_version 132551 (0.00083) [2022-07-09 06:42:19,254][25689] Fps is (10 sec: 5813.7, 60 sec: 5746.2, 300 sec: 5746.6). Total num frames: 135740416. Throughput: 0: 5888.3. Samples: 135742572. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:42:19,254][25689] Avg episode reward: [(0, '-56.970')] [2022-07-09 06:42:19,682][26022] Updated weights on worker 0-0, policy_version 132561 (0.00089) [2022-07-09 06:42:21,435][26022] Updated weights on worker 0-0, policy_version 132571 (0.00087) [2022-07-09 06:42:23,241][26022] Updated weights on worker 0-0, policy_version 132581 (0.00094) [2022-07-09 06:42:24,336][25689] Fps is (10 sec: 5685.9, 60 sec: 5692.3, 300 sec: 5739.2). Total num frames: 135768064. Throughput: 0: 5978.3. Samples: 135777192. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 06:42:24,336][25689] Avg episode reward: [(0, '-56.459')] [2022-07-09 06:42:24,987][26022] Updated weights on worker 0-0, policy_version 132591 (0.00084) [2022-07-09 06:42:26,810][26022] Updated weights on worker 0-0, policy_version 132601 (0.00088) [2022-07-09 06:42:28,606][26022] Updated weights on worker 0-0, policy_version 132611 (0.00081) [2022-07-09 06:42:29,372][25689] Fps is (10 sec: 5465.1, 60 sec: 5708.0, 300 sec: 5738.7). Total num frames: 135795712. Throughput: 0: 5116.9. Samples: 135794542. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 06:42:29,372][25689] Avg episode reward: [(0, '-57.357')] [2022-07-09 06:42:30,286][26022] Updated weights on worker 0-0, policy_version 132621 (0.00086) [2022-07-09 06:42:32,136][26022] Updated weights on worker 0-0, policy_version 132631 (0.00082) [2022-07-09 06:42:33,521][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:42:33,534][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000132641_135824384.pth [2022-07-09 06:42:33,535][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000130614_133748736.pth [2022-07-09 06:42:33,539][26022] Updated weights on worker 0-0, policy_version 132641 (0.00086) [2022-07-09 06:42:34,427][25689] Fps is (10 sec: 5885.4, 60 sec: 5722.4, 300 sec: 5748.1). Total num frames: 135827456. Throughput: 0: 5988.0. Samples: 135829494. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 06:42:34,427][25689] Avg episode reward: [(0, '-58.074')] [2022-07-09 06:42:35,680][26022] Updated weights on worker 0-0, policy_version 132651 (0.00089) [2022-07-09 06:42:37,254][26022] Updated weights on worker 0-0, policy_version 132661 (0.00082) [2022-07-09 06:42:39,082][26022] Updated weights on worker 0-0, policy_version 132671 (0.00088) [2022-07-09 06:42:39,440][25689] Fps is (10 sec: 6102.1, 60 sec: 5745.3, 300 sec: 5746.0). Total num frames: 135857152. Throughput: 0: 6035.9. Samples: 135864416. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 06:42:39,441][25689] Avg episode reward: [(0, '-58.014')] [2022-07-09 06:42:41,020][26022] Updated weights on worker 0-0, policy_version 132681 (0.00084) [2022-07-09 06:42:42,529][26022] Updated weights on worker 0-0, policy_version 132691 (0.00088) [2022-07-09 06:42:44,577][25689] Fps is (10 sec: 5649.5, 60 sec: 5718.8, 300 sec: 5744.2). Total num frames: 135884800. Throughput: 0: 5170.6. Samples: 135881852. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 06:42:44,578][25689] Avg episode reward: [(0, '-57.919')] [2022-07-09 06:42:44,582][26022] Updated weights on worker 0-0, policy_version 132701 (0.00092) [2022-07-09 06:42:46,227][26022] Updated weights on worker 0-0, policy_version 132711 (0.00084) [2022-07-09 06:42:48,159][26022] Updated weights on worker 0-0, policy_version 132721 (0.00091) [2022-07-09 06:42:49,612][25689] Fps is (10 sec: 5738.2, 60 sec: 5767.9, 300 sec: 5744.1). Total num frames: 135915520. Throughput: 0: 6019.9. Samples: 135916388. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 06:42:49,614][25689] Avg episode reward: [(0, '-57.946')] [2022-07-09 06:42:49,831][26022] Updated weights on worker 0-0, policy_version 132731 (0.00087) [2022-07-09 06:42:51,766][26022] Updated weights on worker 0-0, policy_version 132741 (0.00084) [2022-07-09 06:42:53,298][26022] Updated weights on worker 0-0, policy_version 132751 (0.00091) [2022-07-09 06:42:54,622][25689] Fps is (10 sec: 5810.7, 60 sec: 5700.0, 300 sec: 5740.6). Total num frames: 135943168. Throughput: 0: 6028.9. Samples: 135951248. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 06:42:54,628][25689] Avg episode reward: [(0, '-58.028')] [2022-07-09 06:42:55,122][26022] Updated weights on worker 0-0, policy_version 132761 (0.00092) [2022-07-09 06:42:56,998][26022] Updated weights on worker 0-0, policy_version 132771 (0.00092) [2022-07-09 06:42:58,695][26022] Updated weights on worker 0-0, policy_version 132781 (0.00080) [2022-07-09 06:42:59,635][25689] Fps is (10 sec: 5823.5, 60 sec: 5738.7, 300 sec: 5751.5). Total num frames: 135973888. Throughput: 0: 5159.6. Samples: 135968612. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 06:42:59,635][25689] Avg episode reward: [(0, '-58.467')] [2022-07-09 06:43:00,395][26022] Updated weights on worker 0-0, policy_version 132791 (0.00093) [2022-07-09 06:43:02,472][26022] Updated weights on worker 0-0, policy_version 132801 (0.00087) [2022-07-09 06:43:04,372][26022] Updated weights on worker 0-0, policy_version 132811 (0.00091) [2022-07-09 06:43:04,764][25689] Fps is (10 sec: 5654.4, 60 sec: 5749.3, 300 sec: 5742.4). Total num frames: 136000512. Throughput: 0: 5919.2. Samples: 136001340. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 06:43:04,764][25689] Avg episode reward: [(0, '-58.383')] [2022-07-09 06:43:06,106][26022] Updated weights on worker 0-0, policy_version 132821 (0.00091) [2022-07-09 06:43:07,787][26022] Updated weights on worker 0-0, policy_version 132831 (0.00091) [2022-07-09 06:43:09,626][26022] Updated weights on worker 0-0, policy_version 132841 (0.00086) [2022-07-09 06:43:09,778][25689] Fps is (10 sec: 5451.6, 60 sec: 5731.7, 300 sec: 5738.7). Total num frames: 136029184. Throughput: 0: 5944.0. Samples: 136036254. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 06:43:09,779][25689] Avg episode reward: [(0, '-58.293')] [2022-07-09 06:43:11,405][26022] Updated weights on worker 0-0, policy_version 132851 (0.00089) [2022-07-09 06:43:13,112][26022] Updated weights on worker 0-0, policy_version 132861 (0.00086) [2022-07-09 06:43:14,789][25689] Fps is (10 sec: 5720.0, 60 sec: 5734.2, 300 sec: 5740.2). Total num frames: 136057856. Throughput: 0: 5082.2. Samples: 136053740. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 06:43:14,790][25689] Avg episode reward: [(0, '-58.439')] [2022-07-09 06:43:15,059][26022] Updated weights on worker 0-0, policy_version 132871 (0.00097) [2022-07-09 06:43:16,681][26022] Updated weights on worker 0-0, policy_version 132881 (0.00083) [2022-07-09 06:43:18,502][26022] Updated weights on worker 0-0, policy_version 132891 (0.00086) [2022-07-09 06:43:19,810][25689] Fps is (10 sec: 5818.1, 60 sec: 5732.4, 300 sec: 5740.5). Total num frames: 136087552. Throughput: 0: 5927.0. Samples: 136088190. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 06:43:19,812][25689] Avg episode reward: [(0, '-59.643')] [2022-07-09 06:43:20,130][26022] Updated weights on worker 0-0, policy_version 132901 (0.00085) [2022-07-09 06:43:22,094][26022] Updated weights on worker 0-0, policy_version 132911 (0.00091) [2022-07-09 06:43:23,721][26022] Updated weights on worker 0-0, policy_version 132921 (0.00089) [2022-07-09 06:43:24,937][25689] Fps is (10 sec: 5751.6, 60 sec: 5745.1, 300 sec: 5739.0). Total num frames: 136116224. Throughput: 0: 6040.2. Samples: 136123190. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 06:43:24,937][25689] Avg episode reward: [(0, '-58.384')] [2022-07-09 06:43:25,289][26022] Updated weights on worker 0-0, policy_version 132931 (0.00085) [2022-07-09 06:43:27,521][26022] Updated weights on worker 0-0, policy_version 132941 (0.00085) [2022-07-09 06:43:28,979][26022] Updated weights on worker 0-0, policy_version 132951 (0.00086) [2022-07-09 06:43:29,949][25689] Fps is (10 sec: 5756.6, 60 sec: 5781.1, 300 sec: 5735.7). Total num frames: 136145920. Throughput: 0: 5160.8. Samples: 136140354. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 06:43:29,950][25689] Avg episode reward: [(0, '-58.149')] [2022-07-09 06:43:31,075][26022] Updated weights on worker 0-0, policy_version 132961 (0.00085) [2022-07-09 06:43:32,593][26022] Updated weights on worker 0-0, policy_version 132971 (0.00080) [2022-07-09 06:43:34,419][26022] Updated weights on worker 0-0, policy_version 132981 (0.00089) [2022-07-09 06:43:34,974][25689] Fps is (10 sec: 6019.2, 60 sec: 5767.1, 300 sec: 5743.5). Total num frames: 136176640. Throughput: 0: 6010.3. Samples: 136175060. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 06:43:34,975][25689] Avg episode reward: [(0, '-57.429')] [2022-07-09 06:43:36,114][26022] Updated weights on worker 0-0, policy_version 132991 (0.00479) [2022-07-09 06:43:37,843][26022] Updated weights on worker 0-0, policy_version 133001 (0.00083) [2022-07-09 06:43:39,629][26022] Updated weights on worker 0-0, policy_version 133011 (0.00082) [2022-07-09 06:43:39,988][25689] Fps is (10 sec: 5814.2, 60 sec: 5733.2, 300 sec: 5738.2). Total num frames: 136204288. Throughput: 0: 6044.8. Samples: 136210164. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 06:43:39,989][25689] Avg episode reward: [(0, '-58.189')] [2022-07-09 06:43:41,461][26022] Updated weights on worker 0-0, policy_version 133021 (0.00081) [2022-07-09 06:43:43,078][26022] Updated weights on worker 0-0, policy_version 133031 (0.00092) [2022-07-09 06:43:44,951][26022] Updated weights on worker 0-0, policy_version 133041 (0.00093) [2022-07-09 06:43:45,047][25689] Fps is (10 sec: 5693.1, 60 sec: 5774.6, 300 sec: 5740.8). Total num frames: 136233984. Throughput: 0: 5199.5. Samples: 136227750. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 06:43:45,047][25689] Avg episode reward: [(0, '-58.125')] [2022-07-09 06:43:46,619][26022] Updated weights on worker 0-0, policy_version 133051 (0.00085) [2022-07-09 06:43:48,500][26022] Updated weights on worker 0-0, policy_version 133061 (0.00080) [2022-07-09 06:43:50,075][25689] Fps is (10 sec: 5888.4, 60 sec: 5758.3, 300 sec: 5747.6). Total num frames: 136263680. Throughput: 0: 6070.3. Samples: 136262520. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 06:43:50,075][25689] Avg episode reward: [(0, '-57.511')] [2022-07-09 06:43:50,154][26022] Updated weights on worker 0-0, policy_version 133071 (0.00061) [2022-07-09 06:43:52,113][26022] Updated weights on worker 0-0, policy_version 133081 (0.00092) [2022-07-09 06:43:53,827][26022] Updated weights on worker 0-0, policy_version 133091 (0.00085) [2022-07-09 06:43:55,100][25689] Fps is (10 sec: 5704.1, 60 sec: 5756.8, 300 sec: 5736.9). Total num frames: 136291328. Throughput: 0: 6062.7. Samples: 136297076. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 06:43:55,101][25689] Avg episode reward: [(0, '-58.069')] [2022-07-09 06:43:55,808][26022] Updated weights on worker 0-0, policy_version 133101 (0.00089) [2022-07-09 06:43:57,399][26022] Updated weights on worker 0-0, policy_version 133111 (0.00092) [2022-07-09 06:43:59,443][26022] Updated weights on worker 0-0, policy_version 133121 (0.00090) [2022-07-09 06:44:00,145][25689] Fps is (10 sec: 5796.0, 60 sec: 5753.8, 300 sec: 5755.2). Total num frames: 136322048. Throughput: 0: 5168.7. Samples: 136314348. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 06:44:00,146][25689] Avg episode reward: [(0, '-58.748')] [2022-07-09 06:44:00,755][26022] Updated weights on worker 0-0, policy_version 133131 (0.00084) [2022-07-09 06:44:03,218][26022] Updated weights on worker 0-0, policy_version 133141 (0.00085) [2022-07-09 06:44:04,812][26022] Updated weights on worker 0-0, policy_version 133151 (0.00085) [2022-07-09 06:44:05,194][25689] Fps is (10 sec: 5680.8, 60 sec: 5761.3, 300 sec: 5737.3). Total num frames: 136348672. Throughput: 0: 5919.5. Samples: 136347014. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 06:44:05,195][25689] Avg episode reward: [(0, '-58.695')] [2022-07-09 06:44:06,640][26022] Updated weights on worker 0-0, policy_version 133161 (0.00086) [2022-07-09 06:44:08,376][26022] Updated weights on worker 0-0, policy_version 133171 (0.00096) [2022-07-09 06:44:10,121][26022] Updated weights on worker 0-0, policy_version 133181 (0.00086) [2022-07-09 06:44:10,219][25689] Fps is (10 sec: 5488.8, 60 sec: 5760.3, 300 sec: 5740.5). Total num frames: 136377344. Throughput: 0: 5921.8. Samples: 136381814. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 06:44:10,220][25689] Avg episode reward: [(0, '-57.554')] [2022-07-09 06:44:11,690][26022] Updated weights on worker 0-0, policy_version 133191 (0.00081) [2022-07-09 06:44:13,826][26022] Updated weights on worker 0-0, policy_version 133201 (0.00090) [2022-07-09 06:44:15,247][25689] Fps is (10 sec: 5806.0, 60 sec: 5775.6, 300 sec: 5747.6). Total num frames: 136407040. Throughput: 0: 5066.8. Samples: 136399160. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 06:44:15,249][25689] Avg episode reward: [(0, '-56.846')] [2022-07-09 06:44:15,338][26022] Updated weights on worker 0-0, policy_version 133211 (0.00091) [2022-07-09 06:44:17,361][26022] Updated weights on worker 0-0, policy_version 133221 (0.00087) [2022-07-09 06:44:19,079][26022] Updated weights on worker 0-0, policy_version 133231 (0.00089) [2022-07-09 06:44:20,256][25689] Fps is (10 sec: 5713.4, 60 sec: 5742.9, 300 sec: 5738.0). Total num frames: 136434688. Throughput: 0: 5936.1. Samples: 136433730. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 06:44:20,257][25689] Avg episode reward: [(0, '-56.975')] [2022-07-09 06:44:20,879][26022] Updated weights on worker 0-0, policy_version 133241 (0.00087) [2022-07-09 06:44:22,645][26022] Updated weights on worker 0-0, policy_version 133251 (0.00087) [2022-07-09 06:44:24,402][26022] Updated weights on worker 0-0, policy_version 133261 (0.00091) [2022-07-09 06:44:25,302][25689] Fps is (10 sec: 5601.5, 60 sec: 5750.6, 300 sec: 5740.9). Total num frames: 136463360. Throughput: 0: 6050.7. Samples: 136468678. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 06:44:25,302][25689] Avg episode reward: [(0, '-57.186')] [2022-07-09 06:44:25,994][26022] Updated weights on worker 0-0, policy_version 133271 (0.00092) [2022-07-09 06:44:27,868][26022] Updated weights on worker 0-0, policy_version 133281 (0.00085) [2022-07-09 06:44:29,639][26022] Updated weights on worker 0-0, policy_version 133291 (0.00083) [2022-07-09 06:44:30,336][25689] Fps is (10 sec: 5790.7, 60 sec: 5748.6, 300 sec: 5740.8). Total num frames: 136493056. Throughput: 0: 5167.2. Samples: 136485758. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 06:44:30,337][25689] Avg episode reward: [(0, '-57.327')] [2022-07-09 06:44:31,553][26022] Updated weights on worker 0-0, policy_version 133301 (0.00085) [2022-07-09 06:44:33,363][26022] Updated weights on worker 0-0, policy_version 133311 (0.00089) [2022-07-09 06:44:33,595][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:44:33,606][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000133312_136511488.pth [2022-07-09 06:44:33,606][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000131294_134445056.pth [2022-07-09 06:44:34,929][26022] Updated weights on worker 0-0, policy_version 133321 (0.00087) [2022-07-09 06:44:35,359][25689] Fps is (10 sec: 5905.6, 60 sec: 5731.8, 300 sec: 5744.2). Total num frames: 136522752. Throughput: 0: 6031.0. Samples: 136520452. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 06:44:35,359][25689] Avg episode reward: [(0, '-57.910')] [2022-07-09 06:44:37,039][26022] Updated weights on worker 0-0, policy_version 133331 (0.00109) [2022-07-09 06:44:38,585][26022] Updated weights on worker 0-0, policy_version 133341 (0.00084) [2022-07-09 06:44:40,362][25689] Fps is (10 sec: 5617.1, 60 sec: 5715.9, 300 sec: 5735.0). Total num frames: 136549376. Throughput: 0: 6021.8. Samples: 136554804. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 06:44:40,363][25689] Avg episode reward: [(0, '-57.184')] [2022-07-09 06:44:40,491][26022] Updated weights on worker 0-0, policy_version 133351 (0.00084) [2022-07-09 06:44:42,169][26022] Updated weights on worker 0-0, policy_version 133361 (0.00081) [2022-07-09 06:44:43,957][26022] Updated weights on worker 0-0, policy_version 133371 (0.00087) [2022-07-09 06:44:45,408][25689] Fps is (10 sec: 5604.6, 60 sec: 5717.1, 300 sec: 5738.2). Total num frames: 136579072. Throughput: 0: 5989.4. Samples: 136589100. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 06:44:45,408][25689] Avg episode reward: [(0, '-58.048')] [2022-07-09 06:44:45,816][26022] Updated weights on worker 0-0, policy_version 133381 (0.00100) [2022-07-09 06:44:47,527][26022] Updated weights on worker 0-0, policy_version 133391 (0.00083) [2022-07-09 06:44:49,310][26022] Updated weights on worker 0-0, policy_version 133401 (0.00093) [2022-07-09 06:44:50,476][25689] Fps is (10 sec: 5872.6, 60 sec: 5713.3, 300 sec: 5733.6). Total num frames: 136608768. Throughput: 0: 5976.7. Samples: 136606128. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 06:44:50,477][25689] Avg episode reward: [(0, '-57.221')] [2022-07-09 06:44:51,295][26022] Updated weights on worker 0-0, policy_version 133411 (0.00084) [2022-07-09 06:44:52,956][26022] Updated weights on worker 0-0, policy_version 133421 (0.00058) [2022-07-09 06:44:54,710][26022] Updated weights on worker 0-0, policy_version 133431 (0.00078) [2022-07-09 06:44:55,481][25689] Fps is (10 sec: 5895.9, 60 sec: 5749.1, 300 sec: 5740.7). Total num frames: 136638464. Throughput: 0: 5986.9. Samples: 136640922. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 06:44:55,481][25689] Avg episode reward: [(0, '-57.275')] [2022-07-09 06:44:56,543][26022] Updated weights on worker 0-0, policy_version 133441 (0.00093) [2022-07-09 06:44:58,235][26022] Updated weights on worker 0-0, policy_version 133451 (0.00093) [2022-07-09 06:44:59,952][26022] Updated weights on worker 0-0, policy_version 133461 (0.00087) [2022-07-09 06:45:00,503][25689] Fps is (10 sec: 5820.9, 60 sec: 5717.4, 300 sec: 5746.0). Total num frames: 136667136. Throughput: 0: 6018.4. Samples: 136676020. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 06:45:00,503][25689] Avg episode reward: [(0, '-56.226')] [2022-07-09 06:45:01,791][26022] Updated weights on worker 0-0, policy_version 133471 (0.00082) [2022-07-09 06:45:03,813][26022] Updated weights on worker 0-0, policy_version 133481 (0.00086) [2022-07-09 06:45:05,632][25689] Fps is (10 sec: 5447.1, 60 sec: 5709.8, 300 sec: 5731.4). Total num frames: 136693760. Throughput: 0: 5047.2. Samples: 136691178. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 06:45:05,633][25689] Avg episode reward: [(0, '-57.472')] [2022-07-09 06:45:05,771][26022] Updated weights on worker 0-0, policy_version 133491 (0.00096) [2022-07-09 06:45:07,293][26022] Updated weights on worker 0-0, policy_version 133501 (0.00080) [2022-07-09 06:45:09,191][26022] Updated weights on worker 0-0, policy_version 133511 (0.00613) [2022-07-09 06:45:10,645][25689] Fps is (10 sec: 5553.3, 60 sec: 5727.9, 300 sec: 5742.4). Total num frames: 136723456. Throughput: 0: 5946.5. Samples: 136726064. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 06:45:10,645][25689] Avg episode reward: [(0, '-58.756')] [2022-07-09 06:45:11,056][26022] Updated weights on worker 0-0, policy_version 133521 (0.00092) [2022-07-09 06:45:12,831][26022] Updated weights on worker 0-0, policy_version 133531 (0.00085) [2022-07-09 06:45:14,676][26022] Updated weights on worker 0-0, policy_version 133541 (0.00083) [2022-07-09 06:45:15,662][25689] Fps is (10 sec: 5819.5, 60 sec: 5712.0, 300 sec: 5732.4). Total num frames: 136752128. Throughput: 0: 5950.7. Samples: 136761014. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 06:45:15,663][25689] Avg episode reward: [(0, '-57.867')] [2022-07-09 06:45:16,180][26022] Updated weights on worker 0-0, policy_version 133551 (0.00092) [2022-07-09 06:45:17,900][26022] Updated weights on worker 0-0, policy_version 133561 (0.00088) [2022-07-09 06:45:19,922][26022] Updated weights on worker 0-0, policy_version 133571 (0.00098) [2022-07-09 06:45:20,674][25689] Fps is (10 sec: 5819.5, 60 sec: 5745.5, 300 sec: 5740.6). Total num frames: 136781824. Throughput: 0: 5073.1. Samples: 136778352. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 06:45:20,675][25689] Avg episode reward: [(0, '-58.246')] [2022-07-09 06:45:21,426][26022] Updated weights on worker 0-0, policy_version 133581 (0.00087) [2022-07-09 06:45:23,405][26022] Updated weights on worker 0-0, policy_version 133591 (0.00086) [2022-07-09 06:45:25,060][26022] Updated weights on worker 0-0, policy_version 133601 (0.00090) [2022-07-09 06:45:25,761][25689] Fps is (10 sec: 5678.0, 60 sec: 5724.7, 300 sec: 5737.4). Total num frames: 136809472. Throughput: 0: 6063.3. Samples: 136813226. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 06:45:25,762][25689] Avg episode reward: [(0, '-58.113')] [2022-07-09 06:45:26,812][26022] Updated weights on worker 0-0, policy_version 133611 (0.00085) [2022-07-09 06:45:28,675][26022] Updated weights on worker 0-0, policy_version 133621 (0.00084) [2022-07-09 06:45:30,535][26022] Updated weights on worker 0-0, policy_version 133631 (0.00091) [2022-07-09 06:45:30,847][25689] Fps is (10 sec: 5637.1, 60 sec: 5719.8, 300 sec: 5736.3). Total num frames: 136839168. Throughput: 0: 6022.0. Samples: 136847722. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 06:45:30,847][25689] Avg episode reward: [(0, '-58.170')] [2022-07-09 06:45:32,095][26022] Updated weights on worker 0-0, policy_version 133641 (0.00085) [2022-07-09 06:45:34,032][26022] Updated weights on worker 0-0, policy_version 133651 (0.00051) [2022-07-09 06:45:35,728][26022] Updated weights on worker 0-0, policy_version 133661 (0.00091) [2022-07-09 06:45:35,899][25689] Fps is (10 sec: 5858.7, 60 sec: 5717.1, 300 sec: 5735.7). Total num frames: 136868864. Throughput: 0: 5148.6. Samples: 136865204. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 06:45:35,899][25689] Avg episode reward: [(0, '-57.614')] [2022-07-09 06:45:37,635][26022] Updated weights on worker 0-0, policy_version 133671 (0.00091) [2022-07-09 06:45:39,147][26022] Updated weights on worker 0-0, policy_version 133681 (0.00085) [2022-07-09 06:45:40,921][25689] Fps is (10 sec: 5895.6, 60 sec: 5766.0, 300 sec: 5742.9). Total num frames: 136898560. Throughput: 0: 6002.4. Samples: 136899880. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 06:45:40,921][25689] Avg episode reward: [(0, '-58.955')] [2022-07-09 06:45:41,000][26022] Updated weights on worker 0-0, policy_version 133691 (0.00089) [2022-07-09 06:45:42,838][26022] Updated weights on worker 0-0, policy_version 133701 (0.00103) [2022-07-09 06:45:44,561][26022] Updated weights on worker 0-0, policy_version 133711 (0.00087) [2022-07-09 06:45:45,995][25689] Fps is (10 sec: 5781.2, 60 sec: 5746.4, 300 sec: 5734.9). Total num frames: 136927232. Throughput: 0: 6003.9. Samples: 136934706. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 06:45:45,995][25689] Avg episode reward: [(0, '-59.373')] [2022-07-09 06:45:46,357][26022] Updated weights on worker 0-0, policy_version 133721 (0.00082) [2022-07-09 06:45:48,229][26022] Updated weights on worker 0-0, policy_version 133731 (0.00086) [2022-07-09 06:45:49,854][26022] Updated weights on worker 0-0, policy_version 133741 (0.00084) [2022-07-09 06:45:50,998][25689] Fps is (10 sec: 5792.1, 60 sec: 5752.6, 300 sec: 5741.9). Total num frames: 136956928. Throughput: 0: 5171.3. Samples: 136951930. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 06:45:50,999][25689] Avg episode reward: [(0, '-58.787')] [2022-07-09 06:45:51,978][26022] Updated weights on worker 0-0, policy_version 133751 (0.00091) [2022-07-09 06:45:53,405][26022] Updated weights on worker 0-0, policy_version 133761 (0.00095) [2022-07-09 06:45:55,379][26022] Updated weights on worker 0-0, policy_version 133771 (0.00092) [2022-07-09 06:45:56,024][25689] Fps is (10 sec: 5819.7, 60 sec: 5733.7, 300 sec: 5738.1). Total num frames: 136985600. Throughput: 0: 6026.5. Samples: 136986490. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 06:45:56,025][25689] Avg episode reward: [(0, '-58.843')] [2022-07-09 06:45:57,023][26022] Updated weights on worker 0-0, policy_version 133781 (0.00084) [2022-07-09 06:45:58,713][26022] Updated weights on worker 0-0, policy_version 133791 (0.00083) [2022-07-09 06:46:00,472][26022] Updated weights on worker 0-0, policy_version 133801 (0.00088) [2022-07-09 06:46:01,049][25689] Fps is (10 sec: 5807.6, 60 sec: 5750.4, 300 sec: 5745.8). Total num frames: 137015296. Throughput: 0: 6047.5. Samples: 137021600. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 06:46:01,049][25689] Avg episode reward: [(0, '-58.473')] [2022-07-09 06:46:02,862][26022] Updated weights on worker 0-0, policy_version 133811 (0.00084) [2022-07-09 06:46:04,207][26022] Updated weights on worker 0-0, policy_version 133821 (0.00085) [2022-07-09 06:46:06,183][25689] Fps is (10 sec: 5443.3, 60 sec: 5733.0, 300 sec: 5736.6). Total num frames: 137040896. Throughput: 0: 5070.5. Samples: 137037070. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 06:46:06,183][25689] Avg episode reward: [(0, '-58.076')] [2022-07-09 06:46:06,375][26022] Updated weights on worker 0-0, policy_version 133831 (0.00084) [2022-07-09 06:46:08,039][26022] Updated weights on worker 0-0, policy_version 133841 (0.00080) [2022-07-09 06:46:09,934][26022] Updated weights on worker 0-0, policy_version 133851 (0.00077) [2022-07-09 06:46:11,196][25689] Fps is (10 sec: 5550.0, 60 sec: 5749.8, 300 sec: 5737.0). Total num frames: 137071616. Throughput: 0: 5923.0. Samples: 137071562. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 06:46:11,197][25689] Avg episode reward: [(0, '-57.218')] [2022-07-09 06:46:11,559][26022] Updated weights on worker 0-0, policy_version 133861 (0.00083) [2022-07-09 06:46:13,323][26022] Updated weights on worker 0-0, policy_version 133871 (0.00090) [2022-07-09 06:46:15,118][26022] Updated weights on worker 0-0, policy_version 133881 (0.00082) [2022-07-09 06:46:16,254][25689] Fps is (10 sec: 5897.6, 60 sec: 5746.0, 300 sec: 5739.6). Total num frames: 137100288. Throughput: 0: 5919.3. Samples: 137106232. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 06:46:16,254][25689] Avg episode reward: [(0, '-58.192')] [2022-07-09 06:46:16,943][26022] Updated weights on worker 0-0, policy_version 133891 (0.00086) [2022-07-09 06:46:18,674][26022] Updated weights on worker 0-0, policy_version 133901 (0.00099) [2022-07-09 06:46:20,486][26022] Updated weights on worker 0-0, policy_version 133911 (0.00092) [2022-07-09 06:46:21,287][25689] Fps is (10 sec: 5682.8, 60 sec: 5727.1, 300 sec: 5733.0). Total num frames: 137128960. Throughput: 0: 5027.1. Samples: 137123342. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 06:46:21,288][25689] Avg episode reward: [(0, '-57.896')] [2022-07-09 06:46:22,473][26022] Updated weights on worker 0-0, policy_version 133921 (0.00093) [2022-07-09 06:46:23,792][26022] Updated weights on worker 0-0, policy_version 133931 (0.00085) [2022-07-09 06:46:25,955][26022] Updated weights on worker 0-0, policy_version 133941 (0.00084) [2022-07-09 06:46:26,396][25689] Fps is (10 sec: 5754.9, 60 sec: 5758.8, 300 sec: 5741.6). Total num frames: 137158656. Throughput: 0: 5986.2. Samples: 137158068. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 06:46:26,397][25689] Avg episode reward: [(0, '-57.852')] [2022-07-09 06:46:27,599][26022] Updated weights on worker 0-0, policy_version 133951 (0.00090) [2022-07-09 06:46:29,447][26022] Updated weights on worker 0-0, policy_version 133961 (0.00086) [2022-07-09 06:46:31,180][26022] Updated weights on worker 0-0, policy_version 133971 (0.00079) [2022-07-09 06:46:31,465][25689] Fps is (10 sec: 5735.1, 60 sec: 5743.6, 300 sec: 5734.0). Total num frames: 137187328. Throughput: 0: 5984.5. Samples: 137192858. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 06:46:31,467][25689] Avg episode reward: [(0, '-58.280')] [2022-07-09 06:46:32,808][26022] Updated weights on worker 0-0, policy_version 133981 (0.00080) [2022-07-09 06:46:33,631][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:46:33,644][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000133986_137201664.pth [2022-07-09 06:46:33,644][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000131967_135134208.pth [2022-07-09 06:46:34,718][26022] Updated weights on worker 0-0, policy_version 133991 (0.00091) [2022-07-09 06:46:36,460][26022] Updated weights on worker 0-0, policy_version 134001 (0.00081) [2022-07-09 06:46:36,491][25689] Fps is (10 sec: 5781.6, 60 sec: 5745.9, 300 sec: 5738.4). Total num frames: 137217024. Throughput: 0: 5132.1. Samples: 137210094. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 06:46:36,492][25689] Avg episode reward: [(0, '-58.366')] [2022-07-09 06:46:38,135][26022] Updated weights on worker 0-0, policy_version 134011 (0.00088) [2022-07-09 06:46:39,862][26022] Updated weights on worker 0-0, policy_version 134021 (0.00091) [2022-07-09 06:46:41,555][25689] Fps is (10 sec: 5784.5, 60 sec: 5725.1, 300 sec: 5737.8). Total num frames: 137245696. Throughput: 0: 6003.0. Samples: 137245010. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 06:46:41,556][25689] Avg episode reward: [(0, '-57.395')] [2022-07-09 06:46:41,702][26022] Updated weights on worker 0-0, policy_version 134031 (0.00084) [2022-07-09 06:46:43,352][26022] Updated weights on worker 0-0, policy_version 134041 (0.00063) [2022-07-09 06:46:45,274][26022] Updated weights on worker 0-0, policy_version 134051 (0.00092) [2022-07-09 06:46:46,646][25689] Fps is (10 sec: 5849.0, 60 sec: 5757.3, 300 sec: 5746.8). Total num frames: 137276416. Throughput: 0: 6013.9. Samples: 137279848. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 06:46:46,646][25689] Avg episode reward: [(0, '-57.230')] [2022-07-09 06:46:46,955][26022] Updated weights on worker 0-0, policy_version 134061 (0.00087) [2022-07-09 06:46:48,752][26022] Updated weights on worker 0-0, policy_version 134071 (0.00091) [2022-07-09 06:46:50,273][26022] Updated weights on worker 0-0, policy_version 134081 (0.00084) [2022-07-09 06:46:51,738][25689] Fps is (10 sec: 5732.1, 60 sec: 5715.2, 300 sec: 5731.4). Total num frames: 137304064. Throughput: 0: 6017.6. Samples: 137314854. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 06:46:51,738][25689] Avg episode reward: [(0, '-57.177')] [2022-07-09 06:46:52,311][26022] Updated weights on worker 0-0, policy_version 134091 (0.00088) [2022-07-09 06:46:53,843][26022] Updated weights on worker 0-0, policy_version 134101 (0.00087) [2022-07-09 06:46:55,791][26022] Updated weights on worker 0-0, policy_version 134111 (0.00084) [2022-07-09 06:46:56,751][25689] Fps is (10 sec: 5775.7, 60 sec: 5750.1, 300 sec: 5739.2). Total num frames: 137334784. Throughput: 0: 6037.4. Samples: 137332412. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 06:46:56,752][25689] Avg episode reward: [(0, '-57.185')] [2022-07-09 06:46:57,346][26022] Updated weights on worker 0-0, policy_version 134121 (0.00051) [2022-07-09 06:46:59,377][26022] Updated weights on worker 0-0, policy_version 134131 (0.00086) [2022-07-09 06:47:01,068][26022] Updated weights on worker 0-0, policy_version 134141 (0.00085) [2022-07-09 06:47:01,813][25689] Fps is (10 sec: 5792.9, 60 sec: 5712.8, 300 sec: 5746.1). Total num frames: 137362432. Throughput: 0: 6041.1. Samples: 137367394. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 06:47:01,827][25689] Avg episode reward: [(0, '-57.235')] [2022-07-09 06:47:03,214][26022] Updated weights on worker 0-0, policy_version 134151 (0.00086) [2022-07-09 06:47:04,900][26022] Updated weights on worker 0-0, policy_version 134161 (0.00092) [2022-07-09 06:47:06,813][26022] Updated weights on worker 0-0, policy_version 134171 (0.00091) [2022-07-09 06:47:06,911][25689] Fps is (10 sec: 5644.2, 60 sec: 5783.7, 300 sec: 5744.4). Total num frames: 137392128. Throughput: 0: 5925.9. Samples: 137399940. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 06:47:06,912][25689] Avg episode reward: [(0, '-56.958')] [2022-07-09 06:47:08,447][26022] Updated weights on worker 0-0, policy_version 134181 (0.00093) [2022-07-09 06:47:10,295][26022] Updated weights on worker 0-0, policy_version 134191 (0.00089) [2022-07-09 06:47:11,935][25689] Fps is (10 sec: 5766.7, 60 sec: 5748.9, 300 sec: 5744.6). Total num frames: 137420800. Throughput: 0: 5079.0. Samples: 137417436. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 06:47:11,936][25689] Avg episode reward: [(0, '-56.894')] [2022-07-09 06:47:11,944][26022] Updated weights on worker 0-0, policy_version 134201 (0.00085) [2022-07-09 06:47:13,731][26022] Updated weights on worker 0-0, policy_version 134211 (0.00090) [2022-07-09 06:47:15,580][26022] Updated weights on worker 0-0, policy_version 134221 (0.00088) [2022-07-09 06:47:16,945][25689] Fps is (10 sec: 5817.4, 60 sec: 5770.3, 300 sec: 5744.5). Total num frames: 137450496. Throughput: 0: 5943.5. Samples: 137452430. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 06:47:16,945][25689] Avg episode reward: [(0, '-58.064')] [2022-07-09 06:47:17,316][26022] Updated weights on worker 0-0, policy_version 134231 (0.00083) [2022-07-09 06:47:19,170][26022] Updated weights on worker 0-0, policy_version 134241 (0.00085) [2022-07-09 06:47:20,888][26022] Updated weights on worker 0-0, policy_version 134251 (0.00092) [2022-07-09 06:47:21,958][25689] Fps is (10 sec: 5823.2, 60 sec: 5772.2, 300 sec: 5749.2). Total num frames: 137479168. Throughput: 0: 5954.9. Samples: 137487354. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 06:47:21,959][25689] Avg episode reward: [(0, '-58.419')] [2022-07-09 06:47:22,560][26022] Updated weights on worker 0-0, policy_version 134261 (0.00085) [2022-07-09 06:47:24,509][26022] Updated weights on worker 0-0, policy_version 134271 (0.00081) [2022-07-09 06:47:26,226][26022] Updated weights on worker 0-0, policy_version 134281 (0.00080) [2022-07-09 06:47:27,073][25689] Fps is (10 sec: 5661.5, 60 sec: 5754.8, 300 sec: 5751.2). Total num frames: 137507840. Throughput: 0: 5196.5. Samples: 137504712. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 06:47:27,074][25689] Avg episode reward: [(0, '-59.293')] [2022-07-09 06:47:27,931][26022] Updated weights on worker 0-0, policy_version 134291 (0.00093) [2022-07-09 06:47:29,778][26022] Updated weights on worker 0-0, policy_version 134301 (0.00090) [2022-07-09 06:47:31,528][26022] Updated weights on worker 0-0, policy_version 134311 (0.00081) [2022-07-09 06:47:32,093][25689] Fps is (10 sec: 5759.0, 60 sec: 5776.3, 300 sec: 5745.0). Total num frames: 137537536. Throughput: 0: 6046.0. Samples: 137539314. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 06:47:32,094][25689] Avg episode reward: [(0, '-59.777')] [2022-07-09 06:47:33,324][26022] Updated weights on worker 0-0, policy_version 134321 (0.00095) [2022-07-09 06:47:35,030][26022] Updated weights on worker 0-0, policy_version 134331 (0.00096) [2022-07-09 06:47:37,035][26022] Updated weights on worker 0-0, policy_version 134341 (0.00086) [2022-07-09 06:47:37,118][25689] Fps is (10 sec: 5708.7, 60 sec: 5742.7, 300 sec: 5737.9). Total num frames: 137565184. Throughput: 0: 6003.9. Samples: 137573550. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 06:47:37,119][25689] Avg episode reward: [(0, '-60.363')] [2022-07-09 06:47:38,573][26022] Updated weights on worker 0-0, policy_version 134351 (0.00085) [2022-07-09 06:47:40,401][26022] Updated weights on worker 0-0, policy_version 134361 (0.00087) [2022-07-09 06:47:42,136][25689] Fps is (10 sec: 5608.1, 60 sec: 5747.0, 300 sec: 5743.6). Total num frames: 137593856. Throughput: 0: 5122.8. Samples: 137590720. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 06:47:42,137][25689] Avg episode reward: [(0, '-59.732')] [2022-07-09 06:47:42,328][26022] Updated weights on worker 0-0, policy_version 134371 (0.00090) [2022-07-09 06:47:43,880][26022] Updated weights on worker 0-0, policy_version 134381 (0.00085) [2022-07-09 06:47:45,687][26022] Updated weights on worker 0-0, policy_version 134391 (0.00088) [2022-07-09 06:47:47,198][25689] Fps is (10 sec: 5790.5, 60 sec: 5732.8, 300 sec: 5739.6). Total num frames: 137623552. Throughput: 0: 6007.2. Samples: 137625608. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 06:47:47,199][25689] Avg episode reward: [(0, '-58.723')] [2022-07-09 06:47:47,626][26022] Updated weights on worker 0-0, policy_version 134401 (0.00088) [2022-07-09 06:47:49,219][26022] Updated weights on worker 0-0, policy_version 134411 (0.00090) [2022-07-09 06:47:51,169][26022] Updated weights on worker 0-0, policy_version 134421 (0.00084) [2022-07-09 06:47:52,277][25689] Fps is (10 sec: 5957.5, 60 sec: 5784.8, 300 sec: 5748.6). Total num frames: 137654272. Throughput: 0: 5999.9. Samples: 137660416. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 06:47:52,278][25689] Avg episode reward: [(0, '-57.153')] [2022-07-09 06:47:52,701][26022] Updated weights on worker 0-0, policy_version 134431 (0.00084) [2022-07-09 06:47:54,706][26022] Updated weights on worker 0-0, policy_version 134441 (0.00081) [2022-07-09 06:47:56,315][26022] Updated weights on worker 0-0, policy_version 134451 (0.00081) [2022-07-09 06:47:57,327][25689] Fps is (10 sec: 5863.5, 60 sec: 5747.5, 300 sec: 5741.0). Total num frames: 137682944. Throughput: 0: 5164.5. Samples: 137677920. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 06:47:57,328][25689] Avg episode reward: [(0, '-57.598')] [2022-07-09 06:47:58,223][26022] Updated weights on worker 0-0, policy_version 134461 (0.00085) [2022-07-09 06:47:59,907][26022] Updated weights on worker 0-0, policy_version 134471 (0.00087) [2022-07-09 06:48:02,123][26022] Updated weights on worker 0-0, policy_version 134481 (0.00078) [2022-07-09 06:48:02,334][25689] Fps is (10 sec: 5498.2, 60 sec: 5735.8, 300 sec: 5743.4). Total num frames: 137709568. Throughput: 0: 6032.9. Samples: 137712574. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 06:48:02,335][25689] Avg episode reward: [(0, '-58.092')] [2022-07-09 06:48:03,807][26022] Updated weights on worker 0-0, policy_version 134491 (0.00088) [2022-07-09 06:48:05,474][26022] Updated weights on worker 0-0, policy_version 134501 (0.00080) [2022-07-09 06:48:07,354][26022] Updated weights on worker 0-0, policy_version 134511 (0.00082) [2022-07-09 06:48:07,379][25689] Fps is (10 sec: 5603.2, 60 sec: 5740.9, 300 sec: 5746.2). Total num frames: 137739264. Throughput: 0: 5939.5. Samples: 137745470. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 06:48:07,379][25689] Avg episode reward: [(0, '-56.883')] [2022-07-09 06:48:08,981][26022] Updated weights on worker 0-0, policy_version 134521 (0.00087) [2022-07-09 06:48:10,664][26022] Updated weights on worker 0-0, policy_version 134531 (0.00089) [2022-07-09 06:48:12,418][25689] Fps is (10 sec: 5788.2, 60 sec: 5739.4, 300 sec: 5745.7). Total num frames: 137767936. Throughput: 0: 5098.0. Samples: 137763094. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 06:48:12,419][25689] Avg episode reward: [(0, '-56.351')] [2022-07-09 06:48:12,635][26022] Updated weights on worker 0-0, policy_version 134541 (0.00086) [2022-07-09 06:48:14,307][26022] Updated weights on worker 0-0, policy_version 134551 (0.00088) [2022-07-09 06:48:16,026][26022] Updated weights on worker 0-0, policy_version 134561 (0.00085) [2022-07-09 06:48:17,425][25689] Fps is (10 sec: 5911.7, 60 sec: 5756.5, 300 sec: 5749.4). Total num frames: 137798656. Throughput: 0: 5983.1. Samples: 137798168. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 06:48:17,428][25689] Avg episode reward: [(0, '-56.145')] [2022-07-09 06:48:17,682][26022] Updated weights on worker 0-0, policy_version 134571 (0.00093) [2022-07-09 06:48:19,572][26022] Updated weights on worker 0-0, policy_version 134581 (0.00091) [2022-07-09 06:48:21,429][26022] Updated weights on worker 0-0, policy_version 134591 (0.00807) [2022-07-09 06:48:22,448][25689] Fps is (10 sec: 6023.9, 60 sec: 5772.7, 300 sec: 5754.8). Total num frames: 137828352. Throughput: 0: 5995.9. Samples: 137833172. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 06:48:22,448][25689] Avg episode reward: [(0, '-56.803')] [2022-07-09 06:48:23,270][26022] Updated weights on worker 0-0, policy_version 134601 (0.00099) [2022-07-09 06:48:24,834][26022] Updated weights on worker 0-0, policy_version 134611 (0.00081) [2022-07-09 06:48:26,871][26022] Updated weights on worker 0-0, policy_version 134621 (0.00088) [2022-07-09 06:48:27,508][25689] Fps is (10 sec: 5788.7, 60 sec: 5777.8, 300 sec: 5750.5). Total num frames: 137857024. Throughput: 0: 5222.0. Samples: 137850586. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 06:48:27,509][25689] Avg episode reward: [(0, '-56.649')] [2022-07-09 06:48:28,437][26022] Updated weights on worker 0-0, policy_version 134631 (0.00085) [2022-07-09 06:48:30,299][26022] Updated weights on worker 0-0, policy_version 134641 (0.00085) [2022-07-09 06:48:31,907][26022] Updated weights on worker 0-0, policy_version 134651 (0.00084) [2022-07-09 06:48:32,543][25689] Fps is (10 sec: 5680.5, 60 sec: 5759.5, 300 sec: 5743.4). Total num frames: 137885696. Throughput: 0: 6073.9. Samples: 137885326. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 06:48:32,543][25689] Avg episode reward: [(0, '-57.865')] [2022-07-09 06:48:33,620][26022] Updated weights on worker 0-0, policy_version 134661 (0.00084) [2022-07-09 06:48:33,782][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:48:33,796][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000134662_137893888.pth [2022-07-09 06:48:33,797][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000132641_135824384.pth [2022-07-09 06:48:35,503][26022] Updated weights on worker 0-0, policy_version 134671 (0.00091) [2022-07-09 06:48:37,228][26022] Updated weights on worker 0-0, policy_version 134681 (0.00092) [2022-07-09 06:48:37,545][25689] Fps is (10 sec: 5713.6, 60 sec: 5778.7, 300 sec: 5747.1). Total num frames: 137914368. Throughput: 0: 6060.1. Samples: 137920094. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 06:48:37,545][25689] Avg episode reward: [(0, '-57.084')] [2022-07-09 06:48:39,132][26022] Updated weights on worker 0-0, policy_version 134691 (0.00082) [2022-07-09 06:48:40,916][26022] Updated weights on worker 0-0, policy_version 134701 (0.00088) [2022-07-09 06:48:42,548][25689] Fps is (10 sec: 5833.9, 60 sec: 5797.1, 300 sec: 5748.2). Total num frames: 137944064. Throughput: 0: 5190.3. Samples: 137937496. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 06:48:42,548][25689] Avg episode reward: [(0, '-58.420')] [2022-07-09 06:48:42,549][26022] Updated weights on worker 0-0, policy_version 134711 (0.00117) [2022-07-09 06:48:44,470][26022] Updated weights on worker 0-0, policy_version 134721 (0.00100) [2022-07-09 06:48:46,039][26022] Updated weights on worker 0-0, policy_version 134731 (0.00087) [2022-07-09 06:48:47,675][25689] Fps is (10 sec: 5660.8, 60 sec: 5756.9, 300 sec: 5739.4). Total num frames: 137971712. Throughput: 0: 6044.4. Samples: 137972480. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 06:48:47,675][25689] Avg episode reward: [(0, '-58.537')] [2022-07-09 06:48:47,959][26022] Updated weights on worker 0-0, policy_version 134741 (0.00085) [2022-07-09 06:48:49,548][26022] Updated weights on worker 0-0, policy_version 134751 (0.00087) [2022-07-09 06:48:51,507][26022] Updated weights on worker 0-0, policy_version 134761 (0.00081) [2022-07-09 06:48:52,690][25689] Fps is (10 sec: 5754.7, 60 sec: 5763.0, 300 sec: 5749.9). Total num frames: 138002432. Throughput: 0: 6024.3. Samples: 138006702. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 06:48:52,691][25689] Avg episode reward: [(0, '-58.979')] [2022-07-09 06:48:53,323][26022] Updated weights on worker 0-0, policy_version 134771 (0.00086) [2022-07-09 06:48:54,959][26022] Updated weights on worker 0-0, policy_version 134781 (0.00089) [2022-07-09 06:48:56,979][26022] Updated weights on worker 0-0, policy_version 134791 (0.00089) [2022-07-09 06:48:57,735][25689] Fps is (10 sec: 5903.4, 60 sec: 5763.5, 300 sec: 5743.0). Total num frames: 138031104. Throughput: 0: 5162.3. Samples: 138024324. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 06:48:57,736][25689] Avg episode reward: [(0, '-58.840')] [2022-07-09 06:48:58,483][26022] Updated weights on worker 0-0, policy_version 134801 (0.00081) [2022-07-09 06:49:00,312][26022] Updated weights on worker 0-0, policy_version 134811 (0.00086) [2022-07-09 06:49:02,032][26022] Updated weights on worker 0-0, policy_version 134821 (0.00089) [2022-07-09 06:49:02,799][25689] Fps is (10 sec: 5571.6, 60 sec: 5775.0, 300 sec: 5746.1). Total num frames: 138058752. Throughput: 0: 6016.9. Samples: 138059344. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 06:49:02,799][25689] Avg episode reward: [(0, '-58.752')] [2022-07-09 06:49:04,064][26022] Updated weights on worker 0-0, policy_version 134831 (0.00082) [2022-07-09 06:49:05,980][26022] Updated weights on worker 0-0, policy_version 134841 (0.00092) [2022-07-09 06:49:07,750][26022] Updated weights on worker 0-0, policy_version 134851 (0.00086) [2022-07-09 06:49:07,879][25689] Fps is (10 sec: 5653.3, 60 sec: 5771.6, 300 sec: 5748.5). Total num frames: 138088448. Throughput: 0: 5922.8. Samples: 138092144. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 06:49:07,879][25689] Avg episode reward: [(0, '-57.700')] [2022-07-09 06:49:09,407][26022] Updated weights on worker 0-0, policy_version 134861 (0.00096) [2022-07-09 06:49:11,266][26022] Updated weights on worker 0-0, policy_version 134871 (0.00087) [2022-07-09 06:49:12,891][25689] Fps is (10 sec: 5783.3, 60 sec: 5774.2, 300 sec: 5745.4). Total num frames: 138117120. Throughput: 0: 5972.8. Samples: 138127358. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 06:49:12,893][25689] Avg episode reward: [(0, '-57.922')] [2022-07-09 06:49:12,934][26022] Updated weights on worker 0-0, policy_version 134881 (0.00085) [2022-07-09 06:49:14,666][26022] Updated weights on worker 0-0, policy_version 134891 (0.00086) [2022-07-09 06:49:16,321][26022] Updated weights on worker 0-0, policy_version 134901 (0.00083) [2022-07-09 06:49:17,907][25689] Fps is (10 sec: 5820.3, 60 sec: 5756.4, 300 sec: 5752.1). Total num frames: 138146816. Throughput: 0: 5982.3. Samples: 138144998. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 06:49:17,908][25689] Avg episode reward: [(0, '-57.268')] [2022-07-09 06:49:18,080][26022] Updated weights on worker 0-0, policy_version 134911 (0.00086) [2022-07-09 06:49:19,919][26022] Updated weights on worker 0-0, policy_version 134921 (0.00083) [2022-07-09 06:49:21,729][26022] Updated weights on worker 0-0, policy_version 134931 (0.00087) [2022-07-09 06:49:22,967][25689] Fps is (10 sec: 5792.7, 60 sec: 5735.9, 300 sec: 5751.9). Total num frames: 138175488. Throughput: 0: 5964.8. Samples: 138179646. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 06:49:22,968][25689] Avg episode reward: [(0, '-57.162')] [2022-07-09 06:49:23,465][26022] Updated weights on worker 0-0, policy_version 134941 (0.00082) [2022-07-09 06:49:25,441][26022] Updated weights on worker 0-0, policy_version 134951 (0.00086) [2022-07-09 06:49:27,024][26022] Updated weights on worker 0-0, policy_version 134961 (0.00088) [2022-07-09 06:49:28,019][25689] Fps is (10 sec: 5772.2, 60 sec: 5753.7, 300 sec: 5751.5). Total num frames: 138205184. Throughput: 0: 6053.9. Samples: 138214070. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 06:49:28,019][25689] Avg episode reward: [(0, '-57.421')] [2022-07-09 06:49:28,906][26022] Updated weights on worker 0-0, policy_version 134971 (0.00080) [2022-07-09 06:49:30,796][26022] Updated weights on worker 0-0, policy_version 134981 (0.00088) [2022-07-09 06:49:32,468][26022] Updated weights on worker 0-0, policy_version 134991 (0.00084) [2022-07-09 06:49:33,100][25689] Fps is (10 sec: 5760.2, 60 sec: 5749.2, 300 sec: 5746.9). Total num frames: 138233856. Throughput: 0: 5137.8. Samples: 138231188. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 06:49:33,101][25689] Avg episode reward: [(0, '-58.572')] [2022-07-09 06:49:34,298][26022] Updated weights on worker 0-0, policy_version 135001 (0.00099) [2022-07-09 06:49:36,119][26022] Updated weights on worker 0-0, policy_version 135011 (0.00092) [2022-07-09 06:49:37,771][26022] Updated weights on worker 0-0, policy_version 135021 (0.00085) [2022-07-09 06:49:38,134][25689] Fps is (10 sec: 5872.0, 60 sec: 5780.1, 300 sec: 5760.1). Total num frames: 138264576. Throughput: 0: 5966.6. Samples: 138265680. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 06:49:38,134][25689] Avg episode reward: [(0, '-58.101')] [2022-07-09 06:49:39,705][26022] Updated weights on worker 0-0, policy_version 135031 (0.00094) [2022-07-09 06:49:41,415][26022] Updated weights on worker 0-0, policy_version 135041 (0.00096) [2022-07-09 06:49:43,149][25689] Fps is (10 sec: 5706.8, 60 sec: 5728.2, 300 sec: 5750.4). Total num frames: 138291200. Throughput: 0: 5977.6. Samples: 138300282. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 06:49:43,149][25689] Avg episode reward: [(0, '-59.342')] [2022-07-09 06:49:43,289][26022] Updated weights on worker 0-0, policy_version 135051 (0.00085) [2022-07-09 06:49:44,862][26022] Updated weights on worker 0-0, policy_version 135061 (0.00088) [2022-07-09 06:49:46,723][26022] Updated weights on worker 0-0, policy_version 135071 (0.00095) [2022-07-09 06:49:48,193][25689] Fps is (10 sec: 5598.4, 60 sec: 5769.9, 300 sec: 5750.8). Total num frames: 138320896. Throughput: 0: 5132.3. Samples: 138317612. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 06:49:48,194][25689] Avg episode reward: [(0, '-59.279')] [2022-07-09 06:49:48,474][26022] Updated weights on worker 0-0, policy_version 135081 (0.00080) [2022-07-09 06:49:50,328][26022] Updated weights on worker 0-0, policy_version 135091 (0.00086) [2022-07-09 06:49:52,047][26022] Updated weights on worker 0-0, policy_version 135101 (0.00090) [2022-07-09 06:49:53,271][25689] Fps is (10 sec: 5867.7, 60 sec: 5747.1, 300 sec: 5749.5). Total num frames: 138350592. Throughput: 0: 6030.9. Samples: 138352832. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 06:49:53,271][25689] Avg episode reward: [(0, '-58.802')] [2022-07-09 06:49:53,918][26022] Updated weights on worker 0-0, policy_version 135111 (0.00086) [2022-07-09 06:49:55,501][26022] Updated weights on worker 0-0, policy_version 135121 (0.00084) [2022-07-09 06:49:57,559][26022] Updated weights on worker 0-0, policy_version 135131 (0.00084) [2022-07-09 06:49:58,309][25689] Fps is (10 sec: 5770.2, 60 sec: 5747.7, 300 sec: 5749.1). Total num frames: 138379264. Throughput: 0: 6013.1. Samples: 138386996. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 06:49:58,309][25689] Avg episode reward: [(0, '-58.126')] [2022-07-09 06:49:59,088][26022] Updated weights on worker 0-0, policy_version 135141 (0.00085) [2022-07-09 06:50:00,985][26022] Updated weights on worker 0-0, policy_version 135151 (0.00085) [2022-07-09 06:50:03,206][26022] Updated weights on worker 0-0, policy_version 135161 (0.00086) [2022-07-09 06:50:03,407][25689] Fps is (10 sec: 5354.3, 60 sec: 5710.7, 300 sec: 5746.3). Total num frames: 138404864. Throughput: 0: 5142.5. Samples: 138404458. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 06:50:03,407][25689] Avg episode reward: [(0, '-57.722')] [2022-07-09 06:50:04,773][26022] Updated weights on worker 0-0, policy_version 135171 (0.00087) [2022-07-09 06:50:06,772][26022] Updated weights on worker 0-0, policy_version 135181 (0.00092) [2022-07-09 06:50:08,239][26022] Updated weights on worker 0-0, policy_version 135191 (0.00764) [2022-07-09 06:50:08,477][25689] Fps is (10 sec: 5539.0, 60 sec: 5728.5, 300 sec: 5748.6). Total num frames: 138435584. Throughput: 0: 5876.3. Samples: 138436802. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 06:50:08,477][25689] Avg episode reward: [(0, '-56.199')] [2022-07-09 06:50:10,386][26022] Updated weights on worker 0-0, policy_version 135201 (0.00087) [2022-07-09 06:50:11,953][26022] Updated weights on worker 0-0, policy_version 135211 (0.00080) [2022-07-09 06:50:13,487][25689] Fps is (10 sec: 5790.3, 60 sec: 5711.8, 300 sec: 5745.3). Total num frames: 138463232. Throughput: 0: 5866.4. Samples: 138471430. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 06:50:13,487][25689] Avg episode reward: [(0, '-56.761')] [2022-07-09 06:50:13,675][26022] Updated weights on worker 0-0, policy_version 135221 (0.00084) [2022-07-09 06:50:15,531][26022] Updated weights on worker 0-0, policy_version 135231 (0.00086) [2022-07-09 06:50:17,232][26022] Updated weights on worker 0-0, policy_version 135241 (0.00092) [2022-07-09 06:50:18,493][25689] Fps is (10 sec: 5827.1, 60 sec: 5729.6, 300 sec: 5748.9). Total num frames: 138493952. Throughput: 0: 5037.7. Samples: 138488680. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 06:50:18,494][25689] Avg episode reward: [(0, '-56.883')] [2022-07-09 06:50:18,998][26022] Updated weights on worker 0-0, policy_version 135251 (0.00093) [2022-07-09 06:50:20,775][26022] Updated weights on worker 0-0, policy_version 135261 (0.00088) [2022-07-09 06:50:22,826][26022] Updated weights on worker 0-0, policy_version 135271 (0.00086) [2022-07-09 06:50:23,546][25689] Fps is (10 sec: 5904.4, 60 sec: 5730.3, 300 sec: 5753.0). Total num frames: 138522624. Throughput: 0: 5912.3. Samples: 138523528. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 06:50:23,546][25689] Avg episode reward: [(0, '-57.502')] [2022-07-09 06:50:24,300][26022] Updated weights on worker 0-0, policy_version 135281 (0.00083) [2022-07-09 06:50:26,450][26022] Updated weights on worker 0-0, policy_version 135291 (0.00097) [2022-07-09 06:50:27,815][26022] Updated weights on worker 0-0, policy_version 135301 (0.00087) [2022-07-09 06:50:28,658][25689] Fps is (10 sec: 5641.7, 60 sec: 5707.8, 300 sec: 5749.0). Total num frames: 138551296. Throughput: 0: 6001.3. Samples: 138557914. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 06:50:28,658][25689] Avg episode reward: [(0, '-57.858')] [2022-07-09 06:50:29,838][26022] Updated weights on worker 0-0, policy_version 135311 (0.00086) [2022-07-09 06:50:31,531][26022] Updated weights on worker 0-0, policy_version 135321 (0.00088) [2022-07-09 06:50:33,256][26022] Updated weights on worker 0-0, policy_version 135331 (0.00094) [2022-07-09 06:50:33,676][25689] Fps is (10 sec: 5761.8, 60 sec: 5730.6, 300 sec: 5749.7). Total num frames: 138580992. Throughput: 0: 5143.8. Samples: 138575278. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 06:50:33,677][25689] Avg episode reward: [(0, '-57.968')] [2022-07-09 06:50:33,911][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:50:33,925][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000135334_138582016.pth [2022-07-09 06:50:33,926][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000133312_136511488.pth [2022-07-09 06:50:35,032][26022] Updated weights on worker 0-0, policy_version 135341 (0.00091) [2022-07-09 06:50:36,924][26022] Updated weights on worker 0-0, policy_version 135351 (0.00093) [2022-07-09 06:50:38,569][26022] Updated weights on worker 0-0, policy_version 135361 (0.00086) [2022-07-09 06:50:38,701][25689] Fps is (10 sec: 5811.3, 60 sec: 5697.5, 300 sec: 5746.2). Total num frames: 138609664. Throughput: 0: 5995.8. Samples: 138609844. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 06:50:38,702][25689] Avg episode reward: [(0, '-59.063')] [2022-07-09 06:50:40,445][26022] Updated weights on worker 0-0, policy_version 135371 (0.00085) [2022-07-09 06:50:42,079][26022] Updated weights on worker 0-0, policy_version 135381 (0.00091) [2022-07-09 06:50:43,755][25689] Fps is (10 sec: 5689.7, 60 sec: 5727.8, 300 sec: 5746.6). Total num frames: 138638336. Throughput: 0: 5993.3. Samples: 138644646. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 06:50:43,755][25689] Avg episode reward: [(0, '-58.674')] [2022-07-09 06:50:44,060][26022] Updated weights on worker 0-0, policy_version 135391 (0.00093) [2022-07-09 06:50:45,635][26022] Updated weights on worker 0-0, policy_version 135401 (0.00093) [2022-07-09 06:50:47,386][26022] Updated weights on worker 0-0, policy_version 135411 (0.00082) [2022-07-09 06:50:48,859][25689] Fps is (10 sec: 5847.2, 60 sec: 5739.0, 300 sec: 5748.1). Total num frames: 138669056. Throughput: 0: 5168.8. Samples: 138662330. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 06:50:48,859][25689] Avg episode reward: [(0, '-58.316')] [2022-07-09 06:50:49,438][26022] Updated weights on worker 0-0, policy_version 135421 (0.00087) [2022-07-09 06:50:50,859][26022] Updated weights on worker 0-0, policy_version 135431 (0.00073) [2022-07-09 06:50:52,868][26022] Updated weights on worker 0-0, policy_version 135441 (0.00088) [2022-07-09 06:50:53,940][25689] Fps is (10 sec: 5932.0, 60 sec: 5738.7, 300 sec: 5750.5). Total num frames: 138698752. Throughput: 0: 6018.0. Samples: 138697224. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 06:50:53,940][25689] Avg episode reward: [(0, '-57.703')] [2022-07-09 06:50:54,481][26022] Updated weights on worker 0-0, policy_version 135451 (0.00089) [2022-07-09 06:50:56,230][26022] Updated weights on worker 0-0, policy_version 135461 (0.00101) [2022-07-09 06:50:58,261][26022] Updated weights on worker 0-0, policy_version 135471 (0.00092) [2022-07-09 06:50:58,979][25689] Fps is (10 sec: 5666.3, 60 sec: 5721.7, 300 sec: 5743.3). Total num frames: 138726400. Throughput: 0: 6037.1. Samples: 138732262. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 06:50:58,979][25689] Avg episode reward: [(0, '-57.659')] [2022-07-09 06:50:59,531][26022] Updated weights on worker 0-0, policy_version 135481 (0.00086) [2022-07-09 06:51:01,998][26022] Updated weights on worker 0-0, policy_version 135491 (0.00088) [2022-07-09 06:51:03,492][26022] Updated weights on worker 0-0, policy_version 135501 (0.00083) [2022-07-09 06:51:03,991][25689] Fps is (10 sec: 5602.9, 60 sec: 5780.5, 300 sec: 5756.0). Total num frames: 138755072. Throughput: 0: 5200.1. Samples: 138749880. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 06:51:03,992][25689] Avg episode reward: [(0, '-57.564')] [2022-07-09 06:51:05,378][26022] Updated weights on worker 0-0, policy_version 135511 (0.00104) [2022-07-09 06:51:07,078][26022] Updated weights on worker 0-0, policy_version 135521 (0.00083) [2022-07-09 06:51:08,934][26022] Updated weights on worker 0-0, policy_version 135531 (0.00085) [2022-07-09 06:51:09,100][25689] Fps is (10 sec: 5665.8, 60 sec: 5743.0, 300 sec: 5747.3). Total num frames: 138783744. Throughput: 0: 5947.0. Samples: 138782704. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 06:51:09,100][25689] Avg episode reward: [(0, '-58.352')] [2022-07-09 06:51:10,548][26022] Updated weights on worker 0-0, policy_version 135541 (0.00090) [2022-07-09 06:51:12,654][26022] Updated weights on worker 0-0, policy_version 135551 (0.00090) [2022-07-09 06:51:14,118][25689] Fps is (10 sec: 5764.0, 60 sec: 5776.1, 300 sec: 5751.5). Total num frames: 138813440. Throughput: 0: 5960.1. Samples: 138817488. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 06:51:14,118][25689] Avg episode reward: [(0, '-58.776')] [2022-07-09 06:51:14,233][26022] Updated weights on worker 0-0, policy_version 135561 (0.00107) [2022-07-09 06:51:16,093][26022] Updated weights on worker 0-0, policy_version 135571 (0.00082) [2022-07-09 06:51:17,811][26022] Updated weights on worker 0-0, policy_version 135581 (0.00089) [2022-07-09 06:51:19,151][25689] Fps is (10 sec: 5807.1, 60 sec: 5739.7, 300 sec: 5751.5). Total num frames: 138842112. Throughput: 0: 5940.1. Samples: 138852088. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 06:51:19,152][25689] Avg episode reward: [(0, '-59.788')] [2022-07-09 06:51:19,635][26022] Updated weights on worker 0-0, policy_version 135591 (0.00086) [2022-07-09 06:51:21,271][26022] Updated weights on worker 0-0, policy_version 135601 (0.00081) [2022-07-09 06:51:23,286][26022] Updated weights on worker 0-0, policy_version 135611 (0.00084) [2022-07-09 06:51:24,240][25689] Fps is (10 sec: 5867.5, 60 sec: 5770.1, 300 sec: 5755.3). Total num frames: 138872832. Throughput: 0: 5907.6. Samples: 138869500. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 06:51:24,241][25689] Avg episode reward: [(0, '-59.570')] [2022-07-09 06:51:24,868][26022] Updated weights on worker 0-0, policy_version 135621 (0.00089) [2022-07-09 06:51:26,861][26022] Updated weights on worker 0-0, policy_version 135631 (0.00083) [2022-07-09 06:51:28,623][26022] Updated weights on worker 0-0, policy_version 135641 (0.00086) [2022-07-09 06:51:29,340][25689] Fps is (10 sec: 5628.1, 60 sec: 5737.4, 300 sec: 5747.8). Total num frames: 138899456. Throughput: 0: 5969.4. Samples: 138903524. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 06:51:29,342][25689] Avg episode reward: [(0, '-58.806')] [2022-07-09 06:51:30,372][26022] Updated weights on worker 0-0, policy_version 135651 (0.00090) [2022-07-09 06:51:32,182][26022] Updated weights on worker 0-0, policy_version 135661 (0.00091) [2022-07-09 06:51:33,907][26022] Updated weights on worker 0-0, policy_version 135671 (0.00089) [2022-07-09 06:51:34,406][25689] Fps is (10 sec: 5640.6, 60 sec: 5749.8, 300 sec: 5750.5). Total num frames: 138930176. Throughput: 0: 5932.3. Samples: 138937846. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 06:51:34,408][25689] Avg episode reward: [(0, '-57.678')] [2022-07-09 06:51:35,751][26022] Updated weights on worker 0-0, policy_version 135681 (0.00102) [2022-07-09 06:51:37,341][26022] Updated weights on worker 0-0, policy_version 135691 (0.00086) [2022-07-09 06:51:39,252][26022] Updated weights on worker 0-0, policy_version 135701 (0.00086) [2022-07-09 06:51:39,474][25689] Fps is (10 sec: 5759.6, 60 sec: 5728.9, 300 sec: 5747.0). Total num frames: 138957824. Throughput: 0: 5072.5. Samples: 138955176. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 06:51:39,476][25689] Avg episode reward: [(0, '-57.250')] [2022-07-09 06:51:41,072][26022] Updated weights on worker 0-0, policy_version 135711 (0.00089) [2022-07-09 06:51:42,718][26022] Updated weights on worker 0-0, policy_version 135721 (0.00081) [2022-07-09 06:51:44,563][25689] Fps is (10 sec: 5646.0, 60 sec: 5742.4, 300 sec: 5743.6). Total num frames: 138987520. Throughput: 0: 5912.9. Samples: 138989668. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 06:51:44,564][25689] Avg episode reward: [(0, '-56.781')] [2022-07-09 06:51:44,581][26022] Updated weights on worker 0-0, policy_version 135731 (0.00094) [2022-07-09 06:51:46,364][26022] Updated weights on worker 0-0, policy_version 135741 (0.00097) [2022-07-09 06:51:48,123][26022] Updated weights on worker 0-0, policy_version 135751 (0.00071) [2022-07-09 06:51:49,632][25689] Fps is (10 sec: 5846.7, 60 sec: 5728.8, 300 sec: 5750.9). Total num frames: 139017216. Throughput: 0: 5983.8. Samples: 139024948. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 06:51:49,633][25689] Avg episode reward: [(0, '-56.531')] [2022-07-09 06:51:49,918][26022] Updated weights on worker 0-0, policy_version 135761 (0.00086) [2022-07-09 06:51:51,496][26022] Updated weights on worker 0-0, policy_version 135771 (0.00086) [2022-07-09 06:51:53,369][26022] Updated weights on worker 0-0, policy_version 135781 (0.00081) [2022-07-09 06:51:54,676][25689] Fps is (10 sec: 5973.7, 60 sec: 5749.1, 300 sec: 5750.3). Total num frames: 139047936. Throughput: 0: 5164.2. Samples: 139042526. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 06:51:54,677][25689] Avg episode reward: [(0, '-56.435')] [2022-07-09 06:51:54,826][26022] Updated weights on worker 0-0, policy_version 135791 (0.00086) [2022-07-09 06:51:56,808][26022] Updated weights on worker 0-0, policy_version 135801 (0.00088) [2022-07-09 06:51:58,785][26022] Updated weights on worker 0-0, policy_version 135811 (0.00086) [2022-07-09 06:51:59,684][25689] Fps is (10 sec: 5807.0, 60 sec: 5752.2, 300 sec: 5751.3). Total num frames: 139075584. Throughput: 0: 6043.0. Samples: 139077302. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 06:51:59,684][25689] Avg episode reward: [(0, '-56.896')] [2022-07-09 06:52:00,281][26022] Updated weights on worker 0-0, policy_version 135821 (0.00086) [2022-07-09 06:52:02,567][26022] Updated weights on worker 0-0, policy_version 135831 (0.00082) [2022-07-09 06:52:04,095][26022] Updated weights on worker 0-0, policy_version 135841 (0.00362) [2022-07-09 06:52:04,705][25689] Fps is (10 sec: 5513.7, 60 sec: 5734.5, 300 sec: 5745.9). Total num frames: 139103232. Throughput: 0: 5997.3. Samples: 139110468. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 06:52:04,706][25689] Avg episode reward: [(0, '-58.119')] [2022-07-09 06:52:05,937][26022] Updated weights on worker 0-0, policy_version 135851 (0.00085) [2022-07-09 06:52:07,718][26022] Updated weights on worker 0-0, policy_version 135861 (0.00086) [2022-07-09 06:52:09,472][26022] Updated weights on worker 0-0, policy_version 135871 (0.00087) [2022-07-09 06:52:09,794][25689] Fps is (10 sec: 5671.7, 60 sec: 5753.2, 300 sec: 5748.1). Total num frames: 139132928. Throughput: 0: 5105.8. Samples: 139127890. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 06:52:09,795][25689] Avg episode reward: [(0, '-57.850')] [2022-07-09 06:52:11,180][26022] Updated weights on worker 0-0, policy_version 135881 (0.00085) [2022-07-09 06:52:12,899][26022] Updated weights on worker 0-0, policy_version 135891 (0.00082) [2022-07-09 06:52:14,785][26022] Updated weights on worker 0-0, policy_version 135901 (0.00086) [2022-07-09 06:52:14,818][25689] Fps is (10 sec: 5872.8, 60 sec: 5752.6, 300 sec: 5747.8). Total num frames: 139162624. Throughput: 0: 5992.7. Samples: 139163230. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 06:52:14,819][25689] Avg episode reward: [(0, '-57.930')] [2022-07-09 06:52:16,278][26022] Updated weights on worker 0-0, policy_version 135911 (0.00088) [2022-07-09 06:52:18,221][26022] Updated weights on worker 0-0, policy_version 135921 (0.00089) [2022-07-09 06:52:19,827][25689] Fps is (10 sec: 5817.9, 60 sec: 5754.9, 300 sec: 5747.9). Total num frames: 139191296. Throughput: 0: 5998.5. Samples: 139198130. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 06:52:19,828][25689] Avg episode reward: [(0, '-57.846')] [2022-07-09 06:52:20,122][26022] Updated weights on worker 0-0, policy_version 135931 (0.00980) [2022-07-09 06:52:21,756][26022] Updated weights on worker 0-0, policy_version 135941 (0.00089) [2022-07-09 06:52:23,611][26022] Updated weights on worker 0-0, policy_version 135951 (0.00089) [2022-07-09 06:52:24,859][25689] Fps is (10 sec: 5812.8, 60 sec: 5743.4, 300 sec: 5753.0). Total num frames: 139220992. Throughput: 0: 5211.1. Samples: 139215494. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 06:52:24,860][25689] Avg episode reward: [(0, '-58.080')] [2022-07-09 06:52:25,331][26022] Updated weights on worker 0-0, policy_version 135961 (0.00088) [2022-07-09 06:52:26,986][26022] Updated weights on worker 0-0, policy_version 135971 (0.00086) [2022-07-09 06:52:29,190][26022] Updated weights on worker 0-0, policy_version 135981 (0.00096) [2022-07-09 06:52:29,993][25689] Fps is (10 sec: 5841.9, 60 sec: 5790.8, 300 sec: 5750.8). Total num frames: 139250688. Throughput: 0: 6056.7. Samples: 139250230. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 06:52:29,994][25689] Avg episode reward: [(0, '-57.277')] [2022-07-09 06:52:30,542][26022] Updated weights on worker 0-0, policy_version 135991 (0.00099) [2022-07-09 06:52:32,598][26022] Updated weights on worker 0-0, policy_version 136001 (0.00087) [2022-07-09 06:52:34,004][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:52:34,018][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000136010_139274240.pth [2022-07-09 06:52:34,018][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000133986_137201664.pth [2022-07-09 06:52:34,116][26022] Updated weights on worker 0-0, policy_version 136011 (0.00088) [2022-07-09 06:52:35,019][25689] Fps is (10 sec: 5745.3, 60 sec: 5760.9, 300 sec: 5754.2). Total num frames: 139279360. Throughput: 0: 6036.2. Samples: 139285164. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 06:52:35,019][25689] Avg episode reward: [(0, '-57.096')] [2022-07-09 06:52:35,979][26022] Updated weights on worker 0-0, policy_version 136021 (0.00085) [2022-07-09 06:52:37,787][26022] Updated weights on worker 0-0, policy_version 136031 (0.00087) [2022-07-09 06:52:39,490][26022] Updated weights on worker 0-0, policy_version 136041 (0.00097) [2022-07-09 06:52:40,036][25689] Fps is (10 sec: 5608.1, 60 sec: 5765.7, 300 sec: 5750.8). Total num frames: 139307008. Throughput: 0: 5160.5. Samples: 139302420. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 06:52:40,036][25689] Avg episode reward: [(0, '-57.199')] [2022-07-09 06:52:41,384][26022] Updated weights on worker 0-0, policy_version 136051 (0.00094) [2022-07-09 06:52:42,993][26022] Updated weights on worker 0-0, policy_version 136061 (0.00094) [2022-07-09 06:52:44,853][26022] Updated weights on worker 0-0, policy_version 136071 (0.00094) [2022-07-09 06:52:45,069][25689] Fps is (10 sec: 5807.4, 60 sec: 5787.9, 300 sec: 5754.8). Total num frames: 139337728. Throughput: 0: 6015.8. Samples: 139337074. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 06:52:45,070][25689] Avg episode reward: [(0, '-57.832')] [2022-07-09 06:52:46,624][26022] Updated weights on worker 0-0, policy_version 136081 (0.00084) [2022-07-09 06:52:48,517][26022] Updated weights on worker 0-0, policy_version 136091 (0.00083) [2022-07-09 06:52:50,067][26022] Updated weights on worker 0-0, policy_version 136101 (0.00086) [2022-07-09 06:52:50,143][25689] Fps is (10 sec: 5977.6, 60 sec: 5787.5, 300 sec: 5751.4). Total num frames: 139367424. Throughput: 0: 6029.5. Samples: 139371722. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 06:52:50,144][25689] Avg episode reward: [(0, '-57.683')] [2022-07-09 06:52:52,115][26022] Updated weights on worker 0-0, policy_version 136111 (0.00100) [2022-07-09 06:52:53,656][26022] Updated weights on worker 0-0, policy_version 136121 (0.00095) [2022-07-09 06:52:55,154][25689] Fps is (10 sec: 5584.7, 60 sec: 5723.0, 300 sec: 5745.3). Total num frames: 139394048. Throughput: 0: 5164.5. Samples: 139389154. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 06:52:55,155][25689] Avg episode reward: [(0, '-58.497')] [2022-07-09 06:52:55,475][26022] Updated weights on worker 0-0, policy_version 136131 (0.00087) [2022-07-09 06:52:57,083][26022] Updated weights on worker 0-0, policy_version 136141 (0.00085) [2022-07-09 06:52:59,023][26022] Updated weights on worker 0-0, policy_version 136151 (0.00084) [2022-07-09 06:53:00,179][25689] Fps is (10 sec: 5816.2, 60 sec: 5789.0, 300 sec: 5762.2). Total num frames: 139425792. Throughput: 0: 6040.1. Samples: 139424084. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 06:53:00,179][25689] Avg episode reward: [(0, '-58.207')] [2022-07-09 06:53:00,779][26022] Updated weights on worker 0-0, policy_version 136161 (0.00084) [2022-07-09 06:53:02,899][26022] Updated weights on worker 0-0, policy_version 136171 (0.00089) [2022-07-09 06:53:04,586][26022] Updated weights on worker 0-0, policy_version 136181 (0.00087) [2022-07-09 06:53:05,193][25689] Fps is (10 sec: 5814.1, 60 sec: 5772.8, 300 sec: 5752.4). Total num frames: 139452416. Throughput: 0: 5963.8. Samples: 139457088. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 06:53:05,194][25689] Avg episode reward: [(0, '-57.708')] [2022-07-09 06:53:06,499][26022] Updated weights on worker 0-0, policy_version 136191 (0.00084) [2022-07-09 06:53:08,009][26022] Updated weights on worker 0-0, policy_version 136201 (0.00090) [2022-07-09 06:53:10,100][26022] Updated weights on worker 0-0, policy_version 136211 (0.00094) [2022-07-09 06:53:10,302][25689] Fps is (10 sec: 5360.9, 60 sec: 5737.0, 300 sec: 5747.6). Total num frames: 139480064. Throughput: 0: 5093.7. Samples: 139474406. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 06:53:10,303][25689] Avg episode reward: [(0, '-57.052')] [2022-07-09 06:53:11,474][26022] Updated weights on worker 0-0, policy_version 136221 (0.00085) [2022-07-09 06:53:13,487][26022] Updated weights on worker 0-0, policy_version 136231 (0.00096) [2022-07-09 06:53:15,322][25689] Fps is (10 sec: 5661.2, 60 sec: 5737.4, 300 sec: 5743.9). Total num frames: 139509760. Throughput: 0: 5967.2. Samples: 139509502. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 06:53:15,323][25689] Avg episode reward: [(0, '-58.170')] [2022-07-09 06:53:15,348][26022] Updated weights on worker 0-0, policy_version 136241 (0.00086) [2022-07-09 06:53:16,899][26022] Updated weights on worker 0-0, policy_version 136251 (0.00097) [2022-07-09 06:53:18,815][26022] Updated weights on worker 0-0, policy_version 136261 (0.00090) [2022-07-09 06:53:20,397][25689] Fps is (10 sec: 5984.6, 60 sec: 5764.9, 300 sec: 5746.4). Total num frames: 139540480. Throughput: 0: 5941.9. Samples: 139544222. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 06:53:20,398][25689] Avg episode reward: [(0, '-58.524')] [2022-07-09 06:53:20,435][26022] Updated weights on worker 0-0, policy_version 136271 (0.00095) [2022-07-09 06:53:22,183][26022] Updated weights on worker 0-0, policy_version 136281 (0.00085) [2022-07-09 06:53:24,215][26022] Updated weights on worker 0-0, policy_version 136291 (0.00089) [2022-07-09 06:53:25,399][25689] Fps is (10 sec: 5995.9, 60 sec: 5767.9, 300 sec: 5751.0). Total num frames: 139570176. Throughput: 0: 6033.1. Samples: 139578990. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 06:53:25,399][25689] Avg episode reward: [(0, '-58.243')] [2022-07-09 06:53:25,633][26022] Updated weights on worker 0-0, policy_version 136301 (0.00090) [2022-07-09 06:53:27,774][26022] Updated weights on worker 0-0, policy_version 136311 (0.00085) [2022-07-09 06:53:29,190][26022] Updated weights on worker 0-0, policy_version 136321 (0.00087) [2022-07-09 06:53:30,443][25689] Fps is (10 sec: 5606.2, 60 sec: 5725.6, 300 sec: 5743.9). Total num frames: 139596800. Throughput: 0: 6036.2. Samples: 139595982. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 06:53:30,444][25689] Avg episode reward: [(0, '-59.443')] [2022-07-09 06:53:31,464][26022] Updated weights on worker 0-0, policy_version 136331 (0.00093) [2022-07-09 06:53:32,795][26022] Updated weights on worker 0-0, policy_version 136341 (0.00118) [2022-07-09 06:53:34,737][26022] Updated weights on worker 0-0, policy_version 136351 (0.00087) [2022-07-09 06:53:35,463][25689] Fps is (10 sec: 5799.4, 60 sec: 5776.9, 300 sec: 5753.8). Total num frames: 139628544. Throughput: 0: 6013.9. Samples: 139630626. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 06:53:35,464][25689] Avg episode reward: [(0, '-60.553')] [2022-07-09 06:53:36,568][26022] Updated weights on worker 0-0, policy_version 136361 (0.00085) [2022-07-09 06:53:38,308][26022] Updated weights on worker 0-0, policy_version 136371 (0.00084) [2022-07-09 06:53:40,256][26022] Updated weights on worker 0-0, policy_version 136381 (0.00093) [2022-07-09 06:53:40,513][25689] Fps is (10 sec: 5898.4, 60 sec: 5773.9, 300 sec: 5746.1). Total num frames: 139656192. Throughput: 0: 6004.5. Samples: 139665004. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 06:53:40,513][25689] Avg episode reward: [(0, '-60.634')] [2022-07-09 06:53:41,940][26022] Updated weights on worker 0-0, policy_version 136391 (0.00086) [2022-07-09 06:53:43,599][26022] Updated weights on worker 0-0, policy_version 136401 (0.00083) [2022-07-09 06:53:45,374][26022] Updated weights on worker 0-0, policy_version 136411 (0.00092) [2022-07-09 06:53:45,600][25689] Fps is (10 sec: 5555.7, 60 sec: 5734.9, 300 sec: 5750.3). Total num frames: 139684864. Throughput: 0: 5127.0. Samples: 139682564. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 06:53:45,601][25689] Avg episode reward: [(0, '-60.353')] [2022-07-09 06:53:47,026][26022] Updated weights on worker 0-0, policy_version 136421 (0.00082) [2022-07-09 06:53:48,997][26022] Updated weights on worker 0-0, policy_version 136431 (0.00087) [2022-07-09 06:53:50,713][25689] Fps is (10 sec: 5822.5, 60 sec: 5748.1, 300 sec: 5748.4). Total num frames: 139715584. Throughput: 0: 5997.5. Samples: 139717550. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 06:53:50,715][25689] Avg episode reward: [(0, '-60.532')] [2022-07-09 06:53:50,716][26022] Updated weights on worker 0-0, policy_version 136441 (0.00092) [2022-07-09 06:53:52,680][26022] Updated weights on worker 0-0, policy_version 136451 (0.00087) [2022-07-09 06:53:54,237][26022] Updated weights on worker 0-0, policy_version 136461 (0.00085) [2022-07-09 06:53:55,725][25689] Fps is (10 sec: 5764.8, 60 sec: 5764.9, 300 sec: 5745.6). Total num frames: 139743232. Throughput: 0: 5991.7. Samples: 139752030. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 06:53:55,726][25689] Avg episode reward: [(0, '-58.813')] [2022-07-09 06:53:56,192][26022] Updated weights on worker 0-0, policy_version 136471 (0.00094) [2022-07-09 06:53:57,891][26022] Updated weights on worker 0-0, policy_version 136481 (0.00088) [2022-07-09 06:53:59,758][26022] Updated weights on worker 0-0, policy_version 136491 (0.00081) [2022-07-09 06:54:00,745][25689] Fps is (10 sec: 5716.3, 60 sec: 5731.5, 300 sec: 5753.3). Total num frames: 139772928. Throughput: 0: 5152.9. Samples: 139769258. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 06:54:00,745][25689] Avg episode reward: [(0, '-58.075')] [2022-07-09 06:54:01,493][26022] Updated weights on worker 0-0, policy_version 136501 (0.00085) [2022-07-09 06:54:03,580][26022] Updated weights on worker 0-0, policy_version 136511 (0.00084) [2022-07-09 06:54:05,275][26022] Updated weights on worker 0-0, policy_version 136521 (0.00087) [2022-07-09 06:54:05,783][25689] Fps is (10 sec: 5497.6, 60 sec: 5712.3, 300 sec: 5740.4). Total num frames: 139798528. Throughput: 0: 5900.0. Samples: 139801644. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 06:54:05,785][25689] Avg episode reward: [(0, '-57.964')] [2022-07-09 06:54:07,240][26022] Updated weights on worker 0-0, policy_version 136531 (0.00087) [2022-07-09 06:54:08,886][26022] Updated weights on worker 0-0, policy_version 136541 (0.00097) [2022-07-09 06:54:10,858][25689] Fps is (10 sec: 5366.3, 60 sec: 5732.5, 300 sec: 5739.2). Total num frames: 139827200. Throughput: 0: 5881.3. Samples: 139836028. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 06:54:10,859][25689] Avg episode reward: [(0, '-57.641')] [2022-07-09 06:54:10,921][26022] Updated weights on worker 0-0, policy_version 136551 (0.00084) [2022-07-09 06:54:12,416][26022] Updated weights on worker 0-0, policy_version 136561 (0.00563) [2022-07-09 06:54:14,484][26022] Updated weights on worker 0-0, policy_version 136571 (0.00084) [2022-07-09 06:54:15,878][25689] Fps is (10 sec: 5883.2, 60 sec: 5749.4, 300 sec: 5742.5). Total num frames: 139857920. Throughput: 0: 5027.5. Samples: 139853350. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 06:54:15,879][25689] Avg episode reward: [(0, '-57.718')] [2022-07-09 06:54:16,003][26022] Updated weights on worker 0-0, policy_version 136581 (0.00079) [2022-07-09 06:54:17,834][26022] Updated weights on worker 0-0, policy_version 136591 (0.00084) [2022-07-09 06:54:19,680][26022] Updated weights on worker 0-0, policy_version 136601 (0.00084) [2022-07-09 06:54:20,903][25689] Fps is (10 sec: 5912.8, 60 sec: 5720.3, 300 sec: 5743.2). Total num frames: 139886592. Throughput: 0: 5891.8. Samples: 139888026. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 06:54:20,903][25689] Avg episode reward: [(0, '-57.670')] [2022-07-09 06:54:21,282][26022] Updated weights on worker 0-0, policy_version 136611 (0.00093) [2022-07-09 06:54:23,179][26022] Updated weights on worker 0-0, policy_version 136621 (0.00087) [2022-07-09 06:54:24,859][26022] Updated weights on worker 0-0, policy_version 136631 (0.00084) [2022-07-09 06:54:25,943][25689] Fps is (10 sec: 5697.8, 60 sec: 5699.8, 300 sec: 5740.0). Total num frames: 139915264. Throughput: 0: 6009.8. Samples: 139922798. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 06:54:25,943][25689] Avg episode reward: [(0, '-58.584')] [2022-07-09 06:54:26,787][26022] Updated weights on worker 0-0, policy_version 136641 (0.00087) [2022-07-09 06:54:28,584][26022] Updated weights on worker 0-0, policy_version 136651 (0.00089) [2022-07-09 06:54:30,322][26022] Updated weights on worker 0-0, policy_version 136661 (0.00093) [2022-07-09 06:54:31,037][25689] Fps is (10 sec: 5759.6, 60 sec: 5745.8, 300 sec: 5743.2). Total num frames: 139944960. Throughput: 0: 5160.0. Samples: 139940150. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 06:54:31,037][25689] Avg episode reward: [(0, '-59.060')] [2022-07-09 06:54:32,151][26022] Updated weights on worker 0-0, policy_version 136671 (0.00088) [2022-07-09 06:54:33,779][26022] Updated weights on worker 0-0, policy_version 136681 (0.00084) [2022-07-09 06:54:34,145][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:54:34,153][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000136683_139963392.pth [2022-07-09 06:54:34,154][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000134662_137893888.pth [2022-07-09 06:54:35,620][26022] Updated weights on worker 0-0, policy_version 136691 (0.00081) [2022-07-09 06:54:36,066][25689] Fps is (10 sec: 5765.7, 60 sec: 5694.2, 300 sec: 5736.4). Total num frames: 139973632. Throughput: 0: 6005.5. Samples: 139974586. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 06:54:36,067][25689] Avg episode reward: [(0, '-58.825')] [2022-07-09 06:54:37,244][26022] Updated weights on worker 0-0, policy_version 136701 (0.00090) [2022-07-09 06:54:39,250][26022] Updated weights on worker 0-0, policy_version 136711 (0.00096) [2022-07-09 06:54:40,720][26022] Updated weights on worker 0-0, policy_version 136721 (0.00084) [2022-07-09 06:54:41,087][25689] Fps is (10 sec: 5706.0, 60 sec: 5713.9, 300 sec: 5743.2). Total num frames: 140002304. Throughput: 0: 5988.7. Samples: 140008900. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 06:54:41,087][25689] Avg episode reward: [(0, '-58.660')] [2022-07-09 06:54:42,829][26022] Updated weights on worker 0-0, policy_version 136731 (0.00066) [2022-07-09 06:54:44,520][26022] Updated weights on worker 0-0, policy_version 136741 (0.00082) [2022-07-09 06:54:46,107][25689] Fps is (10 sec: 5711.2, 60 sec: 5720.3, 300 sec: 5740.2). Total num frames: 140030976. Throughput: 0: 5128.4. Samples: 140026206. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 06:54:46,107][25689] Avg episode reward: [(0, '-57.571')] [2022-07-09 06:54:46,379][26022] Updated weights on worker 0-0, policy_version 136751 (0.00081) [2022-07-09 06:54:48,289][26022] Updated weights on worker 0-0, policy_version 136761 (0.00086) [2022-07-09 06:54:49,979][26022] Updated weights on worker 0-0, policy_version 136771 (0.00090) [2022-07-09 06:54:51,209][25689] Fps is (10 sec: 5867.5, 60 sec: 5721.3, 300 sec: 5743.1). Total num frames: 140061696. Throughput: 0: 5973.4. Samples: 140060646. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 06:54:51,210][25689] Avg episode reward: [(0, '-57.636')] [2022-07-09 06:54:51,519][26022] Updated weights on worker 0-0, policy_version 136781 (0.00087) [2022-07-09 06:54:53,568][26022] Updated weights on worker 0-0, policy_version 136791 (0.00080) [2022-07-09 06:54:55,319][26022] Updated weights on worker 0-0, policy_version 136801 (0.00085) [2022-07-09 06:54:56,258][25689] Fps is (10 sec: 5749.9, 60 sec: 5717.8, 300 sec: 5739.5). Total num frames: 140089344. Throughput: 0: 5982.7. Samples: 140095388. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2022-07-09 06:54:56,258][25689] Avg episode reward: [(0, '-56.926')] [2022-07-09 06:54:56,997][26022] Updated weights on worker 0-0, policy_version 136811 (0.00090) [2022-07-09 06:54:58,844][26022] Updated weights on worker 0-0, policy_version 136821 (0.00083) [2022-07-09 06:55:00,583][26022] Updated weights on worker 0-0, policy_version 136831 (0.00086) [2022-07-09 06:55:01,279][25689] Fps is (10 sec: 5694.6, 60 sec: 5717.6, 300 sec: 5754.7). Total num frames: 140119040. Throughput: 0: 5138.0. Samples: 140112648. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2022-07-09 06:55:01,279][25689] Avg episode reward: [(0, '-57.287')] [2022-07-09 06:55:02,763][26022] Updated weights on worker 0-0, policy_version 136841 (0.00087) [2022-07-09 06:55:04,597][26022] Updated weights on worker 0-0, policy_version 136851 (0.00093) [2022-07-09 06:55:06,294][25689] Fps is (10 sec: 5509.7, 60 sec: 5719.9, 300 sec: 5738.6). Total num frames: 140144640. Throughput: 0: 5888.5. Samples: 140145078. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2022-07-09 06:55:06,294][25689] Avg episode reward: [(0, '-57.901')] [2022-07-09 06:55:06,443][26022] Updated weights on worker 0-0, policy_version 136861 (0.00093) [2022-07-09 06:55:07,981][26022] Updated weights on worker 0-0, policy_version 136871 (0.00086) [2022-07-09 06:55:09,979][26022] Updated weights on worker 0-0, policy_version 136881 (0.00099) [2022-07-09 06:55:11,361][25689] Fps is (10 sec: 5586.3, 60 sec: 5754.5, 300 sec: 5747.8). Total num frames: 140175360. Throughput: 0: 5913.1. Samples: 140179806. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2022-07-09 06:55:11,361][25689] Avg episode reward: [(0, '-58.211')] [2022-07-09 06:55:11,458][26022] Updated weights on worker 0-0, policy_version 136891 (0.00089) [2022-07-09 06:55:13,551][26022] Updated weights on worker 0-0, policy_version 136901 (0.00083) [2022-07-09 06:55:14,947][26022] Updated weights on worker 0-0, policy_version 136911 (0.00087) [2022-07-09 06:55:16,370][25689] Fps is (10 sec: 5894.4, 60 sec: 5721.7, 300 sec: 5740.9). Total num frames: 140204032. Throughput: 0: 5076.8. Samples: 140197494. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2022-07-09 06:55:16,370][25689] Avg episode reward: [(0, '-58.170')] [2022-07-09 06:55:16,843][26022] Updated weights on worker 0-0, policy_version 136921 (0.00089) [2022-07-09 06:55:18,750][26022] Updated weights on worker 0-0, policy_version 136931 (0.00087) [2022-07-09 06:55:20,549][26022] Updated weights on worker 0-0, policy_version 136941 (0.00086) [2022-07-09 06:55:21,459][25689] Fps is (10 sec: 5476.0, 60 sec: 5681.8, 300 sec: 5733.3). Total num frames: 140230656. Throughput: 0: 5867.5. Samples: 140231054. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2022-07-09 06:55:21,459][25689] Avg episode reward: [(0, '-59.210')] [2022-07-09 06:55:22,426][26022] Updated weights on worker 0-0, policy_version 136951 (0.00080) [2022-07-09 06:55:24,318][26022] Updated weights on worker 0-0, policy_version 136961 (0.00093) [2022-07-09 06:55:25,867][26022] Updated weights on worker 0-0, policy_version 136971 (0.00093) [2022-07-09 06:55:26,495][25689] Fps is (10 sec: 5663.7, 60 sec: 5715.9, 300 sec: 5741.7). Total num frames: 140261376. Throughput: 0: 5971.1. Samples: 140265700. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2022-07-09 06:55:26,496][25689] Avg episode reward: [(0, '-59.245')] [2022-07-09 06:55:27,821][26022] Updated weights on worker 0-0, policy_version 136981 (0.00085) [2022-07-09 06:55:29,471][26022] Updated weights on worker 0-0, policy_version 136991 (0.00095) [2022-07-09 06:55:31,413][26022] Updated weights on worker 0-0, policy_version 137001 (0.00090) [2022-07-09 06:55:31,569][25689] Fps is (10 sec: 5773.2, 60 sec: 5684.0, 300 sec: 5733.7). Total num frames: 140289024. Throughput: 0: 5099.4. Samples: 140282854. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2022-07-09 06:55:31,569][25689] Avg episode reward: [(0, '-59.192')] [2022-07-09 06:55:33,170][26022] Updated weights on worker 0-0, policy_version 137011 (0.00094) [2022-07-09 06:55:34,883][26022] Updated weights on worker 0-0, policy_version 137021 (0.00085) [2022-07-09 06:55:36,599][25689] Fps is (10 sec: 5675.4, 60 sec: 5700.9, 300 sec: 5737.1). Total num frames: 140318720. Throughput: 0: 5919.8. Samples: 140317244. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2022-07-09 06:55:36,601][25689] Avg episode reward: [(0, '-58.989')] [2022-07-09 06:55:36,639][26022] Updated weights on worker 0-0, policy_version 137031 (0.00093) [2022-07-09 06:55:38,577][26022] Updated weights on worker 0-0, policy_version 137041 (0.00089) [2022-07-09 06:55:40,254][26022] Updated weights on worker 0-0, policy_version 137051 (0.00086) [2022-07-09 06:55:41,609][25689] Fps is (10 sec: 5915.2, 60 sec: 5718.7, 300 sec: 5741.3). Total num frames: 140348416. Throughput: 0: 6009.3. Samples: 140352146. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2022-07-09 06:55:41,612][25689] Avg episode reward: [(0, '-59.111')] [2022-07-09 06:55:41,953][26022] Updated weights on worker 0-0, policy_version 137061 (0.00082) [2022-07-09 06:55:43,739][26022] Updated weights on worker 0-0, policy_version 137071 (0.00082) [2022-07-09 06:55:45,336][26022] Updated weights on worker 0-0, policy_version 137081 (0.00081) [2022-07-09 06:55:46,612][25689] Fps is (10 sec: 5726.5, 60 sec: 5703.4, 300 sec: 5733.0). Total num frames: 140376064. Throughput: 0: 5179.5. Samples: 140369900. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2022-07-09 06:55:46,613][25689] Avg episode reward: [(0, '-59.271')] [2022-07-09 06:55:47,211][26022] Updated weights on worker 0-0, policy_version 137091 (0.00089) [2022-07-09 06:55:48,743][26022] Updated weights on worker 0-0, policy_version 137101 (0.00087) [2022-07-09 06:55:50,678][26022] Updated weights on worker 0-0, policy_version 137111 (0.00087) [2022-07-09 06:55:51,679][25689] Fps is (10 sec: 5796.5, 60 sec: 5706.8, 300 sec: 5736.7). Total num frames: 140406784. Throughput: 0: 6083.8. Samples: 140405200. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2022-07-09 06:55:51,679][25689] Avg episode reward: [(0, '-58.573')] [2022-07-09 06:55:52,349][26022] Updated weights on worker 0-0, policy_version 137121 (0.00078) [2022-07-09 06:55:54,347][26022] Updated weights on worker 0-0, policy_version 137131 (0.00088) [2022-07-09 06:55:55,964][26022] Updated weights on worker 0-0, policy_version 137141 (0.00077) [2022-07-09 06:55:56,707][25689] Fps is (10 sec: 5985.2, 60 sec: 5742.6, 300 sec: 5743.8). Total num frames: 140436480. Throughput: 0: 6102.2. Samples: 140439946. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2022-07-09 06:55:56,707][25689] Avg episode reward: [(0, '-58.371')] [2022-07-09 06:55:57,902][26022] Updated weights on worker 0-0, policy_version 137151 (0.00086) [2022-07-09 06:55:59,366][26022] Updated weights on worker 0-0, policy_version 137161 (0.00091) [2022-07-09 06:56:01,423][26022] Updated weights on worker 0-0, policy_version 137171 (0.00097) [2022-07-09 06:56:01,766][25689] Fps is (10 sec: 5684.8, 60 sec: 5705.1, 300 sec: 5739.4). Total num frames: 140464128. Throughput: 0: 6013.7. Samples: 140473362. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2022-07-09 06:56:01,766][25689] Avg episode reward: [(0, '-58.145')] [2022-07-09 06:56:03,283][26022] Updated weights on worker 0-0, policy_version 137181 (0.00086) [2022-07-09 06:56:05,163][26022] Updated weights on worker 0-0, policy_version 137191 (0.00092) [2022-07-09 06:56:06,783][25689] Fps is (10 sec: 5487.9, 60 sec: 5738.8, 300 sec: 5737.8). Total num frames: 140491776. Throughput: 0: 5940.6. Samples: 140489724. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2022-07-09 06:56:06,783][25689] Avg episode reward: [(0, '-57.646')] [2022-07-09 06:56:07,140][26022] Updated weights on worker 0-0, policy_version 137201 (0.00083) [2022-07-09 06:56:08,734][26022] Updated weights on worker 0-0, policy_version 137211 (0.00084) [2022-07-09 06:56:10,458][26022] Updated weights on worker 0-0, policy_version 137221 (0.00091) [2022-07-09 06:56:11,834][25689] Fps is (10 sec: 5593.7, 60 sec: 5706.4, 300 sec: 5733.7). Total num frames: 140520448. Throughput: 0: 5919.7. Samples: 140524516. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2022-07-09 06:56:11,835][25689] Avg episode reward: [(0, '-58.612')] [2022-07-09 06:56:12,318][26022] Updated weights on worker 0-0, policy_version 137231 (0.00148) [2022-07-09 06:56:13,887][26022] Updated weights on worker 0-0, policy_version 137241 (0.00088) [2022-07-09 06:56:15,719][26022] Updated weights on worker 0-0, policy_version 137251 (0.00094) [2022-07-09 06:56:16,923][25689] Fps is (10 sec: 5957.9, 60 sec: 5749.7, 300 sec: 5742.9). Total num frames: 140552192. Throughput: 0: 5913.7. Samples: 140559500. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2022-07-09 06:56:16,924][25689] Avg episode reward: [(0, '-58.349')] [2022-07-09 06:56:17,499][26022] Updated weights on worker 0-0, policy_version 137261 (0.00087) [2022-07-09 06:56:19,172][26022] Updated weights on worker 0-0, policy_version 137271 (0.00083) [2022-07-09 06:56:21,411][26022] Updated weights on worker 0-0, policy_version 137281 (0.00094) [2022-07-09 06:56:22,005][25689] Fps is (10 sec: 5839.5, 60 sec: 5767.2, 300 sec: 5732.7). Total num frames: 140579840. Throughput: 0: 5118.5. Samples: 140576956. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2022-07-09 06:56:22,006][25689] Avg episode reward: [(0, '-59.616')] [2022-07-09 06:56:22,641][26022] Updated weights on worker 0-0, policy_version 137291 (0.00091) [2022-07-09 06:56:24,646][26022] Updated weights on worker 0-0, policy_version 137301 (0.00080) [2022-07-09 06:56:26,338][26022] Updated weights on worker 0-0, policy_version 137311 (0.00049) [2022-07-09 06:56:27,037][25689] Fps is (10 sec: 5568.4, 60 sec: 5733.8, 300 sec: 5740.9). Total num frames: 140608512. Throughput: 0: 6035.0. Samples: 140611960. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2022-07-09 06:56:27,038][25689] Avg episode reward: [(0, '-59.219')] [2022-07-09 06:56:28,069][26022] Updated weights on worker 0-0, policy_version 137321 (0.00087) [2022-07-09 06:56:30,091][26022] Updated weights on worker 0-0, policy_version 137331 (0.00084) [2022-07-09 06:56:31,715][26022] Updated weights on worker 0-0, policy_version 137341 (0.00429) [2022-07-09 06:56:32,119][25689] Fps is (10 sec: 5771.2, 60 sec: 5766.9, 300 sec: 5737.2). Total num frames: 140638208. Throughput: 0: 6009.6. Samples: 140646416. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2022-07-09 06:56:32,119][25689] Avg episode reward: [(0, '-59.565')] [2022-07-09 06:56:33,650][26022] Updated weights on worker 0-0, policy_version 137351 (0.00091) [2022-07-09 06:56:34,241][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:56:34,253][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000137356_140652544.pth [2022-07-09 06:56:34,253][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000135334_138582016.pth [2022-07-09 06:56:35,390][26022] Updated weights on worker 0-0, policy_version 137361 (0.00080) [2022-07-09 06:56:37,028][26022] Updated weights on worker 0-0, policy_version 137371 (0.00088) [2022-07-09 06:56:37,159][25689] Fps is (10 sec: 6070.4, 60 sec: 5799.7, 300 sec: 5751.5). Total num frames: 140669952. Throughput: 0: 5147.9. Samples: 140663670. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2022-07-09 06:56:37,159][25689] Avg episode reward: [(0, '-58.459')] [2022-07-09 06:56:38,909][26022] Updated weights on worker 0-0, policy_version 137381 (0.00082) [2022-07-09 06:56:40,584][26022] Updated weights on worker 0-0, policy_version 137391 (0.00079) [2022-07-09 06:56:42,172][25689] Fps is (10 sec: 5907.7, 60 sec: 5765.7, 300 sec: 5746.1). Total num frames: 140697600. Throughput: 0: 6026.8. Samples: 140698498. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2022-07-09 06:56:42,172][25689] Avg episode reward: [(0, '-58.452')] [2022-07-09 06:56:42,355][26022] Updated weights on worker 0-0, policy_version 137401 (0.00090) [2022-07-09 06:56:44,111][26022] Updated weights on worker 0-0, policy_version 137411 (0.00078) [2022-07-09 06:56:45,793][26022] Updated weights on worker 0-0, policy_version 137421 (0.00086) [2022-07-09 06:56:47,212][25689] Fps is (10 sec: 5704.0, 60 sec: 5796.0, 300 sec: 5746.6). Total num frames: 140727296. Throughput: 0: 6039.0. Samples: 140733794. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2022-07-09 06:56:47,212][25689] Avg episode reward: [(0, '-57.259')] [2022-07-09 06:56:47,596][26022] Updated weights on worker 0-0, policy_version 137431 (0.00086) [2022-07-09 06:56:49,233][26022] Updated weights on worker 0-0, policy_version 137441 (0.00087) [2022-07-09 06:56:51,199][26022] Updated weights on worker 0-0, policy_version 137451 (0.00088) [2022-07-09 06:56:52,290][25689] Fps is (10 sec: 5870.0, 60 sec: 5777.9, 300 sec: 5742.5). Total num frames: 140756992. Throughput: 0: 5199.9. Samples: 140751304. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2022-07-09 06:56:52,291][25689] Avg episode reward: [(0, '-57.905')] [2022-07-09 06:56:52,819][26022] Updated weights on worker 0-0, policy_version 137461 (0.00085) [2022-07-09 06:56:54,598][26022] Updated weights on worker 0-0, policy_version 137471 (0.00079) [2022-07-09 06:56:56,483][26022] Updated weights on worker 0-0, policy_version 137481 (0.00084) [2022-07-09 06:56:57,332][25689] Fps is (10 sec: 5666.6, 60 sec: 5742.9, 300 sec: 5741.9). Total num frames: 140784640. Throughput: 0: 6071.6. Samples: 140786152. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2022-07-09 06:56:57,332][25689] Avg episode reward: [(0, '-58.267')] [2022-07-09 06:56:58,235][26022] Updated weights on worker 0-0, policy_version 137491 (0.00083) [2022-07-09 06:57:00,075][26022] Updated weights on worker 0-0, policy_version 137501 (0.00090) [2022-07-09 06:57:02,059][26022] Updated weights on worker 0-0, policy_version 137511 (0.00094) [2022-07-09 06:57:02,358][25689] Fps is (10 sec: 5390.3, 60 sec: 5729.0, 300 sec: 5738.3). Total num frames: 140811264. Throughput: 0: 5974.3. Samples: 140819098. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:57:02,359][25689] Avg episode reward: [(0, '-58.388')] [2022-07-09 06:57:03,855][26022] Updated weights on worker 0-0, policy_version 137521 (0.00096) [2022-07-09 06:57:05,618][26022] Updated weights on worker 0-0, policy_version 137531 (0.00087) [2022-07-09 06:57:07,308][26022] Updated weights on worker 0-0, policy_version 137541 (0.00090) [2022-07-09 06:57:07,378][25689] Fps is (10 sec: 5707.7, 60 sec: 5779.4, 300 sec: 5743.1). Total num frames: 140841984. Throughput: 0: 5077.4. Samples: 140836188. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:57:07,379][25689] Avg episode reward: [(0, '-58.601')] [2022-07-09 06:57:09,117][26022] Updated weights on worker 0-0, policy_version 137551 (0.00084) [2022-07-09 06:57:10,885][26022] Updated weights on worker 0-0, policy_version 137561 (0.00081) [2022-07-09 06:57:12,447][25689] Fps is (10 sec: 5887.1, 60 sec: 5777.8, 300 sec: 5738.8). Total num frames: 140870656. Throughput: 0: 5949.7. Samples: 140871232. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:57:12,447][25689] Avg episode reward: [(0, '-59.289')] [2022-07-09 06:57:12,659][26022] Updated weights on worker 0-0, policy_version 137571 (0.00093) [2022-07-09 06:57:14,454][26022] Updated weights on worker 0-0, policy_version 137581 (0.00344) [2022-07-09 06:57:16,112][26022] Updated weights on worker 0-0, policy_version 137591 (0.00093) [2022-07-09 06:57:17,479][25689] Fps is (10 sec: 5677.5, 60 sec: 5732.5, 300 sec: 5738.4). Total num frames: 140899328. Throughput: 0: 5961.8. Samples: 140906266. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:57:17,479][25689] Avg episode reward: [(0, '-59.358')] [2022-07-09 06:57:17,918][26022] Updated weights on worker 0-0, policy_version 137601 (0.00109) [2022-07-09 06:57:19,660][26022] Updated weights on worker 0-0, policy_version 137611 (0.00085) [2022-07-09 06:57:21,430][26022] Updated weights on worker 0-0, policy_version 137621 (0.00091) [2022-07-09 06:57:22,495][25689] Fps is (10 sec: 5808.8, 60 sec: 5772.6, 300 sec: 5738.7). Total num frames: 140929024. Throughput: 0: 5183.8. Samples: 140923482. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:57:22,496][25689] Avg episode reward: [(0, '-59.090')] [2022-07-09 06:57:23,257][26022] Updated weights on worker 0-0, policy_version 137631 (0.00085) [2022-07-09 06:57:25,107][26022] Updated weights on worker 0-0, policy_version 137641 (0.00087) [2022-07-09 06:57:26,694][26022] Updated weights on worker 0-0, policy_version 137651 (0.00086) [2022-07-09 06:57:27,527][25689] Fps is (10 sec: 5910.6, 60 sec: 5789.6, 300 sec: 5740.6). Total num frames: 140958720. Throughput: 0: 6055.6. Samples: 140958200. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:57:27,528][25689] Avg episode reward: [(0, '-59.470')] [2022-07-09 06:57:28,760][26022] Updated weights on worker 0-0, policy_version 137661 (0.00093) [2022-07-09 06:57:30,287][26022] Updated weights on worker 0-0, policy_version 137671 (0.00085) [2022-07-09 06:57:32,298][26022] Updated weights on worker 0-0, policy_version 137681 (0.00087) [2022-07-09 06:57:32,575][25689] Fps is (10 sec: 5790.7, 60 sec: 5775.8, 300 sec: 5740.2). Total num frames: 140987392. Throughput: 0: 6046.1. Samples: 140992926. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:57:32,575][25689] Avg episode reward: [(0, '-59.968')] [2022-07-09 06:57:33,836][26022] Updated weights on worker 0-0, policy_version 137691 (0.00085) [2022-07-09 06:57:35,691][26022] Updated weights on worker 0-0, policy_version 137701 (0.00086) [2022-07-09 06:57:37,420][26022] Updated weights on worker 0-0, policy_version 137711 (0.00091) [2022-07-09 06:57:37,630][25689] Fps is (10 sec: 5777.2, 60 sec: 5740.5, 300 sec: 5746.3). Total num frames: 141017088. Throughput: 0: 5152.7. Samples: 141010100. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:57:37,631][25689] Avg episode reward: [(0, '-61.033')] [2022-07-09 06:57:39,297][26022] Updated weights on worker 0-0, policy_version 137721 (0.00080) [2022-07-09 06:57:41,018][26022] Updated weights on worker 0-0, policy_version 137731 (0.00093) [2022-07-09 06:57:42,658][25689] Fps is (10 sec: 5788.3, 60 sec: 5756.0, 300 sec: 5739.6). Total num frames: 141045760. Throughput: 0: 6022.9. Samples: 141044924. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:57:42,660][25689] Avg episode reward: [(0, '-60.664')] [2022-07-09 06:57:42,671][26022] Updated weights on worker 0-0, policy_version 137741 (0.00086) [2022-07-09 06:57:44,679][26022] Updated weights on worker 0-0, policy_version 137751 (0.00085) [2022-07-09 06:57:46,370][26022] Updated weights on worker 0-0, policy_version 137761 (0.00087) [2022-07-09 06:57:47,663][25689] Fps is (10 sec: 5613.5, 60 sec: 5725.5, 300 sec: 5734.0). Total num frames: 141073408. Throughput: 0: 6027.6. Samples: 141079572. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:57:47,665][25689] Avg episode reward: [(0, '-60.769')] [2022-07-09 06:57:48,079][26022] Updated weights on worker 0-0, policy_version 137771 (0.00084) [2022-07-09 06:57:49,821][26022] Updated weights on worker 0-0, policy_version 137781 (0.00095) [2022-07-09 06:57:51,732][26022] Updated weights on worker 0-0, policy_version 137791 (0.00094) [2022-07-09 06:57:52,807][25689] Fps is (10 sec: 5751.6, 60 sec: 5736.2, 300 sec: 5745.2). Total num frames: 141104128. Throughput: 0: 5140.1. Samples: 141096920. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:57:52,807][25689] Avg episode reward: [(0, '-61.120')] [2022-07-09 06:57:53,416][26022] Updated weights on worker 0-0, policy_version 137801 (0.00086) [2022-07-09 06:57:55,260][26022] Updated weights on worker 0-0, policy_version 137811 (0.00093) [2022-07-09 06:57:56,880][26022] Updated weights on worker 0-0, policy_version 137821 (0.00086) [2022-07-09 06:57:57,844][25689] Fps is (10 sec: 5732.9, 60 sec: 5736.6, 300 sec: 5731.2). Total num frames: 141131776. Throughput: 0: 6006.2. Samples: 141131510. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:57:57,846][25689] Avg episode reward: [(0, '-60.350')] [2022-07-09 06:57:58,863][26022] Updated weights on worker 0-0, policy_version 137831 (0.00086) [2022-07-09 06:58:00,545][26022] Updated weights on worker 0-0, policy_version 137841 (0.00091) [2022-07-09 06:58:02,847][26022] Updated weights on worker 0-0, policy_version 137851 (0.00067) [2022-07-09 06:58:02,851][25689] Fps is (10 sec: 5505.4, 60 sec: 5755.5, 300 sec: 5734.8). Total num frames: 141159424. Throughput: 0: 6002.1. Samples: 141166120. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:58:02,851][25689] Avg episode reward: [(0, '-60.782')] [2022-07-09 06:58:04,519][26022] Updated weights on worker 0-0, policy_version 137861 (0.00091) [2022-07-09 06:58:06,412][26022] Updated weights on worker 0-0, policy_version 137871 (0.00091) [2022-07-09 06:58:07,859][25689] Fps is (10 sec: 5623.6, 60 sec: 5722.7, 300 sec: 5740.2). Total num frames: 141188096. Throughput: 0: 5024.9. Samples: 141181056. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:58:07,861][25689] Avg episode reward: [(0, '-60.176')] [2022-07-09 06:58:08,083][26022] Updated weights on worker 0-0, policy_version 137881 (0.00086) [2022-07-09 06:58:09,774][26022] Updated weights on worker 0-0, policy_version 137891 (0.00086) [2022-07-09 06:58:11,699][26022] Updated weights on worker 0-0, policy_version 137901 (0.00091) [2022-07-09 06:58:12,920][25689] Fps is (10 sec: 5695.0, 60 sec: 5723.4, 300 sec: 5736.0). Total num frames: 141216768. Throughput: 0: 5902.2. Samples: 141215632. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:58:12,920][25689] Avg episode reward: [(0, '-60.233')] [2022-07-09 06:58:13,435][26022] Updated weights on worker 0-0, policy_version 137911 (0.00096) [2022-07-09 06:58:15,095][26022] Updated weights on worker 0-0, policy_version 137921 (0.00089) [2022-07-09 06:58:16,840][26022] Updated weights on worker 0-0, policy_version 137931 (0.00078) [2022-07-09 06:58:17,973][25689] Fps is (10 sec: 5568.5, 60 sec: 5704.4, 300 sec: 5726.1). Total num frames: 141244416. Throughput: 0: 5908.8. Samples: 141250448. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:58:17,974][25689] Avg episode reward: [(0, '-59.377')] [2022-07-09 06:58:18,686][26022] Updated weights on worker 0-0, policy_version 137941 (0.00082) [2022-07-09 06:58:20,478][26022] Updated weights on worker 0-0, policy_version 137951 (0.00082) [2022-07-09 06:58:22,140][26022] Updated weights on worker 0-0, policy_version 137961 (0.00086) [2022-07-09 06:58:22,998][25689] Fps is (10 sec: 5791.2, 60 sec: 5720.5, 300 sec: 5729.0). Total num frames: 141275136. Throughput: 0: 5915.8. Samples: 141285312. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:58:22,999][25689] Avg episode reward: [(0, '-58.863')] [2022-07-09 06:58:24,041][26022] Updated weights on worker 0-0, policy_version 137971 (0.00084) [2022-07-09 06:58:25,880][26022] Updated weights on worker 0-0, policy_version 137981 (0.00090) [2022-07-09 06:58:27,866][26022] Updated weights on worker 0-0, policy_version 137991 (0.00088) [2022-07-09 06:58:28,044][25689] Fps is (10 sec: 5999.3, 60 sec: 5719.3, 300 sec: 5739.3). Total num frames: 141304832. Throughput: 0: 6011.5. Samples: 141302396. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:58:28,044][25689] Avg episode reward: [(0, '-60.029')] [2022-07-09 06:58:29,358][26022] Updated weights on worker 0-0, policy_version 138001 (0.00090) [2022-07-09 06:58:31,064][26022] Updated weights on worker 0-0, policy_version 138011 (0.00086) [2022-07-09 06:58:32,929][26022] Updated weights on worker 0-0, policy_version 138021 (0.00081) [2022-07-09 06:58:33,090][25689] Fps is (10 sec: 5783.8, 60 sec: 5719.4, 300 sec: 5728.5). Total num frames: 141333504. Throughput: 0: 6023.2. Samples: 141337122. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:58:33,091][25689] Avg episode reward: [(0, '-59.617')] [2022-07-09 06:58:34,274][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 06:58:34,288][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000138029_141341696.pth [2022-07-09 06:58:34,288][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000136010_139274240.pth [2022-07-09 06:58:34,807][26022] Updated weights on worker 0-0, policy_version 138031 (0.00086) [2022-07-09 06:58:36,499][26022] Updated weights on worker 0-0, policy_version 138041 (0.00086) [2022-07-09 06:58:38,102][25689] Fps is (10 sec: 5802.8, 60 sec: 5723.5, 300 sec: 5736.1). Total num frames: 141363200. Throughput: 0: 6041.8. Samples: 141372064. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:58:38,103][25689] Avg episode reward: [(0, '-60.421')] [2022-07-09 06:58:38,227][26022] Updated weights on worker 0-0, policy_version 138051 (0.00088) [2022-07-09 06:58:40,125][26022] Updated weights on worker 0-0, policy_version 138061 (0.00085) [2022-07-09 06:58:41,766][26022] Updated weights on worker 0-0, policy_version 138071 (0.00088) [2022-07-09 06:58:43,114][25689] Fps is (10 sec: 5720.7, 60 sec: 5708.1, 300 sec: 5734.1). Total num frames: 141390848. Throughput: 0: 5168.8. Samples: 141389286. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:58:43,115][25689] Avg episode reward: [(0, '-61.228')] [2022-07-09 06:58:43,688][26022] Updated weights on worker 0-0, policy_version 138081 (0.00085) [2022-07-09 06:58:45,254][26022] Updated weights on worker 0-0, policy_version 138091 (0.00078) [2022-07-09 06:58:47,186][26022] Updated weights on worker 0-0, policy_version 138101 (0.00090) [2022-07-09 06:58:48,133][25689] Fps is (10 sec: 5819.2, 60 sec: 5757.6, 300 sec: 5736.0). Total num frames: 141421568. Throughput: 0: 6065.3. Samples: 141424240. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:58:48,133][25689] Avg episode reward: [(0, '-60.981')] [2022-07-09 06:58:48,772][26022] Updated weights on worker 0-0, policy_version 138111 (0.00059) [2022-07-09 06:58:50,671][26022] Updated weights on worker 0-0, policy_version 138121 (0.00090) [2022-07-09 06:58:52,359][26022] Updated weights on worker 0-0, policy_version 138131 (0.00090) [2022-07-09 06:58:53,279][25689] Fps is (10 sec: 5842.9, 60 sec: 5723.5, 300 sec: 5736.8). Total num frames: 141450240. Throughput: 0: 6025.1. Samples: 141458762. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:58:53,279][25689] Avg episode reward: [(0, '-60.189')] [2022-07-09 06:58:54,311][26022] Updated weights on worker 0-0, policy_version 138141 (0.00086) [2022-07-09 06:58:56,003][26022] Updated weights on worker 0-0, policy_version 138151 (0.00086) [2022-07-09 06:58:57,771][26022] Updated weights on worker 0-0, policy_version 138161 (0.00088) [2022-07-09 06:58:58,303][25689] Fps is (10 sec: 5638.3, 60 sec: 5741.7, 300 sec: 5733.3). Total num frames: 141478912. Throughput: 0: 5142.5. Samples: 141475946. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:58:58,304][25689] Avg episode reward: [(0, '-60.828')] [2022-07-09 06:58:59,495][26022] Updated weights on worker 0-0, policy_version 138171 (0.00088) [2022-07-09 06:59:01,258][26022] Updated weights on worker 0-0, policy_version 138181 (0.00087) [2022-07-09 06:59:03,267][26022] Updated weights on worker 0-0, policy_version 138191 (0.00097) [2022-07-09 06:59:03,311][25689] Fps is (10 sec: 5716.1, 60 sec: 5758.5, 300 sec: 5744.2). Total num frames: 141507584. Throughput: 0: 5983.6. Samples: 141510136. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 06:59:03,311][25689] Avg episode reward: [(0, '-60.450')] [2022-07-09 06:59:05,367][26022] Updated weights on worker 0-0, policy_version 138201 (0.00088) [2022-07-09 06:59:06,957][26022] Updated weights on worker 0-0, policy_version 138211 (0.00080) [2022-07-09 06:59:08,320][25689] Fps is (10 sec: 5520.3, 60 sec: 5724.6, 300 sec: 5738.6). Total num frames: 141534208. Throughput: 0: 5895.6. Samples: 141543256. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:59:08,322][25689] Avg episode reward: [(0, '-59.559')] [2022-07-09 06:59:08,956][26022] Updated weights on worker 0-0, policy_version 138221 (0.00079) [2022-07-09 06:59:10,551][26022] Updated weights on worker 0-0, policy_version 138231 (0.00088) [2022-07-09 06:59:12,389][26022] Updated weights on worker 0-0, policy_version 138241 (0.00095) [2022-07-09 06:59:13,427][25689] Fps is (10 sec: 5567.5, 60 sec: 5737.1, 300 sec: 5733.5). Total num frames: 141563904. Throughput: 0: 5043.8. Samples: 141560384. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:59:13,427][25689] Avg episode reward: [(0, '-59.013')] [2022-07-09 06:59:14,235][26022] Updated weights on worker 0-0, policy_version 138251 (0.00088) [2022-07-09 06:59:15,944][26022] Updated weights on worker 0-0, policy_version 138261 (0.00087) [2022-07-09 06:59:17,667][26022] Updated weights on worker 0-0, policy_version 138271 (0.00084) [2022-07-09 06:59:18,445][25689] Fps is (10 sec: 5967.2, 60 sec: 5791.3, 300 sec: 5740.5). Total num frames: 141594624. Throughput: 0: 5924.4. Samples: 141595272. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:59:18,446][25689] Avg episode reward: [(0, '-58.360')] [2022-07-09 06:59:19,547][26022] Updated weights on worker 0-0, policy_version 138281 (0.00084) [2022-07-09 06:59:21,191][26022] Updated weights on worker 0-0, policy_version 138291 (0.00084) [2022-07-09 06:59:23,172][26022] Updated weights on worker 0-0, policy_version 138301 (0.00061) [2022-07-09 06:59:23,453][25689] Fps is (10 sec: 5821.7, 60 sec: 5742.1, 300 sec: 5737.6). Total num frames: 141622272. Throughput: 0: 5954.5. Samples: 141630070. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:59:23,454][25689] Avg episode reward: [(0, '-57.931')] [2022-07-09 06:59:24,766][26022] Updated weights on worker 0-0, policy_version 138311 (0.00081) [2022-07-09 06:59:26,507][26022] Updated weights on worker 0-0, policy_version 138321 (0.00093) [2022-07-09 06:59:28,336][26022] Updated weights on worker 0-0, policy_version 138331 (0.00085) [2022-07-09 06:59:28,459][25689] Fps is (10 sec: 5624.0, 60 sec: 5728.9, 300 sec: 5735.9). Total num frames: 141650944. Throughput: 0: 5172.8. Samples: 141647430. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:59:28,459][25689] Avg episode reward: [(0, '-57.514')] [2022-07-09 06:59:30,086][26022] Updated weights on worker 0-0, policy_version 138341 (0.00082) [2022-07-09 06:59:31,906][26022] Updated weights on worker 0-0, policy_version 138351 (0.00049) [2022-07-09 06:59:33,501][26022] Updated weights on worker 0-0, policy_version 138361 (0.00086) [2022-07-09 06:59:33,599][25689] Fps is (10 sec: 5853.7, 60 sec: 5753.9, 300 sec: 5740.6). Total num frames: 141681664. Throughput: 0: 6051.6. Samples: 141682458. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:59:33,599][25689] Avg episode reward: [(0, '-56.718')] [2022-07-09 06:59:35,485][26022] Updated weights on worker 0-0, policy_version 138371 (0.00086) [2022-07-09 06:59:37,037][26022] Updated weights on worker 0-0, policy_version 138381 (0.00089) [2022-07-09 06:59:38,677][25689] Fps is (10 sec: 5812.3, 60 sec: 5730.7, 300 sec: 5739.5). Total num frames: 141710336. Throughput: 0: 6022.0. Samples: 141717112. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:59:38,678][25689] Avg episode reward: [(0, '-57.222')] [2022-07-09 06:59:38,905][26022] Updated weights on worker 0-0, policy_version 138391 (0.00079) [2022-07-09 06:59:40,716][26022] Updated weights on worker 0-0, policy_version 138401 (0.00091) [2022-07-09 06:59:42,551][26022] Updated weights on worker 0-0, policy_version 138411 (0.00093) [2022-07-09 06:59:43,721][25689] Fps is (10 sec: 5665.1, 60 sec: 5744.5, 300 sec: 5739.1). Total num frames: 141739008. Throughput: 0: 5153.6. Samples: 141734526. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:59:43,722][25689] Avg episode reward: [(0, '-57.596')] [2022-07-09 06:59:44,252][26022] Updated weights on worker 0-0, policy_version 138421 (0.00087) [2022-07-09 06:59:45,984][26022] Updated weights on worker 0-0, policy_version 138431 (0.00085) [2022-07-09 06:59:47,618][26022] Updated weights on worker 0-0, policy_version 138441 (0.00084) [2022-07-09 06:59:48,748][25689] Fps is (10 sec: 5795.5, 60 sec: 5726.8, 300 sec: 5737.1). Total num frames: 141768704. Throughput: 0: 6024.9. Samples: 141769674. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:59:48,749][25689] Avg episode reward: [(0, '-58.314')] [2022-07-09 06:59:49,519][26022] Updated weights on worker 0-0, policy_version 138451 (0.00080) [2022-07-09 06:59:51,264][26022] Updated weights on worker 0-0, policy_version 138461 (0.00085) [2022-07-09 06:59:52,999][26022] Updated weights on worker 0-0, policy_version 138471 (0.00086) [2022-07-09 06:59:53,806][25689] Fps is (10 sec: 5889.2, 60 sec: 5752.1, 300 sec: 5743.8). Total num frames: 141798400. Throughput: 0: 6033.4. Samples: 141804376. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:59:53,807][25689] Avg episode reward: [(0, '-57.760')] [2022-07-09 06:59:54,883][26022] Updated weights on worker 0-0, policy_version 138481 (0.00092) [2022-07-09 06:59:56,418][26022] Updated weights on worker 0-0, policy_version 138491 (0.00097) [2022-07-09 06:59:58,511][26022] Updated weights on worker 0-0, policy_version 138501 (0.00093) [2022-07-09 06:59:58,839][25689] Fps is (10 sec: 5784.2, 60 sec: 5751.3, 300 sec: 5740.1). Total num frames: 141827072. Throughput: 0: 5195.2. Samples: 141821860. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 06:59:58,840][25689] Avg episode reward: [(0, '-58.869')] [2022-07-09 06:59:59,969][26022] Updated weights on worker 0-0, policy_version 138511 (0.00080) [2022-07-09 07:00:02,295][26022] Updated weights on worker 0-0, policy_version 138521 (0.00090) [2022-07-09 07:00:03,893][25689] Fps is (10 sec: 5583.3, 60 sec: 5730.0, 300 sec: 5746.2). Total num frames: 141854720. Throughput: 0: 5954.1. Samples: 141854632. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:00:03,894][25689] Avg episode reward: [(0, '-59.011')] [2022-07-09 07:00:03,914][26022] Updated weights on worker 0-0, policy_version 138531 (0.00082) [2022-07-09 07:00:05,802][26022] Updated weights on worker 0-0, policy_version 138541 (0.00083) [2022-07-09 07:00:07,658][26022] Updated weights on worker 0-0, policy_version 138551 (0.00085) [2022-07-09 07:00:08,903][25689] Fps is (10 sec: 5596.6, 60 sec: 5763.8, 300 sec: 5740.5). Total num frames: 141883392. Throughput: 0: 5909.5. Samples: 141888776. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:00:08,903][25689] Avg episode reward: [(0, '-59.374')] [2022-07-09 07:00:09,561][26022] Updated weights on worker 0-0, policy_version 138561 (0.00084) [2022-07-09 07:00:11,079][26022] Updated weights on worker 0-0, policy_version 138571 (0.00086) [2022-07-09 07:00:13,247][26022] Updated weights on worker 0-0, policy_version 138581 (0.00081) [2022-07-09 07:00:14,002][25689] Fps is (10 sec: 5875.5, 60 sec: 5781.4, 300 sec: 5745.6). Total num frames: 141914112. Throughput: 0: 5043.4. Samples: 141906230. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:00:14,002][25689] Avg episode reward: [(0, '-58.781')] [2022-07-09 07:00:14,571][26022] Updated weights on worker 0-0, policy_version 138591 (0.00087) [2022-07-09 07:00:16,494][26022] Updated weights on worker 0-0, policy_version 138601 (0.00084) [2022-07-09 07:00:18,095][26022] Updated weights on worker 0-0, policy_version 138611 (0.00083) [2022-07-09 07:00:19,065][25689] Fps is (10 sec: 5743.5, 60 sec: 5726.4, 300 sec: 5749.6). Total num frames: 141941760. Throughput: 0: 5910.1. Samples: 141941396. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:00:19,066][25689] Avg episode reward: [(0, '-58.550')] [2022-07-09 07:00:19,823][26022] Updated weights on worker 0-0, policy_version 138621 (0.00085) [2022-07-09 07:00:21,767][26022] Updated weights on worker 0-0, policy_version 138631 (0.00086) [2022-07-09 07:00:23,342][26022] Updated weights on worker 0-0, policy_version 138641 (0.00083) [2022-07-09 07:00:24,092][25689] Fps is (10 sec: 5682.9, 60 sec: 5758.3, 300 sec: 5746.3). Total num frames: 141971456. Throughput: 0: 6050.0. Samples: 141976836. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:00:24,093][25689] Avg episode reward: [(0, '-58.794')] [2022-07-09 07:00:25,140][26022] Updated weights on worker 0-0, policy_version 138651 (0.00086) [2022-07-09 07:00:26,908][26022] Updated weights on worker 0-0, policy_version 138661 (0.00085) [2022-07-09 07:00:28,634][26022] Updated weights on worker 0-0, policy_version 138671 (0.00085) [2022-07-09 07:00:29,111][25689] Fps is (10 sec: 6014.1, 60 sec: 5790.9, 300 sec: 5757.7). Total num frames: 142002176. Throughput: 0: 5214.5. Samples: 141994152. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:00:29,111][25689] Avg episode reward: [(0, '-58.434')] [2022-07-09 07:00:30,548][26022] Updated weights on worker 0-0, policy_version 138681 (0.00090) [2022-07-09 07:00:32,178][26022] Updated weights on worker 0-0, policy_version 138691 (0.00764) [2022-07-09 07:00:33,946][26022] Updated weights on worker 0-0, policy_version 138701 (0.00088) [2022-07-09 07:00:34,174][25689] Fps is (10 sec: 5891.0, 60 sec: 5764.5, 300 sec: 5753.6). Total num frames: 142030848. Throughput: 0: 6104.2. Samples: 142029368. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:00:34,175][25689] Avg episode reward: [(0, '-57.551')] [2022-07-09 07:00:34,413][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:00:34,430][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000138704_142032896.pth [2022-07-09 07:00:34,430][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000136683_139963392.pth [2022-07-09 07:00:35,742][26022] Updated weights on worker 0-0, policy_version 138711 (0.00569) [2022-07-09 07:00:37,458][26022] Updated weights on worker 0-0, policy_version 138721 (0.00083) [2022-07-09 07:00:39,195][25689] Fps is (10 sec: 5686.7, 60 sec: 5769.9, 300 sec: 5750.0). Total num frames: 142059520. Throughput: 0: 6126.7. Samples: 142064724. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:00:39,195][25689] Avg episode reward: [(0, '-57.665')] [2022-07-09 07:00:39,272][26022] Updated weights on worker 0-0, policy_version 138731 (0.00087) [2022-07-09 07:00:40,863][26022] Updated weights on worker 0-0, policy_version 138741 (0.00080) [2022-07-09 07:00:42,743][26022] Updated weights on worker 0-0, policy_version 138751 (0.00086) [2022-07-09 07:00:44,210][25689] Fps is (10 sec: 5816.1, 60 sec: 5789.7, 300 sec: 5756.6). Total num frames: 142089216. Throughput: 0: 5227.8. Samples: 142082006. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:00:44,212][25689] Avg episode reward: [(0, '-57.007')] [2022-07-09 07:00:44,519][26022] Updated weights on worker 0-0, policy_version 138761 (0.00087) [2022-07-09 07:00:46,251][26022] Updated weights on worker 0-0, policy_version 138771 (0.00084) [2022-07-09 07:00:47,993][26022] Updated weights on worker 0-0, policy_version 138781 (0.00088) [2022-07-09 07:00:49,215][25689] Fps is (10 sec: 5927.2, 60 sec: 5791.8, 300 sec: 5754.3). Total num frames: 142118912. Throughput: 0: 6117.5. Samples: 142117138. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:00:49,217][25689] Avg episode reward: [(0, '-57.002')] [2022-07-09 07:00:49,644][26022] Updated weights on worker 0-0, policy_version 138791 (0.00085) [2022-07-09 07:00:51,659][26022] Updated weights on worker 0-0, policy_version 138801 (0.00083) [2022-07-09 07:00:53,102][26022] Updated weights on worker 0-0, policy_version 138811 (0.00083) [2022-07-09 07:00:54,330][25689] Fps is (10 sec: 5767.4, 60 sec: 5769.3, 300 sec: 5749.2). Total num frames: 142147584. Throughput: 0: 6085.5. Samples: 142152026. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:00:54,331][25689] Avg episode reward: [(0, '-57.155')] [2022-07-09 07:00:55,180][26022] Updated weights on worker 0-0, policy_version 138821 (0.00085) [2022-07-09 07:00:56,691][26022] Updated weights on worker 0-0, policy_version 138831 (0.00096) [2022-07-09 07:00:58,610][26022] Updated weights on worker 0-0, policy_version 138841 (0.00095) [2022-07-09 07:00:59,363][25689] Fps is (10 sec: 5650.9, 60 sec: 5769.4, 300 sec: 5753.2). Total num frames: 142176256. Throughput: 0: 6051.9. Samples: 142186778. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:00:59,367][25689] Avg episode reward: [(0, '-57.371')] [2022-07-09 07:01:00,305][26022] Updated weights on worker 0-0, policy_version 138851 (0.00126) [2022-07-09 07:01:02,515][26022] Updated weights on worker 0-0, policy_version 138861 (0.00092) [2022-07-09 07:01:04,178][26022] Updated weights on worker 0-0, policy_version 138871 (0.00093) [2022-07-09 07:01:04,377][25689] Fps is (10 sec: 5605.7, 60 sec: 5773.2, 300 sec: 5753.2). Total num frames: 142203904. Throughput: 0: 5938.5. Samples: 142201768. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:01:04,378][25689] Avg episode reward: [(0, '-57.261')] [2022-07-09 07:01:06,197][26022] Updated weights on worker 0-0, policy_version 138881 (0.00083) [2022-07-09 07:01:08,055][26022] Updated weights on worker 0-0, policy_version 138891 (0.00084) [2022-07-09 07:01:09,387][25689] Fps is (10 sec: 5618.5, 60 sec: 5773.1, 300 sec: 5754.0). Total num frames: 142232576. Throughput: 0: 5917.0. Samples: 142236494. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:01:09,387][25689] Avg episode reward: [(0, '-57.527')] [2022-07-09 07:01:09,527][26022] Updated weights on worker 0-0, policy_version 138901 (0.00085) [2022-07-09 07:01:11,351][26022] Updated weights on worker 0-0, policy_version 138911 (0.00086) [2022-07-09 07:01:13,155][26022] Updated weights on worker 0-0, policy_version 138921 (0.00085) [2022-07-09 07:01:14,496][25689] Fps is (10 sec: 5768.3, 60 sec: 5755.3, 300 sec: 5746.8). Total num frames: 142262272. Throughput: 0: 5915.3. Samples: 142271312. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 07:01:14,496][25689] Avg episode reward: [(0, '-57.480')] [2022-07-09 07:01:14,992][26022] Updated weights on worker 0-0, policy_version 138931 (0.00089) [2022-07-09 07:01:16,863][26022] Updated weights on worker 0-0, policy_version 138941 (0.00083) [2022-07-09 07:01:18,275][26022] Updated weights on worker 0-0, policy_version 138951 (0.00092) [2022-07-09 07:01:19,535][25689] Fps is (10 sec: 5650.8, 60 sec: 5757.6, 300 sec: 5747.6). Total num frames: 142289920. Throughput: 0: 5053.9. Samples: 142288724. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 07:01:19,535][25689] Avg episode reward: [(0, '-57.710')] [2022-07-09 07:01:20,275][26022] Updated weights on worker 0-0, policy_version 138961 (0.00083) [2022-07-09 07:01:21,799][26022] Updated weights on worker 0-0, policy_version 138971 (0.00091) [2022-07-09 07:01:23,793][26022] Updated weights on worker 0-0, policy_version 138981 (0.00048) [2022-07-09 07:01:24,540][25689] Fps is (10 sec: 5811.1, 60 sec: 5776.6, 300 sec: 5755.0). Total num frames: 142320640. Throughput: 0: 6054.2. Samples: 142323838. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 07:01:24,540][25689] Avg episode reward: [(0, '-56.695')] [2022-07-09 07:01:25,270][26022] Updated weights on worker 0-0, policy_version 138991 (0.00086) [2022-07-09 07:01:27,265][26022] Updated weights on worker 0-0, policy_version 139001 (0.00084) [2022-07-09 07:01:29,223][26022] Updated weights on worker 0-0, policy_version 139011 (0.00085) [2022-07-09 07:01:29,574][25689] Fps is (10 sec: 5915.7, 60 sec: 5741.2, 300 sec: 5752.5). Total num frames: 142349312. Throughput: 0: 6047.0. Samples: 142358568. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 07:01:29,575][25689] Avg episode reward: [(0, '-57.497')] [2022-07-09 07:01:30,743][26022] Updated weights on worker 0-0, policy_version 139021 (0.00085) [2022-07-09 07:01:32,607][26022] Updated weights on worker 0-0, policy_version 139031 (0.00082) [2022-07-09 07:01:34,351][26022] Updated weights on worker 0-0, policy_version 139041 (0.00084) [2022-07-09 07:01:34,625][25689] Fps is (10 sec: 5787.8, 60 sec: 5759.4, 300 sec: 5745.4). Total num frames: 142379008. Throughput: 0: 5201.5. Samples: 142376012. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 07:01:34,625][25689] Avg episode reward: [(0, '-57.364')] [2022-07-09 07:01:36,028][26022] Updated weights on worker 0-0, policy_version 139051 (0.00083) [2022-07-09 07:01:38,019][26022] Updated weights on worker 0-0, policy_version 139061 (0.00091) [2022-07-09 07:01:39,574][26022] Updated weights on worker 0-0, policy_version 139071 (0.00097) [2022-07-09 07:01:39,636][25689] Fps is (10 sec: 5902.8, 60 sec: 5777.2, 300 sec: 5752.3). Total num frames: 142408704. Throughput: 0: 6073.9. Samples: 142410818. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 07:01:39,637][25689] Avg episode reward: [(0, '-57.716')] [2022-07-09 07:01:41,706][26022] Updated weights on worker 0-0, policy_version 139081 (0.00087) [2022-07-09 07:01:43,287][26022] Updated weights on worker 0-0, policy_version 139091 (0.00079) [2022-07-09 07:01:44,656][25689] Fps is (10 sec: 5614.3, 60 sec: 5725.9, 300 sec: 5742.4). Total num frames: 142435328. Throughput: 0: 5995.2. Samples: 142444438. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 07:01:44,657][25689] Avg episode reward: [(0, '-57.809')] [2022-07-09 07:01:45,144][26022] Updated weights on worker 0-0, policy_version 139101 (0.00090) [2022-07-09 07:01:46,862][26022] Updated weights on worker 0-0, policy_version 139111 (0.00085) [2022-07-09 07:01:48,798][26022] Updated weights on worker 0-0, policy_version 139121 (0.00091) [2022-07-09 07:01:49,666][25689] Fps is (10 sec: 5513.2, 60 sec: 5708.6, 300 sec: 5740.2). Total num frames: 142464000. Throughput: 0: 5120.2. Samples: 142461440. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 07:01:49,666][25689] Avg episode reward: [(0, '-57.843')] [2022-07-09 07:01:50,485][26022] Updated weights on worker 0-0, policy_version 139131 (0.00084) [2022-07-09 07:01:52,515][26022] Updated weights on worker 0-0, policy_version 139141 (0.00083) [2022-07-09 07:01:53,987][26022] Updated weights on worker 0-0, policy_version 139151 (0.00092) [2022-07-09 07:01:54,831][25689] Fps is (10 sec: 5736.7, 60 sec: 5720.8, 300 sec: 5744.7). Total num frames: 142493696. Throughput: 0: 5926.0. Samples: 142495754. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 07:01:54,831][25689] Avg episode reward: [(0, '-58.043')] [2022-07-09 07:01:56,148][26022] Updated weights on worker 0-0, policy_version 139161 (0.00089) [2022-07-09 07:01:57,729][26022] Updated weights on worker 0-0, policy_version 139171 (0.00096) [2022-07-09 07:01:59,579][26022] Updated weights on worker 0-0, policy_version 139181 (0.00091) [2022-07-09 07:01:59,907][25689] Fps is (10 sec: 5699.5, 60 sec: 5716.7, 300 sec: 5750.7). Total num frames: 142522368. Throughput: 0: 5877.3. Samples: 142529954. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 07:01:59,907][25689] Avg episode reward: [(0, '-58.240')] [2022-07-09 07:02:01,421][26022] Updated weights on worker 0-0, policy_version 139191 (0.00086) [2022-07-09 07:02:03,440][26022] Updated weights on worker 0-0, policy_version 139201 (0.00094) [2022-07-09 07:02:04,930][25689] Fps is (10 sec: 5475.1, 60 sec: 5698.9, 300 sec: 5736.8). Total num frames: 142548992. Throughput: 0: 4993.6. Samples: 142545678. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 07:02:04,931][25689] Avg episode reward: [(0, '-59.350')] [2022-07-09 07:02:05,279][26022] Updated weights on worker 0-0, policy_version 139211 (0.00086) [2022-07-09 07:02:07,167][26022] Updated weights on worker 0-0, policy_version 139221 (0.00085) [2022-07-09 07:02:08,754][26022] Updated weights on worker 0-0, policy_version 139231 (0.00087) [2022-07-09 07:02:09,961][25689] Fps is (10 sec: 5499.7, 60 sec: 5696.9, 300 sec: 5737.6). Total num frames: 142577664. Throughput: 0: 5814.3. Samples: 142579442. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 07:02:09,962][25689] Avg episode reward: [(0, '-59.179')] [2022-07-09 07:02:10,957][26022] Updated weights on worker 0-0, policy_version 139241 (0.00082) [2022-07-09 07:02:12,252][26022] Updated weights on worker 0-0, policy_version 139251 (0.00085) [2022-07-09 07:02:14,424][26022] Updated weights on worker 0-0, policy_version 139261 (0.00090) [2022-07-09 07:02:15,059][25689] Fps is (10 sec: 5863.9, 60 sec: 5714.9, 300 sec: 5743.2). Total num frames: 142608384. Throughput: 0: 5837.9. Samples: 142613842. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 07:02:15,059][25689] Avg episode reward: [(0, '-59.830')] [2022-07-09 07:02:16,027][26022] Updated weights on worker 0-0, policy_version 139271 (0.00081) [2022-07-09 07:02:17,775][26022] Updated weights on worker 0-0, policy_version 139281 (0.00084) [2022-07-09 07:02:19,613][26022] Updated weights on worker 0-0, policy_version 139291 (0.00085) [2022-07-09 07:02:20,124][25689] Fps is (10 sec: 5844.2, 60 sec: 5729.4, 300 sec: 5738.8). Total num frames: 142637056. Throughput: 0: 5011.9. Samples: 142631280. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 07:02:20,124][25689] Avg episode reward: [(0, '-59.595')] [2022-07-09 07:02:21,258][26022] Updated weights on worker 0-0, policy_version 139301 (0.00082) [2022-07-09 07:02:23,091][26022] Updated weights on worker 0-0, policy_version 139311 (0.00095) [2022-07-09 07:02:24,876][26022] Updated weights on worker 0-0, policy_version 139321 (0.00082) [2022-07-09 07:02:25,203][25689] Fps is (10 sec: 5653.0, 60 sec: 5688.6, 300 sec: 5734.5). Total num frames: 142665728. Throughput: 0: 5939.7. Samples: 142666090. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 07:02:25,203][25689] Avg episode reward: [(0, '-58.831')] [2022-07-09 07:02:26,480][26022] Updated weights on worker 0-0, policy_version 139331 (0.00085) [2022-07-09 07:02:28,533][26022] Updated weights on worker 0-0, policy_version 139341 (0.00108) [2022-07-09 07:02:30,180][26022] Updated weights on worker 0-0, policy_version 139351 (0.00084) [2022-07-09 07:02:30,230][25689] Fps is (10 sec: 5775.4, 60 sec: 5706.2, 300 sec: 5738.3). Total num frames: 142695424. Throughput: 0: 5994.5. Samples: 142700944. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 07:02:30,231][25689] Avg episode reward: [(0, '-58.340')] [2022-07-09 07:02:31,952][26022] Updated weights on worker 0-0, policy_version 139361 (0.00083) [2022-07-09 07:02:33,783][26022] Updated weights on worker 0-0, policy_version 139371 (0.00091) [2022-07-09 07:02:34,608][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:02:34,631][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000139375_142720000.pth [2022-07-09 07:02:34,632][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000137356_140652544.pth [2022-07-09 07:02:35,268][25689] Fps is (10 sec: 5900.8, 60 sec: 5707.4, 300 sec: 5738.6). Total num frames: 142725120. Throughput: 0: 5171.7. Samples: 142718358. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 07:02:35,270][25689] Avg episode reward: [(0, '-58.047')] [2022-07-09 07:02:35,365][26022] Updated weights on worker 0-0, policy_version 139381 (0.00087) [2022-07-09 07:02:37,350][26022] Updated weights on worker 0-0, policy_version 139391 (0.00090) [2022-07-09 07:02:39,067][26022] Updated weights on worker 0-0, policy_version 139401 (0.00100) [2022-07-09 07:02:40,271][25689] Fps is (10 sec: 5812.8, 60 sec: 5691.2, 300 sec: 5739.1). Total num frames: 142753792. Throughput: 0: 6059.7. Samples: 142753368. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 07:02:40,273][25689] Avg episode reward: [(0, '-58.417')] [2022-07-09 07:02:40,785][26022] Updated weights on worker 0-0, policy_version 139411 (0.00085) [2022-07-09 07:02:42,798][26022] Updated weights on worker 0-0, policy_version 139421 (0.00098) [2022-07-09 07:02:44,173][26022] Updated weights on worker 0-0, policy_version 139431 (0.00094) [2022-07-09 07:02:45,276][25689] Fps is (10 sec: 5729.6, 60 sec: 5726.4, 300 sec: 5742.5). Total num frames: 142782464. Throughput: 0: 6077.8. Samples: 142788092. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 07:02:45,277][25689] Avg episode reward: [(0, '-57.379')] [2022-07-09 07:02:46,159][26022] Updated weights on worker 0-0, policy_version 139441 (0.00084) [2022-07-09 07:02:47,672][26022] Updated weights on worker 0-0, policy_version 139451 (0.00086) [2022-07-09 07:02:49,411][26022] Updated weights on worker 0-0, policy_version 139461 (0.00084) [2022-07-09 07:02:50,281][25689] Fps is (10 sec: 5728.6, 60 sec: 5726.9, 300 sec: 5738.3). Total num frames: 142811136. Throughput: 0: 5224.5. Samples: 142805702. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 07:02:50,283][25689] Avg episode reward: [(0, '-57.153')] [2022-07-09 07:02:51,395][26022] Updated weights on worker 0-0, policy_version 139471 (0.00088) [2022-07-09 07:02:53,031][26022] Updated weights on worker 0-0, policy_version 139481 (0.00080) [2022-07-09 07:02:54,867][26022] Updated weights on worker 0-0, policy_version 139491 (0.00093) [2022-07-09 07:02:55,360][25689] Fps is (10 sec: 5991.5, 60 sec: 5768.9, 300 sec: 5751.3). Total num frames: 142842880. Throughput: 0: 6097.1. Samples: 142840860. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 07:02:55,362][25689] Avg episode reward: [(0, '-57.869')] [2022-07-09 07:02:56,706][26022] Updated weights on worker 0-0, policy_version 139501 (0.00087) [2022-07-09 07:02:58,436][26022] Updated weights on worker 0-0, policy_version 139511 (0.00087) [2022-07-09 07:03:00,282][26022] Updated weights on worker 0-0, policy_version 139521 (0.00089) [2022-07-09 07:03:00,371][25689] Fps is (10 sec: 5785.1, 60 sec: 5741.2, 300 sec: 5747.8). Total num frames: 142869504. Throughput: 0: 6077.3. Samples: 142875518. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 07:03:00,371][25689] Avg episode reward: [(0, '-58.440')] [2022-07-09 07:03:02,104][26022] Updated weights on worker 0-0, policy_version 139531 (0.00098) [2022-07-09 07:03:04,063][26022] Updated weights on worker 0-0, policy_version 139541 (0.00083) [2022-07-09 07:03:05,375][25689] Fps is (10 sec: 5316.4, 60 sec: 5743.0, 300 sec: 5741.0). Total num frames: 142896128. Throughput: 0: 5114.4. Samples: 142890888. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 07:03:05,376][25689] Avg episode reward: [(0, '-59.099')] [2022-07-09 07:03:05,808][26022] Updated weights on worker 0-0, policy_version 139551 (0.00087) [2022-07-09 07:03:07,477][26022] Updated weights on worker 0-0, policy_version 139561 (0.00101) [2022-07-09 07:03:09,284][26022] Updated weights on worker 0-0, policy_version 139571 (0.00087) [2022-07-09 07:03:10,393][25689] Fps is (10 sec: 5619.5, 60 sec: 5761.2, 300 sec: 5745.2). Total num frames: 142925824. Throughput: 0: 5965.3. Samples: 142925674. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 07:03:10,395][25689] Avg episode reward: [(0, '-59.160')] [2022-07-09 07:03:11,211][26022] Updated weights on worker 0-0, policy_version 139581 (0.00088) [2022-07-09 07:03:12,866][26022] Updated weights on worker 0-0, policy_version 139591 (0.00091) [2022-07-09 07:03:14,763][26022] Updated weights on worker 0-0, policy_version 139601 (0.00088) [2022-07-09 07:03:15,459][25689] Fps is (10 sec: 5889.7, 60 sec: 5747.2, 300 sec: 5751.9). Total num frames: 142955520. Throughput: 0: 5919.6. Samples: 142959842. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 07:03:15,460][25689] Avg episode reward: [(0, '-59.673')] [2022-07-09 07:03:16,628][26022] Updated weights on worker 0-0, policy_version 139611 (0.00097) [2022-07-09 07:03:18,248][26022] Updated weights on worker 0-0, policy_version 139621 (0.00079) [2022-07-09 07:03:19,999][26022] Updated weights on worker 0-0, policy_version 139631 (0.00088) [2022-07-09 07:03:20,529][25689] Fps is (10 sec: 5657.5, 60 sec: 5729.9, 300 sec: 5740.7). Total num frames: 142983168. Throughput: 0: 5046.8. Samples: 142977252. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:03:20,529][25689] Avg episode reward: [(0, '-58.598')] [2022-07-09 07:03:21,736][26022] Updated weights on worker 0-0, policy_version 139641 (0.00082) [2022-07-09 07:03:23,614][26022] Updated weights on worker 0-0, policy_version 139651 (0.00051) [2022-07-09 07:03:25,258][26022] Updated weights on worker 0-0, policy_version 139661 (0.00081) [2022-07-09 07:03:25,539][25689] Fps is (10 sec: 5790.9, 60 sec: 5770.3, 300 sec: 5744.8). Total num frames: 143013888. Throughput: 0: 6029.6. Samples: 143012466. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:03:25,539][25689] Avg episode reward: [(0, '-57.793')] [2022-07-09 07:03:27,167][26022] Updated weights on worker 0-0, policy_version 139671 (0.00085) [2022-07-09 07:03:28,921][26022] Updated weights on worker 0-0, policy_version 139681 (0.00092) [2022-07-09 07:03:30,545][25689] Fps is (10 sec: 5929.5, 60 sec: 5755.4, 300 sec: 5745.6). Total num frames: 143042560. Throughput: 0: 6034.6. Samples: 143047286. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:03:30,546][25689] Avg episode reward: [(0, '-57.310')] [2022-07-09 07:03:30,557][26022] Updated weights on worker 0-0, policy_version 139691 (0.00087) [2022-07-09 07:03:32,209][26022] Updated weights on worker 0-0, policy_version 139701 (0.00085) [2022-07-09 07:03:34,306][26022] Updated weights on worker 0-0, policy_version 139711 (0.00084) [2022-07-09 07:03:35,613][25689] Fps is (10 sec: 5895.3, 60 sec: 5769.4, 300 sec: 5748.0). Total num frames: 143073280. Throughput: 0: 5202.7. Samples: 143064696. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:03:35,614][25689] Avg episode reward: [(0, '-57.279')] [2022-07-09 07:03:35,835][26022] Updated weights on worker 0-0, policy_version 139721 (0.00886) [2022-07-09 07:03:37,781][26022] Updated weights on worker 0-0, policy_version 139731 (0.00087) [2022-07-09 07:03:39,303][26022] Updated weights on worker 0-0, policy_version 139741 (0.00084) [2022-07-09 07:03:40,676][25689] Fps is (10 sec: 5761.6, 60 sec: 5746.9, 300 sec: 5747.0). Total num frames: 143100928. Throughput: 0: 6079.1. Samples: 143099726. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:03:40,676][25689] Avg episode reward: [(0, '-57.452')] [2022-07-09 07:03:41,334][26022] Updated weights on worker 0-0, policy_version 139751 (0.00089) [2022-07-09 07:03:43,021][26022] Updated weights on worker 0-0, policy_version 139761 (0.00089) [2022-07-09 07:03:44,943][26022] Updated weights on worker 0-0, policy_version 139771 (0.00082) [2022-07-09 07:03:45,714][25689] Fps is (10 sec: 5677.0, 60 sec: 5760.6, 300 sec: 5743.1). Total num frames: 143130624. Throughput: 0: 6043.7. Samples: 143134402. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:03:45,715][25689] Avg episode reward: [(0, '-57.101')] [2022-07-09 07:03:46,407][26022] Updated weights on worker 0-0, policy_version 139781 (0.00089) [2022-07-09 07:03:48,340][26022] Updated weights on worker 0-0, policy_version 139791 (0.00090) [2022-07-09 07:03:50,142][26022] Updated weights on worker 0-0, policy_version 139801 (0.00087) [2022-07-09 07:03:50,777][25689] Fps is (10 sec: 5778.0, 60 sec: 5755.1, 300 sec: 5744.8). Total num frames: 143159296. Throughput: 0: 5164.4. Samples: 143151774. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:03:50,778][25689] Avg episode reward: [(0, '-58.353')] [2022-07-09 07:03:51,966][26022] Updated weights on worker 0-0, policy_version 139811 (0.00098) [2022-07-09 07:03:53,638][26022] Updated weights on worker 0-0, policy_version 139821 (0.00079) [2022-07-09 07:03:55,612][26022] Updated weights on worker 0-0, policy_version 139831 (0.00096) [2022-07-09 07:03:55,858][25689] Fps is (10 sec: 5552.2, 60 sec: 5687.1, 300 sec: 5740.2). Total num frames: 143186944. Throughput: 0: 5996.4. Samples: 143186092. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:03:55,859][25689] Avg episode reward: [(0, '-58.424')] [2022-07-09 07:03:57,215][26022] Updated weights on worker 0-0, policy_version 139841 (0.00088) [2022-07-09 07:03:59,139][26022] Updated weights on worker 0-0, policy_version 139851 (0.00086) [2022-07-09 07:04:00,626][26022] Updated weights on worker 0-0, policy_version 139861 (0.00092) [2022-07-09 07:04:00,872][25689] Fps is (10 sec: 5782.3, 60 sec: 5754.6, 300 sec: 5747.0). Total num frames: 143217664. Throughput: 0: 5998.6. Samples: 143220874. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:04:00,872][25689] Avg episode reward: [(0, '-57.090')] [2022-07-09 07:04:02,880][26022] Updated weights on worker 0-0, policy_version 139871 (0.00088) [2022-07-09 07:04:04,805][26022] Updated weights on worker 0-0, policy_version 139881 (0.00086) [2022-07-09 07:04:05,928][25689] Fps is (10 sec: 5694.9, 60 sec: 5749.7, 300 sec: 5746.1). Total num frames: 143244288. Throughput: 0: 5899.7. Samples: 143253654. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:04:05,928][25689] Avg episode reward: [(0, '-56.829')] [2022-07-09 07:04:06,452][26022] Updated weights on worker 0-0, policy_version 139891 (0.00086) [2022-07-09 07:04:08,312][26022] Updated weights on worker 0-0, policy_version 139901 (0.00082) [2022-07-09 07:04:09,943][26022] Updated weights on worker 0-0, policy_version 139911 (0.00085) [2022-07-09 07:04:10,974][25689] Fps is (10 sec: 5372.4, 60 sec: 5713.2, 300 sec: 5740.4). Total num frames: 143271936. Throughput: 0: 5903.4. Samples: 143271000. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:04:10,974][25689] Avg episode reward: [(0, '-57.156')] [2022-07-09 07:04:11,591][26022] Updated weights on worker 0-0, policy_version 139921 (0.00083) [2022-07-09 07:04:13,934][26022] Updated weights on worker 0-0, policy_version 139931 (0.00087) [2022-07-09 07:04:15,270][26022] Updated weights on worker 0-0, policy_version 139941 (0.00091) [2022-07-09 07:04:16,028][25689] Fps is (10 sec: 5880.1, 60 sec: 5748.2, 300 sec: 5743.1). Total num frames: 143303680. Throughput: 0: 5920.9. Samples: 143305516. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:04:16,030][25689] Avg episode reward: [(0, '-57.776')] [2022-07-09 07:04:17,295][26022] Updated weights on worker 0-0, policy_version 139951 (0.00089) [2022-07-09 07:04:18,863][26022] Updated weights on worker 0-0, policy_version 139961 (0.00082) [2022-07-09 07:04:20,638][26022] Updated weights on worker 0-0, policy_version 139971 (0.00094) [2022-07-09 07:04:21,037][25689] Fps is (10 sec: 6003.9, 60 sec: 5770.9, 300 sec: 5746.5). Total num frames: 143332352. Throughput: 0: 5936.7. Samples: 143340586. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:04:21,038][25689] Avg episode reward: [(0, '-58.186')] [2022-07-09 07:04:22,364][26022] Updated weights on worker 0-0, policy_version 139981 (0.00091) [2022-07-09 07:04:24,200][26022] Updated weights on worker 0-0, policy_version 139991 (0.00100) [2022-07-09 07:04:25,977][26022] Updated weights on worker 0-0, policy_version 140001 (0.00099) [2022-07-09 07:04:26,052][25689] Fps is (10 sec: 5822.8, 60 sec: 5753.4, 300 sec: 5749.8). Total num frames: 143362048. Throughput: 0: 5190.6. Samples: 143358112. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:04:26,053][25689] Avg episode reward: [(0, '-58.701')] [2022-07-09 07:04:27,826][26022] Updated weights on worker 0-0, policy_version 140011 (0.00102) [2022-07-09 07:04:29,259][26022] Updated weights on worker 0-0, policy_version 140021 (0.00084) [2022-07-09 07:04:31,055][25689] Fps is (10 sec: 5723.9, 60 sec: 5736.9, 300 sec: 5742.1). Total num frames: 143389696. Throughput: 0: 6076.6. Samples: 143393024. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:04:31,056][25689] Avg episode reward: [(0, '-59.112')] [2022-07-09 07:04:31,232][26022] Updated weights on worker 0-0, policy_version 140031 (0.00085) [2022-07-09 07:04:32,881][26022] Updated weights on worker 0-0, policy_version 140041 (0.00086) [2022-07-09 07:04:34,823][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:04:34,834][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000140051_143412224.pth [2022-07-09 07:04:34,835][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000138029_141341696.pth [2022-07-09 07:04:34,839][26022] Updated weights on worker 0-0, policy_version 140051 (0.00079) [2022-07-09 07:04:36,142][25689] Fps is (10 sec: 5683.4, 60 sec: 5718.2, 300 sec: 5745.4). Total num frames: 143419392. Throughput: 0: 6077.9. Samples: 143427766. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:04:36,142][25689] Avg episode reward: [(0, '-59.068')] [2022-07-09 07:04:36,383][26022] Updated weights on worker 0-0, policy_version 140061 (0.00083) [2022-07-09 07:04:38,367][26022] Updated weights on worker 0-0, policy_version 140071 (0.00098) [2022-07-09 07:04:40,055][26022] Updated weights on worker 0-0, policy_version 140081 (0.00089) [2022-07-09 07:04:41,148][25689] Fps is (10 sec: 5782.7, 60 sec: 5740.4, 300 sec: 5746.1). Total num frames: 143448064. Throughput: 0: 5199.2. Samples: 143445152. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:04:41,149][25689] Avg episode reward: [(0, '-58.355')] [2022-07-09 07:04:42,025][26022] Updated weights on worker 0-0, policy_version 140091 (0.00081) [2022-07-09 07:04:43,600][26022] Updated weights on worker 0-0, policy_version 140101 (0.00053) [2022-07-09 07:04:45,632][26022] Updated weights on worker 0-0, policy_version 140111 (0.00091) [2022-07-09 07:04:46,239][25689] Fps is (10 sec: 5780.6, 60 sec: 5735.4, 300 sec: 5744.9). Total num frames: 143477760. Throughput: 0: 6016.4. Samples: 143479566. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:04:46,240][25689] Avg episode reward: [(0, '-57.716')] [2022-07-09 07:04:47,239][26022] Updated weights on worker 0-0, policy_version 140121 (0.00088) [2022-07-09 07:04:48,931][26022] Updated weights on worker 0-0, policy_version 140131 (0.00084) [2022-07-09 07:04:50,756][26022] Updated weights on worker 0-0, policy_version 140141 (0.00085) [2022-07-09 07:04:51,288][25689] Fps is (10 sec: 5857.5, 60 sec: 5753.7, 300 sec: 5745.1). Total num frames: 143507456. Throughput: 0: 5998.9. Samples: 143514400. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:04:51,288][25689] Avg episode reward: [(0, '-58.298')] [2022-07-09 07:04:52,611][26022] Updated weights on worker 0-0, policy_version 140151 (0.00081) [2022-07-09 07:04:54,227][26022] Updated weights on worker 0-0, policy_version 140161 (0.00085) [2022-07-09 07:04:56,151][26022] Updated weights on worker 0-0, policy_version 140171 (0.00097) [2022-07-09 07:04:56,371][25689] Fps is (10 sec: 5760.7, 60 sec: 5770.4, 300 sec: 5744.1). Total num frames: 143536128. Throughput: 0: 5141.9. Samples: 143531782. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:04:56,372][25689] Avg episode reward: [(0, '-58.250')] [2022-07-09 07:04:57,800][26022] Updated weights on worker 0-0, policy_version 140181 (0.00083) [2022-07-09 07:04:59,612][26022] Updated weights on worker 0-0, policy_version 140191 (0.00090) [2022-07-09 07:05:01,373][25689] Fps is (10 sec: 5685.9, 60 sec: 5737.6, 300 sec: 5748.6). Total num frames: 143564800. Throughput: 0: 5993.5. Samples: 143566368. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:05:01,375][25689] Avg episode reward: [(0, '-58.617')] [2022-07-09 07:05:01,608][26022] Updated weights on worker 0-0, policy_version 140201 (0.00104) [2022-07-09 07:05:03,537][26022] Updated weights on worker 0-0, policy_version 140211 (0.00086) [2022-07-09 07:05:05,528][26022] Updated weights on worker 0-0, policy_version 140221 (0.00084) [2022-07-09 07:05:06,383][25689] Fps is (10 sec: 5625.2, 60 sec: 5758.9, 300 sec: 5745.1). Total num frames: 143592448. Throughput: 0: 5902.5. Samples: 143598466. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:05:06,384][25689] Avg episode reward: [(0, '-58.975')] [2022-07-09 07:05:07,303][26022] Updated weights on worker 0-0, policy_version 140231 (0.00086) [2022-07-09 07:05:09,043][26022] Updated weights on worker 0-0, policy_version 140241 (0.00082) [2022-07-09 07:05:10,874][26022] Updated weights on worker 0-0, policy_version 140251 (0.00086) [2022-07-09 07:05:11,431][25689] Fps is (10 sec: 5599.6, 60 sec: 5775.7, 300 sec: 5739.2). Total num frames: 143621120. Throughput: 0: 5014.0. Samples: 143615398. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:05:11,431][25689] Avg episode reward: [(0, '-58.027')] [2022-07-09 07:05:12,529][26022] Updated weights on worker 0-0, policy_version 140261 (0.00087) [2022-07-09 07:05:14,340][26022] Updated weights on worker 0-0, policy_version 140271 (0.00094) [2022-07-09 07:05:16,260][26022] Updated weights on worker 0-0, policy_version 140281 (0.00092) [2022-07-09 07:05:16,514][25689] Fps is (10 sec: 5559.1, 60 sec: 5705.2, 300 sec: 5738.8). Total num frames: 143648768. Throughput: 0: 5855.7. Samples: 143649734. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:05:16,515][25689] Avg episode reward: [(0, '-58.280')] [2022-07-09 07:05:17,796][26022] Updated weights on worker 0-0, policy_version 140291 (0.00077) [2022-07-09 07:05:19,713][26022] Updated weights on worker 0-0, policy_version 140301 (0.00088) [2022-07-09 07:05:21,511][26022] Updated weights on worker 0-0, policy_version 140311 (0.00086) [2022-07-09 07:05:21,604][25689] Fps is (10 sec: 5636.9, 60 sec: 5714.5, 300 sec: 5737.6). Total num frames: 143678464. Throughput: 0: 5851.4. Samples: 143684746. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:05:21,605][25689] Avg episode reward: [(0, '-57.252')] [2022-07-09 07:05:23,175][26022] Updated weights on worker 0-0, policy_version 140321 (0.00086) [2022-07-09 07:05:25,021][26022] Updated weights on worker 0-0, policy_version 140331 (0.00089) [2022-07-09 07:05:26,706][25689] Fps is (10 sec: 5827.6, 60 sec: 5706.4, 300 sec: 5732.6). Total num frames: 143708160. Throughput: 0: 5102.8. Samples: 143702168. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 07:05:26,706][25689] Avg episode reward: [(0, '-57.740')] [2022-07-09 07:05:26,741][26022] Updated weights on worker 0-0, policy_version 140341 (0.00085) [2022-07-09 07:05:28,509][26022] Updated weights on worker 0-0, policy_version 140351 (0.00094) [2022-07-09 07:05:30,478][26022] Updated weights on worker 0-0, policy_version 140361 (0.00081) [2022-07-09 07:05:31,755][25689] Fps is (10 sec: 5850.9, 60 sec: 5735.7, 300 sec: 5736.3). Total num frames: 143737856. Throughput: 0: 5963.0. Samples: 143736590. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 07:05:31,757][25689] Avg episode reward: [(0, '-58.040')] [2022-07-09 07:05:32,046][26022] Updated weights on worker 0-0, policy_version 140371 (0.00079) [2022-07-09 07:05:34,032][26022] Updated weights on worker 0-0, policy_version 140381 (0.00088) [2022-07-09 07:05:35,552][26022] Updated weights on worker 0-0, policy_version 140391 (0.00088) [2022-07-09 07:05:36,835][25689] Fps is (10 sec: 5762.2, 60 sec: 5719.5, 300 sec: 5735.1). Total num frames: 143766528. Throughput: 0: 5981.6. Samples: 143771286. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 07:05:36,836][25689] Avg episode reward: [(0, '-58.341')] [2022-07-09 07:05:37,665][26022] Updated weights on worker 0-0, policy_version 140401 (0.00093) [2022-07-09 07:05:39,173][26022] Updated weights on worker 0-0, policy_version 140411 (0.00051) [2022-07-09 07:05:41,115][26022] Updated weights on worker 0-0, policy_version 140421 (0.00094) [2022-07-09 07:05:41,847][25689] Fps is (10 sec: 5580.7, 60 sec: 5702.2, 300 sec: 5728.3). Total num frames: 143794176. Throughput: 0: 5119.0. Samples: 143788370. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 07:05:41,847][25689] Avg episode reward: [(0, '-58.241')] [2022-07-09 07:05:42,818][26022] Updated weights on worker 0-0, policy_version 140431 (0.00091) [2022-07-09 07:05:44,819][26022] Updated weights on worker 0-0, policy_version 140441 (0.00086) [2022-07-09 07:05:46,226][26022] Updated weights on worker 0-0, policy_version 140451 (0.00090) [2022-07-09 07:05:46,913][25689] Fps is (10 sec: 5792.0, 60 sec: 5721.4, 300 sec: 5730.6). Total num frames: 143824896. Throughput: 0: 5977.5. Samples: 143822954. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 07:05:46,913][25689] Avg episode reward: [(0, '-58.397')] [2022-07-09 07:05:48,199][26022] Updated weights on worker 0-0, policy_version 140461 (0.00086) [2022-07-09 07:05:49,818][26022] Updated weights on worker 0-0, policy_version 140471 (0.00084) [2022-07-09 07:05:51,831][26022] Updated weights on worker 0-0, policy_version 140481 (0.00089) [2022-07-09 07:05:51,932][25689] Fps is (10 sec: 5889.2, 60 sec: 5707.3, 300 sec: 5732.4). Total num frames: 143853568. Throughput: 0: 6007.4. Samples: 143857800. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 07:05:51,932][25689] Avg episode reward: [(0, '-58.356')] [2022-07-09 07:05:53,439][26022] Updated weights on worker 0-0, policy_version 140491 (0.00097) [2022-07-09 07:05:55,076][26022] Updated weights on worker 0-0, policy_version 140501 (0.00088) [2022-07-09 07:05:56,896][26022] Updated weights on worker 0-0, policy_version 140511 (0.00111) [2022-07-09 07:05:56,982][25689] Fps is (10 sec: 5796.6, 60 sec: 5727.3, 300 sec: 5735.5). Total num frames: 143883264. Throughput: 0: 5158.1. Samples: 143875206. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 07:05:56,983][25689] Avg episode reward: [(0, '-57.928')] [2022-07-09 07:05:58,998][26022] Updated weights on worker 0-0, policy_version 140521 (0.00096) [2022-07-09 07:06:00,467][26022] Updated weights on worker 0-0, policy_version 140531 (0.00079) [2022-07-09 07:06:02,020][25689] Fps is (10 sec: 5684.0, 60 sec: 5707.0, 300 sec: 5735.1). Total num frames: 143910912. Throughput: 0: 6029.0. Samples: 143909996. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 07:06:02,021][25689] Avg episode reward: [(0, '-58.582')] [2022-07-09 07:06:02,808][26022] Updated weights on worker 0-0, policy_version 140541 (0.00090) [2022-07-09 07:06:04,422][26022] Updated weights on worker 0-0, policy_version 140551 (0.00085) [2022-07-09 07:06:06,192][26022] Updated weights on worker 0-0, policy_version 140561 (0.00088) [2022-07-09 07:06:07,039][25689] Fps is (10 sec: 5701.9, 60 sec: 5739.9, 300 sec: 5738.3). Total num frames: 143940608. Throughput: 0: 5965.2. Samples: 143943012. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 07:06:07,040][25689] Avg episode reward: [(0, '-58.619')] [2022-07-09 07:06:08,004][26022] Updated weights on worker 0-0, policy_version 140571 (0.00085) [2022-07-09 07:06:09,517][26022] Updated weights on worker 0-0, policy_version 140581 (0.00086) [2022-07-09 07:06:11,520][26022] Updated weights on worker 0-0, policy_version 140591 (0.00092) [2022-07-09 07:06:12,041][25689] Fps is (10 sec: 5722.4, 60 sec: 5727.3, 300 sec: 5733.5). Total num frames: 143968256. Throughput: 0: 5116.5. Samples: 143960690. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 07:06:12,043][25689] Avg episode reward: [(0, '-59.103')] [2022-07-09 07:06:13,282][26022] Updated weights on worker 0-0, policy_version 140601 (0.00083) [2022-07-09 07:06:14,945][26022] Updated weights on worker 0-0, policy_version 140611 (0.00086) [2022-07-09 07:06:16,686][26022] Updated weights on worker 0-0, policy_version 140621 (0.00084) [2022-07-09 07:06:17,167][25689] Fps is (10 sec: 5560.6, 60 sec: 5740.2, 300 sec: 5735.3). Total num frames: 143996928. Throughput: 0: 5945.4. Samples: 143995216. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 07:06:17,169][25689] Avg episode reward: [(0, '-58.620')] [2022-07-09 07:06:18,461][26022] Updated weights on worker 0-0, policy_version 140631 (0.00093) [2022-07-09 07:06:20,364][26022] Updated weights on worker 0-0, policy_version 140641 (0.00082) [2022-07-09 07:06:22,034][26022] Updated weights on worker 0-0, policy_version 140651 (0.00083) [2022-07-09 07:06:22,233][25689] Fps is (10 sec: 5827.8, 60 sec: 5759.4, 300 sec: 5734.1). Total num frames: 144027648. Throughput: 0: 5952.3. Samples: 144030304. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 07:06:22,233][25689] Avg episode reward: [(0, '-58.561')] [2022-07-09 07:06:23,843][26022] Updated weights on worker 0-0, policy_version 140661 (0.00084) [2022-07-09 07:06:25,512][26022] Updated weights on worker 0-0, policy_version 140671 (0.00091) [2022-07-09 07:06:27,331][25689] Fps is (10 sec: 5843.8, 60 sec: 5742.9, 300 sec: 5732.9). Total num frames: 144056320. Throughput: 0: 6021.2. Samples: 144065192. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 07:06:27,331][25689] Avg episode reward: [(0, '-57.835')] [2022-07-09 07:06:27,420][26022] Updated weights on worker 0-0, policy_version 140681 (0.00090) [2022-07-09 07:06:29,169][26022] Updated weights on worker 0-0, policy_version 140691 (0.00081) [2022-07-09 07:06:30,880][26022] Updated weights on worker 0-0, policy_version 140701 (0.00093) [2022-07-09 07:06:32,339][25689] Fps is (10 sec: 5775.3, 60 sec: 5746.7, 300 sec: 5733.7). Total num frames: 144086016. Throughput: 0: 6009.4. Samples: 144082666. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 07:06:32,340][25689] Avg episode reward: [(0, '-57.479')] [2022-07-09 07:06:32,660][26022] Updated weights on worker 0-0, policy_version 140711 (0.00088) [2022-07-09 07:06:34,441][26022] Updated weights on worker 0-0, policy_version 140721 (0.00084) [2022-07-09 07:06:34,873][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:06:34,890][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000140723_144100352.pth [2022-07-09 07:06:34,891][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000138704_142032896.pth [2022-07-09 07:06:36,096][26022] Updated weights on worker 0-0, policy_version 140731 (0.00090) [2022-07-09 07:06:37,410][25689] Fps is (10 sec: 5689.0, 60 sec: 5730.7, 300 sec: 5725.7). Total num frames: 144113664. Throughput: 0: 6041.4. Samples: 144117512. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 07:06:37,411][25689] Avg episode reward: [(0, '-57.889')] [2022-07-09 07:06:37,994][26022] Updated weights on worker 0-0, policy_version 140741 (0.00085) [2022-07-09 07:06:39,688][26022] Updated weights on worker 0-0, policy_version 140751 (0.00085) [2022-07-09 07:06:41,468][26022] Updated weights on worker 0-0, policy_version 140761 (0.00093) [2022-07-09 07:06:42,438][25689] Fps is (10 sec: 5779.9, 60 sec: 5779.9, 300 sec: 5739.3). Total num frames: 144144384. Throughput: 0: 6034.7. Samples: 144152234. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 07:06:42,438][25689] Avg episode reward: [(0, '-57.876')] [2022-07-09 07:06:43,166][26022] Updated weights on worker 0-0, policy_version 140771 (0.00092) [2022-07-09 07:06:44,974][26022] Updated weights on worker 0-0, policy_version 140781 (0.00084) [2022-07-09 07:06:46,837][26022] Updated weights on worker 0-0, policy_version 140791 (0.00093) [2022-07-09 07:06:47,443][25689] Fps is (10 sec: 6022.2, 60 sec: 5768.8, 300 sec: 5742.8). Total num frames: 144174080. Throughput: 0: 5184.4. Samples: 144169460. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 07:06:47,443][25689] Avg episode reward: [(0, '-58.795')] [2022-07-09 07:06:48,670][26022] Updated weights on worker 0-0, policy_version 140801 (0.00090) [2022-07-09 07:06:50,445][26022] Updated weights on worker 0-0, policy_version 140811 (0.00093) [2022-07-09 07:06:52,141][26022] Updated weights on worker 0-0, policy_version 140821 (0.00088) [2022-07-09 07:06:52,460][25689] Fps is (10 sec: 5823.9, 60 sec: 5769.0, 300 sec: 5742.2). Total num frames: 144202752. Throughput: 0: 6033.5. Samples: 144204064. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 07:06:52,460][25689] Avg episode reward: [(0, '-60.091')] [2022-07-09 07:06:53,900][26022] Updated weights on worker 0-0, policy_version 140831 (0.00098) [2022-07-09 07:06:55,455][26022] Updated weights on worker 0-0, policy_version 140841 (0.00094) [2022-07-09 07:06:57,511][26022] Updated weights on worker 0-0, policy_version 140851 (0.00086) [2022-07-09 07:06:57,525][25689] Fps is (10 sec: 5687.3, 60 sec: 5750.6, 300 sec: 5742.4). Total num frames: 144231424. Throughput: 0: 6038.1. Samples: 144238968. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 07:06:57,526][25689] Avg episode reward: [(0, '-60.262')] [2022-07-09 07:06:59,214][26022] Updated weights on worker 0-0, policy_version 140861 (0.00086) [2022-07-09 07:07:00,976][26022] Updated weights on worker 0-0, policy_version 140871 (0.00085) [2022-07-09 07:07:02,555][25689] Fps is (10 sec: 5477.1, 60 sec: 5734.5, 300 sec: 5742.3). Total num frames: 144258048. Throughput: 0: 5188.6. Samples: 144256618. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 07:07:02,556][25689] Avg episode reward: [(0, '-59.714')] [2022-07-09 07:07:03,034][26022] Updated weights on worker 0-0, policy_version 140881 (0.00080) [2022-07-09 07:07:04,870][26022] Updated weights on worker 0-0, policy_version 140891 (0.00105) [2022-07-09 07:07:06,559][26022] Updated weights on worker 0-0, policy_version 140901 (0.00086) [2022-07-09 07:07:07,570][25689] Fps is (10 sec: 5708.7, 60 sec: 5751.7, 300 sec: 5749.5). Total num frames: 144288768. Throughput: 0: 5960.5. Samples: 144289432. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 07:07:07,571][25689] Avg episode reward: [(0, '-59.831')] [2022-07-09 07:07:08,341][26022] Updated weights on worker 0-0, policy_version 140911 (0.00096) [2022-07-09 07:07:10,011][26022] Updated weights on worker 0-0, policy_version 140921 (0.00089) [2022-07-09 07:07:12,050][26022] Updated weights on worker 0-0, policy_version 140931 (0.00093) [2022-07-09 07:07:12,589][25689] Fps is (10 sec: 5919.4, 60 sec: 5767.1, 300 sec: 5744.1). Total num frames: 144317440. Throughput: 0: 5985.6. Samples: 144324550. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 07:07:12,589][25689] Avg episode reward: [(0, '-59.538')] [2022-07-09 07:07:13,612][26022] Updated weights on worker 0-0, policy_version 140941 (0.00079) [2022-07-09 07:07:15,620][26022] Updated weights on worker 0-0, policy_version 140951 (0.00095) [2022-07-09 07:07:17,151][26022] Updated weights on worker 0-0, policy_version 140961 (0.00072) [2022-07-09 07:07:17,630][25689] Fps is (10 sec: 5802.4, 60 sec: 5792.2, 300 sec: 5748.1). Total num frames: 144347136. Throughput: 0: 5110.6. Samples: 144341712. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 07:07:17,630][25689] Avg episode reward: [(0, '-58.539')] [2022-07-09 07:07:19,095][26022] Updated weights on worker 0-0, policy_version 140971 (0.00082) [2022-07-09 07:07:20,645][26022] Updated weights on worker 0-0, policy_version 140981 (0.00084) [2022-07-09 07:07:22,526][26022] Updated weights on worker 0-0, policy_version 140991 (0.00094) [2022-07-09 07:07:22,642][25689] Fps is (10 sec: 5704.0, 60 sec: 5746.4, 300 sec: 5745.9). Total num frames: 144374784. Throughput: 0: 5972.3. Samples: 144376582. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 07:07:22,643][25689] Avg episode reward: [(0, '-57.658')] [2022-07-09 07:07:24,323][26022] Updated weights on worker 0-0, policy_version 141001 (0.00085) [2022-07-09 07:07:26,178][26022] Updated weights on worker 0-0, policy_version 141011 (0.00080) [2022-07-09 07:07:27,678][25689] Fps is (10 sec: 5706.7, 60 sec: 5769.3, 300 sec: 5745.7). Total num frames: 144404480. Throughput: 0: 6076.3. Samples: 144411614. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 07:07:27,679][25689] Avg episode reward: [(0, '-57.937')] [2022-07-09 07:07:27,726][26022] Updated weights on worker 0-0, policy_version 141021 (0.00086) [2022-07-09 07:07:29,533][26022] Updated weights on worker 0-0, policy_version 141031 (0.00084) [2022-07-09 07:07:31,353][26022] Updated weights on worker 0-0, policy_version 141041 (0.00086) [2022-07-09 07:07:32,688][25689] Fps is (10 sec: 5912.1, 60 sec: 5769.1, 300 sec: 5746.3). Total num frames: 144434176. Throughput: 0: 5203.3. Samples: 144429132. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 07:07:32,689][25689] Avg episode reward: [(0, '-57.129')] [2022-07-09 07:07:33,075][26022] Updated weights on worker 0-0, policy_version 141051 (0.00082) [2022-07-09 07:07:34,903][26022] Updated weights on worker 0-0, policy_version 141061 (0.00089) [2022-07-09 07:07:36,639][26022] Updated weights on worker 0-0, policy_version 141071 (0.00089) [2022-07-09 07:07:37,820][25689] Fps is (10 sec: 5755.4, 60 sec: 5780.3, 300 sec: 5743.8). Total num frames: 144462848. Throughput: 0: 6045.0. Samples: 144463760. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 07:07:37,820][25689] Avg episode reward: [(0, '-57.881')] [2022-07-09 07:07:38,327][26022] Updated weights on worker 0-0, policy_version 141081 (0.00089) [2022-07-09 07:07:40,127][26022] Updated weights on worker 0-0, policy_version 141091 (0.00084) [2022-07-09 07:07:41,778][26022] Updated weights on worker 0-0, policy_version 141101 (0.00099) [2022-07-09 07:07:42,915][25689] Fps is (10 sec: 5607.3, 60 sec: 5739.9, 300 sec: 5742.0). Total num frames: 144491520. Throughput: 0: 6026.8. Samples: 144498760. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 07:07:42,915][25689] Avg episode reward: [(0, '-57.770')] [2022-07-09 07:07:43,631][26022] Updated weights on worker 0-0, policy_version 141111 (0.00094) [2022-07-09 07:07:45,646][26022] Updated weights on worker 0-0, policy_version 141121 (0.00105) [2022-07-09 07:07:47,116][26022] Updated weights on worker 0-0, policy_version 141131 (0.00084) [2022-07-09 07:07:47,968][25689] Fps is (10 sec: 5852.7, 60 sec: 5752.3, 300 sec: 5748.0). Total num frames: 144522240. Throughput: 0: 5149.2. Samples: 144516088. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 07:07:47,968][25689] Avg episode reward: [(0, '-58.613')] [2022-07-09 07:07:49,099][26022] Updated weights on worker 0-0, policy_version 141141 (0.00081) [2022-07-09 07:07:50,510][26022] Updated weights on worker 0-0, policy_version 141151 (0.00385) [2022-07-09 07:07:52,625][26022] Updated weights on worker 0-0, policy_version 141161 (0.00091) [2022-07-09 07:07:52,969][25689] Fps is (10 sec: 6008.9, 60 sec: 5770.7, 300 sec: 5742.6). Total num frames: 144551936. Throughput: 0: 5987.8. Samples: 144550572. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 07:07:52,970][25689] Avg episode reward: [(0, '-59.227')] [2022-07-09 07:07:54,456][26022] Updated weights on worker 0-0, policy_version 141171 (0.00091) [2022-07-09 07:07:55,891][26022] Updated weights on worker 0-0, policy_version 141181 (0.00085) [2022-07-09 07:07:58,083][25689] Fps is (10 sec: 5567.9, 60 sec: 5732.4, 300 sec: 5740.6). Total num frames: 144578560. Throughput: 0: 6003.5. Samples: 144585408. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 07:07:58,083][25689] Avg episode reward: [(0, '-59.171')] [2022-07-09 07:07:58,134][26022] Updated weights on worker 0-0, policy_version 141191 (0.00092) [2022-07-09 07:07:59,376][26022] Updated weights on worker 0-0, policy_version 141201 (0.00085) [2022-07-09 07:08:01,516][26022] Updated weights on worker 0-0, policy_version 141211 (0.00083) [2022-07-09 07:08:03,091][25689] Fps is (10 sec: 5564.2, 60 sec: 5785.2, 300 sec: 5750.9). Total num frames: 144608256. Throughput: 0: 5159.5. Samples: 144602860. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 07:08:03,093][25689] Avg episode reward: [(0, '-58.743')] [2022-07-09 07:08:03,296][26022] Updated weights on worker 0-0, policy_version 141221 (0.00086) [2022-07-09 07:08:05,291][26022] Updated weights on worker 0-0, policy_version 141231 (0.00086) [2022-07-09 07:08:06,820][26022] Updated weights on worker 0-0, policy_version 141241 (0.00085) [2022-07-09 07:08:08,158][25689] Fps is (10 sec: 5793.0, 60 sec: 5746.4, 300 sec: 5746.5). Total num frames: 144636928. Throughput: 0: 5931.9. Samples: 144635858. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 07:08:08,159][25689] Avg episode reward: [(0, '-58.308')] [2022-07-09 07:08:08,803][26022] Updated weights on worker 0-0, policy_version 141251 (0.00085) [2022-07-09 07:08:10,458][26022] Updated weights on worker 0-0, policy_version 141261 (0.00083) [2022-07-09 07:08:12,503][26022] Updated weights on worker 0-0, policy_version 141271 (0.00087) [2022-07-09 07:08:13,255][25689] Fps is (10 sec: 5843.2, 60 sec: 5772.7, 300 sec: 5749.3). Total num frames: 144667648. Throughput: 0: 5925.4. Samples: 144670774. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 07:08:13,257][25689] Avg episode reward: [(0, '-58.789')] [2022-07-09 07:08:14,008][26022] Updated weights on worker 0-0, policy_version 141281 (0.00087) [2022-07-09 07:08:16,009][26022] Updated weights on worker 0-0, policy_version 141291 (0.00089) [2022-07-09 07:08:17,579][26022] Updated weights on worker 0-0, policy_version 141301 (0.00084) [2022-07-09 07:08:18,305][25689] Fps is (10 sec: 5752.3, 60 sec: 5738.0, 300 sec: 5749.7). Total num frames: 144695296. Throughput: 0: 5935.4. Samples: 144705438. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 07:08:18,306][25689] Avg episode reward: [(0, '-57.741')] [2022-07-09 07:08:19,272][26022] Updated weights on worker 0-0, policy_version 141311 (0.00098) [2022-07-09 07:08:21,139][26022] Updated weights on worker 0-0, policy_version 141321 (0.00086) [2022-07-09 07:08:22,723][26022] Updated weights on worker 0-0, policy_version 141331 (0.00083) [2022-07-09 07:08:23,368][25689] Fps is (10 sec: 5670.6, 60 sec: 5767.1, 300 sec: 5745.3). Total num frames: 144724992. Throughput: 0: 5936.5. Samples: 144723234. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 07:08:23,368][25689] Avg episode reward: [(0, '-58.329')] [2022-07-09 07:08:24,569][26022] Updated weights on worker 0-0, policy_version 141341 (0.00087) [2022-07-09 07:08:26,403][26022] Updated weights on worker 0-0, policy_version 141351 (0.00082) [2022-07-09 07:08:28,126][26022] Updated weights on worker 0-0, policy_version 141361 (0.00077) [2022-07-09 07:08:28,402][25689] Fps is (10 sec: 5983.6, 60 sec: 5784.1, 300 sec: 5751.6). Total num frames: 144755712. Throughput: 0: 6050.0. Samples: 144758334. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 07:08:28,403][25689] Avg episode reward: [(0, '-59.057')] [2022-07-09 07:08:29,874][26022] Updated weights on worker 0-0, policy_version 141371 (0.00087) [2022-07-09 07:08:31,545][26022] Updated weights on worker 0-0, policy_version 141381 (0.00726) [2022-07-09 07:08:33,281][26022] Updated weights on worker 0-0, policy_version 141391 (0.00086) [2022-07-09 07:08:33,443][25689] Fps is (10 sec: 5996.2, 60 sec: 5781.1, 300 sec: 5748.7). Total num frames: 144785408. Throughput: 0: 6081.3. Samples: 144793544. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 07:08:33,444][25689] Avg episode reward: [(0, '-58.506')] [2022-07-09 07:08:35,096][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:08:35,108][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000141400_144793600.pth [2022-07-09 07:08:35,109][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000139375_142720000.pth [2022-07-09 07:08:35,175][26022] Updated weights on worker 0-0, policy_version 141401 (0.00083) [2022-07-09 07:08:36,728][26022] Updated weights on worker 0-0, policy_version 141411 (0.00086) [2022-07-09 07:08:38,487][25689] Fps is (10 sec: 5787.7, 60 sec: 5789.5, 300 sec: 5752.5). Total num frames: 144814080. Throughput: 0: 5228.9. Samples: 144810966. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 07:08:38,488][25689] Avg episode reward: [(0, '-58.731')] [2022-07-09 07:08:38,624][26022] Updated weights on worker 0-0, policy_version 141421 (0.00092) [2022-07-09 07:08:40,361][26022] Updated weights on worker 0-0, policy_version 141431 (0.00086) [2022-07-09 07:08:42,031][26022] Updated weights on worker 0-0, policy_version 141441 (0.00085) [2022-07-09 07:08:43,521][25689] Fps is (10 sec: 5792.1, 60 sec: 5812.2, 300 sec: 5752.6). Total num frames: 144843776. Throughput: 0: 6095.7. Samples: 144846080. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 07:08:43,521][25689] Avg episode reward: [(0, '-58.420')] [2022-07-09 07:08:44,120][26022] Updated weights on worker 0-0, policy_version 141451 (0.00084) [2022-07-09 07:08:45,527][26022] Updated weights on worker 0-0, policy_version 141461 (0.00088) [2022-07-09 07:08:47,503][26022] Updated weights on worker 0-0, policy_version 141471 (0.00085) [2022-07-09 07:08:48,551][25689] Fps is (10 sec: 5800.0, 60 sec: 5780.6, 300 sec: 5753.2). Total num frames: 144872448. Throughput: 0: 6083.9. Samples: 144880914. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 07:08:48,551][25689] Avg episode reward: [(0, '-58.629')] [2022-07-09 07:08:49,150][26022] Updated weights on worker 0-0, policy_version 141481 (0.00079) [2022-07-09 07:08:51,069][26022] Updated weights on worker 0-0, policy_version 141491 (0.00094) [2022-07-09 07:08:52,788][26022] Updated weights on worker 0-0, policy_version 141501 (0.00093) [2022-07-09 07:08:53,560][25689] Fps is (10 sec: 5712.0, 60 sec: 5763.0, 300 sec: 5758.0). Total num frames: 144901120. Throughput: 0: 5190.5. Samples: 144897958. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 07:08:53,560][25689] Avg episode reward: [(0, '-58.535')] [2022-07-09 07:08:54,430][26022] Updated weights on worker 0-0, policy_version 141511 (0.00084) [2022-07-09 07:08:56,252][26022] Updated weights on worker 0-0, policy_version 141521 (0.00087) [2022-07-09 07:08:58,186][26022] Updated weights on worker 0-0, policy_version 141531 (0.00081) [2022-07-09 07:08:58,646][25689] Fps is (10 sec: 5579.0, 60 sec: 5782.5, 300 sec: 5746.3). Total num frames: 144928768. Throughput: 0: 6049.6. Samples: 144932918. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 07:08:58,646][25689] Avg episode reward: [(0, '-59.346')] [2022-07-09 07:08:59,746][26022] Updated weights on worker 0-0, policy_version 141541 (0.00090) [2022-07-09 07:09:02,118][26022] Updated weights on worker 0-0, policy_version 141551 (0.00086) [2022-07-09 07:09:03,616][26022] Updated weights on worker 0-0, policy_version 141561 (0.00094) [2022-07-09 07:09:03,667][25689] Fps is (10 sec: 5673.8, 60 sec: 5781.3, 300 sec: 5757.3). Total num frames: 144958464. Throughput: 0: 5943.6. Samples: 144965820. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 07:09:03,667][25689] Avg episode reward: [(0, '-59.108')] [2022-07-09 07:09:05,640][26022] Updated weights on worker 0-0, policy_version 141571 (0.00095) [2022-07-09 07:09:07,391][26022] Updated weights on worker 0-0, policy_version 141581 (0.00095) [2022-07-09 07:09:08,674][25689] Fps is (10 sec: 5820.6, 60 sec: 5787.1, 300 sec: 5761.5). Total num frames: 144987136. Throughput: 0: 5087.9. Samples: 144983296. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 07:09:08,674][25689] Avg episode reward: [(0, '-59.745')] [2022-07-09 07:09:09,191][26022] Updated weights on worker 0-0, policy_version 141591 (0.00082) [2022-07-09 07:09:10,736][26022] Updated weights on worker 0-0, policy_version 141601 (0.00086) [2022-07-09 07:09:12,510][26022] Updated weights on worker 0-0, policy_version 141611 (0.00091) [2022-07-09 07:09:13,701][25689] Fps is (10 sec: 5816.7, 60 sec: 5776.8, 300 sec: 5755.1). Total num frames: 145016832. Throughput: 0: 5984.9. Samples: 145018502. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 07:09:13,702][25689] Avg episode reward: [(0, '-59.752')] [2022-07-09 07:09:14,269][26022] Updated weights on worker 0-0, policy_version 141621 (0.00614) [2022-07-09 07:09:16,119][26022] Updated weights on worker 0-0, policy_version 141631 (0.00091) [2022-07-09 07:09:17,763][26022] Updated weights on worker 0-0, policy_version 141641 (0.00090) [2022-07-09 07:09:18,758][25689] Fps is (10 sec: 5788.1, 60 sec: 5793.1, 300 sec: 5754.2). Total num frames: 145045504. Throughput: 0: 5978.0. Samples: 145053146. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 07:09:18,758][25689] Avg episode reward: [(0, '-59.828')] [2022-07-09 07:09:19,587][26022] Updated weights on worker 0-0, policy_version 141651 (0.00090) [2022-07-09 07:09:21,401][26022] Updated weights on worker 0-0, policy_version 141661 (0.00091) [2022-07-09 07:09:23,181][26022] Updated weights on worker 0-0, policy_version 141671 (0.00090) [2022-07-09 07:09:23,841][25689] Fps is (10 sec: 5655.6, 60 sec: 5774.2, 300 sec: 5749.5). Total num frames: 145074176. Throughput: 0: 5199.6. Samples: 145070716. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 07:09:23,841][25689] Avg episode reward: [(0, '-58.996')] [2022-07-09 07:09:24,732][26022] Updated weights on worker 0-0, policy_version 141681 (0.00084) [2022-07-09 07:09:26,526][26022] Updated weights on worker 0-0, policy_version 141691 (0.00083) [2022-07-09 07:09:28,524][26022] Updated weights on worker 0-0, policy_version 141701 (0.00092) [2022-07-09 07:09:28,851][25689] Fps is (10 sec: 5681.2, 60 sec: 5742.6, 300 sec: 5752.8). Total num frames: 145102848. Throughput: 0: 6091.7. Samples: 145106212. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 07:09:28,853][25689] Avg episode reward: [(0, '-58.497')] [2022-07-09 07:09:30,079][26022] Updated weights on worker 0-0, policy_version 141711 (0.00887) [2022-07-09 07:09:32,105][26022] Updated weights on worker 0-0, policy_version 141721 (0.00087) [2022-07-09 07:09:33,387][26022] Updated weights on worker 0-0, policy_version 141731 (0.00090) [2022-07-09 07:09:33,889][25689] Fps is (10 sec: 5910.4, 60 sec: 5759.9, 300 sec: 5757.1). Total num frames: 145133568. Throughput: 0: 6058.1. Samples: 145140804. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-09 07:09:33,890][25689] Avg episode reward: [(0, '-58.714')] [2022-07-09 07:09:35,574][26022] Updated weights on worker 0-0, policy_version 141741 (0.00084) [2022-07-09 07:09:37,125][26022] Updated weights on worker 0-0, policy_version 141751 (0.00086) [2022-07-09 07:09:38,932][25689] Fps is (10 sec: 5891.4, 60 sec: 5759.9, 300 sec: 5756.4). Total num frames: 145162240. Throughput: 0: 5206.7. Samples: 145158190. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 07:09:38,933][25689] Avg episode reward: [(0, '-58.397')] [2022-07-09 07:09:38,958][26022] Updated weights on worker 0-0, policy_version 141761 (0.00079) [2022-07-09 07:09:40,707][26022] Updated weights on worker 0-0, policy_version 141771 (0.00084) [2022-07-09 07:09:42,572][26022] Updated weights on worker 0-0, policy_version 141781 (0.00087) [2022-07-09 07:09:44,018][25689] Fps is (10 sec: 5863.5, 60 sec: 5771.9, 300 sec: 5760.0). Total num frames: 145192960. Throughput: 0: 6070.8. Samples: 145193212. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 07:09:44,019][25689] Avg episode reward: [(0, '-58.797')] [2022-07-09 07:09:44,147][26022] Updated weights on worker 0-0, policy_version 141791 (0.00081) [2022-07-09 07:09:46,056][26022] Updated weights on worker 0-0, policy_version 141801 (0.00084) [2022-07-09 07:09:47,585][26022] Updated weights on worker 0-0, policy_version 141811 (0.00081) [2022-07-09 07:09:49,032][25689] Fps is (10 sec: 5678.1, 60 sec: 5739.6, 300 sec: 5750.3). Total num frames: 145219584. Throughput: 0: 6041.1. Samples: 145228122. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 07:09:49,032][25689] Avg episode reward: [(0, '-58.770')] [2022-07-09 07:09:49,540][26022] Updated weights on worker 0-0, policy_version 141821 (0.00084) [2022-07-09 07:09:51,342][26022] Updated weights on worker 0-0, policy_version 141831 (0.00083) [2022-07-09 07:09:53,085][26022] Updated weights on worker 0-0, policy_version 141841 (0.00615) [2022-07-09 07:09:54,116][25689] Fps is (10 sec: 5780.4, 60 sec: 5783.2, 300 sec: 5760.6). Total num frames: 145251328. Throughput: 0: 5168.3. Samples: 145245342. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 07:09:54,124][25689] Avg episode reward: [(0, '-58.235')] [2022-07-09 07:09:54,958][26022] Updated weights on worker 0-0, policy_version 141851 (0.00311) [2022-07-09 07:09:56,590][26022] Updated weights on worker 0-0, policy_version 141861 (0.00089) [2022-07-09 07:09:58,445][26022] Updated weights on worker 0-0, policy_version 141871 (0.00087) [2022-07-09 07:09:59,218][25689] Fps is (10 sec: 5931.0, 60 sec: 5798.6, 300 sec: 5758.7). Total num frames: 145280000. Throughput: 0: 6024.0. Samples: 145280390. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 07:09:59,218][25689] Avg episode reward: [(0, '-58.620')] [2022-07-09 07:09:59,990][26022] Updated weights on worker 0-0, policy_version 141881 (0.00105) [2022-07-09 07:10:02,251][26022] Updated weights on worker 0-0, policy_version 141891 (0.00090) [2022-07-09 07:10:04,159][26022] Updated weights on worker 0-0, policy_version 141901 (0.00089) [2022-07-09 07:10:04,225][25689] Fps is (10 sec: 5469.7, 60 sec: 5749.1, 300 sec: 5755.3). Total num frames: 145306624. Throughput: 0: 5947.8. Samples: 145313400. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 07:10:04,226][25689] Avg episode reward: [(0, '-58.169')] [2022-07-09 07:10:05,721][26022] Updated weights on worker 0-0, policy_version 141911 (0.00097) [2022-07-09 07:10:07,614][26022] Updated weights on worker 0-0, policy_version 141921 (0.00089) [2022-07-09 07:10:09,243][25689] Fps is (10 sec: 5617.7, 60 sec: 5765.0, 300 sec: 5759.3). Total num frames: 145336320. Throughput: 0: 5076.1. Samples: 145330720. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 07:10:09,244][25689] Avg episode reward: [(0, '-57.500')] [2022-07-09 07:10:09,244][26022] Updated weights on worker 0-0, policy_version 141931 (0.00080) [2022-07-09 07:10:11,106][26022] Updated weights on worker 0-0, policy_version 141941 (0.00084) [2022-07-09 07:10:12,833][26022] Updated weights on worker 0-0, policy_version 141951 (0.00083) [2022-07-09 07:10:14,251][25689] Fps is (10 sec: 5822.1, 60 sec: 5750.0, 300 sec: 5764.2). Total num frames: 145364992. Throughput: 0: 5958.4. Samples: 145365314. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 07:10:14,251][25689] Avg episode reward: [(0, '-57.512')] [2022-07-09 07:10:14,756][26022] Updated weights on worker 0-0, policy_version 141961 (0.00085) [2022-07-09 07:10:16,304][26022] Updated weights on worker 0-0, policy_version 141971 (0.00085) [2022-07-09 07:10:18,322][26022] Updated weights on worker 0-0, policy_version 141981 (0.00083) [2022-07-09 07:10:19,295][25689] Fps is (10 sec: 5908.7, 60 sec: 5785.0, 300 sec: 5768.6). Total num frames: 145395712. Throughput: 0: 5961.7. Samples: 145400082. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 07:10:19,297][25689] Avg episode reward: [(0, '-57.845')] [2022-07-09 07:10:19,894][26022] Updated weights on worker 0-0, policy_version 141991 (0.00088) [2022-07-09 07:10:21,937][26022] Updated weights on worker 0-0, policy_version 142001 (0.00048) [2022-07-09 07:10:23,516][26022] Updated weights on worker 0-0, policy_version 142011 (0.00090) [2022-07-09 07:10:24,300][25689] Fps is (10 sec: 5706.3, 60 sec: 5758.6, 300 sec: 5760.1). Total num frames: 145422336. Throughput: 0: 5163.8. Samples: 145417062. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 07:10:24,300][25689] Avg episode reward: [(0, '-58.156')] [2022-07-09 07:10:25,334][26022] Updated weights on worker 0-0, policy_version 142021 (0.00087) [2022-07-09 07:10:27,163][26022] Updated weights on worker 0-0, policy_version 142031 (0.00527) [2022-07-09 07:10:28,959][26022] Updated weights on worker 0-0, policy_version 142041 (0.00080) [2022-07-09 07:10:29,303][25689] Fps is (10 sec: 5627.2, 60 sec: 5776.2, 300 sec: 5761.0). Total num frames: 145452032. Throughput: 0: 6021.3. Samples: 145451508. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 07:10:29,305][25689] Avg episode reward: [(0, '-58.866')] [2022-07-09 07:10:30,823][26022] Updated weights on worker 0-0, policy_version 142051 (0.00084) [2022-07-09 07:10:32,371][26022] Updated weights on worker 0-0, policy_version 142061 (0.00086) [2022-07-09 07:10:34,318][25689] Fps is (10 sec: 5723.9, 60 sec: 5727.6, 300 sec: 5758.8). Total num frames: 145479680. Throughput: 0: 6051.4. Samples: 145486752. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 07:10:34,319][25689] Avg episode reward: [(0, '-58.633')] [2022-07-09 07:10:34,338][26022] Updated weights on worker 0-0, policy_version 142071 (0.00089) [2022-07-09 07:10:35,224][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:10:35,232][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000142077_145486848.pth [2022-07-09 07:10:35,233][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000140051_143412224.pth [2022-07-09 07:10:35,888][26022] Updated weights on worker 0-0, policy_version 142081 (0.00090) [2022-07-09 07:10:37,888][26022] Updated weights on worker 0-0, policy_version 142091 (0.00092) [2022-07-09 07:10:39,385][25689] Fps is (10 sec: 5789.6, 60 sec: 5759.2, 300 sec: 5768.1). Total num frames: 145510400. Throughput: 0: 5178.1. Samples: 145504110. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 07:10:39,386][25689] Avg episode reward: [(0, '-58.357')] [2022-07-09 07:10:39,432][26022] Updated weights on worker 0-0, policy_version 142101 (0.00086) [2022-07-09 07:10:41,300][26022] Updated weights on worker 0-0, policy_version 142111 (0.00088) [2022-07-09 07:10:42,939][26022] Updated weights on worker 0-0, policy_version 142121 (0.00092) [2022-07-09 07:10:44,416][25689] Fps is (10 sec: 5881.6, 60 sec: 5730.5, 300 sec: 5761.9). Total num frames: 145539072. Throughput: 0: 6079.3. Samples: 145539356. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 07:10:44,417][25689] Avg episode reward: [(0, '-58.180')] [2022-07-09 07:10:44,899][26022] Updated weights on worker 0-0, policy_version 142131 (0.00084) [2022-07-09 07:10:46,303][26022] Updated weights on worker 0-0, policy_version 142141 (0.00088) [2022-07-09 07:10:48,254][26022] Updated weights on worker 0-0, policy_version 142151 (0.00088) [2022-07-09 07:10:49,430][25689] Fps is (10 sec: 5912.7, 60 sec: 5798.3, 300 sec: 5768.9). Total num frames: 145569792. Throughput: 0: 6106.9. Samples: 145574418. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 07:10:49,430][25689] Avg episode reward: [(0, '-58.146')] [2022-07-09 07:10:49,925][26022] Updated weights on worker 0-0, policy_version 142161 (0.00097) [2022-07-09 07:10:51,662][26022] Updated weights on worker 0-0, policy_version 142171 (0.00098) [2022-07-09 07:10:53,521][26022] Updated weights on worker 0-0, policy_version 142181 (0.00083) [2022-07-09 07:10:54,459][25689] Fps is (10 sec: 5914.2, 60 sec: 5752.7, 300 sec: 5765.8). Total num frames: 145598464. Throughput: 0: 5226.2. Samples: 145592010. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 07:10:54,459][25689] Avg episode reward: [(0, '-58.153')] [2022-07-09 07:10:55,450][26022] Updated weights on worker 0-0, policy_version 142191 (0.00087) [2022-07-09 07:10:57,138][26022] Updated weights on worker 0-0, policy_version 142201 (0.00088) [2022-07-09 07:10:59,005][26022] Updated weights on worker 0-0, policy_version 142211 (0.00086) [2022-07-09 07:10:59,497][25689] Fps is (10 sec: 5797.9, 60 sec: 5775.8, 300 sec: 5772.7). Total num frames: 145628160. Throughput: 0: 6077.7. Samples: 145626344. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 07:10:59,498][25689] Avg episode reward: [(0, '-59.032')] [2022-07-09 07:11:00,622][26022] Updated weights on worker 0-0, policy_version 142221 (0.00090) [2022-07-09 07:11:02,851][26022] Updated weights on worker 0-0, policy_version 142231 (0.00087) [2022-07-09 07:11:04,454][26022] Updated weights on worker 0-0, policy_version 142241 (0.00091) [2022-07-09 07:11:04,500][25689] Fps is (10 sec: 5608.9, 60 sec: 5776.2, 300 sec: 5762.7). Total num frames: 145654784. Throughput: 0: 5965.0. Samples: 145659154. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 07:11:04,501][25689] Avg episode reward: [(0, '-59.296')] [2022-07-09 07:11:06,313][26022] Updated weights on worker 0-0, policy_version 142251 (0.00089) [2022-07-09 07:11:08,133][26022] Updated weights on worker 0-0, policy_version 142261 (0.00085) [2022-07-09 07:11:09,517][25689] Fps is (10 sec: 5416.3, 60 sec: 5742.3, 300 sec: 5762.4). Total num frames: 145682432. Throughput: 0: 5090.7. Samples: 145676676. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 07:11:09,518][25689] Avg episode reward: [(0, '-59.025')] [2022-07-09 07:11:09,870][26022] Updated weights on worker 0-0, policy_version 142271 (0.00096) [2022-07-09 07:11:11,577][26022] Updated weights on worker 0-0, policy_version 142281 (0.00081) [2022-07-09 07:11:13,503][26022] Updated weights on worker 0-0, policy_version 142291 (0.00094) [2022-07-09 07:11:14,524][25689] Fps is (10 sec: 5822.8, 60 sec: 5776.4, 300 sec: 5771.6). Total num frames: 145713152. Throughput: 0: 5937.5. Samples: 145711146. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 07:11:14,527][25689] Avg episode reward: [(0, '-58.274')] [2022-07-09 07:11:15,226][26022] Updated weights on worker 0-0, policy_version 142301 (0.00084) [2022-07-09 07:11:17,104][26022] Updated weights on worker 0-0, policy_version 142311 (0.00083) [2022-07-09 07:11:18,756][26022] Updated weights on worker 0-0, policy_version 142321 (0.00080) [2022-07-09 07:11:19,608][25689] Fps is (10 sec: 5784.0, 60 sec: 5721.6, 300 sec: 5760.9). Total num frames: 145740800. Throughput: 0: 5935.0. Samples: 145745706. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 07:11:19,610][25689] Avg episode reward: [(0, '-58.112')] [2022-07-09 07:11:20,497][26022] Updated weights on worker 0-0, policy_version 142331 (0.00090) [2022-07-09 07:11:22,385][26022] Updated weights on worker 0-0, policy_version 142341 (0.00059) [2022-07-09 07:11:24,024][26022] Updated weights on worker 0-0, policy_version 142351 (0.00088) [2022-07-09 07:11:24,660][25689] Fps is (10 sec: 5657.5, 60 sec: 5768.1, 300 sec: 5765.2). Total num frames: 145770496. Throughput: 0: 5158.8. Samples: 145763156. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 07:11:24,660][25689] Avg episode reward: [(0, '-57.884')] [2022-07-09 07:11:26,080][26022] Updated weights on worker 0-0, policy_version 142361 (0.00082) [2022-07-09 07:11:27,565][26022] Updated weights on worker 0-0, policy_version 142371 (0.00087) [2022-07-09 07:11:29,449][26022] Updated weights on worker 0-0, policy_version 142381 (0.00088) [2022-07-09 07:11:29,693][25689] Fps is (10 sec: 5787.6, 60 sec: 5748.2, 300 sec: 5761.3). Total num frames: 145799168. Throughput: 0: 5996.6. Samples: 145797666. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 07:11:29,694][25689] Avg episode reward: [(0, '-56.907')] [2022-07-09 07:11:31,278][26022] Updated weights on worker 0-0, policy_version 142391 (0.00082) [2022-07-09 07:11:32,885][26022] Updated weights on worker 0-0, policy_version 142401 (0.00087) [2022-07-09 07:11:34,711][25689] Fps is (10 sec: 5807.1, 60 sec: 5781.9, 300 sec: 5769.2). Total num frames: 145828864. Throughput: 0: 6021.2. Samples: 145832696. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-09 07:11:34,712][25689] Avg episode reward: [(0, '-56.607')] [2022-07-09 07:11:34,719][26022] Updated weights on worker 0-0, policy_version 142411 (0.00093) [2022-07-09 07:11:36,423][26022] Updated weights on worker 0-0, policy_version 142421 (0.00095) [2022-07-09 07:11:38,241][26022] Updated weights on worker 0-0, policy_version 142431 (0.00089) [2022-07-09 07:11:39,844][25689] Fps is (10 sec: 5850.8, 60 sec: 5758.6, 300 sec: 5763.7). Total num frames: 145858560. Throughput: 0: 6006.0. Samples: 145867244. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:11:39,847][25689] Avg episode reward: [(0, '-56.767')] [2022-07-09 07:11:40,214][26022] Updated weights on worker 0-0, policy_version 142441 (0.00087) [2022-07-09 07:11:41,837][26022] Updated weights on worker 0-0, policy_version 142451 (0.00091) [2022-07-09 07:11:43,648][26022] Updated weights on worker 0-0, policy_version 142461 (0.00078) [2022-07-09 07:11:44,868][25689] Fps is (10 sec: 5645.9, 60 sec: 5742.4, 300 sec: 5756.5). Total num frames: 145886208. Throughput: 0: 6012.3. Samples: 145884652. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:11:44,868][25689] Avg episode reward: [(0, '-57.177')] [2022-07-09 07:11:45,305][26022] Updated weights on worker 0-0, policy_version 142471 (0.00087) [2022-07-09 07:11:46,978][26022] Updated weights on worker 0-0, policy_version 142481 (0.00090) [2022-07-09 07:11:48,723][26022] Updated weights on worker 0-0, policy_version 142491 (0.00095) [2022-07-09 07:11:49,874][25689] Fps is (10 sec: 5819.6, 60 sec: 5743.1, 300 sec: 5763.6). Total num frames: 145916928. Throughput: 0: 6057.3. Samples: 145919906. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:11:49,875][25689] Avg episode reward: [(0, '-57.358')] [2022-07-09 07:11:50,566][26022] Updated weights on worker 0-0, policy_version 142501 (0.00107) [2022-07-09 07:11:52,455][26022] Updated weights on worker 0-0, policy_version 142511 (0.00084) [2022-07-09 07:11:54,175][26022] Updated weights on worker 0-0, policy_version 142521 (0.00083) [2022-07-09 07:11:54,905][25689] Fps is (10 sec: 5815.3, 60 sec: 5726.0, 300 sec: 5760.8). Total num frames: 145944576. Throughput: 0: 6020.1. Samples: 145954264. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:11:54,905][25689] Avg episode reward: [(0, '-57.472')] [2022-07-09 07:11:55,992][26022] Updated weights on worker 0-0, policy_version 142531 (0.00098) [2022-07-09 07:11:57,907][26022] Updated weights on worker 0-0, policy_version 142541 (0.00089) [2022-07-09 07:11:59,434][26022] Updated weights on worker 0-0, policy_version 142551 (0.00085) [2022-07-09 07:12:00,039][25689] Fps is (10 sec: 5742.2, 60 sec: 5733.8, 300 sec: 5772.6). Total num frames: 145975296. Throughput: 0: 5163.7. Samples: 145971522. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:12:00,039][25689] Avg episode reward: [(0, '-57.684')] [2022-07-09 07:12:01,416][26022] Updated weights on worker 0-0, policy_version 142561 (0.00082) [2022-07-09 07:12:03,499][26022] Updated weights on worker 0-0, policy_version 142571 (0.00094) [2022-07-09 07:12:05,066][25689] Fps is (10 sec: 5643.3, 60 sec: 5731.5, 300 sec: 5758.6). Total num frames: 146001920. Throughput: 0: 5921.5. Samples: 146004258. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:12:05,067][25689] Avg episode reward: [(0, '-58.137')] [2022-07-09 07:12:05,228][26022] Updated weights on worker 0-0, policy_version 142581 (0.00083) [2022-07-09 07:12:06,923][26022] Updated weights on worker 0-0, policy_version 142591 (0.00090) [2022-07-09 07:12:08,739][26022] Updated weights on worker 0-0, policy_version 142601 (0.00088) [2022-07-09 07:12:10,079][25689] Fps is (10 sec: 5507.5, 60 sec: 5748.8, 300 sec: 5758.7). Total num frames: 146030592. Throughput: 0: 5885.8. Samples: 146038828. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:12:10,080][25689] Avg episode reward: [(0, '-58.831')] [2022-07-09 07:12:10,430][26022] Updated weights on worker 0-0, policy_version 142611 (0.00084) [2022-07-09 07:12:12,370][26022] Updated weights on worker 0-0, policy_version 142621 (0.00089) [2022-07-09 07:12:13,955][26022] Updated weights on worker 0-0, policy_version 142631 (0.00087) [2022-07-09 07:12:15,091][25689] Fps is (10 sec: 5618.2, 60 sec: 5697.6, 300 sec: 5752.4). Total num frames: 146058240. Throughput: 0: 5048.6. Samples: 146056176. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:12:15,091][25689] Avg episode reward: [(0, '-59.638')] [2022-07-09 07:12:15,796][26022] Updated weights on worker 0-0, policy_version 142641 (0.00091) [2022-07-09 07:12:17,656][26022] Updated weights on worker 0-0, policy_version 142651 (0.00092) [2022-07-09 07:12:19,380][26022] Updated weights on worker 0-0, policy_version 142661 (0.00082) [2022-07-09 07:12:20,137][25689] Fps is (10 sec: 5803.0, 60 sec: 5752.0, 300 sec: 5762.0). Total num frames: 146088960. Throughput: 0: 5932.7. Samples: 146090760. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:12:20,138][25689] Avg episode reward: [(0, '-59.839')] [2022-07-09 07:12:21,292][26022] Updated weights on worker 0-0, policy_version 142671 (0.00086) [2022-07-09 07:12:22,792][26022] Updated weights on worker 0-0, policy_version 142681 (0.00083) [2022-07-09 07:12:24,850][26022] Updated weights on worker 0-0, policy_version 142691 (0.00084) [2022-07-09 07:12:25,162][25689] Fps is (10 sec: 5897.3, 60 sec: 5737.6, 300 sec: 5758.8). Total num frames: 146117632. Throughput: 0: 6012.4. Samples: 146125082. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:12:25,162][25689] Avg episode reward: [(0, '-60.212')] [2022-07-09 07:12:26,565][26022] Updated weights on worker 0-0, policy_version 142701 (0.00094) [2022-07-09 07:12:28,384][26022] Updated weights on worker 0-0, policy_version 142711 (0.00083) [2022-07-09 07:12:30,012][26022] Updated weights on worker 0-0, policy_version 142721 (0.00094) [2022-07-09 07:12:30,199][25689] Fps is (10 sec: 5699.3, 60 sec: 5737.3, 300 sec: 5754.8). Total num frames: 146146304. Throughput: 0: 5150.7. Samples: 146142458. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:12:30,199][25689] Avg episode reward: [(0, '-60.176')] [2022-07-09 07:12:31,944][26022] Updated weights on worker 0-0, policy_version 142731 (0.00089) [2022-07-09 07:12:33,600][26022] Updated weights on worker 0-0, policy_version 142741 (0.00090) [2022-07-09 07:12:35,218][25689] Fps is (10 sec: 5804.3, 60 sec: 5737.1, 300 sec: 5760.5). Total num frames: 146176000. Throughput: 0: 6018.2. Samples: 146177304. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:12:35,219][25689] Avg episode reward: [(0, '-59.878')] [2022-07-09 07:12:35,378][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:12:35,390][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000142751_146177024.pth [2022-07-09 07:12:35,390][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000140723_144100352.pth [2022-07-09 07:12:35,399][26022] Updated weights on worker 0-0, policy_version 142751 (0.00086) [2022-07-09 07:12:37,067][26022] Updated weights on worker 0-0, policy_version 142761 (0.00121) [2022-07-09 07:12:38,938][26022] Updated weights on worker 0-0, policy_version 142771 (0.00086) [2022-07-09 07:12:40,259][25689] Fps is (10 sec: 5801.8, 60 sec: 5729.0, 300 sec: 5761.5). Total num frames: 146204672. Throughput: 0: 6035.1. Samples: 146212198. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:12:40,259][25689] Avg episode reward: [(0, '-59.181')] [2022-07-09 07:12:40,614][26022] Updated weights on worker 0-0, policy_version 142781 (0.00085) [2022-07-09 07:12:42,393][26022] Updated weights on worker 0-0, policy_version 142791 (0.00095) [2022-07-09 07:12:44,306][26022] Updated weights on worker 0-0, policy_version 142801 (0.00079) [2022-07-09 07:12:45,269][25689] Fps is (10 sec: 5807.2, 60 sec: 5764.1, 300 sec: 5758.9). Total num frames: 146234368. Throughput: 0: 5213.5. Samples: 146229910. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:12:45,269][25689] Avg episode reward: [(0, '-57.406')] [2022-07-09 07:12:45,937][26022] Updated weights on worker 0-0, policy_version 142811 (0.00088) [2022-07-09 07:12:47,613][26022] Updated weights on worker 0-0, policy_version 142821 (0.00084) [2022-07-09 07:12:49,366][26022] Updated weights on worker 0-0, policy_version 142831 (0.00078) [2022-07-09 07:12:50,297][25689] Fps is (10 sec: 5916.6, 60 sec: 5745.1, 300 sec: 5758.4). Total num frames: 146264064. Throughput: 0: 6108.8. Samples: 146265236. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:12:50,298][25689] Avg episode reward: [(0, '-57.675')] [2022-07-09 07:12:51,103][26022] Updated weights on worker 0-0, policy_version 142841 (0.00084) [2022-07-09 07:12:52,958][26022] Updated weights on worker 0-0, policy_version 142851 (0.00087) [2022-07-09 07:12:54,843][26022] Updated weights on worker 0-0, policy_version 142861 (0.00081) [2022-07-09 07:12:55,326][25689] Fps is (10 sec: 5701.9, 60 sec: 5745.3, 300 sec: 5763.5). Total num frames: 146291712. Throughput: 0: 6113.7. Samples: 146300238. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:12:55,328][25689] Avg episode reward: [(0, '-56.854')] [2022-07-09 07:12:56,466][26022] Updated weights on worker 0-0, policy_version 142871 (0.00086) [2022-07-09 07:12:58,240][26022] Updated weights on worker 0-0, policy_version 142881 (0.00092) [2022-07-09 07:12:59,975][26022] Updated weights on worker 0-0, policy_version 142891 (0.00100) [2022-07-09 07:13:00,415][25689] Fps is (10 sec: 5769.0, 60 sec: 5749.6, 300 sec: 5765.4). Total num frames: 146322432. Throughput: 0: 5222.7. Samples: 146317466. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:13:00,415][25689] Avg episode reward: [(0, '-56.723')] [2022-07-09 07:13:01,847][26022] Updated weights on worker 0-0, policy_version 142901 (0.00090) [2022-07-09 07:13:04,167][26022] Updated weights on worker 0-0, policy_version 142911 (0.00076) [2022-07-09 07:13:05,417][25689] Fps is (10 sec: 5783.8, 60 sec: 5768.9, 300 sec: 5763.2). Total num frames: 146350080. Throughput: 0: 5970.6. Samples: 146350208. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:13:05,418][25689] Avg episode reward: [(0, '-56.472')] [2022-07-09 07:13:05,455][26022] Updated weights on worker 0-0, policy_version 142921 (0.00080) [2022-07-09 07:13:07,552][26022] Updated weights on worker 0-0, policy_version 142931 (0.00076) [2022-07-09 07:13:09,191][26022] Updated weights on worker 0-0, policy_version 142941 (0.00086) [2022-07-09 07:13:10,441][25689] Fps is (10 sec: 5412.8, 60 sec: 5733.9, 300 sec: 5750.8). Total num frames: 146376704. Throughput: 0: 5935.7. Samples: 146384806. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:13:10,442][25689] Avg episode reward: [(0, '-57.381')] [2022-07-09 07:13:10,953][26022] Updated weights on worker 0-0, policy_version 142951 (0.00080) [2022-07-09 07:13:12,786][26022] Updated weights on worker 0-0, policy_version 142961 (0.00092) [2022-07-09 07:13:14,496][26022] Updated weights on worker 0-0, policy_version 142971 (0.00084) [2022-07-09 07:13:15,470][25689] Fps is (10 sec: 5602.8, 60 sec: 5766.3, 300 sec: 5758.1). Total num frames: 146406400. Throughput: 0: 5071.7. Samples: 146402400. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:13:15,471][25689] Avg episode reward: [(0, '-56.860')] [2022-07-09 07:13:16,276][26022] Updated weights on worker 0-0, policy_version 142981 (0.00095) [2022-07-09 07:13:18,240][26022] Updated weights on worker 0-0, policy_version 142991 (0.00079) [2022-07-09 07:13:19,729][26022] Updated weights on worker 0-0, policy_version 143001 (0.00082) [2022-07-09 07:13:20,577][25689] Fps is (10 sec: 5859.6, 60 sec: 5743.5, 300 sec: 5757.2). Total num frames: 146436096. Throughput: 0: 5940.6. Samples: 146437244. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:13:20,578][25689] Avg episode reward: [(0, '-56.497')] [2022-07-09 07:13:21,765][26022] Updated weights on worker 0-0, policy_version 143011 (0.00087) [2022-07-09 07:13:23,335][26022] Updated weights on worker 0-0, policy_version 143021 (0.00093) [2022-07-09 07:13:25,149][26022] Updated weights on worker 0-0, policy_version 143031 (0.00087) [2022-07-09 07:13:25,639][25689] Fps is (10 sec: 5840.5, 60 sec: 5756.9, 300 sec: 5753.3). Total num frames: 146465792. Throughput: 0: 6017.2. Samples: 146471882. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:13:25,639][25689] Avg episode reward: [(0, '-56.558')] [2022-07-09 07:13:27,002][26022] Updated weights on worker 0-0, policy_version 143041 (0.00086) [2022-07-09 07:13:28,808][26022] Updated weights on worker 0-0, policy_version 143051 (0.00080) [2022-07-09 07:13:30,559][26022] Updated weights on worker 0-0, policy_version 143061 (0.00098) [2022-07-09 07:13:30,647][25689] Fps is (10 sec: 5796.4, 60 sec: 5759.6, 300 sec: 5750.5). Total num frames: 146494464. Throughput: 0: 5161.6. Samples: 146489098. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:13:30,647][25689] Avg episode reward: [(0, '-56.008')] [2022-07-09 07:13:32,331][26022] Updated weights on worker 0-0, policy_version 143071 (0.00088) [2022-07-09 07:13:34,099][26022] Updated weights on worker 0-0, policy_version 143081 (0.00085) [2022-07-09 07:13:35,660][25689] Fps is (10 sec: 5722.0, 60 sec: 5743.3, 300 sec: 5751.0). Total num frames: 146523136. Throughput: 0: 6022.9. Samples: 146524006. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:13:35,661][25689] Avg episode reward: [(0, '-57.058')] [2022-07-09 07:13:35,860][26022] Updated weights on worker 0-0, policy_version 143091 (0.00081) [2022-07-09 07:13:37,717][26022] Updated weights on worker 0-0, policy_version 143101 (0.00086) [2022-07-09 07:13:39,405][26022] Updated weights on worker 0-0, policy_version 143111 (0.00088) [2022-07-09 07:13:40,727][25689] Fps is (10 sec: 5790.2, 60 sec: 5757.7, 300 sec: 5750.4). Total num frames: 146552832. Throughput: 0: 6031.4. Samples: 146558778. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:13:40,735][25689] Avg episode reward: [(0, '-56.610')] [2022-07-09 07:13:41,014][26022] Updated weights on worker 0-0, policy_version 143121 (0.00078) [2022-07-09 07:13:43,028][26022] Updated weights on worker 0-0, policy_version 143131 (0.00079) [2022-07-09 07:13:44,567][26022] Updated weights on worker 0-0, policy_version 143141 (0.00089) [2022-07-09 07:13:45,753][25689] Fps is (10 sec: 5783.0, 60 sec: 5739.3, 300 sec: 5750.5). Total num frames: 146581504. Throughput: 0: 5197.7. Samples: 146576432. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:13:45,754][25689] Avg episode reward: [(0, '-57.016')] [2022-07-09 07:13:46,550][26022] Updated weights on worker 0-0, policy_version 143151 (0.00090) [2022-07-09 07:13:47,928][26022] Updated weights on worker 0-0, policy_version 143161 (0.00086) [2022-07-09 07:13:50,048][26022] Updated weights on worker 0-0, policy_version 143171 (0.00091) [2022-07-09 07:13:50,764][25689] Fps is (10 sec: 5815.4, 60 sec: 5740.9, 300 sec: 5753.9). Total num frames: 146611200. Throughput: 0: 6067.0. Samples: 146611150. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:13:50,765][25689] Avg episode reward: [(0, '-57.112')] [2022-07-09 07:13:51,951][26022] Updated weights on worker 0-0, policy_version 143181 (0.00083) [2022-07-09 07:13:53,508][26022] Updated weights on worker 0-0, policy_version 143191 (0.00092) [2022-07-09 07:13:55,610][26022] Updated weights on worker 0-0, policy_version 143201 (0.00081) [2022-07-09 07:13:55,797][25689] Fps is (10 sec: 5709.3, 60 sec: 5740.5, 300 sec: 5754.9). Total num frames: 146638848. Throughput: 0: 6024.6. Samples: 146645324. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:13:55,798][25689] Avg episode reward: [(0, '-56.995')] [2022-07-09 07:13:57,143][26022] Updated weights on worker 0-0, policy_version 143211 (0.00089) [2022-07-09 07:13:59,053][26022] Updated weights on worker 0-0, policy_version 143221 (0.00085) [2022-07-09 07:14:00,751][26022] Updated weights on worker 0-0, policy_version 143231 (0.00087) [2022-07-09 07:14:00,903][25689] Fps is (10 sec: 5655.8, 60 sec: 5721.9, 300 sec: 5753.3). Total num frames: 146668544. Throughput: 0: 5130.7. Samples: 146662294. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:14:00,904][25689] Avg episode reward: [(0, '-57.287')] [2022-07-09 07:14:02,760][26022] Updated weights on worker 0-0, policy_version 143241 (0.00085) [2022-07-09 07:14:04,704][26022] Updated weights on worker 0-0, policy_version 143251 (0.00098) [2022-07-09 07:14:05,941][25689] Fps is (10 sec: 5652.9, 60 sec: 5718.6, 300 sec: 5749.2). Total num frames: 146696192. Throughput: 0: 5882.3. Samples: 146695186. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:14:05,942][25689] Avg episode reward: [(0, '-56.561')] [2022-07-09 07:14:06,309][26022] Updated weights on worker 0-0, policy_version 143261 (0.00048) [2022-07-09 07:14:08,269][26022] Updated weights on worker 0-0, policy_version 143271 (0.00083) [2022-07-09 07:14:09,888][26022] Updated weights on worker 0-0, policy_version 143281 (0.00091) [2022-07-09 07:14:10,946][25689] Fps is (10 sec: 5607.9, 60 sec: 5754.2, 300 sec: 5746.2). Total num frames: 146724864. Throughput: 0: 5889.6. Samples: 146730014. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:14:10,947][25689] Avg episode reward: [(0, '-56.725')] [2022-07-09 07:14:11,812][26022] Updated weights on worker 0-0, policy_version 143291 (0.00088) [2022-07-09 07:14:13,500][26022] Updated weights on worker 0-0, policy_version 143301 (0.00096) [2022-07-09 07:14:15,152][26022] Updated weights on worker 0-0, policy_version 143311 (0.00087) [2022-07-09 07:14:16,000][25689] Fps is (10 sec: 5904.8, 60 sec: 5768.8, 300 sec: 5753.2). Total num frames: 146755584. Throughput: 0: 5067.0. Samples: 146747690. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:14:16,000][25689] Avg episode reward: [(0, '-57.059')] [2022-07-09 07:14:17,002][26022] Updated weights on worker 0-0, policy_version 143321 (0.00083) [2022-07-09 07:14:18,838][26022] Updated weights on worker 0-0, policy_version 143331 (0.00085) [2022-07-09 07:14:20,576][26022] Updated weights on worker 0-0, policy_version 143341 (0.00084) [2022-07-09 07:14:21,068][25689] Fps is (10 sec: 5867.4, 60 sec: 5755.5, 300 sec: 5753.4). Total num frames: 146784256. Throughput: 0: 5956.0. Samples: 146782398. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:14:21,069][25689] Avg episode reward: [(0, '-57.123')] [2022-07-09 07:14:22,307][26022] Updated weights on worker 0-0, policy_version 143351 (0.00085) [2022-07-09 07:14:23,912][26022] Updated weights on worker 0-0, policy_version 143361 (0.00087) [2022-07-09 07:14:26,065][26022] Updated weights on worker 0-0, policy_version 143371 (0.00093) [2022-07-09 07:14:26,152][25689] Fps is (10 sec: 5547.3, 60 sec: 5719.5, 300 sec: 5748.6). Total num frames: 146811904. Throughput: 0: 6037.5. Samples: 146817210. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:14:26,153][25689] Avg episode reward: [(0, '-57.741')] [2022-07-09 07:14:27,443][26022] Updated weights on worker 0-0, policy_version 143381 (0.00085) [2022-07-09 07:14:29,566][26022] Updated weights on worker 0-0, policy_version 143391 (0.00090) [2022-07-09 07:14:31,127][26022] Updated weights on worker 0-0, policy_version 143401 (0.00086) [2022-07-09 07:14:31,238][25689] Fps is (10 sec: 5739.7, 60 sec: 5746.1, 300 sec: 5747.7). Total num frames: 146842624. Throughput: 0: 5975.7. Samples: 146851270. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:14:31,238][25689] Avg episode reward: [(0, '-57.853')] [2022-07-09 07:14:32,995][26022] Updated weights on worker 0-0, policy_version 143411 (0.00086) [2022-07-09 07:14:34,842][26022] Updated weights on worker 0-0, policy_version 143421 (0.00085) [2022-07-09 07:14:35,434][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:14:35,449][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000143424_146866176.pth [2022-07-09 07:14:35,450][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000141400_144793600.pth [2022-07-09 07:14:36,263][25689] Fps is (10 sec: 5975.7, 60 sec: 5761.9, 300 sec: 5751.4). Total num frames: 146872320. Throughput: 0: 5978.1. Samples: 146868824. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:14:36,263][25689] Avg episode reward: [(0, '-57.125')] [2022-07-09 07:14:36,330][26022] Updated weights on worker 0-0, policy_version 143431 (0.00089) [2022-07-09 07:14:38,613][26022] Updated weights on worker 0-0, policy_version 143441 (0.00090) [2022-07-09 07:14:39,954][26022] Updated weights on worker 0-0, policy_version 143451 (0.00091) [2022-07-09 07:14:41,312][25689] Fps is (10 sec: 5793.7, 60 sec: 5746.7, 300 sec: 5745.3). Total num frames: 146900992. Throughput: 0: 5985.3. Samples: 146903560. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:14:41,312][25689] Avg episode reward: [(0, '-56.475')] [2022-07-09 07:14:41,962][26022] Updated weights on worker 0-0, policy_version 143461 (0.00095) [2022-07-09 07:14:43,409][26022] Updated weights on worker 0-0, policy_version 143471 (0.00089) [2022-07-09 07:14:45,408][26022] Updated weights on worker 0-0, policy_version 143481 (0.00089) [2022-07-09 07:14:46,409][25689] Fps is (10 sec: 5752.7, 60 sec: 5756.8, 300 sec: 5754.0). Total num frames: 146930688. Throughput: 0: 5980.7. Samples: 146938358. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:14:46,409][25689] Avg episode reward: [(0, '-56.099')] [2022-07-09 07:14:47,242][26022] Updated weights on worker 0-0, policy_version 143491 (0.00084) [2022-07-09 07:14:49,001][26022] Updated weights on worker 0-0, policy_version 143501 (0.00086) [2022-07-09 07:14:50,724][26022] Updated weights on worker 0-0, policy_version 143511 (0.00084) [2022-07-09 07:14:51,417][25689] Fps is (10 sec: 5877.2, 60 sec: 5757.1, 300 sec: 5748.6). Total num frames: 146960384. Throughput: 0: 5184.4. Samples: 146955890. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:14:51,418][25689] Avg episode reward: [(0, '-56.787')] [2022-07-09 07:14:52,495][26022] Updated weights on worker 0-0, policy_version 143521 (0.00085) [2022-07-09 07:14:54,343][26022] Updated weights on worker 0-0, policy_version 143531 (0.00086) [2022-07-09 07:14:56,049][26022] Updated weights on worker 0-0, policy_version 143541 (0.00093) [2022-07-09 07:14:56,449][25689] Fps is (10 sec: 5813.3, 60 sec: 5774.0, 300 sec: 5749.9). Total num frames: 146989056. Throughput: 0: 6040.7. Samples: 146990766. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:14:56,451][25689] Avg episode reward: [(0, '-57.051')] [2022-07-09 07:14:57,850][26022] Updated weights on worker 0-0, policy_version 143551 (0.00102) [2022-07-09 07:14:59,471][26022] Updated weights on worker 0-0, policy_version 143561 (0.00092) [2022-07-09 07:15:01,513][25689] Fps is (10 sec: 5477.5, 60 sec: 5727.4, 300 sec: 5748.8). Total num frames: 147015680. Throughput: 0: 6017.3. Samples: 147025114. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:15:01,520][25689] Avg episode reward: [(0, '-56.974')] [2022-07-09 07:15:01,688][26022] Updated weights on worker 0-0, policy_version 143571 (0.00086) [2022-07-09 07:15:03,529][26022] Updated weights on worker 0-0, policy_version 143581 (0.00080) [2022-07-09 07:15:05,219][26022] Updated weights on worker 0-0, policy_version 143591 (0.00085) [2022-07-09 07:15:06,535][25689] Fps is (10 sec: 5482.8, 60 sec: 5745.9, 300 sec: 5745.3). Total num frames: 147044352. Throughput: 0: 5093.7. Samples: 147040872. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:15:06,537][25689] Avg episode reward: [(0, '-57.656')] [2022-07-09 07:15:06,917][26022] Updated weights on worker 0-0, policy_version 143601 (0.00087) [2022-07-09 07:15:08,617][26022] Updated weights on worker 0-0, policy_version 143611 (0.00089) [2022-07-09 07:15:10,261][26022] Updated weights on worker 0-0, policy_version 143621 (0.00086) [2022-07-09 07:15:11,554][25689] Fps is (10 sec: 5812.8, 60 sec: 5761.4, 300 sec: 5748.5). Total num frames: 147074048. Throughput: 0: 5949.7. Samples: 147075696. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:15:11,554][25689] Avg episode reward: [(0, '-58.364')] [2022-07-09 07:15:12,421][26022] Updated weights on worker 0-0, policy_version 143631 (0.00085) [2022-07-09 07:15:13,879][26022] Updated weights on worker 0-0, policy_version 143641 (0.00090) [2022-07-09 07:15:15,826][26022] Updated weights on worker 0-0, policy_version 143651 (0.00086) [2022-07-09 07:15:16,575][25689] Fps is (10 sec: 5915.2, 60 sec: 5747.5, 300 sec: 5745.5). Total num frames: 147103744. Throughput: 0: 5973.9. Samples: 147110996. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:15:16,576][25689] Avg episode reward: [(0, '-57.684')] [2022-07-09 07:15:17,530][26022] Updated weights on worker 0-0, policy_version 143661 (0.00084) [2022-07-09 07:15:19,195][26022] Updated weights on worker 0-0, policy_version 143671 (0.00094) [2022-07-09 07:15:21,349][26022] Updated weights on worker 0-0, policy_version 143681 (0.01015) [2022-07-09 07:15:21,671][25689] Fps is (10 sec: 5668.0, 60 sec: 5728.1, 300 sec: 5747.2). Total num frames: 147131392. Throughput: 0: 5118.8. Samples: 147128302. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:15:21,671][25689] Avg episode reward: [(0, '-57.419')] [2022-07-09 07:15:22,662][26022] Updated weights on worker 0-0, policy_version 143691 (0.00086) [2022-07-09 07:15:24,758][26022] Updated weights on worker 0-0, policy_version 143701 (0.00083) [2022-07-09 07:15:26,327][26022] Updated weights on worker 0-0, policy_version 143711 (0.00097) [2022-07-09 07:15:26,677][25689] Fps is (10 sec: 5676.8, 60 sec: 5769.3, 300 sec: 5747.2). Total num frames: 147161088. Throughput: 0: 6076.3. Samples: 147163262. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:15:26,677][25689] Avg episode reward: [(0, '-57.798')] [2022-07-09 07:15:28,192][26022] Updated weights on worker 0-0, policy_version 143721 (0.00097) [2022-07-09 07:15:30,108][26022] Updated weights on worker 0-0, policy_version 143731 (0.00087) [2022-07-09 07:15:31,613][26022] Updated weights on worker 0-0, policy_version 143741 (0.00091) [2022-07-09 07:15:31,697][25689] Fps is (10 sec: 5923.8, 60 sec: 5758.6, 300 sec: 5754.0). Total num frames: 147190784. Throughput: 0: 6055.6. Samples: 147197676. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:15:31,699][25689] Avg episode reward: [(0, '-57.537')] [2022-07-09 07:15:33,657][26022] Updated weights on worker 0-0, policy_version 143751 (0.00093) [2022-07-09 07:15:35,077][26022] Updated weights on worker 0-0, policy_version 143761 (0.00094) [2022-07-09 07:15:36,727][25689] Fps is (10 sec: 5705.6, 60 sec: 5724.2, 300 sec: 5744.3). Total num frames: 147218432. Throughput: 0: 5170.0. Samples: 147215182. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:15:36,728][25689] Avg episode reward: [(0, '-57.474')] [2022-07-09 07:15:37,118][26022] Updated weights on worker 0-0, policy_version 143771 (0.00082) [2022-07-09 07:15:38,720][26022] Updated weights on worker 0-0, policy_version 143781 (0.00094) [2022-07-09 07:15:40,535][26022] Updated weights on worker 0-0, policy_version 143791 (0.00082) [2022-07-09 07:15:41,805][25689] Fps is (10 sec: 5774.4, 60 sec: 5755.4, 300 sec: 5750.3). Total num frames: 147249152. Throughput: 0: 6048.4. Samples: 147250082. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:15:41,806][25689] Avg episode reward: [(0, '-57.199')] [2022-07-09 07:15:42,333][26022] Updated weights on worker 0-0, policy_version 143801 (0.00090) [2022-07-09 07:15:44,120][26022] Updated weights on worker 0-0, policy_version 143811 (0.00086) [2022-07-09 07:15:45,704][26022] Updated weights on worker 0-0, policy_version 143821 (0.00092) [2022-07-09 07:15:46,832][25689] Fps is (10 sec: 5877.4, 60 sec: 5745.0, 300 sec: 5743.2). Total num frames: 147277824. Throughput: 0: 6039.2. Samples: 147284986. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:15:46,833][25689] Avg episode reward: [(0, '-57.189')] [2022-07-09 07:15:47,607][26022] Updated weights on worker 0-0, policy_version 143831 (0.00083) [2022-07-09 07:15:49,140][26022] Updated weights on worker 0-0, policy_version 143841 (0.00090) [2022-07-09 07:15:51,090][26022] Updated weights on worker 0-0, policy_version 143851 (0.00094) [2022-07-09 07:15:51,884][25689] Fps is (10 sec: 5791.0, 60 sec: 5740.9, 300 sec: 5746.2). Total num frames: 147307520. Throughput: 0: 5185.1. Samples: 147302348. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 07:15:51,885][25689] Avg episode reward: [(0, '-55.869')] [2022-07-09 07:15:52,752][26022] Updated weights on worker 0-0, policy_version 143861 (0.00077) [2022-07-09 07:15:54,706][26022] Updated weights on worker 0-0, policy_version 143871 (0.00084) [2022-07-09 07:15:56,282][26022] Updated weights on worker 0-0, policy_version 143881 (0.00086) [2022-07-09 07:15:56,891][25689] Fps is (10 sec: 5701.2, 60 sec: 5726.4, 300 sec: 5739.9). Total num frames: 147335168. Throughput: 0: 6042.9. Samples: 147337028. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 07:15:56,891][25689] Avg episode reward: [(0, '-56.298')] [2022-07-09 07:15:58,172][26022] Updated weights on worker 0-0, policy_version 143891 (0.00093) [2022-07-09 07:16:00,148][26022] Updated weights on worker 0-0, policy_version 143901 (0.00084) [2022-07-09 07:16:01,721][26022] Updated weights on worker 0-0, policy_version 143911 (0.00087) [2022-07-09 07:16:01,961][25689] Fps is (10 sec: 5792.3, 60 sec: 5793.5, 300 sec: 5752.4). Total num frames: 147365888. Throughput: 0: 6014.2. Samples: 147371302. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 07:16:01,961][25689] Avg episode reward: [(0, '-56.093')] [2022-07-09 07:16:03,939][26022] Updated weights on worker 0-0, policy_version 143921 (0.00561) [2022-07-09 07:16:05,745][26022] Updated weights on worker 0-0, policy_version 143931 (0.00086) [2022-07-09 07:16:06,998][25689] Fps is (10 sec: 5673.3, 60 sec: 5758.2, 300 sec: 5748.5). Total num frames: 147392512. Throughput: 0: 5046.6. Samples: 147386756. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 07:16:06,999][25689] Avg episode reward: [(0, '-56.456')] [2022-07-09 07:16:07,590][26022] Updated weights on worker 0-0, policy_version 143941 (0.00095) [2022-07-09 07:16:09,334][26022] Updated weights on worker 0-0, policy_version 143951 (0.00089) [2022-07-09 07:16:10,958][26022] Updated weights on worker 0-0, policy_version 143961 (0.00086) [2022-07-09 07:16:12,046][25689] Fps is (10 sec: 5482.9, 60 sec: 5738.5, 300 sec: 5740.9). Total num frames: 147421184. Throughput: 0: 5897.1. Samples: 147421244. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 07:16:12,047][25689] Avg episode reward: [(0, '-56.717')] [2022-07-09 07:16:12,963][26022] Updated weights on worker 0-0, policy_version 143971 (0.00094) [2022-07-09 07:16:14,504][26022] Updated weights on worker 0-0, policy_version 143981 (0.00086) [2022-07-09 07:16:16,470][26022] Updated weights on worker 0-0, policy_version 143991 (0.00085) [2022-07-09 07:16:17,050][25689] Fps is (10 sec: 5806.8, 60 sec: 5740.2, 300 sec: 5749.3). Total num frames: 147450880. Throughput: 0: 5912.7. Samples: 147456224. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 07:16:17,052][25689] Avg episode reward: [(0, '-57.418')] [2022-07-09 07:16:17,966][26022] Updated weights on worker 0-0, policy_version 144001 (0.00086) [2022-07-09 07:16:19,907][26022] Updated weights on worker 0-0, policy_version 144011 (0.00091) [2022-07-09 07:16:21,625][26022] Updated weights on worker 0-0, policy_version 144021 (0.00088) [2022-07-09 07:16:22,146][25689] Fps is (10 sec: 5778.9, 60 sec: 5757.1, 300 sec: 5745.0). Total num frames: 147479552. Throughput: 0: 5060.3. Samples: 147473446. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 07:16:22,147][25689] Avg episode reward: [(0, '-58.110')] [2022-07-09 07:16:23,485][26022] Updated weights on worker 0-0, policy_version 144031 (0.00084) [2022-07-09 07:16:25,249][26022] Updated weights on worker 0-0, policy_version 144041 (0.00089) [2022-07-09 07:16:26,915][26022] Updated weights on worker 0-0, policy_version 144051 (0.00087) [2022-07-09 07:16:27,196][25689] Fps is (10 sec: 5752.6, 60 sec: 5752.8, 300 sec: 5748.1). Total num frames: 147509248. Throughput: 0: 6014.4. Samples: 147508236. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 07:16:27,198][25689] Avg episode reward: [(0, '-57.847')] [2022-07-09 07:16:28,890][26022] Updated weights on worker 0-0, policy_version 144061 (0.00092) [2022-07-09 07:16:30,590][26022] Updated weights on worker 0-0, policy_version 144071 (0.00090) [2022-07-09 07:16:32,299][25689] Fps is (10 sec: 5749.3, 60 sec: 5728.2, 300 sec: 5743.0). Total num frames: 147537920. Throughput: 0: 6005.8. Samples: 147542876. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 07:16:32,299][25689] Avg episode reward: [(0, '-57.953')] [2022-07-09 07:16:32,334][26022] Updated weights on worker 0-0, policy_version 144081 (0.00094) [2022-07-09 07:16:34,075][26022] Updated weights on worker 0-0, policy_version 144091 (0.00090) [2022-07-09 07:16:35,516][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:16:35,530][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000144099_147557376.pth [2022-07-09 07:16:35,531][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000142077_145486848.pth [2022-07-09 07:16:35,846][26022] Updated weights on worker 0-0, policy_version 144101 (0.00087) [2022-07-09 07:16:37,361][25689] Fps is (10 sec: 5641.3, 60 sec: 5742.0, 300 sec: 5741.0). Total num frames: 147566592. Throughput: 0: 5130.0. Samples: 147560422. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 07:16:37,362][25689] Avg episode reward: [(0, '-57.120')] [2022-07-09 07:16:37,583][26022] Updated weights on worker 0-0, policy_version 144111 (0.00083) [2022-07-09 07:16:39,370][26022] Updated weights on worker 0-0, policy_version 144121 (0.00101) [2022-07-09 07:16:41,118][26022] Updated weights on worker 0-0, policy_version 144131 (0.00091) [2022-07-09 07:16:42,418][25689] Fps is (10 sec: 5767.8, 60 sec: 5727.1, 300 sec: 5747.2). Total num frames: 147596288. Throughput: 0: 6014.0. Samples: 147595360. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 07:16:42,420][25689] Avg episode reward: [(0, '-56.949')] [2022-07-09 07:16:42,953][26022] Updated weights on worker 0-0, policy_version 144141 (0.00087) [2022-07-09 07:16:44,590][26022] Updated weights on worker 0-0, policy_version 144151 (0.00096) [2022-07-09 07:16:46,408][26022] Updated weights on worker 0-0, policy_version 144161 (0.00089) [2022-07-09 07:16:47,482][25689] Fps is (10 sec: 5868.6, 60 sec: 5740.5, 300 sec: 5742.6). Total num frames: 147625984. Throughput: 0: 6028.9. Samples: 147630534. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 07:16:47,483][25689] Avg episode reward: [(0, '-57.832')] [2022-07-09 07:16:48,150][26022] Updated weights on worker 0-0, policy_version 144171 (0.00088) [2022-07-09 07:16:50,072][26022] Updated weights on worker 0-0, policy_version 144181 (0.00084) [2022-07-09 07:16:51,681][26022] Updated weights on worker 0-0, policy_version 144191 (0.00526) [2022-07-09 07:16:52,485][25689] Fps is (10 sec: 5899.7, 60 sec: 5745.1, 300 sec: 5750.1). Total num frames: 147655680. Throughput: 0: 6067.4. Samples: 147665356. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 07:16:52,486][25689] Avg episode reward: [(0, '-57.624')] [2022-07-09 07:16:53,618][26022] Updated weights on worker 0-0, policy_version 144201 (0.00611) [2022-07-09 07:16:55,087][26022] Updated weights on worker 0-0, policy_version 144211 (0.00085) [2022-07-09 07:16:56,947][26022] Updated weights on worker 0-0, policy_version 144221 (0.00070) [2022-07-09 07:16:57,525][25689] Fps is (10 sec: 5811.5, 60 sec: 5758.8, 300 sec: 5745.0). Total num frames: 147684352. Throughput: 0: 6068.8. Samples: 147682792. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 07:16:57,527][25689] Avg episode reward: [(0, '-57.365')] [2022-07-09 07:16:58,888][26022] Updated weights on worker 0-0, policy_version 144231 (0.00087) [2022-07-09 07:17:00,514][26022] Updated weights on worker 0-0, policy_version 144241 (0.00083) [2022-07-09 07:17:02,610][25689] Fps is (10 sec: 5461.2, 60 sec: 5689.9, 300 sec: 5743.9). Total num frames: 147710976. Throughput: 0: 6025.3. Samples: 147717024. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 07:17:02,611][25689] Avg episode reward: [(0, '-58.017')] [2022-07-09 07:17:02,903][26022] Updated weights on worker 0-0, policy_version 144251 (0.00093) [2022-07-09 07:17:04,504][26022] Updated weights on worker 0-0, policy_version 144261 (0.00092) [2022-07-09 07:17:06,359][26022] Updated weights on worker 0-0, policy_version 144271 (0.00085) [2022-07-09 07:17:07,689][25689] Fps is (10 sec: 5541.3, 60 sec: 5736.6, 300 sec: 5746.0). Total num frames: 147740672. Throughput: 0: 5867.0. Samples: 147749090. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 07:17:07,690][25689] Avg episode reward: [(0, '-58.420')] [2022-07-09 07:17:08,152][26022] Updated weights on worker 0-0, policy_version 144281 (0.00092) [2022-07-09 07:17:09,858][26022] Updated weights on worker 0-0, policy_version 144291 (0.00089) [2022-07-09 07:17:11,691][26022] Updated weights on worker 0-0, policy_version 144301 (0.00585) [2022-07-09 07:17:12,695][25689] Fps is (10 sec: 5787.8, 60 sec: 5740.6, 300 sec: 5749.6). Total num frames: 147769344. Throughput: 0: 4996.7. Samples: 147766338. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 07:17:12,696][25689] Avg episode reward: [(0, '-57.687')] [2022-07-09 07:17:13,562][26022] Updated weights on worker 0-0, policy_version 144311 (0.00085) [2022-07-09 07:17:15,223][26022] Updated weights on worker 0-0, policy_version 144321 (0.00089) [2022-07-09 07:17:17,030][26022] Updated weights on worker 0-0, policy_version 144331 (0.00088) [2022-07-09 07:17:17,766][25689] Fps is (10 sec: 5792.1, 60 sec: 5734.2, 300 sec: 5745.7). Total num frames: 147799040. Throughput: 0: 5848.9. Samples: 147801180. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 07:17:17,767][25689] Avg episode reward: [(0, '-57.848')] [2022-07-09 07:17:18,696][26022] Updated weights on worker 0-0, policy_version 144341 (0.00089) [2022-07-09 07:17:20,634][26022] Updated weights on worker 0-0, policy_version 144351 (0.00083) [2022-07-09 07:17:22,395][26022] Updated weights on worker 0-0, policy_version 144361 (0.00087) [2022-07-09 07:17:22,856][25689] Fps is (10 sec: 5744.4, 60 sec: 5734.8, 300 sec: 5744.4). Total num frames: 147827712. Throughput: 0: 5874.4. Samples: 147835954. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 07:17:22,857][25689] Avg episode reward: [(0, '-57.986')] [2022-07-09 07:17:24,113][26022] Updated weights on worker 0-0, policy_version 144371 (0.00087) [2022-07-09 07:17:25,815][26022] Updated weights on worker 0-0, policy_version 144381 (0.00363) [2022-07-09 07:17:27,735][26022] Updated weights on worker 0-0, policy_version 144391 (0.00088) [2022-07-09 07:17:27,892][25689] Fps is (10 sec: 5764.1, 60 sec: 5736.1, 300 sec: 5747.9). Total num frames: 147857408. Throughput: 0: 5161.9. Samples: 147853378. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 07:17:27,893][25689] Avg episode reward: [(0, '-57.996')] [2022-07-09 07:17:29,375][26022] Updated weights on worker 0-0, policy_version 144401 (0.00086) [2022-07-09 07:17:31,361][26022] Updated weights on worker 0-0, policy_version 144411 (0.00091) [2022-07-09 07:17:32,896][25689] Fps is (10 sec: 5711.4, 60 sec: 5728.5, 300 sec: 5741.3). Total num frames: 147885056. Throughput: 0: 6000.3. Samples: 147887550. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 07:17:32,897][25689] Avg episode reward: [(0, '-58.565')] [2022-07-09 07:17:33,131][26022] Updated weights on worker 0-0, policy_version 144421 (0.00079) [2022-07-09 07:17:34,761][26022] Updated weights on worker 0-0, policy_version 144431 (0.00083) [2022-07-09 07:17:36,700][26022] Updated weights on worker 0-0, policy_version 144441 (0.00088) [2022-07-09 07:17:37,911][25689] Fps is (10 sec: 5723.7, 60 sec: 5750.0, 300 sec: 5745.2). Total num frames: 147914752. Throughput: 0: 6010.9. Samples: 147922268. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 07:17:37,913][25689] Avg episode reward: [(0, '-58.139')] [2022-07-09 07:17:38,401][26022] Updated weights on worker 0-0, policy_version 144451 (0.00087) [2022-07-09 07:17:40,272][26022] Updated weights on worker 0-0, policy_version 144461 (0.00082) [2022-07-09 07:17:41,867][26022] Updated weights on worker 0-0, policy_version 144471 (0.00080) [2022-07-09 07:17:42,964][25689] Fps is (10 sec: 5797.5, 60 sec: 5733.4, 300 sec: 5741.0). Total num frames: 147943424. Throughput: 0: 5156.0. Samples: 147939632. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 07:17:42,964][25689] Avg episode reward: [(0, '-57.485')] [2022-07-09 07:17:43,883][26022] Updated weights on worker 0-0, policy_version 144481 (0.00084) [2022-07-09 07:17:45,433][26022] Updated weights on worker 0-0, policy_version 144491 (0.00087) [2022-07-09 07:17:47,278][26022] Updated weights on worker 0-0, policy_version 144501 (0.01174) [2022-07-09 07:17:47,991][25689] Fps is (10 sec: 5892.2, 60 sec: 5753.9, 300 sec: 5744.4). Total num frames: 147974144. Throughput: 0: 6035.3. Samples: 147974678. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 07:17:47,992][25689] Avg episode reward: [(0, '-58.481')] [2022-07-09 07:17:48,979][26022] Updated weights on worker 0-0, policy_version 144511 (0.00085) [2022-07-09 07:17:50,885][26022] Updated weights on worker 0-0, policy_version 144521 (0.00092) [2022-07-09 07:17:52,461][26022] Updated weights on worker 0-0, policy_version 144531 (0.00085) [2022-07-09 07:17:53,001][25689] Fps is (10 sec: 5917.1, 60 sec: 5736.2, 300 sec: 5748.2). Total num frames: 148002816. Throughput: 0: 6065.0. Samples: 148009488. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 07:17:53,002][25689] Avg episode reward: [(0, '-59.399')] [2022-07-09 07:17:54,378][26022] Updated weights on worker 0-0, policy_version 144541 (0.00080) [2022-07-09 07:17:55,879][26022] Updated weights on worker 0-0, policy_version 144551 (0.00089) [2022-07-09 07:17:58,031][25689] Fps is (10 sec: 5507.6, 60 sec: 5703.4, 300 sec: 5735.6). Total num frames: 148029440. Throughput: 0: 5194.5. Samples: 148026780. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 07:17:58,031][25689] Avg episode reward: [(0, '-59.138')] [2022-07-09 07:17:58,071][26022] Updated weights on worker 0-0, policy_version 144561 (0.00089) [2022-07-09 07:17:59,376][26022] Updated weights on worker 0-0, policy_version 144571 (0.00090) [2022-07-09 07:18:01,573][26022] Updated weights on worker 0-0, policy_version 144581 (0.00082) [2022-07-09 07:18:03,085][25689] Fps is (10 sec: 5483.5, 60 sec: 5740.1, 300 sec: 5738.0). Total num frames: 148058112. Throughput: 0: 6045.6. Samples: 148061278. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 07:18:03,086][25689] Avg episode reward: [(0, '-58.459')] [2022-07-09 07:18:03,641][26022] Updated weights on worker 0-0, policy_version 144591 (0.00087) [2022-07-09 07:18:05,356][26022] Updated weights on worker 0-0, policy_version 144601 (0.00083) [2022-07-09 07:18:07,102][26022] Updated weights on worker 0-0, policy_version 144611 (0.00082) [2022-07-09 07:18:08,095][25689] Fps is (10 sec: 5799.5, 60 sec: 5746.7, 300 sec: 5748.6). Total num frames: 148087808. Throughput: 0: 5920.5. Samples: 148093706. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 07:18:08,095][25689] Avg episode reward: [(0, '-58.756')] [2022-07-09 07:18:09,112][26022] Updated weights on worker 0-0, policy_version 144621 (0.00088) [2022-07-09 07:18:10,725][26022] Updated weights on worker 0-0, policy_version 144631 (0.00085) [2022-07-09 07:18:12,515][26022] Updated weights on worker 0-0, policy_version 144641 (0.00090) [2022-07-09 07:18:13,106][25689] Fps is (10 sec: 5722.5, 60 sec: 5729.3, 300 sec: 5742.1). Total num frames: 148115456. Throughput: 0: 5045.5. Samples: 148110926. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 07:18:13,107][25689] Avg episode reward: [(0, '-59.656')] [2022-07-09 07:18:14,187][26022] Updated weights on worker 0-0, policy_version 144651 (0.00090) [2022-07-09 07:18:16,103][26022] Updated weights on worker 0-0, policy_version 144661 (0.00088) [2022-07-09 07:18:17,757][26022] Updated weights on worker 0-0, policy_version 144671 (0.00085) [2022-07-09 07:18:18,111][25689] Fps is (10 sec: 5622.8, 60 sec: 5718.6, 300 sec: 5740.6). Total num frames: 148144128. Throughput: 0: 5922.1. Samples: 148145700. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 07:18:18,112][25689] Avg episode reward: [(0, '-59.434')] [2022-07-09 07:18:19,603][26022] Updated weights on worker 0-0, policy_version 144681 (0.00096) [2022-07-09 07:18:21,603][26022] Updated weights on worker 0-0, policy_version 144691 (0.00097) [2022-07-09 07:18:23,115][26022] Updated weights on worker 0-0, policy_version 144701 (0.00087) [2022-07-09 07:18:23,166][25689] Fps is (10 sec: 5802.0, 60 sec: 5738.9, 300 sec: 5740.7). Total num frames: 148173824. Throughput: 0: 5924.8. Samples: 148180252. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 07:18:23,167][25689] Avg episode reward: [(0, '-59.392')] [2022-07-09 07:18:24,993][26022] Updated weights on worker 0-0, policy_version 144711 (0.00076) [2022-07-09 07:18:26,777][26022] Updated weights on worker 0-0, policy_version 144721 (0.00088) [2022-07-09 07:18:28,188][25689] Fps is (10 sec: 5792.4, 60 sec: 5723.3, 300 sec: 5740.5). Total num frames: 148202496. Throughput: 0: 5176.5. Samples: 148197718. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 07:18:28,188][25689] Avg episode reward: [(0, '-59.274')] [2022-07-09 07:18:28,479][26022] Updated weights on worker 0-0, policy_version 144731 (0.00083) [2022-07-09 07:18:30,429][26022] Updated weights on worker 0-0, policy_version 144741 (0.00089) [2022-07-09 07:18:32,077][26022] Updated weights on worker 0-0, policy_version 144751 (0.00106) [2022-07-09 07:18:33,190][25689] Fps is (10 sec: 5720.8, 60 sec: 5740.4, 300 sec: 5740.7). Total num frames: 148231168. Throughput: 0: 6030.6. Samples: 148232044. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 07:18:33,191][25689] Avg episode reward: [(0, '-59.458')] [2022-07-09 07:18:33,985][26022] Updated weights on worker 0-0, policy_version 144761 (0.00087) [2022-07-09 07:18:35,536][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:18:35,546][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000144771_148245504.pth [2022-07-09 07:18:35,546][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000142751_146177024.pth [2022-07-09 07:18:35,555][26022] Updated weights on worker 0-0, policy_version 144771 (0.00095) [2022-07-09 07:18:37,515][26022] Updated weights on worker 0-0, policy_version 144781 (0.00090) [2022-07-09 07:18:38,196][25689] Fps is (10 sec: 5730.0, 60 sec: 5724.3, 300 sec: 5738.5). Total num frames: 148259840. Throughput: 0: 6017.3. Samples: 148266554. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 07:18:38,196][25689] Avg episode reward: [(0, '-58.907')] [2022-07-09 07:18:39,211][26022] Updated weights on worker 0-0, policy_version 144791 (0.00086) [2022-07-09 07:18:41,129][26022] Updated weights on worker 0-0, policy_version 144801 (0.00086) [2022-07-09 07:18:42,768][26022] Updated weights on worker 0-0, policy_version 144811 (0.00086) [2022-07-09 07:18:43,340][25689] Fps is (10 sec: 5649.4, 60 sec: 5715.6, 300 sec: 5736.2). Total num frames: 148288512. Throughput: 0: 5140.0. Samples: 148283948. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 07:18:43,341][25689] Avg episode reward: [(0, '-57.896')] [2022-07-09 07:18:44,634][26022] Updated weights on worker 0-0, policy_version 144821 (0.00086) [2022-07-09 07:18:46,319][26022] Updated weights on worker 0-0, policy_version 144831 (0.00085) [2022-07-09 07:18:48,045][26022] Updated weights on worker 0-0, policy_version 144841 (0.00084) [2022-07-09 07:18:48,377][25689] Fps is (10 sec: 5733.1, 60 sec: 5697.8, 300 sec: 5735.7). Total num frames: 148318208. Throughput: 0: 5987.9. Samples: 148318606. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 07:18:48,377][25689] Avg episode reward: [(0, '-57.033')] [2022-07-09 07:18:49,761][26022] Updated weights on worker 0-0, policy_version 144851 (0.00088) [2022-07-09 07:18:51,554][26022] Updated weights on worker 0-0, policy_version 144861 (0.00091) [2022-07-09 07:18:53,272][26022] Updated weights on worker 0-0, policy_version 144871 (0.00216) [2022-07-09 07:18:53,380][25689] Fps is (10 sec: 5915.8, 60 sec: 5715.4, 300 sec: 5743.1). Total num frames: 148347904. Throughput: 0: 6009.5. Samples: 148353376. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 07:18:53,381][25689] Avg episode reward: [(0, '-57.163')] [2022-07-09 07:18:55,136][26022] Updated weights on worker 0-0, policy_version 144881 (0.00088) [2022-07-09 07:18:56,863][26022] Updated weights on worker 0-0, policy_version 144891 (0.00080) [2022-07-09 07:18:58,401][25689] Fps is (10 sec: 5720.6, 60 sec: 5733.2, 300 sec: 5737.9). Total num frames: 148375552. Throughput: 0: 5159.2. Samples: 148370796. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 07:18:58,403][25689] Avg episode reward: [(0, '-57.283')] [2022-07-09 07:18:58,822][26022] Updated weights on worker 0-0, policy_version 144901 (0.00084) [2022-07-09 07:19:00,402][26022] Updated weights on worker 0-0, policy_version 144911 (0.00086) [2022-07-09 07:19:02,772][26022] Updated weights on worker 0-0, policy_version 144921 (0.00088) [2022-07-09 07:19:03,495][25689] Fps is (10 sec: 5568.2, 60 sec: 5729.5, 300 sec: 5740.3). Total num frames: 148404224. Throughput: 0: 5942.7. Samples: 148403718. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 07:19:03,495][25689] Avg episode reward: [(0, '-57.124')] [2022-07-09 07:19:04,492][26022] Updated weights on worker 0-0, policy_version 144931 (0.00086) [2022-07-09 07:19:06,277][26022] Updated weights on worker 0-0, policy_version 144941 (0.00087) [2022-07-09 07:19:07,867][26022] Updated weights on worker 0-0, policy_version 144951 (0.00080) [2022-07-09 07:19:08,507][25689] Fps is (10 sec: 5471.2, 60 sec: 5678.3, 300 sec: 5733.2). Total num frames: 148430848. Throughput: 0: 5917.6. Samples: 148437732. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 07:19:08,508][25689] Avg episode reward: [(0, '-57.918')] [2022-07-09 07:19:09,711][26022] Updated weights on worker 0-0, policy_version 144961 (0.00097) [2022-07-09 07:19:11,603][26022] Updated weights on worker 0-0, policy_version 144971 (0.00090) [2022-07-09 07:19:13,299][26022] Updated weights on worker 0-0, policy_version 144981 (0.00083) [2022-07-09 07:19:13,519][25689] Fps is (10 sec: 5618.5, 60 sec: 5712.2, 300 sec: 5730.6). Total num frames: 148460544. Throughput: 0: 5052.0. Samples: 148455116. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 07:19:13,519][25689] Avg episode reward: [(0, '-58.178')] [2022-07-09 07:19:14,963][26022] Updated weights on worker 0-0, policy_version 144991 (0.00085) [2022-07-09 07:19:16,913][26022] Updated weights on worker 0-0, policy_version 145001 (0.00086) [2022-07-09 07:19:18,503][26022] Updated weights on worker 0-0, policy_version 145011 (0.00088) [2022-07-09 07:19:18,536][25689] Fps is (10 sec: 6024.6, 60 sec: 5745.0, 300 sec: 5738.5). Total num frames: 148491264. Throughput: 0: 5920.8. Samples: 148490010. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 07:19:18,536][25689] Avg episode reward: [(0, '-58.018')] [2022-07-09 07:19:20,399][26022] Updated weights on worker 0-0, policy_version 145021 (0.00086) [2022-07-09 07:19:22,027][26022] Updated weights on worker 0-0, policy_version 145031 (0.00088) [2022-07-09 07:19:23,643][25689] Fps is (10 sec: 5866.2, 60 sec: 5723.0, 300 sec: 5741.5). Total num frames: 148519936. Throughput: 0: 6013.9. Samples: 148524888. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 07:19:23,644][25689] Avg episode reward: [(0, '-57.579')] [2022-07-09 07:19:23,967][26022] Updated weights on worker 0-0, policy_version 145041 (0.00085) [2022-07-09 07:19:25,582][26022] Updated weights on worker 0-0, policy_version 145051 (0.00089) [2022-07-09 07:19:27,326][26022] Updated weights on worker 0-0, policy_version 145061 (0.00086) [2022-07-09 07:19:28,644][25689] Fps is (10 sec: 5875.3, 60 sec: 5758.9, 300 sec: 5743.1). Total num frames: 148550656. Throughput: 0: 5200.1. Samples: 148542446. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 07:19:28,645][25689] Avg episode reward: [(0, '-57.753')] [2022-07-09 07:19:29,003][26022] Updated weights on worker 0-0, policy_version 145071 (0.00088) [2022-07-09 07:19:30,932][26022] Updated weights on worker 0-0, policy_version 145081 (0.00093) [2022-07-09 07:19:32,817][26022] Updated weights on worker 0-0, policy_version 145091 (0.00086) [2022-07-09 07:19:33,661][25689] Fps is (10 sec: 5724.3, 60 sec: 5723.6, 300 sec: 5732.9). Total num frames: 148577280. Throughput: 0: 6052.8. Samples: 148577032. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 07:19:33,661][25689] Avg episode reward: [(0, '-56.030')] [2022-07-09 07:19:34,550][26022] Updated weights on worker 0-0, policy_version 145101 (0.00092) [2022-07-09 07:19:36,382][26022] Updated weights on worker 0-0, policy_version 145111 (0.00080) [2022-07-09 07:19:38,029][26022] Updated weights on worker 0-0, policy_version 145121 (0.00083) [2022-07-09 07:19:38,671][25689] Fps is (10 sec: 5617.0, 60 sec: 5740.1, 300 sec: 5737.1). Total num frames: 148606976. Throughput: 0: 6055.1. Samples: 148611932. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 07:19:38,671][25689] Avg episode reward: [(0, '-56.505')] [2022-07-09 07:19:39,873][26022] Updated weights on worker 0-0, policy_version 145131 (0.00086) [2022-07-09 07:19:41,563][26022] Updated weights on worker 0-0, policy_version 145141 (0.00085) [2022-07-09 07:19:43,461][26022] Updated weights on worker 0-0, policy_version 145151 (0.00085) [2022-07-09 07:19:43,701][25689] Fps is (10 sec: 5813.1, 60 sec: 5751.0, 300 sec: 5735.0). Total num frames: 148635648. Throughput: 0: 5206.7. Samples: 148629326. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 07:19:43,702][25689] Avg episode reward: [(0, '-57.346')] [2022-07-09 07:19:45,135][26022] Updated weights on worker 0-0, policy_version 145161 (0.00090) [2022-07-09 07:19:46,932][26022] Updated weights on worker 0-0, policy_version 145171 (0.00082) [2022-07-09 07:19:48,660][26022] Updated weights on worker 0-0, policy_version 145181 (0.00084) [2022-07-09 07:19:48,707][25689] Fps is (10 sec: 5816.0, 60 sec: 5753.9, 300 sec: 5735.0). Total num frames: 148665344. Throughput: 0: 6054.1. Samples: 148663906. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 07:19:48,707][25689] Avg episode reward: [(0, '-56.932')] [2022-07-09 07:19:50,311][26022] Updated weights on worker 0-0, policy_version 145191 (0.00085) [2022-07-09 07:19:52,226][26022] Updated weights on worker 0-0, policy_version 145201 (0.00081) [2022-07-09 07:19:53,715][25689] Fps is (10 sec: 5931.0, 60 sec: 5753.5, 300 sec: 5738.9). Total num frames: 148695040. Throughput: 0: 6070.5. Samples: 148698772. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 07:19:53,716][25689] Avg episode reward: [(0, '-56.986')] [2022-07-09 07:19:53,968][26022] Updated weights on worker 0-0, policy_version 145211 (0.00086) [2022-07-09 07:19:55,703][26022] Updated weights on worker 0-0, policy_version 145221 (0.00083) [2022-07-09 07:19:57,527][26022] Updated weights on worker 0-0, policy_version 145231 (0.00086) [2022-07-09 07:19:58,726][25689] Fps is (10 sec: 5825.6, 60 sec: 5771.4, 300 sec: 5746.8). Total num frames: 148723712. Throughput: 0: 5191.2. Samples: 148716042. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 07:19:58,726][25689] Avg episode reward: [(0, '-57.503')] [2022-07-09 07:19:59,225][26022] Updated weights on worker 0-0, policy_version 145241 (0.00090) [2022-07-09 07:20:01,081][26022] Updated weights on worker 0-0, policy_version 145251 (0.00085) [2022-07-09 07:20:03,280][26022] Updated weights on worker 0-0, policy_version 145261 (0.00089) [2022-07-09 07:20:03,847][25689] Fps is (10 sec: 5356.9, 60 sec: 5717.9, 300 sec: 5734.6). Total num frames: 148749312. Throughput: 0: 5912.3. Samples: 148748430. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 07:20:03,847][25689] Avg episode reward: [(0, '-57.886')] [2022-07-09 07:20:05,029][26022] Updated weights on worker 0-0, policy_version 145271 (0.00090) [2022-07-09 07:20:06,975][26022] Updated weights on worker 0-0, policy_version 145281 (0.00088) [2022-07-09 07:20:08,494][26022] Updated weights on worker 0-0, policy_version 145291 (0.00093) [2022-07-09 07:20:08,875][25689] Fps is (10 sec: 5448.5, 60 sec: 5767.4, 300 sec: 5734.4). Total num frames: 148779008. Throughput: 0: 5900.2. Samples: 148782902. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:20:08,875][25689] Avg episode reward: [(0, '-56.981')] [2022-07-09 07:20:10,494][26022] Updated weights on worker 0-0, policy_version 145301 (0.00079) [2022-07-09 07:20:12,042][26022] Updated weights on worker 0-0, policy_version 145311 (0.00086) [2022-07-09 07:20:13,970][25689] Fps is (10 sec: 5765.5, 60 sec: 5742.4, 300 sec: 5729.5). Total num frames: 148807680. Throughput: 0: 5867.5. Samples: 148817618. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:20:13,972][25689] Avg episode reward: [(0, '-56.770')] [2022-07-09 07:20:13,991][26022] Updated weights on worker 0-0, policy_version 145321 (0.00084) [2022-07-09 07:20:15,633][26022] Updated weights on worker 0-0, policy_version 145331 (0.00090) [2022-07-09 07:20:17,478][26022] Updated weights on worker 0-0, policy_version 145341 (0.00088) [2022-07-09 07:20:18,995][25689] Fps is (10 sec: 5868.7, 60 sec: 5741.6, 300 sec: 5741.2). Total num frames: 148838400. Throughput: 0: 5864.3. Samples: 148834906. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:20:18,996][25689] Avg episode reward: [(0, '-57.943')] [2022-07-09 07:20:19,169][26022] Updated weights on worker 0-0, policy_version 145351 (0.00085) [2022-07-09 07:20:21,139][26022] Updated weights on worker 0-0, policy_version 145361 (0.00090) [2022-07-09 07:20:22,824][26022] Updated weights on worker 0-0, policy_version 145371 (0.00090) [2022-07-09 07:20:24,074][25689] Fps is (10 sec: 5776.8, 60 sec: 5727.4, 300 sec: 5732.9). Total num frames: 148866048. Throughput: 0: 5998.5. Samples: 148869764. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:20:24,075][25689] Avg episode reward: [(0, '-57.607')] [2022-07-09 07:20:24,484][26022] Updated weights on worker 0-0, policy_version 145381 (0.00086) [2022-07-09 07:20:26,233][26022] Updated weights on worker 0-0, policy_version 145391 (0.00081) [2022-07-09 07:20:27,974][26022] Updated weights on worker 0-0, policy_version 145401 (0.00088) [2022-07-09 07:20:29,089][25689] Fps is (10 sec: 5579.6, 60 sec: 5692.2, 300 sec: 5729.6). Total num frames: 148894720. Throughput: 0: 6006.2. Samples: 148904312. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:20:29,089][25689] Avg episode reward: [(0, '-57.305')] [2022-07-09 07:20:29,873][26022] Updated weights on worker 0-0, policy_version 145411 (0.00085) [2022-07-09 07:20:31,616][26022] Updated weights on worker 0-0, policy_version 145421 (0.00081) [2022-07-09 07:20:33,716][26022] Updated weights on worker 0-0, policy_version 145431 (0.00088) [2022-07-09 07:20:34,095][25689] Fps is (10 sec: 5722.6, 60 sec: 5727.1, 300 sec: 5733.5). Total num frames: 148923392. Throughput: 0: 5162.9. Samples: 148921518. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:20:34,095][25689] Avg episode reward: [(0, '-56.825')] [2022-07-09 07:20:35,340][26022] Updated weights on worker 0-0, policy_version 145441 (0.00085) [2022-07-09 07:20:35,607][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:20:35,616][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000145443_148933632.pth [2022-07-09 07:20:35,617][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000143424_146866176.pth [2022-07-09 07:20:37,240][26022] Updated weights on worker 0-0, policy_version 145451 (0.00085) [2022-07-09 07:20:38,880][26022] Updated weights on worker 0-0, policy_version 145461 (0.00082) [2022-07-09 07:20:39,131][25689] Fps is (10 sec: 5812.5, 60 sec: 5724.7, 300 sec: 5730.9). Total num frames: 148953088. Throughput: 0: 6005.6. Samples: 148955834. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:20:39,133][25689] Avg episode reward: [(0, '-56.281')] [2022-07-09 07:20:40,811][26022] Updated weights on worker 0-0, policy_version 145471 (0.00090) [2022-07-09 07:20:42,614][26022] Updated weights on worker 0-0, policy_version 145481 (0.00084) [2022-07-09 07:20:44,202][25689] Fps is (10 sec: 5775.1, 60 sec: 5720.9, 300 sec: 5730.0). Total num frames: 148981760. Throughput: 0: 5985.8. Samples: 148990244. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:20:44,202][25689] Avg episode reward: [(0, '-56.812')] [2022-07-09 07:20:44,233][26022] Updated weights on worker 0-0, policy_version 145491 (0.00087) [2022-07-09 07:20:46,224][26022] Updated weights on worker 0-0, policy_version 145501 (0.00086) [2022-07-09 07:20:47,690][26022] Updated weights on worker 0-0, policy_version 145511 (0.00092) [2022-07-09 07:20:49,237][25689] Fps is (10 sec: 5572.6, 60 sec: 5684.1, 300 sec: 5723.5). Total num frames: 149009408. Throughput: 0: 5123.6. Samples: 149007544. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:20:49,238][25689] Avg episode reward: [(0, '-55.959')] [2022-07-09 07:20:49,728][26022] Updated weights on worker 0-0, policy_version 145521 (0.00088) [2022-07-09 07:20:51,279][26022] Updated weights on worker 0-0, policy_version 145531 (0.00087) [2022-07-09 07:20:53,059][26022] Updated weights on worker 0-0, policy_version 145541 (0.00088) [2022-07-09 07:20:54,250][25689] Fps is (10 sec: 5808.8, 60 sec: 5700.7, 300 sec: 5733.7). Total num frames: 149040128. Throughput: 0: 6002.1. Samples: 149042490. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:20:54,250][25689] Avg episode reward: [(0, '-56.451')] [2022-07-09 07:20:54,823][26022] Updated weights on worker 0-0, policy_version 145551 (0.00085) [2022-07-09 07:20:56,691][26022] Updated weights on worker 0-0, policy_version 145561 (0.00093) [2022-07-09 07:20:58,228][26022] Updated weights on worker 0-0, policy_version 145571 (0.00091) [2022-07-09 07:20:59,262][25689] Fps is (10 sec: 6026.5, 60 sec: 5717.4, 300 sec: 5731.3). Total num frames: 149069824. Throughput: 0: 6055.5. Samples: 149077742. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:20:59,263][25689] Avg episode reward: [(0, '-55.981')] [2022-07-09 07:21:00,250][26022] Updated weights on worker 0-0, policy_version 145581 (0.00083) [2022-07-09 07:21:01,602][26022] Updated weights on worker 0-0, policy_version 145591 (0.00083) [2022-07-09 07:21:04,131][26022] Updated weights on worker 0-0, policy_version 145601 (0.00097) [2022-07-09 07:21:04,333][25689] Fps is (10 sec: 5585.5, 60 sec: 5739.1, 300 sec: 5730.7). Total num frames: 149096448. Throughput: 0: 5204.2. Samples: 149095012. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:21:04,333][25689] Avg episode reward: [(0, '-57.338')] [2022-07-09 07:21:05,820][26022] Updated weights on worker 0-0, policy_version 145611 (0.00084) [2022-07-09 07:21:07,522][26022] Updated weights on worker 0-0, policy_version 145621 (0.00104) [2022-07-09 07:21:09,342][25689] Fps is (10 sec: 5486.0, 60 sec: 5724.0, 300 sec: 5731.4). Total num frames: 149125120. Throughput: 0: 5977.6. Samples: 149127722. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:21:09,342][25689] Avg episode reward: [(0, '-57.589')] [2022-07-09 07:21:09,402][26022] Updated weights on worker 0-0, policy_version 145631 (0.00093) [2022-07-09 07:21:11,103][26022] Updated weights on worker 0-0, policy_version 145641 (0.00087) [2022-07-09 07:21:12,942][26022] Updated weights on worker 0-0, policy_version 145651 (0.00092) [2022-07-09 07:21:14,357][25689] Fps is (10 sec: 5618.8, 60 sec: 5714.7, 300 sec: 5724.4). Total num frames: 149152768. Throughput: 0: 5944.5. Samples: 149162014. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:21:14,357][25689] Avg episode reward: [(0, '-57.419')] [2022-07-09 07:21:14,684][26022] Updated weights on worker 0-0, policy_version 145661 (0.00089) [2022-07-09 07:21:16,315][26022] Updated weights on worker 0-0, policy_version 145671 (0.00085) [2022-07-09 07:21:18,222][26022] Updated weights on worker 0-0, policy_version 145681 (0.00088) [2022-07-09 07:21:19,362][25689] Fps is (10 sec: 5825.3, 60 sec: 5716.5, 300 sec: 5733.0). Total num frames: 149183488. Throughput: 0: 5059.1. Samples: 149179426. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:21:19,362][25689] Avg episode reward: [(0, '-58.141')] [2022-07-09 07:21:20,023][26022] Updated weights on worker 0-0, policy_version 145691 (0.00058) [2022-07-09 07:21:21,683][26022] Updated weights on worker 0-0, policy_version 145701 (0.00093) [2022-07-09 07:21:23,542][26022] Updated weights on worker 0-0, policy_version 145711 (0.00091) [2022-07-09 07:21:24,418][25689] Fps is (10 sec: 6004.8, 60 sec: 5752.6, 300 sec: 5732.9). Total num frames: 149213184. Throughput: 0: 5945.5. Samples: 149214426. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:21:24,418][25689] Avg episode reward: [(0, '-58.122')] [2022-07-09 07:21:25,284][26022] Updated weights on worker 0-0, policy_version 145721 (0.00088) [2022-07-09 07:21:27,099][26022] Updated weights on worker 0-0, policy_version 145731 (0.00102) [2022-07-09 07:21:28,798][26022] Updated weights on worker 0-0, policy_version 145741 (0.00092) [2022-07-09 07:21:29,423][25689] Fps is (10 sec: 5699.2, 60 sec: 5736.5, 300 sec: 5731.3). Total num frames: 149240832. Throughput: 0: 6046.5. Samples: 149249146. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:21:29,425][25689] Avg episode reward: [(0, '-57.215')] [2022-07-09 07:21:30,760][26022] Updated weights on worker 0-0, policy_version 145751 (0.00085) [2022-07-09 07:21:32,411][26022] Updated weights on worker 0-0, policy_version 145761 (0.00085) [2022-07-09 07:21:34,167][26022] Updated weights on worker 0-0, policy_version 145771 (0.00049) [2022-07-09 07:21:34,477][25689] Fps is (10 sec: 5802.4, 60 sec: 5765.9, 300 sec: 5738.4). Total num frames: 149271552. Throughput: 0: 5201.2. Samples: 149266664. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:21:34,478][25689] Avg episode reward: [(0, '-56.444')] [2022-07-09 07:21:35,826][26022] Updated weights on worker 0-0, policy_version 145781 (0.00094) [2022-07-09 07:21:37,624][26022] Updated weights on worker 0-0, policy_version 145791 (0.00088) [2022-07-09 07:21:39,508][25689] Fps is (10 sec: 5787.7, 60 sec: 5732.5, 300 sec: 5732.0). Total num frames: 149299200. Throughput: 0: 6045.4. Samples: 149301220. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:21:39,509][25689] Avg episode reward: [(0, '-56.720')] [2022-07-09 07:21:39,527][26022] Updated weights on worker 0-0, policy_version 145801 (0.00087) [2022-07-09 07:21:41,203][26022] Updated weights on worker 0-0, policy_version 145811 (0.00101) [2022-07-09 07:21:42,988][26022] Updated weights on worker 0-0, policy_version 145821 (0.00085) [2022-07-09 07:21:44,569][25689] Fps is (10 sec: 5783.5, 60 sec: 5767.3, 300 sec: 5735.5). Total num frames: 149329920. Throughput: 0: 6045.7. Samples: 149336256. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:21:44,572][25689] Avg episode reward: [(0, '-57.194')] [2022-07-09 07:21:44,704][26022] Updated weights on worker 0-0, policy_version 145831 (0.00083) [2022-07-09 07:21:46,475][26022] Updated weights on worker 0-0, policy_version 145841 (0.00089) [2022-07-09 07:21:48,257][26022] Updated weights on worker 0-0, policy_version 145851 (0.00083) [2022-07-09 07:21:49,583][25689] Fps is (10 sec: 5895.3, 60 sec: 5786.4, 300 sec: 5731.8). Total num frames: 149358592. Throughput: 0: 5187.8. Samples: 149353728. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:21:49,584][25689] Avg episode reward: [(0, '-56.883')] [2022-07-09 07:21:50,001][26022] Updated weights on worker 0-0, policy_version 145861 (0.00330) [2022-07-09 07:21:51,841][26022] Updated weights on worker 0-0, policy_version 145871 (0.00087) [2022-07-09 07:21:53,508][26022] Updated weights on worker 0-0, policy_version 145881 (0.00084) [2022-07-09 07:21:54,614][25689] Fps is (10 sec: 5810.7, 60 sec: 5767.7, 300 sec: 5735.4). Total num frames: 149388288. Throughput: 0: 6068.2. Samples: 149388858. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:21:54,616][25689] Avg episode reward: [(0, '-56.741')] [2022-07-09 07:21:55,319][26022] Updated weights on worker 0-0, policy_version 145891 (0.00082) [2022-07-09 07:21:56,955][26022] Updated weights on worker 0-0, policy_version 145901 (0.00458) [2022-07-09 07:21:58,771][26022] Updated weights on worker 0-0, policy_version 145911 (0.00088) [2022-07-09 07:21:59,645][25689] Fps is (10 sec: 5800.6, 60 sec: 5748.9, 300 sec: 5743.4). Total num frames: 149416960. Throughput: 0: 6089.7. Samples: 149423848. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:21:59,646][25689] Avg episode reward: [(0, '-57.980')] [2022-07-09 07:22:00,505][26022] Updated weights on worker 0-0, policy_version 145921 (0.00083) [2022-07-09 07:22:02,718][26022] Updated weights on worker 0-0, policy_version 145931 (0.00084) [2022-07-09 07:22:04,402][26022] Updated weights on worker 0-0, policy_version 145941 (0.00088) [2022-07-09 07:22:04,785][25689] Fps is (10 sec: 5638.1, 60 sec: 5776.2, 300 sec: 5738.8). Total num frames: 149445632. Throughput: 0: 5151.4. Samples: 149440396. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:22:04,785][25689] Avg episode reward: [(0, '-57.594')] [2022-07-09 07:22:06,283][26022] Updated weights on worker 0-0, policy_version 145951 (0.00085) [2022-07-09 07:22:08,115][26022] Updated weights on worker 0-0, policy_version 145961 (0.00087) [2022-07-09 07:22:09,785][26022] Updated weights on worker 0-0, policy_version 145971 (0.00092) [2022-07-09 07:22:09,836][25689] Fps is (10 sec: 5627.0, 60 sec: 5772.2, 300 sec: 5737.9). Total num frames: 149474304. Throughput: 0: 5936.1. Samples: 149473954. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 07:22:09,836][25689] Avg episode reward: [(0, '-56.809')] [2022-07-09 07:22:11,755][26022] Updated weights on worker 0-0, policy_version 145981 (0.00088) [2022-07-09 07:22:13,250][26022] Updated weights on worker 0-0, policy_version 145991 (0.00083) [2022-07-09 07:22:14,855][25689] Fps is (10 sec: 5592.6, 60 sec: 5771.8, 300 sec: 5732.0). Total num frames: 149501952. Throughput: 0: 5918.6. Samples: 149508658. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:22:14,856][25689] Avg episode reward: [(0, '-55.730')] [2022-07-09 07:22:15,159][26022] Updated weights on worker 0-0, policy_version 146001 (0.00085) [2022-07-09 07:22:16,767][26022] Updated weights on worker 0-0, policy_version 146011 (0.00091) [2022-07-09 07:22:18,587][26022] Updated weights on worker 0-0, policy_version 146021 (0.00090) [2022-07-09 07:22:19,883][25689] Fps is (10 sec: 5809.7, 60 sec: 5769.6, 300 sec: 5740.1). Total num frames: 149532672. Throughput: 0: 5047.3. Samples: 149525994. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:22:19,883][25689] Avg episode reward: [(0, '-55.832')] [2022-07-09 07:22:20,390][26022] Updated weights on worker 0-0, policy_version 146031 (0.00086) [2022-07-09 07:22:22,084][26022] Updated weights on worker 0-0, policy_version 146041 (0.00086) [2022-07-09 07:22:24,008][26022] Updated weights on worker 0-0, policy_version 146051 (0.00422) [2022-07-09 07:22:24,941][25689] Fps is (10 sec: 5990.0, 60 sec: 5769.4, 300 sec: 5739.7). Total num frames: 149562368. Throughput: 0: 5986.8. Samples: 149561070. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:22:24,942][25689] Avg episode reward: [(0, '-54.333')] [2022-07-09 07:22:25,569][26022] Updated weights on worker 0-0, policy_version 146061 (0.00084) [2022-07-09 07:22:27,406][26022] Updated weights on worker 0-0, policy_version 146071 (0.00085) [2022-07-09 07:22:29,346][26022] Updated weights on worker 0-0, policy_version 146081 (0.00090) [2022-07-09 07:22:29,998][25689] Fps is (10 sec: 5770.1, 60 sec: 5781.4, 300 sec: 5742.1). Total num frames: 149591040. Throughput: 0: 6044.8. Samples: 149595832. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:22:29,999][25689] Avg episode reward: [(0, '-54.760')] [2022-07-09 07:22:31,042][26022] Updated weights on worker 0-0, policy_version 146091 (0.00084) [2022-07-09 07:22:32,831][26022] Updated weights on worker 0-0, policy_version 146101 (0.00086) [2022-07-09 07:22:34,539][26022] Updated weights on worker 0-0, policy_version 146111 (0.00089) [2022-07-09 07:22:35,049][25689] Fps is (10 sec: 5673.4, 60 sec: 5747.9, 300 sec: 5737.9). Total num frames: 149619712. Throughput: 0: 5172.8. Samples: 149613116. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:22:35,049][25689] Avg episode reward: [(0, '-54.080')] [2022-07-09 07:22:35,825][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:22:35,846][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000146118_149624832.pth [2022-07-09 07:22:35,846][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000144099_147557376.pth [2022-07-09 07:22:36,335][26022] Updated weights on worker 0-0, policy_version 146121 (0.00085) [2022-07-09 07:22:38,375][26022] Updated weights on worker 0-0, policy_version 146131 (0.00091) [2022-07-09 07:22:39,690][26022] Updated weights on worker 0-0, policy_version 146141 (0.00078) [2022-07-09 07:22:40,090][25689] Fps is (10 sec: 5783.7, 60 sec: 5780.7, 300 sec: 5741.6). Total num frames: 149649408. Throughput: 0: 6010.3. Samples: 149647446. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:22:40,091][25689] Avg episode reward: [(0, '-54.482')] [2022-07-09 07:22:41,908][26022] Updated weights on worker 0-0, policy_version 146151 (0.00083) [2022-07-09 07:22:43,355][26022] Updated weights on worker 0-0, policy_version 146161 (0.00082) [2022-07-09 07:22:45,171][25689] Fps is (10 sec: 5665.3, 60 sec: 5728.2, 300 sec: 5730.2). Total num frames: 149677056. Throughput: 0: 5986.5. Samples: 149682174. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:22:45,171][25689] Avg episode reward: [(0, '-54.279')] [2022-07-09 07:22:45,382][26022] Updated weights on worker 0-0, policy_version 146171 (0.00083) [2022-07-09 07:22:47,043][26022] Updated weights on worker 0-0, policy_version 146181 (0.00093) [2022-07-09 07:22:48,770][26022] Updated weights on worker 0-0, policy_version 146191 (0.00088) [2022-07-09 07:22:50,186][25689] Fps is (10 sec: 5680.0, 60 sec: 5744.9, 300 sec: 5733.6). Total num frames: 149706752. Throughput: 0: 5129.6. Samples: 149699386. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:22:50,186][25689] Avg episode reward: [(0, '-54.414')] [2022-07-09 07:22:50,719][26022] Updated weights on worker 0-0, policy_version 146201 (0.00564) [2022-07-09 07:22:52,219][26022] Updated weights on worker 0-0, policy_version 146211 (0.00088) [2022-07-09 07:22:54,287][26022] Updated weights on worker 0-0, policy_version 146221 (0.00087) [2022-07-09 07:22:55,248][25689] Fps is (10 sec: 5995.1, 60 sec: 5758.8, 300 sec: 5746.7). Total num frames: 149737472. Throughput: 0: 6000.8. Samples: 149734330. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:22:55,249][25689] Avg episode reward: [(0, '-55.009')] [2022-07-09 07:22:55,856][26022] Updated weights on worker 0-0, policy_version 146231 (0.00093) [2022-07-09 07:22:57,603][26022] Updated weights on worker 0-0, policy_version 146241 (0.00088) [2022-07-09 07:22:59,579][26022] Updated weights on worker 0-0, policy_version 146251 (0.00086) [2022-07-09 07:23:00,258][25689] Fps is (10 sec: 5795.1, 60 sec: 5744.0, 300 sec: 5744.2). Total num frames: 149765120. Throughput: 0: 6027.6. Samples: 149769008. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:23:00,259][25689] Avg episode reward: [(0, '-55.581')] [2022-07-09 07:23:01,132][26022] Updated weights on worker 0-0, policy_version 146261 (0.00083) [2022-07-09 07:23:03,473][26022] Updated weights on worker 0-0, policy_version 146271 (0.00084) [2022-07-09 07:23:05,227][26022] Updated weights on worker 0-0, policy_version 146281 (0.00088) [2022-07-09 07:23:05,377][25689] Fps is (10 sec: 5358.3, 60 sec: 5712.2, 300 sec: 5731.7). Total num frames: 149791744. Throughput: 0: 5909.1. Samples: 149801574. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:23:05,377][25689] Avg episode reward: [(0, '-55.389')] [2022-07-09 07:23:06,930][26022] Updated weights on worker 0-0, policy_version 146291 (0.00082) [2022-07-09 07:23:08,666][26022] Updated weights on worker 0-0, policy_version 146301 (0.00089) [2022-07-09 07:23:10,387][25689] Fps is (10 sec: 5560.2, 60 sec: 5733.0, 300 sec: 5738.6). Total num frames: 149821440. Throughput: 0: 5906.8. Samples: 149818710. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:23:10,387][25689] Avg episode reward: [(0, '-56.319')] [2022-07-09 07:23:10,580][26022] Updated weights on worker 0-0, policy_version 146311 (0.00095) [2022-07-09 07:23:12,259][26022] Updated weights on worker 0-0, policy_version 146321 (0.00095) [2022-07-09 07:23:14,104][26022] Updated weights on worker 0-0, policy_version 146331 (0.00088) [2022-07-09 07:23:15,466][25689] Fps is (10 sec: 5886.5, 60 sec: 5761.0, 300 sec: 5740.6). Total num frames: 149851136. Throughput: 0: 5905.1. Samples: 149853720. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:23:15,467][25689] Avg episode reward: [(0, '-56.145')] [2022-07-09 07:23:15,735][26022] Updated weights on worker 0-0, policy_version 146341 (0.00086) [2022-07-09 07:23:17,568][26022] Updated weights on worker 0-0, policy_version 146351 (0.00087) [2022-07-09 07:23:19,038][26022] Updated weights on worker 0-0, policy_version 146361 (0.00317) [2022-07-09 07:23:20,527][25689] Fps is (10 sec: 5756.3, 60 sec: 5724.1, 300 sec: 5737.1). Total num frames: 149879808. Throughput: 0: 5912.9. Samples: 149888856. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:23:20,528][25689] Avg episode reward: [(0, '-55.789')] [2022-07-09 07:23:20,943][26022] Updated weights on worker 0-0, policy_version 146371 (0.00089) [2022-07-09 07:23:22,703][26022] Updated weights on worker 0-0, policy_version 146381 (0.00084) [2022-07-09 07:23:24,463][26022] Updated weights on worker 0-0, policy_version 146391 (0.00091) [2022-07-09 07:23:25,596][25689] Fps is (10 sec: 5863.1, 60 sec: 5740.0, 300 sec: 5743.1). Total num frames: 149910528. Throughput: 0: 5202.5. Samples: 149906764. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:23:25,597][25689] Avg episode reward: [(0, '-56.412')] [2022-07-09 07:23:26,146][26022] Updated weights on worker 0-0, policy_version 146401 (0.00093) [2022-07-09 07:23:28,119][26022] Updated weights on worker 0-0, policy_version 146411 (0.00090) [2022-07-09 07:23:29,752][26022] Updated weights on worker 0-0, policy_version 146421 (0.00103) [2022-07-09 07:23:30,603][25689] Fps is (10 sec: 5792.9, 60 sec: 5727.9, 300 sec: 5739.5). Total num frames: 149938176. Throughput: 0: 6075.4. Samples: 149941528. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:23:30,603][25689] Avg episode reward: [(0, '-56.087')] [2022-07-09 07:23:31,552][26022] Updated weights on worker 0-0, policy_version 146431 (0.00091) [2022-07-09 07:23:33,514][26022] Updated weights on worker 0-0, policy_version 146441 (0.00087) [2022-07-09 07:23:35,051][26022] Updated weights on worker 0-0, policy_version 146451 (0.00098) [2022-07-09 07:23:35,623][25689] Fps is (10 sec: 5719.3, 60 sec: 5747.7, 300 sec: 5742.7). Total num frames: 149967872. Throughput: 0: 6048.4. Samples: 149975632. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:23:35,623][25689] Avg episode reward: [(0, '-56.152')] [2022-07-09 07:23:36,995][26022] Updated weights on worker 0-0, policy_version 146461 (0.00082) [2022-07-09 07:23:38,892][26022] Updated weights on worker 0-0, policy_version 146471 (0.00070) [2022-07-09 07:23:40,539][26022] Updated weights on worker 0-0, policy_version 146481 (0.00071) [2022-07-09 07:23:40,632][25689] Fps is (10 sec: 5819.5, 60 sec: 5733.8, 300 sec: 5745.3). Total num frames: 149996544. Throughput: 0: 5161.4. Samples: 149992630. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:23:40,633][25689] Avg episode reward: [(0, '-56.961')] [2022-07-09 07:23:42,432][26022] Updated weights on worker 0-0, policy_version 146491 (0.00087) [2022-07-09 07:23:44,112][26022] Updated weights on worker 0-0, policy_version 146501 (0.00078) [2022-07-09 07:23:45,761][25689] Fps is (10 sec: 5757.1, 60 sec: 5763.1, 300 sec: 5743.5). Total num frames: 150026240. Throughput: 0: 5982.5. Samples: 150027400. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:23:45,761][25689] Avg episode reward: [(0, '-56.979')] [2022-07-09 07:23:45,982][26022] Updated weights on worker 0-0, policy_version 146511 (0.00085) [2022-07-09 07:23:47,682][26022] Updated weights on worker 0-0, policy_version 146521 (0.00084) [2022-07-09 07:23:49,483][26022] Updated weights on worker 0-0, policy_version 146531 (0.00084) [2022-07-09 07:23:50,780][25689] Fps is (10 sec: 5853.1, 60 sec: 5762.7, 300 sec: 5743.2). Total num frames: 150055936. Throughput: 0: 5987.9. Samples: 150062346. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:23:50,780][25689] Avg episode reward: [(0, '-56.918')] [2022-07-09 07:23:51,286][26022] Updated weights on worker 0-0, policy_version 146541 (0.00547) [2022-07-09 07:23:53,007][26022] Updated weights on worker 0-0, policy_version 146551 (0.00083) [2022-07-09 07:23:54,923][26022] Updated weights on worker 0-0, policy_version 146561 (0.00083) [2022-07-09 07:23:55,796][25689] Fps is (10 sec: 5714.2, 60 sec: 5716.3, 300 sec: 5743.3). Total num frames: 150083584. Throughput: 0: 5128.6. Samples: 150079096. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:23:55,798][25689] Avg episode reward: [(0, '-56.338')] [2022-07-09 07:23:56,485][26022] Updated weights on worker 0-0, policy_version 146571 (0.00084) [2022-07-09 07:23:58,255][26022] Updated weights on worker 0-0, policy_version 146581 (0.00085) [2022-07-09 07:24:00,023][26022] Updated weights on worker 0-0, policy_version 146591 (0.00085) [2022-07-09 07:24:00,808][25689] Fps is (10 sec: 5718.1, 60 sec: 5749.9, 300 sec: 5748.3). Total num frames: 150113280. Throughput: 0: 6017.8. Samples: 150114044. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:24:00,810][25689] Avg episode reward: [(0, '-57.289')] [2022-07-09 07:24:02,028][26022] Updated weights on worker 0-0, policy_version 146601 (0.00084) [2022-07-09 07:24:03,944][26022] Updated weights on worker 0-0, policy_version 146611 (0.00086) [2022-07-09 07:24:05,707][26022] Updated weights on worker 0-0, policy_version 146621 (0.00089) [2022-07-09 07:24:05,887][25689] Fps is (10 sec: 5682.8, 60 sec: 5770.7, 300 sec: 5750.5). Total num frames: 150140928. Throughput: 0: 5944.2. Samples: 150147034. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:24:05,888][25689] Avg episode reward: [(0, '-56.797')] [2022-07-09 07:24:07,370][26022] Updated weights on worker 0-0, policy_version 146631 (0.00079) [2022-07-09 07:24:09,402][26022] Updated weights on worker 0-0, policy_version 146641 (0.00083) [2022-07-09 07:24:10,911][25689] Fps is (10 sec: 5675.9, 60 sec: 5769.3, 300 sec: 5750.2). Total num frames: 150170624. Throughput: 0: 5065.8. Samples: 150164328. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:24:10,912][25689] Avg episode reward: [(0, '-56.206')] [2022-07-09 07:24:10,923][26022] Updated weights on worker 0-0, policy_version 146651 (0.00081) [2022-07-09 07:24:12,976][26022] Updated weights on worker 0-0, policy_version 146661 (0.00081) [2022-07-09 07:24:14,468][26022] Updated weights on worker 0-0, policy_version 146671 (0.00105) [2022-07-09 07:24:15,991][25689] Fps is (10 sec: 5574.2, 60 sec: 5718.6, 300 sec: 5735.3). Total num frames: 150197248. Throughput: 0: 5938.6. Samples: 150199022. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:24:15,993][25689] Avg episode reward: [(0, '-56.684')] [2022-07-09 07:24:16,443][26022] Updated weights on worker 0-0, policy_version 146681 (0.00085) [2022-07-09 07:24:18,035][26022] Updated weights on worker 0-0, policy_version 146691 (0.00089) [2022-07-09 07:24:19,812][26022] Updated weights on worker 0-0, policy_version 146701 (0.00085) [2022-07-09 07:24:21,018][25689] Fps is (10 sec: 5673.9, 60 sec: 5755.6, 300 sec: 5743.7). Total num frames: 150227968. Throughput: 0: 5940.6. Samples: 150234102. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 07:24:21,018][25689] Avg episode reward: [(0, '-56.879')] [2022-07-09 07:24:21,563][26022] Updated weights on worker 0-0, policy_version 146711 (0.00088) [2022-07-09 07:24:23,467][26022] Updated weights on worker 0-0, policy_version 146721 (0.00084) [2022-07-09 07:24:25,182][26022] Updated weights on worker 0-0, policy_version 146731 (0.00086) [2022-07-09 07:24:26,149][25689] Fps is (10 sec: 6048.5, 60 sec: 5749.7, 300 sec: 5741.2). Total num frames: 150258688. Throughput: 0: 5153.9. Samples: 150251460. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 07:24:26,149][25689] Avg episode reward: [(0, '-56.856')] [2022-07-09 07:24:26,949][26022] Updated weights on worker 0-0, policy_version 146741 (0.00094) [2022-07-09 07:24:28,548][26022] Updated weights on worker 0-0, policy_version 146751 (0.00094) [2022-07-09 07:24:30,455][26022] Updated weights on worker 0-0, policy_version 146761 (0.00088) [2022-07-09 07:24:31,165][25689] Fps is (10 sec: 5853.3, 60 sec: 5765.7, 300 sec: 5748.1). Total num frames: 150287360. Throughput: 0: 6025.4. Samples: 150286362. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 07:24:31,165][25689] Avg episode reward: [(0, '-57.200')] [2022-07-09 07:24:32,082][26022] Updated weights on worker 0-0, policy_version 146771 (0.00088) [2022-07-09 07:24:33,994][26022] Updated weights on worker 0-0, policy_version 146781 (0.00086) [2022-07-09 07:24:35,905][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:24:35,917][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000146791_150313984.pth [2022-07-09 07:24:35,918][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000144771_148245504.pth [2022-07-09 07:24:35,919][26022] Updated weights on worker 0-0, policy_version 146791 (0.00079) [2022-07-09 07:24:36,197][25689] Fps is (10 sec: 5605.1, 60 sec: 5730.8, 300 sec: 5740.8). Total num frames: 150315008. Throughput: 0: 6057.8. Samples: 150321424. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 07:24:36,197][25689] Avg episode reward: [(0, '-58.121')] [2022-07-09 07:24:37,219][26022] Updated weights on worker 0-0, policy_version 146801 (0.00094) [2022-07-09 07:24:39,326][26022] Updated weights on worker 0-0, policy_version 146811 (0.00052) [2022-07-09 07:24:40,850][26022] Updated weights on worker 0-0, policy_version 146821 (0.00089) [2022-07-09 07:24:41,229][25689] Fps is (10 sec: 5697.6, 60 sec: 5745.5, 300 sec: 5744.2). Total num frames: 150344704. Throughput: 0: 5189.2. Samples: 150338982. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 07:24:41,230][25689] Avg episode reward: [(0, '-57.738')] [2022-07-09 07:24:42,875][26022] Updated weights on worker 0-0, policy_version 146831 (0.00092) [2022-07-09 07:24:44,571][26022] Updated weights on worker 0-0, policy_version 146841 (0.00098) [2022-07-09 07:24:46,269][25689] Fps is (10 sec: 5998.4, 60 sec: 5770.9, 300 sec: 5747.0). Total num frames: 150375424. Throughput: 0: 6079.4. Samples: 150373778. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 07:24:46,269][25689] Avg episode reward: [(0, '-57.989')] [2022-07-09 07:24:46,274][26022] Updated weights on worker 0-0, policy_version 146851 (0.00086) [2022-07-09 07:24:48,101][26022] Updated weights on worker 0-0, policy_version 146861 (0.00090) [2022-07-09 07:24:49,997][26022] Updated weights on worker 0-0, policy_version 146871 (0.00082) [2022-07-09 07:24:51,303][25689] Fps is (10 sec: 5895.5, 60 sec: 5752.5, 300 sec: 5743.0). Total num frames: 150404096. Throughput: 0: 6065.6. Samples: 150408514. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 07:24:51,304][25689] Avg episode reward: [(0, '-57.648')] [2022-07-09 07:24:51,494][26022] Updated weights on worker 0-0, policy_version 146881 (0.00080) [2022-07-09 07:24:53,368][26022] Updated weights on worker 0-0, policy_version 146891 (0.00081) [2022-07-09 07:24:55,099][26022] Updated weights on worker 0-0, policy_version 146901 (0.00088) [2022-07-09 07:24:56,335][25689] Fps is (10 sec: 5595.0, 60 sec: 5751.1, 300 sec: 5739.2). Total num frames: 150431744. Throughput: 0: 5195.6. Samples: 150426054. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 07:24:56,336][25689] Avg episode reward: [(0, '-57.481')] [2022-07-09 07:24:56,810][26022] Updated weights on worker 0-0, policy_version 146911 (0.00088) [2022-07-09 07:24:58,663][26022] Updated weights on worker 0-0, policy_version 146921 (0.00086) [2022-07-09 07:25:00,210][26022] Updated weights on worker 0-0, policy_version 146931 (0.00087) [2022-07-09 07:25:01,413][25689] Fps is (10 sec: 5570.9, 60 sec: 5727.9, 300 sec: 5750.3). Total num frames: 150460416. Throughput: 0: 6031.4. Samples: 150460718. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 07:25:01,413][25689] Avg episode reward: [(0, '-56.473')] [2022-07-09 07:25:02,695][26022] Updated weights on worker 0-0, policy_version 146941 (0.00087) [2022-07-09 07:25:04,429][26022] Updated weights on worker 0-0, policy_version 146951 (0.00089) [2022-07-09 07:25:06,203][26022] Updated weights on worker 0-0, policy_version 146961 (0.00088) [2022-07-09 07:25:06,617][25689] Fps is (10 sec: 5776.3, 60 sec: 5766.7, 300 sec: 5750.4). Total num frames: 150491136. Throughput: 0: 5856.6. Samples: 150492950. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 07:25:06,617][25689] Avg episode reward: [(0, '-56.312')] [2022-07-09 07:25:08,124][26022] Updated weights on worker 0-0, policy_version 146971 (0.00260) [2022-07-09 07:25:09,720][26022] Updated weights on worker 0-0, policy_version 146981 (0.00090) [2022-07-09 07:25:11,663][25689] Fps is (10 sec: 5594.6, 60 sec: 5714.0, 300 sec: 5744.5). Total num frames: 150517760. Throughput: 0: 5830.2. Samples: 150527214. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 07:25:11,663][25689] Avg episode reward: [(0, '-55.350')] [2022-07-09 07:25:11,708][26022] Updated weights on worker 0-0, policy_version 146991 (0.00084) [2022-07-09 07:25:13,393][26022] Updated weights on worker 0-0, policy_version 147001 (0.00088) [2022-07-09 07:25:15,166][26022] Updated weights on worker 0-0, policy_version 147011 (0.00090) [2022-07-09 07:25:16,671][25689] Fps is (10 sec: 5601.9, 60 sec: 5771.4, 300 sec: 5741.3). Total num frames: 150547456. Throughput: 0: 5826.3. Samples: 150544536. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 07:25:16,671][25689] Avg episode reward: [(0, '-56.552')] [2022-07-09 07:25:16,951][26022] Updated weights on worker 0-0, policy_version 147021 (0.00085) [2022-07-09 07:25:18,787][26022] Updated weights on worker 0-0, policy_version 147031 (0.00086) [2022-07-09 07:25:20,374][26022] Updated weights on worker 0-0, policy_version 147041 (0.00088) [2022-07-09 07:25:21,676][25689] Fps is (10 sec: 5829.0, 60 sec: 5739.7, 300 sec: 5746.2). Total num frames: 150576128. Throughput: 0: 5847.8. Samples: 150579214. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 07:25:21,677][25689] Avg episode reward: [(0, '-57.406')] [2022-07-09 07:25:22,275][26022] Updated weights on worker 0-0, policy_version 147051 (0.00086) [2022-07-09 07:25:24,055][26022] Updated weights on worker 0-0, policy_version 147061 (0.00086) [2022-07-09 07:25:25,831][26022] Updated weights on worker 0-0, policy_version 147071 (0.00080) [2022-07-09 07:25:26,710][25689] Fps is (10 sec: 5814.2, 60 sec: 5732.0, 300 sec: 5749.3). Total num frames: 150605824. Throughput: 0: 6029.3. Samples: 150614094. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 07:25:26,711][25689] Avg episode reward: [(0, '-57.502')] [2022-07-09 07:25:27,617][26022] Updated weights on worker 0-0, policy_version 147081 (0.00090) [2022-07-09 07:25:29,266][26022] Updated weights on worker 0-0, policy_version 147091 (0.00050) [2022-07-09 07:25:31,103][26022] Updated weights on worker 0-0, policy_version 147101 (0.00086) [2022-07-09 07:25:31,733][25689] Fps is (10 sec: 5905.5, 60 sec: 5748.2, 300 sec: 5752.4). Total num frames: 150635520. Throughput: 0: 5199.1. Samples: 150631562. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 07:25:31,734][25689] Avg episode reward: [(0, '-57.744')] [2022-07-09 07:25:32,951][26022] Updated weights on worker 0-0, policy_version 147111 (0.00482) [2022-07-09 07:25:34,409][26022] Updated weights on worker 0-0, policy_version 147121 (0.00086) [2022-07-09 07:25:36,319][26022] Updated weights on worker 0-0, policy_version 147131 (0.00641) [2022-07-09 07:25:36,755][25689] Fps is (10 sec: 5810.6, 60 sec: 5766.1, 300 sec: 5749.2). Total num frames: 150664192. Throughput: 0: 6068.8. Samples: 150666422. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 07:25:36,755][25689] Avg episode reward: [(0, '-57.818')] [2022-07-09 07:25:38,302][26022] Updated weights on worker 0-0, policy_version 147141 (0.00086) [2022-07-09 07:25:39,973][26022] Updated weights on worker 0-0, policy_version 147151 (0.00092) [2022-07-09 07:25:41,764][25689] Fps is (10 sec: 5614.9, 60 sec: 5734.5, 300 sec: 5747.0). Total num frames: 150691840. Throughput: 0: 6052.0. Samples: 150700784. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 07:25:41,764][25689] Avg episode reward: [(0, '-57.753')] [2022-07-09 07:25:42,051][26022] Updated weights on worker 0-0, policy_version 147161 (0.00088) [2022-07-09 07:25:43,487][26022] Updated weights on worker 0-0, policy_version 147171 (0.00085) [2022-07-09 07:25:45,386][26022] Updated weights on worker 0-0, policy_version 147181 (0.00087) [2022-07-09 07:25:46,878][25689] Fps is (10 sec: 5664.3, 60 sec: 5710.5, 300 sec: 5752.3). Total num frames: 150721536. Throughput: 0: 5149.1. Samples: 150717946. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 07:25:46,879][25689] Avg episode reward: [(0, '-58.017')] [2022-07-09 07:25:47,140][26022] Updated weights on worker 0-0, policy_version 147191 (0.00089) [2022-07-09 07:25:48,858][26022] Updated weights on worker 0-0, policy_version 147201 (0.00106) [2022-07-09 07:25:50,876][26022] Updated weights on worker 0-0, policy_version 147211 (0.00086) [2022-07-09 07:25:51,882][25689] Fps is (10 sec: 5768.7, 60 sec: 5713.5, 300 sec: 5745.6). Total num frames: 150750208. Throughput: 0: 5996.5. Samples: 150752382. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 07:25:51,882][25689] Avg episode reward: [(0, '-57.931')] [2022-07-09 07:25:52,527][26022] Updated weights on worker 0-0, policy_version 147221 (0.00088) [2022-07-09 07:25:54,300][26022] Updated weights on worker 0-0, policy_version 147231 (0.00094) [2022-07-09 07:25:56,004][26022] Updated weights on worker 0-0, policy_version 147241 (0.00094) [2022-07-09 07:25:56,902][25689] Fps is (10 sec: 5720.7, 60 sec: 5731.4, 300 sec: 5742.0). Total num frames: 150778880. Throughput: 0: 5979.0. Samples: 150786886. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 07:25:56,903][25689] Avg episode reward: [(0, '-59.222')] [2022-07-09 07:25:57,835][26022] Updated weights on worker 0-0, policy_version 147251 (0.00088) [2022-07-09 07:25:59,539][26022] Updated weights on worker 0-0, policy_version 147261 (0.00084) [2022-07-09 07:26:01,368][26022] Updated weights on worker 0-0, policy_version 147271 (0.00091) [2022-07-09 07:26:01,920][25689] Fps is (10 sec: 5712.5, 60 sec: 5737.1, 300 sec: 5749.9). Total num frames: 150807552. Throughput: 0: 5132.0. Samples: 150804228. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 07:26:01,921][25689] Avg episode reward: [(0, '-59.683')] [2022-07-09 07:26:03,604][26022] Updated weights on worker 0-0, policy_version 147281 (0.00089) [2022-07-09 07:26:05,446][26022] Updated weights on worker 0-0, policy_version 147291 (0.00085) [2022-07-09 07:26:07,011][25689] Fps is (10 sec: 5673.1, 60 sec: 5714.0, 300 sec: 5748.3). Total num frames: 150836224. Throughput: 0: 5902.2. Samples: 150836770. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 07:26:07,011][25689] Avg episode reward: [(0, '-59.207')] [2022-07-09 07:26:07,012][26022] Updated weights on worker 0-0, policy_version 147301 (0.00082) [2022-07-09 07:26:08,967][26022] Updated weights on worker 0-0, policy_version 147311 (0.00091) [2022-07-09 07:26:10,510][26022] Updated weights on worker 0-0, policy_version 147321 (0.00095) [2022-07-09 07:26:12,105][25689] Fps is (10 sec: 5529.9, 60 sec: 5726.4, 300 sec: 5746.8). Total num frames: 150863872. Throughput: 0: 5889.8. Samples: 150871492. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 07:26:12,105][25689] Avg episode reward: [(0, '-59.124')] [2022-07-09 07:26:12,501][26022] Updated weights on worker 0-0, policy_version 147331 (0.00091) [2022-07-09 07:26:14,136][26022] Updated weights on worker 0-0, policy_version 147341 (0.00762) [2022-07-09 07:26:15,851][26022] Updated weights on worker 0-0, policy_version 147351 (0.00093) [2022-07-09 07:26:17,174][25689] Fps is (10 sec: 5742.8, 60 sec: 5737.5, 300 sec: 5745.6). Total num frames: 150894592. Throughput: 0: 5045.0. Samples: 150889156. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 07:26:17,175][25689] Avg episode reward: [(0, '-58.667')] [2022-07-09 07:26:17,752][26022] Updated weights on worker 0-0, policy_version 147361 (0.00087) [2022-07-09 07:26:19,470][26022] Updated weights on worker 0-0, policy_version 147371 (0.00087) [2022-07-09 07:26:21,212][26022] Updated weights on worker 0-0, policy_version 147381 (0.00080) [2022-07-09 07:26:22,214][25689] Fps is (10 sec: 5875.2, 60 sec: 5734.2, 300 sec: 5742.5). Total num frames: 150923264. Throughput: 0: 5898.0. Samples: 150923920. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 07:26:22,214][25689] Avg episode reward: [(0, '-58.403')] [2022-07-09 07:26:23,031][26022] Updated weights on worker 0-0, policy_version 147391 (0.00088) [2022-07-09 07:26:24,560][26022] Updated weights on worker 0-0, policy_version 147401 (0.00361) [2022-07-09 07:26:26,521][26022] Updated weights on worker 0-0, policy_version 147411 (0.00088) [2022-07-09 07:26:27,361][25689] Fps is (10 sec: 5830.5, 60 sec: 5740.4, 300 sec: 5750.0). Total num frames: 150953984. Throughput: 0: 6010.4. Samples: 150959082. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:26:27,361][25689] Avg episode reward: [(0, '-58.266')] [2022-07-09 07:26:28,152][26022] Updated weights on worker 0-0, policy_version 147421 (0.00086) [2022-07-09 07:26:29,875][26022] Updated weights on worker 0-0, policy_version 147431 (0.00083) [2022-07-09 07:26:31,842][26022] Updated weights on worker 0-0, policy_version 147441 (0.00082) [2022-07-09 07:26:32,363][25689] Fps is (10 sec: 5851.7, 60 sec: 5725.5, 300 sec: 5744.2). Total num frames: 150982656. Throughput: 0: 5188.2. Samples: 150976596. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:26:32,364][25689] Avg episode reward: [(0, '-58.413')] [2022-07-09 07:26:33,250][26022] Updated weights on worker 0-0, policy_version 147451 (0.00115) [2022-07-09 07:26:35,432][26022] Updated weights on worker 0-0, policy_version 147461 (0.00078) [2022-07-09 07:26:36,068][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:26:36,077][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000147466_151005184.pth [2022-07-09 07:26:36,084][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000145443_148933632.pth [2022-07-09 07:26:37,059][26022] Updated weights on worker 0-0, policy_version 147471 (0.00088) [2022-07-09 07:26:37,422][25689] Fps is (10 sec: 5903.2, 60 sec: 5755.7, 300 sec: 5753.9). Total num frames: 151013376. Throughput: 0: 6046.9. Samples: 151011590. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:26:37,422][25689] Avg episode reward: [(0, '-59.789')] [2022-07-09 07:26:38,783][26022] Updated weights on worker 0-0, policy_version 147481 (0.00091) [2022-07-09 07:26:40,572][26022] Updated weights on worker 0-0, policy_version 147491 (0.00086) [2022-07-09 07:26:42,466][25689] Fps is (10 sec: 5676.1, 60 sec: 5735.5, 300 sec: 5740.5). Total num frames: 151040000. Throughput: 0: 6026.5. Samples: 151045970. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:26:42,467][25689] Avg episode reward: [(0, '-59.980')] [2022-07-09 07:26:42,536][26022] Updated weights on worker 0-0, policy_version 147501 (0.00079) [2022-07-09 07:26:44,076][26022] Updated weights on worker 0-0, policy_version 147511 (0.00089) [2022-07-09 07:26:45,967][26022] Updated weights on worker 0-0, policy_version 147521 (0.00094) [2022-07-09 07:26:47,512][25689] Fps is (10 sec: 5683.1, 60 sec: 5758.9, 300 sec: 5746.8). Total num frames: 151070720. Throughput: 0: 5176.2. Samples: 151063390. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:26:47,512][25689] Avg episode reward: [(0, '-59.728')] [2022-07-09 07:26:47,639][26022] Updated weights on worker 0-0, policy_version 147531 (0.00081) [2022-07-09 07:26:49,493][26022] Updated weights on worker 0-0, policy_version 147541 (0.00086) [2022-07-09 07:26:51,164][26022] Updated weights on worker 0-0, policy_version 147551 (0.00078) [2022-07-09 07:26:52,522][25689] Fps is (10 sec: 5804.4, 60 sec: 5741.4, 300 sec: 5740.3). Total num frames: 151098368. Throughput: 0: 6025.5. Samples: 151098062. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:26:52,522][25689] Avg episode reward: [(0, '-59.610')] [2022-07-09 07:26:53,008][26022] Updated weights on worker 0-0, policy_version 147561 (0.00082) [2022-07-09 07:26:54,763][26022] Updated weights on worker 0-0, policy_version 147571 (0.00088) [2022-07-09 07:26:56,498][26022] Updated weights on worker 0-0, policy_version 147581 (0.00099) [2022-07-09 07:26:57,531][25689] Fps is (10 sec: 5723.7, 60 sec: 5759.4, 300 sec: 5744.2). Total num frames: 151128064. Throughput: 0: 6027.4. Samples: 151132794. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:26:57,531][25689] Avg episode reward: [(0, '-59.734')] [2022-07-09 07:26:58,403][26022] Updated weights on worker 0-0, policy_version 147591 (0.00089) [2022-07-09 07:27:00,128][26022] Updated weights on worker 0-0, policy_version 147601 (0.00097) [2022-07-09 07:27:02,023][26022] Updated weights on worker 0-0, policy_version 147611 (0.00079) [2022-07-09 07:27:02,570][25689] Fps is (10 sec: 5706.6, 60 sec: 5740.4, 300 sec: 5742.7). Total num frames: 151155712. Throughput: 0: 5188.5. Samples: 151150282. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:27:02,571][25689] Avg episode reward: [(0, '-58.520')] [2022-07-09 07:27:03,868][26022] Updated weights on worker 0-0, policy_version 147621 (0.00086) [2022-07-09 07:27:05,780][26022] Updated weights on worker 0-0, policy_version 147631 (0.00086) [2022-07-09 07:27:07,430][26022] Updated weights on worker 0-0, policy_version 147641 (0.00087) [2022-07-09 07:27:07,695][25689] Fps is (10 sec: 5641.8, 60 sec: 5754.1, 300 sec: 5744.7). Total num frames: 151185408. Throughput: 0: 5935.4. Samples: 151183184. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:27:07,696][25689] Avg episode reward: [(0, '-58.364')] [2022-07-09 07:27:09,367][26022] Updated weights on worker 0-0, policy_version 147651 (0.00088) [2022-07-09 07:27:10,962][26022] Updated weights on worker 0-0, policy_version 147661 (0.00086) [2022-07-09 07:27:12,779][25689] Fps is (10 sec: 5617.4, 60 sec: 5755.1, 300 sec: 5743.4). Total num frames: 151213056. Throughput: 0: 5905.1. Samples: 151217682. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:27:12,780][25689] Avg episode reward: [(0, '-58.291')] [2022-07-09 07:27:13,108][26022] Updated weights on worker 0-0, policy_version 147671 (0.00090) [2022-07-09 07:27:14,487][26022] Updated weights on worker 0-0, policy_version 147681 (0.00087) [2022-07-09 07:27:16,591][26022] Updated weights on worker 0-0, policy_version 147691 (0.00078) [2022-07-09 07:27:17,796][25689] Fps is (10 sec: 5879.7, 60 sec: 5776.9, 300 sec: 5747.1). Total num frames: 151244800. Throughput: 0: 5909.1. Samples: 151252544. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:27:17,797][25689] Avg episode reward: [(0, '-58.076')] [2022-07-09 07:27:18,103][26022] Updated weights on worker 0-0, policy_version 147701 (0.00095) [2022-07-09 07:27:19,954][26022] Updated weights on worker 0-0, policy_version 147711 (0.00089) [2022-07-09 07:27:21,602][26022] Updated weights on worker 0-0, policy_version 147721 (0.00117) [2022-07-09 07:27:22,823][25689] Fps is (10 sec: 5811.2, 60 sec: 5744.4, 300 sec: 5737.4). Total num frames: 151271424. Throughput: 0: 5907.1. Samples: 151269914. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:27:22,823][25689] Avg episode reward: [(0, '-57.387')] [2022-07-09 07:27:23,436][26022] Updated weights on worker 0-0, policy_version 147731 (0.00090) [2022-07-09 07:27:25,203][26022] Updated weights on worker 0-0, policy_version 147741 (0.00087) [2022-07-09 07:27:27,032][26022] Updated weights on worker 0-0, policy_version 147751 (0.00092) [2022-07-09 07:27:27,878][25689] Fps is (10 sec: 5586.3, 60 sec: 5736.2, 300 sec: 5740.8). Total num frames: 151301120. Throughput: 0: 6013.6. Samples: 151304556. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:27:27,878][25689] Avg episode reward: [(0, '-58.085')] [2022-07-09 07:27:28,641][26022] Updated weights on worker 0-0, policy_version 147761 (0.00388) [2022-07-09 07:27:30,507][26022] Updated weights on worker 0-0, policy_version 147771 (0.00088) [2022-07-09 07:27:32,167][26022] Updated weights on worker 0-0, policy_version 147781 (0.00082) [2022-07-09 07:27:32,887][25689] Fps is (10 sec: 5901.3, 60 sec: 5752.4, 300 sec: 5745.1). Total num frames: 151330816. Throughput: 0: 6042.3. Samples: 151339182. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:27:32,887][25689] Avg episode reward: [(0, '-58.519')] [2022-07-09 07:27:34,117][26022] Updated weights on worker 0-0, policy_version 147791 (0.00087) [2022-07-09 07:27:35,983][26022] Updated weights on worker 0-0, policy_version 147801 (0.00084) [2022-07-09 07:27:37,525][26022] Updated weights on worker 0-0, policy_version 147811 (0.00088) [2022-07-09 07:27:37,903][25689] Fps is (10 sec: 5822.3, 60 sec: 5722.7, 300 sec: 5742.1). Total num frames: 151359488. Throughput: 0: 5177.2. Samples: 151356640. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:27:37,903][25689] Avg episode reward: [(0, '-59.480')] [2022-07-09 07:27:39,607][26022] Updated weights on worker 0-0, policy_version 147821 (0.00089) [2022-07-09 07:27:41,193][26022] Updated weights on worker 0-0, policy_version 147831 (0.00086) [2022-07-09 07:27:42,935][25689] Fps is (10 sec: 5605.2, 60 sec: 5740.7, 300 sec: 5743.1). Total num frames: 151387136. Throughput: 0: 6041.3. Samples: 151391416. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:27:42,936][25689] Avg episode reward: [(0, '-59.829')] [2022-07-09 07:27:43,165][26022] Updated weights on worker 0-0, policy_version 147841 (0.00084) [2022-07-09 07:27:44,692][26022] Updated weights on worker 0-0, policy_version 147851 (0.00083) [2022-07-09 07:27:46,380][26022] Updated weights on worker 0-0, policy_version 147861 (0.00082) [2022-07-09 07:27:47,990][25689] Fps is (10 sec: 5786.0, 60 sec: 5739.8, 300 sec: 5745.7). Total num frames: 151417856. Throughput: 0: 6046.2. Samples: 151426162. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:27:47,991][25689] Avg episode reward: [(0, '-60.138')] [2022-07-09 07:27:48,264][26022] Updated weights on worker 0-0, policy_version 147871 (0.00086) [2022-07-09 07:27:50,094][26022] Updated weights on worker 0-0, policy_version 147881 (0.00081) [2022-07-09 07:27:51,979][26022] Updated weights on worker 0-0, policy_version 147891 (0.00092) [2022-07-09 07:27:52,998][25689] Fps is (10 sec: 5901.6, 60 sec: 5756.9, 300 sec: 5739.9). Total num frames: 151446528. Throughput: 0: 5178.1. Samples: 151443322. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:27:52,999][25689] Avg episode reward: [(0, '-60.076')] [2022-07-09 07:27:53,724][26022] Updated weights on worker 0-0, policy_version 147901 (0.00086) [2022-07-09 07:27:55,365][26022] Updated weights on worker 0-0, policy_version 147911 (0.00083) [2022-07-09 07:27:57,178][26022] Updated weights on worker 0-0, policy_version 147921 (0.00086) [2022-07-09 07:27:58,007][25689] Fps is (10 sec: 5725.1, 60 sec: 5740.1, 300 sec: 5743.4). Total num frames: 151475200. Throughput: 0: 6033.7. Samples: 151477942. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:27:58,007][25689] Avg episode reward: [(0, '-59.400')] [2022-07-09 07:27:58,974][26022] Updated weights on worker 0-0, policy_version 147931 (0.00084) [2022-07-09 07:28:00,867][26022] Updated weights on worker 0-0, policy_version 147941 (0.00092) [2022-07-09 07:28:02,872][26022] Updated weights on worker 0-0, policy_version 147951 (0.00086) [2022-07-09 07:28:03,034][25689] Fps is (10 sec: 5612.3, 60 sec: 5741.3, 300 sec: 5748.6). Total num frames: 151502848. Throughput: 0: 5912.2. Samples: 151510244. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:28:03,034][25689] Avg episode reward: [(0, '-58.737')] [2022-07-09 07:28:04,578][26022] Updated weights on worker 0-0, policy_version 147961 (0.00090) [2022-07-09 07:28:06,464][26022] Updated weights on worker 0-0, policy_version 147971 (0.00082) [2022-07-09 07:28:08,149][25689] Fps is (10 sec: 5552.9, 60 sec: 5725.2, 300 sec: 5743.1). Total num frames: 151531520. Throughput: 0: 5027.8. Samples: 151527516. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:28:08,150][25689] Avg episode reward: [(0, '-57.943')] [2022-07-09 07:28:08,236][26022] Updated weights on worker 0-0, policy_version 147981 (0.00086) [2022-07-09 07:28:10,107][26022] Updated weights on worker 0-0, policy_version 147991 (0.00096) [2022-07-09 07:28:11,721][26022] Updated weights on worker 0-0, policy_version 148001 (0.00085) [2022-07-09 07:28:13,247][25689] Fps is (10 sec: 5614.7, 60 sec: 5740.8, 300 sec: 5739.3). Total num frames: 151560192. Throughput: 0: 5873.7. Samples: 151562256. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:28:13,248][25689] Avg episode reward: [(0, '-58.645')] [2022-07-09 07:28:13,485][26022] Updated weights on worker 0-0, policy_version 148011 (0.00085) [2022-07-09 07:28:15,383][26022] Updated weights on worker 0-0, policy_version 148021 (0.00090) [2022-07-09 07:28:17,167][26022] Updated weights on worker 0-0, policy_version 148031 (0.00084) [2022-07-09 07:28:18,277][25689] Fps is (10 sec: 5763.3, 60 sec: 5705.8, 300 sec: 5743.3). Total num frames: 151589888. Throughput: 0: 5869.0. Samples: 151596906. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:28:18,278][25689] Avg episode reward: [(0, '-58.296')] [2022-07-09 07:28:18,853][26022] Updated weights on worker 0-0, policy_version 148041 (0.00451) [2022-07-09 07:28:20,677][26022] Updated weights on worker 0-0, policy_version 148051 (0.00056) [2022-07-09 07:28:22,390][26022] Updated weights on worker 0-0, policy_version 148061 (0.00086) [2022-07-09 07:28:23,316][25689] Fps is (10 sec: 5694.8, 60 sec: 5721.5, 300 sec: 5733.6). Total num frames: 151617536. Throughput: 0: 5126.3. Samples: 151614220. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:28:23,317][25689] Avg episode reward: [(0, '-59.006')] [2022-07-09 07:28:24,269][26022] Updated weights on worker 0-0, policy_version 148071 (0.00087) [2022-07-09 07:28:26,061][26022] Updated weights on worker 0-0, policy_version 148081 (0.00082) [2022-07-09 07:28:27,763][26022] Updated weights on worker 0-0, policy_version 148091 (0.00095) [2022-07-09 07:28:28,452][25689] Fps is (10 sec: 5736.4, 60 sec: 5730.8, 300 sec: 5741.4). Total num frames: 151648256. Throughput: 0: 5976.4. Samples: 151648852. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 07:28:28,452][25689] Avg episode reward: [(0, '-58.760')] [2022-07-09 07:28:29,494][26022] Updated weights on worker 0-0, policy_version 148101 (0.00087) [2022-07-09 07:28:31,463][26022] Updated weights on worker 0-0, policy_version 148111 (0.00091) [2022-07-09 07:28:33,078][26022] Updated weights on worker 0-0, policy_version 148121 (0.00093) [2022-07-09 07:28:33,473][25689] Fps is (10 sec: 5948.4, 60 sec: 5729.6, 300 sec: 5741.4). Total num frames: 151677952. Throughput: 0: 6013.9. Samples: 151683896. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:28:33,474][25689] Avg episode reward: [(0, '-58.929')] [2022-07-09 07:28:34,883][26022] Updated weights on worker 0-0, policy_version 148131 (0.00079) [2022-07-09 07:28:36,299][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:28:36,310][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000148139_151694336.pth [2022-07-09 07:28:36,319][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000146118_149624832.pth [2022-07-09 07:28:36,583][26022] Updated weights on worker 0-0, policy_version 148141 (0.00092) [2022-07-09 07:28:38,315][26022] Updated weights on worker 0-0, policy_version 148151 (0.00087) [2022-07-09 07:28:38,495][25689] Fps is (10 sec: 5913.8, 60 sec: 5746.0, 300 sec: 5744.6). Total num frames: 151707648. Throughput: 0: 5172.0. Samples: 151701474. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:28:38,495][25689] Avg episode reward: [(0, '-58.755')] [2022-07-09 07:28:40,263][26022] Updated weights on worker 0-0, policy_version 148161 (0.00086) [2022-07-09 07:28:41,839][26022] Updated weights on worker 0-0, policy_version 148171 (0.00107) [2022-07-09 07:28:43,501][25689] Fps is (10 sec: 5718.2, 60 sec: 5748.4, 300 sec: 5740.1). Total num frames: 151735296. Throughput: 0: 6060.2. Samples: 151736546. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:28:43,503][25689] Avg episode reward: [(0, '-58.205')] [2022-07-09 07:28:43,708][26022] Updated weights on worker 0-0, policy_version 148181 (0.00079) [2022-07-09 07:28:45,475][26022] Updated weights on worker 0-0, policy_version 148191 (0.00079) [2022-07-09 07:28:47,207][26022] Updated weights on worker 0-0, policy_version 148201 (0.00084) [2022-07-09 07:28:48,636][25689] Fps is (10 sec: 5654.5, 60 sec: 5724.0, 300 sec: 5737.9). Total num frames: 151764992. Throughput: 0: 6055.9. Samples: 151771086. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:28:48,637][25689] Avg episode reward: [(0, '-57.961')] [2022-07-09 07:28:48,984][26022] Updated weights on worker 0-0, policy_version 148211 (0.00082) [2022-07-09 07:28:50,751][26022] Updated weights on worker 0-0, policy_version 148221 (0.00102) [2022-07-09 07:28:52,568][26022] Updated weights on worker 0-0, policy_version 148231 (0.00085) [2022-07-09 07:28:53,664][25689] Fps is (10 sec: 5844.0, 60 sec: 5739.0, 300 sec: 5744.5). Total num frames: 151794688. Throughput: 0: 5169.3. Samples: 151788270. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:28:53,665][25689] Avg episode reward: [(0, '-57.466')] [2022-07-09 07:28:54,551][26022] Updated weights on worker 0-0, policy_version 148241 (0.00056) [2022-07-09 07:28:56,040][26022] Updated weights on worker 0-0, policy_version 148251 (0.00093) [2022-07-09 07:28:58,073][26022] Updated weights on worker 0-0, policy_version 148261 (0.00084) [2022-07-09 07:28:58,671][25689] Fps is (10 sec: 5714.2, 60 sec: 5722.2, 300 sec: 5737.7). Total num frames: 151822336. Throughput: 0: 6005.8. Samples: 151822652. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:28:58,672][25689] Avg episode reward: [(0, '-58.011')] [2022-07-09 07:28:59,392][26022] Updated weights on worker 0-0, policy_version 148271 (0.00093) [2022-07-09 07:29:01,571][26022] Updated weights on worker 0-0, policy_version 148281 (0.00513) [2022-07-09 07:29:03,518][26022] Updated weights on worker 0-0, policy_version 148291 (0.00087) [2022-07-09 07:29:03,690][25689] Fps is (10 sec: 5617.8, 60 sec: 5739.9, 300 sec: 5742.3). Total num frames: 151851008. Throughput: 0: 5894.5. Samples: 151855546. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:29:03,692][25689] Avg episode reward: [(0, '-57.837')] [2022-07-09 07:29:05,515][26022] Updated weights on worker 0-0, policy_version 148301 (0.00087) [2022-07-09 07:29:07,035][26022] Updated weights on worker 0-0, policy_version 148311 (0.00090) [2022-07-09 07:29:08,778][25689] Fps is (10 sec: 5674.1, 60 sec: 5742.5, 300 sec: 5737.7). Total num frames: 151879680. Throughput: 0: 5053.2. Samples: 151872866. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:29:08,778][25689] Avg episode reward: [(0, '-58.610')] [2022-07-09 07:29:08,916][26022] Updated weights on worker 0-0, policy_version 148321 (0.00088) [2022-07-09 07:29:10,558][26022] Updated weights on worker 0-0, policy_version 148331 (0.00086) [2022-07-09 07:29:12,475][26022] Updated weights on worker 0-0, policy_version 148341 (0.00088) [2022-07-09 07:29:13,830][25689] Fps is (10 sec: 5655.1, 60 sec: 5746.8, 300 sec: 5745.1). Total num frames: 151908352. Throughput: 0: 5915.1. Samples: 151907552. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:29:13,831][25689] Avg episode reward: [(0, '-58.744')] [2022-07-09 07:29:14,136][26022] Updated weights on worker 0-0, policy_version 148351 (0.00090) [2022-07-09 07:29:15,955][26022] Updated weights on worker 0-0, policy_version 148361 (0.00085) [2022-07-09 07:29:17,921][26022] Updated weights on worker 0-0, policy_version 148371 (0.00087) [2022-07-09 07:29:18,881][25689] Fps is (10 sec: 5777.1, 60 sec: 5744.8, 300 sec: 5741.2). Total num frames: 151938048. Throughput: 0: 5904.1. Samples: 151941972. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:29:18,882][25689] Avg episode reward: [(0, '-59.004')] [2022-07-09 07:29:19,528][26022] Updated weights on worker 0-0, policy_version 148381 (0.00084) [2022-07-09 07:29:21,404][26022] Updated weights on worker 0-0, policy_version 148391 (0.00087) [2022-07-09 07:29:23,208][26022] Updated weights on worker 0-0, policy_version 148401 (0.00087) [2022-07-09 07:29:23,898][25689] Fps is (10 sec: 5695.5, 60 sec: 5747.0, 300 sec: 5733.0). Total num frames: 151965696. Throughput: 0: 5134.4. Samples: 151959304. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:29:23,900][25689] Avg episode reward: [(0, '-58.877')] [2022-07-09 07:29:25,008][26022] Updated weights on worker 0-0, policy_version 148411 (0.00087) [2022-07-09 07:29:26,776][26022] Updated weights on worker 0-0, policy_version 148421 (0.00092) [2022-07-09 07:29:28,375][26022] Updated weights on worker 0-0, policy_version 148431 (0.00087) [2022-07-09 07:29:29,035][25689] Fps is (10 sec: 5647.7, 60 sec: 5729.9, 300 sec: 5734.2). Total num frames: 151995392. Throughput: 0: 5974.8. Samples: 151993896. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:29:29,035][25689] Avg episode reward: [(0, '-58.807')] [2022-07-09 07:29:30,215][26022] Updated weights on worker 0-0, policy_version 148441 (0.00088) [2022-07-09 07:29:32,222][26022] Updated weights on worker 0-0, policy_version 148451 (0.00086) [2022-07-09 07:29:33,540][26022] Updated weights on worker 0-0, policy_version 148461 (0.00617) [2022-07-09 07:29:34,131][25689] Fps is (10 sec: 5904.1, 60 sec: 5739.7, 300 sec: 5743.2). Total num frames: 152026112. Throughput: 0: 5979.2. Samples: 152028936. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:29:34,132][25689] Avg episode reward: [(0, '-58.715')] [2022-07-09 07:29:35,503][26022] Updated weights on worker 0-0, policy_version 148471 (0.00078) [2022-07-09 07:29:37,140][26022] Updated weights on worker 0-0, policy_version 148481 (0.00279) [2022-07-09 07:29:39,060][26022] Updated weights on worker 0-0, policy_version 148491 (0.00083) [2022-07-09 07:29:39,185][25689] Fps is (10 sec: 5851.1, 60 sec: 5719.8, 300 sec: 5739.4). Total num frames: 152054784. Throughput: 0: 5996.4. Samples: 152063724. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:29:39,186][25689] Avg episode reward: [(0, '-58.531')] [2022-07-09 07:29:40,927][26022] Updated weights on worker 0-0, policy_version 148501 (0.00099) [2022-07-09 07:29:42,497][26022] Updated weights on worker 0-0, policy_version 148511 (0.00086) [2022-07-09 07:29:44,203][25689] Fps is (10 sec: 5591.9, 60 sec: 5718.8, 300 sec: 5729.5). Total num frames: 152082432. Throughput: 0: 5997.2. Samples: 152081074. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:29:44,203][25689] Avg episode reward: [(0, '-58.029')] [2022-07-09 07:29:44,630][26022] Updated weights on worker 0-0, policy_version 148521 (0.00094) [2022-07-09 07:29:46,040][26022] Updated weights on worker 0-0, policy_version 148531 (0.00094) [2022-07-09 07:29:48,301][26022] Updated weights on worker 0-0, policy_version 148541 (0.00086) [2022-07-09 07:29:49,259][25689] Fps is (10 sec: 5895.9, 60 sec: 5760.0, 300 sec: 5739.4). Total num frames: 152114176. Throughput: 0: 6006.5. Samples: 152115372. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:29:49,259][25689] Avg episode reward: [(0, '-57.759')] [2022-07-09 07:29:49,639][26022] Updated weights on worker 0-0, policy_version 148551 (0.00086) [2022-07-09 07:29:51,699][26022] Updated weights on worker 0-0, policy_version 148561 (0.00089) [2022-07-09 07:29:53,186][26022] Updated weights on worker 0-0, policy_version 148571 (0.00089) [2022-07-09 07:29:54,267][25689] Fps is (10 sec: 5697.6, 60 sec: 5694.2, 300 sec: 5733.0). Total num frames: 152139776. Throughput: 0: 6000.5. Samples: 152149764. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:29:54,268][25689] Avg episode reward: [(0, '-58.427')] [2022-07-09 07:29:55,267][26022] Updated weights on worker 0-0, policy_version 148581 (0.00089) [2022-07-09 07:29:56,802][26022] Updated weights on worker 0-0, policy_version 148591 (0.00086) [2022-07-09 07:29:58,773][26022] Updated weights on worker 0-0, policy_version 148601 (0.00091) [2022-07-09 07:29:59,279][25689] Fps is (10 sec: 5722.7, 60 sec: 5761.4, 300 sec: 5744.5). Total num frames: 152171520. Throughput: 0: 5131.4. Samples: 152166834. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:29:59,280][25689] Avg episode reward: [(0, '-58.153')] [2022-07-09 07:30:00,441][26022] Updated weights on worker 0-0, policy_version 148611 (0.00081) [2022-07-09 07:30:02,645][26022] Updated weights on worker 0-0, policy_version 148621 (0.00089) [2022-07-09 07:30:04,291][25689] Fps is (10 sec: 5823.2, 60 sec: 5728.2, 300 sec: 5734.5). Total num frames: 152198144. Throughput: 0: 5902.2. Samples: 152199638. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:30:04,291][25689] Avg episode reward: [(0, '-57.980')] [2022-07-09 07:30:04,295][26022] Updated weights on worker 0-0, policy_version 148631 (0.00084) [2022-07-09 07:30:06,243][26022] Updated weights on worker 0-0, policy_version 148641 (0.00084) [2022-07-09 07:30:07,712][26022] Updated weights on worker 0-0, policy_version 148651 (0.00088) [2022-07-09 07:30:09,379][25689] Fps is (10 sec: 5373.7, 60 sec: 5711.3, 300 sec: 5737.1). Total num frames: 152225792. Throughput: 0: 5914.4. Samples: 152234370. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:30:09,379][25689] Avg episode reward: [(0, '-58.493')] [2022-07-09 07:30:09,815][26022] Updated weights on worker 0-0, policy_version 148661 (0.00086) [2022-07-09 07:30:11,303][26022] Updated weights on worker 0-0, policy_version 148671 (0.00086) [2022-07-09 07:30:13,173][26022] Updated weights on worker 0-0, policy_version 148681 (0.01001) [2022-07-09 07:30:14,428][25689] Fps is (10 sec: 5656.6, 60 sec: 5728.5, 300 sec: 5736.3). Total num frames: 152255488. Throughput: 0: 5062.6. Samples: 152251832. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:30:14,429][25689] Avg episode reward: [(0, '-58.624')] [2022-07-09 07:30:15,041][26022] Updated weights on worker 0-0, policy_version 148691 (0.00088) [2022-07-09 07:30:16,791][26022] Updated weights on worker 0-0, policy_version 148701 (0.00082) [2022-07-09 07:30:18,603][26022] Updated weights on worker 0-0, policy_version 148711 (0.00093) [2022-07-09 07:30:19,436][25689] Fps is (10 sec: 5803.4, 60 sec: 5715.7, 300 sec: 5736.3). Total num frames: 152284160. Throughput: 0: 5913.5. Samples: 152286032. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:30:19,437][25689] Avg episode reward: [(0, '-58.349')] [2022-07-09 07:30:20,361][26022] Updated weights on worker 0-0, policy_version 148721 (0.00085) [2022-07-09 07:30:22,268][26022] Updated weights on worker 0-0, policy_version 148731 (0.00087) [2022-07-09 07:30:23,979][26022] Updated weights on worker 0-0, policy_version 148741 (0.00085) [2022-07-09 07:30:24,455][25689] Fps is (10 sec: 5821.5, 60 sec: 5749.3, 300 sec: 5736.6). Total num frames: 152313856. Throughput: 0: 5990.1. Samples: 152320422. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:30:24,455][25689] Avg episode reward: [(0, '-58.274')] [2022-07-09 07:30:25,739][26022] Updated weights on worker 0-0, policy_version 148751 (0.00098) [2022-07-09 07:30:27,452][26022] Updated weights on worker 0-0, policy_version 148761 (0.00084) [2022-07-09 07:30:29,489][25689] Fps is (10 sec: 5704.5, 60 sec: 5725.2, 300 sec: 5729.5). Total num frames: 152341504. Throughput: 0: 5135.2. Samples: 152337638. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:30:29,489][25689] Avg episode reward: [(0, '-59.005')] [2022-07-09 07:30:29,491][26022] Updated weights on worker 0-0, policy_version 148771 (0.00092) [2022-07-09 07:30:31,017][26022] Updated weights on worker 0-0, policy_version 148781 (0.00085) [2022-07-09 07:30:32,902][26022] Updated weights on worker 0-0, policy_version 148791 (0.00096) [2022-07-09 07:30:34,519][25689] Fps is (10 sec: 5596.2, 60 sec: 5697.6, 300 sec: 5729.3). Total num frames: 152370176. Throughput: 0: 5988.9. Samples: 152372150. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 07:30:34,519][25689] Avg episode reward: [(0, '-58.278')] [2022-07-09 07:30:34,749][26022] Updated weights on worker 0-0, policy_version 148801 (0.00080) [2022-07-09 07:30:36,348][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:30:36,361][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000148811_152382464.pth [2022-07-09 07:30:36,362][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000146791_150313984.pth [2022-07-09 07:30:36,370][26022] Updated weights on worker 0-0, policy_version 148811 (0.00104) [2022-07-09 07:30:38,341][26022] Updated weights on worker 0-0, policy_version 148821 (0.00095) [2022-07-09 07:30:39,541][25689] Fps is (10 sec: 5806.1, 60 sec: 5717.5, 300 sec: 5735.9). Total num frames: 152399872. Throughput: 0: 6001.0. Samples: 152406684. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 07:30:39,542][25689] Avg episode reward: [(0, '-58.568')] [2022-07-09 07:30:40,002][26022] Updated weights on worker 0-0, policy_version 148831 (0.00086) [2022-07-09 07:30:41,899][26022] Updated weights on worker 0-0, policy_version 148841 (0.00081) [2022-07-09 07:30:43,639][26022] Updated weights on worker 0-0, policy_version 148851 (0.00089) [2022-07-09 07:30:44,551][25689] Fps is (10 sec: 5817.7, 60 sec: 5735.2, 300 sec: 5734.5). Total num frames: 152428544. Throughput: 0: 5173.5. Samples: 152424394. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 07:30:44,552][25689] Avg episode reward: [(0, '-58.468')] [2022-07-09 07:30:45,283][26022] Updated weights on worker 0-0, policy_version 148861 (0.00088) [2022-07-09 07:30:47,107][26022] Updated weights on worker 0-0, policy_version 148871 (0.00092) [2022-07-09 07:30:48,907][26022] Updated weights on worker 0-0, policy_version 148881 (0.00082) [2022-07-09 07:30:49,618][25689] Fps is (10 sec: 5792.4, 60 sec: 5700.3, 300 sec: 5736.7). Total num frames: 152458240. Throughput: 0: 6036.0. Samples: 152459140. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 07:30:49,619][25689] Avg episode reward: [(0, '-58.071')] [2022-07-09 07:30:50,635][26022] Updated weights on worker 0-0, policy_version 148891 (0.00089) [2022-07-09 07:30:52,483][26022] Updated weights on worker 0-0, policy_version 148901 (0.00093) [2022-07-09 07:30:54,121][26022] Updated weights on worker 0-0, policy_version 148911 (0.00086) [2022-07-09 07:30:54,623][25689] Fps is (10 sec: 5693.8, 60 sec: 5734.6, 300 sec: 5733.6). Total num frames: 152485888. Throughput: 0: 6047.2. Samples: 152493724. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 07:30:54,623][25689] Avg episode reward: [(0, '-57.757')] [2022-07-09 07:30:56,099][26022] Updated weights on worker 0-0, policy_version 148921 (0.00094) [2022-07-09 07:30:57,743][26022] Updated weights on worker 0-0, policy_version 148931 (0.00089) [2022-07-09 07:30:59,536][26022] Updated weights on worker 0-0, policy_version 148941 (0.00087) [2022-07-09 07:30:59,643][25689] Fps is (10 sec: 5720.4, 60 sec: 5699.9, 300 sec: 5737.0). Total num frames: 152515584. Throughput: 0: 5195.2. Samples: 152511112. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 07:30:59,643][25689] Avg episode reward: [(0, '-57.179')] [2022-07-09 07:31:01,276][26022] Updated weights on worker 0-0, policy_version 148951 (0.00205) [2022-07-09 07:31:03,571][26022] Updated weights on worker 0-0, policy_version 148961 (0.00088) [2022-07-09 07:31:04,646][25689] Fps is (10 sec: 5618.7, 60 sec: 5700.6, 300 sec: 5731.8). Total num frames: 152542208. Throughput: 0: 5912.5. Samples: 152543204. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 07:31:04,647][25689] Avg episode reward: [(0, '-57.875')] [2022-07-09 07:31:05,278][26022] Updated weights on worker 0-0, policy_version 148971 (0.00085) [2022-07-09 07:31:07,153][26022] Updated weights on worker 0-0, policy_version 148981 (0.00087) [2022-07-09 07:31:08,803][26022] Updated weights on worker 0-0, policy_version 148991 (0.00102) [2022-07-09 07:31:09,772][25689] Fps is (10 sec: 5459.2, 60 sec: 5714.1, 300 sec: 5734.6). Total num frames: 152570880. Throughput: 0: 5877.4. Samples: 152577590. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 07:31:09,772][25689] Avg episode reward: [(0, '-58.996')] [2022-07-09 07:31:10,784][26022] Updated weights on worker 0-0, policy_version 149001 (0.00083) [2022-07-09 07:31:12,471][26022] Updated weights on worker 0-0, policy_version 149011 (0.00084) [2022-07-09 07:31:14,240][26022] Updated weights on worker 0-0, policy_version 149021 (0.00086) [2022-07-09 07:31:14,804][25689] Fps is (10 sec: 5847.0, 60 sec: 5732.6, 300 sec: 5735.3). Total num frames: 152601600. Throughput: 0: 5014.4. Samples: 152594922. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 07:31:14,805][25689] Avg episode reward: [(0, '-59.793')] [2022-07-09 07:31:16,038][26022] Updated weights on worker 0-0, policy_version 149031 (0.00087) [2022-07-09 07:31:17,757][26022] Updated weights on worker 0-0, policy_version 149041 (0.00087) [2022-07-09 07:31:19,733][26022] Updated weights on worker 0-0, policy_version 149051 (0.00093) [2022-07-09 07:31:19,819][25689] Fps is (10 sec: 5707.4, 60 sec: 5698.1, 300 sec: 5728.9). Total num frames: 152628224. Throughput: 0: 5859.6. Samples: 152629336. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 07:31:19,819][25689] Avg episode reward: [(0, '-60.437')] [2022-07-09 07:31:21,340][26022] Updated weights on worker 0-0, policy_version 149061 (0.00093) [2022-07-09 07:31:23,164][26022] Updated weights on worker 0-0, policy_version 149071 (0.00090) [2022-07-09 07:31:24,827][25689] Fps is (10 sec: 5619.4, 60 sec: 5699.1, 300 sec: 5728.2). Total num frames: 152657920. Throughput: 0: 5997.9. Samples: 152664244. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 07:31:24,827][25689] Avg episode reward: [(0, '-60.723')] [2022-07-09 07:31:24,895][26022] Updated weights on worker 0-0, policy_version 149081 (0.00087) [2022-07-09 07:31:26,669][26022] Updated weights on worker 0-0, policy_version 149091 (0.00085) [2022-07-09 07:31:28,456][26022] Updated weights on worker 0-0, policy_version 149101 (0.00096) [2022-07-09 07:31:29,880][25689] Fps is (10 sec: 5902.9, 60 sec: 5731.1, 300 sec: 5730.6). Total num frames: 152687616. Throughput: 0: 5167.8. Samples: 152681508. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 07:31:29,881][25689] Avg episode reward: [(0, '-60.593')] [2022-07-09 07:31:30,225][26022] Updated weights on worker 0-0, policy_version 149111 (0.00090) [2022-07-09 07:31:32,004][26022] Updated weights on worker 0-0, policy_version 149121 (0.00084) [2022-07-09 07:31:33,731][26022] Updated weights on worker 0-0, policy_version 149131 (0.00083) [2022-07-09 07:31:34,895][25689] Fps is (10 sec: 5797.2, 60 sec: 5732.6, 300 sec: 5724.6). Total num frames: 152716288. Throughput: 0: 6043.0. Samples: 152716330. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 07:31:34,896][25689] Avg episode reward: [(0, '-60.509')] [2022-07-09 07:31:35,404][26022] Updated weights on worker 0-0, policy_version 149141 (0.00081) [2022-07-09 07:31:37,362][26022] Updated weights on worker 0-0, policy_version 149151 (0.00083) [2022-07-09 07:31:39,111][26022] Updated weights on worker 0-0, policy_version 149161 (0.00093) [2022-07-09 07:31:39,971][25689] Fps is (10 sec: 5682.8, 60 sec: 5710.6, 300 sec: 5730.9). Total num frames: 152744960. Throughput: 0: 6046.2. Samples: 152751182. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 07:31:39,972][25689] Avg episode reward: [(0, '-59.270')] [2022-07-09 07:31:40,804][26022] Updated weights on worker 0-0, policy_version 149171 (0.00081) [2022-07-09 07:31:42,676][26022] Updated weights on worker 0-0, policy_version 149181 (0.00087) [2022-07-09 07:31:44,384][26022] Updated weights on worker 0-0, policy_version 149191 (0.00084) [2022-07-09 07:31:45,015][25689] Fps is (10 sec: 5868.7, 60 sec: 5741.3, 300 sec: 5730.9). Total num frames: 152775680. Throughput: 0: 5167.5. Samples: 152768568. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 07:31:45,015][25689] Avg episode reward: [(0, '-59.030')] [2022-07-09 07:31:46,080][26022] Updated weights on worker 0-0, policy_version 149201 (0.00084) [2022-07-09 07:31:47,892][26022] Updated weights on worker 0-0, policy_version 149211 (0.00086) [2022-07-09 07:31:49,927][26022] Updated weights on worker 0-0, policy_version 149221 (0.00084) [2022-07-09 07:31:50,079][25689] Fps is (10 sec: 5774.6, 60 sec: 5707.6, 300 sec: 5729.9). Total num frames: 152803328. Throughput: 0: 6030.4. Samples: 152803314. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 07:31:50,081][25689] Avg episode reward: [(0, '-57.874')] [2022-07-09 07:31:51,529][26022] Updated weights on worker 0-0, policy_version 149231 (0.00084) [2022-07-09 07:31:53,268][26022] Updated weights on worker 0-0, policy_version 149241 (0.00090) [2022-07-09 07:31:54,906][26022] Updated weights on worker 0-0, policy_version 149251 (0.00087) [2022-07-09 07:31:55,092][25689] Fps is (10 sec: 5792.0, 60 sec: 5757.6, 300 sec: 5733.2). Total num frames: 152834048. Throughput: 0: 6025.7. Samples: 152838034. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 07:31:55,093][25689] Avg episode reward: [(0, '-57.680')] [2022-07-09 07:31:56,867][26022] Updated weights on worker 0-0, policy_version 149261 (0.00094) [2022-07-09 07:31:58,554][26022] Updated weights on worker 0-0, policy_version 149271 (0.00086) [2022-07-09 07:32:00,162][25689] Fps is (10 sec: 5788.9, 60 sec: 5719.1, 300 sec: 5732.6). Total num frames: 152861696. Throughput: 0: 5162.5. Samples: 152855416. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 07:32:00,162][25689] Avg episode reward: [(0, '-57.293')] [2022-07-09 07:32:00,276][26022] Updated weights on worker 0-0, policy_version 149281 (0.00080) [2022-07-09 07:32:02,649][26022] Updated weights on worker 0-0, policy_version 149292 (0.00087) [2022-07-09 07:32:04,671][26022] Updated weights on worker 0-0, policy_version 149302 (0.00079) [2022-07-09 07:32:05,259][25689] Fps is (10 sec: 5439.1, 60 sec: 5727.2, 300 sec: 5726.3). Total num frames: 152889344. Throughput: 0: 5904.6. Samples: 152888100. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 07:32:05,259][25689] Avg episode reward: [(0, '-57.456')] [2022-07-09 07:32:06,179][26022] Updated weights on worker 0-0, policy_version 149312 (0.00085) [2022-07-09 07:32:08,045][26022] Updated weights on worker 0-0, policy_version 149322 (0.00090) [2022-07-09 07:32:09,743][26022] Updated weights on worker 0-0, policy_version 149332 (0.00083) [2022-07-09 07:32:10,385][25689] Fps is (10 sec: 5509.0, 60 sec: 5727.1, 300 sec: 5728.9). Total num frames: 152918016. Throughput: 0: 5871.1. Samples: 152922532. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 07:32:10,385][25689] Avg episode reward: [(0, '-56.831')] [2022-07-09 07:32:11,614][26022] Updated weights on worker 0-0, policy_version 149342 (0.00093) [2022-07-09 07:32:13,241][26022] Updated weights on worker 0-0, policy_version 149352 (0.00089) [2022-07-09 07:32:15,178][26022] Updated weights on worker 0-0, policy_version 149362 (0.00091) [2022-07-09 07:32:15,436][25689] Fps is (10 sec: 5735.0, 60 sec: 5708.4, 300 sec: 5721.4). Total num frames: 152947712. Throughput: 0: 5857.4. Samples: 152957196. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 07:32:15,437][25689] Avg episode reward: [(0, '-56.288')] [2022-07-09 07:32:16,832][26022] Updated weights on worker 0-0, policy_version 149372 (0.00081) [2022-07-09 07:32:18,779][26022] Updated weights on worker 0-0, policy_version 149382 (0.00093) [2022-07-09 07:32:20,409][26022] Updated weights on worker 0-0, policy_version 149392 (0.00084) [2022-07-09 07:32:20,463][25689] Fps is (10 sec: 5893.3, 60 sec: 5757.9, 300 sec: 5731.7). Total num frames: 152977408. Throughput: 0: 5872.0. Samples: 152974624. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 07:32:20,463][25689] Avg episode reward: [(0, '-56.339')] [2022-07-09 07:32:22,255][26022] Updated weights on worker 0-0, policy_version 149402 (0.00088) [2022-07-09 07:32:23,975][26022] Updated weights on worker 0-0, policy_version 149412 (0.00080) [2022-07-09 07:32:25,487][25689] Fps is (10 sec: 5705.6, 60 sec: 5722.6, 300 sec: 5725.4). Total num frames: 153005056. Throughput: 0: 5968.0. Samples: 153008820. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 07:32:25,487][25689] Avg episode reward: [(0, '-56.341')] [2022-07-09 07:32:25,801][26022] Updated weights on worker 0-0, policy_version 149422 (0.00091) [2022-07-09 07:32:27,447][26022] Updated weights on worker 0-0, policy_version 149432 (0.00094) [2022-07-09 07:32:29,361][26022] Updated weights on worker 0-0, policy_version 149442 (0.00093) [2022-07-09 07:32:30,561][25689] Fps is (10 sec: 5678.9, 60 sec: 5720.7, 300 sec: 5724.2). Total num frames: 153034752. Throughput: 0: 5989.4. Samples: 153043372. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 07:32:30,561][25689] Avg episode reward: [(0, '-57.128')] [2022-07-09 07:32:31,104][26022] Updated weights on worker 0-0, policy_version 149452 (0.00093) [2022-07-09 07:32:33,094][26022] Updated weights on worker 0-0, policy_version 149462 (0.00079) [2022-07-09 07:32:34,620][26022] Updated weights on worker 0-0, policy_version 149472 (0.00085) [2022-07-09 07:32:35,591][25689] Fps is (10 sec: 5776.7, 60 sec: 5719.3, 300 sec: 5723.9). Total num frames: 153063424. Throughput: 0: 5138.9. Samples: 153060768. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 07:32:35,591][25689] Avg episode reward: [(0, '-57.731')] [2022-07-09 07:32:36,423][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:32:36,438][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000149482_153069568.pth [2022-07-09 07:32:36,439][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000147466_151005184.pth [2022-07-09 07:32:36,442][26022] Updated weights on worker 0-0, policy_version 149482 (0.00079) [2022-07-09 07:32:38,219][26022] Updated weights on worker 0-0, policy_version 149492 (0.00088) [2022-07-09 07:32:39,923][26022] Updated weights on worker 0-0, policy_version 149502 (0.00052) [2022-07-09 07:32:40,609][25689] Fps is (10 sec: 5808.5, 60 sec: 5741.6, 300 sec: 5731.0). Total num frames: 153093120. Throughput: 0: 6009.2. Samples: 153095688. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 07:32:40,610][25689] Avg episode reward: [(0, '-58.036')] [2022-07-09 07:32:41,820][26022] Updated weights on worker 0-0, policy_version 149512 (0.00086) [2022-07-09 07:32:43,506][26022] Updated weights on worker 0-0, policy_version 149522 (0.00088) [2022-07-09 07:32:45,253][26022] Updated weights on worker 0-0, policy_version 149532 (0.00081) [2022-07-09 07:32:45,615][25689] Fps is (10 sec: 5924.6, 60 sec: 5728.3, 300 sec: 5728.6). Total num frames: 153122816. Throughput: 0: 6051.7. Samples: 153130632. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 07:32:45,617][25689] Avg episode reward: [(0, '-57.634')] [2022-07-09 07:32:46,913][26022] Updated weights on worker 0-0, policy_version 149542 (0.00085) [2022-07-09 07:32:48,840][26022] Updated weights on worker 0-0, policy_version 149552 (0.01005) [2022-07-09 07:32:50,597][26022] Updated weights on worker 0-0, policy_version 149562 (0.00089) [2022-07-09 07:32:50,709][25689] Fps is (10 sec: 5779.0, 60 sec: 5742.3, 300 sec: 5726.9). Total num frames: 153151488. Throughput: 0: 5193.2. Samples: 153148008. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 07:32:50,710][25689] Avg episode reward: [(0, '-58.530')] [2022-07-09 07:32:52,398][26022] Updated weights on worker 0-0, policy_version 149572 (0.00086) [2022-07-09 07:32:54,106][26022] Updated weights on worker 0-0, policy_version 149582 (0.00117) [2022-07-09 07:32:55,712][25689] Fps is (10 sec: 5679.2, 60 sec: 5709.5, 300 sec: 5727.0). Total num frames: 153180160. Throughput: 0: 6051.0. Samples: 153182524. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 07:32:55,719][25689] Avg episode reward: [(0, '-59.137')] [2022-07-09 07:32:55,840][26022] Updated weights on worker 0-0, policy_version 149592 (0.00086) [2022-07-09 07:32:57,877][26022] Updated weights on worker 0-0, policy_version 149602 (0.00085) [2022-07-09 07:32:59,470][26022] Updated weights on worker 0-0, policy_version 149612 (0.00095) [2022-07-09 07:33:00,726][25689] Fps is (10 sec: 5622.8, 60 sec: 5714.8, 300 sec: 5727.3). Total num frames: 153207808. Throughput: 0: 6022.6. Samples: 153216840. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 07:33:00,727][25689] Avg episode reward: [(0, '-58.415')] [2022-07-09 07:33:01,386][26022] Updated weights on worker 0-0, policy_version 149622 (0.00084) [2022-07-09 07:33:03,519][26022] Updated weights on worker 0-0, policy_version 149632 (0.00089) [2022-07-09 07:33:05,275][26022] Updated weights on worker 0-0, policy_version 149642 (0.00089) [2022-07-09 07:33:05,734][25689] Fps is (10 sec: 5517.8, 60 sec: 5723.2, 300 sec: 5725.9). Total num frames: 153235456. Throughput: 0: 5024.9. Samples: 153231726. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 07:33:05,734][25689] Avg episode reward: [(0, '-57.781')] [2022-07-09 07:33:07,150][26022] Updated weights on worker 0-0, policy_version 149652 (0.00088) [2022-07-09 07:33:08,893][26022] Updated weights on worker 0-0, policy_version 149662 (0.00086) [2022-07-09 07:33:10,824][25689] Fps is (10 sec: 5577.0, 60 sec: 5726.6, 300 sec: 5726.0). Total num frames: 153264128. Throughput: 0: 5865.6. Samples: 153265994. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 07:33:10,825][25689] Avg episode reward: [(0, '-58.685')] [2022-07-09 07:33:10,841][26022] Updated weights on worker 0-0, policy_version 149672 (0.00090) [2022-07-09 07:33:12,543][26022] Updated weights on worker 0-0, policy_version 149682 (0.00089) [2022-07-09 07:33:14,193][26022] Updated weights on worker 0-0, policy_version 149692 (0.00084) [2022-07-09 07:33:15,897][25689] Fps is (10 sec: 5642.4, 60 sec: 5707.6, 300 sec: 5721.8). Total num frames: 153292800. Throughput: 0: 5851.7. Samples: 153300636. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 07:33:15,897][25689] Avg episode reward: [(0, '-58.462')] [2022-07-09 07:33:16,126][26022] Updated weights on worker 0-0, policy_version 149702 (0.00088) [2022-07-09 07:33:17,825][26022] Updated weights on worker 0-0, policy_version 149712 (0.00088) [2022-07-09 07:33:19,807][26022] Updated weights on worker 0-0, policy_version 149722 (0.00086) [2022-07-09 07:33:20,950][25689] Fps is (10 sec: 5865.2, 60 sec: 5722.0, 300 sec: 5731.8). Total num frames: 153323520. Throughput: 0: 5003.7. Samples: 153318038. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 07:33:20,951][25689] Avg episode reward: [(0, '-57.698')] [2022-07-09 07:33:21,407][26022] Updated weights on worker 0-0, policy_version 149732 (0.00083) [2022-07-09 07:33:23,125][26022] Updated weights on worker 0-0, policy_version 149742 (0.01179) [2022-07-09 07:33:24,937][26022] Updated weights on worker 0-0, policy_version 149752 (0.00096) [2022-07-09 07:33:26,041][25689] Fps is (10 sec: 5955.9, 60 sec: 5749.6, 300 sec: 5729.2). Total num frames: 153353216. Throughput: 0: 5966.9. Samples: 153352896. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 07:33:26,041][25689] Avg episode reward: [(0, '-57.305')] [2022-07-09 07:33:26,738][26022] Updated weights on worker 0-0, policy_version 149762 (0.00085) [2022-07-09 07:33:28,477][26022] Updated weights on worker 0-0, policy_version 149772 (0.00080) [2022-07-09 07:33:30,307][26022] Updated weights on worker 0-0, policy_version 149782 (0.00092) [2022-07-09 07:33:31,098][25689] Fps is (10 sec: 5651.1, 60 sec: 5717.3, 300 sec: 5721.7). Total num frames: 153380864. Throughput: 0: 5995.9. Samples: 153387552. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 07:33:31,098][25689] Avg episode reward: [(0, '-57.943')] [2022-07-09 07:33:31,907][26022] Updated weights on worker 0-0, policy_version 149792 (0.00087) [2022-07-09 07:33:33,785][26022] Updated weights on worker 0-0, policy_version 149802 (0.00086) [2022-07-09 07:33:35,510][26022] Updated weights on worker 0-0, policy_version 149812 (0.00368) [2022-07-09 07:33:36,133][25689] Fps is (10 sec: 5682.2, 60 sec: 5733.8, 300 sec: 5721.4). Total num frames: 153410560. Throughput: 0: 5151.8. Samples: 153404886. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 07:33:36,133][25689] Avg episode reward: [(0, '-57.972')] [2022-07-09 07:33:37,322][26022] Updated weights on worker 0-0, policy_version 149822 (0.00089) [2022-07-09 07:33:39,112][26022] Updated weights on worker 0-0, policy_version 149832 (0.00613) [2022-07-09 07:33:41,022][26022] Updated weights on worker 0-0, policy_version 149842 (0.00084) [2022-07-09 07:33:41,149][25689] Fps is (10 sec: 5807.1, 60 sec: 5717.1, 300 sec: 5724.7). Total num frames: 153439232. Throughput: 0: 6006.1. Samples: 153439352. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 07:33:41,150][25689] Avg episode reward: [(0, '-57.572')] [2022-07-09 07:33:42,630][26022] Updated weights on worker 0-0, policy_version 149852 (0.00081) [2022-07-09 07:33:44,458][26022] Updated weights on worker 0-0, policy_version 149862 (0.00089) [2022-07-09 07:33:46,036][26022] Updated weights on worker 0-0, policy_version 149872 (0.00089) [2022-07-09 07:33:46,154][25689] Fps is (10 sec: 5824.3, 60 sec: 5717.2, 300 sec: 5727.2). Total num frames: 153468928. Throughput: 0: 6038.0. Samples: 153474340. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 07:33:46,155][25689] Avg episode reward: [(0, '-56.864')] [2022-07-09 07:33:47,919][26022] Updated weights on worker 0-0, policy_version 149882 (0.00089) [2022-07-09 07:33:49,730][26022] Updated weights on worker 0-0, policy_version 149892 (0.00085) [2022-07-09 07:33:51,223][25689] Fps is (10 sec: 5692.3, 60 sec: 5702.6, 300 sec: 5719.5). Total num frames: 153496576. Throughput: 0: 5179.8. Samples: 153491796. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 07:33:51,225][25689] Avg episode reward: [(0, '-56.505')] [2022-07-09 07:33:51,741][26022] Updated weights on worker 0-0, policy_version 149902 (0.00686) [2022-07-09 07:33:53,304][26022] Updated weights on worker 0-0, policy_version 149912 (0.00088) [2022-07-09 07:33:55,139][26022] Updated weights on worker 0-0, policy_version 149922 (0.00084) [2022-07-09 07:33:56,232][25689] Fps is (10 sec: 5690.1, 60 sec: 5719.0, 300 sec: 5726.3). Total num frames: 153526272. Throughput: 0: 6003.9. Samples: 153525562. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 07:33:56,233][25689] Avg episode reward: [(0, '-55.942')] [2022-07-09 07:33:56,751][26022] Updated weights on worker 0-0, policy_version 149932 (0.00090) [2022-07-09 07:33:58,810][26022] Updated weights on worker 0-0, policy_version 149942 (0.00084) [2022-07-09 07:34:00,450][26022] Updated weights on worker 0-0, policy_version 149952 (0.00083) [2022-07-09 07:34:01,266][25689] Fps is (10 sec: 5913.9, 60 sec: 5750.9, 300 sec: 5729.5). Total num frames: 153555968. Throughput: 0: 6021.5. Samples: 153560484. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 07:34:01,267][25689] Avg episode reward: [(0, '-54.728')] [2022-07-09 07:34:02,640][26022] Updated weights on worker 0-0, policy_version 149962 (0.00085) [2022-07-09 07:34:04,272][26022] Updated weights on worker 0-0, policy_version 149972 (0.00086) [2022-07-09 07:34:06,265][26022] Updated weights on worker 0-0, policy_version 149982 (0.00088) [2022-07-09 07:34:06,286][25689] Fps is (10 sec: 5499.8, 60 sec: 5715.9, 300 sec: 5720.5). Total num frames: 153581568. Throughput: 0: 5030.3. Samples: 153575612. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 07:34:06,287][25689] Avg episode reward: [(0, '-55.180')] [2022-07-09 07:34:07,899][26022] Updated weights on worker 0-0, policy_version 149992 (0.00900) [2022-07-09 07:34:09,790][26022] Updated weights on worker 0-0, policy_version 150002 (0.00092) [2022-07-09 07:34:11,351][25689] Fps is (10 sec: 5483.1, 60 sec: 5735.3, 300 sec: 5723.7). Total num frames: 153611264. Throughput: 0: 5877.1. Samples: 153610088. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 07:34:11,351][25689] Avg episode reward: [(0, '-54.747')] [2022-07-09 07:34:11,491][26022] Updated weights on worker 0-0, policy_version 150012 (0.00084) [2022-07-09 07:34:13,362][26022] Updated weights on worker 0-0, policy_version 150022 (0.00048) [2022-07-09 07:34:15,085][26022] Updated weights on worker 0-0, policy_version 150032 (0.00080) [2022-07-09 07:34:16,369][25689] Fps is (10 sec: 5788.7, 60 sec: 5740.4, 300 sec: 5720.9). Total num frames: 153639936. Throughput: 0: 5915.0. Samples: 153644674. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 07:34:16,370][25689] Avg episode reward: [(0, '-55.699')] [2022-07-09 07:34:16,763][26022] Updated weights on worker 0-0, policy_version 150042 (0.00086) [2022-07-09 07:34:18,601][26022] Updated weights on worker 0-0, policy_version 150052 (0.00086) [2022-07-09 07:34:20,424][26022] Updated weights on worker 0-0, policy_version 150062 (0.00094) [2022-07-09 07:34:21,460][25689] Fps is (10 sec: 5672.6, 60 sec: 5703.1, 300 sec: 5722.9). Total num frames: 153668608. Throughput: 0: 5028.2. Samples: 153662022. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 07:34:21,460][25689] Avg episode reward: [(0, '-57.203')] [2022-07-09 07:34:22,260][26022] Updated weights on worker 0-0, policy_version 150072 (0.00093) [2022-07-09 07:34:23,847][26022] Updated weights on worker 0-0, policy_version 150082 (0.00095) [2022-07-09 07:34:25,644][26022] Updated weights on worker 0-0, policy_version 150092 (0.00091) [2022-07-09 07:34:26,466][25689] Fps is (10 sec: 5780.7, 60 sec: 5711.0, 300 sec: 5725.4). Total num frames: 153698304. Throughput: 0: 5989.5. Samples: 153696480. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 07:34:26,468][25689] Avg episode reward: [(0, '-57.923')] [2022-07-09 07:34:27,667][26022] Updated weights on worker 0-0, policy_version 150102 (0.00085) [2022-07-09 07:34:29,221][26022] Updated weights on worker 0-0, policy_version 150112 (0.00091) [2022-07-09 07:34:31,129][26022] Updated weights on worker 0-0, policy_version 150122 (0.00085) [2022-07-09 07:34:31,556][25689] Fps is (10 sec: 5780.9, 60 sec: 5724.8, 300 sec: 5718.6). Total num frames: 153726976. Throughput: 0: 5982.5. Samples: 153730968. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 07:34:31,557][25689] Avg episode reward: [(0, '-57.930')] [2022-07-09 07:34:32,926][26022] Updated weights on worker 0-0, policy_version 150132 (0.00088) [2022-07-09 07:34:34,546][26022] Updated weights on worker 0-0, policy_version 150142 (0.00079) [2022-07-09 07:34:36,560][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:34:36,569][25689] Fps is (10 sec: 5473.7, 60 sec: 5676.1, 300 sec: 5712.5). Total num frames: 153753600. Throughput: 0: 5986.9. Samples: 153765604. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 07:34:36,569][25689] Avg episode reward: [(0, '-58.156')] [2022-07-09 07:34:36,576][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000150151_153754624.pth [2022-07-09 07:34:36,577][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000148139_151694336.pth [2022-07-09 07:34:36,688][26022] Updated weights on worker 0-0, policy_version 150152 (0.00082) [2022-07-09 07:34:38,170][26022] Updated weights on worker 0-0, policy_version 150162 (0.00054) [2022-07-09 07:34:40,137][26022] Updated weights on worker 0-0, policy_version 150172 (0.00085) [2022-07-09 07:34:41,635][25689] Fps is (10 sec: 5791.3, 60 sec: 5722.2, 300 sec: 5725.4). Total num frames: 153785344. Throughput: 0: 5975.0. Samples: 153782570. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 07:34:41,635][25689] Avg episode reward: [(0, '-58.536')] [2022-07-09 07:34:41,731][26022] Updated weights on worker 0-0, policy_version 150182 (0.00090) [2022-07-09 07:34:43,483][26022] Updated weights on worker 0-0, policy_version 150192 (0.00083) [2022-07-09 07:34:45,301][26022] Updated weights on worker 0-0, policy_version 150202 (0.00090) [2022-07-09 07:34:46,681][25689] Fps is (10 sec: 5974.3, 60 sec: 5701.4, 300 sec: 5715.2). Total num frames: 153814016. Throughput: 0: 5996.9. Samples: 153817706. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 07:34:46,682][25689] Avg episode reward: [(0, '-58.944')] [2022-07-09 07:34:47,040][26022] Updated weights on worker 0-0, policy_version 150212 (0.00095) [2022-07-09 07:34:48,637][26022] Updated weights on worker 0-0, policy_version 150222 (0.00088) [2022-07-09 07:34:50,615][26022] Updated weights on worker 0-0, policy_version 150232 (0.00866) [2022-07-09 07:34:51,766][25689] Fps is (10 sec: 5963.3, 60 sec: 5767.5, 300 sec: 5734.4). Total num frames: 153845760. Throughput: 0: 6035.5. Samples: 153852944. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:34:51,767][25689] Avg episode reward: [(0, '-58.655')] [2022-07-09 07:34:52,296][26022] Updated weights on worker 0-0, policy_version 150242 (0.00088) [2022-07-09 07:34:54,199][26022] Updated weights on worker 0-0, policy_version 150252 (0.00083) [2022-07-09 07:34:55,814][26022] Updated weights on worker 0-0, policy_version 150262 (0.00087) [2022-07-09 07:34:56,771][25689] Fps is (10 sec: 5785.0, 60 sec: 5717.2, 300 sec: 5717.3). Total num frames: 153872384. Throughput: 0: 5172.1. Samples: 153870092. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:34:56,771][25689] Avg episode reward: [(0, '-59.074')] [2022-07-09 07:34:57,824][26022] Updated weights on worker 0-0, policy_version 150272 (0.00084) [2022-07-09 07:34:59,300][26022] Updated weights on worker 0-0, policy_version 150282 (0.00089) [2022-07-09 07:35:01,391][26022] Updated weights on worker 0-0, policy_version 150292 (0.00085) [2022-07-09 07:35:01,820][25689] Fps is (10 sec: 5500.1, 60 sec: 5698.9, 300 sec: 5723.5). Total num frames: 153901056. Throughput: 0: 6066.2. Samples: 153905016. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:35:01,820][25689] Avg episode reward: [(0, '-59.163')] [2022-07-09 07:35:03,140][26022] Updated weights on worker 0-0, policy_version 150302 (0.00978) [2022-07-09 07:35:05,080][26022] Updated weights on worker 0-0, policy_version 150312 (0.00091) [2022-07-09 07:35:06,811][26022] Updated weights on worker 0-0, policy_version 150322 (0.00086) [2022-07-09 07:35:06,906][25689] Fps is (10 sec: 5657.8, 60 sec: 5743.3, 300 sec: 5727.0). Total num frames: 153929728. Throughput: 0: 5938.3. Samples: 153937808. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:35:06,907][25689] Avg episode reward: [(0, '-58.899')] [2022-07-09 07:35:08,487][26022] Updated weights on worker 0-0, policy_version 150332 (0.00085) [2022-07-09 07:35:10,471][26022] Updated weights on worker 0-0, policy_version 150342 (0.00087) [2022-07-09 07:35:11,908][26022] Updated weights on worker 0-0, policy_version 150352 (0.00089) [2022-07-09 07:35:12,018][25689] Fps is (10 sec: 5824.0, 60 sec: 5755.8, 300 sec: 5729.2). Total num frames: 153960448. Throughput: 0: 5052.9. Samples: 153955272. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:35:12,018][25689] Avg episode reward: [(0, '-58.611')] [2022-07-09 07:35:13,867][26022] Updated weights on worker 0-0, policy_version 150362 (0.00085) [2022-07-09 07:35:15,696][26022] Updated weights on worker 0-0, policy_version 150372 (0.00616) [2022-07-09 07:35:17,106][25689] Fps is (10 sec: 5822.7, 60 sec: 5749.1, 300 sec: 5727.7). Total num frames: 153989120. Throughput: 0: 5910.6. Samples: 153990286. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:35:17,107][25689] Avg episode reward: [(0, '-58.216')] [2022-07-09 07:35:17,344][26022] Updated weights on worker 0-0, policy_version 150382 (0.00090) [2022-07-09 07:35:19,396][26022] Updated weights on worker 0-0, policy_version 150392 (0.00087) [2022-07-09 07:35:20,763][26022] Updated weights on worker 0-0, policy_version 150402 (0.00083) [2022-07-09 07:35:22,142][25689] Fps is (10 sec: 5765.1, 60 sec: 5771.2, 300 sec: 5727.4). Total num frames: 154018816. Throughput: 0: 5913.4. Samples: 154025188. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:35:22,142][25689] Avg episode reward: [(0, '-57.655')] [2022-07-09 07:35:22,709][26022] Updated weights on worker 0-0, policy_version 150412 (0.00080) [2022-07-09 07:35:24,424][26022] Updated weights on worker 0-0, policy_version 150422 (0.00086) [2022-07-09 07:35:26,179][26022] Updated weights on worker 0-0, policy_version 150432 (0.00084) [2022-07-09 07:35:27,148][25689] Fps is (10 sec: 5608.7, 60 sec: 5720.6, 300 sec: 5724.4). Total num frames: 154045440. Throughput: 0: 5170.8. Samples: 154042474. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:35:27,149][25689] Avg episode reward: [(0, '-56.897')] [2022-07-09 07:35:28,087][26022] Updated weights on worker 0-0, policy_version 150442 (0.00086) [2022-07-09 07:35:29,776][26022] Updated weights on worker 0-0, policy_version 150452 (0.00106) [2022-07-09 07:35:31,495][26022] Updated weights on worker 0-0, policy_version 150462 (0.00085) [2022-07-09 07:35:32,208][25689] Fps is (10 sec: 5798.2, 60 sec: 5774.0, 300 sec: 5734.2). Total num frames: 154077184. Throughput: 0: 6043.3. Samples: 154077292. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:35:32,210][25689] Avg episode reward: [(0, '-56.321')] [2022-07-09 07:35:33,362][26022] Updated weights on worker 0-0, policy_version 150472 (0.00092) [2022-07-09 07:35:34,950][26022] Updated weights on worker 0-0, policy_version 150482 (0.00083) [2022-07-09 07:35:36,855][26022] Updated weights on worker 0-0, policy_version 150492 (0.00099) [2022-07-09 07:35:37,281][25689] Fps is (10 sec: 5962.2, 60 sec: 5802.0, 300 sec: 5729.8). Total num frames: 154105856. Throughput: 0: 6042.0. Samples: 154112182. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:35:37,283][25689] Avg episode reward: [(0, '-56.393')] [2022-07-09 07:35:38,677][26022] Updated weights on worker 0-0, policy_version 150502 (0.00089) [2022-07-09 07:35:40,332][26022] Updated weights on worker 0-0, policy_version 150512 (0.00089) [2022-07-09 07:35:42,217][26022] Updated weights on worker 0-0, policy_version 150522 (0.00053) [2022-07-09 07:35:42,286][25689] Fps is (10 sec: 5690.1, 60 sec: 5757.2, 300 sec: 5729.9). Total num frames: 154134528. Throughput: 0: 5172.9. Samples: 154129394. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:35:42,287][25689] Avg episode reward: [(0, '-56.126')] [2022-07-09 07:35:43,874][26022] Updated weights on worker 0-0, policy_version 150532 (0.00088) [2022-07-09 07:35:45,748][26022] Updated weights on worker 0-0, policy_version 150542 (0.00090) [2022-07-09 07:35:47,299][25689] Fps is (10 sec: 5826.4, 60 sec: 5777.3, 300 sec: 5730.9). Total num frames: 154164224. Throughput: 0: 6046.0. Samples: 154164308. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:35:47,300][25689] Avg episode reward: [(0, '-55.787')] [2022-07-09 07:35:47,443][26022] Updated weights on worker 0-0, policy_version 150552 (0.00087) [2022-07-09 07:35:49,112][26022] Updated weights on worker 0-0, policy_version 150562 (0.00084) [2022-07-09 07:35:50,925][26022] Updated weights on worker 0-0, policy_version 150572 (0.00085) [2022-07-09 07:35:52,348][25689] Fps is (10 sec: 5800.9, 60 sec: 5730.0, 300 sec: 5733.5). Total num frames: 154192896. Throughput: 0: 6049.2. Samples: 154199122. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:35:52,350][25689] Avg episode reward: [(0, '-56.073')] [2022-07-09 07:35:52,768][26022] Updated weights on worker 0-0, policy_version 150582 (0.00090) [2022-07-09 07:35:54,479][26022] Updated weights on worker 0-0, policy_version 150592 (0.00084) [2022-07-09 07:35:56,210][26022] Updated weights on worker 0-0, policy_version 150602 (0.00088) [2022-07-09 07:35:57,355][25689] Fps is (10 sec: 5702.4, 60 sec: 5763.6, 300 sec: 5730.3). Total num frames: 154221568. Throughput: 0: 5192.2. Samples: 154216410. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:35:57,356][25689] Avg episode reward: [(0, '-56.715')] [2022-07-09 07:35:57,895][26022] Updated weights on worker 0-0, policy_version 150612 (0.00087) [2022-07-09 07:35:59,878][26022] Updated weights on worker 0-0, policy_version 150622 (0.00079) [2022-07-09 07:36:01,575][26022] Updated weights on worker 0-0, policy_version 150632 (0.00086) [2022-07-09 07:36:02,362][25689] Fps is (10 sec: 5522.1, 60 sec: 5733.8, 300 sec: 5730.2). Total num frames: 154248192. Throughput: 0: 6053.3. Samples: 154250916. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:36:02,364][25689] Avg episode reward: [(0, '-56.913')] [2022-07-09 07:36:03,855][26022] Updated weights on worker 0-0, policy_version 150642 (0.00097) [2022-07-09 07:36:05,734][26022] Updated weights on worker 0-0, policy_version 150652 (0.00092) [2022-07-09 07:36:07,367][25689] Fps is (10 sec: 5523.2, 60 sec: 5741.5, 300 sec: 5732.6). Total num frames: 154276864. Throughput: 0: 5937.3. Samples: 154283456. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:36:07,369][25689] Avg episode reward: [(0, '-57.330')] [2022-07-09 07:36:07,419][26022] Updated weights on worker 0-0, policy_version 150662 (0.00094) [2022-07-09 07:36:09,271][26022] Updated weights on worker 0-0, policy_version 150672 (0.00088) [2022-07-09 07:36:11,036][26022] Updated weights on worker 0-0, policy_version 150682 (0.00106) [2022-07-09 07:36:12,411][25689] Fps is (10 sec: 5808.4, 60 sec: 5731.0, 300 sec: 5728.9). Total num frames: 154306560. Throughput: 0: 5052.3. Samples: 154300484. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:36:12,412][25689] Avg episode reward: [(0, '-58.110')] [2022-07-09 07:36:12,798][26022] Updated weights on worker 0-0, policy_version 150692 (0.00086) [2022-07-09 07:36:14,630][26022] Updated weights on worker 0-0, policy_version 150702 (0.00091) [2022-07-09 07:36:16,235][26022] Updated weights on worker 0-0, policy_version 150712 (0.00092) [2022-07-09 07:36:17,416][25689] Fps is (10 sec: 5808.3, 60 sec: 5738.9, 300 sec: 5736.0). Total num frames: 154335232. Throughput: 0: 5930.2. Samples: 154335374. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:36:17,416][25689] Avg episode reward: [(0, '-58.237')] [2022-07-09 07:36:17,956][26022] Updated weights on worker 0-0, policy_version 150722 (0.00088) [2022-07-09 07:36:19,702][26022] Updated weights on worker 0-0, policy_version 150732 (0.00085) [2022-07-09 07:36:21,628][26022] Updated weights on worker 0-0, policy_version 150742 (0.00090) [2022-07-09 07:36:22,427][25689] Fps is (10 sec: 5725.2, 60 sec: 5724.3, 300 sec: 5732.5). Total num frames: 154363904. Throughput: 0: 5926.3. Samples: 154369826. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:36:22,427][25689] Avg episode reward: [(0, '-58.415')] [2022-07-09 07:36:23,490][26022] Updated weights on worker 0-0, policy_version 150752 (0.00096) [2022-07-09 07:36:25,049][26022] Updated weights on worker 0-0, policy_version 150762 (0.00094) [2022-07-09 07:36:26,987][26022] Updated weights on worker 0-0, policy_version 150772 (0.00078) [2022-07-09 07:36:27,430][25689] Fps is (10 sec: 5726.3, 60 sec: 5758.5, 300 sec: 5730.0). Total num frames: 154392576. Throughput: 0: 5154.4. Samples: 154386870. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:36:27,431][25689] Avg episode reward: [(0, '-57.574')] [2022-07-09 07:36:28,555][26022] Updated weights on worker 0-0, policy_version 150782 (0.00081) [2022-07-09 07:36:30,727][26022] Updated weights on worker 0-0, policy_version 150792 (0.00085) [2022-07-09 07:36:32,281][26022] Updated weights on worker 0-0, policy_version 150802 (0.00093) [2022-07-09 07:36:32,486][25689] Fps is (10 sec: 5700.6, 60 sec: 5708.0, 300 sec: 5729.2). Total num frames: 154421248. Throughput: 0: 6027.6. Samples: 154421490. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:36:32,486][25689] Avg episode reward: [(0, '-57.811')] [2022-07-09 07:36:34,077][26022] Updated weights on worker 0-0, policy_version 150812 (0.00088) [2022-07-09 07:36:35,905][26022] Updated weights on worker 0-0, policy_version 150822 (0.00086) [2022-07-09 07:36:36,677][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:36:36,685][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000150826_154445824.pth [2022-07-09 07:36:36,691][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000148811_152382464.pth [2022-07-09 07:36:37,497][25689] Fps is (10 sec: 5696.2, 60 sec: 5713.9, 300 sec: 5730.5). Total num frames: 154449920. Throughput: 0: 6019.0. Samples: 154456242. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:36:37,499][25689] Avg episode reward: [(0, '-57.340')] [2022-07-09 07:36:37,605][26022] Updated weights on worker 0-0, policy_version 150832 (0.00085) [2022-07-09 07:36:39,569][26022] Updated weights on worker 0-0, policy_version 150842 (0.00089) [2022-07-09 07:36:41,164][26022] Updated weights on worker 0-0, policy_version 150852 (0.00087) [2022-07-09 07:36:42,527][25689] Fps is (10 sec: 5812.9, 60 sec: 5728.5, 300 sec: 5727.3). Total num frames: 154479616. Throughput: 0: 5158.7. Samples: 154473518. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:36:42,528][25689] Avg episode reward: [(0, '-56.597')] [2022-07-09 07:36:43,183][26022] Updated weights on worker 0-0, policy_version 150862 (0.00081) [2022-07-09 07:36:44,875][26022] Updated weights on worker 0-0, policy_version 150872 (0.00089) [2022-07-09 07:36:46,501][26022] Updated weights on worker 0-0, policy_version 150882 (0.00086) [2022-07-09 07:36:47,535][25689] Fps is (10 sec: 5814.9, 60 sec: 5712.0, 300 sec: 5731.8). Total num frames: 154508288. Throughput: 0: 6034.1. Samples: 154508184. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:36:47,535][25689] Avg episode reward: [(0, '-57.471')] [2022-07-09 07:36:48,350][26022] Updated weights on worker 0-0, policy_version 150892 (0.00081) [2022-07-09 07:36:49,907][26022] Updated weights on worker 0-0, policy_version 150902 (0.00090) [2022-07-09 07:36:51,914][26022] Updated weights on worker 0-0, policy_version 150912 (0.00086) [2022-07-09 07:36:52,611][25689] Fps is (10 sec: 5788.4, 60 sec: 5726.4, 300 sec: 5727.2). Total num frames: 154537984. Throughput: 0: 6031.9. Samples: 154542880. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 07:36:52,611][25689] Avg episode reward: [(0, '-57.353')] [2022-07-09 07:36:53,595][26022] Updated weights on worker 0-0, policy_version 150922 (0.00083) [2022-07-09 07:36:55,412][26022] Updated weights on worker 0-0, policy_version 150932 (0.00086) [2022-07-09 07:36:57,210][26022] Updated weights on worker 0-0, policy_version 150942 (0.00086) [2022-07-09 07:36:57,623][25689] Fps is (10 sec: 5684.0, 60 sec: 5708.9, 300 sec: 5728.3). Total num frames: 154565632. Throughput: 0: 5165.4. Samples: 154560202. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 07:36:57,624][25689] Avg episode reward: [(0, '-58.460')] [2022-07-09 07:36:58,967][26022] Updated weights on worker 0-0, policy_version 150952 (0.00090) [2022-07-09 07:37:00,742][26022] Updated weights on worker 0-0, policy_version 150962 (0.00365) [2022-07-09 07:37:02,639][25689] Fps is (10 sec: 5615.9, 60 sec: 5742.0, 300 sec: 5733.3). Total num frames: 154594304. Throughput: 0: 6036.7. Samples: 154594930. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 07:37:02,640][25689] Avg episode reward: [(0, '-58.599')] [2022-07-09 07:37:02,855][26022] Updated weights on worker 0-0, policy_version 150972 (0.00095) [2022-07-09 07:37:04,447][26022] Updated weights on worker 0-0, policy_version 150982 (0.00089) [2022-07-09 07:37:06,470][26022] Updated weights on worker 0-0, policy_version 150992 (0.00086) [2022-07-09 07:37:07,671][25689] Fps is (10 sec: 5707.4, 60 sec: 5739.5, 300 sec: 5735.1). Total num frames: 154622976. Throughput: 0: 5922.1. Samples: 154627432. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 07:37:07,671][25689] Avg episode reward: [(0, '-58.870')] [2022-07-09 07:37:07,919][26022] Updated weights on worker 0-0, policy_version 151002 (0.00086) [2022-07-09 07:37:10,122][26022] Updated weights on worker 0-0, policy_version 151012 (0.00087) [2022-07-09 07:37:11,580][26022] Updated weights on worker 0-0, policy_version 151022 (0.00088) [2022-07-09 07:37:12,718][25689] Fps is (10 sec: 5588.1, 60 sec: 5705.2, 300 sec: 5728.3). Total num frames: 154650624. Throughput: 0: 5067.2. Samples: 154644768. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 07:37:12,718][25689] Avg episode reward: [(0, '-58.654')] [2022-07-09 07:37:13,593][26022] Updated weights on worker 0-0, policy_version 151032 (0.00507) [2022-07-09 07:37:15,361][26022] Updated weights on worker 0-0, policy_version 151042 (0.00086) [2022-07-09 07:37:17,098][26022] Updated weights on worker 0-0, policy_version 151052 (0.00099) [2022-07-09 07:37:17,720][25689] Fps is (10 sec: 5808.1, 60 sec: 5739.4, 300 sec: 5732.2). Total num frames: 154681344. Throughput: 0: 5932.9. Samples: 154679434. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 07:37:17,721][25689] Avg episode reward: [(0, '-58.452')] [2022-07-09 07:37:18,831][26022] Updated weights on worker 0-0, policy_version 151062 (0.00097) [2022-07-09 07:37:20,495][26022] Updated weights on worker 0-0, policy_version 151072 (0.00088) [2022-07-09 07:37:22,296][26022] Updated weights on worker 0-0, policy_version 151082 (0.00094) [2022-07-09 07:37:22,738][25689] Fps is (10 sec: 5927.4, 60 sec: 5738.8, 300 sec: 5735.8). Total num frames: 154710016. Throughput: 0: 5945.1. Samples: 154714418. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 07:37:22,738][25689] Avg episode reward: [(0, '-57.846')] [2022-07-09 07:37:24,050][26022] Updated weights on worker 0-0, policy_version 151092 (0.00085) [2022-07-09 07:37:25,895][26022] Updated weights on worker 0-0, policy_version 151102 (0.00093) [2022-07-09 07:37:27,712][26022] Updated weights on worker 0-0, policy_version 151112 (0.00091) [2022-07-09 07:37:27,752][25689] Fps is (10 sec: 5716.0, 60 sec: 5737.7, 300 sec: 5733.5). Total num frames: 154738688. Throughput: 0: 5208.3. Samples: 154732022. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 07:37:27,753][25689] Avg episode reward: [(0, '-56.467')] [2022-07-09 07:37:29,374][26022] Updated weights on worker 0-0, policy_version 151122 (0.00089) [2022-07-09 07:37:31,291][26022] Updated weights on worker 0-0, policy_version 151132 (0.00085) [2022-07-09 07:37:32,819][25689] Fps is (10 sec: 5790.0, 60 sec: 5753.7, 300 sec: 5736.2). Total num frames: 154768384. Throughput: 0: 6038.2. Samples: 154766140. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 07:37:32,819][25689] Avg episode reward: [(0, '-56.462')] [2022-07-09 07:37:32,828][26022] Updated weights on worker 0-0, policy_version 151142 (0.00087) [2022-07-09 07:37:34,929][26022] Updated weights on worker 0-0, policy_version 151152 (0.00135) [2022-07-09 07:37:36,412][26022] Updated weights on worker 0-0, policy_version 151162 (0.00092) [2022-07-09 07:37:37,842][25689] Fps is (10 sec: 5683.1, 60 sec: 5735.5, 300 sec: 5729.2). Total num frames: 154796032. Throughput: 0: 6037.0. Samples: 154800912. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 07:37:37,844][25689] Avg episode reward: [(0, '-55.559')] [2022-07-09 07:37:38,446][26022] Updated weights on worker 0-0, policy_version 151172 (0.00092) [2022-07-09 07:37:40,038][26022] Updated weights on worker 0-0, policy_version 151182 (0.00087) [2022-07-09 07:37:41,901][26022] Updated weights on worker 0-0, policy_version 151192 (0.00088) [2022-07-09 07:37:42,859][25689] Fps is (10 sec: 5711.4, 60 sec: 5736.8, 300 sec: 5729.0). Total num frames: 154825728. Throughput: 0: 5159.7. Samples: 154818238. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 07:37:42,861][25689] Avg episode reward: [(0, '-55.482')] [2022-07-09 07:37:43,751][26022] Updated weights on worker 0-0, policy_version 151202 (0.00088) [2022-07-09 07:37:45,390][26022] Updated weights on worker 0-0, policy_version 151212 (0.00091) [2022-07-09 07:37:47,272][26022] Updated weights on worker 0-0, policy_version 151222 (0.00085) [2022-07-09 07:37:47,876][25689] Fps is (10 sec: 5919.5, 60 sec: 5752.9, 300 sec: 5733.9). Total num frames: 154855424. Throughput: 0: 6024.7. Samples: 154853260. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 07:37:47,877][25689] Avg episode reward: [(0, '-55.823')] [2022-07-09 07:37:48,957][26022] Updated weights on worker 0-0, policy_version 151232 (0.00086) [2022-07-09 07:37:50,664][26022] Updated weights on worker 0-0, policy_version 151242 (0.00085) [2022-07-09 07:37:52,519][26022] Updated weights on worker 0-0, policy_version 151252 (0.00083) [2022-07-09 07:37:52,915][25689] Fps is (10 sec: 5702.4, 60 sec: 5722.4, 300 sec: 5729.8). Total num frames: 154883072. Throughput: 0: 6071.3. Samples: 154888152. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 07:37:52,916][25689] Avg episode reward: [(0, '-55.797')] [2022-07-09 07:37:54,181][26022] Updated weights on worker 0-0, policy_version 151262 (0.00084) [2022-07-09 07:37:56,141][26022] Updated weights on worker 0-0, policy_version 151272 (0.00085) [2022-07-09 07:37:57,599][26022] Updated weights on worker 0-0, policy_version 151282 (0.00096) [2022-07-09 07:37:57,924][25689] Fps is (10 sec: 5808.4, 60 sec: 5773.7, 300 sec: 5740.2). Total num frames: 154913792. Throughput: 0: 5216.7. Samples: 154905674. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 07:37:57,925][25689] Avg episode reward: [(0, '-55.890')] [2022-07-09 07:37:59,682][26022] Updated weights on worker 0-0, policy_version 151292 (0.00085) [2022-07-09 07:38:01,315][26022] Updated weights on worker 0-0, policy_version 151302 (0.00095) [2022-07-09 07:38:02,937][25689] Fps is (10 sec: 5619.8, 60 sec: 5723.1, 300 sec: 5733.2). Total num frames: 154939392. Throughput: 0: 6082.2. Samples: 154940356. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 07:38:02,939][25689] Avg episode reward: [(0, '-56.325')] [2022-07-09 07:38:03,359][26022] Updated weights on worker 0-0, policy_version 151312 (0.00088) [2022-07-09 07:38:05,414][26022] Updated weights on worker 0-0, policy_version 151322 (0.00828) [2022-07-09 07:38:06,953][26022] Updated weights on worker 0-0, policy_version 151332 (0.00082) [2022-07-09 07:38:07,954][25689] Fps is (10 sec: 5513.3, 60 sec: 5741.4, 300 sec: 5738.1). Total num frames: 154969088. Throughput: 0: 5964.9. Samples: 154973026. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 07:38:07,954][25689] Avg episode reward: [(0, '-56.584')] [2022-07-09 07:38:08,825][26022] Updated weights on worker 0-0, policy_version 151342 (0.00086) [2022-07-09 07:38:10,474][26022] Updated weights on worker 0-0, policy_version 151352 (0.00085) [2022-07-09 07:38:12,095][26022] Updated weights on worker 0-0, policy_version 151362 (0.00088) [2022-07-09 07:38:13,029][25689] Fps is (10 sec: 5783.7, 60 sec: 5755.8, 300 sec: 5738.1). Total num frames: 154997760. Throughput: 0: 5961.3. Samples: 155008054. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 07:38:13,029][25689] Avg episode reward: [(0, '-56.797')] [2022-07-09 07:38:14,049][26022] Updated weights on worker 0-0, policy_version 151372 (0.00090) [2022-07-09 07:38:15,656][26022] Updated weights on worker 0-0, policy_version 151382 (0.00093) [2022-07-09 07:38:17,572][26022] Updated weights on worker 0-0, policy_version 151392 (0.00084) [2022-07-09 07:38:18,041][25689] Fps is (10 sec: 5786.2, 60 sec: 5737.8, 300 sec: 5735.4). Total num frames: 155027456. Throughput: 0: 5955.4. Samples: 155025478. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 07:38:18,042][25689] Avg episode reward: [(0, '-56.136')] [2022-07-09 07:38:19,443][26022] Updated weights on worker 0-0, policy_version 151402 (0.00086) [2022-07-09 07:38:21,085][26022] Updated weights on worker 0-0, policy_version 151412 (0.00553) [2022-07-09 07:38:22,925][26022] Updated weights on worker 0-0, policy_version 151422 (0.00088) [2022-07-09 07:38:23,060][25689] Fps is (10 sec: 5920.8, 60 sec: 5754.7, 300 sec: 5736.8). Total num frames: 155057152. Throughput: 0: 5975.4. Samples: 155060598. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 07:38:23,060][25689] Avg episode reward: [(0, '-56.250')] [2022-07-09 07:38:24,619][26022] Updated weights on worker 0-0, policy_version 151432 (0.00091) [2022-07-09 07:38:26,243][26022] Updated weights on worker 0-0, policy_version 151442 (0.00091) [2022-07-09 07:38:28,080][25689] Fps is (10 sec: 5610.1, 60 sec: 5720.2, 300 sec: 5734.0). Total num frames: 155083776. Throughput: 0: 6056.9. Samples: 155094930. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 07:38:28,081][25689] Avg episode reward: [(0, '-56.158')] [2022-07-09 07:38:28,404][26022] Updated weights on worker 0-0, policy_version 151452 (0.00093) [2022-07-09 07:38:29,871][26022] Updated weights on worker 0-0, policy_version 151462 (0.00089) [2022-07-09 07:38:31,882][26022] Updated weights on worker 0-0, policy_version 151472 (0.00085) [2022-07-09 07:38:33,182][25689] Fps is (10 sec: 5766.3, 60 sec: 5750.8, 300 sec: 5739.7). Total num frames: 155115520. Throughput: 0: 5170.7. Samples: 155112260. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 07:38:33,183][25689] Avg episode reward: [(0, '-55.760')] [2022-07-09 07:38:33,445][26022] Updated weights on worker 0-0, policy_version 151482 (0.00087) [2022-07-09 07:38:35,461][26022] Updated weights on worker 0-0, policy_version 151492 (0.00095) [2022-07-09 07:38:36,778][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:38:36,788][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000151500_155136000.pth [2022-07-09 07:38:36,788][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000149482_153069568.pth [2022-07-09 07:38:37,087][26022] Updated weights on worker 0-0, policy_version 151502 (0.00089) [2022-07-09 07:38:38,206][25689] Fps is (10 sec: 5865.6, 60 sec: 5750.8, 300 sec: 5736.1). Total num frames: 155143168. Throughput: 0: 6027.8. Samples: 155147026. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 07:38:38,206][25689] Avg episode reward: [(0, '-55.188')] [2022-07-09 07:38:38,889][26022] Updated weights on worker 0-0, policy_version 151512 (0.00093) [2022-07-09 07:38:40,595][26022] Updated weights on worker 0-0, policy_version 151522 (0.00083) [2022-07-09 07:38:42,575][26022] Updated weights on worker 0-0, policy_version 151532 (0.00082) [2022-07-09 07:38:43,223][25689] Fps is (10 sec: 5812.8, 60 sec: 5767.6, 300 sec: 5739.3). Total num frames: 155173888. Throughput: 0: 6012.2. Samples: 155181826. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 07:38:43,224][25689] Avg episode reward: [(0, '-56.006')] [2022-07-09 07:38:44,152][26022] Updated weights on worker 0-0, policy_version 151542 (0.00089) [2022-07-09 07:38:45,910][26022] Updated weights on worker 0-0, policy_version 151552 (0.00087) [2022-07-09 07:38:47,823][26022] Updated weights on worker 0-0, policy_version 151562 (0.00087) [2022-07-09 07:38:48,290][25689] Fps is (10 sec: 5787.9, 60 sec: 5728.9, 300 sec: 5739.3). Total num frames: 155201536. Throughput: 0: 5157.9. Samples: 155199172. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 07:38:48,291][25689] Avg episode reward: [(0, '-56.123')] [2022-07-09 07:38:49,267][26022] Updated weights on worker 0-0, policy_version 151572 (0.00090) [2022-07-09 07:38:51,478][26022] Updated weights on worker 0-0, policy_version 151582 (0.00085) [2022-07-09 07:38:52,886][26022] Updated weights on worker 0-0, policy_version 151592 (0.00086) [2022-07-09 07:38:53,394][25689] Fps is (10 sec: 5739.1, 60 sec: 5773.7, 300 sec: 5740.9). Total num frames: 155232256. Throughput: 0: 6029.2. Samples: 155234120. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 07:38:53,394][25689] Avg episode reward: [(0, '-55.632')] [2022-07-09 07:38:54,813][26022] Updated weights on worker 0-0, policy_version 151602 (0.00093) [2022-07-09 07:38:56,674][26022] Updated weights on worker 0-0, policy_version 151612 (0.00088) [2022-07-09 07:38:58,329][26022] Updated weights on worker 0-0, policy_version 151622 (0.00088) [2022-07-09 07:38:58,423][25689] Fps is (10 sec: 5861.7, 60 sec: 5738.0, 300 sec: 5737.6). Total num frames: 155260928. Throughput: 0: 6024.6. Samples: 155268824. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 07:38:58,423][25689] Avg episode reward: [(0, '-55.671')] [2022-07-09 07:39:00,136][26022] Updated weights on worker 0-0, policy_version 151632 (0.00087) [2022-07-09 07:39:01,836][26022] Updated weights on worker 0-0, policy_version 151642 (0.00094) [2022-07-09 07:39:03,452][25689] Fps is (10 sec: 5497.4, 60 sec: 5753.3, 300 sec: 5740.8). Total num frames: 155287552. Throughput: 0: 5163.1. Samples: 155286270. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 07:39:03,453][25689] Avg episode reward: [(0, '-55.605')] [2022-07-09 07:39:03,918][26022] Updated weights on worker 0-0, policy_version 151652 (0.00484) [2022-07-09 07:39:05,923][26022] Updated weights on worker 0-0, policy_version 151662 (0.00086) [2022-07-09 07:39:07,334][26022] Updated weights on worker 0-0, policy_version 151672 (0.00090) [2022-07-09 07:39:08,461][25689] Fps is (10 sec: 5610.6, 60 sec: 5754.0, 300 sec: 5741.9). Total num frames: 155317248. Throughput: 0: 5943.8. Samples: 155319060. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 07:39:08,461][25689] Avg episode reward: [(0, '-56.162')] [2022-07-09 07:39:09,441][26022] Updated weights on worker 0-0, policy_version 151682 (0.00080) [2022-07-09 07:39:10,936][26022] Updated weights on worker 0-0, policy_version 151692 (0.00090) [2022-07-09 07:39:12,811][26022] Updated weights on worker 0-0, policy_version 151702 (0.00088) [2022-07-09 07:39:13,544][25689] Fps is (10 sec: 5783.7, 60 sec: 5753.2, 300 sec: 5740.7). Total num frames: 155345920. Throughput: 0: 5929.1. Samples: 155353594. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 07:39:13,545][25689] Avg episode reward: [(0, '-55.656')] [2022-07-09 07:39:14,687][26022] Updated weights on worker 0-0, policy_version 151712 (0.00086) [2022-07-09 07:39:16,232][26022] Updated weights on worker 0-0, policy_version 151722 (0.00091) [2022-07-09 07:39:18,307][26022] Updated weights on worker 0-0, policy_version 151732 (0.00092) [2022-07-09 07:39:18,552][25689] Fps is (10 sec: 5784.2, 60 sec: 5753.7, 300 sec: 5745.7). Total num frames: 155375616. Throughput: 0: 5073.8. Samples: 155370954. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 07:39:18,552][25689] Avg episode reward: [(0, '-55.340')] [2022-07-09 07:39:19,894][26022] Updated weights on worker 0-0, policy_version 151742 (0.00090) [2022-07-09 07:39:21,716][26022] Updated weights on worker 0-0, policy_version 151752 (0.00084) [2022-07-09 07:39:23,523][26022] Updated weights on worker 0-0, policy_version 151762 (0.00085) [2022-07-09 07:39:23,560][25689] Fps is (10 sec: 5827.8, 60 sec: 5737.8, 300 sec: 5742.3). Total num frames: 155404288. Throughput: 0: 5942.0. Samples: 155405748. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 07:39:23,560][25689] Avg episode reward: [(0, '-55.668')] [2022-07-09 07:39:25,243][26022] Updated weights on worker 0-0, policy_version 151772 (0.00896) [2022-07-09 07:39:26,862][26022] Updated weights on worker 0-0, policy_version 151782 (0.00093) [2022-07-09 07:39:28,584][25689] Fps is (10 sec: 5715.9, 60 sec: 5771.3, 300 sec: 5743.5). Total num frames: 155432960. Throughput: 0: 6027.0. Samples: 155440344. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 07:39:28,585][25689] Avg episode reward: [(0, '-55.915')] [2022-07-09 07:39:28,828][26022] Updated weights on worker 0-0, policy_version 151792 (0.00088) [2022-07-09 07:39:30,648][26022] Updated weights on worker 0-0, policy_version 151802 (0.00088) [2022-07-09 07:39:32,363][26022] Updated weights on worker 0-0, policy_version 151812 (0.00086) [2022-07-09 07:39:33,736][25689] Fps is (10 sec: 5735.8, 60 sec: 5732.7, 300 sec: 5751.2). Total num frames: 155462656. Throughput: 0: 5143.6. Samples: 155457452. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 07:39:33,736][25689] Avg episode reward: [(0, '-55.687')] [2022-07-09 07:39:34,145][26022] Updated weights on worker 0-0, policy_version 151822 (0.00092) [2022-07-09 07:39:35,859][26022] Updated weights on worker 0-0, policy_version 151832 (0.00087) [2022-07-09 07:39:37,704][26022] Updated weights on worker 0-0, policy_version 151842 (0.00093) [2022-07-09 07:39:38,787][25689] Fps is (10 sec: 5821.0, 60 sec: 5763.9, 300 sec: 5744.6). Total num frames: 155492352. Throughput: 0: 5994.4. Samples: 155492254. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 07:39:38,788][25689] Avg episode reward: [(0, '-55.584')] [2022-07-09 07:39:39,502][26022] Updated weights on worker 0-0, policy_version 151852 (0.00089) [2022-07-09 07:39:41,307][26022] Updated weights on worker 0-0, policy_version 151862 (0.00089) [2022-07-09 07:39:42,946][26022] Updated weights on worker 0-0, policy_version 151872 (0.00081) [2022-07-09 07:39:43,823][25689] Fps is (10 sec: 5684.9, 60 sec: 5711.5, 300 sec: 5741.3). Total num frames: 155520000. Throughput: 0: 5983.1. Samples: 155526984. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 07:39:43,823][25689] Avg episode reward: [(0, '-56.464')] [2022-07-09 07:39:44,778][26022] Updated weights on worker 0-0, policy_version 151882 (0.00084) [2022-07-09 07:39:46,560][26022] Updated weights on worker 0-0, policy_version 151892 (0.00087) [2022-07-09 07:39:48,334][26022] Updated weights on worker 0-0, policy_version 151902 (0.00082) [2022-07-09 07:39:48,833][25689] Fps is (10 sec: 5708.4, 60 sec: 5750.7, 300 sec: 5735.9). Total num frames: 155549696. Throughput: 0: 5130.6. Samples: 155544232. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 07:39:48,833][25689] Avg episode reward: [(0, '-56.537')] [2022-07-09 07:39:50,032][26022] Updated weights on worker 0-0, policy_version 151912 (0.00088) [2022-07-09 07:39:51,810][26022] Updated weights on worker 0-0, policy_version 151922 (0.00095) [2022-07-09 07:39:53,560][26022] Updated weights on worker 0-0, policy_version 151932 (0.00105) [2022-07-09 07:39:53,907][25689] Fps is (10 sec: 5889.7, 60 sec: 5736.5, 300 sec: 5744.9). Total num frames: 155579392. Throughput: 0: 6043.3. Samples: 155579352. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 07:39:53,907][25689] Avg episode reward: [(0, '-57.050')] [2022-07-09 07:39:55,297][26022] Updated weights on worker 0-0, policy_version 151942 (0.00080) [2022-07-09 07:39:57,043][26022] Updated weights on worker 0-0, policy_version 151952 (0.00085) [2022-07-09 07:39:58,915][25689] Fps is (10 sec: 5789.1, 60 sec: 5738.5, 300 sec: 5745.7). Total num frames: 155608064. Throughput: 0: 6045.5. Samples: 155613938. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 07:39:58,917][25689] Avg episode reward: [(0, '-57.169')] [2022-07-09 07:39:58,958][26022] Updated weights on worker 0-0, policy_version 151962 (0.00091) [2022-07-09 07:40:00,938][26022] Updated weights on worker 0-0, policy_version 151972 (0.00081) [2022-07-09 07:40:02,839][26022] Updated weights on worker 0-0, policy_version 151982 (0.00087) [2022-07-09 07:40:04,020][25689] Fps is (10 sec: 5467.9, 60 sec: 5731.4, 300 sec: 5738.4). Total num frames: 155634688. Throughput: 0: 5173.2. Samples: 155631466. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 07:40:04,022][25689] Avg episode reward: [(0, '-57.151')] [2022-07-09 07:40:04,767][26022] Updated weights on worker 0-0, policy_version 151992 (0.00085) [2022-07-09 07:40:06,447][26022] Updated weights on worker 0-0, policy_version 152002 (0.00083) [2022-07-09 07:40:08,139][26022] Updated weights on worker 0-0, policy_version 152012 (0.00088) [2022-07-09 07:40:09,035][25689] Fps is (10 sec: 5565.4, 60 sec: 5730.8, 300 sec: 5736.8). Total num frames: 155664384. Throughput: 0: 5919.0. Samples: 155663808. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 07:40:09,037][25689] Avg episode reward: [(0, '-55.398')] [2022-07-09 07:40:10,163][26022] Updated weights on worker 0-0, policy_version 152022 (0.00082) [2022-07-09 07:40:11,795][26022] Updated weights on worker 0-0, policy_version 152032 (0.00089) [2022-07-09 07:40:13,619][26022] Updated weights on worker 0-0, policy_version 152042 (0.00085) [2022-07-09 07:40:14,114][25689] Fps is (10 sec: 5782.0, 60 sec: 5731.1, 300 sec: 5737.0). Total num frames: 155693056. Throughput: 0: 5887.3. Samples: 155698320. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 07:40:14,115][25689] Avg episode reward: [(0, '-55.743')] [2022-07-09 07:40:15,523][26022] Updated weights on worker 0-0, policy_version 152052 (0.00084) [2022-07-09 07:40:17,108][26022] Updated weights on worker 0-0, policy_version 152062 (0.00080) [2022-07-09 07:40:19,126][25689] Fps is (10 sec: 5580.9, 60 sec: 5696.9, 300 sec: 5730.6). Total num frames: 155720704. Throughput: 0: 5035.5. Samples: 155715712. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 07:40:19,128][25689] Avg episode reward: [(0, '-55.764')] [2022-07-09 07:40:19,127][26022] Updated weights on worker 0-0, policy_version 152072 (0.00080) [2022-07-09 07:40:20,627][26022] Updated weights on worker 0-0, policy_version 152082 (0.00088) [2022-07-09 07:40:22,415][26022] Updated weights on worker 0-0, policy_version 152092 (0.00085) [2022-07-09 07:40:24,098][26022] Updated weights on worker 0-0, policy_version 152102 (0.00082) [2022-07-09 07:40:24,144][25689] Fps is (10 sec: 5921.5, 60 sec: 5746.7, 300 sec: 5747.6). Total num frames: 155752448. Throughput: 0: 5910.5. Samples: 155750414. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 07:40:24,146][25689] Avg episode reward: [(0, '-55.250')] [2022-07-09 07:40:26,067][26022] Updated weights on worker 0-0, policy_version 152112 (0.00089) [2022-07-09 07:40:27,818][26022] Updated weights on worker 0-0, policy_version 152122 (0.00086) [2022-07-09 07:40:29,152][25689] Fps is (10 sec: 5924.2, 60 sec: 5731.4, 300 sec: 5734.8). Total num frames: 155780096. Throughput: 0: 6020.3. Samples: 155784918. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 07:40:29,153][25689] Avg episode reward: [(0, '-55.416')] [2022-07-09 07:40:29,787][26022] Updated weights on worker 0-0, policy_version 152132 (0.00088) [2022-07-09 07:40:31,327][26022] Updated weights on worker 0-0, policy_version 152142 (0.00090) [2022-07-09 07:40:33,326][26022] Updated weights on worker 0-0, policy_version 152152 (0.00087) [2022-07-09 07:40:34,243][25689] Fps is (10 sec: 5678.2, 60 sec: 5737.0, 300 sec: 5737.9). Total num frames: 155809792. Throughput: 0: 5163.5. Samples: 155802256. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 07:40:34,245][25689] Avg episode reward: [(0, '-56.050')] [2022-07-09 07:40:34,795][26022] Updated weights on worker 0-0, policy_version 152162 (0.00085) [2022-07-09 07:40:36,885][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:40:36,895][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000152172_155824128.pth [2022-07-09 07:40:36,895][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000150151_153754624.pth [2022-07-09 07:40:36,899][26022] Updated weights on worker 0-0, policy_version 152172 (0.00086) [2022-07-09 07:40:38,414][26022] Updated weights on worker 0-0, policy_version 152182 (0.00084) [2022-07-09 07:40:39,274][25689] Fps is (10 sec: 5867.6, 60 sec: 5739.1, 300 sec: 5740.9). Total num frames: 155839488. Throughput: 0: 6012.4. Samples: 155836846. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 07:40:39,274][25689] Avg episode reward: [(0, '-56.532')] [2022-07-09 07:40:40,350][26022] Updated weights on worker 0-0, policy_version 152192 (0.00484) [2022-07-09 07:40:41,967][26022] Updated weights on worker 0-0, policy_version 152202 (0.00085) [2022-07-09 07:40:43,859][26022] Updated weights on worker 0-0, policy_version 152212 (0.00086) [2022-07-09 07:40:44,340][25689] Fps is (10 sec: 5781.2, 60 sec: 5753.1, 300 sec: 5736.4). Total num frames: 155868160. Throughput: 0: 5996.4. Samples: 155871512. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 07:40:44,340][25689] Avg episode reward: [(0, '-57.201')] [2022-07-09 07:40:45,462][26022] Updated weights on worker 0-0, policy_version 152222 (0.00081) [2022-07-09 07:40:47,258][26022] Updated weights on worker 0-0, policy_version 152232 (0.00087) [2022-07-09 07:40:48,999][26022] Updated weights on worker 0-0, policy_version 152242 (0.00090) [2022-07-09 07:40:49,373][25689] Fps is (10 sec: 5779.3, 60 sec: 5750.9, 300 sec: 5740.1). Total num frames: 155897856. Throughput: 0: 6015.2. Samples: 155906554. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 07:40:49,374][25689] Avg episode reward: [(0, '-57.124')] [2022-07-09 07:40:50,815][26022] Updated weights on worker 0-0, policy_version 152252 (0.00087) [2022-07-09 07:40:52,663][26022] Updated weights on worker 0-0, policy_version 152262 (0.00101) [2022-07-09 07:40:54,424][25689] Fps is (10 sec: 5787.9, 60 sec: 5736.1, 300 sec: 5739.3). Total num frames: 155926528. Throughput: 0: 6023.7. Samples: 155923818. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 07:40:54,426][25689] Avg episode reward: [(0, '-55.926')] [2022-07-09 07:40:54,432][26022] Updated weights on worker 0-0, policy_version 152272 (0.00090) [2022-07-09 07:40:56,216][26022] Updated weights on worker 0-0, policy_version 152282 (0.00092) [2022-07-09 07:40:58,089][26022] Updated weights on worker 0-0, policy_version 152292 (0.00630) [2022-07-09 07:40:59,489][25689] Fps is (10 sec: 5769.7, 60 sec: 5747.7, 300 sec: 5748.5). Total num frames: 155956224. Throughput: 0: 6025.6. Samples: 155958658. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 07:40:59,490][25689] Avg episode reward: [(0, '-55.901')] [2022-07-09 07:40:59,643][26022] Updated weights on worker 0-0, policy_version 152302 (0.00094) [2022-07-09 07:41:01,483][26022] Updated weights on worker 0-0, policy_version 152312 (0.00089) [2022-07-09 07:41:03,586][26022] Updated weights on worker 0-0, policy_version 152322 (0.00090) [2022-07-09 07:41:04,544][25689] Fps is (10 sec: 5767.7, 60 sec: 5786.2, 300 sec: 5747.5). Total num frames: 155984896. Throughput: 0: 5921.9. Samples: 155991160. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 07:41:04,544][25689] Avg episode reward: [(0, '-55.654')] [2022-07-09 07:41:05,537][26022] Updated weights on worker 0-0, policy_version 152332 (0.00084) [2022-07-09 07:41:07,174][26022] Updated weights on worker 0-0, policy_version 152342 (0.00108) [2022-07-09 07:41:08,949][26022] Updated weights on worker 0-0, policy_version 152352 (0.00087) [2022-07-09 07:41:09,619][25689] Fps is (10 sec: 5357.8, 60 sec: 5712.9, 300 sec: 5733.1). Total num frames: 156010496. Throughput: 0: 5042.8. Samples: 156008648. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 07:41:09,621][25689] Avg episode reward: [(0, '-55.644')] [2022-07-09 07:41:10,663][26022] Updated weights on worker 0-0, policy_version 152362 (0.00140) [2022-07-09 07:41:12,631][26022] Updated weights on worker 0-0, policy_version 152372 (0.00085) [2022-07-09 07:41:14,196][26022] Updated weights on worker 0-0, policy_version 152382 (0.00088) [2022-07-09 07:41:14,685][25689] Fps is (10 sec: 5553.4, 60 sec: 5748.0, 300 sec: 5738.8). Total num frames: 156041216. Throughput: 0: 5891.0. Samples: 156043178. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 07:41:14,688][25689] Avg episode reward: [(0, '-54.234')] [2022-07-09 07:41:16,103][26022] Updated weights on worker 0-0, policy_version 152392 (0.00085) [2022-07-09 07:41:17,855][26022] Updated weights on worker 0-0, policy_version 152402 (0.01014) [2022-07-09 07:41:19,594][26022] Updated weights on worker 0-0, policy_version 152412 (0.00081) [2022-07-09 07:41:19,718][25689] Fps is (10 sec: 5982.6, 60 sec: 5779.9, 300 sec: 5741.9). Total num frames: 156070912. Throughput: 0: 5891.8. Samples: 156077838. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 07:41:19,719][25689] Avg episode reward: [(0, '-54.845')] [2022-07-09 07:41:21,247][26022] Updated weights on worker 0-0, policy_version 152422 (0.00081) [2022-07-09 07:41:23,398][26022] Updated weights on worker 0-0, policy_version 152432 (0.00108) [2022-07-09 07:41:24,774][25689] Fps is (10 sec: 5785.7, 60 sec: 5725.6, 300 sec: 5740.9). Total num frames: 156099584. Throughput: 0: 5128.2. Samples: 156094902. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 07:41:24,775][25689] Avg episode reward: [(0, '-54.404')] [2022-07-09 07:41:25,085][26022] Updated weights on worker 0-0, policy_version 152442 (0.00095) [2022-07-09 07:41:26,947][26022] Updated weights on worker 0-0, policy_version 152452 (0.00085) [2022-07-09 07:41:28,480][26022] Updated weights on worker 0-0, policy_version 152462 (0.00089) [2022-07-09 07:41:29,792][25689] Fps is (10 sec: 5590.2, 60 sec: 5724.5, 300 sec: 5738.1). Total num frames: 156127232. Throughput: 0: 5984.1. Samples: 156129368. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 07:41:29,796][25689] Avg episode reward: [(0, '-54.641')] [2022-07-09 07:41:30,376][26022] Updated weights on worker 0-0, policy_version 152472 (0.00084) [2022-07-09 07:41:32,188][26022] Updated weights on worker 0-0, policy_version 152482 (0.00088) [2022-07-09 07:41:34,047][26022] Updated weights on worker 0-0, policy_version 152492 (0.00089) [2022-07-09 07:41:34,920][25689] Fps is (10 sec: 5651.8, 60 sec: 5721.1, 300 sec: 5739.3). Total num frames: 156156928. Throughput: 0: 5950.5. Samples: 156163584. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 07:41:34,920][25689] Avg episode reward: [(0, '-54.218')] [2022-07-09 07:41:35,771][26022] Updated weights on worker 0-0, policy_version 152502 (0.00089) [2022-07-09 07:41:37,746][26022] Updated weights on worker 0-0, policy_version 152512 (0.00074) [2022-07-09 07:41:39,252][26022] Updated weights on worker 0-0, policy_version 152522 (0.00048) [2022-07-09 07:41:39,935][25689] Fps is (10 sec: 5855.8, 60 sec: 5722.5, 300 sec: 5739.6). Total num frames: 156186624. Throughput: 0: 5086.0. Samples: 156180664. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 07:41:39,935][25689] Avg episode reward: [(0, '-54.586')] [2022-07-09 07:41:41,347][26022] Updated weights on worker 0-0, policy_version 152532 (0.00085) [2022-07-09 07:41:42,772][26022] Updated weights on worker 0-0, policy_version 152542 (0.00089) [2022-07-09 07:41:44,676][26022] Updated weights on worker 0-0, policy_version 152552 (0.00089) [2022-07-09 07:41:44,949][25689] Fps is (10 sec: 5717.8, 60 sec: 5710.5, 300 sec: 5736.1). Total num frames: 156214272. Throughput: 0: 5973.1. Samples: 156215412. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 07:41:44,951][25689] Avg episode reward: [(0, '-53.655')] [2022-07-09 07:41:46,419][26022] Updated weights on worker 0-0, policy_version 152562 (0.00085) [2022-07-09 07:41:48,313][26022] Updated weights on worker 0-0, policy_version 152572 (0.00088) [2022-07-09 07:41:49,999][25689] Fps is (10 sec: 5698.0, 60 sec: 5709.0, 300 sec: 5736.6). Total num frames: 156243968. Throughput: 0: 5967.6. Samples: 156249952. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 07:41:49,999][25689] Avg episode reward: [(0, '-54.252')] [2022-07-09 07:41:50,007][26022] Updated weights on worker 0-0, policy_version 152582 (0.00090) [2022-07-09 07:41:51,693][26022] Updated weights on worker 0-0, policy_version 152592 (0.00082) [2022-07-09 07:41:53,438][26022] Updated weights on worker 0-0, policy_version 152602 (0.00085) [2022-07-09 07:41:55,075][25689] Fps is (10 sec: 5764.2, 60 sec: 5706.6, 300 sec: 5738.8). Total num frames: 156272640. Throughput: 0: 5154.7. Samples: 156267478. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 07:41:55,075][25689] Avg episode reward: [(0, '-54.911')] [2022-07-09 07:41:55,339][26022] Updated weights on worker 0-0, policy_version 152612 (0.00085) [2022-07-09 07:41:56,920][26022] Updated weights on worker 0-0, policy_version 152622 (0.00087) [2022-07-09 07:41:58,938][26022] Updated weights on worker 0-0, policy_version 152632 (0.00087) [2022-07-09 07:42:00,082][25689] Fps is (10 sec: 5788.8, 60 sec: 5712.2, 300 sec: 5742.4). Total num frames: 156302336. Throughput: 0: 6054.2. Samples: 156302636. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 07:42:00,082][25689] Avg episode reward: [(0, '-55.529')] [2022-07-09 07:42:00,423][26022] Updated weights on worker 0-0, policy_version 152642 (0.00085) [2022-07-09 07:42:02,579][26022] Updated weights on worker 0-0, policy_version 152652 (0.00102) [2022-07-09 07:42:04,563][26022] Updated weights on worker 0-0, policy_version 152662 (0.00086) [2022-07-09 07:42:05,133][25689] Fps is (10 sec: 5599.4, 60 sec: 5678.6, 300 sec: 5735.1). Total num frames: 156328960. Throughput: 0: 5942.5. Samples: 156335356. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 07:42:05,134][25689] Avg episode reward: [(0, '-56.255')] [2022-07-09 07:42:06,276][26022] Updated weights on worker 0-0, policy_version 152672 (0.00092) [2022-07-09 07:42:08,047][26022] Updated weights on worker 0-0, policy_version 152682 (0.00083) [2022-07-09 07:42:09,673][26022] Updated weights on worker 0-0, policy_version 152692 (0.00088) [2022-07-09 07:42:10,191][25689] Fps is (10 sec: 5672.5, 60 sec: 5764.8, 300 sec: 5745.3). Total num frames: 156359680. Throughput: 0: 5087.2. Samples: 156352672. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 07:42:10,191][25689] Avg episode reward: [(0, '-56.107')] [2022-07-09 07:42:11,524][26022] Updated weights on worker 0-0, policy_version 152702 (0.00086) [2022-07-09 07:42:13,125][26022] Updated weights on worker 0-0, policy_version 152712 (0.00085) [2022-07-09 07:42:14,993][26022] Updated weights on worker 0-0, policy_version 152722 (0.00086) [2022-07-09 07:42:15,268][25689] Fps is (10 sec: 5860.3, 60 sec: 5730.0, 300 sec: 5736.9). Total num frames: 156388352. Throughput: 0: 5950.6. Samples: 156387638. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 07:42:15,268][25689] Avg episode reward: [(0, '-56.119')] [2022-07-09 07:42:16,739][26022] Updated weights on worker 0-0, policy_version 152732 (0.00091) [2022-07-09 07:42:18,625][26022] Updated weights on worker 0-0, policy_version 152742 (0.00092) [2022-07-09 07:42:20,215][26022] Updated weights on worker 0-0, policy_version 152752 (0.00086) [2022-07-09 07:42:20,301][25689] Fps is (10 sec: 5772.9, 60 sec: 5729.9, 300 sec: 5740.1). Total num frames: 156418048. Throughput: 0: 5909.6. Samples: 156422126. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 07:42:20,302][25689] Avg episode reward: [(0, '-54.758')] [2022-07-09 07:42:22,285][26022] Updated weights on worker 0-0, policy_version 152762 (0.00086) [2022-07-09 07:42:23,811][26022] Updated weights on worker 0-0, policy_version 152772 (0.00087) [2022-07-09 07:42:25,347][25689] Fps is (10 sec: 5689.5, 60 sec: 5714.0, 300 sec: 5736.0). Total num frames: 156445696. Throughput: 0: 5160.6. Samples: 156439670. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 07:42:25,348][25689] Avg episode reward: [(0, '-53.427')] [2022-07-09 07:42:25,760][26022] Updated weights on worker 0-0, policy_version 152782 (0.00090) [2022-07-09 07:42:27,450][26022] Updated weights on worker 0-0, policy_version 152792 (0.00088) [2022-07-09 07:42:29,243][26022] Updated weights on worker 0-0, policy_version 152802 (0.00085) [2022-07-09 07:42:30,362][25689] Fps is (10 sec: 5699.8, 60 sec: 5748.1, 300 sec: 5737.0). Total num frames: 156475392. Throughput: 0: 6047.0. Samples: 156474648. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 07:42:30,363][25689] Avg episode reward: [(0, '-52.354')] [2022-07-09 07:42:30,967][26022] Updated weights on worker 0-0, policy_version 152812 (0.00082) [2022-07-09 07:42:32,724][26022] Updated weights on worker 0-0, policy_version 152822 (0.00098) [2022-07-09 07:42:34,487][26022] Updated weights on worker 0-0, policy_version 152832 (0.00088) [2022-07-09 07:42:35,420][25689] Fps is (10 sec: 5896.3, 60 sec: 5754.7, 300 sec: 5743.2). Total num frames: 156505088. Throughput: 0: 6013.6. Samples: 156508822. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 07:42:35,420][25689] Avg episode reward: [(0, '-52.495')] [2022-07-09 07:42:36,309][26022] Updated weights on worker 0-0, policy_version 152842 (0.00086) [2022-07-09 07:42:37,008][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:42:37,023][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000152845_156513280.pth [2022-07-09 07:42:37,024][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000150826_154445824.pth [2022-07-09 07:42:38,151][26022] Updated weights on worker 0-0, policy_version 152852 (0.00091) [2022-07-09 07:42:39,991][26022] Updated weights on worker 0-0, policy_version 152862 (0.00098) [2022-07-09 07:42:40,426][25689] Fps is (10 sec: 5799.6, 60 sec: 5738.6, 300 sec: 5740.0). Total num frames: 156533760. Throughput: 0: 5174.3. Samples: 156526258. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 07:42:40,427][25689] Avg episode reward: [(0, '-52.778')] [2022-07-09 07:42:41,609][26022] Updated weights on worker 0-0, policy_version 152872 (0.00092) [2022-07-09 07:42:43,634][26022] Updated weights on worker 0-0, policy_version 152882 (0.00092) [2022-07-09 07:42:45,275][26022] Updated weights on worker 0-0, policy_version 152892 (0.00082) [2022-07-09 07:42:45,525][25689] Fps is (10 sec: 5674.4, 60 sec: 5747.5, 300 sec: 5735.0). Total num frames: 156562432. Throughput: 0: 6003.6. Samples: 156560814. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 07:42:45,527][25689] Avg episode reward: [(0, '-53.698')] [2022-07-09 07:42:47,131][26022] Updated weights on worker 0-0, policy_version 152902 (0.00107) [2022-07-09 07:42:48,776][26022] Updated weights on worker 0-0, policy_version 152912 (0.00089) [2022-07-09 07:42:50,529][25689] Fps is (10 sec: 5675.9, 60 sec: 5734.9, 300 sec: 5739.1). Total num frames: 156591104. Throughput: 0: 5964.1. Samples: 156594928. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 07:42:50,530][25689] Avg episode reward: [(0, '-54.722')] [2022-07-09 07:42:50,961][26022] Updated weights on worker 0-0, policy_version 152922 (0.00087) [2022-07-09 07:42:52,382][26022] Updated weights on worker 0-0, policy_version 152932 (0.00087) [2022-07-09 07:42:54,401][26022] Updated weights on worker 0-0, policy_version 152942 (0.00095) [2022-07-09 07:42:55,581][25689] Fps is (10 sec: 5804.7, 60 sec: 5754.2, 300 sec: 5734.8). Total num frames: 156620800. Throughput: 0: 5995.3. Samples: 156629694. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 07:42:55,582][25689] Avg episode reward: [(0, '-54.836')] [2022-07-09 07:42:55,835][26022] Updated weights on worker 0-0, policy_version 152952 (0.00822) [2022-07-09 07:42:57,839][26022] Updated weights on worker 0-0, policy_version 152962 (0.00087) [2022-07-09 07:42:59,555][26022] Updated weights on worker 0-0, policy_version 152972 (0.00084) [2022-07-09 07:43:00,657][25689] Fps is (10 sec: 5763.4, 60 sec: 5730.7, 300 sec: 5743.9). Total num frames: 156649472. Throughput: 0: 5970.2. Samples: 156647038. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 07:43:00,657][25689] Avg episode reward: [(0, '-54.663')] [2022-07-09 07:43:01,550][26022] Updated weights on worker 0-0, policy_version 152982 (0.00095) [2022-07-09 07:43:03,470][26022] Updated weights on worker 0-0, policy_version 152992 (0.00087) [2022-07-09 07:43:05,413][26022] Updated weights on worker 0-0, policy_version 153002 (0.00371) [2022-07-09 07:43:05,719][25689] Fps is (10 sec: 5353.0, 60 sec: 5712.7, 300 sec: 5729.3). Total num frames: 156675072. Throughput: 0: 5864.7. Samples: 156679246. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 07:43:05,720][25689] Avg episode reward: [(0, '-54.141')] [2022-07-09 07:43:06,996][26022] Updated weights on worker 0-0, policy_version 153012 (0.00084) [2022-07-09 07:43:09,063][26022] Updated weights on worker 0-0, policy_version 153022 (0.00086) [2022-07-09 07:43:10,586][26022] Updated weights on worker 0-0, policy_version 153032 (0.00086) [2022-07-09 07:43:10,743][25689] Fps is (10 sec: 5482.5, 60 sec: 5699.0, 300 sec: 5733.7). Total num frames: 156704768. Throughput: 0: 5889.2. Samples: 156713968. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 07:43:10,743][25689] Avg episode reward: [(0, '-54.760')] [2022-07-09 07:43:12,599][26022] Updated weights on worker 0-0, policy_version 153042 (0.00090) [2022-07-09 07:43:14,164][26022] Updated weights on worker 0-0, policy_version 153052 (0.00081) [2022-07-09 07:43:15,817][25689] Fps is (10 sec: 5881.5, 60 sec: 5716.2, 300 sec: 5732.5). Total num frames: 156734464. Throughput: 0: 5022.2. Samples: 156731326. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 07:43:15,818][25689] Avg episode reward: [(0, '-54.850')] [2022-07-09 07:43:16,017][26022] Updated weights on worker 0-0, policy_version 153062 (0.00098) [2022-07-09 07:43:17,773][26022] Updated weights on worker 0-0, policy_version 153072 (0.00092) [2022-07-09 07:43:19,598][26022] Updated weights on worker 0-0, policy_version 153082 (0.00087) [2022-07-09 07:43:20,863][25689] Fps is (10 sec: 5767.7, 60 sec: 5698.2, 300 sec: 5728.6). Total num frames: 156763136. Throughput: 0: 5887.6. Samples: 156766002. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:43:20,863][25689] Avg episode reward: [(0, '-54.337')] [2022-07-09 07:43:21,275][26022] Updated weights on worker 0-0, policy_version 153092 (0.00097) [2022-07-09 07:43:23,351][26022] Updated weights on worker 0-0, policy_version 153102 (0.00084) [2022-07-09 07:43:24,824][26022] Updated weights on worker 0-0, policy_version 153112 (0.00085) [2022-07-09 07:43:25,896][25689] Fps is (10 sec: 5689.5, 60 sec: 5716.2, 300 sec: 5735.2). Total num frames: 156791808. Throughput: 0: 6023.4. Samples: 156800780. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:43:25,897][25689] Avg episode reward: [(0, '-54.190')] [2022-07-09 07:43:26,927][26022] Updated weights on worker 0-0, policy_version 153122 (0.00094) [2022-07-09 07:43:28,242][26022] Updated weights on worker 0-0, policy_version 153132 (0.00081) [2022-07-09 07:43:30,521][26022] Updated weights on worker 0-0, policy_version 153142 (0.00086) [2022-07-09 07:43:30,907][25689] Fps is (10 sec: 5708.9, 60 sec: 5699.7, 300 sec: 5726.6). Total num frames: 156820480. Throughput: 0: 5153.9. Samples: 156817894. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:43:30,908][25689] Avg episode reward: [(0, '-54.311')] [2022-07-09 07:43:31,888][26022] Updated weights on worker 0-0, policy_version 153152 (0.00090) [2022-07-09 07:43:33,976][26022] Updated weights on worker 0-0, policy_version 153162 (0.00092) [2022-07-09 07:43:35,552][26022] Updated weights on worker 0-0, policy_version 153172 (0.00080) [2022-07-09 07:43:36,024][25689] Fps is (10 sec: 5763.4, 60 sec: 5694.1, 300 sec: 5731.7). Total num frames: 156850176. Throughput: 0: 5956.6. Samples: 156851686. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:43:36,024][25689] Avg episode reward: [(0, '-54.357')] [2022-07-09 07:43:37,493][26022] Updated weights on worker 0-0, policy_version 153182 (0.00083) [2022-07-09 07:43:39,035][26022] Updated weights on worker 0-0, policy_version 153192 (0.00083) [2022-07-09 07:43:41,063][25689] Fps is (10 sec: 5646.2, 60 sec: 5674.1, 300 sec: 5721.0). Total num frames: 156877824. Throughput: 0: 5965.0. Samples: 156886500. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:43:41,064][25689] Avg episode reward: [(0, '-53.910')] [2022-07-09 07:43:41,081][26022] Updated weights on worker 0-0, policy_version 153202 (0.00086) [2022-07-09 07:43:42,747][26022] Updated weights on worker 0-0, policy_version 153212 (0.00089) [2022-07-09 07:43:44,603][26022] Updated weights on worker 0-0, policy_version 153222 (0.00088) [2022-07-09 07:43:46,147][25689] Fps is (10 sec: 5765.5, 60 sec: 5709.3, 300 sec: 5731.0). Total num frames: 156908544. Throughput: 0: 5088.2. Samples: 156903822. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:43:46,148][25689] Avg episode reward: [(0, '-54.073')] [2022-07-09 07:43:46,331][26022] Updated weights on worker 0-0, policy_version 153232 (0.00084) [2022-07-09 07:43:47,963][26022] Updated weights on worker 0-0, policy_version 153242 (0.00084) [2022-07-09 07:43:50,014][26022] Updated weights on worker 0-0, policy_version 153252 (0.00082) [2022-07-09 07:43:51,231][25689] Fps is (10 sec: 5942.0, 60 sec: 5718.7, 300 sec: 5727.9). Total num frames: 156938240. Throughput: 0: 5944.3. Samples: 156938704. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:43:51,231][25689] Avg episode reward: [(0, '-54.101')] [2022-07-09 07:43:51,669][26022] Updated weights on worker 0-0, policy_version 153262 (0.00105) [2022-07-09 07:43:53,520][26022] Updated weights on worker 0-0, policy_version 153272 (0.00190) [2022-07-09 07:43:55,043][26022] Updated weights on worker 0-0, policy_version 153282 (0.00094) [2022-07-09 07:43:56,291][25689] Fps is (10 sec: 5653.0, 60 sec: 5684.1, 300 sec: 5723.9). Total num frames: 156965888. Throughput: 0: 5999.6. Samples: 156973284. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:43:56,292][25689] Avg episode reward: [(0, '-54.225')] [2022-07-09 07:43:56,960][26022] Updated weights on worker 0-0, policy_version 153292 (0.00105) [2022-07-09 07:43:58,631][26022] Updated weights on worker 0-0, policy_version 153302 (0.00084) [2022-07-09 07:44:00,519][26022] Updated weights on worker 0-0, policy_version 153312 (0.00090) [2022-07-09 07:44:01,319][25689] Fps is (10 sec: 5785.9, 60 sec: 5722.4, 300 sec: 5737.7). Total num frames: 156996608. Throughput: 0: 5135.4. Samples: 156990520. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:44:01,321][25689] Avg episode reward: [(0, '-53.778')] [2022-07-09 07:44:02,780][26022] Updated weights on worker 0-0, policy_version 153322 (0.00095) [2022-07-09 07:44:04,327][26022] Updated weights on worker 0-0, policy_version 153332 (0.00097) [2022-07-09 07:44:06,342][25689] Fps is (10 sec: 5501.5, 60 sec: 5709.3, 300 sec: 5720.2). Total num frames: 157021184. Throughput: 0: 5881.1. Samples: 157022590. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:44:06,344][25689] Avg episode reward: [(0, '-54.088')] [2022-07-09 07:44:06,472][26022] Updated weights on worker 0-0, policy_version 153342 (0.00090) [2022-07-09 07:44:08,074][26022] Updated weights on worker 0-0, policy_version 153352 (0.00087) [2022-07-09 07:44:09,755][26022] Updated weights on worker 0-0, policy_version 153362 (0.00084) [2022-07-09 07:44:11,444][25689] Fps is (10 sec: 5360.5, 60 sec: 5701.9, 300 sec: 5723.3). Total num frames: 157050880. Throughput: 0: 5857.8. Samples: 157057106. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:44:11,444][25689] Avg episode reward: [(0, '-54.676')] [2022-07-09 07:44:11,695][26022] Updated weights on worker 0-0, policy_version 153372 (0.00090) [2022-07-09 07:44:13,397][26022] Updated weights on worker 0-0, policy_version 153382 (0.00085) [2022-07-09 07:44:15,159][26022] Updated weights on worker 0-0, policy_version 153392 (0.00081) [2022-07-09 07:44:16,564][25689] Fps is (10 sec: 5910.7, 60 sec: 5714.5, 300 sec: 5724.5). Total num frames: 157081600. Throughput: 0: 4992.5. Samples: 157074494. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:44:16,564][25689] Avg episode reward: [(0, '-54.590')] [2022-07-09 07:44:17,129][26022] Updated weights on worker 0-0, policy_version 153402 (0.00087) [2022-07-09 07:44:18,750][26022] Updated weights on worker 0-0, policy_version 153412 (0.00088) [2022-07-09 07:44:20,540][26022] Updated weights on worker 0-0, policy_version 153422 (0.00080) [2022-07-09 07:44:21,617][25689] Fps is (10 sec: 5838.1, 60 sec: 5713.8, 300 sec: 5723.7). Total num frames: 157110272. Throughput: 0: 5841.3. Samples: 157109086. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:44:21,617][25689] Avg episode reward: [(0, '-55.036')] [2022-07-09 07:44:22,361][26022] Updated weights on worker 0-0, policy_version 153432 (0.00089) [2022-07-09 07:44:23,996][26022] Updated weights on worker 0-0, policy_version 153442 (0.00080) [2022-07-09 07:44:25,776][26022] Updated weights on worker 0-0, policy_version 153452 (0.00093) [2022-07-09 07:44:26,685][25689] Fps is (10 sec: 5665.6, 60 sec: 5710.5, 300 sec: 5722.8). Total num frames: 157138944. Throughput: 0: 5974.3. Samples: 157144126. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:44:26,686][25689] Avg episode reward: [(0, '-54.521')] [2022-07-09 07:44:27,431][26022] Updated weights on worker 0-0, policy_version 153462 (0.00095) [2022-07-09 07:44:29,357][26022] Updated weights on worker 0-0, policy_version 153472 (0.00086) [2022-07-09 07:44:31,009][26022] Updated weights on worker 0-0, policy_version 153482 (0.00082) [2022-07-09 07:44:31,719][25689] Fps is (10 sec: 5777.5, 60 sec: 5725.2, 300 sec: 5725.1). Total num frames: 157168640. Throughput: 0: 5152.1. Samples: 157161570. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:44:31,720][25689] Avg episode reward: [(0, '-54.190')] [2022-07-09 07:44:33,129][26022] Updated weights on worker 0-0, policy_version 153492 (0.00515) [2022-07-09 07:44:34,615][26022] Updated weights on worker 0-0, policy_version 153502 (0.00086) [2022-07-09 07:44:36,459][26022] Updated weights on worker 0-0, policy_version 153512 (0.00087) [2022-07-09 07:44:36,845][25689] Fps is (10 sec: 5745.2, 60 sec: 5707.5, 300 sec: 5720.2). Total num frames: 157197312. Throughput: 0: 5986.3. Samples: 157195902. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:44:36,845][25689] Avg episode reward: [(0, '-54.118')] [2022-07-09 07:44:37,123][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:44:37,134][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000153515_157199360.pth [2022-07-09 07:44:37,135][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000151500_155136000.pth [2022-07-09 07:44:38,153][26022] Updated weights on worker 0-0, policy_version 153522 (0.00090) [2022-07-09 07:44:40,116][26022] Updated weights on worker 0-0, policy_version 153532 (0.00095) [2022-07-09 07:44:41,858][25689] Fps is (10 sec: 5655.8, 60 sec: 5726.8, 300 sec: 5724.1). Total num frames: 157225984. Throughput: 0: 6000.1. Samples: 157230538. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:44:41,859][25689] Avg episode reward: [(0, '-53.454')] [2022-07-09 07:44:41,868][26022] Updated weights on worker 0-0, policy_version 153542 (0.00087) [2022-07-09 07:44:43,739][26022] Updated weights on worker 0-0, policy_version 153552 (0.00085) [2022-07-09 07:44:45,311][26022] Updated weights on worker 0-0, policy_version 153562 (0.00085) [2022-07-09 07:44:46,935][25689] Fps is (10 sec: 5784.6, 60 sec: 5710.6, 300 sec: 5722.8). Total num frames: 157255680. Throughput: 0: 5976.4. Samples: 157265146. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:44:46,935][25689] Avg episode reward: [(0, '-53.002')] [2022-07-09 07:44:47,139][26022] Updated weights on worker 0-0, policy_version 153572 (0.00080) [2022-07-09 07:44:48,977][26022] Updated weights on worker 0-0, policy_version 153582 (0.00076) [2022-07-09 07:44:50,622][26022] Updated weights on worker 0-0, policy_version 153592 (0.00087) [2022-07-09 07:44:52,006][25689] Fps is (10 sec: 5852.7, 60 sec: 5711.8, 300 sec: 5722.9). Total num frames: 157285376. Throughput: 0: 5966.4. Samples: 157282610. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:44:52,007][25689] Avg episode reward: [(0, '-53.187')] [2022-07-09 07:44:52,778][26022] Updated weights on worker 0-0, policy_version 153602 (0.00087) [2022-07-09 07:44:54,138][26022] Updated weights on worker 0-0, policy_version 153612 (0.00088) [2022-07-09 07:44:56,046][26022] Updated weights on worker 0-0, policy_version 153622 (0.00086) [2022-07-09 07:44:57,063][25689] Fps is (10 sec: 5864.2, 60 sec: 5745.9, 300 sec: 5725.4). Total num frames: 157315072. Throughput: 0: 5999.7. Samples: 157317204. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:44:57,063][25689] Avg episode reward: [(0, '-52.725')] [2022-07-09 07:44:57,942][26022] Updated weights on worker 0-0, policy_version 153632 (0.00088) [2022-07-09 07:44:59,463][26022] Updated weights on worker 0-0, policy_version 153642 (0.00082) [2022-07-09 07:45:01,377][26022] Updated weights on worker 0-0, policy_version 153652 (0.00117) [2022-07-09 07:45:02,097][25689] Fps is (10 sec: 5682.7, 60 sec: 5694.7, 300 sec: 5730.2). Total num frames: 157342720. Throughput: 0: 5968.5. Samples: 157351334. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:45:02,102][25689] Avg episode reward: [(0, '-53.163')] [2022-07-09 07:45:03,439][26022] Updated weights on worker 0-0, policy_version 153662 (0.00086) [2022-07-09 07:45:05,153][26022] Updated weights on worker 0-0, policy_version 153672 (0.00088) [2022-07-09 07:45:07,117][25689] Fps is (10 sec: 5397.8, 60 sec: 5728.7, 300 sec: 5719.8). Total num frames: 157369344. Throughput: 0: 5051.9. Samples: 157367104. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:45:07,119][25689] Avg episode reward: [(0, '-53.256')] [2022-07-09 07:45:07,242][26022] Updated weights on worker 0-0, policy_version 153682 (0.00095) [2022-07-09 07:45:08,754][26022] Updated weights on worker 0-0, policy_version 153692 (0.00085) [2022-07-09 07:45:10,581][26022] Updated weights on worker 0-0, policy_version 153702 (0.00053) [2022-07-09 07:45:12,135][25689] Fps is (10 sec: 5610.9, 60 sec: 5736.6, 300 sec: 5724.4). Total num frames: 157399040. Throughput: 0: 5901.2. Samples: 157401392. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:45:12,136][25689] Avg episode reward: [(0, '-53.868')] [2022-07-09 07:45:12,400][26022] Updated weights on worker 0-0, policy_version 153712 (0.00086) [2022-07-09 07:45:14,047][26022] Updated weights on worker 0-0, policy_version 153722 (0.00079) [2022-07-09 07:45:16,017][26022] Updated weights on worker 0-0, policy_version 153732 (0.00088) [2022-07-09 07:45:17,177][25689] Fps is (10 sec: 5903.8, 60 sec: 5727.1, 300 sec: 5730.7). Total num frames: 157428736. Throughput: 0: 5911.6. Samples: 157436112. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:45:17,178][25689] Avg episode reward: [(0, '-54.244')] [2022-07-09 07:45:17,695][26022] Updated weights on worker 0-0, policy_version 153742 (0.00084) [2022-07-09 07:45:19,415][26022] Updated weights on worker 0-0, policy_version 153752 (0.00089) [2022-07-09 07:45:21,207][26022] Updated weights on worker 0-0, policy_version 153762 (0.00087) [2022-07-09 07:45:22,195][25689] Fps is (10 sec: 5699.9, 60 sec: 5713.4, 300 sec: 5716.9). Total num frames: 157456384. Throughput: 0: 5078.3. Samples: 157453398. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:45:22,196][25689] Avg episode reward: [(0, '-54.465')] [2022-07-09 07:45:22,888][26022] Updated weights on worker 0-0, policy_version 153772 (0.00091) [2022-07-09 07:45:24,800][26022] Updated weights on worker 0-0, policy_version 153782 (0.00085) [2022-07-09 07:45:26,595][26022] Updated weights on worker 0-0, policy_version 153792 (0.00084) [2022-07-09 07:45:27,217][25689] Fps is (10 sec: 5813.7, 60 sec: 5751.7, 300 sec: 5727.0). Total num frames: 157487104. Throughput: 0: 6030.3. Samples: 157488312. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 07:45:27,219][25689] Avg episode reward: [(0, '-55.015')] [2022-07-09 07:45:28,293][26022] Updated weights on worker 0-0, policy_version 153802 (0.00087) [2022-07-09 07:45:30,211][26022] Updated weights on worker 0-0, policy_version 153812 (0.00084) [2022-07-09 07:45:32,183][26022] Updated weights on worker 0-0, policy_version 153822 (0.00090) [2022-07-09 07:45:32,235][25689] Fps is (10 sec: 5711.9, 60 sec: 5702.5, 300 sec: 5718.1). Total num frames: 157513728. Throughput: 0: 6031.4. Samples: 157522624. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 07:45:32,235][25689] Avg episode reward: [(0, '-55.292')] [2022-07-09 07:45:33,622][26022] Updated weights on worker 0-0, policy_version 153832 (0.00080) [2022-07-09 07:45:35,745][26022] Updated weights on worker 0-0, policy_version 153842 (0.00055) [2022-07-09 07:45:37,127][26022] Updated weights on worker 0-0, policy_version 153852 (0.00085) [2022-07-09 07:45:37,267][25689] Fps is (10 sec: 5706.0, 60 sec: 5745.2, 300 sec: 5721.5). Total num frames: 157544448. Throughput: 0: 5169.8. Samples: 157539972. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 07:45:37,267][25689] Avg episode reward: [(0, '-54.451')] [2022-07-09 07:45:39,385][26022] Updated weights on worker 0-0, policy_version 153862 (0.00087) [2022-07-09 07:45:40,873][26022] Updated weights on worker 0-0, policy_version 153872 (0.00093) [2022-07-09 07:45:42,284][25689] Fps is (10 sec: 5808.2, 60 sec: 5727.9, 300 sec: 5719.0). Total num frames: 157572096. Throughput: 0: 6015.6. Samples: 157574244. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 07:45:42,285][25689] Avg episode reward: [(0, '-54.286')] [2022-07-09 07:45:42,866][26022] Updated weights on worker 0-0, policy_version 153882 (0.00087) [2022-07-09 07:45:44,344][26022] Updated weights on worker 0-0, policy_version 153892 (0.00094) [2022-07-09 07:45:46,464][26022] Updated weights on worker 0-0, policy_version 153902 (0.00089) [2022-07-09 07:45:47,307][25689] Fps is (10 sec: 5711.2, 60 sec: 5733.0, 300 sec: 5719.2). Total num frames: 157601792. Throughput: 0: 6013.7. Samples: 157609130. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 07:45:47,308][25689] Avg episode reward: [(0, '-54.864')] [2022-07-09 07:45:47,805][26022] Updated weights on worker 0-0, policy_version 153912 (0.00081) [2022-07-09 07:45:49,888][26022] Updated weights on worker 0-0, policy_version 153922 (0.00089) [2022-07-09 07:45:51,359][26022] Updated weights on worker 0-0, policy_version 153932 (0.00091) [2022-07-09 07:45:52,340][25689] Fps is (10 sec: 5804.1, 60 sec: 5719.6, 300 sec: 5719.5). Total num frames: 157630464. Throughput: 0: 5165.5. Samples: 157626480. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 07:45:52,341][25689] Avg episode reward: [(0, '-54.178')] [2022-07-09 07:45:53,433][26022] Updated weights on worker 0-0, policy_version 153942 (0.00087) [2022-07-09 07:45:55,124][26022] Updated weights on worker 0-0, policy_version 153952 (0.00082) [2022-07-09 07:45:56,778][26022] Updated weights on worker 0-0, policy_version 153962 (0.00086) [2022-07-09 07:45:57,388][25689] Fps is (10 sec: 5790.2, 60 sec: 5720.5, 300 sec: 5719.9). Total num frames: 157660160. Throughput: 0: 6018.7. Samples: 157661074. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 07:45:57,388][25689] Avg episode reward: [(0, '-53.507')] [2022-07-09 07:45:58,593][26022] Updated weights on worker 0-0, policy_version 153972 (0.00082) [2022-07-09 07:46:00,522][26022] Updated weights on worker 0-0, policy_version 153982 (0.00085) [2022-07-09 07:46:02,401][25689] Fps is (10 sec: 5598.0, 60 sec: 5705.5, 300 sec: 5713.8). Total num frames: 157686784. Throughput: 0: 5977.2. Samples: 157694488. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 07:46:02,402][25689] Avg episode reward: [(0, '-53.872')] [2022-07-09 07:46:02,515][26022] Updated weights on worker 0-0, policy_version 153992 (0.00087) [2022-07-09 07:46:04,367][26022] Updated weights on worker 0-0, policy_version 154002 (0.00082) [2022-07-09 07:46:05,816][26022] Updated weights on worker 0-0, policy_version 154012 (0.00114) [2022-07-09 07:46:07,414][25689] Fps is (10 sec: 5310.7, 60 sec: 5706.1, 300 sec: 5718.4). Total num frames: 157713408. Throughput: 0: 5088.8. Samples: 157711450. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 07:46:07,415][25689] Avg episode reward: [(0, '-53.580')] [2022-07-09 07:46:07,980][26022] Updated weights on worker 0-0, policy_version 154022 (0.00084) [2022-07-09 07:46:09,560][26022] Updated weights on worker 0-0, policy_version 154032 (0.00090) [2022-07-09 07:46:11,379][26022] Updated weights on worker 0-0, policy_version 154042 (0.00084) [2022-07-09 07:46:12,430][25689] Fps is (10 sec: 5819.8, 60 sec: 5740.3, 300 sec: 5722.8). Total num frames: 157745152. Throughput: 0: 5932.6. Samples: 157745664. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 07:46:12,431][25689] Avg episode reward: [(0, '-52.864')] [2022-07-09 07:46:13,427][26022] Updated weights on worker 0-0, policy_version 154052 (0.00091) [2022-07-09 07:46:14,915][26022] Updated weights on worker 0-0, policy_version 154062 (0.00092) [2022-07-09 07:46:16,986][26022] Updated weights on worker 0-0, policy_version 154072 (0.00087) [2022-07-09 07:46:17,471][25689] Fps is (10 sec: 5905.8, 60 sec: 5706.5, 300 sec: 5715.8). Total num frames: 157772800. Throughput: 0: 5918.5. Samples: 157779936. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 07:46:17,471][25689] Avg episode reward: [(0, '-52.283')] [2022-07-09 07:46:18,553][26022] Updated weights on worker 0-0, policy_version 154082 (0.00086) [2022-07-09 07:46:20,259][26022] Updated weights on worker 0-0, policy_version 154092 (0.00088) [2022-07-09 07:46:22,169][26022] Updated weights on worker 0-0, policy_version 154102 (0.00083) [2022-07-09 07:46:22,505][25689] Fps is (10 sec: 5590.0, 60 sec: 5721.9, 300 sec: 5716.2). Total num frames: 157801472. Throughput: 0: 5115.8. Samples: 157797338. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 07:46:22,506][25689] Avg episode reward: [(0, '-52.542')] [2022-07-09 07:46:23,734][26022] Updated weights on worker 0-0, policy_version 154112 (0.00084) [2022-07-09 07:46:25,676][26022] Updated weights on worker 0-0, policy_version 154122 (0.00082) [2022-07-09 07:46:27,523][25689] Fps is (10 sec: 5704.7, 60 sec: 5688.3, 300 sec: 5719.6). Total num frames: 157830144. Throughput: 0: 6000.1. Samples: 157832102. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 07:46:27,523][25689] Avg episode reward: [(0, '-52.952')] [2022-07-09 07:46:27,562][26022] Updated weights on worker 0-0, policy_version 154132 (0.00090) [2022-07-09 07:46:29,285][26022] Updated weights on worker 0-0, policy_version 154142 (0.00084) [2022-07-09 07:46:31,008][26022] Updated weights on worker 0-0, policy_version 154152 (0.00092) [2022-07-09 07:46:32,548][25689] Fps is (10 sec: 5710.1, 60 sec: 5721.6, 300 sec: 5718.2). Total num frames: 157858816. Throughput: 0: 6009.0. Samples: 157866550. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 07:46:32,548][25689] Avg episode reward: [(0, '-53.471')] [2022-07-09 07:46:32,890][26022] Updated weights on worker 0-0, policy_version 154162 (0.00127) [2022-07-09 07:46:34,510][26022] Updated weights on worker 0-0, policy_version 154172 (0.00097) [2022-07-09 07:46:36,413][26022] Updated weights on worker 0-0, policy_version 154182 (0.00059) [2022-07-09 07:46:37,214][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:46:37,224][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000154187_157887488.pth [2022-07-09 07:46:37,224][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000152172_155824128.pth [2022-07-09 07:46:37,626][25689] Fps is (10 sec: 5980.0, 60 sec: 5734.2, 300 sec: 5723.8). Total num frames: 157890560. Throughput: 0: 5151.0. Samples: 157883754. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 07:46:37,626][25689] Avg episode reward: [(0, '-53.252')] [2022-07-09 07:46:38,216][26022] Updated weights on worker 0-0, policy_version 154192 (0.00091) [2022-07-09 07:46:39,953][26022] Updated weights on worker 0-0, policy_version 154202 (0.00090) [2022-07-09 07:46:42,038][26022] Updated weights on worker 0-0, policy_version 154212 (0.00055) [2022-07-09 07:46:42,667][25689] Fps is (10 sec: 5667.0, 60 sec: 5698.1, 300 sec: 5716.4). Total num frames: 157916160. Throughput: 0: 5977.0. Samples: 157917844. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 07:46:42,667][25689] Avg episode reward: [(0, '-53.758')] [2022-07-09 07:46:43,722][26022] Updated weights on worker 0-0, policy_version 154222 (0.00094) [2022-07-09 07:46:45,440][26022] Updated weights on worker 0-0, policy_version 154232 (0.00092) [2022-07-09 07:46:47,238][26022] Updated weights on worker 0-0, policy_version 154242 (0.00089) [2022-07-09 07:46:47,672][25689] Fps is (10 sec: 5402.2, 60 sec: 5682.8, 300 sec: 5713.9). Total num frames: 157944832. Throughput: 0: 5958.5. Samples: 157952162. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 07:46:47,673][25689] Avg episode reward: [(0, '-54.523')] [2022-07-09 07:46:48,888][26022] Updated weights on worker 0-0, policy_version 154252 (0.00087) [2022-07-09 07:46:50,957][26022] Updated weights on worker 0-0, policy_version 154262 (0.00086) [2022-07-09 07:46:52,277][26022] Updated weights on worker 0-0, policy_version 154272 (0.00086) [2022-07-09 07:46:52,684][25689] Fps is (10 sec: 5826.5, 60 sec: 5701.7, 300 sec: 5718.5). Total num frames: 157974528. Throughput: 0: 5106.6. Samples: 157969380. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 07:46:52,685][25689] Avg episode reward: [(0, '-54.408')] [2022-07-09 07:46:54,303][26022] Updated weights on worker 0-0, policy_version 154282 (0.00086) [2022-07-09 07:46:56,264][26022] Updated weights on worker 0-0, policy_version 154292 (0.00083) [2022-07-09 07:46:57,806][25689] Fps is (10 sec: 5860.9, 60 sec: 5694.7, 300 sec: 5716.3). Total num frames: 158004224. Throughput: 0: 5959.3. Samples: 158004012. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 07:46:57,806][25689] Avg episode reward: [(0, '-54.397')] [2022-07-09 07:46:57,902][26022] Updated weights on worker 0-0, policy_version 154302 (0.00089) [2022-07-09 07:46:59,548][26022] Updated weights on worker 0-0, policy_version 154312 (0.00106) [2022-07-09 07:47:01,839][26022] Updated weights on worker 0-0, policy_version 154322 (0.00095) [2022-07-09 07:47:02,823][25689] Fps is (10 sec: 5656.2, 60 sec: 5711.3, 300 sec: 5720.4). Total num frames: 158031872. Throughput: 0: 5903.0. Samples: 158036824. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 07:47:02,823][25689] Avg episode reward: [(0, '-54.132')] [2022-07-09 07:47:03,422][26022] Updated weights on worker 0-0, policy_version 154332 (0.00086) [2022-07-09 07:47:05,292][26022] Updated weights on worker 0-0, policy_version 154342 (0.00081) [2022-07-09 07:47:07,041][26022] Updated weights on worker 0-0, policy_version 154352 (0.00083) [2022-07-09 07:47:07,831][25689] Fps is (10 sec: 5515.5, 60 sec: 5728.7, 300 sec: 5711.1). Total num frames: 158059520. Throughput: 0: 5052.9. Samples: 158054024. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 07:47:07,832][25689] Avg episode reward: [(0, '-53.715')] [2022-07-09 07:47:08,912][26022] Updated weights on worker 0-0, policy_version 154362 (0.00085) [2022-07-09 07:47:10,863][26022] Updated weights on worker 0-0, policy_version 154372 (0.00084) [2022-07-09 07:47:12,399][26022] Updated weights on worker 0-0, policy_version 154382 (0.00084) [2022-07-09 07:47:12,849][25689] Fps is (10 sec: 5719.2, 60 sec: 5694.6, 300 sec: 5715.6). Total num frames: 158089216. Throughput: 0: 5893.4. Samples: 158088220. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 07:47:12,850][25689] Avg episode reward: [(0, '-53.645')] [2022-07-09 07:47:14,246][26022] Updated weights on worker 0-0, policy_version 154392 (0.00092) [2022-07-09 07:47:15,912][26022] Updated weights on worker 0-0, policy_version 154402 (0.00088) [2022-07-09 07:47:17,830][26022] Updated weights on worker 0-0, policy_version 154412 (0.00086) [2022-07-09 07:47:17,892][25689] Fps is (10 sec: 5801.5, 60 sec: 5711.4, 300 sec: 5712.0). Total num frames: 158117888. Throughput: 0: 5924.8. Samples: 158123020. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 07:47:17,893][25689] Avg episode reward: [(0, '-53.669')] [2022-07-09 07:47:19,370][26022] Updated weights on worker 0-0, policy_version 154422 (0.00089) [2022-07-09 07:47:21,454][26022] Updated weights on worker 0-0, policy_version 154432 (0.00087) [2022-07-09 07:47:22,863][26022] Updated weights on worker 0-0, policy_version 154442 (0.00097) [2022-07-09 07:47:22,913][25689] Fps is (10 sec: 5901.4, 60 sec: 5746.5, 300 sec: 5722.8). Total num frames: 158148608. Throughput: 0: 5155.2. Samples: 158140396. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 07:47:22,914][25689] Avg episode reward: [(0, '-54.164')] [2022-07-09 07:47:24,824][26022] Updated weights on worker 0-0, policy_version 154452 (0.00093) [2022-07-09 07:47:26,475][26022] Updated weights on worker 0-0, policy_version 154462 (0.00085) [2022-07-09 07:47:27,932][25689] Fps is (10 sec: 5711.6, 60 sec: 5712.5, 300 sec: 5712.4). Total num frames: 158175232. Throughput: 0: 6028.3. Samples: 158175198. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 07:47:27,932][25689] Avg episode reward: [(0, '-54.149')] [2022-07-09 07:47:28,327][26022] Updated weights on worker 0-0, policy_version 154472 (0.00084) [2022-07-09 07:47:30,332][26022] Updated weights on worker 0-0, policy_version 154482 (0.00092) [2022-07-09 07:47:31,926][26022] Updated weights on worker 0-0, policy_version 154492 (0.00088) [2022-07-09 07:47:32,944][25689] Fps is (10 sec: 5512.4, 60 sec: 5713.7, 300 sec: 5709.8). Total num frames: 158203904. Throughput: 0: 6052.0. Samples: 158209836. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:47:32,945][25689] Avg episode reward: [(0, '-53.865')] [2022-07-09 07:47:33,847][26022] Updated weights on worker 0-0, policy_version 154502 (0.00091) [2022-07-09 07:47:35,551][26022] Updated weights on worker 0-0, policy_version 154512 (0.00089) [2022-07-09 07:47:37,254][26022] Updated weights on worker 0-0, policy_version 154522 (0.00092) [2022-07-09 07:47:38,074][25689] Fps is (10 sec: 5856.3, 60 sec: 5691.9, 300 sec: 5714.4). Total num frames: 158234624. Throughput: 0: 5152.4. Samples: 158227004. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:47:38,087][25689] Avg episode reward: [(0, '-53.065')] [2022-07-09 07:47:39,330][26022] Updated weights on worker 0-0, policy_version 154532 (0.00090) [2022-07-09 07:47:40,828][26022] Updated weights on worker 0-0, policy_version 154542 (0.00088) [2022-07-09 07:47:42,884][26022] Updated weights on worker 0-0, policy_version 154552 (0.00548) [2022-07-09 07:47:43,105][25689] Fps is (10 sec: 5744.5, 60 sec: 5726.7, 300 sec: 5712.2). Total num frames: 158262272. Throughput: 0: 6000.4. Samples: 158261556. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:47:43,106][25689] Avg episode reward: [(0, '-52.297')] [2022-07-09 07:47:44,519][26022] Updated weights on worker 0-0, policy_version 154562 (0.00086) [2022-07-09 07:47:46,367][26022] Updated weights on worker 0-0, policy_version 154572 (0.00085) [2022-07-09 07:47:47,946][26022] Updated weights on worker 0-0, policy_version 154582 (0.00086) [2022-07-09 07:47:48,127][25689] Fps is (10 sec: 5704.0, 60 sec: 5742.1, 300 sec: 5715.3). Total num frames: 158291968. Throughput: 0: 6004.6. Samples: 158296460. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:47:48,127][25689] Avg episode reward: [(0, '-52.580')] [2022-07-09 07:47:49,684][26022] Updated weights on worker 0-0, policy_version 154592 (0.00081) [2022-07-09 07:47:51,533][26022] Updated weights on worker 0-0, policy_version 154602 (0.00087) [2022-07-09 07:47:53,147][25689] Fps is (10 sec: 5914.6, 60 sec: 5741.4, 300 sec: 5715.9). Total num frames: 158321664. Throughput: 0: 5155.5. Samples: 158313990. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:47:53,147][25689] Avg episode reward: [(0, '-52.336')] [2022-07-09 07:47:53,394][26022] Updated weights on worker 0-0, policy_version 154612 (0.00081) [2022-07-09 07:47:54,914][26022] Updated weights on worker 0-0, policy_version 154622 (0.00087) [2022-07-09 07:47:56,814][26022] Updated weights on worker 0-0, policy_version 154632 (0.00084) [2022-07-09 07:47:58,262][25689] Fps is (10 sec: 5859.9, 60 sec: 5741.9, 300 sec: 5718.6). Total num frames: 158351360. Throughput: 0: 6044.1. Samples: 158349026. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:47:58,263][25689] Avg episode reward: [(0, '-52.458')] [2022-07-09 07:47:58,595][26022] Updated weights on worker 0-0, policy_version 154642 (0.00078) [2022-07-09 07:48:00,275][26022] Updated weights on worker 0-0, policy_version 154652 (0.00084) [2022-07-09 07:48:02,310][26022] Updated weights on worker 0-0, policy_version 154662 (0.00103) [2022-07-09 07:48:03,344][25689] Fps is (10 sec: 5523.3, 60 sec: 5718.9, 300 sec: 5721.7). Total num frames: 158377984. Throughput: 0: 6007.4. Samples: 158383136. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:48:03,344][25689] Avg episode reward: [(0, '-52.519')] [2022-07-09 07:48:04,024][26022] Updated weights on worker 0-0, policy_version 154672 (0.00108) [2022-07-09 07:48:06,172][26022] Updated weights on worker 0-0, policy_version 154682 (0.00091) [2022-07-09 07:48:07,671][26022] Updated weights on worker 0-0, policy_version 154692 (0.00096) [2022-07-09 07:48:08,407][25689] Fps is (10 sec: 5450.8, 60 sec: 5730.6, 300 sec: 5717.5). Total num frames: 158406656. Throughput: 0: 5897.5. Samples: 158416060. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:48:08,407][25689] Avg episode reward: [(0, '-52.282')] [2022-07-09 07:48:09,557][26022] Updated weights on worker 0-0, policy_version 154702 (0.00086) [2022-07-09 07:48:11,454][26022] Updated weights on worker 0-0, policy_version 154712 (0.00080) [2022-07-09 07:48:13,006][26022] Updated weights on worker 0-0, policy_version 154722 (0.00092) [2022-07-09 07:48:13,421][25689] Fps is (10 sec: 5893.9, 60 sec: 5747.9, 300 sec: 5722.1). Total num frames: 158437376. Throughput: 0: 5883.5. Samples: 158433270. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:48:13,421][25689] Avg episode reward: [(0, '-51.948')] [2022-07-09 07:48:14,980][26022] Updated weights on worker 0-0, policy_version 154732 (0.00085) [2022-07-09 07:48:16,593][26022] Updated weights on worker 0-0, policy_version 154742 (0.00090) [2022-07-09 07:48:18,337][26022] Updated weights on worker 0-0, policy_version 154752 (0.00084) [2022-07-09 07:48:18,475][25689] Fps is (10 sec: 6000.8, 60 sec: 5763.8, 300 sec: 5725.4). Total num frames: 158467072. Throughput: 0: 5890.2. Samples: 158468082. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:48:18,475][25689] Avg episode reward: [(0, '-52.328')] [2022-07-09 07:48:20,150][26022] Updated weights on worker 0-0, policy_version 154762 (0.00085) [2022-07-09 07:48:21,863][26022] Updated weights on worker 0-0, policy_version 154772 (0.00084) [2022-07-09 07:48:23,477][25689] Fps is (10 sec: 5702.1, 60 sec: 5714.8, 300 sec: 5722.5). Total num frames: 158494720. Throughput: 0: 5969.9. Samples: 158503330. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:48:23,478][25689] Avg episode reward: [(0, '-51.494')] [2022-07-09 07:48:23,712][26022] Updated weights on worker 0-0, policy_version 154782 (0.00089) [2022-07-09 07:48:25,627][26022] Updated weights on worker 0-0, policy_version 154792 (0.00100) [2022-07-09 07:48:27,018][26022] Updated weights on worker 0-0, policy_version 154802 (0.00101) [2022-07-09 07:48:28,490][25689] Fps is (10 sec: 5725.6, 60 sec: 5766.1, 300 sec: 5725.9). Total num frames: 158524416. Throughput: 0: 5219.3. Samples: 158520882. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:48:28,491][25689] Avg episode reward: [(0, '-52.821')] [2022-07-09 07:48:29,173][26022] Updated weights on worker 0-0, policy_version 154812 (0.00098) [2022-07-09 07:48:30,527][26022] Updated weights on worker 0-0, policy_version 154822 (0.00088) [2022-07-09 07:48:32,597][26022] Updated weights on worker 0-0, policy_version 154832 (0.00086) [2022-07-09 07:48:33,499][25689] Fps is (10 sec: 5824.0, 60 sec: 5766.4, 300 sec: 5724.5). Total num frames: 158553088. Throughput: 0: 6107.2. Samples: 158555894. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:48:33,500][25689] Avg episode reward: [(0, '-52.892')] [2022-07-09 07:48:34,262][26022] Updated weights on worker 0-0, policy_version 154842 (0.00085) [2022-07-09 07:48:36,175][26022] Updated weights on worker 0-0, policy_version 154852 (0.00083) [2022-07-09 07:48:37,404][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:48:37,417][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000154860_158576640.pth [2022-07-09 07:48:37,418][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000152845_156513280.pth [2022-07-09 07:48:37,836][26022] Updated weights on worker 0-0, policy_version 154862 (0.00083) [2022-07-09 07:48:38,633][25689] Fps is (10 sec: 5855.4, 60 sec: 5765.9, 300 sec: 5733.1). Total num frames: 158583808. Throughput: 0: 6098.0. Samples: 158591008. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:48:38,634][25689] Avg episode reward: [(0, '-53.281')] [2022-07-09 07:48:39,782][26022] Updated weights on worker 0-0, policy_version 154872 (0.00082) [2022-07-09 07:48:41,196][26022] Updated weights on worker 0-0, policy_version 154882 (0.00089) [2022-07-09 07:48:43,212][26022] Updated weights on worker 0-0, policy_version 154892 (0.00092) [2022-07-09 07:48:43,659][25689] Fps is (10 sec: 5745.2, 60 sec: 5766.5, 300 sec: 5723.9). Total num frames: 158611456. Throughput: 0: 5193.9. Samples: 158608150. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:48:43,659][25689] Avg episode reward: [(0, '-53.471')] [2022-07-09 07:48:45,018][26022] Updated weights on worker 0-0, policy_version 154902 (0.00083) [2022-07-09 07:48:46,772][26022] Updated weights on worker 0-0, policy_version 154912 (0.00096) [2022-07-09 07:48:48,410][26022] Updated weights on worker 0-0, policy_version 154922 (0.00050) [2022-07-09 07:48:48,678][25689] Fps is (10 sec: 5708.7, 60 sec: 5766.7, 300 sec: 5725.1). Total num frames: 158641152. Throughput: 0: 6050.4. Samples: 158643028. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:48:48,680][25689] Avg episode reward: [(0, '-54.001')] [2022-07-09 07:48:50,263][26022] Updated weights on worker 0-0, policy_version 154932 (0.00079) [2022-07-09 07:48:51,963][26022] Updated weights on worker 0-0, policy_version 154942 (0.00083) [2022-07-09 07:48:53,706][25689] Fps is (10 sec: 5707.5, 60 sec: 5732.2, 300 sec: 5725.7). Total num frames: 158668800. Throughput: 0: 6016.7. Samples: 158677470. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:48:53,706][25689] Avg episode reward: [(0, '-53.430')] [2022-07-09 07:48:53,783][26022] Updated weights on worker 0-0, policy_version 154952 (0.00088) [2022-07-09 07:48:55,443][26022] Updated weights on worker 0-0, policy_version 154962 (0.00084) [2022-07-09 07:48:57,301][26022] Updated weights on worker 0-0, policy_version 154972 (0.00084) [2022-07-09 07:48:58,758][25689] Fps is (10 sec: 5790.5, 60 sec: 5755.1, 300 sec: 5725.2). Total num frames: 158699520. Throughput: 0: 5150.8. Samples: 158694668. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:48:58,760][25689] Avg episode reward: [(0, '-54.001')] [2022-07-09 07:48:59,116][26022] Updated weights on worker 0-0, policy_version 154982 (0.00084) [2022-07-09 07:49:00,714][26022] Updated weights on worker 0-0, policy_version 154992 (0.00081) [2022-07-09 07:49:03,063][26022] Updated weights on worker 0-0, policy_version 155002 (0.00090) [2022-07-09 07:49:03,763][25689] Fps is (10 sec: 5702.0, 60 sec: 5762.4, 300 sec: 5732.5). Total num frames: 158726144. Throughput: 0: 5962.2. Samples: 158728014. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:49:03,765][25689] Avg episode reward: [(0, '-54.102')] [2022-07-09 07:49:04,665][26022] Updated weights on worker 0-0, policy_version 155012 (0.00089) [2022-07-09 07:49:06,552][26022] Updated weights on worker 0-0, policy_version 155022 (0.00083) [2022-07-09 07:49:08,189][26022] Updated weights on worker 0-0, policy_version 155032 (0.00056) [2022-07-09 07:49:08,766][25689] Fps is (10 sec: 5525.3, 60 sec: 5768.1, 300 sec: 5730.9). Total num frames: 158754816. Throughput: 0: 5936.0. Samples: 158762268. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:49:08,768][25689] Avg episode reward: [(0, '-53.880')] [2022-07-09 07:49:10,087][26022] Updated weights on worker 0-0, policy_version 155042 (0.00113) [2022-07-09 07:49:11,842][26022] Updated weights on worker 0-0, policy_version 155052 (0.00082) [2022-07-09 07:49:13,516][26022] Updated weights on worker 0-0, policy_version 155062 (0.00090) [2022-07-09 07:49:13,799][25689] Fps is (10 sec: 5815.8, 60 sec: 5749.4, 300 sec: 5729.2). Total num frames: 158784512. Throughput: 0: 5082.9. Samples: 158779600. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:49:13,800][25689] Avg episode reward: [(0, '-54.065')] [2022-07-09 07:49:15,462][26022] Updated weights on worker 0-0, policy_version 155072 (0.00086) [2022-07-09 07:49:17,109][26022] Updated weights on worker 0-0, policy_version 155082 (0.00080) [2022-07-09 07:49:18,907][25689] Fps is (10 sec: 5755.8, 60 sec: 5727.3, 300 sec: 5728.1). Total num frames: 158813184. Throughput: 0: 5943.7. Samples: 158814424. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:49:18,909][25689] Avg episode reward: [(0, '-54.425')] [2022-07-09 07:49:18,955][26022] Updated weights on worker 0-0, policy_version 155092 (0.00088) [2022-07-09 07:49:20,659][26022] Updated weights on worker 0-0, policy_version 155102 (0.00095) [2022-07-09 07:49:22,444][26022] Updated weights on worker 0-0, policy_version 155112 (0.00083) [2022-07-09 07:49:23,924][25689] Fps is (10 sec: 5764.3, 60 sec: 5759.7, 300 sec: 5732.5). Total num frames: 158842880. Throughput: 0: 6008.6. Samples: 158849158. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:49:23,925][25689] Avg episode reward: [(0, '-54.530')] [2022-07-09 07:49:24,206][26022] Updated weights on worker 0-0, policy_version 155122 (0.00087) [2022-07-09 07:49:25,872][26022] Updated weights on worker 0-0, policy_version 155132 (0.00088) [2022-07-09 07:49:28,011][26022] Updated weights on worker 0-0, policy_version 155142 (0.00082) [2022-07-09 07:49:28,967][25689] Fps is (10 sec: 5802.0, 60 sec: 5740.0, 300 sec: 5728.9). Total num frames: 158871552. Throughput: 0: 5152.3. Samples: 158866348. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:49:28,967][25689] Avg episode reward: [(0, '-54.430')] [2022-07-09 07:49:29,446][26022] Updated weights on worker 0-0, policy_version 155152 (0.00084) [2022-07-09 07:49:31,462][26022] Updated weights on worker 0-0, policy_version 155162 (0.00087) [2022-07-09 07:49:33,034][26022] Updated weights on worker 0-0, policy_version 155172 (0.00083) [2022-07-09 07:49:33,994][25689] Fps is (10 sec: 5694.9, 60 sec: 5738.3, 300 sec: 5730.8). Total num frames: 158900224. Throughput: 0: 6012.1. Samples: 158901014. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:49:33,994][25689] Avg episode reward: [(0, '-55.129')] [2022-07-09 07:49:35,014][26022] Updated weights on worker 0-0, policy_version 155182 (0.00087) [2022-07-09 07:49:36,515][26022] Updated weights on worker 0-0, policy_version 155192 (0.00091) [2022-07-09 07:49:38,581][26022] Updated weights on worker 0-0, policy_version 155202 (0.00080) [2022-07-09 07:49:39,099][25689] Fps is (10 sec: 5760.8, 60 sec: 5724.1, 300 sec: 5732.5). Total num frames: 158929920. Throughput: 0: 6004.9. Samples: 158935674. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 07:49:39,099][25689] Avg episode reward: [(0, '-54.286')] [2022-07-09 07:49:40,079][26022] Updated weights on worker 0-0, policy_version 155212 (0.00087) [2022-07-09 07:49:42,159][26022] Updated weights on worker 0-0, policy_version 155222 (0.00100) [2022-07-09 07:49:43,626][26022] Updated weights on worker 0-0, policy_version 155232 (0.00086) [2022-07-09 07:49:44,125][25689] Fps is (10 sec: 5761.0, 60 sec: 5741.0, 300 sec: 5730.0). Total num frames: 158958592. Throughput: 0: 5125.4. Samples: 158952696. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 07:49:44,126][25689] Avg episode reward: [(0, '-53.773')] [2022-07-09 07:49:45,688][26022] Updated weights on worker 0-0, policy_version 155242 (0.00094) [2022-07-09 07:49:47,139][26022] Updated weights on worker 0-0, policy_version 155252 (0.00083) [2022-07-09 07:49:49,144][25689] Fps is (10 sec: 5708.5, 60 sec: 5724.1, 300 sec: 5727.6). Total num frames: 158987264. Throughput: 0: 6009.6. Samples: 158987606. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 07:49:49,145][25689] Avg episode reward: [(0, '-53.994')] [2022-07-09 07:49:49,163][26022] Updated weights on worker 0-0, policy_version 155262 (0.00093) [2022-07-09 07:49:50,832][26022] Updated weights on worker 0-0, policy_version 155272 (0.00093) [2022-07-09 07:49:52,704][26022] Updated weights on worker 0-0, policy_version 155282 (0.00100) [2022-07-09 07:49:54,222][25689] Fps is (10 sec: 5882.3, 60 sec: 5770.1, 300 sec: 5730.6). Total num frames: 159017984. Throughput: 0: 6006.5. Samples: 159022516. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 07:49:54,223][25689] Avg episode reward: [(0, '-54.669')] [2022-07-09 07:49:54,395][26022] Updated weights on worker 0-0, policy_version 155292 (0.00089) [2022-07-09 07:49:56,248][26022] Updated weights on worker 0-0, policy_version 155302 (0.00091) [2022-07-09 07:49:57,788][26022] Updated weights on worker 0-0, policy_version 155312 (0.00082) [2022-07-09 07:49:59,300][25689] Fps is (10 sec: 5847.8, 60 sec: 5733.8, 300 sec: 5733.2). Total num frames: 159046656. Throughput: 0: 5153.3. Samples: 159039778. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 07:49:59,301][25689] Avg episode reward: [(0, '-54.595')] [2022-07-09 07:49:59,811][26022] Updated weights on worker 0-0, policy_version 155322 (0.00095) [2022-07-09 07:50:01,539][26022] Updated weights on worker 0-0, policy_version 155332 (0.00084) [2022-07-09 07:50:03,549][26022] Updated weights on worker 0-0, policy_version 155342 (0.00084) [2022-07-09 07:50:04,379][25689] Fps is (10 sec: 5444.2, 60 sec: 5726.8, 300 sec: 5732.1). Total num frames: 159073280. Throughput: 0: 5912.2. Samples: 159072440. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 07:50:04,379][25689] Avg episode reward: [(0, '-53.833')] [2022-07-09 07:50:05,270][26022] Updated weights on worker 0-0, policy_version 155352 (0.00083) [2022-07-09 07:50:07,314][26022] Updated weights on worker 0-0, policy_version 155362 (0.00082) [2022-07-09 07:50:08,978][26022] Updated weights on worker 0-0, policy_version 155372 (0.00088) [2022-07-09 07:50:09,403][25689] Fps is (10 sec: 5676.1, 60 sec: 5758.6, 300 sec: 5735.4). Total num frames: 159104000. Throughput: 0: 5908.6. Samples: 159107308. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 07:50:09,403][25689] Avg episode reward: [(0, '-53.470')] [2022-07-09 07:50:10,888][26022] Updated weights on worker 0-0, policy_version 155382 (0.00088) [2022-07-09 07:50:12,453][26022] Updated weights on worker 0-0, policy_version 155392 (0.00095) [2022-07-09 07:50:14,434][25689] Fps is (10 sec: 5805.0, 60 sec: 5725.0, 300 sec: 5728.8). Total num frames: 159131648. Throughput: 0: 5033.0. Samples: 159124242. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 07:50:14,436][26022] Updated weights on worker 0-0, policy_version 155402 (0.00087) [2022-07-09 07:50:14,439][25689] Avg episode reward: [(0, '-52.792')] [2022-07-09 07:50:16,125][26022] Updated weights on worker 0-0, policy_version 155412 (0.00080) [2022-07-09 07:50:18,043][26022] Updated weights on worker 0-0, policy_version 155422 (0.00083) [2022-07-09 07:50:19,479][25689] Fps is (10 sec: 5690.9, 60 sec: 5747.8, 300 sec: 5735.1). Total num frames: 159161344. Throughput: 0: 5910.2. Samples: 159159040. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 07:50:19,481][25689] Avg episode reward: [(0, '-52.620')] [2022-07-09 07:50:19,635][26022] Updated weights on worker 0-0, policy_version 155432 (0.00087) [2022-07-09 07:50:21,443][26022] Updated weights on worker 0-0, policy_version 155442 (0.00097) [2022-07-09 07:50:23,146][26022] Updated weights on worker 0-0, policy_version 155452 (0.00093) [2022-07-09 07:50:24,515][25689] Fps is (10 sec: 5790.0, 60 sec: 5729.2, 300 sec: 5728.0). Total num frames: 159190016. Throughput: 0: 6010.9. Samples: 159193474. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 07:50:24,516][25689] Avg episode reward: [(0, '-52.632')] [2022-07-09 07:50:25,034][26022] Updated weights on worker 0-0, policy_version 155462 (0.00088) [2022-07-09 07:50:26,889][26022] Updated weights on worker 0-0, policy_version 155472 (0.00085) [2022-07-09 07:50:28,643][26022] Updated weights on worker 0-0, policy_version 155482 (0.00095) [2022-07-09 07:50:29,534][25689] Fps is (10 sec: 5703.3, 60 sec: 5731.4, 300 sec: 5734.8). Total num frames: 159218688. Throughput: 0: 5985.1. Samples: 159227794. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 07:50:29,534][25689] Avg episode reward: [(0, '-52.432')] [2022-07-09 07:50:30,451][26022] Updated weights on worker 0-0, policy_version 155492 (0.00080) [2022-07-09 07:50:32,064][26022] Updated weights on worker 0-0, policy_version 155502 (0.00094) [2022-07-09 07:50:33,875][26022] Updated weights on worker 0-0, policy_version 155512 (0.00087) [2022-07-09 07:50:34,551][25689] Fps is (10 sec: 5713.4, 60 sec: 5732.3, 300 sec: 5728.2). Total num frames: 159247360. Throughput: 0: 6003.9. Samples: 159245026. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 07:50:34,553][25689] Avg episode reward: [(0, '-52.842')] [2022-07-09 07:50:35,969][26022] Updated weights on worker 0-0, policy_version 155522 (0.00085) [2022-07-09 07:50:37,494][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:50:37,506][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000155532_159264768.pth [2022-07-09 07:50:37,506][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000153515_157199360.pth [2022-07-09 07:50:37,508][26022] Updated weights on worker 0-0, policy_version 155532 (0.00090) [2022-07-09 07:50:39,157][26022] Updated weights on worker 0-0, policy_version 155542 (0.00082) [2022-07-09 07:50:39,607][25689] Fps is (10 sec: 5794.4, 60 sec: 5737.0, 300 sec: 5734.4). Total num frames: 159277056. Throughput: 0: 5998.4. Samples: 159279774. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 07:50:39,607][25689] Avg episode reward: [(0, '-53.538')] [2022-07-09 07:50:41,174][26022] Updated weights on worker 0-0, policy_version 155552 (0.00092) [2022-07-09 07:50:42,604][26022] Updated weights on worker 0-0, policy_version 155562 (0.00086) [2022-07-09 07:50:44,614][25689] Fps is (10 sec: 5698.6, 60 sec: 5721.9, 300 sec: 5727.8). Total num frames: 159304704. Throughput: 0: 6013.6. Samples: 159314344. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 07:50:44,615][25689] Avg episode reward: [(0, '-54.379')] [2022-07-09 07:50:44,700][26022] Updated weights on worker 0-0, policy_version 155572 (0.00089) [2022-07-09 07:50:46,395][26022] Updated weights on worker 0-0, policy_version 155582 (0.00095) [2022-07-09 07:50:48,205][26022] Updated weights on worker 0-0, policy_version 155592 (0.00090) [2022-07-09 07:50:49,645][25689] Fps is (10 sec: 5712.5, 60 sec: 5737.7, 300 sec: 5731.3). Total num frames: 159334400. Throughput: 0: 5171.7. Samples: 159331806. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 07:50:49,646][25689] Avg episode reward: [(0, '-54.298')] [2022-07-09 07:50:49,895][26022] Updated weights on worker 0-0, policy_version 155602 (0.00087) [2022-07-09 07:50:51,911][26022] Updated weights on worker 0-0, policy_version 155612 (0.00081) [2022-07-09 07:50:53,379][26022] Updated weights on worker 0-0, policy_version 155622 (0.00085) [2022-07-09 07:50:54,660][25689] Fps is (10 sec: 5810.2, 60 sec: 5709.8, 300 sec: 5728.5). Total num frames: 159363072. Throughput: 0: 6047.7. Samples: 159366636. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 07:50:54,660][25689] Avg episode reward: [(0, '-54.348')] [2022-07-09 07:50:55,427][26022] Updated weights on worker 0-0, policy_version 155632 (0.00085) [2022-07-09 07:50:56,802][26022] Updated weights on worker 0-0, policy_version 155642 (0.00086) [2022-07-09 07:50:58,940][26022] Updated weights on worker 0-0, policy_version 155652 (0.00086) [2022-07-09 07:50:59,729][25689] Fps is (10 sec: 5889.8, 60 sec: 5744.5, 300 sec: 5741.2). Total num frames: 159393792. Throughput: 0: 6046.4. Samples: 159401442. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 07:50:59,730][25689] Avg episode reward: [(0, '-54.783')] [2022-07-09 07:51:00,251][26022] Updated weights on worker 0-0, policy_version 155662 (0.00085) [2022-07-09 07:51:02,740][26022] Updated weights on worker 0-0, policy_version 155672 (0.00085) [2022-07-09 07:51:04,353][26022] Updated weights on worker 0-0, policy_version 155682 (0.00087) [2022-07-09 07:51:04,740][25689] Fps is (10 sec: 5587.2, 60 sec: 5734.0, 300 sec: 5737.8). Total num frames: 159419392. Throughput: 0: 5081.9. Samples: 159416622. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 07:51:04,741][25689] Avg episode reward: [(0, '-53.759')] [2022-07-09 07:51:06,131][26022] Updated weights on worker 0-0, policy_version 155692 (0.00085) [2022-07-09 07:51:08,121][26022] Updated weights on worker 0-0, policy_version 155702 (0.00085) [2022-07-09 07:51:09,554][26022] Updated weights on worker 0-0, policy_version 155712 (0.00083) [2022-07-09 07:51:09,794][25689] Fps is (10 sec: 5493.9, 60 sec: 5714.2, 300 sec: 5730.1). Total num frames: 159449088. Throughput: 0: 5936.1. Samples: 159451412. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 07:51:09,795][25689] Avg episode reward: [(0, '-52.350')] [2022-07-09 07:51:11,382][26022] Updated weights on worker 0-0, policy_version 155722 (0.00091) [2022-07-09 07:51:13,422][26022] Updated weights on worker 0-0, policy_version 155732 (0.00078) [2022-07-09 07:51:14,805][25689] Fps is (10 sec: 5901.0, 60 sec: 5750.0, 300 sec: 5737.6). Total num frames: 159478784. Throughput: 0: 5917.8. Samples: 159485850. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 07:51:14,806][25689] Avg episode reward: [(0, '-51.446')] [2022-07-09 07:51:15,000][26022] Updated weights on worker 0-0, policy_version 155742 (0.00089) [2022-07-09 07:51:16,903][26022] Updated weights on worker 0-0, policy_version 155752 (0.00079) [2022-07-09 07:51:18,544][26022] Updated weights on worker 0-0, policy_version 155762 (0.00082) [2022-07-09 07:51:19,924][25689] Fps is (10 sec: 5762.2, 60 sec: 5726.1, 300 sec: 5736.0). Total num frames: 159507456. Throughput: 0: 5055.1. Samples: 159503528. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 07:51:19,924][25689] Avg episode reward: [(0, '-51.581')] [2022-07-09 07:51:20,397][26022] Updated weights on worker 0-0, policy_version 155772 (0.00109) [2022-07-09 07:51:22,168][26022] Updated weights on worker 0-0, policy_version 155782 (0.00093) [2022-07-09 07:51:23,777][26022] Updated weights on worker 0-0, policy_version 155792 (0.00085) [2022-07-09 07:51:24,988][25689] Fps is (10 sec: 5731.8, 60 sec: 5740.3, 300 sec: 5738.5). Total num frames: 159537152. Throughput: 0: 6020.2. Samples: 159538518. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 07:51:24,990][25689] Avg episode reward: [(0, '-50.884')] [2022-07-09 07:51:25,603][26022] Updated weights on worker 0-0, policy_version 155802 (0.00081) [2022-07-09 07:51:27,519][26022] Updated weights on worker 0-0, policy_version 155812 (0.00092) [2022-07-09 07:51:29,137][26022] Updated weights on worker 0-0, policy_version 155822 (0.00086) [2022-07-09 07:51:30,074][25689] Fps is (10 sec: 5851.1, 60 sec: 5750.8, 300 sec: 5740.8). Total num frames: 159566848. Throughput: 0: 6018.2. Samples: 159573462. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 07:51:30,075][25689] Avg episode reward: [(0, '-50.729')] [2022-07-09 07:51:31,112][26022] Updated weights on worker 0-0, policy_version 155832 (0.00089) [2022-07-09 07:51:32,512][26022] Updated weights on worker 0-0, policy_version 155842 (0.00086) [2022-07-09 07:51:34,701][26022] Updated weights on worker 0-0, policy_version 155852 (0.00079) [2022-07-09 07:51:35,132][25689] Fps is (10 sec: 5652.9, 60 sec: 5730.1, 300 sec: 5727.4). Total num frames: 159594496. Throughput: 0: 5162.1. Samples: 159590782. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 07:51:35,133][25689] Avg episode reward: [(0, '-50.499')] [2022-07-09 07:51:36,038][26022] Updated weights on worker 0-0, policy_version 155862 (0.00085) [2022-07-09 07:51:38,286][26022] Updated weights on worker 0-0, policy_version 155872 (0.00088) [2022-07-09 07:51:39,694][26022] Updated weights on worker 0-0, policy_version 155882 (0.00087) [2022-07-09 07:51:40,169][25689] Fps is (10 sec: 5680.4, 60 sec: 5731.9, 300 sec: 5741.2). Total num frames: 159624192. Throughput: 0: 6018.4. Samples: 159625374. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 07:51:40,171][25689] Avg episode reward: [(0, '-51.521')] [2022-07-09 07:51:41,625][26022] Updated weights on worker 0-0, policy_version 155892 (0.00086) [2022-07-09 07:51:43,270][26022] Updated weights on worker 0-0, policy_version 155902 (0.00093) [2022-07-09 07:51:45,179][25689] Fps is (10 sec: 5809.9, 60 sec: 5748.6, 300 sec: 5741.2). Total num frames: 159652864. Throughput: 0: 6013.2. Samples: 159659928. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 07:51:45,179][25689] Avg episode reward: [(0, '-51.194')] [2022-07-09 07:51:45,192][26022] Updated weights on worker 0-0, policy_version 155912 (0.00081) [2022-07-09 07:51:46,992][26022] Updated weights on worker 0-0, policy_version 155922 (0.00087) [2022-07-09 07:51:48,785][26022] Updated weights on worker 0-0, policy_version 155932 (0.00090) [2022-07-09 07:51:50,203][25689] Fps is (10 sec: 5715.4, 60 sec: 5732.4, 300 sec: 5737.5). Total num frames: 159681536. Throughput: 0: 5150.9. Samples: 159677140. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 07:51:50,203][25689] Avg episode reward: [(0, '-51.810')] [2022-07-09 07:51:50,456][26022] Updated weights on worker 0-0, policy_version 155942 (0.00084) [2022-07-09 07:51:52,235][26022] Updated weights on worker 0-0, policy_version 155952 (0.00094) [2022-07-09 07:51:53,990][26022] Updated weights on worker 0-0, policy_version 155962 (0.00092) [2022-07-09 07:51:55,215][25689] Fps is (10 sec: 5917.9, 60 sec: 5766.4, 300 sec: 5743.0). Total num frames: 159712256. Throughput: 0: 6023.9. Samples: 159711756. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 07:51:55,215][25689] Avg episode reward: [(0, '-52.750')] [2022-07-09 07:51:55,888][26022] Updated weights on worker 0-0, policy_version 155972 (0.00087) [2022-07-09 07:51:57,518][26022] Updated weights on worker 0-0, policy_version 155982 (0.00078) [2022-07-09 07:51:59,362][26022] Updated weights on worker 0-0, policy_version 155992 (0.00083) [2022-07-09 07:52:00,295][25689] Fps is (10 sec: 5885.0, 60 sec: 5731.6, 300 sec: 5745.3). Total num frames: 159740928. Throughput: 0: 6030.5. Samples: 159746740. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 07:52:00,295][25689] Avg episode reward: [(0, '-52.824')] [2022-07-09 07:52:01,203][26022] Updated weights on worker 0-0, policy_version 156002 (0.00088) [2022-07-09 07:52:03,151][26022] Updated weights on worker 0-0, policy_version 156012 (0.00101) [2022-07-09 07:52:05,015][26022] Updated weights on worker 0-0, policy_version 156022 (0.00085) [2022-07-09 07:52:05,318][25689] Fps is (10 sec: 5574.3, 60 sec: 5764.2, 300 sec: 5745.0). Total num frames: 159768576. Throughput: 0: 5062.6. Samples: 159761886. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 07:52:05,319][25689] Avg episode reward: [(0, '-52.019')] [2022-07-09 07:52:06,826][26022] Updated weights on worker 0-0, policy_version 156032 (0.00087) [2022-07-09 07:52:08,589][26022] Updated weights on worker 0-0, policy_version 156042 (0.00085) [2022-07-09 07:52:10,339][25689] Fps is (10 sec: 5505.1, 60 sec: 5733.5, 300 sec: 5738.0). Total num frames: 159796224. Throughput: 0: 5937.2. Samples: 159796694. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 07:52:10,347][25689] Avg episode reward: [(0, '-52.016')] [2022-07-09 07:52:10,418][26022] Updated weights on worker 0-0, policy_version 156052 (0.00093) [2022-07-09 07:52:12,185][26022] Updated weights on worker 0-0, policy_version 156062 (0.00086) [2022-07-09 07:52:13,745][26022] Updated weights on worker 0-0, policy_version 156072 (0.00093) [2022-07-09 07:52:15,431][25689] Fps is (10 sec: 5670.6, 60 sec: 5725.9, 300 sec: 5740.5). Total num frames: 159825920. Throughput: 0: 5934.1. Samples: 159831720. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 07:52:15,431][25689] Avg episode reward: [(0, '-51.611')] [2022-07-09 07:52:15,826][26022] Updated weights on worker 0-0, policy_version 156082 (0.00088) [2022-07-09 07:52:17,375][26022] Updated weights on worker 0-0, policy_version 156092 (0.00082) [2022-07-09 07:52:19,383][26022] Updated weights on worker 0-0, policy_version 156102 (0.00113) [2022-07-09 07:52:20,545][25689] Fps is (10 sec: 5919.7, 60 sec: 5760.1, 300 sec: 5738.7). Total num frames: 159856640. Throughput: 0: 5040.4. Samples: 159848810. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 07:52:20,546][25689] Avg episode reward: [(0, '-51.874')] [2022-07-09 07:52:20,983][26022] Updated weights on worker 0-0, policy_version 156112 (0.00085) [2022-07-09 07:52:22,763][26022] Updated weights on worker 0-0, policy_version 156122 (0.00084) [2022-07-09 07:52:24,504][26022] Updated weights on worker 0-0, policy_version 156132 (0.00087) [2022-07-09 07:52:25,567][25689] Fps is (10 sec: 5758.4, 60 sec: 5730.3, 300 sec: 5742.1). Total num frames: 159884288. Throughput: 0: 6010.7. Samples: 159883594. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 07:52:25,567][25689] Avg episode reward: [(0, '-51.885')] [2022-07-09 07:52:26,456][26022] Updated weights on worker 0-0, policy_version 156142 (0.00087) [2022-07-09 07:52:27,957][26022] Updated weights on worker 0-0, policy_version 156152 (0.00071) [2022-07-09 07:52:30,178][26022] Updated weights on worker 0-0, policy_version 156162 (0.00090) [2022-07-09 07:52:30,570][25689] Fps is (10 sec: 5720.4, 60 sec: 5738.2, 300 sec: 5745.8). Total num frames: 159913984. Throughput: 0: 6001.7. Samples: 159918110. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 07:52:30,570][25689] Avg episode reward: [(0, '-52.332')] [2022-07-09 07:52:31,647][26022] Updated weights on worker 0-0, policy_version 156172 (0.00084) [2022-07-09 07:52:33,331][26022] Updated weights on worker 0-0, policy_version 156182 (0.00082) [2022-07-09 07:52:35,091][26022] Updated weights on worker 0-0, policy_version 156192 (0.00084) [2022-07-09 07:52:35,606][25689] Fps is (10 sec: 5711.9, 60 sec: 5740.3, 300 sec: 5737.2). Total num frames: 159941632. Throughput: 0: 5140.5. Samples: 159935432. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 07:52:35,607][25689] Avg episode reward: [(0, '-52.674')] [2022-07-09 07:52:37,164][26022] Updated weights on worker 0-0, policy_version 156202 (0.00091) [2022-07-09 07:52:37,577][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:52:37,596][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000156205_159953920.pth [2022-07-09 07:52:37,596][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000154187_157887488.pth [2022-07-09 07:52:38,912][26022] Updated weights on worker 0-0, policy_version 156212 (0.00089) [2022-07-09 07:52:40,714][25689] Fps is (10 sec: 5551.9, 60 sec: 5716.7, 300 sec: 5739.2). Total num frames: 159970304. Throughput: 0: 6003.0. Samples: 159969884. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 07:52:40,714][25689] Avg episode reward: [(0, '-53.039')] [2022-07-09 07:52:40,827][26022] Updated weights on worker 0-0, policy_version 156222 (0.00092) [2022-07-09 07:52:42,403][26022] Updated weights on worker 0-0, policy_version 156232 (0.00082) [2022-07-09 07:52:44,215][26022] Updated weights on worker 0-0, policy_version 156242 (0.00088) [2022-07-09 07:52:45,722][25689] Fps is (10 sec: 5871.3, 60 sec: 5750.6, 300 sec: 5742.9). Total num frames: 160001024. Throughput: 0: 5997.9. Samples: 160004482. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 07:52:45,724][25689] Avg episode reward: [(0, '-52.527')] [2022-07-09 07:52:45,857][26022] Updated weights on worker 0-0, policy_version 156252 (0.00087) [2022-07-09 07:52:47,762][26022] Updated weights on worker 0-0, policy_version 156262 (0.00089) [2022-07-09 07:52:49,359][26022] Updated weights on worker 0-0, policy_version 156272 (0.00083) [2022-07-09 07:52:50,728][25689] Fps is (10 sec: 5726.1, 60 sec: 5718.4, 300 sec: 5732.8). Total num frames: 160027648. Throughput: 0: 5994.8. Samples: 160038958. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 07:52:50,729][25689] Avg episode reward: [(0, '-52.637')] [2022-07-09 07:52:51,291][26022] Updated weights on worker 0-0, policy_version 156282 (0.00088) [2022-07-09 07:52:52,868][26022] Updated weights on worker 0-0, policy_version 156292 (0.00094) [2022-07-09 07:52:54,842][26022] Updated weights on worker 0-0, policy_version 156302 (0.00081) [2022-07-09 07:52:55,746][25689] Fps is (10 sec: 5720.5, 60 sec: 5717.9, 300 sec: 5738.1). Total num frames: 160058368. Throughput: 0: 6012.5. Samples: 160056524. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 07:52:55,747][25689] Avg episode reward: [(0, '-52.970')] [2022-07-09 07:52:56,462][26022] Updated weights on worker 0-0, policy_version 156312 (0.00085) [2022-07-09 07:52:58,347][26022] Updated weights on worker 0-0, policy_version 156322 (0.00086) [2022-07-09 07:52:59,999][26022] Updated weights on worker 0-0, policy_version 156332 (0.00089) [2022-07-09 07:53:00,837][25689] Fps is (10 sec: 5875.4, 60 sec: 5716.9, 300 sec: 5744.9). Total num frames: 160087040. Throughput: 0: 6033.8. Samples: 160091302. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 07:53:00,839][25689] Avg episode reward: [(0, '-52.604')] [2022-07-09 07:53:01,916][26022] Updated weights on worker 0-0, policy_version 156342 (0.00084) [2022-07-09 07:53:04,065][26022] Updated weights on worker 0-0, policy_version 156352 (0.00096) [2022-07-09 07:53:05,889][25689] Fps is (10 sec: 5552.9, 60 sec: 5714.2, 300 sec: 5741.6). Total num frames: 160114688. Throughput: 0: 5928.7. Samples: 160124044. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 07:53:05,889][25689] Avg episode reward: [(0, '-52.727')] [2022-07-09 07:53:05,897][26022] Updated weights on worker 0-0, policy_version 156362 (0.00088) [2022-07-09 07:53:07,649][26022] Updated weights on worker 0-0, policy_version 156372 (0.00081) [2022-07-09 07:53:09,440][26022] Updated weights on worker 0-0, policy_version 156382 (0.00089) [2022-07-09 07:53:10,893][25689] Fps is (10 sec: 5600.5, 60 sec: 5732.7, 300 sec: 5734.9). Total num frames: 160143360. Throughput: 0: 5067.0. Samples: 160141134. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 07:53:10,894][25689] Avg episode reward: [(0, '-52.465')] [2022-07-09 07:53:11,119][26022] Updated weights on worker 0-0, policy_version 156392 (0.00082) [2022-07-09 07:53:12,943][26022] Updated weights on worker 0-0, policy_version 156402 (0.00059) [2022-07-09 07:53:14,608][26022] Updated weights on worker 0-0, policy_version 156412 (0.00087) [2022-07-09 07:53:15,925][25689] Fps is (10 sec: 5713.5, 60 sec: 5721.4, 300 sec: 5731.9). Total num frames: 160172032. Throughput: 0: 5924.6. Samples: 160176076. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 07:53:15,926][25689] Avg episode reward: [(0, '-52.860')] [2022-07-09 07:53:16,530][26022] Updated weights on worker 0-0, policy_version 156422 (0.00094) [2022-07-09 07:53:18,211][26022] Updated weights on worker 0-0, policy_version 156432 (0.00089) [2022-07-09 07:53:20,123][26022] Updated weights on worker 0-0, policy_version 156442 (0.00082) [2022-07-09 07:53:20,987][25689] Fps is (10 sec: 5782.5, 60 sec: 5709.4, 300 sec: 5737.7). Total num frames: 160201728. Throughput: 0: 5911.5. Samples: 160210420. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 07:53:20,988][25689] Avg episode reward: [(0, '-52.700')] [2022-07-09 07:53:21,615][26022] Updated weights on worker 0-0, policy_version 156452 (0.00087) [2022-07-09 07:53:23,515][26022] Updated weights on worker 0-0, policy_version 156462 (0.00089) [2022-07-09 07:53:25,389][26022] Updated weights on worker 0-0, policy_version 156472 (0.00446) [2022-07-09 07:53:25,998][25689] Fps is (10 sec: 5794.4, 60 sec: 5727.3, 300 sec: 5734.2). Total num frames: 160230400. Throughput: 0: 5164.6. Samples: 160227902. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 07:53:25,999][25689] Avg episode reward: [(0, '-52.008')] [2022-07-09 07:53:27,189][26022] Updated weights on worker 0-0, policy_version 156482 (0.00090) [2022-07-09 07:53:28,821][26022] Updated weights on worker 0-0, policy_version 156492 (0.00084) [2022-07-09 07:53:30,888][26022] Updated weights on worker 0-0, policy_version 156502 (0.00096) [2022-07-09 07:53:31,043][25689] Fps is (10 sec: 5600.4, 60 sec: 5689.5, 300 sec: 5730.1). Total num frames: 160258048. Throughput: 0: 6024.5. Samples: 160262530. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 07:53:31,049][25689] Avg episode reward: [(0, '-53.115')] [2022-07-09 07:53:32,339][26022] Updated weights on worker 0-0, policy_version 156512 (0.00090) [2022-07-09 07:53:34,501][26022] Updated weights on worker 0-0, policy_version 156522 (0.00085) [2022-07-09 07:53:35,747][26022] Updated weights on worker 0-0, policy_version 156532 (0.00092) [2022-07-09 07:53:36,056][25689] Fps is (10 sec: 5904.9, 60 sec: 5759.5, 300 sec: 5735.9). Total num frames: 160289792. Throughput: 0: 6027.2. Samples: 160297412. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 07:53:36,057][25689] Avg episode reward: [(0, '-52.838')] [2022-07-09 07:53:37,860][26022] Updated weights on worker 0-0, policy_version 156542 (0.00092) [2022-07-09 07:53:39,609][26022] Updated weights on worker 0-0, policy_version 156552 (0.00085) [2022-07-09 07:53:41,131][25689] Fps is (10 sec: 5887.5, 60 sec: 5745.6, 300 sec: 5734.9). Total num frames: 160317440. Throughput: 0: 5169.1. Samples: 160314548. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 07:53:41,132][25689] Avg episode reward: [(0, '-52.263')] [2022-07-09 07:53:41,278][26022] Updated weights on worker 0-0, policy_version 156562 (0.00085) [2022-07-09 07:53:43,043][26022] Updated weights on worker 0-0, policy_version 156572 (0.00897) [2022-07-09 07:53:44,906][26022] Updated weights on worker 0-0, policy_version 156582 (0.00087) [2022-07-09 07:53:46,144][25689] Fps is (10 sec: 5684.8, 60 sec: 5728.3, 300 sec: 5735.1). Total num frames: 160347136. Throughput: 0: 6043.3. Samples: 160349646. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 07:53:46,144][25689] Avg episode reward: [(0, '-51.719')] [2022-07-09 07:53:46,449][26022] Updated weights on worker 0-0, policy_version 156592 (0.00086) [2022-07-09 07:53:48,479][26022] Updated weights on worker 0-0, policy_version 156602 (0.00084) [2022-07-09 07:53:49,897][26022] Updated weights on worker 0-0, policy_version 156612 (0.00091) [2022-07-09 07:53:51,199][25689] Fps is (10 sec: 5696.2, 60 sec: 5740.6, 300 sec: 5734.5). Total num frames: 160374784. Throughput: 0: 6041.0. Samples: 160384286. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 07:53:51,199][25689] Avg episode reward: [(0, '-52.613')] [2022-07-09 07:53:51,962][26022] Updated weights on worker 0-0, policy_version 156622 (0.00086) [2022-07-09 07:53:53,415][26022] Updated weights on worker 0-0, policy_version 156632 (0.00076) [2022-07-09 07:53:55,548][26022] Updated weights on worker 0-0, policy_version 156642 (0.00082) [2022-07-09 07:53:56,204][25689] Fps is (10 sec: 5699.8, 60 sec: 5724.8, 300 sec: 5732.0). Total num frames: 160404480. Throughput: 0: 5163.1. Samples: 160401438. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 07:53:56,205][25689] Avg episode reward: [(0, '-53.092')] [2022-07-09 07:53:57,366][26022] Updated weights on worker 0-0, policy_version 156652 (0.00084) [2022-07-09 07:53:58,954][26022] Updated weights on worker 0-0, policy_version 156662 (0.00093) [2022-07-09 07:54:00,931][26022] Updated weights on worker 0-0, policy_version 156672 (0.00083) [2022-07-09 07:54:01,280][25689] Fps is (10 sec: 5891.3, 60 sec: 5743.2, 300 sec: 5741.0). Total num frames: 160434176. Throughput: 0: 6026.2. Samples: 160435966. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 07:54:01,282][25689] Avg episode reward: [(0, '-52.933')] [2022-07-09 07:54:02,801][26022] Updated weights on worker 0-0, policy_version 156682 (0.00086) [2022-07-09 07:54:04,892][26022] Updated weights on worker 0-0, policy_version 156692 (0.00093) [2022-07-09 07:54:06,299][25689] Fps is (10 sec: 5579.4, 60 sec: 5729.4, 300 sec: 5733.8). Total num frames: 160460800. Throughput: 0: 5910.7. Samples: 160468776. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 07:54:06,300][25689] Avg episode reward: [(0, '-53.203')] [2022-07-09 07:54:06,424][26022] Updated weights on worker 0-0, policy_version 156702 (0.00091) [2022-07-09 07:54:08,307][26022] Updated weights on worker 0-0, policy_version 156712 (0.00089) [2022-07-09 07:54:10,116][26022] Updated weights on worker 0-0, policy_version 156722 (0.00086) [2022-07-09 07:54:11,309][25689] Fps is (10 sec: 5615.5, 60 sec: 5745.8, 300 sec: 5734.2). Total num frames: 160490496. Throughput: 0: 5054.3. Samples: 160485932. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 07:54:11,310][25689] Avg episode reward: [(0, '-53.675')] [2022-07-09 07:54:11,852][26022] Updated weights on worker 0-0, policy_version 156732 (0.00085) [2022-07-09 07:54:13,645][26022] Updated weights on worker 0-0, policy_version 156742 (0.00086) [2022-07-09 07:54:15,321][26022] Updated weights on worker 0-0, policy_version 156752 (0.00091) [2022-07-09 07:54:16,313][25689] Fps is (10 sec: 5828.3, 60 sec: 5748.4, 300 sec: 5736.2). Total num frames: 160519168. Throughput: 0: 5931.4. Samples: 160520712. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 07:54:16,315][25689] Avg episode reward: [(0, '-53.799')] [2022-07-09 07:54:17,165][26022] Updated weights on worker 0-0, policy_version 156762 (0.00098) [2022-07-09 07:54:18,989][26022] Updated weights on worker 0-0, policy_version 156772 (0.00086) [2022-07-09 07:54:20,765][26022] Updated weights on worker 0-0, policy_version 156782 (0.00084) [2022-07-09 07:54:21,358][25689] Fps is (10 sec: 5706.7, 60 sec: 5733.1, 300 sec: 5732.2). Total num frames: 160547840. Throughput: 0: 5941.1. Samples: 160555252. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 07:54:21,358][25689] Avg episode reward: [(0, '-53.605')] [2022-07-09 07:54:22,536][26022] Updated weights on worker 0-0, policy_version 156792 (0.00091) [2022-07-09 07:54:24,322][26022] Updated weights on worker 0-0, policy_version 156802 (0.00433) [2022-07-09 07:54:25,979][26022] Updated weights on worker 0-0, policy_version 156812 (0.00093) [2022-07-09 07:54:26,387][25689] Fps is (10 sec: 5794.1, 60 sec: 5748.4, 300 sec: 5735.9). Total num frames: 160577536. Throughput: 0: 5166.9. Samples: 160572570. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 07:54:26,387][25689] Avg episode reward: [(0, '-53.372')] [2022-07-09 07:54:27,935][26022] Updated weights on worker 0-0, policy_version 156822 (0.00078) [2022-07-09 07:54:29,475][26022] Updated weights on worker 0-0, policy_version 156832 (0.00084) [2022-07-09 07:54:31,353][26022] Updated weights on worker 0-0, policy_version 156842 (0.00091) [2022-07-09 07:54:31,418][25689] Fps is (10 sec: 5801.9, 60 sec: 5766.7, 300 sec: 5735.9). Total num frames: 160606208. Throughput: 0: 6041.2. Samples: 160607414. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 07:54:31,419][25689] Avg episode reward: [(0, '-53.555')] [2022-07-09 07:54:33,167][26022] Updated weights on worker 0-0, policy_version 156852 (0.00096) [2022-07-09 07:54:34,907][26022] Updated weights on worker 0-0, policy_version 156862 (0.00084) [2022-07-09 07:54:36,459][25689] Fps is (10 sec: 5591.9, 60 sec: 5696.2, 300 sec: 5730.2). Total num frames: 160633856. Throughput: 0: 6024.3. Samples: 160642074. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 07:54:36,459][25689] Avg episode reward: [(0, '-53.490')] [2022-07-09 07:54:36,755][26022] Updated weights on worker 0-0, policy_version 156872 (0.00083) [2022-07-09 07:54:37,627][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:54:37,642][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000156878_160643072.pth [2022-07-09 07:54:37,642][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000154860_158576640.pth [2022-07-09 07:54:38,445][26022] Updated weights on worker 0-0, policy_version 156882 (0.00090) [2022-07-09 07:54:40,308][26022] Updated weights on worker 0-0, policy_version 156892 (0.00086) [2022-07-09 07:54:41,563][25689] Fps is (10 sec: 5652.7, 60 sec: 5727.4, 300 sec: 5732.2). Total num frames: 160663552. Throughput: 0: 5153.8. Samples: 160659382. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 07:54:41,563][25689] Avg episode reward: [(0, '-52.605')] [2022-07-09 07:54:42,042][26022] Updated weights on worker 0-0, policy_version 156902 (0.00093) [2022-07-09 07:54:43,905][26022] Updated weights on worker 0-0, policy_version 156912 (0.00087) [2022-07-09 07:54:45,411][26022] Updated weights on worker 0-0, policy_version 156922 (0.00096) [2022-07-09 07:54:46,566][25689] Fps is (10 sec: 5977.5, 60 sec: 5745.2, 300 sec: 5739.3). Total num frames: 160694272. Throughput: 0: 6034.7. Samples: 160694344. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 07:54:46,566][25689] Avg episode reward: [(0, '-53.345')] [2022-07-09 07:54:47,321][26022] Updated weights on worker 0-0, policy_version 156932 (0.00084) [2022-07-09 07:54:48,954][26022] Updated weights on worker 0-0, policy_version 156942 (0.00082) [2022-07-09 07:54:51,005][26022] Updated weights on worker 0-0, policy_version 156952 (0.00091) [2022-07-09 07:54:51,602][25689] Fps is (10 sec: 5916.0, 60 sec: 5763.9, 300 sec: 5733.3). Total num frames: 160722944. Throughput: 0: 6029.0. Samples: 160729102. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 07:54:51,602][25689] Avg episode reward: [(0, '-53.505')] [2022-07-09 07:54:52,507][26022] Updated weights on worker 0-0, policy_version 156962 (0.00084) [2022-07-09 07:54:54,381][26022] Updated weights on worker 0-0, policy_version 156972 (0.00087) [2022-07-09 07:54:55,996][26022] Updated weights on worker 0-0, policy_version 156982 (0.00089) [2022-07-09 07:54:56,659][25689] Fps is (10 sec: 5681.7, 60 sec: 5742.1, 300 sec: 5733.7). Total num frames: 160751616. Throughput: 0: 5174.6. Samples: 160746602. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 07:54:56,659][25689] Avg episode reward: [(0, '-53.436')] [2022-07-09 07:54:57,879][26022] Updated weights on worker 0-0, policy_version 156992 (0.00082) [2022-07-09 07:54:59,695][26022] Updated weights on worker 0-0, policy_version 157002 (0.00085) [2022-07-09 07:55:01,418][26022] Updated weights on worker 0-0, policy_version 157012 (0.00085) [2022-07-09 07:55:01,757][25689] Fps is (10 sec: 5747.5, 60 sec: 5739.9, 300 sec: 5743.6). Total num frames: 160781312. Throughput: 0: 6039.4. Samples: 160781344. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 07:55:01,758][25689] Avg episode reward: [(0, '-53.687')] [2022-07-09 07:55:03,598][26022] Updated weights on worker 0-0, policy_version 157022 (0.00086) [2022-07-09 07:55:05,583][26022] Updated weights on worker 0-0, policy_version 157032 (0.00083) [2022-07-09 07:55:06,772][25689] Fps is (10 sec: 5771.5, 60 sec: 5774.2, 300 sec: 5736.9). Total num frames: 160809984. Throughput: 0: 5923.8. Samples: 160814040. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 07:55:06,772][25689] Avg episode reward: [(0, '-54.017')] [2022-07-09 07:55:07,052][26022] Updated weights on worker 0-0, policy_version 157042 (0.00085) [2022-07-09 07:55:08,966][26022] Updated weights on worker 0-0, policy_version 157052 (0.00093) [2022-07-09 07:55:10,613][26022] Updated weights on worker 0-0, policy_version 157062 (0.00358) [2022-07-09 07:55:11,821][25689] Fps is (10 sec: 5494.5, 60 sec: 5719.8, 300 sec: 5733.1). Total num frames: 160836608. Throughput: 0: 5060.5. Samples: 160831422. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 07:55:11,821][25689] Avg episode reward: [(0, '-54.576')] [2022-07-09 07:55:12,295][26022] Updated weights on worker 0-0, policy_version 157072 (0.00089) [2022-07-09 07:55:14,173][26022] Updated weights on worker 0-0, policy_version 157082 (0.00086) [2022-07-09 07:55:15,847][26022] Updated weights on worker 0-0, policy_version 157092 (0.00503) [2022-07-09 07:55:16,839][25689] Fps is (10 sec: 5696.0, 60 sec: 5752.2, 300 sec: 5737.1). Total num frames: 160867328. Throughput: 0: 5926.2. Samples: 160866196. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 07:55:16,840][25689] Avg episode reward: [(0, '-53.162')] [2022-07-09 07:55:17,817][26022] Updated weights on worker 0-0, policy_version 157102 (0.00090) [2022-07-09 07:55:19,588][26022] Updated weights on worker 0-0, policy_version 157112 (0.00090) [2022-07-09 07:55:21,203][26022] Updated weights on worker 0-0, policy_version 157122 (0.00078) [2022-07-09 07:55:21,911][25689] Fps is (10 sec: 5987.4, 60 sec: 5766.5, 300 sec: 5739.8). Total num frames: 160897024. Throughput: 0: 5919.9. Samples: 160900656. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 07:55:21,912][25689] Avg episode reward: [(0, '-53.902')] [2022-07-09 07:55:23,150][26022] Updated weights on worker 0-0, policy_version 157132 (0.00080) [2022-07-09 07:55:24,766][26022] Updated weights on worker 0-0, policy_version 157142 (0.00085) [2022-07-09 07:55:26,639][26022] Updated weights on worker 0-0, policy_version 157152 (0.00087) [2022-07-09 07:55:26,950][25689] Fps is (10 sec: 5772.9, 60 sec: 5748.7, 300 sec: 5739.5). Total num frames: 160925696. Throughput: 0: 5158.5. Samples: 160918128. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 07:55:26,951][25689] Avg episode reward: [(0, '-53.735')] [2022-07-09 07:55:28,351][26022] Updated weights on worker 0-0, policy_version 157162 (0.00084) [2022-07-09 07:55:30,175][26022] Updated weights on worker 0-0, policy_version 157172 (0.00091) [2022-07-09 07:55:32,001][25689] Fps is (10 sec: 5581.8, 60 sec: 5729.9, 300 sec: 5735.4). Total num frames: 160953344. Throughput: 0: 6015.1. Samples: 160952810. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 07:55:32,003][25689] Avg episode reward: [(0, '-54.147')] [2022-07-09 07:55:32,101][26022] Updated weights on worker 0-0, policy_version 157182 (0.00084) [2022-07-09 07:55:33,787][26022] Updated weights on worker 0-0, policy_version 157192 (0.00079) [2022-07-09 07:55:35,494][26022] Updated weights on worker 0-0, policy_version 157202 (0.00086) [2022-07-09 07:55:37,068][25689] Fps is (10 sec: 5869.9, 60 sec: 5795.0, 300 sec: 5742.0). Total num frames: 160985088. Throughput: 0: 6004.7. Samples: 160987664. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 07:55:37,070][25689] Avg episode reward: [(0, '-54.913')] [2022-07-09 07:55:37,075][26022] Updated weights on worker 0-0, policy_version 157212 (0.00308) [2022-07-09 07:55:38,998][26022] Updated weights on worker 0-0, policy_version 157222 (0.00089) [2022-07-09 07:55:40,766][26022] Updated weights on worker 0-0, policy_version 157232 (0.00086) [2022-07-09 07:55:42,103][25689] Fps is (10 sec: 5879.7, 60 sec: 5767.8, 300 sec: 5741.5). Total num frames: 161012736. Throughput: 0: 5162.0. Samples: 161004884. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 07:55:42,103][25689] Avg episode reward: [(0, '-54.593')] [2022-07-09 07:55:42,663][26022] Updated weights on worker 0-0, policy_version 157242 (0.00085) [2022-07-09 07:55:44,335][26022] Updated weights on worker 0-0, policy_version 157252 (0.00082) [2022-07-09 07:55:46,051][26022] Updated weights on worker 0-0, policy_version 157262 (0.00104) [2022-07-09 07:55:47,130][25689] Fps is (10 sec: 5597.3, 60 sec: 5731.7, 300 sec: 5738.1). Total num frames: 161041408. Throughput: 0: 6030.6. Samples: 161039826. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 07:55:47,131][25689] Avg episode reward: [(0, '-54.842')] [2022-07-09 07:55:47,776][26022] Updated weights on worker 0-0, policy_version 157272 (0.00092) [2022-07-09 07:55:49,742][26022] Updated weights on worker 0-0, policy_version 157282 (0.00103) [2022-07-09 07:55:51,376][26022] Updated weights on worker 0-0, policy_version 157292 (0.00094) [2022-07-09 07:55:52,141][25689] Fps is (10 sec: 5712.3, 60 sec: 5734.0, 300 sec: 5738.2). Total num frames: 161070080. Throughput: 0: 6038.7. Samples: 161074430. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 07:55:52,142][25689] Avg episode reward: [(0, '-53.842')] [2022-07-09 07:55:53,203][26022] Updated weights on worker 0-0, policy_version 157302 (0.00092) [2022-07-09 07:55:54,828][26022] Updated weights on worker 0-0, policy_version 157312 (0.00089) [2022-07-09 07:55:56,959][26022] Updated weights on worker 0-0, policy_version 157322 (0.00102) [2022-07-09 07:55:57,156][25689] Fps is (10 sec: 5822.1, 60 sec: 5755.0, 300 sec: 5735.8). Total num frames: 161099776. Throughput: 0: 6042.5. Samples: 161109042. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 07:55:57,156][25689] Avg episode reward: [(0, '-53.933')] [2022-07-09 07:55:58,478][26022] Updated weights on worker 0-0, policy_version 157332 (0.00082) [2022-07-09 07:56:00,419][26022] Updated weights on worker 0-0, policy_version 157342 (0.00080) [2022-07-09 07:56:02,203][25689] Fps is (10 sec: 5699.5, 60 sec: 5726.0, 300 sec: 5742.0). Total num frames: 161127424. Throughput: 0: 6052.5. Samples: 161126540. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 07:56:02,203][25689] Avg episode reward: [(0, '-52.978')] [2022-07-09 07:56:02,318][26022] Updated weights on worker 0-0, policy_version 157352 (0.00086) [2022-07-09 07:56:04,279][26022] Updated weights on worker 0-0, policy_version 157362 (0.00084) [2022-07-09 07:56:05,995][26022] Updated weights on worker 0-0, policy_version 157372 (0.00083) [2022-07-09 07:56:07,213][25689] Fps is (10 sec: 5498.3, 60 sec: 5709.5, 300 sec: 5736.0). Total num frames: 161155072. Throughput: 0: 5950.0. Samples: 161159316. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 07:56:07,213][25689] Avg episode reward: [(0, '-53.179')] [2022-07-09 07:56:07,736][26022] Updated weights on worker 0-0, policy_version 157382 (0.00085) [2022-07-09 07:56:09,467][26022] Updated weights on worker 0-0, policy_version 157392 (0.00088) [2022-07-09 07:56:11,395][26022] Updated weights on worker 0-0, policy_version 157402 (0.00092) [2022-07-09 07:56:12,258][25689] Fps is (10 sec: 5703.0, 60 sec: 5760.7, 300 sec: 5735.3). Total num frames: 161184768. Throughput: 0: 5937.2. Samples: 161193864. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 07:56:12,258][25689] Avg episode reward: [(0, '-52.414')] [2022-07-09 07:56:13,072][26022] Updated weights on worker 0-0, policy_version 157412 (0.00087) [2022-07-09 07:56:14,879][26022] Updated weights on worker 0-0, policy_version 157422 (0.00087) [2022-07-09 07:56:16,645][26022] Updated weights on worker 0-0, policy_version 157432 (0.00097) [2022-07-09 07:56:17,287][25689] Fps is (10 sec: 5793.8, 60 sec: 5725.8, 300 sec: 5737.0). Total num frames: 161213440. Throughput: 0: 5065.9. Samples: 161211024. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 07:56:17,288][25689] Avg episode reward: [(0, '-52.670')] [2022-07-09 07:56:18,476][26022] Updated weights on worker 0-0, policy_version 157442 (0.00090) [2022-07-09 07:56:20,188][26022] Updated weights on worker 0-0, policy_version 157452 (0.00094) [2022-07-09 07:56:21,945][26022] Updated weights on worker 0-0, policy_version 157462 (0.00081) [2022-07-09 07:56:22,370][25689] Fps is (10 sec: 5873.2, 60 sec: 5741.7, 300 sec: 5740.1). Total num frames: 161244160. Throughput: 0: 5908.7. Samples: 161245702. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 07:56:22,371][25689] Avg episode reward: [(0, '-52.927')] [2022-07-09 07:56:23,967][26022] Updated weights on worker 0-0, policy_version 157472 (0.00088) [2022-07-09 07:56:25,427][26022] Updated weights on worker 0-0, policy_version 157482 (0.00087) [2022-07-09 07:56:27,436][25689] Fps is (10 sec: 5649.8, 60 sec: 5705.2, 300 sec: 5730.2). Total num frames: 161270784. Throughput: 0: 5996.1. Samples: 161280582. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 07:56:27,437][25689] Avg episode reward: [(0, '-52.972')] [2022-07-09 07:56:27,448][26022] Updated weights on worker 0-0, policy_version 157492 (0.00089) [2022-07-09 07:56:29,047][26022] Updated weights on worker 0-0, policy_version 157502 (0.00079) [2022-07-09 07:56:30,656][26022] Updated weights on worker 0-0, policy_version 157512 (0.00081) [2022-07-09 07:56:32,444][25689] Fps is (10 sec: 5692.4, 60 sec: 5760.2, 300 sec: 5741.5). Total num frames: 161301504. Throughput: 0: 5160.6. Samples: 161298036. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 07:56:32,444][25689] Avg episode reward: [(0, '-52.505')] [2022-07-09 07:56:32,572][26022] Updated weights on worker 0-0, policy_version 157522 (0.00091) [2022-07-09 07:56:34,234][26022] Updated weights on worker 0-0, policy_version 157532 (0.00086) [2022-07-09 07:56:36,130][26022] Updated weights on worker 0-0, policy_version 157542 (0.00083) [2022-07-09 07:56:37,463][25689] Fps is (10 sec: 6025.4, 60 sec: 5730.8, 300 sec: 5741.8). Total num frames: 161331200. Throughput: 0: 6058.5. Samples: 161333264. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 07:56:37,464][25689] Avg episode reward: [(0, '-52.685')] [2022-07-09 07:56:37,792][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:56:37,801][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000157552_161333248.pth [2022-07-09 07:56:37,811][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000155532_159264768.pth [2022-07-09 07:56:37,814][26022] Updated weights on worker 0-0, policy_version 157552 (0.00097) [2022-07-09 07:56:39,547][26022] Updated weights on worker 0-0, policy_version 157562 (0.00086) [2022-07-09 07:56:41,310][26022] Updated weights on worker 0-0, policy_version 157572 (0.00096) [2022-07-09 07:56:42,537][25689] Fps is (10 sec: 5884.4, 60 sec: 5761.0, 300 sec: 5744.0). Total num frames: 161360896. Throughput: 0: 6084.2. Samples: 161368402. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 07:56:42,537][25689] Avg episode reward: [(0, '-53.646')] [2022-07-09 07:56:43,148][26022] Updated weights on worker 0-0, policy_version 157582 (0.00086) [2022-07-09 07:56:44,654][26022] Updated weights on worker 0-0, policy_version 157592 (0.00088) [2022-07-09 07:56:46,815][26022] Updated weights on worker 0-0, policy_version 157602 (0.00099) [2022-07-09 07:56:47,609][25689] Fps is (10 sec: 5753.3, 60 sec: 5756.8, 300 sec: 5743.1). Total num frames: 161389568. Throughput: 0: 5210.2. Samples: 161385684. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 07:56:47,609][25689] Avg episode reward: [(0, '-53.656')] [2022-07-09 07:56:48,154][26022] Updated weights on worker 0-0, policy_version 157612 (0.00082) [2022-07-09 07:56:50,331][26022] Updated weights on worker 0-0, policy_version 157622 (0.00085) [2022-07-09 07:56:51,812][26022] Updated weights on worker 0-0, policy_version 157632 (0.00089) [2022-07-09 07:56:52,635][25689] Fps is (10 sec: 5678.5, 60 sec: 5755.3, 300 sec: 5735.9). Total num frames: 161418240. Throughput: 0: 6058.2. Samples: 161420360. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 07:56:52,637][25689] Avg episode reward: [(0, '-53.656')] [2022-07-09 07:56:53,770][26022] Updated weights on worker 0-0, policy_version 157642 (0.00101) [2022-07-09 07:56:55,195][26022] Updated weights on worker 0-0, policy_version 157652 (0.00093) [2022-07-09 07:56:57,378][26022] Updated weights on worker 0-0, policy_version 157662 (0.00086) [2022-07-09 07:56:57,658][25689] Fps is (10 sec: 5706.1, 60 sec: 5737.5, 300 sec: 5737.0). Total num frames: 161446912. Throughput: 0: 6035.8. Samples: 161455156. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 07:56:57,659][25689] Avg episode reward: [(0, '-53.333')] [2022-07-09 07:56:59,039][26022] Updated weights on worker 0-0, policy_version 157672 (0.00085) [2022-07-09 07:57:01,018][26022] Updated weights on worker 0-0, policy_version 157682 (0.00088) [2022-07-09 07:57:02,719][25689] Fps is (10 sec: 5687.0, 60 sec: 5753.2, 300 sec: 5739.8). Total num frames: 161475584. Throughput: 0: 5149.0. Samples: 161472318. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 07:57:02,719][25689] Avg episode reward: [(0, '-53.120')] [2022-07-09 07:57:02,800][26022] Updated weights on worker 0-0, policy_version 157692 (0.00086) [2022-07-09 07:57:05,007][26022] Updated weights on worker 0-0, policy_version 157702 (0.00092) [2022-07-09 07:57:06,183][26022] Updated weights on worker 0-0, policy_version 157712 (0.00097) [2022-07-09 07:57:07,757][25689] Fps is (10 sec: 5475.4, 60 sec: 5733.5, 300 sec: 5736.0). Total num frames: 161502208. Throughput: 0: 5943.2. Samples: 161505432. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 07:57:07,758][25689] Avg episode reward: [(0, '-52.813')] [2022-07-09 07:57:08,377][26022] Updated weights on worker 0-0, policy_version 157722 (0.00094) [2022-07-09 07:57:09,612][26022] Updated weights on worker 0-0, policy_version 157732 (0.00081) [2022-07-09 07:57:11,800][26022] Updated weights on worker 0-0, policy_version 157742 (0.00102) [2022-07-09 07:57:12,762][25689] Fps is (10 sec: 5811.4, 60 sec: 5771.2, 300 sec: 5744.5). Total num frames: 161533952. Throughput: 0: 5963.7. Samples: 161540392. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 07:57:12,764][25689] Avg episode reward: [(0, '-52.685')] [2022-07-09 07:57:13,437][26022] Updated weights on worker 0-0, policy_version 157752 (0.00092) [2022-07-09 07:57:15,207][26022] Updated weights on worker 0-0, policy_version 157762 (0.00087) [2022-07-09 07:57:17,049][26022] Updated weights on worker 0-0, policy_version 157772 (0.00094) [2022-07-09 07:57:17,766][25689] Fps is (10 sec: 6036.2, 60 sec: 5773.6, 300 sec: 5739.8). Total num frames: 161562624. Throughput: 0: 5098.5. Samples: 161557676. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 07:57:17,768][25689] Avg episode reward: [(0, '-51.827')] [2022-07-09 07:57:18,968][26022] Updated weights on worker 0-0, policy_version 157782 (0.00086) [2022-07-09 07:57:20,442][26022] Updated weights on worker 0-0, policy_version 157792 (0.00086) [2022-07-09 07:57:22,412][26022] Updated weights on worker 0-0, policy_version 157802 (0.00083) [2022-07-09 07:57:22,909][25689] Fps is (10 sec: 5752.5, 60 sec: 5751.0, 300 sec: 5744.3). Total num frames: 161592320. Throughput: 0: 5937.7. Samples: 161592202. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 07:57:22,909][25689] Avg episode reward: [(0, '-51.179')] [2022-07-09 07:57:23,872][26022] Updated weights on worker 0-0, policy_version 157812 (0.00085) [2022-07-09 07:57:25,962][26022] Updated weights on worker 0-0, policy_version 157822 (0.00089) [2022-07-09 07:57:27,679][26022] Updated weights on worker 0-0, policy_version 157832 (0.00086) [2022-07-09 07:57:27,916][25689] Fps is (10 sec: 5750.4, 60 sec: 5790.5, 300 sec: 5740.8). Total num frames: 161620992. Throughput: 0: 6022.6. Samples: 161626842. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 07:57:27,917][25689] Avg episode reward: [(0, '-51.721')] [2022-07-09 07:57:29,495][26022] Updated weights on worker 0-0, policy_version 157842 (0.00083) [2022-07-09 07:57:31,174][26022] Updated weights on worker 0-0, policy_version 157852 (0.00082) [2022-07-09 07:57:32,948][25689] Fps is (10 sec: 5712.2, 60 sec: 5754.3, 300 sec: 5744.3). Total num frames: 161649664. Throughput: 0: 5133.0. Samples: 161644004. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 07:57:32,950][25689] Avg episode reward: [(0, '-51.822')] [2022-07-09 07:57:33,101][26022] Updated weights on worker 0-0, policy_version 157862 (0.00084) [2022-07-09 07:57:34,702][26022] Updated weights on worker 0-0, policy_version 157872 (0.00080) [2022-07-09 07:57:36,691][26022] Updated weights on worker 0-0, policy_version 157882 (0.00082) [2022-07-09 07:57:37,968][25689] Fps is (10 sec: 5806.9, 60 sec: 5754.3, 300 sec: 5749.5). Total num frames: 161679360. Throughput: 0: 5994.4. Samples: 161678774. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 07:57:37,970][25689] Avg episode reward: [(0, '-51.456')] [2022-07-09 07:57:38,363][26022] Updated weights on worker 0-0, policy_version 157892 (0.00095) [2022-07-09 07:57:40,306][26022] Updated weights on worker 0-0, policy_version 157902 (0.00091) [2022-07-09 07:57:41,940][26022] Updated weights on worker 0-0, policy_version 157912 (0.00090) [2022-07-09 07:57:43,020][25689] Fps is (10 sec: 5693.3, 60 sec: 5722.5, 300 sec: 5738.3). Total num frames: 161707008. Throughput: 0: 6007.3. Samples: 161713014. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 07:57:43,020][25689] Avg episode reward: [(0, '-52.641')] [2022-07-09 07:57:43,897][26022] Updated weights on worker 0-0, policy_version 157922 (0.00085) [2022-07-09 07:57:45,630][26022] Updated weights on worker 0-0, policy_version 157932 (0.00094) [2022-07-09 07:57:47,346][26022] Updated weights on worker 0-0, policy_version 157942 (0.00083) [2022-07-09 07:57:48,032][25689] Fps is (10 sec: 5697.8, 60 sec: 5745.0, 300 sec: 5748.5). Total num frames: 161736704. Throughput: 0: 5145.7. Samples: 161730352. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 07:57:48,033][25689] Avg episode reward: [(0, '-52.098')] [2022-07-09 07:57:49,229][26022] Updated weights on worker 0-0, policy_version 157952 (0.00081) [2022-07-09 07:57:50,735][26022] Updated weights on worker 0-0, policy_version 157962 (0.00087) [2022-07-09 07:57:52,644][26022] Updated weights on worker 0-0, policy_version 157972 (0.00084) [2022-07-09 07:57:53,052][25689] Fps is (10 sec: 5818.4, 60 sec: 5745.7, 300 sec: 5741.6). Total num frames: 161765376. Throughput: 0: 6023.7. Samples: 161765104. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 07:57:53,053][25689] Avg episode reward: [(0, '-52.720')] [2022-07-09 07:57:54,378][26022] Updated weights on worker 0-0, policy_version 157982 (0.00080) [2022-07-09 07:57:56,077][26022] Updated weights on worker 0-0, policy_version 157992 (0.00080) [2022-07-09 07:57:58,048][26022] Updated weights on worker 0-0, policy_version 158002 (0.00085) [2022-07-09 07:57:58,065][25689] Fps is (10 sec: 5715.9, 60 sec: 5746.7, 300 sec: 5743.1). Total num frames: 161794048. Throughput: 0: 6023.1. Samples: 161799818. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 07:57:58,066][25689] Avg episode reward: [(0, '-53.260')] [2022-07-09 07:57:59,823][26022] Updated weights on worker 0-0, policy_version 158012 (0.00090) [2022-07-09 07:58:01,363][26022] Updated weights on worker 0-0, policy_version 158022 (0.00085) [2022-07-09 07:58:03,206][25689] Fps is (10 sec: 5546.1, 60 sec: 5722.0, 300 sec: 5741.4). Total num frames: 161821696. Throughput: 0: 5166.0. Samples: 161817298. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 07:58:03,207][25689] Avg episode reward: [(0, '-52.914')] [2022-07-09 07:58:03,672][26022] Updated weights on worker 0-0, policy_version 158032 (0.00093) [2022-07-09 07:58:05,312][26022] Updated weights on worker 0-0, policy_version 158042 (0.00089) [2022-07-09 07:58:07,017][26022] Updated weights on worker 0-0, policy_version 158052 (0.00081) [2022-07-09 07:58:08,218][25689] Fps is (10 sec: 5647.9, 60 sec: 5775.4, 300 sec: 5744.7). Total num frames: 161851392. Throughput: 0: 5947.2. Samples: 161850400. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 07:58:08,218][25689] Avg episode reward: [(0, '-52.194')] [2022-07-09 07:58:08,943][26022] Updated weights on worker 0-0, policy_version 158062 (0.00091) [2022-07-09 07:58:10,515][26022] Updated weights on worker 0-0, policy_version 158072 (0.00081) [2022-07-09 07:58:12,507][26022] Updated weights on worker 0-0, policy_version 158082 (0.00089) [2022-07-09 07:58:13,259][25689] Fps is (10 sec: 5908.3, 60 sec: 5738.1, 300 sec: 5747.9). Total num frames: 161881088. Throughput: 0: 5955.1. Samples: 161885440. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 07:58:13,260][25689] Avg episode reward: [(0, '-52.479')] [2022-07-09 07:58:14,054][26022] Updated weights on worker 0-0, policy_version 158092 (0.00084) [2022-07-09 07:58:15,884][26022] Updated weights on worker 0-0, policy_version 158102 (0.00088) [2022-07-09 07:58:17,509][26022] Updated weights on worker 0-0, policy_version 158112 (0.00084) [2022-07-09 07:58:18,289][25689] Fps is (10 sec: 5795.7, 60 sec: 5735.6, 300 sec: 5745.1). Total num frames: 161909760. Throughput: 0: 5102.6. Samples: 161903014. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 07:58:18,289][25689] Avg episode reward: [(0, '-52.455')] [2022-07-09 07:58:19,441][26022] Updated weights on worker 0-0, policy_version 158122 (0.00088) [2022-07-09 07:58:21,235][26022] Updated weights on worker 0-0, policy_version 158132 (0.00091) [2022-07-09 07:58:23,117][26022] Updated weights on worker 0-0, policy_version 158142 (0.00082) [2022-07-09 07:58:23,347][25689] Fps is (10 sec: 5684.4, 60 sec: 5726.8, 300 sec: 5744.2). Total num frames: 161938432. Throughput: 0: 5968.7. Samples: 161937508. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 07:58:23,347][25689] Avg episode reward: [(0, '-52.593')] [2022-07-09 07:58:24,638][26022] Updated weights on worker 0-0, policy_version 158152 (0.00081) [2022-07-09 07:58:26,767][26022] Updated weights on worker 0-0, policy_version 158162 (0.00082) [2022-07-09 07:58:28,307][26022] Updated weights on worker 0-0, policy_version 158172 (0.00092) [2022-07-09 07:58:28,369][25689] Fps is (10 sec: 5790.2, 60 sec: 5742.3, 300 sec: 5751.5). Total num frames: 161968128. Throughput: 0: 6035.8. Samples: 161972030. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 07:58:28,371][25689] Avg episode reward: [(0, '-52.092')] [2022-07-09 07:58:30,208][26022] Updated weights on worker 0-0, policy_version 158182 (0.00080) [2022-07-09 07:58:31,786][26022] Updated weights on worker 0-0, policy_version 158192 (0.00096) [2022-07-09 07:58:33,372][25689] Fps is (10 sec: 5822.0, 60 sec: 5745.0, 300 sec: 5741.4). Total num frames: 161996800. Throughput: 0: 5181.4. Samples: 161989656. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 07:58:33,373][25689] Avg episode reward: [(0, '-51.537')] [2022-07-09 07:58:33,587][26022] Updated weights on worker 0-0, policy_version 158202 (0.00087) [2022-07-09 07:58:35,268][26022] Updated weights on worker 0-0, policy_version 158212 (0.00089) [2022-07-09 07:58:37,111][26022] Updated weights on worker 0-0, policy_version 158222 (0.00067) [2022-07-09 07:58:37,845][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 07:58:37,875][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000158226_162023424.pth [2022-07-09 07:58:37,876][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000156205_159953920.pth [2022-07-09 07:58:38,387][25689] Fps is (10 sec: 5724.5, 60 sec: 5728.6, 300 sec: 5746.0). Total num frames: 162025472. Throughput: 0: 6066.1. Samples: 162024930. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 07:58:38,387][25689] Avg episode reward: [(0, '-50.963')] [2022-07-09 07:58:38,928][26022] Updated weights on worker 0-0, policy_version 158232 (0.00089) [2022-07-09 07:58:40,768][26022] Updated weights on worker 0-0, policy_version 158242 (0.00091) [2022-07-09 07:58:42,243][26022] Updated weights on worker 0-0, policy_version 158252 (0.00083) [2022-07-09 07:58:43,430][25689] Fps is (10 sec: 5803.3, 60 sec: 5763.3, 300 sec: 5745.4). Total num frames: 162055168. Throughput: 0: 6089.8. Samples: 162059810. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 07:58:43,430][25689] Avg episode reward: [(0, '-51.664')] [2022-07-09 07:58:44,241][26022] Updated weights on worker 0-0, policy_version 158262 (0.00081) [2022-07-09 07:58:45,733][26022] Updated weights on worker 0-0, policy_version 158272 (0.00088) [2022-07-09 07:58:47,647][26022] Updated weights on worker 0-0, policy_version 158282 (0.00087) [2022-07-09 07:58:48,442][25689] Fps is (10 sec: 6008.2, 60 sec: 5780.2, 300 sec: 5756.6). Total num frames: 162085888. Throughput: 0: 5247.2. Samples: 162077356. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 07:58:48,443][25689] Avg episode reward: [(0, '-52.058')] [2022-07-09 07:58:49,467][26022] Updated weights on worker 0-0, policy_version 158292 (0.00085) [2022-07-09 07:58:51,164][26022] Updated weights on worker 0-0, policy_version 158302 (0.00092) [2022-07-09 07:58:53,154][26022] Updated weights on worker 0-0, policy_version 158312 (0.00087) [2022-07-09 07:58:53,455][25689] Fps is (10 sec: 5822.4, 60 sec: 5764.0, 300 sec: 5749.6). Total num frames: 162113536. Throughput: 0: 6089.2. Samples: 162111942. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 07:58:53,456][25689] Avg episode reward: [(0, '-51.870')] [2022-07-09 07:58:54,691][26022] Updated weights on worker 0-0, policy_version 158322 (0.00084) [2022-07-09 07:58:56,714][26022] Updated weights on worker 0-0, policy_version 158332 (0.00088) [2022-07-09 07:58:58,420][26022] Updated weights on worker 0-0, policy_version 158342 (0.00085) [2022-07-09 07:58:58,466][25689] Fps is (10 sec: 5618.6, 60 sec: 5764.1, 300 sec: 5747.3). Total num frames: 162142208. Throughput: 0: 6043.8. Samples: 162146286. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 07:58:58,468][25689] Avg episode reward: [(0, '-51.584')] [2022-07-09 07:59:00,153][26022] Updated weights on worker 0-0, policy_version 158352 (0.00084) [2022-07-09 07:59:02,202][26022] Updated weights on worker 0-0, policy_version 158362 (0.00090) [2022-07-09 07:59:03,594][25689] Fps is (10 sec: 5453.8, 60 sec: 5748.5, 300 sec: 5745.2). Total num frames: 162168832. Throughput: 0: 5899.0. Samples: 162178756. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 07:59:03,596][25689] Avg episode reward: [(0, '-52.295')] [2022-07-09 07:59:04,134][26022] Updated weights on worker 0-0, policy_version 158372 (0.00088) [2022-07-09 07:59:05,742][26022] Updated weights on worker 0-0, policy_version 158382 (0.00086) [2022-07-09 07:59:07,804][26022] Updated weights on worker 0-0, policy_version 158392 (0.00087) [2022-07-09 07:59:08,663][25689] Fps is (10 sec: 5623.4, 60 sec: 5759.9, 300 sec: 5747.5). Total num frames: 162199552. Throughput: 0: 5873.5. Samples: 162196126. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 07:59:08,665][25689] Avg episode reward: [(0, '-52.287')] [2022-07-09 07:59:09,144][26022] Updated weights on worker 0-0, policy_version 158402 (0.00090) [2022-07-09 07:59:11,265][26022] Updated weights on worker 0-0, policy_version 158412 (0.00085) [2022-07-09 07:59:12,759][26022] Updated weights on worker 0-0, policy_version 158422 (0.00094) [2022-07-09 07:59:13,719][25689] Fps is (10 sec: 5764.8, 60 sec: 5724.7, 300 sec: 5743.1). Total num frames: 162227200. Throughput: 0: 5875.8. Samples: 162231010. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 07:59:13,719][25689] Avg episode reward: [(0, '-51.560')] [2022-07-09 07:59:14,666][26022] Updated weights on worker 0-0, policy_version 158432 (0.00090) [2022-07-09 07:59:16,464][26022] Updated weights on worker 0-0, policy_version 158442 (0.00178) [2022-07-09 07:59:18,389][26022] Updated weights on worker 0-0, policy_version 158452 (0.00091) [2022-07-09 07:59:18,815][25689] Fps is (10 sec: 5548.2, 60 sec: 5718.4, 300 sec: 5742.1). Total num frames: 162255872. Throughput: 0: 5865.0. Samples: 162265630. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 07:59:18,816][25689] Avg episode reward: [(0, '-51.834')] [2022-07-09 07:59:20,029][26022] Updated weights on worker 0-0, policy_version 158462 (0.00089) [2022-07-09 07:59:21,939][26022] Updated weights on worker 0-0, policy_version 158472 (0.00090) [2022-07-09 07:59:23,367][26022] Updated weights on worker 0-0, policy_version 158482 (0.00085) [2022-07-09 07:59:23,893][25689] Fps is (10 sec: 5938.3, 60 sec: 5767.3, 300 sec: 5748.1). Total num frames: 162287616. Throughput: 0: 5125.4. Samples: 162282796. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 07:59:23,893][25689] Avg episode reward: [(0, '-52.190')] [2022-07-09 07:59:25,466][26022] Updated weights on worker 0-0, policy_version 158492 (0.00084) [2022-07-09 07:59:27,213][26022] Updated weights on worker 0-0, policy_version 158502 (0.00082) [2022-07-09 07:59:28,934][25689] Fps is (10 sec: 5869.3, 60 sec: 5731.7, 300 sec: 5744.4). Total num frames: 162315264. Throughput: 0: 5987.8. Samples: 162317500. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 07:59:28,935][25689] Avg episode reward: [(0, '-52.064')] [2022-07-09 07:59:29,047][26022] Updated weights on worker 0-0, policy_version 158512 (0.00522) [2022-07-09 07:59:30,616][26022] Updated weights on worker 0-0, policy_version 158522 (0.00090) [2022-07-09 07:59:32,404][26022] Updated weights on worker 0-0, policy_version 158532 (0.00085) [2022-07-09 07:59:33,973][25689] Fps is (10 sec: 5688.6, 60 sec: 5745.1, 300 sec: 5751.3). Total num frames: 162344960. Throughput: 0: 5979.3. Samples: 162352118. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 07:59:33,974][25689] Avg episode reward: [(0, '-52.477')] [2022-07-09 07:59:34,278][26022] Updated weights on worker 0-0, policy_version 158542 (0.00087) [2022-07-09 07:59:36,212][26022] Updated weights on worker 0-0, policy_version 158552 (0.00088) [2022-07-09 07:59:37,734][26022] Updated weights on worker 0-0, policy_version 158562 (0.00093) [2022-07-09 07:59:38,976][25689] Fps is (10 sec: 5608.1, 60 sec: 5712.4, 300 sec: 5742.9). Total num frames: 162371584. Throughput: 0: 5146.0. Samples: 162369382. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 07:59:38,979][25689] Avg episode reward: [(0, '-53.019')] [2022-07-09 07:59:39,611][26022] Updated weights on worker 0-0, policy_version 158572 (0.00092) [2022-07-09 07:59:41,398][26022] Updated weights on worker 0-0, policy_version 158582 (0.00085) [2022-07-09 07:59:43,251][26022] Updated weights on worker 0-0, policy_version 158592 (0.00089) [2022-07-09 07:59:44,034][25689] Fps is (10 sec: 5801.7, 60 sec: 5744.9, 300 sec: 5745.4). Total num frames: 162403328. Throughput: 0: 6008.6. Samples: 162403816. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 07:59:44,034][25689] Avg episode reward: [(0, '-52.449')] [2022-07-09 07:59:45,066][26022] Updated weights on worker 0-0, policy_version 158602 (0.00087) [2022-07-09 07:59:46,661][26022] Updated weights on worker 0-0, policy_version 158612 (0.00083) [2022-07-09 07:59:48,490][26022] Updated weights on worker 0-0, policy_version 158622 (0.00097) [2022-07-09 07:59:49,090][25689] Fps is (10 sec: 5973.9, 60 sec: 5707.0, 300 sec: 5745.0). Total num frames: 162432000. Throughput: 0: 6001.3. Samples: 162438462. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 07:59:49,090][25689] Avg episode reward: [(0, '-52.504')] [2022-07-09 07:59:50,302][26022] Updated weights on worker 0-0, policy_version 158632 (0.00084) [2022-07-09 07:59:52,122][26022] Updated weights on worker 0-0, policy_version 158642 (0.00089) [2022-07-09 07:59:53,807][26022] Updated weights on worker 0-0, policy_version 158652 (0.00085) [2022-07-09 07:59:54,107][25689] Fps is (10 sec: 5692.7, 60 sec: 5723.4, 300 sec: 5745.7). Total num frames: 162460672. Throughput: 0: 5155.8. Samples: 162455924. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 07:59:54,107][25689] Avg episode reward: [(0, '-51.864')] [2022-07-09 07:59:55,671][26022] Updated weights on worker 0-0, policy_version 158662 (0.00084) [2022-07-09 07:59:57,314][26022] Updated weights on worker 0-0, policy_version 158672 (0.00611) [2022-07-09 07:59:59,129][25689] Fps is (10 sec: 5813.7, 60 sec: 5739.2, 300 sec: 5747.2). Total num frames: 162490368. Throughput: 0: 6026.1. Samples: 162490826. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 07:59:59,130][25689] Avg episode reward: [(0, '-51.024')] [2022-07-09 07:59:59,136][26022] Updated weights on worker 0-0, policy_version 158682 (0.00091) [2022-07-09 08:00:00,947][26022] Updated weights on worker 0-0, policy_version 158692 (0.00086) [2022-07-09 08:00:03,099][26022] Updated weights on worker 0-0, policy_version 158702 (0.00083) [2022-07-09 08:00:04,180][25689] Fps is (10 sec: 5591.1, 60 sec: 5746.5, 300 sec: 5739.6). Total num frames: 162516992. Throughput: 0: 5939.1. Samples: 162523468. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 08:00:04,180][25689] Avg episode reward: [(0, '-51.120')] [2022-07-09 08:00:04,794][26022] Updated weights on worker 0-0, policy_version 158712 (0.00085) [2022-07-09 08:00:06,523][26022] Updated weights on worker 0-0, policy_version 158722 (0.00432) [2022-07-09 08:00:08,566][26022] Updated weights on worker 0-0, policy_version 158732 (0.00087) [2022-07-09 08:00:09,203][25689] Fps is (10 sec: 5489.3, 60 sec: 5717.1, 300 sec: 5747.0). Total num frames: 162545664. Throughput: 0: 5083.2. Samples: 162540702. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 08:00:09,203][25689] Avg episode reward: [(0, '-51.335')] [2022-07-09 08:00:10,163][26022] Updated weights on worker 0-0, policy_version 158742 (0.00095) [2022-07-09 08:00:12,103][26022] Updated weights on worker 0-0, policy_version 158752 (0.00086) [2022-07-09 08:00:13,539][26022] Updated weights on worker 0-0, policy_version 158762 (0.00089) [2022-07-09 08:00:14,222][25689] Fps is (10 sec: 5710.5, 60 sec: 5737.5, 300 sec: 5740.1). Total num frames: 162574336. Throughput: 0: 5937.6. Samples: 162575358. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 08:00:14,223][25689] Avg episode reward: [(0, '-52.017')] [2022-07-09 08:00:15,591][26022] Updated weights on worker 0-0, policy_version 158772 (0.00089) [2022-07-09 08:00:17,230][26022] Updated weights on worker 0-0, policy_version 158782 (0.00096) [2022-07-09 08:00:18,998][26022] Updated weights on worker 0-0, policy_version 158792 (0.00087) [2022-07-09 08:00:19,225][25689] Fps is (10 sec: 5823.9, 60 sec: 5763.3, 300 sec: 5741.4). Total num frames: 162604032. Throughput: 0: 5951.4. Samples: 162610422. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 08:00:19,225][25689] Avg episode reward: [(0, '-51.873')] [2022-07-09 08:00:20,649][26022] Updated weights on worker 0-0, policy_version 158802 (0.00091) [2022-07-09 08:00:22,552][26022] Updated weights on worker 0-0, policy_version 158812 (0.00093) [2022-07-09 08:00:24,262][25689] Fps is (10 sec: 5813.6, 60 sec: 5716.3, 300 sec: 5741.5). Total num frames: 162632704. Throughput: 0: 5203.1. Samples: 162627954. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 08:00:24,262][25689] Avg episode reward: [(0, '-52.622')] [2022-07-09 08:00:24,351][26022] Updated weights on worker 0-0, policy_version 158822 (0.00435) [2022-07-09 08:00:26,194][26022] Updated weights on worker 0-0, policy_version 158832 (0.00091) [2022-07-09 08:00:27,781][26022] Updated weights on worker 0-0, policy_version 158842 (0.00089) [2022-07-09 08:00:29,265][25689] Fps is (10 sec: 5711.3, 60 sec: 5736.8, 300 sec: 5745.8). Total num frames: 162661376. Throughput: 0: 6070.4. Samples: 162662490. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 08:00:29,267][25689] Avg episode reward: [(0, '-53.262')] [2022-07-09 08:00:29,640][26022] Updated weights on worker 0-0, policy_version 158852 (0.00092) [2022-07-09 08:00:31,535][26022] Updated weights on worker 0-0, policy_version 158862 (0.00096) [2022-07-09 08:00:33,154][26022] Updated weights on worker 0-0, policy_version 158872 (0.00086) [2022-07-09 08:00:34,291][25689] Fps is (10 sec: 5717.7, 60 sec: 5721.2, 300 sec: 5736.3). Total num frames: 162690048. Throughput: 0: 6058.8. Samples: 162696952. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 08:00:34,292][25689] Avg episode reward: [(0, '-52.857')] [2022-07-09 08:00:35,131][26022] Updated weights on worker 0-0, policy_version 158882 (0.00272) [2022-07-09 08:00:36,896][26022] Updated weights on worker 0-0, policy_version 158892 (0.00093) [2022-07-09 08:00:37,907][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:00:37,920][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000158899_162712576.pth [2022-07-09 08:00:37,921][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000156878_160643072.pth [2022-07-09 08:00:38,651][26022] Updated weights on worker 0-0, policy_version 158902 (0.00087) [2022-07-09 08:00:39,300][25689] Fps is (10 sec: 5816.7, 60 sec: 5771.6, 300 sec: 5743.7). Total num frames: 162719744. Throughput: 0: 5186.1. Samples: 162714532. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 08:00:39,301][25689] Avg episode reward: [(0, '-52.298')] [2022-07-09 08:00:40,275][26022] Updated weights on worker 0-0, policy_version 158912 (0.00592) [2022-07-09 08:00:42,003][26022] Updated weights on worker 0-0, policy_version 158922 (0.00070) [2022-07-09 08:00:43,796][26022] Updated weights on worker 0-0, policy_version 158932 (0.00342) [2022-07-09 08:00:44,347][25689] Fps is (10 sec: 6008.0, 60 sec: 5755.6, 300 sec: 5750.2). Total num frames: 162750464. Throughput: 0: 6060.5. Samples: 162749680. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 08:00:44,347][25689] Avg episode reward: [(0, '-52.982')] [2022-07-09 08:00:45,547][26022] Updated weights on worker 0-0, policy_version 158942 (0.00089) [2022-07-09 08:00:47,295][26022] Updated weights on worker 0-0, policy_version 158953 (0.00089) [2022-07-09 08:00:49,217][26022] Updated weights on worker 0-0, policy_version 158963 (0.00088) [2022-07-09 08:00:49,349][25689] Fps is (10 sec: 5808.4, 60 sec: 5743.7, 300 sec: 5746.9). Total num frames: 162778112. Throughput: 0: 6095.2. Samples: 162784902. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 08:00:49,349][25689] Avg episode reward: [(0, '-53.256')] [2022-07-09 08:00:50,729][26022] Updated weights on worker 0-0, policy_version 158973 (0.00049) [2022-07-09 08:00:52,672][26022] Updated weights on worker 0-0, policy_version 158983 (0.00081) [2022-07-09 08:00:54,318][26022] Updated weights on worker 0-0, policy_version 158993 (0.00089) [2022-07-09 08:00:54,439][25689] Fps is (10 sec: 5884.8, 60 sec: 5787.7, 300 sec: 5752.4). Total num frames: 162809856. Throughput: 0: 5240.4. Samples: 162802536. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 08:00:54,439][25689] Avg episode reward: [(0, '-52.358')] [2022-07-09 08:00:56,167][26022] Updated weights on worker 0-0, policy_version 159003 (0.00098) [2022-07-09 08:00:57,989][26022] Updated weights on worker 0-0, policy_version 159013 (0.00087) [2022-07-09 08:00:59,454][25689] Fps is (10 sec: 5877.1, 60 sec: 5754.4, 300 sec: 5753.0). Total num frames: 162837504. Throughput: 0: 6099.3. Samples: 162837462. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 08:00:59,455][25689] Avg episode reward: [(0, '-51.870')] [2022-07-09 08:00:59,784][26022] Updated weights on worker 0-0, policy_version 159023 (0.00086) [2022-07-09 08:01:01,425][26022] Updated weights on worker 0-0, policy_version 159033 (0.00082) [2022-07-09 08:01:03,834][26022] Updated weights on worker 0-0, policy_version 159043 (0.00084) [2022-07-09 08:01:04,494][25689] Fps is (10 sec: 5397.3, 60 sec: 5755.4, 300 sec: 5748.9). Total num frames: 162864128. Throughput: 0: 5959.9. Samples: 162869758. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 08:01:04,495][25689] Avg episode reward: [(0, '-52.054')] [2022-07-09 08:01:05,451][26022] Updated weights on worker 0-0, policy_version 159053 (0.00091) [2022-07-09 08:01:07,386][26022] Updated weights on worker 0-0, policy_version 159063 (0.00086) [2022-07-09 08:01:09,115][26022] Updated weights on worker 0-0, policy_version 159073 (0.00088) [2022-07-09 08:01:09,522][25689] Fps is (10 sec: 5594.2, 60 sec: 5772.0, 300 sec: 5749.3). Total num frames: 162893824. Throughput: 0: 5073.8. Samples: 162887258. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 08:01:09,522][25689] Avg episode reward: [(0, '-51.684')] [2022-07-09 08:01:10,674][26022] Updated weights on worker 0-0, policy_version 159083 (0.00081) [2022-07-09 08:01:12,483][26022] Updated weights on worker 0-0, policy_version 159093 (0.00514) [2022-07-09 08:01:14,423][26022] Updated weights on worker 0-0, policy_version 159103 (0.00082) [2022-07-09 08:01:14,539][25689] Fps is (10 sec: 5810.8, 60 sec: 5772.1, 300 sec: 5749.5). Total num frames: 162922496. Throughput: 0: 5951.7. Samples: 162922166. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 08:01:14,541][25689] Avg episode reward: [(0, '-50.503')] [2022-07-09 08:01:15,998][26022] Updated weights on worker 0-0, policy_version 159113 (0.00092) [2022-07-09 08:01:17,723][26022] Updated weights on worker 0-0, policy_version 159123 (0.00080) [2022-07-09 08:01:19,461][26022] Updated weights on worker 0-0, policy_version 159133 (0.00085) [2022-07-09 08:01:19,553][25689] Fps is (10 sec: 5818.5, 60 sec: 5771.1, 300 sec: 5747.4). Total num frames: 162952192. Throughput: 0: 5959.2. Samples: 162957236. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 08:01:19,554][25689] Avg episode reward: [(0, '-51.266')] [2022-07-09 08:01:21,226][26022] Updated weights on worker 0-0, policy_version 159143 (0.00090) [2022-07-09 08:01:23,156][26022] Updated weights on worker 0-0, policy_version 159153 (0.00079) [2022-07-09 08:01:24,612][25689] Fps is (10 sec: 5895.9, 60 sec: 5785.9, 300 sec: 5757.9). Total num frames: 162981888. Throughput: 0: 5222.3. Samples: 162974820. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 08:01:24,613][25689] Avg episode reward: [(0, '-51.565')] [2022-07-09 08:01:24,695][26022] Updated weights on worker 0-0, policy_version 159163 (0.00082) [2022-07-09 08:01:26,559][26022] Updated weights on worker 0-0, policy_version 159173 (0.00088) [2022-07-09 08:01:28,351][26022] Updated weights on worker 0-0, policy_version 159183 (0.00096) [2022-07-09 08:01:29,615][25689] Fps is (10 sec: 5801.1, 60 sec: 5786.0, 300 sec: 5751.1). Total num frames: 163010560. Throughput: 0: 6088.0. Samples: 163009584. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 08:01:29,615][25689] Avg episode reward: [(0, '-52.074')] [2022-07-09 08:01:30,067][26022] Updated weights on worker 0-0, policy_version 159193 (0.00086) [2022-07-09 08:01:31,875][26022] Updated weights on worker 0-0, policy_version 159203 (0.00085) [2022-07-09 08:01:33,619][26022] Updated weights on worker 0-0, policy_version 159213 (0.00086) [2022-07-09 08:01:34,659][25689] Fps is (10 sec: 5707.6, 60 sec: 5784.2, 300 sec: 5747.2). Total num frames: 163039232. Throughput: 0: 6070.5. Samples: 163044306. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 08:01:34,660][25689] Avg episode reward: [(0, '-53.355')] [2022-07-09 08:01:35,511][26022] Updated weights on worker 0-0, policy_version 159223 (0.00088) [2022-07-09 08:01:37,155][26022] Updated weights on worker 0-0, policy_version 159233 (0.00081) [2022-07-09 08:01:39,039][26022] Updated weights on worker 0-0, policy_version 159243 (0.00084) [2022-07-09 08:01:39,679][25689] Fps is (10 sec: 5697.7, 60 sec: 5766.2, 300 sec: 5744.7). Total num frames: 163067904. Throughput: 0: 5200.8. Samples: 163061908. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 08:01:39,680][25689] Avg episode reward: [(0, '-53.549')] [2022-07-09 08:01:40,707][26022] Updated weights on worker 0-0, policy_version 159253 (0.00086) [2022-07-09 08:01:42,331][26022] Updated weights on worker 0-0, policy_version 159263 (0.00057) [2022-07-09 08:01:44,136][26022] Updated weights on worker 0-0, policy_version 159273 (0.00093) [2022-07-09 08:01:44,769][25689] Fps is (10 sec: 5874.7, 60 sec: 5762.1, 300 sec: 5751.3). Total num frames: 163098624. Throughput: 0: 6060.0. Samples: 163096970. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 08:01:44,771][25689] Avg episode reward: [(0, '-52.776')] [2022-07-09 08:01:46,075][26022] Updated weights on worker 0-0, policy_version 159283 (0.00080) [2022-07-09 08:01:47,622][26022] Updated weights on worker 0-0, policy_version 159293 (0.00090) [2022-07-09 08:01:49,552][26022] Updated weights on worker 0-0, policy_version 159303 (0.00092) [2022-07-09 08:01:49,780][25689] Fps is (10 sec: 5880.1, 60 sec: 5778.2, 300 sec: 5751.6). Total num frames: 163127296. Throughput: 0: 6064.1. Samples: 163131866. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 08:01:49,782][25689] Avg episode reward: [(0, '-53.581')] [2022-07-09 08:01:51,107][26022] Updated weights on worker 0-0, policy_version 159313 (0.00079) [2022-07-09 08:01:53,119][26022] Updated weights on worker 0-0, policy_version 159323 (0.00085) [2022-07-09 08:01:54,662][26022] Updated weights on worker 0-0, policy_version 159333 (0.00053) [2022-07-09 08:01:54,790][25689] Fps is (10 sec: 5926.6, 60 sec: 5768.9, 300 sec: 5758.7). Total num frames: 163158016. Throughput: 0: 5231.4. Samples: 163149620. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 08:01:54,791][25689] Avg episode reward: [(0, '-52.452')] [2022-07-09 08:01:56,540][26022] Updated weights on worker 0-0, policy_version 159343 (0.00087) [2022-07-09 08:01:58,356][26022] Updated weights on worker 0-0, policy_version 159353 (0.00095) [2022-07-09 08:01:59,818][25689] Fps is (10 sec: 5916.5, 60 sec: 5784.6, 300 sec: 5759.4). Total num frames: 163186688. Throughput: 0: 6072.8. Samples: 163184208. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 08:01:59,819][25689] Avg episode reward: [(0, '-51.723')] [2022-07-09 08:02:00,035][26022] Updated weights on worker 0-0, policy_version 159363 (0.00079) [2022-07-09 08:02:02,166][26022] Updated weights on worker 0-0, policy_version 159373 (0.00085) [2022-07-09 08:02:03,975][26022] Updated weights on worker 0-0, policy_version 159383 (0.00533) [2022-07-09 08:02:04,857][25689] Fps is (10 sec: 5289.6, 60 sec: 5750.8, 300 sec: 5752.5). Total num frames: 163211264. Throughput: 0: 5949.5. Samples: 163216482. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 08:02:04,857][25689] Avg episode reward: [(0, '-51.120')] [2022-07-09 08:02:05,731][26022] Updated weights on worker 0-0, policy_version 159393 (0.00088) [2022-07-09 08:02:07,739][26022] Updated weights on worker 0-0, policy_version 159403 (0.00083) [2022-07-09 08:02:09,323][26022] Updated weights on worker 0-0, policy_version 159413 (0.00093) [2022-07-09 08:02:09,861][25689] Fps is (10 sec: 5505.8, 60 sec: 5770.0, 300 sec: 5749.0). Total num frames: 163241984. Throughput: 0: 5079.6. Samples: 163233872. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 08:02:09,862][25689] Avg episode reward: [(0, '-51.119')] [2022-07-09 08:02:11,167][26022] Updated weights on worker 0-0, policy_version 159423 (0.00087) [2022-07-09 08:02:12,880][26022] Updated weights on worker 0-0, policy_version 159433 (0.00283) [2022-07-09 08:02:14,554][26022] Updated weights on worker 0-0, policy_version 159443 (0.00091) [2022-07-09 08:02:14,912][25689] Fps is (10 sec: 5906.5, 60 sec: 5766.8, 300 sec: 5748.1). Total num frames: 163270656. Throughput: 0: 5930.4. Samples: 163268952. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 08:02:14,913][25689] Avg episode reward: [(0, '-51.692')] [2022-07-09 08:02:16,413][26022] Updated weights on worker 0-0, policy_version 159453 (0.00087) [2022-07-09 08:02:18,042][26022] Updated weights on worker 0-0, policy_version 159463 (0.00086) [2022-07-09 08:02:19,834][26022] Updated weights on worker 0-0, policy_version 159473 (0.00088) [2022-07-09 08:02:19,959][25689] Fps is (10 sec: 5882.1, 60 sec: 5780.7, 300 sec: 5753.4). Total num frames: 163301376. Throughput: 0: 5942.5. Samples: 163303892. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 08:02:19,959][25689] Avg episode reward: [(0, '-51.042')] [2022-07-09 08:02:21,737][26022] Updated weights on worker 0-0, policy_version 159483 (0.00084) [2022-07-09 08:02:23,414][26022] Updated weights on worker 0-0, policy_version 159493 (0.00086) [2022-07-09 08:02:25,000][25689] Fps is (10 sec: 5786.0, 60 sec: 5748.4, 300 sec: 5749.3). Total num frames: 163329024. Throughput: 0: 6065.9. Samples: 163338670. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 08:02:25,001][25689] Avg episode reward: [(0, '-51.448')] [2022-07-09 08:02:25,273][26022] Updated weights on worker 0-0, policy_version 159503 (0.00084) [2022-07-09 08:02:27,069][26022] Updated weights on worker 0-0, policy_version 159513 (0.00081) [2022-07-09 08:02:28,568][26022] Updated weights on worker 0-0, policy_version 159523 (0.00095) [2022-07-09 08:02:30,005][25689] Fps is (10 sec: 5606.0, 60 sec: 5748.2, 300 sec: 5749.8). Total num frames: 163357696. Throughput: 0: 6072.0. Samples: 163356182. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 08:02:30,006][25689] Avg episode reward: [(0, '-51.693')] [2022-07-09 08:02:30,515][26022] Updated weights on worker 0-0, policy_version 159533 (0.00092) [2022-07-09 08:02:32,214][26022] Updated weights on worker 0-0, policy_version 159543 (0.00081) [2022-07-09 08:02:34,025][26022] Updated weights on worker 0-0, policy_version 159553 (0.00087) [2022-07-09 08:02:35,031][25689] Fps is (10 sec: 5920.8, 60 sec: 5783.9, 300 sec: 5753.2). Total num frames: 163388416. Throughput: 0: 6074.1. Samples: 163391156. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 08:02:35,032][25689] Avg episode reward: [(0, '-51.651')] [2022-07-09 08:02:35,792][26022] Updated weights on worker 0-0, policy_version 159563 (0.00090) [2022-07-09 08:02:37,471][26022] Updated weights on worker 0-0, policy_version 159573 (0.00083) [2022-07-09 08:02:37,943][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:02:37,955][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000159576_163405824.pth [2022-07-09 08:02:37,956][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000157552_161333248.pth [2022-07-09 08:02:39,239][26022] Updated weights on worker 0-0, policy_version 159583 (0.00090) [2022-07-09 08:02:40,033][25689] Fps is (10 sec: 5820.5, 60 sec: 5768.7, 300 sec: 5754.1). Total num frames: 163416064. Throughput: 0: 6090.0. Samples: 163426144. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 08:02:40,035][25689] Avg episode reward: [(0, '-52.181')] [2022-07-09 08:02:41,175][26022] Updated weights on worker 0-0, policy_version 159593 (0.00087) [2022-07-09 08:02:42,855][26022] Updated weights on worker 0-0, policy_version 159603 (0.00096) [2022-07-09 08:02:44,558][26022] Updated weights on worker 0-0, policy_version 159613 (0.00089) [2022-07-09 08:02:45,067][25689] Fps is (10 sec: 5816.2, 60 sec: 5774.0, 300 sec: 5757.2). Total num frames: 163446784. Throughput: 0: 5218.8. Samples: 163443392. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 08:02:45,069][25689] Avg episode reward: [(0, '-51.525')] [2022-07-09 08:02:46,370][26022] Updated weights on worker 0-0, policy_version 159623 (0.00101) [2022-07-09 08:02:48,182][26022] Updated weights on worker 0-0, policy_version 159633 (0.00082) [2022-07-09 08:02:49,876][26022] Updated weights on worker 0-0, policy_version 159643 (0.00086) [2022-07-09 08:02:50,136][25689] Fps is (10 sec: 5979.9, 60 sec: 5785.4, 300 sec: 5759.7). Total num frames: 163476480. Throughput: 0: 6066.8. Samples: 163478314. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 08:02:50,137][25689] Avg episode reward: [(0, '-51.594')] [2022-07-09 08:02:51,459][26022] Updated weights on worker 0-0, policy_version 159653 (0.00085) [2022-07-09 08:02:53,331][26022] Updated weights on worker 0-0, policy_version 159663 (0.00089) [2022-07-09 08:02:55,189][25689] Fps is (10 sec: 5766.1, 60 sec: 5747.4, 300 sec: 5758.9). Total num frames: 163505152. Throughput: 0: 6068.1. Samples: 163513478. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 08:02:55,190][25689] Avg episode reward: [(0, '-52.808')] [2022-07-09 08:02:55,191][26022] Updated weights on worker 0-0, policy_version 159673 (0.00081) [2022-07-09 08:02:56,845][26022] Updated weights on worker 0-0, policy_version 159683 (0.00049) [2022-07-09 08:02:58,619][26022] Updated weights on worker 0-0, policy_version 159693 (0.00100) [2022-07-09 08:03:00,238][25689] Fps is (10 sec: 5777.9, 60 sec: 5762.4, 300 sec: 5767.6). Total num frames: 163534848. Throughput: 0: 5190.2. Samples: 163531012. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 08:03:00,239][25689] Avg episode reward: [(0, '-52.257')] [2022-07-09 08:03:00,308][26022] Updated weights on worker 0-0, policy_version 159703 (0.00090) [2022-07-09 08:03:02,648][26022] Updated weights on worker 0-0, policy_version 159713 (0.00097) [2022-07-09 08:03:04,451][26022] Updated weights on worker 0-0, policy_version 159723 (0.00085) [2022-07-09 08:03:05,327][25689] Fps is (10 sec: 5454.4, 60 sec: 5774.5, 300 sec: 5752.3). Total num frames: 163560448. Throughput: 0: 5921.2. Samples: 163563358. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 08:03:05,328][25689] Avg episode reward: [(0, '-52.180')] [2022-07-09 08:03:06,043][26022] Updated weights on worker 0-0, policy_version 159733 (0.00077) [2022-07-09 08:03:07,947][26022] Updated weights on worker 0-0, policy_version 159743 (0.00084) [2022-07-09 08:03:09,592][26022] Updated weights on worker 0-0, policy_version 159753 (0.00086) [2022-07-09 08:03:10,334][25689] Fps is (10 sec: 5679.9, 60 sec: 5791.2, 300 sec: 5759.8). Total num frames: 163592192. Throughput: 0: 5949.8. Samples: 163598488. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 08:03:10,334][25689] Avg episode reward: [(0, '-52.663')] [2022-07-09 08:03:11,468][26022] Updated weights on worker 0-0, policy_version 159763 (0.00089) [2022-07-09 08:03:13,207][26022] Updated weights on worker 0-0, policy_version 159773 (0.00081) [2022-07-09 08:03:14,952][26022] Updated weights on worker 0-0, policy_version 159783 (0.00086) [2022-07-09 08:03:15,341][25689] Fps is (10 sec: 6033.2, 60 sec: 5795.4, 300 sec: 5760.3). Total num frames: 163620864. Throughput: 0: 5089.6. Samples: 163616044. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 08:03:15,342][25689] Avg episode reward: [(0, '-52.701')] [2022-07-09 08:03:16,863][26022] Updated weights on worker 0-0, policy_version 159793 (0.00082) [2022-07-09 08:03:18,352][26022] Updated weights on worker 0-0, policy_version 159803 (0.00090) [2022-07-09 08:03:20,196][26022] Updated weights on worker 0-0, policy_version 159813 (0.00087) [2022-07-09 08:03:20,360][25689] Fps is (10 sec: 5617.1, 60 sec: 5747.1, 300 sec: 5757.6). Total num frames: 163648512. Throughput: 0: 5953.8. Samples: 163650816. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 08:03:20,361][25689] Avg episode reward: [(0, '-52.114')] [2022-07-09 08:03:21,970][26022] Updated weights on worker 0-0, policy_version 159823 (0.00402) [2022-07-09 08:03:23,759][26022] Updated weights on worker 0-0, policy_version 159833 (0.00084) [2022-07-09 08:03:25,463][25689] Fps is (10 sec: 5665.5, 60 sec: 5775.2, 300 sec: 5756.0). Total num frames: 163678208. Throughput: 0: 6093.1. Samples: 163686046. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 08:03:25,463][25689] Avg episode reward: [(0, '-52.039')] [2022-07-09 08:03:25,576][26022] Updated weights on worker 0-0, policy_version 159843 (0.00080) [2022-07-09 08:03:27,296][26022] Updated weights on worker 0-0, policy_version 159853 (0.00084) [2022-07-09 08:03:29,123][26022] Updated weights on worker 0-0, policy_version 159863 (0.00087) [2022-07-09 08:03:30,481][25689] Fps is (10 sec: 5868.7, 60 sec: 5790.9, 300 sec: 5759.2). Total num frames: 163707904. Throughput: 0: 5195.7. Samples: 163703164. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 08:03:30,481][25689] Avg episode reward: [(0, '-52.297')] [2022-07-09 08:03:30,903][26022] Updated weights on worker 0-0, policy_version 159873 (0.00086) [2022-07-09 08:03:32,646][26022] Updated weights on worker 0-0, policy_version 159883 (0.00097) [2022-07-09 08:03:34,453][26022] Updated weights on worker 0-0, policy_version 159893 (0.00084) [2022-07-09 08:03:35,528][25689] Fps is (10 sec: 5900.5, 60 sec: 5771.9, 300 sec: 5762.0). Total num frames: 163737600. Throughput: 0: 6030.6. Samples: 163737786. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 08:03:35,529][25689] Avg episode reward: [(0, '-53.106')] [2022-07-09 08:03:36,152][26022] Updated weights on worker 0-0, policy_version 159903 (0.00099) [2022-07-09 08:03:38,030][26022] Updated weights on worker 0-0, policy_version 159913 (0.00084) [2022-07-09 08:03:39,699][26022] Updated weights on worker 0-0, policy_version 159923 (0.00085) [2022-07-09 08:03:40,530][25689] Fps is (10 sec: 5808.0, 60 sec: 5788.8, 300 sec: 5759.3). Total num frames: 163766272. Throughput: 0: 6041.2. Samples: 163772664. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 08:03:40,531][25689] Avg episode reward: [(0, '-52.767')] [2022-07-09 08:03:41,517][26022] Updated weights on worker 0-0, policy_version 159933 (0.00081) [2022-07-09 08:03:43,289][26022] Updated weights on worker 0-0, policy_version 159943 (0.00094) [2022-07-09 08:03:44,928][26022] Updated weights on worker 0-0, policy_version 159953 (0.00082) [2022-07-09 08:03:45,645][25689] Fps is (10 sec: 5668.4, 60 sec: 5747.3, 300 sec: 5750.5). Total num frames: 163794944. Throughput: 0: 5161.7. Samples: 163790220. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 08:03:45,646][25689] Avg episode reward: [(0, '-53.714')] [2022-07-09 08:03:46,696][26022] Updated weights on worker 0-0, policy_version 159963 (0.00088) [2022-07-09 08:03:48,555][26022] Updated weights on worker 0-0, policy_version 159973 (0.00493) [2022-07-09 08:03:50,125][26022] Updated weights on worker 0-0, policy_version 159983 (0.00082) [2022-07-09 08:03:50,711][25689] Fps is (10 sec: 5833.9, 60 sec: 5764.5, 300 sec: 5759.8). Total num frames: 163825664. Throughput: 0: 6030.8. Samples: 163825168. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 08:03:50,711][25689] Avg episode reward: [(0, '-53.475')] [2022-07-09 08:03:52,243][26022] Updated weights on worker 0-0, policy_version 159993 (0.00087) [2022-07-09 08:03:53,759][26022] Updated weights on worker 0-0, policy_version 160003 (0.00089) [2022-07-09 08:03:55,740][25689] Fps is (10 sec: 5782.0, 60 sec: 5749.9, 300 sec: 5756.0). Total num frames: 163853312. Throughput: 0: 6021.0. Samples: 163859478. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 08:03:55,740][25689] Avg episode reward: [(0, '-53.744')] [2022-07-09 08:03:55,742][26022] Updated weights on worker 0-0, policy_version 160013 (0.00090) [2022-07-09 08:03:57,435][26022] Updated weights on worker 0-0, policy_version 160023 (0.00081) [2022-07-09 08:03:59,138][26022] Updated weights on worker 0-0, policy_version 160033 (0.00081) [2022-07-09 08:04:00,798][25689] Fps is (10 sec: 5583.0, 60 sec: 5732.1, 300 sec: 5764.2). Total num frames: 163881984. Throughput: 0: 5140.6. Samples: 163876856. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 08:04:00,799][25689] Avg episode reward: [(0, '-52.925')] [2022-07-09 08:04:00,975][26022] Updated weights on worker 0-0, policy_version 160043 (0.00085) [2022-07-09 08:04:03,161][26022] Updated weights on worker 0-0, policy_version 160053 (0.00090) [2022-07-09 08:04:04,710][26022] Updated weights on worker 0-0, policy_version 160063 (0.00087) [2022-07-09 08:04:05,874][25689] Fps is (10 sec: 5557.4, 60 sec: 5767.2, 300 sec: 5753.8). Total num frames: 163909632. Throughput: 0: 5892.0. Samples: 163909410. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 08:04:05,874][25689] Avg episode reward: [(0, '-52.359')] [2022-07-09 08:04:06,598][26022] Updated weights on worker 0-0, policy_version 160073 (0.00079) [2022-07-09 08:04:08,205][26022] Updated weights on worker 0-0, policy_version 160083 (0.00083) [2022-07-09 08:04:10,199][26022] Updated weights on worker 0-0, policy_version 160093 (0.00085) [2022-07-09 08:04:10,966][25689] Fps is (10 sec: 5740.7, 60 sec: 5742.2, 300 sec: 5763.4). Total num frames: 163940352. Throughput: 0: 5883.3. Samples: 163944336. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 08:04:10,967][25689] Avg episode reward: [(0, '-51.914')] [2022-07-09 08:04:11,888][26022] Updated weights on worker 0-0, policy_version 160103 (0.00085) [2022-07-09 08:04:13,679][26022] Updated weights on worker 0-0, policy_version 160113 (0.00089) [2022-07-09 08:04:15,513][26022] Updated weights on worker 0-0, policy_version 160123 (0.00050) [2022-07-09 08:04:16,002][25689] Fps is (10 sec: 5762.7, 60 sec: 5722.5, 300 sec: 5761.1). Total num frames: 163968000. Throughput: 0: 5043.1. Samples: 163961666. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 08:04:16,003][25689] Avg episode reward: [(0, '-52.052')] [2022-07-09 08:04:17,390][26022] Updated weights on worker 0-0, policy_version 160133 (0.00089) [2022-07-09 08:04:19,091][26022] Updated weights on worker 0-0, policy_version 160143 (0.00086) [2022-07-09 08:04:20,919][26022] Updated weights on worker 0-0, policy_version 160153 (0.00080) [2022-07-09 08:04:21,016][25689] Fps is (10 sec: 5604.0, 60 sec: 5740.0, 300 sec: 5752.0). Total num frames: 163996672. Throughput: 0: 5907.9. Samples: 163996298. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 08:04:21,016][25689] Avg episode reward: [(0, '-51.230')] [2022-07-09 08:04:22,511][26022] Updated weights on worker 0-0, policy_version 160163 (0.00085) [2022-07-09 08:04:24,447][26022] Updated weights on worker 0-0, policy_version 160173 (0.00078) [2022-07-09 08:04:26,062][25689] Fps is (10 sec: 5802.0, 60 sec: 5745.3, 300 sec: 5758.8). Total num frames: 164026368. Throughput: 0: 6020.9. Samples: 164030962. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 08:04:26,063][25689] Avg episode reward: [(0, '-51.429')] [2022-07-09 08:04:26,191][26022] Updated weights on worker 0-0, policy_version 160183 (0.00080) [2022-07-09 08:04:27,910][26022] Updated weights on worker 0-0, policy_version 160193 (0.00423) [2022-07-09 08:04:29,646][26022] Updated weights on worker 0-0, policy_version 160203 (0.00094) [2022-07-09 08:04:31,079][25689] Fps is (10 sec: 5800.3, 60 sec: 5728.5, 300 sec: 5755.8). Total num frames: 164055040. Throughput: 0: 5173.4. Samples: 164048388. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 08:04:31,079][25689] Avg episode reward: [(0, '-51.548')] [2022-07-09 08:04:31,487][26022] Updated weights on worker 0-0, policy_version 160213 (0.00099) [2022-07-09 08:04:33,130][26022] Updated weights on worker 0-0, policy_version 160223 (0.00088) [2022-07-09 08:04:35,051][26022] Updated weights on worker 0-0, policy_version 160233 (0.00089) [2022-07-09 08:04:36,103][25689] Fps is (10 sec: 5813.4, 60 sec: 5730.8, 300 sec: 5765.7). Total num frames: 164084736. Throughput: 0: 6039.3. Samples: 164083058. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 08:04:36,103][25689] Avg episode reward: [(0, '-52.026')] [2022-07-09 08:04:36,483][26022] Updated weights on worker 0-0, policy_version 160243 (0.00089) [2022-07-09 08:04:37,983][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:04:37,994][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000160249_164094976.pth [2022-07-09 08:04:37,995][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000158226_162023424.pth [2022-07-09 08:04:38,655][26022] Updated weights on worker 0-0, policy_version 160253 (0.00085) [2022-07-09 08:04:40,308][26022] Updated weights on worker 0-0, policy_version 160263 (0.00085) [2022-07-09 08:04:41,165][25689] Fps is (10 sec: 5685.2, 60 sec: 5708.1, 300 sec: 5751.8). Total num frames: 164112384. Throughput: 0: 6013.8. Samples: 164117474. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 08:04:41,166][25689] Avg episode reward: [(0, '-51.613')] [2022-07-09 08:04:42,255][26022] Updated weights on worker 0-0, policy_version 160273 (0.00086) [2022-07-09 08:04:44,052][26022] Updated weights on worker 0-0, policy_version 160283 (0.00788) [2022-07-09 08:04:45,713][26022] Updated weights on worker 0-0, policy_version 160293 (0.00086) [2022-07-09 08:04:46,226][25689] Fps is (10 sec: 5664.3, 60 sec: 5730.1, 300 sec: 5755.2). Total num frames: 164142080. Throughput: 0: 5994.6. Samples: 164151838. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 08:04:46,227][25689] Avg episode reward: [(0, '-51.323')] [2022-07-09 08:04:47,676][26022] Updated weights on worker 0-0, policy_version 160303 (0.00086) [2022-07-09 08:04:49,249][26022] Updated weights on worker 0-0, policy_version 160313 (0.00085) [2022-07-09 08:04:50,927][26022] Updated weights on worker 0-0, policy_version 160323 (0.00087) [2022-07-09 08:04:51,243][25689] Fps is (10 sec: 5995.0, 60 sec: 5734.7, 300 sec: 5762.1). Total num frames: 164172800. Throughput: 0: 6003.4. Samples: 164169442. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 08:04:51,244][25689] Avg episode reward: [(0, '-51.092')] [2022-07-09 08:04:52,907][26022] Updated weights on worker 0-0, policy_version 160333 (0.00091) [2022-07-09 08:04:54,550][26022] Updated weights on worker 0-0, policy_version 160343 (0.00095) [2022-07-09 08:04:56,248][25689] Fps is (10 sec: 5722.0, 60 sec: 5720.1, 300 sec: 5752.1). Total num frames: 164199424. Throughput: 0: 6019.4. Samples: 164204322. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 08:04:56,249][25689] Avg episode reward: [(0, '-51.187')] [2022-07-09 08:04:56,404][26022] Updated weights on worker 0-0, policy_version 160353 (0.00049) [2022-07-09 08:04:58,223][26022] Updated weights on worker 0-0, policy_version 160363 (0.00086) [2022-07-09 08:04:59,831][26022] Updated weights on worker 0-0, policy_version 160373 (0.00100) [2022-07-09 08:05:01,286][25689] Fps is (10 sec: 5608.1, 60 sec: 5739.0, 300 sec: 5762.7). Total num frames: 164229120. Throughput: 0: 6036.7. Samples: 164238934. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 08:05:01,287][25689] Avg episode reward: [(0, '-51.461')] [2022-07-09 08:05:02,239][26022] Updated weights on worker 0-0, policy_version 160383 (0.00341) [2022-07-09 08:05:03,834][26022] Updated weights on worker 0-0, policy_version 160393 (0.00087) [2022-07-09 08:05:05,627][26022] Updated weights on worker 0-0, policy_version 160403 (0.00059) [2022-07-09 08:05:06,416][25689] Fps is (10 sec: 5639.6, 60 sec: 5733.8, 300 sec: 5757.2). Total num frames: 164256768. Throughput: 0: 5062.4. Samples: 164254046. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 08:05:06,417][25689] Avg episode reward: [(0, '-51.425')] [2022-07-09 08:05:07,527][26022] Updated weights on worker 0-0, policy_version 160413 (0.00083) [2022-07-09 08:05:09,261][26022] Updated weights on worker 0-0, policy_version 160423 (0.00084) [2022-07-09 08:05:11,070][26022] Updated weights on worker 0-0, policy_version 160433 (0.00086) [2022-07-09 08:05:11,486][25689] Fps is (10 sec: 5521.3, 60 sec: 5702.0, 300 sec: 5756.2). Total num frames: 164285440. Throughput: 0: 5883.2. Samples: 164288538. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 08:05:11,487][25689] Avg episode reward: [(0, '-52.763')] [2022-07-09 08:05:12,721][26022] Updated weights on worker 0-0, policy_version 160443 (0.00081) [2022-07-09 08:05:14,651][26022] Updated weights on worker 0-0, policy_version 160453 (0.00085) [2022-07-09 08:05:16,201][26022] Updated weights on worker 0-0, policy_version 160463 (0.00079) [2022-07-09 08:05:16,535][25689] Fps is (10 sec: 5869.2, 60 sec: 5751.6, 300 sec: 5758.7). Total num frames: 164316160. Throughput: 0: 5867.6. Samples: 164323360. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 08:05:16,536][25689] Avg episode reward: [(0, '-52.859')] [2022-07-09 08:05:18,151][26022] Updated weights on worker 0-0, policy_version 160473 (0.00088) [2022-07-09 08:05:19,838][26022] Updated weights on worker 0-0, policy_version 160483 (0.00081) [2022-07-09 08:05:21,581][25689] Fps is (10 sec: 5883.3, 60 sec: 5748.5, 300 sec: 5758.5). Total num frames: 164344832. Throughput: 0: 5023.9. Samples: 164340896. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 08:05:21,582][25689] Avg episode reward: [(0, '-52.888')] [2022-07-09 08:05:21,583][26022] Updated weights on worker 0-0, policy_version 160493 (0.00092) [2022-07-09 08:05:23,360][26022] Updated weights on worker 0-0, policy_version 160503 (0.00088) [2022-07-09 08:05:24,974][26022] Updated weights on worker 0-0, policy_version 160513 (0.00085) [2022-07-09 08:05:26,691][25689] Fps is (10 sec: 5646.5, 60 sec: 5725.6, 300 sec: 5756.5). Total num frames: 164373504. Throughput: 0: 6019.0. Samples: 164376080. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 08:05:26,692][25689] Avg episode reward: [(0, '-52.517')] [2022-07-09 08:05:26,951][26022] Updated weights on worker 0-0, policy_version 160523 (0.00095) [2022-07-09 08:05:28,483][26022] Updated weights on worker 0-0, policy_version 160533 (0.00085) [2022-07-09 08:05:30,549][26022] Updated weights on worker 0-0, policy_version 160543 (0.01039) [2022-07-09 08:05:31,786][25689] Fps is (10 sec: 5820.2, 60 sec: 5752.0, 300 sec: 5762.0). Total num frames: 164404224. Throughput: 0: 6019.2. Samples: 164410724. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 08:05:31,786][25689] Avg episode reward: [(0, '-52.648')] [2022-07-09 08:05:32,179][26022] Updated weights on worker 0-0, policy_version 160553 (0.00085) [2022-07-09 08:05:34,037][26022] Updated weights on worker 0-0, policy_version 160563 (0.00091) [2022-07-09 08:05:35,713][26022] Updated weights on worker 0-0, policy_version 160573 (0.00085) [2022-07-09 08:05:36,791][25689] Fps is (10 sec: 5880.5, 60 sec: 5736.9, 300 sec: 5758.7). Total num frames: 164432896. Throughput: 0: 5163.4. Samples: 164427948. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 08:05:36,791][25689] Avg episode reward: [(0, '-52.096')] [2022-07-09 08:05:37,505][26022] Updated weights on worker 0-0, policy_version 160583 (0.00085) [2022-07-09 08:05:39,199][26022] Updated weights on worker 0-0, policy_version 160593 (0.00083) [2022-07-09 08:05:41,191][26022] Updated weights on worker 0-0, policy_version 160603 (0.00099) [2022-07-09 08:05:41,798][25689] Fps is (10 sec: 5727.7, 60 sec: 5759.1, 300 sec: 5752.6). Total num frames: 164461568. Throughput: 0: 6033.2. Samples: 164462864. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 08:05:41,798][25689] Avg episode reward: [(0, '-50.668')] [2022-07-09 08:05:42,631][26022] Updated weights on worker 0-0, policy_version 160613 (0.00090) [2022-07-09 08:05:44,668][26022] Updated weights on worker 0-0, policy_version 160623 (0.00080) [2022-07-09 08:05:46,151][26022] Updated weights on worker 0-0, policy_version 160633 (0.00090) [2022-07-09 08:05:46,876][25689] Fps is (10 sec: 5787.6, 60 sec: 5757.4, 300 sec: 5758.0). Total num frames: 164491264. Throughput: 0: 6023.1. Samples: 164497656. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 08:05:46,876][25689] Avg episode reward: [(0, '-50.278')] [2022-07-09 08:05:48,028][26022] Updated weights on worker 0-0, policy_version 160643 (0.00087) [2022-07-09 08:05:49,722][26022] Updated weights on worker 0-0, policy_version 160653 (0.00087) [2022-07-09 08:05:51,640][26022] Updated weights on worker 0-0, policy_version 160663 (0.00085) [2022-07-09 08:05:51,971][25689] Fps is (10 sec: 5737.3, 60 sec: 5716.3, 300 sec: 5747.5). Total num frames: 164519936. Throughput: 0: 5171.0. Samples: 164515102. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 08:05:51,972][25689] Avg episode reward: [(0, '-50.661')] [2022-07-09 08:05:53,299][26022] Updated weights on worker 0-0, policy_version 160673 (0.00081) [2022-07-09 08:05:55,252][26022] Updated weights on worker 0-0, policy_version 160683 (0.00092) [2022-07-09 08:05:56,741][26022] Updated weights on worker 0-0, policy_version 160693 (0.00083) [2022-07-09 08:05:56,979][25689] Fps is (10 sec: 5878.7, 60 sec: 5783.4, 300 sec: 5758.0). Total num frames: 164550656. Throughput: 0: 6048.6. Samples: 164550056. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 08:05:56,979][25689] Avg episode reward: [(0, '-50.391')] [2022-07-09 08:05:58,643][26022] Updated weights on worker 0-0, policy_version 160703 (0.00087) [2022-07-09 08:06:00,295][26022] Updated weights on worker 0-0, policy_version 160713 (0.00088) [2022-07-09 08:06:01,993][25689] Fps is (10 sec: 5926.2, 60 sec: 5768.8, 300 sec: 5765.4). Total num frames: 164579328. Throughput: 0: 6052.4. Samples: 164585094. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 08:06:01,994][25689] Avg episode reward: [(0, '-50.830')] [2022-07-09 08:06:02,199][26022] Updated weights on worker 0-0, policy_version 160723 (0.00087) [2022-07-09 08:06:04,327][26022] Updated weights on worker 0-0, policy_version 160733 (0.00089) [2022-07-09 08:06:05,944][26022] Updated weights on worker 0-0, policy_version 160743 (0.00089) [2022-07-09 08:06:07,074][25689] Fps is (10 sec: 5376.2, 60 sec: 5739.8, 300 sec: 5750.6). Total num frames: 164604928. Throughput: 0: 5066.9. Samples: 164599996. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 08:06:07,074][25689] Avg episode reward: [(0, '-51.069')] [2022-07-09 08:06:07,865][26022] Updated weights on worker 0-0, policy_version 160753 (0.00084) [2022-07-09 08:06:09,650][26022] Updated weights on worker 0-0, policy_version 160763 (0.00089) [2022-07-09 08:06:11,179][26022] Updated weights on worker 0-0, policy_version 160773 (0.00079) [2022-07-09 08:06:12,132][25689] Fps is (10 sec: 5453.6, 60 sec: 5757.7, 300 sec: 5753.2). Total num frames: 164634624. Throughput: 0: 5953.0. Samples: 164635122. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 08:06:12,133][25689] Avg episode reward: [(0, '-51.979')] [2022-07-09 08:06:13,083][26022] Updated weights on worker 0-0, policy_version 160783 (0.00083) [2022-07-09 08:06:15,070][26022] Updated weights on worker 0-0, policy_version 160793 (0.00093) [2022-07-09 08:06:16,466][26022] Updated weights on worker 0-0, policy_version 160803 (0.00087) [2022-07-09 08:06:17,137][25689] Fps is (10 sec: 6105.4, 60 sec: 5778.9, 300 sec: 5760.3). Total num frames: 164666368. Throughput: 0: 5960.3. Samples: 164670204. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 08:06:17,137][25689] Avg episode reward: [(0, '-52.488')] [2022-07-09 08:06:18,521][26022] Updated weights on worker 0-0, policy_version 160813 (0.00082) [2022-07-09 08:06:20,043][26022] Updated weights on worker 0-0, policy_version 160823 (0.00080) [2022-07-09 08:06:21,823][26022] Updated weights on worker 0-0, policy_version 160833 (0.00083) [2022-07-09 08:06:22,139][25689] Fps is (10 sec: 5935.4, 60 sec: 5766.2, 300 sec: 5754.5). Total num frames: 164694016. Throughput: 0: 5097.0. Samples: 164687778. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 08:06:22,139][25689] Avg episode reward: [(0, '-52.697')] [2022-07-09 08:06:23,720][26022] Updated weights on worker 0-0, policy_version 160843 (0.00089) [2022-07-09 08:06:25,321][26022] Updated weights on worker 0-0, policy_version 160853 (0.00089) [2022-07-09 08:06:27,206][25689] Fps is (10 sec: 5593.4, 60 sec: 5770.2, 300 sec: 5753.3). Total num frames: 164722688. Throughput: 0: 6087.1. Samples: 164722542. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 08:06:27,206][25689] Avg episode reward: [(0, '-52.370')] [2022-07-09 08:06:27,258][26022] Updated weights on worker 0-0, policy_version 160863 (0.00085) [2022-07-09 08:06:29,242][26022] Updated weights on worker 0-0, policy_version 160873 (0.00085) [2022-07-09 08:06:30,793][26022] Updated weights on worker 0-0, policy_version 160883 (0.00092) [2022-07-09 08:06:32,208][25689] Fps is (10 sec: 5694.7, 60 sec: 5745.1, 300 sec: 5754.1). Total num frames: 164751360. Throughput: 0: 6082.0. Samples: 164757224. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 08:06:32,211][25689] Avg episode reward: [(0, '-52.077')] [2022-07-09 08:06:32,662][26022] Updated weights on worker 0-0, policy_version 160893 (0.00095) [2022-07-09 08:06:34,461][26022] Updated weights on worker 0-0, policy_version 160903 (0.00613) [2022-07-09 08:06:36,199][26022] Updated weights on worker 0-0, policy_version 160913 (0.00088) [2022-07-09 08:06:37,217][25689] Fps is (10 sec: 5830.4, 60 sec: 5761.8, 300 sec: 5757.8). Total num frames: 164781056. Throughput: 0: 5186.1. Samples: 164774342. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 08:06:37,217][25689] Avg episode reward: [(0, '-52.893')] [2022-07-09 08:06:37,950][26022] Updated weights on worker 0-0, policy_version 160923 (0.00085) [2022-07-09 08:06:38,173][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:06:38,188][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000160924_164786176.pth [2022-07-09 08:06:38,188][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000158899_162712576.pth [2022-07-09 08:06:39,719][26022] Updated weights on worker 0-0, policy_version 160933 (0.00085) [2022-07-09 08:06:41,456][26022] Updated weights on worker 0-0, policy_version 160943 (0.00083) [2022-07-09 08:06:42,233][25689] Fps is (10 sec: 5924.7, 60 sec: 5777.8, 300 sec: 5755.8). Total num frames: 164810752. Throughput: 0: 6033.6. Samples: 164809018. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 08:06:42,234][25689] Avg episode reward: [(0, '-52.895')] [2022-07-09 08:06:43,107][26022] Updated weights on worker 0-0, policy_version 160953 (0.00087) [2022-07-09 08:06:44,887][26022] Updated weights on worker 0-0, policy_version 160963 (0.00085) [2022-07-09 08:06:46,851][26022] Updated weights on worker 0-0, policy_version 160973 (0.00093) [2022-07-09 08:06:47,359][25689] Fps is (10 sec: 5755.2, 60 sec: 5756.4, 300 sec: 5753.5). Total num frames: 164839424. Throughput: 0: 6027.2. Samples: 164844008. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:06:47,359][25689] Avg episode reward: [(0, '-53.654')] [2022-07-09 08:06:48,442][26022] Updated weights on worker 0-0, policy_version 160983 (0.00087) [2022-07-09 08:06:50,299][26022] Updated weights on worker 0-0, policy_version 160993 (0.00088) [2022-07-09 08:06:51,997][26022] Updated weights on worker 0-0, policy_version 161003 (0.00085) [2022-07-09 08:06:52,395][25689] Fps is (10 sec: 5743.8, 60 sec: 5778.9, 300 sec: 5749.6). Total num frames: 164869120. Throughput: 0: 5170.9. Samples: 164861606. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:06:52,395][25689] Avg episode reward: [(0, '-53.596')] [2022-07-09 08:06:53,884][26022] Updated weights on worker 0-0, policy_version 161013 (0.00093) [2022-07-09 08:06:55,569][26022] Updated weights on worker 0-0, policy_version 161023 (0.00088) [2022-07-09 08:06:57,354][26022] Updated weights on worker 0-0, policy_version 161033 (0.00083) [2022-07-09 08:06:57,402][25689] Fps is (10 sec: 5811.6, 60 sec: 5745.1, 300 sec: 5750.0). Total num frames: 164897792. Throughput: 0: 6028.0. Samples: 164896018. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:06:57,403][25689] Avg episode reward: [(0, '-53.749')] [2022-07-09 08:06:59,241][26022] Updated weights on worker 0-0, policy_version 161043 (0.00081) [2022-07-09 08:07:00,771][26022] Updated weights on worker 0-0, policy_version 161053 (0.00083) [2022-07-09 08:07:02,441][25689] Fps is (10 sec: 5504.2, 60 sec: 5708.8, 300 sec: 5756.9). Total num frames: 164924416. Throughput: 0: 6031.7. Samples: 164930906. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:07:02,442][25689] Avg episode reward: [(0, '-53.116')] [2022-07-09 08:07:02,947][26022] Updated weights on worker 0-0, policy_version 161063 (0.00096) [2022-07-09 08:07:04,791][26022] Updated weights on worker 0-0, policy_version 161073 (0.00088) [2022-07-09 08:07:06,623][26022] Updated weights on worker 0-0, policy_version 161083 (0.00076) [2022-07-09 08:07:07,569][25689] Fps is (10 sec: 5539.5, 60 sec: 5772.1, 300 sec: 5751.0). Total num frames: 164954112. Throughput: 0: 5887.2. Samples: 164962990. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:07:07,570][25689] Avg episode reward: [(0, '-51.822')] [2022-07-09 08:07:08,445][26022] Updated weights on worker 0-0, policy_version 161093 (0.00094) [2022-07-09 08:07:10,313][26022] Updated weights on worker 0-0, policy_version 161103 (0.00090) [2022-07-09 08:07:11,950][26022] Updated weights on worker 0-0, policy_version 161113 (0.00099) [2022-07-09 08:07:12,593][25689] Fps is (10 sec: 5648.6, 60 sec: 5741.5, 300 sec: 5748.1). Total num frames: 164981760. Throughput: 0: 5869.4. Samples: 164980156. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:07:12,593][25689] Avg episode reward: [(0, '-51.975')] [2022-07-09 08:07:13,763][26022] Updated weights on worker 0-0, policy_version 161123 (0.00084) [2022-07-09 08:07:15,581][26022] Updated weights on worker 0-0, policy_version 161133 (0.00088) [2022-07-09 08:07:17,353][26022] Updated weights on worker 0-0, policy_version 161143 (0.00082) [2022-07-09 08:07:17,625][25689] Fps is (10 sec: 5804.4, 60 sec: 5722.0, 300 sec: 5748.4). Total num frames: 165012480. Throughput: 0: 5878.1. Samples: 165014890. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:07:17,625][25689] Avg episode reward: [(0, '-51.758')] [2022-07-09 08:07:19,184][26022] Updated weights on worker 0-0, policy_version 161153 (0.00087) [2022-07-09 08:07:20,794][26022] Updated weights on worker 0-0, policy_version 161163 (0.00090) [2022-07-09 08:07:22,642][25689] Fps is (10 sec: 5808.1, 60 sec: 5720.5, 300 sec: 5748.9). Total num frames: 165040128. Throughput: 0: 5894.1. Samples: 165049974. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:07:22,643][25689] Avg episode reward: [(0, '-51.309')] [2022-07-09 08:07:22,693][26022] Updated weights on worker 0-0, policy_version 161173 (0.00082) [2022-07-09 08:07:24,246][26022] Updated weights on worker 0-0, policy_version 161183 (0.00083) [2022-07-09 08:07:26,146][26022] Updated weights on worker 0-0, policy_version 161193 (0.00060) [2022-07-09 08:07:27,700][25689] Fps is (10 sec: 5793.0, 60 sec: 5755.2, 300 sec: 5754.7). Total num frames: 165070848. Throughput: 0: 5193.9. Samples: 165067550. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:07:27,702][25689] Avg episode reward: [(0, '-52.632')] [2022-07-09 08:07:27,919][26022] Updated weights on worker 0-0, policy_version 161203 (0.00086) [2022-07-09 08:07:29,475][26022] Updated weights on worker 0-0, policy_version 161213 (0.00085) [2022-07-09 08:07:31,610][26022] Updated weights on worker 0-0, policy_version 161223 (0.00865) [2022-07-09 08:07:32,727][25689] Fps is (10 sec: 5990.8, 60 sec: 5769.9, 300 sec: 5751.3). Total num frames: 165100544. Throughput: 0: 6066.4. Samples: 165102298. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:07:32,727][25689] Avg episode reward: [(0, '-53.602')] [2022-07-09 08:07:33,063][26022] Updated weights on worker 0-0, policy_version 161233 (0.00088) [2022-07-09 08:07:35,071][26022] Updated weights on worker 0-0, policy_version 161243 (0.00095) [2022-07-09 08:07:36,949][26022] Updated weights on worker 0-0, policy_version 161253 (0.00091) [2022-07-09 08:07:37,729][25689] Fps is (10 sec: 5615.8, 60 sec: 5719.7, 300 sec: 5747.8). Total num frames: 165127168. Throughput: 0: 6040.5. Samples: 165136330. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:07:37,730][25689] Avg episode reward: [(0, '-54.191')] [2022-07-09 08:07:38,567][26022] Updated weights on worker 0-0, policy_version 161263 (0.00278) [2022-07-09 08:07:40,477][26022] Updated weights on worker 0-0, policy_version 161273 (0.00086) [2022-07-09 08:07:42,087][26022] Updated weights on worker 0-0, policy_version 161283 (0.00083) [2022-07-09 08:07:42,750][25689] Fps is (10 sec: 5517.0, 60 sec: 5702.4, 300 sec: 5741.2). Total num frames: 165155840. Throughput: 0: 5159.8. Samples: 165153724. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:07:42,751][25689] Avg episode reward: [(0, '-54.092')] [2022-07-09 08:07:43,897][26022] Updated weights on worker 0-0, policy_version 161293 (0.00090) [2022-07-09 08:07:45,635][26022] Updated weights on worker 0-0, policy_version 161303 (0.00094) [2022-07-09 08:07:47,257][26022] Updated weights on worker 0-0, policy_version 161313 (0.00083) [2022-07-09 08:07:47,844][25689] Fps is (10 sec: 5871.6, 60 sec: 5739.2, 300 sec: 5744.1). Total num frames: 165186560. Throughput: 0: 6023.1. Samples: 165188878. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:07:47,846][25689] Avg episode reward: [(0, '-54.207')] [2022-07-09 08:07:49,243][26022] Updated weights on worker 0-0, policy_version 161323 (0.00083) [2022-07-09 08:07:50,823][26022] Updated weights on worker 0-0, policy_version 161333 (0.00099) [2022-07-09 08:07:52,638][26022] Updated weights on worker 0-0, policy_version 161343 (0.00093) [2022-07-09 08:07:52,847][25689] Fps is (10 sec: 5983.0, 60 sec: 5742.3, 300 sec: 5748.6). Total num frames: 165216256. Throughput: 0: 6052.9. Samples: 165224086. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:07:52,849][25689] Avg episode reward: [(0, '-53.814')] [2022-07-09 08:07:54,412][26022] Updated weights on worker 0-0, policy_version 161353 (0.00090) [2022-07-09 08:07:56,158][26022] Updated weights on worker 0-0, policy_version 161363 (0.00081) [2022-07-09 08:07:57,768][26022] Updated weights on worker 0-0, policy_version 161373 (0.00085) [2022-07-09 08:07:57,894][25689] Fps is (10 sec: 5909.5, 60 sec: 5755.4, 300 sec: 5748.6). Total num frames: 165245952. Throughput: 0: 5226.1. Samples: 165241716. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:07:57,896][25689] Avg episode reward: [(0, '-53.403')] [2022-07-09 08:07:59,715][26022] Updated weights on worker 0-0, policy_version 161383 (0.00089) [2022-07-09 08:08:01,459][26022] Updated weights on worker 0-0, policy_version 161393 (0.00084) [2022-07-09 08:08:02,916][25689] Fps is (10 sec: 5593.8, 60 sec: 5757.1, 300 sec: 5753.3). Total num frames: 165272576. Throughput: 0: 6091.2. Samples: 165276560. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:08:02,918][25689] Avg episode reward: [(0, '-53.034')] [2022-07-09 08:08:03,607][26022] Updated weights on worker 0-0, policy_version 161403 (0.00085) [2022-07-09 08:08:05,458][26022] Updated weights on worker 0-0, policy_version 161413 (0.00089) [2022-07-09 08:08:06,953][26022] Updated weights on worker 0-0, policy_version 161423 (0.00084) [2022-07-09 08:08:07,994][25689] Fps is (10 sec: 5475.0, 60 sec: 5744.9, 300 sec: 5741.6). Total num frames: 165301248. Throughput: 0: 5974.0. Samples: 165309254. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:08:07,995][25689] Avg episode reward: [(0, '-53.396')] [2022-07-09 08:08:08,846][26022] Updated weights on worker 0-0, policy_version 161433 (0.00087) [2022-07-09 08:08:10,719][26022] Updated weights on worker 0-0, policy_version 161443 (0.00086) [2022-07-09 08:08:12,312][26022] Updated weights on worker 0-0, policy_version 161453 (0.00090) [2022-07-09 08:08:13,027][25689] Fps is (10 sec: 5874.1, 60 sec: 5794.9, 300 sec: 5748.0). Total num frames: 165331968. Throughput: 0: 5085.6. Samples: 165326710. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:08:13,027][25689] Avg episode reward: [(0, '-53.694')] [2022-07-09 08:08:14,241][26022] Updated weights on worker 0-0, policy_version 161463 (0.00109) [2022-07-09 08:08:15,755][26022] Updated weights on worker 0-0, policy_version 161473 (0.00093) [2022-07-09 08:08:17,750][26022] Updated weights on worker 0-0, policy_version 161483 (0.00098) [2022-07-09 08:08:18,035][25689] Fps is (10 sec: 5813.3, 60 sec: 5746.3, 300 sec: 5748.2). Total num frames: 165359616. Throughput: 0: 5957.8. Samples: 165361708. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:08:18,035][25689] Avg episode reward: [(0, '-53.014')] [2022-07-09 08:08:19,367][26022] Updated weights on worker 0-0, policy_version 161493 (0.00092) [2022-07-09 08:08:21,149][26022] Updated weights on worker 0-0, policy_version 161503 (0.00083) [2022-07-09 08:08:22,991][26022] Updated weights on worker 0-0, policy_version 161513 (0.00088) [2022-07-09 08:08:23,048][25689] Fps is (10 sec: 5722.3, 60 sec: 5780.6, 300 sec: 5750.0). Total num frames: 165389312. Throughput: 0: 5970.5. Samples: 165396758. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:08:23,048][25689] Avg episode reward: [(0, '-53.255')] [2022-07-09 08:08:24,802][26022] Updated weights on worker 0-0, policy_version 161523 (0.00577) [2022-07-09 08:08:26,446][26022] Updated weights on worker 0-0, policy_version 161533 (0.00086) [2022-07-09 08:08:28,134][25689] Fps is (10 sec: 5880.6, 60 sec: 5761.0, 300 sec: 5748.6). Total num frames: 165419008. Throughput: 0: 5217.4. Samples: 165414332. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:08:28,135][25689] Avg episode reward: [(0, '-53.452')] [2022-07-09 08:08:28,352][26022] Updated weights on worker 0-0, policy_version 161543 (0.00101) [2022-07-09 08:08:29,913][26022] Updated weights on worker 0-0, policy_version 161553 (0.00093) [2022-07-09 08:08:32,063][26022] Updated weights on worker 0-0, policy_version 161563 (0.00086) [2022-07-09 08:08:33,155][25689] Fps is (10 sec: 5876.4, 60 sec: 5761.6, 300 sec: 5749.2). Total num frames: 165448704. Throughput: 0: 6086.4. Samples: 165449216. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:08:33,155][25689] Avg episode reward: [(0, '-53.679')] [2022-07-09 08:08:33,559][26022] Updated weights on worker 0-0, policy_version 161573 (0.00086) [2022-07-09 08:08:35,486][26022] Updated weights on worker 0-0, policy_version 161583 (0.00081) [2022-07-09 08:08:36,939][26022] Updated weights on worker 0-0, policy_version 161593 (0.00085) [2022-07-09 08:08:38,184][25689] Fps is (10 sec: 5705.9, 60 sec: 5775.9, 300 sec: 5745.2). Total num frames: 165476352. Throughput: 0: 6063.6. Samples: 165483886. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:08:38,185][25689] Avg episode reward: [(0, '-53.932')] [2022-07-09 08:08:38,307][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:08:38,316][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000161599_165477376.pth [2022-07-09 08:08:38,316][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000159576_163405824.pth [2022-07-09 08:08:39,015][26022] Updated weights on worker 0-0, policy_version 161603 (0.00078) [2022-07-09 08:08:40,480][26022] Updated weights on worker 0-0, policy_version 161613 (0.00086) [2022-07-09 08:08:42,541][26022] Updated weights on worker 0-0, policy_version 161623 (0.00082) [2022-07-09 08:08:43,192][25689] Fps is (10 sec: 5815.0, 60 sec: 5811.0, 300 sec: 5754.1). Total num frames: 165507072. Throughput: 0: 5201.0. Samples: 165501528. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:08:43,193][25689] Avg episode reward: [(0, '-53.089')] [2022-07-09 08:08:43,932][26022] Updated weights on worker 0-0, policy_version 161633 (0.00085) [2022-07-09 08:08:45,950][26022] Updated weights on worker 0-0, policy_version 161643 (0.00090) [2022-07-09 08:08:47,599][26022] Updated weights on worker 0-0, policy_version 161653 (0.00084) [2022-07-09 08:08:48,263][25689] Fps is (10 sec: 5892.6, 60 sec: 5779.4, 300 sec: 5747.1). Total num frames: 165535744. Throughput: 0: 6058.4. Samples: 165536282. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:08:48,264][25689] Avg episode reward: [(0, '-52.608')] [2022-07-09 08:08:49,606][26022] Updated weights on worker 0-0, policy_version 161663 (0.00394) [2022-07-09 08:08:51,286][26022] Updated weights on worker 0-0, policy_version 161673 (0.00096) [2022-07-09 08:08:53,207][26022] Updated weights on worker 0-0, policy_version 161683 (0.00088) [2022-07-09 08:08:53,274][25689] Fps is (10 sec: 5586.5, 60 sec: 5744.8, 300 sec: 5747.5). Total num frames: 165563392. Throughput: 0: 6029.8. Samples: 165570530. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:08:53,274][25689] Avg episode reward: [(0, '-53.198')] [2022-07-09 08:08:54,842][26022] Updated weights on worker 0-0, policy_version 161693 (0.00083) [2022-07-09 08:08:56,774][26022] Updated weights on worker 0-0, policy_version 161703 (0.00082) [2022-07-09 08:08:58,297][25689] Fps is (10 sec: 5715.4, 60 sec: 5747.1, 300 sec: 5751.6). Total num frames: 165593088. Throughput: 0: 5165.0. Samples: 165587766. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:08:58,297][25689] Avg episode reward: [(0, '-52.680')] [2022-07-09 08:08:58,327][26022] Updated weights on worker 0-0, policy_version 161713 (0.00080) [2022-07-09 08:09:00,285][26022] Updated weights on worker 0-0, policy_version 161723 (0.00090) [2022-07-09 08:09:02,359][26022] Updated weights on worker 0-0, policy_version 161733 (0.00089) [2022-07-09 08:09:03,320][25689] Fps is (10 sec: 5504.0, 60 sec: 5729.9, 300 sec: 5745.8). Total num frames: 165618688. Throughput: 0: 5957.1. Samples: 165621434. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:09:03,321][25689] Avg episode reward: [(0, '-52.198')] [2022-07-09 08:09:04,086][26022] Updated weights on worker 0-0, policy_version 161743 (0.00087) [2022-07-09 08:09:06,110][26022] Updated weights on worker 0-0, policy_version 161753 (0.00094) [2022-07-09 08:09:07,779][26022] Updated weights on worker 0-0, policy_version 161763 (0.00093) [2022-07-09 08:09:08,410][25689] Fps is (10 sec: 5467.5, 60 sec: 5745.8, 300 sec: 5742.4). Total num frames: 165648384. Throughput: 0: 5854.4. Samples: 165654230. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:09:08,411][25689] Avg episode reward: [(0, '-52.129')] [2022-07-09 08:09:09,594][26022] Updated weights on worker 0-0, policy_version 161773 (0.00091) [2022-07-09 08:09:11,426][26022] Updated weights on worker 0-0, policy_version 161783 (0.00074) [2022-07-09 08:09:12,954][26022] Updated weights on worker 0-0, policy_version 161793 (0.00088) [2022-07-09 08:09:13,452][25689] Fps is (10 sec: 5862.1, 60 sec: 5728.0, 300 sec: 5749.1). Total num frames: 165678080. Throughput: 0: 5002.2. Samples: 165671466. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:09:13,460][25689] Avg episode reward: [(0, '-52.473')] [2022-07-09 08:09:14,973][26022] Updated weights on worker 0-0, policy_version 161803 (0.00082) [2022-07-09 08:09:16,689][26022] Updated weights on worker 0-0, policy_version 161813 (0.00087) [2022-07-09 08:09:18,541][25689] Fps is (10 sec: 5660.2, 60 sec: 5720.2, 300 sec: 5744.2). Total num frames: 165705728. Throughput: 0: 5837.9. Samples: 165705952. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:09:18,543][25689] Avg episode reward: [(0, '-52.960')] [2022-07-09 08:09:18,543][26022] Updated weights on worker 0-0, policy_version 161823 (0.00095) [2022-07-09 08:09:20,258][26022] Updated weights on worker 0-0, policy_version 161833 (0.00108) [2022-07-09 08:09:22,095][26022] Updated weights on worker 0-0, policy_version 161843 (0.00084) [2022-07-09 08:09:23,639][25689] Fps is (10 sec: 5629.2, 60 sec: 5712.3, 300 sec: 5743.3). Total num frames: 165735424. Throughput: 0: 5866.2. Samples: 165740626. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:09:23,639][25689] Avg episode reward: [(0, '-52.990')] [2022-07-09 08:09:23,796][26022] Updated weights on worker 0-0, policy_version 161853 (0.00082) [2022-07-09 08:09:25,747][26022] Updated weights on worker 0-0, policy_version 161863 (0.00085) [2022-07-09 08:09:27,479][26022] Updated weights on worker 0-0, policy_version 161873 (0.00080) [2022-07-09 08:09:28,687][25689] Fps is (10 sec: 5652.2, 60 sec: 5682.1, 300 sec: 5739.2). Total num frames: 165763072. Throughput: 0: 5090.5. Samples: 165757448. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:09:28,687][25689] Avg episode reward: [(0, '-53.025')] [2022-07-09 08:09:29,191][26022] Updated weights on worker 0-0, policy_version 161883 (0.00085) [2022-07-09 08:09:31,121][26022] Updated weights on worker 0-0, policy_version 161893 (0.00087) [2022-07-09 08:09:32,894][26022] Updated weights on worker 0-0, policy_version 161903 (0.00085) [2022-07-09 08:09:33,697][25689] Fps is (10 sec: 5802.7, 60 sec: 5699.9, 300 sec: 5742.9). Total num frames: 165793792. Throughput: 0: 5932.7. Samples: 165791576. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:09:33,698][25689] Avg episode reward: [(0, '-52.747')] [2022-07-09 08:09:34,717][26022] Updated weights on worker 0-0, policy_version 161913 (0.00089) [2022-07-09 08:09:36,485][26022] Updated weights on worker 0-0, policy_version 161923 (0.00086) [2022-07-09 08:09:38,326][26022] Updated weights on worker 0-0, policy_version 161933 (0.00088) [2022-07-09 08:09:38,721][25689] Fps is (10 sec: 5714.8, 60 sec: 5683.6, 300 sec: 5740.2). Total num frames: 165820416. Throughput: 0: 5935.2. Samples: 165825722. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:09:38,721][25689] Avg episode reward: [(0, '-53.017')] [2022-07-09 08:09:40,068][26022] Updated weights on worker 0-0, policy_version 161943 (0.00082) [2022-07-09 08:09:41,885][26022] Updated weights on worker 0-0, policy_version 161953 (0.00085) [2022-07-09 08:09:43,432][26022] Updated weights on worker 0-0, policy_version 161963 (0.00092) [2022-07-09 08:09:43,734][25689] Fps is (10 sec: 5611.4, 60 sec: 5666.2, 300 sec: 5741.1). Total num frames: 165850112. Throughput: 0: 5092.3. Samples: 165842958. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:09:43,735][25689] Avg episode reward: [(0, '-53.124')] [2022-07-09 08:09:45,339][26022] Updated weights on worker 0-0, policy_version 161973 (0.00695) [2022-07-09 08:09:47,003][26022] Updated weights on worker 0-0, policy_version 161983 (0.00088) [2022-07-09 08:09:48,845][25689] Fps is (10 sec: 5866.7, 60 sec: 5679.4, 300 sec: 5735.9). Total num frames: 165879808. Throughput: 0: 5977.1. Samples: 165877934. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:09:48,845][25689] Avg episode reward: [(0, '-53.190')] [2022-07-09 08:09:48,916][26022] Updated weights on worker 0-0, policy_version 161993 (0.00092) [2022-07-09 08:09:50,699][26022] Updated weights on worker 0-0, policy_version 162003 (0.00089) [2022-07-09 08:09:52,411][26022] Updated weights on worker 0-0, policy_version 162013 (0.00087) [2022-07-09 08:09:53,878][25689] Fps is (10 sec: 5854.9, 60 sec: 5711.0, 300 sec: 5745.7). Total num frames: 165909504. Throughput: 0: 6011.2. Samples: 165912886. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:09:53,883][25689] Avg episode reward: [(0, '-53.750')] [2022-07-09 08:09:54,203][26022] Updated weights on worker 0-0, policy_version 162023 (0.00092) [2022-07-09 08:09:56,135][26022] Updated weights on worker 0-0, policy_version 162033 (0.00111) [2022-07-09 08:09:57,809][26022] Updated weights on worker 0-0, policy_version 162043 (0.00086) [2022-07-09 08:09:58,942][25689] Fps is (10 sec: 5679.1, 60 sec: 5673.4, 300 sec: 5738.3). Total num frames: 165937152. Throughput: 0: 6008.1. Samples: 165947212. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:09:58,942][25689] Avg episode reward: [(0, '-53.627')] [2022-07-09 08:09:59,683][26022] Updated weights on worker 0-0, policy_version 162053 (0.00362) [2022-07-09 08:10:01,455][26022] Updated weights on worker 0-0, policy_version 162063 (0.00084) [2022-07-09 08:10:03,631][26022] Updated weights on worker 0-0, policy_version 162073 (0.00088) [2022-07-09 08:10:03,946][25689] Fps is (10 sec: 5390.8, 60 sec: 5692.1, 300 sec: 5737.3). Total num frames: 165963776. Throughput: 0: 5904.2. Samples: 165962290. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:10:03,946][25689] Avg episode reward: [(0, '-53.812')] [2022-07-09 08:10:05,316][26022] Updated weights on worker 0-0, policy_version 162083 (0.00085) [2022-07-09 08:10:07,300][26022] Updated weights on worker 0-0, policy_version 162093 (0.00090) [2022-07-09 08:10:08,946][26022] Updated weights on worker 0-0, policy_version 162103 (0.00078) [2022-07-09 08:10:09,011][25689] Fps is (10 sec: 5593.3, 60 sec: 5694.4, 300 sec: 5740.8). Total num frames: 165993472. Throughput: 0: 5870.0. Samples: 165996312. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:10:09,012][25689] Avg episode reward: [(0, '-53.135')] [2022-07-09 08:10:10,907][26022] Updated weights on worker 0-0, policy_version 162113 (0.00087) [2022-07-09 08:10:12,552][26022] Updated weights on worker 0-0, policy_version 162123 (0.00091) [2022-07-09 08:10:14,037][25689] Fps is (10 sec: 5682.5, 60 sec: 5662.1, 300 sec: 5730.9). Total num frames: 166021120. Throughput: 0: 5845.2. Samples: 166030718. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:10:14,037][25689] Avg episode reward: [(0, '-52.972')] [2022-07-09 08:10:14,512][26022] Updated weights on worker 0-0, policy_version 162133 (0.00090) [2022-07-09 08:10:15,970][26022] Updated weights on worker 0-0, policy_version 162143 (0.00084) [2022-07-09 08:10:17,964][26022] Updated weights on worker 0-0, policy_version 162153 (0.00086) [2022-07-09 08:10:19,079][25689] Fps is (10 sec: 5797.7, 60 sec: 5717.3, 300 sec: 5737.9). Total num frames: 166051840. Throughput: 0: 5005.0. Samples: 166047996. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:10:19,079][25689] Avg episode reward: [(0, '-53.179')] [2022-07-09 08:10:19,551][26022] Updated weights on worker 0-0, policy_version 162163 (0.00088) [2022-07-09 08:10:21,520][26022] Updated weights on worker 0-0, policy_version 162173 (0.00094) [2022-07-09 08:10:23,173][26022] Updated weights on worker 0-0, policy_version 162183 (0.00105) [2022-07-09 08:10:24,090][25689] Fps is (10 sec: 5806.0, 60 sec: 5691.6, 300 sec: 5736.4). Total num frames: 166079488. Throughput: 0: 5960.5. Samples: 166082360. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:10:24,091][25689] Avg episode reward: [(0, '-53.678')] [2022-07-09 08:10:25,035][26022] Updated weights on worker 0-0, policy_version 162193 (0.00969) [2022-07-09 08:10:26,740][26022] Updated weights on worker 0-0, policy_version 162203 (0.00092) [2022-07-09 08:10:28,783][26022] Updated weights on worker 0-0, policy_version 162213 (0.00087) [2022-07-09 08:10:29,171][25689] Fps is (10 sec: 5580.5, 60 sec: 5705.4, 300 sec: 5729.7). Total num frames: 166108160. Throughput: 0: 5973.8. Samples: 166116742. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:10:29,172][25689] Avg episode reward: [(0, '-53.309')] [2022-07-09 08:10:30,290][26022] Updated weights on worker 0-0, policy_version 162223 (0.00093) [2022-07-09 08:10:32,210][26022] Updated weights on worker 0-0, policy_version 162233 (0.00088) [2022-07-09 08:10:33,993][26022] Updated weights on worker 0-0, policy_version 162243 (0.00053) [2022-07-09 08:10:34,176][25689] Fps is (10 sec: 5685.6, 60 sec: 5672.1, 300 sec: 5729.7). Total num frames: 166136832. Throughput: 0: 5134.8. Samples: 166134128. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:10:34,176][25689] Avg episode reward: [(0, '-53.527')] [2022-07-09 08:10:35,863][26022] Updated weights on worker 0-0, policy_version 162253 (0.00091) [2022-07-09 08:10:37,566][26022] Updated weights on worker 0-0, policy_version 162263 (0.00419) [2022-07-09 08:10:38,375][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:10:38,389][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000162267_166161408.pth [2022-07-09 08:10:38,389][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000160249_164094976.pth [2022-07-09 08:10:39,181][25689] Fps is (10 sec: 5830.9, 60 sec: 5724.6, 300 sec: 5733.2). Total num frames: 166166528. Throughput: 0: 6004.3. Samples: 166168696. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:10:39,182][25689] Avg episode reward: [(0, '-53.368')] [2022-07-09 08:10:39,225][26022] Updated weights on worker 0-0, policy_version 162273 (0.00089) [2022-07-09 08:10:41,383][26022] Updated weights on worker 0-0, policy_version 162283 (0.00091) [2022-07-09 08:10:42,727][26022] Updated weights on worker 0-0, policy_version 162293 (0.00091) [2022-07-09 08:10:44,213][25689] Fps is (10 sec: 5713.0, 60 sec: 5689.0, 300 sec: 5727.2). Total num frames: 166194176. Throughput: 0: 6002.3. Samples: 166203146. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:10:44,215][25689] Avg episode reward: [(0, '-52.883')] [2022-07-09 08:10:44,760][26022] Updated weights on worker 0-0, policy_version 162303 (0.00079) [2022-07-09 08:10:46,206][26022] Updated weights on worker 0-0, policy_version 162313 (0.00084) [2022-07-09 08:10:48,268][26022] Updated weights on worker 0-0, policy_version 162323 (0.00090) [2022-07-09 08:10:49,281][25689] Fps is (10 sec: 5779.1, 60 sec: 5709.9, 300 sec: 5734.6). Total num frames: 166224896. Throughput: 0: 5163.5. Samples: 166220580. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:10:49,283][25689] Avg episode reward: [(0, '-53.066')] [2022-07-09 08:10:49,868][26022] Updated weights on worker 0-0, policy_version 162333 (0.00991) [2022-07-09 08:10:51,836][26022] Updated weights on worker 0-0, policy_version 162343 (0.00596) [2022-07-09 08:10:53,340][26022] Updated weights on worker 0-0, policy_version 162353 (0.00085) [2022-07-09 08:10:54,317][25689] Fps is (10 sec: 5878.3, 60 sec: 5692.8, 300 sec: 5727.2). Total num frames: 166253568. Throughput: 0: 6015.5. Samples: 166255286. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 08:10:54,318][25689] Avg episode reward: [(0, '-52.192')] [2022-07-09 08:10:55,367][26022] Updated weights on worker 0-0, policy_version 162363 (0.00086) [2022-07-09 08:10:57,090][26022] Updated weights on worker 0-0, policy_version 162373 (0.00099) [2022-07-09 08:10:59,158][26022] Updated weights on worker 0-0, policy_version 162383 (0.00080) [2022-07-09 08:10:59,376][25689] Fps is (10 sec: 5680.6, 60 sec: 5710.2, 300 sec: 5726.3). Total num frames: 166282240. Throughput: 0: 5977.4. Samples: 166289406. Policy #0 lag: (min: 0.0, avg: 9.7, max: 26.0) [2022-07-09 08:10:59,377][25689] Avg episode reward: [(0, '-52.810')] [2022-07-09 08:11:00,632][26022] Updated weights on worker 0-0, policy_version 162393 (0.00085) [2022-07-09 08:11:02,908][26022] Updated weights on worker 0-0, policy_version 162403 (0.00085) [2022-07-09 08:11:04,387][25689] Fps is (10 sec: 5593.0, 60 sec: 5726.4, 300 sec: 5734.6). Total num frames: 166309888. Throughput: 0: 5142.6. Samples: 166306890. Policy #0 lag: (min: 0.0, avg: 9.7, max: 26.0) [2022-07-09 08:11:04,387][25689] Avg episode reward: [(0, '-53.043')] [2022-07-09 08:11:04,562][26022] Updated weights on worker 0-0, policy_version 162413 (0.00082) [2022-07-09 08:11:06,453][26022] Updated weights on worker 0-0, policy_version 162423 (0.00081) [2022-07-09 08:11:08,061][26022] Updated weights on worker 0-0, policy_version 162433 (0.00085) [2022-07-09 08:11:09,445][25689] Fps is (10 sec: 5593.4, 60 sec: 5710.2, 300 sec: 5731.1). Total num frames: 166338560. Throughput: 0: 5903.7. Samples: 166339620. Policy #0 lag: (min: 0.0, avg: 9.7, max: 26.0) [2022-07-09 08:11:09,445][25689] Avg episode reward: [(0, '-53.200')] [2022-07-09 08:11:10,107][26022] Updated weights on worker 0-0, policy_version 162443 (0.00082) [2022-07-09 08:11:11,622][26022] Updated weights on worker 0-0, policy_version 162453 (0.00087) [2022-07-09 08:11:13,538][26022] Updated weights on worker 0-0, policy_version 162463 (0.00084) [2022-07-09 08:11:14,457][25689] Fps is (10 sec: 5694.6, 60 sec: 5728.5, 300 sec: 5720.7). Total num frames: 166367232. Throughput: 0: 5900.6. Samples: 166374122. Policy #0 lag: (min: 0.0, avg: 9.7, max: 26.0) [2022-07-09 08:11:14,457][25689] Avg episode reward: [(0, '-53.010')] [2022-07-09 08:11:15,355][26022] Updated weights on worker 0-0, policy_version 162473 (0.00098) [2022-07-09 08:11:17,066][26022] Updated weights on worker 0-0, policy_version 162483 (0.00087) [2022-07-09 08:11:19,028][26022] Updated weights on worker 0-0, policy_version 162493 (0.00091) [2022-07-09 08:11:19,463][25689] Fps is (10 sec: 5724.1, 60 sec: 5697.9, 300 sec: 5724.0). Total num frames: 166395904. Throughput: 0: 5077.9. Samples: 166391406. Policy #0 lag: (min: 0.0, avg: 9.7, max: 26.0) [2022-07-09 08:11:19,463][25689] Avg episode reward: [(0, '-53.317')] [2022-07-09 08:11:20,590][26022] Updated weights on worker 0-0, policy_version 162503 (0.00085) [2022-07-09 08:11:22,383][26022] Updated weights on worker 0-0, policy_version 162513 (0.00089) [2022-07-09 08:11:24,264][26022] Updated weights on worker 0-0, policy_version 162523 (0.00087) [2022-07-09 08:11:24,469][25689] Fps is (10 sec: 5624.8, 60 sec: 5698.4, 300 sec: 5721.7). Total num frames: 166423552. Throughput: 0: 5933.1. Samples: 166426042. Policy #0 lag: (min: 0.0, avg: 9.7, max: 26.0) [2022-07-09 08:11:24,470][25689] Avg episode reward: [(0, '-53.014')] [2022-07-09 08:11:25,822][26022] Updated weights on worker 0-0, policy_version 162533 (0.00082) [2022-07-09 08:11:27,875][26022] Updated weights on worker 0-0, policy_version 162543 (0.00090) [2022-07-09 08:11:29,455][26022] Updated weights on worker 0-0, policy_version 162553 (0.00084) [2022-07-09 08:11:29,527][25689] Fps is (10 sec: 5799.5, 60 sec: 5734.5, 300 sec: 5727.6). Total num frames: 166454272. Throughput: 0: 6027.1. Samples: 166460658. Policy #0 lag: (min: 0.0, avg: 9.7, max: 26.0) [2022-07-09 08:11:29,531][25689] Avg episode reward: [(0, '-52.551')] [2022-07-09 08:11:31,211][26022] Updated weights on worker 0-0, policy_version 162563 (0.00087) [2022-07-09 08:11:32,971][26022] Updated weights on worker 0-0, policy_version 162573 (0.00091) [2022-07-09 08:11:34,534][25689] Fps is (10 sec: 5901.1, 60 sec: 5734.3, 300 sec: 5724.2). Total num frames: 166482944. Throughput: 0: 5175.9. Samples: 166478040. Policy #0 lag: (min: 0.0, avg: 9.7, max: 26.0) [2022-07-09 08:11:34,534][25689] Avg episode reward: [(0, '-52.110')] [2022-07-09 08:11:34,839][26022] Updated weights on worker 0-0, policy_version 162583 (0.00084) [2022-07-09 08:11:36,576][26022] Updated weights on worker 0-0, policy_version 162593 (0.00080) [2022-07-09 08:11:38,238][26022] Updated weights on worker 0-0, policy_version 162603 (0.00959) [2022-07-09 08:11:39,543][25689] Fps is (10 sec: 5725.3, 60 sec: 5717.0, 300 sec: 5720.8). Total num frames: 166511616. Throughput: 0: 6044.4. Samples: 166512778. Policy #0 lag: (min: 0.0, avg: 9.7, max: 26.0) [2022-07-09 08:11:39,544][25689] Avg episode reward: [(0, '-53.019')] [2022-07-09 08:11:40,347][26022] Updated weights on worker 0-0, policy_version 162613 (0.00098) [2022-07-09 08:11:41,864][26022] Updated weights on worker 0-0, policy_version 162623 (0.00093) [2022-07-09 08:11:43,979][26022] Updated weights on worker 0-0, policy_version 162633 (0.00092) [2022-07-09 08:11:44,555][25689] Fps is (10 sec: 5824.8, 60 sec: 5752.9, 300 sec: 5726.5). Total num frames: 166541312. Throughput: 0: 6030.3. Samples: 166547162. Policy #0 lag: (min: 0.0, avg: 9.7, max: 26.0) [2022-07-09 08:11:44,556][25689] Avg episode reward: [(0, '-52.492')] [2022-07-09 08:11:45,720][26022] Updated weights on worker 0-0, policy_version 162643 (0.00089) [2022-07-09 08:11:47,284][26022] Updated weights on worker 0-0, policy_version 162653 (0.00102) [2022-07-09 08:11:49,090][26022] Updated weights on worker 0-0, policy_version 162663 (0.00089) [2022-07-09 08:11:49,679][25689] Fps is (10 sec: 5758.2, 60 sec: 5713.5, 300 sec: 5721.3). Total num frames: 166569984. Throughput: 0: 5143.1. Samples: 166564304. Policy #0 lag: (min: 0.0, avg: 9.7, max: 26.0) [2022-07-09 08:11:49,680][25689] Avg episode reward: [(0, '-53.107')] [2022-07-09 08:11:50,707][26022] Updated weights on worker 0-0, policy_version 162673 (0.00096) [2022-07-09 08:11:52,600][26022] Updated weights on worker 0-0, policy_version 162683 (0.00088) [2022-07-09 08:11:54,439][26022] Updated weights on worker 0-0, policy_version 162693 (0.00102) [2022-07-09 08:11:54,719][25689] Fps is (10 sec: 5641.8, 60 sec: 5713.2, 300 sec: 5720.7). Total num frames: 166598656. Throughput: 0: 5997.1. Samples: 166599090. Policy #0 lag: (min: 0.0, avg: 9.7, max: 26.0) [2022-07-09 08:11:54,719][25689] Avg episode reward: [(0, '-53.010')] [2022-07-09 08:11:56,164][26022] Updated weights on worker 0-0, policy_version 162703 (0.00084) [2022-07-09 08:11:58,083][26022] Updated weights on worker 0-0, policy_version 162713 (0.00085) [2022-07-09 08:11:59,563][26022] Updated weights on worker 0-0, policy_version 162723 (0.00088) [2022-07-09 08:11:59,765][25689] Fps is (10 sec: 5888.9, 60 sec: 5748.3, 300 sec: 5734.4). Total num frames: 166629376. Throughput: 0: 5977.6. Samples: 166633656. Policy #0 lag: (min: 0.0, avg: 9.7, max: 26.0) [2022-07-09 08:11:59,765][25689] Avg episode reward: [(0, '-53.057')] [2022-07-09 08:12:01,683][26022] Updated weights on worker 0-0, policy_version 162733 (0.00091) [2022-07-09 08:12:03,454][26022] Updated weights on worker 0-0, policy_version 162743 (0.00090) [2022-07-09 08:12:04,855][25689] Fps is (10 sec: 5455.2, 60 sec: 5690.0, 300 sec: 5717.9). Total num frames: 166653952. Throughput: 0: 5087.7. Samples: 166650444. Policy #0 lag: (min: 0.0, avg: 9.7, max: 26.0) [2022-07-09 08:12:04,855][25689] Avg episode reward: [(0, '-52.190')] [2022-07-09 08:12:05,470][26022] Updated weights on worker 0-0, policy_version 162753 (0.00089) [2022-07-09 08:12:07,047][26022] Updated weights on worker 0-0, policy_version 162763 (0.00083) [2022-07-09 08:12:09,160][26022] Updated weights on worker 0-0, policy_version 162773 (0.00081) [2022-07-09 08:12:09,924][25689] Fps is (10 sec: 5442.6, 60 sec: 5722.8, 300 sec: 5727.3). Total num frames: 166684672. Throughput: 0: 5895.3. Samples: 166683656. Policy #0 lag: (min: 0.0, avg: 9.7, max: 26.0) [2022-07-09 08:12:09,925][25689] Avg episode reward: [(0, '-52.597')] [2022-07-09 08:12:10,741][26022] Updated weights on worker 0-0, policy_version 162783 (0.00095) [2022-07-09 08:12:12,627][26022] Updated weights on worker 0-0, policy_version 162793 (0.00098) [2022-07-09 08:12:14,287][26022] Updated weights on worker 0-0, policy_version 162803 (0.00083) [2022-07-09 08:12:14,972][25689] Fps is (10 sec: 5769.3, 60 sec: 5702.5, 300 sec: 5716.7). Total num frames: 166712320. Throughput: 0: 5844.5. Samples: 166717460. Policy #0 lag: (min: 0.0, avg: 9.7, max: 26.0) [2022-07-09 08:12:14,972][25689] Avg episode reward: [(0, '-52.173')] [2022-07-09 08:12:16,336][26022] Updated weights on worker 0-0, policy_version 162813 (0.00092) [2022-07-09 08:12:17,879][26022] Updated weights on worker 0-0, policy_version 162823 (0.00089) [2022-07-09 08:12:19,970][26022] Updated weights on worker 0-0, policy_version 162833 (0.00088) [2022-07-09 08:12:19,985][25689] Fps is (10 sec: 5598.0, 60 sec: 5701.8, 300 sec: 5720.2). Total num frames: 166740992. Throughput: 0: 4993.7. Samples: 166734632. Policy #0 lag: (min: 0.0, avg: 9.7, max: 26.0) [2022-07-09 08:12:19,986][25689] Avg episode reward: [(0, '-52.131')] [2022-07-09 08:12:21,446][26022] Updated weights on worker 0-0, policy_version 162843 (0.00082) [2022-07-09 08:12:23,359][26022] Updated weights on worker 0-0, policy_version 162853 (0.00086) [2022-07-09 08:12:25,020][25689] Fps is (10 sec: 5808.7, 60 sec: 5733.0, 300 sec: 5717.2). Total num frames: 166770688. Throughput: 0: 5903.7. Samples: 166769494. Policy #0 lag: (min: 0.0, avg: 9.7, max: 26.0) [2022-07-09 08:12:25,021][25689] Avg episode reward: [(0, '-52.267')] [2022-07-09 08:12:25,075][26022] Updated weights on worker 0-0, policy_version 162863 (0.00082) [2022-07-09 08:12:26,926][26022] Updated weights on worker 0-0, policy_version 162873 (0.00093) [2022-07-09 08:12:28,709][26022] Updated weights on worker 0-0, policy_version 162883 (0.00097) [2022-07-09 08:12:30,103][25689] Fps is (10 sec: 5768.8, 60 sec: 5696.8, 300 sec: 5712.7). Total num frames: 166799360. Throughput: 0: 5944.3. Samples: 166803604. Policy #0 lag: (min: 0.0, avg: 9.7, max: 26.0) [2022-07-09 08:12:30,103][25689] Avg episode reward: [(0, '-52.360')] [2022-07-09 08:12:30,608][26022] Updated weights on worker 0-0, policy_version 162893 (0.00083) [2022-07-09 08:12:32,143][26022] Updated weights on worker 0-0, policy_version 162903 (0.00084) [2022-07-09 08:12:34,250][26022] Updated weights on worker 0-0, policy_version 162913 (0.00080) [2022-07-09 08:12:35,121][25689] Fps is (10 sec: 5778.6, 60 sec: 5712.6, 300 sec: 5722.7). Total num frames: 166829056. Throughput: 0: 5124.3. Samples: 166820710. Policy #0 lag: (min: 0.0, avg: 9.7, max: 26.0) [2022-07-09 08:12:35,122][25689] Avg episode reward: [(0, '-52.942')] [2022-07-09 08:12:35,907][26022] Updated weights on worker 0-0, policy_version 162923 (0.00088) [2022-07-09 08:12:37,715][26022] Updated weights on worker 0-0, policy_version 162933 (0.00087) [2022-07-09 08:12:38,391][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:12:38,406][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000162937_166847488.pth [2022-07-09 08:12:38,407][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000160924_164786176.pth [2022-07-09 08:12:39,415][26022] Updated weights on worker 0-0, policy_version 162943 (0.00080) [2022-07-09 08:12:40,217][25689] Fps is (10 sec: 5771.0, 60 sec: 5704.4, 300 sec: 5721.3). Total num frames: 166857728. Throughput: 0: 5972.0. Samples: 166855456. Policy #0 lag: (min: 0.0, avg: 9.7, max: 26.0) [2022-07-09 08:12:40,217][25689] Avg episode reward: [(0, '-53.058')] [2022-07-09 08:12:41,270][26022] Updated weights on worker 0-0, policy_version 162953 (0.00086) [2022-07-09 08:12:43,062][26022] Updated weights on worker 0-0, policy_version 162963 (0.00089) [2022-07-09 08:12:44,958][26022] Updated weights on worker 0-0, policy_version 162973 (0.00094) [2022-07-09 08:12:45,256][25689] Fps is (10 sec: 5557.0, 60 sec: 5668.1, 300 sec: 5712.0). Total num frames: 166885376. Throughput: 0: 5952.5. Samples: 166889946. Policy #0 lag: (min: 0.0, avg: 9.7, max: 26.0) [2022-07-09 08:12:45,256][25689] Avg episode reward: [(0, '-54.720')] [2022-07-09 08:12:46,587][26022] Updated weights on worker 0-0, policy_version 162983 (0.00081) [2022-07-09 08:12:48,441][26022] Updated weights on worker 0-0, policy_version 162993 (0.00085) [2022-07-09 08:12:49,989][26022] Updated weights on worker 0-0, policy_version 163003 (0.00089) [2022-07-09 08:12:50,375][25689] Fps is (10 sec: 5746.1, 60 sec: 5702.4, 300 sec: 5713.2). Total num frames: 166916096. Throughput: 0: 5967.7. Samples: 166924580. Policy #0 lag: (min: 0.0, avg: 9.7, max: 26.0) [2022-07-09 08:12:50,377][25689] Avg episode reward: [(0, '-53.778')] [2022-07-09 08:12:51,842][26022] Updated weights on worker 0-0, policy_version 163013 (0.00086) [2022-07-09 08:12:53,695][26022] Updated weights on worker 0-0, policy_version 163023 (0.00090) [2022-07-09 08:12:55,380][26022] Updated weights on worker 0-0, policy_version 163033 (0.00090) [2022-07-09 08:12:55,472][25689] Fps is (10 sec: 5913.5, 60 sec: 5713.8, 300 sec: 5712.3). Total num frames: 166945792. Throughput: 0: 5964.0. Samples: 166942086. Policy #0 lag: (min: 0.0, avg: 9.7, max: 26.0) [2022-07-09 08:12:55,473][25689] Avg episode reward: [(0, '-53.098')] [2022-07-09 08:12:57,325][26022] Updated weights on worker 0-0, policy_version 163043 (0.00086) [2022-07-09 08:12:58,948][26022] Updated weights on worker 0-0, policy_version 163053 (0.00094) [2022-07-09 08:13:00,571][25689] Fps is (10 sec: 5724.3, 60 sec: 5675.2, 300 sec: 5717.6). Total num frames: 166974464. Throughput: 0: 5953.3. Samples: 166976632. Policy #0 lag: (min: 0.0, avg: 9.7, max: 26.0) [2022-07-09 08:13:00,572][25689] Avg episode reward: [(0, '-53.693')] [2022-07-09 08:13:00,817][26022] Updated weights on worker 0-0, policy_version 163063 (0.00713) [2022-07-09 08:13:03,066][26022] Updated weights on worker 0-0, policy_version 163073 (0.00079) [2022-07-09 08:13:04,574][26022] Updated weights on worker 0-0, policy_version 163083 (0.00079) [2022-07-09 08:13:05,578][25689] Fps is (10 sec: 5573.4, 60 sec: 5733.6, 300 sec: 5715.6). Total num frames: 167002112. Throughput: 0: 5872.1. Samples: 167009278. Policy #0 lag: (min: 0.0, avg: 9.7, max: 26.0) [2022-07-09 08:13:05,578][25689] Avg episode reward: [(0, '-53.302')] [2022-07-09 08:13:06,496][26022] Updated weights on worker 0-0, policy_version 163093 (0.00087) [2022-07-09 08:13:08,022][26022] Updated weights on worker 0-0, policy_version 163103 (0.00083) [2022-07-09 08:13:10,061][26022] Updated weights on worker 0-0, policy_version 163113 (0.00091) [2022-07-09 08:13:10,626][25689] Fps is (10 sec: 5601.3, 60 sec: 5701.8, 300 sec: 5708.4). Total num frames: 167030784. Throughput: 0: 5030.7. Samples: 167026458. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-09 08:13:10,627][25689] Avg episode reward: [(0, '-52.893')] [2022-07-09 08:13:11,918][26022] Updated weights on worker 0-0, policy_version 163123 (0.00094) [2022-07-09 08:13:13,477][26022] Updated weights on worker 0-0, policy_version 163133 (0.00080) [2022-07-09 08:13:15,549][26022] Updated weights on worker 0-0, policy_version 163143 (0.00081) [2022-07-09 08:13:15,648][25689] Fps is (10 sec: 5592.8, 60 sec: 5704.3, 300 sec: 5708.1). Total num frames: 167058432. Throughput: 0: 5883.3. Samples: 167060784. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-09 08:13:15,648][25689] Avg episode reward: [(0, '-52.845')] [2022-07-09 08:13:17,084][26022] Updated weights on worker 0-0, policy_version 163153 (0.00086) [2022-07-09 08:13:19,028][26022] Updated weights on worker 0-0, policy_version 163163 (0.00092) [2022-07-09 08:13:20,692][25689] Fps is (10 sec: 5594.9, 60 sec: 5701.3, 300 sec: 5704.1). Total num frames: 167087104. Throughput: 0: 5889.8. Samples: 167095140. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-09 08:13:20,693][25689] Avg episode reward: [(0, '-53.772')] [2022-07-09 08:13:20,994][26022] Updated weights on worker 0-0, policy_version 163173 (0.00078) [2022-07-09 08:13:22,544][26022] Updated weights on worker 0-0, policy_version 163183 (0.00088) [2022-07-09 08:13:24,656][26022] Updated weights on worker 0-0, policy_version 163193 (0.00086) [2022-07-09 08:13:25,703][25689] Fps is (10 sec: 5804.6, 60 sec: 5703.6, 300 sec: 5705.5). Total num frames: 167116800. Throughput: 0: 5121.3. Samples: 167112348. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-09 08:13:25,704][25689] Avg episode reward: [(0, '-53.414')] [2022-07-09 08:13:26,083][26022] Updated weights on worker 0-0, policy_version 163203 (0.00084) [2022-07-09 08:13:27,916][26022] Updated weights on worker 0-0, policy_version 163213 (0.00084) [2022-07-09 08:13:29,735][26022] Updated weights on worker 0-0, policy_version 163223 (0.00090) [2022-07-09 08:13:30,793][25689] Fps is (10 sec: 5880.1, 60 sec: 5719.8, 300 sec: 5704.2). Total num frames: 167146496. Throughput: 0: 5950.9. Samples: 167146468. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-09 08:13:30,793][25689] Avg episode reward: [(0, '-53.431')] [2022-07-09 08:13:31,885][26022] Updated weights on worker 0-0, policy_version 163233 (0.00094) [2022-07-09 08:13:33,390][26022] Updated weights on worker 0-0, policy_version 163243 (0.00097) [2022-07-09 08:13:35,319][26022] Updated weights on worker 0-0, policy_version 163253 (0.00086) [2022-07-09 08:13:35,847][25689] Fps is (10 sec: 5552.4, 60 sec: 5665.9, 300 sec: 5700.3). Total num frames: 167173120. Throughput: 0: 5937.5. Samples: 167180714. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-09 08:13:35,847][25689] Avg episode reward: [(0, '-53.105')] [2022-07-09 08:13:36,896][26022] Updated weights on worker 0-0, policy_version 163263 (0.00089) [2022-07-09 08:13:38,802][26022] Updated weights on worker 0-0, policy_version 163273 (0.00089) [2022-07-09 08:13:40,736][26022] Updated weights on worker 0-0, policy_version 163283 (0.00087) [2022-07-09 08:13:40,859][25689] Fps is (10 sec: 5493.1, 60 sec: 5673.7, 300 sec: 5693.3). Total num frames: 167201792. Throughput: 0: 5089.8. Samples: 167197786. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-09 08:13:40,860][25689] Avg episode reward: [(0, '-53.571')] [2022-07-09 08:13:42,195][26022] Updated weights on worker 0-0, policy_version 163293 (0.00094) [2022-07-09 08:13:44,284][26022] Updated weights on worker 0-0, policy_version 163303 (0.00096) [2022-07-09 08:13:45,901][25689] Fps is (10 sec: 5805.4, 60 sec: 5707.2, 300 sec: 5697.3). Total num frames: 167231488. Throughput: 0: 5921.7. Samples: 167231952. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-09 08:13:45,901][25689] Avg episode reward: [(0, '-53.212')] [2022-07-09 08:13:45,911][26022] Updated weights on worker 0-0, policy_version 163313 (0.00087) [2022-07-09 08:13:47,858][26022] Updated weights on worker 0-0, policy_version 163323 (0.00087) [2022-07-09 08:13:49,949][26022] Updated weights on worker 0-0, policy_version 163333 (0.00086) [2022-07-09 08:13:50,948][25689] Fps is (10 sec: 5785.6, 60 sec: 5680.2, 300 sec: 5700.1). Total num frames: 167260160. Throughput: 0: 5934.4. Samples: 167266076. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-09 08:13:50,948][25689] Avg episode reward: [(0, '-53.828')] [2022-07-09 08:13:51,464][26022] Updated weights on worker 0-0, policy_version 163343 (0.00093) [2022-07-09 08:13:53,395][26022] Updated weights on worker 0-0, policy_version 163353 (0.00084) [2022-07-09 08:13:55,138][26022] Updated weights on worker 0-0, policy_version 163363 (0.00087) [2022-07-09 08:13:56,008][25689] Fps is (10 sec: 5572.4, 60 sec: 5649.9, 300 sec: 5692.5). Total num frames: 167287808. Throughput: 0: 5067.4. Samples: 167282878. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-09 08:13:56,008][25689] Avg episode reward: [(0, '-53.809')] [2022-07-09 08:13:56,731][26022] Updated weights on worker 0-0, policy_version 163373 (0.00080) [2022-07-09 08:13:58,879][26022] Updated weights on worker 0-0, policy_version 163383 (0.00096) [2022-07-09 08:14:00,184][26022] Updated weights on worker 0-0, policy_version 163393 (0.00088) [2022-07-09 08:14:01,042][25689] Fps is (10 sec: 5681.0, 60 sec: 5672.9, 300 sec: 5706.1). Total num frames: 167317504. Throughput: 0: 5927.4. Samples: 167317416. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-09 08:14:01,043][25689] Avg episode reward: [(0, '-53.517')] [2022-07-09 08:14:02,687][26022] Updated weights on worker 0-0, policy_version 163403 (0.00095) [2022-07-09 08:14:04,302][26022] Updated weights on worker 0-0, policy_version 163413 (0.00088) [2022-07-09 08:14:06,099][25689] Fps is (10 sec: 5581.1, 60 sec: 5651.2, 300 sec: 5696.4). Total num frames: 167344128. Throughput: 0: 5823.4. Samples: 167349574. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-09 08:14:06,099][25689] Avg episode reward: [(0, '-53.302')] [2022-07-09 08:14:06,180][26022] Updated weights on worker 0-0, policy_version 163423 (0.00090) [2022-07-09 08:14:08,020][26022] Updated weights on worker 0-0, policy_version 163433 (0.00065) [2022-07-09 08:14:09,904][26022] Updated weights on worker 0-0, policy_version 163443 (0.00086) [2022-07-09 08:14:11,159][25689] Fps is (10 sec: 5465.5, 60 sec: 5650.1, 300 sec: 5692.6). Total num frames: 167372800. Throughput: 0: 4977.7. Samples: 167366682. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-09 08:14:11,161][25689] Avg episode reward: [(0, '-52.096')] [2022-07-09 08:14:11,384][26022] Updated weights on worker 0-0, policy_version 163453 (0.00083) [2022-07-09 08:14:13,528][26022] Updated weights on worker 0-0, policy_version 163463 (0.00632) [2022-07-09 08:14:15,150][26022] Updated weights on worker 0-0, policy_version 163473 (0.00088) [2022-07-09 08:14:16,261][25689] Fps is (10 sec: 5542.2, 60 sec: 5642.6, 300 sec: 5692.3). Total num frames: 167400448. Throughput: 0: 5806.6. Samples: 167400480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-09 08:14:16,262][25689] Avg episode reward: [(0, '-51.368')] [2022-07-09 08:14:17,136][26022] Updated weights on worker 0-0, policy_version 163483 (0.00097) [2022-07-09 08:14:18,614][26022] Updated weights on worker 0-0, policy_version 163493 (0.00086) [2022-07-09 08:14:20,591][26022] Updated weights on worker 0-0, policy_version 163503 (0.00085) [2022-07-09 08:14:21,291][25689] Fps is (10 sec: 5760.8, 60 sec: 5677.8, 300 sec: 5697.1). Total num frames: 167431168. Throughput: 0: 5796.9. Samples: 167434798. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-09 08:14:21,291][25689] Avg episode reward: [(0, '-51.728')] [2022-07-09 08:14:22,487][26022] Updated weights on worker 0-0, policy_version 163513 (0.00085) [2022-07-09 08:14:24,126][26022] Updated weights on worker 0-0, policy_version 163523 (0.00093) [2022-07-09 08:14:26,171][26022] Updated weights on worker 0-0, policy_version 163533 (0.00093) [2022-07-09 08:14:26,386][25689] Fps is (10 sec: 5663.8, 60 sec: 5619.3, 300 sec: 5692.7). Total num frames: 167457792. Throughput: 0: 5893.7. Samples: 167469138. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-09 08:14:26,386][25689] Avg episode reward: [(0, '-51.726')] [2022-07-09 08:14:27,877][26022] Updated weights on worker 0-0, policy_version 163543 (0.00092) [2022-07-09 08:14:29,720][26022] Updated weights on worker 0-0, policy_version 163553 (0.00082) [2022-07-09 08:14:31,448][26022] Updated weights on worker 0-0, policy_version 163563 (0.00082) [2022-07-09 08:14:31,450][25689] Fps is (10 sec: 5543.6, 60 sec: 5621.6, 300 sec: 5688.3). Total num frames: 167487488. Throughput: 0: 5883.8. Samples: 167486072. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-09 08:14:31,451][25689] Avg episode reward: [(0, '-51.003')] [2022-07-09 08:14:33,216][26022] Updated weights on worker 0-0, policy_version 163573 (0.00092) [2022-07-09 08:14:35,315][26022] Updated weights on worker 0-0, policy_version 163584 (0.00093) [2022-07-09 08:14:36,453][25689] Fps is (10 sec: 5899.4, 60 sec: 5677.0, 300 sec: 5699.0). Total num frames: 167517184. Throughput: 0: 5930.2. Samples: 167520222. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-09 08:14:36,454][25689] Avg episode reward: [(0, '-51.771')] [2022-07-09 08:14:36,913][26022] Updated weights on worker 0-0, policy_version 163594 (0.00084) [2022-07-09 08:14:38,456][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:14:38,470][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000163601_167527424.pth [2022-07-09 08:14:38,471][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000161599_165477376.pth [2022-07-09 08:14:38,858][26022] Updated weights on worker 0-0, policy_version 163604 (0.00093) [2022-07-09 08:14:40,598][26022] Updated weights on worker 0-0, policy_version 163614 (0.00089) [2022-07-09 08:14:41,473][25689] Fps is (10 sec: 5721.3, 60 sec: 5659.4, 300 sec: 5692.0). Total num frames: 167544832. Throughput: 0: 5922.7. Samples: 167554332. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-09 08:14:41,474][25689] Avg episode reward: [(0, '-52.126')] [2022-07-09 08:14:42,507][26022] Updated weights on worker 0-0, policy_version 163624 (0.00090) [2022-07-09 08:14:44,276][26022] Updated weights on worker 0-0, policy_version 163634 (0.00050) [2022-07-09 08:14:46,043][26022] Updated weights on worker 0-0, policy_version 163644 (0.00091) [2022-07-09 08:14:46,476][25689] Fps is (10 sec: 5619.3, 60 sec: 5646.2, 300 sec: 5690.6). Total num frames: 167573504. Throughput: 0: 5088.2. Samples: 167571360. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-09 08:14:46,477][25689] Avg episode reward: [(0, '-52.573')] [2022-07-09 08:14:48,023][26022] Updated weights on worker 0-0, policy_version 163654 (0.00081) [2022-07-09 08:14:49,763][26022] Updated weights on worker 0-0, policy_version 163664 (0.00048) [2022-07-09 08:14:51,485][26022] Updated weights on worker 0-0, policy_version 163674 (0.00088) [2022-07-09 08:14:51,635][25689] Fps is (10 sec: 5744.0, 60 sec: 5652.6, 300 sec: 5688.2). Total num frames: 167603200. Throughput: 0: 5915.7. Samples: 167605476. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-09 08:14:51,635][25689] Avg episode reward: [(0, '-52.919')] [2022-07-09 08:14:53,387][26022] Updated weights on worker 0-0, policy_version 163684 (0.00084) [2022-07-09 08:14:54,879][26022] Updated weights on worker 0-0, policy_version 163694 (0.00093) [2022-07-09 08:14:56,651][25689] Fps is (10 sec: 5534.7, 60 sec: 5639.8, 300 sec: 5685.7). Total num frames: 167629824. Throughput: 0: 5902.3. Samples: 167639438. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-09 08:14:56,652][25689] Avg episode reward: [(0, '-53.786')] [2022-07-09 08:14:56,955][26022] Updated weights on worker 0-0, policy_version 163704 (0.00083) [2022-07-09 08:14:58,615][26022] Updated weights on worker 0-0, policy_version 163714 (0.00087) [2022-07-09 08:15:00,491][26022] Updated weights on worker 0-0, policy_version 163724 (0.00097) [2022-07-09 08:15:01,659][25689] Fps is (10 sec: 5414.3, 60 sec: 5608.5, 300 sec: 5689.0). Total num frames: 167657472. Throughput: 0: 5062.2. Samples: 167656514. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-09 08:15:01,659][25689] Avg episode reward: [(0, '-52.836')] [2022-07-09 08:15:02,767][26022] Updated weights on worker 0-0, policy_version 163734 (0.00085) [2022-07-09 08:15:04,427][26022] Updated weights on worker 0-0, policy_version 163744 (0.00086) [2022-07-09 08:15:06,354][26022] Updated weights on worker 0-0, policy_version 163754 (0.00080) [2022-07-09 08:15:06,668][25689] Fps is (10 sec: 5520.4, 60 sec: 5629.8, 300 sec: 5683.2). Total num frames: 167685120. Throughput: 0: 5807.7. Samples: 167688632. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-09 08:15:06,669][25689] Avg episode reward: [(0, '-53.132')] [2022-07-09 08:15:08,127][26022] Updated weights on worker 0-0, policy_version 163764 (0.00083) [2022-07-09 08:15:09,947][26022] Updated weights on worker 0-0, policy_version 163774 (0.00089) [2022-07-09 08:15:11,583][26022] Updated weights on worker 0-0, policy_version 163784 (0.00087) [2022-07-09 08:15:11,718][25689] Fps is (10 sec: 5700.4, 60 sec: 5647.6, 300 sec: 5689.6). Total num frames: 167714816. Throughput: 0: 5851.3. Samples: 167722990. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-09 08:15:11,719][25689] Avg episode reward: [(0, '-52.793')] [2022-07-09 08:15:13,516][26022] Updated weights on worker 0-0, policy_version 163794 (0.00086) [2022-07-09 08:15:15,280][26022] Updated weights on worker 0-0, policy_version 163804 (0.00090) [2022-07-09 08:15:16,740][25689] Fps is (10 sec: 5693.5, 60 sec: 5655.1, 300 sec: 5679.7). Total num frames: 167742464. Throughput: 0: 5007.3. Samples: 167740028. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:15:16,741][25689] Avg episode reward: [(0, '-53.008')] [2022-07-09 08:15:17,108][26022] Updated weights on worker 0-0, policy_version 163814 (0.00084) [2022-07-09 08:15:19,141][26022] Updated weights on worker 0-0, policy_version 163824 (0.00085) [2022-07-09 08:15:20,450][26022] Updated weights on worker 0-0, policy_version 163834 (0.00088) [2022-07-09 08:15:21,757][25689] Fps is (10 sec: 5610.2, 60 sec: 5622.4, 300 sec: 5683.0). Total num frames: 167771136. Throughput: 0: 5853.7. Samples: 167774164. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:15:21,758][25689] Avg episode reward: [(0, '-52.972')] [2022-07-09 08:15:22,674][26022] Updated weights on worker 0-0, policy_version 163844 (0.00095) [2022-07-09 08:15:24,011][26022] Updated weights on worker 0-0, policy_version 163854 (0.00090) [2022-07-09 08:15:26,209][26022] Updated weights on worker 0-0, policy_version 163864 (0.00083) [2022-07-09 08:15:26,783][25689] Fps is (10 sec: 5812.0, 60 sec: 5679.8, 300 sec: 5687.5). Total num frames: 167800832. Throughput: 0: 5961.8. Samples: 167808550. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:15:26,784][25689] Avg episode reward: [(0, '-52.077')] [2022-07-09 08:15:27,664][26022] Updated weights on worker 0-0, policy_version 163874 (0.00090) [2022-07-09 08:15:29,488][26022] Updated weights on worker 0-0, policy_version 163884 (0.00085) [2022-07-09 08:15:31,519][26022] Updated weights on worker 0-0, policy_version 163894 (0.00090) [2022-07-09 08:15:31,891][25689] Fps is (10 sec: 5557.8, 60 sec: 5624.9, 300 sec: 5678.7). Total num frames: 167827456. Throughput: 0: 5097.1. Samples: 167825810. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:15:31,891][25689] Avg episode reward: [(0, '-52.127')] [2022-07-09 08:15:32,958][26022] Updated weights on worker 0-0, policy_version 163904 (0.00088) [2022-07-09 08:15:35,075][26022] Updated weights on worker 0-0, policy_version 163914 (0.00089) [2022-07-09 08:15:36,628][26022] Updated weights on worker 0-0, policy_version 163924 (0.00090) [2022-07-09 08:15:36,896][25689] Fps is (10 sec: 5771.3, 60 sec: 5658.5, 300 sec: 5685.5). Total num frames: 167859200. Throughput: 0: 5972.8. Samples: 167860416. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:15:36,897][25689] Avg episode reward: [(0, '-52.581')] [2022-07-09 08:15:38,574][26022] Updated weights on worker 0-0, policy_version 163934 (0.00090) [2022-07-09 08:15:40,453][26022] Updated weights on worker 0-0, policy_version 163944 (0.00085) [2022-07-09 08:15:41,917][25689] Fps is (10 sec: 5923.3, 60 sec: 5658.4, 300 sec: 5685.8). Total num frames: 167886848. Throughput: 0: 5975.8. Samples: 167894638. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:15:41,918][25689] Avg episode reward: [(0, '-52.111')] [2022-07-09 08:15:42,119][26022] Updated weights on worker 0-0, policy_version 163954 (0.00088) [2022-07-09 08:15:44,001][26022] Updated weights on worker 0-0, policy_version 163964 (0.00083) [2022-07-09 08:15:45,675][26022] Updated weights on worker 0-0, policy_version 163974 (0.00083) [2022-07-09 08:15:46,931][25689] Fps is (10 sec: 5612.6, 60 sec: 5657.4, 300 sec: 5679.9). Total num frames: 167915520. Throughput: 0: 5132.0. Samples: 167911950. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:15:46,931][25689] Avg episode reward: [(0, '-52.429')] [2022-07-09 08:15:47,467][26022] Updated weights on worker 0-0, policy_version 163984 (0.00078) [2022-07-09 08:15:49,449][26022] Updated weights on worker 0-0, policy_version 163994 (0.00082) [2022-07-09 08:15:50,822][26022] Updated weights on worker 0-0, policy_version 164004 (0.00088) [2022-07-09 08:15:51,999][25689] Fps is (10 sec: 5688.0, 60 sec: 5648.9, 300 sec: 5679.3). Total num frames: 167944192. Throughput: 0: 5991.1. Samples: 167946282. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:15:51,999][25689] Avg episode reward: [(0, '-52.474')] [2022-07-09 08:15:52,947][26022] Updated weights on worker 0-0, policy_version 164014 (0.00088) [2022-07-09 08:15:54,543][26022] Updated weights on worker 0-0, policy_version 164024 (0.00089) [2022-07-09 08:15:56,400][26022] Updated weights on worker 0-0, policy_version 164034 (0.00092) [2022-07-09 08:15:57,004][25689] Fps is (10 sec: 5794.1, 60 sec: 5700.9, 300 sec: 5683.8). Total num frames: 167973888. Throughput: 0: 5982.2. Samples: 167980708. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:15:57,005][25689] Avg episode reward: [(0, '-52.407')] [2022-07-09 08:15:58,346][26022] Updated weights on worker 0-0, policy_version 164044 (0.00090) [2022-07-09 08:15:59,932][26022] Updated weights on worker 0-0, policy_version 164054 (0.00086) [2022-07-09 08:16:01,835][26022] Updated weights on worker 0-0, policy_version 164064 (0.00093) [2022-07-09 08:16:02,021][25689] Fps is (10 sec: 5824.0, 60 sec: 5716.9, 300 sec: 5687.1). Total num frames: 168002560. Throughput: 0: 5138.5. Samples: 167997940. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:16:02,022][25689] Avg episode reward: [(0, '-51.980')] [2022-07-09 08:16:04,102][26022] Updated weights on worker 0-0, policy_version 164074 (0.00084) [2022-07-09 08:16:05,794][26022] Updated weights on worker 0-0, policy_version 164084 (0.00085) [2022-07-09 08:16:07,063][25689] Fps is (10 sec: 5497.6, 60 sec: 5696.9, 300 sec: 5680.5). Total num frames: 168029184. Throughput: 0: 5874.8. Samples: 168030220. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:16:07,063][25689] Avg episode reward: [(0, '-51.624')] [2022-07-09 08:16:07,670][26022] Updated weights on worker 0-0, policy_version 164094 (0.00087) [2022-07-09 08:16:09,369][26022] Updated weights on worker 0-0, policy_version 164104 (0.00085) [2022-07-09 08:16:11,214][26022] Updated weights on worker 0-0, policy_version 164114 (0.00087) [2022-07-09 08:16:12,115][25689] Fps is (10 sec: 5376.5, 60 sec: 5662.8, 300 sec: 5676.3). Total num frames: 168056832. Throughput: 0: 5872.4. Samples: 168064414. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:16:12,117][25689] Avg episode reward: [(0, '-51.270')] [2022-07-09 08:16:13,039][26022] Updated weights on worker 0-0, policy_version 164124 (0.00620) [2022-07-09 08:16:14,748][26022] Updated weights on worker 0-0, policy_version 164134 (0.00088) [2022-07-09 08:16:16,561][26022] Updated weights on worker 0-0, policy_version 164144 (0.00095) [2022-07-09 08:16:17,127][25689] Fps is (10 sec: 5595.8, 60 sec: 5680.7, 300 sec: 5676.2). Total num frames: 168085504. Throughput: 0: 5011.4. Samples: 168081552. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:16:17,128][25689] Avg episode reward: [(0, '-50.704')] [2022-07-09 08:16:18,531][26022] Updated weights on worker 0-0, policy_version 164154 (0.00093) [2022-07-09 08:16:20,336][26022] Updated weights on worker 0-0, policy_version 164164 (0.00092) [2022-07-09 08:16:22,003][26022] Updated weights on worker 0-0, policy_version 164174 (0.00095) [2022-07-09 08:16:22,129][25689] Fps is (10 sec: 5828.7, 60 sec: 5699.1, 300 sec: 5683.2). Total num frames: 168115200. Throughput: 0: 5846.2. Samples: 168115496. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:16:22,131][25689] Avg episode reward: [(0, '-51.289')] [2022-07-09 08:16:23,916][26022] Updated weights on worker 0-0, policy_version 164184 (0.00091) [2022-07-09 08:16:25,598][26022] Updated weights on worker 0-0, policy_version 164194 (0.00093) [2022-07-09 08:16:27,146][25689] Fps is (10 sec: 5621.3, 60 sec: 5649.0, 300 sec: 5670.2). Total num frames: 168141824. Throughput: 0: 5959.5. Samples: 168149910. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:16:27,148][25689] Avg episode reward: [(0, '-52.108')] [2022-07-09 08:16:27,448][26022] Updated weights on worker 0-0, policy_version 164204 (0.00099) [2022-07-09 08:16:29,108][26022] Updated weights on worker 0-0, policy_version 164214 (0.00090) [2022-07-09 08:16:31,124][26022] Updated weights on worker 0-0, policy_version 164224 (0.00047) [2022-07-09 08:16:32,210][25689] Fps is (10 sec: 5789.9, 60 sec: 5738.0, 300 sec: 5679.4). Total num frames: 168173568. Throughput: 0: 5107.9. Samples: 168167056. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:16:32,211][25689] Avg episode reward: [(0, '-52.145')] [2022-07-09 08:16:32,670][26022] Updated weights on worker 0-0, policy_version 164234 (0.00086) [2022-07-09 08:16:34,688][26022] Updated weights on worker 0-0, policy_version 164244 (0.00091) [2022-07-09 08:16:36,399][26022] Updated weights on worker 0-0, policy_version 164254 (0.00095) [2022-07-09 08:16:37,218][25689] Fps is (10 sec: 5896.8, 60 sec: 5669.8, 300 sec: 5676.0). Total num frames: 168201216. Throughput: 0: 5987.0. Samples: 168201838. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:16:37,219][25689] Avg episode reward: [(0, '-52.412')] [2022-07-09 08:16:38,001][26022] Updated weights on worker 0-0, policy_version 164264 (0.00061) [2022-07-09 08:16:38,572][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:16:38,592][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000164267_168209408.pth [2022-07-09 08:16:38,592][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000162267_166161408.pth [2022-07-09 08:16:39,732][26022] Updated weights on worker 0-0, policy_version 164274 (0.00085) [2022-07-09 08:16:41,494][26022] Updated weights on worker 0-0, policy_version 164284 (0.00093) [2022-07-09 08:16:42,255][25689] Fps is (10 sec: 5505.1, 60 sec: 5668.4, 300 sec: 5668.6). Total num frames: 168228864. Throughput: 0: 6011.4. Samples: 168236478. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:16:42,255][25689] Avg episode reward: [(0, '-52.326')] [2022-07-09 08:16:43,293][26022] Updated weights on worker 0-0, policy_version 164294 (0.00086) [2022-07-09 08:16:45,238][26022] Updated weights on worker 0-0, policy_version 164304 (0.00117) [2022-07-09 08:16:46,971][26022] Updated weights on worker 0-0, policy_version 164314 (0.00085) [2022-07-09 08:16:47,271][25689] Fps is (10 sec: 5806.1, 60 sec: 5702.0, 300 sec: 5677.6). Total num frames: 168259584. Throughput: 0: 5178.3. Samples: 168254122. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:16:47,272][25689] Avg episode reward: [(0, '-52.234')] [2022-07-09 08:16:48,713][26022] Updated weights on worker 0-0, policy_version 164324 (0.00082) [2022-07-09 08:16:50,398][26022] Updated weights on worker 0-0, policy_version 164334 (0.00089) [2022-07-09 08:16:52,126][26022] Updated weights on worker 0-0, policy_version 164344 (0.00087) [2022-07-09 08:16:52,400][25689] Fps is (10 sec: 5955.1, 60 sec: 5713.3, 300 sec: 5679.3). Total num frames: 168289280. Throughput: 0: 6011.3. Samples: 168288422. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:16:52,400][25689] Avg episode reward: [(0, '-51.136')] [2022-07-09 08:16:54,072][26022] Updated weights on worker 0-0, policy_version 164354 (0.00087) [2022-07-09 08:16:55,739][26022] Updated weights on worker 0-0, policy_version 164364 (0.00094) [2022-07-09 08:16:57,427][25689] Fps is (10 sec: 5646.6, 60 sec: 5677.4, 300 sec: 5669.4). Total num frames: 168316928. Throughput: 0: 6003.5. Samples: 168323158. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:16:57,427][25689] Avg episode reward: [(0, '-50.555')] [2022-07-09 08:16:57,532][26022] Updated weights on worker 0-0, policy_version 164374 (0.00089) [2022-07-09 08:16:59,334][26022] Updated weights on worker 0-0, policy_version 164384 (0.00090) [2022-07-09 08:17:01,107][26022] Updated weights on worker 0-0, policy_version 164394 (0.00055) [2022-07-09 08:17:02,438][25689] Fps is (10 sec: 5508.6, 60 sec: 5660.9, 300 sec: 5681.2). Total num frames: 168344576. Throughput: 0: 5150.1. Samples: 168340424. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:17:02,439][25689] Avg episode reward: [(0, '-49.757')] [2022-07-09 08:17:03,323][26022] Updated weights on worker 0-0, policy_version 164404 (0.00093) [2022-07-09 08:17:05,027][26022] Updated weights on worker 0-0, policy_version 164414 (0.00085) [2022-07-09 08:17:06,872][26022] Updated weights on worker 0-0, policy_version 164424 (0.00093) [2022-07-09 08:17:07,470][25689] Fps is (10 sec: 5607.5, 60 sec: 5695.7, 300 sec: 5675.0). Total num frames: 168373248. Throughput: 0: 5881.2. Samples: 168372918. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:17:07,471][25689] Avg episode reward: [(0, '-49.617')] [2022-07-09 08:17:08,587][26022] Updated weights on worker 0-0, policy_version 164434 (0.00087) [2022-07-09 08:17:10,406][26022] Updated weights on worker 0-0, policy_version 164444 (0.00054) [2022-07-09 08:17:12,305][26022] Updated weights on worker 0-0, policy_version 164454 (0.00083) [2022-07-09 08:17:12,617][25689] Fps is (10 sec: 5734.0, 60 sec: 5720.7, 300 sec: 5680.0). Total num frames: 168402944. Throughput: 0: 5888.6. Samples: 168407474. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:17:12,618][25689] Avg episode reward: [(0, '-49.845')] [2022-07-09 08:17:13,927][26022] Updated weights on worker 0-0, policy_version 164464 (0.00089) [2022-07-09 08:17:15,763][26022] Updated weights on worker 0-0, policy_version 164474 (0.00084) [2022-07-09 08:17:17,512][26022] Updated weights on worker 0-0, policy_version 164484 (0.00092) [2022-07-09 08:17:17,670][25689] Fps is (10 sec: 5823.1, 60 sec: 5733.8, 300 sec: 5682.7). Total num frames: 168432640. Throughput: 0: 5022.9. Samples: 168424836. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:17:17,670][25689] Avg episode reward: [(0, '-50.969')] [2022-07-09 08:17:19,402][26022] Updated weights on worker 0-0, policy_version 164494 (0.00094) [2022-07-09 08:17:20,941][26022] Updated weights on worker 0-0, policy_version 164504 (0.00082) [2022-07-09 08:17:22,680][25689] Fps is (10 sec: 5698.9, 60 sec: 5699.2, 300 sec: 5676.3). Total num frames: 168460288. Throughput: 0: 5848.0. Samples: 168458796. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:17:22,680][25689] Avg episode reward: [(0, '-51.709')] [2022-07-09 08:17:22,970][26022] Updated weights on worker 0-0, policy_version 164514 (0.00094) [2022-07-09 08:17:24,701][26022] Updated weights on worker 0-0, policy_version 164524 (0.00090) [2022-07-09 08:17:26,715][26022] Updated weights on worker 0-0, policy_version 164534 (0.00083) [2022-07-09 08:17:27,688][25689] Fps is (10 sec: 5621.7, 60 sec: 5733.8, 300 sec: 5677.7). Total num frames: 168488960. Throughput: 0: 5933.8. Samples: 168492886. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 08:17:27,689][25689] Avg episode reward: [(0, '-51.639')] [2022-07-09 08:17:28,517][26022] Updated weights on worker 0-0, policy_version 164544 (0.00083) [2022-07-09 08:17:30,183][26022] Updated weights on worker 0-0, policy_version 164554 (0.00088) [2022-07-09 08:17:31,932][26022] Updated weights on worker 0-0, policy_version 164564 (0.00094) [2022-07-09 08:17:32,750][25689] Fps is (10 sec: 5592.4, 60 sec: 5666.3, 300 sec: 5670.0). Total num frames: 168516608. Throughput: 0: 5919.8. Samples: 168526658. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 08:17:32,751][25689] Avg episode reward: [(0, '-52.196')] [2022-07-09 08:17:33,884][26022] Updated weights on worker 0-0, policy_version 164574 (0.00087) [2022-07-09 08:17:35,550][26022] Updated weights on worker 0-0, policy_version 164584 (0.00096) [2022-07-09 08:17:37,446][26022] Updated weights on worker 0-0, policy_version 164594 (0.00086) [2022-07-09 08:17:37,831][25689] Fps is (10 sec: 5653.7, 60 sec: 5693.4, 300 sec: 5673.7). Total num frames: 168546304. Throughput: 0: 5901.3. Samples: 168543812. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 08:17:37,832][25689] Avg episode reward: [(0, '-52.175')] [2022-07-09 08:17:39,312][26022] Updated weights on worker 0-0, policy_version 164604 (0.00088) [2022-07-09 08:17:40,866][26022] Updated weights on worker 0-0, policy_version 164614 (0.00084) [2022-07-09 08:17:42,836][25689] Fps is (10 sec: 5685.5, 60 sec: 5696.2, 300 sec: 5674.3). Total num frames: 168573952. Throughput: 0: 5929.9. Samples: 168578324. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 08:17:42,837][25689] Avg episode reward: [(0, '-52.623')] [2022-07-09 08:17:43,042][26022] Updated weights on worker 0-0, policy_version 164624 (0.00091) [2022-07-09 08:17:44,415][26022] Updated weights on worker 0-0, policy_version 164634 (0.00092) [2022-07-09 08:17:46,552][26022] Updated weights on worker 0-0, policy_version 164644 (0.00093) [2022-07-09 08:17:47,893][25689] Fps is (10 sec: 5698.7, 60 sec: 5675.6, 300 sec: 5672.1). Total num frames: 168603648. Throughput: 0: 5897.4. Samples: 168612044. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 08:17:47,894][25689] Avg episode reward: [(0, '-53.188')] [2022-07-09 08:17:48,267][26022] Updated weights on worker 0-0, policy_version 164654 (0.00066) [2022-07-09 08:17:50,050][26022] Updated weights on worker 0-0, policy_version 164664 (0.00090) [2022-07-09 08:17:52,211][26022] Updated weights on worker 0-0, policy_version 164674 (0.00087) [2022-07-09 08:17:52,996][25689] Fps is (10 sec: 5745.2, 60 sec: 5661.1, 300 sec: 5668.6). Total num frames: 168632320. Throughput: 0: 5058.1. Samples: 168629062. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 08:17:52,997][25689] Avg episode reward: [(0, '-53.235')] [2022-07-09 08:17:53,778][26022] Updated weights on worker 0-0, policy_version 164684 (0.00077) [2022-07-09 08:17:55,674][26022] Updated weights on worker 0-0, policy_version 164694 (0.00091) [2022-07-09 08:17:57,273][26022] Updated weights on worker 0-0, policy_version 164704 (0.00089) [2022-07-09 08:17:58,084][25689] Fps is (10 sec: 5627.2, 60 sec: 5672.3, 300 sec: 5668.8). Total num frames: 168660992. Throughput: 0: 5887.6. Samples: 168663054. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 08:17:58,085][25689] Avg episode reward: [(0, '-53.736')] [2022-07-09 08:17:59,127][26022] Updated weights on worker 0-0, policy_version 164714 (0.00085) [2022-07-09 08:18:00,902][26022] Updated weights on worker 0-0, policy_version 164724 (0.00974) [2022-07-09 08:18:03,006][26022] Updated weights on worker 0-0, policy_version 164734 (0.00086) [2022-07-09 08:18:03,103][25689] Fps is (10 sec: 5471.1, 60 sec: 5654.7, 300 sec: 5665.1). Total num frames: 168687616. Throughput: 0: 5775.8. Samples: 168695378. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 08:18:03,103][25689] Avg episode reward: [(0, '-54.046')] [2022-07-09 08:18:04,787][26022] Updated weights on worker 0-0, policy_version 164744 (0.00087) [2022-07-09 08:18:06,916][26022] Updated weights on worker 0-0, policy_version 164754 (0.00094) [2022-07-09 08:18:08,157][25689] Fps is (10 sec: 5489.2, 60 sec: 5652.6, 300 sec: 5665.0). Total num frames: 168716288. Throughput: 0: 4949.5. Samples: 168712342. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 08:18:08,159][25689] Avg episode reward: [(0, '-53.607')] [2022-07-09 08:18:08,445][26022] Updated weights on worker 0-0, policy_version 164764 (0.00092) [2022-07-09 08:18:10,476][26022] Updated weights on worker 0-0, policy_version 164774 (0.00094) [2022-07-09 08:18:11,845][26022] Updated weights on worker 0-0, policy_version 164784 (0.00092) [2022-07-09 08:18:13,255][25689] Fps is (10 sec: 5648.4, 60 sec: 5640.3, 300 sec: 5667.0). Total num frames: 168744960. Throughput: 0: 5817.0. Samples: 168746910. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 08:18:13,256][25689] Avg episode reward: [(0, '-53.012')] [2022-07-09 08:18:14,007][26022] Updated weights on worker 0-0, policy_version 164794 (0.00084) [2022-07-09 08:18:15,527][26022] Updated weights on worker 0-0, policy_version 164804 (0.00094) [2022-07-09 08:18:17,415][26022] Updated weights on worker 0-0, policy_version 164814 (0.00084) [2022-07-09 08:18:18,257][25689] Fps is (10 sec: 5779.1, 60 sec: 5645.0, 300 sec: 5671.2). Total num frames: 168774656. Throughput: 0: 5878.0. Samples: 168781634. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 08:18:18,258][25689] Avg episode reward: [(0, '-53.049')] [2022-07-09 08:18:19,167][26022] Updated weights on worker 0-0, policy_version 164824 (0.00083) [2022-07-09 08:18:20,886][26022] Updated weights on worker 0-0, policy_version 164834 (0.00090) [2022-07-09 08:18:22,663][26022] Updated weights on worker 0-0, policy_version 164844 (0.00083) [2022-07-09 08:18:23,278][25689] Fps is (10 sec: 5721.2, 60 sec: 5644.0, 300 sec: 5664.1). Total num frames: 168802304. Throughput: 0: 5137.0. Samples: 168799020. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 08:18:23,279][25689] Avg episode reward: [(0, '-53.104')] [2022-07-09 08:18:24,454][26022] Updated weights on worker 0-0, policy_version 164854 (0.00092) [2022-07-09 08:18:26,252][26022] Updated weights on worker 0-0, policy_version 164864 (0.00080) [2022-07-09 08:18:28,081][26022] Updated weights on worker 0-0, policy_version 164874 (0.00086) [2022-07-09 08:18:28,282][25689] Fps is (10 sec: 5720.4, 60 sec: 5661.3, 300 sec: 5665.8). Total num frames: 168832000. Throughput: 0: 6018.5. Samples: 168833460. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 08:18:28,282][25689] Avg episode reward: [(0, '-53.256')] [2022-07-09 08:18:29,867][26022] Updated weights on worker 0-0, policy_version 164884 (0.00095) [2022-07-09 08:18:31,537][26022] Updated weights on worker 0-0, policy_version 164894 (0.00084) [2022-07-09 08:18:33,338][25689] Fps is (10 sec: 5700.0, 60 sec: 5661.8, 300 sec: 5669.2). Total num frames: 168859648. Throughput: 0: 6013.0. Samples: 168867672. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 08:18:33,339][25689] Avg episode reward: [(0, '-51.802')] [2022-07-09 08:18:33,569][26022] Updated weights on worker 0-0, policy_version 164904 (0.00091) [2022-07-09 08:18:35,437][26022] Updated weights on worker 0-0, policy_version 164914 (0.00576) [2022-07-09 08:18:37,014][26022] Updated weights on worker 0-0, policy_version 164924 (0.00084) [2022-07-09 08:18:38,341][25689] Fps is (10 sec: 5700.7, 60 sec: 5669.1, 300 sec: 5672.8). Total num frames: 168889344. Throughput: 0: 5135.1. Samples: 168884766. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 08:18:38,341][25689] Avg episode reward: [(0, '-51.884')] [2022-07-09 08:18:38,711][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:18:38,731][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000164933_168891392.pth [2022-07-09 08:18:38,732][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000162937_166847488.pth [2022-07-09 08:18:39,038][26022] Updated weights on worker 0-0, policy_version 164934 (0.00083) [2022-07-09 08:18:40,461][26022] Updated weights on worker 0-0, policy_version 164944 (0.00088) [2022-07-09 08:18:42,435][26022] Updated weights on worker 0-0, policy_version 164954 (0.00088) [2022-07-09 08:18:43,345][25689] Fps is (10 sec: 5935.1, 60 sec: 5703.1, 300 sec: 5673.5). Total num frames: 168919040. Throughput: 0: 5992.7. Samples: 168919276. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 08:18:43,346][25689] Avg episode reward: [(0, '-52.425')] [2022-07-09 08:18:44,061][26022] Updated weights on worker 0-0, policy_version 164964 (0.00095) [2022-07-09 08:18:46,025][26022] Updated weights on worker 0-0, policy_version 164974 (0.00087) [2022-07-09 08:18:47,844][26022] Updated weights on worker 0-0, policy_version 164984 (0.00095) [2022-07-09 08:18:48,349][25689] Fps is (10 sec: 5627.6, 60 sec: 5657.3, 300 sec: 5667.4). Total num frames: 168945664. Throughput: 0: 5998.3. Samples: 168953828. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 08:18:48,350][25689] Avg episode reward: [(0, '-51.270')] [2022-07-09 08:18:49,430][26022] Updated weights on worker 0-0, policy_version 164994 (0.00097) [2022-07-09 08:18:51,536][26022] Updated weights on worker 0-0, policy_version 165004 (0.00083) [2022-07-09 08:18:53,256][26022] Updated weights on worker 0-0, policy_version 165014 (0.00087) [2022-07-09 08:18:53,397][25689] Fps is (10 sec: 5501.1, 60 sec: 5662.4, 300 sec: 5671.1). Total num frames: 168974336. Throughput: 0: 5140.0. Samples: 168970774. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 08:18:53,398][25689] Avg episode reward: [(0, '-51.522')] [2022-07-09 08:18:54,824][26022] Updated weights on worker 0-0, policy_version 165024 (0.00093) [2022-07-09 08:18:56,813][26022] Updated weights on worker 0-0, policy_version 165034 (0.00087) [2022-07-09 08:18:58,252][26022] Updated weights on worker 0-0, policy_version 165044 (0.00080) [2022-07-09 08:18:58,415][25689] Fps is (10 sec: 5900.4, 60 sec: 5703.0, 300 sec: 5674.9). Total num frames: 169005056. Throughput: 0: 5996.6. Samples: 169005140. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 08:18:58,419][25689] Avg episode reward: [(0, '-51.340')] [2022-07-09 08:19:00,313][26022] Updated weights on worker 0-0, policy_version 165054 (0.00387) [2022-07-09 08:19:02,402][26022] Updated weights on worker 0-0, policy_version 165064 (0.00089) [2022-07-09 08:19:03,435][25689] Fps is (10 sec: 5508.9, 60 sec: 5668.9, 300 sec: 5668.7). Total num frames: 169029632. Throughput: 0: 5878.4. Samples: 169037370. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 08:19:03,437][25689] Avg episode reward: [(0, '-52.200')] [2022-07-09 08:19:04,076][26022] Updated weights on worker 0-0, policy_version 165074 (0.00091) [2022-07-09 08:19:06,251][26022] Updated weights on worker 0-0, policy_version 165084 (0.00097) [2022-07-09 08:19:07,835][26022] Updated weights on worker 0-0, policy_version 165094 (0.00056) [2022-07-09 08:19:08,452][25689] Fps is (10 sec: 5305.0, 60 sec: 5672.4, 300 sec: 5669.5). Total num frames: 169058304. Throughput: 0: 5007.8. Samples: 169054500. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 08:19:08,452][25689] Avg episode reward: [(0, '-51.898')] [2022-07-09 08:19:09,725][26022] Updated weights on worker 0-0, policy_version 165104 (0.00094) [2022-07-09 08:19:11,596][26022] Updated weights on worker 0-0, policy_version 165114 (0.00084) [2022-07-09 08:19:12,980][26022] Updated weights on worker 0-0, policy_version 165124 (0.00082) [2022-07-09 08:19:13,544][25689] Fps is (10 sec: 5976.3, 60 sec: 5723.9, 300 sec: 5683.5). Total num frames: 169090048. Throughput: 0: 5874.6. Samples: 169089128. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 08:19:13,544][25689] Avg episode reward: [(0, '-51.026')] [2022-07-09 08:19:15,058][26022] Updated weights on worker 0-0, policy_version 165134 (0.00801) [2022-07-09 08:19:16,653][26022] Updated weights on worker 0-0, policy_version 165144 (0.00093) [2022-07-09 08:19:18,557][25689] Fps is (10 sec: 5877.2, 60 sec: 5688.9, 300 sec: 5673.5). Total num frames: 169117696. Throughput: 0: 5863.8. Samples: 169123252. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 08:19:18,558][25689] Avg episode reward: [(0, '-51.117')] [2022-07-09 08:19:18,562][26022] Updated weights on worker 0-0, policy_version 165154 (0.00093) [2022-07-09 08:19:20,472][26022] Updated weights on worker 0-0, policy_version 165164 (0.00093) [2022-07-09 08:19:22,217][26022] Updated weights on worker 0-0, policy_version 165174 (0.00085) [2022-07-09 08:19:23,614][25689] Fps is (10 sec: 5491.1, 60 sec: 5685.5, 300 sec: 5677.6). Total num frames: 169145344. Throughput: 0: 5094.7. Samples: 169140178. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 08:19:23,615][25689] Avg episode reward: [(0, '-51.555')] [2022-07-09 08:19:23,901][26022] Updated weights on worker 0-0, policy_version 165184 (0.00095) [2022-07-09 08:19:25,961][26022] Updated weights on worker 0-0, policy_version 165194 (0.00092) [2022-07-09 08:19:27,428][26022] Updated weights on worker 0-0, policy_version 165204 (0.00093) [2022-07-09 08:19:28,636][25689] Fps is (10 sec: 5588.0, 60 sec: 5666.8, 300 sec: 5675.0). Total num frames: 169174016. Throughput: 0: 5939.4. Samples: 169174382. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 08:19:28,637][25689] Avg episode reward: [(0, '-51.002')] [2022-07-09 08:19:29,649][26022] Updated weights on worker 0-0, policy_version 165214 (0.00086) [2022-07-09 08:19:31,188][26022] Updated weights on worker 0-0, policy_version 165224 (0.00086) [2022-07-09 08:19:32,992][26022] Updated weights on worker 0-0, policy_version 165234 (0.00095) [2022-07-09 08:19:33,748][25689] Fps is (10 sec: 5759.9, 60 sec: 5695.6, 300 sec: 5672.9). Total num frames: 169203712. Throughput: 0: 5916.0. Samples: 169208652. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 08:19:33,748][25689] Avg episode reward: [(0, '-51.142')] [2022-07-09 08:19:34,833][26022] Updated weights on worker 0-0, policy_version 165244 (0.00092) [2022-07-09 08:19:36,536][26022] Updated weights on worker 0-0, policy_version 165254 (0.00092) [2022-07-09 08:19:38,372][26022] Updated weights on worker 0-0, policy_version 165264 (0.00090) [2022-07-09 08:19:38,824][25689] Fps is (10 sec: 5729.4, 60 sec: 5671.7, 300 sec: 5675.3). Total num frames: 169232384. Throughput: 0: 5057.5. Samples: 169225748. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 08:19:38,824][25689] Avg episode reward: [(0, '-52.365')] [2022-07-09 08:19:40,194][26022] Updated weights on worker 0-0, policy_version 165274 (0.00093) [2022-07-09 08:19:41,962][26022] Updated weights on worker 0-0, policy_version 165284 (0.00084) [2022-07-09 08:19:43,841][25689] Fps is (10 sec: 5478.4, 60 sec: 5619.7, 300 sec: 5668.1). Total num frames: 169259008. Throughput: 0: 5915.9. Samples: 169259838. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 08:19:43,842][25689] Avg episode reward: [(0, '-52.021')] [2022-07-09 08:19:43,983][26022] Updated weights on worker 0-0, policy_version 165294 (0.00088) [2022-07-09 08:19:45,660][26022] Updated weights on worker 0-0, policy_version 165304 (0.00092) [2022-07-09 08:19:47,329][26022] Updated weights on worker 0-0, policy_version 165314 (0.00089) [2022-07-09 08:19:48,851][25689] Fps is (10 sec: 5616.6, 60 sec: 5669.9, 300 sec: 5671.0). Total num frames: 169288704. Throughput: 0: 5915.9. Samples: 169293972. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 08:19:48,852][25689] Avg episode reward: [(0, '-51.814')] [2022-07-09 08:19:49,610][26022] Updated weights on worker 0-0, policy_version 165324 (0.00088) [2022-07-09 08:19:50,995][26022] Updated weights on worker 0-0, policy_version 165334 (0.00091) [2022-07-09 08:19:53,001][26022] Updated weights on worker 0-0, policy_version 165344 (0.00080) [2022-07-09 08:19:53,927][25689] Fps is (10 sec: 5990.3, 60 sec: 5701.1, 300 sec: 5683.6). Total num frames: 169319424. Throughput: 0: 5070.5. Samples: 169310970. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 08:19:53,927][25689] Avg episode reward: [(0, '-52.125')] [2022-07-09 08:19:54,602][26022] Updated weights on worker 0-0, policy_version 165354 (0.00080) [2022-07-09 08:19:56,480][26022] Updated weights on worker 0-0, policy_version 165364 (0.00089) [2022-07-09 08:19:58,229][26022] Updated weights on worker 0-0, policy_version 165374 (0.00088) [2022-07-09 08:19:58,942][25689] Fps is (10 sec: 5682.9, 60 sec: 5633.7, 300 sec: 5680.0). Total num frames: 169346048. Throughput: 0: 5954.8. Samples: 169345548. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 08:19:58,942][25689] Avg episode reward: [(0, '-51.997')] [2022-07-09 08:19:59,824][26022] Updated weights on worker 0-0, policy_version 165384 (0.00089) [2022-07-09 08:20:02,092][26022] Updated weights on worker 0-0, policy_version 165394 (0.00095) [2022-07-09 08:20:03,947][25689] Fps is (10 sec: 5314.1, 60 sec: 5669.0, 300 sec: 5676.7). Total num frames: 169372672. Throughput: 0: 5880.6. Samples: 169378072. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 08:20:03,947][25689] Avg episode reward: [(0, '-51.765')] [2022-07-09 08:20:03,972][26022] Updated weights on worker 0-0, policy_version 165404 (0.00098) [2022-07-09 08:20:05,520][26022] Updated weights on worker 0-0, policy_version 165414 (0.00087) [2022-07-09 08:20:07,576][26022] Updated weights on worker 0-0, policy_version 165424 (0.00090) [2022-07-09 08:20:08,973][25689] Fps is (10 sec: 5614.2, 60 sec: 5685.0, 300 sec: 5677.1). Total num frames: 169402368. Throughput: 0: 5030.7. Samples: 169395202. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 08:20:08,974][25689] Avg episode reward: [(0, '-51.484')] [2022-07-09 08:20:09,127][26022] Updated weights on worker 0-0, policy_version 165434 (0.00083) [2022-07-09 08:20:11,047][26022] Updated weights on worker 0-0, policy_version 165444 (0.00089) [2022-07-09 08:20:12,843][26022] Updated weights on worker 0-0, policy_version 165454 (0.00100) [2022-07-09 08:20:14,020][25689] Fps is (10 sec: 5895.7, 60 sec: 5655.4, 300 sec: 5683.5). Total num frames: 169432064. Throughput: 0: 5906.4. Samples: 169429652. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 08:20:14,021][25689] Avg episode reward: [(0, '-51.247')] [2022-07-09 08:20:14,623][26022] Updated weights on worker 0-0, policy_version 165464 (0.00086) [2022-07-09 08:20:16,411][26022] Updated weights on worker 0-0, policy_version 165474 (0.00644) [2022-07-09 08:20:18,304][26022] Updated weights on worker 0-0, policy_version 165484 (0.00092) [2022-07-09 08:20:19,034][25689] Fps is (10 sec: 5699.9, 60 sec: 5655.3, 300 sec: 5680.1). Total num frames: 169459712. Throughput: 0: 5888.7. Samples: 169463864. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 08:20:19,034][25689] Avg episode reward: [(0, '-51.863')] [2022-07-09 08:20:20,109][26022] Updated weights on worker 0-0, policy_version 165494 (0.00083) [2022-07-09 08:20:21,863][26022] Updated weights on worker 0-0, policy_version 165504 (0.00088) [2022-07-09 08:20:23,734][26022] Updated weights on worker 0-0, policy_version 165514 (0.00093) [2022-07-09 08:20:24,039][25689] Fps is (10 sec: 5621.8, 60 sec: 5677.2, 300 sec: 5677.1). Total num frames: 169488384. Throughput: 0: 5948.1. Samples: 169497580. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 08:20:24,039][25689] Avg episode reward: [(0, '-51.405')] [2022-07-09 08:20:25,524][26022] Updated weights on worker 0-0, policy_version 165524 (0.00088) [2022-07-09 08:20:27,278][26022] Updated weights on worker 0-0, policy_version 165534 (0.00086) [2022-07-09 08:20:29,072][25689] Fps is (10 sec: 5712.7, 60 sec: 5676.1, 300 sec: 5685.4). Total num frames: 169517056. Throughput: 0: 5939.8. Samples: 169514584. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 08:20:29,072][25689] Avg episode reward: [(0, '-51.674')] [2022-07-09 08:20:29,079][26022] Updated weights on worker 0-0, policy_version 165544 (0.00098) [2022-07-09 08:20:30,863][26022] Updated weights on worker 0-0, policy_version 165554 (0.00101) [2022-07-09 08:20:32,967][26022] Updated weights on worker 0-0, policy_version 165564 (0.00082) [2022-07-09 08:20:34,195][25689] Fps is (10 sec: 5747.0, 60 sec: 5675.0, 300 sec: 5676.3). Total num frames: 169546752. Throughput: 0: 5898.8. Samples: 169548656. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 08:20:34,195][25689] Avg episode reward: [(0, '-52.497')] [2022-07-09 08:20:34,363][26022] Updated weights on worker 0-0, policy_version 165574 (0.00932) [2022-07-09 08:20:36,460][26022] Updated weights on worker 0-0, policy_version 165584 (0.00084) [2022-07-09 08:20:37,977][26022] Updated weights on worker 0-0, policy_version 165594 (0.00087) [2022-07-09 08:20:38,922][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:20:38,936][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000165598_169572352.pth [2022-07-09 08:20:38,937][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000163601_167527424.pth [2022-07-09 08:20:39,196][25689] Fps is (10 sec: 5563.0, 60 sec: 5648.2, 300 sec: 5673.2). Total num frames: 169573376. Throughput: 0: 5901.9. Samples: 169582858. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 08:20:39,196][25689] Avg episode reward: [(0, '-52.833')] [2022-07-09 08:20:40,052][26022] Updated weights on worker 0-0, policy_version 165604 (0.00095) [2022-07-09 08:20:41,687][26022] Updated weights on worker 0-0, policy_version 165614 (0.00053) [2022-07-09 08:20:43,491][26022] Updated weights on worker 0-0, policy_version 165624 (0.00086) [2022-07-09 08:20:44,233][25689] Fps is (10 sec: 5610.3, 60 sec: 5697.1, 300 sec: 5676.2). Total num frames: 169603072. Throughput: 0: 5065.6. Samples: 169599878. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 08:20:44,234][25689] Avg episode reward: [(0, '-53.214')] [2022-07-09 08:20:45,386][26022] Updated weights on worker 0-0, policy_version 165634 (0.00080) [2022-07-09 08:20:47,039][26022] Updated weights on worker 0-0, policy_version 165644 (0.00094) [2022-07-09 08:20:48,950][26022] Updated weights on worker 0-0, policy_version 165654 (0.00094) [2022-07-09 08:20:49,245][25689] Fps is (10 sec: 5808.0, 60 sec: 5680.0, 300 sec: 5677.3). Total num frames: 169631744. Throughput: 0: 5925.5. Samples: 169634122. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 08:20:49,246][25689] Avg episode reward: [(0, '-51.941')] [2022-07-09 08:20:50,843][26022] Updated weights on worker 0-0, policy_version 165664 (0.00085) [2022-07-09 08:20:52,720][26022] Updated weights on worker 0-0, policy_version 165674 (0.00094) [2022-07-09 08:20:54,353][25689] Fps is (10 sec: 5464.3, 60 sec: 5609.2, 300 sec: 5665.0). Total num frames: 169658368. Throughput: 0: 5918.0. Samples: 169667952. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 08:20:54,354][25689] Avg episode reward: [(0, '-51.805')] [2022-07-09 08:20:54,552][26022] Updated weights on worker 0-0, policy_version 165684 (0.00085) [2022-07-09 08:20:56,170][26022] Updated weights on worker 0-0, policy_version 165694 (0.00095) [2022-07-09 08:20:58,096][26022] Updated weights on worker 0-0, policy_version 165704 (0.00096) [2022-07-09 08:20:59,381][25689] Fps is (10 sec: 5455.4, 60 sec: 5641.9, 300 sec: 5664.8). Total num frames: 169687040. Throughput: 0: 5056.6. Samples: 169684928. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 08:20:59,382][25689] Avg episode reward: [(0, '-52.431')] [2022-07-09 08:20:59,810][26022] Updated weights on worker 0-0, policy_version 165714 (0.00090) [2022-07-09 08:21:01,594][26022] Updated weights on worker 0-0, policy_version 165724 (0.00082) [2022-07-09 08:21:03,843][26022] Updated weights on worker 0-0, policy_version 165734 (0.00085) [2022-07-09 08:21:04,454][25689] Fps is (10 sec: 5676.9, 60 sec: 5669.4, 300 sec: 5671.1). Total num frames: 169715712. Throughput: 0: 5795.6. Samples: 169717070. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 08:21:04,455][25689] Avg episode reward: [(0, '-52.119')] [2022-07-09 08:21:05,564][26022] Updated weights on worker 0-0, policy_version 165744 (0.00095) [2022-07-09 08:21:07,460][26022] Updated weights on worker 0-0, policy_version 165754 (0.00095) [2022-07-09 08:21:09,388][26022] Updated weights on worker 0-0, policy_version 165764 (0.00094) [2022-07-09 08:21:09,499][25689] Fps is (10 sec: 5465.5, 60 sec: 5617.0, 300 sec: 5667.8). Total num frames: 169742336. Throughput: 0: 5775.0. Samples: 169751084. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 08:21:09,499][25689] Avg episode reward: [(0, '-52.526')] [2022-07-09 08:21:10,980][26022] Updated weights on worker 0-0, policy_version 165774 (0.00084) [2022-07-09 08:21:12,819][26022] Updated weights on worker 0-0, policy_version 165784 (0.00085) [2022-07-09 08:21:14,641][25689] Fps is (10 sec: 5528.7, 60 sec: 5608.1, 300 sec: 5668.7). Total num frames: 169772032. Throughput: 0: 4937.4. Samples: 169768124. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 08:21:14,642][25689] Avg episode reward: [(0, '-52.648')] [2022-07-09 08:21:14,715][26022] Updated weights on worker 0-0, policy_version 165794 (0.00089) [2022-07-09 08:21:16,373][26022] Updated weights on worker 0-0, policy_version 165804 (0.00088) [2022-07-09 08:21:18,432][26022] Updated weights on worker 0-0, policy_version 165814 (0.00089) [2022-07-09 08:21:19,659][25689] Fps is (10 sec: 5946.2, 60 sec: 5658.4, 300 sec: 5671.9). Total num frames: 169802752. Throughput: 0: 5792.2. Samples: 169802380. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 08:21:19,659][25689] Avg episode reward: [(0, '-52.565')] [2022-07-09 08:21:20,128][26022] Updated weights on worker 0-0, policy_version 165824 (0.00082) [2022-07-09 08:21:21,799][26022] Updated weights on worker 0-0, policy_version 165834 (0.00084) [2022-07-09 08:21:23,773][26022] Updated weights on worker 0-0, policy_version 165844 (0.00093) [2022-07-09 08:21:24,759][25689] Fps is (10 sec: 5769.2, 60 sec: 5632.7, 300 sec: 5673.7). Total num frames: 169830400. Throughput: 0: 5892.1. Samples: 169836704. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 08:21:24,760][25689] Avg episode reward: [(0, '-52.757')] [2022-07-09 08:21:25,323][26022] Updated weights on worker 0-0, policy_version 165854 (0.00093) [2022-07-09 08:21:27,243][26022] Updated weights on worker 0-0, policy_version 165864 (0.00469) [2022-07-09 08:21:29,011][26022] Updated weights on worker 0-0, policy_version 165874 (0.00086) [2022-07-09 08:21:29,777][25689] Fps is (10 sec: 5465.0, 60 sec: 5617.2, 300 sec: 5660.8). Total num frames: 169858048. Throughput: 0: 5063.0. Samples: 169853754. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 08:21:29,779][25689] Avg episode reward: [(0, '-53.057')] [2022-07-09 08:21:30,759][26022] Updated weights on worker 0-0, policy_version 165884 (0.00086) [2022-07-09 08:21:32,812][26022] Updated weights on worker 0-0, policy_version 165894 (0.00084) [2022-07-09 08:21:34,340][26022] Updated weights on worker 0-0, policy_version 165904 (0.00088) [2022-07-09 08:21:34,916][25689] Fps is (10 sec: 5746.1, 60 sec: 5632.5, 300 sec: 5668.7). Total num frames: 169888768. Throughput: 0: 5909.0. Samples: 169887930. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 08:21:34,917][25689] Avg episode reward: [(0, '-52.876')] [2022-07-09 08:21:36,215][26022] Updated weights on worker 0-0, policy_version 165914 (0.00087) [2022-07-09 08:21:38,005][26022] Updated weights on worker 0-0, policy_version 165924 (0.00087) [2022-07-09 08:21:39,877][26022] Updated weights on worker 0-0, policy_version 165934 (0.00088) [2022-07-09 08:21:39,977][25689] Fps is (10 sec: 5722.8, 60 sec: 5643.9, 300 sec: 5668.2). Total num frames: 169916416. Throughput: 0: 5890.0. Samples: 169922050. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 08:21:39,978][25689] Avg episode reward: [(0, '-51.650')] [2022-07-09 08:21:41,648][26022] Updated weights on worker 0-0, policy_version 165944 (0.00086) [2022-07-09 08:21:43,491][26022] Updated weights on worker 0-0, policy_version 165954 (0.00094) [2022-07-09 08:21:45,051][25689] Fps is (10 sec: 5557.3, 60 sec: 5623.7, 300 sec: 5660.2). Total num frames: 169945088. Throughput: 0: 5043.5. Samples: 169939048. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 08:21:45,051][25689] Avg episode reward: [(0, '-52.048')] [2022-07-09 08:21:45,153][26022] Updated weights on worker 0-0, policy_version 165964 (0.00087) [2022-07-09 08:21:47,202][26022] Updated weights on worker 0-0, policy_version 165974 (0.00091) [2022-07-09 08:21:48,683][26022] Updated weights on worker 0-0, policy_version 165984 (0.00096) [2022-07-09 08:21:50,109][25689] Fps is (10 sec: 5659.5, 60 sec: 5619.4, 300 sec: 5658.1). Total num frames: 169973760. Throughput: 0: 5875.9. Samples: 169973220. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 08:21:50,109][25689] Avg episode reward: [(0, '-52.183')] [2022-07-09 08:21:50,626][26022] Updated weights on worker 0-0, policy_version 165994 (0.00087) [2022-07-09 08:21:52,299][26022] Updated weights on worker 0-0, policy_version 166004 (0.00876) [2022-07-09 08:21:54,297][26022] Updated weights on worker 0-0, policy_version 166014 (0.00090) [2022-07-09 08:21:55,165][25689] Fps is (10 sec: 5669.7, 60 sec: 5657.8, 300 sec: 5661.0). Total num frames: 170002432. Throughput: 0: 5892.8. Samples: 170007252. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 08:21:55,165][25689] Avg episode reward: [(0, '-51.741')] [2022-07-09 08:21:55,958][26022] Updated weights on worker 0-0, policy_version 166024 (0.00109) [2022-07-09 08:21:57,946][26022] Updated weights on worker 0-0, policy_version 166034 (0.00093) [2022-07-09 08:21:59,535][26022] Updated weights on worker 0-0, policy_version 166044 (0.00091) [2022-07-09 08:22:00,192][25689] Fps is (10 sec: 5788.7, 60 sec: 5674.8, 300 sec: 5667.6). Total num frames: 170032128. Throughput: 0: 5917.8. Samples: 170041680. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 08:22:00,192][25689] Avg episode reward: [(0, '-52.159')] [2022-07-09 08:22:01,339][26022] Updated weights on worker 0-0, policy_version 166054 (0.00089) [2022-07-09 08:22:03,569][26022] Updated weights on worker 0-0, policy_version 166064 (0.00082) [2022-07-09 08:22:05,193][25689] Fps is (10 sec: 5718.3, 60 sec: 5664.7, 300 sec: 5664.8). Total num frames: 170059776. Throughput: 0: 5842.2. Samples: 170056724. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 08:22:05,194][25689] Avg episode reward: [(0, '-52.028')] [2022-07-09 08:22:05,209][26022] Updated weights on worker 0-0, policy_version 166074 (0.00086) [2022-07-09 08:22:07,214][26022] Updated weights on worker 0-0, policy_version 166084 (0.00088) [2022-07-09 08:22:08,871][26022] Updated weights on worker 0-0, policy_version 166094 (0.00082) [2022-07-09 08:22:10,242][25689] Fps is (10 sec: 5502.0, 60 sec: 5681.1, 300 sec: 5659.7). Total num frames: 170087424. Throughput: 0: 5844.8. Samples: 170090894. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 08:22:10,243][25689] Avg episode reward: [(0, '-52.250')] [2022-07-09 08:22:10,691][26022] Updated weights on worker 0-0, policy_version 166104 (0.00081) [2022-07-09 08:22:12,639][26022] Updated weights on worker 0-0, policy_version 166114 (0.00088) [2022-07-09 08:22:14,215][26022] Updated weights on worker 0-0, policy_version 166124 (0.00095) [2022-07-09 08:22:15,349][25689] Fps is (10 sec: 5545.8, 60 sec: 5667.6, 300 sec: 5655.3). Total num frames: 170116096. Throughput: 0: 5853.1. Samples: 170125388. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 08:22:15,349][25689] Avg episode reward: [(0, '-52.383')] [2022-07-09 08:22:16,013][26022] Updated weights on worker 0-0, policy_version 166134 (0.00085) [2022-07-09 08:22:17,836][26022] Updated weights on worker 0-0, policy_version 166144 (0.00085) [2022-07-09 08:22:19,540][26022] Updated weights on worker 0-0, policy_version 166154 (0.00087) [2022-07-09 08:22:20,393][25689] Fps is (10 sec: 5750.2, 60 sec: 5648.3, 300 sec: 5661.5). Total num frames: 170145792. Throughput: 0: 4987.4. Samples: 170142430. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 08:22:20,393][25689] Avg episode reward: [(0, '-52.525')] [2022-07-09 08:22:21,460][26022] Updated weights on worker 0-0, policy_version 166164 (0.00087) [2022-07-09 08:22:23,208][26022] Updated weights on worker 0-0, policy_version 166174 (0.00109) [2022-07-09 08:22:25,189][26022] Updated weights on worker 0-0, policy_version 166184 (0.00090) [2022-07-09 08:22:25,475][25689] Fps is (10 sec: 5763.9, 60 sec: 5666.7, 300 sec: 5660.1). Total num frames: 170174464. Throughput: 0: 5917.5. Samples: 170176742. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 08:22:25,476][25689] Avg episode reward: [(0, '-52.450')] [2022-07-09 08:22:26,796][26022] Updated weights on worker 0-0, policy_version 166194 (0.00089) [2022-07-09 08:22:28,709][26022] Updated weights on worker 0-0, policy_version 166204 (0.00094) [2022-07-09 08:22:30,443][26022] Updated weights on worker 0-0, policy_version 166214 (0.00087) [2022-07-09 08:22:30,542][25689] Fps is (10 sec: 5649.9, 60 sec: 5679.1, 300 sec: 5663.4). Total num frames: 170203136. Throughput: 0: 5921.7. Samples: 170211106. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 08:22:30,543][25689] Avg episode reward: [(0, '-52.208')] [2022-07-09 08:22:32,183][26022] Updated weights on worker 0-0, policy_version 166224 (0.00093) [2022-07-09 08:22:34,024][26022] Updated weights on worker 0-0, policy_version 166234 (0.00082) [2022-07-09 08:22:35,605][25689] Fps is (10 sec: 5660.6, 60 sec: 5652.4, 300 sec: 5660.3). Total num frames: 170231808. Throughput: 0: 5078.0. Samples: 170228252. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 08:22:35,606][25689] Avg episode reward: [(0, '-52.477')] [2022-07-09 08:22:35,871][26022] Updated weights on worker 0-0, policy_version 166244 (0.00087) [2022-07-09 08:22:37,544][26022] Updated weights on worker 0-0, policy_version 166254 (0.00089) [2022-07-09 08:22:39,091][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:22:39,109][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000166262_170252288.pth [2022-07-09 08:22:39,109][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000164267_168209408.pth [2022-07-09 08:22:39,394][26022] Updated weights on worker 0-0, policy_version 166264 (0.00086) [2022-07-09 08:22:40,617][25689] Fps is (10 sec: 5793.3, 60 sec: 5690.7, 300 sec: 5667.1). Total num frames: 170261504. Throughput: 0: 5935.5. Samples: 170262472. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 08:22:40,618][25689] Avg episode reward: [(0, '-53.168')] [2022-07-09 08:22:41,203][26022] Updated weights on worker 0-0, policy_version 166274 (0.00103) [2022-07-09 08:22:43,286][26022] Updated weights on worker 0-0, policy_version 166284 (0.00094) [2022-07-09 08:22:44,891][26022] Updated weights on worker 0-0, policy_version 166294 (0.00089) [2022-07-09 08:22:45,638][25689] Fps is (10 sec: 5716.1, 60 sec: 5678.9, 300 sec: 5660.9). Total num frames: 170289152. Throughput: 0: 5941.2. Samples: 170296532. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 08:22:45,638][25689] Avg episode reward: [(0, '-53.187')] [2022-07-09 08:22:46,722][26022] Updated weights on worker 0-0, policy_version 166304 (0.00080) [2022-07-09 08:22:48,552][26022] Updated weights on worker 0-0, policy_version 166314 (0.00091) [2022-07-09 08:22:50,196][26022] Updated weights on worker 0-0, policy_version 166324 (0.00092) [2022-07-09 08:22:50,675][25689] Fps is (10 sec: 5701.4, 60 sec: 5697.7, 300 sec: 5665.6). Total num frames: 170318848. Throughput: 0: 5100.5. Samples: 170313794. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 08:22:50,675][25689] Avg episode reward: [(0, '-52.924')] [2022-07-09 08:22:52,171][26022] Updated weights on worker 0-0, policy_version 166334 (0.00088) [2022-07-09 08:22:53,678][26022] Updated weights on worker 0-0, policy_version 166344 (0.00092) [2022-07-09 08:22:55,575][26022] Updated weights on worker 0-0, policy_version 166354 (0.00089) [2022-07-09 08:22:55,709][25689] Fps is (10 sec: 5795.2, 60 sec: 5699.8, 300 sec: 5666.6). Total num frames: 170347520. Throughput: 0: 5961.8. Samples: 170348106. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 08:22:55,710][25689] Avg episode reward: [(0, '-52.744')] [2022-07-09 08:22:57,326][26022] Updated weights on worker 0-0, policy_version 166364 (0.00082) [2022-07-09 08:22:59,135][26022] Updated weights on worker 0-0, policy_version 166374 (0.00082) [2022-07-09 08:23:00,744][25689] Fps is (10 sec: 5593.6, 60 sec: 5665.2, 300 sec: 5669.8). Total num frames: 170375168. Throughput: 0: 5945.2. Samples: 170382128. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 08:23:00,744][25689] Avg episode reward: [(0, '-52.739')] [2022-07-09 08:23:01,039][26022] Updated weights on worker 0-0, policy_version 166384 (0.00091) [2022-07-09 08:23:03,244][26022] Updated weights on worker 0-0, policy_version 166394 (0.00110) [2022-07-09 08:23:04,969][26022] Updated weights on worker 0-0, policy_version 166404 (0.00086) [2022-07-09 08:23:05,780][25689] Fps is (10 sec: 5388.9, 60 sec: 5645.0, 300 sec: 5663.2). Total num frames: 170401792. Throughput: 0: 4989.6. Samples: 170397036. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 08:23:05,783][25689] Avg episode reward: [(0, '-52.846')] [2022-07-09 08:23:06,951][26022] Updated weights on worker 0-0, policy_version 166414 (0.00088) [2022-07-09 08:23:08,483][26022] Updated weights on worker 0-0, policy_version 166424 (0.00089) [2022-07-09 08:23:10,415][26022] Updated weights on worker 0-0, policy_version 166434 (0.00085) [2022-07-09 08:23:10,814][25689] Fps is (10 sec: 5490.9, 60 sec: 5663.3, 300 sec: 5664.4). Total num frames: 170430464. Throughput: 0: 5836.9. Samples: 170431344. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 08:23:10,815][25689] Avg episode reward: [(0, '-53.410')] [2022-07-09 08:23:12,163][26022] Updated weights on worker 0-0, policy_version 166444 (0.00087) [2022-07-09 08:23:13,981][26022] Updated weights on worker 0-0, policy_version 166454 (0.00091) [2022-07-09 08:23:15,807][26022] Updated weights on worker 0-0, policy_version 166464 (0.00218) [2022-07-09 08:23:15,894][25689] Fps is (10 sec: 5669.8, 60 sec: 5665.8, 300 sec: 5659.5). Total num frames: 170459136. Throughput: 0: 5822.2. Samples: 170465628. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 08:23:15,895][25689] Avg episode reward: [(0, '-53.019')] [2022-07-09 08:23:17,644][26022] Updated weights on worker 0-0, policy_version 166474 (0.00101) [2022-07-09 08:23:19,395][26022] Updated weights on worker 0-0, policy_version 166484 (0.00093) [2022-07-09 08:23:20,917][25689] Fps is (10 sec: 5574.8, 60 sec: 5634.0, 300 sec: 5659.5). Total num frames: 170486784. Throughput: 0: 4981.7. Samples: 170482624. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 08:23:20,917][25689] Avg episode reward: [(0, '-53.620')] [2022-07-09 08:23:21,206][26022] Updated weights on worker 0-0, policy_version 166494 (0.00091) [2022-07-09 08:23:22,847][26022] Updated weights on worker 0-0, policy_version 166504 (0.00087) [2022-07-09 08:23:24,856][26022] Updated weights on worker 0-0, policy_version 166514 (0.00089) [2022-07-09 08:23:25,924][25689] Fps is (10 sec: 5717.4, 60 sec: 5657.9, 300 sec: 5659.4). Total num frames: 170516480. Throughput: 0: 5946.6. Samples: 170516822. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 08:23:25,924][25689] Avg episode reward: [(0, '-53.359')] [2022-07-09 08:23:26,724][26022] Updated weights on worker 0-0, policy_version 166524 (0.00087) [2022-07-09 08:23:28,519][26022] Updated weights on worker 0-0, policy_version 166534 (0.00095) [2022-07-09 08:23:30,419][26022] Updated weights on worker 0-0, policy_version 166544 (0.00085) [2022-07-09 08:23:30,932][25689] Fps is (10 sec: 5827.9, 60 sec: 5663.5, 300 sec: 5663.8). Total num frames: 170545152. Throughput: 0: 5931.4. Samples: 170550670. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 08:23:30,933][25689] Avg episode reward: [(0, '-53.118')] [2022-07-09 08:23:32,122][26022] Updated weights on worker 0-0, policy_version 166554 (0.00085) [2022-07-09 08:23:33,658][26022] Updated weights on worker 0-0, policy_version 166564 (0.00100) [2022-07-09 08:23:35,791][26022] Updated weights on worker 0-0, policy_version 166574 (0.00087) [2022-07-09 08:23:35,979][25689] Fps is (10 sec: 5601.2, 60 sec: 5648.0, 300 sec: 5656.1). Total num frames: 170572800. Throughput: 0: 5092.5. Samples: 170567908. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 08:23:35,979][25689] Avg episode reward: [(0, '-52.231')] [2022-07-09 08:23:37,170][26022] Updated weights on worker 0-0, policy_version 166584 (0.00091) [2022-07-09 08:23:39,303][26022] Updated weights on worker 0-0, policy_version 166594 (0.00084) [2022-07-09 08:23:40,767][26022] Updated weights on worker 0-0, policy_version 166604 (0.00087) [2022-07-09 08:23:40,990][25689] Fps is (10 sec: 5701.5, 60 sec: 5648.1, 300 sec: 5655.9). Total num frames: 170602496. Throughput: 0: 5966.0. Samples: 170602378. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 08:23:40,990][25689] Avg episode reward: [(0, '-50.442')] [2022-07-09 08:23:42,862][26022] Updated weights on worker 0-0, policy_version 166614 (0.00092) [2022-07-09 08:23:44,580][26022] Updated weights on worker 0-0, policy_version 166624 (0.00083) [2022-07-09 08:23:46,011][25689] Fps is (10 sec: 5716.1, 60 sec: 5648.0, 300 sec: 5659.0). Total num frames: 170630144. Throughput: 0: 5975.8. Samples: 170636858. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 08:23:46,011][25689] Avg episode reward: [(0, '-49.678')] [2022-07-09 08:23:46,410][26022] Updated weights on worker 0-0, policy_version 166634 (0.00093) [2022-07-09 08:23:47,955][26022] Updated weights on worker 0-0, policy_version 166644 (0.00096) [2022-07-09 08:23:50,050][26022] Updated weights on worker 0-0, policy_version 166654 (0.00087) [2022-07-09 08:23:51,012][25689] Fps is (10 sec: 5721.7, 60 sec: 5651.5, 300 sec: 5663.4). Total num frames: 170659840. Throughput: 0: 5148.8. Samples: 170654058. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 08:23:51,012][25689] Avg episode reward: [(0, '-50.207')] [2022-07-09 08:23:51,533][26022] Updated weights on worker 0-0, policy_version 166664 (0.00088) [2022-07-09 08:23:53,774][26022] Updated weights on worker 0-0, policy_version 166674 (0.00090) [2022-07-09 08:23:55,122][26022] Updated weights on worker 0-0, policy_version 166684 (0.00092) [2022-07-09 08:23:56,090][25689] Fps is (10 sec: 5689.5, 60 sec: 5630.4, 300 sec: 5651.9). Total num frames: 170687488. Throughput: 0: 5963.9. Samples: 170687848. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-09 08:23:56,090][25689] Avg episode reward: [(0, '-50.008')] [2022-07-09 08:23:57,309][26022] Updated weights on worker 0-0, policy_version 166694 (0.00081) [2022-07-09 08:23:58,863][26022] Updated weights on worker 0-0, policy_version 166704 (0.00085) [2022-07-09 08:24:00,670][26022] Updated weights on worker 0-0, policy_version 166714 (0.00092) [2022-07-09 08:24:01,103][25689] Fps is (10 sec: 5682.8, 60 sec: 5666.4, 300 sec: 5669.2). Total num frames: 170717184. Throughput: 0: 5953.1. Samples: 170722114. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-09 08:24:01,103][25689] Avg episode reward: [(0, '-50.051')] [2022-07-09 08:24:02,979][26022] Updated weights on worker 0-0, policy_version 166724 (0.00086) [2022-07-09 08:24:04,701][26022] Updated weights on worker 0-0, policy_version 166734 (0.00095) [2022-07-09 08:24:06,159][25689] Fps is (10 sec: 5593.3, 60 sec: 5664.5, 300 sec: 5661.6). Total num frames: 170743808. Throughput: 0: 4978.2. Samples: 170737160. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-09 08:24:06,160][25689] Avg episode reward: [(0, '-50.674')] [2022-07-09 08:24:06,436][26022] Updated weights on worker 0-0, policy_version 166744 (0.00087) [2022-07-09 08:24:08,329][26022] Updated weights on worker 0-0, policy_version 166754 (0.00115) [2022-07-09 08:24:10,137][26022] Updated weights on worker 0-0, policy_version 166764 (0.00615) [2022-07-09 08:24:11,163][25689] Fps is (10 sec: 5394.6, 60 sec: 5650.3, 300 sec: 5649.5). Total num frames: 170771456. Throughput: 0: 5824.3. Samples: 170771424. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-09 08:24:11,164][25689] Avg episode reward: [(0, '-52.100')] [2022-07-09 08:24:11,910][26022] Updated weights on worker 0-0, policy_version 166774 (0.00088) [2022-07-09 08:24:13,708][26022] Updated weights on worker 0-0, policy_version 166784 (0.00088) [2022-07-09 08:24:15,400][26022] Updated weights on worker 0-0, policy_version 166794 (0.00098) [2022-07-09 08:24:16,242][25689] Fps is (10 sec: 5687.3, 60 sec: 5667.4, 300 sec: 5655.2). Total num frames: 170801152. Throughput: 0: 5852.1. Samples: 170805780. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-09 08:24:16,244][25689] Avg episode reward: [(0, '-51.857')] [2022-07-09 08:24:17,183][26022] Updated weights on worker 0-0, policy_version 166804 (0.00093) [2022-07-09 08:24:19,266][26022] Updated weights on worker 0-0, policy_version 166814 (0.00083) [2022-07-09 08:24:20,854][26022] Updated weights on worker 0-0, policy_version 166824 (0.00106) [2022-07-09 08:24:21,303][25689] Fps is (10 sec: 5756.6, 60 sec: 5680.8, 300 sec: 5658.5). Total num frames: 170829824. Throughput: 0: 4982.1. Samples: 170822752. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-09 08:24:21,303][25689] Avg episode reward: [(0, '-51.460')] [2022-07-09 08:24:22,843][26022] Updated weights on worker 0-0, policy_version 166834 (0.00087) [2022-07-09 08:24:24,450][26022] Updated weights on worker 0-0, policy_version 166844 (0.00103) [2022-07-09 08:24:26,247][26022] Updated weights on worker 0-0, policy_version 166854 (0.00096) [2022-07-09 08:24:26,330][25689] Fps is (10 sec: 5684.5, 60 sec: 5662.0, 300 sec: 5658.4). Total num frames: 170858496. Throughput: 0: 5940.8. Samples: 170856990. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-09 08:24:26,330][25689] Avg episode reward: [(0, '-51.769')] [2022-07-09 08:24:28,098][26022] Updated weights on worker 0-0, policy_version 166864 (0.00082) [2022-07-09 08:24:29,852][26022] Updated weights on worker 0-0, policy_version 166874 (0.00090) [2022-07-09 08:24:31,340][25689] Fps is (10 sec: 5713.1, 60 sec: 5661.8, 300 sec: 5656.9). Total num frames: 170887168. Throughput: 0: 5950.7. Samples: 170891490. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-09 08:24:31,340][25689] Avg episode reward: [(0, '-52.117')] [2022-07-09 08:24:31,654][26022] Updated weights on worker 0-0, policy_version 166884 (0.00084) [2022-07-09 08:24:33,543][26022] Updated weights on worker 0-0, policy_version 166894 (0.00086) [2022-07-09 08:24:35,092][26022] Updated weights on worker 0-0, policy_version 166904 (0.00090) [2022-07-09 08:24:36,400][25689] Fps is (10 sec: 5694.5, 60 sec: 5677.5, 300 sec: 5657.2). Total num frames: 170915840. Throughput: 0: 5966.6. Samples: 170926054. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-09 08:24:36,400][25689] Avg episode reward: [(0, '-51.841')] [2022-07-09 08:24:36,924][26022] Updated weights on worker 0-0, policy_version 166914 (0.00089) [2022-07-09 08:24:38,804][26022] Updated weights on worker 0-0, policy_version 166924 (0.00090) [2022-07-09 08:24:39,136][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:24:39,157][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000166926_170932224.pth [2022-07-09 08:24:39,157][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000164933_168891392.pth [2022-07-09 08:24:40,511][26022] Updated weights on worker 0-0, policy_version 166934 (0.00090) [2022-07-09 08:24:41,406][25689] Fps is (10 sec: 5900.0, 60 sec: 5694.9, 300 sec: 5671.2). Total num frames: 170946560. Throughput: 0: 6007.1. Samples: 170943516. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-09 08:24:41,407][25689] Avg episode reward: [(0, '-52.066')] [2022-07-09 08:24:42,293][26022] Updated weights on worker 0-0, policy_version 166944 (0.00083) [2022-07-09 08:24:43,927][26022] Updated weights on worker 0-0, policy_version 166954 (0.00088) [2022-07-09 08:24:45,621][26022] Updated weights on worker 0-0, policy_version 166964 (0.00082) [2022-07-09 08:24:46,465][25689] Fps is (10 sec: 6002.5, 60 sec: 5725.2, 300 sec: 5670.3). Total num frames: 170976256. Throughput: 0: 6058.8. Samples: 170978984. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-09 08:24:46,465][25689] Avg episode reward: [(0, '-51.444')] [2022-07-09 08:24:47,417][26022] Updated weights on worker 0-0, policy_version 166974 (0.00096) [2022-07-09 08:24:49,153][26022] Updated weights on worker 0-0, policy_version 166984 (0.00091) [2022-07-09 08:24:50,983][26022] Updated weights on worker 0-0, policy_version 166994 (0.00094) [2022-07-09 08:24:51,487][25689] Fps is (10 sec: 5789.8, 60 sec: 5706.2, 300 sec: 5664.4). Total num frames: 171004928. Throughput: 0: 6086.0. Samples: 171014108. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-09 08:24:51,488][25689] Avg episode reward: [(0, '-52.203')] [2022-07-09 08:24:52,865][26022] Updated weights on worker 0-0, policy_version 167004 (0.00087) [2022-07-09 08:24:54,383][26022] Updated weights on worker 0-0, policy_version 167014 (0.00083) [2022-07-09 08:24:56,174][26022] Updated weights on worker 0-0, policy_version 167024 (0.00079) [2022-07-09 08:24:56,548][25689] Fps is (10 sec: 5788.6, 60 sec: 5741.7, 300 sec: 5673.9). Total num frames: 171034624. Throughput: 0: 5253.5. Samples: 171031904. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-09 08:24:56,549][25689] Avg episode reward: [(0, '-51.072')] [2022-07-09 08:24:58,080][26022] Updated weights on worker 0-0, policy_version 167034 (0.00083) [2022-07-09 08:24:59,743][26022] Updated weights on worker 0-0, policy_version 167044 (0.00086) [2022-07-09 08:25:01,555][25689] Fps is (10 sec: 5594.3, 60 sec: 5691.5, 300 sec: 5673.8). Total num frames: 171061248. Throughput: 0: 6102.2. Samples: 171066466. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-09 08:25:01,557][25689] Avg episode reward: [(0, '-51.319')] [2022-07-09 08:25:02,053][26022] Updated weights on worker 0-0, policy_version 167054 (0.00085) [2022-07-09 08:25:03,527][26022] Updated weights on worker 0-0, policy_version 167064 (0.00096) [2022-07-09 08:25:05,628][26022] Updated weights on worker 0-0, policy_version 167074 (0.00097) [2022-07-09 08:25:06,567][25689] Fps is (10 sec: 5519.0, 60 sec: 5729.5, 300 sec: 5670.7). Total num frames: 171089920. Throughput: 0: 5970.7. Samples: 171099010. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-09 08:25:06,569][25689] Avg episode reward: [(0, '-50.184')] [2022-07-09 08:25:07,050][26022] Updated weights on worker 0-0, policy_version 167084 (0.00098) [2022-07-09 08:25:09,011][26022] Updated weights on worker 0-0, policy_version 167094 (0.00083) [2022-07-09 08:25:10,608][26022] Updated weights on worker 0-0, policy_version 167104 (0.00094) [2022-07-09 08:25:11,571][25689] Fps is (10 sec: 5725.0, 60 sec: 5746.5, 300 sec: 5668.0). Total num frames: 171118592. Throughput: 0: 5105.6. Samples: 171116648. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-09 08:25:11,572][25689] Avg episode reward: [(0, '-50.543')] [2022-07-09 08:25:12,439][26022] Updated weights on worker 0-0, policy_version 167114 (0.00093) [2022-07-09 08:25:14,261][26022] Updated weights on worker 0-0, policy_version 167124 (0.00085) [2022-07-09 08:25:15,919][26022] Updated weights on worker 0-0, policy_version 167134 (0.00088) [2022-07-09 08:25:16,612][25689] Fps is (10 sec: 5811.0, 60 sec: 5750.2, 300 sec: 5674.4). Total num frames: 171148288. Throughput: 0: 5970.6. Samples: 171151696. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-09 08:25:16,612][25689] Avg episode reward: [(0, '-50.708')] [2022-07-09 08:25:17,830][26022] Updated weights on worker 0-0, policy_version 167144 (0.00091) [2022-07-09 08:25:19,581][26022] Updated weights on worker 0-0, policy_version 167154 (0.00084) [2022-07-09 08:25:21,207][26022] Updated weights on worker 0-0, policy_version 167164 (0.00084) [2022-07-09 08:25:21,616][25689] Fps is (10 sec: 5912.5, 60 sec: 5772.5, 300 sec: 5677.8). Total num frames: 171177984. Throughput: 0: 5977.5. Samples: 171186384. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-09 08:25:21,619][25689] Avg episode reward: [(0, '-50.076')] [2022-07-09 08:25:23,063][26022] Updated weights on worker 0-0, policy_version 167174 (0.00094) [2022-07-09 08:25:24,798][26022] Updated weights on worker 0-0, policy_version 167184 (0.00087) [2022-07-09 08:25:26,643][25689] Fps is (10 sec: 5614.7, 60 sec: 5738.6, 300 sec: 5671.1). Total num frames: 171204608. Throughput: 0: 5222.8. Samples: 171203860. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-09 08:25:26,643][25689] Avg episode reward: [(0, '-50.651')] [2022-07-09 08:25:26,876][26022] Updated weights on worker 0-0, policy_version 167194 (0.00086) [2022-07-09 08:25:28,400][26022] Updated weights on worker 0-0, policy_version 167204 (0.00081) [2022-07-09 08:25:30,319][26022] Updated weights on worker 0-0, policy_version 167214 (0.00085) [2022-07-09 08:25:31,659][25689] Fps is (10 sec: 5710.2, 60 sec: 5772.0, 300 sec: 5676.6). Total num frames: 171235328. Throughput: 0: 6068.4. Samples: 171238548. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-09 08:25:31,660][25689] Avg episode reward: [(0, '-50.670')] [2022-07-09 08:25:31,887][26022] Updated weights on worker 0-0, policy_version 167224 (0.00090) [2022-07-09 08:25:33,687][26022] Updated weights on worker 0-0, policy_version 167234 (0.00088) [2022-07-09 08:25:35,507][26022] Updated weights on worker 0-0, policy_version 167244 (0.00083) [2022-07-09 08:25:36,785][25689] Fps is (10 sec: 6057.5, 60 sec: 5799.5, 300 sec: 5687.9). Total num frames: 171266048. Throughput: 0: 6040.8. Samples: 171273560. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-09 08:25:36,786][25689] Avg episode reward: [(0, '-50.210')] [2022-07-09 08:25:37,303][26022] Updated weights on worker 0-0, policy_version 167254 (0.00086) [2022-07-09 08:25:38,959][26022] Updated weights on worker 0-0, policy_version 167264 (0.00087) [2022-07-09 08:25:40,859][26022] Updated weights on worker 0-0, policy_version 167274 (0.00085) [2022-07-09 08:25:41,815][25689] Fps is (10 sec: 5747.1, 60 sec: 5746.5, 300 sec: 5681.2). Total num frames: 171293696. Throughput: 0: 5178.7. Samples: 171290986. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-09 08:25:41,815][25689] Avg episode reward: [(0, '-50.795')] [2022-07-09 08:25:42,393][26022] Updated weights on worker 0-0, policy_version 167284 (0.00080) [2022-07-09 08:25:44,297][26022] Updated weights on worker 0-0, policy_version 167294 (0.00082) [2022-07-09 08:25:45,783][26022] Updated weights on worker 0-0, policy_version 167304 (0.00085) [2022-07-09 08:25:46,819][25689] Fps is (10 sec: 5715.0, 60 sec: 5751.6, 300 sec: 5684.8). Total num frames: 171323392. Throughput: 0: 6053.3. Samples: 171325996. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-09 08:25:46,820][25689] Avg episode reward: [(0, '-51.055')] [2022-07-09 08:25:47,901][26022] Updated weights on worker 0-0, policy_version 167314 (0.00086) [2022-07-09 08:25:49,339][26022] Updated weights on worker 0-0, policy_version 167324 (0.00088) [2022-07-09 08:25:51,427][26022] Updated weights on worker 0-0, policy_version 167334 (0.00093) [2022-07-09 08:25:51,822][25689] Fps is (10 sec: 5934.9, 60 sec: 5770.5, 300 sec: 5697.1). Total num frames: 171353088. Throughput: 0: 6062.7. Samples: 171360792. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-09 08:25:51,823][25689] Avg episode reward: [(0, '-51.161')] [2022-07-09 08:25:53,007][26022] Updated weights on worker 0-0, policy_version 167344 (0.00090) [2022-07-09 08:25:54,880][26022] Updated weights on worker 0-0, policy_version 167354 (0.00088) [2022-07-09 08:25:56,471][26022] Updated weights on worker 0-0, policy_version 167364 (0.00085) [2022-07-09 08:25:56,901][25689] Fps is (10 sec: 5789.4, 60 sec: 5751.8, 300 sec: 5696.2). Total num frames: 171381760. Throughput: 0: 5205.1. Samples: 171378264. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-09 08:25:56,902][25689] Avg episode reward: [(0, '-51.825')] [2022-07-09 08:25:58,459][26022] Updated weights on worker 0-0, policy_version 167374 (0.00087) [2022-07-09 08:26:00,196][26022] Updated weights on worker 0-0, policy_version 167384 (0.00089) [2022-07-09 08:26:01,896][26022] Updated weights on worker 0-0, policy_version 167394 (0.00085) [2022-07-09 08:26:01,985][25689] Fps is (10 sec: 5743.0, 60 sec: 5795.2, 300 sec: 5699.4). Total num frames: 171411456. Throughput: 0: 6047.2. Samples: 171412962. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 08:26:01,986][25689] Avg episode reward: [(0, '-51.477')] [2022-07-09 08:26:04,098][26022] Updated weights on worker 0-0, policy_version 167404 (0.00093) [2022-07-09 08:26:05,779][26022] Updated weights on worker 0-0, policy_version 167414 (0.00096) [2022-07-09 08:26:07,032][25689] Fps is (10 sec: 5660.4, 60 sec: 5775.0, 300 sec: 5702.8). Total num frames: 171439104. Throughput: 0: 5934.3. Samples: 171445944. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 08:26:07,032][25689] Avg episode reward: [(0, '-51.862')] [2022-07-09 08:26:07,727][26022] Updated weights on worker 0-0, policy_version 167424 (0.00086) [2022-07-09 08:26:09,263][26022] Updated weights on worker 0-0, policy_version 167434 (0.00090) [2022-07-09 08:26:10,967][26022] Updated weights on worker 0-0, policy_version 167444 (0.00087) [2022-07-09 08:26:12,059][25689] Fps is (10 sec: 5489.2, 60 sec: 5755.9, 300 sec: 5698.1). Total num frames: 171466752. Throughput: 0: 5074.1. Samples: 171463476. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 08:26:12,060][25689] Avg episode reward: [(0, '-52.625')] [2022-07-09 08:26:12,839][26022] Updated weights on worker 0-0, policy_version 167454 (0.00088) [2022-07-09 08:26:14,549][26022] Updated weights on worker 0-0, policy_version 167464 (0.00088) [2022-07-09 08:26:16,243][26022] Updated weights on worker 0-0, policy_version 167474 (0.00086) [2022-07-09 08:26:17,115][25689] Fps is (10 sec: 5890.4, 60 sec: 5788.3, 300 sec: 5700.8). Total num frames: 171498496. Throughput: 0: 5956.2. Samples: 171498660. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 08:26:17,115][25689] Avg episode reward: [(0, '-52.889')] [2022-07-09 08:26:17,965][26022] Updated weights on worker 0-0, policy_version 167484 (0.00093) [2022-07-09 08:26:19,792][26022] Updated weights on worker 0-0, policy_version 167494 (0.00084) [2022-07-09 08:26:21,593][26022] Updated weights on worker 0-0, policy_version 167504 (0.00084) [2022-07-09 08:26:22,122][25689] Fps is (10 sec: 6003.7, 60 sec: 5771.1, 300 sec: 5706.0). Total num frames: 171527168. Throughput: 0: 5996.9. Samples: 171533720. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 08:26:22,123][25689] Avg episode reward: [(0, '-53.460')] [2022-07-09 08:26:23,394][26022] Updated weights on worker 0-0, policy_version 167514 (0.00086) [2022-07-09 08:26:24,831][26022] Updated weights on worker 0-0, policy_version 167524 (0.00085) [2022-07-09 08:26:26,905][26022] Updated weights on worker 0-0, policy_version 167534 (0.00084) [2022-07-09 08:26:27,150][25689] Fps is (10 sec: 5714.2, 60 sec: 5804.8, 300 sec: 5709.3). Total num frames: 171555840. Throughput: 0: 5241.9. Samples: 171551402. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 08:26:27,150][25689] Avg episode reward: [(0, '-53.192')] [2022-07-09 08:26:28,591][26022] Updated weights on worker 0-0, policy_version 167544 (0.00084) [2022-07-09 08:26:30,432][26022] Updated weights on worker 0-0, policy_version 167554 (0.00081) [2022-07-09 08:26:32,159][25689] Fps is (10 sec: 5713.2, 60 sec: 5771.6, 300 sec: 5704.9). Total num frames: 171584512. Throughput: 0: 6094.9. Samples: 171585984. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 08:26:32,160][25689] Avg episode reward: [(0, '-53.983')] [2022-07-09 08:26:32,247][26022] Updated weights on worker 0-0, policy_version 167564 (0.00086) [2022-07-09 08:26:33,982][26022] Updated weights on worker 0-0, policy_version 167574 (0.00084) [2022-07-09 08:26:35,654][26022] Updated weights on worker 0-0, policy_version 167584 (0.00611) [2022-07-09 08:26:37,206][25689] Fps is (10 sec: 5804.3, 60 sec: 5762.3, 300 sec: 5712.1). Total num frames: 171614208. Throughput: 0: 6093.1. Samples: 171621078. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 08:26:37,207][25689] Avg episode reward: [(0, '-52.812')] [2022-07-09 08:26:37,407][26022] Updated weights on worker 0-0, policy_version 167594 (0.00088) [2022-07-09 08:26:39,042][26022] Updated weights on worker 0-0, policy_version 167604 (0.00086) [2022-07-09 08:26:39,274][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:26:39,286][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000167605_171627520.pth [2022-07-09 08:26:39,286][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000165598_169572352.pth [2022-07-09 08:26:39,287][25974] Saving a milestone ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/milestones/checkpoint_000167605_171627520.pth.milestone [2022-07-09 08:26:40,951][26022] Updated weights on worker 0-0, policy_version 167614 (0.00094) [2022-07-09 08:26:42,222][25689] Fps is (10 sec: 6004.0, 60 sec: 5814.4, 300 sec: 5720.1). Total num frames: 171644928. Throughput: 0: 5207.4. Samples: 171638388. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 08:26:42,222][25689] Avg episode reward: [(0, '-53.468')] [2022-07-09 08:26:42,691][26022] Updated weights on worker 0-0, policy_version 167624 (0.00049) [2022-07-09 08:26:44,435][26022] Updated weights on worker 0-0, policy_version 167634 (0.00087) [2022-07-09 08:26:46,247][26022] Updated weights on worker 0-0, policy_version 167644 (0.00083) [2022-07-09 08:26:47,237][25689] Fps is (10 sec: 5818.9, 60 sec: 5779.5, 300 sec: 5717.5). Total num frames: 171672576. Throughput: 0: 6075.9. Samples: 171673448. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 08:26:47,237][25689] Avg episode reward: [(0, '-53.643')] [2022-07-09 08:26:48,006][26022] Updated weights on worker 0-0, policy_version 167654 (0.00085) [2022-07-09 08:26:49,730][26022] Updated weights on worker 0-0, policy_version 167664 (0.00091) [2022-07-09 08:26:51,466][26022] Updated weights on worker 0-0, policy_version 167674 (0.00085) [2022-07-09 08:26:52,247][25689] Fps is (10 sec: 5617.6, 60 sec: 5761.8, 300 sec: 5718.3). Total num frames: 171701248. Throughput: 0: 6100.4. Samples: 171708530. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 08:26:52,249][25689] Avg episode reward: [(0, '-54.299')] [2022-07-09 08:26:53,158][26022] Updated weights on worker 0-0, policy_version 167684 (0.00082) [2022-07-09 08:26:55,054][26022] Updated weights on worker 0-0, policy_version 167694 (0.00092) [2022-07-09 08:26:56,904][26022] Updated weights on worker 0-0, policy_version 167704 (0.00080) [2022-07-09 08:26:57,299][25689] Fps is (10 sec: 5902.5, 60 sec: 5798.3, 300 sec: 5721.3). Total num frames: 171731968. Throughput: 0: 5216.8. Samples: 171725898. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 08:26:57,300][25689] Avg episode reward: [(0, '-54.546')] [2022-07-09 08:26:58,710][26022] Updated weights on worker 0-0, policy_version 167714 (0.00098) [2022-07-09 08:27:00,273][26022] Updated weights on worker 0-0, policy_version 167724 (0.00083) [2022-07-09 08:27:02,316][25689] Fps is (10 sec: 5593.6, 60 sec: 5736.9, 300 sec: 5714.1). Total num frames: 171757568. Throughput: 0: 6078.9. Samples: 171760538. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 08:27:02,317][25689] Avg episode reward: [(0, '-54.532')] [2022-07-09 08:27:02,597][26022] Updated weights on worker 0-0, policy_version 167734 (0.00084) [2022-07-09 08:27:04,178][26022] Updated weights on worker 0-0, policy_version 167744 (0.00098) [2022-07-09 08:27:06,185][26022] Updated weights on worker 0-0, policy_version 167754 (0.00077) [2022-07-09 08:27:07,325][25689] Fps is (10 sec: 5617.2, 60 sec: 5791.4, 300 sec: 5725.2). Total num frames: 171788288. Throughput: 0: 5950.5. Samples: 171792986. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 08:27:07,326][25689] Avg episode reward: [(0, '-53.979')] [2022-07-09 08:27:07,685][26022] Updated weights on worker 0-0, policy_version 167764 (0.00085) [2022-07-09 08:27:09,703][26022] Updated weights on worker 0-0, policy_version 167774 (0.00093) [2022-07-09 08:27:11,278][26022] Updated weights on worker 0-0, policy_version 167784 (0.00068) [2022-07-09 08:27:12,344][25689] Fps is (10 sec: 5820.6, 60 sec: 5792.2, 300 sec: 5723.5). Total num frames: 171815936. Throughput: 0: 5067.5. Samples: 171810368. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 08:27:12,344][25689] Avg episode reward: [(0, '-53.764')] [2022-07-09 08:27:13,243][26022] Updated weights on worker 0-0, policy_version 167794 (0.00088) [2022-07-09 08:27:14,895][26022] Updated weights on worker 0-0, policy_version 167804 (0.00089) [2022-07-09 08:27:16,733][26022] Updated weights on worker 0-0, policy_version 167814 (0.00086) [2022-07-09 08:27:17,474][25689] Fps is (10 sec: 5549.6, 60 sec: 5734.2, 300 sec: 5718.4). Total num frames: 171844608. Throughput: 0: 5906.3. Samples: 171845058. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 08:27:17,474][25689] Avg episode reward: [(0, '-53.613')] [2022-07-09 08:27:18,397][26022] Updated weights on worker 0-0, policy_version 167824 (0.00082) [2022-07-09 08:27:20,447][26022] Updated weights on worker 0-0, policy_version 167834 (0.00088) [2022-07-09 08:27:21,860][26022] Updated weights on worker 0-0, policy_version 167844 (0.00088) [2022-07-09 08:27:22,505][25689] Fps is (10 sec: 5845.1, 60 sec: 5765.9, 300 sec: 5726.2). Total num frames: 171875328. Throughput: 0: 5910.0. Samples: 171879854. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 08:27:22,505][25689] Avg episode reward: [(0, '-52.727')] [2022-07-09 08:27:23,919][26022] Updated weights on worker 0-0, policy_version 167854 (0.00089) [2022-07-09 08:27:25,500][26022] Updated weights on worker 0-0, policy_version 167864 (0.00085) [2022-07-09 08:27:27,398][26022] Updated weights on worker 0-0, policy_version 167874 (0.00081) [2022-07-09 08:27:27,511][25689] Fps is (10 sec: 5815.4, 60 sec: 5751.0, 300 sec: 5724.0). Total num frames: 171902976. Throughput: 0: 5167.3. Samples: 171897290. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 08:27:27,511][25689] Avg episode reward: [(0, '-52.679')] [2022-07-09 08:27:28,919][26022] Updated weights on worker 0-0, policy_version 167884 (0.00083) [2022-07-09 08:27:31,217][26022] Updated weights on worker 0-0, policy_version 167894 (0.00081) [2022-07-09 08:27:32,411][26022] Updated weights on worker 0-0, policy_version 167904 (0.00090) [2022-07-09 08:27:32,518][25689] Fps is (10 sec: 5829.4, 60 sec: 5785.2, 300 sec: 5732.0). Total num frames: 171933696. Throughput: 0: 6024.1. Samples: 171931896. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 08:27:32,518][25689] Avg episode reward: [(0, '-51.920')] [2022-07-09 08:27:34,489][26022] Updated weights on worker 0-0, policy_version 167914 (0.00085) [2022-07-09 08:27:35,954][26022] Updated weights on worker 0-0, policy_version 167924 (0.00095) [2022-07-09 08:27:37,619][25689] Fps is (10 sec: 5875.8, 60 sec: 5763.0, 300 sec: 5726.8). Total num frames: 171962368. Throughput: 0: 6057.5. Samples: 171967084. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 08:27:37,619][25689] Avg episode reward: [(0, '-51.695')] [2022-07-09 08:27:37,885][26022] Updated weights on worker 0-0, policy_version 167934 (0.00072) [2022-07-09 08:27:39,717][26022] Updated weights on worker 0-0, policy_version 167944 (0.00095) [2022-07-09 08:27:41,298][26022] Updated weights on worker 0-0, policy_version 167954 (0.00085) [2022-07-09 08:27:42,654][25689] Fps is (10 sec: 5556.4, 60 sec: 5710.4, 300 sec: 5726.5). Total num frames: 171990016. Throughput: 0: 5193.6. Samples: 171984500. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 08:27:42,655][25689] Avg episode reward: [(0, '-51.714')] [2022-07-09 08:27:43,161][26022] Updated weights on worker 0-0, policy_version 167964 (0.00084) [2022-07-09 08:27:45,012][26022] Updated weights on worker 0-0, policy_version 167974 (0.00088) [2022-07-09 08:27:46,714][26022] Updated weights on worker 0-0, policy_version 167984 (0.00085) [2022-07-09 08:27:47,668][25689] Fps is (10 sec: 5910.2, 60 sec: 5778.2, 300 sec: 5733.9). Total num frames: 172021760. Throughput: 0: 6043.6. Samples: 172019110. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 08:27:47,668][25689] Avg episode reward: [(0, '-51.481')] [2022-07-09 08:27:48,707][26022] Updated weights on worker 0-0, policy_version 167994 (0.00089) [2022-07-09 08:27:50,287][26022] Updated weights on worker 0-0, policy_version 168004 (0.00084) [2022-07-09 08:27:52,125][26022] Updated weights on worker 0-0, policy_version 168014 (0.00081) [2022-07-09 08:27:52,676][25689] Fps is (10 sec: 5926.1, 60 sec: 5761.5, 300 sec: 5730.9). Total num frames: 172049408. Throughput: 0: 6042.1. Samples: 172053694. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 08:27:52,678][25689] Avg episode reward: [(0, '-51.401')] [2022-07-09 08:27:53,923][26022] Updated weights on worker 0-0, policy_version 168024 (0.00088) [2022-07-09 08:27:55,593][26022] Updated weights on worker 0-0, policy_version 168034 (0.00080) [2022-07-09 08:27:57,481][26022] Updated weights on worker 0-0, policy_version 168044 (0.00463) [2022-07-09 08:27:57,765][25689] Fps is (10 sec: 5678.9, 60 sec: 5741.0, 300 sec: 5736.7). Total num frames: 172079104. Throughput: 0: 5157.4. Samples: 172070990. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 08:27:57,767][25689] Avg episode reward: [(0, '-50.799')] [2022-07-09 08:27:59,216][26022] Updated weights on worker 0-0, policy_version 168054 (0.00085) [2022-07-09 08:28:00,934][26022] Updated weights on worker 0-0, policy_version 168064 (0.00091) [2022-07-09 08:28:02,785][25689] Fps is (10 sec: 5469.8, 60 sec: 5740.7, 300 sec: 5733.6). Total num frames: 172104704. Throughput: 0: 6002.5. Samples: 172105340. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 08:28:02,787][25689] Avg episode reward: [(0, '-52.227')] [2022-07-09 08:28:03,355][26022] Updated weights on worker 0-0, policy_version 168074 (0.00088) [2022-07-09 08:28:04,713][26022] Updated weights on worker 0-0, policy_version 168084 (0.00089) [2022-07-09 08:28:06,772][26022] Updated weights on worker 0-0, policy_version 168094 (0.00086) [2022-07-09 08:28:07,830][25689] Fps is (10 sec: 5494.2, 60 sec: 5720.4, 300 sec: 5736.9). Total num frames: 172134400. Throughput: 0: 5907.6. Samples: 172138220. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 08:28:07,831][25689] Avg episode reward: [(0, '-51.520')] [2022-07-09 08:28:08,266][26022] Updated weights on worker 0-0, policy_version 168104 (0.00085) [2022-07-09 08:28:10,337][26022] Updated weights on worker 0-0, policy_version 168114 (0.00096) [2022-07-09 08:28:11,869][26022] Updated weights on worker 0-0, policy_version 168124 (0.00090) [2022-07-09 08:28:12,841][25689] Fps is (10 sec: 5804.2, 60 sec: 5738.0, 300 sec: 5738.2). Total num frames: 172163072. Throughput: 0: 5918.8. Samples: 172173050. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 08:28:12,842][25689] Avg episode reward: [(0, '-50.892')] [2022-07-09 08:28:13,848][26022] Updated weights on worker 0-0, policy_version 168134 (0.00087) [2022-07-09 08:28:15,304][26022] Updated weights on worker 0-0, policy_version 168144 (0.00090) [2022-07-09 08:28:17,493][26022] Updated weights on worker 0-0, policy_version 168154 (0.00050) [2022-07-09 08:28:17,885][25689] Fps is (10 sec: 5702.9, 60 sec: 5746.2, 300 sec: 5741.2). Total num frames: 172191744. Throughput: 0: 5940.7. Samples: 172190514. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 08:28:17,886][25689] Avg episode reward: [(0, '-50.763')] [2022-07-09 08:28:19,018][26022] Updated weights on worker 0-0, policy_version 168164 (0.00095) [2022-07-09 08:28:20,828][26022] Updated weights on worker 0-0, policy_version 168174 (0.00104) [2022-07-09 08:28:22,441][26022] Updated weights on worker 0-0, policy_version 168184 (0.00336) [2022-07-09 08:28:22,907][25689] Fps is (10 sec: 5900.7, 60 sec: 5747.1, 300 sec: 5744.4). Total num frames: 172222464. Throughput: 0: 5970.0. Samples: 172225466. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 08:28:22,907][25689] Avg episode reward: [(0, '-50.946')] [2022-07-09 08:28:24,483][26022] Updated weights on worker 0-0, policy_version 168194 (0.00089) [2022-07-09 08:28:26,217][26022] Updated weights on worker 0-0, policy_version 168204 (0.00086) [2022-07-09 08:28:27,898][26022] Updated weights on worker 0-0, policy_version 168214 (0.00091) [2022-07-09 08:28:27,914][25689] Fps is (10 sec: 5921.9, 60 sec: 5763.9, 300 sec: 5744.4). Total num frames: 172251136. Throughput: 0: 6053.8. Samples: 172259808. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 08:28:27,915][25689] Avg episode reward: [(0, '-50.834')] [2022-07-09 08:28:29,736][26022] Updated weights on worker 0-0, policy_version 168224 (0.00092) [2022-07-09 08:28:31,667][26022] Updated weights on worker 0-0, policy_version 168234 (0.00086) [2022-07-09 08:28:32,924][25689] Fps is (10 sec: 5622.5, 60 sec: 5712.8, 300 sec: 5745.1). Total num frames: 172278784. Throughput: 0: 5165.3. Samples: 172276780. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 08:28:32,924][25689] Avg episode reward: [(0, '-51.529')] [2022-07-09 08:28:33,338][26022] Updated weights on worker 0-0, policy_version 168244 (0.00090) [2022-07-09 08:28:35,318][26022] Updated weights on worker 0-0, policy_version 168254 (0.00083) [2022-07-09 08:28:36,852][26022] Updated weights on worker 0-0, policy_version 168264 (0.00089) [2022-07-09 08:28:37,968][25689] Fps is (10 sec: 5704.0, 60 sec: 5735.2, 300 sec: 5744.5). Total num frames: 172308480. Throughput: 0: 6019.5. Samples: 172311402. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 08:28:37,969][25689] Avg episode reward: [(0, '-52.035')] [2022-07-09 08:28:38,728][26022] Updated weights on worker 0-0, policy_version 168274 (0.00091) [2022-07-09 08:28:39,414][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:28:39,430][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000168277_172315648.pth [2022-07-09 08:28:39,431][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000166262_170252288.pth [2022-07-09 08:28:40,487][26022] Updated weights on worker 0-0, policy_version 168284 (0.00094) [2022-07-09 08:28:42,487][26022] Updated weights on worker 0-0, policy_version 168294 (0.00089) [2022-07-09 08:28:42,991][25689] Fps is (10 sec: 5594.4, 60 sec: 5719.3, 300 sec: 5741.0). Total num frames: 172335104. Throughput: 0: 5984.8. Samples: 172345664. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 08:28:42,992][25689] Avg episode reward: [(0, '-52.981')] [2022-07-09 08:28:44,108][26022] Updated weights on worker 0-0, policy_version 168304 (0.00088) [2022-07-09 08:28:46,088][26022] Updated weights on worker 0-0, policy_version 168314 (0.00093) [2022-07-09 08:28:47,551][26022] Updated weights on worker 0-0, policy_version 168324 (0.00085) [2022-07-09 08:28:47,999][25689] Fps is (10 sec: 5818.7, 60 sec: 5719.9, 300 sec: 5747.8). Total num frames: 172366848. Throughput: 0: 5137.6. Samples: 172362994. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 08:28:47,999][25689] Avg episode reward: [(0, '-53.226')] [2022-07-09 08:28:49,739][26022] Updated weights on worker 0-0, policy_version 168334 (0.00088) [2022-07-09 08:28:50,995][26022] Updated weights on worker 0-0, policy_version 168344 (0.00089) [2022-07-09 08:28:53,021][25689] Fps is (10 sec: 5716.9, 60 sec: 5684.6, 300 sec: 5741.9). Total num frames: 172392448. Throughput: 0: 5986.2. Samples: 172397090. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 08:28:53,023][25689] Avg episode reward: [(0, '-52.884')] [2022-07-09 08:28:53,235][26022] Updated weights on worker 0-0, policy_version 168354 (0.00095) [2022-07-09 08:28:54,781][26022] Updated weights on worker 0-0, policy_version 168364 (0.00092) [2022-07-09 08:28:56,695][26022] Updated weights on worker 0-0, policy_version 168374 (0.00087) [2022-07-09 08:28:58,091][25689] Fps is (10 sec: 5478.8, 60 sec: 5686.4, 300 sec: 5740.8). Total num frames: 172422144. Throughput: 0: 5969.7. Samples: 172431534. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 08:28:58,092][25689] Avg episode reward: [(0, '-52.342')] [2022-07-09 08:28:58,559][26022] Updated weights on worker 0-0, policy_version 168384 (0.00088) [2022-07-09 08:29:00,267][26022] Updated weights on worker 0-0, policy_version 168394 (0.00093) [2022-07-09 08:29:02,242][26022] Updated weights on worker 0-0, policy_version 168404 (0.00098) [2022-07-09 08:29:03,106][25689] Fps is (10 sec: 5686.2, 60 sec: 5720.9, 300 sec: 5745.1). Total num frames: 172449792. Throughput: 0: 5121.3. Samples: 172448682. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 08:29:03,108][25689] Avg episode reward: [(0, '-50.864')] [2022-07-09 08:29:04,365][26022] Updated weights on worker 0-0, policy_version 168414 (0.00093) [2022-07-09 08:29:05,815][26022] Updated weights on worker 0-0, policy_version 168424 (0.00087) [2022-07-09 08:29:07,903][26022] Updated weights on worker 0-0, policy_version 168434 (0.00086) [2022-07-09 08:29:08,110][25689] Fps is (10 sec: 5519.4, 60 sec: 5690.8, 300 sec: 5745.1). Total num frames: 172477440. Throughput: 0: 5874.2. Samples: 172481132. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 08:29:08,110][25689] Avg episode reward: [(0, '-51.298')] [2022-07-09 08:29:09,667][26022] Updated weights on worker 0-0, policy_version 168444 (0.00441) [2022-07-09 08:29:11,502][26022] Updated weights on worker 0-0, policy_version 168454 (0.00082) [2022-07-09 08:29:13,123][25689] Fps is (10 sec: 5725.0, 60 sec: 5707.6, 300 sec: 5746.4). Total num frames: 172507136. Throughput: 0: 5881.4. Samples: 172515314. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 08:29:13,123][25689] Avg episode reward: [(0, '-51.030')] [2022-07-09 08:29:13,130][26022] Updated weights on worker 0-0, policy_version 168464 (0.00089) [2022-07-09 08:29:14,927][26022] Updated weights on worker 0-0, policy_version 168474 (0.00100) [2022-07-09 08:29:16,608][26022] Updated weights on worker 0-0, policy_version 168484 (0.00095) [2022-07-09 08:29:18,240][25689] Fps is (10 sec: 5761.6, 60 sec: 5700.6, 300 sec: 5745.3). Total num frames: 172535808. Throughput: 0: 5023.5. Samples: 172532754. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 08:29:18,241][25689] Avg episode reward: [(0, '-50.397')] [2022-07-09 08:29:18,416][26022] Updated weights on worker 0-0, policy_version 168494 (0.00088) [2022-07-09 08:29:20,090][26022] Updated weights on worker 0-0, policy_version 168504 (0.00097) [2022-07-09 08:29:22,183][26022] Updated weights on worker 0-0, policy_version 168514 (0.00090) [2022-07-09 08:29:23,272][25689] Fps is (10 sec: 5649.9, 60 sec: 5665.8, 300 sec: 5745.2). Total num frames: 172564480. Throughput: 0: 5892.6. Samples: 172567512. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 08:29:23,273][25689] Avg episode reward: [(0, '-51.191')] [2022-07-09 08:29:23,561][26022] Updated weights on worker 0-0, policy_version 168524 (0.00087) [2022-07-09 08:29:25,802][26022] Updated weights on worker 0-0, policy_version 168534 (0.00089) [2022-07-09 08:29:27,151][26022] Updated weights on worker 0-0, policy_version 168544 (0.00090) [2022-07-09 08:29:28,277][25689] Fps is (10 sec: 5713.6, 60 sec: 5666.0, 300 sec: 5745.3). Total num frames: 172593152. Throughput: 0: 5984.2. Samples: 172601814. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 08:29:28,277][25689] Avg episode reward: [(0, '-51.512')] [2022-07-09 08:29:29,278][26022] Updated weights on worker 0-0, policy_version 168554 (0.00107) [2022-07-09 08:29:30,909][26022] Updated weights on worker 0-0, policy_version 168564 (0.00095) [2022-07-09 08:29:32,831][26022] Updated weights on worker 0-0, policy_version 168574 (0.00091) [2022-07-09 08:29:33,288][25689] Fps is (10 sec: 5725.4, 60 sec: 5682.8, 300 sec: 5746.2). Total num frames: 172621824. Throughput: 0: 5141.1. Samples: 172618986. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 08:29:33,293][25689] Avg episode reward: [(0, '-51.348')] [2022-07-09 08:29:34,733][26022] Updated weights on worker 0-0, policy_version 168584 (0.00084) [2022-07-09 08:29:36,459][26022] Updated weights on worker 0-0, policy_version 168594 (0.00099) [2022-07-09 08:29:38,222][26022] Updated weights on worker 0-0, policy_version 168604 (0.01203) [2022-07-09 08:29:38,391][25689] Fps is (10 sec: 5669.5, 60 sec: 5660.3, 300 sec: 5737.4). Total num frames: 172650496. Throughput: 0: 5966.8. Samples: 172652990. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 08:29:38,392][25689] Avg episode reward: [(0, '-50.725')] [2022-07-09 08:29:40,221][26022] Updated weights on worker 0-0, policy_version 168614 (0.00088) [2022-07-09 08:29:41,641][26022] Updated weights on worker 0-0, policy_version 168624 (0.00085) [2022-07-09 08:29:43,396][25689] Fps is (10 sec: 5571.5, 60 sec: 5678.9, 300 sec: 5731.6). Total num frames: 172678144. Throughput: 0: 5942.2. Samples: 172687094. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 08:29:43,397][25689] Avg episode reward: [(0, '-51.483')] [2022-07-09 08:29:43,750][26022] Updated weights on worker 0-0, policy_version 168634 (0.00093) [2022-07-09 08:29:45,352][26022] Updated weights on worker 0-0, policy_version 168644 (0.00087) [2022-07-09 08:29:47,260][26022] Updated weights on worker 0-0, policy_version 168654 (0.00084) [2022-07-09 08:29:48,496][25689] Fps is (10 sec: 5776.5, 60 sec: 5653.4, 300 sec: 5737.0). Total num frames: 172708864. Throughput: 0: 5075.4. Samples: 172704432. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 08:29:48,496][25689] Avg episode reward: [(0, '-51.851')] [2022-07-09 08:29:48,964][26022] Updated weights on worker 0-0, policy_version 168664 (0.00092) [2022-07-09 08:29:50,594][26022] Updated weights on worker 0-0, policy_version 168674 (0.00081) [2022-07-09 08:29:52,455][26022] Updated weights on worker 0-0, policy_version 168684 (0.00084) [2022-07-09 08:29:53,534][25689] Fps is (10 sec: 5959.4, 60 sec: 5719.6, 300 sec: 5737.4). Total num frames: 172738560. Throughput: 0: 5960.1. Samples: 172739656. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 08:29:53,536][25689] Avg episode reward: [(0, '-51.329')] [2022-07-09 08:29:54,203][26022] Updated weights on worker 0-0, policy_version 168694 (0.00081) [2022-07-09 08:29:55,818][26022] Updated weights on worker 0-0, policy_version 168704 (0.00083) [2022-07-09 08:29:57,691][26022] Updated weights on worker 0-0, policy_version 168714 (0.00082) [2022-07-09 08:29:58,664][25689] Fps is (10 sec: 5740.0, 60 sec: 5697.1, 300 sec: 5741.9). Total num frames: 172767232. Throughput: 0: 6001.0. Samples: 172774648. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 08:29:58,665][25689] Avg episode reward: [(0, '-51.599')] [2022-07-09 08:29:59,360][26022] Updated weights on worker 0-0, policy_version 168724 (0.00082) [2022-07-09 08:30:01,286][26022] Updated weights on worker 0-0, policy_version 168734 (0.00090) [2022-07-09 08:30:03,384][26022] Updated weights on worker 0-0, policy_version 168744 (0.00083) [2022-07-09 08:30:03,722][25689] Fps is (10 sec: 5528.1, 60 sec: 5693.0, 300 sec: 5737.6). Total num frames: 172794880. Throughput: 0: 5159.7. Samples: 172791968. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 08:30:03,723][25689] Avg episode reward: [(0, '-51.018')] [2022-07-09 08:30:05,240][26022] Updated weights on worker 0-0, policy_version 168754 (0.00094) [2022-07-09 08:30:06,907][26022] Updated weights on worker 0-0, policy_version 168764 (0.00083) [2022-07-09 08:30:08,670][26022] Updated weights on worker 0-0, policy_version 168774 (0.00106) [2022-07-09 08:30:08,747][25689] Fps is (10 sec: 5687.3, 60 sec: 5724.8, 300 sec: 5740.7). Total num frames: 172824576. Throughput: 0: 5936.6. Samples: 172824656. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 08:30:08,748][25689] Avg episode reward: [(0, '-51.660')] [2022-07-09 08:30:10,293][26022] Updated weights on worker 0-0, policy_version 168784 (0.00084) [2022-07-09 08:30:12,197][26022] Updated weights on worker 0-0, policy_version 168794 (0.00085) [2022-07-09 08:30:13,735][26022] Updated weights on worker 0-0, policy_version 168804 (0.00083) [2022-07-09 08:30:13,763][25689] Fps is (10 sec: 6017.3, 60 sec: 5741.4, 300 sec: 5744.6). Total num frames: 172855296. Throughput: 0: 5949.7. Samples: 172860008. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 08:30:13,763][25689] Avg episode reward: [(0, '-50.122')] [2022-07-09 08:30:15,650][26022] Updated weights on worker 0-0, policy_version 168814 (0.00616) [2022-07-09 08:30:17,334][26022] Updated weights on worker 0-0, policy_version 168824 (0.00078) [2022-07-09 08:30:18,817][25689] Fps is (10 sec: 5897.9, 60 sec: 5747.4, 300 sec: 5740.2). Total num frames: 172883968. Throughput: 0: 5118.2. Samples: 172877794. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-09 08:30:18,819][25689] Avg episode reward: [(0, '-49.937')] [2022-07-09 08:30:19,183][26022] Updated weights on worker 0-0, policy_version 168834 (0.00081) [2022-07-09 08:30:21,108][26022] Updated weights on worker 0-0, policy_version 168844 (0.00107) [2022-07-09 08:30:22,399][26022] Updated weights on worker 0-0, policy_version 168854 (0.00896) [2022-07-09 08:30:23,905][25689] Fps is (10 sec: 5754.9, 60 sec: 5759.0, 300 sec: 5749.3). Total num frames: 172913664. Throughput: 0: 5991.3. Samples: 172912888. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2022-07-09 08:30:23,905][25689] Avg episode reward: [(0, '-49.422')] [2022-07-09 08:30:24,715][26022] Updated weights on worker 0-0, policy_version 168864 (0.00091) [2022-07-09 08:30:26,024][26022] Updated weights on worker 0-0, policy_version 168874 (0.00094) [2022-07-09 08:30:28,092][26022] Updated weights on worker 0-0, policy_version 168884 (0.00089) [2022-07-09 08:30:28,921][25689] Fps is (10 sec: 5675.3, 60 sec: 5741.0, 300 sec: 5739.0). Total num frames: 172941312. Throughput: 0: 6104.4. Samples: 172947808. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2022-07-09 08:30:28,922][25689] Avg episode reward: [(0, '-49.631')] [2022-07-09 08:30:29,565][26022] Updated weights on worker 0-0, policy_version 168894 (0.00090) [2022-07-09 08:30:31,281][26022] Updated weights on worker 0-0, policy_version 168904 (0.00080) [2022-07-09 08:30:33,092][26022] Updated weights on worker 0-0, policy_version 168914 (0.00082) [2022-07-09 08:30:33,942][25689] Fps is (10 sec: 5713.2, 60 sec: 5757.0, 300 sec: 5737.6). Total num frames: 172971008. Throughput: 0: 5204.3. Samples: 172965028. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2022-07-09 08:30:33,943][25689] Avg episode reward: [(0, '-49.735')] [2022-07-09 08:30:34,962][26022] Updated weights on worker 0-0, policy_version 168924 (0.00095) [2022-07-09 08:30:36,806][26022] Updated weights on worker 0-0, policy_version 168934 (0.00087) [2022-07-09 08:30:38,864][26022] Updated weights on worker 0-0, policy_version 168944 (0.00085) [2022-07-09 08:30:38,990][25689] Fps is (10 sec: 5695.5, 60 sec: 5745.4, 300 sec: 5737.2). Total num frames: 172998656. Throughput: 0: 6017.1. Samples: 172999174. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2022-07-09 08:30:38,990][25689] Avg episode reward: [(0, '-49.716')] [2022-07-09 08:30:39,725][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:30:39,735][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000168948_173002752.pth [2022-07-09 08:30:39,744][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000166926_170932224.pth [2022-07-09 08:30:40,165][26022] Updated weights on worker 0-0, policy_version 168954 (0.00086) [2022-07-09 08:30:42,379][26022] Updated weights on worker 0-0, policy_version 168964 (0.00087) [2022-07-09 08:30:43,805][26022] Updated weights on worker 0-0, policy_version 168974 (0.00087) [2022-07-09 08:30:44,015][25689] Fps is (10 sec: 5896.1, 60 sec: 5811.0, 300 sec: 5743.7). Total num frames: 173030400. Throughput: 0: 6025.3. Samples: 173034058. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2022-07-09 08:30:44,016][25689] Avg episode reward: [(0, '-50.162')] [2022-07-09 08:30:45,868][26022] Updated weights on worker 0-0, policy_version 168984 (0.00093) [2022-07-09 08:30:47,529][26022] Updated weights on worker 0-0, policy_version 168994 (0.00084) [2022-07-09 08:30:49,063][25689] Fps is (10 sec: 5896.0, 60 sec: 5765.2, 300 sec: 5736.0). Total num frames: 173058048. Throughput: 0: 5141.2. Samples: 173051360. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2022-07-09 08:30:49,063][25689] Avg episode reward: [(0, '-50.711')] [2022-07-09 08:30:49,426][26022] Updated weights on worker 0-0, policy_version 169004 (0.00092) [2022-07-09 08:30:51,011][26022] Updated weights on worker 0-0, policy_version 169014 (0.00093) [2022-07-09 08:30:52,895][26022] Updated weights on worker 0-0, policy_version 169024 (0.00105) [2022-07-09 08:30:54,067][25689] Fps is (10 sec: 5603.0, 60 sec: 5751.6, 300 sec: 5737.4). Total num frames: 173086720. Throughput: 0: 6004.1. Samples: 173085860. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2022-07-09 08:30:54,070][25689] Avg episode reward: [(0, '-50.824')] [2022-07-09 08:30:54,526][26022] Updated weights on worker 0-0, policy_version 169034 (0.00089) [2022-07-09 08:30:56,431][26022] Updated weights on worker 0-0, policy_version 169044 (0.00088) [2022-07-09 08:30:58,142][26022] Updated weights on worker 0-0, policy_version 169054 (0.00089) [2022-07-09 08:30:59,127][25689] Fps is (10 sec: 5799.7, 60 sec: 5775.2, 300 sec: 5737.9). Total num frames: 173116416. Throughput: 0: 6016.0. Samples: 173120318. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2022-07-09 08:30:59,127][25689] Avg episode reward: [(0, '-50.926')] [2022-07-09 08:30:59,857][26022] Updated weights on worker 0-0, policy_version 169064 (0.00088) [2022-07-09 08:31:02,223][26022] Updated weights on worker 0-0, policy_version 169074 (0.00095) [2022-07-09 08:31:03,928][26022] Updated weights on worker 0-0, policy_version 169084 (0.00083) [2022-07-09 08:31:04,130][25689] Fps is (10 sec: 5494.8, 60 sec: 5746.5, 300 sec: 5731.8). Total num frames: 173142016. Throughput: 0: 5127.1. Samples: 173137190. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2022-07-09 08:31:04,131][25689] Avg episode reward: [(0, '-51.684')] [2022-07-09 08:31:05,582][26022] Updated weights on worker 0-0, policy_version 169094 (0.00091) [2022-07-09 08:31:07,554][26022] Updated weights on worker 0-0, policy_version 169104 (0.00082) [2022-07-09 08:31:09,135][25689] Fps is (10 sec: 5525.2, 60 sec: 5748.5, 300 sec: 5739.1). Total num frames: 173171712. Throughput: 0: 5901.9. Samples: 173169822. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2022-07-09 08:31:09,138][25689] Avg episode reward: [(0, '-51.441')] [2022-07-09 08:31:09,259][26022] Updated weights on worker 0-0, policy_version 169114 (0.00083) [2022-07-09 08:31:11,302][26022] Updated weights on worker 0-0, policy_version 169124 (0.00092) [2022-07-09 08:31:12,915][26022] Updated weights on worker 0-0, policy_version 169134 (0.00087) [2022-07-09 08:31:14,146][25689] Fps is (10 sec: 5827.5, 60 sec: 5714.9, 300 sec: 5729.7). Total num frames: 173200384. Throughput: 0: 5901.2. Samples: 173204352. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2022-07-09 08:31:14,147][25689] Avg episode reward: [(0, '-51.514')] [2022-07-09 08:31:14,711][26022] Updated weights on worker 0-0, policy_version 169144 (0.00087) [2022-07-09 08:31:16,518][26022] Updated weights on worker 0-0, policy_version 169154 (0.01177) [2022-07-09 08:31:18,218][26022] Updated weights on worker 0-0, policy_version 169164 (0.00091) [2022-07-09 08:31:19,200][25689] Fps is (10 sec: 5595.3, 60 sec: 5698.0, 300 sec: 5725.3). Total num frames: 173228032. Throughput: 0: 5050.7. Samples: 173221704. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2022-07-09 08:31:19,201][25689] Avg episode reward: [(0, '-51.312')] [2022-07-09 08:31:20,096][26022] Updated weights on worker 0-0, policy_version 169174 (0.00098) [2022-07-09 08:31:21,651][26022] Updated weights on worker 0-0, policy_version 169184 (0.00088) [2022-07-09 08:31:23,515][26022] Updated weights on worker 0-0, policy_version 169194 (0.00085) [2022-07-09 08:31:24,201][25689] Fps is (10 sec: 5804.8, 60 sec: 5723.2, 300 sec: 5732.7). Total num frames: 173258752. Throughput: 0: 5955.0. Samples: 173256714. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2022-07-09 08:31:24,202][25689] Avg episode reward: [(0, '-51.886')] [2022-07-09 08:31:25,295][26022] Updated weights on worker 0-0, policy_version 169204 (0.00088) [2022-07-09 08:31:27,063][26022] Updated weights on worker 0-0, policy_version 169214 (0.00082) [2022-07-09 08:31:28,798][26022] Updated weights on worker 0-0, policy_version 169224 (0.00084) [2022-07-09 08:31:29,215][25689] Fps is (10 sec: 5930.5, 60 sec: 5740.4, 300 sec: 5732.6). Total num frames: 173287424. Throughput: 0: 6073.7. Samples: 173291782. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2022-07-09 08:31:29,217][25689] Avg episode reward: [(0, '-51.708')] [2022-07-09 08:31:30,360][26022] Updated weights on worker 0-0, policy_version 169234 (0.00086) [2022-07-09 08:31:32,436][26022] Updated weights on worker 0-0, policy_version 169244 (0.00085) [2022-07-09 08:31:33,971][26022] Updated weights on worker 0-0, policy_version 169254 (0.00090) [2022-07-09 08:31:34,235][25689] Fps is (10 sec: 5715.0, 60 sec: 5723.5, 300 sec: 5729.7). Total num frames: 173316096. Throughput: 0: 5230.2. Samples: 173309422. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2022-07-09 08:31:34,237][25689] Avg episode reward: [(0, '-51.525')] [2022-07-09 08:31:35,731][26022] Updated weights on worker 0-0, policy_version 169264 (0.00076) [2022-07-09 08:31:37,703][26022] Updated weights on worker 0-0, policy_version 169274 (0.00083) [2022-07-09 08:31:39,308][25689] Fps is (10 sec: 5884.4, 60 sec: 5772.0, 300 sec: 5728.6). Total num frames: 173346816. Throughput: 0: 6090.7. Samples: 173344174. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2022-07-09 08:31:39,308][25689] Avg episode reward: [(0, '-52.708')] [2022-07-09 08:31:39,310][26022] Updated weights on worker 0-0, policy_version 169284 (0.00088) [2022-07-09 08:31:41,165][26022] Updated weights on worker 0-0, policy_version 169294 (0.00086) [2022-07-09 08:31:42,939][26022] Updated weights on worker 0-0, policy_version 169304 (0.00085) [2022-07-09 08:31:44,343][25689] Fps is (10 sec: 5774.7, 60 sec: 5703.3, 300 sec: 5728.2). Total num frames: 173374464. Throughput: 0: 6063.3. Samples: 173378838. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2022-07-09 08:31:44,344][25689] Avg episode reward: [(0, '-53.231')] [2022-07-09 08:31:44,768][26022] Updated weights on worker 0-0, policy_version 169314 (0.00089) [2022-07-09 08:31:46,305][26022] Updated weights on worker 0-0, policy_version 169324 (0.00086) [2022-07-09 08:31:48,185][26022] Updated weights on worker 0-0, policy_version 169334 (0.00085) [2022-07-09 08:31:49,400][25689] Fps is (10 sec: 5783.8, 60 sec: 5753.3, 300 sec: 5734.2). Total num frames: 173405184. Throughput: 0: 6047.1. Samples: 173413842. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2022-07-09 08:31:49,400][25689] Avg episode reward: [(0, '-53.054')] [2022-07-09 08:31:50,011][26022] Updated weights on worker 0-0, policy_version 169344 (0.00098) [2022-07-09 08:31:51,650][26022] Updated weights on worker 0-0, policy_version 169354 (0.00094) [2022-07-09 08:31:53,533][26022] Updated weights on worker 0-0, policy_version 169364 (0.00080) [2022-07-09 08:31:54,430][25689] Fps is (10 sec: 5989.1, 60 sec: 5767.7, 300 sec: 5731.2). Total num frames: 173434880. Throughput: 0: 6038.9. Samples: 173431378. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2022-07-09 08:31:54,431][25689] Avg episode reward: [(0, '-53.234')] [2022-07-09 08:31:55,107][26022] Updated weights on worker 0-0, policy_version 169374 (0.00083) [2022-07-09 08:31:57,035][26022] Updated weights on worker 0-0, policy_version 169384 (0.00085) [2022-07-09 08:31:58,796][26022] Updated weights on worker 0-0, policy_version 169394 (0.00089) [2022-07-09 08:31:59,498][25689] Fps is (10 sec: 5779.8, 60 sec: 5750.0, 300 sec: 5740.5). Total num frames: 173463552. Throughput: 0: 6053.1. Samples: 173466388. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2022-07-09 08:31:59,499][25689] Avg episode reward: [(0, '-53.404')] [2022-07-09 08:32:00,510][26022] Updated weights on worker 0-0, policy_version 169404 (0.00082) [2022-07-09 08:32:02,600][26022] Updated weights on worker 0-0, policy_version 169414 (0.00084) [2022-07-09 08:32:04,515][25689] Fps is (10 sec: 5381.8, 60 sec: 5748.8, 300 sec: 5723.2). Total num frames: 173489152. Throughput: 0: 5971.0. Samples: 173499286. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2022-07-09 08:32:04,515][25689] Avg episode reward: [(0, '-52.275')] [2022-07-09 08:32:04,544][26022] Updated weights on worker 0-0, policy_version 169424 (0.00088) [2022-07-09 08:32:06,179][26022] Updated weights on worker 0-0, policy_version 169434 (0.00103) [2022-07-09 08:32:07,967][26022] Updated weights on worker 0-0, policy_version 169444 (0.00093) [2022-07-09 08:32:09,524][25689] Fps is (10 sec: 5617.5, 60 sec: 5765.3, 300 sec: 5733.7). Total num frames: 173519872. Throughput: 0: 5106.7. Samples: 173516612. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2022-07-09 08:32:09,524][25689] Avg episode reward: [(0, '-51.556')] [2022-07-09 08:32:09,589][26022] Updated weights on worker 0-0, policy_version 169454 (0.00085) [2022-07-09 08:32:11,581][26022] Updated weights on worker 0-0, policy_version 169464 (0.00088) [2022-07-09 08:32:13,304][26022] Updated weights on worker 0-0, policy_version 169474 (0.00085) [2022-07-09 08:32:14,554][25689] Fps is (10 sec: 5916.1, 60 sec: 5763.5, 300 sec: 5735.6). Total num frames: 173548544. Throughput: 0: 5964.4. Samples: 173551402. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2022-07-09 08:32:14,554][25689] Avg episode reward: [(0, '-51.252')] [2022-07-09 08:32:14,896][26022] Updated weights on worker 0-0, policy_version 169484 (0.00082) [2022-07-09 08:32:16,720][26022] Updated weights on worker 0-0, policy_version 169494 (0.00094) [2022-07-09 08:32:18,542][26022] Updated weights on worker 0-0, policy_version 169504 (0.00081) [2022-07-09 08:32:19,688][25689] Fps is (10 sec: 5742.3, 60 sec: 5789.7, 300 sec: 5730.2). Total num frames: 173578240. Throughput: 0: 5939.1. Samples: 173586300. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2022-07-09 08:32:19,689][25689] Avg episode reward: [(0, '-51.597')] [2022-07-09 08:32:20,147][26022] Updated weights on worker 0-0, policy_version 169514 (0.00093) [2022-07-09 08:32:22,033][26022] Updated weights on worker 0-0, policy_version 169524 (0.00080) [2022-07-09 08:32:23,792][26022] Updated weights on worker 0-0, policy_version 169534 (0.00091) [2022-07-09 08:32:24,721][25689] Fps is (10 sec: 5740.7, 60 sec: 5752.8, 300 sec: 5733.1). Total num frames: 173606912. Throughput: 0: 5166.3. Samples: 173603678. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2022-07-09 08:32:24,722][25689] Avg episode reward: [(0, '-51.185')] [2022-07-09 08:32:25,642][26022] Updated weights on worker 0-0, policy_version 169544 (0.00087) [2022-07-09 08:32:27,544][26022] Updated weights on worker 0-0, policy_version 169554 (0.00092) [2022-07-09 08:32:29,211][26022] Updated weights on worker 0-0, policy_version 169564 (0.00085) [2022-07-09 08:32:29,813][25689] Fps is (10 sec: 5865.9, 60 sec: 5779.1, 300 sec: 5731.5). Total num frames: 173637632. Throughput: 0: 5983.1. Samples: 173638006. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 08:32:29,814][25689] Avg episode reward: [(0, '-52.071')] [2022-07-09 08:32:31,074][26022] Updated weights on worker 0-0, policy_version 169574 (0.00082) [2022-07-09 08:32:32,628][26022] Updated weights on worker 0-0, policy_version 169584 (0.00112) [2022-07-09 08:32:34,708][26022] Updated weights on worker 0-0, policy_version 169594 (0.00097) [2022-07-09 08:32:34,892][25689] Fps is (10 sec: 5638.0, 60 sec: 5739.8, 300 sec: 5725.0). Total num frames: 173664256. Throughput: 0: 5967.8. Samples: 173672778. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 08:32:34,893][25689] Avg episode reward: [(0, '-52.160')] [2022-07-09 08:32:36,127][26022] Updated weights on worker 0-0, policy_version 169604 (0.00085) [2022-07-09 08:32:38,235][26022] Updated weights on worker 0-0, policy_version 169614 (0.00090) [2022-07-09 08:32:39,646][26022] Updated weights on worker 0-0, policy_version 169624 (0.00087) [2022-07-09 08:32:39,925][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:32:39,935][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000169625_173696000.pth [2022-07-09 08:32:39,935][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000167605_171627520.pth [2022-07-09 08:32:39,936][25689] Fps is (10 sec: 5765.9, 60 sec: 5759.4, 300 sec: 5738.6). Total num frames: 173696000. Throughput: 0: 5118.4. Samples: 173689936. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 08:32:39,937][25689] Avg episode reward: [(0, '-52.767')] [2022-07-09 08:32:41,635][26022] Updated weights on worker 0-0, policy_version 169634 (0.00079) [2022-07-09 08:32:43,354][26022] Updated weights on worker 0-0, policy_version 169644 (0.00265) [2022-07-09 08:32:44,990][25689] Fps is (10 sec: 5983.0, 60 sec: 5774.5, 300 sec: 5727.5). Total num frames: 173724672. Throughput: 0: 5967.4. Samples: 173724630. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 08:32:44,991][25689] Avg episode reward: [(0, '-52.715')] [2022-07-09 08:32:45,110][26022] Updated weights on worker 0-0, policy_version 169654 (0.00095) [2022-07-09 08:32:46,870][26022] Updated weights on worker 0-0, policy_version 169664 (0.00087) [2022-07-09 08:32:48,716][26022] Updated weights on worker 0-0, policy_version 169674 (0.00087) [2022-07-09 08:32:50,007][25689] Fps is (10 sec: 5694.4, 60 sec: 5744.5, 300 sec: 5730.8). Total num frames: 173753344. Throughput: 0: 6006.7. Samples: 173759300. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 08:32:50,007][25689] Avg episode reward: [(0, '-52.786')] [2022-07-09 08:32:50,467][26022] Updated weights on worker 0-0, policy_version 169684 (0.00088) [2022-07-09 08:32:52,318][26022] Updated weights on worker 0-0, policy_version 169694 (0.00093) [2022-07-09 08:32:53,905][26022] Updated weights on worker 0-0, policy_version 169704 (0.00089) [2022-07-09 08:32:55,030][25689] Fps is (10 sec: 5609.5, 60 sec: 5711.4, 300 sec: 5725.2). Total num frames: 173780992. Throughput: 0: 5158.3. Samples: 173776656. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 08:32:55,032][25689] Avg episode reward: [(0, '-52.535')] [2022-07-09 08:32:55,877][26022] Updated weights on worker 0-0, policy_version 169714 (0.00082) [2022-07-09 08:32:57,748][26022] Updated weights on worker 0-0, policy_version 169724 (0.00057) [2022-07-09 08:32:59,255][26022] Updated weights on worker 0-0, policy_version 169734 (0.00081) [2022-07-09 08:33:00,131][25689] Fps is (10 sec: 5765.3, 60 sec: 5742.1, 300 sec: 5740.8). Total num frames: 173811712. Throughput: 0: 6002.6. Samples: 173811154. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 08:33:00,131][25689] Avg episode reward: [(0, '-52.258')] [2022-07-09 08:33:01,463][26022] Updated weights on worker 0-0, policy_version 169744 (0.00090) [2022-07-09 08:33:03,147][26022] Updated weights on worker 0-0, policy_version 169754 (0.00078) [2022-07-09 08:33:05,182][25689] Fps is (10 sec: 5446.7, 60 sec: 5721.9, 300 sec: 5723.5). Total num frames: 173836288. Throughput: 0: 5898.4. Samples: 173843732. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 08:33:05,183][25689] Avg episode reward: [(0, '-52.295')] [2022-07-09 08:33:05,288][26022] Updated weights on worker 0-0, policy_version 169764 (0.00088) [2022-07-09 08:33:06,766][26022] Updated weights on worker 0-0, policy_version 169774 (0.00054) [2022-07-09 08:33:08,875][26022] Updated weights on worker 0-0, policy_version 169784 (0.00076) [2022-07-09 08:33:10,277][25689] Fps is (10 sec: 5449.7, 60 sec: 5713.9, 300 sec: 5728.8). Total num frames: 173867008. Throughput: 0: 5016.0. Samples: 173860978. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 08:33:10,278][25689] Avg episode reward: [(0, '-51.179')] [2022-07-09 08:33:10,471][26022] Updated weights on worker 0-0, policy_version 169794 (0.00080) [2022-07-09 08:33:12,330][26022] Updated weights on worker 0-0, policy_version 169804 (0.00081) [2022-07-09 08:33:13,966][26022] Updated weights on worker 0-0, policy_version 169814 (0.00090) [2022-07-09 08:33:15,335][25689] Fps is (10 sec: 5950.6, 60 sec: 5728.1, 300 sec: 5731.9). Total num frames: 173896704. Throughput: 0: 5852.7. Samples: 173895496. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 08:33:15,336][25689] Avg episode reward: [(0, '-52.071')] [2022-07-09 08:33:16,018][26022] Updated weights on worker 0-0, policy_version 169824 (0.00083) [2022-07-09 08:33:17,466][26022] Updated weights on worker 0-0, policy_version 169834 (0.00081) [2022-07-09 08:33:19,362][26022] Updated weights on worker 0-0, policy_version 169844 (0.00081) [2022-07-09 08:33:20,386][25689] Fps is (10 sec: 5875.4, 60 sec: 5736.0, 300 sec: 5727.9). Total num frames: 173926400. Throughput: 0: 5877.2. Samples: 173930196. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 08:33:20,386][25689] Avg episode reward: [(0, '-51.873')] [2022-07-09 08:33:21,025][26022] Updated weights on worker 0-0, policy_version 169854 (0.00093) [2022-07-09 08:33:22,868][26022] Updated weights on worker 0-0, policy_version 169864 (0.00093) [2022-07-09 08:33:24,571][26022] Updated weights on worker 0-0, policy_version 169874 (0.00084) [2022-07-09 08:33:25,432][25689] Fps is (10 sec: 5781.2, 60 sec: 5734.8, 300 sec: 5727.2). Total num frames: 173955072. Throughput: 0: 5134.6. Samples: 173947704. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 08:33:25,432][25689] Avg episode reward: [(0, '-51.732')] [2022-07-09 08:33:26,472][26022] Updated weights on worker 0-0, policy_version 169884 (0.00086) [2022-07-09 08:33:28,192][26022] Updated weights on worker 0-0, policy_version 169894 (0.00079) [2022-07-09 08:33:30,039][26022] Updated weights on worker 0-0, policy_version 169904 (0.00082) [2022-07-09 08:33:30,463][25689] Fps is (10 sec: 5792.2, 60 sec: 5723.7, 300 sec: 5733.6). Total num frames: 173984768. Throughput: 0: 6017.9. Samples: 173982452. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 08:33:30,463][25689] Avg episode reward: [(0, '-51.726')] [2022-07-09 08:33:31,673][26022] Updated weights on worker 0-0, policy_version 169914 (0.00092) [2022-07-09 08:33:33,403][26022] Updated weights on worker 0-0, policy_version 169924 (0.00091) [2022-07-09 08:33:35,233][26022] Updated weights on worker 0-0, policy_version 169934 (0.00084) [2022-07-09 08:33:35,501][25689] Fps is (10 sec: 5796.6, 60 sec: 5761.3, 300 sec: 5730.3). Total num frames: 174013440. Throughput: 0: 6055.0. Samples: 174017598. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 08:33:35,501][25689] Avg episode reward: [(0, '-52.372')] [2022-07-09 08:33:36,982][26022] Updated weights on worker 0-0, policy_version 169944 (0.00084) [2022-07-09 08:33:38,793][26022] Updated weights on worker 0-0, policy_version 169954 (0.00085) [2022-07-09 08:33:40,537][25689] Fps is (10 sec: 5794.0, 60 sec: 5728.3, 300 sec: 5740.4). Total num frames: 174043136. Throughput: 0: 6051.5. Samples: 174052140. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 08:33:40,538][25689] Avg episode reward: [(0, '-52.337')] [2022-07-09 08:33:40,539][26022] Updated weights on worker 0-0, policy_version 169964 (0.00088) [2022-07-09 08:33:42,347][26022] Updated weights on worker 0-0, policy_version 169974 (0.00085) [2022-07-09 08:33:44,207][26022] Updated weights on worker 0-0, policy_version 169984 (0.00081) [2022-07-09 08:33:45,616][25689] Fps is (10 sec: 5770.5, 60 sec: 5725.9, 300 sec: 5728.7). Total num frames: 174071808. Throughput: 0: 6040.1. Samples: 174069618. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 08:33:45,617][25689] Avg episode reward: [(0, '-53.150')] [2022-07-09 08:33:45,773][26022] Updated weights on worker 0-0, policy_version 169994 (0.00087) [2022-07-09 08:33:47,546][26022] Updated weights on worker 0-0, policy_version 170004 (0.00086) [2022-07-09 08:33:49,392][26022] Updated weights on worker 0-0, policy_version 170014 (0.00080) [2022-07-09 08:33:50,640][25689] Fps is (10 sec: 5777.3, 60 sec: 5742.1, 300 sec: 5742.4). Total num frames: 174101504. Throughput: 0: 6054.8. Samples: 174104620. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 08:33:50,640][25689] Avg episode reward: [(0, '-53.246')] [2022-07-09 08:33:51,153][26022] Updated weights on worker 0-0, policy_version 170024 (0.00101) [2022-07-09 08:33:52,874][26022] Updated weights on worker 0-0, policy_version 170034 (0.00086) [2022-07-09 08:33:54,618][26022] Updated weights on worker 0-0, policy_version 170044 (0.00087) [2022-07-09 08:33:55,724][25689] Fps is (10 sec: 5875.4, 60 sec: 5770.1, 300 sec: 5742.2). Total num frames: 174131200. Throughput: 0: 6012.1. Samples: 174139184. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 08:33:55,725][25689] Avg episode reward: [(0, '-53.478')] [2022-07-09 08:33:56,451][26022] Updated weights on worker 0-0, policy_version 170054 (0.00086) [2022-07-09 08:33:58,451][26022] Updated weights on worker 0-0, policy_version 170064 (0.00087) [2022-07-09 08:34:00,067][26022] Updated weights on worker 0-0, policy_version 170074 (0.00091) [2022-07-09 08:34:00,802][25689] Fps is (10 sec: 5844.4, 60 sec: 5755.3, 300 sec: 5747.8). Total num frames: 174160896. Throughput: 0: 5146.0. Samples: 174156426. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 08:34:00,802][25689] Avg episode reward: [(0, '-53.637')] [2022-07-09 08:34:02,260][26022] Updated weights on worker 0-0, policy_version 170084 (0.00083) [2022-07-09 08:34:03,957][26022] Updated weights on worker 0-0, policy_version 170094 (0.00091) [2022-07-09 08:34:05,844][25689] Fps is (10 sec: 5464.1, 60 sec: 5773.1, 300 sec: 5740.2). Total num frames: 174186496. Throughput: 0: 5910.7. Samples: 174189182. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 08:34:05,845][25689] Avg episode reward: [(0, '-53.861')] [2022-07-09 08:34:05,850][26022] Updated weights on worker 0-0, policy_version 170104 (0.00088) [2022-07-09 08:34:07,366][26022] Updated weights on worker 0-0, policy_version 170114 (0.00084) [2022-07-09 08:34:09,259][26022] Updated weights on worker 0-0, policy_version 170124 (0.00069) [2022-07-09 08:34:10,890][25689] Fps is (10 sec: 5481.2, 60 sec: 5760.9, 300 sec: 5739.6). Total num frames: 174216192. Throughput: 0: 5878.6. Samples: 174223664. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 08:34:10,891][25689] Avg episode reward: [(0, '-53.972')] [2022-07-09 08:34:11,060][26022] Updated weights on worker 0-0, policy_version 170134 (0.00081) [2022-07-09 08:34:12,808][26022] Updated weights on worker 0-0, policy_version 170144 (0.00083) [2022-07-09 08:34:14,581][26022] Updated weights on worker 0-0, policy_version 170154 (0.00088) [2022-07-09 08:34:15,904][25689] Fps is (10 sec: 5903.7, 60 sec: 5765.1, 300 sec: 5745.0). Total num frames: 174245888. Throughput: 0: 5054.3. Samples: 174241180. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 08:34:15,906][25689] Avg episode reward: [(0, '-53.751')] [2022-07-09 08:34:16,423][26022] Updated weights on worker 0-0, policy_version 170164 (0.00085) [2022-07-09 08:34:18,255][26022] Updated weights on worker 0-0, policy_version 170174 (0.00087) [2022-07-09 08:34:19,909][26022] Updated weights on worker 0-0, policy_version 170184 (0.00087) [2022-07-09 08:34:21,025][25689] Fps is (10 sec: 5657.8, 60 sec: 5724.5, 300 sec: 5739.8). Total num frames: 174273536. Throughput: 0: 5914.2. Samples: 174276032. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 08:34:21,027][25689] Avg episode reward: [(0, '-53.723')] [2022-07-09 08:34:21,561][26022] Updated weights on worker 0-0, policy_version 170194 (0.00089) [2022-07-09 08:34:23,287][26022] Updated weights on worker 0-0, policy_version 170204 (0.00085) [2022-07-09 08:34:25,122][26022] Updated weights on worker 0-0, policy_version 170214 (0.00095) [2022-07-09 08:34:26,057][25689] Fps is (10 sec: 5850.0, 60 sec: 5776.6, 300 sec: 5749.6). Total num frames: 174305280. Throughput: 0: 6030.9. Samples: 174311082. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 08:34:26,059][25689] Avg episode reward: [(0, '-53.136')] [2022-07-09 08:34:27,026][26022] Updated weights on worker 0-0, policy_version 170224 (0.00081) [2022-07-09 08:34:28,566][26022] Updated weights on worker 0-0, policy_version 170234 (0.00081) [2022-07-09 08:34:30,528][26022] Updated weights on worker 0-0, policy_version 170244 (0.00091) [2022-07-09 08:34:31,089][25689] Fps is (10 sec: 5901.9, 60 sec: 5742.7, 300 sec: 5745.8). Total num frames: 174332928. Throughput: 0: 5187.9. Samples: 174328452. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 08:34:31,089][25689] Avg episode reward: [(0, '-52.438')] [2022-07-09 08:34:31,959][26022] Updated weights on worker 0-0, policy_version 170254 (0.00084) [2022-07-09 08:34:33,962][26022] Updated weights on worker 0-0, policy_version 170264 (0.00085) [2022-07-09 08:34:35,783][26022] Updated weights on worker 0-0, policy_version 170274 (0.00087) [2022-07-09 08:34:36,107][25689] Fps is (10 sec: 5603.8, 60 sec: 5744.6, 300 sec: 5747.4). Total num frames: 174361600. Throughput: 0: 6044.8. Samples: 174363302. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 08:34:36,109][25689] Avg episode reward: [(0, '-51.988')] [2022-07-09 08:34:37,366][26022] Updated weights on worker 0-0, policy_version 170284 (0.00084) [2022-07-09 08:34:39,361][26022] Updated weights on worker 0-0, policy_version 170294 (0.00092) [2022-07-09 08:34:40,050][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:34:40,067][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000170298_174385152.pth [2022-07-09 08:34:40,067][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000168277_172315648.pth [2022-07-09 08:34:40,854][26022] Updated weights on worker 0-0, policy_version 170304 (0.00084) [2022-07-09 08:34:41,211][25689] Fps is (10 sec: 5867.6, 60 sec: 5755.1, 300 sec: 5755.9). Total num frames: 174392320. Throughput: 0: 6031.6. Samples: 174397782. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 08:34:41,211][25689] Avg episode reward: [(0, '-51.843')] [2022-07-09 08:34:42,961][26022] Updated weights on worker 0-0, policy_version 170314 (0.00084) [2022-07-09 08:34:44,598][26022] Updated weights on worker 0-0, policy_version 170324 (0.00090) [2022-07-09 08:34:46,220][25689] Fps is (10 sec: 5873.1, 60 sec: 5761.7, 300 sec: 5750.7). Total num frames: 174420992. Throughput: 0: 5166.6. Samples: 174415256. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 08:34:46,220][25689] Avg episode reward: [(0, '-51.353')] [2022-07-09 08:34:46,325][26022] Updated weights on worker 0-0, policy_version 170334 (0.00081) [2022-07-09 08:34:48,030][26022] Updated weights on worker 0-0, policy_version 170344 (0.00087) [2022-07-09 08:34:50,100][26022] Updated weights on worker 0-0, policy_version 170354 (0.00083) [2022-07-09 08:34:51,235][25689] Fps is (10 sec: 5822.8, 60 sec: 5762.6, 300 sec: 5751.2). Total num frames: 174450688. Throughput: 0: 6052.2. Samples: 174450380. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 08:34:51,235][25689] Avg episode reward: [(0, '-51.922')] [2022-07-09 08:34:51,407][26022] Updated weights on worker 0-0, policy_version 170364 (0.00086) [2022-07-09 08:34:53,738][26022] Updated weights on worker 0-0, policy_version 170374 (0.00090) [2022-07-09 08:34:54,861][26022] Updated weights on worker 0-0, policy_version 170384 (0.00091) [2022-07-09 08:34:56,250][25689] Fps is (10 sec: 5717.3, 60 sec: 5735.4, 300 sec: 5750.0). Total num frames: 174478336. Throughput: 0: 6061.3. Samples: 174485392. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 08:34:56,251][25689] Avg episode reward: [(0, '-52.480')] [2022-07-09 08:34:57,013][26022] Updated weights on worker 0-0, policy_version 170394 (0.00091) [2022-07-09 08:34:58,605][26022] Updated weights on worker 0-0, policy_version 170404 (0.00087) [2022-07-09 08:35:00,599][26022] Updated weights on worker 0-0, policy_version 170414 (0.00067) [2022-07-09 08:35:01,327][25689] Fps is (10 sec: 5682.1, 60 sec: 5735.4, 300 sec: 5756.5). Total num frames: 174508032. Throughput: 0: 5209.8. Samples: 174502582. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 08:35:01,328][25689] Avg episode reward: [(0, '-52.758')] [2022-07-09 08:35:02,505][26022] Updated weights on worker 0-0, policy_version 170424 (0.00086) [2022-07-09 08:35:04,352][26022] Updated weights on worker 0-0, policy_version 170434 (0.00092) [2022-07-09 08:35:05,955][26022] Updated weights on worker 0-0, policy_version 170444 (0.00087) [2022-07-09 08:35:06,354][25689] Fps is (10 sec: 5675.1, 60 sec: 5770.7, 300 sec: 5749.5). Total num frames: 174535680. Throughput: 0: 5967.6. Samples: 174535410. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 08:35:06,356][25689] Avg episode reward: [(0, '-52.954')] [2022-07-09 08:35:08,044][26022] Updated weights on worker 0-0, policy_version 170454 (0.00094) [2022-07-09 08:35:09,679][26022] Updated weights on worker 0-0, policy_version 170464 (0.00097) [2022-07-09 08:35:11,363][25689] Fps is (10 sec: 5509.8, 60 sec: 5740.4, 300 sec: 5739.4). Total num frames: 174563328. Throughput: 0: 5923.9. Samples: 174569616. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 08:35:11,363][25689] Avg episode reward: [(0, '-51.602')] [2022-07-09 08:35:11,635][26022] Updated weights on worker 0-0, policy_version 170474 (0.00074) [2022-07-09 08:35:13,211][26022] Updated weights on worker 0-0, policy_version 170484 (0.00092) [2022-07-09 08:35:15,215][26022] Updated weights on worker 0-0, policy_version 170494 (0.00083) [2022-07-09 08:35:16,383][25689] Fps is (10 sec: 5820.1, 60 sec: 5756.8, 300 sec: 5746.9). Total num frames: 174594048. Throughput: 0: 5047.0. Samples: 174587004. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 08:35:16,384][25689] Avg episode reward: [(0, '-51.420')] [2022-07-09 08:35:16,699][26022] Updated weights on worker 0-0, policy_version 170504 (0.00048) [2022-07-09 08:35:18,622][26022] Updated weights on worker 0-0, policy_version 170514 (0.00090) [2022-07-09 08:35:20,228][26022] Updated weights on worker 0-0, policy_version 170524 (0.00080) [2022-07-09 08:35:21,430][25689] Fps is (10 sec: 5797.9, 60 sec: 5763.8, 300 sec: 5740.8). Total num frames: 174621696. Throughput: 0: 5945.3. Samples: 174622100. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 08:35:21,434][25689] Avg episode reward: [(0, '-51.679')] [2022-07-09 08:35:22,215][26022] Updated weights on worker 0-0, policy_version 170534 (0.00086) [2022-07-09 08:35:23,748][26022] Updated weights on worker 0-0, policy_version 170544 (0.00085) [2022-07-09 08:35:25,705][26022] Updated weights on worker 0-0, policy_version 170554 (0.00090) [2022-07-09 08:35:26,451][25689] Fps is (10 sec: 5695.7, 60 sec: 5730.9, 300 sec: 5747.6). Total num frames: 174651392. Throughput: 0: 6054.6. Samples: 174657088. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 08:35:26,451][25689] Avg episode reward: [(0, '-52.096')] [2022-07-09 08:35:27,437][26022] Updated weights on worker 0-0, policy_version 170564 (0.00086) [2022-07-09 08:35:29,186][26022] Updated weights on worker 0-0, policy_version 170574 (0.00089) [2022-07-09 08:35:30,920][26022] Updated weights on worker 0-0, policy_version 170584 (0.00080) [2022-07-09 08:35:31,460][25689] Fps is (10 sec: 5819.5, 60 sec: 5750.1, 300 sec: 5744.4). Total num frames: 174680064. Throughput: 0: 5208.4. Samples: 174674288. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 08:35:31,461][25689] Avg episode reward: [(0, '-51.267')] [2022-07-09 08:35:32,808][26022] Updated weights on worker 0-0, policy_version 170594 (0.00103) [2022-07-09 08:35:34,391][26022] Updated weights on worker 0-0, policy_version 170604 (0.00093) [2022-07-09 08:35:36,204][26022] Updated weights on worker 0-0, policy_version 170614 (0.00087) [2022-07-09 08:35:36,475][25689] Fps is (10 sec: 5720.7, 60 sec: 5750.4, 300 sec: 5748.5). Total num frames: 174708736. Throughput: 0: 6064.2. Samples: 174708846. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 08:35:36,475][25689] Avg episode reward: [(0, '-52.320')] [2022-07-09 08:35:37,902][26022] Updated weights on worker 0-0, policy_version 170624 (0.00086) [2022-07-09 08:35:40,000][26022] Updated weights on worker 0-0, policy_version 170634 (0.00093) [2022-07-09 08:35:41,578][25689] Fps is (10 sec: 5768.5, 60 sec: 5733.4, 300 sec: 5740.1). Total num frames: 174738432. Throughput: 0: 5994.5. Samples: 174742878. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 08:35:41,579][25689] Avg episode reward: [(0, '-52.631')] [2022-07-09 08:35:41,660][26022] Updated weights on worker 0-0, policy_version 170644 (0.00089) [2022-07-09 08:35:43,671][26022] Updated weights on worker 0-0, policy_version 170654 (0.00086) [2022-07-09 08:35:45,232][26022] Updated weights on worker 0-0, policy_version 170664 (0.00090) [2022-07-09 08:35:46,583][25689] Fps is (10 sec: 5672.8, 60 sec: 5716.8, 300 sec: 5740.9). Total num frames: 174766080. Throughput: 0: 5118.2. Samples: 174760130. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 08:35:46,584][25689] Avg episode reward: [(0, '-53.098')] [2022-07-09 08:35:47,295][26022] Updated weights on worker 0-0, policy_version 170674 (0.00088) [2022-07-09 08:35:48,808][26022] Updated weights on worker 0-0, policy_version 170684 (0.00091) [2022-07-09 08:35:50,890][26022] Updated weights on worker 0-0, policy_version 170694 (0.00091) [2022-07-09 08:35:51,601][25689] Fps is (10 sec: 5516.7, 60 sec: 5682.6, 300 sec: 5737.2). Total num frames: 174793728. Throughput: 0: 5963.0. Samples: 174794394. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 08:35:51,602][25689] Avg episode reward: [(0, '-52.572')] [2022-07-09 08:35:52,482][26022] Updated weights on worker 0-0, policy_version 170704 (0.00085) [2022-07-09 08:35:54,443][26022] Updated weights on worker 0-0, policy_version 170714 (0.00086) [2022-07-09 08:35:56,053][26022] Updated weights on worker 0-0, policy_version 170724 (0.00081) [2022-07-09 08:35:56,647][25689] Fps is (10 sec: 5800.1, 60 sec: 5730.6, 300 sec: 5740.9). Total num frames: 174824448. Throughput: 0: 5942.2. Samples: 174828712. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 08:35:56,647][25689] Avg episode reward: [(0, '-51.712')] [2022-07-09 08:35:58,000][26022] Updated weights on worker 0-0, policy_version 170734 (0.00089) [2022-07-09 08:35:59,732][26022] Updated weights on worker 0-0, policy_version 170744 (0.00095) [2022-07-09 08:36:01,572][26022] Updated weights on worker 0-0, policy_version 170754 (0.00087) [2022-07-09 08:36:01,756][25689] Fps is (10 sec: 5848.7, 60 sec: 5710.6, 300 sec: 5749.2). Total num frames: 174853120. Throughput: 0: 5103.5. Samples: 174845858. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 08:36:01,757][25689] Avg episode reward: [(0, '-52.397')] [2022-07-09 08:36:03,622][26022] Updated weights on worker 0-0, policy_version 170764 (0.00088) [2022-07-09 08:36:05,617][26022] Updated weights on worker 0-0, policy_version 170774 (0.00087) [2022-07-09 08:36:06,771][25689] Fps is (10 sec: 5461.9, 60 sec: 5694.9, 300 sec: 5738.7). Total num frames: 174879744. Throughput: 0: 5828.7. Samples: 174877796. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 08:36:06,771][25689] Avg episode reward: [(0, '-52.956')] [2022-07-09 08:36:07,148][26022] Updated weights on worker 0-0, policy_version 170784 (0.00095) [2022-07-09 08:36:09,263][26022] Updated weights on worker 0-0, policy_version 170794 (0.00379) [2022-07-09 08:36:10,735][26022] Updated weights on worker 0-0, policy_version 170804 (0.00096) [2022-07-09 08:36:11,817][25689] Fps is (10 sec: 5496.2, 60 sec: 5708.2, 300 sec: 5738.0). Total num frames: 174908416. Throughput: 0: 5828.7. Samples: 174912224. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 08:36:11,818][25689] Avg episode reward: [(0, '-52.725')] [2022-07-09 08:36:12,673][26022] Updated weights on worker 0-0, policy_version 170814 (0.00087) [2022-07-09 08:36:14,431][26022] Updated weights on worker 0-0, policy_version 170824 (0.00088) [2022-07-09 08:36:16,295][26022] Updated weights on worker 0-0, policy_version 170834 (0.00082) [2022-07-09 08:36:16,836][25689] Fps is (10 sec: 5697.4, 60 sec: 5674.5, 300 sec: 5742.1). Total num frames: 174937088. Throughput: 0: 5822.9. Samples: 174946270. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 08:36:16,836][25689] Avg episode reward: [(0, '-52.644')] [2022-07-09 08:36:18,077][26022] Updated weights on worker 0-0, policy_version 170844 (0.00085) [2022-07-09 08:36:20,092][26022] Updated weights on worker 0-0, policy_version 170854 (0.00096) [2022-07-09 08:36:21,568][26022] Updated weights on worker 0-0, policy_version 170864 (0.00093) [2022-07-09 08:36:21,900][25689] Fps is (10 sec: 5788.9, 60 sec: 5706.8, 300 sec: 5737.5). Total num frames: 174966784. Throughput: 0: 5840.5. Samples: 174963506. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 08:36:21,900][25689] Avg episode reward: [(0, '-52.567')] [2022-07-09 08:36:23,697][26022] Updated weights on worker 0-0, policy_version 170874 (0.00755) [2022-07-09 08:36:25,011][26022] Updated weights on worker 0-0, policy_version 170884 (0.01304) [2022-07-09 08:36:26,909][25689] Fps is (10 sec: 5489.4, 60 sec: 5640.1, 300 sec: 5727.2). Total num frames: 174992384. Throughput: 0: 5952.9. Samples: 174997676. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 08:36:26,909][25689] Avg episode reward: [(0, '-52.714')] [2022-07-09 08:36:27,147][26022] Updated weights on worker 0-0, policy_version 170894 (0.00084) [2022-07-09 08:36:28,810][26022] Updated weights on worker 0-0, policy_version 170904 (0.00087) [2022-07-09 08:36:30,800][26022] Updated weights on worker 0-0, policy_version 170914 (0.00084) [2022-07-09 08:36:31,913][25689] Fps is (10 sec: 5624.8, 60 sec: 5674.5, 300 sec: 5734.4). Total num frames: 175023104. Throughput: 0: 5961.7. Samples: 175032028. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 08:36:31,914][25689] Avg episode reward: [(0, '-53.001')] [2022-07-09 08:36:32,463][26022] Updated weights on worker 0-0, policy_version 170924 (0.00091) [2022-07-09 08:36:34,290][26022] Updated weights on worker 0-0, policy_version 170934 (0.00089) [2022-07-09 08:36:35,959][26022] Updated weights on worker 0-0, policy_version 170944 (0.00093) [2022-07-09 08:36:36,955][25689] Fps is (10 sec: 5911.9, 60 sec: 5671.9, 300 sec: 5728.1). Total num frames: 175051776. Throughput: 0: 5110.7. Samples: 175049094. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 08:36:36,956][25689] Avg episode reward: [(0, '-52.481')] [2022-07-09 08:36:37,899][26022] Updated weights on worker 0-0, policy_version 170954 (0.00081) [2022-07-09 08:36:39,677][26022] Updated weights on worker 0-0, policy_version 170964 (0.00089) [2022-07-09 08:36:40,099][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:36:40,110][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000170967_175070208.pth [2022-07-09 08:36:40,110][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000168948_173002752.pth [2022-07-09 08:36:41,435][26022] Updated weights on worker 0-0, policy_version 170974 (0.00110) [2022-07-09 08:36:42,046][25689] Fps is (10 sec: 5659.0, 60 sec: 5656.2, 300 sec: 5730.5). Total num frames: 175080448. Throughput: 0: 5960.1. Samples: 175083578. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 08:36:42,046][25689] Avg episode reward: [(0, '-51.997')] [2022-07-09 08:36:43,249][26022] Updated weights on worker 0-0, policy_version 170984 (0.00264) [2022-07-09 08:36:45,023][26022] Updated weights on worker 0-0, policy_version 170994 (0.00776) [2022-07-09 08:36:46,752][26022] Updated weights on worker 0-0, policy_version 171004 (0.00092) [2022-07-09 08:36:47,097][25689] Fps is (10 sec: 5653.8, 60 sec: 5668.8, 300 sec: 5723.7). Total num frames: 175109120. Throughput: 0: 5946.0. Samples: 175117718. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 08:36:47,098][25689] Avg episode reward: [(0, '-52.687')] [2022-07-09 08:36:48,656][26022] Updated weights on worker 0-0, policy_version 171014 (0.00080) [2022-07-09 08:36:50,354][26022] Updated weights on worker 0-0, policy_version 171024 (0.00083) [2022-07-09 08:36:52,157][25689] Fps is (10 sec: 5671.3, 60 sec: 5681.8, 300 sec: 5719.7). Total num frames: 175137792. Throughput: 0: 5094.0. Samples: 175135154. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 08:36:52,158][25689] Avg episode reward: [(0, '-52.926')] [2022-07-09 08:36:52,223][26022] Updated weights on worker 0-0, policy_version 171034 (0.00091) [2022-07-09 08:36:53,978][26022] Updated weights on worker 0-0, policy_version 171044 (0.00087) [2022-07-09 08:36:55,679][26022] Updated weights on worker 0-0, policy_version 171054 (0.00090) [2022-07-09 08:36:57,236][25689] Fps is (10 sec: 5655.7, 60 sec: 5644.8, 300 sec: 5719.5). Total num frames: 175166464. Throughput: 0: 5963.2. Samples: 175170038. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 08:36:57,237][25689] Avg episode reward: [(0, '-52.894')] [2022-07-09 08:36:57,512][26022] Updated weights on worker 0-0, policy_version 171064 (0.00086) [2022-07-09 08:36:59,159][26022] Updated weights on worker 0-0, policy_version 171074 (0.01302) [2022-07-09 08:37:01,036][26022] Updated weights on worker 0-0, policy_version 171084 (0.00090) [2022-07-09 08:37:02,295][25689] Fps is (10 sec: 5757.6, 60 sec: 5666.5, 300 sec: 5732.4). Total num frames: 175196160. Throughput: 0: 5937.0. Samples: 175203796. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 08:37:02,295][25689] Avg episode reward: [(0, '-52.454')] [2022-07-09 08:37:03,187][26022] Updated weights on worker 0-0, policy_version 171094 (0.00086) [2022-07-09 08:37:04,779][26022] Updated weights on worker 0-0, policy_version 171104 (0.00092) [2022-07-09 08:37:06,835][26022] Updated weights on worker 0-0, policy_version 171114 (0.00081) [2022-07-09 08:37:07,310][25689] Fps is (10 sec: 5692.4, 60 sec: 5683.3, 300 sec: 5722.0). Total num frames: 175223808. Throughput: 0: 5052.4. Samples: 175219842. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 08:37:07,311][25689] Avg episode reward: [(0, '-52.499')] [2022-07-09 08:37:08,336][26022] Updated weights on worker 0-0, policy_version 171124 (0.00097) [2022-07-09 08:37:10,404][26022] Updated weights on worker 0-0, policy_version 171134 (0.01379) [2022-07-09 08:37:11,988][26022] Updated weights on worker 0-0, policy_version 171144 (0.00098) [2022-07-09 08:37:12,314][25689] Fps is (10 sec: 5620.9, 60 sec: 5687.2, 300 sec: 5722.5). Total num frames: 175252480. Throughput: 0: 5899.9. Samples: 175254080. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 08:37:12,315][25689] Avg episode reward: [(0, '-52.324')] [2022-07-09 08:37:13,888][26022] Updated weights on worker 0-0, policy_version 171154 (0.00083) [2022-07-09 08:37:15,746][26022] Updated weights on worker 0-0, policy_version 171164 (0.00085) [2022-07-09 08:37:17,319][25689] Fps is (10 sec: 5627.1, 60 sec: 5671.6, 300 sec: 5718.1). Total num frames: 175280128. Throughput: 0: 5887.6. Samples: 175288276. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 08:37:17,321][25689] Avg episode reward: [(0, '-51.954')] [2022-07-09 08:37:17,500][26022] Updated weights on worker 0-0, policy_version 171174 (0.00088) [2022-07-09 08:37:19,260][26022] Updated weights on worker 0-0, policy_version 171184 (0.00088) [2022-07-09 08:37:21,124][26022] Updated weights on worker 0-0, policy_version 171194 (0.00089) [2022-07-09 08:37:22,372][25689] Fps is (10 sec: 5701.4, 60 sec: 5672.7, 300 sec: 5721.2). Total num frames: 175309824. Throughput: 0: 5064.1. Samples: 175305472. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 08:37:22,373][25689] Avg episode reward: [(0, '-50.707')] [2022-07-09 08:37:22,803][26022] Updated weights on worker 0-0, policy_version 171204 (0.00083) [2022-07-09 08:37:24,591][26022] Updated weights on worker 0-0, policy_version 171214 (0.00084) [2022-07-09 08:37:26,281][26022] Updated weights on worker 0-0, policy_version 171224 (0.00089) [2022-07-09 08:37:27,431][25689] Fps is (10 sec: 5670.6, 60 sec: 5701.8, 300 sec: 5711.5). Total num frames: 175337472. Throughput: 0: 5966.6. Samples: 175339896. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 08:37:27,433][25689] Avg episode reward: [(0, '-51.274')] [2022-07-09 08:37:28,447][26022] Updated weights on worker 0-0, policy_version 171234 (0.00084) [2022-07-09 08:37:29,909][26022] Updated weights on worker 0-0, policy_version 171244 (0.00084) [2022-07-09 08:37:31,878][26022] Updated weights on worker 0-0, policy_version 171254 (0.00083) [2022-07-09 08:37:32,443][25689] Fps is (10 sec: 5795.9, 60 sec: 5701.1, 300 sec: 5726.5). Total num frames: 175368192. Throughput: 0: 5968.4. Samples: 175374214. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 08:37:32,446][25689] Avg episode reward: [(0, '-51.483')] [2022-07-09 08:37:33,551][26022] Updated weights on worker 0-0, policy_version 171264 (0.00083) [2022-07-09 08:37:35,425][26022] Updated weights on worker 0-0, policy_version 171274 (0.00083) [2022-07-09 08:37:37,263][26022] Updated weights on worker 0-0, policy_version 171284 (0.00085) [2022-07-09 08:37:37,461][25689] Fps is (10 sec: 5717.2, 60 sec: 5669.5, 300 sec: 5709.8). Total num frames: 175394816. Throughput: 0: 5121.0. Samples: 175391426. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 08:37:37,463][25689] Avg episode reward: [(0, '-51.087')] [2022-07-09 08:37:39,060][26022] Updated weights on worker 0-0, policy_version 171294 (0.00088) [2022-07-09 08:37:40,858][26022] Updated weights on worker 0-0, policy_version 171304 (0.00090) [2022-07-09 08:37:42,511][25689] Fps is (10 sec: 5594.0, 60 sec: 5690.3, 300 sec: 5713.4). Total num frames: 175424512. Throughput: 0: 5977.2. Samples: 175425844. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 08:37:42,511][25689] Avg episode reward: [(0, '-52.011')] [2022-07-09 08:37:42,627][26022] Updated weights on worker 0-0, policy_version 171314 (0.00096) [2022-07-09 08:37:44,383][26022] Updated weights on worker 0-0, policy_version 171324 (0.00091) [2022-07-09 08:37:46,120][26022] Updated weights on worker 0-0, policy_version 171334 (0.00085) [2022-07-09 08:37:47,516][25689] Fps is (10 sec: 5805.2, 60 sec: 5694.7, 300 sec: 5713.6). Total num frames: 175453184. Throughput: 0: 5984.7. Samples: 175460096. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 08:37:47,517][25689] Avg episode reward: [(0, '-52.235')] [2022-07-09 08:37:47,939][26022] Updated weights on worker 0-0, policy_version 171344 (0.00087) [2022-07-09 08:37:49,907][26022] Updated weights on worker 0-0, policy_version 171354 (0.00082) [2022-07-09 08:37:51,446][26022] Updated weights on worker 0-0, policy_version 171364 (0.00080) [2022-07-09 08:37:52,529][25689] Fps is (10 sec: 5826.4, 60 sec: 5716.1, 300 sec: 5720.7). Total num frames: 175482880. Throughput: 0: 5140.4. Samples: 175477464. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 08:37:52,530][25689] Avg episode reward: [(0, '-52.488')] [2022-07-09 08:37:53,419][26022] Updated weights on worker 0-0, policy_version 171374 (0.00086) [2022-07-09 08:37:55,056][26022] Updated weights on worker 0-0, policy_version 171384 (0.00085) [2022-07-09 08:37:56,834][26022] Updated weights on worker 0-0, policy_version 171394 (0.00105) [2022-07-09 08:37:57,547][25689] Fps is (10 sec: 5818.8, 60 sec: 5721.9, 300 sec: 5715.4). Total num frames: 175511552. Throughput: 0: 6006.5. Samples: 175512068. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 08:37:57,547][25689] Avg episode reward: [(0, '-52.093')] [2022-07-09 08:37:58,554][26022] Updated weights on worker 0-0, policy_version 171404 (0.00087) [2022-07-09 08:38:00,278][26022] Updated weights on worker 0-0, policy_version 171414 (0.00094) [2022-07-09 08:38:02,589][25689] Fps is (10 sec: 5496.7, 60 sec: 5672.5, 300 sec: 5722.4). Total num frames: 175538176. Throughput: 0: 5942.3. Samples: 175545152. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 08:38:02,589][25689] Avg episode reward: [(0, '-51.963')] [2022-07-09 08:38:02,592][26022] Updated weights on worker 0-0, policy_version 171424 (0.00082) [2022-07-09 08:38:04,181][26022] Updated weights on worker 0-0, policy_version 171434 (0.00081) [2022-07-09 08:38:06,159][26022] Updated weights on worker 0-0, policy_version 171444 (0.00088) [2022-07-09 08:38:07,634][25689] Fps is (10 sec: 5684.6, 60 sec: 5720.6, 300 sec: 5723.4). Total num frames: 175568896. Throughput: 0: 5085.9. Samples: 175562416. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 08:38:07,635][25689] Avg episode reward: [(0, '-52.314')] [2022-07-09 08:38:07,638][26022] Updated weights on worker 0-0, policy_version 171454 (0.00087) [2022-07-09 08:38:09,557][26022] Updated weights on worker 0-0, policy_version 171464 (0.00081) [2022-07-09 08:38:11,299][26022] Updated weights on worker 0-0, policy_version 171474 (0.00087) [2022-07-09 08:38:12,718][25689] Fps is (10 sec: 5661.2, 60 sec: 5679.2, 300 sec: 5712.6). Total num frames: 175595520. Throughput: 0: 5933.2. Samples: 175597250. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 08:38:12,719][25689] Avg episode reward: [(0, '-52.243')] [2022-07-09 08:38:13,386][26022] Updated weights on worker 0-0, policy_version 171484 (0.00085) [2022-07-09 08:38:14,905][26022] Updated weights on worker 0-0, policy_version 171494 (0.00083) [2022-07-09 08:38:16,855][26022] Updated weights on worker 0-0, policy_version 171504 (0.00091) [2022-07-09 08:38:17,724][25689] Fps is (10 sec: 5683.5, 60 sec: 5729.9, 300 sec: 5716.9). Total num frames: 175626240. Throughput: 0: 5925.5. Samples: 175631628. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 08:38:17,724][25689] Avg episode reward: [(0, '-52.206')] [2022-07-09 08:38:18,453][26022] Updated weights on worker 0-0, policy_version 171514 (0.00082) [2022-07-09 08:38:20,265][26022] Updated weights on worker 0-0, policy_version 171524 (0.00086) [2022-07-09 08:38:22,049][26022] Updated weights on worker 0-0, policy_version 171534 (0.00086) [2022-07-09 08:38:22,788][25689] Fps is (10 sec: 5796.0, 60 sec: 5694.9, 300 sec: 5713.1). Total num frames: 175653888. Throughput: 0: 5138.2. Samples: 175648942. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 08:38:22,789][25689] Avg episode reward: [(0, '-51.394')] [2022-07-09 08:38:23,843][26022] Updated weights on worker 0-0, policy_version 171544 (0.00083) [2022-07-09 08:38:25,577][26022] Updated weights on worker 0-0, policy_version 171554 (0.00086) [2022-07-09 08:38:27,464][26022] Updated weights on worker 0-0, policy_version 171564 (0.00083) [2022-07-09 08:38:27,871][25689] Fps is (10 sec: 5651.1, 60 sec: 5726.5, 300 sec: 5712.1). Total num frames: 175683584. Throughput: 0: 5995.5. Samples: 175683748. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 08:38:27,872][25689] Avg episode reward: [(0, '-51.672')] [2022-07-09 08:38:29,156][26022] Updated weights on worker 0-0, policy_version 171574 (0.00089) [2022-07-09 08:38:31,095][26022] Updated weights on worker 0-0, policy_version 171584 (0.00085) [2022-07-09 08:38:32,686][26022] Updated weights on worker 0-0, policy_version 171594 (0.00081) [2022-07-09 08:38:32,965][25689] Fps is (10 sec: 5836.0, 60 sec: 5701.8, 300 sec: 5714.4). Total num frames: 175713280. Throughput: 0: 5984.7. Samples: 175718424. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 08:38:32,966][25689] Avg episode reward: [(0, '-52.687')] [2022-07-09 08:38:34,562][26022] Updated weights on worker 0-0, policy_version 171604 (0.00093) [2022-07-09 08:38:36,184][26022] Updated weights on worker 0-0, policy_version 171614 (0.00090) [2022-07-09 08:38:37,995][25689] Fps is (10 sec: 5866.7, 60 sec: 5751.5, 300 sec: 5714.5). Total num frames: 175742976. Throughput: 0: 5155.3. Samples: 175736130. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 08:38:37,996][25689] Avg episode reward: [(0, '-52.292')] [2022-07-09 08:38:37,997][26022] Updated weights on worker 0-0, policy_version 171624 (0.00086) [2022-07-09 08:38:39,613][26022] Updated weights on worker 0-0, policy_version 171634 (0.00083) [2022-07-09 08:38:40,276][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:38:40,289][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000171637_175756288.pth [2022-07-09 08:38:40,290][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000169625_173696000.pth [2022-07-09 08:38:41,495][26022] Updated weights on worker 0-0, policy_version 171644 (0.00088) [2022-07-09 08:38:43,087][25689] Fps is (10 sec: 5867.6, 60 sec: 5747.5, 300 sec: 5717.7). Total num frames: 175772672. Throughput: 0: 6019.4. Samples: 175771130. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 08:38:43,088][25689] Avg episode reward: [(0, '-52.286')] [2022-07-09 08:38:43,312][26022] Updated weights on worker 0-0, policy_version 171654 (0.00083) [2022-07-09 08:38:45,085][26022] Updated weights on worker 0-0, policy_version 171664 (0.00090) [2022-07-09 08:38:46,722][26022] Updated weights on worker 0-0, policy_version 171674 (0.00089) [2022-07-09 08:38:48,095][25689] Fps is (10 sec: 5677.4, 60 sec: 5730.3, 300 sec: 5711.2). Total num frames: 175800320. Throughput: 0: 6032.4. Samples: 175805748. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 08:38:48,096][25689] Avg episode reward: [(0, '-52.863')] [2022-07-09 08:38:48,576][26022] Updated weights on worker 0-0, policy_version 171684 (0.00088) [2022-07-09 08:38:50,170][26022] Updated weights on worker 0-0, policy_version 171694 (0.00049) [2022-07-09 08:38:52,221][26022] Updated weights on worker 0-0, policy_version 171704 (0.00084) [2022-07-09 08:38:53,160][25689] Fps is (10 sec: 5896.4, 60 sec: 5759.2, 300 sec: 5718.4). Total num frames: 175832064. Throughput: 0: 5189.9. Samples: 175823232. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 08:38:53,160][25689] Avg episode reward: [(0, '-53.037')] [2022-07-09 08:38:53,781][26022] Updated weights on worker 0-0, policy_version 171714 (0.00085) [2022-07-09 08:38:55,616][26022] Updated weights on worker 0-0, policy_version 171724 (0.00086) [2022-07-09 08:38:57,498][26022] Updated weights on worker 0-0, policy_version 171734 (0.00079) [2022-07-09 08:38:58,180][25689] Fps is (10 sec: 5889.6, 60 sec: 5742.1, 300 sec: 5712.7). Total num frames: 175859712. Throughput: 0: 6041.7. Samples: 175858080. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 08:38:58,180][25689] Avg episode reward: [(0, '-52.609')] [2022-07-09 08:38:59,131][26022] Updated weights on worker 0-0, policy_version 171744 (0.00086) [2022-07-09 08:39:00,932][26022] Updated weights on worker 0-0, policy_version 171754 (0.00087) [2022-07-09 08:39:03,084][26022] Updated weights on worker 0-0, policy_version 171764 (0.00107) [2022-07-09 08:39:03,261][25689] Fps is (10 sec: 5372.8, 60 sec: 5738.4, 300 sec: 5715.3). Total num frames: 175886336. Throughput: 0: 5925.8. Samples: 175890676. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 08:39:03,263][25689] Avg episode reward: [(0, '-51.500')] [2022-07-09 08:39:04,733][26022] Updated weights on worker 0-0, policy_version 171774 (0.00089) [2022-07-09 08:39:06,751][26022] Updated weights on worker 0-0, policy_version 171784 (0.00095) [2022-07-09 08:39:08,249][26022] Updated weights on worker 0-0, policy_version 171794 (0.00087) [2022-07-09 08:39:08,278][25689] Fps is (10 sec: 5678.4, 60 sec: 5741.1, 300 sec: 5719.3). Total num frames: 175917056. Throughput: 0: 5933.3. Samples: 175925500. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 08:39:08,279][25689] Avg episode reward: [(0, '-51.552')] [2022-07-09 08:39:10,191][26022] Updated weights on worker 0-0, policy_version 171804 (0.00073) [2022-07-09 08:39:11,943][26022] Updated weights on worker 0-0, policy_version 171814 (0.00086) [2022-07-09 08:39:13,290][25689] Fps is (10 sec: 5819.9, 60 sec: 5764.8, 300 sec: 5712.5). Total num frames: 175944704. Throughput: 0: 5942.1. Samples: 175942848. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 08:39:13,290][25689] Avg episode reward: [(0, '-50.925')] [2022-07-09 08:39:13,645][26022] Updated weights on worker 0-0, policy_version 171824 (0.00084) [2022-07-09 08:39:15,555][26022] Updated weights on worker 0-0, policy_version 171834 (0.00084) [2022-07-09 08:39:17,109][26022] Updated weights on worker 0-0, policy_version 171844 (0.00087) [2022-07-09 08:39:18,309][25689] Fps is (10 sec: 5614.5, 60 sec: 5729.7, 300 sec: 5717.9). Total num frames: 175973376. Throughput: 0: 5944.2. Samples: 175977736. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 08:39:18,310][25689] Avg episode reward: [(0, '-50.662')] [2022-07-09 08:39:18,978][26022] Updated weights on worker 0-0, policy_version 171854 (0.00094) [2022-07-09 08:39:20,907][26022] Updated weights on worker 0-0, policy_version 171864 (0.00089) [2022-07-09 08:39:22,476][26022] Updated weights on worker 0-0, policy_version 171874 (0.00087) [2022-07-09 08:39:23,373][25689] Fps is (10 sec: 5788.8, 60 sec: 5763.6, 300 sec: 5710.4). Total num frames: 176003072. Throughput: 0: 6041.9. Samples: 176012190. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 08:39:23,373][25689] Avg episode reward: [(0, '-50.626')] [2022-07-09 08:39:24,375][26022] Updated weights on worker 0-0, policy_version 171884 (0.00093) [2022-07-09 08:39:26,144][26022] Updated weights on worker 0-0, policy_version 171894 (0.00091) [2022-07-09 08:39:27,749][26022] Updated weights on worker 0-0, policy_version 171904 (0.00083) [2022-07-09 08:39:28,376][25689] Fps is (10 sec: 5900.1, 60 sec: 5771.2, 300 sec: 5717.8). Total num frames: 176032768. Throughput: 0: 5175.9. Samples: 176029522. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 08:39:28,376][25689] Avg episode reward: [(0, '-50.768')] [2022-07-09 08:39:29,840][26022] Updated weights on worker 0-0, policy_version 171914 (0.00088) [2022-07-09 08:39:31,319][26022] Updated weights on worker 0-0, policy_version 171924 (0.00085) [2022-07-09 08:39:33,265][26022] Updated weights on worker 0-0, policy_version 171934 (0.00082) [2022-07-09 08:39:33,379][25689] Fps is (10 sec: 5832.9, 60 sec: 5762.9, 300 sec: 5718.1). Total num frames: 176061440. Throughput: 0: 6063.7. Samples: 176064666. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 08:39:33,380][25689] Avg episode reward: [(0, '-51.566')] [2022-07-09 08:39:34,992][26022] Updated weights on worker 0-0, policy_version 171944 (0.00086) [2022-07-09 08:39:36,556][26022] Updated weights on worker 0-0, policy_version 171954 (0.00087) [2022-07-09 08:39:38,384][25689] Fps is (10 sec: 5729.8, 60 sec: 5748.3, 300 sec: 5713.1). Total num frames: 176090112. Throughput: 0: 6073.7. Samples: 176099662. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 08:39:38,384][25689] Avg episode reward: [(0, '-51.235')] [2022-07-09 08:39:38,534][26022] Updated weights on worker 0-0, policy_version 171964 (0.00081) [2022-07-09 08:39:40,245][26022] Updated weights on worker 0-0, policy_version 171974 (0.00082) [2022-07-09 08:39:41,966][26022] Updated weights on worker 0-0, policy_version 171984 (0.00097) [2022-07-09 08:39:43,470][25689] Fps is (10 sec: 5784.1, 60 sec: 5748.9, 300 sec: 5715.1). Total num frames: 176119808. Throughput: 0: 5217.3. Samples: 176117046. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 08:39:43,471][25689] Avg episode reward: [(0, '-50.976')] [2022-07-09 08:39:43,848][26022] Updated weights on worker 0-0, policy_version 171994 (0.00085) [2022-07-09 08:39:45,647][26022] Updated weights on worker 0-0, policy_version 172004 (0.00091) [2022-07-09 08:39:47,390][26022] Updated weights on worker 0-0, policy_version 172014 (0.00092) [2022-07-09 08:39:48,478][25689] Fps is (10 sec: 5782.2, 60 sec: 5765.9, 300 sec: 5711.8). Total num frames: 176148480. Throughput: 0: 6062.6. Samples: 176151394. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 08:39:48,479][25689] Avg episode reward: [(0, '-51.197')] [2022-07-09 08:39:48,943][26022] Updated weights on worker 0-0, policy_version 172024 (0.00086) [2022-07-09 08:39:50,873][26022] Updated weights on worker 0-0, policy_version 172034 (0.00079) [2022-07-09 08:39:52,683][26022] Updated weights on worker 0-0, policy_version 172044 (0.00084) [2022-07-09 08:39:53,479][25689] Fps is (10 sec: 5729.3, 60 sec: 5721.0, 300 sec: 5715.5). Total num frames: 176177152. Throughput: 0: 6057.7. Samples: 176186426. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 08:39:53,483][25689] Avg episode reward: [(0, '-51.771')] [2022-07-09 08:39:54,399][26022] Updated weights on worker 0-0, policy_version 172054 (0.00087) [2022-07-09 08:39:56,159][26022] Updated weights on worker 0-0, policy_version 172064 (0.00079) [2022-07-09 08:39:57,793][26022] Updated weights on worker 0-0, policy_version 172074 (0.00088) [2022-07-09 08:39:58,486][25689] Fps is (10 sec: 5832.0, 60 sec: 5756.2, 300 sec: 5716.8). Total num frames: 176206848. Throughput: 0: 5181.5. Samples: 176203824. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 08:39:58,487][25689] Avg episode reward: [(0, '-51.131')] [2022-07-09 08:39:59,761][26022] Updated weights on worker 0-0, policy_version 172084 (0.00082) [2022-07-09 08:40:01,336][26022] Updated weights on worker 0-0, policy_version 172094 (0.00096) [2022-07-09 08:40:03,555][25689] Fps is (10 sec: 5691.1, 60 sec: 5774.3, 300 sec: 5716.0). Total num frames: 176234496. Throughput: 0: 6006.3. Samples: 176237682. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 08:40:03,558][25689] Avg episode reward: [(0, '-51.896')] [2022-07-09 08:40:03,563][26022] Updated weights on worker 0-0, policy_version 172104 (0.00089) [2022-07-09 08:40:05,317][26022] Updated weights on worker 0-0, policy_version 172114 (0.00091) [2022-07-09 08:40:07,153][26022] Updated weights on worker 0-0, policy_version 172124 (0.00087) [2022-07-09 08:40:08,636][25689] Fps is (10 sec: 5549.2, 60 sec: 5734.4, 300 sec: 5718.1). Total num frames: 176263168. Throughput: 0: 5969.5. Samples: 176271724. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 08:40:08,636][25689] Avg episode reward: [(0, '-51.528')] [2022-07-09 08:40:08,782][26022] Updated weights on worker 0-0, policy_version 172134 (0.00088) [2022-07-09 08:40:10,722][26022] Updated weights on worker 0-0, policy_version 172144 (0.00080) [2022-07-09 08:40:12,211][26022] Updated weights on worker 0-0, policy_version 172154 (0.00086) [2022-07-09 08:40:13,643][25689] Fps is (10 sec: 5786.4, 60 sec: 5768.7, 300 sec: 5714.9). Total num frames: 176292864. Throughput: 0: 5090.2. Samples: 176289060. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 08:40:13,643][25689] Avg episode reward: [(0, '-52.712')] [2022-07-09 08:40:14,253][26022] Updated weights on worker 0-0, policy_version 172164 (0.00089) [2022-07-09 08:40:15,775][26022] Updated weights on worker 0-0, policy_version 172174 (0.00096) [2022-07-09 08:40:17,747][26022] Updated weights on worker 0-0, policy_version 172184 (0.00089) [2022-07-09 08:40:18,662][25689] Fps is (10 sec: 5719.2, 60 sec: 5751.8, 300 sec: 5715.4). Total num frames: 176320512. Throughput: 0: 5936.4. Samples: 176323596. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 08:40:18,663][25689] Avg episode reward: [(0, '-52.169')] [2022-07-09 08:40:19,532][26022] Updated weights on worker 0-0, policy_version 172194 (0.00092) [2022-07-09 08:40:21,277][26022] Updated weights on worker 0-0, policy_version 172204 (0.00103) [2022-07-09 08:40:23,028][26022] Updated weights on worker 0-0, policy_version 172214 (0.00104) [2022-07-09 08:40:23,723][25689] Fps is (10 sec: 5790.1, 60 sec: 5769.0, 300 sec: 5718.1). Total num frames: 176351232. Throughput: 0: 5986.8. Samples: 176358422. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 08:40:23,724][25689] Avg episode reward: [(0, '-51.644')] [2022-07-09 08:40:24,822][26022] Updated weights on worker 0-0, policy_version 172224 (0.00092) [2022-07-09 08:40:26,492][26022] Updated weights on worker 0-0, policy_version 172234 (0.00083) [2022-07-09 08:40:28,199][26022] Updated weights on worker 0-0, policy_version 172244 (0.00087) [2022-07-09 08:40:28,738][25689] Fps is (10 sec: 5894.8, 60 sec: 5750.9, 300 sec: 5718.0). Total num frames: 176379904. Throughput: 0: 5175.2. Samples: 176375756. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 08:40:28,738][25689] Avg episode reward: [(0, '-50.704')] [2022-07-09 08:40:30,171][26022] Updated weights on worker 0-0, policy_version 172254 (0.00091) [2022-07-09 08:40:31,774][26022] Updated weights on worker 0-0, policy_version 172264 (0.00079) [2022-07-09 08:40:33,730][26022] Updated weights on worker 0-0, policy_version 172274 (0.00096) [2022-07-09 08:40:33,744][25689] Fps is (10 sec: 5722.5, 60 sec: 5750.7, 300 sec: 5718.2). Total num frames: 176408576. Throughput: 0: 6054.3. Samples: 176410760. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 08:40:33,745][25689] Avg episode reward: [(0, '-51.135')] [2022-07-09 08:40:35,301][26022] Updated weights on worker 0-0, policy_version 172284 (0.00055) [2022-07-09 08:40:37,163][26022] Updated weights on worker 0-0, policy_version 172294 (0.00101) [2022-07-09 08:40:38,768][25689] Fps is (10 sec: 5818.9, 60 sec: 5765.7, 300 sec: 5719.7). Total num frames: 176438272. Throughput: 0: 6056.9. Samples: 176445376. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 08:40:38,770][25689] Avg episode reward: [(0, '-51.668')] [2022-07-09 08:40:38,819][26022] Updated weights on worker 0-0, policy_version 172304 (0.00082) [2022-07-09 08:40:40,504][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:40:40,516][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000172312_176447488.pth [2022-07-09 08:40:40,517][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000170298_174385152.pth [2022-07-09 08:40:40,811][26022] Updated weights on worker 0-0, policy_version 172314 (0.00091) [2022-07-09 08:40:42,494][26022] Updated weights on worker 0-0, policy_version 172324 (0.00087) [2022-07-09 08:40:43,896][25689] Fps is (10 sec: 5749.4, 60 sec: 5744.8, 300 sec: 5720.8). Total num frames: 176466944. Throughput: 0: 5173.6. Samples: 176462788. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 08:40:43,897][25689] Avg episode reward: [(0, '-51.242')] [2022-07-09 08:40:44,262][26022] Updated weights on worker 0-0, policy_version 172334 (0.00085) [2022-07-09 08:40:45,938][26022] Updated weights on worker 0-0, policy_version 172344 (0.00087) [2022-07-09 08:40:48,007][26022] Updated weights on worker 0-0, policy_version 172354 (0.00088) [2022-07-09 08:40:48,943][25689] Fps is (10 sec: 5736.9, 60 sec: 5758.1, 300 sec: 5727.1). Total num frames: 176496640. Throughput: 0: 6024.1. Samples: 176497472. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 08:40:48,951][25689] Avg episode reward: [(0, '-51.331')] [2022-07-09 08:40:49,483][26022] Updated weights on worker 0-0, policy_version 172364 (0.00091) [2022-07-09 08:40:51,587][26022] Updated weights on worker 0-0, policy_version 172374 (0.00091) [2022-07-09 08:40:53,035][26022] Updated weights on worker 0-0, policy_version 172384 (0.00089) [2022-07-09 08:40:53,972][25689] Fps is (10 sec: 5894.6, 60 sec: 5772.4, 300 sec: 5724.0). Total num frames: 176526336. Throughput: 0: 5984.3. Samples: 176531808. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 08:40:53,972][25689] Avg episode reward: [(0, '-51.444')] [2022-07-09 08:40:55,107][26022] Updated weights on worker 0-0, policy_version 172394 (0.00086) [2022-07-09 08:40:56,777][26022] Updated weights on worker 0-0, policy_version 172404 (0.00087) [2022-07-09 08:40:58,474][26022] Updated weights on worker 0-0, policy_version 172414 (0.00086) [2022-07-09 08:40:58,984][25689] Fps is (10 sec: 5711.0, 60 sec: 5738.1, 300 sec: 5722.4). Total num frames: 176553984. Throughput: 0: 5127.6. Samples: 176549032. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 08:40:58,984][25689] Avg episode reward: [(0, '-51.345')] [2022-07-09 08:41:00,308][26022] Updated weights on worker 0-0, policy_version 172424 (0.00085) [2022-07-09 08:41:02,563][26022] Updated weights on worker 0-0, policy_version 172434 (0.00074) [2022-07-09 08:41:04,059][25689] Fps is (10 sec: 5481.6, 60 sec: 5737.4, 300 sec: 5724.7). Total num frames: 176581632. Throughput: 0: 5906.4. Samples: 176581880. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 08:41:04,060][25689] Avg episode reward: [(0, '-51.384')] [2022-07-09 08:41:04,176][26022] Updated weights on worker 0-0, policy_version 172444 (0.00081) [2022-07-09 08:41:06,055][26022] Updated weights on worker 0-0, policy_version 172454 (0.00090) [2022-07-09 08:41:07,623][26022] Updated weights on worker 0-0, policy_version 172464 (0.00088) [2022-07-09 08:41:09,104][25689] Fps is (10 sec: 5564.9, 60 sec: 5740.8, 300 sec: 5724.7). Total num frames: 176610304. Throughput: 0: 5899.5. Samples: 176616416. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 08:41:09,105][25689] Avg episode reward: [(0, '-51.401')] [2022-07-09 08:41:09,539][26022] Updated weights on worker 0-0, policy_version 172474 (0.00088) [2022-07-09 08:41:11,304][26022] Updated weights on worker 0-0, policy_version 172484 (0.00087) [2022-07-09 08:41:12,971][26022] Updated weights on worker 0-0, policy_version 172494 (0.00082) [2022-07-09 08:41:14,163][25689] Fps is (10 sec: 5574.2, 60 sec: 5702.0, 300 sec: 5720.5). Total num frames: 176637952. Throughput: 0: 5050.1. Samples: 176633776. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 08:41:14,163][25689] Avg episode reward: [(0, '-51.121')] [2022-07-09 08:41:14,865][26022] Updated weights on worker 0-0, policy_version 172504 (0.00092) [2022-07-09 08:41:16,531][26022] Updated weights on worker 0-0, policy_version 172514 (0.00086) [2022-07-09 08:41:18,499][26022] Updated weights on worker 0-0, policy_version 172524 (0.00093) [2022-07-09 08:41:19,184][25689] Fps is (10 sec: 5892.0, 60 sec: 5769.5, 300 sec: 5728.2). Total num frames: 176669696. Throughput: 0: 5919.7. Samples: 176668614. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 08:41:19,185][25689] Avg episode reward: [(0, '-51.763')] [2022-07-09 08:41:20,329][26022] Updated weights on worker 0-0, policy_version 172534 (0.00092) [2022-07-09 08:41:21,843][26022] Updated weights on worker 0-0, policy_version 172544 (0.00095) [2022-07-09 08:41:23,862][26022] Updated weights on worker 0-0, policy_version 172554 (0.00092) [2022-07-09 08:41:24,297][25689] Fps is (10 sec: 5860.6, 60 sec: 5713.9, 300 sec: 5733.1). Total num frames: 176697344. Throughput: 0: 5987.0. Samples: 176703044. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 08:41:24,298][25689] Avg episode reward: [(0, '-52.308')] [2022-07-09 08:41:25,408][26022] Updated weights on worker 0-0, policy_version 172564 (0.00085) [2022-07-09 08:41:27,391][26022] Updated weights on worker 0-0, policy_version 172574 (0.00084) [2022-07-09 08:41:28,925][26022] Updated weights on worker 0-0, policy_version 172584 (0.00085) [2022-07-09 08:41:29,323][25689] Fps is (10 sec: 5656.1, 60 sec: 5729.7, 300 sec: 5729.2). Total num frames: 176727040. Throughput: 0: 5147.2. Samples: 176720484. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 08:41:29,323][25689] Avg episode reward: [(0, '-52.587')] [2022-07-09 08:41:30,806][26022] Updated weights on worker 0-0, policy_version 172594 (0.00091) [2022-07-09 08:41:32,595][26022] Updated weights on worker 0-0, policy_version 172604 (0.00088) [2022-07-09 08:41:34,326][25689] Fps is (10 sec: 5819.9, 60 sec: 5730.0, 300 sec: 5730.0). Total num frames: 176755712. Throughput: 0: 6019.1. Samples: 176755140. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 08:41:34,326][25689] Avg episode reward: [(0, '-52.576')] [2022-07-09 08:41:34,416][26022] Updated weights on worker 0-0, policy_version 172614 (0.00088) [2022-07-09 08:41:36,308][26022] Updated weights on worker 0-0, policy_version 172624 (0.00084) [2022-07-09 08:41:37,838][26022] Updated weights on worker 0-0, policy_version 172634 (0.00084) [2022-07-09 08:41:39,356][25689] Fps is (10 sec: 5715.4, 60 sec: 5712.6, 300 sec: 5731.2). Total num frames: 176784384. Throughput: 0: 5998.7. Samples: 176789618. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 08:41:39,356][25689] Avg episode reward: [(0, '-52.643')] [2022-07-09 08:41:39,890][26022] Updated weights on worker 0-0, policy_version 172644 (0.00090) [2022-07-09 08:41:41,352][26022] Updated weights on worker 0-0, policy_version 172654 (0.00089) [2022-07-09 08:41:43,360][26022] Updated weights on worker 0-0, policy_version 172664 (0.00080) [2022-07-09 08:41:44,481][25689] Fps is (10 sec: 5747.7, 60 sec: 5729.7, 300 sec: 5733.2). Total num frames: 176814080. Throughput: 0: 5152.2. Samples: 176807036. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 08:41:44,481][25689] Avg episode reward: [(0, '-52.235')] [2022-07-09 08:41:45,068][26022] Updated weights on worker 0-0, policy_version 172674 (0.00089) [2022-07-09 08:41:46,928][26022] Updated weights on worker 0-0, policy_version 172684 (0.00089) [2022-07-09 08:41:48,655][26022] Updated weights on worker 0-0, policy_version 172694 (0.00077) [2022-07-09 08:41:49,516][25689] Fps is (10 sec: 5744.7, 60 sec: 5713.9, 300 sec: 5733.7). Total num frames: 176842752. Throughput: 0: 5995.5. Samples: 176841556. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 08:41:49,517][25689] Avg episode reward: [(0, '-52.725')] [2022-07-09 08:41:50,555][26022] Updated weights on worker 0-0, policy_version 172704 (0.00092) [2022-07-09 08:41:52,333][26022] Updated weights on worker 0-0, policy_version 172714 (0.00100) [2022-07-09 08:41:54,185][26022] Updated weights on worker 0-0, policy_version 172724 (0.00090) [2022-07-09 08:41:54,538][25689] Fps is (10 sec: 5702.0, 60 sec: 5697.7, 300 sec: 5734.8). Total num frames: 176871424. Throughput: 0: 5957.1. Samples: 176875544. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 08:41:54,538][25689] Avg episode reward: [(0, '-52.638')] [2022-07-09 08:41:55,904][26022] Updated weights on worker 0-0, policy_version 172734 (0.00084) [2022-07-09 08:41:57,712][26022] Updated weights on worker 0-0, policy_version 172744 (0.00090) [2022-07-09 08:41:59,387][26022] Updated weights on worker 0-0, policy_version 172754 (0.00087) [2022-07-09 08:41:59,546][25689] Fps is (10 sec: 5717.6, 60 sec: 5715.0, 300 sec: 5732.3). Total num frames: 176900096. Throughput: 0: 5964.7. Samples: 176910044. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 08:41:59,546][25689] Avg episode reward: [(0, '-52.246')] [2022-07-09 08:42:01,188][26022] Updated weights on worker 0-0, policy_version 172764 (0.00087) [2022-07-09 08:42:03,405][26022] Updated weights on worker 0-0, policy_version 172774 (0.00095) [2022-07-09 08:42:04,641][25689] Fps is (10 sec: 5473.2, 60 sec: 5696.3, 300 sec: 5727.3). Total num frames: 176926720. Throughput: 0: 5853.7. Samples: 176925046. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 08:42:04,641][25689] Avg episode reward: [(0, '-52.175')] [2022-07-09 08:42:05,184][26022] Updated weights on worker 0-0, policy_version 172784 (0.00083) [2022-07-09 08:42:06,997][26022] Updated weights on worker 0-0, policy_version 172794 (0.00094) [2022-07-09 08:42:08,745][26022] Updated weights on worker 0-0, policy_version 172804 (0.00084) [2022-07-09 08:42:09,708][25689] Fps is (10 sec: 5643.0, 60 sec: 5728.0, 300 sec: 5733.0). Total num frames: 176957440. Throughput: 0: 5855.3. Samples: 176959782. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 08:42:09,708][25689] Avg episode reward: [(0, '-52.799')] [2022-07-09 08:42:10,594][26022] Updated weights on worker 0-0, policy_version 172814 (0.00085) [2022-07-09 08:42:12,373][26022] Updated weights on worker 0-0, policy_version 172824 (0.00085) [2022-07-09 08:42:14,058][26022] Updated weights on worker 0-0, policy_version 172834 (0.00090) [2022-07-09 08:42:14,723][25689] Fps is (10 sec: 5891.0, 60 sec: 5749.0, 300 sec: 5736.3). Total num frames: 176986112. Throughput: 0: 5908.8. Samples: 176994812. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 08:42:14,723][25689] Avg episode reward: [(0, '-52.273')] [2022-07-09 08:42:15,710][26022] Updated weights on worker 0-0, policy_version 172844 (0.00096) [2022-07-09 08:42:17,488][26022] Updated weights on worker 0-0, policy_version 172854 (0.00088) [2022-07-09 08:42:19,347][26022] Updated weights on worker 0-0, policy_version 172864 (0.00080) [2022-07-09 08:42:19,739][25689] Fps is (10 sec: 5716.3, 60 sec: 5698.7, 300 sec: 5733.5). Total num frames: 177014784. Throughput: 0: 5076.4. Samples: 177012556. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 08:42:19,740][25689] Avg episode reward: [(0, '-52.351')] [2022-07-09 08:42:21,003][26022] Updated weights on worker 0-0, policy_version 172874 (0.00086) [2022-07-09 08:42:22,770][26022] Updated weights on worker 0-0, policy_version 172884 (0.00088) [2022-07-09 08:42:24,512][26022] Updated weights on worker 0-0, policy_version 172894 (0.00086) [2022-07-09 08:42:24,805][25689] Fps is (10 sec: 5789.0, 60 sec: 5737.0, 300 sec: 5740.3). Total num frames: 177044480. Throughput: 0: 6048.2. Samples: 177047004. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 08:42:24,807][25689] Avg episode reward: [(0, '-51.917')] [2022-07-09 08:42:26,281][26022] Updated weights on worker 0-0, policy_version 172904 (0.00100) [2022-07-09 08:42:28,135][26022] Updated weights on worker 0-0, policy_version 172914 (0.00083) [2022-07-09 08:42:29,642][26022] Updated weights on worker 0-0, policy_version 172924 (0.00091) [2022-07-09 08:42:29,844][25689] Fps is (10 sec: 5978.9, 60 sec: 5752.7, 300 sec: 5739.7). Total num frames: 177075200. Throughput: 0: 6068.6. Samples: 177081982. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 08:42:29,845][25689] Avg episode reward: [(0, '-51.928')] [2022-07-09 08:42:31,631][26022] Updated weights on worker 0-0, policy_version 172934 (0.00091) [2022-07-09 08:42:33,268][26022] Updated weights on worker 0-0, policy_version 172944 (0.00090) [2022-07-09 08:42:34,921][25689] Fps is (10 sec: 5769.9, 60 sec: 5728.8, 300 sec: 5742.1). Total num frames: 177102848. Throughput: 0: 5181.3. Samples: 177099466. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 08:42:34,922][25689] Avg episode reward: [(0, '-51.548')] [2022-07-09 08:42:35,205][26022] Updated weights on worker 0-0, policy_version 172954 (0.00086) [2022-07-09 08:42:36,932][26022] Updated weights on worker 0-0, policy_version 172964 (0.00188) [2022-07-09 08:42:38,655][26022] Updated weights on worker 0-0, policy_version 172974 (0.00076) [2022-07-09 08:42:39,945][25689] Fps is (10 sec: 5677.0, 60 sec: 5746.2, 300 sec: 5742.5). Total num frames: 177132544. Throughput: 0: 6031.5. Samples: 177134428. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 08:42:39,947][25689] Avg episode reward: [(0, '-51.634')] [2022-07-09 08:42:40,466][26022] Updated weights on worker 0-0, policy_version 172984 (0.00085) [2022-07-09 08:42:40,663][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:42:40,678][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000172985_177136640.pth [2022-07-09 08:42:40,678][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000170967_175070208.pth [2022-07-09 08:42:42,347][26022] Updated weights on worker 0-0, policy_version 172994 (0.00083) [2022-07-09 08:42:44,062][26022] Updated weights on worker 0-0, policy_version 173004 (0.00087) [2022-07-09 08:42:45,024][25689] Fps is (10 sec: 5777.2, 60 sec: 5733.7, 300 sec: 5741.1). Total num frames: 177161216. Throughput: 0: 6034.1. Samples: 177169008. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 08:42:45,025][25689] Avg episode reward: [(0, '-52.423')] [2022-07-09 08:42:45,931][26022] Updated weights on worker 0-0, policy_version 173014 (0.00092) [2022-07-09 08:42:47,557][26022] Updated weights on worker 0-0, policy_version 173024 (0.00087) [2022-07-09 08:42:49,555][26022] Updated weights on worker 0-0, policy_version 173034 (0.00087) [2022-07-09 08:42:50,072][25689] Fps is (10 sec: 5764.0, 60 sec: 5749.4, 300 sec: 5740.5). Total num frames: 177190912. Throughput: 0: 5149.9. Samples: 177186152. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 08:42:50,074][25689] Avg episode reward: [(0, '-52.327')] [2022-07-09 08:42:51,067][26022] Updated weights on worker 0-0, policy_version 173044 (0.00088) [2022-07-09 08:42:53,090][26022] Updated weights on worker 0-0, policy_version 173054 (0.00094) [2022-07-09 08:42:54,598][26022] Updated weights on worker 0-0, policy_version 173064 (0.00088) [2022-07-09 08:42:55,093][25689] Fps is (10 sec: 5898.9, 60 sec: 5766.4, 300 sec: 5743.8). Total num frames: 177220608. Throughput: 0: 6021.2. Samples: 177220922. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 08:42:55,098][25689] Avg episode reward: [(0, '-51.901')] [2022-07-09 08:42:56,521][26022] Updated weights on worker 0-0, policy_version 173074 (0.00086) [2022-07-09 08:42:58,000][26022] Updated weights on worker 0-0, policy_version 173084 (0.00086) [2022-07-09 08:43:00,096][26022] Updated weights on worker 0-0, policy_version 173094 (0.00094) [2022-07-09 08:43:00,175][25689] Fps is (10 sec: 5777.3, 60 sec: 5759.4, 300 sec: 5749.9). Total num frames: 177249280. Throughput: 0: 5994.3. Samples: 177255688. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 08:43:00,175][25689] Avg episode reward: [(0, '-52.037')] [2022-07-09 08:43:01,791][26022] Updated weights on worker 0-0, policy_version 173104 (0.00086) [2022-07-09 08:43:03,963][26022] Updated weights on worker 0-0, policy_version 173114 (0.00083) [2022-07-09 08:43:05,254][25689] Fps is (10 sec: 5340.8, 60 sec: 5743.9, 300 sec: 5732.1). Total num frames: 177274880. Throughput: 0: 5027.0. Samples: 177270706. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 08:43:05,255][25689] Avg episode reward: [(0, '-51.656')] [2022-07-09 08:43:05,525][26022] Updated weights on worker 0-0, policy_version 173124 (0.00095) [2022-07-09 08:43:07,667][26022] Updated weights on worker 0-0, policy_version 173134 (0.00086) [2022-07-09 08:43:09,215][26022] Updated weights on worker 0-0, policy_version 173144 (0.00087) [2022-07-09 08:43:10,265][25689] Fps is (10 sec: 5581.4, 60 sec: 5749.2, 300 sec: 5747.3). Total num frames: 177305600. Throughput: 0: 5911.1. Samples: 177305518. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 08:43:10,266][25689] Avg episode reward: [(0, '-51.069')] [2022-07-09 08:43:11,105][26022] Updated weights on worker 0-0, policy_version 173154 (0.00090) [2022-07-09 08:43:12,589][26022] Updated weights on worker 0-0, policy_version 173164 (0.00088) [2022-07-09 08:43:14,792][26022] Updated weights on worker 0-0, policy_version 173174 (0.00103) [2022-07-09 08:43:15,317][25689] Fps is (10 sec: 5800.3, 60 sec: 5728.8, 300 sec: 5736.0). Total num frames: 177333248. Throughput: 0: 5906.2. Samples: 177340372. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 08:43:15,318][25689] Avg episode reward: [(0, '-51.490')] [2022-07-09 08:43:16,318][26022] Updated weights on worker 0-0, policy_version 173184 (0.00090) [2022-07-09 08:43:18,305][26022] Updated weights on worker 0-0, policy_version 173194 (0.00095) [2022-07-09 08:43:19,753][26022] Updated weights on worker 0-0, policy_version 173204 (0.00087) [2022-07-09 08:43:20,333][25689] Fps is (10 sec: 5696.0, 60 sec: 5745.8, 300 sec: 5743.9). Total num frames: 177362944. Throughput: 0: 5053.5. Samples: 177357556. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 08:43:20,333][25689] Avg episode reward: [(0, '-51.478')] [2022-07-09 08:43:21,832][26022] Updated weights on worker 0-0, policy_version 173214 (0.00095) [2022-07-09 08:43:23,670][26022] Updated weights on worker 0-0, policy_version 173224 (0.00097) [2022-07-09 08:43:25,407][25689] Fps is (10 sec: 5785.1, 60 sec: 5728.2, 300 sec: 5740.6). Total num frames: 177391616. Throughput: 0: 5991.5. Samples: 177391448. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 08:43:25,411][25689] Avg episode reward: [(0, '-51.501')] [2022-07-09 08:43:25,413][26022] Updated weights on worker 0-0, policy_version 173234 (0.00087) [2022-07-09 08:43:27,259][26022] Updated weights on worker 0-0, policy_version 173244 (0.00084) [2022-07-09 08:43:28,935][26022] Updated weights on worker 0-0, policy_version 173254 (0.00092) [2022-07-09 08:43:30,460][25689] Fps is (10 sec: 5561.4, 60 sec: 5676.2, 300 sec: 5734.5). Total num frames: 177419264. Throughput: 0: 5941.7. Samples: 177425506. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 08:43:30,460][25689] Avg episode reward: [(0, '-51.276')] [2022-07-09 08:43:30,778][26022] Updated weights on worker 0-0, policy_version 173264 (0.00085) [2022-07-09 08:43:32,902][26022] Updated weights on worker 0-0, policy_version 173274 (0.00086) [2022-07-09 08:43:34,287][26022] Updated weights on worker 0-0, policy_version 173284 (0.00092) [2022-07-09 08:43:35,470][25689] Fps is (10 sec: 5596.5, 60 sec: 5699.3, 300 sec: 5731.4). Total num frames: 177447936. Throughput: 0: 5076.4. Samples: 177442676. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 08:43:35,471][25689] Avg episode reward: [(0, '-51.408')] [2022-07-09 08:43:36,319][26022] Updated weights on worker 0-0, policy_version 173294 (0.00089) [2022-07-09 08:43:37,855][26022] Updated weights on worker 0-0, policy_version 173304 (0.00095) [2022-07-09 08:43:39,831][26022] Updated weights on worker 0-0, policy_version 173314 (0.00100) [2022-07-09 08:43:40,486][25689] Fps is (10 sec: 5821.6, 60 sec: 5700.1, 300 sec: 5732.9). Total num frames: 177477632. Throughput: 0: 5926.4. Samples: 177476992. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 08:43:40,486][25689] Avg episode reward: [(0, '-51.340')] [2022-07-09 08:43:41,668][26022] Updated weights on worker 0-0, policy_version 173324 (0.00073) [2022-07-09 08:43:43,427][26022] Updated weights on worker 0-0, policy_version 173334 (0.00085) [2022-07-09 08:43:45,139][26022] Updated weights on worker 0-0, policy_version 173344 (0.00086) [2022-07-09 08:43:45,620][25689] Fps is (10 sec: 5851.2, 60 sec: 5711.8, 300 sec: 5737.4). Total num frames: 177507328. Throughput: 0: 5938.3. Samples: 177511484. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 08:43:45,621][25689] Avg episode reward: [(0, '-51.336')] [2022-07-09 08:43:46,894][26022] Updated weights on worker 0-0, policy_version 173354 (0.00788) [2022-07-09 08:43:48,575][26022] Updated weights on worker 0-0, policy_version 173364 (0.00088) [2022-07-09 08:43:50,662][25689] Fps is (10 sec: 5635.3, 60 sec: 5678.6, 300 sec: 5724.0). Total num frames: 177534976. Throughput: 0: 5116.5. Samples: 177528870. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 08:43:50,662][25689] Avg episode reward: [(0, '-51.438')] [2022-07-09 08:43:50,679][26022] Updated weights on worker 0-0, policy_version 173374 (0.00101) [2022-07-09 08:43:52,241][26022] Updated weights on worker 0-0, policy_version 173384 (0.00086) [2022-07-09 08:43:54,373][26022] Updated weights on worker 0-0, policy_version 173394 (0.00085) [2022-07-09 08:43:55,674][25689] Fps is (10 sec: 5602.1, 60 sec: 5662.5, 300 sec: 5727.6). Total num frames: 177563648. Throughput: 0: 5937.2. Samples: 177562630. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 08:43:55,674][25689] Avg episode reward: [(0, '-52.526')] [2022-07-09 08:43:56,128][26022] Updated weights on worker 0-0, policy_version 173404 (0.00104) [2022-07-09 08:43:57,917][26022] Updated weights on worker 0-0, policy_version 173414 (0.00094) [2022-07-09 08:43:59,423][26022] Updated weights on worker 0-0, policy_version 173424 (0.00085) [2022-07-09 08:44:00,729][25689] Fps is (10 sec: 5695.9, 60 sec: 5665.0, 300 sec: 5735.0). Total num frames: 177592320. Throughput: 0: 5932.8. Samples: 177597092. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 08:44:00,730][25689] Avg episode reward: [(0, '-52.726')] [2022-07-09 08:44:01,311][26022] Updated weights on worker 0-0, policy_version 173434 (0.00081) [2022-07-09 08:44:03,485][26022] Updated weights on worker 0-0, policy_version 173444 (0.00089) [2022-07-09 08:44:05,390][26022] Updated weights on worker 0-0, policy_version 173454 (0.00091) [2022-07-09 08:44:05,776][25689] Fps is (10 sec: 5574.8, 60 sec: 5701.9, 300 sec: 5724.1). Total num frames: 177619968. Throughput: 0: 4999.3. Samples: 177612250. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 08:44:05,777][25689] Avg episode reward: [(0, '-52.742')] [2022-07-09 08:44:06,987][26022] Updated weights on worker 0-0, policy_version 173464 (0.00088) [2022-07-09 08:44:08,972][26022] Updated weights on worker 0-0, policy_version 173474 (0.00097) [2022-07-09 08:44:10,371][26022] Updated weights on worker 0-0, policy_version 173484 (0.00087) [2022-07-09 08:44:10,841][25689] Fps is (10 sec: 5670.8, 60 sec: 5679.9, 300 sec: 5730.0). Total num frames: 177649664. Throughput: 0: 5836.6. Samples: 177646652. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 08:44:10,842][25689] Avg episode reward: [(0, '-52.607')] [2022-07-09 08:44:12,590][26022] Updated weights on worker 0-0, policy_version 173494 (0.00093) [2022-07-09 08:44:14,132][26022] Updated weights on worker 0-0, policy_version 173504 (0.00079) [2022-07-09 08:44:15,887][25689] Fps is (10 sec: 5570.6, 60 sec: 5663.6, 300 sec: 5722.6). Total num frames: 177676288. Throughput: 0: 5860.0. Samples: 177681078. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 08:44:15,888][25689] Avg episode reward: [(0, '-52.687')] [2022-07-09 08:44:16,024][26022] Updated weights on worker 0-0, policy_version 173514 (0.00083) [2022-07-09 08:44:17,822][26022] Updated weights on worker 0-0, policy_version 173524 (0.00087) [2022-07-09 08:44:19,570][26022] Updated weights on worker 0-0, policy_version 173534 (0.00090) [2022-07-09 08:44:20,900][25689] Fps is (10 sec: 5599.1, 60 sec: 5663.8, 300 sec: 5723.5). Total num frames: 177705984. Throughput: 0: 5867.7. Samples: 177715448. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 08:44:20,901][25689] Avg episode reward: [(0, '-52.433')] [2022-07-09 08:44:21,412][26022] Updated weights on worker 0-0, policy_version 173544 (0.00089) [2022-07-09 08:44:23,066][26022] Updated weights on worker 0-0, policy_version 173554 (0.00084) [2022-07-09 08:44:25,021][26022] Updated weights on worker 0-0, policy_version 173564 (0.00088) [2022-07-09 08:44:25,976][25689] Fps is (10 sec: 5988.2, 60 sec: 5697.4, 300 sec: 5725.6). Total num frames: 177736704. Throughput: 0: 5957.1. Samples: 177732580. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 08:44:25,976][25689] Avg episode reward: [(0, '-53.035')] [2022-07-09 08:44:26,782][26022] Updated weights on worker 0-0, policy_version 173574 (0.00088) [2022-07-09 08:44:28,399][26022] Updated weights on worker 0-0, policy_version 173584 (0.00115) [2022-07-09 08:44:30,315][26022] Updated weights on worker 0-0, policy_version 173594 (0.00086) [2022-07-09 08:44:31,048][25689] Fps is (10 sec: 5752.0, 60 sec: 5695.7, 300 sec: 5720.8). Total num frames: 177764352. Throughput: 0: 5956.6. Samples: 177767012. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 08:44:31,048][25689] Avg episode reward: [(0, '-52.855')] [2022-07-09 08:44:32,103][26022] Updated weights on worker 0-0, policy_version 173604 (0.00081) [2022-07-09 08:44:33,848][26022] Updated weights on worker 0-0, policy_version 173614 (0.00103) [2022-07-09 08:44:35,758][26022] Updated weights on worker 0-0, policy_version 173624 (0.00087) [2022-07-09 08:44:36,068][25689] Fps is (10 sec: 5479.1, 60 sec: 5677.8, 300 sec: 5717.1). Total num frames: 177792000. Throughput: 0: 5952.4. Samples: 177801206. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 08:44:36,068][25689] Avg episode reward: [(0, '-52.760')] [2022-07-09 08:44:37,431][26022] Updated weights on worker 0-0, policy_version 173634 (0.00742) [2022-07-09 08:44:39,388][26022] Updated weights on worker 0-0, policy_version 173644 (0.00084) [2022-07-09 08:44:40,699][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:44:40,714][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000173653_177820672.pth [2022-07-09 08:44:40,714][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000171637_175756288.pth [2022-07-09 08:44:40,857][26022] Updated weights on worker 0-0, policy_version 173654 (0.00084) [2022-07-09 08:44:41,101][25689] Fps is (10 sec: 5703.7, 60 sec: 5676.2, 300 sec: 5718.1). Total num frames: 177821696. Throughput: 0: 5100.6. Samples: 177818486. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 08:44:41,102][25689] Avg episode reward: [(0, '-52.934')] [2022-07-09 08:44:42,793][26022] Updated weights on worker 0-0, policy_version 173664 (0.00090) [2022-07-09 08:44:44,605][26022] Updated weights on worker 0-0, policy_version 173674 (0.00084) [2022-07-09 08:44:46,216][25689] Fps is (10 sec: 5852.3, 60 sec: 5678.0, 300 sec: 5719.5). Total num frames: 177851392. Throughput: 0: 5964.5. Samples: 177853306. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 08:44:46,217][25689] Avg episode reward: [(0, '-53.305')] [2022-07-09 08:44:46,347][26022] Updated weights on worker 0-0, policy_version 173684 (0.00088) [2022-07-09 08:44:48,367][26022] Updated weights on worker 0-0, policy_version 173694 (0.00084) [2022-07-09 08:44:49,751][26022] Updated weights on worker 0-0, policy_version 173704 (0.00101) [2022-07-09 08:44:51,277][25689] Fps is (10 sec: 5735.8, 60 sec: 5693.0, 300 sec: 5718.3). Total num frames: 177880064. Throughput: 0: 5988.8. Samples: 177888164. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 08:44:51,278][25689] Avg episode reward: [(0, '-52.571')] [2022-07-09 08:44:51,841][26022] Updated weights on worker 0-0, policy_version 173714 (0.00088) [2022-07-09 08:44:53,517][26022] Updated weights on worker 0-0, policy_version 173724 (0.00087) [2022-07-09 08:44:55,234][26022] Updated weights on worker 0-0, policy_version 173734 (0.00094) [2022-07-09 08:44:56,297][25689] Fps is (10 sec: 5790.1, 60 sec: 5709.2, 300 sec: 5718.1). Total num frames: 177909760. Throughput: 0: 5140.8. Samples: 177905200. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 08:44:56,298][25689] Avg episode reward: [(0, '-52.230')] [2022-07-09 08:44:57,287][26022] Updated weights on worker 0-0, policy_version 173744 (0.00095) [2022-07-09 08:44:58,695][26022] Updated weights on worker 0-0, policy_version 173754 (0.00087) [2022-07-09 08:45:00,736][26022] Updated weights on worker 0-0, policy_version 173764 (0.00083) [2022-07-09 08:45:01,332][25689] Fps is (10 sec: 5805.1, 60 sec: 5711.2, 300 sec: 5722.2). Total num frames: 177938432. Throughput: 0: 5996.4. Samples: 177939796. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 08:45:01,333][25689] Avg episode reward: [(0, '-51.941')] [2022-07-09 08:45:02,529][26022] Updated weights on worker 0-0, policy_version 173774 (0.00101) [2022-07-09 08:45:04,543][26022] Updated weights on worker 0-0, policy_version 173784 (0.00090) [2022-07-09 08:45:06,293][26022] Updated weights on worker 0-0, policy_version 173794 (0.00085) [2022-07-09 08:45:06,396][25689] Fps is (10 sec: 5576.8, 60 sec: 5709.6, 300 sec: 5719.1). Total num frames: 177966080. Throughput: 0: 5890.3. Samples: 177972170. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 08:45:06,397][25689] Avg episode reward: [(0, '-52.554')] [2022-07-09 08:45:08,175][26022] Updated weights on worker 0-0, policy_version 173804 (0.00096) [2022-07-09 08:45:09,960][26022] Updated weights on worker 0-0, policy_version 173814 (0.00086) [2022-07-09 08:45:11,467][25689] Fps is (10 sec: 5355.2, 60 sec: 5658.4, 300 sec: 5707.5). Total num frames: 177992704. Throughput: 0: 5017.9. Samples: 177989470. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 08:45:11,467][25689] Avg episode reward: [(0, '-51.652')] [2022-07-09 08:45:11,720][26022] Updated weights on worker 0-0, policy_version 173824 (0.00078) [2022-07-09 08:45:13,462][26022] Updated weights on worker 0-0, policy_version 173834 (0.00093) [2022-07-09 08:45:15,146][26022] Updated weights on worker 0-0, policy_version 173844 (0.00053) [2022-07-09 08:45:16,503][25689] Fps is (10 sec: 5572.7, 60 sec: 5709.9, 300 sec: 5714.1). Total num frames: 178022400. Throughput: 0: 5874.9. Samples: 178023906. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 08:45:16,503][25689] Avg episode reward: [(0, '-51.441')] [2022-07-09 08:45:17,075][26022] Updated weights on worker 0-0, policy_version 173854 (0.00088) [2022-07-09 08:45:18,842][26022] Updated weights on worker 0-0, policy_version 173864 (0.00089) [2022-07-09 08:45:20,593][26022] Updated weights on worker 0-0, policy_version 173874 (0.00092) [2022-07-09 08:45:21,546][25689] Fps is (10 sec: 5994.2, 60 sec: 5724.0, 300 sec: 5714.4). Total num frames: 178053120. Throughput: 0: 5867.9. Samples: 178058408. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 08:45:21,546][25689] Avg episode reward: [(0, '-52.215')] [2022-07-09 08:45:22,492][26022] Updated weights on worker 0-0, policy_version 173884 (0.00060) [2022-07-09 08:45:24,231][26022] Updated weights on worker 0-0, policy_version 173894 (0.00114) [2022-07-09 08:45:26,027][26022] Updated weights on worker 0-0, policy_version 173904 (0.00094) [2022-07-09 08:45:26,580][25689] Fps is (10 sec: 5792.2, 60 sec: 5677.3, 300 sec: 5710.6). Total num frames: 178080768. Throughput: 0: 5117.1. Samples: 178075452. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-09 08:45:26,580][25689] Avg episode reward: [(0, '-52.039')] [2022-07-09 08:45:27,739][26022] Updated weights on worker 0-0, policy_version 173914 (0.00084) [2022-07-09 08:45:29,568][26022] Updated weights on worker 0-0, policy_version 173924 (0.00086) [2022-07-09 08:45:31,520][26022] Updated weights on worker 0-0, policy_version 173934 (0.00086) [2022-07-09 08:45:31,616][25689] Fps is (10 sec: 5490.9, 60 sec: 5680.6, 300 sec: 5706.6). Total num frames: 178108416. Throughput: 0: 5965.6. Samples: 178109674. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:45:31,617][25689] Avg episode reward: [(0, '-51.319')] [2022-07-09 08:45:33,195][26022] Updated weights on worker 0-0, policy_version 173944 (0.00086) [2022-07-09 08:45:34,961][26022] Updated weights on worker 0-0, policy_version 173954 (0.00087) [2022-07-09 08:45:36,652][25689] Fps is (10 sec: 5693.3, 60 sec: 5712.9, 300 sec: 5706.4). Total num frames: 178138112. Throughput: 0: 5980.2. Samples: 178144402. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:45:36,652][25689] Avg episode reward: [(0, '-51.578')] [2022-07-09 08:45:36,672][26022] Updated weights on worker 0-0, policy_version 173964 (0.00091) [2022-07-09 08:45:38,557][26022] Updated weights on worker 0-0, policy_version 173974 (0.00088) [2022-07-09 08:45:40,270][26022] Updated weights on worker 0-0, policy_version 173984 (0.00089) [2022-07-09 08:45:41,666][25689] Fps is (10 sec: 5706.2, 60 sec: 5681.0, 300 sec: 5705.1). Total num frames: 178165760. Throughput: 0: 5113.1. Samples: 178161284. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:45:41,666][25689] Avg episode reward: [(0, '-51.702')] [2022-07-09 08:45:42,044][26022] Updated weights on worker 0-0, policy_version 173994 (0.00088) [2022-07-09 08:45:44,209][26022] Updated weights on worker 0-0, policy_version 174004 (0.00093) [2022-07-09 08:45:45,678][26022] Updated weights on worker 0-0, policy_version 174014 (0.00096) [2022-07-09 08:45:46,810][25689] Fps is (10 sec: 5645.0, 60 sec: 5678.2, 300 sec: 5703.3). Total num frames: 178195456. Throughput: 0: 5934.6. Samples: 178195512. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:45:46,811][25689] Avg episode reward: [(0, '-51.553')] [2022-07-09 08:45:47,566][26022] Updated weights on worker 0-0, policy_version 174024 (0.00090) [2022-07-09 08:45:49,356][26022] Updated weights on worker 0-0, policy_version 174034 (0.00081) [2022-07-09 08:45:51,196][26022] Updated weights on worker 0-0, policy_version 174044 (0.00096) [2022-07-09 08:45:51,847][25689] Fps is (10 sec: 5732.8, 60 sec: 5680.5, 300 sec: 5699.7). Total num frames: 178224128. Throughput: 0: 5925.4. Samples: 178229550. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:45:51,848][25689] Avg episode reward: [(0, '-51.862')] [2022-07-09 08:45:52,851][26022] Updated weights on worker 0-0, policy_version 174054 (0.00095) [2022-07-09 08:45:54,716][26022] Updated weights on worker 0-0, policy_version 174064 (0.00081) [2022-07-09 08:45:56,625][26022] Updated weights on worker 0-0, policy_version 174074 (0.00089) [2022-07-09 08:45:56,850][25689] Fps is (10 sec: 5711.8, 60 sec: 5665.2, 300 sec: 5703.3). Total num frames: 178252800. Throughput: 0: 5052.8. Samples: 178246458. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:45:56,850][25689] Avg episode reward: [(0, '-52.541')] [2022-07-09 08:45:58,325][26022] Updated weights on worker 0-0, policy_version 174084 (0.00090) [2022-07-09 08:46:00,241][26022] Updated weights on worker 0-0, policy_version 174094 (0.00087) [2022-07-09 08:46:01,871][25689] Fps is (10 sec: 5720.4, 60 sec: 5666.4, 300 sec: 5707.8). Total num frames: 178281472. Throughput: 0: 5909.8. Samples: 178280696. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:46:01,872][25689] Avg episode reward: [(0, '-52.836')] [2022-07-09 08:46:02,144][26022] Updated weights on worker 0-0, policy_version 174104 (0.00086) [2022-07-09 08:46:04,140][26022] Updated weights on worker 0-0, policy_version 174114 (0.00091) [2022-07-09 08:46:05,873][26022] Updated weights on worker 0-0, policy_version 174124 (0.00090) [2022-07-09 08:46:07,000][25689] Fps is (10 sec: 5447.6, 60 sec: 5643.4, 300 sec: 5699.3). Total num frames: 178308096. Throughput: 0: 5819.6. Samples: 178313012. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:46:07,001][25689] Avg episode reward: [(0, '-52.164')] [2022-07-09 08:46:07,735][26022] Updated weights on worker 0-0, policy_version 174134 (0.00089) [2022-07-09 08:46:09,455][26022] Updated weights on worker 0-0, policy_version 174144 (0.00084) [2022-07-09 08:46:11,419][26022] Updated weights on worker 0-0, policy_version 174154 (0.00088) [2022-07-09 08:46:12,031][25689] Fps is (10 sec: 5543.7, 60 sec: 5697.9, 300 sec: 5706.7). Total num frames: 178337792. Throughput: 0: 4962.3. Samples: 178329708. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:46:12,032][25689] Avg episode reward: [(0, '-51.866')] [2022-07-09 08:46:13,072][26022] Updated weights on worker 0-0, policy_version 174164 (0.00087) [2022-07-09 08:46:15,007][26022] Updated weights on worker 0-0, policy_version 174174 (0.00084) [2022-07-09 08:46:16,614][26022] Updated weights on worker 0-0, policy_version 174184 (0.00092) [2022-07-09 08:46:17,055][25689] Fps is (10 sec: 5804.9, 60 sec: 5682.1, 300 sec: 5696.3). Total num frames: 178366464. Throughput: 0: 5829.1. Samples: 178364238. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:46:17,056][25689] Avg episode reward: [(0, '-52.704')] [2022-07-09 08:46:18,634][26022] Updated weights on worker 0-0, policy_version 174194 (0.00085) [2022-07-09 08:46:20,362][26022] Updated weights on worker 0-0, policy_version 174204 (0.00051) [2022-07-09 08:46:22,070][25689] Fps is (10 sec: 5609.9, 60 sec: 5634.0, 300 sec: 5698.2). Total num frames: 178394112. Throughput: 0: 5843.8. Samples: 178398732. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:46:22,071][25689] Avg episode reward: [(0, '-52.740')] [2022-07-09 08:46:22,174][26022] Updated weights on worker 0-0, policy_version 174214 (0.00090) [2022-07-09 08:46:23,802][26022] Updated weights on worker 0-0, policy_version 174224 (0.00090) [2022-07-09 08:46:25,687][26022] Updated weights on worker 0-0, policy_version 174234 (0.00089) [2022-07-09 08:46:27,149][25689] Fps is (10 sec: 5782.8, 60 sec: 5680.5, 300 sec: 5700.6). Total num frames: 178424832. Throughput: 0: 5945.0. Samples: 178432794. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:46:27,149][25689] Avg episode reward: [(0, '-52.922')] [2022-07-09 08:46:27,325][26022] Updated weights on worker 0-0, policy_version 174244 (0.00079) [2022-07-09 08:46:29,331][26022] Updated weights on worker 0-0, policy_version 174254 (0.00090) [2022-07-09 08:46:31,125][26022] Updated weights on worker 0-0, policy_version 174264 (0.00077) [2022-07-09 08:46:32,182][25689] Fps is (10 sec: 5670.8, 60 sec: 5663.9, 300 sec: 5693.2). Total num frames: 178451456. Throughput: 0: 5964.5. Samples: 178449902. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:46:32,184][25689] Avg episode reward: [(0, '-53.166')] [2022-07-09 08:46:32,794][26022] Updated weights on worker 0-0, policy_version 174274 (0.00086) [2022-07-09 08:46:34,690][26022] Updated weights on worker 0-0, policy_version 174284 (0.00086) [2022-07-09 08:46:36,483][26022] Updated weights on worker 0-0, policy_version 174294 (0.00093) [2022-07-09 08:46:37,196][25689] Fps is (10 sec: 5503.3, 60 sec: 5649.0, 300 sec: 5693.5). Total num frames: 178480128. Throughput: 0: 5950.2. Samples: 178484082. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:46:37,197][25689] Avg episode reward: [(0, '-52.867')] [2022-07-09 08:46:38,318][26022] Updated weights on worker 0-0, policy_version 174304 (0.00089) [2022-07-09 08:46:40,215][26022] Updated weights on worker 0-0, policy_version 174314 (0.00085) [2022-07-09 08:46:40,750][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:46:40,760][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000174317_178500608.pth [2022-07-09 08:46:40,761][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000172312_176447488.pth [2022-07-09 08:46:41,573][26022] Updated weights on worker 0-0, policy_version 174324 (0.00090) [2022-07-09 08:46:42,248][25689] Fps is (10 sec: 5798.5, 60 sec: 5679.3, 300 sec: 5694.9). Total num frames: 178509824. Throughput: 0: 5925.0. Samples: 178518288. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:46:42,249][25689] Avg episode reward: [(0, '-52.776')] [2022-07-09 08:46:43,758][26022] Updated weights on worker 0-0, policy_version 174334 (0.00091) [2022-07-09 08:46:45,370][26022] Updated weights on worker 0-0, policy_version 174344 (0.00066) [2022-07-09 08:46:47,114][26022] Updated weights on worker 0-0, policy_version 174354 (0.00089) [2022-07-09 08:46:47,289][25689] Fps is (10 sec: 5783.1, 60 sec: 5672.0, 300 sec: 5694.8). Total num frames: 178538496. Throughput: 0: 5099.7. Samples: 178535506. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:46:47,290][25689] Avg episode reward: [(0, '-52.258')] [2022-07-09 08:46:48,868][26022] Updated weights on worker 0-0, policy_version 174364 (0.00094) [2022-07-09 08:46:50,551][26022] Updated weights on worker 0-0, policy_version 174374 (0.00088) [2022-07-09 08:46:52,291][25689] Fps is (10 sec: 5812.2, 60 sec: 5692.3, 300 sec: 5698.6). Total num frames: 178568192. Throughput: 0: 6000.4. Samples: 178570560. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:46:52,291][25689] Avg episode reward: [(0, '-51.758')] [2022-07-09 08:46:52,363][26022] Updated weights on worker 0-0, policy_version 174384 (0.00090) [2022-07-09 08:46:54,257][26022] Updated weights on worker 0-0, policy_version 174394 (0.00611) [2022-07-09 08:46:56,023][26022] Updated weights on worker 0-0, policy_version 174404 (0.00095) [2022-07-09 08:46:57,307][25689] Fps is (10 sec: 5826.5, 60 sec: 5691.0, 300 sec: 5698.4). Total num frames: 178596864. Throughput: 0: 6040.2. Samples: 178605554. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:46:57,307][25689] Avg episode reward: [(0, '-51.115')] [2022-07-09 08:46:57,559][26022] Updated weights on worker 0-0, policy_version 174414 (0.00087) [2022-07-09 08:46:59,503][26022] Updated weights on worker 0-0, policy_version 174424 (0.00084) [2022-07-09 08:47:01,235][26022] Updated weights on worker 0-0, policy_version 174434 (0.00087) [2022-07-09 08:47:02,312][25689] Fps is (10 sec: 5517.7, 60 sec: 5658.7, 300 sec: 5700.1). Total num frames: 178623488. Throughput: 0: 5211.4. Samples: 178622848. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:47:02,313][25689] Avg episode reward: [(0, '-51.754')] [2022-07-09 08:47:03,574][26022] Updated weights on worker 0-0, policy_version 174444 (0.00081) [2022-07-09 08:47:05,046][26022] Updated weights on worker 0-0, policy_version 174454 (0.00083) [2022-07-09 08:47:06,962][26022] Updated weights on worker 0-0, policy_version 174464 (0.00092) [2022-07-09 08:47:07,429][25689] Fps is (10 sec: 5564.4, 60 sec: 5710.7, 300 sec: 5695.8). Total num frames: 178653184. Throughput: 0: 5963.7. Samples: 178655610. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:47:07,431][25689] Avg episode reward: [(0, '-52.112')] [2022-07-09 08:47:08,810][26022] Updated weights on worker 0-0, policy_version 174474 (0.00089) [2022-07-09 08:47:10,498][26022] Updated weights on worker 0-0, policy_version 174484 (0.00102) [2022-07-09 08:47:12,249][26022] Updated weights on worker 0-0, policy_version 174494 (0.00084) [2022-07-09 08:47:12,455][25689] Fps is (10 sec: 5855.4, 60 sec: 5711.0, 300 sec: 5699.0). Total num frames: 178682880. Throughput: 0: 5940.4. Samples: 178690346. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:47:12,456][25689] Avg episode reward: [(0, '-51.447')] [2022-07-09 08:47:14,207][26022] Updated weights on worker 0-0, policy_version 174504 (0.00082) [2022-07-09 08:47:15,679][26022] Updated weights on worker 0-0, policy_version 174514 (0.00096) [2022-07-09 08:47:17,485][25689] Fps is (10 sec: 5702.1, 60 sec: 5693.6, 300 sec: 5695.3). Total num frames: 178710528. Throughput: 0: 5055.7. Samples: 178707570. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:47:17,486][25689] Avg episode reward: [(0, '-51.339')] [2022-07-09 08:47:17,860][26022] Updated weights on worker 0-0, policy_version 174524 (0.00082) [2022-07-09 08:47:19,284][26022] Updated weights on worker 0-0, policy_version 174534 (0.00083) [2022-07-09 08:47:21,219][26022] Updated weights on worker 0-0, policy_version 174544 (0.00091) [2022-07-09 08:47:22,580][25689] Fps is (10 sec: 5663.7, 60 sec: 5719.9, 300 sec: 5694.7). Total num frames: 178740224. Throughput: 0: 5886.9. Samples: 178742164. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:47:22,581][25689] Avg episode reward: [(0, '-51.621')] [2022-07-09 08:47:23,212][26022] Updated weights on worker 0-0, policy_version 174554 (0.00089) [2022-07-09 08:47:24,707][26022] Updated weights on worker 0-0, policy_version 174564 (0.00091) [2022-07-09 08:47:26,624][26022] Updated weights on worker 0-0, policy_version 174574 (0.00086) [2022-07-09 08:47:27,673][25689] Fps is (10 sec: 5930.0, 60 sec: 5718.5, 300 sec: 5693.7). Total num frames: 178770944. Throughput: 0: 5962.8. Samples: 178776326. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:47:27,674][25689] Avg episode reward: [(0, '-52.185')] [2022-07-09 08:47:28,325][26022] Updated weights on worker 0-0, policy_version 174584 (0.00079) [2022-07-09 08:47:30,223][26022] Updated weights on worker 0-0, policy_version 174594 (0.00086) [2022-07-09 08:47:32,077][26022] Updated weights on worker 0-0, policy_version 174604 (0.00068) [2022-07-09 08:47:32,763][25689] Fps is (10 sec: 5631.6, 60 sec: 5713.2, 300 sec: 5690.0). Total num frames: 178797568. Throughput: 0: 5081.7. Samples: 178793542. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 08:47:32,763][25689] Avg episode reward: [(0, '-53.317')] [2022-07-09 08:47:33,652][26022] Updated weights on worker 0-0, policy_version 174614 (0.00085) [2022-07-09 08:47:35,552][26022] Updated weights on worker 0-0, policy_version 174624 (0.00087) [2022-07-09 08:47:37,231][26022] Updated weights on worker 0-0, policy_version 174634 (0.00090) [2022-07-09 08:47:37,783][25689] Fps is (10 sec: 5672.4, 60 sec: 5746.5, 300 sec: 5693.5). Total num frames: 178828288. Throughput: 0: 5948.4. Samples: 178828308. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 08:47:37,783][25689] Avg episode reward: [(0, '-52.804')] [2022-07-09 08:47:39,120][26022] Updated weights on worker 0-0, policy_version 174644 (0.00088) [2022-07-09 08:47:40,810][26022] Updated weights on worker 0-0, policy_version 174654 (0.00090) [2022-07-09 08:47:42,832][26022] Updated weights on worker 0-0, policy_version 174664 (0.00080) [2022-07-09 08:47:42,852][25689] Fps is (10 sec: 5683.7, 60 sec: 5694.2, 300 sec: 5686.8). Total num frames: 178854912. Throughput: 0: 5936.7. Samples: 178862512. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 08:47:42,853][25689] Avg episode reward: [(0, '-53.135')] [2022-07-09 08:47:44,323][26022] Updated weights on worker 0-0, policy_version 174674 (0.00089) [2022-07-09 08:47:46,344][26022] Updated weights on worker 0-0, policy_version 174684 (0.00083) [2022-07-09 08:47:47,733][26022] Updated weights on worker 0-0, policy_version 174694 (0.00084) [2022-07-09 08:47:47,977][25689] Fps is (10 sec: 5725.6, 60 sec: 5736.9, 300 sec: 5692.2). Total num frames: 178886656. Throughput: 0: 5110.1. Samples: 178880080. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 08:47:47,977][25689] Avg episode reward: [(0, '-53.288')] [2022-07-09 08:47:49,739][26022] Updated weights on worker 0-0, policy_version 174704 (0.00088) [2022-07-09 08:47:51,622][26022] Updated weights on worker 0-0, policy_version 174714 (0.00087) [2022-07-09 08:47:52,994][25689] Fps is (10 sec: 6058.1, 60 sec: 5735.4, 300 sec: 5692.3). Total num frames: 178916352. Throughput: 0: 5995.4. Samples: 178914836. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 08:47:52,994][25689] Avg episode reward: [(0, '-53.446')] [2022-07-09 08:47:53,148][26022] Updated weights on worker 0-0, policy_version 174724 (0.00084) [2022-07-09 08:47:55,210][26022] Updated weights on worker 0-0, policy_version 174734 (0.00090) [2022-07-09 08:47:56,731][26022] Updated weights on worker 0-0, policy_version 174744 (0.00085) [2022-07-09 08:47:58,002][25689] Fps is (10 sec: 5720.1, 60 sec: 5719.3, 300 sec: 5690.3). Total num frames: 178944000. Throughput: 0: 5999.1. Samples: 178949606. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 08:47:58,007][25689] Avg episode reward: [(0, '-52.891')] [2022-07-09 08:47:58,526][26022] Updated weights on worker 0-0, policy_version 174754 (0.00081) [2022-07-09 08:48:00,585][26022] Updated weights on worker 0-0, policy_version 174764 (0.00084) [2022-07-09 08:48:02,324][26022] Updated weights on worker 0-0, policy_version 174774 (0.00084) [2022-07-09 08:48:03,048][25689] Fps is (10 sec: 5499.9, 60 sec: 5732.4, 300 sec: 5697.8). Total num frames: 178971648. Throughput: 0: 5181.4. Samples: 178967154. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 08:48:03,048][25689] Avg episode reward: [(0, '-52.971')] [2022-07-09 08:48:04,318][26022] Updated weights on worker 0-0, policy_version 174784 (0.00087) [2022-07-09 08:48:05,771][26022] Updated weights on worker 0-0, policy_version 174794 (0.00087) [2022-07-09 08:48:08,051][26022] Updated weights on worker 0-0, policy_version 174804 (0.00090) [2022-07-09 08:48:08,151][25689] Fps is (10 sec: 5448.5, 60 sec: 5699.9, 300 sec: 5685.7). Total num frames: 178999296. Throughput: 0: 5930.9. Samples: 178999730. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 08:48:08,151][25689] Avg episode reward: [(0, '-52.571')] [2022-07-09 08:48:09,709][26022] Updated weights on worker 0-0, policy_version 174814 (0.00091) [2022-07-09 08:48:11,540][26022] Updated weights on worker 0-0, policy_version 174824 (0.00081) [2022-07-09 08:48:13,134][26022] Updated weights on worker 0-0, policy_version 174834 (0.00082) [2022-07-09 08:48:13,161][25689] Fps is (10 sec: 5771.3, 60 sec: 5718.3, 300 sec: 5696.8). Total num frames: 179030016. Throughput: 0: 5914.2. Samples: 179034110. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 08:48:13,162][25689] Avg episode reward: [(0, '-51.895')] [2022-07-09 08:48:15,127][26022] Updated weights on worker 0-0, policy_version 174844 (0.00086) [2022-07-09 08:48:16,560][26022] Updated weights on worker 0-0, policy_version 174854 (0.00084) [2022-07-09 08:48:18,210][25689] Fps is (10 sec: 5904.2, 60 sec: 5733.4, 300 sec: 5692.8). Total num frames: 179058688. Throughput: 0: 5036.1. Samples: 179051376. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 08:48:18,211][25689] Avg episode reward: [(0, '-51.464')] [2022-07-09 08:48:18,708][26022] Updated weights on worker 0-0, policy_version 174864 (0.00084) [2022-07-09 08:48:20,289][26022] Updated weights on worker 0-0, policy_version 174874 (0.00082) [2022-07-09 08:48:22,171][26022] Updated weights on worker 0-0, policy_version 174884 (0.00082) [2022-07-09 08:48:23,227][25689] Fps is (10 sec: 5697.2, 60 sec: 5723.9, 300 sec: 5693.9). Total num frames: 179087360. Throughput: 0: 5889.7. Samples: 179086002. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 08:48:23,227][25689] Avg episode reward: [(0, '-50.958')] [2022-07-09 08:48:23,713][26022] Updated weights on worker 0-0, policy_version 174894 (0.00087) [2022-07-09 08:48:25,619][26022] Updated weights on worker 0-0, policy_version 174904 (0.00086) [2022-07-09 08:48:27,416][26022] Updated weights on worker 0-0, policy_version 174914 (0.00054) [2022-07-09 08:48:28,273][25689] Fps is (10 sec: 5698.6, 60 sec: 5694.5, 300 sec: 5697.4). Total num frames: 179116032. Throughput: 0: 5988.9. Samples: 179120240. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 08:48:28,274][25689] Avg episode reward: [(0, '-51.197')] [2022-07-09 08:48:29,292][26022] Updated weights on worker 0-0, policy_version 174924 (0.00083) [2022-07-09 08:48:31,143][26022] Updated weights on worker 0-0, policy_version 174934 (0.00105) [2022-07-09 08:48:32,863][26022] Updated weights on worker 0-0, policy_version 174944 (0.00090) [2022-07-09 08:48:33,279][25689] Fps is (10 sec: 5704.7, 60 sec: 5736.3, 300 sec: 5697.5). Total num frames: 179144704. Throughput: 0: 5126.5. Samples: 179137240. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 08:48:33,279][25689] Avg episode reward: [(0, '-51.830')] [2022-07-09 08:48:34,666][26022] Updated weights on worker 0-0, policy_version 174954 (0.00089) [2022-07-09 08:48:36,359][26022] Updated weights on worker 0-0, policy_version 174964 (0.00086) [2022-07-09 08:48:38,290][25689] Fps is (10 sec: 5622.3, 60 sec: 5686.3, 300 sec: 5690.7). Total num frames: 179172352. Throughput: 0: 5966.6. Samples: 179171186. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 08:48:38,291][25689] Avg episode reward: [(0, '-51.989')] [2022-07-09 08:48:38,348][26022] Updated weights on worker 0-0, policy_version 174974 (0.00094) [2022-07-09 08:48:40,095][26022] Updated weights on worker 0-0, policy_version 174984 (0.00090) [2022-07-09 08:48:40,764][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:48:40,773][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000174987_179186688.pth [2022-07-09 08:48:40,774][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000172985_177136640.pth [2022-07-09 08:48:42,001][26022] Updated weights on worker 0-0, policy_version 174994 (0.00090) [2022-07-09 08:48:43,293][25689] Fps is (10 sec: 5623.9, 60 sec: 5726.5, 300 sec: 5689.8). Total num frames: 179201024. Throughput: 0: 5945.2. Samples: 179205300. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 08:48:43,295][25689] Avg episode reward: [(0, '-51.697')] [2022-07-09 08:48:43,611][26022] Updated weights on worker 0-0, policy_version 175004 (0.00692) [2022-07-09 08:48:45,450][26022] Updated weights on worker 0-0, policy_version 175014 (0.00087) [2022-07-09 08:48:47,111][26022] Updated weights on worker 0-0, policy_version 175024 (0.01281) [2022-07-09 08:48:48,424][25689] Fps is (10 sec: 5759.8, 60 sec: 5692.0, 300 sec: 5695.0). Total num frames: 179230720. Throughput: 0: 5949.5. Samples: 179240128. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 08:48:48,424][25689] Avg episode reward: [(0, '-51.838')] [2022-07-09 08:48:49,187][26022] Updated weights on worker 0-0, policy_version 175034 (0.00439) [2022-07-09 08:48:50,583][26022] Updated weights on worker 0-0, policy_version 175044 (0.00086) [2022-07-09 08:48:52,676][26022] Updated weights on worker 0-0, policy_version 175054 (0.00096) [2022-07-09 08:48:53,476][25689] Fps is (10 sec: 5732.0, 60 sec: 5671.8, 300 sec: 5694.2). Total num frames: 179259392. Throughput: 0: 5934.5. Samples: 179257100. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 08:48:53,477][25689] Avg episode reward: [(0, '-52.544')] [2022-07-09 08:48:54,307][26022] Updated weights on worker 0-0, policy_version 175064 (0.00083) [2022-07-09 08:48:56,259][26022] Updated weights on worker 0-0, policy_version 175074 (0.00088) [2022-07-09 08:48:57,971][26022] Updated weights on worker 0-0, policy_version 175084 (0.00084) [2022-07-09 08:48:58,489][25689] Fps is (10 sec: 5697.2, 60 sec: 5688.2, 300 sec: 5695.0). Total num frames: 179288064. Throughput: 0: 5944.8. Samples: 179291264. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 08:48:58,491][25689] Avg episode reward: [(0, '-53.264')] [2022-07-09 08:49:00,047][26022] Updated weights on worker 0-0, policy_version 175094 (0.00081) [2022-07-09 08:49:01,690][26022] Updated weights on worker 0-0, policy_version 175104 (0.00092) [2022-07-09 08:49:03,556][25689] Fps is (10 sec: 5485.6, 60 sec: 5669.3, 300 sec: 5691.2). Total num frames: 179314688. Throughput: 0: 5853.6. Samples: 179323912. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 08:49:03,557][25689] Avg episode reward: [(0, '-53.041')] [2022-07-09 08:49:03,902][26022] Updated weights on worker 0-0, policy_version 175114 (0.00083) [2022-07-09 08:49:05,366][26022] Updated weights on worker 0-0, policy_version 175124 (0.00086) [2022-07-09 08:49:07,372][26022] Updated weights on worker 0-0, policy_version 175134 (0.00090) [2022-07-09 08:49:08,629][25689] Fps is (10 sec: 5554.1, 60 sec: 5706.0, 300 sec: 5691.0). Total num frames: 179344384. Throughput: 0: 4992.1. Samples: 179340996. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 08:49:08,630][25689] Avg episode reward: [(0, '-53.233')] [2022-07-09 08:49:09,031][26022] Updated weights on worker 0-0, policy_version 175144 (0.00086) [2022-07-09 08:49:10,799][26022] Updated weights on worker 0-0, policy_version 175154 (0.00085) [2022-07-09 08:49:12,697][26022] Updated weights on worker 0-0, policy_version 175164 (0.00084) [2022-07-09 08:49:13,672][25689] Fps is (10 sec: 5871.2, 60 sec: 5686.0, 300 sec: 5701.4). Total num frames: 179374080. Throughput: 0: 5861.2. Samples: 179375474. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 08:49:13,673][25689] Avg episode reward: [(0, '-52.483')] [2022-07-09 08:49:14,376][26022] Updated weights on worker 0-0, policy_version 175174 (0.00091) [2022-07-09 08:49:16,239][26022] Updated weights on worker 0-0, policy_version 175184 (0.00085) [2022-07-09 08:49:18,128][26022] Updated weights on worker 0-0, policy_version 175194 (0.00077) [2022-07-09 08:49:18,699][25689] Fps is (10 sec: 5592.8, 60 sec: 5654.2, 300 sec: 5690.8). Total num frames: 179400704. Throughput: 0: 5890.6. Samples: 179410314. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 08:49:18,700][25689] Avg episode reward: [(0, '-52.477')] [2022-07-09 08:49:19,548][26022] Updated weights on worker 0-0, policy_version 175204 (0.00090) [2022-07-09 08:49:21,835][26022] Updated weights on worker 0-0, policy_version 175214 (0.00082) [2022-07-09 08:49:23,259][26022] Updated weights on worker 0-0, policy_version 175224 (0.00087) [2022-07-09 08:49:23,735][25689] Fps is (10 sec: 5698.6, 60 sec: 5686.2, 300 sec: 5691.6). Total num frames: 179431424. Throughput: 0: 5128.2. Samples: 179427392. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 08:49:23,736][25689] Avg episode reward: [(0, '-52.062')] [2022-07-09 08:49:25,114][26022] Updated weights on worker 0-0, policy_version 175234 (0.00095) [2022-07-09 08:49:26,931][26022] Updated weights on worker 0-0, policy_version 175244 (0.00084) [2022-07-09 08:49:28,626][26022] Updated weights on worker 0-0, policy_version 175254 (0.00081) [2022-07-09 08:49:28,782][25689] Fps is (10 sec: 5991.9, 60 sec: 5703.1, 300 sec: 5698.9). Total num frames: 179461120. Throughput: 0: 5975.0. Samples: 179461410. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 08:49:28,783][25689] Avg episode reward: [(0, '-51.628')] [2022-07-09 08:49:30,722][26022] Updated weights on worker 0-0, policy_version 175264 (0.00087) [2022-07-09 08:49:32,255][26022] Updated weights on worker 0-0, policy_version 175274 (0.00096) [2022-07-09 08:49:33,785][25689] Fps is (10 sec: 5603.9, 60 sec: 5669.5, 300 sec: 5695.8). Total num frames: 179487744. Throughput: 0: 5995.3. Samples: 179496058. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 08:49:33,785][25689] Avg episode reward: [(0, '-52.119')] [2022-07-09 08:49:34,139][26022] Updated weights on worker 0-0, policy_version 175284 (0.00097) [2022-07-09 08:49:35,737][26022] Updated weights on worker 0-0, policy_version 175294 (0.00084) [2022-07-09 08:49:37,594][26022] Updated weights on worker 0-0, policy_version 175304 (0.00092) [2022-07-09 08:49:38,791][25689] Fps is (10 sec: 5627.3, 60 sec: 5703.9, 300 sec: 5696.4). Total num frames: 179517440. Throughput: 0: 5126.3. Samples: 179513308. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 08:49:38,791][25689] Avg episode reward: [(0, '-51.781')] [2022-07-09 08:49:39,548][26022] Updated weights on worker 0-0, policy_version 175314 (0.00087) [2022-07-09 08:49:41,262][26022] Updated weights on worker 0-0, policy_version 175324 (0.00073) [2022-07-09 08:49:43,199][26022] Updated weights on worker 0-0, policy_version 175334 (0.00079) [2022-07-09 08:49:43,808][25689] Fps is (10 sec: 5721.5, 60 sec: 5685.6, 300 sec: 5691.4). Total num frames: 179545088. Throughput: 0: 5991.8. Samples: 179547664. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 08:49:43,808][25689] Avg episode reward: [(0, '-51.711')] [2022-07-09 08:49:44,822][26022] Updated weights on worker 0-0, policy_version 175344 (0.00101) [2022-07-09 08:49:46,680][26022] Updated weights on worker 0-0, policy_version 175354 (0.00090) [2022-07-09 08:49:48,239][26022] Updated weights on worker 0-0, policy_version 175364 (0.00088) [2022-07-09 08:49:48,851][25689] Fps is (10 sec: 5802.0, 60 sec: 5710.9, 300 sec: 5698.6). Total num frames: 179575808. Throughput: 0: 6026.2. Samples: 179582346. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 08:49:48,851][25689] Avg episode reward: [(0, '-51.061')] [2022-07-09 08:49:50,317][26022] Updated weights on worker 0-0, policy_version 175374 (0.00084) [2022-07-09 08:49:51,932][26022] Updated weights on worker 0-0, policy_version 175384 (0.00087) [2022-07-09 08:49:53,811][26022] Updated weights on worker 0-0, policy_version 175394 (0.00089) [2022-07-09 08:49:53,861][25689] Fps is (10 sec: 5805.9, 60 sec: 5697.8, 300 sec: 5691.9). Total num frames: 179603456. Throughput: 0: 5159.1. Samples: 179599632. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 08:49:53,861][25689] Avg episode reward: [(0, '-50.891')] [2022-07-09 08:49:55,475][26022] Updated weights on worker 0-0, policy_version 175404 (0.00088) [2022-07-09 08:49:57,404][26022] Updated weights on worker 0-0, policy_version 175414 (0.00094) [2022-07-09 08:49:58,872][25689] Fps is (10 sec: 5619.8, 60 sec: 5698.0, 300 sec: 5692.3). Total num frames: 179632128. Throughput: 0: 6018.9. Samples: 179634178. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 08:49:58,873][25689] Avg episode reward: [(0, '-50.114')] [2022-07-09 08:49:58,990][26022] Updated weights on worker 0-0, policy_version 175424 (0.00102) [2022-07-09 08:50:00,795][26022] Updated weights on worker 0-0, policy_version 175434 (0.00084) [2022-07-09 08:50:02,984][26022] Updated weights on worker 0-0, policy_version 175444 (0.00957) [2022-07-09 08:50:03,879][25689] Fps is (10 sec: 5621.7, 60 sec: 5720.7, 300 sec: 5693.4). Total num frames: 179659776. Throughput: 0: 5926.9. Samples: 179666626. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 08:50:03,880][25689] Avg episode reward: [(0, '-50.492')] [2022-07-09 08:50:04,950][26022] Updated weights on worker 0-0, policy_version 175454 (0.00088) [2022-07-09 08:50:06,474][26022] Updated weights on worker 0-0, policy_version 175464 (0.00087) [2022-07-09 08:50:08,311][26022] Updated weights on worker 0-0, policy_version 175474 (0.00089) [2022-07-09 08:50:08,935][25689] Fps is (10 sec: 5597.1, 60 sec: 5705.4, 300 sec: 5700.6). Total num frames: 179688448. Throughput: 0: 5060.3. Samples: 179683978. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 08:50:08,935][25689] Avg episode reward: [(0, '-51.071')] [2022-07-09 08:50:10,018][26022] Updated weights on worker 0-0, policy_version 175484 (0.00093) [2022-07-09 08:50:12,047][26022] Updated weights on worker 0-0, policy_version 175494 (0.00085) [2022-07-09 08:50:13,714][26022] Updated weights on worker 0-0, policy_version 175504 (0.00102) [2022-07-09 08:50:13,950][25689] Fps is (10 sec: 5795.9, 60 sec: 5708.0, 300 sec: 5701.0). Total num frames: 179718144. Throughput: 0: 5925.6. Samples: 179718670. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 08:50:13,950][25689] Avg episode reward: [(0, '-50.877')] [2022-07-09 08:50:15,353][26022] Updated weights on worker 0-0, policy_version 175514 (0.00094) [2022-07-09 08:50:17,342][26022] Updated weights on worker 0-0, policy_version 175524 (0.00089) [2022-07-09 08:50:18,649][26022] Updated weights on worker 0-0, policy_version 175534 (0.00091) [2022-07-09 08:50:18,955][25689] Fps is (10 sec: 5825.2, 60 sec: 5744.1, 300 sec: 5694.8). Total num frames: 179746816. Throughput: 0: 5928.1. Samples: 179753226. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 08:50:18,955][25689] Avg episode reward: [(0, '-51.592')] [2022-07-09 08:50:20,947][26022] Updated weights on worker 0-0, policy_version 175544 (0.00082) [2022-07-09 08:50:22,314][26022] Updated weights on worker 0-0, policy_version 175554 (0.00081) [2022-07-09 08:50:23,961][25689] Fps is (10 sec: 5523.2, 60 sec: 5678.8, 300 sec: 5691.9). Total num frames: 179773440. Throughput: 0: 5164.7. Samples: 179770344. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 08:50:23,962][25689] Avg episode reward: [(0, '-51.863')] [2022-07-09 08:50:24,350][26022] Updated weights on worker 0-0, policy_version 175564 (0.00092) [2022-07-09 08:50:26,195][26022] Updated weights on worker 0-0, policy_version 175574 (0.00086) [2022-07-09 08:50:28,012][26022] Updated weights on worker 0-0, policy_version 175584 (0.00089) [2022-07-09 08:50:28,996][25689] Fps is (10 sec: 5609.0, 60 sec: 5680.1, 300 sec: 5698.8). Total num frames: 179803136. Throughput: 0: 6005.6. Samples: 179804456. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 08:50:28,996][25689] Avg episode reward: [(0, '-52.432')] [2022-07-09 08:50:29,889][26022] Updated weights on worker 0-0, policy_version 175594 (0.00097) [2022-07-09 08:50:31,593][26022] Updated weights on worker 0-0, policy_version 175604 (0.00086) [2022-07-09 08:50:33,467][26022] Updated weights on worker 0-0, policy_version 175614 (0.00088) [2022-07-09 08:50:34,004][25689] Fps is (10 sec: 5812.0, 60 sec: 5713.5, 300 sec: 5695.9). Total num frames: 179831808. Throughput: 0: 5988.6. Samples: 179838768. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 08:50:34,004][25689] Avg episode reward: [(0, '-52.493')] [2022-07-09 08:50:35,316][26022] Updated weights on worker 0-0, policy_version 175624 (0.00085) [2022-07-09 08:50:36,822][26022] Updated weights on worker 0-0, policy_version 175634 (0.00092) [2022-07-09 08:50:38,802][26022] Updated weights on worker 0-0, policy_version 175644 (0.00086) [2022-07-09 08:50:39,009][25689] Fps is (10 sec: 5726.6, 60 sec: 5696.6, 300 sec: 5699.5). Total num frames: 179860480. Throughput: 0: 5117.4. Samples: 179855854. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 08:50:39,010][25689] Avg episode reward: [(0, '-53.814')] [2022-07-09 08:50:40,525][26022] Updated weights on worker 0-0, policy_version 175654 (0.00088) [2022-07-09 08:50:40,816][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:50:40,841][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000175655_179870720.pth [2022-07-09 08:50:40,842][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000173653_177820672.pth [2022-07-09 08:50:42,453][26022] Updated weights on worker 0-0, policy_version 175664 (0.00085) [2022-07-09 08:50:44,023][25689] Fps is (10 sec: 5723.6, 60 sec: 5713.9, 300 sec: 5698.6). Total num frames: 179889152. Throughput: 0: 5971.2. Samples: 179890136. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 08:50:44,024][25689] Avg episode reward: [(0, '-53.386')] [2022-07-09 08:50:44,155][26022] Updated weights on worker 0-0, policy_version 175674 (0.00093) [2022-07-09 08:50:46,022][26022] Updated weights on worker 0-0, policy_version 175684 (0.00094) [2022-07-09 08:50:47,644][26022] Updated weights on worker 0-0, policy_version 175694 (0.00088) [2022-07-09 08:50:49,076][25689] Fps is (10 sec: 5696.5, 60 sec: 5679.0, 300 sec: 5698.3). Total num frames: 179917824. Throughput: 0: 5983.5. Samples: 179924606. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 08:50:49,076][25689] Avg episode reward: [(0, '-53.137')] [2022-07-09 08:50:49,591][26022] Updated weights on worker 0-0, policy_version 175704 (0.00094) [2022-07-09 08:50:51,371][26022] Updated weights on worker 0-0, policy_version 175714 (0.00083) [2022-07-09 08:50:53,184][26022] Updated weights on worker 0-0, policy_version 175724 (0.00089) [2022-07-09 08:50:54,077][25689] Fps is (10 sec: 5703.4, 60 sec: 5696.8, 300 sec: 5698.3). Total num frames: 179946496. Throughput: 0: 5122.4. Samples: 179941590. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 08:50:54,078][25689] Avg episode reward: [(0, '-53.732')] [2022-07-09 08:50:54,971][26022] Updated weights on worker 0-0, policy_version 175734 (0.00095) [2022-07-09 08:50:56,710][26022] Updated weights on worker 0-0, policy_version 175744 (0.00092) [2022-07-09 08:50:58,596][26022] Updated weights on worker 0-0, policy_version 175754 (0.00079) [2022-07-09 08:50:59,087][25689] Fps is (10 sec: 5625.6, 60 sec: 5679.9, 300 sec: 5695.1). Total num frames: 179974144. Throughput: 0: 5982.0. Samples: 179975960. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 08:50:59,088][25689] Avg episode reward: [(0, '-52.509')] [2022-07-09 08:51:00,181][26022] Updated weights on worker 0-0, policy_version 175764 (0.00085) [2022-07-09 08:51:02,537][26022] Updated weights on worker 0-0, policy_version 175774 (0.00087) [2022-07-09 08:51:04,108][25689] Fps is (10 sec: 5512.7, 60 sec: 5678.6, 300 sec: 5700.6). Total num frames: 180001792. Throughput: 0: 5865.3. Samples: 180007940. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 08:51:04,108][25689] Avg episode reward: [(0, '-52.685')] [2022-07-09 08:51:04,304][26022] Updated weights on worker 0-0, policy_version 175784 (0.00086) [2022-07-09 08:51:06,140][26022] Updated weights on worker 0-0, policy_version 175794 (0.00092) [2022-07-09 08:51:07,874][26022] Updated weights on worker 0-0, policy_version 175804 (0.00080) [2022-07-09 08:51:09,143][25689] Fps is (10 sec: 5600.6, 60 sec: 5680.5, 300 sec: 5697.1). Total num frames: 180030464. Throughput: 0: 5006.0. Samples: 180025064. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 08:51:09,144][25689] Avg episode reward: [(0, '-52.821')] [2022-07-09 08:51:09,699][26022] Updated weights on worker 0-0, policy_version 175814 (0.00087) [2022-07-09 08:51:11,503][26022] Updated weights on worker 0-0, policy_version 175824 (0.00092) [2022-07-09 08:51:13,565][26022] Updated weights on worker 0-0, policy_version 175834 (0.00092) [2022-07-09 08:51:14,148][25689] Fps is (10 sec: 5609.4, 60 sec: 5647.5, 300 sec: 5694.0). Total num frames: 180058112. Throughput: 0: 5868.0. Samples: 180059366. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 08:51:14,149][25689] Avg episode reward: [(0, '-53.295')] [2022-07-09 08:51:15,047][26022] Updated weights on worker 0-0, policy_version 175844 (0.00080) [2022-07-09 08:51:16,901][26022] Updated weights on worker 0-0, policy_version 175854 (0.00083) [2022-07-09 08:51:18,671][26022] Updated weights on worker 0-0, policy_version 175864 (0.00096) [2022-07-09 08:51:19,172][25689] Fps is (10 sec: 5615.7, 60 sec: 5645.7, 300 sec: 5697.3). Total num frames: 180086784. Throughput: 0: 5871.0. Samples: 180093880. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 08:51:19,173][25689] Avg episode reward: [(0, '-52.262')] [2022-07-09 08:51:20,386][26022] Updated weights on worker 0-0, policy_version 175874 (0.00078) [2022-07-09 08:51:22,286][26022] Updated weights on worker 0-0, policy_version 175884 (0.00087) [2022-07-09 08:51:24,071][26022] Updated weights on worker 0-0, policy_version 175894 (0.00092) [2022-07-09 08:51:24,203][25689] Fps is (10 sec: 5703.2, 60 sec: 5677.4, 300 sec: 5691.3). Total num frames: 180115456. Throughput: 0: 5124.9. Samples: 180110924. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 08:51:24,204][25689] Avg episode reward: [(0, '-53.388')] [2022-07-09 08:51:25,935][26022] Updated weights on worker 0-0, policy_version 175904 (0.00084) [2022-07-09 08:51:27,611][26022] Updated weights on worker 0-0, policy_version 175914 (0.00088) [2022-07-09 08:51:29,270][25689] Fps is (10 sec: 5679.1, 60 sec: 5657.4, 300 sec: 5697.6). Total num frames: 180144128. Throughput: 0: 5947.6. Samples: 180144768. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 08:51:29,271][25689] Avg episode reward: [(0, '-53.773')] [2022-07-09 08:51:29,708][26022] Updated weights on worker 0-0, policy_version 175924 (0.00091) [2022-07-09 08:51:31,325][26022] Updated weights on worker 0-0, policy_version 175934 (0.00083) [2022-07-09 08:51:33,283][26022] Updated weights on worker 0-0, policy_version 175944 (0.00080) [2022-07-09 08:51:34,272][25689] Fps is (10 sec: 5796.8, 60 sec: 5674.9, 300 sec: 5701.2). Total num frames: 180173824. Throughput: 0: 5943.1. Samples: 180178962. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 08:51:34,272][25689] Avg episode reward: [(0, '-53.380')] [2022-07-09 08:51:34,823][26022] Updated weights on worker 0-0, policy_version 175954 (0.00093) [2022-07-09 08:51:36,787][26022] Updated weights on worker 0-0, policy_version 175964 (0.00489) [2022-07-09 08:51:38,427][26022] Updated weights on worker 0-0, policy_version 175974 (0.00097) [2022-07-09 08:51:39,274][25689] Fps is (10 sec: 5731.5, 60 sec: 5658.2, 300 sec: 5695.3). Total num frames: 180201472. Throughput: 0: 5092.6. Samples: 180196254. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 08:51:39,276][25689] Avg episode reward: [(0, '-52.932')] [2022-07-09 08:51:40,218][26022] Updated weights on worker 0-0, policy_version 175984 (0.00086) [2022-07-09 08:51:42,035][26022] Updated weights on worker 0-0, policy_version 175994 (0.00088) [2022-07-09 08:51:43,619][26022] Updated weights on worker 0-0, policy_version 176004 (0.00097) [2022-07-09 08:51:44,281][25689] Fps is (10 sec: 5524.2, 60 sec: 5641.8, 300 sec: 5692.5). Total num frames: 180229120. Throughput: 0: 5966.4. Samples: 180230722. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 08:51:44,287][25689] Avg episode reward: [(0, '-52.998')] [2022-07-09 08:51:45,704][26022] Updated weights on worker 0-0, policy_version 176014 (0.00089) [2022-07-09 08:51:47,455][26022] Updated weights on worker 0-0, policy_version 176024 (0.00080) [2022-07-09 08:51:49,135][26022] Updated weights on worker 0-0, policy_version 176034 (0.00089) [2022-07-09 08:51:49,383][25689] Fps is (10 sec: 5773.9, 60 sec: 5671.2, 300 sec: 5694.0). Total num frames: 180259840. Throughput: 0: 6003.1. Samples: 180265514. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 08:51:49,384][25689] Avg episode reward: [(0, '-53.196')] [2022-07-09 08:51:51,009][26022] Updated weights on worker 0-0, policy_version 176044 (0.00088) [2022-07-09 08:51:52,651][26022] Updated weights on worker 0-0, policy_version 176054 (0.00081) [2022-07-09 08:51:54,438][25689] Fps is (10 sec: 5847.5, 60 sec: 5666.2, 300 sec: 5693.3). Total num frames: 180288512. Throughput: 0: 5149.6. Samples: 180282812. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 08:51:54,439][25689] Avg episode reward: [(0, '-52.184')] [2022-07-09 08:51:54,571][26022] Updated weights on worker 0-0, policy_version 176064 (0.00090) [2022-07-09 08:51:56,203][26022] Updated weights on worker 0-0, policy_version 176074 (0.00095) [2022-07-09 08:51:57,950][26022] Updated weights on worker 0-0, policy_version 176084 (0.00086) [2022-07-09 08:51:59,459][25689] Fps is (10 sec: 5792.9, 60 sec: 5699.1, 300 sec: 5703.3). Total num frames: 180318208. Throughput: 0: 5990.0. Samples: 180317160. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 08:51:59,460][25689] Avg episode reward: [(0, '-52.732')] [2022-07-09 08:51:59,750][26022] Updated weights on worker 0-0, policy_version 176094 (0.00089) [2022-07-09 08:52:01,828][26022] Updated weights on worker 0-0, policy_version 176104 (0.00087) [2022-07-09 08:52:03,897][26022] Updated weights on worker 0-0, policy_version 176114 (0.00085) [2022-07-09 08:52:04,503][25689] Fps is (10 sec: 5697.4, 60 sec: 5696.9, 300 sec: 5697.8). Total num frames: 180345856. Throughput: 0: 5871.6. Samples: 180349456. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 08:52:04,504][25689] Avg episode reward: [(0, '-52.289')] [2022-07-09 08:52:05,632][26022] Updated weights on worker 0-0, policy_version 176124 (0.00087) [2022-07-09 08:52:07,343][26022] Updated weights on worker 0-0, policy_version 176134 (0.00085) [2022-07-09 08:52:09,351][26022] Updated weights on worker 0-0, policy_version 176144 (0.00088) [2022-07-09 08:52:09,628][25689] Fps is (10 sec: 5438.0, 60 sec: 5671.5, 300 sec: 5689.0). Total num frames: 180373504. Throughput: 0: 4997.9. Samples: 180366692. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 08:52:09,628][25689] Avg episode reward: [(0, '-51.740')] [2022-07-09 08:52:10,935][26022] Updated weights on worker 0-0, policy_version 176154 (0.00078) [2022-07-09 08:52:12,775][26022] Updated weights on worker 0-0, policy_version 176164 (0.00086) [2022-07-09 08:52:14,455][26022] Updated weights on worker 0-0, policy_version 176174 (0.00090) [2022-07-09 08:52:14,660][25689] Fps is (10 sec: 5545.0, 60 sec: 5685.9, 300 sec: 5692.4). Total num frames: 180402176. Throughput: 0: 5832.9. Samples: 180400764. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 08:52:14,660][25689] Avg episode reward: [(0, '-51.507')] [2022-07-09 08:52:16,252][26022] Updated weights on worker 0-0, policy_version 176184 (0.00090) [2022-07-09 08:52:18,167][26022] Updated weights on worker 0-0, policy_version 176194 (0.00091) [2022-07-09 08:52:19,684][25689] Fps is (10 sec: 5702.3, 60 sec: 5685.9, 300 sec: 5690.4). Total num frames: 180430848. Throughput: 0: 5832.4. Samples: 180435118. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 08:52:19,684][25689] Avg episode reward: [(0, '-51.591')] [2022-07-09 08:52:19,856][26022] Updated weights on worker 0-0, policy_version 176204 (0.00083) [2022-07-09 08:52:21,955][26022] Updated weights on worker 0-0, policy_version 176214 (0.00089) [2022-07-09 08:52:23,404][26022] Updated weights on worker 0-0, policy_version 176224 (0.00090) [2022-07-09 08:52:24,737][25689] Fps is (10 sec: 5589.1, 60 sec: 5666.9, 300 sec: 5680.8). Total num frames: 180458496. Throughput: 0: 5928.8. Samples: 180469418. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 08:52:24,738][25689] Avg episode reward: [(0, '-51.829')] [2022-07-09 08:52:25,468][26022] Updated weights on worker 0-0, policy_version 176234 (0.00093) [2022-07-09 08:52:27,134][26022] Updated weights on worker 0-0, policy_version 176244 (0.00094) [2022-07-09 08:52:29,089][26022] Updated weights on worker 0-0, policy_version 176254 (0.00094) [2022-07-09 08:52:29,795][25689] Fps is (10 sec: 5671.3, 60 sec: 5684.6, 300 sec: 5691.7). Total num frames: 180488192. Throughput: 0: 5930.9. Samples: 180486304. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 08:52:29,796][25689] Avg episode reward: [(0, '-51.937')] [2022-07-09 08:52:30,758][26022] Updated weights on worker 0-0, policy_version 176264 (0.00089) [2022-07-09 08:52:32,569][26022] Updated weights on worker 0-0, policy_version 176274 (0.00087) [2022-07-09 08:52:34,331][26022] Updated weights on worker 0-0, policy_version 176284 (0.00086) [2022-07-09 08:52:34,845][25689] Fps is (10 sec: 5774.1, 60 sec: 5663.2, 300 sec: 5684.2). Total num frames: 180516864. Throughput: 0: 5924.1. Samples: 180520344. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 08:52:34,847][25689] Avg episode reward: [(0, '-53.320')] [2022-07-09 08:52:36,383][26022] Updated weights on worker 0-0, policy_version 176294 (0.00088) [2022-07-09 08:52:37,721][26022] Updated weights on worker 0-0, policy_version 176304 (0.00082) [2022-07-09 08:52:39,864][25689] Fps is (10 sec: 5593.7, 60 sec: 5661.7, 300 sec: 5688.7). Total num frames: 180544512. Throughput: 0: 5926.3. Samples: 180554710. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 08:52:39,866][25689] Avg episode reward: [(0, '-52.979')] [2022-07-09 08:52:39,887][26022] Updated weights on worker 0-0, policy_version 176314 (0.00084) [2022-07-09 08:52:40,980][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:52:40,996][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000176322_180553728.pth [2022-07-09 08:52:40,997][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000174317_178500608.pth [2022-07-09 08:52:41,576][26022] Updated weights on worker 0-0, policy_version 176324 (0.00091) [2022-07-09 08:52:43,331][26022] Updated weights on worker 0-0, policy_version 176334 (0.00498) [2022-07-09 08:52:44,882][25689] Fps is (10 sec: 5611.6, 60 sec: 5677.6, 300 sec: 5680.4). Total num frames: 180573184. Throughput: 0: 5076.1. Samples: 180571676. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 08:52:44,882][25689] Avg episode reward: [(0, '-53.113')] [2022-07-09 08:52:45,241][26022] Updated weights on worker 0-0, policy_version 176344 (0.00082) [2022-07-09 08:52:46,998][26022] Updated weights on worker 0-0, policy_version 176354 (0.00088) [2022-07-09 08:52:48,758][26022] Updated weights on worker 0-0, policy_version 176364 (0.00091) [2022-07-09 08:52:49,975][25689] Fps is (10 sec: 5772.6, 60 sec: 5661.5, 300 sec: 5678.9). Total num frames: 180602880. Throughput: 0: 5934.9. Samples: 180606070. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 08:52:49,975][25689] Avg episode reward: [(0, '-52.535')] [2022-07-09 08:52:50,586][26022] Updated weights on worker 0-0, policy_version 176374 (0.00089) [2022-07-09 08:52:52,122][26022] Updated weights on worker 0-0, policy_version 176384 (0.00086) [2022-07-09 08:52:54,172][26022] Updated weights on worker 0-0, policy_version 176394 (0.00083) [2022-07-09 08:52:55,023][25689] Fps is (10 sec: 5957.1, 60 sec: 5695.9, 300 sec: 5688.5). Total num frames: 180633600. Throughput: 0: 5970.4. Samples: 180640816. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 08:52:55,024][25689] Avg episode reward: [(0, '-52.759')] [2022-07-09 08:52:55,899][26022] Updated weights on worker 0-0, policy_version 176404 (0.00089) [2022-07-09 08:52:57,716][26022] Updated weights on worker 0-0, policy_version 176414 (0.00090) [2022-07-09 08:52:59,506][26022] Updated weights on worker 0-0, policy_version 176424 (0.00088) [2022-07-09 08:53:00,059][25689] Fps is (10 sec: 5686.7, 60 sec: 5643.9, 300 sec: 5685.2). Total num frames: 180660224. Throughput: 0: 5110.1. Samples: 180657910. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 08:53:00,059][25689] Avg episode reward: [(0, '-53.276')] [2022-07-09 08:53:01,091][26022] Updated weights on worker 0-0, policy_version 176434 (0.00089) [2022-07-09 08:53:03,574][26022] Updated weights on worker 0-0, policy_version 176444 (0.00091) [2022-07-09 08:53:05,148][25689] Fps is (10 sec: 5259.2, 60 sec: 5622.8, 300 sec: 5682.0). Total num frames: 180686848. Throughput: 0: 5822.6. Samples: 180689680. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 08:53:05,150][25689] Avg episode reward: [(0, '-52.878')] [2022-07-09 08:53:05,424][26022] Updated weights on worker 0-0, policy_version 176454 (0.00084) [2022-07-09 08:53:07,067][26022] Updated weights on worker 0-0, policy_version 176464 (0.00086) [2022-07-09 08:53:08,980][26022] Updated weights on worker 0-0, policy_version 176474 (0.00091) [2022-07-09 08:53:10,202][25689] Fps is (10 sec: 5552.6, 60 sec: 5663.1, 300 sec: 5677.8). Total num frames: 180716544. Throughput: 0: 5819.3. Samples: 180723776. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 08:53:10,202][25689] Avg episode reward: [(0, '-52.433')] [2022-07-09 08:53:10,603][26022] Updated weights on worker 0-0, policy_version 176484 (0.00082) [2022-07-09 08:53:12,679][26022] Updated weights on worker 0-0, policy_version 176494 (0.00085) [2022-07-09 08:53:14,267][26022] Updated weights on worker 0-0, policy_version 176504 (0.00093) [2022-07-09 08:53:15,207][25689] Fps is (10 sec: 5701.0, 60 sec: 5648.8, 300 sec: 5675.2). Total num frames: 180744192. Throughput: 0: 4952.4. Samples: 180740772. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 08:53:15,207][25689] Avg episode reward: [(0, '-51.973')] [2022-07-09 08:53:15,967][26022] Updated weights on worker 0-0, policy_version 176514 (0.00090) [2022-07-09 08:53:18,006][26022] Updated weights on worker 0-0, policy_version 176524 (0.00088) [2022-07-09 08:53:19,829][26022] Updated weights on worker 0-0, policy_version 176534 (0.00085) [2022-07-09 08:53:20,217][25689] Fps is (10 sec: 5623.4, 60 sec: 5650.0, 300 sec: 5675.3). Total num frames: 180772864. Throughput: 0: 5807.9. Samples: 180774988. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 08:53:20,217][25689] Avg episode reward: [(0, '-51.236')] [2022-07-09 08:53:21,551][26022] Updated weights on worker 0-0, policy_version 176544 (0.00087) [2022-07-09 08:53:23,299][26022] Updated weights on worker 0-0, policy_version 176554 (0.00089) [2022-07-09 08:53:25,215][26022] Updated weights on worker 0-0, policy_version 176564 (0.00090) [2022-07-09 08:53:25,235][25689] Fps is (10 sec: 5718.4, 60 sec: 5670.3, 300 sec: 5675.8). Total num frames: 180801536. Throughput: 0: 5958.7. Samples: 180809370. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 08:53:25,235][25689] Avg episode reward: [(0, '-50.860')] [2022-07-09 08:53:26,936][26022] Updated weights on worker 0-0, policy_version 176574 (0.00087) [2022-07-09 08:53:28,749][26022] Updated weights on worker 0-0, policy_version 176584 (0.00087) [2022-07-09 08:53:30,289][25689] Fps is (10 sec: 5795.1, 60 sec: 5670.6, 300 sec: 5678.3). Total num frames: 180831232. Throughput: 0: 5098.3. Samples: 180826186. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 08:53:30,290][25689] Avg episode reward: [(0, '-50.627')] [2022-07-09 08:53:30,558][26022] Updated weights on worker 0-0, policy_version 176594 (0.00103) [2022-07-09 08:53:32,399][26022] Updated weights on worker 0-0, policy_version 176604 (0.00082) [2022-07-09 08:53:34,159][26022] Updated weights on worker 0-0, policy_version 176614 (0.00097) [2022-07-09 08:53:35,317][25689] Fps is (10 sec: 5789.1, 60 sec: 5672.7, 300 sec: 5681.5). Total num frames: 180859904. Throughput: 0: 5956.1. Samples: 180860552. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 08:53:35,318][25689] Avg episode reward: [(0, '-50.369')] [2022-07-09 08:53:36,055][26022] Updated weights on worker 0-0, policy_version 176624 (0.00086) [2022-07-09 08:53:37,684][26022] Updated weights on worker 0-0, policy_version 176634 (0.00087) [2022-07-09 08:53:39,602][26022] Updated weights on worker 0-0, policy_version 176644 (0.00088) [2022-07-09 08:53:40,332][25689] Fps is (10 sec: 5608.1, 60 sec: 5673.1, 300 sec: 5677.8). Total num frames: 180887552. Throughput: 0: 5952.5. Samples: 180894720. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 08:53:40,332][25689] Avg episode reward: [(0, '-51.866')] [2022-07-09 08:53:41,351][26022] Updated weights on worker 0-0, policy_version 176654 (0.00095) [2022-07-09 08:53:43,320][26022] Updated weights on worker 0-0, policy_version 176664 (0.00084) [2022-07-09 08:53:45,003][26022] Updated weights on worker 0-0, policy_version 176674 (0.00086) [2022-07-09 08:53:45,357][25689] Fps is (10 sec: 5711.7, 60 sec: 5689.3, 300 sec: 5679.8). Total num frames: 180917248. Throughput: 0: 5090.6. Samples: 180911804. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 08:53:45,358][25689] Avg episode reward: [(0, '-51.554')] [2022-07-09 08:53:46,818][26022] Updated weights on worker 0-0, policy_version 176684 (0.00087) [2022-07-09 08:53:48,371][26022] Updated weights on worker 0-0, policy_version 176694 (0.00105) [2022-07-09 08:53:50,366][26022] Updated weights on worker 0-0, policy_version 176704 (0.00085) [2022-07-09 08:53:50,463][25689] Fps is (10 sec: 5660.3, 60 sec: 5654.3, 300 sec: 5675.4). Total num frames: 180944896. Throughput: 0: 5935.9. Samples: 180945934. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 08:53:50,463][25689] Avg episode reward: [(0, '-51.857')] [2022-07-09 08:53:52,176][26022] Updated weights on worker 0-0, policy_version 176714 (0.00081) [2022-07-09 08:53:54,007][26022] Updated weights on worker 0-0, policy_version 176724 (0.00092) [2022-07-09 08:53:55,471][25689] Fps is (10 sec: 5467.1, 60 sec: 5607.2, 300 sec: 5672.0). Total num frames: 180972544. Throughput: 0: 5926.1. Samples: 180979988. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 08:53:55,473][25689] Avg episode reward: [(0, '-51.510')] [2022-07-09 08:53:55,815][26022] Updated weights on worker 0-0, policy_version 176734 (0.00086) [2022-07-09 08:53:57,646][26022] Updated weights on worker 0-0, policy_version 176744 (0.00088) [2022-07-09 08:53:59,336][26022] Updated weights on worker 0-0, policy_version 176754 (0.00122) [2022-07-09 08:54:00,482][25689] Fps is (10 sec: 5723.4, 60 sec: 5660.4, 300 sec: 5683.4). Total num frames: 181002240. Throughput: 0: 5077.0. Samples: 180997022. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 08:54:00,483][25689] Avg episode reward: [(0, '-51.861')] [2022-07-09 08:54:01,173][26022] Updated weights on worker 0-0, policy_version 176764 (0.00819) [2022-07-09 08:54:03,387][26022] Updated weights on worker 0-0, policy_version 176774 (0.00074) [2022-07-09 08:54:05,035][26022] Updated weights on worker 0-0, policy_version 176784 (0.00089) [2022-07-09 08:54:05,517][25689] Fps is (10 sec: 5606.5, 60 sec: 5665.5, 300 sec: 5673.8). Total num frames: 181028864. Throughput: 0: 5827.0. Samples: 181029276. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 08:54:05,519][25689] Avg episode reward: [(0, '-51.758')] [2022-07-09 08:54:06,953][26022] Updated weights on worker 0-0, policy_version 176794 (0.00063) [2022-07-09 08:54:08,692][26022] Updated weights on worker 0-0, policy_version 176804 (0.00085) [2022-07-09 08:54:10,595][25689] Fps is (10 sec: 5467.5, 60 sec: 5646.2, 300 sec: 5669.7). Total num frames: 181057536. Throughput: 0: 5829.5. Samples: 181063298. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 08:54:10,596][25689] Avg episode reward: [(0, '-52.526')] [2022-07-09 08:54:10,608][26022] Updated weights on worker 0-0, policy_version 176814 (0.00104) [2022-07-09 08:54:12,375][26022] Updated weights on worker 0-0, policy_version 176824 (0.00081) [2022-07-09 08:54:14,118][26022] Updated weights on worker 0-0, policy_version 176834 (0.00090) [2022-07-09 08:54:15,636][25689] Fps is (10 sec: 5666.5, 60 sec: 5659.7, 300 sec: 5676.3). Total num frames: 181086208. Throughput: 0: 4995.8. Samples: 181080730. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 08:54:15,637][25689] Avg episode reward: [(0, '-52.746')] [2022-07-09 08:54:15,952][26022] Updated weights on worker 0-0, policy_version 176844 (0.00090) [2022-07-09 08:54:17,576][26022] Updated weights on worker 0-0, policy_version 176854 (0.00086) [2022-07-09 08:54:19,475][26022] Updated weights on worker 0-0, policy_version 176864 (0.00080) [2022-07-09 08:54:20,677][25689] Fps is (10 sec: 5688.1, 60 sec: 5656.9, 300 sec: 5669.3). Total num frames: 181114880. Throughput: 0: 5863.0. Samples: 181115424. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 08:54:20,677][25689] Avg episode reward: [(0, '-52.871')] [2022-07-09 08:54:21,225][26022] Updated weights on worker 0-0, policy_version 176874 (0.00084) [2022-07-09 08:54:22,906][26022] Updated weights on worker 0-0, policy_version 176884 (0.00086) [2022-07-09 08:54:24,734][26022] Updated weights on worker 0-0, policy_version 176894 (0.00091) [2022-07-09 08:54:25,692][25689] Fps is (10 sec: 5804.6, 60 sec: 5674.1, 300 sec: 5669.9). Total num frames: 181144576. Throughput: 0: 5977.9. Samples: 181149882. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 08:54:25,692][25689] Avg episode reward: [(0, '-52.881')] [2022-07-09 08:54:26,570][26022] Updated weights on worker 0-0, policy_version 176904 (0.00081) [2022-07-09 08:54:28,213][26022] Updated weights on worker 0-0, policy_version 176914 (0.00089) [2022-07-09 08:54:30,201][26022] Updated weights on worker 0-0, policy_version 176924 (0.00085) [2022-07-09 08:54:30,731][25689] Fps is (10 sec: 5703.5, 60 sec: 5641.7, 300 sec: 5672.7). Total num frames: 181172224. Throughput: 0: 5157.3. Samples: 181167144. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 08:54:30,731][25689] Avg episode reward: [(0, '-53.539')] [2022-07-09 08:54:31,783][26022] Updated weights on worker 0-0, policy_version 176934 (0.00087) [2022-07-09 08:54:33,780][26022] Updated weights on worker 0-0, policy_version 176944 (0.00087) [2022-07-09 08:54:35,495][26022] Updated weights on worker 0-0, policy_version 176954 (0.00090) [2022-07-09 08:54:35,738][25689] Fps is (10 sec: 5708.0, 60 sec: 5660.5, 300 sec: 5672.7). Total num frames: 181201920. Throughput: 0: 6008.9. Samples: 181201520. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 08:54:35,739][25689] Avg episode reward: [(0, '-53.297')] [2022-07-09 08:54:37,401][26022] Updated weights on worker 0-0, policy_version 176964 (0.00084) [2022-07-09 08:54:39,004][26022] Updated weights on worker 0-0, policy_version 176974 (0.00085) [2022-07-09 08:54:40,776][25689] Fps is (10 sec: 5810.4, 60 sec: 5675.3, 300 sec: 5675.7). Total num frames: 181230592. Throughput: 0: 5989.1. Samples: 181235804. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 08:54:40,777][25689] Avg episode reward: [(0, '-52.965')] [2022-07-09 08:54:40,904][26022] Updated weights on worker 0-0, policy_version 176984 (0.00092) [2022-07-09 08:54:41,036][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:54:41,049][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000176985_181232640.pth [2022-07-09 08:54:41,049][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000174987_179186688.pth [2022-07-09 08:54:42,757][26022] Updated weights on worker 0-0, policy_version 176994 (0.00081) [2022-07-09 08:54:44,435][26022] Updated weights on worker 0-0, policy_version 177004 (0.00092) [2022-07-09 08:54:45,788][25689] Fps is (10 sec: 5604.0, 60 sec: 5642.6, 300 sec: 5665.9). Total num frames: 181258240. Throughput: 0: 5130.7. Samples: 181252990. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 08:54:45,789][25689] Avg episode reward: [(0, '-52.218')] [2022-07-09 08:54:46,288][26022] Updated weights on worker 0-0, policy_version 177014 (0.00089) [2022-07-09 08:54:48,087][26022] Updated weights on worker 0-0, policy_version 177024 (0.00094) [2022-07-09 08:54:50,043][26022] Updated weights on worker 0-0, policy_version 177034 (0.00089) [2022-07-09 08:54:50,839][25689] Fps is (10 sec: 5698.6, 60 sec: 5681.7, 300 sec: 5672.1). Total num frames: 181287936. Throughput: 0: 5970.0. Samples: 181287190. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 08:54:50,840][25689] Avg episode reward: [(0, '-53.190')] [2022-07-09 08:54:51,719][26022] Updated weights on worker 0-0, policy_version 177044 (0.00088) [2022-07-09 08:54:53,504][26022] Updated weights on worker 0-0, policy_version 177054 (0.00089) [2022-07-09 08:54:55,156][26022] Updated weights on worker 0-0, policy_version 177064 (0.00085) [2022-07-09 08:54:55,864][25689] Fps is (10 sec: 5793.0, 60 sec: 5697.1, 300 sec: 5671.8). Total num frames: 181316608. Throughput: 0: 5976.9. Samples: 181321808. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 08:54:55,864][25689] Avg episode reward: [(0, '-52.523')] [2022-07-09 08:54:57,022][26022] Updated weights on worker 0-0, policy_version 177074 (0.01278) [2022-07-09 08:54:58,725][26022] Updated weights on worker 0-0, policy_version 177084 (0.00083) [2022-07-09 08:55:00,636][26022] Updated weights on worker 0-0, policy_version 177094 (0.00089) [2022-07-09 08:55:00,867][25689] Fps is (10 sec: 5718.7, 60 sec: 5680.9, 300 sec: 5675.3). Total num frames: 181345280. Throughput: 0: 5144.2. Samples: 181339152. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 08:55:00,867][25689] Avg episode reward: [(0, '-52.351')] [2022-07-09 08:55:02,706][26022] Updated weights on worker 0-0, policy_version 177104 (0.00088) [2022-07-09 08:55:04,399][26022] Updated weights on worker 0-0, policy_version 177114 (0.00096) [2022-07-09 08:55:05,909][25689] Fps is (10 sec: 5606.6, 60 sec: 5697.1, 300 sec: 5672.1). Total num frames: 181372928. Throughput: 0: 5897.4. Samples: 181371650. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 08:55:05,910][25689] Avg episode reward: [(0, '-52.079')] [2022-07-09 08:55:06,209][26022] Updated weights on worker 0-0, policy_version 177124 (0.00090) [2022-07-09 08:55:08,091][26022] Updated weights on worker 0-0, policy_version 177134 (0.00086) [2022-07-09 08:55:09,855][26022] Updated weights on worker 0-0, policy_version 177144 (0.00103) [2022-07-09 08:55:10,950][25689] Fps is (10 sec: 5686.8, 60 sec: 5717.6, 300 sec: 5671.6). Total num frames: 181402624. Throughput: 0: 5891.3. Samples: 181405672. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 08:55:10,951][25689] Avg episode reward: [(0, '-52.955')] [2022-07-09 08:55:11,642][26022] Updated weights on worker 0-0, policy_version 177154 (0.00052) [2022-07-09 08:55:13,438][26022] Updated weights on worker 0-0, policy_version 177164 (0.00083) [2022-07-09 08:55:15,312][26022] Updated weights on worker 0-0, policy_version 177174 (0.00086) [2022-07-09 08:55:15,973][25689] Fps is (10 sec: 5596.4, 60 sec: 5685.5, 300 sec: 5664.4). Total num frames: 181429248. Throughput: 0: 5033.2. Samples: 181423020. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 08:55:15,973][25689] Avg episode reward: [(0, '-52.684')] [2022-07-09 08:55:17,014][26022] Updated weights on worker 0-0, policy_version 177184 (0.00083) [2022-07-09 08:55:18,935][26022] Updated weights on worker 0-0, policy_version 177194 (0.00089) [2022-07-09 08:55:20,680][26022] Updated weights on worker 0-0, policy_version 177204 (0.00098) [2022-07-09 08:55:20,974][25689] Fps is (10 sec: 5618.7, 60 sec: 5706.1, 300 sec: 5674.9). Total num frames: 181458944. Throughput: 0: 5871.8. Samples: 181457218. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 08:55:20,975][25689] Avg episode reward: [(0, '-52.613')] [2022-07-09 08:55:22,514][26022] Updated weights on worker 0-0, policy_version 177214 (0.00047) [2022-07-09 08:55:24,289][26022] Updated weights on worker 0-0, policy_version 177224 (0.00091) [2022-07-09 08:55:25,992][25689] Fps is (10 sec: 5723.2, 60 sec: 5671.9, 300 sec: 5668.3). Total num frames: 181486592. Throughput: 0: 5961.1. Samples: 181491368. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 08:55:25,994][25689] Avg episode reward: [(0, '-53.077')] [2022-07-09 08:55:26,147][26022] Updated weights on worker 0-0, policy_version 177234 (0.00091) [2022-07-09 08:55:27,712][26022] Updated weights on worker 0-0, policy_version 177244 (0.00093) [2022-07-09 08:55:29,749][26022] Updated weights on worker 0-0, policy_version 177254 (0.00086) [2022-07-09 08:55:31,065][25689] Fps is (10 sec: 5682.9, 60 sec: 5702.7, 300 sec: 5670.5). Total num frames: 181516288. Throughput: 0: 5106.6. Samples: 181508386. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 08:55:31,065][25689] Avg episode reward: [(0, '-52.584')] [2022-07-09 08:55:31,539][26022] Updated weights on worker 0-0, policy_version 177264 (0.00092) [2022-07-09 08:55:33,151][26022] Updated weights on worker 0-0, policy_version 177274 (0.00086) [2022-07-09 08:55:35,126][26022] Updated weights on worker 0-0, policy_version 177284 (0.00084) [2022-07-09 08:55:36,073][25689] Fps is (10 sec: 5790.1, 60 sec: 5685.6, 300 sec: 5670.4). Total num frames: 181544960. Throughput: 0: 5945.6. Samples: 181542528. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 08:55:36,074][25689] Avg episode reward: [(0, '-52.501')] [2022-07-09 08:55:36,837][26022] Updated weights on worker 0-0, policy_version 177294 (0.00094) [2022-07-09 08:55:38,792][26022] Updated weights on worker 0-0, policy_version 177304 (0.00053) [2022-07-09 08:55:40,467][26022] Updated weights on worker 0-0, policy_version 177314 (0.00101) [2022-07-09 08:55:41,083][25689] Fps is (10 sec: 5621.5, 60 sec: 5671.3, 300 sec: 5667.0). Total num frames: 181572608. Throughput: 0: 5947.5. Samples: 181576818. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 08:55:41,084][25689] Avg episode reward: [(0, '-52.232')] [2022-07-09 08:55:42,168][26022] Updated weights on worker 0-0, policy_version 177324 (0.00092) [2022-07-09 08:55:44,067][26022] Updated weights on worker 0-0, policy_version 177334 (0.00089) [2022-07-09 08:55:45,787][26022] Updated weights on worker 0-0, policy_version 177344 (0.00091) [2022-07-09 08:55:46,095][25689] Fps is (10 sec: 5619.6, 60 sec: 5688.2, 300 sec: 5667.8). Total num frames: 181601280. Throughput: 0: 5960.5. Samples: 181611190. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 08:55:46,096][25689] Avg episode reward: [(0, '-52.852')] [2022-07-09 08:55:47,599][26022] Updated weights on worker 0-0, policy_version 177354 (0.00082) [2022-07-09 08:55:49,465][26022] Updated weights on worker 0-0, policy_version 177364 (0.00084) [2022-07-09 08:55:51,117][26022] Updated weights on worker 0-0, policy_version 177374 (0.00093) [2022-07-09 08:55:51,214][25689] Fps is (10 sec: 5761.6, 60 sec: 5681.9, 300 sec: 5669.0). Total num frames: 181630976. Throughput: 0: 5945.8. Samples: 181628190. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 08:55:51,214][25689] Avg episode reward: [(0, '-52.538')] [2022-07-09 08:55:53,133][26022] Updated weights on worker 0-0, policy_version 177384 (0.00090) [2022-07-09 08:55:54,957][26022] Updated weights on worker 0-0, policy_version 177394 (0.00088) [2022-07-09 08:55:56,286][25689] Fps is (10 sec: 5626.8, 60 sec: 5660.4, 300 sec: 5667.8). Total num frames: 181658624. Throughput: 0: 5921.7. Samples: 181662226. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 08:55:56,287][25689] Avg episode reward: [(0, '-52.179')] [2022-07-09 08:55:56,555][26022] Updated weights on worker 0-0, policy_version 177404 (0.00089) [2022-07-09 08:55:58,623][26022] Updated weights on worker 0-0, policy_version 177414 (0.00080) [2022-07-09 08:56:00,078][26022] Updated weights on worker 0-0, policy_version 177424 (0.00086) [2022-07-09 08:56:01,378][25689] Fps is (10 sec: 5541.3, 60 sec: 5652.1, 300 sec: 5669.9). Total num frames: 181687296. Throughput: 0: 5890.2. Samples: 181696356. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 08:56:01,378][25689] Avg episode reward: [(0, '-52.453')] [2022-07-09 08:56:02,492][26022] Updated weights on worker 0-0, policy_version 177434 (0.00079) [2022-07-09 08:56:04,142][26022] Updated weights on worker 0-0, policy_version 177444 (0.00092) [2022-07-09 08:56:05,967][26022] Updated weights on worker 0-0, policy_version 177454 (0.00088) [2022-07-09 08:56:06,465][25689] Fps is (10 sec: 5533.0, 60 sec: 5647.9, 300 sec: 5665.5). Total num frames: 181714944. Throughput: 0: 4920.6. Samples: 181711424. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 08:56:06,466][25689] Avg episode reward: [(0, '-52.975')] [2022-07-09 08:56:07,885][26022] Updated weights on worker 0-0, policy_version 177464 (0.00088) [2022-07-09 08:56:09,651][26022] Updated weights on worker 0-0, policy_version 177474 (0.00087) [2022-07-09 08:56:11,301][26022] Updated weights on worker 0-0, policy_version 177484 (0.00089) [2022-07-09 08:56:11,576][25689] Fps is (10 sec: 5622.6, 60 sec: 5641.4, 300 sec: 5670.4). Total num frames: 181744640. Throughput: 0: 5762.6. Samples: 181745530. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 08:56:11,577][25689] Avg episode reward: [(0, '-52.066')] [2022-07-09 08:56:13,106][26022] Updated weights on worker 0-0, policy_version 177494 (0.00508) [2022-07-09 08:56:15,017][26022] Updated weights on worker 0-0, policy_version 177504 (0.00086) [2022-07-09 08:56:16,614][25689] Fps is (10 sec: 5751.3, 60 sec: 5673.8, 300 sec: 5670.1). Total num frames: 181773312. Throughput: 0: 5820.1. Samples: 181780534. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 08:56:16,614][25689] Avg episode reward: [(0, '-51.905')] [2022-07-09 08:56:16,810][26022] Updated weights on worker 0-0, policy_version 177514 (0.00080) [2022-07-09 08:56:18,546][26022] Updated weights on worker 0-0, policy_version 177524 (0.00083) [2022-07-09 08:56:20,332][26022] Updated weights on worker 0-0, policy_version 177534 (0.00094) [2022-07-09 08:56:21,621][25689] Fps is (10 sec: 5607.0, 60 sec: 5639.5, 300 sec: 5667.1). Total num frames: 181800960. Throughput: 0: 5007.4. Samples: 181797722. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 08:56:21,621][25689] Avg episode reward: [(0, '-52.087')] [2022-07-09 08:56:21,995][26022] Updated weights on worker 0-0, policy_version 177544 (0.00083) [2022-07-09 08:56:23,839][26022] Updated weights on worker 0-0, policy_version 177554 (0.00084) [2022-07-09 08:56:25,717][26022] Updated weights on worker 0-0, policy_version 177564 (0.00092) [2022-07-09 08:56:26,698][25689] Fps is (10 sec: 5788.4, 60 sec: 5684.6, 300 sec: 5673.8). Total num frames: 181831680. Throughput: 0: 5970.1. Samples: 181832212. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 08:56:26,698][25689] Avg episode reward: [(0, '-51.865')] [2022-07-09 08:56:27,463][26022] Updated weights on worker 0-0, policy_version 177574 (0.00088) [2022-07-09 08:56:29,182][26022] Updated weights on worker 0-0, policy_version 177584 (0.00072) [2022-07-09 08:56:31,151][26022] Updated weights on worker 0-0, policy_version 177594 (0.00086) [2022-07-09 08:56:31,760][25689] Fps is (10 sec: 5756.6, 60 sec: 5651.8, 300 sec: 5665.8). Total num frames: 181859328. Throughput: 0: 5994.8. Samples: 181866530. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 08:56:31,762][25689] Avg episode reward: [(0, '-50.727')] [2022-07-09 08:56:32,795][26022] Updated weights on worker 0-0, policy_version 177604 (0.00089) [2022-07-09 08:56:34,661][26022] Updated weights on worker 0-0, policy_version 177614 (0.00093) [2022-07-09 08:56:36,381][26022] Updated weights on worker 0-0, policy_version 177624 (0.00088) [2022-07-09 08:56:36,766][25689] Fps is (10 sec: 5695.1, 60 sec: 5668.9, 300 sec: 5672.6). Total num frames: 181889024. Throughput: 0: 5129.3. Samples: 181883902. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 08:56:36,768][25689] Avg episode reward: [(0, '-50.923')] [2022-07-09 08:56:38,236][26022] Updated weights on worker 0-0, policy_version 177634 (0.00084) [2022-07-09 08:56:39,980][26022] Updated weights on worker 0-0, policy_version 177644 (0.00083) [2022-07-09 08:56:41,115][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:56:41,130][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000177651_181914624.pth [2022-07-09 08:56:41,131][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000175655_179870720.pth [2022-07-09 08:56:41,622][26022] Updated weights on worker 0-0, policy_version 177654 (0.00090) [2022-07-09 08:56:41,775][25689] Fps is (10 sec: 5828.0, 60 sec: 5685.9, 300 sec: 5676.0). Total num frames: 181917696. Throughput: 0: 5972.3. Samples: 181918090. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 08:56:41,777][25689] Avg episode reward: [(0, '-50.951')] [2022-07-09 08:56:43,580][26022] Updated weights on worker 0-0, policy_version 177664 (0.00085) [2022-07-09 08:56:45,273][26022] Updated weights on worker 0-0, policy_version 177674 (0.00088) [2022-07-09 08:56:46,793][25689] Fps is (10 sec: 5616.9, 60 sec: 5668.4, 300 sec: 5667.3). Total num frames: 181945344. Throughput: 0: 6003.7. Samples: 181952862. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 08:56:46,794][25689] Avg episode reward: [(0, '-51.234')] [2022-07-09 08:56:47,122][26022] Updated weights on worker 0-0, policy_version 177684 (0.00082) [2022-07-09 08:56:48,745][26022] Updated weights on worker 0-0, policy_version 177694 (0.00090) [2022-07-09 08:56:50,748][26022] Updated weights on worker 0-0, policy_version 177704 (0.00088) [2022-07-09 08:56:51,920][25689] Fps is (10 sec: 5753.4, 60 sec: 5684.5, 300 sec: 5672.8). Total num frames: 181976064. Throughput: 0: 5130.2. Samples: 181969954. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 08:56:51,923][25689] Avg episode reward: [(0, '-50.470')] [2022-07-09 08:56:52,462][26022] Updated weights on worker 0-0, policy_version 177714 (0.00089) [2022-07-09 08:56:54,265][26022] Updated weights on worker 0-0, policy_version 177724 (0.00086) [2022-07-09 08:56:55,878][26022] Updated weights on worker 0-0, policy_version 177734 (0.00083) [2022-07-09 08:56:56,951][25689] Fps is (10 sec: 5645.2, 60 sec: 5671.5, 300 sec: 5662.3). Total num frames: 182002688. Throughput: 0: 5982.8. Samples: 182004664. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 08:56:56,952][25689] Avg episode reward: [(0, '-50.267')] [2022-07-09 08:56:57,723][26022] Updated weights on worker 0-0, policy_version 177744 (0.00097) [2022-07-09 08:56:59,451][26022] Updated weights on worker 0-0, policy_version 177754 (0.00084) [2022-07-09 08:57:01,280][26022] Updated weights on worker 0-0, policy_version 177764 (0.00087) [2022-07-09 08:57:01,953][25689] Fps is (10 sec: 5715.8, 60 sec: 5713.7, 300 sec: 5673.4). Total num frames: 182033408. Throughput: 0: 6009.7. Samples: 182039350. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 08:57:01,953][25689] Avg episode reward: [(0, '-50.942')] [2022-07-09 08:57:03,463][26022] Updated weights on worker 0-0, policy_version 177774 (0.00087) [2022-07-09 08:57:05,098][26022] Updated weights on worker 0-0, policy_version 177784 (0.00097) [2022-07-09 08:57:07,007][25689] Fps is (10 sec: 5702.8, 60 sec: 5700.0, 300 sec: 5671.3). Total num frames: 182060032. Throughput: 0: 5032.8. Samples: 182054592. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 08:57:07,007][25689] Avg episode reward: [(0, '-50.996')] [2022-07-09 08:57:07,193][26022] Updated weights on worker 0-0, policy_version 177794 (0.00082) [2022-07-09 08:57:08,666][26022] Updated weights on worker 0-0, policy_version 177804 (0.00080) [2022-07-09 08:57:10,737][26022] Updated weights on worker 0-0, policy_version 177814 (0.00617) [2022-07-09 08:57:12,070][25689] Fps is (10 sec: 5566.9, 60 sec: 5704.5, 300 sec: 5674.2). Total num frames: 182089728. Throughput: 0: 5904.3. Samples: 182088922. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 08:57:12,071][25689] Avg episode reward: [(0, '-50.399')] [2022-07-09 08:57:12,275][26022] Updated weights on worker 0-0, policy_version 177824 (0.00078) [2022-07-09 08:57:14,190][26022] Updated weights on worker 0-0, policy_version 177834 (0.00085) [2022-07-09 08:57:15,763][26022] Updated weights on worker 0-0, policy_version 177844 (0.00088) [2022-07-09 08:57:17,073][25689] Fps is (10 sec: 5798.6, 60 sec: 5707.8, 300 sec: 5674.6). Total num frames: 182118400. Throughput: 0: 5914.6. Samples: 182123674. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 08:57:17,073][25689] Avg episode reward: [(0, '-50.709')] [2022-07-09 08:57:17,832][26022] Updated weights on worker 0-0, policy_version 177854 (0.00080) [2022-07-09 08:57:19,360][26022] Updated weights on worker 0-0, policy_version 177864 (0.00090) [2022-07-09 08:57:21,317][26022] Updated weights on worker 0-0, policy_version 177874 (0.00087) [2022-07-09 08:57:22,083][25689] Fps is (10 sec: 5829.2, 60 sec: 5741.4, 300 sec: 5682.3). Total num frames: 182148096. Throughput: 0: 5044.9. Samples: 182140904. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 08:57:22,083][25689] Avg episode reward: [(0, '-51.339')] [2022-07-09 08:57:23,005][26022] Updated weights on worker 0-0, policy_version 177884 (0.00080) [2022-07-09 08:57:24,851][26022] Updated weights on worker 0-0, policy_version 177894 (0.00079) [2022-07-09 08:57:26,517][26022] Updated weights on worker 0-0, policy_version 177904 (0.00084) [2022-07-09 08:57:27,107][25689] Fps is (10 sec: 5714.6, 60 sec: 5695.5, 300 sec: 5676.0). Total num frames: 182175744. Throughput: 0: 6021.9. Samples: 182175634. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 08:57:27,108][25689] Avg episode reward: [(0, '-51.563')] [2022-07-09 08:57:28,520][26022] Updated weights on worker 0-0, policy_version 177914 (0.00093) [2022-07-09 08:57:30,207][26022] Updated weights on worker 0-0, policy_version 177924 (0.00085) [2022-07-09 08:57:32,143][26022] Updated weights on worker 0-0, policy_version 177934 (0.00078) [2022-07-09 08:57:32,212][25689] Fps is (10 sec: 5560.4, 60 sec: 5708.5, 300 sec: 5675.0). Total num frames: 182204416. Throughput: 0: 6000.5. Samples: 182209782. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 08:57:32,212][25689] Avg episode reward: [(0, '-52.028')] [2022-07-09 08:57:33,616][26022] Updated weights on worker 0-0, policy_version 177944 (0.00086) [2022-07-09 08:57:35,799][26022] Updated weights on worker 0-0, policy_version 177954 (0.00084) [2022-07-09 08:57:37,251][25689] Fps is (10 sec: 5754.0, 60 sec: 5705.4, 300 sec: 5681.5). Total num frames: 182234112. Throughput: 0: 5127.1. Samples: 182227130. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 08:57:37,252][25689] Avg episode reward: [(0, '-51.112')] [2022-07-09 08:57:37,296][26022] Updated weights on worker 0-0, policy_version 177964 (0.00092) [2022-07-09 08:57:39,280][26022] Updated weights on worker 0-0, policy_version 177974 (0.00087) [2022-07-09 08:57:40,765][26022] Updated weights on worker 0-0, policy_version 177984 (0.00091) [2022-07-09 08:57:42,279][25689] Fps is (10 sec: 5695.9, 60 sec: 5686.6, 300 sec: 5677.8). Total num frames: 182261760. Throughput: 0: 5962.5. Samples: 182261324. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 08:57:42,280][25689] Avg episode reward: [(0, '-52.013')] [2022-07-09 08:57:42,819][26022] Updated weights on worker 0-0, policy_version 177994 (0.00096) [2022-07-09 08:57:44,308][26022] Updated weights on worker 0-0, policy_version 178004 (0.00087) [2022-07-09 08:57:46,339][26022] Updated weights on worker 0-0, policy_version 178014 (0.00093) [2022-07-09 08:57:47,343][25689] Fps is (10 sec: 5682.0, 60 sec: 5716.1, 300 sec: 5678.4). Total num frames: 182291456. Throughput: 0: 5933.3. Samples: 182295700. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 08:57:47,345][25689] Avg episode reward: [(0, '-52.104')] [2022-07-09 08:57:47,979][26022] Updated weights on worker 0-0, policy_version 178024 (0.00082) [2022-07-09 08:57:50,015][26022] Updated weights on worker 0-0, policy_version 178034 (0.00089) [2022-07-09 08:57:51,604][26022] Updated weights on worker 0-0, policy_version 178044 (0.00085) [2022-07-09 08:57:52,440][25689] Fps is (10 sec: 5845.5, 60 sec: 5702.1, 300 sec: 5674.0). Total num frames: 182321152. Throughput: 0: 5943.1. Samples: 182329998. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 08:57:52,440][25689] Avg episode reward: [(0, '-51.370')] [2022-07-09 08:57:53,616][26022] Updated weights on worker 0-0, policy_version 178054 (0.00085) [2022-07-09 08:57:55,220][26022] Updated weights on worker 0-0, policy_version 178064 (0.00092) [2022-07-09 08:57:57,261][26022] Updated weights on worker 0-0, policy_version 178074 (0.00091) [2022-07-09 08:57:57,457][25689] Fps is (10 sec: 5669.9, 60 sec: 5720.3, 300 sec: 5677.8). Total num frames: 182348800. Throughput: 0: 5935.5. Samples: 182347062. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 08:57:57,458][25689] Avg episode reward: [(0, '-51.533')] [2022-07-09 08:57:58,912][26022] Updated weights on worker 0-0, policy_version 178084 (0.00091) [2022-07-09 08:58:00,902][26022] Updated weights on worker 0-0, policy_version 178094 (0.00093) [2022-07-09 08:58:02,474][25689] Fps is (10 sec: 5408.4, 60 sec: 5651.2, 300 sec: 5679.2). Total num frames: 182375424. Throughput: 0: 5931.7. Samples: 182381116. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 08:58:02,475][25689] Avg episode reward: [(0, '-51.648')] [2022-07-09 08:58:02,873][26022] Updated weights on worker 0-0, policy_version 178104 (0.00084) [2022-07-09 08:58:04,725][26022] Updated weights on worker 0-0, policy_version 178114 (0.00084) [2022-07-09 08:58:06,424][26022] Updated weights on worker 0-0, policy_version 178124 (0.00076) [2022-07-09 08:58:07,503][25689] Fps is (10 sec: 5504.4, 60 sec: 5687.4, 300 sec: 5676.2). Total num frames: 182404096. Throughput: 0: 5849.7. Samples: 182413628. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 08:58:07,503][25689] Avg episode reward: [(0, '-51.722')] [2022-07-09 08:58:08,113][26022] Updated weights on worker 0-0, policy_version 178134 (0.00088) [2022-07-09 08:58:09,961][26022] Updated weights on worker 0-0, policy_version 178144 (0.00087) [2022-07-09 08:58:11,786][26022] Updated weights on worker 0-0, policy_version 178154 (0.00086) [2022-07-09 08:58:12,597][25689] Fps is (10 sec: 5665.1, 60 sec: 5667.6, 300 sec: 5678.0). Total num frames: 182432768. Throughput: 0: 4984.5. Samples: 182430472. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 08:58:12,597][25689] Avg episode reward: [(0, '-51.238')] [2022-07-09 08:58:13,606][26022] Updated weights on worker 0-0, policy_version 178164 (0.00088) [2022-07-09 08:58:15,357][26022] Updated weights on worker 0-0, policy_version 178174 (0.00090) [2022-07-09 08:58:17,084][26022] Updated weights on worker 0-0, policy_version 178184 (0.00086) [2022-07-09 08:58:17,632][25689] Fps is (10 sec: 5762.6, 60 sec: 5681.5, 300 sec: 5681.0). Total num frames: 182462464. Throughput: 0: 5849.1. Samples: 182465064. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 08:58:17,632][25689] Avg episode reward: [(0, '-51.706')] [2022-07-09 08:58:19,067][26022] Updated weights on worker 0-0, policy_version 178194 (0.00090) [2022-07-09 08:58:20,660][26022] Updated weights on worker 0-0, policy_version 178204 (0.00090) [2022-07-09 08:58:22,640][26022] Updated weights on worker 0-0, policy_version 178214 (0.00090) [2022-07-09 08:58:22,730][25689] Fps is (10 sec: 5759.9, 60 sec: 5656.3, 300 sec: 5679.4). Total num frames: 182491136. Throughput: 0: 5852.8. Samples: 182499670. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 08:58:22,731][25689] Avg episode reward: [(0, '-51.922')] [2022-07-09 08:58:24,248][26022] Updated weights on worker 0-0, policy_version 178224 (0.00084) [2022-07-09 08:58:26,372][26022] Updated weights on worker 0-0, policy_version 178234 (0.00056) [2022-07-09 08:58:27,732][25689] Fps is (10 sec: 5677.4, 60 sec: 5675.3, 300 sec: 5677.0). Total num frames: 182519808. Throughput: 0: 5102.3. Samples: 182516844. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 08:58:27,732][25689] Avg episode reward: [(0, '-50.833')] [2022-07-09 08:58:27,866][26022] Updated weights on worker 0-0, policy_version 178244 (0.00086) [2022-07-09 08:58:29,743][26022] Updated weights on worker 0-0, policy_version 178254 (0.00090) [2022-07-09 08:58:31,437][26022] Updated weights on worker 0-0, policy_version 178264 (0.00085) [2022-07-09 08:58:32,806][25689] Fps is (10 sec: 5691.6, 60 sec: 5678.2, 300 sec: 5676.1). Total num frames: 182548480. Throughput: 0: 5970.5. Samples: 182551128. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 08:58:32,806][25689] Avg episode reward: [(0, '-49.968')] [2022-07-09 08:58:33,391][26022] Updated weights on worker 0-0, policy_version 178274 (0.00086) [2022-07-09 08:58:35,183][26022] Updated weights on worker 0-0, policy_version 178284 (0.00079) [2022-07-09 08:58:37,011][26022] Updated weights on worker 0-0, policy_version 178294 (0.00083) [2022-07-09 08:58:37,813][25689] Fps is (10 sec: 5790.0, 60 sec: 5681.2, 300 sec: 5683.1). Total num frames: 182578176. Throughput: 0: 5963.2. Samples: 182585408. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 08:58:37,813][25689] Avg episode reward: [(0, '-50.344')] [2022-07-09 08:58:38,688][26022] Updated weights on worker 0-0, policy_version 178304 (0.00089) [2022-07-09 08:58:40,671][26022] Updated weights on worker 0-0, policy_version 178314 (0.00088) [2022-07-09 08:58:41,425][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 08:58:41,438][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000178319_182598656.pth [2022-07-09 08:58:41,439][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000176322_180553728.pth [2022-07-09 08:58:42,202][26022] Updated weights on worker 0-0, policy_version 178324 (0.00093) [2022-07-09 08:58:42,819][25689] Fps is (10 sec: 5829.1, 60 sec: 5700.2, 300 sec: 5680.1). Total num frames: 182606848. Throughput: 0: 5114.8. Samples: 182602418. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 08:58:42,819][25689] Avg episode reward: [(0, '-50.077')] [2022-07-09 08:58:44,173][26022] Updated weights on worker 0-0, policy_version 178334 (0.00085) [2022-07-09 08:58:45,914][26022] Updated weights on worker 0-0, policy_version 178344 (0.00056) [2022-07-09 08:58:47,672][26022] Updated weights on worker 0-0, policy_version 178354 (0.00077) [2022-07-09 08:58:47,844][25689] Fps is (10 sec: 5716.3, 60 sec: 5686.9, 300 sec: 5685.0). Total num frames: 182635520. Throughput: 0: 5958.8. Samples: 182636690. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 08:58:47,845][25689] Avg episode reward: [(0, '-49.211')] [2022-07-09 08:58:49,574][26022] Updated weights on worker 0-0, policy_version 178364 (0.00088) [2022-07-09 08:58:51,362][26022] Updated weights on worker 0-0, policy_version 178374 (0.00084) [2022-07-09 08:58:52,900][25689] Fps is (10 sec: 5688.1, 60 sec: 5673.8, 300 sec: 5687.6). Total num frames: 182664192. Throughput: 0: 5951.7. Samples: 182670724. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 08:58:52,901][25689] Avg episode reward: [(0, '-49.445')] [2022-07-09 08:58:53,070][26022] Updated weights on worker 0-0, policy_version 178384 (0.00092) [2022-07-09 08:58:55,070][26022] Updated weights on worker 0-0, policy_version 178394 (0.00620) [2022-07-09 08:58:56,642][26022] Updated weights on worker 0-0, policy_version 178404 (0.00085) [2022-07-09 08:58:57,904][25689] Fps is (10 sec: 5496.7, 60 sec: 5658.1, 300 sec: 5677.4). Total num frames: 182690816. Throughput: 0: 5100.7. Samples: 182687890. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 08:58:57,905][25689] Avg episode reward: [(0, '-50.013')] [2022-07-09 08:58:58,619][26022] Updated weights on worker 0-0, policy_version 178414 (0.00087) [2022-07-09 08:59:00,483][26022] Updated weights on worker 0-0, policy_version 178424 (0.00085) [2022-07-09 08:59:02,484][26022] Updated weights on worker 0-0, policy_version 178434 (0.00090) [2022-07-09 08:59:02,940][25689] Fps is (10 sec: 5405.8, 60 sec: 5673.3, 300 sec: 5680.8). Total num frames: 182718464. Throughput: 0: 5905.7. Samples: 182721246. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 08:59:02,940][25689] Avg episode reward: [(0, '-50.342')] [2022-07-09 08:59:04,307][26022] Updated weights on worker 0-0, policy_version 178444 (0.00097) [2022-07-09 08:59:06,004][26022] Updated weights on worker 0-0, policy_version 178454 (0.00089) [2022-07-09 08:59:07,803][26022] Updated weights on worker 0-0, policy_version 178464 (0.00099) [2022-07-09 08:59:07,951][25689] Fps is (10 sec: 5606.0, 60 sec: 5675.0, 300 sec: 5682.1). Total num frames: 182747136. Throughput: 0: 5875.5. Samples: 182754824. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 08:59:07,951][25689] Avg episode reward: [(0, '-50.059')] [2022-07-09 08:59:09,766][26022] Updated weights on worker 0-0, policy_version 178474 (0.00056) [2022-07-09 08:59:11,506][26022] Updated weights on worker 0-0, policy_version 178484 (0.00413) [2022-07-09 08:59:13,023][25689] Fps is (10 sec: 5687.0, 60 sec: 5677.0, 300 sec: 5681.5). Total num frames: 182775808. Throughput: 0: 5019.1. Samples: 182771724. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 08:59:13,024][25689] Avg episode reward: [(0, '-49.966')] [2022-07-09 08:59:13,358][26022] Updated weights on worker 0-0, policy_version 178494 (0.00087) [2022-07-09 08:59:15,015][26022] Updated weights on worker 0-0, policy_version 178504 (0.00099) [2022-07-09 08:59:16,735][26022] Updated weights on worker 0-0, policy_version 178514 (0.00085) [2022-07-09 08:59:18,042][25689] Fps is (10 sec: 5885.4, 60 sec: 5695.4, 300 sec: 5688.8). Total num frames: 182806528. Throughput: 0: 5885.2. Samples: 182806404. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 08:59:18,043][25689] Avg episode reward: [(0, '-50.181')] [2022-07-09 08:59:18,690][26022] Updated weights on worker 0-0, policy_version 178524 (0.00094) [2022-07-09 08:59:20,335][26022] Updated weights on worker 0-0, policy_version 178534 (0.00087) [2022-07-09 08:59:22,192][26022] Updated weights on worker 0-0, policy_version 178544 (0.00090) [2022-07-09 08:59:23,067][25689] Fps is (10 sec: 5811.5, 60 sec: 5685.5, 300 sec: 5681.7). Total num frames: 182834176. Throughput: 0: 5952.8. Samples: 182841058. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 08:59:23,067][25689] Avg episode reward: [(0, '-50.027')] [2022-07-09 08:59:23,871][26022] Updated weights on worker 0-0, policy_version 178554 (0.00108) [2022-07-09 08:59:25,708][26022] Updated weights on worker 0-0, policy_version 178564 (0.00090) [2022-07-09 08:59:27,539][26022] Updated weights on worker 0-0, policy_version 178574 (0.00093) [2022-07-09 08:59:28,087][25689] Fps is (10 sec: 5607.1, 60 sec: 5683.7, 300 sec: 5685.5). Total num frames: 182862848. Throughput: 0: 5131.9. Samples: 182858160. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 08:59:28,087][25689] Avg episode reward: [(0, '-50.180')] [2022-07-09 08:59:29,438][26022] Updated weights on worker 0-0, policy_version 178584 (0.00091) [2022-07-09 08:59:31,051][26022] Updated weights on worker 0-0, policy_version 178594 (0.00082) [2022-07-09 08:59:32,881][26022] Updated weights on worker 0-0, policy_version 178604 (0.00603) [2022-07-09 08:59:33,198][25689] Fps is (10 sec: 5659.8, 60 sec: 5680.1, 300 sec: 5680.1). Total num frames: 182891520. Throughput: 0: 5976.0. Samples: 182892292. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 08:59:33,199][25689] Avg episode reward: [(0, '-50.763')] [2022-07-09 08:59:34,799][26022] Updated weights on worker 0-0, policy_version 178614 (0.00088) [2022-07-09 08:59:36,567][26022] Updated weights on worker 0-0, policy_version 178624 (0.00085) [2022-07-09 08:59:38,289][25689] Fps is (10 sec: 5620.8, 60 sec: 5655.4, 300 sec: 5679.1). Total num frames: 182920192. Throughput: 0: 5934.9. Samples: 182926566. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 08:59:38,289][25689] Avg episode reward: [(0, '-50.437')] [2022-07-09 08:59:38,307][26022] Updated weights on worker 0-0, policy_version 178634 (0.00081) [2022-07-09 08:59:40,129][26022] Updated weights on worker 0-0, policy_version 178644 (0.00085) [2022-07-09 08:59:42,059][26022] Updated weights on worker 0-0, policy_version 178654 (0.00079) [2022-07-09 08:59:43,388][25689] Fps is (10 sec: 5627.9, 60 sec: 5646.7, 300 sec: 5680.9). Total num frames: 182948864. Throughput: 0: 5042.8. Samples: 182943532. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 08:59:43,388][25689] Avg episode reward: [(0, '-50.946')] [2022-07-09 08:59:43,696][26022] Updated weights on worker 0-0, policy_version 178664 (0.00086) [2022-07-09 08:59:45,534][26022] Updated weights on worker 0-0, policy_version 178674 (0.00053) [2022-07-09 08:59:47,068][26022] Updated weights on worker 0-0, policy_version 178684 (0.00087) [2022-07-09 08:59:48,390][25689] Fps is (10 sec: 5677.1, 60 sec: 5648.9, 300 sec: 5678.4). Total num frames: 182977536. Throughput: 0: 5923.8. Samples: 182978430. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 08:59:48,390][25689] Avg episode reward: [(0, '-51.559')] [2022-07-09 08:59:49,168][26022] Updated weights on worker 0-0, policy_version 178694 (0.00095) [2022-07-09 08:59:50,627][26022] Updated weights on worker 0-0, policy_version 178704 (0.00082) [2022-07-09 08:59:52,567][26022] Updated weights on worker 0-0, policy_version 178714 (0.00087) [2022-07-09 08:59:53,424][25689] Fps is (10 sec: 6019.9, 60 sec: 5701.7, 300 sec: 5688.5). Total num frames: 183009280. Throughput: 0: 5963.3. Samples: 183012902. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 08:59:53,424][25689] Avg episode reward: [(0, '-51.598')] [2022-07-09 08:59:54,158][26022] Updated weights on worker 0-0, policy_version 178724 (0.00099) [2022-07-09 08:59:56,182][26022] Updated weights on worker 0-0, policy_version 178734 (0.00086) [2022-07-09 08:59:57,815][26022] Updated weights on worker 0-0, policy_version 178744 (0.00082) [2022-07-09 08:59:58,501][25689] Fps is (10 sec: 5873.8, 60 sec: 5711.7, 300 sec: 5683.6). Total num frames: 183036928. Throughput: 0: 5134.2. Samples: 183030340. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 08:59:58,502][25689] Avg episode reward: [(0, '-51.766')] [2022-07-09 08:59:59,761][26022] Updated weights on worker 0-0, policy_version 178754 (0.00091) [2022-07-09 09:00:01,399][26022] Updated weights on worker 0-0, policy_version 178764 (0.00087) [2022-07-09 09:00:03,575][25689] Fps is (10 sec: 5346.5, 60 sec: 5691.2, 300 sec: 5679.6). Total num frames: 183063552. Throughput: 0: 6014.0. Samples: 183064936. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 09:00:03,575][25689] Avg episode reward: [(0, '-50.978')] [2022-07-09 09:00:03,677][26022] Updated weights on worker 0-0, policy_version 178774 (0.00087) [2022-07-09 09:00:05,143][26022] Updated weights on worker 0-0, policy_version 178784 (0.00085) [2022-07-09 09:00:07,195][26022] Updated weights on worker 0-0, policy_version 178794 (0.00106) [2022-07-09 09:00:08,623][25689] Fps is (10 sec: 5665.4, 60 sec: 5721.5, 300 sec: 5682.9). Total num frames: 183094272. Throughput: 0: 5890.8. Samples: 183097620. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 09:00:08,624][25689] Avg episode reward: [(0, '-51.021')] [2022-07-09 09:00:08,727][26022] Updated weights on worker 0-0, policy_version 178804 (0.00088) [2022-07-09 09:00:10,807][26022] Updated weights on worker 0-0, policy_version 178814 (0.00096) [2022-07-09 09:00:12,458][26022] Updated weights on worker 0-0, policy_version 178824 (0.00088) [2022-07-09 09:00:13,666][25689] Fps is (10 sec: 5784.0, 60 sec: 5707.4, 300 sec: 5686.0). Total num frames: 183121920. Throughput: 0: 5879.3. Samples: 183131912. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 09:00:13,666][25689] Avg episode reward: [(0, '-50.923')] [2022-07-09 09:00:14,335][26022] Updated weights on worker 0-0, policy_version 178834 (0.00072) [2022-07-09 09:00:16,027][26022] Updated weights on worker 0-0, policy_version 178844 (0.00085) [2022-07-09 09:00:17,938][26022] Updated weights on worker 0-0, policy_version 178854 (0.00088) [2022-07-09 09:00:18,686][25689] Fps is (10 sec: 5596.9, 60 sec: 5673.6, 300 sec: 5682.2). Total num frames: 183150592. Throughput: 0: 5880.7. Samples: 183149038. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 09:00:18,687][25689] Avg episode reward: [(0, '-51.243')] [2022-07-09 09:00:19,561][26022] Updated weights on worker 0-0, policy_version 178864 (0.00096) [2022-07-09 09:00:21,509][26022] Updated weights on worker 0-0, policy_version 178874 (0.01444) [2022-07-09 09:00:23,189][26022] Updated weights on worker 0-0, policy_version 178884 (0.00087) [2022-07-09 09:00:23,741][25689] Fps is (10 sec: 5793.5, 60 sec: 5704.4, 300 sec: 5688.3). Total num frames: 183180288. Throughput: 0: 5896.1. Samples: 183183838. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 09:00:23,741][25689] Avg episode reward: [(0, '-50.938')] [2022-07-09 09:00:24,922][26022] Updated weights on worker 0-0, policy_version 178894 (0.00081) [2022-07-09 09:00:26,679][26022] Updated weights on worker 0-0, policy_version 178904 (0.00092) [2022-07-09 09:00:28,481][26022] Updated weights on worker 0-0, policy_version 178914 (0.00080) [2022-07-09 09:00:28,779][25689] Fps is (10 sec: 5782.9, 60 sec: 5702.7, 300 sec: 5685.6). Total num frames: 183208960. Throughput: 0: 6020.2. Samples: 183218964. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 09:00:28,781][25689] Avg episode reward: [(0, '-51.821')] [2022-07-09 09:00:30,219][26022] Updated weights on worker 0-0, policy_version 178924 (0.00089) [2022-07-09 09:00:32,115][26022] Updated weights on worker 0-0, policy_version 178934 (0.00086) [2022-07-09 09:00:33,611][26022] Updated weights on worker 0-0, policy_version 178944 (0.00093) [2022-07-09 09:00:33,842][25689] Fps is (10 sec: 5879.7, 60 sec: 5741.1, 300 sec: 5691.4). Total num frames: 183239680. Throughput: 0: 5181.9. Samples: 183236462. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 09:00:33,843][25689] Avg episode reward: [(0, '-52.489')] [2022-07-09 09:00:35,553][26022] Updated weights on worker 0-0, policy_version 178954 (0.00092) [2022-07-09 09:00:37,127][26022] Updated weights on worker 0-0, policy_version 178964 (0.00085) [2022-07-09 09:00:38,867][25689] Fps is (10 sec: 5887.0, 60 sec: 5747.2, 300 sec: 5694.5). Total num frames: 183268352. Throughput: 0: 6048.8. Samples: 183271114. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 09:00:38,868][25689] Avg episode reward: [(0, '-51.718')] [2022-07-09 09:00:38,975][26022] Updated weights on worker 0-0, policy_version 178974 (0.00090) [2022-07-09 09:00:40,888][26022] Updated weights on worker 0-0, policy_version 178984 (0.00084) [2022-07-09 09:00:41,457][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:00:41,466][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000178988_183283712.pth [2022-07-09 09:00:41,466][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000176985_181232640.pth [2022-07-09 09:00:42,432][26022] Updated weights on worker 0-0, policy_version 178994 (0.00087) [2022-07-09 09:00:43,911][25689] Fps is (10 sec: 5694.9, 60 sec: 5752.5, 300 sec: 5693.9). Total num frames: 183297024. Throughput: 0: 6037.9. Samples: 183305626. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 09:00:43,912][25689] Avg episode reward: [(0, '-51.230')] [2022-07-09 09:00:44,331][26022] Updated weights on worker 0-0, policy_version 179004 (0.00096) [2022-07-09 09:00:46,099][26022] Updated weights on worker 0-0, policy_version 179014 (0.00078) [2022-07-09 09:00:47,873][26022] Updated weights on worker 0-0, policy_version 179024 (0.00082) [2022-07-09 09:00:48,989][25689] Fps is (10 sec: 5665.7, 60 sec: 5745.3, 300 sec: 5691.3). Total num frames: 183325696. Throughput: 0: 5159.0. Samples: 183323226. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 09:00:48,989][25689] Avg episode reward: [(0, '-51.271')] [2022-07-09 09:00:49,642][26022] Updated weights on worker 0-0, policy_version 179034 (0.00090) [2022-07-09 09:00:51,411][26022] Updated weights on worker 0-0, policy_version 179044 (0.00087) [2022-07-09 09:00:53,221][26022] Updated weights on worker 0-0, policy_version 179054 (0.00085) [2022-07-09 09:00:54,096][25689] Fps is (10 sec: 5932.3, 60 sec: 5738.4, 300 sec: 5704.4). Total num frames: 183357440. Throughput: 0: 6004.8. Samples: 183358082. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 09:00:54,096][25689] Avg episode reward: [(0, '-52.278')] [2022-07-09 09:00:54,856][26022] Updated weights on worker 0-0, policy_version 179064 (0.00092) [2022-07-09 09:00:56,804][26022] Updated weights on worker 0-0, policy_version 179074 (0.00084) [2022-07-09 09:00:58,399][26022] Updated weights on worker 0-0, policy_version 179084 (0.00088) [2022-07-09 09:00:59,102][25689] Fps is (10 sec: 5872.9, 60 sec: 5745.2, 300 sec: 5702.6). Total num frames: 183385088. Throughput: 0: 6018.2. Samples: 183392886. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 09:00:59,102][25689] Avg episode reward: [(0, '-51.011')] [2022-07-09 09:01:00,282][26022] Updated weights on worker 0-0, policy_version 179094 (0.00088) [2022-07-09 09:01:02,332][26022] Updated weights on worker 0-0, policy_version 179104 (0.00099) [2022-07-09 09:01:04,125][25689] Fps is (10 sec: 5411.3, 60 sec: 5749.9, 300 sec: 5700.4). Total num frames: 183411712. Throughput: 0: 5168.8. Samples: 183410106. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 09:01:04,126][25689] Avg episode reward: [(0, '-50.928')] [2022-07-09 09:01:04,227][26022] Updated weights on worker 0-0, policy_version 179114 (0.00086) [2022-07-09 09:01:05,991][26022] Updated weights on worker 0-0, policy_version 179124 (0.00088) [2022-07-09 09:01:07,808][26022] Updated weights on worker 0-0, policy_version 179134 (0.00088) [2022-07-09 09:01:09,141][25689] Fps is (10 sec: 5609.9, 60 sec: 5736.1, 300 sec: 5702.2). Total num frames: 183441408. Throughput: 0: 5900.0. Samples: 183442124. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 09:01:09,142][25689] Avg episode reward: [(0, '-51.130')] [2022-07-09 09:01:09,544][26022] Updated weights on worker 0-0, policy_version 179144 (0.00093) [2022-07-09 09:01:11,358][26022] Updated weights on worker 0-0, policy_version 179154 (0.00093) [2022-07-09 09:01:13,113][26022] Updated weights on worker 0-0, policy_version 179164 (0.00084) [2022-07-09 09:01:14,213][25689] Fps is (10 sec: 5684.5, 60 sec: 5733.3, 300 sec: 5698.1). Total num frames: 183469056. Throughput: 0: 5904.1. Samples: 183476856. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 09:01:14,213][25689] Avg episode reward: [(0, '-50.865')] [2022-07-09 09:01:14,962][26022] Updated weights on worker 0-0, policy_version 179174 (0.00087) [2022-07-09 09:01:16,767][26022] Updated weights on worker 0-0, policy_version 179184 (0.00086) [2022-07-09 09:01:18,583][26022] Updated weights on worker 0-0, policy_version 179194 (0.00941) [2022-07-09 09:01:19,255][25689] Fps is (10 sec: 5771.0, 60 sec: 5765.0, 300 sec: 5707.7). Total num frames: 183499776. Throughput: 0: 5023.5. Samples: 183494128. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 09:01:19,256][25689] Avg episode reward: [(0, '-50.231')] [2022-07-09 09:01:20,318][26022] Updated weights on worker 0-0, policy_version 179204 (0.00081) [2022-07-09 09:01:22,002][26022] Updated weights on worker 0-0, policy_version 179214 (0.00091) [2022-07-09 09:01:23,863][26022] Updated weights on worker 0-0, policy_version 179224 (0.00091) [2022-07-09 09:01:24,288][25689] Fps is (10 sec: 5793.3, 60 sec: 5733.3, 300 sec: 5698.2). Total num frames: 183527424. Throughput: 0: 5878.4. Samples: 183528630. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 09:01:24,288][25689] Avg episode reward: [(0, '-49.348')] [2022-07-09 09:01:25,557][26022] Updated weights on worker 0-0, policy_version 179234 (0.00087) [2022-07-09 09:01:27,352][26022] Updated weights on worker 0-0, policy_version 179244 (0.00080) [2022-07-09 09:01:29,087][26022] Updated weights on worker 0-0, policy_version 179254 (0.00094) [2022-07-09 09:01:29,332][25689] Fps is (10 sec: 5690.4, 60 sec: 5749.6, 300 sec: 5705.5). Total num frames: 183557120. Throughput: 0: 5997.1. Samples: 183563214. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 09:01:29,333][25689] Avg episode reward: [(0, '-50.039')] [2022-07-09 09:01:31,090][26022] Updated weights on worker 0-0, policy_version 179264 (0.00087) [2022-07-09 09:01:32,870][26022] Updated weights on worker 0-0, policy_version 179274 (0.00087) [2022-07-09 09:01:34,463][25689] Fps is (10 sec: 5736.2, 60 sec: 5709.4, 300 sec: 5699.7). Total num frames: 183585792. Throughput: 0: 5106.8. Samples: 183580270. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 09:01:34,464][25689] Avg episode reward: [(0, '-50.278')] [2022-07-09 09:01:34,568][26022] Updated weights on worker 0-0, policy_version 179284 (0.00080) [2022-07-09 09:01:36,382][26022] Updated weights on worker 0-0, policy_version 179294 (0.00084) [2022-07-09 09:01:38,264][26022] Updated weights on worker 0-0, policy_version 179304 (0.00089) [2022-07-09 09:01:39,504][25689] Fps is (10 sec: 5637.6, 60 sec: 5708.0, 300 sec: 5699.0). Total num frames: 183614464. Throughput: 0: 5940.1. Samples: 183614408. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 09:01:39,505][25689] Avg episode reward: [(0, '-50.845')] [2022-07-09 09:01:39,952][26022] Updated weights on worker 0-0, policy_version 179314 (0.00085) [2022-07-09 09:01:41,830][26022] Updated weights on worker 0-0, policy_version 179324 (0.00093) [2022-07-09 09:01:43,660][26022] Updated weights on worker 0-0, policy_version 179334 (0.00090) [2022-07-09 09:01:44,511][25689] Fps is (10 sec: 5605.3, 60 sec: 5694.6, 300 sec: 5699.3). Total num frames: 183642112. Throughput: 0: 5940.0. Samples: 183648754. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 09:01:44,513][25689] Avg episode reward: [(0, '-50.619')] [2022-07-09 09:01:45,445][26022] Updated weights on worker 0-0, policy_version 179344 (0.00081) [2022-07-09 09:01:47,132][26022] Updated weights on worker 0-0, policy_version 179354 (0.00089) [2022-07-09 09:01:48,941][26022] Updated weights on worker 0-0, policy_version 179364 (0.00092) [2022-07-09 09:01:49,569][25689] Fps is (10 sec: 5697.6, 60 sec: 5713.3, 300 sec: 5697.1). Total num frames: 183671808. Throughput: 0: 5060.9. Samples: 183665628. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 09:01:49,571][25689] Avg episode reward: [(0, '-50.600')] [2022-07-09 09:01:50,804][26022] Updated weights on worker 0-0, policy_version 179374 (0.00094) [2022-07-09 09:01:52,651][26022] Updated weights on worker 0-0, policy_version 179384 (0.00083) [2022-07-09 09:01:54,430][26022] Updated weights on worker 0-0, policy_version 179394 (0.00085) [2022-07-09 09:01:54,719][25689] Fps is (10 sec: 5617.6, 60 sec: 5641.7, 300 sec: 5698.3). Total num frames: 183699456. Throughput: 0: 5917.7. Samples: 183700136. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 09:01:54,719][25689] Avg episode reward: [(0, '-49.307')] [2022-07-09 09:01:56,135][26022] Updated weights on worker 0-0, policy_version 179404 (0.00086) [2022-07-09 09:01:58,039][26022] Updated weights on worker 0-0, policy_version 179414 (0.00097) [2022-07-09 09:01:59,523][26022] Updated weights on worker 0-0, policy_version 179424 (0.00085) [2022-07-09 09:01:59,724][25689] Fps is (10 sec: 5747.7, 60 sec: 5692.4, 300 sec: 5698.2). Total num frames: 183730176. Throughput: 0: 5951.6. Samples: 183734748. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 09:01:59,724][25689] Avg episode reward: [(0, '-48.944')] [2022-07-09 09:02:01,596][26022] Updated weights on worker 0-0, policy_version 179434 (0.00087) [2022-07-09 09:02:03,515][26022] Updated weights on worker 0-0, policy_version 179444 (0.00085) [2022-07-09 09:02:04,745][25689] Fps is (10 sec: 5617.0, 60 sec: 5675.7, 300 sec: 5695.4). Total num frames: 183755776. Throughput: 0: 5056.4. Samples: 183751068. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 09:02:04,746][25689] Avg episode reward: [(0, '-49.208')] [2022-07-09 09:02:05,420][26022] Updated weights on worker 0-0, policy_version 179454 (0.00089) [2022-07-09 09:02:07,279][26022] Updated weights on worker 0-0, policy_version 179464 (0.00087) [2022-07-09 09:02:08,893][26022] Updated weights on worker 0-0, policy_version 179474 (0.00093) [2022-07-09 09:02:09,753][25689] Fps is (10 sec: 5513.5, 60 sec: 5676.5, 300 sec: 5696.5). Total num frames: 183785472. Throughput: 0: 5877.9. Samples: 183784270. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 09:02:09,753][25689] Avg episode reward: [(0, '-49.153')] [2022-07-09 09:02:10,795][26022] Updated weights on worker 0-0, policy_version 179484 (0.00085) [2022-07-09 09:02:12,660][26022] Updated weights on worker 0-0, policy_version 179494 (0.00082) [2022-07-09 09:02:14,407][26022] Updated weights on worker 0-0, policy_version 179504 (0.00086) [2022-07-09 09:02:14,819][25689] Fps is (10 sec: 5793.9, 60 sec: 5693.9, 300 sec: 5695.3). Total num frames: 183814144. Throughput: 0: 5887.5. Samples: 183818480. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 09:02:14,820][25689] Avg episode reward: [(0, '-50.344')] [2022-07-09 09:02:16,344][26022] Updated weights on worker 0-0, policy_version 179514 (0.00094) [2022-07-09 09:02:17,950][26022] Updated weights on worker 0-0, policy_version 179524 (0.00100) [2022-07-09 09:02:19,709][26022] Updated weights on worker 0-0, policy_version 179534 (0.00094) [2022-07-09 09:02:19,859][25689] Fps is (10 sec: 5674.2, 60 sec: 5660.4, 300 sec: 5691.3). Total num frames: 183842816. Throughput: 0: 5000.0. Samples: 183835424. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 09:02:19,860][25689] Avg episode reward: [(0, '-51.931')] [2022-07-09 09:02:21,543][26022] Updated weights on worker 0-0, policy_version 179544 (0.00086) [2022-07-09 09:02:23,198][26022] Updated weights on worker 0-0, policy_version 179554 (0.00056) [2022-07-09 09:02:24,880][25689] Fps is (10 sec: 5598.0, 60 sec: 5661.4, 300 sec: 5691.3). Total num frames: 183870464. Throughput: 0: 5914.1. Samples: 183870146. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 09:02:24,881][25689] Avg episode reward: [(0, '-52.191')] [2022-07-09 09:02:25,212][26022] Updated weights on worker 0-0, policy_version 179564 (0.00093) [2022-07-09 09:02:26,795][26022] Updated weights on worker 0-0, policy_version 179574 (0.00086) [2022-07-09 09:02:28,719][26022] Updated weights on worker 0-0, policy_version 179584 (0.00087) [2022-07-09 09:02:29,898][25689] Fps is (10 sec: 5814.3, 60 sec: 5680.9, 300 sec: 5699.9). Total num frames: 183901184. Throughput: 0: 5972.7. Samples: 183904588. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 09:02:29,898][25689] Avg episode reward: [(0, '-52.134')] [2022-07-09 09:02:30,620][26022] Updated weights on worker 0-0, policy_version 179594 (0.00090) [2022-07-09 09:02:32,335][26022] Updated weights on worker 0-0, policy_version 179604 (0.00100) [2022-07-09 09:02:34,203][26022] Updated weights on worker 0-0, policy_version 179614 (0.00087) [2022-07-09 09:02:34,962][25689] Fps is (10 sec: 5789.6, 60 sec: 5670.2, 300 sec: 5692.5). Total num frames: 183928832. Throughput: 0: 5984.9. Samples: 183939030. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 09:02:34,962][25689] Avg episode reward: [(0, '-51.710')] [2022-07-09 09:02:35,878][26022] Updated weights on worker 0-0, policy_version 179624 (0.00082) [2022-07-09 09:02:37,556][26022] Updated weights on worker 0-0, policy_version 179634 (0.00091) [2022-07-09 09:02:39,474][26022] Updated weights on worker 0-0, policy_version 179644 (0.00081) [2022-07-09 09:02:40,048][25689] Fps is (10 sec: 5649.2, 60 sec: 5682.8, 300 sec: 5698.3). Total num frames: 183958528. Throughput: 0: 5992.3. Samples: 183956406. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 09:02:40,049][25689] Avg episode reward: [(0, '-49.941')] [2022-07-09 09:02:41,073][26022] Updated weights on worker 0-0, policy_version 179654 (0.00088) [2022-07-09 09:02:41,766][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:02:41,779][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000179657_183968768.pth [2022-07-09 09:02:41,780][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000177651_181914624.pth [2022-07-09 09:02:43,014][26022] Updated weights on worker 0-0, policy_version 179664 (0.00083) [2022-07-09 09:02:44,664][26022] Updated weights on worker 0-0, policy_version 179674 (0.00079) [2022-07-09 09:02:45,079][25689] Fps is (10 sec: 5769.0, 60 sec: 5697.5, 300 sec: 5695.5). Total num frames: 183987200. Throughput: 0: 5991.4. Samples: 183991166. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 09:02:45,080][25689] Avg episode reward: [(0, '-50.240')] [2022-07-09 09:02:46,475][26022] Updated weights on worker 0-0, policy_version 179684 (0.00089) [2022-07-09 09:02:48,442][26022] Updated weights on worker 0-0, policy_version 179694 (0.00080) [2022-07-09 09:02:49,970][26022] Updated weights on worker 0-0, policy_version 179704 (0.00099) [2022-07-09 09:02:50,122][25689] Fps is (10 sec: 5794.4, 60 sec: 5698.9, 300 sec: 5696.5). Total num frames: 184016896. Throughput: 0: 5957.5. Samples: 184025072. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 09:02:50,122][25689] Avg episode reward: [(0, '-50.421')] [2022-07-09 09:02:52,148][26022] Updated weights on worker 0-0, policy_version 179714 (0.00094) [2022-07-09 09:02:53,783][26022] Updated weights on worker 0-0, policy_version 179724 (0.00087) [2022-07-09 09:02:55,231][25689] Fps is (10 sec: 5648.8, 60 sec: 5702.8, 300 sec: 5694.7). Total num frames: 184044544. Throughput: 0: 5079.2. Samples: 184041980. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 09:02:55,231][25689] Avg episode reward: [(0, '-50.288')] [2022-07-09 09:02:55,663][26022] Updated weights on worker 0-0, policy_version 179734 (0.00060) [2022-07-09 09:02:57,283][26022] Updated weights on worker 0-0, policy_version 179744 (0.00083) [2022-07-09 09:02:59,202][26022] Updated weights on worker 0-0, policy_version 179754 (0.00088) [2022-07-09 09:03:00,255][25689] Fps is (10 sec: 5659.1, 60 sec: 5684.1, 300 sec: 5704.9). Total num frames: 184074240. Throughput: 0: 5949.7. Samples: 184076628. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 09:03:00,256][25689] Avg episode reward: [(0, '-51.113')] [2022-07-09 09:03:00,866][26022] Updated weights on worker 0-0, policy_version 179764 (0.00087) [2022-07-09 09:03:03,241][26022] Updated weights on worker 0-0, policy_version 179774 (0.00083) [2022-07-09 09:03:04,680][26022] Updated weights on worker 0-0, policy_version 179784 (0.00096) [2022-07-09 09:03:05,287][25689] Fps is (10 sec: 5600.8, 60 sec: 5700.1, 300 sec: 5698.0). Total num frames: 184100864. Throughput: 0: 5831.3. Samples: 184109002. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 09:03:05,287][25689] Avg episode reward: [(0, '-50.978')] [2022-07-09 09:03:06,685][26022] Updated weights on worker 0-0, policy_version 179794 (0.00081) [2022-07-09 09:03:08,357][26022] Updated weights on worker 0-0, policy_version 179804 (0.00087) [2022-07-09 09:03:10,176][26022] Updated weights on worker 0-0, policy_version 179814 (0.00085) [2022-07-09 09:03:10,335][25689] Fps is (10 sec: 5688.9, 60 sec: 5713.1, 300 sec: 5705.7). Total num frames: 184131584. Throughput: 0: 5009.7. Samples: 184126334. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 09:03:10,335][25689] Avg episode reward: [(0, '-50.570')] [2022-07-09 09:03:12,278][26022] Updated weights on worker 0-0, policy_version 179824 (0.00980) [2022-07-09 09:03:13,645][26022] Updated weights on worker 0-0, policy_version 179834 (0.00088) [2022-07-09 09:03:15,399][25689] Fps is (10 sec: 5569.4, 60 sec: 5662.6, 300 sec: 5691.4). Total num frames: 184157184. Throughput: 0: 5883.8. Samples: 184160646. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 09:03:15,400][25689] Avg episode reward: [(0, '-50.491')] [2022-07-09 09:03:15,743][26022] Updated weights on worker 0-0, policy_version 179844 (0.00089) [2022-07-09 09:03:17,565][26022] Updated weights on worker 0-0, policy_version 179854 (0.00087) [2022-07-09 09:03:19,127][26022] Updated weights on worker 0-0, policy_version 179864 (0.00079) [2022-07-09 09:03:20,494][25689] Fps is (10 sec: 5443.0, 60 sec: 5674.4, 300 sec: 5694.9). Total num frames: 184186880. Throughput: 0: 5827.5. Samples: 184194572. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 09:03:20,499][25689] Avg episode reward: [(0, '-49.676')] [2022-07-09 09:03:21,167][26022] Updated weights on worker 0-0, policy_version 179874 (0.00087) [2022-07-09 09:03:22,868][26022] Updated weights on worker 0-0, policy_version 179884 (0.00088) [2022-07-09 09:03:24,723][26022] Updated weights on worker 0-0, policy_version 179894 (0.00084) [2022-07-09 09:03:25,540][25689] Fps is (10 sec: 5856.5, 60 sec: 5705.8, 300 sec: 5697.5). Total num frames: 184216576. Throughput: 0: 5074.6. Samples: 184211780. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 09:03:25,541][25689] Avg episode reward: [(0, '-49.992')] [2022-07-09 09:03:26,382][26022] Updated weights on worker 0-0, policy_version 179904 (0.00084) [2022-07-09 09:03:28,148][26022] Updated weights on worker 0-0, policy_version 179914 (0.00089) [2022-07-09 09:03:29,944][26022] Updated weights on worker 0-0, policy_version 179924 (0.00087) [2022-07-09 09:03:30,557][25689] Fps is (10 sec: 5800.2, 60 sec: 5672.1, 300 sec: 5698.6). Total num frames: 184245248. Throughput: 0: 5919.3. Samples: 184246036. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 09:03:30,557][25689] Avg episode reward: [(0, '-49.410')] [2022-07-09 09:03:31,831][26022] Updated weights on worker 0-0, policy_version 179934 (0.00091) [2022-07-09 09:03:33,537][26022] Updated weights on worker 0-0, policy_version 179944 (0.00086) [2022-07-09 09:03:35,480][26022] Updated weights on worker 0-0, policy_version 179954 (0.00091) [2022-07-09 09:03:35,648][25689] Fps is (10 sec: 5673.1, 60 sec: 5686.4, 300 sec: 5693.6). Total num frames: 184273920. Throughput: 0: 5919.7. Samples: 184280516. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 09:03:35,649][25689] Avg episode reward: [(0, '-49.595')] [2022-07-09 09:03:37,062][26022] Updated weights on worker 0-0, policy_version 179964 (0.00085) [2022-07-09 09:03:39,025][26022] Updated weights on worker 0-0, policy_version 179974 (0.00096) [2022-07-09 09:03:40,677][25689] Fps is (10 sec: 5666.2, 60 sec: 5674.9, 300 sec: 5693.1). Total num frames: 184302592. Throughput: 0: 5110.9. Samples: 184297728. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 09:03:40,678][25689] Avg episode reward: [(0, '-50.722')] [2022-07-09 09:03:40,736][26022] Updated weights on worker 0-0, policy_version 179984 (0.00087) [2022-07-09 09:03:42,494][26022] Updated weights on worker 0-0, policy_version 179994 (0.00092) [2022-07-09 09:03:44,312][26022] Updated weights on worker 0-0, policy_version 180004 (0.00089) [2022-07-09 09:03:45,695][25689] Fps is (10 sec: 5809.4, 60 sec: 5693.0, 300 sec: 5696.7). Total num frames: 184332288. Throughput: 0: 5979.4. Samples: 184332296. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 09:03:45,697][25689] Avg episode reward: [(0, '-50.198')] [2022-07-09 09:03:46,115][26022] Updated weights on worker 0-0, policy_version 180014 (0.00096) [2022-07-09 09:03:48,059][26022] Updated weights on worker 0-0, policy_version 180024 (0.00082) [2022-07-09 09:03:49,598][26022] Updated weights on worker 0-0, policy_version 180034 (0.00080) [2022-07-09 09:03:50,713][25689] Fps is (10 sec: 5713.9, 60 sec: 5661.5, 300 sec: 5694.0). Total num frames: 184359936. Throughput: 0: 5970.9. Samples: 184366386. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 09:03:50,715][25689] Avg episode reward: [(0, '-51.339')] [2022-07-09 09:03:51,690][26022] Updated weights on worker 0-0, policy_version 180044 (0.00088) [2022-07-09 09:03:53,139][26022] Updated weights on worker 0-0, policy_version 180054 (0.00085) [2022-07-09 09:03:55,231][26022] Updated weights on worker 0-0, policy_version 180064 (0.00073) [2022-07-09 09:03:55,799][25689] Fps is (10 sec: 5472.9, 60 sec: 5663.7, 300 sec: 5695.9). Total num frames: 184387584. Throughput: 0: 5089.3. Samples: 184383070. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 09:03:55,799][25689] Avg episode reward: [(0, '-52.031')] [2022-07-09 09:03:56,973][26022] Updated weights on worker 0-0, policy_version 180074 (0.00091) [2022-07-09 09:03:58,772][26022] Updated weights on worker 0-0, policy_version 180084 (0.00096) [2022-07-09 09:04:00,537][26022] Updated weights on worker 0-0, policy_version 180094 (0.00086) [2022-07-09 09:04:00,814][25689] Fps is (10 sec: 5575.6, 60 sec: 5647.6, 300 sec: 5699.7). Total num frames: 184416256. Throughput: 0: 5940.0. Samples: 184417342. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 09:04:00,815][25689] Avg episode reward: [(0, '-53.120')] [2022-07-09 09:04:02,485][26022] Updated weights on worker 0-0, policy_version 180104 (0.00089) [2022-07-09 09:04:04,505][26022] Updated weights on worker 0-0, policy_version 180114 (0.00095) [2022-07-09 09:04:05,829][25689] Fps is (10 sec: 5614.9, 60 sec: 5666.1, 300 sec: 5696.2). Total num frames: 184443904. Throughput: 0: 5814.6. Samples: 184449368. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 09:04:05,830][25689] Avg episode reward: [(0, '-52.428')] [2022-07-09 09:04:06,404][26022] Updated weights on worker 0-0, policy_version 180124 (0.00085) [2022-07-09 09:04:08,024][26022] Updated weights on worker 0-0, policy_version 180134 (0.00087) [2022-07-09 09:04:10,103][26022] Updated weights on worker 0-0, policy_version 180144 (0.00093) [2022-07-09 09:04:10,835][25689] Fps is (10 sec: 5518.1, 60 sec: 5619.3, 300 sec: 5694.0). Total num frames: 184471552. Throughput: 0: 4960.1. Samples: 184466194. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 09:04:10,837][25689] Avg episode reward: [(0, '-51.958')] [2022-07-09 09:04:11,652][26022] Updated weights on worker 0-0, policy_version 180154 (0.00092) [2022-07-09 09:04:13,455][26022] Updated weights on worker 0-0, policy_version 180164 (0.00081) [2022-07-09 09:04:15,298][26022] Updated weights on worker 0-0, policy_version 180174 (0.00086) [2022-07-09 09:04:15,890][25689] Fps is (10 sec: 5597.9, 60 sec: 5670.9, 300 sec: 5686.4). Total num frames: 184500224. Throughput: 0: 5863.3. Samples: 184500872. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 09:04:15,891][25689] Avg episode reward: [(0, '-51.894')] [2022-07-09 09:04:17,049][26022] Updated weights on worker 0-0, policy_version 180184 (0.00092) [2022-07-09 09:04:19,123][26022] Updated weights on worker 0-0, policy_version 180194 (0.00093) [2022-07-09 09:04:20,694][26022] Updated weights on worker 0-0, policy_version 180204 (0.00080) [2022-07-09 09:04:20,895][25689] Fps is (10 sec: 5700.1, 60 sec: 5662.4, 300 sec: 5690.3). Total num frames: 184528896. Throughput: 0: 5855.5. Samples: 184534926. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 09:04:20,897][25689] Avg episode reward: [(0, '-51.081')] [2022-07-09 09:04:22,646][26022] Updated weights on worker 0-0, policy_version 180214 (0.00083) [2022-07-09 09:04:24,196][26022] Updated weights on worker 0-0, policy_version 180224 (0.00098) [2022-07-09 09:04:25,907][25689] Fps is (10 sec: 5725.0, 60 sec: 5648.7, 300 sec: 5690.4). Total num frames: 184557568. Throughput: 0: 5122.1. Samples: 184552206. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 09:04:25,908][25689] Avg episode reward: [(0, '-50.637')] [2022-07-09 09:04:26,075][26022] Updated weights on worker 0-0, policy_version 180234 (0.00085) [2022-07-09 09:04:27,810][26022] Updated weights on worker 0-0, policy_version 180244 (0.00092) [2022-07-09 09:04:29,586][26022] Updated weights on worker 0-0, policy_version 180254 (0.00095) [2022-07-09 09:04:30,930][25689] Fps is (10 sec: 5816.5, 60 sec: 5665.0, 300 sec: 5695.5). Total num frames: 184587264. Throughput: 0: 6018.5. Samples: 184587136. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 09:04:30,931][25689] Avg episode reward: [(0, '-49.693')] [2022-07-09 09:04:31,613][26022] Updated weights on worker 0-0, policy_version 180264 (0.00086) [2022-07-09 09:04:33,110][26022] Updated weights on worker 0-0, policy_version 180274 (0.00085) [2022-07-09 09:04:35,036][26022] Updated weights on worker 0-0, policy_version 180284 (0.00090) [2022-07-09 09:04:36,063][25689] Fps is (10 sec: 5847.9, 60 sec: 5678.1, 300 sec: 5698.2). Total num frames: 184616960. Throughput: 0: 5983.9. Samples: 184621582. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 09:04:36,063][25689] Avg episode reward: [(0, '-51.183')] [2022-07-09 09:04:36,748][26022] Updated weights on worker 0-0, policy_version 180294 (0.00084) [2022-07-09 09:04:38,529][26022] Updated weights on worker 0-0, policy_version 180304 (0.00090) [2022-07-09 09:04:40,368][26022] Updated weights on worker 0-0, policy_version 180314 (0.00086) [2022-07-09 09:04:41,087][25689] Fps is (10 sec: 5847.5, 60 sec: 5695.5, 300 sec: 5703.0). Total num frames: 184646656. Throughput: 0: 5142.5. Samples: 184638762. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 09:04:41,087][25689] Avg episode reward: [(0, '-52.258')] [2022-07-09 09:04:41,907][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:04:41,921][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000180323_184650752.pth [2022-07-09 09:04:41,922][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000178319_182598656.pth [2022-07-09 09:04:42,161][26022] Updated weights on worker 0-0, policy_version 180324 (0.00092) [2022-07-09 09:04:43,788][26022] Updated weights on worker 0-0, policy_version 180334 (0.00084) [2022-07-09 09:04:45,911][26022] Updated weights on worker 0-0, policy_version 180344 (0.00095) [2022-07-09 09:04:46,106][25689] Fps is (10 sec: 5506.0, 60 sec: 5627.6, 300 sec: 5692.4). Total num frames: 184672256. Throughput: 0: 5980.8. Samples: 184673012. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 09:04:46,106][25689] Avg episode reward: [(0, '-51.689')] [2022-07-09 09:04:47,339][26022] Updated weights on worker 0-0, policy_version 180354 (0.00071) [2022-07-09 09:04:49,570][26022] Updated weights on worker 0-0, policy_version 180364 (0.00085) [2022-07-09 09:04:50,991][26022] Updated weights on worker 0-0, policy_version 180374 (0.00086) [2022-07-09 09:04:51,121][25689] Fps is (10 sec: 5613.0, 60 sec: 5678.8, 300 sec: 5689.3). Total num frames: 184702976. Throughput: 0: 5942.8. Samples: 184707124. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 09:04:51,122][25689] Avg episode reward: [(0, '-51.914')] [2022-07-09 09:04:53,106][26022] Updated weights on worker 0-0, policy_version 180384 (0.00085) [2022-07-09 09:04:54,545][26022] Updated weights on worker 0-0, policy_version 180394 (0.00084) [2022-07-09 09:04:56,203][25689] Fps is (10 sec: 5780.6, 60 sec: 5679.1, 300 sec: 5689.2). Total num frames: 184730624. Throughput: 0: 5102.2. Samples: 184724340. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 09:04:56,203][25689] Avg episode reward: [(0, '-51.797')] [2022-07-09 09:04:56,621][26022] Updated weights on worker 0-0, policy_version 180404 (0.00094) [2022-07-09 09:04:58,518][26022] Updated weights on worker 0-0, policy_version 180414 (0.00112) [2022-07-09 09:05:00,152][26022] Updated weights on worker 0-0, policy_version 180424 (0.00090) [2022-07-09 09:05:01,286][25689] Fps is (10 sec: 5540.3, 60 sec: 5672.7, 300 sec: 5695.9). Total num frames: 184759296. Throughput: 0: 5900.5. Samples: 184757948. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 09:05:01,288][25689] Avg episode reward: [(0, '-51.867')] [2022-07-09 09:05:02,343][26022] Updated weights on worker 0-0, policy_version 180434 (0.00087) [2022-07-09 09:05:04,307][26022] Updated weights on worker 0-0, policy_version 180444 (0.00091) [2022-07-09 09:05:05,969][26022] Updated weights on worker 0-0, policy_version 180454 (0.00087) [2022-07-09 09:05:06,311][25689] Fps is (10 sec: 5571.7, 60 sec: 5671.8, 300 sec: 5686.1). Total num frames: 184786944. Throughput: 0: 5794.9. Samples: 184790102. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:05:06,313][25689] Avg episode reward: [(0, '-51.434')] [2022-07-09 09:05:07,763][26022] Updated weights on worker 0-0, policy_version 180464 (0.00095) [2022-07-09 09:05:09,505][26022] Updated weights on worker 0-0, policy_version 180474 (0.00095) [2022-07-09 09:05:11,323][25689] Fps is (10 sec: 5407.5, 60 sec: 5654.3, 300 sec: 5683.2). Total num frames: 184813568. Throughput: 0: 5805.2. Samples: 184824400. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:05:11,323][25689] Avg episode reward: [(0, '-51.605')] [2022-07-09 09:05:11,595][26022] Updated weights on worker 0-0, policy_version 180484 (0.00082) [2022-07-09 09:05:13,229][26022] Updated weights on worker 0-0, policy_version 180494 (0.00087) [2022-07-09 09:05:15,051][26022] Updated weights on worker 0-0, policy_version 180504 (0.00084) [2022-07-09 09:05:16,443][25689] Fps is (10 sec: 5659.9, 60 sec: 5682.1, 300 sec: 5688.2). Total num frames: 184844288. Throughput: 0: 5779.2. Samples: 184841310. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:05:16,445][25689] Avg episode reward: [(0, '-51.314')] [2022-07-09 09:05:16,613][26022] Updated weights on worker 0-0, policy_version 180514 (0.00090) [2022-07-09 09:05:18,594][26022] Updated weights on worker 0-0, policy_version 180524 (0.00088) [2022-07-09 09:05:20,376][26022] Updated weights on worker 0-0, policy_version 180534 (0.00087) [2022-07-09 09:05:21,447][25689] Fps is (10 sec: 5866.0, 60 sec: 5682.1, 300 sec: 5685.7). Total num frames: 184872960. Throughput: 0: 5829.3. Samples: 184875474. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:05:21,449][25689] Avg episode reward: [(0, '-51.365')] [2022-07-09 09:05:22,216][26022] Updated weights on worker 0-0, policy_version 180544 (0.00092) [2022-07-09 09:05:23,926][26022] Updated weights on worker 0-0, policy_version 180554 (0.00084) [2022-07-09 09:05:25,642][26022] Updated weights on worker 0-0, policy_version 180564 (0.00091) [2022-07-09 09:05:26,456][25689] Fps is (10 sec: 5624.5, 60 sec: 5665.4, 300 sec: 5682.8). Total num frames: 184900608. Throughput: 0: 5943.9. Samples: 184909842. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:05:26,458][25689] Avg episode reward: [(0, '-50.588')] [2022-07-09 09:05:27,536][26022] Updated weights on worker 0-0, policy_version 180574 (0.00089) [2022-07-09 09:05:29,381][26022] Updated weights on worker 0-0, policy_version 180584 (0.00089) [2022-07-09 09:05:31,194][26022] Updated weights on worker 0-0, policy_version 180594 (0.00093) [2022-07-09 09:05:31,485][25689] Fps is (10 sec: 5814.9, 60 sec: 5681.8, 300 sec: 5683.5). Total num frames: 184931328. Throughput: 0: 5083.3. Samples: 184926892. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:05:31,487][25689] Avg episode reward: [(0, '-49.642')] [2022-07-09 09:05:33,105][26022] Updated weights on worker 0-0, policy_version 180604 (0.00083) [2022-07-09 09:05:34,383][26022] Updated weights on worker 0-0, policy_version 180614 (0.00085) [2022-07-09 09:05:36,587][25689] Fps is (10 sec: 5761.4, 60 sec: 5650.9, 300 sec: 5678.6). Total num frames: 184958976. Throughput: 0: 5969.7. Samples: 184961566. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:05:36,588][25689] Avg episode reward: [(0, '-49.459')] [2022-07-09 09:05:36,594][26022] Updated weights on worker 0-0, policy_version 180624 (0.00090) [2022-07-09 09:05:37,987][26022] Updated weights on worker 0-0, policy_version 180634 (0.00088) [2022-07-09 09:05:40,068][26022] Updated weights on worker 0-0, policy_version 180644 (0.00096) [2022-07-09 09:05:41,589][25689] Fps is (10 sec: 5776.8, 60 sec: 5669.9, 300 sec: 5686.3). Total num frames: 184989696. Throughput: 0: 5996.1. Samples: 184996246. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:05:41,590][26022] Updated weights on worker 0-0, policy_version 180654 (0.00085) [2022-07-09 09:05:41,590][25689] Avg episode reward: [(0, '-49.483')] [2022-07-09 09:05:43,513][26022] Updated weights on worker 0-0, policy_version 180664 (0.00087) [2022-07-09 09:05:45,263][26022] Updated weights on worker 0-0, policy_version 180674 (0.00088) [2022-07-09 09:05:46,611][25689] Fps is (10 sec: 5823.1, 60 sec: 5703.5, 300 sec: 5683.9). Total num frames: 185017344. Throughput: 0: 5158.0. Samples: 185013798. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:05:46,611][25689] Avg episode reward: [(0, '-49.165')] [2022-07-09 09:05:47,283][26022] Updated weights on worker 0-0, policy_version 180684 (0.00089) [2022-07-09 09:05:48,715][26022] Updated weights on worker 0-0, policy_version 180694 (0.00100) [2022-07-09 09:05:50,952][26022] Updated weights on worker 0-0, policy_version 180704 (0.00091) [2022-07-09 09:05:51,617][25689] Fps is (10 sec: 5514.4, 60 sec: 5653.5, 300 sec: 5672.0). Total num frames: 185044992. Throughput: 0: 6017.0. Samples: 185048022. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:05:51,617][25689] Avg episode reward: [(0, '-48.444')] [2022-07-09 09:05:52,227][26022] Updated weights on worker 0-0, policy_version 180714 (0.00088) [2022-07-09 09:05:54,406][26022] Updated weights on worker 0-0, policy_version 180724 (0.00090) [2022-07-09 09:05:56,113][26022] Updated weights on worker 0-0, policy_version 180734 (0.00086) [2022-07-09 09:05:56,716][25689] Fps is (10 sec: 5573.5, 60 sec: 5668.9, 300 sec: 5673.7). Total num frames: 185073664. Throughput: 0: 5988.0. Samples: 185082096. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:05:56,716][25689] Avg episode reward: [(0, '-48.868')] [2022-07-09 09:05:57,789][26022] Updated weights on worker 0-0, policy_version 180744 (0.00087) [2022-07-09 09:05:59,818][26022] Updated weights on worker 0-0, policy_version 180754 (0.00098) [2022-07-09 09:06:01,398][26022] Updated weights on worker 0-0, policy_version 180764 (0.00091) [2022-07-09 09:06:01,752][25689] Fps is (10 sec: 5758.9, 60 sec: 5690.2, 300 sec: 5683.8). Total num frames: 185103360. Throughput: 0: 5108.2. Samples: 185099242. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:06:01,752][25689] Avg episode reward: [(0, '-49.560')] [2022-07-09 09:06:03,763][26022] Updated weights on worker 0-0, policy_version 180774 (0.00095) [2022-07-09 09:06:05,367][26022] Updated weights on worker 0-0, policy_version 180784 (0.00085) [2022-07-09 09:06:06,842][25689] Fps is (10 sec: 5460.9, 60 sec: 5650.3, 300 sec: 5668.6). Total num frames: 185128960. Throughput: 0: 5798.5. Samples: 185131106. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:06:06,842][25689] Avg episode reward: [(0, '-49.113')] [2022-07-09 09:06:07,138][26022] Updated weights on worker 0-0, policy_version 180794 (0.00092) [2022-07-09 09:06:09,115][26022] Updated weights on worker 0-0, policy_version 180804 (0.00089) [2022-07-09 09:06:10,977][26022] Updated weights on worker 0-0, policy_version 180814 (0.00088) [2022-07-09 09:06:11,891][25689] Fps is (10 sec: 5453.9, 60 sec: 5697.5, 300 sec: 5675.9). Total num frames: 185158656. Throughput: 0: 5773.4. Samples: 185165072. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:06:11,891][25689] Avg episode reward: [(0, '-50.518')] [2022-07-09 09:06:12,684][26022] Updated weights on worker 0-0, policy_version 180824 (0.00090) [2022-07-09 09:06:14,749][26022] Updated weights on worker 0-0, policy_version 180834 (0.00087) [2022-07-09 09:06:16,119][26022] Updated weights on worker 0-0, policy_version 180844 (0.00092) [2022-07-09 09:06:17,024][25689] Fps is (10 sec: 5631.4, 60 sec: 5645.5, 300 sec: 5663.9). Total num frames: 185186304. Throughput: 0: 4930.0. Samples: 185182218. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:06:17,025][25689] Avg episode reward: [(0, '-50.675')] [2022-07-09 09:06:18,180][26022] Updated weights on worker 0-0, policy_version 180854 (0.00083) [2022-07-09 09:06:20,139][26022] Updated weights on worker 0-0, policy_version 180864 (0.00907) [2022-07-09 09:06:21,647][26022] Updated weights on worker 0-0, policy_version 180874 (0.00086) [2022-07-09 09:06:22,035][25689] Fps is (10 sec: 5653.0, 60 sec: 5661.9, 300 sec: 5671.2). Total num frames: 185216000. Throughput: 0: 5758.5. Samples: 185216040. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:06:22,036][25689] Avg episode reward: [(0, '-50.587')] [2022-07-09 09:06:23,752][26022] Updated weights on worker 0-0, policy_version 180884 (0.00086) [2022-07-09 09:06:25,415][26022] Updated weights on worker 0-0, policy_version 180894 (0.00094) [2022-07-09 09:06:27,053][25689] Fps is (10 sec: 5718.3, 60 sec: 5661.1, 300 sec: 5664.8). Total num frames: 185243648. Throughput: 0: 5895.7. Samples: 185250264. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:06:27,054][25689] Avg episode reward: [(0, '-50.099')] [2022-07-09 09:06:27,204][26022] Updated weights on worker 0-0, policy_version 180904 (0.00084) [2022-07-09 09:06:29,084][26022] Updated weights on worker 0-0, policy_version 180914 (0.00092) [2022-07-09 09:06:30,870][26022] Updated weights on worker 0-0, policy_version 180924 (0.00100) [2022-07-09 09:06:32,074][25689] Fps is (10 sec: 5507.9, 60 sec: 5611.1, 300 sec: 5663.5). Total num frames: 185271296. Throughput: 0: 5064.6. Samples: 185267292. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:06:32,075][25689] Avg episode reward: [(0, '-50.787')] [2022-07-09 09:06:32,785][26022] Updated weights on worker 0-0, policy_version 180934 (0.00090) [2022-07-09 09:06:34,586][26022] Updated weights on worker 0-0, policy_version 180944 (0.00091) [2022-07-09 09:06:36,220][26022] Updated weights on worker 0-0, policy_version 180954 (0.00093) [2022-07-09 09:06:37,140][25689] Fps is (10 sec: 5684.7, 60 sec: 5648.2, 300 sec: 5666.4). Total num frames: 185300992. Throughput: 0: 5919.6. Samples: 185301294. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:06:37,145][25689] Avg episode reward: [(0, '-50.791')] [2022-07-09 09:06:38,345][26022] Updated weights on worker 0-0, policy_version 180964 (0.00092) [2022-07-09 09:06:39,904][26022] Updated weights on worker 0-0, policy_version 180974 (0.00087) [2022-07-09 09:06:41,974][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:06:41,988][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000180984_185327616.pth [2022-07-09 09:06:41,989][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000178988_183283712.pth [2022-07-09 09:06:41,991][26022] Updated weights on worker 0-0, policy_version 180984 (0.00083) [2022-07-09 09:06:42,179][25689] Fps is (10 sec: 5776.4, 60 sec: 5611.0, 300 sec: 5669.2). Total num frames: 185329664. Throughput: 0: 5915.8. Samples: 185335208. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:06:42,179][25689] Avg episode reward: [(0, '-50.039')] [2022-07-09 09:06:43,579][26022] Updated weights on worker 0-0, policy_version 180994 (0.00084) [2022-07-09 09:06:45,290][26022] Updated weights on worker 0-0, policy_version 181004 (0.00088) [2022-07-09 09:06:47,186][25689] Fps is (10 sec: 5606.4, 60 sec: 5612.3, 300 sec: 5663.3). Total num frames: 185357312. Throughput: 0: 5067.9. Samples: 185352296. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:06:47,186][25689] Avg episode reward: [(0, '-49.563')] [2022-07-09 09:06:47,259][26022] Updated weights on worker 0-0, policy_version 181014 (0.00095) [2022-07-09 09:06:48,821][26022] Updated weights on worker 0-0, policy_version 181024 (0.00087) [2022-07-09 09:06:50,841][26022] Updated weights on worker 0-0, policy_version 181034 (0.00082) [2022-07-09 09:06:52,199][25689] Fps is (10 sec: 5723.0, 60 sec: 5645.5, 300 sec: 5672.8). Total num frames: 185387008. Throughput: 0: 5909.8. Samples: 185386224. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:06:52,199][25689] Avg episode reward: [(0, '-49.629')] [2022-07-09 09:06:52,623][26022] Updated weights on worker 0-0, policy_version 181044 (0.00091) [2022-07-09 09:06:54,537][26022] Updated weights on worker 0-0, policy_version 181054 (0.00092) [2022-07-09 09:06:56,154][26022] Updated weights on worker 0-0, policy_version 181064 (0.00087) [2022-07-09 09:06:57,318][25689] Fps is (10 sec: 5659.7, 60 sec: 5626.8, 300 sec: 5660.3). Total num frames: 185414656. Throughput: 0: 5893.9. Samples: 185420218. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:06:57,318][25689] Avg episode reward: [(0, '-49.632')] [2022-07-09 09:06:58,164][26022] Updated weights on worker 0-0, policy_version 181074 (0.00086) [2022-07-09 09:06:59,820][26022] Updated weights on worker 0-0, policy_version 181084 (0.00089) [2022-07-09 09:07:02,012][26022] Updated weights on worker 0-0, policy_version 181094 (0.00100) [2022-07-09 09:07:02,344][25689] Fps is (10 sec: 5450.4, 60 sec: 5593.9, 300 sec: 5667.1). Total num frames: 185442304. Throughput: 0: 5063.5. Samples: 185437316. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:07:02,344][25689] Avg episode reward: [(0, '-49.978')] [2022-07-09 09:07:03,788][26022] Updated weights on worker 0-0, policy_version 181104 (0.00090) [2022-07-09 09:07:05,539][26022] Updated weights on worker 0-0, policy_version 181114 (0.00085) [2022-07-09 09:07:07,361][25689] Fps is (10 sec: 5505.8, 60 sec: 5634.4, 300 sec: 5660.1). Total num frames: 185469952. Throughput: 0: 5812.7. Samples: 185469568. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:07:07,361][25689] Avg episode reward: [(0, '-50.228')] [2022-07-09 09:07:07,487][26022] Updated weights on worker 0-0, policy_version 181124 (0.00087) [2022-07-09 09:07:09,450][26022] Updated weights on worker 0-0, policy_version 181134 (0.00090) [2022-07-09 09:07:10,886][26022] Updated weights on worker 0-0, policy_version 181144 (0.00085) [2022-07-09 09:07:12,385][25689] Fps is (10 sec: 5507.2, 60 sec: 5602.9, 300 sec: 5657.4). Total num frames: 185497600. Throughput: 0: 5813.6. Samples: 185503578. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:07:12,385][25689] Avg episode reward: [(0, '-50.368')] [2022-07-09 09:07:12,966][26022] Updated weights on worker 0-0, policy_version 181154 (0.00083) [2022-07-09 09:07:14,493][26022] Updated weights on worker 0-0, policy_version 181164 (0.00085) [2022-07-09 09:07:16,487][26022] Updated weights on worker 0-0, policy_version 181174 (0.00089) [2022-07-09 09:07:17,440][25689] Fps is (10 sec: 5790.8, 60 sec: 5661.0, 300 sec: 5664.0). Total num frames: 185528320. Throughput: 0: 4991.1. Samples: 185520652. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2022-07-09 09:07:17,441][25689] Avg episode reward: [(0, '-50.464')] [2022-07-09 09:07:18,121][26022] Updated weights on worker 0-0, policy_version 181184 (0.00091) [2022-07-09 09:07:19,964][26022] Updated weights on worker 0-0, policy_version 181194 (0.00094) [2022-07-09 09:07:21,787][26022] Updated weights on worker 0-0, policy_version 181204 (0.00086) [2022-07-09 09:07:22,501][25689] Fps is (10 sec: 5769.5, 60 sec: 5622.4, 300 sec: 5663.3). Total num frames: 185555968. Throughput: 0: 5825.6. Samples: 185554744. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2022-07-09 09:07:22,502][25689] Avg episode reward: [(0, '-50.197')] [2022-07-09 09:07:23,755][26022] Updated weights on worker 0-0, policy_version 181214 (0.00050) [2022-07-09 09:07:25,368][26022] Updated weights on worker 0-0, policy_version 181224 (0.00580) [2022-07-09 09:07:27,337][26022] Updated weights on worker 0-0, policy_version 181234 (0.00091) [2022-07-09 09:07:27,511][25689] Fps is (10 sec: 5592.5, 60 sec: 5640.1, 300 sec: 5656.5). Total num frames: 185584640. Throughput: 0: 5910.7. Samples: 185588668. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2022-07-09 09:07:27,511][25689] Avg episode reward: [(0, '-49.841')] [2022-07-09 09:07:29,020][26022] Updated weights on worker 0-0, policy_version 181244 (0.00093) [2022-07-09 09:07:30,932][26022] Updated weights on worker 0-0, policy_version 181254 (0.00085) [2022-07-09 09:07:32,541][25689] Fps is (10 sec: 5711.7, 60 sec: 5656.2, 300 sec: 5660.6). Total num frames: 185613312. Throughput: 0: 5918.9. Samples: 185622882. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2022-07-09 09:07:32,542][25689] Avg episode reward: [(0, '-50.480')] [2022-07-09 09:07:32,660][26022] Updated weights on worker 0-0, policy_version 181264 (0.00088) [2022-07-09 09:07:34,511][26022] Updated weights on worker 0-0, policy_version 181274 (0.00095) [2022-07-09 09:07:36,328][26022] Updated weights on worker 0-0, policy_version 181284 (0.00089) [2022-07-09 09:07:37,579][25689] Fps is (10 sec: 5593.8, 60 sec: 5625.0, 300 sec: 5654.6). Total num frames: 185640960. Throughput: 0: 5925.4. Samples: 185639982. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2022-07-09 09:07:37,580][25689] Avg episode reward: [(0, '-50.695')] [2022-07-09 09:07:38,278][26022] Updated weights on worker 0-0, policy_version 181294 (0.00086) [2022-07-09 09:07:39,680][26022] Updated weights on worker 0-0, policy_version 181304 (0.00114) [2022-07-09 09:07:41,965][26022] Updated weights on worker 0-0, policy_version 181314 (0.00086) [2022-07-09 09:07:42,591][25689] Fps is (10 sec: 5705.7, 60 sec: 5644.4, 300 sec: 5658.4). Total num frames: 185670656. Throughput: 0: 5940.2. Samples: 185674084. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2022-07-09 09:07:42,592][25689] Avg episode reward: [(0, '-51.366')] [2022-07-09 09:07:43,328][26022] Updated weights on worker 0-0, policy_version 181324 (0.00085) [2022-07-09 09:07:45,390][26022] Updated weights on worker 0-0, policy_version 181334 (0.00096) [2022-07-09 09:07:47,095][26022] Updated weights on worker 0-0, policy_version 181344 (0.00084) [2022-07-09 09:07:47,617][25689] Fps is (10 sec: 5814.6, 60 sec: 5659.6, 300 sec: 5655.3). Total num frames: 185699328. Throughput: 0: 5963.6. Samples: 185708574. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2022-07-09 09:07:47,619][25689] Avg episode reward: [(0, '-51.577')] [2022-07-09 09:07:48,824][26022] Updated weights on worker 0-0, policy_version 181354 (0.00092) [2022-07-09 09:07:50,671][26022] Updated weights on worker 0-0, policy_version 181364 (0.00100) [2022-07-09 09:07:52,400][26022] Updated weights on worker 0-0, policy_version 181374 (0.00080) [2022-07-09 09:07:52,623][25689] Fps is (10 sec: 5716.6, 60 sec: 5643.3, 300 sec: 5660.7). Total num frames: 185728000. Throughput: 0: 5127.3. Samples: 185725846. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2022-07-09 09:07:52,623][25689] Avg episode reward: [(0, '-51.126')] [2022-07-09 09:07:54,187][26022] Updated weights on worker 0-0, policy_version 181384 (0.00102) [2022-07-09 09:07:56,166][26022] Updated weights on worker 0-0, policy_version 181394 (0.00088) [2022-07-09 09:07:57,658][26022] Updated weights on worker 0-0, policy_version 181404 (0.00091) [2022-07-09 09:07:57,712][25689] Fps is (10 sec: 5883.7, 60 sec: 5697.0, 300 sec: 5662.9). Total num frames: 185758720. Throughput: 0: 5968.2. Samples: 185760138. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2022-07-09 09:07:57,712][25689] Avg episode reward: [(0, '-51.402')] [2022-07-09 09:07:59,795][26022] Updated weights on worker 0-0, policy_version 181414 (0.00089) [2022-07-09 09:08:01,457][26022] Updated weights on worker 0-0, policy_version 181424 (0.00092) [2022-07-09 09:08:02,736][25689] Fps is (10 sec: 5467.7, 60 sec: 5646.3, 300 sec: 5656.2). Total num frames: 185783296. Throughput: 0: 5863.3. Samples: 185792196. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2022-07-09 09:08:02,736][25689] Avg episode reward: [(0, '-50.212')] [2022-07-09 09:08:03,598][26022] Updated weights on worker 0-0, policy_version 181434 (0.00085) [2022-07-09 09:08:05,393][26022] Updated weights on worker 0-0, policy_version 181444 (0.00092) [2022-07-09 09:08:07,150][26022] Updated weights on worker 0-0, policy_version 181454 (0.00094) [2022-07-09 09:08:07,790][25689] Fps is (10 sec: 5486.8, 60 sec: 5693.7, 300 sec: 5656.1). Total num frames: 185814016. Throughput: 0: 5000.2. Samples: 185809440. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2022-07-09 09:08:07,790][25689] Avg episode reward: [(0, '-50.829')] [2022-07-09 09:08:09,138][26022] Updated weights on worker 0-0, policy_version 181464 (0.00086) [2022-07-09 09:08:10,510][26022] Updated weights on worker 0-0, policy_version 181474 (0.00090) [2022-07-09 09:08:12,570][26022] Updated weights on worker 0-0, policy_version 181484 (0.00081) [2022-07-09 09:08:12,792][25689] Fps is (10 sec: 5702.5, 60 sec: 5678.8, 300 sec: 5660.7). Total num frames: 185840640. Throughput: 0: 5847.2. Samples: 185843780. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2022-07-09 09:08:12,793][25689] Avg episode reward: [(0, '-50.498')] [2022-07-09 09:08:14,388][26022] Updated weights on worker 0-0, policy_version 181494 (0.00084) [2022-07-09 09:08:16,020][26022] Updated weights on worker 0-0, policy_version 181504 (0.00090) [2022-07-09 09:08:17,864][25689] Fps is (10 sec: 5489.1, 60 sec: 5643.4, 300 sec: 5657.7). Total num frames: 185869312. Throughput: 0: 5868.3. Samples: 185878396. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2022-07-09 09:08:17,864][25689] Avg episode reward: [(0, '-50.647')] [2022-07-09 09:08:17,899][26022] Updated weights on worker 0-0, policy_version 181514 (0.00089) [2022-07-09 09:08:19,499][26022] Updated weights on worker 0-0, policy_version 181524 (0.00087) [2022-07-09 09:08:21,429][26022] Updated weights on worker 0-0, policy_version 181534 (0.00085) [2022-07-09 09:08:22,900][25689] Fps is (10 sec: 5774.1, 60 sec: 5679.6, 300 sec: 5657.9). Total num frames: 185899008. Throughput: 0: 5125.7. Samples: 185895554. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2022-07-09 09:08:22,901][25689] Avg episode reward: [(0, '-50.735')] [2022-07-09 09:08:23,295][26022] Updated weights on worker 0-0, policy_version 181544 (0.00091) [2022-07-09 09:08:25,036][26022] Updated weights on worker 0-0, policy_version 181554 (0.00095) [2022-07-09 09:08:26,832][26022] Updated weights on worker 0-0, policy_version 181564 (0.00085) [2022-07-09 09:08:27,918][25689] Fps is (10 sec: 5703.2, 60 sec: 5661.8, 300 sec: 5654.4). Total num frames: 185926656. Throughput: 0: 5962.1. Samples: 185929448. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2022-07-09 09:08:27,919][25689] Avg episode reward: [(0, '-50.460')] [2022-07-09 09:08:28,771][26022] Updated weights on worker 0-0, policy_version 181574 (0.00086) [2022-07-09 09:08:30,191][26022] Updated weights on worker 0-0, policy_version 181584 (0.00091) [2022-07-09 09:08:32,457][26022] Updated weights on worker 0-0, policy_version 181594 (0.00093) [2022-07-09 09:08:32,950][25689] Fps is (10 sec: 5807.8, 60 sec: 5695.6, 300 sec: 5662.4). Total num frames: 185957376. Throughput: 0: 5950.8. Samples: 185963740. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2022-07-09 09:08:32,951][25689] Avg episode reward: [(0, '-50.015')] [2022-07-09 09:08:34,112][26022] Updated weights on worker 0-0, policy_version 181604 (0.00048) [2022-07-09 09:08:35,774][26022] Updated weights on worker 0-0, policy_version 181614 (0.00093) [2022-07-09 09:08:37,951][26022] Updated weights on worker 0-0, policy_version 181624 (0.00084) [2022-07-09 09:08:38,042][25689] Fps is (10 sec: 5563.5, 60 sec: 5656.7, 300 sec: 5650.9). Total num frames: 185982976. Throughput: 0: 5080.5. Samples: 185980910. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2022-07-09 09:08:38,042][25689] Avg episode reward: [(0, '-50.068')] [2022-07-09 09:08:39,354][26022] Updated weights on worker 0-0, policy_version 181634 (0.00089) [2022-07-09 09:08:41,410][26022] Updated weights on worker 0-0, policy_version 181644 (0.00092) [2022-07-09 09:08:42,315][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:08:42,325][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000181649_186008576.pth [2022-07-09 09:08:42,325][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000179657_183968768.pth [2022-07-09 09:08:43,075][25689] Fps is (10 sec: 5461.2, 60 sec: 5654.7, 300 sec: 5650.6). Total num frames: 186012672. Throughput: 0: 5923.6. Samples: 186015064. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2022-07-09 09:08:43,076][25689] Avg episode reward: [(0, '-50.085')] [2022-07-09 09:08:43,127][26022] Updated weights on worker 0-0, policy_version 181654 (0.00090) [2022-07-09 09:08:44,730][26022] Updated weights on worker 0-0, policy_version 181664 (0.00087) [2022-07-09 09:08:46,768][26022] Updated weights on worker 0-0, policy_version 181674 (0.00065) [2022-07-09 09:08:48,108][25689] Fps is (10 sec: 6001.6, 60 sec: 5687.9, 300 sec: 5660.7). Total num frames: 186043392. Throughput: 0: 5928.9. Samples: 186049152. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2022-07-09 09:08:48,109][25689] Avg episode reward: [(0, '-50.848')] [2022-07-09 09:08:48,318][26022] Updated weights on worker 0-0, policy_version 181684 (0.00088) [2022-07-09 09:08:50,202][26022] Updated weights on worker 0-0, policy_version 181694 (0.00090) [2022-07-09 09:08:52,238][26022] Updated weights on worker 0-0, policy_version 181704 (0.00083) [2022-07-09 09:08:53,141][25689] Fps is (10 sec: 5595.4, 60 sec: 5634.5, 300 sec: 5654.8). Total num frames: 186068992. Throughput: 0: 5074.8. Samples: 186066202. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2022-07-09 09:08:53,141][25689] Avg episode reward: [(0, '-50.991')] [2022-07-09 09:08:53,954][26022] Updated weights on worker 0-0, policy_version 181714 (0.00084) [2022-07-09 09:08:56,003][26022] Updated weights on worker 0-0, policy_version 181724 (0.00086) [2022-07-09 09:08:57,582][26022] Updated weights on worker 0-0, policy_version 181734 (0.00081) [2022-07-09 09:08:58,200][25689] Fps is (10 sec: 5682.2, 60 sec: 5654.2, 300 sec: 5664.3). Total num frames: 186100736. Throughput: 0: 5902.1. Samples: 186099888. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2022-07-09 09:08:58,201][25689] Avg episode reward: [(0, '-50.911')] [2022-07-09 09:08:59,535][26022] Updated weights on worker 0-0, policy_version 181744 (0.00089) [2022-07-09 09:09:01,201][26022] Updated weights on worker 0-0, policy_version 181754 (0.00082) [2022-07-09 09:09:03,220][25689] Fps is (10 sec: 5587.8, 60 sec: 5654.6, 300 sec: 5653.9). Total num frames: 186125312. Throughput: 0: 5797.7. Samples: 186131856. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2022-07-09 09:09:03,220][25689] Avg episode reward: [(0, '-51.014')] [2022-07-09 09:09:03,367][26022] Updated weights on worker 0-0, policy_version 181764 (0.00089) [2022-07-09 09:09:05,212][26022] Updated weights on worker 0-0, policy_version 181774 (0.00098) [2022-07-09 09:09:07,109][26022] Updated weights on worker 0-0, policy_version 181784 (0.01036) [2022-07-09 09:09:08,225][25689] Fps is (10 sec: 5209.4, 60 sec: 5608.4, 300 sec: 5653.9). Total num frames: 186152960. Throughput: 0: 4955.3. Samples: 186148836. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2022-07-09 09:09:08,227][25689] Avg episode reward: [(0, '-50.812')] [2022-07-09 09:09:08,684][26022] Updated weights on worker 0-0, policy_version 181794 (0.00089) [2022-07-09 09:09:10,783][26022] Updated weights on worker 0-0, policy_version 181804 (0.00089) [2022-07-09 09:09:12,392][26022] Updated weights on worker 0-0, policy_version 181814 (0.00085) [2022-07-09 09:09:13,234][25689] Fps is (10 sec: 5624.1, 60 sec: 5641.6, 300 sec: 5654.8). Total num frames: 186181632. Throughput: 0: 5813.0. Samples: 186183004. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2022-07-09 09:09:13,236][25689] Avg episode reward: [(0, '-50.865')] [2022-07-09 09:09:14,243][26022] Updated weights on worker 0-0, policy_version 181824 (0.00091) [2022-07-09 09:09:15,944][26022] Updated weights on worker 0-0, policy_version 181834 (0.00088) [2022-07-09 09:09:17,840][26022] Updated weights on worker 0-0, policy_version 181844 (0.00092) [2022-07-09 09:09:18,302][25689] Fps is (10 sec: 5589.1, 60 sec: 5625.1, 300 sec: 5650.1). Total num frames: 186209280. Throughput: 0: 5822.7. Samples: 186216934. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2022-07-09 09:09:18,303][25689] Avg episode reward: [(0, '-50.696')] [2022-07-09 09:09:19,702][26022] Updated weights on worker 0-0, policy_version 181854 (0.00098) [2022-07-09 09:09:21,649][26022] Updated weights on worker 0-0, policy_version 181864 (0.00087) [2022-07-09 09:09:23,252][26022] Updated weights on worker 0-0, policy_version 181874 (0.00086) [2022-07-09 09:09:23,403][25689] Fps is (10 sec: 5639.3, 60 sec: 5619.1, 300 sec: 5651.9). Total num frames: 186238976. Throughput: 0: 5044.2. Samples: 186233662. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2022-07-09 09:09:23,403][25689] Avg episode reward: [(0, '-50.363')] [2022-07-09 09:09:25,143][26022] Updated weights on worker 0-0, policy_version 181884 (0.00081) [2022-07-09 09:09:26,902][26022] Updated weights on worker 0-0, policy_version 181894 (0.00090) [2022-07-09 09:09:28,432][25689] Fps is (10 sec: 5761.7, 60 sec: 5634.9, 300 sec: 5648.3). Total num frames: 186267648. Throughput: 0: 5888.7. Samples: 186267830. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 09:09:28,433][25689] Avg episode reward: [(0, '-50.414')] [2022-07-09 09:09:28,764][26022] Updated weights on worker 0-0, policy_version 181904 (0.00093) [2022-07-09 09:09:30,499][26022] Updated weights on worker 0-0, policy_version 181914 (0.00084) [2022-07-09 09:09:32,240][26022] Updated weights on worker 0-0, policy_version 181924 (0.00092) [2022-07-09 09:09:33,443][25689] Fps is (10 sec: 5609.4, 60 sec: 5586.1, 300 sec: 5643.7). Total num frames: 186295296. Throughput: 0: 5898.8. Samples: 186302212. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 09:09:33,443][25689] Avg episode reward: [(0, '-50.663')] [2022-07-09 09:09:34,157][26022] Updated weights on worker 0-0, policy_version 181934 (0.00090) [2022-07-09 09:09:35,877][26022] Updated weights on worker 0-0, policy_version 181944 (0.00087) [2022-07-09 09:09:37,603][26022] Updated weights on worker 0-0, policy_version 181954 (0.00087) [2022-07-09 09:09:38,539][25689] Fps is (10 sec: 5876.5, 60 sec: 5687.2, 300 sec: 5649.3). Total num frames: 186327040. Throughput: 0: 5062.4. Samples: 186319382. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 09:09:38,540][25689] Avg episode reward: [(0, '-50.874')] [2022-07-09 09:09:39,748][26022] Updated weights on worker 0-0, policy_version 181964 (0.00094) [2022-07-09 09:09:41,223][26022] Updated weights on worker 0-0, policy_version 181974 (0.00087) [2022-07-09 09:09:43,047][26022] Updated weights on worker 0-0, policy_version 181984 (0.00085) [2022-07-09 09:09:43,571][25689] Fps is (10 sec: 5864.2, 60 sec: 5653.6, 300 sec: 5655.9). Total num frames: 186354688. Throughput: 0: 5949.7. Samples: 186353656. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 09:09:43,571][25689] Avg episode reward: [(0, '-50.924')] [2022-07-09 09:09:44,978][26022] Updated weights on worker 0-0, policy_version 181994 (0.00086) [2022-07-09 09:09:46,527][26022] Updated weights on worker 0-0, policy_version 182004 (0.00063) [2022-07-09 09:09:48,507][26022] Updated weights on worker 0-0, policy_version 182014 (0.00090) [2022-07-09 09:09:48,592][25689] Fps is (10 sec: 5500.7, 60 sec: 5603.9, 300 sec: 5645.5). Total num frames: 186382336. Throughput: 0: 5976.9. Samples: 186388320. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 09:09:48,592][25689] Avg episode reward: [(0, '-51.483')] [2022-07-09 09:09:49,919][26022] Updated weights on worker 0-0, policy_version 182024 (0.00088) [2022-07-09 09:09:52,001][26022] Updated weights on worker 0-0, policy_version 182034 (0.00088) [2022-07-09 09:09:53,595][25689] Fps is (10 sec: 5720.5, 60 sec: 5674.4, 300 sec: 5653.8). Total num frames: 186412032. Throughput: 0: 5131.0. Samples: 186405612. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 09:09:53,595][25689] Avg episode reward: [(0, '-51.566')] [2022-07-09 09:09:53,640][26022] Updated weights on worker 0-0, policy_version 182044 (0.00083) [2022-07-09 09:09:55,577][26022] Updated weights on worker 0-0, policy_version 182054 (0.00081) [2022-07-09 09:09:57,474][26022] Updated weights on worker 0-0, policy_version 182064 (0.00084) [2022-07-09 09:09:58,733][25689] Fps is (10 sec: 5755.3, 60 sec: 5616.2, 300 sec: 5652.8). Total num frames: 186440704. Throughput: 0: 5938.4. Samples: 186439302. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 09:09:58,734][25689] Avg episode reward: [(0, '-52.147')] [2022-07-09 09:09:59,172][26022] Updated weights on worker 0-0, policy_version 182074 (0.00093) [2022-07-09 09:10:01,288][26022] Updated weights on worker 0-0, policy_version 182084 (0.00087) [2022-07-09 09:10:03,151][26022] Updated weights on worker 0-0, policy_version 182094 (0.00109) [2022-07-09 09:10:03,750][25689] Fps is (10 sec: 5344.2, 60 sec: 5633.4, 300 sec: 5646.1). Total num frames: 186466304. Throughput: 0: 5821.4. Samples: 186471128. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 09:10:03,752][25689] Avg episode reward: [(0, '-52.582')] [2022-07-09 09:10:05,192][26022] Updated weights on worker 0-0, policy_version 182104 (0.00093) [2022-07-09 09:10:06,641][26022] Updated weights on worker 0-0, policy_version 182114 (0.00081) [2022-07-09 09:10:08,655][26022] Updated weights on worker 0-0, policy_version 182124 (0.00083) [2022-07-09 09:10:08,782][25689] Fps is (10 sec: 5400.9, 60 sec: 5647.8, 300 sec: 5652.6). Total num frames: 186494976. Throughput: 0: 4937.2. Samples: 186488004. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 09:10:08,782][25689] Avg episode reward: [(0, '-52.478')] [2022-07-09 09:10:10,397][26022] Updated weights on worker 0-0, policy_version 182134 (0.00087) [2022-07-09 09:10:12,237][26022] Updated weights on worker 0-0, policy_version 182144 (0.00087) [2022-07-09 09:10:13,810][25689] Fps is (10 sec: 5700.2, 60 sec: 5646.0, 300 sec: 5647.4). Total num frames: 186523648. Throughput: 0: 5773.3. Samples: 186522320. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 09:10:13,811][25689] Avg episode reward: [(0, '-51.551')] [2022-07-09 09:10:14,152][26022] Updated weights on worker 0-0, policy_version 182154 (0.00084) [2022-07-09 09:10:15,751][26022] Updated weights on worker 0-0, policy_version 182164 (0.00088) [2022-07-09 09:10:17,631][26022] Updated weights on worker 0-0, policy_version 182174 (0.00089) [2022-07-09 09:10:18,953][25689] Fps is (10 sec: 5738.8, 60 sec: 5672.9, 300 sec: 5648.2). Total num frames: 186553344. Throughput: 0: 5796.4. Samples: 186556502. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 09:10:18,953][25689] Avg episode reward: [(0, '-51.228')] [2022-07-09 09:10:19,540][26022] Updated weights on worker 0-0, policy_version 182184 (0.00086) [2022-07-09 09:10:21,246][26022] Updated weights on worker 0-0, policy_version 182194 (0.00094) [2022-07-09 09:10:23,183][26022] Updated weights on worker 0-0, policy_version 182204 (0.00081) [2022-07-09 09:10:24,017][25689] Fps is (10 sec: 5718.7, 60 sec: 5659.4, 300 sec: 5650.6). Total num frames: 186582016. Throughput: 0: 5912.2. Samples: 186590948. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 09:10:24,018][25689] Avg episode reward: [(0, '-51.162')] [2022-07-09 09:10:24,904][26022] Updated weights on worker 0-0, policy_version 182214 (0.00091) [2022-07-09 09:10:26,724][26022] Updated weights on worker 0-0, policy_version 182224 (0.00094) [2022-07-09 09:10:28,602][26022] Updated weights on worker 0-0, policy_version 182234 (0.00085) [2022-07-09 09:10:29,086][25689] Fps is (10 sec: 5659.1, 60 sec: 5655.7, 300 sec: 5643.0). Total num frames: 186610688. Throughput: 0: 5903.4. Samples: 186607866. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 09:10:29,087][25689] Avg episode reward: [(0, '-51.246')] [2022-07-09 09:10:30,239][26022] Updated weights on worker 0-0, policy_version 182244 (0.00090) [2022-07-09 09:10:32,056][26022] Updated weights on worker 0-0, policy_version 182254 (0.00089) [2022-07-09 09:10:33,833][26022] Updated weights on worker 0-0, policy_version 182264 (0.00088) [2022-07-09 09:10:34,119][25689] Fps is (10 sec: 5676.6, 60 sec: 5670.5, 300 sec: 5647.8). Total num frames: 186639360. Throughput: 0: 5900.5. Samples: 186642150. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 09:10:34,119][25689] Avg episode reward: [(0, '-51.289')] [2022-07-09 09:10:35,543][26022] Updated weights on worker 0-0, policy_version 182274 (0.00088) [2022-07-09 09:10:37,502][26022] Updated weights on worker 0-0, policy_version 182284 (0.00082) [2022-07-09 09:10:39,225][25689] Fps is (10 sec: 5655.8, 60 sec: 5619.0, 300 sec: 5638.9). Total num frames: 186668032. Throughput: 0: 5911.0. Samples: 186676330. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 09:10:39,225][25689] Avg episode reward: [(0, '-50.471')] [2022-07-09 09:10:39,316][26022] Updated weights on worker 0-0, policy_version 182294 (0.00082) [2022-07-09 09:10:40,953][26022] Updated weights on worker 0-0, policy_version 182304 (0.00610) [2022-07-09 09:10:42,449][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:10:42,462][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000182312_186687488.pth [2022-07-09 09:10:42,462][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000180323_184650752.pth [2022-07-09 09:10:42,895][26022] Updated weights on worker 0-0, policy_version 182314 (0.00062) [2022-07-09 09:10:44,251][25689] Fps is (10 sec: 5760.4, 60 sec: 5653.2, 300 sec: 5645.7). Total num frames: 186697728. Throughput: 0: 5065.6. Samples: 186693448. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 09:10:44,252][25689] Avg episode reward: [(0, '-50.868')] [2022-07-09 09:10:44,671][26022] Updated weights on worker 0-0, policy_version 182324 (0.00088) [2022-07-09 09:10:46,582][26022] Updated weights on worker 0-0, policy_version 182334 (0.00090) [2022-07-09 09:10:48,243][26022] Updated weights on worker 0-0, policy_version 182344 (0.00087) [2022-07-09 09:10:49,264][25689] Fps is (10 sec: 5711.9, 60 sec: 5654.0, 300 sec: 5645.6). Total num frames: 186725376. Throughput: 0: 5940.0. Samples: 186727724. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 09:10:49,265][25689] Avg episode reward: [(0, '-50.724')] [2022-07-09 09:10:50,059][26022] Updated weights on worker 0-0, policy_version 182354 (0.00101) [2022-07-09 09:10:51,922][26022] Updated weights on worker 0-0, policy_version 182364 (0.00087) [2022-07-09 09:10:53,787][26022] Updated weights on worker 0-0, policy_version 182374 (0.00094) [2022-07-09 09:10:54,300][25689] Fps is (10 sec: 5604.8, 60 sec: 5634.1, 300 sec: 5646.8). Total num frames: 186754048. Throughput: 0: 5931.3. Samples: 186761850. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 09:10:54,300][25689] Avg episode reward: [(0, '-50.407')] [2022-07-09 09:10:55,516][26022] Updated weights on worker 0-0, policy_version 182384 (0.00091) [2022-07-09 09:10:57,517][26022] Updated weights on worker 0-0, policy_version 182394 (0.00248) [2022-07-09 09:10:59,183][26022] Updated weights on worker 0-0, policy_version 182404 (0.00081) [2022-07-09 09:10:59,426][25689] Fps is (10 sec: 5643.1, 60 sec: 5635.2, 300 sec: 5641.6). Total num frames: 186782720. Throughput: 0: 5060.1. Samples: 186778550. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 09:10:59,426][25689] Avg episode reward: [(0, '-49.624')] [2022-07-09 09:11:00,994][26022] Updated weights on worker 0-0, policy_version 182414 (0.00084) [2022-07-09 09:11:03,128][26022] Updated weights on worker 0-0, policy_version 182424 (0.00085) [2022-07-09 09:11:04,448][25689] Fps is (10 sec: 5449.1, 60 sec: 5651.6, 300 sec: 5646.3). Total num frames: 186809344. Throughput: 0: 5801.8. Samples: 186810622. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 09:11:04,448][25689] Avg episode reward: [(0, '-49.653')] [2022-07-09 09:11:04,906][26022] Updated weights on worker 0-0, policy_version 182434 (0.00065) [2022-07-09 09:11:06,766][26022] Updated weights on worker 0-0, policy_version 182444 (0.00088) [2022-07-09 09:11:08,419][26022] Updated weights on worker 0-0, policy_version 182454 (0.00091) [2022-07-09 09:11:09,474][25689] Fps is (10 sec: 5503.4, 60 sec: 5652.2, 300 sec: 5643.3). Total num frames: 186838016. Throughput: 0: 5806.3. Samples: 186845066. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 09:11:09,474][25689] Avg episode reward: [(0, '-49.606')] [2022-07-09 09:11:10,282][26022] Updated weights on worker 0-0, policy_version 182464 (0.00081) [2022-07-09 09:11:12,009][26022] Updated weights on worker 0-0, policy_version 182474 (0.00089) [2022-07-09 09:11:13,799][26022] Updated weights on worker 0-0, policy_version 182484 (0.00066) [2022-07-09 09:11:14,481][25689] Fps is (10 sec: 5919.4, 60 sec: 5687.9, 300 sec: 5656.1). Total num frames: 186868736. Throughput: 0: 4986.3. Samples: 186862480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 09:11:14,482][25689] Avg episode reward: [(0, '-49.542')] [2022-07-09 09:11:15,573][26022] Updated weights on worker 0-0, policy_version 182494 (0.00084) [2022-07-09 09:11:17,156][26022] Updated weights on worker 0-0, policy_version 182504 (0.00083) [2022-07-09 09:11:19,372][26022] Updated weights on worker 0-0, policy_version 182514 (0.00084) [2022-07-09 09:11:19,526][25689] Fps is (10 sec: 5704.3, 60 sec: 5646.3, 300 sec: 5645.1). Total num frames: 186895360. Throughput: 0: 5899.3. Samples: 186897128. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 09:11:19,527][25689] Avg episode reward: [(0, '-51.846')] [2022-07-09 09:11:20,903][26022] Updated weights on worker 0-0, policy_version 182524 (0.00086) [2022-07-09 09:11:22,644][26022] Updated weights on worker 0-0, policy_version 182534 (0.00055) [2022-07-09 09:11:24,537][25689] Fps is (10 sec: 5600.5, 60 sec: 5668.2, 300 sec: 5652.1). Total num frames: 186925056. Throughput: 0: 6031.0. Samples: 186931782. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 09:11:24,538][25689] Avg episode reward: [(0, '-51.875')] [2022-07-09 09:11:24,545][26022] Updated weights on worker 0-0, policy_version 182544 (0.00085) [2022-07-09 09:11:26,163][26022] Updated weights on worker 0-0, policy_version 182554 (0.00093) [2022-07-09 09:11:28,350][26022] Updated weights on worker 0-0, policy_version 182564 (0.00090) [2022-07-09 09:11:29,568][25689] Fps is (10 sec: 5812.5, 60 sec: 5671.8, 300 sec: 5655.3). Total num frames: 186953728. Throughput: 0: 5168.7. Samples: 186948928. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 09:11:29,568][25689] Avg episode reward: [(0, '-53.076')] [2022-07-09 09:11:30,016][26022] Updated weights on worker 0-0, policy_version 182574 (0.00099) [2022-07-09 09:11:31,727][26022] Updated weights on worker 0-0, policy_version 182584 (0.00088) [2022-07-09 09:11:33,483][26022] Updated weights on worker 0-0, policy_version 182594 (0.00084) [2022-07-09 09:11:34,591][25689] Fps is (10 sec: 5602.1, 60 sec: 5655.8, 300 sec: 5649.3). Total num frames: 186981376. Throughput: 0: 5994.3. Samples: 186983022. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 09:11:34,592][25689] Avg episode reward: [(0, '-53.049')] [2022-07-09 09:11:35,082][26022] Updated weights on worker 0-0, policy_version 182604 (0.00097) [2022-07-09 09:11:37,250][26022] Updated weights on worker 0-0, policy_version 182614 (0.00095) [2022-07-09 09:11:38,724][26022] Updated weights on worker 0-0, policy_version 182624 (0.00090) [2022-07-09 09:11:39,669][25689] Fps is (10 sec: 5677.1, 60 sec: 5675.3, 300 sec: 5652.0). Total num frames: 187011072. Throughput: 0: 5981.1. Samples: 187017604. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 09:11:39,670][25689] Avg episode reward: [(0, '-54.094')] [2022-07-09 09:11:40,616][26022] Updated weights on worker 0-0, policy_version 182634 (0.00085) [2022-07-09 09:11:42,399][26022] Updated weights on worker 0-0, policy_version 182644 (0.00088) [2022-07-09 09:11:44,141][26022] Updated weights on worker 0-0, policy_version 182654 (0.00103) [2022-07-09 09:11:44,689][25689] Fps is (10 sec: 5881.2, 60 sec: 5675.9, 300 sec: 5658.6). Total num frames: 187040768. Throughput: 0: 5108.1. Samples: 187034720. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 09:11:44,690][25689] Avg episode reward: [(0, '-52.778')] [2022-07-09 09:11:46,114][26022] Updated weights on worker 0-0, policy_version 182664 (0.00085) [2022-07-09 09:11:47,646][26022] Updated weights on worker 0-0, policy_version 182674 (0.00087) [2022-07-09 09:11:49,455][26022] Updated weights on worker 0-0, policy_version 182684 (0.00080) [2022-07-09 09:11:49,696][25689] Fps is (10 sec: 5821.1, 60 sec: 5693.4, 300 sec: 5655.3). Total num frames: 187069440. Throughput: 0: 5988.0. Samples: 187069454. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 09:11:49,696][25689] Avg episode reward: [(0, '-51.775')] [2022-07-09 09:11:51,383][26022] Updated weights on worker 0-0, policy_version 182694 (0.00091) [2022-07-09 09:11:52,894][26022] Updated weights on worker 0-0, policy_version 182704 (0.00092) [2022-07-09 09:11:54,723][25689] Fps is (10 sec: 5613.2, 60 sec: 5677.3, 300 sec: 5657.0). Total num frames: 187097088. Throughput: 0: 6025.4. Samples: 187104326. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 09:11:54,723][25689] Avg episode reward: [(0, '-51.920')] [2022-07-09 09:11:54,861][26022] Updated weights on worker 0-0, policy_version 182714 (0.00089) [2022-07-09 09:11:56,456][26022] Updated weights on worker 0-0, policy_version 182724 (0.00082) [2022-07-09 09:11:58,484][26022] Updated weights on worker 0-0, policy_version 182734 (0.00081) [2022-07-09 09:11:59,801][25689] Fps is (10 sec: 5674.5, 60 sec: 5698.7, 300 sec: 5662.9). Total num frames: 187126784. Throughput: 0: 5164.7. Samples: 187121582. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 09:11:59,802][25689] Avg episode reward: [(0, '-51.647')] [2022-07-09 09:12:00,147][26022] Updated weights on worker 0-0, policy_version 182744 (0.00090) [2022-07-09 09:12:02,386][26022] Updated weights on worker 0-0, policy_version 182754 (0.00083) [2022-07-09 09:12:04,045][26022] Updated weights on worker 0-0, policy_version 182764 (0.00092) [2022-07-09 09:12:04,841][25689] Fps is (10 sec: 5667.6, 60 sec: 5714.0, 300 sec: 5662.5). Total num frames: 187154432. Throughput: 0: 5909.0. Samples: 187153794. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 09:12:04,841][25689] Avg episode reward: [(0, '-51.583')] [2022-07-09 09:12:05,975][26022] Updated weights on worker 0-0, policy_version 182774 (0.00082) [2022-07-09 09:12:07,579][26022] Updated weights on worker 0-0, policy_version 182784 (0.00051) [2022-07-09 09:12:09,435][26022] Updated weights on worker 0-0, policy_version 182794 (0.00094) [2022-07-09 09:12:09,882][25689] Fps is (10 sec: 5688.3, 60 sec: 5729.5, 300 sec: 5669.0). Total num frames: 187184128. Throughput: 0: 5893.1. Samples: 187188416. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 09:12:09,883][25689] Avg episode reward: [(0, '-50.365')] [2022-07-09 09:12:11,395][26022] Updated weights on worker 0-0, policy_version 182804 (0.00091) [2022-07-09 09:12:12,799][26022] Updated weights on worker 0-0, policy_version 182814 (0.00086) [2022-07-09 09:12:14,897][25689] Fps is (10 sec: 5600.5, 60 sec: 5661.0, 300 sec: 5656.1). Total num frames: 187210752. Throughput: 0: 5031.4. Samples: 187205830. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 09:12:14,897][25689] Avg episode reward: [(0, '-51.174')] [2022-07-09 09:12:14,980][26022] Updated weights on worker 0-0, policy_version 182824 (0.00088) [2022-07-09 09:12:16,411][26022] Updated weights on worker 0-0, policy_version 182834 (0.00083) [2022-07-09 09:12:18,335][26022] Updated weights on worker 0-0, policy_version 182844 (0.00091) [2022-07-09 09:12:20,013][25689] Fps is (10 sec: 5660.2, 60 sec: 5722.1, 300 sec: 5665.3). Total num frames: 187241472. Throughput: 0: 5888.9. Samples: 187240608. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 09:12:20,014][25689] Avg episode reward: [(0, '-50.658')] [2022-07-09 09:12:20,145][26022] Updated weights on worker 0-0, policy_version 182854 (0.00085) [2022-07-09 09:12:21,775][26022] Updated weights on worker 0-0, policy_version 182864 (0.00089) [2022-07-09 09:12:23,821][26022] Updated weights on worker 0-0, policy_version 182874 (0.00087) [2022-07-09 09:12:25,025][25689] Fps is (10 sec: 6066.4, 60 sec: 5739.0, 300 sec: 5672.2). Total num frames: 187272192. Throughput: 0: 6028.1. Samples: 187275466. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 09:12:25,025][25689] Avg episode reward: [(0, '-51.246')] [2022-07-09 09:12:25,368][26022] Updated weights on worker 0-0, policy_version 182884 (0.00091) [2022-07-09 09:12:27,057][26022] Updated weights on worker 0-0, policy_version 182894 (0.00084) [2022-07-09 09:12:29,139][26022] Updated weights on worker 0-0, policy_version 182904 (0.00095) [2022-07-09 09:12:30,093][25689] Fps is (10 sec: 5892.4, 60 sec: 5735.4, 300 sec: 5671.5). Total num frames: 187300864. Throughput: 0: 5163.6. Samples: 187292774. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 09:12:30,093][25689] Avg episode reward: [(0, '-50.972')] [2022-07-09 09:12:30,684][26022] Updated weights on worker 0-0, policy_version 182914 (0.00086) [2022-07-09 09:12:32,534][26022] Updated weights on worker 0-0, policy_version 182924 (0.00525) [2022-07-09 09:12:34,343][26022] Updated weights on worker 0-0, policy_version 182934 (0.00087) [2022-07-09 09:12:35,097][25689] Fps is (10 sec: 5490.1, 60 sec: 5720.3, 300 sec: 5668.7). Total num frames: 187327488. Throughput: 0: 6018.3. Samples: 187327398. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 09:12:35,097][25689] Avg episode reward: [(0, '-51.263')] [2022-07-09 09:12:35,933][26022] Updated weights on worker 0-0, policy_version 182944 (0.00088) [2022-07-09 09:12:37,913][26022] Updated weights on worker 0-0, policy_version 182954 (0.00089) [2022-07-09 09:12:39,519][26022] Updated weights on worker 0-0, policy_version 182964 (0.00089) [2022-07-09 09:12:40,197][25689] Fps is (10 sec: 5675.3, 60 sec: 5735.1, 300 sec: 5670.4). Total num frames: 187358208. Throughput: 0: 6028.4. Samples: 187362282. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 09:12:40,199][25689] Avg episode reward: [(0, '-51.126')] [2022-07-09 09:12:41,181][26022] Updated weights on worker 0-0, policy_version 182974 (0.00048) [2022-07-09 09:12:42,768][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:12:42,782][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000182982_187373568.pth [2022-07-09 09:12:42,783][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000180984_185327616.pth [2022-07-09 09:12:43,250][26022] Updated weights on worker 0-0, policy_version 182984 (0.00091) [2022-07-09 09:12:44,763][26022] Updated weights on worker 0-0, policy_version 182994 (0.00089) [2022-07-09 09:12:45,287][25689] Fps is (10 sec: 5828.3, 60 sec: 5711.6, 300 sec: 5669.2). Total num frames: 187386880. Throughput: 0: 5988.6. Samples: 187396808. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 09:12:45,288][25689] Avg episode reward: [(0, '-51.595')] [2022-07-09 09:12:46,715][26022] Updated weights on worker 0-0, policy_version 183004 (0.00095) [2022-07-09 09:12:48,473][26022] Updated weights on worker 0-0, policy_version 183014 (0.00084) [2022-07-09 09:12:50,061][26022] Updated weights on worker 0-0, policy_version 183024 (0.00087) [2022-07-09 09:12:50,295][25689] Fps is (10 sec: 5780.1, 60 sec: 5728.4, 300 sec: 5672.6). Total num frames: 187416576. Throughput: 0: 6012.4. Samples: 187414238. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 09:12:50,295][25689] Avg episode reward: [(0, '-51.882')] [2022-07-09 09:12:51,939][26022] Updated weights on worker 0-0, policy_version 183034 (0.00083) [2022-07-09 09:12:53,727][26022] Updated weights on worker 0-0, policy_version 183044 (0.00095) [2022-07-09 09:12:55,351][25689] Fps is (10 sec: 5901.3, 60 sec: 5759.4, 300 sec: 5669.8). Total num frames: 187446272. Throughput: 0: 6033.2. Samples: 187449598. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 09:12:55,352][25689] Avg episode reward: [(0, '-51.694')] [2022-07-09 09:12:55,474][26022] Updated weights on worker 0-0, policy_version 183054 (0.00086) [2022-07-09 09:12:57,163][26022] Updated weights on worker 0-0, policy_version 183064 (0.00078) [2022-07-09 09:12:58,902][26022] Updated weights on worker 0-0, policy_version 183074 (0.00071) [2022-07-09 09:13:00,466][25689] Fps is (10 sec: 5738.5, 60 sec: 5739.1, 300 sec: 5681.8). Total num frames: 187474944. Throughput: 0: 6017.2. Samples: 187484246. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 09:13:00,467][25689] Avg episode reward: [(0, '-51.604')] [2022-07-09 09:13:00,638][26022] Updated weights on worker 0-0, policy_version 183084 (0.00084) [2022-07-09 09:13:02,771][26022] Updated weights on worker 0-0, policy_version 183094 (0.00086) [2022-07-09 09:13:04,771][26022] Updated weights on worker 0-0, policy_version 183104 (0.00085) [2022-07-09 09:13:05,509][25689] Fps is (10 sec: 5645.6, 60 sec: 5755.7, 300 sec: 5675.2). Total num frames: 187503616. Throughput: 0: 5068.4. Samples: 187499298. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 09:13:05,509][25689] Avg episode reward: [(0, '-51.806')] [2022-07-09 09:13:06,441][26022] Updated weights on worker 0-0, policy_version 183114 (0.00099) [2022-07-09 09:13:08,150][26022] Updated weights on worker 0-0, policy_version 183124 (0.00090) [2022-07-09 09:13:10,014][26022] Updated weights on worker 0-0, policy_version 183134 (0.00094) [2022-07-09 09:13:10,527][25689] Fps is (10 sec: 5699.9, 60 sec: 5741.0, 300 sec: 5681.7). Total num frames: 187532288. Throughput: 0: 5921.4. Samples: 187534038. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 09:13:10,527][25689] Avg episode reward: [(0, '-51.596')] [2022-07-09 09:13:11,802][26022] Updated weights on worker 0-0, policy_version 183144 (0.00081) [2022-07-09 09:13:13,574][26022] Updated weights on worker 0-0, policy_version 183154 (0.00082) [2022-07-09 09:13:15,345][26022] Updated weights on worker 0-0, policy_version 183164 (0.00084) [2022-07-09 09:13:15,547][25689] Fps is (10 sec: 5712.6, 60 sec: 5774.3, 300 sec: 5682.7). Total num frames: 187560960. Throughput: 0: 5911.7. Samples: 187568988. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 09:13:15,547][25689] Avg episode reward: [(0, '-50.994')] [2022-07-09 09:13:17,145][26022] Updated weights on worker 0-0, policy_version 183174 (0.00085) [2022-07-09 09:13:18,843][26022] Updated weights on worker 0-0, policy_version 183184 (0.00084) [2022-07-09 09:13:20,502][26022] Updated weights on worker 0-0, policy_version 183194 (0.00077) [2022-07-09 09:13:20,641][25689] Fps is (10 sec: 5770.6, 60 sec: 5759.4, 300 sec: 5681.6). Total num frames: 187590656. Throughput: 0: 5063.7. Samples: 187586408. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 09:13:20,642][25689] Avg episode reward: [(0, '-50.266')] [2022-07-09 09:13:22,336][26022] Updated weights on worker 0-0, policy_version 183204 (0.00086) [2022-07-09 09:13:24,067][26022] Updated weights on worker 0-0, policy_version 183214 (0.00085) [2022-07-09 09:13:25,654][25689] Fps is (10 sec: 5876.4, 60 sec: 5742.5, 300 sec: 5688.6). Total num frames: 187620352. Throughput: 0: 6066.2. Samples: 187621502. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 09:13:25,654][25689] Avg episode reward: [(0, '-50.392')] [2022-07-09 09:13:25,774][26022] Updated weights on worker 0-0, policy_version 183224 (0.00090) [2022-07-09 09:13:27,731][26022] Updated weights on worker 0-0, policy_version 183234 (0.00103) [2022-07-09 09:13:29,332][26022] Updated weights on worker 0-0, policy_version 183244 (0.00090) [2022-07-09 09:13:30,672][25689] Fps is (10 sec: 5717.1, 60 sec: 5730.3, 300 sec: 5678.5). Total num frames: 187648000. Throughput: 0: 6053.6. Samples: 187655988. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 09:13:30,672][25689] Avg episode reward: [(0, '-49.711')] [2022-07-09 09:13:31,091][26022] Updated weights on worker 0-0, policy_version 183254 (0.00096) [2022-07-09 09:13:32,897][26022] Updated weights on worker 0-0, policy_version 183264 (0.00085) [2022-07-09 09:13:34,624][26022] Updated weights on worker 0-0, policy_version 183274 (0.00085) [2022-07-09 09:13:35,682][25689] Fps is (10 sec: 5615.9, 60 sec: 5763.5, 300 sec: 5690.4). Total num frames: 187676672. Throughput: 0: 5183.6. Samples: 187673362. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 09:13:35,684][25689] Avg episode reward: [(0, '-49.014')] [2022-07-09 09:13:36,537][26022] Updated weights on worker 0-0, policy_version 183284 (0.00085) [2022-07-09 09:13:38,263][26022] Updated weights on worker 0-0, policy_version 183294 (0.00086) [2022-07-09 09:13:39,921][26022] Updated weights on worker 0-0, policy_version 183304 (0.00092) [2022-07-09 09:13:40,804][25689] Fps is (10 sec: 5861.7, 60 sec: 5761.4, 300 sec: 5692.2). Total num frames: 187707392. Throughput: 0: 6057.8. Samples: 187708548. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-09 09:13:40,804][25689] Avg episode reward: [(0, '-48.990')] [2022-07-09 09:13:41,722][26022] Updated weights on worker 0-0, policy_version 183314 (0.00087) [2022-07-09 09:13:43,488][26022] Updated weights on worker 0-0, policy_version 183324 (0.00078) [2022-07-09 09:13:45,398][26022] Updated weights on worker 0-0, policy_version 183334 (0.00085) [2022-07-09 09:13:45,890][25689] Fps is (10 sec: 5818.6, 60 sec: 5761.9, 300 sec: 5684.3). Total num frames: 187736064. Throughput: 0: 6024.3. Samples: 187743410. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 09:13:45,890][25689] Avg episode reward: [(0, '-49.182')] [2022-07-09 09:13:46,979][26022] Updated weights on worker 0-0, policy_version 183344 (0.00092) [2022-07-09 09:13:48,849][26022] Updated weights on worker 0-0, policy_version 183354 (0.00087) [2022-07-09 09:13:50,583][26022] Updated weights on worker 0-0, policy_version 183364 (0.00092) [2022-07-09 09:13:50,955][25689] Fps is (10 sec: 5750.1, 60 sec: 5756.4, 300 sec: 5697.4). Total num frames: 187765760. Throughput: 0: 5169.1. Samples: 187760832. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 09:13:50,955][25689] Avg episode reward: [(0, '-48.850')] [2022-07-09 09:13:52,335][26022] Updated weights on worker 0-0, policy_version 183374 (0.00498) [2022-07-09 09:13:54,329][26022] Updated weights on worker 0-0, policy_version 183384 (0.00103) [2022-07-09 09:13:55,751][26022] Updated weights on worker 0-0, policy_version 183394 (0.00084) [2022-07-09 09:13:55,961][25689] Fps is (10 sec: 5998.8, 60 sec: 5778.1, 300 sec: 5695.0). Total num frames: 187796480. Throughput: 0: 6023.2. Samples: 187795504. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 09:13:55,961][25689] Avg episode reward: [(0, '-49.302')] [2022-07-09 09:13:57,829][26022] Updated weights on worker 0-0, policy_version 183404 (0.00084) [2022-07-09 09:13:59,363][26022] Updated weights on worker 0-0, policy_version 183414 (0.00102) [2022-07-09 09:14:01,049][25689] Fps is (10 sec: 5782.1, 60 sec: 5763.7, 300 sec: 5704.0). Total num frames: 187824128. Throughput: 0: 6008.9. Samples: 187830200. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 09:14:01,050][25689] Avg episode reward: [(0, '-49.695')] [2022-07-09 09:14:01,244][26022] Updated weights on worker 0-0, policy_version 183424 (0.00097) [2022-07-09 09:14:03,272][26022] Updated weights on worker 0-0, policy_version 183434 (0.00085) [2022-07-09 09:14:05,056][26022] Updated weights on worker 0-0, policy_version 183444 (0.00087) [2022-07-09 09:14:06,055][25689] Fps is (10 sec: 5579.7, 60 sec: 5767.3, 300 sec: 5707.4). Total num frames: 187852800. Throughput: 0: 5063.2. Samples: 187845510. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 09:14:06,055][25689] Avg episode reward: [(0, '-49.725')] [2022-07-09 09:14:06,847][26022] Updated weights on worker 0-0, policy_version 183454 (0.00101) [2022-07-09 09:14:08,736][26022] Updated weights on worker 0-0, policy_version 183464 (0.00087) [2022-07-09 09:14:10,243][26022] Updated weights on worker 0-0, policy_version 183474 (0.00083) [2022-07-09 09:14:11,101][25689] Fps is (10 sec: 5603.1, 60 sec: 5747.7, 300 sec: 5703.3). Total num frames: 187880448. Throughput: 0: 5933.8. Samples: 187880376. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 09:14:11,102][25689] Avg episode reward: [(0, '-50.730')] [2022-07-09 09:14:12,231][26022] Updated weights on worker 0-0, policy_version 183484 (0.00082) [2022-07-09 09:14:13,951][26022] Updated weights on worker 0-0, policy_version 183494 (0.00089) [2022-07-09 09:14:15,798][26022] Updated weights on worker 0-0, policy_version 183504 (0.00100) [2022-07-09 09:14:16,107][25689] Fps is (10 sec: 5704.6, 60 sec: 5765.9, 300 sec: 5711.3). Total num frames: 187910144. Throughput: 0: 5939.9. Samples: 187915168. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 09:14:16,107][25689] Avg episode reward: [(0, '-51.274')] [2022-07-09 09:14:17,494][26022] Updated weights on worker 0-0, policy_version 183514 (0.00089) [2022-07-09 09:14:19,285][26022] Updated weights on worker 0-0, policy_version 183524 (0.00961) [2022-07-09 09:14:21,037][26022] Updated weights on worker 0-0, policy_version 183534 (0.00083) [2022-07-09 09:14:21,211][25689] Fps is (10 sec: 5874.5, 60 sec: 5765.0, 300 sec: 5711.3). Total num frames: 187939840. Throughput: 0: 5065.9. Samples: 187932338. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 09:14:21,212][25689] Avg episode reward: [(0, '-51.330')] [2022-07-09 09:14:22,910][26022] Updated weights on worker 0-0, policy_version 183544 (0.00101) [2022-07-09 09:14:24,458][26022] Updated weights on worker 0-0, policy_version 183554 (0.00091) [2022-07-09 09:14:26,304][25689] Fps is (10 sec: 5623.4, 60 sec: 5723.5, 300 sec: 5706.6). Total num frames: 187967488. Throughput: 0: 5989.6. Samples: 187966798. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 09:14:26,305][25689] Avg episode reward: [(0, '-50.849')] [2022-07-09 09:14:26,485][26022] Updated weights on worker 0-0, policy_version 183564 (0.00085) [2022-07-09 09:14:28,304][26022] Updated weights on worker 0-0, policy_version 183574 (0.00104) [2022-07-09 09:14:29,927][26022] Updated weights on worker 0-0, policy_version 183584 (0.00091) [2022-07-09 09:14:31,313][25689] Fps is (10 sec: 5676.4, 60 sec: 5758.2, 300 sec: 5713.5). Total num frames: 187997184. Throughput: 0: 5970.4. Samples: 188001052. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 09:14:31,314][25689] Avg episode reward: [(0, '-51.350')] [2022-07-09 09:14:32,010][26022] Updated weights on worker 0-0, policy_version 183594 (0.00092) [2022-07-09 09:14:33,431][26022] Updated weights on worker 0-0, policy_version 183604 (0.00094) [2022-07-09 09:14:35,324][26022] Updated weights on worker 0-0, policy_version 183614 (0.00082) [2022-07-09 09:14:36,346][25689] Fps is (10 sec: 5914.7, 60 sec: 5773.0, 300 sec: 5707.9). Total num frames: 188026880. Throughput: 0: 5114.8. Samples: 188018686. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 09:14:36,346][25689] Avg episode reward: [(0, '-51.132')] [2022-07-09 09:14:37,116][26022] Updated weights on worker 0-0, policy_version 183624 (0.00088) [2022-07-09 09:14:38,796][26022] Updated weights on worker 0-0, policy_version 183634 (0.00083) [2022-07-09 09:14:40,619][26022] Updated weights on worker 0-0, policy_version 183644 (0.00090) [2022-07-09 09:14:41,468][25689] Fps is (10 sec: 5747.6, 60 sec: 5739.1, 300 sec: 5709.6). Total num frames: 188055552. Throughput: 0: 5978.2. Samples: 188053440. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 09:14:41,469][25689] Avg episode reward: [(0, '-50.845')] [2022-07-09 09:14:42,295][26022] Updated weights on worker 0-0, policy_version 183654 (0.00085) [2022-07-09 09:14:42,807][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:14:42,823][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000183656_188063744.pth [2022-07-09 09:14:42,826][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000181649_186008576.pth [2022-07-09 09:14:44,107][26022] Updated weights on worker 0-0, policy_version 183664 (0.00084) [2022-07-09 09:14:46,107][26022] Updated weights on worker 0-0, policy_version 183674 (0.00086) [2022-07-09 09:14:46,507][25689] Fps is (10 sec: 5643.5, 60 sec: 5743.6, 300 sec: 5712.7). Total num frames: 188084224. Throughput: 0: 6008.0. Samples: 188088176. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 09:14:46,507][25689] Avg episode reward: [(0, '-51.258')] [2022-07-09 09:14:47,550][26022] Updated weights on worker 0-0, policy_version 183684 (0.00086) [2022-07-09 09:14:49,610][26022] Updated weights on worker 0-0, policy_version 183694 (0.00089) [2022-07-09 09:14:51,251][26022] Updated weights on worker 0-0, policy_version 183704 (0.00086) [2022-07-09 09:14:51,524][25689] Fps is (10 sec: 5702.7, 60 sec: 5731.2, 300 sec: 5709.0). Total num frames: 188112896. Throughput: 0: 6037.9. Samples: 188123082. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 09:14:51,525][25689] Avg episode reward: [(0, '-51.552')] [2022-07-09 09:14:52,866][26022] Updated weights on worker 0-0, policy_version 183714 (0.00086) [2022-07-09 09:14:54,744][26022] Updated weights on worker 0-0, policy_version 183724 (0.00083) [2022-07-09 09:14:56,537][25689] Fps is (10 sec: 5819.0, 60 sec: 5713.6, 300 sec: 5714.8). Total num frames: 188142592. Throughput: 0: 6044.6. Samples: 188140736. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 09:14:56,538][25689] Avg episode reward: [(0, '-51.891')] [2022-07-09 09:14:56,557][26022] Updated weights on worker 0-0, policy_version 183734 (0.00084) [2022-07-09 09:14:58,323][26022] Updated weights on worker 0-0, policy_version 183744 (0.00089) [2022-07-09 09:15:00,183][26022] Updated weights on worker 0-0, policy_version 183754 (0.00088) [2022-07-09 09:15:01,639][25689] Fps is (10 sec: 5871.7, 60 sec: 5746.2, 300 sec: 5726.9). Total num frames: 188172288. Throughput: 0: 6030.0. Samples: 188175068. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 09:15:01,640][25689] Avg episode reward: [(0, '-51.453')] [2022-07-09 09:15:02,008][26022] Updated weights on worker 0-0, policy_version 183764 (0.00098) [2022-07-09 09:15:04,127][26022] Updated weights on worker 0-0, policy_version 183774 (0.00084) [2022-07-09 09:15:05,884][26022] Updated weights on worker 0-0, policy_version 183784 (0.00089) [2022-07-09 09:15:06,644][25689] Fps is (10 sec: 5572.4, 60 sec: 5712.4, 300 sec: 5720.5). Total num frames: 188198912. Throughput: 0: 5913.4. Samples: 188207258. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 09:15:06,645][25689] Avg episode reward: [(0, '-51.709')] [2022-07-09 09:15:07,602][26022] Updated weights on worker 0-0, policy_version 183794 (0.00087) [2022-07-09 09:15:09,463][26022] Updated weights on worker 0-0, policy_version 183804 (0.00083) [2022-07-09 09:15:11,238][26022] Updated weights on worker 0-0, policy_version 183814 (0.00087) [2022-07-09 09:15:11,741][25689] Fps is (10 sec: 5575.0, 60 sec: 5741.4, 300 sec: 5722.7). Total num frames: 188228608. Throughput: 0: 5029.6. Samples: 188224766. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 09:15:11,742][25689] Avg episode reward: [(0, '-51.456')] [2022-07-09 09:15:12,781][26022] Updated weights on worker 0-0, policy_version 183824 (0.00084) [2022-07-09 09:15:14,819][26022] Updated weights on worker 0-0, policy_version 183834 (0.00084) [2022-07-09 09:15:16,499][26022] Updated weights on worker 0-0, policy_version 183844 (0.00085) [2022-07-09 09:15:16,771][25689] Fps is (10 sec: 5865.1, 60 sec: 5739.2, 300 sec: 5724.8). Total num frames: 188258304. Throughput: 0: 5882.2. Samples: 188259752. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 09:15:16,771][25689] Avg episode reward: [(0, '-51.795')] [2022-07-09 09:15:18,164][26022] Updated weights on worker 0-0, policy_version 183854 (0.00086) [2022-07-09 09:15:19,966][26022] Updated weights on worker 0-0, policy_version 183864 (0.00090) [2022-07-09 09:15:21,653][26022] Updated weights on worker 0-0, policy_version 183874 (0.00095) [2022-07-09 09:15:21,843][25689] Fps is (10 sec: 5778.1, 60 sec: 5725.3, 300 sec: 5724.7). Total num frames: 188286976. Throughput: 0: 5903.1. Samples: 188294334. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 09:15:21,843][25689] Avg episode reward: [(0, '-52.302')] [2022-07-09 09:15:23,516][26022] Updated weights on worker 0-0, policy_version 183884 (0.00091) [2022-07-09 09:15:25,263][26022] Updated weights on worker 0-0, policy_version 183894 (0.00093) [2022-07-09 09:15:26,876][25689] Fps is (10 sec: 5573.1, 60 sec: 5731.0, 300 sec: 5721.9). Total num frames: 188314624. Throughput: 0: 5163.8. Samples: 188311732. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 09:15:26,877][25689] Avg episode reward: [(0, '-52.012')] [2022-07-09 09:15:27,133][26022] Updated weights on worker 0-0, policy_version 183904 (0.00080) [2022-07-09 09:15:28,875][26022] Updated weights on worker 0-0, policy_version 183914 (0.00092) [2022-07-09 09:15:30,522][26022] Updated weights on worker 0-0, policy_version 183924 (0.00429) [2022-07-09 09:15:31,950][25689] Fps is (10 sec: 5673.7, 60 sec: 5724.9, 300 sec: 5724.6). Total num frames: 188344320. Throughput: 0: 6009.6. Samples: 188346212. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 09:15:31,950][25689] Avg episode reward: [(0, '-51.709')] [2022-07-09 09:15:32,493][26022] Updated weights on worker 0-0, policy_version 183934 (0.00089) [2022-07-09 09:15:34,154][26022] Updated weights on worker 0-0, policy_version 183944 (0.00083) [2022-07-09 09:15:36,090][26022] Updated weights on worker 0-0, policy_version 183954 (0.00093) [2022-07-09 09:15:36,971][25689] Fps is (10 sec: 5883.3, 60 sec: 5725.9, 300 sec: 5729.7). Total num frames: 188374016. Throughput: 0: 5997.5. Samples: 188380906. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 09:15:36,972][25689] Avg episode reward: [(0, '-52.453')] [2022-07-09 09:15:37,733][26022] Updated weights on worker 0-0, policy_version 183964 (0.00088) [2022-07-09 09:15:39,329][26022] Updated weights on worker 0-0, policy_version 183974 (0.00086) [2022-07-09 09:15:41,498][26022] Updated weights on worker 0-0, policy_version 183984 (0.00084) [2022-07-09 09:15:42,086][25689] Fps is (10 sec: 5859.4, 60 sec: 5743.5, 300 sec: 5728.0). Total num frames: 188403712. Throughput: 0: 5139.3. Samples: 188398370. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 09:15:42,087][25689] Avg episode reward: [(0, '-51.321')] [2022-07-09 09:15:42,942][26022] Updated weights on worker 0-0, policy_version 183994 (0.00091) [2022-07-09 09:15:45,041][26022] Updated weights on worker 0-0, policy_version 184004 (0.00110) [2022-07-09 09:15:46,733][26022] Updated weights on worker 0-0, policy_version 184014 (0.00089) [2022-07-09 09:15:47,108][25689] Fps is (10 sec: 5657.0, 60 sec: 5728.2, 300 sec: 5727.8). Total num frames: 188431360. Throughput: 0: 5970.6. Samples: 188432530. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 09:15:47,109][25689] Avg episode reward: [(0, '-50.024')] [2022-07-09 09:15:48,499][26022] Updated weights on worker 0-0, policy_version 184024 (0.00051) [2022-07-09 09:15:50,293][26022] Updated weights on worker 0-0, policy_version 184034 (0.00095) [2022-07-09 09:15:52,143][25689] Fps is (10 sec: 5600.4, 60 sec: 5726.6, 300 sec: 5727.8). Total num frames: 188460032. Throughput: 0: 6000.3. Samples: 188467374. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 09:15:52,143][25689] Avg episode reward: [(0, '-49.911')] [2022-07-09 09:15:52,227][26022] Updated weights on worker 0-0, policy_version 184044 (0.00082) [2022-07-09 09:15:53,603][26022] Updated weights on worker 0-0, policy_version 184054 (0.00090) [2022-07-09 09:15:55,681][26022] Updated weights on worker 0-0, policy_version 184064 (0.00087) [2022-07-09 09:15:57,178][25689] Fps is (10 sec: 5898.1, 60 sec: 5741.4, 300 sec: 5736.4). Total num frames: 188490752. Throughput: 0: 5135.6. Samples: 188484680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 09:15:57,178][25689] Avg episode reward: [(0, '-50.157')] [2022-07-09 09:15:57,331][26022] Updated weights on worker 0-0, policy_version 184074 (0.00088) [2022-07-09 09:15:59,197][26022] Updated weights on worker 0-0, policy_version 184084 (0.00088) [2022-07-09 09:16:00,883][26022] Updated weights on worker 0-0, policy_version 184094 (0.00088) [2022-07-09 09:16:02,315][25689] Fps is (10 sec: 5536.8, 60 sec: 5670.6, 300 sec: 5730.8). Total num frames: 188516352. Throughput: 0: 5983.3. Samples: 188519402. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 09:16:02,315][25689] Avg episode reward: [(0, '-50.393')] [2022-07-09 09:16:02,946][26022] Updated weights on worker 0-0, policy_version 184104 (0.00088) [2022-07-09 09:16:04,973][26022] Updated weights on worker 0-0, policy_version 184114 (0.00090) [2022-07-09 09:16:06,567][26022] Updated weights on worker 0-0, policy_version 184124 (0.00089) [2022-07-09 09:16:07,351][25689] Fps is (10 sec: 5636.9, 60 sec: 5752.0, 300 sec: 5740.9). Total num frames: 188548096. Throughput: 0: 5907.1. Samples: 188552106. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 09:16:07,351][25689] Avg episode reward: [(0, '-50.772')] [2022-07-09 09:16:08,341][26022] Updated weights on worker 0-0, policy_version 184134 (0.00094) [2022-07-09 09:16:10,048][26022] Updated weights on worker 0-0, policy_version 184144 (0.00090) [2022-07-09 09:16:11,831][26022] Updated weights on worker 0-0, policy_version 184154 (0.00101) [2022-07-09 09:16:12,410][25689] Fps is (10 sec: 5883.3, 60 sec: 5721.9, 300 sec: 5729.6). Total num frames: 188575744. Throughput: 0: 5042.9. Samples: 188569578. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 09:16:12,410][25689] Avg episode reward: [(0, '-51.597')] [2022-07-09 09:16:13,742][26022] Updated weights on worker 0-0, policy_version 184164 (0.00092) [2022-07-09 09:16:15,479][26022] Updated weights on worker 0-0, policy_version 184174 (0.00088) [2022-07-09 09:16:17,122][26022] Updated weights on worker 0-0, policy_version 184184 (0.00087) [2022-07-09 09:16:17,439][25689] Fps is (10 sec: 5582.8, 60 sec: 5705.0, 300 sec: 5736.8). Total num frames: 188604416. Throughput: 0: 5902.3. Samples: 188604268. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 09:16:17,440][25689] Avg episode reward: [(0, '-51.457')] [2022-07-09 09:16:18,958][26022] Updated weights on worker 0-0, policy_version 184194 (0.00090) [2022-07-09 09:16:20,800][26022] Updated weights on worker 0-0, policy_version 184204 (0.00086) [2022-07-09 09:16:22,545][25689] Fps is (10 sec: 5758.6, 60 sec: 5718.7, 300 sec: 5734.9). Total num frames: 188634112. Throughput: 0: 5912.5. Samples: 188639016. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 09:16:22,546][25689] Avg episode reward: [(0, '-51.994')] [2022-07-09 09:16:22,561][26022] Updated weights on worker 0-0, policy_version 184214 (0.00087) [2022-07-09 09:16:24,258][26022] Updated weights on worker 0-0, policy_version 184224 (0.00086) [2022-07-09 09:16:25,769][26022] Updated weights on worker 0-0, policy_version 184234 (0.00084) [2022-07-09 09:16:27,560][25689] Fps is (10 sec: 5867.9, 60 sec: 5754.1, 300 sec: 5738.7). Total num frames: 188663808. Throughput: 0: 5167.0. Samples: 188656530. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 09:16:27,561][25689] Avg episode reward: [(0, '-51.846')] [2022-07-09 09:16:27,909][26022] Updated weights on worker 0-0, policy_version 184244 (0.00081) [2022-07-09 09:16:29,549][26022] Updated weights on worker 0-0, policy_version 184254 (0.00078) [2022-07-09 09:16:31,362][26022] Updated weights on worker 0-0, policy_version 184264 (0.00090) [2022-07-09 09:16:32,625][25689] Fps is (10 sec: 5994.0, 60 sec: 5771.9, 300 sec: 5748.2). Total num frames: 188694528. Throughput: 0: 6031.3. Samples: 188691502. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 09:16:32,627][25689] Avg episode reward: [(0, '-50.971')] [2022-07-09 09:16:33,003][26022] Updated weights on worker 0-0, policy_version 184274 (0.00080) [2022-07-09 09:16:34,897][26022] Updated weights on worker 0-0, policy_version 184284 (0.00104) [2022-07-09 09:16:36,726][26022] Updated weights on worker 0-0, policy_version 184294 (0.00091) [2022-07-09 09:16:37,669][25689] Fps is (10 sec: 5774.0, 60 sec: 5735.9, 300 sec: 5742.0). Total num frames: 188722176. Throughput: 0: 6038.9. Samples: 188726436. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 09:16:37,670][25689] Avg episode reward: [(0, '-50.833')] [2022-07-09 09:16:38,351][26022] Updated weights on worker 0-0, policy_version 184304 (0.00090) [2022-07-09 09:16:40,014][26022] Updated weights on worker 0-0, policy_version 184314 (0.00094) [2022-07-09 09:16:41,993][26022] Updated weights on worker 0-0, policy_version 184324 (0.00082) [2022-07-09 09:16:42,786][25689] Fps is (10 sec: 5643.6, 60 sec: 5735.8, 300 sec: 5740.1). Total num frames: 188751872. Throughput: 0: 6044.6. Samples: 188761360. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 09:16:42,786][25689] Avg episode reward: [(0, '-51.512')] [2022-07-09 09:16:42,934][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:16:42,946][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000184330_188753920.pth [2022-07-09 09:16:42,947][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000182312_186687488.pth [2022-07-09 09:16:43,575][26022] Updated weights on worker 0-0, policy_version 184334 (0.00085) [2022-07-09 09:16:45,478][26022] Updated weights on worker 0-0, policy_version 184344 (0.00086) [2022-07-09 09:16:47,097][26022] Updated weights on worker 0-0, policy_version 184354 (0.00086) [2022-07-09 09:16:47,859][25689] Fps is (10 sec: 5828.7, 60 sec: 5764.7, 300 sec: 5742.3). Total num frames: 188781568. Throughput: 0: 6026.4. Samples: 188778856. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 09:16:47,859][25689] Avg episode reward: [(0, '-51.412')] [2022-07-09 09:16:48,915][26022] Updated weights on worker 0-0, policy_version 184364 (0.00091) [2022-07-09 09:16:50,694][26022] Updated weights on worker 0-0, policy_version 184374 (0.00085) [2022-07-09 09:16:52,452][26022] Updated weights on worker 0-0, policy_version 184384 (0.00084) [2022-07-09 09:16:52,877][25689] Fps is (10 sec: 5987.2, 60 sec: 5800.0, 300 sec: 5752.7). Total num frames: 188812288. Throughput: 0: 6046.8. Samples: 188813962. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 09:16:52,877][25689] Avg episode reward: [(0, '-50.151')] [2022-07-09 09:16:54,161][26022] Updated weights on worker 0-0, policy_version 184394 (0.00084) [2022-07-09 09:16:55,917][26022] Updated weights on worker 0-0, policy_version 184404 (0.00088) [2022-07-09 09:16:57,648][26022] Updated weights on worker 0-0, policy_version 184414 (0.00083) [2022-07-09 09:16:57,881][25689] Fps is (10 sec: 5926.2, 60 sec: 5769.2, 300 sec: 5750.7). Total num frames: 188840960. Throughput: 0: 6055.1. Samples: 188848818. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 09:16:57,881][25689] Avg episode reward: [(0, '-50.035')] [2022-07-09 09:16:59,711][26022] Updated weights on worker 0-0, policy_version 184424 (0.00081) [2022-07-09 09:17:01,193][26022] Updated weights on worker 0-0, policy_version 184434 (0.00083) [2022-07-09 09:17:03,021][25689] Fps is (10 sec: 5350.2, 60 sec: 5768.9, 300 sec: 5741.9). Total num frames: 188866560. Throughput: 0: 5183.4. Samples: 188866246. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 09:17:03,021][25689] Avg episode reward: [(0, '-50.500')] [2022-07-09 09:17:03,335][26022] Updated weights on worker 0-0, policy_version 184444 (0.00091) [2022-07-09 09:17:05,327][26022] Updated weights on worker 0-0, policy_version 184454 (0.00088) [2022-07-09 09:17:07,003][26022] Updated weights on worker 0-0, policy_version 184464 (0.00085) [2022-07-09 09:17:08,035][25689] Fps is (10 sec: 5445.4, 60 sec: 5737.2, 300 sec: 5742.4). Total num frames: 188896256. Throughput: 0: 5918.2. Samples: 188898266. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 09:17:08,036][25689] Avg episode reward: [(0, '-49.919')] [2022-07-09 09:17:08,777][26022] Updated weights on worker 0-0, policy_version 184474 (0.00088) [2022-07-09 09:17:10,618][26022] Updated weights on worker 0-0, policy_version 184484 (0.00085) [2022-07-09 09:17:12,313][26022] Updated weights on worker 0-0, policy_version 184494 (0.00082) [2022-07-09 09:17:13,049][25689] Fps is (10 sec: 5922.6, 60 sec: 5775.3, 300 sec: 5752.8). Total num frames: 188925952. Throughput: 0: 5912.5. Samples: 188933228. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 09:17:13,049][25689] Avg episode reward: [(0, '-50.584')] [2022-07-09 09:17:14,100][26022] Updated weights on worker 0-0, policy_version 184504 (0.00088) [2022-07-09 09:17:15,791][26022] Updated weights on worker 0-0, policy_version 184514 (0.01028) [2022-07-09 09:17:17,447][26022] Updated weights on worker 0-0, policy_version 184524 (0.00081) [2022-07-09 09:17:18,062][25689] Fps is (10 sec: 5821.3, 60 sec: 5776.8, 300 sec: 5747.9). Total num frames: 188954624. Throughput: 0: 5045.2. Samples: 188950636. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 09:17:18,062][25689] Avg episode reward: [(0, '-50.848')] [2022-07-09 09:17:19,386][26022] Updated weights on worker 0-0, policy_version 184534 (0.00084) [2022-07-09 09:17:21,207][26022] Updated weights on worker 0-0, policy_version 184544 (0.00095) [2022-07-09 09:17:22,793][26022] Updated weights on worker 0-0, policy_version 184554 (0.00085) [2022-07-09 09:17:23,106][25689] Fps is (10 sec: 5803.2, 60 sec: 5782.7, 300 sec: 5743.8). Total num frames: 188984320. Throughput: 0: 5926.2. Samples: 188985278. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 09:17:23,107][25689] Avg episode reward: [(0, '-51.692')] [2022-07-09 09:17:24,818][26022] Updated weights on worker 0-0, policy_version 184564 (0.00098) [2022-07-09 09:17:26,385][26022] Updated weights on worker 0-0, policy_version 184574 (0.00081) [2022-07-09 09:17:28,148][25689] Fps is (10 sec: 5888.5, 60 sec: 5780.2, 300 sec: 5747.8). Total num frames: 189014016. Throughput: 0: 6065.6. Samples: 189020262. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 09:17:28,149][25689] Avg episode reward: [(0, '-51.762')] [2022-07-09 09:17:28,161][26022] Updated weights on worker 0-0, policy_version 184584 (0.00086) [2022-07-09 09:17:29,942][26022] Updated weights on worker 0-0, policy_version 184594 (0.00096) [2022-07-09 09:17:31,696][26022] Updated weights on worker 0-0, policy_version 184604 (0.00086) [2022-07-09 09:17:33,205][25689] Fps is (10 sec: 5678.6, 60 sec: 5730.2, 300 sec: 5750.2). Total num frames: 189041664. Throughput: 0: 5186.2. Samples: 189037762. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 09:17:33,206][25689] Avg episode reward: [(0, '-51.262')] [2022-07-09 09:17:33,606][26022] Updated weights on worker 0-0, policy_version 184614 (0.00092) [2022-07-09 09:17:35,217][26022] Updated weights on worker 0-0, policy_version 184624 (0.00078) [2022-07-09 09:17:37,047][26022] Updated weights on worker 0-0, policy_version 184634 (0.00098) [2022-07-09 09:17:38,255][25689] Fps is (10 sec: 5775.2, 60 sec: 5780.4, 300 sec: 5751.2). Total num frames: 189072384. Throughput: 0: 6029.3. Samples: 189072384. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 09:17:38,256][25689] Avg episode reward: [(0, '-51.571')] [2022-07-09 09:17:38,811][26022] Updated weights on worker 0-0, policy_version 184644 (0.00087) [2022-07-09 09:17:40,448][26022] Updated weights on worker 0-0, policy_version 184654 (0.00085) [2022-07-09 09:17:42,315][26022] Updated weights on worker 0-0, policy_version 184664 (0.00091) [2022-07-09 09:17:43,360][25689] Fps is (10 sec: 5949.3, 60 sec: 5781.5, 300 sec: 5754.3). Total num frames: 189102080. Throughput: 0: 6032.1. Samples: 189107448. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 09:17:43,361][25689] Avg episode reward: [(0, '-52.070')] [2022-07-09 09:17:44,043][26022] Updated weights on worker 0-0, policy_version 184674 (0.00096) [2022-07-09 09:17:45,681][26022] Updated weights on worker 0-0, policy_version 184684 (0.00317) [2022-07-09 09:17:47,779][26022] Updated weights on worker 0-0, policy_version 184694 (0.00083) [2022-07-09 09:17:48,401][25689] Fps is (10 sec: 5752.6, 60 sec: 5767.6, 300 sec: 5750.2). Total num frames: 189130752. Throughput: 0: 5164.0. Samples: 189124848. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 09:17:48,403][25689] Avg episode reward: [(0, '-51.642')] [2022-07-09 09:17:49,057][26022] Updated weights on worker 0-0, policy_version 184704 (0.00086) [2022-07-09 09:17:50,971][26022] Updated weights on worker 0-0, policy_version 184714 (0.00081) [2022-07-09 09:17:52,670][26022] Updated weights on worker 0-0, policy_version 184724 (0.00085) [2022-07-09 09:17:53,410][25689] Fps is (10 sec: 5705.5, 60 sec: 5734.6, 300 sec: 5747.7). Total num frames: 189159424. Throughput: 0: 6066.2. Samples: 189160334. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 09:17:53,412][25689] Avg episode reward: [(0, '-51.448')] [2022-07-09 09:17:54,398][26022] Updated weights on worker 0-0, policy_version 184734 (0.00080) [2022-07-09 09:17:56,342][26022] Updated weights on worker 0-0, policy_version 184744 (0.00086) [2022-07-09 09:17:57,901][26022] Updated weights on worker 0-0, policy_version 184754 (0.00086) [2022-07-09 09:17:58,477][25689] Fps is (10 sec: 5894.5, 60 sec: 5762.5, 300 sec: 5755.5). Total num frames: 189190144. Throughput: 0: 6089.8. Samples: 189195534. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 09:17:58,477][25689] Avg episode reward: [(0, '-51.651')] [2022-07-09 09:17:59,694][26022] Updated weights on worker 0-0, policy_version 184764 (0.00086) [2022-07-09 09:18:01,942][26022] Updated weights on worker 0-0, policy_version 184774 (0.00089) [2022-07-09 09:18:03,577][25689] Fps is (10 sec: 5741.2, 60 sec: 5800.1, 300 sec: 5750.9). Total num frames: 189217792. Throughput: 0: 5207.4. Samples: 189212728. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 09:18:03,578][25689] Avg episode reward: [(0, '-51.784')] [2022-07-09 09:18:03,791][26022] Updated weights on worker 0-0, policy_version 184784 (0.00094) [2022-07-09 09:18:05,642][26022] Updated weights on worker 0-0, policy_version 184794 (0.00105) [2022-07-09 09:18:07,457][26022] Updated weights on worker 0-0, policy_version 184804 (0.00078) [2022-07-09 09:18:08,601][25689] Fps is (10 sec: 5562.9, 60 sec: 5782.3, 300 sec: 5750.8). Total num frames: 189246464. Throughput: 0: 5932.3. Samples: 189244680. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 09:18:08,602][25689] Avg episode reward: [(0, '-50.559')] [2022-07-09 09:18:08,971][26022] Updated weights on worker 0-0, policy_version 184814 (0.00082) [2022-07-09 09:18:10,979][26022] Updated weights on worker 0-0, policy_version 184824 (0.00087) [2022-07-09 09:18:12,831][26022] Updated weights on worker 0-0, policy_version 184834 (0.00080) [2022-07-09 09:18:13,622][25689] Fps is (10 sec: 5606.8, 60 sec: 5747.7, 300 sec: 5747.4). Total num frames: 189274112. Throughput: 0: 5893.6. Samples: 189279452. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 09:18:13,623][25689] Avg episode reward: [(0, '-50.132')] [2022-07-09 09:18:14,327][26022] Updated weights on worker 0-0, policy_version 184844 (0.00082) [2022-07-09 09:18:16,367][26022] Updated weights on worker 0-0, policy_version 184854 (0.00083) [2022-07-09 09:18:17,971][26022] Updated weights on worker 0-0, policy_version 184864 (0.00087) [2022-07-09 09:18:18,624][25689] Fps is (10 sec: 5619.1, 60 sec: 5748.8, 300 sec: 5745.7). Total num frames: 189302784. Throughput: 0: 5032.9. Samples: 189296934. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 09:18:18,625][25689] Avg episode reward: [(0, '-50.494')] [2022-07-09 09:18:19,826][26022] Updated weights on worker 0-0, policy_version 184874 (0.00091) [2022-07-09 09:18:21,770][26022] Updated weights on worker 0-0, policy_version 184884 (0.00086) [2022-07-09 09:18:23,150][26022] Updated weights on worker 0-0, policy_version 184894 (0.00083) [2022-07-09 09:18:23,679][25689] Fps is (10 sec: 5905.4, 60 sec: 5764.7, 300 sec: 5748.3). Total num frames: 189333504. Throughput: 0: 5905.6. Samples: 189331444. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 09:18:23,680][25689] Avg episode reward: [(0, '-50.043')] [2022-07-09 09:18:25,329][26022] Updated weights on worker 0-0, policy_version 184904 (0.00080) [2022-07-09 09:18:26,855][26022] Updated weights on worker 0-0, policy_version 184914 (0.00092) [2022-07-09 09:18:28,695][25689] Fps is (10 sec: 5795.9, 60 sec: 5733.4, 300 sec: 5748.4). Total num frames: 189361152. Throughput: 0: 6036.0. Samples: 189365964. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 09:18:28,695][25689] Avg episode reward: [(0, '-50.475')] [2022-07-09 09:18:28,766][26022] Updated weights on worker 0-0, policy_version 184924 (0.00094) [2022-07-09 09:18:30,539][26022] Updated weights on worker 0-0, policy_version 184934 (0.00093) [2022-07-09 09:18:32,211][26022] Updated weights on worker 0-0, policy_version 184944 (0.00086) [2022-07-09 09:18:33,697][25689] Fps is (10 sec: 5519.9, 60 sec: 5738.5, 300 sec: 5745.1). Total num frames: 189388800. Throughput: 0: 5162.0. Samples: 189383078. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 09:18:33,697][25689] Avg episode reward: [(0, '-50.098')] [2022-07-09 09:18:34,092][26022] Updated weights on worker 0-0, policy_version 184954 (0.00091) [2022-07-09 09:18:35,950][26022] Updated weights on worker 0-0, policy_version 184964 (0.00102) [2022-07-09 09:18:37,506][26022] Updated weights on worker 0-0, policy_version 184974 (0.00090) [2022-07-09 09:18:38,708][25689] Fps is (10 sec: 5726.9, 60 sec: 5725.3, 300 sec: 5743.8). Total num frames: 189418496. Throughput: 0: 6005.6. Samples: 189417548. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 09:18:38,708][25689] Avg episode reward: [(0, '-50.378')] [2022-07-09 09:18:39,584][26022] Updated weights on worker 0-0, policy_version 184984 (0.00101) [2022-07-09 09:18:41,142][26022] Updated weights on worker 0-0, policy_version 184994 (0.00089) [2022-07-09 09:18:43,145][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:18:43,151][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000185004_189444096.pth [2022-07-09 09:18:43,152][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000182982_187373568.pth [2022-07-09 09:18:43,153][26022] Updated weights on worker 0-0, policy_version 185004 (0.00050) [2022-07-09 09:18:43,835][25689] Fps is (10 sec: 5858.4, 60 sec: 5723.2, 300 sec: 5746.5). Total num frames: 189448192. Throughput: 0: 5992.8. Samples: 189452232. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 09:18:43,835][25689] Avg episode reward: [(0, '-50.794')] [2022-07-09 09:18:44,807][26022] Updated weights on worker 0-0, policy_version 185014 (0.00477) [2022-07-09 09:18:46,565][26022] Updated weights on worker 0-0, policy_version 185024 (0.00088) [2022-07-09 09:18:48,200][26022] Updated weights on worker 0-0, policy_version 185034 (0.00092) [2022-07-09 09:18:48,858][25689] Fps is (10 sec: 5750.7, 60 sec: 5725.0, 300 sec: 5743.8). Total num frames: 189476864. Throughput: 0: 5992.1. Samples: 189486782. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 09:18:48,858][25689] Avg episode reward: [(0, '-51.679')] [2022-07-09 09:18:50,220][26022] Updated weights on worker 0-0, policy_version 185044 (0.00088) [2022-07-09 09:18:51,943][26022] Updated weights on worker 0-0, policy_version 185054 (0.00085) [2022-07-09 09:18:53,815][26022] Updated weights on worker 0-0, policy_version 185064 (0.00086) [2022-07-09 09:18:53,879][25689] Fps is (10 sec: 5709.1, 60 sec: 5723.8, 300 sec: 5736.7). Total num frames: 189505536. Throughput: 0: 5987.0. Samples: 189503908. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 09:18:53,879][25689] Avg episode reward: [(0, '-52.151')] [2022-07-09 09:18:55,371][26022] Updated weights on worker 0-0, policy_version 185074 (0.00084) [2022-07-09 09:18:57,206][26022] Updated weights on worker 0-0, policy_version 185084 (0.00086) [2022-07-09 09:18:58,883][25689] Fps is (10 sec: 5720.1, 60 sec: 5695.9, 300 sec: 5741.7). Total num frames: 189534208. Throughput: 0: 6032.8. Samples: 189539258. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 09:18:58,884][25689] Avg episode reward: [(0, '-52.191')] [2022-07-09 09:18:58,981][26022] Updated weights on worker 0-0, policy_version 185094 (0.00088) [2022-07-09 09:19:00,561][26022] Updated weights on worker 0-0, policy_version 185104 (0.00086) [2022-07-09 09:19:02,759][26022] Updated weights on worker 0-0, policy_version 185114 (0.00089) [2022-07-09 09:19:03,940][25689] Fps is (10 sec: 5801.6, 60 sec: 5733.9, 300 sec: 5744.2). Total num frames: 189563904. Throughput: 0: 5951.6. Samples: 189571890. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 09:19:03,940][25689] Avg episode reward: [(0, '-52.644')] [2022-07-09 09:19:04,744][26022] Updated weights on worker 0-0, policy_version 185124 (0.00092) [2022-07-09 09:19:06,320][26022] Updated weights on worker 0-0, policy_version 185134 (0.00085) [2022-07-09 09:19:08,130][26022] Updated weights on worker 0-0, policy_version 185144 (0.00089) [2022-07-09 09:19:09,024][25689] Fps is (10 sec: 5654.2, 60 sec: 5711.2, 300 sec: 5743.4). Total num frames: 189591552. Throughput: 0: 5072.6. Samples: 189589080. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 09:19:09,025][25689] Avg episode reward: [(0, '-52.721')] [2022-07-09 09:19:09,759][26022] Updated weights on worker 0-0, policy_version 185154 (0.00089) [2022-07-09 09:19:11,695][26022] Updated weights on worker 0-0, policy_version 185164 (0.00082) [2022-07-09 09:19:13,298][26022] Updated weights on worker 0-0, policy_version 185174 (0.00082) [2022-07-09 09:19:14,039][25689] Fps is (10 sec: 5678.1, 60 sec: 5745.7, 300 sec: 5743.3). Total num frames: 189621248. Throughput: 0: 5978.8. Samples: 189624440. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 09:19:14,039][25689] Avg episode reward: [(0, '-52.866')] [2022-07-09 09:19:15,147][26022] Updated weights on worker 0-0, policy_version 185184 (0.00086) [2022-07-09 09:19:16,908][26022] Updated weights on worker 0-0, policy_version 185194 (0.00091) [2022-07-09 09:19:18,620][26022] Updated weights on worker 0-0, policy_version 185204 (0.00105) [2022-07-09 09:19:19,114][25689] Fps is (10 sec: 5988.2, 60 sec: 5772.6, 300 sec: 5747.3). Total num frames: 189651968. Throughput: 0: 5934.9. Samples: 189659328. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 09:19:19,114][25689] Avg episode reward: [(0, '-52.041')] [2022-07-09 09:19:20,515][26022] Updated weights on worker 0-0, policy_version 185214 (0.00093) [2022-07-09 09:19:22,103][26022] Updated weights on worker 0-0, policy_version 185224 (0.00083) [2022-07-09 09:19:23,927][26022] Updated weights on worker 0-0, policy_version 185234 (0.00099) [2022-07-09 09:19:24,239][25689] Fps is (10 sec: 5922.8, 60 sec: 5749.0, 300 sec: 5753.5). Total num frames: 189681664. Throughput: 0: 5162.1. Samples: 189676678. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 09:19:24,240][25689] Avg episode reward: [(0, '-51.728')] [2022-07-09 09:19:25,876][26022] Updated weights on worker 0-0, policy_version 185244 (0.00087) [2022-07-09 09:19:27,567][26022] Updated weights on worker 0-0, policy_version 185254 (0.00089) [2022-07-09 09:19:29,251][25689] Fps is (10 sec: 5656.6, 60 sec: 5749.4, 300 sec: 5746.6). Total num frames: 189709312. Throughput: 0: 6061.2. Samples: 189711678. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 09:19:29,252][25689] Avg episode reward: [(0, '-50.576')] [2022-07-09 09:19:29,342][26022] Updated weights on worker 0-0, policy_version 185264 (0.00083) [2022-07-09 09:19:30,841][26022] Updated weights on worker 0-0, policy_version 185274 (0.00085) [2022-07-09 09:19:32,881][26022] Updated weights on worker 0-0, policy_version 185284 (0.00083) [2022-07-09 09:19:34,286][25689] Fps is (10 sec: 5707.6, 60 sec: 5780.0, 300 sec: 5746.5). Total num frames: 189739008. Throughput: 0: 6022.9. Samples: 189746388. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 09:19:34,287][25689] Avg episode reward: [(0, '-50.966')] [2022-07-09 09:19:34,619][26022] Updated weights on worker 0-0, policy_version 185294 (0.00089) [2022-07-09 09:19:36,213][26022] Updated weights on worker 0-0, policy_version 185304 (0.00614) [2022-07-09 09:19:38,124][26022] Updated weights on worker 0-0, policy_version 185314 (0.00089) [2022-07-09 09:19:39,295][25689] Fps is (10 sec: 5913.3, 60 sec: 5780.2, 300 sec: 5752.2). Total num frames: 189768704. Throughput: 0: 5170.0. Samples: 189763666. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 09:19:39,295][25689] Avg episode reward: [(0, '-50.432')] [2022-07-09 09:19:39,838][26022] Updated weights on worker 0-0, policy_version 185324 (0.00086) [2022-07-09 09:19:41,824][26022] Updated weights on worker 0-0, policy_version 185334 (0.00088) [2022-07-09 09:19:43,359][26022] Updated weights on worker 0-0, policy_version 185344 (0.00084) [2022-07-09 09:19:44,390][25689] Fps is (10 sec: 5776.8, 60 sec: 5766.3, 300 sec: 5751.1). Total num frames: 189797376. Throughput: 0: 6033.1. Samples: 189798250. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 09:19:44,391][25689] Avg episode reward: [(0, '-50.037')] [2022-07-09 09:19:45,055][26022] Updated weights on worker 0-0, policy_version 185354 (0.00084) [2022-07-09 09:19:46,896][26022] Updated weights on worker 0-0, policy_version 185364 (0.00085) [2022-07-09 09:19:48,832][26022] Updated weights on worker 0-0, policy_version 185374 (0.00091) [2022-07-09 09:19:49,412][25689] Fps is (10 sec: 5668.2, 60 sec: 5766.5, 300 sec: 5751.0). Total num frames: 189826048. Throughput: 0: 6021.2. Samples: 189833070. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 09:19:49,412][25689] Avg episode reward: [(0, '-49.434')] [2022-07-09 09:19:50,591][26022] Updated weights on worker 0-0, policy_version 185384 (0.00090) [2022-07-09 09:19:52,195][26022] Updated weights on worker 0-0, policy_version 185394 (0.00247) [2022-07-09 09:19:54,121][26022] Updated weights on worker 0-0, policy_version 185404 (0.00087) [2022-07-09 09:19:54,421][25689] Fps is (10 sec: 5614.7, 60 sec: 5750.7, 300 sec: 5744.2). Total num frames: 189853696. Throughput: 0: 5164.5. Samples: 189850374. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 09:19:54,422][25689] Avg episode reward: [(0, '-50.370')] [2022-07-09 09:19:55,851][26022] Updated weights on worker 0-0, policy_version 185414 (0.00089) [2022-07-09 09:19:57,840][26022] Updated weights on worker 0-0, policy_version 185424 (0.00085) [2022-07-09 09:19:59,388][26022] Updated weights on worker 0-0, policy_version 185434 (0.00118) [2022-07-09 09:19:59,451][25689] Fps is (10 sec: 5814.2, 60 sec: 5782.0, 300 sec: 5749.0). Total num frames: 189884416. Throughput: 0: 6017.6. Samples: 189884956. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 09:19:59,451][25689] Avg episode reward: [(0, '-50.324')] [2022-07-09 09:20:01,286][26022] Updated weights on worker 0-0, policy_version 185444 (0.00094) [2022-07-09 09:20:03,562][26022] Updated weights on worker 0-0, policy_version 185454 (0.00082) [2022-07-09 09:20:04,528][25689] Fps is (10 sec: 5572.3, 60 sec: 5712.5, 300 sec: 5744.2). Total num frames: 189910016. Throughput: 0: 5909.8. Samples: 189917262. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 09:20:04,529][25689] Avg episode reward: [(0, '-50.510')] [2022-07-09 09:20:05,237][26022] Updated weights on worker 0-0, policy_version 185464 (0.00090) [2022-07-09 09:20:06,947][26022] Updated weights on worker 0-0, policy_version 185474 (0.00109) [2022-07-09 09:20:08,964][26022] Updated weights on worker 0-0, policy_version 185484 (0.00085) [2022-07-09 09:20:09,560][25689] Fps is (10 sec: 5368.5, 60 sec: 5734.4, 300 sec: 5742.0). Total num frames: 189938688. Throughput: 0: 5022.4. Samples: 189934264. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 09:20:09,561][25689] Avg episode reward: [(0, '-50.034')] [2022-07-09 09:20:10,508][26022] Updated weights on worker 0-0, policy_version 185494 (0.00086) [2022-07-09 09:20:12,495][26022] Updated weights on worker 0-0, policy_version 185504 (0.00095) [2022-07-09 09:20:13,974][26022] Updated weights on worker 0-0, policy_version 185514 (0.00085) [2022-07-09 09:20:14,572][25689] Fps is (10 sec: 5811.3, 60 sec: 5734.6, 300 sec: 5742.3). Total num frames: 189968384. Throughput: 0: 5879.5. Samples: 189968852. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 09:20:14,573][25689] Avg episode reward: [(0, '-49.870')] [2022-07-09 09:20:15,874][26022] Updated weights on worker 0-0, policy_version 185524 (0.00087) [2022-07-09 09:20:17,633][26022] Updated weights on worker 0-0, policy_version 185534 (0.00082) [2022-07-09 09:20:19,248][26022] Updated weights on worker 0-0, policy_version 185544 (0.00080) [2022-07-09 09:20:19,583][25689] Fps is (10 sec: 5925.8, 60 sec: 5723.8, 300 sec: 5747.0). Total num frames: 189998080. Throughput: 0: 5919.1. Samples: 190004120. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-09 09:20:19,583][25689] Avg episode reward: [(0, '-49.752')] [2022-07-09 09:20:21,324][26022] Updated weights on worker 0-0, policy_version 185554 (0.00092) [2022-07-09 09:20:22,748][26022] Updated weights on worker 0-0, policy_version 185564 (0.00048) [2022-07-09 09:20:24,615][25689] Fps is (10 sec: 5812.1, 60 sec: 5715.7, 300 sec: 5750.4). Total num frames: 190026752. Throughput: 0: 5176.7. Samples: 190021244. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 09:20:24,615][25689] Avg episode reward: [(0, '-49.173')] [2022-07-09 09:20:24,816][26022] Updated weights on worker 0-0, policy_version 185574 (0.00096) [2022-07-09 09:20:26,524][26022] Updated weights on worker 0-0, policy_version 185584 (0.00088) [2022-07-09 09:20:28,135][26022] Updated weights on worker 0-0, policy_version 185594 (0.00085) [2022-07-09 09:20:29,626][25689] Fps is (10 sec: 5811.9, 60 sec: 5749.7, 300 sec: 5751.7). Total num frames: 190056448. Throughput: 0: 6069.7. Samples: 190056056. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 09:20:29,626][25689] Avg episode reward: [(0, '-48.902')] [2022-07-09 09:20:30,003][26022] Updated weights on worker 0-0, policy_version 185604 (0.00086) [2022-07-09 09:20:31,878][26022] Updated weights on worker 0-0, policy_version 185614 (0.00089) [2022-07-09 09:20:33,652][26022] Updated weights on worker 0-0, policy_version 185624 (0.00089) [2022-07-09 09:20:34,647][25689] Fps is (10 sec: 5818.2, 60 sec: 5734.1, 300 sec: 5748.2). Total num frames: 190085120. Throughput: 0: 6067.9. Samples: 190090662. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 09:20:34,647][25689] Avg episode reward: [(0, '-48.885')] [2022-07-09 09:20:35,307][26022] Updated weights on worker 0-0, policy_version 185634 (0.00091) [2022-07-09 09:20:36,963][26022] Updated weights on worker 0-0, policy_version 185644 (0.00088) [2022-07-09 09:20:38,808][26022] Updated weights on worker 0-0, policy_version 185654 (0.00091) [2022-07-09 09:20:39,660][25689] Fps is (10 sec: 5612.8, 60 sec: 5699.8, 300 sec: 5743.3). Total num frames: 190112768. Throughput: 0: 5186.1. Samples: 190108242. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 09:20:39,660][25689] Avg episode reward: [(0, '-49.467')] [2022-07-09 09:20:40,625][26022] Updated weights on worker 0-0, policy_version 185664 (0.00087) [2022-07-09 09:20:42,420][26022] Updated weights on worker 0-0, policy_version 185674 (0.00123) [2022-07-09 09:20:43,224][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:20:43,237][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000185678_190134272.pth [2022-07-09 09:20:43,238][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000183656_188063744.pth [2022-07-09 09:20:44,244][26022] Updated weights on worker 0-0, policy_version 185684 (0.00098) [2022-07-09 09:20:44,739][25689] Fps is (10 sec: 5682.1, 60 sec: 5718.3, 300 sec: 5749.1). Total num frames: 190142464. Throughput: 0: 6048.0. Samples: 190142954. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 09:20:44,739][25689] Avg episode reward: [(0, '-49.149')] [2022-07-09 09:20:45,938][26022] Updated weights on worker 0-0, policy_version 185694 (0.00091) [2022-07-09 09:20:47,617][26022] Updated weights on worker 0-0, policy_version 185704 (0.00084) [2022-07-09 09:20:49,370][26022] Updated weights on worker 0-0, policy_version 185714 (0.00083) [2022-07-09 09:20:49,769][25689] Fps is (10 sec: 5976.2, 60 sec: 5751.3, 300 sec: 5756.0). Total num frames: 190173184. Throughput: 0: 6047.2. Samples: 190177868. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 09:20:49,770][25689] Avg episode reward: [(0, '-50.190')] [2022-07-09 09:20:51,371][26022] Updated weights on worker 0-0, policy_version 185724 (0.00084) [2022-07-09 09:20:52,899][26022] Updated weights on worker 0-0, policy_version 185734 (0.00090) [2022-07-09 09:20:54,783][25689] Fps is (10 sec: 5810.8, 60 sec: 5750.9, 300 sec: 5746.1). Total num frames: 190200832. Throughput: 0: 5190.7. Samples: 190195186. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 09:20:54,784][25689] Avg episode reward: [(0, '-50.069')] [2022-07-09 09:20:54,902][26022] Updated weights on worker 0-0, policy_version 185744 (0.00087) [2022-07-09 09:20:56,353][26022] Updated weights on worker 0-0, policy_version 185754 (0.00092) [2022-07-09 09:20:58,346][26022] Updated weights on worker 0-0, policy_version 185764 (0.00088) [2022-07-09 09:20:59,803][25689] Fps is (10 sec: 5714.8, 60 sec: 5734.8, 300 sec: 5762.2). Total num frames: 190230528. Throughput: 0: 6044.7. Samples: 190230006. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 09:20:59,805][25689] Avg episode reward: [(0, '-50.355')] [2022-07-09 09:21:00,161][26022] Updated weights on worker 0-0, policy_version 185774 (0.00103) [2022-07-09 09:21:02,128][26022] Updated weights on worker 0-0, policy_version 185784 (0.00081) [2022-07-09 09:21:04,142][26022] Updated weights on worker 0-0, policy_version 185794 (0.00082) [2022-07-09 09:21:04,843][25689] Fps is (10 sec: 5598.8, 60 sec: 5755.4, 300 sec: 5744.9). Total num frames: 190257152. Throughput: 0: 5963.1. Samples: 190262836. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 09:21:04,843][25689] Avg episode reward: [(0, '-51.255')] [2022-07-09 09:21:05,680][26022] Updated weights on worker 0-0, policy_version 185804 (0.00085) [2022-07-09 09:21:07,753][26022] Updated weights on worker 0-0, policy_version 185814 (0.00086) [2022-07-09 09:21:09,334][26022] Updated weights on worker 0-0, policy_version 185824 (0.00091) [2022-07-09 09:21:09,851][25689] Fps is (10 sec: 5503.6, 60 sec: 5757.7, 300 sec: 5749.3). Total num frames: 190285824. Throughput: 0: 5081.4. Samples: 190279910. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 09:21:09,851][25689] Avg episode reward: [(0, '-51.099')] [2022-07-09 09:21:11,053][26022] Updated weights on worker 0-0, policy_version 185834 (0.00090) [2022-07-09 09:21:13,003][26022] Updated weights on worker 0-0, policy_version 185844 (0.00085) [2022-07-09 09:21:14,609][26022] Updated weights on worker 0-0, policy_version 185854 (0.00083) [2022-07-09 09:21:14,871][25689] Fps is (10 sec: 5718.4, 60 sec: 5740.0, 300 sec: 5749.5). Total num frames: 190314496. Throughput: 0: 5938.0. Samples: 190314464. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 09:21:14,872][25689] Avg episode reward: [(0, '-50.868')] [2022-07-09 09:21:16,480][26022] Updated weights on worker 0-0, policy_version 185864 (0.00091) [2022-07-09 09:21:18,128][26022] Updated weights on worker 0-0, policy_version 185874 (0.00087) [2022-07-09 09:21:19,824][26022] Updated weights on worker 0-0, policy_version 185884 (0.00093) [2022-07-09 09:21:19,882][25689] Fps is (10 sec: 5920.6, 60 sec: 5756.9, 300 sec: 5754.8). Total num frames: 190345216. Throughput: 0: 5935.0. Samples: 190349172. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 09:21:19,883][25689] Avg episode reward: [(0, '-51.496')] [2022-07-09 09:21:21,963][26022] Updated weights on worker 0-0, policy_version 185894 (0.00090) [2022-07-09 09:21:23,497][26022] Updated weights on worker 0-0, policy_version 185904 (0.00085) [2022-07-09 09:21:25,007][25689] Fps is (10 sec: 5657.3, 60 sec: 5714.1, 300 sec: 5742.3). Total num frames: 190371840. Throughput: 0: 5125.6. Samples: 190366190. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 09:21:25,007][25689] Avg episode reward: [(0, '-50.687')] [2022-07-09 09:21:25,380][26022] Updated weights on worker 0-0, policy_version 185914 (0.00085) [2022-07-09 09:21:27,146][26022] Updated weights on worker 0-0, policy_version 185924 (0.00095) [2022-07-09 09:21:29,051][26022] Updated weights on worker 0-0, policy_version 185934 (0.00086) [2022-07-09 09:21:30,018][25689] Fps is (10 sec: 5556.3, 60 sec: 5714.1, 300 sec: 5739.9). Total num frames: 190401536. Throughput: 0: 5983.9. Samples: 190400592. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 09:21:30,020][25689] Avg episode reward: [(0, '-51.049')] [2022-07-09 09:21:30,740][26022] Updated weights on worker 0-0, policy_version 185944 (0.00082) [2022-07-09 09:21:32,692][26022] Updated weights on worker 0-0, policy_version 185954 (0.00083) [2022-07-09 09:21:34,204][26022] Updated weights on worker 0-0, policy_version 185964 (0.00086) [2022-07-09 09:21:35,029][25689] Fps is (10 sec: 5823.8, 60 sec: 5715.0, 300 sec: 5744.0). Total num frames: 190430208. Throughput: 0: 5990.7. Samples: 190435228. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 09:21:35,031][25689] Avg episode reward: [(0, '-51.286')] [2022-07-09 09:21:36,157][26022] Updated weights on worker 0-0, policy_version 185974 (0.00084) [2022-07-09 09:21:37,776][26022] Updated weights on worker 0-0, policy_version 185984 (0.00088) [2022-07-09 09:21:39,459][26022] Updated weights on worker 0-0, policy_version 185994 (0.00084) [2022-07-09 09:21:40,059][25689] Fps is (10 sec: 5915.5, 60 sec: 5764.3, 300 sec: 5749.1). Total num frames: 190460928. Throughput: 0: 5130.1. Samples: 190452680. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 09:21:40,060][25689] Avg episode reward: [(0, '-52.187')] [2022-07-09 09:21:41,234][26022] Updated weights on worker 0-0, policy_version 186004 (0.00088) [2022-07-09 09:21:43,026][26022] Updated weights on worker 0-0, policy_version 186014 (0.00082) [2022-07-09 09:21:44,892][26022] Updated weights on worker 0-0, policy_version 186024 (0.00092) [2022-07-09 09:21:45,196][25689] Fps is (10 sec: 5841.5, 60 sec: 5741.8, 300 sec: 5744.4). Total num frames: 190489600. Throughput: 0: 6008.4. Samples: 190487496. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 09:21:45,197][25689] Avg episode reward: [(0, '-51.984')] [2022-07-09 09:21:46,820][26022] Updated weights on worker 0-0, policy_version 186034 (0.00084) [2022-07-09 09:21:48,236][26022] Updated weights on worker 0-0, policy_version 186044 (0.00090) [2022-07-09 09:21:50,204][25689] Fps is (10 sec: 5551.5, 60 sec: 5693.2, 300 sec: 5734.3). Total num frames: 190517248. Throughput: 0: 6016.1. Samples: 190522028. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 09:21:50,204][25689] Avg episode reward: [(0, '-51.262')] [2022-07-09 09:21:50,349][26022] Updated weights on worker 0-0, policy_version 186054 (0.00092) [2022-07-09 09:21:52,058][26022] Updated weights on worker 0-0, policy_version 186064 (0.00084) [2022-07-09 09:21:53,702][26022] Updated weights on worker 0-0, policy_version 186074 (0.00083) [2022-07-09 09:21:55,221][25689] Fps is (10 sec: 5822.5, 60 sec: 5743.7, 300 sec: 5740.9). Total num frames: 190547968. Throughput: 0: 6030.1. Samples: 190556986. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 09:21:55,221][25689] Avg episode reward: [(0, '-51.422')] [2022-07-09 09:21:55,735][26022] Updated weights on worker 0-0, policy_version 186084 (0.00083) [2022-07-09 09:21:57,192][26022] Updated weights on worker 0-0, policy_version 186094 (0.00101) [2022-07-09 09:21:59,261][26022] Updated weights on worker 0-0, policy_version 186104 (0.00081) [2022-07-09 09:22:00,303][25689] Fps is (10 sec: 5982.3, 60 sec: 5737.9, 300 sec: 5755.8). Total num frames: 190577664. Throughput: 0: 6010.5. Samples: 190574358. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 09:22:00,304][25689] Avg episode reward: [(0, '-50.845')] [2022-07-09 09:22:00,925][26022] Updated weights on worker 0-0, policy_version 186114 (0.00084) [2022-07-09 09:22:03,076][26022] Updated weights on worker 0-0, policy_version 186124 (0.00088) [2022-07-09 09:22:04,918][26022] Updated weights on worker 0-0, policy_version 186134 (0.00081) [2022-07-09 09:22:05,355][25689] Fps is (10 sec: 5355.5, 60 sec: 5702.8, 300 sec: 5737.9). Total num frames: 190602240. Throughput: 0: 5909.3. Samples: 190606618. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 09:22:05,357][25689] Avg episode reward: [(0, '-50.688')] [2022-07-09 09:22:06,515][26022] Updated weights on worker 0-0, policy_version 186144 (0.00088) [2022-07-09 09:22:08,374][26022] Updated weights on worker 0-0, policy_version 186154 (0.00092) [2022-07-09 09:22:10,125][26022] Updated weights on worker 0-0, policy_version 186164 (0.00087) [2022-07-09 09:22:10,387][25689] Fps is (10 sec: 5483.5, 60 sec: 5734.4, 300 sec: 5741.0). Total num frames: 190632960. Throughput: 0: 5915.1. Samples: 190641414. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 09:22:10,388][25689] Avg episode reward: [(0, '-51.269')] [2022-07-09 09:22:11,944][26022] Updated weights on worker 0-0, policy_version 186174 (0.00093) [2022-07-09 09:22:13,765][26022] Updated weights on worker 0-0, policy_version 186184 (0.00251) [2022-07-09 09:22:15,240][26022] Updated weights on worker 0-0, policy_version 186194 (0.00086) [2022-07-09 09:22:15,396][25689] Fps is (10 sec: 6017.0, 60 sec: 5752.3, 300 sec: 5744.5). Total num frames: 190662656. Throughput: 0: 5048.0. Samples: 190658828. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 09:22:15,397][25689] Avg episode reward: [(0, '-50.828')] [2022-07-09 09:22:17,305][26022] Updated weights on worker 0-0, policy_version 186204 (0.00093) [2022-07-09 09:22:18,903][26022] Updated weights on worker 0-0, policy_version 186214 (0.00083) [2022-07-09 09:22:20,438][25689] Fps is (10 sec: 5603.1, 60 sec: 5681.7, 300 sec: 5734.2). Total num frames: 190689280. Throughput: 0: 5916.6. Samples: 190693492. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 09:22:20,439][25689] Avg episode reward: [(0, '-51.554')] [2022-07-09 09:22:20,735][26022] Updated weights on worker 0-0, policy_version 186224 (0.00080) [2022-07-09 09:22:22,571][26022] Updated weights on worker 0-0, policy_version 186234 (0.00091) [2022-07-09 09:22:24,467][26022] Updated weights on worker 0-0, policy_version 186244 (0.00087) [2022-07-09 09:22:25,570][25689] Fps is (10 sec: 5636.2, 60 sec: 5748.7, 300 sec: 5735.9). Total num frames: 190720000. Throughput: 0: 6018.2. Samples: 190728276. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 09:22:25,571][25689] Avg episode reward: [(0, '-51.469')] [2022-07-09 09:22:26,075][26022] Updated weights on worker 0-0, policy_version 186254 (0.00056) [2022-07-09 09:22:28,059][26022] Updated weights on worker 0-0, policy_version 186264 (0.00083) [2022-07-09 09:22:29,530][26022] Updated weights on worker 0-0, policy_version 186274 (0.00091) [2022-07-09 09:22:30,609][25689] Fps is (10 sec: 5839.3, 60 sec: 5729.2, 300 sec: 5739.7). Total num frames: 190748672. Throughput: 0: 5140.3. Samples: 190745364. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 09:22:30,611][25689] Avg episode reward: [(0, '-52.549')] [2022-07-09 09:22:31,686][26022] Updated weights on worker 0-0, policy_version 186284 (0.00082) [2022-07-09 09:22:33,012][26022] Updated weights on worker 0-0, policy_version 186294 (0.00085) [2022-07-09 09:22:35,183][26022] Updated weights on worker 0-0, policy_version 186304 (0.00082) [2022-07-09 09:22:35,650][25689] Fps is (10 sec: 5790.3, 60 sec: 5743.2, 300 sec: 5736.4). Total num frames: 190778368. Throughput: 0: 5981.2. Samples: 190779974. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 09:22:35,653][25689] Avg episode reward: [(0, '-51.677')] [2022-07-09 09:22:36,732][26022] Updated weights on worker 0-0, policy_version 186314 (0.00089) [2022-07-09 09:22:38,575][26022] Updated weights on worker 0-0, policy_version 186324 (0.00082) [2022-07-09 09:22:40,279][26022] Updated weights on worker 0-0, policy_version 186334 (0.00080) [2022-07-09 09:22:40,703][25689] Fps is (10 sec: 5884.2, 60 sec: 5724.1, 300 sec: 5737.4). Total num frames: 190808064. Throughput: 0: 5983.9. Samples: 190814752. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 09:22:40,704][25689] Avg episode reward: [(0, '-52.347')] [2022-07-09 09:22:42,285][26022] Updated weights on worker 0-0, policy_version 186344 (0.00088) [2022-07-09 09:22:43,278][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:22:43,290][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000186350_190822400.pth [2022-07-09 09:22:43,291][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000184330_188753920.pth [2022-07-09 09:22:43,777][26022] Updated weights on worker 0-0, policy_version 186354 (0.00087) [2022-07-09 09:22:45,767][25689] Fps is (10 sec: 5769.5, 60 sec: 5731.1, 300 sec: 5737.0). Total num frames: 190836736. Throughput: 0: 5133.4. Samples: 190831956. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 09:22:45,769][25689] Avg episode reward: [(0, '-52.251')] [2022-07-09 09:22:45,771][26022] Updated weights on worker 0-0, policy_version 186364 (0.00096) [2022-07-09 09:22:47,277][26022] Updated weights on worker 0-0, policy_version 186374 (0.00082) [2022-07-09 09:22:49,331][26022] Updated weights on worker 0-0, policy_version 186384 (0.00088) [2022-07-09 09:22:50,779][25689] Fps is (10 sec: 5792.8, 60 sec: 5764.5, 300 sec: 5740.4). Total num frames: 190866432. Throughput: 0: 6020.5. Samples: 190866794. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 09:22:50,779][25689] Avg episode reward: [(0, '-52.230')] [2022-07-09 09:22:50,902][26022] Updated weights on worker 0-0, policy_version 186394 (0.00083) [2022-07-09 09:22:52,682][26022] Updated weights on worker 0-0, policy_version 186404 (0.00084) [2022-07-09 09:22:54,482][26022] Updated weights on worker 0-0, policy_version 186414 (0.01112) [2022-07-09 09:22:55,800][25689] Fps is (10 sec: 5817.8, 60 sec: 5730.3, 300 sec: 5734.4). Total num frames: 190895104. Throughput: 0: 6045.2. Samples: 190901782. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 09:22:55,800][25689] Avg episode reward: [(0, '-51.960')] [2022-07-09 09:22:56,255][26022] Updated weights on worker 0-0, policy_version 186424 (0.00088) [2022-07-09 09:22:58,114][26022] Updated weights on worker 0-0, policy_version 186434 (0.00082) [2022-07-09 09:22:59,754][26022] Updated weights on worker 0-0, policy_version 186444 (0.01140) [2022-07-09 09:23:00,813][25689] Fps is (10 sec: 5816.6, 60 sec: 5736.8, 300 sec: 5742.9). Total num frames: 190924800. Throughput: 0: 5205.8. Samples: 190919444. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 09:23:00,815][25689] Avg episode reward: [(0, '-51.767')] [2022-07-09 09:23:01,650][26022] Updated weights on worker 0-0, policy_version 186454 (0.00087) [2022-07-09 09:23:03,713][26022] Updated weights on worker 0-0, policy_version 186464 (0.00095) [2022-07-09 09:23:05,438][26022] Updated weights on worker 0-0, policy_version 186474 (0.00086) [2022-07-09 09:23:05,887][25689] Fps is (10 sec: 5481.9, 60 sec: 5751.7, 300 sec: 5731.6). Total num frames: 190950400. Throughput: 0: 5972.4. Samples: 190952120. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 09:23:05,887][25689] Avg episode reward: [(0, '-52.272')] [2022-07-09 09:23:07,126][26022] Updated weights on worker 0-0, policy_version 186484 (0.00086) [2022-07-09 09:23:09,064][26022] Updated weights on worker 0-0, policy_version 186494 (0.00079) [2022-07-09 09:23:10,562][26022] Updated weights on worker 0-0, policy_version 186504 (0.00087) [2022-07-09 09:23:10,889][25689] Fps is (10 sec: 5590.0, 60 sec: 5754.5, 300 sec: 5742.3). Total num frames: 190981120. Throughput: 0: 5969.9. Samples: 190986848. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 09:23:10,889][25689] Avg episode reward: [(0, '-51.881')] [2022-07-09 09:23:12,611][26022] Updated weights on worker 0-0, policy_version 186514 (0.00098) [2022-07-09 09:23:14,440][26022] Updated weights on worker 0-0, policy_version 186524 (0.00081) [2022-07-09 09:23:15,903][25689] Fps is (10 sec: 5929.2, 60 sec: 5737.0, 300 sec: 5742.1). Total num frames: 191009792. Throughput: 0: 5088.7. Samples: 191004084. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 09:23:15,904][25689] Avg episode reward: [(0, '-51.860')] [2022-07-09 09:23:15,964][26022] Updated weights on worker 0-0, policy_version 186534 (0.00423) [2022-07-09 09:23:17,842][26022] Updated weights on worker 0-0, policy_version 186544 (0.00090) [2022-07-09 09:23:19,663][26022] Updated weights on worker 0-0, policy_version 186554 (0.00093) [2022-07-09 09:23:20,914][25689] Fps is (10 sec: 5719.6, 60 sec: 5773.9, 300 sec: 5736.1). Total num frames: 191038464. Throughput: 0: 5951.9. Samples: 191039082. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 09:23:20,915][25689] Avg episode reward: [(0, '-51.517')] [2022-07-09 09:23:21,371][26022] Updated weights on worker 0-0, policy_version 186564 (0.00086) [2022-07-09 09:23:23,219][26022] Updated weights on worker 0-0, policy_version 186574 (0.00088) [2022-07-09 09:23:24,684][26022] Updated weights on worker 0-0, policy_version 186584 (0.00089) [2022-07-09 09:23:25,984][25689] Fps is (10 sec: 5790.0, 60 sec: 5762.9, 300 sec: 5741.9). Total num frames: 191068160. Throughput: 0: 6065.8. Samples: 191074028. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 09:23:25,985][25689] Avg episode reward: [(0, '-51.111')] [2022-07-09 09:23:26,828][26022] Updated weights on worker 0-0, policy_version 186594 (0.00096) [2022-07-09 09:23:28,420][26022] Updated weights on worker 0-0, policy_version 186604 (0.00095) [2022-07-09 09:23:30,214][26022] Updated weights on worker 0-0, policy_version 186614 (0.00085) [2022-07-09 09:23:30,992][25689] Fps is (10 sec: 5893.7, 60 sec: 5782.9, 300 sec: 5748.7). Total num frames: 191097856. Throughput: 0: 5185.1. Samples: 191091084. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 09:23:31,003][25689] Avg episode reward: [(0, '-50.825')] [2022-07-09 09:23:32,196][26022] Updated weights on worker 0-0, policy_version 186624 (0.00087) [2022-07-09 09:23:33,543][26022] Updated weights on worker 0-0, policy_version 186634 (0.00087) [2022-07-09 09:23:35,704][26022] Updated weights on worker 0-0, policy_version 186644 (0.00065) [2022-07-09 09:23:36,010][25689] Fps is (10 sec: 5821.8, 60 sec: 5768.1, 300 sec: 5745.1). Total num frames: 191126528. Throughput: 0: 6051.9. Samples: 191125766. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 09:23:36,011][25689] Avg episode reward: [(0, '-50.511')] [2022-07-09 09:23:37,352][26022] Updated weights on worker 0-0, policy_version 186654 (0.00091) [2022-07-09 09:23:39,035][26022] Updated weights on worker 0-0, policy_version 186664 (0.00088) [2022-07-09 09:23:41,039][25689] Fps is (10 sec: 5503.4, 60 sec: 5719.4, 300 sec: 5736.7). Total num frames: 191153152. Throughput: 0: 6029.6. Samples: 191160426. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 09:23:41,040][25689] Avg episode reward: [(0, '-50.576')] [2022-07-09 09:23:41,111][26022] Updated weights on worker 0-0, policy_version 186674 (0.00416) [2022-07-09 09:23:42,584][26022] Updated weights on worker 0-0, policy_version 186684 (0.00093) [2022-07-09 09:23:44,538][26022] Updated weights on worker 0-0, policy_version 186694 (0.00082) [2022-07-09 09:23:46,094][25689] Fps is (10 sec: 5686.6, 60 sec: 5754.2, 300 sec: 5742.9). Total num frames: 191183872. Throughput: 0: 5153.7. Samples: 191177668. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 09:23:46,095][25689] Avg episode reward: [(0, '-50.904')] [2022-07-09 09:23:46,224][26022] Updated weights on worker 0-0, policy_version 186704 (0.00093) [2022-07-09 09:23:47,927][26022] Updated weights on worker 0-0, policy_version 186714 (0.00084) [2022-07-09 09:23:49,857][26022] Updated weights on worker 0-0, policy_version 186724 (0.00084) [2022-07-09 09:23:51,120][25689] Fps is (10 sec: 5992.9, 60 sec: 5752.8, 300 sec: 5746.3). Total num frames: 191213568. Throughput: 0: 6041.8. Samples: 191212698. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 09:23:51,121][25689] Avg episode reward: [(0, '-50.781')] [2022-07-09 09:23:51,422][26022] Updated weights on worker 0-0, policy_version 186734 (0.00093) [2022-07-09 09:23:53,432][26022] Updated weights on worker 0-0, policy_version 186744 (0.00087) [2022-07-09 09:23:55,110][26022] Updated weights on worker 0-0, policy_version 186754 (0.00053) [2022-07-09 09:23:56,128][25689] Fps is (10 sec: 5714.9, 60 sec: 5737.1, 300 sec: 5742.8). Total num frames: 191241216. Throughput: 0: 6032.9. Samples: 191247138. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 09:23:56,129][25689] Avg episode reward: [(0, '-50.398')] [2022-07-09 09:23:56,762][26022] Updated weights on worker 0-0, policy_version 186764 (0.00087) [2022-07-09 09:23:58,812][26022] Updated weights on worker 0-0, policy_version 186774 (0.00087) [2022-07-09 09:24:00,263][26022] Updated weights on worker 0-0, policy_version 186784 (0.00478) [2022-07-09 09:24:01,150][25689] Fps is (10 sec: 5615.4, 60 sec: 5719.4, 300 sec: 5740.0). Total num frames: 191269888. Throughput: 0: 5167.6. Samples: 191264350. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 09:24:01,151][25689] Avg episode reward: [(0, '-51.233')] [2022-07-09 09:24:02,527][26022] Updated weights on worker 0-0, policy_version 186794 (0.00092) [2022-07-09 09:24:04,297][26022] Updated weights on worker 0-0, policy_version 186804 (0.00051) [2022-07-09 09:24:06,103][26022] Updated weights on worker 0-0, policy_version 186814 (0.00085) [2022-07-09 09:24:06,207][25689] Fps is (10 sec: 5689.9, 60 sec: 5771.9, 300 sec: 5744.0). Total num frames: 191298560. Throughput: 0: 5934.4. Samples: 191297022. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 09:24:06,207][25689] Avg episode reward: [(0, '-51.335')] [2022-07-09 09:24:07,783][26022] Updated weights on worker 0-0, policy_version 186824 (0.00084) [2022-07-09 09:24:09,622][26022] Updated weights on worker 0-0, policy_version 186834 (0.00080) [2022-07-09 09:24:11,143][26022] Updated weights on worker 0-0, policy_version 186844 (0.00080) [2022-07-09 09:24:11,215][25689] Fps is (10 sec: 5798.8, 60 sec: 5754.3, 300 sec: 5744.1). Total num frames: 191328256. Throughput: 0: 5940.7. Samples: 191332076. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 09:24:11,216][25689] Avg episode reward: [(0, '-51.061')] [2022-07-09 09:24:13,211][26022] Updated weights on worker 0-0, policy_version 186854 (0.00081) [2022-07-09 09:24:14,890][26022] Updated weights on worker 0-0, policy_version 186864 (0.00094) [2022-07-09 09:24:16,316][25689] Fps is (10 sec: 5571.0, 60 sec: 5712.2, 300 sec: 5729.8). Total num frames: 191354880. Throughput: 0: 5064.6. Samples: 191349380. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 09:24:16,316][25689] Avg episode reward: [(0, '-51.757')] [2022-07-09 09:24:16,708][26022] Updated weights on worker 0-0, policy_version 186874 (0.00079) [2022-07-09 09:24:18,468][26022] Updated weights on worker 0-0, policy_version 186884 (0.00091) [2022-07-09 09:24:20,371][26022] Updated weights on worker 0-0, policy_version 186894 (0.00052) [2022-07-09 09:24:21,373][25689] Fps is (10 sec: 5645.6, 60 sec: 5741.7, 300 sec: 5734.6). Total num frames: 191385600. Throughput: 0: 5927.2. Samples: 191384214. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 09:24:21,373][25689] Avg episode reward: [(0, '-52.122')] [2022-07-09 09:24:21,977][26022] Updated weights on worker 0-0, policy_version 186904 (0.00094) [2022-07-09 09:24:23,910][26022] Updated weights on worker 0-0, policy_version 186914 (0.00086) [2022-07-09 09:24:25,393][26022] Updated weights on worker 0-0, policy_version 186924 (0.00085) [2022-07-09 09:24:26,484][25689] Fps is (10 sec: 5841.1, 60 sec: 5720.9, 300 sec: 5736.1). Total num frames: 191414272. Throughput: 0: 5998.5. Samples: 191418656. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 09:24:26,484][25689] Avg episode reward: [(0, '-52.187')] [2022-07-09 09:24:27,403][26022] Updated weights on worker 0-0, policy_version 186934 (0.00080) [2022-07-09 09:24:29,004][26022] Updated weights on worker 0-0, policy_version 186944 (0.00091) [2022-07-09 09:24:30,853][26022] Updated weights on worker 0-0, policy_version 186954 (0.00086) [2022-07-09 09:24:31,557][25689] Fps is (10 sec: 5731.2, 60 sec: 5714.7, 300 sec: 5735.4). Total num frames: 191443968. Throughput: 0: 5967.0. Samples: 191453454. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 09:24:31,557][25689] Avg episode reward: [(0, '-51.588')] [2022-07-09 09:24:32,574][26022] Updated weights on worker 0-0, policy_version 186964 (0.00086) [2022-07-09 09:24:34,283][26022] Updated weights on worker 0-0, policy_version 186974 (0.00098) [2022-07-09 09:24:36,222][26022] Updated weights on worker 0-0, policy_version 186984 (0.00095) [2022-07-09 09:24:36,625][25689] Fps is (10 sec: 5856.8, 60 sec: 5726.9, 300 sec: 5734.2). Total num frames: 191473664. Throughput: 0: 5991.4. Samples: 191471056. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 09:24:36,625][25689] Avg episode reward: [(0, '-51.397')] [2022-07-09 09:24:37,799][26022] Updated weights on worker 0-0, policy_version 186994 (0.00090) [2022-07-09 09:24:39,631][26022] Updated weights on worker 0-0, policy_version 187004 (0.00082) [2022-07-09 09:24:41,250][26022] Updated weights on worker 0-0, policy_version 187014 (0.00087) [2022-07-09 09:24:41,645][25689] Fps is (10 sec: 5887.1, 60 sec: 5778.4, 300 sec: 5739.1). Total num frames: 191503360. Throughput: 0: 6019.2. Samples: 191506240. Policy #0 lag: (min: 0.0, avg: 9.1, max: 24.0) [2022-07-09 09:24:41,646][25689] Avg episode reward: [(0, '-51.574')] [2022-07-09 09:24:43,174][26022] Updated weights on worker 0-0, policy_version 187024 (0.00090) [2022-07-09 09:24:43,385][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:24:43,399][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000187025_191513600.pth [2022-07-09 09:24:43,399][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000185004_189444096.pth [2022-07-09 09:24:44,915][26022] Updated weights on worker 0-0, policy_version 187034 (0.00096) [2022-07-09 09:24:46,694][25689] Fps is (10 sec: 5796.6, 60 sec: 5745.2, 300 sec: 5738.6). Total num frames: 191532032. Throughput: 0: 6059.5. Samples: 191541120. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 09:24:46,694][25689] Avg episode reward: [(0, '-51.051')] [2022-07-09 09:24:46,764][26022] Updated weights on worker 0-0, policy_version 187044 (0.00084) [2022-07-09 09:24:48,304][26022] Updated weights on worker 0-0, policy_version 187054 (0.00097) [2022-07-09 09:24:50,135][26022] Updated weights on worker 0-0, policy_version 187064 (0.00074) [2022-07-09 09:24:51,753][25689] Fps is (10 sec: 5774.5, 60 sec: 5742.1, 300 sec: 5744.5). Total num frames: 191561728. Throughput: 0: 5208.4. Samples: 191558644. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 09:24:51,754][25689] Avg episode reward: [(0, '-51.342')] [2022-07-09 09:24:51,983][26022] Updated weights on worker 0-0, policy_version 187074 (0.00085) [2022-07-09 09:24:53,789][26022] Updated weights on worker 0-0, policy_version 187084 (0.00092) [2022-07-09 09:24:55,486][26022] Updated weights on worker 0-0, policy_version 187094 (0.00089) [2022-07-09 09:24:56,771][25689] Fps is (10 sec: 5995.5, 60 sec: 5791.8, 300 sec: 5744.8). Total num frames: 191592448. Throughput: 0: 6090.3. Samples: 191593754. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 09:24:56,771][25689] Avg episode reward: [(0, '-50.724')] [2022-07-09 09:24:57,247][26022] Updated weights on worker 0-0, policy_version 187104 (0.00083) [2022-07-09 09:24:58,951][26022] Updated weights on worker 0-0, policy_version 187114 (0.00087) [2022-07-09 09:25:00,824][26022] Updated weights on worker 0-0, policy_version 187124 (0.00082) [2022-07-09 09:25:01,805][25689] Fps is (10 sec: 5603.0, 60 sec: 5740.0, 300 sec: 5745.6). Total num frames: 191618048. Throughput: 0: 6058.1. Samples: 191628368. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 09:25:01,805][25689] Avg episode reward: [(0, '-50.985')] [2022-07-09 09:25:02,751][26022] Updated weights on worker 0-0, policy_version 187134 (0.00085) [2022-07-09 09:25:04,751][26022] Updated weights on worker 0-0, policy_version 187144 (0.00094) [2022-07-09 09:25:06,458][26022] Updated weights on worker 0-0, policy_version 187154 (0.00493) [2022-07-09 09:25:06,946][25689] Fps is (10 sec: 5535.0, 60 sec: 5765.7, 300 sec: 5750.4). Total num frames: 191648768. Throughput: 0: 5074.9. Samples: 191643898. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 09:25:06,947][25689] Avg episode reward: [(0, '-50.800')] [2022-07-09 09:25:08,119][26022] Updated weights on worker 0-0, policy_version 187164 (0.00089) [2022-07-09 09:25:09,906][26022] Updated weights on worker 0-0, policy_version 187174 (0.00079) [2022-07-09 09:25:11,560][26022] Updated weights on worker 0-0, policy_version 187184 (0.00088) [2022-07-09 09:25:11,951][25689] Fps is (10 sec: 5853.8, 60 sec: 5749.3, 300 sec: 5747.1). Total num frames: 191677440. Throughput: 0: 5960.7. Samples: 191679036. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 09:25:11,951][25689] Avg episode reward: [(0, '-50.533')] [2022-07-09 09:25:13,384][26022] Updated weights on worker 0-0, policy_version 187194 (0.00083) [2022-07-09 09:25:15,236][26022] Updated weights on worker 0-0, policy_version 187204 (0.00088) [2022-07-09 09:25:16,958][25689] Fps is (10 sec: 5829.7, 60 sec: 5808.7, 300 sec: 5747.1). Total num frames: 191707136. Throughput: 0: 5950.3. Samples: 191713876. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 09:25:16,959][25689] Avg episode reward: [(0, '-51.118')] [2022-07-09 09:25:16,959][26022] Updated weights on worker 0-0, policy_version 187214 (0.00071) [2022-07-09 09:25:18,558][26022] Updated weights on worker 0-0, policy_version 187224 (0.00085) [2022-07-09 09:25:20,594][26022] Updated weights on worker 0-0, policy_version 187234 (0.00100) [2022-07-09 09:25:21,980][25689] Fps is (10 sec: 5819.6, 60 sec: 5778.3, 300 sec: 5747.3). Total num frames: 191735808. Throughput: 0: 5105.7. Samples: 191731378. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 09:25:21,981][25689] Avg episode reward: [(0, '-50.371')] [2022-07-09 09:25:22,322][26022] Updated weights on worker 0-0, policy_version 187244 (0.00086) [2022-07-09 09:25:24,110][26022] Updated weights on worker 0-0, policy_version 187254 (0.00094) [2022-07-09 09:25:25,920][26022] Updated weights on worker 0-0, policy_version 187264 (0.00089) [2022-07-09 09:25:27,073][25689] Fps is (10 sec: 5770.7, 60 sec: 5796.9, 300 sec: 5745.7). Total num frames: 191765504. Throughput: 0: 6063.1. Samples: 191765928. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 09:25:27,075][25689] Avg episode reward: [(0, '-50.363')] [2022-07-09 09:25:27,700][26022] Updated weights on worker 0-0, policy_version 187274 (0.00082) [2022-07-09 09:25:29,330][26022] Updated weights on worker 0-0, policy_version 187284 (0.00089) [2022-07-09 09:25:31,254][26022] Updated weights on worker 0-0, policy_version 187294 (0.00094) [2022-07-09 09:25:32,138][25689] Fps is (10 sec: 5746.0, 60 sec: 5780.8, 300 sec: 5744.9). Total num frames: 191794176. Throughput: 0: 6015.7. Samples: 191800478. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 09:25:32,140][25689] Avg episode reward: [(0, '-49.768')] [2022-07-09 09:25:32,921][26022] Updated weights on worker 0-0, policy_version 187304 (0.00084) [2022-07-09 09:25:34,773][26022] Updated weights on worker 0-0, policy_version 187314 (0.00077) [2022-07-09 09:25:36,512][26022] Updated weights on worker 0-0, policy_version 187324 (0.00084) [2022-07-09 09:25:37,166][25689] Fps is (10 sec: 5681.6, 60 sec: 5767.7, 300 sec: 5748.0). Total num frames: 191822848. Throughput: 0: 5140.3. Samples: 191817748. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 09:25:37,166][25689] Avg episode reward: [(0, '-50.200')] [2022-07-09 09:25:38,432][26022] Updated weights on worker 0-0, policy_version 187334 (0.00095) [2022-07-09 09:25:39,846][26022] Updated weights on worker 0-0, policy_version 187344 (0.00084) [2022-07-09 09:25:41,978][26022] Updated weights on worker 0-0, policy_version 187354 (0.00087) [2022-07-09 09:25:42,177][25689] Fps is (10 sec: 5814.3, 60 sec: 5768.6, 300 sec: 5749.4). Total num frames: 191852544. Throughput: 0: 5976.0. Samples: 191852074. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 09:25:42,178][25689] Avg episode reward: [(0, '-50.747')] [2022-07-09 09:25:43,560][26022] Updated weights on worker 0-0, policy_version 187364 (0.00084) [2022-07-09 09:25:45,422][26022] Updated weights on worker 0-0, policy_version 187374 (0.00087) [2022-07-09 09:25:47,244][25689] Fps is (10 sec: 5690.2, 60 sec: 5750.0, 300 sec: 5738.3). Total num frames: 191880192. Throughput: 0: 5986.0. Samples: 191886670. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 09:25:47,244][25689] Avg episode reward: [(0, '-50.706')] [2022-07-09 09:25:47,323][26022] Updated weights on worker 0-0, policy_version 187384 (0.00087) [2022-07-09 09:25:48,894][26022] Updated weights on worker 0-0, policy_version 187394 (0.00089) [2022-07-09 09:25:50,759][26022] Updated weights on worker 0-0, policy_version 187404 (0.00086) [2022-07-09 09:25:52,271][25689] Fps is (10 sec: 5782.6, 60 sec: 5770.0, 300 sec: 5748.4). Total num frames: 191910912. Throughput: 0: 5141.1. Samples: 191903980. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 09:25:52,271][25689] Avg episode reward: [(0, '-50.825')] [2022-07-09 09:25:52,358][26022] Updated weights on worker 0-0, policy_version 187414 (0.00089) [2022-07-09 09:25:54,287][26022] Updated weights on worker 0-0, policy_version 187424 (0.00079) [2022-07-09 09:25:56,230][26022] Updated weights on worker 0-0, policy_version 187434 (0.00083) [2022-07-09 09:25:57,298][25689] Fps is (10 sec: 5805.1, 60 sec: 5718.3, 300 sec: 5741.4). Total num frames: 191938560. Throughput: 0: 6013.5. Samples: 191938814. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 09:25:57,299][25689] Avg episode reward: [(0, '-50.347')] [2022-07-09 09:25:57,936][26022] Updated weights on worker 0-0, policy_version 187445 (0.00090) [2022-07-09 09:25:59,880][26022] Updated weights on worker 0-0, policy_version 187455 (0.00081) [2022-07-09 09:26:01,409][26022] Updated weights on worker 0-0, policy_version 187465 (0.00091) [2022-07-09 09:26:02,324][25689] Fps is (10 sec: 5500.5, 60 sec: 5752.9, 300 sec: 5745.1). Total num frames: 191966208. Throughput: 0: 6003.2. Samples: 191973018. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 09:26:02,324][25689] Avg episode reward: [(0, '-50.848')] [2022-07-09 09:26:03,873][26022] Updated weights on worker 0-0, policy_version 187475 (0.00086) [2022-07-09 09:26:05,409][26022] Updated weights on worker 0-0, policy_version 187485 (0.00092) [2022-07-09 09:26:07,372][25689] Fps is (10 sec: 5590.6, 60 sec: 5727.9, 300 sec: 5744.3). Total num frames: 191994880. Throughput: 0: 5073.7. Samples: 191988796. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 09:26:07,373][25689] Avg episode reward: [(0, '-50.935')] [2022-07-09 09:26:07,377][26022] Updated weights on worker 0-0, policy_version 187495 (0.00092) [2022-07-09 09:26:08,908][26022] Updated weights on worker 0-0, policy_version 187505 (0.00086) [2022-07-09 09:26:10,787][26022] Updated weights on worker 0-0, policy_version 187515 (0.00093) [2022-07-09 09:26:12,388][25689] Fps is (10 sec: 5901.2, 60 sec: 5760.7, 300 sec: 5751.3). Total num frames: 192025600. Throughput: 0: 5948.5. Samples: 192023648. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 09:26:12,389][25689] Avg episode reward: [(0, '-51.270')] [2022-07-09 09:26:12,402][26022] Updated weights on worker 0-0, policy_version 187525 (0.00086) [2022-07-09 09:26:14,204][26022] Updated weights on worker 0-0, policy_version 187535 (0.00093) [2022-07-09 09:26:15,997][26022] Updated weights on worker 0-0, policy_version 187545 (0.00086) [2022-07-09 09:26:17,396][25689] Fps is (10 sec: 5823.1, 60 sec: 5726.8, 300 sec: 5741.0). Total num frames: 192053248. Throughput: 0: 5939.8. Samples: 192058190. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 09:26:17,403][25689] Avg episode reward: [(0, '-50.019')] [2022-07-09 09:26:17,939][26022] Updated weights on worker 0-0, policy_version 187555 (0.00090) [2022-07-09 09:26:19,720][26022] Updated weights on worker 0-0, policy_version 187565 (0.00087) [2022-07-09 09:26:21,360][26022] Updated weights on worker 0-0, policy_version 187575 (0.00080) [2022-07-09 09:26:22,448][25689] Fps is (10 sec: 5598.4, 60 sec: 5723.9, 300 sec: 5749.3). Total num frames: 192081920. Throughput: 0: 5104.8. Samples: 192075750. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 09:26:22,450][25689] Avg episode reward: [(0, '-50.704')] [2022-07-09 09:26:23,282][26022] Updated weights on worker 0-0, policy_version 187585 (0.00088) [2022-07-09 09:26:24,901][26022] Updated weights on worker 0-0, policy_version 187595 (0.00081) [2022-07-09 09:26:26,801][26022] Updated weights on worker 0-0, policy_version 187605 (0.00088) [2022-07-09 09:26:27,560][25689] Fps is (10 sec: 5742.5, 60 sec: 5722.1, 300 sec: 5747.4). Total num frames: 192111616. Throughput: 0: 6026.0. Samples: 192110448. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 09:26:27,561][25689] Avg episode reward: [(0, '-51.308')] [2022-07-09 09:26:28,468][26022] Updated weights on worker 0-0, policy_version 187615 (0.00090) [2022-07-09 09:26:30,286][26022] Updated weights on worker 0-0, policy_version 187625 (0.00086) [2022-07-09 09:26:32,127][26022] Updated weights on worker 0-0, policy_version 187635 (0.00078) [2022-07-09 09:26:32,570][25689] Fps is (10 sec: 5766.6, 60 sec: 5727.3, 300 sec: 5747.4). Total num frames: 192140288. Throughput: 0: 6002.2. Samples: 192144782. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 09:26:32,571][25689] Avg episode reward: [(0, '-50.428')] [2022-07-09 09:26:33,785][26022] Updated weights on worker 0-0, policy_version 187645 (0.00095) [2022-07-09 09:26:35,535][26022] Updated weights on worker 0-0, policy_version 187655 (0.00087) [2022-07-09 09:26:37,586][25689] Fps is (10 sec: 5617.3, 60 sec: 5711.5, 300 sec: 5737.3). Total num frames: 192167936. Throughput: 0: 5163.7. Samples: 192162448. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 09:26:37,587][25689] Avg episode reward: [(0, '-49.967')] [2022-07-09 09:26:37,602][26022] Updated weights on worker 0-0, policy_version 187665 (0.00086) [2022-07-09 09:26:39,138][26022] Updated weights on worker 0-0, policy_version 187675 (0.00084) [2022-07-09 09:26:41,103][26022] Updated weights on worker 0-0, policy_version 187685 (0.00086) [2022-07-09 09:26:42,539][26022] Updated weights on worker 0-0, policy_version 187695 (0.00083) [2022-07-09 09:26:42,591][25689] Fps is (10 sec: 5926.6, 60 sec: 5746.0, 300 sec: 5750.2). Total num frames: 192199680. Throughput: 0: 6019.5. Samples: 192197000. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 09:26:42,592][25689] Avg episode reward: [(0, '-51.288')] [2022-07-09 09:26:43,491][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:26:43,503][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000187699_192203776.pth [2022-07-09 09:26:43,504][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000185678_190134272.pth [2022-07-09 09:26:44,679][26022] Updated weights on worker 0-0, policy_version 187705 (0.00091) [2022-07-09 09:26:46,329][26022] Updated weights on worker 0-0, policy_version 187715 (0.00087) [2022-07-09 09:26:47,662][25689] Fps is (10 sec: 5894.7, 60 sec: 5745.6, 300 sec: 5749.0). Total num frames: 192227328. Throughput: 0: 6016.1. Samples: 192231382. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 09:26:47,662][25689] Avg episode reward: [(0, '-51.041')] [2022-07-09 09:26:48,244][26022] Updated weights on worker 0-0, policy_version 187725 (0.00095) [2022-07-09 09:26:49,851][26022] Updated weights on worker 0-0, policy_version 187735 (0.00090) [2022-07-09 09:26:51,759][26022] Updated weights on worker 0-0, policy_version 187745 (0.00588) [2022-07-09 09:26:52,674][25689] Fps is (10 sec: 5687.2, 60 sec: 5730.0, 300 sec: 5745.6). Total num frames: 192257024. Throughput: 0: 6037.8. Samples: 192266166. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-09 09:26:52,675][25689] Avg episode reward: [(0, '-51.097')] [2022-07-09 09:26:53,242][26022] Updated weights on worker 0-0, policy_version 187755 (0.00080) [2022-07-09 09:26:55,069][26022] Updated weights on worker 0-0, policy_version 187765 (0.00084) [2022-07-09 09:26:56,863][26022] Updated weights on worker 0-0, policy_version 187775 (0.00088) [2022-07-09 09:26:57,744][25689] Fps is (10 sec: 5789.2, 60 sec: 5742.9, 300 sec: 5742.4). Total num frames: 192285696. Throughput: 0: 6022.7. Samples: 192283852. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2022-07-09 09:26:57,744][25689] Avg episode reward: [(0, '-51.265')] [2022-07-09 09:26:58,722][26022] Updated weights on worker 0-0, policy_version 187785 (0.00084) [2022-07-09 09:27:00,511][26022] Updated weights on worker 0-0, policy_version 187795 (0.00092) [2022-07-09 09:27:02,754][25689] Fps is (10 sec: 5485.8, 60 sec: 5727.5, 300 sec: 5750.1). Total num frames: 192312320. Throughput: 0: 5942.6. Samples: 192316820. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2022-07-09 09:27:02,754][25689] Avg episode reward: [(0, '-51.893')] [2022-07-09 09:27:02,756][26022] Updated weights on worker 0-0, policy_version 187805 (0.00092) [2022-07-09 09:27:04,152][26022] Updated weights on worker 0-0, policy_version 187815 (0.00089) [2022-07-09 09:27:06,408][26022] Updated weights on worker 0-0, policy_version 187825 (0.00093) [2022-07-09 09:27:07,798][25689] Fps is (10 sec: 5601.9, 60 sec: 5744.9, 300 sec: 5746.4). Total num frames: 192342016. Throughput: 0: 5926.8. Samples: 192350724. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2022-07-09 09:27:07,798][25689] Avg episode reward: [(0, '-52.497')] [2022-07-09 09:27:07,872][26022] Updated weights on worker 0-0, policy_version 187835 (0.00093) [2022-07-09 09:27:09,787][26022] Updated weights on worker 0-0, policy_version 187845 (0.00090) [2022-07-09 09:27:11,363][26022] Updated weights on worker 0-0, policy_version 187855 (0.00088) [2022-07-09 09:27:12,828][25689] Fps is (10 sec: 5793.8, 60 sec: 5709.6, 300 sec: 5742.6). Total num frames: 192370688. Throughput: 0: 5068.5. Samples: 192368318. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2022-07-09 09:27:12,829][25689] Avg episode reward: [(0, '-51.766')] [2022-07-09 09:27:13,346][26022] Updated weights on worker 0-0, policy_version 187865 (0.00086) [2022-07-09 09:27:14,921][26022] Updated weights on worker 0-0, policy_version 187875 (0.00084) [2022-07-09 09:27:17,028][26022] Updated weights on worker 0-0, policy_version 187885 (0.00092) [2022-07-09 09:27:17,882][25689] Fps is (10 sec: 5686.6, 60 sec: 5722.2, 300 sec: 5749.3). Total num frames: 192399360. Throughput: 0: 5893.9. Samples: 192402542. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2022-07-09 09:27:17,882][25689] Avg episode reward: [(0, '-52.375')] [2022-07-09 09:27:18,513][26022] Updated weights on worker 0-0, policy_version 187895 (0.00087) [2022-07-09 09:27:20,532][26022] Updated weights on worker 0-0, policy_version 187905 (0.00093) [2022-07-09 09:27:22,020][26022] Updated weights on worker 0-0, policy_version 187915 (0.00082) [2022-07-09 09:27:22,974][25689] Fps is (10 sec: 5753.1, 60 sec: 5735.4, 300 sec: 5746.6). Total num frames: 192429056. Throughput: 0: 5967.0. Samples: 192437470. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2022-07-09 09:27:22,974][25689] Avg episode reward: [(0, '-52.544')] [2022-07-09 09:27:24,026][26022] Updated weights on worker 0-0, policy_version 187925 (0.00088) [2022-07-09 09:27:25,889][26022] Updated weights on worker 0-0, policy_version 187935 (0.00092) [2022-07-09 09:27:27,507][26022] Updated weights on worker 0-0, policy_version 187945 (0.00093) [2022-07-09 09:27:28,075][25689] Fps is (10 sec: 5826.8, 60 sec: 5736.4, 300 sec: 5748.8). Total num frames: 192458752. Throughput: 0: 5129.4. Samples: 192454732. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2022-07-09 09:27:28,075][25689] Avg episode reward: [(0, '-52.897')] [2022-07-09 09:27:29,292][26022] Updated weights on worker 0-0, policy_version 187955 (0.00090) [2022-07-09 09:27:31,084][26022] Updated weights on worker 0-0, policy_version 187965 (0.00092) [2022-07-09 09:27:32,803][26022] Updated weights on worker 0-0, policy_version 187975 (0.00087) [2022-07-09 09:27:33,091][25689] Fps is (10 sec: 5769.3, 60 sec: 5735.8, 300 sec: 5745.9). Total num frames: 192487424. Throughput: 0: 5974.5. Samples: 192489376. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2022-07-09 09:27:33,091][25689] Avg episode reward: [(0, '-52.183')] [2022-07-09 09:27:34,608][26022] Updated weights on worker 0-0, policy_version 187985 (0.00091) [2022-07-09 09:27:36,251][26022] Updated weights on worker 0-0, policy_version 187995 (0.00091) [2022-07-09 09:27:37,934][26022] Updated weights on worker 0-0, policy_version 188005 (0.00101) [2022-07-09 09:27:38,125][25689] Fps is (10 sec: 5909.5, 60 sec: 5784.8, 300 sec: 5749.7). Total num frames: 192518144. Throughput: 0: 6013.3. Samples: 192524270. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2022-07-09 09:27:38,126][25689] Avg episode reward: [(0, '-51.105')] [2022-07-09 09:27:40,002][26022] Updated weights on worker 0-0, policy_version 188015 (0.00085) [2022-07-09 09:27:41,416][26022] Updated weights on worker 0-0, policy_version 188025 (0.00086) [2022-07-09 09:27:43,133][25689] Fps is (10 sec: 5710.2, 60 sec: 5700.0, 300 sec: 5743.9). Total num frames: 192544768. Throughput: 0: 5166.5. Samples: 192541624. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2022-07-09 09:27:43,135][25689] Avg episode reward: [(0, '-51.324')] [2022-07-09 09:27:43,528][26022] Updated weights on worker 0-0, policy_version 188035 (0.00087) [2022-07-09 09:27:45,083][26022] Updated weights on worker 0-0, policy_version 188045 (0.00087) [2022-07-09 09:27:46,885][26022] Updated weights on worker 0-0, policy_version 188055 (0.00088) [2022-07-09 09:27:48,187][25689] Fps is (10 sec: 5597.2, 60 sec: 5735.4, 300 sec: 5743.0). Total num frames: 192574464. Throughput: 0: 6045.5. Samples: 192576322. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2022-07-09 09:27:48,188][25689] Avg episode reward: [(0, '-50.997')] [2022-07-09 09:27:48,852][26022] Updated weights on worker 0-0, policy_version 188065 (0.00086) [2022-07-09 09:27:50,513][26022] Updated weights on worker 0-0, policy_version 188075 (0.00090) [2022-07-09 09:27:52,247][26022] Updated weights on worker 0-0, policy_version 188085 (0.00080) [2022-07-09 09:27:53,204][25689] Fps is (10 sec: 5897.4, 60 sec: 5735.0, 300 sec: 5746.6). Total num frames: 192604160. Throughput: 0: 6056.6. Samples: 192611194. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2022-07-09 09:27:53,206][25689] Avg episode reward: [(0, '-50.430')] [2022-07-09 09:27:54,024][26022] Updated weights on worker 0-0, policy_version 188095 (0.00087) [2022-07-09 09:27:55,699][26022] Updated weights on worker 0-0, policy_version 188105 (0.00105) [2022-07-09 09:27:57,684][26022] Updated weights on worker 0-0, policy_version 188115 (0.00089) [2022-07-09 09:27:58,237][25689] Fps is (10 sec: 5909.6, 60 sec: 5755.3, 300 sec: 5746.2). Total num frames: 192633856. Throughput: 0: 5186.6. Samples: 192628584. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2022-07-09 09:27:58,238][25689] Avg episode reward: [(0, '-50.313')] [2022-07-09 09:27:59,273][26022] Updated weights on worker 0-0, policy_version 188125 (0.00083) [2022-07-09 09:28:01,127][26022] Updated weights on worker 0-0, policy_version 188135 (0.00078) [2022-07-09 09:28:03,133][26022] Updated weights on worker 0-0, policy_version 188145 (0.00084) [2022-07-09 09:28:03,262][25689] Fps is (10 sec: 5599.3, 60 sec: 5753.9, 300 sec: 5750.6). Total num frames: 192660480. Throughput: 0: 6056.3. Samples: 192663532. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2022-07-09 09:28:03,264][25689] Avg episode reward: [(0, '-50.800')] [2022-07-09 09:28:05,019][26022] Updated weights on worker 0-0, policy_version 188155 (0.00087) [2022-07-09 09:28:06,851][26022] Updated weights on worker 0-0, policy_version 188165 (0.00097) [2022-07-09 09:28:08,276][26022] Updated weights on worker 0-0, policy_version 188175 (0.00090) [2022-07-09 09:28:08,382][25689] Fps is (10 sec: 5652.7, 60 sec: 5763.6, 300 sec: 5748.3). Total num frames: 192691200. Throughput: 0: 5926.9. Samples: 192696014. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2022-07-09 09:28:08,382][25689] Avg episode reward: [(0, '-50.935')] [2022-07-09 09:28:10,354][26022] Updated weights on worker 0-0, policy_version 188185 (0.00090) [2022-07-09 09:28:11,997][26022] Updated weights on worker 0-0, policy_version 188195 (0.00087) [2022-07-09 09:28:13,392][25689] Fps is (10 sec: 5863.2, 60 sec: 5765.6, 300 sec: 5748.4). Total num frames: 192719872. Throughput: 0: 5060.6. Samples: 192713358. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2022-07-09 09:28:13,392][25689] Avg episode reward: [(0, '-50.515')] [2022-07-09 09:28:13,734][26022] Updated weights on worker 0-0, policy_version 188205 (0.00084) [2022-07-09 09:28:15,667][26022] Updated weights on worker 0-0, policy_version 188215 (0.00085) [2022-07-09 09:28:17,314][26022] Updated weights on worker 0-0, policy_version 188225 (0.00088) [2022-07-09 09:28:18,422][25689] Fps is (10 sec: 5609.4, 60 sec: 5750.9, 300 sec: 5744.6). Total num frames: 192747520. Throughput: 0: 5926.6. Samples: 192748212. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2022-07-09 09:28:18,423][25689] Avg episode reward: [(0, '-50.408')] [2022-07-09 09:28:19,115][26022] Updated weights on worker 0-0, policy_version 188235 (0.00085) [2022-07-09 09:28:20,898][26022] Updated weights on worker 0-0, policy_version 188245 (0.00086) [2022-07-09 09:28:22,713][26022] Updated weights on worker 0-0, policy_version 188255 (0.00081) [2022-07-09 09:28:23,424][25689] Fps is (10 sec: 5715.9, 60 sec: 5759.4, 300 sec: 5745.9). Total num frames: 192777216. Throughput: 0: 5905.4. Samples: 192782596. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2022-07-09 09:28:23,425][25689] Avg episode reward: [(0, '-50.749')] [2022-07-09 09:28:24,471][26022] Updated weights on worker 0-0, policy_version 188265 (0.00086) [2022-07-09 09:28:26,345][26022] Updated weights on worker 0-0, policy_version 188275 (0.00088) [2022-07-09 09:28:28,056][26022] Updated weights on worker 0-0, policy_version 188285 (0.00087) [2022-07-09 09:28:28,489][25689] Fps is (10 sec: 5696.2, 60 sec: 5729.0, 300 sec: 5737.9). Total num frames: 192804864. Throughput: 0: 5168.7. Samples: 192799942. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2022-07-09 09:28:28,490][25689] Avg episode reward: [(0, '-50.970')] [2022-07-09 09:28:30,057][26022] Updated weights on worker 0-0, policy_version 188295 (0.00088) [2022-07-09 09:28:31,743][26022] Updated weights on worker 0-0, policy_version 188305 (0.00082) [2022-07-09 09:28:33,349][26022] Updated weights on worker 0-0, policy_version 188315 (0.00093) [2022-07-09 09:28:33,495][25689] Fps is (10 sec: 5796.0, 60 sec: 5763.9, 300 sec: 5745.0). Total num frames: 192835584. Throughput: 0: 6022.9. Samples: 192834436. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2022-07-09 09:28:33,495][25689] Avg episode reward: [(0, '-51.456')] [2022-07-09 09:28:35,308][26022] Updated weights on worker 0-0, policy_version 188325 (0.00084) [2022-07-09 09:28:36,852][26022] Updated weights on worker 0-0, policy_version 188335 (0.00087) [2022-07-09 09:28:38,516][25689] Fps is (10 sec: 5718.9, 60 sec: 5697.2, 300 sec: 5745.2). Total num frames: 192862208. Throughput: 0: 6017.2. Samples: 192869122. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2022-07-09 09:28:38,517][25689] Avg episode reward: [(0, '-51.270')] [2022-07-09 09:28:38,811][26022] Updated weights on worker 0-0, policy_version 188345 (0.00081) [2022-07-09 09:28:40,395][26022] Updated weights on worker 0-0, policy_version 188355 (0.00097) [2022-07-09 09:28:42,256][26022] Updated weights on worker 0-0, policy_version 188365 (0.00085) [2022-07-09 09:28:43,510][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:28:43,522][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000188372_192892928.pth [2022-07-09 09:28:43,522][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000186350_190822400.pth [2022-07-09 09:28:43,523][25689] Fps is (10 sec: 5718.0, 60 sec: 5765.2, 300 sec: 5746.1). Total num frames: 192892928. Throughput: 0: 5172.1. Samples: 192886550. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2022-07-09 09:28:43,524][25689] Avg episode reward: [(0, '-52.280')] [2022-07-09 09:28:44,074][26022] Updated weights on worker 0-0, policy_version 188375 (0.00082) [2022-07-09 09:28:45,830][26022] Updated weights on worker 0-0, policy_version 188385 (0.00091) [2022-07-09 09:28:47,879][26022] Updated weights on worker 0-0, policy_version 188395 (0.00084) [2022-07-09 09:28:48,581][25689] Fps is (10 sec: 5799.0, 60 sec: 5730.9, 300 sec: 5738.6). Total num frames: 192920576. Throughput: 0: 6034.6. Samples: 192921190. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2022-07-09 09:28:48,582][25689] Avg episode reward: [(0, '-51.486')] [2022-07-09 09:28:49,194][26022] Updated weights on worker 0-0, policy_version 188405 (0.00085) [2022-07-09 09:28:51,365][26022] Updated weights on worker 0-0, policy_version 188415 (0.00085) [2022-07-09 09:28:53,217][26022] Updated weights on worker 0-0, policy_version 188425 (0.00091) [2022-07-09 09:28:53,594][25689] Fps is (10 sec: 5491.0, 60 sec: 5697.4, 300 sec: 5738.5). Total num frames: 192948224. Throughput: 0: 6010.8. Samples: 192955246. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2022-07-09 09:28:53,598][25689] Avg episode reward: [(0, '-51.157')] [2022-07-09 09:28:54,733][26022] Updated weights on worker 0-0, policy_version 188435 (0.00081) [2022-07-09 09:28:56,751][26022] Updated weights on worker 0-0, policy_version 188445 (0.00084) [2022-07-09 09:28:58,180][26022] Updated weights on worker 0-0, policy_version 188455 (0.00079) [2022-07-09 09:28:58,607][25689] Fps is (10 sec: 5822.0, 60 sec: 5716.3, 300 sec: 5745.6). Total num frames: 192978944. Throughput: 0: 5157.3. Samples: 192972736. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2022-07-09 09:28:58,607][25689] Avg episode reward: [(0, '-51.015')] [2022-07-09 09:29:00,099][26022] Updated weights on worker 0-0, policy_version 188465 (0.00086) [2022-07-09 09:29:02,601][26022] Updated weights on worker 0-0, policy_version 188475 (0.00088) [2022-07-09 09:29:03,615][25689] Fps is (10 sec: 5926.3, 60 sec: 5751.7, 300 sec: 5746.5). Total num frames: 193007616. Throughput: 0: 6012.7. Samples: 193007358. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2022-07-09 09:29:03,616][25689] Avg episode reward: [(0, '-50.421')] [2022-07-09 09:29:03,737][26022] Updated weights on worker 0-0, policy_version 188485 (0.00088) [2022-07-09 09:29:05,925][26022] Updated weights on worker 0-0, policy_version 188495 (0.00078) [2022-07-09 09:29:07,388][26022] Updated weights on worker 0-0, policy_version 188505 (0.00095) [2022-07-09 09:29:08,714][25689] Fps is (10 sec: 5572.5, 60 sec: 5702.9, 300 sec: 5737.9). Total num frames: 193035264. Throughput: 0: 5915.1. Samples: 193040274. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 09:29:08,714][25689] Avg episode reward: [(0, '-51.504')] [2022-07-09 09:29:09,251][26022] Updated weights on worker 0-0, policy_version 188515 (0.00087) [2022-07-09 09:29:11,289][26022] Updated weights on worker 0-0, policy_version 188525 (0.00085) [2022-07-09 09:29:12,649][26022] Updated weights on worker 0-0, policy_version 188535 (0.00091) [2022-07-09 09:29:13,784][25689] Fps is (10 sec: 5538.4, 60 sec: 5697.1, 300 sec: 5745.3). Total num frames: 193063936. Throughput: 0: 5069.4. Samples: 193057602. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 09:29:13,785][25689] Avg episode reward: [(0, '-51.190')] [2022-07-09 09:29:14,687][26022] Updated weights on worker 0-0, policy_version 188545 (0.00087) [2022-07-09 09:29:16,371][26022] Updated weights on worker 0-0, policy_version 188555 (0.00098) [2022-07-09 09:29:18,346][26022] Updated weights on worker 0-0, policy_version 188565 (0.00088) [2022-07-09 09:29:18,802][25689] Fps is (10 sec: 5684.4, 60 sec: 5715.3, 300 sec: 5739.2). Total num frames: 193092608. Throughput: 0: 5920.9. Samples: 193092306. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 09:29:18,802][25689] Avg episode reward: [(0, '-52.025')] [2022-07-09 09:29:19,982][26022] Updated weights on worker 0-0, policy_version 188575 (0.00086) [2022-07-09 09:29:22,040][26022] Updated weights on worker 0-0, policy_version 188585 (0.00083) [2022-07-09 09:29:23,488][26022] Updated weights on worker 0-0, policy_version 188595 (0.00093) [2022-07-09 09:29:23,806][25689] Fps is (10 sec: 5824.5, 60 sec: 5715.1, 300 sec: 5744.7). Total num frames: 193122304. Throughput: 0: 5891.8. Samples: 193126314. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 09:29:23,806][25689] Avg episode reward: [(0, '-52.673')] [2022-07-09 09:29:25,646][26022] Updated weights on worker 0-0, policy_version 188605 (0.00086) [2022-07-09 09:29:27,143][26022] Updated weights on worker 0-0, policy_version 188615 (0.00085) [2022-07-09 09:29:28,887][25689] Fps is (10 sec: 5686.0, 60 sec: 5713.6, 300 sec: 5737.7). Total num frames: 193149952. Throughput: 0: 5108.2. Samples: 193143322. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 09:29:28,887][25689] Avg episode reward: [(0, '-52.721')] [2022-07-09 09:29:29,163][26022] Updated weights on worker 0-0, policy_version 188625 (0.00090) [2022-07-09 09:29:30,860][26022] Updated weights on worker 0-0, policy_version 188635 (0.00091) [2022-07-09 09:29:32,831][26022] Updated weights on worker 0-0, policy_version 188645 (0.00081) [2022-07-09 09:29:33,900][25689] Fps is (10 sec: 5782.1, 60 sec: 5712.9, 300 sec: 5742.2). Total num frames: 193180672. Throughput: 0: 5970.6. Samples: 193177706. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 09:29:33,901][25689] Avg episode reward: [(0, '-52.337')] [2022-07-09 09:29:34,254][26022] Updated weights on worker 0-0, policy_version 188655 (0.00087) [2022-07-09 09:29:36,356][26022] Updated weights on worker 0-0, policy_version 188665 (0.00090) [2022-07-09 09:29:37,723][26022] Updated weights on worker 0-0, policy_version 188675 (0.00513) [2022-07-09 09:29:38,907][25689] Fps is (10 sec: 5620.5, 60 sec: 5697.3, 300 sec: 5728.7). Total num frames: 193206272. Throughput: 0: 5961.7. Samples: 193212170. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 09:29:38,908][25689] Avg episode reward: [(0, '-51.755')] [2022-07-09 09:29:39,815][26022] Updated weights on worker 0-0, policy_version 188685 (0.00087) [2022-07-09 09:29:41,511][26022] Updated weights on worker 0-0, policy_version 188695 (0.00086) [2022-07-09 09:29:43,181][26022] Updated weights on worker 0-0, policy_version 188705 (0.00085) [2022-07-09 09:29:43,910][25689] Fps is (10 sec: 5626.3, 60 sec: 5697.7, 300 sec: 5736.4). Total num frames: 193236992. Throughput: 0: 5134.0. Samples: 193229532. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 09:29:43,911][25689] Avg episode reward: [(0, '-51.697')] [2022-07-09 09:29:45,064][26022] Updated weights on worker 0-0, policy_version 188715 (0.00087) [2022-07-09 09:29:46,751][26022] Updated weights on worker 0-0, policy_version 188725 (0.00089) [2022-07-09 09:29:48,636][26022] Updated weights on worker 0-0, policy_version 188735 (0.00089) [2022-07-09 09:29:49,037][25689] Fps is (10 sec: 5964.3, 60 sec: 5725.1, 300 sec: 5735.1). Total num frames: 193266688. Throughput: 0: 6012.4. Samples: 193264472. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 09:29:49,037][25689] Avg episode reward: [(0, '-52.302')] [2022-07-09 09:29:50,280][26022] Updated weights on worker 0-0, policy_version 188745 (0.00757) [2022-07-09 09:29:52,065][26022] Updated weights on worker 0-0, policy_version 188755 (0.00093) [2022-07-09 09:29:53,919][26022] Updated weights on worker 0-0, policy_version 188765 (0.00091) [2022-07-09 09:29:54,048][25689] Fps is (10 sec: 5757.3, 60 sec: 5742.1, 300 sec: 5728.4). Total num frames: 193295360. Throughput: 0: 6040.2. Samples: 193299404. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 09:29:54,049][25689] Avg episode reward: [(0, '-52.197')] [2022-07-09 09:29:55,623][26022] Updated weights on worker 0-0, policy_version 188775 (0.00095) [2022-07-09 09:29:57,395][26022] Updated weights on worker 0-0, policy_version 188785 (0.00090) [2022-07-09 09:29:59,084][25689] Fps is (10 sec: 5707.6, 60 sec: 5706.1, 300 sec: 5738.7). Total num frames: 193324032. Throughput: 0: 6053.4. Samples: 193334306. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 09:29:59,084][25689] Avg episode reward: [(0, '-51.201')] [2022-07-09 09:29:59,294][26022] Updated weights on worker 0-0, policy_version 188795 (0.00089) [2022-07-09 09:30:00,932][26022] Updated weights on worker 0-0, policy_version 188805 (0.00088) [2022-07-09 09:30:03,131][26022] Updated weights on worker 0-0, policy_version 188815 (0.00084) [2022-07-09 09:30:04,119][25689] Fps is (10 sec: 5694.3, 60 sec: 5703.6, 300 sec: 5733.8). Total num frames: 193352704. Throughput: 0: 5980.6. Samples: 193350390. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 09:30:04,119][25689] Avg episode reward: [(0, '-51.617')] [2022-07-09 09:30:04,721][26022] Updated weights on worker 0-0, policy_version 188825 (0.00081) [2022-07-09 09:30:06,506][26022] Updated weights on worker 0-0, policy_version 188835 (0.00086) [2022-07-09 09:30:08,383][26022] Updated weights on worker 0-0, policy_version 188845 (0.00098) [2022-07-09 09:30:09,182][25689] Fps is (10 sec: 5780.1, 60 sec: 5740.8, 300 sec: 5736.1). Total num frames: 193382400. Throughput: 0: 5964.3. Samples: 193384624. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 09:30:09,182][25689] Avg episode reward: [(0, '-52.461')] [2022-07-09 09:30:10,051][26022] Updated weights on worker 0-0, policy_version 188855 (0.00084) [2022-07-09 09:30:11,707][26022] Updated weights on worker 0-0, policy_version 188865 (0.00085) [2022-07-09 09:30:13,775][26022] Updated weights on worker 0-0, policy_version 188875 (0.00387) [2022-07-09 09:30:14,193][25689] Fps is (10 sec: 5692.0, 60 sec: 5729.5, 300 sec: 5729.2). Total num frames: 193410048. Throughput: 0: 5973.6. Samples: 193419742. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 09:30:14,195][25689] Avg episode reward: [(0, '-51.383')] [2022-07-09 09:30:15,169][26022] Updated weights on worker 0-0, policy_version 188885 (0.00086) [2022-07-09 09:30:17,302][26022] Updated weights on worker 0-0, policy_version 188895 (0.00086) [2022-07-09 09:30:18,747][26022] Updated weights on worker 0-0, policy_version 188905 (0.00087) [2022-07-09 09:30:19,199][25689] Fps is (10 sec: 5826.6, 60 sec: 5764.5, 300 sec: 5736.4). Total num frames: 193440768. Throughput: 0: 5120.0. Samples: 193437298. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 09:30:19,200][25689] Avg episode reward: [(0, '-51.402')] [2022-07-09 09:30:20,774][26022] Updated weights on worker 0-0, policy_version 188915 (0.00081) [2022-07-09 09:30:22,352][26022] Updated weights on worker 0-0, policy_version 188925 (0.00058) [2022-07-09 09:30:24,215][25689] Fps is (10 sec: 5823.9, 60 sec: 5729.4, 300 sec: 5731.0). Total num frames: 193468416. Throughput: 0: 6041.8. Samples: 193471810. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 09:30:24,216][25689] Avg episode reward: [(0, '-51.467')] [2022-07-09 09:30:24,280][26022] Updated weights on worker 0-0, policy_version 188935 (0.00089) [2022-07-09 09:30:25,862][26022] Updated weights on worker 0-0, policy_version 188945 (0.00086) [2022-07-09 09:30:27,862][26022] Updated weights on worker 0-0, policy_version 188955 (0.00095) [2022-07-09 09:30:29,311][25689] Fps is (10 sec: 5671.3, 60 sec: 5762.0, 300 sec: 5733.8). Total num frames: 193498112. Throughput: 0: 6055.7. Samples: 193506518. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 09:30:29,312][25689] Avg episode reward: [(0, '-50.690')] [2022-07-09 09:30:29,541][26022] Updated weights on worker 0-0, policy_version 188965 (0.00087) [2022-07-09 09:30:31,180][26022] Updated weights on worker 0-0, policy_version 188975 (0.00092) [2022-07-09 09:30:33,243][26022] Updated weights on worker 0-0, policy_version 188985 (0.00083) [2022-07-09 09:30:34,327][25689] Fps is (10 sec: 5873.8, 60 sec: 5744.8, 300 sec: 5737.5). Total num frames: 193527808. Throughput: 0: 5175.0. Samples: 193523932. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 09:30:34,327][25689] Avg episode reward: [(0, '-49.832')] [2022-07-09 09:30:34,759][26022] Updated weights on worker 0-0, policy_version 188995 (0.00090) [2022-07-09 09:30:36,620][26022] Updated weights on worker 0-0, policy_version 189005 (0.00085) [2022-07-09 09:30:38,575][26022] Updated weights on worker 0-0, policy_version 189015 (0.00086) [2022-07-09 09:30:39,335][25689] Fps is (10 sec: 5822.9, 60 sec: 5795.5, 300 sec: 5734.1). Total num frames: 193556480. Throughput: 0: 6044.2. Samples: 193559000. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 09:30:39,335][25689] Avg episode reward: [(0, '-49.908')] [2022-07-09 09:30:39,917][26022] Updated weights on worker 0-0, policy_version 189025 (0.00080) [2022-07-09 09:30:42,051][26022] Updated weights on worker 0-0, policy_version 189035 (0.00100) [2022-07-09 09:30:43,508][26022] Updated weights on worker 0-0, policy_version 189045 (0.00084) [2022-07-09 09:30:43,662][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:30:43,676][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000189046_193583104.pth [2022-07-09 09:30:43,676][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000187025_191513600.pth [2022-07-09 09:30:44,369][25689] Fps is (10 sec: 5812.1, 60 sec: 5775.6, 300 sec: 5741.6). Total num frames: 193586176. Throughput: 0: 6073.5. Samples: 193594214. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 09:30:44,370][25689] Avg episode reward: [(0, '-49.228')] [2022-07-09 09:30:45,475][26022] Updated weights on worker 0-0, policy_version 189055 (0.00084) [2022-07-09 09:30:47,124][26022] Updated weights on worker 0-0, policy_version 189065 (0.00054) [2022-07-09 09:30:48,993][26022] Updated weights on worker 0-0, policy_version 189075 (0.00085) [2022-07-09 09:30:49,405][25689] Fps is (10 sec: 5897.9, 60 sec: 5784.3, 300 sec: 5738.0). Total num frames: 193615872. Throughput: 0: 5223.2. Samples: 193611472. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 09:30:49,405][25689] Avg episode reward: [(0, '-49.438')] [2022-07-09 09:30:50,763][26022] Updated weights on worker 0-0, policy_version 189085 (0.00087) [2022-07-09 09:30:52,656][26022] Updated weights on worker 0-0, policy_version 189095 (0.00086) [2022-07-09 09:30:54,400][26022] Updated weights on worker 0-0, policy_version 189105 (0.00084) [2022-07-09 09:30:54,406][25689] Fps is (10 sec: 5713.4, 60 sec: 5768.3, 300 sec: 5738.5). Total num frames: 193643520. Throughput: 0: 6069.6. Samples: 193645804. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 09:30:54,406][25689] Avg episode reward: [(0, '-50.186')] [2022-07-09 09:30:56,272][26022] Updated weights on worker 0-0, policy_version 189115 (0.00090) [2022-07-09 09:30:57,900][26022] Updated weights on worker 0-0, policy_version 189125 (0.00092) [2022-07-09 09:30:59,420][25689] Fps is (10 sec: 5521.0, 60 sec: 5753.3, 300 sec: 5738.7). Total num frames: 193671168. Throughput: 0: 6042.8. Samples: 193680372. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 09:30:59,422][25689] Avg episode reward: [(0, '-50.959')] [2022-07-09 09:30:59,786][26022] Updated weights on worker 0-0, policy_version 189135 (0.00083) [2022-07-09 09:31:01,179][26022] Updated weights on worker 0-0, policy_version 189145 (0.00985) [2022-07-09 09:31:03,647][26022] Updated weights on worker 0-0, policy_version 189155 (0.00091) [2022-07-09 09:31:04,441][25689] Fps is (10 sec: 5714.4, 60 sec: 5771.7, 300 sec: 5742.7). Total num frames: 193700864. Throughput: 0: 5065.6. Samples: 193695890. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 09:31:04,441][25689] Avg episode reward: [(0, '-50.307')] [2022-07-09 09:31:05,277][26022] Updated weights on worker 0-0, policy_version 189165 (0.00089) [2022-07-09 09:31:06,960][26022] Updated weights on worker 0-0, policy_version 189175 (0.00090) [2022-07-09 09:31:08,835][26022] Updated weights on worker 0-0, policy_version 189185 (0.00088) [2022-07-09 09:31:09,498][25689] Fps is (10 sec: 5690.0, 60 sec: 5738.3, 300 sec: 5731.6). Total num frames: 193728512. Throughput: 0: 5943.2. Samples: 193730892. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 09:31:09,505][25689] Avg episode reward: [(0, '-50.233')] [2022-07-09 09:31:10,496][26022] Updated weights on worker 0-0, policy_version 189195 (0.00086) [2022-07-09 09:31:12,314][26022] Updated weights on worker 0-0, policy_version 189205 (0.00091) [2022-07-09 09:31:14,087][26022] Updated weights on worker 0-0, policy_version 189215 (0.00083) [2022-07-09 09:31:14,530][25689] Fps is (10 sec: 5683.6, 60 sec: 5770.3, 300 sec: 5738.0). Total num frames: 193758208. Throughput: 0: 5964.8. Samples: 193765842. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 09:31:14,531][25689] Avg episode reward: [(0, '-50.893')] [2022-07-09 09:31:15,638][26022] Updated weights on worker 0-0, policy_version 189225 (0.00090) [2022-07-09 09:31:17,657][26022] Updated weights on worker 0-0, policy_version 189235 (0.00084) [2022-07-09 09:31:19,332][26022] Updated weights on worker 0-0, policy_version 189245 (0.00084) [2022-07-09 09:31:19,535][25689] Fps is (10 sec: 5815.3, 60 sec: 5736.4, 300 sec: 5738.9). Total num frames: 193786880. Throughput: 0: 5123.6. Samples: 193783432. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 09:31:19,536][25689] Avg episode reward: [(0, '-50.726')] [2022-07-09 09:31:21,146][26022] Updated weights on worker 0-0, policy_version 189255 (0.00089) [2022-07-09 09:31:22,928][26022] Updated weights on worker 0-0, policy_version 189265 (0.00094) [2022-07-09 09:31:24,547][25689] Fps is (10 sec: 5827.0, 60 sec: 5770.8, 300 sec: 5740.9). Total num frames: 193816576. Throughput: 0: 6080.4. Samples: 193818144. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 09:31:24,555][25689] Avg episode reward: [(0, '-50.905')] [2022-07-09 09:31:24,574][26022] Updated weights on worker 0-0, policy_version 189275 (0.00126) [2022-07-09 09:31:26,554][26022] Updated weights on worker 0-0, policy_version 189285 (0.00088) [2022-07-09 09:31:28,265][26022] Updated weights on worker 0-0, policy_version 189295 (0.00084) [2022-07-09 09:31:29,650][25689] Fps is (10 sec: 5871.7, 60 sec: 5770.0, 300 sec: 5742.5). Total num frames: 193846272. Throughput: 0: 6039.4. Samples: 193852598. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 09:31:29,651][25689] Avg episode reward: [(0, '-50.055')] [2022-07-09 09:31:29,889][26022] Updated weights on worker 0-0, policy_version 189305 (0.00087) [2022-07-09 09:31:31,857][26022] Updated weights on worker 0-0, policy_version 189315 (0.00083) [2022-07-09 09:31:33,479][26022] Updated weights on worker 0-0, policy_version 189325 (0.00084) [2022-07-09 09:31:34,671][25689] Fps is (10 sec: 5563.1, 60 sec: 5718.7, 300 sec: 5739.0). Total num frames: 193872896. Throughput: 0: 5165.2. Samples: 193869872. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 09:31:34,671][25689] Avg episode reward: [(0, '-50.054')] [2022-07-09 09:31:35,349][26022] Updated weights on worker 0-0, policy_version 189335 (0.00091) [2022-07-09 09:31:36,960][26022] Updated weights on worker 0-0, policy_version 189345 (0.00086) [2022-07-09 09:31:38,850][26022] Updated weights on worker 0-0, policy_version 189355 (0.00365) [2022-07-09 09:31:39,673][25689] Fps is (10 sec: 5721.5, 60 sec: 5753.2, 300 sec: 5735.6). Total num frames: 193903616. Throughput: 0: 6016.3. Samples: 193904586. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 09:31:39,673][25689] Avg episode reward: [(0, '-50.193')] [2022-07-09 09:31:40,736][26022] Updated weights on worker 0-0, policy_version 189365 (0.00083) [2022-07-09 09:31:42,518][26022] Updated weights on worker 0-0, policy_version 189375 (0.00084) [2022-07-09 09:31:44,064][26022] Updated weights on worker 0-0, policy_version 189385 (0.00089) [2022-07-09 09:31:44,719][25689] Fps is (10 sec: 6012.7, 60 sec: 5752.1, 300 sec: 5742.9). Total num frames: 193933312. Throughput: 0: 6011.7. Samples: 193939412. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 09:31:44,719][25689] Avg episode reward: [(0, '-50.798')] [2022-07-09 09:31:46,106][26022] Updated weights on worker 0-0, policy_version 189395 (0.00087) [2022-07-09 09:31:47,527][26022] Updated weights on worker 0-0, policy_version 189405 (0.00089) [2022-07-09 09:31:49,559][26022] Updated weights on worker 0-0, policy_version 189415 (0.00083) [2022-07-09 09:31:49,814][25689] Fps is (10 sec: 5856.2, 60 sec: 5746.4, 300 sec: 5741.3). Total num frames: 193963008. Throughput: 0: 5169.6. Samples: 193956840. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 09:31:49,815][25689] Avg episode reward: [(0, '-50.245')] [2022-07-09 09:31:51,088][26022] Updated weights on worker 0-0, policy_version 189425 (0.00091) [2022-07-09 09:31:53,252][26022] Updated weights on worker 0-0, policy_version 189435 (0.00097) [2022-07-09 09:31:54,839][25689] Fps is (10 sec: 5666.1, 60 sec: 5744.1, 300 sec: 5738.8). Total num frames: 193990656. Throughput: 0: 6020.2. Samples: 193991292. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 09:31:54,840][25689] Avg episode reward: [(0, '-51.298')] [2022-07-09 09:31:54,850][26022] Updated weights on worker 0-0, policy_version 189445 (0.00097) [2022-07-09 09:31:56,752][26022] Updated weights on worker 0-0, policy_version 189455 (0.00085) [2022-07-09 09:31:58,341][26022] Updated weights on worker 0-0, policy_version 189465 (0.00088) [2022-07-09 09:31:59,842][25689] Fps is (10 sec: 5514.2, 60 sec: 5745.2, 300 sec: 5742.3). Total num frames: 194018304. Throughput: 0: 6023.6. Samples: 194026080. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 09:31:59,843][25689] Avg episode reward: [(0, '-51.716')] [2022-07-09 09:32:00,280][26022] Updated weights on worker 0-0, policy_version 189475 (0.00081) [2022-07-09 09:32:02,331][26022] Updated weights on worker 0-0, policy_version 189485 (0.00091) [2022-07-09 09:32:04,149][26022] Updated weights on worker 0-0, policy_version 189495 (0.00085) [2022-07-09 09:32:04,920][25689] Fps is (10 sec: 5587.1, 60 sec: 5722.8, 300 sec: 5738.2). Total num frames: 194046976. Throughput: 0: 5043.1. Samples: 194041286. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 09:32:04,920][25689] Avg episode reward: [(0, '-51.240')] [2022-07-09 09:32:05,963][26022] Updated weights on worker 0-0, policy_version 189505 (0.00087) [2022-07-09 09:32:07,718][26022] Updated weights on worker 0-0, policy_version 189515 (0.00091) [2022-07-09 09:32:09,474][26022] Updated weights on worker 0-0, policy_version 189525 (0.00087) [2022-07-09 09:32:09,969][25689] Fps is (10 sec: 5662.5, 60 sec: 5740.5, 300 sec: 5737.9). Total num frames: 194075648. Throughput: 0: 5887.3. Samples: 194075498. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 09:32:09,970][25689] Avg episode reward: [(0, '-51.071')] [2022-07-09 09:32:11,297][26022] Updated weights on worker 0-0, policy_version 189535 (0.00085) [2022-07-09 09:32:13,131][26022] Updated weights on worker 0-0, policy_version 189545 (0.01306) [2022-07-09 09:32:14,830][26022] Updated weights on worker 0-0, policy_version 189555 (0.00083) [2022-07-09 09:32:14,978][25689] Fps is (10 sec: 5803.1, 60 sec: 5742.7, 300 sec: 5742.2). Total num frames: 194105344. Throughput: 0: 5906.9. Samples: 194110248. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 09:32:14,978][25689] Avg episode reward: [(0, '-51.598')] [2022-07-09 09:32:16,577][26022] Updated weights on worker 0-0, policy_version 189565 (0.00083) [2022-07-09 09:32:18,415][26022] Updated weights on worker 0-0, policy_version 189575 (0.00094) [2022-07-09 09:32:20,019][25689] Fps is (10 sec: 5807.8, 60 sec: 5739.3, 300 sec: 5739.7). Total num frames: 194134016. Throughput: 0: 5037.6. Samples: 194127724. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 09:32:20,020][25689] Avg episode reward: [(0, '-51.342')] [2022-07-09 09:32:20,094][26022] Updated weights on worker 0-0, policy_version 189585 (0.00088) [2022-07-09 09:32:21,987][26022] Updated weights on worker 0-0, policy_version 189595 (0.00086) [2022-07-09 09:32:23,646][26022] Updated weights on worker 0-0, policy_version 189605 (0.00080) [2022-07-09 09:32:25,037][25689] Fps is (10 sec: 5599.0, 60 sec: 5704.9, 300 sec: 5734.4). Total num frames: 194161664. Throughput: 0: 6010.6. Samples: 194162202. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 09:32:25,037][25689] Avg episode reward: [(0, '-51.007')] [2022-07-09 09:32:25,623][26022] Updated weights on worker 0-0, policy_version 189615 (0.00089) [2022-07-09 09:32:27,370][26022] Updated weights on worker 0-0, policy_version 189625 (0.00096) [2022-07-09 09:32:29,110][26022] Updated weights on worker 0-0, policy_version 189635 (0.00089) [2022-07-09 09:32:30,100][25689] Fps is (10 sec: 5586.7, 60 sec: 5691.7, 300 sec: 5733.5). Total num frames: 194190336. Throughput: 0: 6014.1. Samples: 194196568. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 09:32:30,101][25689] Avg episode reward: [(0, '-51.712')] [2022-07-09 09:32:30,668][26022] Updated weights on worker 0-0, policy_version 189645 (0.00091) [2022-07-09 09:32:32,983][26022] Updated weights on worker 0-0, policy_version 189655 (0.00083) [2022-07-09 09:32:34,313][26022] Updated weights on worker 0-0, policy_version 189665 (0.00083) [2022-07-09 09:32:35,106][25689] Fps is (10 sec: 5898.2, 60 sec: 5760.8, 300 sec: 5734.1). Total num frames: 194221056. Throughput: 0: 5147.5. Samples: 194213860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 09:32:35,107][25689] Avg episode reward: [(0, '-52.126')] [2022-07-09 09:32:36,367][26022] Updated weights on worker 0-0, policy_version 189675 (0.00085) [2022-07-09 09:32:37,747][26022] Updated weights on worker 0-0, policy_version 189685 (0.00080) [2022-07-09 09:32:40,082][26022] Updated weights on worker 0-0, policy_version 189695 (0.00085) [2022-07-09 09:32:40,114][25689] Fps is (10 sec: 5726.5, 60 sec: 5692.5, 300 sec: 5734.1). Total num frames: 194247680. Throughput: 0: 6013.7. Samples: 194248568. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 09:32:40,115][25689] Avg episode reward: [(0, '-52.614')] [2022-07-09 09:32:41,459][26022] Updated weights on worker 0-0, policy_version 189705 (0.00083) [2022-07-09 09:32:43,376][26022] Updated weights on worker 0-0, policy_version 189715 (0.00088) [2022-07-09 09:32:43,872][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:32:43,886][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000189718_194271232.pth [2022-07-09 09:32:43,887][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000187699_192203776.pth [2022-07-09 09:32:45,027][26022] Updated weights on worker 0-0, policy_version 189725 (0.00085) [2022-07-09 09:32:45,121][25689] Fps is (10 sec: 5725.8, 60 sec: 5713.1, 300 sec: 5738.4). Total num frames: 194278400. Throughput: 0: 6027.1. Samples: 194283254. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 09:32:45,122][25689] Avg episode reward: [(0, '-52.395')] [2022-07-09 09:32:46,839][26022] Updated weights on worker 0-0, policy_version 189735 (0.00083) [2022-07-09 09:32:48,546][26022] Updated weights on worker 0-0, policy_version 189745 (0.00090) [2022-07-09 09:32:50,199][25689] Fps is (10 sec: 5889.2, 60 sec: 5697.9, 300 sec: 5733.8). Total num frames: 194307072. Throughput: 0: 5167.1. Samples: 194300420. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 09:32:50,207][25689] Avg episode reward: [(0, '-52.384')] [2022-07-09 09:32:50,559][26022] Updated weights on worker 0-0, policy_version 189755 (0.00280) [2022-07-09 09:32:52,065][26022] Updated weights on worker 0-0, policy_version 189765 (0.00083) [2022-07-09 09:32:54,153][26022] Updated weights on worker 0-0, policy_version 189775 (0.00077) [2022-07-09 09:32:55,274][25689] Fps is (10 sec: 5749.0, 60 sec: 5727.0, 300 sec: 5733.0). Total num frames: 194336768. Throughput: 0: 6014.1. Samples: 194335150. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 09:32:55,275][25689] Avg episode reward: [(0, '-53.050')] [2022-07-09 09:32:55,707][26022] Updated weights on worker 0-0, policy_version 189785 (0.00083) [2022-07-09 09:32:57,630][26022] Updated weights on worker 0-0, policy_version 189795 (0.00085) [2022-07-09 09:32:59,424][26022] Updated weights on worker 0-0, policy_version 189805 (0.00096) [2022-07-09 09:33:00,279][25689] Fps is (10 sec: 5790.3, 60 sec: 5743.8, 300 sec: 5740.3). Total num frames: 194365440. Throughput: 0: 6003.7. Samples: 194369634. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 09:33:00,280][25689] Avg episode reward: [(0, '-53.481')] [2022-07-09 09:33:01,075][26022] Updated weights on worker 0-0, policy_version 189815 (0.00082) [2022-07-09 09:33:03,125][26022] Updated weights on worker 0-0, policy_version 189825 (0.00084) [2022-07-09 09:33:05,311][25689] Fps is (10 sec: 5305.3, 60 sec: 5680.3, 300 sec: 5721.3). Total num frames: 194390016. Throughput: 0: 5076.0. Samples: 194385736. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 09:33:05,312][25689] Avg episode reward: [(0, '-53.212')] [2022-07-09 09:33:05,367][26022] Updated weights on worker 0-0, policy_version 189835 (0.00085) [2022-07-09 09:33:06,585][26022] Updated weights on worker 0-0, policy_version 189845 (0.00089) [2022-07-09 09:33:08,716][26022] Updated weights on worker 0-0, policy_version 189855 (0.00085) [2022-07-09 09:33:10,155][26022] Updated weights on worker 0-0, policy_version 189865 (0.00068) [2022-07-09 09:33:10,387][25689] Fps is (10 sec: 5572.2, 60 sec: 5728.7, 300 sec: 5730.4). Total num frames: 194421760. Throughput: 0: 5915.0. Samples: 194419830. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 09:33:10,387][25689] Avg episode reward: [(0, '-53.258')] [2022-07-09 09:33:12,226][26022] Updated weights on worker 0-0, policy_version 189875 (0.00080) [2022-07-09 09:33:13,763][26022] Updated weights on worker 0-0, policy_version 189885 (0.00086) [2022-07-09 09:33:15,395][25689] Fps is (10 sec: 5889.6, 60 sec: 5694.8, 300 sec: 5730.8). Total num frames: 194449408. Throughput: 0: 5932.6. Samples: 194454520. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 09:33:15,396][25689] Avg episode reward: [(0, '-52.522')] [2022-07-09 09:33:15,843][26022] Updated weights on worker 0-0, policy_version 189895 (0.00081) [2022-07-09 09:33:17,283][26022] Updated weights on worker 0-0, policy_version 189905 (0.00093) [2022-07-09 09:33:19,214][26022] Updated weights on worker 0-0, policy_version 189915 (0.00092) [2022-07-09 09:33:20,433][25689] Fps is (10 sec: 5707.8, 60 sec: 5712.1, 300 sec: 5730.1). Total num frames: 194479104. Throughput: 0: 5940.7. Samples: 194489362. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 09:33:20,434][25689] Avg episode reward: [(0, '-53.013')] [2022-07-09 09:33:20,792][26022] Updated weights on worker 0-0, policy_version 189925 (0.00086) [2022-07-09 09:33:22,788][26022] Updated weights on worker 0-0, policy_version 189935 (0.00089) [2022-07-09 09:33:24,425][26022] Updated weights on worker 0-0, policy_version 189945 (0.00093) [2022-07-09 09:33:25,437][25689] Fps is (10 sec: 5812.5, 60 sec: 5730.3, 300 sec: 5734.7). Total num frames: 194507776. Throughput: 0: 6008.2. Samples: 194506658. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 09:33:25,437][25689] Avg episode reward: [(0, '-52.247')] [2022-07-09 09:33:26,511][26022] Updated weights on worker 0-0, policy_version 189955 (0.00096) [2022-07-09 09:33:28,126][26022] Updated weights on worker 0-0, policy_version 189965 (0.00086) [2022-07-09 09:33:30,146][26022] Updated weights on worker 0-0, policy_version 189975 (0.00087) [2022-07-09 09:33:30,545][25689] Fps is (10 sec: 5772.4, 60 sec: 5743.1, 300 sec: 5729.3). Total num frames: 194537472. Throughput: 0: 6000.2. Samples: 194540782. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 09:33:30,545][25689] Avg episode reward: [(0, '-52.251')] [2022-07-09 09:33:31,767][26022] Updated weights on worker 0-0, policy_version 189985 (0.00082) [2022-07-09 09:33:33,703][26022] Updated weights on worker 0-0, policy_version 189995 (0.00091) [2022-07-09 09:33:35,144][26022] Updated weights on worker 0-0, policy_version 190005 (0.00088) [2022-07-09 09:33:35,604][25689] Fps is (10 sec: 5740.9, 60 sec: 5704.2, 300 sec: 5735.5). Total num frames: 194566144. Throughput: 0: 5967.6. Samples: 194575118. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 09:33:35,604][25689] Avg episode reward: [(0, '-51.929')] [2022-07-09 09:33:37,205][26022] Updated weights on worker 0-0, policy_version 190015 (0.00083) [2022-07-09 09:33:38,909][26022] Updated weights on worker 0-0, policy_version 190025 (0.00086) [2022-07-09 09:33:40,652][25689] Fps is (10 sec: 5673.8, 60 sec: 5734.2, 300 sec: 5727.8). Total num frames: 194594816. Throughput: 0: 5103.5. Samples: 194592546. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 09:33:40,652][25689] Avg episode reward: [(0, '-52.198')] [2022-07-09 09:33:40,698][26022] Updated weights on worker 0-0, policy_version 190035 (0.00078) [2022-07-09 09:33:42,447][26022] Updated weights on worker 0-0, policy_version 190045 (0.00083) [2022-07-09 09:33:44,239][26022] Updated weights on worker 0-0, policy_version 190055 (0.00081) [2022-07-09 09:33:45,671][25689] Fps is (10 sec: 5696.3, 60 sec: 5699.3, 300 sec: 5732.0). Total num frames: 194623488. Throughput: 0: 5954.7. Samples: 194627146. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 09:33:45,671][25689] Avg episode reward: [(0, '-52.139')] [2022-07-09 09:33:45,909][26022] Updated weights on worker 0-0, policy_version 190065 (0.00083) [2022-07-09 09:33:47,731][26022] Updated weights on worker 0-0, policy_version 190075 (0.00084) [2022-07-09 09:33:49,396][26022] Updated weights on worker 0-0, policy_version 190085 (0.00092) [2022-07-09 09:33:50,738][25689] Fps is (10 sec: 5685.4, 60 sec: 5700.3, 300 sec: 5734.4). Total num frames: 194652160. Throughput: 0: 5989.7. Samples: 194661732. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 09:33:50,738][25689] Avg episode reward: [(0, '-52.469')] [2022-07-09 09:33:51,556][26022] Updated weights on worker 0-0, policy_version 190095 (0.00092) [2022-07-09 09:33:53,172][26022] Updated weights on worker 0-0, policy_version 190105 (0.00087) [2022-07-09 09:33:54,901][26022] Updated weights on worker 0-0, policy_version 190115 (0.00098) [2022-07-09 09:33:55,837][25689] Fps is (10 sec: 5841.9, 60 sec: 5714.9, 300 sec: 5732.7). Total num frames: 194682880. Throughput: 0: 5127.1. Samples: 194678856. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 09:33:55,838][25689] Avg episode reward: [(0, '-51.532')] [2022-07-09 09:33:56,697][26022] Updated weights on worker 0-0, policy_version 190125 (0.00088) [2022-07-09 09:33:58,451][26022] Updated weights on worker 0-0, policy_version 190135 (0.00089) [2022-07-09 09:34:00,436][26022] Updated weights on worker 0-0, policy_version 190145 (0.00082) [2022-07-09 09:34:00,898][25689] Fps is (10 sec: 5946.3, 60 sec: 5726.6, 300 sec: 5735.1). Total num frames: 194712576. Throughput: 0: 5973.1. Samples: 194713480. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 09:34:00,899][25689] Avg episode reward: [(0, '-51.377')] [2022-07-09 09:34:02,287][26022] Updated weights on worker 0-0, policy_version 190155 (0.00103) [2022-07-09 09:34:04,263][26022] Updated weights on worker 0-0, policy_version 190165 (0.00092) [2022-07-09 09:34:05,907][25689] Fps is (10 sec: 5592.8, 60 sec: 5762.4, 300 sec: 5733.4). Total num frames: 194739200. Throughput: 0: 5870.6. Samples: 194745946. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 09:34:05,908][25689] Avg episode reward: [(0, '-51.049')] [2022-07-09 09:34:05,910][26022] Updated weights on worker 0-0, policy_version 190175 (0.00116) [2022-07-09 09:34:07,774][26022] Updated weights on worker 0-0, policy_version 190185 (0.00084) [2022-07-09 09:34:09,364][26022] Updated weights on worker 0-0, policy_version 190195 (0.00089) [2022-07-09 09:34:10,990][25689] Fps is (10 sec: 5377.8, 60 sec: 5694.2, 300 sec: 5729.7). Total num frames: 194766848. Throughput: 0: 5008.9. Samples: 194763172. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 09:34:10,990][25689] Avg episode reward: [(0, '-51.148')] [2022-07-09 09:34:11,344][26022] Updated weights on worker 0-0, policy_version 190205 (0.00091) [2022-07-09 09:34:12,890][26022] Updated weights on worker 0-0, policy_version 190215 (0.00094) [2022-07-09 09:34:14,995][26022] Updated weights on worker 0-0, policy_version 190225 (0.00092) [2022-07-09 09:34:15,997][25689] Fps is (10 sec: 5683.6, 60 sec: 5728.2, 300 sec: 5733.4). Total num frames: 194796544. Throughput: 0: 5901.1. Samples: 194797818. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 09:34:15,997][25689] Avg episode reward: [(0, '-50.493')] [2022-07-09 09:34:16,575][26022] Updated weights on worker 0-0, policy_version 190235 (0.01298) [2022-07-09 09:34:18,351][26022] Updated weights on worker 0-0, policy_version 190245 (0.00086) [2022-07-09 09:34:20,077][26022] Updated weights on worker 0-0, policy_version 190255 (0.00091) [2022-07-09 09:34:21,034][25689] Fps is (10 sec: 5811.4, 60 sec: 5711.4, 300 sec: 5729.3). Total num frames: 194825216. Throughput: 0: 5929.6. Samples: 194832876. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 09:34:21,034][25689] Avg episode reward: [(0, '-50.303')] [2022-07-09 09:34:21,907][26022] Updated weights on worker 0-0, policy_version 190265 (0.00058) [2022-07-09 09:34:23,609][26022] Updated weights on worker 0-0, policy_version 190275 (0.00085) [2022-07-09 09:34:25,639][26022] Updated weights on worker 0-0, policy_version 190285 (0.00092) [2022-07-09 09:34:26,040][25689] Fps is (10 sec: 5709.7, 60 sec: 5711.1, 300 sec: 5734.2). Total num frames: 194853888. Throughput: 0: 5182.4. Samples: 194850280. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 09:34:26,041][25689] Avg episode reward: [(0, '-50.113')] [2022-07-09 09:34:27,260][26022] Updated weights on worker 0-0, policy_version 190295 (0.00089) [2022-07-09 09:34:29,155][26022] Updated weights on worker 0-0, policy_version 190305 (0.00087) [2022-07-09 09:34:30,809][26022] Updated weights on worker 0-0, policy_version 190315 (0.00093) [2022-07-09 09:34:31,104][25689] Fps is (10 sec: 5694.2, 60 sec: 5698.3, 300 sec: 5726.3). Total num frames: 194882560. Throughput: 0: 6032.1. Samples: 194884504. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 09:34:31,105][25689] Avg episode reward: [(0, '-49.680')] [2022-07-09 09:34:32,660][26022] Updated weights on worker 0-0, policy_version 190325 (0.00091) [2022-07-09 09:34:34,337][26022] Updated weights on worker 0-0, policy_version 190335 (0.00079) [2022-07-09 09:34:36,123][25689] Fps is (10 sec: 5788.9, 60 sec: 5719.1, 300 sec: 5739.9). Total num frames: 194912256. Throughput: 0: 6031.5. Samples: 194919208. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 09:34:36,123][25689] Avg episode reward: [(0, '-49.509')] [2022-07-09 09:34:36,276][26022] Updated weights on worker 0-0, policy_version 190345 (0.00090) [2022-07-09 09:34:38,035][26022] Updated weights on worker 0-0, policy_version 190355 (0.00081) [2022-07-09 09:34:39,703][26022] Updated weights on worker 0-0, policy_version 190365 (0.00084) [2022-07-09 09:34:41,131][25689] Fps is (10 sec: 5821.4, 60 sec: 5722.8, 300 sec: 5732.9). Total num frames: 194940928. Throughput: 0: 5156.1. Samples: 194936498. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 09:34:41,131][25689] Avg episode reward: [(0, '-49.364')] [2022-07-09 09:34:41,429][26022] Updated weights on worker 0-0, policy_version 190375 (0.00087) [2022-07-09 09:34:43,467][26022] Updated weights on worker 0-0, policy_version 190385 (0.00087) [2022-07-09 09:34:44,051][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:34:44,060][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000190390_194959360.pth [2022-07-09 09:34:44,062][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000188372_192892928.pth [2022-07-09 09:34:44,895][26022] Updated weights on worker 0-0, policy_version 190395 (0.00087) [2022-07-09 09:34:46,146][25689] Fps is (10 sec: 5823.3, 60 sec: 5740.2, 300 sec: 5735.1). Total num frames: 194970624. Throughput: 0: 6030.8. Samples: 194971534. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 09:34:46,146][25689] Avg episode reward: [(0, '-49.146')] [2022-07-09 09:34:46,746][26022] Updated weights on worker 0-0, policy_version 190405 (0.00084) [2022-07-09 09:34:48,348][26022] Updated weights on worker 0-0, policy_version 190415 (0.00087) [2022-07-09 09:34:50,274][26022] Updated weights on worker 0-0, policy_version 190425 (0.00085) [2022-07-09 09:34:51,209][25689] Fps is (10 sec: 5994.7, 60 sec: 5774.4, 300 sec: 5740.9). Total num frames: 195001344. Throughput: 0: 6070.0. Samples: 195006540. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 09:34:51,210][25689] Avg episode reward: [(0, '-49.462')] [2022-07-09 09:34:52,020][26022] Updated weights on worker 0-0, policy_version 190435 (0.00105) [2022-07-09 09:34:53,781][26022] Updated weights on worker 0-0, policy_version 190445 (0.00089) [2022-07-09 09:34:55,530][26022] Updated weights on worker 0-0, policy_version 190455 (0.00094) [2022-07-09 09:34:56,292][25689] Fps is (10 sec: 5853.7, 60 sec: 5742.1, 300 sec: 5740.0). Total num frames: 195030016. Throughput: 0: 5209.0. Samples: 195024270. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 09:34:56,292][25689] Avg episode reward: [(0, '-48.799')] [2022-07-09 09:34:57,397][26022] Updated weights on worker 0-0, policy_version 190465 (0.00086) [2022-07-09 09:34:59,055][26022] Updated weights on worker 0-0, policy_version 190475 (0.00088) [2022-07-09 09:35:01,003][26022] Updated weights on worker 0-0, policy_version 190485 (0.00090) [2022-07-09 09:35:01,342][25689] Fps is (10 sec: 5659.1, 60 sec: 5726.2, 300 sec: 5739.7). Total num frames: 195058688. Throughput: 0: 6047.6. Samples: 195058728. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 09:35:01,343][25689] Avg episode reward: [(0, '-48.943')] [2022-07-09 09:35:02,785][26022] Updated weights on worker 0-0, policy_version 190495 (0.00083) [2022-07-09 09:35:04,726][26022] Updated weights on worker 0-0, policy_version 190505 (0.00084) [2022-07-09 09:35:06,311][26022] Updated weights on worker 0-0, policy_version 190515 (0.00090) [2022-07-09 09:35:06,379][25689] Fps is (10 sec: 5684.9, 60 sec: 5757.4, 300 sec: 5736.8). Total num frames: 195087360. Throughput: 0: 5927.5. Samples: 195091466. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 09:35:06,381][25689] Avg episode reward: [(0, '-48.674')] [2022-07-09 09:35:08,219][26022] Updated weights on worker 0-0, policy_version 190525 (0.00091) [2022-07-09 09:35:10,193][26022] Updated weights on worker 0-0, policy_version 190535 (0.00092) [2022-07-09 09:35:11,477][25689] Fps is (10 sec: 5658.0, 60 sec: 5772.8, 300 sec: 5738.6). Total num frames: 195116032. Throughput: 0: 5049.2. Samples: 195108878. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 09:35:11,479][25689] Avg episode reward: [(0, '-48.861')] [2022-07-09 09:35:11,753][26022] Updated weights on worker 0-0, policy_version 190545 (0.00089) [2022-07-09 09:35:13,737][26022] Updated weights on worker 0-0, policy_version 190555 (0.00083) [2022-07-09 09:35:15,359][26022] Updated weights on worker 0-0, policy_version 190565 (0.00083) [2022-07-09 09:35:16,493][25689] Fps is (10 sec: 5467.3, 60 sec: 5721.2, 300 sec: 5724.6). Total num frames: 195142656. Throughput: 0: 5912.5. Samples: 195143710. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 09:35:16,494][25689] Avg episode reward: [(0, '-49.114')] [2022-07-09 09:35:17,255][26022] Updated weights on worker 0-0, policy_version 190575 (0.00092) [2022-07-09 09:35:18,851][26022] Updated weights on worker 0-0, policy_version 190585 (0.00404) [2022-07-09 09:35:20,558][26022] Updated weights on worker 0-0, policy_version 190595 (0.00085) [2022-07-09 09:35:21,531][25689] Fps is (10 sec: 5805.3, 60 sec: 5771.8, 300 sec: 5737.9). Total num frames: 195174400. Throughput: 0: 5949.0. Samples: 195178834. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 09:35:21,533][25689] Avg episode reward: [(0, '-50.046')] [2022-07-09 09:35:22,308][26022] Updated weights on worker 0-0, policy_version 190605 (0.00090) [2022-07-09 09:35:24,096][26022] Updated weights on worker 0-0, policy_version 190615 (0.00091) [2022-07-09 09:35:25,829][26022] Updated weights on worker 0-0, policy_version 190625 (0.00096) [2022-07-09 09:35:26,543][25689] Fps is (10 sec: 6011.5, 60 sec: 5771.3, 300 sec: 5736.1). Total num frames: 195203072. Throughput: 0: 5212.2. Samples: 195196564. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 09:35:26,544][25689] Avg episode reward: [(0, '-49.684')] [2022-07-09 09:35:27,719][26022] Updated weights on worker 0-0, policy_version 190635 (0.00085) [2022-07-09 09:35:29,440][26022] Updated weights on worker 0-0, policy_version 190645 (0.00086) [2022-07-09 09:35:31,071][26022] Updated weights on worker 0-0, policy_version 190655 (0.00084) [2022-07-09 09:35:31,660][25689] Fps is (10 sec: 5863.9, 60 sec: 5800.2, 300 sec: 5737.6). Total num frames: 195233792. Throughput: 0: 6041.9. Samples: 195230820. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 09:35:31,660][25689] Avg episode reward: [(0, '-50.260')] [2022-07-09 09:35:33,005][26022] Updated weights on worker 0-0, policy_version 190665 (0.00090) [2022-07-09 09:35:34,805][26022] Updated weights on worker 0-0, policy_version 190675 (0.00087) [2022-07-09 09:35:36,634][26022] Updated weights on worker 0-0, policy_version 190685 (0.00082) [2022-07-09 09:35:36,723][25689] Fps is (10 sec: 5733.5, 60 sec: 5762.1, 300 sec: 5733.1). Total num frames: 195261440. Throughput: 0: 6044.2. Samples: 195265986. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 09:35:36,724][25689] Avg episode reward: [(0, '-51.349')] [2022-07-09 09:35:38,144][26022] Updated weights on worker 0-0, policy_version 190695 (0.00084) [2022-07-09 09:35:39,905][26022] Updated weights on worker 0-0, policy_version 190705 (0.00085) [2022-07-09 09:35:41,652][26022] Updated weights on worker 0-0, policy_version 190715 (0.00084) [2022-07-09 09:35:41,779][25689] Fps is (10 sec: 5767.9, 60 sec: 5791.3, 300 sec: 5736.1). Total num frames: 195292160. Throughput: 0: 6032.9. Samples: 195300988. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 09:35:41,780][25689] Avg episode reward: [(0, '-51.045')] [2022-07-09 09:35:43,511][26022] Updated weights on worker 0-0, policy_version 190725 (0.00088) [2022-07-09 09:35:45,316][26022] Updated weights on worker 0-0, policy_version 190735 (0.00097) [2022-07-09 09:35:46,790][25689] Fps is (10 sec: 5900.0, 60 sec: 5774.8, 300 sec: 5733.2). Total num frames: 195320832. Throughput: 0: 6024.1. Samples: 195318532. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 09:35:46,791][25689] Avg episode reward: [(0, '-50.222')] [2022-07-09 09:35:47,019][26022] Updated weights on worker 0-0, policy_version 190745 (0.00090) [2022-07-09 09:35:48,719][26022] Updated weights on worker 0-0, policy_version 190755 (0.00090) [2022-07-09 09:35:50,656][26022] Updated weights on worker 0-0, policy_version 190765 (0.00097) [2022-07-09 09:35:51,877][25689] Fps is (10 sec: 5780.3, 60 sec: 5755.6, 300 sec: 5738.4). Total num frames: 195350528. Throughput: 0: 6077.5. Samples: 195353690. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 09:35:51,878][25689] Avg episode reward: [(0, '-50.204')] [2022-07-09 09:35:52,067][26022] Updated weights on worker 0-0, policy_version 190775 (0.00096) [2022-07-09 09:35:54,095][26022] Updated weights on worker 0-0, policy_version 190785 (0.00083) [2022-07-09 09:35:55,629][26022] Updated weights on worker 0-0, policy_version 190795 (0.01158) [2022-07-09 09:35:56,892][25689] Fps is (10 sec: 5777.7, 60 sec: 5762.1, 300 sec: 5741.8). Total num frames: 195379200. Throughput: 0: 6075.6. Samples: 195388524. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 09:35:56,893][25689] Avg episode reward: [(0, '-50.150')] [2022-07-09 09:35:57,738][26022] Updated weights on worker 0-0, policy_version 190805 (0.00090) [2022-07-09 09:35:59,351][26022] Updated weights on worker 0-0, policy_version 190815 (0.00096) [2022-07-09 09:36:01,251][26022] Updated weights on worker 0-0, policy_version 190825 (0.00093) [2022-07-09 09:36:01,957][25689] Fps is (10 sec: 5485.9, 60 sec: 5726.9, 300 sec: 5730.6). Total num frames: 195405824. Throughput: 0: 5182.2. Samples: 195405552. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 09:36:01,957][25689] Avg episode reward: [(0, '-49.378')] [2022-07-09 09:36:03,205][26022] Updated weights on worker 0-0, policy_version 190835 (0.00085) [2022-07-09 09:36:05,180][26022] Updated weights on worker 0-0, policy_version 190845 (0.00088) [2022-07-09 09:36:06,965][25689] Fps is (10 sec: 5591.2, 60 sec: 5746.5, 300 sec: 5738.5). Total num frames: 195435520. Throughput: 0: 5930.9. Samples: 195438190. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 09:36:06,966][25689] Avg episode reward: [(0, '-48.734')] [2022-07-09 09:36:06,971][26022] Updated weights on worker 0-0, policy_version 190855 (0.00088) [2022-07-09 09:36:08,722][26022] Updated weights on worker 0-0, policy_version 190865 (0.00065) [2022-07-09 09:36:10,490][26022] Updated weights on worker 0-0, policy_version 190875 (0.00098) [2022-07-09 09:36:12,082][25689] Fps is (10 sec: 5866.0, 60 sec: 5761.7, 300 sec: 5736.8). Total num frames: 195465216. Throughput: 0: 5888.4. Samples: 195472662. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 09:36:12,082][25689] Avg episode reward: [(0, '-49.134')] [2022-07-09 09:36:12,274][26022] Updated weights on worker 0-0, policy_version 190885 (0.00082) [2022-07-09 09:36:14,095][26022] Updated weights on worker 0-0, policy_version 190895 (0.00091) [2022-07-09 09:36:15,922][26022] Updated weights on worker 0-0, policy_version 190905 (0.00086) [2022-07-09 09:36:17,149][25689] Fps is (10 sec: 5731.1, 60 sec: 5790.5, 300 sec: 5735.6). Total num frames: 195493888. Throughput: 0: 5009.0. Samples: 195489994. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 09:36:17,150][25689] Avg episode reward: [(0, '-49.688')] [2022-07-09 09:36:17,607][26022] Updated weights on worker 0-0, policy_version 190915 (0.00092) [2022-07-09 09:36:19,485][26022] Updated weights on worker 0-0, policy_version 190925 (0.00087) [2022-07-09 09:36:21,102][26022] Updated weights on worker 0-0, policy_version 190935 (0.00079) [2022-07-09 09:36:22,217][25689] Fps is (10 sec: 5758.8, 60 sec: 5754.0, 300 sec: 5734.6). Total num frames: 195523584. Throughput: 0: 5883.6. Samples: 195524756. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 09:36:22,218][25689] Avg episode reward: [(0, '-50.978')] [2022-07-09 09:36:22,843][26022] Updated weights on worker 0-0, policy_version 190945 (0.00082) [2022-07-09 09:36:24,567][26022] Updated weights on worker 0-0, policy_version 190955 (0.00086) [2022-07-09 09:36:26,333][26022] Updated weights on worker 0-0, policy_version 190965 (0.00098) [2022-07-09 09:36:27,297][25689] Fps is (10 sec: 5852.9, 60 sec: 5764.4, 300 sec: 5735.0). Total num frames: 195553280. Throughput: 0: 5971.6. Samples: 195559604. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 09:36:27,298][25689] Avg episode reward: [(0, '-51.377')] [2022-07-09 09:36:28,189][26022] Updated weights on worker 0-0, policy_version 190975 (0.00087) [2022-07-09 09:36:30,033][26022] Updated weights on worker 0-0, policy_version 190985 (0.00084) [2022-07-09 09:36:31,854][26022] Updated weights on worker 0-0, policy_version 190995 (0.00085) [2022-07-09 09:36:32,375][25689] Fps is (10 sec: 5847.1, 60 sec: 5751.2, 300 sec: 5744.2). Total num frames: 195582976. Throughput: 0: 5129.3. Samples: 195576750. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 09:36:32,376][25689] Avg episode reward: [(0, '-52.086')] [2022-07-09 09:36:33,574][26022] Updated weights on worker 0-0, policy_version 191005 (0.00085) [2022-07-09 09:36:35,170][26022] Updated weights on worker 0-0, policy_version 191015 (0.00089) [2022-07-09 09:36:37,107][26022] Updated weights on worker 0-0, policy_version 191025 (0.00080) [2022-07-09 09:36:37,402][25689] Fps is (10 sec: 5776.0, 60 sec: 5771.5, 300 sec: 5736.9). Total num frames: 195611648. Throughput: 0: 6019.1. Samples: 195611896. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 09:36:37,404][25689] Avg episode reward: [(0, '-52.397')] [2022-07-09 09:36:38,703][26022] Updated weights on worker 0-0, policy_version 191035 (0.00053) [2022-07-09 09:36:40,427][26022] Updated weights on worker 0-0, policy_version 191045 (0.00085) [2022-07-09 09:36:42,290][26022] Updated weights on worker 0-0, policy_version 191055 (0.00088) [2022-07-09 09:36:42,484][25689] Fps is (10 sec: 5672.1, 60 sec: 5735.3, 300 sec: 5732.7). Total num frames: 195640320. Throughput: 0: 6037.2. Samples: 195647114. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 09:36:42,487][25689] Avg episode reward: [(0, '-52.245')] [2022-07-09 09:36:43,882][26022] Updated weights on worker 0-0, policy_version 191065 (0.00084) [2022-07-09 09:36:44,096][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:36:44,110][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000191066_195651584.pth [2022-07-09 09:36:44,110][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000189046_193583104.pth [2022-07-09 09:36:46,007][26022] Updated weights on worker 0-0, policy_version 191075 (0.00079) [2022-07-09 09:36:47,388][26022] Updated weights on worker 0-0, policy_version 191085 (0.00086) [2022-07-09 09:36:47,538][25689] Fps is (10 sec: 5960.5, 60 sec: 5781.7, 300 sec: 5740.4). Total num frames: 195672064. Throughput: 0: 5181.7. Samples: 195664488. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 09:36:47,540][25689] Avg episode reward: [(0, '-52.000')] [2022-07-09 09:36:49,500][26022] Updated weights on worker 0-0, policy_version 191095 (0.00080) [2022-07-09 09:36:50,787][26022] Updated weights on worker 0-0, policy_version 191105 (0.00084) [2022-07-09 09:36:52,631][25689] Fps is (10 sec: 5853.2, 60 sec: 5747.5, 300 sec: 5739.1). Total num frames: 195699712. Throughput: 0: 6052.2. Samples: 195699348. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 09:36:52,632][25689] Avg episode reward: [(0, '-50.795')] [2022-07-09 09:36:52,911][26022] Updated weights on worker 0-0, policy_version 191115 (0.00086) [2022-07-09 09:36:54,435][26022] Updated weights on worker 0-0, policy_version 191125 (0.00088) [2022-07-09 09:36:56,429][26022] Updated weights on worker 0-0, policy_version 191135 (0.00056) [2022-07-09 09:36:57,674][25689] Fps is (10 sec: 5657.5, 60 sec: 5761.7, 300 sec: 5745.2). Total num frames: 195729408. Throughput: 0: 6047.0. Samples: 195734480. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 09:36:57,674][25689] Avg episode reward: [(0, '-49.671')] [2022-07-09 09:36:57,978][26022] Updated weights on worker 0-0, policy_version 191145 (0.00086) [2022-07-09 09:36:59,870][26022] Updated weights on worker 0-0, policy_version 191155 (0.00084) [2022-07-09 09:37:02,074][26022] Updated weights on worker 0-0, policy_version 191165 (0.00086) [2022-07-09 09:37:02,682][25689] Fps is (10 sec: 5603.6, 60 sec: 5767.1, 300 sec: 5739.7). Total num frames: 195756032. Throughput: 0: 5176.7. Samples: 195751672. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 09:37:02,683][25689] Avg episode reward: [(0, '-48.833')] [2022-07-09 09:37:03,766][26022] Updated weights on worker 0-0, policy_version 191175 (0.00086) [2022-07-09 09:37:05,577][26022] Updated weights on worker 0-0, policy_version 191185 (0.00086) [2022-07-09 09:37:07,290][26022] Updated weights on worker 0-0, policy_version 191195 (0.00095) [2022-07-09 09:37:07,699][25689] Fps is (10 sec: 5618.1, 60 sec: 5766.3, 300 sec: 5743.7). Total num frames: 195785728. Throughput: 0: 5961.1. Samples: 195784668. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 09:37:07,699][25689] Avg episode reward: [(0, '-48.919')] [2022-07-09 09:37:09,002][26022] Updated weights on worker 0-0, policy_version 191205 (0.00086) [2022-07-09 09:37:10,909][26022] Updated weights on worker 0-0, policy_version 191215 (0.00093) [2022-07-09 09:37:12,491][26022] Updated weights on worker 0-0, policy_version 191225 (0.00086) [2022-07-09 09:37:12,847][25689] Fps is (10 sec: 5943.6, 60 sec: 5780.1, 300 sec: 5744.5). Total num frames: 195816448. Throughput: 0: 5952.3. Samples: 195819678. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 09:37:12,848][25689] Avg episode reward: [(0, '-49.066')] [2022-07-09 09:37:14,269][26022] Updated weights on worker 0-0, policy_version 191235 (0.00083) [2022-07-09 09:37:16,099][26022] Updated weights on worker 0-0, policy_version 191245 (0.00082) [2022-07-09 09:37:17,846][26022] Updated weights on worker 0-0, policy_version 191255 (0.00085) [2022-07-09 09:37:17,863][25689] Fps is (10 sec: 5843.2, 60 sec: 5785.1, 300 sec: 5745.0). Total num frames: 195845120. Throughput: 0: 5961.5. Samples: 195854836. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 09:37:17,863][25689] Avg episode reward: [(0, '-49.138')] [2022-07-09 09:37:19,632][26022] Updated weights on worker 0-0, policy_version 191265 (0.00089) [2022-07-09 09:37:21,206][26022] Updated weights on worker 0-0, policy_version 191275 (0.00075) [2022-07-09 09:37:22,940][25689] Fps is (10 sec: 5783.0, 60 sec: 5784.2, 300 sec: 5750.7). Total num frames: 195874816. Throughput: 0: 5957.1. Samples: 195872350. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 09:37:22,940][25689] Avg episode reward: [(0, '-49.326')] [2022-07-09 09:37:23,111][26022] Updated weights on worker 0-0, policy_version 191285 (0.00088) [2022-07-09 09:37:24,840][26022] Updated weights on worker 0-0, policy_version 191295 (0.00084) [2022-07-09 09:37:26,446][26022] Updated weights on worker 0-0, policy_version 191305 (0.00082) [2022-07-09 09:37:27,945][25689] Fps is (10 sec: 5890.8, 60 sec: 5791.3, 300 sec: 5755.3). Total num frames: 195904512. Throughput: 0: 6070.4. Samples: 195907572. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 09:37:27,946][25689] Avg episode reward: [(0, '-49.156')] [2022-07-09 09:37:28,412][26022] Updated weights on worker 0-0, policy_version 191315 (0.00080) [2022-07-09 09:37:30,077][26022] Updated weights on worker 0-0, policy_version 191325 (0.00086) [2022-07-09 09:37:32,014][26022] Updated weights on worker 0-0, policy_version 191335 (0.00080) [2022-07-09 09:37:32,995][25689] Fps is (10 sec: 5804.8, 60 sec: 5777.1, 300 sec: 5747.5). Total num frames: 195933184. Throughput: 0: 6079.7. Samples: 195942172. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 09:37:32,996][25689] Avg episode reward: [(0, '-49.877')] [2022-07-09 09:37:33,476][26022] Updated weights on worker 0-0, policy_version 191345 (0.00090) [2022-07-09 09:37:35,440][26022] Updated weights on worker 0-0, policy_version 191355 (0.00082) [2022-07-09 09:37:37,152][26022] Updated weights on worker 0-0, policy_version 191365 (0.00084) [2022-07-09 09:37:38,028][25689] Fps is (10 sec: 5687.3, 60 sec: 5776.5, 300 sec: 5753.9). Total num frames: 195961856. Throughput: 0: 5198.3. Samples: 195959660. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 09:37:38,028][25689] Avg episode reward: [(0, '-50.399')] [2022-07-09 09:37:38,876][26022] Updated weights on worker 0-0, policy_version 191375 (0.00092) [2022-07-09 09:37:40,676][26022] Updated weights on worker 0-0, policy_version 191385 (0.00089) [2022-07-09 09:37:42,382][26022] Updated weights on worker 0-0, policy_version 191395 (0.00090) [2022-07-09 09:37:43,052][25689] Fps is (10 sec: 5803.7, 60 sec: 5799.0, 300 sec: 5750.2). Total num frames: 195991552. Throughput: 0: 6077.6. Samples: 195994582. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 09:37:43,052][25689] Avg episode reward: [(0, '-50.354')] [2022-07-09 09:37:44,216][26022] Updated weights on worker 0-0, policy_version 191405 (0.00084) [2022-07-09 09:37:45,749][26022] Updated weights on worker 0-0, policy_version 191415 (0.00085) [2022-07-09 09:37:47,752][26022] Updated weights on worker 0-0, policy_version 191425 (0.00085) [2022-07-09 09:37:48,080][25689] Fps is (10 sec: 5806.5, 60 sec: 5750.7, 300 sec: 5751.1). Total num frames: 196020224. Throughput: 0: 6083.9. Samples: 196030072. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 09:37:48,082][25689] Avg episode reward: [(0, '-50.108')] [2022-07-09 09:37:49,432][26022] Updated weights on worker 0-0, policy_version 191435 (0.00086) [2022-07-09 09:37:51,295][26022] Updated weights on worker 0-0, policy_version 191445 (0.00083) [2022-07-09 09:37:52,708][26022] Updated weights on worker 0-0, policy_version 191455 (0.00389) [2022-07-09 09:37:53,219][25689] Fps is (10 sec: 5942.5, 60 sec: 5814.0, 300 sec: 5756.8). Total num frames: 196051968. Throughput: 0: 5216.8. Samples: 196047676. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 09:37:53,221][25689] Avg episode reward: [(0, '-51.224')] [2022-07-09 09:37:54,663][26022] Updated weights on worker 0-0, policy_version 191465 (0.00088) [2022-07-09 09:37:56,284][26022] Updated weights on worker 0-0, policy_version 191475 (0.00085) [2022-07-09 09:37:58,118][26022] Updated weights on worker 0-0, policy_version 191485 (0.00091) [2022-07-09 09:37:58,276][25689] Fps is (10 sec: 5925.3, 60 sec: 5795.7, 300 sec: 5755.8). Total num frames: 196080640. Throughput: 0: 6067.8. Samples: 196082524. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 09:37:58,277][25689] Avg episode reward: [(0, '-50.729')] [2022-07-09 09:38:00,076][26022] Updated weights on worker 0-0, policy_version 191495 (0.00085) [2022-07-09 09:38:01,775][26022] Updated weights on worker 0-0, policy_version 191505 (0.00088) [2022-07-09 09:38:03,362][25689] Fps is (10 sec: 5451.7, 60 sec: 5788.3, 300 sec: 5761.6). Total num frames: 196107264. Throughput: 0: 5930.8. Samples: 196115034. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 09:38:03,362][25689] Avg episode reward: [(0, '-50.311')] [2022-07-09 09:38:03,943][26022] Updated weights on worker 0-0, policy_version 191515 (0.00407) [2022-07-09 09:38:05,555][26022] Updated weights on worker 0-0, policy_version 191525 (0.00089) [2022-07-09 09:38:07,360][26022] Updated weights on worker 0-0, policy_version 191535 (0.00085) [2022-07-09 09:38:08,376][25689] Fps is (10 sec: 5576.3, 60 sec: 5788.5, 300 sec: 5755.9). Total num frames: 196136960. Throughput: 0: 5050.8. Samples: 196132584. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 09:38:08,377][25689] Avg episode reward: [(0, '-50.640')] [2022-07-09 09:38:09,149][26022] Updated weights on worker 0-0, policy_version 191545 (0.00087) [2022-07-09 09:38:11,047][26022] Updated weights on worker 0-0, policy_version 191555 (0.00094) [2022-07-09 09:38:12,740][26022] Updated weights on worker 0-0, policy_version 191565 (0.00056) [2022-07-09 09:38:13,487][25689] Fps is (10 sec: 5764.5, 60 sec: 5758.3, 300 sec: 5757.4). Total num frames: 196165632. Throughput: 0: 5910.9. Samples: 196167480. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 09:38:13,488][25689] Avg episode reward: [(0, '-51.125')] [2022-07-09 09:38:14,420][26022] Updated weights on worker 0-0, policy_version 191575 (0.00093) [2022-07-09 09:38:16,261][26022] Updated weights on worker 0-0, policy_version 191585 (0.00088) [2022-07-09 09:38:18,001][26022] Updated weights on worker 0-0, policy_version 191595 (0.00085) [2022-07-09 09:38:18,491][25689] Fps is (10 sec: 5770.6, 60 sec: 5776.3, 300 sec: 5758.0). Total num frames: 196195328. Throughput: 0: 5916.6. Samples: 196202126. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 09:38:18,492][25689] Avg episode reward: [(0, '-50.732')] [2022-07-09 09:38:19,717][26022] Updated weights on worker 0-0, policy_version 191605 (0.00086) [2022-07-09 09:38:21,528][26022] Updated weights on worker 0-0, policy_version 191615 (0.00083) [2022-07-09 09:38:23,462][26022] Updated weights on worker 0-0, policy_version 191625 (0.00086) [2022-07-09 09:38:23,509][25689] Fps is (10 sec: 5823.9, 60 sec: 5765.0, 300 sec: 5757.7). Total num frames: 196224000. Throughput: 0: 5192.2. Samples: 196219644. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 09:38:23,510][25689] Avg episode reward: [(0, '-50.855')] [2022-07-09 09:38:25,032][26022] Updated weights on worker 0-0, policy_version 191635 (0.00082) [2022-07-09 09:38:27,099][26022] Updated weights on worker 0-0, policy_version 191645 (0.00091) [2022-07-09 09:38:28,517][25689] Fps is (10 sec: 5821.8, 60 sec: 5764.8, 300 sec: 5759.7). Total num frames: 196253696. Throughput: 0: 6049.7. Samples: 196254428. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 09:38:28,531][25689] Avg episode reward: [(0, '-51.314')] [2022-07-09 09:38:28,537][26022] Updated weights on worker 0-0, policy_version 191655 (0.00109) [2022-07-09 09:38:30,480][26022] Updated weights on worker 0-0, policy_version 191665 (0.00088) [2022-07-09 09:38:32,118][26022] Updated weights on worker 0-0, policy_version 191675 (0.00088) [2022-07-09 09:38:33,615][25689] Fps is (10 sec: 5877.0, 60 sec: 5777.1, 300 sec: 5762.4). Total num frames: 196283392. Throughput: 0: 6038.7. Samples: 196289028. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 09:38:33,616][25689] Avg episode reward: [(0, '-50.614')] [2022-07-09 09:38:33,923][26022] Updated weights on worker 0-0, policy_version 191685 (0.00083) [2022-07-09 09:38:35,723][26022] Updated weights on worker 0-0, policy_version 191695 (0.00083) [2022-07-09 09:38:37,407][26022] Updated weights on worker 0-0, policy_version 191705 (0.00089) [2022-07-09 09:38:38,651][25689] Fps is (10 sec: 5658.8, 60 sec: 5760.0, 300 sec: 5759.2). Total num frames: 196311040. Throughput: 0: 5169.6. Samples: 196306342. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 09:38:38,652][25689] Avg episode reward: [(0, '-50.321')] [2022-07-09 09:38:39,457][26022] Updated weights on worker 0-0, policy_version 191715 (0.00100) [2022-07-09 09:38:40,961][26022] Updated weights on worker 0-0, policy_version 191725 (0.00093) [2022-07-09 09:38:42,817][26022] Updated weights on worker 0-0, policy_version 191735 (0.00080) [2022-07-09 09:38:43,668][25689] Fps is (10 sec: 5907.7, 60 sec: 5794.3, 300 sec: 5769.5). Total num frames: 196342784. Throughput: 0: 6040.2. Samples: 196341410. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 09:38:43,671][25689] Avg episode reward: [(0, '-50.403')] [2022-07-09 09:38:44,379][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:38:44,394][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000191744_196345856.pth [2022-07-09 09:38:44,395][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000189718_194271232.pth [2022-07-09 09:38:44,635][26022] Updated weights on worker 0-0, policy_version 191745 (0.00084) [2022-07-09 09:38:46,292][26022] Updated weights on worker 0-0, policy_version 191755 (0.00097) [2022-07-09 09:38:48,325][26022] Updated weights on worker 0-0, policy_version 191765 (0.00085) [2022-07-09 09:38:48,703][25689] Fps is (10 sec: 5806.3, 60 sec: 5759.9, 300 sec: 5763.2). Total num frames: 196369408. Throughput: 0: 6049.5. Samples: 196376544. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 09:38:48,704][25689] Avg episode reward: [(0, '-50.282')] [2022-07-09 09:38:49,730][26022] Updated weights on worker 0-0, policy_version 191775 (0.00099) [2022-07-09 09:38:51,754][26022] Updated weights on worker 0-0, policy_version 191785 (0.00056) [2022-07-09 09:38:53,443][26022] Updated weights on worker 0-0, policy_version 191795 (0.00094) [2022-07-09 09:38:53,776][25689] Fps is (10 sec: 5673.4, 60 sec: 5749.3, 300 sec: 5763.8). Total num frames: 196400128. Throughput: 0: 5188.0. Samples: 196393622. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 09:38:53,776][25689] Avg episode reward: [(0, '-51.321')] [2022-07-09 09:38:55,087][26022] Updated weights on worker 0-0, policy_version 191805 (0.00053) [2022-07-09 09:38:56,951][26022] Updated weights on worker 0-0, policy_version 191815 (0.00087) [2022-07-09 09:38:58,635][26022] Updated weights on worker 0-0, policy_version 191825 (0.00087) [2022-07-09 09:38:58,797][25689] Fps is (10 sec: 5884.1, 60 sec: 5752.8, 300 sec: 5761.1). Total num frames: 196428800. Throughput: 0: 6072.0. Samples: 196428668. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 09:38:58,798][25689] Avg episode reward: [(0, '-51.361')] [2022-07-09 09:39:00,501][26022] Updated weights on worker 0-0, policy_version 191835 (0.00085) [2022-07-09 09:39:02,792][26022] Updated weights on worker 0-0, policy_version 191845 (0.00089) [2022-07-09 09:39:03,806][25689] Fps is (10 sec: 5615.0, 60 sec: 5776.9, 300 sec: 5764.5). Total num frames: 196456448. Throughput: 0: 5962.6. Samples: 196461484. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 09:39:03,808][25689] Avg episode reward: [(0, '-51.327')] [2022-07-09 09:39:04,169][26022] Updated weights on worker 0-0, policy_version 191855 (0.00089) [2022-07-09 09:39:06,198][26022] Updated weights on worker 0-0, policy_version 191865 (0.00089) [2022-07-09 09:39:07,825][26022] Updated weights on worker 0-0, policy_version 191875 (0.00082) [2022-07-09 09:39:08,823][25689] Fps is (10 sec: 5514.9, 60 sec: 5742.8, 300 sec: 5765.8). Total num frames: 196484096. Throughput: 0: 5077.6. Samples: 196478706. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 09:39:08,825][25689] Avg episode reward: [(0, '-51.768')] [2022-07-09 09:39:09,629][26022] Updated weights on worker 0-0, policy_version 191885 (0.00082) [2022-07-09 09:39:11,472][26022] Updated weights on worker 0-0, policy_version 191895 (0.00089) [2022-07-09 09:39:13,159][26022] Updated weights on worker 0-0, policy_version 191905 (0.00095) [2022-07-09 09:39:13,874][25689] Fps is (10 sec: 5695.6, 60 sec: 5765.4, 300 sec: 5764.9). Total num frames: 196513792. Throughput: 0: 5968.3. Samples: 196513576. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 09:39:13,875][25689] Avg episode reward: [(0, '-51.848')] [2022-07-09 09:39:14,946][26022] Updated weights on worker 0-0, policy_version 191915 (0.00090) [2022-07-09 09:39:16,843][26022] Updated weights on worker 0-0, policy_version 191925 (0.00086) [2022-07-09 09:39:18,498][26022] Updated weights on worker 0-0, policy_version 191935 (0.00085) [2022-07-09 09:39:18,934][25689] Fps is (10 sec: 5874.4, 60 sec: 5760.2, 300 sec: 5767.9). Total num frames: 196543488. Throughput: 0: 5967.6. Samples: 196548838. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 09:39:18,934][25689] Avg episode reward: [(0, '-51.394')] [2022-07-09 09:39:20,217][26022] Updated weights on worker 0-0, policy_version 191945 (0.00086) [2022-07-09 09:39:21,871][26022] Updated weights on worker 0-0, policy_version 191955 (0.00082) [2022-07-09 09:39:23,693][26022] Updated weights on worker 0-0, policy_version 191965 (0.00089) [2022-07-09 09:39:23,949][25689] Fps is (10 sec: 5895.1, 60 sec: 5777.4, 300 sec: 5771.2). Total num frames: 196573184. Throughput: 0: 5191.3. Samples: 196566052. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 09:39:23,950][25689] Avg episode reward: [(0, '-51.595')] [2022-07-09 09:39:25,654][26022] Updated weights on worker 0-0, policy_version 191975 (0.00084) [2022-07-09 09:39:27,221][26022] Updated weights on worker 0-0, policy_version 191985 (0.00085) [2022-07-09 09:39:28,957][25689] Fps is (10 sec: 5720.9, 60 sec: 5743.5, 300 sec: 5768.9). Total num frames: 196600832. Throughput: 0: 6068.4. Samples: 196600886. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 09:39:28,959][25689] Avg episode reward: [(0, '-51.175')] [2022-07-09 09:39:29,199][26022] Updated weights on worker 0-0, policy_version 191995 (0.00090) [2022-07-09 09:39:30,874][26022] Updated weights on worker 0-0, policy_version 192005 (0.00087) [2022-07-09 09:39:32,599][26022] Updated weights on worker 0-0, policy_version 192015 (0.00084) [2022-07-09 09:39:34,087][25689] Fps is (10 sec: 5555.4, 60 sec: 5723.5, 300 sec: 5763.3). Total num frames: 196629504. Throughput: 0: 6012.9. Samples: 196635114. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 09:39:34,088][25689] Avg episode reward: [(0, '-51.345')] [2022-07-09 09:39:34,730][26022] Updated weights on worker 0-0, policy_version 192025 (0.00088) [2022-07-09 09:39:36,065][26022] Updated weights on worker 0-0, policy_version 192035 (0.00087) [2022-07-09 09:39:37,964][26022] Updated weights on worker 0-0, policy_version 192045 (0.00079) [2022-07-09 09:39:39,162][25689] Fps is (10 sec: 5920.4, 60 sec: 5787.5, 300 sec: 5772.3). Total num frames: 196661248. Throughput: 0: 5998.9. Samples: 196670184. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 09:39:39,162][25689] Avg episode reward: [(0, '-50.984')] [2022-07-09 09:39:39,839][26022] Updated weights on worker 0-0, policy_version 192055 (0.00083) [2022-07-09 09:39:41,562][26022] Updated weights on worker 0-0, policy_version 192065 (0.00092) [2022-07-09 09:39:43,430][26022] Updated weights on worker 0-0, policy_version 192075 (0.00090) [2022-07-09 09:39:44,194][25689] Fps is (10 sec: 6078.9, 60 sec: 5752.3, 300 sec: 5771.9). Total num frames: 196690944. Throughput: 0: 5998.6. Samples: 196687494. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 09:39:44,195][25689] Avg episode reward: [(0, '-51.322')] [2022-07-09 09:39:45,170][26022] Updated weights on worker 0-0, policy_version 192085 (0.00097) [2022-07-09 09:39:46,778][26022] Updated weights on worker 0-0, policy_version 192095 (0.00087) [2022-07-09 09:39:48,842][26022] Updated weights on worker 0-0, policy_version 192105 (0.00088) [2022-07-09 09:39:49,223][25689] Fps is (10 sec: 5699.7, 60 sec: 5769.8, 300 sec: 5762.3). Total num frames: 196718592. Throughput: 0: 5984.7. Samples: 196722170. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 09:39:49,224][25689] Avg episode reward: [(0, '-51.167')] [2022-07-09 09:39:50,337][26022] Updated weights on worker 0-0, policy_version 192115 (0.00088) [2022-07-09 09:39:52,227][26022] Updated weights on worker 0-0, policy_version 192125 (0.00094) [2022-07-09 09:39:53,910][26022] Updated weights on worker 0-0, policy_version 192135 (0.00085) [2022-07-09 09:39:54,322][25689] Fps is (10 sec: 5561.0, 60 sec: 5733.4, 300 sec: 5762.0). Total num frames: 196747264. Throughput: 0: 6021.6. Samples: 196756960. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 09:39:54,322][25689] Avg episode reward: [(0, '-51.389')] [2022-07-09 09:39:55,583][26022] Updated weights on worker 0-0, policy_version 192145 (0.00083) [2022-07-09 09:39:57,614][26022] Updated weights on worker 0-0, policy_version 192155 (0.00080) [2022-07-09 09:39:59,202][26022] Updated weights on worker 0-0, policy_version 192165 (0.00082) [2022-07-09 09:39:59,355][25689] Fps is (10 sec: 5861.5, 60 sec: 5766.0, 300 sec: 5769.2). Total num frames: 196777984. Throughput: 0: 5158.1. Samples: 196774344. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 09:39:59,356][25689] Avg episode reward: [(0, '-50.971')] [2022-07-09 09:40:01,082][26022] Updated weights on worker 0-0, policy_version 192175 (0.00095) [2022-07-09 09:40:03,096][26022] Updated weights on worker 0-0, policy_version 192185 (0.00084) [2022-07-09 09:40:04,405][25689] Fps is (10 sec: 5585.5, 60 sec: 5728.4, 300 sec: 5758.6). Total num frames: 196803584. Throughput: 0: 5896.5. Samples: 196806668. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 09:40:04,406][25689] Avg episode reward: [(0, '-50.330')] [2022-07-09 09:40:05,004][26022] Updated weights on worker 0-0, policy_version 192195 (0.00087) [2022-07-09 09:40:06,735][26022] Updated weights on worker 0-0, policy_version 192205 (0.00086) [2022-07-09 09:40:08,520][26022] Updated weights on worker 0-0, policy_version 192215 (0.00093) [2022-07-09 09:40:09,438][25689] Fps is (10 sec: 5484.3, 60 sec: 5760.7, 300 sec: 5763.3). Total num frames: 196833280. Throughput: 0: 5889.4. Samples: 196841226. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 09:40:09,439][25689] Avg episode reward: [(0, '-49.857')] [2022-07-09 09:40:10,371][26022] Updated weights on worker 0-0, policy_version 192225 (0.00099) [2022-07-09 09:40:12,142][26022] Updated weights on worker 0-0, policy_version 192235 (0.00082) [2022-07-09 09:40:13,757][26022] Updated weights on worker 0-0, policy_version 192245 (0.00084) [2022-07-09 09:40:14,537][25689] Fps is (10 sec: 5862.1, 60 sec: 5756.2, 300 sec: 5772.0). Total num frames: 196862976. Throughput: 0: 5029.7. Samples: 196858634. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 09:40:14,537][25689] Avg episode reward: [(0, '-50.610')] [2022-07-09 09:40:15,726][26022] Updated weights on worker 0-0, policy_version 192255 (0.00532) [2022-07-09 09:40:17,270][26022] Updated weights on worker 0-0, policy_version 192265 (0.00091) [2022-07-09 09:40:19,185][26022] Updated weights on worker 0-0, policy_version 192275 (0.00089) [2022-07-09 09:40:19,561][25689] Fps is (10 sec: 5765.7, 60 sec: 5742.6, 300 sec: 5762.0). Total num frames: 196891648. Throughput: 0: 5898.3. Samples: 196893526. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 09:40:19,563][25689] Avg episode reward: [(0, '-50.999')] [2022-07-09 09:40:20,840][26022] Updated weights on worker 0-0, policy_version 192285 (0.00084) [2022-07-09 09:40:22,755][26022] Updated weights on worker 0-0, policy_version 192295 (0.00085) [2022-07-09 09:40:24,507][26022] Updated weights on worker 0-0, policy_version 192305 (0.00084) [2022-07-09 09:40:24,567][25689] Fps is (10 sec: 5717.3, 60 sec: 5726.7, 300 sec: 5762.1). Total num frames: 196920320. Throughput: 0: 6050.2. Samples: 196928648. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 09:40:24,569][25689] Avg episode reward: [(0, '-51.329')] [2022-07-09 09:40:26,195][26022] Updated weights on worker 0-0, policy_version 192315 (0.00092) [2022-07-09 09:40:27,992][26022] Updated weights on worker 0-0, policy_version 192325 (0.00080) [2022-07-09 09:40:29,570][25689] Fps is (10 sec: 5831.8, 60 sec: 5760.9, 300 sec: 5760.9). Total num frames: 196950016. Throughput: 0: 5213.8. Samples: 196946190. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 09:40:29,571][25689] Avg episode reward: [(0, '-51.733')] [2022-07-09 09:40:29,787][26022] Updated weights on worker 0-0, policy_version 192335 (0.00103) [2022-07-09 09:40:31,552][26022] Updated weights on worker 0-0, policy_version 192345 (0.00090) [2022-07-09 09:40:33,437][26022] Updated weights on worker 0-0, policy_version 192355 (0.00088) [2022-07-09 09:40:34,691][25689] Fps is (10 sec: 5664.0, 60 sec: 5744.8, 300 sec: 5759.8). Total num frames: 196977664. Throughput: 0: 6030.5. Samples: 196980174. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 09:40:34,692][25689] Avg episode reward: [(0, '-51.849')] [2022-07-09 09:40:35,172][26022] Updated weights on worker 0-0, policy_version 192365 (0.00092) [2022-07-09 09:40:37,021][26022] Updated weights on worker 0-0, policy_version 192375 (0.00088) [2022-07-09 09:40:38,607][26022] Updated weights on worker 0-0, policy_version 192385 (0.00083) [2022-07-09 09:40:39,764][25689] Fps is (10 sec: 5625.5, 60 sec: 5711.3, 300 sec: 5756.0). Total num frames: 197007360. Throughput: 0: 6011.2. Samples: 197014966. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 09:40:39,764][25689] Avg episode reward: [(0, '-51.880')] [2022-07-09 09:40:40,443][26022] Updated weights on worker 0-0, policy_version 192395 (0.00093) [2022-07-09 09:40:42,196][26022] Updated weights on worker 0-0, policy_version 192405 (0.00086) [2022-07-09 09:40:43,989][26022] Updated weights on worker 0-0, policy_version 192415 (0.00085) [2022-07-09 09:40:44,519][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:40:44,536][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000192419_197037056.pth [2022-07-09 09:40:44,537][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000190390_194959360.pth [2022-07-09 09:40:44,785][25689] Fps is (10 sec: 5883.7, 60 sec: 5712.2, 300 sec: 5759.2). Total num frames: 197037056. Throughput: 0: 5132.3. Samples: 197032416. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 09:40:44,786][25689] Avg episode reward: [(0, '-52.061')] [2022-07-09 09:40:45,726][26022] Updated weights on worker 0-0, policy_version 192425 (0.00103) [2022-07-09 09:40:47,415][26022] Updated weights on worker 0-0, policy_version 192435 (0.00084) [2022-07-09 09:40:49,272][26022] Updated weights on worker 0-0, policy_version 192445 (0.00088) [2022-07-09 09:40:49,802][25689] Fps is (10 sec: 5814.6, 60 sec: 5730.3, 300 sec: 5757.2). Total num frames: 197065728. Throughput: 0: 5986.3. Samples: 197067304. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 09:40:49,802][25689] Avg episode reward: [(0, '-51.164')] [2022-07-09 09:40:50,926][26022] Updated weights on worker 0-0, policy_version 192455 (0.00084) [2022-07-09 09:40:52,789][26022] Updated weights on worker 0-0, policy_version 192465 (0.00082) [2022-07-09 09:40:54,498][26022] Updated weights on worker 0-0, policy_version 192475 (0.00089) [2022-07-09 09:40:54,918][25689] Fps is (10 sec: 5760.3, 60 sec: 5745.6, 300 sec: 5758.6). Total num frames: 197095424. Throughput: 0: 6025.5. Samples: 197102054. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 09:40:54,919][25689] Avg episode reward: [(0, '-51.059')] [2022-07-09 09:40:56,386][26022] Updated weights on worker 0-0, policy_version 192485 (0.00086) [2022-07-09 09:40:58,086][26022] Updated weights on worker 0-0, policy_version 192495 (0.00088) [2022-07-09 09:40:59,899][26022] Updated weights on worker 0-0, policy_version 192505 (0.00081) [2022-07-09 09:40:59,925][25689] Fps is (10 sec: 5867.1, 60 sec: 5731.2, 300 sec: 5770.1). Total num frames: 197125120. Throughput: 0: 5193.1. Samples: 197119664. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 09:40:59,925][25689] Avg episode reward: [(0, '-51.106')] [2022-07-09 09:41:01,960][26022] Updated weights on worker 0-0, policy_version 192515 (0.00081) [2022-07-09 09:41:03,769][26022] Updated weights on worker 0-0, policy_version 192525 (0.00090) [2022-07-09 09:41:04,933][25689] Fps is (10 sec: 5521.3, 60 sec: 5735.1, 300 sec: 5756.3). Total num frames: 197150720. Throughput: 0: 5951.8. Samples: 197152332. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 09:41:04,934][25689] Avg episode reward: [(0, '-51.471')] [2022-07-09 09:41:05,545][26022] Updated weights on worker 0-0, policy_version 192535 (0.00079) [2022-07-09 09:41:07,498][26022] Updated weights on worker 0-0, policy_version 192545 (0.00086) [2022-07-09 09:41:09,132][26022] Updated weights on worker 0-0, policy_version 192555 (0.00086) [2022-07-09 09:41:09,952][25689] Fps is (10 sec: 5514.8, 60 sec: 5736.5, 300 sec: 5758.2). Total num frames: 197180416. Throughput: 0: 5900.3. Samples: 197186194. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 09:41:09,952][25689] Avg episode reward: [(0, '-51.427')] [2022-07-09 09:41:11,170][26022] Updated weights on worker 0-0, policy_version 192565 (0.00092) [2022-07-09 09:41:12,602][26022] Updated weights on worker 0-0, policy_version 192575 (0.00083) [2022-07-09 09:41:14,579][26022] Updated weights on worker 0-0, policy_version 192585 (0.00085) [2022-07-09 09:41:14,991][25689] Fps is (10 sec: 5803.5, 60 sec: 5725.2, 300 sec: 5758.8). Total num frames: 197209088. Throughput: 0: 5045.4. Samples: 197203328. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 09:41:14,991][25689] Avg episode reward: [(0, '-52.064')] [2022-07-09 09:41:16,253][26022] Updated weights on worker 0-0, policy_version 192595 (0.00085) [2022-07-09 09:41:18,134][26022] Updated weights on worker 0-0, policy_version 192605 (0.00089) [2022-07-09 09:41:19,962][26022] Updated weights on worker 0-0, policy_version 192615 (0.00082) [2022-07-09 09:41:19,997][25689] Fps is (10 sec: 5708.7, 60 sec: 5727.0, 300 sec: 5756.5). Total num frames: 197237760. Throughput: 0: 5899.2. Samples: 197238072. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 09:41:19,997][25689] Avg episode reward: [(0, '-51.312')] [2022-07-09 09:41:21,654][26022] Updated weights on worker 0-0, policy_version 192625 (0.00082) [2022-07-09 09:41:23,436][26022] Updated weights on worker 0-0, policy_version 192635 (0.00080) [2022-07-09 09:41:25,003][25689] Fps is (10 sec: 5829.8, 60 sec: 5743.9, 300 sec: 5758.0). Total num frames: 197267456. Throughput: 0: 6005.2. Samples: 197272852. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 09:41:25,003][25689] Avg episode reward: [(0, '-51.269')] [2022-07-09 09:41:25,247][26022] Updated weights on worker 0-0, policy_version 192645 (0.00083) [2022-07-09 09:41:26,898][26022] Updated weights on worker 0-0, policy_version 192655 (0.00086) [2022-07-09 09:41:28,911][26022] Updated weights on worker 0-0, policy_version 192665 (0.00089) [2022-07-09 09:41:30,008][25689] Fps is (10 sec: 5829.9, 60 sec: 5726.7, 300 sec: 5755.9). Total num frames: 197296128. Throughput: 0: 5185.8. Samples: 197290204. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 09:41:30,009][25689] Avg episode reward: [(0, '-51.004')] [2022-07-09 09:41:30,427][26022] Updated weights on worker 0-0, policy_version 192675 (0.00086) [2022-07-09 09:41:32,397][26022] Updated weights on worker 0-0, policy_version 192685 (0.00086) [2022-07-09 09:41:34,127][26022] Updated weights on worker 0-0, policy_version 192695 (0.00088) [2022-07-09 09:41:35,078][25689] Fps is (10 sec: 5589.7, 60 sec: 5731.5, 300 sec: 5751.6). Total num frames: 197323776. Throughput: 0: 6039.6. Samples: 197324650. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 09:41:35,080][25689] Avg episode reward: [(0, '-50.636')] [2022-07-09 09:41:35,936][26022] Updated weights on worker 0-0, policy_version 192705 (0.00089) [2022-07-09 09:41:37,630][26022] Updated weights on worker 0-0, policy_version 192715 (0.00556) [2022-07-09 09:41:39,492][26022] Updated weights on worker 0-0, policy_version 192725 (0.00087) [2022-07-09 09:41:40,086][25689] Fps is (10 sec: 5690.2, 60 sec: 5737.7, 300 sec: 5756.5). Total num frames: 197353472. Throughput: 0: 6027.6. Samples: 197359164. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 09:41:40,091][25689] Avg episode reward: [(0, '-49.904')] [2022-07-09 09:41:41,185][26022] Updated weights on worker 0-0, policy_version 192735 (0.00610) [2022-07-09 09:41:43,168][26022] Updated weights on worker 0-0, policy_version 192745 (0.00090) [2022-07-09 09:41:44,688][26022] Updated weights on worker 0-0, policy_version 192755 (0.00092) [2022-07-09 09:41:45,115][25689] Fps is (10 sec: 5917.8, 60 sec: 5737.0, 300 sec: 5750.1). Total num frames: 197383168. Throughput: 0: 5158.9. Samples: 197376608. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 09:41:45,115][25689] Avg episode reward: [(0, '-50.217')] [2022-07-09 09:41:46,824][26022] Updated weights on worker 0-0, policy_version 192765 (0.00091) [2022-07-09 09:41:48,171][26022] Updated weights on worker 0-0, policy_version 192775 (0.00085) [2022-07-09 09:41:50,043][26022] Updated weights on worker 0-0, policy_version 192785 (0.00086) [2022-07-09 09:41:50,126][25689] Fps is (10 sec: 5813.9, 60 sec: 5737.6, 300 sec: 5755.2). Total num frames: 197411840. Throughput: 0: 6016.5. Samples: 197411240. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 09:41:50,127][25689] Avg episode reward: [(0, '-49.733')] [2022-07-09 09:41:51,879][26022] Updated weights on worker 0-0, policy_version 192795 (0.00092) [2022-07-09 09:41:53,400][26022] Updated weights on worker 0-0, policy_version 192805 (0.00088) [2022-07-09 09:41:55,256][25689] Fps is (10 sec: 5654.3, 60 sec: 5719.2, 300 sec: 5750.0). Total num frames: 197440512. Throughput: 0: 6039.3. Samples: 197446510. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 09:41:55,257][25689] Avg episode reward: [(0, '-50.255')] [2022-07-09 09:41:55,401][26022] Updated weights on worker 0-0, policy_version 192815 (0.00090) [2022-07-09 09:41:56,934][26022] Updated weights on worker 0-0, policy_version 192825 (0.00083) [2022-07-09 09:41:58,899][26022] Updated weights on worker 0-0, policy_version 192835 (0.00084) [2022-07-09 09:42:00,309][25689] Fps is (10 sec: 5732.0, 60 sec: 5714.9, 300 sec: 5759.5). Total num frames: 197470208. Throughput: 0: 5177.5. Samples: 197463866. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 09:42:00,309][25689] Avg episode reward: [(0, '-50.060')] [2022-07-09 09:42:00,599][26022] Updated weights on worker 0-0, policy_version 192845 (0.00093) [2022-07-09 09:42:02,734][26022] Updated weights on worker 0-0, policy_version 192855 (0.00083) [2022-07-09 09:42:04,435][26022] Updated weights on worker 0-0, policy_version 192865 (0.00088) [2022-07-09 09:42:05,328][25689] Fps is (10 sec: 5693.7, 60 sec: 5747.8, 300 sec: 5752.6). Total num frames: 197497856. Throughput: 0: 5937.1. Samples: 197496614. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 09:42:05,328][25689] Avg episode reward: [(0, '-50.578')] [2022-07-09 09:42:06,446][26022] Updated weights on worker 0-0, policy_version 192875 (0.00085) [2022-07-09 09:42:08,006][26022] Updated weights on worker 0-0, policy_version 192885 (0.00082) [2022-07-09 09:42:10,007][26022] Updated weights on worker 0-0, policy_version 192895 (0.00091) [2022-07-09 09:42:10,331][25689] Fps is (10 sec: 5517.4, 60 sec: 5715.4, 300 sec: 5745.0). Total num frames: 197525504. Throughput: 0: 5934.1. Samples: 197531138. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 09:42:10,331][25689] Avg episode reward: [(0, '-50.478')] [2022-07-09 09:42:11,568][26022] Updated weights on worker 0-0, policy_version 192905 (0.00092) [2022-07-09 09:42:13,468][26022] Updated weights on worker 0-0, policy_version 192915 (0.00086) [2022-07-09 09:42:15,215][26022] Updated weights on worker 0-0, policy_version 192925 (0.00084) [2022-07-09 09:42:15,382][25689] Fps is (10 sec: 5805.2, 60 sec: 5748.1, 300 sec: 5751.2). Total num frames: 197556224. Throughput: 0: 5060.3. Samples: 197548354. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 09:42:15,383][25689] Avg episode reward: [(0, '-50.062')] [2022-07-09 09:42:17,001][26022] Updated weights on worker 0-0, policy_version 192935 (0.00083) [2022-07-09 09:42:18,787][26022] Updated weights on worker 0-0, policy_version 192945 (0.00091) [2022-07-09 09:42:20,494][25689] Fps is (10 sec: 5743.0, 60 sec: 5721.1, 300 sec: 5743.7). Total num frames: 197583872. Throughput: 0: 5908.1. Samples: 197583124. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-09 09:42:20,495][25689] Avg episode reward: [(0, '-51.026')] [2022-07-09 09:42:20,707][26022] Updated weights on worker 0-0, policy_version 192955 (0.00086) [2022-07-09 09:42:22,311][26022] Updated weights on worker 0-0, policy_version 192965 (0.00085) [2022-07-09 09:42:24,239][26022] Updated weights on worker 0-0, policy_version 192975 (0.00083) [2022-07-09 09:42:25,515][25689] Fps is (10 sec: 5760.1, 60 sec: 5736.6, 300 sec: 5746.8). Total num frames: 197614592. Throughput: 0: 6008.5. Samples: 197617912. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 09:42:25,516][25689] Avg episode reward: [(0, '-51.175')] [2022-07-09 09:42:25,862][26022] Updated weights on worker 0-0, policy_version 192985 (0.00092) [2022-07-09 09:42:27,810][26022] Updated weights on worker 0-0, policy_version 192995 (0.00093) [2022-07-09 09:42:29,432][26022] Updated weights on worker 0-0, policy_version 193005 (0.00087) [2022-07-09 09:42:30,517][25689] Fps is (10 sec: 5823.4, 60 sec: 5720.1, 300 sec: 5744.3). Total num frames: 197642240. Throughput: 0: 6007.1. Samples: 197652400. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 09:42:30,517][25689] Avg episode reward: [(0, '-51.873')] [2022-07-09 09:42:31,235][26022] Updated weights on worker 0-0, policy_version 193015 (0.00089) [2022-07-09 09:42:32,842][26022] Updated weights on worker 0-0, policy_version 193025 (0.00089) [2022-07-09 09:42:34,947][26022] Updated weights on worker 0-0, policy_version 193035 (0.00087) [2022-07-09 09:42:35,590][25689] Fps is (10 sec: 5692.1, 60 sec: 5753.6, 300 sec: 5747.0). Total num frames: 197671936. Throughput: 0: 6008.9. Samples: 197669780. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 09:42:35,590][25689] Avg episode reward: [(0, '-51.718')] [2022-07-09 09:42:36,611][26022] Updated weights on worker 0-0, policy_version 193045 (0.00101) [2022-07-09 09:42:38,513][26022] Updated weights on worker 0-0, policy_version 193055 (0.00098) [2022-07-09 09:42:40,041][26022] Updated weights on worker 0-0, policy_version 193065 (0.00087) [2022-07-09 09:42:40,607][25689] Fps is (10 sec: 5784.9, 60 sec: 5735.8, 300 sec: 5743.7). Total num frames: 197700608. Throughput: 0: 6027.5. Samples: 197704354. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 09:42:40,607][25689] Avg episode reward: [(0, '-51.618')] [2022-07-09 09:42:41,923][26022] Updated weights on worker 0-0, policy_version 193075 (0.00085) [2022-07-09 09:42:43,620][26022] Updated weights on worker 0-0, policy_version 193085 (0.00086) [2022-07-09 09:42:44,785][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:42:44,794][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000193091_197725184.pth [2022-07-09 09:42:44,794][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000191066_195651584.pth [2022-07-09 09:42:45,505][26022] Updated weights on worker 0-0, policy_version 193095 (0.00080) [2022-07-09 09:42:45,631][25689] Fps is (10 sec: 5812.9, 60 sec: 5736.3, 300 sec: 5747.2). Total num frames: 197730304. Throughput: 0: 6033.1. Samples: 197739270. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 09:42:45,631][25689] Avg episode reward: [(0, '-50.921')] [2022-07-09 09:42:47,036][26022] Updated weights on worker 0-0, policy_version 193105 (0.00083) [2022-07-09 09:42:49,029][26022] Updated weights on worker 0-0, policy_version 193115 (0.00085) [2022-07-09 09:42:50,642][25689] Fps is (10 sec: 5816.3, 60 sec: 5736.2, 300 sec: 5739.3). Total num frames: 197758976. Throughput: 0: 5182.2. Samples: 197756692. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 09:42:50,642][25689] Avg episode reward: [(0, '-50.104')] [2022-07-09 09:42:50,797][26022] Updated weights on worker 0-0, policy_version 193125 (0.00081) [2022-07-09 09:42:52,384][26022] Updated weights on worker 0-0, policy_version 193135 (0.00087) [2022-07-09 09:42:54,139][26022] Updated weights on worker 0-0, policy_version 193145 (0.00089) [2022-07-09 09:42:55,697][25689] Fps is (10 sec: 5798.4, 60 sec: 5760.3, 300 sec: 5742.8). Total num frames: 197788672. Throughput: 0: 6065.6. Samples: 197791742. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 09:42:55,699][25689] Avg episode reward: [(0, '-50.262')] [2022-07-09 09:42:56,047][26022] Updated weights on worker 0-0, policy_version 193155 (0.00090) [2022-07-09 09:42:57,697][26022] Updated weights on worker 0-0, policy_version 193165 (0.00081) [2022-07-09 09:42:59,633][26022] Updated weights on worker 0-0, policy_version 193175 (0.00089) [2022-07-09 09:43:00,707][25689] Fps is (10 sec: 5901.1, 60 sec: 5764.4, 300 sec: 5754.6). Total num frames: 197818368. Throughput: 0: 6073.6. Samples: 197826432. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 09:43:00,707][25689] Avg episode reward: [(0, '-49.563')] [2022-07-09 09:43:01,016][26022] Updated weights on worker 0-0, policy_version 193185 (0.00107) [2022-07-09 09:43:03,467][26022] Updated weights on worker 0-0, policy_version 193195 (0.00086) [2022-07-09 09:43:05,149][26022] Updated weights on worker 0-0, policy_version 193205 (0.00105) [2022-07-09 09:43:05,708][25689] Fps is (10 sec: 5625.9, 60 sec: 5749.2, 300 sec: 5744.5). Total num frames: 197844992. Throughput: 0: 5105.4. Samples: 197841772. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 09:43:05,708][25689] Avg episode reward: [(0, '-48.611')] [2022-07-09 09:43:06,993][26022] Updated weights on worker 0-0, policy_version 193215 (0.00083) [2022-07-09 09:43:08,797][26022] Updated weights on worker 0-0, policy_version 193225 (0.00085) [2022-07-09 09:43:10,578][26022] Updated weights on worker 0-0, policy_version 193235 (0.00094) [2022-07-09 09:43:10,741][25689] Fps is (10 sec: 5510.8, 60 sec: 5763.3, 300 sec: 5746.0). Total num frames: 197873664. Throughput: 0: 5949.1. Samples: 197876260. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 09:43:10,741][25689] Avg episode reward: [(0, '-48.527')] [2022-07-09 09:43:12,500][26022] Updated weights on worker 0-0, policy_version 193245 (0.00088) [2022-07-09 09:43:14,209][26022] Updated weights on worker 0-0, policy_version 193255 (0.00088) [2022-07-09 09:43:15,695][26022] Updated weights on worker 0-0, policy_version 193265 (0.00088) [2022-07-09 09:43:15,821][25689] Fps is (10 sec: 5771.7, 60 sec: 5743.6, 300 sec: 5744.6). Total num frames: 197903360. Throughput: 0: 5921.7. Samples: 197910908. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 09:43:15,821][25689] Avg episode reward: [(0, '-48.591')] [2022-07-09 09:43:17,880][26022] Updated weights on worker 0-0, policy_version 193275 (0.00083) [2022-07-09 09:43:19,311][26022] Updated weights on worker 0-0, policy_version 193285 (0.00083) [2022-07-09 09:43:20,842][25689] Fps is (10 sec: 5575.3, 60 sec: 5735.3, 300 sec: 5737.6). Total num frames: 197929984. Throughput: 0: 5050.4. Samples: 197928126. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 09:43:20,843][25689] Avg episode reward: [(0, '-48.790')] [2022-07-09 09:43:21,373][26022] Updated weights on worker 0-0, policy_version 193295 (0.00083) [2022-07-09 09:43:22,990][26022] Updated weights on worker 0-0, policy_version 193305 (0.00079) [2022-07-09 09:43:24,739][26022] Updated weights on worker 0-0, policy_version 193315 (0.00083) [2022-07-09 09:43:25,864][25689] Fps is (10 sec: 5709.6, 60 sec: 5735.2, 300 sec: 5740.8). Total num frames: 197960704. Throughput: 0: 6014.7. Samples: 197963006. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 09:43:25,865][25689] Avg episode reward: [(0, '-49.194')] [2022-07-09 09:43:26,663][26022] Updated weights on worker 0-0, policy_version 193325 (0.00084) [2022-07-09 09:43:28,373][26022] Updated weights on worker 0-0, policy_version 193335 (0.00088) [2022-07-09 09:43:30,081][26022] Updated weights on worker 0-0, policy_version 193345 (0.00090) [2022-07-09 09:43:30,879][25689] Fps is (10 sec: 5815.6, 60 sec: 5734.0, 300 sec: 5735.5). Total num frames: 197988352. Throughput: 0: 6010.2. Samples: 197997294. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 09:43:30,879][25689] Avg episode reward: [(0, '-48.805')] [2022-07-09 09:43:31,939][26022] Updated weights on worker 0-0, policy_version 193355 (0.00083) [2022-07-09 09:43:33,898][26022] Updated weights on worker 0-0, policy_version 193365 (0.00092) [2022-07-09 09:43:35,466][26022] Updated weights on worker 0-0, policy_version 193375 (0.00091) [2022-07-09 09:43:36,015][25689] Fps is (10 sec: 5649.1, 60 sec: 5727.9, 300 sec: 5740.5). Total num frames: 198018048. Throughput: 0: 5127.9. Samples: 198014462. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 09:43:36,016][25689] Avg episode reward: [(0, '-49.971')] [2022-07-09 09:43:37,442][26022] Updated weights on worker 0-0, policy_version 193385 (0.00095) [2022-07-09 09:43:39,011][26022] Updated weights on worker 0-0, policy_version 193395 (0.00081) [2022-07-09 09:43:41,044][25689] Fps is (10 sec: 5741.6, 60 sec: 5726.8, 300 sec: 5729.9). Total num frames: 198046720. Throughput: 0: 5965.2. Samples: 198048636. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 09:43:41,045][25689] Avg episode reward: [(0, '-49.951')] [2022-07-09 09:43:41,052][26022] Updated weights on worker 0-0, policy_version 193405 (0.00092) [2022-07-09 09:43:42,983][26022] Updated weights on worker 0-0, policy_version 193415 (0.00083) [2022-07-09 09:43:44,444][26022] Updated weights on worker 0-0, policy_version 193425 (0.00085) [2022-07-09 09:43:46,080][25689] Fps is (10 sec: 5697.4, 60 sec: 5708.7, 300 sec: 5736.8). Total num frames: 198075392. Throughput: 0: 5945.0. Samples: 198083190. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 09:43:46,082][25689] Avg episode reward: [(0, '-50.141')] [2022-07-09 09:43:46,397][26022] Updated weights on worker 0-0, policy_version 193435 (0.00087) [2022-07-09 09:43:47,849][26022] Updated weights on worker 0-0, policy_version 193445 (0.00094) [2022-07-09 09:43:49,871][26022] Updated weights on worker 0-0, policy_version 193455 (0.00078) [2022-07-09 09:43:51,124][25689] Fps is (10 sec: 5790.6, 60 sec: 5722.5, 300 sec: 5733.9). Total num frames: 198105088. Throughput: 0: 5104.1. Samples: 198100630. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 09:43:51,125][25689] Avg episode reward: [(0, '-49.979')] [2022-07-09 09:43:51,885][26022] Updated weights on worker 0-0, policy_version 193465 (0.00086) [2022-07-09 09:43:53,550][26022] Updated weights on worker 0-0, policy_version 193475 (0.00083) [2022-07-09 09:43:55,391][26022] Updated weights on worker 0-0, policy_version 193485 (0.00091) [2022-07-09 09:43:56,206][25689] Fps is (10 sec: 5865.3, 60 sec: 5720.0, 300 sec: 5736.1). Total num frames: 198134784. Throughput: 0: 5958.3. Samples: 198134766. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 09:43:56,207][25689] Avg episode reward: [(0, '-50.296')] [2022-07-09 09:43:57,122][26022] Updated weights on worker 0-0, policy_version 193495 (0.00083) [2022-07-09 09:43:58,802][26022] Updated weights on worker 0-0, policy_version 193505 (0.00078) [2022-07-09 09:44:00,841][26022] Updated weights on worker 0-0, policy_version 193515 (0.00086) [2022-07-09 09:44:01,214][25689] Fps is (10 sec: 5683.5, 60 sec: 5686.3, 300 sec: 5736.2). Total num frames: 198162432. Throughput: 0: 5976.7. Samples: 198169182. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 09:44:01,214][25689] Avg episode reward: [(0, '-50.013')] [2022-07-09 09:44:02,707][26022] Updated weights on worker 0-0, policy_version 193525 (0.00082) [2022-07-09 09:44:04,655][26022] Updated weights on worker 0-0, policy_version 193535 (0.00091) [2022-07-09 09:44:06,274][25689] Fps is (10 sec: 5289.0, 60 sec: 5663.9, 300 sec: 5728.5). Total num frames: 198188032. Throughput: 0: 5017.6. Samples: 198184514. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 09:44:06,276][25689] Avg episode reward: [(0, '-50.484')] [2022-07-09 09:44:06,368][26022] Updated weights on worker 0-0, policy_version 193545 (0.00089) [2022-07-09 09:44:07,961][26022] Updated weights on worker 0-0, policy_version 193555 (0.00097) [2022-07-09 09:44:10,143][26022] Updated weights on worker 0-0, policy_version 193565 (0.00086) [2022-07-09 09:44:11,280][25689] Fps is (10 sec: 5594.8, 60 sec: 5700.2, 300 sec: 5732.8). Total num frames: 198218752. Throughput: 0: 5859.2. Samples: 198218728. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 09:44:11,281][25689] Avg episode reward: [(0, '-51.411')] [2022-07-09 09:44:11,525][26022] Updated weights on worker 0-0, policy_version 193575 (0.00082) [2022-07-09 09:44:13,575][26022] Updated weights on worker 0-0, policy_version 193585 (0.00086) [2022-07-09 09:44:15,288][26022] Updated weights on worker 0-0, policy_version 193595 (0.00092) [2022-07-09 09:44:16,414][25689] Fps is (10 sec: 5857.1, 60 sec: 5678.3, 300 sec: 5727.9). Total num frames: 198247424. Throughput: 0: 5868.3. Samples: 198253350. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 09:44:16,414][25689] Avg episode reward: [(0, '-52.011')] [2022-07-09 09:44:17,045][26022] Updated weights on worker 0-0, policy_version 193605 (0.00090) [2022-07-09 09:44:18,938][26022] Updated weights on worker 0-0, policy_version 193615 (0.00093) [2022-07-09 09:44:20,423][26022] Updated weights on worker 0-0, policy_version 193625 (0.00081) [2022-07-09 09:44:21,432][25689] Fps is (10 sec: 5648.5, 60 sec: 5712.4, 300 sec: 5724.4). Total num frames: 198276096. Throughput: 0: 5027.3. Samples: 198270826. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 09:44:21,433][25689] Avg episode reward: [(0, '-51.538')] [2022-07-09 09:44:22,328][26022] Updated weights on worker 0-0, policy_version 193635 (0.00086) [2022-07-09 09:44:24,035][26022] Updated weights on worker 0-0, policy_version 193645 (0.00096) [2022-07-09 09:44:25,988][26022] Updated weights on worker 0-0, policy_version 193655 (0.00092) [2022-07-09 09:44:26,487][25689] Fps is (10 sec: 5794.5, 60 sec: 5692.4, 300 sec: 5730.4). Total num frames: 198305792. Throughput: 0: 5997.9. Samples: 198305750. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 09:44:26,487][25689] Avg episode reward: [(0, '-52.576')] [2022-07-09 09:44:27,586][26022] Updated weights on worker 0-0, policy_version 193665 (0.00091) [2022-07-09 09:44:29,759][26022] Updated weights on worker 0-0, policy_version 193675 (0.00089) [2022-07-09 09:44:31,107][26022] Updated weights on worker 0-0, policy_version 193685 (0.00084) [2022-07-09 09:44:31,545][25689] Fps is (10 sec: 5974.0, 60 sec: 5738.9, 300 sec: 5738.6). Total num frames: 198336512. Throughput: 0: 6003.5. Samples: 198340390. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 09:44:31,546][25689] Avg episode reward: [(0, '-52.211')] [2022-07-09 09:44:33,226][26022] Updated weights on worker 0-0, policy_version 193695 (0.00719) [2022-07-09 09:44:34,583][26022] Updated weights on worker 0-0, policy_version 193705 (0.00087) [2022-07-09 09:44:36,659][25689] Fps is (10 sec: 5637.0, 60 sec: 5690.4, 300 sec: 5720.7). Total num frames: 198363136. Throughput: 0: 5145.3. Samples: 198357520. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 09:44:36,660][25689] Avg episode reward: [(0, '-51.881')] [2022-07-09 09:44:36,701][26022] Updated weights on worker 0-0, policy_version 193715 (0.00079) [2022-07-09 09:44:38,467][26022] Updated weights on worker 0-0, policy_version 193725 (0.00092) [2022-07-09 09:44:40,135][26022] Updated weights on worker 0-0, policy_version 193735 (0.00093) [2022-07-09 09:44:41,670][25689] Fps is (10 sec: 5461.3, 60 sec: 5692.1, 300 sec: 5717.6). Total num frames: 198391808. Throughput: 0: 5975.7. Samples: 198391764. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 09:44:41,671][25689] Avg episode reward: [(0, '-50.512')] [2022-07-09 09:44:42,006][26022] Updated weights on worker 0-0, policy_version 193745 (0.00469) [2022-07-09 09:44:43,776][26022] Updated weights on worker 0-0, policy_version 193755 (0.00091) [2022-07-09 09:44:44,850][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:44:44,869][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000193760_198410240.pth [2022-07-09 09:44:44,870][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000191744_196345856.pth [2022-07-09 09:44:45,453][26022] Updated weights on worker 0-0, policy_version 193765 (0.00090) [2022-07-09 09:44:46,727][25689] Fps is (10 sec: 5899.6, 60 sec: 5723.9, 300 sec: 5727.4). Total num frames: 198422528. Throughput: 0: 5964.6. Samples: 198426472. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 09:44:46,727][25689] Avg episode reward: [(0, '-50.301')] [2022-07-09 09:44:47,324][26022] Updated weights on worker 0-0, policy_version 193775 (0.00079) [2022-07-09 09:44:49,028][26022] Updated weights on worker 0-0, policy_version 193785 (0.00091) [2022-07-09 09:44:50,898][26022] Updated weights on worker 0-0, policy_version 193795 (0.00087) [2022-07-09 09:44:51,747][25689] Fps is (10 sec: 5893.8, 60 sec: 5709.2, 300 sec: 5728.9). Total num frames: 198451200. Throughput: 0: 5117.0. Samples: 198443764. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 09:44:51,748][25689] Avg episode reward: [(0, '-49.660')] [2022-07-09 09:44:52,568][26022] Updated weights on worker 0-0, policy_version 193805 (0.00083) [2022-07-09 09:44:54,288][26022] Updated weights on worker 0-0, policy_version 193815 (0.00088) [2022-07-09 09:44:56,218][26022] Updated weights on worker 0-0, policy_version 193825 (0.00089) [2022-07-09 09:44:56,873][25689] Fps is (10 sec: 5752.8, 60 sec: 5705.2, 300 sec: 5723.7). Total num frames: 198480896. Throughput: 0: 5982.4. Samples: 198478442. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 09:44:56,874][25689] Avg episode reward: [(0, '-49.401')] [2022-07-09 09:44:57,858][26022] Updated weights on worker 0-0, policy_version 193835 (0.00089) [2022-07-09 09:44:59,840][26022] Updated weights on worker 0-0, policy_version 193846 (0.00088) [2022-07-09 09:45:01,897][25689] Fps is (10 sec: 5549.1, 60 sec: 5686.7, 300 sec: 5727.7). Total num frames: 198507520. Throughput: 0: 5979.9. Samples: 198512716. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 09:45:01,899][25689] Avg episode reward: [(0, '-50.321')] [2022-07-09 09:45:02,015][26022] Updated weights on worker 0-0, policy_version 193856 (0.00084) [2022-07-09 09:45:03,893][26022] Updated weights on worker 0-0, policy_version 193866 (0.00091) [2022-07-09 09:45:05,811][26022] Updated weights on worker 0-0, policy_version 193876 (0.00094) [2022-07-09 09:45:06,938][25689] Fps is (10 sec: 5493.8, 60 sec: 5739.1, 300 sec: 5724.1). Total num frames: 198536192. Throughput: 0: 5861.7. Samples: 198544944. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 09:45:06,938][25689] Avg episode reward: [(0, '-50.514')] [2022-07-09 09:45:07,464][26022] Updated weights on worker 0-0, policy_version 193886 (0.00091) [2022-07-09 09:45:09,263][26022] Updated weights on worker 0-0, policy_version 193896 (0.00085) [2022-07-09 09:45:11,200][26022] Updated weights on worker 0-0, policy_version 193906 (0.00092) [2022-07-09 09:45:12,038][25689] Fps is (10 sec: 5553.8, 60 sec: 5679.8, 300 sec: 5717.2). Total num frames: 198563840. Throughput: 0: 5815.2. Samples: 198561756. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 09:45:12,039][25689] Avg episode reward: [(0, '-51.467')] [2022-07-09 09:45:12,787][26022] Updated weights on worker 0-0, policy_version 193916 (0.00084) [2022-07-09 09:45:14,833][26022] Updated weights on worker 0-0, policy_version 193926 (0.00090) [2022-07-09 09:45:16,241][26022] Updated weights on worker 0-0, policy_version 193936 (0.00087) [2022-07-09 09:45:17,180][25689] Fps is (10 sec: 5598.8, 60 sec: 5695.8, 300 sec: 5718.4). Total num frames: 198593536. Throughput: 0: 5803.4. Samples: 198596294. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 09:45:17,180][25689] Avg episode reward: [(0, '-52.323')] [2022-07-09 09:45:18,354][26022] Updated weights on worker 0-0, policy_version 193946 (0.00093) [2022-07-09 09:45:20,101][26022] Updated weights on worker 0-0, policy_version 193956 (0.00081) [2022-07-09 09:45:21,805][26022] Updated weights on worker 0-0, policy_version 193966 (0.00094) [2022-07-09 09:45:22,207][25689] Fps is (10 sec: 5739.7, 60 sec: 5695.0, 300 sec: 5717.9). Total num frames: 198622208. Throughput: 0: 5799.5. Samples: 198630504. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 09:45:22,208][25689] Avg episode reward: [(0, '-52.010')] [2022-07-09 09:45:23,812][26022] Updated weights on worker 0-0, policy_version 193976 (0.00092) [2022-07-09 09:45:25,683][26022] Updated weights on worker 0-0, policy_version 193986 (0.00085) [2022-07-09 09:45:27,178][26022] Updated weights on worker 0-0, policy_version 193996 (0.00087) [2022-07-09 09:45:27,292][25689] Fps is (10 sec: 5772.3, 60 sec: 5692.2, 300 sec: 5716.4). Total num frames: 198651904. Throughput: 0: 5035.5. Samples: 198647436. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 09:45:27,293][25689] Avg episode reward: [(0, '-50.440')] [2022-07-09 09:45:29,139][26022] Updated weights on worker 0-0, policy_version 194006 (0.00085) [2022-07-09 09:45:30,647][26022] Updated weights on worker 0-0, policy_version 194016 (0.00091) [2022-07-09 09:45:32,296][25689] Fps is (10 sec: 5683.7, 60 sec: 5646.7, 300 sec: 5718.6). Total num frames: 198679552. Throughput: 0: 5945.7. Samples: 198682206. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 09:45:32,297][25689] Avg episode reward: [(0, '-50.153')] [2022-07-09 09:45:32,682][26022] Updated weights on worker 0-0, policy_version 194026 (0.00079) [2022-07-09 09:45:34,344][26022] Updated weights on worker 0-0, policy_version 194036 (0.00098) [2022-07-09 09:45:36,229][26022] Updated weights on worker 0-0, policy_version 194046 (0.00086) [2022-07-09 09:45:37,344][25689] Fps is (10 sec: 5704.5, 60 sec: 5703.5, 300 sec: 5719.1). Total num frames: 198709248. Throughput: 0: 5975.3. Samples: 198716780. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 09:45:37,345][25689] Avg episode reward: [(0, '-49.103')] [2022-07-09 09:45:37,921][26022] Updated weights on worker 0-0, policy_version 194056 (0.00085) [2022-07-09 09:45:39,735][26022] Updated weights on worker 0-0, policy_version 194066 (0.00085) [2022-07-09 09:45:41,569][26022] Updated weights on worker 0-0, policy_version 194076 (0.00090) [2022-07-09 09:45:42,353][25689] Fps is (10 sec: 5905.6, 60 sec: 5720.5, 300 sec: 5719.3). Total num frames: 198738944. Throughput: 0: 5131.4. Samples: 198733882. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 09:45:42,353][25689] Avg episode reward: [(0, '-49.424')] [2022-07-09 09:45:43,354][26022] Updated weights on worker 0-0, policy_version 194086 (0.00085) [2022-07-09 09:45:45,142][26022] Updated weights on worker 0-0, policy_version 194096 (0.01093) [2022-07-09 09:45:46,899][26022] Updated weights on worker 0-0, policy_version 194106 (0.00086) [2022-07-09 09:45:47,369][25689] Fps is (10 sec: 5720.3, 60 sec: 5673.7, 300 sec: 5715.9). Total num frames: 198766592. Throughput: 0: 6016.0. Samples: 198768220. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 09:45:47,369][25689] Avg episode reward: [(0, '-49.546')] [2022-07-09 09:45:48,707][26022] Updated weights on worker 0-0, policy_version 194116 (0.00085) [2022-07-09 09:45:50,419][26022] Updated weights on worker 0-0, policy_version 194126 (0.00083) [2022-07-09 09:45:52,268][26022] Updated weights on worker 0-0, policy_version 194136 (0.00094) [2022-07-09 09:45:52,373][25689] Fps is (10 sec: 5722.6, 60 sec: 5692.1, 300 sec: 5718.1). Total num frames: 198796288. Throughput: 0: 6006.6. Samples: 198802804. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 09:45:52,374][25689] Avg episode reward: [(0, '-49.049')] [2022-07-09 09:45:54,199][26022] Updated weights on worker 0-0, policy_version 194146 (0.00082) [2022-07-09 09:45:55,775][26022] Updated weights on worker 0-0, policy_version 194156 (0.00081) [2022-07-09 09:45:57,416][25689] Fps is (10 sec: 5707.1, 60 sec: 5666.0, 300 sec: 5710.5). Total num frames: 198823936. Throughput: 0: 5139.2. Samples: 198819936. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 09:45:57,417][25689] Avg episode reward: [(0, '-49.287')] [2022-07-09 09:45:57,703][26022] Updated weights on worker 0-0, policy_version 194166 (0.00086) [2022-07-09 09:45:59,160][26022] Updated weights on worker 0-0, policy_version 194176 (0.00087) [2022-07-09 09:46:01,355][26022] Updated weights on worker 0-0, policy_version 194186 (0.00079) [2022-07-09 09:46:02,474][25689] Fps is (10 sec: 5576.2, 60 sec: 5696.7, 300 sec: 5719.9). Total num frames: 198852608. Throughput: 0: 6001.4. Samples: 198854634. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 09:46:02,474][25689] Avg episode reward: [(0, '-49.308')] [2022-07-09 09:46:03,246][26022] Updated weights on worker 0-0, policy_version 194196 (0.00087) [2022-07-09 09:46:05,124][26022] Updated weights on worker 0-0, policy_version 194206 (0.00091) [2022-07-09 09:46:06,932][26022] Updated weights on worker 0-0, policy_version 194216 (0.00092) [2022-07-09 09:46:07,493][25689] Fps is (10 sec: 5589.4, 60 sec: 5681.8, 300 sec: 5713.0). Total num frames: 198880256. Throughput: 0: 5890.0. Samples: 198886752. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 09:46:07,493][25689] Avg episode reward: [(0, '-48.814')] [2022-07-09 09:46:08,610][26022] Updated weights on worker 0-0, policy_version 194226 (0.00093) [2022-07-09 09:46:10,595][26022] Updated weights on worker 0-0, policy_version 194236 (0.00088) [2022-07-09 09:46:12,181][26022] Updated weights on worker 0-0, policy_version 194246 (0.00090) [2022-07-09 09:46:12,505][25689] Fps is (10 sec: 5614.1, 60 sec: 5707.0, 300 sec: 5713.5). Total num frames: 198908928. Throughput: 0: 5013.3. Samples: 198903732. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 09:46:12,506][25689] Avg episode reward: [(0, '-47.599')] [2022-07-09 09:46:14,129][26022] Updated weights on worker 0-0, policy_version 194256 (0.00089) [2022-07-09 09:46:15,830][26022] Updated weights on worker 0-0, policy_version 194266 (0.00090) [2022-07-09 09:46:17,575][25689] Fps is (10 sec: 5687.8, 60 sec: 5696.9, 300 sec: 5712.2). Total num frames: 198937600. Throughput: 0: 5869.4. Samples: 198938252. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 09:46:17,575][25689] Avg episode reward: [(0, '-47.406')] [2022-07-09 09:46:17,678][26022] Updated weights on worker 0-0, policy_version 194276 (0.00090) [2022-07-09 09:46:19,700][26022] Updated weights on worker 0-0, policy_version 194286 (0.00095) [2022-07-09 09:46:21,197][26022] Updated weights on worker 0-0, policy_version 194296 (0.00088) [2022-07-09 09:46:22,583][25689] Fps is (10 sec: 5588.9, 60 sec: 5681.7, 300 sec: 5705.3). Total num frames: 198965248. Throughput: 0: 5860.5. Samples: 198972482. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 09:46:22,583][25689] Avg episode reward: [(0, '-49.074')] [2022-07-09 09:46:23,008][26022] Updated weights on worker 0-0, policy_version 194306 (0.00083) [2022-07-09 09:46:24,912][26022] Updated weights on worker 0-0, policy_version 194316 (0.00089) [2022-07-09 09:46:26,514][26022] Updated weights on worker 0-0, policy_version 194326 (0.00082) [2022-07-09 09:46:27,586][25689] Fps is (10 sec: 5727.6, 60 sec: 5689.4, 300 sec: 5708.8). Total num frames: 198994944. Throughput: 0: 5128.7. Samples: 198989806. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 09:46:27,587][25689] Avg episode reward: [(0, '-49.906')] [2022-07-09 09:46:28,532][26022] Updated weights on worker 0-0, policy_version 194336 (0.00093) [2022-07-09 09:46:29,999][26022] Updated weights on worker 0-0, policy_version 194346 (0.00090) [2022-07-09 09:46:31,982][26022] Updated weights on worker 0-0, policy_version 194356 (0.00084) [2022-07-09 09:46:32,595][25689] Fps is (10 sec: 5829.8, 60 sec: 5706.0, 300 sec: 5713.4). Total num frames: 199023616. Throughput: 0: 5996.5. Samples: 199024198. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 09:46:32,596][25689] Avg episode reward: [(0, '-50.109')] [2022-07-09 09:46:33,710][26022] Updated weights on worker 0-0, policy_version 194366 (0.00095) [2022-07-09 09:46:35,551][26022] Updated weights on worker 0-0, policy_version 194376 (0.00088) [2022-07-09 09:46:37,539][26022] Updated weights on worker 0-0, policy_version 194386 (0.00078) [2022-07-09 09:46:37,626][25689] Fps is (10 sec: 5711.8, 60 sec: 5690.6, 300 sec: 5709.5). Total num frames: 199052288. Throughput: 0: 6001.4. Samples: 199058588. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 09:46:37,635][25689] Avg episode reward: [(0, '-50.768')] [2022-07-09 09:46:39,057][26022] Updated weights on worker 0-0, policy_version 194396 (0.00085) [2022-07-09 09:46:41,043][26022] Updated weights on worker 0-0, policy_version 194406 (0.00093) [2022-07-09 09:46:42,640][25689] Fps is (10 sec: 5708.7, 60 sec: 5673.2, 300 sec: 5706.4). Total num frames: 199080960. Throughput: 0: 5136.0. Samples: 199075494. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 09:46:42,640][25689] Avg episode reward: [(0, '-51.481')] [2022-07-09 09:46:42,823][26022] Updated weights on worker 0-0, policy_version 194416 (0.00093) [2022-07-09 09:46:44,535][26022] Updated weights on worker 0-0, policy_version 194426 (0.00084) [2022-07-09 09:46:45,176][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:46:45,190][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000194430_199096320.pth [2022-07-09 09:46:45,191][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000192419_197037056.pth [2022-07-09 09:46:46,453][26022] Updated weights on worker 0-0, policy_version 194436 (0.00097) [2022-07-09 09:46:47,643][25689] Fps is (10 sec: 5724.9, 60 sec: 5691.3, 300 sec: 5706.5). Total num frames: 199109632. Throughput: 0: 5995.2. Samples: 199110046. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 09:46:47,643][25689] Avg episode reward: [(0, '-51.552')] [2022-07-09 09:46:47,985][26022] Updated weights on worker 0-0, policy_version 194446 (0.00086) [2022-07-09 09:46:49,996][26022] Updated weights on worker 0-0, policy_version 194456 (0.00084) [2022-07-09 09:46:51,703][26022] Updated weights on worker 0-0, policy_version 194466 (0.00088) [2022-07-09 09:46:52,667][25689] Fps is (10 sec: 5616.8, 60 sec: 5655.6, 300 sec: 5705.1). Total num frames: 199137280. Throughput: 0: 5991.3. Samples: 199144456. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 09:46:52,667][25689] Avg episode reward: [(0, '-50.222')] [2022-07-09 09:46:53,459][26022] Updated weights on worker 0-0, policy_version 194476 (0.00079) [2022-07-09 09:46:55,249][26022] Updated weights on worker 0-0, policy_version 194486 (0.00091) [2022-07-09 09:46:57,010][26022] Updated weights on worker 0-0, policy_version 194496 (0.00096) [2022-07-09 09:46:57,731][25689] Fps is (10 sec: 5684.3, 60 sec: 5687.5, 300 sec: 5704.9). Total num frames: 199166976. Throughput: 0: 5136.5. Samples: 199161856. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 09:46:57,731][25689] Avg episode reward: [(0, '-49.683')] [2022-07-09 09:46:58,837][26022] Updated weights on worker 0-0, policy_version 194506 (0.00084) [2022-07-09 09:47:00,537][26022] Updated weights on worker 0-0, policy_version 194516 (0.00086) [2022-07-09 09:47:02,727][26022] Updated weights on worker 0-0, policy_version 194526 (0.00088) [2022-07-09 09:47:02,754][25689] Fps is (10 sec: 5684.5, 60 sec: 5673.7, 300 sec: 5704.8). Total num frames: 199194624. Throughput: 0: 6019.8. Samples: 199196582. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 09:47:02,755][25689] Avg episode reward: [(0, '-49.556')] [2022-07-09 09:47:04,452][26022] Updated weights on worker 0-0, policy_version 194536 (0.00077) [2022-07-09 09:47:06,337][26022] Updated weights on worker 0-0, policy_version 194546 (0.00080) [2022-07-09 09:47:07,758][25689] Fps is (10 sec: 5616.6, 60 sec: 5692.2, 300 sec: 5708.3). Total num frames: 199223296. Throughput: 0: 5935.2. Samples: 199229436. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 09:47:07,759][25689] Avg episode reward: [(0, '-49.236')] [2022-07-09 09:47:08,119][26022] Updated weights on worker 0-0, policy_version 194556 (0.00050) [2022-07-09 09:47:09,749][26022] Updated weights on worker 0-0, policy_version 194566 (0.00085) [2022-07-09 09:47:11,605][26022] Updated weights on worker 0-0, policy_version 194576 (0.00092) [2022-07-09 09:47:12,767][25689] Fps is (10 sec: 5829.7, 60 sec: 5709.5, 300 sec: 5705.6). Total num frames: 199252992. Throughput: 0: 5094.8. Samples: 199246862. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 09:47:12,767][25689] Avg episode reward: [(0, '-48.628')] [2022-07-09 09:47:13,286][26022] Updated weights on worker 0-0, policy_version 194586 (0.00087) [2022-07-09 09:47:15,146][26022] Updated weights on worker 0-0, policy_version 194596 (0.00086) [2022-07-09 09:47:16,908][26022] Updated weights on worker 0-0, policy_version 194606 (0.00093) [2022-07-09 09:47:17,833][25689] Fps is (10 sec: 5793.5, 60 sec: 5709.8, 300 sec: 5710.0). Total num frames: 199281664. Throughput: 0: 5934.2. Samples: 199281146. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 09:47:17,833][25689] Avg episode reward: [(0, '-49.310')] [2022-07-09 09:47:18,676][26022] Updated weights on worker 0-0, policy_version 194616 (0.00090) [2022-07-09 09:47:20,523][26022] Updated weights on worker 0-0, policy_version 194626 (0.00086) [2022-07-09 09:47:22,195][26022] Updated weights on worker 0-0, policy_version 194636 (0.00498) [2022-07-09 09:47:22,896][25689] Fps is (10 sec: 5863.1, 60 sec: 5755.5, 300 sec: 5709.1). Total num frames: 199312384. Throughput: 0: 5925.2. Samples: 199315928. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 09:47:22,897][25689] Avg episode reward: [(0, '-48.958')] [2022-07-09 09:47:24,047][26022] Updated weights on worker 0-0, policy_version 194646 (0.00090) [2022-07-09 09:47:25,650][26022] Updated weights on worker 0-0, policy_version 194656 (0.00120) [2022-07-09 09:47:27,567][26022] Updated weights on worker 0-0, policy_version 194666 (0.00087) [2022-07-09 09:47:27,935][25689] Fps is (10 sec: 5777.6, 60 sec: 5718.2, 300 sec: 5708.4). Total num frames: 199340032. Throughput: 0: 5143.2. Samples: 199333210. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 09:47:27,936][25689] Avg episode reward: [(0, '-48.864')] [2022-07-09 09:47:29,284][26022] Updated weights on worker 0-0, policy_version 194676 (0.00088) [2022-07-09 09:47:31,051][26022] Updated weights on worker 0-0, policy_version 194686 (0.00086) [2022-07-09 09:47:32,833][26022] Updated weights on worker 0-0, policy_version 194696 (0.00087) [2022-07-09 09:47:32,962][25689] Fps is (10 sec: 5595.5, 60 sec: 5716.5, 300 sec: 5705.9). Total num frames: 199368704. Throughput: 0: 6009.2. Samples: 199368218. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 09:47:32,962][25689] Avg episode reward: [(0, '-49.352')] [2022-07-09 09:47:34,524][26022] Updated weights on worker 0-0, policy_version 194706 (0.00081) [2022-07-09 09:47:36,371][26022] Updated weights on worker 0-0, policy_version 194716 (0.00084) [2022-07-09 09:47:38,105][25689] Fps is (10 sec: 5739.5, 60 sec: 5722.9, 300 sec: 5706.9). Total num frames: 199398400. Throughput: 0: 6004.4. Samples: 199402866. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 09:47:38,105][25689] Avg episode reward: [(0, '-50.268')] [2022-07-09 09:47:38,287][26022] Updated weights on worker 0-0, policy_version 194726 (0.00086) [2022-07-09 09:47:39,826][26022] Updated weights on worker 0-0, policy_version 194736 (0.00086) [2022-07-09 09:47:41,837][26022] Updated weights on worker 0-0, policy_version 194746 (0.00090) [2022-07-09 09:47:43,152][25689] Fps is (10 sec: 5828.4, 60 sec: 5736.6, 300 sec: 5706.5). Total num frames: 199428096. Throughput: 0: 5983.7. Samples: 199437130. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 09:47:43,154][25689] Avg episode reward: [(0, '-50.095')] [2022-07-09 09:47:43,408][26022] Updated weights on worker 0-0, policy_version 194756 (0.00094) [2022-07-09 09:47:45,450][26022] Updated weights on worker 0-0, policy_version 194766 (0.00086) [2022-07-09 09:47:46,887][26022] Updated weights on worker 0-0, policy_version 194776 (0.00089) [2022-07-09 09:47:48,198][25689] Fps is (10 sec: 5681.3, 60 sec: 5715.6, 300 sec: 5702.3). Total num frames: 199455744. Throughput: 0: 5995.6. Samples: 199454698. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 09:47:48,200][25689] Avg episode reward: [(0, '-50.219')] [2022-07-09 09:47:48,840][26022] Updated weights on worker 0-0, policy_version 194786 (0.00093) [2022-07-09 09:47:50,539][26022] Updated weights on worker 0-0, policy_version 194796 (0.00094) [2022-07-09 09:47:52,419][26022] Updated weights on worker 0-0, policy_version 194806 (0.00088) [2022-07-09 09:47:53,222][25689] Fps is (10 sec: 5796.3, 60 sec: 5766.4, 300 sec: 5706.4). Total num frames: 199486464. Throughput: 0: 6000.5. Samples: 199489788. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 09:47:53,223][25689] Avg episode reward: [(0, '-50.532')] [2022-07-09 09:47:54,171][26022] Updated weights on worker 0-0, policy_version 194816 (0.00082) [2022-07-09 09:47:55,862][26022] Updated weights on worker 0-0, policy_version 194826 (0.00093) [2022-07-09 09:47:57,624][26022] Updated weights on worker 0-0, policy_version 194836 (0.00103) [2022-07-09 09:47:58,288][25689] Fps is (10 sec: 5987.7, 60 sec: 5766.2, 300 sec: 5705.3). Total num frames: 199516160. Throughput: 0: 6038.8. Samples: 199524750. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 09:47:58,289][25689] Avg episode reward: [(0, '-49.705')] [2022-07-09 09:47:59,206][26022] Updated weights on worker 0-0, policy_version 194846 (0.00081) [2022-07-09 09:48:01,166][26022] Updated weights on worker 0-0, policy_version 194856 (0.00087) [2022-07-09 09:48:03,204][26022] Updated weights on worker 0-0, policy_version 194866 (0.00088) [2022-07-09 09:48:03,301][25689] Fps is (10 sec: 5587.9, 60 sec: 5750.3, 300 sec: 5705.1). Total num frames: 199542784. Throughput: 0: 5210.5. Samples: 199542116. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 09:48:03,301][25689] Avg episode reward: [(0, '-50.429')] [2022-07-09 09:48:04,930][26022] Updated weights on worker 0-0, policy_version 194876 (0.00088) [2022-07-09 09:48:06,877][26022] Updated weights on worker 0-0, policy_version 194886 (0.00089) [2022-07-09 09:48:08,303][25689] Fps is (10 sec: 5623.5, 60 sec: 5767.3, 300 sec: 5709.1). Total num frames: 199572480. Throughput: 0: 5981.9. Samples: 199574964. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 09:48:08,304][25689] Avg episode reward: [(0, '-50.254')] [2022-07-09 09:48:08,434][26022] Updated weights on worker 0-0, policy_version 194896 (0.00083) [2022-07-09 09:48:10,494][26022] Updated weights on worker 0-0, policy_version 194906 (0.00086) [2022-07-09 09:48:12,007][26022] Updated weights on worker 0-0, policy_version 194916 (0.00549) [2022-07-09 09:48:13,317][25689] Fps is (10 sec: 5622.9, 60 sec: 5716.1, 300 sec: 5700.1). Total num frames: 199599104. Throughput: 0: 5955.2. Samples: 199609458. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 09:48:13,319][25689] Avg episode reward: [(0, '-49.825')] [2022-07-09 09:48:13,907][26022] Updated weights on worker 0-0, policy_version 194926 (0.00085) [2022-07-09 09:48:15,593][26022] Updated weights on worker 0-0, policy_version 194936 (0.00088) [2022-07-09 09:48:17,312][26022] Updated weights on worker 0-0, policy_version 194946 (0.00085) [2022-07-09 09:48:18,383][25689] Fps is (10 sec: 5689.2, 60 sec: 5750.0, 300 sec: 5713.0). Total num frames: 199629824. Throughput: 0: 5089.5. Samples: 199627022. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 09:48:18,383][25689] Avg episode reward: [(0, '-50.323')] [2022-07-09 09:48:19,295][26022] Updated weights on worker 0-0, policy_version 194956 (0.00083) [2022-07-09 09:48:20,831][26022] Updated weights on worker 0-0, policy_version 194966 (0.00096) [2022-07-09 09:48:22,718][26022] Updated weights on worker 0-0, policy_version 194976 (0.00088) [2022-07-09 09:48:23,403][25689] Fps is (10 sec: 5990.1, 60 sec: 5737.2, 300 sec: 5709.6). Total num frames: 199659520. Throughput: 0: 5968.1. Samples: 199662088. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 09:48:23,403][25689] Avg episode reward: [(0, '-49.988')] [2022-07-09 09:48:24,572][26022] Updated weights on worker 0-0, policy_version 194986 (0.00088) [2022-07-09 09:48:26,299][26022] Updated weights on worker 0-0, policy_version 194996 (0.00085) [2022-07-09 09:48:28,039][26022] Updated weights on worker 0-0, policy_version 195006 (0.00082) [2022-07-09 09:48:28,416][25689] Fps is (10 sec: 5715.2, 60 sec: 5739.6, 300 sec: 5709.6). Total num frames: 199687168. Throughput: 0: 6037.8. Samples: 199696402. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 09:48:28,417][25689] Avg episode reward: [(0, '-50.508')] [2022-07-09 09:48:29,670][26022] Updated weights on worker 0-0, policy_version 195016 (0.00094) [2022-07-09 09:48:31,676][26022] Updated weights on worker 0-0, policy_version 195026 (0.00087) [2022-07-09 09:48:33,389][26022] Updated weights on worker 0-0, policy_version 195036 (0.00089) [2022-07-09 09:48:33,427][25689] Fps is (10 sec: 5720.8, 60 sec: 5758.1, 300 sec: 5712.0). Total num frames: 199716864. Throughput: 0: 5181.3. Samples: 199713652. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 09:48:33,427][25689] Avg episode reward: [(0, '-49.707')] [2022-07-09 09:48:35,254][26022] Updated weights on worker 0-0, policy_version 195046 (0.00085) [2022-07-09 09:48:37,015][26022] Updated weights on worker 0-0, policy_version 195056 (0.00087) [2022-07-09 09:48:38,503][25689] Fps is (10 sec: 5888.2, 60 sec: 5764.4, 300 sec: 5714.5). Total num frames: 199746560. Throughput: 0: 6027.6. Samples: 199748298. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 09:48:38,504][25689] Avg episode reward: [(0, '-50.240')] [2022-07-09 09:48:38,591][26022] Updated weights on worker 0-0, policy_version 195066 (0.00092) [2022-07-09 09:48:40,388][26022] Updated weights on worker 0-0, policy_version 195076 (0.00083) [2022-07-09 09:48:42,194][26022] Updated weights on worker 0-0, policy_version 195086 (0.00086) [2022-07-09 09:48:43,516][25689] Fps is (10 sec: 5785.1, 60 sec: 5750.7, 300 sec: 5715.0). Total num frames: 199775232. Throughput: 0: 6010.5. Samples: 199782978. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 09:48:43,517][25689] Avg episode reward: [(0, '-49.384')] [2022-07-09 09:48:43,986][26022] Updated weights on worker 0-0, policy_version 195096 (0.00086) [2022-07-09 09:48:45,448][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:48:45,462][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000195103_199785472.pth [2022-07-09 09:48:45,462][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000193091_197725184.pth [2022-07-09 09:48:45,964][26022] Updated weights on worker 0-0, policy_version 195106 (0.00084) [2022-07-09 09:48:47,617][26022] Updated weights on worker 0-0, policy_version 195116 (0.00082) [2022-07-09 09:48:48,541][25689] Fps is (10 sec: 5508.5, 60 sec: 5735.7, 300 sec: 5705.0). Total num frames: 199801856. Throughput: 0: 5165.7. Samples: 199800362. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 09:48:48,543][25689] Avg episode reward: [(0, '-49.204')] [2022-07-09 09:48:49,447][26022] Updated weights on worker 0-0, policy_version 195126 (0.00091) [2022-07-09 09:48:51,175][26022] Updated weights on worker 0-0, policy_version 195136 (0.00109) [2022-07-09 09:48:53,152][26022] Updated weights on worker 0-0, policy_version 195146 (0.00086) [2022-07-09 09:48:53,551][25689] Fps is (10 sec: 5816.7, 60 sec: 5754.1, 300 sec: 5713.3). Total num frames: 199833600. Throughput: 0: 6017.7. Samples: 199834752. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 09:48:53,551][25689] Avg episode reward: [(0, '-48.565')] [2022-07-09 09:48:54,739][26022] Updated weights on worker 0-0, policy_version 195156 (0.00082) [2022-07-09 09:48:56,445][26022] Updated weights on worker 0-0, policy_version 195166 (0.00093) [2022-07-09 09:48:58,102][26022] Updated weights on worker 0-0, policy_version 195176 (0.00087) [2022-07-09 09:48:58,605][25689] Fps is (10 sec: 6003.1, 60 sec: 5738.2, 300 sec: 5715.8). Total num frames: 199862272. Throughput: 0: 6056.1. Samples: 199870040. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 09:48:58,608][25689] Avg episode reward: [(0, '-48.748')] [2022-07-09 09:48:59,944][26022] Updated weights on worker 0-0, policy_version 195186 (0.00095) [2022-07-09 09:49:02,199][26022] Updated weights on worker 0-0, policy_version 195196 (0.00092) [2022-07-09 09:49:03,622][25689] Fps is (10 sec: 5592.0, 60 sec: 5754.8, 300 sec: 5723.5). Total num frames: 199889920. Throughput: 0: 5211.3. Samples: 199887756. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:49:03,624][25689] Avg episode reward: [(0, '-48.519')] [2022-07-09 09:49:03,718][26022] Updated weights on worker 0-0, policy_version 195206 (0.00087) [2022-07-09 09:49:05,638][26022] Updated weights on worker 0-0, policy_version 195216 (0.00072) [2022-07-09 09:49:07,126][26022] Updated weights on worker 0-0, policy_version 195226 (0.00060) [2022-07-09 09:49:08,626][25689] Fps is (10 sec: 5620.3, 60 sec: 5737.6, 300 sec: 5716.7). Total num frames: 199918592. Throughput: 0: 5988.4. Samples: 199920638. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:49:08,627][25689] Avg episode reward: [(0, '-47.601')] [2022-07-09 09:49:09,050][26022] Updated weights on worker 0-0, policy_version 195236 (0.00089) [2022-07-09 09:49:10,945][26022] Updated weights on worker 0-0, policy_version 195246 (0.00085) [2022-07-09 09:49:12,502][26022] Updated weights on worker 0-0, policy_version 195256 (0.00086) [2022-07-09 09:49:13,631][25689] Fps is (10 sec: 5627.1, 60 sec: 5755.5, 300 sec: 5715.7). Total num frames: 199946240. Throughput: 0: 6010.1. Samples: 199955436. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:49:13,631][25689] Avg episode reward: [(0, '-48.494')] [2022-07-09 09:49:14,495][26022] Updated weights on worker 0-0, policy_version 195266 (0.00086) [2022-07-09 09:49:16,039][26022] Updated weights on worker 0-0, policy_version 195276 (0.00088) [2022-07-09 09:49:18,020][26022] Updated weights on worker 0-0, policy_version 195286 (0.00086) [2022-07-09 09:49:18,698][25689] Fps is (10 sec: 5795.0, 60 sec: 5755.3, 300 sec: 5721.7). Total num frames: 199976960. Throughput: 0: 5117.8. Samples: 199972872. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:49:18,699][25689] Avg episode reward: [(0, '-50.353')] [2022-07-09 09:49:19,633][26022] Updated weights on worker 0-0, policy_version 195296 (0.00078) [2022-07-09 09:49:21,608][26022] Updated weights on worker 0-0, policy_version 195306 (0.00083) [2022-07-09 09:49:23,133][26022] Updated weights on worker 0-0, policy_version 195316 (0.00080) [2022-07-09 09:49:23,739][25689] Fps is (10 sec: 6078.4, 60 sec: 5770.4, 300 sec: 5725.4). Total num frames: 200007680. Throughput: 0: 5957.2. Samples: 200007594. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:49:23,739][25689] Avg episode reward: [(0, '-50.402')] [2022-07-09 09:49:25,155][26022] Updated weights on worker 0-0, policy_version 195326 (0.00090) [2022-07-09 09:49:26,591][26022] Updated weights on worker 0-0, policy_version 195336 (0.00083) [2022-07-09 09:49:28,750][25689] Fps is (10 sec: 5705.1, 60 sec: 5753.6, 300 sec: 5712.5). Total num frames: 200034304. Throughput: 0: 6049.9. Samples: 200042384. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:49:28,750][25689] Avg episode reward: [(0, '-51.180')] [2022-07-09 09:49:28,751][26022] Updated weights on worker 0-0, policy_version 195346 (0.00085) [2022-07-09 09:49:30,207][26022] Updated weights on worker 0-0, policy_version 195356 (0.00087) [2022-07-09 09:49:32,203][26022] Updated weights on worker 0-0, policy_version 195366 (0.00087) [2022-07-09 09:49:33,827][25689] Fps is (10 sec: 5582.8, 60 sec: 5747.3, 300 sec: 5723.6). Total num frames: 200064000. Throughput: 0: 5164.6. Samples: 200059740. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:49:33,827][25689] Avg episode reward: [(0, '-51.375')] [2022-07-09 09:49:33,904][26022] Updated weights on worker 0-0, policy_version 195376 (0.00085) [2022-07-09 09:49:35,684][26022] Updated weights on worker 0-0, policy_version 195386 (0.00086) [2022-07-09 09:49:37,318][26022] Updated weights on worker 0-0, policy_version 195396 (0.00089) [2022-07-09 09:49:38,881][25689] Fps is (10 sec: 5761.4, 60 sec: 5732.5, 300 sec: 5722.8). Total num frames: 200092672. Throughput: 0: 6021.9. Samples: 200094408. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:49:38,881][25689] Avg episode reward: [(0, '-51.635')] [2022-07-09 09:49:39,180][26022] Updated weights on worker 0-0, policy_version 195406 (0.00077) [2022-07-09 09:49:40,844][26022] Updated weights on worker 0-0, policy_version 195416 (0.00086) [2022-07-09 09:49:42,732][26022] Updated weights on worker 0-0, policy_version 195426 (0.00088) [2022-07-09 09:49:43,903][25689] Fps is (10 sec: 5691.2, 60 sec: 5731.6, 300 sec: 5716.5). Total num frames: 200121344. Throughput: 0: 6049.7. Samples: 200129580. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:49:43,903][25689] Avg episode reward: [(0, '-51.302')] [2022-07-09 09:49:44,421][26022] Updated weights on worker 0-0, policy_version 195436 (0.00085) [2022-07-09 09:49:46,163][26022] Updated weights on worker 0-0, policy_version 195446 (0.00082) [2022-07-09 09:49:48,141][26022] Updated weights on worker 0-0, policy_version 195456 (0.00082) [2022-07-09 09:49:48,931][25689] Fps is (10 sec: 5909.6, 60 sec: 5799.2, 300 sec: 5723.3). Total num frames: 200152064. Throughput: 0: 5170.6. Samples: 200146732. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:49:48,931][25689] Avg episode reward: [(0, '-50.024')] [2022-07-09 09:49:49,799][26022] Updated weights on worker 0-0, policy_version 195466 (0.00091) [2022-07-09 09:49:51,625][26022] Updated weights on worker 0-0, policy_version 195476 (0.00096) [2022-07-09 09:49:53,280][26022] Updated weights on worker 0-0, policy_version 195486 (0.00090) [2022-07-09 09:49:53,954][25689] Fps is (10 sec: 5908.7, 60 sec: 5746.9, 300 sec: 5721.8). Total num frames: 200180736. Throughput: 0: 6060.6. Samples: 200181724. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:49:53,955][25689] Avg episode reward: [(0, '-50.178')] [2022-07-09 09:49:55,150][26022] Updated weights on worker 0-0, policy_version 195496 (0.00088) [2022-07-09 09:49:56,887][26022] Updated weights on worker 0-0, policy_version 195506 (0.00086) [2022-07-09 09:49:58,575][26022] Updated weights on worker 0-0, policy_version 195516 (0.00090) [2022-07-09 09:49:59,020][25689] Fps is (10 sec: 5785.2, 60 sec: 5762.9, 300 sec: 5731.3). Total num frames: 200210432. Throughput: 0: 6058.4. Samples: 200216420. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:49:59,020][25689] Avg episode reward: [(0, '-49.583')] [2022-07-09 09:50:00,387][26022] Updated weights on worker 0-0, policy_version 195526 (0.00080) [2022-07-09 09:50:02,715][26022] Updated weights on worker 0-0, policy_version 195536 (0.00083) [2022-07-09 09:50:04,049][25689] Fps is (10 sec: 5579.4, 60 sec: 5744.8, 300 sec: 5724.7). Total num frames: 200237056. Throughput: 0: 5128.4. Samples: 200232896. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:50:04,049][25689] Avg episode reward: [(0, '-49.765')] [2022-07-09 09:50:04,280][26022] Updated weights on worker 0-0, policy_version 195546 (0.00093) [2022-07-09 09:50:06,125][26022] Updated weights on worker 0-0, policy_version 195556 (0.00089) [2022-07-09 09:50:07,950][26022] Updated weights on worker 0-0, policy_version 195566 (0.00086) [2022-07-09 09:50:09,050][25689] Fps is (10 sec: 5410.9, 60 sec: 5728.1, 300 sec: 5726.6). Total num frames: 200264704. Throughput: 0: 5939.8. Samples: 200266236. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:50:09,050][25689] Avg episode reward: [(0, '-49.517')] [2022-07-09 09:50:09,576][26022] Updated weights on worker 0-0, policy_version 195576 (0.00080) [2022-07-09 09:50:11,491][26022] Updated weights on worker 0-0, policy_version 195586 (0.00094) [2022-07-09 09:50:13,170][26022] Updated weights on worker 0-0, policy_version 195596 (0.00088) [2022-07-09 09:50:14,064][25689] Fps is (10 sec: 5725.7, 60 sec: 5761.1, 300 sec: 5729.1). Total num frames: 200294400. Throughput: 0: 5938.8. Samples: 200301150. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:50:14,064][25689] Avg episode reward: [(0, '-48.903')] [2022-07-09 09:50:15,087][26022] Updated weights on worker 0-0, policy_version 195606 (0.00087) [2022-07-09 09:50:16,754][26022] Updated weights on worker 0-0, policy_version 195616 (0.00086) [2022-07-09 09:50:18,407][26022] Updated weights on worker 0-0, policy_version 195626 (0.00085) [2022-07-09 09:50:19,191][25689] Fps is (10 sec: 5957.4, 60 sec: 5755.4, 300 sec: 5734.0). Total num frames: 200325120. Throughput: 0: 5041.3. Samples: 200318110. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:50:19,192][25689] Avg episode reward: [(0, '-49.189')] [2022-07-09 09:50:20,400][26022] Updated weights on worker 0-0, policy_version 195636 (0.00364) [2022-07-09 09:50:22,093][26022] Updated weights on worker 0-0, policy_version 195646 (0.00090) [2022-07-09 09:50:23,879][26022] Updated weights on worker 0-0, policy_version 195656 (0.00083) [2022-07-09 09:50:24,195][25689] Fps is (10 sec: 5862.6, 60 sec: 5725.0, 300 sec: 5732.1). Total num frames: 200353792. Throughput: 0: 5958.8. Samples: 200352942. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:50:24,195][25689] Avg episode reward: [(0, '-49.507')] [2022-07-09 09:50:25,797][26022] Updated weights on worker 0-0, policy_version 195666 (0.00095) [2022-07-09 09:50:27,379][26022] Updated weights on worker 0-0, policy_version 195676 (0.00088) [2022-07-09 09:50:29,222][25689] Fps is (10 sec: 5512.6, 60 sec: 5723.5, 300 sec: 5728.2). Total num frames: 200380416. Throughput: 0: 6006.1. Samples: 200387392. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:50:29,226][25689] Avg episode reward: [(0, '-48.673')] [2022-07-09 09:50:29,397][26022] Updated weights on worker 0-0, policy_version 195686 (0.00092) [2022-07-09 09:50:30,921][26022] Updated weights on worker 0-0, policy_version 195696 (0.00081) [2022-07-09 09:50:32,795][26022] Updated weights on worker 0-0, policy_version 195706 (0.00084) [2022-07-09 09:50:34,269][25689] Fps is (10 sec: 5793.5, 60 sec: 5760.2, 300 sec: 5735.2). Total num frames: 200412160. Throughput: 0: 5132.1. Samples: 200404846. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:50:34,270][25689] Avg episode reward: [(0, '-47.748')] [2022-07-09 09:50:34,435][26022] Updated weights on worker 0-0, policy_version 195716 (0.00053) [2022-07-09 09:50:36,208][26022] Updated weights on worker 0-0, policy_version 195726 (0.00080) [2022-07-09 09:50:37,980][26022] Updated weights on worker 0-0, policy_version 195736 (0.00090) [2022-07-09 09:50:39,375][25689] Fps is (10 sec: 6051.5, 60 sec: 5772.2, 300 sec: 5733.3). Total num frames: 200441856. Throughput: 0: 6040.8. Samples: 200440036. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:50:39,375][25689] Avg episode reward: [(0, '-47.450')] [2022-07-09 09:50:39,891][26022] Updated weights on worker 0-0, policy_version 195746 (0.00412) [2022-07-09 09:50:41,599][26022] Updated weights on worker 0-0, policy_version 195756 (0.00093) [2022-07-09 09:50:43,465][26022] Updated weights on worker 0-0, policy_version 195766 (0.00081) [2022-07-09 09:50:44,455][25689] Fps is (10 sec: 5730.4, 60 sec: 5766.7, 300 sec: 5735.5). Total num frames: 200470528. Throughput: 0: 6006.6. Samples: 200474638. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:50:44,455][25689] Avg episode reward: [(0, '-47.817')] [2022-07-09 09:50:45,085][26022] Updated weights on worker 0-0, policy_version 195776 (0.00081) [2022-07-09 09:50:45,857][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:50:45,866][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000195779_200477696.pth [2022-07-09 09:50:45,867][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000193760_198410240.pth [2022-07-09 09:50:46,827][26022] Updated weights on worker 0-0, policy_version 195786 (0.00095) [2022-07-09 09:50:48,774][26022] Updated weights on worker 0-0, policy_version 195796 (0.00087) [2022-07-09 09:50:49,517][25689] Fps is (10 sec: 5754.8, 60 sec: 5746.5, 300 sec: 5734.4). Total num frames: 200500224. Throughput: 0: 6012.7. Samples: 200509422. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:50:49,518][25689] Avg episode reward: [(0, '-48.269')] [2022-07-09 09:50:50,504][26022] Updated weights on worker 0-0, policy_version 195806 (0.00091) [2022-07-09 09:50:52,122][26022] Updated weights on worker 0-0, policy_version 195816 (0.00082) [2022-07-09 09:50:53,873][26022] Updated weights on worker 0-0, policy_version 195826 (0.00086) [2022-07-09 09:50:54,586][25689] Fps is (10 sec: 5761.5, 60 sec: 5742.2, 300 sec: 5737.3). Total num frames: 200528896. Throughput: 0: 5998.5. Samples: 200526714. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:50:54,586][25689] Avg episode reward: [(0, '-49.234')] [2022-07-09 09:50:55,595][26022] Updated weights on worker 0-0, policy_version 195836 (0.00086) [2022-07-09 09:50:57,510][26022] Updated weights on worker 0-0, policy_version 195846 (0.00091) [2022-07-09 09:50:59,285][26022] Updated weights on worker 0-0, policy_version 195856 (0.00561) [2022-07-09 09:50:59,665][25689] Fps is (10 sec: 5752.0, 60 sec: 5740.9, 300 sec: 5740.4). Total num frames: 200558592. Throughput: 0: 5982.5. Samples: 200561420. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:50:59,665][25689] Avg episode reward: [(0, '-49.127')] [2022-07-09 09:51:01,115][26022] Updated weights on worker 0-0, policy_version 195866 (0.00086) [2022-07-09 09:51:03,123][26022] Updated weights on worker 0-0, policy_version 195876 (0.00089) [2022-07-09 09:51:04,696][25689] Fps is (10 sec: 5570.6, 60 sec: 5740.7, 300 sec: 5736.7). Total num frames: 200585216. Throughput: 0: 5911.6. Samples: 200594294. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:51:04,696][25689] Avg episode reward: [(0, '-48.989')] [2022-07-09 09:51:04,923][26022] Updated weights on worker 0-0, policy_version 195886 (0.00080) [2022-07-09 09:51:06,548][26022] Updated weights on worker 0-0, policy_version 195896 (0.00085) [2022-07-09 09:51:08,502][26022] Updated weights on worker 0-0, policy_version 195906 (0.00086) [2022-07-09 09:51:09,734][25689] Fps is (10 sec: 5593.4, 60 sec: 5771.0, 300 sec: 5739.6). Total num frames: 200614912. Throughput: 0: 5070.8. Samples: 200611936. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 09:51:09,734][25689] Avg episode reward: [(0, '-49.427')] [2022-07-09 09:51:10,168][26022] Updated weights on worker 0-0, policy_version 195916 (0.00091) [2022-07-09 09:51:12,033][26022] Updated weights on worker 0-0, policy_version 195926 (0.00087) [2022-07-09 09:51:13,660][26022] Updated weights on worker 0-0, policy_version 195936 (0.00091) [2022-07-09 09:51:14,751][25689] Fps is (10 sec: 5804.9, 60 sec: 5753.9, 300 sec: 5740.6). Total num frames: 200643584. Throughput: 0: 5963.5. Samples: 200646970. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 09:51:14,751][25689] Avg episode reward: [(0, '-49.351')] [2022-07-09 09:51:15,448][26022] Updated weights on worker 0-0, policy_version 195946 (0.00087) [2022-07-09 09:51:17,281][26022] Updated weights on worker 0-0, policy_version 195956 (0.00090) [2022-07-09 09:51:19,127][26022] Updated weights on worker 0-0, policy_version 195966 (0.00083) [2022-07-09 09:51:19,827][25689] Fps is (10 sec: 5783.2, 60 sec: 5741.9, 300 sec: 5746.2). Total num frames: 200673280. Throughput: 0: 5942.9. Samples: 200681240. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 09:51:19,827][25689] Avg episode reward: [(0, '-49.333')] [2022-07-09 09:51:20,671][26022] Updated weights on worker 0-0, policy_version 195976 (0.00087) [2022-07-09 09:51:22,490][26022] Updated weights on worker 0-0, policy_version 195986 (0.00089) [2022-07-09 09:51:24,219][26022] Updated weights on worker 0-0, policy_version 195996 (0.00825) [2022-07-09 09:51:24,855][25689] Fps is (10 sec: 5776.7, 60 sec: 5739.5, 300 sec: 5742.3). Total num frames: 200701952. Throughput: 0: 5185.7. Samples: 200698832. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 09:51:24,855][25689] Avg episode reward: [(0, '-48.942')] [2022-07-09 09:51:26,407][26022] Updated weights on worker 0-0, policy_version 196006 (0.00088) [2022-07-09 09:51:27,707][26022] Updated weights on worker 0-0, policy_version 196016 (0.00085) [2022-07-09 09:51:29,808][26022] Updated weights on worker 0-0, policy_version 196026 (0.00096) [2022-07-09 09:51:29,869][25689] Fps is (10 sec: 5710.3, 60 sec: 5774.5, 300 sec: 5742.2). Total num frames: 200730624. Throughput: 0: 6023.5. Samples: 200733220. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 09:51:29,869][25689] Avg episode reward: [(0, '-48.171')] [2022-07-09 09:51:31,406][26022] Updated weights on worker 0-0, policy_version 196036 (0.00087) [2022-07-09 09:51:33,370][26022] Updated weights on worker 0-0, policy_version 196046 (0.00102) [2022-07-09 09:51:34,919][25689] Fps is (10 sec: 5799.4, 60 sec: 5740.5, 300 sec: 5745.3). Total num frames: 200760320. Throughput: 0: 5995.1. Samples: 200767882. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 09:51:34,921][25689] Avg episode reward: [(0, '-48.776')] [2022-07-09 09:51:35,123][26022] Updated weights on worker 0-0, policy_version 196056 (0.00087) [2022-07-09 09:51:36,907][26022] Updated weights on worker 0-0, policy_version 196066 (0.00089) [2022-07-09 09:51:38,644][26022] Updated weights on worker 0-0, policy_version 196076 (0.00086) [2022-07-09 09:51:39,969][25689] Fps is (10 sec: 5778.9, 60 sec: 5728.9, 300 sec: 5744.6). Total num frames: 200788992. Throughput: 0: 5158.0. Samples: 200785134. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 09:51:39,970][25689] Avg episode reward: [(0, '-48.895')] [2022-07-09 09:51:40,483][26022] Updated weights on worker 0-0, policy_version 196086 (0.00089) [2022-07-09 09:51:42,143][26022] Updated weights on worker 0-0, policy_version 196096 (0.00056) [2022-07-09 09:51:43,904][26022] Updated weights on worker 0-0, policy_version 196106 (0.00090) [2022-07-09 09:51:45,055][25689] Fps is (10 sec: 5758.9, 60 sec: 5745.2, 300 sec: 5746.4). Total num frames: 200818688. Throughput: 0: 5994.8. Samples: 200819926. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 09:51:45,056][25689] Avg episode reward: [(0, '-49.473')] [2022-07-09 09:51:45,588][26022] Updated weights on worker 0-0, policy_version 196116 (0.00087) [2022-07-09 09:51:47,471][26022] Updated weights on worker 0-0, policy_version 196126 (0.00101) [2022-07-09 09:51:49,454][26022] Updated weights on worker 0-0, policy_version 196136 (0.00093) [2022-07-09 09:51:50,142][25689] Fps is (10 sec: 5737.7, 60 sec: 5726.0, 300 sec: 5748.6). Total num frames: 200847360. Throughput: 0: 5970.8. Samples: 200854266. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 09:51:50,143][25689] Avg episode reward: [(0, '-50.552')] [2022-07-09 09:51:50,985][26022] Updated weights on worker 0-0, policy_version 196146 (0.00092) [2022-07-09 09:51:52,953][26022] Updated weights on worker 0-0, policy_version 196156 (0.00089) [2022-07-09 09:51:54,501][26022] Updated weights on worker 0-0, policy_version 196166 (0.00082) [2022-07-09 09:51:55,157][25689] Fps is (10 sec: 5778.0, 60 sec: 5748.0, 300 sec: 5749.6). Total num frames: 200877056. Throughput: 0: 5138.8. Samples: 200871876. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 09:51:55,157][25689] Avg episode reward: [(0, '-50.111')] [2022-07-09 09:51:56,243][26022] Updated weights on worker 0-0, policy_version 196176 (0.00093) [2022-07-09 09:51:58,035][26022] Updated weights on worker 0-0, policy_version 196186 (0.00674) [2022-07-09 09:51:59,851][26022] Updated weights on worker 0-0, policy_version 196196 (0.00087) [2022-07-09 09:52:00,202][25689] Fps is (10 sec: 5801.7, 60 sec: 5734.2, 300 sec: 5752.6). Total num frames: 200905728. Throughput: 0: 6002.3. Samples: 200906580. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 09:52:00,203][25689] Avg episode reward: [(0, '-49.852')] [2022-07-09 09:52:01,485][26022] Updated weights on worker 0-0, policy_version 196206 (0.00092) [2022-07-09 09:52:03,959][26022] Updated weights on worker 0-0, policy_version 196216 (0.00084) [2022-07-09 09:52:05,217][25689] Fps is (10 sec: 5496.1, 60 sec: 5735.7, 300 sec: 5745.5). Total num frames: 200932352. Throughput: 0: 5901.4. Samples: 200938914. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 09:52:05,218][25689] Avg episode reward: [(0, '-50.621')] [2022-07-09 09:52:05,503][26022] Updated weights on worker 0-0, policy_version 196226 (0.00083) [2022-07-09 09:52:07,453][26022] Updated weights on worker 0-0, policy_version 196236 (0.00086) [2022-07-09 09:52:09,076][26022] Updated weights on worker 0-0, policy_version 196246 (0.00347) [2022-07-09 09:52:10,260][25689] Fps is (10 sec: 5396.2, 60 sec: 5701.5, 300 sec: 5738.0). Total num frames: 200960000. Throughput: 0: 5075.6. Samples: 200956376. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 09:52:10,262][25689] Avg episode reward: [(0, '-50.728')] [2022-07-09 09:52:11,059][26022] Updated weights on worker 0-0, policy_version 196256 (0.00089) [2022-07-09 09:52:12,630][26022] Updated weights on worker 0-0, policy_version 196266 (0.00084) [2022-07-09 09:52:14,419][26022] Updated weights on worker 0-0, policy_version 196276 (0.00056) [2022-07-09 09:52:15,270][25689] Fps is (10 sec: 5908.3, 60 sec: 5752.9, 300 sec: 5749.4). Total num frames: 200991744. Throughput: 0: 5936.1. Samples: 200991270. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 09:52:15,270][25689] Avg episode reward: [(0, '-50.051')] [2022-07-09 09:52:16,188][26022] Updated weights on worker 0-0, policy_version 196286 (0.00090) [2022-07-09 09:52:18,061][26022] Updated weights on worker 0-0, policy_version 196296 (0.00086) [2022-07-09 09:52:19,615][26022] Updated weights on worker 0-0, policy_version 196306 (0.00090) [2022-07-09 09:52:20,359][25689] Fps is (10 sec: 5982.1, 60 sec: 5734.7, 300 sec: 5742.0). Total num frames: 201020416. Throughput: 0: 5927.4. Samples: 201026058. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 09:52:20,360][25689] Avg episode reward: [(0, '-49.455')] [2022-07-09 09:52:21,672][26022] Updated weights on worker 0-0, policy_version 196316 (0.00097) [2022-07-09 09:52:23,126][26022] Updated weights on worker 0-0, policy_version 196326 (0.00078) [2022-07-09 09:52:25,085][26022] Updated weights on worker 0-0, policy_version 196336 (0.00086) [2022-07-09 09:52:25,413][25689] Fps is (10 sec: 5653.6, 60 sec: 5732.3, 300 sec: 5745.2). Total num frames: 201049088. Throughput: 0: 5185.8. Samples: 201043642. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 09:52:25,414][25689] Avg episode reward: [(0, '-50.433')] [2022-07-09 09:52:26,851][26022] Updated weights on worker 0-0, policy_version 196346 (0.00089) [2022-07-09 09:52:28,636][26022] Updated weights on worker 0-0, policy_version 196356 (0.00086) [2022-07-09 09:52:30,429][25689] Fps is (10 sec: 5694.4, 60 sec: 5732.1, 300 sec: 5745.4). Total num frames: 201077760. Throughput: 0: 6042.1. Samples: 201078244. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 09:52:30,430][25689] Avg episode reward: [(0, '-50.636')] [2022-07-09 09:52:30,458][26022] Updated weights on worker 0-0, policy_version 196366 (0.00081) [2022-07-09 09:52:32,188][26022] Updated weights on worker 0-0, policy_version 196376 (0.00081) [2022-07-09 09:52:33,776][26022] Updated weights on worker 0-0, policy_version 196386 (0.00078) [2022-07-09 09:52:35,481][25689] Fps is (10 sec: 5797.5, 60 sec: 5732.0, 300 sec: 5747.1). Total num frames: 201107456. Throughput: 0: 6034.4. Samples: 201113230. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 09:52:35,481][25689] Avg episode reward: [(0, '-50.579')] [2022-07-09 09:52:35,878][26022] Updated weights on worker 0-0, policy_version 196396 (0.00086) [2022-07-09 09:52:37,261][26022] Updated weights on worker 0-0, policy_version 196406 (0.00092) [2022-07-09 09:52:39,367][26022] Updated weights on worker 0-0, policy_version 196416 (0.00084) [2022-07-09 09:52:40,526][25689] Fps is (10 sec: 5983.7, 60 sec: 5766.2, 300 sec: 5750.6). Total num frames: 201138176. Throughput: 0: 5181.8. Samples: 201130560. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 09:52:40,527][25689] Avg episode reward: [(0, '-50.781')] [2022-07-09 09:52:40,832][26022] Updated weights on worker 0-0, policy_version 196426 (0.00084) [2022-07-09 09:52:42,807][26022] Updated weights on worker 0-0, policy_version 196436 (0.00081) [2022-07-09 09:52:44,308][26022] Updated weights on worker 0-0, policy_version 196446 (0.00055) [2022-07-09 09:52:45,531][25689] Fps is (10 sec: 5807.7, 60 sec: 5740.1, 300 sec: 5751.4). Total num frames: 201165824. Throughput: 0: 6064.1. Samples: 201165640. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 09:52:45,531][25689] Avg episode reward: [(0, '-50.194')] [2022-07-09 09:52:45,882][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:52:45,894][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000196453_201167872.pth [2022-07-09 09:52:45,895][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000194430_199096320.pth [2022-07-09 09:52:46,292][26022] Updated weights on worker 0-0, policy_version 196456 (0.00089) [2022-07-09 09:52:47,985][26022] Updated weights on worker 0-0, policy_version 196466 (0.00092) [2022-07-09 09:52:49,757][26022] Updated weights on worker 0-0, policy_version 196476 (0.00077) [2022-07-09 09:52:50,543][25689] Fps is (10 sec: 5725.0, 60 sec: 5764.2, 300 sec: 5748.2). Total num frames: 201195520. Throughput: 0: 6062.5. Samples: 201200180. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 09:52:50,543][25689] Avg episode reward: [(0, '-51.229')] [2022-07-09 09:52:51,666][26022] Updated weights on worker 0-0, policy_version 196486 (0.00089) [2022-07-09 09:52:53,470][26022] Updated weights on worker 0-0, policy_version 196496 (0.00082) [2022-07-09 09:52:55,142][26022] Updated weights on worker 0-0, policy_version 196506 (0.00086) [2022-07-09 09:52:55,547][25689] Fps is (10 sec: 5827.3, 60 sec: 5748.2, 300 sec: 5745.9). Total num frames: 201224192. Throughput: 0: 5194.8. Samples: 201217472. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 09:52:55,548][25689] Avg episode reward: [(0, '-50.741')] [2022-07-09 09:52:56,988][26022] Updated weights on worker 0-0, policy_version 196516 (0.00084) [2022-07-09 09:52:58,606][26022] Updated weights on worker 0-0, policy_version 196526 (0.00086) [2022-07-09 09:53:00,663][25689] Fps is (10 sec: 5565.0, 60 sec: 5724.6, 300 sec: 5747.4). Total num frames: 201251840. Throughput: 0: 6047.6. Samples: 201252340. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 09:53:00,663][25689] Avg episode reward: [(0, '-50.552')] [2022-07-09 09:53:00,707][26022] Updated weights on worker 0-0, policy_version 196536 (0.00094) [2022-07-09 09:53:02,349][26022] Updated weights on worker 0-0, policy_version 196546 (0.00085) [2022-07-09 09:53:04,395][26022] Updated weights on worker 0-0, policy_version 196556 (0.00094) [2022-07-09 09:53:05,699][25689] Fps is (10 sec: 5648.3, 60 sec: 5773.4, 300 sec: 5746.7). Total num frames: 201281536. Throughput: 0: 5940.9. Samples: 201285460. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 09:53:05,700][25689] Avg episode reward: [(0, '-49.764')] [2022-07-09 09:53:06,129][26022] Updated weights on worker 0-0, policy_version 196566 (0.00088) [2022-07-09 09:53:07,773][26022] Updated weights on worker 0-0, policy_version 196576 (0.00086) [2022-07-09 09:53:09,583][26022] Updated weights on worker 0-0, policy_version 196586 (0.00089) [2022-07-09 09:53:10,723][25689] Fps is (10 sec: 5801.9, 60 sec: 5792.1, 300 sec: 5753.4). Total num frames: 201310208. Throughput: 0: 5092.4. Samples: 201302948. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 09:53:10,724][25689] Avg episode reward: [(0, '-49.827')] [2022-07-09 09:53:11,346][26022] Updated weights on worker 0-0, policy_version 196596 (0.00084) [2022-07-09 09:53:13,221][26022] Updated weights on worker 0-0, policy_version 196606 (0.00082) [2022-07-09 09:53:14,803][26022] Updated weights on worker 0-0, policy_version 196616 (0.00084) [2022-07-09 09:53:15,746][25689] Fps is (10 sec: 5707.8, 60 sec: 5740.0, 300 sec: 5747.4). Total num frames: 201338880. Throughput: 0: 5979.4. Samples: 201338250. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 09:53:15,747][25689] Avg episode reward: [(0, '-49.475')] [2022-07-09 09:53:16,434][26022] Updated weights on worker 0-0, policy_version 196626 (0.00085) [2022-07-09 09:53:18,542][26022] Updated weights on worker 0-0, policy_version 196636 (0.00086) [2022-07-09 09:53:19,817][26022] Updated weights on worker 0-0, policy_version 196646 (0.00088) [2022-07-09 09:53:20,782][25689] Fps is (10 sec: 5802.7, 60 sec: 5762.1, 300 sec: 5747.1). Total num frames: 201368576. Throughput: 0: 6020.9. Samples: 201373474. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 09:53:20,782][25689] Avg episode reward: [(0, '-48.818')] [2022-07-09 09:53:21,981][26022] Updated weights on worker 0-0, policy_version 196656 (0.00087) [2022-07-09 09:53:23,531][26022] Updated weights on worker 0-0, policy_version 196666 (0.00085) [2022-07-09 09:53:25,395][26022] Updated weights on worker 0-0, policy_version 196676 (0.00085) [2022-07-09 09:53:25,798][25689] Fps is (10 sec: 5908.6, 60 sec: 5782.6, 300 sec: 5753.9). Total num frames: 201398272. Throughput: 0: 5253.4. Samples: 201391044. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 09:53:25,798][25689] Avg episode reward: [(0, '-49.322')] [2022-07-09 09:53:27,208][26022] Updated weights on worker 0-0, policy_version 196686 (0.00084) [2022-07-09 09:53:28,947][26022] Updated weights on worker 0-0, policy_version 196696 (0.00093) [2022-07-09 09:53:30,559][26022] Updated weights on worker 0-0, policy_version 196706 (0.00094) [2022-07-09 09:53:30,807][25689] Fps is (10 sec: 5924.1, 60 sec: 5800.3, 300 sec: 5753.9). Total num frames: 201427968. Throughput: 0: 6118.8. Samples: 201425840. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 09:53:30,808][25689] Avg episode reward: [(0, '-50.084')] [2022-07-09 09:53:32,416][26022] Updated weights on worker 0-0, policy_version 196716 (0.00096) [2022-07-09 09:53:33,965][26022] Updated weights on worker 0-0, policy_version 196726 (0.00086) [2022-07-09 09:53:35,820][25689] Fps is (10 sec: 5824.0, 60 sec: 5787.0, 300 sec: 5751.7). Total num frames: 201456640. Throughput: 0: 6104.4. Samples: 201460788. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 09:53:35,820][25689] Avg episode reward: [(0, '-50.299')] [2022-07-09 09:53:36,027][26022] Updated weights on worker 0-0, policy_version 196736 (0.00086) [2022-07-09 09:53:37,620][26022] Updated weights on worker 0-0, policy_version 196746 (0.00082) [2022-07-09 09:53:39,483][26022] Updated weights on worker 0-0, policy_version 196756 (0.00085) [2022-07-09 09:53:40,875][25689] Fps is (10 sec: 5797.8, 60 sec: 5769.2, 300 sec: 5754.3). Total num frames: 201486336. Throughput: 0: 6067.5. Samples: 201495386. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 09:53:40,875][25689] Avg episode reward: [(0, '-50.146')] [2022-07-09 09:53:41,070][26022] Updated weights on worker 0-0, policy_version 196766 (0.00097) [2022-07-09 09:53:43,162][26022] Updated weights on worker 0-0, policy_version 196776 (0.00093) [2022-07-09 09:53:44,531][26022] Updated weights on worker 0-0, policy_version 196786 (0.00095) [2022-07-09 09:53:45,876][25689] Fps is (10 sec: 5702.3, 60 sec: 5769.5, 300 sec: 5758.3). Total num frames: 201513984. Throughput: 0: 6073.3. Samples: 201512986. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 09:53:45,876][25689] Avg episode reward: [(0, '-50.496')] [2022-07-09 09:53:46,691][26022] Updated weights on worker 0-0, policy_version 196796 (0.00089) [2022-07-09 09:53:48,130][26022] Updated weights on worker 0-0, policy_version 196806 (0.00092) [2022-07-09 09:53:50,231][26022] Updated weights on worker 0-0, policy_version 196816 (0.00094) [2022-07-09 09:53:50,881][25689] Fps is (10 sec: 5833.1, 60 sec: 5787.1, 300 sec: 5754.9). Total num frames: 201544704. Throughput: 0: 6074.8. Samples: 201547784. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 09:53:50,882][25689] Avg episode reward: [(0, '-50.266')] [2022-07-09 09:53:52,064][26022] Updated weights on worker 0-0, policy_version 196826 (0.00090) [2022-07-09 09:53:53,770][26022] Updated weights on worker 0-0, policy_version 196836 (0.00089) [2022-07-09 09:53:55,461][26022] Updated weights on worker 0-0, policy_version 196846 (0.00087) [2022-07-09 09:53:55,894][25689] Fps is (10 sec: 5826.4, 60 sec: 5769.3, 300 sec: 5752.3). Total num frames: 201572352. Throughput: 0: 5173.7. Samples: 201564646. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 09:53:55,894][25689] Avg episode reward: [(0, '-50.623')] [2022-07-09 09:53:57,323][26022] Updated weights on worker 0-0, policy_version 196856 (0.00086) [2022-07-09 09:53:58,911][26022] Updated weights on worker 0-0, policy_version 196866 (0.00079) [2022-07-09 09:54:00,933][25689] Fps is (10 sec: 5500.8, 60 sec: 5776.6, 300 sec: 5751.8). Total num frames: 201600000. Throughput: 0: 5194.1. Samples: 201599572. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 09:54:00,934][25689] Avg episode reward: [(0, '-49.738')] [2022-07-09 09:54:01,015][26022] Updated weights on worker 0-0, policy_version 196876 (0.00113) [2022-07-09 09:54:02,884][26022] Updated weights on worker 0-0, policy_version 196886 (0.00084) [2022-07-09 09:54:04,802][26022] Updated weights on worker 0-0, policy_version 196896 (0.00091) [2022-07-09 09:54:05,955][25689] Fps is (10 sec: 5597.9, 60 sec: 5761.1, 300 sec: 5751.5). Total num frames: 201628672. Throughput: 0: 5929.6. Samples: 201632044. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 09:54:05,955][25689] Avg episode reward: [(0, '-49.535')] [2022-07-09 09:54:06,342][26022] Updated weights on worker 0-0, policy_version 196906 (0.00082) [2022-07-09 09:54:08,402][26022] Updated weights on worker 0-0, policy_version 196916 (0.00092) [2022-07-09 09:54:09,988][26022] Updated weights on worker 0-0, policy_version 196926 (0.00084) [2022-07-09 09:54:10,959][25689] Fps is (10 sec: 5617.7, 60 sec: 5746.0, 300 sec: 5751.5). Total num frames: 201656320. Throughput: 0: 5928.5. Samples: 201666814. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 09:54:10,961][25689] Avg episode reward: [(0, '-49.617')] [2022-07-09 09:54:11,905][26022] Updated weights on worker 0-0, policy_version 196936 (0.00075) [2022-07-09 09:54:13,475][26022] Updated weights on worker 0-0, policy_version 196946 (0.00079) [2022-07-09 09:54:15,364][26022] Updated weights on worker 0-0, policy_version 196956 (0.00086) [2022-07-09 09:54:15,967][25689] Fps is (10 sec: 5931.9, 60 sec: 5798.4, 300 sec: 5756.1). Total num frames: 201688064. Throughput: 0: 5962.6. Samples: 201684334. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 09:54:15,968][25689] Avg episode reward: [(0, '-49.937')] [2022-07-09 09:54:17,012][26022] Updated weights on worker 0-0, policy_version 196966 (0.00085) [2022-07-09 09:54:18,798][26022] Updated weights on worker 0-0, policy_version 196976 (0.00086) [2022-07-09 09:54:20,391][26022] Updated weights on worker 0-0, policy_version 196986 (0.00088) [2022-07-09 09:54:21,038][25689] Fps is (10 sec: 5994.4, 60 sec: 5778.1, 300 sec: 5748.6). Total num frames: 201716736. Throughput: 0: 5957.9. Samples: 201719350. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 09:54:21,038][25689] Avg episode reward: [(0, '-50.255')] [2022-07-09 09:54:22,438][26022] Updated weights on worker 0-0, policy_version 196996 (0.00811) [2022-07-09 09:54:23,957][26022] Updated weights on worker 0-0, policy_version 197006 (0.00091) [2022-07-09 09:54:26,059][25689] Fps is (10 sec: 5479.0, 60 sec: 5726.5, 300 sec: 5748.4). Total num frames: 201743360. Throughput: 0: 6061.4. Samples: 201753908. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 09:54:26,060][25689] Avg episode reward: [(0, '-50.135')] [2022-07-09 09:54:26,170][26022] Updated weights on worker 0-0, policy_version 197016 (0.00094) [2022-07-09 09:54:27,525][26022] Updated weights on worker 0-0, policy_version 197026 (0.00086) [2022-07-09 09:54:29,661][26022] Updated weights on worker 0-0, policy_version 197036 (0.00084) [2022-07-09 09:54:31,075][25689] Fps is (10 sec: 5713.2, 60 sec: 5743.0, 300 sec: 5753.0). Total num frames: 201774080. Throughput: 0: 5175.6. Samples: 201770926. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 09:54:31,075][25689] Avg episode reward: [(0, '-50.546')] [2022-07-09 09:54:31,127][26022] Updated weights on worker 0-0, policy_version 197046 (0.01303) [2022-07-09 09:54:33,227][26022] Updated weights on worker 0-0, policy_version 197056 (0.00088) [2022-07-09 09:54:34,826][26022] Updated weights on worker 0-0, policy_version 197066 (0.00087) [2022-07-09 09:54:36,091][25689] Fps is (10 sec: 5818.6, 60 sec: 5725.6, 300 sec: 5750.3). Total num frames: 201801728. Throughput: 0: 6016.4. Samples: 201805406. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 09:54:36,091][25689] Avg episode reward: [(0, '-50.635')] [2022-07-09 09:54:36,731][26022] Updated weights on worker 0-0, policy_version 197076 (0.00088) [2022-07-09 09:54:38,352][26022] Updated weights on worker 0-0, policy_version 197086 (0.00090) [2022-07-09 09:54:40,239][26022] Updated weights on worker 0-0, policy_version 197096 (0.00077) [2022-07-09 09:54:41,159][25689] Fps is (10 sec: 5686.6, 60 sec: 5724.4, 300 sec: 5752.9). Total num frames: 201831424. Throughput: 0: 5998.9. Samples: 201840054. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 09:54:41,159][25689] Avg episode reward: [(0, '-51.104')] [2022-07-09 09:54:41,841][26022] Updated weights on worker 0-0, policy_version 197106 (0.00332) [2022-07-09 09:54:43,715][26022] Updated weights on worker 0-0, policy_version 197116 (0.00087) [2022-07-09 09:54:45,408][26022] Updated weights on worker 0-0, policy_version 197126 (0.00095) [2022-07-09 09:54:45,951][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:54:45,972][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000197129_201860096.pth [2022-07-09 09:54:45,973][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000195103_199785472.pth [2022-07-09 09:54:46,177][25689] Fps is (10 sec: 5787.1, 60 sec: 5739.8, 300 sec: 5746.2). Total num frames: 201860096. Throughput: 0: 5145.4. Samples: 201857418. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 09:54:46,177][25689] Avg episode reward: [(0, '-50.823')] [2022-07-09 09:54:47,315][26022] Updated weights on worker 0-0, policy_version 197136 (0.00087) [2022-07-09 09:54:48,911][26022] Updated weights on worker 0-0, policy_version 197146 (0.00092) [2022-07-09 09:54:50,762][26022] Updated weights on worker 0-0, policy_version 197156 (0.00079) [2022-07-09 09:54:51,184][25689] Fps is (10 sec: 5822.2, 60 sec: 5722.6, 300 sec: 5750.0). Total num frames: 201889792. Throughput: 0: 6034.9. Samples: 201892284. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 09:54:51,184][25689] Avg episode reward: [(0, '-51.297')] [2022-07-09 09:54:52,548][26022] Updated weights on worker 0-0, policy_version 197166 (0.00093) [2022-07-09 09:54:54,294][26022] Updated weights on worker 0-0, policy_version 197176 (0.00084) [2022-07-09 09:54:56,221][25689] Fps is (10 sec: 5709.3, 60 sec: 5720.3, 300 sec: 5743.6). Total num frames: 201917440. Throughput: 0: 6054.5. Samples: 201927282. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 09:54:56,221][25689] Avg episode reward: [(0, '-51.241')] [2022-07-09 09:54:56,230][26022] Updated weights on worker 0-0, policy_version 197186 (0.00086) [2022-07-09 09:54:57,843][26022] Updated weights on worker 0-0, policy_version 197196 (0.00089) [2022-07-09 09:54:59,635][26022] Updated weights on worker 0-0, policy_version 197206 (0.00090) [2022-07-09 09:55:01,291][25689] Fps is (10 sec: 5775.0, 60 sec: 5768.3, 300 sec: 5756.6). Total num frames: 201948160. Throughput: 0: 5189.9. Samples: 201944538. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 09:55:01,291][25689] Avg episode reward: [(0, '-50.892')] [2022-07-09 09:55:01,463][26022] Updated weights on worker 0-0, policy_version 197216 (0.00088) [2022-07-09 09:55:03,607][26022] Updated weights on worker 0-0, policy_version 197226 (0.00086) [2022-07-09 09:55:05,437][26022] Updated weights on worker 0-0, policy_version 197236 (0.00085) [2022-07-09 09:55:06,348][25689] Fps is (10 sec: 5763.5, 60 sec: 5747.9, 300 sec: 5755.5). Total num frames: 201975808. Throughput: 0: 5930.3. Samples: 201977038. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 09:55:06,348][25689] Avg episode reward: [(0, '-50.406')] [2022-07-09 09:55:07,000][26022] Updated weights on worker 0-0, policy_version 197246 (0.00121) [2022-07-09 09:55:09,084][26022] Updated weights on worker 0-0, policy_version 197256 (0.00087) [2022-07-09 09:55:10,703][26022] Updated weights on worker 0-0, policy_version 197266 (0.00092) [2022-07-09 09:55:11,401][25689] Fps is (10 sec: 5570.4, 60 sec: 5760.2, 300 sec: 5751.3). Total num frames: 202004480. Throughput: 0: 5909.5. Samples: 202011760. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 09:55:11,403][25689] Avg episode reward: [(0, '-49.818')] [2022-07-09 09:55:12,452][26022] Updated weights on worker 0-0, policy_version 197276 (0.00079) [2022-07-09 09:55:14,139][26022] Updated weights on worker 0-0, policy_version 197286 (0.00091) [2022-07-09 09:55:15,948][26022] Updated weights on worker 0-0, policy_version 197296 (0.00098) [2022-07-09 09:55:16,419][25689] Fps is (10 sec: 5693.9, 60 sec: 5708.5, 300 sec: 5746.5). Total num frames: 202033152. Throughput: 0: 5043.3. Samples: 202029148. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 09:55:16,420][25689] Avg episode reward: [(0, '-50.280')] [2022-07-09 09:55:17,758][26022] Updated weights on worker 0-0, policy_version 197306 (0.00088) [2022-07-09 09:55:19,520][26022] Updated weights on worker 0-0, policy_version 197316 (0.00084) [2022-07-09 09:55:21,259][26022] Updated weights on worker 0-0, policy_version 197326 (0.00086) [2022-07-09 09:55:21,520][25689] Fps is (10 sec: 5768.5, 60 sec: 5722.5, 300 sec: 5748.1). Total num frames: 202062848. Throughput: 0: 5888.2. Samples: 202063650. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 09:55:21,520][25689] Avg episode reward: [(0, '-49.496')] [2022-07-09 09:55:23,016][26022] Updated weights on worker 0-0, policy_version 197336 (0.00083) [2022-07-09 09:55:24,973][26022] Updated weights on worker 0-0, policy_version 197346 (0.00082) [2022-07-09 09:55:26,565][25689] Fps is (10 sec: 5853.8, 60 sec: 5771.1, 300 sec: 5758.1). Total num frames: 202092544. Throughput: 0: 5992.2. Samples: 202098182. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 09:55:26,565][25689] Avg episode reward: [(0, '-49.672')] [2022-07-09 09:55:26,570][26022] Updated weights on worker 0-0, policy_version 197356 (0.00078) [2022-07-09 09:55:28,447][26022] Updated weights on worker 0-0, policy_version 197366 (0.00089) [2022-07-09 09:55:30,175][26022] Updated weights on worker 0-0, policy_version 197376 (0.00086) [2022-07-09 09:55:31,584][25689] Fps is (10 sec: 5697.8, 60 sec: 5720.0, 300 sec: 5744.9). Total num frames: 202120192. Throughput: 0: 5137.7. Samples: 202115452. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 09:55:31,585][25689] Avg episode reward: [(0, '-48.772')] [2022-07-09 09:55:31,987][26022] Updated weights on worker 0-0, policy_version 197386 (0.00088) [2022-07-09 09:55:33,865][26022] Updated weights on worker 0-0, policy_version 197396 (0.00102) [2022-07-09 09:55:35,288][26022] Updated weights on worker 0-0, policy_version 197406 (0.00087) [2022-07-09 09:55:36,590][25689] Fps is (10 sec: 5720.0, 60 sec: 5754.8, 300 sec: 5746.8). Total num frames: 202149888. Throughput: 0: 6012.0. Samples: 202150416. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 09:55:36,592][25689] Avg episode reward: [(0, '-48.958')] [2022-07-09 09:55:37,337][26022] Updated weights on worker 0-0, policy_version 197416 (0.00086) [2022-07-09 09:55:39,183][26022] Updated weights on worker 0-0, policy_version 197426 (0.00084) [2022-07-09 09:55:40,755][26022] Updated weights on worker 0-0, policy_version 197436 (0.00480) [2022-07-09 09:55:41,699][25689] Fps is (10 sec: 5770.3, 60 sec: 5733.9, 300 sec: 5746.2). Total num frames: 202178560. Throughput: 0: 6030.8. Samples: 202185348. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 09:55:41,700][25689] Avg episode reward: [(0, '-48.780')] [2022-07-09 09:55:42,591][26022] Updated weights on worker 0-0, policy_version 197446 (0.00087) [2022-07-09 09:55:44,285][26022] Updated weights on worker 0-0, policy_version 197456 (0.00096) [2022-07-09 09:55:46,083][26022] Updated weights on worker 0-0, policy_version 197466 (0.00088) [2022-07-09 09:55:46,708][25689] Fps is (10 sec: 5971.3, 60 sec: 5785.6, 300 sec: 5754.2). Total num frames: 202210304. Throughput: 0: 5196.4. Samples: 202202852. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 09:55:46,708][25689] Avg episode reward: [(0, '-49.136')] [2022-07-09 09:55:48,011][26022] Updated weights on worker 0-0, policy_version 197476 (0.00092) [2022-07-09 09:55:49,464][26022] Updated weights on worker 0-0, policy_version 197486 (0.00084) [2022-07-09 09:55:51,503][26022] Updated weights on worker 0-0, policy_version 197496 (0.00092) [2022-07-09 09:55:51,719][25689] Fps is (10 sec: 5723.2, 60 sec: 5717.5, 300 sec: 5744.9). Total num frames: 202235904. Throughput: 0: 6057.7. Samples: 202237422. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 09:55:51,719][25689] Avg episode reward: [(0, '-49.636')] [2022-07-09 09:55:53,227][26022] Updated weights on worker 0-0, policy_version 197506 (0.00087) [2022-07-09 09:55:55,119][26022] Updated weights on worker 0-0, policy_version 197516 (0.00084) [2022-07-09 09:55:56,773][25689] Fps is (10 sec: 5493.5, 60 sec: 5749.7, 300 sec: 5745.4). Total num frames: 202265600. Throughput: 0: 6036.8. Samples: 202272260. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 09:55:56,774][25689] Avg episode reward: [(0, '-50.611')] [2022-07-09 09:55:56,795][26022] Updated weights on worker 0-0, policy_version 197526 (0.00080) [2022-07-09 09:55:58,671][26022] Updated weights on worker 0-0, policy_version 197536 (0.00086) [2022-07-09 09:56:00,200][26022] Updated weights on worker 0-0, policy_version 197546 (0.00213) [2022-07-09 09:56:01,892][25689] Fps is (10 sec: 5838.3, 60 sec: 5728.2, 300 sec: 5754.0). Total num frames: 202295296. Throughput: 0: 5170.9. Samples: 202289762. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 09:56:01,892][25689] Avg episode reward: [(0, '-50.361')] [2022-07-09 09:56:02,576][26022] Updated weights on worker 0-0, policy_version 197556 (0.00087) [2022-07-09 09:56:04,093][26022] Updated weights on worker 0-0, policy_version 197566 (0.00091) [2022-07-09 09:56:05,941][26022] Updated weights on worker 0-0, policy_version 197576 (0.00090) [2022-07-09 09:56:06,937][25689] Fps is (10 sec: 5642.2, 60 sec: 5729.4, 300 sec: 5747.0). Total num frames: 202322944. Throughput: 0: 5907.9. Samples: 202322364. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 09:56:06,937][25689] Avg episode reward: [(0, '-49.705')] [2022-07-09 09:56:07,651][26022] Updated weights on worker 0-0, policy_version 197586 (0.00091) [2022-07-09 09:56:09,353][26022] Updated weights on worker 0-0, policy_version 197596 (0.00090) [2022-07-09 09:56:11,100][26022] Updated weights on worker 0-0, policy_version 197606 (0.00086) [2022-07-09 09:56:11,938][25689] Fps is (10 sec: 5708.1, 60 sec: 5751.3, 300 sec: 5750.8). Total num frames: 202352640. Throughput: 0: 5930.3. Samples: 202357326. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 09:56:11,938][25689] Avg episode reward: [(0, '-50.015')] [2022-07-09 09:56:12,705][26022] Updated weights on worker 0-0, policy_version 197616 (0.00088) [2022-07-09 09:56:14,716][26022] Updated weights on worker 0-0, policy_version 197626 (0.00084) [2022-07-09 09:56:16,638][26022] Updated weights on worker 0-0, policy_version 197636 (0.00086) [2022-07-09 09:56:16,953][25689] Fps is (10 sec: 5724.9, 60 sec: 5734.5, 300 sec: 5745.0). Total num frames: 202380288. Throughput: 0: 5084.5. Samples: 202374866. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 09:56:16,954][25689] Avg episode reward: [(0, '-49.038')] [2022-07-09 09:56:18,204][26022] Updated weights on worker 0-0, policy_version 197646 (0.00083) [2022-07-09 09:56:20,247][26022] Updated weights on worker 0-0, policy_version 197656 (0.00084) [2022-07-09 09:56:21,898][26022] Updated weights on worker 0-0, policy_version 197666 (0.00085) [2022-07-09 09:56:21,996][25689] Fps is (10 sec: 5701.0, 60 sec: 5740.0, 300 sec: 5748.2). Total num frames: 202409984. Throughput: 0: 5965.2. Samples: 202409690. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 09:56:21,997][25689] Avg episode reward: [(0, '-48.023')] [2022-07-09 09:56:23,701][26022] Updated weights on worker 0-0, policy_version 197676 (0.00086) [2022-07-09 09:56:25,467][26022] Updated weights on worker 0-0, policy_version 197686 (0.00086) [2022-07-09 09:56:27,012][25689] Fps is (10 sec: 5802.8, 60 sec: 5725.9, 300 sec: 5748.2). Total num frames: 202438656. Throughput: 0: 6037.3. Samples: 202443564. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 09:56:27,013][25689] Avg episode reward: [(0, '-47.865')] [2022-07-09 09:56:27,521][26022] Updated weights on worker 0-0, policy_version 197696 (0.00082) [2022-07-09 09:56:29,064][26022] Updated weights on worker 0-0, policy_version 197706 (0.00098) [2022-07-09 09:56:31,078][26022] Updated weights on worker 0-0, policy_version 197716 (0.00085) [2022-07-09 09:56:32,089][25689] Fps is (10 sec: 5682.0, 60 sec: 5737.4, 300 sec: 5744.2). Total num frames: 202467328. Throughput: 0: 5131.0. Samples: 202460720. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 09:56:32,089][25689] Avg episode reward: [(0, '-47.366')] [2022-07-09 09:56:32,349][26022] Updated weights on worker 0-0, policy_version 197726 (0.00084) [2022-07-09 09:56:34,501][26022] Updated weights on worker 0-0, policy_version 197736 (0.00083) [2022-07-09 09:56:36,013][26022] Updated weights on worker 0-0, policy_version 197746 (0.00627) [2022-07-09 09:56:37,123][25689] Fps is (10 sec: 5671.8, 60 sec: 5717.8, 300 sec: 5744.5). Total num frames: 202496000. Throughput: 0: 5984.9. Samples: 202495576. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 09:56:37,123][25689] Avg episode reward: [(0, '-47.616')] [2022-07-09 09:56:37,971][26022] Updated weights on worker 0-0, policy_version 197756 (0.00098) [2022-07-09 09:56:39,781][26022] Updated weights on worker 0-0, policy_version 197766 (0.00083) [2022-07-09 09:56:41,631][26022] Updated weights on worker 0-0, policy_version 197776 (0.00089) [2022-07-09 09:56:42,188][25689] Fps is (10 sec: 5779.3, 60 sec: 5738.8, 300 sec: 5744.9). Total num frames: 202525696. Throughput: 0: 5957.9. Samples: 202529990. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 09:56:42,189][25689] Avg episode reward: [(0, '-48.070')] [2022-07-09 09:56:43,277][26022] Updated weights on worker 0-0, policy_version 197786 (0.00091) [2022-07-09 09:56:45,244][26022] Updated weights on worker 0-0, policy_version 197796 (0.00091) [2022-07-09 09:56:45,985][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:56:45,997][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000197802_202549248.pth [2022-07-09 09:56:45,998][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000195779_200477696.pth [2022-07-09 09:56:46,847][26022] Updated weights on worker 0-0, policy_version 197806 (0.00068) [2022-07-09 09:56:47,193][25689] Fps is (10 sec: 5795.8, 60 sec: 5688.4, 300 sec: 5746.5). Total num frames: 202554368. Throughput: 0: 5984.4. Samples: 202564336. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 09:56:47,194][25689] Avg episode reward: [(0, '-48.218')] [2022-07-09 09:56:48,774][26022] Updated weights on worker 0-0, policy_version 197816 (0.00091) [2022-07-09 09:56:50,444][26022] Updated weights on worker 0-0, policy_version 197826 (0.00083) [2022-07-09 09:56:52,080][26022] Updated weights on worker 0-0, policy_version 197836 (0.00086) [2022-07-09 09:56:52,195][25689] Fps is (10 sec: 5832.7, 60 sec: 5757.0, 300 sec: 5746.8). Total num frames: 202584064. Throughput: 0: 6012.5. Samples: 202581610. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 09:56:52,196][25689] Avg episode reward: [(0, '-47.892')] [2022-07-09 09:56:53,981][26022] Updated weights on worker 0-0, policy_version 197846 (0.00093) [2022-07-09 09:56:55,812][26022] Updated weights on worker 0-0, policy_version 197856 (0.00089) [2022-07-09 09:56:57,196][25689] Fps is (10 sec: 5732.6, 60 sec: 5728.2, 300 sec: 5744.2). Total num frames: 202611712. Throughput: 0: 6009.0. Samples: 202616200. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 09:56:57,197][25689] Avg episode reward: [(0, '-48.022')] [2022-07-09 09:56:57,453][26022] Updated weights on worker 0-0, policy_version 197866 (0.00086) [2022-07-09 09:56:59,562][26022] Updated weights on worker 0-0, policy_version 197876 (0.00092) [2022-07-09 09:57:01,054][26022] Updated weights on worker 0-0, policy_version 197886 (0.00085) [2022-07-09 09:57:02,338][25689] Fps is (10 sec: 5351.0, 60 sec: 5675.2, 300 sec: 5741.7). Total num frames: 202638336. Throughput: 0: 5889.6. Samples: 202648662. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 09:57:02,338][25689] Avg episode reward: [(0, '-47.426')] [2022-07-09 09:57:03,292][26022] Updated weights on worker 0-0, policy_version 197896 (0.00080) [2022-07-09 09:57:05,107][26022] Updated weights on worker 0-0, policy_version 197906 (0.00085) [2022-07-09 09:57:06,755][26022] Updated weights on worker 0-0, policy_version 197916 (0.00088) [2022-07-09 09:57:07,373][25689] Fps is (10 sec: 5634.8, 60 sec: 5726.9, 300 sec: 5752.2). Total num frames: 202669056. Throughput: 0: 5023.9. Samples: 202665718. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 09:57:07,374][25689] Avg episode reward: [(0, '-47.634')] [2022-07-09 09:57:08,847][26022] Updated weights on worker 0-0, policy_version 197926 (0.00082) [2022-07-09 09:57:10,115][26022] Updated weights on worker 0-0, policy_version 197936 (0.00106) [2022-07-09 09:57:12,219][26022] Updated weights on worker 0-0, policy_version 197946 (0.00089) [2022-07-09 09:57:12,428][25689] Fps is (10 sec: 5987.6, 60 sec: 5721.8, 300 sec: 5744.4). Total num frames: 202698752. Throughput: 0: 5885.2. Samples: 202700682. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 09:57:12,428][25689] Avg episode reward: [(0, '-47.806')] [2022-07-09 09:57:14,215][26022] Updated weights on worker 0-0, policy_version 197956 (0.00081) [2022-07-09 09:57:15,470][26022] Updated weights on worker 0-0, policy_version 197966 (0.00840) [2022-07-09 09:57:17,525][25689] Fps is (10 sec: 5547.9, 60 sec: 5697.2, 300 sec: 5737.4). Total num frames: 202725376. Throughput: 0: 5863.3. Samples: 202735390. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 09:57:17,525][25689] Avg episode reward: [(0, '-48.833')] [2022-07-09 09:57:17,795][26022] Updated weights on worker 0-0, policy_version 197976 (0.00087) [2022-07-09 09:57:18,995][26022] Updated weights on worker 0-0, policy_version 197986 (0.00084) [2022-07-09 09:57:21,199][26022] Updated weights on worker 0-0, policy_version 197996 (0.00094) [2022-07-09 09:57:22,629][25689] Fps is (10 sec: 5721.5, 60 sec: 5725.2, 300 sec: 5746.8). Total num frames: 202757120. Throughput: 0: 5132.0. Samples: 202752800. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 09:57:22,631][25689] Avg episode reward: [(0, '-49.104')] [2022-07-09 09:57:22,759][26022] Updated weights on worker 0-0, policy_version 198006 (0.00087) [2022-07-09 09:57:24,689][26022] Updated weights on worker 0-0, policy_version 198016 (0.00094) [2022-07-09 09:57:26,410][26022] Updated weights on worker 0-0, policy_version 198026 (0.00099) [2022-07-09 09:57:27,642][25689] Fps is (10 sec: 5971.5, 60 sec: 5725.5, 300 sec: 5746.8). Total num frames: 202785792. Throughput: 0: 5993.8. Samples: 202787206. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 09:57:27,643][25689] Avg episode reward: [(0, '-49.074')] [2022-07-09 09:57:28,313][26022] Updated weights on worker 0-0, policy_version 198036 (0.00085) [2022-07-09 09:57:29,731][26022] Updated weights on worker 0-0, policy_version 198046 (0.00078) [2022-07-09 09:57:31,939][26022] Updated weights on worker 0-0, policy_version 198056 (0.00085) [2022-07-09 09:57:32,654][25689] Fps is (10 sec: 5720.3, 60 sec: 5731.6, 300 sec: 5744.1). Total num frames: 202814464. Throughput: 0: 5979.0. Samples: 202821614. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 09:57:32,655][25689] Avg episode reward: [(0, '-48.960')] [2022-07-09 09:57:33,342][26022] Updated weights on worker 0-0, policy_version 198066 (0.00088) [2022-07-09 09:57:35,267][26022] Updated weights on worker 0-0, policy_version 198076 (0.00090) [2022-07-09 09:57:37,112][26022] Updated weights on worker 0-0, policy_version 198086 (0.00091) [2022-07-09 09:57:37,707][25689] Fps is (10 sec: 5595.8, 60 sec: 5712.9, 300 sec: 5733.7). Total num frames: 202842112. Throughput: 0: 5139.2. Samples: 202839110. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 09:57:37,707][25689] Avg episode reward: [(0, '-49.575')] [2022-07-09 09:57:38,918][26022] Updated weights on worker 0-0, policy_version 198096 (0.00090) [2022-07-09 09:57:40,699][26022] Updated weights on worker 0-0, policy_version 198106 (0.00085) [2022-07-09 09:57:42,205][26022] Updated weights on worker 0-0, policy_version 198116 (0.00081) [2022-07-09 09:57:42,777][25689] Fps is (10 sec: 5664.8, 60 sec: 5712.5, 300 sec: 5739.3). Total num frames: 202871808. Throughput: 0: 6000.7. Samples: 202873700. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 09:57:42,779][25689] Avg episode reward: [(0, '-49.503')] [2022-07-09 09:57:44,104][26022] Updated weights on worker 0-0, policy_version 198126 (0.00083) [2022-07-09 09:57:45,914][26022] Updated weights on worker 0-0, policy_version 198136 (0.00082) [2022-07-09 09:57:47,714][26022] Updated weights on worker 0-0, policy_version 198146 (0.00082) [2022-07-09 09:57:47,782][25689] Fps is (10 sec: 5895.0, 60 sec: 5729.4, 300 sec: 5739.4). Total num frames: 202901504. Throughput: 0: 6047.6. Samples: 202909004. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 09:57:47,783][25689] Avg episode reward: [(0, '-49.411')] [2022-07-09 09:57:49,423][26022] Updated weights on worker 0-0, policy_version 198156 (0.00100) [2022-07-09 09:57:51,155][26022] Updated weights on worker 0-0, policy_version 198166 (0.00096) [2022-07-09 09:57:52,738][26022] Updated weights on worker 0-0, policy_version 198176 (0.00093) [2022-07-09 09:57:52,792][25689] Fps is (10 sec: 6032.6, 60 sec: 5745.5, 300 sec: 5746.2). Total num frames: 202932224. Throughput: 0: 5212.9. Samples: 202926590. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 09:57:52,793][25689] Avg episode reward: [(0, '-50.016')] [2022-07-09 09:57:54,699][26022] Updated weights on worker 0-0, policy_version 198186 (0.00088) [2022-07-09 09:57:56,327][26022] Updated weights on worker 0-0, policy_version 198196 (0.00084) [2022-07-09 09:57:57,810][25689] Fps is (10 sec: 5820.2, 60 sec: 5743.9, 300 sec: 5748.1). Total num frames: 202959872. Throughput: 0: 6067.9. Samples: 202961096. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 09:57:57,812][25689] Avg episode reward: [(0, '-49.852')] [2022-07-09 09:57:58,173][26022] Updated weights on worker 0-0, policy_version 198206 (0.00086) [2022-07-09 09:58:00,216][26022] Updated weights on worker 0-0, policy_version 198216 (0.00091) [2022-07-09 09:58:02,128][26022] Updated weights on worker 0-0, policy_version 198226 (0.00082) [2022-07-09 09:58:02,855][25689] Fps is (10 sec: 5596.7, 60 sec: 5786.9, 300 sec: 5744.5). Total num frames: 202988544. Throughput: 0: 5959.9. Samples: 202993362. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 09:58:02,855][25689] Avg episode reward: [(0, '-50.254')] [2022-07-09 09:58:04,024][26022] Updated weights on worker 0-0, policy_version 198236 (0.00084) [2022-07-09 09:58:05,711][26022] Updated weights on worker 0-0, policy_version 198246 (0.00086) [2022-07-09 09:58:07,486][26022] Updated weights on worker 0-0, policy_version 198256 (0.00088) [2022-07-09 09:58:07,861][25689] Fps is (10 sec: 5603.6, 60 sec: 5738.9, 300 sec: 5741.4). Total num frames: 203016192. Throughput: 0: 5063.1. Samples: 203010668. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 09:58:07,861][25689] Avg episode reward: [(0, '-49.317')] [2022-07-09 09:58:09,263][26022] Updated weights on worker 0-0, policy_version 198266 (0.00094) [2022-07-09 09:58:10,994][26022] Updated weights on worker 0-0, policy_version 198276 (0.00088) [2022-07-09 09:58:12,833][26022] Updated weights on worker 0-0, policy_version 198286 (0.00093) [2022-07-09 09:58:12,870][25689] Fps is (10 sec: 5623.1, 60 sec: 5726.3, 300 sec: 5741.6). Total num frames: 203044864. Throughput: 0: 5907.2. Samples: 203045200. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 09:58:12,871][25689] Avg episode reward: [(0, '-49.195')] [2022-07-09 09:58:14,732][26022] Updated weights on worker 0-0, policy_version 198296 (0.00090) [2022-07-09 09:58:16,302][26022] Updated weights on worker 0-0, policy_version 198306 (0.00098) [2022-07-09 09:58:17,884][25689] Fps is (10 sec: 5619.0, 60 sec: 5751.1, 300 sec: 5735.2). Total num frames: 203072512. Throughput: 0: 5924.1. Samples: 203080016. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 09:58:17,885][25689] Avg episode reward: [(0, '-47.995')] [2022-07-09 09:58:18,218][26022] Updated weights on worker 0-0, policy_version 198316 (0.00082) [2022-07-09 09:58:19,807][26022] Updated weights on worker 0-0, policy_version 198326 (0.00089) [2022-07-09 09:58:21,663][26022] Updated weights on worker 0-0, policy_version 198336 (0.00087) [2022-07-09 09:58:22,940][25689] Fps is (10 sec: 5796.5, 60 sec: 5738.8, 300 sec: 5737.9). Total num frames: 203103232. Throughput: 0: 5175.3. Samples: 203097310. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 09:58:22,940][25689] Avg episode reward: [(0, '-48.321')] [2022-07-09 09:58:23,543][26022] Updated weights on worker 0-0, policy_version 198346 (0.00094) [2022-07-09 09:58:25,172][26022] Updated weights on worker 0-0, policy_version 198356 (0.00089) [2022-07-09 09:58:27,233][26022] Updated weights on worker 0-0, policy_version 198366 (0.00092) [2022-07-09 09:58:27,954][25689] Fps is (10 sec: 5999.8, 60 sec: 5755.7, 300 sec: 5737.8). Total num frames: 203132928. Throughput: 0: 6020.4. Samples: 203131634. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 09:58:27,954][25689] Avg episode reward: [(0, '-48.584')] [2022-07-09 09:58:28,824][26022] Updated weights on worker 0-0, policy_version 198376 (0.00095) [2022-07-09 09:58:30,682][26022] Updated weights on worker 0-0, policy_version 198386 (0.00092) [2022-07-09 09:58:32,739][26022] Updated weights on worker 0-0, policy_version 198396 (0.00086) [2022-07-09 09:58:32,968][25689] Fps is (10 sec: 5514.0, 60 sec: 5704.5, 300 sec: 5727.4). Total num frames: 203158528. Throughput: 0: 6011.5. Samples: 203166018. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 09:58:32,969][25689] Avg episode reward: [(0, '-48.518')] [2022-07-09 09:58:34,127][26022] Updated weights on worker 0-0, policy_version 198406 (0.00095) [2022-07-09 09:58:36,306][26022] Updated weights on worker 0-0, policy_version 198416 (0.00093) [2022-07-09 09:58:37,750][26022] Updated weights on worker 0-0, policy_version 198426 (0.00090) [2022-07-09 09:58:37,972][25689] Fps is (10 sec: 5519.7, 60 sec: 5743.2, 300 sec: 5728.4). Total num frames: 203188224. Throughput: 0: 5152.9. Samples: 203183526. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 09:58:37,972][25689] Avg episode reward: [(0, '-48.373')] [2022-07-09 09:58:39,635][26022] Updated weights on worker 0-0, policy_version 198436 (0.00091) [2022-07-09 09:58:41,418][26022] Updated weights on worker 0-0, policy_version 198446 (0.00089) [2022-07-09 09:58:43,049][25689] Fps is (10 sec: 5891.6, 60 sec: 5742.5, 300 sec: 5733.8). Total num frames: 203217920. Throughput: 0: 6005.6. Samples: 203218078. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 09:58:43,050][25689] Avg episode reward: [(0, '-48.660')] [2022-07-09 09:58:43,088][26022] Updated weights on worker 0-0, policy_version 198456 (0.00079) [2022-07-09 09:58:44,824][26022] Updated weights on worker 0-0, policy_version 198466 (0.00075) [2022-07-09 09:58:46,142][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 09:58:46,155][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000198473_203236352.pth [2022-07-09 09:58:46,156][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000196453_201167872.pth [2022-07-09 09:58:46,736][26022] Updated weights on worker 0-0, policy_version 198476 (0.00087) [2022-07-09 09:58:48,091][25689] Fps is (10 sec: 5768.2, 60 sec: 5722.0, 300 sec: 5726.2). Total num frames: 203246592. Throughput: 0: 6038.2. Samples: 203253224. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 09:58:48,091][25689] Avg episode reward: [(0, '-48.818')] [2022-07-09 09:58:48,344][26022] Updated weights on worker 0-0, policy_version 198486 (0.00084) [2022-07-09 09:58:50,272][26022] Updated weights on worker 0-0, policy_version 198496 (0.00084) [2022-07-09 09:58:52,018][26022] Updated weights on worker 0-0, policy_version 198506 (0.00069) [2022-07-09 09:58:53,106][25689] Fps is (10 sec: 5701.8, 60 sec: 5687.5, 300 sec: 5729.6). Total num frames: 203275264. Throughput: 0: 5192.8. Samples: 203270590. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 09:58:53,107][25689] Avg episode reward: [(0, '-48.337')] [2022-07-09 09:58:53,639][26022] Updated weights on worker 0-0, policy_version 198516 (0.00083) [2022-07-09 09:58:55,632][26022] Updated weights on worker 0-0, policy_version 198526 (0.00087) [2022-07-09 09:58:57,209][26022] Updated weights on worker 0-0, policy_version 198536 (0.00085) [2022-07-09 09:58:58,135][25689] Fps is (10 sec: 5811.0, 60 sec: 5720.5, 300 sec: 5736.7). Total num frames: 203304960. Throughput: 0: 6019.4. Samples: 203304896. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 09:58:58,136][25689] Avg episode reward: [(0, '-47.879')] [2022-07-09 09:58:59,237][26022] Updated weights on worker 0-0, policy_version 198546 (0.00087) [2022-07-09 09:59:00,882][26022] Updated weights on worker 0-0, policy_version 198556 (0.00086) [2022-07-09 09:59:02,982][26022] Updated weights on worker 0-0, policy_version 198566 (0.00071) [2022-07-09 09:59:03,259][25689] Fps is (10 sec: 5648.3, 60 sec: 5696.0, 300 sec: 5731.3). Total num frames: 203332608. Throughput: 0: 5905.5. Samples: 203337426. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 09:59:03,261][25689] Avg episode reward: [(0, '-47.887')] [2022-07-09 09:59:04,910][26022] Updated weights on worker 0-0, policy_version 198576 (0.00096) [2022-07-09 09:59:06,497][26022] Updated weights on worker 0-0, policy_version 198586 (0.00088) [2022-07-09 09:59:08,267][25689] Fps is (10 sec: 5558.5, 60 sec: 5712.8, 300 sec: 5734.6). Total num frames: 203361280. Throughput: 0: 5045.2. Samples: 203355020. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 09:59:08,269][25689] Avg episode reward: [(0, '-48.907')] [2022-07-09 09:59:08,465][26022] Updated weights on worker 0-0, policy_version 198596 (0.00094) [2022-07-09 09:59:09,892][26022] Updated weights on worker 0-0, policy_version 198606 (0.00087) [2022-07-09 09:59:11,819][26022] Updated weights on worker 0-0, policy_version 198616 (0.00087) [2022-07-09 09:59:13,283][25689] Fps is (10 sec: 5822.8, 60 sec: 5729.2, 300 sec: 5727.6). Total num frames: 203390976. Throughput: 0: 5924.6. Samples: 203390128. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 09:59:13,283][25689] Avg episode reward: [(0, '-49.431')] [2022-07-09 09:59:13,635][26022] Updated weights on worker 0-0, policy_version 198626 (0.00091) [2022-07-09 09:59:15,231][26022] Updated weights on worker 0-0, policy_version 198636 (0.00087) [2022-07-09 09:59:17,048][26022] Updated weights on worker 0-0, policy_version 198646 (0.00084) [2022-07-09 09:59:18,306][25689] Fps is (10 sec: 5916.6, 60 sec: 5762.2, 300 sec: 5732.0). Total num frames: 203420672. Throughput: 0: 5976.6. Samples: 203425446. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 09:59:18,307][25689] Avg episode reward: [(0, '-50.876')] [2022-07-09 09:59:18,805][26022] Updated weights on worker 0-0, policy_version 198656 (0.00065) [2022-07-09 09:59:20,544][26022] Updated weights on worker 0-0, policy_version 198666 (0.00094) [2022-07-09 09:59:22,419][26022] Updated weights on worker 0-0, policy_version 198676 (0.00081) [2022-07-09 09:59:23,431][25689] Fps is (10 sec: 5852.6, 60 sec: 5738.7, 300 sec: 5740.3). Total num frames: 203450368. Throughput: 0: 5229.8. Samples: 203442922. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 09:59:23,431][25689] Avg episode reward: [(0, '-50.457')] [2022-07-09 09:59:23,904][26022] Updated weights on worker 0-0, policy_version 198686 (0.00081) [2022-07-09 09:59:25,864][26022] Updated weights on worker 0-0, policy_version 198696 (0.00104) [2022-07-09 09:59:27,485][26022] Updated weights on worker 0-0, policy_version 198706 (0.00084) [2022-07-09 09:59:28,457][25689] Fps is (10 sec: 5648.6, 60 sec: 5703.6, 300 sec: 5729.8). Total num frames: 203478016. Throughput: 0: 6069.5. Samples: 203477564. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 09:59:28,458][25689] Avg episode reward: [(0, '-50.486')] [2022-07-09 09:59:29,415][26022] Updated weights on worker 0-0, policy_version 198716 (0.00089) [2022-07-09 09:59:31,341][26022] Updated weights on worker 0-0, policy_version 198726 (0.00083) [2022-07-09 09:59:32,813][26022] Updated weights on worker 0-0, policy_version 198736 (0.00084) [2022-07-09 09:59:33,471][25689] Fps is (10 sec: 5915.5, 60 sec: 5805.3, 300 sec: 5743.6). Total num frames: 203509760. Throughput: 0: 6049.0. Samples: 203512244. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 09:59:33,471][25689] Avg episode reward: [(0, '-50.500')] [2022-07-09 09:59:34,751][26022] Updated weights on worker 0-0, policy_version 198746 (0.00085) [2022-07-09 09:59:36,483][26022] Updated weights on worker 0-0, policy_version 198756 (0.00083) [2022-07-09 09:59:38,100][26022] Updated weights on worker 0-0, policy_version 198766 (0.00089) [2022-07-09 09:59:38,475][25689] Fps is (10 sec: 5826.3, 60 sec: 5754.4, 300 sec: 5734.5). Total num frames: 203536384. Throughput: 0: 5173.8. Samples: 203529800. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 09:59:38,476][25689] Avg episode reward: [(0, '-50.626')] [2022-07-09 09:59:39,980][26022] Updated weights on worker 0-0, policy_version 198776 (0.00092) [2022-07-09 09:59:41,747][26022] Updated weights on worker 0-0, policy_version 198786 (0.00083) [2022-07-09 09:59:43,525][25689] Fps is (10 sec: 5601.6, 60 sec: 5757.0, 300 sec: 5737.3). Total num frames: 203566080. Throughput: 0: 6044.6. Samples: 203564384. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 09:59:43,525][25689] Avg episode reward: [(0, '-49.421')] [2022-07-09 09:59:43,572][26022] Updated weights on worker 0-0, policy_version 198796 (0.00082) [2022-07-09 09:59:45,426][26022] Updated weights on worker 0-0, policy_version 198806 (0.00091) [2022-07-09 09:59:46,963][26022] Updated weights on worker 0-0, policy_version 198816 (0.00090) [2022-07-09 09:59:48,527][25689] Fps is (10 sec: 5806.9, 60 sec: 5760.8, 300 sec: 5734.0). Total num frames: 203594752. Throughput: 0: 6079.3. Samples: 203599572. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 09:59:48,527][25689] Avg episode reward: [(0, '-49.502')] [2022-07-09 09:59:48,960][26022] Updated weights on worker 0-0, policy_version 198826 (0.00087) [2022-07-09 09:59:50,579][26022] Updated weights on worker 0-0, policy_version 198836 (0.00089) [2022-07-09 09:59:52,384][26022] Updated weights on worker 0-0, policy_version 198846 (0.00079) [2022-07-09 09:59:53,571][25689] Fps is (10 sec: 6013.9, 60 sec: 5808.9, 300 sec: 5747.6). Total num frames: 203626496. Throughput: 0: 6096.2. Samples: 203634780. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 09:59:53,571][25689] Avg episode reward: [(0, '-49.578')] [2022-07-09 09:59:54,108][26022] Updated weights on worker 0-0, policy_version 198856 (0.00088) [2022-07-09 09:59:55,748][26022] Updated weights on worker 0-0, policy_version 198866 (0.00087) [2022-07-09 09:59:57,692][26022] Updated weights on worker 0-0, policy_version 198876 (0.00093) [2022-07-09 09:59:58,588][25689] Fps is (10 sec: 5903.2, 60 sec: 5776.1, 300 sec: 5738.3). Total num frames: 203654144. Throughput: 0: 6084.7. Samples: 203652178. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 09:59:58,588][25689] Avg episode reward: [(0, '-50.885')] [2022-07-09 09:59:59,237][26022] Updated weights on worker 0-0, policy_version 198886 (0.00088) [2022-07-09 10:00:01,364][26022] Updated weights on worker 0-0, policy_version 198896 (0.00085) [2022-07-09 10:00:03,248][26022] Updated weights on worker 0-0, policy_version 198906 (0.00087) [2022-07-09 10:00:03,637][25689] Fps is (10 sec: 5391.3, 60 sec: 5766.3, 300 sec: 5735.0). Total num frames: 203680768. Throughput: 0: 5997.7. Samples: 203685014. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 10:00:03,638][25689] Avg episode reward: [(0, '-49.978')] [2022-07-09 10:00:05,141][26022] Updated weights on worker 0-0, policy_version 198916 (0.00084) [2022-07-09 10:00:06,976][26022] Updated weights on worker 0-0, policy_version 198926 (0.00081) [2022-07-09 10:00:08,695][25689] Fps is (10 sec: 5470.9, 60 sec: 5761.6, 300 sec: 5734.9). Total num frames: 203709440. Throughput: 0: 5945.7. Samples: 203719488. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 10:00:08,696][25689] Avg episode reward: [(0, '-49.275')] [2022-07-09 10:00:08,725][26022] Updated weights on worker 0-0, policy_version 198936 (0.00082) [2022-07-09 10:00:10,479][26022] Updated weights on worker 0-0, policy_version 198946 (0.00086) [2022-07-09 10:00:12,315][26022] Updated weights on worker 0-0, policy_version 198956 (0.00082) [2022-07-09 10:00:13,717][25689] Fps is (10 sec: 5790.6, 60 sec: 5760.9, 300 sec: 5738.3). Total num frames: 203739136. Throughput: 0: 5058.9. Samples: 203736698. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 10:00:13,718][25689] Avg episode reward: [(0, '-49.324')] [2022-07-09 10:00:13,842][26022] Updated weights on worker 0-0, policy_version 198966 (0.00093) [2022-07-09 10:00:15,852][26022] Updated weights on worker 0-0, policy_version 198976 (0.00092) [2022-07-09 10:00:17,380][26022] Updated weights on worker 0-0, policy_version 198986 (0.00088) [2022-07-09 10:00:18,759][25689] Fps is (10 sec: 5800.1, 60 sec: 5742.2, 300 sec: 5736.0). Total num frames: 203767808. Throughput: 0: 5912.9. Samples: 203771446. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 10:00:18,759][25689] Avg episode reward: [(0, '-49.094')] [2022-07-09 10:00:19,352][26022] Updated weights on worker 0-0, policy_version 198996 (0.00088) [2022-07-09 10:00:21,213][26022] Updated weights on worker 0-0, policy_version 199006 (0.00095) [2022-07-09 10:00:22,751][26022] Updated weights on worker 0-0, policy_version 199016 (0.00085) [2022-07-09 10:00:23,855][25689] Fps is (10 sec: 5757.8, 60 sec: 5745.0, 300 sec: 5735.0). Total num frames: 203797504. Throughput: 0: 5993.8. Samples: 203806192. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 10:00:23,856][25689] Avg episode reward: [(0, '-48.517')] [2022-07-09 10:00:24,567][26022] Updated weights on worker 0-0, policy_version 199026 (0.00087) [2022-07-09 10:00:26,258][26022] Updated weights on worker 0-0, policy_version 199036 (0.00610) [2022-07-09 10:00:28,271][26022] Updated weights on worker 0-0, policy_version 199046 (0.00095) [2022-07-09 10:00:28,878][25689] Fps is (10 sec: 5767.7, 60 sec: 5762.2, 300 sec: 5738.4). Total num frames: 203826176. Throughput: 0: 5159.0. Samples: 203823612. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 10:00:28,880][25689] Avg episode reward: [(0, '-49.173')] [2022-07-09 10:00:30,062][26022] Updated weights on worker 0-0, policy_version 199056 (0.00081) [2022-07-09 10:00:31,675][26022] Updated weights on worker 0-0, policy_version 199066 (0.00092) [2022-07-09 10:00:33,395][26022] Updated weights on worker 0-0, policy_version 199076 (0.00092) [2022-07-09 10:00:33,912][25689] Fps is (10 sec: 5803.4, 60 sec: 5726.4, 300 sec: 5737.8). Total num frames: 203855872. Throughput: 0: 6017.6. Samples: 203858222. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 10:00:33,913][25689] Avg episode reward: [(0, '-49.529')] [2022-07-09 10:00:35,347][26022] Updated weights on worker 0-0, policy_version 199086 (0.00091) [2022-07-09 10:00:36,899][26022] Updated weights on worker 0-0, policy_version 199096 (0.00085) [2022-07-09 10:00:38,831][26022] Updated weights on worker 0-0, policy_version 199106 (0.00085) [2022-07-09 10:00:38,951][25689] Fps is (10 sec: 5896.2, 60 sec: 5773.9, 300 sec: 5742.6). Total num frames: 203885568. Throughput: 0: 6024.7. Samples: 203893102. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 10:00:38,952][25689] Avg episode reward: [(0, '-49.301')] [2022-07-09 10:00:40,551][26022] Updated weights on worker 0-0, policy_version 199116 (0.00538) [2022-07-09 10:00:42,383][26022] Updated weights on worker 0-0, policy_version 199126 (0.00089) [2022-07-09 10:00:44,035][25689] Fps is (10 sec: 5765.8, 60 sec: 5753.7, 300 sec: 5730.8). Total num frames: 203914240. Throughput: 0: 5163.0. Samples: 203910386. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 10:00:44,036][25689] Avg episode reward: [(0, '-49.438')] [2022-07-09 10:00:44,125][26022] Updated weights on worker 0-0, policy_version 199136 (0.00091) [2022-07-09 10:00:45,875][26022] Updated weights on worker 0-0, policy_version 199146 (0.00093) [2022-07-09 10:00:46,287][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:00:46,300][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000199149_203928576.pth [2022-07-09 10:00:46,300][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000197129_201860096.pth [2022-07-09 10:00:47,707][26022] Updated weights on worker 0-0, policy_version 199156 (0.00086) [2022-07-09 10:00:49,075][25689] Fps is (10 sec: 5765.6, 60 sec: 5767.1, 300 sec: 5744.1). Total num frames: 203943936. Throughput: 0: 6024.5. Samples: 203945286. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 10:00:49,075][25689] Avg episode reward: [(0, '-49.319')] [2022-07-09 10:00:49,487][26022] Updated weights on worker 0-0, policy_version 199166 (0.00086) [2022-07-09 10:00:51,272][26022] Updated weights on worker 0-0, policy_version 199176 (0.00085) [2022-07-09 10:00:52,993][26022] Updated weights on worker 0-0, policy_version 199186 (0.00082) [2022-07-09 10:00:54,140][25689] Fps is (10 sec: 5877.7, 60 sec: 5731.3, 300 sec: 5743.9). Total num frames: 203973632. Throughput: 0: 6027.4. Samples: 203980144. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 10:00:54,140][25689] Avg episode reward: [(0, '-49.111')] [2022-07-09 10:00:54,556][26022] Updated weights on worker 0-0, policy_version 199196 (0.00096) [2022-07-09 10:00:56,479][26022] Updated weights on worker 0-0, policy_version 199206 (0.00093) [2022-07-09 10:00:58,461][26022] Updated weights on worker 0-0, policy_version 199216 (0.00086) [2022-07-09 10:00:59,186][25689] Fps is (10 sec: 5772.9, 60 sec: 5745.4, 300 sec: 5741.8). Total num frames: 204002304. Throughput: 0: 5149.5. Samples: 203997298. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 10:00:59,186][25689] Avg episode reward: [(0, '-49.023')] [2022-07-09 10:00:59,890][26022] Updated weights on worker 0-0, policy_version 199226 (0.00091) [2022-07-09 10:01:02,338][26022] Updated weights on worker 0-0, policy_version 199236 (0.00091) [2022-07-09 10:01:04,011][26022] Updated weights on worker 0-0, policy_version 199246 (0.00088) [2022-07-09 10:01:04,261][25689] Fps is (10 sec: 5463.3, 60 sec: 5743.0, 300 sec: 5737.8). Total num frames: 204028928. Throughput: 0: 5908.3. Samples: 204029886. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 10:01:04,262][25689] Avg episode reward: [(0, '-48.669')] [2022-07-09 10:01:05,613][26022] Updated weights on worker 0-0, policy_version 199256 (0.00083) [2022-07-09 10:01:07,537][26022] Updated weights on worker 0-0, policy_version 199266 (0.00089) [2022-07-09 10:01:09,115][26022] Updated weights on worker 0-0, policy_version 199276 (0.00083) [2022-07-09 10:01:09,342][25689] Fps is (10 sec: 5646.4, 60 sec: 5774.6, 300 sec: 5739.7). Total num frames: 204059648. Throughput: 0: 5899.0. Samples: 204064838. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 10:01:09,343][25689] Avg episode reward: [(0, '-48.591')] [2022-07-09 10:01:11,006][26022] Updated weights on worker 0-0, policy_version 199286 (0.00079) [2022-07-09 10:01:12,677][26022] Updated weights on worker 0-0, policy_version 199296 (0.00095) [2022-07-09 10:01:14,347][25689] Fps is (10 sec: 5990.4, 60 sec: 5776.2, 300 sec: 5746.8). Total num frames: 204089344. Throughput: 0: 5052.7. Samples: 204082244. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 10:01:14,347][25689] Avg episode reward: [(0, '-48.447')] [2022-07-09 10:01:14,348][26022] Updated weights on worker 0-0, policy_version 199306 (0.00107) [2022-07-09 10:01:16,318][26022] Updated weights on worker 0-0, policy_version 199316 (0.00088) [2022-07-09 10:01:18,103][26022] Updated weights on worker 0-0, policy_version 199326 (0.00085) [2022-07-09 10:01:19,365][25689] Fps is (10 sec: 5720.8, 60 sec: 5761.5, 300 sec: 5740.4). Total num frames: 204116992. Throughput: 0: 5940.8. Samples: 204117180. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 10:01:19,366][25689] Avg episode reward: [(0, '-48.827')] [2022-07-09 10:01:19,896][26022] Updated weights on worker 0-0, policy_version 199336 (0.00084) [2022-07-09 10:01:21,759][26022] Updated weights on worker 0-0, policy_version 199346 (0.00092) [2022-07-09 10:01:23,268][26022] Updated weights on worker 0-0, policy_version 199356 (0.00086) [2022-07-09 10:01:24,414][25689] Fps is (10 sec: 5594.2, 60 sec: 5749.0, 300 sec: 5739.7). Total num frames: 204145664. Throughput: 0: 6056.4. Samples: 204151942. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 10:01:24,416][25689] Avg episode reward: [(0, '-48.699')] [2022-07-09 10:01:25,098][26022] Updated weights on worker 0-0, policy_version 199366 (0.00087) [2022-07-09 10:01:26,939][26022] Updated weights on worker 0-0, policy_version 199376 (0.00091) [2022-07-09 10:01:28,787][26022] Updated weights on worker 0-0, policy_version 199386 (0.00078) [2022-07-09 10:01:29,474][25689] Fps is (10 sec: 5875.5, 60 sec: 5779.5, 300 sec: 5746.9). Total num frames: 204176384. Throughput: 0: 5183.5. Samples: 204169194. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 10:01:29,474][25689] Avg episode reward: [(0, '-49.318')] [2022-07-09 10:01:30,730][26022] Updated weights on worker 0-0, policy_version 199396 (0.00091) [2022-07-09 10:01:32,182][26022] Updated weights on worker 0-0, policy_version 199406 (0.00078) [2022-07-09 10:01:34,111][26022] Updated weights on worker 0-0, policy_version 199416 (0.00088) [2022-07-09 10:01:34,513][25689] Fps is (10 sec: 5780.1, 60 sec: 5745.2, 300 sec: 5743.4). Total num frames: 204204032. Throughput: 0: 6015.6. Samples: 204203554. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 10:01:34,514][25689] Avg episode reward: [(0, '-49.592')] [2022-07-09 10:01:35,910][26022] Updated weights on worker 0-0, policy_version 199426 (0.00087) [2022-07-09 10:01:37,516][26022] Updated weights on worker 0-0, policy_version 199436 (0.00087) [2022-07-09 10:01:39,385][26022] Updated weights on worker 0-0, policy_version 199446 (0.00082) [2022-07-09 10:01:39,521][25689] Fps is (10 sec: 5707.3, 60 sec: 5748.1, 300 sec: 5744.5). Total num frames: 204233728. Throughput: 0: 6018.1. Samples: 204238480. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 10:01:39,523][25689] Avg episode reward: [(0, '-50.239')] [2022-07-09 10:01:41,206][26022] Updated weights on worker 0-0, policy_version 199456 (0.00080) [2022-07-09 10:01:42,911][26022] Updated weights on worker 0-0, policy_version 199466 (0.00117) [2022-07-09 10:01:44,668][25689] Fps is (10 sec: 5747.7, 60 sec: 5742.2, 300 sec: 5741.8). Total num frames: 204262400. Throughput: 0: 5139.7. Samples: 204256038. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 10:01:44,668][25689] Avg episode reward: [(0, '-50.405')] [2022-07-09 10:01:44,723][26022] Updated weights on worker 0-0, policy_version 199476 (0.00081) [2022-07-09 10:01:46,259][26022] Updated weights on worker 0-0, policy_version 199486 (0.00085) [2022-07-09 10:01:48,246][26022] Updated weights on worker 0-0, policy_version 199496 (0.00078) [2022-07-09 10:01:49,692][25689] Fps is (10 sec: 5738.6, 60 sec: 5743.6, 300 sec: 5741.3). Total num frames: 204292096. Throughput: 0: 6018.1. Samples: 204290872. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 10:01:49,693][25689] Avg episode reward: [(0, '-50.754')] [2022-07-09 10:01:49,823][26022] Updated weights on worker 0-0, policy_version 199506 (0.00086) [2022-07-09 10:01:51,859][26022] Updated weights on worker 0-0, policy_version 199516 (0.00084) [2022-07-09 10:01:53,456][26022] Updated weights on worker 0-0, policy_version 199526 (0.00079) [2022-07-09 10:01:54,728][25689] Fps is (10 sec: 5903.3, 60 sec: 5746.3, 300 sec: 5747.5). Total num frames: 204321792. Throughput: 0: 6054.0. Samples: 204325942. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 10:01:54,729][25689] Avg episode reward: [(0, '-49.520')] [2022-07-09 10:01:55,311][26022] Updated weights on worker 0-0, policy_version 199536 (0.00092) [2022-07-09 10:01:57,060][26022] Updated weights on worker 0-0, policy_version 199546 (0.00086) [2022-07-09 10:01:58,719][26022] Updated weights on worker 0-0, policy_version 199556 (0.00085) [2022-07-09 10:01:59,751][25689] Fps is (10 sec: 5803.0, 60 sec: 5748.5, 300 sec: 5756.7). Total num frames: 204350464. Throughput: 0: 5189.0. Samples: 204343450. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 10:01:59,751][25689] Avg episode reward: [(0, '-48.969')] [2022-07-09 10:02:00,478][26022] Updated weights on worker 0-0, policy_version 199566 (0.00086) [2022-07-09 10:02:02,678][26022] Updated weights on worker 0-0, policy_version 199576 (0.00086) [2022-07-09 10:02:04,523][26022] Updated weights on worker 0-0, policy_version 199586 (0.00084) [2022-07-09 10:02:04,888][25689] Fps is (10 sec: 5442.6, 60 sec: 5742.6, 300 sec: 5741.0). Total num frames: 204377088. Throughput: 0: 5924.7. Samples: 204375840. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 10:02:04,889][25689] Avg episode reward: [(0, '-48.248')] [2022-07-09 10:02:06,179][26022] Updated weights on worker 0-0, policy_version 199596 (0.00084) [2022-07-09 10:02:07,943][26022] Updated weights on worker 0-0, policy_version 199606 (0.00087) [2022-07-09 10:02:09,661][26022] Updated weights on worker 0-0, policy_version 199616 (0.00095) [2022-07-09 10:02:09,914][25689] Fps is (10 sec: 5642.5, 60 sec: 5747.9, 300 sec: 5745.0). Total num frames: 204407808. Throughput: 0: 5934.4. Samples: 204410872. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 10:02:09,914][25689] Avg episode reward: [(0, '-48.111')] [2022-07-09 10:02:11,502][26022] Updated weights on worker 0-0, policy_version 199626 (0.00085) [2022-07-09 10:02:13,120][26022] Updated weights on worker 0-0, policy_version 199636 (0.00333) [2022-07-09 10:02:14,987][25689] Fps is (10 sec: 5779.8, 60 sec: 5707.6, 300 sec: 5748.9). Total num frames: 204435456. Throughput: 0: 5917.2. Samples: 204445816. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 10:02:14,988][25689] Avg episode reward: [(0, '-47.685')] [2022-07-09 10:02:15,182][26022] Updated weights on worker 0-0, policy_version 199646 (0.00084) [2022-07-09 10:02:16,571][26022] Updated weights on worker 0-0, policy_version 199656 (0.00094) [2022-07-09 10:02:18,787][26022] Updated weights on worker 0-0, policy_version 199666 (0.00083) [2022-07-09 10:02:20,071][25689] Fps is (10 sec: 5847.0, 60 sec: 5768.9, 300 sec: 5749.3). Total num frames: 204467200. Throughput: 0: 5898.5. Samples: 204463310. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 10:02:20,072][25689] Avg episode reward: [(0, '-47.346')] [2022-07-09 10:02:20,095][26022] Updated weights on worker 0-0, policy_version 199676 (0.00093) [2022-07-09 10:02:22,067][26022] Updated weights on worker 0-0, policy_version 199686 (0.00088) [2022-07-09 10:02:23,763][26022] Updated weights on worker 0-0, policy_version 199696 (0.00096) [2022-07-09 10:02:25,138][25689] Fps is (10 sec: 5850.6, 60 sec: 5750.3, 300 sec: 5744.8). Total num frames: 204494848. Throughput: 0: 6033.0. Samples: 204498008. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 10:02:25,139][25689] Avg episode reward: [(0, '-47.146')] [2022-07-09 10:02:25,623][26022] Updated weights on worker 0-0, policy_version 199706 (0.00087) [2022-07-09 10:02:27,483][26022] Updated weights on worker 0-0, policy_version 199716 (0.00081) [2022-07-09 10:02:29,099][26022] Updated weights on worker 0-0, policy_version 199726 (0.00087) [2022-07-09 10:02:30,188][25689] Fps is (10 sec: 5668.1, 60 sec: 5734.4, 300 sec: 5747.5). Total num frames: 204524544. Throughput: 0: 6008.5. Samples: 204532692. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 10:02:30,189][25689] Avg episode reward: [(0, '-47.906')] [2022-07-09 10:02:31,041][26022] Updated weights on worker 0-0, policy_version 199736 (0.00086) [2022-07-09 10:02:32,704][26022] Updated weights on worker 0-0, policy_version 199746 (0.00091) [2022-07-09 10:02:34,661][26022] Updated weights on worker 0-0, policy_version 199756 (0.00088) [2022-07-09 10:02:35,275][25689] Fps is (10 sec: 5758.3, 60 sec: 5746.7, 300 sec: 5750.3). Total num frames: 204553216. Throughput: 0: 5117.1. Samples: 204549630. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 10:02:35,275][25689] Avg episode reward: [(0, '-48.493')] [2022-07-09 10:02:36,197][26022] Updated weights on worker 0-0, policy_version 199766 (0.00088) [2022-07-09 10:02:38,049][26022] Updated weights on worker 0-0, policy_version 199776 (0.00090) [2022-07-09 10:02:39,760][26022] Updated weights on worker 0-0, policy_version 199786 (0.00091) [2022-07-09 10:02:40,343][25689] Fps is (10 sec: 5748.0, 60 sec: 5741.1, 300 sec: 5750.3). Total num frames: 204582912. Throughput: 0: 5980.3. Samples: 204584540. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 10:02:40,343][25689] Avg episode reward: [(0, '-48.638')] [2022-07-09 10:02:41,633][26022] Updated weights on worker 0-0, policy_version 199796 (0.00085) [2022-07-09 10:02:43,538][26022] Updated weights on worker 0-0, policy_version 199806 (0.00081) [2022-07-09 10:02:45,107][26022] Updated weights on worker 0-0, policy_version 199816 (0.00084) [2022-07-09 10:02:45,402][25689] Fps is (10 sec: 5965.9, 60 sec: 5783.1, 300 sec: 5752.7). Total num frames: 204613632. Throughput: 0: 5990.8. Samples: 204619402. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 10:02:45,402][25689] Avg episode reward: [(0, '-47.754')] [2022-07-09 10:02:46,536][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:02:46,553][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000199823_204618752.pth [2022-07-09 10:02:46,554][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000197802_202549248.pth [2022-07-09 10:02:46,982][26022] Updated weights on worker 0-0, policy_version 199826 (0.00083) [2022-07-09 10:02:48,651][26022] Updated weights on worker 0-0, policy_version 199836 (0.00086) [2022-07-09 10:02:50,436][25689] Fps is (10 sec: 5884.1, 60 sec: 5765.3, 300 sec: 5745.4). Total num frames: 204642304. Throughput: 0: 5148.2. Samples: 204636934. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 10:02:50,437][25689] Avg episode reward: [(0, '-48.535')] [2022-07-09 10:02:50,448][26022] Updated weights on worker 0-0, policy_version 199846 (0.00096) [2022-07-09 10:02:52,347][26022] Updated weights on worker 0-0, policy_version 199856 (0.00077) [2022-07-09 10:02:53,926][26022] Updated weights on worker 0-0, policy_version 199866 (0.00087) [2022-07-09 10:02:55,475][25689] Fps is (10 sec: 5692.4, 60 sec: 5748.2, 300 sec: 5748.4). Total num frames: 204670976. Throughput: 0: 6050.8. Samples: 204671860. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 10:02:55,476][25689] Avg episode reward: [(0, '-48.322')] [2022-07-09 10:02:55,745][26022] Updated weights on worker 0-0, policy_version 199876 (0.00085) [2022-07-09 10:02:57,463][26022] Updated weights on worker 0-0, policy_version 199886 (0.00089) [2022-07-09 10:02:59,282][26022] Updated weights on worker 0-0, policy_version 199896 (0.00087) [2022-07-09 10:03:00,479][25689] Fps is (10 sec: 5710.3, 60 sec: 5750.0, 300 sec: 5749.2). Total num frames: 204699648. Throughput: 0: 6065.3. Samples: 204706670. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 10:03:00,480][25689] Avg episode reward: [(0, '-47.653')] [2022-07-09 10:03:01,109][26022] Updated weights on worker 0-0, policy_version 199906 (0.00087) [2022-07-09 10:03:03,176][26022] Updated weights on worker 0-0, policy_version 199916 (0.00072) [2022-07-09 10:03:05,155][26022] Updated weights on worker 0-0, policy_version 199926 (0.00088) [2022-07-09 10:03:05,584][25689] Fps is (10 sec: 5571.4, 60 sec: 5769.9, 300 sec: 5747.3). Total num frames: 204727296. Throughput: 0: 5052.2. Samples: 204721368. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 10:03:05,585][25689] Avg episode reward: [(0, '-48.233')] [2022-07-09 10:03:06,612][26022] Updated weights on worker 0-0, policy_version 199936 (0.00104) [2022-07-09 10:03:08,695][26022] Updated weights on worker 0-0, policy_version 199946 (0.00086) [2022-07-09 10:03:10,405][26022] Updated weights on worker 0-0, policy_version 199956 (0.00083) [2022-07-09 10:03:10,605][25689] Fps is (10 sec: 5562.0, 60 sec: 5736.6, 300 sec: 5747.1). Total num frames: 204755968. Throughput: 0: 5909.2. Samples: 204756112. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 10:03:10,605][25689] Avg episode reward: [(0, '-48.491')] [2022-07-09 10:03:12,285][26022] Updated weights on worker 0-0, policy_version 199966 (0.00093) [2022-07-09 10:03:14,087][26022] Updated weights on worker 0-0, policy_version 199976 (0.00099) [2022-07-09 10:03:15,651][25689] Fps is (10 sec: 5696.5, 60 sec: 5756.1, 300 sec: 5749.9). Total num frames: 204784640. Throughput: 0: 5895.1. Samples: 204790796. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 10:03:15,651][25689] Avg episode reward: [(0, '-49.663')] [2022-07-09 10:03:15,698][26022] Updated weights on worker 0-0, policy_version 199986 (0.00086) [2022-07-09 10:03:17,573][26022] Updated weights on worker 0-0, policy_version 199996 (0.00086) [2022-07-09 10:03:19,072][26022] Updated weights on worker 0-0, policy_version 200006 (0.00079) [2022-07-09 10:03:20,655][25689] Fps is (10 sec: 5705.6, 60 sec: 5712.9, 300 sec: 5744.0). Total num frames: 204813312. Throughput: 0: 5041.0. Samples: 204808380. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 10:03:20,656][25689] Avg episode reward: [(0, '-49.278')] [2022-07-09 10:03:21,031][26022] Updated weights on worker 0-0, policy_version 200016 (0.00079) [2022-07-09 10:03:22,770][26022] Updated weights on worker 0-0, policy_version 200026 (0.00088) [2022-07-09 10:03:24,609][26022] Updated weights on worker 0-0, policy_version 200036 (0.00092) [2022-07-09 10:03:25,732][25689] Fps is (10 sec: 5891.5, 60 sec: 5762.8, 300 sec: 5746.2). Total num frames: 204844032. Throughput: 0: 6043.8. Samples: 204843136. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 10:03:25,732][25689] Avg episode reward: [(0, '-50.221')] [2022-07-09 10:03:26,203][26022] Updated weights on worker 0-0, policy_version 200046 (0.00084) [2022-07-09 10:03:28,149][26022] Updated weights on worker 0-0, policy_version 200056 (0.00091) [2022-07-09 10:03:30,002][26022] Updated weights on worker 0-0, policy_version 200066 (0.00082) [2022-07-09 10:03:30,793][25689] Fps is (10 sec: 5757.2, 60 sec: 5727.8, 300 sec: 5752.2). Total num frames: 204871680. Throughput: 0: 6036.3. Samples: 204877978. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 10:03:30,794][25689] Avg episode reward: [(0, '-49.955')] [2022-07-09 10:03:31,525][26022] Updated weights on worker 0-0, policy_version 200076 (0.00084) [2022-07-09 10:03:33,328][26022] Updated weights on worker 0-0, policy_version 200086 (0.00089) [2022-07-09 10:03:35,280][26022] Updated weights on worker 0-0, policy_version 200096 (0.00090) [2022-07-09 10:03:35,875][25689] Fps is (10 sec: 5653.6, 60 sec: 5745.2, 300 sec: 5750.7). Total num frames: 204901376. Throughput: 0: 5165.0. Samples: 204895252. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 10:03:35,875][25689] Avg episode reward: [(0, '-49.623')] [2022-07-09 10:03:37,111][26022] Updated weights on worker 0-0, policy_version 200106 (0.00094) [2022-07-09 10:03:38,848][26022] Updated weights on worker 0-0, policy_version 200116 (0.00109) [2022-07-09 10:03:40,564][26022] Updated weights on worker 0-0, policy_version 200126 (0.00091) [2022-07-09 10:03:40,903][25689] Fps is (10 sec: 5773.3, 60 sec: 5732.0, 300 sec: 5748.2). Total num frames: 204930048. Throughput: 0: 5952.1. Samples: 204928900. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 10:03:40,904][25689] Avg episode reward: [(0, '-50.003')] [2022-07-09 10:03:42,758][26022] Updated weights on worker 0-0, policy_version 200136 (0.00090) [2022-07-09 10:03:44,439][26022] Updated weights on worker 0-0, policy_version 200146 (0.00091) [2022-07-09 10:03:45,954][25689] Fps is (10 sec: 5689.4, 60 sec: 5699.0, 300 sec: 5748.0). Total num frames: 204958720. Throughput: 0: 5917.6. Samples: 204962802. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 10:03:45,954][25689] Avg episode reward: [(0, '-50.420')] [2022-07-09 10:03:46,044][26022] Updated weights on worker 0-0, policy_version 200156 (0.00085) [2022-07-09 10:03:47,920][26022] Updated weights on worker 0-0, policy_version 200166 (0.00092) [2022-07-09 10:03:49,580][26022] Updated weights on worker 0-0, policy_version 200176 (0.00083) [2022-07-09 10:03:50,962][25689] Fps is (10 sec: 5700.8, 60 sec: 5701.5, 300 sec: 5748.2). Total num frames: 204987392. Throughput: 0: 5047.0. Samples: 204979770. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 10:03:50,963][25689] Avg episode reward: [(0, '-50.445')] [2022-07-09 10:03:51,473][26022] Updated weights on worker 0-0, policy_version 200186 (0.00806) [2022-07-09 10:03:53,352][26022] Updated weights on worker 0-0, policy_version 200196 (0.00085) [2022-07-09 10:03:55,109][26022] Updated weights on worker 0-0, policy_version 200206 (0.00087) [2022-07-09 10:03:55,974][25689] Fps is (10 sec: 5722.5, 60 sec: 5704.0, 300 sec: 5745.1). Total num frames: 205016064. Throughput: 0: 5929.8. Samples: 205014440. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 10:03:55,976][25689] Avg episode reward: [(0, '-50.251')] [2022-07-09 10:03:56,977][26022] Updated weights on worker 0-0, policy_version 200216 (0.00094) [2022-07-09 10:03:58,581][26022] Updated weights on worker 0-0, policy_version 200226 (0.00090) [2022-07-09 10:04:00,420][26022] Updated weights on worker 0-0, policy_version 200236 (0.00088) [2022-07-09 10:04:01,023][25689] Fps is (10 sec: 5597.9, 60 sec: 5682.8, 300 sec: 5746.5). Total num frames: 205043712. Throughput: 0: 5969.5. Samples: 205049004. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 10:04:01,023][25689] Avg episode reward: [(0, '-50.965')] [2022-07-09 10:04:02,628][26022] Updated weights on worker 0-0, policy_version 200246 (0.00126) [2022-07-09 10:04:04,467][26022] Updated weights on worker 0-0, policy_version 200256 (0.00086) [2022-07-09 10:04:06,035][26022] Updated weights on worker 0-0, policy_version 200266 (0.00085) [2022-07-09 10:04:06,120][25689] Fps is (10 sec: 5551.1, 60 sec: 5700.5, 300 sec: 5744.8). Total num frames: 205072384. Throughput: 0: 5856.7. Samples: 205080912. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 10:04:06,121][25689] Avg episode reward: [(0, '-51.229')] [2022-07-09 10:04:08,075][26022] Updated weights on worker 0-0, policy_version 200276 (0.00085) [2022-07-09 10:04:09,540][26022] Updated weights on worker 0-0, policy_version 200286 (0.00090) [2022-07-09 10:04:11,130][25689] Fps is (10 sec: 5471.2, 60 sec: 5667.7, 300 sec: 5734.6). Total num frames: 205099008. Throughput: 0: 5876.4. Samples: 205098282. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 10:04:11,130][25689] Avg episode reward: [(0, '-50.639')] [2022-07-09 10:04:11,678][26022] Updated weights on worker 0-0, policy_version 200296 (0.00052) [2022-07-09 10:04:13,196][26022] Updated weights on worker 0-0, policy_version 200306 (0.00093) [2022-07-09 10:04:15,099][26022] Updated weights on worker 0-0, policy_version 200316 (0.00085) [2022-07-09 10:04:16,151][25689] Fps is (10 sec: 5717.0, 60 sec: 5703.9, 300 sec: 5738.1). Total num frames: 205129728. Throughput: 0: 5863.4. Samples: 205132742. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 10:04:16,153][25689] Avg episode reward: [(0, '-49.832')] [2022-07-09 10:04:16,811][26022] Updated weights on worker 0-0, policy_version 200326 (0.00087) [2022-07-09 10:04:18,719][26022] Updated weights on worker 0-0, policy_version 200336 (0.00085) [2022-07-09 10:04:20,499][26022] Updated weights on worker 0-0, policy_version 200346 (0.00090) [2022-07-09 10:04:21,176][25689] Fps is (10 sec: 5912.0, 60 sec: 5702.0, 300 sec: 5736.6). Total num frames: 205158400. Throughput: 0: 5862.4. Samples: 205167148. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 10:04:21,178][25689] Avg episode reward: [(0, '-48.196')] [2022-07-09 10:04:22,300][26022] Updated weights on worker 0-0, policy_version 200356 (0.00088) [2022-07-09 10:04:24,014][26022] Updated weights on worker 0-0, policy_version 200366 (0.00086) [2022-07-09 10:04:25,991][26022] Updated weights on worker 0-0, policy_version 200376 (0.00093) [2022-07-09 10:04:26,221][25689] Fps is (10 sec: 5694.1, 60 sec: 5671.0, 300 sec: 5739.6). Total num frames: 205187072. Throughput: 0: 5139.6. Samples: 205184224. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 10:04:26,224][25689] Avg episode reward: [(0, '-47.724')] [2022-07-09 10:04:27,626][26022] Updated weights on worker 0-0, policy_version 200386 (0.00085) [2022-07-09 10:04:29,497][26022] Updated weights on worker 0-0, policy_version 200396 (0.00089) [2022-07-09 10:04:31,221][26022] Updated weights on worker 0-0, policy_version 200406 (0.00079) [2022-07-09 10:04:31,258][25689] Fps is (10 sec: 5687.4, 60 sec: 5690.3, 300 sec: 5728.9). Total num frames: 205215744. Throughput: 0: 5972.5. Samples: 205218500. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-09 10:04:31,259][25689] Avg episode reward: [(0, '-46.874')] [2022-07-09 10:04:32,967][26022] Updated weights on worker 0-0, policy_version 200416 (0.00094) [2022-07-09 10:04:34,919][26022] Updated weights on worker 0-0, policy_version 200426 (0.00088) [2022-07-09 10:04:36,281][25689] Fps is (10 sec: 5700.5, 60 sec: 5678.9, 300 sec: 5735.4). Total num frames: 205244416. Throughput: 0: 5962.5. Samples: 205252770. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-09 10:04:36,281][25689] Avg episode reward: [(0, '-46.701')] [2022-07-09 10:04:36,599][26022] Updated weights on worker 0-0, policy_version 200436 (0.00090) [2022-07-09 10:04:38,487][26022] Updated weights on worker 0-0, policy_version 200446 (0.00085) [2022-07-09 10:04:40,145][26022] Updated weights on worker 0-0, policy_version 200456 (0.00085) [2022-07-09 10:04:41,293][25689] Fps is (10 sec: 5612.8, 60 sec: 5663.5, 300 sec: 5729.2). Total num frames: 205272064. Throughput: 0: 5110.2. Samples: 205269950. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-09 10:04:41,293][25689] Avg episode reward: [(0, '-47.045')] [2022-07-09 10:04:42,143][26022] Updated weights on worker 0-0, policy_version 200466 (0.00085) [2022-07-09 10:04:43,699][26022] Updated weights on worker 0-0, policy_version 200476 (0.00580) [2022-07-09 10:04:45,675][26022] Updated weights on worker 0-0, policy_version 200486 (0.00095) [2022-07-09 10:04:46,407][25689] Fps is (10 sec: 5763.9, 60 sec: 5691.3, 300 sec: 5733.9). Total num frames: 205302784. Throughput: 0: 5961.8. Samples: 205304570. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-09 10:04:46,408][25689] Avg episode reward: [(0, '-47.548')] [2022-07-09 10:04:46,702][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:04:46,710][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000200493_205304832.pth [2022-07-09 10:04:46,711][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000198473_203236352.pth [2022-07-09 10:04:47,219][26022] Updated weights on worker 0-0, policy_version 200496 (0.00086) [2022-07-09 10:04:49,199][26022] Updated weights on worker 0-0, policy_version 200506 (0.00086) [2022-07-09 10:04:50,651][26022] Updated weights on worker 0-0, policy_version 200516 (0.00090) [2022-07-09 10:04:51,419][25689] Fps is (10 sec: 5764.1, 60 sec: 5674.1, 300 sec: 5720.8). Total num frames: 205330432. Throughput: 0: 5993.8. Samples: 205339338. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-09 10:04:51,419][25689] Avg episode reward: [(0, '-47.838')] [2022-07-09 10:04:52,690][26022] Updated weights on worker 0-0, policy_version 200526 (0.00088) [2022-07-09 10:04:54,483][26022] Updated weights on worker 0-0, policy_version 200536 (0.00086) [2022-07-09 10:04:56,300][26022] Updated weights on worker 0-0, policy_version 200546 (0.00090) [2022-07-09 10:04:56,425][25689] Fps is (10 sec: 5724.2, 60 sec: 5691.6, 300 sec: 5727.9). Total num frames: 205360128. Throughput: 0: 5155.8. Samples: 205356630. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-09 10:04:56,426][25689] Avg episode reward: [(0, '-49.084')] [2022-07-09 10:04:58,067][26022] Updated weights on worker 0-0, policy_version 200556 (0.00097) [2022-07-09 10:04:59,817][26022] Updated weights on worker 0-0, policy_version 200566 (0.00090) [2022-07-09 10:05:01,446][25689] Fps is (10 sec: 5718.8, 60 sec: 5694.2, 300 sec: 5731.9). Total num frames: 205387776. Throughput: 0: 6002.2. Samples: 205390914. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-09 10:05:01,447][25689] Avg episode reward: [(0, '-48.699')] [2022-07-09 10:05:01,555][26022] Updated weights on worker 0-0, policy_version 200576 (0.00094) [2022-07-09 10:05:03,888][26022] Updated weights on worker 0-0, policy_version 200586 (0.00090) [2022-07-09 10:05:05,485][26022] Updated weights on worker 0-0, policy_version 200596 (0.00051) [2022-07-09 10:05:06,579][25689] Fps is (10 sec: 5546.6, 60 sec: 5690.9, 300 sec: 5730.4). Total num frames: 205416448. Throughput: 0: 5856.7. Samples: 205422710. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-09 10:05:06,582][25689] Avg episode reward: [(0, '-49.238')] [2022-07-09 10:05:07,547][26022] Updated weights on worker 0-0, policy_version 200606 (0.00090) [2022-07-09 10:05:09,284][26022] Updated weights on worker 0-0, policy_version 200616 (0.00087) [2022-07-09 10:05:10,936][26022] Updated weights on worker 0-0, policy_version 200626 (0.00088) [2022-07-09 10:05:11,605][25689] Fps is (10 sec: 5543.8, 60 sec: 5706.2, 300 sec: 5723.5). Total num frames: 205444096. Throughput: 0: 4983.6. Samples: 205439938. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-09 10:05:11,607][25689] Avg episode reward: [(0, '-49.916')] [2022-07-09 10:05:12,719][26022] Updated weights on worker 0-0, policy_version 200636 (0.00096) [2022-07-09 10:05:14,464][26022] Updated weights on worker 0-0, policy_version 200646 (0.00085) [2022-07-09 10:05:16,298][26022] Updated weights on worker 0-0, policy_version 200656 (0.00090) [2022-07-09 10:05:16,650][25689] Fps is (10 sec: 5592.4, 60 sec: 5670.1, 300 sec: 5723.4). Total num frames: 205472768. Throughput: 0: 5831.3. Samples: 205474566. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-09 10:05:16,651][25689] Avg episode reward: [(0, '-50.002')] [2022-07-09 10:05:18,243][26022] Updated weights on worker 0-0, policy_version 200666 (0.00083) [2022-07-09 10:05:19,742][26022] Updated weights on worker 0-0, policy_version 200676 (0.00087) [2022-07-09 10:05:21,719][25689] Fps is (10 sec: 5669.9, 60 sec: 5666.0, 300 sec: 5720.5). Total num frames: 205501440. Throughput: 0: 5831.3. Samples: 205509130. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-09 10:05:21,721][25689] Avg episode reward: [(0, '-50.328')] [2022-07-09 10:05:21,776][26022] Updated weights on worker 0-0, policy_version 200686 (0.00081) [2022-07-09 10:05:23,189][26022] Updated weights on worker 0-0, policy_version 200696 (0.00088) [2022-07-09 10:05:25,513][26022] Updated weights on worker 0-0, policy_version 200706 (0.00090) [2022-07-09 10:05:26,792][25689] Fps is (10 sec: 5754.8, 60 sec: 5680.3, 300 sec: 5723.0). Total num frames: 205531136. Throughput: 0: 5114.2. Samples: 205526084. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-09 10:05:26,793][25689] Avg episode reward: [(0, '-49.407')] [2022-07-09 10:05:27,055][26022] Updated weights on worker 0-0, policy_version 200716 (0.00095) [2022-07-09 10:05:29,099][26022] Updated weights on worker 0-0, policy_version 200726 (0.00093) [2022-07-09 10:05:30,705][26022] Updated weights on worker 0-0, policy_version 200736 (0.00089) [2022-07-09 10:05:31,795][25689] Fps is (10 sec: 5691.1, 60 sec: 5666.7, 300 sec: 5716.7). Total num frames: 205558784. Throughput: 0: 5947.0. Samples: 205560006. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-09 10:05:31,795][25689] Avg episode reward: [(0, '-49.497')] [2022-07-09 10:05:32,705][26022] Updated weights on worker 0-0, policy_version 200746 (0.00087) [2022-07-09 10:05:34,259][26022] Updated weights on worker 0-0, policy_version 200756 (0.00096) [2022-07-09 10:05:36,141][26022] Updated weights on worker 0-0, policy_version 200766 (0.00092) [2022-07-09 10:05:36,823][25689] Fps is (10 sec: 5615.0, 60 sec: 5666.2, 300 sec: 5713.5). Total num frames: 205587456. Throughput: 0: 5923.5. Samples: 205594060. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-09 10:05:36,824][25689] Avg episode reward: [(0, '-49.467')] [2022-07-09 10:05:37,871][26022] Updated weights on worker 0-0, policy_version 200776 (0.00090) [2022-07-09 10:05:39,823][26022] Updated weights on worker 0-0, policy_version 200786 (0.00078) [2022-07-09 10:05:41,449][26022] Updated weights on worker 0-0, policy_version 200796 (0.00084) [2022-07-09 10:05:41,828][25689] Fps is (10 sec: 5817.7, 60 sec: 5700.6, 300 sec: 5718.5). Total num frames: 205617152. Throughput: 0: 5089.4. Samples: 205611472. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-09 10:05:41,829][25689] Avg episode reward: [(0, '-49.357')] [2022-07-09 10:05:43,271][26022] Updated weights on worker 0-0, policy_version 200806 (0.00090) [2022-07-09 10:05:44,882][26022] Updated weights on worker 0-0, policy_version 200816 (0.00091) [2022-07-09 10:05:46,690][26022] Updated weights on worker 0-0, policy_version 200826 (0.00903) [2022-07-09 10:05:46,928][25689] Fps is (10 sec: 5877.5, 60 sec: 5685.1, 300 sec: 5717.3). Total num frames: 205646848. Throughput: 0: 5984.8. Samples: 205646590. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-09 10:05:46,930][25689] Avg episode reward: [(0, '-48.661')] [2022-07-09 10:05:48,456][26022] Updated weights on worker 0-0, policy_version 200836 (0.00086) [2022-07-09 10:05:50,226][26022] Updated weights on worker 0-0, policy_version 200846 (0.00082) [2022-07-09 10:05:51,978][25689] Fps is (10 sec: 5750.5, 60 sec: 5698.4, 300 sec: 5714.1). Total num frames: 205675520. Throughput: 0: 6025.7. Samples: 205681622. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-09 10:05:51,978][25689] Avg episode reward: [(0, '-48.874')] [2022-07-09 10:05:52,122][26022] Updated weights on worker 0-0, policy_version 200856 (0.00473) [2022-07-09 10:05:53,819][26022] Updated weights on worker 0-0, policy_version 200866 (0.00088) [2022-07-09 10:05:55,733][26022] Updated weights on worker 0-0, policy_version 200876 (0.00090) [2022-07-09 10:05:57,021][25689] Fps is (10 sec: 5884.2, 60 sec: 5711.8, 300 sec: 5721.1). Total num frames: 205706240. Throughput: 0: 5192.5. Samples: 205698940. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-09 10:05:57,022][25689] Avg episode reward: [(0, '-49.016')] [2022-07-09 10:05:57,155][26022] Updated weights on worker 0-0, policy_version 200886 (0.00087) [2022-07-09 10:05:59,128][26022] Updated weights on worker 0-0, policy_version 200896 (0.00094) [2022-07-09 10:06:01,017][26022] Updated weights on worker 0-0, policy_version 200906 (0.00083) [2022-07-09 10:06:02,047][25689] Fps is (10 sec: 5593.6, 60 sec: 5677.6, 300 sec: 5718.6). Total num frames: 205731840. Throughput: 0: 6042.7. Samples: 205733648. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-09 10:06:02,049][25689] Avg episode reward: [(0, '-48.239')] [2022-07-09 10:06:03,009][26022] Updated weights on worker 0-0, policy_version 200916 (0.00047) [2022-07-09 10:06:04,989][26022] Updated weights on worker 0-0, policy_version 200926 (0.00081) [2022-07-09 10:06:06,193][26022] Updated weights on worker 0-0, policy_version 200936 (0.00089) [2022-07-09 10:06:07,131][25689] Fps is (10 sec: 5368.4, 60 sec: 5682.2, 300 sec: 5711.6). Total num frames: 205760512. Throughput: 0: 5927.4. Samples: 205766340. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-09 10:06:07,131][25689] Avg episode reward: [(0, '-48.827')] [2022-07-09 10:06:08,278][26022] Updated weights on worker 0-0, policy_version 200946 (0.00084) [2022-07-09 10:06:10,271][26022] Updated weights on worker 0-0, policy_version 200956 (0.00086) [2022-07-09 10:06:11,782][26022] Updated weights on worker 0-0, policy_version 200966 (0.00371) [2022-07-09 10:06:12,140][25689] Fps is (10 sec: 5985.5, 60 sec: 5751.4, 300 sec: 5718.4). Total num frames: 205792256. Throughput: 0: 5058.9. Samples: 205783622. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-09 10:06:12,141][25689] Avg episode reward: [(0, '-49.195')] [2022-07-09 10:06:13,865][26022] Updated weights on worker 0-0, policy_version 200976 (0.00090) [2022-07-09 10:06:15,200][26022] Updated weights on worker 0-0, policy_version 200986 (0.00090) [2022-07-09 10:06:17,205][25689] Fps is (10 sec: 5895.4, 60 sec: 5732.6, 300 sec: 5717.5). Total num frames: 205819904. Throughput: 0: 5934.6. Samples: 205818724. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-09 10:06:17,207][25689] Avg episode reward: [(0, '-48.610')] [2022-07-09 10:06:17,210][26022] Updated weights on worker 0-0, policy_version 200996 (0.00095) [2022-07-09 10:06:18,966][26022] Updated weights on worker 0-0, policy_version 201006 (0.00085) [2022-07-09 10:06:20,813][26022] Updated weights on worker 0-0, policy_version 201016 (0.00080) [2022-07-09 10:06:22,234][25689] Fps is (10 sec: 5579.6, 60 sec: 5736.3, 300 sec: 5717.9). Total num frames: 205848576. Throughput: 0: 5946.1. Samples: 205853686. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-09 10:06:22,236][25689] Avg episode reward: [(0, '-48.747')] [2022-07-09 10:06:22,543][26022] Updated weights on worker 0-0, policy_version 201026 (0.00091) [2022-07-09 10:06:24,439][26022] Updated weights on worker 0-0, policy_version 201036 (0.00085) [2022-07-09 10:06:25,980][26022] Updated weights on worker 0-0, policy_version 201046 (0.00091) [2022-07-09 10:06:27,322][25689] Fps is (10 sec: 5769.6, 60 sec: 5735.0, 300 sec: 5714.0). Total num frames: 205878272. Throughput: 0: 5198.0. Samples: 205871292. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-09 10:06:27,322][25689] Avg episode reward: [(0, '-48.941')] [2022-07-09 10:06:27,919][26022] Updated weights on worker 0-0, policy_version 201056 (0.00081) [2022-07-09 10:06:29,434][26022] Updated weights on worker 0-0, policy_version 201066 (0.00091) [2022-07-09 10:06:31,335][26022] Updated weights on worker 0-0, policy_version 201076 (0.00100) [2022-07-09 10:06:32,373][25689] Fps is (10 sec: 5857.9, 60 sec: 5764.2, 300 sec: 5720.6). Total num frames: 205907968. Throughput: 0: 6051.8. Samples: 205906066. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-09 10:06:32,374][25689] Avg episode reward: [(0, '-49.035')] [2022-07-09 10:06:33,131][26022] Updated weights on worker 0-0, policy_version 201086 (0.00939) [2022-07-09 10:06:34,757][26022] Updated weights on worker 0-0, policy_version 201096 (0.00093) [2022-07-09 10:06:36,563][26022] Updated weights on worker 0-0, policy_version 201106 (0.00093) [2022-07-09 10:06:37,429][25689] Fps is (10 sec: 5774.7, 60 sec: 5761.5, 300 sec: 5716.2). Total num frames: 205936640. Throughput: 0: 6045.3. Samples: 205940984. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-09 10:06:37,430][25689] Avg episode reward: [(0, '-48.930')] [2022-07-09 10:06:38,470][26022] Updated weights on worker 0-0, policy_version 201116 (0.00091) [2022-07-09 10:06:40,029][26022] Updated weights on worker 0-0, policy_version 201126 (0.00081) [2022-07-09 10:06:42,015][26022] Updated weights on worker 0-0, policy_version 201136 (0.00088) [2022-07-09 10:06:42,447][25689] Fps is (10 sec: 5692.2, 60 sec: 5743.4, 300 sec: 5718.7). Total num frames: 205965312. Throughput: 0: 6044.7. Samples: 205975868. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-09 10:06:42,449][25689] Avg episode reward: [(0, '-48.415')] [2022-07-09 10:06:43,523][26022] Updated weights on worker 0-0, policy_version 201146 (0.00085) [2022-07-09 10:06:45,582][26022] Updated weights on worker 0-0, policy_version 201156 (0.00091) [2022-07-09 10:06:46,786][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:06:46,804][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000201164_205991936.pth [2022-07-09 10:06:46,805][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000199149_203928576.pth [2022-07-09 10:06:47,202][26022] Updated weights on worker 0-0, policy_version 201166 (0.00086) [2022-07-09 10:06:47,509][25689] Fps is (10 sec: 5892.0, 60 sec: 5763.9, 300 sec: 5721.4). Total num frames: 205996032. Throughput: 0: 6037.5. Samples: 205993176. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 10:06:47,510][25689] Avg episode reward: [(0, '-48.654')] [2022-07-09 10:06:48,859][26022] Updated weights on worker 0-0, policy_version 201176 (0.00084) [2022-07-09 10:06:50,663][26022] Updated weights on worker 0-0, policy_version 201186 (0.00086) [2022-07-09 10:06:52,512][25689] Fps is (10 sec: 5799.2, 60 sec: 5751.5, 300 sec: 5715.2). Total num frames: 206023680. Throughput: 0: 6063.6. Samples: 206028182. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 10:06:52,513][25689] Avg episode reward: [(0, '-49.486')] [2022-07-09 10:06:52,683][26022] Updated weights on worker 0-0, policy_version 201196 (0.00090) [2022-07-09 10:06:54,236][26022] Updated weights on worker 0-0, policy_version 201206 (0.00089) [2022-07-09 10:06:56,051][26022] Updated weights on worker 0-0, policy_version 201216 (0.00083) [2022-07-09 10:06:57,516][25689] Fps is (10 sec: 5730.6, 60 sec: 5738.3, 300 sec: 5719.0). Total num frames: 206053376. Throughput: 0: 6080.1. Samples: 206063114. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 10:06:57,516][25689] Avg episode reward: [(0, '-48.598')] [2022-07-09 10:06:57,667][26022] Updated weights on worker 0-0, policy_version 201226 (0.00096) [2022-07-09 10:06:59,510][26022] Updated weights on worker 0-0, policy_version 201236 (0.00108) [2022-07-09 10:07:01,241][26022] Updated weights on worker 0-0, policy_version 201246 (0.00088) [2022-07-09 10:07:02,535][25689] Fps is (10 sec: 5721.5, 60 sec: 5772.8, 300 sec: 5724.7). Total num frames: 206081024. Throughput: 0: 5210.1. Samples: 206080526. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 10:07:02,536][25689] Avg episode reward: [(0, '-50.066')] [2022-07-09 10:07:03,533][26022] Updated weights on worker 0-0, policy_version 201256 (0.00085) [2022-07-09 10:07:05,159][26022] Updated weights on worker 0-0, policy_version 201266 (0.00098) [2022-07-09 10:07:07,247][26022] Updated weights on worker 0-0, policy_version 201276 (0.00100) [2022-07-09 10:07:07,613][25689] Fps is (10 sec: 5476.5, 60 sec: 5756.4, 300 sec: 5713.4). Total num frames: 206108672. Throughput: 0: 5952.1. Samples: 206112834. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 10:07:07,613][25689] Avg episode reward: [(0, '-50.493')] [2022-07-09 10:07:08,666][26022] Updated weights on worker 0-0, policy_version 201286 (0.00079) [2022-07-09 10:07:10,755][26022] Updated weights on worker 0-0, policy_version 201296 (0.00088) [2022-07-09 10:07:12,138][26022] Updated weights on worker 0-0, policy_version 201306 (0.00089) [2022-07-09 10:07:12,623][25689] Fps is (10 sec: 5786.0, 60 sec: 5739.5, 300 sec: 5724.9). Total num frames: 206139392. Throughput: 0: 5930.6. Samples: 206147448. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 10:07:12,623][25689] Avg episode reward: [(0, '-50.327')] [2022-07-09 10:07:14,267][26022] Updated weights on worker 0-0, policy_version 201316 (0.00087) [2022-07-09 10:07:15,794][26022] Updated weights on worker 0-0, policy_version 201326 (0.00089) [2022-07-09 10:07:17,647][25689] Fps is (10 sec: 5817.1, 60 sec: 5743.3, 300 sec: 5712.3). Total num frames: 206167040. Throughput: 0: 5058.0. Samples: 206164936. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 10:07:17,647][25689] Avg episode reward: [(0, '-50.343')] [2022-07-09 10:07:17,673][26022] Updated weights on worker 0-0, policy_version 201336 (0.00096) [2022-07-09 10:07:19,261][26022] Updated weights on worker 0-0, policy_version 201346 (0.00089) [2022-07-09 10:07:21,201][26022] Updated weights on worker 0-0, policy_version 201356 (0.00091) [2022-07-09 10:07:22,655][25689] Fps is (10 sec: 5818.1, 60 sec: 5779.2, 300 sec: 5723.8). Total num frames: 206197760. Throughput: 0: 5919.3. Samples: 206199622. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 10:07:22,655][25689] Avg episode reward: [(0, '-49.347')] [2022-07-09 10:07:22,791][26022] Updated weights on worker 0-0, policy_version 201366 (0.00089) [2022-07-09 10:07:24,863][26022] Updated weights on worker 0-0, policy_version 201376 (0.00093) [2022-07-09 10:07:26,344][26022] Updated weights on worker 0-0, policy_version 201386 (0.00091) [2022-07-09 10:07:27,794][25689] Fps is (10 sec: 5752.2, 60 sec: 5740.4, 300 sec: 5715.2). Total num frames: 206225408. Throughput: 0: 6027.6. Samples: 206234476. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 10:07:27,794][25689] Avg episode reward: [(0, '-48.404')] [2022-07-09 10:07:28,348][26022] Updated weights on worker 0-0, policy_version 201396 (0.00089) [2022-07-09 10:07:29,906][26022] Updated weights on worker 0-0, policy_version 201406 (0.00084) [2022-07-09 10:07:32,034][26022] Updated weights on worker 0-0, policy_version 201416 (0.00097) [2022-07-09 10:07:32,806][25689] Fps is (10 sec: 5649.1, 60 sec: 5744.2, 300 sec: 5720.0). Total num frames: 206255104. Throughput: 0: 5176.3. Samples: 206251922. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 10:07:32,806][25689] Avg episode reward: [(0, '-48.615')] [2022-07-09 10:07:33,618][26022] Updated weights on worker 0-0, policy_version 201426 (0.00095) [2022-07-09 10:07:35,344][26022] Updated weights on worker 0-0, policy_version 201436 (0.00092) [2022-07-09 10:07:37,001][26022] Updated weights on worker 0-0, policy_version 201446 (0.00097) [2022-07-09 10:07:37,823][25689] Fps is (10 sec: 5819.9, 60 sec: 5747.9, 300 sec: 5717.6). Total num frames: 206283776. Throughput: 0: 6034.0. Samples: 206286678. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 10:07:37,823][25689] Avg episode reward: [(0, '-48.399')] [2022-07-09 10:07:38,968][26022] Updated weights on worker 0-0, policy_version 201456 (0.00084) [2022-07-09 10:07:40,675][26022] Updated weights on worker 0-0, policy_version 201466 (0.00084) [2022-07-09 10:07:42,427][26022] Updated weights on worker 0-0, policy_version 201476 (0.00093) [2022-07-09 10:07:42,918][25689] Fps is (10 sec: 5772.1, 60 sec: 5757.5, 300 sec: 5713.4). Total num frames: 206313472. Throughput: 0: 6003.1. Samples: 206321264. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 10:07:42,919][25689] Avg episode reward: [(0, '-47.631')] [2022-07-09 10:07:44,407][26022] Updated weights on worker 0-0, policy_version 201486 (0.00089) [2022-07-09 10:07:46,099][26022] Updated weights on worker 0-0, policy_version 201496 (0.00088) [2022-07-09 10:07:47,633][26022] Updated weights on worker 0-0, policy_version 201506 (0.00091) [2022-07-09 10:07:48,034][25689] Fps is (10 sec: 5916.5, 60 sec: 5752.3, 300 sec: 5718.8). Total num frames: 206344192. Throughput: 0: 5156.5. Samples: 206338846. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 10:07:48,035][25689] Avg episode reward: [(0, '-47.851')] [2022-07-09 10:07:49,607][26022] Updated weights on worker 0-0, policy_version 201516 (0.00089) [2022-07-09 10:07:51,062][26022] Updated weights on worker 0-0, policy_version 201526 (0.00095) [2022-07-09 10:07:53,059][25689] Fps is (10 sec: 5755.8, 60 sec: 5750.3, 300 sec: 5715.6). Total num frames: 206371840. Throughput: 0: 6020.7. Samples: 206373858. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 10:07:53,059][25689] Avg episode reward: [(0, '-49.026')] [2022-07-09 10:07:53,081][26022] Updated weights on worker 0-0, policy_version 201536 (0.00081) [2022-07-09 10:07:54,720][26022] Updated weights on worker 0-0, policy_version 201546 (0.00622) [2022-07-09 10:07:56,532][26022] Updated weights on worker 0-0, policy_version 201556 (0.00081) [2022-07-09 10:07:58,084][25689] Fps is (10 sec: 5706.1, 60 sec: 5748.3, 300 sec: 5718.6). Total num frames: 206401536. Throughput: 0: 6025.8. Samples: 206408768. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 10:07:58,084][25689] Avg episode reward: [(0, '-49.983')] [2022-07-09 10:07:58,231][26022] Updated weights on worker 0-0, policy_version 201566 (0.00089) [2022-07-09 10:07:59,951][26022] Updated weights on worker 0-0, policy_version 201576 (0.00091) [2022-07-09 10:08:02,239][26022] Updated weights on worker 0-0, policy_version 201586 (0.00081) [2022-07-09 10:08:03,089][25689] Fps is (10 sec: 5716.8, 60 sec: 5749.5, 300 sec: 5720.5). Total num frames: 206429184. Throughput: 0: 5208.6. Samples: 206426330. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 10:08:03,090][25689] Avg episode reward: [(0, '-50.246')] [2022-07-09 10:08:03,995][26022] Updated weights on worker 0-0, policy_version 201596 (0.00082) [2022-07-09 10:08:05,823][26022] Updated weights on worker 0-0, policy_version 201606 (0.00097) [2022-07-09 10:08:07,545][26022] Updated weights on worker 0-0, policy_version 201616 (0.00083) [2022-07-09 10:08:08,144][25689] Fps is (10 sec: 5598.6, 60 sec: 5768.7, 300 sec: 5719.9). Total num frames: 206457856. Throughput: 0: 5962.0. Samples: 206458738. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 10:08:08,144][25689] Avg episode reward: [(0, '-50.387')] [2022-07-09 10:08:09,249][26022] Updated weights on worker 0-0, policy_version 201626 (0.00083) [2022-07-09 10:08:11,134][26022] Updated weights on worker 0-0, policy_version 201636 (0.00090) [2022-07-09 10:08:12,865][26022] Updated weights on worker 0-0, policy_version 201646 (0.00082) [2022-07-09 10:08:13,186][25689] Fps is (10 sec: 5781.1, 60 sec: 5748.7, 300 sec: 5723.4). Total num frames: 206487552. Throughput: 0: 5945.7. Samples: 206493528. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 10:08:13,186][25689] Avg episode reward: [(0, '-50.317')] [2022-07-09 10:08:14,667][26022] Updated weights on worker 0-0, policy_version 201656 (0.00080) [2022-07-09 10:08:16,261][26022] Updated weights on worker 0-0, policy_version 201666 (0.00086) [2022-07-09 10:08:18,242][25689] Fps is (10 sec: 5678.5, 60 sec: 5745.7, 300 sec: 5719.0). Total num frames: 206515200. Throughput: 0: 5064.4. Samples: 206510856. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 10:08:18,243][25689] Avg episode reward: [(0, '-49.852')] [2022-07-09 10:08:18,284][26022] Updated weights on worker 0-0, policy_version 201676 (0.00089) [2022-07-09 10:08:20,048][26022] Updated weights on worker 0-0, policy_version 201686 (0.00081) [2022-07-09 10:08:21,809][26022] Updated weights on worker 0-0, policy_version 201696 (0.00083) [2022-07-09 10:08:23,284][25689] Fps is (10 sec: 5678.7, 60 sec: 5725.6, 300 sec: 5716.2). Total num frames: 206544896. Throughput: 0: 5905.4. Samples: 206545588. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 10:08:23,284][25689] Avg episode reward: [(0, '-49.921')] [2022-07-09 10:08:23,535][26022] Updated weights on worker 0-0, policy_version 201706 (0.00094) [2022-07-09 10:08:25,236][26022] Updated weights on worker 0-0, policy_version 201716 (0.00091) [2022-07-09 10:08:27,162][26022] Updated weights on worker 0-0, policy_version 201726 (0.00087) [2022-07-09 10:08:28,354][25689] Fps is (10 sec: 5873.3, 60 sec: 5765.9, 300 sec: 5722.9). Total num frames: 206574592. Throughput: 0: 5997.5. Samples: 206579950. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 10:08:28,355][25689] Avg episode reward: [(0, '-49.150')] [2022-07-09 10:08:28,918][26022] Updated weights on worker 0-0, policy_version 201736 (0.00080) [2022-07-09 10:08:30,593][26022] Updated weights on worker 0-0, policy_version 201746 (0.00086) [2022-07-09 10:08:32,570][26022] Updated weights on worker 0-0, policy_version 201756 (0.00099) [2022-07-09 10:08:33,357][25689] Fps is (10 sec: 5794.2, 60 sec: 5749.8, 300 sec: 5721.0). Total num frames: 206603264. Throughput: 0: 5141.3. Samples: 206597236. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 10:08:33,358][25689] Avg episode reward: [(0, '-48.588')] [2022-07-09 10:08:34,134][26022] Updated weights on worker 0-0, policy_version 201766 (0.00094) [2022-07-09 10:08:36,073][26022] Updated weights on worker 0-0, policy_version 201776 (0.00092) [2022-07-09 10:08:37,786][26022] Updated weights on worker 0-0, policy_version 201786 (0.00084) [2022-07-09 10:08:38,458][25689] Fps is (10 sec: 5776.9, 60 sec: 5758.8, 300 sec: 5723.0). Total num frames: 206632960. Throughput: 0: 5977.7. Samples: 206631700. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 10:08:38,458][25689] Avg episode reward: [(0, '-48.971')] [2022-07-09 10:08:39,547][26022] Updated weights on worker 0-0, policy_version 201796 (0.00082) [2022-07-09 10:08:41,375][26022] Updated weights on worker 0-0, policy_version 201806 (0.00086) [2022-07-09 10:08:43,055][26022] Updated weights on worker 0-0, policy_version 201816 (0.00084) [2022-07-09 10:08:43,499][25689] Fps is (10 sec: 5755.3, 60 sec: 5747.0, 300 sec: 5723.2). Total num frames: 206661632. Throughput: 0: 5974.4. Samples: 206666360. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 10:08:43,499][25689] Avg episode reward: [(0, '-48.648')] [2022-07-09 10:08:44,988][26022] Updated weights on worker 0-0, policy_version 201826 (0.00087) [2022-07-09 10:08:46,598][26022] Updated weights on worker 0-0, policy_version 201836 (0.00086) [2022-07-09 10:08:46,917][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:08:46,926][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000201838_206682112.pth [2022-07-09 10:08:46,927][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000199823_204618752.pth [2022-07-09 10:08:48,370][26022] Updated weights on worker 0-0, policy_version 201846 (0.00089) [2022-07-09 10:08:48,582][25689] Fps is (10 sec: 5764.9, 60 sec: 5733.2, 300 sec: 5725.2). Total num frames: 206691328. Throughput: 0: 5129.7. Samples: 206683716. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 10:08:48,583][25689] Avg episode reward: [(0, '-47.572')] [2022-07-09 10:08:50,354][26022] Updated weights on worker 0-0, policy_version 201856 (0.00082) [2022-07-09 10:08:51,907][26022] Updated weights on worker 0-0, policy_version 201866 (0.00098) [2022-07-09 10:08:53,670][25689] Fps is (10 sec: 5638.1, 60 sec: 5727.2, 300 sec: 5720.3). Total num frames: 206718976. Throughput: 0: 5946.9. Samples: 206718032. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 10:08:53,670][25689] Avg episode reward: [(0, '-47.048')] [2022-07-09 10:08:53,931][26022] Updated weights on worker 0-0, policy_version 201876 (0.00083) [2022-07-09 10:08:55,333][26022] Updated weights on worker 0-0, policy_version 201886 (0.00085) [2022-07-09 10:08:57,401][26022] Updated weights on worker 0-0, policy_version 201896 (0.00090) [2022-07-09 10:08:58,696][25689] Fps is (10 sec: 5771.3, 60 sec: 5744.1, 300 sec: 5731.1). Total num frames: 206749696. Throughput: 0: 6002.8. Samples: 206753186. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 10:08:58,697][25689] Avg episode reward: [(0, '-48.137')] [2022-07-09 10:08:59,068][26022] Updated weights on worker 0-0, policy_version 201906 (0.00093) [2022-07-09 10:09:00,802][26022] Updated weights on worker 0-0, policy_version 201916 (0.00091) [2022-07-09 10:09:02,912][26022] Updated weights on worker 0-0, policy_version 201926 (0.00074) [2022-07-09 10:09:03,715][25689] Fps is (10 sec: 5708.8, 60 sec: 5725.9, 300 sec: 5725.7). Total num frames: 206776320. Throughput: 0: 5159.2. Samples: 206770658. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 10:09:03,715][25689] Avg episode reward: [(0, '-48.257')] [2022-07-09 10:09:04,812][26022] Updated weights on worker 0-0, policy_version 201936 (0.00092) [2022-07-09 10:09:06,285][26022] Updated weights on worker 0-0, policy_version 201946 (0.00089) [2022-07-09 10:09:08,306][26022] Updated weights on worker 0-0, policy_version 201956 (0.00444) [2022-07-09 10:09:08,825][25689] Fps is (10 sec: 5459.1, 60 sec: 5720.6, 300 sec: 5730.6). Total num frames: 206804992. Throughput: 0: 5923.4. Samples: 206803622. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 10:09:08,826][25689] Avg episode reward: [(0, '-47.679')] [2022-07-09 10:09:09,767][26022] Updated weights on worker 0-0, policy_version 201966 (0.00089) [2022-07-09 10:09:11,766][26022] Updated weights on worker 0-0, policy_version 201976 (0.00090) [2022-07-09 10:09:13,692][26022] Updated weights on worker 0-0, policy_version 201986 (0.00084) [2022-07-09 10:09:13,841][25689] Fps is (10 sec: 5865.0, 60 sec: 5740.0, 300 sec: 5730.7). Total num frames: 206835712. Throughput: 0: 5962.3. Samples: 206838300. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 10:09:13,842][25689] Avg episode reward: [(0, '-48.770')] [2022-07-09 10:09:15,264][26022] Updated weights on worker 0-0, policy_version 201996 (0.00085) [2022-07-09 10:09:17,128][26022] Updated weights on worker 0-0, policy_version 202006 (0.00089) [2022-07-09 10:09:18,684][26022] Updated weights on worker 0-0, policy_version 202016 (0.00084) [2022-07-09 10:09:18,875][25689] Fps is (10 sec: 5909.8, 60 sec: 5759.0, 300 sec: 5730.6). Total num frames: 206864384. Throughput: 0: 5954.7. Samples: 206873346. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 10:09:18,875][25689] Avg episode reward: [(0, '-49.520')] [2022-07-09 10:09:20,720][26022] Updated weights on worker 0-0, policy_version 202026 (0.00086) [2022-07-09 10:09:22,370][26022] Updated weights on worker 0-0, policy_version 202036 (0.00086) [2022-07-09 10:09:23,908][25689] Fps is (10 sec: 5798.3, 60 sec: 5759.9, 300 sec: 5734.3). Total num frames: 206894080. Throughput: 0: 5941.7. Samples: 206890638. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 10:09:23,908][25689] Avg episode reward: [(0, '-49.037')] [2022-07-09 10:09:23,915][26022] Updated weights on worker 0-0, policy_version 202046 (0.00088) [2022-07-09 10:09:26,124][26022] Updated weights on worker 0-0, policy_version 202056 (0.00085) [2022-07-09 10:09:27,595][26022] Updated weights on worker 0-0, policy_version 202066 (0.00061) [2022-07-09 10:09:28,952][25689] Fps is (10 sec: 5791.9, 60 sec: 5745.4, 300 sec: 5734.1). Total num frames: 206922752. Throughput: 0: 6036.5. Samples: 206925120. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 10:09:28,953][25689] Avg episode reward: [(0, '-47.482')] [2022-07-09 10:09:29,633][26022] Updated weights on worker 0-0, policy_version 202076 (0.00088) [2022-07-09 10:09:30,962][26022] Updated weights on worker 0-0, policy_version 202086 (0.00087) [2022-07-09 10:09:32,926][26022] Updated weights on worker 0-0, policy_version 202096 (0.00098) [2022-07-09 10:09:34,018][25689] Fps is (10 sec: 5773.1, 60 sec: 5756.3, 300 sec: 5736.7). Total num frames: 206952448. Throughput: 0: 6038.8. Samples: 206960144. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 10:09:34,019][25689] Avg episode reward: [(0, '-47.935')] [2022-07-09 10:09:34,453][26022] Updated weights on worker 0-0, policy_version 202106 (0.00087) [2022-07-09 10:09:36,654][26022] Updated weights on worker 0-0, policy_version 202116 (0.00086) [2022-07-09 10:09:38,339][26022] Updated weights on worker 0-0, policy_version 202126 (0.00092) [2022-07-09 10:09:39,114][25689] Fps is (10 sec: 5744.0, 60 sec: 5739.9, 300 sec: 5738.5). Total num frames: 206981120. Throughput: 0: 5136.6. Samples: 206977306. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 10:09:39,114][25689] Avg episode reward: [(0, '-47.561')] [2022-07-09 10:09:40,042][26022] Updated weights on worker 0-0, policy_version 202136 (0.00091) [2022-07-09 10:09:41,940][26022] Updated weights on worker 0-0, policy_version 202146 (0.00088) [2022-07-09 10:09:43,783][26022] Updated weights on worker 0-0, policy_version 202156 (0.00084) [2022-07-09 10:09:44,146][25689] Fps is (10 sec: 5662.2, 60 sec: 5740.8, 300 sec: 5733.3). Total num frames: 207009792. Throughput: 0: 5995.6. Samples: 207011976. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 10:09:44,146][25689] Avg episode reward: [(0, '-46.656')] [2022-07-09 10:09:45,376][26022] Updated weights on worker 0-0, policy_version 202166 (0.00081) [2022-07-09 10:09:47,243][26022] Updated weights on worker 0-0, policy_version 202176 (0.00080) [2022-07-09 10:09:48,869][26022] Updated weights on worker 0-0, policy_version 202186 (0.00090) [2022-07-09 10:09:49,215][25689] Fps is (10 sec: 5778.6, 60 sec: 5742.2, 300 sec: 5739.0). Total num frames: 207039488. Throughput: 0: 6007.7. Samples: 207046850. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 10:09:49,215][25689] Avg episode reward: [(0, '-47.144')] [2022-07-09 10:09:50,874][26022] Updated weights on worker 0-0, policy_version 202196 (0.00052) [2022-07-09 10:09:52,300][26022] Updated weights on worker 0-0, policy_version 202206 (0.00085) [2022-07-09 10:09:54,288][25689] Fps is (10 sec: 5754.7, 60 sec: 5760.4, 300 sec: 5734.3). Total num frames: 207068160. Throughput: 0: 5140.4. Samples: 207064342. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 10:09:54,289][25689] Avg episode reward: [(0, '-47.754')] [2022-07-09 10:09:54,296][26022] Updated weights on worker 0-0, policy_version 202216 (0.00082) [2022-07-09 10:09:55,943][26022] Updated weights on worker 0-0, policy_version 202226 (0.00095) [2022-07-09 10:09:57,953][26022] Updated weights on worker 0-0, policy_version 202236 (0.00094) [2022-07-09 10:09:59,313][25689] Fps is (10 sec: 5779.8, 60 sec: 5743.6, 300 sec: 5741.1). Total num frames: 207097856. Throughput: 0: 6026.8. Samples: 207099044. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 10:09:59,314][25689] Avg episode reward: [(0, '-48.083')] [2022-07-09 10:09:59,511][26022] Updated weights on worker 0-0, policy_version 202246 (0.00088) [2022-07-09 10:10:01,378][26022] Updated weights on worker 0-0, policy_version 202256 (0.00085) [2022-07-09 10:10:03,353][26022] Updated weights on worker 0-0, policy_version 202266 (0.00089) [2022-07-09 10:10:04,408][25689] Fps is (10 sec: 5666.4, 60 sec: 5753.2, 300 sec: 5738.4). Total num frames: 207125504. Throughput: 0: 5902.9. Samples: 207131584. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 10:10:04,409][25689] Avg episode reward: [(0, '-48.543')] [2022-07-09 10:10:05,293][26022] Updated weights on worker 0-0, policy_version 202276 (0.00084) [2022-07-09 10:10:07,096][26022] Updated weights on worker 0-0, policy_version 202286 (0.00077) [2022-07-09 10:10:08,847][26022] Updated weights on worker 0-0, policy_version 202296 (0.00083) [2022-07-09 10:10:09,501][25689] Fps is (10 sec: 5628.7, 60 sec: 5771.8, 300 sec: 5744.0). Total num frames: 207155200. Throughput: 0: 5038.2. Samples: 207149056. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 10:10:09,501][25689] Avg episode reward: [(0, '-48.291')] [2022-07-09 10:10:10,383][26022] Updated weights on worker 0-0, policy_version 202306 (0.00085) [2022-07-09 10:10:12,345][26022] Updated weights on worker 0-0, policy_version 202316 (0.00088) [2022-07-09 10:10:14,137][26022] Updated weights on worker 0-0, policy_version 202326 (0.00084) [2022-07-09 10:10:14,522][25689] Fps is (10 sec: 5771.0, 60 sec: 5737.5, 300 sec: 5744.4). Total num frames: 207183872. Throughput: 0: 5892.2. Samples: 207183564. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 10:10:14,523][25689] Avg episode reward: [(0, '-47.544')] [2022-07-09 10:10:15,852][26022] Updated weights on worker 0-0, policy_version 202336 (0.00091) [2022-07-09 10:10:17,626][26022] Updated weights on worker 0-0, policy_version 202346 (0.00088) [2022-07-09 10:10:19,390][26022] Updated weights on worker 0-0, policy_version 202356 (0.00087) [2022-07-09 10:10:19,587][25689] Fps is (10 sec: 5787.0, 60 sec: 5751.5, 300 sec: 5747.9). Total num frames: 207213568. Throughput: 0: 5901.5. Samples: 207218690. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 10:10:19,587][25689] Avg episode reward: [(0, '-47.761')] [2022-07-09 10:10:21,153][26022] Updated weights on worker 0-0, policy_version 202366 (0.00085) [2022-07-09 10:10:23,029][26022] Updated weights on worker 0-0, policy_version 202376 (0.00086) [2022-07-09 10:10:24,624][25689] Fps is (10 sec: 5879.6, 60 sec: 5751.1, 300 sec: 5748.6). Total num frames: 207243264. Throughput: 0: 5172.6. Samples: 207236150. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 10:10:24,624][25689] Avg episode reward: [(0, '-47.713')] [2022-07-09 10:10:24,631][26022] Updated weights on worker 0-0, policy_version 202386 (0.00087) [2022-07-09 10:10:26,514][26022] Updated weights on worker 0-0, policy_version 202396 (0.00086) [2022-07-09 10:10:28,220][26022] Updated weights on worker 0-0, policy_version 202406 (0.00089) [2022-07-09 10:10:29,683][25689] Fps is (10 sec: 5680.0, 60 sec: 5732.9, 300 sec: 5747.6). Total num frames: 207270912. Throughput: 0: 6035.9. Samples: 207270872. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 10:10:29,683][25689] Avg episode reward: [(0, '-47.890')] [2022-07-09 10:10:29,910][26022] Updated weights on worker 0-0, policy_version 202416 (0.00088) [2022-07-09 10:10:31,944][26022] Updated weights on worker 0-0, policy_version 202426 (0.00095) [2022-07-09 10:10:33,443][26022] Updated weights on worker 0-0, policy_version 202436 (0.00078) [2022-07-09 10:10:34,717][25689] Fps is (10 sec: 5579.8, 60 sec: 5718.9, 300 sec: 5747.4). Total num frames: 207299584. Throughput: 0: 6039.8. Samples: 207305538. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 10:10:34,718][25689] Avg episode reward: [(0, '-48.464')] [2022-07-09 10:10:35,330][26022] Updated weights on worker 0-0, policy_version 202446 (0.00093) [2022-07-09 10:10:37,141][26022] Updated weights on worker 0-0, policy_version 202456 (0.00084) [2022-07-09 10:10:38,873][26022] Updated weights on worker 0-0, policy_version 202466 (0.00086) [2022-07-09 10:10:39,724][25689] Fps is (10 sec: 5710.7, 60 sec: 5727.3, 300 sec: 5744.0). Total num frames: 207328256. Throughput: 0: 5177.9. Samples: 207322960. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 10:10:39,725][25689] Avg episode reward: [(0, '-48.635')] [2022-07-09 10:10:40,735][26022] Updated weights on worker 0-0, policy_version 202476 (0.00085) [2022-07-09 10:10:42,303][26022] Updated weights on worker 0-0, policy_version 202486 (0.00094) [2022-07-09 10:10:44,200][26022] Updated weights on worker 0-0, policy_version 202496 (0.00076) [2022-07-09 10:10:44,735][25689] Fps is (10 sec: 5928.9, 60 sec: 5763.2, 300 sec: 5749.1). Total num frames: 207358976. Throughput: 0: 6050.3. Samples: 207357828. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 10:10:44,735][25689] Avg episode reward: [(0, '-48.185')] [2022-07-09 10:10:45,977][26022] Updated weights on worker 0-0, policy_version 202506 (0.00618) [2022-07-09 10:10:46,951][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:10:46,962][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000202511_207371264.pth [2022-07-09 10:10:46,962][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000200493_205304832.pth [2022-07-09 10:10:47,742][26022] Updated weights on worker 0-0, policy_version 202516 (0.00086) [2022-07-09 10:10:49,603][26022] Updated weights on worker 0-0, policy_version 202526 (0.00085) [2022-07-09 10:10:49,851][25689] Fps is (10 sec: 5865.0, 60 sec: 5741.8, 300 sec: 5747.8). Total num frames: 207387648. Throughput: 0: 6023.2. Samples: 207392348. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 10:10:49,851][25689] Avg episode reward: [(0, '-47.790')] [2022-07-09 10:10:51,077][26022] Updated weights on worker 0-0, policy_version 202536 (0.00085) [2022-07-09 10:10:53,072][26022] Updated weights on worker 0-0, policy_version 202546 (0.00078) [2022-07-09 10:10:54,803][26022] Updated weights on worker 0-0, policy_version 202556 (0.00098) [2022-07-09 10:10:54,900][25689] Fps is (10 sec: 5742.0, 60 sec: 5761.0, 300 sec: 5744.3). Total num frames: 207417344. Throughput: 0: 5168.6. Samples: 207409852. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 10:10:54,900][25689] Avg episode reward: [(0, '-48.161')] [2022-07-09 10:10:56,538][26022] Updated weights on worker 0-0, policy_version 202566 (0.00090) [2022-07-09 10:10:58,484][26022] Updated weights on worker 0-0, policy_version 202576 (0.00080) [2022-07-09 10:10:59,913][25689] Fps is (10 sec: 5800.6, 60 sec: 5745.2, 300 sec: 5754.8). Total num frames: 207446016. Throughput: 0: 6019.6. Samples: 207444488. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 10:10:59,914][25689] Avg episode reward: [(0, '-47.428')] [2022-07-09 10:11:00,060][26022] Updated weights on worker 0-0, policy_version 202586 (0.00087) [2022-07-09 10:11:02,529][26022] Updated weights on worker 0-0, policy_version 202596 (0.00084) [2022-07-09 10:11:04,096][26022] Updated weights on worker 0-0, policy_version 202606 (0.00085) [2022-07-09 10:11:04,919][25689] Fps is (10 sec: 5621.4, 60 sec: 5753.7, 300 sec: 5752.9). Total num frames: 207473664. Throughput: 0: 5894.4. Samples: 207476800. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 10:11:04,920][25689] Avg episode reward: [(0, '-47.488')] [2022-07-09 10:11:05,963][26022] Updated weights on worker 0-0, policy_version 202616 (0.00083) [2022-07-09 10:11:07,607][26022] Updated weights on worker 0-0, policy_version 202626 (0.00082) [2022-07-09 10:11:09,526][26022] Updated weights on worker 0-0, policy_version 202636 (0.00098) [2022-07-09 10:11:09,983][25689] Fps is (10 sec: 5491.4, 60 sec: 5722.6, 300 sec: 5738.1). Total num frames: 207501312. Throughput: 0: 5897.6. Samples: 207511078. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 10:11:09,984][25689] Avg episode reward: [(0, '-47.711')] [2022-07-09 10:11:11,446][26022] Updated weights on worker 0-0, policy_version 202646 (0.00085) [2022-07-09 10:11:12,996][26022] Updated weights on worker 0-0, policy_version 202656 (0.00085) [2022-07-09 10:11:14,891][26022] Updated weights on worker 0-0, policy_version 202666 (0.00094) [2022-07-09 10:11:14,987][25689] Fps is (10 sec: 5594.0, 60 sec: 5724.2, 300 sec: 5742.7). Total num frames: 207529984. Throughput: 0: 5912.5. Samples: 207528616. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 10:11:14,989][25689] Avg episode reward: [(0, '-48.084')] [2022-07-09 10:11:16,567][26022] Updated weights on worker 0-0, policy_version 202676 (0.00091) [2022-07-09 10:11:18,456][26022] Updated weights on worker 0-0, policy_version 202686 (0.00091) [2022-07-09 10:11:19,967][26022] Updated weights on worker 0-0, policy_version 202696 (0.00089) [2022-07-09 10:11:19,998][25689] Fps is (10 sec: 5929.8, 60 sec: 5746.2, 300 sec: 5749.9). Total num frames: 207560704. Throughput: 0: 5908.0. Samples: 207563152. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 10:11:20,000][25689] Avg episode reward: [(0, '-47.402')] [2022-07-09 10:11:21,922][26022] Updated weights on worker 0-0, policy_version 202706 (0.00085) [2022-07-09 10:11:23,669][26022] Updated weights on worker 0-0, policy_version 202716 (0.00082) [2022-07-09 10:11:25,012][25689] Fps is (10 sec: 5822.1, 60 sec: 5714.5, 300 sec: 5744.5). Total num frames: 207588352. Throughput: 0: 6032.2. Samples: 207598008. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 10:11:25,012][25689] Avg episode reward: [(0, '-46.713')] [2022-07-09 10:11:25,501][26022] Updated weights on worker 0-0, policy_version 202726 (0.00085) [2022-07-09 10:11:27,207][26022] Updated weights on worker 0-0, policy_version 202736 (0.00088) [2022-07-09 10:11:29,061][26022] Updated weights on worker 0-0, policy_version 202746 (0.00091) [2022-07-09 10:11:30,075][25689] Fps is (10 sec: 5691.0, 60 sec: 5748.1, 300 sec: 5744.3). Total num frames: 207618048. Throughput: 0: 5186.6. Samples: 207615288. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 10:11:30,075][25689] Avg episode reward: [(0, '-47.735')] [2022-07-09 10:11:30,703][26022] Updated weights on worker 0-0, policy_version 202756 (0.00080) [2022-07-09 10:11:32,571][26022] Updated weights on worker 0-0, policy_version 202766 (0.00084) [2022-07-09 10:11:34,379][26022] Updated weights on worker 0-0, policy_version 202776 (0.00086) [2022-07-09 10:11:35,080][25689] Fps is (10 sec: 5695.8, 60 sec: 5733.9, 300 sec: 5741.8). Total num frames: 207645696. Throughput: 0: 6033.3. Samples: 207649844. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 10:11:35,080][25689] Avg episode reward: [(0, '-47.391')] [2022-07-09 10:11:36,004][26022] Updated weights on worker 0-0, policy_version 202786 (0.00084) [2022-07-09 10:11:38,012][26022] Updated weights on worker 0-0, policy_version 202796 (0.00092) [2022-07-09 10:11:39,639][26022] Updated weights on worker 0-0, policy_version 202806 (0.00114) [2022-07-09 10:11:40,085][25689] Fps is (10 sec: 5831.0, 60 sec: 5768.0, 300 sec: 5749.0). Total num frames: 207676416. Throughput: 0: 6039.2. Samples: 207684456. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 10:11:40,085][25689] Avg episode reward: [(0, '-47.690')] [2022-07-09 10:11:41,577][26022] Updated weights on worker 0-0, policy_version 202816 (0.00084) [2022-07-09 10:11:43,146][26022] Updated weights on worker 0-0, policy_version 202826 (0.00085) [2022-07-09 10:11:44,945][26022] Updated weights on worker 0-0, policy_version 202836 (0.00092) [2022-07-09 10:11:45,107][25689] Fps is (10 sec: 5821.2, 60 sec: 5716.0, 300 sec: 5739.4). Total num frames: 207704064. Throughput: 0: 5167.4. Samples: 207701844. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 10:11:45,107][25689] Avg episode reward: [(0, '-47.382')] [2022-07-09 10:11:46,873][26022] Updated weights on worker 0-0, policy_version 202846 (0.00091) [2022-07-09 10:11:48,759][26022] Updated weights on worker 0-0, policy_version 202856 (0.00090) [2022-07-09 10:11:50,245][25689] Fps is (10 sec: 5543.1, 60 sec: 5713.9, 300 sec: 5740.2). Total num frames: 207732736. Throughput: 0: 5992.7. Samples: 207736164. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 10:11:50,246][25689] Avg episode reward: [(0, '-47.559')] [2022-07-09 10:11:50,488][26022] Updated weights on worker 0-0, policy_version 202866 (0.00082) [2022-07-09 10:11:52,143][26022] Updated weights on worker 0-0, policy_version 202876 (0.00094) [2022-07-09 10:11:54,032][26022] Updated weights on worker 0-0, policy_version 202886 (0.00263) [2022-07-09 10:11:55,319][25689] Fps is (10 sec: 5715.1, 60 sec: 5711.5, 300 sec: 5738.9). Total num frames: 207762432. Throughput: 0: 5980.9. Samples: 207770896. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 10:11:55,320][25689] Avg episode reward: [(0, '-47.878')] [2022-07-09 10:11:55,705][26022] Updated weights on worker 0-0, policy_version 202896 (0.00507) [2022-07-09 10:11:57,571][26022] Updated weights on worker 0-0, policy_version 202906 (0.00082) [2022-07-09 10:11:59,003][26022] Updated weights on worker 0-0, policy_version 202916 (0.00091) [2022-07-09 10:12:00,369][25689] Fps is (10 sec: 5866.7, 60 sec: 5725.1, 300 sec: 5745.2). Total num frames: 207792128. Throughput: 0: 5129.7. Samples: 207788508. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 10:12:00,369][25689] Avg episode reward: [(0, '-47.950')] [2022-07-09 10:12:01,211][26022] Updated weights on worker 0-0, policy_version 202926 (0.00068) [2022-07-09 10:12:03,116][26022] Updated weights on worker 0-0, policy_version 202936 (0.00087) [2022-07-09 10:12:05,028][26022] Updated weights on worker 0-0, policy_version 202946 (0.00090) [2022-07-09 10:12:05,436][25689] Fps is (10 sec: 5567.3, 60 sec: 5702.3, 300 sec: 5741.9). Total num frames: 207818752. Throughput: 0: 5862.4. Samples: 207821022. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 10:12:05,436][25689] Avg episode reward: [(0, '-47.172')] [2022-07-09 10:12:06,584][26022] Updated weights on worker 0-0, policy_version 202956 (0.00092) [2022-07-09 10:12:08,487][26022] Updated weights on worker 0-0, policy_version 202966 (0.00086) [2022-07-09 10:12:10,403][26022] Updated weights on worker 0-0, policy_version 202976 (0.00085) [2022-07-09 10:12:10,560][25689] Fps is (10 sec: 5425.6, 60 sec: 5713.5, 300 sec: 5732.8). Total num frames: 207847424. Throughput: 0: 5884.8. Samples: 207855716. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 10:12:10,561][25689] Avg episode reward: [(0, '-48.822')] [2022-07-09 10:12:12,042][26022] Updated weights on worker 0-0, policy_version 202986 (0.00081) [2022-07-09 10:12:13,796][26022] Updated weights on worker 0-0, policy_version 202996 (0.00085) [2022-07-09 10:12:15,636][25689] Fps is (10 sec: 5722.1, 60 sec: 5723.6, 300 sec: 5738.7). Total num frames: 207877120. Throughput: 0: 5033.0. Samples: 207873150. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 10:12:15,637][25689] Avg episode reward: [(0, '-50.073')] [2022-07-09 10:12:15,673][26022] Updated weights on worker 0-0, policy_version 203006 (0.00096) [2022-07-09 10:12:17,387][26022] Updated weights on worker 0-0, policy_version 203016 (0.00088) [2022-07-09 10:12:19,447][26022] Updated weights on worker 0-0, policy_version 203026 (0.00092) [2022-07-09 10:12:20,668][25689] Fps is (10 sec: 5875.8, 60 sec: 5704.9, 300 sec: 5734.8). Total num frames: 207906816. Throughput: 0: 5855.4. Samples: 207907370. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 10:12:20,669][25689] Avg episode reward: [(0, '-48.654')] [2022-07-09 10:12:20,871][26022] Updated weights on worker 0-0, policy_version 203036 (0.00093) [2022-07-09 10:12:22,869][26022] Updated weights on worker 0-0, policy_version 203046 (0.00089) [2022-07-09 10:12:24,665][26022] Updated weights on worker 0-0, policy_version 203056 (0.00084) [2022-07-09 10:12:25,672][25689] Fps is (10 sec: 5713.8, 60 sec: 5705.7, 300 sec: 5737.4). Total num frames: 207934464. Throughput: 0: 5993.0. Samples: 207942300. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 10:12:25,673][25689] Avg episode reward: [(0, '-48.604')] [2022-07-09 10:12:26,117][26022] Updated weights on worker 0-0, policy_version 203066 (0.00088) [2022-07-09 10:12:28,271][26022] Updated weights on worker 0-0, policy_version 203076 (0.00615) [2022-07-09 10:12:29,810][26022] Updated weights on worker 0-0, policy_version 203086 (0.00093) [2022-07-09 10:12:30,723][25689] Fps is (10 sec: 5601.5, 60 sec: 5690.0, 300 sec: 5733.2). Total num frames: 207963136. Throughput: 0: 5149.1. Samples: 207959534. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 10:12:30,723][25689] Avg episode reward: [(0, '-48.338')] [2022-07-09 10:12:31,762][26022] Updated weights on worker 0-0, policy_version 203096 (0.00080) [2022-07-09 10:12:33,526][26022] Updated weights on worker 0-0, policy_version 203106 (0.00087) [2022-07-09 10:12:35,056][26022] Updated weights on worker 0-0, policy_version 203116 (0.00086) [2022-07-09 10:12:35,795][25689] Fps is (10 sec: 5867.2, 60 sec: 5734.3, 300 sec: 5739.1). Total num frames: 207993856. Throughput: 0: 6009.2. Samples: 207994288. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 10:12:35,797][25689] Avg episode reward: [(0, '-48.316')] [2022-07-09 10:12:37,115][26022] Updated weights on worker 0-0, policy_version 203126 (0.00089) [2022-07-09 10:12:38,597][26022] Updated weights on worker 0-0, policy_version 203136 (0.00092) [2022-07-09 10:12:40,595][26022] Updated weights on worker 0-0, policy_version 203146 (0.00085) [2022-07-09 10:12:40,863][25689] Fps is (10 sec: 5856.9, 60 sec: 5694.7, 300 sec: 5736.1). Total num frames: 208022528. Throughput: 0: 6007.8. Samples: 208028696. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 10:12:40,865][25689] Avg episode reward: [(0, '-46.089')] [2022-07-09 10:12:42,227][26022] Updated weights on worker 0-0, policy_version 203156 (0.00096) [2022-07-09 10:12:44,091][26022] Updated weights on worker 0-0, policy_version 203166 (0.00083) [2022-07-09 10:12:45,882][25689] Fps is (10 sec: 5684.9, 60 sec: 5711.8, 300 sec: 5731.1). Total num frames: 208051200. Throughput: 0: 5133.3. Samples: 208046044. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 10:12:45,883][25689] Avg episode reward: [(0, '-46.378')] [2022-07-09 10:12:45,891][26022] Updated weights on worker 0-0, policy_version 203176 (0.00087) [2022-07-09 10:12:47,188][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:12:47,199][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000203182_208058368.pth [2022-07-09 10:12:47,200][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000201164_205991936.pth [2022-07-09 10:12:47,703][26022] Updated weights on worker 0-0, policy_version 203186 (0.00077) [2022-07-09 10:12:49,326][26022] Updated weights on worker 0-0, policy_version 203196 (0.00084) [2022-07-09 10:12:50,925][25689] Fps is (10 sec: 5801.1, 60 sec: 5737.7, 300 sec: 5737.7). Total num frames: 208080896. Throughput: 0: 5989.2. Samples: 208080528. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 10:12:50,925][25689] Avg episode reward: [(0, '-47.822')] [2022-07-09 10:12:51,250][26022] Updated weights on worker 0-0, policy_version 203206 (0.00080) [2022-07-09 10:12:52,891][26022] Updated weights on worker 0-0, policy_version 203216 (0.00083) [2022-07-09 10:12:54,833][26022] Updated weights on worker 0-0, policy_version 203226 (0.00089) [2022-07-09 10:12:55,945][25689] Fps is (10 sec: 5901.9, 60 sec: 5742.8, 300 sec: 5737.8). Total num frames: 208110592. Throughput: 0: 6011.6. Samples: 208115424. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 10:12:55,946][25689] Avg episode reward: [(0, '-47.829')] [2022-07-09 10:12:56,423][26022] Updated weights on worker 0-0, policy_version 203236 (0.00079) [2022-07-09 10:12:58,433][26022] Updated weights on worker 0-0, policy_version 203246 (0.00089) [2022-07-09 10:12:59,856][26022] Updated weights on worker 0-0, policy_version 203256 (0.00107) [2022-07-09 10:13:00,962][25689] Fps is (10 sec: 5815.0, 60 sec: 5728.9, 300 sec: 5741.0). Total num frames: 208139264. Throughput: 0: 5180.7. Samples: 208132822. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 10:13:00,963][25689] Avg episode reward: [(0, '-48.068')] [2022-07-09 10:13:02,252][26022] Updated weights on worker 0-0, policy_version 203266 (0.00085) [2022-07-09 10:13:03,884][26022] Updated weights on worker 0-0, policy_version 203276 (0.00086) [2022-07-09 10:13:05,741][26022] Updated weights on worker 0-0, policy_version 203286 (0.00084) [2022-07-09 10:13:06,050][25689] Fps is (10 sec: 5573.6, 60 sec: 5743.9, 300 sec: 5736.9). Total num frames: 208166912. Throughput: 0: 5936.2. Samples: 208165766. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 10:13:06,051][25689] Avg episode reward: [(0, '-48.228')] [2022-07-09 10:13:07,347][26022] Updated weights on worker 0-0, policy_version 203296 (0.00084) [2022-07-09 10:13:09,287][26022] Updated weights on worker 0-0, policy_version 203306 (0.00084) [2022-07-09 10:13:10,936][26022] Updated weights on worker 0-0, policy_version 203316 (0.00087) [2022-07-09 10:13:11,142][25689] Fps is (10 sec: 5532.5, 60 sec: 5747.0, 300 sec: 5732.5). Total num frames: 208195584. Throughput: 0: 5938.9. Samples: 208200598. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 10:13:11,143][25689] Avg episode reward: [(0, '-49.663')] [2022-07-09 10:13:12,683][26022] Updated weights on worker 0-0, policy_version 203326 (0.00083) [2022-07-09 10:13:14,567][26022] Updated weights on worker 0-0, policy_version 203336 (0.00086) [2022-07-09 10:13:16,141][26022] Updated weights on worker 0-0, policy_version 203346 (0.00090) [2022-07-09 10:13:16,240][25689] Fps is (10 sec: 5828.5, 60 sec: 5761.7, 300 sec: 5742.0). Total num frames: 208226304. Throughput: 0: 5062.2. Samples: 208218160. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 10:13:16,241][25689] Avg episode reward: [(0, '-49.471')] [2022-07-09 10:13:18,125][26022] Updated weights on worker 0-0, policy_version 203356 (0.00082) [2022-07-09 10:13:19,995][26022] Updated weights on worker 0-0, policy_version 203366 (0.00086) [2022-07-09 10:13:21,308][25689] Fps is (10 sec: 5741.6, 60 sec: 5724.6, 300 sec: 5734.6). Total num frames: 208253952. Throughput: 0: 5898.2. Samples: 208252826. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 10:13:21,308][25689] Avg episode reward: [(0, '-49.343')] [2022-07-09 10:13:21,584][26022] Updated weights on worker 0-0, policy_version 203376 (0.00086) [2022-07-09 10:13:23,447][26022] Updated weights on worker 0-0, policy_version 203386 (0.00085) [2022-07-09 10:13:25,082][26022] Updated weights on worker 0-0, policy_version 203396 (0.00085) [2022-07-09 10:13:26,334][25689] Fps is (10 sec: 5579.6, 60 sec: 5739.4, 300 sec: 5732.0). Total num frames: 208282624. Throughput: 0: 6006.8. Samples: 208287608. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:13:26,335][25689] Avg episode reward: [(0, '-50.025')] [2022-07-09 10:13:26,962][26022] Updated weights on worker 0-0, policy_version 203406 (0.00097) [2022-07-09 10:13:28,668][26022] Updated weights on worker 0-0, policy_version 203416 (0.00090) [2022-07-09 10:13:30,408][26022] Updated weights on worker 0-0, policy_version 203426 (0.00092) [2022-07-09 10:13:31,367][25689] Fps is (10 sec: 5904.5, 60 sec: 5774.8, 300 sec: 5738.3). Total num frames: 208313344. Throughput: 0: 6002.3. Samples: 208321992. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:13:31,367][25689] Avg episode reward: [(0, '-50.667')] [2022-07-09 10:13:32,266][26022] Updated weights on worker 0-0, policy_version 203436 (0.00090) [2022-07-09 10:13:33,901][26022] Updated weights on worker 0-0, policy_version 203446 (0.00089) [2022-07-09 10:13:35,964][26022] Updated weights on worker 0-0, policy_version 203456 (0.00090) [2022-07-09 10:13:36,373][25689] Fps is (10 sec: 5814.1, 60 sec: 5730.4, 300 sec: 5733.3). Total num frames: 208340992. Throughput: 0: 6016.0. Samples: 208339280. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:13:36,374][25689] Avg episode reward: [(0, '-49.751')] [2022-07-09 10:13:37,553][26022] Updated weights on worker 0-0, policy_version 203466 (0.00084) [2022-07-09 10:13:39,357][26022] Updated weights on worker 0-0, policy_version 203476 (0.00080) [2022-07-09 10:13:41,299][26022] Updated weights on worker 0-0, policy_version 203486 (0.00087) [2022-07-09 10:13:41,417][25689] Fps is (10 sec: 5705.5, 60 sec: 5749.6, 300 sec: 5736.7). Total num frames: 208370688. Throughput: 0: 6021.9. Samples: 208373922. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:13:41,418][25689] Avg episode reward: [(0, '-49.795')] [2022-07-09 10:13:42,908][26022] Updated weights on worker 0-0, policy_version 203496 (0.00088) [2022-07-09 10:13:44,683][26022] Updated weights on worker 0-0, policy_version 203506 (0.00083) [2022-07-09 10:13:46,421][25689] Fps is (10 sec: 5706.9, 60 sec: 5734.1, 300 sec: 5731.3). Total num frames: 208398336. Throughput: 0: 6035.1. Samples: 208408836. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:13:46,422][25689] Avg episode reward: [(0, '-49.494')] [2022-07-09 10:13:46,649][26022] Updated weights on worker 0-0, policy_version 203516 (0.00090) [2022-07-09 10:13:48,038][26022] Updated weights on worker 0-0, policy_version 203526 (0.00082) [2022-07-09 10:13:50,247][26022] Updated weights on worker 0-0, policy_version 203536 (0.00081) [2022-07-09 10:13:51,477][25689] Fps is (10 sec: 5903.8, 60 sec: 5766.6, 300 sec: 5745.7). Total num frames: 208430080. Throughput: 0: 5173.7. Samples: 208426038. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:13:51,478][25689] Avg episode reward: [(0, '-49.618')] [2022-07-09 10:13:51,555][26022] Updated weights on worker 0-0, policy_version 203546 (0.00084) [2022-07-09 10:13:53,607][26022] Updated weights on worker 0-0, policy_version 203556 (0.00086) [2022-07-09 10:13:55,328][26022] Updated weights on worker 0-0, policy_version 203566 (0.00083) [2022-07-09 10:13:56,497][25689] Fps is (10 sec: 5792.7, 60 sec: 5715.9, 300 sec: 5732.0). Total num frames: 208456704. Throughput: 0: 6035.7. Samples: 208460744. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:13:56,499][25689] Avg episode reward: [(0, '-47.947')] [2022-07-09 10:13:57,071][26022] Updated weights on worker 0-0, policy_version 203576 (0.00082) [2022-07-09 10:13:58,935][26022] Updated weights on worker 0-0, policy_version 203586 (0.00109) [2022-07-09 10:14:00,640][26022] Updated weights on worker 0-0, policy_version 203596 (0.00625) [2022-07-09 10:14:01,501][25689] Fps is (10 sec: 5618.4, 60 sec: 5734.1, 300 sec: 5742.6). Total num frames: 208486400. Throughput: 0: 6060.7. Samples: 208495646. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:14:01,503][25689] Avg episode reward: [(0, '-47.624')] [2022-07-09 10:14:02,728][26022] Updated weights on worker 0-0, policy_version 203606 (0.00088) [2022-07-09 10:14:04,609][26022] Updated weights on worker 0-0, policy_version 203616 (0.00093) [2022-07-09 10:14:06,195][26022] Updated weights on worker 0-0, policy_version 203626 (0.00090) [2022-07-09 10:14:06,515][25689] Fps is (10 sec: 5724.4, 60 sec: 5741.1, 300 sec: 5741.1). Total num frames: 208514048. Throughput: 0: 5090.6. Samples: 208511124. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:14:06,520][25689] Avg episode reward: [(0, '-48.266')] [2022-07-09 10:14:07,990][26022] Updated weights on worker 0-0, policy_version 203636 (0.00087) [2022-07-09 10:14:09,900][26022] Updated weights on worker 0-0, policy_version 203646 (0.00092) [2022-07-09 10:14:11,570][26022] Updated weights on worker 0-0, policy_version 203656 (0.00085) [2022-07-09 10:14:11,575][25689] Fps is (10 sec: 5590.9, 60 sec: 5744.2, 300 sec: 5733.3). Total num frames: 208542720. Throughput: 0: 5965.0. Samples: 208545920. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:14:11,575][25689] Avg episode reward: [(0, '-47.402')] [2022-07-09 10:14:13,322][26022] Updated weights on worker 0-0, policy_version 203666 (0.00088) [2022-07-09 10:14:15,153][26022] Updated weights on worker 0-0, policy_version 203676 (0.00089) [2022-07-09 10:14:16,591][25689] Fps is (10 sec: 5792.8, 60 sec: 5735.0, 300 sec: 5737.1). Total num frames: 208572416. Throughput: 0: 5970.5. Samples: 208580712. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:14:16,591][25689] Avg episode reward: [(0, '-46.975')] [2022-07-09 10:14:16,900][26022] Updated weights on worker 0-0, policy_version 203686 (0.00090) [2022-07-09 10:14:18,735][26022] Updated weights on worker 0-0, policy_version 203696 (0.00094) [2022-07-09 10:14:20,296][26022] Updated weights on worker 0-0, policy_version 203706 (0.00091) [2022-07-09 10:14:21,595][25689] Fps is (10 sec: 5825.0, 60 sec: 5758.0, 300 sec: 5734.2). Total num frames: 208601088. Throughput: 0: 5100.7. Samples: 208598138. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:14:21,596][25689] Avg episode reward: [(0, '-46.899')] [2022-07-09 10:14:22,249][26022] Updated weights on worker 0-0, policy_version 203716 (0.00088) [2022-07-09 10:14:24,054][26022] Updated weights on worker 0-0, policy_version 203726 (0.00090) [2022-07-09 10:14:25,775][26022] Updated weights on worker 0-0, policy_version 203736 (0.00091) [2022-07-09 10:14:26,608][25689] Fps is (10 sec: 5724.7, 60 sec: 5759.3, 300 sec: 5734.8). Total num frames: 208629760. Throughput: 0: 6059.8. Samples: 208632882. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:14:26,608][25689] Avg episode reward: [(0, '-47.852')] [2022-07-09 10:14:27,485][26022] Updated weights on worker 0-0, policy_version 203746 (0.00085) [2022-07-09 10:14:29,342][26022] Updated weights on worker 0-0, policy_version 203756 (0.00086) [2022-07-09 10:14:31,004][26022] Updated weights on worker 0-0, policy_version 203766 (0.00082) [2022-07-09 10:14:31,664][25689] Fps is (10 sec: 5797.0, 60 sec: 5740.1, 300 sec: 5735.0). Total num frames: 208659456. Throughput: 0: 6052.1. Samples: 208667500. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:14:31,664][25689] Avg episode reward: [(0, '-47.717')] [2022-07-09 10:14:32,984][26022] Updated weights on worker 0-0, policy_version 203776 (0.00049) [2022-07-09 10:14:34,544][26022] Updated weights on worker 0-0, policy_version 203786 (0.00080) [2022-07-09 10:14:36,407][26022] Updated weights on worker 0-0, policy_version 203796 (0.00089) [2022-07-09 10:14:36,675][25689] Fps is (10 sec: 5797.8, 60 sec: 5756.6, 300 sec: 5736.7). Total num frames: 208688128. Throughput: 0: 5197.9. Samples: 208685108. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:14:36,676][25689] Avg episode reward: [(0, '-47.900')] [2022-07-09 10:14:38,029][26022] Updated weights on worker 0-0, policy_version 203806 (0.00089) [2022-07-09 10:14:39,955][26022] Updated weights on worker 0-0, policy_version 203816 (0.00083) [2022-07-09 10:14:41,678][25689] Fps is (10 sec: 5828.3, 60 sec: 5760.5, 300 sec: 5740.7). Total num frames: 208717824. Throughput: 0: 6072.7. Samples: 208720098. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:14:41,680][25689] Avg episode reward: [(0, '-47.746')] [2022-07-09 10:14:41,684][26022] Updated weights on worker 0-0, policy_version 203826 (0.00088) [2022-07-09 10:14:43,503][26022] Updated weights on worker 0-0, policy_version 203836 (0.00092) [2022-07-09 10:14:45,356][26022] Updated weights on worker 0-0, policy_version 203846 (0.00118) [2022-07-09 10:14:46,686][25689] Fps is (10 sec: 5932.5, 60 sec: 5794.1, 300 sec: 5741.8). Total num frames: 208747520. Throughput: 0: 6096.3. Samples: 208755288. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:14:46,687][25689] Avg episode reward: [(0, '-47.915')] [2022-07-09 10:14:46,847][26022] Updated weights on worker 0-0, policy_version 203856 (0.00094) [2022-07-09 10:14:47,238][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:14:47,250][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000203858_208750592.pth [2022-07-09 10:14:47,250][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000201838_206682112.pth [2022-07-09 10:14:48,726][26022] Updated weights on worker 0-0, policy_version 203866 (0.00082) [2022-07-09 10:14:50,515][26022] Updated weights on worker 0-0, policy_version 203876 (0.00086) [2022-07-09 10:14:51,738][25689] Fps is (10 sec: 5802.3, 60 sec: 5743.6, 300 sec: 5742.3). Total num frames: 208776192. Throughput: 0: 5228.5. Samples: 208772458. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:14:51,740][25689] Avg episode reward: [(0, '-46.763')] [2022-07-09 10:14:52,197][26022] Updated weights on worker 0-0, policy_version 203886 (0.00086) [2022-07-09 10:14:54,063][26022] Updated weights on worker 0-0, policy_version 203896 (0.00084) [2022-07-09 10:14:55,533][26022] Updated weights on worker 0-0, policy_version 203906 (0.00081) [2022-07-09 10:14:56,743][25689] Fps is (10 sec: 5702.2, 60 sec: 5779.0, 300 sec: 5739.2). Total num frames: 208804864. Throughput: 0: 6105.7. Samples: 208807636. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:14:56,743][25689] Avg episode reward: [(0, '-48.162')] [2022-07-09 10:14:57,546][26022] Updated weights on worker 0-0, policy_version 203916 (0.00086) [2022-07-09 10:14:59,148][26022] Updated weights on worker 0-0, policy_version 203926 (0.00089) [2022-07-09 10:15:01,010][26022] Updated weights on worker 0-0, policy_version 203936 (0.00095) [2022-07-09 10:15:01,763][25689] Fps is (10 sec: 5720.1, 60 sec: 5760.5, 300 sec: 5744.1). Total num frames: 208833536. Throughput: 0: 6076.0. Samples: 208842132. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:15:01,765][25689] Avg episode reward: [(0, '-48.472')] [2022-07-09 10:15:03,310][26022] Updated weights on worker 0-0, policy_version 203946 (0.00087) [2022-07-09 10:15:04,952][26022] Updated weights on worker 0-0, policy_version 203956 (0.00092) [2022-07-09 10:15:06,724][26022] Updated weights on worker 0-0, policy_version 203966 (0.00079) [2022-07-09 10:15:06,810][25689] Fps is (10 sec: 5594.4, 60 sec: 5757.3, 300 sec: 5738.1). Total num frames: 208861184. Throughput: 0: 5070.1. Samples: 208857316. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:15:06,812][25689] Avg episode reward: [(0, '-48.127')] [2022-07-09 10:15:08,476][26022] Updated weights on worker 0-0, policy_version 203976 (0.00094) [2022-07-09 10:15:10,330][26022] Updated weights on worker 0-0, policy_version 203986 (0.00093) [2022-07-09 10:15:11,874][25689] Fps is (10 sec: 5569.9, 60 sec: 5756.9, 300 sec: 5737.3). Total num frames: 208889856. Throughput: 0: 5929.2. Samples: 208891852. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:15:11,875][25689] Avg episode reward: [(0, '-48.814')] [2022-07-09 10:15:12,134][26022] Updated weights on worker 0-0, policy_version 203996 (0.00083) [2022-07-09 10:15:13,776][26022] Updated weights on worker 0-0, policy_version 204006 (0.00088) [2022-07-09 10:15:15,486][26022] Updated weights on worker 0-0, policy_version 204016 (0.00086) [2022-07-09 10:15:16,908][25689] Fps is (10 sec: 5779.8, 60 sec: 5755.1, 300 sec: 5737.8). Total num frames: 208919552. Throughput: 0: 5914.3. Samples: 208926902. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:15:16,909][25689] Avg episode reward: [(0, '-49.320')] [2022-07-09 10:15:17,369][26022] Updated weights on worker 0-0, policy_version 204026 (0.00087) [2022-07-09 10:15:19,363][26022] Updated weights on worker 0-0, policy_version 204036 (0.00084) [2022-07-09 10:15:21,078][26022] Updated weights on worker 0-0, policy_version 204046 (0.00088) [2022-07-09 10:15:21,939][25689] Fps is (10 sec: 5697.7, 60 sec: 5735.7, 300 sec: 5731.1). Total num frames: 208947200. Throughput: 0: 5036.9. Samples: 208943756. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:15:21,940][25689] Avg episode reward: [(0, '-49.093')] [2022-07-09 10:15:22,735][26022] Updated weights on worker 0-0, policy_version 204056 (0.00081) [2022-07-09 10:15:24,526][26022] Updated weights on worker 0-0, policy_version 204066 (0.00081) [2022-07-09 10:15:26,087][26022] Updated weights on worker 0-0, policy_version 204076 (0.00081) [2022-07-09 10:15:26,950][25689] Fps is (10 sec: 5812.7, 60 sec: 5769.8, 300 sec: 5742.3). Total num frames: 208977920. Throughput: 0: 6014.5. Samples: 208978448. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:15:26,951][25689] Avg episode reward: [(0, '-48.139')] [2022-07-09 10:15:28,293][26022] Updated weights on worker 0-0, policy_version 204086 (0.00086) [2022-07-09 10:15:29,802][26022] Updated weights on worker 0-0, policy_version 204096 (0.00500) [2022-07-09 10:15:31,614][26022] Updated weights on worker 0-0, policy_version 204106 (0.00049) [2022-07-09 10:15:32,035][25689] Fps is (10 sec: 5882.7, 60 sec: 5750.0, 300 sec: 5741.4). Total num frames: 209006592. Throughput: 0: 6019.5. Samples: 209013208. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:15:32,035][25689] Avg episode reward: [(0, '-48.496')] [2022-07-09 10:15:33,319][26022] Updated weights on worker 0-0, policy_version 204116 (0.00082) [2022-07-09 10:15:35,077][26022] Updated weights on worker 0-0, policy_version 204126 (0.00083) [2022-07-09 10:15:36,909][26022] Updated weights on worker 0-0, policy_version 204136 (0.00087) [2022-07-09 10:15:37,040][25689] Fps is (10 sec: 5784.7, 60 sec: 5767.6, 300 sec: 5744.8). Total num frames: 209036288. Throughput: 0: 5169.7. Samples: 209030976. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:15:37,040][25689] Avg episode reward: [(0, '-47.856')] [2022-07-09 10:15:38,725][26022] Updated weights on worker 0-0, policy_version 204146 (0.00084) [2022-07-09 10:15:40,389][26022] Updated weights on worker 0-0, policy_version 204156 (0.00081) [2022-07-09 10:15:42,049][25689] Fps is (10 sec: 5828.3, 60 sec: 5750.0, 300 sec: 5738.0). Total num frames: 209064960. Throughput: 0: 6073.0. Samples: 209065888. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:15:42,050][25689] Avg episode reward: [(0, '-48.371')] [2022-07-09 10:15:42,167][26022] Updated weights on worker 0-0, policy_version 204166 (0.00080) [2022-07-09 10:15:43,921][26022] Updated weights on worker 0-0, policy_version 204176 (0.00085) [2022-07-09 10:15:45,645][26022] Updated weights on worker 0-0, policy_version 204186 (0.00086) [2022-07-09 10:15:47,080][25689] Fps is (10 sec: 5711.4, 60 sec: 5730.9, 300 sec: 5739.6). Total num frames: 209093632. Throughput: 0: 6074.5. Samples: 209100730. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:15:47,081][25689] Avg episode reward: [(0, '-48.095')] [2022-07-09 10:15:47,449][26022] Updated weights on worker 0-0, policy_version 204196 (0.00082) [2022-07-09 10:15:49,423][26022] Updated weights on worker 0-0, policy_version 204206 (0.00085) [2022-07-09 10:15:51,045][26022] Updated weights on worker 0-0, policy_version 204216 (0.00091) [2022-07-09 10:15:52,143][25689] Fps is (10 sec: 5782.8, 60 sec: 5746.8, 300 sec: 5739.4). Total num frames: 209123328. Throughput: 0: 5195.4. Samples: 209117676. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:15:52,143][25689] Avg episode reward: [(0, '-48.807')] [2022-07-09 10:15:52,995][26022] Updated weights on worker 0-0, policy_version 204226 (0.00081) [2022-07-09 10:15:54,534][26022] Updated weights on worker 0-0, policy_version 204236 (0.00103) [2022-07-09 10:15:56,642][26022] Updated weights on worker 0-0, policy_version 204246 (0.00092) [2022-07-09 10:15:57,159][25689] Fps is (10 sec: 5791.3, 60 sec: 5745.8, 300 sec: 5739.3). Total num frames: 209152000. Throughput: 0: 6024.3. Samples: 209152178. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:15:57,159][25689] Avg episode reward: [(0, '-49.949')] [2022-07-09 10:15:58,123][26022] Updated weights on worker 0-0, policy_version 204256 (0.00084) [2022-07-09 10:15:59,985][26022] Updated weights on worker 0-0, policy_version 204266 (0.00085) [2022-07-09 10:16:01,946][26022] Updated weights on worker 0-0, policy_version 204276 (0.00096) [2022-07-09 10:16:02,194][25689] Fps is (10 sec: 5603.0, 60 sec: 5727.3, 300 sec: 5738.7). Total num frames: 209179648. Throughput: 0: 5994.6. Samples: 209186650. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:16:02,195][25689] Avg episode reward: [(0, '-50.826')] [2022-07-09 10:16:03,923][26022] Updated weights on worker 0-0, policy_version 204286 (0.00088) [2022-07-09 10:16:05,614][26022] Updated weights on worker 0-0, policy_version 204296 (0.00090) [2022-07-09 10:16:07,214][25689] Fps is (10 sec: 5499.1, 60 sec: 5729.9, 300 sec: 5739.6). Total num frames: 209207296. Throughput: 0: 5038.3. Samples: 209202172. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:16:07,215][25689] Avg episode reward: [(0, '-51.305')] [2022-07-09 10:16:07,471][26022] Updated weights on worker 0-0, policy_version 204306 (0.00090) [2022-07-09 10:16:09,091][26022] Updated weights on worker 0-0, policy_version 204316 (0.00091) [2022-07-09 10:16:11,119][26022] Updated weights on worker 0-0, policy_version 204326 (0.00090) [2022-07-09 10:16:12,318][25689] Fps is (10 sec: 5664.4, 60 sec: 5743.1, 300 sec: 5741.1). Total num frames: 209236992. Throughput: 0: 5896.2. Samples: 209236634. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:16:12,320][25689] Avg episode reward: [(0, '-50.903')] [2022-07-09 10:16:12,758][26022] Updated weights on worker 0-0, policy_version 204336 (0.00087) [2022-07-09 10:16:14,734][26022] Updated weights on worker 0-0, policy_version 204346 (0.00082) [2022-07-09 10:16:16,141][26022] Updated weights on worker 0-0, policy_version 204356 (0.00088) [2022-07-09 10:16:17,379][25689] Fps is (10 sec: 5742.4, 60 sec: 5723.7, 300 sec: 5733.3). Total num frames: 209265664. Throughput: 0: 5912.5. Samples: 209271728. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:16:17,381][25689] Avg episode reward: [(0, '-50.659')] [2022-07-09 10:16:18,219][26022] Updated weights on worker 0-0, policy_version 204366 (0.00086) [2022-07-09 10:16:19,696][26022] Updated weights on worker 0-0, policy_version 204376 (0.00554) [2022-07-09 10:16:21,728][26022] Updated weights on worker 0-0, policy_version 204386 (0.00086) [2022-07-09 10:16:22,464][25689] Fps is (10 sec: 5853.8, 60 sec: 5769.2, 300 sec: 5742.2). Total num frames: 209296384. Throughput: 0: 5036.3. Samples: 209288734. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:16:22,465][25689] Avg episode reward: [(0, '-49.487')] [2022-07-09 10:16:23,593][26022] Updated weights on worker 0-0, policy_version 204396 (0.00088) [2022-07-09 10:16:25,091][26022] Updated weights on worker 0-0, policy_version 204406 (0.00090) [2022-07-09 10:16:27,246][26022] Updated weights on worker 0-0, policy_version 204416 (0.00092) [2022-07-09 10:16:27,510][25689] Fps is (10 sec: 5761.3, 60 sec: 5715.2, 300 sec: 5735.7). Total num frames: 209324032. Throughput: 0: 5996.8. Samples: 209323880. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:16:27,510][25689] Avg episode reward: [(0, '-48.996')] [2022-07-09 10:16:28,658][26022] Updated weights on worker 0-0, policy_version 204426 (0.00087) [2022-07-09 10:16:30,502][26022] Updated weights on worker 0-0, policy_version 204436 (0.00081) [2022-07-09 10:16:32,315][26022] Updated weights on worker 0-0, policy_version 204446 (0.00083) [2022-07-09 10:16:32,587][25689] Fps is (10 sec: 5664.9, 60 sec: 5732.9, 300 sec: 5741.2). Total num frames: 209353728. Throughput: 0: 6016.6. Samples: 209358582. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:16:32,587][25689] Avg episode reward: [(0, '-48.727')] [2022-07-09 10:16:33,860][26022] Updated weights on worker 0-0, policy_version 204456 (0.00087) [2022-07-09 10:16:36,009][26022] Updated weights on worker 0-0, policy_version 204466 (0.00082) [2022-07-09 10:16:37,523][26022] Updated weights on worker 0-0, policy_version 204476 (0.00086) [2022-07-09 10:16:37,603][25689] Fps is (10 sec: 5884.4, 60 sec: 5731.8, 300 sec: 5737.5). Total num frames: 209383424. Throughput: 0: 5162.1. Samples: 209376126. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:16:37,603][25689] Avg episode reward: [(0, '-49.239')] [2022-07-09 10:16:39,417][26022] Updated weights on worker 0-0, policy_version 204486 (0.00084) [2022-07-09 10:16:41,059][26022] Updated weights on worker 0-0, policy_version 204496 (0.00079) [2022-07-09 10:16:42,614][25689] Fps is (10 sec: 5922.8, 60 sec: 5748.5, 300 sec: 5744.6). Total num frames: 209413120. Throughput: 0: 6061.4. Samples: 209410874. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:16:42,615][25689] Avg episode reward: [(0, '-49.674')] [2022-07-09 10:16:42,849][26022] Updated weights on worker 0-0, policy_version 204506 (0.00088) [2022-07-09 10:16:44,549][26022] Updated weights on worker 0-0, policy_version 204516 (0.00097) [2022-07-09 10:16:46,407][26022] Updated weights on worker 0-0, policy_version 204526 (0.00091) [2022-07-09 10:16:47,336][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:16:47,359][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000204531_209439744.pth [2022-07-09 10:16:47,359][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000202511_207371264.pth [2022-07-09 10:16:47,636][25689] Fps is (10 sec: 5817.4, 60 sec: 5749.4, 300 sec: 5746.8). Total num frames: 209441792. Throughput: 0: 6057.7. Samples: 209445800. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:16:47,636][25689] Avg episode reward: [(0, '-50.293')] [2022-07-09 10:16:48,099][26022] Updated weights on worker 0-0, policy_version 204536 (0.00095) [2022-07-09 10:16:49,978][26022] Updated weights on worker 0-0, policy_version 204546 (0.00089) [2022-07-09 10:16:51,579][26022] Updated weights on worker 0-0, policy_version 204556 (0.00090) [2022-07-09 10:16:52,691][25689] Fps is (10 sec: 5792.3, 60 sec: 5750.1, 300 sec: 5747.2). Total num frames: 209471488. Throughput: 0: 6070.1. Samples: 209480618. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:16:52,692][25689] Avg episode reward: [(0, '-49.755')] [2022-07-09 10:16:53,557][26022] Updated weights on worker 0-0, policy_version 204566 (0.00083) [2022-07-09 10:16:55,030][26022] Updated weights on worker 0-0, policy_version 204576 (0.00101) [2022-07-09 10:16:57,097][26022] Updated weights on worker 0-0, policy_version 204586 (0.00080) [2022-07-09 10:16:57,699][25689] Fps is (10 sec: 5698.4, 60 sec: 5734.0, 300 sec: 5741.1). Total num frames: 209499136. Throughput: 0: 6076.0. Samples: 209498232. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:16:57,700][25689] Avg episode reward: [(0, '-50.873')] [2022-07-09 10:16:58,496][26022] Updated weights on worker 0-0, policy_version 204596 (0.00088) [2022-07-09 10:17:00,453][26022] Updated weights on worker 0-0, policy_version 204606 (0.00076) [2022-07-09 10:17:02,497][26022] Updated weights on worker 0-0, policy_version 204616 (0.00085) [2022-07-09 10:17:02,706][25689] Fps is (10 sec: 5623.8, 60 sec: 5753.6, 300 sec: 5749.2). Total num frames: 209527808. Throughput: 0: 6009.5. Samples: 209531614. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:17:02,706][25689] Avg episode reward: [(0, '-50.216')] [2022-07-09 10:17:04,425][26022] Updated weights on worker 0-0, policy_version 204626 (0.00087) [2022-07-09 10:17:06,018][26022] Updated weights on worker 0-0, policy_version 204636 (0.00087) [2022-07-09 10:17:07,715][25689] Fps is (10 sec: 5725.5, 60 sec: 5771.6, 300 sec: 5751.4). Total num frames: 209556480. Throughput: 0: 5127.0. Samples: 209548744. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:17:07,715][25689] Avg episode reward: [(0, '-49.868')] [2022-07-09 10:17:07,868][26022] Updated weights on worker 0-0, policy_version 204646 (0.00082) [2022-07-09 10:17:09,487][26022] Updated weights on worker 0-0, policy_version 204656 (0.00092) [2022-07-09 10:17:11,368][26022] Updated weights on worker 0-0, policy_version 204666 (0.00091) [2022-07-09 10:17:12,774][25689] Fps is (10 sec: 5797.3, 60 sec: 5775.8, 300 sec: 5751.7). Total num frames: 209586176. Throughput: 0: 5130.7. Samples: 209583658. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:17:12,775][25689] Avg episode reward: [(0, '-50.393')] [2022-07-09 10:17:13,125][26022] Updated weights on worker 0-0, policy_version 204676 (0.00086) [2022-07-09 10:17:14,756][26022] Updated weights on worker 0-0, policy_version 204686 (0.00086) [2022-07-09 10:17:16,629][26022] Updated weights on worker 0-0, policy_version 204696 (0.00094) [2022-07-09 10:17:17,796][25689] Fps is (10 sec: 5891.5, 60 sec: 5796.5, 300 sec: 5751.9). Total num frames: 209615872. Throughput: 0: 5991.4. Samples: 209618638. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:17:17,796][25689] Avg episode reward: [(0, '-49.290')] [2022-07-09 10:17:18,580][26022] Updated weights on worker 0-0, policy_version 204706 (0.00084) [2022-07-09 10:17:20,186][26022] Updated weights on worker 0-0, policy_version 204716 (0.00056) [2022-07-09 10:17:22,082][26022] Updated weights on worker 0-0, policy_version 204726 (0.00081) [2022-07-09 10:17:22,811][25689] Fps is (10 sec: 5713.2, 60 sec: 5752.3, 300 sec: 5751.7). Total num frames: 209643520. Throughput: 0: 6056.9. Samples: 209653390. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:17:22,812][25689] Avg episode reward: [(0, '-48.670')] [2022-07-09 10:17:23,673][26022] Updated weights on worker 0-0, policy_version 204736 (0.00081) [2022-07-09 10:17:25,555][26022] Updated weights on worker 0-0, policy_version 204746 (0.00082) [2022-07-09 10:17:27,368][26022] Updated weights on worker 0-0, policy_version 204756 (0.00090) [2022-07-09 10:17:27,817][25689] Fps is (10 sec: 5620.0, 60 sec: 5773.1, 300 sec: 5752.6). Total num frames: 209672192. Throughput: 0: 6068.5. Samples: 209670736. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:17:27,817][25689] Avg episode reward: [(0, '-48.240')] [2022-07-09 10:17:29,058][26022] Updated weights on worker 0-0, policy_version 204766 (0.00089) [2022-07-09 10:17:30,888][26022] Updated weights on worker 0-0, policy_version 204776 (0.00086) [2022-07-09 10:17:32,678][26022] Updated weights on worker 0-0, policy_version 204786 (0.00089) [2022-07-09 10:17:32,942][25689] Fps is (10 sec: 5761.7, 60 sec: 5768.5, 300 sec: 5748.1). Total num frames: 209701888. Throughput: 0: 6032.9. Samples: 209705326. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:17:32,942][25689] Avg episode reward: [(0, '-48.396')] [2022-07-09 10:17:34,371][26022] Updated weights on worker 0-0, policy_version 204796 (0.00093) [2022-07-09 10:17:36,049][26022] Updated weights on worker 0-0, policy_version 204806 (0.00078) [2022-07-09 10:17:37,809][26022] Updated weights on worker 0-0, policy_version 204816 (0.00081) [2022-07-09 10:17:38,026][25689] Fps is (10 sec: 5817.5, 60 sec: 5762.0, 300 sec: 5751.2). Total num frames: 209731584. Throughput: 0: 6030.6. Samples: 209740640. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:17:38,027][25689] Avg episode reward: [(0, '-47.238')] [2022-07-09 10:17:39,627][26022] Updated weights on worker 0-0, policy_version 204826 (0.00089) [2022-07-09 10:17:41,562][26022] Updated weights on worker 0-0, policy_version 204836 (0.00097) [2022-07-09 10:17:43,007][26022] Updated weights on worker 0-0, policy_version 204846 (0.00080) [2022-07-09 10:17:43,098][25689] Fps is (10 sec: 6049.3, 60 sec: 5790.1, 300 sec: 5760.5). Total num frames: 209763328. Throughput: 0: 5163.4. Samples: 209758140. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:17:43,099][25689] Avg episode reward: [(0, '-47.884')] [2022-07-09 10:17:44,887][26022] Updated weights on worker 0-0, policy_version 204856 (0.00083) [2022-07-09 10:17:46,437][26022] Updated weights on worker 0-0, policy_version 204866 (0.00087) [2022-07-09 10:17:48,147][25689] Fps is (10 sec: 5868.1, 60 sec: 5770.6, 300 sec: 5753.5). Total num frames: 209790976. Throughput: 0: 6024.2. Samples: 209793210. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:17:48,149][25689] Avg episode reward: [(0, '-48.408')] [2022-07-09 10:17:48,481][26022] Updated weights on worker 0-0, policy_version 204876 (0.00082) [2022-07-09 10:17:50,105][26022] Updated weights on worker 0-0, policy_version 204886 (0.00085) [2022-07-09 10:17:52,012][26022] Updated weights on worker 0-0, policy_version 204896 (0.00084) [2022-07-09 10:17:53,236][25689] Fps is (10 sec: 5656.2, 60 sec: 5767.3, 300 sec: 5752.2). Total num frames: 209820672. Throughput: 0: 6036.1. Samples: 209827828. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:17:53,237][25689] Avg episode reward: [(0, '-49.044')] [2022-07-09 10:17:53,778][26022] Updated weights on worker 0-0, policy_version 204906 (0.00086) [2022-07-09 10:17:55,692][26022] Updated weights on worker 0-0, policy_version 204916 (0.00087) [2022-07-09 10:17:57,065][26022] Updated weights on worker 0-0, policy_version 204926 (0.00081) [2022-07-09 10:17:58,251][25689] Fps is (10 sec: 5777.0, 60 sec: 5783.6, 300 sec: 5752.3). Total num frames: 209849344. Throughput: 0: 5176.2. Samples: 209845326. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 10:17:58,251][25689] Avg episode reward: [(0, '-49.443')] [2022-07-09 10:17:59,170][26022] Updated weights on worker 0-0, policy_version 204936 (0.00095) [2022-07-09 10:18:00,525][26022] Updated weights on worker 0-0, policy_version 204946 (0.00083) [2022-07-09 10:18:02,882][26022] Updated weights on worker 0-0, policy_version 204956 (0.00085) [2022-07-09 10:18:03,273][25689] Fps is (10 sec: 5611.3, 60 sec: 5765.2, 300 sec: 5753.5). Total num frames: 209876992. Throughput: 0: 6068.6. Samples: 209880574. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 10:18:03,274][25689] Avg episode reward: [(0, '-49.240')] [2022-07-09 10:18:04,506][26022] Updated weights on worker 0-0, policy_version 204966 (0.00081) [2022-07-09 10:18:06,465][26022] Updated weights on worker 0-0, policy_version 204976 (0.00096) [2022-07-09 10:18:08,045][26022] Updated weights on worker 0-0, policy_version 204986 (0.00094) [2022-07-09 10:18:08,312][25689] Fps is (10 sec: 5801.0, 60 sec: 5796.1, 300 sec: 5761.4). Total num frames: 209907712. Throughput: 0: 5977.9. Samples: 209913756. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 10:18:08,313][25689] Avg episode reward: [(0, '-50.108')] [2022-07-09 10:18:09,860][26022] Updated weights on worker 0-0, policy_version 204996 (0.00092) [2022-07-09 10:18:11,511][26022] Updated weights on worker 0-0, policy_version 205006 (0.00084) [2022-07-09 10:18:13,374][25689] Fps is (10 sec: 5778.4, 60 sec: 5762.1, 300 sec: 5751.8). Total num frames: 209935360. Throughput: 0: 5124.3. Samples: 209931020. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 10:18:13,375][25689] Avg episode reward: [(0, '-50.087')] [2022-07-09 10:18:13,420][26022] Updated weights on worker 0-0, policy_version 205016 (0.00081) [2022-07-09 10:18:14,916][26022] Updated weights on worker 0-0, policy_version 205026 (0.00091) [2022-07-09 10:18:16,899][26022] Updated weights on worker 0-0, policy_version 205036 (0.00086) [2022-07-09 10:18:18,392][25689] Fps is (10 sec: 5790.7, 60 sec: 5779.3, 300 sec: 5763.1). Total num frames: 209966080. Throughput: 0: 6013.1. Samples: 209966438. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 10:18:18,393][25689] Avg episode reward: [(0, '-49.533')] [2022-07-09 10:18:18,579][26022] Updated weights on worker 0-0, policy_version 205046 (0.00085) [2022-07-09 10:18:20,359][26022] Updated weights on worker 0-0, policy_version 205056 (0.00086) [2022-07-09 10:18:22,123][26022] Updated weights on worker 0-0, policy_version 205066 (0.00082) [2022-07-09 10:18:23,415][25689] Fps is (10 sec: 5915.1, 60 sec: 5795.6, 300 sec: 5763.2). Total num frames: 209994752. Throughput: 0: 5995.8. Samples: 210001340. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 10:18:23,415][25689] Avg episode reward: [(0, '-49.335')] [2022-07-09 10:18:23,927][26022] Updated weights on worker 0-0, policy_version 205076 (0.01115) [2022-07-09 10:18:25,466][26022] Updated weights on worker 0-0, policy_version 205086 (0.00085) [2022-07-09 10:18:27,647][26022] Updated weights on worker 0-0, policy_version 205096 (0.00095) [2022-07-09 10:18:28,450][25689] Fps is (10 sec: 5701.3, 60 sec: 5792.7, 300 sec: 5756.2). Total num frames: 210023424. Throughput: 0: 5211.8. Samples: 210018710. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 10:18:28,451][25689] Avg episode reward: [(0, '-48.577')] [2022-07-09 10:18:29,103][26022] Updated weights on worker 0-0, policy_version 205106 (0.00091) [2022-07-09 10:18:31,072][26022] Updated weights on worker 0-0, policy_version 205116 (0.00090) [2022-07-09 10:18:32,700][26022] Updated weights on worker 0-0, policy_version 205126 (0.00087) [2022-07-09 10:18:33,527][25689] Fps is (10 sec: 5772.4, 60 sec: 5797.3, 300 sec: 5761.8). Total num frames: 210053120. Throughput: 0: 6065.0. Samples: 210053244. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 10:18:33,527][25689] Avg episode reward: [(0, '-47.816')] [2022-07-09 10:18:34,571][26022] Updated weights on worker 0-0, policy_version 205136 (0.00087) [2022-07-09 10:18:36,271][26022] Updated weights on worker 0-0, policy_version 205146 (0.00085) [2022-07-09 10:18:37,999][26022] Updated weights on worker 0-0, policy_version 205156 (0.00089) [2022-07-09 10:18:38,534][25689] Fps is (10 sec: 5889.6, 60 sec: 5804.7, 300 sec: 5762.5). Total num frames: 210082816. Throughput: 0: 6042.8. Samples: 210088154. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 10:18:38,535][25689] Avg episode reward: [(0, '-47.264')] [2022-07-09 10:18:39,918][26022] Updated weights on worker 0-0, policy_version 205166 (0.00087) [2022-07-09 10:18:41,482][26022] Updated weights on worker 0-0, policy_version 205176 (0.00083) [2022-07-09 10:18:43,482][26022] Updated weights on worker 0-0, policy_version 205186 (0.00080) [2022-07-09 10:18:43,586][25689] Fps is (10 sec: 5700.4, 60 sec: 5738.9, 300 sec: 5761.5). Total num frames: 210110464. Throughput: 0: 5165.7. Samples: 210105538. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 10:18:43,587][25689] Avg episode reward: [(0, '-47.151')] [2022-07-09 10:18:45,058][26022] Updated weights on worker 0-0, policy_version 205196 (0.00087) [2022-07-09 10:18:46,959][26022] Updated weights on worker 0-0, policy_version 205206 (0.00091) [2022-07-09 10:18:47,412][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:18:47,420][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000205210_210135040.pth [2022-07-09 10:18:47,421][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000203182_208058368.pth [2022-07-09 10:18:48,591][25689] Fps is (10 sec: 5600.4, 60 sec: 5760.1, 300 sec: 5752.2). Total num frames: 210139136. Throughput: 0: 6030.9. Samples: 210140176. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 10:18:48,591][25689] Avg episode reward: [(0, '-47.197')] [2022-07-09 10:18:48,880][26022] Updated weights on worker 0-0, policy_version 205216 (0.00087) [2022-07-09 10:18:50,443][26022] Updated weights on worker 0-0, policy_version 205226 (0.00087) [2022-07-09 10:18:52,450][26022] Updated weights on worker 0-0, policy_version 205236 (0.00085) [2022-07-09 10:18:53,659][25689] Fps is (10 sec: 5794.9, 60 sec: 5762.1, 300 sec: 5761.6). Total num frames: 210168832. Throughput: 0: 6035.5. Samples: 210174750. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 10:18:53,659][25689] Avg episode reward: [(0, '-47.458')] [2022-07-09 10:18:54,086][26022] Updated weights on worker 0-0, policy_version 205246 (0.00089) [2022-07-09 10:18:55,860][26022] Updated weights on worker 0-0, policy_version 205256 (0.00085) [2022-07-09 10:18:57,726][26022] Updated weights on worker 0-0, policy_version 205266 (0.00086) [2022-07-09 10:18:58,676][25689] Fps is (10 sec: 5888.9, 60 sec: 5778.8, 300 sec: 5761.3). Total num frames: 210198528. Throughput: 0: 5170.4. Samples: 210192294. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 10:18:58,677][25689] Avg episode reward: [(0, '-48.433')] [2022-07-09 10:18:59,225][26022] Updated weights on worker 0-0, policy_version 205276 (0.00096) [2022-07-09 10:19:01,303][26022] Updated weights on worker 0-0, policy_version 205286 (0.00081) [2022-07-09 10:19:03,147][26022] Updated weights on worker 0-0, policy_version 205296 (0.00092) [2022-07-09 10:19:03,691][25689] Fps is (10 sec: 5614.1, 60 sec: 5762.6, 300 sec: 5757.9). Total num frames: 210225152. Throughput: 0: 5988.6. Samples: 210225934. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 10:19:03,691][25689] Avg episode reward: [(0, '-49.622')] [2022-07-09 10:19:04,982][26022] Updated weights on worker 0-0, policy_version 205306 (0.00090) [2022-07-09 10:19:06,803][26022] Updated weights on worker 0-0, policy_version 205316 (0.00357) [2022-07-09 10:19:08,619][26022] Updated weights on worker 0-0, policy_version 205326 (0.00087) [2022-07-09 10:19:08,776][25689] Fps is (10 sec: 5576.2, 60 sec: 5741.2, 300 sec: 5760.8). Total num frames: 210254848. Throughput: 0: 5921.9. Samples: 210259712. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 10:19:08,778][25689] Avg episode reward: [(0, '-49.803')] [2022-07-09 10:19:10,385][26022] Updated weights on worker 0-0, policy_version 205336 (0.00080) [2022-07-09 10:19:12,100][26022] Updated weights on worker 0-0, policy_version 205346 (0.00082) [2022-07-09 10:19:13,779][26022] Updated weights on worker 0-0, policy_version 205356 (0.00080) [2022-07-09 10:19:13,868][25689] Fps is (10 sec: 5835.5, 60 sec: 5772.2, 300 sec: 5759.4). Total num frames: 210284544. Throughput: 0: 5054.7. Samples: 210276904. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 10:19:13,869][25689] Avg episode reward: [(0, '-50.091')] [2022-07-09 10:19:15,553][26022] Updated weights on worker 0-0, policy_version 205366 (0.00089) [2022-07-09 10:19:17,524][26022] Updated weights on worker 0-0, policy_version 205376 (0.00082) [2022-07-09 10:19:18,892][25689] Fps is (10 sec: 5769.8, 60 sec: 5737.8, 300 sec: 5759.0). Total num frames: 210313216. Throughput: 0: 5930.8. Samples: 210312192. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 10:19:18,893][25689] Avg episode reward: [(0, '-48.829')] [2022-07-09 10:19:19,171][26022] Updated weights on worker 0-0, policy_version 205386 (0.00094) [2022-07-09 10:19:20,988][26022] Updated weights on worker 0-0, policy_version 205396 (0.00086) [2022-07-09 10:19:22,826][26022] Updated weights on worker 0-0, policy_version 205406 (0.00087) [2022-07-09 10:19:23,907][25689] Fps is (10 sec: 5712.4, 60 sec: 5738.6, 300 sec: 5758.9). Total num frames: 210341888. Throughput: 0: 5982.0. Samples: 210346866. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 10:19:23,907][25689] Avg episode reward: [(0, '-48.449')] [2022-07-09 10:19:24,459][26022] Updated weights on worker 0-0, policy_version 205416 (0.00094) [2022-07-09 10:19:26,472][26022] Updated weights on worker 0-0, policy_version 205426 (0.00090) [2022-07-09 10:19:27,953][26022] Updated weights on worker 0-0, policy_version 205436 (0.00082) [2022-07-09 10:19:28,914][25689] Fps is (10 sec: 5722.2, 60 sec: 5741.2, 300 sec: 5756.4). Total num frames: 210370560. Throughput: 0: 5187.3. Samples: 210364170. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 10:19:28,914][25689] Avg episode reward: [(0, '-47.768')] [2022-07-09 10:19:30,021][26022] Updated weights on worker 0-0, policy_version 205446 (0.00080) [2022-07-09 10:19:31,319][26022] Updated weights on worker 0-0, policy_version 205456 (0.00089) [2022-07-09 10:19:33,446][26022] Updated weights on worker 0-0, policy_version 205466 (0.00086) [2022-07-09 10:19:33,969][25689] Fps is (10 sec: 5902.2, 60 sec: 5760.2, 300 sec: 5762.5). Total num frames: 210401280. Throughput: 0: 6065.8. Samples: 210398834. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 10:19:33,970][25689] Avg episode reward: [(0, '-47.090')] [2022-07-09 10:19:34,978][26022] Updated weights on worker 0-0, policy_version 205476 (0.00087) [2022-07-09 10:19:36,984][26022] Updated weights on worker 0-0, policy_version 205486 (0.00083) [2022-07-09 10:19:38,708][26022] Updated weights on worker 0-0, policy_version 205496 (0.00082) [2022-07-09 10:19:38,983][25689] Fps is (10 sec: 5796.7, 60 sec: 5725.7, 300 sec: 5755.4). Total num frames: 210428928. Throughput: 0: 6058.2. Samples: 210433904. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 10:19:38,983][25689] Avg episode reward: [(0, '-47.508')] [2022-07-09 10:19:40,454][26022] Updated weights on worker 0-0, policy_version 205506 (0.00090) [2022-07-09 10:19:42,072][26022] Updated weights on worker 0-0, policy_version 205516 (0.00096) [2022-07-09 10:19:44,010][25689] Fps is (10 sec: 5609.2, 60 sec: 5745.0, 300 sec: 5751.6). Total num frames: 210457600. Throughput: 0: 5191.3. Samples: 210451228. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 10:19:44,010][25689] Avg episode reward: [(0, '-48.347')] [2022-07-09 10:19:44,106][26022] Updated weights on worker 0-0, policy_version 205526 (0.00091) [2022-07-09 10:19:45,455][26022] Updated weights on worker 0-0, policy_version 205536 (0.00084) [2022-07-09 10:19:47,702][26022] Updated weights on worker 0-0, policy_version 205546 (0.00090) [2022-07-09 10:19:49,025][25689] Fps is (10 sec: 5914.0, 60 sec: 5777.9, 300 sec: 5759.2). Total num frames: 210488320. Throughput: 0: 6056.7. Samples: 210485980. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 10:19:49,026][25689] Avg episode reward: [(0, '-49.049')] [2022-07-09 10:19:49,100][26022] Updated weights on worker 0-0, policy_version 205556 (0.00081) [2022-07-09 10:19:51,178][26022] Updated weights on worker 0-0, policy_version 205566 (0.00080) [2022-07-09 10:19:52,687][26022] Updated weights on worker 0-0, policy_version 205576 (0.00757) [2022-07-09 10:19:54,101][25689] Fps is (10 sec: 5682.9, 60 sec: 5726.3, 300 sec: 5750.9). Total num frames: 210514944. Throughput: 0: 6050.6. Samples: 210520640. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 10:19:54,101][25689] Avg episode reward: [(0, '-50.922')] [2022-07-09 10:19:54,598][26022] Updated weights on worker 0-0, policy_version 205586 (0.00087) [2022-07-09 10:19:56,377][26022] Updated weights on worker 0-0, policy_version 205596 (0.00091) [2022-07-09 10:19:58,217][26022] Updated weights on worker 0-0, policy_version 205606 (0.00884) [2022-07-09 10:19:59,115][25689] Fps is (10 sec: 5581.7, 60 sec: 5726.6, 300 sec: 5754.5). Total num frames: 210544640. Throughput: 0: 6028.0. Samples: 210555264. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 10:19:59,117][25689] Avg episode reward: [(0, '-50.535')] [2022-07-09 10:19:59,894][26022] Updated weights on worker 0-0, policy_version 205616 (0.00092) [2022-07-09 10:20:02,199][26022] Updated weights on worker 0-0, policy_version 205626 (0.00087) [2022-07-09 10:20:03,623][26022] Updated weights on worker 0-0, policy_version 205636 (0.00086) [2022-07-09 10:20:04,126][25689] Fps is (10 sec: 5719.7, 60 sec: 5743.9, 300 sec: 5755.2). Total num frames: 210572288. Throughput: 0: 5932.7. Samples: 210570572. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 10:20:04,127][25689] Avg episode reward: [(0, '-51.220')] [2022-07-09 10:20:05,750][26022] Updated weights on worker 0-0, policy_version 205646 (0.00093) [2022-07-09 10:20:07,300][26022] Updated weights on worker 0-0, policy_version 205656 (0.00086) [2022-07-09 10:20:09,161][25689] Fps is (10 sec: 5504.6, 60 sec: 5714.8, 300 sec: 5752.3). Total num frames: 210599936. Throughput: 0: 5917.1. Samples: 210605124. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 10:20:09,163][25689] Avg episode reward: [(0, '-49.802')] [2022-07-09 10:20:09,360][26022] Updated weights on worker 0-0, policy_version 205666 (0.00079) [2022-07-09 10:20:10,971][26022] Updated weights on worker 0-0, policy_version 205676 (0.00079) [2022-07-09 10:20:12,825][26022] Updated weights on worker 0-0, policy_version 205686 (0.00084) [2022-07-09 10:20:14,227][25689] Fps is (10 sec: 5778.4, 60 sec: 5734.2, 300 sec: 5755.1). Total num frames: 210630656. Throughput: 0: 5920.8. Samples: 210639806. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 10:20:14,228][25689] Avg episode reward: [(0, '-49.670')] [2022-07-09 10:20:14,507][26022] Updated weights on worker 0-0, policy_version 205696 (0.00085) [2022-07-09 10:20:16,301][26022] Updated weights on worker 0-0, policy_version 205706 (0.00084) [2022-07-09 10:20:17,889][26022] Updated weights on worker 0-0, policy_version 205716 (0.00089) [2022-07-09 10:20:19,259][25689] Fps is (10 sec: 5881.7, 60 sec: 5733.5, 300 sec: 5758.5). Total num frames: 210659328. Throughput: 0: 5075.0. Samples: 210657490. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 10:20:19,260][25689] Avg episode reward: [(0, '-48.606')] [2022-07-09 10:20:19,744][26022] Updated weights on worker 0-0, policy_version 205726 (0.00094) [2022-07-09 10:20:21,477][26022] Updated weights on worker 0-0, policy_version 205736 (0.00084) [2022-07-09 10:20:23,198][26022] Updated weights on worker 0-0, policy_version 205746 (0.00051) [2022-07-09 10:20:24,301][25689] Fps is (10 sec: 5896.1, 60 sec: 5764.8, 300 sec: 5757.9). Total num frames: 210690048. Throughput: 0: 6049.9. Samples: 210692624. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 10:20:24,302][25689] Avg episode reward: [(0, '-48.456')] [2022-07-09 10:20:25,112][26022] Updated weights on worker 0-0, policy_version 205756 (0.00087) [2022-07-09 10:20:26,852][26022] Updated weights on worker 0-0, policy_version 205766 (0.00084) [2022-07-09 10:20:28,491][26022] Updated weights on worker 0-0, policy_version 205776 (0.00083) [2022-07-09 10:20:29,326][25689] Fps is (10 sec: 5797.9, 60 sec: 5746.1, 300 sec: 5755.7). Total num frames: 210717696. Throughput: 0: 6056.3. Samples: 210727248. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 10:20:29,326][25689] Avg episode reward: [(0, '-48.508')] [2022-07-09 10:20:30,355][26022] Updated weights on worker 0-0, policy_version 205786 (0.00086) [2022-07-09 10:20:32,308][26022] Updated weights on worker 0-0, policy_version 205796 (0.00081) [2022-07-09 10:20:33,927][26022] Updated weights on worker 0-0, policy_version 205806 (0.00087) [2022-07-09 10:20:34,391][25689] Fps is (10 sec: 5784.5, 60 sec: 5745.2, 300 sec: 5757.9). Total num frames: 210748416. Throughput: 0: 5173.7. Samples: 210744126. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 10:20:34,391][25689] Avg episode reward: [(0, '-48.633')] [2022-07-09 10:20:35,847][26022] Updated weights on worker 0-0, policy_version 205816 (0.00088) [2022-07-09 10:20:37,352][26022] Updated weights on worker 0-0, policy_version 205826 (0.00076) [2022-07-09 10:20:39,409][25689] Fps is (10 sec: 5788.4, 60 sec: 5744.7, 300 sec: 5754.3). Total num frames: 210776064. Throughput: 0: 6032.8. Samples: 210779056. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 10:20:39,410][25689] Avg episode reward: [(0, '-49.566')] [2022-07-09 10:20:39,411][26022] Updated weights on worker 0-0, policy_version 205836 (0.00110) [2022-07-09 10:20:40,968][26022] Updated weights on worker 0-0, policy_version 205846 (0.00091) [2022-07-09 10:20:42,675][26022] Updated weights on worker 0-0, policy_version 205856 (0.00088) [2022-07-09 10:20:44,424][25689] Fps is (10 sec: 5715.3, 60 sec: 5762.9, 300 sec: 5758.1). Total num frames: 210805760. Throughput: 0: 6027.4. Samples: 210813918. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 10:20:44,425][25689] Avg episode reward: [(0, '-48.017')] [2022-07-09 10:20:44,547][26022] Updated weights on worker 0-0, policy_version 205866 (0.00090) [2022-07-09 10:20:46,180][26022] Updated weights on worker 0-0, policy_version 205876 (0.00087) [2022-07-09 10:20:47,552][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:20:47,566][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000205882_210823168.pth [2022-07-09 10:20:47,566][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000203858_208750592.pth [2022-07-09 10:20:48,183][26022] Updated weights on worker 0-0, policy_version 205886 (0.00089) [2022-07-09 10:20:49,455][25689] Fps is (10 sec: 5912.4, 60 sec: 5744.5, 300 sec: 5758.7). Total num frames: 210835456. Throughput: 0: 5175.1. Samples: 210831418. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 10:20:49,455][25689] Avg episode reward: [(0, '-49.165')] [2022-07-09 10:20:49,786][26022] Updated weights on worker 0-0, policy_version 205896 (0.00084) [2022-07-09 10:20:51,597][26022] Updated weights on worker 0-0, policy_version 205906 (0.00078) [2022-07-09 10:20:53,530][26022] Updated weights on worker 0-0, policy_version 205916 (0.00086) [2022-07-09 10:20:54,505][25689] Fps is (10 sec: 5891.7, 60 sec: 5797.7, 300 sec: 5761.5). Total num frames: 210865152. Throughput: 0: 6073.5. Samples: 210866290. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 10:20:54,506][25689] Avg episode reward: [(0, '-48.993')] [2022-07-09 10:20:54,955][26022] Updated weights on worker 0-0, policy_version 205926 (0.00084) [2022-07-09 10:20:57,124][26022] Updated weights on worker 0-0, policy_version 205936 (0.00087) [2022-07-09 10:20:58,752][26022] Updated weights on worker 0-0, policy_version 205946 (0.00087) [2022-07-09 10:20:59,507][25689] Fps is (10 sec: 5704.4, 60 sec: 5765.0, 300 sec: 5762.1). Total num frames: 210892800. Throughput: 0: 6077.3. Samples: 210901198. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 10:20:59,508][25689] Avg episode reward: [(0, '-49.566')] [2022-07-09 10:21:00,361][26022] Updated weights on worker 0-0, policy_version 205956 (0.00081) [2022-07-09 10:21:02,376][26022] Updated weights on worker 0-0, policy_version 205966 (0.00086) [2022-07-09 10:21:04,118][26022] Updated weights on worker 0-0, policy_version 205976 (0.00083) [2022-07-09 10:21:04,509][25689] Fps is (10 sec: 5527.4, 60 sec: 5765.9, 300 sec: 5762.5). Total num frames: 210920448. Throughput: 0: 5179.9. Samples: 210917954. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 10:21:04,510][25689] Avg episode reward: [(0, '-49.528')] [2022-07-09 10:21:06,058][26022] Updated weights on worker 0-0, policy_version 205986 (0.00092) [2022-07-09 10:21:07,872][26022] Updated weights on worker 0-0, policy_version 205996 (0.00085) [2022-07-09 10:21:09,497][26022] Updated weights on worker 0-0, policy_version 206006 (0.00089) [2022-07-09 10:21:09,551][25689] Fps is (10 sec: 5709.5, 60 sec: 5799.1, 300 sec: 5763.7). Total num frames: 210950144. Throughput: 0: 6008.8. Samples: 210952172. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 10:21:09,551][25689] Avg episode reward: [(0, '-50.063')] [2022-07-09 10:21:11,537][26022] Updated weights on worker 0-0, policy_version 206016 (0.00085) [2022-07-09 10:21:13,086][26022] Updated weights on worker 0-0, policy_version 206026 (0.00084) [2022-07-09 10:21:14,679][25689] Fps is (10 sec: 5739.1, 60 sec: 5759.3, 300 sec: 5762.4). Total num frames: 210978816. Throughput: 0: 5986.6. Samples: 210987064. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 10:21:14,680][25689] Avg episode reward: [(0, '-49.726')] [2022-07-09 10:21:15,023][26022] Updated weights on worker 0-0, policy_version 206036 (0.00082) [2022-07-09 10:21:16,562][26022] Updated weights on worker 0-0, policy_version 206046 (0.00083) [2022-07-09 10:21:18,407][26022] Updated weights on worker 0-0, policy_version 206056 (0.00095) [2022-07-09 10:21:19,695][25689] Fps is (10 sec: 5753.9, 60 sec: 5777.7, 300 sec: 5760.3). Total num frames: 211008512. Throughput: 0: 5112.9. Samples: 211004416. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 10:21:19,696][25689] Avg episode reward: [(0, '-49.126')] [2022-07-09 10:21:20,139][26022] Updated weights on worker 0-0, policy_version 206066 (0.00086) [2022-07-09 10:21:21,964][26022] Updated weights on worker 0-0, policy_version 206076 (0.00086) [2022-07-09 10:21:23,755][26022] Updated weights on worker 0-0, policy_version 206086 (0.00085) [2022-07-09 10:21:24,720][25689] Fps is (10 sec: 5813.3, 60 sec: 5745.5, 300 sec: 5764.1). Total num frames: 211037184. Throughput: 0: 5997.1. Samples: 211039158. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 10:21:24,720][25689] Avg episode reward: [(0, '-49.102')] [2022-07-09 10:21:25,454][26022] Updated weights on worker 0-0, policy_version 206096 (0.00085) [2022-07-09 10:21:27,310][26022] Updated weights on worker 0-0, policy_version 206106 (0.00084) [2022-07-09 10:21:29,106][26022] Updated weights on worker 0-0, policy_version 206116 (0.00080) [2022-07-09 10:21:29,731][25689] Fps is (10 sec: 5713.9, 60 sec: 5763.8, 300 sec: 5761.9). Total num frames: 211065856. Throughput: 0: 6005.6. Samples: 211073364. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 10:21:29,731][25689] Avg episode reward: [(0, '-47.973')] [2022-07-09 10:21:30,927][26022] Updated weights on worker 0-0, policy_version 206126 (0.00081) [2022-07-09 10:21:32,598][26022] Updated weights on worker 0-0, policy_version 206136 (0.00087) [2022-07-09 10:21:34,430][26022] Updated weights on worker 0-0, policy_version 206146 (0.00085) [2022-07-09 10:21:34,811][25689] Fps is (10 sec: 5682.6, 60 sec: 5728.5, 300 sec: 5757.3). Total num frames: 211094528. Throughput: 0: 5146.2. Samples: 211090664. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 10:21:34,811][25689] Avg episode reward: [(0, '-47.833')] [2022-07-09 10:21:36,317][26022] Updated weights on worker 0-0, policy_version 206156 (0.00087) [2022-07-09 10:21:37,870][26022] Updated weights on worker 0-0, policy_version 206166 (0.00088) [2022-07-09 10:21:39,798][26022] Updated weights on worker 0-0, policy_version 206176 (0.00085) [2022-07-09 10:21:39,812][25689] Fps is (10 sec: 5790.1, 60 sec: 5764.0, 300 sec: 5757.5). Total num frames: 211124224. Throughput: 0: 6020.7. Samples: 211125532. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 10:21:39,812][25689] Avg episode reward: [(0, '-48.516')] [2022-07-09 10:21:41,579][26022] Updated weights on worker 0-0, policy_version 206186 (0.00081) [2022-07-09 10:21:43,225][26022] Updated weights on worker 0-0, policy_version 206196 (0.00085) [2022-07-09 10:21:44,815][25689] Fps is (10 sec: 5732.0, 60 sec: 5731.2, 300 sec: 5754.4). Total num frames: 211151872. Throughput: 0: 6025.2. Samples: 211160236. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 10:21:44,815][25689] Avg episode reward: [(0, '-50.122')] [2022-07-09 10:21:45,178][26022] Updated weights on worker 0-0, policy_version 206206 (0.00090) [2022-07-09 10:21:46,619][26022] Updated weights on worker 0-0, policy_version 206216 (0.00084) [2022-07-09 10:21:48,664][26022] Updated weights on worker 0-0, policy_version 206226 (0.00088) [2022-07-09 10:21:49,838][25689] Fps is (10 sec: 5923.3, 60 sec: 5765.8, 300 sec: 5761.9). Total num frames: 211183616. Throughput: 0: 5193.4. Samples: 211177790. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 10:21:49,839][25689] Avg episode reward: [(0, '-49.745')] [2022-07-09 10:21:50,086][26022] Updated weights on worker 0-0, policy_version 206236 (0.00083) [2022-07-09 10:21:52,054][26022] Updated weights on worker 0-0, policy_version 206246 (0.00086) [2022-07-09 10:21:53,710][26022] Updated weights on worker 0-0, policy_version 206256 (0.00093) [2022-07-09 10:21:54,881][25689] Fps is (10 sec: 6001.7, 60 sec: 5749.5, 300 sec: 5764.7). Total num frames: 211212288. Throughput: 0: 6088.2. Samples: 211212858. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 10:21:54,882][25689] Avg episode reward: [(0, '-50.709')] [2022-07-09 10:21:55,696][26022] Updated weights on worker 0-0, policy_version 206266 (0.00083) [2022-07-09 10:21:57,293][26022] Updated weights on worker 0-0, policy_version 206276 (0.00084) [2022-07-09 10:21:59,083][26022] Updated weights on worker 0-0, policy_version 206286 (0.00082) [2022-07-09 10:21:59,893][25689] Fps is (10 sec: 5703.4, 60 sec: 5765.6, 300 sec: 5764.6). Total num frames: 211240960. Throughput: 0: 6070.0. Samples: 211247426. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 10:21:59,894][25689] Avg episode reward: [(0, '-50.515')] [2022-07-09 10:22:00,902][26022] Updated weights on worker 0-0, policy_version 206296 (0.00087) [2022-07-09 10:22:03,103][26022] Updated weights on worker 0-0, policy_version 206306 (0.00092) [2022-07-09 10:22:04,903][25689] Fps is (10 sec: 5415.2, 60 sec: 5730.9, 300 sec: 5754.2). Total num frames: 211266560. Throughput: 0: 5098.4. Samples: 211262656. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 10:22:04,904][25689] Avg episode reward: [(0, '-49.697')] [2022-07-09 10:22:04,975][26022] Updated weights on worker 0-0, policy_version 206316 (0.00088) [2022-07-09 10:22:06,498][26022] Updated weights on worker 0-0, policy_version 206326 (0.00087) [2022-07-09 10:22:08,311][26022] Updated weights on worker 0-0, policy_version 206336 (0.00097) [2022-07-09 10:22:09,928][25689] Fps is (10 sec: 5612.1, 60 sec: 5749.4, 300 sec: 5758.3). Total num frames: 211297280. Throughput: 0: 5963.3. Samples: 211297592. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 10:22:09,929][25689] Avg episode reward: [(0, '-49.238')] [2022-07-09 10:22:10,134][26022] Updated weights on worker 0-0, policy_version 206346 (0.00098) [2022-07-09 10:22:11,676][26022] Updated weights on worker 0-0, policy_version 206356 (0.00090) [2022-07-09 10:22:13,651][26022] Updated weights on worker 0-0, policy_version 206366 (0.00085) [2022-07-09 10:22:14,996][25689] Fps is (10 sec: 5986.4, 60 sec: 5772.2, 300 sec: 5757.4). Total num frames: 211326976. Throughput: 0: 5953.9. Samples: 211332616. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 10:22:14,996][25689] Avg episode reward: [(0, '-48.985')] [2022-07-09 10:22:15,295][26022] Updated weights on worker 0-0, policy_version 206376 (0.00079) [2022-07-09 10:22:17,220][26022] Updated weights on worker 0-0, policy_version 206386 (0.00085) [2022-07-09 10:22:18,943][26022] Updated weights on worker 0-0, policy_version 206396 (0.00089) [2022-07-09 10:22:20,003][25689] Fps is (10 sec: 5691.8, 60 sec: 5739.1, 300 sec: 5757.6). Total num frames: 211354624. Throughput: 0: 5104.1. Samples: 211350072. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 10:22:20,004][25689] Avg episode reward: [(0, '-48.934')] [2022-07-09 10:22:20,558][26022] Updated weights on worker 0-0, policy_version 206406 (0.00088) [2022-07-09 10:22:22,510][26022] Updated weights on worker 0-0, policy_version 206416 (0.00090) [2022-07-09 10:22:24,217][26022] Updated weights on worker 0-0, policy_version 206426 (0.00092) [2022-07-09 10:22:25,023][25689] Fps is (10 sec: 5719.0, 60 sec: 5756.5, 300 sec: 5760.8). Total num frames: 211384320. Throughput: 0: 6086.1. Samples: 211385104. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 10:22:25,023][25689] Avg episode reward: [(0, '-48.271')] [2022-07-09 10:22:26,012][26022] Updated weights on worker 0-0, policy_version 206436 (0.00094) [2022-07-09 10:22:27,913][26022] Updated weights on worker 0-0, policy_version 206446 (0.00096) [2022-07-09 10:22:29,291][26022] Updated weights on worker 0-0, policy_version 206456 (0.00095) [2022-07-09 10:22:30,044][25689] Fps is (10 sec: 5914.8, 60 sec: 5772.5, 300 sec: 5762.7). Total num frames: 211414016. Throughput: 0: 6066.2. Samples: 211419620. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 10:22:30,045][25689] Avg episode reward: [(0, '-47.613')] [2022-07-09 10:22:31,600][26022] Updated weights on worker 0-0, policy_version 206466 (0.00082) [2022-07-09 10:22:32,965][26022] Updated weights on worker 0-0, policy_version 206476 (0.00087) [2022-07-09 10:22:34,963][26022] Updated weights on worker 0-0, policy_version 206486 (0.00081) [2022-07-09 10:22:35,182][25689] Fps is (10 sec: 5745.1, 60 sec: 5767.0, 300 sec: 5758.3). Total num frames: 211442688. Throughput: 0: 5169.4. Samples: 211436968. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 10:22:35,183][25689] Avg episode reward: [(0, '-47.932')] [2022-07-09 10:22:36,612][26022] Updated weights on worker 0-0, policy_version 206496 (0.00081) [2022-07-09 10:22:38,482][26022] Updated weights on worker 0-0, policy_version 206506 (0.00098) [2022-07-09 10:22:40,177][26022] Updated weights on worker 0-0, policy_version 206516 (0.00088) [2022-07-09 10:22:40,270][25689] Fps is (10 sec: 5708.2, 60 sec: 5758.7, 300 sec: 5751.1). Total num frames: 211472384. Throughput: 0: 6015.7. Samples: 211471988. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 10:22:40,270][25689] Avg episode reward: [(0, '-48.000')] [2022-07-09 10:22:41,961][26022] Updated weights on worker 0-0, policy_version 206526 (0.00087) [2022-07-09 10:22:43,544][26022] Updated weights on worker 0-0, policy_version 206536 (0.00090) [2022-07-09 10:22:45,273][25689] Fps is (10 sec: 5885.7, 60 sec: 5792.6, 300 sec: 5758.9). Total num frames: 211502080. Throughput: 0: 6038.1. Samples: 211507378. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 10:22:45,275][25689] Avg episode reward: [(0, '-47.763')] [2022-07-09 10:22:45,570][26022] Updated weights on worker 0-0, policy_version 206546 (0.00085) [2022-07-09 10:22:47,084][26022] Updated weights on worker 0-0, policy_version 206556 (0.00086) [2022-07-09 10:22:47,622][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:22:47,631][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000206559_211516416.pth [2022-07-09 10:22:47,647][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000204531_209439744.pth [2022-07-09 10:22:49,019][26022] Updated weights on worker 0-0, policy_version 206566 (0.00087) [2022-07-09 10:22:50,324][25689] Fps is (10 sec: 5906.9, 60 sec: 5756.1, 300 sec: 5759.6). Total num frames: 211531776. Throughput: 0: 5191.3. Samples: 211524900. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 10:22:50,325][25689] Avg episode reward: [(0, '-46.732')] [2022-07-09 10:22:50,504][26022] Updated weights on worker 0-0, policy_version 206576 (0.00105) [2022-07-09 10:22:52,456][26022] Updated weights on worker 0-0, policy_version 206586 (0.00088) [2022-07-09 10:22:53,994][26022] Updated weights on worker 0-0, policy_version 206596 (0.00088) [2022-07-09 10:22:55,390][25689] Fps is (10 sec: 5769.5, 60 sec: 5753.9, 300 sec: 5758.6). Total num frames: 211560448. Throughput: 0: 6085.4. Samples: 211559938. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 10:22:55,390][25689] Avg episode reward: [(0, '-47.846')] [2022-07-09 10:22:56,128][26022] Updated weights on worker 0-0, policy_version 206606 (0.00089) [2022-07-09 10:22:57,450][26022] Updated weights on worker 0-0, policy_version 206616 (0.00095) [2022-07-09 10:22:59,717][26022] Updated weights on worker 0-0, policy_version 206626 (0.00084) [2022-07-09 10:23:00,438][25689] Fps is (10 sec: 5670.0, 60 sec: 5750.4, 300 sec: 5761.5). Total num frames: 211589120. Throughput: 0: 6073.1. Samples: 211594472. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 10:23:00,438][25689] Avg episode reward: [(0, '-48.134')] [2022-07-09 10:23:01,167][26022] Updated weights on worker 0-0, policy_version 206636 (0.00088) [2022-07-09 10:23:03,633][26022] Updated weights on worker 0-0, policy_version 206646 (0.00086) [2022-07-09 10:23:05,026][26022] Updated weights on worker 0-0, policy_version 206656 (0.00086) [2022-07-09 10:23:05,460][25689] Fps is (10 sec: 5694.6, 60 sec: 5800.1, 300 sec: 5755.0). Total num frames: 211617792. Throughput: 0: 5052.0. Samples: 211609360. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 10:23:05,460][25689] Avg episode reward: [(0, '-48.222')] [2022-07-09 10:23:07,028][26022] Updated weights on worker 0-0, policy_version 206666 (0.00085) [2022-07-09 10:23:08,490][26022] Updated weights on worker 0-0, policy_version 206676 (0.00094) [2022-07-09 10:23:10,494][25689] Fps is (10 sec: 5702.3, 60 sec: 5765.4, 300 sec: 5759.0). Total num frames: 211646464. Throughput: 0: 5934.3. Samples: 211644594. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 10:23:10,495][25689] Avg episode reward: [(0, '-48.668')] [2022-07-09 10:23:10,497][26022] Updated weights on worker 0-0, policy_version 206686 (0.00157) [2022-07-09 10:23:12,264][26022] Updated weights on worker 0-0, policy_version 206696 (0.00087) [2022-07-09 10:23:13,907][26022] Updated weights on worker 0-0, policy_version 206706 (0.00084) [2022-07-09 10:23:15,598][25689] Fps is (10 sec: 5656.1, 60 sec: 5745.0, 300 sec: 5750.4). Total num frames: 211675136. Throughput: 0: 5928.1. Samples: 211679734. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 10:23:15,599][25689] Avg episode reward: [(0, '-48.494')] [2022-07-09 10:23:15,857][26022] Updated weights on worker 0-0, policy_version 206716 (0.00091) [2022-07-09 10:23:17,478][26022] Updated weights on worker 0-0, policy_version 206726 (0.00085) [2022-07-09 10:23:19,161][26022] Updated weights on worker 0-0, policy_version 206736 (0.00095) [2022-07-09 10:23:20,611][25689] Fps is (10 sec: 5769.6, 60 sec: 5778.3, 300 sec: 5754.1). Total num frames: 211704832. Throughput: 0: 5958.1. Samples: 211714664. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 10:23:20,611][25689] Avg episode reward: [(0, '-48.921')] [2022-07-09 10:23:20,939][26022] Updated weights on worker 0-0, policy_version 206746 (0.00095) [2022-07-09 10:23:22,660][26022] Updated weights on worker 0-0, policy_version 206756 (0.00091) [2022-07-09 10:23:24,620][26022] Updated weights on worker 0-0, policy_version 206766 (0.00079) [2022-07-09 10:23:25,647][25689] Fps is (10 sec: 6012.4, 60 sec: 5793.6, 300 sec: 5760.9). Total num frames: 211735552. Throughput: 0: 6075.7. Samples: 211732010. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 10:23:25,647][25689] Avg episode reward: [(0, '-49.006')] [2022-07-09 10:23:26,218][26022] Updated weights on worker 0-0, policy_version 206776 (0.00088) [2022-07-09 10:23:28,004][26022] Updated weights on worker 0-0, policy_version 206786 (0.00090) [2022-07-09 10:23:29,995][26022] Updated weights on worker 0-0, policy_version 206796 (0.00083) [2022-07-09 10:23:30,667][25689] Fps is (10 sec: 5600.7, 60 sec: 5726.2, 300 sec: 5748.2). Total num frames: 211761152. Throughput: 0: 6055.1. Samples: 211766740. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 10:23:30,667][25689] Avg episode reward: [(0, '-47.774')] [2022-07-09 10:23:31,379][26022] Updated weights on worker 0-0, policy_version 206806 (0.00083) [2022-07-09 10:23:33,561][26022] Updated weights on worker 0-0, policy_version 206816 (0.00082) [2022-07-09 10:23:34,910][26022] Updated weights on worker 0-0, policy_version 206826 (0.00087) [2022-07-09 10:23:35,815][25689] Fps is (10 sec: 5639.4, 60 sec: 5775.9, 300 sec: 5752.4). Total num frames: 211792896. Throughput: 0: 6033.1. Samples: 211801706. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 10:23:35,816][25689] Avg episode reward: [(0, '-47.514')] [2022-07-09 10:23:36,955][26022] Updated weights on worker 0-0, policy_version 206836 (0.00082) [2022-07-09 10:23:38,604][26022] Updated weights on worker 0-0, policy_version 206846 (0.00091) [2022-07-09 10:23:40,251][26022] Updated weights on worker 0-0, policy_version 206856 (0.00085) [2022-07-09 10:23:40,835][25689] Fps is (10 sec: 6042.4, 60 sec: 5782.3, 300 sec: 5759.9). Total num frames: 211822592. Throughput: 0: 5170.7. Samples: 211819238. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 10:23:40,836][25689] Avg episode reward: [(0, '-48.114')] [2022-07-09 10:23:42,114][26022] Updated weights on worker 0-0, policy_version 206866 (0.00086) [2022-07-09 10:23:43,977][26022] Updated weights on worker 0-0, policy_version 206876 (0.00076) [2022-07-09 10:23:45,517][26022] Updated weights on worker 0-0, policy_version 206886 (0.00087) [2022-07-09 10:23:45,852][25689] Fps is (10 sec: 6019.3, 60 sec: 5797.9, 300 sec: 5766.6). Total num frames: 211853312. Throughput: 0: 6068.5. Samples: 211854626. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 10:23:45,854][25689] Avg episode reward: [(0, '-48.833')] [2022-07-09 10:23:47,415][26022] Updated weights on worker 0-0, policy_version 206896 (0.00082) [2022-07-09 10:23:49,267][26022] Updated weights on worker 0-0, policy_version 206906 (0.00081) [2022-07-09 10:23:50,908][25689] Fps is (10 sec: 5794.2, 60 sec: 5763.6, 300 sec: 5759.9). Total num frames: 211880960. Throughput: 0: 6078.5. Samples: 211889780. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 10:23:50,909][25689] Avg episode reward: [(0, '-48.073')] [2022-07-09 10:23:50,952][26022] Updated weights on worker 0-0, policy_version 206916 (0.00087) [2022-07-09 10:23:52,684][26022] Updated weights on worker 0-0, policy_version 206926 (0.00085) [2022-07-09 10:23:54,362][26022] Updated weights on worker 0-0, policy_version 206936 (0.00087) [2022-07-09 10:23:56,001][25689] Fps is (10 sec: 5751.3, 60 sec: 5794.8, 300 sec: 5761.9). Total num frames: 211911680. Throughput: 0: 5217.4. Samples: 211907024. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 10:23:56,001][25689] Avg episode reward: [(0, '-49.229')] [2022-07-09 10:23:56,300][26022] Updated weights on worker 0-0, policy_version 206946 (0.00085) [2022-07-09 10:23:57,943][26022] Updated weights on worker 0-0, policy_version 206956 (0.00083) [2022-07-09 10:23:59,648][26022] Updated weights on worker 0-0, policy_version 206966 (0.00087) [2022-07-09 10:24:01,037][25689] Fps is (10 sec: 5863.6, 60 sec: 5796.0, 300 sec: 5768.3). Total num frames: 211940352. Throughput: 0: 6066.7. Samples: 211941800. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 10:24:01,038][25689] Avg episode reward: [(0, '-49.641')] [2022-07-09 10:24:01,968][26022] Updated weights on worker 0-0, policy_version 206976 (0.00087) [2022-07-09 10:24:03,664][26022] Updated weights on worker 0-0, policy_version 206986 (0.00091) [2022-07-09 10:24:05,351][26022] Updated weights on worker 0-0, policy_version 206996 (0.00083) [2022-07-09 10:24:06,071][25689] Fps is (10 sec: 5491.2, 60 sec: 5761.0, 300 sec: 5759.0). Total num frames: 211966976. Throughput: 0: 5925.0. Samples: 211974422. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 10:24:06,072][25689] Avg episode reward: [(0, '-48.875')] [2022-07-09 10:24:07,125][26022] Updated weights on worker 0-0, policy_version 207006 (0.00083) [2022-07-09 10:24:08,892][26022] Updated weights on worker 0-0, policy_version 207016 (0.00086) [2022-07-09 10:24:10,724][26022] Updated weights on worker 0-0, policy_version 207026 (0.00088) [2022-07-09 10:24:11,109][25689] Fps is (10 sec: 5591.8, 60 sec: 5777.6, 300 sec: 5760.1). Total num frames: 211996672. Throughput: 0: 5041.0. Samples: 211991614. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 10:24:11,111][25689] Avg episode reward: [(0, '-48.512')] [2022-07-09 10:24:12,579][26022] Updated weights on worker 0-0, policy_version 207036 (0.00082) [2022-07-09 10:24:14,247][26022] Updated weights on worker 0-0, policy_version 207046 (0.00084) [2022-07-09 10:24:16,183][25689] Fps is (10 sec: 5670.7, 60 sec: 5763.5, 300 sec: 5755.7). Total num frames: 212024320. Throughput: 0: 5903.2. Samples: 212026164. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 10:24:16,184][25689] Avg episode reward: [(0, '-48.622')] [2022-07-09 10:24:16,306][26022] Updated weights on worker 0-0, policy_version 207056 (0.00090) [2022-07-09 10:24:17,795][26022] Updated weights on worker 0-0, policy_version 207066 (0.00079) [2022-07-09 10:24:19,565][26022] Updated weights on worker 0-0, policy_version 207076 (0.00086) [2022-07-09 10:24:21,201][25689] Fps is (10 sec: 5783.7, 60 sec: 5779.9, 300 sec: 5762.5). Total num frames: 212055040. Throughput: 0: 5916.5. Samples: 212061098. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 10:24:21,202][25689] Avg episode reward: [(0, '-48.978')] [2022-07-09 10:24:21,339][26022] Updated weights on worker 0-0, policy_version 207086 (0.00097) [2022-07-09 10:24:23,183][26022] Updated weights on worker 0-0, policy_version 207096 (0.00886) [2022-07-09 10:24:24,992][26022] Updated weights on worker 0-0, policy_version 207106 (0.00081) [2022-07-09 10:24:26,218][25689] Fps is (10 sec: 5918.5, 60 sec: 5747.9, 300 sec: 5762.3). Total num frames: 212083712. Throughput: 0: 5162.5. Samples: 212078432. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 10:24:26,219][25689] Avg episode reward: [(0, '-49.244')] [2022-07-09 10:24:26,805][26022] Updated weights on worker 0-0, policy_version 207116 (0.00086) [2022-07-09 10:24:28,518][26022] Updated weights on worker 0-0, policy_version 207126 (0.00082) [2022-07-09 10:24:30,439][26022] Updated weights on worker 0-0, policy_version 207136 (0.00091) [2022-07-09 10:24:31,270][25689] Fps is (10 sec: 5593.4, 60 sec: 5778.7, 300 sec: 5752.0). Total num frames: 212111360. Throughput: 0: 6017.8. Samples: 212112938. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 10:24:31,271][25689] Avg episode reward: [(0, '-48.549')] [2022-07-09 10:24:32,202][26022] Updated weights on worker 0-0, policy_version 207146 (0.00087) [2022-07-09 10:24:33,856][26022] Updated weights on worker 0-0, policy_version 207156 (0.00092) [2022-07-09 10:24:35,646][26022] Updated weights on worker 0-0, policy_version 207166 (0.00088) [2022-07-09 10:24:36,323][25689] Fps is (10 sec: 5776.1, 60 sec: 5770.9, 300 sec: 5761.6). Total num frames: 212142080. Throughput: 0: 6024.8. Samples: 212147504. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 10:24:36,324][25689] Avg episode reward: [(0, '-49.381')] [2022-07-09 10:24:37,515][26022] Updated weights on worker 0-0, policy_version 207176 (0.00089) [2022-07-09 10:24:39,486][26022] Updated weights on worker 0-0, policy_version 207186 (0.00086) [2022-07-09 10:24:40,960][26022] Updated weights on worker 0-0, policy_version 207196 (0.00094) [2022-07-09 10:24:41,403][25689] Fps is (10 sec: 5861.2, 60 sec: 5748.2, 300 sec: 5760.6). Total num frames: 212170752. Throughput: 0: 5125.9. Samples: 212164652. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 10:24:41,404][25689] Avg episode reward: [(0, '-49.256')] [2022-07-09 10:24:42,914][26022] Updated weights on worker 0-0, policy_version 207206 (0.00104) [2022-07-09 10:24:44,515][26022] Updated weights on worker 0-0, policy_version 207216 (0.00083) [2022-07-09 10:24:46,435][25689] Fps is (10 sec: 5569.8, 60 sec: 5696.1, 300 sec: 5749.9). Total num frames: 212198400. Throughput: 0: 5984.8. Samples: 212199424. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 10:24:46,435][25689] Avg episode reward: [(0, '-49.606')] [2022-07-09 10:24:46,442][26022] Updated weights on worker 0-0, policy_version 207226 (0.00083) [2022-07-09 10:24:47,854][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:24:47,876][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000207235_212208640.pth [2022-07-09 10:24:47,876][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000205210_210135040.pth [2022-07-09 10:24:48,142][26022] Updated weights on worker 0-0, policy_version 207236 (0.00082) [2022-07-09 10:24:49,826][26022] Updated weights on worker 0-0, policy_version 207246 (0.00087) [2022-07-09 10:24:51,521][25689] Fps is (10 sec: 5768.8, 60 sec: 5744.0, 300 sec: 5763.5). Total num frames: 212229120. Throughput: 0: 5997.6. Samples: 212234394. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 10:24:51,521][25689] Avg episode reward: [(0, '-48.934')] [2022-07-09 10:24:51,620][26022] Updated weights on worker 0-0, policy_version 207256 (0.00091) [2022-07-09 10:24:53,383][26022] Updated weights on worker 0-0, policy_version 207266 (0.00088) [2022-07-09 10:24:55,024][26022] Updated weights on worker 0-0, policy_version 207276 (0.00083) [2022-07-09 10:24:56,575][25689] Fps is (10 sec: 5958.3, 60 sec: 5730.8, 300 sec: 5762.7). Total num frames: 212258816. Throughput: 0: 5165.0. Samples: 212252104. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 10:24:56,575][25689] Avg episode reward: [(0, '-49.352')] [2022-07-09 10:24:56,924][26022] Updated weights on worker 0-0, policy_version 207286 (0.00087) [2022-07-09 10:24:58,660][26022] Updated weights on worker 0-0, policy_version 207296 (0.00094) [2022-07-09 10:25:00,432][26022] Updated weights on worker 0-0, policy_version 207306 (0.00088) [2022-07-09 10:25:01,673][25689] Fps is (10 sec: 5749.4, 60 sec: 5725.0, 300 sec: 5764.5). Total num frames: 212287488. Throughput: 0: 6023.5. Samples: 212286746. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 10:25:01,675][25689] Avg episode reward: [(0, '-49.500')] [2022-07-09 10:25:02,286][26022] Updated weights on worker 0-0, policy_version 207316 (0.00090) [2022-07-09 10:25:04,300][26022] Updated weights on worker 0-0, policy_version 207326 (0.00086) [2022-07-09 10:25:06,212][26022] Updated weights on worker 0-0, policy_version 207336 (0.00094) [2022-07-09 10:25:06,696][25689] Fps is (10 sec: 5564.5, 60 sec: 5742.8, 300 sec: 5764.7). Total num frames: 212315136. Throughput: 0: 5922.8. Samples: 212319428. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 10:25:06,697][25689] Avg episode reward: [(0, '-49.413')] [2022-07-09 10:25:07,856][26022] Updated weights on worker 0-0, policy_version 207346 (0.00087) [2022-07-09 10:25:09,724][26022] Updated weights on worker 0-0, policy_version 207356 (0.00092) [2022-07-09 10:25:11,364][26022] Updated weights on worker 0-0, policy_version 207366 (0.00620) [2022-07-09 10:25:11,707][25689] Fps is (10 sec: 5817.2, 60 sec: 5762.4, 300 sec: 5765.8). Total num frames: 212345856. Throughput: 0: 5076.0. Samples: 212336856. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 10:25:11,707][25689] Avg episode reward: [(0, '-49.494')] [2022-07-09 10:25:13,218][26022] Updated weights on worker 0-0, policy_version 207376 (0.00093) [2022-07-09 10:25:14,815][26022] Updated weights on worker 0-0, policy_version 207386 (0.00084) [2022-07-09 10:25:16,487][26022] Updated weights on worker 0-0, policy_version 207396 (0.00086) [2022-07-09 10:25:16,763][25689] Fps is (10 sec: 5900.0, 60 sec: 5781.0, 300 sec: 5765.3). Total num frames: 212374528. Throughput: 0: 5949.9. Samples: 212372220. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 10:25:16,763][25689] Avg episode reward: [(0, '-50.499')] [2022-07-09 10:25:18,437][26022] Updated weights on worker 0-0, policy_version 207406 (0.00089) [2022-07-09 10:25:19,958][26022] Updated weights on worker 0-0, policy_version 207416 (0.00087) [2022-07-09 10:25:21,765][25689] Fps is (10 sec: 5700.8, 60 sec: 5748.6, 300 sec: 5759.2). Total num frames: 212403200. Throughput: 0: 6004.3. Samples: 212407388. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 10:25:21,767][25689] Avg episode reward: [(0, '-49.937')] [2022-07-09 10:25:21,958][26022] Updated weights on worker 0-0, policy_version 207426 (0.00081) [2022-07-09 10:25:23,539][26022] Updated weights on worker 0-0, policy_version 207436 (0.00085) [2022-07-09 10:25:25,428][26022] Updated weights on worker 0-0, policy_version 207446 (0.00087) [2022-07-09 10:25:26,778][25689] Fps is (10 sec: 5828.0, 60 sec: 5766.0, 300 sec: 5766.3). Total num frames: 212432896. Throughput: 0: 5256.1. Samples: 212424978. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 10:25:26,779][25689] Avg episode reward: [(0, '-50.178')] [2022-07-09 10:25:27,154][26022] Updated weights on worker 0-0, policy_version 207456 (0.00086) [2022-07-09 10:25:28,873][26022] Updated weights on worker 0-0, policy_version 207466 (0.00617) [2022-07-09 10:25:30,796][26022] Updated weights on worker 0-0, policy_version 207476 (0.00100) [2022-07-09 10:25:31,783][25689] Fps is (10 sec: 5826.7, 60 sec: 5787.4, 300 sec: 5760.6). Total num frames: 212461568. Throughput: 0: 6116.5. Samples: 212459652. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 10:25:31,783][25689] Avg episode reward: [(0, '-50.113')] [2022-07-09 10:25:32,493][26022] Updated weights on worker 0-0, policy_version 207486 (0.00086) [2022-07-09 10:25:34,301][26022] Updated weights on worker 0-0, policy_version 207496 (0.00094) [2022-07-09 10:25:36,032][26022] Updated weights on worker 0-0, policy_version 207506 (0.01030) [2022-07-09 10:25:36,851][25689] Fps is (10 sec: 5794.5, 60 sec: 5769.0, 300 sec: 5766.5). Total num frames: 212491264. Throughput: 0: 6074.5. Samples: 212494244. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 10:25:36,851][25689] Avg episode reward: [(0, '-49.565')] [2022-07-09 10:25:37,790][26022] Updated weights on worker 0-0, policy_version 207516 (0.00089) [2022-07-09 10:25:39,595][26022] Updated weights on worker 0-0, policy_version 207526 (0.00086) [2022-07-09 10:25:41,397][26022] Updated weights on worker 0-0, policy_version 207536 (0.00089) [2022-07-09 10:25:41,857][25689] Fps is (10 sec: 5895.4, 60 sec: 5793.0, 300 sec: 5766.7). Total num frames: 212520960. Throughput: 0: 6055.3. Samples: 212529048. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 10:25:41,858][25689] Avg episode reward: [(0, '-48.921')] [2022-07-09 10:25:43,209][26022] Updated weights on worker 0-0, policy_version 207546 (0.00094) [2022-07-09 10:25:44,985][26022] Updated weights on worker 0-0, policy_version 207556 (0.00091) [2022-07-09 10:25:46,739][26022] Updated weights on worker 0-0, policy_version 207566 (0.00089) [2022-07-09 10:25:46,863][25689] Fps is (10 sec: 5727.1, 60 sec: 5795.4, 300 sec: 5760.3). Total num frames: 212548608. Throughput: 0: 6033.9. Samples: 212546172. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 10:25:46,864][25689] Avg episode reward: [(0, '-49.809')] [2022-07-09 10:25:48,493][26022] Updated weights on worker 0-0, policy_version 207576 (0.00082) [2022-07-09 10:25:50,258][26022] Updated weights on worker 0-0, policy_version 207586 (0.00095) [2022-07-09 10:25:51,908][25689] Fps is (10 sec: 5603.2, 60 sec: 5765.4, 300 sec: 5756.9). Total num frames: 212577280. Throughput: 0: 6043.1. Samples: 212581274. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 10:25:51,909][25689] Avg episode reward: [(0, '-48.919')] [2022-07-09 10:25:51,912][26022] Updated weights on worker 0-0, policy_version 207596 (0.00083) [2022-07-09 10:25:53,841][26022] Updated weights on worker 0-0, policy_version 207606 (0.00088) [2022-07-09 10:25:55,251][26022] Updated weights on worker 0-0, policy_version 207616 (0.00095) [2022-07-09 10:25:56,968][25689] Fps is (10 sec: 5674.8, 60 sec: 5747.9, 300 sec: 5759.2). Total num frames: 212605952. Throughput: 0: 6061.4. Samples: 212616186. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 10:25:56,969][25689] Avg episode reward: [(0, '-48.828')] [2022-07-09 10:25:57,399][26022] Updated weights on worker 0-0, policy_version 207626 (0.00086) [2022-07-09 10:25:58,950][26022] Updated weights on worker 0-0, policy_version 207636 (0.00098) [2022-07-09 10:26:00,811][26022] Updated weights on worker 0-0, policy_version 207646 (0.00092) [2022-07-09 10:26:02,000][25689] Fps is (10 sec: 5783.9, 60 sec: 5771.2, 300 sec: 5765.6). Total num frames: 212635648. Throughput: 0: 5196.6. Samples: 212633722. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 10:26:02,007][25689] Avg episode reward: [(0, '-49.492')] [2022-07-09 10:26:02,984][26022] Updated weights on worker 0-0, policy_version 207656 (0.00082) [2022-07-09 10:26:04,651][26022] Updated weights on worker 0-0, policy_version 207666 (0.00082) [2022-07-09 10:26:06,513][26022] Updated weights on worker 0-0, policy_version 207676 (0.00080) [2022-07-09 10:26:07,013][25689] Fps is (10 sec: 5708.8, 60 sec: 5772.2, 300 sec: 5759.2). Total num frames: 212663296. Throughput: 0: 5967.2. Samples: 212666410. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 10:26:07,014][25689] Avg episode reward: [(0, '-49.186')] [2022-07-09 10:26:08,172][26022] Updated weights on worker 0-0, policy_version 207686 (0.00090) [2022-07-09 10:26:10,012][26022] Updated weights on worker 0-0, policy_version 207696 (0.00088) [2022-07-09 10:26:11,652][26022] Updated weights on worker 0-0, policy_version 207706 (0.00083) [2022-07-09 10:26:12,018][25689] Fps is (10 sec: 5723.9, 60 sec: 5755.7, 300 sec: 5765.1). Total num frames: 212692992. Throughput: 0: 5981.3. Samples: 212701558. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 10:26:12,019][25689] Avg episode reward: [(0, '-48.655')] [2022-07-09 10:26:13,349][26022] Updated weights on worker 0-0, policy_version 207716 (0.00087) [2022-07-09 10:26:15,106][26022] Updated weights on worker 0-0, policy_version 207726 (0.00086) [2022-07-09 10:26:16,922][26022] Updated weights on worker 0-0, policy_version 207736 (0.00090) [2022-07-09 10:26:17,066][25689] Fps is (10 sec: 5907.8, 60 sec: 5773.4, 300 sec: 5764.4). Total num frames: 212722688. Throughput: 0: 5116.7. Samples: 212719022. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 10:26:17,067][25689] Avg episode reward: [(0, '-49.113')] [2022-07-09 10:26:18,657][26022] Updated weights on worker 0-0, policy_version 207746 (0.00089) [2022-07-09 10:26:20,718][26022] Updated weights on worker 0-0, policy_version 207756 (0.00092) [2022-07-09 10:26:22,101][25689] Fps is (10 sec: 5789.1, 60 sec: 5770.4, 300 sec: 5764.2). Total num frames: 212751360. Throughput: 0: 5971.9. Samples: 212753762. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 10:26:22,101][25689] Avg episode reward: [(0, '-49.263')] [2022-07-09 10:26:22,236][26022] Updated weights on worker 0-0, policy_version 207766 (0.00081) [2022-07-09 10:26:23,998][26022] Updated weights on worker 0-0, policy_version 207776 (0.00086) [2022-07-09 10:26:25,771][26022] Updated weights on worker 0-0, policy_version 207786 (0.00087) [2022-07-09 10:26:27,109][25689] Fps is (10 sec: 5709.9, 60 sec: 5753.8, 300 sec: 5764.3). Total num frames: 212780032. Throughput: 0: 6079.8. Samples: 212788590. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 10:26:27,110][25689] Avg episode reward: [(0, '-49.181')] [2022-07-09 10:26:27,415][26022] Updated weights on worker 0-0, policy_version 207796 (0.00083) [2022-07-09 10:26:29,377][26022] Updated weights on worker 0-0, policy_version 207806 (0.00087) [2022-07-09 10:26:31,112][26022] Updated weights on worker 0-0, policy_version 207816 (0.00085) [2022-07-09 10:26:32,122][25689] Fps is (10 sec: 5722.0, 60 sec: 5753.0, 300 sec: 5765.6). Total num frames: 212808704. Throughput: 0: 5197.8. Samples: 212806056. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 10:26:32,123][25689] Avg episode reward: [(0, '-49.551')] [2022-07-09 10:26:32,892][26022] Updated weights on worker 0-0, policy_version 207826 (0.00094) [2022-07-09 10:26:34,607][26022] Updated weights on worker 0-0, policy_version 207836 (0.01065) [2022-07-09 10:26:36,510][26022] Updated weights on worker 0-0, policy_version 207846 (0.00087) [2022-07-09 10:26:37,213][25689] Fps is (10 sec: 5776.7, 60 sec: 5750.9, 300 sec: 5763.8). Total num frames: 212838400. Throughput: 0: 6038.4. Samples: 212840676. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 10:26:37,214][25689] Avg episode reward: [(0, '-49.322')] [2022-07-09 10:26:38,099][26022] Updated weights on worker 0-0, policy_version 207856 (0.00089) [2022-07-09 10:26:40,088][26022] Updated weights on worker 0-0, policy_version 207866 (0.00095) [2022-07-09 10:26:41,612][26022] Updated weights on worker 0-0, policy_version 207876 (0.00087) [2022-07-09 10:26:42,217][25689] Fps is (10 sec: 5883.6, 60 sec: 5751.1, 300 sec: 5770.7). Total num frames: 212868096. Throughput: 0: 6062.0. Samples: 212875708. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 10:26:42,217][25689] Avg episode reward: [(0, '-49.248')] [2022-07-09 10:26:43,472][26022] Updated weights on worker 0-0, policy_version 207886 (0.00083) [2022-07-09 10:26:45,241][26022] Updated weights on worker 0-0, policy_version 207896 (0.00080) [2022-07-09 10:26:46,917][26022] Updated weights on worker 0-0, policy_version 207906 (0.00094) [2022-07-09 10:26:47,219][25689] Fps is (10 sec: 5833.3, 60 sec: 5768.5, 300 sec: 5760.8). Total num frames: 212896768. Throughput: 0: 5199.3. Samples: 212893148. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 10:26:47,220][25689] Avg episode reward: [(0, '-48.231')] [2022-07-09 10:26:47,993][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:26:48,001][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000207912_212901888.pth [2022-07-09 10:26:48,002][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000205882_210823168.pth [2022-07-09 10:26:48,895][26022] Updated weights on worker 0-0, policy_version 207916 (0.00090) [2022-07-09 10:26:50,264][26022] Updated weights on worker 0-0, policy_version 207926 (0.00081) [2022-07-09 10:26:52,228][25689] Fps is (10 sec: 5728.0, 60 sec: 5771.9, 300 sec: 5761.5). Total num frames: 212925440. Throughput: 0: 6083.1. Samples: 212928362. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:26:52,229][25689] Avg episode reward: [(0, '-48.093')] [2022-07-09 10:26:52,278][26022] Updated weights on worker 0-0, policy_version 207936 (0.00080) [2022-07-09 10:26:53,849][26022] Updated weights on worker 0-0, policy_version 207946 (0.00088) [2022-07-09 10:26:55,705][26022] Updated weights on worker 0-0, policy_version 207956 (0.00088) [2022-07-09 10:26:57,266][25689] Fps is (10 sec: 5809.6, 60 sec: 5791.0, 300 sec: 5764.4). Total num frames: 212955136. Throughput: 0: 6104.8. Samples: 212963094. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:26:57,267][25689] Avg episode reward: [(0, '-48.034')] [2022-07-09 10:26:57,747][26022] Updated weights on worker 0-0, policy_version 207966 (0.00083) [2022-07-09 10:26:59,166][26022] Updated weights on worker 0-0, policy_version 207976 (0.00088) [2022-07-09 10:27:01,003][26022] Updated weights on worker 0-0, policy_version 207986 (0.00087) [2022-07-09 10:27:02,279][25689] Fps is (10 sec: 5603.7, 60 sec: 5741.9, 300 sec: 5767.8). Total num frames: 212981760. Throughput: 0: 5231.3. Samples: 212980656. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:27:02,279][25689] Avg episode reward: [(0, '-47.871')] [2022-07-09 10:27:03,402][26022] Updated weights on worker 0-0, policy_version 207996 (0.00088) [2022-07-09 10:27:04,892][26022] Updated weights on worker 0-0, policy_version 208006 (0.00078) [2022-07-09 10:27:07,127][26022] Updated weights on worker 0-0, policy_version 208016 (0.00086) [2022-07-09 10:27:07,291][25689] Fps is (10 sec: 5413.9, 60 sec: 5742.0, 300 sec: 5757.7). Total num frames: 213009408. Throughput: 0: 5953.7. Samples: 213012648. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:27:07,291][25689] Avg episode reward: [(0, '-47.549')] [2022-07-09 10:27:08,493][26022] Updated weights on worker 0-0, policy_version 208026 (0.00086) [2022-07-09 10:27:10,592][26022] Updated weights on worker 0-0, policy_version 208036 (0.00097) [2022-07-09 10:27:12,068][26022] Updated weights on worker 0-0, policy_version 208046 (0.00079) [2022-07-09 10:27:12,301][25689] Fps is (10 sec: 5721.5, 60 sec: 5741.5, 300 sec: 5758.8). Total num frames: 213039104. Throughput: 0: 5933.3. Samples: 213047462. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:27:12,302][25689] Avg episode reward: [(0, '-47.675')] [2022-07-09 10:27:13,846][26022] Updated weights on worker 0-0, policy_version 208056 (0.00104) [2022-07-09 10:27:15,735][26022] Updated weights on worker 0-0, policy_version 208066 (0.00083) [2022-07-09 10:27:17,411][25689] Fps is (10 sec: 5868.7, 60 sec: 5735.6, 300 sec: 5763.7). Total num frames: 213068800. Throughput: 0: 5048.4. Samples: 213064792. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:27:17,412][25689] Avg episode reward: [(0, '-49.380')] [2022-07-09 10:27:17,600][26022] Updated weights on worker 0-0, policy_version 208076 (0.00086) [2022-07-09 10:27:19,259][26022] Updated weights on worker 0-0, policy_version 208086 (0.00099) [2022-07-09 10:27:21,207][26022] Updated weights on worker 0-0, policy_version 208096 (0.00093) [2022-07-09 10:27:22,414][25689] Fps is (10 sec: 5873.0, 60 sec: 5755.5, 300 sec: 5764.0). Total num frames: 213098496. Throughput: 0: 5918.7. Samples: 213099830. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:27:22,414][25689] Avg episode reward: [(0, '-49.165')] [2022-07-09 10:27:22,750][26022] Updated weights on worker 0-0, policy_version 208106 (0.00097) [2022-07-09 10:27:24,744][26022] Updated weights on worker 0-0, policy_version 208116 (0.00093) [2022-07-09 10:27:26,355][26022] Updated weights on worker 0-0, policy_version 208126 (0.00083) [2022-07-09 10:27:27,440][25689] Fps is (10 sec: 5717.5, 60 sec: 5736.9, 300 sec: 5757.1). Total num frames: 213126144. Throughput: 0: 6052.9. Samples: 213134612. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:27:27,445][25689] Avg episode reward: [(0, '-48.907')] [2022-07-09 10:27:28,117][26022] Updated weights on worker 0-0, policy_version 208136 (0.00081) [2022-07-09 10:27:30,053][26022] Updated weights on worker 0-0, policy_version 208146 (0.00088) [2022-07-09 10:27:31,547][26022] Updated weights on worker 0-0, policy_version 208156 (0.00082) [2022-07-09 10:27:32,455][25689] Fps is (10 sec: 5812.8, 60 sec: 5770.7, 300 sec: 5766.3). Total num frames: 213156864. Throughput: 0: 5190.4. Samples: 213152070. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:27:32,456][25689] Avg episode reward: [(0, '-49.882')] [2022-07-09 10:27:33,453][26022] Updated weights on worker 0-0, policy_version 208166 (0.00088) [2022-07-09 10:27:35,206][26022] Updated weights on worker 0-0, policy_version 208176 (0.00079) [2022-07-09 10:27:36,930][26022] Updated weights on worker 0-0, policy_version 208186 (0.00086) [2022-07-09 10:27:37,505][25689] Fps is (10 sec: 5900.8, 60 sec: 5757.6, 300 sec: 5763.6). Total num frames: 213185536. Throughput: 0: 6064.1. Samples: 213186648. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:27:37,506][25689] Avg episode reward: [(0, '-50.509')] [2022-07-09 10:27:38,733][26022] Updated weights on worker 0-0, policy_version 208196 (0.00099) [2022-07-09 10:27:40,574][26022] Updated weights on worker 0-0, policy_version 208206 (0.00084) [2022-07-09 10:27:42,200][26022] Updated weights on worker 0-0, policy_version 208216 (0.00050) [2022-07-09 10:27:42,519][25689] Fps is (10 sec: 5799.8, 60 sec: 5756.6, 300 sec: 5763.4). Total num frames: 213215232. Throughput: 0: 6045.2. Samples: 213221368. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:27:42,520][25689] Avg episode reward: [(0, '-49.625')] [2022-07-09 10:27:44,228][26022] Updated weights on worker 0-0, policy_version 208226 (0.00081) [2022-07-09 10:27:45,575][26022] Updated weights on worker 0-0, policy_version 208236 (0.00092) [2022-07-09 10:27:47,546][25689] Fps is (10 sec: 5711.1, 60 sec: 5737.3, 300 sec: 5757.0). Total num frames: 213242880. Throughput: 0: 5187.0. Samples: 213238900. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:27:47,551][25689] Avg episode reward: [(0, '-49.517')] [2022-07-09 10:27:47,800][26022] Updated weights on worker 0-0, policy_version 208246 (0.00102) [2022-07-09 10:27:49,239][26022] Updated weights on worker 0-0, policy_version 208256 (0.00090) [2022-07-09 10:27:51,265][26022] Updated weights on worker 0-0, policy_version 208266 (0.00084) [2022-07-09 10:27:52,578][25689] Fps is (10 sec: 5700.6, 60 sec: 5752.0, 300 sec: 5761.1). Total num frames: 213272576. Throughput: 0: 6043.7. Samples: 213273688. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:27:52,579][25689] Avg episode reward: [(0, '-49.716')] [2022-07-09 10:27:52,918][26022] Updated weights on worker 0-0, policy_version 208276 (0.00090) [2022-07-09 10:27:54,626][26022] Updated weights on worker 0-0, policy_version 208286 (0.00085) [2022-07-09 10:27:56,357][26022] Updated weights on worker 0-0, policy_version 208296 (0.00088) [2022-07-09 10:27:57,616][25689] Fps is (10 sec: 5999.7, 60 sec: 5769.0, 300 sec: 5768.2). Total num frames: 213303296. Throughput: 0: 6060.8. Samples: 213308534. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:27:57,617][25689] Avg episode reward: [(0, '-48.856')] [2022-07-09 10:27:58,374][26022] Updated weights on worker 0-0, policy_version 208306 (0.00091) [2022-07-09 10:27:59,898][26022] Updated weights on worker 0-0, policy_version 208316 (0.00087) [2022-07-09 10:28:02,249][26022] Updated weights on worker 0-0, policy_version 208326 (0.00084) [2022-07-09 10:28:02,667][25689] Fps is (10 sec: 5481.0, 60 sec: 5731.4, 300 sec: 5753.8). Total num frames: 213327872. Throughput: 0: 5192.0. Samples: 213325976. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:28:02,668][25689] Avg episode reward: [(0, '-48.646')] [2022-07-09 10:28:03,733][26022] Updated weights on worker 0-0, policy_version 208336 (0.00083) [2022-07-09 10:28:05,663][26022] Updated weights on worker 0-0, policy_version 208346 (0.00083) [2022-07-09 10:28:07,539][26022] Updated weights on worker 0-0, policy_version 208356 (0.00085) [2022-07-09 10:28:07,671][25689] Fps is (10 sec: 5397.5, 60 sec: 5766.1, 300 sec: 5757.9). Total num frames: 213357568. Throughput: 0: 5943.9. Samples: 213358520. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:28:07,672][25689] Avg episode reward: [(0, '-49.437')] [2022-07-09 10:28:08,988][26022] Updated weights on worker 0-0, policy_version 208366 (0.00087) [2022-07-09 10:28:11,156][26022] Updated weights on worker 0-0, policy_version 208376 (0.00091) [2022-07-09 10:28:12,587][26022] Updated weights on worker 0-0, policy_version 208386 (0.00095) [2022-07-09 10:28:12,683][25689] Fps is (10 sec: 5930.0, 60 sec: 5766.0, 300 sec: 5763.1). Total num frames: 213387264. Throughput: 0: 5946.3. Samples: 213393234. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:28:12,683][25689] Avg episode reward: [(0, '-49.376')] [2022-07-09 10:28:14,598][26022] Updated weights on worker 0-0, policy_version 208396 (0.00090) [2022-07-09 10:28:16,310][26022] Updated weights on worker 0-0, policy_version 208406 (0.00092) [2022-07-09 10:28:17,763][25689] Fps is (10 sec: 5884.8, 60 sec: 5768.7, 300 sec: 5761.8). Total num frames: 213416960. Throughput: 0: 5069.4. Samples: 213410670. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:28:17,764][25689] Avg episode reward: [(0, '-48.990')] [2022-07-09 10:28:17,923][26022] Updated weights on worker 0-0, policy_version 208416 (0.00082) [2022-07-09 10:28:19,955][26022] Updated weights on worker 0-0, policy_version 208426 (0.00084) [2022-07-09 10:28:21,253][26022] Updated weights on worker 0-0, policy_version 208436 (0.00087) [2022-07-09 10:28:22,803][25689] Fps is (10 sec: 5767.5, 60 sec: 5748.4, 300 sec: 5754.8). Total num frames: 213445632. Throughput: 0: 5952.8. Samples: 213445840. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:28:22,803][25689] Avg episode reward: [(0, '-49.062')] [2022-07-09 10:28:23,433][26022] Updated weights on worker 0-0, policy_version 208446 (0.00086) [2022-07-09 10:28:24,763][26022] Updated weights on worker 0-0, policy_version 208456 (0.00079) [2022-07-09 10:28:26,879][26022] Updated weights on worker 0-0, policy_version 208466 (0.00084) [2022-07-09 10:28:27,839][25689] Fps is (10 sec: 5691.4, 60 sec: 5764.4, 300 sec: 5764.8). Total num frames: 213474304. Throughput: 0: 6065.1. Samples: 213480840. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:28:27,840][25689] Avg episode reward: [(0, '-49.262')] [2022-07-09 10:28:28,354][26022] Updated weights on worker 0-0, policy_version 208476 (0.00088) [2022-07-09 10:28:30,301][26022] Updated weights on worker 0-0, policy_version 208486 (0.00088) [2022-07-09 10:28:32,018][26022] Updated weights on worker 0-0, policy_version 208496 (0.00089) [2022-07-09 10:28:32,864][25689] Fps is (10 sec: 5801.2, 60 sec: 5746.5, 300 sec: 5760.3). Total num frames: 213504000. Throughput: 0: 5195.2. Samples: 213498080. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:28:32,864][25689] Avg episode reward: [(0, '-48.342')] [2022-07-09 10:28:33,920][26022] Updated weights on worker 0-0, policy_version 208506 (0.00086) [2022-07-09 10:28:35,634][26022] Updated weights on worker 0-0, policy_version 208516 (0.00091) [2022-07-09 10:28:37,638][26022] Updated weights on worker 0-0, policy_version 208526 (0.00084) [2022-07-09 10:28:37,898][25689] Fps is (10 sec: 5802.7, 60 sec: 5748.0, 300 sec: 5756.6). Total num frames: 213532672. Throughput: 0: 6064.7. Samples: 213532778. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:28:37,898][25689] Avg episode reward: [(0, '-48.878')] [2022-07-09 10:28:39,254][26022] Updated weights on worker 0-0, policy_version 208536 (0.00090) [2022-07-09 10:28:41,074][26022] Updated weights on worker 0-0, policy_version 208546 (0.00082) [2022-07-09 10:28:42,606][26022] Updated weights on worker 0-0, policy_version 208556 (0.00079) [2022-07-09 10:28:42,923][25689] Fps is (10 sec: 5904.5, 60 sec: 5763.9, 300 sec: 5756.5). Total num frames: 213563392. Throughput: 0: 6053.6. Samples: 213567640. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:28:42,923][25689] Avg episode reward: [(0, '-49.873')] [2022-07-09 10:28:44,628][26022] Updated weights on worker 0-0, policy_version 208566 (0.00081) [2022-07-09 10:28:46,182][26022] Updated weights on worker 0-0, policy_version 208576 (0.00091) [2022-07-09 10:28:47,931][25689] Fps is (10 sec: 5919.4, 60 sec: 5782.6, 300 sec: 5760.9). Total num frames: 213592064. Throughput: 0: 5190.0. Samples: 213585118. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:28:47,933][26022] Updated weights on worker 0-0, policy_version 208586 (0.00087) [2022-07-09 10:28:47,933][25689] Avg episode reward: [(0, '-49.876')] [2022-07-09 10:28:48,116][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:28:48,136][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000208587_213593088.pth [2022-07-09 10:28:48,137][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000206559_211516416.pth [2022-07-09 10:28:49,566][26022] Updated weights on worker 0-0, policy_version 208596 (0.00079) [2022-07-09 10:28:51,545][26022] Updated weights on worker 0-0, policy_version 208606 (0.00063) [2022-07-09 10:28:52,933][25689] Fps is (10 sec: 5626.0, 60 sec: 5751.6, 300 sec: 5752.3). Total num frames: 213619712. Throughput: 0: 6067.5. Samples: 213619852. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:28:52,934][25689] Avg episode reward: [(0, '-49.465')] [2022-07-09 10:28:53,291][26022] Updated weights on worker 0-0, policy_version 208616 (0.00085) [2022-07-09 10:28:55,072][26022] Updated weights on worker 0-0, policy_version 208626 (0.00085) [2022-07-09 10:28:56,951][26022] Updated weights on worker 0-0, policy_version 208636 (0.00481) [2022-07-09 10:28:57,989][25689] Fps is (10 sec: 5599.6, 60 sec: 5715.9, 300 sec: 5751.9). Total num frames: 213648384. Throughput: 0: 6066.8. Samples: 213654668. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:28:57,991][25689] Avg episode reward: [(0, '-49.764')] [2022-07-09 10:28:58,578][26022] Updated weights on worker 0-0, policy_version 208646 (0.00559) [2022-07-09 10:29:00,434][26022] Updated weights on worker 0-0, policy_version 208656 (0.00089) [2022-07-09 10:29:02,375][26022] Updated weights on worker 0-0, policy_version 208666 (0.00104) [2022-07-09 10:29:03,019][25689] Fps is (10 sec: 5482.6, 60 sec: 5751.8, 300 sec: 5752.0). Total num frames: 213675008. Throughput: 0: 5202.6. Samples: 213672194. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:29:03,020][25689] Avg episode reward: [(0, '-49.619')] [2022-07-09 10:29:04,249][26022] Updated weights on worker 0-0, policy_version 208676 (0.00081) [2022-07-09 10:29:06,293][26022] Updated weights on worker 0-0, policy_version 208686 (0.00086) [2022-07-09 10:29:07,797][26022] Updated weights on worker 0-0, policy_version 208696 (0.00089) [2022-07-09 10:29:08,039][25689] Fps is (10 sec: 5808.0, 60 sec: 5784.3, 300 sec: 5759.2). Total num frames: 213706752. Throughput: 0: 5945.6. Samples: 213704670. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 10:29:08,041][25689] Avg episode reward: [(0, '-49.143')] [2022-07-09 10:29:09,785][26022] Updated weights on worker 0-0, policy_version 208706 (0.00090) [2022-07-09 10:29:11,431][26022] Updated weights on worker 0-0, policy_version 208716 (0.00084) [2022-07-09 10:29:13,053][25689] Fps is (10 sec: 5919.0, 60 sec: 5750.1, 300 sec: 5760.4). Total num frames: 213734400. Throughput: 0: 5952.2. Samples: 213739610. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 10:29:13,055][25689] Avg episode reward: [(0, '-48.392')] [2022-07-09 10:29:13,203][26022] Updated weights on worker 0-0, policy_version 208726 (0.00092) [2022-07-09 10:29:15,017][26022] Updated weights on worker 0-0, policy_version 208736 (0.00086) [2022-07-09 10:29:16,693][26022] Updated weights on worker 0-0, policy_version 208746 (0.00091) [2022-07-09 10:29:18,210][25689] Fps is (10 sec: 5537.4, 60 sec: 5726.0, 300 sec: 5750.8). Total num frames: 213763072. Throughput: 0: 5059.9. Samples: 213756982. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 10:29:18,219][25689] Avg episode reward: [(0, '-47.989')] [2022-07-09 10:29:18,671][26022] Updated weights on worker 0-0, policy_version 208756 (0.00083) [2022-07-09 10:29:20,281][26022] Updated weights on worker 0-0, policy_version 208766 (0.00084) [2022-07-09 10:29:22,155][26022] Updated weights on worker 0-0, policy_version 208776 (0.00088) [2022-07-09 10:29:23,227][25689] Fps is (10 sec: 5736.9, 60 sec: 5744.9, 300 sec: 5754.2). Total num frames: 213792768. Throughput: 0: 5898.3. Samples: 213791388. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 10:29:23,228][25689] Avg episode reward: [(0, '-47.985')] [2022-07-09 10:29:23,837][26022] Updated weights on worker 0-0, policy_version 208786 (0.00089) [2022-07-09 10:29:25,497][26022] Updated weights on worker 0-0, policy_version 208796 (0.00087) [2022-07-09 10:29:27,519][26022] Updated weights on worker 0-0, policy_version 208806 (0.00086) [2022-07-09 10:29:28,270][25689] Fps is (10 sec: 5802.0, 60 sec: 5744.4, 300 sec: 5757.9). Total num frames: 213821440. Throughput: 0: 6015.3. Samples: 213826364. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 10:29:28,270][25689] Avg episode reward: [(0, '-48.253')] [2022-07-09 10:29:29,247][26022] Updated weights on worker 0-0, policy_version 208816 (0.00085) [2022-07-09 10:29:30,973][26022] Updated weights on worker 0-0, policy_version 208826 (0.00093) [2022-07-09 10:29:32,797][26022] Updated weights on worker 0-0, policy_version 208836 (0.00089) [2022-07-09 10:29:33,271][25689] Fps is (10 sec: 5709.6, 60 sec: 5729.7, 300 sec: 5752.0). Total num frames: 213850112. Throughput: 0: 5138.3. Samples: 213843494. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 10:29:33,271][25689] Avg episode reward: [(0, '-48.604')] [2022-07-09 10:29:34,495][26022] Updated weights on worker 0-0, policy_version 208846 (0.00090) [2022-07-09 10:29:36,358][26022] Updated weights on worker 0-0, policy_version 208856 (0.00084) [2022-07-09 10:29:38,053][26022] Updated weights on worker 0-0, policy_version 208866 (0.00089) [2022-07-09 10:29:38,371][25689] Fps is (10 sec: 5676.8, 60 sec: 5723.4, 300 sec: 5751.6). Total num frames: 213878784. Throughput: 0: 6002.9. Samples: 213878010. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 10:29:38,372][25689] Avg episode reward: [(0, '-48.459')] [2022-07-09 10:29:39,724][26022] Updated weights on worker 0-0, policy_version 208876 (0.00094) [2022-07-09 10:29:41,974][26022] Updated weights on worker 0-0, policy_version 208886 (0.00088) [2022-07-09 10:29:43,360][26022] Updated weights on worker 0-0, policy_version 208896 (0.00091) [2022-07-09 10:29:43,403][25689] Fps is (10 sec: 5861.9, 60 sec: 5722.8, 300 sec: 5761.9). Total num frames: 213909504. Throughput: 0: 6013.9. Samples: 213912720. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 10:29:43,403][25689] Avg episode reward: [(0, '-48.952')] [2022-07-09 10:29:45,316][26022] Updated weights on worker 0-0, policy_version 208906 (0.00090) [2022-07-09 10:29:46,908][26022] Updated weights on worker 0-0, policy_version 208916 (0.00093) [2022-07-09 10:29:48,406][25689] Fps is (10 sec: 5918.6, 60 sec: 5723.3, 300 sec: 5756.6). Total num frames: 213938176. Throughput: 0: 6022.1. Samples: 213947626. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 10:29:48,406][25689] Avg episode reward: [(0, '-48.825')] [2022-07-09 10:29:48,684][26022] Updated weights on worker 0-0, policy_version 208926 (0.00093) [2022-07-09 10:29:50,646][26022] Updated weights on worker 0-0, policy_version 208936 (0.00090) [2022-07-09 10:29:52,123][26022] Updated weights on worker 0-0, policy_version 208946 (0.00081) [2022-07-09 10:29:53,440][25689] Fps is (10 sec: 5611.1, 60 sec: 5720.2, 300 sec: 5750.1). Total num frames: 213965824. Throughput: 0: 6018.8. Samples: 213964888. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 10:29:53,440][25689] Avg episode reward: [(0, '-49.095')] [2022-07-09 10:29:54,208][26022] Updated weights on worker 0-0, policy_version 208956 (0.00996) [2022-07-09 10:29:55,836][26022] Updated weights on worker 0-0, policy_version 208966 (0.00092) [2022-07-09 10:29:57,547][26022] Updated weights on worker 0-0, policy_version 208976 (0.00080) [2022-07-09 10:29:58,528][25689] Fps is (10 sec: 5867.5, 60 sec: 5767.9, 300 sec: 5760.7). Total num frames: 213997568. Throughput: 0: 6037.7. Samples: 213999712. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 10:29:58,528][25689] Avg episode reward: [(0, '-48.195')] [2022-07-09 10:29:59,315][26022] Updated weights on worker 0-0, policy_version 208986 (0.00092) [2022-07-09 10:30:01,106][26022] Updated weights on worker 0-0, policy_version 208996 (0.00086) [2022-07-09 10:30:03,340][26022] Updated weights on worker 0-0, policy_version 209006 (0.00088) [2022-07-09 10:30:03,541][25689] Fps is (10 sec: 5575.5, 60 sec: 5735.7, 300 sec: 5750.5). Total num frames: 214022144. Throughput: 0: 5947.3. Samples: 214032490. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 10:30:03,542][25689] Avg episode reward: [(0, '-47.929')] [2022-07-09 10:30:05,030][26022] Updated weights on worker 0-0, policy_version 209016 (0.00099) [2022-07-09 10:30:06,744][26022] Updated weights on worker 0-0, policy_version 209026 (0.00085) [2022-07-09 10:30:08,544][25689] Fps is (10 sec: 5418.5, 60 sec: 5703.4, 300 sec: 5747.2). Total num frames: 214051840. Throughput: 0: 5076.2. Samples: 214049852. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 10:30:08,545][25689] Avg episode reward: [(0, '-47.884')] [2022-07-09 10:30:08,708][26022] Updated weights on worker 0-0, policy_version 209036 (0.00091) [2022-07-09 10:30:10,201][26022] Updated weights on worker 0-0, policy_version 209046 (0.00086) [2022-07-09 10:30:12,179][26022] Updated weights on worker 0-0, policy_version 209056 (0.00076) [2022-07-09 10:30:13,575][25689] Fps is (10 sec: 5919.3, 60 sec: 5735.8, 300 sec: 5751.1). Total num frames: 214081536. Throughput: 0: 5937.2. Samples: 214084432. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 10:30:13,575][25689] Avg episode reward: [(0, '-48.322')] [2022-07-09 10:30:13,869][26022] Updated weights on worker 0-0, policy_version 209066 (0.00084) [2022-07-09 10:30:15,530][26022] Updated weights on worker 0-0, policy_version 209076 (0.00384) [2022-07-09 10:30:17,431][26022] Updated weights on worker 0-0, policy_version 209086 (0.00080) [2022-07-09 10:30:18,692][25689] Fps is (10 sec: 5852.5, 60 sec: 5756.4, 300 sec: 5752.4). Total num frames: 214111232. Throughput: 0: 5948.8. Samples: 214119664. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 10:30:18,693][25689] Avg episode reward: [(0, '-47.890')] [2022-07-09 10:30:19,075][26022] Updated weights on worker 0-0, policy_version 209096 (0.00857) [2022-07-09 10:30:21,012][26022] Updated weights on worker 0-0, policy_version 209106 (0.00087) [2022-07-09 10:30:22,715][26022] Updated weights on worker 0-0, policy_version 209116 (0.00088) [2022-07-09 10:30:23,697][25689] Fps is (10 sec: 5665.1, 60 sec: 5723.7, 300 sec: 5745.6). Total num frames: 214138880. Throughput: 0: 5179.8. Samples: 214136894. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 10:30:23,698][25689] Avg episode reward: [(0, '-48.287')] [2022-07-09 10:30:24,401][26022] Updated weights on worker 0-0, policy_version 209126 (0.00486) [2022-07-09 10:30:26,355][26022] Updated weights on worker 0-0, policy_version 209136 (0.00674) [2022-07-09 10:30:28,051][26022] Updated weights on worker 0-0, policy_version 209146 (0.00098) [2022-07-09 10:30:28,751][25689] Fps is (10 sec: 5802.8, 60 sec: 5756.5, 300 sec: 5751.6). Total num frames: 214169600. Throughput: 0: 6023.8. Samples: 214171572. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 10:30:28,752][25689] Avg episode reward: [(0, '-48.331')] [2022-07-09 10:30:29,736][26022] Updated weights on worker 0-0, policy_version 209156 (0.01031) [2022-07-09 10:30:31,486][26022] Updated weights on worker 0-0, policy_version 209166 (0.00084) [2022-07-09 10:30:33,371][26022] Updated weights on worker 0-0, policy_version 209176 (0.00086) [2022-07-09 10:30:33,780][25689] Fps is (10 sec: 5890.4, 60 sec: 5753.9, 300 sec: 5748.9). Total num frames: 214198272. Throughput: 0: 6048.1. Samples: 214206634. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 10:30:33,780][25689] Avg episode reward: [(0, '-47.837')] [2022-07-09 10:30:35,160][26022] Updated weights on worker 0-0, policy_version 209186 (0.00093) [2022-07-09 10:30:36,957][26022] Updated weights on worker 0-0, policy_version 209196 (0.00087) [2022-07-09 10:30:38,443][26022] Updated weights on worker 0-0, policy_version 209206 (0.00081) [2022-07-09 10:30:38,901][25689] Fps is (10 sec: 5851.0, 60 sec: 5785.7, 300 sec: 5750.1). Total num frames: 214228992. Throughput: 0: 5165.2. Samples: 214224048. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 10:30:38,902][25689] Avg episode reward: [(0, '-48.571')] [2022-07-09 10:30:40,592][26022] Updated weights on worker 0-0, policy_version 209216 (0.00100) [2022-07-09 10:30:42,051][26022] Updated weights on worker 0-0, policy_version 209226 (0.00088) [2022-07-09 10:30:43,947][25689] Fps is (10 sec: 5740.8, 60 sec: 5733.6, 300 sec: 5749.3). Total num frames: 214256640. Throughput: 0: 6023.6. Samples: 214258872. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 10:30:43,948][25689] Avg episode reward: [(0, '-48.966')] [2022-07-09 10:30:44,005][26022] Updated weights on worker 0-0, policy_version 209236 (0.00099) [2022-07-09 10:30:45,757][26022] Updated weights on worker 0-0, policy_version 209246 (0.00409) [2022-07-09 10:30:47,371][26022] Updated weights on worker 0-0, policy_version 209256 (0.00095) [2022-07-09 10:30:48,228][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:30:48,236][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000209260_214282240.pth [2022-07-09 10:30:48,237][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000207235_212208640.pth [2022-07-09 10:30:48,949][25689] Fps is (10 sec: 5605.1, 60 sec: 5733.7, 300 sec: 5750.2). Total num frames: 214285312. Throughput: 0: 6042.4. Samples: 214293620. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 10:30:48,950][25689] Avg episode reward: [(0, '-49.393')] [2022-07-09 10:30:49,272][26022] Updated weights on worker 0-0, policy_version 209266 (0.00092) [2022-07-09 10:30:51,167][26022] Updated weights on worker 0-0, policy_version 209276 (0.00086) [2022-07-09 10:30:52,780][26022] Updated weights on worker 0-0, policy_version 209286 (0.00088) [2022-07-09 10:30:53,973][25689] Fps is (10 sec: 5821.7, 60 sec: 5768.5, 300 sec: 5754.3). Total num frames: 214315008. Throughput: 0: 5165.0. Samples: 214310932. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 10:30:53,973][25689] Avg episode reward: [(0, '-49.347')] [2022-07-09 10:30:54,655][26022] Updated weights on worker 0-0, policy_version 209296 (0.00085) [2022-07-09 10:30:56,151][26022] Updated weights on worker 0-0, policy_version 209306 (0.00085) [2022-07-09 10:30:58,122][26022] Updated weights on worker 0-0, policy_version 209316 (0.00087) [2022-07-09 10:30:59,110][25689] Fps is (10 sec: 5945.9, 60 sec: 5746.9, 300 sec: 5755.7). Total num frames: 214345728. Throughput: 0: 6022.4. Samples: 214345754. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 10:30:59,111][25689] Avg episode reward: [(0, '-50.131')] [2022-07-09 10:30:59,783][26022] Updated weights on worker 0-0, policy_version 209326 (0.00095) [2022-07-09 10:31:01,638][26022] Updated weights on worker 0-0, policy_version 209336 (0.00085) [2022-07-09 10:31:03,883][26022] Updated weights on worker 0-0, policy_version 209346 (0.00085) [2022-07-09 10:31:04,156][25689] Fps is (10 sec: 5631.1, 60 sec: 5777.5, 300 sec: 5751.6). Total num frames: 214372352. Throughput: 0: 5921.2. Samples: 214378536. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 10:31:04,157][25689] Avg episode reward: [(0, '-49.567')] [2022-07-09 10:31:05,594][26022] Updated weights on worker 0-0, policy_version 209356 (0.00089) [2022-07-09 10:31:07,319][26022] Updated weights on worker 0-0, policy_version 209366 (0.00079) [2022-07-09 10:31:09,043][26022] Updated weights on worker 0-0, policy_version 209376 (0.00086) [2022-07-09 10:31:09,222][25689] Fps is (10 sec: 5569.7, 60 sec: 5771.6, 300 sec: 5750.4). Total num frames: 214402048. Throughput: 0: 5039.0. Samples: 214395774. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 10:31:09,223][25689] Avg episode reward: [(0, '-48.642')] [2022-07-09 10:31:10,751][26022] Updated weights on worker 0-0, policy_version 209386 (0.00092) [2022-07-09 10:31:12,771][26022] Updated weights on worker 0-0, policy_version 209396 (0.00082) [2022-07-09 10:31:14,274][25689] Fps is (10 sec: 5870.1, 60 sec: 5769.5, 300 sec: 5750.4). Total num frames: 214431744. Throughput: 0: 5896.2. Samples: 214430632. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 10:31:14,275][25689] Avg episode reward: [(0, '-48.808')] [2022-07-09 10:31:14,279][26022] Updated weights on worker 0-0, policy_version 209406 (0.00173) [2022-07-09 10:31:15,959][26022] Updated weights on worker 0-0, policy_version 209416 (0.00086) [2022-07-09 10:31:18,039][26022] Updated weights on worker 0-0, policy_version 209426 (0.00080) [2022-07-09 10:31:19,341][25689] Fps is (10 sec: 5869.4, 60 sec: 5774.4, 300 sec: 5753.2). Total num frames: 214461440. Throughput: 0: 5928.8. Samples: 214465698. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 10:31:19,341][25689] Avg episode reward: [(0, '-47.710')] [2022-07-09 10:31:19,612][26022] Updated weights on worker 0-0, policy_version 209436 (0.00090) [2022-07-09 10:31:21,391][26022] Updated weights on worker 0-0, policy_version 209446 (0.00095) [2022-07-09 10:31:23,405][26022] Updated weights on worker 0-0, policy_version 209456 (0.00086) [2022-07-09 10:31:24,351][25689] Fps is (10 sec: 5792.0, 60 sec: 5790.7, 300 sec: 5753.2). Total num frames: 214490112. Throughput: 0: 5175.6. Samples: 214483058. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 10:31:24,352][25689] Avg episode reward: [(0, '-48.239')] [2022-07-09 10:31:24,735][26022] Updated weights on worker 0-0, policy_version 209466 (0.00084) [2022-07-09 10:31:26,982][26022] Updated weights on worker 0-0, policy_version 209476 (0.00091) [2022-07-09 10:31:28,515][26022] Updated weights on worker 0-0, policy_version 209486 (0.00053) [2022-07-09 10:31:29,369][25689] Fps is (10 sec: 5514.1, 60 sec: 5726.6, 300 sec: 5746.2). Total num frames: 214516736. Throughput: 0: 6045.8. Samples: 214517578. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 10:31:29,370][25689] Avg episode reward: [(0, '-49.321')] [2022-07-09 10:31:30,367][26022] Updated weights on worker 0-0, policy_version 209496 (0.00092) [2022-07-09 10:31:32,298][26022] Updated weights on worker 0-0, policy_version 209506 (0.00085) [2022-07-09 10:31:33,629][26022] Updated weights on worker 0-0, policy_version 209516 (0.00085) [2022-07-09 10:31:34,401][25689] Fps is (10 sec: 5705.9, 60 sec: 5760.1, 300 sec: 5750.8). Total num frames: 214547456. Throughput: 0: 6046.5. Samples: 214552332. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 10:31:34,402][25689] Avg episode reward: [(0, '-49.508')] [2022-07-09 10:31:35,684][26022] Updated weights on worker 0-0, policy_version 209526 (0.00086) [2022-07-09 10:31:37,337][26022] Updated weights on worker 0-0, policy_version 209536 (0.00089) [2022-07-09 10:31:39,060][26022] Updated weights on worker 0-0, policy_version 209546 (0.00647) [2022-07-09 10:31:39,467][25689] Fps is (10 sec: 5881.7, 60 sec: 5731.6, 300 sec: 5746.1). Total num frames: 214576128. Throughput: 0: 5170.2. Samples: 214569752. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 10:31:39,467][25689] Avg episode reward: [(0, '-49.279')] [2022-07-09 10:31:40,948][26022] Updated weights on worker 0-0, policy_version 209556 (0.00565) [2022-07-09 10:31:42,615][26022] Updated weights on worker 0-0, policy_version 209566 (0.00088) [2022-07-09 10:31:44,517][25689] Fps is (10 sec: 5567.8, 60 sec: 5731.2, 300 sec: 5741.8). Total num frames: 214603776. Throughput: 0: 6026.3. Samples: 214604580. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 10:31:44,517][25689] Avg episode reward: [(0, '-48.837')] [2022-07-09 10:31:44,575][26022] Updated weights on worker 0-0, policy_version 209576 (0.00090) [2022-07-09 10:31:46,311][26022] Updated weights on worker 0-0, policy_version 209586 (0.00090) [2022-07-09 10:31:47,979][26022] Updated weights on worker 0-0, policy_version 209596 (0.00084) [2022-07-09 10:31:49,518][25689] Fps is (10 sec: 5806.9, 60 sec: 5765.1, 300 sec: 5748.8). Total num frames: 214634496. Throughput: 0: 6043.9. Samples: 214639358. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 10:31:49,519][25689] Avg episode reward: [(0, '-48.987')] [2022-07-09 10:31:49,866][26022] Updated weights on worker 0-0, policy_version 209606 (0.00083) [2022-07-09 10:31:51,477][26022] Updated weights on worker 0-0, policy_version 209616 (0.00084) [2022-07-09 10:31:53,141][26022] Updated weights on worker 0-0, policy_version 209626 (0.00085) [2022-07-09 10:31:54,530][25689] Fps is (10 sec: 5931.3, 60 sec: 5749.3, 300 sec: 5745.8). Total num frames: 214663168. Throughput: 0: 5196.4. Samples: 214656928. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 10:31:54,530][25689] Avg episode reward: [(0, '-48.420')] [2022-07-09 10:31:55,255][26022] Updated weights on worker 0-0, policy_version 209636 (0.00087) [2022-07-09 10:31:56,687][26022] Updated weights on worker 0-0, policy_version 209646 (0.00095) [2022-07-09 10:31:58,623][26022] Updated weights on worker 0-0, policy_version 209656 (0.00085) [2022-07-09 10:31:59,579][25689] Fps is (10 sec: 5801.8, 60 sec: 5740.8, 300 sec: 5755.5). Total num frames: 214692864. Throughput: 0: 6083.2. Samples: 214692096. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 10:31:59,579][25689] Avg episode reward: [(0, '-47.648')] [2022-07-09 10:32:00,238][26022] Updated weights on worker 0-0, policy_version 209666 (0.00082) [2022-07-09 10:32:02,547][26022] Updated weights on worker 0-0, policy_version 209676 (0.00084) [2022-07-09 10:32:04,275][26022] Updated weights on worker 0-0, policy_version 209686 (0.00093) [2022-07-09 10:32:04,595][25689] Fps is (10 sec: 5595.8, 60 sec: 5743.7, 300 sec: 5752.0). Total num frames: 214719488. Throughput: 0: 5981.1. Samples: 214724668. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 10:32:04,595][25689] Avg episode reward: [(0, '-47.969')] [2022-07-09 10:32:05,981][26022] Updated weights on worker 0-0, policy_version 209696 (0.00088) [2022-07-09 10:32:07,838][26022] Updated weights on worker 0-0, policy_version 209706 (0.00085) [2022-07-09 10:32:09,600][25689] Fps is (10 sec: 5517.6, 60 sec: 5732.4, 300 sec: 5748.6). Total num frames: 214748160. Throughput: 0: 5114.2. Samples: 214742062. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 10:32:09,601][25689] Avg episode reward: [(0, '-47.128')] [2022-07-09 10:32:09,636][26022] Updated weights on worker 0-0, policy_version 209716 (0.00087) [2022-07-09 10:32:11,226][26022] Updated weights on worker 0-0, policy_version 209726 (0.00092) [2022-07-09 10:32:13,035][26022] Updated weights on worker 0-0, policy_version 209736 (0.00085) [2022-07-09 10:32:14,632][25689] Fps is (10 sec: 5917.3, 60 sec: 5751.3, 300 sec: 5753.6). Total num frames: 214778880. Throughput: 0: 5977.9. Samples: 214777094. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 10:32:14,632][25689] Avg episode reward: [(0, '-48.063')] [2022-07-09 10:32:15,015][26022] Updated weights on worker 0-0, policy_version 209746 (0.00094) [2022-07-09 10:32:16,598][26022] Updated weights on worker 0-0, policy_version 209756 (0.00092) [2022-07-09 10:32:18,514][26022] Updated weights on worker 0-0, policy_version 209766 (0.00085) [2022-07-09 10:32:19,724][25689] Fps is (10 sec: 6068.7, 60 sec: 5765.8, 300 sec: 5755.3). Total num frames: 214809600. Throughput: 0: 5964.7. Samples: 214812260. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 10:32:19,725][25689] Avg episode reward: [(0, '-47.488')] [2022-07-09 10:32:19,914][26022] Updated weights on worker 0-0, policy_version 209776 (0.00049) [2022-07-09 10:32:22,083][26022] Updated weights on worker 0-0, policy_version 209786 (0.00093) [2022-07-09 10:32:23,544][26022] Updated weights on worker 0-0, policy_version 209796 (0.00085) [2022-07-09 10:32:24,734][25689] Fps is (10 sec: 5575.0, 60 sec: 5715.0, 300 sec: 5748.7). Total num frames: 214835200. Throughput: 0: 5214.3. Samples: 214829680. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 10:32:24,735][25689] Avg episode reward: [(0, '-47.702')] [2022-07-09 10:32:25,443][26022] Updated weights on worker 0-0, policy_version 209806 (0.00082) [2022-07-09 10:32:27,176][26022] Updated weights on worker 0-0, policy_version 209816 (0.00089) [2022-07-09 10:32:29,084][26022] Updated weights on worker 0-0, policy_version 209826 (0.00091) [2022-07-09 10:32:29,791][25689] Fps is (10 sec: 5493.1, 60 sec: 5762.2, 300 sec: 5744.5). Total num frames: 214864896. Throughput: 0: 6020.5. Samples: 214863618. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 10:32:29,791][25689] Avg episode reward: [(0, '-47.257')] [2022-07-09 10:32:30,837][26022] Updated weights on worker 0-0, policy_version 209836 (0.00083) [2022-07-09 10:32:32,548][26022] Updated weights on worker 0-0, policy_version 209846 (0.00086) [2022-07-09 10:32:34,367][26022] Updated weights on worker 0-0, policy_version 209856 (0.00093) [2022-07-09 10:32:34,820][25689] Fps is (10 sec: 5989.8, 60 sec: 5762.4, 300 sec: 5751.7). Total num frames: 214895616. Throughput: 0: 6009.6. Samples: 214898420. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 10:32:34,821][25689] Avg episode reward: [(0, '-48.733')] [2022-07-09 10:32:36,409][26022] Updated weights on worker 0-0, policy_version 209866 (0.00093) [2022-07-09 10:32:37,736][26022] Updated weights on worker 0-0, policy_version 209876 (0.00084) [2022-07-09 10:32:39,744][26022] Updated weights on worker 0-0, policy_version 209886 (0.00091) [2022-07-09 10:32:39,864][25689] Fps is (10 sec: 5794.5, 60 sec: 5747.6, 300 sec: 5744.3). Total num frames: 214923264. Throughput: 0: 5989.5. Samples: 214932886. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 10:32:39,864][25689] Avg episode reward: [(0, '-50.123')] [2022-07-09 10:32:41,742][26022] Updated weights on worker 0-0, policy_version 209896 (0.00087) [2022-07-09 10:32:43,332][26022] Updated weights on worker 0-0, policy_version 209906 (0.00093) [2022-07-09 10:32:44,871][25689] Fps is (10 sec: 5501.6, 60 sec: 5751.6, 300 sec: 5744.7). Total num frames: 214950912. Throughput: 0: 5975.1. Samples: 214950002. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 10:32:44,872][25689] Avg episode reward: [(0, '-51.003')] [2022-07-09 10:32:45,179][26022] Updated weights on worker 0-0, policy_version 209916 (0.00085) [2022-07-09 10:32:46,909][26022] Updated weights on worker 0-0, policy_version 209926 (0.00092) [2022-07-09 10:32:48,407][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:32:48,424][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000209934_214972416.pth [2022-07-09 10:32:48,425][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000207912_212901888.pth [2022-07-09 10:32:48,626][26022] Updated weights on worker 0-0, policy_version 209936 (0.00090) [2022-07-09 10:32:49,873][25689] Fps is (10 sec: 5831.5, 60 sec: 5751.6, 300 sec: 5748.7). Total num frames: 214981632. Throughput: 0: 6029.2. Samples: 214984696. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 10:32:49,875][25689] Avg episode reward: [(0, '-50.751')] [2022-07-09 10:32:50,610][26022] Updated weights on worker 0-0, policy_version 209946 (0.00099) [2022-07-09 10:32:52,002][26022] Updated weights on worker 0-0, policy_version 209956 (0.00085) [2022-07-09 10:32:54,022][26022] Updated weights on worker 0-0, policy_version 209966 (0.00083) [2022-07-09 10:32:54,953][25689] Fps is (10 sec: 5992.2, 60 sec: 5762.0, 300 sec: 5744.4). Total num frames: 215011328. Throughput: 0: 6025.3. Samples: 215019728. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 10:32:54,955][25689] Avg episode reward: [(0, '-50.926')] [2022-07-09 10:32:55,559][26022] Updated weights on worker 0-0, policy_version 209976 (0.00090) [2022-07-09 10:32:57,490][26022] Updated weights on worker 0-0, policy_version 209986 (0.00098) [2022-07-09 10:32:59,355][26022] Updated weights on worker 0-0, policy_version 209996 (0.00084) [2022-07-09 10:33:00,023][25689] Fps is (10 sec: 5649.7, 60 sec: 5726.2, 300 sec: 5754.4). Total num frames: 215038976. Throughput: 0: 5165.3. Samples: 215037014. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 10:33:00,023][25689] Avg episode reward: [(0, '-51.481')] [2022-07-09 10:33:01,074][26022] Updated weights on worker 0-0, policy_version 210006 (0.00088) [2022-07-09 10:33:03,260][26022] Updated weights on worker 0-0, policy_version 210016 (0.00085) [2022-07-09 10:33:04,908][26022] Updated weights on worker 0-0, policy_version 210026 (0.00090) [2022-07-09 10:33:05,057][25689] Fps is (10 sec: 5472.9, 60 sec: 5741.3, 300 sec: 5746.9). Total num frames: 215066624. Throughput: 0: 5923.8. Samples: 215069580. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 10:33:05,058][25689] Avg episode reward: [(0, '-51.499')] [2022-07-09 10:33:06,695][26022] Updated weights on worker 0-0, policy_version 210036 (0.00088) [2022-07-09 10:33:08,575][26022] Updated weights on worker 0-0, policy_version 210046 (0.00094) [2022-07-09 10:33:10,067][25689] Fps is (10 sec: 5607.5, 60 sec: 5741.0, 300 sec: 5743.5). Total num frames: 215095296. Throughput: 0: 5918.8. Samples: 215104218. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 10:33:10,067][25689] Avg episode reward: [(0, '-50.223')] [2022-07-09 10:33:10,265][26022] Updated weights on worker 0-0, policy_version 210056 (0.00089) [2022-07-09 10:33:12,144][26022] Updated weights on worker 0-0, policy_version 210066 (0.00090) [2022-07-09 10:33:13,811][26022] Updated weights on worker 0-0, policy_version 210076 (0.00085) [2022-07-09 10:33:15,091][25689] Fps is (10 sec: 5715.0, 60 sec: 5707.8, 300 sec: 5741.2). Total num frames: 215123968. Throughput: 0: 5039.4. Samples: 215121210. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 10:33:15,092][25689] Avg episode reward: [(0, '-51.039')] [2022-07-09 10:33:15,695][26022] Updated weights on worker 0-0, policy_version 210086 (0.00084) [2022-07-09 10:33:17,442][26022] Updated weights on worker 0-0, policy_version 210096 (0.00080) [2022-07-09 10:33:19,398][26022] Updated weights on worker 0-0, policy_version 210106 (0.00085) [2022-07-09 10:33:20,152][25689] Fps is (10 sec: 5888.9, 60 sec: 5710.8, 300 sec: 5747.6). Total num frames: 215154688. Throughput: 0: 5921.4. Samples: 215156206. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 10:33:20,153][25689] Avg episode reward: [(0, '-50.605')] [2022-07-09 10:33:20,750][26022] Updated weights on worker 0-0, policy_version 210116 (0.00090) [2022-07-09 10:33:22,791][26022] Updated weights on worker 0-0, policy_version 210126 (0.00094) [2022-07-09 10:33:24,298][26022] Updated weights on worker 0-0, policy_version 210136 (0.00089) [2022-07-09 10:33:25,159][25689] Fps is (10 sec: 5797.6, 60 sec: 5744.9, 300 sec: 5744.8). Total num frames: 215182336. Throughput: 0: 6063.6. Samples: 215191468. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 10:33:25,159][25689] Avg episode reward: [(0, '-51.317')] [2022-07-09 10:33:26,290][26022] Updated weights on worker 0-0, policy_version 210146 (0.00094) [2022-07-09 10:33:28,036][26022] Updated weights on worker 0-0, policy_version 210156 (0.00611) [2022-07-09 10:33:29,589][26022] Updated weights on worker 0-0, policy_version 210166 (0.00087) [2022-07-09 10:33:30,227][25689] Fps is (10 sec: 5793.4, 60 sec: 5760.8, 300 sec: 5747.4). Total num frames: 215213056. Throughput: 0: 5181.9. Samples: 215208686. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 10:33:30,227][25689] Avg episode reward: [(0, '-50.751')] [2022-07-09 10:33:31,672][26022] Updated weights on worker 0-0, policy_version 210176 (0.00082) [2022-07-09 10:33:33,391][26022] Updated weights on worker 0-0, policy_version 210186 (0.00093) [2022-07-09 10:33:34,872][26022] Updated weights on worker 0-0, policy_version 210196 (0.00086) [2022-07-09 10:33:35,231][25689] Fps is (10 sec: 5998.0, 60 sec: 5746.2, 300 sec: 5751.4). Total num frames: 215242752. Throughput: 0: 6064.5. Samples: 215243350. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 10:33:35,232][25689] Avg episode reward: [(0, '-49.944')] [2022-07-09 10:33:37,046][26022] Updated weights on worker 0-0, policy_version 210206 (0.00085) [2022-07-09 10:33:38,405][26022] Updated weights on worker 0-0, policy_version 210216 (0.00089) [2022-07-09 10:33:40,348][25689] Fps is (10 sec: 5564.5, 60 sec: 5722.3, 300 sec: 5735.8). Total num frames: 215269376. Throughput: 0: 6022.6. Samples: 215277840. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-09 10:33:40,349][25689] Avg episode reward: [(0, '-50.827')] [2022-07-09 10:33:40,620][26022] Updated weights on worker 0-0, policy_version 210226 (0.00091) [2022-07-09 10:33:42,207][26022] Updated weights on worker 0-0, policy_version 210236 (0.00085) [2022-07-09 10:33:43,786][26022] Updated weights on worker 0-0, policy_version 210246 (0.00089) [2022-07-09 10:33:45,363][25689] Fps is (10 sec: 5659.7, 60 sec: 5772.4, 300 sec: 5742.6). Total num frames: 215300096. Throughput: 0: 5138.0. Samples: 215295280. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-09 10:33:45,365][25689] Avg episode reward: [(0, '-50.322')] [2022-07-09 10:33:45,815][26022] Updated weights on worker 0-0, policy_version 210256 (0.00085) [2022-07-09 10:33:47,532][26022] Updated weights on worker 0-0, policy_version 210266 (0.00096) [2022-07-09 10:33:49,446][26022] Updated weights on worker 0-0, policy_version 210276 (0.00091) [2022-07-09 10:33:50,429][25689] Fps is (10 sec: 5993.3, 60 sec: 5749.4, 300 sec: 5748.3). Total num frames: 215329792. Throughput: 0: 6019.3. Samples: 215330288. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-09 10:33:50,429][25689] Avg episode reward: [(0, '-50.838')] [2022-07-09 10:33:50,906][26022] Updated weights on worker 0-0, policy_version 210286 (0.00082) [2022-07-09 10:33:52,829][26022] Updated weights on worker 0-0, policy_version 210296 (0.00085) [2022-07-09 10:33:54,710][26022] Updated weights on worker 0-0, policy_version 210306 (0.00090) [2022-07-09 10:33:55,431][25689] Fps is (10 sec: 5594.2, 60 sec: 5706.1, 300 sec: 5742.4). Total num frames: 215356416. Throughput: 0: 6016.0. Samples: 215364870. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-09 10:33:55,431][25689] Avg episode reward: [(0, '-50.080')] [2022-07-09 10:33:56,198][26022] Updated weights on worker 0-0, policy_version 210316 (0.00096) [2022-07-09 10:33:58,303][26022] Updated weights on worker 0-0, policy_version 210326 (0.00089) [2022-07-09 10:33:59,683][26022] Updated weights on worker 0-0, policy_version 210336 (0.00500) [2022-07-09 10:34:00,509][25689] Fps is (10 sec: 5688.7, 60 sec: 5756.0, 300 sec: 5755.2). Total num frames: 215387136. Throughput: 0: 5170.6. Samples: 215382084. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-09 10:34:00,531][25689] Avg episode reward: [(0, '-49.794')] [2022-07-09 10:34:02,121][26022] Updated weights on worker 0-0, policy_version 210346 (0.00086) [2022-07-09 10:34:04,007][26022] Updated weights on worker 0-0, policy_version 210356 (0.00088) [2022-07-09 10:34:05,522][26022] Updated weights on worker 0-0, policy_version 210366 (0.00082) [2022-07-09 10:34:05,613][25689] Fps is (10 sec: 5732.4, 60 sec: 5749.4, 300 sec: 5739.8). Total num frames: 215414784. Throughput: 0: 5877.9. Samples: 215414306. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-09 10:34:05,614][25689] Avg episode reward: [(0, '-49.236')] [2022-07-09 10:34:07,520][26022] Updated weights on worker 0-0, policy_version 210376 (0.00085) [2022-07-09 10:34:09,169][26022] Updated weights on worker 0-0, policy_version 210386 (0.00087) [2022-07-09 10:34:10,618][25689] Fps is (10 sec: 5469.9, 60 sec: 5732.9, 300 sec: 5740.0). Total num frames: 215442432. Throughput: 0: 5885.7. Samples: 215449118. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-09 10:34:10,623][25689] Avg episode reward: [(0, '-49.447')] [2022-07-09 10:34:10,861][26022] Updated weights on worker 0-0, policy_version 210396 (0.00093) [2022-07-09 10:34:12,635][26022] Updated weights on worker 0-0, policy_version 210406 (0.00087) [2022-07-09 10:34:14,606][26022] Updated weights on worker 0-0, policy_version 210416 (0.00085) [2022-07-09 10:34:15,626][25689] Fps is (10 sec: 5727.1, 60 sec: 5751.4, 300 sec: 5746.3). Total num frames: 215472128. Throughput: 0: 5039.3. Samples: 215466638. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-09 10:34:15,626][25689] Avg episode reward: [(0, '-48.213')] [2022-07-09 10:34:16,160][26022] Updated weights on worker 0-0, policy_version 210426 (0.00093) [2022-07-09 10:34:18,059][26022] Updated weights on worker 0-0, policy_version 210436 (0.00095) [2022-07-09 10:34:19,726][26022] Updated weights on worker 0-0, policy_version 210446 (0.00089) [2022-07-09 10:34:20,742][25689] Fps is (10 sec: 5664.4, 60 sec: 5695.5, 300 sec: 5737.6). Total num frames: 215499776. Throughput: 0: 5895.9. Samples: 215501374. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-09 10:34:20,742][25689] Avg episode reward: [(0, '-48.390')] [2022-07-09 10:34:21,510][26022] Updated weights on worker 0-0, policy_version 210456 (0.00084) [2022-07-09 10:34:23,442][26022] Updated weights on worker 0-0, policy_version 210466 (0.00082) [2022-07-09 10:34:25,070][26022] Updated weights on worker 0-0, policy_version 210476 (0.00094) [2022-07-09 10:34:25,787][25689] Fps is (10 sec: 5845.0, 60 sec: 5759.4, 300 sec: 5747.8). Total num frames: 215531520. Throughput: 0: 6033.4. Samples: 215536022. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-09 10:34:25,788][25689] Avg episode reward: [(0, '-49.725')] [2022-07-09 10:34:27,075][26022] Updated weights on worker 0-0, policy_version 210486 (0.00089) [2022-07-09 10:34:28,651][26022] Updated weights on worker 0-0, policy_version 210496 (0.00089) [2022-07-09 10:34:30,340][26022] Updated weights on worker 0-0, policy_version 210506 (0.00089) [2022-07-09 10:34:30,809][25689] Fps is (10 sec: 6102.8, 60 sec: 5746.9, 300 sec: 5750.9). Total num frames: 215561216. Throughput: 0: 5160.6. Samples: 215553316. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-09 10:34:30,810][25689] Avg episode reward: [(0, '-48.706')] [2022-07-09 10:34:32,186][26022] Updated weights on worker 0-0, policy_version 210516 (0.00085) [2022-07-09 10:34:33,945][26022] Updated weights on worker 0-0, policy_version 210526 (0.00084) [2022-07-09 10:34:35,809][26022] Updated weights on worker 0-0, policy_version 210536 (0.00084) [2022-07-09 10:34:35,905][25689] Fps is (10 sec: 5667.2, 60 sec: 5704.4, 300 sec: 5747.5). Total num frames: 215588864. Throughput: 0: 5989.3. Samples: 215588100. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-09 10:34:35,906][25689] Avg episode reward: [(0, '-48.645')] [2022-07-09 10:34:37,391][26022] Updated weights on worker 0-0, policy_version 210546 (0.00083) [2022-07-09 10:34:39,223][26022] Updated weights on worker 0-0, policy_version 210556 (0.00086) [2022-07-09 10:34:40,949][25689] Fps is (10 sec: 5655.6, 60 sec: 5762.1, 300 sec: 5743.8). Total num frames: 215618560. Throughput: 0: 6000.8. Samples: 215622632. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-09 10:34:40,949][25689] Avg episode reward: [(0, '-48.877')] [2022-07-09 10:34:41,176][26022] Updated weights on worker 0-0, policy_version 210566 (0.00084) [2022-07-09 10:34:42,793][26022] Updated weights on worker 0-0, policy_version 210576 (0.00092) [2022-07-09 10:34:44,681][26022] Updated weights on worker 0-0, policy_version 210586 (0.00091) [2022-07-09 10:34:46,034][25689] Fps is (10 sec: 5863.9, 60 sec: 5738.5, 300 sec: 5745.7). Total num frames: 215648256. Throughput: 0: 5994.7. Samples: 215657396. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-09 10:34:46,034][25689] Avg episode reward: [(0, '-49.453')] [2022-07-09 10:34:46,367][26022] Updated weights on worker 0-0, policy_version 210596 (0.00100) [2022-07-09 10:34:48,220][26022] Updated weights on worker 0-0, policy_version 210606 (0.00083) [2022-07-09 10:34:48,434][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:34:48,442][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000210608_215662592.pth [2022-07-09 10:34:48,443][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000208587_213593088.pth [2022-07-09 10:34:49,963][26022] Updated weights on worker 0-0, policy_version 210616 (0.00345) [2022-07-09 10:34:51,051][25689] Fps is (10 sec: 5676.5, 60 sec: 5709.4, 300 sec: 5746.0). Total num frames: 215675904. Throughput: 0: 6014.3. Samples: 215675054. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-09 10:34:51,051][25689] Avg episode reward: [(0, '-49.864')] [2022-07-09 10:34:51,509][26022] Updated weights on worker 0-0, policy_version 210626 (0.00081) [2022-07-09 10:34:53,279][26022] Updated weights on worker 0-0, policy_version 210636 (0.00089) [2022-07-09 10:34:55,295][26022] Updated weights on worker 0-0, policy_version 210646 (0.00083) [2022-07-09 10:34:56,059][25689] Fps is (10 sec: 5719.8, 60 sec: 5759.4, 300 sec: 5740.7). Total num frames: 215705600. Throughput: 0: 6048.9. Samples: 215710008. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-09 10:34:56,060][25689] Avg episode reward: [(0, '-50.456')] [2022-07-09 10:34:56,851][26022] Updated weights on worker 0-0, policy_version 210656 (0.00093) [2022-07-09 10:34:58,806][26022] Updated weights on worker 0-0, policy_version 210666 (0.00089) [2022-07-09 10:35:00,326][26022] Updated weights on worker 0-0, policy_version 210676 (0.00089) [2022-07-09 10:35:01,167][25689] Fps is (10 sec: 5871.0, 60 sec: 5739.8, 300 sec: 5756.1). Total num frames: 215735296. Throughput: 0: 6053.0. Samples: 215745012. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-09 10:35:01,167][25689] Avg episode reward: [(0, '-50.480')] [2022-07-09 10:35:02,612][26022] Updated weights on worker 0-0, policy_version 210686 (0.00089) [2022-07-09 10:35:04,358][26022] Updated weights on worker 0-0, policy_version 210696 (0.00087) [2022-07-09 10:35:05,971][26022] Updated weights on worker 0-0, policy_version 210706 (0.00087) [2022-07-09 10:35:06,225][25689] Fps is (10 sec: 5741.7, 60 sec: 5761.0, 300 sec: 5751.6). Total num frames: 215763968. Throughput: 0: 5108.3. Samples: 215760538. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-09 10:35:06,225][25689] Avg episode reward: [(0, '-49.984')] [2022-07-09 10:35:07,952][26022] Updated weights on worker 0-0, policy_version 210716 (0.00091) [2022-07-09 10:35:09,466][26022] Updated weights on worker 0-0, policy_version 210726 (0.00081) [2022-07-09 10:35:11,227][25689] Fps is (10 sec: 5700.0, 60 sec: 5778.2, 300 sec: 5748.7). Total num frames: 215792640. Throughput: 0: 5958.5. Samples: 215795276. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-09 10:35:11,228][25689] Avg episode reward: [(0, '-49.196')] [2022-07-09 10:35:11,455][26022] Updated weights on worker 0-0, policy_version 210736 (0.00081) [2022-07-09 10:35:13,399][26022] Updated weights on worker 0-0, policy_version 210746 (0.00081) [2022-07-09 10:35:15,023][26022] Updated weights on worker 0-0, policy_version 210756 (0.00086) [2022-07-09 10:35:16,275][25689] Fps is (10 sec: 5603.9, 60 sec: 5740.5, 300 sec: 5743.1). Total num frames: 215820288. Throughput: 0: 5925.9. Samples: 215829804. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-09 10:35:16,277][25689] Avg episode reward: [(0, '-49.073')] [2022-07-09 10:35:16,913][26022] Updated weights on worker 0-0, policy_version 210766 (0.00094) [2022-07-09 10:35:18,501][26022] Updated weights on worker 0-0, policy_version 210776 (0.00077) [2022-07-09 10:35:20,139][26022] Updated weights on worker 0-0, policy_version 210786 (0.00079) [2022-07-09 10:35:21,330][25689] Fps is (10 sec: 5675.9, 60 sec: 5780.1, 300 sec: 5749.0). Total num frames: 215849984. Throughput: 0: 5074.4. Samples: 215847330. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-09 10:35:21,331][25689] Avg episode reward: [(0, '-49.533')] [2022-07-09 10:35:22,040][26022] Updated weights on worker 0-0, policy_version 210796 (0.00083) [2022-07-09 10:35:23,881][26022] Updated weights on worker 0-0, policy_version 210806 (0.00087) [2022-07-09 10:35:25,471][26022] Updated weights on worker 0-0, policy_version 210816 (0.00094) [2022-07-09 10:35:26,346][25689] Fps is (10 sec: 5999.2, 60 sec: 5766.0, 300 sec: 5749.8). Total num frames: 215880704. Throughput: 0: 6055.5. Samples: 215882380. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-09 10:35:26,347][25689] Avg episode reward: [(0, '-49.437')] [2022-07-09 10:35:27,415][26022] Updated weights on worker 0-0, policy_version 210826 (0.00090) [2022-07-09 10:35:28,955][26022] Updated weights on worker 0-0, policy_version 210836 (0.00086) [2022-07-09 10:35:31,024][26022] Updated weights on worker 0-0, policy_version 210846 (0.00083) [2022-07-09 10:35:31,358][25689] Fps is (10 sec: 5820.6, 60 sec: 5733.2, 300 sec: 5746.7). Total num frames: 215908352. Throughput: 0: 6046.5. Samples: 215916998. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-09 10:35:31,360][25689] Avg episode reward: [(0, '-49.449')] [2022-07-09 10:35:32,528][26022] Updated weights on worker 0-0, policy_version 210856 (0.00085) [2022-07-09 10:35:34,415][26022] Updated weights on worker 0-0, policy_version 210866 (0.00084) [2022-07-09 10:35:36,163][26022] Updated weights on worker 0-0, policy_version 210876 (0.00090) [2022-07-09 10:35:36,370][25689] Fps is (10 sec: 5720.4, 60 sec: 5775.0, 300 sec: 5745.3). Total num frames: 215938048. Throughput: 0: 5219.8. Samples: 215934698. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-09 10:35:36,371][25689] Avg episode reward: [(0, '-49.920')] [2022-07-09 10:35:37,949][26022] Updated weights on worker 0-0, policy_version 210886 (0.00082) [2022-07-09 10:35:39,603][26022] Updated weights on worker 0-0, policy_version 210896 (0.00084) [2022-07-09 10:35:41,426][25689] Fps is (10 sec: 5899.4, 60 sec: 5773.8, 300 sec: 5752.0). Total num frames: 215967744. Throughput: 0: 6094.5. Samples: 215969802. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-09 10:35:41,426][25689] Avg episode reward: [(0, '-49.682')] [2022-07-09 10:35:41,433][26022] Updated weights on worker 0-0, policy_version 210906 (0.00081) [2022-07-09 10:35:43,025][26022] Updated weights on worker 0-0, policy_version 210916 (0.00087) [2022-07-09 10:35:44,978][26022] Updated weights on worker 0-0, policy_version 210926 (0.00089) [2022-07-09 10:35:46,501][25689] Fps is (10 sec: 5862.6, 60 sec: 5774.8, 300 sec: 5754.1). Total num frames: 215997440. Throughput: 0: 6081.5. Samples: 216004954. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-09 10:35:46,501][25689] Avg episode reward: [(0, '-48.908')] [2022-07-09 10:35:46,555][26022] Updated weights on worker 0-0, policy_version 210936 (0.00085) [2022-07-09 10:35:48,377][26022] Updated weights on worker 0-0, policy_version 210946 (0.00085) [2022-07-09 10:35:50,238][26022] Updated weights on worker 0-0, policy_version 210956 (0.00091) [2022-07-09 10:35:51,545][25689] Fps is (10 sec: 5869.4, 60 sec: 5806.1, 300 sec: 5753.7). Total num frames: 216027136. Throughput: 0: 5235.3. Samples: 216022680. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 10:35:51,545][25689] Avg episode reward: [(0, '-48.769')] [2022-07-09 10:35:51,731][26022] Updated weights on worker 0-0, policy_version 210966 (0.00084) [2022-07-09 10:35:53,520][26022] Updated weights on worker 0-0, policy_version 210976 (0.00088) [2022-07-09 10:35:55,323][26022] Updated weights on worker 0-0, policy_version 210986 (0.00088) [2022-07-09 10:35:56,615][25689] Fps is (10 sec: 5771.2, 60 sec: 5783.3, 300 sec: 5748.1). Total num frames: 216055808. Throughput: 0: 6068.8. Samples: 216057558. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 10:35:56,615][25689] Avg episode reward: [(0, '-47.917')] [2022-07-09 10:35:57,219][26022] Updated weights on worker 0-0, policy_version 210996 (0.00085) [2022-07-09 10:35:59,088][26022] Updated weights on worker 0-0, policy_version 211006 (0.00090) [2022-07-09 10:36:00,608][26022] Updated weights on worker 0-0, policy_version 211016 (0.01021) [2022-07-09 10:36:01,722][25689] Fps is (10 sec: 5734.9, 60 sec: 5783.3, 300 sec: 5757.2). Total num frames: 216085504. Throughput: 0: 6041.1. Samples: 216092416. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 10:36:01,723][25689] Avg episode reward: [(0, '-48.046')] [2022-07-09 10:36:02,873][26022] Updated weights on worker 0-0, policy_version 211026 (0.00082) [2022-07-09 10:36:04,505][26022] Updated weights on worker 0-0, policy_version 211036 (0.00080) [2022-07-09 10:36:06,388][26022] Updated weights on worker 0-0, policy_version 211046 (0.00094) [2022-07-09 10:36:06,725][25689] Fps is (10 sec: 5570.6, 60 sec: 5754.7, 300 sec: 5748.1). Total num frames: 216112128. Throughput: 0: 5089.9. Samples: 216107892. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 10:36:06,725][25689] Avg episode reward: [(0, '-47.791')] [2022-07-09 10:36:08,129][26022] Updated weights on worker 0-0, policy_version 211056 (0.00858) [2022-07-09 10:36:09,925][26022] Updated weights on worker 0-0, policy_version 211066 (0.00095) [2022-07-09 10:36:11,747][25689] Fps is (10 sec: 5617.8, 60 sec: 5769.7, 300 sec: 5748.7). Total num frames: 216141824. Throughput: 0: 5941.5. Samples: 216142716. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 10:36:11,748][25689] Avg episode reward: [(0, '-48.301')] [2022-07-09 10:36:11,749][26022] Updated weights on worker 0-0, policy_version 211076 (0.00090) [2022-07-09 10:36:13,452][26022] Updated weights on worker 0-0, policy_version 211086 (0.00088) [2022-07-09 10:36:15,312][26022] Updated weights on worker 0-0, policy_version 211096 (0.00081) [2022-07-09 10:36:16,778][25689] Fps is (10 sec: 5907.8, 60 sec: 5805.2, 300 sec: 5749.4). Total num frames: 216171520. Throughput: 0: 5956.2. Samples: 216177654. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 10:36:16,779][25689] Avg episode reward: [(0, '-48.213')] [2022-07-09 10:36:16,990][26022] Updated weights on worker 0-0, policy_version 211106 (0.00094) [2022-07-09 10:36:18,515][26022] Updated weights on worker 0-0, policy_version 211116 (0.00096) [2022-07-09 10:36:20,614][26022] Updated weights on worker 0-0, policy_version 211127 (0.00081) [2022-07-09 10:36:21,835][25689] Fps is (10 sec: 5785.9, 60 sec: 5788.1, 300 sec: 5748.5). Total num frames: 216200192. Throughput: 0: 5107.9. Samples: 216195152. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 10:36:21,837][25689] Avg episode reward: [(0, '-48.084')] [2022-07-09 10:36:22,277][26022] Updated weights on worker 0-0, policy_version 211137 (0.00088) [2022-07-09 10:36:24,043][26022] Updated weights on worker 0-0, policy_version 211147 (0.00089) [2022-07-09 10:36:25,785][26022] Updated weights on worker 0-0, policy_version 211157 (0.00091) [2022-07-09 10:36:26,858][25689] Fps is (10 sec: 5993.7, 60 sec: 5804.3, 300 sec: 5765.6). Total num frames: 216231936. Throughput: 0: 6080.5. Samples: 216230312. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 10:36:26,858][25689] Avg episode reward: [(0, '-47.626')] [2022-07-09 10:36:27,693][26022] Updated weights on worker 0-0, policy_version 211167 (0.00085) [2022-07-09 10:36:29,281][26022] Updated weights on worker 0-0, policy_version 211177 (0.00095) [2022-07-09 10:36:31,363][26022] Updated weights on worker 0-0, policy_version 211187 (0.00091) [2022-07-09 10:36:31,879][25689] Fps is (10 sec: 5811.7, 60 sec: 5786.6, 300 sec: 5752.0). Total num frames: 216258560. Throughput: 0: 6067.6. Samples: 216264864. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 10:36:31,879][25689] Avg episode reward: [(0, '-48.353')] [2022-07-09 10:36:32,794][26022] Updated weights on worker 0-0, policy_version 211197 (0.00091) [2022-07-09 10:36:34,819][26022] Updated weights on worker 0-0, policy_version 211207 (0.00084) [2022-07-09 10:36:36,457][26022] Updated weights on worker 0-0, policy_version 211217 (0.00090) [2022-07-09 10:36:36,904][25689] Fps is (10 sec: 5606.1, 60 sec: 5785.3, 300 sec: 5756.3). Total num frames: 216288256. Throughput: 0: 5204.2. Samples: 216282394. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 10:36:36,905][25689] Avg episode reward: [(0, '-48.353')] [2022-07-09 10:36:38,378][26022] Updated weights on worker 0-0, policy_version 211227 (0.00092) [2022-07-09 10:36:39,948][26022] Updated weights on worker 0-0, policy_version 211237 (0.00084) [2022-07-09 10:36:41,995][25689] Fps is (10 sec: 5769.8, 60 sec: 5765.1, 300 sec: 5758.9). Total num frames: 216316928. Throughput: 0: 6047.8. Samples: 216317072. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 10:36:41,995][25689] Avg episode reward: [(0, '-49.063')] [2022-07-09 10:36:42,000][26022] Updated weights on worker 0-0, policy_version 211247 (0.00081) [2022-07-09 10:36:43,359][26022] Updated weights on worker 0-0, policy_version 211257 (0.00086) [2022-07-09 10:36:45,409][26022] Updated weights on worker 0-0, policy_version 211267 (0.00093) [2022-07-09 10:36:47,028][25689] Fps is (10 sec: 5866.6, 60 sec: 5786.0, 300 sec: 5758.3). Total num frames: 216347648. Throughput: 0: 6021.7. Samples: 216351768. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 10:36:47,028][25689] Avg episode reward: [(0, '-49.523')] [2022-07-09 10:36:47,030][26022] Updated weights on worker 0-0, policy_version 211277 (0.00092) [2022-07-09 10:36:48,526][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:36:48,555][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000211284_216354816.pth [2022-07-09 10:36:48,556][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000209260_214282240.pth [2022-07-09 10:36:48,918][26022] Updated weights on worker 0-0, policy_version 211287 (0.00095) [2022-07-09 10:36:50,647][26022] Updated weights on worker 0-0, policy_version 211297 (0.00082) [2022-07-09 10:36:52,068][25689] Fps is (10 sec: 5895.9, 60 sec: 5769.4, 300 sec: 5757.8). Total num frames: 216376320. Throughput: 0: 5170.2. Samples: 216369248. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 10:36:52,069][25689] Avg episode reward: [(0, '-49.655')] [2022-07-09 10:36:52,310][26022] Updated weights on worker 0-0, policy_version 211307 (0.00078) [2022-07-09 10:36:54,178][26022] Updated weights on worker 0-0, policy_version 211317 (0.00081) [2022-07-09 10:36:55,982][26022] Updated weights on worker 0-0, policy_version 211327 (0.00095) [2022-07-09 10:36:57,149][25689] Fps is (10 sec: 5665.6, 60 sec: 5768.3, 300 sec: 5753.7). Total num frames: 216404992. Throughput: 0: 6033.7. Samples: 216404548. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 10:36:57,150][25689] Avg episode reward: [(0, '-49.790')] [2022-07-09 10:36:57,537][26022] Updated weights on worker 0-0, policy_version 211337 (0.00088) [2022-07-09 10:36:59,576][26022] Updated weights on worker 0-0, policy_version 211347 (0.00084) [2022-07-09 10:37:01,152][26022] Updated weights on worker 0-0, policy_version 211357 (0.00086) [2022-07-09 10:37:02,201][25689] Fps is (10 sec: 5457.2, 60 sec: 5722.9, 300 sec: 5753.0). Total num frames: 216431616. Throughput: 0: 6039.1. Samples: 216439098. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 10:37:02,201][25689] Avg episode reward: [(0, '-50.495')] [2022-07-09 10:37:03,476][26022] Updated weights on worker 0-0, policy_version 211367 (0.00088) [2022-07-09 10:37:05,099][26022] Updated weights on worker 0-0, policy_version 211377 (0.00084) [2022-07-09 10:37:06,806][26022] Updated weights on worker 0-0, policy_version 211387 (0.00095) [2022-07-09 10:37:07,206][25689] Fps is (10 sec: 5701.8, 60 sec: 5790.3, 300 sec: 5759.9). Total num frames: 216462336. Throughput: 0: 5087.5. Samples: 216454426. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 10:37:07,207][25689] Avg episode reward: [(0, '-50.762')] [2022-07-09 10:37:08,836][26022] Updated weights on worker 0-0, policy_version 211397 (0.00095) [2022-07-09 10:37:10,421][26022] Updated weights on worker 0-0, policy_version 211407 (0.00085) [2022-07-09 10:37:12,193][26022] Updated weights on worker 0-0, policy_version 211417 (0.00091) [2022-07-09 10:37:12,214][25689] Fps is (10 sec: 5931.4, 60 sec: 5774.8, 300 sec: 5753.5). Total num frames: 216491008. Throughput: 0: 5948.1. Samples: 216489076. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 10:37:12,214][25689] Avg episode reward: [(0, '-50.064')] [2022-07-09 10:37:13,956][26022] Updated weights on worker 0-0, policy_version 211427 (0.00090) [2022-07-09 10:37:15,851][26022] Updated weights on worker 0-0, policy_version 211438 (0.00097) [2022-07-09 10:37:17,241][25689] Fps is (10 sec: 5714.7, 60 sec: 5758.2, 300 sec: 5747.9). Total num frames: 216519680. Throughput: 0: 5935.4. Samples: 216523800. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 10:37:17,241][25689] Avg episode reward: [(0, '-49.967')] [2022-07-09 10:37:17,832][26022] Updated weights on worker 0-0, policy_version 211448 (0.00084) [2022-07-09 10:37:19,499][26022] Updated weights on worker 0-0, policy_version 211458 (0.00086) [2022-07-09 10:37:21,087][26022] Updated weights on worker 0-0, policy_version 211468 (0.00085) [2022-07-09 10:37:22,313][25689] Fps is (10 sec: 5677.9, 60 sec: 5756.8, 300 sec: 5757.0). Total num frames: 216548352. Throughput: 0: 5943.8. Samples: 216558642. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 10:37:22,314][25689] Avg episode reward: [(0, '-50.551')] [2022-07-09 10:37:22,960][26022] Updated weights on worker 0-0, policy_version 211478 (0.00069) [2022-07-09 10:37:24,625][26022] Updated weights on worker 0-0, policy_version 211488 (0.00087) [2022-07-09 10:37:26,523][26022] Updated weights on worker 0-0, policy_version 211498 (0.00084) [2022-07-09 10:37:27,391][25689] Fps is (10 sec: 5851.1, 60 sec: 5734.6, 300 sec: 5760.0). Total num frames: 216579072. Throughput: 0: 6039.5. Samples: 216576334. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 10:37:27,399][25689] Avg episode reward: [(0, '-49.676')] [2022-07-09 10:37:28,484][26022] Updated weights on worker 0-0, policy_version 211508 (0.00087) [2022-07-09 10:37:30,007][26022] Updated weights on worker 0-0, policy_version 211518 (0.00083) [2022-07-09 10:37:32,136][26022] Updated weights on worker 0-0, policy_version 211528 (0.00083) [2022-07-09 10:37:32,417][25689] Fps is (10 sec: 5777.1, 60 sec: 5751.1, 300 sec: 5749.7). Total num frames: 216606720. Throughput: 0: 6026.1. Samples: 216610820. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 10:37:32,419][25689] Avg episode reward: [(0, '-49.241')] [2022-07-09 10:37:33,454][26022] Updated weights on worker 0-0, policy_version 211538 (0.00089) [2022-07-09 10:37:35,328][26022] Updated weights on worker 0-0, policy_version 211548 (0.00086) [2022-07-09 10:37:37,022][26022] Updated weights on worker 0-0, policy_version 211558 (0.00085) [2022-07-09 10:37:37,431][25689] Fps is (10 sec: 5609.5, 60 sec: 5735.2, 300 sec: 5753.7). Total num frames: 216635392. Throughput: 0: 6049.1. Samples: 216645934. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 10:37:37,433][25689] Avg episode reward: [(0, '-48.785')] [2022-07-09 10:37:38,840][26022] Updated weights on worker 0-0, policy_version 211568 (0.00083) [2022-07-09 10:37:40,631][26022] Updated weights on worker 0-0, policy_version 211578 (0.00093) [2022-07-09 10:37:42,471][25689] Fps is (10 sec: 5805.0, 60 sec: 5756.9, 300 sec: 5760.0). Total num frames: 216665088. Throughput: 0: 5192.8. Samples: 216663318. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 10:37:42,473][25689] Avg episode reward: [(0, '-48.521')] [2022-07-09 10:37:42,590][26022] Updated weights on worker 0-0, policy_version 211588 (0.01123) [2022-07-09 10:37:43,974][26022] Updated weights on worker 0-0, policy_version 211598 (0.00095) [2022-07-09 10:37:46,014][26022] Updated weights on worker 0-0, policy_version 211608 (0.00094) [2022-07-09 10:37:47,482][25689] Fps is (10 sec: 5909.4, 60 sec: 5742.2, 300 sec: 5756.4). Total num frames: 216694784. Throughput: 0: 6064.4. Samples: 216698170. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 10:37:47,483][25689] Avg episode reward: [(0, '-48.772')] [2022-07-09 10:37:47,703][26022] Updated weights on worker 0-0, policy_version 211618 (0.00086) [2022-07-09 10:37:49,480][26022] Updated weights on worker 0-0, policy_version 211628 (0.00087) [2022-07-09 10:37:51,403][26022] Updated weights on worker 0-0, policy_version 211638 (0.00089) [2022-07-09 10:37:52,525][25689] Fps is (10 sec: 5907.4, 60 sec: 5758.8, 300 sec: 5757.1). Total num frames: 216724480. Throughput: 0: 6069.6. Samples: 216732872. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 10:37:52,526][25689] Avg episode reward: [(0, '-49.729')] [2022-07-09 10:37:53,192][26022] Updated weights on worker 0-0, policy_version 211648 (0.00090) [2022-07-09 10:37:54,827][26022] Updated weights on worker 0-0, policy_version 211658 (0.00087) [2022-07-09 10:37:56,559][26022] Updated weights on worker 0-0, policy_version 211668 (0.00086) [2022-07-09 10:37:57,531][25689] Fps is (10 sec: 5706.3, 60 sec: 5749.0, 300 sec: 5758.3). Total num frames: 216752128. Throughput: 0: 5200.2. Samples: 216750456. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 10:37:57,531][25689] Avg episode reward: [(0, '-49.074')] [2022-07-09 10:37:58,135][26022] Updated weights on worker 0-0, policy_version 211678 (0.00100) [2022-07-09 10:38:00,135][26022] Updated weights on worker 0-0, policy_version 211688 (0.00087) [2022-07-09 10:38:02,192][26022] Updated weights on worker 0-0, policy_version 211698 (0.00084) [2022-07-09 10:38:02,647][25689] Fps is (10 sec: 5564.4, 60 sec: 5776.8, 300 sec: 5760.2). Total num frames: 216780800. Throughput: 0: 6022.0. Samples: 216784814. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 10:38:02,647][25689] Avg episode reward: [(0, '-48.301')] [2022-07-09 10:38:03,995][26022] Updated weights on worker 0-0, policy_version 211708 (0.00087) [2022-07-09 10:38:05,753][26022] Updated weights on worker 0-0, policy_version 211718 (0.00087) [2022-07-09 10:38:07,274][26022] Updated weights on worker 0-0, policy_version 211728 (0.00082) [2022-07-09 10:38:07,667][25689] Fps is (10 sec: 5758.5, 60 sec: 5758.5, 300 sec: 5763.4). Total num frames: 216810496. Throughput: 0: 5935.8. Samples: 216817984. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 10:38:07,667][25689] Avg episode reward: [(0, '-48.582')] [2022-07-09 10:38:09,415][26022] Updated weights on worker 0-0, policy_version 211738 (0.00086) [2022-07-09 10:38:11,227][26022] Updated weights on worker 0-0, policy_version 211748 (0.00087) [2022-07-09 10:38:12,699][25689] Fps is (10 sec: 5806.5, 60 sec: 5756.1, 300 sec: 5763.3). Total num frames: 216839168. Throughput: 0: 5079.7. Samples: 216835348. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 10:38:12,700][25689] Avg episode reward: [(0, '-48.481')] [2022-07-09 10:38:12,939][26022] Updated weights on worker 0-0, policy_version 211758 (0.00094) [2022-07-09 10:38:14,835][26022] Updated weights on worker 0-0, policy_version 211768 (0.00088) [2022-07-09 10:38:16,332][26022] Updated weights on worker 0-0, policy_version 211778 (0.00086) [2022-07-09 10:38:17,705][25689] Fps is (10 sec: 5712.6, 60 sec: 5758.1, 300 sec: 5757.5). Total num frames: 216867840. Throughput: 0: 5929.2. Samples: 216870072. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 10:38:17,706][25689] Avg episode reward: [(0, '-48.092')] [2022-07-09 10:38:18,323][26022] Updated weights on worker 0-0, policy_version 211788 (0.00088) [2022-07-09 10:38:19,865][26022] Updated weights on worker 0-0, policy_version 211798 (0.00082) [2022-07-09 10:38:21,697][26022] Updated weights on worker 0-0, policy_version 211808 (0.00083) [2022-07-09 10:38:22,846][25689] Fps is (10 sec: 5752.1, 60 sec: 5768.5, 300 sec: 5761.7). Total num frames: 216897536. Throughput: 0: 5943.7. Samples: 216904874. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 10:38:22,847][25689] Avg episode reward: [(0, '-47.611')] [2022-07-09 10:38:23,459][26022] Updated weights on worker 0-0, policy_version 211818 (0.00092) [2022-07-09 10:38:25,197][26022] Updated weights on worker 0-0, policy_version 211828 (0.00615) [2022-07-09 10:38:26,999][26022] Updated weights on worker 0-0, policy_version 211838 (0.00106) [2022-07-09 10:38:27,883][25689] Fps is (10 sec: 5835.6, 60 sec: 5755.5, 300 sec: 5758.9). Total num frames: 216927232. Throughput: 0: 5171.2. Samples: 216922524. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 10:38:27,883][25689] Avg episode reward: [(0, '-46.876')] [2022-07-09 10:38:28,693][26022] Updated weights on worker 0-0, policy_version 211848 (0.00082) [2022-07-09 10:38:30,679][26022] Updated weights on worker 0-0, policy_version 211858 (0.00087) [2022-07-09 10:38:32,459][26022] Updated weights on worker 0-0, policy_version 211868 (0.00083) [2022-07-09 10:38:32,927][25689] Fps is (10 sec: 5891.7, 60 sec: 5787.6, 300 sec: 5758.1). Total num frames: 216956928. Throughput: 0: 6016.1. Samples: 216957040. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 10:38:32,928][25689] Avg episode reward: [(0, '-47.172')] [2022-07-09 10:38:34,127][26022] Updated weights on worker 0-0, policy_version 211878 (0.00085) [2022-07-09 10:38:35,784][26022] Updated weights on worker 0-0, policy_version 211888 (0.00083) [2022-07-09 10:38:37,534][26022] Updated weights on worker 0-0, policy_version 211898 (0.00088) [2022-07-09 10:38:37,946][25689] Fps is (10 sec: 5698.1, 60 sec: 5770.2, 300 sec: 5763.5). Total num frames: 216984576. Throughput: 0: 6024.6. Samples: 216992018. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 10:38:37,947][25689] Avg episode reward: [(0, '-46.734')] [2022-07-09 10:38:39,315][26022] Updated weights on worker 0-0, policy_version 211908 (0.00084) [2022-07-09 10:38:41,175][26022] Updated weights on worker 0-0, policy_version 211918 (0.00088) [2022-07-09 10:38:42,848][26022] Updated weights on worker 0-0, policy_version 211928 (0.00086) [2022-07-09 10:38:42,989][25689] Fps is (10 sec: 5800.6, 60 sec: 5786.8, 300 sec: 5762.9). Total num frames: 217015296. Throughput: 0: 5195.0. Samples: 217009516. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 10:38:42,990][25689] Avg episode reward: [(0, '-47.180')] [2022-07-09 10:38:44,590][26022] Updated weights on worker 0-0, policy_version 211938 (0.00089) [2022-07-09 10:38:46,541][26022] Updated weights on worker 0-0, policy_version 211948 (0.00087) [2022-07-09 10:38:48,001][25689] Fps is (10 sec: 5907.0, 60 sec: 5769.8, 300 sec: 5760.5). Total num frames: 217043968. Throughput: 0: 6061.4. Samples: 217044468. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 10:38:48,003][25689] Avg episode reward: [(0, '-47.515')] [2022-07-09 10:38:48,047][26022] Updated weights on worker 0-0, policy_version 211958 (0.00087) [2022-07-09 10:38:48,685][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:38:48,700][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000211961_217048064.pth [2022-07-09 10:38:48,701][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000209934_214972416.pth [2022-07-09 10:38:49,955][26022] Updated weights on worker 0-0, policy_version 211968 (0.00085) [2022-07-09 10:38:51,667][26022] Updated weights on worker 0-0, policy_version 211978 (0.00102) [2022-07-09 10:38:53,005][25689] Fps is (10 sec: 5725.7, 60 sec: 5756.7, 300 sec: 5767.4). Total num frames: 217072640. Throughput: 0: 6075.4. Samples: 217079020. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 10:38:53,005][25689] Avg episode reward: [(0, '-47.939')] [2022-07-09 10:38:53,561][26022] Updated weights on worker 0-0, policy_version 211988 (0.00051) [2022-07-09 10:38:55,174][26022] Updated weights on worker 0-0, policy_version 211998 (0.00089) [2022-07-09 10:38:57,108][26022] Updated weights on worker 0-0, policy_version 212008 (0.00085) [2022-07-09 10:38:58,014][25689] Fps is (10 sec: 5829.3, 60 sec: 5790.2, 300 sec: 5765.3). Total num frames: 217102336. Throughput: 0: 5208.2. Samples: 217096532. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 10:38:58,016][25689] Avg episode reward: [(0, '-47.593')] [2022-07-09 10:38:58,741][26022] Updated weights on worker 0-0, policy_version 212018 (0.00094) [2022-07-09 10:39:00,660][26022] Updated weights on worker 0-0, policy_version 212028 (0.00086) [2022-07-09 10:39:02,552][26022] Updated weights on worker 0-0, policy_version 212038 (0.00088) [2022-07-09 10:39:03,098][25689] Fps is (10 sec: 5579.9, 60 sec: 5759.4, 300 sec: 5762.2). Total num frames: 217128960. Throughput: 0: 6000.4. Samples: 217130176. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 10:39:03,098][25689] Avg episode reward: [(0, '-48.119')] [2022-07-09 10:39:04,568][26022] Updated weights on worker 0-0, policy_version 212048 (0.00079) [2022-07-09 10:39:06,163][26022] Updated weights on worker 0-0, policy_version 212058 (0.00091) [2022-07-09 10:39:08,046][26022] Updated weights on worker 0-0, policy_version 212068 (0.00083) [2022-07-09 10:39:08,106][25689] Fps is (10 sec: 5479.0, 60 sec: 5743.6, 300 sec: 5765.6). Total num frames: 217157632. Throughput: 0: 5968.7. Samples: 217164470. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 10:39:08,108][25689] Avg episode reward: [(0, '-48.266')] [2022-07-09 10:39:09,643][26022] Updated weights on worker 0-0, policy_version 212078 (0.00089) [2022-07-09 10:39:11,607][26022] Updated weights on worker 0-0, policy_version 212088 (0.00089) [2022-07-09 10:39:13,080][26022] Updated weights on worker 0-0, policy_version 212098 (0.00082) [2022-07-09 10:39:13,115][25689] Fps is (10 sec: 5929.3, 60 sec: 5779.7, 300 sec: 5769.0). Total num frames: 217188352. Throughput: 0: 5108.5. Samples: 217181752. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 10:39:13,115][25689] Avg episode reward: [(0, '-47.180')] [2022-07-09 10:39:15,233][26022] Updated weights on worker 0-0, policy_version 212108 (0.00086) [2022-07-09 10:39:16,509][26022] Updated weights on worker 0-0, policy_version 212118 (0.00086) [2022-07-09 10:39:18,127][25689] Fps is (10 sec: 5722.6, 60 sec: 5745.2, 300 sec: 5767.6). Total num frames: 217214976. Throughput: 0: 5983.4. Samples: 217216876. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 10:39:18,127][25689] Avg episode reward: [(0, '-48.595')] [2022-07-09 10:39:18,636][26022] Updated weights on worker 0-0, policy_version 212128 (0.00086) [2022-07-09 10:39:20,379][26022] Updated weights on worker 0-0, policy_version 212138 (0.00087) [2022-07-09 10:39:22,143][26022] Updated weights on worker 0-0, policy_version 212148 (0.00083) [2022-07-09 10:39:23,272][25689] Fps is (10 sec: 5645.4, 60 sec: 5761.8, 300 sec: 5762.2). Total num frames: 217245696. Throughput: 0: 6013.8. Samples: 217251500. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 10:39:23,273][25689] Avg episode reward: [(0, '-49.167')] [2022-07-09 10:39:23,891][26022] Updated weights on worker 0-0, policy_version 212158 (0.00100) [2022-07-09 10:39:25,530][26022] Updated weights on worker 0-0, policy_version 212168 (0.00095) [2022-07-09 10:39:27,392][26022] Updated weights on worker 0-0, policy_version 212178 (0.00084) [2022-07-09 10:39:28,284][25689] Fps is (10 sec: 5847.3, 60 sec: 5747.2, 300 sec: 5759.0). Total num frames: 217274368. Throughput: 0: 5172.0. Samples: 217268828. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 10:39:28,284][25689] Avg episode reward: [(0, '-49.164')] [2022-07-09 10:39:29,193][26022] Updated weights on worker 0-0, policy_version 212188 (0.00088) [2022-07-09 10:39:30,963][26022] Updated weights on worker 0-0, policy_version 212198 (0.00081) [2022-07-09 10:39:32,770][26022] Updated weights on worker 0-0, policy_version 212208 (0.00083) [2022-07-09 10:39:33,299][25689] Fps is (10 sec: 5821.1, 60 sec: 5749.9, 300 sec: 5767.4). Total num frames: 217304064. Throughput: 0: 6026.1. Samples: 217303386. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 10:39:33,300][25689] Avg episode reward: [(0, '-48.588')] [2022-07-09 10:39:34,558][26022] Updated weights on worker 0-0, policy_version 212218 (0.00086) [2022-07-09 10:39:36,285][26022] Updated weights on worker 0-0, policy_version 212228 (0.00084) [2022-07-09 10:39:38,172][26022] Updated weights on worker 0-0, policy_version 212238 (0.00086) [2022-07-09 10:39:38,340][25689] Fps is (10 sec: 5804.4, 60 sec: 5764.9, 300 sec: 5764.0). Total num frames: 217332736. Throughput: 0: 6019.3. Samples: 217338544. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 10:39:38,340][25689] Avg episode reward: [(0, '-48.511')] [2022-07-09 10:39:39,917][26022] Updated weights on worker 0-0, policy_version 212248 (0.00089) [2022-07-09 10:39:41,552][26022] Updated weights on worker 0-0, policy_version 212258 (0.00084) [2022-07-09 10:39:43,427][25689] Fps is (10 sec: 5662.1, 60 sec: 5726.8, 300 sec: 5760.6). Total num frames: 217361408. Throughput: 0: 5172.1. Samples: 217355742. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 10:39:43,427][25689] Avg episode reward: [(0, '-49.055')] [2022-07-09 10:39:43,530][26022] Updated weights on worker 0-0, policy_version 212269 (0.00083) [2022-07-09 10:39:45,265][26022] Updated weights on worker 0-0, policy_version 212279 (0.00086) [2022-07-09 10:39:47,050][26022] Updated weights on worker 0-0, policy_version 212289 (0.00094) [2022-07-09 10:39:48,504][25689] Fps is (10 sec: 5843.2, 60 sec: 5754.4, 300 sec: 5769.7). Total num frames: 217392128. Throughput: 0: 6025.9. Samples: 217390672. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 10:39:48,505][25689] Avg episode reward: [(0, '-49.491')] [2022-07-09 10:39:48,855][26022] Updated weights on worker 0-0, policy_version 212299 (0.00095) [2022-07-09 10:39:50,559][26022] Updated weights on worker 0-0, policy_version 212309 (0.00084) [2022-07-09 10:39:52,339][26022] Updated weights on worker 0-0, policy_version 212319 (0.00083) [2022-07-09 10:39:53,605][25689] Fps is (10 sec: 5835.3, 60 sec: 5745.2, 300 sec: 5764.5). Total num frames: 217420800. Throughput: 0: 5994.9. Samples: 217425116. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 10:39:53,605][25689] Avg episode reward: [(0, '-49.337')] [2022-07-09 10:39:54,208][26022] Updated weights on worker 0-0, policy_version 212329 (0.00084) [2022-07-09 10:39:55,841][26022] Updated weights on worker 0-0, policy_version 212339 (0.00087) [2022-07-09 10:39:57,804][26022] Updated weights on worker 0-0, policy_version 212349 (0.00084) [2022-07-09 10:39:58,635][25689] Fps is (10 sec: 5862.6, 60 sec: 5760.1, 300 sec: 5769.4). Total num frames: 217451520. Throughput: 0: 5986.8. Samples: 217460046. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 10:39:58,635][25689] Avg episode reward: [(0, '-50.447')] [2022-07-09 10:39:59,269][26022] Updated weights on worker 0-0, policy_version 212359 (0.00083) [2022-07-09 10:40:01,214][26022] Updated weights on worker 0-0, policy_version 212369 (0.00051) [2022-07-09 10:40:03,334][26022] Updated weights on worker 0-0, policy_version 212379 (0.00081) [2022-07-09 10:40:03,686][25689] Fps is (10 sec: 5586.7, 60 sec: 5746.3, 300 sec: 5759.2). Total num frames: 217477120. Throughput: 0: 6014.9. Samples: 217477600. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 10:40:03,691][25689] Avg episode reward: [(0, '-49.298')] [2022-07-09 10:40:05,069][26022] Updated weights on worker 0-0, policy_version 212389 (0.00088) [2022-07-09 10:40:06,773][26022] Updated weights on worker 0-0, policy_version 212399 (0.00093) [2022-07-09 10:40:08,701][25689] Fps is (10 sec: 5391.7, 60 sec: 5745.7, 300 sec: 5759.0). Total num frames: 217505792. Throughput: 0: 5942.1. Samples: 217510682. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 10:40:08,701][25689] Avg episode reward: [(0, '-49.159')] [2022-07-09 10:40:08,766][26022] Updated weights on worker 0-0, policy_version 212409 (0.00099) [2022-07-09 10:40:10,263][26022] Updated weights on worker 0-0, policy_version 212419 (0.00081) [2022-07-09 10:40:12,313][26022] Updated weights on worker 0-0, policy_version 212429 (0.00077) [2022-07-09 10:40:13,541][26022] Updated weights on worker 0-0, policy_version 212439 (0.00086) [2022-07-09 10:40:13,715][25689] Fps is (10 sec: 6024.3, 60 sec: 5762.1, 300 sec: 5773.4). Total num frames: 217537536. Throughput: 0: 5972.4. Samples: 217545220. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 10:40:13,715][25689] Avg episode reward: [(0, '-48.331')] [2022-07-09 10:40:15,831][26022] Updated weights on worker 0-0, policy_version 212449 (0.00091) [2022-07-09 10:40:17,547][26022] Updated weights on worker 0-0, policy_version 212459 (0.00178) [2022-07-09 10:40:18,737][25689] Fps is (10 sec: 5815.6, 60 sec: 5761.1, 300 sec: 5763.7). Total num frames: 217564160. Throughput: 0: 5097.8. Samples: 217562524. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 10:40:18,738][25689] Avg episode reward: [(0, '-48.496')] [2022-07-09 10:40:19,367][26022] Updated weights on worker 0-0, policy_version 212469 (0.00084) [2022-07-09 10:40:21,166][26022] Updated weights on worker 0-0, policy_version 212479 (0.00087) [2022-07-09 10:40:22,730][26022] Updated weights on worker 0-0, policy_version 212489 (0.00097) [2022-07-09 10:40:23,843][25689] Fps is (10 sec: 5560.4, 60 sec: 5747.9, 300 sec: 5758.5). Total num frames: 217593856. Throughput: 0: 5933.9. Samples: 217597212. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2022-07-09 10:40:23,844][25689] Avg episode reward: [(0, '-49.154')] [2022-07-09 10:40:24,497][26022] Updated weights on worker 0-0, policy_version 212499 (0.00087) [2022-07-09 10:40:26,530][26022] Updated weights on worker 0-0, policy_version 212509 (0.00088) [2022-07-09 10:40:27,991][26022] Updated weights on worker 0-0, policy_version 212519 (0.00085) [2022-07-09 10:40:28,870][25689] Fps is (10 sec: 5760.1, 60 sec: 5746.5, 300 sec: 5761.7). Total num frames: 217622528. Throughput: 0: 6010.8. Samples: 217631918. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2022-07-09 10:40:28,871][25689] Avg episode reward: [(0, '-49.118')] [2022-07-09 10:40:30,010][26022] Updated weights on worker 0-0, policy_version 212529 (0.00088) [2022-07-09 10:40:31,759][26022] Updated weights on worker 0-0, policy_version 212539 (0.00080) [2022-07-09 10:40:33,497][26022] Updated weights on worker 0-0, policy_version 212549 (0.00087) [2022-07-09 10:40:33,902][25689] Fps is (10 sec: 5904.9, 60 sec: 5761.9, 300 sec: 5764.8). Total num frames: 217653248. Throughput: 0: 5140.7. Samples: 217648994. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2022-07-09 10:40:33,902][25689] Avg episode reward: [(0, '-49.473')] [2022-07-09 10:40:35,305][26022] Updated weights on worker 0-0, policy_version 212559 (0.00085) [2022-07-09 10:40:36,968][26022] Updated weights on worker 0-0, policy_version 212569 (0.00089) [2022-07-09 10:40:38,815][26022] Updated weights on worker 0-0, policy_version 212579 (0.00084) [2022-07-09 10:40:38,914][25689] Fps is (10 sec: 5811.4, 60 sec: 5747.7, 300 sec: 5758.7). Total num frames: 217680896. Throughput: 0: 6026.6. Samples: 217684122. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2022-07-09 10:40:38,915][25689] Avg episode reward: [(0, '-49.859')] [2022-07-09 10:40:40,484][26022] Updated weights on worker 0-0, policy_version 212589 (0.00882) [2022-07-09 10:40:42,154][26022] Updated weights on worker 0-0, policy_version 212599 (0.00083) [2022-07-09 10:40:44,011][25689] Fps is (10 sec: 5672.5, 60 sec: 5763.6, 300 sec: 5758.3). Total num frames: 217710592. Throughput: 0: 6035.0. Samples: 217718922. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2022-07-09 10:40:44,011][25689] Avg episode reward: [(0, '-49.648')] [2022-07-09 10:40:44,135][26022] Updated weights on worker 0-0, policy_version 212609 (0.00078) [2022-07-09 10:40:45,826][26022] Updated weights on worker 0-0, policy_version 212619 (0.00085) [2022-07-09 10:40:47,545][26022] Updated weights on worker 0-0, policy_version 212629 (0.00083) [2022-07-09 10:40:48,714][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:40:48,725][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000212635_217738240.pth [2022-07-09 10:40:48,725][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000210608_215662592.pth [2022-07-09 10:40:49,036][25689] Fps is (10 sec: 5867.4, 60 sec: 5751.6, 300 sec: 5758.7). Total num frames: 217740288. Throughput: 0: 5180.3. Samples: 217736384. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2022-07-09 10:40:49,037][25689] Avg episode reward: [(0, '-49.497')] [2022-07-09 10:40:49,300][26022] Updated weights on worker 0-0, policy_version 212639 (0.00081) [2022-07-09 10:40:50,954][26022] Updated weights on worker 0-0, policy_version 212649 (0.00087) [2022-07-09 10:40:52,957][26022] Updated weights on worker 0-0, policy_version 212659 (0.00090) [2022-07-09 10:40:54,056][25689] Fps is (10 sec: 5810.3, 60 sec: 5759.3, 300 sec: 5759.6). Total num frames: 217768960. Throughput: 0: 6067.0. Samples: 217771274. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2022-07-09 10:40:54,057][25689] Avg episode reward: [(0, '-48.542')] [2022-07-09 10:40:54,819][26022] Updated weights on worker 0-0, policy_version 212669 (0.00087) [2022-07-09 10:40:56,351][26022] Updated weights on worker 0-0, policy_version 212679 (0.00088) [2022-07-09 10:40:58,205][26022] Updated weights on worker 0-0, policy_version 212689 (0.00093) [2022-07-09 10:40:59,063][25689] Fps is (10 sec: 5821.2, 60 sec: 5744.6, 300 sec: 5761.6). Total num frames: 217798656. Throughput: 0: 6056.2. Samples: 217806150. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2022-07-09 10:40:59,064][25689] Avg episode reward: [(0, '-49.000')] [2022-07-09 10:40:59,824][26022] Updated weights on worker 0-0, policy_version 212699 (0.00094) [2022-07-09 10:41:01,838][26022] Updated weights on worker 0-0, policy_version 212709 (0.00095) [2022-07-09 10:41:03,763][26022] Updated weights on worker 0-0, policy_version 212719 (0.00084) [2022-07-09 10:41:04,182][25689] Fps is (10 sec: 5663.2, 60 sec: 5772.0, 300 sec: 5762.8). Total num frames: 217826304. Throughput: 0: 5179.9. Samples: 217823408. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2022-07-09 10:41:04,182][25689] Avg episode reward: [(0, '-48.787')] [2022-07-09 10:41:05,531][26022] Updated weights on worker 0-0, policy_version 212729 (0.00084) [2022-07-09 10:41:07,290][26022] Updated weights on worker 0-0, policy_version 212739 (0.00080) [2022-07-09 10:41:08,950][26022] Updated weights on worker 0-0, policy_version 212749 (0.00083) [2022-07-09 10:41:09,194][25689] Fps is (10 sec: 5559.0, 60 sec: 5772.2, 300 sec: 5759.5). Total num frames: 217854976. Throughput: 0: 5970.8. Samples: 217856744. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2022-07-09 10:41:09,196][25689] Avg episode reward: [(0, '-48.587')] [2022-07-09 10:41:10,791][26022] Updated weights on worker 0-0, policy_version 212759 (0.00102) [2022-07-09 10:41:12,558][26022] Updated weights on worker 0-0, policy_version 212769 (0.00565) [2022-07-09 10:41:14,178][26022] Updated weights on worker 0-0, policy_version 212779 (0.00085) [2022-07-09 10:41:14,277][25689] Fps is (10 sec: 5883.0, 60 sec: 5748.7, 300 sec: 5761.9). Total num frames: 217885696. Throughput: 0: 5958.2. Samples: 217891756. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2022-07-09 10:41:14,278][25689] Avg episode reward: [(0, '-49.562')] [2022-07-09 10:41:16,174][26022] Updated weights on worker 0-0, policy_version 212789 (0.00085) [2022-07-09 10:41:17,990][26022] Updated weights on worker 0-0, policy_version 212799 (0.00077) [2022-07-09 10:41:19,287][25689] Fps is (10 sec: 5783.1, 60 sec: 5766.9, 300 sec: 5759.4). Total num frames: 217913344. Throughput: 0: 5099.3. Samples: 217909278. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2022-07-09 10:41:19,288][25689] Avg episode reward: [(0, '-49.582')] [2022-07-09 10:41:19,525][26022] Updated weights on worker 0-0, policy_version 212809 (0.00085) [2022-07-09 10:41:21,382][26022] Updated weights on worker 0-0, policy_version 212819 (0.00087) [2022-07-09 10:41:23,011][26022] Updated weights on worker 0-0, policy_version 212829 (0.00081) [2022-07-09 10:41:24,355][25689] Fps is (10 sec: 5690.5, 60 sec: 5770.6, 300 sec: 5751.7). Total num frames: 217943040. Throughput: 0: 5976.8. Samples: 217943976. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2022-07-09 10:41:24,357][25689] Avg episode reward: [(0, '-49.577')] [2022-07-09 10:41:24,801][26022] Updated weights on worker 0-0, policy_version 212839 (0.00089) [2022-07-09 10:41:26,720][26022] Updated weights on worker 0-0, policy_version 212849 (0.00078) [2022-07-09 10:41:28,300][26022] Updated weights on worker 0-0, policy_version 212859 (0.00093) [2022-07-09 10:41:29,427][25689] Fps is (10 sec: 5756.4, 60 sec: 5766.2, 300 sec: 5757.6). Total num frames: 217971712. Throughput: 0: 6045.2. Samples: 217979054. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2022-07-09 10:41:29,429][25689] Avg episode reward: [(0, '-49.804')] [2022-07-09 10:41:30,391][26022] Updated weights on worker 0-0, policy_version 212869 (0.00085) [2022-07-09 10:41:32,006][26022] Updated weights on worker 0-0, policy_version 212879 (0.00086) [2022-07-09 10:41:33,719][26022] Updated weights on worker 0-0, policy_version 212889 (0.00085) [2022-07-09 10:41:34,487][25689] Fps is (10 sec: 5962.3, 60 sec: 5780.3, 300 sec: 5763.8). Total num frames: 218003456. Throughput: 0: 5175.2. Samples: 217996346. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2022-07-09 10:41:34,490][25689] Avg episode reward: [(0, '-48.740')] [2022-07-09 10:41:35,406][26022] Updated weights on worker 0-0, policy_version 212899 (0.00086) [2022-07-09 10:41:37,131][26022] Updated weights on worker 0-0, policy_version 212909 (0.00090) [2022-07-09 10:41:39,101][26022] Updated weights on worker 0-0, policy_version 212919 (0.00087) [2022-07-09 10:41:39,513][25689] Fps is (10 sec: 5787.1, 60 sec: 5762.2, 300 sec: 5758.1). Total num frames: 218030080. Throughput: 0: 6034.3. Samples: 218031324. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2022-07-09 10:41:39,513][25689] Avg episode reward: [(0, '-47.877')] [2022-07-09 10:41:40,807][26022] Updated weights on worker 0-0, policy_version 212929 (0.00087) [2022-07-09 10:41:42,605][26022] Updated weights on worker 0-0, policy_version 212939 (0.00087) [2022-07-09 10:41:44,388][26022] Updated weights on worker 0-0, policy_version 212949 (0.00086) [2022-07-09 10:41:44,586][25689] Fps is (10 sec: 5577.2, 60 sec: 5764.5, 300 sec: 5753.9). Total num frames: 218059776. Throughput: 0: 6049.4. Samples: 218066362. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2022-07-09 10:41:44,586][25689] Avg episode reward: [(0, '-47.255')] [2022-07-09 10:41:45,823][26022] Updated weights on worker 0-0, policy_version 212959 (0.00079) [2022-07-09 10:41:47,797][26022] Updated weights on worker 0-0, policy_version 212969 (0.00087) [2022-07-09 10:41:49,596][26022] Updated weights on worker 0-0, policy_version 212979 (0.00085) [2022-07-09 10:41:49,688][25689] Fps is (10 sec: 5937.5, 60 sec: 5774.1, 300 sec: 5759.6). Total num frames: 218090496. Throughput: 0: 5175.9. Samples: 218083924. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2022-07-09 10:41:49,689][25689] Avg episode reward: [(0, '-47.352')] [2022-07-09 10:41:51,208][26022] Updated weights on worker 0-0, policy_version 212989 (0.00085) [2022-07-09 10:41:52,963][26022] Updated weights on worker 0-0, policy_version 212999 (0.00096) [2022-07-09 10:41:54,771][25689] Fps is (10 sec: 5931.9, 60 sec: 5785.0, 300 sec: 5763.0). Total num frames: 218120192. Throughput: 0: 6047.1. Samples: 218119000. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2022-07-09 10:41:54,771][25689] Avg episode reward: [(0, '-47.873')] [2022-07-09 10:41:54,772][26022] Updated weights on worker 0-0, policy_version 213009 (0.00087) [2022-07-09 10:41:56,600][26022] Updated weights on worker 0-0, policy_version 213019 (0.00078) [2022-07-09 10:41:58,355][26022] Updated weights on worker 0-0, policy_version 213029 (0.00083) [2022-07-09 10:41:59,781][25689] Fps is (10 sec: 5884.6, 60 sec: 5784.7, 300 sec: 5774.2). Total num frames: 218149888. Throughput: 0: 6048.1. Samples: 218153908. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2022-07-09 10:41:59,781][25689] Avg episode reward: [(0, '-46.865')] [2022-07-09 10:42:00,171][26022] Updated weights on worker 0-0, policy_version 213039 (0.00090) [2022-07-09 10:42:02,359][26022] Updated weights on worker 0-0, policy_version 213049 (0.00090) [2022-07-09 10:42:03,971][26022] Updated weights on worker 0-0, policy_version 213059 (0.00087) [2022-07-09 10:42:04,843][25689] Fps is (10 sec: 5489.6, 60 sec: 5756.3, 300 sec: 5755.8). Total num frames: 218175488. Throughput: 0: 5119.0. Samples: 218170060. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2022-07-09 10:42:04,844][25689] Avg episode reward: [(0, '-46.863')] [2022-07-09 10:42:05,834][26022] Updated weights on worker 0-0, policy_version 213069 (0.00086) [2022-07-09 10:42:07,480][26022] Updated weights on worker 0-0, policy_version 213079 (0.00091) [2022-07-09 10:42:09,327][26022] Updated weights on worker 0-0, policy_version 213089 (0.00089) [2022-07-09 10:42:09,896][25689] Fps is (10 sec: 5466.6, 60 sec: 5769.4, 300 sec: 5758.4). Total num frames: 218205184. Throughput: 0: 5946.0. Samples: 218204080. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2022-07-09 10:42:09,896][25689] Avg episode reward: [(0, '-47.951')] [2022-07-09 10:42:10,893][26022] Updated weights on worker 0-0, policy_version 213099 (0.00092) [2022-07-09 10:42:12,863][26022] Updated weights on worker 0-0, policy_version 213109 (0.00092) [2022-07-09 10:42:14,506][26022] Updated weights on worker 0-0, policy_version 213119 (0.00086) [2022-07-09 10:42:14,899][25689] Fps is (10 sec: 5906.5, 60 sec: 5760.1, 300 sec: 5762.3). Total num frames: 218234880. Throughput: 0: 5966.6. Samples: 218239096. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2022-07-09 10:42:14,899][25689] Avg episode reward: [(0, '-48.443')] [2022-07-09 10:42:16,521][26022] Updated weights on worker 0-0, policy_version 213129 (0.00086) [2022-07-09 10:42:18,182][26022] Updated weights on worker 0-0, policy_version 213139 (0.00084) [2022-07-09 10:42:19,843][26022] Updated weights on worker 0-0, policy_version 213149 (0.00089) [2022-07-09 10:42:19,909][25689] Fps is (10 sec: 5931.4, 60 sec: 5793.8, 300 sec: 5767.0). Total num frames: 218264576. Throughput: 0: 5971.2. Samples: 218274098. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2022-07-09 10:42:19,910][25689] Avg episode reward: [(0, '-49.058')] [2022-07-09 10:42:21,675][26022] Updated weights on worker 0-0, policy_version 213159 (0.00083) [2022-07-09 10:42:23,411][26022] Updated weights on worker 0-0, policy_version 213169 (0.00086) [2022-07-09 10:42:24,974][25689] Fps is (10 sec: 5793.5, 60 sec: 5777.2, 300 sec: 5760.3). Total num frames: 218293248. Throughput: 0: 6027.2. Samples: 218291388. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2022-07-09 10:42:24,974][25689] Avg episode reward: [(0, '-49.222')] [2022-07-09 10:42:25,322][26022] Updated weights on worker 0-0, policy_version 213179 (0.00085) [2022-07-09 10:42:27,039][26022] Updated weights on worker 0-0, policy_version 213189 (0.00083) [2022-07-09 10:42:28,773][26022] Updated weights on worker 0-0, policy_version 213199 (0.00091) [2022-07-09 10:42:29,987][25689] Fps is (10 sec: 5689.9, 60 sec: 5782.8, 300 sec: 5764.0). Total num frames: 218321920. Throughput: 0: 6058.2. Samples: 218325798. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2022-07-09 10:42:29,988][25689] Avg episode reward: [(0, '-49.486')] [2022-07-09 10:42:30,500][26022] Updated weights on worker 0-0, policy_version 213209 (0.00098) [2022-07-09 10:42:32,129][26022] Updated weights on worker 0-0, policy_version 213219 (0.00083) [2022-07-09 10:42:34,282][26022] Updated weights on worker 0-0, policy_version 213229 (0.00086) [2022-07-09 10:42:34,990][25689] Fps is (10 sec: 5929.5, 60 sec: 5771.4, 300 sec: 5771.1). Total num frames: 218352640. Throughput: 0: 6058.1. Samples: 218360810. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2022-07-09 10:42:34,992][25689] Avg episode reward: [(0, '-49.866')] [2022-07-09 10:42:35,662][26022] Updated weights on worker 0-0, policy_version 213239 (0.00094) [2022-07-09 10:42:37,629][26022] Updated weights on worker 0-0, policy_version 213249 (0.00084) [2022-07-09 10:42:39,265][26022] Updated weights on worker 0-0, policy_version 213259 (0.00081) [2022-07-09 10:42:40,025][25689] Fps is (10 sec: 5712.7, 60 sec: 5770.4, 300 sec: 5760.9). Total num frames: 218379264. Throughput: 0: 5180.6. Samples: 218378312. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 10:42:40,027][25689] Avg episode reward: [(0, '-49.426')] [2022-07-09 10:42:41,054][26022] Updated weights on worker 0-0, policy_version 213269 (0.00081) [2022-07-09 10:42:42,998][26022] Updated weights on worker 0-0, policy_version 213279 (0.00087) [2022-07-09 10:42:44,684][26022] Updated weights on worker 0-0, policy_version 213289 (0.00085) [2022-07-09 10:42:45,103][25689] Fps is (10 sec: 5771.3, 60 sec: 5803.8, 300 sec: 5766.5). Total num frames: 218411008. Throughput: 0: 6058.8. Samples: 218413350. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 10:42:45,107][25689] Avg episode reward: [(0, '-48.688')] [2022-07-09 10:42:46,279][26022] Updated weights on worker 0-0, policy_version 213299 (0.00085) [2022-07-09 10:42:48,204][26022] Updated weights on worker 0-0, policy_version 213309 (0.00093) [2022-07-09 10:42:48,742][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:42:48,754][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000213312_218431488.pth [2022-07-09 10:42:48,754][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000211284_216354816.pth [2022-07-09 10:42:49,844][26022] Updated weights on worker 0-0, policy_version 213319 (0.00087) [2022-07-09 10:42:50,141][25689] Fps is (10 sec: 6073.7, 60 sec: 5793.1, 300 sec: 5766.6). Total num frames: 218440704. Throughput: 0: 6067.2. Samples: 218448074. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 10:42:50,141][25689] Avg episode reward: [(0, '-48.613')] [2022-07-09 10:42:51,800][26022] Updated weights on worker 0-0, policy_version 213329 (0.00089) [2022-07-09 10:42:53,266][26022] Updated weights on worker 0-0, policy_version 213339 (0.00084) [2022-07-09 10:42:55,164][25689] Fps is (10 sec: 5699.7, 60 sec: 5764.9, 300 sec: 5766.2). Total num frames: 218468352. Throughput: 0: 5205.6. Samples: 218465830. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 10:42:55,166][25689] Avg episode reward: [(0, '-48.786')] [2022-07-09 10:42:55,229][26022] Updated weights on worker 0-0, policy_version 213349 (0.00087) [2022-07-09 10:42:56,913][26022] Updated weights on worker 0-0, policy_version 213359 (0.00099) [2022-07-09 10:42:58,755][26022] Updated weights on worker 0-0, policy_version 213369 (0.00085) [2022-07-09 10:43:00,247][25689] Fps is (10 sec: 5674.4, 60 sec: 5758.0, 300 sec: 5770.3). Total num frames: 218498048. Throughput: 0: 6050.6. Samples: 218500664. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 10:43:00,247][25689] Avg episode reward: [(0, '-49.396')] [2022-07-09 10:43:00,507][26022] Updated weights on worker 0-0, policy_version 213379 (0.00522) [2022-07-09 10:43:02,759][26022] Updated weights on worker 0-0, policy_version 213389 (0.00087) [2022-07-09 10:43:04,203][26022] Updated weights on worker 0-0, policy_version 213399 (0.00090) [2022-07-09 10:43:05,369][25689] Fps is (10 sec: 5518.9, 60 sec: 5769.2, 300 sec: 5758.0). Total num frames: 218524672. Throughput: 0: 5931.2. Samples: 218533550. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 10:43:05,370][25689] Avg episode reward: [(0, '-48.613')] [2022-07-09 10:43:06,175][26022] Updated weights on worker 0-0, policy_version 213409 (0.00085) [2022-07-09 10:43:07,724][26022] Updated weights on worker 0-0, policy_version 213419 (0.00082) [2022-07-09 10:43:09,624][26022] Updated weights on worker 0-0, policy_version 213429 (0.00090) [2022-07-09 10:43:10,393][25689] Fps is (10 sec: 5651.5, 60 sec: 5788.8, 300 sec: 5765.1). Total num frames: 218555392. Throughput: 0: 5082.9. Samples: 218551016. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 10:43:10,394][25689] Avg episode reward: [(0, '-47.609')] [2022-07-09 10:43:11,387][26022] Updated weights on worker 0-0, policy_version 213439 (0.00084) [2022-07-09 10:43:13,295][26022] Updated weights on worker 0-0, policy_version 213449 (0.00084) [2022-07-09 10:43:14,870][26022] Updated weights on worker 0-0, policy_version 213459 (0.00096) [2022-07-09 10:43:15,413][25689] Fps is (10 sec: 6015.1, 60 sec: 5787.2, 300 sec: 5768.2). Total num frames: 218585088. Throughput: 0: 5931.7. Samples: 218585942. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 10:43:15,414][25689] Avg episode reward: [(0, '-47.352')] [2022-07-09 10:43:16,852][26022] Updated weights on worker 0-0, policy_version 213469 (0.00944) [2022-07-09 10:43:18,435][26022] Updated weights on worker 0-0, policy_version 213479 (0.00082) [2022-07-09 10:43:20,270][26022] Updated weights on worker 0-0, policy_version 213489 (0.00084) [2022-07-09 10:43:20,419][25689] Fps is (10 sec: 5821.9, 60 sec: 5770.7, 300 sec: 5767.4). Total num frames: 218613760. Throughput: 0: 5950.5. Samples: 218620698. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 10:43:20,419][25689] Avg episode reward: [(0, '-47.965')] [2022-07-09 10:43:21,943][26022] Updated weights on worker 0-0, policy_version 213499 (0.00363) [2022-07-09 10:43:23,899][26022] Updated weights on worker 0-0, policy_version 213509 (0.00091) [2022-07-09 10:43:25,525][25689] Fps is (10 sec: 5772.5, 60 sec: 5783.7, 300 sec: 5766.1). Total num frames: 218643456. Throughput: 0: 5178.6. Samples: 218637928. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 10:43:25,525][25689] Avg episode reward: [(0, '-48.127')] [2022-07-09 10:43:25,531][26022] Updated weights on worker 0-0, policy_version 213519 (0.00096) [2022-07-09 10:43:27,475][26022] Updated weights on worker 0-0, policy_version 213529 (0.00094) [2022-07-09 10:43:29,287][26022] Updated weights on worker 0-0, policy_version 213539 (0.00083) [2022-07-09 10:43:30,544][25689] Fps is (10 sec: 5663.3, 60 sec: 5766.2, 300 sec: 5759.7). Total num frames: 218671104. Throughput: 0: 6015.3. Samples: 218672230. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 10:43:30,545][25689] Avg episode reward: [(0, '-47.213')] [2022-07-09 10:43:31,001][26022] Updated weights on worker 0-0, policy_version 213549 (0.00084) [2022-07-09 10:43:32,900][26022] Updated weights on worker 0-0, policy_version 213559 (0.00092) [2022-07-09 10:43:34,594][26022] Updated weights on worker 0-0, policy_version 213569 (0.00085) [2022-07-09 10:43:35,575][25689] Fps is (10 sec: 5705.9, 60 sec: 5746.7, 300 sec: 5766.3). Total num frames: 218700800. Throughput: 0: 5991.5. Samples: 218706738. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 10:43:35,575][25689] Avg episode reward: [(0, '-48.530')] [2022-07-09 10:43:36,458][26022] Updated weights on worker 0-0, policy_version 213579 (0.00085) [2022-07-09 10:43:38,184][26022] Updated weights on worker 0-0, policy_version 213589 (0.00081) [2022-07-09 10:43:39,990][26022] Updated weights on worker 0-0, policy_version 213599 (0.00086) [2022-07-09 10:43:40,594][25689] Fps is (10 sec: 5706.2, 60 sec: 5765.1, 300 sec: 5756.5). Total num frames: 218728448. Throughput: 0: 5128.8. Samples: 218724172. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 10:43:40,594][25689] Avg episode reward: [(0, '-49.102')] [2022-07-09 10:43:41,565][26022] Updated weights on worker 0-0, policy_version 213609 (0.00079) [2022-07-09 10:43:43,491][26022] Updated weights on worker 0-0, policy_version 213619 (0.00083) [2022-07-09 10:43:45,149][26022] Updated weights on worker 0-0, policy_version 213629 (0.00085) [2022-07-09 10:43:45,710][25689] Fps is (10 sec: 5759.1, 60 sec: 5744.6, 300 sec: 5761.3). Total num frames: 218759168. Throughput: 0: 6017.5. Samples: 218759390. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 10:43:45,710][25689] Avg episode reward: [(0, '-49.303')] [2022-07-09 10:43:46,963][26022] Updated weights on worker 0-0, policy_version 213639 (0.00091) [2022-07-09 10:43:48,734][26022] Updated weights on worker 0-0, policy_version 213649 (0.00093) [2022-07-09 10:43:50,274][26022] Updated weights on worker 0-0, policy_version 213659 (0.00083) [2022-07-09 10:43:50,714][25689] Fps is (10 sec: 5970.2, 60 sec: 5747.8, 300 sec: 5764.8). Total num frames: 218788864. Throughput: 0: 6059.7. Samples: 218794450. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 10:43:50,714][25689] Avg episode reward: [(0, '-48.283')] [2022-07-09 10:43:52,348][26022] Updated weights on worker 0-0, policy_version 213669 (0.00087) [2022-07-09 10:43:53,815][26022] Updated weights on worker 0-0, policy_version 213679 (0.00088) [2022-07-09 10:43:55,740][25689] Fps is (10 sec: 5717.1, 60 sec: 5747.5, 300 sec: 5757.6). Total num frames: 218816512. Throughput: 0: 5196.4. Samples: 218811524. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 10:43:55,742][25689] Avg episode reward: [(0, '-48.560')] [2022-07-09 10:43:55,806][26022] Updated weights on worker 0-0, policy_version 213689 (0.00088) [2022-07-09 10:43:57,748][26022] Updated weights on worker 0-0, policy_version 213699 (0.00086) [2022-07-09 10:43:59,195][26022] Updated weights on worker 0-0, policy_version 213709 (0.00089) [2022-07-09 10:44:00,782][25689] Fps is (10 sec: 5695.4, 60 sec: 5751.3, 300 sec: 5768.7). Total num frames: 218846208. Throughput: 0: 6049.0. Samples: 218846290. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 10:44:00,783][25689] Avg episode reward: [(0, '-48.761')] [2022-07-09 10:44:01,262][26022] Updated weights on worker 0-0, policy_version 213719 (0.00080) [2022-07-09 10:44:02,975][26022] Updated weights on worker 0-0, policy_version 213729 (0.00089) [2022-07-09 10:44:05,033][26022] Updated weights on worker 0-0, policy_version 213739 (0.00085) [2022-07-09 10:44:05,854][25689] Fps is (10 sec: 5568.7, 60 sec: 5756.2, 300 sec: 5760.6). Total num frames: 218872832. Throughput: 0: 5954.3. Samples: 218879334. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 10:44:05,854][25689] Avg episode reward: [(0, '-48.439')] [2022-07-09 10:44:06,718][26022] Updated weights on worker 0-0, policy_version 213749 (0.00053) [2022-07-09 10:44:08,547][26022] Updated weights on worker 0-0, policy_version 213759 (0.00083) [2022-07-09 10:44:10,227][26022] Updated weights on worker 0-0, policy_version 213769 (0.00086) [2022-07-09 10:44:10,868][25689] Fps is (10 sec: 5685.8, 60 sec: 5757.2, 300 sec: 5760.5). Total num frames: 218903552. Throughput: 0: 5072.8. Samples: 218896688. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 10:44:10,868][25689] Avg episode reward: [(0, '-48.812')] [2022-07-09 10:44:12,128][26022] Updated weights on worker 0-0, policy_version 213779 (0.00079) [2022-07-09 10:44:13,566][26022] Updated weights on worker 0-0, policy_version 213789 (0.00086) [2022-07-09 10:44:15,531][26022] Updated weights on worker 0-0, policy_version 213799 (0.00092) [2022-07-09 10:44:15,873][25689] Fps is (10 sec: 5927.8, 60 sec: 5741.6, 300 sec: 5767.5). Total num frames: 218932224. Throughput: 0: 5985.0. Samples: 218932020. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 10:44:15,874][25689] Avg episode reward: [(0, '-46.952')] [2022-07-09 10:44:17,017][26022] Updated weights on worker 0-0, policy_version 213809 (0.00078) [2022-07-09 10:44:19,004][26022] Updated weights on worker 0-0, policy_version 213819 (0.00090) [2022-07-09 10:44:20,768][26022] Updated weights on worker 0-0, policy_version 213829 (0.00095) [2022-07-09 10:44:20,896][25689] Fps is (10 sec: 5718.4, 60 sec: 5740.0, 300 sec: 5763.0). Total num frames: 218960896. Throughput: 0: 6021.9. Samples: 218967412. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 10:44:20,897][25689] Avg episode reward: [(0, '-47.578')] [2022-07-09 10:44:22,495][26022] Updated weights on worker 0-0, policy_version 213839 (0.00082) [2022-07-09 10:44:24,031][26022] Updated weights on worker 0-0, policy_version 213849 (0.00087) [2022-07-09 10:44:25,966][25689] Fps is (10 sec: 5884.2, 60 sec: 5760.3, 300 sec: 5768.7). Total num frames: 218991616. Throughput: 0: 5255.3. Samples: 218985032. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 10:44:25,967][25689] Avg episode reward: [(0, '-47.356')] [2022-07-09 10:44:25,975][26022] Updated weights on worker 0-0, policy_version 213859 (0.00643) [2022-07-09 10:44:27,580][26022] Updated weights on worker 0-0, policy_version 213869 (0.00088) [2022-07-09 10:44:29,419][26022] Updated weights on worker 0-0, policy_version 213879 (0.00089) [2022-07-09 10:44:31,038][25689] Fps is (10 sec: 5855.6, 60 sec: 5772.2, 300 sec: 5764.2). Total num frames: 219020288. Throughput: 0: 6111.0. Samples: 219019952. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 10:44:31,039][25689] Avg episode reward: [(0, '-47.495')] [2022-07-09 10:44:31,191][26022] Updated weights on worker 0-0, policy_version 213889 (0.00091) [2022-07-09 10:44:32,963][26022] Updated weights on worker 0-0, policy_version 213899 (0.00086) [2022-07-09 10:44:34,788][26022] Updated weights on worker 0-0, policy_version 213909 (0.00087) [2022-07-09 10:44:36,052][25689] Fps is (10 sec: 5787.2, 60 sec: 5773.8, 300 sec: 5768.1). Total num frames: 219049984. Throughput: 0: 6078.6. Samples: 219054682. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 10:44:36,052][25689] Avg episode reward: [(0, '-48.438')] [2022-07-09 10:44:36,525][26022] Updated weights on worker 0-0, policy_version 213919 (0.00082) [2022-07-09 10:44:38,174][26022] Updated weights on worker 0-0, policy_version 213929 (0.00090) [2022-07-09 10:44:40,092][26022] Updated weights on worker 0-0, policy_version 213939 (0.00090) [2022-07-09 10:44:41,142][25689] Fps is (10 sec: 5878.2, 60 sec: 5800.9, 300 sec: 5771.5). Total num frames: 219079680. Throughput: 0: 5184.3. Samples: 219072376. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 10:44:41,143][25689] Avg episode reward: [(0, '-48.254')] [2022-07-09 10:44:41,843][26022] Updated weights on worker 0-0, policy_version 213949 (0.00092) [2022-07-09 10:44:43,493][26022] Updated weights on worker 0-0, policy_version 213959 (0.00090) [2022-07-09 10:44:45,195][26022] Updated weights on worker 0-0, policy_version 213969 (0.00091) [2022-07-09 10:44:46,198][25689] Fps is (10 sec: 5853.4, 60 sec: 5789.6, 300 sec: 5768.5). Total num frames: 219109376. Throughput: 0: 6043.8. Samples: 219107312. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 10:44:46,199][25689] Avg episode reward: [(0, '-48.704')] [2022-07-09 10:44:47,090][26022] Updated weights on worker 0-0, policy_version 213979 (0.00059) [2022-07-09 10:44:48,744][26022] Updated weights on worker 0-0, policy_version 213989 (0.00087) [2022-07-09 10:44:48,884][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:44:48,901][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000213990_219125760.pth [2022-07-09 10:44:48,901][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000211961_217048064.pth [2022-07-09 10:44:50,741][26022] Updated weights on worker 0-0, policy_version 213999 (0.00091) [2022-07-09 10:44:51,205][25689] Fps is (10 sec: 5698.2, 60 sec: 5755.5, 300 sec: 5766.9). Total num frames: 219137024. Throughput: 0: 6064.5. Samples: 219142256. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 10:44:51,206][25689] Avg episode reward: [(0, '-49.219')] [2022-07-09 10:44:52,163][26022] Updated weights on worker 0-0, policy_version 214009 (0.00084) [2022-07-09 10:44:54,248][26022] Updated weights on worker 0-0, policy_version 214019 (0.00087) [2022-07-09 10:44:55,826][26022] Updated weights on worker 0-0, policy_version 214029 (0.00090) [2022-07-09 10:44:56,235][25689] Fps is (10 sec: 5815.7, 60 sec: 5806.0, 300 sec: 5766.9). Total num frames: 219167744. Throughput: 0: 5206.0. Samples: 219159754. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 10:44:56,235][25689] Avg episode reward: [(0, '-49.771')] [2022-07-09 10:44:57,708][26022] Updated weights on worker 0-0, policy_version 214039 (0.00087) [2022-07-09 10:44:59,495][26022] Updated weights on worker 0-0, policy_version 214049 (0.00092) [2022-07-09 10:45:01,250][25689] Fps is (10 sec: 5913.1, 60 sec: 5791.7, 300 sec: 5777.9). Total num frames: 219196416. Throughput: 0: 6078.7. Samples: 219194604. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 10:45:01,250][25689] Avg episode reward: [(0, '-49.282')] [2022-07-09 10:45:01,255][26022] Updated weights on worker 0-0, policy_version 214059 (0.00251) [2022-07-09 10:45:03,268][26022] Updated weights on worker 0-0, policy_version 214069 (0.00087) [2022-07-09 10:45:05,367][26022] Updated weights on worker 0-0, policy_version 214079 (0.00083) [2022-07-09 10:45:06,292][25689] Fps is (10 sec: 5497.8, 60 sec: 5794.4, 300 sec: 5770.5). Total num frames: 219223040. Throughput: 0: 5978.5. Samples: 219227444. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 10:45:06,293][25689] Avg episode reward: [(0, '-50.388')] [2022-07-09 10:45:06,724][26022] Updated weights on worker 0-0, policy_version 214089 (0.00089) [2022-07-09 10:45:08,772][26022] Updated weights on worker 0-0, policy_version 214099 (0.00086) [2022-07-09 10:45:10,231][26022] Updated weights on worker 0-0, policy_version 214109 (0.00091) [2022-07-09 10:45:11,297][25689] Fps is (10 sec: 5401.6, 60 sec: 5744.5, 300 sec: 5756.9). Total num frames: 219250688. Throughput: 0: 5107.4. Samples: 219244872. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 10:45:11,298][25689] Avg episode reward: [(0, '-50.482')] [2022-07-09 10:45:12,167][26022] Updated weights on worker 0-0, policy_version 214119 (0.00088) [2022-07-09 10:45:14,016][26022] Updated weights on worker 0-0, policy_version 214129 (0.00086) [2022-07-09 10:45:15,642][26022] Updated weights on worker 0-0, policy_version 214139 (0.00091) [2022-07-09 10:45:16,323][25689] Fps is (10 sec: 5819.2, 60 sec: 5776.4, 300 sec: 5770.6). Total num frames: 219281408. Throughput: 0: 5980.3. Samples: 219279886. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 10:45:16,323][25689] Avg episode reward: [(0, '-50.049')] [2022-07-09 10:45:17,583][26022] Updated weights on worker 0-0, policy_version 214149 (0.00082) [2022-07-09 10:45:19,285][26022] Updated weights on worker 0-0, policy_version 214159 (0.00094) [2022-07-09 10:45:20,928][26022] Updated weights on worker 0-0, policy_version 214169 (0.00090) [2022-07-09 10:45:21,343][25689] Fps is (10 sec: 5912.3, 60 sec: 5776.7, 300 sec: 5768.8). Total num frames: 219310080. Throughput: 0: 5965.6. Samples: 219314470. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 10:45:21,344][25689] Avg episode reward: [(0, '-49.477')] [2022-07-09 10:45:22,815][26022] Updated weights on worker 0-0, policy_version 214179 (0.00090) [2022-07-09 10:45:24,528][26022] Updated weights on worker 0-0, policy_version 214189 (0.00085) [2022-07-09 10:45:26,361][26022] Updated weights on worker 0-0, policy_version 214199 (0.00088) [2022-07-09 10:45:26,381][25689] Fps is (10 sec: 5802.7, 60 sec: 5762.8, 300 sec: 5772.0). Total num frames: 219339776. Throughput: 0: 6054.6. Samples: 219349074. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 10:45:26,382][25689] Avg episode reward: [(0, '-49.591')] [2022-07-09 10:45:28,169][26022] Updated weights on worker 0-0, policy_version 214209 (0.00090) [2022-07-09 10:45:29,795][26022] Updated weights on worker 0-0, policy_version 214219 (0.00093) [2022-07-09 10:45:31,412][25689] Fps is (10 sec: 5695.1, 60 sec: 5749.8, 300 sec: 5761.7). Total num frames: 219367424. Throughput: 0: 6035.6. Samples: 219366276. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 10:45:31,412][25689] Avg episode reward: [(0, '-49.458')] [2022-07-09 10:45:31,779][26022] Updated weights on worker 0-0, policy_version 214229 (0.00100) [2022-07-09 10:45:33,356][26022] Updated weights on worker 0-0, policy_version 214239 (0.00091) [2022-07-09 10:45:35,146][26022] Updated weights on worker 0-0, policy_version 214249 (0.00085) [2022-07-09 10:45:36,423][25689] Fps is (10 sec: 5812.9, 60 sec: 5767.0, 300 sec: 5772.1). Total num frames: 219398144. Throughput: 0: 6023.9. Samples: 219400966. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 10:45:36,423][25689] Avg episode reward: [(0, '-47.413')] [2022-07-09 10:45:36,991][26022] Updated weights on worker 0-0, policy_version 214259 (0.00085) [2022-07-09 10:45:38,675][26022] Updated weights on worker 0-0, policy_version 214269 (0.00081) [2022-07-09 10:45:40,674][26022] Updated weights on worker 0-0, policy_version 214279 (0.00091) [2022-07-09 10:45:41,450][25689] Fps is (10 sec: 6018.8, 60 sec: 5773.1, 300 sec: 5773.4). Total num frames: 219427840. Throughput: 0: 6048.9. Samples: 219436094. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 10:45:41,450][25689] Avg episode reward: [(0, '-49.035')] [2022-07-09 10:45:42,281][26022] Updated weights on worker 0-0, policy_version 214289 (0.00085) [2022-07-09 10:45:44,079][26022] Updated weights on worker 0-0, policy_version 214299 (0.00087) [2022-07-09 10:45:46,047][26022] Updated weights on worker 0-0, policy_version 214309 (0.00084) [2022-07-09 10:45:46,538][25689] Fps is (10 sec: 5668.7, 60 sec: 5736.0, 300 sec: 5765.3). Total num frames: 219455488. Throughput: 0: 5172.3. Samples: 219453332. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 10:45:46,539][25689] Avg episode reward: [(0, '-48.970')] [2022-07-09 10:45:47,618][26022] Updated weights on worker 0-0, policy_version 214319 (0.00098) [2022-07-09 10:45:49,520][26022] Updated weights on worker 0-0, policy_version 214329 (0.00088) [2022-07-09 10:45:51,128][26022] Updated weights on worker 0-0, policy_version 214339 (0.00093) [2022-07-09 10:45:51,552][25689] Fps is (10 sec: 5575.1, 60 sec: 5752.4, 300 sec: 5765.5). Total num frames: 219484160. Throughput: 0: 6043.2. Samples: 219487986. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 10:45:51,552][25689] Avg episode reward: [(0, '-48.741')] [2022-07-09 10:45:52,943][26022] Updated weights on worker 0-0, policy_version 214349 (0.00089) [2022-07-09 10:45:54,801][26022] Updated weights on worker 0-0, policy_version 214359 (0.00086) [2022-07-09 10:45:56,342][26022] Updated weights on worker 0-0, policy_version 214369 (0.00097) [2022-07-09 10:45:56,565][25689] Fps is (10 sec: 5923.5, 60 sec: 5753.9, 300 sec: 5768.8). Total num frames: 219514880. Throughput: 0: 6046.9. Samples: 219522764. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 10:45:56,565][25689] Avg episode reward: [(0, '-50.438')] [2022-07-09 10:45:58,341][26022] Updated weights on worker 0-0, policy_version 214379 (0.00097) [2022-07-09 10:45:59,863][26022] Updated weights on worker 0-0, policy_version 214389 (0.00091) [2022-07-09 10:46:01,601][25689] Fps is (10 sec: 5910.4, 60 sec: 5752.0, 300 sec: 5773.8). Total num frames: 219543552. Throughput: 0: 5171.9. Samples: 219540312. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 10:46:01,601][25689] Avg episode reward: [(0, '-51.001')] [2022-07-09 10:46:01,951][26022] Updated weights on worker 0-0, policy_version 214399 (0.00507) [2022-07-09 10:46:04,030][26022] Updated weights on worker 0-0, policy_version 214409 (0.00097) [2022-07-09 10:46:05,591][26022] Updated weights on worker 0-0, policy_version 214419 (0.00085) [2022-07-09 10:46:06,686][25689] Fps is (10 sec: 5362.4, 60 sec: 5731.0, 300 sec: 5762.1). Total num frames: 219569152. Throughput: 0: 5919.1. Samples: 219572586. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 10:46:06,686][25689] Avg episode reward: [(0, '-51.137')] [2022-07-09 10:46:07,427][26022] Updated weights on worker 0-0, policy_version 214429 (0.00081) [2022-07-09 10:46:09,346][26022] Updated weights on worker 0-0, policy_version 214439 (0.00080) [2022-07-09 10:46:10,853][26022] Updated weights on worker 0-0, policy_version 214449 (0.00087) [2022-07-09 10:46:11,725][25689] Fps is (10 sec: 5461.4, 60 sec: 5761.5, 300 sec: 5759.5). Total num frames: 219598848. Throughput: 0: 5929.6. Samples: 219607608. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 10:46:11,726][25689] Avg episode reward: [(0, '-50.175')] [2022-07-09 10:46:12,856][26022] Updated weights on worker 0-0, policy_version 214459 (0.00092) [2022-07-09 10:46:14,556][26022] Updated weights on worker 0-0, policy_version 214469 (0.00090) [2022-07-09 10:46:16,207][26022] Updated weights on worker 0-0, policy_version 214479 (0.00083) [2022-07-09 10:46:16,729][25689] Fps is (10 sec: 6015.5, 60 sec: 5763.6, 300 sec: 5769.9). Total num frames: 219629568. Throughput: 0: 5075.2. Samples: 219625102. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 10:46:16,730][25689] Avg episode reward: [(0, '-50.561')] [2022-07-09 10:46:18,158][26022] Updated weights on worker 0-0, policy_version 214489 (0.00083) [2022-07-09 10:46:19,703][26022] Updated weights on worker 0-0, policy_version 214499 (0.00086) [2022-07-09 10:46:21,446][26022] Updated weights on worker 0-0, policy_version 214509 (0.00089) [2022-07-09 10:46:21,748][25689] Fps is (10 sec: 5925.8, 60 sec: 5763.7, 300 sec: 5767.4). Total num frames: 219658240. Throughput: 0: 5946.6. Samples: 219660122. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 10:46:21,749][25689] Avg episode reward: [(0, '-50.728')] [2022-07-09 10:46:23,235][26022] Updated weights on worker 0-0, policy_version 214519 (0.00086) [2022-07-09 10:46:25,061][26022] Updated weights on worker 0-0, policy_version 214529 (0.00054) [2022-07-09 10:46:26,792][25689] Fps is (10 sec: 5698.4, 60 sec: 5746.2, 300 sec: 5768.0). Total num frames: 219686912. Throughput: 0: 6095.3. Samples: 219695142. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 10:46:26,794][25689] Avg episode reward: [(0, '-48.375')] [2022-07-09 10:46:26,887][26022] Updated weights on worker 0-0, policy_version 214539 (0.00097) [2022-07-09 10:46:28,580][26022] Updated weights on worker 0-0, policy_version 214549 (0.00087) [2022-07-09 10:46:30,461][26022] Updated weights on worker 0-0, policy_version 214559 (0.00628) [2022-07-09 10:46:31,810][25689] Fps is (10 sec: 5699.0, 60 sec: 5764.4, 300 sec: 5758.5). Total num frames: 219715584. Throughput: 0: 5215.9. Samples: 219712368. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 10:46:31,812][25689] Avg episode reward: [(0, '-47.823')] [2022-07-09 10:46:32,254][26022] Updated weights on worker 0-0, policy_version 214569 (0.00090) [2022-07-09 10:46:33,877][26022] Updated weights on worker 0-0, policy_version 214579 (0.00091) [2022-07-09 10:46:35,841][26022] Updated weights on worker 0-0, policy_version 214589 (0.00094) [2022-07-09 10:46:36,847][25689] Fps is (10 sec: 5805.3, 60 sec: 5745.0, 300 sec: 5768.6). Total num frames: 219745280. Throughput: 0: 6065.0. Samples: 219747114. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 10:46:36,847][25689] Avg episode reward: [(0, '-47.357')] [2022-07-09 10:46:37,472][26022] Updated weights on worker 0-0, policy_version 214599 (0.00108) [2022-07-09 10:46:39,133][26022] Updated weights on worker 0-0, policy_version 214609 (0.00101) [2022-07-09 10:46:41,303][26022] Updated weights on worker 0-0, policy_version 214619 (0.00085) [2022-07-09 10:46:41,850][25689] Fps is (10 sec: 5915.6, 60 sec: 5747.3, 300 sec: 5770.0). Total num frames: 219774976. Throughput: 0: 6058.0. Samples: 219781900. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 10:46:41,852][25689] Avg episode reward: [(0, '-47.160')] [2022-07-09 10:46:42,616][26022] Updated weights on worker 0-0, policy_version 214629 (0.00083) [2022-07-09 10:46:44,547][26022] Updated weights on worker 0-0, policy_version 214639 (0.00091) [2022-07-09 10:46:46,489][26022] Updated weights on worker 0-0, policy_version 214649 (0.00083) [2022-07-09 10:46:46,895][25689] Fps is (10 sec: 5808.9, 60 sec: 5768.4, 300 sec: 5764.2). Total num frames: 219803648. Throughput: 0: 5186.3. Samples: 219799398. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 10:46:46,895][25689] Avg episode reward: [(0, '-45.653')] [2022-07-09 10:46:48,070][26022] Updated weights on worker 0-0, policy_version 214659 (0.00080) [2022-07-09 10:46:49,181][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:46:49,213][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000214665_219816960.pth [2022-07-09 10:46:49,213][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000212635_217738240.pth [2022-07-09 10:46:49,908][26022] Updated weights on worker 0-0, policy_version 214669 (0.00575) [2022-07-09 10:46:51,422][26022] Updated weights on worker 0-0, policy_version 214679 (0.00083) [2022-07-09 10:46:51,918][25689] Fps is (10 sec: 5695.6, 60 sec: 5767.5, 300 sec: 5761.9). Total num frames: 219832320. Throughput: 0: 6079.9. Samples: 219834622. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 10:46:51,919][25689] Avg episode reward: [(0, '-45.534')] [2022-07-09 10:46:53,121][26022] Updated weights on worker 0-0, policy_version 214689 (0.00081) [2022-07-09 10:46:55,154][26022] Updated weights on worker 0-0, policy_version 214699 (0.00088) [2022-07-09 10:46:56,844][26022] Updated weights on worker 0-0, policy_version 214710 (0.00087) [2022-07-09 10:46:56,954][25689] Fps is (10 sec: 5904.0, 60 sec: 5765.3, 300 sec: 5764.8). Total num frames: 219863040. Throughput: 0: 6097.2. Samples: 219869714. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 10:46:56,955][25689] Avg episode reward: [(0, '-46.683')] [2022-07-09 10:46:58,738][26022] Updated weights on worker 0-0, policy_version 214720 (0.00086) [2022-07-09 10:47:00,708][26022] Updated weights on worker 0-0, policy_version 214730 (0.00087) [2022-07-09 10:47:01,971][25689] Fps is (10 sec: 5602.4, 60 sec: 5716.2, 300 sec: 5765.7). Total num frames: 219888640. Throughput: 0: 5225.2. Samples: 219887036. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 10:47:01,971][25689] Avg episode reward: [(0, '-46.922')] [2022-07-09 10:47:02,450][26022] Updated weights on worker 0-0, policy_version 214740 (0.00088) [2022-07-09 10:47:04,531][26022] Updated weights on worker 0-0, policy_version 214750 (0.00087) [2022-07-09 10:47:06,118][26022] Updated weights on worker 0-0, policy_version 214760 (0.00087) [2022-07-09 10:47:07,047][25689] Fps is (10 sec: 5377.4, 60 sec: 5767.9, 300 sec: 5761.8). Total num frames: 219917312. Throughput: 0: 5954.5. Samples: 219919394. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:47:07,047][25689] Avg episode reward: [(0, '-47.909')] [2022-07-09 10:47:08,040][26022] Updated weights on worker 0-0, policy_version 214770 (0.00094) [2022-07-09 10:47:09,923][26022] Updated weights on worker 0-0, policy_version 214780 (0.00090) [2022-07-09 10:47:11,373][26022] Updated weights on worker 0-0, policy_version 214790 (0.00083) [2022-07-09 10:47:12,082][25689] Fps is (10 sec: 5772.7, 60 sec: 5768.4, 300 sec: 5761.2). Total num frames: 219947008. Throughput: 0: 5916.0. Samples: 219953912. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:47:12,082][25689] Avg episode reward: [(0, '-48.388')] [2022-07-09 10:47:13,405][26022] Updated weights on worker 0-0, policy_version 214800 (0.00087) [2022-07-09 10:47:15,019][26022] Updated weights on worker 0-0, policy_version 214810 (0.00084) [2022-07-09 10:47:16,874][26022] Updated weights on worker 0-0, policy_version 214820 (0.00080) [2022-07-09 10:47:17,106][25689] Fps is (10 sec: 5904.5, 60 sec: 5749.5, 300 sec: 5760.9). Total num frames: 219976704. Throughput: 0: 5030.5. Samples: 219971086. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:47:17,106][25689] Avg episode reward: [(0, '-49.333')] [2022-07-09 10:47:18,719][26022] Updated weights on worker 0-0, policy_version 214830 (0.00446) [2022-07-09 10:47:20,226][26022] Updated weights on worker 0-0, policy_version 214840 (0.00082) [2022-07-09 10:47:22,168][25689] Fps is (10 sec: 5787.1, 60 sec: 5745.4, 300 sec: 5761.0). Total num frames: 220005376. Throughput: 0: 5924.9. Samples: 220006700. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:47:22,168][25689] Avg episode reward: [(0, '-50.035')] [2022-07-09 10:47:22,248][26022] Updated weights on worker 0-0, policy_version 214850 (0.00340) [2022-07-09 10:47:23,640][26022] Updated weights on worker 0-0, policy_version 214860 (0.00088) [2022-07-09 10:47:25,602][26022] Updated weights on worker 0-0, policy_version 214870 (0.00087) [2022-07-09 10:47:27,221][25689] Fps is (10 sec: 5871.7, 60 sec: 5778.5, 300 sec: 5767.1). Total num frames: 220036096. Throughput: 0: 6071.0. Samples: 220041868. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:47:27,221][25689] Avg episode reward: [(0, '-50.392')] [2022-07-09 10:47:27,489][26022] Updated weights on worker 0-0, policy_version 214880 (0.00083) [2022-07-09 10:47:29,170][26022] Updated weights on worker 0-0, policy_version 214890 (0.00088) [2022-07-09 10:47:30,983][26022] Updated weights on worker 0-0, policy_version 214900 (0.00085) [2022-07-09 10:47:32,287][25689] Fps is (10 sec: 5869.1, 60 sec: 5773.8, 300 sec: 5759.0). Total num frames: 220064768. Throughput: 0: 5209.5. Samples: 220059172. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:47:32,288][25689] Avg episode reward: [(0, '-50.448')] [2022-07-09 10:47:32,687][26022] Updated weights on worker 0-0, policy_version 214910 (0.00085) [2022-07-09 10:47:34,379][26022] Updated weights on worker 0-0, policy_version 214920 (0.00090) [2022-07-09 10:47:36,181][26022] Updated weights on worker 0-0, policy_version 214930 (0.00061) [2022-07-09 10:47:37,349][25689] Fps is (10 sec: 5762.9, 60 sec: 5771.4, 300 sec: 5768.8). Total num frames: 220094464. Throughput: 0: 6070.7. Samples: 220093978. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:47:37,351][25689] Avg episode reward: [(0, '-50.230')] [2022-07-09 10:47:37,983][26022] Updated weights on worker 0-0, policy_version 214940 (0.00082) [2022-07-09 10:47:39,575][26022] Updated weights on worker 0-0, policy_version 214950 (0.00084) [2022-07-09 10:47:41,448][26022] Updated weights on worker 0-0, policy_version 214960 (0.00086) [2022-07-09 10:47:42,355][25689] Fps is (10 sec: 5797.3, 60 sec: 5754.2, 300 sec: 5759.9). Total num frames: 220123136. Throughput: 0: 6060.9. Samples: 220129054. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:47:42,357][25689] Avg episode reward: [(0, '-49.542')] [2022-07-09 10:47:43,161][26022] Updated weights on worker 0-0, policy_version 214970 (0.00083) [2022-07-09 10:47:44,874][26022] Updated weights on worker 0-0, policy_version 214980 (0.00084) [2022-07-09 10:47:46,901][26022] Updated weights on worker 0-0, policy_version 214990 (0.00085) [2022-07-09 10:47:47,459][25689] Fps is (10 sec: 5874.7, 60 sec: 5782.4, 300 sec: 5762.0). Total num frames: 220153856. Throughput: 0: 5182.3. Samples: 220146748. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:47:47,459][25689] Avg episode reward: [(0, '-49.982')] [2022-07-09 10:47:48,483][26022] Updated weights on worker 0-0, policy_version 215000 (0.00084) [2022-07-09 10:47:50,202][26022] Updated weights on worker 0-0, policy_version 215010 (0.00088) [2022-07-09 10:47:51,955][26022] Updated weights on worker 0-0, policy_version 215020 (0.00079) [2022-07-09 10:47:52,514][25689] Fps is (10 sec: 5846.6, 60 sec: 5779.4, 300 sec: 5764.9). Total num frames: 220182528. Throughput: 0: 6072.9. Samples: 220182006. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:47:52,514][25689] Avg episode reward: [(0, '-49.194')] [2022-07-09 10:47:53,546][26022] Updated weights on worker 0-0, policy_version 215030 (0.00087) [2022-07-09 10:47:55,347][26022] Updated weights on worker 0-0, policy_version 215040 (0.00086) [2022-07-09 10:47:57,298][26022] Updated weights on worker 0-0, policy_version 215050 (0.00083) [2022-07-09 10:47:57,537][25689] Fps is (10 sec: 5791.6, 60 sec: 5763.8, 300 sec: 5766.0). Total num frames: 220212224. Throughput: 0: 6109.7. Samples: 220217320. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:47:57,538][25689] Avg episode reward: [(0, '-48.776')] [2022-07-09 10:47:58,933][26022] Updated weights on worker 0-0, policy_version 215060 (0.00092) [2022-07-09 10:48:01,008][26022] Updated weights on worker 0-0, policy_version 215070 (0.00085) [2022-07-09 10:48:02,545][25689] Fps is (10 sec: 5818.6, 60 sec: 5815.3, 300 sec: 5775.1). Total num frames: 220240896. Throughput: 0: 5229.3. Samples: 220234632. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:48:02,547][25689] Avg episode reward: [(0, '-48.652')] [2022-07-09 10:48:02,671][26022] Updated weights on worker 0-0, policy_version 215080 (0.00074) [2022-07-09 10:48:04,687][26022] Updated weights on worker 0-0, policy_version 215090 (0.00085) [2022-07-09 10:48:06,299][26022] Updated weights on worker 0-0, policy_version 215100 (0.00085) [2022-07-09 10:48:07,656][25689] Fps is (10 sec: 5666.8, 60 sec: 5811.9, 300 sec: 5766.5). Total num frames: 220269568. Throughput: 0: 5983.0. Samples: 220267590. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:48:07,657][25689] Avg episode reward: [(0, '-49.418')] [2022-07-09 10:48:08,087][26022] Updated weights on worker 0-0, policy_version 215110 (0.00087) [2022-07-09 10:48:09,876][26022] Updated weights on worker 0-0, policy_version 215120 (0.00086) [2022-07-09 10:48:11,620][26022] Updated weights on worker 0-0, policy_version 215130 (0.00082) [2022-07-09 10:48:12,671][25689] Fps is (10 sec: 5663.1, 60 sec: 5797.0, 300 sec: 5763.2). Total num frames: 220298240. Throughput: 0: 5979.2. Samples: 220302530. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:48:12,672][25689] Avg episode reward: [(0, '-49.505')] [2022-07-09 10:48:13,336][26022] Updated weights on worker 0-0, policy_version 215140 (0.00082) [2022-07-09 10:48:15,187][26022] Updated weights on worker 0-0, policy_version 215150 (0.00085) [2022-07-09 10:48:17,076][26022] Updated weights on worker 0-0, policy_version 215160 (0.00084) [2022-07-09 10:48:17,674][25689] Fps is (10 sec: 5724.5, 60 sec: 5782.1, 300 sec: 5763.2). Total num frames: 220326912. Throughput: 0: 5096.1. Samples: 220319938. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:48:17,674][25689] Avg episode reward: [(0, '-48.976')] [2022-07-09 10:48:18,672][26022] Updated weights on worker 0-0, policy_version 215170 (0.00083) [2022-07-09 10:48:20,624][26022] Updated weights on worker 0-0, policy_version 215180 (0.00084) [2022-07-09 10:48:21,958][26022] Updated weights on worker 0-0, policy_version 215190 (0.00081) [2022-07-09 10:48:22,773][25689] Fps is (10 sec: 5879.2, 60 sec: 5812.3, 300 sec: 5766.8). Total num frames: 220357632. Throughput: 0: 5951.5. Samples: 220355020. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:48:22,774][25689] Avg episode reward: [(0, '-48.117')] [2022-07-09 10:48:24,114][26022] Updated weights on worker 0-0, policy_version 215200 (0.00083) [2022-07-09 10:48:25,665][26022] Updated weights on worker 0-0, policy_version 215210 (0.00087) [2022-07-09 10:48:27,723][26022] Updated weights on worker 0-0, policy_version 215220 (0.00086) [2022-07-09 10:48:27,879][25689] Fps is (10 sec: 5819.7, 60 sec: 5773.5, 300 sec: 5768.6). Total num frames: 220386304. Throughput: 0: 6032.2. Samples: 220389576. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:48:27,879][25689] Avg episode reward: [(0, '-48.882')] [2022-07-09 10:48:29,407][26022] Updated weights on worker 0-0, policy_version 215230 (0.00085) [2022-07-09 10:48:31,203][26022] Updated weights on worker 0-0, policy_version 215240 (0.00085) [2022-07-09 10:48:32,771][26022] Updated weights on worker 0-0, policy_version 215250 (0.00088) [2022-07-09 10:48:32,919][25689] Fps is (10 sec: 5752.9, 60 sec: 5792.9, 300 sec: 5768.4). Total num frames: 220416000. Throughput: 0: 6019.9. Samples: 220424420. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:48:32,919][25689] Avg episode reward: [(0, '-48.858')] [2022-07-09 10:48:34,678][26022] Updated weights on worker 0-0, policy_version 215260 (0.00104) [2022-07-09 10:48:36,275][26022] Updated weights on worker 0-0, policy_version 215270 (0.00081) [2022-07-09 10:48:37,932][25689] Fps is (10 sec: 5806.0, 60 sec: 5780.6, 300 sec: 5772.0). Total num frames: 220444672. Throughput: 0: 6017.9. Samples: 220441852. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:48:37,932][25689] Avg episode reward: [(0, '-49.251')] [2022-07-09 10:48:38,015][26022] Updated weights on worker 0-0, policy_version 215280 (0.00082) [2022-07-09 10:48:39,723][26022] Updated weights on worker 0-0, policy_version 215290 (0.00100) [2022-07-09 10:48:41,507][26022] Updated weights on worker 0-0, policy_version 215300 (0.00084) [2022-07-09 10:48:42,942][25689] Fps is (10 sec: 5823.1, 60 sec: 5797.1, 300 sec: 5770.6). Total num frames: 220474368. Throughput: 0: 6072.9. Samples: 220477508. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:48:42,943][25689] Avg episode reward: [(0, '-48.788')] [2022-07-09 10:48:43,378][26022] Updated weights on worker 0-0, policy_version 215310 (0.00082) [2022-07-09 10:48:44,975][26022] Updated weights on worker 0-0, policy_version 215320 (0.00086) [2022-07-09 10:48:46,963][26022] Updated weights on worker 0-0, policy_version 215330 (0.00086) [2022-07-09 10:48:48,074][25689] Fps is (10 sec: 5957.1, 60 sec: 5794.5, 300 sec: 5771.5). Total num frames: 220505088. Throughput: 0: 6078.7. Samples: 220512336. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:48:48,074][25689] Avg episode reward: [(0, '-48.065')] [2022-07-09 10:48:48,759][26022] Updated weights on worker 0-0, policy_version 215340 (0.00081) [2022-07-09 10:48:49,340][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:48:49,348][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000215344_220512256.pth [2022-07-09 10:48:49,349][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000213312_218431488.pth [2022-07-09 10:48:50,451][26022] Updated weights on worker 0-0, policy_version 215350 (0.00083) [2022-07-09 10:48:52,144][26022] Updated weights on worker 0-0, policy_version 215360 (0.00087) [2022-07-09 10:48:53,103][25689] Fps is (10 sec: 5845.5, 60 sec: 5797.0, 300 sec: 5774.9). Total num frames: 220533760. Throughput: 0: 5218.1. Samples: 220529742. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:48:53,105][25689] Avg episode reward: [(0, '-47.884')] [2022-07-09 10:48:53,951][26022] Updated weights on worker 0-0, policy_version 215370 (0.00096) [2022-07-09 10:48:55,791][26022] Updated weights on worker 0-0, policy_version 215380 (0.00087) [2022-07-09 10:48:57,618][26022] Updated weights on worker 0-0, policy_version 215390 (0.00080) [2022-07-09 10:48:58,127][25689] Fps is (10 sec: 5806.1, 60 sec: 5796.9, 300 sec: 5775.3). Total num frames: 220563456. Throughput: 0: 6089.5. Samples: 220564828. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:48:58,128][25689] Avg episode reward: [(0, '-47.670')] [2022-07-09 10:48:59,161][26022] Updated weights on worker 0-0, policy_version 215400 (0.00085) [2022-07-09 10:49:00,970][26022] Updated weights on worker 0-0, policy_version 215410 (0.00092) [2022-07-09 10:49:03,130][25689] Fps is (10 sec: 5514.4, 60 sec: 5746.6, 300 sec: 5773.2). Total num frames: 220589056. Throughput: 0: 5946.2. Samples: 220597548. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:49:03,131][25689] Avg episode reward: [(0, '-47.643')] [2022-07-09 10:49:03,184][26022] Updated weights on worker 0-0, policy_version 215420 (0.00090) [2022-07-09 10:49:04,822][26022] Updated weights on worker 0-0, policy_version 215430 (0.00359) [2022-07-09 10:49:06,668][26022] Updated weights on worker 0-0, policy_version 215440 (0.00350) [2022-07-09 10:49:08,241][25689] Fps is (10 sec: 5568.3, 60 sec: 5780.5, 300 sec: 5771.3). Total num frames: 220619776. Throughput: 0: 5082.7. Samples: 220614840. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:49:08,242][25689] Avg episode reward: [(0, '-47.498')] [2022-07-09 10:49:08,351][26022] Updated weights on worker 0-0, policy_version 215450 (0.00092) [2022-07-09 10:49:10,222][26022] Updated weights on worker 0-0, policy_version 215460 (0.00089) [2022-07-09 10:49:11,931][26022] Updated weights on worker 0-0, policy_version 215470 (0.00085) [2022-07-09 10:49:13,337][25689] Fps is (10 sec: 5818.9, 60 sec: 5772.7, 300 sec: 5769.5). Total num frames: 220648448. Throughput: 0: 5931.1. Samples: 220649754. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:49:13,337][25689] Avg episode reward: [(0, '-48.614')] [2022-07-09 10:49:13,844][26022] Updated weights on worker 0-0, policy_version 215480 (0.00086) [2022-07-09 10:49:15,368][26022] Updated weights on worker 0-0, policy_version 215490 (0.00091) [2022-07-09 10:49:17,357][26022] Updated weights on worker 0-0, policy_version 215500 (0.00080) [2022-07-09 10:49:18,349][25689] Fps is (10 sec: 5774.3, 60 sec: 5788.7, 300 sec: 5773.2). Total num frames: 220678144. Throughput: 0: 5925.3. Samples: 220684654. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 10:49:18,350][25689] Avg episode reward: [(0, '-49.533')] [2022-07-09 10:49:18,865][26022] Updated weights on worker 0-0, policy_version 215510 (0.00090) [2022-07-09 10:49:20,988][26022] Updated weights on worker 0-0, policy_version 215520 (0.00082) [2022-07-09 10:49:22,602][26022] Updated weights on worker 0-0, policy_version 215531 (0.00046) [2022-07-09 10:49:23,379][25689] Fps is (10 sec: 5812.6, 60 sec: 5761.6, 300 sec: 5767.1). Total num frames: 220706816. Throughput: 0: 5161.1. Samples: 220702052. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 10:49:23,379][25689] Avg episode reward: [(0, '-49.827')] [2022-07-09 10:49:24,538][26022] Updated weights on worker 0-0, policy_version 215541 (0.00091) [2022-07-09 10:49:26,334][26022] Updated weights on worker 0-0, policy_version 215551 (0.00093) [2022-07-09 10:49:28,137][26022] Updated weights on worker 0-0, policy_version 215561 (0.00095) [2022-07-09 10:49:28,432][25689] Fps is (10 sec: 5789.1, 60 sec: 5783.5, 300 sec: 5770.9). Total num frames: 220736512. Throughput: 0: 6031.6. Samples: 220736622. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 10:49:28,432][25689] Avg episode reward: [(0, '-49.500')] [2022-07-09 10:49:29,693][26022] Updated weights on worker 0-0, policy_version 215571 (0.00057) [2022-07-09 10:49:31,596][26022] Updated weights on worker 0-0, policy_version 215581 (0.00092) [2022-07-09 10:49:33,331][26022] Updated weights on worker 0-0, policy_version 215591 (0.00051) [2022-07-09 10:49:33,459][25689] Fps is (10 sec: 5891.8, 60 sec: 5784.7, 300 sec: 5770.6). Total num frames: 220766208. Throughput: 0: 6045.1. Samples: 220771394. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 10:49:33,460][25689] Avg episode reward: [(0, '-49.642')] [2022-07-09 10:49:35,231][26022] Updated weights on worker 0-0, policy_version 215601 (0.00090) [2022-07-09 10:49:36,801][26022] Updated weights on worker 0-0, policy_version 215611 (0.00092) [2022-07-09 10:49:38,467][25689] Fps is (10 sec: 5714.5, 60 sec: 5768.3, 300 sec: 5765.3). Total num frames: 220793856. Throughput: 0: 5180.2. Samples: 220788866. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 10:49:38,468][25689] Avg episode reward: [(0, '-50.159')] [2022-07-09 10:49:38,787][26022] Updated weights on worker 0-0, policy_version 215621 (0.00087) [2022-07-09 10:49:40,269][26022] Updated weights on worker 0-0, policy_version 215631 (0.00096) [2022-07-09 10:49:42,160][26022] Updated weights on worker 0-0, policy_version 215641 (0.00085) [2022-07-09 10:49:43,480][25689] Fps is (10 sec: 5722.3, 60 sec: 5768.0, 300 sec: 5766.2). Total num frames: 220823552. Throughput: 0: 6067.2. Samples: 220824012. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 10:49:43,482][25689] Avg episode reward: [(0, '-49.671')] [2022-07-09 10:49:43,922][26022] Updated weights on worker 0-0, policy_version 215651 (0.00209) [2022-07-09 10:49:45,674][26022] Updated weights on worker 0-0, policy_version 215661 (0.00092) [2022-07-09 10:49:47,247][26022] Updated weights on worker 0-0, policy_version 215671 (0.00092) [2022-07-09 10:49:48,532][25689] Fps is (10 sec: 5798.8, 60 sec: 5741.8, 300 sec: 5768.7). Total num frames: 220852224. Throughput: 0: 6085.6. Samples: 220858944. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 10:49:48,533][25689] Avg episode reward: [(0, '-49.696')] [2022-07-09 10:49:49,288][26022] Updated weights on worker 0-0, policy_version 215681 (0.00095) [2022-07-09 10:49:50,798][26022] Updated weights on worker 0-0, policy_version 215691 (0.00085) [2022-07-09 10:49:52,719][26022] Updated weights on worker 0-0, policy_version 215701 (0.00086) [2022-07-09 10:49:53,552][25689] Fps is (10 sec: 5897.1, 60 sec: 5776.5, 300 sec: 5768.9). Total num frames: 220882944. Throughput: 0: 5225.4. Samples: 220876386. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 10:49:53,552][25689] Avg episode reward: [(0, '-49.866')] [2022-07-09 10:49:54,270][26022] Updated weights on worker 0-0, policy_version 215711 (0.00088) [2022-07-09 10:49:56,314][26022] Updated weights on worker 0-0, policy_version 215721 (0.00081) [2022-07-09 10:49:58,105][26022] Updated weights on worker 0-0, policy_version 215731 (0.00082) [2022-07-09 10:49:58,574][25689] Fps is (10 sec: 5914.3, 60 sec: 5759.7, 300 sec: 5768.8). Total num frames: 220911616. Throughput: 0: 6083.7. Samples: 220911194. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 10:49:58,575][25689] Avg episode reward: [(0, '-49.162')] [2022-07-09 10:49:59,833][26022] Updated weights on worker 0-0, policy_version 215741 (0.00090) [2022-07-09 10:50:01,564][26022] Updated weights on worker 0-0, policy_version 215751 (0.00095) [2022-07-09 10:50:03,594][25689] Fps is (10 sec: 5506.2, 60 sec: 5775.1, 300 sec: 5769.2). Total num frames: 220938240. Throughput: 0: 5944.8. Samples: 220943586. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 10:50:03,595][25689] Avg episode reward: [(0, '-49.401')] [2022-07-09 10:50:03,659][26022] Updated weights on worker 0-0, policy_version 215761 (0.00088) [2022-07-09 10:50:05,671][26022] Updated weights on worker 0-0, policy_version 215771 (0.00085) [2022-07-09 10:50:07,377][26022] Updated weights on worker 0-0, policy_version 215781 (0.00085) [2022-07-09 10:50:08,688][25689] Fps is (10 sec: 5467.5, 60 sec: 5742.9, 300 sec: 5770.9). Total num frames: 220966912. Throughput: 0: 5060.8. Samples: 220960948. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 10:50:08,689][25689] Avg episode reward: [(0, '-48.784')] [2022-07-09 10:50:09,037][26022] Updated weights on worker 0-0, policy_version 215791 (0.00087) [2022-07-09 10:50:10,813][26022] Updated weights on worker 0-0, policy_version 215801 (0.00086) [2022-07-09 10:50:12,543][26022] Updated weights on worker 0-0, policy_version 215811 (0.00089) [2022-07-09 10:50:13,699][25689] Fps is (10 sec: 5877.8, 60 sec: 5784.9, 300 sec: 5771.2). Total num frames: 220997632. Throughput: 0: 5941.5. Samples: 220996090. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 10:50:13,700][25689] Avg episode reward: [(0, '-49.208')] [2022-07-09 10:50:14,295][26022] Updated weights on worker 0-0, policy_version 215821 (0.00083) [2022-07-09 10:50:16,097][26022] Updated weights on worker 0-0, policy_version 215831 (0.00081) [2022-07-09 10:50:18,030][26022] Updated weights on worker 0-0, policy_version 215841 (0.00089) [2022-07-09 10:50:18,732][25689] Fps is (10 sec: 5811.0, 60 sec: 5748.9, 300 sec: 5767.5). Total num frames: 221025280. Throughput: 0: 5939.2. Samples: 221030916. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 10:50:18,734][25689] Avg episode reward: [(0, '-48.674')] [2022-07-09 10:50:19,495][26022] Updated weights on worker 0-0, policy_version 215851 (0.00087) [2022-07-09 10:50:21,638][26022] Updated weights on worker 0-0, policy_version 215861 (0.00090) [2022-07-09 10:50:23,096][26022] Updated weights on worker 0-0, policy_version 215871 (0.00085) [2022-07-09 10:50:23,762][25689] Fps is (10 sec: 5596.3, 60 sec: 5748.8, 300 sec: 5764.2). Total num frames: 221053952. Throughput: 0: 5185.9. Samples: 221048176. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 10:50:23,763][25689] Avg episode reward: [(0, '-49.263')] [2022-07-09 10:50:24,981][26022] Updated weights on worker 0-0, policy_version 215881 (0.00087) [2022-07-09 10:50:26,955][26022] Updated weights on worker 0-0, policy_version 215891 (0.00087) [2022-07-09 10:50:28,363][26022] Updated weights on worker 0-0, policy_version 215901 (0.00093) [2022-07-09 10:50:28,813][25689] Fps is (10 sec: 5891.4, 60 sec: 5766.0, 300 sec: 5774.2). Total num frames: 221084672. Throughput: 0: 6059.9. Samples: 221082906. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 10:50:28,814][25689] Avg episode reward: [(0, '-49.371')] [2022-07-09 10:50:30,577][26022] Updated weights on worker 0-0, policy_version 215911 (0.00081) [2022-07-09 10:50:31,890][26022] Updated weights on worker 0-0, policy_version 215921 (0.00091) [2022-07-09 10:50:33,823][25689] Fps is (10 sec: 5801.8, 60 sec: 5733.8, 300 sec: 5763.9). Total num frames: 221112320. Throughput: 0: 6029.0. Samples: 221117418. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 10:50:33,823][25689] Avg episode reward: [(0, '-49.689')] [2022-07-09 10:50:33,900][26022] Updated weights on worker 0-0, policy_version 215931 (0.00087) [2022-07-09 10:50:35,617][26022] Updated weights on worker 0-0, policy_version 215941 (0.00094) [2022-07-09 10:50:37,161][26022] Updated weights on worker 0-0, policy_version 215951 (0.00079) [2022-07-09 10:50:38,852][25689] Fps is (10 sec: 5610.1, 60 sec: 5748.7, 300 sec: 5760.4). Total num frames: 221140992. Throughput: 0: 5160.0. Samples: 221134736. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 10:50:38,853][25689] Avg episode reward: [(0, '-50.318')] [2022-07-09 10:50:39,201][26022] Updated weights on worker 0-0, policy_version 215961 (0.00081) [2022-07-09 10:50:40,971][26022] Updated weights on worker 0-0, policy_version 215971 (0.00090) [2022-07-09 10:50:42,641][26022] Updated weights on worker 0-0, policy_version 215981 (0.00083) [2022-07-09 10:50:43,859][25689] Fps is (10 sec: 5917.8, 60 sec: 5766.3, 300 sec: 5772.3). Total num frames: 221171712. Throughput: 0: 6056.6. Samples: 221169894. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 10:50:43,860][25689] Avg episode reward: [(0, '-50.825')] [2022-07-09 10:50:44,683][26022] Updated weights on worker 0-0, policy_version 215991 (0.00094) [2022-07-09 10:50:45,914][26022] Updated weights on worker 0-0, policy_version 216001 (0.00087) [2022-07-09 10:50:48,010][26022] Updated weights on worker 0-0, policy_version 216011 (0.00083) [2022-07-09 10:50:48,927][25689] Fps is (10 sec: 5996.5, 60 sec: 5781.6, 300 sec: 5774.7). Total num frames: 221201408. Throughput: 0: 6068.7. Samples: 221204974. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 10:50:48,928][25689] Avg episode reward: [(0, '-51.068')] [2022-07-09 10:50:49,438][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:50:49,455][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000216020_221204480.pth [2022-07-09 10:50:49,455][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000213990_219125760.pth [2022-07-09 10:50:49,635][26022] Updated weights on worker 0-0, policy_version 216021 (0.00085) [2022-07-09 10:50:51,571][26022] Updated weights on worker 0-0, policy_version 216031 (0.00086) [2022-07-09 10:50:53,244][26022] Updated weights on worker 0-0, policy_version 216041 (0.00090) [2022-07-09 10:50:53,947][25689] Fps is (10 sec: 5786.1, 60 sec: 5747.8, 300 sec: 5767.7). Total num frames: 221230080. Throughput: 0: 5214.0. Samples: 221222344. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 10:50:53,947][25689] Avg episode reward: [(0, '-51.161')] [2022-07-09 10:50:55,177][26022] Updated weights on worker 0-0, policy_version 216051 (0.00085) [2022-07-09 10:50:56,702][26022] Updated weights on worker 0-0, policy_version 216061 (0.00090) [2022-07-09 10:50:58,762][26022] Updated weights on worker 0-0, policy_version 216071 (0.00083) [2022-07-09 10:50:58,963][25689] Fps is (10 sec: 5510.2, 60 sec: 5714.5, 300 sec: 5761.2). Total num frames: 221256704. Throughput: 0: 6077.2. Samples: 221256952. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 10:50:58,963][25689] Avg episode reward: [(0, '-50.336')] [2022-07-09 10:51:00,191][26022] Updated weights on worker 0-0, policy_version 216081 (0.00090) [2022-07-09 10:51:02,559][26022] Updated weights on worker 0-0, policy_version 216091 (0.00080) [2022-07-09 10:51:03,967][25689] Fps is (10 sec: 5518.5, 60 sec: 5749.9, 300 sec: 5773.1). Total num frames: 221285376. Throughput: 0: 5955.4. Samples: 221289644. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 10:51:03,967][25689] Avg episode reward: [(0, '-49.698')] [2022-07-09 10:51:04,115][26022] Updated weights on worker 0-0, policy_version 216101 (0.00092) [2022-07-09 10:51:05,924][26022] Updated weights on worker 0-0, policy_version 216111 (0.00085) [2022-07-09 10:51:07,797][26022] Updated weights on worker 0-0, policy_version 216121 (0.00080) [2022-07-09 10:51:09,022][25689] Fps is (10 sec: 5802.3, 60 sec: 5770.5, 300 sec: 5772.8). Total num frames: 221315072. Throughput: 0: 5944.7. Samples: 221324432. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 10:51:09,024][25689] Avg episode reward: [(0, '-48.908')] [2022-07-09 10:51:09,375][26022] Updated weights on worker 0-0, policy_version 216131 (0.00094) [2022-07-09 10:51:11,226][26022] Updated weights on worker 0-0, policy_version 216141 (0.00090) [2022-07-09 10:51:13,032][26022] Updated weights on worker 0-0, policy_version 216151 (0.00087) [2022-07-09 10:51:14,028][25689] Fps is (10 sec: 5597.8, 60 sec: 5703.1, 300 sec: 5759.0). Total num frames: 221341696. Throughput: 0: 5958.2. Samples: 221341992. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 10:51:14,029][25689] Avg episode reward: [(0, '-49.538')] [2022-07-09 10:51:14,730][26022] Updated weights on worker 0-0, policy_version 216161 (0.00097) [2022-07-09 10:51:16,714][26022] Updated weights on worker 0-0, policy_version 216171 (0.00084) [2022-07-09 10:51:18,247][26022] Updated weights on worker 0-0, policy_version 216181 (0.00084) [2022-07-09 10:51:19,059][25689] Fps is (10 sec: 5815.3, 60 sec: 5771.2, 300 sec: 5769.0). Total num frames: 221373440. Throughput: 0: 5957.6. Samples: 221376678. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 10:51:19,060][25689] Avg episode reward: [(0, '-49.211')] [2022-07-09 10:51:20,186][26022] Updated weights on worker 0-0, policy_version 216191 (0.00085) [2022-07-09 10:51:21,962][26022] Updated weights on worker 0-0, policy_version 216201 (0.00083) [2022-07-09 10:51:23,782][26022] Updated weights on worker 0-0, policy_version 216211 (0.00087) [2022-07-09 10:51:24,111][25689] Fps is (10 sec: 5991.9, 60 sec: 5769.2, 300 sec: 5768.9). Total num frames: 221402112. Throughput: 0: 6070.5. Samples: 221411928. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 10:51:24,113][25689] Avg episode reward: [(0, '-49.824')] [2022-07-09 10:51:25,414][26022] Updated weights on worker 0-0, policy_version 216221 (0.00086) [2022-07-09 10:51:27,284][26022] Updated weights on worker 0-0, policy_version 216231 (0.00064) [2022-07-09 10:51:28,861][26022] Updated weights on worker 0-0, policy_version 216241 (0.00089) [2022-07-09 10:51:29,174][25689] Fps is (10 sec: 5770.2, 60 sec: 5751.0, 300 sec: 5771.4). Total num frames: 221431808. Throughput: 0: 5199.9. Samples: 221429222. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 10:51:29,175][25689] Avg episode reward: [(0, '-50.582')] [2022-07-09 10:51:30,846][26022] Updated weights on worker 0-0, policy_version 216251 (0.00113) [2022-07-09 10:51:32,312][26022] Updated weights on worker 0-0, policy_version 216261 (0.00080) [2022-07-09 10:51:34,202][25689] Fps is (10 sec: 5682.3, 60 sec: 5749.2, 300 sec: 5764.7). Total num frames: 221459456. Throughput: 0: 6056.9. Samples: 221464188. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 10:51:34,204][25689] Avg episode reward: [(0, '-50.958')] [2022-07-09 10:51:34,393][26022] Updated weights on worker 0-0, policy_version 216271 (0.00078) [2022-07-09 10:51:35,883][26022] Updated weights on worker 0-0, policy_version 216281 (0.00087) [2022-07-09 10:51:37,935][26022] Updated weights on worker 0-0, policy_version 216291 (0.00084) [2022-07-09 10:51:39,210][25689] Fps is (10 sec: 5815.9, 60 sec: 5785.2, 300 sec: 5768.1). Total num frames: 221490176. Throughput: 0: 6061.1. Samples: 221498816. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 10:51:39,211][25689] Avg episode reward: [(0, '-50.458')] [2022-07-09 10:51:39,625][26022] Updated weights on worker 0-0, policy_version 216301 (0.00087) [2022-07-09 10:51:41,288][26022] Updated weights on worker 0-0, policy_version 216311 (0.00104) [2022-07-09 10:51:43,264][26022] Updated weights on worker 0-0, policy_version 216321 (0.00090) [2022-07-09 10:51:44,220][25689] Fps is (10 sec: 5928.9, 60 sec: 5751.0, 300 sec: 5768.8). Total num frames: 221518848. Throughput: 0: 5185.3. Samples: 221516200. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 10:51:44,220][25689] Avg episode reward: [(0, '-50.729')] [2022-07-09 10:51:44,903][26022] Updated weights on worker 0-0, policy_version 216331 (0.00081) [2022-07-09 10:51:46,587][26022] Updated weights on worker 0-0, policy_version 216341 (0.00083) [2022-07-09 10:51:48,491][26022] Updated weights on worker 0-0, policy_version 216351 (0.00082) [2022-07-09 10:51:49,331][25689] Fps is (10 sec: 5767.3, 60 sec: 5746.9, 300 sec: 5770.5). Total num frames: 221548544. Throughput: 0: 6033.1. Samples: 221550828. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 10:51:49,331][25689] Avg episode reward: [(0, '-51.051')] [2022-07-09 10:51:50,259][26022] Updated weights on worker 0-0, policy_version 216361 (0.00093) [2022-07-09 10:51:51,963][26022] Updated weights on worker 0-0, policy_version 216371 (0.00084) [2022-07-09 10:51:53,700][26022] Updated weights on worker 0-0, policy_version 216381 (0.00087) [2022-07-09 10:51:54,364][25689] Fps is (10 sec: 5753.8, 60 sec: 5745.6, 300 sec: 5763.7). Total num frames: 221577216. Throughput: 0: 6031.2. Samples: 221585786. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 10:51:54,364][25689] Avg episode reward: [(0, '-49.968')] [2022-07-09 10:51:55,309][26022] Updated weights on worker 0-0, policy_version 216391 (0.00374) [2022-07-09 10:51:57,467][26022] Updated weights on worker 0-0, policy_version 216401 (0.00080) [2022-07-09 10:51:58,999][26022] Updated weights on worker 0-0, policy_version 216411 (0.00092) [2022-07-09 10:51:59,367][25689] Fps is (10 sec: 5714.0, 60 sec: 5780.8, 300 sec: 5774.3). Total num frames: 221605888. Throughput: 0: 5173.2. Samples: 221603088. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 10:51:59,367][25689] Avg episode reward: [(0, '-50.491')] [2022-07-09 10:52:00,856][26022] Updated weights on worker 0-0, policy_version 216421 (0.00111) [2022-07-09 10:52:02,833][26022] Updated weights on worker 0-0, policy_version 216431 (0.00088) [2022-07-09 10:52:04,397][25689] Fps is (10 sec: 5511.3, 60 sec: 5744.4, 300 sec: 5768.3). Total num frames: 221632512. Throughput: 0: 5925.8. Samples: 221635768. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 10:52:04,399][25689] Avg episode reward: [(0, '-50.532')] [2022-07-09 10:52:04,745][26022] Updated weights on worker 0-0, policy_version 216441 (0.00084) [2022-07-09 10:52:06,492][26022] Updated weights on worker 0-0, policy_version 216451 (0.00085) [2022-07-09 10:52:08,291][26022] Updated weights on worker 0-0, policy_version 216461 (0.00088) [2022-07-09 10:52:09,520][25689] Fps is (10 sec: 5647.8, 60 sec: 5754.9, 300 sec: 5770.0). Total num frames: 221663232. Throughput: 0: 5914.3. Samples: 221670232. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 10:52:09,521][25689] Avg episode reward: [(0, '-49.618')] [2022-07-09 10:52:10,135][26022] Updated weights on worker 0-0, policy_version 216471 (0.00090) [2022-07-09 10:52:11,828][26022] Updated weights on worker 0-0, policy_version 216481 (0.00085) [2022-07-09 10:52:13,675][26022] Updated weights on worker 0-0, policy_version 216491 (0.00084) [2022-07-09 10:52:14,523][25689] Fps is (10 sec: 5764.3, 60 sec: 5772.1, 300 sec: 5763.5). Total num frames: 221690880. Throughput: 0: 5042.3. Samples: 221687434. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 10:52:14,524][25689] Avg episode reward: [(0, '-49.567')] [2022-07-09 10:52:15,331][26022] Updated weights on worker 0-0, policy_version 216501 (0.00096) [2022-07-09 10:52:17,143][26022] Updated weights on worker 0-0, policy_version 216511 (0.00087) [2022-07-09 10:52:18,864][26022] Updated weights on worker 0-0, policy_version 216521 (0.00083) [2022-07-09 10:52:19,603][25689] Fps is (10 sec: 5687.3, 60 sec: 5733.7, 300 sec: 5766.6). Total num frames: 221720576. Throughput: 0: 5900.9. Samples: 221722500. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 10:52:19,603][25689] Avg episode reward: [(0, '-49.395')] [2022-07-09 10:52:20,696][26022] Updated weights on worker 0-0, policy_version 216531 (0.00093) [2022-07-09 10:52:22,299][26022] Updated weights on worker 0-0, policy_version 216541 (0.00081) [2022-07-09 10:52:24,146][26022] Updated weights on worker 0-0, policy_version 216551 (0.00091) [2022-07-09 10:52:24,609][25689] Fps is (10 sec: 5888.3, 60 sec: 5754.8, 300 sec: 5764.1). Total num frames: 221750272. Throughput: 0: 6018.2. Samples: 221757410. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 10:52:24,610][25689] Avg episode reward: [(0, '-49.721')] [2022-07-09 10:52:26,183][26022] Updated weights on worker 0-0, policy_version 216561 (0.00097) [2022-07-09 10:52:27,540][26022] Updated weights on worker 0-0, policy_version 216571 (0.00081) [2022-07-09 10:52:29,654][25689] Fps is (10 sec: 5705.3, 60 sec: 5722.8, 300 sec: 5761.1). Total num frames: 221777920. Throughput: 0: 5185.9. Samples: 221774644. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 10:52:29,654][25689] Avg episode reward: [(0, '-49.493')] [2022-07-09 10:52:29,663][26022] Updated weights on worker 0-0, policy_version 216581 (0.00411) [2022-07-09 10:52:31,058][26022] Updated weights on worker 0-0, policy_version 216591 (0.00075) [2022-07-09 10:52:33,198][26022] Updated weights on worker 0-0, policy_version 216601 (0.00513) [2022-07-09 10:52:34,624][26022] Updated weights on worker 0-0, policy_version 216611 (0.00080) [2022-07-09 10:52:34,718][25689] Fps is (10 sec: 5875.3, 60 sec: 5787.1, 300 sec: 5767.9). Total num frames: 221809664. Throughput: 0: 6042.1. Samples: 221809456. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 10:52:34,719][25689] Avg episode reward: [(0, '-50.335')] [2022-07-09 10:52:36,609][26022] Updated weights on worker 0-0, policy_version 216621 (0.00093) [2022-07-09 10:52:38,415][26022] Updated weights on worker 0-0, policy_version 216631 (0.00096) [2022-07-09 10:52:39,781][25689] Fps is (10 sec: 5864.3, 60 sec: 5731.1, 300 sec: 5763.4). Total num frames: 221837312. Throughput: 0: 6023.2. Samples: 221844040. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 10:52:39,782][25689] Avg episode reward: [(0, '-50.079')] [2022-07-09 10:52:40,202][26022] Updated weights on worker 0-0, policy_version 216641 (0.00079) [2022-07-09 10:52:41,870][26022] Updated weights on worker 0-0, policy_version 216651 (0.00092) [2022-07-09 10:52:43,700][26022] Updated weights on worker 0-0, policy_version 216661 (0.00079) [2022-07-09 10:52:44,819][25689] Fps is (10 sec: 5677.3, 60 sec: 5745.3, 300 sec: 5761.2). Total num frames: 221867008. Throughput: 0: 5157.8. Samples: 221861646. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 10:52:44,821][25689] Avg episode reward: [(0, '-50.505')] [2022-07-09 10:52:45,358][26022] Updated weights on worker 0-0, policy_version 216671 (0.00091) [2022-07-09 10:52:47,059][26022] Updated weights on worker 0-0, policy_version 216681 (0.00082) [2022-07-09 10:52:48,903][26022] Updated weights on worker 0-0, policy_version 216691 (0.00088) [2022-07-09 10:52:49,477][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:52:49,490][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000216695_221895680.pth [2022-07-09 10:52:49,491][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000214665_219816960.pth [2022-07-09 10:52:49,902][25689] Fps is (10 sec: 5969.5, 60 sec: 5764.9, 300 sec: 5767.5). Total num frames: 221897728. Throughput: 0: 6047.9. Samples: 221897104. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 10:52:49,904][25689] Avg episode reward: [(0, '-50.289')] [2022-07-09 10:52:50,621][26022] Updated weights on worker 0-0, policy_version 216701 (0.00082) [2022-07-09 10:52:52,239][26022] Updated weights on worker 0-0, policy_version 216711 (0.00083) [2022-07-09 10:52:54,132][26022] Updated weights on worker 0-0, policy_version 216721 (0.00079) [2022-07-09 10:52:54,950][25689] Fps is (10 sec: 5963.2, 60 sec: 5780.3, 300 sec: 5767.1). Total num frames: 221927424. Throughput: 0: 6070.4. Samples: 221932272. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 10:52:54,959][25689] Avg episode reward: [(0, '-49.832')] [2022-07-09 10:52:55,924][26022] Updated weights on worker 0-0, policy_version 216731 (0.00054) [2022-07-09 10:52:57,560][26022] Updated weights on worker 0-0, policy_version 216741 (0.00090) [2022-07-09 10:52:59,534][26022] Updated weights on worker 0-0, policy_version 216751 (0.00087) [2022-07-09 10:52:59,973][25689] Fps is (10 sec: 5694.0, 60 sec: 5761.5, 300 sec: 5763.3). Total num frames: 221955072. Throughput: 0: 5238.0. Samples: 221949804. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 10:52:59,974][25689] Avg episode reward: [(0, '-49.832')] [2022-07-09 10:53:00,955][26022] Updated weights on worker 0-0, policy_version 216761 (0.00093) [2022-07-09 10:53:03,516][26022] Updated weights on worker 0-0, policy_version 216771 (0.00092) [2022-07-09 10:53:04,879][26022] Updated weights on worker 0-0, policy_version 216781 (0.00093) [2022-07-09 10:53:04,977][25689] Fps is (10 sec: 5617.2, 60 sec: 5797.9, 300 sec: 5765.4). Total num frames: 221983744. Throughput: 0: 5996.1. Samples: 221982514. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 10:53:04,979][25689] Avg episode reward: [(0, '-49.614')] [2022-07-09 10:53:06,918][26022] Updated weights on worker 0-0, policy_version 216791 (0.00083) [2022-07-09 10:53:08,596][26022] Updated weights on worker 0-0, policy_version 216801 (0.00086) [2022-07-09 10:53:10,102][25689] Fps is (10 sec: 5560.2, 60 sec: 5746.9, 300 sec: 5759.8). Total num frames: 222011392. Throughput: 0: 5939.3. Samples: 222017078. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 10:53:10,103][25689] Avg episode reward: [(0, '-50.835')] [2022-07-09 10:53:10,374][26022] Updated weights on worker 0-0, policy_version 216811 (0.00088) [2022-07-09 10:53:12,061][26022] Updated weights on worker 0-0, policy_version 216821 (0.00087) [2022-07-09 10:53:13,810][26022] Updated weights on worker 0-0, policy_version 216831 (0.00087) [2022-07-09 10:53:15,108][25689] Fps is (10 sec: 5660.2, 60 sec: 5780.5, 300 sec: 5763.2). Total num frames: 222041088. Throughput: 0: 5079.1. Samples: 222034654. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 10:53:15,108][25689] Avg episode reward: [(0, '-51.258')] [2022-07-09 10:53:15,694][26022] Updated weights on worker 0-0, policy_version 216841 (0.00088) [2022-07-09 10:53:17,492][26022] Updated weights on worker 0-0, policy_version 216851 (0.00092) [2022-07-09 10:53:19,095][26022] Updated weights on worker 0-0, policy_version 216861 (0.00087) [2022-07-09 10:53:20,138][25689] Fps is (10 sec: 5917.9, 60 sec: 5785.2, 300 sec: 5761.1). Total num frames: 222070784. Throughput: 0: 5944.0. Samples: 222069666. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 10:53:20,139][25689] Avg episode reward: [(0, '-51.074')] [2022-07-09 10:53:20,979][26022] Updated weights on worker 0-0, policy_version 216871 (0.00082) [2022-07-09 10:53:22,475][26022] Updated weights on worker 0-0, policy_version 216881 (0.00093) [2022-07-09 10:53:24,439][26022] Updated weights on worker 0-0, policy_version 216891 (0.00088) [2022-07-09 10:53:25,172][25689] Fps is (10 sec: 5901.5, 60 sec: 5782.6, 300 sec: 5765.9). Total num frames: 222100480. Throughput: 0: 6062.5. Samples: 222104946. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 10:53:25,172][25689] Avg episode reward: [(0, '-51.836')] [2022-07-09 10:53:26,144][26022] Updated weights on worker 0-0, policy_version 216901 (0.00087) [2022-07-09 10:53:27,932][26022] Updated weights on worker 0-0, policy_version 216911 (0.00593) [2022-07-09 10:53:29,652][26022] Updated weights on worker 0-0, policy_version 216921 (0.00087) [2022-07-09 10:53:30,266][25689] Fps is (10 sec: 5661.8, 60 sec: 5777.8, 300 sec: 5758.0). Total num frames: 222128128. Throughput: 0: 6073.8. Samples: 222139552. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 10:53:30,267][25689] Avg episode reward: [(0, '-51.896')] [2022-07-09 10:53:31,555][26022] Updated weights on worker 0-0, policy_version 216931 (0.00087) [2022-07-09 10:53:33,330][26022] Updated weights on worker 0-0, policy_version 216941 (0.00096) [2022-07-09 10:53:34,954][26022] Updated weights on worker 0-0, policy_version 216951 (0.00083) [2022-07-09 10:53:35,296][25689] Fps is (10 sec: 5765.2, 60 sec: 5764.2, 300 sec: 5764.6). Total num frames: 222158848. Throughput: 0: 6049.7. Samples: 222156786. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 10:53:35,296][25689] Avg episode reward: [(0, '-52.143')] [2022-07-09 10:53:36,605][26022] Updated weights on worker 0-0, policy_version 216961 (0.00086) [2022-07-09 10:53:38,732][26022] Updated weights on worker 0-0, policy_version 216971 (0.00094) [2022-07-09 10:53:40,247][26022] Updated weights on worker 0-0, policy_version 216981 (0.00092) [2022-07-09 10:53:40,321][25689] Fps is (10 sec: 6008.6, 60 sec: 5801.6, 300 sec: 5764.3). Total num frames: 222188544. Throughput: 0: 6026.2. Samples: 222191294. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 10:53:40,322][25689] Avg episode reward: [(0, '-51.019')] [2022-07-09 10:53:42,132][26022] Updated weights on worker 0-0, policy_version 216991 (0.00084) [2022-07-09 10:53:43,915][26022] Updated weights on worker 0-0, policy_version 217001 (0.00087) [2022-07-09 10:53:45,333][25689] Fps is (10 sec: 5713.1, 60 sec: 5770.3, 300 sec: 5756.3). Total num frames: 222216192. Throughput: 0: 6033.9. Samples: 222226598. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 10:53:45,334][25689] Avg episode reward: [(0, '-50.618')] [2022-07-09 10:53:45,585][26022] Updated weights on worker 0-0, policy_version 217011 (0.00087) [2022-07-09 10:53:47,407][26022] Updated weights on worker 0-0, policy_version 217021 (0.00095) [2022-07-09 10:53:49,096][26022] Updated weights on worker 0-0, policy_version 217031 (0.00086) [2022-07-09 10:53:50,420][25689] Fps is (10 sec: 5779.9, 60 sec: 5769.9, 300 sec: 5762.0). Total num frames: 222246912. Throughput: 0: 5192.2. Samples: 222244192. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 10:53:50,420][25689] Avg episode reward: [(0, '-51.319')] [2022-07-09 10:53:50,864][26022] Updated weights on worker 0-0, policy_version 217041 (0.00089) [2022-07-09 10:53:52,720][26022] Updated weights on worker 0-0, policy_version 217051 (0.00089) [2022-07-09 10:53:54,318][26022] Updated weights on worker 0-0, policy_version 217061 (0.00084) [2022-07-09 10:53:55,476][25689] Fps is (10 sec: 5754.5, 60 sec: 5735.3, 300 sec: 5754.5). Total num frames: 222274560. Throughput: 0: 6041.6. Samples: 222278708. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 10:53:55,477][25689] Avg episode reward: [(0, '-50.589')] [2022-07-09 10:53:56,451][26022] Updated weights on worker 0-0, policy_version 217071 (0.00084) [2022-07-09 10:53:57,927][26022] Updated weights on worker 0-0, policy_version 217081 (0.00097) [2022-07-09 10:53:59,808][26022] Updated weights on worker 0-0, policy_version 217091 (0.00085) [2022-07-09 10:54:00,482][25689] Fps is (10 sec: 5801.0, 60 sec: 5787.7, 300 sec: 5771.7). Total num frames: 222305280. Throughput: 0: 6046.9. Samples: 222313200. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 10:54:00,482][25689] Avg episode reward: [(0, '-50.050')] [2022-07-09 10:54:01,698][26022] Updated weights on worker 0-0, policy_version 217101 (0.00083) [2022-07-09 10:54:03,638][26022] Updated weights on worker 0-0, policy_version 217111 (0.00087) [2022-07-09 10:54:05,495][25689] Fps is (10 sec: 5621.6, 60 sec: 5736.1, 300 sec: 5756.4). Total num frames: 222330880. Throughput: 0: 5054.0. Samples: 222328494. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 10:54:05,497][25689] Avg episode reward: [(0, '-49.661')] [2022-07-09 10:54:05,542][26022] Updated weights on worker 0-0, policy_version 217121 (0.00095) [2022-07-09 10:54:07,288][26022] Updated weights on worker 0-0, policy_version 217131 (0.00089) [2022-07-09 10:54:08,893][26022] Updated weights on worker 0-0, policy_version 217141 (0.00085) [2022-07-09 10:54:10,564][25689] Fps is (10 sec: 5484.5, 60 sec: 5775.3, 300 sec: 5760.3). Total num frames: 222360576. Throughput: 0: 5913.1. Samples: 222363304. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 10:54:10,564][25689] Avg episode reward: [(0, '-49.617')] [2022-07-09 10:54:10,795][26022] Updated weights on worker 0-0, policy_version 217151 (0.00078) [2022-07-09 10:54:12,504][26022] Updated weights on worker 0-0, policy_version 217161 (0.00099) [2022-07-09 10:54:14,287][26022] Updated weights on worker 0-0, policy_version 217171 (0.00102) [2022-07-09 10:54:15,591][25689] Fps is (10 sec: 5882.5, 60 sec: 5773.2, 300 sec: 5760.0). Total num frames: 222390272. Throughput: 0: 5957.2. Samples: 222398536. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 10:54:15,592][25689] Avg episode reward: [(0, '-50.048')] [2022-07-09 10:54:16,018][26022] Updated weights on worker 0-0, policy_version 217181 (0.00091) [2022-07-09 10:54:17,782][26022] Updated weights on worker 0-0, policy_version 217191 (0.00095) [2022-07-09 10:54:19,631][26022] Updated weights on worker 0-0, policy_version 217201 (0.00089) [2022-07-09 10:54:20,606][25689] Fps is (10 sec: 5812.2, 60 sec: 5757.8, 300 sec: 5760.3). Total num frames: 222418944. Throughput: 0: 5106.6. Samples: 222415968. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 10:54:20,607][25689] Avg episode reward: [(0, '-50.092')] [2022-07-09 10:54:21,129][26022] Updated weights on worker 0-0, policy_version 217211 (0.00079) [2022-07-09 10:54:22,939][26022] Updated weights on worker 0-0, policy_version 217221 (0.00081) [2022-07-09 10:54:24,778][26022] Updated weights on worker 0-0, policy_version 217231 (0.00085) [2022-07-09 10:54:25,656][25689] Fps is (10 sec: 5799.3, 60 sec: 5756.2, 300 sec: 5760.4). Total num frames: 222448640. Throughput: 0: 6077.1. Samples: 222451014. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 10:54:25,657][25689] Avg episode reward: [(0, '-50.454')] [2022-07-09 10:54:26,669][26022] Updated weights on worker 0-0, policy_version 217241 (0.00087) [2022-07-09 10:54:28,255][26022] Updated weights on worker 0-0, policy_version 217251 (0.00078) [2022-07-09 10:54:30,243][26022] Updated weights on worker 0-0, policy_version 217261 (0.00056) [2022-07-09 10:54:30,730][25689] Fps is (10 sec: 5765.3, 60 sec: 5775.1, 300 sec: 5756.0). Total num frames: 222477312. Throughput: 0: 6069.0. Samples: 222485694. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 10:54:30,731][25689] Avg episode reward: [(0, '-50.385')] [2022-07-09 10:54:31,935][26022] Updated weights on worker 0-0, policy_version 217271 (0.00094) [2022-07-09 10:54:33,630][26022] Updated weights on worker 0-0, policy_version 217281 (0.00086) [2022-07-09 10:54:35,323][26022] Updated weights on worker 0-0, policy_version 217291 (0.00096) [2022-07-09 10:54:35,745][25689] Fps is (10 sec: 5785.4, 60 sec: 5759.6, 300 sec: 5762.8). Total num frames: 222507008. Throughput: 0: 5201.3. Samples: 222503360. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 10:54:35,745][25689] Avg episode reward: [(0, '-51.045')] [2022-07-09 10:54:37,192][26022] Updated weights on worker 0-0, policy_version 217301 (0.00086) [2022-07-09 10:54:38,872][26022] Updated weights on worker 0-0, policy_version 217311 (0.00111) [2022-07-09 10:54:40,759][25689] Fps is (10 sec: 5922.3, 60 sec: 5760.7, 300 sec: 5762.8). Total num frames: 222536704. Throughput: 0: 6057.5. Samples: 222538042. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 10:54:40,760][25689] Avg episode reward: [(0, '-49.831')] [2022-07-09 10:54:40,763][26022] Updated weights on worker 0-0, policy_version 217321 (0.00090) [2022-07-09 10:54:42,517][26022] Updated weights on worker 0-0, policy_version 217331 (0.00085) [2022-07-09 10:54:44,147][26022] Updated weights on worker 0-0, policy_version 217341 (0.00082) [2022-07-09 10:54:45,771][25689] Fps is (10 sec: 5923.9, 60 sec: 5794.6, 300 sec: 5767.0). Total num frames: 222566400. Throughput: 0: 6091.2. Samples: 222573536. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 10:54:45,771][25689] Avg episode reward: [(0, '-49.021')] [2022-07-09 10:54:46,027][26022] Updated weights on worker 0-0, policy_version 217351 (0.00087) [2022-07-09 10:54:47,583][26022] Updated weights on worker 0-0, policy_version 217361 (0.00088) [2022-07-09 10:54:49,409][26022] Updated weights on worker 0-0, policy_version 217371 (0.00081) [2022-07-09 10:54:49,710][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:54:49,722][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000217372_222588928.pth [2022-07-09 10:54:49,723][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000215344_220512256.pth [2022-07-09 10:54:50,831][25689] Fps is (10 sec: 5896.4, 60 sec: 5780.1, 300 sec: 5762.8). Total num frames: 222596096. Throughput: 0: 5249.1. Samples: 222591206. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 10:54:50,832][25689] Avg episode reward: [(0, '-48.985')] [2022-07-09 10:54:51,355][26022] Updated weights on worker 0-0, policy_version 217381 (0.00102) [2022-07-09 10:54:52,720][26022] Updated weights on worker 0-0, policy_version 217391 (0.00092) [2022-07-09 10:54:54,760][26022] Updated weights on worker 0-0, policy_version 217401 (0.00080) [2022-07-09 10:54:55,846][25689] Fps is (10 sec: 5894.9, 60 sec: 5818.0, 300 sec: 5766.4). Total num frames: 222625792. Throughput: 0: 6120.2. Samples: 222626382. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 10:54:55,846][25689] Avg episode reward: [(0, '-48.884')] [2022-07-09 10:54:56,420][26022] Updated weights on worker 0-0, policy_version 217411 (0.00102) [2022-07-09 10:54:58,194][26022] Updated weights on worker 0-0, policy_version 217421 (0.00081) [2022-07-09 10:54:59,930][26022] Updated weights on worker 0-0, policy_version 217431 (0.00092) [2022-07-09 10:55:00,848][25689] Fps is (10 sec: 5724.6, 60 sec: 5767.4, 300 sec: 5770.1). Total num frames: 222653440. Throughput: 0: 6144.5. Samples: 222661484. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 10:55:00,850][25689] Avg episode reward: [(0, '-48.248')] [2022-07-09 10:55:02,029][26022] Updated weights on worker 0-0, policy_version 217441 (0.00076) [2022-07-09 10:55:03,941][26022] Updated weights on worker 0-0, policy_version 217451 (0.00081) [2022-07-09 10:55:05,611][26022] Updated weights on worker 0-0, policy_version 217461 (0.00088) [2022-07-09 10:55:05,875][25689] Fps is (10 sec: 5411.7, 60 sec: 5783.1, 300 sec: 5764.6). Total num frames: 222680064. Throughput: 0: 5132.4. Samples: 222676716. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 10:55:05,877][25689] Avg episode reward: [(0, '-47.222')] [2022-07-09 10:55:07,349][26022] Updated weights on worker 0-0, policy_version 217471 (0.00082) [2022-07-09 10:55:09,511][26022] Updated weights on worker 0-0, policy_version 217481 (0.00089) [2022-07-09 10:55:10,919][25689] Fps is (10 sec: 5592.7, 60 sec: 5785.5, 300 sec: 5760.5). Total num frames: 222709760. Throughput: 0: 5965.3. Samples: 222711034. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 10:55:10,920][25689] Avg episode reward: [(0, '-47.560')] [2022-07-09 10:55:11,103][26022] Updated weights on worker 0-0, policy_version 217491 (0.00081) [2022-07-09 10:55:12,795][26022] Updated weights on worker 0-0, policy_version 217501 (0.00082) [2022-07-09 10:55:14,603][26022] Updated weights on worker 0-0, policy_version 217511 (0.00081) [2022-07-09 10:55:15,927][25689] Fps is (10 sec: 5806.2, 60 sec: 5770.4, 300 sec: 5764.4). Total num frames: 222738432. Throughput: 0: 5970.6. Samples: 222746280. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 10:55:15,928][25689] Avg episode reward: [(0, '-47.603')] [2022-07-09 10:55:16,156][26022] Updated weights on worker 0-0, policy_version 217521 (0.00088) [2022-07-09 10:55:18,032][26022] Updated weights on worker 0-0, policy_version 217531 (0.00086) [2022-07-09 10:55:19,809][26022] Updated weights on worker 0-0, policy_version 217541 (0.00092) [2022-07-09 10:55:20,951][25689] Fps is (10 sec: 5817.8, 60 sec: 5786.5, 300 sec: 5768.0). Total num frames: 222768128. Throughput: 0: 5092.3. Samples: 222763854. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 10:55:20,952][25689] Avg episode reward: [(0, '-48.164')] [2022-07-09 10:55:21,503][26022] Updated weights on worker 0-0, policy_version 217551 (0.00087) [2022-07-09 10:55:23,474][26022] Updated weights on worker 0-0, policy_version 217561 (0.00084) [2022-07-09 10:55:24,939][26022] Updated weights on worker 0-0, policy_version 217571 (0.00084) [2022-07-09 10:55:25,967][25689] Fps is (10 sec: 5813.7, 60 sec: 5772.8, 300 sec: 5761.8). Total num frames: 222796800. Throughput: 0: 6074.9. Samples: 222798776. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 10:55:25,967][25689] Avg episode reward: [(0, '-47.951')] [2022-07-09 10:55:26,806][26022] Updated weights on worker 0-0, policy_version 217581 (0.00095) [2022-07-09 10:55:28,601][26022] Updated weights on worker 0-0, policy_version 217591 (0.00087) [2022-07-09 10:55:30,290][26022] Updated weights on worker 0-0, policy_version 217601 (0.00098) [2022-07-09 10:55:31,024][25689] Fps is (10 sec: 5794.4, 60 sec: 5791.3, 300 sec: 5767.7). Total num frames: 222826496. Throughput: 0: 6102.5. Samples: 222833730. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 10:55:31,025][25689] Avg episode reward: [(0, '-48.166')] [2022-07-09 10:55:32,147][26022] Updated weights on worker 0-0, policy_version 217611 (0.00085) [2022-07-09 10:55:33,797][26022] Updated weights on worker 0-0, policy_version 217621 (0.00266) [2022-07-09 10:55:35,435][26022] Updated weights on worker 0-0, policy_version 217631 (0.00100) [2022-07-09 10:55:36,035][25689] Fps is (10 sec: 5899.2, 60 sec: 5791.7, 300 sec: 5771.5). Total num frames: 222856192. Throughput: 0: 5215.8. Samples: 222851156. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 10:55:36,035][25689] Avg episode reward: [(0, '-49.557')] [2022-07-09 10:55:37,342][26022] Updated weights on worker 0-0, policy_version 217641 (0.00088) [2022-07-09 10:55:38,980][26022] Updated weights on worker 0-0, policy_version 217651 (0.00083) [2022-07-09 10:55:40,994][26022] Updated weights on worker 0-0, policy_version 217661 (0.00087) [2022-07-09 10:55:41,047][25689] Fps is (10 sec: 5823.5, 60 sec: 5774.9, 300 sec: 5764.5). Total num frames: 222884864. Throughput: 0: 6082.5. Samples: 222886088. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 10:55:41,048][25689] Avg episode reward: [(0, '-49.631')] [2022-07-09 10:55:42,658][26022] Updated weights on worker 0-0, policy_version 217671 (0.00086) [2022-07-09 10:55:44,490][26022] Updated weights on worker 0-0, policy_version 217681 (0.00081) [2022-07-09 10:55:46,079][25689] Fps is (10 sec: 5709.0, 60 sec: 5756.0, 300 sec: 5761.8). Total num frames: 222913536. Throughput: 0: 6074.6. Samples: 222920950. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 10:55:46,080][25689] Avg episode reward: [(0, '-49.475')] [2022-07-09 10:55:46,283][26022] Updated weights on worker 0-0, policy_version 217691 (0.00085) [2022-07-09 10:55:48,031][26022] Updated weights on worker 0-0, policy_version 217701 (0.00281) [2022-07-09 10:55:49,697][26022] Updated weights on worker 0-0, policy_version 217711 (0.00087) [2022-07-09 10:55:51,139][25689] Fps is (10 sec: 5885.1, 60 sec: 5773.0, 300 sec: 5767.9). Total num frames: 222944256. Throughput: 0: 5210.0. Samples: 222938528. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 10:55:51,140][25689] Avg episode reward: [(0, '-49.652')] [2022-07-09 10:55:51,663][26022] Updated weights on worker 0-0, policy_version 217721 (0.00094) [2022-07-09 10:55:53,223][26022] Updated weights on worker 0-0, policy_version 217731 (0.00080) [2022-07-09 10:55:55,171][26022] Updated weights on worker 0-0, policy_version 217741 (0.00087) [2022-07-09 10:55:56,213][25689] Fps is (10 sec: 5962.1, 60 sec: 5767.4, 300 sec: 5777.1). Total num frames: 222973952. Throughput: 0: 6051.7. Samples: 222973266. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 10:55:56,213][25689] Avg episode reward: [(0, '-50.094')] [2022-07-09 10:55:56,746][26022] Updated weights on worker 0-0, policy_version 217751 (0.00079) [2022-07-09 10:55:58,577][26022] Updated weights on worker 0-0, policy_version 217761 (0.00611) [2022-07-09 10:56:00,248][26022] Updated weights on worker 0-0, policy_version 217771 (0.00088) [2022-07-09 10:56:01,244][25689] Fps is (10 sec: 5776.3, 60 sec: 5781.6, 300 sec: 5776.6). Total num frames: 223002624. Throughput: 0: 6058.4. Samples: 223008448. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 10:56:01,244][25689] Avg episode reward: [(0, '-50.518')] [2022-07-09 10:56:02,487][26022] Updated weights on worker 0-0, policy_version 217781 (0.00090) [2022-07-09 10:56:04,074][26022] Updated weights on worker 0-0, policy_version 217791 (0.00083) [2022-07-09 10:56:05,895][26022] Updated weights on worker 0-0, policy_version 217801 (0.00092) [2022-07-09 10:56:06,250][25689] Fps is (10 sec: 5407.1, 60 sec: 5766.6, 300 sec: 5763.8). Total num frames: 223028224. Throughput: 0: 5105.0. Samples: 223023920. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 10:56:06,250][25689] Avg episode reward: [(0, '-49.979')] [2022-07-09 10:56:07,576][26022] Updated weights on worker 0-0, policy_version 217811 (0.00094) [2022-07-09 10:56:09,686][26022] Updated weights on worker 0-0, policy_version 217821 (0.00081) [2022-07-09 10:56:11,213][26022] Updated weights on worker 0-0, policy_version 217831 (0.00082) [2022-07-09 10:56:11,296][25689] Fps is (10 sec: 5704.8, 60 sec: 5800.3, 300 sec: 5780.2). Total num frames: 223059968. Throughput: 0: 5961.9. Samples: 223058700. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:56:11,296][25689] Avg episode reward: [(0, '-49.976')] [2022-07-09 10:56:13,179][26022] Updated weights on worker 0-0, policy_version 217841 (0.00083) [2022-07-09 10:56:14,545][26022] Updated weights on worker 0-0, policy_version 217851 (0.00103) [2022-07-09 10:56:16,299][25689] Fps is (10 sec: 5808.5, 60 sec: 5767.0, 300 sec: 5763.5). Total num frames: 223086592. Throughput: 0: 5969.6. Samples: 223093170. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:56:16,299][25689] Avg episode reward: [(0, '-50.118')] [2022-07-09 10:56:16,628][26022] Updated weights on worker 0-0, policy_version 217861 (0.00091) [2022-07-09 10:56:18,395][26022] Updated weights on worker 0-0, policy_version 217871 (0.00087) [2022-07-09 10:56:20,041][26022] Updated weights on worker 0-0, policy_version 217881 (0.00462) [2022-07-09 10:56:21,319][25689] Fps is (10 sec: 5619.2, 60 sec: 5767.4, 300 sec: 5767.6). Total num frames: 223116288. Throughput: 0: 5094.8. Samples: 223110724. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:56:21,319][25689] Avg episode reward: [(0, '-50.338')] [2022-07-09 10:56:21,828][26022] Updated weights on worker 0-0, policy_version 217891 (0.00089) [2022-07-09 10:56:23,522][26022] Updated weights on worker 0-0, policy_version 217901 (0.00089) [2022-07-09 10:56:25,459][26022] Updated weights on worker 0-0, policy_version 217911 (0.00087) [2022-07-09 10:56:26,338][25689] Fps is (10 sec: 6119.9, 60 sec: 5817.9, 300 sec: 5775.4). Total num frames: 223148032. Throughput: 0: 6066.1. Samples: 223145776. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:56:26,338][25689] Avg episode reward: [(0, '-49.788')] [2022-07-09 10:56:27,271][26022] Updated weights on worker 0-0, policy_version 217921 (0.00083) [2022-07-09 10:56:28,783][26022] Updated weights on worker 0-0, policy_version 217931 (0.00096) [2022-07-09 10:56:30,762][26022] Updated weights on worker 0-0, policy_version 217941 (0.00093) [2022-07-09 10:56:31,397][25689] Fps is (10 sec: 5791.4, 60 sec: 5766.9, 300 sec: 5771.3). Total num frames: 223174656. Throughput: 0: 6070.6. Samples: 223180726. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:56:31,398][25689] Avg episode reward: [(0, '-50.482')] [2022-07-09 10:56:32,383][26022] Updated weights on worker 0-0, policy_version 217951 (0.00084) [2022-07-09 10:56:34,203][26022] Updated weights on worker 0-0, policy_version 217961 (0.00085) [2022-07-09 10:56:35,772][26022] Updated weights on worker 0-0, policy_version 217971 (0.00084) [2022-07-09 10:56:36,405][25689] Fps is (10 sec: 5594.4, 60 sec: 5767.1, 300 sec: 5767.9). Total num frames: 223204352. Throughput: 0: 5226.8. Samples: 223198262. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:56:36,405][25689] Avg episode reward: [(0, '-50.220')] [2022-07-09 10:56:37,503][26022] Updated weights on worker 0-0, policy_version 217981 (0.00085) [2022-07-09 10:56:39,404][26022] Updated weights on worker 0-0, policy_version 217991 (0.00101) [2022-07-09 10:56:41,417][25689] Fps is (10 sec: 5723.1, 60 sec: 5750.2, 300 sec: 5764.4). Total num frames: 223232000. Throughput: 0: 6094.1. Samples: 223233204. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:56:41,417][25689] Avg episode reward: [(0, '-49.892')] [2022-07-09 10:56:41,434][26022] Updated weights on worker 0-0, policy_version 218001 (0.00082) [2022-07-09 10:56:42,878][26022] Updated weights on worker 0-0, policy_version 218011 (0.00089) [2022-07-09 10:56:44,782][26022] Updated weights on worker 0-0, policy_version 218021 (0.00080) [2022-07-09 10:56:46,415][26022] Updated weights on worker 0-0, policy_version 218031 (0.00089) [2022-07-09 10:56:46,491][25689] Fps is (10 sec: 5888.7, 60 sec: 5797.0, 300 sec: 5772.0). Total num frames: 223263744. Throughput: 0: 6073.9. Samples: 223268184. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:56:46,491][25689] Avg episode reward: [(0, '-50.251')] [2022-07-09 10:56:48,515][26022] Updated weights on worker 0-0, policy_version 218041 (0.00096) [2022-07-09 10:56:49,737][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:56:49,745][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000218050_223283200.pth [2022-07-09 10:56:49,746][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000216020_221204480.pth [2022-07-09 10:56:49,886][26022] Updated weights on worker 0-0, policy_version 218051 (0.00084) [2022-07-09 10:56:51,607][25689] Fps is (10 sec: 5928.8, 60 sec: 5757.8, 300 sec: 5770.4). Total num frames: 223292416. Throughput: 0: 5183.6. Samples: 223285486. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:56:51,607][25689] Avg episode reward: [(0, '-50.024')] [2022-07-09 10:56:51,795][26022] Updated weights on worker 0-0, policy_version 218061 (0.00090) [2022-07-09 10:56:53,548][26022] Updated weights on worker 0-0, policy_version 218071 (0.00085) [2022-07-09 10:56:55,420][26022] Updated weights on worker 0-0, policy_version 218081 (0.00088) [2022-07-09 10:56:56,615][25689] Fps is (10 sec: 5765.0, 60 sec: 5764.0, 300 sec: 5773.7). Total num frames: 223322112. Throughput: 0: 6050.6. Samples: 223320546. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:56:56,616][25689] Avg episode reward: [(0, '-49.355')] [2022-07-09 10:56:56,990][26022] Updated weights on worker 0-0, policy_version 218091 (0.00118) [2022-07-09 10:56:58,801][26022] Updated weights on worker 0-0, policy_version 218101 (0.00086) [2022-07-09 10:57:00,479][26022] Updated weights on worker 0-0, policy_version 218111 (0.00083) [2022-07-09 10:57:01,642][25689] Fps is (10 sec: 5918.6, 60 sec: 5781.4, 300 sec: 5784.1). Total num frames: 223351808. Throughput: 0: 6057.8. Samples: 223355722. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:57:01,642][25689] Avg episode reward: [(0, '-49.271')] [2022-07-09 10:57:02,734][26022] Updated weights on worker 0-0, policy_version 218121 (0.00086) [2022-07-09 10:57:04,306][26022] Updated weights on worker 0-0, policy_version 218131 (0.00090) [2022-07-09 10:57:06,171][26022] Updated weights on worker 0-0, policy_version 218141 (0.00086) [2022-07-09 10:57:06,650][25689] Fps is (10 sec: 5612.4, 60 sec: 5798.1, 300 sec: 5772.5). Total num frames: 223378432. Throughput: 0: 5982.7. Samples: 223388792. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:57:06,651][25689] Avg episode reward: [(0, '-49.230')] [2022-07-09 10:57:07,885][26022] Updated weights on worker 0-0, policy_version 218151 (0.00087) [2022-07-09 10:57:09,759][26022] Updated weights on worker 0-0, policy_version 218161 (0.00083) [2022-07-09 10:57:11,459][26022] Updated weights on worker 0-0, policy_version 218171 (0.00109) [2022-07-09 10:57:11,714][25689] Fps is (10 sec: 5591.3, 60 sec: 5762.5, 300 sec: 5778.3). Total num frames: 223408128. Throughput: 0: 6009.1. Samples: 223406314. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:57:11,715][25689] Avg episode reward: [(0, '-49.215')] [2022-07-09 10:57:13,237][26022] Updated weights on worker 0-0, policy_version 218181 (0.00093) [2022-07-09 10:57:14,838][26022] Updated weights on worker 0-0, policy_version 218191 (0.00084) [2022-07-09 10:57:16,751][25689] Fps is (10 sec: 5778.7, 60 sec: 5793.1, 300 sec: 5775.6). Total num frames: 223436800. Throughput: 0: 5982.8. Samples: 223441014. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:57:16,751][25689] Avg episode reward: [(0, '-49.859')] [2022-07-09 10:57:16,839][26022] Updated weights on worker 0-0, policy_version 218201 (0.00086) [2022-07-09 10:57:18,392][26022] Updated weights on worker 0-0, policy_version 218211 (0.00089) [2022-07-09 10:57:20,423][26022] Updated weights on worker 0-0, policy_version 218221 (0.00059) [2022-07-09 10:57:21,791][25689] Fps is (10 sec: 5894.1, 60 sec: 5808.1, 300 sec: 5778.4). Total num frames: 223467520. Throughput: 0: 5962.4. Samples: 223475860. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:57:21,791][25689] Avg episode reward: [(0, '-49.501')] [2022-07-09 10:57:22,125][26022] Updated weights on worker 0-0, policy_version 218231 (0.00082) [2022-07-09 10:57:23,835][26022] Updated weights on worker 0-0, policy_version 218241 (0.00086) [2022-07-09 10:57:25,475][26022] Updated weights on worker 0-0, policy_version 218251 (0.00083) [2022-07-09 10:57:26,867][25689] Fps is (10 sec: 5972.3, 60 sec: 5768.9, 300 sec: 5784.7). Total num frames: 223497216. Throughput: 0: 5184.1. Samples: 223493598. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:57:26,868][25689] Avg episode reward: [(0, '-50.580')] [2022-07-09 10:57:27,223][26022] Updated weights on worker 0-0, policy_version 218261 (0.00088) [2022-07-09 10:57:29,024][26022] Updated weights on worker 0-0, policy_version 218271 (0.00078) [2022-07-09 10:57:31,009][26022] Updated weights on worker 0-0, policy_version 218281 (0.00091) [2022-07-09 10:57:31,921][25689] Fps is (10 sec: 5761.9, 60 sec: 5803.2, 300 sec: 5774.6). Total num frames: 223525888. Throughput: 0: 6044.5. Samples: 223528452. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:57:31,921][25689] Avg episode reward: [(0, '-51.528')] [2022-07-09 10:57:32,486][26022] Updated weights on worker 0-0, policy_version 218291 (0.00086) [2022-07-09 10:57:34,467][26022] Updated weights on worker 0-0, policy_version 218301 (0.00083) [2022-07-09 10:57:35,998][26022] Updated weights on worker 0-0, policy_version 218311 (0.00081) [2022-07-09 10:57:36,983][25689] Fps is (10 sec: 5668.8, 60 sec: 5781.1, 300 sec: 5778.1). Total num frames: 223554560. Throughput: 0: 6042.6. Samples: 223563266. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:57:36,983][25689] Avg episode reward: [(0, '-51.527')] [2022-07-09 10:57:37,905][26022] Updated weights on worker 0-0, policy_version 218321 (0.00089) [2022-07-09 10:57:39,508][26022] Updated weights on worker 0-0, policy_version 218331 (0.00082) [2022-07-09 10:57:41,395][26022] Updated weights on worker 0-0, policy_version 218341 (0.00087) [2022-07-09 10:57:41,998][25689] Fps is (10 sec: 5690.5, 60 sec: 5797.7, 300 sec: 5775.0). Total num frames: 223583232. Throughput: 0: 5183.1. Samples: 223580596. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:57:42,000][25689] Avg episode reward: [(0, '-50.911')] [2022-07-09 10:57:43,224][26022] Updated weights on worker 0-0, policy_version 218351 (0.00083) [2022-07-09 10:57:45,038][26022] Updated weights on worker 0-0, policy_version 218361 (0.00083) [2022-07-09 10:57:46,822][26022] Updated weights on worker 0-0, policy_version 218371 (0.00086) [2022-07-09 10:57:47,003][25689] Fps is (10 sec: 5824.7, 60 sec: 5770.4, 300 sec: 5773.1). Total num frames: 223612928. Throughput: 0: 6052.6. Samples: 223615476. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:57:47,004][25689] Avg episode reward: [(0, '-50.159')] [2022-07-09 10:57:48,534][26022] Updated weights on worker 0-0, policy_version 218381 (0.00086) [2022-07-09 10:57:50,183][26022] Updated weights on worker 0-0, policy_version 218391 (0.00086) [2022-07-09 10:57:51,886][26022] Updated weights on worker 0-0, policy_version 218401 (0.00089) [2022-07-09 10:57:52,056][25689] Fps is (10 sec: 5905.2, 60 sec: 5793.5, 300 sec: 5773.0). Total num frames: 223642624. Throughput: 0: 6075.0. Samples: 223650772. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:57:52,056][25689] Avg episode reward: [(0, '-50.820')] [2022-07-09 10:57:53,678][26022] Updated weights on worker 0-0, policy_version 218411 (0.00087) [2022-07-09 10:57:55,439][26022] Updated weights on worker 0-0, policy_version 218421 (0.00086) [2022-07-09 10:57:57,087][25689] Fps is (10 sec: 5890.1, 60 sec: 5791.3, 300 sec: 5779.8). Total num frames: 223672320. Throughput: 0: 5235.9. Samples: 223668528. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:57:57,087][25689] Avg episode reward: [(0, '-50.906')] [2022-07-09 10:57:57,247][26022] Updated weights on worker 0-0, policy_version 218431 (0.00086) [2022-07-09 10:57:59,066][26022] Updated weights on worker 0-0, policy_version 218441 (0.00082) [2022-07-09 10:58:00,725][26022] Updated weights on worker 0-0, policy_version 218451 (0.00087) [2022-07-09 10:58:02,128][25689] Fps is (10 sec: 5693.4, 60 sec: 5756.0, 300 sec: 5775.6). Total num frames: 223699968. Throughput: 0: 6101.2. Samples: 223703410. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:58:02,128][25689] Avg episode reward: [(0, '-50.410')] [2022-07-09 10:58:02,821][26022] Updated weights on worker 0-0, policy_version 218461 (0.00088) [2022-07-09 10:58:04,590][26022] Updated weights on worker 0-0, policy_version 218471 (0.00088) [2022-07-09 10:58:06,359][26022] Updated weights on worker 0-0, policy_version 218481 (0.00095) [2022-07-09 10:58:07,151][25689] Fps is (10 sec: 5596.3, 60 sec: 5788.5, 300 sec: 5781.0). Total num frames: 223728640. Throughput: 0: 6008.0. Samples: 223736520. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:58:07,151][25689] Avg episode reward: [(0, '-50.456')] [2022-07-09 10:58:07,980][26022] Updated weights on worker 0-0, policy_version 218491 (0.00089) [2022-07-09 10:58:09,822][26022] Updated weights on worker 0-0, policy_version 218501 (0.00079) [2022-07-09 10:58:11,571][26022] Updated weights on worker 0-0, policy_version 218511 (0.00084) [2022-07-09 10:58:12,221][25689] Fps is (10 sec: 5782.7, 60 sec: 5787.9, 300 sec: 5779.8). Total num frames: 223758336. Throughput: 0: 5136.1. Samples: 223754344. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:58:12,222][25689] Avg episode reward: [(0, '-50.364')] [2022-07-09 10:58:13,281][26022] Updated weights on worker 0-0, policy_version 218521 (0.00097) [2022-07-09 10:58:14,979][26022] Updated weights on worker 0-0, policy_version 218531 (0.00086) [2022-07-09 10:58:16,966][26022] Updated weights on worker 0-0, policy_version 218541 (0.00087) [2022-07-09 10:58:17,237][25689] Fps is (10 sec: 5888.2, 60 sec: 5806.8, 300 sec: 5780.1). Total num frames: 223788032. Throughput: 0: 5986.6. Samples: 223789162. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:58:17,238][25689] Avg episode reward: [(0, '-50.866')] [2022-07-09 10:58:18,705][26022] Updated weights on worker 0-0, policy_version 218551 (0.00084) [2022-07-09 10:58:20,435][26022] Updated weights on worker 0-0, policy_version 218561 (0.00081) [2022-07-09 10:58:21,962][26022] Updated weights on worker 0-0, policy_version 218571 (0.00086) [2022-07-09 10:58:22,269][25689] Fps is (10 sec: 5910.8, 60 sec: 5790.6, 300 sec: 5780.1). Total num frames: 223817728. Throughput: 0: 5995.2. Samples: 223824164. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 10:58:22,270][25689] Avg episode reward: [(0, '-50.682')] [2022-07-09 10:58:24,037][26022] Updated weights on worker 0-0, policy_version 218581 (0.00091) [2022-07-09 10:58:25,582][26022] Updated weights on worker 0-0, policy_version 218591 (0.00084) [2022-07-09 10:58:27,273][25689] Fps is (10 sec: 5714.1, 60 sec: 5763.7, 300 sec: 5781.9). Total num frames: 223845376. Throughput: 0: 5227.4. Samples: 223841708. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 10:58:27,273][25689] Avg episode reward: [(0, '-50.444')] [2022-07-09 10:58:27,511][26022] Updated weights on worker 0-0, policy_version 218601 (0.00081) [2022-07-09 10:58:29,311][26022] Updated weights on worker 0-0, policy_version 218611 (0.00083) [2022-07-09 10:58:30,893][26022] Updated weights on worker 0-0, policy_version 218621 (0.00091) [2022-07-09 10:58:32,380][25689] Fps is (10 sec: 5671.4, 60 sec: 5775.5, 300 sec: 5776.9). Total num frames: 223875072. Throughput: 0: 6034.6. Samples: 223875996. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 10:58:32,381][25689] Avg episode reward: [(0, '-50.587')] [2022-07-09 10:58:32,951][26022] Updated weights on worker 0-0, policy_version 218631 (0.00082) [2022-07-09 10:58:34,487][26022] Updated weights on worker 0-0, policy_version 218641 (0.00084) [2022-07-09 10:58:36,403][26022] Updated weights on worker 0-0, policy_version 218651 (0.00086) [2022-07-09 10:58:37,454][25689] Fps is (10 sec: 5833.5, 60 sec: 5791.3, 300 sec: 5776.0). Total num frames: 223904768. Throughput: 0: 6010.8. Samples: 223910680. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 10:58:37,455][25689] Avg episode reward: [(0, '-50.369')] [2022-07-09 10:58:38,074][26022] Updated weights on worker 0-0, policy_version 218661 (0.00086) [2022-07-09 10:58:39,867][26022] Updated weights on worker 0-0, policy_version 218671 (0.00083) [2022-07-09 10:58:41,708][26022] Updated weights on worker 0-0, policy_version 218681 (0.00085) [2022-07-09 10:58:42,503][25689] Fps is (10 sec: 5766.0, 60 sec: 5788.1, 300 sec: 5778.7). Total num frames: 223933440. Throughput: 0: 5148.1. Samples: 223928330. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 10:58:42,504][25689] Avg episode reward: [(0, '-50.043')] [2022-07-09 10:58:43,365][26022] Updated weights on worker 0-0, policy_version 218691 (0.00084) [2022-07-09 10:58:45,154][26022] Updated weights on worker 0-0, policy_version 218701 (0.00092) [2022-07-09 10:58:47,029][26022] Updated weights on worker 0-0, policy_version 218711 (0.00094) [2022-07-09 10:58:47,599][25689] Fps is (10 sec: 5652.5, 60 sec: 5762.5, 300 sec: 5771.6). Total num frames: 223962112. Throughput: 0: 5950.8. Samples: 223962664. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 10:58:47,599][25689] Avg episode reward: [(0, '-49.786')] [2022-07-09 10:58:48,765][26022] Updated weights on worker 0-0, policy_version 218721 (0.00085) [2022-07-09 10:58:49,753][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 10:58:49,768][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000218726_223975424.pth [2022-07-09 10:58:49,769][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000216695_221895680.pth [2022-07-09 10:58:50,563][26022] Updated weights on worker 0-0, policy_version 218731 (0.00094) [2022-07-09 10:58:52,343][26022] Updated weights on worker 0-0, policy_version 218741 (0.00092) [2022-07-09 10:58:52,638][25689] Fps is (10 sec: 5658.4, 60 sec: 5746.9, 300 sec: 5775.4). Total num frames: 223990784. Throughput: 0: 5966.9. Samples: 223996868. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 10:58:52,638][25689] Avg episode reward: [(0, '-49.431')] [2022-07-09 10:58:54,183][26022] Updated weights on worker 0-0, policy_version 218751 (0.00085) [2022-07-09 10:58:56,048][26022] Updated weights on worker 0-0, policy_version 218761 (0.00088) [2022-07-09 10:58:57,501][26022] Updated weights on worker 0-0, policy_version 218771 (0.00086) [2022-07-09 10:58:57,639][25689] Fps is (10 sec: 5915.6, 60 sec: 5766.7, 300 sec: 5775.5). Total num frames: 224021504. Throughput: 0: 5130.0. Samples: 224014228. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 10:58:57,639][25689] Avg episode reward: [(0, '-50.463')] [2022-07-09 10:58:59,562][26022] Updated weights on worker 0-0, policy_version 218781 (0.00091) [2022-07-09 10:59:01,368][26022] Updated weights on worker 0-0, policy_version 218791 (0.00092) [2022-07-09 10:59:02,652][25689] Fps is (10 sec: 5521.9, 60 sec: 5718.6, 300 sec: 5772.0). Total num frames: 224046080. Throughput: 0: 5973.1. Samples: 224048678. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 10:59:02,652][25689] Avg episode reward: [(0, '-49.572')] [2022-07-09 10:59:03,315][26022] Updated weights on worker 0-0, policy_version 218801 (0.00080) [2022-07-09 10:59:05,384][26022] Updated weights on worker 0-0, policy_version 218811 (0.00090) [2022-07-09 10:59:06,877][26022] Updated weights on worker 0-0, policy_version 218821 (0.00095) [2022-07-09 10:59:07,724][25689] Fps is (10 sec: 5381.1, 60 sec: 5730.8, 300 sec: 5772.0). Total num frames: 224075776. Throughput: 0: 5881.6. Samples: 224081034. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 10:59:07,725][25689] Avg episode reward: [(0, '-49.647')] [2022-07-09 10:59:08,832][26022] Updated weights on worker 0-0, policy_version 218831 (0.00091) [2022-07-09 10:59:10,597][26022] Updated weights on worker 0-0, policy_version 218841 (0.00087) [2022-07-09 10:59:12,302][26022] Updated weights on worker 0-0, policy_version 218851 (0.00604) [2022-07-09 10:59:12,825][25689] Fps is (10 sec: 5838.0, 60 sec: 5728.0, 300 sec: 5770.5). Total num frames: 224105472. Throughput: 0: 5014.4. Samples: 224098096. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 10:59:12,826][25689] Avg episode reward: [(0, '-48.458')] [2022-07-09 10:59:14,236][26022] Updated weights on worker 0-0, policy_version 218861 (0.00089) [2022-07-09 10:59:16,120][26022] Updated weights on worker 0-0, policy_version 218871 (0.00090) [2022-07-09 10:59:17,867][25689] Fps is (10 sec: 5653.9, 60 sec: 5691.7, 300 sec: 5766.6). Total num frames: 224133120. Throughput: 0: 5834.1. Samples: 224132242. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 10:59:17,867][25689] Avg episode reward: [(0, '-49.142')] [2022-07-09 10:59:17,893][26022] Updated weights on worker 0-0, policy_version 218881 (0.00093) [2022-07-09 10:59:19,851][26022] Updated weights on worker 0-0, policy_version 218891 (0.00092) [2022-07-09 10:59:21,430][26022] Updated weights on worker 0-0, policy_version 218901 (0.00103) [2022-07-09 10:59:22,900][25689] Fps is (10 sec: 5691.5, 60 sec: 5691.6, 300 sec: 5766.9). Total num frames: 224162816. Throughput: 0: 5814.2. Samples: 224166408. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 10:59:22,901][25689] Avg episode reward: [(0, '-47.982')] [2022-07-09 10:59:23,318][26022] Updated weights on worker 0-0, policy_version 218911 (0.00095) [2022-07-09 10:59:24,995][26022] Updated weights on worker 0-0, policy_version 218921 (0.00081) [2022-07-09 10:59:26,992][26022] Updated weights on worker 0-0, policy_version 218931 (0.00089) [2022-07-09 10:59:27,912][25689] Fps is (10 sec: 5810.4, 60 sec: 5707.7, 300 sec: 5768.1). Total num frames: 224191488. Throughput: 0: 5926.1. Samples: 224200670. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 10:59:27,913][25689] Avg episode reward: [(0, '-48.966')] [2022-07-09 10:59:28,732][26022] Updated weights on worker 0-0, policy_version 218941 (0.00086) [2022-07-09 10:59:30,543][26022] Updated weights on worker 0-0, policy_version 218951 (0.00088) [2022-07-09 10:59:32,197][26022] Updated weights on worker 0-0, policy_version 218961 (0.00080) [2022-07-09 10:59:32,991][25689] Fps is (10 sec: 5581.3, 60 sec: 5676.6, 300 sec: 5760.0). Total num frames: 224219136. Throughput: 0: 5942.0. Samples: 224217924. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 10:59:32,992][25689] Avg episode reward: [(0, '-49.763')] [2022-07-09 10:59:34,002][26022] Updated weights on worker 0-0, policy_version 218971 (0.00105) [2022-07-09 10:59:35,972][26022] Updated weights on worker 0-0, policy_version 218981 (0.00085) [2022-07-09 10:59:37,539][26022] Updated weights on worker 0-0, policy_version 218991 (0.00095) [2022-07-09 10:59:38,018][25689] Fps is (10 sec: 5674.4, 60 sec: 5681.0, 300 sec: 5759.7). Total num frames: 224248832. Throughput: 0: 5955.5. Samples: 224252252. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 10:59:38,019][25689] Avg episode reward: [(0, '-48.956')] [2022-07-09 10:59:39,607][26022] Updated weights on worker 0-0, policy_version 219001 (0.00081) [2022-07-09 10:59:41,155][26022] Updated weights on worker 0-0, policy_version 219011 (0.00081) [2022-07-09 10:59:43,012][26022] Updated weights on worker 0-0, policy_version 219021 (0.00082) [2022-07-09 10:59:43,032][25689] Fps is (10 sec: 5813.1, 60 sec: 5684.3, 300 sec: 5756.2). Total num frames: 224277504. Throughput: 0: 5961.7. Samples: 224286426. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 10:59:43,033][25689] Avg episode reward: [(0, '-49.741')] [2022-07-09 10:59:44,966][26022] Updated weights on worker 0-0, policy_version 219031 (0.00082) [2022-07-09 10:59:46,521][26022] Updated weights on worker 0-0, policy_version 219041 (0.00092) [2022-07-09 10:59:48,061][25689] Fps is (10 sec: 5506.0, 60 sec: 5656.7, 300 sec: 5746.5). Total num frames: 224304128. Throughput: 0: 5097.4. Samples: 224303374. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 10:59:48,061][25689] Avg episode reward: [(0, '-49.715')] [2022-07-09 10:59:48,563][26022] Updated weights on worker 0-0, policy_version 219051 (0.00082) [2022-07-09 10:59:50,127][26022] Updated weights on worker 0-0, policy_version 219061 (0.00089) [2022-07-09 10:59:51,927][26022] Updated weights on worker 0-0, policy_version 219071 (0.00080) [2022-07-09 10:59:53,158][25689] Fps is (10 sec: 5663.0, 60 sec: 5685.1, 300 sec: 5748.4). Total num frames: 224334848. Throughput: 0: 5941.0. Samples: 224337736. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 10:59:53,159][25689] Avg episode reward: [(0, '-49.080')] [2022-07-09 10:59:53,909][26022] Updated weights on worker 0-0, policy_version 219081 (0.00086) [2022-07-09 10:59:55,451][26022] Updated weights on worker 0-0, policy_version 219091 (0.00092) [2022-07-09 10:59:57,100][26022] Updated weights on worker 0-0, policy_version 219101 (0.00081) [2022-07-09 10:59:58,179][25689] Fps is (10 sec: 5970.9, 60 sec: 5666.3, 300 sec: 5754.9). Total num frames: 224364544. Throughput: 0: 6000.3. Samples: 224373226. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 10:59:58,180][25689] Avg episode reward: [(0, '-49.280')] [2022-07-09 10:59:58,913][26022] Updated weights on worker 0-0, policy_version 219111 (0.00086) [2022-07-09 11:00:00,620][26022] Updated weights on worker 0-0, policy_version 219121 (0.00086) [2022-07-09 11:00:02,825][26022] Updated weights on worker 0-0, policy_version 219131 (0.00086) [2022-07-09 11:00:03,199][25689] Fps is (10 sec: 5813.2, 60 sec: 5733.3, 300 sec: 5761.9). Total num frames: 224393216. Throughput: 0: 5175.9. Samples: 224390806. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 11:00:03,199][25689] Avg episode reward: [(0, '-49.705')] [2022-07-09 11:00:04,645][26022] Updated weights on worker 0-0, policy_version 219141 (0.00368) [2022-07-09 11:00:06,194][26022] Updated weights on worker 0-0, policy_version 219151 (0.00084) [2022-07-09 11:00:08,172][26022] Updated weights on worker 0-0, policy_version 219161 (0.00092) [2022-07-09 11:00:08,237][25689] Fps is (10 sec: 5599.5, 60 sec: 5702.7, 300 sec: 5755.1). Total num frames: 224420864. Throughput: 0: 5974.6. Samples: 224423920. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 11:00:08,238][25689] Avg episode reward: [(0, '-49.583')] [2022-07-09 11:00:09,696][26022] Updated weights on worker 0-0, policy_version 219171 (0.00079) [2022-07-09 11:00:11,527][26022] Updated weights on worker 0-0, policy_version 219181 (0.00082) [2022-07-09 11:00:13,206][26022] Updated weights on worker 0-0, policy_version 219191 (0.00084) [2022-07-09 11:00:13,365][25689] Fps is (10 sec: 5741.3, 60 sec: 5717.1, 300 sec: 5759.7). Total num frames: 224451584. Throughput: 0: 6006.8. Samples: 224459114. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 11:00:13,365][25689] Avg episode reward: [(0, '-49.496')] [2022-07-09 11:00:14,978][26022] Updated weights on worker 0-0, policy_version 219201 (0.00089) [2022-07-09 11:00:16,937][26022] Updated weights on worker 0-0, policy_version 219211 (0.00080) [2022-07-09 11:00:18,377][25689] Fps is (10 sec: 5756.6, 60 sec: 5719.9, 300 sec: 5753.1). Total num frames: 224479232. Throughput: 0: 5097.8. Samples: 224476188. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 11:00:18,377][25689] Avg episode reward: [(0, '-49.821')] [2022-07-09 11:00:18,734][26022] Updated weights on worker 0-0, policy_version 219221 (0.00091) [2022-07-09 11:00:20,329][26022] Updated weights on worker 0-0, policy_version 219231 (0.00084) [2022-07-09 11:00:22,395][26022] Updated weights on worker 0-0, policy_version 219241 (0.00086) [2022-07-09 11:00:23,386][25689] Fps is (10 sec: 5722.2, 60 sec: 5722.2, 300 sec: 5756.6). Total num frames: 224508928. Throughput: 0: 5950.0. Samples: 224510922. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 11:00:23,387][25689] Avg episode reward: [(0, '-50.011')] [2022-07-09 11:00:23,928][26022] Updated weights on worker 0-0, policy_version 219251 (0.00095) [2022-07-09 11:00:25,731][26022] Updated weights on worker 0-0, policy_version 219261 (0.00088) [2022-07-09 11:00:27,439][26022] Updated weights on worker 0-0, policy_version 219271 (0.00088) [2022-07-09 11:00:28,392][25689] Fps is (10 sec: 5828.0, 60 sec: 5722.8, 300 sec: 5754.2). Total num frames: 224537600. Throughput: 0: 6037.4. Samples: 224545600. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 11:00:28,394][25689] Avg episode reward: [(0, '-49.671')] [2022-07-09 11:00:29,262][26022] Updated weights on worker 0-0, policy_version 219281 (0.00084) [2022-07-09 11:00:31,057][26022] Updated weights on worker 0-0, policy_version 219291 (0.00092) [2022-07-09 11:00:32,781][26022] Updated weights on worker 0-0, policy_version 219301 (0.00085) [2022-07-09 11:00:33,504][25689] Fps is (10 sec: 5768.7, 60 sec: 5753.5, 300 sec: 5752.2). Total num frames: 224567296. Throughput: 0: 5168.2. Samples: 224563198. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 11:00:33,505][25689] Avg episode reward: [(0, '-49.992')] [2022-07-09 11:00:34,559][26022] Updated weights on worker 0-0, policy_version 219311 (0.00089) [2022-07-09 11:00:36,279][26022] Updated weights on worker 0-0, policy_version 219321 (0.00087) [2022-07-09 11:00:38,039][26022] Updated weights on worker 0-0, policy_version 219331 (0.00082) [2022-07-09 11:00:38,509][25689] Fps is (10 sec: 5668.0, 60 sec: 5721.7, 300 sec: 5748.9). Total num frames: 224594944. Throughput: 0: 6061.3. Samples: 224598216. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 11:00:38,509][25689] Avg episode reward: [(0, '-49.454')] [2022-07-09 11:00:39,783][26022] Updated weights on worker 0-0, policy_version 219341 (0.00085) [2022-07-09 11:00:41,892][26022] Updated weights on worker 0-0, policy_version 219351 (0.00094) [2022-07-09 11:00:43,286][26022] Updated weights on worker 0-0, policy_version 219361 (0.00088) [2022-07-09 11:00:43,528][25689] Fps is (10 sec: 5924.8, 60 sec: 5772.0, 300 sec: 5759.5). Total num frames: 224626688. Throughput: 0: 6075.4. Samples: 224633294. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:00:43,529][25689] Avg episode reward: [(0, '-49.515')] [2022-07-09 11:00:45,172][26022] Updated weights on worker 0-0, policy_version 219371 (0.00087) [2022-07-09 11:00:46,847][26022] Updated weights on worker 0-0, policy_version 219381 (0.00088) [2022-07-09 11:00:48,535][25689] Fps is (10 sec: 6026.2, 60 sec: 5808.0, 300 sec: 5753.6). Total num frames: 224655360. Throughput: 0: 5227.1. Samples: 224650888. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:00:48,535][25689] Avg episode reward: [(0, '-49.514')] [2022-07-09 11:00:48,743][26022] Updated weights on worker 0-0, policy_version 219391 (0.00085) [2022-07-09 11:00:50,049][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:00:50,063][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000219398_224663552.pth [2022-07-09 11:00:50,064][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000217372_222588928.pth [2022-07-09 11:00:50,536][26022] Updated weights on worker 0-0, policy_version 219401 (0.00085) [2022-07-09 11:00:52,151][26022] Updated weights on worker 0-0, policy_version 219411 (0.00098) [2022-07-09 11:00:53,584][25689] Fps is (10 sec: 5702.9, 60 sec: 5778.7, 300 sec: 5750.7). Total num frames: 224684032. Throughput: 0: 6098.4. Samples: 224685652. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:00:53,584][25689] Avg episode reward: [(0, '-48.884')] [2022-07-09 11:00:53,989][26022] Updated weights on worker 0-0, policy_version 219421 (0.00086) [2022-07-09 11:00:55,694][26022] Updated weights on worker 0-0, policy_version 219431 (0.00085) [2022-07-09 11:00:57,437][26022] Updated weights on worker 0-0, policy_version 219441 (0.00085) [2022-07-09 11:00:58,593][25689] Fps is (10 sec: 5904.5, 60 sec: 5796.8, 300 sec: 5758.0). Total num frames: 224714752. Throughput: 0: 6102.7. Samples: 224720784. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:00:58,594][25689] Avg episode reward: [(0, '-48.378')] [2022-07-09 11:00:59,146][26022] Updated weights on worker 0-0, policy_version 219451 (0.00086) [2022-07-09 11:01:00,952][26022] Updated weights on worker 0-0, policy_version 219461 (0.00084) [2022-07-09 11:01:03,119][26022] Updated weights on worker 0-0, policy_version 219471 (0.00098) [2022-07-09 11:01:03,638][25689] Fps is (10 sec: 5601.7, 60 sec: 5743.6, 300 sec: 5757.2). Total num frames: 224740352. Throughput: 0: 5218.5. Samples: 224738236. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:01:03,639][25689] Avg episode reward: [(0, '-49.123')] [2022-07-09 11:01:04,931][26022] Updated weights on worker 0-0, policy_version 219481 (0.00089) [2022-07-09 11:01:06,621][26022] Updated weights on worker 0-0, policy_version 219491 (0.00087) [2022-07-09 11:01:08,399][26022] Updated weights on worker 0-0, policy_version 219501 (0.00082) [2022-07-09 11:01:08,639][25689] Fps is (10 sec: 5606.2, 60 sec: 5798.0, 300 sec: 5754.6). Total num frames: 224771072. Throughput: 0: 5993.5. Samples: 224771386. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:01:08,640][25689] Avg episode reward: [(0, '-49.489')] [2022-07-09 11:01:10,123][26022] Updated weights on worker 0-0, policy_version 219511 (0.00101) [2022-07-09 11:01:11,920][26022] Updated weights on worker 0-0, policy_version 219521 (0.00088) [2022-07-09 11:01:13,675][26022] Updated weights on worker 0-0, policy_version 219531 (0.00081) [2022-07-09 11:01:13,705][25689] Fps is (10 sec: 5899.8, 60 sec: 5770.0, 300 sec: 5760.3). Total num frames: 224799744. Throughput: 0: 6003.2. Samples: 224806442. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:01:13,714][25689] Avg episode reward: [(0, '-49.345')] [2022-07-09 11:01:15,340][26022] Updated weights on worker 0-0, policy_version 219541 (0.00084) [2022-07-09 11:01:17,176][26022] Updated weights on worker 0-0, policy_version 219551 (0.00086) [2022-07-09 11:01:18,759][25689] Fps is (10 sec: 5869.2, 60 sec: 5816.8, 300 sec: 5763.1). Total num frames: 224830464. Throughput: 0: 5122.0. Samples: 224824072. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:01:18,759][25689] Avg episode reward: [(0, '-49.392')] [2022-07-09 11:01:18,762][26022] Updated weights on worker 0-0, policy_version 219561 (0.00086) [2022-07-09 11:01:20,828][26022] Updated weights on worker 0-0, policy_version 219571 (0.00090) [2022-07-09 11:01:22,335][26022] Updated weights on worker 0-0, policy_version 219581 (0.00108) [2022-07-09 11:01:23,803][25689] Fps is (10 sec: 5780.0, 60 sec: 5779.6, 300 sec: 5748.8). Total num frames: 224858112. Throughput: 0: 5985.0. Samples: 224858924. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:01:23,804][25689] Avg episode reward: [(0, '-50.151')] [2022-07-09 11:01:24,240][26022] Updated weights on worker 0-0, policy_version 219591 (0.00085) [2022-07-09 11:01:25,924][26022] Updated weights on worker 0-0, policy_version 219601 (0.00094) [2022-07-09 11:01:27,713][26022] Updated weights on worker 0-0, policy_version 219611 (0.00083) [2022-07-09 11:01:28,852][25689] Fps is (10 sec: 5580.0, 60 sec: 5775.5, 300 sec: 5755.9). Total num frames: 224886784. Throughput: 0: 6067.0. Samples: 224894016. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:01:28,853][25689] Avg episode reward: [(0, '-50.120')] [2022-07-09 11:01:29,327][26022] Updated weights on worker 0-0, policy_version 219621 (0.00086) [2022-07-09 11:01:31,193][26022] Updated weights on worker 0-0, policy_version 219631 (0.00086) [2022-07-09 11:01:32,822][26022] Updated weights on worker 0-0, policy_version 219641 (0.00090) [2022-07-09 11:01:33,907][25689] Fps is (10 sec: 5979.4, 60 sec: 5814.8, 300 sec: 5761.9). Total num frames: 224918528. Throughput: 0: 5204.3. Samples: 224911580. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:01:33,909][25689] Avg episode reward: [(0, '-49.552')] [2022-07-09 11:01:34,737][26022] Updated weights on worker 0-0, policy_version 219651 (0.00090) [2022-07-09 11:01:36,366][26022] Updated weights on worker 0-0, policy_version 219661 (0.00092) [2022-07-09 11:01:38,250][26022] Updated weights on worker 0-0, policy_version 219671 (0.00094) [2022-07-09 11:01:38,927][25689] Fps is (10 sec: 5894.7, 60 sec: 5813.3, 300 sec: 5761.7). Total num frames: 224946176. Throughput: 0: 6084.6. Samples: 224946790. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:01:38,928][25689] Avg episode reward: [(0, '-49.655')] [2022-07-09 11:01:39,924][26022] Updated weights on worker 0-0, policy_version 219681 (0.00081) [2022-07-09 11:01:41,780][26022] Updated weights on worker 0-0, policy_version 219691 (0.00088) [2022-07-09 11:01:43,323][26022] Updated weights on worker 0-0, policy_version 219701 (0.00084) [2022-07-09 11:01:43,929][25689] Fps is (10 sec: 5824.1, 60 sec: 5798.1, 300 sec: 5759.7). Total num frames: 224976896. Throughput: 0: 6099.6. Samples: 224981684. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:01:43,931][25689] Avg episode reward: [(0, '-50.057')] [2022-07-09 11:01:45,406][26022] Updated weights on worker 0-0, policy_version 219711 (0.00087) [2022-07-09 11:01:46,871][26022] Updated weights on worker 0-0, policy_version 219721 (0.00093) [2022-07-09 11:01:48,812][26022] Updated weights on worker 0-0, policy_version 219731 (0.00088) [2022-07-09 11:01:48,935][25689] Fps is (10 sec: 5832.4, 60 sec: 5781.2, 300 sec: 5758.4). Total num frames: 225004544. Throughput: 0: 5237.7. Samples: 224999204. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:01:48,935][25689] Avg episode reward: [(0, '-49.778')] [2022-07-09 11:01:50,681][26022] Updated weights on worker 0-0, policy_version 219741 (0.00096) [2022-07-09 11:01:52,235][26022] Updated weights on worker 0-0, policy_version 219751 (0.00079) [2022-07-09 11:01:54,010][25689] Fps is (10 sec: 5586.5, 60 sec: 5778.7, 300 sec: 5753.6). Total num frames: 225033216. Throughput: 0: 6091.8. Samples: 225034046. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:01:54,011][25689] Avg episode reward: [(0, '-48.820')] [2022-07-09 11:01:54,201][26022] Updated weights on worker 0-0, policy_version 219761 (0.00086) [2022-07-09 11:01:55,643][26022] Updated weights on worker 0-0, policy_version 219771 (0.00096) [2022-07-09 11:01:57,634][26022] Updated weights on worker 0-0, policy_version 219781 (0.01336) [2022-07-09 11:01:59,026][25689] Fps is (10 sec: 5885.5, 60 sec: 5778.1, 300 sec: 5757.3). Total num frames: 225063936. Throughput: 0: 6082.6. Samples: 225069042. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:01:59,027][25689] Avg episode reward: [(0, '-49.786')] [2022-07-09 11:01:59,447][26022] Updated weights on worker 0-0, policy_version 219791 (0.00087) [2022-07-09 11:02:01,079][26022] Updated weights on worker 0-0, policy_version 219801 (0.00088) [2022-07-09 11:02:03,265][26022] Updated weights on worker 0-0, policy_version 219811 (0.00082) [2022-07-09 11:02:04,064][25689] Fps is (10 sec: 5703.7, 60 sec: 5795.6, 300 sec: 5756.7). Total num frames: 225090560. Throughput: 0: 5205.5. Samples: 225086498. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:02:04,065][25689] Avg episode reward: [(0, '-49.600')] [2022-07-09 11:02:04,782][26022] Updated weights on worker 0-0, policy_version 219821 (0.00085) [2022-07-09 11:02:06,939][26022] Updated weights on worker 0-0, policy_version 219831 (0.00088) [2022-07-09 11:02:08,525][26022] Updated weights on worker 0-0, policy_version 219841 (0.00093) [2022-07-09 11:02:09,098][25689] Fps is (10 sec: 5591.6, 60 sec: 5775.5, 300 sec: 5757.3). Total num frames: 225120256. Throughput: 0: 5963.2. Samples: 225119444. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:02:09,100][25689] Avg episode reward: [(0, '-47.782')] [2022-07-09 11:02:10,301][26022] Updated weights on worker 0-0, policy_version 219851 (0.00082) [2022-07-09 11:02:12,045][26022] Updated weights on worker 0-0, policy_version 219861 (0.00089) [2022-07-09 11:02:13,986][26022] Updated weights on worker 0-0, policy_version 219871 (0.00090) [2022-07-09 11:02:14,167][25689] Fps is (10 sec: 5777.5, 60 sec: 5775.3, 300 sec: 5756.7). Total num frames: 225148928. Throughput: 0: 5973.1. Samples: 225154442. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:02:14,168][25689] Avg episode reward: [(0, '-48.189')] [2022-07-09 11:02:15,567][26022] Updated weights on worker 0-0, policy_version 219881 (0.00095) [2022-07-09 11:02:17,405][26022] Updated weights on worker 0-0, policy_version 219891 (0.00081) [2022-07-09 11:02:19,138][26022] Updated weights on worker 0-0, policy_version 219901 (0.00089) [2022-07-09 11:02:19,233][25689] Fps is (10 sec: 5759.0, 60 sec: 5757.1, 300 sec: 5752.7). Total num frames: 225178624. Throughput: 0: 5082.1. Samples: 225171738. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:02:19,235][25689] Avg episode reward: [(0, '-47.930')] [2022-07-09 11:02:21,119][26022] Updated weights on worker 0-0, policy_version 219911 (0.00082) [2022-07-09 11:02:22,778][26022] Updated weights on worker 0-0, policy_version 219921 (0.00093) [2022-07-09 11:02:24,258][25689] Fps is (10 sec: 5885.3, 60 sec: 5792.8, 300 sec: 5753.7). Total num frames: 225208320. Throughput: 0: 5924.2. Samples: 225206132. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:02:24,260][25689] Avg episode reward: [(0, '-47.561')] [2022-07-09 11:02:24,619][26022] Updated weights on worker 0-0, policy_version 219931 (0.00084) [2022-07-09 11:02:26,220][26022] Updated weights on worker 0-0, policy_version 219941 (0.00082) [2022-07-09 11:02:28,092][26022] Updated weights on worker 0-0, policy_version 219951 (0.00091) [2022-07-09 11:02:29,261][25689] Fps is (10 sec: 5821.0, 60 sec: 5797.3, 300 sec: 5754.7). Total num frames: 225236992. Throughput: 0: 6015.0. Samples: 225240720. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:02:29,262][25689] Avg episode reward: [(0, '-47.254')] [2022-07-09 11:02:29,678][26022] Updated weights on worker 0-0, policy_version 219961 (0.00083) [2022-07-09 11:02:31,689][26022] Updated weights on worker 0-0, policy_version 219971 (0.00086) [2022-07-09 11:02:33,616][26022] Updated weights on worker 0-0, policy_version 219981 (0.00086) [2022-07-09 11:02:34,315][25689] Fps is (10 sec: 5600.5, 60 sec: 5729.6, 300 sec: 5751.4). Total num frames: 225264640. Throughput: 0: 5136.8. Samples: 225257938. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:02:34,315][25689] Avg episode reward: [(0, '-47.067')] [2022-07-09 11:02:35,124][26022] Updated weights on worker 0-0, policy_version 219991 (0.00088) [2022-07-09 11:02:37,092][26022] Updated weights on worker 0-0, policy_version 220001 (0.00091) [2022-07-09 11:02:38,558][26022] Updated weights on worker 0-0, policy_version 220011 (0.00081) [2022-07-09 11:02:39,333][25689] Fps is (10 sec: 5693.3, 60 sec: 5763.7, 300 sec: 5754.8). Total num frames: 225294336. Throughput: 0: 6028.2. Samples: 225292902. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:02:39,333][25689] Avg episode reward: [(0, '-48.790')] [2022-07-09 11:02:40,369][26022] Updated weights on worker 0-0, policy_version 220021 (0.00091) [2022-07-09 11:02:42,324][26022] Updated weights on worker 0-0, policy_version 220031 (0.00060) [2022-07-09 11:02:43,876][26022] Updated weights on worker 0-0, policy_version 220041 (0.00095) [2022-07-09 11:02:44,344][25689] Fps is (10 sec: 6023.9, 60 sec: 5762.8, 300 sec: 5758.1). Total num frames: 225325056. Throughput: 0: 6062.1. Samples: 225327892. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:02:44,348][25689] Avg episode reward: [(0, '-49.414')] [2022-07-09 11:02:45,939][26022] Updated weights on worker 0-0, policy_version 220051 (0.00091) [2022-07-09 11:02:47,471][26022] Updated weights on worker 0-0, policy_version 220061 (0.00080) [2022-07-09 11:02:49,376][25689] Fps is (10 sec: 5811.5, 60 sec: 5760.3, 300 sec: 5751.6). Total num frames: 225352704. Throughput: 0: 6076.2. Samples: 225362946. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:02:49,377][25689] Avg episode reward: [(0, '-50.083')] [2022-07-09 11:02:49,384][26022] Updated weights on worker 0-0, policy_version 220071 (0.00090) [2022-07-09 11:02:50,385][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:02:50,401][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000220078_225359872.pth [2022-07-09 11:02:50,402][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000218050_223283200.pth [2022-07-09 11:02:51,132][26022] Updated weights on worker 0-0, policy_version 220081 (0.00083) [2022-07-09 11:02:52,781][26022] Updated weights on worker 0-0, policy_version 220091 (0.00083) [2022-07-09 11:02:54,467][26022] Updated weights on worker 0-0, policy_version 220101 (0.00084) [2022-07-09 11:02:54,561][25689] Fps is (10 sec: 5712.7, 60 sec: 5783.7, 300 sec: 5752.1). Total num frames: 225383424. Throughput: 0: 6044.5. Samples: 225380318. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:02:54,562][25689] Avg episode reward: [(0, '-49.853')] [2022-07-09 11:02:56,421][26022] Updated weights on worker 0-0, policy_version 220111 (0.00081) [2022-07-09 11:02:58,117][26022] Updated weights on worker 0-0, policy_version 220121 (0.00085) [2022-07-09 11:02:59,564][25689] Fps is (10 sec: 5729.3, 60 sec: 5734.2, 300 sec: 5752.8). Total num frames: 225411072. Throughput: 0: 6046.0. Samples: 225415218. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:02:59,564][25689] Avg episode reward: [(0, '-50.271')] [2022-07-09 11:02:59,886][26022] Updated weights on worker 0-0, policy_version 220131 (0.00085) [2022-07-09 11:03:01,442][26022] Updated weights on worker 0-0, policy_version 220141 (0.00085) [2022-07-09 11:03:03,695][26022] Updated weights on worker 0-0, policy_version 220151 (0.00084) [2022-07-09 11:03:04,648][25689] Fps is (10 sec: 5685.2, 60 sec: 5780.6, 300 sec: 5755.1). Total num frames: 225440768. Throughput: 0: 5924.0. Samples: 225448172. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:03:04,648][25689] Avg episode reward: [(0, '-50.525')] [2022-07-09 11:03:05,575][26022] Updated weights on worker 0-0, policy_version 220161 (0.00091) [2022-07-09 11:03:07,023][26022] Updated weights on worker 0-0, policy_version 220171 (0.00091) [2022-07-09 11:03:08,901][26022] Updated weights on worker 0-0, policy_version 220181 (0.00088) [2022-07-09 11:03:09,667][25689] Fps is (10 sec: 5777.0, 60 sec: 5765.1, 300 sec: 5752.7). Total num frames: 225469440. Throughput: 0: 5058.9. Samples: 225465596. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:03:09,668][25689] Avg episode reward: [(0, '-50.684')] [2022-07-09 11:03:10,679][26022] Updated weights on worker 0-0, policy_version 220191 (0.00095) [2022-07-09 11:03:12,515][26022] Updated weights on worker 0-0, policy_version 220201 (0.00083) [2022-07-09 11:03:14,356][26022] Updated weights on worker 0-0, policy_version 220211 (0.00084) [2022-07-09 11:03:14,797][25689] Fps is (10 sec: 5851.6, 60 sec: 5793.0, 300 sec: 5753.9). Total num frames: 225500160. Throughput: 0: 5944.9. Samples: 225500620. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:03:14,799][25689] Avg episode reward: [(0, '-50.950')] [2022-07-09 11:03:15,905][26022] Updated weights on worker 0-0, policy_version 220221 (0.00094) [2022-07-09 11:03:17,663][26022] Updated weights on worker 0-0, policy_version 220231 (0.00089) [2022-07-09 11:03:19,487][26022] Updated weights on worker 0-0, policy_version 220241 (0.00085) [2022-07-09 11:03:19,897][25689] Fps is (10 sec: 5806.0, 60 sec: 5773.0, 300 sec: 5749.1). Total num frames: 225528832. Throughput: 0: 5928.1. Samples: 225535754. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:03:19,897][25689] Avg episode reward: [(0, '-50.825')] [2022-07-09 11:03:21,214][26022] Updated weights on worker 0-0, policy_version 220251 (0.00084) [2022-07-09 11:03:22,961][26022] Updated weights on worker 0-0, policy_version 220261 (0.00076) [2022-07-09 11:03:24,651][26022] Updated weights on worker 0-0, policy_version 220271 (0.00087) [2022-07-09 11:03:24,943][25689] Fps is (10 sec: 5652.1, 60 sec: 5754.1, 300 sec: 5751.8). Total num frames: 225557504. Throughput: 0: 5160.6. Samples: 225552908. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:03:24,944][25689] Avg episode reward: [(0, '-50.581')] [2022-07-09 11:03:26,608][26022] Updated weights on worker 0-0, policy_version 220281 (0.00084) [2022-07-09 11:03:28,553][26022] Updated weights on worker 0-0, policy_version 220291 (0.00092) [2022-07-09 11:03:29,972][25689] Fps is (10 sec: 5691.7, 60 sec: 5751.6, 300 sec: 5749.8). Total num frames: 225586176. Throughput: 0: 5989.5. Samples: 225587210. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:03:29,972][25689] Avg episode reward: [(0, '-50.835')] [2022-07-09 11:03:30,213][26022] Updated weights on worker 0-0, policy_version 220301 (0.00089) [2022-07-09 11:03:32,348][26022] Updated weights on worker 0-0, policy_version 220311 (0.00085) [2022-07-09 11:03:33,454][26022] Updated weights on worker 0-0, policy_version 220321 (0.00079) [2022-07-09 11:03:35,055][25689] Fps is (10 sec: 5772.2, 60 sec: 5782.5, 300 sec: 5749.7). Total num frames: 225615872. Throughput: 0: 5973.7. Samples: 225621634. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:03:35,055][25689] Avg episode reward: [(0, '-50.347')] [2022-07-09 11:03:35,767][26022] Updated weights on worker 0-0, policy_version 220331 (0.00086) [2022-07-09 11:03:37,434][26022] Updated weights on worker 0-0, policy_version 220341 (0.00087) [2022-07-09 11:03:39,008][26022] Updated weights on worker 0-0, policy_version 220351 (0.00089) [2022-07-09 11:03:40,101][25689] Fps is (10 sec: 5762.6, 60 sec: 5763.1, 300 sec: 5749.7). Total num frames: 225644544. Throughput: 0: 5113.4. Samples: 225639064. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:03:40,103][25689] Avg episode reward: [(0, '-49.477')] [2022-07-09 11:03:40,892][26022] Updated weights on worker 0-0, policy_version 220361 (0.00092) [2022-07-09 11:03:42,677][26022] Updated weights on worker 0-0, policy_version 220371 (0.00083) [2022-07-09 11:03:44,321][26022] Updated weights on worker 0-0, policy_version 220381 (0.00082) [2022-07-09 11:03:45,112][25689] Fps is (10 sec: 6007.5, 60 sec: 5779.9, 300 sec: 5761.7). Total num frames: 225676288. Throughput: 0: 5987.3. Samples: 225673664. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:03:45,116][25689] Avg episode reward: [(0, '-48.968')] [2022-07-09 11:03:46,347][26022] Updated weights on worker 0-0, policy_version 220391 (0.00091) [2022-07-09 11:03:47,825][26022] Updated weights on worker 0-0, policy_version 220401 (0.00084) [2022-07-09 11:03:49,906][26022] Updated weights on worker 0-0, policy_version 220411 (0.00093) [2022-07-09 11:03:50,135][25689] Fps is (10 sec: 5714.9, 60 sec: 5747.1, 300 sec: 5751.7). Total num frames: 225701888. Throughput: 0: 6025.2. Samples: 225708696. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:03:50,135][25689] Avg episode reward: [(0, '-48.182')] [2022-07-09 11:03:51,435][26022] Updated weights on worker 0-0, policy_version 220421 (0.00083) [2022-07-09 11:03:53,360][26022] Updated weights on worker 0-0, policy_version 220431 (0.00090) [2022-07-09 11:03:55,253][25689] Fps is (10 sec: 5351.4, 60 sec: 5719.6, 300 sec: 5742.5). Total num frames: 225730560. Throughput: 0: 5141.2. Samples: 225725478. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:03:55,254][25689] Avg episode reward: [(0, '-48.510')] [2022-07-09 11:03:55,304][26022] Updated weights on worker 0-0, policy_version 220441 (0.00089) [2022-07-09 11:03:56,822][26022] Updated weights on worker 0-0, policy_version 220451 (0.00083) [2022-07-09 11:03:58,655][26022] Updated weights on worker 0-0, policy_version 220461 (0.00085) [2022-07-09 11:04:00,256][25689] Fps is (10 sec: 5766.7, 60 sec: 5753.3, 300 sec: 5759.9). Total num frames: 225760256. Throughput: 0: 6014.4. Samples: 225760288. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:04:00,257][25689] Avg episode reward: [(0, '-47.958')] [2022-07-09 11:04:00,392][26022] Updated weights on worker 0-0, policy_version 220471 (0.00082) [2022-07-09 11:04:02,493][26022] Updated weights on worker 0-0, policy_version 220481 (0.00085) [2022-07-09 11:04:04,216][26022] Updated weights on worker 0-0, policy_version 220491 (0.00088) [2022-07-09 11:04:05,312][25689] Fps is (10 sec: 5701.0, 60 sec: 5722.2, 300 sec: 5753.4). Total num frames: 225787904. Throughput: 0: 5909.3. Samples: 225793034. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:04:05,313][25689] Avg episode reward: [(0, '-48.258')] [2022-07-09 11:04:06,091][26022] Updated weights on worker 0-0, policy_version 220501 (0.00091) [2022-07-09 11:04:07,827][26022] Updated weights on worker 0-0, policy_version 220511 (0.00093) [2022-07-09 11:04:09,559][26022] Updated weights on worker 0-0, policy_version 220521 (0.00082) [2022-07-09 11:04:10,359][25689] Fps is (10 sec: 5777.4, 60 sec: 5753.4, 300 sec: 5757.8). Total num frames: 225818624. Throughput: 0: 5036.4. Samples: 225810554. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:04:10,360][25689] Avg episode reward: [(0, '-49.938')] [2022-07-09 11:04:11,264][26022] Updated weights on worker 0-0, policy_version 220531 (0.00091) [2022-07-09 11:04:12,924][26022] Updated weights on worker 0-0, policy_version 220541 (0.00094) [2022-07-09 11:04:15,011][26022] Updated weights on worker 0-0, policy_version 220551 (0.00593) [2022-07-09 11:04:15,406][25689] Fps is (10 sec: 5985.6, 60 sec: 5744.4, 300 sec: 5764.6). Total num frames: 225848320. Throughput: 0: 5969.1. Samples: 225845768. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:04:15,406][25689] Avg episode reward: [(0, '-50.033')] [2022-07-09 11:04:16,537][26022] Updated weights on worker 0-0, policy_version 220561 (0.00088) [2022-07-09 11:04:18,377][26022] Updated weights on worker 0-0, policy_version 220571 (0.00084) [2022-07-09 11:04:19,977][26022] Updated weights on worker 0-0, policy_version 220581 (0.00083) [2022-07-09 11:04:20,452][25689] Fps is (10 sec: 5681.4, 60 sec: 5732.5, 300 sec: 5757.5). Total num frames: 225875968. Throughput: 0: 5972.6. Samples: 225880912. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:04:20,453][25689] Avg episode reward: [(0, '-49.465')] [2022-07-09 11:04:21,839][26022] Updated weights on worker 0-0, policy_version 220591 (0.00083) [2022-07-09 11:04:23,846][26022] Updated weights on worker 0-0, policy_version 220601 (0.00078) [2022-07-09 11:04:25,356][26022] Updated weights on worker 0-0, policy_version 220611 (0.00091) [2022-07-09 11:04:25,489][25689] Fps is (10 sec: 5788.4, 60 sec: 5767.2, 300 sec: 5763.9). Total num frames: 225906688. Throughput: 0: 5211.2. Samples: 225898182. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:04:25,490][25689] Avg episode reward: [(0, '-50.079')] [2022-07-09 11:04:27,362][26022] Updated weights on worker 0-0, policy_version 220621 (0.00088) [2022-07-09 11:04:28,922][26022] Updated weights on worker 0-0, policy_version 220631 (0.00094) [2022-07-09 11:04:30,512][25689] Fps is (10 sec: 5802.1, 60 sec: 5750.9, 300 sec: 5765.0). Total num frames: 225934336. Throughput: 0: 6072.2. Samples: 225932928. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:04:30,513][25689] Avg episode reward: [(0, '-50.245')] [2022-07-09 11:04:30,831][26022] Updated weights on worker 0-0, policy_version 220641 (0.00083) [2022-07-09 11:04:32,603][26022] Updated weights on worker 0-0, policy_version 220651 (0.00088) [2022-07-09 11:04:34,139][26022] Updated weights on worker 0-0, policy_version 220661 (0.00097) [2022-07-09 11:04:35,598][25689] Fps is (10 sec: 5672.7, 60 sec: 5750.6, 300 sec: 5763.8). Total num frames: 225964032. Throughput: 0: 6027.6. Samples: 225967480. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:04:35,599][25689] Avg episode reward: [(0, '-49.485')] [2022-07-09 11:04:36,440][26022] Updated weights on worker 0-0, policy_version 220671 (0.00098) [2022-07-09 11:04:37,733][26022] Updated weights on worker 0-0, policy_version 220681 (0.00092) [2022-07-09 11:04:39,691][26022] Updated weights on worker 0-0, policy_version 220691 (0.00079) [2022-07-09 11:04:40,649][25689] Fps is (10 sec: 5758.2, 60 sec: 5750.1, 300 sec: 5763.1). Total num frames: 225992704. Throughput: 0: 6008.6. Samples: 226002262. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:04:40,649][25689] Avg episode reward: [(0, '-49.575')] [2022-07-09 11:04:41,119][26022] Updated weights on worker 0-0, policy_version 220701 (0.00083) [2022-07-09 11:04:43,075][26022] Updated weights on worker 0-0, policy_version 220711 (0.00095) [2022-07-09 11:04:44,952][26022] Updated weights on worker 0-0, policy_version 220721 (0.00102) [2022-07-09 11:04:45,687][25689] Fps is (10 sec: 5684.1, 60 sec: 5696.9, 300 sec: 5769.8). Total num frames: 226021376. Throughput: 0: 6010.4. Samples: 226019576. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:04:45,687][25689] Avg episode reward: [(0, '-49.371')] [2022-07-09 11:04:46,474][26022] Updated weights on worker 0-0, policy_version 220731 (0.00086) [2022-07-09 11:04:48,619][26022] Updated weights on worker 0-0, policy_version 220741 (0.00084) [2022-07-09 11:04:50,027][26022] Updated weights on worker 0-0, policy_version 220751 (0.00091) [2022-07-09 11:04:50,532][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:04:50,545][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000220752_226050048.pth [2022-07-09 11:04:50,546][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000218726_223975424.pth [2022-07-09 11:04:50,695][25689] Fps is (10 sec: 5708.0, 60 sec: 5749.0, 300 sec: 5764.7). Total num frames: 226050048. Throughput: 0: 6014.3. Samples: 226054312. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:04:50,695][25689] Avg episode reward: [(0, '-49.142')] [2022-07-09 11:04:52,096][26022] Updated weights on worker 0-0, policy_version 220761 (0.00088) [2022-07-09 11:04:54,018][26022] Updated weights on worker 0-0, policy_version 220771 (0.00085) [2022-07-09 11:04:55,521][26022] Updated weights on worker 0-0, policy_version 220781 (0.00092) [2022-07-09 11:04:55,747][25689] Fps is (10 sec: 6005.2, 60 sec: 5806.1, 300 sec: 5770.9). Total num frames: 226081792. Throughput: 0: 6018.0. Samples: 226088738. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:04:55,748][25689] Avg episode reward: [(0, '-49.202')] [2022-07-09 11:04:57,567][26022] Updated weights on worker 0-0, policy_version 220791 (0.00090) [2022-07-09 11:04:59,298][26022] Updated weights on worker 0-0, policy_version 220801 (0.00091) [2022-07-09 11:05:00,842][25689] Fps is (10 sec: 5752.2, 60 sec: 5746.6, 300 sec: 5762.6). Total num frames: 226108416. Throughput: 0: 5114.2. Samples: 226105534. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:05:00,842][25689] Avg episode reward: [(0, '-48.849')] [2022-07-09 11:05:00,991][26022] Updated weights on worker 0-0, policy_version 220811 (0.00087) [2022-07-09 11:05:03,423][26022] Updated weights on worker 0-0, policy_version 220821 (0.00086) [2022-07-09 11:05:04,831][26022] Updated weights on worker 0-0, policy_version 220831 (0.00084) [2022-07-09 11:05:05,900][25689] Fps is (10 sec: 5244.7, 60 sec: 5729.4, 300 sec: 5758.8). Total num frames: 226135040. Throughput: 0: 5841.7. Samples: 226137656. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:05:05,901][25689] Avg episode reward: [(0, '-48.536')] [2022-07-09 11:05:06,976][26022] Updated weights on worker 0-0, policy_version 220841 (0.00089) [2022-07-09 11:05:08,412][26022] Updated weights on worker 0-0, policy_version 220851 (0.00093) [2022-07-09 11:05:10,578][26022] Updated weights on worker 0-0, policy_version 220861 (0.00095) [2022-07-09 11:05:10,969][25689] Fps is (10 sec: 5460.0, 60 sec: 5693.6, 300 sec: 5753.0). Total num frames: 226163712. Throughput: 0: 5812.3. Samples: 226172154. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 11:05:10,970][25689] Avg episode reward: [(0, '-47.669')] [2022-07-09 11:05:12,229][26022] Updated weights on worker 0-0, policy_version 220871 (0.00080) [2022-07-09 11:05:14,034][26022] Updated weights on worker 0-0, policy_version 220881 (0.01036) [2022-07-09 11:05:15,806][26022] Updated weights on worker 0-0, policy_version 220891 (0.00083) [2022-07-09 11:05:16,027][25689] Fps is (10 sec: 5662.7, 60 sec: 5675.7, 300 sec: 5755.6). Total num frames: 226192384. Throughput: 0: 4972.1. Samples: 226189568. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 11:05:16,027][25689] Avg episode reward: [(0, '-47.701')] [2022-07-09 11:05:17,523][26022] Updated weights on worker 0-0, policy_version 220901 (0.00081) [2022-07-09 11:05:19,344][26022] Updated weights on worker 0-0, policy_version 220911 (0.00095) [2022-07-09 11:05:21,054][25689] Fps is (10 sec: 5787.6, 60 sec: 5711.2, 300 sec: 5755.2). Total num frames: 226222080. Throughput: 0: 5867.0. Samples: 226224120. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 11:05:21,055][25689] Avg episode reward: [(0, '-48.488')] [2022-07-09 11:05:21,190][26022] Updated weights on worker 0-0, policy_version 220921 (0.00085) [2022-07-09 11:05:23,011][26022] Updated weights on worker 0-0, policy_version 220931 (0.00095) [2022-07-09 11:05:24,526][26022] Updated weights on worker 0-0, policy_version 220941 (0.00092) [2022-07-09 11:05:26,061][25689] Fps is (10 sec: 5714.6, 60 sec: 5663.3, 300 sec: 5751.8). Total num frames: 226249728. Throughput: 0: 5970.7. Samples: 226258032. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 11:05:26,062][25689] Avg episode reward: [(0, '-48.166')] [2022-07-09 11:05:26,703][26022] Updated weights on worker 0-0, policy_version 220951 (0.00087) [2022-07-09 11:05:28,192][26022] Updated weights on worker 0-0, policy_version 220961 (0.00085) [2022-07-09 11:05:30,084][26022] Updated weights on worker 0-0, policy_version 220971 (0.00087) [2022-07-09 11:05:31,077][25689] Fps is (10 sec: 5721.4, 60 sec: 5697.8, 300 sec: 5753.6). Total num frames: 226279424. Throughput: 0: 5129.9. Samples: 226275304. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 11:05:31,078][25689] Avg episode reward: [(0, '-48.535')] [2022-07-09 11:05:31,956][26022] Updated weights on worker 0-0, policy_version 220981 (0.00079) [2022-07-09 11:05:33,671][26022] Updated weights on worker 0-0, policy_version 220991 (0.00095) [2022-07-09 11:05:35,523][26022] Updated weights on worker 0-0, policy_version 221001 (0.00082) [2022-07-09 11:05:36,141][25689] Fps is (10 sec: 5892.0, 60 sec: 5699.9, 300 sec: 5759.4). Total num frames: 226309120. Throughput: 0: 5976.5. Samples: 226309784. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 11:05:36,142][25689] Avg episode reward: [(0, '-48.850')] [2022-07-09 11:05:37,256][26022] Updated weights on worker 0-0, policy_version 221011 (0.00085) [2022-07-09 11:05:38,788][26022] Updated weights on worker 0-0, policy_version 221021 (0.00092) [2022-07-09 11:05:40,633][26022] Updated weights on worker 0-0, policy_version 221031 (0.00087) [2022-07-09 11:05:41,155][25689] Fps is (10 sec: 5690.3, 60 sec: 5686.4, 300 sec: 5745.7). Total num frames: 226336768. Throughput: 0: 5990.0. Samples: 226344520. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 11:05:41,155][25689] Avg episode reward: [(0, '-49.133')] [2022-07-09 11:05:42,624][26022] Updated weights on worker 0-0, policy_version 221041 (0.00081) [2022-07-09 11:05:44,299][26022] Updated weights on worker 0-0, policy_version 221051 (0.00087) [2022-07-09 11:05:46,159][25689] Fps is (10 sec: 5622.1, 60 sec: 5689.6, 300 sec: 5745.8). Total num frames: 226365440. Throughput: 0: 5165.1. Samples: 226361838. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 11:05:46,159][25689] Avg episode reward: [(0, '-49.230')] [2022-07-09 11:05:46,233][26022] Updated weights on worker 0-0, policy_version 221061 (0.00080) [2022-07-09 11:05:47,667][26022] Updated weights on worker 0-0, policy_version 221071 (0.00083) [2022-07-09 11:05:49,850][26022] Updated weights on worker 0-0, policy_version 221081 (0.00091) [2022-07-09 11:05:51,162][26022] Updated weights on worker 0-0, policy_version 221091 (0.00080) [2022-07-09 11:05:51,198][25689] Fps is (10 sec: 6015.5, 60 sec: 5737.5, 300 sec: 5756.3). Total num frames: 226397184. Throughput: 0: 6031.3. Samples: 226396660. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 11:05:51,198][25689] Avg episode reward: [(0, '-48.608')] [2022-07-09 11:05:53,215][26022] Updated weights on worker 0-0, policy_version 221101 (0.00091) [2022-07-09 11:05:54,991][26022] Updated weights on worker 0-0, policy_version 221111 (0.00089) [2022-07-09 11:05:56,309][25689] Fps is (10 sec: 5952.5, 60 sec: 5681.2, 300 sec: 5747.4). Total num frames: 226425856. Throughput: 0: 6025.0. Samples: 226431292. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 11:05:56,310][25689] Avg episode reward: [(0, '-49.233')] [2022-07-09 11:05:56,595][26022] Updated weights on worker 0-0, policy_version 221121 (0.00082) [2022-07-09 11:05:58,752][26022] Updated weights on worker 0-0, policy_version 221131 (0.00089) [2022-07-09 11:06:00,042][26022] Updated weights on worker 0-0, policy_version 221141 (0.00097) [2022-07-09 11:06:01,351][25689] Fps is (10 sec: 5547.2, 60 sec: 5703.1, 300 sec: 5754.4). Total num frames: 226453504. Throughput: 0: 5155.7. Samples: 226448646. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 11:06:01,352][25689] Avg episode reward: [(0, '-49.136')] [2022-07-09 11:06:02,131][26022] Updated weights on worker 0-0, policy_version 221151 (0.00088) [2022-07-09 11:06:04,339][26022] Updated weights on worker 0-0, policy_version 221161 (0.00089) [2022-07-09 11:06:06,072][26022] Updated weights on worker 0-0, policy_version 221171 (0.00064) [2022-07-09 11:06:06,381][25689] Fps is (10 sec: 5489.9, 60 sec: 5722.7, 300 sec: 5743.5). Total num frames: 226481152. Throughput: 0: 5885.4. Samples: 226480854. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 11:06:06,382][25689] Avg episode reward: [(0, '-49.364')] [2022-07-09 11:06:07,767][26022] Updated weights on worker 0-0, policy_version 221181 (0.00087) [2022-07-09 11:06:09,535][26022] Updated weights on worker 0-0, policy_version 221191 (0.00081) [2022-07-09 11:06:11,195][26022] Updated weights on worker 0-0, policy_version 221201 (0.00090) [2022-07-09 11:06:11,421][25689] Fps is (10 sec: 5592.6, 60 sec: 5725.4, 300 sec: 5744.0). Total num frames: 226509824. Throughput: 0: 5894.1. Samples: 226515858. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 11:06:11,422][25689] Avg episode reward: [(0, '-49.369')] [2022-07-09 11:06:13,148][26022] Updated weights on worker 0-0, policy_version 221211 (0.00085) [2022-07-09 11:06:14,890][26022] Updated weights on worker 0-0, policy_version 221221 (0.00062) [2022-07-09 11:06:16,524][25689] Fps is (10 sec: 5855.7, 60 sec: 5755.0, 300 sec: 5743.1). Total num frames: 226540544. Throughput: 0: 5054.3. Samples: 226533464. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 11:06:16,524][25689] Avg episode reward: [(0, '-48.985')] [2022-07-09 11:06:16,535][26022] Updated weights on worker 0-0, policy_version 221231 (0.00086) [2022-07-09 11:06:18,303][26022] Updated weights on worker 0-0, policy_version 221241 (0.00087) [2022-07-09 11:06:20,142][26022] Updated weights on worker 0-0, policy_version 221251 (0.00085) [2022-07-09 11:06:21,549][25689] Fps is (10 sec: 5864.3, 60 sec: 5738.3, 300 sec: 5746.9). Total num frames: 226569216. Throughput: 0: 5928.9. Samples: 226568400. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 11:06:21,550][25689] Avg episode reward: [(0, '-49.661')] [2022-07-09 11:06:21,835][26022] Updated weights on worker 0-0, policy_version 221261 (0.00085) [2022-07-09 11:06:23,628][26022] Updated weights on worker 0-0, policy_version 221271 (0.00084) [2022-07-09 11:06:25,176][26022] Updated weights on worker 0-0, policy_version 221281 (0.00088) [2022-07-09 11:06:26,562][25689] Fps is (10 sec: 5610.3, 60 sec: 5737.7, 300 sec: 5744.1). Total num frames: 226596864. Throughput: 0: 6070.8. Samples: 226603370. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 11:06:26,565][25689] Avg episode reward: [(0, '-47.622')] [2022-07-09 11:06:27,154][26022] Updated weights on worker 0-0, policy_version 221291 (0.00090) [2022-07-09 11:06:28,979][26022] Updated weights on worker 0-0, policy_version 221301 (0.00081) [2022-07-09 11:06:30,522][26022] Updated weights on worker 0-0, policy_version 221311 (0.00086) [2022-07-09 11:06:31,573][25689] Fps is (10 sec: 5720.7, 60 sec: 5738.2, 300 sec: 5738.1). Total num frames: 226626560. Throughput: 0: 5202.4. Samples: 226620694. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 11:06:31,574][25689] Avg episode reward: [(0, '-47.332')] [2022-07-09 11:06:32,555][26022] Updated weights on worker 0-0, policy_version 221321 (0.00086) [2022-07-09 11:06:34,006][26022] Updated weights on worker 0-0, policy_version 221331 (0.00085) [2022-07-09 11:06:35,935][26022] Updated weights on worker 0-0, policy_version 221341 (0.00087) [2022-07-09 11:06:36,620][25689] Fps is (10 sec: 5905.2, 60 sec: 5739.8, 300 sec: 5744.4). Total num frames: 226656256. Throughput: 0: 6071.8. Samples: 226655484. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 11:06:36,620][25689] Avg episode reward: [(0, '-47.677')] [2022-07-09 11:06:37,595][26022] Updated weights on worker 0-0, policy_version 221351 (0.00090) [2022-07-09 11:06:39,271][26022] Updated weights on worker 0-0, policy_version 221361 (0.00086) [2022-07-09 11:06:41,351][26022] Updated weights on worker 0-0, policy_version 221371 (0.00086) [2022-07-09 11:06:41,656][25689] Fps is (10 sec: 5788.6, 60 sec: 5754.6, 300 sec: 5736.9). Total num frames: 226684928. Throughput: 0: 6061.5. Samples: 226690280. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 11:06:41,657][25689] Avg episode reward: [(0, '-48.063')] [2022-07-09 11:06:42,912][26022] Updated weights on worker 0-0, policy_version 221381 (0.00091) [2022-07-09 11:06:44,770][26022] Updated weights on worker 0-0, policy_version 221391 (0.00088) [2022-07-09 11:06:46,582][26022] Updated weights on worker 0-0, policy_version 221401 (0.00088) [2022-07-09 11:06:46,672][25689] Fps is (10 sec: 5806.8, 60 sec: 5770.4, 300 sec: 5743.6). Total num frames: 226714624. Throughput: 0: 5184.0. Samples: 226707616. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 11:06:46,674][25689] Avg episode reward: [(0, '-48.623')] [2022-07-09 11:06:48,303][26022] Updated weights on worker 0-0, policy_version 221411 (0.00085) [2022-07-09 11:06:50,110][26022] Updated weights on worker 0-0, policy_version 221421 (0.00089) [2022-07-09 11:06:50,558][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:06:50,566][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000221424_226738176.pth [2022-07-09 11:06:50,569][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000219398_224663552.pth [2022-07-09 11:06:51,679][25689] Fps is (10 sec: 5925.6, 60 sec: 5739.6, 300 sec: 5748.3). Total num frames: 226744320. Throughput: 0: 6065.7. Samples: 226742652. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 11:06:51,680][25689] Avg episode reward: [(0, '-48.570')] [2022-07-09 11:06:51,831][26022] Updated weights on worker 0-0, policy_version 221431 (0.00100) [2022-07-09 11:06:53,477][26022] Updated weights on worker 0-0, policy_version 221441 (0.00084) [2022-07-09 11:06:55,535][26022] Updated weights on worker 0-0, policy_version 221451 (0.00086) [2022-07-09 11:06:56,734][25689] Fps is (10 sec: 5800.6, 60 sec: 5744.9, 300 sec: 5740.7). Total num frames: 226772992. Throughput: 0: 6050.4. Samples: 226777182. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 11:06:56,735][25689] Avg episode reward: [(0, '-50.209')] [2022-07-09 11:06:57,151][26022] Updated weights on worker 0-0, policy_version 221461 (0.00082) [2022-07-09 11:06:58,949][26022] Updated weights on worker 0-0, policy_version 221471 (0.00081) [2022-07-09 11:07:00,972][26022] Updated weights on worker 0-0, policy_version 221481 (0.00084) [2022-07-09 11:07:01,742][25689] Fps is (10 sec: 5698.7, 60 sec: 5765.1, 300 sec: 5748.2). Total num frames: 226801664. Throughput: 0: 5192.7. Samples: 226794578. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 11:07:01,742][25689] Avg episode reward: [(0, '-49.647')] [2022-07-09 11:07:02,895][26022] Updated weights on worker 0-0, policy_version 221491 (0.00082) [2022-07-09 11:07:04,943][26022] Updated weights on worker 0-0, policy_version 221501 (0.00089) [2022-07-09 11:07:06,371][26022] Updated weights on worker 0-0, policy_version 221511 (0.00102) [2022-07-09 11:07:06,752][25689] Fps is (10 sec: 5621.9, 60 sec: 5767.0, 300 sec: 5741.8). Total num frames: 226829312. Throughput: 0: 5962.9. Samples: 226827354. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 11:07:06,753][25689] Avg episode reward: [(0, '-49.249')] [2022-07-09 11:07:08,407][26022] Updated weights on worker 0-0, policy_version 221521 (0.00435) [2022-07-09 11:07:09,955][26022] Updated weights on worker 0-0, policy_version 221531 (0.00083) [2022-07-09 11:07:11,770][25689] Fps is (10 sec: 5616.5, 60 sec: 5769.2, 300 sec: 5742.7). Total num frames: 226857984. Throughput: 0: 5928.8. Samples: 226861764. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 11:07:11,770][25689] Avg episode reward: [(0, '-48.489')] [2022-07-09 11:07:11,772][26022] Updated weights on worker 0-0, policy_version 221541 (0.00091) [2022-07-09 11:07:13,532][26022] Updated weights on worker 0-0, policy_version 221551 (0.00107) [2022-07-09 11:07:15,410][26022] Updated weights on worker 0-0, policy_version 221561 (0.00085) [2022-07-09 11:07:16,851][25689] Fps is (10 sec: 5779.8, 60 sec: 5754.2, 300 sec: 5742.5). Total num frames: 226887680. Throughput: 0: 5067.2. Samples: 226879116. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 11:07:16,851][25689] Avg episode reward: [(0, '-48.545')] [2022-07-09 11:07:17,050][26022] Updated weights on worker 0-0, policy_version 221571 (0.00086) [2022-07-09 11:07:19,033][26022] Updated weights on worker 0-0, policy_version 221581 (0.00087) [2022-07-09 11:07:20,677][26022] Updated weights on worker 0-0, policy_version 221591 (0.00079) [2022-07-09 11:07:21,864][25689] Fps is (10 sec: 5680.6, 60 sec: 5738.4, 300 sec: 5735.8). Total num frames: 226915328. Throughput: 0: 5924.4. Samples: 226913790. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 11:07:21,866][25689] Avg episode reward: [(0, '-47.780')] [2022-07-09 11:07:22,559][26022] Updated weights on worker 0-0, policy_version 221601 (0.00089) [2022-07-09 11:07:24,389][26022] Updated weights on worker 0-0, policy_version 221611 (0.00425) [2022-07-09 11:07:25,982][26022] Updated weights on worker 0-0, policy_version 221621 (0.00097) [2022-07-09 11:07:26,878][25689] Fps is (10 sec: 5514.4, 60 sec: 5738.3, 300 sec: 5732.1). Total num frames: 226942976. Throughput: 0: 5993.7. Samples: 226947984. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 11:07:26,879][25689] Avg episode reward: [(0, '-47.388')] [2022-07-09 11:07:28,005][26022] Updated weights on worker 0-0, policy_version 221631 (0.00070) [2022-07-09 11:07:29,624][26022] Updated weights on worker 0-0, policy_version 221641 (0.00090) [2022-07-09 11:07:31,457][26022] Updated weights on worker 0-0, policy_version 221651 (0.00080) [2022-07-09 11:07:31,904][25689] Fps is (10 sec: 5813.6, 60 sec: 5753.8, 300 sec: 5743.0). Total num frames: 226973696. Throughput: 0: 5148.0. Samples: 226965416. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 11:07:31,906][25689] Avg episode reward: [(0, '-47.277')] [2022-07-09 11:07:33,487][26022] Updated weights on worker 0-0, policy_version 221661 (0.00084) [2022-07-09 11:07:34,799][26022] Updated weights on worker 0-0, policy_version 221671 (0.00084) [2022-07-09 11:07:36,899][26022] Updated weights on worker 0-0, policy_version 221681 (0.00085) [2022-07-09 11:07:36,996][25689] Fps is (10 sec: 5869.9, 60 sec: 5732.6, 300 sec: 5738.1). Total num frames: 227002368. Throughput: 0: 6006.7. Samples: 227000126. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 11:07:36,998][25689] Avg episode reward: [(0, '-48.403')] [2022-07-09 11:07:38,403][26022] Updated weights on worker 0-0, policy_version 221691 (0.00075) [2022-07-09 11:07:40,363][26022] Updated weights on worker 0-0, policy_version 221701 (0.00090) [2022-07-09 11:07:41,951][26022] Updated weights on worker 0-0, policy_version 221711 (0.00096) [2022-07-09 11:07:42,066][25689] Fps is (10 sec: 5743.9, 60 sec: 5746.4, 300 sec: 5733.6). Total num frames: 227032064. Throughput: 0: 6014.4. Samples: 227035290. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 11:07:42,066][25689] Avg episode reward: [(0, '-47.925')] [2022-07-09 11:07:43,859][26022] Updated weights on worker 0-0, policy_version 221721 (0.00090) [2022-07-09 11:07:45,667][26022] Updated weights on worker 0-0, policy_version 221731 (0.00086) [2022-07-09 11:07:47,089][25689] Fps is (10 sec: 5783.2, 60 sec: 5728.7, 300 sec: 5737.2). Total num frames: 227060736. Throughput: 0: 6064.3. Samples: 227070548. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 11:07:47,090][25689] Avg episode reward: [(0, '-48.058')] [2022-07-09 11:07:47,285][26022] Updated weights on worker 0-0, policy_version 221741 (0.00086) [2022-07-09 11:07:48,970][26022] Updated weights on worker 0-0, policy_version 221751 (0.00089) [2022-07-09 11:07:50,728][26022] Updated weights on worker 0-0, policy_version 221761 (0.00087) [2022-07-09 11:07:52,103][25689] Fps is (10 sec: 5815.2, 60 sec: 5728.1, 300 sec: 5737.0). Total num frames: 227090432. Throughput: 0: 6067.5. Samples: 227087972. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 11:07:52,104][25689] Avg episode reward: [(0, '-48.295')] [2022-07-09 11:07:52,729][26022] Updated weights on worker 0-0, policy_version 221771 (0.00088) [2022-07-09 11:07:54,310][26022] Updated weights on worker 0-0, policy_version 221781 (0.00090) [2022-07-09 11:07:56,253][26022] Updated weights on worker 0-0, policy_version 221791 (0.00288) [2022-07-09 11:07:57,144][25689] Fps is (10 sec: 5805.2, 60 sec: 5729.4, 300 sec: 5739.7). Total num frames: 227119104. Throughput: 0: 6063.2. Samples: 227122282. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 11:07:57,144][25689] Avg episode reward: [(0, '-48.925')] [2022-07-09 11:07:57,943][26022] Updated weights on worker 0-0, policy_version 221801 (0.00093) [2022-07-09 11:07:59,672][26022] Updated weights on worker 0-0, policy_version 221811 (0.00091) [2022-07-09 11:08:01,483][26022] Updated weights on worker 0-0, policy_version 221821 (0.00092) [2022-07-09 11:08:02,159][25689] Fps is (10 sec: 5600.8, 60 sec: 5711.8, 300 sec: 5734.2). Total num frames: 227146752. Throughput: 0: 6009.4. Samples: 227156036. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 11:08:02,159][25689] Avg episode reward: [(0, '-47.729')] [2022-07-09 11:08:03,614][26022] Updated weights on worker 0-0, policy_version 221831 (0.00084) [2022-07-09 11:08:05,455][26022] Updated weights on worker 0-0, policy_version 221841 (0.00083) [2022-07-09 11:08:07,111][26022] Updated weights on worker 0-0, policy_version 221851 (0.00085) [2022-07-09 11:08:07,208][25689] Fps is (10 sec: 5596.1, 60 sec: 5725.1, 300 sec: 5733.6). Total num frames: 227175424. Throughput: 0: 5074.4. Samples: 227172638. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 11:08:07,208][25689] Avg episode reward: [(0, '-48.097')] [2022-07-09 11:08:09,003][26022] Updated weights on worker 0-0, policy_version 221861 (0.00087) [2022-07-09 11:08:10,625][26022] Updated weights on worker 0-0, policy_version 221871 (0.00084) [2022-07-09 11:08:12,274][25689] Fps is (10 sec: 5668.9, 60 sec: 5720.4, 300 sec: 5728.0). Total num frames: 227204096. Throughput: 0: 5940.4. Samples: 227207798. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 11:08:12,280][25689] Avg episode reward: [(0, '-48.889')] [2022-07-09 11:08:12,533][26022] Updated weights on worker 0-0, policy_version 221881 (0.00096) [2022-07-09 11:08:14,142][26022] Updated weights on worker 0-0, policy_version 221891 (0.00087) [2022-07-09 11:08:15,928][26022] Updated weights on worker 0-0, policy_version 221901 (0.00084) [2022-07-09 11:08:17,389][25689] Fps is (10 sec: 5833.3, 60 sec: 5734.1, 300 sec: 5734.5). Total num frames: 227234816. Throughput: 0: 5965.7. Samples: 227243062. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 11:08:17,390][25689] Avg episode reward: [(0, '-48.460')] [2022-07-09 11:08:17,558][26022] Updated weights on worker 0-0, policy_version 221911 (0.00086) [2022-07-09 11:08:19,357][26022] Updated weights on worker 0-0, policy_version 221921 (0.00087) [2022-07-09 11:08:21,215][26022] Updated weights on worker 0-0, policy_version 221931 (0.00086) [2022-07-09 11:08:22,459][25689] Fps is (10 sec: 5831.4, 60 sec: 5745.7, 300 sec: 5734.1). Total num frames: 227263488. Throughput: 0: 5142.0. Samples: 227260420. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 11:08:22,460][25689] Avg episode reward: [(0, '-47.862')] [2022-07-09 11:08:22,994][26022] Updated weights on worker 0-0, policy_version 221941 (0.00090) [2022-07-09 11:08:24,548][26022] Updated weights on worker 0-0, policy_version 221951 (0.00088) [2022-07-09 11:08:26,559][26022] Updated weights on worker 0-0, policy_version 221961 (0.00095) [2022-07-09 11:08:27,502][25689] Fps is (10 sec: 5872.9, 60 sec: 5793.6, 300 sec: 5740.7). Total num frames: 227294208. Throughput: 0: 6028.9. Samples: 227294992. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 11:08:27,503][25689] Avg episode reward: [(0, '-48.652')] [2022-07-09 11:08:28,203][26022] Updated weights on worker 0-0, policy_version 221971 (0.00083) [2022-07-09 11:08:30,050][26022] Updated weights on worker 0-0, policy_version 221981 (0.00086) [2022-07-09 11:08:31,959][26022] Updated weights on worker 0-0, policy_version 221991 (0.00087) [2022-07-09 11:08:32,523][25689] Fps is (10 sec: 5799.9, 60 sec: 5743.4, 300 sec: 5735.0). Total num frames: 227321856. Throughput: 0: 6031.5. Samples: 227329926. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 11:08:32,523][25689] Avg episode reward: [(0, '-49.280')] [2022-07-09 11:08:33,627][26022] Updated weights on worker 0-0, policy_version 222001 (0.00092) [2022-07-09 11:08:35,433][26022] Updated weights on worker 0-0, policy_version 222011 (0.00085) [2022-07-09 11:08:37,088][26022] Updated weights on worker 0-0, policy_version 222021 (0.00083) [2022-07-09 11:08:37,605][25689] Fps is (10 sec: 5676.0, 60 sec: 5761.3, 300 sec: 5737.8). Total num frames: 227351552. Throughput: 0: 5146.4. Samples: 227347100. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 11:08:37,606][25689] Avg episode reward: [(0, '-48.789')] [2022-07-09 11:08:38,810][26022] Updated weights on worker 0-0, policy_version 222031 (0.00084) [2022-07-09 11:08:40,667][26022] Updated weights on worker 0-0, policy_version 222041 (0.00085) [2022-07-09 11:08:42,273][26022] Updated weights on worker 0-0, policy_version 222051 (0.00083) [2022-07-09 11:08:42,611][25689] Fps is (10 sec: 5785.9, 60 sec: 5750.4, 300 sec: 5727.5). Total num frames: 227380224. Throughput: 0: 6041.2. Samples: 227382160. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 11:08:42,611][25689] Avg episode reward: [(0, '-48.286')] [2022-07-09 11:08:44,028][26022] Updated weights on worker 0-0, policy_version 222061 (0.00086) [2022-07-09 11:08:46,033][26022] Updated weights on worker 0-0, policy_version 222071 (0.00098) [2022-07-09 11:08:47,615][25689] Fps is (10 sec: 5831.3, 60 sec: 5769.2, 300 sec: 5741.7). Total num frames: 227409920. Throughput: 0: 6064.2. Samples: 227416958. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 11:08:47,615][25689] Avg episode reward: [(0, '-49.210')] [2022-07-09 11:08:47,648][26022] Updated weights on worker 0-0, policy_version 222081 (0.00086) [2022-07-09 11:08:49,443][26022] Updated weights on worker 0-0, policy_version 222091 (0.00091) [2022-07-09 11:08:50,729][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:08:50,738][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000222097_227427328.pth [2022-07-09 11:08:50,739][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000220078_225359872.pth [2022-07-09 11:08:51,218][26022] Updated weights on worker 0-0, policy_version 222101 (0.00088) [2022-07-09 11:08:52,683][25689] Fps is (10 sec: 5896.7, 60 sec: 5764.0, 300 sec: 5746.1). Total num frames: 227439616. Throughput: 0: 5186.6. Samples: 227434488. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 11:08:52,684][25689] Avg episode reward: [(0, '-49.423')] [2022-07-09 11:08:53,068][26022] Updated weights on worker 0-0, policy_version 222111 (0.00090) [2022-07-09 11:08:54,835][26022] Updated weights on worker 0-0, policy_version 222121 (0.00087) [2022-07-09 11:08:56,878][26022] Updated weights on worker 0-0, policy_version 222131 (0.00088) [2022-07-09 11:08:57,808][25689] Fps is (10 sec: 5625.7, 60 sec: 5739.1, 300 sec: 5736.8). Total num frames: 227467264. Throughput: 0: 6017.1. Samples: 227468662. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 11:08:57,808][25689] Avg episode reward: [(0, '-48.602')] [2022-07-09 11:08:58,370][26022] Updated weights on worker 0-0, policy_version 222141 (0.00111) [2022-07-09 11:09:00,351][26022] Updated weights on worker 0-0, policy_version 222151 (0.00094) [2022-07-09 11:09:02,287][26022] Updated weights on worker 0-0, policy_version 222161 (0.00097) [2022-07-09 11:09:02,887][25689] Fps is (10 sec: 5619.6, 60 sec: 5766.8, 300 sec: 5743.3). Total num frames: 227496960. Throughput: 0: 5860.7. Samples: 227500992. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 11:09:02,888][25689] Avg episode reward: [(0, '-48.925')] [2022-07-09 11:09:04,165][26022] Updated weights on worker 0-0, policy_version 222171 (0.00083) [2022-07-09 11:09:05,871][26022] Updated weights on worker 0-0, policy_version 222181 (0.00088) [2022-07-09 11:09:07,690][26022] Updated weights on worker 0-0, policy_version 222191 (0.00090) [2022-07-09 11:09:07,920][25689] Fps is (10 sec: 5569.4, 60 sec: 5734.6, 300 sec: 5729.8). Total num frames: 227523584. Throughput: 0: 4994.7. Samples: 227518378. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 11:09:07,921][25689] Avg episode reward: [(0, '-48.639')] [2022-07-09 11:09:09,311][26022] Updated weights on worker 0-0, policy_version 222201 (0.00086) [2022-07-09 11:09:11,421][26022] Updated weights on worker 0-0, policy_version 222211 (0.00089) [2022-07-09 11:09:12,890][26022] Updated weights on worker 0-0, policy_version 222221 (0.00102) [2022-07-09 11:09:12,959][25689] Fps is (10 sec: 5794.9, 60 sec: 5787.8, 300 sec: 5736.8). Total num frames: 227555328. Throughput: 0: 5843.1. Samples: 227552966. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 11:09:12,961][25689] Avg episode reward: [(0, '-49.509')] [2022-07-09 11:09:14,930][26022] Updated weights on worker 0-0, policy_version 222231 (0.00089) [2022-07-09 11:09:16,355][26022] Updated weights on worker 0-0, policy_version 222241 (0.00085) [2022-07-09 11:09:18,010][25689] Fps is (10 sec: 5886.6, 60 sec: 5743.3, 300 sec: 5736.7). Total num frames: 227582976. Throughput: 0: 5904.0. Samples: 227587934. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 11:09:18,011][25689] Avg episode reward: [(0, '-48.714')] [2022-07-09 11:09:18,462][26022] Updated weights on worker 0-0, policy_version 222251 (0.00086) [2022-07-09 11:09:19,913][26022] Updated weights on worker 0-0, policy_version 222261 (0.00086) [2022-07-09 11:09:21,878][26022] Updated weights on worker 0-0, policy_version 222271 (0.00081) [2022-07-09 11:09:23,020][25689] Fps is (10 sec: 5699.6, 60 sec: 5765.8, 300 sec: 5733.8). Total num frames: 227612672. Throughput: 0: 5186.0. Samples: 227605404. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 11:09:23,021][25689] Avg episode reward: [(0, '-49.292')] [2022-07-09 11:09:23,489][26022] Updated weights on worker 0-0, policy_version 222281 (0.00067) [2022-07-09 11:09:25,443][26022] Updated weights on worker 0-0, policy_version 222291 (0.00090) [2022-07-09 11:09:26,987][26022] Updated weights on worker 0-0, policy_version 222301 (0.00087) [2022-07-09 11:09:28,102][25689] Fps is (10 sec: 5783.4, 60 sec: 5728.4, 300 sec: 5736.1). Total num frames: 227641344. Throughput: 0: 6045.4. Samples: 227640384. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 11:09:28,102][25689] Avg episode reward: [(0, '-48.645')] [2022-07-09 11:09:29,073][26022] Updated weights on worker 0-0, policy_version 222311 (0.00084) [2022-07-09 11:09:30,511][26022] Updated weights on worker 0-0, policy_version 222321 (0.00498) [2022-07-09 11:09:32,623][26022] Updated weights on worker 0-0, policy_version 222331 (0.00100) [2022-07-09 11:09:33,188][25689] Fps is (10 sec: 5740.6, 60 sec: 5755.9, 300 sec: 5736.1). Total num frames: 227671040. Throughput: 0: 6029.3. Samples: 227674930. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 11:09:33,189][25689] Avg episode reward: [(0, '-48.760')] [2022-07-09 11:09:33,985][26022] Updated weights on worker 0-0, policy_version 222341 (0.00079) [2022-07-09 11:09:36,050][26022] Updated weights on worker 0-0, policy_version 222351 (0.00089) [2022-07-09 11:09:37,743][26022] Updated weights on worker 0-0, policy_version 222361 (0.00085) [2022-07-09 11:09:38,314][25689] Fps is (10 sec: 5715.8, 60 sec: 5735.0, 300 sec: 5734.7). Total num frames: 227699712. Throughput: 0: 5992.4. Samples: 227709604. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 11:09:38,314][25689] Avg episode reward: [(0, '-48.839')] [2022-07-09 11:09:39,573][26022] Updated weights on worker 0-0, policy_version 222371 (0.00088) [2022-07-09 11:09:41,323][26022] Updated weights on worker 0-0, policy_version 222381 (0.00082) [2022-07-09 11:09:43,070][26022] Updated weights on worker 0-0, policy_version 222391 (0.00086) [2022-07-09 11:09:43,327][25689] Fps is (10 sec: 5756.7, 60 sec: 5751.1, 300 sec: 5738.6). Total num frames: 227729408. Throughput: 0: 5990.3. Samples: 227727048. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-09 11:09:43,328][25689] Avg episode reward: [(0, '-48.147')] [2022-07-09 11:09:44,754][26022] Updated weights on worker 0-0, policy_version 222401 (0.00086) [2022-07-09 11:09:46,588][26022] Updated weights on worker 0-0, policy_version 222411 (0.00089) [2022-07-09 11:09:48,155][26022] Updated weights on worker 0-0, policy_version 222421 (0.00082) [2022-07-09 11:09:48,371][25689] Fps is (10 sec: 5905.4, 60 sec: 5747.3, 300 sec: 5741.3). Total num frames: 227759104. Throughput: 0: 5995.8. Samples: 227761912. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 11:09:48,371][25689] Avg episode reward: [(0, '-48.659')] [2022-07-09 11:09:50,136][26022] Updated weights on worker 0-0, policy_version 222431 (0.00088) [2022-07-09 11:09:51,971][26022] Updated weights on worker 0-0, policy_version 222441 (0.00091) [2022-07-09 11:09:53,378][25689] Fps is (10 sec: 5909.3, 60 sec: 5753.1, 300 sec: 5735.3). Total num frames: 227788800. Throughput: 0: 6051.4. Samples: 227797106. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 11:09:53,378][25689] Avg episode reward: [(0, '-48.297')] [2022-07-09 11:09:53,478][26022] Updated weights on worker 0-0, policy_version 222451 (0.00080) [2022-07-09 11:09:55,432][26022] Updated weights on worker 0-0, policy_version 222461 (0.00083) [2022-07-09 11:09:57,005][26022] Updated weights on worker 0-0, policy_version 222471 (0.00089) [2022-07-09 11:09:58,449][25689] Fps is (10 sec: 5791.3, 60 sec: 5775.0, 300 sec: 5742.7). Total num frames: 227817472. Throughput: 0: 5208.2. Samples: 227814474. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 11:09:58,453][25689] Avg episode reward: [(0, '-47.363')] [2022-07-09 11:09:59,013][26022] Updated weights on worker 0-0, policy_version 222481 (0.00092) [2022-07-09 11:10:00,542][26022] Updated weights on worker 0-0, policy_version 222491 (0.00103) [2022-07-09 11:10:02,980][26022] Updated weights on worker 0-0, policy_version 222501 (0.00086) [2022-07-09 11:10:03,472][25689] Fps is (10 sec: 5477.9, 60 sec: 5729.7, 300 sec: 5743.3). Total num frames: 227844096. Throughput: 0: 5956.6. Samples: 227847042. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 11:10:03,472][25689] Avg episode reward: [(0, '-47.452')] [2022-07-09 11:10:04,414][26022] Updated weights on worker 0-0, policy_version 222511 (0.00081) [2022-07-09 11:10:06,378][26022] Updated weights on worker 0-0, policy_version 222521 (0.00091) [2022-07-09 11:10:07,978][26022] Updated weights on worker 0-0, policy_version 222531 (0.00086) [2022-07-09 11:10:08,483][25689] Fps is (10 sec: 5612.9, 60 sec: 5782.5, 300 sec: 5747.9). Total num frames: 227873792. Throughput: 0: 5969.9. Samples: 227881982. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 11:10:08,484][25689] Avg episode reward: [(0, '-47.726')] [2022-07-09 11:10:10,100][26022] Updated weights on worker 0-0, policy_version 222541 (0.00089) [2022-07-09 11:10:11,521][26022] Updated weights on worker 0-0, policy_version 222551 (0.00085) [2022-07-09 11:10:13,409][26022] Updated weights on worker 0-0, policy_version 222561 (0.00084) [2022-07-09 11:10:13,518][25689] Fps is (10 sec: 5911.9, 60 sec: 5749.1, 300 sec: 5751.8). Total num frames: 227903488. Throughput: 0: 5082.3. Samples: 227899466. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 11:10:13,519][25689] Avg episode reward: [(0, '-47.608')] [2022-07-09 11:10:15,204][26022] Updated weights on worker 0-0, policy_version 222571 (0.00096) [2022-07-09 11:10:16,840][26022] Updated weights on worker 0-0, policy_version 222581 (0.00089) [2022-07-09 11:10:18,585][25689] Fps is (10 sec: 5677.0, 60 sec: 5747.6, 300 sec: 5744.1). Total num frames: 227931136. Throughput: 0: 5964.5. Samples: 227934570. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 11:10:18,585][25689] Avg episode reward: [(0, '-47.382')] [2022-07-09 11:10:18,666][26022] Updated weights on worker 0-0, policy_version 222591 (0.00331) [2022-07-09 11:10:20,355][26022] Updated weights on worker 0-0, policy_version 222601 (0.00087) [2022-07-09 11:10:22,097][26022] Updated weights on worker 0-0, policy_version 222611 (0.00083) [2022-07-09 11:10:23,589][25689] Fps is (10 sec: 5795.9, 60 sec: 5765.1, 300 sec: 5754.5). Total num frames: 227961856. Throughput: 0: 6086.4. Samples: 227969480. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 11:10:23,589][25689] Avg episode reward: [(0, '-47.493')] [2022-07-09 11:10:24,047][26022] Updated weights on worker 0-0, policy_version 222621 (0.00090) [2022-07-09 11:10:25,667][26022] Updated weights on worker 0-0, policy_version 222631 (0.00084) [2022-07-09 11:10:27,542][26022] Updated weights on worker 0-0, policy_version 222641 (0.00089) [2022-07-09 11:10:28,619][25689] Fps is (10 sec: 5918.7, 60 sec: 5770.0, 300 sec: 5750.8). Total num frames: 227990528. Throughput: 0: 5207.1. Samples: 227986832. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 11:10:28,620][25689] Avg episode reward: [(0, '-47.713')] [2022-07-09 11:10:29,401][26022] Updated weights on worker 0-0, policy_version 222651 (0.00084) [2022-07-09 11:10:30,873][26022] Updated weights on worker 0-0, policy_version 222661 (0.00085) [2022-07-09 11:10:32,895][26022] Updated weights on worker 0-0, policy_version 222671 (0.00086) [2022-07-09 11:10:33,639][25689] Fps is (10 sec: 5807.7, 60 sec: 5776.3, 300 sec: 5751.7). Total num frames: 228020224. Throughput: 0: 6066.6. Samples: 228021530. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 11:10:33,640][25689] Avg episode reward: [(0, '-47.271')] [2022-07-09 11:10:34,760][26022] Updated weights on worker 0-0, policy_version 222681 (0.00104) [2022-07-09 11:10:36,299][26022] Updated weights on worker 0-0, policy_version 222691 (0.00088) [2022-07-09 11:10:38,275][26022] Updated weights on worker 0-0, policy_version 222701 (0.00088) [2022-07-09 11:10:38,694][25689] Fps is (10 sec: 5793.8, 60 sec: 5783.1, 300 sec: 5754.3). Total num frames: 228048896. Throughput: 0: 6049.4. Samples: 228056218. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 11:10:38,694][25689] Avg episode reward: [(0, '-48.121')] [2022-07-09 11:10:39,891][26022] Updated weights on worker 0-0, policy_version 222711 (0.00088) [2022-07-09 11:10:41,645][26022] Updated weights on worker 0-0, policy_version 222721 (0.01063) [2022-07-09 11:10:43,426][26022] Updated weights on worker 0-0, policy_version 222731 (0.00087) [2022-07-09 11:10:43,749][25689] Fps is (10 sec: 5773.3, 60 sec: 5779.1, 300 sec: 5756.8). Total num frames: 228078592. Throughput: 0: 5161.6. Samples: 228073538. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 11:10:43,749][25689] Avg episode reward: [(0, '-47.986')] [2022-07-09 11:10:45,384][26022] Updated weights on worker 0-0, policy_version 222741 (0.00093) [2022-07-09 11:10:46,879][26022] Updated weights on worker 0-0, policy_version 222751 (0.00085) [2022-07-09 11:10:48,783][25689] Fps is (10 sec: 5582.1, 60 sec: 5729.2, 300 sec: 5739.6). Total num frames: 228105216. Throughput: 0: 6022.4. Samples: 228108266. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 11:10:48,784][25689] Avg episode reward: [(0, '-48.758')] [2022-07-09 11:10:49,007][26022] Updated weights on worker 0-0, policy_version 222761 (0.00096) [2022-07-09 11:10:50,188][26022] Updated weights on worker 0-0, policy_version 222771 (0.00081) [2022-07-09 11:10:50,830][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:10:50,841][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000222774_228120576.pth [2022-07-09 11:10:50,842][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000220752_226050048.pth [2022-07-09 11:10:52,496][26022] Updated weights on worker 0-0, policy_version 222781 (0.00085) [2022-07-09 11:10:53,801][25689] Fps is (10 sec: 5806.4, 60 sec: 5762.0, 300 sec: 5751.8). Total num frames: 228136960. Throughput: 0: 6039.9. Samples: 228143310. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 11:10:53,802][25689] Avg episode reward: [(0, '-48.785')] [2022-07-09 11:10:53,934][26022] Updated weights on worker 0-0, policy_version 222791 (0.00086) [2022-07-09 11:10:55,882][26022] Updated weights on worker 0-0, policy_version 222801 (0.00086) [2022-07-09 11:10:57,639][26022] Updated weights on worker 0-0, policy_version 222811 (0.00091) [2022-07-09 11:10:58,862][25689] Fps is (10 sec: 5994.6, 60 sec: 5763.1, 300 sec: 5754.8). Total num frames: 228165632. Throughput: 0: 5189.0. Samples: 228160870. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 11:10:58,862][25689] Avg episode reward: [(0, '-48.415')] [2022-07-09 11:10:59,275][26022] Updated weights on worker 0-0, policy_version 222821 (0.00381) [2022-07-09 11:11:01,355][26022] Updated weights on worker 0-0, policy_version 222831 (0.00086) [2022-07-09 11:11:03,239][26022] Updated weights on worker 0-0, policy_version 222841 (0.00082) [2022-07-09 11:11:03,946][25689] Fps is (10 sec: 5450.9, 60 sec: 5757.2, 300 sec: 5750.4). Total num frames: 228192256. Throughput: 0: 5942.2. Samples: 228193552. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 11:11:03,946][25689] Avg episode reward: [(0, '-48.171')] [2022-07-09 11:11:04,864][26022] Updated weights on worker 0-0, policy_version 222851 (0.00090) [2022-07-09 11:11:06,838][26022] Updated weights on worker 0-0, policy_version 222861 (0.00092) [2022-07-09 11:11:08,425][26022] Updated weights on worker 0-0, policy_version 222871 (0.00086) [2022-07-09 11:11:09,026][25689] Fps is (10 sec: 5541.0, 60 sec: 5750.7, 300 sec: 5753.0). Total num frames: 228221952. Throughput: 0: 5937.9. Samples: 228228466. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 11:11:09,026][25689] Avg episode reward: [(0, '-48.200')] [2022-07-09 11:11:10,310][26022] Updated weights on worker 0-0, policy_version 222881 (0.00093) [2022-07-09 11:11:12,175][26022] Updated weights on worker 0-0, policy_version 222891 (0.00086) [2022-07-09 11:11:13,713][26022] Updated weights on worker 0-0, policy_version 222901 (0.01128) [2022-07-09 11:11:14,082][25689] Fps is (10 sec: 5859.6, 60 sec: 5748.7, 300 sec: 5750.5). Total num frames: 228251648. Throughput: 0: 5063.8. Samples: 228246006. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 11:11:14,082][25689] Avg episode reward: [(0, '-48.076')] [2022-07-09 11:11:15,623][26022] Updated weights on worker 0-0, policy_version 222911 (0.00087) [2022-07-09 11:11:17,225][26022] Updated weights on worker 0-0, policy_version 222921 (0.00085) [2022-07-09 11:11:19,105][26022] Updated weights on worker 0-0, policy_version 222931 (0.00079) [2022-07-09 11:11:19,195][25689] Fps is (10 sec: 5940.9, 60 sec: 5794.9, 300 sec: 5755.7). Total num frames: 228282368. Throughput: 0: 5912.1. Samples: 228281086. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 11:11:19,196][25689] Avg episode reward: [(0, '-47.368')] [2022-07-09 11:11:20,988][26022] Updated weights on worker 0-0, policy_version 222941 (0.00082) [2022-07-09 11:11:22,517][26022] Updated weights on worker 0-0, policy_version 222951 (0.00083) [2022-07-09 11:11:24,236][25689] Fps is (10 sec: 5748.0, 60 sec: 5740.7, 300 sec: 5755.1). Total num frames: 228310016. Throughput: 0: 6033.5. Samples: 228315974. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 11:11:24,237][25689] Avg episode reward: [(0, '-47.457')] [2022-07-09 11:11:24,607][26022] Updated weights on worker 0-0, policy_version 222961 (0.00092) [2022-07-09 11:11:26,149][26022] Updated weights on worker 0-0, policy_version 222971 (0.00081) [2022-07-09 11:11:28,041][26022] Updated weights on worker 0-0, policy_version 222981 (0.00090) [2022-07-09 11:11:29,244][25689] Fps is (10 sec: 5604.5, 60 sec: 5742.9, 300 sec: 5751.7). Total num frames: 228338688. Throughput: 0: 5175.4. Samples: 228333104. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 11:11:29,246][25689] Avg episode reward: [(0, '-47.854')] [2022-07-09 11:11:29,665][26022] Updated weights on worker 0-0, policy_version 222991 (0.00085) [2022-07-09 11:11:31,371][26022] Updated weights on worker 0-0, policy_version 223001 (0.00084) [2022-07-09 11:11:33,295][26022] Updated weights on worker 0-0, policy_version 223011 (0.00082) [2022-07-09 11:11:34,278][25689] Fps is (10 sec: 5812.2, 60 sec: 5741.5, 300 sec: 5752.0). Total num frames: 228368384. Throughput: 0: 6026.0. Samples: 228367712. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 11:11:34,279][25689] Avg episode reward: [(0, '-47.912')] [2022-07-09 11:11:35,095][26022] Updated weights on worker 0-0, policy_version 223021 (0.00086) [2022-07-09 11:11:36,827][26022] Updated weights on worker 0-0, policy_version 223031 (0.00091) [2022-07-09 11:11:38,689][26022] Updated weights on worker 0-0, policy_version 223041 (0.00087) [2022-07-09 11:11:39,342][25689] Fps is (10 sec: 5881.5, 60 sec: 5757.5, 300 sec: 5754.9). Total num frames: 228398080. Throughput: 0: 6028.5. Samples: 228402542. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 11:11:39,343][25689] Avg episode reward: [(0, '-47.694')] [2022-07-09 11:11:40,379][26022] Updated weights on worker 0-0, policy_version 223051 (0.00090) [2022-07-09 11:11:42,064][26022] Updated weights on worker 0-0, policy_version 223061 (0.00086) [2022-07-09 11:11:43,759][26022] Updated weights on worker 0-0, policy_version 223071 (0.00083) [2022-07-09 11:11:44,412][25689] Fps is (10 sec: 5861.1, 60 sec: 5756.1, 300 sec: 5753.9). Total num frames: 228427776. Throughput: 0: 5154.6. Samples: 228419972. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 11:11:44,413][25689] Avg episode reward: [(0, '-46.491')] [2022-07-09 11:11:45,673][26022] Updated weights on worker 0-0, policy_version 223081 (0.00086) [2022-07-09 11:11:47,415][26022] Updated weights on worker 0-0, policy_version 223091 (0.00087) [2022-07-09 11:11:49,105][26022] Updated weights on worker 0-0, policy_version 223101 (0.00086) [2022-07-09 11:11:49,445][25689] Fps is (10 sec: 5878.9, 60 sec: 5806.9, 300 sec: 5753.4). Total num frames: 228457472. Throughput: 0: 6036.7. Samples: 228455048. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 11:11:49,445][25689] Avg episode reward: [(0, '-47.049')] [2022-07-09 11:11:51,070][26022] Updated weights on worker 0-0, policy_version 223111 (0.00079) [2022-07-09 11:11:52,573][26022] Updated weights on worker 0-0, policy_version 223121 (0.00081) [2022-07-09 11:11:54,470][25689] Fps is (10 sec: 5700.9, 60 sec: 5738.7, 300 sec: 5750.5). Total num frames: 228485120. Throughput: 0: 6052.6. Samples: 228489926. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 11:11:54,471][25689] Avg episode reward: [(0, '-47.094')] [2022-07-09 11:11:54,515][26022] Updated weights on worker 0-0, policy_version 223131 (0.00084) [2022-07-09 11:11:56,170][26022] Updated weights on worker 0-0, policy_version 223141 (0.00082) [2022-07-09 11:11:58,004][26022] Updated weights on worker 0-0, policy_version 223151 (0.00096) [2022-07-09 11:11:59,520][25689] Fps is (10 sec: 5691.5, 60 sec: 5756.5, 300 sec: 5753.1). Total num frames: 228514816. Throughput: 0: 6058.1. Samples: 228524782. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 11:11:59,521][25689] Avg episode reward: [(0, '-46.728')] [2022-07-09 11:11:59,819][26022] Updated weights on worker 0-0, policy_version 223161 (0.00095) [2022-07-09 11:12:01,581][26022] Updated weights on worker 0-0, policy_version 223171 (0.00085) [2022-07-09 11:12:03,775][26022] Updated weights on worker 0-0, policy_version 223181 (0.00094) [2022-07-09 11:12:04,547][25689] Fps is (10 sec: 5589.3, 60 sec: 5762.0, 300 sec: 5749.4). Total num frames: 228541440. Throughput: 0: 5964.1. Samples: 228540060. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 11:12:04,547][25689] Avg episode reward: [(0, '-46.330')] [2022-07-09 11:12:05,537][26022] Updated weights on worker 0-0, policy_version 223191 (0.00083) [2022-07-09 11:12:07,395][26022] Updated weights on worker 0-0, policy_version 223201 (0.00087) [2022-07-09 11:12:09,081][26022] Updated weights on worker 0-0, policy_version 223211 (0.00090) [2022-07-09 11:12:09,584][25689] Fps is (10 sec: 5596.1, 60 sec: 5766.0, 300 sec: 5752.4). Total num frames: 228571136. Throughput: 0: 5914.9. Samples: 228574172. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 11:12:09,585][25689] Avg episode reward: [(0, '-47.305')] [2022-07-09 11:12:10,915][26022] Updated weights on worker 0-0, policy_version 223221 (0.00084) [2022-07-09 11:12:12,384][26022] Updated weights on worker 0-0, policy_version 223231 (0.00086) [2022-07-09 11:12:14,339][26022] Updated weights on worker 0-0, policy_version 223241 (0.00092) [2022-07-09 11:12:14,615][25689] Fps is (10 sec: 5797.2, 60 sec: 5751.5, 300 sec: 5749.9). Total num frames: 228599808. Throughput: 0: 5916.7. Samples: 228609116. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 11:12:14,616][25689] Avg episode reward: [(0, '-47.487')] [2022-07-09 11:12:16,159][26022] Updated weights on worker 0-0, policy_version 223251 (0.00084) [2022-07-09 11:12:17,823][26022] Updated weights on worker 0-0, policy_version 223261 (0.00087) [2022-07-09 11:12:19,628][26022] Updated weights on worker 0-0, policy_version 223271 (0.00085) [2022-07-09 11:12:19,765][25689] Fps is (10 sec: 5733.2, 60 sec: 5731.2, 300 sec: 5754.2). Total num frames: 228629504. Throughput: 0: 5030.0. Samples: 228626614. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 11:12:19,766][25689] Avg episode reward: [(0, '-47.860')] [2022-07-09 11:12:21,519][26022] Updated weights on worker 0-0, policy_version 223281 (0.00084) [2022-07-09 11:12:23,139][26022] Updated weights on worker 0-0, policy_version 223291 (0.00091) [2022-07-09 11:12:24,796][25689] Fps is (10 sec: 5732.9, 60 sec: 5749.0, 300 sec: 5757.3). Total num frames: 228658176. Throughput: 0: 5998.8. Samples: 228661532. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 11:12:24,797][25689] Avg episode reward: [(0, '-48.292')] [2022-07-09 11:12:24,995][26022] Updated weights on worker 0-0, policy_version 223301 (0.00092) [2022-07-09 11:12:26,691][26022] Updated weights on worker 0-0, policy_version 223311 (0.00094) [2022-07-09 11:12:28,488][26022] Updated weights on worker 0-0, policy_version 223321 (0.00085) [2022-07-09 11:12:29,797][25689] Fps is (10 sec: 5920.1, 60 sec: 5783.5, 300 sec: 5757.8). Total num frames: 228688896. Throughput: 0: 6040.6. Samples: 228696270. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 11:12:29,798][25689] Avg episode reward: [(0, '-48.194')] [2022-07-09 11:12:30,319][26022] Updated weights on worker 0-0, policy_version 223331 (0.00089) [2022-07-09 11:12:31,890][26022] Updated weights on worker 0-0, policy_version 223341 (0.00092) [2022-07-09 11:12:33,915][26022] Updated weights on worker 0-0, policy_version 223351 (0.00086) [2022-07-09 11:12:34,835][25689] Fps is (10 sec: 5814.3, 60 sec: 5749.3, 300 sec: 5755.4). Total num frames: 228716544. Throughput: 0: 5168.6. Samples: 228713624. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 11:12:34,835][25689] Avg episode reward: [(0, '-47.802')] [2022-07-09 11:12:35,576][26022] Updated weights on worker 0-0, policy_version 223361 (0.00096) [2022-07-09 11:12:37,347][26022] Updated weights on worker 0-0, policy_version 223371 (0.00085) [2022-07-09 11:12:39,068][26022] Updated weights on worker 0-0, policy_version 223381 (0.00078) [2022-07-09 11:12:39,875][25689] Fps is (10 sec: 5690.4, 60 sec: 5751.6, 300 sec: 5756.0). Total num frames: 228746240. Throughput: 0: 6037.0. Samples: 228748014. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 11:12:39,875][25689] Avg episode reward: [(0, '-48.218')] [2022-07-09 11:12:41,037][26022] Updated weights on worker 0-0, policy_version 223391 (0.00093) [2022-07-09 11:12:42,595][26022] Updated weights on worker 0-0, policy_version 223401 (0.00078) [2022-07-09 11:12:44,467][26022] Updated weights on worker 0-0, policy_version 223411 (0.00086) [2022-07-09 11:12:44,878][25689] Fps is (10 sec: 5710.1, 60 sec: 5724.1, 300 sec: 5752.9). Total num frames: 228773888. Throughput: 0: 6031.9. Samples: 228782660. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 11:12:44,878][25689] Avg episode reward: [(0, '-48.857')] [2022-07-09 11:12:46,356][26022] Updated weights on worker 0-0, policy_version 223421 (0.00079) [2022-07-09 11:12:48,138][26022] Updated weights on worker 0-0, policy_version 223431 (0.00085) [2022-07-09 11:12:49,797][26022] Updated weights on worker 0-0, policy_version 223441 (0.00086) [2022-07-09 11:12:49,883][25689] Fps is (10 sec: 5832.0, 60 sec: 5743.7, 300 sec: 5756.5). Total num frames: 228804608. Throughput: 0: 5170.2. Samples: 228800116. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 11:12:49,884][25689] Avg episode reward: [(0, '-47.605')] [2022-07-09 11:12:51,053][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:12:51,071][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000223448_228810752.pth [2022-07-09 11:12:51,074][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000221424_226738176.pth [2022-07-09 11:12:51,645][26022] Updated weights on worker 0-0, policy_version 223451 (0.00085) [2022-07-09 11:12:53,152][26022] Updated weights on worker 0-0, policy_version 223461 (0.00095) [2022-07-09 11:12:54,890][25689] Fps is (10 sec: 5829.5, 60 sec: 5745.4, 300 sec: 5753.7). Total num frames: 228832256. Throughput: 0: 6050.9. Samples: 228834976. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 11:12:54,891][25689] Avg episode reward: [(0, '-47.263')] [2022-07-09 11:12:55,267][26022] Updated weights on worker 0-0, policy_version 223471 (0.00053) [2022-07-09 11:12:56,763][26022] Updated weights on worker 0-0, policy_version 223481 (0.00087) [2022-07-09 11:12:58,866][26022] Updated weights on worker 0-0, policy_version 223491 (0.00083) [2022-07-09 11:12:59,940][25689] Fps is (10 sec: 5803.5, 60 sec: 5762.3, 300 sec: 5763.4). Total num frames: 228862976. Throughput: 0: 6061.9. Samples: 228869650. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 11:12:59,941][25689] Avg episode reward: [(0, '-47.387')] [2022-07-09 11:13:00,431][26022] Updated weights on worker 0-0, policy_version 223501 (0.00086) [2022-07-09 11:13:02,462][26022] Updated weights on worker 0-0, policy_version 223511 (0.00075) [2022-07-09 11:13:04,325][26022] Updated weights on worker 0-0, policy_version 223521 (0.00082) [2022-07-09 11:13:04,957][25689] Fps is (10 sec: 5492.9, 60 sec: 5729.4, 300 sec: 5750.2). Total num frames: 228887552. Throughput: 0: 5096.4. Samples: 228884992. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 11:13:04,957][25689] Avg episode reward: [(0, '-46.358')] [2022-07-09 11:13:06,053][26022] Updated weights on worker 0-0, policy_version 223531 (0.00087) [2022-07-09 11:13:08,053][26022] Updated weights on worker 0-0, policy_version 223541 (0.00080) [2022-07-09 11:13:09,547][26022] Updated weights on worker 0-0, policy_version 223551 (0.00076) [2022-07-09 11:13:09,964][25689] Fps is (10 sec: 5414.2, 60 sec: 5732.2, 300 sec: 5754.8). Total num frames: 228917248. Throughput: 0: 5942.4. Samples: 228919448. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 11:13:09,965][25689] Avg episode reward: [(0, '-47.152')] [2022-07-09 11:13:11,461][26022] Updated weights on worker 0-0, policy_version 223561 (0.00085) [2022-07-09 11:13:13,439][26022] Updated weights on worker 0-0, policy_version 223571 (0.00083) [2022-07-09 11:13:14,973][25689] Fps is (10 sec: 5827.6, 60 sec: 5734.4, 300 sec: 5750.0). Total num frames: 228945920. Throughput: 0: 5941.8. Samples: 228954302. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 11:13:14,973][25689] Avg episode reward: [(0, '-47.538')] [2022-07-09 11:13:14,997][26022] Updated weights on worker 0-0, policy_version 223581 (0.00086) [2022-07-09 11:13:16,946][26022] Updated weights on worker 0-0, policy_version 223591 (0.00085) [2022-07-09 11:13:18,329][26022] Updated weights on worker 0-0, policy_version 223601 (0.00103) [2022-07-09 11:13:20,102][25689] Fps is (10 sec: 5757.4, 60 sec: 5736.3, 300 sec: 5752.3). Total num frames: 228975616. Throughput: 0: 5067.8. Samples: 228971826. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 11:13:20,103][25689] Avg episode reward: [(0, '-47.927')] [2022-07-09 11:13:20,223][26022] Updated weights on worker 0-0, policy_version 223611 (0.00088) [2022-07-09 11:13:21,933][26022] Updated weights on worker 0-0, policy_version 223621 (0.00088) [2022-07-09 11:13:23,759][26022] Updated weights on worker 0-0, policy_version 223631 (0.00081) [2022-07-09 11:13:25,108][25689] Fps is (10 sec: 5859.7, 60 sec: 5755.7, 300 sec: 5749.6). Total num frames: 229005312. Throughput: 0: 6047.6. Samples: 229006860. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 11:13:25,109][25689] Avg episode reward: [(0, '-48.199')] [2022-07-09 11:13:25,628][26022] Updated weights on worker 0-0, policy_version 223641 (0.00088) [2022-07-09 11:13:27,344][26022] Updated weights on worker 0-0, policy_version 223651 (0.00085) [2022-07-09 11:13:29,219][26022] Updated weights on worker 0-0, policy_version 223661 (0.00087) [2022-07-09 11:13:30,161][25689] Fps is (10 sec: 5802.7, 60 sec: 5716.8, 300 sec: 5752.4). Total num frames: 229033984. Throughput: 0: 6034.6. Samples: 229041328. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 11:13:30,162][25689] Avg episode reward: [(0, '-47.971')] [2022-07-09 11:13:31,004][26022] Updated weights on worker 0-0, policy_version 223671 (0.00094) [2022-07-09 11:13:32,667][26022] Updated weights on worker 0-0, policy_version 223681 (0.00081) [2022-07-09 11:13:34,414][26022] Updated weights on worker 0-0, policy_version 223691 (0.00095) [2022-07-09 11:13:35,176][25689] Fps is (10 sec: 5797.8, 60 sec: 5752.9, 300 sec: 5753.7). Total num frames: 229063680. Throughput: 0: 5161.8. Samples: 229058586. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 11:13:35,176][25689] Avg episode reward: [(0, '-47.905')] [2022-07-09 11:13:36,348][26022] Updated weights on worker 0-0, policy_version 223701 (0.00087) [2022-07-09 11:13:38,113][26022] Updated weights on worker 0-0, policy_version 223711 (0.00090) [2022-07-09 11:13:39,896][26022] Updated weights on worker 0-0, policy_version 223721 (0.00087) [2022-07-09 11:13:40,267][25689] Fps is (10 sec: 5674.4, 60 sec: 5714.1, 300 sec: 5748.6). Total num frames: 229091328. Throughput: 0: 6003.9. Samples: 229092892. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 11:13:40,269][25689] Avg episode reward: [(0, '-47.718')] [2022-07-09 11:13:41,751][26022] Updated weights on worker 0-0, policy_version 223731 (0.00096) [2022-07-09 11:13:43,389][26022] Updated weights on worker 0-0, policy_version 223741 (0.00086) [2022-07-09 11:13:45,182][26022] Updated weights on worker 0-0, policy_version 223751 (0.00615) [2022-07-09 11:13:45,275][25689] Fps is (10 sec: 5677.9, 60 sec: 5747.5, 300 sec: 5748.5). Total num frames: 229121024. Throughput: 0: 6015.4. Samples: 229128172. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 11:13:45,277][25689] Avg episode reward: [(0, '-47.481')] [2022-07-09 11:13:46,942][26022] Updated weights on worker 0-0, policy_version 223761 (0.00099) [2022-07-09 11:13:48,499][26022] Updated weights on worker 0-0, policy_version 223771 (0.00079) [2022-07-09 11:13:50,318][25689] Fps is (10 sec: 5908.9, 60 sec: 5727.0, 300 sec: 5749.0). Total num frames: 229150720. Throughput: 0: 5176.9. Samples: 229145678. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 11:13:50,319][25689] Avg episode reward: [(0, '-47.541')] [2022-07-09 11:13:50,336][26022] Updated weights on worker 0-0, policy_version 223781 (0.00081) [2022-07-09 11:13:52,130][26022] Updated weights on worker 0-0, policy_version 223791 (0.00082) [2022-07-09 11:13:53,958][26022] Updated weights on worker 0-0, policy_version 223801 (0.00087) [2022-07-09 11:13:55,374][25689] Fps is (10 sec: 5779.7, 60 sec: 5739.3, 300 sec: 5753.8). Total num frames: 229179392. Throughput: 0: 6031.0. Samples: 229180402. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 11:13:55,375][25689] Avg episode reward: [(0, '-47.308')] [2022-07-09 11:13:55,719][26022] Updated weights on worker 0-0, policy_version 223811 (0.00051) [2022-07-09 11:13:57,393][26022] Updated weights on worker 0-0, policy_version 223821 (0.00091) [2022-07-09 11:13:59,182][26022] Updated weights on worker 0-0, policy_version 223831 (0.00083) [2022-07-09 11:14:00,442][25689] Fps is (10 sec: 5866.6, 60 sec: 5737.6, 300 sec: 5757.4). Total num frames: 229210112. Throughput: 0: 6080.9. Samples: 229215574. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 11:14:00,442][25689] Avg episode reward: [(0, '-47.673')] [2022-07-09 11:14:00,751][26022] Updated weights on worker 0-0, policy_version 223841 (0.00084) [2022-07-09 11:14:03,084][26022] Updated weights on worker 0-0, policy_version 223851 (0.00086) [2022-07-09 11:14:05,023][26022] Updated weights on worker 0-0, policy_version 223861 (0.00091) [2022-07-09 11:14:05,454][25689] Fps is (10 sec: 5587.2, 60 sec: 5755.0, 300 sec: 5754.4). Total num frames: 229235712. Throughput: 0: 5086.5. Samples: 229230812. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 11:14:05,456][25689] Avg episode reward: [(0, '-48.255')] [2022-07-09 11:14:06,638][26022] Updated weights on worker 0-0, policy_version 223871 (0.00095) [2022-07-09 11:14:08,515][26022] Updated weights on worker 0-0, policy_version 223881 (0.00096) [2022-07-09 11:14:10,232][26022] Updated weights on worker 0-0, policy_version 223891 (0.00090) [2022-07-09 11:14:10,464][25689] Fps is (10 sec: 5517.6, 60 sec: 5754.8, 300 sec: 5748.1). Total num frames: 229265408. Throughput: 0: 5931.3. Samples: 229265168. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 11:14:10,464][25689] Avg episode reward: [(0, '-48.458')] [2022-07-09 11:14:12,211][26022] Updated weights on worker 0-0, policy_version 223901 (0.00085) [2022-07-09 11:14:13,705][26022] Updated weights on worker 0-0, policy_version 223911 (0.00089) [2022-07-09 11:14:15,503][25689] Fps is (10 sec: 5808.7, 60 sec: 5751.9, 300 sec: 5751.7). Total num frames: 229294080. Throughput: 0: 5943.2. Samples: 229300030. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 11:14:15,503][25689] Avg episode reward: [(0, '-47.696')] [2022-07-09 11:14:15,563][26022] Updated weights on worker 0-0, policy_version 223921 (0.00087) [2022-07-09 11:14:17,211][26022] Updated weights on worker 0-0, policy_version 223931 (0.00091) [2022-07-09 11:14:19,174][26022] Updated weights on worker 0-0, policy_version 223941 (0.00081) [2022-07-09 11:14:20,608][25689] Fps is (10 sec: 5854.7, 60 sec: 5771.1, 300 sec: 5753.4). Total num frames: 229324800. Throughput: 0: 5038.1. Samples: 229317172. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 11:14:20,608][25689] Avg episode reward: [(0, '-48.627')] [2022-07-09 11:14:20,822][26022] Updated weights on worker 0-0, policy_version 223951 (0.00084) [2022-07-09 11:14:22,592][26022] Updated weights on worker 0-0, policy_version 223961 (0.00086) [2022-07-09 11:14:24,391][26022] Updated weights on worker 0-0, policy_version 223971 (0.00096) [2022-07-09 11:14:25,624][25689] Fps is (10 sec: 5767.0, 60 sec: 5736.3, 300 sec: 5751.2). Total num frames: 229352448. Throughput: 0: 6009.6. Samples: 229352022. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 11:14:25,624][25689] Avg episode reward: [(0, '-47.742')] [2022-07-09 11:14:26,239][26022] Updated weights on worker 0-0, policy_version 223981 (0.00090) [2022-07-09 11:14:27,919][26022] Updated weights on worker 0-0, policy_version 223991 (0.00086) [2022-07-09 11:14:29,980][26022] Updated weights on worker 0-0, policy_version 224001 (0.00091) [2022-07-09 11:14:30,669][25689] Fps is (10 sec: 5597.6, 60 sec: 5737.0, 300 sec: 5748.5). Total num frames: 229381120. Throughput: 0: 6007.6. Samples: 229386556. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 11:14:30,670][25689] Avg episode reward: [(0, '-47.903')] [2022-07-09 11:14:31,401][26022] Updated weights on worker 0-0, policy_version 224011 (0.00087) [2022-07-09 11:14:33,318][26022] Updated weights on worker 0-0, policy_version 224021 (0.00084) [2022-07-09 11:14:35,055][26022] Updated weights on worker 0-0, policy_version 224031 (0.00087) [2022-07-09 11:14:35,714][25689] Fps is (10 sec: 5784.3, 60 sec: 5734.1, 300 sec: 5753.5). Total num frames: 229410816. Throughput: 0: 5155.5. Samples: 229404232. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 11:14:35,715][25689] Avg episode reward: [(0, '-47.159')] [2022-07-09 11:14:36,782][26022] Updated weights on worker 0-0, policy_version 224041 (0.00083) [2022-07-09 11:14:38,598][26022] Updated weights on worker 0-0, policy_version 224051 (0.00082) [2022-07-09 11:14:40,219][26022] Updated weights on worker 0-0, policy_version 224061 (0.00087) [2022-07-09 11:14:40,838][25689] Fps is (10 sec: 5840.6, 60 sec: 5764.8, 300 sec: 5751.4). Total num frames: 229440512. Throughput: 0: 6023.2. Samples: 229439022. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 11:14:40,839][25689] Avg episode reward: [(0, '-47.548')] [2022-07-09 11:14:42,009][26022] Updated weights on worker 0-0, policy_version 224071 (0.00090) [2022-07-09 11:14:43,747][26022] Updated weights on worker 0-0, policy_version 224081 (0.00098) [2022-07-09 11:14:45,541][26022] Updated weights on worker 0-0, policy_version 224091 (0.00081) [2022-07-09 11:14:45,873][25689] Fps is (10 sec: 5846.3, 60 sec: 5762.3, 300 sec: 5751.6). Total num frames: 229470208. Throughput: 0: 6045.8. Samples: 229474448. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 11:14:45,874][25689] Avg episode reward: [(0, '-47.512')] [2022-07-09 11:14:47,276][26022] Updated weights on worker 0-0, policy_version 224101 (0.00084) [2022-07-09 11:14:49,063][26022] Updated weights on worker 0-0, policy_version 224111 (0.00088) [2022-07-09 11:14:50,800][26022] Updated weights on worker 0-0, policy_version 224121 (0.00080) [2022-07-09 11:14:50,887][25689] Fps is (10 sec: 6011.9, 60 sec: 5781.9, 300 sec: 5754.9). Total num frames: 229500928. Throughput: 0: 6085.4. Samples: 229509592. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 11:14:50,888][25689] Avg episode reward: [(0, '-47.536')] [2022-07-09 11:14:51,124][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:14:51,140][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000224123_229501952.pth [2022-07-09 11:14:51,141][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000222097_227427328.pth [2022-07-09 11:14:51,141][25974] Saving a milestone ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/milestones/checkpoint_000224123_229501952.pth.milestone [2022-07-09 11:14:52,428][26022] Updated weights on worker 0-0, policy_version 224131 (0.00092) [2022-07-09 11:14:54,159][26022] Updated weights on worker 0-0, policy_version 224141 (0.00084) [2022-07-09 11:14:55,907][25689] Fps is (10 sec: 5817.4, 60 sec: 5768.5, 300 sec: 5752.4). Total num frames: 229528576. Throughput: 0: 6066.7. Samples: 229526732. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 11:14:55,908][25689] Avg episode reward: [(0, '-47.510')] [2022-07-09 11:14:56,210][26022] Updated weights on worker 0-0, policy_version 224151 (0.00093) [2022-07-09 11:14:57,887][26022] Updated weights on worker 0-0, policy_version 224161 (0.00092) [2022-07-09 11:14:59,644][26022] Updated weights on worker 0-0, policy_version 224171 (0.00086) [2022-07-09 11:15:00,961][25689] Fps is (10 sec: 5794.0, 60 sec: 5769.8, 300 sec: 5765.6). Total num frames: 229559296. Throughput: 0: 6104.0. Samples: 229561854. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 11:15:00,962][25689] Avg episode reward: [(0, '-47.747')] [2022-07-09 11:15:01,372][26022] Updated weights on worker 0-0, policy_version 224181 (0.00089) [2022-07-09 11:15:03,523][26022] Updated weights on worker 0-0, policy_version 224191 (0.00084) [2022-07-09 11:15:05,207][26022] Updated weights on worker 0-0, policy_version 224201 (0.00962) [2022-07-09 11:15:06,040][25689] Fps is (10 sec: 5557.8, 60 sec: 5763.4, 300 sec: 5750.5). Total num frames: 229584896. Throughput: 0: 5947.3. Samples: 229594386. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 11:15:06,041][25689] Avg episode reward: [(0, '-48.050')] [2022-07-09 11:15:07,143][26022] Updated weights on worker 0-0, policy_version 224211 (0.00091) [2022-07-09 11:15:08,889][26022] Updated weights on worker 0-0, policy_version 224221 (0.00099) [2022-07-09 11:15:10,547][26022] Updated weights on worker 0-0, policy_version 224231 (0.00083) [2022-07-09 11:15:11,056][25689] Fps is (10 sec: 5579.1, 60 sec: 5779.7, 300 sec: 5754.3). Total num frames: 229615616. Throughput: 0: 5061.2. Samples: 229611670. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 11:15:11,057][25689] Avg episode reward: [(0, '-48.760')] [2022-07-09 11:15:12,332][26022] Updated weights on worker 0-0, policy_version 224241 (0.00088) [2022-07-09 11:15:13,947][26022] Updated weights on worker 0-0, policy_version 224251 (0.00083) [2022-07-09 11:15:16,025][26022] Updated weights on worker 0-0, policy_version 224261 (0.00087) [2022-07-09 11:15:16,063][25689] Fps is (10 sec: 5823.8, 60 sec: 5765.9, 300 sec: 5755.5). Total num frames: 229643264. Throughput: 0: 5959.6. Samples: 229646852. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 11:15:16,063][25689] Avg episode reward: [(0, '-47.791')] [2022-07-09 11:15:17,526][26022] Updated weights on worker 0-0, policy_version 224271 (0.00086) [2022-07-09 11:15:19,542][26022] Updated weights on worker 0-0, policy_version 224281 (0.00091) [2022-07-09 11:15:21,092][26022] Updated weights on worker 0-0, policy_version 224291 (0.00100) [2022-07-09 11:15:21,183][25689] Fps is (10 sec: 5864.6, 60 sec: 5781.3, 300 sec: 5756.7). Total num frames: 229675008. Throughput: 0: 5923.6. Samples: 229681640. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 11:15:21,184][25689] Avg episode reward: [(0, '-47.691')] [2022-07-09 11:15:22,922][26022] Updated weights on worker 0-0, policy_version 224301 (0.00087) [2022-07-09 11:15:24,772][26022] Updated weights on worker 0-0, policy_version 224311 (0.00083) [2022-07-09 11:15:26,233][25689] Fps is (10 sec: 5840.0, 60 sec: 5778.2, 300 sec: 5752.9). Total num frames: 229702656. Throughput: 0: 5188.4. Samples: 229699152. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 11:15:26,233][25689] Avg episode reward: [(0, '-48.237')] [2022-07-09 11:15:26,435][26022] Updated weights on worker 0-0, policy_version 224321 (0.00081) [2022-07-09 11:15:28,251][26022] Updated weights on worker 0-0, policy_version 224331 (0.00090) [2022-07-09 11:15:30,023][26022] Updated weights on worker 0-0, policy_version 224341 (0.00084) [2022-07-09 11:15:31,235][25689] Fps is (10 sec: 5501.0, 60 sec: 5765.3, 300 sec: 5746.3). Total num frames: 229730304. Throughput: 0: 6036.8. Samples: 229733488. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 11:15:31,236][25689] Avg episode reward: [(0, '-48.482')] [2022-07-09 11:15:31,883][26022] Updated weights on worker 0-0, policy_version 224351 (0.00089) [2022-07-09 11:15:33,603][26022] Updated weights on worker 0-0, policy_version 224361 (0.00087) [2022-07-09 11:15:35,315][26022] Updated weights on worker 0-0, policy_version 224371 (0.00082) [2022-07-09 11:15:36,253][25689] Fps is (10 sec: 5926.9, 60 sec: 5801.8, 300 sec: 5757.3). Total num frames: 229762048. Throughput: 0: 6020.5. Samples: 229768410. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 11:15:36,254][25689] Avg episode reward: [(0, '-47.535')] [2022-07-09 11:15:37,179][26022] Updated weights on worker 0-0, policy_version 224381 (0.00087) [2022-07-09 11:15:39,021][26022] Updated weights on worker 0-0, policy_version 224391 (0.00084) [2022-07-09 11:15:40,710][26022] Updated weights on worker 0-0, policy_version 224401 (0.00097) [2022-07-09 11:15:41,337][25689] Fps is (10 sec: 5879.4, 60 sec: 5771.7, 300 sec: 5749.9). Total num frames: 229789696. Throughput: 0: 5152.5. Samples: 229785482. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 11:15:41,338][25689] Avg episode reward: [(0, '-48.190')] [2022-07-09 11:15:42,419][26022] Updated weights on worker 0-0, policy_version 224411 (0.00079) [2022-07-09 11:15:44,186][26022] Updated weights on worker 0-0, policy_version 224421 (0.00085) [2022-07-09 11:15:45,926][26022] Updated weights on worker 0-0, policy_version 224431 (0.00087) [2022-07-09 11:15:46,391][25689] Fps is (10 sec: 5656.5, 60 sec: 5770.0, 300 sec: 5759.8). Total num frames: 229819392. Throughput: 0: 6006.8. Samples: 229820240. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 11:15:46,391][25689] Avg episode reward: [(0, '-47.362')] [2022-07-09 11:15:47,875][26022] Updated weights on worker 0-0, policy_version 224441 (0.00088) [2022-07-09 11:15:49,398][26022] Updated weights on worker 0-0, policy_version 224451 (0.00090) [2022-07-09 11:15:51,413][25689] Fps is (10 sec: 5691.1, 60 sec: 5718.4, 300 sec: 5746.0). Total num frames: 229847040. Throughput: 0: 6005.8. Samples: 229854672. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 11:15:51,414][25689] Avg episode reward: [(0, '-47.569')] [2022-07-09 11:15:51,567][26022] Updated weights on worker 0-0, policy_version 224461 (0.00080) [2022-07-09 11:15:53,043][26022] Updated weights on worker 0-0, policy_version 224471 (0.00105) [2022-07-09 11:15:55,212][26022] Updated weights on worker 0-0, policy_version 224481 (0.00099) [2022-07-09 11:15:56,443][25689] Fps is (10 sec: 5806.7, 60 sec: 5768.2, 300 sec: 5753.5). Total num frames: 229877760. Throughput: 0: 5129.2. Samples: 229871966. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 11:15:56,443][25689] Avg episode reward: [(0, '-46.755')] [2022-07-09 11:15:56,531][26022] Updated weights on worker 0-0, policy_version 224491 (0.00088) [2022-07-09 11:15:58,541][26022] Updated weights on worker 0-0, policy_version 224501 (0.00088) [2022-07-09 11:16:00,135][26022] Updated weights on worker 0-0, policy_version 224511 (0.00086) [2022-07-09 11:16:01,527][25689] Fps is (10 sec: 5670.0, 60 sec: 5697.8, 300 sec: 5753.5). Total num frames: 229904384. Throughput: 0: 6012.4. Samples: 229906870. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 11:16:01,527][25689] Avg episode reward: [(0, '-46.447')] [2022-07-09 11:16:02,585][26022] Updated weights on worker 0-0, policy_version 224521 (0.00084) [2022-07-09 11:16:03,997][26022] Updated weights on worker 0-0, policy_version 224531 (0.00087) [2022-07-09 11:16:05,859][26022] Updated weights on worker 0-0, policy_version 224541 (0.00080) [2022-07-09 11:16:06,555][25689] Fps is (10 sec: 5671.0, 60 sec: 5787.2, 300 sec: 5757.9). Total num frames: 229935104. Throughput: 0: 5922.6. Samples: 229939662. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 11:16:06,555][25689] Avg episode reward: [(0, '-46.279')] [2022-07-09 11:16:07,666][26022] Updated weights on worker 0-0, policy_version 224551 (0.00080) [2022-07-09 11:16:09,515][26022] Updated weights on worker 0-0, policy_version 224561 (0.00085) [2022-07-09 11:16:11,403][26022] Updated weights on worker 0-0, policy_version 224571 (0.00081) [2022-07-09 11:16:11,558][25689] Fps is (10 sec: 5716.4, 60 sec: 5720.7, 300 sec: 5748.6). Total num frames: 229961728. Throughput: 0: 5063.3. Samples: 229956672. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 11:16:11,559][25689] Avg episode reward: [(0, '-48.606')] [2022-07-09 11:16:12,994][26022] Updated weights on worker 0-0, policy_version 224581 (0.00087) [2022-07-09 11:16:14,810][26022] Updated weights on worker 0-0, policy_version 224591 (0.00093) [2022-07-09 11:16:16,400][26022] Updated weights on worker 0-0, policy_version 224601 (0.00086) [2022-07-09 11:16:16,616][25689] Fps is (10 sec: 5699.6, 60 sec: 5766.6, 300 sec: 5749.7). Total num frames: 229992448. Throughput: 0: 5939.5. Samples: 229991782. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 11:16:16,616][25689] Avg episode reward: [(0, '-48.879')] [2022-07-09 11:16:18,393][26022] Updated weights on worker 0-0, policy_version 224611 (0.00087) [2022-07-09 11:16:19,739][26022] Updated weights on worker 0-0, policy_version 224621 (0.00089) [2022-07-09 11:16:21,739][25689] Fps is (10 sec: 5833.8, 60 sec: 5715.6, 300 sec: 5751.6). Total num frames: 230021120. Throughput: 0: 5921.6. Samples: 230026558. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 11:16:21,739][25689] Avg episode reward: [(0, '-48.886')] [2022-07-09 11:16:21,918][26022] Updated weights on worker 0-0, policy_version 224631 (0.00088) [2022-07-09 11:16:23,769][26022] Updated weights on worker 0-0, policy_version 224641 (0.00085) [2022-07-09 11:16:25,282][26022] Updated weights on worker 0-0, policy_version 224651 (0.00087) [2022-07-09 11:16:26,756][25689] Fps is (10 sec: 5655.3, 60 sec: 5735.6, 300 sec: 5751.4). Total num frames: 230049792. Throughput: 0: 5162.0. Samples: 230043940. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 11:16:26,756][25689] Avg episode reward: [(0, '-48.978')] [2022-07-09 11:16:27,074][26022] Updated weights on worker 0-0, policy_version 224661 (0.00084) [2022-07-09 11:16:28,930][26022] Updated weights on worker 0-0, policy_version 224671 (0.00093) [2022-07-09 11:16:30,678][26022] Updated weights on worker 0-0, policy_version 224681 (0.00086) [2022-07-09 11:16:31,782][25689] Fps is (10 sec: 5709.6, 60 sec: 5750.2, 300 sec: 5748.1). Total num frames: 230078464. Throughput: 0: 6033.7. Samples: 230078698. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 11:16:31,783][25689] Avg episode reward: [(0, '-48.441')] [2022-07-09 11:16:32,527][26022] Updated weights on worker 0-0, policy_version 224691 (0.00096) [2022-07-09 11:16:34,259][26022] Updated weights on worker 0-0, policy_version 224701 (0.00096) [2022-07-09 11:16:35,870][26022] Updated weights on worker 0-0, policy_version 224711 (0.00089) [2022-07-09 11:16:36,851][25689] Fps is (10 sec: 5883.4, 60 sec: 5728.6, 300 sec: 5751.5). Total num frames: 230109184. Throughput: 0: 6012.4. Samples: 230113440. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2022-07-09 11:16:36,851][25689] Avg episode reward: [(0, '-49.366')] [2022-07-09 11:16:37,962][26022] Updated weights on worker 0-0, policy_version 224721 (0.00094) [2022-07-09 11:16:39,488][26022] Updated weights on worker 0-0, policy_version 224731 (0.00086) [2022-07-09 11:16:41,486][26022] Updated weights on worker 0-0, policy_version 224741 (0.00087) [2022-07-09 11:16:41,898][25689] Fps is (10 sec: 5769.9, 60 sec: 5732.0, 300 sec: 5745.0). Total num frames: 230136832. Throughput: 0: 5146.2. Samples: 230130302. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2022-07-09 11:16:41,899][25689] Avg episode reward: [(0, '-48.412')] [2022-07-09 11:16:43,086][26022] Updated weights on worker 0-0, policy_version 224751 (0.00086) [2022-07-09 11:16:44,938][26022] Updated weights on worker 0-0, policy_version 224761 (0.00091) [2022-07-09 11:16:46,705][26022] Updated weights on worker 0-0, policy_version 224771 (0.00089) [2022-07-09 11:16:46,905][25689] Fps is (10 sec: 5601.5, 60 sec: 5719.5, 300 sec: 5742.1). Total num frames: 230165504. Throughput: 0: 5999.4. Samples: 230164824. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2022-07-09 11:16:46,906][25689] Avg episode reward: [(0, '-47.902')] [2022-07-09 11:16:48,582][26022] Updated weights on worker 0-0, policy_version 224781 (0.00089) [2022-07-09 11:16:50,307][26022] Updated weights on worker 0-0, policy_version 224791 (0.00092) [2022-07-09 11:16:51,304][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:16:51,316][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000224797_230192128.pth [2022-07-09 11:16:51,316][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000222774_228120576.pth [2022-07-09 11:16:51,937][25689] Fps is (10 sec: 5712.6, 60 sec: 5735.6, 300 sec: 5745.4). Total num frames: 230194176. Throughput: 0: 5988.5. Samples: 230199392. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2022-07-09 11:16:51,937][25689] Avg episode reward: [(0, '-47.287')] [2022-07-09 11:16:52,235][26022] Updated weights on worker 0-0, policy_version 224801 (0.00086) [2022-07-09 11:16:53,727][26022] Updated weights on worker 0-0, policy_version 224811 (0.00095) [2022-07-09 11:16:55,821][26022] Updated weights on worker 0-0, policy_version 224821 (0.00323) [2022-07-09 11:16:56,946][25689] Fps is (10 sec: 5813.0, 60 sec: 5720.6, 300 sec: 5746.2). Total num frames: 230223872. Throughput: 0: 5149.4. Samples: 230216920. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2022-07-09 11:16:56,947][25689] Avg episode reward: [(0, '-48.586')] [2022-07-09 11:16:57,333][26022] Updated weights on worker 0-0, policy_version 224831 (0.00088) [2022-07-09 11:16:59,179][26022] Updated weights on worker 0-0, policy_version 224841 (0.00096) [2022-07-09 11:17:00,852][26022] Updated weights on worker 0-0, policy_version 224851 (0.00105) [2022-07-09 11:17:01,986][25689] Fps is (10 sec: 5706.3, 60 sec: 5741.7, 300 sec: 5749.3). Total num frames: 230251520. Throughput: 0: 6023.0. Samples: 230251288. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2022-07-09 11:17:01,986][25689] Avg episode reward: [(0, '-47.771')] [2022-07-09 11:17:03,092][26022] Updated weights on worker 0-0, policy_version 224861 (0.00083) [2022-07-09 11:17:04,922][26022] Updated weights on worker 0-0, policy_version 224871 (0.00081) [2022-07-09 11:17:06,726][26022] Updated weights on worker 0-0, policy_version 224881 (0.00082) [2022-07-09 11:17:06,991][25689] Fps is (10 sec: 5606.6, 60 sec: 5709.9, 300 sec: 5746.5). Total num frames: 230280192. Throughput: 0: 5931.8. Samples: 230283972. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2022-07-09 11:17:06,992][25689] Avg episode reward: [(0, '-47.121')] [2022-07-09 11:17:08,536][26022] Updated weights on worker 0-0, policy_version 224891 (0.00092) [2022-07-09 11:17:10,349][26022] Updated weights on worker 0-0, policy_version 224901 (0.00051) [2022-07-09 11:17:11,851][26022] Updated weights on worker 0-0, policy_version 224911 (0.00086) [2022-07-09 11:17:12,015][25689] Fps is (10 sec: 5717.5, 60 sec: 5741.9, 300 sec: 5746.7). Total num frames: 230308864. Throughput: 0: 5060.6. Samples: 230301002. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2022-07-09 11:17:12,016][25689] Avg episode reward: [(0, '-46.558')] [2022-07-09 11:17:13,827][26022] Updated weights on worker 0-0, policy_version 224921 (0.00086) [2022-07-09 11:17:15,564][26022] Updated weights on worker 0-0, policy_version 224931 (0.00080) [2022-07-09 11:17:17,027][25689] Fps is (10 sec: 5714.2, 60 sec: 5712.4, 300 sec: 5745.9). Total num frames: 230337536. Throughput: 0: 5918.6. Samples: 230335770. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2022-07-09 11:17:17,027][25689] Avg episode reward: [(0, '-46.176')] [2022-07-09 11:17:17,353][26022] Updated weights on worker 0-0, policy_version 224941 (0.00094) [2022-07-09 11:17:19,095][26022] Updated weights on worker 0-0, policy_version 224951 (0.00086) [2022-07-09 11:17:20,755][26022] Updated weights on worker 0-0, policy_version 224961 (0.00083) [2022-07-09 11:17:22,065][25689] Fps is (10 sec: 5705.8, 60 sec: 5720.3, 300 sec: 5745.7). Total num frames: 230366208. Throughput: 0: 5944.3. Samples: 230370648. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2022-07-09 11:17:22,066][25689] Avg episode reward: [(0, '-47.431')] [2022-07-09 11:17:22,641][26022] Updated weights on worker 0-0, policy_version 224971 (0.00085) [2022-07-09 11:17:24,484][26022] Updated weights on worker 0-0, policy_version 224981 (0.00094) [2022-07-09 11:17:25,970][26022] Updated weights on worker 0-0, policy_version 224991 (0.00106) [2022-07-09 11:17:27,071][25689] Fps is (10 sec: 5811.2, 60 sec: 5738.4, 300 sec: 5742.2). Total num frames: 230395904. Throughput: 0: 5196.1. Samples: 230388306. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2022-07-09 11:17:27,071][25689] Avg episode reward: [(0, '-47.682')] [2022-07-09 11:17:28,039][26022] Updated weights on worker 0-0, policy_version 225001 (0.00083) [2022-07-09 11:17:29,426][26022] Updated weights on worker 0-0, policy_version 225011 (0.00086) [2022-07-09 11:17:31,461][26022] Updated weights on worker 0-0, policy_version 225021 (0.00088) [2022-07-09 11:17:32,077][25689] Fps is (10 sec: 5932.3, 60 sec: 5757.3, 300 sec: 5749.7). Total num frames: 230425600. Throughput: 0: 6105.1. Samples: 230423480. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2022-07-09 11:17:32,077][25689] Avg episode reward: [(0, '-48.216')] [2022-07-09 11:17:33,150][26022] Updated weights on worker 0-0, policy_version 225031 (0.00097) [2022-07-09 11:17:34,937][26022] Updated weights on worker 0-0, policy_version 225041 (0.00093) [2022-07-09 11:17:36,679][26022] Updated weights on worker 0-0, policy_version 225051 (0.00088) [2022-07-09 11:17:37,098][25689] Fps is (10 sec: 5821.1, 60 sec: 5727.9, 300 sec: 5746.6). Total num frames: 230454272. Throughput: 0: 6103.6. Samples: 230458276. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2022-07-09 11:17:37,098][25689] Avg episode reward: [(0, '-48.954')] [2022-07-09 11:17:38,625][26022] Updated weights on worker 0-0, policy_version 225061 (0.00081) [2022-07-09 11:17:40,184][26022] Updated weights on worker 0-0, policy_version 225071 (0.00084) [2022-07-09 11:17:42,136][26022] Updated weights on worker 0-0, policy_version 225081 (0.00087) [2022-07-09 11:17:42,196][25689] Fps is (10 sec: 5667.2, 60 sec: 5740.1, 300 sec: 5748.2). Total num frames: 230482944. Throughput: 0: 6076.6. Samples: 230492972. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2022-07-09 11:17:42,196][25689] Avg episode reward: [(0, '-48.894')] [2022-07-09 11:17:43,704][26022] Updated weights on worker 0-0, policy_version 225091 (0.01016) [2022-07-09 11:17:45,560][26022] Updated weights on worker 0-0, policy_version 225101 (0.00085) [2022-07-09 11:17:47,202][25689] Fps is (10 sec: 5776.6, 60 sec: 5757.1, 300 sec: 5744.8). Total num frames: 230512640. Throughput: 0: 6078.4. Samples: 230510672. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2022-07-09 11:17:47,202][25689] Avg episode reward: [(0, '-48.513')] [2022-07-09 11:17:47,357][26022] Updated weights on worker 0-0, policy_version 225111 (0.00080) [2022-07-09 11:17:48,890][26022] Updated weights on worker 0-0, policy_version 225121 (0.00099) [2022-07-09 11:17:50,826][26022] Updated weights on worker 0-0, policy_version 225131 (0.00086) [2022-07-09 11:17:52,205][25689] Fps is (10 sec: 6035.7, 60 sec: 5793.8, 300 sec: 5755.2). Total num frames: 230543360. Throughput: 0: 6077.9. Samples: 230545818. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2022-07-09 11:17:52,206][25689] Avg episode reward: [(0, '-48.039')] [2022-07-09 11:17:52,311][26022] Updated weights on worker 0-0, policy_version 225141 (0.00081) [2022-07-09 11:17:54,306][26022] Updated weights on worker 0-0, policy_version 225151 (0.00092) [2022-07-09 11:17:56,041][26022] Updated weights on worker 0-0, policy_version 225161 (0.00093) [2022-07-09 11:17:57,227][25689] Fps is (10 sec: 5822.4, 60 sec: 5758.7, 300 sec: 5745.4). Total num frames: 230571008. Throughput: 0: 6074.0. Samples: 230580540. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2022-07-09 11:17:57,228][25689] Avg episode reward: [(0, '-47.519')] [2022-07-09 11:17:57,846][26022] Updated weights on worker 0-0, policy_version 225171 (0.00092) [2022-07-09 11:17:59,509][26022] Updated weights on worker 0-0, policy_version 225181 (0.00087) [2022-07-09 11:18:01,277][26022] Updated weights on worker 0-0, policy_version 225191 (0.00080) [2022-07-09 11:18:02,283][25689] Fps is (10 sec: 5385.5, 60 sec: 5740.1, 300 sec: 5751.5). Total num frames: 230597632. Throughput: 0: 5225.6. Samples: 230597940. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2022-07-09 11:18:02,283][25689] Avg episode reward: [(0, '-46.801')] [2022-07-09 11:18:03,366][26022] Updated weights on worker 0-0, policy_version 225201 (0.00081) [2022-07-09 11:18:05,247][26022] Updated weights on worker 0-0, policy_version 225211 (0.00090) [2022-07-09 11:18:06,923][26022] Updated weights on worker 0-0, policy_version 225221 (0.00079) [2022-07-09 11:18:07,336][25689] Fps is (10 sec: 5672.6, 60 sec: 5769.6, 300 sec: 5754.1). Total num frames: 230628352. Throughput: 0: 5956.2. Samples: 230630592. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2022-07-09 11:18:07,336][25689] Avg episode reward: [(0, '-46.681')] [2022-07-09 11:18:08,813][26022] Updated weights on worker 0-0, policy_version 225231 (0.00099) [2022-07-09 11:18:10,608][26022] Updated weights on worker 0-0, policy_version 225241 (0.00092) [2022-07-09 11:18:12,240][26022] Updated weights on worker 0-0, policy_version 225251 (0.00079) [2022-07-09 11:18:12,408][25689] Fps is (10 sec: 5966.6, 60 sec: 5781.8, 300 sec: 5756.3). Total num frames: 230658048. Throughput: 0: 5910.0. Samples: 230665218. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2022-07-09 11:18:12,409][25689] Avg episode reward: [(0, '-46.852')] [2022-07-09 11:18:14,223][26022] Updated weights on worker 0-0, policy_version 225261 (0.00086) [2022-07-09 11:18:15,815][26022] Updated weights on worker 0-0, policy_version 225271 (0.00089) [2022-07-09 11:18:17,425][25689] Fps is (10 sec: 5683.3, 60 sec: 5764.4, 300 sec: 5751.6). Total num frames: 230685696. Throughput: 0: 5068.4. Samples: 230682916. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2022-07-09 11:18:17,426][25689] Avg episode reward: [(0, '-47.581')] [2022-07-09 11:18:17,582][26022] Updated weights on worker 0-0, policy_version 225281 (0.00086) [2022-07-09 11:18:19,337][26022] Updated weights on worker 0-0, policy_version 225291 (0.00077) [2022-07-09 11:18:21,037][26022] Updated weights on worker 0-0, policy_version 225301 (0.00082) [2022-07-09 11:18:22,503][25689] Fps is (10 sec: 5680.5, 60 sec: 5777.6, 300 sec: 5750.2). Total num frames: 230715392. Throughput: 0: 5938.5. Samples: 230718020. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2022-07-09 11:18:22,504][25689] Avg episode reward: [(0, '-47.658')] [2022-07-09 11:18:22,845][26022] Updated weights on worker 0-0, policy_version 225311 (0.00081) [2022-07-09 11:18:24,526][26022] Updated weights on worker 0-0, policy_version 225321 (0.00088) [2022-07-09 11:18:26,162][26022] Updated weights on worker 0-0, policy_version 225331 (0.00084) [2022-07-09 11:18:27,555][25689] Fps is (10 sec: 5863.2, 60 sec: 5773.1, 300 sec: 5753.7). Total num frames: 230745088. Throughput: 0: 6039.8. Samples: 230752714. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2022-07-09 11:18:27,556][25689] Avg episode reward: [(0, '-48.070')] [2022-07-09 11:18:28,283][26022] Updated weights on worker 0-0, policy_version 225341 (0.00089) [2022-07-09 11:18:29,871][26022] Updated weights on worker 0-0, policy_version 225351 (0.00096) [2022-07-09 11:18:31,705][26022] Updated weights on worker 0-0, policy_version 225361 (0.00091) [2022-07-09 11:18:32,595][25689] Fps is (10 sec: 5885.5, 60 sec: 5770.0, 300 sec: 5753.2). Total num frames: 230774784. Throughput: 0: 5196.9. Samples: 230770124. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2022-07-09 11:18:32,595][25689] Avg episode reward: [(0, '-48.333')] [2022-07-09 11:18:33,522][26022] Updated weights on worker 0-0, policy_version 225371 (0.00103) [2022-07-09 11:18:35,119][26022] Updated weights on worker 0-0, policy_version 225381 (0.00088) [2022-07-09 11:18:37,086][26022] Updated weights on worker 0-0, policy_version 225391 (0.00090) [2022-07-09 11:18:37,605][25689] Fps is (10 sec: 5706.0, 60 sec: 5754.0, 300 sec: 5754.7). Total num frames: 230802432. Throughput: 0: 6050.1. Samples: 230805004. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2022-07-09 11:18:37,605][25689] Avg episode reward: [(0, '-48.128')] [2022-07-09 11:18:38,619][26022] Updated weights on worker 0-0, policy_version 225401 (0.00079) [2022-07-09 11:18:40,590][26022] Updated weights on worker 0-0, policy_version 225411 (0.00087) [2022-07-09 11:18:42,273][26022] Updated weights on worker 0-0, policy_version 225421 (0.00084) [2022-07-09 11:18:42,696][25689] Fps is (10 sec: 5778.2, 60 sec: 5788.5, 300 sec: 5756.6). Total num frames: 230833152. Throughput: 0: 6025.8. Samples: 230839696. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2022-07-09 11:18:42,696][25689] Avg episode reward: [(0, '-47.669')] [2022-07-09 11:18:44,093][26022] Updated weights on worker 0-0, policy_version 225431 (0.00088) [2022-07-09 11:18:45,820][26022] Updated weights on worker 0-0, policy_version 225441 (0.00085) [2022-07-09 11:18:47,530][26022] Updated weights on worker 0-0, policy_version 225451 (0.00084) [2022-07-09 11:18:47,730][25689] Fps is (10 sec: 5967.0, 60 sec: 5785.9, 300 sec: 5756.8). Total num frames: 230862848. Throughput: 0: 5179.3. Samples: 230857206. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2022-07-09 11:18:47,735][25689] Avg episode reward: [(0, '-47.218')] [2022-07-09 11:18:49,500][26022] Updated weights on worker 0-0, policy_version 225461 (0.00087) [2022-07-09 11:18:51,210][26022] Updated weights on worker 0-0, policy_version 225471 (0.00088) [2022-07-09 11:18:51,426][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:18:51,438][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000225472_230883328.pth [2022-07-09 11:18:51,438][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000223448_228810752.pth [2022-07-09 11:18:52,779][25689] Fps is (10 sec: 5687.2, 60 sec: 5730.8, 300 sec: 5753.4). Total num frames: 230890496. Throughput: 0: 6031.4. Samples: 230891864. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2022-07-09 11:18:52,781][25689] Avg episode reward: [(0, '-47.045')] [2022-07-09 11:18:52,916][26022] Updated weights on worker 0-0, policy_version 225481 (0.00088) [2022-07-09 11:18:54,750][26022] Updated weights on worker 0-0, policy_version 225491 (0.00090) [2022-07-09 11:18:56,369][26022] Updated weights on worker 0-0, policy_version 225501 (0.00092) [2022-07-09 11:18:57,784][25689] Fps is (10 sec: 5601.6, 60 sec: 5749.2, 300 sec: 5747.8). Total num frames: 230919168. Throughput: 0: 6029.8. Samples: 230926682. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 11:18:57,786][25689] Avg episode reward: [(0, '-47.736')] [2022-07-09 11:18:58,264][26022] Updated weights on worker 0-0, policy_version 225511 (0.00092) [2022-07-09 11:19:00,151][26022] Updated weights on worker 0-0, policy_version 225521 (0.00090) [2022-07-09 11:19:02,103][26022] Updated weights on worker 0-0, policy_version 225531 (0.00055) [2022-07-09 11:19:02,931][25689] Fps is (10 sec: 5648.6, 60 sec: 5774.4, 300 sec: 5755.5). Total num frames: 230947840. Throughput: 0: 5149.6. Samples: 230943898. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 11:19:02,933][25689] Avg episode reward: [(0, '-47.609')] [2022-07-09 11:19:04,191][26022] Updated weights on worker 0-0, policy_version 225541 (0.00089) [2022-07-09 11:19:05,571][26022] Updated weights on worker 0-0, policy_version 225551 (0.00086) [2022-07-09 11:19:07,751][26022] Updated weights on worker 0-0, policy_version 225561 (0.00085) [2022-07-09 11:19:07,967][25689] Fps is (10 sec: 5732.2, 60 sec: 5759.2, 300 sec: 5755.0). Total num frames: 230977536. Throughput: 0: 5918.8. Samples: 230976984. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 11:19:07,967][25689] Avg episode reward: [(0, '-47.329')] [2022-07-09 11:19:09,040][26022] Updated weights on worker 0-0, policy_version 225571 (0.00084) [2022-07-09 11:19:11,029][26022] Updated weights on worker 0-0, policy_version 225581 (0.00087) [2022-07-09 11:19:12,665][26022] Updated weights on worker 0-0, policy_version 225591 (0.00086) [2022-07-09 11:19:13,045][25689] Fps is (10 sec: 5669.5, 60 sec: 5724.8, 300 sec: 5750.8). Total num frames: 231005184. Throughput: 0: 5897.1. Samples: 231011378. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 11:19:13,047][25689] Avg episode reward: [(0, '-48.243')] [2022-07-09 11:19:14,770][26022] Updated weights on worker 0-0, policy_version 225601 (0.00084) [2022-07-09 11:19:16,307][26022] Updated weights on worker 0-0, policy_version 225611 (0.00092) [2022-07-09 11:19:18,064][25689] Fps is (10 sec: 5678.9, 60 sec: 5758.4, 300 sec: 5749.0). Total num frames: 231034880. Throughput: 0: 5036.9. Samples: 231028832. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 11:19:18,065][25689] Avg episode reward: [(0, '-47.747')] [2022-07-09 11:19:18,186][26022] Updated weights on worker 0-0, policy_version 225621 (0.00087) [2022-07-09 11:19:19,673][26022] Updated weights on worker 0-0, policy_version 225631 (0.00086) [2022-07-09 11:19:21,691][26022] Updated weights on worker 0-0, policy_version 225641 (0.00096) [2022-07-09 11:19:23,131][25689] Fps is (10 sec: 5889.0, 60 sec: 5759.5, 300 sec: 5754.9). Total num frames: 231064576. Throughput: 0: 5928.7. Samples: 231063656. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 11:19:23,131][25689] Avg episode reward: [(0, '-47.590')] [2022-07-09 11:19:23,395][26022] Updated weights on worker 0-0, policy_version 225651 (0.00084) [2022-07-09 11:19:25,087][26022] Updated weights on worker 0-0, policy_version 225661 (0.00088) [2022-07-09 11:19:26,798][26022] Updated weights on worker 0-0, policy_version 225671 (0.00085) [2022-07-09 11:19:28,154][25689] Fps is (10 sec: 5886.3, 60 sec: 5762.2, 300 sec: 5758.8). Total num frames: 231094272. Throughput: 0: 6033.7. Samples: 231098790. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 11:19:28,155][25689] Avg episode reward: [(0, '-47.265')] [2022-07-09 11:19:28,691][26022] Updated weights on worker 0-0, policy_version 225681 (0.00092) [2022-07-09 11:19:30,374][26022] Updated weights on worker 0-0, policy_version 225691 (0.00079) [2022-07-09 11:19:32,235][26022] Updated weights on worker 0-0, policy_version 225701 (0.00083) [2022-07-09 11:19:33,167][25689] Fps is (10 sec: 5815.8, 60 sec: 5747.9, 300 sec: 5756.0). Total num frames: 231122944. Throughput: 0: 5213.7. Samples: 231116284. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 11:19:33,167][25689] Avg episode reward: [(0, '-48.221')] [2022-07-09 11:19:33,763][26022] Updated weights on worker 0-0, policy_version 225711 (0.00081) [2022-07-09 11:19:35,806][26022] Updated weights on worker 0-0, policy_version 225721 (0.00092) [2022-07-09 11:19:37,385][26022] Updated weights on worker 0-0, policy_version 225731 (0.00087) [2022-07-09 11:19:38,189][25689] Fps is (10 sec: 5816.9, 60 sec: 5780.6, 300 sec: 5757.9). Total num frames: 231152640. Throughput: 0: 6088.4. Samples: 231151356. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 11:19:38,189][25689] Avg episode reward: [(0, '-47.882')] [2022-07-09 11:19:39,380][26022] Updated weights on worker 0-0, policy_version 225741 (0.00091) [2022-07-09 11:19:40,943][26022] Updated weights on worker 0-0, policy_version 225751 (0.00083) [2022-07-09 11:19:42,869][26022] Updated weights on worker 0-0, policy_version 225761 (0.00087) [2022-07-09 11:19:43,282][25689] Fps is (10 sec: 5770.1, 60 sec: 5746.5, 300 sec: 5753.4). Total num frames: 231181312. Throughput: 0: 6063.1. Samples: 231185838. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 11:19:43,283][25689] Avg episode reward: [(0, '-47.412')] [2022-07-09 11:19:44,666][26022] Updated weights on worker 0-0, policy_version 225771 (0.00084) [2022-07-09 11:19:46,292][26022] Updated weights on worker 0-0, policy_version 225781 (0.00090) [2022-07-09 11:19:48,069][26022] Updated weights on worker 0-0, policy_version 225791 (0.00091) [2022-07-09 11:19:48,287][25689] Fps is (10 sec: 5779.9, 60 sec: 5749.3, 300 sec: 5750.1). Total num frames: 231211008. Throughput: 0: 5200.4. Samples: 231203488. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 11:19:48,288][25689] Avg episode reward: [(0, '-47.164')] [2022-07-09 11:19:49,887][26022] Updated weights on worker 0-0, policy_version 225801 (0.00087) [2022-07-09 11:19:51,599][26022] Updated weights on worker 0-0, policy_version 225811 (0.00083) [2022-07-09 11:19:53,339][25689] Fps is (10 sec: 5804.0, 60 sec: 5765.9, 300 sec: 5752.9). Total num frames: 231239680. Throughput: 0: 6054.5. Samples: 231238416. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 11:19:53,339][25689] Avg episode reward: [(0, '-47.251')] [2022-07-09 11:19:53,361][26022] Updated weights on worker 0-0, policy_version 225821 (0.00084) [2022-07-09 11:19:54,906][26022] Updated weights on worker 0-0, policy_version 225831 (0.00083) [2022-07-09 11:19:56,982][26022] Updated weights on worker 0-0, policy_version 225841 (0.00108) [2022-07-09 11:19:58,341][25689] Fps is (10 sec: 5907.6, 60 sec: 5800.1, 300 sec: 5753.9). Total num frames: 231270400. Throughput: 0: 6074.9. Samples: 231273776. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 11:19:58,341][25689] Avg episode reward: [(0, '-46.613')] [2022-07-09 11:19:58,452][26022] Updated weights on worker 0-0, policy_version 225851 (0.00085) [2022-07-09 11:20:00,422][26022] Updated weights on worker 0-0, policy_version 225861 (0.00086) [2022-07-09 11:20:02,376][26022] Updated weights on worker 0-0, policy_version 225871 (0.00083) [2022-07-09 11:20:03,386][25689] Fps is (10 sec: 5707.3, 60 sec: 5775.9, 300 sec: 5758.0). Total num frames: 231297024. Throughput: 0: 6015.6. Samples: 231306776. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 11:20:03,387][25689] Avg episode reward: [(0, '-47.451')] [2022-07-09 11:20:04,115][26022] Updated weights on worker 0-0, policy_version 225881 (0.00086) [2022-07-09 11:20:05,795][26022] Updated weights on worker 0-0, policy_version 225891 (0.00089) [2022-07-09 11:20:07,643][26022] Updated weights on worker 0-0, policy_version 225901 (0.00090) [2022-07-09 11:20:08,425][25689] Fps is (10 sec: 5686.6, 60 sec: 5792.6, 300 sec: 5757.6). Total num frames: 231327744. Throughput: 0: 6003.9. Samples: 231324392. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 11:20:08,425][25689] Avg episode reward: [(0, '-46.761')] [2022-07-09 11:20:09,376][26022] Updated weights on worker 0-0, policy_version 225911 (0.00095) [2022-07-09 11:20:11,019][26022] Updated weights on worker 0-0, policy_version 225921 (0.00086) [2022-07-09 11:20:13,054][26022] Updated weights on worker 0-0, policy_version 225931 (0.00082) [2022-07-09 11:20:13,444][25689] Fps is (10 sec: 5803.2, 60 sec: 5798.3, 300 sec: 5757.3). Total num frames: 231355392. Throughput: 0: 6022.9. Samples: 231359508. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 11:20:13,445][25689] Avg episode reward: [(0, '-46.873')] [2022-07-09 11:20:14,648][26022] Updated weights on worker 0-0, policy_version 225941 (0.00093) [2022-07-09 11:20:16,522][26022] Updated weights on worker 0-0, policy_version 225951 (0.00087) [2022-07-09 11:20:18,380][26022] Updated weights on worker 0-0, policy_version 225961 (0.00083) [2022-07-09 11:20:18,467][25689] Fps is (10 sec: 5608.1, 60 sec: 5780.9, 300 sec: 5748.9). Total num frames: 231384064. Throughput: 0: 5993.2. Samples: 231394400. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 11:20:18,468][25689] Avg episode reward: [(0, '-47.510')] [2022-07-09 11:20:19,825][26022] Updated weights on worker 0-0, policy_version 225971 (0.00095) [2022-07-09 11:20:21,912][26022] Updated weights on worker 0-0, policy_version 225981 (0.00090) [2022-07-09 11:20:23,455][26022] Updated weights on worker 0-0, policy_version 225991 (0.00086) [2022-07-09 11:20:23,510][25689] Fps is (10 sec: 6002.4, 60 sec: 5817.1, 300 sec: 5762.8). Total num frames: 231415808. Throughput: 0: 5231.4. Samples: 231412048. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 11:20:23,510][25689] Avg episode reward: [(0, '-48.623')] [2022-07-09 11:20:25,346][26022] Updated weights on worker 0-0, policy_version 226001 (0.00089) [2022-07-09 11:20:27,005][26022] Updated weights on worker 0-0, policy_version 226011 (0.00091) [2022-07-09 11:20:28,525][25689] Fps is (10 sec: 5904.8, 60 sec: 5784.0, 300 sec: 5762.6). Total num frames: 231443456. Throughput: 0: 6090.7. Samples: 231446820. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 11:20:28,530][25689] Avg episode reward: [(0, '-48.782')] [2022-07-09 11:20:28,852][26022] Updated weights on worker 0-0, policy_version 226021 (0.00080) [2022-07-09 11:20:30,612][26022] Updated weights on worker 0-0, policy_version 226031 (0.00083) [2022-07-09 11:20:32,397][26022] Updated weights on worker 0-0, policy_version 226041 (0.00628) [2022-07-09 11:20:33,543][25689] Fps is (10 sec: 5613.4, 60 sec: 5783.5, 300 sec: 5752.2). Total num frames: 231472128. Throughput: 0: 6073.7. Samples: 231481580. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 11:20:33,543][25689] Avg episode reward: [(0, '-48.187')] [2022-07-09 11:20:34,186][26022] Updated weights on worker 0-0, policy_version 226051 (0.00092) [2022-07-09 11:20:36,085][26022] Updated weights on worker 0-0, policy_version 226061 (0.00087) [2022-07-09 11:20:37,606][26022] Updated weights on worker 0-0, policy_version 226071 (0.00083) [2022-07-09 11:20:38,571][25689] Fps is (10 sec: 5810.5, 60 sec: 5782.9, 300 sec: 5760.2). Total num frames: 231501824. Throughput: 0: 5191.4. Samples: 231498766. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 11:20:38,571][25689] Avg episode reward: [(0, '-48.653')] [2022-07-09 11:20:39,689][26022] Updated weights on worker 0-0, policy_version 226081 (0.00082) [2022-07-09 11:20:41,223][26022] Updated weights on worker 0-0, policy_version 226091 (0.00086) [2022-07-09 11:20:43,075][26022] Updated weights on worker 0-0, policy_version 226101 (0.00082) [2022-07-09 11:20:43,621][25689] Fps is (10 sec: 5892.9, 60 sec: 5804.0, 300 sec: 5760.3). Total num frames: 231531520. Throughput: 0: 6039.7. Samples: 231533516. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 11:20:43,622][25689] Avg episode reward: [(0, '-49.221')] [2022-07-09 11:20:44,863][26022] Updated weights on worker 0-0, policy_version 226111 (0.00085) [2022-07-09 11:20:46,644][26022] Updated weights on worker 0-0, policy_version 226121 (0.00086) [2022-07-09 11:20:48,467][26022] Updated weights on worker 0-0, policy_version 226131 (0.00085) [2022-07-09 11:20:48,645][25689] Fps is (10 sec: 5692.3, 60 sec: 5768.3, 300 sec: 5760.3). Total num frames: 231559168. Throughput: 0: 6019.5. Samples: 231567926. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 11:20:48,645][25689] Avg episode reward: [(0, '-49.572')] [2022-07-09 11:20:50,040][26022] Updated weights on worker 0-0, policy_version 226141 (0.00089) [2022-07-09 11:20:51,496][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:20:51,510][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000226148_231575552.pth [2022-07-09 11:20:51,511][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000224123_229501952.pth [2022-07-09 11:20:51,930][26022] Updated weights on worker 0-0, policy_version 226151 (0.00083) [2022-07-09 11:20:53,646][25689] Fps is (10 sec: 5720.0, 60 sec: 5790.1, 300 sec: 5757.3). Total num frames: 231588864. Throughput: 0: 5153.8. Samples: 231585190. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 11:20:53,647][25689] Avg episode reward: [(0, '-47.187')] [2022-07-09 11:20:53,647][26022] Updated weights on worker 0-0, policy_version 226161 (0.00086) [2022-07-09 11:20:55,516][26022] Updated weights on worker 0-0, policy_version 226171 (0.00073) [2022-07-09 11:20:57,148][26022] Updated weights on worker 0-0, policy_version 226181 (0.00091) [2022-07-09 11:20:58,666][25689] Fps is (10 sec: 5620.1, 60 sec: 5720.5, 300 sec: 5758.6). Total num frames: 231615488. Throughput: 0: 6018.0. Samples: 231619698. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 11:20:58,667][25689] Avg episode reward: [(0, '-46.853')] [2022-07-09 11:20:59,197][26022] Updated weights on worker 0-0, policy_version 226191 (0.00086) [2022-07-09 11:21:00,652][26022] Updated weights on worker 0-0, policy_version 226201 (0.00094) [2022-07-09 11:21:03,107][26022] Updated weights on worker 0-0, policy_version 226211 (0.00086) [2022-07-09 11:21:03,781][25689] Fps is (10 sec: 5456.2, 60 sec: 5747.8, 300 sec: 5750.0). Total num frames: 231644160. Throughput: 0: 5888.2. Samples: 231652220. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 11:21:03,783][25689] Avg episode reward: [(0, '-47.023')] [2022-07-09 11:21:04,564][26022] Updated weights on worker 0-0, policy_version 226221 (0.00081) [2022-07-09 11:21:06,572][26022] Updated weights on worker 0-0, policy_version 226231 (0.00084) [2022-07-09 11:21:08,304][26022] Updated weights on worker 0-0, policy_version 226241 (0.00080) [2022-07-09 11:21:08,810][25689] Fps is (10 sec: 5754.0, 60 sec: 5731.7, 300 sec: 5759.9). Total num frames: 231673856. Throughput: 0: 5048.0. Samples: 231669720. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 11:21:08,810][25689] Avg episode reward: [(0, '-46.702')] [2022-07-09 11:21:09,975][26022] Updated weights on worker 0-0, policy_version 226251 (0.00049) [2022-07-09 11:21:11,787][26022] Updated weights on worker 0-0, policy_version 226261 (0.00081) [2022-07-09 11:21:13,739][26022] Updated weights on worker 0-0, policy_version 226271 (0.00086) [2022-07-09 11:21:13,847][25689] Fps is (10 sec: 5696.8, 60 sec: 5730.1, 300 sec: 5749.9). Total num frames: 231701504. Throughput: 0: 5896.2. Samples: 231704296. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:21:13,847][25689] Avg episode reward: [(0, '-45.230')] [2022-07-09 11:21:15,283][26022] Updated weights on worker 0-0, policy_version 226281 (0.00083) [2022-07-09 11:21:17,348][26022] Updated weights on worker 0-0, policy_version 226291 (0.00084) [2022-07-09 11:21:18,862][25689] Fps is (10 sec: 5704.5, 60 sec: 5747.8, 300 sec: 5755.5). Total num frames: 231731200. Throughput: 0: 5895.2. Samples: 231738760. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:21:18,863][25689] Avg episode reward: [(0, '-45.772')] [2022-07-09 11:21:19,150][26022] Updated weights on worker 0-0, policy_version 226301 (0.00096) [2022-07-09 11:21:20,806][26022] Updated weights on worker 0-0, policy_version 226311 (0.00079) [2022-07-09 11:21:22,683][26022] Updated weights on worker 0-0, policy_version 226321 (0.00094) [2022-07-09 11:21:23,933][25689] Fps is (10 sec: 5787.0, 60 sec: 5694.2, 300 sec: 5754.4). Total num frames: 231759872. Throughput: 0: 5153.1. Samples: 231756066. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:21:23,935][25689] Avg episode reward: [(0, '-46.848')] [2022-07-09 11:21:24,153][26022] Updated weights on worker 0-0, policy_version 226331 (0.00089) [2022-07-09 11:21:26,262][26022] Updated weights on worker 0-0, policy_version 226341 (0.00093) [2022-07-09 11:21:27,723][26022] Updated weights on worker 0-0, policy_version 226351 (0.00079) [2022-07-09 11:21:28,952][25689] Fps is (10 sec: 5784.5, 60 sec: 5727.8, 300 sec: 5758.0). Total num frames: 231789568. Throughput: 0: 6013.5. Samples: 231790850. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:21:28,954][25689] Avg episode reward: [(0, '-47.873')] [2022-07-09 11:21:29,690][26022] Updated weights on worker 0-0, policy_version 226361 (0.00087) [2022-07-09 11:21:31,468][26022] Updated weights on worker 0-0, policy_version 226371 (0.00087) [2022-07-09 11:21:33,254][26022] Updated weights on worker 0-0, policy_version 226381 (0.00080) [2022-07-09 11:21:33,982][25689] Fps is (10 sec: 5808.3, 60 sec: 5726.6, 300 sec: 5751.8). Total num frames: 231818240. Throughput: 0: 6025.4. Samples: 231825618. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:21:33,987][25689] Avg episode reward: [(0, '-49.049')] [2022-07-09 11:21:35,019][26022] Updated weights on worker 0-0, policy_version 226391 (0.00084) [2022-07-09 11:21:36,668][26022] Updated weights on worker 0-0, policy_version 226401 (0.00086) [2022-07-09 11:21:38,479][26022] Updated weights on worker 0-0, policy_version 226411 (0.00086) [2022-07-09 11:21:39,069][25689] Fps is (10 sec: 5668.4, 60 sec: 5704.1, 300 sec: 5754.5). Total num frames: 231846912. Throughput: 0: 5161.9. Samples: 231843066. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:21:39,071][25689] Avg episode reward: [(0, '-49.094')] [2022-07-09 11:21:40,208][26022] Updated weights on worker 0-0, policy_version 226421 (0.00085) [2022-07-09 11:21:42,143][26022] Updated weights on worker 0-0, policy_version 226431 (0.00088) [2022-07-09 11:21:43,725][26022] Updated weights on worker 0-0, policy_version 226441 (0.00086) [2022-07-09 11:21:44,136][25689] Fps is (10 sec: 5748.1, 60 sec: 5702.5, 300 sec: 5756.8). Total num frames: 231876608. Throughput: 0: 6032.3. Samples: 231877938. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:21:44,137][25689] Avg episode reward: [(0, '-49.772')] [2022-07-09 11:21:45,758][26022] Updated weights on worker 0-0, policy_version 226451 (0.00085) [2022-07-09 11:21:47,351][26022] Updated weights on worker 0-0, policy_version 226461 (0.00086) [2022-07-09 11:21:49,154][25689] Fps is (10 sec: 5787.5, 60 sec: 5719.9, 300 sec: 5757.1). Total num frames: 231905280. Throughput: 0: 6016.4. Samples: 231912392. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:21:49,155][25689] Avg episode reward: [(0, '-49.552')] [2022-07-09 11:21:49,218][26022] Updated weights on worker 0-0, policy_version 226471 (0.00084) [2022-07-09 11:21:50,925][26022] Updated weights on worker 0-0, policy_version 226481 (0.01117) [2022-07-09 11:21:52,717][26022] Updated weights on worker 0-0, policy_version 226491 (0.00082) [2022-07-09 11:21:54,202][25689] Fps is (10 sec: 5697.3, 60 sec: 5698.7, 300 sec: 5752.9). Total num frames: 231933952. Throughput: 0: 5140.9. Samples: 231929564. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:21:54,207][25689] Avg episode reward: [(0, '-49.331')] [2022-07-09 11:21:54,671][26022] Updated weights on worker 0-0, policy_version 226501 (0.00090) [2022-07-09 11:21:56,275][26022] Updated weights on worker 0-0, policy_version 226511 (0.00086) [2022-07-09 11:21:58,002][26022] Updated weights on worker 0-0, policy_version 226521 (0.00091) [2022-07-09 11:21:59,282][25689] Fps is (10 sec: 5763.0, 60 sec: 5743.7, 300 sec: 5759.0). Total num frames: 231963648. Throughput: 0: 6003.8. Samples: 231964422. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:21:59,283][25689] Avg episode reward: [(0, '-49.096')] [2022-07-09 11:21:59,690][26022] Updated weights on worker 0-0, policy_version 226531 (0.00089) [2022-07-09 11:22:01,562][26022] Updated weights on worker 0-0, policy_version 226541 (0.00080) [2022-07-09 11:22:03,703][26022] Updated weights on worker 0-0, policy_version 226551 (0.00083) [2022-07-09 11:22:04,348][25689] Fps is (10 sec: 5651.4, 60 sec: 5731.4, 300 sec: 5754.4). Total num frames: 231991296. Throughput: 0: 5886.0. Samples: 231996906. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:22:04,349][25689] Avg episode reward: [(0, '-48.368')] [2022-07-09 11:22:05,598][26022] Updated weights on worker 0-0, policy_version 226561 (0.00082) [2022-07-09 11:22:07,225][26022] Updated weights on worker 0-0, policy_version 226571 (0.00086) [2022-07-09 11:22:09,178][26022] Updated weights on worker 0-0, policy_version 226581 (0.00084) [2022-07-09 11:22:09,361][25689] Fps is (10 sec: 5486.5, 60 sec: 5699.1, 300 sec: 5751.2). Total num frames: 232018944. Throughput: 0: 5044.6. Samples: 232014326. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:22:09,362][25689] Avg episode reward: [(0, '-47.186')] [2022-07-09 11:22:10,579][26022] Updated weights on worker 0-0, policy_version 226591 (0.00094) [2022-07-09 11:22:12,659][26022] Updated weights on worker 0-0, policy_version 226601 (0.00089) [2022-07-09 11:22:14,167][26022] Updated weights on worker 0-0, policy_version 226611 (0.00085) [2022-07-09 11:22:14,364][25689] Fps is (10 sec: 5827.9, 60 sec: 5753.1, 300 sec: 5758.2). Total num frames: 232049664. Throughput: 0: 5948.1. Samples: 232049490. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:22:14,364][25689] Avg episode reward: [(0, '-48.230')] [2022-07-09 11:22:16,127][26022] Updated weights on worker 0-0, policy_version 226621 (0.00085) [2022-07-09 11:22:17,867][26022] Updated weights on worker 0-0, policy_version 226631 (0.00093) [2022-07-09 11:22:19,409][25689] Fps is (10 sec: 5910.8, 60 sec: 5733.3, 300 sec: 5758.1). Total num frames: 232078336. Throughput: 0: 5944.8. Samples: 232084072. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:22:19,409][25689] Avg episode reward: [(0, '-47.495')] [2022-07-09 11:22:19,715][26022] Updated weights on worker 0-0, policy_version 226641 (0.00086) [2022-07-09 11:22:21,403][26022] Updated weights on worker 0-0, policy_version 226651 (0.00087) [2022-07-09 11:22:23,063][26022] Updated weights on worker 0-0, policy_version 226661 (0.00080) [2022-07-09 11:22:24,519][25689] Fps is (10 sec: 5646.8, 60 sec: 5729.7, 300 sec: 5752.6). Total num frames: 232107008. Throughput: 0: 5176.1. Samples: 232101310. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:22:24,519][25689] Avg episode reward: [(0, '-48.552')] [2022-07-09 11:22:24,927][26022] Updated weights on worker 0-0, policy_version 226671 (0.00089) [2022-07-09 11:22:26,859][26022] Updated weights on worker 0-0, policy_version 226681 (0.00083) [2022-07-09 11:22:28,346][26022] Updated weights on worker 0-0, policy_version 226691 (0.00081) [2022-07-09 11:22:29,582][25689] Fps is (10 sec: 5838.1, 60 sec: 5742.4, 300 sec: 5755.0). Total num frames: 232137728. Throughput: 0: 6034.6. Samples: 232136354. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:22:29,583][25689] Avg episode reward: [(0, '-48.928')] [2022-07-09 11:22:30,270][26022] Updated weights on worker 0-0, policy_version 226701 (0.00096) [2022-07-09 11:22:31,890][26022] Updated weights on worker 0-0, policy_version 226711 (0.00098) [2022-07-09 11:22:33,760][26022] Updated weights on worker 0-0, policy_version 226721 (0.00089) [2022-07-09 11:22:34,604][25689] Fps is (10 sec: 5889.1, 60 sec: 5743.2, 300 sec: 5754.9). Total num frames: 232166400. Throughput: 0: 6003.3. Samples: 232170998. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:22:34,604][25689] Avg episode reward: [(0, '-49.573')] [2022-07-09 11:22:35,631][26022] Updated weights on worker 0-0, policy_version 226731 (0.00084) [2022-07-09 11:22:37,357][26022] Updated weights on worker 0-0, policy_version 226741 (0.00085) [2022-07-09 11:22:38,989][26022] Updated weights on worker 0-0, policy_version 226751 (0.00086) [2022-07-09 11:22:39,634][25689] Fps is (10 sec: 5806.8, 60 sec: 5765.5, 300 sec: 5759.7). Total num frames: 232196096. Throughput: 0: 5166.2. Samples: 232188558. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:22:39,634][25689] Avg episode reward: [(0, '-50.126')] [2022-07-09 11:22:40,987][26022] Updated weights on worker 0-0, policy_version 226761 (0.00090) [2022-07-09 11:22:42,429][26022] Updated weights on worker 0-0, policy_version 226771 (0.00702) [2022-07-09 11:22:44,464][26022] Updated weights on worker 0-0, policy_version 226781 (0.00084) [2022-07-09 11:22:44,699][25689] Fps is (10 sec: 5781.6, 60 sec: 5748.7, 300 sec: 5755.1). Total num frames: 232224768. Throughput: 0: 6052.9. Samples: 232223460. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:22:44,700][25689] Avg episode reward: [(0, '-49.842')] [2022-07-09 11:22:45,960][26022] Updated weights on worker 0-0, policy_version 226791 (0.00093) [2022-07-09 11:22:47,910][26022] Updated weights on worker 0-0, policy_version 226801 (0.00083) [2022-07-09 11:22:49,487][26022] Updated weights on worker 0-0, policy_version 226811 (0.00089) [2022-07-09 11:22:49,715][25689] Fps is (10 sec: 5992.7, 60 sec: 5799.6, 300 sec: 5758.3). Total num frames: 232256512. Throughput: 0: 6064.0. Samples: 232258442. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:22:49,716][25689] Avg episode reward: [(0, '-49.904')] [2022-07-09 11:22:51,632][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:22:51,644][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000226821_232264704.pth [2022-07-09 11:22:51,645][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000224797_230192128.pth [2022-07-09 11:22:51,647][26022] Updated weights on worker 0-0, policy_version 226821 (0.00088) [2022-07-09 11:22:52,839][26022] Updated weights on worker 0-0, policy_version 226831 (0.00094) [2022-07-09 11:22:54,717][25689] Fps is (10 sec: 5826.5, 60 sec: 5770.2, 300 sec: 5755.2). Total num frames: 232283136. Throughput: 0: 5230.8. Samples: 232276206. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:22:54,719][25689] Avg episode reward: [(0, '-49.092')] [2022-07-09 11:22:55,000][26022] Updated weights on worker 0-0, policy_version 226841 (0.00091) [2022-07-09 11:22:56,383][26022] Updated weights on worker 0-0, policy_version 226851 (0.00091) [2022-07-09 11:22:58,291][26022] Updated weights on worker 0-0, policy_version 226861 (0.00091) [2022-07-09 11:22:59,743][25689] Fps is (10 sec: 5718.6, 60 sec: 5792.3, 300 sec: 5769.6). Total num frames: 232313856. Throughput: 0: 6103.1. Samples: 232311288. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:22:59,745][25689] Avg episode reward: [(0, '-49.172')] [2022-07-09 11:23:00,264][26022] Updated weights on worker 0-0, policy_version 226871 (0.00083) [2022-07-09 11:23:02,207][26022] Updated weights on worker 0-0, policy_version 226881 (0.00100) [2022-07-09 11:23:04,204][26022] Updated weights on worker 0-0, policy_version 226891 (0.00082) [2022-07-09 11:23:04,836][25689] Fps is (10 sec: 5666.8, 60 sec: 5772.8, 300 sec: 5755.0). Total num frames: 232340480. Throughput: 0: 5978.0. Samples: 232343838. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:23:04,837][25689] Avg episode reward: [(0, '-48.188')] [2022-07-09 11:23:05,531][26022] Updated weights on worker 0-0, policy_version 226901 (0.00083) [2022-07-09 11:23:07,687][26022] Updated weights on worker 0-0, policy_version 226911 (0.00083) [2022-07-09 11:23:09,469][26022] Updated weights on worker 0-0, policy_version 226921 (0.00088) [2022-07-09 11:23:09,919][25689] Fps is (10 sec: 5434.1, 60 sec: 5783.0, 300 sec: 5751.4). Total num frames: 232369152. Throughput: 0: 5973.5. Samples: 232379128. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:23:09,919][25689] Avg episode reward: [(0, '-48.657')] [2022-07-09 11:23:10,904][26022] Updated weights on worker 0-0, policy_version 226931 (0.00087) [2022-07-09 11:23:12,755][26022] Updated weights on worker 0-0, policy_version 226941 (0.00081) [2022-07-09 11:23:14,499][26022] Updated weights on worker 0-0, policy_version 226951 (0.00090) [2022-07-09 11:23:14,940][25689] Fps is (10 sec: 5675.6, 60 sec: 5747.5, 300 sec: 5754.8). Total num frames: 232397824. Throughput: 0: 5949.3. Samples: 232396518. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:23:14,940][25689] Avg episode reward: [(0, '-47.130')] [2022-07-09 11:23:16,228][26022] Updated weights on worker 0-0, policy_version 226961 (0.00090) [2022-07-09 11:23:18,303][26022] Updated weights on worker 0-0, policy_version 226971 (0.00092) [2022-07-09 11:23:19,949][25689] Fps is (10 sec: 5819.3, 60 sec: 5767.8, 300 sec: 5756.1). Total num frames: 232427520. Throughput: 0: 5939.0. Samples: 232431290. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:23:19,949][25689] Avg episode reward: [(0, '-46.654')] [2022-07-09 11:23:19,963][26022] Updated weights on worker 0-0, policy_version 226981 (0.00320) [2022-07-09 11:23:21,685][26022] Updated weights on worker 0-0, policy_version 226991 (0.00084) [2022-07-09 11:23:23,620][26022] Updated weights on worker 0-0, policy_version 227001 (0.00080) [2022-07-09 11:23:25,046][25689] Fps is (10 sec: 5978.3, 60 sec: 5802.9, 300 sec: 5758.7). Total num frames: 232458240. Throughput: 0: 6036.6. Samples: 232465834. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:23:25,046][25689] Avg episode reward: [(0, '-47.162')] [2022-07-09 11:23:25,122][26022] Updated weights on worker 0-0, policy_version 227011 (0.00087) [2022-07-09 11:23:27,246][26022] Updated weights on worker 0-0, policy_version 227021 (0.00094) [2022-07-09 11:23:28,809][26022] Updated weights on worker 0-0, policy_version 227031 (0.00091) [2022-07-09 11:23:30,116][25689] Fps is (10 sec: 5740.8, 60 sec: 5751.5, 300 sec: 5751.2). Total num frames: 232485888. Throughput: 0: 5141.4. Samples: 232482970. Policy #0 lag: (min: 0.0, avg: 6.9, max: 16.0) [2022-07-09 11:23:30,117][25689] Avg episode reward: [(0, '-47.158')] [2022-07-09 11:23:30,644][26022] Updated weights on worker 0-0, policy_version 227041 (0.00092) [2022-07-09 11:23:32,511][26022] Updated weights on worker 0-0, policy_version 227051 (0.00083) [2022-07-09 11:23:34,205][26022] Updated weights on worker 0-0, policy_version 227061 (0.00086) [2022-07-09 11:23:35,127][25689] Fps is (10 sec: 5586.6, 60 sec: 5752.5, 300 sec: 5754.6). Total num frames: 232514560. Throughput: 0: 5986.4. Samples: 232517366. Policy #0 lag: (min: 0.0, avg: 6.9, max: 16.0) [2022-07-09 11:23:35,128][25689] Avg episode reward: [(0, '-47.732')] [2022-07-09 11:23:35,839][26022] Updated weights on worker 0-0, policy_version 227071 (0.00083) [2022-07-09 11:23:37,813][26022] Updated weights on worker 0-0, policy_version 227081 (0.00088) [2022-07-09 11:23:39,338][26022] Updated weights on worker 0-0, policy_version 227091 (0.00085) [2022-07-09 11:23:40,153][25689] Fps is (10 sec: 5917.6, 60 sec: 5769.8, 300 sec: 5755.9). Total num frames: 232545280. Throughput: 0: 6006.6. Samples: 232552646. Policy #0 lag: (min: 0.0, avg: 6.9, max: 16.0) [2022-07-09 11:23:40,153][25689] Avg episode reward: [(0, '-48.385')] [2022-07-09 11:23:41,261][26022] Updated weights on worker 0-0, policy_version 227101 (0.00080) [2022-07-09 11:23:42,906][26022] Updated weights on worker 0-0, policy_version 227111 (0.00086) [2022-07-09 11:23:44,714][26022] Updated weights on worker 0-0, policy_version 227121 (0.00082) [2022-07-09 11:23:45,195][25689] Fps is (10 sec: 6102.7, 60 sec: 5805.9, 300 sec: 5759.2). Total num frames: 232576000. Throughput: 0: 5185.4. Samples: 232570314. Policy #0 lag: (min: 0.0, avg: 6.9, max: 16.0) [2022-07-09 11:23:45,195][25689] Avg episode reward: [(0, '-48.880')] [2022-07-09 11:23:46,386][26022] Updated weights on worker 0-0, policy_version 227131 (0.00086) [2022-07-09 11:23:48,190][26022] Updated weights on worker 0-0, policy_version 227141 (0.00093) [2022-07-09 11:23:50,079][26022] Updated weights on worker 0-0, policy_version 227151 (0.00085) [2022-07-09 11:23:50,219][25689] Fps is (10 sec: 5798.5, 60 sec: 5737.5, 300 sec: 5759.7). Total num frames: 232603648. Throughput: 0: 6072.6. Samples: 232605044. Policy #0 lag: (min: 0.0, avg: 6.9, max: 16.0) [2022-07-09 11:23:50,219][25689] Avg episode reward: [(0, '-48.191')] [2022-07-09 11:23:51,809][26022] Updated weights on worker 0-0, policy_version 227161 (0.00099) [2022-07-09 11:23:53,602][26022] Updated weights on worker 0-0, policy_version 227171 (0.00086) [2022-07-09 11:23:55,221][25689] Fps is (10 sec: 5617.1, 60 sec: 5771.2, 300 sec: 5759.7). Total num frames: 232632320. Throughput: 0: 6075.9. Samples: 232639456. Policy #0 lag: (min: 0.0, avg: 6.9, max: 16.0) [2022-07-09 11:23:55,223][25689] Avg episode reward: [(0, '-47.412')] [2022-07-09 11:23:55,308][26022] Updated weights on worker 0-0, policy_version 227181 (0.00091) [2022-07-09 11:23:57,139][26022] Updated weights on worker 0-0, policy_version 227191 (0.00086) [2022-07-09 11:23:59,100][26022] Updated weights on worker 0-0, policy_version 227201 (0.00084) [2022-07-09 11:24:00,260][25689] Fps is (10 sec: 5812.6, 60 sec: 5753.1, 300 sec: 5765.2). Total num frames: 232662016. Throughput: 0: 5181.7. Samples: 232656838. Policy #0 lag: (min: 0.0, avg: 6.9, max: 16.0) [2022-07-09 11:24:00,261][25689] Avg episode reward: [(0, '-47.082')] [2022-07-09 11:24:00,661][26022] Updated weights on worker 0-0, policy_version 227211 (0.00087) [2022-07-09 11:24:02,979][26022] Updated weights on worker 0-0, policy_version 227221 (0.00086) [2022-07-09 11:24:04,593][26022] Updated weights on worker 0-0, policy_version 227231 (0.00080) [2022-07-09 11:24:05,301][25689] Fps is (10 sec: 5485.6, 60 sec: 5741.1, 300 sec: 5751.4). Total num frames: 232687616. Throughput: 0: 5921.4. Samples: 232689374. Policy #0 lag: (min: 0.0, avg: 6.9, max: 16.0) [2022-07-09 11:24:05,302][25689] Avg episode reward: [(0, '-47.007')] [2022-07-09 11:24:06,259][26022] Updated weights on worker 0-0, policy_version 227241 (0.00082) [2022-07-09 11:24:08,191][26022] Updated weights on worker 0-0, policy_version 227251 (0.00081) [2022-07-09 11:24:09,797][26022] Updated weights on worker 0-0, policy_version 227261 (0.00081) [2022-07-09 11:24:10,334][25689] Fps is (10 sec: 5387.3, 60 sec: 5745.8, 300 sec: 5755.7). Total num frames: 232716288. Throughput: 0: 5917.1. Samples: 232724070. Policy #0 lag: (min: 0.0, avg: 6.9, max: 16.0) [2022-07-09 11:24:10,336][25689] Avg episode reward: [(0, '-47.102')] [2022-07-09 11:24:11,703][26022] Updated weights on worker 0-0, policy_version 227271 (0.00089) [2022-07-09 11:24:13,637][26022] Updated weights on worker 0-0, policy_version 227281 (0.00089) [2022-07-09 11:24:14,970][26022] Updated weights on worker 0-0, policy_version 227291 (0.00089) [2022-07-09 11:24:15,398][25689] Fps is (10 sec: 5882.4, 60 sec: 5775.6, 300 sec: 5758.3). Total num frames: 232747008. Throughput: 0: 5059.9. Samples: 232741548. Policy #0 lag: (min: 0.0, avg: 6.9, max: 16.0) [2022-07-09 11:24:15,399][25689] Avg episode reward: [(0, '-46.997')] [2022-07-09 11:24:17,143][26022] Updated weights on worker 0-0, policy_version 227301 (0.00084) [2022-07-09 11:24:18,724][26022] Updated weights on worker 0-0, policy_version 227311 (0.00087) [2022-07-09 11:24:20,422][25689] Fps is (10 sec: 5786.1, 60 sec: 5740.4, 300 sec: 5752.2). Total num frames: 232774656. Throughput: 0: 5907.5. Samples: 232775942. Policy #0 lag: (min: 0.0, avg: 6.9, max: 16.0) [2022-07-09 11:24:20,422][25689] Avg episode reward: [(0, '-47.012')] [2022-07-09 11:24:20,491][26022] Updated weights on worker 0-0, policy_version 227321 (0.00088) [2022-07-09 11:24:22,431][26022] Updated weights on worker 0-0, policy_version 227331 (0.00082) [2022-07-09 11:24:23,989][26022] Updated weights on worker 0-0, policy_version 227341 (0.00100) [2022-07-09 11:24:25,534][25689] Fps is (10 sec: 5758.6, 60 sec: 5738.9, 300 sec: 5753.9). Total num frames: 232805376. Throughput: 0: 6024.6. Samples: 232811266. Policy #0 lag: (min: 0.0, avg: 6.9, max: 16.0) [2022-07-09 11:24:25,534][25689] Avg episode reward: [(0, '-47.751')] [2022-07-09 11:24:25,837][26022] Updated weights on worker 0-0, policy_version 227351 (0.00089) [2022-07-09 11:24:27,822][26022] Updated weights on worker 0-0, policy_version 227361 (0.00087) [2022-07-09 11:24:29,133][26022] Updated weights on worker 0-0, policy_version 227371 (0.00088) [2022-07-09 11:24:30,575][25689] Fps is (10 sec: 5749.0, 60 sec: 5741.7, 300 sec: 5749.9). Total num frames: 232833024. Throughput: 0: 5169.3. Samples: 232828698. Policy #0 lag: (min: 0.0, avg: 6.9, max: 16.0) [2022-07-09 11:24:30,575][25689] Avg episode reward: [(0, '-47.728')] [2022-07-09 11:24:31,451][26022] Updated weights on worker 0-0, policy_version 227381 (0.00088) [2022-07-09 11:24:32,729][26022] Updated weights on worker 0-0, policy_version 227391 (0.00091) [2022-07-09 11:24:34,771][26022] Updated weights on worker 0-0, policy_version 227401 (0.00085) [2022-07-09 11:24:35,591][25689] Fps is (10 sec: 5905.6, 60 sec: 5792.0, 300 sec: 5756.9). Total num frames: 232864768. Throughput: 0: 6040.3. Samples: 232863518. Policy #0 lag: (min: 0.0, avg: 6.9, max: 16.0) [2022-07-09 11:24:35,591][25689] Avg episode reward: [(0, '-46.207')] [2022-07-09 11:24:36,499][26022] Updated weights on worker 0-0, policy_version 227411 (0.00098) [2022-07-09 11:24:38,136][26022] Updated weights on worker 0-0, policy_version 227421 (0.00090) [2022-07-09 11:24:40,054][26022] Updated weights on worker 0-0, policy_version 227431 (0.00094) [2022-07-09 11:24:40,598][25689] Fps is (10 sec: 6027.8, 60 sec: 5760.0, 300 sec: 5758.6). Total num frames: 232893440. Throughput: 0: 6060.7. Samples: 232898220. Policy #0 lag: (min: 0.0, avg: 6.9, max: 16.0) [2022-07-09 11:24:40,598][25689] Avg episode reward: [(0, '-46.366')] [2022-07-09 11:24:41,759][26022] Updated weights on worker 0-0, policy_version 227441 (0.00089) [2022-07-09 11:24:43,421][26022] Updated weights on worker 0-0, policy_version 227451 (0.00085) [2022-07-09 11:24:45,439][26022] Updated weights on worker 0-0, policy_version 227461 (0.00087) [2022-07-09 11:24:45,656][25689] Fps is (10 sec: 5595.3, 60 sec: 5707.6, 300 sec: 5750.7). Total num frames: 232921088. Throughput: 0: 5189.5. Samples: 232915692. Policy #0 lag: (min: 0.0, avg: 6.9, max: 16.0) [2022-07-09 11:24:45,657][25689] Avg episode reward: [(0, '-46.164')] [2022-07-09 11:24:46,888][26022] Updated weights on worker 0-0, policy_version 227471 (0.00087) [2022-07-09 11:24:49,006][26022] Updated weights on worker 0-0, policy_version 227481 (0.00095) [2022-07-09 11:24:50,543][26022] Updated weights on worker 0-0, policy_version 227491 (0.00091) [2022-07-09 11:24:50,678][25689] Fps is (10 sec: 5688.4, 60 sec: 5741.6, 300 sec: 5754.7). Total num frames: 232950784. Throughput: 0: 6058.6. Samples: 232950498. Policy #0 lag: (min: 0.0, avg: 6.9, max: 16.0) [2022-07-09 11:24:50,679][25689] Avg episode reward: [(0, '-46.125')] [2022-07-09 11:24:51,914][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:24:51,931][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000227498_232957952.pth [2022-07-09 11:24:51,941][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000225472_230883328.pth [2022-07-09 11:24:52,546][26022] Updated weights on worker 0-0, policy_version 227501 (0.00094) [2022-07-09 11:24:54,018][26022] Updated weights on worker 0-0, policy_version 227511 (0.00099) [2022-07-09 11:24:55,695][25689] Fps is (10 sec: 5814.5, 60 sec: 5740.3, 300 sec: 5747.5). Total num frames: 232979456. Throughput: 0: 6050.4. Samples: 232985156. Policy #0 lag: (min: 0.0, avg: 6.9, max: 16.0) [2022-07-09 11:24:55,696][25689] Avg episode reward: [(0, '-45.895')] [2022-07-09 11:24:56,078][26022] Updated weights on worker 0-0, policy_version 227521 (0.00094) [2022-07-09 11:24:57,562][26022] Updated weights on worker 0-0, policy_version 227531 (0.00083) [2022-07-09 11:24:59,394][26022] Updated weights on worker 0-0, policy_version 227541 (0.00085) [2022-07-09 11:25:00,703][25689] Fps is (10 sec: 5822.5, 60 sec: 5743.2, 300 sec: 5758.6). Total num frames: 233009152. Throughput: 0: 5195.6. Samples: 233002680. Policy #0 lag: (min: 0.0, avg: 6.9, max: 16.0) [2022-07-09 11:25:00,704][25689] Avg episode reward: [(0, '-45.169')] [2022-07-09 11:25:01,688][26022] Updated weights on worker 0-0, policy_version 227551 (0.00095) [2022-07-09 11:25:03,207][26022] Updated weights on worker 0-0, policy_version 227561 (0.00084) [2022-07-09 11:25:05,126][26022] Updated weights on worker 0-0, policy_version 227571 (0.00085) [2022-07-09 11:25:05,799][25689] Fps is (10 sec: 5574.0, 60 sec: 5755.0, 300 sec: 5743.7). Total num frames: 233035776. Throughput: 0: 5948.0. Samples: 233035500. Policy #0 lag: (min: 0.0, avg: 6.9, max: 16.0) [2022-07-09 11:25:05,801][25689] Avg episode reward: [(0, '-46.349')] [2022-07-09 11:25:06,676][26022] Updated weights on worker 0-0, policy_version 227581 (0.00090) [2022-07-09 11:25:08,723][26022] Updated weights on worker 0-0, policy_version 227591 (0.00080) [2022-07-09 11:25:10,509][26022] Updated weights on worker 0-0, policy_version 227601 (0.00089) [2022-07-09 11:25:10,850][25689] Fps is (10 sec: 5550.3, 60 sec: 5770.1, 300 sec: 5750.0). Total num frames: 233065472. Throughput: 0: 5944.5. Samples: 233070410. Policy #0 lag: (min: 0.0, avg: 6.9, max: 16.0) [2022-07-09 11:25:10,851][25689] Avg episode reward: [(0, '-46.374')] [2022-07-09 11:25:12,240][26022] Updated weights on worker 0-0, policy_version 227611 (0.00080) [2022-07-09 11:25:14,041][26022] Updated weights on worker 0-0, policy_version 227621 (0.00094) [2022-07-09 11:25:15,577][26022] Updated weights on worker 0-0, policy_version 227631 (0.00095) [2022-07-09 11:25:15,882][25689] Fps is (10 sec: 5890.3, 60 sec: 5756.2, 300 sec: 5753.3). Total num frames: 233095168. Throughput: 0: 5093.7. Samples: 233087984. Policy #0 lag: (min: 0.0, avg: 6.9, max: 16.0) [2022-07-09 11:25:15,884][25689] Avg episode reward: [(0, '-46.729')] [2022-07-09 11:25:17,463][26022] Updated weights on worker 0-0, policy_version 227641 (0.00088) [2022-07-09 11:25:19,354][26022] Updated weights on worker 0-0, policy_version 227651 (0.00082) [2022-07-09 11:25:20,895][25689] Fps is (10 sec: 5811.0, 60 sec: 5774.2, 300 sec: 5743.5). Total num frames: 233123840. Throughput: 0: 5933.1. Samples: 233122480. Policy #0 lag: (min: 0.0, avg: 6.9, max: 16.0) [2022-07-09 11:25:20,896][25689] Avg episode reward: [(0, '-46.324')] [2022-07-09 11:25:21,004][26022] Updated weights on worker 0-0, policy_version 227661 (0.00095) [2022-07-09 11:25:22,790][26022] Updated weights on worker 0-0, policy_version 227671 (0.00092) [2022-07-09 11:25:24,633][26022] Updated weights on worker 0-0, policy_version 227681 (0.00100) [2022-07-09 11:25:25,960][25689] Fps is (10 sec: 5893.2, 60 sec: 5778.7, 300 sec: 5752.9). Total num frames: 233154560. Throughput: 0: 6040.6. Samples: 233157286. Policy #0 lag: (min: 0.0, avg: 6.9, max: 16.0) [2022-07-09 11:25:25,960][25689] Avg episode reward: [(0, '-45.967')] [2022-07-09 11:25:26,359][26022] Updated weights on worker 0-0, policy_version 227691 (0.00087) [2022-07-09 11:25:28,127][26022] Updated weights on worker 0-0, policy_version 227701 (0.00085) [2022-07-09 11:25:29,774][26022] Updated weights on worker 0-0, policy_version 227711 (0.00090) [2022-07-09 11:25:30,977][25689] Fps is (10 sec: 5687.3, 60 sec: 5764.0, 300 sec: 5746.0). Total num frames: 233181184. Throughput: 0: 5188.3. Samples: 233174836. Policy #0 lag: (min: 0.0, avg: 6.9, max: 16.0) [2022-07-09 11:25:30,978][25689] Avg episode reward: [(0, '-45.570')] [2022-07-09 11:25:31,637][26022] Updated weights on worker 0-0, policy_version 227721 (0.00089) [2022-07-09 11:25:33,362][26022] Updated weights on worker 0-0, policy_version 227731 (0.00084) [2022-07-09 11:25:35,277][26022] Updated weights on worker 0-0, policy_version 227741 (0.00082) [2022-07-09 11:25:35,985][25689] Fps is (10 sec: 5516.0, 60 sec: 5714.0, 300 sec: 5742.9). Total num frames: 233209856. Throughput: 0: 6048.5. Samples: 233209576. Policy #0 lag: (min: 0.0, avg: 6.9, max: 16.0) [2022-07-09 11:25:35,985][25689] Avg episode reward: [(0, '-46.263')] [2022-07-09 11:25:36,769][26022] Updated weights on worker 0-0, policy_version 227751 (0.00065) [2022-07-09 11:25:38,851][26022] Updated weights on worker 0-0, policy_version 227761 (0.00085) [2022-07-09 11:25:40,279][26022] Updated weights on worker 0-0, policy_version 227771 (0.00088) [2022-07-09 11:25:41,019][25689] Fps is (10 sec: 6016.7, 60 sec: 5762.2, 300 sec: 5750.1). Total num frames: 233241600. Throughput: 0: 6056.9. Samples: 233244370. Policy #0 lag: (min: 0.0, avg: 6.9, max: 16.0) [2022-07-09 11:25:41,019][25689] Avg episode reward: [(0, '-47.090')] [2022-07-09 11:25:42,419][26022] Updated weights on worker 0-0, policy_version 227781 (0.00086) [2022-07-09 11:25:43,863][26022] Updated weights on worker 0-0, policy_version 227791 (0.00092) [2022-07-09 11:25:45,889][26022] Updated weights on worker 0-0, policy_version 227801 (0.00093) [2022-07-09 11:25:46,060][25689] Fps is (10 sec: 5996.5, 60 sec: 5780.9, 300 sec: 5753.2). Total num frames: 233270272. Throughput: 0: 5192.2. Samples: 233261650. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 11:25:46,060][25689] Avg episode reward: [(0, '-47.138')] [2022-07-09 11:25:47,555][26022] Updated weights on worker 0-0, policy_version 227811 (0.00093) [2022-07-09 11:25:49,489][26022] Updated weights on worker 0-0, policy_version 227821 (0.00097) [2022-07-09 11:25:51,092][25689] Fps is (10 sec: 5591.1, 60 sec: 5746.0, 300 sec: 5745.8). Total num frames: 233297920. Throughput: 0: 6059.3. Samples: 233296716. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 11:25:51,094][25689] Avg episode reward: [(0, '-47.211')] [2022-07-09 11:25:51,200][26022] Updated weights on worker 0-0, policy_version 227831 (0.00084) [2022-07-09 11:25:52,959][26022] Updated weights on worker 0-0, policy_version 227841 (0.00093) [2022-07-09 11:25:54,522][26022] Updated weights on worker 0-0, policy_version 227851 (0.00084) [2022-07-09 11:25:56,095][25689] Fps is (10 sec: 5612.0, 60 sec: 5747.2, 300 sec: 5753.0). Total num frames: 233326592. Throughput: 0: 6064.3. Samples: 233331534. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 11:25:56,096][25689] Avg episode reward: [(0, '-47.442')] [2022-07-09 11:25:56,499][26022] Updated weights on worker 0-0, policy_version 227861 (0.00104) [2022-07-09 11:25:57,873][26022] Updated weights on worker 0-0, policy_version 227871 (0.00078) [2022-07-09 11:25:59,895][26022] Updated weights on worker 0-0, policy_version 227881 (0.00084) [2022-07-09 11:26:01,101][25689] Fps is (10 sec: 6035.9, 60 sec: 5781.3, 300 sec: 5765.4). Total num frames: 233358336. Throughput: 0: 6094.5. Samples: 233366764. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 11:26:01,102][25689] Avg episode reward: [(0, '-47.276')] [2022-07-09 11:26:01,641][26022] Updated weights on worker 0-0, policy_version 227891 (0.00090) [2022-07-09 11:26:03,743][26022] Updated weights on worker 0-0, policy_version 227901 (0.00085) [2022-07-09 11:26:05,617][26022] Updated weights on worker 0-0, policy_version 227911 (0.00083) [2022-07-09 11:26:06,248][25689] Fps is (10 sec: 5749.0, 60 sec: 5776.5, 300 sec: 5752.8). Total num frames: 233384960. Throughput: 0: 5967.2. Samples: 233382118. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 11:26:06,249][25689] Avg episode reward: [(0, '-46.580')] [2022-07-09 11:26:07,143][26022] Updated weights on worker 0-0, policy_version 227921 (0.00087) [2022-07-09 11:26:08,920][26022] Updated weights on worker 0-0, policy_version 227931 (0.00087) [2022-07-09 11:26:10,932][26022] Updated weights on worker 0-0, policy_version 227941 (0.00081) [2022-07-09 11:26:11,303][25689] Fps is (10 sec: 5420.4, 60 sec: 5759.2, 300 sec: 5755.9). Total num frames: 233413632. Throughput: 0: 5948.7. Samples: 233416944. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 11:26:11,303][25689] Avg episode reward: [(0, '-45.948')] [2022-07-09 11:26:12,440][26022] Updated weights on worker 0-0, policy_version 227951 (0.00082) [2022-07-09 11:26:14,464][26022] Updated weights on worker 0-0, policy_version 227961 (0.00091) [2022-07-09 11:26:16,019][26022] Updated weights on worker 0-0, policy_version 227971 (0.00096) [2022-07-09 11:26:16,316][25689] Fps is (10 sec: 5899.3, 60 sec: 5778.0, 300 sec: 5759.4). Total num frames: 233444352. Throughput: 0: 5949.0. Samples: 233451824. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 11:26:16,316][25689] Avg episode reward: [(0, '-45.455')] [2022-07-09 11:26:17,943][26022] Updated weights on worker 0-0, policy_version 227981 (0.00089) [2022-07-09 11:26:19,619][26022] Updated weights on worker 0-0, policy_version 227991 (0.00503) [2022-07-09 11:26:21,411][25689] Fps is (10 sec: 5774.3, 60 sec: 5753.1, 300 sec: 5755.5). Total num frames: 233472000. Throughput: 0: 5049.6. Samples: 233469320. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 11:26:21,412][25689] Avg episode reward: [(0, '-46.798')] [2022-07-09 11:26:21,466][26022] Updated weights on worker 0-0, policy_version 228001 (0.00082) [2022-07-09 11:26:23,112][26022] Updated weights on worker 0-0, policy_version 228011 (0.00085) [2022-07-09 11:26:25,031][26022] Updated weights on worker 0-0, policy_version 228021 (0.00078) [2022-07-09 11:26:26,465][25689] Fps is (10 sec: 5751.0, 60 sec: 5754.2, 300 sec: 5758.3). Total num frames: 233502720. Throughput: 0: 6023.0. Samples: 233503884. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 11:26:26,465][25689] Avg episode reward: [(0, '-47.295')] [2022-07-09 11:26:26,716][26022] Updated weights on worker 0-0, policy_version 228031 (0.00087) [2022-07-09 11:26:28,526][26022] Updated weights on worker 0-0, policy_version 228041 (0.00089) [2022-07-09 11:26:30,199][26022] Updated weights on worker 0-0, policy_version 228051 (0.00083) [2022-07-09 11:26:31,521][25689] Fps is (10 sec: 5773.2, 60 sec: 5767.5, 300 sec: 5754.3). Total num frames: 233530368. Throughput: 0: 6015.3. Samples: 233538564. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 11:26:31,523][25689] Avg episode reward: [(0, '-47.586')] [2022-07-09 11:26:31,999][26022] Updated weights on worker 0-0, policy_version 228061 (0.00085) [2022-07-09 11:26:33,715][26022] Updated weights on worker 0-0, policy_version 228071 (0.00087) [2022-07-09 11:26:35,619][26022] Updated weights on worker 0-0, policy_version 228081 (0.00089) [2022-07-09 11:26:36,525][25689] Fps is (10 sec: 5598.1, 60 sec: 5767.7, 300 sec: 5755.9). Total num frames: 233559040. Throughput: 0: 5152.6. Samples: 233555956. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 11:26:36,527][25689] Avg episode reward: [(0, '-47.395')] [2022-07-09 11:26:37,406][26022] Updated weights on worker 0-0, policy_version 228091 (0.00085) [2022-07-09 11:26:39,091][26022] Updated weights on worker 0-0, policy_version 228101 (0.00084) [2022-07-09 11:26:41,001][26022] Updated weights on worker 0-0, policy_version 228111 (0.00086) [2022-07-09 11:26:41,622][25689] Fps is (10 sec: 5880.1, 60 sec: 5744.9, 300 sec: 5758.8). Total num frames: 233589760. Throughput: 0: 6007.3. Samples: 233590732. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 11:26:41,622][25689] Avg episode reward: [(0, '-47.958')] [2022-07-09 11:26:42,787][26022] Updated weights on worker 0-0, policy_version 228121 (0.00082) [2022-07-09 11:26:44,412][26022] Updated weights on worker 0-0, policy_version 228131 (0.01169) [2022-07-09 11:26:46,366][26022] Updated weights on worker 0-0, policy_version 228141 (0.00089) [2022-07-09 11:26:46,663][25689] Fps is (10 sec: 5757.5, 60 sec: 5728.0, 300 sec: 5754.9). Total num frames: 233617408. Throughput: 0: 6022.9. Samples: 233625538. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 11:26:46,664][25689] Avg episode reward: [(0, '-48.083')] [2022-07-09 11:26:47,864][26022] Updated weights on worker 0-0, policy_version 228151 (0.00082) [2022-07-09 11:26:49,786][26022] Updated weights on worker 0-0, policy_version 228161 (0.00087) [2022-07-09 11:26:51,543][26022] Updated weights on worker 0-0, policy_version 228171 (0.00087) [2022-07-09 11:26:51,671][25689] Fps is (10 sec: 5808.3, 60 sec: 5781.0, 300 sec: 5762.5). Total num frames: 233648128. Throughput: 0: 5181.2. Samples: 233642962. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 11:26:51,671][25689] Avg episode reward: [(0, '-48.051')] [2022-07-09 11:26:52,138][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:26:52,158][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000228175_233651200.pth [2022-07-09 11:26:52,158][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000226148_231575552.pth [2022-07-09 11:26:53,383][26022] Updated weights on worker 0-0, policy_version 228181 (0.00085) [2022-07-09 11:26:55,034][26022] Updated weights on worker 0-0, policy_version 228191 (0.00084) [2022-07-09 11:26:56,704][25689] Fps is (10 sec: 5813.1, 60 sec: 5761.3, 300 sec: 5756.6). Total num frames: 233675776. Throughput: 0: 6026.9. Samples: 233677570. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 11:26:56,705][25689] Avg episode reward: [(0, '-48.291')] [2022-07-09 11:26:56,876][26022] Updated weights on worker 0-0, policy_version 228201 (0.00080) [2022-07-09 11:26:58,615][26022] Updated weights on worker 0-0, policy_version 228211 (0.00086) [2022-07-09 11:27:00,353][26022] Updated weights on worker 0-0, policy_version 228221 (0.00086) [2022-07-09 11:27:01,749][25689] Fps is (10 sec: 5791.7, 60 sec: 5740.7, 300 sec: 5767.3). Total num frames: 233706496. Throughput: 0: 6042.5. Samples: 233712348. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 11:27:01,749][25689] Avg episode reward: [(0, '-48.576')] [2022-07-09 11:27:02,486][26022] Updated weights on worker 0-0, policy_version 228231 (0.00079) [2022-07-09 11:27:04,287][26022] Updated weights on worker 0-0, policy_version 228241 (0.00086) [2022-07-09 11:27:05,999][26022] Updated weights on worker 0-0, policy_version 228251 (0.00082) [2022-07-09 11:27:06,851][25689] Fps is (10 sec: 5550.5, 60 sec: 5728.1, 300 sec: 5758.7). Total num frames: 233732096. Throughput: 0: 5056.5. Samples: 233727616. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 11:27:06,851][25689] Avg episode reward: [(0, '-48.043')] [2022-07-09 11:27:07,853][26022] Updated weights on worker 0-0, policy_version 228261 (0.00087) [2022-07-09 11:27:09,652][26022] Updated weights on worker 0-0, policy_version 228271 (0.00099) [2022-07-09 11:27:11,343][26022] Updated weights on worker 0-0, policy_version 228281 (0.00084) [2022-07-09 11:27:11,882][25689] Fps is (10 sec: 5456.6, 60 sec: 5747.2, 300 sec: 5754.7). Total num frames: 233761792. Throughput: 0: 5900.8. Samples: 233762226. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 11:27:11,883][25689] Avg episode reward: [(0, '-48.104')] [2022-07-09 11:27:13,034][26022] Updated weights on worker 0-0, policy_version 228291 (0.00085) [2022-07-09 11:27:14,850][26022] Updated weights on worker 0-0, policy_version 228301 (0.00088) [2022-07-09 11:27:16,681][26022] Updated weights on worker 0-0, policy_version 228311 (0.00088) [2022-07-09 11:27:16,888][25689] Fps is (10 sec: 6019.3, 60 sec: 5747.9, 300 sec: 5762.3). Total num frames: 233792512. Throughput: 0: 5939.0. Samples: 233797442. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 11:27:16,888][25689] Avg episode reward: [(0, '-47.298')] [2022-07-09 11:27:18,452][26022] Updated weights on worker 0-0, policy_version 228321 (0.00085) [2022-07-09 11:27:20,194][26022] Updated weights on worker 0-0, policy_version 228331 (0.00089) [2022-07-09 11:27:21,920][25689] Fps is (10 sec: 5814.8, 60 sec: 5753.8, 300 sec: 5760.4). Total num frames: 233820160. Throughput: 0: 5093.7. Samples: 233815096. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 11:27:21,921][25689] Avg episode reward: [(0, '-48.023')] [2022-07-09 11:27:21,976][26022] Updated weights on worker 0-0, policy_version 228341 (0.00111) [2022-07-09 11:27:23,814][26022] Updated weights on worker 0-0, policy_version 228351 (0.00055) [2022-07-09 11:27:25,257][26022] Updated weights on worker 0-0, policy_version 228361 (0.00657) [2022-07-09 11:27:26,975][25689] Fps is (10 sec: 5685.1, 60 sec: 5736.9, 300 sec: 5757.1). Total num frames: 233849856. Throughput: 0: 6079.6. Samples: 233849960. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 11:27:26,976][25689] Avg episode reward: [(0, '-48.007')] [2022-07-09 11:27:27,335][26022] Updated weights on worker 0-0, policy_version 228371 (0.00083) [2022-07-09 11:27:28,899][26022] Updated weights on worker 0-0, policy_version 228381 (0.00087) [2022-07-09 11:27:30,768][26022] Updated weights on worker 0-0, policy_version 228391 (0.00085) [2022-07-09 11:27:31,982][25689] Fps is (10 sec: 5902.7, 60 sec: 5775.4, 300 sec: 5760.9). Total num frames: 233879552. Throughput: 0: 6113.9. Samples: 233885114. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 11:27:31,983][25689] Avg episode reward: [(0, '-49.038')] [2022-07-09 11:27:32,362][26022] Updated weights on worker 0-0, policy_version 228401 (0.00091) [2022-07-09 11:27:34,250][26022] Updated weights on worker 0-0, policy_version 228411 (0.00084) [2022-07-09 11:27:35,919][26022] Updated weights on worker 0-0, policy_version 228421 (0.00085) [2022-07-09 11:27:37,003][25689] Fps is (10 sec: 5718.3, 60 sec: 5756.9, 300 sec: 5754.1). Total num frames: 233907200. Throughput: 0: 5227.7. Samples: 233902596. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 11:27:37,003][25689] Avg episode reward: [(0, '-48.279')] [2022-07-09 11:27:37,768][26022] Updated weights on worker 0-0, policy_version 228431 (0.00089) [2022-07-09 11:27:39,287][26022] Updated weights on worker 0-0, policy_version 228441 (0.00093) [2022-07-09 11:27:41,174][26022] Updated weights on worker 0-0, policy_version 228451 (0.00097) [2022-07-09 11:27:42,008][25689] Fps is (10 sec: 5821.8, 60 sec: 5765.6, 300 sec: 5762.2). Total num frames: 233937920. Throughput: 0: 6095.7. Samples: 233937544. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 11:27:42,010][25689] Avg episode reward: [(0, '-49.288')] [2022-07-09 11:27:42,895][26022] Updated weights on worker 0-0, policy_version 228461 (0.00085) [2022-07-09 11:27:44,716][26022] Updated weights on worker 0-0, policy_version 228471 (0.00085) [2022-07-09 11:27:46,441][26022] Updated weights on worker 0-0, policy_version 228481 (0.00085) [2022-07-09 11:27:47,059][25689] Fps is (10 sec: 6007.7, 60 sec: 5798.5, 300 sec: 5754.6). Total num frames: 233967616. Throughput: 0: 6098.3. Samples: 233972442. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 11:27:47,061][25689] Avg episode reward: [(0, '-48.890')] [2022-07-09 11:27:48,199][26022] Updated weights on worker 0-0, policy_version 228491 (0.00083) [2022-07-09 11:27:50,138][26022] Updated weights on worker 0-0, policy_version 228501 (0.00086) [2022-07-09 11:27:52,014][26022] Updated weights on worker 0-0, policy_version 228511 (0.00083) [2022-07-09 11:27:52,066][25689] Fps is (10 sec: 5701.4, 60 sec: 5747.8, 300 sec: 5758.0). Total num frames: 233995264. Throughput: 0: 5219.1. Samples: 233989932. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 11:27:52,066][25689] Avg episode reward: [(0, '-47.821')] [2022-07-09 11:27:53,628][26022] Updated weights on worker 0-0, policy_version 228521 (0.00085) [2022-07-09 11:27:55,365][26022] Updated weights on worker 0-0, policy_version 228531 (0.00086) [2022-07-09 11:27:57,087][25689] Fps is (10 sec: 5820.4, 60 sec: 5799.7, 300 sec: 5758.1). Total num frames: 234025984. Throughput: 0: 6088.9. Samples: 234024890. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 11:27:57,091][25689] Avg episode reward: [(0, '-47.479')] [2022-07-09 11:27:57,091][26022] Updated weights on worker 0-0, policy_version 228541 (0.00085) [2022-07-09 11:27:58,863][26022] Updated weights on worker 0-0, policy_version 228551 (0.00084) [2022-07-09 11:28:00,572][26022] Updated weights on worker 0-0, policy_version 228561 (0.00081) [2022-07-09 11:28:02,139][25689] Fps is (10 sec: 5794.1, 60 sec: 5748.2, 300 sec: 5762.3). Total num frames: 234053632. Throughput: 0: 6068.0. Samples: 234059702. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 11:28:02,140][25689] Avg episode reward: [(0, '-47.360')] [2022-07-09 11:28:02,672][26022] Updated weights on worker 0-0, policy_version 228571 (0.00091) [2022-07-09 11:28:04,578][26022] Updated weights on worker 0-0, policy_version 228581 (0.00083) [2022-07-09 11:28:06,208][26022] Updated weights on worker 0-0, policy_version 228591 (0.00093) [2022-07-09 11:28:07,184][25689] Fps is (10 sec: 5679.6, 60 sec: 5821.6, 300 sec: 5766.5). Total num frames: 234083328. Throughput: 0: 5091.7. Samples: 234074910. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 11:28:07,184][25689] Avg episode reward: [(0, '-46.134')] [2022-07-09 11:28:07,922][26022] Updated weights on worker 0-0, policy_version 228601 (0.00083) [2022-07-09 11:28:09,918][26022] Updated weights on worker 0-0, policy_version 228611 (0.00083) [2022-07-09 11:28:11,528][26022] Updated weights on worker 0-0, policy_version 228621 (0.00086) [2022-07-09 11:28:12,192][25689] Fps is (10 sec: 5704.4, 60 sec: 5789.9, 300 sec: 5763.3). Total num frames: 234110976. Throughput: 0: 5961.2. Samples: 234109908. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 11:28:12,193][25689] Avg episode reward: [(0, '-46.632')] [2022-07-09 11:28:13,425][26022] Updated weights on worker 0-0, policy_version 228631 (0.00084) [2022-07-09 11:28:14,939][26022] Updated weights on worker 0-0, policy_version 228641 (0.00082) [2022-07-09 11:28:16,992][26022] Updated weights on worker 0-0, policy_version 228651 (0.00086) [2022-07-09 11:28:17,202][25689] Fps is (10 sec: 5621.5, 60 sec: 5755.4, 300 sec: 5759.8). Total num frames: 234139648. Throughput: 0: 5954.9. Samples: 234144674. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 11:28:17,205][25689] Avg episode reward: [(0, '-46.639')] [2022-07-09 11:28:18,602][26022] Updated weights on worker 0-0, policy_version 228661 (0.00085) [2022-07-09 11:28:20,442][26022] Updated weights on worker 0-0, policy_version 228671 (0.00086) [2022-07-09 11:28:22,147][26022] Updated weights on worker 0-0, policy_version 228681 (0.00091) [2022-07-09 11:28:22,220][25689] Fps is (10 sec: 5820.5, 60 sec: 5790.8, 300 sec: 5757.9). Total num frames: 234169344. Throughput: 0: 5105.8. Samples: 234162230. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 11:28:22,221][25689] Avg episode reward: [(0, '-46.811')] [2022-07-09 11:28:23,836][26022] Updated weights on worker 0-0, policy_version 228691 (0.00091) [2022-07-09 11:28:25,677][26022] Updated weights on worker 0-0, policy_version 228701 (0.00083) [2022-07-09 11:28:27,268][25689] Fps is (10 sec: 5697.2, 60 sec: 5757.5, 300 sec: 5758.4). Total num frames: 234196992. Throughput: 0: 6073.3. Samples: 234196886. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 11:28:27,268][25689] Avg episode reward: [(0, '-46.616')] [2022-07-09 11:28:27,642][26022] Updated weights on worker 0-0, policy_version 228711 (0.00091) [2022-07-09 11:28:29,233][26022] Updated weights on worker 0-0, policy_version 228721 (0.00085) [2022-07-09 11:28:31,209][26022] Updated weights on worker 0-0, policy_version 228731 (0.00084) [2022-07-09 11:28:32,283][25689] Fps is (10 sec: 5901.8, 60 sec: 5790.7, 300 sec: 5768.6). Total num frames: 234228736. Throughput: 0: 6056.5. Samples: 234231592. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 11:28:32,284][25689] Avg episode reward: [(0, '-46.674')] [2022-07-09 11:28:32,703][26022] Updated weights on worker 0-0, policy_version 228741 (0.00086) [2022-07-09 11:28:34,691][26022] Updated weights on worker 0-0, policy_version 228751 (0.00081) [2022-07-09 11:28:36,371][26022] Updated weights on worker 0-0, policy_version 228761 (0.01111) [2022-07-09 11:28:37,290][25689] Fps is (10 sec: 5926.3, 60 sec: 5792.0, 300 sec: 5758.6). Total num frames: 234256384. Throughput: 0: 5194.2. Samples: 234249010. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 11:28:37,290][25689] Avg episode reward: [(0, '-47.215')] [2022-07-09 11:28:38,291][26022] Updated weights on worker 0-0, policy_version 228771 (0.00084) [2022-07-09 11:28:39,743][26022] Updated weights on worker 0-0, policy_version 228781 (0.00093) [2022-07-09 11:28:41,714][26022] Updated weights on worker 0-0, policy_version 228791 (0.00086) [2022-07-09 11:28:42,311][25689] Fps is (10 sec: 5616.7, 60 sec: 5756.5, 300 sec: 5752.2). Total num frames: 234285056. Throughput: 0: 6052.9. Samples: 234283836. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 11:28:42,311][25689] Avg episode reward: [(0, '-47.389')] [2022-07-09 11:28:43,299][26022] Updated weights on worker 0-0, policy_version 228801 (0.00099) [2022-07-09 11:28:45,252][26022] Updated weights on worker 0-0, policy_version 228811 (0.01028) [2022-07-09 11:28:46,709][26022] Updated weights on worker 0-0, policy_version 228821 (0.00085) [2022-07-09 11:28:47,370][25689] Fps is (10 sec: 5790.5, 60 sec: 5755.8, 300 sec: 5758.4). Total num frames: 234314752. Throughput: 0: 6073.9. Samples: 234318982. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 11:28:47,370][25689] Avg episode reward: [(0, '-47.716')] [2022-07-09 11:28:48,831][26022] Updated weights on worker 0-0, policy_version 228831 (0.00084) [2022-07-09 11:28:50,343][26022] Updated weights on worker 0-0, policy_version 228841 (0.00101) [2022-07-09 11:28:52,176][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:28:52,189][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000228850_234342400.pth [2022-07-09 11:28:52,190][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000226821_232264704.pth [2022-07-09 11:28:52,323][26022] Updated weights on worker 0-0, policy_version 228851 (0.00090) [2022-07-09 11:28:52,387][25689] Fps is (10 sec: 5792.3, 60 sec: 5771.7, 300 sec: 5758.1). Total num frames: 234343424. Throughput: 0: 5225.9. Samples: 234336654. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 11:28:52,388][25689] Avg episode reward: [(0, '-48.532')] [2022-07-09 11:28:54,035][26022] Updated weights on worker 0-0, policy_version 228861 (0.00088) [2022-07-09 11:28:55,841][26022] Updated weights on worker 0-0, policy_version 228871 (0.00089) [2022-07-09 11:28:57,411][25689] Fps is (10 sec: 5812.7, 60 sec: 5754.6, 300 sec: 5758.4). Total num frames: 234373120. Throughput: 0: 6059.9. Samples: 234370944. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 11:28:57,412][25689] Avg episode reward: [(0, '-48.223')] [2022-07-09 11:28:57,471][26022] Updated weights on worker 0-0, policy_version 228881 (0.00088) [2022-07-09 11:28:59,443][26022] Updated weights on worker 0-0, policy_version 228891 (0.00079) [2022-07-09 11:29:01,093][26022] Updated weights on worker 0-0, policy_version 228901 (0.00090) [2022-07-09 11:29:02,426][25689] Fps is (10 sec: 5610.2, 60 sec: 5741.1, 300 sec: 5762.3). Total num frames: 234399744. Throughput: 0: 6050.5. Samples: 234405546. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 11:29:02,427][25689] Avg episode reward: [(0, '-47.907')] [2022-07-09 11:29:03,265][26022] Updated weights on worker 0-0, policy_version 228911 (0.00050) [2022-07-09 11:29:04,968][26022] Updated weights on worker 0-0, policy_version 228921 (0.00092) [2022-07-09 11:29:06,846][26022] Updated weights on worker 0-0, policy_version 228931 (0.00082) [2022-07-09 11:29:07,477][25689] Fps is (10 sec: 5696.6, 60 sec: 5757.4, 300 sec: 5768.9). Total num frames: 234430464. Throughput: 0: 5945.9. Samples: 234438542. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 11:29:07,478][25689] Avg episode reward: [(0, '-48.293')] [2022-07-09 11:29:08,606][26022] Updated weights on worker 0-0, policy_version 228941 (0.00248) [2022-07-09 11:29:10,346][26022] Updated weights on worker 0-0, policy_version 228951 (0.00080) [2022-07-09 11:29:12,060][26022] Updated weights on worker 0-0, policy_version 228961 (0.00093) [2022-07-09 11:29:12,491][25689] Fps is (10 sec: 5799.2, 60 sec: 5756.9, 300 sec: 5759.5). Total num frames: 234458112. Throughput: 0: 5936.5. Samples: 234456000. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 11:29:12,492][25689] Avg episode reward: [(0, '-47.880')] [2022-07-09 11:29:14,034][26022] Updated weights on worker 0-0, policy_version 228971 (0.00096) [2022-07-09 11:29:15,728][26022] Updated weights on worker 0-0, policy_version 228981 (0.00086) [2022-07-09 11:29:17,533][25689] Fps is (10 sec: 5600.9, 60 sec: 5753.9, 300 sec: 5762.6). Total num frames: 234486784. Throughput: 0: 5946.5. Samples: 234490600. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 11:29:17,533][25689] Avg episode reward: [(0, '-49.160')] [2022-07-09 11:29:17,542][26022] Updated weights on worker 0-0, policy_version 228991 (0.00099) [2022-07-09 11:29:19,189][26022] Updated weights on worker 0-0, policy_version 229001 (0.00085) [2022-07-09 11:29:21,215][26022] Updated weights on worker 0-0, policy_version 229011 (0.00081) [2022-07-09 11:29:22,513][26022] Updated weights on worker 0-0, policy_version 229021 (0.00089) [2022-07-09 11:29:22,547][25689] Fps is (10 sec: 5906.4, 60 sec: 5771.3, 300 sec: 5764.5). Total num frames: 234517504. Throughput: 0: 5952.1. Samples: 234525304. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 11:29:22,547][25689] Avg episode reward: [(0, '-48.524')] [2022-07-09 11:29:24,750][26022] Updated weights on worker 0-0, policy_version 229031 (0.00090) [2022-07-09 11:29:26,157][26022] Updated weights on worker 0-0, policy_version 229041 (0.00094) [2022-07-09 11:29:27,614][25689] Fps is (10 sec: 5789.9, 60 sec: 5769.4, 300 sec: 5764.0). Total num frames: 234545152. Throughput: 0: 5158.6. Samples: 234542418. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 11:29:27,614][25689] Avg episode reward: [(0, '-48.460')] [2022-07-09 11:29:28,282][26022] Updated weights on worker 0-0, policy_version 229051 (0.00093) [2022-07-09 11:29:29,943][26022] Updated weights on worker 0-0, policy_version 229061 (0.00094) [2022-07-09 11:29:31,602][26022] Updated weights on worker 0-0, policy_version 229071 (0.00082) [2022-07-09 11:29:32,619][25689] Fps is (10 sec: 5591.6, 60 sec: 5719.5, 300 sec: 5753.8). Total num frames: 234573824. Throughput: 0: 6013.3. Samples: 234577034. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 11:29:32,619][25689] Avg episode reward: [(0, '-48.867')] [2022-07-09 11:29:33,339][26022] Updated weights on worker 0-0, policy_version 229081 (0.00082) [2022-07-09 11:29:35,279][26022] Updated weights on worker 0-0, policy_version 229091 (0.00089) [2022-07-09 11:29:36,931][26022] Updated weights on worker 0-0, policy_version 229101 (0.00090) [2022-07-09 11:29:37,649][25689] Fps is (10 sec: 5918.2, 60 sec: 5768.1, 300 sec: 5760.3). Total num frames: 234604544. Throughput: 0: 6059.9. Samples: 234612504. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 11:29:37,650][25689] Avg episode reward: [(0, '-48.901')] [2022-07-09 11:29:38,719][26022] Updated weights on worker 0-0, policy_version 229111 (0.00080) [2022-07-09 11:29:40,386][26022] Updated weights on worker 0-0, policy_version 229121 (0.00081) [2022-07-09 11:29:42,169][26022] Updated weights on worker 0-0, policy_version 229131 (0.00086) [2022-07-09 11:29:42,652][25689] Fps is (10 sec: 5817.7, 60 sec: 5752.9, 300 sec: 5761.4). Total num frames: 234632192. Throughput: 0: 5205.7. Samples: 234629964. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 11:29:42,652][25689] Avg episode reward: [(0, '-48.270')] [2022-07-09 11:29:43,866][26022] Updated weights on worker 0-0, policy_version 229141 (0.00083) [2022-07-09 11:29:45,705][26022] Updated weights on worker 0-0, policy_version 229151 (0.00084) [2022-07-09 11:29:47,343][26022] Updated weights on worker 0-0, policy_version 229161 (0.00084) [2022-07-09 11:29:47,787][25689] Fps is (10 sec: 5757.7, 60 sec: 5762.6, 300 sec: 5762.6). Total num frames: 234662912. Throughput: 0: 6083.0. Samples: 234665130. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 11:29:47,791][25689] Avg episode reward: [(0, '-47.968')] [2022-07-09 11:29:49,294][26022] Updated weights on worker 0-0, policy_version 229171 (0.00090) [2022-07-09 11:29:50,740][26022] Updated weights on worker 0-0, policy_version 229181 (0.00081) [2022-07-09 11:29:52,756][26022] Updated weights on worker 0-0, policy_version 229191 (0.00087) [2022-07-09 11:29:52,811][25689] Fps is (10 sec: 5845.9, 60 sec: 5762.0, 300 sec: 5762.5). Total num frames: 234691584. Throughput: 0: 6086.6. Samples: 234699936. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 11:29:52,813][25689] Avg episode reward: [(0, '-47.409')] [2022-07-09 11:29:54,265][26022] Updated weights on worker 0-0, policy_version 229201 (0.00085) [2022-07-09 11:29:56,271][26022] Updated weights on worker 0-0, policy_version 229211 (0.00408) [2022-07-09 11:29:57,818][25689] Fps is (10 sec: 5818.9, 60 sec: 5763.6, 300 sec: 5762.5). Total num frames: 234721280. Throughput: 0: 5194.8. Samples: 234717274. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 11:29:57,818][25689] Avg episode reward: [(0, '-46.804')] [2022-07-09 11:29:57,872][26022] Updated weights on worker 0-0, policy_version 229221 (0.00086) [2022-07-09 11:29:59,806][26022] Updated weights on worker 0-0, policy_version 229231 (0.00092) [2022-07-09 11:30:02,000][26022] Updated weights on worker 0-0, policy_version 229241 (0.00090) [2022-07-09 11:30:02,843][25689] Fps is (10 sec: 5614.4, 60 sec: 5762.7, 300 sec: 5763.9). Total num frames: 234747904. Throughput: 0: 5995.0. Samples: 234751008. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 11:30:02,843][25689] Avg episode reward: [(0, '-46.317')] [2022-07-09 11:30:03,611][26022] Updated weights on worker 0-0, policy_version 229251 (0.00085) [2022-07-09 11:30:05,292][26022] Updated weights on worker 0-0, policy_version 229261 (0.00081) [2022-07-09 11:30:07,308][26022] Updated weights on worker 0-0, policy_version 229271 (0.00088) [2022-07-09 11:30:07,894][25689] Fps is (10 sec: 5487.6, 60 sec: 5728.7, 300 sec: 5760.4). Total num frames: 234776576. Throughput: 0: 5980.2. Samples: 234785376. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 11:30:07,895][25689] Avg episode reward: [(0, '-46.512')] [2022-07-09 11:30:08,735][26022] Updated weights on worker 0-0, policy_version 229281 (0.00092) [2022-07-09 11:30:10,898][26022] Updated weights on worker 0-0, policy_version 229291 (0.00087) [2022-07-09 11:30:12,166][26022] Updated weights on worker 0-0, policy_version 229301 (0.00051) [2022-07-09 11:30:12,913][25689] Fps is (10 sec: 5897.6, 60 sec: 5779.1, 300 sec: 5764.1). Total num frames: 234807296. Throughput: 0: 5129.7. Samples: 234803052. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 11:30:12,916][25689] Avg episode reward: [(0, '-46.936')] [2022-07-09 11:30:14,308][26022] Updated weights on worker 0-0, policy_version 229311 (0.00084) [2022-07-09 11:30:15,646][26022] Updated weights on worker 0-0, policy_version 229321 (0.00096) [2022-07-09 11:30:17,660][26022] Updated weights on worker 0-0, policy_version 229331 (0.00094) [2022-07-09 11:30:18,014][25689] Fps is (10 sec: 5869.1, 60 sec: 5773.5, 300 sec: 5762.4). Total num frames: 234835968. Throughput: 0: 6007.1. Samples: 234838594. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 11:30:18,014][25689] Avg episode reward: [(0, '-47.791')] [2022-07-09 11:30:19,373][26022] Updated weights on worker 0-0, policy_version 229341 (0.00087) [2022-07-09 11:30:21,325][26022] Updated weights on worker 0-0, policy_version 229351 (0.00050) [2022-07-09 11:30:22,856][26022] Updated weights on worker 0-0, policy_version 229361 (0.00094) [2022-07-09 11:30:23,025][25689] Fps is (10 sec: 5974.7, 60 sec: 5790.6, 300 sec: 5766.9). Total num frames: 234867712. Throughput: 0: 6067.2. Samples: 234873460. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 11:30:23,026][25689] Avg episode reward: [(0, '-48.070')] [2022-07-09 11:30:24,729][26022] Updated weights on worker 0-0, policy_version 229371 (0.00619) [2022-07-09 11:30:26,305][26022] Updated weights on worker 0-0, policy_version 229381 (0.00086) [2022-07-09 11:30:28,079][25689] Fps is (10 sec: 5798.8, 60 sec: 5775.0, 300 sec: 5766.2). Total num frames: 234894336. Throughput: 0: 5206.6. Samples: 234890472. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 11:30:28,081][25689] Avg episode reward: [(0, '-48.698')] [2022-07-09 11:30:28,368][26022] Updated weights on worker 0-0, policy_version 229391 (0.00089) [2022-07-09 11:30:29,932][26022] Updated weights on worker 0-0, policy_version 229401 (0.00095) [2022-07-09 11:30:31,853][26022] Updated weights on worker 0-0, policy_version 229411 (0.00081) [2022-07-09 11:30:33,109][25689] Fps is (10 sec: 5686.7, 60 sec: 5806.4, 300 sec: 5772.7). Total num frames: 234925056. Throughput: 0: 6047.2. Samples: 234925180. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 11:30:33,111][25689] Avg episode reward: [(0, '-49.148')] [2022-07-09 11:30:33,496][26022] Updated weights on worker 0-0, policy_version 229421 (0.00086) [2022-07-09 11:30:35,439][26022] Updated weights on worker 0-0, policy_version 229431 (0.00082) [2022-07-09 11:30:36,934][26022] Updated weights on worker 0-0, policy_version 229441 (0.00082) [2022-07-09 11:30:38,205][25689] Fps is (10 sec: 5865.3, 60 sec: 5766.3, 300 sec: 5761.1). Total num frames: 234953728. Throughput: 0: 6017.3. Samples: 234960092. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 11:30:38,211][25689] Avg episode reward: [(0, '-49.072')] [2022-07-09 11:30:39,028][26022] Updated weights on worker 0-0, policy_version 229451 (0.00079) [2022-07-09 11:30:40,414][26022] Updated weights on worker 0-0, policy_version 229461 (0.00092) [2022-07-09 11:30:42,376][26022] Updated weights on worker 0-0, policy_version 229471 (0.00086) [2022-07-09 11:30:43,309][25689] Fps is (10 sec: 5822.8, 60 sec: 5807.3, 300 sec: 5766.8). Total num frames: 234984448. Throughput: 0: 5139.0. Samples: 234977698. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 11:30:43,314][25689] Avg episode reward: [(0, '-48.895')] [2022-07-09 11:30:43,787][26022] Updated weights on worker 0-0, policy_version 229481 (0.00080) [2022-07-09 11:30:45,973][26022] Updated weights on worker 0-0, policy_version 229491 (0.00088) [2022-07-09 11:30:47,632][26022] Updated weights on worker 0-0, policy_version 229501 (0.00084) [2022-07-09 11:30:48,392][25689] Fps is (10 sec: 5830.4, 60 sec: 5778.5, 300 sec: 5769.3). Total num frames: 235013120. Throughput: 0: 6013.1. Samples: 235012614. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 11:30:48,392][25689] Avg episode reward: [(0, '-48.155')] [2022-07-09 11:30:49,231][26022] Updated weights on worker 0-0, policy_version 229511 (0.00086) [2022-07-09 11:30:51,123][26022] Updated weights on worker 0-0, policy_version 229521 (0.00081) [2022-07-09 11:30:52,349][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:30:52,360][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000229528_235036672.pth [2022-07-09 11:30:52,360][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000227498_232957952.pth [2022-07-09 11:30:52,806][26022] Updated weights on worker 0-0, policy_version 229531 (0.00085) [2022-07-09 11:30:53,433][25689] Fps is (10 sec: 5664.0, 60 sec: 5776.9, 300 sec: 5768.5). Total num frames: 235041792. Throughput: 0: 6027.6. Samples: 235047686. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 11:30:53,435][25689] Avg episode reward: [(0, '-48.051')] [2022-07-09 11:30:54,654][26022] Updated weights on worker 0-0, policy_version 229541 (0.00093) [2022-07-09 11:30:56,514][26022] Updated weights on worker 0-0, policy_version 229551 (0.00087) [2022-07-09 11:30:58,163][26022] Updated weights on worker 0-0, policy_version 229561 (0.00088) [2022-07-09 11:30:58,453][25689] Fps is (10 sec: 5801.0, 60 sec: 5775.6, 300 sec: 5761.4). Total num frames: 235071488. Throughput: 0: 5185.8. Samples: 235065096. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 11:30:58,454][25689] Avg episode reward: [(0, '-47.751')] [2022-07-09 11:30:59,988][26022] Updated weights on worker 0-0, policy_version 229571 (0.00052) [2022-07-09 11:31:01,576][26022] Updated weights on worker 0-0, policy_version 229581 (0.00086) [2022-07-09 11:31:03,455][25689] Fps is (10 sec: 5619.7, 60 sec: 5777.8, 300 sec: 5764.2). Total num frames: 235098112. Throughput: 0: 6051.0. Samples: 235099602. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 11:31:03,456][25689] Avg episode reward: [(0, '-48.221')] [2022-07-09 11:31:03,884][26022] Updated weights on worker 0-0, policy_version 229591 (0.00084) [2022-07-09 11:31:05,680][26022] Updated weights on worker 0-0, policy_version 229601 (0.00087) [2022-07-09 11:31:07,398][26022] Updated weights on worker 0-0, policy_version 229611 (0.00085) [2022-07-09 11:31:08,530][25689] Fps is (10 sec: 5690.8, 60 sec: 5809.4, 300 sec: 5770.7). Total num frames: 235128832. Throughput: 0: 5954.8. Samples: 235132534. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 11:31:08,530][25689] Avg episode reward: [(0, '-47.856')] [2022-07-09 11:31:09,073][26022] Updated weights on worker 0-0, policy_version 229621 (0.00082) [2022-07-09 11:31:10,911][26022] Updated weights on worker 0-0, policy_version 229631 (0.00086) [2022-07-09 11:31:12,745][26022] Updated weights on worker 0-0, policy_version 229641 (0.00090) [2022-07-09 11:31:13,610][25689] Fps is (10 sec: 5848.7, 60 sec: 5769.8, 300 sec: 5762.5). Total num frames: 235157504. Throughput: 0: 5075.8. Samples: 235150100. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 11:31:13,611][25689] Avg episode reward: [(0, '-48.846')] [2022-07-09 11:31:14,411][26022] Updated weights on worker 0-0, policy_version 229651 (0.00086) [2022-07-09 11:31:16,145][26022] Updated weights on worker 0-0, policy_version 229661 (0.00087) [2022-07-09 11:31:17,829][26022] Updated weights on worker 0-0, policy_version 229671 (0.00090) [2022-07-09 11:31:18,614][25689] Fps is (10 sec: 5788.1, 60 sec: 5795.9, 300 sec: 5771.1). Total num frames: 235187200. Throughput: 0: 5971.1. Samples: 235185478. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 11:31:18,615][25689] Avg episode reward: [(0, '-48.379')] [2022-07-09 11:31:19,796][26022] Updated weights on worker 0-0, policy_version 229681 (0.00081) [2022-07-09 11:31:21,484][26022] Updated weights on worker 0-0, policy_version 229691 (0.00085) [2022-07-09 11:31:23,139][26022] Updated weights on worker 0-0, policy_version 229701 (0.00087) [2022-07-09 11:31:23,629][25689] Fps is (10 sec: 5927.8, 60 sec: 5761.7, 300 sec: 5768.4). Total num frames: 235216896. Throughput: 0: 5978.6. Samples: 235220214. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 11:31:23,631][25689] Avg episode reward: [(0, '-48.540')] [2022-07-09 11:31:25,125][26022] Updated weights on worker 0-0, policy_version 229711 (0.00085) [2022-07-09 11:31:26,615][26022] Updated weights on worker 0-0, policy_version 229721 (0.00088) [2022-07-09 11:31:28,624][26022] Updated weights on worker 0-0, policy_version 229731 (0.00089) [2022-07-09 11:31:28,695][25689] Fps is (10 sec: 5688.3, 60 sec: 5777.5, 300 sec: 5768.3). Total num frames: 235244544. Throughput: 0: 6046.5. Samples: 235254462. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 11:31:28,697][25689] Avg episode reward: [(0, '-48.332')] [2022-07-09 11:31:30,252][26022] Updated weights on worker 0-0, policy_version 229741 (0.00085) [2022-07-09 11:31:32,102][26022] Updated weights on worker 0-0, policy_version 229751 (0.00079) [2022-07-09 11:31:33,701][26022] Updated weights on worker 0-0, policy_version 229761 (0.00083) [2022-07-09 11:31:33,792][25689] Fps is (10 sec: 5743.5, 60 sec: 5771.1, 300 sec: 5773.4). Total num frames: 235275264. Throughput: 0: 6031.7. Samples: 235271830. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 11:31:33,793][25689] Avg episode reward: [(0, '-47.551')] [2022-07-09 11:31:35,779][26022] Updated weights on worker 0-0, policy_version 229771 (0.00086) [2022-07-09 11:31:37,184][26022] Updated weights on worker 0-0, policy_version 229781 (0.00101) [2022-07-09 11:31:38,796][25689] Fps is (10 sec: 5778.5, 60 sec: 5763.0, 300 sec: 5764.8). Total num frames: 235302912. Throughput: 0: 6018.3. Samples: 235306940. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 11:31:38,798][25689] Avg episode reward: [(0, '-48.428')] [2022-07-09 11:31:39,287][26022] Updated weights on worker 0-0, policy_version 229791 (0.00091) [2022-07-09 11:31:40,947][26022] Updated weights on worker 0-0, policy_version 229801 (0.00074) [2022-07-09 11:31:42,631][26022] Updated weights on worker 0-0, policy_version 229811 (0.00086) [2022-07-09 11:31:43,840][25689] Fps is (10 sec: 5707.0, 60 sec: 5751.8, 300 sec: 5771.7). Total num frames: 235332608. Throughput: 0: 6024.3. Samples: 235341968. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 11:31:43,840][25689] Avg episode reward: [(0, '-47.744')] [2022-07-09 11:31:44,419][26022] Updated weights on worker 0-0, policy_version 229821 (0.00576) [2022-07-09 11:31:46,253][26022] Updated weights on worker 0-0, policy_version 229831 (0.00081) [2022-07-09 11:31:47,922][26022] Updated weights on worker 0-0, policy_version 229841 (0.00087) [2022-07-09 11:31:48,935][25689] Fps is (10 sec: 5858.0, 60 sec: 5767.5, 300 sec: 5766.5). Total num frames: 235362304. Throughput: 0: 5182.5. Samples: 235359354. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 11:31:48,936][25689] Avg episode reward: [(0, '-47.149')] [2022-07-09 11:31:49,709][26022] Updated weights on worker 0-0, policy_version 229851 (0.00082) [2022-07-09 11:31:51,434][26022] Updated weights on worker 0-0, policy_version 229861 (0.00081) [2022-07-09 11:31:53,066][26022] Updated weights on worker 0-0, policy_version 229871 (0.00086) [2022-07-09 11:31:53,940][25689] Fps is (10 sec: 5981.9, 60 sec: 5804.9, 300 sec: 5777.4). Total num frames: 235393024. Throughput: 0: 6089.6. Samples: 235394522. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 11:31:53,940][25689] Avg episode reward: [(0, '-47.557')] [2022-07-09 11:31:55,087][26022] Updated weights on worker 0-0, policy_version 229881 (0.00087) [2022-07-09 11:31:56,605][26022] Updated weights on worker 0-0, policy_version 229891 (0.00085) [2022-07-09 11:31:58,644][26022] Updated weights on worker 0-0, policy_version 229901 (0.00093) [2022-07-09 11:31:58,947][25689] Fps is (10 sec: 5727.2, 60 sec: 5755.3, 300 sec: 5764.3). Total num frames: 235419648. Throughput: 0: 6077.0. Samples: 235429398. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 11:31:58,948][25689] Avg episode reward: [(0, '-47.258')] [2022-07-09 11:32:00,152][26022] Updated weights on worker 0-0, policy_version 229911 (0.00083) [2022-07-09 11:32:02,568][26022] Updated weights on worker 0-0, policy_version 229921 (0.00091) [2022-07-09 11:32:03,951][25689] Fps is (10 sec: 5421.2, 60 sec: 5772.1, 300 sec: 5773.1). Total num frames: 235447296. Throughput: 0: 5202.4. Samples: 235446592. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 11:32:03,951][25689] Avg episode reward: [(0, '-47.343')] [2022-07-09 11:32:04,083][26022] Updated weights on worker 0-0, policy_version 229931 (0.00092) [2022-07-09 11:32:05,853][26022] Updated weights on worker 0-0, policy_version 229941 (0.00092) [2022-07-09 11:32:07,809][26022] Updated weights on worker 0-0, policy_version 229951 (0.00081) [2022-07-09 11:32:09,014][25689] Fps is (10 sec: 5696.2, 60 sec: 5756.2, 300 sec: 5772.5). Total num frames: 235476992. Throughput: 0: 5987.6. Samples: 235479582. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 11:32:09,015][25689] Avg episode reward: [(0, '-46.622')] [2022-07-09 11:32:09,353][26022] Updated weights on worker 0-0, policy_version 229961 (0.00081) [2022-07-09 11:32:11,232][26022] Updated weights on worker 0-0, policy_version 229971 (0.00083) [2022-07-09 11:32:12,773][26022] Updated weights on worker 0-0, policy_version 229981 (0.00081) [2022-07-09 11:32:14,023][25689] Fps is (10 sec: 5795.1, 60 sec: 5763.0, 300 sec: 5765.6). Total num frames: 235505664. Throughput: 0: 5990.7. Samples: 235514834. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 11:32:14,023][25689] Avg episode reward: [(0, '-47.020')] [2022-07-09 11:32:14,656][26022] Updated weights on worker 0-0, policy_version 229991 (0.00086) [2022-07-09 11:32:16,308][26022] Updated weights on worker 0-0, policy_version 230001 (0.00090) [2022-07-09 11:32:18,191][26022] Updated weights on worker 0-0, policy_version 230011 (0.00088) [2022-07-09 11:32:19,091][25689] Fps is (10 sec: 5894.0, 60 sec: 5773.8, 300 sec: 5775.2). Total num frames: 235536384. Throughput: 0: 5118.2. Samples: 235532498. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 11:32:19,092][25689] Avg episode reward: [(0, '-47.369')] [2022-07-09 11:32:19,718][26022] Updated weights on worker 0-0, policy_version 230021 (0.00086) [2022-07-09 11:32:21,807][26022] Updated weights on worker 0-0, policy_version 230031 (0.00091) [2022-07-09 11:32:23,434][26022] Updated weights on worker 0-0, policy_version 230041 (0.00092) [2022-07-09 11:32:24,102][25689] Fps is (10 sec: 5892.4, 60 sec: 5757.3, 300 sec: 5772.6). Total num frames: 235565056. Throughput: 0: 5965.6. Samples: 235566808. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 11:32:24,103][25689] Avg episode reward: [(0, '-47.614')] [2022-07-09 11:32:25,460][26022] Updated weights on worker 0-0, policy_version 230051 (0.00082) [2022-07-09 11:32:27,160][26022] Updated weights on worker 0-0, policy_version 230061 (0.00089) [2022-07-09 11:32:28,935][26022] Updated weights on worker 0-0, policy_version 230071 (0.00089) [2022-07-09 11:32:29,228][25689] Fps is (10 sec: 5657.0, 60 sec: 5768.5, 300 sec: 5766.9). Total num frames: 235593728. Throughput: 0: 6008.8. Samples: 235601040. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 11:32:29,228][25689] Avg episode reward: [(0, '-48.355')] [2022-07-09 11:32:30,710][26022] Updated weights on worker 0-0, policy_version 230081 (0.00092) [2022-07-09 11:32:32,623][26022] Updated weights on worker 0-0, policy_version 230091 (0.00081) [2022-07-09 11:32:33,998][26022] Updated weights on worker 0-0, policy_version 230101 (0.00080) [2022-07-09 11:32:34,230][25689] Fps is (10 sec: 5864.1, 60 sec: 5777.5, 300 sec: 5777.6). Total num frames: 235624448. Throughput: 0: 5123.2. Samples: 235618360. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 11:32:34,231][25689] Avg episode reward: [(0, '-48.402')] [2022-07-09 11:32:36,219][26022] Updated weights on worker 0-0, policy_version 230111 (0.00092) [2022-07-09 11:32:37,668][26022] Updated weights on worker 0-0, policy_version 230121 (0.00104) [2022-07-09 11:32:39,236][25689] Fps is (10 sec: 5627.3, 60 sec: 5743.5, 300 sec: 5760.3). Total num frames: 235650048. Throughput: 0: 5984.4. Samples: 235653054. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 11:32:39,237][25689] Avg episode reward: [(0, '-48.582')] [2022-07-09 11:32:39,742][26022] Updated weights on worker 0-0, policy_version 230131 (0.00099) [2022-07-09 11:32:41,322][26022] Updated weights on worker 0-0, policy_version 230141 (0.00054) [2022-07-09 11:32:43,008][26022] Updated weights on worker 0-0, policy_version 230151 (0.00087) [2022-07-09 11:32:44,237][25689] Fps is (10 sec: 5627.9, 60 sec: 5764.5, 300 sec: 5764.7). Total num frames: 235680768. Throughput: 0: 6023.8. Samples: 235688098. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 11:32:44,238][25689] Avg episode reward: [(0, '-48.080')] [2022-07-09 11:32:44,934][26022] Updated weights on worker 0-0, policy_version 230161 (0.00091) [2022-07-09 11:32:46,544][26022] Updated weights on worker 0-0, policy_version 230171 (0.00605) [2022-07-09 11:32:48,187][26022] Updated weights on worker 0-0, policy_version 230181 (0.00083) [2022-07-09 11:32:49,281][25689] Fps is (10 sec: 6014.3, 60 sec: 5769.3, 300 sec: 5770.9). Total num frames: 235710464. Throughput: 0: 5220.8. Samples: 235705736. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 11:32:49,282][25689] Avg episode reward: [(0, '-48.870')] [2022-07-09 11:32:50,223][26022] Updated weights on worker 0-0, policy_version 230191 (0.00089) [2022-07-09 11:32:51,662][26022] Updated weights on worker 0-0, policy_version 230201 (0.00092) [2022-07-09 11:32:52,463][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:32:52,474][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000230205_235729920.pth [2022-07-09 11:32:52,474][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000228175_233651200.pth [2022-07-09 11:32:53,709][26022] Updated weights on worker 0-0, policy_version 230211 (0.00086) [2022-07-09 11:32:54,368][25689] Fps is (10 sec: 5963.5, 60 sec: 5761.5, 300 sec: 5769.6). Total num frames: 235741184. Throughput: 0: 6077.2. Samples: 235740744. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 11:32:54,369][25689] Avg episode reward: [(0, '-48.298')] [2022-07-09 11:32:55,293][26022] Updated weights on worker 0-0, policy_version 230221 (0.00086) [2022-07-09 11:32:57,115][26022] Updated weights on worker 0-0, policy_version 230231 (0.00095) [2022-07-09 11:32:59,046][26022] Updated weights on worker 0-0, policy_version 230241 (0.00084) [2022-07-09 11:32:59,387][25689] Fps is (10 sec: 5775.9, 60 sec: 5777.4, 300 sec: 5770.3). Total num frames: 235768832. Throughput: 0: 6095.1. Samples: 235775876. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 11:32:59,387][25689] Avg episode reward: [(0, '-48.508')] [2022-07-09 11:33:00,708][26022] Updated weights on worker 0-0, policy_version 230251 (0.00091) [2022-07-09 11:33:02,819][26022] Updated weights on worker 0-0, policy_version 230261 (0.00084) [2022-07-09 11:33:04,414][25689] Fps is (10 sec: 5402.4, 60 sec: 5758.2, 300 sec: 5760.3). Total num frames: 235795456. Throughput: 0: 5124.9. Samples: 235791502. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 11:33:04,415][25689] Avg episode reward: [(0, '-48.949')] [2022-07-09 11:33:04,750][26022] Updated weights on worker 0-0, policy_version 230271 (0.00086) [2022-07-09 11:33:06,472][26022] Updated weights on worker 0-0, policy_version 230281 (0.00090) [2022-07-09 11:33:08,239][26022] Updated weights on worker 0-0, policy_version 230291 (0.00089) [2022-07-09 11:33:09,500][25689] Fps is (10 sec: 5670.0, 60 sec: 5773.0, 300 sec: 5769.1). Total num frames: 235826176. Throughput: 0: 5923.9. Samples: 235825510. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 11:33:09,501][25689] Avg episode reward: [(0, '-48.535')] [2022-07-09 11:33:09,900][26022] Updated weights on worker 0-0, policy_version 230301 (0.00080) [2022-07-09 11:33:11,725][26022] Updated weights on worker 0-0, policy_version 230311 (0.00087) [2022-07-09 11:33:13,652][26022] Updated weights on worker 0-0, policy_version 230321 (0.00089) [2022-07-09 11:33:14,527][25689] Fps is (10 sec: 5771.7, 60 sec: 5754.3, 300 sec: 5765.3). Total num frames: 235853824. Throughput: 0: 5936.0. Samples: 235860404. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 11:33:14,527][25689] Avg episode reward: [(0, '-47.434')] [2022-07-09 11:33:15,242][26022] Updated weights on worker 0-0, policy_version 230331 (0.00086) [2022-07-09 11:33:17,091][26022] Updated weights on worker 0-0, policy_version 230341 (0.00093) [2022-07-09 11:33:18,718][26022] Updated weights on worker 0-0, policy_version 230351 (0.00085) [2022-07-09 11:33:19,538][25689] Fps is (10 sec: 5712.8, 60 sec: 5742.8, 300 sec: 5765.5). Total num frames: 235883520. Throughput: 0: 5065.6. Samples: 235877954. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 11:33:19,539][25689] Avg episode reward: [(0, '-47.742')] [2022-07-09 11:33:20,454][26022] Updated weights on worker 0-0, policy_version 230361 (0.00087) [2022-07-09 11:33:22,324][26022] Updated weights on worker 0-0, policy_version 230371 (0.00085) [2022-07-09 11:33:24,034][26022] Updated weights on worker 0-0, policy_version 230381 (0.00090) [2022-07-09 11:33:24,590][25689] Fps is (10 sec: 5799.9, 60 sec: 5738.9, 300 sec: 5768.8). Total num frames: 235912192. Throughput: 0: 6017.6. Samples: 235912914. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 11:33:24,591][25689] Avg episode reward: [(0, '-47.243')] [2022-07-09 11:33:25,676][26022] Updated weights on worker 0-0, policy_version 230391 (0.00082) [2022-07-09 11:33:27,652][26022] Updated weights on worker 0-0, policy_version 230401 (0.00086) [2022-07-09 11:33:29,291][26022] Updated weights on worker 0-0, policy_version 230411 (0.00080) [2022-07-09 11:33:29,713][25689] Fps is (10 sec: 5837.1, 60 sec: 5773.0, 300 sec: 5763.3). Total num frames: 235942912. Throughput: 0: 6042.3. Samples: 235947642. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 11:33:29,714][25689] Avg episode reward: [(0, '-47.124')] [2022-07-09 11:33:31,156][26022] Updated weights on worker 0-0, policy_version 230421 (0.00091) [2022-07-09 11:33:33,015][26022] Updated weights on worker 0-0, policy_version 230431 (0.00085) [2022-07-09 11:33:34,566][26022] Updated weights on worker 0-0, policy_version 230441 (0.00081) [2022-07-09 11:33:34,719][25689] Fps is (10 sec: 5864.0, 60 sec: 5738.9, 300 sec: 5766.8). Total num frames: 235971584. Throughput: 0: 5177.0. Samples: 235964938. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 11:33:34,719][25689] Avg episode reward: [(0, '-47.797')] [2022-07-09 11:33:36,479][26022] Updated weights on worker 0-0, policy_version 230451 (0.00081) [2022-07-09 11:33:38,182][26022] Updated weights on worker 0-0, policy_version 230461 (0.00087) [2022-07-09 11:33:39,808][25689] Fps is (10 sec: 5883.1, 60 sec: 5815.4, 300 sec: 5772.3). Total num frames: 236002304. Throughput: 0: 6032.0. Samples: 236000226. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 11:33:39,809][25689] Avg episode reward: [(0, '-48.572')] [2022-07-09 11:33:39,823][26022] Updated weights on worker 0-0, policy_version 230471 (0.00082) [2022-07-09 11:33:41,683][26022] Updated weights on worker 0-0, policy_version 230481 (0.00082) [2022-07-09 11:33:43,421][26022] Updated weights on worker 0-0, policy_version 230491 (0.00089) [2022-07-09 11:33:44,832][25689] Fps is (10 sec: 5872.9, 60 sec: 5779.5, 300 sec: 5769.6). Total num frames: 236030976. Throughput: 0: 6059.1. Samples: 236035558. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 11:33:44,832][25689] Avg episode reward: [(0, '-48.360')] [2022-07-09 11:33:45,215][26022] Updated weights on worker 0-0, policy_version 230501 (0.00091) [2022-07-09 11:33:46,804][26022] Updated weights on worker 0-0, policy_version 230511 (0.00089) [2022-07-09 11:33:48,664][26022] Updated weights on worker 0-0, policy_version 230521 (0.00092) [2022-07-09 11:33:49,894][25689] Fps is (10 sec: 5685.9, 60 sec: 5760.9, 300 sec: 5768.7). Total num frames: 236059648. Throughput: 0: 5218.9. Samples: 236052962. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 11:33:49,894][25689] Avg episode reward: [(0, '-47.727')] [2022-07-09 11:33:50,363][26022] Updated weights on worker 0-0, policy_version 230531 (0.00089) [2022-07-09 11:33:52,120][26022] Updated weights on worker 0-0, policy_version 230541 (0.00080) [2022-07-09 11:33:53,952][26022] Updated weights on worker 0-0, policy_version 230551 (0.00086) [2022-07-09 11:33:54,912][25689] Fps is (10 sec: 5790.3, 60 sec: 5750.6, 300 sec: 5768.8). Total num frames: 236089344. Throughput: 0: 6075.7. Samples: 236087626. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 11:33:54,915][25689] Avg episode reward: [(0, '-48.029')] [2022-07-09 11:33:55,754][26022] Updated weights on worker 0-0, policy_version 230561 (0.00084) [2022-07-09 11:33:57,584][26022] Updated weights on worker 0-0, policy_version 230571 (0.00082) [2022-07-09 11:33:59,323][26022] Updated weights on worker 0-0, policy_version 230581 (0.00083) [2022-07-09 11:33:59,974][25689] Fps is (10 sec: 5891.8, 60 sec: 5780.2, 300 sec: 5778.2). Total num frames: 236119040. Throughput: 0: 6061.6. Samples: 236122462. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 11:33:59,975][25689] Avg episode reward: [(0, '-47.201')] [2022-07-09 11:34:01,076][26022] Updated weights on worker 0-0, policy_version 230591 (0.00091) [2022-07-09 11:34:03,234][26022] Updated weights on worker 0-0, policy_version 230601 (0.00086) [2022-07-09 11:34:04,993][25689] Fps is (10 sec: 5485.4, 60 sec: 5764.2, 300 sec: 5761.6). Total num frames: 236144640. Throughput: 0: 5049.8. Samples: 236137366. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 11:34:04,993][25689] Avg episode reward: [(0, '-46.524')] [2022-07-09 11:34:05,089][26022] Updated weights on worker 0-0, policy_version 230611 (0.00095) [2022-07-09 11:34:06,588][26022] Updated weights on worker 0-0, policy_version 230621 (0.00093) [2022-07-09 11:34:08,747][26022] Updated weights on worker 0-0, policy_version 230631 (0.00091) [2022-07-09 11:34:10,031][25689] Fps is (10 sec: 5498.5, 60 sec: 5751.8, 300 sec: 5768.1). Total num frames: 236174336. Throughput: 0: 5923.2. Samples: 236172236. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 11:34:10,031][25689] Avg episode reward: [(0, '-46.539')] [2022-07-09 11:34:10,211][26022] Updated weights on worker 0-0, policy_version 230641 (0.00082) [2022-07-09 11:34:12,216][26022] Updated weights on worker 0-0, policy_version 230651 (0.00089) [2022-07-09 11:34:13,974][26022] Updated weights on worker 0-0, policy_version 230661 (0.00085) [2022-07-09 11:34:15,054][25689] Fps is (10 sec: 5801.1, 60 sec: 5769.0, 300 sec: 5768.4). Total num frames: 236203008. Throughput: 0: 5929.2. Samples: 236207052. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 11:34:15,055][25689] Avg episode reward: [(0, '-46.389')] [2022-07-09 11:34:15,620][26022] Updated weights on worker 0-0, policy_version 230671 (0.00084) [2022-07-09 11:34:17,541][26022] Updated weights on worker 0-0, policy_version 230681 (0.00086) [2022-07-09 11:34:19,040][26022] Updated weights on worker 0-0, policy_version 230691 (0.01048) [2022-07-09 11:34:20,077][25689] Fps is (10 sec: 5707.7, 60 sec: 5751.0, 300 sec: 5761.3). Total num frames: 236231680. Throughput: 0: 5937.4. Samples: 236241824. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 11:34:20,078][25689] Avg episode reward: [(0, '-46.530')] [2022-07-09 11:34:20,892][26022] Updated weights on worker 0-0, policy_version 230701 (0.00087) [2022-07-09 11:34:22,691][26022] Updated weights on worker 0-0, policy_version 230711 (0.00083) [2022-07-09 11:34:24,620][26022] Updated weights on worker 0-0, policy_version 230721 (0.00089) [2022-07-09 11:34:25,084][25689] Fps is (10 sec: 5819.3, 60 sec: 5772.2, 300 sec: 5769.4). Total num frames: 236261376. Throughput: 0: 6058.9. Samples: 236259098. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 11:34:25,085][25689] Avg episode reward: [(0, '-46.680')] [2022-07-09 11:34:26,354][26022] Updated weights on worker 0-0, policy_version 230731 (0.00079) [2022-07-09 11:34:28,268][26022] Updated weights on worker 0-0, policy_version 230741 (0.00087) [2022-07-09 11:34:29,861][26022] Updated weights on worker 0-0, policy_version 230751 (0.00086) [2022-07-09 11:34:30,212][25689] Fps is (10 sec: 5860.3, 60 sec: 5754.8, 300 sec: 5770.5). Total num frames: 236291072. Throughput: 0: 6017.3. Samples: 236293674. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 11:34:30,213][25689] Avg episode reward: [(0, '-46.603')] [2022-07-09 11:34:31,794][26022] Updated weights on worker 0-0, policy_version 230761 (0.00087) [2022-07-09 11:34:33,411][26022] Updated weights on worker 0-0, policy_version 230771 (0.00082) [2022-07-09 11:34:35,219][25689] Fps is (10 sec: 5658.3, 60 sec: 5737.8, 300 sec: 5760.6). Total num frames: 236318720. Throughput: 0: 5983.2. Samples: 236327700. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 11:34:35,219][25689] Avg episode reward: [(0, '-46.799')] [2022-07-09 11:34:35,338][26022] Updated weights on worker 0-0, policy_version 230781 (0.00744) [2022-07-09 11:34:37,174][26022] Updated weights on worker 0-0, policy_version 230791 (0.00048) [2022-07-09 11:34:38,864][26022] Updated weights on worker 0-0, policy_version 230801 (0.00087) [2022-07-09 11:34:40,231][25689] Fps is (10 sec: 5518.8, 60 sec: 5694.3, 300 sec: 5760.4). Total num frames: 236346368. Throughput: 0: 5125.5. Samples: 236345124. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 11:34:40,232][25689] Avg episode reward: [(0, '-46.870')] [2022-07-09 11:34:40,563][26022] Updated weights on worker 0-0, policy_version 230811 (0.00083) [2022-07-09 11:34:42,291][26022] Updated weights on worker 0-0, policy_version 230821 (0.00094) [2022-07-09 11:34:44,089][26022] Updated weights on worker 0-0, policy_version 230831 (0.00094) [2022-07-09 11:34:45,239][25689] Fps is (10 sec: 5927.3, 60 sec: 5746.6, 300 sec: 5766.3). Total num frames: 236378112. Throughput: 0: 6000.0. Samples: 236380026. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 11:34:45,239][25689] Avg episode reward: [(0, '-48.020')] [2022-07-09 11:34:45,946][26022] Updated weights on worker 0-0, policy_version 230841 (0.00086) [2022-07-09 11:34:47,571][26022] Updated weights on worker 0-0, policy_version 230851 (0.00091) [2022-07-09 11:34:49,455][26022] Updated weights on worker 0-0, policy_version 230861 (0.00084) [2022-07-09 11:34:50,307][25689] Fps is (10 sec: 6097.9, 60 sec: 5763.0, 300 sec: 5768.9). Total num frames: 236407808. Throughput: 0: 6039.1. Samples: 236415032. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 11:34:50,308][25689] Avg episode reward: [(0, '-46.938')] [2022-07-09 11:34:51,029][26022] Updated weights on worker 0-0, policy_version 230871 (0.00086) [2022-07-09 11:34:52,532][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:34:52,544][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000230878_236419072.pth [2022-07-09 11:34:52,544][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000228850_234342400.pth [2022-07-09 11:34:52,930][26022] Updated weights on worker 0-0, policy_version 230881 (0.00098) [2022-07-09 11:34:54,675][26022] Updated weights on worker 0-0, policy_version 230891 (0.00082) [2022-07-09 11:34:55,321][25689] Fps is (10 sec: 5687.9, 60 sec: 5729.5, 300 sec: 5761.9). Total num frames: 236435456. Throughput: 0: 5202.9. Samples: 236432288. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 11:34:55,322][25689] Avg episode reward: [(0, '-47.543')] [2022-07-09 11:34:56,343][26022] Updated weights on worker 0-0, policy_version 230901 (0.00087) [2022-07-09 11:34:58,372][26022] Updated weights on worker 0-0, policy_version 230911 (0.00090) [2022-07-09 11:35:00,023][26022] Updated weights on worker 0-0, policy_version 230921 (0.00107) [2022-07-09 11:35:00,356][25689] Fps is (10 sec: 5604.8, 60 sec: 5715.1, 300 sec: 5768.6). Total num frames: 236464128. Throughput: 0: 6059.9. Samples: 236467074. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 11:35:00,358][25689] Avg episode reward: [(0, '-47.764')] [2022-07-09 11:35:01,746][26022] Updated weights on worker 0-0, policy_version 230931 (0.00094) [2022-07-09 11:35:04,206][26022] Updated weights on worker 0-0, policy_version 230941 (0.00083) [2022-07-09 11:35:05,423][25689] Fps is (10 sec: 5676.5, 60 sec: 5761.4, 300 sec: 5768.3). Total num frames: 236492800. Throughput: 0: 5930.3. Samples: 236499722. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 11:35:05,423][25689] Avg episode reward: [(0, '-47.613')] [2022-07-09 11:35:05,750][26022] Updated weights on worker 0-0, policy_version 230951 (0.00089) [2022-07-09 11:35:07,431][26022] Updated weights on worker 0-0, policy_version 230961 (0.00574) [2022-07-09 11:35:09,321][26022] Updated weights on worker 0-0, policy_version 230971 (0.00087) [2022-07-09 11:35:10,478][25689] Fps is (10 sec: 5665.1, 60 sec: 5742.8, 300 sec: 5760.7). Total num frames: 236521472. Throughput: 0: 5069.0. Samples: 236517278. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 11:35:10,480][25689] Avg episode reward: [(0, '-46.920')] [2022-07-09 11:35:10,905][26022] Updated weights on worker 0-0, policy_version 230981 (0.00091) [2022-07-09 11:35:12,739][26022] Updated weights on worker 0-0, policy_version 230991 (0.00080) [2022-07-09 11:35:14,429][26022] Updated weights on worker 0-0, policy_version 231001 (0.00083) [2022-07-09 11:35:15,554][25689] Fps is (10 sec: 5660.0, 60 sec: 5737.8, 300 sec: 5761.1). Total num frames: 236550144. Throughput: 0: 5929.2. Samples: 236552256. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 11:35:15,555][25689] Avg episode reward: [(0, '-47.180')] [2022-07-09 11:35:16,264][26022] Updated weights on worker 0-0, policy_version 231011 (0.00088) [2022-07-09 11:35:18,051][26022] Updated weights on worker 0-0, policy_version 231021 (0.00095) [2022-07-09 11:35:19,657][26022] Updated weights on worker 0-0, policy_version 231031 (0.00092) [2022-07-09 11:35:20,649][25689] Fps is (10 sec: 5638.0, 60 sec: 5731.0, 300 sec: 5749.2). Total num frames: 236578816. Throughput: 0: 5917.7. Samples: 236587164. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 11:35:20,650][25689] Avg episode reward: [(0, '-47.480')] [2022-07-09 11:35:21,399][26022] Updated weights on worker 0-0, policy_version 231041 (0.00086) [2022-07-09 11:35:23,516][26022] Updated weights on worker 0-0, policy_version 231051 (0.00086) [2022-07-09 11:35:25,048][26022] Updated weights on worker 0-0, policy_version 231061 (0.00082) [2022-07-09 11:35:25,667][25689] Fps is (10 sec: 5873.1, 60 sec: 5746.9, 300 sec: 5763.7). Total num frames: 236609536. Throughput: 0: 5180.4. Samples: 236604594. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 11:35:25,667][25689] Avg episode reward: [(0, '-47.244')] [2022-07-09 11:35:27,032][26022] Updated weights on worker 0-0, policy_version 231071 (0.00091) [2022-07-09 11:35:28,719][26022] Updated weights on worker 0-0, policy_version 231081 (0.00090) [2022-07-09 11:35:30,447][26022] Updated weights on worker 0-0, policy_version 231091 (0.00088) [2022-07-09 11:35:30,719][25689] Fps is (10 sec: 5898.2, 60 sec: 5737.1, 300 sec: 5756.4). Total num frames: 236638208. Throughput: 0: 6011.3. Samples: 236638950. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 11:35:30,719][25689] Avg episode reward: [(0, '-46.651')] [2022-07-09 11:35:32,391][26022] Updated weights on worker 0-0, policy_version 231101 (0.00098) [2022-07-09 11:35:33,997][26022] Updated weights on worker 0-0, policy_version 231111 (0.00089) [2022-07-09 11:35:35,754][25689] Fps is (10 sec: 5583.0, 60 sec: 5734.4, 300 sec: 5754.1). Total num frames: 236665856. Throughput: 0: 5988.5. Samples: 236673226. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 11:35:35,755][25689] Avg episode reward: [(0, '-47.144')] [2022-07-09 11:35:36,059][26022] Updated weights on worker 0-0, policy_version 231121 (0.00085) [2022-07-09 11:35:37,579][26022] Updated weights on worker 0-0, policy_version 231131 (0.00085) [2022-07-09 11:35:39,539][26022] Updated weights on worker 0-0, policy_version 231141 (0.00084) [2022-07-09 11:35:40,762][25689] Fps is (10 sec: 5811.5, 60 sec: 5785.6, 300 sec: 5756.0). Total num frames: 236696576. Throughput: 0: 5144.2. Samples: 236690630. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 11:35:40,763][25689] Avg episode reward: [(0, '-47.561')] [2022-07-09 11:35:41,215][26022] Updated weights on worker 0-0, policy_version 231151 (0.00072) [2022-07-09 11:35:42,977][26022] Updated weights on worker 0-0, policy_version 231161 (0.00082) [2022-07-09 11:35:44,854][26022] Updated weights on worker 0-0, policy_version 231171 (0.00084) [2022-07-09 11:35:45,795][25689] Fps is (10 sec: 5813.2, 60 sec: 5715.6, 300 sec: 5753.5). Total num frames: 236724224. Throughput: 0: 6007.2. Samples: 236725508. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 11:35:45,795][25689] Avg episode reward: [(0, '-46.824')] [2022-07-09 11:35:46,371][26022] Updated weights on worker 0-0, policy_version 231181 (0.00090) [2022-07-09 11:35:48,392][26022] Updated weights on worker 0-0, policy_version 231191 (0.00084) [2022-07-09 11:35:49,768][26022] Updated weights on worker 0-0, policy_version 231201 (0.00084) [2022-07-09 11:35:50,848][25689] Fps is (10 sec: 5685.6, 60 sec: 5717.0, 300 sec: 5756.7). Total num frames: 236753920. Throughput: 0: 6046.0. Samples: 236760652. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 11:35:50,848][25689] Avg episode reward: [(0, '-47.215')] [2022-07-09 11:35:51,798][26022] Updated weights on worker 0-0, policy_version 231211 (0.00095) [2022-07-09 11:35:53,636][26022] Updated weights on worker 0-0, policy_version 231221 (0.00092) [2022-07-09 11:35:55,251][26022] Updated weights on worker 0-0, policy_version 231231 (0.00098) [2022-07-09 11:35:55,873][25689] Fps is (10 sec: 5994.6, 60 sec: 5766.6, 300 sec: 5760.0). Total num frames: 236784640. Throughput: 0: 5200.8. Samples: 236777860. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 11:35:55,874][25689] Avg episode reward: [(0, '-46.439')] [2022-07-09 11:35:57,190][26022] Updated weights on worker 0-0, policy_version 231241 (0.00103) [2022-07-09 11:35:58,702][26022] Updated weights on worker 0-0, policy_version 231251 (0.00082) [2022-07-09 11:36:00,783][26022] Updated weights on worker 0-0, policy_version 231261 (0.00091) [2022-07-09 11:36:00,948][25689] Fps is (10 sec: 5677.3, 60 sec: 5729.0, 300 sec: 5758.6). Total num frames: 236811264. Throughput: 0: 6049.9. Samples: 236812756. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 11:36:00,949][25689] Avg episode reward: [(0, '-46.858')] [2022-07-09 11:36:02,548][26022] Updated weights on worker 0-0, policy_version 231271 (0.00089) [2022-07-09 11:36:04,666][26022] Updated weights on worker 0-0, policy_version 231281 (0.00082) [2022-07-09 11:36:05,974][25689] Fps is (10 sec: 5474.5, 60 sec: 5732.9, 300 sec: 5752.7). Total num frames: 236839936. Throughput: 0: 5951.2. Samples: 236845598. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 11:36:05,975][25689] Avg episode reward: [(0, '-47.308')] [2022-07-09 11:36:06,269][26022] Updated weights on worker 0-0, policy_version 231291 (0.00089) [2022-07-09 11:36:08,097][26022] Updated weights on worker 0-0, policy_version 231301 (0.00092) [2022-07-09 11:36:09,596][26022] Updated weights on worker 0-0, policy_version 231311 (0.00088) [2022-07-09 11:36:11,019][25689] Fps is (10 sec: 5796.0, 60 sec: 5750.8, 300 sec: 5756.8). Total num frames: 236869632. Throughput: 0: 5077.9. Samples: 236863078. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 11:36:11,019][25689] Avg episode reward: [(0, '-47.952')] [2022-07-09 11:36:11,587][26022] Updated weights on worker 0-0, policy_version 231321 (0.00084) [2022-07-09 11:36:13,202][26022] Updated weights on worker 0-0, policy_version 231331 (0.00093) [2022-07-09 11:36:15,204][26022] Updated weights on worker 0-0, policy_version 231341 (0.00085) [2022-07-09 11:36:16,021][25689] Fps is (10 sec: 5809.3, 60 sec: 5757.8, 300 sec: 5753.4). Total num frames: 236898304. Throughput: 0: 5956.8. Samples: 236897878. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 11:36:16,022][25689] Avg episode reward: [(0, '-48.105')] [2022-07-09 11:36:16,638][26022] Updated weights on worker 0-0, policy_version 231351 (0.00087) [2022-07-09 11:36:18,631][26022] Updated weights on worker 0-0, policy_version 231361 (0.00078) [2022-07-09 11:36:20,332][26022] Updated weights on worker 0-0, policy_version 231371 (0.00090) [2022-07-09 11:36:21,028][25689] Fps is (10 sec: 5729.2, 60 sec: 5766.3, 300 sec: 5750.1). Total num frames: 236926976. Throughput: 0: 5988.3. Samples: 236932998. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 11:36:21,028][25689] Avg episode reward: [(0, '-48.774')] [2022-07-09 11:36:22,142][26022] Updated weights on worker 0-0, policy_version 231381 (0.00082) [2022-07-09 11:36:23,900][26022] Updated weights on worker 0-0, policy_version 231391 (0.00086) [2022-07-09 11:36:25,636][26022] Updated weights on worker 0-0, policy_version 231401 (0.00091) [2022-07-09 11:36:26,030][25689] Fps is (10 sec: 5831.5, 60 sec: 5750.7, 300 sec: 5758.2). Total num frames: 236956672. Throughput: 0: 5211.7. Samples: 236950126. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 11:36:26,032][25689] Avg episode reward: [(0, '-49.836')] [2022-07-09 11:36:27,555][26022] Updated weights on worker 0-0, policy_version 231411 (0.00087) [2022-07-09 11:36:29,315][26022] Updated weights on worker 0-0, policy_version 231421 (0.00083) [2022-07-09 11:36:30,940][26022] Updated weights on worker 0-0, policy_version 231431 (0.01231) [2022-07-09 11:36:31,061][25689] Fps is (10 sec: 5817.6, 60 sec: 5752.8, 300 sec: 5752.6). Total num frames: 236985344. Throughput: 0: 6070.3. Samples: 236984740. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 11:36:31,061][25689] Avg episode reward: [(0, '-49.010')] [2022-07-09 11:36:32,841][26022] Updated weights on worker 0-0, policy_version 231441 (0.00071) [2022-07-09 11:36:34,431][26022] Updated weights on worker 0-0, policy_version 231451 (0.00087) [2022-07-09 11:36:36,081][25689] Fps is (10 sec: 5705.6, 60 sec: 5771.3, 300 sec: 5755.8). Total num frames: 237014016. Throughput: 0: 6068.2. Samples: 237019604. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 11:36:36,081][25689] Avg episode reward: [(0, '-48.256')] [2022-07-09 11:36:36,515][26022] Updated weights on worker 0-0, policy_version 231461 (0.00085) [2022-07-09 11:36:38,152][26022] Updated weights on worker 0-0, policy_version 231471 (0.00091) [2022-07-09 11:36:39,842][26022] Updated weights on worker 0-0, policy_version 231481 (0.00091) [2022-07-09 11:36:41,083][25689] Fps is (10 sec: 5722.1, 60 sec: 5737.9, 300 sec: 5753.1). Total num frames: 237042688. Throughput: 0: 5183.1. Samples: 237036942. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 11:36:41,083][25689] Avg episode reward: [(0, '-47.519')] [2022-07-09 11:36:41,638][26022] Updated weights on worker 0-0, policy_version 231491 (0.00093) [2022-07-09 11:36:43,725][26022] Updated weights on worker 0-0, policy_version 231501 (0.00085) [2022-07-09 11:36:45,205][26022] Updated weights on worker 0-0, policy_version 231511 (0.00083) [2022-07-09 11:36:46,095][25689] Fps is (10 sec: 5828.8, 60 sec: 5773.8, 300 sec: 5754.7). Total num frames: 237072384. Throughput: 0: 6060.4. Samples: 237071724. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 11:36:46,095][25689] Avg episode reward: [(0, '-47.428')] [2022-07-09 11:36:47,065][26022] Updated weights on worker 0-0, policy_version 231521 (0.00092) [2022-07-09 11:36:48,648][26022] Updated weights on worker 0-0, policy_version 231531 (0.01007) [2022-07-09 11:36:50,507][26022] Updated weights on worker 0-0, policy_version 231541 (0.00091) [2022-07-09 11:36:51,166][25689] Fps is (10 sec: 5788.3, 60 sec: 5755.1, 300 sec: 5746.6). Total num frames: 237101056. Throughput: 0: 6064.0. Samples: 237106658. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 11:36:51,167][25689] Avg episode reward: [(0, '-46.917')] [2022-07-09 11:36:52,223][26022] Updated weights on worker 0-0, policy_version 231551 (0.00087) [2022-07-09 11:36:52,626][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:36:52,650][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000231554_237111296.pth [2022-07-09 11:36:52,651][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000229528_235036672.pth [2022-07-09 11:36:53,876][26022] Updated weights on worker 0-0, policy_version 231561 (0.00087) [2022-07-09 11:36:55,846][26022] Updated weights on worker 0-0, policy_version 231571 (0.00089) [2022-07-09 11:36:56,168][25689] Fps is (10 sec: 5997.5, 60 sec: 5774.3, 300 sec: 5763.9). Total num frames: 237132800. Throughput: 0: 5208.4. Samples: 237124226. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 11:36:56,169][25689] Avg episode reward: [(0, '-46.715')] [2022-07-09 11:36:57,482][26022] Updated weights on worker 0-0, policy_version 231581 (0.00092) [2022-07-09 11:36:59,190][26022] Updated weights on worker 0-0, policy_version 231591 (0.00086) [2022-07-09 11:37:01,177][25689] Fps is (10 sec: 5830.3, 60 sec: 5780.6, 300 sec: 5760.3). Total num frames: 237159424. Throughput: 0: 6088.8. Samples: 237159296. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 11:37:01,178][25689] Avg episode reward: [(0, '-47.680')] [2022-07-09 11:37:01,184][26022] Updated weights on worker 0-0, policy_version 231601 (0.00088) [2022-07-09 11:37:03,119][26022] Updated weights on worker 0-0, policy_version 231611 (0.00097) [2022-07-09 11:37:05,051][26022] Updated weights on worker 0-0, policy_version 231621 (0.00082) [2022-07-09 11:37:06,185][25689] Fps is (10 sec: 5315.9, 60 sec: 5748.3, 300 sec: 5751.1). Total num frames: 237186048. Throughput: 0: 5967.0. Samples: 237191602. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 11:37:06,185][25689] Avg episode reward: [(0, '-47.714')] [2022-07-09 11:37:06,726][26022] Updated weights on worker 0-0, policy_version 231631 (0.00091) [2022-07-09 11:37:08,321][26022] Updated weights on worker 0-0, policy_version 231641 (0.00088) [2022-07-09 11:37:10,447][26022] Updated weights on worker 0-0, policy_version 231651 (0.00081) [2022-07-09 11:37:11,313][25689] Fps is (10 sec: 5657.5, 60 sec: 5757.4, 300 sec: 5755.7). Total num frames: 237216768. Throughput: 0: 5086.5. Samples: 237209136. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 11:37:11,314][25689] Avg episode reward: [(0, '-49.411')] [2022-07-09 11:37:11,881][26022] Updated weights on worker 0-0, policy_version 231661 (0.00085) [2022-07-09 11:37:13,835][26022] Updated weights on worker 0-0, policy_version 231671 (0.00089) [2022-07-09 11:37:15,455][26022] Updated weights on worker 0-0, policy_version 231681 (0.00087) [2022-07-09 11:37:16,316][25689] Fps is (10 sec: 5761.5, 60 sec: 5740.4, 300 sec: 5746.6). Total num frames: 237244416. Throughput: 0: 5954.4. Samples: 237244192. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:37:16,316][25689] Avg episode reward: [(0, '-49.782')] [2022-07-09 11:37:17,094][26022] Updated weights on worker 0-0, policy_version 231691 (0.00092) [2022-07-09 11:37:19,127][26022] Updated weights on worker 0-0, policy_version 231701 (0.00078) [2022-07-09 11:37:20,847][26022] Updated weights on worker 0-0, policy_version 231711 (0.00091) [2022-07-09 11:37:21,317][25689] Fps is (10 sec: 5732.1, 60 sec: 5757.8, 300 sec: 5750.2). Total num frames: 237274112. Throughput: 0: 5947.5. Samples: 237279078. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:37:21,318][25689] Avg episode reward: [(0, '-50.920')] [2022-07-09 11:37:22,560][26022] Updated weights on worker 0-0, policy_version 231721 (0.00088) [2022-07-09 11:37:24,394][26022] Updated weights on worker 0-0, policy_version 231731 (0.00092) [2022-07-09 11:37:26,045][26022] Updated weights on worker 0-0, policy_version 231741 (0.00084) [2022-07-09 11:37:26,338][25689] Fps is (10 sec: 5925.8, 60 sec: 5756.1, 300 sec: 5755.7). Total num frames: 237303808. Throughput: 0: 5206.1. Samples: 237296522. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:37:26,339][25689] Avg episode reward: [(0, '-50.815')] [2022-07-09 11:37:28,041][26022] Updated weights on worker 0-0, policy_version 231751 (0.00091) [2022-07-09 11:37:29,955][26022] Updated weights on worker 0-0, policy_version 231761 (0.00081) [2022-07-09 11:37:31,430][25689] Fps is (10 sec: 5771.5, 60 sec: 5750.2, 300 sec: 5747.1). Total num frames: 237332480. Throughput: 0: 6039.6. Samples: 237330636. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:37:31,431][25689] Avg episode reward: [(0, '-50.208')] [2022-07-09 11:37:31,515][26022] Updated weights on worker 0-0, policy_version 231771 (0.00094) [2022-07-09 11:37:33,527][26022] Updated weights on worker 0-0, policy_version 231781 (0.00077) [2022-07-09 11:37:35,094][26022] Updated weights on worker 0-0, policy_version 231791 (0.00089) [2022-07-09 11:37:36,449][25689] Fps is (10 sec: 5671.4, 60 sec: 5750.3, 300 sec: 5757.2). Total num frames: 237361152. Throughput: 0: 6013.2. Samples: 237365260. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:37:36,450][25689] Avg episode reward: [(0, '-47.830')] [2022-07-09 11:37:37,039][26022] Updated weights on worker 0-0, policy_version 231801 (0.00090) [2022-07-09 11:37:38,515][26022] Updated weights on worker 0-0, policy_version 231811 (0.00086) [2022-07-09 11:37:40,451][26022] Updated weights on worker 0-0, policy_version 231821 (0.00088) [2022-07-09 11:37:41,459][25689] Fps is (10 sec: 5718.2, 60 sec: 5749.5, 300 sec: 5750.1). Total num frames: 237389824. Throughput: 0: 6012.4. Samples: 237400178. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:37:41,459][25689] Avg episode reward: [(0, '-47.313')] [2022-07-09 11:37:42,139][26022] Updated weights on worker 0-0, policy_version 231831 (0.00082) [2022-07-09 11:37:44,043][26022] Updated weights on worker 0-0, policy_version 231841 (0.00092) [2022-07-09 11:37:45,543][26022] Updated weights on worker 0-0, policy_version 231851 (0.00095) [2022-07-09 11:37:46,544][25689] Fps is (10 sec: 5883.5, 60 sec: 5759.5, 300 sec: 5752.8). Total num frames: 237420544. Throughput: 0: 6001.8. Samples: 237417792. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:37:46,545][25689] Avg episode reward: [(0, '-46.984')] [2022-07-09 11:37:47,458][26022] Updated weights on worker 0-0, policy_version 231861 (0.00083) [2022-07-09 11:37:49,185][26022] Updated weights on worker 0-0, policy_version 231871 (0.00080) [2022-07-09 11:37:51,051][26022] Updated weights on worker 0-0, policy_version 231881 (0.00087) [2022-07-09 11:37:51,620][25689] Fps is (10 sec: 5946.0, 60 sec: 5776.1, 300 sec: 5749.5). Total num frames: 237450240. Throughput: 0: 6025.5. Samples: 237452286. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:37:51,620][25689] Avg episode reward: [(0, '-46.848')] [2022-07-09 11:37:52,854][26022] Updated weights on worker 0-0, policy_version 231891 (0.00090) [2022-07-09 11:37:54,355][26022] Updated weights on worker 0-0, policy_version 231901 (0.00087) [2022-07-09 11:37:56,407][26022] Updated weights on worker 0-0, policy_version 231911 (0.00091) [2022-07-09 11:37:56,624][25689] Fps is (10 sec: 5689.1, 60 sec: 5708.1, 300 sec: 5749.8). Total num frames: 237477888. Throughput: 0: 6044.6. Samples: 237487206. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:37:56,624][25689] Avg episode reward: [(0, '-47.545')] [2022-07-09 11:37:58,036][26022] Updated weights on worker 0-0, policy_version 231921 (0.00079) [2022-07-09 11:37:59,763][26022] Updated weights on worker 0-0, policy_version 231931 (0.00085) [2022-07-09 11:38:01,635][25689] Fps is (10 sec: 5623.5, 60 sec: 5741.8, 300 sec: 5757.0). Total num frames: 237506560. Throughput: 0: 5187.8. Samples: 237504846. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:38:01,635][25689] Avg episode reward: [(0, '-48.246')] [2022-07-09 11:38:01,900][26022] Updated weights on worker 0-0, policy_version 231941 (0.00073) [2022-07-09 11:38:03,679][26022] Updated weights on worker 0-0, policy_version 231951 (0.00090) [2022-07-09 11:38:05,620][26022] Updated weights on worker 0-0, policy_version 231961 (0.00092) [2022-07-09 11:38:06,722][25689] Fps is (10 sec: 5577.3, 60 sec: 5751.2, 300 sec: 5746.7). Total num frames: 237534208. Throughput: 0: 5913.7. Samples: 237537116. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:38:06,722][25689] Avg episode reward: [(0, '-48.703')] [2022-07-09 11:38:07,475][26022] Updated weights on worker 0-0, policy_version 231971 (0.00091) [2022-07-09 11:38:09,035][26022] Updated weights on worker 0-0, policy_version 231981 (0.00072) [2022-07-09 11:38:10,882][26022] Updated weights on worker 0-0, policy_version 231991 (0.00083) [2022-07-09 11:38:11,815][25689] Fps is (10 sec: 5733.6, 60 sec: 5754.6, 300 sec: 5755.7). Total num frames: 237564928. Throughput: 0: 5924.3. Samples: 237571928. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:38:11,815][25689] Avg episode reward: [(0, '-48.697')] [2022-07-09 11:38:12,522][26022] Updated weights on worker 0-0, policy_version 232001 (0.00082) [2022-07-09 11:38:14,475][26022] Updated weights on worker 0-0, policy_version 232011 (0.00083) [2022-07-09 11:38:16,242][26022] Updated weights on worker 0-0, policy_version 232021 (0.00086) [2022-07-09 11:38:16,817][25689] Fps is (10 sec: 5781.8, 60 sec: 5754.6, 300 sec: 5749.0). Total num frames: 237592576. Throughput: 0: 5060.1. Samples: 237589386. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:38:16,818][25689] Avg episode reward: [(0, '-47.762')] [2022-07-09 11:38:17,891][26022] Updated weights on worker 0-0, policy_version 232031 (0.00089) [2022-07-09 11:38:19,786][26022] Updated weights on worker 0-0, policy_version 232041 (0.01128) [2022-07-09 11:38:21,452][26022] Updated weights on worker 0-0, policy_version 232051 (0.00089) [2022-07-09 11:38:21,857][25689] Fps is (10 sec: 5710.2, 60 sec: 5750.9, 300 sec: 5752.7). Total num frames: 237622272. Throughput: 0: 5911.1. Samples: 237624382. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:38:21,858][25689] Avg episode reward: [(0, '-46.520')] [2022-07-09 11:38:23,159][26022] Updated weights on worker 0-0, policy_version 232061 (0.00086) [2022-07-09 11:38:24,924][26022] Updated weights on worker 0-0, policy_version 232071 (0.00085) [2022-07-09 11:38:26,544][26022] Updated weights on worker 0-0, policy_version 232081 (0.00085) [2022-07-09 11:38:26,879][25689] Fps is (10 sec: 6004.4, 60 sec: 5767.8, 300 sec: 5754.6). Total num frames: 237652992. Throughput: 0: 6060.5. Samples: 237659278. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:38:26,879][25689] Avg episode reward: [(0, '-46.890')] [2022-07-09 11:38:28,709][26022] Updated weights on worker 0-0, policy_version 232091 (0.00092) [2022-07-09 11:38:30,045][26022] Updated weights on worker 0-0, policy_version 232101 (0.00092) [2022-07-09 11:38:31,987][25689] Fps is (10 sec: 5762.1, 60 sec: 5749.4, 300 sec: 5749.2). Total num frames: 237680640. Throughput: 0: 5203.9. Samples: 237676900. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:38:31,987][25689] Avg episode reward: [(0, '-47.246')] [2022-07-09 11:38:32,095][26022] Updated weights on worker 0-0, policy_version 232111 (0.00089) [2022-07-09 11:38:33,621][26022] Updated weights on worker 0-0, policy_version 232121 (0.00068) [2022-07-09 11:38:35,392][26022] Updated weights on worker 0-0, policy_version 232131 (0.00082) [2022-07-09 11:38:37,045][25689] Fps is (10 sec: 5741.2, 60 sec: 5779.4, 300 sec: 5749.8). Total num frames: 237711360. Throughput: 0: 6069.1. Samples: 237712154. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:38:37,047][25689] Avg episode reward: [(0, '-47.502')] [2022-07-09 11:38:37,111][26022] Updated weights on worker 0-0, policy_version 232141 (0.00094) [2022-07-09 11:38:39,038][26022] Updated weights on worker 0-0, policy_version 232151 (0.00086) [2022-07-09 11:38:40,710][26022] Updated weights on worker 0-0, policy_version 232161 (0.00096) [2022-07-09 11:38:42,075][25689] Fps is (10 sec: 5887.4, 60 sec: 5777.5, 300 sec: 5749.7). Total num frames: 237740032. Throughput: 0: 6051.7. Samples: 237746732. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:38:42,075][25689] Avg episode reward: [(0, '-47.463')] [2022-07-09 11:38:42,659][26022] Updated weights on worker 0-0, policy_version 232171 (0.00087) [2022-07-09 11:38:44,096][26022] Updated weights on worker 0-0, policy_version 232181 (0.00091) [2022-07-09 11:38:46,051][26022] Updated weights on worker 0-0, policy_version 232191 (0.00087) [2022-07-09 11:38:47,082][25689] Fps is (10 sec: 5917.7, 60 sec: 5785.0, 300 sec: 5757.7). Total num frames: 237770752. Throughput: 0: 5199.1. Samples: 237764314. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:38:47,082][25689] Avg episode reward: [(0, '-48.329')] [2022-07-09 11:38:47,686][26022] Updated weights on worker 0-0, policy_version 232201 (0.00085) [2022-07-09 11:38:49,493][26022] Updated weights on worker 0-0, policy_version 232211 (0.00084) [2022-07-09 11:38:51,435][26022] Updated weights on worker 0-0, policy_version 232221 (0.00078) [2022-07-09 11:38:52,142][25689] Fps is (10 sec: 5797.6, 60 sec: 5752.5, 300 sec: 5749.9). Total num frames: 237798400. Throughput: 0: 6083.3. Samples: 237799512. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:38:52,143][25689] Avg episode reward: [(0, '-48.617')] [2022-07-09 11:38:52,687][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:38:52,694][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000232229_237802496.pth [2022-07-09 11:38:52,695][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000230205_235729920.pth [2022-07-09 11:38:53,002][26022] Updated weights on worker 0-0, policy_version 232231 (0.00087) [2022-07-09 11:38:55,026][26022] Updated weights on worker 0-0, policy_version 232241 (0.00084) [2022-07-09 11:38:56,546][26022] Updated weights on worker 0-0, policy_version 232251 (0.00082) [2022-07-09 11:38:57,217][25689] Fps is (10 sec: 5759.0, 60 sec: 5796.6, 300 sec: 5753.1). Total num frames: 237829120. Throughput: 0: 6052.1. Samples: 237834234. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:38:57,224][25689] Avg episode reward: [(0, '-47.418')] [2022-07-09 11:38:58,271][26022] Updated weights on worker 0-0, policy_version 232261 (0.00088) [2022-07-09 11:39:00,198][26022] Updated weights on worker 0-0, policy_version 232271 (0.00091) [2022-07-09 11:39:02,149][26022] Updated weights on worker 0-0, policy_version 232281 (0.00086) [2022-07-09 11:39:02,295][25689] Fps is (10 sec: 5648.3, 60 sec: 5756.4, 300 sec: 5755.4). Total num frames: 237855744. Throughput: 0: 5195.0. Samples: 237851774. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:39:02,297][25689] Avg episode reward: [(0, '-47.266')] [2022-07-09 11:39:04,031][26022] Updated weights on worker 0-0, policy_version 232291 (0.00088) [2022-07-09 11:39:05,660][26022] Updated weights on worker 0-0, policy_version 232301 (0.00090) [2022-07-09 11:39:07,373][25689] Fps is (10 sec: 5444.3, 60 sec: 5774.1, 300 sec: 5751.2). Total num frames: 237884416. Throughput: 0: 5926.2. Samples: 237884566. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:39:07,375][25689] Avg episode reward: [(0, '-47.761')] [2022-07-09 11:39:07,596][26022] Updated weights on worker 0-0, policy_version 232311 (0.00083) [2022-07-09 11:39:09,335][26022] Updated weights on worker 0-0, policy_version 232321 (0.00079) [2022-07-09 11:39:11,173][26022] Updated weights on worker 0-0, policy_version 232331 (0.00086) [2022-07-09 11:39:12,431][25689] Fps is (10 sec: 5859.5, 60 sec: 5777.5, 300 sec: 5757.4). Total num frames: 237915136. Throughput: 0: 5902.7. Samples: 237919268. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:39:12,432][25689] Avg episode reward: [(0, '-46.665')] [2022-07-09 11:39:12,639][26022] Updated weights on worker 0-0, policy_version 232341 (0.00087) [2022-07-09 11:39:14,794][26022] Updated weights on worker 0-0, policy_version 232351 (0.00090) [2022-07-09 11:39:16,140][26022] Updated weights on worker 0-0, policy_version 232361 (0.00084) [2022-07-09 11:39:17,442][25689] Fps is (10 sec: 5797.2, 60 sec: 5776.6, 300 sec: 5754.2). Total num frames: 237942784. Throughput: 0: 5077.4. Samples: 237936924. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:39:17,442][25689] Avg episode reward: [(0, '-47.060')] [2022-07-09 11:39:18,104][26022] Updated weights on worker 0-0, policy_version 232371 (0.00093) [2022-07-09 11:39:19,768][26022] Updated weights on worker 0-0, policy_version 232381 (0.00089) [2022-07-09 11:39:21,466][26022] Updated weights on worker 0-0, policy_version 232391 (0.00084) [2022-07-09 11:39:22,446][25689] Fps is (10 sec: 5827.7, 60 sec: 5796.9, 300 sec: 5757.7). Total num frames: 237973504. Throughput: 0: 5967.1. Samples: 237972020. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:39:22,447][25689] Avg episode reward: [(0, '-48.379')] [2022-07-09 11:39:23,531][26022] Updated weights on worker 0-0, policy_version 232401 (0.00092) [2022-07-09 11:39:25,066][26022] Updated weights on worker 0-0, policy_version 232411 (0.00089) [2022-07-09 11:39:26,928][26022] Updated weights on worker 0-0, policy_version 232421 (0.00092) [2022-07-09 11:39:27,475][25689] Fps is (10 sec: 5919.7, 60 sec: 5762.5, 300 sec: 5756.2). Total num frames: 238002176. Throughput: 0: 6098.5. Samples: 238007154. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:39:27,475][25689] Avg episode reward: [(0, '-48.041')] [2022-07-09 11:39:28,607][26022] Updated weights on worker 0-0, policy_version 232431 (0.00087) [2022-07-09 11:39:30,420][26022] Updated weights on worker 0-0, policy_version 232441 (0.00090) [2022-07-09 11:39:32,211][26022] Updated weights on worker 0-0, policy_version 232451 (0.00084) [2022-07-09 11:39:32,601][25689] Fps is (10 sec: 5748.1, 60 sec: 5794.6, 300 sec: 5760.8). Total num frames: 238031872. Throughput: 0: 5209.5. Samples: 238024344. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 11:39:32,601][25689] Avg episode reward: [(0, '-47.795')] [2022-07-09 11:39:33,927][26022] Updated weights on worker 0-0, policy_version 232461 (0.00090) [2022-07-09 11:39:35,625][26022] Updated weights on worker 0-0, policy_version 232471 (0.00092) [2022-07-09 11:39:37,687][25689] Fps is (10 sec: 5715.7, 60 sec: 5758.2, 300 sec: 5762.8). Total num frames: 238060544. Throughput: 0: 6033.7. Samples: 238059076. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 11:39:37,687][25689] Avg episode reward: [(0, '-48.023')] [2022-07-09 11:39:37,692][26022] Updated weights on worker 0-0, policy_version 232481 (0.00090) [2022-07-09 11:39:39,262][26022] Updated weights on worker 0-0, policy_version 232491 (0.00088) [2022-07-09 11:39:41,207][26022] Updated weights on worker 0-0, policy_version 232501 (0.01291) [2022-07-09 11:39:42,747][25689] Fps is (10 sec: 5752.6, 60 sec: 5772.1, 300 sec: 5754.9). Total num frames: 238090240. Throughput: 0: 5988.8. Samples: 238093596. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 11:39:42,748][25689] Avg episode reward: [(0, '-48.380')] [2022-07-09 11:39:42,775][26022] Updated weights on worker 0-0, policy_version 232511 (0.00102) [2022-07-09 11:39:44,699][26022] Updated weights on worker 0-0, policy_version 232521 (0.00093) [2022-07-09 11:39:46,324][26022] Updated weights on worker 0-0, policy_version 232531 (0.00085) [2022-07-09 11:39:47,807][25689] Fps is (10 sec: 5868.6, 60 sec: 5750.2, 300 sec: 5755.1). Total num frames: 238119936. Throughput: 0: 5988.1. Samples: 238128906. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 11:39:47,807][25689] Avg episode reward: [(0, '-47.801')] [2022-07-09 11:39:48,259][26022] Updated weights on worker 0-0, policy_version 232541 (0.00098) [2022-07-09 11:39:49,767][26022] Updated weights on worker 0-0, policy_version 232551 (0.00083) [2022-07-09 11:39:51,626][26022] Updated weights on worker 0-0, policy_version 232561 (0.00086) [2022-07-09 11:39:52,863][25689] Fps is (10 sec: 5870.8, 60 sec: 5784.3, 300 sec: 5761.1). Total num frames: 238149632. Throughput: 0: 6018.9. Samples: 238146304. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 11:39:52,864][25689] Avg episode reward: [(0, '-48.348')] [2022-07-09 11:39:53,339][26022] Updated weights on worker 0-0, policy_version 232571 (0.00086) [2022-07-09 11:39:55,161][26022] Updated weights on worker 0-0, policy_version 232581 (0.00086) [2022-07-09 11:39:56,883][26022] Updated weights on worker 0-0, policy_version 232591 (0.00085) [2022-07-09 11:39:57,896][25689] Fps is (10 sec: 5886.9, 60 sec: 5771.5, 300 sec: 5764.6). Total num frames: 238179328. Throughput: 0: 6041.2. Samples: 238181162. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 11:39:57,896][25689] Avg episode reward: [(0, '-48.235')] [2022-07-09 11:39:58,633][26022] Updated weights on worker 0-0, policy_version 232601 (0.00063) [2022-07-09 11:40:00,458][26022] Updated weights on worker 0-0, policy_version 232611 (0.00092) [2022-07-09 11:40:02,728][26022] Updated weights on worker 0-0, policy_version 232621 (0.00087) [2022-07-09 11:40:02,922][25689] Fps is (10 sec: 5497.3, 60 sec: 5759.5, 300 sec: 5755.1). Total num frames: 238204928. Throughput: 0: 5963.2. Samples: 238213906. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 11:40:02,923][25689] Avg episode reward: [(0, '-47.735')] [2022-07-09 11:40:04,330][26022] Updated weights on worker 0-0, policy_version 232631 (0.00093) [2022-07-09 11:40:06,355][26022] Updated weights on worker 0-0, policy_version 232641 (0.00086) [2022-07-09 11:40:07,830][26022] Updated weights on worker 0-0, policy_version 232651 (0.00095) [2022-07-09 11:40:07,928][25689] Fps is (10 sec: 5511.9, 60 sec: 5783.4, 300 sec: 5759.5). Total num frames: 238234624. Throughput: 0: 5075.5. Samples: 238231030. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 11:40:07,928][25689] Avg episode reward: [(0, '-47.359')] [2022-07-09 11:40:09,930][26022] Updated weights on worker 0-0, policy_version 232661 (0.00089) [2022-07-09 11:40:11,542][26022] Updated weights on worker 0-0, policy_version 232671 (0.00085) [2022-07-09 11:40:12,989][25689] Fps is (10 sec: 5696.4, 60 sec: 5732.3, 300 sec: 5756.3). Total num frames: 238262272. Throughput: 0: 5906.0. Samples: 238265164. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 11:40:12,989][25689] Avg episode reward: [(0, '-46.958')] [2022-07-09 11:40:13,380][26022] Updated weights on worker 0-0, policy_version 232681 (0.00085) [2022-07-09 11:40:15,279][26022] Updated weights on worker 0-0, policy_version 232691 (0.00087) [2022-07-09 11:40:16,742][26022] Updated weights on worker 0-0, policy_version 232701 (0.00083) [2022-07-09 11:40:18,083][25689] Fps is (10 sec: 5545.7, 60 sec: 5741.3, 300 sec: 5756.3). Total num frames: 238290944. Throughput: 0: 5887.2. Samples: 238300010. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 11:40:18,084][25689] Avg episode reward: [(0, '-47.042')] [2022-07-09 11:40:18,778][26022] Updated weights on worker 0-0, policy_version 232711 (0.00082) [2022-07-09 11:40:20,481][26022] Updated weights on worker 0-0, policy_version 232721 (0.00082) [2022-07-09 11:40:22,176][26022] Updated weights on worker 0-0, policy_version 232731 (0.00087) [2022-07-09 11:40:23,102][25689] Fps is (10 sec: 5771.7, 60 sec: 5723.1, 300 sec: 5752.9). Total num frames: 238320640. Throughput: 0: 5138.5. Samples: 238317598. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 11:40:23,103][25689] Avg episode reward: [(0, '-46.489')] [2022-07-09 11:40:24,039][26022] Updated weights on worker 0-0, policy_version 232741 (0.00090) [2022-07-09 11:40:25,759][26022] Updated weights on worker 0-0, policy_version 232751 (0.00101) [2022-07-09 11:40:27,616][26022] Updated weights on worker 0-0, policy_version 232761 (0.00087) [2022-07-09 11:40:28,119][25689] Fps is (10 sec: 5917.9, 60 sec: 5741.0, 300 sec: 5757.0). Total num frames: 238350336. Throughput: 0: 5993.5. Samples: 238352048. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 11:40:28,121][25689] Avg episode reward: [(0, '-47.243')] [2022-07-09 11:40:29,319][26022] Updated weights on worker 0-0, policy_version 232771 (0.00080) [2022-07-09 11:40:31,030][26022] Updated weights on worker 0-0, policy_version 232781 (0.00096) [2022-07-09 11:40:33,131][26022] Updated weights on worker 0-0, policy_version 232791 (0.00086) [2022-07-09 11:40:33,206][25689] Fps is (10 sec: 5675.1, 60 sec: 5710.9, 300 sec: 5756.0). Total num frames: 238377984. Throughput: 0: 6024.5. Samples: 238386962. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 11:40:33,207][25689] Avg episode reward: [(0, '-47.115')] [2022-07-09 11:40:34,375][26022] Updated weights on worker 0-0, policy_version 232801 (0.00081) [2022-07-09 11:40:36,341][26022] Updated weights on worker 0-0, policy_version 232811 (0.00094) [2022-07-09 11:40:38,032][26022] Updated weights on worker 0-0, policy_version 232821 (0.00078) [2022-07-09 11:40:38,223][25689] Fps is (10 sec: 5878.4, 60 sec: 5768.2, 300 sec: 5759.3). Total num frames: 238409728. Throughput: 0: 5190.1. Samples: 238404536. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 11:40:38,223][25689] Avg episode reward: [(0, '-47.476')] [2022-07-09 11:40:39,852][26022] Updated weights on worker 0-0, policy_version 232831 (0.00090) [2022-07-09 11:40:41,798][26022] Updated weights on worker 0-0, policy_version 232841 (0.00077) [2022-07-09 11:40:43,183][26022] Updated weights on worker 0-0, policy_version 232851 (0.00092) [2022-07-09 11:40:43,237][25689] Fps is (10 sec: 6124.9, 60 sec: 5772.6, 300 sec: 5766.5). Total num frames: 238439424. Throughput: 0: 6052.1. Samples: 238439462. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 11:40:43,238][25689] Avg episode reward: [(0, '-46.880')] [2022-07-09 11:40:45,332][26022] Updated weights on worker 0-0, policy_version 232861 (0.00089) [2022-07-09 11:40:46,859][26022] Updated weights on worker 0-0, policy_version 232871 (0.00089) [2022-07-09 11:40:48,260][25689] Fps is (10 sec: 5712.8, 60 sec: 5742.2, 300 sec: 5760.2). Total num frames: 238467072. Throughput: 0: 6079.9. Samples: 238474504. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 11:40:48,261][25689] Avg episode reward: [(0, '-46.255')] [2022-07-09 11:40:48,751][26022] Updated weights on worker 0-0, policy_version 232881 (0.00092) [2022-07-09 11:40:50,399][26022] Updated weights on worker 0-0, policy_version 232891 (0.00085) [2022-07-09 11:40:52,155][26022] Updated weights on worker 0-0, policy_version 232901 (0.00087) [2022-07-09 11:40:52,827][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:40:52,841][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000232904_238493696.pth [2022-07-09 11:40:52,842][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000230878_236419072.pth [2022-07-09 11:40:53,315][25689] Fps is (10 sec: 5792.1, 60 sec: 5759.4, 300 sec: 5759.6). Total num frames: 238497792. Throughput: 0: 5215.7. Samples: 238491842. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 11:40:53,315][25689] Avg episode reward: [(0, '-46.573')] [2022-07-09 11:40:53,947][26022] Updated weights on worker 0-0, policy_version 232911 (0.00087) [2022-07-09 11:40:55,804][26022] Updated weights on worker 0-0, policy_version 232921 (0.00086) [2022-07-09 11:40:57,342][26022] Updated weights on worker 0-0, policy_version 232931 (0.00083) [2022-07-09 11:40:58,415][25689] Fps is (10 sec: 5849.0, 60 sec: 5736.0, 300 sec: 5766.0). Total num frames: 238526464. Throughput: 0: 6074.4. Samples: 238527190. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 11:40:58,415][25689] Avg episode reward: [(0, '-46.173')] [2022-07-09 11:40:59,116][26022] Updated weights on worker 0-0, policy_version 232941 (0.00101) [2022-07-09 11:41:00,746][26022] Updated weights on worker 0-0, policy_version 232951 (0.00086) [2022-07-09 11:41:03,135][26022] Updated weights on worker 0-0, policy_version 232961 (0.00087) [2022-07-09 11:41:03,462][25689] Fps is (10 sec: 5449.4, 60 sec: 5751.0, 300 sec: 5758.7). Total num frames: 238553088. Throughput: 0: 5972.8. Samples: 238560258. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 11:41:03,463][25689] Avg episode reward: [(0, '-46.595')] [2022-07-09 11:41:04,718][26022] Updated weights on worker 0-0, policy_version 232971 (0.00091) [2022-07-09 11:41:06,560][26022] Updated weights on worker 0-0, policy_version 232981 (0.00087) [2022-07-09 11:41:08,329][26022] Updated weights on worker 0-0, policy_version 232991 (0.00109) [2022-07-09 11:41:08,506][25689] Fps is (10 sec: 5682.7, 60 sec: 5764.2, 300 sec: 5762.2). Total num frames: 238583808. Throughput: 0: 5097.0. Samples: 238577694. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 11:41:08,506][25689] Avg episode reward: [(0, '-47.269')] [2022-07-09 11:41:10,005][26022] Updated weights on worker 0-0, policy_version 233001 (0.00082) [2022-07-09 11:41:12,059][26022] Updated weights on worker 0-0, policy_version 233011 (0.00089) [2022-07-09 11:41:13,620][25689] Fps is (10 sec: 5846.7, 60 sec: 5776.1, 300 sec: 5760.0). Total num frames: 238612480. Throughput: 0: 5932.1. Samples: 238612294. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 11:41:13,620][25689] Avg episode reward: [(0, '-47.415')] [2022-07-09 11:41:13,727][26022] Updated weights on worker 0-0, policy_version 233021 (0.00090) [2022-07-09 11:41:15,547][26022] Updated weights on worker 0-0, policy_version 233031 (0.00079) [2022-07-09 11:41:17,319][26022] Updated weights on worker 0-0, policy_version 233041 (0.00090) [2022-07-09 11:41:18,657][25689] Fps is (10 sec: 5749.7, 60 sec: 5798.4, 300 sec: 5762.9). Total num frames: 238642176. Throughput: 0: 5934.3. Samples: 238647314. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 11:41:18,658][25689] Avg episode reward: [(0, '-47.399')] [2022-07-09 11:41:19,081][26022] Updated weights on worker 0-0, policy_version 233051 (0.00086) [2022-07-09 11:41:20,711][26022] Updated weights on worker 0-0, policy_version 233061 (0.00096) [2022-07-09 11:41:22,588][26022] Updated weights on worker 0-0, policy_version 233071 (0.00085) [2022-07-09 11:41:23,686][25689] Fps is (10 sec: 6001.7, 60 sec: 5814.3, 300 sec: 5765.8). Total num frames: 238672896. Throughput: 0: 5175.9. Samples: 238664938. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 11:41:23,687][25689] Avg episode reward: [(0, '-48.093')] [2022-07-09 11:41:24,148][26022] Updated weights on worker 0-0, policy_version 233081 (0.00089) [2022-07-09 11:41:26,393][26022] Updated weights on worker 0-0, policy_version 233091 (0.00091) [2022-07-09 11:41:27,642][26022] Updated weights on worker 0-0, policy_version 233101 (0.00095) [2022-07-09 11:41:28,723][25689] Fps is (10 sec: 5696.8, 60 sec: 5761.8, 300 sec: 5758.8). Total num frames: 238699520. Throughput: 0: 6036.0. Samples: 238699726. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 11:41:28,724][25689] Avg episode reward: [(0, '-47.819')] [2022-07-09 11:41:29,766][26022] Updated weights on worker 0-0, policy_version 233111 (0.00618) [2022-07-09 11:41:31,208][26022] Updated weights on worker 0-0, policy_version 233121 (0.00089) [2022-07-09 11:41:33,250][26022] Updated weights on worker 0-0, policy_version 233131 (0.00087) [2022-07-09 11:41:33,781][25689] Fps is (10 sec: 5680.3, 60 sec: 5815.2, 300 sec: 5764.9). Total num frames: 238730240. Throughput: 0: 6063.9. Samples: 238734552. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 11:41:33,782][25689] Avg episode reward: [(0, '-46.989')] [2022-07-09 11:41:34,801][26022] Updated weights on worker 0-0, policy_version 233141 (0.00092) [2022-07-09 11:41:36,848][26022] Updated weights on worker 0-0, policy_version 233151 (0.00085) [2022-07-09 11:41:38,337][26022] Updated weights on worker 0-0, policy_version 233161 (0.00090) [2022-07-09 11:41:38,806][25689] Fps is (10 sec: 5890.4, 60 sec: 5763.7, 300 sec: 5764.5). Total num frames: 238758912. Throughput: 0: 5197.8. Samples: 238752044. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 11:41:38,807][25689] Avg episode reward: [(0, '-46.674')] [2022-07-09 11:41:40,398][26022] Updated weights on worker 0-0, policy_version 233171 (0.00086) [2022-07-09 11:41:41,952][26022] Updated weights on worker 0-0, policy_version 233181 (0.00082) [2022-07-09 11:41:43,846][25689] Fps is (10 sec: 5596.0, 60 sec: 5727.5, 300 sec: 5757.1). Total num frames: 238786560. Throughput: 0: 6028.0. Samples: 238786460. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 11:41:43,847][25689] Avg episode reward: [(0, '-46.389')] [2022-07-09 11:41:43,905][26022] Updated weights on worker 0-0, policy_version 233191 (0.00090) [2022-07-09 11:41:45,458][26022] Updated weights on worker 0-0, policy_version 233201 (0.00083) [2022-07-09 11:41:47,389][26022] Updated weights on worker 0-0, policy_version 233211 (0.00194) [2022-07-09 11:41:48,889][25689] Fps is (10 sec: 5789.0, 60 sec: 5776.3, 300 sec: 5764.5). Total num frames: 238817280. Throughput: 0: 6045.8. Samples: 238821642. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 11:41:48,889][25689] Avg episode reward: [(0, '-45.982')] [2022-07-09 11:41:48,923][26022] Updated weights on worker 0-0, policy_version 233221 (0.00092) [2022-07-09 11:41:50,788][26022] Updated weights on worker 0-0, policy_version 233231 (0.00091) [2022-07-09 11:41:52,689][26022] Updated weights on worker 0-0, policy_version 233241 (0.00084) [2022-07-09 11:41:53,980][25689] Fps is (10 sec: 6062.6, 60 sec: 5772.8, 300 sec: 5759.3). Total num frames: 238848000. Throughput: 0: 5177.5. Samples: 238839130. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 11:41:53,981][25689] Avg episode reward: [(0, '-46.149')] [2022-07-09 11:41:54,176][26022] Updated weights on worker 0-0, policy_version 233251 (0.00107) [2022-07-09 11:41:56,244][26022] Updated weights on worker 0-0, policy_version 233261 (0.00090) [2022-07-09 11:41:57,821][26022] Updated weights on worker 0-0, policy_version 233271 (0.00086) [2022-07-09 11:41:59,033][25689] Fps is (10 sec: 5753.5, 60 sec: 5760.3, 300 sec: 5761.9). Total num frames: 238875648. Throughput: 0: 6035.6. Samples: 238874128. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 11:41:59,034][25689] Avg episode reward: [(0, '-47.141')] [2022-07-09 11:41:59,665][26022] Updated weights on worker 0-0, policy_version 233281 (0.00085) [2022-07-09 11:42:01,195][26022] Updated weights on worker 0-0, policy_version 233291 (0.00090) [2022-07-09 11:42:03,527][26022] Updated weights on worker 0-0, policy_version 233301 (0.00100) [2022-07-09 11:42:04,049][25689] Fps is (10 sec: 5390.0, 60 sec: 5763.3, 300 sec: 5761.8). Total num frames: 238902272. Throughput: 0: 5951.6. Samples: 238906702. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 11:42:04,050][25689] Avg episode reward: [(0, '-47.748')] [2022-07-09 11:42:05,241][26022] Updated weights on worker 0-0, policy_version 233311 (0.00089) [2022-07-09 11:42:07,266][26022] Updated weights on worker 0-0, policy_version 233321 (0.00086) [2022-07-09 11:42:08,691][26022] Updated weights on worker 0-0, policy_version 233331 (0.00087) [2022-07-09 11:42:09,060][25689] Fps is (10 sec: 5719.1, 60 sec: 5766.4, 300 sec: 5764.0). Total num frames: 238932992. Throughput: 0: 5958.0. Samples: 238941824. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 11:42:09,061][25689] Avg episode reward: [(0, '-47.905')] [2022-07-09 11:42:10,572][26022] Updated weights on worker 0-0, policy_version 233341 (0.00086) [2022-07-09 11:42:12,439][26022] Updated weights on worker 0-0, policy_version 233351 (0.00093) [2022-07-09 11:42:14,170][25689] Fps is (10 sec: 5868.2, 60 sec: 5766.8, 300 sec: 5765.4). Total num frames: 238961664. Throughput: 0: 5935.8. Samples: 238958974. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 11:42:14,171][25689] Avg episode reward: [(0, '-48.529')] [2022-07-09 11:42:14,175][26022] Updated weights on worker 0-0, policy_version 233361 (0.00092) [2022-07-09 11:42:15,954][26022] Updated weights on worker 0-0, policy_version 233371 (0.00084) [2022-07-09 11:42:17,531][26022] Updated weights on worker 0-0, policy_version 233381 (0.00084) [2022-07-09 11:42:19,198][25689] Fps is (10 sec: 5656.3, 60 sec: 5750.8, 300 sec: 5761.4). Total num frames: 238990336. Throughput: 0: 5937.9. Samples: 238993866. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 11:42:19,199][25689] Avg episode reward: [(0, '-49.277')] [2022-07-09 11:42:19,479][26022] Updated weights on worker 0-0, policy_version 233391 (0.00088) [2022-07-09 11:42:21,407][26022] Updated weights on worker 0-0, policy_version 233401 (0.00086) [2022-07-09 11:42:22,931][26022] Updated weights on worker 0-0, policy_version 233411 (0.00087) [2022-07-09 11:42:24,223][25689] Fps is (10 sec: 5704.3, 60 sec: 5717.4, 300 sec: 5757.9). Total num frames: 239019008. Throughput: 0: 6054.1. Samples: 239028836. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 11:42:24,223][25689] Avg episode reward: [(0, '-49.238')] [2022-07-09 11:42:24,865][26022] Updated weights on worker 0-0, policy_version 233421 (0.00088) [2022-07-09 11:42:26,331][26022] Updated weights on worker 0-0, policy_version 233431 (0.00093) [2022-07-09 11:42:28,355][26022] Updated weights on worker 0-0, policy_version 233441 (0.00085) [2022-07-09 11:42:29,232][25689] Fps is (10 sec: 5816.9, 60 sec: 5770.7, 300 sec: 5763.0). Total num frames: 239048704. Throughput: 0: 5165.5. Samples: 239046024. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 11:42:29,234][25689] Avg episode reward: [(0, '-48.753')] [2022-07-09 11:42:30,095][26022] Updated weights on worker 0-0, policy_version 233451 (0.00092) [2022-07-09 11:42:31,730][26022] Updated weights on worker 0-0, policy_version 233461 (0.00087) [2022-07-09 11:42:33,703][26022] Updated weights on worker 0-0, policy_version 233471 (0.00084) [2022-07-09 11:42:34,309][25689] Fps is (10 sec: 5990.0, 60 sec: 5769.0, 300 sec: 5768.7). Total num frames: 239079424. Throughput: 0: 6046.4. Samples: 239080742. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 11:42:34,309][25689] Avg episode reward: [(0, '-47.743')] [2022-07-09 11:42:35,300][26022] Updated weights on worker 0-0, policy_version 233481 (0.00091) [2022-07-09 11:42:37,102][26022] Updated weights on worker 0-0, policy_version 233491 (0.00099) [2022-07-09 11:42:38,855][26022] Updated weights on worker 0-0, policy_version 233501 (0.00093) [2022-07-09 11:42:39,345][25689] Fps is (10 sec: 5670.5, 60 sec: 5734.1, 300 sec: 5761.3). Total num frames: 239106048. Throughput: 0: 6028.1. Samples: 239115314. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 11:42:39,346][25689] Avg episode reward: [(0, '-47.154')] [2022-07-09 11:42:40,672][26022] Updated weights on worker 0-0, policy_version 233511 (0.00090) [2022-07-09 11:42:42,616][26022] Updated weights on worker 0-0, policy_version 233521 (0.00092) [2022-07-09 11:42:44,363][25689] Fps is (10 sec: 5500.0, 60 sec: 5753.1, 300 sec: 5755.8). Total num frames: 239134720. Throughput: 0: 5157.6. Samples: 239132710. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 11:42:44,363][25689] Avg episode reward: [(0, '-46.589')] [2022-07-09 11:42:44,386][26022] Updated weights on worker 0-0, policy_version 233531 (0.00082) [2022-07-09 11:42:46,079][26022] Updated weights on worker 0-0, policy_version 233541 (0.00094) [2022-07-09 11:42:47,819][26022] Updated weights on worker 0-0, policy_version 233551 (0.00087) [2022-07-09 11:42:49,382][25689] Fps is (10 sec: 5815.3, 60 sec: 5738.4, 300 sec: 5756.9). Total num frames: 239164416. Throughput: 0: 6019.1. Samples: 239167308. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 11:42:49,386][25689] Avg episode reward: [(0, '-46.804')] [2022-07-09 11:42:49,636][26022] Updated weights on worker 0-0, policy_version 233561 (0.00090) [2022-07-09 11:42:51,355][26022] Updated weights on worker 0-0, policy_version 233571 (0.00088) [2022-07-09 11:42:52,940][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:42:52,958][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000233580_239185920.pth [2022-07-09 11:42:52,958][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000231554_237111296.pth [2022-07-09 11:42:53,075][26022] Updated weights on worker 0-0, policy_version 233581 (0.00085) [2022-07-09 11:42:54,520][25689] Fps is (10 sec: 5847.4, 60 sec: 5717.1, 300 sec: 5761.1). Total num frames: 239194112. Throughput: 0: 6020.5. Samples: 239202422. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 11:42:54,520][25689] Avg episode reward: [(0, '-46.682')] [2022-07-09 11:42:54,887][26022] Updated weights on worker 0-0, policy_version 233591 (0.00090) [2022-07-09 11:42:56,675][26022] Updated weights on worker 0-0, policy_version 233601 (0.00084) [2022-07-09 11:42:58,455][26022] Updated weights on worker 0-0, policy_version 233611 (0.00082) [2022-07-09 11:42:59,524][25689] Fps is (10 sec: 5856.0, 60 sec: 5755.6, 300 sec: 5764.7). Total num frames: 239223808. Throughput: 0: 5182.0. Samples: 239219882. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 11:42:59,531][25689] Avg episode reward: [(0, '-47.868')] [2022-07-09 11:43:00,035][26022] Updated weights on worker 0-0, policy_version 233621 (0.00092) [2022-07-09 11:43:02,311][26022] Updated weights on worker 0-0, policy_version 233631 (0.00089) [2022-07-09 11:43:04,001][26022] Updated weights on worker 0-0, policy_version 233641 (0.00082) [2022-07-09 11:43:04,532][25689] Fps is (10 sec: 5625.2, 60 sec: 5756.3, 300 sec: 5762.8). Total num frames: 239250432. Throughput: 0: 5961.0. Samples: 239252938. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 11:43:04,532][25689] Avg episode reward: [(0, '-47.767')] [2022-07-09 11:43:05,713][26022] Updated weights on worker 0-0, policy_version 233651 (0.00093) [2022-07-09 11:43:07,645][26022] Updated weights on worker 0-0, policy_version 233661 (0.00086) [2022-07-09 11:43:09,191][26022] Updated weights on worker 0-0, policy_version 233671 (0.00088) [2022-07-09 11:43:09,599][25689] Fps is (10 sec: 5590.2, 60 sec: 5734.1, 300 sec: 5759.9). Total num frames: 239280128. Throughput: 0: 5957.5. Samples: 239287750. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 11:43:09,600][25689] Avg episode reward: [(0, '-47.893')] [2022-07-09 11:43:11,112][26022] Updated weights on worker 0-0, policy_version 233681 (0.00091) [2022-07-09 11:43:12,890][26022] Updated weights on worker 0-0, policy_version 233691 (0.00094) [2022-07-09 11:43:14,551][26022] Updated weights on worker 0-0, policy_version 233701 (0.00086) [2022-07-09 11:43:14,641][25689] Fps is (10 sec: 5875.2, 60 sec: 5757.5, 300 sec: 5766.0). Total num frames: 239309824. Throughput: 0: 5104.6. Samples: 239305132. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 11:43:14,641][25689] Avg episode reward: [(0, '-47.359')] [2022-07-09 11:43:16,389][26022] Updated weights on worker 0-0, policy_version 233711 (0.00086) [2022-07-09 11:43:18,023][26022] Updated weights on worker 0-0, policy_version 233721 (0.00087) [2022-07-09 11:43:19,659][25689] Fps is (10 sec: 5802.0, 60 sec: 5758.4, 300 sec: 5763.0). Total num frames: 239338496. Throughput: 0: 5955.2. Samples: 239339788. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 11:43:19,660][25689] Avg episode reward: [(0, '-47.682')] [2022-07-09 11:43:20,011][26022] Updated weights on worker 0-0, policy_version 233731 (0.00089) [2022-07-09 11:43:21,716][26022] Updated weights on worker 0-0, policy_version 233741 (0.00081) [2022-07-09 11:43:23,498][26022] Updated weights on worker 0-0, policy_version 233751 (0.00076) [2022-07-09 11:43:24,683][25689] Fps is (10 sec: 5812.4, 60 sec: 5775.5, 300 sec: 5759.5). Total num frames: 239368192. Throughput: 0: 6041.2. Samples: 239374674. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 11:43:24,684][25689] Avg episode reward: [(0, '-47.938')] [2022-07-09 11:43:25,130][26022] Updated weights on worker 0-0, policy_version 233761 (0.00096) [2022-07-09 11:43:26,971][26022] Updated weights on worker 0-0, policy_version 233771 (0.00083) [2022-07-09 11:43:28,808][26022] Updated weights on worker 0-0, policy_version 233781 (0.00091) [2022-07-09 11:43:29,702][25689] Fps is (10 sec: 5811.8, 60 sec: 5757.6, 300 sec: 5764.6). Total num frames: 239396864. Throughput: 0: 5192.2. Samples: 239392128. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 11:43:29,704][25689] Avg episode reward: [(0, '-47.699')] [2022-07-09 11:43:30,634][26022] Updated weights on worker 0-0, policy_version 233791 (0.00086) [2022-07-09 11:43:32,380][26022] Updated weights on worker 0-0, policy_version 233801 (0.00083) [2022-07-09 11:43:34,131][26022] Updated weights on worker 0-0, policy_version 233811 (0.00087) [2022-07-09 11:43:34,753][25689] Fps is (10 sec: 5796.0, 60 sec: 5743.1, 300 sec: 5761.4). Total num frames: 239426560. Throughput: 0: 6058.0. Samples: 239426974. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 11:43:34,754][25689] Avg episode reward: [(0, '-48.568')] [2022-07-09 11:43:35,719][26022] Updated weights on worker 0-0, policy_version 233821 (0.00088) [2022-07-09 11:43:37,552][26022] Updated weights on worker 0-0, policy_version 233831 (0.00088) [2022-07-09 11:43:39,343][26022] Updated weights on worker 0-0, policy_version 233841 (0.00085) [2022-07-09 11:43:39,758][25689] Fps is (10 sec: 5906.0, 60 sec: 5796.9, 300 sec: 5765.3). Total num frames: 239456256. Throughput: 0: 6089.0. Samples: 239462172. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 11:43:39,767][25689] Avg episode reward: [(0, '-47.815')] [2022-07-09 11:43:41,176][26022] Updated weights on worker 0-0, policy_version 233851 (0.00086) [2022-07-09 11:43:42,663][26022] Updated weights on worker 0-0, policy_version 233861 (0.00089) [2022-07-09 11:43:44,653][26022] Updated weights on worker 0-0, policy_version 233871 (0.00085) [2022-07-09 11:43:44,787][25689] Fps is (10 sec: 5715.0, 60 sec: 5778.9, 300 sec: 5754.5). Total num frames: 239483904. Throughput: 0: 5222.9. Samples: 239479676. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 11:43:44,787][25689] Avg episode reward: [(0, '-47.650')] [2022-07-09 11:43:46,210][26022] Updated weights on worker 0-0, policy_version 233881 (0.00094) [2022-07-09 11:43:48,233][26022] Updated weights on worker 0-0, policy_version 233891 (0.00095) [2022-07-09 11:43:49,819][25689] Fps is (10 sec: 5700.0, 60 sec: 5777.7, 300 sec: 5762.0). Total num frames: 239513600. Throughput: 0: 6065.5. Samples: 239514144. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 11:43:49,820][25689] Avg episode reward: [(0, '-46.058')] [2022-07-09 11:43:49,917][26022] Updated weights on worker 0-0, policy_version 233901 (0.00090) [2022-07-09 11:43:51,723][26022] Updated weights on worker 0-0, policy_version 233911 (0.00084) [2022-07-09 11:43:53,442][26022] Updated weights on worker 0-0, policy_version 233921 (0.00084) [2022-07-09 11:43:54,872][25689] Fps is (10 sec: 5889.0, 60 sec: 5785.7, 300 sec: 5758.9). Total num frames: 239543296. Throughput: 0: 6058.8. Samples: 239548872. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 11:43:54,874][25689] Avg episode reward: [(0, '-46.088')] [2022-07-09 11:43:55,202][26022] Updated weights on worker 0-0, policy_version 233931 (0.00083) [2022-07-09 11:43:56,910][26022] Updated weights on worker 0-0, policy_version 233941 (0.00102) [2022-07-09 11:43:58,909][26022] Updated weights on worker 0-0, policy_version 233951 (0.00088) [2022-07-09 11:43:59,899][25689] Fps is (10 sec: 5688.8, 60 sec: 5749.7, 300 sec: 5763.4). Total num frames: 239570944. Throughput: 0: 5174.0. Samples: 239566378. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 11:43:59,901][25689] Avg episode reward: [(0, '-46.757')] [2022-07-09 11:44:00,402][26022] Updated weights on worker 0-0, policy_version 233961 (0.00084) [2022-07-09 11:44:02,791][26022] Updated weights on worker 0-0, policy_version 233971 (0.00095) [2022-07-09 11:44:04,356][26022] Updated weights on worker 0-0, policy_version 233981 (0.00082) [2022-07-09 11:44:04,910][25689] Fps is (10 sec: 5508.7, 60 sec: 5766.3, 300 sec: 5761.2). Total num frames: 239598592. Throughput: 0: 5940.3. Samples: 239599214. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 11:44:04,911][25689] Avg episode reward: [(0, '-46.373')] [2022-07-09 11:44:06,300][26022] Updated weights on worker 0-0, policy_version 233991 (0.00087) [2022-07-09 11:44:07,983][26022] Updated weights on worker 0-0, policy_version 234001 (0.00086) [2022-07-09 11:44:09,511][26022] Updated weights on worker 0-0, policy_version 234011 (0.00086) [2022-07-09 11:44:09,914][25689] Fps is (10 sec: 5725.7, 60 sec: 5772.4, 300 sec: 5758.8). Total num frames: 239628288. Throughput: 0: 5969.5. Samples: 239634102. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:44:09,915][25689] Avg episode reward: [(0, '-46.672')] [2022-07-09 11:44:11,661][26022] Updated weights on worker 0-0, policy_version 234021 (0.00084) [2022-07-09 11:44:13,151][26022] Updated weights on worker 0-0, policy_version 234031 (0.00100) [2022-07-09 11:44:14,955][25689] Fps is (10 sec: 5810.5, 60 sec: 5755.4, 300 sec: 5761.7). Total num frames: 239656960. Throughput: 0: 5112.2. Samples: 239651542. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:44:14,956][25689] Avg episode reward: [(0, '-46.953')] [2022-07-09 11:44:15,108][26022] Updated weights on worker 0-0, policy_version 234041 (0.00086) [2022-07-09 11:44:16,750][26022] Updated weights on worker 0-0, policy_version 234051 (0.00086) [2022-07-09 11:44:18,509][26022] Updated weights on worker 0-0, policy_version 234061 (0.00092) [2022-07-09 11:44:19,979][25689] Fps is (10 sec: 5798.9, 60 sec: 5771.9, 300 sec: 5757.9). Total num frames: 239686656. Throughput: 0: 5980.7. Samples: 239686474. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:44:19,980][25689] Avg episode reward: [(0, '-47.361')] [2022-07-09 11:44:20,078][26022] Updated weights on worker 0-0, policy_version 234071 (0.00085) [2022-07-09 11:44:22,005][26022] Updated weights on worker 0-0, policy_version 234081 (0.00102) [2022-07-09 11:44:23,742][26022] Updated weights on worker 0-0, policy_version 234091 (0.00081) [2022-07-09 11:44:24,987][25689] Fps is (10 sec: 5920.3, 60 sec: 5773.4, 300 sec: 5761.7). Total num frames: 239716352. Throughput: 0: 6082.6. Samples: 239721336. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:44:24,988][25689] Avg episode reward: [(0, '-47.221')] [2022-07-09 11:44:25,786][26022] Updated weights on worker 0-0, policy_version 234101 (0.00083) [2022-07-09 11:44:27,515][26022] Updated weights on worker 0-0, policy_version 234111 (0.00087) [2022-07-09 11:44:29,145][26022] Updated weights on worker 0-0, policy_version 234121 (0.00094) [2022-07-09 11:44:30,016][25689] Fps is (10 sec: 5611.4, 60 sec: 5738.5, 300 sec: 5753.2). Total num frames: 239742976. Throughput: 0: 5196.6. Samples: 239738566. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:44:30,017][25689] Avg episode reward: [(0, '-46.305')] [2022-07-09 11:44:31,184][26022] Updated weights on worker 0-0, policy_version 234131 (0.00606) [2022-07-09 11:44:32,574][26022] Updated weights on worker 0-0, policy_version 234141 (0.00086) [2022-07-09 11:44:34,545][26022] Updated weights on worker 0-0, policy_version 234151 (0.00096) [2022-07-09 11:44:35,112][25689] Fps is (10 sec: 5663.9, 60 sec: 5751.2, 300 sec: 5759.9). Total num frames: 239773696. Throughput: 0: 6033.7. Samples: 239773160. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:44:35,112][25689] Avg episode reward: [(0, '-45.897')] [2022-07-09 11:44:36,368][26022] Updated weights on worker 0-0, policy_version 234161 (0.00090) [2022-07-09 11:44:38,275][26022] Updated weights on worker 0-0, policy_version 234171 (0.00087) [2022-07-09 11:44:40,046][26022] Updated weights on worker 0-0, policy_version 234181 (0.00085) [2022-07-09 11:44:40,125][25689] Fps is (10 sec: 5773.7, 60 sec: 5716.5, 300 sec: 5754.0). Total num frames: 239801344. Throughput: 0: 6012.1. Samples: 239807594. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:44:40,126][25689] Avg episode reward: [(0, '-45.621')] [2022-07-09 11:44:41,509][26022] Updated weights on worker 0-0, policy_version 234191 (0.00087) [2022-07-09 11:44:43,500][26022] Updated weights on worker 0-0, policy_version 234201 (0.00081) [2022-07-09 11:44:45,134][25689] Fps is (10 sec: 5721.4, 60 sec: 5752.3, 300 sec: 5754.9). Total num frames: 239831040. Throughput: 0: 5146.6. Samples: 239825026. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:44:45,135][25689] Avg episode reward: [(0, '-45.153')] [2022-07-09 11:44:45,465][26022] Updated weights on worker 0-0, policy_version 234212 (0.00082) [2022-07-09 11:44:47,156][26022] Updated weights on worker 0-0, policy_version 234222 (0.00081) [2022-07-09 11:44:48,863][26022] Updated weights on worker 0-0, policy_version 234232 (0.00087) [2022-07-09 11:44:50,146][25689] Fps is (10 sec: 5824.7, 60 sec: 5737.2, 300 sec: 5752.4). Total num frames: 239859712. Throughput: 0: 6020.8. Samples: 239859764. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:44:50,148][25689] Avg episode reward: [(0, '-45.363')] [2022-07-09 11:44:50,622][26022] Updated weights on worker 0-0, policy_version 234242 (0.00083) [2022-07-09 11:44:52,418][26022] Updated weights on worker 0-0, policy_version 234252 (0.00086) [2022-07-09 11:44:52,990][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:44:52,998][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000234254_239876096.pth [2022-07-09 11:44:52,999][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000232229_237802496.pth [2022-07-09 11:44:54,246][26022] Updated weights on worker 0-0, policy_version 234262 (0.00080) [2022-07-09 11:44:55,191][25689] Fps is (10 sec: 5803.8, 60 sec: 5738.0, 300 sec: 5752.1). Total num frames: 239889408. Throughput: 0: 6054.2. Samples: 239894726. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:44:55,192][25689] Avg episode reward: [(0, '-45.490')] [2022-07-09 11:44:56,087][26022] Updated weights on worker 0-0, policy_version 234272 (0.00085) [2022-07-09 11:44:57,811][26022] Updated weights on worker 0-0, policy_version 234282 (0.00087) [2022-07-09 11:44:59,537][26022] Updated weights on worker 0-0, policy_version 234292 (0.00081) [2022-07-09 11:45:00,198][25689] Fps is (10 sec: 5806.5, 60 sec: 5756.9, 300 sec: 5762.8). Total num frames: 239918080. Throughput: 0: 5202.8. Samples: 239912030. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:45:00,199][25689] Avg episode reward: [(0, '-45.224')] [2022-07-09 11:45:01,186][26022] Updated weights on worker 0-0, policy_version 234302 (0.00084) [2022-07-09 11:45:03,495][26022] Updated weights on worker 0-0, policy_version 234312 (0.00084) [2022-07-09 11:45:05,007][26022] Updated weights on worker 0-0, policy_version 234322 (0.00084) [2022-07-09 11:45:05,258][25689] Fps is (10 sec: 5594.9, 60 sec: 5752.3, 300 sec: 5754.9). Total num frames: 239945728. Throughput: 0: 5947.7. Samples: 239944712. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:45:05,258][25689] Avg episode reward: [(0, '-46.072')] [2022-07-09 11:45:07,151][26022] Updated weights on worker 0-0, policy_version 234332 (0.00095) [2022-07-09 11:45:08,738][26022] Updated weights on worker 0-0, policy_version 234342 (0.00083) [2022-07-09 11:45:10,283][25689] Fps is (10 sec: 5584.8, 60 sec: 5733.3, 300 sec: 5759.0). Total num frames: 239974400. Throughput: 0: 5942.9. Samples: 239979434. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:45:10,285][25689] Avg episode reward: [(0, '-47.426')] [2022-07-09 11:45:10,618][26022] Updated weights on worker 0-0, policy_version 234352 (0.00086) [2022-07-09 11:45:12,412][26022] Updated weights on worker 0-0, policy_version 234362 (0.00093) [2022-07-09 11:45:14,176][26022] Updated weights on worker 0-0, policy_version 234372 (0.00084) [2022-07-09 11:45:15,421][25689] Fps is (10 sec: 5642.3, 60 sec: 5724.1, 300 sec: 5758.2). Total num frames: 240003072. Throughput: 0: 5888.8. Samples: 240013854. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:45:15,423][25689] Avg episode reward: [(0, '-46.252')] [2022-07-09 11:45:15,955][26022] Updated weights on worker 0-0, policy_version 234382 (0.00084) [2022-07-09 11:45:17,728][26022] Updated weights on worker 0-0, policy_version 234392 (0.00086) [2022-07-09 11:45:19,646][26022] Updated weights on worker 0-0, policy_version 234402 (0.00089) [2022-07-09 11:45:20,423][25689] Fps is (10 sec: 5756.0, 60 sec: 5726.2, 300 sec: 5758.5). Total num frames: 240032768. Throughput: 0: 5886.3. Samples: 240031080. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:45:20,425][25689] Avg episode reward: [(0, '-45.786')] [2022-07-09 11:45:21,327][26022] Updated weights on worker 0-0, policy_version 234412 (0.00091) [2022-07-09 11:45:23,231][26022] Updated weights on worker 0-0, policy_version 234422 (0.00082) [2022-07-09 11:45:24,640][26022] Updated weights on worker 0-0, policy_version 234432 (0.00089) [2022-07-09 11:45:25,426][25689] Fps is (10 sec: 5833.8, 60 sec: 5709.7, 300 sec: 5755.3). Total num frames: 240061440. Throughput: 0: 5999.7. Samples: 240065716. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:45:25,427][25689] Avg episode reward: [(0, '-46.067')] [2022-07-09 11:45:26,766][26022] Updated weights on worker 0-0, policy_version 234442 (0.00081) [2022-07-09 11:45:28,242][26022] Updated weights on worker 0-0, policy_version 234452 (0.00098) [2022-07-09 11:45:30,254][26022] Updated weights on worker 0-0, policy_version 234462 (0.00082) [2022-07-09 11:45:30,458][25689] Fps is (10 sec: 5612.8, 60 sec: 5726.4, 300 sec: 5756.4). Total num frames: 240089088. Throughput: 0: 5131.5. Samples: 240082960. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:45:30,458][25689] Avg episode reward: [(0, '-46.085')] [2022-07-09 11:45:31,801][26022] Updated weights on worker 0-0, policy_version 234472 (0.00082) [2022-07-09 11:45:33,761][26022] Updated weights on worker 0-0, policy_version 234482 (0.00079) [2022-07-09 11:45:35,427][26022] Updated weights on worker 0-0, policy_version 234492 (0.00096) [2022-07-09 11:45:35,524][25689] Fps is (10 sec: 5780.2, 60 sec: 5729.2, 300 sec: 5752.0). Total num frames: 240119808. Throughput: 0: 5165.9. Samples: 240117704. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:45:35,525][25689] Avg episode reward: [(0, '-45.614')] [2022-07-09 11:45:37,361][26022] Updated weights on worker 0-0, policy_version 234502 (0.00083) [2022-07-09 11:45:38,954][26022] Updated weights on worker 0-0, policy_version 234512 (0.00093) [2022-07-09 11:45:40,586][25689] Fps is (10 sec: 5863.7, 60 sec: 5741.5, 300 sec: 5747.6). Total num frames: 240148480. Throughput: 0: 6026.1. Samples: 240152584. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:45:40,587][25689] Avg episode reward: [(0, '-46.338')] [2022-07-09 11:45:40,890][26022] Updated weights on worker 0-0, policy_version 234522 (0.00098) [2022-07-09 11:45:42,507][26022] Updated weights on worker 0-0, policy_version 234532 (0.00085) [2022-07-09 11:45:44,194][26022] Updated weights on worker 0-0, policy_version 234542 (0.00086) [2022-07-09 11:45:45,590][25689] Fps is (10 sec: 5900.5, 60 sec: 5759.0, 300 sec: 5758.4). Total num frames: 240179200. Throughput: 0: 6050.8. Samples: 240187722. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:45:45,590][25689] Avg episode reward: [(0, '-47.421')] [2022-07-09 11:45:46,160][26022] Updated weights on worker 0-0, policy_version 234552 (0.00099) [2022-07-09 11:45:47,710][26022] Updated weights on worker 0-0, policy_version 234562 (0.00096) [2022-07-09 11:45:49,600][26022] Updated weights on worker 0-0, policy_version 234572 (0.00090) [2022-07-09 11:45:50,689][25689] Fps is (10 sec: 5878.8, 60 sec: 5750.6, 300 sec: 5750.6). Total num frames: 240207872. Throughput: 0: 6030.1. Samples: 240204958. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:45:50,690][25689] Avg episode reward: [(0, '-47.775')] [2022-07-09 11:45:51,271][26022] Updated weights on worker 0-0, policy_version 234582 (0.00095) [2022-07-09 11:45:53,133][26022] Updated weights on worker 0-0, policy_version 234592 (0.00092) [2022-07-09 11:45:54,826][26022] Updated weights on worker 0-0, policy_version 234602 (0.00093) [2022-07-09 11:45:55,764][25689] Fps is (10 sec: 5837.3, 60 sec: 5764.7, 300 sec: 5758.0). Total num frames: 240238592. Throughput: 0: 6035.6. Samples: 240239866. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:45:55,765][25689] Avg episode reward: [(0, '-48.275')] [2022-07-09 11:45:56,543][26022] Updated weights on worker 0-0, policy_version 234612 (0.00083) [2022-07-09 11:45:58,336][26022] Updated weights on worker 0-0, policy_version 234622 (0.00082) [2022-07-09 11:46:00,202][26022] Updated weights on worker 0-0, policy_version 234632 (0.00086) [2022-07-09 11:46:00,786][25689] Fps is (10 sec: 5679.6, 60 sec: 5729.5, 300 sec: 5758.5). Total num frames: 240265216. Throughput: 0: 6030.0. Samples: 240274386. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:46:00,786][25689] Avg episode reward: [(0, '-47.926')] [2022-07-09 11:46:02,159][26022] Updated weights on worker 0-0, policy_version 234642 (0.00093) [2022-07-09 11:46:04,203][26022] Updated weights on worker 0-0, policy_version 234652 (0.00085) [2022-07-09 11:46:05,818][25689] Fps is (10 sec: 5398.1, 60 sec: 5732.1, 300 sec: 5748.3). Total num frames: 240292864. Throughput: 0: 5032.2. Samples: 240289516. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:46:05,819][25689] Avg episode reward: [(0, '-47.762')] [2022-07-09 11:46:05,986][26022] Updated weights on worker 0-0, policy_version 234662 (0.00083) [2022-07-09 11:46:07,467][26022] Updated weights on worker 0-0, policy_version 234672 (0.00095) [2022-07-09 11:46:09,481][26022] Updated weights on worker 0-0, policy_version 234682 (0.00089) [2022-07-09 11:46:10,848][25689] Fps is (10 sec: 5698.8, 60 sec: 5748.5, 300 sec: 5753.4). Total num frames: 240322560. Throughput: 0: 5907.8. Samples: 240324054. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:46:10,849][25689] Avg episode reward: [(0, '-48.151')] [2022-07-09 11:46:11,058][26022] Updated weights on worker 0-0, policy_version 234692 (0.00084) [2022-07-09 11:46:12,962][26022] Updated weights on worker 0-0, policy_version 234702 (0.00080) [2022-07-09 11:46:14,923][26022] Updated weights on worker 0-0, policy_version 234712 (0.00093) [2022-07-09 11:46:15,914][25689] Fps is (10 sec: 5781.1, 60 sec: 5755.3, 300 sec: 5749.4). Total num frames: 240351232. Throughput: 0: 5910.3. Samples: 240358960. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:46:15,915][25689] Avg episode reward: [(0, '-47.519')] [2022-07-09 11:46:16,434][26022] Updated weights on worker 0-0, policy_version 234722 (0.00082) [2022-07-09 11:46:18,383][26022] Updated weights on worker 0-0, policy_version 234732 (0.00082) [2022-07-09 11:46:20,108][26022] Updated weights on worker 0-0, policy_version 234742 (0.00085) [2022-07-09 11:46:20,952][25689] Fps is (10 sec: 5675.6, 60 sec: 5735.1, 300 sec: 5742.4). Total num frames: 240379904. Throughput: 0: 5057.9. Samples: 240376384. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 11:46:20,952][25689] Avg episode reward: [(0, '-47.881')] [2022-07-09 11:46:21,722][26022] Updated weights on worker 0-0, policy_version 234752 (0.00081) [2022-07-09 11:46:23,647][26022] Updated weights on worker 0-0, policy_version 234762 (0.00080) [2022-07-09 11:46:25,303][26022] Updated weights on worker 0-0, policy_version 234772 (0.00087) [2022-07-09 11:46:25,954][25689] Fps is (10 sec: 5712.1, 60 sec: 5735.2, 300 sec: 5749.9). Total num frames: 240408576. Throughput: 0: 6068.6. Samples: 240411710. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 11:46:25,955][25689] Avg episode reward: [(0, '-46.181')] [2022-07-09 11:46:27,126][26022] Updated weights on worker 0-0, policy_version 234782 (0.00082) [2022-07-09 11:46:29,047][26022] Updated weights on worker 0-0, policy_version 234792 (0.00091) [2022-07-09 11:46:30,545][26022] Updated weights on worker 0-0, policy_version 234802 (0.00091) [2022-07-09 11:46:30,960][25689] Fps is (10 sec: 5934.2, 60 sec: 5788.3, 300 sec: 5750.9). Total num frames: 240439296. Throughput: 0: 6066.0. Samples: 240446054. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 11:46:30,962][25689] Avg episode reward: [(0, '-46.767')] [2022-07-09 11:46:32,503][26022] Updated weights on worker 0-0, policy_version 234812 (0.00086) [2022-07-09 11:46:34,178][26022] Updated weights on worker 0-0, policy_version 234822 (0.00080) [2022-07-09 11:46:36,030][25689] Fps is (10 sec: 5792.3, 60 sec: 5737.2, 300 sec: 5746.6). Total num frames: 240466944. Throughput: 0: 5185.9. Samples: 240463278. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 11:46:36,031][25689] Avg episode reward: [(0, '-46.696')] [2022-07-09 11:46:36,090][26022] Updated weights on worker 0-0, policy_version 234832 (0.00091) [2022-07-09 11:46:37,739][26022] Updated weights on worker 0-0, policy_version 234842 (0.00086) [2022-07-09 11:46:39,757][26022] Updated weights on worker 0-0, policy_version 234852 (0.00092) [2022-07-09 11:46:41,057][25689] Fps is (10 sec: 5781.1, 60 sec: 5774.5, 300 sec: 5757.2). Total num frames: 240497664. Throughput: 0: 6051.7. Samples: 240498052. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 11:46:41,057][25689] Avg episode reward: [(0, '-46.580')] [2022-07-09 11:46:41,283][26022] Updated weights on worker 0-0, policy_version 234862 (0.00090) [2022-07-09 11:46:43,331][26022] Updated weights on worker 0-0, policy_version 234872 (0.00104) [2022-07-09 11:46:44,630][26022] Updated weights on worker 0-0, policy_version 234882 (0.00095) [2022-07-09 11:46:46,114][25689] Fps is (10 sec: 5686.7, 60 sec: 5701.6, 300 sec: 5743.1). Total num frames: 240524288. Throughput: 0: 6007.4. Samples: 240532824. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 11:46:46,116][25689] Avg episode reward: [(0, '-46.260')] [2022-07-09 11:46:46,859][26022] Updated weights on worker 0-0, policy_version 234892 (0.00087) [2022-07-09 11:46:48,475][26022] Updated weights on worker 0-0, policy_version 234902 (0.00085) [2022-07-09 11:46:50,185][26022] Updated weights on worker 0-0, policy_version 234912 (0.00081) [2022-07-09 11:46:51,120][25689] Fps is (10 sec: 5698.2, 60 sec: 5744.3, 300 sec: 5744.8). Total num frames: 240555008. Throughput: 0: 5158.3. Samples: 240550044. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 11:46:51,121][25689] Avg episode reward: [(0, '-45.767')] [2022-07-09 11:46:51,833][26022] Updated weights on worker 0-0, policy_version 234922 (0.00104) [2022-07-09 11:46:53,014][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:46:53,021][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000234928_240566272.pth [2022-07-09 11:46:53,021][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000232904_238493696.pth [2022-07-09 11:46:53,816][26022] Updated weights on worker 0-0, policy_version 234932 (0.00080) [2022-07-09 11:46:55,450][26022] Updated weights on worker 0-0, policy_version 234942 (0.00100) [2022-07-09 11:46:56,199][25689] Fps is (10 sec: 5991.0, 60 sec: 5727.1, 300 sec: 5751.2). Total num frames: 240584704. Throughput: 0: 6043.6. Samples: 240585168. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 11:46:56,200][25689] Avg episode reward: [(0, '-46.407')] [2022-07-09 11:46:57,484][26022] Updated weights on worker 0-0, policy_version 234952 (0.00083) [2022-07-09 11:46:58,823][26022] Updated weights on worker 0-0, policy_version 234962 (0.00096) [2022-07-09 11:47:00,939][26022] Updated weights on worker 0-0, policy_version 234972 (0.00087) [2022-07-09 11:47:01,223][25689] Fps is (10 sec: 5777.2, 60 sec: 5760.6, 300 sec: 5757.9). Total num frames: 240613376. Throughput: 0: 6061.4. Samples: 240620290. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 11:47:01,225][25689] Avg episode reward: [(0, '-47.528')] [2022-07-09 11:47:02,816][26022] Updated weights on worker 0-0, policy_version 234982 (0.00086) [2022-07-09 11:47:04,680][26022] Updated weights on worker 0-0, policy_version 234992 (0.00091) [2022-07-09 11:47:06,287][25689] Fps is (10 sec: 5582.6, 60 sec: 5757.6, 300 sec: 5746.6). Total num frames: 240641024. Throughput: 0: 5092.2. Samples: 240635548. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 11:47:06,288][25689] Avg episode reward: [(0, '-48.229')] [2022-07-09 11:47:06,436][26022] Updated weights on worker 0-0, policy_version 235002 (0.00086) [2022-07-09 11:47:08,034][26022] Updated weights on worker 0-0, policy_version 235012 (0.00094) [2022-07-09 11:47:09,943][26022] Updated weights on worker 0-0, policy_version 235022 (0.00085) [2022-07-09 11:47:11,307][25689] Fps is (10 sec: 5585.3, 60 sec: 5741.7, 300 sec: 5748.3). Total num frames: 240669696. Throughput: 0: 5962.2. Samples: 240670402. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 11:47:11,307][25689] Avg episode reward: [(0, '-49.168')] [2022-07-09 11:47:11,707][26022] Updated weights on worker 0-0, policy_version 235032 (0.00106) [2022-07-09 11:47:13,493][26022] Updated weights on worker 0-0, policy_version 235042 (0.00082) [2022-07-09 11:47:15,270][26022] Updated weights on worker 0-0, policy_version 235052 (0.00086) [2022-07-09 11:47:16,414][25689] Fps is (10 sec: 5864.9, 60 sec: 5771.7, 300 sec: 5753.7). Total num frames: 240700416. Throughput: 0: 5936.7. Samples: 240705180. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 11:47:16,414][25689] Avg episode reward: [(0, '-49.084')] [2022-07-09 11:47:16,971][26022] Updated weights on worker 0-0, policy_version 235062 (0.00092) [2022-07-09 11:47:18,845][26022] Updated weights on worker 0-0, policy_version 235072 (0.00089) [2022-07-09 11:47:20,590][26022] Updated weights on worker 0-0, policy_version 235082 (0.00089) [2022-07-09 11:47:21,440][25689] Fps is (10 sec: 5861.3, 60 sec: 5772.8, 300 sec: 5753.6). Total num frames: 240729088. Throughput: 0: 5034.0. Samples: 240722060. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 11:47:21,442][25689] Avg episode reward: [(0, '-49.041')] [2022-07-09 11:47:22,375][26022] Updated weights on worker 0-0, policy_version 235092 (0.00096) [2022-07-09 11:47:24,162][26022] Updated weights on worker 0-0, policy_version 235102 (0.00092) [2022-07-09 11:47:26,052][26022] Updated weights on worker 0-0, policy_version 235112 (0.00082) [2022-07-09 11:47:26,480][25689] Fps is (10 sec: 5696.9, 60 sec: 5769.1, 300 sec: 5749.6). Total num frames: 240757760. Throughput: 0: 6011.2. Samples: 240756930. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 11:47:26,480][25689] Avg episode reward: [(0, '-48.609')] [2022-07-09 11:47:27,651][26022] Updated weights on worker 0-0, policy_version 235122 (0.00088) [2022-07-09 11:47:29,447][26022] Updated weights on worker 0-0, policy_version 235132 (0.00082) [2022-07-09 11:47:31,377][26022] Updated weights on worker 0-0, policy_version 235142 (0.00093) [2022-07-09 11:47:31,487][25689] Fps is (10 sec: 5707.7, 60 sec: 5735.3, 300 sec: 5744.1). Total num frames: 240786432. Throughput: 0: 6014.4. Samples: 240791770. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 11:47:31,487][25689] Avg episode reward: [(0, '-47.549')] [2022-07-09 11:47:32,920][26022] Updated weights on worker 0-0, policy_version 235152 (0.00107) [2022-07-09 11:47:34,799][26022] Updated weights on worker 0-0, policy_version 235162 (0.00089) [2022-07-09 11:47:36,385][26022] Updated weights on worker 0-0, policy_version 235172 (0.00081) [2022-07-09 11:47:36,593][25689] Fps is (10 sec: 5771.2, 60 sec: 5765.6, 300 sec: 5753.0). Total num frames: 240816128. Throughput: 0: 5153.4. Samples: 240809174. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 11:47:36,594][25689] Avg episode reward: [(0, '-47.919')] [2022-07-09 11:47:38,228][26022] Updated weights on worker 0-0, policy_version 235182 (0.00101) [2022-07-09 11:47:40,202][26022] Updated weights on worker 0-0, policy_version 235192 (0.00088) [2022-07-09 11:47:41,598][25689] Fps is (10 sec: 5772.8, 60 sec: 5733.9, 300 sec: 5753.3). Total num frames: 240844800. Throughput: 0: 6047.8. Samples: 240843970. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 11:47:41,598][25689] Avg episode reward: [(0, '-47.345')] [2022-07-09 11:47:41,962][26022] Updated weights on worker 0-0, policy_version 235202 (0.00082) [2022-07-09 11:47:43,569][26022] Updated weights on worker 0-0, policy_version 235212 (0.00088) [2022-07-09 11:47:45,388][26022] Updated weights on worker 0-0, policy_version 235222 (0.00094) [2022-07-09 11:47:46,610][25689] Fps is (10 sec: 5724.8, 60 sec: 5772.0, 300 sec: 5750.0). Total num frames: 240873472. Throughput: 0: 6055.1. Samples: 240878822. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 11:47:46,612][25689] Avg episode reward: [(0, '-47.681')] [2022-07-09 11:47:46,992][26022] Updated weights on worker 0-0, policy_version 235232 (0.00094) [2022-07-09 11:47:49,083][26022] Updated weights on worker 0-0, policy_version 235242 (0.00085) [2022-07-09 11:47:50,710][26022] Updated weights on worker 0-0, policy_version 235252 (0.00088) [2022-07-09 11:47:51,677][25689] Fps is (10 sec: 5790.6, 60 sec: 5749.3, 300 sec: 5751.3). Total num frames: 240903168. Throughput: 0: 5145.5. Samples: 240895662. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 11:47:51,679][25689] Avg episode reward: [(0, '-47.547')] [2022-07-09 11:47:52,691][26022] Updated weights on worker 0-0, policy_version 235262 (0.00084) [2022-07-09 11:47:54,382][26022] Updated weights on worker 0-0, policy_version 235272 (0.00092) [2022-07-09 11:47:56,128][26022] Updated weights on worker 0-0, policy_version 235282 (0.00098) [2022-07-09 11:47:56,827][25689] Fps is (10 sec: 5813.3, 60 sec: 5742.5, 300 sec: 5748.5). Total num frames: 240932864. Throughput: 0: 5987.8. Samples: 240930328. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 11:47:56,827][25689] Avg episode reward: [(0, '-46.916')] [2022-07-09 11:47:57,788][26022] Updated weights on worker 0-0, policy_version 235292 (0.00090) [2022-07-09 11:47:59,503][26022] Updated weights on worker 0-0, policy_version 235302 (0.00086) [2022-07-09 11:48:01,874][25689] Fps is (10 sec: 5423.1, 60 sec: 5689.8, 300 sec: 5744.3). Total num frames: 240958464. Throughput: 0: 5982.3. Samples: 240965268. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 11:48:01,874][25689] Avg episode reward: [(0, '-46.542')] [2022-07-09 11:48:01,906][26022] Updated weights on worker 0-0, policy_version 235312 (0.00091) [2022-07-09 11:48:03,626][26022] Updated weights on worker 0-0, policy_version 235322 (0.00081) [2022-07-09 11:48:05,330][26022] Updated weights on worker 0-0, policy_version 235332 (0.00100) [2022-07-09 11:48:06,924][25689] Fps is (10 sec: 5476.2, 60 sec: 5724.8, 300 sec: 5744.7). Total num frames: 240988160. Throughput: 0: 5860.0. Samples: 240997862. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 11:48:06,925][25689] Avg episode reward: [(0, '-46.472')] [2022-07-09 11:48:07,018][26022] Updated weights on worker 0-0, policy_version 235342 (0.00090) [2022-07-09 11:48:08,690][26022] Updated weights on worker 0-0, policy_version 235352 (0.00093) [2022-07-09 11:48:10,498][26022] Updated weights on worker 0-0, policy_version 235362 (0.00086) [2022-07-09 11:48:11,962][25689] Fps is (10 sec: 5887.2, 60 sec: 5740.0, 300 sec: 5744.7). Total num frames: 241017856. Throughput: 0: 5893.2. Samples: 241015204. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 11:48:11,963][25689] Avg episode reward: [(0, '-46.774')] [2022-07-09 11:48:12,460][26022] Updated weights on worker 0-0, policy_version 235372 (0.00085) [2022-07-09 11:48:13,922][26022] Updated weights on worker 0-0, policy_version 235382 (0.00088) [2022-07-09 11:48:15,955][26022] Updated weights on worker 0-0, policy_version 235392 (0.00081) [2022-07-09 11:48:17,003][25689] Fps is (10 sec: 5893.0, 60 sec: 5729.4, 300 sec: 5747.7). Total num frames: 241047552. Throughput: 0: 5945.5. Samples: 241050284. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 11:48:17,003][25689] Avg episode reward: [(0, '-46.761')] [2022-07-09 11:48:17,415][26022] Updated weights on worker 0-0, policy_version 235402 (0.00087) [2022-07-09 11:48:19,532][26022] Updated weights on worker 0-0, policy_version 235412 (0.00088) [2022-07-09 11:48:21,226][26022] Updated weights on worker 0-0, policy_version 235422 (0.00092) [2022-07-09 11:48:22,009][25689] Fps is (10 sec: 5707.9, 60 sec: 5714.4, 300 sec: 5741.2). Total num frames: 241075200. Throughput: 0: 5932.8. Samples: 241084724. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 11:48:22,009][25689] Avg episode reward: [(0, '-48.747')] [2022-07-09 11:48:22,995][26022] Updated weights on worker 0-0, policy_version 235432 (0.00086) [2022-07-09 11:48:24,816][26022] Updated weights on worker 0-0, policy_version 235442 (0.00090) [2022-07-09 11:48:26,517][26022] Updated weights on worker 0-0, policy_version 235452 (0.00088) [2022-07-09 11:48:27,023][25689] Fps is (10 sec: 5723.0, 60 sec: 5733.7, 300 sec: 5744.7). Total num frames: 241104896. Throughput: 0: 5190.6. Samples: 241102184. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 11:48:27,023][25689] Avg episode reward: [(0, '-47.793')] [2022-07-09 11:48:28,256][26022] Updated weights on worker 0-0, policy_version 235462 (0.00089) [2022-07-09 11:48:30,369][26022] Updated weights on worker 0-0, policy_version 235472 (0.00615) [2022-07-09 11:48:31,898][26022] Updated weights on worker 0-0, policy_version 235482 (0.00085) [2022-07-09 11:48:32,035][25689] Fps is (10 sec: 5923.8, 60 sec: 5750.1, 300 sec: 5745.5). Total num frames: 241134592. Throughput: 0: 6047.9. Samples: 241136600. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 11:48:32,035][25689] Avg episode reward: [(0, '-47.682')] [2022-07-09 11:48:33,763][26022] Updated weights on worker 0-0, policy_version 235492 (0.00087) [2022-07-09 11:48:35,315][26022] Updated weights on worker 0-0, policy_version 235502 (0.00078) [2022-07-09 11:48:37,100][25689] Fps is (10 sec: 5791.9, 60 sec: 5737.2, 300 sec: 5740.9). Total num frames: 241163264. Throughput: 0: 6037.6. Samples: 241171624. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 11:48:37,101][25689] Avg episode reward: [(0, '-47.776')] [2022-07-09 11:48:37,245][26022] Updated weights on worker 0-0, policy_version 235512 (0.00083) [2022-07-09 11:48:38,870][26022] Updated weights on worker 0-0, policy_version 235522 (0.00099) [2022-07-09 11:48:40,797][26022] Updated weights on worker 0-0, policy_version 235532 (0.01421) [2022-07-09 11:48:42,104][25689] Fps is (10 sec: 5796.7, 60 sec: 5754.1, 300 sec: 5748.3). Total num frames: 241192960. Throughput: 0: 5195.2. Samples: 241189122. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:48:42,104][25689] Avg episode reward: [(0, '-48.259')] [2022-07-09 11:48:42,626][26022] Updated weights on worker 0-0, policy_version 235542 (0.00085) [2022-07-09 11:48:44,079][26022] Updated weights on worker 0-0, policy_version 235552 (0.00090) [2022-07-09 11:48:46,182][26022] Updated weights on worker 0-0, policy_version 235562 (0.00091) [2022-07-09 11:48:47,117][25689] Fps is (10 sec: 5724.8, 60 sec: 5737.2, 300 sec: 5741.7). Total num frames: 241220608. Throughput: 0: 6057.9. Samples: 241223912. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:48:47,117][25689] Avg episode reward: [(0, '-48.201')] [2022-07-09 11:48:47,776][26022] Updated weights on worker 0-0, policy_version 235572 (0.00079) [2022-07-09 11:48:49,487][26022] Updated weights on worker 0-0, policy_version 235582 (0.00086) [2022-07-09 11:48:51,536][26022] Updated weights on worker 0-0, policy_version 235592 (0.00084) [2022-07-09 11:48:52,129][25689] Fps is (10 sec: 5719.8, 60 sec: 5742.3, 300 sec: 5742.5). Total num frames: 241250304. Throughput: 0: 6064.2. Samples: 241258458. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:48:52,130][25689] Avg episode reward: [(0, '-47.800')] [2022-07-09 11:48:53,085][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:48:53,091][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000235602_241256448.pth [2022-07-09 11:48:53,091][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000233580_239185920.pth [2022-07-09 11:48:53,102][26022] Updated weights on worker 0-0, policy_version 235602 (0.01012) [2022-07-09 11:48:55,011][26022] Updated weights on worker 0-0, policy_version 235612 (0.00091) [2022-07-09 11:48:56,595][26022] Updated weights on worker 0-0, policy_version 235622 (0.00085) [2022-07-09 11:48:57,172][25689] Fps is (10 sec: 5906.4, 60 sec: 5752.5, 300 sec: 5749.1). Total num frames: 241280000. Throughput: 0: 5197.6. Samples: 241275952. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:48:57,173][25689] Avg episode reward: [(0, '-48.604')] [2022-07-09 11:48:58,542][26022] Updated weights on worker 0-0, policy_version 235632 (0.00084) [2022-07-09 11:49:00,222][26022] Updated weights on worker 0-0, policy_version 235642 (0.00079) [2022-07-09 11:49:02,179][25689] Fps is (10 sec: 5706.3, 60 sec: 5790.3, 300 sec: 5749.2). Total num frames: 241307648. Throughput: 0: 6065.6. Samples: 241310888. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:49:02,179][25689] Avg episode reward: [(0, '-48.313')] [2022-07-09 11:49:02,181][26022] Updated weights on worker 0-0, policy_version 235652 (0.00099) [2022-07-09 11:49:04,141][26022] Updated weights on worker 0-0, policy_version 235662 (0.00096) [2022-07-09 11:49:05,803][26022] Updated weights on worker 0-0, policy_version 235672 (0.00077) [2022-07-09 11:49:07,201][25689] Fps is (10 sec: 5513.6, 60 sec: 5759.0, 300 sec: 5741.9). Total num frames: 241335296. Throughput: 0: 5950.5. Samples: 241343426. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:49:07,202][25689] Avg episode reward: [(0, '-48.378')] [2022-07-09 11:49:07,494][26022] Updated weights on worker 0-0, policy_version 235682 (0.00089) [2022-07-09 11:49:09,425][26022] Updated weights on worker 0-0, policy_version 235692 (0.00089) [2022-07-09 11:49:11,264][26022] Updated weights on worker 0-0, policy_version 235702 (0.00092) [2022-07-09 11:49:12,204][25689] Fps is (10 sec: 5617.8, 60 sec: 5745.4, 300 sec: 5742.7). Total num frames: 241363968. Throughput: 0: 5094.9. Samples: 241360736. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:49:12,206][25689] Avg episode reward: [(0, '-48.630')] [2022-07-09 11:49:13,011][26022] Updated weights on worker 0-0, policy_version 235712 (0.00089) [2022-07-09 11:49:14,690][26022] Updated weights on worker 0-0, policy_version 235722 (0.00087) [2022-07-09 11:49:16,576][26022] Updated weights on worker 0-0, policy_version 235732 (0.00088) [2022-07-09 11:49:17,272][25689] Fps is (10 sec: 5795.9, 60 sec: 5742.8, 300 sec: 5741.8). Total num frames: 241393664. Throughput: 0: 5945.4. Samples: 241395452. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:49:17,273][25689] Avg episode reward: [(0, '-49.571')] [2022-07-09 11:49:18,070][26022] Updated weights on worker 0-0, policy_version 235742 (0.00513) [2022-07-09 11:49:20,191][26022] Updated weights on worker 0-0, policy_version 235752 (0.00097) [2022-07-09 11:49:21,717][26022] Updated weights on worker 0-0, policy_version 235762 (0.00086) [2022-07-09 11:49:22,303][25689] Fps is (10 sec: 5779.7, 60 sec: 5757.4, 300 sec: 5738.0). Total num frames: 241422336. Throughput: 0: 5913.3. Samples: 241429888. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:49:22,303][25689] Avg episode reward: [(0, '-49.325')] [2022-07-09 11:49:23,822][26022] Updated weights on worker 0-0, policy_version 235772 (0.00084) [2022-07-09 11:49:25,219][26022] Updated weights on worker 0-0, policy_version 235782 (0.00094) [2022-07-09 11:49:27,321][25689] Fps is (10 sec: 5604.4, 60 sec: 5723.0, 300 sec: 5741.6). Total num frames: 241449984. Throughput: 0: 5167.9. Samples: 241447402. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:49:27,322][25689] Avg episode reward: [(0, '-49.228')] [2022-07-09 11:49:27,348][26022] Updated weights on worker 0-0, policy_version 235792 (0.00093) [2022-07-09 11:49:28,878][26022] Updated weights on worker 0-0, policy_version 235802 (0.00083) [2022-07-09 11:49:30,915][26022] Updated weights on worker 0-0, policy_version 235812 (0.00089) [2022-07-09 11:49:32,348][25689] Fps is (10 sec: 5810.5, 60 sec: 5738.6, 300 sec: 5742.9). Total num frames: 241480704. Throughput: 0: 6019.3. Samples: 241481988. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:49:32,348][25689] Avg episode reward: [(0, '-49.368')] [2022-07-09 11:49:32,506][26022] Updated weights on worker 0-0, policy_version 235822 (0.00088) [2022-07-09 11:49:34,339][26022] Updated weights on worker 0-0, policy_version 235832 (0.00085) [2022-07-09 11:49:35,886][26022] Updated weights on worker 0-0, policy_version 235842 (0.00087) [2022-07-09 11:49:37,412][25689] Fps is (10 sec: 5885.5, 60 sec: 5738.7, 300 sec: 5745.4). Total num frames: 241509376. Throughput: 0: 6013.0. Samples: 241516556. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:49:37,413][25689] Avg episode reward: [(0, '-47.954')] [2022-07-09 11:49:37,807][26022] Updated weights on worker 0-0, policy_version 235852 (0.00083) [2022-07-09 11:49:39,671][26022] Updated weights on worker 0-0, policy_version 235862 (0.00083) [2022-07-09 11:49:41,516][26022] Updated weights on worker 0-0, policy_version 235872 (0.00090) [2022-07-09 11:49:42,431][25689] Fps is (10 sec: 5687.2, 60 sec: 5720.3, 300 sec: 5741.8). Total num frames: 241538048. Throughput: 0: 5175.3. Samples: 241534058. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:49:42,431][25689] Avg episode reward: [(0, '-47.233')] [2022-07-09 11:49:42,986][26022] Updated weights on worker 0-0, policy_version 235882 (0.00095) [2022-07-09 11:49:44,972][26022] Updated weights on worker 0-0, policy_version 235892 (0.00086) [2022-07-09 11:49:46,772][26022] Updated weights on worker 0-0, policy_version 235902 (0.00093) [2022-07-09 11:49:47,516][25689] Fps is (10 sec: 5675.4, 60 sec: 5730.4, 300 sec: 5740.3). Total num frames: 241566720. Throughput: 0: 6010.5. Samples: 241568786. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:49:47,517][25689] Avg episode reward: [(0, '-47.443')] [2022-07-09 11:49:48,543][26022] Updated weights on worker 0-0, policy_version 235912 (0.00083) [2022-07-09 11:49:50,364][26022] Updated weights on worker 0-0, policy_version 235922 (0.00088) [2022-07-09 11:49:52,040][26022] Updated weights on worker 0-0, policy_version 235932 (0.00092) [2022-07-09 11:49:52,536][25689] Fps is (10 sec: 5877.4, 60 sec: 5746.7, 300 sec: 5744.3). Total num frames: 241597440. Throughput: 0: 6000.4. Samples: 241603126. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:49:52,537][25689] Avg episode reward: [(0, '-47.041')] [2022-07-09 11:49:54,033][26022] Updated weights on worker 0-0, policy_version 235942 (0.00085) [2022-07-09 11:49:55,689][26022] Updated weights on worker 0-0, policy_version 235952 (0.00101) [2022-07-09 11:49:57,308][26022] Updated weights on worker 0-0, policy_version 235962 (0.00083) [2022-07-09 11:49:57,687][25689] Fps is (10 sec: 5839.7, 60 sec: 5719.5, 300 sec: 5741.5). Total num frames: 241626112. Throughput: 0: 5122.9. Samples: 241620420. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:49:57,687][25689] Avg episode reward: [(0, '-47.885')] [2022-07-09 11:49:59,345][26022] Updated weights on worker 0-0, policy_version 235972 (0.00084) [2022-07-09 11:50:00,924][26022] Updated weights on worker 0-0, policy_version 235982 (0.00089) [2022-07-09 11:50:02,761][25689] Fps is (10 sec: 5307.9, 60 sec: 5679.3, 300 sec: 5734.3). Total num frames: 241651712. Throughput: 0: 5943.7. Samples: 241654894. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:50:02,761][25689] Avg episode reward: [(0, '-48.611')] [2022-07-09 11:50:03,179][26022] Updated weights on worker 0-0, policy_version 235992 (0.00096) [2022-07-09 11:50:04,799][26022] Updated weights on worker 0-0, policy_version 236002 (0.00095) [2022-07-09 11:50:06,760][26022] Updated weights on worker 0-0, policy_version 236012 (0.00087) [2022-07-09 11:50:07,770][25689] Fps is (10 sec: 5686.8, 60 sec: 5748.2, 300 sec: 5745.0). Total num frames: 241683456. Throughput: 0: 5848.3. Samples: 241687238. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:50:07,771][25689] Avg episode reward: [(0, '-48.507')] [2022-07-09 11:50:08,919][26022] Updated weights on worker 0-0, policy_version 236022 (0.00094) [2022-07-09 11:50:10,179][26022] Updated weights on worker 0-0, policy_version 236032 (0.00095) [2022-07-09 11:50:12,245][26022] Updated weights on worker 0-0, policy_version 236042 (0.00084) [2022-07-09 11:50:12,809][25689] Fps is (10 sec: 5808.6, 60 sec: 5710.9, 300 sec: 5740.0). Total num frames: 241710080. Throughput: 0: 5003.5. Samples: 241704564. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:50:12,810][25689] Avg episode reward: [(0, '-47.982')] [2022-07-09 11:50:13,655][26022] Updated weights on worker 0-0, policy_version 236052 (0.00089) [2022-07-09 11:50:15,614][26022] Updated weights on worker 0-0, policy_version 236062 (0.00088) [2022-07-09 11:50:17,628][26022] Updated weights on worker 0-0, policy_version 236072 (0.00094) [2022-07-09 11:50:17,860][25689] Fps is (10 sec: 5581.9, 60 sec: 5712.5, 300 sec: 5739.1). Total num frames: 241739776. Throughput: 0: 5880.1. Samples: 241739042. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:50:17,861][25689] Avg episode reward: [(0, '-48.617')] [2022-07-09 11:50:19,175][26022] Updated weights on worker 0-0, policy_version 236082 (0.00084) [2022-07-09 11:50:21,046][26022] Updated weights on worker 0-0, policy_version 236092 (0.00087) [2022-07-09 11:50:22,691][26022] Updated weights on worker 0-0, policy_version 236102 (0.00096) [2022-07-09 11:50:22,891][25689] Fps is (10 sec: 5789.3, 60 sec: 5712.5, 300 sec: 5738.5). Total num frames: 241768448. Throughput: 0: 5881.2. Samples: 241773286. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:50:22,893][25689] Avg episode reward: [(0, '-48.838')] [2022-07-09 11:50:24,742][26022] Updated weights on worker 0-0, policy_version 236112 (0.00092) [2022-07-09 11:50:26,540][26022] Updated weights on worker 0-0, policy_version 236122 (0.00087) [2022-07-09 11:50:27,903][25689] Fps is (10 sec: 5608.1, 60 sec: 5713.1, 300 sec: 5738.9). Total num frames: 241796096. Throughput: 0: 5114.4. Samples: 241790204. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:50:27,903][25689] Avg episode reward: [(0, '-48.490')] [2022-07-09 11:50:28,495][26022] Updated weights on worker 0-0, policy_version 236132 (0.00087) [2022-07-09 11:50:30,016][26022] Updated weights on worker 0-0, policy_version 236142 (0.00086) [2022-07-09 11:50:32,061][26022] Updated weights on worker 0-0, policy_version 236152 (0.00095) [2022-07-09 11:50:32,931][25689] Fps is (10 sec: 5609.6, 60 sec: 5679.2, 300 sec: 5732.7). Total num frames: 241824768. Throughput: 0: 5930.5. Samples: 241823896. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:50:32,933][25689] Avg episode reward: [(0, '-47.844')] [2022-07-09 11:50:33,723][26022] Updated weights on worker 0-0, policy_version 236162 (0.00085) [2022-07-09 11:50:35,703][26022] Updated weights on worker 0-0, policy_version 236172 (0.00086) [2022-07-09 11:50:37,287][26022] Updated weights on worker 0-0, policy_version 236182 (0.00097) [2022-07-09 11:50:38,042][25689] Fps is (10 sec: 5756.4, 60 sec: 5691.7, 300 sec: 5735.2). Total num frames: 241854464. Throughput: 0: 5903.9. Samples: 241858196. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:50:38,043][25689] Avg episode reward: [(0, '-48.915')] [2022-07-09 11:50:39,088][26022] Updated weights on worker 0-0, policy_version 236192 (0.00092) [2022-07-09 11:50:40,859][26022] Updated weights on worker 0-0, policy_version 236202 (0.00092) [2022-07-09 11:50:42,594][26022] Updated weights on worker 0-0, policy_version 236212 (0.00085) [2022-07-09 11:50:43,127][25689] Fps is (10 sec: 5724.9, 60 sec: 5685.5, 300 sec: 5726.8). Total num frames: 241883136. Throughput: 0: 5907.2. Samples: 241892820. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:50:43,127][25689] Avg episode reward: [(0, '-48.486')] [2022-07-09 11:50:44,506][26022] Updated weights on worker 0-0, policy_version 236222 (0.00085) [2022-07-09 11:50:46,131][26022] Updated weights on worker 0-0, policy_version 236232 (0.00096) [2022-07-09 11:50:47,980][26022] Updated weights on worker 0-0, policy_version 236242 (0.00087) [2022-07-09 11:50:48,166][25689] Fps is (10 sec: 5765.3, 60 sec: 5706.7, 300 sec: 5731.4). Total num frames: 241912832. Throughput: 0: 5913.6. Samples: 241910034. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:50:48,167][25689] Avg episode reward: [(0, '-48.716')] [2022-07-09 11:50:49,931][26022] Updated weights on worker 0-0, policy_version 236252 (0.00084) [2022-07-09 11:50:51,490][26022] Updated weights on worker 0-0, policy_version 236262 (0.00087) [2022-07-09 11:50:53,109][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:50:53,121][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000236270_241940480.pth [2022-07-09 11:50:53,122][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000234254_239876096.pth [2022-07-09 11:50:53,241][25689] Fps is (10 sec: 5669.4, 60 sec: 5650.9, 300 sec: 5721.1). Total num frames: 241940480. Throughput: 0: 5953.4. Samples: 241944810. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:50:53,242][25689] Avg episode reward: [(0, '-48.888')] [2022-07-09 11:50:53,394][26022] Updated weights on worker 0-0, policy_version 236272 (0.00088) [2022-07-09 11:50:55,095][26022] Updated weights on worker 0-0, policy_version 236282 (0.00087) [2022-07-09 11:50:56,805][26022] Updated weights on worker 0-0, policy_version 236292 (0.00085) [2022-07-09 11:50:58,357][25689] Fps is (10 sec: 5627.3, 60 sec: 5671.1, 300 sec: 5729.6). Total num frames: 241970176. Throughput: 0: 5975.4. Samples: 241979582. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 11:50:58,359][25689] Avg episode reward: [(0, '-49.091')] [2022-07-09 11:50:58,857][26022] Updated weights on worker 0-0, policy_version 236302 (0.00086) [2022-07-09 11:51:00,361][26022] Updated weights on worker 0-0, policy_version 236312 (0.00082) [2022-07-09 11:51:02,632][26022] Updated weights on worker 0-0, policy_version 236322 (0.00613) [2022-07-09 11:51:03,372][25689] Fps is (10 sec: 5761.3, 60 sec: 5727.2, 300 sec: 5733.4). Total num frames: 241998848. Throughput: 0: 5142.3. Samples: 241996930. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 11:51:03,373][25689] Avg episode reward: [(0, '-48.559')] [2022-07-09 11:51:04,230][26022] Updated weights on worker 0-0, policy_version 236332 (0.00086) [2022-07-09 11:51:06,022][26022] Updated weights on worker 0-0, policy_version 236342 (0.00084) [2022-07-09 11:51:07,989][26022] Updated weights on worker 0-0, policy_version 236352 (0.00087) [2022-07-09 11:51:08,430][25689] Fps is (10 sec: 5591.1, 60 sec: 5655.2, 300 sec: 5725.9). Total num frames: 242026496. Throughput: 0: 5895.7. Samples: 242029502. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 11:51:08,430][25689] Avg episode reward: [(0, '-48.048')] [2022-07-09 11:51:09,667][26022] Updated weights on worker 0-0, policy_version 236362 (0.00091) [2022-07-09 11:51:11,452][26022] Updated weights on worker 0-0, policy_version 236372 (0.00081) [2022-07-09 11:51:13,308][26022] Updated weights on worker 0-0, policy_version 236382 (0.00049) [2022-07-09 11:51:13,469][25689] Fps is (10 sec: 5679.5, 60 sec: 5705.8, 300 sec: 5729.9). Total num frames: 242056192. Throughput: 0: 5902.5. Samples: 242064204. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 11:51:13,470][25689] Avg episode reward: [(0, '-48.273')] [2022-07-09 11:51:15,030][26022] Updated weights on worker 0-0, policy_version 236392 (0.00084) [2022-07-09 11:51:16,767][26022] Updated weights on worker 0-0, policy_version 236402 (0.00078) [2022-07-09 11:51:18,534][25689] Fps is (10 sec: 5877.8, 60 sec: 5704.4, 300 sec: 5732.8). Total num frames: 242085888. Throughput: 0: 5051.8. Samples: 242081514. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 11:51:18,535][25689] Avg episode reward: [(0, '-47.923')] [2022-07-09 11:51:18,536][26022] Updated weights on worker 0-0, policy_version 236412 (0.00095) [2022-07-09 11:51:20,363][26022] Updated weights on worker 0-0, policy_version 236422 (0.00085) [2022-07-09 11:51:22,012][26022] Updated weights on worker 0-0, policy_version 236432 (0.00086) [2022-07-09 11:51:23,549][25689] Fps is (10 sec: 5790.4, 60 sec: 5706.0, 300 sec: 5732.6). Total num frames: 242114560. Throughput: 0: 5916.1. Samples: 242116298. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 11:51:23,550][25689] Avg episode reward: [(0, '-48.779')] [2022-07-09 11:51:24,041][26022] Updated weights on worker 0-0, policy_version 236442 (0.00096) [2022-07-09 11:51:25,636][26022] Updated weights on worker 0-0, policy_version 236452 (0.00088) [2022-07-09 11:51:27,390][26022] Updated weights on worker 0-0, policy_version 236462 (0.00092) [2022-07-09 11:51:28,551][25689] Fps is (10 sec: 5622.4, 60 sec: 5706.8, 300 sec: 5722.3). Total num frames: 242142208. Throughput: 0: 6018.0. Samples: 242150594. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 11:51:28,552][25689] Avg episode reward: [(0, '-48.742')] [2022-07-09 11:51:29,209][26022] Updated weights on worker 0-0, policy_version 236472 (0.00081) [2022-07-09 11:51:30,866][26022] Updated weights on worker 0-0, policy_version 236482 (0.00083) [2022-07-09 11:51:32,997][26022] Updated weights on worker 0-0, policy_version 236492 (0.00089) [2022-07-09 11:51:33,567][25689] Fps is (10 sec: 5724.6, 60 sec: 5725.0, 300 sec: 5730.3). Total num frames: 242171904. Throughput: 0: 5159.0. Samples: 242167886. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 11:51:33,567][25689] Avg episode reward: [(0, '-49.029')] [2022-07-09 11:51:34,613][26022] Updated weights on worker 0-0, policy_version 236502 (0.00078) [2022-07-09 11:51:36,260][26022] Updated weights on worker 0-0, policy_version 236512 (0.00086) [2022-07-09 11:51:38,031][26022] Updated weights on worker 0-0, policy_version 236522 (0.00614) [2022-07-09 11:51:38,622][25689] Fps is (10 sec: 5796.2, 60 sec: 5713.4, 300 sec: 5722.8). Total num frames: 242200576. Throughput: 0: 6031.1. Samples: 242202662. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 11:51:38,622][25689] Avg episode reward: [(0, '-49.346')] [2022-07-09 11:51:39,771][26022] Updated weights on worker 0-0, policy_version 236532 (0.00082) [2022-07-09 11:51:41,769][26022] Updated weights on worker 0-0, policy_version 236542 (0.00091) [2022-07-09 11:51:43,233][26022] Updated weights on worker 0-0, policy_version 236552 (0.00090) [2022-07-09 11:51:43,647][25689] Fps is (10 sec: 5790.4, 60 sec: 5735.9, 300 sec: 5733.8). Total num frames: 242230272. Throughput: 0: 6030.4. Samples: 242237494. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 11:51:43,647][25689] Avg episode reward: [(0, '-49.213')] [2022-07-09 11:51:45,236][26022] Updated weights on worker 0-0, policy_version 236562 (0.00085) [2022-07-09 11:51:46,954][26022] Updated weights on worker 0-0, policy_version 236572 (0.00082) [2022-07-09 11:51:48,650][25689] Fps is (10 sec: 5820.0, 60 sec: 5722.4, 300 sec: 5726.9). Total num frames: 242258944. Throughput: 0: 5196.0. Samples: 242255028. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 11:51:48,651][25689] Avg episode reward: [(0, '-48.559')] [2022-07-09 11:51:48,745][26022] Updated weights on worker 0-0, policy_version 236582 (0.00090) [2022-07-09 11:51:50,618][26022] Updated weights on worker 0-0, policy_version 236592 (0.00091) [2022-07-09 11:51:52,124][26022] Updated weights on worker 0-0, policy_version 236602 (0.00088) [2022-07-09 11:51:53,657][25689] Fps is (10 sec: 5830.8, 60 sec: 5762.7, 300 sec: 5728.3). Total num frames: 242288640. Throughput: 0: 6069.5. Samples: 242289826. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 11:51:53,657][25689] Avg episode reward: [(0, '-47.893')] [2022-07-09 11:51:54,023][26022] Updated weights on worker 0-0, policy_version 236612 (0.00089) [2022-07-09 11:51:55,736][26022] Updated weights on worker 0-0, policy_version 236622 (0.00090) [2022-07-09 11:51:57,613][26022] Updated weights on worker 0-0, policy_version 236632 (0.00088) [2022-07-09 11:51:58,725][25689] Fps is (10 sec: 5793.6, 60 sec: 5750.3, 300 sec: 5727.5). Total num frames: 242317312. Throughput: 0: 6050.6. Samples: 242324300. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 11:51:58,725][25689] Avg episode reward: [(0, '-48.515')] [2022-07-09 11:51:59,391][26022] Updated weights on worker 0-0, policy_version 236642 (0.00093) [2022-07-09 11:52:00,938][26022] Updated weights on worker 0-0, policy_version 236652 (0.00085) [2022-07-09 11:52:03,326][26022] Updated weights on worker 0-0, policy_version 236662 (0.00083) [2022-07-09 11:52:03,728][25689] Fps is (10 sec: 5592.5, 60 sec: 5734.6, 300 sec: 5728.7). Total num frames: 242344960. Throughput: 0: 5185.4. Samples: 242341622. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 11:52:03,728][25689] Avg episode reward: [(0, '-48.892')] [2022-07-09 11:52:05,087][26022] Updated weights on worker 0-0, policy_version 236672 (0.00086) [2022-07-09 11:52:06,747][26022] Updated weights on worker 0-0, policy_version 236682 (0.00078) [2022-07-09 11:52:08,499][26022] Updated weights on worker 0-0, policy_version 236692 (0.00086) [2022-07-09 11:52:08,765][25689] Fps is (10 sec: 5507.5, 60 sec: 5736.5, 300 sec: 5724.9). Total num frames: 242372608. Throughput: 0: 5923.6. Samples: 242374180. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 11:52:08,766][25689] Avg episode reward: [(0, '-48.890')] [2022-07-09 11:52:10,383][26022] Updated weights on worker 0-0, policy_version 236702 (0.00084) [2022-07-09 11:52:12,154][26022] Updated weights on worker 0-0, policy_version 236712 (0.00082) [2022-07-09 11:52:13,787][25689] Fps is (10 sec: 5700.8, 60 sec: 5738.2, 300 sec: 5723.1). Total num frames: 242402304. Throughput: 0: 5926.5. Samples: 242409124. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 11:52:13,788][25689] Avg episode reward: [(0, '-49.091')] [2022-07-09 11:52:13,796][26022] Updated weights on worker 0-0, policy_version 236722 (0.00086) [2022-07-09 11:52:15,788][26022] Updated weights on worker 0-0, policy_version 236732 (0.00085) [2022-07-09 11:52:17,334][26022] Updated weights on worker 0-0, policy_version 236742 (0.00087) [2022-07-09 11:52:18,867][25689] Fps is (10 sec: 5777.9, 60 sec: 5719.7, 300 sec: 5722.0). Total num frames: 242430976. Throughput: 0: 5076.6. Samples: 242426554. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 11:52:18,869][25689] Avg episode reward: [(0, '-48.816')] [2022-07-09 11:52:19,369][26022] Updated weights on worker 0-0, policy_version 236752 (0.00091) [2022-07-09 11:52:20,855][26022] Updated weights on worker 0-0, policy_version 236762 (0.00107) [2022-07-09 11:52:22,639][26022] Updated weights on worker 0-0, policy_version 236772 (0.00083) [2022-07-09 11:52:23,891][25689] Fps is (10 sec: 5877.9, 60 sec: 5752.8, 300 sec: 5729.2). Total num frames: 242461696. Throughput: 0: 5941.7. Samples: 242461426. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 11:52:23,892][25689] Avg episode reward: [(0, '-49.231')] [2022-07-09 11:52:24,418][26022] Updated weights on worker 0-0, policy_version 236782 (0.00083) [2022-07-09 11:52:26,276][26022] Updated weights on worker 0-0, policy_version 236792 (0.00099) [2022-07-09 11:52:27,950][26022] Updated weights on worker 0-0, policy_version 236802 (0.00091) [2022-07-09 11:52:28,914][25689] Fps is (10 sec: 5911.3, 60 sec: 5767.8, 300 sec: 5728.9). Total num frames: 242490368. Throughput: 0: 6050.8. Samples: 242496100. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 11:52:28,916][25689] Avg episode reward: [(0, '-48.723')] [2022-07-09 11:52:29,786][26022] Updated weights on worker 0-0, policy_version 236812 (0.00084) [2022-07-09 11:52:31,480][26022] Updated weights on worker 0-0, policy_version 236822 (0.00086) [2022-07-09 11:52:33,388][26022] Updated weights on worker 0-0, policy_version 236832 (0.00086) [2022-07-09 11:52:33,920][25689] Fps is (10 sec: 5717.8, 60 sec: 5751.7, 300 sec: 5727.4). Total num frames: 242519040. Throughput: 0: 5172.0. Samples: 242513254. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 11:52:33,922][25689] Avg episode reward: [(0, '-47.743')] [2022-07-09 11:52:35,183][26022] Updated weights on worker 0-0, policy_version 236842 (0.00096) [2022-07-09 11:52:36,964][26022] Updated weights on worker 0-0, policy_version 236852 (0.00087) [2022-07-09 11:52:38,672][26022] Updated weights on worker 0-0, policy_version 236862 (0.00081) [2022-07-09 11:52:39,009][25689] Fps is (10 sec: 5782.2, 60 sec: 5765.4, 300 sec: 5729.2). Total num frames: 242548736. Throughput: 0: 6023.4. Samples: 242547876. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 11:52:39,009][25689] Avg episode reward: [(0, '-46.846')] [2022-07-09 11:52:40,477][26022] Updated weights on worker 0-0, policy_version 236872 (0.00089) [2022-07-09 11:52:42,151][26022] Updated weights on worker 0-0, policy_version 236882 (0.00087) [2022-07-09 11:52:44,035][25689] Fps is (10 sec: 5669.4, 60 sec: 5731.5, 300 sec: 5725.5). Total num frames: 242576384. Throughput: 0: 6032.8. Samples: 242582948. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 11:52:44,036][25689] Avg episode reward: [(0, '-46.607')] [2022-07-09 11:52:44,095][26022] Updated weights on worker 0-0, policy_version 236892 (0.00086) [2022-07-09 11:52:45,785][26022] Updated weights on worker 0-0, policy_version 236902 (0.00088) [2022-07-09 11:52:47,521][26022] Updated weights on worker 0-0, policy_version 236912 (0.00091) [2022-07-09 11:52:49,040][25689] Fps is (10 sec: 5716.8, 60 sec: 5748.3, 300 sec: 5726.7). Total num frames: 242606080. Throughput: 0: 5174.6. Samples: 242600240. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 11:52:49,040][25689] Avg episode reward: [(0, '-47.239')] [2022-07-09 11:52:49,145][26022] Updated weights on worker 0-0, policy_version 236922 (0.00086) [2022-07-09 11:52:51,036][26022] Updated weights on worker 0-0, policy_version 236932 (0.00083) [2022-07-09 11:52:52,963][26022] Updated weights on worker 0-0, policy_version 236942 (0.00097) [2022-07-09 11:52:53,238][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:52:53,251][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000236944_242630656.pth [2022-07-09 11:52:53,252][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000234928_240566272.pth [2022-07-09 11:52:54,044][25689] Fps is (10 sec: 5933.9, 60 sec: 5748.6, 300 sec: 5729.5). Total num frames: 242635776. Throughput: 0: 6064.7. Samples: 242635298. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 11:52:54,044][25689] Avg episode reward: [(0, '-46.048')] [2022-07-09 11:52:54,615][26022] Updated weights on worker 0-0, policy_version 236952 (0.00055) [2022-07-09 11:52:56,475][26022] Updated weights on worker 0-0, policy_version 236962 (0.00089) [2022-07-09 11:52:58,311][26022] Updated weights on worker 0-0, policy_version 236972 (0.00082) [2022-07-09 11:52:59,173][25689] Fps is (10 sec: 5759.9, 60 sec: 5742.7, 300 sec: 5738.3). Total num frames: 242664448. Throughput: 0: 6054.2. Samples: 242669954. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 11:52:59,174][25689] Avg episode reward: [(0, '-46.959')] [2022-07-09 11:52:59,799][26022] Updated weights on worker 0-0, policy_version 236982 (0.00086) [2022-07-09 11:53:01,771][26022] Updated weights on worker 0-0, policy_version 236992 (0.00053) [2022-07-09 11:53:03,637][26022] Updated weights on worker 0-0, policy_version 237002 (0.00082) [2022-07-09 11:53:04,213][25689] Fps is (10 sec: 5638.6, 60 sec: 5756.1, 300 sec: 5735.0). Total num frames: 242693120. Throughput: 0: 5168.2. Samples: 242687236. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 11:53:04,214][25689] Avg episode reward: [(0, '-46.922')] [2022-07-09 11:53:05,814][26022] Updated weights on worker 0-0, policy_version 237012 (0.00083) [2022-07-09 11:53:07,319][26022] Updated weights on worker 0-0, policy_version 237022 (0.00079) [2022-07-09 11:53:09,235][25689] Fps is (10 sec: 5495.7, 60 sec: 5740.7, 300 sec: 5725.0). Total num frames: 242719744. Throughput: 0: 5927.7. Samples: 242719950. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 11:53:09,236][25689] Avg episode reward: [(0, '-48.137')] [2022-07-09 11:53:09,239][26022] Updated weights on worker 0-0, policy_version 237032 (0.00077) [2022-07-09 11:53:10,738][26022] Updated weights on worker 0-0, policy_version 237042 (0.00083) [2022-07-09 11:53:12,658][26022] Updated weights on worker 0-0, policy_version 237052 (0.00087) [2022-07-09 11:53:14,274][25689] Fps is (10 sec: 5699.8, 60 sec: 5755.9, 300 sec: 5728.5). Total num frames: 242750464. Throughput: 0: 5910.4. Samples: 242754870. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 11:53:14,275][25689] Avg episode reward: [(0, '-47.709')] [2022-07-09 11:53:14,484][26022] Updated weights on worker 0-0, policy_version 237062 (0.00083) [2022-07-09 11:53:16,196][26022] Updated weights on worker 0-0, policy_version 237072 (0.00088) [2022-07-09 11:53:18,008][26022] Updated weights on worker 0-0, policy_version 237082 (0.00086) [2022-07-09 11:53:19,392][25689] Fps is (10 sec: 5847.1, 60 sec: 5752.3, 300 sec: 5729.8). Total num frames: 242779136. Throughput: 0: 5910.2. Samples: 242789454. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 11:53:19,393][25689] Avg episode reward: [(0, '-48.549')] [2022-07-09 11:53:19,816][26022] Updated weights on worker 0-0, policy_version 237092 (0.00090) [2022-07-09 11:53:21,557][26022] Updated weights on worker 0-0, policy_version 237102 (0.00091) [2022-07-09 11:53:23,346][26022] Updated weights on worker 0-0, policy_version 237112 (0.00082) [2022-07-09 11:53:24,493][25689] Fps is (10 sec: 5812.0, 60 sec: 5745.0, 300 sec: 5731.5). Total num frames: 242809856. Throughput: 0: 5892.1. Samples: 242806726. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 11:53:24,494][25689] Avg episode reward: [(0, '-47.854')] [2022-07-09 11:53:24,948][26022] Updated weights on worker 0-0, policy_version 237122 (0.00087) [2022-07-09 11:53:26,765][26022] Updated weights on worker 0-0, policy_version 237132 (0.00085) [2022-07-09 11:53:28,492][26022] Updated weights on worker 0-0, policy_version 237142 (0.00083) [2022-07-09 11:53:29,528][25689] Fps is (10 sec: 5758.6, 60 sec: 5727.0, 300 sec: 5724.2). Total num frames: 242837504. Throughput: 0: 5983.5. Samples: 242841374. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 11:53:29,529][25689] Avg episode reward: [(0, '-47.865')] [2022-07-09 11:53:30,450][26022] Updated weights on worker 0-0, policy_version 237152 (0.00094) [2022-07-09 11:53:32,098][26022] Updated weights on worker 0-0, policy_version 237162 (0.00091) [2022-07-09 11:53:33,766][26022] Updated weights on worker 0-0, policy_version 237172 (0.00086) [2022-07-09 11:53:34,544][25689] Fps is (10 sec: 5705.3, 60 sec: 5742.9, 300 sec: 5728.6). Total num frames: 242867200. Throughput: 0: 5979.1. Samples: 242876066. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 11:53:34,545][25689] Avg episode reward: [(0, '-47.984')] [2022-07-09 11:53:35,670][26022] Updated weights on worker 0-0, policy_version 237182 (0.00094) [2022-07-09 11:53:37,346][26022] Updated weights on worker 0-0, policy_version 237192 (0.00094) [2022-07-09 11:53:39,352][26022] Updated weights on worker 0-0, policy_version 237202 (0.00084) [2022-07-09 11:53:39,597][25689] Fps is (10 sec: 5797.0, 60 sec: 5729.5, 300 sec: 5724.2). Total num frames: 242895872. Throughput: 0: 5137.9. Samples: 242893262. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 11:53:39,597][25689] Avg episode reward: [(0, '-47.265')] [2022-07-09 11:53:41,118][26022] Updated weights on worker 0-0, policy_version 237212 (0.00095) [2022-07-09 11:53:42,887][26022] Updated weights on worker 0-0, policy_version 237222 (0.00099) [2022-07-09 11:53:44,511][26022] Updated weights on worker 0-0, policy_version 237232 (0.00086) [2022-07-09 11:53:44,602][25689] Fps is (10 sec: 5803.3, 60 sec: 5765.2, 300 sec: 5731.3). Total num frames: 242925568. Throughput: 0: 6025.2. Samples: 242927884. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 11:53:44,602][25689] Avg episode reward: [(0, '-46.729')] [2022-07-09 11:53:46,504][26022] Updated weights on worker 0-0, policy_version 237242 (0.00066) [2022-07-09 11:53:48,102][26022] Updated weights on worker 0-0, policy_version 237252 (0.00084) [2022-07-09 11:53:49,683][25689] Fps is (10 sec: 5786.7, 60 sec: 5741.1, 300 sec: 5726.5). Total num frames: 242954240. Throughput: 0: 6030.6. Samples: 242962922. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 11:53:49,684][25689] Avg episode reward: [(0, '-47.098')] [2022-07-09 11:53:49,908][26022] Updated weights on worker 0-0, policy_version 237262 (0.00085) [2022-07-09 11:53:51,742][26022] Updated weights on worker 0-0, policy_version 237272 (0.00084) [2022-07-09 11:53:53,431][26022] Updated weights on worker 0-0, policy_version 237282 (0.00089) [2022-07-09 11:53:54,710][25689] Fps is (10 sec: 5774.3, 60 sec: 5738.9, 300 sec: 5726.8). Total num frames: 242983936. Throughput: 0: 5161.6. Samples: 242980154. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 11:53:54,712][25689] Avg episode reward: [(0, '-47.197')] [2022-07-09 11:53:55,379][26022] Updated weights on worker 0-0, policy_version 237292 (0.00098) [2022-07-09 11:53:57,330][26022] Updated weights on worker 0-0, policy_version 237302 (0.00093) [2022-07-09 11:53:59,022][26022] Updated weights on worker 0-0, policy_version 237312 (0.00089) [2022-07-09 11:53:59,775][25689] Fps is (10 sec: 5783.9, 60 sec: 5745.1, 300 sec: 5729.1). Total num frames: 243012608. Throughput: 0: 5999.2. Samples: 243014314. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 11:53:59,775][25689] Avg episode reward: [(0, '-47.442')] [2022-07-09 11:54:00,896][26022] Updated weights on worker 0-0, policy_version 237322 (0.00086) [2022-07-09 11:54:02,771][26022] Updated weights on worker 0-0, policy_version 237332 (0.00086) [2022-07-09 11:54:04,671][26022] Updated weights on worker 0-0, policy_version 237342 (0.00086) [2022-07-09 11:54:04,867][25689] Fps is (10 sec: 5443.9, 60 sec: 5706.3, 300 sec: 5724.3). Total num frames: 243039232. Throughput: 0: 5867.0. Samples: 243046784. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 11:54:04,868][25689] Avg episode reward: [(0, '-46.980')] [2022-07-09 11:54:06,459][26022] Updated weights on worker 0-0, policy_version 237352 (0.00088) [2022-07-09 11:54:08,040][26022] Updated weights on worker 0-0, policy_version 237362 (0.00090) [2022-07-09 11:54:09,902][25689] Fps is (10 sec: 5460.3, 60 sec: 5738.9, 300 sec: 5723.7). Total num frames: 243067904. Throughput: 0: 5001.7. Samples: 243064048. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 11:54:09,902][25689] Avg episode reward: [(0, '-47.701')] [2022-07-09 11:54:10,111][26022] Updated weights on worker 0-0, policy_version 237372 (0.00102) [2022-07-09 11:54:11,624][26022] Updated weights on worker 0-0, policy_version 237382 (0.00093) [2022-07-09 11:54:13,392][26022] Updated weights on worker 0-0, policy_version 237392 (0.00083) [2022-07-09 11:54:14,961][25689] Fps is (10 sec: 5782.6, 60 sec: 5720.1, 300 sec: 5723.9). Total num frames: 243097600. Throughput: 0: 5864.2. Samples: 243098912. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 11:54:14,962][25689] Avg episode reward: [(0, '-48.254')] [2022-07-09 11:54:15,283][26022] Updated weights on worker 0-0, policy_version 237402 (0.00084) [2022-07-09 11:54:16,937][26022] Updated weights on worker 0-0, policy_version 237412 (0.00086) [2022-07-09 11:54:18,889][26022] Updated weights on worker 0-0, policy_version 237422 (0.00086) [2022-07-09 11:54:20,039][25689] Fps is (10 sec: 5858.8, 60 sec: 5740.8, 300 sec: 5726.4). Total num frames: 243127296. Throughput: 0: 5884.1. Samples: 243133550. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 11:54:20,039][25689] Avg episode reward: [(0, '-48.362')] [2022-07-09 11:54:20,437][26022] Updated weights on worker 0-0, policy_version 237432 (0.00084) [2022-07-09 11:54:22,432][26022] Updated weights on worker 0-0, policy_version 237442 (0.00093) [2022-07-09 11:54:24,154][26022] Updated weights on worker 0-0, policy_version 237452 (0.00086) [2022-07-09 11:54:25,097][25689] Fps is (10 sec: 5758.5, 60 sec: 5711.0, 300 sec: 5729.1). Total num frames: 243155968. Throughput: 0: 5144.2. Samples: 243150850. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 11:54:25,098][25689] Avg episode reward: [(0, '-48.028')] [2022-07-09 11:54:26,011][26022] Updated weights on worker 0-0, policy_version 237462 (0.00097) [2022-07-09 11:54:27,661][26022] Updated weights on worker 0-0, policy_version 237472 (0.00088) [2022-07-09 11:54:29,518][26022] Updated weights on worker 0-0, policy_version 237482 (0.00089) [2022-07-09 11:54:30,175][25689] Fps is (10 sec: 5556.1, 60 sec: 5707.0, 300 sec: 5717.8). Total num frames: 243183616. Throughput: 0: 5965.1. Samples: 243184982. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 11:54:30,176][25689] Avg episode reward: [(0, '-48.436')] [2022-07-09 11:54:31,279][26022] Updated weights on worker 0-0, policy_version 237492 (0.00090) [2022-07-09 11:54:33,280][26022] Updated weights on worker 0-0, policy_version 237502 (0.00094) [2022-07-09 11:54:34,799][26022] Updated weights on worker 0-0, policy_version 237512 (0.00089) [2022-07-09 11:54:35,192][25689] Fps is (10 sec: 5781.9, 60 sec: 5723.8, 300 sec: 5725.6). Total num frames: 243214336. Throughput: 0: 5968.8. Samples: 243219666. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 11:54:35,193][25689] Avg episode reward: [(0, '-47.759')] [2022-07-09 11:54:36,449][26022] Updated weights on worker 0-0, policy_version 237522 (0.00095) [2022-07-09 11:54:38,404][26022] Updated weights on worker 0-0, policy_version 237532 (0.00089) [2022-07-09 11:54:40,319][25689] Fps is (10 sec: 5754.0, 60 sec: 5699.9, 300 sec: 5720.0). Total num frames: 243241984. Throughput: 0: 5098.1. Samples: 243236942. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 11:54:40,320][25689] Avg episode reward: [(0, '-46.854')] [2022-07-09 11:54:40,415][26022] Updated weights on worker 0-0, policy_version 237542 (0.00086) [2022-07-09 11:54:41,699][26022] Updated weights on worker 0-0, policy_version 237552 (0.00088) [2022-07-09 11:54:43,812][26022] Updated weights on worker 0-0, policy_version 237562 (0.00091) [2022-07-09 11:54:45,322][26022] Updated weights on worker 0-0, policy_version 237572 (0.00090) [2022-07-09 11:54:45,357][25689] Fps is (10 sec: 5842.7, 60 sec: 5730.5, 300 sec: 5731.3). Total num frames: 243273728. Throughput: 0: 5967.6. Samples: 243271754. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 11:54:45,358][25689] Avg episode reward: [(0, '-45.870')] [2022-07-09 11:54:47,335][26022] Updated weights on worker 0-0, policy_version 237582 (0.00091) [2022-07-09 11:54:49,099][26022] Updated weights on worker 0-0, policy_version 237592 (0.00083) [2022-07-09 11:54:50,378][25689] Fps is (10 sec: 5904.6, 60 sec: 5719.4, 300 sec: 5720.9). Total num frames: 243301376. Throughput: 0: 6012.2. Samples: 243306442. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 11:54:50,378][25689] Avg episode reward: [(0, '-45.546')] [2022-07-09 11:54:50,792][26022] Updated weights on worker 0-0, policy_version 237602 (0.00093) [2022-07-09 11:54:52,679][26022] Updated weights on worker 0-0, policy_version 237612 (0.00087) [2022-07-09 11:54:53,270][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:54:53,285][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000237616_243318784.pth [2022-07-09 11:54:53,286][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000235602_241256448.pth [2022-07-09 11:54:54,287][26022] Updated weights on worker 0-0, policy_version 237622 (0.00087) [2022-07-09 11:54:55,392][25689] Fps is (10 sec: 5714.7, 60 sec: 5720.6, 300 sec: 5727.0). Total num frames: 243331072. Throughput: 0: 5151.6. Samples: 243323728. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 11:54:55,392][25689] Avg episode reward: [(0, '-45.583')] [2022-07-09 11:54:56,048][26022] Updated weights on worker 0-0, policy_version 237632 (0.00085) [2022-07-09 11:54:58,033][26022] Updated weights on worker 0-0, policy_version 237642 (0.00088) [2022-07-09 11:54:59,692][26022] Updated weights on worker 0-0, policy_version 237652 (0.00086) [2022-07-09 11:55:00,522][25689] Fps is (10 sec: 5753.5, 60 sec: 5714.4, 300 sec: 5736.2). Total num frames: 243359744. Throughput: 0: 6005.8. Samples: 243358278. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 11:55:00,523][25689] Avg episode reward: [(0, '-45.555')] [2022-07-09 11:55:01,650][26022] Updated weights on worker 0-0, policy_version 237662 (0.00086) [2022-07-09 11:55:03,458][26022] Updated weights on worker 0-0, policy_version 237672 (0.00085) [2022-07-09 11:55:05,506][26022] Updated weights on worker 0-0, policy_version 237682 (0.00092) [2022-07-09 11:55:05,597][25689] Fps is (10 sec: 5418.4, 60 sec: 5716.1, 300 sec: 5717.8). Total num frames: 243386368. Throughput: 0: 5894.3. Samples: 243391052. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 11:55:05,598][25689] Avg episode reward: [(0, '-45.407')] [2022-07-09 11:55:06,884][26022] Updated weights on worker 0-0, policy_version 237692 (0.00093) [2022-07-09 11:55:08,980][26022] Updated weights on worker 0-0, policy_version 237702 (0.00084) [2022-07-09 11:55:10,511][26022] Updated weights on worker 0-0, policy_version 237712 (0.00084) [2022-07-09 11:55:10,600][25689] Fps is (10 sec: 5690.5, 60 sec: 5752.8, 300 sec: 5732.3). Total num frames: 243417088. Throughput: 0: 5923.6. Samples: 243426228. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 11:55:10,600][25689] Avg episode reward: [(0, '-44.828')] [2022-07-09 11:55:12,514][26022] Updated weights on worker 0-0, policy_version 237722 (0.00088) [2022-07-09 11:55:13,938][26022] Updated weights on worker 0-0, policy_version 237732 (0.00085) [2022-07-09 11:55:15,654][25689] Fps is (10 sec: 5803.6, 60 sec: 5719.5, 300 sec: 5725.3). Total num frames: 243444736. Throughput: 0: 5923.8. Samples: 243443758. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 11:55:15,655][25689] Avg episode reward: [(0, '-45.415')] [2022-07-09 11:55:15,983][26022] Updated weights on worker 0-0, policy_version 237742 (0.00079) [2022-07-09 11:55:17,613][26022] Updated weights on worker 0-0, policy_version 237752 (0.00086) [2022-07-09 11:55:19,487][26022] Updated weights on worker 0-0, policy_version 237762 (0.00087) [2022-07-09 11:55:20,770][25689] Fps is (10 sec: 5738.8, 60 sec: 5732.8, 300 sec: 5730.5). Total num frames: 243475456. Throughput: 0: 5948.3. Samples: 243478718. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 11:55:20,771][25689] Avg episode reward: [(0, '-45.763')] [2022-07-09 11:55:21,262][26022] Updated weights on worker 0-0, policy_version 237772 (0.00081) [2022-07-09 11:55:23,046][26022] Updated weights on worker 0-0, policy_version 237782 (0.00086) [2022-07-09 11:55:24,753][26022] Updated weights on worker 0-0, policy_version 237792 (0.00088) [2022-07-09 11:55:25,818][25689] Fps is (10 sec: 5944.5, 60 sec: 5750.7, 300 sec: 5736.7). Total num frames: 243505152. Throughput: 0: 6051.4. Samples: 243513414. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 11:55:25,818][25689] Avg episode reward: [(0, '-45.813')] [2022-07-09 11:55:26,575][26022] Updated weights on worker 0-0, policy_version 237802 (0.00498) [2022-07-09 11:55:28,258][26022] Updated weights on worker 0-0, policy_version 237812 (0.00095) [2022-07-09 11:55:30,007][26022] Updated weights on worker 0-0, policy_version 237822 (0.00091) [2022-07-09 11:55:30,840][25689] Fps is (10 sec: 5694.6, 60 sec: 5755.9, 300 sec: 5733.4). Total num frames: 243532800. Throughput: 0: 5148.4. Samples: 243530436. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 11:55:30,841][25689] Avg episode reward: [(0, '-46.592')] [2022-07-09 11:55:31,809][26022] Updated weights on worker 0-0, policy_version 237832 (0.00085) [2022-07-09 11:55:33,799][26022] Updated weights on worker 0-0, policy_version 237842 (0.00086) [2022-07-09 11:55:35,440][26022] Updated weights on worker 0-0, policy_version 237852 (0.00604) [2022-07-09 11:55:35,848][25689] Fps is (10 sec: 5819.1, 60 sec: 5756.8, 300 sec: 5738.8). Total num frames: 243563520. Throughput: 0: 6016.0. Samples: 243565244. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-09 11:55:35,849][25689] Avg episode reward: [(0, '-47.554')] [2022-07-09 11:55:37,180][26022] Updated weights on worker 0-0, policy_version 237862 (0.00097) [2022-07-09 11:55:39,017][26022] Updated weights on worker 0-0, policy_version 237872 (0.00083) [2022-07-09 11:55:40,690][26022] Updated weights on worker 0-0, policy_version 237882 (0.00096) [2022-07-09 11:55:40,975][25689] Fps is (10 sec: 5860.4, 60 sec: 5773.7, 300 sec: 5738.0). Total num frames: 243592192. Throughput: 0: 5987.8. Samples: 243599698. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 11:55:40,975][25689] Avg episode reward: [(0, '-48.271')] [2022-07-09 11:55:42,479][26022] Updated weights on worker 0-0, policy_version 237892 (0.00096) [2022-07-09 11:55:44,227][26022] Updated weights on worker 0-0, policy_version 237902 (0.00085) [2022-07-09 11:55:45,983][25689] Fps is (10 sec: 5657.9, 60 sec: 5725.9, 300 sec: 5735.2). Total num frames: 243620864. Throughput: 0: 5144.2. Samples: 243617150. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 11:55:45,984][25689] Avg episode reward: [(0, '-48.657')] [2022-07-09 11:55:46,129][26022] Updated weights on worker 0-0, policy_version 237912 (0.00094) [2022-07-09 11:55:47,636][26022] Updated weights on worker 0-0, policy_version 237922 (0.00089) [2022-07-09 11:55:49,525][26022] Updated weights on worker 0-0, policy_version 237932 (0.00088) [2022-07-09 11:55:50,995][25689] Fps is (10 sec: 5824.9, 60 sec: 5760.5, 300 sec: 5743.3). Total num frames: 243650560. Throughput: 0: 6049.3. Samples: 243652360. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 11:55:50,996][25689] Avg episode reward: [(0, '-48.946')] [2022-07-09 11:55:51,395][26022] Updated weights on worker 0-0, policy_version 237942 (0.00084) [2022-07-09 11:55:53,088][26022] Updated weights on worker 0-0, policy_version 237952 (0.00089) [2022-07-09 11:55:55,046][26022] Updated weights on worker 0-0, policy_version 237962 (0.00085) [2022-07-09 11:55:55,999][25689] Fps is (10 sec: 5725.4, 60 sec: 5727.7, 300 sec: 5738.5). Total num frames: 243678208. Throughput: 0: 6040.4. Samples: 243686962. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 11:55:56,000][25689] Avg episode reward: [(0, '-49.270')] [2022-07-09 11:55:56,535][26022] Updated weights on worker 0-0, policy_version 237972 (0.00087) [2022-07-09 11:55:58,374][26022] Updated weights on worker 0-0, policy_version 237982 (0.00096) [2022-07-09 11:56:00,323][26022] Updated weights on worker 0-0, policy_version 237992 (0.00084) [2022-07-09 11:56:01,053][25689] Fps is (10 sec: 5803.2, 60 sec: 5768.7, 300 sec: 5744.7). Total num frames: 243708928. Throughput: 0: 5199.5. Samples: 243704096. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 11:56:01,054][25689] Avg episode reward: [(0, '-50.194')] [2022-07-09 11:56:02,403][26022] Updated weights on worker 0-0, policy_version 238002 (0.00088) [2022-07-09 11:56:04,341][26022] Updated weights on worker 0-0, policy_version 238012 (0.00096) [2022-07-09 11:56:05,791][26022] Updated weights on worker 0-0, policy_version 238022 (0.00084) [2022-07-09 11:56:06,106][25689] Fps is (10 sec: 5673.6, 60 sec: 5770.8, 300 sec: 5741.3). Total num frames: 243735552. Throughput: 0: 5930.9. Samples: 243736496. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 11:56:06,107][25689] Avg episode reward: [(0, '-50.452')] [2022-07-09 11:56:07,918][26022] Updated weights on worker 0-0, policy_version 238032 (0.00093) [2022-07-09 11:56:09,511][26022] Updated weights on worker 0-0, policy_version 238042 (0.00089) [2022-07-09 11:56:11,147][25689] Fps is (10 sec: 5478.0, 60 sec: 5733.3, 300 sec: 5737.8). Total num frames: 243764224. Throughput: 0: 5908.6. Samples: 243771432. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 11:56:11,148][25689] Avg episode reward: [(0, '-50.205')] [2022-07-09 11:56:11,382][26022] Updated weights on worker 0-0, policy_version 238052 (0.00088) [2022-07-09 11:56:12,859][26022] Updated weights on worker 0-0, policy_version 238062 (0.00083) [2022-07-09 11:56:14,727][26022] Updated weights on worker 0-0, policy_version 238072 (0.00086) [2022-07-09 11:56:16,222][25689] Fps is (10 sec: 5770.1, 60 sec: 5765.2, 300 sec: 5737.7). Total num frames: 243793920. Throughput: 0: 5035.8. Samples: 243788800. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 11:56:16,222][25689] Avg episode reward: [(0, '-50.262')] [2022-07-09 11:56:16,497][26022] Updated weights on worker 0-0, policy_version 238082 (0.00084) [2022-07-09 11:56:18,315][26022] Updated weights on worker 0-0, policy_version 238092 (0.00089) [2022-07-09 11:56:20,153][26022] Updated weights on worker 0-0, policy_version 238102 (0.00091) [2022-07-09 11:56:21,274][25689] Fps is (10 sec: 5864.8, 60 sec: 5754.3, 300 sec: 5740.4). Total num frames: 243823616. Throughput: 0: 5908.9. Samples: 243823580. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 11:56:21,275][25689] Avg episode reward: [(0, '-50.349')] [2022-07-09 11:56:21,835][26022] Updated weights on worker 0-0, policy_version 238112 (0.00090) [2022-07-09 11:56:23,620][26022] Updated weights on worker 0-0, policy_version 238122 (0.00085) [2022-07-09 11:56:25,531][26022] Updated weights on worker 0-0, policy_version 238132 (0.00081) [2022-07-09 11:56:26,276][25689] Fps is (10 sec: 5805.0, 60 sec: 5741.7, 300 sec: 5743.8). Total num frames: 243852288. Throughput: 0: 6061.3. Samples: 243858754. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 11:56:26,278][25689] Avg episode reward: [(0, '-49.270')] [2022-07-09 11:56:26,906][26022] Updated weights on worker 0-0, policy_version 238142 (0.00085) [2022-07-09 11:56:29,018][26022] Updated weights on worker 0-0, policy_version 238152 (0.00085) [2022-07-09 11:56:30,608][26022] Updated weights on worker 0-0, policy_version 238162 (0.00085) [2022-07-09 11:56:31,284][25689] Fps is (10 sec: 5626.5, 60 sec: 5743.1, 300 sec: 5737.1). Total num frames: 243879936. Throughput: 0: 5206.1. Samples: 243876266. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 11:56:31,284][25689] Avg episode reward: [(0, '-48.880')] [2022-07-09 11:56:32,429][26022] Updated weights on worker 0-0, policy_version 238172 (0.00082) [2022-07-09 11:56:34,212][26022] Updated weights on worker 0-0, policy_version 238182 (0.00091) [2022-07-09 11:56:35,807][26022] Updated weights on worker 0-0, policy_version 238192 (0.00094) [2022-07-09 11:56:36,303][25689] Fps is (10 sec: 5821.5, 60 sec: 5742.1, 300 sec: 5744.7). Total num frames: 243910656. Throughput: 0: 6088.5. Samples: 243911064. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 11:56:36,303][25689] Avg episode reward: [(0, '-48.549')] [2022-07-09 11:56:37,783][26022] Updated weights on worker 0-0, policy_version 238202 (0.00389) [2022-07-09 11:56:39,490][26022] Updated weights on worker 0-0, policy_version 238212 (0.00086) [2022-07-09 11:56:41,169][26022] Updated weights on worker 0-0, policy_version 238222 (0.00088) [2022-07-09 11:56:41,404][25689] Fps is (10 sec: 6071.1, 60 sec: 5778.4, 300 sec: 5746.6). Total num frames: 243941376. Throughput: 0: 6072.3. Samples: 243945814. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 11:56:41,404][25689] Avg episode reward: [(0, '-48.405')] [2022-07-09 11:56:43,250][26022] Updated weights on worker 0-0, policy_version 238232 (0.00082) [2022-07-09 11:56:44,528][26022] Updated weights on worker 0-0, policy_version 238242 (0.00083) [2022-07-09 11:56:46,449][25689] Fps is (10 sec: 5752.6, 60 sec: 5758.0, 300 sec: 5742.4). Total num frames: 243969024. Throughput: 0: 5189.8. Samples: 243963446. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 11:56:46,450][25689] Avg episode reward: [(0, '-47.384')] [2022-07-09 11:56:46,605][26022] Updated weights on worker 0-0, policy_version 238252 (0.00082) [2022-07-09 11:56:48,077][26022] Updated weights on worker 0-0, policy_version 238262 (0.00087) [2022-07-09 11:56:50,099][26022] Updated weights on worker 0-0, policy_version 238272 (0.00086) [2022-07-09 11:56:51,473][25689] Fps is (10 sec: 5694.9, 60 sec: 5756.8, 300 sec: 5742.1). Total num frames: 243998720. Throughput: 0: 6059.4. Samples: 243998602. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 11:56:51,474][25689] Avg episode reward: [(0, '-46.871')] [2022-07-09 11:56:51,658][26022] Updated weights on worker 0-0, policy_version 238282 (0.00088) [2022-07-09 11:56:53,435][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:56:53,448][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000238291_244009984.pth [2022-07-09 11:56:53,448][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000236270_241940480.pth [2022-07-09 11:56:53,623][26022] Updated weights on worker 0-0, policy_version 238292 (0.00085) [2022-07-09 11:56:55,183][26022] Updated weights on worker 0-0, policy_version 238302 (0.00084) [2022-07-09 11:56:56,508][25689] Fps is (10 sec: 5802.9, 60 sec: 5770.8, 300 sec: 5742.7). Total num frames: 244027392. Throughput: 0: 6069.3. Samples: 244033692. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 11:56:56,508][25689] Avg episode reward: [(0, '-47.554')] [2022-07-09 11:56:57,323][26022] Updated weights on worker 0-0, policy_version 238312 (0.00087) [2022-07-09 11:56:58,654][26022] Updated weights on worker 0-0, policy_version 238322 (0.00083) [2022-07-09 11:57:00,714][26022] Updated weights on worker 0-0, policy_version 238332 (0.00082) [2022-07-09 11:57:01,577][25689] Fps is (10 sec: 5777.1, 60 sec: 5752.5, 300 sec: 5748.3). Total num frames: 244057088. Throughput: 0: 5216.1. Samples: 244051038. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 11:57:01,577][25689] Avg episode reward: [(0, '-47.255')] [2022-07-09 11:57:02,717][26022] Updated weights on worker 0-0, policy_version 238342 (0.00085) [2022-07-09 11:57:04,442][26022] Updated weights on worker 0-0, policy_version 238352 (0.00091) [2022-07-09 11:57:06,179][26022] Updated weights on worker 0-0, policy_version 238362 (0.00083) [2022-07-09 11:57:06,583][25689] Fps is (10 sec: 5691.6, 60 sec: 5773.9, 300 sec: 5748.9). Total num frames: 244084736. Throughput: 0: 5971.9. Samples: 244083682. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 11:57:06,583][25689] Avg episode reward: [(0, '-47.655')] [2022-07-09 11:57:07,819][26022] Updated weights on worker 0-0, policy_version 238372 (0.00088) [2022-07-09 11:57:09,817][26022] Updated weights on worker 0-0, policy_version 238382 (0.00081) [2022-07-09 11:57:11,458][26022] Updated weights on worker 0-0, policy_version 238392 (0.00087) [2022-07-09 11:57:11,619][25689] Fps is (10 sec: 5608.3, 60 sec: 5774.3, 300 sec: 5745.2). Total num frames: 244113408. Throughput: 0: 5961.5. Samples: 244118700. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 11:57:11,620][25689] Avg episode reward: [(0, '-48.517')] [2022-07-09 11:57:13,316][26022] Updated weights on worker 0-0, policy_version 238402 (0.00091) [2022-07-09 11:57:15,156][26022] Updated weights on worker 0-0, policy_version 238412 (0.00085) [2022-07-09 11:57:16,645][25689] Fps is (10 sec: 5698.8, 60 sec: 5762.0, 300 sec: 5746.2). Total num frames: 244142080. Throughput: 0: 5081.1. Samples: 244136014. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 11:57:16,646][25689] Avg episode reward: [(0, '-48.455')] [2022-07-09 11:57:16,783][26022] Updated weights on worker 0-0, policy_version 238422 (0.00087) [2022-07-09 11:57:18,489][26022] Updated weights on worker 0-0, policy_version 238432 (0.00088) [2022-07-09 11:57:20,427][26022] Updated weights on worker 0-0, policy_version 238442 (0.00095) [2022-07-09 11:57:21,768][25689] Fps is (10 sec: 5851.9, 60 sec: 5772.2, 300 sec: 5744.3). Total num frames: 244172800. Throughput: 0: 5930.3. Samples: 244170780. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 11:57:21,769][25689] Avg episode reward: [(0, '-48.324')] [2022-07-09 11:57:22,363][26022] Updated weights on worker 0-0, policy_version 238452 (0.00084) [2022-07-09 11:57:23,899][26022] Updated weights on worker 0-0, policy_version 238462 (0.00081) [2022-07-09 11:57:25,741][26022] Updated weights on worker 0-0, policy_version 238472 (0.00092) [2022-07-09 11:57:26,821][25689] Fps is (10 sec: 5735.9, 60 sec: 5750.4, 300 sec: 5740.3). Total num frames: 244200448. Throughput: 0: 6013.2. Samples: 244205380. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 11:57:26,822][25689] Avg episode reward: [(0, '-48.599')] [2022-07-09 11:57:27,277][26022] Updated weights on worker 0-0, policy_version 238482 (0.00088) [2022-07-09 11:57:29,313][26022] Updated weights on worker 0-0, policy_version 238492 (0.00088) [2022-07-09 11:57:31,176][26022] Updated weights on worker 0-0, policy_version 238502 (0.00080) [2022-07-09 11:57:31,845][25689] Fps is (10 sec: 5690.7, 60 sec: 5782.7, 300 sec: 5743.4). Total num frames: 244230144. Throughput: 0: 5981.7. Samples: 244239686. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 11:57:31,845][25689] Avg episode reward: [(0, '-48.060')] [2022-07-09 11:57:32,894][26022] Updated weights on worker 0-0, policy_version 238512 (0.00091) [2022-07-09 11:57:34,518][26022] Updated weights on worker 0-0, policy_version 238522 (0.00093) [2022-07-09 11:57:36,426][26022] Updated weights on worker 0-0, policy_version 238532 (0.00080) [2022-07-09 11:57:36,935][25689] Fps is (10 sec: 5670.0, 60 sec: 5725.3, 300 sec: 5736.5). Total num frames: 244257792. Throughput: 0: 5963.9. Samples: 244257018. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 11:57:36,935][25689] Avg episode reward: [(0, '-47.214')] [2022-07-09 11:57:38,163][26022] Updated weights on worker 0-0, policy_version 238542 (0.00086) [2022-07-09 11:57:40,008][26022] Updated weights on worker 0-0, policy_version 238552 (0.00086) [2022-07-09 11:57:41,700][26022] Updated weights on worker 0-0, policy_version 238562 (0.00095) [2022-07-09 11:57:41,973][25689] Fps is (10 sec: 5661.7, 60 sec: 5714.3, 300 sec: 5743.1). Total num frames: 244287488. Throughput: 0: 5967.5. Samples: 244291354. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 11:57:41,974][25689] Avg episode reward: [(0, '-47.576')] [2022-07-09 11:57:43,606][26022] Updated weights on worker 0-0, policy_version 238572 (0.00087) [2022-07-09 11:57:45,367][26022] Updated weights on worker 0-0, policy_version 238582 (0.00089) [2022-07-09 11:57:46,979][25689] Fps is (10 sec: 5913.3, 60 sec: 5751.9, 300 sec: 5743.1). Total num frames: 244317184. Throughput: 0: 6000.3. Samples: 244326330. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 11:57:46,979][25689] Avg episode reward: [(0, '-47.946')] [2022-07-09 11:57:47,076][26022] Updated weights on worker 0-0, policy_version 238592 (0.00083) [2022-07-09 11:57:48,913][26022] Updated weights on worker 0-0, policy_version 238602 (0.00080) [2022-07-09 11:57:50,749][26022] Updated weights on worker 0-0, policy_version 238612 (0.00054) [2022-07-09 11:57:51,980][25689] Fps is (10 sec: 5935.3, 60 sec: 5754.1, 300 sec: 5743.2). Total num frames: 244346880. Throughput: 0: 5167.3. Samples: 244343728. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 11:57:51,980][25689] Avg episode reward: [(0, '-48.144')] [2022-07-09 11:57:52,496][26022] Updated weights on worker 0-0, policy_version 238622 (0.00100) [2022-07-09 11:57:54,188][26022] Updated weights on worker 0-0, policy_version 238632 (0.00085) [2022-07-09 11:57:55,888][26022] Updated weights on worker 0-0, policy_version 238642 (0.00093) [2022-07-09 11:57:57,011][25689] Fps is (10 sec: 5817.9, 60 sec: 5754.4, 300 sec: 5745.1). Total num frames: 244375552. Throughput: 0: 6064.9. Samples: 244378778. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 11:57:57,013][25689] Avg episode reward: [(0, '-47.123')] [2022-07-09 11:57:57,780][26022] Updated weights on worker 0-0, policy_version 238652 (0.00091) [2022-07-09 11:57:59,340][26022] Updated weights on worker 0-0, policy_version 238662 (0.00089) [2022-07-09 11:58:01,462][26022] Updated weights on worker 0-0, policy_version 238672 (0.00054) [2022-07-09 11:58:02,094][25689] Fps is (10 sec: 5568.2, 60 sec: 5719.2, 300 sec: 5740.8). Total num frames: 244403200. Throughput: 0: 6037.1. Samples: 244412828. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 11:58:02,099][25689] Avg episode reward: [(0, '-47.743')] [2022-07-09 11:58:03,244][26022] Updated weights on worker 0-0, policy_version 238682 (0.00089) [2022-07-09 11:58:05,190][26022] Updated weights on worker 0-0, policy_version 238692 (0.01129) [2022-07-09 11:58:06,836][26022] Updated weights on worker 0-0, policy_version 238702 (0.00086) [2022-07-09 11:58:07,111][25689] Fps is (10 sec: 5475.1, 60 sec: 5718.2, 300 sec: 5744.4). Total num frames: 244430848. Throughput: 0: 5056.5. Samples: 244428128. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 11:58:07,111][25689] Avg episode reward: [(0, '-47.370')] [2022-07-09 11:58:08,932][26022] Updated weights on worker 0-0, policy_version 238712 (0.00094) [2022-07-09 11:58:10,554][26022] Updated weights on worker 0-0, policy_version 238722 (0.00081) [2022-07-09 11:58:12,159][25689] Fps is (10 sec: 5595.7, 60 sec: 5717.1, 300 sec: 5737.3). Total num frames: 244459520. Throughput: 0: 5900.7. Samples: 244462802. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 11:58:12,160][25689] Avg episode reward: [(0, '-47.346')] [2022-07-09 11:58:12,260][26022] Updated weights on worker 0-0, policy_version 238732 (0.00092) [2022-07-09 11:58:14,012][26022] Updated weights on worker 0-0, policy_version 238742 (0.00090) [2022-07-09 11:58:15,814][26022] Updated weights on worker 0-0, policy_version 238752 (0.00082) [2022-07-09 11:58:17,187][25689] Fps is (10 sec: 5792.5, 60 sec: 5733.8, 300 sec: 5742.5). Total num frames: 244489216. Throughput: 0: 5899.7. Samples: 244497810. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 11:58:17,188][25689] Avg episode reward: [(0, '-46.464')] [2022-07-09 11:58:17,693][26022] Updated weights on worker 0-0, policy_version 238762 (0.00091) [2022-07-09 11:58:19,326][26022] Updated weights on worker 0-0, policy_version 238772 (0.00091) [2022-07-09 11:58:21,113][26022] Updated weights on worker 0-0, policy_version 238782 (0.00092) [2022-07-09 11:58:22,325][25689] Fps is (10 sec: 5842.5, 60 sec: 5715.5, 300 sec: 5738.3). Total num frames: 244518912. Throughput: 0: 5053.5. Samples: 244515062. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 11:58:22,325][25689] Avg episode reward: [(0, '-46.661')] [2022-07-09 11:58:22,850][26022] Updated weights on worker 0-0, policy_version 238792 (0.00089) [2022-07-09 11:58:24,601][26022] Updated weights on worker 0-0, policy_version 238802 (0.00070) [2022-07-09 11:58:26,419][26022] Updated weights on worker 0-0, policy_version 238812 (0.00093) [2022-07-09 11:58:27,332][25689] Fps is (10 sec: 5854.6, 60 sec: 5753.7, 300 sec: 5745.7). Total num frames: 244548608. Throughput: 0: 6037.0. Samples: 244550202. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 11:58:27,332][25689] Avg episode reward: [(0, '-46.633')] [2022-07-09 11:58:28,032][26022] Updated weights on worker 0-0, policy_version 238822 (0.00084) [2022-07-09 11:58:29,955][26022] Updated weights on worker 0-0, policy_version 238832 (0.00085) [2022-07-09 11:58:31,924][26022] Updated weights on worker 0-0, policy_version 238842 (0.00087) [2022-07-09 11:58:32,347][25689] Fps is (10 sec: 5721.9, 60 sec: 5720.7, 300 sec: 5738.9). Total num frames: 244576256. Throughput: 0: 6041.1. Samples: 244584756. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 11:58:32,347][25689] Avg episode reward: [(0, '-46.149')] [2022-07-09 11:58:33,327][26022] Updated weights on worker 0-0, policy_version 238852 (0.00089) [2022-07-09 11:58:35,430][26022] Updated weights on worker 0-0, policy_version 238862 (0.00081) [2022-07-09 11:58:36,916][26022] Updated weights on worker 0-0, policy_version 238872 (0.00092) [2022-07-09 11:58:37,362][25689] Fps is (10 sec: 5819.0, 60 sec: 5778.6, 300 sec: 5746.5). Total num frames: 244606976. Throughput: 0: 5177.7. Samples: 244602270. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 11:58:37,363][25689] Avg episode reward: [(0, '-47.047')] [2022-07-09 11:58:38,940][26022] Updated weights on worker 0-0, policy_version 238882 (0.00098) [2022-07-09 11:58:40,407][26022] Updated weights on worker 0-0, policy_version 238892 (0.00087) [2022-07-09 11:58:42,444][25689] Fps is (10 sec: 5780.3, 60 sec: 5740.5, 300 sec: 5738.1). Total num frames: 244634624. Throughput: 0: 6048.9. Samples: 244636764. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 11:58:42,445][25689] Avg episode reward: [(0, '-47.157')] [2022-07-09 11:58:42,618][26022] Updated weights on worker 0-0, policy_version 238902 (0.00087) [2022-07-09 11:58:44,071][26022] Updated weights on worker 0-0, policy_version 238912 (0.00086) [2022-07-09 11:58:46,095][26022] Updated weights on worker 0-0, policy_version 238922 (0.00086) [2022-07-09 11:58:47,471][25689] Fps is (10 sec: 5774.0, 60 sec: 5755.4, 300 sec: 5746.1). Total num frames: 244665344. Throughput: 0: 6014.6. Samples: 244671332. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 11:58:47,472][25689] Avg episode reward: [(0, '-47.569')] [2022-07-09 11:58:47,588][26022] Updated weights on worker 0-0, policy_version 238932 (0.00090) [2022-07-09 11:58:49,587][26022] Updated weights on worker 0-0, policy_version 238942 (0.00088) [2022-07-09 11:58:51,249][26022] Updated weights on worker 0-0, policy_version 238952 (0.00088) [2022-07-09 11:58:52,503][25689] Fps is (10 sec: 5802.7, 60 sec: 5718.7, 300 sec: 5739.1). Total num frames: 244692992. Throughput: 0: 5167.9. Samples: 244688922. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 11:58:52,504][25689] Avg episode reward: [(0, '-47.952')] [2022-07-09 11:58:53,099][26022] Updated weights on worker 0-0, policy_version 238962 (0.00080) [2022-07-09 11:58:53,600][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 11:58:53,614][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000238964_244699136.pth [2022-07-09 11:58:53,614][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000236944_242630656.pth [2022-07-09 11:58:54,750][26022] Updated weights on worker 0-0, policy_version 238972 (0.00089) [2022-07-09 11:58:56,775][26022] Updated weights on worker 0-0, policy_version 238982 (0.00089) [2022-07-09 11:58:57,527][25689] Fps is (10 sec: 5702.5, 60 sec: 5736.3, 300 sec: 5743.3). Total num frames: 244722688. Throughput: 0: 6004.0. Samples: 244723338. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 11:58:57,529][25689] Avg episode reward: [(0, '-48.327')] [2022-07-09 11:58:58,426][26022] Updated weights on worker 0-0, policy_version 238992 (0.00092) [2022-07-09 11:59:00,205][26022] Updated weights on worker 0-0, policy_version 239002 (0.00090) [2022-07-09 11:59:02,224][26022] Updated weights on worker 0-0, policy_version 239012 (0.00091) [2022-07-09 11:59:02,629][25689] Fps is (10 sec: 5562.2, 60 sec: 5717.6, 300 sec: 5743.1). Total num frames: 244749312. Throughput: 0: 5957.3. Samples: 244757008. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 11:59:02,629][25689] Avg episode reward: [(0, '-48.092')] [2022-07-09 11:59:03,983][26022] Updated weights on worker 0-0, policy_version 239022 (0.00086) [2022-07-09 11:59:05,969][26022] Updated weights on worker 0-0, policy_version 239032 (0.00082) [2022-07-09 11:59:07,458][26022] Updated weights on worker 0-0, policy_version 239042 (0.00079) [2022-07-09 11:59:07,679][25689] Fps is (10 sec: 5547.5, 60 sec: 5748.2, 300 sec: 5746.3). Total num frames: 244779008. Throughput: 0: 5055.8. Samples: 244773500. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 11:59:07,680][25689] Avg episode reward: [(0, '-48.953')] [2022-07-09 11:59:09,334][26022] Updated weights on worker 0-0, policy_version 239052 (0.00085) [2022-07-09 11:59:11,206][26022] Updated weights on worker 0-0, policy_version 239062 (0.00090) [2022-07-09 11:59:12,689][25689] Fps is (10 sec: 5903.2, 60 sec: 5768.8, 300 sec: 5747.2). Total num frames: 244808704. Throughput: 0: 5927.4. Samples: 244808574. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 11:59:12,690][25689] Avg episode reward: [(0, '-49.065')] [2022-07-09 11:59:12,889][26022] Updated weights on worker 0-0, policy_version 239072 (0.00087) [2022-07-09 11:59:14,614][26022] Updated weights on worker 0-0, policy_version 239082 (0.00085) [2022-07-09 11:59:16,258][26022] Updated weights on worker 0-0, policy_version 239092 (0.00082) [2022-07-09 11:59:17,747][25689] Fps is (10 sec: 5797.5, 60 sec: 5749.0, 300 sec: 5744.2). Total num frames: 244837376. Throughput: 0: 5961.4. Samples: 244843878. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 11:59:17,748][25689] Avg episode reward: [(0, '-48.997')] [2022-07-09 11:59:18,110][26022] Updated weights on worker 0-0, policy_version 239102 (0.00086) [2022-07-09 11:59:19,728][26022] Updated weights on worker 0-0, policy_version 239112 (0.00089) [2022-07-09 11:59:21,635][26022] Updated weights on worker 0-0, policy_version 239122 (0.00082) [2022-07-09 11:59:22,847][25689] Fps is (10 sec: 5948.1, 60 sec: 5786.5, 300 sec: 5753.7). Total num frames: 244869120. Throughput: 0: 5158.6. Samples: 244861306. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 11:59:22,847][25689] Avg episode reward: [(0, '-49.766')] [2022-07-09 11:59:23,148][26022] Updated weights on worker 0-0, policy_version 239132 (0.00087) [2022-07-09 11:59:25,155][26022] Updated weights on worker 0-0, policy_version 239142 (0.00085) [2022-07-09 11:59:26,856][26022] Updated weights on worker 0-0, policy_version 239152 (0.00094) [2022-07-09 11:59:27,908][25689] Fps is (10 sec: 5844.8, 60 sec: 5747.4, 300 sec: 5754.0). Total num frames: 244896768. Throughput: 0: 6055.7. Samples: 244896002. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 11:59:27,909][25689] Avg episode reward: [(0, '-50.062')] [2022-07-09 11:59:28,795][26022] Updated weights on worker 0-0, policy_version 239162 (0.00086) [2022-07-09 11:59:30,347][26022] Updated weights on worker 0-0, policy_version 239172 (0.00087) [2022-07-09 11:59:32,286][26022] Updated weights on worker 0-0, policy_version 239182 (0.00084) [2022-07-09 11:59:32,981][25689] Fps is (10 sec: 5658.1, 60 sec: 5775.7, 300 sec: 5749.5). Total num frames: 244926464. Throughput: 0: 5996.8. Samples: 244930260. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 11:59:32,982][25689] Avg episode reward: [(0, '-49.663')] [2022-07-09 11:59:34,180][26022] Updated weights on worker 0-0, policy_version 239192 (0.00090) [2022-07-09 11:59:35,978][26022] Updated weights on worker 0-0, policy_version 239202 (0.00620) [2022-07-09 11:59:37,846][26022] Updated weights on worker 0-0, policy_version 239212 (0.00093) [2022-07-09 11:59:38,023][25689] Fps is (10 sec: 5669.4, 60 sec: 5722.6, 300 sec: 5751.1). Total num frames: 244954112. Throughput: 0: 5089.1. Samples: 244947066. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 11:59:38,023][25689] Avg episode reward: [(0, '-49.664')] [2022-07-09 11:59:39,432][26022] Updated weights on worker 0-0, policy_version 239222 (0.00081) [2022-07-09 11:59:41,190][26022] Updated weights on worker 0-0, policy_version 239232 (0.00091) [2022-07-09 11:59:43,132][25689] Fps is (10 sec: 5548.3, 60 sec: 5736.9, 300 sec: 5739.4). Total num frames: 244982784. Throughput: 0: 5933.0. Samples: 244981660. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 11:59:43,133][25689] Avg episode reward: [(0, '-49.279')] [2022-07-09 11:59:43,242][26022] Updated weights on worker 0-0, policy_version 239242 (0.00082) [2022-07-09 11:59:44,933][26022] Updated weights on worker 0-0, policy_version 239252 (0.00089) [2022-07-09 11:59:46,623][26022] Updated weights on worker 0-0, policy_version 239262 (0.00084) [2022-07-09 11:59:48,153][25689] Fps is (10 sec: 5761.8, 60 sec: 5720.6, 300 sec: 5746.3). Total num frames: 245012480. Throughput: 0: 5947.2. Samples: 245016402. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 11:59:48,154][25689] Avg episode reward: [(0, '-49.140')] [2022-07-09 11:59:48,478][26022] Updated weights on worker 0-0, policy_version 239272 (0.00091) [2022-07-09 11:59:50,137][26022] Updated weights on worker 0-0, policy_version 239282 (0.00098) [2022-07-09 11:59:52,031][26022] Updated weights on worker 0-0, policy_version 239292 (0.00094) [2022-07-09 11:59:53,166][25689] Fps is (10 sec: 5919.2, 60 sec: 5756.1, 300 sec: 5746.3). Total num frames: 245042176. Throughput: 0: 5992.5. Samples: 245051216. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 11:59:53,166][25689] Avg episode reward: [(0, '-49.628')] [2022-07-09 11:59:53,768][26022] Updated weights on worker 0-0, policy_version 239302 (0.00097) [2022-07-09 11:59:55,488][26022] Updated weights on worker 0-0, policy_version 239312 (0.00086) [2022-07-09 11:59:57,272][26022] Updated weights on worker 0-0, policy_version 239322 (0.00086) [2022-07-09 11:59:58,175][25689] Fps is (10 sec: 5824.0, 60 sec: 5740.6, 300 sec: 5748.7). Total num frames: 245070848. Throughput: 0: 6018.1. Samples: 245068344. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 11:59:58,176][25689] Avg episode reward: [(0, '-48.671')] [2022-07-09 11:59:58,848][26022] Updated weights on worker 0-0, policy_version 239332 (0.00086) [2022-07-09 12:00:00,790][26022] Updated weights on worker 0-0, policy_version 239342 (0.00090) [2022-07-09 12:00:02,978][26022] Updated weights on worker 0-0, policy_version 239352 (0.00094) [2022-07-09 12:00:03,235][25689] Fps is (10 sec: 5492.0, 60 sec: 5744.6, 300 sec: 5749.0). Total num frames: 245097472. Throughput: 0: 5934.1. Samples: 245100948. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 12:00:03,235][25689] Avg episode reward: [(0, '-49.151')] [2022-07-09 12:00:04,752][26022] Updated weights on worker 0-0, policy_version 239362 (0.00087) [2022-07-09 12:00:06,506][26022] Updated weights on worker 0-0, policy_version 239372 (0.00088) [2022-07-09 12:00:08,332][25689] Fps is (10 sec: 5444.1, 60 sec: 5723.3, 300 sec: 5740.2). Total num frames: 245126144. Throughput: 0: 5898.6. Samples: 245135430. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 12:00:08,333][25689] Avg episode reward: [(0, '-49.018')] [2022-07-09 12:00:08,424][26022] Updated weights on worker 0-0, policy_version 239382 (0.00093) [2022-07-09 12:00:10,145][26022] Updated weights on worker 0-0, policy_version 239392 (0.00092) [2022-07-09 12:00:11,935][26022] Updated weights on worker 0-0, policy_version 239402 (0.00082) [2022-07-09 12:00:13,345][25689] Fps is (10 sec: 5772.9, 60 sec: 5723.1, 300 sec: 5747.9). Total num frames: 245155840. Throughput: 0: 5023.7. Samples: 245152590. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 12:00:13,346][25689] Avg episode reward: [(0, '-49.829')] [2022-07-09 12:00:13,827][26022] Updated weights on worker 0-0, policy_version 239412 (0.00079) [2022-07-09 12:00:15,407][26022] Updated weights on worker 0-0, policy_version 239422 (0.00082) [2022-07-09 12:00:17,167][26022] Updated weights on worker 0-0, policy_version 239432 (0.00077) [2022-07-09 12:00:18,391][25689] Fps is (10 sec: 5802.5, 60 sec: 5724.1, 300 sec: 5742.4). Total num frames: 245184512. Throughput: 0: 5886.4. Samples: 245187344. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 12:00:18,392][25689] Avg episode reward: [(0, '-49.365')] [2022-07-09 12:00:18,964][26022] Updated weights on worker 0-0, policy_version 239442 (0.00083) [2022-07-09 12:00:20,960][26022] Updated weights on worker 0-0, policy_version 239452 (0.00086) [2022-07-09 12:00:22,430][26022] Updated weights on worker 0-0, policy_version 239462 (0.00086) [2022-07-09 12:00:23,484][25689] Fps is (10 sec: 5757.1, 60 sec: 5691.1, 300 sec: 5741.5). Total num frames: 245214208. Throughput: 0: 5978.6. Samples: 245222008. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 12:00:23,484][25689] Avg episode reward: [(0, '-48.932')] [2022-07-09 12:00:24,553][26022] Updated weights on worker 0-0, policy_version 239472 (0.00092) [2022-07-09 12:00:26,063][26022] Updated weights on worker 0-0, policy_version 239482 (0.00093) [2022-07-09 12:00:28,063][26022] Updated weights on worker 0-0, policy_version 239492 (0.00085) [2022-07-09 12:00:28,543][25689] Fps is (10 sec: 5850.6, 60 sec: 5725.1, 300 sec: 5747.7). Total num frames: 245243904. Throughput: 0: 5134.5. Samples: 245239202. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 12:00:28,543][25689] Avg episode reward: [(0, '-48.488')] [2022-07-09 12:00:29,575][26022] Updated weights on worker 0-0, policy_version 239502 (0.00084) [2022-07-09 12:00:31,431][26022] Updated weights on worker 0-0, policy_version 239512 (0.00083) [2022-07-09 12:00:33,180][26022] Updated weights on worker 0-0, policy_version 239522 (0.00090) [2022-07-09 12:00:33,566][25689] Fps is (10 sec: 5890.6, 60 sec: 5729.8, 300 sec: 5744.0). Total num frames: 245273600. Throughput: 0: 6017.4. Samples: 245274266. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 12:00:33,567][25689] Avg episode reward: [(0, '-48.710')] [2022-07-09 12:00:34,990][26022] Updated weights on worker 0-0, policy_version 239532 (0.00347) [2022-07-09 12:00:36,726][26022] Updated weights on worker 0-0, policy_version 239542 (0.00086) [2022-07-09 12:00:38,579][25689] Fps is (10 sec: 5611.9, 60 sec: 5715.6, 300 sec: 5739.3). Total num frames: 245300224. Throughput: 0: 6008.1. Samples: 245308632. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 12:00:38,579][25689] Avg episode reward: [(0, '-48.181')] [2022-07-09 12:00:38,716][26022] Updated weights on worker 0-0, policy_version 239552 (0.00089) [2022-07-09 12:00:40,287][26022] Updated weights on worker 0-0, policy_version 239562 (0.00088) [2022-07-09 12:00:42,362][26022] Updated weights on worker 0-0, policy_version 239572 (0.00086) [2022-07-09 12:00:43,629][25689] Fps is (10 sec: 5596.8, 60 sec: 5738.1, 300 sec: 5741.9). Total num frames: 245329920. Throughput: 0: 5155.0. Samples: 245325858. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 12:00:43,629][25689] Avg episode reward: [(0, '-48.965')] [2022-07-09 12:00:43,818][26022] Updated weights on worker 0-0, policy_version 239582 (0.00082) [2022-07-09 12:00:45,803][26022] Updated weights on worker 0-0, policy_version 239592 (0.00085) [2022-07-09 12:00:47,334][26022] Updated weights on worker 0-0, policy_version 239602 (0.00097) [2022-07-09 12:00:48,653][25689] Fps is (10 sec: 5793.7, 60 sec: 5720.9, 300 sec: 5738.2). Total num frames: 245358592. Throughput: 0: 6036.0. Samples: 245360588. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 12:00:48,655][25689] Avg episode reward: [(0, '-47.869')] [2022-07-09 12:00:49,311][26022] Updated weights on worker 0-0, policy_version 239612 (0.00090) [2022-07-09 12:00:50,976][26022] Updated weights on worker 0-0, policy_version 239622 (0.00087) [2022-07-09 12:00:52,687][26022] Updated weights on worker 0-0, policy_version 239632 (0.00084) [2022-07-09 12:00:53,661][25689] Fps is (10 sec: 5715.9, 60 sec: 5704.4, 300 sec: 5741.6). Total num frames: 245387264. Throughput: 0: 6019.5. Samples: 245395230. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 12:00:53,663][25689] Avg episode reward: [(0, '-47.553')] [2022-07-09 12:00:53,720][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:00:53,732][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000239637_245388288.pth [2022-07-09 12:00:53,733][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000237616_243318784.pth [2022-07-09 12:00:54,625][26022] Updated weights on worker 0-0, policy_version 239642 (0.00091) [2022-07-09 12:00:56,188][26022] Updated weights on worker 0-0, policy_version 239652 (0.00078) [2022-07-09 12:00:57,947][26022] Updated weights on worker 0-0, policy_version 239662 (0.00083) [2022-07-09 12:00:58,699][25689] Fps is (10 sec: 5912.2, 60 sec: 5735.5, 300 sec: 5741.9). Total num frames: 245417984. Throughput: 0: 5173.7. Samples: 245412728. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 12:00:58,699][25689] Avg episode reward: [(0, '-48.738')] [2022-07-09 12:00:59,783][26022] Updated weights on worker 0-0, policy_version 239672 (0.00093) [2022-07-09 12:01:02,025][26022] Updated weights on worker 0-0, policy_version 239682 (0.00086) [2022-07-09 12:01:03,769][25689] Fps is (10 sec: 5673.4, 60 sec: 5734.5, 300 sec: 5741.6). Total num frames: 245444608. Throughput: 0: 5938.5. Samples: 245445458. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 12:01:03,769][25689] Avg episode reward: [(0, '-48.723')] [2022-07-09 12:01:03,771][26022] Updated weights on worker 0-0, policy_version 239692 (0.00095) [2022-07-09 12:01:05,633][26022] Updated weights on worker 0-0, policy_version 239702 (0.00093) [2022-07-09 12:01:07,366][26022] Updated weights on worker 0-0, policy_version 239712 (0.00091) [2022-07-09 12:01:08,831][25689] Fps is (10 sec: 5356.3, 60 sec: 5721.0, 300 sec: 5737.7). Total num frames: 245472256. Throughput: 0: 5899.9. Samples: 245479636. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 12:01:08,831][25689] Avg episode reward: [(0, '-47.605')] [2022-07-09 12:01:09,254][26022] Updated weights on worker 0-0, policy_version 239722 (0.00079) [2022-07-09 12:01:10,885][26022] Updated weights on worker 0-0, policy_version 239732 (0.00080) [2022-07-09 12:01:12,668][26022] Updated weights on worker 0-0, policy_version 239742 (0.00086) [2022-07-09 12:01:13,864][25689] Fps is (10 sec: 5781.6, 60 sec: 5736.0, 300 sec: 5742.0). Total num frames: 245502976. Throughput: 0: 5031.1. Samples: 245496874. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 12:01:13,866][25689] Avg episode reward: [(0, '-47.855')] [2022-07-09 12:01:14,517][26022] Updated weights on worker 0-0, policy_version 239752 (0.00049) [2022-07-09 12:01:16,277][26022] Updated weights on worker 0-0, policy_version 239762 (0.00092) [2022-07-09 12:01:18,172][26022] Updated weights on worker 0-0, policy_version 239772 (0.00087) [2022-07-09 12:01:18,875][25689] Fps is (10 sec: 5811.4, 60 sec: 5722.4, 300 sec: 5735.9). Total num frames: 245530624. Throughput: 0: 5883.4. Samples: 245531432. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 12:01:18,875][25689] Avg episode reward: [(0, '-48.814')] [2022-07-09 12:01:19,816][26022] Updated weights on worker 0-0, policy_version 239782 (0.00085) [2022-07-09 12:01:21,679][26022] Updated weights on worker 0-0, policy_version 239792 (0.00084) [2022-07-09 12:01:23,339][26022] Updated weights on worker 0-0, policy_version 239802 (0.00084) [2022-07-09 12:01:23,925][25689] Fps is (10 sec: 5699.5, 60 sec: 5726.4, 300 sec: 5738.4). Total num frames: 245560320. Throughput: 0: 5982.8. Samples: 245566052. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 12:01:23,926][25689] Avg episode reward: [(0, '-48.210')] [2022-07-09 12:01:25,162][26022] Updated weights on worker 0-0, policy_version 239812 (0.00083) [2022-07-09 12:01:26,938][26022] Updated weights on worker 0-0, policy_version 239822 (0.00090) [2022-07-09 12:01:28,849][26022] Updated weights on worker 0-0, policy_version 239832 (0.00090) [2022-07-09 12:01:28,952][25689] Fps is (10 sec: 5690.4, 60 sec: 5695.6, 300 sec: 5738.0). Total num frames: 245587968. Throughput: 0: 5148.0. Samples: 245583218. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 12:01:28,952][25689] Avg episode reward: [(0, '-46.913')] [2022-07-09 12:01:30,559][26022] Updated weights on worker 0-0, policy_version 239842 (0.00093) [2022-07-09 12:01:32,380][26022] Updated weights on worker 0-0, policy_version 239852 (0.00090) [2022-07-09 12:01:33,956][25689] Fps is (10 sec: 5716.8, 60 sec: 5697.3, 300 sec: 5734.9). Total num frames: 245617664. Throughput: 0: 6004.7. Samples: 245617520. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 12:01:33,958][25689] Avg episode reward: [(0, '-47.985')] [2022-07-09 12:01:34,084][26022] Updated weights on worker 0-0, policy_version 239862 (0.00054) [2022-07-09 12:01:35,983][26022] Updated weights on worker 0-0, policy_version 239872 (0.00082) [2022-07-09 12:01:37,707][26022] Updated weights on worker 0-0, policy_version 239882 (0.00080) [2022-07-09 12:01:38,975][25689] Fps is (10 sec: 5823.4, 60 sec: 5730.7, 300 sec: 5729.6). Total num frames: 245646336. Throughput: 0: 5987.9. Samples: 245651790. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 12:01:38,976][25689] Avg episode reward: [(0, '-48.425')] [2022-07-09 12:01:39,402][26022] Updated weights on worker 0-0, policy_version 239892 (0.00086) [2022-07-09 12:01:41,482][26022] Updated weights on worker 0-0, policy_version 239902 (0.00082) [2022-07-09 12:01:43,117][26022] Updated weights on worker 0-0, policy_version 239912 (0.00085) [2022-07-09 12:01:44,062][25689] Fps is (10 sec: 5674.3, 60 sec: 5710.2, 300 sec: 5732.2). Total num frames: 245675008. Throughput: 0: 5117.7. Samples: 245669106. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 12:01:44,063][25689] Avg episode reward: [(0, '-48.965')] [2022-07-09 12:01:44,954][26022] Updated weights on worker 0-0, policy_version 239922 (0.00093) [2022-07-09 12:01:46,437][26022] Updated weights on worker 0-0, policy_version 239932 (0.00090) [2022-07-09 12:01:48,353][26022] Updated weights on worker 0-0, policy_version 239942 (0.00619) [2022-07-09 12:01:49,093][25689] Fps is (10 sec: 5667.3, 60 sec: 5709.6, 300 sec: 5728.6). Total num frames: 245703680. Throughput: 0: 6001.6. Samples: 245704098. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 12:01:49,094][25689] Avg episode reward: [(0, '-49.461')] [2022-07-09 12:01:50,076][26022] Updated weights on worker 0-0, policy_version 239952 (0.00090) [2022-07-09 12:01:51,827][26022] Updated weights on worker 0-0, policy_version 239962 (0.00086) [2022-07-09 12:01:53,803][26022] Updated weights on worker 0-0, policy_version 239972 (0.00090) [2022-07-09 12:01:54,130][25689] Fps is (10 sec: 5797.4, 60 sec: 5723.8, 300 sec: 5732.0). Total num frames: 245733376. Throughput: 0: 6021.3. Samples: 245738992. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 12:01:54,131][25689] Avg episode reward: [(0, '-51.010')] [2022-07-09 12:01:55,463][26022] Updated weights on worker 0-0, policy_version 239982 (0.00082) [2022-07-09 12:01:57,238][26022] Updated weights on worker 0-0, policy_version 239992 (0.00089) [2022-07-09 12:01:58,932][26022] Updated weights on worker 0-0, policy_version 240002 (0.00091) [2022-07-09 12:01:59,136][25689] Fps is (10 sec: 5812.2, 60 sec: 5692.9, 300 sec: 5729.8). Total num frames: 245762048. Throughput: 0: 5187.2. Samples: 245756368. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 12:01:59,136][25689] Avg episode reward: [(0, '-51.810')] [2022-07-09 12:02:00,812][26022] Updated weights on worker 0-0, policy_version 240012 (0.00091) [2022-07-09 12:02:02,849][26022] Updated weights on worker 0-0, policy_version 240022 (0.00085) [2022-07-09 12:02:04,166][25689] Fps is (10 sec: 5509.9, 60 sec: 5696.7, 300 sec: 5725.9). Total num frames: 245788672. Throughput: 0: 5993.9. Samples: 245789606. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 12:02:04,166][25689] Avg episode reward: [(0, '-51.133')] [2022-07-09 12:02:04,851][26022] Updated weights on worker 0-0, policy_version 240032 (0.00089) [2022-07-09 12:02:06,298][26022] Updated weights on worker 0-0, policy_version 240042 (0.00085) [2022-07-09 12:02:08,430][26022] Updated weights on worker 0-0, policy_version 240052 (0.00457) [2022-07-09 12:02:09,175][25689] Fps is (10 sec: 5712.1, 60 sec: 5752.6, 300 sec: 5733.3). Total num frames: 245819392. Throughput: 0: 5944.1. Samples: 245823464. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 12:02:09,175][25689] Avg episode reward: [(0, '-50.641')] [2022-07-09 12:02:09,872][26022] Updated weights on worker 0-0, policy_version 240062 (0.00087) [2022-07-09 12:02:11,828][26022] Updated weights on worker 0-0, policy_version 240072 (0.00086) [2022-07-09 12:02:13,531][26022] Updated weights on worker 0-0, policy_version 240082 (0.00086) [2022-07-09 12:02:14,194][25689] Fps is (10 sec: 5718.4, 60 sec: 5686.1, 300 sec: 5726.6). Total num frames: 245846016. Throughput: 0: 5083.1. Samples: 245840978. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 12:02:14,195][25689] Avg episode reward: [(0, '-49.570')] [2022-07-09 12:02:15,323][26022] Updated weights on worker 0-0, policy_version 240092 (0.00084) [2022-07-09 12:02:17,199][26022] Updated weights on worker 0-0, policy_version 240102 (0.00094) [2022-07-09 12:02:18,879][26022] Updated weights on worker 0-0, policy_version 240112 (0.00093) [2022-07-09 12:02:19,244][25689] Fps is (10 sec: 5694.9, 60 sec: 5733.2, 300 sec: 5728.0). Total num frames: 245876736. Throughput: 0: 5935.7. Samples: 245875726. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 12:02:19,246][25689] Avg episode reward: [(0, '-48.990')] [2022-07-09 12:02:20,716][26022] Updated weights on worker 0-0, policy_version 240122 (0.00090) [2022-07-09 12:02:22,381][26022] Updated weights on worker 0-0, policy_version 240132 (0.00091) [2022-07-09 12:02:24,294][25689] Fps is (10 sec: 5778.7, 60 sec: 5699.4, 300 sec: 5728.0). Total num frames: 245904384. Throughput: 0: 6016.0. Samples: 245910700. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 12:02:24,295][25689] Avg episode reward: [(0, '-48.173')] [2022-07-09 12:02:24,376][26022] Updated weights on worker 0-0, policy_version 240142 (0.00089) [2022-07-09 12:02:25,875][26022] Updated weights on worker 0-0, policy_version 240152 (0.00064) [2022-07-09 12:02:27,798][26022] Updated weights on worker 0-0, policy_version 240162 (0.00088) [2022-07-09 12:02:29,309][25689] Fps is (10 sec: 5799.2, 60 sec: 5751.4, 300 sec: 5731.7). Total num frames: 245935104. Throughput: 0: 5191.5. Samples: 245927990. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 12:02:29,309][25689] Avg episode reward: [(0, '-47.876')] [2022-07-09 12:02:29,356][26022] Updated weights on worker 0-0, policy_version 240172 (0.00091) [2022-07-09 12:02:31,369][26022] Updated weights on worker 0-0, policy_version 240182 (0.00088) [2022-07-09 12:02:33,051][26022] Updated weights on worker 0-0, policy_version 240192 (0.00084) [2022-07-09 12:02:34,339][25689] Fps is (10 sec: 5810.9, 60 sec: 5715.0, 300 sec: 5732.8). Total num frames: 245962752. Throughput: 0: 6019.5. Samples: 245962242. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 12:02:34,341][25689] Avg episode reward: [(0, '-48.017')] [2022-07-09 12:02:34,763][26022] Updated weights on worker 0-0, policy_version 240202 (0.00083) [2022-07-09 12:02:36,744][26022] Updated weights on worker 0-0, policy_version 240212 (0.00091) [2022-07-09 12:02:38,368][26022] Updated weights on worker 0-0, policy_version 240222 (0.00095) [2022-07-09 12:02:39,357][25689] Fps is (10 sec: 5706.9, 60 sec: 5732.0, 300 sec: 5733.2). Total num frames: 245992448. Throughput: 0: 6023.1. Samples: 245996868. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 12:02:39,358][25689] Avg episode reward: [(0, '-48.833')] [2022-07-09 12:02:40,149][26022] Updated weights on worker 0-0, policy_version 240232 (0.00087) [2022-07-09 12:02:42,244][26022] Updated weights on worker 0-0, policy_version 240242 (0.00086) [2022-07-09 12:02:43,513][26022] Updated weights on worker 0-0, policy_version 240252 (0.00087) [2022-07-09 12:02:44,415][25689] Fps is (10 sec: 5792.9, 60 sec: 5734.8, 300 sec: 5728.7). Total num frames: 246021120. Throughput: 0: 5145.7. Samples: 246014234. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 12:02:44,418][25689] Avg episode reward: [(0, '-48.326')] [2022-07-09 12:02:45,571][26022] Updated weights on worker 0-0, policy_version 240262 (0.00085) [2022-07-09 12:02:47,158][26022] Updated weights on worker 0-0, policy_version 240272 (0.00082) [2022-07-09 12:02:49,139][26022] Updated weights on worker 0-0, policy_version 240282 (0.00091) [2022-07-09 12:02:49,439][25689] Fps is (10 sec: 5891.2, 60 sec: 5769.5, 300 sec: 5731.8). Total num frames: 246051840. Throughput: 0: 6016.7. Samples: 246049104. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 12:02:49,439][25689] Avg episode reward: [(0, '-48.623')] [2022-07-09 12:02:50,932][26022] Updated weights on worker 0-0, policy_version 240292 (0.00081) [2022-07-09 12:02:52,551][26022] Updated weights on worker 0-0, policy_version 240302 (0.00097) [2022-07-09 12:02:53,826][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:02:53,843][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000240309_246076416.pth [2022-07-09 12:02:53,844][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000238291_244009984.pth [2022-07-09 12:02:54,239][26022] Updated weights on worker 0-0, policy_version 240312 (0.00091) [2022-07-09 12:02:54,468][25689] Fps is (10 sec: 5805.9, 60 sec: 5736.2, 300 sec: 5728.3). Total num frames: 246079488. Throughput: 0: 6058.2. Samples: 246084188. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 12:02:54,470][25689] Avg episode reward: [(0, '-47.902')] [2022-07-09 12:02:56,134][26022] Updated weights on worker 0-0, policy_version 240322 (0.00087) [2022-07-09 12:02:57,769][26022] Updated weights on worker 0-0, policy_version 240332 (0.00089) [2022-07-09 12:02:59,494][25689] Fps is (10 sec: 5499.3, 60 sec: 5717.4, 300 sec: 5729.5). Total num frames: 246107136. Throughput: 0: 5190.4. Samples: 246101384. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 12:02:59,494][25689] Avg episode reward: [(0, '-47.941')] [2022-07-09 12:02:59,787][26022] Updated weights on worker 0-0, policy_version 240342 (0.00088) [2022-07-09 12:03:01,288][26022] Updated weights on worker 0-0, policy_version 240352 (0.00049) [2022-07-09 12:03:03,642][26022] Updated weights on worker 0-0, policy_version 240362 (0.00092) [2022-07-09 12:03:04,538][25689] Fps is (10 sec: 5592.9, 60 sec: 5749.9, 300 sec: 5732.4). Total num frames: 246135808. Throughput: 0: 5942.8. Samples: 246133822. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 12:03:04,538][25689] Avg episode reward: [(0, '-47.747')] [2022-07-09 12:03:05,322][26022] Updated weights on worker 0-0, policy_version 240372 (0.00080) [2022-07-09 12:03:07,303][26022] Updated weights on worker 0-0, policy_version 240382 (0.00085) [2022-07-09 12:03:08,955][26022] Updated weights on worker 0-0, policy_version 240392 (0.00090) [2022-07-09 12:03:09,554][25689] Fps is (10 sec: 5699.9, 60 sec: 5715.3, 300 sec: 5733.0). Total num frames: 246164480. Throughput: 0: 5941.2. Samples: 246168614. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 12:03:09,555][25689] Avg episode reward: [(0, '-48.135')] [2022-07-09 12:03:10,651][26022] Updated weights on worker 0-0, policy_version 240402 (0.00087) [2022-07-09 12:03:12,463][26022] Updated weights on worker 0-0, policy_version 240412 (0.00094) [2022-07-09 12:03:14,403][26022] Updated weights on worker 0-0, policy_version 240422 (0.00089) [2022-07-09 12:03:14,567][25689] Fps is (10 sec: 5717.9, 60 sec: 5749.9, 300 sec: 5729.8). Total num frames: 246193152. Throughput: 0: 5057.3. Samples: 246185834. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 12:03:14,567][25689] Avg episode reward: [(0, '-48.918')] [2022-07-09 12:03:16,064][26022] Updated weights on worker 0-0, policy_version 240432 (0.00094) [2022-07-09 12:03:17,864][26022] Updated weights on worker 0-0, policy_version 240442 (0.00084) [2022-07-09 12:03:19,505][26022] Updated weights on worker 0-0, policy_version 240452 (0.00084) [2022-07-09 12:03:19,573][25689] Fps is (10 sec: 5825.7, 60 sec: 5737.1, 300 sec: 5732.4). Total num frames: 246222848. Throughput: 0: 5933.6. Samples: 246220528. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 12:03:19,574][25689] Avg episode reward: [(0, '-48.771')] [2022-07-09 12:03:21,568][26022] Updated weights on worker 0-0, policy_version 240462 (0.00090) [2022-07-09 12:03:23,168][26022] Updated weights on worker 0-0, policy_version 240472 (0.00091) [2022-07-09 12:03:24,629][25689] Fps is (10 sec: 5699.1, 60 sec: 5736.6, 300 sec: 5724.5). Total num frames: 246250496. Throughput: 0: 6037.0. Samples: 246255110. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 12:03:24,629][25689] Avg episode reward: [(0, '-49.149')] [2022-07-09 12:03:25,074][26022] Updated weights on worker 0-0, policy_version 240482 (0.00085) [2022-07-09 12:03:26,542][26022] Updated weights on worker 0-0, policy_version 240492 (0.00088) [2022-07-09 12:03:28,642][26022] Updated weights on worker 0-0, policy_version 240502 (0.00083) [2022-07-09 12:03:29,679][25689] Fps is (10 sec: 5775.9, 60 sec: 5733.2, 300 sec: 5734.2). Total num frames: 246281216. Throughput: 0: 5998.7. Samples: 246289334. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 12:03:29,679][25689] Avg episode reward: [(0, '-48.147')] [2022-07-09 12:03:30,417][26022] Updated weights on worker 0-0, policy_version 240512 (0.00091) [2022-07-09 12:03:32,108][26022] Updated weights on worker 0-0, policy_version 240522 (0.00085) [2022-07-09 12:03:34,024][26022] Updated weights on worker 0-0, policy_version 240532 (0.00065) [2022-07-09 12:03:34,683][25689] Fps is (10 sec: 5805.5, 60 sec: 5735.7, 300 sec: 5724.1). Total num frames: 246308864. Throughput: 0: 5994.3. Samples: 246306414. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 12:03:34,683][25689] Avg episode reward: [(0, '-47.877')] [2022-07-09 12:03:35,743][26022] Updated weights on worker 0-0, policy_version 240542 (0.00098) [2022-07-09 12:03:37,507][26022] Updated weights on worker 0-0, policy_version 240552 (0.00086) [2022-07-09 12:03:39,436][26022] Updated weights on worker 0-0, policy_version 240562 (0.00093) [2022-07-09 12:03:39,704][25689] Fps is (10 sec: 5617.7, 60 sec: 5718.4, 300 sec: 5728.7). Total num frames: 246337536. Throughput: 0: 5973.4. Samples: 246340778. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 12:03:39,705][25689] Avg episode reward: [(0, '-47.447')] [2022-07-09 12:03:41,034][26022] Updated weights on worker 0-0, policy_version 240572 (0.00088) [2022-07-09 12:03:42,980][26022] Updated weights on worker 0-0, policy_version 240582 (0.00090) [2022-07-09 12:03:44,845][25689] Fps is (10 sec: 5541.7, 60 sec: 5693.5, 300 sec: 5716.2). Total num frames: 246365184. Throughput: 0: 5912.0. Samples: 246374632. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 12:03:44,846][25689] Avg episode reward: [(0, '-46.834')] [2022-07-09 12:03:44,892][26022] Updated weights on worker 0-0, policy_version 240592 (0.00088) [2022-07-09 12:03:46,569][26022] Updated weights on worker 0-0, policy_version 240602 (0.00091) [2022-07-09 12:03:48,387][26022] Updated weights on worker 0-0, policy_version 240612 (0.00092) [2022-07-09 12:03:49,870][25689] Fps is (10 sec: 5640.6, 60 sec: 5676.5, 300 sec: 5723.2). Total num frames: 246394880. Throughput: 0: 5074.6. Samples: 246391798. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 12:03:49,871][25689] Avg episode reward: [(0, '-47.504')] [2022-07-09 12:03:50,134][26022] Updated weights on worker 0-0, policy_version 240622 (0.01122) [2022-07-09 12:03:51,936][26022] Updated weights on worker 0-0, policy_version 240632 (0.00088) [2022-07-09 12:03:53,857][26022] Updated weights on worker 0-0, policy_version 240642 (0.00092) [2022-07-09 12:03:54,904][25689] Fps is (10 sec: 5802.7, 60 sec: 5693.0, 300 sec: 5719.5). Total num frames: 246423552. Throughput: 0: 5899.3. Samples: 246425708. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 12:03:54,905][25689] Avg episode reward: [(0, '-47.865')] [2022-07-09 12:03:55,600][26022] Updated weights on worker 0-0, policy_version 240652 (0.00084) [2022-07-09 12:03:57,407][26022] Updated weights on worker 0-0, policy_version 240662 (0.00081) [2022-07-09 12:03:59,373][26022] Updated weights on worker 0-0, policy_version 240672 (0.00092) [2022-07-09 12:03:59,907][25689] Fps is (10 sec: 5713.4, 60 sec: 5712.1, 300 sec: 5728.3). Total num frames: 246452224. Throughput: 0: 5902.9. Samples: 246460034. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 12:03:59,908][25689] Avg episode reward: [(0, '-47.259')] [2022-07-09 12:04:00,843][26022] Updated weights on worker 0-0, policy_version 240682 (0.00087) [2022-07-09 12:04:03,300][26022] Updated weights on worker 0-0, policy_version 240692 (0.00084) [2022-07-09 12:04:04,947][26022] Updated weights on worker 0-0, policy_version 240702 (0.00087) [2022-07-09 12:04:05,032][25689] Fps is (10 sec: 5460.0, 60 sec: 5670.7, 300 sec: 5716.6). Total num frames: 246478848. Throughput: 0: 4971.3. Samples: 246474984. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 12:04:05,032][25689] Avg episode reward: [(0, '-46.621')] [2022-07-09 12:04:06,922][26022] Updated weights on worker 0-0, policy_version 240712 (0.00088) [2022-07-09 12:04:08,546][26022] Updated weights on worker 0-0, policy_version 240722 (0.00086) [2022-07-09 12:04:10,097][25689] Fps is (10 sec: 5426.3, 60 sec: 5666.0, 300 sec: 5712.1). Total num frames: 246507520. Throughput: 0: 5814.5. Samples: 246509408. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 12:04:10,098][25689] Avg episode reward: [(0, '-46.740')] [2022-07-09 12:04:10,400][26022] Updated weights on worker 0-0, policy_version 240732 (0.00087) [2022-07-09 12:04:12,047][26022] Updated weights on worker 0-0, policy_version 240742 (0.00093) [2022-07-09 12:04:13,908][26022] Updated weights on worker 0-0, policy_version 240752 (0.00092) [2022-07-09 12:04:15,130][25689] Fps is (10 sec: 5678.7, 60 sec: 5664.2, 300 sec: 5712.6). Total num frames: 246536192. Throughput: 0: 5854.1. Samples: 246544110. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 12:04:15,130][25689] Avg episode reward: [(0, '-47.952')] [2022-07-09 12:04:15,519][26022] Updated weights on worker 0-0, policy_version 240762 (0.00088) [2022-07-09 12:04:17,542][26022] Updated weights on worker 0-0, policy_version 240772 (0.00087) [2022-07-09 12:04:19,140][26022] Updated weights on worker 0-0, policy_version 240782 (0.00092) [2022-07-09 12:04:20,173][25689] Fps is (10 sec: 5792.9, 60 sec: 5660.7, 300 sec: 5706.8). Total num frames: 246565888. Throughput: 0: 5003.7. Samples: 246561440. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 12:04:20,174][25689] Avg episode reward: [(0, '-47.640')] [2022-07-09 12:04:21,038][26022] Updated weights on worker 0-0, policy_version 240792 (0.00085) [2022-07-09 12:04:22,702][26022] Updated weights on worker 0-0, policy_version 240802 (0.00078) [2022-07-09 12:04:24,603][26022] Updated weights on worker 0-0, policy_version 240812 (0.00096) [2022-07-09 12:04:25,233][25689] Fps is (10 sec: 5777.3, 60 sec: 5677.2, 300 sec: 5710.2). Total num frames: 246594560. Throughput: 0: 5991.7. Samples: 246596022. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 12:04:25,233][25689] Avg episode reward: [(0, '-47.498')] [2022-07-09 12:04:26,467][26022] Updated weights on worker 0-0, policy_version 240822 (0.00087) [2022-07-09 12:04:28,143][26022] Updated weights on worker 0-0, policy_version 240832 (0.00087) [2022-07-09 12:04:30,051][26022] Updated weights on worker 0-0, policy_version 240842 (0.00087) [2022-07-09 12:04:30,309][25689] Fps is (10 sec: 5657.5, 60 sec: 5641.0, 300 sec: 5706.7). Total num frames: 246623232. Throughput: 0: 5979.7. Samples: 246630268. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 12:04:30,310][25689] Avg episode reward: [(0, '-47.823')] [2022-07-09 12:04:31,677][26022] Updated weights on worker 0-0, policy_version 240852 (0.00081) [2022-07-09 12:04:33,582][26022] Updated weights on worker 0-0, policy_version 240862 (0.00087) [2022-07-09 12:04:35,250][26022] Updated weights on worker 0-0, policy_version 240872 (0.01255) [2022-07-09 12:04:35,336][25689] Fps is (10 sec: 5777.3, 60 sec: 5672.7, 300 sec: 5713.9). Total num frames: 246652928. Throughput: 0: 5119.5. Samples: 246647554. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 12:04:35,336][25689] Avg episode reward: [(0, '-48.330')] [2022-07-09 12:04:37,086][26022] Updated weights on worker 0-0, policy_version 240882 (0.00084) [2022-07-09 12:04:38,814][26022] Updated weights on worker 0-0, policy_version 240892 (0.00084) [2022-07-09 12:04:40,365][25689] Fps is (10 sec: 5804.5, 60 sec: 5671.9, 300 sec: 5715.4). Total num frames: 246681600. Throughput: 0: 5974.6. Samples: 246682076. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 12:04:40,365][25689] Avg episode reward: [(0, '-47.697')] [2022-07-09 12:04:40,722][26022] Updated weights on worker 0-0, policy_version 240902 (0.00082) [2022-07-09 12:04:42,546][26022] Updated weights on worker 0-0, policy_version 240912 (0.00085) [2022-07-09 12:04:44,320][26022] Updated weights on worker 0-0, policy_version 240922 (0.00091) [2022-07-09 12:04:45,402][25689] Fps is (10 sec: 5696.7, 60 sec: 5698.6, 300 sec: 5711.7). Total num frames: 246710272. Throughput: 0: 5976.8. Samples: 246716568. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 12:04:45,402][25689] Avg episode reward: [(0, '-47.714')] [2022-07-09 12:04:45,893][26022] Updated weights on worker 0-0, policy_version 240932 (0.00090) [2022-07-09 12:04:47,702][26022] Updated weights on worker 0-0, policy_version 240942 (0.00086) [2022-07-09 12:04:49,551][26022] Updated weights on worker 0-0, policy_version 240952 (0.00089) [2022-07-09 12:04:50,406][25689] Fps is (10 sec: 5710.9, 60 sec: 5683.7, 300 sec: 5708.4). Total num frames: 246738944. Throughput: 0: 5167.1. Samples: 246734110. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-09 12:04:50,406][25689] Avg episode reward: [(0, '-47.281')] [2022-07-09 12:04:51,334][26022] Updated weights on worker 0-0, policy_version 240962 (0.00089) [2022-07-09 12:04:53,116][26022] Updated weights on worker 0-0, policy_version 240972 (0.01430) [2022-07-09 12:04:54,017][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:04:54,031][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000240978_246761472.pth [2022-07-09 12:04:54,032][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000238964_244699136.pth [2022-07-09 12:04:54,807][26022] Updated weights on worker 0-0, policy_version 240982 (0.00096) [2022-07-09 12:04:55,437][25689] Fps is (10 sec: 5816.1, 60 sec: 5700.8, 300 sec: 5711.4). Total num frames: 246768640. Throughput: 0: 6036.6. Samples: 246768900. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 12:04:55,438][25689] Avg episode reward: [(0, '-46.985')] [2022-07-09 12:04:56,432][26022] Updated weights on worker 0-0, policy_version 240992 (0.00093) [2022-07-09 12:04:58,302][26022] Updated weights on worker 0-0, policy_version 241002 (0.00083) [2022-07-09 12:04:59,753][26022] Updated weights on worker 0-0, policy_version 241012 (0.00086) [2022-07-09 12:05:00,453][25689] Fps is (10 sec: 5809.2, 60 sec: 5699.6, 300 sec: 5719.2). Total num frames: 246797312. Throughput: 0: 6053.0. Samples: 246803672. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 12:05:00,454][25689] Avg episode reward: [(0, '-46.966')] [2022-07-09 12:05:02,398][26022] Updated weights on worker 0-0, policy_version 241022 (0.00082) [2022-07-09 12:05:03,963][26022] Updated weights on worker 0-0, policy_version 241032 (0.00080) [2022-07-09 12:05:05,517][25689] Fps is (10 sec: 5485.8, 60 sec: 5705.3, 300 sec: 5712.9). Total num frames: 246823936. Throughput: 0: 5085.0. Samples: 246818856. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 12:05:05,518][25689] Avg episode reward: [(0, '-47.068')] [2022-07-09 12:05:05,938][26022] Updated weights on worker 0-0, policy_version 241042 (0.00086) [2022-07-09 12:05:07,573][26022] Updated weights on worker 0-0, policy_version 241052 (0.00089) [2022-07-09 12:05:09,283][26022] Updated weights on worker 0-0, policy_version 241062 (0.00080) [2022-07-09 12:05:10,569][25689] Fps is (10 sec: 5668.7, 60 sec: 5740.4, 300 sec: 5715.6). Total num frames: 246854656. Throughput: 0: 5921.2. Samples: 246853502. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 12:05:10,570][25689] Avg episode reward: [(0, '-47.011')] [2022-07-09 12:05:11,235][26022] Updated weights on worker 0-0, policy_version 241072 (0.00094) [2022-07-09 12:05:12,913][26022] Updated weights on worker 0-0, policy_version 241082 (0.00083) [2022-07-09 12:05:14,706][26022] Updated weights on worker 0-0, policy_version 241092 (0.00097) [2022-07-09 12:05:15,583][25689] Fps is (10 sec: 5798.9, 60 sec: 5725.3, 300 sec: 5712.8). Total num frames: 246882304. Throughput: 0: 5913.5. Samples: 246888030. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 12:05:15,584][25689] Avg episode reward: [(0, '-46.908')] [2022-07-09 12:05:16,581][26022] Updated weights on worker 0-0, policy_version 241102 (0.00097) [2022-07-09 12:05:18,200][26022] Updated weights on worker 0-0, policy_version 241112 (0.00084) [2022-07-09 12:05:20,109][26022] Updated weights on worker 0-0, policy_version 241122 (0.00084) [2022-07-09 12:05:20,601][25689] Fps is (10 sec: 5716.2, 60 sec: 5727.7, 300 sec: 5714.2). Total num frames: 246912000. Throughput: 0: 5041.2. Samples: 246905242. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 12:05:20,602][25689] Avg episode reward: [(0, '-47.735')] [2022-07-09 12:05:21,582][26022] Updated weights on worker 0-0, policy_version 241132 (0.00060) [2022-07-09 12:05:23,550][26022] Updated weights on worker 0-0, policy_version 241142 (0.00079) [2022-07-09 12:05:25,443][26022] Updated weights on worker 0-0, policy_version 241152 (0.00084) [2022-07-09 12:05:25,649][25689] Fps is (10 sec: 5798.4, 60 sec: 5728.8, 300 sec: 5711.0). Total num frames: 246940672. Throughput: 0: 6032.6. Samples: 246940302. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 12:05:25,649][25689] Avg episode reward: [(0, '-48.026')] [2022-07-09 12:05:27,059][26022] Updated weights on worker 0-0, policy_version 241162 (0.00094) [2022-07-09 12:05:28,899][26022] Updated weights on worker 0-0, policy_version 241172 (0.00087) [2022-07-09 12:05:30,660][25689] Fps is (10 sec: 5700.6, 60 sec: 5734.9, 300 sec: 5707.8). Total num frames: 246969344. Throughput: 0: 6057.2. Samples: 246975198. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 12:05:30,661][25689] Avg episode reward: [(0, '-48.113')] [2022-07-09 12:05:30,690][26022] Updated weights on worker 0-0, policy_version 241182 (0.00094) [2022-07-09 12:05:32,174][26022] Updated weights on worker 0-0, policy_version 241192 (0.00089) [2022-07-09 12:05:34,326][26022] Updated weights on worker 0-0, policy_version 241202 (0.00097) [2022-07-09 12:05:35,682][25689] Fps is (10 sec: 5817.6, 60 sec: 5735.4, 300 sec: 5717.9). Total num frames: 246999040. Throughput: 0: 5194.2. Samples: 246992430. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 12:05:35,682][25689] Avg episode reward: [(0, '-48.327')] [2022-07-09 12:05:35,790][26022] Updated weights on worker 0-0, policy_version 241212 (0.00098) [2022-07-09 12:05:37,855][26022] Updated weights on worker 0-0, policy_version 241222 (0.00095) [2022-07-09 12:05:39,555][26022] Updated weights on worker 0-0, policy_version 241232 (0.00085) [2022-07-09 12:05:40,714][25689] Fps is (10 sec: 5703.9, 60 sec: 5718.2, 300 sec: 5711.4). Total num frames: 247026688. Throughput: 0: 6045.2. Samples: 247026828. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 12:05:40,714][25689] Avg episode reward: [(0, '-47.954')] [2022-07-09 12:05:41,400][26022] Updated weights on worker 0-0, policy_version 241242 (0.00092) [2022-07-09 12:05:43,187][26022] Updated weights on worker 0-0, policy_version 241252 (0.00055) [2022-07-09 12:05:44,931][26022] Updated weights on worker 0-0, policy_version 241262 (0.00085) [2022-07-09 12:05:45,777][25689] Fps is (10 sec: 5680.2, 60 sec: 5732.7, 300 sec: 5714.1). Total num frames: 247056384. Throughput: 0: 6012.0. Samples: 247061314. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 12:05:45,778][25689] Avg episode reward: [(0, '-48.984')] [2022-07-09 12:05:46,798][26022] Updated weights on worker 0-0, policy_version 241272 (0.00083) [2022-07-09 12:05:48,681][26022] Updated weights on worker 0-0, policy_version 241282 (0.00086) [2022-07-09 12:05:50,259][26022] Updated weights on worker 0-0, policy_version 241292 (0.00088) [2022-07-09 12:05:50,790][25689] Fps is (10 sec: 5894.2, 60 sec: 5748.8, 300 sec: 5717.4). Total num frames: 247086080. Throughput: 0: 5138.1. Samples: 247078628. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 12:05:50,791][25689] Avg episode reward: [(0, '-48.664')] [2022-07-09 12:05:52,102][26022] Updated weights on worker 0-0, policy_version 241302 (0.00109) [2022-07-09 12:05:53,855][26022] Updated weights on worker 0-0, policy_version 241312 (0.00092) [2022-07-09 12:05:55,475][26022] Updated weights on worker 0-0, policy_version 241322 (0.00089) [2022-07-09 12:05:55,820][25689] Fps is (10 sec: 5914.1, 60 sec: 5749.0, 300 sec: 5714.1). Total num frames: 247115776. Throughput: 0: 5994.1. Samples: 247113138. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 12:05:55,820][25689] Avg episode reward: [(0, '-49.649')] [2022-07-09 12:05:57,465][26022] Updated weights on worker 0-0, policy_version 241332 (0.00085) [2022-07-09 12:05:59,139][26022] Updated weights on worker 0-0, policy_version 241342 (0.00084) [2022-07-09 12:06:00,883][25689] Fps is (10 sec: 5479.0, 60 sec: 5693.7, 300 sec: 5710.8). Total num frames: 247141376. Throughput: 0: 5990.8. Samples: 247147656. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 12:06:00,883][25689] Avg episode reward: [(0, '-49.246')] [2022-07-09 12:06:01,440][26022] Updated weights on worker 0-0, policy_version 241352 (0.00088) [2022-07-09 12:06:03,145][26022] Updated weights on worker 0-0, policy_version 241362 (0.00083) [2022-07-09 12:06:04,970][26022] Updated weights on worker 0-0, policy_version 241372 (0.00080) [2022-07-09 12:06:05,922][25689] Fps is (10 sec: 5473.8, 60 sec: 5746.9, 300 sec: 5718.2). Total num frames: 247171072. Throughput: 0: 5881.9. Samples: 247179802. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 12:06:05,922][25689] Avg episode reward: [(0, '-48.668')] [2022-07-09 12:06:06,923][26022] Updated weights on worker 0-0, policy_version 241382 (0.00093) [2022-07-09 12:06:08,332][26022] Updated weights on worker 0-0, policy_version 241392 (0.00092) [2022-07-09 12:06:10,275][26022] Updated weights on worker 0-0, policy_version 241402 (0.00089) [2022-07-09 12:06:10,931][25689] Fps is (10 sec: 5604.9, 60 sec: 5683.1, 300 sec: 5704.8). Total num frames: 247197696. Throughput: 0: 5883.5. Samples: 247197126. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 12:06:10,932][25689] Avg episode reward: [(0, '-48.755')] [2022-07-09 12:06:11,840][26022] Updated weights on worker 0-0, policy_version 241412 (0.00086) [2022-07-09 12:06:13,912][26022] Updated weights on worker 0-0, policy_version 241422 (0.00086) [2022-07-09 12:06:15,638][26022] Updated weights on worker 0-0, policy_version 241432 (0.00453) [2022-07-09 12:06:15,939][25689] Fps is (10 sec: 5622.3, 60 sec: 5717.5, 300 sec: 5711.8). Total num frames: 247227392. Throughput: 0: 5889.2. Samples: 247231624. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 12:06:15,939][25689] Avg episode reward: [(0, '-48.608')] [2022-07-09 12:06:17,324][26022] Updated weights on worker 0-0, policy_version 241442 (0.00091) [2022-07-09 12:06:19,104][26022] Updated weights on worker 0-0, policy_version 241452 (0.00090) [2022-07-09 12:06:20,942][25689] Fps is (10 sec: 5830.2, 60 sec: 5702.0, 300 sec: 5709.2). Total num frames: 247256064. Throughput: 0: 5917.1. Samples: 247266352. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 12:06:20,944][25689] Avg episode reward: [(0, '-48.921')] [2022-07-09 12:06:21,005][26022] Updated weights on worker 0-0, policy_version 241462 (0.00090) [2022-07-09 12:06:22,727][26022] Updated weights on worker 0-0, policy_version 241472 (0.00117) [2022-07-09 12:06:24,641][26022] Updated weights on worker 0-0, policy_version 241482 (0.00085) [2022-07-09 12:06:26,010][25689] Fps is (10 sec: 5897.1, 60 sec: 5734.0, 300 sec: 5718.8). Total num frames: 247286784. Throughput: 0: 5172.4. Samples: 247283708. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 12:06:26,011][25689] Avg episode reward: [(0, '-48.983')] [2022-07-09 12:06:26,347][26022] Updated weights on worker 0-0, policy_version 241492 (0.00092) [2022-07-09 12:06:28,015][26022] Updated weights on worker 0-0, policy_version 241502 (0.00088) [2022-07-09 12:06:30,058][26022] Updated weights on worker 0-0, policy_version 241512 (0.00088) [2022-07-09 12:06:31,029][25689] Fps is (10 sec: 5888.3, 60 sec: 5733.3, 300 sec: 5715.1). Total num frames: 247315456. Throughput: 0: 6019.4. Samples: 247318102. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 12:06:31,029][25689] Avg episode reward: [(0, '-49.071')] [2022-07-09 12:06:31,657][26022] Updated weights on worker 0-0, policy_version 241522 (0.00084) [2022-07-09 12:06:33,451][26022] Updated weights on worker 0-0, policy_version 241532 (0.00082) [2022-07-09 12:06:35,240][26022] Updated weights on worker 0-0, policy_version 241542 (0.00086) [2022-07-09 12:06:36,102][25689] Fps is (10 sec: 5479.3, 60 sec: 5677.6, 300 sec: 5707.1). Total num frames: 247342080. Throughput: 0: 5995.9. Samples: 247352520. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 12:06:36,104][25689] Avg episode reward: [(0, '-48.382')] [2022-07-09 12:06:37,060][26022] Updated weights on worker 0-0, policy_version 241552 (0.00093) [2022-07-09 12:06:38,871][26022] Updated weights on worker 0-0, policy_version 241562 (0.00093) [2022-07-09 12:06:40,719][26022] Updated weights on worker 0-0, policy_version 241572 (0.00087) [2022-07-09 12:06:41,109][25689] Fps is (10 sec: 5485.6, 60 sec: 5696.9, 300 sec: 5708.7). Total num frames: 247370752. Throughput: 0: 5118.0. Samples: 247369564. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 12:06:41,110][25689] Avg episode reward: [(0, '-47.628')] [2022-07-09 12:06:42,234][26022] Updated weights on worker 0-0, policy_version 241582 (0.00085) [2022-07-09 12:06:44,441][26022] Updated weights on worker 0-0, policy_version 241592 (0.00084) [2022-07-09 12:06:45,801][26022] Updated weights on worker 0-0, policy_version 241602 (0.00084) [2022-07-09 12:06:46,253][25689] Fps is (10 sec: 5850.6, 60 sec: 5706.2, 300 sec: 5713.4). Total num frames: 247401472. Throughput: 0: 5932.5. Samples: 247403800. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 12:06:46,254][25689] Avg episode reward: [(0, '-48.074')] [2022-07-09 12:06:47,930][26022] Updated weights on worker 0-0, policy_version 241612 (0.00095) [2022-07-09 12:06:49,603][26022] Updated weights on worker 0-0, policy_version 241622 (0.00086) [2022-07-09 12:06:51,304][25689] Fps is (10 sec: 5825.6, 60 sec: 5685.7, 300 sec: 5709.7). Total num frames: 247430144. Throughput: 0: 5930.9. Samples: 247438352. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 12:06:51,306][25689] Avg episode reward: [(0, '-47.906')] [2022-07-09 12:06:51,321][26022] Updated weights on worker 0-0, policy_version 241632 (0.00086) [2022-07-09 12:06:53,206][26022] Updated weights on worker 0-0, policy_version 241642 (0.00085) [2022-07-09 12:06:54,045][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:06:54,059][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000241647_247446528.pth [2022-07-09 12:06:54,060][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000239637_245388288.pth [2022-07-09 12:06:54,907][26022] Updated weights on worker 0-0, policy_version 241652 (0.00081) [2022-07-09 12:06:56,315][25689] Fps is (10 sec: 5597.6, 60 sec: 5653.6, 300 sec: 5706.1). Total num frames: 247457792. Throughput: 0: 5101.3. Samples: 247455632. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 12:06:56,315][25689] Avg episode reward: [(0, '-48.128')] [2022-07-09 12:06:56,757][26022] Updated weights on worker 0-0, policy_version 241662 (0.00092) [2022-07-09 12:06:58,666][26022] Updated weights on worker 0-0, policy_version 241672 (0.00085) [2022-07-09 12:07:00,176][26022] Updated weights on worker 0-0, policy_version 241682 (0.00082) [2022-07-09 12:07:01,335][25689] Fps is (10 sec: 5818.6, 60 sec: 5742.3, 300 sec: 5720.1). Total num frames: 247488512. Throughput: 0: 5958.2. Samples: 247490076. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 12:07:01,336][25689] Avg episode reward: [(0, '-48.096')] [2022-07-09 12:07:02,693][26022] Updated weights on worker 0-0, policy_version 241692 (0.00084) [2022-07-09 12:07:04,216][26022] Updated weights on worker 0-0, policy_version 241702 (0.00112) [2022-07-09 12:07:06,226][26022] Updated weights on worker 0-0, policy_version 241712 (0.00086) [2022-07-09 12:07:06,414][25689] Fps is (10 sec: 5577.0, 60 sec: 5670.8, 300 sec: 5701.5). Total num frames: 247514112. Throughput: 0: 5871.9. Samples: 247522178. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 12:07:06,414][25689] Avg episode reward: [(0, '-48.264')] [2022-07-09 12:07:07,847][26022] Updated weights on worker 0-0, policy_version 241722 (0.00093) [2022-07-09 12:07:09,739][26022] Updated weights on worker 0-0, policy_version 241732 (0.00084) [2022-07-09 12:07:11,420][25689] Fps is (10 sec: 5381.6, 60 sec: 5705.0, 300 sec: 5708.7). Total num frames: 247542784. Throughput: 0: 5020.8. Samples: 247539350. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 12:07:11,422][25689] Avg episode reward: [(0, '-48.508')] [2022-07-09 12:07:11,439][26022] Updated weights on worker 0-0, policy_version 241742 (0.00080) [2022-07-09 12:07:13,528][26022] Updated weights on worker 0-0, policy_version 241752 (0.00091) [2022-07-09 12:07:14,925][26022] Updated weights on worker 0-0, policy_version 241762 (0.00084) [2022-07-09 12:07:16,441][25689] Fps is (10 sec: 5718.6, 60 sec: 5686.8, 300 sec: 5702.3). Total num frames: 247571456. Throughput: 0: 5870.0. Samples: 247573774. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 12:07:16,442][25689] Avg episode reward: [(0, '-47.678')] [2022-07-09 12:07:16,950][26022] Updated weights on worker 0-0, policy_version 241772 (0.00090) [2022-07-09 12:07:18,490][26022] Updated weights on worker 0-0, policy_version 241782 (0.00086) [2022-07-09 12:07:20,475][26022] Updated weights on worker 0-0, policy_version 241792 (0.00090) [2022-07-09 12:07:21,474][25689] Fps is (10 sec: 5805.4, 60 sec: 5701.0, 300 sec: 5709.6). Total num frames: 247601152. Throughput: 0: 5870.9. Samples: 247608308. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 12:07:21,476][25689] Avg episode reward: [(0, '-47.704')] [2022-07-09 12:07:22,228][26022] Updated weights on worker 0-0, policy_version 241802 (0.00088) [2022-07-09 12:07:23,939][26022] Updated weights on worker 0-0, policy_version 241812 (0.00090) [2022-07-09 12:07:25,800][26022] Updated weights on worker 0-0, policy_version 241822 (0.00103) [2022-07-09 12:07:26,596][25689] Fps is (10 sec: 5747.9, 60 sec: 5662.1, 300 sec: 5700.6). Total num frames: 247629824. Throughput: 0: 5124.7. Samples: 247625606. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 12:07:26,598][25689] Avg episode reward: [(0, '-48.046')] [2022-07-09 12:07:27,547][26022] Updated weights on worker 0-0, policy_version 241832 (0.00084) [2022-07-09 12:07:29,331][26022] Updated weights on worker 0-0, policy_version 241842 (0.00095) [2022-07-09 12:07:31,255][26022] Updated weights on worker 0-0, policy_version 241852 (0.00087) [2022-07-09 12:07:31,620][25689] Fps is (10 sec: 5752.9, 60 sec: 5678.5, 300 sec: 5707.6). Total num frames: 247659520. Throughput: 0: 5978.2. Samples: 247660110. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 12:07:31,621][25689] Avg episode reward: [(0, '-47.603')] [2022-07-09 12:07:32,892][26022] Updated weights on worker 0-0, policy_version 241862 (0.00086) [2022-07-09 12:07:34,650][26022] Updated weights on worker 0-0, policy_version 241872 (0.00090) [2022-07-09 12:07:36,464][26022] Updated weights on worker 0-0, policy_version 241882 (0.00091) [2022-07-09 12:07:36,622][25689] Fps is (10 sec: 5719.4, 60 sec: 5702.0, 300 sec: 5701.0). Total num frames: 247687168. Throughput: 0: 5991.5. Samples: 247694688. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 12:07:36,623][25689] Avg episode reward: [(0, '-48.251')] [2022-07-09 12:07:37,978][26022] Updated weights on worker 0-0, policy_version 241892 (0.00092) [2022-07-09 12:07:40,119][26022] Updated weights on worker 0-0, policy_version 241902 (0.00087) [2022-07-09 12:07:41,631][25689] Fps is (10 sec: 5728.3, 60 sec: 5718.8, 300 sec: 5705.4). Total num frames: 247716864. Throughput: 0: 5140.1. Samples: 247711916. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 12:07:41,631][25689] Avg episode reward: [(0, '-49.232')] [2022-07-09 12:07:41,766][26022] Updated weights on worker 0-0, policy_version 241912 (0.00086) [2022-07-09 12:07:43,673][26022] Updated weights on worker 0-0, policy_version 241922 (0.00090) [2022-07-09 12:07:45,239][26022] Updated weights on worker 0-0, policy_version 241932 (0.00088) [2022-07-09 12:07:46,749][25689] Fps is (10 sec: 5763.5, 60 sec: 5687.4, 300 sec: 5696.7). Total num frames: 247745536. Throughput: 0: 5977.0. Samples: 247746062. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 12:07:46,750][25689] Avg episode reward: [(0, '-49.125')] [2022-07-09 12:07:47,256][26022] Updated weights on worker 0-0, policy_version 241942 (0.00086) [2022-07-09 12:07:48,807][26022] Updated weights on worker 0-0, policy_version 241952 (0.00091) [2022-07-09 12:07:50,865][26022] Updated weights on worker 0-0, policy_version 241962 (0.00084) [2022-07-09 12:07:51,783][25689] Fps is (10 sec: 5749.3, 60 sec: 5705.9, 300 sec: 5703.5). Total num frames: 247775232. Throughput: 0: 5980.0. Samples: 247780684. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 12:07:51,783][25689] Avg episode reward: [(0, '-48.665')] [2022-07-09 12:07:52,507][26022] Updated weights on worker 0-0, policy_version 241972 (0.00087) [2022-07-09 12:07:54,281][26022] Updated weights on worker 0-0, policy_version 241982 (0.00090) [2022-07-09 12:07:55,970][26022] Updated weights on worker 0-0, policy_version 241992 (0.00089) [2022-07-09 12:07:56,819][25689] Fps is (10 sec: 5694.8, 60 sec: 5703.6, 300 sec: 5703.3). Total num frames: 247802880. Throughput: 0: 5098.8. Samples: 247797666. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 12:07:56,819][25689] Avg episode reward: [(0, '-48.214')] [2022-07-09 12:07:57,830][26022] Updated weights on worker 0-0, policy_version 242002 (0.00092) [2022-07-09 12:07:59,507][26022] Updated weights on worker 0-0, policy_version 242012 (0.00087) [2022-07-09 12:08:01,790][26022] Updated weights on worker 0-0, policy_version 242022 (0.00089) [2022-07-09 12:08:01,829][25689] Fps is (10 sec: 5504.4, 60 sec: 5653.8, 300 sec: 5700.5). Total num frames: 247830528. Throughput: 0: 5937.3. Samples: 247831838. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 12:08:01,829][25689] Avg episode reward: [(0, '-48.345')] [2022-07-09 12:08:03,503][26022] Updated weights on worker 0-0, policy_version 242032 (0.00079) [2022-07-09 12:08:05,655][26022] Updated weights on worker 0-0, policy_version 242042 (0.00089) [2022-07-09 12:08:06,931][25689] Fps is (10 sec: 5569.4, 60 sec: 5702.3, 300 sec: 5698.9). Total num frames: 247859200. Throughput: 0: 5833.2. Samples: 247863786. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 12:08:06,935][25689] Avg episode reward: [(0, '-48.039')] [2022-07-09 12:08:07,215][26022] Updated weights on worker 0-0, policy_version 242052 (0.00082) [2022-07-09 12:08:09,151][26022] Updated weights on worker 0-0, policy_version 242062 (0.00086) [2022-07-09 12:08:10,702][26022] Updated weights on worker 0-0, policy_version 242072 (0.00090) [2022-07-09 12:08:11,999][25689] Fps is (10 sec: 5537.9, 60 sec: 5679.6, 300 sec: 5694.4). Total num frames: 247886848. Throughput: 0: 4954.9. Samples: 247880850. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 12:08:11,999][25689] Avg episode reward: [(0, '-48.535')] [2022-07-09 12:08:12,852][26022] Updated weights on worker 0-0, policy_version 242082 (0.00079) [2022-07-09 12:08:14,437][26022] Updated weights on worker 0-0, policy_version 242092 (0.00092) [2022-07-09 12:08:16,343][26022] Updated weights on worker 0-0, policy_version 242102 (0.00082) [2022-07-09 12:08:17,067][25689] Fps is (10 sec: 5657.2, 60 sec: 5692.1, 300 sec: 5693.2). Total num frames: 247916544. Throughput: 0: 5806.5. Samples: 247915240. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 12:08:17,068][25689] Avg episode reward: [(0, '-48.415')] [2022-07-09 12:08:18,072][26022] Updated weights on worker 0-0, policy_version 242112 (0.00092) [2022-07-09 12:08:19,910][26022] Updated weights on worker 0-0, policy_version 242122 (0.00087) [2022-07-09 12:08:21,630][26022] Updated weights on worker 0-0, policy_version 242132 (0.00095) [2022-07-09 12:08:22,095][25689] Fps is (10 sec: 5781.0, 60 sec: 5675.6, 300 sec: 5697.2). Total num frames: 247945216. Throughput: 0: 5818.3. Samples: 247949754. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 12:08:22,096][25689] Avg episode reward: [(0, '-49.015')] [2022-07-09 12:08:23,563][26022] Updated weights on worker 0-0, policy_version 242142 (0.00084) [2022-07-09 12:08:25,124][26022] Updated weights on worker 0-0, policy_version 242152 (0.00085) [2022-07-09 12:08:27,108][26022] Updated weights on worker 0-0, policy_version 242162 (0.00105) [2022-07-09 12:08:27,147][25689] Fps is (10 sec: 5688.9, 60 sec: 5682.2, 300 sec: 5690.3). Total num frames: 247973888. Throughput: 0: 5103.3. Samples: 247966956. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 12:08:27,148][25689] Avg episode reward: [(0, '-49.980')] [2022-07-09 12:08:28,778][26022] Updated weights on worker 0-0, policy_version 242172 (0.00084) [2022-07-09 12:08:30,750][26022] Updated weights on worker 0-0, policy_version 242182 (0.00086) [2022-07-09 12:08:32,167][25689] Fps is (10 sec: 5795.0, 60 sec: 5682.6, 300 sec: 5696.8). Total num frames: 248003584. Throughput: 0: 5953.4. Samples: 248000922. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 12:08:32,168][25689] Avg episode reward: [(0, '-49.520')] [2022-07-09 12:08:32,251][26022] Updated weights on worker 0-0, policy_version 242192 (0.00095) [2022-07-09 12:08:34,282][26022] Updated weights on worker 0-0, policy_version 242202 (0.00087) [2022-07-09 12:08:36,125][26022] Updated weights on worker 0-0, policy_version 242212 (0.00084) [2022-07-09 12:08:37,171][25689] Fps is (10 sec: 5721.1, 60 sec: 5682.5, 300 sec: 5693.7). Total num frames: 248031232. Throughput: 0: 5985.7. Samples: 248035572. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 12:08:37,171][25689] Avg episode reward: [(0, '-49.786')] [2022-07-09 12:08:37,810][26022] Updated weights on worker 0-0, policy_version 242222 (0.00086) [2022-07-09 12:08:39,528][26022] Updated weights on worker 0-0, policy_version 242232 (0.00083) [2022-07-09 12:08:41,490][26022] Updated weights on worker 0-0, policy_version 242242 (0.00084) [2022-07-09 12:08:42,191][25689] Fps is (10 sec: 5618.5, 60 sec: 5664.4, 300 sec: 5699.5). Total num frames: 248059904. Throughput: 0: 5132.8. Samples: 248052904. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 12:08:42,192][25689] Avg episode reward: [(0, '-49.028')] [2022-07-09 12:08:43,094][26022] Updated weights on worker 0-0, policy_version 242252 (0.00089) [2022-07-09 12:08:45,183][26022] Updated weights on worker 0-0, policy_version 242262 (0.00084) [2022-07-09 12:08:46,540][26022] Updated weights on worker 0-0, policy_version 242272 (0.00084) [2022-07-09 12:08:47,295][25689] Fps is (10 sec: 5765.1, 60 sec: 5682.7, 300 sec: 5698.0). Total num frames: 248089600. Throughput: 0: 5977.0. Samples: 248087378. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 12:08:47,295][25689] Avg episode reward: [(0, '-49.418')] [2022-07-09 12:08:48,724][26022] Updated weights on worker 0-0, policy_version 242282 (0.00096) [2022-07-09 12:08:50,117][26022] Updated weights on worker 0-0, policy_version 242292 (0.00085) [2022-07-09 12:08:52,294][26022] Updated weights on worker 0-0, policy_version 242302 (0.00091) [2022-07-09 12:08:52,392][25689] Fps is (10 sec: 5621.8, 60 sec: 5643.0, 300 sec: 5693.3). Total num frames: 248117248. Throughput: 0: 5960.7. Samples: 248121472. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 12:08:52,392][25689] Avg episode reward: [(0, '-48.259')] [2022-07-09 12:08:53,819][26022] Updated weights on worker 0-0, policy_version 242312 (0.00104) [2022-07-09 12:08:54,168][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:08:54,182][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000242314_248129536.pth [2022-07-09 12:08:54,187][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000240309_246076416.pth [2022-07-09 12:08:55,780][26022] Updated weights on worker 0-0, policy_version 242322 (0.00086) [2022-07-09 12:08:57,427][25689] Fps is (10 sec: 5659.7, 60 sec: 5676.9, 300 sec: 5696.2). Total num frames: 248146944. Throughput: 0: 5918.5. Samples: 248155458. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 12:08:57,427][25689] Avg episode reward: [(0, '-48.706')] [2022-07-09 12:08:57,587][26022] Updated weights on worker 0-0, policy_version 242332 (0.00087) [2022-07-09 12:08:59,451][26022] Updated weights on worker 0-0, policy_version 242342 (0.00085) [2022-07-09 12:09:00,991][26022] Updated weights on worker 0-0, policy_version 242352 (0.00085) [2022-07-09 12:09:02,473][25689] Fps is (10 sec: 5688.3, 60 sec: 5673.5, 300 sec: 5701.1). Total num frames: 248174592. Throughput: 0: 5912.8. Samples: 248172822. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 12:09:02,473][25689] Avg episode reward: [(0, '-48.990')] [2022-07-09 12:09:03,495][26022] Updated weights on worker 0-0, policy_version 242362 (0.00054) [2022-07-09 12:09:04,915][26022] Updated weights on worker 0-0, policy_version 242372 (0.00088) [2022-07-09 12:09:06,976][26022] Updated weights on worker 0-0, policy_version 242382 (0.00089) [2022-07-09 12:09:07,512][25689] Fps is (10 sec: 5584.5, 60 sec: 5679.4, 300 sec: 5701.6). Total num frames: 248203264. Throughput: 0: 5839.3. Samples: 248205432. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 12:09:07,512][25689] Avg episode reward: [(0, '-49.350')] [2022-07-09 12:09:08,520][26022] Updated weights on worker 0-0, policy_version 242392 (0.00101) [2022-07-09 12:09:10,338][26022] Updated weights on worker 0-0, policy_version 242402 (0.00088) [2022-07-09 12:09:11,915][26022] Updated weights on worker 0-0, policy_version 242412 (0.00083) [2022-07-09 12:09:12,606][25689] Fps is (10 sec: 5658.7, 60 sec: 5693.8, 300 sec: 5700.4). Total num frames: 248231936. Throughput: 0: 5885.6. Samples: 248240448. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 12:09:12,607][25689] Avg episode reward: [(0, '-49.297')] [2022-07-09 12:09:13,900][26022] Updated weights on worker 0-0, policy_version 242422 (0.00318) [2022-07-09 12:09:15,697][26022] Updated weights on worker 0-0, policy_version 242432 (0.00080) [2022-07-09 12:09:17,464][26022] Updated weights on worker 0-0, policy_version 242442 (0.00085) [2022-07-09 12:09:17,636][25689] Fps is (10 sec: 5765.2, 60 sec: 5697.5, 300 sec: 5700.7). Total num frames: 248261632. Throughput: 0: 5061.0. Samples: 248257738. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 12:09:17,637][25689] Avg episode reward: [(0, '-49.609')] [2022-07-09 12:09:19,299][26022] Updated weights on worker 0-0, policy_version 242452 (0.00089) [2022-07-09 12:09:21,017][26022] Updated weights on worker 0-0, policy_version 242462 (0.00086) [2022-07-09 12:09:22,643][25689] Fps is (10 sec: 5815.7, 60 sec: 5699.5, 300 sec: 5701.7). Total num frames: 248290304. Throughput: 0: 5917.1. Samples: 248292170. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 12:09:22,644][25689] Avg episode reward: [(0, '-49.978')] [2022-07-09 12:09:22,752][26022] Updated weights on worker 0-0, policy_version 242472 (0.00090) [2022-07-09 12:09:24,700][26022] Updated weights on worker 0-0, policy_version 242482 (0.00083) [2022-07-09 12:09:26,492][26022] Updated weights on worker 0-0, policy_version 242492 (0.00089) [2022-07-09 12:09:27,685][25689] Fps is (10 sec: 5808.5, 60 sec: 5717.3, 300 sec: 5705.8). Total num frames: 248320000. Throughput: 0: 6010.5. Samples: 248326682. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 12:09:27,685][25689] Avg episode reward: [(0, '-49.713')] [2022-07-09 12:09:28,285][26022] Updated weights on worker 0-0, policy_version 242502 (0.00088) [2022-07-09 12:09:29,849][26022] Updated weights on worker 0-0, policy_version 242512 (0.00090) [2022-07-09 12:09:31,911][26022] Updated weights on worker 0-0, policy_version 242522 (0.00095) [2022-07-09 12:09:32,711][25689] Fps is (10 sec: 5594.0, 60 sec: 5666.0, 300 sec: 5695.5). Total num frames: 248346624. Throughput: 0: 5141.1. Samples: 248343806. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 12:09:32,711][25689] Avg episode reward: [(0, '-49.085')] [2022-07-09 12:09:33,535][26022] Updated weights on worker 0-0, policy_version 242532 (0.00086) [2022-07-09 12:09:35,547][26022] Updated weights on worker 0-0, policy_version 242542 (0.00086) [2022-07-09 12:09:37,144][26022] Updated weights on worker 0-0, policy_version 242552 (0.00086) [2022-07-09 12:09:37,715][25689] Fps is (10 sec: 5615.4, 60 sec: 5699.8, 300 sec: 5699.4). Total num frames: 248376320. Throughput: 0: 6003.4. Samples: 248378276. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 12:09:37,715][25689] Avg episode reward: [(0, '-49.590')] [2022-07-09 12:09:38,775][26022] Updated weights on worker 0-0, policy_version 242562 (0.00090) [2022-07-09 12:09:40,819][26022] Updated weights on worker 0-0, policy_version 242572 (0.00085) [2022-07-09 12:09:42,344][26022] Updated weights on worker 0-0, policy_version 242582 (0.00084) [2022-07-09 12:09:42,736][25689] Fps is (10 sec: 5822.1, 60 sec: 5699.7, 300 sec: 5699.7). Total num frames: 248404992. Throughput: 0: 6007.5. Samples: 248412880. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 12:09:42,737][25689] Avg episode reward: [(0, '-49.297')] [2022-07-09 12:09:44,259][26022] Updated weights on worker 0-0, policy_version 242592 (0.00069) [2022-07-09 12:09:46,160][26022] Updated weights on worker 0-0, policy_version 242602 (0.00391) [2022-07-09 12:09:47,721][26022] Updated weights on worker 0-0, policy_version 242612 (0.00087) [2022-07-09 12:09:47,860][25689] Fps is (10 sec: 5753.2, 60 sec: 5697.8, 300 sec: 5700.9). Total num frames: 248434688. Throughput: 0: 5114.0. Samples: 248429858. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 12:09:47,862][25689] Avg episode reward: [(0, '-49.273')] [2022-07-09 12:09:49,777][26022] Updated weights on worker 0-0, policy_version 242622 (0.00090) [2022-07-09 12:09:51,415][26022] Updated weights on worker 0-0, policy_version 242632 (0.00612) [2022-07-09 12:09:52,875][25689] Fps is (10 sec: 5756.9, 60 sec: 5722.4, 300 sec: 5697.7). Total num frames: 248463360. Throughput: 0: 5953.2. Samples: 248463848. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 12:09:52,875][25689] Avg episode reward: [(0, '-49.723')] [2022-07-09 12:09:53,204][26022] Updated weights on worker 0-0, policy_version 242642 (0.00085) [2022-07-09 12:09:55,107][26022] Updated weights on worker 0-0, policy_version 242652 (0.00097) [2022-07-09 12:09:56,731][26022] Updated weights on worker 0-0, policy_version 242662 (0.00674) [2022-07-09 12:09:57,880][25689] Fps is (10 sec: 5620.6, 60 sec: 5691.4, 300 sec: 5694.5). Total num frames: 248491008. Throughput: 0: 5966.1. Samples: 248498586. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 12:09:57,881][25689] Avg episode reward: [(0, '-49.496')] [2022-07-09 12:09:58,607][26022] Updated weights on worker 0-0, policy_version 242672 (0.00085) [2022-07-09 12:10:00,564][26022] Updated weights on worker 0-0, policy_version 242682 (0.00088) [2022-07-09 12:10:02,573][26022] Updated weights on worker 0-0, policy_version 242692 (0.00092) [2022-07-09 12:10:02,927][25689] Fps is (10 sec: 5399.2, 60 sec: 5674.4, 300 sec: 5694.8). Total num frames: 248517632. Throughput: 0: 5099.9. Samples: 248515850. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 12:10:02,927][25689] Avg episode reward: [(0, '-49.904')] [2022-07-09 12:10:04,471][26022] Updated weights on worker 0-0, policy_version 242702 (0.00093) [2022-07-09 12:10:06,280][26022] Updated weights on worker 0-0, policy_version 242712 (0.00090) [2022-07-09 12:10:07,939][26022] Updated weights on worker 0-0, policy_version 242722 (0.00079) [2022-07-09 12:10:08,035][25689] Fps is (10 sec: 5546.4, 60 sec: 5684.9, 300 sec: 5690.3). Total num frames: 248547328. Throughput: 0: 5853.2. Samples: 248547944. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 12:10:08,035][25689] Avg episode reward: [(0, '-49.875')] [2022-07-09 12:10:09,890][26022] Updated weights on worker 0-0, policy_version 242732 (0.00093) [2022-07-09 12:10:11,355][26022] Updated weights on worker 0-0, policy_version 242742 (0.00084) [2022-07-09 12:10:13,037][25689] Fps is (10 sec: 5671.9, 60 sec: 5676.6, 300 sec: 5690.5). Total num frames: 248574976. Throughput: 0: 5881.0. Samples: 248582420. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 12:10:13,037][25689] Avg episode reward: [(0, '-49.828')] [2022-07-09 12:10:13,387][26022] Updated weights on worker 0-0, policy_version 242752 (0.00091) [2022-07-09 12:10:15,288][26022] Updated weights on worker 0-0, policy_version 242762 (0.00094) [2022-07-09 12:10:16,892][26022] Updated weights on worker 0-0, policy_version 242772 (0.00087) [2022-07-09 12:10:18,101][25689] Fps is (10 sec: 5696.7, 60 sec: 5673.4, 300 sec: 5689.7). Total num frames: 248604672. Throughput: 0: 4994.8. Samples: 248599584. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 12:10:18,101][25689] Avg episode reward: [(0, '-49.544')] [2022-07-09 12:10:19,002][26022] Updated weights on worker 0-0, policy_version 242782 (0.00085) [2022-07-09 12:10:20,456][26022] Updated weights on worker 0-0, policy_version 242792 (0.00094) [2022-07-09 12:10:22,429][26022] Updated weights on worker 0-0, policy_version 242802 (0.00086) [2022-07-09 12:10:23,121][25689] Fps is (10 sec: 5787.8, 60 sec: 5672.1, 300 sec: 5690.2). Total num frames: 248633344. Throughput: 0: 5820.4. Samples: 248633392. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 12:10:23,123][25689] Avg episode reward: [(0, '-48.874')] [2022-07-09 12:10:24,204][26022] Updated weights on worker 0-0, policy_version 242812 (0.00084) [2022-07-09 12:10:25,863][26022] Updated weights on worker 0-0, policy_version 242822 (0.00083) [2022-07-09 12:10:27,949][26022] Updated weights on worker 0-0, policy_version 242832 (0.00089) [2022-07-09 12:10:28,217][25689] Fps is (10 sec: 5668.8, 60 sec: 5650.2, 300 sec: 5688.6). Total num frames: 248662016. Throughput: 0: 5933.6. Samples: 248667696. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 12:10:28,217][25689] Avg episode reward: [(0, '-49.527')] [2022-07-09 12:10:29,452][26022] Updated weights on worker 0-0, policy_version 242842 (0.00069) [2022-07-09 12:10:31,427][26022] Updated weights on worker 0-0, policy_version 242852 (0.00087) [2022-07-09 12:10:33,162][26022] Updated weights on worker 0-0, policy_version 242862 (0.00372) [2022-07-09 12:10:33,243][25689] Fps is (10 sec: 5665.4, 60 sec: 5684.0, 300 sec: 5685.0). Total num frames: 248690688. Throughput: 0: 5059.4. Samples: 248684652. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 12:10:33,244][25689] Avg episode reward: [(0, '-49.438')] [2022-07-09 12:10:34,961][26022] Updated weights on worker 0-0, policy_version 242872 (0.00093) [2022-07-09 12:10:36,651][26022] Updated weights on worker 0-0, policy_version 242882 (0.00089) [2022-07-09 12:10:38,259][25689] Fps is (10 sec: 5710.1, 60 sec: 5665.9, 300 sec: 5688.8). Total num frames: 248719360. Throughput: 0: 5937.2. Samples: 248719268. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 12:10:38,260][25689] Avg episode reward: [(0, '-49.678')] [2022-07-09 12:10:38,514][26022] Updated weights on worker 0-0, policy_version 242892 (0.00085) [2022-07-09 12:10:40,310][26022] Updated weights on worker 0-0, policy_version 242902 (0.00097) [2022-07-09 12:10:42,118][26022] Updated weights on worker 0-0, policy_version 242912 (0.00088) [2022-07-09 12:10:43,300][25689] Fps is (10 sec: 5803.6, 60 sec: 5681.0, 300 sec: 5689.2). Total num frames: 248749056. Throughput: 0: 5957.3. Samples: 248753604. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 12:10:43,301][25689] Avg episode reward: [(0, '-49.943')] [2022-07-09 12:10:43,731][26022] Updated weights on worker 0-0, policy_version 242922 (0.00088) [2022-07-09 12:10:45,573][26022] Updated weights on worker 0-0, policy_version 242932 (0.00083) [2022-07-09 12:10:47,588][26022] Updated weights on worker 0-0, policy_version 242942 (0.00082) [2022-07-09 12:10:48,407][25689] Fps is (10 sec: 5751.5, 60 sec: 5665.6, 300 sec: 5684.0). Total num frames: 248777728. Throughput: 0: 5109.5. Samples: 248770860. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 12:10:48,408][25689] Avg episode reward: [(0, '-50.662')] [2022-07-09 12:10:49,245][26022] Updated weights on worker 0-0, policy_version 242952 (0.00091) [2022-07-09 12:10:51,097][26022] Updated weights on worker 0-0, policy_version 242962 (0.00088) [2022-07-09 12:10:52,679][26022] Updated weights on worker 0-0, policy_version 242972 (0.00082) [2022-07-09 12:10:53,426][25689] Fps is (10 sec: 5562.0, 60 sec: 5648.4, 300 sec: 5677.3). Total num frames: 248805376. Throughput: 0: 5963.3. Samples: 248805010. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 12:10:53,427][25689] Avg episode reward: [(0, '-50.148')] [2022-07-09 12:10:54,232][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:10:54,240][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000242979_248810496.pth [2022-07-09 12:10:54,241][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000240978_246761472.pth [2022-07-09 12:10:54,695][26022] Updated weights on worker 0-0, policy_version 242982 (0.00088) [2022-07-09 12:10:56,493][26022] Updated weights on worker 0-0, policy_version 242992 (0.00084) [2022-07-09 12:10:58,088][26022] Updated weights on worker 0-0, policy_version 243002 (0.00086) [2022-07-09 12:10:58,431][25689] Fps is (10 sec: 5721.1, 60 sec: 5682.3, 300 sec: 5692.2). Total num frames: 248835072. Throughput: 0: 5959.1. Samples: 248839472. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 12:10:58,433][25689] Avg episode reward: [(0, '-49.491')] [2022-07-09 12:11:00,104][26022] Updated weights on worker 0-0, policy_version 243012 (0.00050) [2022-07-09 12:11:01,886][26022] Updated weights on worker 0-0, policy_version 243022 (0.00092) [2022-07-09 12:11:03,448][25689] Fps is (10 sec: 5619.6, 60 sec: 5685.0, 300 sec: 5682.3). Total num frames: 248861696. Throughput: 0: 5116.0. Samples: 248856680. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 12:11:03,449][25689] Avg episode reward: [(0, '-49.782')] [2022-07-09 12:11:04,071][26022] Updated weights on worker 0-0, policy_version 243032 (0.00089) [2022-07-09 12:11:05,736][26022] Updated weights on worker 0-0, policy_version 243042 (0.00093) [2022-07-09 12:11:07,623][26022] Updated weights on worker 0-0, policy_version 243052 (0.00086) [2022-07-09 12:11:08,560][25689] Fps is (10 sec: 5357.7, 60 sec: 5650.8, 300 sec: 5683.7). Total num frames: 248889344. Throughput: 0: 5837.9. Samples: 248888510. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 12:11:08,561][25689] Avg episode reward: [(0, '-50.300')] [2022-07-09 12:11:09,315][26022] Updated weights on worker 0-0, policy_version 243062 (0.00084) [2022-07-09 12:11:11,290][26022] Updated weights on worker 0-0, policy_version 243072 (0.00097) [2022-07-09 12:11:12,900][26022] Updated weights on worker 0-0, policy_version 243082 (0.00084) [2022-07-09 12:11:13,626][25689] Fps is (10 sec: 5634.1, 60 sec: 5678.6, 300 sec: 5682.6). Total num frames: 248919040. Throughput: 0: 5833.7. Samples: 248922850. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 12:11:13,627][25689] Avg episode reward: [(0, '-49.336')] [2022-07-09 12:11:14,985][26022] Updated weights on worker 0-0, policy_version 243092 (0.00086) [2022-07-09 12:11:16,574][26022] Updated weights on worker 0-0, policy_version 243102 (0.00085) [2022-07-09 12:11:18,463][26022] Updated weights on worker 0-0, policy_version 243112 (0.00091) [2022-07-09 12:11:18,708][25689] Fps is (10 sec: 5751.7, 60 sec: 5660.0, 300 sec: 5681.1). Total num frames: 248947712. Throughput: 0: 5806.1. Samples: 248957206. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 12:11:18,709][25689] Avg episode reward: [(0, '-49.489')] [2022-07-09 12:11:20,020][26022] Updated weights on worker 0-0, policy_version 243122 (0.00084) [2022-07-09 12:11:22,211][26022] Updated weights on worker 0-0, policy_version 243132 (0.00094) [2022-07-09 12:11:23,455][26022] Updated weights on worker 0-0, policy_version 243142 (0.00098) [2022-07-09 12:11:23,722][25689] Fps is (10 sec: 5781.5, 60 sec: 5677.6, 300 sec: 5678.7). Total num frames: 248977408. Throughput: 0: 5803.6. Samples: 248974340. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 12:11:23,723][25689] Avg episode reward: [(0, '-49.850')] [2022-07-09 12:11:25,568][26022] Updated weights on worker 0-0, policy_version 243152 (0.00088) [2022-07-09 12:11:27,222][26022] Updated weights on worker 0-0, policy_version 243162 (0.00086) [2022-07-09 12:11:28,798][25689] Fps is (10 sec: 5581.7, 60 sec: 5645.5, 300 sec: 5670.8). Total num frames: 249004032. Throughput: 0: 5934.6. Samples: 249008614. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 12:11:28,799][25689] Avg episode reward: [(0, '-48.883')] [2022-07-09 12:11:29,225][26022] Updated weights on worker 0-0, policy_version 243172 (0.00083) [2022-07-09 12:11:30,651][26022] Updated weights on worker 0-0, policy_version 243182 (0.00093) [2022-07-09 12:11:32,806][26022] Updated weights on worker 0-0, policy_version 243192 (0.00082) [2022-07-09 12:11:33,881][25689] Fps is (10 sec: 5745.6, 60 sec: 5691.0, 300 sec: 5687.8). Total num frames: 249035776. Throughput: 0: 5947.5. Samples: 249043314. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 12:11:33,883][25689] Avg episode reward: [(0, '-47.558')] [2022-07-09 12:11:34,291][26022] Updated weights on worker 0-0, policy_version 243202 (0.00086) [2022-07-09 12:11:36,248][26022] Updated weights on worker 0-0, policy_version 243212 (0.00098) [2022-07-09 12:11:37,887][26022] Updated weights on worker 0-0, policy_version 243222 (0.00079) [2022-07-09 12:11:38,904][25689] Fps is (10 sec: 5877.3, 60 sec: 5673.5, 300 sec: 5684.0). Total num frames: 249063424. Throughput: 0: 5114.5. Samples: 249060496. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 12:11:38,904][25689] Avg episode reward: [(0, '-47.499')] [2022-07-09 12:11:39,866][26022] Updated weights on worker 0-0, policy_version 243232 (0.00091) [2022-07-09 12:11:41,627][26022] Updated weights on worker 0-0, policy_version 243242 (0.00093) [2022-07-09 12:11:43,436][26022] Updated weights on worker 0-0, policy_version 243252 (0.00083) [2022-07-09 12:11:43,908][25689] Fps is (10 sec: 5719.0, 60 sec: 5676.9, 300 sec: 5683.3). Total num frames: 249093120. Throughput: 0: 5978.7. Samples: 249095024. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 12:11:43,908][25689] Avg episode reward: [(0, '-47.694')] [2022-07-09 12:11:45,213][26022] Updated weights on worker 0-0, policy_version 243262 (0.00093) [2022-07-09 12:11:47,134][26022] Updated weights on worker 0-0, policy_version 243272 (0.00086) [2022-07-09 12:11:48,731][26022] Updated weights on worker 0-0, policy_version 243282 (0.00081) [2022-07-09 12:11:49,021][25689] Fps is (10 sec: 5971.4, 60 sec: 5710.1, 300 sec: 5688.9). Total num frames: 249123840. Throughput: 0: 5972.5. Samples: 249129394. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:11:49,022][25689] Avg episode reward: [(0, '-47.365')] [2022-07-09 12:11:50,820][26022] Updated weights on worker 0-0, policy_version 243292 (0.00081) [2022-07-09 12:11:52,160][26022] Updated weights on worker 0-0, policy_version 243302 (0.00082) [2022-07-09 12:11:54,030][25689] Fps is (10 sec: 5563.7, 60 sec: 5677.2, 300 sec: 5682.1). Total num frames: 249149440. Throughput: 0: 5111.6. Samples: 249146312. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:11:54,031][25689] Avg episode reward: [(0, '-47.480')] [2022-07-09 12:11:54,426][26022] Updated weights on worker 0-0, policy_version 243312 (0.00087) [2022-07-09 12:11:55,723][26022] Updated weights on worker 0-0, policy_version 243322 (0.00093) [2022-07-09 12:11:57,792][26022] Updated weights on worker 0-0, policy_version 243332 (0.00092) [2022-07-09 12:11:59,087][25689] Fps is (10 sec: 5493.4, 60 sec: 5672.4, 300 sec: 5678.0). Total num frames: 249179136. Throughput: 0: 5976.4. Samples: 249181120. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:11:59,087][25689] Avg episode reward: [(0, '-47.747')] [2022-07-09 12:11:59,298][26022] Updated weights on worker 0-0, policy_version 243342 (0.00086) [2022-07-09 12:12:01,238][26022] Updated weights on worker 0-0, policy_version 243352 (0.00082) [2022-07-09 12:12:03,461][26022] Updated weights on worker 0-0, policy_version 243362 (0.00087) [2022-07-09 12:12:04,108][25689] Fps is (10 sec: 5690.4, 60 sec: 5688.9, 300 sec: 5685.9). Total num frames: 249206784. Throughput: 0: 5861.8. Samples: 249213434. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:12:04,108][25689] Avg episode reward: [(0, '-48.040')] [2022-07-09 12:12:05,197][26022] Updated weights on worker 0-0, policy_version 243372 (0.00080) [2022-07-09 12:12:07,107][26022] Updated weights on worker 0-0, policy_version 243382 (0.00084) [2022-07-09 12:12:09,002][26022] Updated weights on worker 0-0, policy_version 243392 (0.00084) [2022-07-09 12:12:09,175][25689] Fps is (10 sec: 5481.3, 60 sec: 5693.1, 300 sec: 5681.3). Total num frames: 249234432. Throughput: 0: 5011.2. Samples: 249230390. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:12:09,175][25689] Avg episode reward: [(0, '-48.037')] [2022-07-09 12:12:10,571][26022] Updated weights on worker 0-0, policy_version 243402 (0.00091) [2022-07-09 12:12:12,621][26022] Updated weights on worker 0-0, policy_version 243412 (0.00057) [2022-07-09 12:12:14,026][26022] Updated weights on worker 0-0, policy_version 243422 (0.00083) [2022-07-09 12:12:14,211][25689] Fps is (10 sec: 5676.0, 60 sec: 5696.0, 300 sec: 5684.5). Total num frames: 249264128. Throughput: 0: 5863.8. Samples: 249264644. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:12:14,211][25689] Avg episode reward: [(0, '-49.541')] [2022-07-09 12:12:16,210][26022] Updated weights on worker 0-0, policy_version 243432 (0.00096) [2022-07-09 12:12:17,835][26022] Updated weights on worker 0-0, policy_version 243442 (0.00087) [2022-07-09 12:12:19,227][25689] Fps is (10 sec: 5603.0, 60 sec: 5668.3, 300 sec: 5674.5). Total num frames: 249290752. Throughput: 0: 5847.1. Samples: 249298880. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:12:19,227][25689] Avg episode reward: [(0, '-49.811')] [2022-07-09 12:12:19,643][26022] Updated weights on worker 0-0, policy_version 243452 (0.00087) [2022-07-09 12:12:21,530][26022] Updated weights on worker 0-0, policy_version 243462 (0.00091) [2022-07-09 12:12:23,107][26022] Updated weights on worker 0-0, policy_version 243472 (0.00095) [2022-07-09 12:12:24,243][25689] Fps is (10 sec: 5613.7, 60 sec: 5668.1, 300 sec: 5680.0). Total num frames: 249320448. Throughput: 0: 5085.5. Samples: 249315832. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:12:24,244][25689] Avg episode reward: [(0, '-50.690')] [2022-07-09 12:12:25,154][26022] Updated weights on worker 0-0, policy_version 243482 (0.00084) [2022-07-09 12:12:26,742][26022] Updated weights on worker 0-0, policy_version 243492 (0.00079) [2022-07-09 12:12:28,708][26022] Updated weights on worker 0-0, policy_version 243502 (0.00090) [2022-07-09 12:12:29,272][25689] Fps is (10 sec: 5912.4, 60 sec: 5723.3, 300 sec: 5679.9). Total num frames: 249350144. Throughput: 0: 5968.2. Samples: 249350334. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:12:29,272][25689] Avg episode reward: [(0, '-49.851')] [2022-07-09 12:12:30,395][26022] Updated weights on worker 0-0, policy_version 243512 (0.00095) [2022-07-09 12:12:32,087][26022] Updated weights on worker 0-0, policy_version 243522 (0.00088) [2022-07-09 12:12:33,945][26022] Updated weights on worker 0-0, policy_version 243532 (0.00087) [2022-07-09 12:12:34,312][25689] Fps is (10 sec: 5797.1, 60 sec: 5676.6, 300 sec: 5682.6). Total num frames: 249378816. Throughput: 0: 5980.7. Samples: 249384862. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:12:34,312][25689] Avg episode reward: [(0, '-50.760')] [2022-07-09 12:12:35,734][26022] Updated weights on worker 0-0, policy_version 243542 (0.00094) [2022-07-09 12:12:37,475][26022] Updated weights on worker 0-0, policy_version 243552 (0.00086) [2022-07-09 12:12:39,326][25689] Fps is (10 sec: 5601.6, 60 sec: 5677.3, 300 sec: 5675.6). Total num frames: 249406464. Throughput: 0: 5125.7. Samples: 249401904. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:12:39,328][25689] Avg episode reward: [(0, '-50.143')] [2022-07-09 12:12:39,568][26022] Updated weights on worker 0-0, policy_version 243562 (0.00090) [2022-07-09 12:12:40,906][26022] Updated weights on worker 0-0, policy_version 243572 (0.00088) [2022-07-09 12:12:42,938][26022] Updated weights on worker 0-0, policy_version 243582 (0.00087) [2022-07-09 12:12:44,343][25689] Fps is (10 sec: 5716.3, 60 sec: 5676.1, 300 sec: 5681.0). Total num frames: 249436160. Throughput: 0: 5995.1. Samples: 249436334. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:12:44,344][25689] Avg episode reward: [(0, '-48.884')] [2022-07-09 12:12:44,665][26022] Updated weights on worker 0-0, policy_version 243592 (0.00084) [2022-07-09 12:12:46,508][26022] Updated weights on worker 0-0, policy_version 243602 (0.00089) [2022-07-09 12:12:48,390][26022] Updated weights on worker 0-0, policy_version 243612 (0.00096) [2022-07-09 12:12:49,382][25689] Fps is (10 sec: 5804.3, 60 sec: 5649.2, 300 sec: 5677.4). Total num frames: 249464832. Throughput: 0: 5977.7. Samples: 249470546. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:12:49,383][25689] Avg episode reward: [(0, '-47.721')] [2022-07-09 12:12:49,976][26022] Updated weights on worker 0-0, policy_version 243622 (0.00090) [2022-07-09 12:12:51,817][26022] Updated weights on worker 0-0, policy_version 243632 (0.00086) [2022-07-09 12:12:53,639][26022] Updated weights on worker 0-0, policy_version 243642 (0.00083) [2022-07-09 12:12:54,353][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:12:54,370][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000243646_249493504.pth [2022-07-09 12:12:54,371][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000241647_247446528.pth [2022-07-09 12:12:54,387][25689] Fps is (10 sec: 5709.6, 60 sec: 5700.6, 300 sec: 5681.5). Total num frames: 249493504. Throughput: 0: 5124.0. Samples: 249487726. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:12:54,388][25689] Avg episode reward: [(0, '-49.120')] [2022-07-09 12:12:55,419][26022] Updated weights on worker 0-0, policy_version 243652 (0.00086) [2022-07-09 12:12:57,283][26022] Updated weights on worker 0-0, policy_version 243662 (0.00096) [2022-07-09 12:12:58,931][26022] Updated weights on worker 0-0, policy_version 243672 (0.00087) [2022-07-09 12:12:59,400][25689] Fps is (10 sec: 5724.2, 60 sec: 5687.7, 300 sec: 5684.9). Total num frames: 249522176. Throughput: 0: 6013.6. Samples: 249522620. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:12:59,400][25689] Avg episode reward: [(0, '-48.289')] [2022-07-09 12:13:00,774][26022] Updated weights on worker 0-0, policy_version 243682 (0.00081) [2022-07-09 12:13:02,857][26022] Updated weights on worker 0-0, policy_version 243692 (0.00091) [2022-07-09 12:13:04,443][25689] Fps is (10 sec: 5396.6, 60 sec: 5651.6, 300 sec: 5675.7). Total num frames: 249547776. Throughput: 0: 5895.3. Samples: 249554830. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:13:04,444][25689] Avg episode reward: [(0, '-48.281')] [2022-07-09 12:13:04,646][26022] Updated weights on worker 0-0, policy_version 243702 (0.00087) [2022-07-09 12:13:06,377][26022] Updated weights on worker 0-0, policy_version 243712 (0.00062) [2022-07-09 12:13:08,487][26022] Updated weights on worker 0-0, policy_version 243722 (0.00094) [2022-07-09 12:13:09,576][25689] Fps is (10 sec: 5635.3, 60 sec: 5713.3, 300 sec: 5688.2). Total num frames: 249579520. Throughput: 0: 5023.0. Samples: 249571978. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:13:09,576][25689] Avg episode reward: [(0, '-49.317')] [2022-07-09 12:13:09,913][26022] Updated weights on worker 0-0, policy_version 243732 (0.00094) [2022-07-09 12:13:12,049][26022] Updated weights on worker 0-0, policy_version 243742 (0.00090) [2022-07-09 12:13:13,502][26022] Updated weights on worker 0-0, policy_version 243752 (0.00081) [2022-07-09 12:13:14,599][25689] Fps is (10 sec: 5747.3, 60 sec: 5663.6, 300 sec: 5678.7). Total num frames: 249606144. Throughput: 0: 5869.1. Samples: 249606354. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:13:14,600][25689] Avg episode reward: [(0, '-50.173')] [2022-07-09 12:13:15,584][26022] Updated weights on worker 0-0, policy_version 243762 (0.01371) [2022-07-09 12:13:17,051][26022] Updated weights on worker 0-0, policy_version 243772 (0.00079) [2022-07-09 12:13:19,124][26022] Updated weights on worker 0-0, policy_version 243782 (0.00087) [2022-07-09 12:13:19,605][25689] Fps is (10 sec: 5615.7, 60 sec: 5715.4, 300 sec: 5682.6). Total num frames: 249635840. Throughput: 0: 5851.5. Samples: 249640850. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:13:19,605][25689] Avg episode reward: [(0, '-49.121')] [2022-07-09 12:13:20,845][26022] Updated weights on worker 0-0, policy_version 243792 (0.00105) [2022-07-09 12:13:22,744][26022] Updated weights on worker 0-0, policy_version 243802 (0.00094) [2022-07-09 12:13:24,191][26022] Updated weights on worker 0-0, policy_version 243812 (0.00081) [2022-07-09 12:13:24,662][25689] Fps is (10 sec: 6003.9, 60 sec: 5728.6, 300 sec: 5689.4). Total num frames: 249666560. Throughput: 0: 5096.4. Samples: 249657872. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:13:24,662][25689] Avg episode reward: [(0, '-48.681')] [2022-07-09 12:13:26,463][26022] Updated weights on worker 0-0, policy_version 243822 (0.00092) [2022-07-09 12:13:27,787][26022] Updated weights on worker 0-0, policy_version 243832 (0.00431) [2022-07-09 12:13:29,704][25689] Fps is (10 sec: 5576.6, 60 sec: 5659.5, 300 sec: 5675.2). Total num frames: 249692160. Throughput: 0: 5957.5. Samples: 249691892. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:13:29,705][25689] Avg episode reward: [(0, '-49.974')] [2022-07-09 12:13:29,918][26022] Updated weights on worker 0-0, policy_version 243842 (0.00097) [2022-07-09 12:13:31,750][26022] Updated weights on worker 0-0, policy_version 243852 (0.00099) [2022-07-09 12:13:33,530][26022] Updated weights on worker 0-0, policy_version 243862 (0.00096) [2022-07-09 12:13:34,717][25689] Fps is (10 sec: 5397.6, 60 sec: 5662.1, 300 sec: 5678.4). Total num frames: 249720832. Throughput: 0: 5954.9. Samples: 249726152. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:13:34,717][25689] Avg episode reward: [(0, '-49.927')] [2022-07-09 12:13:35,293][26022] Updated weights on worker 0-0, policy_version 243872 (0.00086) [2022-07-09 12:13:37,095][26022] Updated weights on worker 0-0, policy_version 243882 (0.00054) [2022-07-09 12:13:38,752][26022] Updated weights on worker 0-0, policy_version 243892 (0.00084) [2022-07-09 12:13:39,722][25689] Fps is (10 sec: 5724.1, 60 sec: 5679.9, 300 sec: 5678.7). Total num frames: 249749504. Throughput: 0: 5100.4. Samples: 249743456. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:13:39,723][25689] Avg episode reward: [(0, '-49.577')] [2022-07-09 12:13:40,984][26022] Updated weights on worker 0-0, policy_version 243902 (0.00099) [2022-07-09 12:13:42,209][26022] Updated weights on worker 0-0, policy_version 243912 (0.00089) [2022-07-09 12:13:44,503][26022] Updated weights on worker 0-0, policy_version 243922 (0.00085) [2022-07-09 12:13:44,736][25689] Fps is (10 sec: 5723.3, 60 sec: 5663.2, 300 sec: 5677.0). Total num frames: 249778176. Throughput: 0: 5965.8. Samples: 249777632. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:13:44,737][25689] Avg episode reward: [(0, '-50.880')] [2022-07-09 12:13:46,063][26022] Updated weights on worker 0-0, policy_version 243932 (0.00099) [2022-07-09 12:13:47,855][26022] Updated weights on worker 0-0, policy_version 243942 (0.00529) [2022-07-09 12:13:49,475][26022] Updated weights on worker 0-0, policy_version 243952 (0.00085) [2022-07-09 12:13:49,796][25689] Fps is (10 sec: 5794.0, 60 sec: 5678.2, 300 sec: 5684.6). Total num frames: 249807872. Throughput: 0: 5980.4. Samples: 249812050. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:13:49,796][25689] Avg episode reward: [(0, '-50.957')] [2022-07-09 12:13:51,626][26022] Updated weights on worker 0-0, policy_version 243962 (0.00092) [2022-07-09 12:13:53,018][26022] Updated weights on worker 0-0, policy_version 243972 (0.00086) [2022-07-09 12:13:54,800][25689] Fps is (10 sec: 5698.1, 60 sec: 5661.3, 300 sec: 5678.3). Total num frames: 249835520. Throughput: 0: 5123.4. Samples: 249829046. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:13:54,800][25689] Avg episode reward: [(0, '-50.131')] [2022-07-09 12:13:55,123][26022] Updated weights on worker 0-0, policy_version 243982 (0.00085) [2022-07-09 12:13:56,770][26022] Updated weights on worker 0-0, policy_version 243992 (0.00083) [2022-07-09 12:13:58,602][26022] Updated weights on worker 0-0, policy_version 244002 (0.00090) [2022-07-09 12:13:59,836][25689] Fps is (10 sec: 5711.4, 60 sec: 5676.0, 300 sec: 5685.4). Total num frames: 249865216. Throughput: 0: 5966.0. Samples: 249863456. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:13:59,837][25689] Avg episode reward: [(0, '-49.296')] [2022-07-09 12:14:00,400][26022] Updated weights on worker 0-0, policy_version 244012 (0.00084) [2022-07-09 12:14:02,655][26022] Updated weights on worker 0-0, policy_version 244022 (0.00088) [2022-07-09 12:14:04,343][26022] Updated weights on worker 0-0, policy_version 244032 (0.00091) [2022-07-09 12:14:04,886][25689] Fps is (10 sec: 5482.1, 60 sec: 5675.4, 300 sec: 5674.8). Total num frames: 249890816. Throughput: 0: 5841.9. Samples: 249895348. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:14:04,888][25689] Avg episode reward: [(0, '-48.115')] [2022-07-09 12:14:06,090][26022] Updated weights on worker 0-0, policy_version 244042 (0.00093) [2022-07-09 12:14:07,830][26022] Updated weights on worker 0-0, policy_version 244052 (0.00085) [2022-07-09 12:14:09,659][26022] Updated weights on worker 0-0, policy_version 244062 (0.00085) [2022-07-09 12:14:09,939][25689] Fps is (10 sec: 5473.4, 60 sec: 5649.0, 300 sec: 5679.1). Total num frames: 249920512. Throughput: 0: 5848.4. Samples: 249929854. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-09 12:14:09,939][25689] Avg episode reward: [(0, '-47.660')] [2022-07-09 12:14:11,667][26022] Updated weights on worker 0-0, policy_version 244072 (0.00094) [2022-07-09 12:14:13,272][26022] Updated weights on worker 0-0, policy_version 244082 (0.00094) [2022-07-09 12:14:14,962][25689] Fps is (10 sec: 5792.8, 60 sec: 5682.9, 300 sec: 5675.8). Total num frames: 249949184. Throughput: 0: 5856.0. Samples: 249947118. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-09 12:14:14,963][25689] Avg episode reward: [(0, '-47.981')] [2022-07-09 12:14:15,101][26022] Updated weights on worker 0-0, policy_version 244092 (0.00089) [2022-07-09 12:14:16,919][26022] Updated weights on worker 0-0, policy_version 244102 (0.00082) [2022-07-09 12:14:18,835][26022] Updated weights on worker 0-0, policy_version 244112 (0.00098) [2022-07-09 12:14:20,040][25689] Fps is (10 sec: 5778.6, 60 sec: 5676.2, 300 sec: 5677.9). Total num frames: 249978880. Throughput: 0: 5848.5. Samples: 249981616. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-09 12:14:20,040][25689] Avg episode reward: [(0, '-48.018')] [2022-07-09 12:14:20,599][26022] Updated weights on worker 0-0, policy_version 244122 (0.00082) [2022-07-09 12:14:22,223][26022] Updated weights on worker 0-0, policy_version 244132 (0.00086) [2022-07-09 12:14:24,038][26022] Updated weights on worker 0-0, policy_version 244142 (0.00087) [2022-07-09 12:14:25,069][25689] Fps is (10 sec: 5673.8, 60 sec: 5627.9, 300 sec: 5671.2). Total num frames: 250006528. Throughput: 0: 5975.9. Samples: 250015958. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-09 12:14:25,070][25689] Avg episode reward: [(0, '-48.544')] [2022-07-09 12:14:25,850][26022] Updated weights on worker 0-0, policy_version 244152 (0.00087) [2022-07-09 12:14:27,706][26022] Updated weights on worker 0-0, policy_version 244162 (0.00087) [2022-07-09 12:14:29,335][26022] Updated weights on worker 0-0, policy_version 244172 (0.00899) [2022-07-09 12:14:30,163][25689] Fps is (10 sec: 5664.7, 60 sec: 5690.9, 300 sec: 5680.2). Total num frames: 250036224. Throughput: 0: 5099.0. Samples: 250032974. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-09 12:14:30,163][25689] Avg episode reward: [(0, '-48.717')] [2022-07-09 12:14:31,356][26022] Updated weights on worker 0-0, policy_version 244182 (0.00090) [2022-07-09 12:14:32,799][26022] Updated weights on worker 0-0, policy_version 244192 (0.00082) [2022-07-09 12:14:34,923][26022] Updated weights on worker 0-0, policy_version 244202 (0.00092) [2022-07-09 12:14:35,187][25689] Fps is (10 sec: 5768.7, 60 sec: 5689.8, 300 sec: 5676.4). Total num frames: 250064896. Throughput: 0: 5953.1. Samples: 250067518. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-09 12:14:35,188][25689] Avg episode reward: [(0, '-49.563')] [2022-07-09 12:14:36,536][26022] Updated weights on worker 0-0, policy_version 244212 (0.00092) [2022-07-09 12:14:38,376][26022] Updated weights on worker 0-0, policy_version 244222 (0.00094) [2022-07-09 12:14:40,219][25689] Fps is (10 sec: 5600.6, 60 sec: 5670.4, 300 sec: 5672.8). Total num frames: 250092544. Throughput: 0: 5961.7. Samples: 250101918. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-09 12:14:40,220][25689] Avg episode reward: [(0, '-50.121')] [2022-07-09 12:14:40,352][26022] Updated weights on worker 0-0, policy_version 244232 (0.00084) [2022-07-09 12:14:41,993][26022] Updated weights on worker 0-0, policy_version 244242 (0.00090) [2022-07-09 12:14:43,767][26022] Updated weights on worker 0-0, policy_version 244252 (0.00093) [2022-07-09 12:14:45,252][25689] Fps is (10 sec: 5697.4, 60 sec: 5685.5, 300 sec: 5674.5). Total num frames: 250122240. Throughput: 0: 5111.6. Samples: 250119126. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-09 12:14:45,254][25689] Avg episode reward: [(0, '-50.318')] [2022-07-09 12:14:45,603][26022] Updated weights on worker 0-0, policy_version 244262 (0.00092) [2022-07-09 12:14:47,098][26022] Updated weights on worker 0-0, policy_version 244272 (0.00083) [2022-07-09 12:14:49,144][26022] Updated weights on worker 0-0, policy_version 244282 (0.00086) [2022-07-09 12:14:50,293][25689] Fps is (10 sec: 5895.4, 60 sec: 5687.3, 300 sec: 5677.4). Total num frames: 250151936. Throughput: 0: 5999.4. Samples: 250153744. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-09 12:14:50,295][25689] Avg episode reward: [(0, '-50.180')] [2022-07-09 12:14:50,671][26022] Updated weights on worker 0-0, policy_version 244292 (0.00086) [2022-07-09 12:14:52,626][26022] Updated weights on worker 0-0, policy_version 244302 (0.00077) [2022-07-09 12:14:54,481][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:14:54,494][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000244312_250175488.pth [2022-07-09 12:14:54,495][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000242314_248129536.pth [2022-07-09 12:14:54,497][26022] Updated weights on worker 0-0, policy_version 244312 (0.00089) [2022-07-09 12:14:55,323][25689] Fps is (10 sec: 5694.2, 60 sec: 5684.9, 300 sec: 5677.0). Total num frames: 250179584. Throughput: 0: 5978.2. Samples: 250187892. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-09 12:14:55,324][25689] Avg episode reward: [(0, '-49.882')] [2022-07-09 12:14:56,338][26022] Updated weights on worker 0-0, policy_version 244322 (0.00087) [2022-07-09 12:14:58,227][26022] Updated weights on worker 0-0, policy_version 244332 (0.00081) [2022-07-09 12:14:59,724][26022] Updated weights on worker 0-0, policy_version 244342 (0.00088) [2022-07-09 12:15:00,381][25689] Fps is (10 sec: 5582.6, 60 sec: 5665.9, 300 sec: 5683.6). Total num frames: 250208256. Throughput: 0: 5123.2. Samples: 250205216. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-09 12:15:00,382][25689] Avg episode reward: [(0, '-48.971')] [2022-07-09 12:15:02,138][26022] Updated weights on worker 0-0, policy_version 244352 (0.00088) [2022-07-09 12:15:03,776][26022] Updated weights on worker 0-0, policy_version 244362 (0.00090) [2022-07-09 12:15:05,409][25689] Fps is (10 sec: 5482.4, 60 sec: 5684.9, 300 sec: 5674.8). Total num frames: 250234880. Throughput: 0: 5876.1. Samples: 250237568. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-09 12:15:05,409][25689] Avg episode reward: [(0, '-48.605')] [2022-07-09 12:15:05,479][26022] Updated weights on worker 0-0, policy_version 244372 (0.00087) [2022-07-09 12:15:07,449][26022] Updated weights on worker 0-0, policy_version 244382 (0.00090) [2022-07-09 12:15:09,051][26022] Updated weights on worker 0-0, policy_version 244392 (0.00089) [2022-07-09 12:15:10,542][25689] Fps is (10 sec: 5442.0, 60 sec: 5660.4, 300 sec: 5675.8). Total num frames: 250263552. Throughput: 0: 5834.5. Samples: 250271888. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-09 12:15:10,544][25689] Avg episode reward: [(0, '-48.826')] [2022-07-09 12:15:11,106][26022] Updated weights on worker 0-0, policy_version 244402 (0.00089) [2022-07-09 12:15:12,699][26022] Updated weights on worker 0-0, policy_version 244412 (0.00092) [2022-07-09 12:15:14,544][26022] Updated weights on worker 0-0, policy_version 244422 (0.00085) [2022-07-09 12:15:15,643][25689] Fps is (10 sec: 5903.0, 60 sec: 5703.8, 300 sec: 5682.0). Total num frames: 250295296. Throughput: 0: 4985.7. Samples: 250289206. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-09 12:15:15,645][25689] Avg episode reward: [(0, '-46.999')] [2022-07-09 12:15:16,357][26022] Updated weights on worker 0-0, policy_version 244432 (0.00077) [2022-07-09 12:15:18,193][26022] Updated weights on worker 0-0, policy_version 244442 (0.00085) [2022-07-09 12:15:19,900][26022] Updated weights on worker 0-0, policy_version 244452 (0.00094) [2022-07-09 12:15:20,733][25689] Fps is (10 sec: 5828.3, 60 sec: 5668.9, 300 sec: 5677.2). Total num frames: 250322944. Throughput: 0: 5814.4. Samples: 250323546. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-09 12:15:20,733][25689] Avg episode reward: [(0, '-46.243')] [2022-07-09 12:15:21,785][26022] Updated weights on worker 0-0, policy_version 244462 (0.00088) [2022-07-09 12:15:23,517][26022] Updated weights on worker 0-0, policy_version 244472 (0.00085) [2022-07-09 12:15:25,377][26022] Updated weights on worker 0-0, policy_version 244482 (0.00082) [2022-07-09 12:15:25,747][25689] Fps is (10 sec: 5574.5, 60 sec: 5687.2, 300 sec: 5678.8). Total num frames: 250351616. Throughput: 0: 5906.4. Samples: 250357692. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-09 12:15:25,747][25689] Avg episode reward: [(0, '-46.930')] [2022-07-09 12:15:27,039][26022] Updated weights on worker 0-0, policy_version 244492 (0.00081) [2022-07-09 12:15:28,983][26022] Updated weights on worker 0-0, policy_version 244502 (0.00086) [2022-07-09 12:15:30,687][26022] Updated weights on worker 0-0, policy_version 244512 (0.00084) [2022-07-09 12:15:30,893][25689] Fps is (10 sec: 5644.2, 60 sec: 5665.5, 300 sec: 5676.5). Total num frames: 250380288. Throughput: 0: 5049.3. Samples: 250374642. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-09 12:15:30,893][25689] Avg episode reward: [(0, '-46.499')] [2022-07-09 12:15:32,628][26022] Updated weights on worker 0-0, policy_version 244522 (0.00087) [2022-07-09 12:15:34,248][26022] Updated weights on worker 0-0, policy_version 244532 (0.00087) [2022-07-09 12:15:35,926][25689] Fps is (10 sec: 5633.2, 60 sec: 5664.6, 300 sec: 5676.1). Total num frames: 250408960. Throughput: 0: 5899.8. Samples: 250408872. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-09 12:15:35,927][25689] Avg episode reward: [(0, '-46.055')] [2022-07-09 12:15:36,156][26022] Updated weights on worker 0-0, policy_version 244542 (0.01131) [2022-07-09 12:15:38,010][26022] Updated weights on worker 0-0, policy_version 244552 (0.00087) [2022-07-09 12:15:39,816][26022] Updated weights on worker 0-0, policy_version 244562 (0.00614) [2022-07-09 12:15:40,954][25689] Fps is (10 sec: 5801.1, 60 sec: 5698.7, 300 sec: 5676.4). Total num frames: 250438656. Throughput: 0: 5908.2. Samples: 250443018. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-09 12:15:40,955][25689] Avg episode reward: [(0, '-46.677')] [2022-07-09 12:15:41,772][26022] Updated weights on worker 0-0, policy_version 244572 (0.00092) [2022-07-09 12:15:43,235][26022] Updated weights on worker 0-0, policy_version 244582 (0.00521) [2022-07-09 12:15:45,413][26022] Updated weights on worker 0-0, policy_version 244592 (0.00087) [2022-07-09 12:15:45,968][25689] Fps is (10 sec: 5812.4, 60 sec: 5683.6, 300 sec: 5678.2). Total num frames: 250467328. Throughput: 0: 5067.1. Samples: 250460158. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-09 12:15:45,969][25689] Avg episode reward: [(0, '-48.245')] [2022-07-09 12:15:46,781][26022] Updated weights on worker 0-0, policy_version 244602 (0.00083) [2022-07-09 12:15:48,819][26022] Updated weights on worker 0-0, policy_version 244612 (0.00090) [2022-07-09 12:15:50,570][26022] Updated weights on worker 0-0, policy_version 244622 (0.00095) [2022-07-09 12:15:51,022][25689] Fps is (10 sec: 5492.5, 60 sec: 5631.9, 300 sec: 5674.1). Total num frames: 250493952. Throughput: 0: 5965.1. Samples: 250494714. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-09 12:15:51,022][25689] Avg episode reward: [(0, '-49.240')] [2022-07-09 12:15:52,211][26022] Updated weights on worker 0-0, policy_version 244632 (0.00085) [2022-07-09 12:15:54,247][26022] Updated weights on worker 0-0, policy_version 244642 (0.00082) [2022-07-09 12:15:55,715][26022] Updated weights on worker 0-0, policy_version 244652 (0.00090) [2022-07-09 12:15:56,046][25689] Fps is (10 sec: 5690.3, 60 sec: 5683.0, 300 sec: 5677.1). Total num frames: 250524672. Throughput: 0: 5968.9. Samples: 250528962. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-09 12:15:56,046][25689] Avg episode reward: [(0, '-49.401')] [2022-07-09 12:15:57,652][26022] Updated weights on worker 0-0, policy_version 244662 (0.00087) [2022-07-09 12:15:59,672][26022] Updated weights on worker 0-0, policy_version 244672 (0.00088) [2022-07-09 12:16:01,079][26022] Updated weights on worker 0-0, policy_version 244682 (0.00359) [2022-07-09 12:16:01,090][25689] Fps is (10 sec: 6000.3, 60 sec: 5701.2, 300 sec: 5686.9). Total num frames: 250554368. Throughput: 0: 5124.2. Samples: 250546200. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-09 12:16:01,091][25689] Avg episode reward: [(0, '-50.141')] [2022-07-09 12:16:03,552][26022] Updated weights on worker 0-0, policy_version 244692 (0.00090) [2022-07-09 12:16:05,174][26022] Updated weights on worker 0-0, policy_version 244702 (0.00088) [2022-07-09 12:16:06,113][25689] Fps is (10 sec: 5289.3, 60 sec: 5651.0, 300 sec: 5674.9). Total num frames: 250577920. Throughput: 0: 5860.1. Samples: 250578208. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-09 12:16:06,113][25689] Avg episode reward: [(0, '-50.757')] [2022-07-09 12:16:07,162][26022] Updated weights on worker 0-0, policy_version 244712 (0.00084) [2022-07-09 12:16:09,134][26022] Updated weights on worker 0-0, policy_version 244722 (0.00095) [2022-07-09 12:16:10,570][26022] Updated weights on worker 0-0, policy_version 244732 (0.00091) [2022-07-09 12:16:11,242][25689] Fps is (10 sec: 5446.8, 60 sec: 5702.0, 300 sec: 5680.6). Total num frames: 250609664. Throughput: 0: 5807.9. Samples: 250612154. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-09 12:16:11,243][25689] Avg episode reward: [(0, '-50.207')] [2022-07-09 12:16:12,466][26022] Updated weights on worker 0-0, policy_version 244742 (0.00088) [2022-07-09 12:16:14,281][26022] Updated weights on worker 0-0, policy_version 244752 (0.00088) [2022-07-09 12:16:15,985][26022] Updated weights on worker 0-0, policy_version 244762 (0.00083) [2022-07-09 12:16:16,245][25689] Fps is (10 sec: 5962.9, 60 sec: 5660.6, 300 sec: 5682.1). Total num frames: 250638336. Throughput: 0: 5816.9. Samples: 250646458. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-09 12:16:16,247][25689] Avg episode reward: [(0, '-49.841')] [2022-07-09 12:16:17,944][26022] Updated weights on worker 0-0, policy_version 244772 (0.00093) [2022-07-09 12:16:19,591][26022] Updated weights on worker 0-0, policy_version 244782 (0.00108) [2022-07-09 12:16:21,291][25689] Fps is (10 sec: 5401.2, 60 sec: 5630.8, 300 sec: 5667.7). Total num frames: 250663936. Throughput: 0: 5806.8. Samples: 250663498. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-09 12:16:21,292][25689] Avg episode reward: [(0, '-49.410')] [2022-07-09 12:16:21,499][26022] Updated weights on worker 0-0, policy_version 244792 (0.00087) [2022-07-09 12:16:23,283][26022] Updated weights on worker 0-0, policy_version 244802 (0.00087) [2022-07-09 12:16:24,998][26022] Updated weights on worker 0-0, policy_version 244812 (0.00086) [2022-07-09 12:16:26,300][25689] Fps is (10 sec: 5499.4, 60 sec: 5648.2, 300 sec: 5679.3). Total num frames: 250693632. Throughput: 0: 5921.0. Samples: 250697734. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-09 12:16:26,301][25689] Avg episode reward: [(0, '-49.159')] [2022-07-09 12:16:26,828][26022] Updated weights on worker 0-0, policy_version 244822 (0.00096) [2022-07-09 12:16:28,868][26022] Updated weights on worker 0-0, policy_version 244832 (0.00098) [2022-07-09 12:16:30,419][26022] Updated weights on worker 0-0, policy_version 244842 (0.00588) [2022-07-09 12:16:31,374][25689] Fps is (10 sec: 5890.1, 60 sec: 5671.8, 300 sec: 5672.6). Total num frames: 250723328. Throughput: 0: 5940.2. Samples: 250731740. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:16:31,374][25689] Avg episode reward: [(0, '-49.739')] [2022-07-09 12:16:32,419][26022] Updated weights on worker 0-0, policy_version 244852 (0.00091) [2022-07-09 12:16:33,929][26022] Updated weights on worker 0-0, policy_version 244862 (0.00098) [2022-07-09 12:16:35,962][26022] Updated weights on worker 0-0, policy_version 244872 (0.00092) [2022-07-09 12:16:36,417][25689] Fps is (10 sec: 5769.0, 60 sec: 5670.9, 300 sec: 5675.6). Total num frames: 250752000. Throughput: 0: 5076.0. Samples: 250748852. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:16:36,418][25689] Avg episode reward: [(0, '-49.005')] [2022-07-09 12:16:37,591][26022] Updated weights on worker 0-0, policy_version 244882 (0.00087) [2022-07-09 12:16:39,498][26022] Updated weights on worker 0-0, policy_version 244892 (0.00096) [2022-07-09 12:16:41,382][26022] Updated weights on worker 0-0, policy_version 244902 (0.00084) [2022-07-09 12:16:41,464][25689] Fps is (10 sec: 5683.2, 60 sec: 5652.2, 300 sec: 5671.4). Total num frames: 250780672. Throughput: 0: 5939.7. Samples: 250783322. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:16:41,465][25689] Avg episode reward: [(0, '-48.739')] [2022-07-09 12:16:42,979][26022] Updated weights on worker 0-0, policy_version 244912 (0.00084) [2022-07-09 12:16:44,874][26022] Updated weights on worker 0-0, policy_version 244922 (0.00087) [2022-07-09 12:16:46,501][25689] Fps is (10 sec: 5686.7, 60 sec: 5650.1, 300 sec: 5665.9). Total num frames: 250809344. Throughput: 0: 5940.8. Samples: 250817746. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:16:46,502][25689] Avg episode reward: [(0, '-48.668')] [2022-07-09 12:16:46,688][26022] Updated weights on worker 0-0, policy_version 244932 (0.00089) [2022-07-09 12:16:48,326][26022] Updated weights on worker 0-0, policy_version 244942 (0.00621) [2022-07-09 12:16:50,306][26022] Updated weights on worker 0-0, policy_version 244952 (0.00087) [2022-07-09 12:16:51,608][25689] Fps is (10 sec: 5754.1, 60 sec: 5695.8, 300 sec: 5677.9). Total num frames: 250839040. Throughput: 0: 5102.5. Samples: 250834984. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:16:51,611][25689] Avg episode reward: [(0, '-48.578')] [2022-07-09 12:16:51,811][26022] Updated weights on worker 0-0, policy_version 244962 (0.00085) [2022-07-09 12:16:53,828][26022] Updated weights on worker 0-0, policy_version 244972 (0.00089) [2022-07-09 12:16:54,562][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:16:54,582][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000244977_250856448.pth [2022-07-09 12:16:54,583][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000242979_248810496.pth [2022-07-09 12:16:55,538][26022] Updated weights on worker 0-0, policy_version 244982 (0.00093) [2022-07-09 12:16:56,643][25689] Fps is (10 sec: 5654.3, 60 sec: 5644.1, 300 sec: 5671.4). Total num frames: 250866688. Throughput: 0: 5948.6. Samples: 250869164. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:16:56,643][25689] Avg episode reward: [(0, '-48.419')] [2022-07-09 12:16:57,376][26022] Updated weights on worker 0-0, policy_version 244992 (0.00087) [2022-07-09 12:16:59,367][26022] Updated weights on worker 0-0, policy_version 245002 (0.00094) [2022-07-09 12:17:00,759][26022] Updated weights on worker 0-0, policy_version 245012 (0.00097) [2022-07-09 12:17:01,654][25689] Fps is (10 sec: 5707.8, 60 sec: 5647.2, 300 sec: 5678.4). Total num frames: 250896384. Throughput: 0: 5947.3. Samples: 250903400. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:17:01,655][25689] Avg episode reward: [(0, '-48.007')] [2022-07-09 12:17:03,209][26022] Updated weights on worker 0-0, policy_version 245022 (0.00094) [2022-07-09 12:17:04,766][26022] Updated weights on worker 0-0, policy_version 245032 (0.00095) [2022-07-09 12:17:06,675][25689] Fps is (10 sec: 5409.9, 60 sec: 5664.3, 300 sec: 5669.0). Total num frames: 250920960. Throughput: 0: 4987.8. Samples: 250918368. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:17:06,675][25689] Avg episode reward: [(0, '-48.260')] [2022-07-09 12:17:06,853][26022] Updated weights on worker 0-0, policy_version 245042 (0.00090) [2022-07-09 12:17:08,655][26022] Updated weights on worker 0-0, policy_version 245052 (0.00089) [2022-07-09 12:17:10,507][26022] Updated weights on worker 0-0, policy_version 245062 (0.00097) [2022-07-09 12:17:11,768][25689] Fps is (10 sec: 5568.6, 60 sec: 5667.7, 300 sec: 5674.8). Total num frames: 250952704. Throughput: 0: 5831.3. Samples: 250952546. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:17:11,769][25689] Avg episode reward: [(0, '-48.564')] [2022-07-09 12:17:12,048][26022] Updated weights on worker 0-0, policy_version 245072 (0.00088) [2022-07-09 12:17:14,082][26022] Updated weights on worker 0-0, policy_version 245082 (0.00085) [2022-07-09 12:17:15,723][26022] Updated weights on worker 0-0, policy_version 245092 (0.00085) [2022-07-09 12:17:16,840][25689] Fps is (10 sec: 5641.2, 60 sec: 5610.5, 300 sec: 5670.3). Total num frames: 250978304. Throughput: 0: 5836.6. Samples: 250987048. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:17:16,840][25689] Avg episode reward: [(0, '-48.910')] [2022-07-09 12:17:17,520][26022] Updated weights on worker 0-0, policy_version 245102 (0.00098) [2022-07-09 12:17:19,406][26022] Updated weights on worker 0-0, policy_version 245112 (0.00089) [2022-07-09 12:17:20,948][26022] Updated weights on worker 0-0, policy_version 245122 (0.00092) [2022-07-09 12:17:21,842][25689] Fps is (10 sec: 5692.3, 60 sec: 5716.0, 300 sec: 5677.4). Total num frames: 251010048. Throughput: 0: 5000.2. Samples: 251004344. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:17:21,843][25689] Avg episode reward: [(0, '-48.857')] [2022-07-09 12:17:22,959][26022] Updated weights on worker 0-0, policy_version 245132 (0.00087) [2022-07-09 12:17:24,617][26022] Updated weights on worker 0-0, policy_version 245142 (0.00086) [2022-07-09 12:17:26,691][26022] Updated weights on worker 0-0, policy_version 245152 (0.00099) [2022-07-09 12:17:26,895][25689] Fps is (10 sec: 5805.0, 60 sec: 5661.2, 300 sec: 5666.7). Total num frames: 251036672. Throughput: 0: 5933.6. Samples: 251038346. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:17:26,895][25689] Avg episode reward: [(0, '-49.080')] [2022-07-09 12:17:28,366][26022] Updated weights on worker 0-0, policy_version 245162 (0.00091) [2022-07-09 12:17:30,163][26022] Updated weights on worker 0-0, policy_version 245172 (0.00089) [2022-07-09 12:17:31,957][25689] Fps is (10 sec: 5466.7, 60 sec: 5645.4, 300 sec: 5666.2). Total num frames: 251065344. Throughput: 0: 5931.7. Samples: 251072302. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:17:31,959][25689] Avg episode reward: [(0, '-48.979')] [2022-07-09 12:17:31,970][26022] Updated weights on worker 0-0, policy_version 245182 (0.00086) [2022-07-09 12:17:33,680][26022] Updated weights on worker 0-0, policy_version 245192 (0.00083) [2022-07-09 12:17:35,457][26022] Updated weights on worker 0-0, policy_version 245202 (0.00082) [2022-07-09 12:17:36,982][25689] Fps is (10 sec: 5786.4, 60 sec: 5664.1, 300 sec: 5672.9). Total num frames: 251095040. Throughput: 0: 5091.1. Samples: 251089592. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:17:36,983][25689] Avg episode reward: [(0, '-48.952')] [2022-07-09 12:17:37,302][26022] Updated weights on worker 0-0, policy_version 245212 (0.00108) [2022-07-09 12:17:39,074][26022] Updated weights on worker 0-0, policy_version 245222 (0.00087) [2022-07-09 12:17:40,802][26022] Updated weights on worker 0-0, policy_version 245232 (0.00089) [2022-07-09 12:17:42,015][25689] Fps is (10 sec: 5701.4, 60 sec: 5648.4, 300 sec: 5665.7). Total num frames: 251122688. Throughput: 0: 5920.0. Samples: 251123768. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:17:42,017][25689] Avg episode reward: [(0, '-48.456')] [2022-07-09 12:17:42,676][26022] Updated weights on worker 0-0, policy_version 245242 (0.00086) [2022-07-09 12:17:44,447][26022] Updated weights on worker 0-0, policy_version 245252 (0.00085) [2022-07-09 12:17:46,440][26022] Updated weights on worker 0-0, policy_version 245262 (0.00081) [2022-07-09 12:17:47,028][25689] Fps is (10 sec: 5606.1, 60 sec: 5650.7, 300 sec: 5666.2). Total num frames: 251151360. Throughput: 0: 5946.6. Samples: 251158070. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:17:47,028][25689] Avg episode reward: [(0, '-48.311')] [2022-07-09 12:17:47,975][26022] Updated weights on worker 0-0, policy_version 245272 (0.00092) [2022-07-09 12:17:50,077][26022] Updated weights on worker 0-0, policy_version 245282 (0.00084) [2022-07-09 12:17:51,455][26022] Updated weights on worker 0-0, policy_version 245292 (0.00087) [2022-07-09 12:17:52,148][25689] Fps is (10 sec: 5760.1, 60 sec: 5649.4, 300 sec: 5667.5). Total num frames: 251181056. Throughput: 0: 5096.0. Samples: 251175196. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:17:52,150][25689] Avg episode reward: [(0, '-47.606')] [2022-07-09 12:17:53,522][26022] Updated weights on worker 0-0, policy_version 245302 (0.00085) [2022-07-09 12:17:55,082][26022] Updated weights on worker 0-0, policy_version 245312 (0.00085) [2022-07-09 12:17:57,163][25689] Fps is (10 sec: 5658.0, 60 sec: 5651.3, 300 sec: 5664.0). Total num frames: 251208704. Throughput: 0: 5961.3. Samples: 251209898. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:17:57,163][25689] Avg episode reward: [(0, '-47.287')] [2022-07-09 12:17:57,179][26022] Updated weights on worker 0-0, policy_version 245322 (0.00084) [2022-07-09 12:17:58,731][26022] Updated weights on worker 0-0, policy_version 245332 (0.00085) [2022-07-09 12:18:00,692][26022] Updated weights on worker 0-0, policy_version 245342 (0.00094) [2022-07-09 12:18:02,186][25689] Fps is (10 sec: 5509.1, 60 sec: 5616.4, 300 sec: 5671.3). Total num frames: 251236352. Throughput: 0: 5967.6. Samples: 251244136. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:18:02,186][25689] Avg episode reward: [(0, '-47.970')] [2022-07-09 12:18:02,651][26022] Updated weights on worker 0-0, policy_version 245352 (0.00087) [2022-07-09 12:18:04,653][26022] Updated weights on worker 0-0, policy_version 245362 (0.00091) [2022-07-09 12:18:06,243][26022] Updated weights on worker 0-0, policy_version 245372 (0.00093) [2022-07-09 12:18:07,241][25689] Fps is (10 sec: 5588.5, 60 sec: 5680.8, 300 sec: 5662.4). Total num frames: 251265024. Throughput: 0: 4998.6. Samples: 251259106. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:18:07,241][25689] Avg episode reward: [(0, '-47.447')] [2022-07-09 12:18:08,224][26022] Updated weights on worker 0-0, policy_version 245382 (0.00101) [2022-07-09 12:18:10,007][26022] Updated weights on worker 0-0, policy_version 245392 (0.00090) [2022-07-09 12:18:11,851][26022] Updated weights on worker 0-0, policy_version 245402 (0.00092) [2022-07-09 12:18:12,284][25689] Fps is (10 sec: 5678.8, 60 sec: 5634.8, 300 sec: 5668.9). Total num frames: 251293696. Throughput: 0: 5860.4. Samples: 251293196. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:18:12,284][25689] Avg episode reward: [(0, '-47.440')] [2022-07-09 12:18:13,513][26022] Updated weights on worker 0-0, policy_version 245412 (0.00083) [2022-07-09 12:18:15,496][26022] Updated weights on worker 0-0, policy_version 245422 (0.00082) [2022-07-09 12:18:17,099][26022] Updated weights on worker 0-0, policy_version 245432 (0.00079) [2022-07-09 12:18:17,368][25689] Fps is (10 sec: 5864.7, 60 sec: 5718.2, 300 sec: 5670.9). Total num frames: 251324416. Throughput: 0: 5834.4. Samples: 251327782. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:18:17,368][25689] Avg episode reward: [(0, '-48.071')] [2022-07-09 12:18:19,054][26022] Updated weights on worker 0-0, policy_version 245442 (0.00086) [2022-07-09 12:18:20,568][26022] Updated weights on worker 0-0, policy_version 245452 (0.00088) [2022-07-09 12:18:22,410][25689] Fps is (10 sec: 5663.0, 60 sec: 5630.0, 300 sec: 5657.4). Total num frames: 251351040. Throughput: 0: 4996.0. Samples: 251345176. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:18:22,410][25689] Avg episode reward: [(0, '-48.198')] [2022-07-09 12:18:22,595][26022] Updated weights on worker 0-0, policy_version 245462 (0.00091) [2022-07-09 12:18:24,239][26022] Updated weights on worker 0-0, policy_version 245472 (0.00095) [2022-07-09 12:18:26,268][26022] Updated weights on worker 0-0, policy_version 245482 (0.00088) [2022-07-09 12:18:27,495][25689] Fps is (10 sec: 5460.2, 60 sec: 5660.7, 300 sec: 5666.9). Total num frames: 251379712. Throughput: 0: 5947.7. Samples: 251379572. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:18:27,495][25689] Avg episode reward: [(0, '-47.818')] [2022-07-09 12:18:27,976][26022] Updated weights on worker 0-0, policy_version 245492 (0.00087) [2022-07-09 12:18:29,859][26022] Updated weights on worker 0-0, policy_version 245502 (0.00092) [2022-07-09 12:18:31,494][26022] Updated weights on worker 0-0, policy_version 245512 (0.00089) [2022-07-09 12:18:32,627][25689] Fps is (10 sec: 5712.6, 60 sec: 5671.1, 300 sec: 5668.1). Total num frames: 251409408. Throughput: 0: 5890.8. Samples: 251413036. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:18:32,628][25689] Avg episode reward: [(0, '-47.722')] [2022-07-09 12:18:33,627][26022] Updated weights on worker 0-0, policy_version 245522 (0.00088) [2022-07-09 12:18:35,087][26022] Updated weights on worker 0-0, policy_version 245532 (0.00094) [2022-07-09 12:18:37,166][26022] Updated weights on worker 0-0, policy_version 245542 (0.00088) [2022-07-09 12:18:37,655][25689] Fps is (10 sec: 5744.6, 60 sec: 5653.9, 300 sec: 5667.6). Total num frames: 251438080. Throughput: 0: 5893.9. Samples: 251447354. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:18:37,656][25689] Avg episode reward: [(0, '-47.321')] [2022-07-09 12:18:38,560][26022] Updated weights on worker 0-0, policy_version 245552 (0.00085) [2022-07-09 12:18:40,687][26022] Updated weights on worker 0-0, policy_version 245562 (0.00099) [2022-07-09 12:18:42,312][26022] Updated weights on worker 0-0, policy_version 245572 (0.00089) [2022-07-09 12:18:42,685][25689] Fps is (10 sec: 5803.4, 60 sec: 5688.0, 300 sec: 5670.8). Total num frames: 251467776. Throughput: 0: 5895.1. Samples: 251464698. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:18:42,685][25689] Avg episode reward: [(0, '-47.057')] [2022-07-09 12:18:44,248][26022] Updated weights on worker 0-0, policy_version 245582 (0.00082) [2022-07-09 12:18:45,867][26022] Updated weights on worker 0-0, policy_version 245592 (0.00087) [2022-07-09 12:18:47,677][26022] Updated weights on worker 0-0, policy_version 245602 (0.00093) [2022-07-09 12:18:47,779][25689] Fps is (10 sec: 5765.6, 60 sec: 5680.4, 300 sec: 5666.7). Total num frames: 251496448. Throughput: 0: 5894.8. Samples: 251499142. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 12:18:47,780][25689] Avg episode reward: [(0, '-47.768')] [2022-07-09 12:18:49,560][26022] Updated weights on worker 0-0, policy_version 245612 (0.00088) [2022-07-09 12:18:51,484][26022] Updated weights on worker 0-0, policy_version 245622 (0.00088) [2022-07-09 12:18:52,893][25689] Fps is (10 sec: 5617.5, 60 sec: 5664.1, 300 sec: 5668.0). Total num frames: 251525120. Throughput: 0: 5933.6. Samples: 251533284. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 12:18:52,893][25689] Avg episode reward: [(0, '-48.446')] [2022-07-09 12:18:53,118][26022] Updated weights on worker 0-0, policy_version 245632 (0.00091) [2022-07-09 12:18:54,726][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:18:54,755][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000245640_251535360.pth [2022-07-09 12:18:54,755][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000243646_249493504.pth [2022-07-09 12:18:55,009][26022] Updated weights on worker 0-0, policy_version 245642 (0.00089) [2022-07-09 12:18:56,766][26022] Updated weights on worker 0-0, policy_version 245652 (0.00094) [2022-07-09 12:18:57,894][25689] Fps is (10 sec: 5668.9, 60 sec: 5682.2, 300 sec: 5665.3). Total num frames: 251553792. Throughput: 0: 5083.1. Samples: 251550230. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 12:18:57,896][25689] Avg episode reward: [(0, '-47.921')] [2022-07-09 12:18:58,661][26022] Updated weights on worker 0-0, policy_version 245662 (0.00084) [2022-07-09 12:19:00,283][26022] Updated weights on worker 0-0, policy_version 245672 (0.00091) [2022-07-09 12:19:02,522][26022] Updated weights on worker 0-0, policy_version 245682 (0.00082) [2022-07-09 12:19:02,979][25689] Fps is (10 sec: 5482.3, 60 sec: 5659.6, 300 sec: 5668.0). Total num frames: 251580416. Throughput: 0: 5851.5. Samples: 251583450. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 12:19:02,979][25689] Avg episode reward: [(0, '-47.769')] [2022-07-09 12:19:04,374][26022] Updated weights on worker 0-0, policy_version 245692 (0.00086) [2022-07-09 12:19:06,165][26022] Updated weights on worker 0-0, policy_version 245702 (0.00093) [2022-07-09 12:19:07,975][26022] Updated weights on worker 0-0, policy_version 245712 (0.00088) [2022-07-09 12:19:08,045][25689] Fps is (10 sec: 5447.3, 60 sec: 5658.5, 300 sec: 5664.3). Total num frames: 251609088. Throughput: 0: 5801.0. Samples: 251616708. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 12:19:08,047][25689] Avg episode reward: [(0, '-49.821')] [2022-07-09 12:19:09,615][26022] Updated weights on worker 0-0, policy_version 245722 (0.00082) [2022-07-09 12:19:11,472][26022] Updated weights on worker 0-0, policy_version 245732 (0.00091) [2022-07-09 12:19:13,123][25689] Fps is (10 sec: 5652.9, 60 sec: 5655.3, 300 sec: 5663.3). Total num frames: 251637760. Throughput: 0: 4970.8. Samples: 251633846. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 12:19:13,124][25689] Avg episode reward: [(0, '-50.256')] [2022-07-09 12:19:13,344][26022] Updated weights on worker 0-0, policy_version 245742 (0.00056) [2022-07-09 12:19:14,953][26022] Updated weights on worker 0-0, policy_version 245752 (0.00080) [2022-07-09 12:19:16,967][26022] Updated weights on worker 0-0, policy_version 245762 (0.00093) [2022-07-09 12:19:18,130][25689] Fps is (10 sec: 5788.0, 60 sec: 5645.6, 300 sec: 5664.7). Total num frames: 251667456. Throughput: 0: 5841.5. Samples: 251668436. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 12:19:18,130][25689] Avg episode reward: [(0, '-49.792')] [2022-07-09 12:19:18,563][26022] Updated weights on worker 0-0, policy_version 245772 (0.00082) [2022-07-09 12:19:20,476][26022] Updated weights on worker 0-0, policy_version 245782 (0.00087) [2022-07-09 12:19:22,294][26022] Updated weights on worker 0-0, policy_version 245792 (0.00088) [2022-07-09 12:19:23,146][25689] Fps is (10 sec: 5619.4, 60 sec: 5648.1, 300 sec: 5661.5). Total num frames: 251694080. Throughput: 0: 5924.6. Samples: 251702930. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 12:19:23,146][25689] Avg episode reward: [(0, '-49.265')] [2022-07-09 12:19:23,854][26022] Updated weights on worker 0-0, policy_version 245802 (0.00090) [2022-07-09 12:19:26,002][26022] Updated weights on worker 0-0, policy_version 245812 (0.00091) [2022-07-09 12:19:27,539][26022] Updated weights on worker 0-0, policy_version 245822 (0.00083) [2022-07-09 12:19:28,151][25689] Fps is (10 sec: 5722.4, 60 sec: 5689.3, 300 sec: 5666.6). Total num frames: 251724800. Throughput: 0: 5139.1. Samples: 251720032. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 12:19:28,151][25689] Avg episode reward: [(0, '-50.113')] [2022-07-09 12:19:29,520][26022] Updated weights on worker 0-0, policy_version 245832 (0.00094) [2022-07-09 12:19:31,198][26022] Updated weights on worker 0-0, policy_version 245842 (0.00089) [2022-07-09 12:19:33,025][26022] Updated weights on worker 0-0, policy_version 245852 (0.00085) [2022-07-09 12:19:33,252][25689] Fps is (10 sec: 5876.9, 60 sec: 5675.3, 300 sec: 5665.1). Total num frames: 251753472. Throughput: 0: 5965.7. Samples: 251753926. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 12:19:33,252][25689] Avg episode reward: [(0, '-50.292')] [2022-07-09 12:19:34,888][26022] Updated weights on worker 0-0, policy_version 245862 (0.00088) [2022-07-09 12:19:36,723][26022] Updated weights on worker 0-0, policy_version 245872 (0.00088) [2022-07-09 12:19:38,269][25689] Fps is (10 sec: 5667.5, 60 sec: 5676.4, 300 sec: 5668.8). Total num frames: 251782144. Throughput: 0: 5945.2. Samples: 251788166. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 12:19:38,269][25689] Avg episode reward: [(0, '-49.668')] [2022-07-09 12:19:38,493][26022] Updated weights on worker 0-0, policy_version 245882 (0.00090) [2022-07-09 12:19:40,279][26022] Updated weights on worker 0-0, policy_version 245892 (0.00092) [2022-07-09 12:19:42,025][26022] Updated weights on worker 0-0, policy_version 245902 (0.00081) [2022-07-09 12:19:43,326][25689] Fps is (10 sec: 5590.4, 60 sec: 5640.0, 300 sec: 5661.5). Total num frames: 251809792. Throughput: 0: 5065.9. Samples: 251805162. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 12:19:43,327][25689] Avg episode reward: [(0, '-49.462')] [2022-07-09 12:19:43,841][26022] Updated weights on worker 0-0, policy_version 245912 (0.00086) [2022-07-09 12:19:45,478][26022] Updated weights on worker 0-0, policy_version 245922 (0.00086) [2022-07-09 12:19:47,479][26022] Updated weights on worker 0-0, policy_version 245932 (0.00088) [2022-07-09 12:19:48,407][25689] Fps is (10 sec: 5656.2, 60 sec: 5658.1, 300 sec: 5660.8). Total num frames: 251839488. Throughput: 0: 5909.4. Samples: 251839734. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 12:19:48,407][25689] Avg episode reward: [(0, '-49.411')] [2022-07-09 12:19:49,204][26022] Updated weights on worker 0-0, policy_version 245942 (0.00086) [2022-07-09 12:19:51,137][26022] Updated weights on worker 0-0, policy_version 245952 (0.00097) [2022-07-09 12:19:52,833][26022] Updated weights on worker 0-0, policy_version 245962 (0.00090) [2022-07-09 12:19:53,452][25689] Fps is (10 sec: 5763.9, 60 sec: 5664.5, 300 sec: 5663.9). Total num frames: 251868160. Throughput: 0: 5938.6. Samples: 251873890. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 12:19:53,453][25689] Avg episode reward: [(0, '-50.337')] [2022-07-09 12:19:54,690][26022] Updated weights on worker 0-0, policy_version 245972 (0.00094) [2022-07-09 12:19:56,464][26022] Updated weights on worker 0-0, policy_version 245982 (0.00093) [2022-07-09 12:19:58,267][26022] Updated weights on worker 0-0, policy_version 245992 (0.00090) [2022-07-09 12:19:58,474][25689] Fps is (10 sec: 5696.2, 60 sec: 5662.6, 300 sec: 5664.6). Total num frames: 251896832. Throughput: 0: 5079.6. Samples: 251890800. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 12:19:58,474][25689] Avg episode reward: [(0, '-49.534')] [2022-07-09 12:20:00,094][26022] Updated weights on worker 0-0, policy_version 246002 (0.00088) [2022-07-09 12:20:02,219][26022] Updated weights on worker 0-0, policy_version 246012 (0.00088) [2022-07-09 12:20:03,480][25689] Fps is (10 sec: 5412.4, 60 sec: 5653.1, 300 sec: 5661.6). Total num frames: 251922432. Throughput: 0: 5837.9. Samples: 251922818. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 12:20:03,480][25689] Avg episode reward: [(0, '-49.136')] [2022-07-09 12:20:03,879][26022] Updated weights on worker 0-0, policy_version 246022 (0.00110) [2022-07-09 12:20:05,978][26022] Updated weights on worker 0-0, policy_version 246032 (0.00092) [2022-07-09 12:20:07,567][26022] Updated weights on worker 0-0, policy_version 246042 (0.00083) [2022-07-09 12:20:08,488][25689] Fps is (10 sec: 5522.0, 60 sec: 5675.5, 300 sec: 5667.4). Total num frames: 251952128. Throughput: 0: 5827.6. Samples: 251956756. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 12:20:08,488][25689] Avg episode reward: [(0, '-49.459')] [2022-07-09 12:20:09,579][26022] Updated weights on worker 0-0, policy_version 246052 (0.00090) [2022-07-09 12:20:11,187][26022] Updated weights on worker 0-0, policy_version 246062 (0.00087) [2022-07-09 12:20:13,067][26022] Updated weights on worker 0-0, policy_version 246072 (0.00106) [2022-07-09 12:20:13,563][25689] Fps is (10 sec: 5788.4, 60 sec: 5675.7, 300 sec: 5657.6). Total num frames: 251980800. Throughput: 0: 4976.1. Samples: 251973964. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 12:20:13,564][25689] Avg episode reward: [(0, '-49.157')] [2022-07-09 12:20:14,538][26022] Updated weights on worker 0-0, policy_version 246082 (0.00087) [2022-07-09 12:20:16,672][26022] Updated weights on worker 0-0, policy_version 246092 (0.00088) [2022-07-09 12:20:18,196][26022] Updated weights on worker 0-0, policy_version 246102 (0.00094) [2022-07-09 12:20:18,595][25689] Fps is (10 sec: 5673.6, 60 sec: 5656.4, 300 sec: 5662.1). Total num frames: 252009472. Throughput: 0: 5851.5. Samples: 252008536. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 12:20:18,595][25689] Avg episode reward: [(0, '-48.660')] [2022-07-09 12:20:20,247][26022] Updated weights on worker 0-0, policy_version 246112 (0.00087) [2022-07-09 12:20:21,723][26022] Updated weights on worker 0-0, policy_version 246122 (0.00086) [2022-07-09 12:20:23,626][25689] Fps is (10 sec: 5596.8, 60 sec: 5671.9, 300 sec: 5658.3). Total num frames: 252037120. Throughput: 0: 5967.4. Samples: 252043040. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 12:20:23,627][25689] Avg episode reward: [(0, '-48.406')] [2022-07-09 12:20:23,700][26022] Updated weights on worker 0-0, policy_version 246132 (0.00092) [2022-07-09 12:20:25,554][26022] Updated weights on worker 0-0, policy_version 246142 (0.00083) [2022-07-09 12:20:27,295][26022] Updated weights on worker 0-0, policy_version 246152 (0.00084) [2022-07-09 12:20:28,630][25689] Fps is (10 sec: 5714.1, 60 sec: 5655.1, 300 sec: 5664.5). Total num frames: 252066816. Throughput: 0: 5121.3. Samples: 252059912. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 12:20:28,631][25689] Avg episode reward: [(0, '-48.180')] [2022-07-09 12:20:29,127][26022] Updated weights on worker 0-0, policy_version 246162 (0.00065) [2022-07-09 12:20:30,890][26022] Updated weights on worker 0-0, policy_version 246172 (0.00797) [2022-07-09 12:20:32,713][26022] Updated weights on worker 0-0, policy_version 246182 (0.00088) [2022-07-09 12:20:33,751][25689] Fps is (10 sec: 5764.9, 60 sec: 5653.2, 300 sec: 5662.8). Total num frames: 252095488. Throughput: 0: 5942.1. Samples: 252093920. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 12:20:33,751][25689] Avg episode reward: [(0, '-48.382')] [2022-07-09 12:20:34,565][26022] Updated weights on worker 0-0, policy_version 246192 (0.00085) [2022-07-09 12:20:36,404][26022] Updated weights on worker 0-0, policy_version 246202 (0.00092) [2022-07-09 12:20:38,097][26022] Updated weights on worker 0-0, policy_version 246212 (0.00087) [2022-07-09 12:20:38,757][25689] Fps is (10 sec: 5561.3, 60 sec: 5637.3, 300 sec: 5656.3). Total num frames: 252123136. Throughput: 0: 5937.2. Samples: 252128246. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 12:20:38,758][25689] Avg episode reward: [(0, '-49.284')] [2022-07-09 12:20:40,008][26022] Updated weights on worker 0-0, policy_version 246222 (0.00094) [2022-07-09 12:20:41,664][26022] Updated weights on worker 0-0, policy_version 246232 (0.00080) [2022-07-09 12:20:43,561][26022] Updated weights on worker 0-0, policy_version 246242 (0.00082) [2022-07-09 12:20:43,787][25689] Fps is (10 sec: 5714.0, 60 sec: 5673.8, 300 sec: 5659.5). Total num frames: 252152832. Throughput: 0: 5931.2. Samples: 252162616. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 12:20:43,787][25689] Avg episode reward: [(0, '-49.433')] [2022-07-09 12:20:45,257][26022] Updated weights on worker 0-0, policy_version 246252 (0.00086) [2022-07-09 12:20:47,142][26022] Updated weights on worker 0-0, policy_version 246262 (0.00097) [2022-07-09 12:20:48,797][25689] Fps is (10 sec: 5813.9, 60 sec: 5663.5, 300 sec: 5667.2). Total num frames: 252181504. Throughput: 0: 5953.1. Samples: 252179966. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 12:20:48,797][25689] Avg episode reward: [(0, '-50.114')] [2022-07-09 12:20:49,001][26022] Updated weights on worker 0-0, policy_version 246272 (0.00087) [2022-07-09 12:20:50,604][26022] Updated weights on worker 0-0, policy_version 246282 (0.00093) [2022-07-09 12:20:52,503][26022] Updated weights on worker 0-0, policy_version 246292 (0.00087) [2022-07-09 12:20:53,900][25689] Fps is (10 sec: 5669.8, 60 sec: 5658.0, 300 sec: 5658.8). Total num frames: 252210176. Throughput: 0: 5958.6. Samples: 252213984. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 12:20:53,901][25689] Avg episode reward: [(0, '-51.472')] [2022-07-09 12:20:54,390][26022] Updated weights on worker 0-0, policy_version 246302 (0.00094) [2022-07-09 12:20:54,882][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:20:54,895][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000246305_252216320.pth [2022-07-09 12:20:54,896][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000244312_250175488.pth [2022-07-09 12:20:55,868][26022] Updated weights on worker 0-0, policy_version 246312 (0.00068) [2022-07-09 12:20:57,939][26022] Updated weights on worker 0-0, policy_version 246322 (0.00091) [2022-07-09 12:20:58,949][25689] Fps is (10 sec: 5749.2, 60 sec: 5672.4, 300 sec: 5658.7). Total num frames: 252239872. Throughput: 0: 5951.3. Samples: 252248412. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 12:20:58,950][25689] Avg episode reward: [(0, '-51.175')] [2022-07-09 12:20:59,398][26022] Updated weights on worker 0-0, policy_version 246332 (0.00085) [2022-07-09 12:21:01,576][26022] Updated weights on worker 0-0, policy_version 246342 (0.00090) [2022-07-09 12:21:03,578][26022] Updated weights on worker 0-0, policy_version 246352 (0.00086) [2022-07-09 12:21:04,002][25689] Fps is (10 sec: 5575.3, 60 sec: 5684.9, 300 sec: 5668.5). Total num frames: 252266496. Throughput: 0: 5098.9. Samples: 252265692. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 12:21:04,003][25689] Avg episode reward: [(0, '-50.523')] [2022-07-09 12:21:05,520][26022] Updated weights on worker 0-0, policy_version 246362 (0.00096) [2022-07-09 12:21:07,152][26022] Updated weights on worker 0-0, policy_version 246372 (0.00090) [2022-07-09 12:21:09,002][26022] Updated weights on worker 0-0, policy_version 246382 (0.00087) [2022-07-09 12:21:09,096][25689] Fps is (10 sec: 5449.7, 60 sec: 5660.0, 300 sec: 5658.8). Total num frames: 252295168. Throughput: 0: 5813.9. Samples: 252297982. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 12:21:09,096][25689] Avg episode reward: [(0, '-50.254')] [2022-07-09 12:21:10,615][26022] Updated weights on worker 0-0, policy_version 246392 (0.00082) [2022-07-09 12:21:12,641][26022] Updated weights on worker 0-0, policy_version 246402 (0.00088) [2022-07-09 12:21:14,133][25689] Fps is (10 sec: 5761.3, 60 sec: 5680.5, 300 sec: 5661.6). Total num frames: 252324864. Throughput: 0: 5838.8. Samples: 252332118. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 12:21:14,134][25689] Avg episode reward: [(0, '-49.524')] [2022-07-09 12:21:14,215][26022] Updated weights on worker 0-0, policy_version 246412 (0.00085) [2022-07-09 12:21:16,324][26022] Updated weights on worker 0-0, policy_version 246422 (0.00090) [2022-07-09 12:21:17,710][26022] Updated weights on worker 0-0, policy_version 246432 (0.00085) [2022-07-09 12:21:19,139][25689] Fps is (10 sec: 5709.7, 60 sec: 5665.9, 300 sec: 5669.2). Total num frames: 252352512. Throughput: 0: 5001.9. Samples: 252349400. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 12:21:19,139][25689] Avg episode reward: [(0, '-50.079')] [2022-07-09 12:21:19,966][26022] Updated weights on worker 0-0, policy_version 246442 (0.00085) [2022-07-09 12:21:21,305][26022] Updated weights on worker 0-0, policy_version 246452 (0.00086) [2022-07-09 12:21:23,551][26022] Updated weights on worker 0-0, policy_version 246462 (0.00091) [2022-07-09 12:21:24,167][25689] Fps is (10 sec: 5613.0, 60 sec: 5683.2, 300 sec: 5665.5). Total num frames: 252381184. Throughput: 0: 5857.3. Samples: 252383804. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 12:21:24,167][25689] Avg episode reward: [(0, '-48.690')] [2022-07-09 12:21:24,969][26022] Updated weights on worker 0-0, policy_version 246472 (0.00095) [2022-07-09 12:21:26,907][26022] Updated weights on worker 0-0, policy_version 246482 (0.00080) [2022-07-09 12:21:28,795][26022] Updated weights on worker 0-0, policy_version 246492 (0.00098) [2022-07-09 12:21:29,176][25689] Fps is (10 sec: 5611.1, 60 sec: 5648.9, 300 sec: 5659.8). Total num frames: 252408832. Throughput: 0: 5963.3. Samples: 252417728. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 12:21:29,177][25689] Avg episode reward: [(0, '-48.801')] [2022-07-09 12:21:30,568][26022] Updated weights on worker 0-0, policy_version 246502 (0.00089) [2022-07-09 12:21:32,464][26022] Updated weights on worker 0-0, policy_version 246512 (0.00090) [2022-07-09 12:21:34,266][25689] Fps is (10 sec: 5678.3, 60 sec: 5668.7, 300 sec: 5662.4). Total num frames: 252438528. Throughput: 0: 5091.9. Samples: 252434630. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 12:21:34,266][25689] Avg episode reward: [(0, '-48.635')] [2022-07-09 12:21:34,272][26022] Updated weights on worker 0-0, policy_version 246522 (0.00092) [2022-07-09 12:21:36,081][26022] Updated weights on worker 0-0, policy_version 246532 (0.00089) [2022-07-09 12:21:37,693][26022] Updated weights on worker 0-0, policy_version 246542 (0.00089) [2022-07-09 12:21:39,281][25689] Fps is (10 sec: 5674.7, 60 sec: 5667.8, 300 sec: 5659.5). Total num frames: 252466176. Throughput: 0: 5921.2. Samples: 252468668. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 12:21:39,282][25689] Avg episode reward: [(0, '-48.485')] [2022-07-09 12:21:39,733][26022] Updated weights on worker 0-0, policy_version 246552 (0.00089) [2022-07-09 12:21:41,483][26022] Updated weights on worker 0-0, policy_version 246562 (0.00086) [2022-07-09 12:21:43,295][26022] Updated weights on worker 0-0, policy_version 246572 (0.00077) [2022-07-09 12:21:44,291][25689] Fps is (10 sec: 5720.0, 60 sec: 5669.7, 300 sec: 5663.5). Total num frames: 252495872. Throughput: 0: 5918.9. Samples: 252502914. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 12:21:44,292][25689] Avg episode reward: [(0, '-47.954')] [2022-07-09 12:21:44,878][26022] Updated weights on worker 0-0, policy_version 246582 (0.00090) [2022-07-09 12:21:46,878][26022] Updated weights on worker 0-0, policy_version 246592 (0.00081) [2022-07-09 12:21:48,600][26022] Updated weights on worker 0-0, policy_version 246602 (0.00088) [2022-07-09 12:21:49,294][25689] Fps is (10 sec: 5727.0, 60 sec: 5653.4, 300 sec: 5658.6). Total num frames: 252523520. Throughput: 0: 5093.2. Samples: 252520194. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 12:21:49,296][25689] Avg episode reward: [(0, '-47.091')] [2022-07-09 12:21:50,461][26022] Updated weights on worker 0-0, policy_version 246612 (0.00091) [2022-07-09 12:21:52,058][26022] Updated weights on worker 0-0, policy_version 246622 (0.00082) [2022-07-09 12:21:53,969][26022] Updated weights on worker 0-0, policy_version 246632 (0.00085) [2022-07-09 12:21:54,390][25689] Fps is (10 sec: 5880.7, 60 sec: 5704.9, 300 sec: 5671.2). Total num frames: 252555264. Throughput: 0: 5975.7. Samples: 252554888. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 12:21:54,391][25689] Avg episode reward: [(0, '-48.265')] [2022-07-09 12:21:55,827][26022] Updated weights on worker 0-0, policy_version 246642 (0.00097) [2022-07-09 12:21:57,498][26022] Updated weights on worker 0-0, policy_version 246652 (0.00092) [2022-07-09 12:21:59,388][26022] Updated weights on worker 0-0, policy_version 246662 (0.00099) [2022-07-09 12:21:59,456][25689] Fps is (10 sec: 5844.6, 60 sec: 5669.4, 300 sec: 5663.3). Total num frames: 252582912. Throughput: 0: 5972.1. Samples: 252589154. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 12:21:59,457][25689] Avg episode reward: [(0, '-49.573')] [2022-07-09 12:22:01,037][26022] Updated weights on worker 0-0, policy_version 246672 (0.00090) [2022-07-09 12:22:03,399][26022] Updated weights on worker 0-0, policy_version 246682 (0.00084) [2022-07-09 12:22:04,482][25689] Fps is (10 sec: 5276.2, 60 sec: 5655.0, 300 sec: 5666.6). Total num frames: 252608512. Throughput: 0: 5080.5. Samples: 252605496. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 12:22:04,483][25689] Avg episode reward: [(0, '-50.067')] [2022-07-09 12:22:05,060][26022] Updated weights on worker 0-0, policy_version 246692 (0.00084) [2022-07-09 12:22:07,002][26022] Updated weights on worker 0-0, policy_version 246702 (0.01075) [2022-07-09 12:22:08,479][26022] Updated weights on worker 0-0, policy_version 246712 (0.00091) [2022-07-09 12:22:09,513][25689] Fps is (10 sec: 5498.4, 60 sec: 5677.9, 300 sec: 5660.9). Total num frames: 252638208. Throughput: 0: 5842.7. Samples: 252638326. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 12:22:09,513][25689] Avg episode reward: [(0, '-50.513')] [2022-07-09 12:22:10,563][26022] Updated weights on worker 0-0, policy_version 246722 (0.00090) [2022-07-09 12:22:12,204][26022] Updated weights on worker 0-0, policy_version 246732 (0.00087) [2022-07-09 12:22:14,091][26022] Updated weights on worker 0-0, policy_version 246742 (0.00089) [2022-07-09 12:22:14,566][25689] Fps is (10 sec: 5890.0, 60 sec: 5676.4, 300 sec: 5675.0). Total num frames: 252667904. Throughput: 0: 5840.0. Samples: 252672712. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 12:22:14,566][25689] Avg episode reward: [(0, '-50.428')] [2022-07-09 12:22:15,831][26022] Updated weights on worker 0-0, policy_version 246752 (0.00097) [2022-07-09 12:22:17,544][26022] Updated weights on worker 0-0, policy_version 246762 (0.00090) [2022-07-09 12:22:19,434][26022] Updated weights on worker 0-0, policy_version 246772 (0.00052) [2022-07-09 12:22:19,589][25689] Fps is (10 sec: 5589.4, 60 sec: 5657.8, 300 sec: 5657.4). Total num frames: 252694528. Throughput: 0: 5017.8. Samples: 252690178. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 12:22:19,590][25689] Avg episode reward: [(0, '-50.711')] [2022-07-09 12:22:21,000][26022] Updated weights on worker 0-0, policy_version 246782 (0.00080) [2022-07-09 12:22:22,967][26022] Updated weights on worker 0-0, policy_version 246792 (0.00078) [2022-07-09 12:22:24,599][25689] Fps is (10 sec: 5613.5, 60 sec: 5676.5, 300 sec: 5668.6). Total num frames: 252724224. Throughput: 0: 5933.1. Samples: 252724850. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 12:22:24,599][25689] Avg episode reward: [(0, '-50.389')] [2022-07-09 12:22:24,680][26022] Updated weights on worker 0-0, policy_version 246802 (0.00084) [2022-07-09 12:22:26,344][26022] Updated weights on worker 0-0, policy_version 246812 (0.00064) [2022-07-09 12:22:28,396][26022] Updated weights on worker 0-0, policy_version 246822 (0.00091) [2022-07-09 12:22:29,622][25689] Fps is (10 sec: 5817.8, 60 sec: 5692.1, 300 sec: 5669.3). Total num frames: 252752896. Throughput: 0: 6007.5. Samples: 252759130. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 12:22:29,622][25689] Avg episode reward: [(0, '-49.067')] [2022-07-09 12:22:29,977][26022] Updated weights on worker 0-0, policy_version 246832 (0.00087) [2022-07-09 12:22:31,943][26022] Updated weights on worker 0-0, policy_version 246842 (0.00082) [2022-07-09 12:22:33,795][26022] Updated weights on worker 0-0, policy_version 246852 (0.00091) [2022-07-09 12:22:34,683][25689] Fps is (10 sec: 5787.7, 60 sec: 5694.7, 300 sec: 5668.6). Total num frames: 252782592. Throughput: 0: 5137.5. Samples: 252776068. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 12:22:34,684][25689] Avg episode reward: [(0, '-50.182')] [2022-07-09 12:22:35,348][26022] Updated weights on worker 0-0, policy_version 246862 (0.00091) [2022-07-09 12:22:37,540][26022] Updated weights on worker 0-0, policy_version 246872 (0.00092) [2022-07-09 12:22:38,752][26022] Updated weights on worker 0-0, policy_version 246882 (0.00081) [2022-07-09 12:22:39,702][25689] Fps is (10 sec: 5689.0, 60 sec: 5694.5, 300 sec: 5668.9). Total num frames: 252810240. Throughput: 0: 5973.5. Samples: 252810320. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 12:22:39,702][25689] Avg episode reward: [(0, '-50.365')] [2022-07-09 12:22:40,909][26022] Updated weights on worker 0-0, policy_version 246892 (0.00091) [2022-07-09 12:22:42,745][26022] Updated weights on worker 0-0, policy_version 246902 (0.00088) [2022-07-09 12:22:44,387][26022] Updated weights on worker 0-0, policy_version 246912 (0.00097) [2022-07-09 12:22:44,717][25689] Fps is (10 sec: 5817.4, 60 sec: 5710.9, 300 sec: 5675.7). Total num frames: 252840960. Throughput: 0: 5952.5. Samples: 252844602. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 12:22:44,717][25689] Avg episode reward: [(0, '-50.165')] [2022-07-09 12:22:46,460][26022] Updated weights on worker 0-0, policy_version 246922 (0.00093) [2022-07-09 12:22:47,836][26022] Updated weights on worker 0-0, policy_version 246932 (0.00088) [2022-07-09 12:22:49,751][25689] Fps is (10 sec: 5604.4, 60 sec: 5674.1, 300 sec: 5663.6). Total num frames: 252866560. Throughput: 0: 5090.3. Samples: 252861594. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 12:22:49,754][25689] Avg episode reward: [(0, '-49.656')] [2022-07-09 12:22:50,084][26022] Updated weights on worker 0-0, policy_version 246942 (0.00122) [2022-07-09 12:22:51,575][26022] Updated weights on worker 0-0, policy_version 246952 (0.00092) [2022-07-09 12:22:53,540][26022] Updated weights on worker 0-0, policy_version 246962 (0.00096) [2022-07-09 12:22:54,796][25689] Fps is (10 sec: 5587.6, 60 sec: 5661.9, 300 sec: 5673.3). Total num frames: 252897280. Throughput: 0: 5953.8. Samples: 252895816. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 12:22:54,797][25689] Avg episode reward: [(0, '-50.393')] [2022-07-09 12:22:54,907][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:22:54,919][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000246971_252898304.pth [2022-07-09 12:22:54,920][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000244977_250856448.pth [2022-07-09 12:22:55,146][26022] Updated weights on worker 0-0, policy_version 246972 (0.00083) [2022-07-09 12:22:56,910][26022] Updated weights on worker 0-0, policy_version 246982 (0.00109) [2022-07-09 12:22:58,854][26022] Updated weights on worker 0-0, policy_version 246992 (0.00093) [2022-07-09 12:22:59,820][25689] Fps is (10 sec: 5695.0, 60 sec: 5648.9, 300 sec: 5669.9). Total num frames: 252923904. Throughput: 0: 5939.9. Samples: 252929822. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 12:22:59,821][25689] Avg episode reward: [(0, '-50.485')] [2022-07-09 12:23:00,673][26022] Updated weights on worker 0-0, policy_version 247002 (0.00088) [2022-07-09 12:23:02,698][26022] Updated weights on worker 0-0, policy_version 247012 (0.00087) [2022-07-09 12:23:04,721][26022] Updated weights on worker 0-0, policy_version 247022 (0.00734) [2022-07-09 12:23:04,826][25689] Fps is (10 sec: 5411.2, 60 sec: 5684.8, 300 sec: 5667.4). Total num frames: 252951552. Throughput: 0: 5010.1. Samples: 252945350. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 12:23:04,826][25689] Avg episode reward: [(0, '-50.360')] [2022-07-09 12:23:06,518][26022] Updated weights on worker 0-0, policy_version 247032 (0.00078) [2022-07-09 12:23:08,280][26022] Updated weights on worker 0-0, policy_version 247042 (0.00089) [2022-07-09 12:23:09,850][25689] Fps is (10 sec: 5615.0, 60 sec: 5668.4, 300 sec: 5667.7). Total num frames: 252980224. Throughput: 0: 5832.6. Samples: 252978826. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 12:23:09,851][25689] Avg episode reward: [(0, '-48.736')] [2022-07-09 12:23:09,967][26022] Updated weights on worker 0-0, policy_version 247052 (0.00089) [2022-07-09 12:23:11,837][26022] Updated weights on worker 0-0, policy_version 247062 (0.00082) [2022-07-09 12:23:13,591][26022] Updated weights on worker 0-0, policy_version 247072 (0.00084) [2022-07-09 12:23:14,924][25689] Fps is (10 sec: 5577.1, 60 sec: 5632.5, 300 sec: 5657.6). Total num frames: 253007872. Throughput: 0: 5820.5. Samples: 253012970. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 12:23:14,925][25689] Avg episode reward: [(0, '-50.128')] [2022-07-09 12:23:15,439][26022] Updated weights on worker 0-0, policy_version 247082 (0.00089) [2022-07-09 12:23:17,206][26022] Updated weights on worker 0-0, policy_version 247092 (0.00086) [2022-07-09 12:23:18,981][26022] Updated weights on worker 0-0, policy_version 247102 (0.00090) [2022-07-09 12:23:19,948][25689] Fps is (10 sec: 5679.3, 60 sec: 5683.4, 300 sec: 5668.3). Total num frames: 253037568. Throughput: 0: 4983.2. Samples: 253030120. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 12:23:19,948][25689] Avg episode reward: [(0, '-50.373')] [2022-07-09 12:23:20,935][26022] Updated weights on worker 0-0, policy_version 247112 (0.00082) [2022-07-09 12:23:22,561][26022] Updated weights on worker 0-0, policy_version 247122 (0.00096) [2022-07-09 12:23:24,384][26022] Updated weights on worker 0-0, policy_version 247132 (0.00088) [2022-07-09 12:23:24,992][25689] Fps is (10 sec: 5899.0, 60 sec: 5680.1, 300 sec: 5672.5). Total num frames: 253067264. Throughput: 0: 5909.6. Samples: 253064526. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 12:23:24,993][25689] Avg episode reward: [(0, '-50.230')] [2022-07-09 12:23:26,239][26022] Updated weights on worker 0-0, policy_version 247142 (0.00095) [2022-07-09 12:23:28,027][26022] Updated weights on worker 0-0, policy_version 247152 (0.00092) [2022-07-09 12:23:29,820][26022] Updated weights on worker 0-0, policy_version 247162 (0.00085) [2022-07-09 12:23:30,080][25689] Fps is (10 sec: 5659.3, 60 sec: 5657.1, 300 sec: 5666.5). Total num frames: 253094912. Throughput: 0: 5933.6. Samples: 253098860. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 12:23:30,082][25689] Avg episode reward: [(0, '-50.461')] [2022-07-09 12:23:31,588][26022] Updated weights on worker 0-0, policy_version 247172 (0.00092) [2022-07-09 12:23:33,582][26022] Updated weights on worker 0-0, policy_version 247182 (0.00096) [2022-07-09 12:23:35,157][25689] Fps is (10 sec: 5540.7, 60 sec: 5638.7, 300 sec: 5665.5). Total num frames: 253123584. Throughput: 0: 5933.1. Samples: 253133014. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 12:23:35,158][25689] Avg episode reward: [(0, '-50.053')] [2022-07-09 12:23:35,271][26022] Updated weights on worker 0-0, policy_version 247192 (0.00103) [2022-07-09 12:23:37,038][26022] Updated weights on worker 0-0, policy_version 247202 (0.00091) [2022-07-09 12:23:38,659][26022] Updated weights on worker 0-0, policy_version 247212 (0.00087) [2022-07-09 12:23:40,213][25689] Fps is (10 sec: 5659.4, 60 sec: 5652.1, 300 sec: 5661.6). Total num frames: 253152256. Throughput: 0: 5917.7. Samples: 253150044. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 12:23:40,213][25689] Avg episode reward: [(0, '-49.928')] [2022-07-09 12:23:40,542][26022] Updated weights on worker 0-0, policy_version 247222 (0.00086) [2022-07-09 12:23:42,425][26022] Updated weights on worker 0-0, policy_version 247232 (0.00079) [2022-07-09 12:23:44,190][26022] Updated weights on worker 0-0, policy_version 247242 (0.00091) [2022-07-09 12:23:45,251][25689] Fps is (10 sec: 5681.2, 60 sec: 5616.2, 300 sec: 5662.7). Total num frames: 253180928. Throughput: 0: 5920.8. Samples: 253184472. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 12:23:45,251][25689] Avg episode reward: [(0, '-48.909')] [2022-07-09 12:23:46,131][26022] Updated weights on worker 0-0, policy_version 247252 (0.00092) [2022-07-09 12:23:47,739][26022] Updated weights on worker 0-0, policy_version 247262 (0.00087) [2022-07-09 12:23:49,546][26022] Updated weights on worker 0-0, policy_version 247272 (0.00094) [2022-07-09 12:23:50,265][25689] Fps is (10 sec: 5704.6, 60 sec: 5668.8, 300 sec: 5664.5). Total num frames: 253209600. Throughput: 0: 5929.3. Samples: 253218542. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 12:23:50,266][25689] Avg episode reward: [(0, '-48.897')] [2022-07-09 12:23:51,364][26022] Updated weights on worker 0-0, policy_version 247282 (0.00091) [2022-07-09 12:23:53,278][26022] Updated weights on worker 0-0, policy_version 247292 (0.00091) [2022-07-09 12:23:55,068][26022] Updated weights on worker 0-0, policy_version 247302 (0.00088) [2022-07-09 12:23:55,327][25689] Fps is (10 sec: 5690.8, 60 sec: 5633.3, 300 sec: 5663.4). Total num frames: 253238272. Throughput: 0: 5071.8. Samples: 253235314. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 12:23:55,329][25689] Avg episode reward: [(0, '-49.835')] [2022-07-09 12:23:56,883][26022] Updated weights on worker 0-0, policy_version 247312 (0.00101) [2022-07-09 12:23:58,594][26022] Updated weights on worker 0-0, policy_version 247322 (0.00090) [2022-07-09 12:24:00,355][25689] Fps is (10 sec: 5683.1, 60 sec: 5666.8, 300 sec: 5671.4). Total num frames: 253266944. Throughput: 0: 5918.7. Samples: 253269262. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 12:24:00,356][25689] Avg episode reward: [(0, '-50.263')] [2022-07-09 12:24:00,545][26022] Updated weights on worker 0-0, policy_version 247332 (0.00092) [2022-07-09 12:24:02,627][26022] Updated weights on worker 0-0, policy_version 247342 (0.00092) [2022-07-09 12:24:04,529][26022] Updated weights on worker 0-0, policy_version 247352 (0.00090) [2022-07-09 12:24:05,366][25689] Fps is (10 sec: 5406.5, 60 sec: 5632.5, 300 sec: 5662.1). Total num frames: 253292544. Throughput: 0: 5805.1. Samples: 253301242. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 12:24:05,366][25689] Avg episode reward: [(0, '-50.595')] [2022-07-09 12:24:06,340][26022] Updated weights on worker 0-0, policy_version 247362 (0.00087) [2022-07-09 12:24:08,154][26022] Updated weights on worker 0-0, policy_version 247372 (0.00084) [2022-07-09 12:24:09,875][26022] Updated weights on worker 0-0, policy_version 247382 (0.00084) [2022-07-09 12:24:10,394][25689] Fps is (10 sec: 5508.2, 60 sec: 5649.1, 300 sec: 5666.5). Total num frames: 253322240. Throughput: 0: 4956.8. Samples: 253318318. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 12:24:10,395][25689] Avg episode reward: [(0, '-51.121')] [2022-07-09 12:24:11,758][26022] Updated weights on worker 0-0, policy_version 247392 (0.00091) [2022-07-09 12:24:13,515][26022] Updated weights on worker 0-0, policy_version 247402 (0.00084) [2022-07-09 12:24:15,337][26022] Updated weights on worker 0-0, policy_version 247412 (0.00089) [2022-07-09 12:24:15,480][25689] Fps is (10 sec: 5770.6, 60 sec: 5664.8, 300 sec: 5661.5). Total num frames: 253350912. Throughput: 0: 5815.7. Samples: 253352518. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 12:24:15,481][25689] Avg episode reward: [(0, '-51.977')] [2022-07-09 12:24:17,041][26022] Updated weights on worker 0-0, policy_version 247422 (0.00082) [2022-07-09 12:24:18,787][26022] Updated weights on worker 0-0, policy_version 247432 (0.00083) [2022-07-09 12:24:20,484][25689] Fps is (10 sec: 5683.4, 60 sec: 5649.8, 300 sec: 5668.6). Total num frames: 253379584. Throughput: 0: 5841.5. Samples: 253386842. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 12:24:20,484][25689] Avg episode reward: [(0, '-50.938')] [2022-07-09 12:24:20,827][26022] Updated weights on worker 0-0, policy_version 247442 (0.00093) [2022-07-09 12:24:22,166][26022] Updated weights on worker 0-0, policy_version 247452 (0.00080) [2022-07-09 12:24:24,479][26022] Updated weights on worker 0-0, policy_version 247462 (0.00093) [2022-07-09 12:24:25,496][25689] Fps is (10 sec: 5725.2, 60 sec: 5635.9, 300 sec: 5661.6). Total num frames: 253408256. Throughput: 0: 5108.1. Samples: 253404072. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 12:24:25,497][25689] Avg episode reward: [(0, '-49.745')] [2022-07-09 12:24:26,056][26022] Updated weights on worker 0-0, policy_version 247472 (0.00095) [2022-07-09 12:24:27,883][26022] Updated weights on worker 0-0, policy_version 247482 (0.00096) [2022-07-09 12:24:29,701][26022] Updated weights on worker 0-0, policy_version 247492 (0.00085) [2022-07-09 12:24:30,511][25689] Fps is (10 sec: 5718.7, 60 sec: 5659.6, 300 sec: 5663.3). Total num frames: 253436928. Throughput: 0: 5951.5. Samples: 253438046. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 12:24:30,512][25689] Avg episode reward: [(0, '-50.335')] [2022-07-09 12:24:31,628][26022] Updated weights on worker 0-0, policy_version 247502 (0.00093) [2022-07-09 12:24:33,205][26022] Updated weights on worker 0-0, policy_version 247512 (0.00087) [2022-07-09 12:24:35,160][26022] Updated weights on worker 0-0, policy_version 247522 (0.00087) [2022-07-09 12:24:35,576][25689] Fps is (10 sec: 5486.0, 60 sec: 5626.9, 300 sec: 5655.5). Total num frames: 253463552. Throughput: 0: 5971.3. Samples: 253472514. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 12:24:35,576][25689] Avg episode reward: [(0, '-50.526')] [2022-07-09 12:24:36,612][26022] Updated weights on worker 0-0, policy_version 247532 (0.00100) [2022-07-09 12:24:38,891][26022] Updated weights on worker 0-0, policy_version 247542 (0.00087) [2022-07-09 12:24:40,362][26022] Updated weights on worker 0-0, policy_version 247552 (0.00088) [2022-07-09 12:24:40,622][25689] Fps is (10 sec: 5671.4, 60 sec: 5661.6, 300 sec: 5666.0). Total num frames: 253494272. Throughput: 0: 5104.8. Samples: 253489644. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 12:24:40,623][25689] Avg episode reward: [(0, '-49.683')] [2022-07-09 12:24:42,225][26022] Updated weights on worker 0-0, policy_version 247562 (0.00088) [2022-07-09 12:24:44,189][26022] Updated weights on worker 0-0, policy_version 247572 (0.00092) [2022-07-09 12:24:45,653][25689] Fps is (10 sec: 5792.1, 60 sec: 5645.3, 300 sec: 5660.1). Total num frames: 253521920. Throughput: 0: 5939.5. Samples: 253523792. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 12:24:45,654][25689] Avg episode reward: [(0, '-50.896')] [2022-07-09 12:24:45,775][26022] Updated weights on worker 0-0, policy_version 247582 (0.00086) [2022-07-09 12:24:47,650][26022] Updated weights on worker 0-0, policy_version 247592 (0.00096) [2022-07-09 12:24:49,270][26022] Updated weights on worker 0-0, policy_version 247602 (0.00084) [2022-07-09 12:24:50,678][25689] Fps is (10 sec: 5600.7, 60 sec: 5644.3, 300 sec: 5660.5). Total num frames: 253550592. Throughput: 0: 5960.2. Samples: 253558244. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 12:24:50,679][25689] Avg episode reward: [(0, '-51.119')] [2022-07-09 12:24:51,214][26022] Updated weights on worker 0-0, policy_version 247612 (0.00339) [2022-07-09 12:24:53,095][26022] Updated weights on worker 0-0, policy_version 247622 (0.00092) [2022-07-09 12:24:54,835][26022] Updated weights on worker 0-0, policy_version 247632 (0.00085) [2022-07-09 12:24:55,010][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:24:55,024][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000247633_253576192.pth [2022-07-09 12:24:55,024][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000245640_251535360.pth [2022-07-09 12:24:55,811][25689] Fps is (10 sec: 5746.2, 60 sec: 5654.7, 300 sec: 5661.8). Total num frames: 253580288. Throughput: 0: 5086.5. Samples: 253575438. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 12:24:55,811][25689] Avg episode reward: [(0, '-51.236')] [2022-07-09 12:24:56,638][26022] Updated weights on worker 0-0, policy_version 247642 (0.00087) [2022-07-09 12:24:58,349][26022] Updated weights on worker 0-0, policy_version 247652 (0.00093) [2022-07-09 12:25:00,130][26022] Updated weights on worker 0-0, policy_version 247662 (0.00090) [2022-07-09 12:25:00,836][25689] Fps is (10 sec: 5746.0, 60 sec: 5655.0, 300 sec: 5671.7). Total num frames: 253608960. Throughput: 0: 5952.9. Samples: 253609974. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 12:25:00,837][25689] Avg episode reward: [(0, '-50.850')] [2022-07-09 12:25:02,355][26022] Updated weights on worker 0-0, policy_version 247672 (0.00088) [2022-07-09 12:25:04,157][26022] Updated weights on worker 0-0, policy_version 247682 (0.00085) [2022-07-09 12:25:05,848][25689] Fps is (10 sec: 5508.8, 60 sec: 5671.7, 300 sec: 5661.3). Total num frames: 253635584. Throughput: 0: 5858.7. Samples: 253642110. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 12:25:05,849][25689] Avg episode reward: [(0, '-50.355')] [2022-07-09 12:25:05,927][26022] Updated weights on worker 0-0, policy_version 247692 (0.00085) [2022-07-09 12:25:07,874][26022] Updated weights on worker 0-0, policy_version 247702 (0.00083) [2022-07-09 12:25:09,600][26022] Updated weights on worker 0-0, policy_version 247712 (0.00086) [2022-07-09 12:25:10,861][25689] Fps is (10 sec: 5413.9, 60 sec: 5639.4, 300 sec: 5659.1). Total num frames: 253663232. Throughput: 0: 4985.9. Samples: 253658872. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 12:25:10,861][25689] Avg episode reward: [(0, '-49.444')] [2022-07-09 12:25:11,320][26022] Updated weights on worker 0-0, policy_version 247722 (0.00090) [2022-07-09 12:25:13,142][26022] Updated weights on worker 0-0, policy_version 247732 (0.00387) [2022-07-09 12:25:15,087][26022] Updated weights on worker 0-0, policy_version 247742 (0.00093) [2022-07-09 12:25:15,928][25689] Fps is (10 sec: 5689.3, 60 sec: 5658.1, 300 sec: 5661.9). Total num frames: 253692928. Throughput: 0: 5846.6. Samples: 253693054. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 12:25:15,928][25689] Avg episode reward: [(0, '-49.051')] [2022-07-09 12:25:16,759][26022] Updated weights on worker 0-0, policy_version 247752 (0.00091) [2022-07-09 12:25:18,755][26022] Updated weights on worker 0-0, policy_version 247762 (0.00084) [2022-07-09 12:25:20,266][26022] Updated weights on worker 0-0, policy_version 247772 (0.00085) [2022-07-09 12:25:20,935][25689] Fps is (10 sec: 5793.7, 60 sec: 5657.7, 300 sec: 5665.8). Total num frames: 253721600. Throughput: 0: 5837.3. Samples: 253727296. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 12:25:20,937][25689] Avg episode reward: [(0, '-49.820')] [2022-07-09 12:25:22,400][26022] Updated weights on worker 0-0, policy_version 247782 (0.00091) [2022-07-09 12:25:23,745][26022] Updated weights on worker 0-0, policy_version 247792 (0.00053) [2022-07-09 12:25:25,778][26022] Updated weights on worker 0-0, policy_version 247802 (0.00094) [2022-07-09 12:25:25,966][25689] Fps is (10 sec: 5610.5, 60 sec: 5639.1, 300 sec: 5658.4). Total num frames: 253749248. Throughput: 0: 5086.0. Samples: 253744428. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 12:25:25,968][25689] Avg episode reward: [(0, '-50.298')] [2022-07-09 12:25:27,332][26022] Updated weights on worker 0-0, policy_version 247812 (0.00106) [2022-07-09 12:25:29,408][26022] Updated weights on worker 0-0, policy_version 247822 (0.00086) [2022-07-09 12:25:30,989][25689] Fps is (10 sec: 5601.6, 60 sec: 5638.3, 300 sec: 5660.2). Total num frames: 253777920. Throughput: 0: 5942.9. Samples: 253778494. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 12:25:30,991][25689] Avg episode reward: [(0, '-50.290')] [2022-07-09 12:25:31,165][26022] Updated weights on worker 0-0, policy_version 247832 (0.00093) [2022-07-09 12:25:32,938][26022] Updated weights on worker 0-0, policy_version 247842 (0.00086) [2022-07-09 12:25:35,067][26022] Updated weights on worker 0-0, policy_version 247852 (0.00085) [2022-07-09 12:25:36,029][25689] Fps is (10 sec: 5800.1, 60 sec: 5691.4, 300 sec: 5666.5). Total num frames: 253807616. Throughput: 0: 5940.5. Samples: 253812468. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 12:25:36,030][25689] Avg episode reward: [(0, '-50.444')] [2022-07-09 12:25:36,541][26022] Updated weights on worker 0-0, policy_version 247862 (0.00085) [2022-07-09 12:25:38,451][26022] Updated weights on worker 0-0, policy_version 247872 (0.00093) [2022-07-09 12:25:40,052][26022] Updated weights on worker 0-0, policy_version 247882 (0.00108) [2022-07-09 12:25:41,044][25689] Fps is (10 sec: 5601.1, 60 sec: 5626.5, 300 sec: 5656.4). Total num frames: 253834240. Throughput: 0: 5082.5. Samples: 253829502. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 12:25:41,045][25689] Avg episode reward: [(0, '-50.936')] [2022-07-09 12:25:41,909][26022] Updated weights on worker 0-0, policy_version 247892 (0.00089) [2022-07-09 12:25:43,892][26022] Updated weights on worker 0-0, policy_version 247902 (0.00093) [2022-07-09 12:25:45,594][26022] Updated weights on worker 0-0, policy_version 247912 (0.00088) [2022-07-09 12:25:46,087][25689] Fps is (10 sec: 5701.3, 60 sec: 5676.3, 300 sec: 5662.7). Total num frames: 253864960. Throughput: 0: 5929.4. Samples: 253863736. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 12:25:46,088][25689] Avg episode reward: [(0, '-50.506')] [2022-07-09 12:25:47,392][26022] Updated weights on worker 0-0, policy_version 247922 (0.00087) [2022-07-09 12:25:49,234][26022] Updated weights on worker 0-0, policy_version 247932 (0.00090) [2022-07-09 12:25:50,972][26022] Updated weights on worker 0-0, policy_version 247942 (0.00086) [2022-07-09 12:25:51,131][25689] Fps is (10 sec: 5888.3, 60 sec: 5674.5, 300 sec: 5663.8). Total num frames: 253893632. Throughput: 0: 5925.6. Samples: 253897846. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:25:51,132][25689] Avg episode reward: [(0, '-49.638')] [2022-07-09 12:25:52,959][26022] Updated weights on worker 0-0, policy_version 247952 (0.00086) [2022-07-09 12:25:54,462][26022] Updated weights on worker 0-0, policy_version 247962 (0.00087) [2022-07-09 12:25:56,238][25689] Fps is (10 sec: 5649.0, 60 sec: 5659.9, 300 sec: 5659.2). Total num frames: 253922304. Throughput: 0: 5068.5. Samples: 253914898. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:25:56,239][25689] Avg episode reward: [(0, '-50.396')] [2022-07-09 12:25:56,441][26022] Updated weights on worker 0-0, policy_version 247972 (0.00081) [2022-07-09 12:25:58,231][26022] Updated weights on worker 0-0, policy_version 247982 (0.00086) [2022-07-09 12:25:59,936][26022] Updated weights on worker 0-0, policy_version 247992 (0.00092) [2022-07-09 12:26:01,276][25689] Fps is (10 sec: 5551.7, 60 sec: 5641.9, 300 sec: 5663.0). Total num frames: 253949952. Throughput: 0: 5917.8. Samples: 253949226. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:26:01,276][25689] Avg episode reward: [(0, '-49.821')] [2022-07-09 12:26:01,947][26022] Updated weights on worker 0-0, policy_version 248002 (0.00083) [2022-07-09 12:26:03,872][26022] Updated weights on worker 0-0, policy_version 248012 (0.00084) [2022-07-09 12:26:05,701][26022] Updated weights on worker 0-0, policy_version 248022 (0.00090) [2022-07-09 12:26:06,321][25689] Fps is (10 sec: 5484.3, 60 sec: 5655.7, 300 sec: 5660.5). Total num frames: 253977600. Throughput: 0: 5819.8. Samples: 253981494. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:26:06,322][25689] Avg episode reward: [(0, '-49.046')] [2022-07-09 12:26:07,573][26022] Updated weights on worker 0-0, policy_version 248032 (0.00082) [2022-07-09 12:26:09,459][26022] Updated weights on worker 0-0, policy_version 248042 (0.00084) [2022-07-09 12:26:11,110][26022] Updated weights on worker 0-0, policy_version 248052 (0.00093) [2022-07-09 12:26:11,327][25689] Fps is (10 sec: 5704.9, 60 sec: 5690.1, 300 sec: 5661.1). Total num frames: 254007296. Throughput: 0: 4989.4. Samples: 253998614. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:26:11,328][25689] Avg episode reward: [(0, '-47.924')] [2022-07-09 12:26:12,931][26022] Updated weights on worker 0-0, policy_version 248062 (0.00086) [2022-07-09 12:26:14,636][26022] Updated weights on worker 0-0, policy_version 248072 (0.00094) [2022-07-09 12:26:16,385][25689] Fps is (10 sec: 5698.2, 60 sec: 5657.2, 300 sec: 5660.1). Total num frames: 254034944. Throughput: 0: 5870.0. Samples: 254033156. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:26:16,387][25689] Avg episode reward: [(0, '-47.845')] [2022-07-09 12:26:16,538][26022] Updated weights on worker 0-0, policy_version 248082 (0.00098) [2022-07-09 12:26:18,398][26022] Updated weights on worker 0-0, policy_version 248092 (0.00082) [2022-07-09 12:26:19,986][26022] Updated weights on worker 0-0, policy_version 248102 (0.00094) [2022-07-09 12:26:21,388][25689] Fps is (10 sec: 5598.0, 60 sec: 5657.5, 300 sec: 5660.5). Total num frames: 254063616. Throughput: 0: 5873.0. Samples: 254067346. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:26:21,389][25689] Avg episode reward: [(0, '-48.202')] [2022-07-09 12:26:22,066][26022] Updated weights on worker 0-0, policy_version 248112 (0.00090) [2022-07-09 12:26:23,630][26022] Updated weights on worker 0-0, policy_version 248122 (0.00086) [2022-07-09 12:26:25,643][26022] Updated weights on worker 0-0, policy_version 248132 (0.00097) [2022-07-09 12:26:26,399][25689] Fps is (10 sec: 5726.3, 60 sec: 5676.4, 300 sec: 5664.0). Total num frames: 254092288. Throughput: 0: 5975.3. Samples: 254101464. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:26:26,401][25689] Avg episode reward: [(0, '-47.525')] [2022-07-09 12:26:27,412][26022] Updated weights on worker 0-0, policy_version 248142 (0.00090) [2022-07-09 12:26:29,044][26022] Updated weights on worker 0-0, policy_version 248152 (0.00092) [2022-07-09 12:26:31,117][26022] Updated weights on worker 0-0, policy_version 248162 (0.00083) [2022-07-09 12:26:31,490][25689] Fps is (10 sec: 5676.3, 60 sec: 5670.0, 300 sec: 5660.5). Total num frames: 254120960. Throughput: 0: 5949.1. Samples: 254118566. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:26:31,491][25689] Avg episode reward: [(0, '-48.423')] [2022-07-09 12:26:32,537][26022] Updated weights on worker 0-0, policy_version 248172 (0.00092) [2022-07-09 12:26:34,582][26022] Updated weights on worker 0-0, policy_version 248182 (0.00085) [2022-07-09 12:26:36,089][26022] Updated weights on worker 0-0, policy_version 248192 (0.00093) [2022-07-09 12:26:36,545][25689] Fps is (10 sec: 5651.6, 60 sec: 5651.6, 300 sec: 5663.2). Total num frames: 254149632. Throughput: 0: 5941.6. Samples: 254152944. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:26:36,546][25689] Avg episode reward: [(0, '-49.093')] [2022-07-09 12:26:38,054][26022] Updated weights on worker 0-0, policy_version 248202 (0.00082) [2022-07-09 12:26:39,916][26022] Updated weights on worker 0-0, policy_version 248212 (0.00090) [2022-07-09 12:26:41,573][25689] Fps is (10 sec: 5586.1, 60 sec: 5667.4, 300 sec: 5655.9). Total num frames: 254177280. Throughput: 0: 5919.9. Samples: 254186836. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:26:41,573][25689] Avg episode reward: [(0, '-50.321')] [2022-07-09 12:26:41,760][26022] Updated weights on worker 0-0, policy_version 248222 (0.00089) [2022-07-09 12:26:43,545][26022] Updated weights on worker 0-0, policy_version 248232 (0.00089) [2022-07-09 12:26:45,254][26022] Updated weights on worker 0-0, policy_version 248242 (0.00076) [2022-07-09 12:26:46,588][25689] Fps is (10 sec: 5608.4, 60 sec: 5636.2, 300 sec: 5659.2). Total num frames: 254205952. Throughput: 0: 5073.7. Samples: 254203896. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:26:46,588][25689] Avg episode reward: [(0, '-50.106')] [2022-07-09 12:26:47,073][26022] Updated weights on worker 0-0, policy_version 248252 (0.00098) [2022-07-09 12:26:49,051][26022] Updated weights on worker 0-0, policy_version 248262 (0.00086) [2022-07-09 12:26:50,676][26022] Updated weights on worker 0-0, policy_version 248272 (0.00088) [2022-07-09 12:26:51,590][25689] Fps is (10 sec: 5826.8, 60 sec: 5657.0, 300 sec: 5654.1). Total num frames: 254235648. Throughput: 0: 5948.5. Samples: 254238126. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:26:51,590][25689] Avg episode reward: [(0, '-49.777')] [2022-07-09 12:26:52,610][26022] Updated weights on worker 0-0, policy_version 248282 (0.00061) [2022-07-09 12:26:54,240][26022] Updated weights on worker 0-0, policy_version 248292 (0.00094) [2022-07-09 12:26:55,078][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:26:55,092][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000248296_254255104.pth [2022-07-09 12:26:55,092][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000246305_252216320.pth [2022-07-09 12:26:56,318][26022] Updated weights on worker 0-0, policy_version 248302 (0.00087) [2022-07-09 12:26:56,674][25689] Fps is (10 sec: 5786.9, 60 sec: 5659.2, 300 sec: 5657.2). Total num frames: 254264320. Throughput: 0: 5923.0. Samples: 254272164. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:26:56,674][25689] Avg episode reward: [(0, '-48.995')] [2022-07-09 12:26:57,743][26022] Updated weights on worker 0-0, policy_version 248312 (0.00083) [2022-07-09 12:26:59,890][26022] Updated weights on worker 0-0, policy_version 248322 (0.00098) [2022-07-09 12:27:01,686][25689] Fps is (10 sec: 5578.1, 60 sec: 5661.5, 300 sec: 5664.3). Total num frames: 254291968. Throughput: 0: 5094.9. Samples: 254289314. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:27:01,687][25689] Avg episode reward: [(0, '-49.664')] [2022-07-09 12:27:01,693][26022] Updated weights on worker 0-0, policy_version 248332 (0.00090) [2022-07-09 12:27:03,661][26022] Updated weights on worker 0-0, policy_version 248342 (0.00090) [2022-07-09 12:27:05,611][26022] Updated weights on worker 0-0, policy_version 248352 (0.00091) [2022-07-09 12:27:06,731][25689] Fps is (10 sec: 5498.0, 60 sec: 5661.6, 300 sec: 5657.1). Total num frames: 254319616. Throughput: 0: 5833.9. Samples: 254321412. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:27:06,732][25689] Avg episode reward: [(0, '-49.557')] [2022-07-09 12:27:07,273][26022] Updated weights on worker 0-0, policy_version 248362 (0.00086) [2022-07-09 12:27:09,153][26022] Updated weights on worker 0-0, policy_version 248372 (0.00090) [2022-07-09 12:27:11,128][26022] Updated weights on worker 0-0, policy_version 248382 (0.00093) [2022-07-09 12:27:11,733][25689] Fps is (10 sec: 5504.2, 60 sec: 5628.1, 300 sec: 5651.2). Total num frames: 254347264. Throughput: 0: 5830.5. Samples: 254355568. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:27:11,733][25689] Avg episode reward: [(0, '-49.782')] [2022-07-09 12:27:12,784][26022] Updated weights on worker 0-0, policy_version 248392 (0.00085) [2022-07-09 12:27:14,621][26022] Updated weights on worker 0-0, policy_version 248402 (0.00096) [2022-07-09 12:27:16,103][26022] Updated weights on worker 0-0, policy_version 248412 (0.00082) [2022-07-09 12:27:16,782][25689] Fps is (10 sec: 5705.5, 60 sec: 5662.8, 300 sec: 5661.1). Total num frames: 254376960. Throughput: 0: 5003.3. Samples: 254372770. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:27:16,782][25689] Avg episode reward: [(0, '-48.666')] [2022-07-09 12:27:18,253][26022] Updated weights on worker 0-0, policy_version 248422 (0.00085) [2022-07-09 12:27:19,801][26022] Updated weights on worker 0-0, policy_version 248432 (0.00087) [2022-07-09 12:27:21,802][25689] Fps is (10 sec: 5694.7, 60 sec: 5644.2, 300 sec: 5654.0). Total num frames: 254404608. Throughput: 0: 5857.5. Samples: 254407142. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:27:21,803][25689] Avg episode reward: [(0, '-48.797')] [2022-07-09 12:27:21,808][26022] Updated weights on worker 0-0, policy_version 248442 (0.00086) [2022-07-09 12:27:23,293][26022] Updated weights on worker 0-0, policy_version 248452 (0.00091) [2022-07-09 12:27:25,206][26022] Updated weights on worker 0-0, policy_version 248462 (0.00097) [2022-07-09 12:27:26,828][25689] Fps is (10 sec: 5606.2, 60 sec: 5642.9, 300 sec: 5653.9). Total num frames: 254433280. Throughput: 0: 5996.0. Samples: 254441910. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:27:26,828][25689] Avg episode reward: [(0, '-49.559')] [2022-07-09 12:27:27,175][26022] Updated weights on worker 0-0, policy_version 248472 (0.00089) [2022-07-09 12:27:28,875][26022] Updated weights on worker 0-0, policy_version 248482 (0.00381) [2022-07-09 12:27:30,689][26022] Updated weights on worker 0-0, policy_version 248492 (0.00087) [2022-07-09 12:27:31,883][25689] Fps is (10 sec: 5790.3, 60 sec: 5663.3, 300 sec: 5654.1). Total num frames: 254462976. Throughput: 0: 5133.2. Samples: 254459002. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:27:31,883][25689] Avg episode reward: [(0, '-48.913')] [2022-07-09 12:27:32,553][26022] Updated weights on worker 0-0, policy_version 248502 (0.00085) [2022-07-09 12:27:34,078][26022] Updated weights on worker 0-0, policy_version 248512 (0.00094) [2022-07-09 12:27:36,088][26022] Updated weights on worker 0-0, policy_version 248522 (0.00088) [2022-07-09 12:27:36,954][25689] Fps is (10 sec: 5865.3, 60 sec: 5678.7, 300 sec: 5660.0). Total num frames: 254492672. Throughput: 0: 5966.2. Samples: 254493120. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:27:36,954][25689] Avg episode reward: [(0, '-48.194')] [2022-07-09 12:27:37,629][26022] Updated weights on worker 0-0, policy_version 248532 (0.00081) [2022-07-09 12:27:39,696][26022] Updated weights on worker 0-0, policy_version 248542 (0.00099) [2022-07-09 12:27:41,502][26022] Updated weights on worker 0-0, policy_version 248552 (0.00089) [2022-07-09 12:27:42,020][25689] Fps is (10 sec: 5555.5, 60 sec: 5658.1, 300 sec: 5645.2). Total num frames: 254519296. Throughput: 0: 5935.7. Samples: 254527150. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:27:42,021][25689] Avg episode reward: [(0, '-49.164')] [2022-07-09 12:27:43,219][26022] Updated weights on worker 0-0, policy_version 248562 (0.00085) [2022-07-09 12:27:44,943][26022] Updated weights on worker 0-0, policy_version 248572 (0.00094) [2022-07-09 12:27:46,695][26022] Updated weights on worker 0-0, policy_version 248582 (0.00102) [2022-07-09 12:27:47,032][25689] Fps is (10 sec: 5689.7, 60 sec: 5692.2, 300 sec: 5662.8). Total num frames: 254550016. Throughput: 0: 5074.8. Samples: 254544444. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:27:47,033][25689] Avg episode reward: [(0, '-49.835')] [2022-07-09 12:27:48,971][26022] Updated weights on worker 0-0, policy_version 248592 (0.00085) [2022-07-09 12:27:50,235][26022] Updated weights on worker 0-0, policy_version 248602 (0.00096) [2022-07-09 12:27:52,061][25689] Fps is (10 sec: 5710.9, 60 sec: 5638.9, 300 sec: 5649.4). Total num frames: 254576640. Throughput: 0: 5930.4. Samples: 254578670. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:27:52,063][25689] Avg episode reward: [(0, '-50.020')] [2022-07-09 12:27:52,488][26022] Updated weights on worker 0-0, policy_version 248612 (0.00090) [2022-07-09 12:27:53,903][26022] Updated weights on worker 0-0, policy_version 248622 (0.00085) [2022-07-09 12:27:55,675][26022] Updated weights on worker 0-0, policy_version 248632 (0.00093) [2022-07-09 12:27:57,124][25689] Fps is (10 sec: 5580.6, 60 sec: 5657.8, 300 sec: 5659.0). Total num frames: 254606336. Throughput: 0: 5951.2. Samples: 254613160. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:27:57,125][25689] Avg episode reward: [(0, '-50.174')] [2022-07-09 12:27:57,602][26022] Updated weights on worker 0-0, policy_version 248642 (0.00086) [2022-07-09 12:27:59,378][26022] Updated weights on worker 0-0, policy_version 248652 (0.00095) [2022-07-09 12:28:01,216][26022] Updated weights on worker 0-0, policy_version 248662 (0.00087) [2022-07-09 12:28:02,186][25689] Fps is (10 sec: 5664.0, 60 sec: 5653.2, 300 sec: 5657.9). Total num frames: 254633984. Throughput: 0: 5116.4. Samples: 254630324. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:28:02,186][25689] Avg episode reward: [(0, '-50.559')] [2022-07-09 12:28:03,341][26022] Updated weights on worker 0-0, policy_version 248672 (0.00084) [2022-07-09 12:28:05,310][26022] Updated weights on worker 0-0, policy_version 248682 (0.00091) [2022-07-09 12:28:07,043][26022] Updated weights on worker 0-0, policy_version 248692 (0.00087) [2022-07-09 12:28:07,244][25689] Fps is (10 sec: 5464.4, 60 sec: 5652.0, 300 sec: 5653.8). Total num frames: 254661632. Throughput: 0: 5823.8. Samples: 254662152. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 12:28:07,244][25689] Avg episode reward: [(0, '-50.179')] [2022-07-09 12:28:08,794][26022] Updated weights on worker 0-0, policy_version 248702 (0.00096) [2022-07-09 12:28:10,698][26022] Updated weights on worker 0-0, policy_version 248712 (0.00381) [2022-07-09 12:28:12,306][25689] Fps is (10 sec: 5565.2, 60 sec: 5663.2, 300 sec: 5657.5). Total num frames: 254690304. Throughput: 0: 5802.7. Samples: 254696142. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 12:28:12,306][25689] Avg episode reward: [(0, '-50.094')] [2022-07-09 12:28:12,531][26022] Updated weights on worker 0-0, policy_version 248722 (0.00082) [2022-07-09 12:28:14,185][26022] Updated weights on worker 0-0, policy_version 248732 (0.00096) [2022-07-09 12:28:15,899][26022] Updated weights on worker 0-0, policy_version 248742 (0.00084) [2022-07-09 12:28:17,400][25689] Fps is (10 sec: 5646.4, 60 sec: 5642.2, 300 sec: 5652.7). Total num frames: 254718976. Throughput: 0: 4943.6. Samples: 254713392. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 12:28:17,400][25689] Avg episode reward: [(0, '-49.569')] [2022-07-09 12:28:17,796][26022] Updated weights on worker 0-0, policy_version 248752 (0.00090) [2022-07-09 12:28:19,653][26022] Updated weights on worker 0-0, policy_version 248762 (0.00087) [2022-07-09 12:28:21,363][26022] Updated weights on worker 0-0, policy_version 248772 (0.00088) [2022-07-09 12:28:22,424][25689] Fps is (10 sec: 5768.3, 60 sec: 5675.6, 300 sec: 5653.1). Total num frames: 254748672. Throughput: 0: 5788.0. Samples: 254747466. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 12:28:22,425][25689] Avg episode reward: [(0, '-49.116')] [2022-07-09 12:28:23,144][26022] Updated weights on worker 0-0, policy_version 248782 (0.00090) [2022-07-09 12:28:25,024][26022] Updated weights on worker 0-0, policy_version 248792 (0.00103) [2022-07-09 12:28:26,739][26022] Updated weights on worker 0-0, policy_version 248802 (0.00089) [2022-07-09 12:28:27,471][25689] Fps is (10 sec: 5694.0, 60 sec: 5656.7, 300 sec: 5653.9). Total num frames: 254776320. Throughput: 0: 5903.9. Samples: 254781570. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 12:28:27,471][25689] Avg episode reward: [(0, '-49.317')] [2022-07-09 12:28:28,659][26022] Updated weights on worker 0-0, policy_version 248812 (0.00081) [2022-07-09 12:28:30,439][26022] Updated weights on worker 0-0, policy_version 248822 (0.00097) [2022-07-09 12:28:32,410][26022] Updated weights on worker 0-0, policy_version 248832 (0.00094) [2022-07-09 12:28:32,483][25689] Fps is (10 sec: 5497.5, 60 sec: 5626.9, 300 sec: 5651.7). Total num frames: 254803968. Throughput: 0: 5072.9. Samples: 254798498. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 12:28:32,483][25689] Avg episode reward: [(0, '-50.213')] [2022-07-09 12:28:34,014][26022] Updated weights on worker 0-0, policy_version 248842 (0.00107) [2022-07-09 12:28:35,868][26022] Updated weights on worker 0-0, policy_version 248852 (0.00089) [2022-07-09 12:28:37,468][26022] Updated weights on worker 0-0, policy_version 248862 (0.00090) [2022-07-09 12:28:37,535][25689] Fps is (10 sec: 5799.6, 60 sec: 5645.6, 300 sec: 5658.6). Total num frames: 254834688. Throughput: 0: 5919.9. Samples: 254832590. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 12:28:37,535][25689] Avg episode reward: [(0, '-50.106')] [2022-07-09 12:28:39,482][26022] Updated weights on worker 0-0, policy_version 248872 (0.00086) [2022-07-09 12:28:41,252][26022] Updated weights on worker 0-0, policy_version 248882 (0.00082) [2022-07-09 12:28:42,607][25689] Fps is (10 sec: 5765.4, 60 sec: 5662.0, 300 sec: 5654.5). Total num frames: 254862336. Throughput: 0: 5899.7. Samples: 254866534. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 12:28:42,608][25689] Avg episode reward: [(0, '-49.592')] [2022-07-09 12:28:43,068][26022] Updated weights on worker 0-0, policy_version 248892 (0.00085) [2022-07-09 12:28:44,906][26022] Updated weights on worker 0-0, policy_version 248902 (0.00086) [2022-07-09 12:28:46,571][26022] Updated weights on worker 0-0, policy_version 248912 (0.00084) [2022-07-09 12:28:47,635][25689] Fps is (10 sec: 5475.0, 60 sec: 5609.8, 300 sec: 5650.8). Total num frames: 254889984. Throughput: 0: 5064.7. Samples: 254883694. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 12:28:47,635][25689] Avg episode reward: [(0, '-50.131')] [2022-07-09 12:28:48,679][26022] Updated weights on worker 0-0, policy_version 248922 (0.00088) [2022-07-09 12:28:50,078][26022] Updated weights on worker 0-0, policy_version 248932 (0.00093) [2022-07-09 12:28:52,065][26022] Updated weights on worker 0-0, policy_version 248942 (0.00085) [2022-07-09 12:28:52,698][25689] Fps is (10 sec: 5783.7, 60 sec: 5674.1, 300 sec: 5657.7). Total num frames: 254920704. Throughput: 0: 5918.1. Samples: 254918136. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 12:28:52,699][25689] Avg episode reward: [(0, '-49.792')] [2022-07-09 12:28:53,778][26022] Updated weights on worker 0-0, policy_version 248952 (0.00086) [2022-07-09 12:28:55,116][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:28:55,125][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000248958_254932992.pth [2022-07-09 12:28:55,125][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000246971_252898304.pth [2022-07-09 12:28:55,656][26022] Updated weights on worker 0-0, policy_version 248962 (0.00088) [2022-07-09 12:28:57,746][25689] Fps is (10 sec: 5671.3, 60 sec: 5625.0, 300 sec: 5650.4). Total num frames: 254947328. Throughput: 0: 5896.3. Samples: 254951760. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 12:28:57,746][25689] Avg episode reward: [(0, '-49.550')] [2022-07-09 12:28:57,756][26022] Updated weights on worker 0-0, policy_version 248972 (0.00082) [2022-07-09 12:28:59,334][26022] Updated weights on worker 0-0, policy_version 248982 (0.00072) [2022-07-09 12:29:01,107][26022] Updated weights on worker 0-0, policy_version 248992 (0.00085) [2022-07-09 12:29:02,759][25689] Fps is (10 sec: 5394.5, 60 sec: 5629.4, 300 sec: 5657.3). Total num frames: 254974976. Throughput: 0: 5833.9. Samples: 254984102. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 12:29:02,760][25689] Avg episode reward: [(0, '-48.726')] [2022-07-09 12:29:03,275][26022] Updated weights on worker 0-0, policy_version 249002 (0.00087) [2022-07-09 12:29:05,078][26022] Updated weights on worker 0-0, policy_version 249012 (0.00089) [2022-07-09 12:29:06,951][26022] Updated weights on worker 0-0, policy_version 249022 (0.00089) [2022-07-09 12:29:07,774][25689] Fps is (10 sec: 5616.0, 60 sec: 5650.3, 300 sec: 5654.1). Total num frames: 255003648. Throughput: 0: 5838.6. Samples: 255001282. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 12:29:07,775][25689] Avg episode reward: [(0, '-49.409')] [2022-07-09 12:29:08,523][26022] Updated weights on worker 0-0, policy_version 249032 (0.00088) [2022-07-09 12:29:10,538][26022] Updated weights on worker 0-0, policy_version 249042 (0.00087) [2022-07-09 12:29:12,178][26022] Updated weights on worker 0-0, policy_version 249052 (0.00086) [2022-07-09 12:29:12,815][25689] Fps is (10 sec: 5600.9, 60 sec: 5635.4, 300 sec: 5651.5). Total num frames: 255031296. Throughput: 0: 5810.3. Samples: 255035018. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 12:29:12,815][25689] Avg episode reward: [(0, '-49.075')] [2022-07-09 12:29:14,179][26022] Updated weights on worker 0-0, policy_version 249062 (0.00081) [2022-07-09 12:29:15,863][26022] Updated weights on worker 0-0, policy_version 249072 (0.00090) [2022-07-09 12:29:17,803][26022] Updated weights on worker 0-0, policy_version 249082 (0.00458) [2022-07-09 12:29:17,930][25689] Fps is (10 sec: 5545.5, 60 sec: 5633.4, 300 sec: 5649.4). Total num frames: 255059968. Throughput: 0: 5831.2. Samples: 255069460. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 12:29:17,931][25689] Avg episode reward: [(0, '-48.699')] [2022-07-09 12:29:19,486][26022] Updated weights on worker 0-0, policy_version 249092 (0.00376) [2022-07-09 12:29:21,341][26022] Updated weights on worker 0-0, policy_version 249102 (0.00095) [2022-07-09 12:29:22,947][25689] Fps is (10 sec: 5760.4, 60 sec: 5634.1, 300 sec: 5652.7). Total num frames: 255089664. Throughput: 0: 5068.0. Samples: 255086416. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 12:29:22,948][25689] Avg episode reward: [(0, '-49.065')] [2022-07-09 12:29:23,119][26022] Updated weights on worker 0-0, policy_version 249112 (0.00093) [2022-07-09 12:29:24,907][26022] Updated weights on worker 0-0, policy_version 249122 (0.00087) [2022-07-09 12:29:26,844][26022] Updated weights on worker 0-0, policy_version 249132 (0.00098) [2022-07-09 12:29:27,955][25689] Fps is (10 sec: 5720.4, 60 sec: 5637.7, 300 sec: 5649.4). Total num frames: 255117312. Throughput: 0: 5899.6. Samples: 255120338. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 12:29:27,955][25689] Avg episode reward: [(0, '-49.156')] [2022-07-09 12:29:28,632][26022] Updated weights on worker 0-0, policy_version 249142 (0.00091) [2022-07-09 12:29:30,601][26022] Updated weights on worker 0-0, policy_version 249152 (0.00086) [2022-07-09 12:29:32,144][26022] Updated weights on worker 0-0, policy_version 249162 (0.00087) [2022-07-09 12:29:33,000][25689] Fps is (10 sec: 5704.6, 60 sec: 5668.5, 300 sec: 5660.1). Total num frames: 255147008. Throughput: 0: 5918.0. Samples: 255154472. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 12:29:33,000][25689] Avg episode reward: [(0, '-49.123')] [2022-07-09 12:29:34,002][26022] Updated weights on worker 0-0, policy_version 249172 (0.00090) [2022-07-09 12:29:35,860][26022] Updated weights on worker 0-0, policy_version 249182 (0.00089) [2022-07-09 12:29:37,590][26022] Updated weights on worker 0-0, policy_version 249192 (0.00083) [2022-07-09 12:29:38,090][25689] Fps is (10 sec: 5658.2, 60 sec: 5614.2, 300 sec: 5649.0). Total num frames: 255174656. Throughput: 0: 5069.8. Samples: 255171662. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 12:29:38,090][25689] Avg episode reward: [(0, '-49.014')] [2022-07-09 12:29:39,472][26022] Updated weights on worker 0-0, policy_version 249202 (0.00088) [2022-07-09 12:29:41,347][26022] Updated weights on worker 0-0, policy_version 249212 (0.00089) [2022-07-09 12:29:43,014][26022] Updated weights on worker 0-0, policy_version 249222 (0.00089) [2022-07-09 12:29:43,179][25689] Fps is (10 sec: 5532.9, 60 sec: 5629.5, 300 sec: 5651.3). Total num frames: 255203328. Throughput: 0: 5892.8. Samples: 255205636. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 12:29:43,179][25689] Avg episode reward: [(0, '-48.766')] [2022-07-09 12:29:45,032][26022] Updated weights on worker 0-0, policy_version 249232 (0.00081) [2022-07-09 12:29:46,567][26022] Updated weights on worker 0-0, policy_version 249242 (0.00088) [2022-07-09 12:29:48,203][25689] Fps is (10 sec: 5569.0, 60 sec: 5629.9, 300 sec: 5647.9). Total num frames: 255230976. Throughput: 0: 5890.4. Samples: 255239606. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 12:29:48,203][25689] Avg episode reward: [(0, '-48.838')] [2022-07-09 12:29:48,514][26022] Updated weights on worker 0-0, policy_version 249252 (0.00086) [2022-07-09 12:29:50,235][26022] Updated weights on worker 0-0, policy_version 249262 (0.00091) [2022-07-09 12:29:51,960][26022] Updated weights on worker 0-0, policy_version 249272 (0.00088) [2022-07-09 12:29:53,221][25689] Fps is (10 sec: 5710.3, 60 sec: 5617.2, 300 sec: 5650.1). Total num frames: 255260672. Throughput: 0: 5057.5. Samples: 255256742. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 12:29:53,221][25689] Avg episode reward: [(0, '-48.162')] [2022-07-09 12:29:53,919][26022] Updated weights on worker 0-0, policy_version 249282 (0.00086) [2022-07-09 12:29:55,387][26022] Updated weights on worker 0-0, policy_version 249292 (0.00085) [2022-07-09 12:29:57,627][26022] Updated weights on worker 0-0, policy_version 249302 (0.00084) [2022-07-09 12:29:58,298][25689] Fps is (10 sec: 5883.0, 60 sec: 5665.2, 300 sec: 5652.5). Total num frames: 255290368. Throughput: 0: 5910.5. Samples: 255291106. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 12:29:58,299][25689] Avg episode reward: [(0, '-47.910')] [2022-07-09 12:29:59,142][26022] Updated weights on worker 0-0, policy_version 249312 (0.00088) [2022-07-09 12:30:01,127][26022] Updated weights on worker 0-0, policy_version 249322 (0.00083) [2022-07-09 12:30:03,329][25689] Fps is (10 sec: 5470.2, 60 sec: 5629.7, 300 sec: 5648.7). Total num frames: 255315968. Throughput: 0: 5833.6. Samples: 255323188. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 12:30:03,330][25689] Avg episode reward: [(0, '-47.605')] [2022-07-09 12:30:03,331][26022] Updated weights on worker 0-0, policy_version 249332 (0.00079) [2022-07-09 12:30:05,000][26022] Updated weights on worker 0-0, policy_version 249342 (0.00092) [2022-07-09 12:30:06,786][26022] Updated weights on worker 0-0, policy_version 249352 (0.00091) [2022-07-09 12:30:08,410][25689] Fps is (10 sec: 5367.2, 60 sec: 5623.6, 300 sec: 5650.9). Total num frames: 255344640. Throughput: 0: 4978.8. Samples: 255340214. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 12:30:08,410][25689] Avg episode reward: [(0, '-47.614')] [2022-07-09 12:30:08,747][26022] Updated weights on worker 0-0, policy_version 249362 (0.00084) [2022-07-09 12:30:10,385][26022] Updated weights on worker 0-0, policy_version 249372 (0.00093) [2022-07-09 12:30:12,372][26022] Updated weights on worker 0-0, policy_version 249382 (0.00094) [2022-07-09 12:30:13,439][25689] Fps is (10 sec: 5672.1, 60 sec: 5641.5, 300 sec: 5648.1). Total num frames: 255373312. Throughput: 0: 5822.0. Samples: 255374454. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 12:30:13,440][25689] Avg episode reward: [(0, '-47.149')] [2022-07-09 12:30:13,974][26022] Updated weights on worker 0-0, policy_version 249392 (0.00085) [2022-07-09 12:30:15,977][26022] Updated weights on worker 0-0, policy_version 249402 (0.00090) [2022-07-09 12:30:17,574][26022] Updated weights on worker 0-0, policy_version 249412 (0.00091) [2022-07-09 12:30:18,488][25689] Fps is (10 sec: 5690.0, 60 sec: 5647.8, 300 sec: 5647.3). Total num frames: 255401984. Throughput: 0: 5798.4. Samples: 255408174. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 12:30:18,488][25689] Avg episode reward: [(0, '-47.415')] [2022-07-09 12:30:19,652][26022] Updated weights on worker 0-0, policy_version 249422 (0.00082) [2022-07-09 12:30:21,260][26022] Updated weights on worker 0-0, policy_version 249432 (0.00095) [2022-07-09 12:30:23,033][26022] Updated weights on worker 0-0, policy_version 249442 (0.00086) [2022-07-09 12:30:23,504][25689] Fps is (10 sec: 5595.9, 60 sec: 5614.1, 300 sec: 5647.6). Total num frames: 255429632. Throughput: 0: 5052.6. Samples: 255425120. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 12:30:23,504][25689] Avg episode reward: [(0, '-48.542')] [2022-07-09 12:30:24,945][26022] Updated weights on worker 0-0, policy_version 249452 (0.00080) [2022-07-09 12:30:26,873][26022] Updated weights on worker 0-0, policy_version 249462 (0.00205) [2022-07-09 12:30:28,483][26022] Updated weights on worker 0-0, policy_version 249472 (0.00087) [2022-07-09 12:30:28,581][25689] Fps is (10 sec: 5681.4, 60 sec: 5641.4, 300 sec: 5650.0). Total num frames: 255459328. Throughput: 0: 5915.4. Samples: 255459536. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 12:30:28,582][25689] Avg episode reward: [(0, '-49.460')] [2022-07-09 12:30:30,255][26022] Updated weights on worker 0-0, policy_version 249482 (0.00088) [2022-07-09 12:30:32,008][26022] Updated weights on worker 0-0, policy_version 249492 (0.00087) [2022-07-09 12:30:33,624][25689] Fps is (10 sec: 5767.3, 60 sec: 5624.6, 300 sec: 5646.5). Total num frames: 255488000. Throughput: 0: 5921.4. Samples: 255493978. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 12:30:33,625][25689] Avg episode reward: [(0, '-50.002')] [2022-07-09 12:30:33,914][26022] Updated weights on worker 0-0, policy_version 249502 (0.00085) [2022-07-09 12:30:35,407][26022] Updated weights on worker 0-0, policy_version 249512 (0.00098) [2022-07-09 12:30:37,462][26022] Updated weights on worker 0-0, policy_version 249522 (0.00084) [2022-07-09 12:30:38,686][25689] Fps is (10 sec: 5776.1, 60 sec: 5661.0, 300 sec: 5656.0). Total num frames: 255517696. Throughput: 0: 5094.8. Samples: 255511082. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 12:30:38,687][25689] Avg episode reward: [(0, '-49.729')] [2022-07-09 12:30:39,114][26022] Updated weights on worker 0-0, policy_version 249532 (0.00086) [2022-07-09 12:30:41,086][26022] Updated weights on worker 0-0, policy_version 249542 (0.00085) [2022-07-09 12:30:42,629][26022] Updated weights on worker 0-0, policy_version 249552 (0.00088) [2022-07-09 12:30:43,695][25689] Fps is (10 sec: 5795.9, 60 sec: 5668.6, 300 sec: 5649.7). Total num frames: 255546368. Throughput: 0: 5969.3. Samples: 255545648. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 12:30:43,696][25689] Avg episode reward: [(0, '-49.203')] [2022-07-09 12:30:44,604][26022] Updated weights on worker 0-0, policy_version 249562 (0.00082) [2022-07-09 12:30:46,270][26022] Updated weights on worker 0-0, policy_version 249572 (0.00088) [2022-07-09 12:30:48,333][26022] Updated weights on worker 0-0, policy_version 249582 (0.00096) [2022-07-09 12:30:48,761][25689] Fps is (10 sec: 5691.9, 60 sec: 5681.5, 300 sec: 5649.3). Total num frames: 255575040. Throughput: 0: 5967.1. Samples: 255579952. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 12:30:48,762][25689] Avg episode reward: [(0, '-49.327')] [2022-07-09 12:30:49,906][26022] Updated weights on worker 0-0, policy_version 249592 (0.00091) [2022-07-09 12:30:51,819][26022] Updated weights on worker 0-0, policy_version 249602 (0.00083) [2022-07-09 12:30:53,634][26022] Updated weights on worker 0-0, policy_version 249612 (0.00090) [2022-07-09 12:30:53,777][25689] Fps is (10 sec: 5687.6, 60 sec: 5664.8, 300 sec: 5651.1). Total num frames: 255603712. Throughput: 0: 5105.1. Samples: 255596860. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 12:30:53,778][25689] Avg episode reward: [(0, '-49.420')] [2022-07-09 12:30:55,180][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:30:55,194][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000249621_255611904.pth [2022-07-09 12:30:55,195][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000247633_253576192.pth [2022-07-09 12:30:55,447][26022] Updated weights on worker 0-0, policy_version 249622 (0.00084) [2022-07-09 12:30:57,186][26022] Updated weights on worker 0-0, policy_version 249632 (0.00106) [2022-07-09 12:30:58,874][25689] Fps is (10 sec: 5670.4, 60 sec: 5646.1, 300 sec: 5653.4). Total num frames: 255632384. Throughput: 0: 5933.8. Samples: 255630870. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 12:30:58,875][25689] Avg episode reward: [(0, '-49.051')] [2022-07-09 12:30:59,215][26022] Updated weights on worker 0-0, policy_version 249642 (0.00086) [2022-07-09 12:31:00,659][26022] Updated weights on worker 0-0, policy_version 249652 (0.00089) [2022-07-09 12:31:02,928][26022] Updated weights on worker 0-0, policy_version 249662 (0.00087) [2022-07-09 12:31:03,912][25689] Fps is (10 sec: 5658.1, 60 sec: 5696.2, 300 sec: 5656.9). Total num frames: 255661056. Throughput: 0: 5831.6. Samples: 255663546. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 12:31:03,913][25689] Avg episode reward: [(0, '-48.792')] [2022-07-09 12:31:04,571][26022] Updated weights on worker 0-0, policy_version 249672 (0.00085) [2022-07-09 12:31:06,594][26022] Updated weights on worker 0-0, policy_version 249682 (0.00092) [2022-07-09 12:31:08,125][26022] Updated weights on worker 0-0, policy_version 249692 (0.00088) [2022-07-09 12:31:08,919][25689] Fps is (10 sec: 5606.7, 60 sec: 5686.2, 300 sec: 5650.1). Total num frames: 255688704. Throughput: 0: 4995.7. Samples: 255680654. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 12:31:08,920][25689] Avg episode reward: [(0, '-49.924')] [2022-07-09 12:31:10,216][26022] Updated weights on worker 0-0, policy_version 249702 (0.00102) [2022-07-09 12:31:11,753][26022] Updated weights on worker 0-0, policy_version 249712 (0.00087) [2022-07-09 12:31:13,868][26022] Updated weights on worker 0-0, policy_version 249722 (0.00090) [2022-07-09 12:31:13,968][25689] Fps is (10 sec: 5397.1, 60 sec: 5650.5, 300 sec: 5646.8). Total num frames: 255715328. Throughput: 0: 5840.5. Samples: 255714782. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 12:31:13,968][25689] Avg episode reward: [(0, '-49.628')] [2022-07-09 12:31:15,372][26022] Updated weights on worker 0-0, policy_version 249732 (0.00089) [2022-07-09 12:31:17,382][26022] Updated weights on worker 0-0, policy_version 249742 (0.00087) [2022-07-09 12:31:19,015][25689] Fps is (10 sec: 5578.3, 60 sec: 5667.5, 300 sec: 5649.4). Total num frames: 255745024. Throughput: 0: 5854.4. Samples: 255748786. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 12:31:19,016][25689] Avg episode reward: [(0, '-48.826')] [2022-07-09 12:31:19,063][26022] Updated weights on worker 0-0, policy_version 249752 (0.00086) [2022-07-09 12:31:21,031][26022] Updated weights on worker 0-0, policy_version 249762 (0.00091) [2022-07-09 12:31:22,799][26022] Updated weights on worker 0-0, policy_version 249772 (0.00085) [2022-07-09 12:31:24,072][25689] Fps is (10 sec: 5776.7, 60 sec: 5680.6, 300 sec: 5648.5). Total num frames: 255773696. Throughput: 0: 5914.1. Samples: 255782774. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 12:31:24,072][25689] Avg episode reward: [(0, '-47.756')] [2022-07-09 12:31:24,628][26022] Updated weights on worker 0-0, policy_version 249782 (0.00089) [2022-07-09 12:31:26,308][26022] Updated weights on worker 0-0, policy_version 249792 (0.00092) [2022-07-09 12:31:28,117][26022] Updated weights on worker 0-0, policy_version 249802 (0.00087) [2022-07-09 12:31:29,100][25689] Fps is (10 sec: 5584.8, 60 sec: 5651.4, 300 sec: 5646.3). Total num frames: 255801344. Throughput: 0: 5905.1. Samples: 255799824. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 12:31:29,100][25689] Avg episode reward: [(0, '-47.710')] [2022-07-09 12:31:29,934][26022] Updated weights on worker 0-0, policy_version 249812 (0.00090) [2022-07-09 12:31:31,814][26022] Updated weights on worker 0-0, policy_version 249822 (0.00114) [2022-07-09 12:31:33,470][26022] Updated weights on worker 0-0, policy_version 249832 (0.00090) [2022-07-09 12:31:34,123][25689] Fps is (10 sec: 5603.5, 60 sec: 5653.3, 300 sec: 5646.9). Total num frames: 255830016. Throughput: 0: 5912.7. Samples: 255833952. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 12:31:34,123][25689] Avg episode reward: [(0, '-47.678')] [2022-07-09 12:31:35,460][26022] Updated weights on worker 0-0, policy_version 249842 (0.00092) [2022-07-09 12:31:37,187][26022] Updated weights on worker 0-0, policy_version 249852 (0.00087) [2022-07-09 12:31:38,951][26022] Updated weights on worker 0-0, policy_version 249862 (0.00082) [2022-07-09 12:31:39,186][25689] Fps is (10 sec: 5787.0, 60 sec: 5653.2, 300 sec: 5653.1). Total num frames: 255859712. Throughput: 0: 5915.4. Samples: 255868104. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 12:31:39,187][25689] Avg episode reward: [(0, '-47.243')] [2022-07-09 12:31:40,711][26022] Updated weights on worker 0-0, policy_version 249872 (0.00094) [2022-07-09 12:31:42,611][26022] Updated weights on worker 0-0, policy_version 249882 (0.00087) [2022-07-09 12:31:44,260][25689] Fps is (10 sec: 5859.1, 60 sec: 5664.0, 300 sec: 5655.4). Total num frames: 255889408. Throughput: 0: 5073.9. Samples: 255885204. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 12:31:44,260][25689] Avg episode reward: [(0, '-48.733')] [2022-07-09 12:31:44,268][26022] Updated weights on worker 0-0, policy_version 249892 (0.00082) [2022-07-09 12:31:46,325][26022] Updated weights on worker 0-0, policy_version 249902 (0.00094) [2022-07-09 12:31:47,924][26022] Updated weights on worker 0-0, policy_version 249912 (0.00092) [2022-07-09 12:31:49,314][25689] Fps is (10 sec: 5560.6, 60 sec: 5631.3, 300 sec: 5644.1). Total num frames: 255916032. Throughput: 0: 5896.8. Samples: 255919026. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 12:31:49,315][25689] Avg episode reward: [(0, '-48.977')] [2022-07-09 12:31:49,951][26022] Updated weights on worker 0-0, policy_version 249922 (0.00087) [2022-07-09 12:31:51,460][26022] Updated weights on worker 0-0, policy_version 249932 (0.00087) [2022-07-09 12:31:53,450][26022] Updated weights on worker 0-0, policy_version 249942 (0.00093) [2022-07-09 12:31:54,336][25689] Fps is (10 sec: 5589.0, 60 sec: 5647.6, 300 sec: 5648.7). Total num frames: 255945728. Throughput: 0: 5888.2. Samples: 255952974. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 12:31:54,337][25689] Avg episode reward: [(0, '-49.064')] [2022-07-09 12:31:55,368][26022] Updated weights on worker 0-0, policy_version 249952 (0.00088) [2022-07-09 12:31:57,186][26022] Updated weights on worker 0-0, policy_version 249962 (0.00089) [2022-07-09 12:31:58,898][26022] Updated weights on worker 0-0, policy_version 249972 (0.00084) [2022-07-09 12:31:59,431][25689] Fps is (10 sec: 5769.7, 60 sec: 5647.9, 300 sec: 5650.6). Total num frames: 255974400. Throughput: 0: 5025.5. Samples: 255969844. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 12:31:59,431][25689] Avg episode reward: [(0, '-49.288')] [2022-07-09 12:32:00,645][26022] Updated weights on worker 0-0, policy_version 249982 (0.00094) [2022-07-09 12:32:02,814][26022] Updated weights on worker 0-0, policy_version 249992 (0.00082) [2022-07-09 12:32:04,466][25689] Fps is (10 sec: 5357.7, 60 sec: 5597.4, 300 sec: 5643.9). Total num frames: 256000000. Throughput: 0: 5780.0. Samples: 256001998. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 12:32:04,467][25689] Avg episode reward: [(0, '-49.021')] [2022-07-09 12:32:04,731][26022] Updated weights on worker 0-0, policy_version 250002 (0.00084) [2022-07-09 12:32:06,503][26022] Updated weights on worker 0-0, policy_version 250012 (0.00092) [2022-07-09 12:32:08,263][26022] Updated weights on worker 0-0, policy_version 250022 (0.00090) [2022-07-09 12:32:09,472][25689] Fps is (10 sec: 5404.9, 60 sec: 5614.4, 300 sec: 5647.2). Total num frames: 256028672. Throughput: 0: 5808.1. Samples: 256036104. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 12:32:09,472][25689] Avg episode reward: [(0, '-48.415')] [2022-07-09 12:32:10,016][26022] Updated weights on worker 0-0, policy_version 250032 (0.00091) [2022-07-09 12:32:11,804][26022] Updated weights on worker 0-0, policy_version 250042 (0.00080) [2022-07-09 12:32:13,666][26022] Updated weights on worker 0-0, policy_version 250052 (0.00096) [2022-07-09 12:32:14,474][25689] Fps is (10 sec: 5627.5, 60 sec: 5635.6, 300 sec: 5641.3). Total num frames: 256056320. Throughput: 0: 4971.1. Samples: 256053080. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 12:32:14,475][25689] Avg episode reward: [(0, '-47.656')] [2022-07-09 12:32:15,494][26022] Updated weights on worker 0-0, policy_version 250062 (0.00084) [2022-07-09 12:32:17,336][26022] Updated weights on worker 0-0, policy_version 250072 (0.00094) [2022-07-09 12:32:19,252][26022] Updated weights on worker 0-0, policy_version 250082 (0.00086) [2022-07-09 12:32:19,605][25689] Fps is (10 sec: 5658.9, 60 sec: 5627.8, 300 sec: 5646.0). Total num frames: 256086016. Throughput: 0: 5803.9. Samples: 256086936. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 12:32:19,606][25689] Avg episode reward: [(0, '-47.690')] [2022-07-09 12:32:21,026][26022] Updated weights on worker 0-0, policy_version 250092 (0.00086) [2022-07-09 12:32:22,770][26022] Updated weights on worker 0-0, policy_version 250102 (0.00096) [2022-07-09 12:32:24,686][25689] Fps is (10 sec: 5615.4, 60 sec: 5608.7, 300 sec: 5641.6). Total num frames: 256113664. Throughput: 0: 5884.5. Samples: 256120982. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 12:32:24,687][25689] Avg episode reward: [(0, '-47.411')] [2022-07-09 12:32:24,844][26022] Updated weights on worker 0-0, policy_version 250112 (0.00087) [2022-07-09 12:32:26,208][26022] Updated weights on worker 0-0, policy_version 250122 (0.00086) [2022-07-09 12:32:28,284][26022] Updated weights on worker 0-0, policy_version 250132 (0.00092) [2022-07-09 12:32:29,713][25689] Fps is (10 sec: 5673.4, 60 sec: 5642.6, 300 sec: 5642.1). Total num frames: 256143360. Throughput: 0: 5030.0. Samples: 256137916. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 12:32:29,719][25689] Avg episode reward: [(0, '-47.645')] [2022-07-09 12:32:29,919][26022] Updated weights on worker 0-0, policy_version 250142 (0.00084) [2022-07-09 12:32:31,774][26022] Updated weights on worker 0-0, policy_version 250152 (0.00084) [2022-07-09 12:32:33,747][26022] Updated weights on worker 0-0, policy_version 250162 (0.00089) [2022-07-09 12:32:34,728][25689] Fps is (10 sec: 5710.7, 60 sec: 5626.5, 300 sec: 5636.3). Total num frames: 256171008. Throughput: 0: 5869.1. Samples: 256171950. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 12:32:34,728][25689] Avg episode reward: [(0, '-47.973')] [2022-07-09 12:32:35,296][26022] Updated weights on worker 0-0, policy_version 250172 (0.00092) [2022-07-09 12:32:37,310][26022] Updated weights on worker 0-0, policy_version 250182 (0.00088) [2022-07-09 12:32:38,957][26022] Updated weights on worker 0-0, policy_version 250192 (0.00090) [2022-07-09 12:32:39,772][25689] Fps is (10 sec: 5598.9, 60 sec: 5611.3, 300 sec: 5643.6). Total num frames: 256199680. Throughput: 0: 5904.3. Samples: 256206006. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 12:32:39,773][25689] Avg episode reward: [(0, '-48.546')] [2022-07-09 12:32:40,650][26022] Updated weights on worker 0-0, policy_version 250202 (0.00094) [2022-07-09 12:32:42,732][26022] Updated weights on worker 0-0, policy_version 250212 (0.00089) [2022-07-09 12:32:44,455][26022] Updated weights on worker 0-0, policy_version 250222 (0.00084) [2022-07-09 12:32:44,777][25689] Fps is (10 sec: 5808.0, 60 sec: 5617.7, 300 sec: 5640.3). Total num frames: 256229376. Throughput: 0: 5083.1. Samples: 256223106. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 12:32:44,778][25689] Avg episode reward: [(0, '-49.356')] [2022-07-09 12:32:46,311][26022] Updated weights on worker 0-0, policy_version 250232 (0.00086) [2022-07-09 12:32:48,130][26022] Updated weights on worker 0-0, policy_version 250242 (0.00097) [2022-07-09 12:32:49,790][25689] Fps is (10 sec: 5724.0, 60 sec: 5638.5, 300 sec: 5644.0). Total num frames: 256257024. Throughput: 0: 5941.3. Samples: 256257202. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 12:32:49,791][25689] Avg episode reward: [(0, '-50.294')] [2022-07-09 12:32:49,875][26022] Updated weights on worker 0-0, policy_version 250252 (0.00088) [2022-07-09 12:32:51,762][26022] Updated weights on worker 0-0, policy_version 250262 (0.00128) [2022-07-09 12:32:53,410][26022] Updated weights on worker 0-0, policy_version 250272 (0.00097) [2022-07-09 12:32:54,809][25689] Fps is (10 sec: 5613.9, 60 sec: 5621.8, 300 sec: 5641.4). Total num frames: 256285696. Throughput: 0: 5953.3. Samples: 256291502. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 12:32:54,810][25689] Avg episode reward: [(0, '-49.737')] [2022-07-09 12:32:55,278][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:32:55,291][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000250282_256288768.pth [2022-07-09 12:32:55,291][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000248296_254255104.pth [2022-07-09 12:32:55,295][26022] Updated weights on worker 0-0, policy_version 250282 (0.00094) [2022-07-09 12:32:57,323][26022] Updated weights on worker 0-0, policy_version 250292 (0.00086) [2022-07-09 12:32:58,842][26022] Updated weights on worker 0-0, policy_version 250302 (0.00089) [2022-07-09 12:32:59,862][25689] Fps is (10 sec: 5592.0, 60 sec: 5608.8, 300 sec: 5641.6). Total num frames: 256313344. Throughput: 0: 5093.0. Samples: 256308322. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 12:32:59,862][25689] Avg episode reward: [(0, '-49.996')] [2022-07-09 12:33:00,708][26022] Updated weights on worker 0-0, policy_version 250312 (0.00086) [2022-07-09 12:33:02,929][26022] Updated weights on worker 0-0, policy_version 250322 (0.00088) [2022-07-09 12:33:04,602][26022] Updated weights on worker 0-0, policy_version 250332 (0.01003) [2022-07-09 12:33:04,878][25689] Fps is (10 sec: 5593.6, 60 sec: 5661.4, 300 sec: 5645.8). Total num frames: 256342016. Throughput: 0: 5842.8. Samples: 256340552. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 12:33:04,878][25689] Avg episode reward: [(0, '-50.409')] [2022-07-09 12:33:06,475][26022] Updated weights on worker 0-0, policy_version 250342 (0.00086) [2022-07-09 12:33:08,244][26022] Updated weights on worker 0-0, policy_version 250352 (0.00089) [2022-07-09 12:33:09,895][25689] Fps is (10 sec: 5613.3, 60 sec: 5643.4, 300 sec: 5643.2). Total num frames: 256369664. Throughput: 0: 5856.5. Samples: 256374946. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 12:33:09,896][25689] Avg episode reward: [(0, '-49.489')] [2022-07-09 12:33:09,936][26022] Updated weights on worker 0-0, policy_version 250362 (0.00093) [2022-07-09 12:33:11,878][26022] Updated weights on worker 0-0, policy_version 250372 (0.00093) [2022-07-09 12:33:13,369][26022] Updated weights on worker 0-0, policy_version 250382 (0.00090) [2022-07-09 12:33:14,918][25689] Fps is (10 sec: 5507.5, 60 sec: 5641.5, 300 sec: 5641.1). Total num frames: 256397312. Throughput: 0: 5010.7. Samples: 256392262. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 12:33:14,919][25689] Avg episode reward: [(0, '-50.045')] [2022-07-09 12:33:15,417][26022] Updated weights on worker 0-0, policy_version 250392 (0.00084) [2022-07-09 12:33:17,045][26022] Updated weights on worker 0-0, policy_version 250402 (0.00054) [2022-07-09 12:33:19,096][26022] Updated weights on worker 0-0, policy_version 250412 (0.00095) [2022-07-09 12:33:20,047][25689] Fps is (10 sec: 5547.9, 60 sec: 5624.8, 300 sec: 5635.7). Total num frames: 256425984. Throughput: 0: 5846.8. Samples: 256426340. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 12:33:20,047][25689] Avg episode reward: [(0, '-49.035')] [2022-07-09 12:33:20,871][26022] Updated weights on worker 0-0, policy_version 250422 (0.00089) [2022-07-09 12:33:22,621][26022] Updated weights on worker 0-0, policy_version 250432 (0.00085) [2022-07-09 12:33:24,312][26022] Updated weights on worker 0-0, policy_version 250442 (0.00085) [2022-07-09 12:33:25,051][25689] Fps is (10 sec: 5861.5, 60 sec: 5682.8, 300 sec: 5646.8). Total num frames: 256456704. Throughput: 0: 5948.9. Samples: 256460558. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 12:33:25,051][25689] Avg episode reward: [(0, '-49.212')] [2022-07-09 12:33:26,483][26022] Updated weights on worker 0-0, policy_version 250452 (0.00110) [2022-07-09 12:33:27,872][26022] Updated weights on worker 0-0, policy_version 250462 (0.00925) [2022-07-09 12:33:30,053][25689] Fps is (10 sec: 5628.6, 60 sec: 5617.2, 300 sec: 5640.1). Total num frames: 256482304. Throughput: 0: 5084.0. Samples: 256477428. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 12:33:30,053][25689] Avg episode reward: [(0, '-48.647')] [2022-07-09 12:33:30,058][26022] Updated weights on worker 0-0, policy_version 250472 (0.00087) [2022-07-09 12:33:31,642][26022] Updated weights on worker 0-0, policy_version 250482 (0.00085) [2022-07-09 12:33:33,535][26022] Updated weights on worker 0-0, policy_version 250492 (0.00099) [2022-07-09 12:33:35,061][25689] Fps is (10 sec: 5523.9, 60 sec: 5651.8, 300 sec: 5637.5). Total num frames: 256512000. Throughput: 0: 5917.4. Samples: 256511456. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 12:33:35,062][25689] Avg episode reward: [(0, '-48.145')] [2022-07-09 12:33:35,403][26022] Updated weights on worker 0-0, policy_version 250502 (0.00095) [2022-07-09 12:33:36,957][26022] Updated weights on worker 0-0, policy_version 250512 (0.00085) [2022-07-09 12:33:39,018][26022] Updated weights on worker 0-0, policy_version 250522 (0.00088) [2022-07-09 12:33:40,130][25689] Fps is (10 sec: 5893.6, 60 sec: 5666.5, 300 sec: 5644.5). Total num frames: 256541696. Throughput: 0: 5936.7. Samples: 256545570. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 12:33:40,131][25689] Avg episode reward: [(0, '-48.655')] [2022-07-09 12:33:40,535][26022] Updated weights on worker 0-0, policy_version 250532 (0.00085) [2022-07-09 12:33:42,424][26022] Updated weights on worker 0-0, policy_version 250542 (0.00091) [2022-07-09 12:33:44,323][26022] Updated weights on worker 0-0, policy_version 250552 (0.00086) [2022-07-09 12:33:45,196][25689] Fps is (10 sec: 5759.0, 60 sec: 5643.8, 300 sec: 5647.2). Total num frames: 256570368. Throughput: 0: 5071.9. Samples: 256562734. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 12:33:45,197][25689] Avg episode reward: [(0, '-48.138')] [2022-07-09 12:33:45,980][26022] Updated weights on worker 0-0, policy_version 250562 (0.00086) [2022-07-09 12:33:47,847][26022] Updated weights on worker 0-0, policy_version 250572 (0.00085) [2022-07-09 12:33:49,871][26022] Updated weights on worker 0-0, policy_version 250582 (0.00096) [2022-07-09 12:33:50,229][25689] Fps is (10 sec: 5576.9, 60 sec: 5641.9, 300 sec: 5637.4). Total num frames: 256598016. Throughput: 0: 5925.3. Samples: 256596978. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 12:33:50,229][25689] Avg episode reward: [(0, '-48.395')] [2022-07-09 12:33:51,516][26022] Updated weights on worker 0-0, policy_version 250592 (0.00103) [2022-07-09 12:33:53,648][26022] Updated weights on worker 0-0, policy_version 250602 (0.00085) [2022-07-09 12:33:54,865][26022] Updated weights on worker 0-0, policy_version 250612 (0.00081) [2022-07-09 12:33:55,257][25689] Fps is (10 sec: 5699.6, 60 sec: 5658.0, 300 sec: 5648.1). Total num frames: 256627712. Throughput: 0: 5895.0. Samples: 256630512. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 12:33:55,258][25689] Avg episode reward: [(0, '-47.591')] [2022-07-09 12:33:57,188][26022] Updated weights on worker 0-0, policy_version 250622 (0.00091) [2022-07-09 12:33:58,614][26022] Updated weights on worker 0-0, policy_version 250632 (0.00086) [2022-07-09 12:34:00,315][25689] Fps is (10 sec: 5584.2, 60 sec: 5640.6, 300 sec: 5643.9). Total num frames: 256654336. Throughput: 0: 5907.4. Samples: 256664808. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 12:34:00,315][25689] Avg episode reward: [(0, '-47.713')] [2022-07-09 12:34:00,547][26022] Updated weights on worker 0-0, policy_version 250642 (0.00086) [2022-07-09 12:34:02,908][26022] Updated weights on worker 0-0, policy_version 250652 (0.00095) [2022-07-09 12:34:04,488][26022] Updated weights on worker 0-0, policy_version 250662 (0.00088) [2022-07-09 12:34:05,374][25689] Fps is (10 sec: 5364.8, 60 sec: 5619.7, 300 sec: 5639.6). Total num frames: 256681984. Throughput: 0: 5791.3. Samples: 256679588. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 12:34:05,375][25689] Avg episode reward: [(0, '-47.968')] [2022-07-09 12:34:06,493][26022] Updated weights on worker 0-0, policy_version 250672 (0.00089) [2022-07-09 12:34:08,128][26022] Updated weights on worker 0-0, policy_version 250682 (0.00090) [2022-07-09 12:34:09,937][26022] Updated weights on worker 0-0, policy_version 250692 (0.00097) [2022-07-09 12:34:10,389][25689] Fps is (10 sec: 5590.4, 60 sec: 5636.8, 300 sec: 5643.5). Total num frames: 256710656. Throughput: 0: 5806.1. Samples: 256714030. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 12:34:10,391][25689] Avg episode reward: [(0, '-47.698')] [2022-07-09 12:34:11,794][26022] Updated weights on worker 0-0, policy_version 250702 (0.00083) [2022-07-09 12:34:13,459][26022] Updated weights on worker 0-0, policy_version 250712 (0.00090) [2022-07-09 12:34:15,218][26022] Updated weights on worker 0-0, policy_version 250722 (0.00088) [2022-07-09 12:34:15,407][25689] Fps is (10 sec: 5817.5, 60 sec: 5671.2, 300 sec: 5648.8). Total num frames: 256740352. Throughput: 0: 5872.8. Samples: 256748846. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 12:34:15,408][25689] Avg episode reward: [(0, '-47.973')] [2022-07-09 12:34:17,103][26022] Updated weights on worker 0-0, policy_version 250732 (0.00090) [2022-07-09 12:34:18,773][26022] Updated weights on worker 0-0, policy_version 250742 (0.00081) [2022-07-09 12:34:20,482][25689] Fps is (10 sec: 5783.0, 60 sec: 5676.2, 300 sec: 5644.2). Total num frames: 256769024. Throughput: 0: 5034.1. Samples: 256766334. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 12:34:20,482][25689] Avg episode reward: [(0, '-48.679')] [2022-07-09 12:34:20,514][26022] Updated weights on worker 0-0, policy_version 250752 (0.00108) [2022-07-09 12:34:22,405][26022] Updated weights on worker 0-0, policy_version 250762 (0.00097) [2022-07-09 12:34:24,370][26022] Updated weights on worker 0-0, policy_version 250772 (0.00090) [2022-07-09 12:34:25,558][25689] Fps is (10 sec: 5649.0, 60 sec: 5635.6, 300 sec: 5646.4). Total num frames: 256797696. Throughput: 0: 5979.8. Samples: 256800286. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 12:34:25,559][25689] Avg episode reward: [(0, '-49.822')] [2022-07-09 12:34:25,978][26022] Updated weights on worker 0-0, policy_version 250782 (0.00092) [2022-07-09 12:34:27,956][26022] Updated weights on worker 0-0, policy_version 250792 (0.00088) [2022-07-09 12:34:29,502][26022] Updated weights on worker 0-0, policy_version 250802 (0.00083) [2022-07-09 12:34:30,591][25689] Fps is (10 sec: 5672.4, 60 sec: 5683.4, 300 sec: 5643.2). Total num frames: 256826368. Throughput: 0: 5949.1. Samples: 256834216. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 12:34:30,592][25689] Avg episode reward: [(0, '-50.696')] [2022-07-09 12:34:31,416][26022] Updated weights on worker 0-0, policy_version 250812 (0.00093) [2022-07-09 12:34:33,318][26022] Updated weights on worker 0-0, policy_version 250822 (0.00088) [2022-07-09 12:34:35,055][26022] Updated weights on worker 0-0, policy_version 250832 (0.00091) [2022-07-09 12:34:35,650][25689] Fps is (10 sec: 5681.9, 60 sec: 5661.7, 300 sec: 5647.2). Total num frames: 256855040. Throughput: 0: 5064.4. Samples: 256851368. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 12:34:35,651][25689] Avg episode reward: [(0, '-50.577')] [2022-07-09 12:34:37,090][26022] Updated weights on worker 0-0, policy_version 250842 (0.00098) [2022-07-09 12:34:38,724][26022] Updated weights on worker 0-0, policy_version 250852 (0.00093) [2022-07-09 12:34:40,696][25689] Fps is (10 sec: 5573.6, 60 sec: 5630.1, 300 sec: 5644.6). Total num frames: 256882688. Throughput: 0: 5872.5. Samples: 256885042. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 12:34:40,697][25689] Avg episode reward: [(0, '-50.474')] [2022-07-09 12:34:40,704][26022] Updated weights on worker 0-0, policy_version 250862 (0.00088) [2022-07-09 12:34:42,176][26022] Updated weights on worker 0-0, policy_version 250872 (0.00084) [2022-07-09 12:34:44,142][26022] Updated weights on worker 0-0, policy_version 250882 (0.00097) [2022-07-09 12:34:45,792][25689] Fps is (10 sec: 5654.2, 60 sec: 5644.2, 300 sec: 5650.1). Total num frames: 256912384. Throughput: 0: 5889.4. Samples: 256919454. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 12:34:45,793][25689] Avg episode reward: [(0, '-50.191')] [2022-07-09 12:34:45,915][26022] Updated weights on worker 0-0, policy_version 250892 (0.00093) [2022-07-09 12:34:47,665][26022] Updated weights on worker 0-0, policy_version 250902 (0.00096) [2022-07-09 12:34:49,482][26022] Updated weights on worker 0-0, policy_version 250912 (0.00085) [2022-07-09 12:34:50,802][25689] Fps is (10 sec: 5775.9, 60 sec: 5663.3, 300 sec: 5646.8). Total num frames: 256941056. Throughput: 0: 5061.1. Samples: 256936502. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 12:34:50,802][25689] Avg episode reward: [(0, '-50.070')] [2022-07-09 12:34:51,304][26022] Updated weights on worker 0-0, policy_version 250922 (0.00096) [2022-07-09 12:34:53,295][26022] Updated weights on worker 0-0, policy_version 250932 (0.00085) [2022-07-09 12:34:54,801][26022] Updated weights on worker 0-0, policy_version 250942 (0.00091) [2022-07-09 12:34:55,433][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:34:55,449][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000250945_256967680.pth [2022-07-09 12:34:55,449][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000248958_254932992.pth [2022-07-09 12:34:55,856][25689] Fps is (10 sec: 5596.2, 60 sec: 5627.0, 300 sec: 5640.4). Total num frames: 256968704. Throughput: 0: 5898.7. Samples: 256970556. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 12:34:55,857][25689] Avg episode reward: [(0, '-49.746')] [2022-07-09 12:34:56,877][26022] Updated weights on worker 0-0, policy_version 250952 (0.00081) [2022-07-09 12:34:58,463][26022] Updated weights on worker 0-0, policy_version 250962 (0.00093) [2022-07-09 12:35:00,357][26022] Updated weights on worker 0-0, policy_version 250972 (0.00086) [2022-07-09 12:35:00,955][25689] Fps is (10 sec: 5647.6, 60 sec: 5673.8, 300 sec: 5652.8). Total num frames: 256998400. Throughput: 0: 5897.9. Samples: 257004530. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 12:35:00,956][25689] Avg episode reward: [(0, '-49.321')] [2022-07-09 12:35:02,499][26022] Updated weights on worker 0-0, policy_version 250982 (0.00079) [2022-07-09 12:35:04,250][26022] Updated weights on worker 0-0, policy_version 250992 (0.00097) [2022-07-09 12:35:05,993][25689] Fps is (10 sec: 5556.2, 60 sec: 5659.0, 300 sec: 5646.8). Total num frames: 257025024. Throughput: 0: 4957.9. Samples: 257019606. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-09 12:35:05,993][25689] Avg episode reward: [(0, '-49.568')] [2022-07-09 12:35:06,208][26022] Updated weights on worker 0-0, policy_version 251002 (0.00094) [2022-07-09 12:35:07,944][26022] Updated weights on worker 0-0, policy_version 251012 (0.00096) [2022-07-09 12:35:09,899][26022] Updated weights on worker 0-0, policy_version 251022 (0.00090) [2022-07-09 12:35:10,996][25689] Fps is (10 sec: 5405.2, 60 sec: 5643.2, 300 sec: 5643.8). Total num frames: 257052672. Throughput: 0: 5791.6. Samples: 257053460. Policy #0 lag: (min: 0.0, avg: 7.8, max: 21.0) [2022-07-09 12:35:10,997][25689] Avg episode reward: [(0, '-49.257')] [2022-07-09 12:35:11,714][26022] Updated weights on worker 0-0, policy_version 251032 (0.00082) [2022-07-09 12:35:13,202][26022] Updated weights on worker 0-0, policy_version 251042 (0.00086) [2022-07-09 12:35:15,242][26022] Updated weights on worker 0-0, policy_version 251052 (0.00106) [2022-07-09 12:35:16,000][25689] Fps is (10 sec: 5730.0, 60 sec: 5644.4, 300 sec: 5648.1). Total num frames: 257082368. Throughput: 0: 5828.8. Samples: 257087972. Policy #0 lag: (min: 0.0, avg: 7.8, max: 21.0) [2022-07-09 12:35:16,002][25689] Avg episode reward: [(0, '-48.913')] [2022-07-09 12:35:16,870][26022] Updated weights on worker 0-0, policy_version 251062 (0.00474) [2022-07-09 12:35:18,691][26022] Updated weights on worker 0-0, policy_version 251072 (0.00087) [2022-07-09 12:35:20,433][26022] Updated weights on worker 0-0, policy_version 251082 (0.00085) [2022-07-09 12:35:21,104][25689] Fps is (10 sec: 5673.1, 60 sec: 5624.9, 300 sec: 5646.4). Total num frames: 257110016. Throughput: 0: 4988.5. Samples: 257105042. Policy #0 lag: (min: 0.0, avg: 7.8, max: 21.0) [2022-07-09 12:35:21,105][25689] Avg episode reward: [(0, '-49.083')] [2022-07-09 12:35:22,456][26022] Updated weights on worker 0-0, policy_version 251092 (0.00089) [2022-07-09 12:35:24,138][26022] Updated weights on worker 0-0, policy_version 251102 (0.00084) [2022-07-09 12:35:26,043][26022] Updated weights on worker 0-0, policy_version 251112 (0.00099) [2022-07-09 12:35:26,144][25689] Fps is (10 sec: 5552.1, 60 sec: 5628.2, 300 sec: 5643.7). Total num frames: 257138688. Throughput: 0: 5928.5. Samples: 257139074. Policy #0 lag: (min: 0.0, avg: 7.8, max: 21.0) [2022-07-09 12:35:26,146][25689] Avg episode reward: [(0, '-48.943')] [2022-07-09 12:35:27,692][26022] Updated weights on worker 0-0, policy_version 251122 (0.00091) [2022-07-09 12:35:29,667][26022] Updated weights on worker 0-0, policy_version 251132 (0.00084) [2022-07-09 12:35:31,160][25689] Fps is (10 sec: 5804.3, 60 sec: 5646.7, 300 sec: 5647.7). Total num frames: 257168384. Throughput: 0: 5931.6. Samples: 257173064. Policy #0 lag: (min: 0.0, avg: 7.8, max: 21.0) [2022-07-09 12:35:31,161][25689] Avg episode reward: [(0, '-48.663')] [2022-07-09 12:35:31,328][26022] Updated weights on worker 0-0, policy_version 251142 (0.00090) [2022-07-09 12:35:33,198][26022] Updated weights on worker 0-0, policy_version 251152 (0.00086) [2022-07-09 12:35:35,117][26022] Updated weights on worker 0-0, policy_version 251162 (0.00094) [2022-07-09 12:35:36,219][25689] Fps is (10 sec: 5691.9, 60 sec: 5629.9, 300 sec: 5640.9). Total num frames: 257196032. Throughput: 0: 5052.8. Samples: 257190132. Policy #0 lag: (min: 0.0, avg: 7.8, max: 21.0) [2022-07-09 12:35:36,219][25689] Avg episode reward: [(0, '-49.565')] [2022-07-09 12:35:36,958][26022] Updated weights on worker 0-0, policy_version 251172 (0.00087) [2022-07-09 12:35:38,662][26022] Updated weights on worker 0-0, policy_version 251182 (0.00090) [2022-07-09 12:35:40,493][26022] Updated weights on worker 0-0, policy_version 251192 (0.00093) [2022-07-09 12:35:41,336][25689] Fps is (10 sec: 5635.3, 60 sec: 5657.0, 300 sec: 5642.2). Total num frames: 257225728. Throughput: 0: 5904.0. Samples: 257224490. Policy #0 lag: (min: 0.0, avg: 7.8, max: 21.0) [2022-07-09 12:35:41,336][25689] Avg episode reward: [(0, '-49.417')] [2022-07-09 12:35:41,999][26022] Updated weights on worker 0-0, policy_version 251202 (0.00091) [2022-07-09 12:35:44,121][26022] Updated weights on worker 0-0, policy_version 251212 (0.00091) [2022-07-09 12:35:45,582][26022] Updated weights on worker 0-0, policy_version 251222 (0.00087) [2022-07-09 12:35:46,402][25689] Fps is (10 sec: 5731.8, 60 sec: 5642.9, 300 sec: 5642.2). Total num frames: 257254400. Throughput: 0: 5911.8. Samples: 257258834. Policy #0 lag: (min: 0.0, avg: 7.8, max: 21.0) [2022-07-09 12:35:46,402][25689] Avg episode reward: [(0, '-50.369')] [2022-07-09 12:35:47,599][26022] Updated weights on worker 0-0, policy_version 251232 (0.00086) [2022-07-09 12:35:49,152][26022] Updated weights on worker 0-0, policy_version 251242 (0.00086) [2022-07-09 12:35:51,209][26022] Updated weights on worker 0-0, policy_version 251252 (0.00082) [2022-07-09 12:35:51,451][25689] Fps is (10 sec: 5668.9, 60 sec: 5639.2, 300 sec: 5641.6). Total num frames: 257283072. Throughput: 0: 5072.3. Samples: 257275988. Policy #0 lag: (min: 0.0, avg: 7.8, max: 21.0) [2022-07-09 12:35:51,452][25689] Avg episode reward: [(0, '-50.126')] [2022-07-09 12:35:53,014][26022] Updated weights on worker 0-0, policy_version 251262 (0.00109) [2022-07-09 12:35:54,897][26022] Updated weights on worker 0-0, policy_version 251272 (0.00562) [2022-07-09 12:35:56,498][25689] Fps is (10 sec: 5781.2, 60 sec: 5673.7, 300 sec: 5646.0). Total num frames: 257312768. Throughput: 0: 5912.7. Samples: 257310038. Policy #0 lag: (min: 0.0, avg: 7.8, max: 21.0) [2022-07-09 12:35:56,499][25689] Avg episode reward: [(0, '-50.483')] [2022-07-09 12:35:56,507][26022] Updated weights on worker 0-0, policy_version 251282 (0.00437) [2022-07-09 12:35:58,564][26022] Updated weights on worker 0-0, policy_version 251292 (0.00083) [2022-07-09 12:36:00,127][26022] Updated weights on worker 0-0, policy_version 251302 (0.00094) [2022-07-09 12:36:01,595][25689] Fps is (10 sec: 5552.5, 60 sec: 5623.3, 300 sec: 5638.0). Total num frames: 257339392. Throughput: 0: 5901.4. Samples: 257344044. Policy #0 lag: (min: 0.0, avg: 7.8, max: 21.0) [2022-07-09 12:36:01,595][25689] Avg episode reward: [(0, '-49.369')] [2022-07-09 12:36:02,528][26022] Updated weights on worker 0-0, policy_version 251312 (0.00090) [2022-07-09 12:36:03,971][26022] Updated weights on worker 0-0, policy_version 251322 (0.00091) [2022-07-09 12:36:06,024][26022] Updated weights on worker 0-0, policy_version 251332 (0.00086) [2022-07-09 12:36:06,604][25689] Fps is (10 sec: 5370.5, 60 sec: 5642.8, 300 sec: 5638.0). Total num frames: 257367040. Throughput: 0: 5804.4. Samples: 257376094. Policy #0 lag: (min: 0.0, avg: 7.8, max: 21.0) [2022-07-09 12:36:06,604][25689] Avg episode reward: [(0, '-49.443')] [2022-07-09 12:36:07,563][26022] Updated weights on worker 0-0, policy_version 251342 (0.00086) [2022-07-09 12:36:09,528][26022] Updated weights on worker 0-0, policy_version 251352 (0.00094) [2022-07-09 12:36:11,383][26022] Updated weights on worker 0-0, policy_version 251362 (0.00091) [2022-07-09 12:36:11,627][25689] Fps is (10 sec: 5511.9, 60 sec: 5641.0, 300 sec: 5641.9). Total num frames: 257394688. Throughput: 0: 5807.1. Samples: 257393148. Policy #0 lag: (min: 0.0, avg: 7.8, max: 21.0) [2022-07-09 12:36:11,629][25689] Avg episode reward: [(0, '-49.674')] [2022-07-09 12:36:13,240][26022] Updated weights on worker 0-0, policy_version 251372 (0.00088) [2022-07-09 12:36:15,388][26022] Updated weights on worker 0-0, policy_version 251382 (0.00093) [2022-07-09 12:36:16,642][25689] Fps is (10 sec: 5712.7, 60 sec: 5640.0, 300 sec: 5642.5). Total num frames: 257424384. Throughput: 0: 5809.0. Samples: 257427052. Policy #0 lag: (min: 0.0, avg: 7.8, max: 21.0) [2022-07-09 12:36:16,642][25689] Avg episode reward: [(0, '-48.981')] [2022-07-09 12:36:16,783][26022] Updated weights on worker 0-0, policy_version 251392 (0.00090) [2022-07-09 12:36:18,718][26022] Updated weights on worker 0-0, policy_version 251402 (0.00092) [2022-07-09 12:36:20,257][26022] Updated weights on worker 0-0, policy_version 251412 (0.00086) [2022-07-09 12:36:21,699][25689] Fps is (10 sec: 5591.4, 60 sec: 5627.4, 300 sec: 5635.6). Total num frames: 257451008. Throughput: 0: 5818.1. Samples: 257461016. Policy #0 lag: (min: 0.0, avg: 7.8, max: 21.0) [2022-07-09 12:36:21,700][25689] Avg episode reward: [(0, '-48.793')] [2022-07-09 12:36:22,015][26022] Updated weights on worker 0-0, policy_version 251422 (0.00103) [2022-07-09 12:36:24,341][26022] Updated weights on worker 0-0, policy_version 251432 (0.00085) [2022-07-09 12:36:25,772][26022] Updated weights on worker 0-0, policy_version 251442 (0.00090) [2022-07-09 12:36:26,711][25689] Fps is (10 sec: 5593.3, 60 sec: 5646.9, 300 sec: 5642.8). Total num frames: 257480704. Throughput: 0: 5073.4. Samples: 257478106. Policy #0 lag: (min: 0.0, avg: 7.8, max: 21.0) [2022-07-09 12:36:26,712][25689] Avg episode reward: [(0, '-49.186')] [2022-07-09 12:36:27,792][26022] Updated weights on worker 0-0, policy_version 251452 (0.00086) [2022-07-09 12:36:29,571][26022] Updated weights on worker 0-0, policy_version 251462 (0.00103) [2022-07-09 12:36:31,294][26022] Updated weights on worker 0-0, policy_version 251472 (0.00102) [2022-07-09 12:36:31,742][25689] Fps is (10 sec: 5812.0, 60 sec: 5628.6, 300 sec: 5642.6). Total num frames: 257509376. Throughput: 0: 5905.9. Samples: 257511948. Policy #0 lag: (min: 0.0, avg: 7.8, max: 21.0) [2022-07-09 12:36:31,744][25689] Avg episode reward: [(0, '-49.193')] [2022-07-09 12:36:33,382][26022] Updated weights on worker 0-0, policy_version 251482 (0.00087) [2022-07-09 12:36:34,860][26022] Updated weights on worker 0-0, policy_version 251492 (0.00099) [2022-07-09 12:36:36,763][25689] Fps is (10 sec: 5603.1, 60 sec: 5632.1, 300 sec: 5636.6). Total num frames: 257537024. Throughput: 0: 5912.1. Samples: 257546008. Policy #0 lag: (min: 0.0, avg: 7.8, max: 21.0) [2022-07-09 12:36:36,763][25689] Avg episode reward: [(0, '-49.556')] [2022-07-09 12:36:36,906][26022] Updated weights on worker 0-0, policy_version 251502 (0.00092) [2022-07-09 12:36:38,650][26022] Updated weights on worker 0-0, policy_version 251512 (0.00094) [2022-07-09 12:36:40,368][26022] Updated weights on worker 0-0, policy_version 251522 (0.00084) [2022-07-09 12:36:41,843][25689] Fps is (10 sec: 5677.3, 60 sec: 5635.6, 300 sec: 5636.4). Total num frames: 257566720. Throughput: 0: 5060.3. Samples: 257562948. Policy #0 lag: (min: 0.0, avg: 7.8, max: 21.0) [2022-07-09 12:36:41,844][25689] Avg episode reward: [(0, '-50.095')] [2022-07-09 12:36:42,138][26022] Updated weights on worker 0-0, policy_version 251532 (0.00081) [2022-07-09 12:36:43,994][26022] Updated weights on worker 0-0, policy_version 251542 (0.00085) [2022-07-09 12:36:45,773][26022] Updated weights on worker 0-0, policy_version 251552 (0.00087) [2022-07-09 12:36:46,896][25689] Fps is (10 sec: 5760.0, 60 sec: 5636.8, 300 sec: 5643.4). Total num frames: 257595392. Throughput: 0: 5927.2. Samples: 257597746. Policy #0 lag: (min: 0.0, avg: 7.8, max: 21.0) [2022-07-09 12:36:46,902][25689] Avg episode reward: [(0, '-50.413')] [2022-07-09 12:36:47,473][26022] Updated weights on worker 0-0, policy_version 251562 (0.00086) [2022-07-09 12:36:49,392][26022] Updated weights on worker 0-0, policy_version 251572 (0.00089) [2022-07-09 12:36:51,012][26022] Updated weights on worker 0-0, policy_version 251582 (0.00052) [2022-07-09 12:36:51,975][25689] Fps is (10 sec: 5659.6, 60 sec: 5634.1, 300 sec: 5638.8). Total num frames: 257624064. Throughput: 0: 5932.8. Samples: 257631984. Policy #0 lag: (min: 0.0, avg: 7.8, max: 21.0) [2022-07-09 12:36:51,975][25689] Avg episode reward: [(0, '-50.766')] [2022-07-09 12:36:53,025][26022] Updated weights on worker 0-0, policy_version 251592 (0.00085) [2022-07-09 12:36:54,533][26022] Updated weights on worker 0-0, policy_version 251602 (0.00087) [2022-07-09 12:36:55,641][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:36:55,654][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000251606_257644544.pth [2022-07-09 12:36:55,654][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000249621_255611904.pth [2022-07-09 12:36:56,447][26022] Updated weights on worker 0-0, policy_version 251612 (0.00094) [2022-07-09 12:36:56,986][25689] Fps is (10 sec: 5785.0, 60 sec: 5637.4, 300 sec: 5643.9). Total num frames: 257653760. Throughput: 0: 5092.8. Samples: 257649000. Policy #0 lag: (min: 0.0, avg: 7.8, max: 21.0) [2022-07-09 12:36:56,986][25689] Avg episode reward: [(0, '-51.810')] [2022-07-09 12:36:58,312][26022] Updated weights on worker 0-0, policy_version 251622 (0.00093) [2022-07-09 12:36:59,957][26022] Updated weights on worker 0-0, policy_version 251632 (0.00090) [2022-07-09 12:37:02,062][25689] Fps is (10 sec: 5481.8, 60 sec: 5622.3, 300 sec: 5643.1). Total num frames: 257679360. Throughput: 0: 5931.9. Samples: 257682888. Policy #0 lag: (min: 0.0, avg: 7.8, max: 21.0) [2022-07-09 12:37:02,063][25689] Avg episode reward: [(0, '-51.495')] [2022-07-09 12:37:02,448][26022] Updated weights on worker 0-0, policy_version 251642 (0.00095) [2022-07-09 12:37:04,102][26022] Updated weights on worker 0-0, policy_version 251652 (0.00092) [2022-07-09 12:37:05,998][26022] Updated weights on worker 0-0, policy_version 251662 (0.00087) [2022-07-09 12:37:07,106][25689] Fps is (10 sec: 5362.5, 60 sec: 5636.0, 300 sec: 5642.4). Total num frames: 257708032. Throughput: 0: 5797.2. Samples: 257714910. Policy #0 lag: (min: 0.0, avg: 7.8, max: 21.0) [2022-07-09 12:37:07,107][25689] Avg episode reward: [(0, '-51.667')] [2022-07-09 12:37:07,768][26022] Updated weights on worker 0-0, policy_version 251672 (0.00629) [2022-07-09 12:37:09,431][26022] Updated weights on worker 0-0, policy_version 251682 (0.00086) [2022-07-09 12:37:11,376][26022] Updated weights on worker 0-0, policy_version 251692 (0.00092) [2022-07-09 12:37:12,146][25689] Fps is (10 sec: 5584.9, 60 sec: 5634.4, 300 sec: 5641.6). Total num frames: 257735680. Throughput: 0: 4964.6. Samples: 257732128. Policy #0 lag: (min: 0.0, avg: 7.8, max: 21.0) [2022-07-09 12:37:12,147][25689] Avg episode reward: [(0, '-51.407')] [2022-07-09 12:37:12,972][26022] Updated weights on worker 0-0, policy_version 251702 (0.00092) [2022-07-09 12:37:15,035][26022] Updated weights on worker 0-0, policy_version 251712 (0.00089) [2022-07-09 12:37:16,595][26022] Updated weights on worker 0-0, policy_version 251722 (0.00088) [2022-07-09 12:37:17,205][25689] Fps is (10 sec: 5678.7, 60 sec: 5630.4, 300 sec: 5643.0). Total num frames: 257765376. Throughput: 0: 5806.4. Samples: 257766398. Policy #0 lag: (min: 0.0, avg: 7.8, max: 21.0) [2022-07-09 12:37:17,205][25689] Avg episode reward: [(0, '-51.314')] [2022-07-09 12:37:18,464][26022] Updated weights on worker 0-0, policy_version 251732 (0.00093) [2022-07-09 12:37:20,514][26022] Updated weights on worker 0-0, policy_version 251742 (0.00085) [2022-07-09 12:37:22,060][26022] Updated weights on worker 0-0, policy_version 251752 (0.00091) [2022-07-09 12:37:22,305][25689] Fps is (10 sec: 5846.4, 60 sec: 5677.1, 300 sec: 5649.5). Total num frames: 257795072. Throughput: 0: 5813.7. Samples: 257800576. Policy #0 lag: (min: 0.0, avg: 7.8, max: 21.0) [2022-07-09 12:37:22,306][25689] Avg episode reward: [(0, '-50.122')] [2022-07-09 12:37:23,989][26022] Updated weights on worker 0-0, policy_version 251762 (0.00083) [2022-07-09 12:37:25,769][26022] Updated weights on worker 0-0, policy_version 251772 (0.00094) [2022-07-09 12:37:27,362][25689] Fps is (10 sec: 5746.4, 60 sec: 5656.0, 300 sec: 5645.5). Total num frames: 257823744. Throughput: 0: 5066.5. Samples: 257817534. Policy #0 lag: (min: 0.0, avg: 7.8, max: 21.0) [2022-07-09 12:37:27,362][25689] Avg episode reward: [(0, '-49.441')] [2022-07-09 12:37:27,490][26022] Updated weights on worker 0-0, policy_version 251782 (0.00088) [2022-07-09 12:37:29,488][26022] Updated weights on worker 0-0, policy_version 251792 (0.00094) [2022-07-09 12:37:30,818][26022] Updated weights on worker 0-0, policy_version 251802 (0.00088) [2022-07-09 12:37:32,455][25689] Fps is (10 sec: 5548.9, 60 sec: 5633.3, 300 sec: 5644.0). Total num frames: 257851392. Throughput: 0: 5893.7. Samples: 257851820. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:37:32,457][25689] Avg episode reward: [(0, '-49.716')] [2022-07-09 12:37:33,122][26022] Updated weights on worker 0-0, policy_version 251812 (0.00095) [2022-07-09 12:37:34,695][26022] Updated weights on worker 0-0, policy_version 251822 (0.00083) [2022-07-09 12:37:36,465][26022] Updated weights on worker 0-0, policy_version 251832 (0.00098) [2022-07-09 12:37:37,491][25689] Fps is (10 sec: 5560.5, 60 sec: 5648.8, 300 sec: 5644.2). Total num frames: 257880064. Throughput: 0: 5897.9. Samples: 257886042. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:37:37,492][25689] Avg episode reward: [(0, '-48.714')] [2022-07-09 12:37:38,420][26022] Updated weights on worker 0-0, policy_version 251842 (0.00091) [2022-07-09 12:37:39,878][26022] Updated weights on worker 0-0, policy_version 251852 (0.00100) [2022-07-09 12:37:42,011][26022] Updated weights on worker 0-0, policy_version 251862 (0.00090) [2022-07-09 12:37:42,536][25689] Fps is (10 sec: 5993.1, 60 sec: 5685.8, 300 sec: 5650.3). Total num frames: 257911808. Throughput: 0: 5061.2. Samples: 257902968. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:37:42,536][25689] Avg episode reward: [(0, '-48.519')] [2022-07-09 12:37:43,960][26022] Updated weights on worker 0-0, policy_version 251872 (0.00085) [2022-07-09 12:37:45,399][26022] Updated weights on worker 0-0, policy_version 251882 (0.00087) [2022-07-09 12:37:47,574][25689] Fps is (10 sec: 5686.8, 60 sec: 5636.6, 300 sec: 5643.0). Total num frames: 257937408. Throughput: 0: 5927.5. Samples: 257937342. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:37:47,575][25689] Avg episode reward: [(0, '-49.367')] [2022-07-09 12:37:47,580][26022] Updated weights on worker 0-0, policy_version 251892 (0.00086) [2022-07-09 12:37:48,981][26022] Updated weights on worker 0-0, policy_version 251902 (0.00095) [2022-07-09 12:37:50,941][26022] Updated weights on worker 0-0, policy_version 251912 (0.00083) [2022-07-09 12:37:52,611][25689] Fps is (10 sec: 5386.7, 60 sec: 5640.5, 300 sec: 5642.6). Total num frames: 257966080. Throughput: 0: 5940.9. Samples: 257971566. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:37:52,613][25689] Avg episode reward: [(0, '-49.343')] [2022-07-09 12:37:52,828][26022] Updated weights on worker 0-0, policy_version 251922 (0.00091) [2022-07-09 12:37:54,483][26022] Updated weights on worker 0-0, policy_version 251932 (0.00070) [2022-07-09 12:37:56,382][26022] Updated weights on worker 0-0, policy_version 251942 (0.00089) [2022-07-09 12:37:57,649][25689] Fps is (10 sec: 5895.2, 60 sec: 5654.8, 300 sec: 5653.2). Total num frames: 257996800. Throughput: 0: 5087.6. Samples: 257988600. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:37:57,650][25689] Avg episode reward: [(0, '-49.899')] [2022-07-09 12:37:58,136][26022] Updated weights on worker 0-0, policy_version 251952 (0.00089) [2022-07-09 12:37:59,871][26022] Updated weights on worker 0-0, policy_version 251962 (0.00093) [2022-07-09 12:38:02,215][26022] Updated weights on worker 0-0, policy_version 251972 (0.00095) [2022-07-09 12:38:02,727][25689] Fps is (10 sec: 5567.6, 60 sec: 5654.7, 300 sec: 5641.7). Total num frames: 258022400. Throughput: 0: 5940.6. Samples: 258022914. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:38:02,727][25689] Avg episode reward: [(0, '-49.740')] [2022-07-09 12:38:03,728][26022] Updated weights on worker 0-0, policy_version 251982 (0.00080) [2022-07-09 12:38:05,659][26022] Updated weights on worker 0-0, policy_version 251992 (0.00490) [2022-07-09 12:38:07,329][26022] Updated weights on worker 0-0, policy_version 252002 (0.00085) [2022-07-09 12:38:07,735][25689] Fps is (10 sec: 5381.0, 60 sec: 5658.1, 300 sec: 5645.3). Total num frames: 258051072. Throughput: 0: 5850.3. Samples: 258055286. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:38:07,735][25689] Avg episode reward: [(0, '-50.521')] [2022-07-09 12:38:09,186][26022] Updated weights on worker 0-0, policy_version 252012 (0.00089) [2022-07-09 12:38:11,115][26022] Updated weights on worker 0-0, policy_version 252022 (0.00080) [2022-07-09 12:38:12,713][26022] Updated weights on worker 0-0, policy_version 252032 (0.00102) [2022-07-09 12:38:12,752][25689] Fps is (10 sec: 5822.3, 60 sec: 5694.0, 300 sec: 5652.3). Total num frames: 258080768. Throughput: 0: 5853.6. Samples: 258089460. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:38:12,752][25689] Avg episode reward: [(0, '-47.917')] [2022-07-09 12:38:14,485][26022] Updated weights on worker 0-0, policy_version 252042 (0.00741) [2022-07-09 12:38:16,376][26022] Updated weights on worker 0-0, policy_version 252052 (0.00085) [2022-07-09 12:38:17,767][25689] Fps is (10 sec: 5818.3, 60 sec: 5681.2, 300 sec: 5654.5). Total num frames: 258109440. Throughput: 0: 5882.5. Samples: 258106942. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:38:17,767][25689] Avg episode reward: [(0, '-48.015')] [2022-07-09 12:38:17,927][26022] Updated weights on worker 0-0, policy_version 252062 (0.00086) [2022-07-09 12:38:19,947][26022] Updated weights on worker 0-0, policy_version 252072 (0.00083) [2022-07-09 12:38:21,519][26022] Updated weights on worker 0-0, policy_version 252082 (0.00086) [2022-07-09 12:38:22,809][25689] Fps is (10 sec: 5599.9, 60 sec: 5652.8, 300 sec: 5643.5). Total num frames: 258137088. Throughput: 0: 5894.5. Samples: 258141288. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:38:22,810][25689] Avg episode reward: [(0, '-47.748')] [2022-07-09 12:38:23,476][26022] Updated weights on worker 0-0, policy_version 252092 (0.00104) [2022-07-09 12:38:25,435][26022] Updated weights on worker 0-0, policy_version 252102 (0.00112) [2022-07-09 12:38:27,215][26022] Updated weights on worker 0-0, policy_version 252112 (0.00078) [2022-07-09 12:38:27,813][25689] Fps is (10 sec: 5606.2, 60 sec: 5657.8, 300 sec: 5653.7). Total num frames: 258165760. Throughput: 0: 5974.6. Samples: 258175244. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:38:27,813][25689] Avg episode reward: [(0, '-47.384')] [2022-07-09 12:38:28,867][26022] Updated weights on worker 0-0, policy_version 252122 (0.00089) [2022-07-09 12:38:30,997][26022] Updated weights on worker 0-0, policy_version 252132 (0.00081) [2022-07-09 12:38:32,502][26022] Updated weights on worker 0-0, policy_version 252142 (0.00087) [2022-07-09 12:38:32,825][25689] Fps is (10 sec: 5725.1, 60 sec: 5682.3, 300 sec: 5650.2). Total num frames: 258194432. Throughput: 0: 5120.7. Samples: 258192250. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:38:32,826][25689] Avg episode reward: [(0, '-47.481')] [2022-07-09 12:38:34,606][26022] Updated weights on worker 0-0, policy_version 252152 (0.00086) [2022-07-09 12:38:36,096][26022] Updated weights on worker 0-0, policy_version 252162 (0.00092) [2022-07-09 12:38:37,844][25689] Fps is (10 sec: 5716.3, 60 sec: 5683.8, 300 sec: 5647.7). Total num frames: 258223104. Throughput: 0: 5944.5. Samples: 258226294. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:38:37,845][25689] Avg episode reward: [(0, '-47.989')] [2022-07-09 12:38:37,926][26022] Updated weights on worker 0-0, policy_version 252172 (0.00085) [2022-07-09 12:38:39,753][26022] Updated weights on worker 0-0, policy_version 252182 (0.00083) [2022-07-09 12:38:41,649][26022] Updated weights on worker 0-0, policy_version 252192 (0.00087) [2022-07-09 12:38:42,926][25689] Fps is (10 sec: 5778.7, 60 sec: 5646.5, 300 sec: 5650.9). Total num frames: 258252800. Throughput: 0: 5940.2. Samples: 258260786. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:38:42,926][25689] Avg episode reward: [(0, '-47.888')] [2022-07-09 12:38:43,375][26022] Updated weights on worker 0-0, policy_version 252202 (0.00089) [2022-07-09 12:38:45,242][26022] Updated weights on worker 0-0, policy_version 252212 (0.00088) [2022-07-09 12:38:47,095][26022] Updated weights on worker 0-0, policy_version 252222 (0.00095) [2022-07-09 12:38:47,937][25689] Fps is (10 sec: 5681.4, 60 sec: 5682.9, 300 sec: 5651.3). Total num frames: 258280448. Throughput: 0: 5100.3. Samples: 258277886. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:38:47,939][25689] Avg episode reward: [(0, '-48.571')] [2022-07-09 12:38:48,809][26022] Updated weights on worker 0-0, policy_version 252232 (0.00087) [2022-07-09 12:38:50,425][26022] Updated weights on worker 0-0, policy_version 252242 (0.00092) [2022-07-09 12:38:52,676][26022] Updated weights on worker 0-0, policy_version 252252 (0.00088) [2022-07-09 12:38:52,963][25689] Fps is (10 sec: 5509.2, 60 sec: 5667.1, 300 sec: 5644.4). Total num frames: 258308096. Throughput: 0: 5942.0. Samples: 258311908. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:38:52,966][25689] Avg episode reward: [(0, '-49.341')] [2022-07-09 12:38:54,154][26022] Updated weights on worker 0-0, policy_version 252262 (0.00084) [2022-07-09 12:38:55,656][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:38:55,672][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000252269_258323456.pth [2022-07-09 12:38:55,672][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000250282_256288768.pth [2022-07-09 12:38:56,260][26022] Updated weights on worker 0-0, policy_version 252272 (0.00095) [2022-07-09 12:38:57,720][26022] Updated weights on worker 0-0, policy_version 252282 (0.00085) [2022-07-09 12:38:57,987][25689] Fps is (10 sec: 5706.3, 60 sec: 5651.4, 300 sec: 5655.4). Total num frames: 258337792. Throughput: 0: 5931.6. Samples: 258345772. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:38:57,987][25689] Avg episode reward: [(0, '-50.291')] [2022-07-09 12:38:59,807][26022] Updated weights on worker 0-0, policy_version 252292 (0.00083) [2022-07-09 12:39:01,422][26022] Updated weights on worker 0-0, policy_version 252302 (0.00087) [2022-07-09 12:39:03,036][25689] Fps is (10 sec: 5387.6, 60 sec: 5637.0, 300 sec: 5645.3). Total num frames: 258362368. Throughput: 0: 5081.2. Samples: 258362972. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:39:03,037][25689] Avg episode reward: [(0, '-49.783')] [2022-07-09 12:39:03,703][26022] Updated weights on worker 0-0, policy_version 252312 (0.00084) [2022-07-09 12:39:05,622][26022] Updated weights on worker 0-0, policy_version 252322 (0.00091) [2022-07-09 12:39:07,333][26022] Updated weights on worker 0-0, policy_version 252332 (0.00092) [2022-07-09 12:39:08,047][25689] Fps is (10 sec: 5496.6, 60 sec: 5670.8, 300 sec: 5652.2). Total num frames: 258393088. Throughput: 0: 5796.7. Samples: 258394452. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:39:08,047][25689] Avg episode reward: [(0, '-50.104')] [2022-07-09 12:39:09,139][26022] Updated weights on worker 0-0, policy_version 252342 (0.00088) [2022-07-09 12:39:10,866][26022] Updated weights on worker 0-0, policy_version 252352 (0.00091) [2022-07-09 12:39:12,851][26022] Updated weights on worker 0-0, policy_version 252362 (0.00090) [2022-07-09 12:39:13,053][25689] Fps is (10 sec: 5725.0, 60 sec: 5620.9, 300 sec: 5642.1). Total num frames: 258419712. Throughput: 0: 5805.8. Samples: 258428546. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:39:13,053][25689] Avg episode reward: [(0, '-50.857')] [2022-07-09 12:39:14,650][26022] Updated weights on worker 0-0, policy_version 252372 (0.00085) [2022-07-09 12:39:16,104][26022] Updated weights on worker 0-0, policy_version 252382 (0.00088) [2022-07-09 12:39:18,077][25689] Fps is (10 sec: 5410.9, 60 sec: 5603.0, 300 sec: 5639.7). Total num frames: 258447360. Throughput: 0: 4975.3. Samples: 258445726. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:39:18,077][25689] Avg episode reward: [(0, '-50.976')] [2022-07-09 12:39:18,202][26022] Updated weights on worker 0-0, policy_version 252392 (0.00081) [2022-07-09 12:39:19,761][26022] Updated weights on worker 0-0, policy_version 252402 (0.00081) [2022-07-09 12:39:21,785][26022] Updated weights on worker 0-0, policy_version 252412 (0.00085) [2022-07-09 12:39:23,138][25689] Fps is (10 sec: 5787.7, 60 sec: 5652.2, 300 sec: 5646.8). Total num frames: 258478080. Throughput: 0: 5824.1. Samples: 258480044. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:39:23,138][25689] Avg episode reward: [(0, '-50.855')] [2022-07-09 12:39:23,426][26022] Updated weights on worker 0-0, policy_version 252422 (0.00083) [2022-07-09 12:39:25,166][26022] Updated weights on worker 0-0, policy_version 252432 (0.00088) [2022-07-09 12:39:27,278][26022] Updated weights on worker 0-0, policy_version 252442 (0.00097) [2022-07-09 12:39:28,174][25689] Fps is (10 sec: 5881.9, 60 sec: 5649.2, 300 sec: 5646.8). Total num frames: 258506752. Throughput: 0: 5953.6. Samples: 258514284. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:39:28,175][25689] Avg episode reward: [(0, '-49.934')] [2022-07-09 12:39:28,880][26022] Updated weights on worker 0-0, policy_version 252452 (0.00084) [2022-07-09 12:39:30,675][26022] Updated weights on worker 0-0, policy_version 252462 (0.00092) [2022-07-09 12:39:32,578][26022] Updated weights on worker 0-0, policy_version 252472 (0.00089) [2022-07-09 12:39:33,258][25689] Fps is (10 sec: 5564.9, 60 sec: 5625.5, 300 sec: 5642.9). Total num frames: 258534400. Throughput: 0: 5089.9. Samples: 258531390. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:39:33,259][25689] Avg episode reward: [(0, '-49.601')] [2022-07-09 12:39:34,264][26022] Updated weights on worker 0-0, policy_version 252482 (0.00108) [2022-07-09 12:39:36,126][26022] Updated weights on worker 0-0, policy_version 252492 (0.00096) [2022-07-09 12:39:37,693][26022] Updated weights on worker 0-0, policy_version 252502 (0.00084) [2022-07-09 12:39:38,294][25689] Fps is (10 sec: 5767.9, 60 sec: 5657.9, 300 sec: 5653.4). Total num frames: 258565120. Throughput: 0: 5948.9. Samples: 258565994. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:39:38,294][25689] Avg episode reward: [(0, '-50.143')] [2022-07-09 12:39:39,526][26022] Updated weights on worker 0-0, policy_version 252512 (0.00785) [2022-07-09 12:39:41,347][26022] Updated weights on worker 0-0, policy_version 252522 (0.00090) [2022-07-09 12:39:43,037][26022] Updated weights on worker 0-0, policy_version 252532 (0.00087) [2022-07-09 12:39:43,347][25689] Fps is (10 sec: 5886.8, 60 sec: 5643.5, 300 sec: 5650.7). Total num frames: 258593792. Throughput: 0: 5961.9. Samples: 258600532. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 12:39:43,348][25689] Avg episode reward: [(0, '-49.135')] [2022-07-09 12:39:44,820][26022] Updated weights on worker 0-0, policy_version 252542 (0.00086) [2022-07-09 12:39:46,802][26022] Updated weights on worker 0-0, policy_version 252552 (0.00085) [2022-07-09 12:39:48,321][26022] Updated weights on worker 0-0, policy_version 252562 (0.00090) [2022-07-09 12:39:48,408][25689] Fps is (10 sec: 5770.6, 60 sec: 5672.8, 300 sec: 5653.2). Total num frames: 258623488. Throughput: 0: 5117.9. Samples: 258617840. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 12:39:48,409][25689] Avg episode reward: [(0, '-48.542')] [2022-07-09 12:39:50,241][26022] Updated weights on worker 0-0, policy_version 252572 (0.00091) [2022-07-09 12:39:51,828][26022] Updated weights on worker 0-0, policy_version 252582 (0.00083) [2022-07-09 12:39:53,427][25689] Fps is (10 sec: 5689.1, 60 sec: 5673.4, 300 sec: 5653.9). Total num frames: 258651136. Throughput: 0: 6000.6. Samples: 258652414. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 12:39:53,427][25689] Avg episode reward: [(0, '-48.165')] [2022-07-09 12:39:53,810][26022] Updated weights on worker 0-0, policy_version 252592 (0.00085) [2022-07-09 12:39:55,698][26022] Updated weights on worker 0-0, policy_version 252602 (0.00093) [2022-07-09 12:39:57,459][26022] Updated weights on worker 0-0, policy_version 252612 (0.00083) [2022-07-09 12:39:58,463][25689] Fps is (10 sec: 5499.7, 60 sec: 5638.5, 300 sec: 5648.2). Total num frames: 258678784. Throughput: 0: 5972.7. Samples: 258686458. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 12:39:58,463][25689] Avg episode reward: [(0, '-47.607')] [2022-07-09 12:39:59,224][26022] Updated weights on worker 0-0, policy_version 252622 (0.00090) [2022-07-09 12:40:01,347][26022] Updated weights on worker 0-0, policy_version 252632 (0.00510) [2022-07-09 12:40:03,276][26022] Updated weights on worker 0-0, policy_version 252642 (0.00081) [2022-07-09 12:40:03,526][25689] Fps is (10 sec: 5475.4, 60 sec: 5688.0, 300 sec: 5651.2). Total num frames: 258706432. Throughput: 0: 5103.6. Samples: 258703516. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 12:40:03,528][25689] Avg episode reward: [(0, '-47.553')] [2022-07-09 12:40:05,069][26022] Updated weights on worker 0-0, policy_version 252652 (0.00080) [2022-07-09 12:40:06,914][26022] Updated weights on worker 0-0, policy_version 252662 (0.00089) [2022-07-09 12:40:08,531][25689] Fps is (10 sec: 5593.6, 60 sec: 5654.6, 300 sec: 5654.6). Total num frames: 258735104. Throughput: 0: 5842.7. Samples: 258735414. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 12:40:08,533][25689] Avg episode reward: [(0, '-47.233')] [2022-07-09 12:40:08,618][26022] Updated weights on worker 0-0, policy_version 252672 (0.00090) [2022-07-09 12:40:10,662][26022] Updated weights on worker 0-0, policy_version 252682 (0.00093) [2022-07-09 12:40:12,106][26022] Updated weights on worker 0-0, policy_version 252692 (0.00088) [2022-07-09 12:40:13,567][25689] Fps is (10 sec: 5609.1, 60 sec: 5668.8, 300 sec: 5647.1). Total num frames: 258762752. Throughput: 0: 5840.8. Samples: 258770046. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 12:40:13,568][25689] Avg episode reward: [(0, '-47.727')] [2022-07-09 12:40:14,239][26022] Updated weights on worker 0-0, policy_version 252702 (0.00498) [2022-07-09 12:40:15,744][26022] Updated weights on worker 0-0, policy_version 252712 (0.00090) [2022-07-09 12:40:17,754][26022] Updated weights on worker 0-0, policy_version 252722 (0.00085) [2022-07-09 12:40:18,588][25689] Fps is (10 sec: 5702.3, 60 sec: 5702.9, 300 sec: 5655.5). Total num frames: 258792448. Throughput: 0: 4999.9. Samples: 258787080. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 12:40:18,588][25689] Avg episode reward: [(0, '-46.996')] [2022-07-09 12:40:19,314][26022] Updated weights on worker 0-0, policy_version 252732 (0.00080) [2022-07-09 12:40:21,263][26022] Updated weights on worker 0-0, policy_version 252742 (0.00084) [2022-07-09 12:40:23,012][26022] Updated weights on worker 0-0, policy_version 252752 (0.00085) [2022-07-09 12:40:23,687][25689] Fps is (10 sec: 5767.6, 60 sec: 5665.5, 300 sec: 5654.4). Total num frames: 258821120. Throughput: 0: 5840.7. Samples: 258821270. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 12:40:23,688][25689] Avg episode reward: [(0, '-47.384')] [2022-07-09 12:40:24,858][26022] Updated weights on worker 0-0, policy_version 252762 (0.00106) [2022-07-09 12:40:26,924][26022] Updated weights on worker 0-0, policy_version 252773 (0.00088) [2022-07-09 12:40:28,675][26022] Updated weights on worker 0-0, policy_version 252783 (0.00097) [2022-07-09 12:40:28,757][25689] Fps is (10 sec: 5639.1, 60 sec: 5662.4, 300 sec: 5650.0). Total num frames: 258849792. Throughput: 0: 5927.5. Samples: 258855300. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 12:40:28,757][25689] Avg episode reward: [(0, '-47.697')] [2022-07-09 12:40:30,532][26022] Updated weights on worker 0-0, policy_version 252793 (0.00085) [2022-07-09 12:40:32,215][26022] Updated weights on worker 0-0, policy_version 252803 (0.00087) [2022-07-09 12:40:33,792][25689] Fps is (10 sec: 5674.8, 60 sec: 5683.9, 300 sec: 5653.8). Total num frames: 258878464. Throughput: 0: 5053.6. Samples: 258872254. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 12:40:33,793][25689] Avg episode reward: [(0, '-48.094')] [2022-07-09 12:40:34,100][26022] Updated weights on worker 0-0, policy_version 252813 (0.00098) [2022-07-09 12:40:35,942][26022] Updated weights on worker 0-0, policy_version 252823 (0.00092) [2022-07-09 12:40:37,618][26022] Updated weights on worker 0-0, policy_version 252833 (0.00092) [2022-07-09 12:40:38,820][25689] Fps is (10 sec: 5698.5, 60 sec: 5650.7, 300 sec: 5652.1). Total num frames: 258907136. Throughput: 0: 5900.7. Samples: 258906464. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 12:40:38,821][25689] Avg episode reward: [(0, '-49.074')] [2022-07-09 12:40:39,549][26022] Updated weights on worker 0-0, policy_version 252843 (0.00090) [2022-07-09 12:40:41,108][26022] Updated weights on worker 0-0, policy_version 252853 (0.00087) [2022-07-09 12:40:43,111][26022] Updated weights on worker 0-0, policy_version 252863 (0.00081) [2022-07-09 12:40:43,943][25689] Fps is (10 sec: 5750.0, 60 sec: 5661.1, 300 sec: 5654.5). Total num frames: 258936832. Throughput: 0: 5895.7. Samples: 258940694. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 12:40:43,943][25689] Avg episode reward: [(0, '-49.156')] [2022-07-09 12:40:44,924][26022] Updated weights on worker 0-0, policy_version 252873 (0.00090) [2022-07-09 12:40:46,648][26022] Updated weights on worker 0-0, policy_version 252883 (0.00085) [2022-07-09 12:40:48,460][26022] Updated weights on worker 0-0, policy_version 252893 (0.00085) [2022-07-09 12:40:48,967][25689] Fps is (10 sec: 5651.3, 60 sec: 5630.8, 300 sec: 5651.5). Total num frames: 258964480. Throughput: 0: 5073.2. Samples: 258957830. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 12:40:48,967][25689] Avg episode reward: [(0, '-51.003')] [2022-07-09 12:40:50,060][26022] Updated weights on worker 0-0, policy_version 252903 (0.00087) [2022-07-09 12:40:52,144][26022] Updated weights on worker 0-0, policy_version 252913 (0.00095) [2022-07-09 12:40:53,664][26022] Updated weights on worker 0-0, policy_version 252923 (0.00084) [2022-07-09 12:40:54,048][25689] Fps is (10 sec: 5674.4, 60 sec: 5658.7, 300 sec: 5650.8). Total num frames: 258994176. Throughput: 0: 5921.4. Samples: 258992202. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 12:40:54,049][25689] Avg episode reward: [(0, '-50.957')] [2022-07-09 12:40:55,666][26022] Updated weights on worker 0-0, policy_version 252933 (0.00091) [2022-07-09 12:40:55,835][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:40:55,841][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000252934_259004416.pth [2022-07-09 12:40:55,842][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000250945_256967680.pth [2022-07-09 12:40:57,602][26022] Updated weights on worker 0-0, policy_version 252943 (0.00093) [2022-07-09 12:40:59,054][25689] Fps is (10 sec: 5786.1, 60 sec: 5678.4, 300 sec: 5659.4). Total num frames: 259022848. Throughput: 0: 5913.7. Samples: 259026126. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 12:40:59,055][25689] Avg episode reward: [(0, '-50.979')] [2022-07-09 12:40:59,367][26022] Updated weights on worker 0-0, policy_version 252953 (0.00086) [2022-07-09 12:41:01,193][26022] Updated weights on worker 0-0, policy_version 252963 (0.00086) [2022-07-09 12:41:03,172][26022] Updated weights on worker 0-0, policy_version 252973 (0.00088) [2022-07-09 12:41:04,158][25689] Fps is (10 sec: 5368.6, 60 sec: 5640.9, 300 sec: 5650.8). Total num frames: 259048448. Throughput: 0: 5792.6. Samples: 259057790. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 12:41:04,158][25689] Avg episode reward: [(0, '-50.831')] [2022-07-09 12:41:05,157][26022] Updated weights on worker 0-0, policy_version 252983 (0.00089) [2022-07-09 12:41:06,901][26022] Updated weights on worker 0-0, policy_version 252993 (0.00085) [2022-07-09 12:41:08,758][26022] Updated weights on worker 0-0, policy_version 253003 (0.00083) [2022-07-09 12:41:09,159][25689] Fps is (10 sec: 5371.2, 60 sec: 5641.3, 300 sec: 5654.6). Total num frames: 259077120. Throughput: 0: 5800.6. Samples: 259074954. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 12:41:09,159][25689] Avg episode reward: [(0, '-49.909')] [2022-07-09 12:41:10,345][26022] Updated weights on worker 0-0, policy_version 253013 (0.00084) [2022-07-09 12:41:12,319][26022] Updated weights on worker 0-0, policy_version 253023 (0.00084) [2022-07-09 12:41:14,022][26022] Updated weights on worker 0-0, policy_version 253033 (0.00086) [2022-07-09 12:41:14,195][25689] Fps is (10 sec: 5815.1, 60 sec: 5675.0, 300 sec: 5654.2). Total num frames: 259106816. Throughput: 0: 5810.1. Samples: 259109256. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 12:41:14,195][25689] Avg episode reward: [(0, '-50.805')] [2022-07-09 12:41:15,935][26022] Updated weights on worker 0-0, policy_version 253043 (0.00084) [2022-07-09 12:41:17,814][26022] Updated weights on worker 0-0, policy_version 253053 (0.00107) [2022-07-09 12:41:19,217][25689] Fps is (10 sec: 5803.1, 60 sec: 5658.0, 300 sec: 5661.8). Total num frames: 259135488. Throughput: 0: 5830.8. Samples: 259143688. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 12:41:19,217][25689] Avg episode reward: [(0, '-49.722')] [2022-07-09 12:41:19,385][26022] Updated weights on worker 0-0, policy_version 253063 (0.00087) [2022-07-09 12:41:21,256][26022] Updated weights on worker 0-0, policy_version 253073 (0.00091) [2022-07-09 12:41:23,050][26022] Updated weights on worker 0-0, policy_version 253083 (0.00086) [2022-07-09 12:41:24,333][25689] Fps is (10 sec: 5656.4, 60 sec: 5656.4, 300 sec: 5656.4). Total num frames: 259164160. Throughput: 0: 5101.0. Samples: 259160704. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 12:41:24,333][25689] Avg episode reward: [(0, '-50.330')] [2022-07-09 12:41:24,760][26022] Updated weights on worker 0-0, policy_version 253093 (0.00089) [2022-07-09 12:41:26,791][26022] Updated weights on worker 0-0, policy_version 253103 (0.00088) [2022-07-09 12:41:28,476][26022] Updated weights on worker 0-0, policy_version 253113 (0.00097) [2022-07-09 12:41:29,346][25689] Fps is (10 sec: 5560.2, 60 sec: 5644.8, 300 sec: 5653.3). Total num frames: 259191808. Throughput: 0: 5930.4. Samples: 259194672. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 12:41:29,346][25689] Avg episode reward: [(0, '-50.243')] [2022-07-09 12:41:30,453][26022] Updated weights on worker 0-0, policy_version 253123 (0.00098) [2022-07-09 12:41:32,028][26022] Updated weights on worker 0-0, policy_version 253133 (0.00086) [2022-07-09 12:41:34,053][26022] Updated weights on worker 0-0, policy_version 253143 (0.00092) [2022-07-09 12:41:34,370][25689] Fps is (10 sec: 5610.9, 60 sec: 5645.8, 300 sec: 5656.6). Total num frames: 259220480. Throughput: 0: 5914.2. Samples: 259228578. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 12:41:34,371][25689] Avg episode reward: [(0, '-50.856')] [2022-07-09 12:41:35,712][26022] Updated weights on worker 0-0, policy_version 253153 (0.00079) [2022-07-09 12:41:37,455][26022] Updated weights on worker 0-0, policy_version 253163 (0.00100) [2022-07-09 12:41:39,378][25689] Fps is (10 sec: 5613.8, 60 sec: 5630.8, 300 sec: 5651.1). Total num frames: 259248128. Throughput: 0: 5051.6. Samples: 259245536. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 12:41:39,379][25689] Avg episode reward: [(0, '-51.603')] [2022-07-09 12:41:39,437][26022] Updated weights on worker 0-0, policy_version 253173 (0.00057) [2022-07-09 12:41:41,017][26022] Updated weights on worker 0-0, policy_version 253183 (0.00094) [2022-07-09 12:41:43,078][26022] Updated weights on worker 0-0, policy_version 253193 (0.00086) [2022-07-09 12:41:44,433][25689] Fps is (10 sec: 5698.6, 60 sec: 5637.1, 300 sec: 5654.5). Total num frames: 259277824. Throughput: 0: 5931.0. Samples: 259279920. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 12:41:44,435][25689] Avg episode reward: [(0, '-51.139')] [2022-07-09 12:41:44,799][26022] Updated weights on worker 0-0, policy_version 253203 (0.00079) [2022-07-09 12:41:46,679][26022] Updated weights on worker 0-0, policy_version 253213 (0.00088) [2022-07-09 12:41:48,306][26022] Updated weights on worker 0-0, policy_version 253223 (0.00088) [2022-07-09 12:41:49,439][25689] Fps is (10 sec: 5801.7, 60 sec: 5655.8, 300 sec: 5655.9). Total num frames: 259306496. Throughput: 0: 5963.8. Samples: 259314502. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 12:41:49,441][25689] Avg episode reward: [(0, '-50.794')] [2022-07-09 12:41:50,020][26022] Updated weights on worker 0-0, policy_version 253233 (0.00085) [2022-07-09 12:41:51,955][26022] Updated weights on worker 0-0, policy_version 253243 (0.00081) [2022-07-09 12:41:53,470][26022] Updated weights on worker 0-0, policy_version 253253 (0.00094) [2022-07-09 12:41:54,443][25689] Fps is (10 sec: 5831.1, 60 sec: 5663.0, 300 sec: 5656.0). Total num frames: 259336192. Throughput: 0: 5145.0. Samples: 259331850. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 12:41:54,444][25689] Avg episode reward: [(0, '-49.815')] [2022-07-09 12:41:55,399][26022] Updated weights on worker 0-0, policy_version 253263 (0.00089) [2022-07-09 12:41:57,039][26022] Updated weights on worker 0-0, policy_version 253273 (0.00089) [2022-07-09 12:41:59,133][26022] Updated weights on worker 0-0, policy_version 253283 (0.00054) [2022-07-09 12:41:59,449][25689] Fps is (10 sec: 5728.6, 60 sec: 5646.1, 300 sec: 5664.3). Total num frames: 259363840. Throughput: 0: 6007.8. Samples: 259366116. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 12:41:59,449][25689] Avg episode reward: [(0, '-49.877')] [2022-07-09 12:42:00,895][26022] Updated weights on worker 0-0, policy_version 253293 (0.00094) [2022-07-09 12:42:03,112][26022] Updated weights on worker 0-0, policy_version 253303 (0.00093) [2022-07-09 12:42:04,554][25689] Fps is (10 sec: 5469.0, 60 sec: 5679.8, 300 sec: 5659.7). Total num frames: 259391488. Throughput: 0: 5857.5. Samples: 259397776. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 12:42:04,554][25689] Avg episode reward: [(0, '-48.206')] [2022-07-09 12:42:04,740][26022] Updated weights on worker 0-0, policy_version 253313 (0.00090) [2022-07-09 12:42:06,850][26022] Updated weights on worker 0-0, policy_version 253323 (0.00095) [2022-07-09 12:42:08,537][26022] Updated weights on worker 0-0, policy_version 253333 (0.00091) [2022-07-09 12:42:09,599][25689] Fps is (10 sec: 5448.1, 60 sec: 5658.8, 300 sec: 5659.6). Total num frames: 259419136. Throughput: 0: 4963.4. Samples: 259414564. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2022-07-09 12:42:09,599][25689] Avg episode reward: [(0, '-47.775')] [2022-07-09 12:42:10,446][26022] Updated weights on worker 0-0, policy_version 253343 (0.00082) [2022-07-09 12:42:12,057][26022] Updated weights on worker 0-0, policy_version 253353 (0.00086) [2022-07-09 12:42:14,239][26022] Updated weights on worker 0-0, policy_version 253363 (0.00087) [2022-07-09 12:42:14,618][25689] Fps is (10 sec: 5494.2, 60 sec: 5626.4, 300 sec: 5653.4). Total num frames: 259446784. Throughput: 0: 5786.3. Samples: 259448590. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2022-07-09 12:42:14,619][25689] Avg episode reward: [(0, '-47.607')] [2022-07-09 12:42:15,531][26022] Updated weights on worker 0-0, policy_version 253373 (0.00079) [2022-07-09 12:42:17,578][26022] Updated weights on worker 0-0, policy_version 253383 (0.00085) [2022-07-09 12:42:19,125][26022] Updated weights on worker 0-0, policy_version 253393 (0.00083) [2022-07-09 12:42:19,634][25689] Fps is (10 sec: 5612.0, 60 sec: 5627.0, 300 sec: 5651.6). Total num frames: 259475456. Throughput: 0: 5791.2. Samples: 259483014. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2022-07-09 12:42:19,635][25689] Avg episode reward: [(0, '-47.811')] [2022-07-09 12:42:21,014][26022] Updated weights on worker 0-0, policy_version 253403 (0.00081) [2022-07-09 12:42:22,754][26022] Updated weights on worker 0-0, policy_version 253413 (0.00085) [2022-07-09 12:42:24,713][25689] Fps is (10 sec: 5782.0, 60 sec: 5647.4, 300 sec: 5654.6). Total num frames: 259505152. Throughput: 0: 5069.6. Samples: 259499976. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2022-07-09 12:42:24,713][25689] Avg episode reward: [(0, '-48.602')] [2022-07-09 12:42:24,716][26022] Updated weights on worker 0-0, policy_version 253423 (0.00089) [2022-07-09 12:42:26,413][26022] Updated weights on worker 0-0, policy_version 253433 (0.00090) [2022-07-09 12:42:28,254][26022] Updated weights on worker 0-0, policy_version 253443 (0.00089) [2022-07-09 12:42:29,721][25689] Fps is (10 sec: 5685.0, 60 sec: 5647.9, 300 sec: 5656.2). Total num frames: 259532800. Throughput: 0: 5936.0. Samples: 259534012. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2022-07-09 12:42:29,722][25689] Avg episode reward: [(0, '-49.727')] [2022-07-09 12:42:30,100][26022] Updated weights on worker 0-0, policy_version 253453 (0.00091) [2022-07-09 12:42:31,914][26022] Updated weights on worker 0-0, policy_version 253463 (0.00086) [2022-07-09 12:42:33,729][26022] Updated weights on worker 0-0, policy_version 253473 (0.00086) [2022-07-09 12:42:34,759][25689] Fps is (10 sec: 5606.3, 60 sec: 5646.6, 300 sec: 5656.2). Total num frames: 259561472. Throughput: 0: 5930.3. Samples: 259568030. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2022-07-09 12:42:34,759][25689] Avg episode reward: [(0, '-49.798')] [2022-07-09 12:42:35,577][26022] Updated weights on worker 0-0, policy_version 253483 (0.00085) [2022-07-09 12:42:37,328][26022] Updated weights on worker 0-0, policy_version 253493 (0.00092) [2022-07-09 12:42:39,332][26022] Updated weights on worker 0-0, policy_version 253503 (0.00093) [2022-07-09 12:42:39,763][25689] Fps is (10 sec: 5608.3, 60 sec: 5647.0, 300 sec: 5643.2). Total num frames: 259589120. Throughput: 0: 5072.6. Samples: 259585122. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2022-07-09 12:42:39,764][25689] Avg episode reward: [(0, '-49.372')] [2022-07-09 12:42:40,760][26022] Updated weights on worker 0-0, policy_version 253513 (0.00105) [2022-07-09 12:42:42,936][26022] Updated weights on worker 0-0, policy_version 253523 (0.00089) [2022-07-09 12:42:44,595][26022] Updated weights on worker 0-0, policy_version 253533 (0.00082) [2022-07-09 12:42:44,800][25689] Fps is (10 sec: 5608.7, 60 sec: 5631.7, 300 sec: 5653.6). Total num frames: 259617792. Throughput: 0: 5927.0. Samples: 259619034. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2022-07-09 12:42:44,802][25689] Avg episode reward: [(0, '-49.566')] [2022-07-09 12:42:46,467][26022] Updated weights on worker 0-0, policy_version 253543 (0.00088) [2022-07-09 12:42:48,205][26022] Updated weights on worker 0-0, policy_version 253553 (0.00089) [2022-07-09 12:42:49,814][25689] Fps is (10 sec: 5807.4, 60 sec: 5647.9, 300 sec: 5657.4). Total num frames: 259647488. Throughput: 0: 5942.9. Samples: 259653422. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2022-07-09 12:42:49,815][25689] Avg episode reward: [(0, '-48.887')] [2022-07-09 12:42:50,084][26022] Updated weights on worker 0-0, policy_version 253563 (0.00102) [2022-07-09 12:42:51,818][26022] Updated weights on worker 0-0, policy_version 253573 (0.00085) [2022-07-09 12:42:53,581][26022] Updated weights on worker 0-0, policy_version 253583 (0.00094) [2022-07-09 12:42:54,843][25689] Fps is (10 sec: 5812.2, 60 sec: 5628.6, 300 sec: 5650.7). Total num frames: 259676160. Throughput: 0: 5101.6. Samples: 259670490. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2022-07-09 12:42:54,843][25689] Avg episode reward: [(0, '-48.698')] [2022-07-09 12:42:55,260][26022] Updated weights on worker 0-0, policy_version 253593 (0.00084) [2022-07-09 12:42:55,858][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:42:55,873][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000253596_259682304.pth [2022-07-09 12:42:55,873][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000251606_257644544.pth [2022-07-09 12:42:57,192][26022] Updated weights on worker 0-0, policy_version 253603 (0.00086) [2022-07-09 12:42:58,905][26022] Updated weights on worker 0-0, policy_version 253613 (0.00091) [2022-07-09 12:42:59,869][25689] Fps is (10 sec: 5601.4, 60 sec: 5626.8, 300 sec: 5658.6). Total num frames: 259703808. Throughput: 0: 5943.1. Samples: 259704610. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2022-07-09 12:42:59,871][25689] Avg episode reward: [(0, '-48.619')] [2022-07-09 12:43:00,750][26022] Updated weights on worker 0-0, policy_version 253623 (0.00089) [2022-07-09 12:43:02,919][26022] Updated weights on worker 0-0, policy_version 253633 (0.00086) [2022-07-09 12:43:04,834][26022] Updated weights on worker 0-0, policy_version 253643 (0.00091) [2022-07-09 12:43:04,938][25689] Fps is (10 sec: 5376.0, 60 sec: 5613.1, 300 sec: 5650.5). Total num frames: 259730432. Throughput: 0: 5843.3. Samples: 259736704. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2022-07-09 12:43:04,938][25689] Avg episode reward: [(0, '-49.120')] [2022-07-09 12:43:06,440][26022] Updated weights on worker 0-0, policy_version 253653 (0.00087) [2022-07-09 12:43:08,411][26022] Updated weights on worker 0-0, policy_version 253663 (0.00101) [2022-07-09 12:43:09,965][25689] Fps is (10 sec: 5578.3, 60 sec: 5648.7, 300 sec: 5650.4). Total num frames: 259760128. Throughput: 0: 4980.5. Samples: 259753784. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2022-07-09 12:43:09,966][25689] Avg episode reward: [(0, '-48.142')] [2022-07-09 12:43:10,273][26022] Updated weights on worker 0-0, policy_version 253673 (0.00102) [2022-07-09 12:43:11,894][26022] Updated weights on worker 0-0, policy_version 253683 (0.00095) [2022-07-09 12:43:14,018][26022] Updated weights on worker 0-0, policy_version 253693 (0.00089) [2022-07-09 12:43:14,982][25689] Fps is (10 sec: 5811.0, 60 sec: 5665.9, 300 sec: 5650.3). Total num frames: 259788800. Throughput: 0: 5825.1. Samples: 259787806. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2022-07-09 12:43:14,983][25689] Avg episode reward: [(0, '-48.752')] [2022-07-09 12:43:15,727][26022] Updated weights on worker 0-0, policy_version 253703 (0.00088) [2022-07-09 12:43:17,381][26022] Updated weights on worker 0-0, policy_version 253713 (0.00088) [2022-07-09 12:43:19,134][26022] Updated weights on worker 0-0, policy_version 253723 (0.00085) [2022-07-09 12:43:19,994][25689] Fps is (10 sec: 5615.4, 60 sec: 5649.3, 300 sec: 5650.9). Total num frames: 259816448. Throughput: 0: 5842.4. Samples: 259822194. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2022-07-09 12:43:19,995][25689] Avg episode reward: [(0, '-49.271')] [2022-07-09 12:43:20,848][26022] Updated weights on worker 0-0, policy_version 253733 (0.00085) [2022-07-09 12:43:22,739][26022] Updated weights on worker 0-0, policy_version 253743 (0.00092) [2022-07-09 12:43:24,686][26022] Updated weights on worker 0-0, policy_version 253753 (0.00089) [2022-07-09 12:43:25,128][25689] Fps is (10 sec: 5652.0, 60 sec: 5644.1, 300 sec: 5651.9). Total num frames: 259846144. Throughput: 0: 5080.5. Samples: 259839280. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2022-07-09 12:43:25,129][25689] Avg episode reward: [(0, '-49.182')] [2022-07-09 12:43:26,363][26022] Updated weights on worker 0-0, policy_version 253763 (0.00108) [2022-07-09 12:43:28,235][26022] Updated weights on worker 0-0, policy_version 253773 (0.00088) [2022-07-09 12:43:29,796][26022] Updated weights on worker 0-0, policy_version 253783 (0.00084) [2022-07-09 12:43:30,144][25689] Fps is (10 sec: 5649.7, 60 sec: 5643.4, 300 sec: 5648.3). Total num frames: 259873792. Throughput: 0: 5923.6. Samples: 259873320. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2022-07-09 12:43:30,145][25689] Avg episode reward: [(0, '-48.874')] [2022-07-09 12:43:31,823][26022] Updated weights on worker 0-0, policy_version 253793 (0.00087) [2022-07-09 12:43:33,970][26022] Updated weights on worker 0-0, policy_version 253803 (0.00108) [2022-07-09 12:43:35,155][25689] Fps is (10 sec: 5616.7, 60 sec: 5645.9, 300 sec: 5648.5). Total num frames: 259902464. Throughput: 0: 5923.6. Samples: 259907302. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2022-07-09 12:43:35,156][25689] Avg episode reward: [(0, '-49.126')] [2022-07-09 12:43:35,433][26022] Updated weights on worker 0-0, policy_version 253813 (0.00081) [2022-07-09 12:43:37,377][26022] Updated weights on worker 0-0, policy_version 253823 (0.00084) [2022-07-09 12:43:39,162][26022] Updated weights on worker 0-0, policy_version 253833 (0.00086) [2022-07-09 12:43:40,214][25689] Fps is (10 sec: 5695.0, 60 sec: 5657.8, 300 sec: 5645.5). Total num frames: 259931136. Throughput: 0: 5062.8. Samples: 259924560. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2022-07-09 12:43:40,214][25689] Avg episode reward: [(0, '-49.430')] [2022-07-09 12:43:40,747][26022] Updated weights on worker 0-0, policy_version 253843 (0.00100) [2022-07-09 12:43:42,575][26022] Updated weights on worker 0-0, policy_version 253853 (0.00085) [2022-07-09 12:43:44,379][26022] Updated weights on worker 0-0, policy_version 253863 (0.00090) [2022-07-09 12:43:45,272][25689] Fps is (10 sec: 5668.4, 60 sec: 5655.8, 300 sec: 5648.1). Total num frames: 259959808. Throughput: 0: 5922.7. Samples: 259958586. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2022-07-09 12:43:45,273][25689] Avg episode reward: [(0, '-48.689')] [2022-07-09 12:43:46,098][26022] Updated weights on worker 0-0, policy_version 253873 (0.00090) [2022-07-09 12:43:48,032][26022] Updated weights on worker 0-0, policy_version 253883 (0.00084) [2022-07-09 12:43:49,698][26022] Updated weights on worker 0-0, policy_version 253893 (0.00087) [2022-07-09 12:43:50,299][25689] Fps is (10 sec: 5787.1, 60 sec: 5654.5, 300 sec: 5654.9). Total num frames: 259989504. Throughput: 0: 5956.6. Samples: 259993376. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2022-07-09 12:43:50,301][25689] Avg episode reward: [(0, '-49.266')] [2022-07-09 12:43:51,610][26022] Updated weights on worker 0-0, policy_version 253903 (0.00085) [2022-07-09 12:43:53,383][26022] Updated weights on worker 0-0, policy_version 253913 (0.00557) [2022-07-09 12:43:55,071][26022] Updated weights on worker 0-0, policy_version 253923 (0.00090) [2022-07-09 12:43:55,314][25689] Fps is (10 sec: 5812.4, 60 sec: 5655.9, 300 sec: 5651.6). Total num frames: 260018176. Throughput: 0: 5110.0. Samples: 260010312. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2022-07-09 12:43:55,314][25689] Avg episode reward: [(0, '-49.169')] [2022-07-09 12:43:56,975][26022] Updated weights on worker 0-0, policy_version 253933 (0.00081) [2022-07-09 12:43:58,603][26022] Updated weights on worker 0-0, policy_version 253943 (0.00086) [2022-07-09 12:44:00,319][25689] Fps is (10 sec: 5620.9, 60 sec: 5657.8, 300 sec: 5662.8). Total num frames: 260045824. Throughput: 0: 5973.6. Samples: 260044664. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2022-07-09 12:44:00,320][25689] Avg episode reward: [(0, '-49.526')] [2022-07-09 12:44:00,512][26022] Updated weights on worker 0-0, policy_version 253953 (0.00091) [2022-07-09 12:44:02,663][26022] Updated weights on worker 0-0, policy_version 253963 (0.00085) [2022-07-09 12:44:04,450][26022] Updated weights on worker 0-0, policy_version 253973 (0.00222) [2022-07-09 12:44:05,380][25689] Fps is (10 sec: 5595.2, 60 sec: 5692.5, 300 sec: 5655.0). Total num frames: 260074496. Throughput: 0: 5876.7. Samples: 260076754. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2022-07-09 12:44:05,383][25689] Avg episode reward: [(0, '-50.541')] [2022-07-09 12:44:06,370][26022] Updated weights on worker 0-0, policy_version 253983 (0.00089) [2022-07-09 12:44:07,992][26022] Updated weights on worker 0-0, policy_version 253993 (0.00090) [2022-07-09 12:44:09,838][26022] Updated weights on worker 0-0, policy_version 254003 (0.00084) [2022-07-09 12:44:10,461][25689] Fps is (10 sec: 5452.2, 60 sec: 5636.6, 300 sec: 5653.5). Total num frames: 260101120. Throughput: 0: 5845.1. Samples: 260111224. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2022-07-09 12:44:10,462][25689] Avg episode reward: [(0, '-50.870')] [2022-07-09 12:44:11,501][26022] Updated weights on worker 0-0, policy_version 254013 (0.00082) [2022-07-09 12:44:13,464][26022] Updated weights on worker 0-0, policy_version 254023 (0.00089) [2022-07-09 12:44:15,202][26022] Updated weights on worker 0-0, policy_version 254033 (0.00082) [2022-07-09 12:44:15,507][25689] Fps is (10 sec: 5460.2, 60 sec: 5634.0, 300 sec: 5656.6). Total num frames: 260129792. Throughput: 0: 5853.0. Samples: 260128502. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2022-07-09 12:44:15,509][25689] Avg episode reward: [(0, '-49.990')] [2022-07-09 12:44:17,051][26022] Updated weights on worker 0-0, policy_version 254043 (0.00084) [2022-07-09 12:44:18,804][26022] Updated weights on worker 0-0, policy_version 254053 (0.00088) [2022-07-09 12:44:20,570][25689] Fps is (10 sec: 5774.0, 60 sec: 5663.0, 300 sec: 5653.1). Total num frames: 260159488. Throughput: 0: 5837.0. Samples: 260162868. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2022-07-09 12:44:20,571][25689] Avg episode reward: [(0, '-50.030')] [2022-07-09 12:44:20,603][26022] Updated weights on worker 0-0, policy_version 254063 (0.00092) [2022-07-09 12:44:22,433][26022] Updated weights on worker 0-0, policy_version 254073 (0.00085) [2022-07-09 12:44:24,244][26022] Updated weights on worker 0-0, policy_version 254083 (0.00087) [2022-07-09 12:44:25,686][25689] Fps is (10 sec: 5734.1, 60 sec: 5647.8, 300 sec: 5651.6). Total num frames: 260188160. Throughput: 0: 5936.1. Samples: 260197294. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2022-07-09 12:44:25,686][25689] Avg episode reward: [(0, '-50.571')] [2022-07-09 12:44:25,941][26022] Updated weights on worker 0-0, policy_version 254093 (0.00092) [2022-07-09 12:44:27,943][26022] Updated weights on worker 0-0, policy_version 254103 (0.00089) [2022-07-09 12:44:29,472][26022] Updated weights on worker 0-0, policy_version 254113 (0.00090) [2022-07-09 12:44:30,697][25689] Fps is (10 sec: 5763.7, 60 sec: 5682.1, 300 sec: 5659.8). Total num frames: 260217856. Throughput: 0: 5097.1. Samples: 260214368. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:44:30,697][25689] Avg episode reward: [(0, '-50.051')] [2022-07-09 12:44:31,267][26022] Updated weights on worker 0-0, policy_version 254123 (0.00093) [2022-07-09 12:44:33,234][26022] Updated weights on worker 0-0, policy_version 254133 (0.00086) [2022-07-09 12:44:34,871][26022] Updated weights on worker 0-0, policy_version 254143 (0.00092) [2022-07-09 12:44:35,758][25689] Fps is (10 sec: 5795.2, 60 sec: 5677.4, 300 sec: 5652.5). Total num frames: 260246528. Throughput: 0: 5932.6. Samples: 260248642. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:44:35,758][25689] Avg episode reward: [(0, '-49.283')] [2022-07-09 12:44:36,847][26022] Updated weights on worker 0-0, policy_version 254153 (0.00094) [2022-07-09 12:44:38,527][26022] Updated weights on worker 0-0, policy_version 254163 (0.00093) [2022-07-09 12:44:40,507][26022] Updated weights on worker 0-0, policy_version 254173 (0.00089) [2022-07-09 12:44:40,760][25689] Fps is (10 sec: 5698.3, 60 sec: 5682.6, 300 sec: 5653.4). Total num frames: 260275200. Throughput: 0: 5951.2. Samples: 260283024. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:44:40,767][25689] Avg episode reward: [(0, '-49.771')] [2022-07-09 12:44:41,900][26022] Updated weights on worker 0-0, policy_version 254183 (0.00084) [2022-07-09 12:44:44,151][26022] Updated weights on worker 0-0, policy_version 254193 (0.00087) [2022-07-09 12:44:45,451][26022] Updated weights on worker 0-0, policy_version 254203 (0.00080) [2022-07-09 12:44:45,886][25689] Fps is (10 sec: 5763.0, 60 sec: 5693.2, 300 sec: 5652.2). Total num frames: 260304896. Throughput: 0: 5083.5. Samples: 260299980. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:44:45,886][25689] Avg episode reward: [(0, '-49.736')] [2022-07-09 12:44:47,709][26022] Updated weights on worker 0-0, policy_version 254213 (0.00082) [2022-07-09 12:44:49,219][26022] Updated weights on worker 0-0, policy_version 254223 (0.00083) [2022-07-09 12:44:50,941][25689] Fps is (10 sec: 5632.7, 60 sec: 5656.9, 300 sec: 5651.5). Total num frames: 260332544. Throughput: 0: 5945.8. Samples: 260334734. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:44:50,941][25689] Avg episode reward: [(0, '-49.521')] [2022-07-09 12:44:51,095][26022] Updated weights on worker 0-0, policy_version 254233 (0.00087) [2022-07-09 12:44:52,773][26022] Updated weights on worker 0-0, policy_version 254243 (0.00084) [2022-07-09 12:44:54,706][26022] Updated weights on worker 0-0, policy_version 254253 (0.00084) [2022-07-09 12:44:55,875][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:44:55,886][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000254260_260362240.pth [2022-07-09 12:44:55,887][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000252269_258323456.pth [2022-07-09 12:44:55,984][25689] Fps is (10 sec: 5678.5, 60 sec: 5671.1, 300 sec: 5658.3). Total num frames: 260362240. Throughput: 0: 5956.1. Samples: 260369112. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:44:55,985][25689] Avg episode reward: [(0, '-49.461')] [2022-07-09 12:44:56,404][26022] Updated weights on worker 0-0, policy_version 254263 (0.00088) [2022-07-09 12:44:58,415][26022] Updated weights on worker 0-0, policy_version 254273 (0.00080) [2022-07-09 12:44:59,823][26022] Updated weights on worker 0-0, policy_version 254283 (0.00089) [2022-07-09 12:45:00,998][25689] Fps is (10 sec: 5702.0, 60 sec: 5670.3, 300 sec: 5659.2). Total num frames: 260389888. Throughput: 0: 5100.7. Samples: 260386250. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:45:00,998][25689] Avg episode reward: [(0, '-49.130')] [2022-07-09 12:45:02,097][26022] Updated weights on worker 0-0, policy_version 254293 (0.00088) [2022-07-09 12:45:03,838][26022] Updated weights on worker 0-0, policy_version 254303 (0.00090) [2022-07-09 12:45:05,811][26022] Updated weights on worker 0-0, policy_version 254313 (0.00103) [2022-07-09 12:45:06,079][25689] Fps is (10 sec: 5376.3, 60 sec: 5634.6, 300 sec: 5650.9). Total num frames: 260416512. Throughput: 0: 5852.7. Samples: 260418164. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:45:06,079][25689] Avg episode reward: [(0, '-49.018')] [2022-07-09 12:45:07,660][26022] Updated weights on worker 0-0, policy_version 254323 (0.00092) [2022-07-09 12:45:09,489][26022] Updated weights on worker 0-0, policy_version 254333 (0.00089) [2022-07-09 12:45:11,154][25689] Fps is (10 sec: 5545.5, 60 sec: 5685.8, 300 sec: 5657.0). Total num frames: 260446208. Throughput: 0: 5812.3. Samples: 260452218. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:45:11,154][25689] Avg episode reward: [(0, '-48.019')] [2022-07-09 12:45:11,312][26022] Updated weights on worker 0-0, policy_version 254343 (0.00091) [2022-07-09 12:45:13,055][26022] Updated weights on worker 0-0, policy_version 254353 (0.00090) [2022-07-09 12:45:14,859][26022] Updated weights on worker 0-0, policy_version 254363 (0.00085) [2022-07-09 12:45:16,240][25689] Fps is (10 sec: 5744.1, 60 sec: 5682.0, 300 sec: 5652.3). Total num frames: 260474880. Throughput: 0: 4938.8. Samples: 260469154. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:45:16,241][25689] Avg episode reward: [(0, '-48.098')] [2022-07-09 12:45:16,796][26022] Updated weights on worker 0-0, policy_version 254373 (0.00089) [2022-07-09 12:45:18,383][26022] Updated weights on worker 0-0, policy_version 254383 (0.00090) [2022-07-09 12:45:20,261][26022] Updated weights on worker 0-0, policy_version 254393 (0.00098) [2022-07-09 12:45:21,263][25689] Fps is (10 sec: 5672.4, 60 sec: 5668.9, 300 sec: 5653.8). Total num frames: 260503552. Throughput: 0: 5774.2. Samples: 260503266. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:45:21,263][25689] Avg episode reward: [(0, '-48.452')] [2022-07-09 12:45:21,997][26022] Updated weights on worker 0-0, policy_version 254403 (0.00068) [2022-07-09 12:45:23,843][26022] Updated weights on worker 0-0, policy_version 254413 (0.00095) [2022-07-09 12:45:25,516][26022] Updated weights on worker 0-0, policy_version 254423 (0.00101) [2022-07-09 12:45:26,348][25689] Fps is (10 sec: 5774.7, 60 sec: 5688.7, 300 sec: 5656.9). Total num frames: 260533248. Throughput: 0: 5888.3. Samples: 260537514. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:45:26,350][25689] Avg episode reward: [(0, '-48.485')] [2022-07-09 12:45:27,577][26022] Updated weights on worker 0-0, policy_version 254433 (0.01019) [2022-07-09 12:45:29,021][26022] Updated weights on worker 0-0, policy_version 254443 (0.00096) [2022-07-09 12:45:31,175][26022] Updated weights on worker 0-0, policy_version 254453 (0.00095) [2022-07-09 12:45:31,372][25689] Fps is (10 sec: 5571.3, 60 sec: 5636.9, 300 sec: 5650.3). Total num frames: 260559872. Throughput: 0: 5064.7. Samples: 260554618. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:45:31,372][25689] Avg episode reward: [(0, '-49.432')] [2022-07-09 12:45:32,788][26022] Updated weights on worker 0-0, policy_version 254463 (0.00087) [2022-07-09 12:45:34,625][26022] Updated weights on worker 0-0, policy_version 254473 (0.00080) [2022-07-09 12:45:36,433][25689] Fps is (10 sec: 5584.5, 60 sec: 5653.7, 300 sec: 5653.1). Total num frames: 260589568. Throughput: 0: 5922.5. Samples: 260588746. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:45:36,433][25689] Avg episode reward: [(0, '-49.615')] [2022-07-09 12:45:36,511][26022] Updated weights on worker 0-0, policy_version 254483 (0.00097) [2022-07-09 12:45:38,119][26022] Updated weights on worker 0-0, policy_version 254493 (0.00091) [2022-07-09 12:45:40,186][26022] Updated weights on worker 0-0, policy_version 254503 (0.00092) [2022-07-09 12:45:41,491][25689] Fps is (10 sec: 5869.4, 60 sec: 5665.4, 300 sec: 5654.3). Total num frames: 260619264. Throughput: 0: 5916.2. Samples: 260622938. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:45:41,491][25689] Avg episode reward: [(0, '-49.578')] [2022-07-09 12:45:41,808][26022] Updated weights on worker 0-0, policy_version 254513 (0.00083) [2022-07-09 12:45:43,620][26022] Updated weights on worker 0-0, policy_version 254523 (0.00085) [2022-07-09 12:45:45,385][26022] Updated weights on worker 0-0, policy_version 254533 (0.00090) [2022-07-09 12:45:46,543][25689] Fps is (10 sec: 5773.6, 60 sec: 5655.4, 300 sec: 5657.2). Total num frames: 260647936. Throughput: 0: 5086.7. Samples: 260640238. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:45:46,543][25689] Avg episode reward: [(0, '-49.818')] [2022-07-09 12:45:47,061][26022] Updated weights on worker 0-0, policy_version 254543 (0.00088) [2022-07-09 12:45:49,001][26022] Updated weights on worker 0-0, policy_version 254553 (0.00084) [2022-07-09 12:45:50,832][26022] Updated weights on worker 0-0, policy_version 254563 (0.00080) [2022-07-09 12:45:51,550][25689] Fps is (10 sec: 5700.4, 60 sec: 5676.7, 300 sec: 5655.2). Total num frames: 260676608. Throughput: 0: 5957.9. Samples: 260674840. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:45:51,551][25689] Avg episode reward: [(0, '-48.603')] [2022-07-09 12:45:52,435][26022] Updated weights on worker 0-0, policy_version 254573 (0.00081) [2022-07-09 12:45:54,305][26022] Updated weights on worker 0-0, policy_version 254583 (0.00084) [2022-07-09 12:45:56,117][26022] Updated weights on worker 0-0, policy_version 254593 (0.00092) [2022-07-09 12:45:56,568][25689] Fps is (10 sec: 5617.6, 60 sec: 5645.3, 300 sec: 5651.5). Total num frames: 260704256. Throughput: 0: 5981.1. Samples: 260709176. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:45:56,569][25689] Avg episode reward: [(0, '-48.078')] [2022-07-09 12:45:57,996][26022] Updated weights on worker 0-0, policy_version 254603 (0.00090) [2022-07-09 12:45:59,808][26022] Updated weights on worker 0-0, policy_version 254613 (0.00091) [2022-07-09 12:46:01,501][26022] Updated weights on worker 0-0, policy_version 254623 (0.00086) [2022-07-09 12:46:01,587][25689] Fps is (10 sec: 5713.3, 60 sec: 5678.6, 300 sec: 5666.9). Total num frames: 260733952. Throughput: 0: 5995.2. Samples: 260743422. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:46:01,588][25689] Avg episode reward: [(0, '-48.155')] [2022-07-09 12:46:03,811][26022] Updated weights on worker 0-0, policy_version 254633 (0.00091) [2022-07-09 12:46:05,614][26022] Updated weights on worker 0-0, policy_version 254643 (0.00093) [2022-07-09 12:46:06,659][25689] Fps is (10 sec: 5480.0, 60 sec: 5662.6, 300 sec: 5655.2). Total num frames: 260759552. Throughput: 0: 5860.3. Samples: 260758124. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:46:06,659][25689] Avg episode reward: [(0, '-48.429')] [2022-07-09 12:46:07,324][26022] Updated weights on worker 0-0, policy_version 254653 (0.00087) [2022-07-09 12:46:09,172][26022] Updated weights on worker 0-0, policy_version 254663 (0.00090) [2022-07-09 12:46:10,831][26022] Updated weights on worker 0-0, policy_version 254673 (0.00093) [2022-07-09 12:46:11,695][25689] Fps is (10 sec: 5268.3, 60 sec: 5632.4, 300 sec: 5648.4). Total num frames: 260787200. Throughput: 0: 5829.1. Samples: 260792262. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:46:11,695][25689] Avg episode reward: [(0, '-47.832')] [2022-07-09 12:46:12,756][26022] Updated weights on worker 0-0, policy_version 254683 (0.00088) [2022-07-09 12:46:14,623][26022] Updated weights on worker 0-0, policy_version 254693 (0.00082) [2022-07-09 12:46:16,215][26022] Updated weights on worker 0-0, policy_version 254703 (0.00084) [2022-07-09 12:46:16,703][25689] Fps is (10 sec: 5810.9, 60 sec: 5673.6, 300 sec: 5655.5). Total num frames: 260817920. Throughput: 0: 5836.7. Samples: 260826698. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:46:16,704][25689] Avg episode reward: [(0, '-47.585')] [2022-07-09 12:46:18,217][26022] Updated weights on worker 0-0, policy_version 254713 (0.00087) [2022-07-09 12:46:19,874][26022] Updated weights on worker 0-0, policy_version 254723 (0.00049) [2022-07-09 12:46:21,675][26022] Updated weights on worker 0-0, policy_version 254733 (0.00089) [2022-07-09 12:46:21,718][25689] Fps is (10 sec: 6027.3, 60 sec: 5691.2, 300 sec: 5660.9). Total num frames: 260847616. Throughput: 0: 4997.4. Samples: 260844024. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:46:21,719][25689] Avg episode reward: [(0, '-48.806')] [2022-07-09 12:46:23,775][26022] Updated weights on worker 0-0, policy_version 254743 (0.00083) [2022-07-09 12:46:25,202][26022] Updated weights on worker 0-0, policy_version 254753 (0.00082) [2022-07-09 12:46:26,779][25689] Fps is (10 sec: 5691.4, 60 sec: 5659.6, 300 sec: 5660.0). Total num frames: 260875264. Throughput: 0: 5970.7. Samples: 260878256. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:46:26,779][25689] Avg episode reward: [(0, '-48.250')] [2022-07-09 12:46:27,258][26022] Updated weights on worker 0-0, policy_version 254763 (0.00086) [2022-07-09 12:46:28,915][26022] Updated weights on worker 0-0, policy_version 254773 (0.00085) [2022-07-09 12:46:30,699][26022] Updated weights on worker 0-0, policy_version 254783 (0.00085) [2022-07-09 12:46:31,789][25689] Fps is (10 sec: 5592.6, 60 sec: 5694.8, 300 sec: 5660.2). Total num frames: 260903936. Throughput: 0: 5978.4. Samples: 260912394. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:46:31,789][25689] Avg episode reward: [(0, '-48.538')] [2022-07-09 12:46:32,498][26022] Updated weights on worker 0-0, policy_version 254793 (0.00093) [2022-07-09 12:46:34,248][26022] Updated weights on worker 0-0, policy_version 254803 (0.00093) [2022-07-09 12:46:36,236][26022] Updated weights on worker 0-0, policy_version 254813 (0.00101) [2022-07-09 12:46:36,807][25689] Fps is (10 sec: 5820.4, 60 sec: 5698.9, 300 sec: 5666.9). Total num frames: 260933632. Throughput: 0: 5094.0. Samples: 260929106. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:46:36,807][25689] Avg episode reward: [(0, '-49.215')] [2022-07-09 12:46:38,166][26022] Updated weights on worker 0-0, policy_version 254823 (0.00087) [2022-07-09 12:46:39,695][26022] Updated weights on worker 0-0, policy_version 254833 (0.00093) [2022-07-09 12:46:41,704][26022] Updated weights on worker 0-0, policy_version 254843 (0.00086) [2022-07-09 12:46:41,820][25689] Fps is (10 sec: 5614.4, 60 sec: 5652.2, 300 sec: 5657.4). Total num frames: 260960256. Throughput: 0: 5917.4. Samples: 260962974. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:46:41,822][25689] Avg episode reward: [(0, '-48.780')] [2022-07-09 12:46:43,363][26022] Updated weights on worker 0-0, policy_version 254853 (0.00112) [2022-07-09 12:46:45,336][26022] Updated weights on worker 0-0, policy_version 254863 (0.00089) [2022-07-09 12:46:46,898][25689] Fps is (10 sec: 5479.5, 60 sec: 5649.7, 300 sec: 5656.0). Total num frames: 260988928. Throughput: 0: 5918.5. Samples: 260997334. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 12:46:46,899][25689] Avg episode reward: [(0, '-47.952')] [2022-07-09 12:46:47,059][26022] Updated weights on worker 0-0, policy_version 254873 (0.00085) [2022-07-09 12:46:48,627][26022] Updated weights on worker 0-0, policy_version 254883 (0.00086) [2022-07-09 12:46:50,600][26022] Updated weights on worker 0-0, policy_version 254893 (0.00082) [2022-07-09 12:46:51,917][25689] Fps is (10 sec: 5781.0, 60 sec: 5665.7, 300 sec: 5655.7). Total num frames: 261018624. Throughput: 0: 5080.7. Samples: 261014658. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:46:51,917][25689] Avg episode reward: [(0, '-47.701')] [2022-07-09 12:46:52,334][26022] Updated weights on worker 0-0, policy_version 254903 (0.00090) [2022-07-09 12:46:54,168][26022] Updated weights on worker 0-0, policy_version 254913 (0.00086) [2022-07-09 12:46:55,946][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:46:55,956][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000254923_261041152.pth [2022-07-09 12:46:55,957][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000252934_259004416.pth [2022-07-09 12:46:55,960][26022] Updated weights on worker 0-0, policy_version 254923 (0.00081) [2022-07-09 12:46:56,987][25689] Fps is (10 sec: 5684.2, 60 sec: 5660.8, 300 sec: 5654.5). Total num frames: 261046272. Throughput: 0: 5936.9. Samples: 261048910. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:46:56,987][25689] Avg episode reward: [(0, '-48.415')] [2022-07-09 12:46:57,671][26022] Updated weights on worker 0-0, policy_version 254933 (0.00098) [2022-07-09 12:46:59,603][26022] Updated weights on worker 0-0, policy_version 254943 (0.00090) [2022-07-09 12:47:01,399][26022] Updated weights on worker 0-0, policy_version 254953 (0.00088) [2022-07-09 12:47:02,069][25689] Fps is (10 sec: 5547.3, 60 sec: 5637.9, 300 sec: 5658.4). Total num frames: 261074944. Throughput: 0: 5904.9. Samples: 261082544. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:47:02,070][25689] Avg episode reward: [(0, '-48.921')] [2022-07-09 12:47:03,389][26022] Updated weights on worker 0-0, policy_version 254963 (0.00087) [2022-07-09 12:47:05,343][26022] Updated weights on worker 0-0, policy_version 254973 (0.00093) [2022-07-09 12:47:07,207][25689] Fps is (10 sec: 5310.3, 60 sec: 5631.7, 300 sec: 5649.7). Total num frames: 261100544. Throughput: 0: 4955.2. Samples: 261097966. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:47:07,207][25689] Avg episode reward: [(0, '-48.017')] [2022-07-09 12:47:07,334][26022] Updated weights on worker 0-0, policy_version 254983 (0.00089) [2022-07-09 12:47:08,944][26022] Updated weights on worker 0-0, policy_version 254993 (0.00093) [2022-07-09 12:47:11,035][26022] Updated weights on worker 0-0, policy_version 255003 (0.00086) [2022-07-09 12:47:12,215][25689] Fps is (10 sec: 5450.4, 60 sec: 5668.2, 300 sec: 5656.8). Total num frames: 261130240. Throughput: 0: 5761.1. Samples: 261131598. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:47:12,215][25689] Avg episode reward: [(0, '-49.269')] [2022-07-09 12:47:12,458][26022] Updated weights on worker 0-0, policy_version 255013 (0.00086) [2022-07-09 12:47:14,597][26022] Updated weights on worker 0-0, policy_version 255023 (0.00083) [2022-07-09 12:47:16,224][26022] Updated weights on worker 0-0, policy_version 255033 (0.00094) [2022-07-09 12:47:17,221][25689] Fps is (10 sec: 5828.4, 60 sec: 5634.6, 300 sec: 5657.0). Total num frames: 261158912. Throughput: 0: 5772.6. Samples: 261165716. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:47:17,222][25689] Avg episode reward: [(0, '-48.935')] [2022-07-09 12:47:18,155][26022] Updated weights on worker 0-0, policy_version 255043 (0.00099) [2022-07-09 12:47:19,996][26022] Updated weights on worker 0-0, policy_version 255053 (0.00089) [2022-07-09 12:47:21,765][26022] Updated weights on worker 0-0, policy_version 255063 (0.00097) [2022-07-09 12:47:22,252][25689] Fps is (10 sec: 5610.8, 60 sec: 5599.2, 300 sec: 5651.0). Total num frames: 261186560. Throughput: 0: 4959.4. Samples: 261182640. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:47:22,254][25689] Avg episode reward: [(0, '-49.309')] [2022-07-09 12:47:23,506][26022] Updated weights on worker 0-0, policy_version 255073 (0.00085) [2022-07-09 12:47:25,282][26022] Updated weights on worker 0-0, policy_version 255083 (0.00093) [2022-07-09 12:47:26,984][26022] Updated weights on worker 0-0, policy_version 255093 (0.00092) [2022-07-09 12:47:27,336][25689] Fps is (10 sec: 5770.8, 60 sec: 5647.8, 300 sec: 5659.9). Total num frames: 261217280. Throughput: 0: 5898.6. Samples: 261216698. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:47:27,336][25689] Avg episode reward: [(0, '-49.696')] [2022-07-09 12:47:28,917][26022] Updated weights on worker 0-0, policy_version 255103 (0.00086) [2022-07-09 12:47:30,575][26022] Updated weights on worker 0-0, policy_version 255113 (0.00091) [2022-07-09 12:47:32,353][25689] Fps is (10 sec: 5677.6, 60 sec: 5613.4, 300 sec: 5653.5). Total num frames: 261243904. Throughput: 0: 5938.8. Samples: 261251192. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:47:32,354][25689] Avg episode reward: [(0, '-50.285')] [2022-07-09 12:47:32,553][26022] Updated weights on worker 0-0, policy_version 255123 (0.00089) [2022-07-09 12:47:34,225][26022] Updated weights on worker 0-0, policy_version 255133 (0.00082) [2022-07-09 12:47:36,058][26022] Updated weights on worker 0-0, policy_version 255143 (0.00084) [2022-07-09 12:47:37,410][25689] Fps is (10 sec: 5590.3, 60 sec: 5609.7, 300 sec: 5659.3). Total num frames: 261273600. Throughput: 0: 5099.6. Samples: 261268670. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:47:37,412][25689] Avg episode reward: [(0, '-50.978')] [2022-07-09 12:47:37,950][26022] Updated weights on worker 0-0, policy_version 255153 (0.00102) [2022-07-09 12:47:39,600][26022] Updated weights on worker 0-0, policy_version 255163 (0.00084) [2022-07-09 12:47:41,475][26022] Updated weights on worker 0-0, policy_version 255173 (0.00094) [2022-07-09 12:47:42,472][25689] Fps is (10 sec: 5970.4, 60 sec: 5672.8, 300 sec: 5665.7). Total num frames: 261304320. Throughput: 0: 5963.3. Samples: 261303214. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:47:42,472][25689] Avg episode reward: [(0, '-50.777')] [2022-07-09 12:47:42,861][26022] Updated weights on worker 0-0, policy_version 255183 (0.00089) [2022-07-09 12:47:45,050][26022] Updated weights on worker 0-0, policy_version 255193 (0.00082) [2022-07-09 12:47:46,703][26022] Updated weights on worker 0-0, policy_version 255203 (0.00088) [2022-07-09 12:47:47,537][25689] Fps is (10 sec: 5763.6, 60 sec: 5657.1, 300 sec: 5657.9). Total num frames: 261331968. Throughput: 0: 5989.0. Samples: 261337684. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:47:47,539][25689] Avg episode reward: [(0, '-52.169')] [2022-07-09 12:47:48,532][26022] Updated weights on worker 0-0, policy_version 255213 (0.00087) [2022-07-09 12:47:50,284][26022] Updated weights on worker 0-0, policy_version 255223 (0.00090) [2022-07-09 12:47:52,175][26022] Updated weights on worker 0-0, policy_version 255233 (0.00084) [2022-07-09 12:47:52,634][25689] Fps is (10 sec: 5743.5, 60 sec: 5666.6, 300 sec: 5663.5). Total num frames: 261362688. Throughput: 0: 5115.5. Samples: 261354942. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:47:52,635][25689] Avg episode reward: [(0, '-50.667')] [2022-07-09 12:47:53,918][26022] Updated weights on worker 0-0, policy_version 255243 (0.00091) [2022-07-09 12:47:55,766][26022] Updated weights on worker 0-0, policy_version 255253 (0.00086) [2022-07-09 12:47:57,511][26022] Updated weights on worker 0-0, policy_version 255263 (0.00094) [2022-07-09 12:47:57,668][25689] Fps is (10 sec: 5761.1, 60 sec: 5670.0, 300 sec: 5663.3). Total num frames: 261390336. Throughput: 0: 5945.2. Samples: 261389110. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:47:57,670][25689] Avg episode reward: [(0, '-49.646')] [2022-07-09 12:47:59,242][26022] Updated weights on worker 0-0, policy_version 255273 (0.00084) [2022-07-09 12:48:01,288][26022] Updated weights on worker 0-0, policy_version 255283 (0.00188) [2022-07-09 12:48:02,756][25689] Fps is (10 sec: 5361.9, 60 sec: 5635.8, 300 sec: 5662.9). Total num frames: 261416960. Throughput: 0: 5839.4. Samples: 261421662. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:48:02,757][25689] Avg episode reward: [(0, '-49.262')] [2022-07-09 12:48:03,247][26022] Updated weights on worker 0-0, policy_version 255293 (0.00084) [2022-07-09 12:48:05,132][26022] Updated weights on worker 0-0, policy_version 255303 (0.00083) [2022-07-09 12:48:06,851][26022] Updated weights on worker 0-0, policy_version 255313 (0.00091) [2022-07-09 12:48:07,841][25689] Fps is (10 sec: 5335.0, 60 sec: 5674.4, 300 sec: 5655.0). Total num frames: 261444608. Throughput: 0: 5788.2. Samples: 261455210. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:48:07,842][25689] Avg episode reward: [(0, '-50.057')] [2022-07-09 12:48:08,666][26022] Updated weights on worker 0-0, policy_version 255323 (0.00085) [2022-07-09 12:48:10,543][26022] Updated weights on worker 0-0, policy_version 255333 (0.00084) [2022-07-09 12:48:12,283][26022] Updated weights on worker 0-0, policy_version 255343 (0.00088) [2022-07-09 12:48:12,927][25689] Fps is (10 sec: 5638.2, 60 sec: 5667.2, 300 sec: 5657.1). Total num frames: 261474304. Throughput: 0: 5789.8. Samples: 261472430. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:48:12,927][25689] Avg episode reward: [(0, '-49.421')] [2022-07-09 12:48:13,987][26022] Updated weights on worker 0-0, policy_version 255353 (0.00088) [2022-07-09 12:48:15,880][26022] Updated weights on worker 0-0, policy_version 255363 (0.00093) [2022-07-09 12:48:17,551][26022] Updated weights on worker 0-0, policy_version 255373 (0.00087) [2022-07-09 12:48:17,935][25689] Fps is (10 sec: 5985.5, 60 sec: 5700.7, 300 sec: 5667.5). Total num frames: 261505024. Throughput: 0: 5811.0. Samples: 261506880. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:48:17,936][25689] Avg episode reward: [(0, '-47.789')] [2022-07-09 12:48:19,570][26022] Updated weights on worker 0-0, policy_version 255383 (0.00093) [2022-07-09 12:48:21,137][26022] Updated weights on worker 0-0, policy_version 255393 (0.00081) [2022-07-09 12:48:22,949][25689] Fps is (10 sec: 5721.8, 60 sec: 5685.5, 300 sec: 5659.5). Total num frames: 261531648. Throughput: 0: 5915.3. Samples: 261541108. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:48:22,949][25689] Avg episode reward: [(0, '-48.218')] [2022-07-09 12:48:23,033][26022] Updated weights on worker 0-0, policy_version 255403 (0.00087) [2022-07-09 12:48:24,774][26022] Updated weights on worker 0-0, policy_version 255413 (0.00087) [2022-07-09 12:48:26,663][26022] Updated weights on worker 0-0, policy_version 255423 (0.00085) [2022-07-09 12:48:28,065][25689] Fps is (10 sec: 5458.7, 60 sec: 5648.7, 300 sec: 5661.0). Total num frames: 261560320. Throughput: 0: 5089.0. Samples: 261558128. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:48:28,066][25689] Avg episode reward: [(0, '-48.454')] [2022-07-09 12:48:28,394][26022] Updated weights on worker 0-0, policy_version 255433 (0.00082) [2022-07-09 12:48:30,276][26022] Updated weights on worker 0-0, policy_version 255443 (0.00090) [2022-07-09 12:48:31,969][26022] Updated weights on worker 0-0, policy_version 255453 (0.00085) [2022-07-09 12:48:33,095][25689] Fps is (10 sec: 5651.7, 60 sec: 5681.2, 300 sec: 5660.6). Total num frames: 261588992. Throughput: 0: 5931.8. Samples: 261592064. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:48:33,097][25689] Avg episode reward: [(0, '-47.936')] [2022-07-09 12:48:33,947][26022] Updated weights on worker 0-0, policy_version 255463 (0.00084) [2022-07-09 12:48:35,531][26022] Updated weights on worker 0-0, policy_version 255473 (0.00097) [2022-07-09 12:48:37,568][26022] Updated weights on worker 0-0, policy_version 255483 (0.00089) [2022-07-09 12:48:38,115][25689] Fps is (10 sec: 5705.6, 60 sec: 5667.8, 300 sec: 5661.4). Total num frames: 261617664. Throughput: 0: 5898.7. Samples: 261625918. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:48:38,116][25689] Avg episode reward: [(0, '-47.514')] [2022-07-09 12:48:39,512][26022] Updated weights on worker 0-0, policy_version 255493 (0.00088) [2022-07-09 12:48:41,088][26022] Updated weights on worker 0-0, policy_version 255503 (0.00091) [2022-07-09 12:48:43,003][26022] Updated weights on worker 0-0, policy_version 255513 (0.00083) [2022-07-09 12:48:43,138][25689] Fps is (10 sec: 5709.8, 60 sec: 5637.7, 300 sec: 5662.0). Total num frames: 261646336. Throughput: 0: 5047.5. Samples: 261643014. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:48:43,139][25689] Avg episode reward: [(0, '-48.158')] [2022-07-09 12:48:44,631][26022] Updated weights on worker 0-0, policy_version 255523 (0.00088) [2022-07-09 12:48:46,449][26022] Updated weights on worker 0-0, policy_version 255533 (0.00094) [2022-07-09 12:48:48,182][25689] Fps is (10 sec: 5594.9, 60 sec: 5639.7, 300 sec: 5654.8). Total num frames: 261673984. Throughput: 0: 5931.7. Samples: 261677456. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:48:48,182][25689] Avg episode reward: [(0, '-49.219')] [2022-07-09 12:48:48,352][26022] Updated weights on worker 0-0, policy_version 255543 (0.00084) [2022-07-09 12:48:49,943][26022] Updated weights on worker 0-0, policy_version 255553 (0.00086) [2022-07-09 12:48:52,040][26022] Updated weights on worker 0-0, policy_version 255563 (0.00085) [2022-07-09 12:48:53,206][25689] Fps is (10 sec: 5696.0, 60 sec: 5629.6, 300 sec: 5658.1). Total num frames: 261703680. Throughput: 0: 5937.2. Samples: 261711464. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:48:53,206][25689] Avg episode reward: [(0, '-49.589')] [2022-07-09 12:48:53,608][26022] Updated weights on worker 0-0, policy_version 255573 (0.00087) [2022-07-09 12:48:55,468][26022] Updated weights on worker 0-0, policy_version 255583 (0.00089) [2022-07-09 12:48:56,113][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:48:56,128][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000255587_261721088.pth [2022-07-09 12:48:56,129][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000253596_259682304.pth [2022-07-09 12:48:57,393][26022] Updated weights on worker 0-0, policy_version 255593 (0.00086) [2022-07-09 12:48:58,216][25689] Fps is (10 sec: 5714.9, 60 sec: 5631.8, 300 sec: 5658.0). Total num frames: 261731328. Throughput: 0: 5107.8. Samples: 261728588. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:48:58,218][25689] Avg episode reward: [(0, '-49.202')] [2022-07-09 12:48:58,900][26022] Updated weights on worker 0-0, policy_version 255603 (0.00088) [2022-07-09 12:49:01,009][26022] Updated weights on worker 0-0, policy_version 255613 (0.00087) [2022-07-09 12:49:02,931][26022] Updated weights on worker 0-0, policy_version 255623 (0.00085) [2022-07-09 12:49:03,223][25689] Fps is (10 sec: 5520.2, 60 sec: 5656.3, 300 sec: 5655.6). Total num frames: 261758976. Throughput: 0: 5977.2. Samples: 261763064. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:49:03,224][25689] Avg episode reward: [(0, '-49.331')] [2022-07-09 12:49:04,746][26022] Updated weights on worker 0-0, policy_version 255633 (0.00092) [2022-07-09 12:49:06,653][26022] Updated weights on worker 0-0, policy_version 255643 (0.00058) [2022-07-09 12:49:08,279][25689] Fps is (10 sec: 5596.8, 60 sec: 5675.9, 300 sec: 5663.0). Total num frames: 261787648. Throughput: 0: 5844.2. Samples: 261794906. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:49:08,282][25689] Avg episode reward: [(0, '-49.223')] [2022-07-09 12:49:08,342][26022] Updated weights on worker 0-0, policy_version 255653 (0.00093) [2022-07-09 12:49:10,360][26022] Updated weights on worker 0-0, policy_version 255663 (0.00093) [2022-07-09 12:49:11,917][26022] Updated weights on worker 0-0, policy_version 255673 (0.00097) [2022-07-09 12:49:13,288][25689] Fps is (10 sec: 5595.6, 60 sec: 5649.2, 300 sec: 5660.2). Total num frames: 261815296. Throughput: 0: 5005.2. Samples: 261811978. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:49:13,288][25689] Avg episode reward: [(0, '-50.406')] [2022-07-09 12:49:13,805][26022] Updated weights on worker 0-0, policy_version 255683 (0.00091) [2022-07-09 12:49:15,728][26022] Updated weights on worker 0-0, policy_version 255693 (0.00088) [2022-07-09 12:49:17,329][26022] Updated weights on worker 0-0, policy_version 255703 (0.00087) [2022-07-09 12:49:18,290][25689] Fps is (10 sec: 5625.7, 60 sec: 5615.8, 300 sec: 5657.9). Total num frames: 261843968. Throughput: 0: 5865.6. Samples: 261846334. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:49:18,291][25689] Avg episode reward: [(0, '-50.449')] [2022-07-09 12:49:19,311][26022] Updated weights on worker 0-0, policy_version 255713 (0.00086) [2022-07-09 12:49:21,019][26022] Updated weights on worker 0-0, policy_version 255723 (0.00080) [2022-07-09 12:49:22,917][26022] Updated weights on worker 0-0, policy_version 255733 (0.00092) [2022-07-09 12:49:23,323][25689] Fps is (10 sec: 5714.3, 60 sec: 5648.0, 300 sec: 5659.5). Total num frames: 261872640. Throughput: 0: 5836.8. Samples: 261880386. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:49:23,323][25689] Avg episode reward: [(0, '-50.470')] [2022-07-09 12:49:24,739][26022] Updated weights on worker 0-0, policy_version 255743 (0.00081) [2022-07-09 12:49:26,576][26022] Updated weights on worker 0-0, policy_version 255753 (0.00090) [2022-07-09 12:49:28,326][26022] Updated weights on worker 0-0, policy_version 255763 (0.00117) [2022-07-09 12:49:28,436][25689] Fps is (10 sec: 5752.6, 60 sec: 5665.2, 300 sec: 5657.6). Total num frames: 261902336. Throughput: 0: 5086.0. Samples: 261897428. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:49:28,437][25689] Avg episode reward: [(0, '-50.644')] [2022-07-09 12:49:30,323][26022] Updated weights on worker 0-0, policy_version 255773 (0.00088) [2022-07-09 12:49:31,873][26022] Updated weights on worker 0-0, policy_version 255783 (0.00089) [2022-07-09 12:49:33,464][25689] Fps is (10 sec: 5654.5, 60 sec: 5648.5, 300 sec: 5654.8). Total num frames: 261929984. Throughput: 0: 5920.2. Samples: 261931426. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:49:33,465][25689] Avg episode reward: [(0, '-50.288')] [2022-07-09 12:49:33,806][26022] Updated weights on worker 0-0, policy_version 255793 (0.00090) [2022-07-09 12:49:35,687][26022] Updated weights on worker 0-0, policy_version 255803 (0.00095) [2022-07-09 12:49:37,431][26022] Updated weights on worker 0-0, policy_version 255813 (0.00094) [2022-07-09 12:49:38,488][25689] Fps is (10 sec: 5603.1, 60 sec: 5648.1, 300 sec: 5654.3). Total num frames: 261958656. Throughput: 0: 5893.1. Samples: 261965362. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:49:38,489][25689] Avg episode reward: [(0, '-50.121')] [2022-07-09 12:49:39,188][26022] Updated weights on worker 0-0, policy_version 255823 (0.00100) [2022-07-09 12:49:41,070][26022] Updated weights on worker 0-0, policy_version 255833 (0.00092) [2022-07-09 12:49:42,713][26022] Updated weights on worker 0-0, policy_version 255843 (0.00092) [2022-07-09 12:49:43,513][25689] Fps is (10 sec: 5604.7, 60 sec: 5631.0, 300 sec: 5649.4). Total num frames: 261986304. Throughput: 0: 5050.4. Samples: 261982354. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:49:43,513][25689] Avg episode reward: [(0, '-49.042')] [2022-07-09 12:49:44,674][26022] Updated weights on worker 0-0, policy_version 255853 (0.00086) [2022-07-09 12:49:46,297][26022] Updated weights on worker 0-0, policy_version 255863 (0.00086) [2022-07-09 12:49:48,166][26022] Updated weights on worker 0-0, policy_version 255873 (0.00101) [2022-07-09 12:49:48,630][25689] Fps is (10 sec: 5755.3, 60 sec: 5675.0, 300 sec: 5658.5). Total num frames: 262017024. Throughput: 0: 5908.3. Samples: 262016736. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:49:48,630][25689] Avg episode reward: [(0, '-49.023')] [2022-07-09 12:49:49,912][26022] Updated weights on worker 0-0, policy_version 255883 (0.00092) [2022-07-09 12:49:51,745][26022] Updated weights on worker 0-0, policy_version 255893 (0.00079) [2022-07-09 12:49:53,649][25689] Fps is (10 sec: 5556.2, 60 sec: 5607.6, 300 sec: 5645.2). Total num frames: 262042624. Throughput: 0: 5920.3. Samples: 262050928. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:49:53,650][25689] Avg episode reward: [(0, '-49.111')] [2022-07-09 12:49:53,890][26022] Updated weights on worker 0-0, policy_version 255903 (0.00614) [2022-07-09 12:49:55,380][26022] Updated weights on worker 0-0, policy_version 255913 (0.00090) [2022-07-09 12:49:57,327][26022] Updated weights on worker 0-0, policy_version 255923 (0.00091) [2022-07-09 12:49:58,711][25689] Fps is (10 sec: 5688.3, 60 sec: 5670.6, 300 sec: 5658.1). Total num frames: 262074368. Throughput: 0: 5069.6. Samples: 262067882. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:49:58,711][25689] Avg episode reward: [(0, '-49.367')] [2022-07-09 12:49:58,855][26022] Updated weights on worker 0-0, policy_version 255933 (0.00098) [2022-07-09 12:50:00,990][26022] Updated weights on worker 0-0, policy_version 255943 (0.00078) [2022-07-09 12:50:03,084][26022] Updated weights on worker 0-0, policy_version 255953 (0.00093) [2022-07-09 12:50:03,728][25689] Fps is (10 sec: 5588.2, 60 sec: 5618.8, 300 sec: 5652.4). Total num frames: 262098944. Throughput: 0: 5886.7. Samples: 262101352. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:50:03,728][25689] Avg episode reward: [(0, '-49.911')] [2022-07-09 12:50:04,794][26022] Updated weights on worker 0-0, policy_version 255963 (0.00086) [2022-07-09 12:50:06,696][26022] Updated weights on worker 0-0, policy_version 255973 (0.00082) [2022-07-09 12:50:08,432][26022] Updated weights on worker 0-0, policy_version 255983 (0.00094) [2022-07-09 12:50:08,782][25689] Fps is (10 sec: 5286.9, 60 sec: 5619.0, 300 sec: 5649.3). Total num frames: 262127616. Throughput: 0: 5812.0. Samples: 262133864. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:50:08,783][25689] Avg episode reward: [(0, '-49.872')] [2022-07-09 12:50:10,219][26022] Updated weights on worker 0-0, policy_version 255993 (0.00097) [2022-07-09 12:50:11,985][26022] Updated weights on worker 0-0, policy_version 256003 (0.00089) [2022-07-09 12:50:13,792][25689] Fps is (10 sec: 5799.7, 60 sec: 5652.8, 300 sec: 5654.3). Total num frames: 262157312. Throughput: 0: 4956.2. Samples: 262150758. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:50:13,792][25689] Avg episode reward: [(0, '-49.503')] [2022-07-09 12:50:13,797][26022] Updated weights on worker 0-0, policy_version 256013 (0.00081) [2022-07-09 12:50:15,709][26022] Updated weights on worker 0-0, policy_version 256023 (0.00086) [2022-07-09 12:50:17,295][26022] Updated weights on worker 0-0, policy_version 256033 (0.00084) [2022-07-09 12:50:18,799][25689] Fps is (10 sec: 5725.0, 60 sec: 5635.4, 300 sec: 5651.1). Total num frames: 262184960. Throughput: 0: 5845.9. Samples: 262185314. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:50:18,800][25689] Avg episode reward: [(0, '-49.414')] [2022-07-09 12:50:19,223][26022] Updated weights on worker 0-0, policy_version 256043 (0.00102) [2022-07-09 12:50:20,922][26022] Updated weights on worker 0-0, policy_version 256053 (0.00085) [2022-07-09 12:50:22,794][26022] Updated weights on worker 0-0, policy_version 256063 (0.00094) [2022-07-09 12:50:23,806][25689] Fps is (10 sec: 5726.1, 60 sec: 5654.7, 300 sec: 5652.6). Total num frames: 262214656. Throughput: 0: 5902.0. Samples: 262219856. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:50:23,807][25689] Avg episode reward: [(0, '-49.160')] [2022-07-09 12:50:24,464][26022] Updated weights on worker 0-0, policy_version 256073 (0.00077) [2022-07-09 12:50:26,447][26022] Updated weights on worker 0-0, policy_version 256083 (0.00090) [2022-07-09 12:50:28,025][26022] Updated weights on worker 0-0, policy_version 256093 (0.00069) [2022-07-09 12:50:28,855][25689] Fps is (10 sec: 5804.1, 60 sec: 5643.8, 300 sec: 5659.0). Total num frames: 262243328. Throughput: 0: 5130.4. Samples: 262236844. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:50:28,857][25689] Avg episode reward: [(0, '-48.075')] [2022-07-09 12:50:30,066][26022] Updated weights on worker 0-0, policy_version 256103 (0.00082) [2022-07-09 12:50:31,712][26022] Updated weights on worker 0-0, policy_version 256113 (0.00091) [2022-07-09 12:50:33,459][26022] Updated weights on worker 0-0, policy_version 256123 (0.00088) [2022-07-09 12:50:33,954][25689] Fps is (10 sec: 5651.2, 60 sec: 5654.1, 300 sec: 5654.8). Total num frames: 262272000. Throughput: 0: 5970.5. Samples: 262271134. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:50:33,954][25689] Avg episode reward: [(0, '-47.925')] [2022-07-09 12:50:35,298][26022] Updated weights on worker 0-0, policy_version 256133 (0.00087) [2022-07-09 12:50:37,056][26022] Updated weights on worker 0-0, policy_version 256143 (0.00083) [2022-07-09 12:50:38,979][25689] Fps is (10 sec: 5563.2, 60 sec: 5637.1, 300 sec: 5648.6). Total num frames: 262299648. Throughput: 0: 5960.9. Samples: 262305604. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:50:38,979][25689] Avg episode reward: [(0, '-47.550')] [2022-07-09 12:50:39,074][26022] Updated weights on worker 0-0, policy_version 256153 (0.00083) [2022-07-09 12:50:40,724][26022] Updated weights on worker 0-0, policy_version 256163 (0.00092) [2022-07-09 12:50:42,514][26022] Updated weights on worker 0-0, policy_version 256173 (0.00084) [2022-07-09 12:50:44,070][25689] Fps is (10 sec: 5668.6, 60 sec: 5664.8, 300 sec: 5651.3). Total num frames: 262329344. Throughput: 0: 5066.4. Samples: 262322518. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:50:44,070][25689] Avg episode reward: [(0, '-47.652')] [2022-07-09 12:50:44,366][26022] Updated weights on worker 0-0, policy_version 256183 (0.00614) [2022-07-09 12:50:46,070][26022] Updated weights on worker 0-0, policy_version 256193 (0.00094) [2022-07-09 12:50:47,960][26022] Updated weights on worker 0-0, policy_version 256203 (0.00087) [2022-07-09 12:50:49,129][25689] Fps is (10 sec: 5952.3, 60 sec: 5670.2, 300 sec: 5657.2). Total num frames: 262360064. Throughput: 0: 5928.2. Samples: 262357030. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:50:49,129][25689] Avg episode reward: [(0, '-48.274')] [2022-07-09 12:50:49,637][26022] Updated weights on worker 0-0, policy_version 256213 (0.00085) [2022-07-09 12:50:51,553][26022] Updated weights on worker 0-0, policy_version 256223 (0.00085) [2022-07-09 12:50:53,183][26022] Updated weights on worker 0-0, policy_version 256233 (0.00086) [2022-07-09 12:50:54,231][25689] Fps is (10 sec: 5643.3, 60 sec: 5679.3, 300 sec: 5652.1). Total num frames: 262386688. Throughput: 0: 5935.3. Samples: 262391486. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:50:54,231][25689] Avg episode reward: [(0, '-48.043')] [2022-07-09 12:50:54,987][26022] Updated weights on worker 0-0, policy_version 256243 (0.00084) [2022-07-09 12:50:56,462][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:50:56,476][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000256251_262401024.pth [2022-07-09 12:50:56,476][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000254260_260362240.pth [2022-07-09 12:50:56,718][26022] Updated weights on worker 0-0, policy_version 256253 (0.00085) [2022-07-09 12:50:58,563][26022] Updated weights on worker 0-0, policy_version 256263 (0.00098) [2022-07-09 12:50:59,264][25689] Fps is (10 sec: 5557.0, 60 sec: 5648.2, 300 sec: 5651.9). Total num frames: 262416384. Throughput: 0: 5930.0. Samples: 262425894. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:50:59,264][25689] Avg episode reward: [(0, '-48.190')] [2022-07-09 12:51:00,462][26022] Updated weights on worker 0-0, policy_version 256273 (0.00087) [2022-07-09 12:51:02,671][26022] Updated weights on worker 0-0, policy_version 256283 (0.00101) [2022-07-09 12:51:04,276][25689] Fps is (10 sec: 5708.7, 60 sec: 5699.4, 300 sec: 5659.9). Total num frames: 262444032. Throughput: 0: 5904.2. Samples: 262441820. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:51:04,276][25689] Avg episode reward: [(0, '-47.708')] [2022-07-09 12:51:04,278][26022] Updated weights on worker 0-0, policy_version 256293 (0.00084) [2022-07-09 12:51:06,191][26022] Updated weights on worker 0-0, policy_version 256303 (0.00089) [2022-07-09 12:51:07,949][26022] Updated weights on worker 0-0, policy_version 256313 (0.00083) [2022-07-09 12:51:09,409][25689] Fps is (10 sec: 5450.4, 60 sec: 5675.1, 300 sec: 5658.0). Total num frames: 262471680. Throughput: 0: 5821.7. Samples: 262475096. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:51:09,409][25689] Avg episode reward: [(0, '-47.832')] [2022-07-09 12:51:09,881][26022] Updated weights on worker 0-0, policy_version 256323 (0.00089) [2022-07-09 12:51:11,527][26022] Updated weights on worker 0-0, policy_version 256333 (0.00089) [2022-07-09 12:51:13,517][26022] Updated weights on worker 0-0, policy_version 256343 (0.00084) [2022-07-09 12:51:14,412][25689] Fps is (10 sec: 5556.4, 60 sec: 5658.8, 300 sec: 5651.3). Total num frames: 262500352. Throughput: 0: 5831.5. Samples: 262509172. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:51:14,413][25689] Avg episode reward: [(0, '-48.469')] [2022-07-09 12:51:15,129][26022] Updated weights on worker 0-0, policy_version 256353 (0.00085) [2022-07-09 12:51:17,082][26022] Updated weights on worker 0-0, policy_version 256363 (0.00367) [2022-07-09 12:51:18,800][26022] Updated weights on worker 0-0, policy_version 256373 (0.00088) [2022-07-09 12:51:19,450][25689] Fps is (10 sec: 5608.7, 60 sec: 5655.9, 300 sec: 5643.9). Total num frames: 262528000. Throughput: 0: 4978.9. Samples: 262526402. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:51:19,451][25689] Avg episode reward: [(0, '-48.038')] [2022-07-09 12:51:20,395][26022] Updated weights on worker 0-0, policy_version 256383 (0.00099) [2022-07-09 12:51:22,443][26022] Updated weights on worker 0-0, policy_version 256393 (0.00084) [2022-07-09 12:51:24,120][26022] Updated weights on worker 0-0, policy_version 256403 (0.00084) [2022-07-09 12:51:24,458][25689] Fps is (10 sec: 5708.0, 60 sec: 5655.9, 300 sec: 5651.8). Total num frames: 262557696. Throughput: 0: 5891.4. Samples: 262560722. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:51:24,459][25689] Avg episode reward: [(0, '-48.142')] [2022-07-09 12:51:26,080][26022] Updated weights on worker 0-0, policy_version 256413 (0.00086) [2022-07-09 12:51:27,673][26022] Updated weights on worker 0-0, policy_version 256423 (0.00091) [2022-07-09 12:51:29,583][25689] Fps is (10 sec: 5760.5, 60 sec: 5648.8, 300 sec: 5649.7). Total num frames: 262586368. Throughput: 0: 5928.9. Samples: 262594706. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 12:51:29,583][25689] Avg episode reward: [(0, '-47.550')] [2022-07-09 12:51:29,613][26022] Updated weights on worker 0-0, policy_version 256433 (0.00086) [2022-07-09 12:51:31,457][26022] Updated weights on worker 0-0, policy_version 256443 (0.00095) [2022-07-09 12:51:33,301][26022] Updated weights on worker 0-0, policy_version 256453 (0.00087) [2022-07-09 12:51:34,606][25689] Fps is (10 sec: 5651.0, 60 sec: 5655.8, 300 sec: 5646.1). Total num frames: 262615040. Throughput: 0: 5085.6. Samples: 262611870. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 12:51:34,606][25689] Avg episode reward: [(0, '-48.102')] [2022-07-09 12:51:34,969][26022] Updated weights on worker 0-0, policy_version 256463 (0.00093) [2022-07-09 12:51:36,717][26022] Updated weights on worker 0-0, policy_version 256473 (0.00091) [2022-07-09 12:51:38,427][26022] Updated weights on worker 0-0, policy_version 256483 (0.00088) [2022-07-09 12:51:39,619][25689] Fps is (10 sec: 5713.6, 60 sec: 5673.8, 300 sec: 5653.0). Total num frames: 262643712. Throughput: 0: 5941.0. Samples: 262646226. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 12:51:39,620][25689] Avg episode reward: [(0, '-47.042')] [2022-07-09 12:51:40,466][26022] Updated weights on worker 0-0, policy_version 256493 (0.00091) [2022-07-09 12:51:42,216][26022] Updated weights on worker 0-0, policy_version 256503 (0.00092) [2022-07-09 12:51:43,998][26022] Updated weights on worker 0-0, policy_version 256513 (0.00088) [2022-07-09 12:51:44,682][25689] Fps is (10 sec: 5691.3, 60 sec: 5659.6, 300 sec: 5653.3). Total num frames: 262672384. Throughput: 0: 5908.3. Samples: 262680208. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 12:51:44,682][25689] Avg episode reward: [(0, '-46.958')] [2022-07-09 12:51:45,846][26022] Updated weights on worker 0-0, policy_version 256523 (0.00098) [2022-07-09 12:51:47,390][26022] Updated weights on worker 0-0, policy_version 256533 (0.00098) [2022-07-09 12:51:49,561][26022] Updated weights on worker 0-0, policy_version 256543 (0.00082) [2022-07-09 12:51:49,815][25689] Fps is (10 sec: 5624.4, 60 sec: 5618.9, 300 sec: 5647.7). Total num frames: 262701056. Throughput: 0: 5075.6. Samples: 262697396. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 12:51:49,815][25689] Avg episode reward: [(0, '-47.098')] [2022-07-09 12:51:51,019][26022] Updated weights on worker 0-0, policy_version 256553 (0.00092) [2022-07-09 12:51:52,928][26022] Updated weights on worker 0-0, policy_version 256563 (0.00082) [2022-07-09 12:51:54,780][26022] Updated weights on worker 0-0, policy_version 256573 (0.00087) [2022-07-09 12:51:54,910][25689] Fps is (10 sec: 5706.2, 60 sec: 5670.2, 300 sec: 5654.1). Total num frames: 262730752. Throughput: 0: 5904.3. Samples: 262731754. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 12:51:54,911][25689] Avg episode reward: [(0, '-47.418')] [2022-07-09 12:51:56,490][26022] Updated weights on worker 0-0, policy_version 256583 (0.00094) [2022-07-09 12:51:58,429][26022] Updated weights on worker 0-0, policy_version 256593 (0.00092) [2022-07-09 12:52:00,003][25689] Fps is (10 sec: 5829.3, 60 sec: 5664.6, 300 sec: 5657.3). Total num frames: 262760448. Throughput: 0: 5888.8. Samples: 262766264. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 12:52:00,004][25689] Avg episode reward: [(0, '-47.559')] [2022-07-09 12:52:00,127][26022] Updated weights on worker 0-0, policy_version 256603 (0.00084) [2022-07-09 12:52:02,323][26022] Updated weights on worker 0-0, policy_version 256613 (0.00080) [2022-07-09 12:52:04,111][26022] Updated weights on worker 0-0, policy_version 256623 (0.00081) [2022-07-09 12:52:05,036][25689] Fps is (10 sec: 5562.1, 60 sec: 5645.8, 300 sec: 5662.7). Total num frames: 262787072. Throughput: 0: 4965.1. Samples: 262781270. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 12:52:05,036][25689] Avg episode reward: [(0, '-47.817')] [2022-07-09 12:52:05,925][26022] Updated weights on worker 0-0, policy_version 256633 (0.00096) [2022-07-09 12:52:07,713][26022] Updated weights on worker 0-0, policy_version 256643 (0.00087) [2022-07-09 12:52:09,564][26022] Updated weights on worker 0-0, policy_version 256653 (0.00100) [2022-07-09 12:52:10,162][25689] Fps is (10 sec: 5543.8, 60 sec: 5680.1, 300 sec: 5660.5). Total num frames: 262816768. Throughput: 0: 5796.0. Samples: 262815330. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 12:52:10,163][25689] Avg episode reward: [(0, '-47.944')] [2022-07-09 12:52:11,359][26022] Updated weights on worker 0-0, policy_version 256663 (0.00078) [2022-07-09 12:52:13,081][26022] Updated weights on worker 0-0, policy_version 256673 (0.00087) [2022-07-09 12:52:14,757][26022] Updated weights on worker 0-0, policy_version 256683 (0.00090) [2022-07-09 12:52:15,248][25689] Fps is (10 sec: 5615.2, 60 sec: 5655.6, 300 sec: 5655.5). Total num frames: 262844416. Throughput: 0: 5776.0. Samples: 262849226. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 12:52:15,249][25689] Avg episode reward: [(0, '-47.982')] [2022-07-09 12:52:16,624][26022] Updated weights on worker 0-0, policy_version 256693 (0.00086) [2022-07-09 12:52:18,553][26022] Updated weights on worker 0-0, policy_version 256703 (0.00085) [2022-07-09 12:52:20,307][25689] Fps is (10 sec: 5551.7, 60 sec: 5670.5, 300 sec: 5658.5). Total num frames: 262873088. Throughput: 0: 4927.0. Samples: 262866300. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 12:52:20,307][25689] Avg episode reward: [(0, '-48.458')] [2022-07-09 12:52:20,413][26022] Updated weights on worker 0-0, policy_version 256713 (0.00083) [2022-07-09 12:52:22,107][26022] Updated weights on worker 0-0, policy_version 256723 (0.00087) [2022-07-09 12:52:23,987][26022] Updated weights on worker 0-0, policy_version 256733 (0.00088) [2022-07-09 12:52:25,346][25689] Fps is (10 sec: 5780.2, 60 sec: 5667.6, 300 sec: 5655.9). Total num frames: 262902784. Throughput: 0: 5869.5. Samples: 262900480. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 12:52:25,347][25689] Avg episode reward: [(0, '-48.905')] [2022-07-09 12:52:25,755][26022] Updated weights on worker 0-0, policy_version 256743 (0.00097) [2022-07-09 12:52:27,568][26022] Updated weights on worker 0-0, policy_version 256753 (0.00091) [2022-07-09 12:52:29,453][26022] Updated weights on worker 0-0, policy_version 256763 (0.00085) [2022-07-09 12:52:30,451][25689] Fps is (10 sec: 5754.1, 60 sec: 5669.5, 300 sec: 5661.1). Total num frames: 262931456. Throughput: 0: 5861.1. Samples: 262934242. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 12:52:30,451][25689] Avg episode reward: [(0, '-49.461')] [2022-07-09 12:52:31,364][26022] Updated weights on worker 0-0, policy_version 256773 (0.00086) [2022-07-09 12:52:33,110][26022] Updated weights on worker 0-0, policy_version 256783 (0.00094) [2022-07-09 12:52:34,983][26022] Updated weights on worker 0-0, policy_version 256793 (0.00084) [2022-07-09 12:52:35,469][25689] Fps is (10 sec: 5361.1, 60 sec: 5619.4, 300 sec: 5648.1). Total num frames: 262957056. Throughput: 0: 5867.8. Samples: 262967880. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 12:52:35,470][25689] Avg episode reward: [(0, '-50.425')] [2022-07-09 12:52:36,647][26022] Updated weights on worker 0-0, policy_version 256803 (0.00085) [2022-07-09 12:52:38,589][26022] Updated weights on worker 0-0, policy_version 256813 (0.00083) [2022-07-09 12:52:40,240][26022] Updated weights on worker 0-0, policy_version 256823 (0.00089) [2022-07-09 12:52:40,484][25689] Fps is (10 sec: 5613.2, 60 sec: 5652.9, 300 sec: 5648.9). Total num frames: 262987776. Throughput: 0: 5873.1. Samples: 262984802. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 12:52:40,485][25689] Avg episode reward: [(0, '-50.534')] [2022-07-09 12:52:42,039][26022] Updated weights on worker 0-0, policy_version 256833 (0.00087) [2022-07-09 12:52:43,925][26022] Updated weights on worker 0-0, policy_version 256843 (0.00089) [2022-07-09 12:52:45,491][25689] Fps is (10 sec: 5926.2, 60 sec: 5658.0, 300 sec: 5653.5). Total num frames: 263016448. Throughput: 0: 5885.0. Samples: 263019034. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 12:52:45,492][25689] Avg episode reward: [(0, '-50.841')] [2022-07-09 12:52:45,575][26022] Updated weights on worker 0-0, policy_version 256853 (0.00095) [2022-07-09 12:52:47,749][26022] Updated weights on worker 0-0, policy_version 256863 (0.00084) [2022-07-09 12:52:49,142][26022] Updated weights on worker 0-0, policy_version 256873 (0.00092) [2022-07-09 12:52:50,557][25689] Fps is (10 sec: 5591.3, 60 sec: 5647.5, 300 sec: 5643.8). Total num frames: 263044096. Throughput: 0: 5916.0. Samples: 263053190. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 12:52:50,557][25689] Avg episode reward: [(0, '-50.291')] [2022-07-09 12:52:51,144][26022] Updated weights on worker 0-0, policy_version 256883 (0.00091) [2022-07-09 12:52:52,934][26022] Updated weights on worker 0-0, policy_version 256893 (0.01290) [2022-07-09 12:52:54,489][26022] Updated weights on worker 0-0, policy_version 256903 (0.00086) [2022-07-09 12:52:55,561][25689] Fps is (10 sec: 5593.0, 60 sec: 5639.1, 300 sec: 5647.8). Total num frames: 263072768. Throughput: 0: 5108.7. Samples: 263070522. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 12:52:55,561][25689] Avg episode reward: [(0, '-49.724')] [2022-07-09 12:52:56,679][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:52:56,694][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000256913_263078912.pth [2022-07-09 12:52:56,695][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000254923_261041152.pth [2022-07-09 12:52:56,699][26022] Updated weights on worker 0-0, policy_version 256913 (0.00091) [2022-07-09 12:52:58,314][26022] Updated weights on worker 0-0, policy_version 256923 (0.00092) [2022-07-09 12:53:00,065][26022] Updated weights on worker 0-0, policy_version 256933 (0.00093) [2022-07-09 12:53:00,569][25689] Fps is (10 sec: 5727.4, 60 sec: 5630.1, 300 sec: 5656.2). Total num frames: 263101440. Throughput: 0: 5954.5. Samples: 263104398. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 12:53:00,571][25689] Avg episode reward: [(0, '-49.277')] [2022-07-09 12:53:02,280][26022] Updated weights on worker 0-0, policy_version 256943 (0.00090) [2022-07-09 12:53:04,164][26022] Updated weights on worker 0-0, policy_version 256953 (0.00096) [2022-07-09 12:53:05,576][25689] Fps is (10 sec: 5521.3, 60 sec: 5632.5, 300 sec: 5654.2). Total num frames: 263128064. Throughput: 0: 5843.1. Samples: 263136392. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 12:53:05,577][25689] Avg episode reward: [(0, '-49.182')] [2022-07-09 12:53:06,014][26022] Updated weights on worker 0-0, policy_version 256963 (0.00088) [2022-07-09 12:53:07,900][26022] Updated weights on worker 0-0, policy_version 256973 (0.00083) [2022-07-09 12:53:09,496][26022] Updated weights on worker 0-0, policy_version 256983 (0.00085) [2022-07-09 12:53:10,639][25689] Fps is (10 sec: 5389.7, 60 sec: 5604.6, 300 sec: 5647.8). Total num frames: 263155712. Throughput: 0: 4993.4. Samples: 263153466. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 12:53:10,639][25689] Avg episode reward: [(0, '-47.688')] [2022-07-09 12:53:11,434][26022] Updated weights on worker 0-0, policy_version 256993 (0.00085) [2022-07-09 12:53:13,245][26022] Updated weights on worker 0-0, policy_version 257003 (0.00088) [2022-07-09 12:53:15,145][26022] Updated weights on worker 0-0, policy_version 257013 (0.00113) [2022-07-09 12:53:15,645][25689] Fps is (10 sec: 5694.9, 60 sec: 5645.8, 300 sec: 5644.4). Total num frames: 263185408. Throughput: 0: 5823.3. Samples: 263187480. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 12:53:15,646][25689] Avg episode reward: [(0, '-48.047')] [2022-07-09 12:53:16,848][26022] Updated weights on worker 0-0, policy_version 257023 (0.00085) [2022-07-09 12:53:18,508][26022] Updated weights on worker 0-0, policy_version 257033 (0.00084) [2022-07-09 12:53:20,295][26022] Updated weights on worker 0-0, policy_version 257043 (0.00099) [2022-07-09 12:53:20,654][25689] Fps is (10 sec: 5827.8, 60 sec: 5650.5, 300 sec: 5651.3). Total num frames: 263214080. Throughput: 0: 5839.4. Samples: 263221682. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 12:53:20,655][25689] Avg episode reward: [(0, '-48.437')] [2022-07-09 12:53:22,131][26022] Updated weights on worker 0-0, policy_version 257053 (0.00092) [2022-07-09 12:53:23,911][26022] Updated weights on worker 0-0, policy_version 257063 (0.00091) [2022-07-09 12:53:25,665][25689] Fps is (10 sec: 5620.9, 60 sec: 5619.2, 300 sec: 5649.9). Total num frames: 263241728. Throughput: 0: 5097.3. Samples: 263238792. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 12:53:25,666][25689] Avg episode reward: [(0, '-48.221')] [2022-07-09 12:53:25,782][26022] Updated weights on worker 0-0, policy_version 257073 (0.00081) [2022-07-09 12:53:27,653][26022] Updated weights on worker 0-0, policy_version 257083 (0.00087) [2022-07-09 12:53:29,398][26022] Updated weights on worker 0-0, policy_version 257093 (0.00086) [2022-07-09 12:53:30,756][25689] Fps is (10 sec: 5575.4, 60 sec: 5620.5, 300 sec: 5648.7). Total num frames: 263270400. Throughput: 0: 5927.4. Samples: 263272708. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 12:53:30,756][25689] Avg episode reward: [(0, '-47.614')] [2022-07-09 12:53:31,282][26022] Updated weights on worker 0-0, policy_version 257103 (0.00082) [2022-07-09 12:53:32,933][26022] Updated weights on worker 0-0, policy_version 257113 (0.00092) [2022-07-09 12:53:34,709][26022] Updated weights on worker 0-0, policy_version 257123 (0.00086) [2022-07-09 12:53:35,835][25689] Fps is (10 sec: 5638.7, 60 sec: 5665.7, 300 sec: 5647.6). Total num frames: 263299072. Throughput: 0: 5921.5. Samples: 263307032. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 12:53:35,835][25689] Avg episode reward: [(0, '-48.640')] [2022-07-09 12:53:36,753][26022] Updated weights on worker 0-0, policy_version 257133 (0.00097) [2022-07-09 12:53:38,398][26022] Updated weights on worker 0-0, policy_version 257143 (0.00095) [2022-07-09 12:53:40,356][26022] Updated weights on worker 0-0, policy_version 257153 (0.00092) [2022-07-09 12:53:40,839][25689] Fps is (10 sec: 5687.1, 60 sec: 5632.8, 300 sec: 5648.0). Total num frames: 263327744. Throughput: 0: 5073.2. Samples: 263324084. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 12:53:40,839][25689] Avg episode reward: [(0, '-48.433')] [2022-07-09 12:53:41,982][26022] Updated weights on worker 0-0, policy_version 257163 (0.00084) [2022-07-09 12:53:43,904][26022] Updated weights on worker 0-0, policy_version 257173 (0.00363) [2022-07-09 12:53:45,520][26022] Updated weights on worker 0-0, policy_version 257183 (0.00085) [2022-07-09 12:53:45,841][25689] Fps is (10 sec: 5833.4, 60 sec: 5650.3, 300 sec: 5655.7). Total num frames: 263357440. Throughput: 0: 5920.4. Samples: 263358240. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 12:53:45,841][25689] Avg episode reward: [(0, '-48.436')] [2022-07-09 12:53:47,500][26022] Updated weights on worker 0-0, policy_version 257193 (0.00056) [2022-07-09 12:53:49,475][26022] Updated weights on worker 0-0, policy_version 257203 (0.00091) [2022-07-09 12:53:50,957][25689] Fps is (10 sec: 5667.4, 60 sec: 5645.6, 300 sec: 5647.0). Total num frames: 263385088. Throughput: 0: 5911.4. Samples: 263392126. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 12:53:50,957][25689] Avg episode reward: [(0, '-48.032')] [2022-07-09 12:53:51,009][26022] Updated weights on worker 0-0, policy_version 257213 (0.00094) [2022-07-09 12:53:53,214][26022] Updated weights on worker 0-0, policy_version 257223 (0.00092) [2022-07-09 12:53:54,752][26022] Updated weights on worker 0-0, policy_version 257233 (0.00086) [2022-07-09 12:53:55,968][25689] Fps is (10 sec: 5460.3, 60 sec: 5628.0, 300 sec: 5647.0). Total num frames: 263412736. Throughput: 0: 5043.1. Samples: 263408564. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 12:53:55,969][25689] Avg episode reward: [(0, '-48.844')] [2022-07-09 12:53:56,723][26022] Updated weights on worker 0-0, policy_version 257243 (0.00085) [2022-07-09 12:53:58,576][26022] Updated weights on worker 0-0, policy_version 257253 (0.00090) [2022-07-09 12:54:00,054][26022] Updated weights on worker 0-0, policy_version 257263 (0.00085) [2022-07-09 12:54:00,983][25689] Fps is (10 sec: 5617.2, 60 sec: 5627.3, 300 sec: 5650.3). Total num frames: 263441408. Throughput: 0: 5893.1. Samples: 263442798. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 12:54:00,984][25689] Avg episode reward: [(0, '-48.198')] [2022-07-09 12:54:02,496][26022] Updated weights on worker 0-0, policy_version 257273 (0.00088) [2022-07-09 12:54:04,163][26022] Updated weights on worker 0-0, policy_version 257283 (0.00084) [2022-07-09 12:54:05,961][26022] Updated weights on worker 0-0, policy_version 257293 (0.00086) [2022-07-09 12:54:06,028][25689] Fps is (10 sec: 5598.1, 60 sec: 5640.7, 300 sec: 5647.1). Total num frames: 263469056. Throughput: 0: 5781.8. Samples: 263474960. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 12:54:06,029][25689] Avg episode reward: [(0, '-49.683')] [2022-07-09 12:54:07,876][26022] Updated weights on worker 0-0, policy_version 257303 (0.00091) [2022-07-09 12:54:09,519][26022] Updated weights on worker 0-0, policy_version 257313 (0.00508) [2022-07-09 12:54:11,147][25689] Fps is (10 sec: 5339.8, 60 sec: 5618.5, 300 sec: 5641.5). Total num frames: 263495680. Throughput: 0: 4938.8. Samples: 263491840. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 12:54:11,156][25689] Avg episode reward: [(0, '-49.564')] [2022-07-09 12:54:11,410][26022] Updated weights on worker 0-0, policy_version 257323 (0.00103) [2022-07-09 12:54:13,125][26022] Updated weights on worker 0-0, policy_version 257333 (0.00085) [2022-07-09 12:54:14,855][26022] Updated weights on worker 0-0, policy_version 257343 (0.00086) [2022-07-09 12:54:16,169][25689] Fps is (10 sec: 5553.6, 60 sec: 5617.1, 300 sec: 5644.6). Total num frames: 263525376. Throughput: 0: 5828.3. Samples: 263526306. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 12:54:16,170][25689] Avg episode reward: [(0, '-48.815')] [2022-07-09 12:54:16,794][26022] Updated weights on worker 0-0, policy_version 257353 (0.00089) [2022-07-09 12:54:18,333][26022] Updated weights on worker 0-0, policy_version 257363 (0.00085) [2022-07-09 12:54:20,402][26022] Updated weights on worker 0-0, policy_version 257373 (0.00093) [2022-07-09 12:54:21,183][25689] Fps is (10 sec: 5815.6, 60 sec: 5616.6, 300 sec: 5644.9). Total num frames: 263554048. Throughput: 0: 5839.4. Samples: 263560754. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 12:54:21,184][25689] Avg episode reward: [(0, '-49.048')] [2022-07-09 12:54:22,081][26022] Updated weights on worker 0-0, policy_version 257383 (0.00084) [2022-07-09 12:54:23,927][26022] Updated weights on worker 0-0, policy_version 257393 (0.00088) [2022-07-09 12:54:25,676][26022] Updated weights on worker 0-0, policy_version 257403 (0.00093) [2022-07-09 12:54:26,204][25689] Fps is (10 sec: 5816.5, 60 sec: 5649.5, 300 sec: 5646.7). Total num frames: 263583744. Throughput: 0: 5108.4. Samples: 263578028. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 12:54:26,206][25689] Avg episode reward: [(0, '-50.202')] [2022-07-09 12:54:27,524][26022] Updated weights on worker 0-0, policy_version 257413 (0.00087) [2022-07-09 12:54:29,142][26022] Updated weights on worker 0-0, policy_version 257423 (0.00091) [2022-07-09 12:54:31,012][26022] Updated weights on worker 0-0, policy_version 257433 (0.00092) [2022-07-09 12:54:31,310][25689] Fps is (10 sec: 5763.7, 60 sec: 5648.1, 300 sec: 5648.7). Total num frames: 263612416. Throughput: 0: 5981.9. Samples: 263612454. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 12:54:31,311][25689] Avg episode reward: [(0, '-49.352')] [2022-07-09 12:54:32,787][26022] Updated weights on worker 0-0, policy_version 257443 (0.00093) [2022-07-09 12:54:34,653][26022] Updated weights on worker 0-0, policy_version 257453 (0.00091) [2022-07-09 12:54:36,336][25689] Fps is (10 sec: 5760.8, 60 sec: 5670.0, 300 sec: 5652.1). Total num frames: 263642112. Throughput: 0: 5972.0. Samples: 263646742. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 12:54:36,337][25689] Avg episode reward: [(0, '-48.627')] [2022-07-09 12:54:36,341][26022] Updated weights on worker 0-0, policy_version 257463 (0.00095) [2022-07-09 12:54:38,234][26022] Updated weights on worker 0-0, policy_version 257473 (0.00083) [2022-07-09 12:54:39,713][26022] Updated weights on worker 0-0, policy_version 257483 (0.00086) [2022-07-09 12:54:41,367][25689] Fps is (10 sec: 5702.3, 60 sec: 5650.6, 300 sec: 5652.0). Total num frames: 263669760. Throughput: 0: 5113.3. Samples: 263663956. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 12:54:41,367][25689] Avg episode reward: [(0, '-48.045')] [2022-07-09 12:54:41,961][26022] Updated weights on worker 0-0, policy_version 257493 (0.00091) [2022-07-09 12:54:43,399][26022] Updated weights on worker 0-0, policy_version 257503 (0.00097) [2022-07-09 12:54:45,394][26022] Updated weights on worker 0-0, policy_version 257513 (0.00086) [2022-07-09 12:54:46,399][25689] Fps is (10 sec: 5597.1, 60 sec: 5630.9, 300 sec: 5646.7). Total num frames: 263698432. Throughput: 0: 5969.2. Samples: 263698572. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 12:54:46,399][25689] Avg episode reward: [(0, '-47.533')] [2022-07-09 12:54:47,004][26022] Updated weights on worker 0-0, policy_version 257523 (0.00091) [2022-07-09 12:54:49,111][26022] Updated weights on worker 0-0, policy_version 257533 (0.00085) [2022-07-09 12:54:50,809][26022] Updated weights on worker 0-0, policy_version 257543 (0.00086) [2022-07-09 12:54:51,455][25689] Fps is (10 sec: 5684.0, 60 sec: 5653.3, 300 sec: 5656.3). Total num frames: 263727104. Throughput: 0: 5957.8. Samples: 263732474. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 12:54:51,456][25689] Avg episode reward: [(0, '-47.045')] [2022-07-09 12:54:52,483][26022] Updated weights on worker 0-0, policy_version 257553 (0.00090) [2022-07-09 12:54:54,211][26022] Updated weights on worker 0-0, policy_version 257563 (0.00089) [2022-07-09 12:54:56,192][26022] Updated weights on worker 0-0, policy_version 257573 (0.00087) [2022-07-09 12:54:56,469][25689] Fps is (10 sec: 5694.3, 60 sec: 5670.0, 300 sec: 5646.9). Total num frames: 263755776. Throughput: 0: 5125.0. Samples: 263749922. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 12:54:56,469][25689] Avg episode reward: [(0, '-46.728')] [2022-07-09 12:54:56,829][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:54:56,840][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000257577_263758848.pth [2022-07-09 12:54:56,841][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000255587_261721088.pth [2022-07-09 12:54:58,008][26022] Updated weights on worker 0-0, policy_version 257583 (0.00089) [2022-07-09 12:54:59,519][26022] Updated weights on worker 0-0, policy_version 257593 (0.00086) [2022-07-09 12:55:01,473][25689] Fps is (10 sec: 5724.0, 60 sec: 5671.0, 300 sec: 5660.9). Total num frames: 263784448. Throughput: 0: 5978.2. Samples: 263784158. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 12:55:01,474][25689] Avg episode reward: [(0, '-46.803')] [2022-07-09 12:55:01,589][26022] Updated weights on worker 0-0, policy_version 257603 (0.00086) [2022-07-09 12:55:03,582][26022] Updated weights on worker 0-0, policy_version 257613 (0.00084) [2022-07-09 12:55:05,538][26022] Updated weights on worker 0-0, policy_version 257623 (0.00067) [2022-07-09 12:55:06,481][25689] Fps is (10 sec: 5522.7, 60 sec: 5657.5, 300 sec: 5654.9). Total num frames: 263811072. Throughput: 0: 5855.0. Samples: 263816158. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 12:55:06,482][25689] Avg episode reward: [(0, '-48.234')] [2022-07-09 12:55:07,293][26022] Updated weights on worker 0-0, policy_version 257633 (0.00098) [2022-07-09 12:55:08,995][26022] Updated weights on worker 0-0, policy_version 257643 (0.00095) [2022-07-09 12:55:10,981][26022] Updated weights on worker 0-0, policy_version 257653 (0.00085) [2022-07-09 12:55:11,557][25689] Fps is (10 sec: 5483.8, 60 sec: 5695.5, 300 sec: 5650.2). Total num frames: 263839744. Throughput: 0: 5009.2. Samples: 263833168. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 12:55:11,557][25689] Avg episode reward: [(0, '-49.217')] [2022-07-09 12:55:12,686][26022] Updated weights on worker 0-0, policy_version 257663 (0.00101) [2022-07-09 12:55:14,479][26022] Updated weights on worker 0-0, policy_version 257673 (0.00084) [2022-07-09 12:55:16,380][26022] Updated weights on worker 0-0, policy_version 257683 (0.00085) [2022-07-09 12:55:16,631][25689] Fps is (10 sec: 5650.0, 60 sec: 5673.7, 300 sec: 5652.4). Total num frames: 263868416. Throughput: 0: 5834.2. Samples: 263867550. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 12:55:16,631][25689] Avg episode reward: [(0, '-50.157')] [2022-07-09 12:55:17,976][26022] Updated weights on worker 0-0, policy_version 257693 (0.00090) [2022-07-09 12:55:19,978][26022] Updated weights on worker 0-0, policy_version 257703 (0.00085) [2022-07-09 12:55:21,583][26022] Updated weights on worker 0-0, policy_version 257713 (0.00086) [2022-07-09 12:55:21,678][25689] Fps is (10 sec: 5767.1, 60 sec: 5687.5, 300 sec: 5651.6). Total num frames: 263898112. Throughput: 0: 5821.6. Samples: 263901780. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 12:55:21,678][25689] Avg episode reward: [(0, '-50.237')] [2022-07-09 12:55:23,359][26022] Updated weights on worker 0-0, policy_version 257723 (0.00090) [2022-07-09 12:55:25,373][26022] Updated weights on worker 0-0, policy_version 257733 (0.00088) [2022-07-09 12:55:26,694][25689] Fps is (10 sec: 5800.0, 60 sec: 5671.0, 300 sec: 5652.2). Total num frames: 263926784. Throughput: 0: 5936.3. Samples: 263936148. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 12:55:26,695][25689] Avg episode reward: [(0, '-49.368')] [2022-07-09 12:55:26,811][26022] Updated weights on worker 0-0, policy_version 257743 (0.00389) [2022-07-09 12:55:28,919][26022] Updated weights on worker 0-0, policy_version 257753 (0.00087) [2022-07-09 12:55:30,440][26022] Updated weights on worker 0-0, policy_version 257763 (0.00092) [2022-07-09 12:55:31,776][25689] Fps is (10 sec: 5577.3, 60 sec: 5656.4, 300 sec: 5649.1). Total num frames: 263954432. Throughput: 0: 5933.0. Samples: 263953128. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 12:55:31,776][25689] Avg episode reward: [(0, '-50.023')] [2022-07-09 12:55:32,564][26022] Updated weights on worker 0-0, policy_version 257773 (0.00090) [2022-07-09 12:55:34,299][26022] Updated weights on worker 0-0, policy_version 257783 (0.00093) [2022-07-09 12:55:35,922][26022] Updated weights on worker 0-0, policy_version 257793 (0.00092) [2022-07-09 12:55:36,782][25689] Fps is (10 sec: 5582.9, 60 sec: 5641.3, 300 sec: 5652.9). Total num frames: 263983104. Throughput: 0: 5953.3. Samples: 263987518. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 12:55:36,783][25689] Avg episode reward: [(0, '-49.389')] [2022-07-09 12:55:37,680][26022] Updated weights on worker 0-0, policy_version 257803 (0.00090) [2022-07-09 12:55:39,660][26022] Updated weights on worker 0-0, policy_version 257813 (0.00089) [2022-07-09 12:55:41,239][26022] Updated weights on worker 0-0, policy_version 257823 (0.00086) [2022-07-09 12:55:41,794][25689] Fps is (10 sec: 5826.1, 60 sec: 5676.9, 300 sec: 5654.4). Total num frames: 264012800. Throughput: 0: 5967.8. Samples: 264021832. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 12:55:41,795][25689] Avg episode reward: [(0, '-48.108')] [2022-07-09 12:55:43,282][26022] Updated weights on worker 0-0, policy_version 257833 (0.00084) [2022-07-09 12:55:44,840][26022] Updated weights on worker 0-0, policy_version 257843 (0.00095) [2022-07-09 12:55:46,796][25689] Fps is (10 sec: 5726.6, 60 sec: 5662.8, 300 sec: 5645.2). Total num frames: 264040448. Throughput: 0: 5126.0. Samples: 264039188. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 12:55:46,796][25689] Avg episode reward: [(0, '-49.666')] [2022-07-09 12:55:46,966][26022] Updated weights on worker 0-0, policy_version 257853 (0.00050) [2022-07-09 12:55:48,615][26022] Updated weights on worker 0-0, policy_version 257863 (0.00082) [2022-07-09 12:55:50,522][26022] Updated weights on worker 0-0, policy_version 257873 (0.00096) [2022-07-09 12:55:51,883][25689] Fps is (10 sec: 5785.4, 60 sec: 5693.8, 300 sec: 5659.2). Total num frames: 264071168. Throughput: 0: 5966.5. Samples: 264073098. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 12:55:51,884][25689] Avg episode reward: [(0, '-49.165')] [2022-07-09 12:55:52,090][26022] Updated weights on worker 0-0, policy_version 257883 (0.00096) [2022-07-09 12:55:54,112][26022] Updated weights on worker 0-0, policy_version 257893 (0.00091) [2022-07-09 12:55:55,854][26022] Updated weights on worker 0-0, policy_version 257903 (0.00081) [2022-07-09 12:55:56,946][25689] Fps is (10 sec: 5750.6, 60 sec: 5672.2, 300 sec: 5651.8). Total num frames: 264098816. Throughput: 0: 5939.7. Samples: 264107284. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 12:55:56,946][25689] Avg episode reward: [(0, '-49.990')] [2022-07-09 12:55:57,545][26022] Updated weights on worker 0-0, policy_version 257913 (0.00084) [2022-07-09 12:55:59,459][26022] Updated weights on worker 0-0, policy_version 257923 (0.00089) [2022-07-09 12:56:01,286][26022] Updated weights on worker 0-0, policy_version 257933 (0.00087) [2022-07-09 12:56:01,983][25689] Fps is (10 sec: 5373.7, 60 sec: 5635.4, 300 sec: 5647.9). Total num frames: 264125440. Throughput: 0: 5088.1. Samples: 264124552. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 12:56:01,983][25689] Avg episode reward: [(0, '-49.305')] [2022-07-09 12:56:03,436][26022] Updated weights on worker 0-0, policy_version 257943 (0.00087) [2022-07-09 12:56:05,110][26022] Updated weights on worker 0-0, policy_version 257953 (0.00093) [2022-07-09 12:56:06,893][26022] Updated weights on worker 0-0, policy_version 257963 (0.00089) [2022-07-09 12:56:06,991][25689] Fps is (10 sec: 5504.7, 60 sec: 5669.2, 300 sec: 5653.7). Total num frames: 264154112. Throughput: 0: 5820.4. Samples: 264156732. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 12:56:06,992][25689] Avg episode reward: [(0, '-49.703')] [2022-07-09 12:56:08,910][26022] Updated weights on worker 0-0, policy_version 257973 (0.00090) [2022-07-09 12:56:10,437][26022] Updated weights on worker 0-0, policy_version 257983 (0.00086) [2022-07-09 12:56:12,047][25689] Fps is (10 sec: 5697.9, 60 sec: 5671.0, 300 sec: 5652.7). Total num frames: 264182784. Throughput: 0: 5846.0. Samples: 264190974. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 12:56:12,047][25689] Avg episode reward: [(0, '-48.761')] [2022-07-09 12:56:12,571][26022] Updated weights on worker 0-0, policy_version 257993 (0.00087) [2022-07-09 12:56:13,975][26022] Updated weights on worker 0-0, policy_version 258003 (0.00083) [2022-07-09 12:56:16,159][26022] Updated weights on worker 0-0, policy_version 258013 (0.00088) [2022-07-09 12:56:17,066][25689] Fps is (10 sec: 5691.7, 60 sec: 5676.2, 300 sec: 5656.5). Total num frames: 264211456. Throughput: 0: 5013.3. Samples: 264208152. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:56:17,066][25689] Avg episode reward: [(0, '-48.678')] [2022-07-09 12:56:17,506][26022] Updated weights on worker 0-0, policy_version 258023 (0.00086) [2022-07-09 12:56:19,560][26022] Updated weights on worker 0-0, policy_version 258033 (0.00093) [2022-07-09 12:56:21,292][26022] Updated weights on worker 0-0, policy_version 258043 (0.00093) [2022-07-09 12:56:22,091][25689] Fps is (10 sec: 5811.2, 60 sec: 5678.3, 300 sec: 5656.2). Total num frames: 264241152. Throughput: 0: 5863.6. Samples: 264242458. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:56:22,091][25689] Avg episode reward: [(0, '-48.395')] [2022-07-09 12:56:23,227][26022] Updated weights on worker 0-0, policy_version 258053 (0.00096) [2022-07-09 12:56:24,703][26022] Updated weights on worker 0-0, policy_version 258063 (0.00079) [2022-07-09 12:56:26,823][26022] Updated weights on worker 0-0, policy_version 258073 (0.00087) [2022-07-09 12:56:27,112][25689] Fps is (10 sec: 5606.1, 60 sec: 5643.9, 300 sec: 5651.2). Total num frames: 264267776. Throughput: 0: 5956.4. Samples: 264276582. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:56:27,113][25689] Avg episode reward: [(0, '-49.053')] [2022-07-09 12:56:28,255][26022] Updated weights on worker 0-0, policy_version 258083 (0.00097) [2022-07-09 12:56:30,492][26022] Updated weights on worker 0-0, policy_version 258093 (0.00090) [2022-07-09 12:56:32,028][26022] Updated weights on worker 0-0, policy_version 258103 (0.00089) [2022-07-09 12:56:32,237][25689] Fps is (10 sec: 5550.5, 60 sec: 5673.7, 300 sec: 5652.7). Total num frames: 264297472. Throughput: 0: 5081.7. Samples: 264293578. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:56:32,238][25689] Avg episode reward: [(0, '-49.504')] [2022-07-09 12:56:34,111][26022] Updated weights on worker 0-0, policy_version 258113 (0.00098) [2022-07-09 12:56:35,493][26022] Updated weights on worker 0-0, policy_version 258123 (0.00097) [2022-07-09 12:56:37,259][25689] Fps is (10 sec: 5651.7, 60 sec: 5655.4, 300 sec: 5649.1). Total num frames: 264325120. Throughput: 0: 5934.4. Samples: 264327982. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:56:37,259][25689] Avg episode reward: [(0, '-49.744')] [2022-07-09 12:56:37,540][26022] Updated weights on worker 0-0, policy_version 258133 (0.00089) [2022-07-09 12:56:39,332][26022] Updated weights on worker 0-0, policy_version 258143 (0.00088) [2022-07-09 12:56:41,084][26022] Updated weights on worker 0-0, policy_version 258153 (0.00094) [2022-07-09 12:56:42,270][25689] Fps is (10 sec: 5715.6, 60 sec: 5655.4, 300 sec: 5653.5). Total num frames: 264354816. Throughput: 0: 5933.1. Samples: 264362186. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:56:42,272][25689] Avg episode reward: [(0, '-50.684')] [2022-07-09 12:56:42,837][26022] Updated weights on worker 0-0, policy_version 258163 (0.00082) [2022-07-09 12:56:44,897][26022] Updated weights on worker 0-0, policy_version 258173 (0.00093) [2022-07-09 12:56:46,344][26022] Updated weights on worker 0-0, policy_version 258183 (0.00089) [2022-07-09 12:56:47,370][25689] Fps is (10 sec: 5873.8, 60 sec: 5680.1, 300 sec: 5657.6). Total num frames: 264384512. Throughput: 0: 5064.7. Samples: 264379182. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:56:47,371][25689] Avg episode reward: [(0, '-49.920')] [2022-07-09 12:56:48,476][26022] Updated weights on worker 0-0, policy_version 258193 (0.00088) [2022-07-09 12:56:49,903][26022] Updated weights on worker 0-0, policy_version 258203 (0.00087) [2022-07-09 12:56:52,057][26022] Updated weights on worker 0-0, policy_version 258213 (0.00047) [2022-07-09 12:56:52,420][25689] Fps is (10 sec: 5649.8, 60 sec: 5632.8, 300 sec: 5651.6). Total num frames: 264412160. Throughput: 0: 5939.7. Samples: 264413456. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:56:52,421][25689] Avg episode reward: [(0, '-50.440')] [2022-07-09 12:56:53,388][26022] Updated weights on worker 0-0, policy_version 258223 (0.00101) [2022-07-09 12:56:55,461][26022] Updated weights on worker 0-0, policy_version 258233 (0.00088) [2022-07-09 12:56:56,878][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:56:56,890][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000258241_264438784.pth [2022-07-09 12:56:56,890][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000256251_262401024.pth [2022-07-09 12:56:57,111][26022] Updated weights on worker 0-0, policy_version 258243 (0.00085) [2022-07-09 12:56:57,461][25689] Fps is (10 sec: 5682.8, 60 sec: 5668.7, 300 sec: 5652.6). Total num frames: 264441856. Throughput: 0: 5943.8. Samples: 264448060. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:56:57,467][25689] Avg episode reward: [(0, '-50.920')] [2022-07-09 12:56:58,859][26022] Updated weights on worker 0-0, policy_version 258253 (0.00083) [2022-07-09 12:57:00,791][26022] Updated weights on worker 0-0, policy_version 258263 (0.00097) [2022-07-09 12:57:02,508][25689] Fps is (10 sec: 5582.9, 60 sec: 5667.7, 300 sec: 5652.3). Total num frames: 264468480. Throughput: 0: 5093.7. Samples: 264465270. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:57:02,509][25689] Avg episode reward: [(0, '-50.749')] [2022-07-09 12:57:02,876][26022] Updated weights on worker 0-0, policy_version 258273 (0.00090) [2022-07-09 12:57:04,648][26022] Updated weights on worker 0-0, policy_version 258283 (0.00089) [2022-07-09 12:57:06,637][26022] Updated weights on worker 0-0, policy_version 258293 (0.00092) [2022-07-09 12:57:07,586][25689] Fps is (10 sec: 5562.0, 60 sec: 5678.1, 300 sec: 5653.2). Total num frames: 264498176. Throughput: 0: 5858.8. Samples: 264497628. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:57:07,587][25689] Avg episode reward: [(0, '-50.446')] [2022-07-09 12:57:08,220][26022] Updated weights on worker 0-0, policy_version 258303 (0.00089) [2022-07-09 12:57:10,162][26022] Updated weights on worker 0-0, policy_version 258313 (0.00088) [2022-07-09 12:57:11,877][26022] Updated weights on worker 0-0, policy_version 258323 (0.00094) [2022-07-09 12:57:12,706][25689] Fps is (10 sec: 5723.7, 60 sec: 5672.1, 300 sec: 5656.0). Total num frames: 264526848. Throughput: 0: 5847.0. Samples: 264532064. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:57:12,706][25689] Avg episode reward: [(0, '-50.291')] [2022-07-09 12:57:13,807][26022] Updated weights on worker 0-0, policy_version 258333 (0.00088) [2022-07-09 12:57:15,457][26022] Updated weights on worker 0-0, policy_version 258343 (0.00083) [2022-07-09 12:57:17,291][26022] Updated weights on worker 0-0, policy_version 258353 (0.00081) [2022-07-09 12:57:17,744][25689] Fps is (10 sec: 5645.2, 60 sec: 5670.3, 300 sec: 5656.4). Total num frames: 264555520. Throughput: 0: 4989.1. Samples: 264549258. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:57:17,745][25689] Avg episode reward: [(0, '-49.339')] [2022-07-09 12:57:19,068][26022] Updated weights on worker 0-0, policy_version 258363 (0.00091) [2022-07-09 12:57:20,772][26022] Updated weights on worker 0-0, policy_version 258373 (0.00086) [2022-07-09 12:57:22,735][26022] Updated weights on worker 0-0, policy_version 258383 (0.00088) [2022-07-09 12:57:22,789][25689] Fps is (10 sec: 5788.6, 60 sec: 5668.5, 300 sec: 5656.3). Total num frames: 264585216. Throughput: 0: 5829.5. Samples: 264583496. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:57:22,789][25689] Avg episode reward: [(0, '-48.648')] [2022-07-09 12:57:24,513][26022] Updated weights on worker 0-0, policy_version 258393 (0.00095) [2022-07-09 12:57:26,291][26022] Updated weights on worker 0-0, policy_version 258403 (0.00092) [2022-07-09 12:57:27,795][25689] Fps is (10 sec: 5603.8, 60 sec: 5669.9, 300 sec: 5651.3). Total num frames: 264611840. Throughput: 0: 5948.0. Samples: 264617828. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:57:27,795][25689] Avg episode reward: [(0, '-47.731')] [2022-07-09 12:57:28,110][26022] Updated weights on worker 0-0, policy_version 258413 (0.00093) [2022-07-09 12:57:29,666][26022] Updated weights on worker 0-0, policy_version 258423 (0.00103) [2022-07-09 12:57:31,621][26022] Updated weights on worker 0-0, policy_version 258433 (0.00092) [2022-07-09 12:57:32,895][25689] Fps is (10 sec: 5775.6, 60 sec: 5706.0, 300 sec: 5670.4). Total num frames: 264643584. Throughput: 0: 5095.8. Samples: 264634944. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:57:32,895][25689] Avg episode reward: [(0, '-47.601')] [2022-07-09 12:57:33,523][26022] Updated weights on worker 0-0, policy_version 258443 (0.00088) [2022-07-09 12:57:35,183][26022] Updated weights on worker 0-0, policy_version 258453 (0.00085) [2022-07-09 12:57:37,138][26022] Updated weights on worker 0-0, policy_version 258463 (0.00095) [2022-07-09 12:57:37,957][25689] Fps is (10 sec: 5844.1, 60 sec: 5702.1, 300 sec: 5659.2). Total num frames: 264671232. Throughput: 0: 5943.7. Samples: 264669398. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:57:37,958][25689] Avg episode reward: [(0, '-47.884')] [2022-07-09 12:57:38,791][26022] Updated weights on worker 0-0, policy_version 258473 (0.00096) [2022-07-09 12:57:40,679][26022] Updated weights on worker 0-0, policy_version 258483 (0.00089) [2022-07-09 12:57:42,248][26022] Updated weights on worker 0-0, policy_version 258493 (0.00085) [2022-07-09 12:57:42,965][25689] Fps is (10 sec: 5491.1, 60 sec: 5668.8, 300 sec: 5655.7). Total num frames: 264698880. Throughput: 0: 5939.3. Samples: 264703328. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:57:42,965][25689] Avg episode reward: [(0, '-47.763')] [2022-07-09 12:57:44,268][26022] Updated weights on worker 0-0, policy_version 258503 (0.00091) [2022-07-09 12:57:46,003][26022] Updated weights on worker 0-0, policy_version 258513 (0.00082) [2022-07-09 12:57:47,884][26022] Updated weights on worker 0-0, policy_version 258523 (0.00087) [2022-07-09 12:57:47,969][25689] Fps is (10 sec: 5625.3, 60 sec: 5660.8, 300 sec: 5660.3). Total num frames: 264727552. Throughput: 0: 5909.4. Samples: 264737048. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:57:47,970][25689] Avg episode reward: [(0, '-49.491')] [2022-07-09 12:57:49,753][26022] Updated weights on worker 0-0, policy_version 258533 (0.00086) [2022-07-09 12:57:51,442][26022] Updated weights on worker 0-0, policy_version 258543 (0.00082) [2022-07-09 12:57:53,067][25689] Fps is (10 sec: 5676.7, 60 sec: 5673.3, 300 sec: 5658.6). Total num frames: 264756224. Throughput: 0: 5907.6. Samples: 264754110. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:57:53,068][25689] Avg episode reward: [(0, '-49.347')] [2022-07-09 12:57:53,363][26022] Updated weights on worker 0-0, policy_version 258553 (0.00080) [2022-07-09 12:57:55,119][26022] Updated weights on worker 0-0, policy_version 258563 (0.00090) [2022-07-09 12:57:57,046][26022] Updated weights on worker 0-0, policy_version 258573 (0.00086) [2022-07-09 12:57:58,115][25689] Fps is (10 sec: 5652.0, 60 sec: 5655.7, 300 sec: 5657.8). Total num frames: 264784896. Throughput: 0: 5890.9. Samples: 264788144. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:57:58,120][25689] Avg episode reward: [(0, '-50.034')] [2022-07-09 12:57:58,763][26022] Updated weights on worker 0-0, policy_version 258583 (0.00077) [2022-07-09 12:58:00,650][26022] Updated weights on worker 0-0, policy_version 258593 (0.00097) [2022-07-09 12:58:02,683][26022] Updated weights on worker 0-0, policy_version 258603 (0.00091) [2022-07-09 12:58:03,135][25689] Fps is (10 sec: 5390.4, 60 sec: 5641.3, 300 sec: 5654.1). Total num frames: 264810496. Throughput: 0: 5787.8. Samples: 264820068. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:58:03,135][25689] Avg episode reward: [(0, '-51.192')] [2022-07-09 12:58:04,572][26022] Updated weights on worker 0-0, policy_version 258613 (0.00080) [2022-07-09 12:58:06,416][26022] Updated weights on worker 0-0, policy_version 258623 (0.00089) [2022-07-09 12:58:08,150][25689] Fps is (10 sec: 5408.1, 60 sec: 5630.3, 300 sec: 5658.5). Total num frames: 264839168. Throughput: 0: 4947.6. Samples: 264836896. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:58:08,152][25689] Avg episode reward: [(0, '-51.407')] [2022-07-09 12:58:08,175][26022] Updated weights on worker 0-0, policy_version 258633 (0.00089) [2022-07-09 12:58:10,089][26022] Updated weights on worker 0-0, policy_version 258643 (0.00084) [2022-07-09 12:58:11,794][26022] Updated weights on worker 0-0, policy_version 258653 (0.00086) [2022-07-09 12:58:13,193][25689] Fps is (10 sec: 5701.1, 60 sec: 5637.4, 300 sec: 5654.3). Total num frames: 264867840. Throughput: 0: 5808.7. Samples: 264871020. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:58:13,194][25689] Avg episode reward: [(0, '-52.249')] [2022-07-09 12:58:13,600][26022] Updated weights on worker 0-0, policy_version 258663 (0.00081) [2022-07-09 12:58:15,472][26022] Updated weights on worker 0-0, policy_version 258673 (0.00089) [2022-07-09 12:58:17,211][26022] Updated weights on worker 0-0, policy_version 258683 (0.00093) [2022-07-09 12:58:18,203][25689] Fps is (10 sec: 5704.3, 60 sec: 5640.1, 300 sec: 5654.3). Total num frames: 264896512. Throughput: 0: 5841.0. Samples: 264905478. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:58:18,204][25689] Avg episode reward: [(0, '-51.562')] [2022-07-09 12:58:19,113][26022] Updated weights on worker 0-0, policy_version 258693 (0.00106) [2022-07-09 12:58:20,603][26022] Updated weights on worker 0-0, policy_version 258703 (0.00095) [2022-07-09 12:58:22,517][26022] Updated weights on worker 0-0, policy_version 258713 (0.00085) [2022-07-09 12:58:23,235][25689] Fps is (10 sec: 5812.3, 60 sec: 5641.2, 300 sec: 5660.8). Total num frames: 264926208. Throughput: 0: 5096.7. Samples: 264922514. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:58:23,236][25689] Avg episode reward: [(0, '-51.625')] [2022-07-09 12:58:24,492][26022] Updated weights on worker 0-0, policy_version 258723 (0.00084) [2022-07-09 12:58:26,176][26022] Updated weights on worker 0-0, policy_version 258733 (0.00091) [2022-07-09 12:58:28,090][26022] Updated weights on worker 0-0, policy_version 258743 (0.00098) [2022-07-09 12:58:28,247][25689] Fps is (10 sec: 5709.5, 60 sec: 5657.7, 300 sec: 5658.8). Total num frames: 264953856. Throughput: 0: 5959.5. Samples: 264956660. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:58:28,247][25689] Avg episode reward: [(0, '-50.319')] [2022-07-09 12:58:29,495][26022] Updated weights on worker 0-0, policy_version 258753 (0.00081) [2022-07-09 12:58:31,743][26022] Updated weights on worker 0-0, policy_version 258763 (0.00094) [2022-07-09 12:58:33,274][26022] Updated weights on worker 0-0, policy_version 258773 (0.00088) [2022-07-09 12:58:33,380][25689] Fps is (10 sec: 5652.6, 60 sec: 5620.7, 300 sec: 5661.3). Total num frames: 264983552. Throughput: 0: 5923.9. Samples: 264990606. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 12:58:33,380][25689] Avg episode reward: [(0, '-50.987')] [2022-07-09 12:58:35,278][26022] Updated weights on worker 0-0, policy_version 258783 (0.00092) [2022-07-09 12:58:36,933][26022] Updated weights on worker 0-0, policy_version 258793 (0.00092) [2022-07-09 12:58:38,411][25689] Fps is (10 sec: 5641.7, 60 sec: 5623.6, 300 sec: 5657.3). Total num frames: 265011200. Throughput: 0: 5057.6. Samples: 265007682. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 12:58:38,411][25689] Avg episode reward: [(0, '-50.813')] [2022-07-09 12:58:38,835][26022] Updated weights on worker 0-0, policy_version 258803 (0.00085) [2022-07-09 12:58:40,543][26022] Updated weights on worker 0-0, policy_version 258813 (0.00093) [2022-07-09 12:58:42,491][26022] Updated weights on worker 0-0, policy_version 258823 (0.00087) [2022-07-09 12:58:43,418][25689] Fps is (10 sec: 5610.5, 60 sec: 5640.6, 300 sec: 5653.8). Total num frames: 265039872. Throughput: 0: 5924.5. Samples: 265042088. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 12:58:43,419][25689] Avg episode reward: [(0, '-51.218')] [2022-07-09 12:58:44,211][26022] Updated weights on worker 0-0, policy_version 258833 (0.00087) [2022-07-09 12:58:46,001][26022] Updated weights on worker 0-0, policy_version 258843 (0.00093) [2022-07-09 12:58:47,909][26022] Updated weights on worker 0-0, policy_version 258853 (0.00088) [2022-07-09 12:58:48,496][25689] Fps is (10 sec: 5686.0, 60 sec: 5633.7, 300 sec: 5657.9). Total num frames: 265068544. Throughput: 0: 5901.2. Samples: 265076156. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 12:58:48,497][25689] Avg episode reward: [(0, '-51.118')] [2022-07-09 12:58:49,557][26022] Updated weights on worker 0-0, policy_version 258863 (0.00091) [2022-07-09 12:58:51,655][26022] Updated weights on worker 0-0, policy_version 258873 (0.00094) [2022-07-09 12:58:53,215][26022] Updated weights on worker 0-0, policy_version 258883 (0.00093) [2022-07-09 12:58:53,562][25689] Fps is (10 sec: 5552.0, 60 sec: 5619.7, 300 sec: 5656.9). Total num frames: 265096192. Throughput: 0: 5065.8. Samples: 265092846. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 12:58:53,563][25689] Avg episode reward: [(0, '-51.361')] [2022-07-09 12:58:55,218][26022] Updated weights on worker 0-0, policy_version 258893 (0.00082) [2022-07-09 12:58:56,875][26022] Updated weights on worker 0-0, policy_version 258903 (0.00092) [2022-07-09 12:58:57,041][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 12:58:57,047][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000258904_265117696.pth [2022-07-09 12:58:57,048][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000256913_263078912.pth [2022-07-09 12:58:58,609][25689] Fps is (10 sec: 5569.1, 60 sec: 5619.9, 300 sec: 5656.3). Total num frames: 265124864. Throughput: 0: 5894.9. Samples: 265126748. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 12:58:58,610][25689] Avg episode reward: [(0, '-51.740')] [2022-07-09 12:58:58,813][26022] Updated weights on worker 0-0, policy_version 258913 (0.00096) [2022-07-09 12:59:00,534][26022] Updated weights on worker 0-0, policy_version 258923 (0.00082) [2022-07-09 12:59:02,781][26022] Updated weights on worker 0-0, policy_version 258933 (0.00084) [2022-07-09 12:59:03,625][25689] Fps is (10 sec: 5495.0, 60 sec: 5637.1, 300 sec: 5653.4). Total num frames: 265151488. Throughput: 0: 5781.7. Samples: 265158918. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 12:59:03,626][25689] Avg episode reward: [(0, '-51.238')] [2022-07-09 12:59:04,472][26022] Updated weights on worker 0-0, policy_version 258943 (0.00089) [2022-07-09 12:59:06,406][26022] Updated weights on worker 0-0, policy_version 258953 (0.00092) [2022-07-09 12:59:07,909][26022] Updated weights on worker 0-0, policy_version 258963 (0.00080) [2022-07-09 12:59:08,654][25689] Fps is (10 sec: 5606.7, 60 sec: 5652.8, 300 sec: 5665.4). Total num frames: 265181184. Throughput: 0: 4939.3. Samples: 265175722. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 12:59:08,655][25689] Avg episode reward: [(0, '-50.566')] [2022-07-09 12:59:10,012][26022] Updated weights on worker 0-0, policy_version 258973 (0.00088) [2022-07-09 12:59:11,567][26022] Updated weights on worker 0-0, policy_version 258983 (0.00098) [2022-07-09 12:59:13,688][26022] Updated weights on worker 0-0, policy_version 258993 (0.00086) [2022-07-09 12:59:13,713][25689] Fps is (10 sec: 5684.4, 60 sec: 5634.3, 300 sec: 5657.8). Total num frames: 265208832. Throughput: 0: 5823.8. Samples: 265210200. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 12:59:13,714][25689] Avg episode reward: [(0, '-50.980')] [2022-07-09 12:59:15,361][26022] Updated weights on worker 0-0, policy_version 259003 (0.00090) [2022-07-09 12:59:17,180][26022] Updated weights on worker 0-0, policy_version 259013 (0.00080) [2022-07-09 12:59:18,719][25689] Fps is (10 sec: 5595.8, 60 sec: 5634.8, 300 sec: 5658.0). Total num frames: 265237504. Throughput: 0: 5850.9. Samples: 265244408. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 12:59:18,719][25689] Avg episode reward: [(0, '-51.254')] [2022-07-09 12:59:18,907][26022] Updated weights on worker 0-0, policy_version 259023 (0.00088) [2022-07-09 12:59:20,919][26022] Updated weights on worker 0-0, policy_version 259033 (0.00092) [2022-07-09 12:59:22,443][26022] Updated weights on worker 0-0, policy_version 259043 (0.00084) [2022-07-09 12:59:23,732][25689] Fps is (10 sec: 5621.3, 60 sec: 5602.6, 300 sec: 5651.3). Total num frames: 265265152. Throughput: 0: 5099.2. Samples: 265261448. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 12:59:23,733][25689] Avg episode reward: [(0, '-51.641')] [2022-07-09 12:59:24,381][26022] Updated weights on worker 0-0, policy_version 259053 (0.00088) [2022-07-09 12:59:26,003][26022] Updated weights on worker 0-0, policy_version 259063 (0.00857) [2022-07-09 12:59:27,996][26022] Updated weights on worker 0-0, policy_version 259073 (0.00088) [2022-07-09 12:59:28,755][25689] Fps is (10 sec: 5815.6, 60 sec: 5652.4, 300 sec: 5659.7). Total num frames: 265295872. Throughput: 0: 5976.4. Samples: 265295854. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 12:59:28,756][25689] Avg episode reward: [(0, '-51.542')] [2022-07-09 12:59:29,705][26022] Updated weights on worker 0-0, policy_version 259083 (0.00086) [2022-07-09 12:59:31,463][26022] Updated weights on worker 0-0, policy_version 259093 (0.00082) [2022-07-09 12:59:33,448][26022] Updated weights on worker 0-0, policy_version 259103 (0.00089) [2022-07-09 12:59:33,811][25689] Fps is (10 sec: 5893.0, 60 sec: 5642.7, 300 sec: 5655.7). Total num frames: 265324544. Throughput: 0: 5974.4. Samples: 265330270. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 12:59:33,811][25689] Avg episode reward: [(0, '-52.852')] [2022-07-09 12:59:34,909][26022] Updated weights on worker 0-0, policy_version 259113 (0.00087) [2022-07-09 12:59:36,860][26022] Updated weights on worker 0-0, policy_version 259123 (0.00089) [2022-07-09 12:59:38,683][26022] Updated weights on worker 0-0, policy_version 259133 (0.00090) [2022-07-09 12:59:38,816][25689] Fps is (10 sec: 5598.0, 60 sec: 5645.1, 300 sec: 5656.2). Total num frames: 265352192. Throughput: 0: 5129.8. Samples: 265347502. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 12:59:38,817][25689] Avg episode reward: [(0, '-51.458')] [2022-07-09 12:59:40,326][26022] Updated weights on worker 0-0, policy_version 259143 (0.00086) [2022-07-09 12:59:42,382][26022] Updated weights on worker 0-0, policy_version 259153 (0.00087) [2022-07-09 12:59:43,819][25689] Fps is (10 sec: 5627.6, 60 sec: 5645.5, 300 sec: 5656.8). Total num frames: 265380864. Throughput: 0: 5996.4. Samples: 265381892. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 12:59:43,819][25689] Avg episode reward: [(0, '-52.427')] [2022-07-09 12:59:43,959][26022] Updated weights on worker 0-0, policy_version 259163 (0.00101) [2022-07-09 12:59:45,693][26022] Updated weights on worker 0-0, policy_version 259173 (0.00092) [2022-07-09 12:59:47,805][26022] Updated weights on worker 0-0, policy_version 259183 (0.00090) [2022-07-09 12:59:48,848][25689] Fps is (10 sec: 5818.5, 60 sec: 5667.1, 300 sec: 5660.7). Total num frames: 265410560. Throughput: 0: 5988.7. Samples: 265416180. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 12:59:48,848][25689] Avg episode reward: [(0, '-51.462')] [2022-07-09 12:59:49,315][26022] Updated weights on worker 0-0, policy_version 259193 (0.00088) [2022-07-09 12:59:51,262][26022] Updated weights on worker 0-0, policy_version 259203 (0.00087) [2022-07-09 12:59:53,100][26022] Updated weights on worker 0-0, policy_version 259213 (0.00092) [2022-07-09 12:59:53,906][25689] Fps is (10 sec: 5583.3, 60 sec: 5650.9, 300 sec: 5653.0). Total num frames: 265437184. Throughput: 0: 5117.4. Samples: 265433104. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 12:59:53,906][25689] Avg episode reward: [(0, '-50.220')] [2022-07-09 12:59:54,695][26022] Updated weights on worker 0-0, policy_version 259223 (0.00082) [2022-07-09 12:59:56,964][26022] Updated weights on worker 0-0, policy_version 259233 (0.00087) [2022-07-09 12:59:58,437][26022] Updated weights on worker 0-0, policy_version 259243 (0.00085) [2022-07-09 12:59:58,923][25689] Fps is (10 sec: 5691.6, 60 sec: 5687.6, 300 sec: 5659.6). Total num frames: 265467904. Throughput: 0: 5948.3. Samples: 265467102. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 12:59:58,923][25689] Avg episode reward: [(0, '-49.870')] [2022-07-09 13:00:00,398][26022] Updated weights on worker 0-0, policy_version 259253 (0.00093) [2022-07-09 13:00:02,299][26022] Updated weights on worker 0-0, policy_version 259263 (0.00081) [2022-07-09 13:00:03,953][25689] Fps is (10 sec: 5605.2, 60 sec: 5669.3, 300 sec: 5655.8). Total num frames: 265493504. Throughput: 0: 5818.1. Samples: 265499040. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 13:00:03,954][25689] Avg episode reward: [(0, '-50.747')] [2022-07-09 13:00:04,372][26022] Updated weights on worker 0-0, policy_version 259273 (0.00095) [2022-07-09 13:00:06,031][26022] Updated weights on worker 0-0, policy_version 259283 (0.00084) [2022-07-09 13:00:07,936][26022] Updated weights on worker 0-0, policy_version 259293 (0.00087) [2022-07-09 13:00:08,967][25689] Fps is (10 sec: 5505.4, 60 sec: 5670.7, 300 sec: 5660.4). Total num frames: 265523200. Throughput: 0: 4968.5. Samples: 265516142. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 13:00:08,967][25689] Avg episode reward: [(0, '-50.858')] [2022-07-09 13:00:09,602][26022] Updated weights on worker 0-0, policy_version 259303 (0.00092) [2022-07-09 13:00:11,607][26022] Updated weights on worker 0-0, policy_version 259313 (0.00096) [2022-07-09 13:00:13,127][26022] Updated weights on worker 0-0, policy_version 259323 (0.00090) [2022-07-09 13:00:14,099][25689] Fps is (10 sec: 5652.4, 60 sec: 5663.9, 300 sec: 5655.8). Total num frames: 265550848. Throughput: 0: 5798.3. Samples: 265550188. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 13:00:14,099][25689] Avg episode reward: [(0, '-50.080')] [2022-07-09 13:00:15,213][26022] Updated weights on worker 0-0, policy_version 259333 (0.00806) [2022-07-09 13:00:16,872][26022] Updated weights on worker 0-0, policy_version 259343 (0.00091) [2022-07-09 13:00:18,630][26022] Updated weights on worker 0-0, policy_version 259353 (0.00087) [2022-07-09 13:00:19,159][25689] Fps is (10 sec: 5626.4, 60 sec: 5675.7, 300 sec: 5655.6). Total num frames: 265580544. Throughput: 0: 5818.7. Samples: 265584848. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 13:00:19,159][25689] Avg episode reward: [(0, '-50.119')] [2022-07-09 13:00:20,300][26022] Updated weights on worker 0-0, policy_version 259363 (0.00086) [2022-07-09 13:00:22,367][26022] Updated weights on worker 0-0, policy_version 259373 (0.00087) [2022-07-09 13:00:23,964][26022] Updated weights on worker 0-0, policy_version 259383 (0.00086) [2022-07-09 13:00:24,228][25689] Fps is (10 sec: 5762.4, 60 sec: 5687.4, 300 sec: 5654.6). Total num frames: 265609216. Throughput: 0: 5076.8. Samples: 265601968. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 13:00:24,229][25689] Avg episode reward: [(0, '-49.932')] [2022-07-09 13:00:25,930][26022] Updated weights on worker 0-0, policy_version 259393 (0.00088) [2022-07-09 13:00:27,538][26022] Updated weights on worker 0-0, policy_version 259403 (0.00080) [2022-07-09 13:00:29,256][25689] Fps is (10 sec: 5679.1, 60 sec: 5653.1, 300 sec: 5659.1). Total num frames: 265637888. Throughput: 0: 5910.7. Samples: 265636066. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 13:00:29,257][25689] Avg episode reward: [(0, '-49.656')] [2022-07-09 13:00:29,409][26022] Updated weights on worker 0-0, policy_version 259413 (0.00089) [2022-07-09 13:00:31,227][26022] Updated weights on worker 0-0, policy_version 259423 (0.00086) [2022-07-09 13:00:33,011][26022] Updated weights on worker 0-0, policy_version 259433 (0.00094) [2022-07-09 13:00:34,320][25689] Fps is (10 sec: 5580.4, 60 sec: 5635.3, 300 sec: 5654.5). Total num frames: 265665536. Throughput: 0: 5929.8. Samples: 265670098. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 13:00:34,321][25689] Avg episode reward: [(0, '-49.442')] [2022-07-09 13:00:34,729][26022] Updated weights on worker 0-0, policy_version 259443 (0.00088) [2022-07-09 13:00:36,707][26022] Updated weights on worker 0-0, policy_version 259453 (0.00091) [2022-07-09 13:00:38,255][26022] Updated weights on worker 0-0, policy_version 259463 (0.00088) [2022-07-09 13:00:39,346][25689] Fps is (10 sec: 5683.2, 60 sec: 5667.3, 300 sec: 5654.3). Total num frames: 265695232. Throughput: 0: 5934.4. Samples: 265704648. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 13:00:39,349][25689] Avg episode reward: [(0, '-49.933')] [2022-07-09 13:00:40,206][26022] Updated weights on worker 0-0, policy_version 259473 (0.00086) [2022-07-09 13:00:41,910][26022] Updated weights on worker 0-0, policy_version 259483 (0.00113) [2022-07-09 13:00:43,642][26022] Updated weights on worker 0-0, policy_version 259493 (0.00086) [2022-07-09 13:00:44,351][25689] Fps is (10 sec: 5819.1, 60 sec: 5667.1, 300 sec: 5657.6). Total num frames: 265723904. Throughput: 0: 5953.7. Samples: 265721774. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 13:00:44,351][25689] Avg episode reward: [(0, '-49.964')] [2022-07-09 13:00:45,612][26022] Updated weights on worker 0-0, policy_version 259503 (0.00083) [2022-07-09 13:00:47,458][26022] Updated weights on worker 0-0, policy_version 259513 (0.00339) [2022-07-09 13:00:49,005][26022] Updated weights on worker 0-0, policy_version 259523 (0.00059) [2022-07-09 13:00:49,359][25689] Fps is (10 sec: 5727.0, 60 sec: 5652.1, 300 sec: 5652.3). Total num frames: 265752576. Throughput: 0: 5972.4. Samples: 265756130. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 13:00:49,359][25689] Avg episode reward: [(0, '-49.506')] [2022-07-09 13:00:50,945][26022] Updated weights on worker 0-0, policy_version 259533 (0.00085) [2022-07-09 13:00:52,645][26022] Updated weights on worker 0-0, policy_version 259543 (0.00092) [2022-07-09 13:00:54,405][25689] Fps is (10 sec: 5601.7, 60 sec: 5670.2, 300 sec: 5652.6). Total num frames: 265780224. Throughput: 0: 5966.7. Samples: 265789936. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 13:00:54,405][25689] Avg episode reward: [(0, '-49.336')] [2022-07-09 13:00:54,775][26022] Updated weights on worker 0-0, policy_version 259553 (0.00090) [2022-07-09 13:00:56,470][26022] Updated weights on worker 0-0, policy_version 259563 (0.00089) [2022-07-09 13:00:57,176][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:00:57,185][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000259567_265796608.pth [2022-07-09 13:00:57,191][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000257577_263758848.pth [2022-07-09 13:00:58,312][26022] Updated weights on worker 0-0, policy_version 259573 (0.00090) [2022-07-09 13:00:59,412][25689] Fps is (10 sec: 5602.5, 60 sec: 5637.3, 300 sec: 5660.0). Total num frames: 265808896. Throughput: 0: 5095.5. Samples: 265806892. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 13:00:59,412][25689] Avg episode reward: [(0, '-49.089')] [2022-07-09 13:00:59,974][26022] Updated weights on worker 0-0, policy_version 259583 (0.00081) [2022-07-09 13:01:02,173][26022] Updated weights on worker 0-0, policy_version 259593 (0.01042) [2022-07-09 13:01:03,946][26022] Updated weights on worker 0-0, policy_version 259603 (0.00090) [2022-07-09 13:01:04,427][25689] Fps is (10 sec: 5517.5, 60 sec: 5655.7, 300 sec: 5653.0). Total num frames: 265835520. Throughput: 0: 5845.9. Samples: 265839136. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 13:01:04,427][25689] Avg episode reward: [(0, '-48.042')] [2022-07-09 13:01:05,877][26022] Updated weights on worker 0-0, policy_version 259613 (0.00059) [2022-07-09 13:01:07,503][26022] Updated weights on worker 0-0, policy_version 259623 (0.00090) [2022-07-09 13:01:09,265][26022] Updated weights on worker 0-0, policy_version 259633 (0.00086) [2022-07-09 13:01:09,435][25689] Fps is (10 sec: 5516.6, 60 sec: 5639.2, 300 sec: 5653.9). Total num frames: 265864192. Throughput: 0: 5847.0. Samples: 265873516. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 13:01:09,436][25689] Avg episode reward: [(0, '-47.709')] [2022-07-09 13:01:11,043][26022] Updated weights on worker 0-0, policy_version 259643 (0.00087) [2022-07-09 13:01:12,946][26022] Updated weights on worker 0-0, policy_version 259653 (0.00087) [2022-07-09 13:01:14,541][25689] Fps is (10 sec: 5669.6, 60 sec: 5658.6, 300 sec: 5652.3). Total num frames: 265892864. Throughput: 0: 5006.4. Samples: 265890746. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 13:01:14,541][25689] Avg episode reward: [(0, '-46.960')] [2022-07-09 13:01:14,671][26022] Updated weights on worker 0-0, policy_version 259663 (0.00099) [2022-07-09 13:01:16,628][26022] Updated weights on worker 0-0, policy_version 259673 (0.00092) [2022-07-09 13:01:18,271][26022] Updated weights on worker 0-0, policy_version 259683 (0.00088) [2022-07-09 13:01:19,593][25689] Fps is (10 sec: 5645.2, 60 sec: 5642.3, 300 sec: 5648.3). Total num frames: 265921536. Throughput: 0: 5867.6. Samples: 265925308. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 13:01:19,594][25689] Avg episode reward: [(0, '-47.179')] [2022-07-09 13:01:20,110][26022] Updated weights on worker 0-0, policy_version 259693 (0.00085) [2022-07-09 13:01:21,788][26022] Updated weights on worker 0-0, policy_version 259703 (0.00085) [2022-07-09 13:01:23,656][26022] Updated weights on worker 0-0, policy_version 259713 (0.00086) [2022-07-09 13:01:24,632][25689] Fps is (10 sec: 5784.2, 60 sec: 5662.1, 300 sec: 5658.3). Total num frames: 265951232. Throughput: 0: 5953.9. Samples: 265959436. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 13:01:24,632][25689] Avg episode reward: [(0, '-47.701')] [2022-07-09 13:01:25,337][26022] Updated weights on worker 0-0, policy_version 259723 (0.00086) [2022-07-09 13:01:27,375][26022] Updated weights on worker 0-0, policy_version 259733 (0.00088) [2022-07-09 13:01:28,879][26022] Updated weights on worker 0-0, policy_version 259743 (0.00087) [2022-07-09 13:01:29,635][25689] Fps is (10 sec: 5812.3, 60 sec: 5664.4, 300 sec: 5657.2). Total num frames: 265979904. Throughput: 0: 5098.1. Samples: 265976498. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 13:01:29,636][25689] Avg episode reward: [(0, '-48.619')] [2022-07-09 13:01:31,096][26022] Updated weights on worker 0-0, policy_version 259753 (0.00084) [2022-07-09 13:01:32,562][26022] Updated weights on worker 0-0, policy_version 259763 (0.00104) [2022-07-09 13:01:34,620][26022] Updated weights on worker 0-0, policy_version 259773 (0.00085) [2022-07-09 13:01:34,704][25689] Fps is (10 sec: 5591.4, 60 sec: 5664.0, 300 sec: 5656.3). Total num frames: 266007552. Throughput: 0: 5946.1. Samples: 266010640. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 13:01:34,705][25689] Avg episode reward: [(0, '-49.156')] [2022-07-09 13:01:36,194][26022] Updated weights on worker 0-0, policy_version 259783 (0.00086) [2022-07-09 13:01:38,085][26022] Updated weights on worker 0-0, policy_version 259793 (0.00099) [2022-07-09 13:01:39,715][25689] Fps is (10 sec: 5689.0, 60 sec: 5665.4, 300 sec: 5656.3). Total num frames: 266037248. Throughput: 0: 5943.3. Samples: 266044900. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 13:01:39,716][25689] Avg episode reward: [(0, '-49.373')] [2022-07-09 13:01:39,828][26022] Updated weights on worker 0-0, policy_version 259803 (0.00089) [2022-07-09 13:01:41,734][26022] Updated weights on worker 0-0, policy_version 259813 (0.00092) [2022-07-09 13:01:43,505][26022] Updated weights on worker 0-0, policy_version 259823 (0.00089) [2022-07-09 13:01:44,779][25689] Fps is (10 sec: 5691.8, 60 sec: 5642.9, 300 sec: 5650.1). Total num frames: 266064896. Throughput: 0: 5093.6. Samples: 266062058. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 13:01:44,780][25689] Avg episode reward: [(0, '-49.088')] [2022-07-09 13:01:45,241][26022] Updated weights on worker 0-0, policy_version 259833 (0.00093) [2022-07-09 13:01:47,150][26022] Updated weights on worker 0-0, policy_version 259843 (0.00095) [2022-07-09 13:01:48,882][26022] Updated weights on worker 0-0, policy_version 259853 (0.00087) [2022-07-09 13:01:49,803][25689] Fps is (10 sec: 5684.6, 60 sec: 5658.4, 300 sec: 5657.5). Total num frames: 266094592. Throughput: 0: 5930.2. Samples: 266096096. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 13:01:49,803][25689] Avg episode reward: [(0, '-49.165')] [2022-07-09 13:01:50,719][26022] Updated weights on worker 0-0, policy_version 259863 (0.00089) [2022-07-09 13:01:52,406][26022] Updated weights on worker 0-0, policy_version 259873 (0.00090) [2022-07-09 13:01:54,580][26022] Updated weights on worker 0-0, policy_version 259883 (0.00098) [2022-07-09 13:01:54,866][25689] Fps is (10 sec: 5583.8, 60 sec: 5639.9, 300 sec: 5646.7). Total num frames: 266121216. Throughput: 0: 5937.2. Samples: 266130340. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 13:01:54,866][25689] Avg episode reward: [(0, '-48.484')] [2022-07-09 13:01:56,032][26022] Updated weights on worker 0-0, policy_version 259893 (0.00092) [2022-07-09 13:01:58,066][26022] Updated weights on worker 0-0, policy_version 259903 (0.00085) [2022-07-09 13:01:59,596][26022] Updated weights on worker 0-0, policy_version 259913 (0.00091) [2022-07-09 13:01:59,878][25689] Fps is (10 sec: 5691.7, 60 sec: 5673.3, 300 sec: 5661.2). Total num frames: 266151936. Throughput: 0: 5072.3. Samples: 266147170. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 13:01:59,878][25689] Avg episode reward: [(0, '-48.655')] [2022-07-09 13:02:01,708][26022] Updated weights on worker 0-0, policy_version 259923 (0.00086) [2022-07-09 13:02:03,552][26022] Updated weights on worker 0-0, policy_version 259933 (0.00086) [2022-07-09 13:02:04,910][25689] Fps is (10 sec: 5606.9, 60 sec: 5654.7, 300 sec: 5648.3). Total num frames: 266177536. Throughput: 0: 5819.6. Samples: 266179214. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 13:02:04,911][25689] Avg episode reward: [(0, '-48.784')] [2022-07-09 13:02:05,447][26022] Updated weights on worker 0-0, policy_version 259943 (0.00087) [2022-07-09 13:02:07,233][26022] Updated weights on worker 0-0, policy_version 259953 (0.00092) [2022-07-09 13:02:09,190][26022] Updated weights on worker 0-0, policy_version 259963 (0.00094) [2022-07-09 13:02:09,959][25689] Fps is (10 sec: 5383.7, 60 sec: 5651.0, 300 sec: 5649.6). Total num frames: 266206208. Throughput: 0: 5836.9. Samples: 266213744. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 13:02:09,959][25689] Avg episode reward: [(0, '-49.045')] [2022-07-09 13:02:10,706][26022] Updated weights on worker 0-0, policy_version 259973 (0.00091) [2022-07-09 13:02:12,697][26022] Updated weights on worker 0-0, policy_version 259983 (0.00094) [2022-07-09 13:02:14,334][26022] Updated weights on worker 0-0, policy_version 259993 (0.00082) [2022-07-09 13:02:15,021][25689] Fps is (10 sec: 5773.2, 60 sec: 5672.0, 300 sec: 5652.6). Total num frames: 266235904. Throughput: 0: 4997.7. Samples: 266231068. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 13:02:15,021][25689] Avg episode reward: [(0, '-49.612')] [2022-07-09 13:02:16,088][26022] Updated weights on worker 0-0, policy_version 260003 (0.00085) [2022-07-09 13:02:18,100][26022] Updated weights on worker 0-0, policy_version 260013 (0.00093) [2022-07-09 13:02:19,757][26022] Updated weights on worker 0-0, policy_version 260023 (0.00079) [2022-07-09 13:02:20,070][25689] Fps is (10 sec: 5873.7, 60 sec: 5689.2, 300 sec: 5652.5). Total num frames: 266265600. Throughput: 0: 5855.4. Samples: 266265404. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 13:02:20,071][25689] Avg episode reward: [(0, '-49.782')] [2022-07-09 13:02:21,491][26022] Updated weights on worker 0-0, policy_version 260033 (0.00089) [2022-07-09 13:02:23,387][26022] Updated weights on worker 0-0, policy_version 260043 (0.00084) [2022-07-09 13:02:25,065][26022] Updated weights on worker 0-0, policy_version 260053 (0.00091) [2022-07-09 13:02:25,162][25689] Fps is (10 sec: 5755.6, 60 sec: 5667.3, 300 sec: 5657.8). Total num frames: 266294272. Throughput: 0: 5959.4. Samples: 266299900. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 13:02:25,162][25689] Avg episode reward: [(0, '-49.740')] [2022-07-09 13:02:27,047][26022] Updated weights on worker 0-0, policy_version 260063 (0.00083) [2022-07-09 13:02:28,867][26022] Updated weights on worker 0-0, policy_version 260073 (0.00088) [2022-07-09 13:02:30,171][25689] Fps is (10 sec: 5575.7, 60 sec: 5649.8, 300 sec: 5645.7). Total num frames: 266321920. Throughput: 0: 5106.9. Samples: 266316966. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 13:02:30,172][25689] Avg episode reward: [(0, '-49.626')] [2022-07-09 13:02:30,612][26022] Updated weights on worker 0-0, policy_version 260083 (0.00088) [2022-07-09 13:02:32,488][26022] Updated weights on worker 0-0, policy_version 260093 (0.00088) [2022-07-09 13:02:34,143][26022] Updated weights on worker 0-0, policy_version 260103 (0.00083) [2022-07-09 13:02:35,287][25689] Fps is (10 sec: 5562.0, 60 sec: 5662.3, 300 sec: 5648.1). Total num frames: 266350592. Throughput: 0: 5918.1. Samples: 266351010. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 13:02:35,288][25689] Avg episode reward: [(0, '-49.397')] [2022-07-09 13:02:35,982][26022] Updated weights on worker 0-0, policy_version 260113 (0.00092) [2022-07-09 13:02:37,823][26022] Updated weights on worker 0-0, policy_version 260123 (0.00090) [2022-07-09 13:02:39,629][26022] Updated weights on worker 0-0, policy_version 260133 (0.00090) [2022-07-09 13:02:40,361][25689] Fps is (10 sec: 5828.5, 60 sec: 5673.3, 300 sec: 5657.2). Total num frames: 266381312. Throughput: 0: 5909.4. Samples: 266385310. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 13:02:40,371][25689] Avg episode reward: [(0, '-48.108')] [2022-07-09 13:02:41,430][26022] Updated weights on worker 0-0, policy_version 260143 (0.00087) [2022-07-09 13:02:43,061][26022] Updated weights on worker 0-0, policy_version 260153 (0.00088) [2022-07-09 13:02:44,928][26022] Updated weights on worker 0-0, policy_version 260163 (0.00106) [2022-07-09 13:02:45,399][25689] Fps is (10 sec: 5772.5, 60 sec: 5675.8, 300 sec: 5653.1). Total num frames: 266408960. Throughput: 0: 5073.6. Samples: 266402576. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 13:02:45,401][25689] Avg episode reward: [(0, '-48.190')] [2022-07-09 13:02:46,548][26022] Updated weights on worker 0-0, policy_version 260173 (0.00083) [2022-07-09 13:02:48,494][26022] Updated weights on worker 0-0, policy_version 260183 (0.00082) [2022-07-09 13:02:50,381][26022] Updated weights on worker 0-0, policy_version 260193 (0.00088) [2022-07-09 13:02:50,480][25689] Fps is (10 sec: 5565.8, 60 sec: 5653.5, 300 sec: 5653.4). Total num frames: 266437632. Throughput: 0: 5896.0. Samples: 266436706. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 13:02:50,480][25689] Avg episode reward: [(0, '-48.916')] [2022-07-09 13:02:52,358][26022] Updated weights on worker 0-0, policy_version 260203 (0.00094) [2022-07-09 13:02:53,988][26022] Updated weights on worker 0-0, policy_version 260213 (0.00087) [2022-07-09 13:02:55,573][25689] Fps is (10 sec: 5737.1, 60 sec: 5701.4, 300 sec: 5656.0). Total num frames: 266467328. Throughput: 0: 5901.8. Samples: 266470728. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 13:02:55,573][25689] Avg episode reward: [(0, '-47.623')] [2022-07-09 13:02:55,822][26022] Updated weights on worker 0-0, policy_version 260223 (0.00081) [2022-07-09 13:02:57,325][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:02:57,342][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000260232_266477568.pth [2022-07-09 13:02:57,342][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000258241_264438784.pth [2022-07-09 13:02:57,676][26022] Updated weights on worker 0-0, policy_version 260233 (0.00093) [2022-07-09 13:02:59,416][26022] Updated weights on worker 0-0, policy_version 260243 (0.00090) [2022-07-09 13:03:00,639][25689] Fps is (10 sec: 5644.6, 60 sec: 5645.7, 300 sec: 5662.0). Total num frames: 266494976. Throughput: 0: 5871.4. Samples: 266504370. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 13:03:00,644][25689] Avg episode reward: [(0, '-47.912')] [2022-07-09 13:03:01,311][26022] Updated weights on worker 0-0, policy_version 260253 (0.00093) [2022-07-09 13:03:03,329][26022] Updated weights on worker 0-0, policy_version 260263 (0.00505) [2022-07-09 13:03:05,324][26022] Updated weights on worker 0-0, policy_version 260273 (0.00082) [2022-07-09 13:03:05,713][25689] Fps is (10 sec: 5352.2, 60 sec: 5658.7, 300 sec: 5654.0). Total num frames: 266521600. Throughput: 0: 5750.2. Samples: 266519384. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 13:03:05,713][25689] Avg episode reward: [(0, '-48.927')] [2022-07-09 13:03:06,765][26022] Updated weights on worker 0-0, policy_version 260283 (0.00083) [2022-07-09 13:03:08,878][26022] Updated weights on worker 0-0, policy_version 260293 (0.00089) [2022-07-09 13:03:10,371][26022] Updated weights on worker 0-0, policy_version 260303 (0.00090) [2022-07-09 13:03:10,781][25689] Fps is (10 sec: 5553.1, 60 sec: 5673.7, 300 sec: 5657.0). Total num frames: 266551296. Throughput: 0: 5768.5. Samples: 266553814. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 13:03:10,782][25689] Avg episode reward: [(0, '-49.173')] [2022-07-09 13:03:12,394][26022] Updated weights on worker 0-0, policy_version 260313 (0.00084) [2022-07-09 13:03:14,188][26022] Updated weights on worker 0-0, policy_version 260323 (0.01029) [2022-07-09 13:03:15,888][25689] Fps is (10 sec: 5736.0, 60 sec: 5652.6, 300 sec: 5655.2). Total num frames: 266579968. Throughput: 0: 5794.7. Samples: 266588452. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 13:03:15,889][25689] Avg episode reward: [(0, '-48.794')] [2022-07-09 13:03:16,018][26022] Updated weights on worker 0-0, policy_version 260333 (0.00075) [2022-07-09 13:03:17,624][26022] Updated weights on worker 0-0, policy_version 260343 (0.00083) [2022-07-09 13:03:19,497][26022] Updated weights on worker 0-0, policy_version 260353 (0.00091) [2022-07-09 13:03:20,909][25689] Fps is (10 sec: 5762.9, 60 sec: 5655.3, 300 sec: 5655.4). Total num frames: 266609664. Throughput: 0: 5000.0. Samples: 266605722. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 13:03:20,910][25689] Avg episode reward: [(0, '-48.947')] [2022-07-09 13:03:21,297][26022] Updated weights on worker 0-0, policy_version 260363 (0.00088) [2022-07-09 13:03:22,922][26022] Updated weights on worker 0-0, policy_version 260373 (0.00088) [2022-07-09 13:03:24,990][26022] Updated weights on worker 0-0, policy_version 260383 (0.00095) [2022-07-09 13:03:25,927][25689] Fps is (10 sec: 5610.5, 60 sec: 5628.5, 300 sec: 5651.8). Total num frames: 266636288. Throughput: 0: 5965.9. Samples: 266639978. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 13:03:25,927][25689] Avg episode reward: [(0, '-49.132')] [2022-07-09 13:03:26,557][26022] Updated weights on worker 0-0, policy_version 260393 (0.00085) [2022-07-09 13:03:28,499][26022] Updated weights on worker 0-0, policy_version 260403 (0.00607) [2022-07-09 13:03:30,266][26022] Updated weights on worker 0-0, policy_version 260413 (0.00089) [2022-07-09 13:03:30,941][25689] Fps is (10 sec: 5614.2, 60 sec: 5661.7, 300 sec: 5654.1). Total num frames: 266665984. Throughput: 0: 5958.5. Samples: 266673938. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 13:03:30,942][25689] Avg episode reward: [(0, '-49.363')] [2022-07-09 13:03:32,006][26022] Updated weights on worker 0-0, policy_version 260423 (0.00090) [2022-07-09 13:03:33,982][26022] Updated weights on worker 0-0, policy_version 260433 (0.00086) [2022-07-09 13:03:35,627][26022] Updated weights on worker 0-0, policy_version 260443 (0.00090) [2022-07-09 13:03:35,983][25689] Fps is (10 sec: 5803.8, 60 sec: 5668.6, 300 sec: 5657.3). Total num frames: 266694656. Throughput: 0: 5104.3. Samples: 266691024. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 13:03:35,984][25689] Avg episode reward: [(0, '-48.922')] [2022-07-09 13:03:37,408][26022] Updated weights on worker 0-0, policy_version 260453 (0.00085) [2022-07-09 13:03:39,243][26022] Updated weights on worker 0-0, policy_version 260463 (0.00083) [2022-07-09 13:03:40,856][26022] Updated weights on worker 0-0, policy_version 260473 (0.00094) [2022-07-09 13:03:40,996][25689] Fps is (10 sec: 5906.7, 60 sec: 5674.3, 300 sec: 5664.1). Total num frames: 266725376. Throughput: 0: 5980.8. Samples: 266725858. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 13:03:40,997][25689] Avg episode reward: [(0, '-49.517')] [2022-07-09 13:03:42,877][26022] Updated weights on worker 0-0, policy_version 260483 (0.00083) [2022-07-09 13:03:44,503][26022] Updated weights on worker 0-0, policy_version 260493 (0.00084) [2022-07-09 13:03:46,015][25689] Fps is (10 sec: 5818.3, 60 sec: 5676.1, 300 sec: 5661.8). Total num frames: 266753024. Throughput: 0: 5993.8. Samples: 266760384. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 13:03:46,016][25689] Avg episode reward: [(0, '-48.785')] [2022-07-09 13:03:46,422][26022] Updated weights on worker 0-0, policy_version 260503 (0.00090) [2022-07-09 13:03:48,155][26022] Updated weights on worker 0-0, policy_version 260513 (0.00083) [2022-07-09 13:03:49,788][26022] Updated weights on worker 0-0, policy_version 260523 (0.00087) [2022-07-09 13:03:51,017][25689] Fps is (10 sec: 5518.3, 60 sec: 5666.6, 300 sec: 5663.0). Total num frames: 266780672. Throughput: 0: 5170.9. Samples: 266777746. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 13:03:51,017][25689] Avg episode reward: [(0, '-48.797')] [2022-07-09 13:03:51,759][26022] Updated weights on worker 0-0, policy_version 260533 (0.00087) [2022-07-09 13:03:53,377][26022] Updated weights on worker 0-0, policy_version 260543 (0.00087) [2022-07-09 13:03:55,432][26022] Updated weights on worker 0-0, policy_version 260553 (0.00092) [2022-07-09 13:03:56,093][25689] Fps is (10 sec: 5791.7, 60 sec: 5685.1, 300 sec: 5669.3). Total num frames: 266811392. Throughput: 0: 6025.0. Samples: 266812184. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 13:03:56,093][25689] Avg episode reward: [(0, '-48.419')] [2022-07-09 13:03:57,196][26022] Updated weights on worker 0-0, policy_version 260563 (0.00095) [2022-07-09 13:03:58,947][26022] Updated weights on worker 0-0, policy_version 260573 (0.00085) [2022-07-09 13:04:00,625][26022] Updated weights on worker 0-0, policy_version 260583 (0.00085) [2022-07-09 13:04:01,103][25689] Fps is (10 sec: 5888.3, 60 sec: 5707.3, 300 sec: 5676.3). Total num frames: 266840064. Throughput: 0: 5990.3. Samples: 266846304. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 13:04:01,104][25689] Avg episode reward: [(0, '-48.562')] [2022-07-09 13:04:02,911][26022] Updated weights on worker 0-0, policy_version 260593 (0.00094) [2022-07-09 13:04:04,553][26022] Updated weights on worker 0-0, policy_version 260603 (0.00093) [2022-07-09 13:04:06,127][25689] Fps is (10 sec: 5408.8, 60 sec: 5695.1, 300 sec: 5662.6). Total num frames: 266865664. Throughput: 0: 5025.0. Samples: 266861446. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 13:04:06,128][25689] Avg episode reward: [(0, '-47.927')] [2022-07-09 13:04:06,323][26022] Updated weights on worker 0-0, policy_version 260613 (0.00080) [2022-07-09 13:04:07,953][26022] Updated weights on worker 0-0, policy_version 260623 (0.00080) [2022-07-09 13:04:10,040][26022] Updated weights on worker 0-0, policy_version 260633 (0.00106) [2022-07-09 13:04:11,138][25689] Fps is (10 sec: 5408.4, 60 sec: 5683.5, 300 sec: 5667.0). Total num frames: 266894336. Throughput: 0: 5883.8. Samples: 266896134. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 13:04:11,138][25689] Avg episode reward: [(0, '-47.671')] [2022-07-09 13:04:11,838][26022] Updated weights on worker 0-0, policy_version 260643 (0.00090) [2022-07-09 13:04:13,522][26022] Updated weights on worker 0-0, policy_version 260653 (0.00085) [2022-07-09 13:04:15,331][26022] Updated weights on worker 0-0, policy_version 260663 (0.00089) [2022-07-09 13:04:16,206][25689] Fps is (10 sec: 5791.3, 60 sec: 5704.2, 300 sec: 5669.3). Total num frames: 266924032. Throughput: 0: 5882.4. Samples: 266930494. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 13:04:16,206][25689] Avg episode reward: [(0, '-48.139')] [2022-07-09 13:04:17,159][26022] Updated weights on worker 0-0, policy_version 260673 (0.00098) [2022-07-09 13:04:18,749][26022] Updated weights on worker 0-0, policy_version 260683 (0.00091) [2022-07-09 13:04:20,732][26022] Updated weights on worker 0-0, policy_version 260693 (0.00094) [2022-07-09 13:04:21,267][25689] Fps is (10 sec: 5762.6, 60 sec: 5683.5, 300 sec: 5671.8). Total num frames: 266952704. Throughput: 0: 5027.6. Samples: 266947678. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 13:04:21,267][25689] Avg episode reward: [(0, '-48.432')] [2022-07-09 13:04:22,457][26022] Updated weights on worker 0-0, policy_version 260703 (0.00085) [2022-07-09 13:04:24,271][26022] Updated weights on worker 0-0, policy_version 260713 (0.00093) [2022-07-09 13:04:26,130][26022] Updated weights on worker 0-0, policy_version 260723 (0.00088) [2022-07-09 13:04:26,301][25689] Fps is (10 sec: 5579.1, 60 sec: 5698.9, 300 sec: 5661.3). Total num frames: 266980352. Throughput: 0: 5986.5. Samples: 266982214. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 13:04:26,301][25689] Avg episode reward: [(0, '-49.218')] [2022-07-09 13:04:27,980][26022] Updated weights on worker 0-0, policy_version 260733 (0.00102) [2022-07-09 13:04:29,897][26022] Updated weights on worker 0-0, policy_version 260743 (0.00092) [2022-07-09 13:04:31,309][25689] Fps is (10 sec: 5608.1, 60 sec: 5682.5, 300 sec: 5662.1). Total num frames: 267009024. Throughput: 0: 5935.7. Samples: 267015866. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 13:04:31,310][25689] Avg episode reward: [(0, '-50.049')] [2022-07-09 13:04:31,584][26022] Updated weights on worker 0-0, policy_version 260753 (0.00086) [2022-07-09 13:04:33,163][26022] Updated weights on worker 0-0, policy_version 260763 (0.00084) [2022-07-09 13:04:35,352][26022] Updated weights on worker 0-0, policy_version 260773 (0.00088) [2022-07-09 13:04:36,460][25689] Fps is (10 sec: 5745.4, 60 sec: 5689.3, 300 sec: 5666.3). Total num frames: 267038720. Throughput: 0: 5059.6. Samples: 267032970. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 13:04:36,460][25689] Avg episode reward: [(0, '-49.763')] [2022-07-09 13:04:36,975][26022] Updated weights on worker 0-0, policy_version 260783 (0.00084) [2022-07-09 13:04:38,791][26022] Updated weights on worker 0-0, policy_version 260793 (0.00083) [2022-07-09 13:04:40,451][26022] Updated weights on worker 0-0, policy_version 260803 (0.00153) [2022-07-09 13:04:41,514][25689] Fps is (10 sec: 5619.2, 60 sec: 5634.6, 300 sec: 5661.8). Total num frames: 267066368. Throughput: 0: 5899.8. Samples: 267067134. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 13:04:41,515][25689] Avg episode reward: [(0, '-49.792')] [2022-07-09 13:04:42,280][26022] Updated weights on worker 0-0, policy_version 260813 (0.00088) [2022-07-09 13:04:44,242][26022] Updated weights on worker 0-0, policy_version 260823 (0.00083) [2022-07-09 13:04:46,043][26022] Updated weights on worker 0-0, policy_version 260833 (0.00087) [2022-07-09 13:04:46,520][25689] Fps is (10 sec: 5699.9, 60 sec: 5669.6, 300 sec: 5662.3). Total num frames: 267096064. Throughput: 0: 5889.9. Samples: 267101304. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 13:04:46,521][25689] Avg episode reward: [(0, '-49.752')] [2022-07-09 13:04:47,506][26022] Updated weights on worker 0-0, policy_version 260843 (0.00099) [2022-07-09 13:04:49,556][26022] Updated weights on worker 0-0, policy_version 260853 (0.00092) [2022-07-09 13:04:51,219][26022] Updated weights on worker 0-0, policy_version 260863 (0.00082) [2022-07-09 13:04:51,522][25689] Fps is (10 sec: 5832.1, 60 sec: 5686.5, 300 sec: 5670.2). Total num frames: 267124736. Throughput: 0: 5077.6. Samples: 267118500. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 13:04:51,523][25689] Avg episode reward: [(0, '-49.810')] [2022-07-09 13:04:53,199][26022] Updated weights on worker 0-0, policy_version 260873 (0.00502) [2022-07-09 13:04:54,830][26022] Updated weights on worker 0-0, policy_version 260883 (0.00091) [2022-07-09 13:04:56,590][25689] Fps is (10 sec: 5592.9, 60 sec: 5636.5, 300 sec: 5658.9). Total num frames: 267152384. Throughput: 0: 5971.1. Samples: 267153170. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 13:04:56,590][25689] Avg episode reward: [(0, '-49.482')] [2022-07-09 13:04:56,744][26022] Updated weights on worker 0-0, policy_version 260893 (0.00093) [2022-07-09 13:04:57,424][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:04:57,435][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000260898_267159552.pth [2022-07-09 13:04:57,435][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000258904_265117696.pth [2022-07-09 13:04:58,549][26022] Updated weights on worker 0-0, policy_version 260903 (0.00086) [2022-07-09 13:05:00,504][26022] Updated weights on worker 0-0, policy_version 260913 (0.00094) [2022-07-09 13:05:01,600][25689] Fps is (10 sec: 5791.6, 60 sec: 5670.4, 300 sec: 5676.5). Total num frames: 267183104. Throughput: 0: 5954.9. Samples: 267186744. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 13:05:01,601][25689] Avg episode reward: [(0, '-49.530')] [2022-07-09 13:05:02,471][26022] Updated weights on worker 0-0, policy_version 260923 (0.00086) [2022-07-09 13:05:04,406][26022] Updated weights on worker 0-0, policy_version 260933 (0.00091) [2022-07-09 13:05:05,967][26022] Updated weights on worker 0-0, policy_version 260943 (0.00557) [2022-07-09 13:05:06,691][25689] Fps is (10 sec: 5474.4, 60 sec: 5647.2, 300 sec: 5657.9). Total num frames: 267207680. Throughput: 0: 4979.3. Samples: 267201740. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 13:05:06,693][25689] Avg episode reward: [(0, '-49.440')] [2022-07-09 13:05:07,924][26022] Updated weights on worker 0-0, policy_version 260953 (0.00094) [2022-07-09 13:05:09,554][26022] Updated weights on worker 0-0, policy_version 260963 (0.00086) [2022-07-09 13:05:11,455][26022] Updated weights on worker 0-0, policy_version 260973 (0.00084) [2022-07-09 13:05:11,718][25689] Fps is (10 sec: 5465.4, 60 sec: 5679.5, 300 sec: 5670.2). Total num frames: 267238400. Throughput: 0: 5817.8. Samples: 267235992. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 13:05:11,719][25689] Avg episode reward: [(0, '-49.155')] [2022-07-09 13:05:13,539][26022] Updated weights on worker 0-0, policy_version 260983 (0.00087) [2022-07-09 13:05:15,121][26022] Updated weights on worker 0-0, policy_version 260993 (0.00062) [2022-07-09 13:05:16,782][25689] Fps is (10 sec: 5784.0, 60 sec: 5646.0, 300 sec: 5663.2). Total num frames: 267266048. Throughput: 0: 5796.8. Samples: 267270220. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 13:05:16,782][25689] Avg episode reward: [(0, '-48.987')] [2022-07-09 13:05:16,934][26022] Updated weights on worker 0-0, policy_version 261003 (0.00082) [2022-07-09 13:05:18,651][26022] Updated weights on worker 0-0, policy_version 261013 (0.00084) [2022-07-09 13:05:20,400][26022] Updated weights on worker 0-0, policy_version 261023 (0.00089) [2022-07-09 13:05:21,787][25689] Fps is (10 sec: 5491.4, 60 sec: 5634.3, 300 sec: 5661.0). Total num frames: 267293696. Throughput: 0: 4987.4. Samples: 267287424. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 13:05:21,789][25689] Avg episode reward: [(0, '-49.101')] [2022-07-09 13:05:22,355][26022] Updated weights on worker 0-0, policy_version 261033 (0.00089) [2022-07-09 13:05:24,039][26022] Updated weights on worker 0-0, policy_version 261043 (0.00092) [2022-07-09 13:05:25,767][26022] Updated weights on worker 0-0, policy_version 261053 (0.00092) [2022-07-09 13:05:26,797][25689] Fps is (10 sec: 5725.6, 60 sec: 5670.4, 300 sec: 5664.8). Total num frames: 267323392. Throughput: 0: 5971.5. Samples: 267321806. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 13:05:26,799][25689] Avg episode reward: [(0, '-49.758')] [2022-07-09 13:05:27,822][26022] Updated weights on worker 0-0, policy_version 261063 (0.00086) [2022-07-09 13:05:29,371][26022] Updated weights on worker 0-0, policy_version 261073 (0.00086) [2022-07-09 13:05:31,316][26022] Updated weights on worker 0-0, policy_version 261083 (0.00085) [2022-07-09 13:05:31,813][25689] Fps is (10 sec: 5719.6, 60 sec: 5652.8, 300 sec: 5665.7). Total num frames: 267351040. Throughput: 0: 5961.4. Samples: 267355788. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 13:05:31,815][25689] Avg episode reward: [(0, '-49.672')] [2022-07-09 13:05:33,055][26022] Updated weights on worker 0-0, policy_version 261093 (0.00090) [2022-07-09 13:05:34,852][26022] Updated weights on worker 0-0, policy_version 261103 (0.00087) [2022-07-09 13:05:36,607][26022] Updated weights on worker 0-0, policy_version 261113 (0.00055) [2022-07-09 13:05:36,857][25689] Fps is (10 sec: 5598.7, 60 sec: 5645.8, 300 sec: 5661.9). Total num frames: 267379712. Throughput: 0: 5111.9. Samples: 267372840. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 13:05:36,859][25689] Avg episode reward: [(0, '-49.391')] [2022-07-09 13:05:38,384][26022] Updated weights on worker 0-0, policy_version 261123 (0.00089) [2022-07-09 13:05:40,235][26022] Updated weights on worker 0-0, policy_version 261133 (0.00085) [2022-07-09 13:05:41,849][26022] Updated weights on worker 0-0, policy_version 261143 (0.00081) [2022-07-09 13:05:41,875][25689] Fps is (10 sec: 5902.8, 60 sec: 5700.1, 300 sec: 5668.5). Total num frames: 267410432. Throughput: 0: 5969.0. Samples: 267407326. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 13:05:41,875][25689] Avg episode reward: [(0, '-50.462')] [2022-07-09 13:05:43,881][26022] Updated weights on worker 0-0, policy_version 261153 (0.00088) [2022-07-09 13:05:45,745][26022] Updated weights on worker 0-0, policy_version 261163 (0.01243) [2022-07-09 13:05:46,894][25689] Fps is (10 sec: 5713.2, 60 sec: 5648.0, 300 sec: 5661.5). Total num frames: 267437056. Throughput: 0: 5970.8. Samples: 267441798. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 13:05:46,894][25689] Avg episode reward: [(0, '-50.171')] [2022-07-09 13:05:47,349][26022] Updated weights on worker 0-0, policy_version 261173 (0.00082) [2022-07-09 13:05:49,192][26022] Updated weights on worker 0-0, policy_version 261183 (0.00093) [2022-07-09 13:05:50,943][26022] Updated weights on worker 0-0, policy_version 261193 (0.00083) [2022-07-09 13:05:51,904][25689] Fps is (10 sec: 5615.8, 60 sec: 5664.3, 300 sec: 5669.0). Total num frames: 267466752. Throughput: 0: 5981.6. Samples: 267475962. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 13:05:51,904][25689] Avg episode reward: [(0, '-50.291')] [2022-07-09 13:05:52,855][26022] Updated weights on worker 0-0, policy_version 261203 (0.00099) [2022-07-09 13:05:54,548][26022] Updated weights on worker 0-0, policy_version 261213 (0.00087) [2022-07-09 13:05:56,589][26022] Updated weights on worker 0-0, policy_version 261223 (0.00096) [2022-07-09 13:05:57,018][25689] Fps is (10 sec: 5562.8, 60 sec: 5642.9, 300 sec: 5660.1). Total num frames: 267493376. Throughput: 0: 5969.9. Samples: 267493202. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 13:05:57,019][25689] Avg episode reward: [(0, '-49.483')] [2022-07-09 13:05:58,157][26022] Updated weights on worker 0-0, policy_version 261233 (0.00085) [2022-07-09 13:06:00,214][26022] Updated weights on worker 0-0, policy_version 261243 (0.00085) [2022-07-09 13:06:02,039][25689] Fps is (10 sec: 5455.8, 60 sec: 5608.1, 300 sec: 5666.9). Total num frames: 267522048. Throughput: 0: 5942.4. Samples: 267527148. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 13:06:02,039][25689] Avg episode reward: [(0, '-49.267')] [2022-07-09 13:06:02,340][26022] Updated weights on worker 0-0, policy_version 261253 (0.00086) [2022-07-09 13:06:03,866][26022] Updated weights on worker 0-0, policy_version 261263 (0.00085) [2022-07-09 13:06:05,912][26022] Updated weights on worker 0-0, policy_version 261273 (0.00116) [2022-07-09 13:06:07,085][25689] Fps is (10 sec: 5798.1, 60 sec: 5697.0, 300 sec: 5669.6). Total num frames: 267551744. Throughput: 0: 5815.5. Samples: 267559218. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 13:06:07,085][25689] Avg episode reward: [(0, '-48.845')] [2022-07-09 13:06:07,534][26022] Updated weights on worker 0-0, policy_version 261283 (0.00092) [2022-07-09 13:06:09,478][26022] Updated weights on worker 0-0, policy_version 261293 (0.00081) [2022-07-09 13:06:11,286][26022] Updated weights on worker 0-0, policy_version 261303 (0.00080) [2022-07-09 13:06:12,128][25689] Fps is (10 sec: 5683.8, 60 sec: 5644.6, 300 sec: 5667.3). Total num frames: 267579392. Throughput: 0: 4960.4. Samples: 267576282. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 13:06:12,128][25689] Avg episode reward: [(0, '-48.460')] [2022-07-09 13:06:13,038][26022] Updated weights on worker 0-0, policy_version 261313 (0.00082) [2022-07-09 13:06:14,849][26022] Updated weights on worker 0-0, policy_version 261323 (0.00081) [2022-07-09 13:06:16,434][26022] Updated weights on worker 0-0, policy_version 261333 (0.00087) [2022-07-09 13:06:17,257][25689] Fps is (10 sec: 5637.4, 60 sec: 5672.4, 300 sec: 5669.3). Total num frames: 267609088. Throughput: 0: 5816.1. Samples: 267610912. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 13:06:17,258][25689] Avg episode reward: [(0, '-48.291')] [2022-07-09 13:06:18,275][26022] Updated weights on worker 0-0, policy_version 261343 (0.00098) [2022-07-09 13:06:20,090][26022] Updated weights on worker 0-0, policy_version 261353 (0.00097) [2022-07-09 13:06:21,975][26022] Updated weights on worker 0-0, policy_version 261363 (0.00088) [2022-07-09 13:06:22,318][25689] Fps is (10 sec: 5728.1, 60 sec: 5684.2, 300 sec: 5665.5). Total num frames: 267637760. Throughput: 0: 5829.0. Samples: 267645352. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 13:06:22,318][25689] Avg episode reward: [(0, '-48.528')] [2022-07-09 13:06:23,681][26022] Updated weights on worker 0-0, policy_version 261373 (0.00106) [2022-07-09 13:06:25,687][26022] Updated weights on worker 0-0, policy_version 261383 (0.00096) [2022-07-09 13:06:27,173][26022] Updated weights on worker 0-0, policy_version 261393 (0.00087) [2022-07-09 13:06:27,387][25689] Fps is (10 sec: 5660.8, 60 sec: 5661.7, 300 sec: 5664.2). Total num frames: 267666432. Throughput: 0: 5084.2. Samples: 267662438. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 13:06:27,388][25689] Avg episode reward: [(0, '-48.645')] [2022-07-09 13:06:29,363][26022] Updated weights on worker 0-0, policy_version 261403 (0.00086) [2022-07-09 13:06:30,716][26022] Updated weights on worker 0-0, policy_version 261413 (0.00086) [2022-07-09 13:06:32,417][25689] Fps is (10 sec: 5576.3, 60 sec: 5660.3, 300 sec: 5664.9). Total num frames: 267694080. Throughput: 0: 5922.6. Samples: 267696450. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 13:06:32,418][25689] Avg episode reward: [(0, '-48.431')] [2022-07-09 13:06:32,819][26022] Updated weights on worker 0-0, policy_version 261423 (0.00083) [2022-07-09 13:06:34,592][26022] Updated weights on worker 0-0, policy_version 261433 (0.00095) [2022-07-09 13:06:36,227][26022] Updated weights on worker 0-0, policy_version 261443 (0.00084) [2022-07-09 13:06:37,457][25689] Fps is (10 sec: 5796.4, 60 sec: 5694.5, 300 sec: 5667.8). Total num frames: 267724800. Throughput: 0: 5930.0. Samples: 267730696. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 13:06:37,458][25689] Avg episode reward: [(0, '-48.138')] [2022-07-09 13:06:38,127][26022] Updated weights on worker 0-0, policy_version 261453 (0.00086) [2022-07-09 13:06:40,041][26022] Updated weights on worker 0-0, policy_version 261463 (0.00098) [2022-07-09 13:06:41,600][26022] Updated weights on worker 0-0, policy_version 261473 (0.00082) [2022-07-09 13:06:42,533][25689] Fps is (10 sec: 5770.2, 60 sec: 5638.4, 300 sec: 5667.6). Total num frames: 267752448. Throughput: 0: 5070.5. Samples: 267747854. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 13:06:42,533][25689] Avg episode reward: [(0, '-48.674')] [2022-07-09 13:06:43,657][26022] Updated weights on worker 0-0, policy_version 261483 (0.00080) [2022-07-09 13:06:45,083][26022] Updated weights on worker 0-0, policy_version 261493 (0.00085) [2022-07-09 13:06:47,268][26022] Updated weights on worker 0-0, policy_version 261503 (0.00085) [2022-07-09 13:06:47,536][25689] Fps is (10 sec: 5486.0, 60 sec: 5656.8, 300 sec: 5661.1). Total num frames: 267780096. Throughput: 0: 5936.1. Samples: 267782046. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 13:06:47,538][25689] Avg episode reward: [(0, '-48.153')] [2022-07-09 13:06:48,759][26022] Updated weights on worker 0-0, policy_version 261513 (0.00057) [2022-07-09 13:06:50,704][26022] Updated weights on worker 0-0, policy_version 261523 (0.00092) [2022-07-09 13:06:52,413][26022] Updated weights on worker 0-0, policy_version 261533 (0.00103) [2022-07-09 13:06:52,559][25689] Fps is (10 sec: 5719.5, 60 sec: 5655.6, 300 sec: 5672.2). Total num frames: 267809792. Throughput: 0: 5954.9. Samples: 267816392. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 13:06:52,560][25689] Avg episode reward: [(0, '-48.353')] [2022-07-09 13:06:54,281][26022] Updated weights on worker 0-0, policy_version 261543 (0.00092) [2022-07-09 13:06:56,046][26022] Updated weights on worker 0-0, policy_version 261553 (0.00089) [2022-07-09 13:06:57,461][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:06:57,474][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000261561_267838464.pth [2022-07-09 13:06:57,474][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000259567_265796608.pth [2022-07-09 13:06:57,609][25689] Fps is (10 sec: 5794.5, 60 sec: 5695.4, 300 sec: 5664.6). Total num frames: 267838464. Throughput: 0: 5106.4. Samples: 267833602. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 13:06:57,610][25689] Avg episode reward: [(0, '-49.372')] [2022-07-09 13:06:57,875][26022] Updated weights on worker 0-0, policy_version 261563 (0.00079) [2022-07-09 13:06:59,552][26022] Updated weights on worker 0-0, policy_version 261573 (0.00086) [2022-07-09 13:07:01,763][26022] Updated weights on worker 0-0, policy_version 261583 (0.00104) [2022-07-09 13:07:02,638][25689] Fps is (10 sec: 5486.1, 60 sec: 5660.8, 300 sec: 5668.1). Total num frames: 267865088. Throughput: 0: 5955.7. Samples: 267867596. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 13:07:02,639][25689] Avg episode reward: [(0, '-49.808')] [2022-07-09 13:07:03,728][26022] Updated weights on worker 0-0, policy_version 261593 (0.00093) [2022-07-09 13:07:05,476][26022] Updated weights on worker 0-0, policy_version 261603 (0.00092) [2022-07-09 13:07:07,280][26022] Updated weights on worker 0-0, policy_version 261613 (0.00088) [2022-07-09 13:07:07,711][25689] Fps is (10 sec: 5474.1, 60 sec: 5641.4, 300 sec: 5667.6). Total num frames: 267893760. Throughput: 0: 5841.8. Samples: 267899900. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 13:07:07,711][25689] Avg episode reward: [(0, '-50.608')] [2022-07-09 13:07:09,106][26022] Updated weights on worker 0-0, policy_version 261623 (0.00092) [2022-07-09 13:07:10,803][26022] Updated weights on worker 0-0, policy_version 261633 (0.00079) [2022-07-09 13:07:12,690][26022] Updated weights on worker 0-0, policy_version 261643 (0.00093) [2022-07-09 13:07:12,725][25689] Fps is (10 sec: 5685.3, 60 sec: 5661.0, 300 sec: 5665.1). Total num frames: 267922432. Throughput: 0: 4985.2. Samples: 267916922. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 13:07:12,725][25689] Avg episode reward: [(0, '-50.384')] [2022-07-09 13:07:14,230][26022] Updated weights on worker 0-0, policy_version 261653 (0.00086) [2022-07-09 13:07:16,184][26022] Updated weights on worker 0-0, policy_version 261663 (0.00096) [2022-07-09 13:07:17,806][25689] Fps is (10 sec: 5781.6, 60 sec: 5665.5, 300 sec: 5664.5). Total num frames: 267952128. Throughput: 0: 5841.8. Samples: 267951588. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 13:07:17,807][25689] Avg episode reward: [(0, '-50.128')] [2022-07-09 13:07:17,974][26022] Updated weights on worker 0-0, policy_version 261673 (0.00056) [2022-07-09 13:07:19,650][26022] Updated weights on worker 0-0, policy_version 261683 (0.00102) [2022-07-09 13:07:21,664][26022] Updated weights on worker 0-0, policy_version 261693 (0.00087) [2022-07-09 13:07:22,823][25689] Fps is (10 sec: 5780.3, 60 sec: 5669.6, 300 sec: 5665.9). Total num frames: 267980800. Throughput: 0: 5857.8. Samples: 267985832. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 13:07:22,823][25689] Avg episode reward: [(0, '-49.894')] [2022-07-09 13:07:23,265][26022] Updated weights on worker 0-0, policy_version 261703 (0.00086) [2022-07-09 13:07:25,162][26022] Updated weights on worker 0-0, policy_version 261713 (0.00080) [2022-07-09 13:07:26,861][26022] Updated weights on worker 0-0, policy_version 261723 (0.00091) [2022-07-09 13:07:27,849][25689] Fps is (10 sec: 5710.0, 60 sec: 5673.7, 300 sec: 5669.1). Total num frames: 268009472. Throughput: 0: 5126.4. Samples: 268003136. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 13:07:27,849][25689] Avg episode reward: [(0, '-49.270')] [2022-07-09 13:07:28,614][26022] Updated weights on worker 0-0, policy_version 261733 (0.00094) [2022-07-09 13:07:30,540][26022] Updated weights on worker 0-0, policy_version 261743 (0.00095) [2022-07-09 13:07:32,319][26022] Updated weights on worker 0-0, policy_version 261753 (0.00086) [2022-07-09 13:07:32,873][25689] Fps is (10 sec: 5603.5, 60 sec: 5674.2, 300 sec: 5667.4). Total num frames: 268037120. Throughput: 0: 5975.0. Samples: 268037310. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 13:07:32,874][25689] Avg episode reward: [(0, '-49.720')] [2022-07-09 13:07:34,096][26022] Updated weights on worker 0-0, policy_version 261763 (0.00088) [2022-07-09 13:07:35,874][26022] Updated weights on worker 0-0, policy_version 261773 (0.00080) [2022-07-09 13:07:37,782][26022] Updated weights on worker 0-0, policy_version 261783 (0.00086) [2022-07-09 13:07:37,929][25689] Fps is (10 sec: 5688.8, 60 sec: 5655.8, 300 sec: 5664.3). Total num frames: 268066816. Throughput: 0: 5960.2. Samples: 268071524. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 13:07:37,930][25689] Avg episode reward: [(0, '-49.212')] [2022-07-09 13:07:39,559][26022] Updated weights on worker 0-0, policy_version 261793 (0.00084) [2022-07-09 13:07:41,160][26022] Updated weights on worker 0-0, policy_version 261803 (0.00086) [2022-07-09 13:07:42,857][26022] Updated weights on worker 0-0, policy_version 261813 (0.00082) [2022-07-09 13:07:42,942][25689] Fps is (10 sec: 5898.7, 60 sec: 5695.6, 300 sec: 5671.6). Total num frames: 268096512. Throughput: 0: 5123.2. Samples: 268088908. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 13:07:42,942][25689] Avg episode reward: [(0, '-49.191')] [2022-07-09 13:07:44,966][26022] Updated weights on worker 0-0, policy_version 261823 (0.00087) [2022-07-09 13:07:46,618][26022] Updated weights on worker 0-0, policy_version 261833 (0.00085) [2022-07-09 13:07:47,951][25689] Fps is (10 sec: 5722.0, 60 sec: 5695.1, 300 sec: 5669.6). Total num frames: 268124160. Throughput: 0: 5986.1. Samples: 268123468. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 13:07:47,951][25689] Avg episode reward: [(0, '-48.388')] [2022-07-09 13:07:48,295][26022] Updated weights on worker 0-0, policy_version 261843 (0.00088) [2022-07-09 13:07:50,072][26022] Updated weights on worker 0-0, policy_version 261853 (0.00084) [2022-07-09 13:07:51,880][26022] Updated weights on worker 0-0, policy_version 261863 (0.00087) [2022-07-09 13:07:52,969][25689] Fps is (10 sec: 5718.8, 60 sec: 5695.5, 300 sec: 5671.0). Total num frames: 268153856. Throughput: 0: 6019.3. Samples: 268158274. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 13:07:52,970][25689] Avg episode reward: [(0, '-48.165')] [2022-07-09 13:07:53,738][26022] Updated weights on worker 0-0, policy_version 261873 (0.00085) [2022-07-09 13:07:55,561][26022] Updated weights on worker 0-0, policy_version 261883 (0.00085) [2022-07-09 13:07:57,275][26022] Updated weights on worker 0-0, policy_version 261893 (0.00089) [2022-07-09 13:07:58,027][25689] Fps is (10 sec: 5691.0, 60 sec: 5677.8, 300 sec: 5671.1). Total num frames: 268181504. Throughput: 0: 5170.5. Samples: 268175440. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 13:07:58,027][25689] Avg episode reward: [(0, '-48.000')] [2022-07-09 13:07:59,232][26022] Updated weights on worker 0-0, policy_version 261903 (0.00093) [2022-07-09 13:08:00,772][26022] Updated weights on worker 0-0, policy_version 261913 (0.00092) [2022-07-09 13:08:03,059][25689] Fps is (10 sec: 5379.1, 60 sec: 5677.6, 300 sec: 5671.9). Total num frames: 268208128. Throughput: 0: 5987.8. Samples: 268209364. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 13:08:03,059][25689] Avg episode reward: [(0, '-47.505')] [2022-07-09 13:08:03,306][26022] Updated weights on worker 0-0, policy_version 261923 (0.00083) [2022-07-09 13:08:04,978][26022] Updated weights on worker 0-0, policy_version 261933 (0.00459) [2022-07-09 13:08:06,742][26022] Updated weights on worker 0-0, policy_version 261943 (0.00064) [2022-07-09 13:08:08,099][25689] Fps is (10 sec: 5490.2, 60 sec: 5680.6, 300 sec: 5669.0). Total num frames: 268236800. Throughput: 0: 5846.4. Samples: 268241262. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 13:08:08,099][25689] Avg episode reward: [(0, '-47.798')] [2022-07-09 13:08:08,589][26022] Updated weights on worker 0-0, policy_version 261953 (0.00086) [2022-07-09 13:08:10,141][26022] Updated weights on worker 0-0, policy_version 261963 (0.00086) [2022-07-09 13:08:12,202][26022] Updated weights on worker 0-0, policy_version 261973 (0.00093) [2022-07-09 13:08:13,127][25689] Fps is (10 sec: 5797.3, 60 sec: 5696.2, 300 sec: 5674.0). Total num frames: 268266496. Throughput: 0: 4964.2. Samples: 268258340. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 13:08:13,127][25689] Avg episode reward: [(0, '-48.430')] [2022-07-09 13:08:13,742][26022] Updated weights on worker 0-0, policy_version 261983 (0.00090) [2022-07-09 13:08:15,787][26022] Updated weights on worker 0-0, policy_version 261993 (0.00088) [2022-07-09 13:08:17,543][26022] Updated weights on worker 0-0, policy_version 262003 (0.00093) [2022-07-09 13:08:18,204][25689] Fps is (10 sec: 5775.8, 60 sec: 5679.6, 300 sec: 5669.5). Total num frames: 268295168. Throughput: 0: 5805.8. Samples: 268292586. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 13:08:18,205][25689] Avg episode reward: [(0, '-48.228')] [2022-07-09 13:08:19,303][26022] Updated weights on worker 0-0, policy_version 262013 (0.00091) [2022-07-09 13:08:20,957][26022] Updated weights on worker 0-0, policy_version 262023 (0.00081) [2022-07-09 13:08:23,062][26022] Updated weights on worker 0-0, policy_version 262033 (0.00079) [2022-07-09 13:08:23,245][25689] Fps is (10 sec: 5566.1, 60 sec: 5660.4, 300 sec: 5672.5). Total num frames: 268322816. Throughput: 0: 5825.6. Samples: 268326964. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 13:08:23,246][25689] Avg episode reward: [(0, '-49.043')] [2022-07-09 13:08:24,507][26022] Updated weights on worker 0-0, policy_version 262043 (0.00090) [2022-07-09 13:08:26,584][26022] Updated weights on worker 0-0, policy_version 262053 (0.00091) [2022-07-09 13:08:28,097][26022] Updated weights on worker 0-0, policy_version 262063 (0.00090) [2022-07-09 13:08:28,247][25689] Fps is (10 sec: 5710.1, 60 sec: 5679.7, 300 sec: 5672.7). Total num frames: 268352512. Throughput: 0: 5958.8. Samples: 268361320. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 13:08:28,247][25689] Avg episode reward: [(0, '-48.877')] [2022-07-09 13:08:30,139][26022] Updated weights on worker 0-0, policy_version 262073 (0.00095) [2022-07-09 13:08:31,789][26022] Updated weights on worker 0-0, policy_version 262083 (0.00080) [2022-07-09 13:08:33,287][25689] Fps is (10 sec: 5710.6, 60 sec: 5678.2, 300 sec: 5669.3). Total num frames: 268380160. Throughput: 0: 5956.3. Samples: 268378420. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 13:08:33,287][25689] Avg episode reward: [(0, '-49.243')] [2022-07-09 13:08:33,785][26022] Updated weights on worker 0-0, policy_version 262093 (0.00090) [2022-07-09 13:08:35,343][26022] Updated weights on worker 0-0, policy_version 262103 (0.00083) [2022-07-09 13:08:37,359][26022] Updated weights on worker 0-0, policy_version 262113 (0.00083) [2022-07-09 13:08:38,432][25689] Fps is (10 sec: 5630.0, 60 sec: 5669.8, 300 sec: 5663.3). Total num frames: 268409856. Throughput: 0: 5941.9. Samples: 268412780. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 13:08:38,433][25689] Avg episode reward: [(0, '-50.151')] [2022-07-09 13:08:38,914][26022] Updated weights on worker 0-0, policy_version 262123 (0.00086) [2022-07-09 13:08:40,948][26022] Updated weights on worker 0-0, policy_version 262133 (0.00084) [2022-07-09 13:08:42,471][26022] Updated weights on worker 0-0, policy_version 262143 (0.00089) [2022-07-09 13:08:43,456][25689] Fps is (10 sec: 5739.8, 60 sec: 5651.9, 300 sec: 5666.7). Total num frames: 268438528. Throughput: 0: 5952.5. Samples: 268447268. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 13:08:43,456][25689] Avg episode reward: [(0, '-49.846')] [2022-07-09 13:08:44,570][26022] Updated weights on worker 0-0, policy_version 262153 (0.00097) [2022-07-09 13:08:46,134][26022] Updated weights on worker 0-0, policy_version 262163 (0.00089) [2022-07-09 13:08:47,997][26022] Updated weights on worker 0-0, policy_version 262173 (0.00087) [2022-07-09 13:08:48,463][25689] Fps is (10 sec: 5717.2, 60 sec: 5669.0, 300 sec: 5670.0). Total num frames: 268467200. Throughput: 0: 5095.8. Samples: 268464334. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 13:08:48,463][25689] Avg episode reward: [(0, '-49.677')] [2022-07-09 13:08:49,671][26022] Updated weights on worker 0-0, policy_version 262183 (0.00095) [2022-07-09 13:08:51,677][26022] Updated weights on worker 0-0, policy_version 262193 (0.00100) [2022-07-09 13:08:53,339][26022] Updated weights on worker 0-0, policy_version 262203 (0.00094) [2022-07-09 13:08:53,486][25689] Fps is (10 sec: 5717.2, 60 sec: 5651.6, 300 sec: 5664.2). Total num frames: 268495872. Throughput: 0: 5941.9. Samples: 268498440. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 13:08:53,486][25689] Avg episode reward: [(0, '-48.693')] [2022-07-09 13:08:55,336][26022] Updated weights on worker 0-0, policy_version 262213 (0.00098) [2022-07-09 13:08:57,003][26022] Updated weights on worker 0-0, policy_version 262223 (0.00076) [2022-07-09 13:08:57,718][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:08:57,735][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000262226_268519424.pth [2022-07-09 13:08:57,735][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000260232_266477568.pth [2022-07-09 13:08:58,533][25689] Fps is (10 sec: 5694.2, 60 sec: 5669.5, 300 sec: 5663.5). Total num frames: 268524544. Throughput: 0: 5966.3. Samples: 268532708. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 13:08:58,535][25689] Avg episode reward: [(0, '-48.891')] [2022-07-09 13:08:58,786][26022] Updated weights on worker 0-0, policy_version 262233 (0.00088) [2022-07-09 13:09:00,727][26022] Updated weights on worker 0-0, policy_version 262243 (0.00090) [2022-07-09 13:09:02,616][26022] Updated weights on worker 0-0, policy_version 262253 (0.00091) [2022-07-09 13:09:03,551][25689] Fps is (10 sec: 5494.1, 60 sec: 5670.8, 300 sec: 5667.0). Total num frames: 268551168. Throughput: 0: 5111.0. Samples: 268549972. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 13:09:03,553][25689] Avg episode reward: [(0, '-49.183')] [2022-07-09 13:09:04,803][26022] Updated weights on worker 0-0, policy_version 262263 (0.00094) [2022-07-09 13:09:06,377][26022] Updated weights on worker 0-0, policy_version 262273 (0.00088) [2022-07-09 13:09:08,399][26022] Updated weights on worker 0-0, policy_version 262283 (0.00090) [2022-07-09 13:09:08,559][25689] Fps is (10 sec: 5413.2, 60 sec: 5656.9, 300 sec: 5663.6). Total num frames: 268578816. Throughput: 0: 5834.3. Samples: 268581584. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 13:09:08,561][25689] Avg episode reward: [(0, '-49.317')] [2022-07-09 13:09:10,146][26022] Updated weights on worker 0-0, policy_version 262293 (0.00090) [2022-07-09 13:09:12,106][26022] Updated weights on worker 0-0, policy_version 262303 (0.00486) [2022-07-09 13:09:13,569][25689] Fps is (10 sec: 5621.6, 60 sec: 5641.6, 300 sec: 5661.3). Total num frames: 268607488. Throughput: 0: 5836.3. Samples: 268615652. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 13:09:13,571][25689] Avg episode reward: [(0, '-49.124')] [2022-07-09 13:09:13,595][26022] Updated weights on worker 0-0, policy_version 262313 (0.00083) [2022-07-09 13:09:15,571][26022] Updated weights on worker 0-0, policy_version 262323 (0.00088) [2022-07-09 13:09:17,219][26022] Updated weights on worker 0-0, policy_version 262333 (0.00091) [2022-07-09 13:09:18,691][25689] Fps is (10 sec: 5659.8, 60 sec: 5637.5, 300 sec: 5660.1). Total num frames: 268636160. Throughput: 0: 4977.1. Samples: 268633034. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 13:09:18,693][25689] Avg episode reward: [(0, '-48.381')] [2022-07-09 13:09:19,101][26022] Updated weights on worker 0-0, policy_version 262343 (0.00090) [2022-07-09 13:09:20,684][26022] Updated weights on worker 0-0, policy_version 262353 (0.00085) [2022-07-09 13:09:22,650][26022] Updated weights on worker 0-0, policy_version 262363 (0.00089) [2022-07-09 13:09:23,699][25689] Fps is (10 sec: 5863.2, 60 sec: 5691.4, 300 sec: 5670.9). Total num frames: 268666880. Throughput: 0: 5837.5. Samples: 268667588. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 13:09:23,699][25689] Avg episode reward: [(0, '-47.689')] [2022-07-09 13:09:24,291][26022] Updated weights on worker 0-0, policy_version 262373 (0.00525) [2022-07-09 13:09:26,196][26022] Updated weights on worker 0-0, policy_version 262383 (0.00094) [2022-07-09 13:09:27,885][26022] Updated weights on worker 0-0, policy_version 262393 (0.00083) [2022-07-09 13:09:28,713][25689] Fps is (10 sec: 5619.5, 60 sec: 5622.4, 300 sec: 5660.5). Total num frames: 268692480. Throughput: 0: 5974.9. Samples: 268702002. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 13:09:28,714][25689] Avg episode reward: [(0, '-47.468')] [2022-07-09 13:09:29,695][26022] Updated weights on worker 0-0, policy_version 262403 (0.00079) [2022-07-09 13:09:31,617][26022] Updated weights on worker 0-0, policy_version 262413 (0.00087) [2022-07-09 13:09:33,275][26022] Updated weights on worker 0-0, policy_version 262423 (0.00092) [2022-07-09 13:09:33,730][25689] Fps is (10 sec: 5512.4, 60 sec: 5658.5, 300 sec: 5663.0). Total num frames: 268722176. Throughput: 0: 5128.8. Samples: 268719054. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 13:09:33,731][25689] Avg episode reward: [(0, '-47.103')] [2022-07-09 13:09:35,257][26022] Updated weights on worker 0-0, policy_version 262433 (0.00086) [2022-07-09 13:09:37,056][26022] Updated weights on worker 0-0, policy_version 262443 (0.00085) [2022-07-09 13:09:38,776][25689] Fps is (10 sec: 5800.1, 60 sec: 5650.8, 300 sec: 5666.6). Total num frames: 268750848. Throughput: 0: 5966.0. Samples: 268752864. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 13:09:38,777][25689] Avg episode reward: [(0, '-46.825')] [2022-07-09 13:09:38,887][26022] Updated weights on worker 0-0, policy_version 262453 (0.00100) [2022-07-09 13:09:40,664][26022] Updated weights on worker 0-0, policy_version 262463 (0.00063) [2022-07-09 13:09:42,429][26022] Updated weights on worker 0-0, policy_version 262473 (0.00082) [2022-07-09 13:09:43,804][25689] Fps is (10 sec: 5692.4, 60 sec: 5650.4, 300 sec: 5662.8). Total num frames: 268779520. Throughput: 0: 5943.7. Samples: 268787086. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 13:09:43,805][25689] Avg episode reward: [(0, '-47.006')] [2022-07-09 13:09:44,202][26022] Updated weights on worker 0-0, policy_version 262483 (0.00087) [2022-07-09 13:09:45,987][26022] Updated weights on worker 0-0, policy_version 262493 (0.00085) [2022-07-09 13:09:47,906][26022] Updated weights on worker 0-0, policy_version 262503 (0.00093) [2022-07-09 13:09:48,847][25689] Fps is (10 sec: 5592.3, 60 sec: 5630.0, 300 sec: 5658.6). Total num frames: 268807168. Throughput: 0: 5077.6. Samples: 268804236. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 13:09:48,848][25689] Avg episode reward: [(0, '-48.859')] [2022-07-09 13:09:49,444][26022] Updated weights on worker 0-0, policy_version 262513 (0.00094) [2022-07-09 13:09:51,378][26022] Updated weights on worker 0-0, policy_version 262523 (0.00091) [2022-07-09 13:09:53,159][26022] Updated weights on worker 0-0, policy_version 262533 (0.00082) [2022-07-09 13:09:53,874][25689] Fps is (10 sec: 5796.3, 60 sec: 5663.6, 300 sec: 5669.7). Total num frames: 268837888. Throughput: 0: 5928.4. Samples: 268838474. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 13:09:53,874][25689] Avg episode reward: [(0, '-49.092')] [2022-07-09 13:09:55,076][26022] Updated weights on worker 0-0, policy_version 262543 (0.00091) [2022-07-09 13:09:56,781][26022] Updated weights on worker 0-0, policy_version 262553 (0.00093) [2022-07-09 13:09:58,558][26022] Updated weights on worker 0-0, policy_version 262563 (0.00086) [2022-07-09 13:09:58,924][25689] Fps is (10 sec: 5792.5, 60 sec: 5646.4, 300 sec: 5658.6). Total num frames: 268865536. Throughput: 0: 5956.8. Samples: 268872878. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 13:09:58,924][25689] Avg episode reward: [(0, '-48.606')] [2022-07-09 13:10:00,296][26022] Updated weights on worker 0-0, policy_version 262573 (0.00443) [2022-07-09 13:10:02,574][26022] Updated weights on worker 0-0, policy_version 262583 (0.00094) [2022-07-09 13:10:03,980][25689] Fps is (10 sec: 5370.3, 60 sec: 5642.8, 300 sec: 5666.1). Total num frames: 268892160. Throughput: 0: 5098.1. Samples: 268889944. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 13:10:03,980][25689] Avg episode reward: [(0, '-48.951')] [2022-07-09 13:10:04,306][26022] Updated weights on worker 0-0, policy_version 262593 (0.00092) [2022-07-09 13:10:06,252][26022] Updated weights on worker 0-0, policy_version 262603 (0.00081) [2022-07-09 13:10:07,915][26022] Updated weights on worker 0-0, policy_version 262613 (0.00097) [2022-07-09 13:10:08,992][25689] Fps is (10 sec: 5390.6, 60 sec: 5642.5, 300 sec: 5656.1). Total num frames: 268919808. Throughput: 0: 5816.8. Samples: 268921412. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 13:10:08,992][25689] Avg episode reward: [(0, '-48.659')] [2022-07-09 13:10:09,821][26022] Updated weights on worker 0-0, policy_version 262623 (0.00087) [2022-07-09 13:10:11,635][26022] Updated weights on worker 0-0, policy_version 262633 (0.00092) [2022-07-09 13:10:13,463][26022] Updated weights on worker 0-0, policy_version 262643 (0.00084) [2022-07-09 13:10:14,036][25689] Fps is (10 sec: 5804.3, 60 sec: 5673.2, 300 sec: 5666.8). Total num frames: 268950528. Throughput: 0: 5818.8. Samples: 268955794. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 13:10:14,038][25689] Avg episode reward: [(0, '-47.767')] [2022-07-09 13:10:15,247][26022] Updated weights on worker 0-0, policy_version 262653 (0.00084) [2022-07-09 13:10:16,827][26022] Updated weights on worker 0-0, policy_version 262663 (0.00091) [2022-07-09 13:10:18,944][26022] Updated weights on worker 0-0, policy_version 262673 (0.00097) [2022-07-09 13:10:19,089][25689] Fps is (10 sec: 5679.3, 60 sec: 5645.8, 300 sec: 5662.5). Total num frames: 268977152. Throughput: 0: 4960.6. Samples: 268972908. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 13:10:19,089][25689] Avg episode reward: [(0, '-47.586')] [2022-07-09 13:10:20,546][26022] Updated weights on worker 0-0, policy_version 262683 (0.00081) [2022-07-09 13:10:22,451][26022] Updated weights on worker 0-0, policy_version 262693 (0.00087) [2022-07-09 13:10:24,022][26022] Updated weights on worker 0-0, policy_version 262703 (0.00089) [2022-07-09 13:10:24,119][25689] Fps is (10 sec: 5687.1, 60 sec: 5643.7, 300 sec: 5665.5). Total num frames: 269007872. Throughput: 0: 5834.0. Samples: 269007436. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 13:10:24,123][25689] Avg episode reward: [(0, '-47.451')] [2022-07-09 13:10:26,063][26022] Updated weights on worker 0-0, policy_version 262713 (0.00098) [2022-07-09 13:10:27,795][26022] Updated weights on worker 0-0, policy_version 262723 (0.00090) [2022-07-09 13:10:29,152][25689] Fps is (10 sec: 5698.1, 60 sec: 5658.8, 300 sec: 5661.7). Total num frames: 269034496. Throughput: 0: 5956.5. Samples: 269041500. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 13:10:29,153][25689] Avg episode reward: [(0, '-47.431')] [2022-07-09 13:10:29,548][26022] Updated weights on worker 0-0, policy_version 262733 (0.00088) [2022-07-09 13:10:31,333][26022] Updated weights on worker 0-0, policy_version 262743 (0.00086) [2022-07-09 13:10:33,180][26022] Updated weights on worker 0-0, policy_version 262753 (0.00084) [2022-07-09 13:10:34,179][25689] Fps is (10 sec: 5598.7, 60 sec: 5658.0, 300 sec: 5665.5). Total num frames: 269064192. Throughput: 0: 5099.4. Samples: 269058510. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 13:10:34,179][25689] Avg episode reward: [(0, '-48.181')] [2022-07-09 13:10:35,089][26022] Updated weights on worker 0-0, policy_version 262763 (0.00087) [2022-07-09 13:10:36,623][26022] Updated weights on worker 0-0, policy_version 262773 (0.00096) [2022-07-09 13:10:38,797][26022] Updated weights on worker 0-0, policy_version 262783 (0.00087) [2022-07-09 13:10:39,286][25689] Fps is (10 sec: 5760.0, 60 sec: 5652.3, 300 sec: 5656.9). Total num frames: 269092864. Throughput: 0: 5921.2. Samples: 269092498. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 13:10:39,286][25689] Avg episode reward: [(0, '-48.643')] [2022-07-09 13:10:40,339][26022] Updated weights on worker 0-0, policy_version 262793 (0.00088) [2022-07-09 13:10:42,385][26022] Updated weights on worker 0-0, policy_version 262803 (0.00087) [2022-07-09 13:10:43,999][26022] Updated weights on worker 0-0, policy_version 262813 (0.00100) [2022-07-09 13:10:44,302][25689] Fps is (10 sec: 5664.6, 60 sec: 5653.4, 300 sec: 5663.9). Total num frames: 269121536. Throughput: 0: 5924.3. Samples: 269127004. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 13:10:44,303][25689] Avg episode reward: [(0, '-48.447')] [2022-07-09 13:10:45,709][26022] Updated weights on worker 0-0, policy_version 262823 (0.00088) [2022-07-09 13:10:47,686][26022] Updated weights on worker 0-0, policy_version 262833 (0.00664) [2022-07-09 13:10:49,256][26022] Updated weights on worker 0-0, policy_version 262843 (0.00091) [2022-07-09 13:10:49,353][25689] Fps is (10 sec: 5797.8, 60 sec: 5686.5, 300 sec: 5663.1). Total num frames: 269151232. Throughput: 0: 5083.9. Samples: 269144198. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 13:10:49,354][25689] Avg episode reward: [(0, '-48.030')] [2022-07-09 13:10:51,141][26022] Updated weights on worker 0-0, policy_version 262853 (0.00086) [2022-07-09 13:10:52,914][26022] Updated weights on worker 0-0, policy_version 262863 (0.00097) [2022-07-09 13:10:54,397][25689] Fps is (10 sec: 5680.3, 60 sec: 5634.1, 300 sec: 5667.9). Total num frames: 269178880. Throughput: 0: 5927.7. Samples: 269178360. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 13:10:54,398][25689] Avg episode reward: [(0, '-47.884')] [2022-07-09 13:10:54,740][26022] Updated weights on worker 0-0, policy_version 262873 (0.00087) [2022-07-09 13:10:56,524][26022] Updated weights on worker 0-0, policy_version 262883 (0.00090) [2022-07-09 13:10:58,032][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:10:58,044][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000262891_269200384.pth [2022-07-09 13:10:58,045][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000260898_267159552.pth [2022-07-09 13:10:58,609][26022] Updated weights on worker 0-0, policy_version 262893 (0.00087) [2022-07-09 13:10:59,444][25689] Fps is (10 sec: 5682.5, 60 sec: 5668.2, 300 sec: 5670.8). Total num frames: 269208576. Throughput: 0: 5954.1. Samples: 269212524. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 13:10:59,445][25689] Avg episode reward: [(0, '-48.585')] [2022-07-09 13:11:00,171][26022] Updated weights on worker 0-0, policy_version 262903 (0.00088) [2022-07-09 13:11:02,585][26022] Updated weights on worker 0-0, policy_version 262913 (0.00086) [2022-07-09 13:11:04,179][26022] Updated weights on worker 0-0, policy_version 262923 (0.00085) [2022-07-09 13:11:04,503][25689] Fps is (10 sec: 5572.8, 60 sec: 5667.9, 300 sec: 5660.2). Total num frames: 269235200. Throughput: 0: 5062.1. Samples: 269229266. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 13:11:04,504][25689] Avg episode reward: [(0, '-48.455')] [2022-07-09 13:11:06,024][26022] Updated weights on worker 0-0, policy_version 262933 (0.00115) [2022-07-09 13:11:07,674][26022] Updated weights on worker 0-0, policy_version 262943 (0.00095) [2022-07-09 13:11:09,510][25689] Fps is (10 sec: 5391.7, 60 sec: 5668.4, 300 sec: 5660.9). Total num frames: 269262848. Throughput: 0: 5820.6. Samples: 269261524. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 13:11:09,511][25689] Avg episode reward: [(0, '-48.270')] [2022-07-09 13:11:09,562][26022] Updated weights on worker 0-0, policy_version 262953 (0.00090) [2022-07-09 13:11:11,601][26022] Updated weights on worker 0-0, policy_version 262963 (0.00723) [2022-07-09 13:11:13,228][26022] Updated weights on worker 0-0, policy_version 262973 (0.00087) [2022-07-09 13:11:14,561][25689] Fps is (10 sec: 5497.7, 60 sec: 5616.9, 300 sec: 5655.5). Total num frames: 269290496. Throughput: 0: 5812.1. Samples: 269295556. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 13:11:14,565][25689] Avg episode reward: [(0, '-48.896')] [2022-07-09 13:11:15,102][26022] Updated weights on worker 0-0, policy_version 262983 (0.00084) [2022-07-09 13:11:16,891][26022] Updated weights on worker 0-0, policy_version 262993 (0.00087) [2022-07-09 13:11:18,620][26022] Updated weights on worker 0-0, policy_version 263003 (0.00093) [2022-07-09 13:11:19,666][25689] Fps is (10 sec: 5646.1, 60 sec: 5662.8, 300 sec: 5658.1). Total num frames: 269320192. Throughput: 0: 5791.0. Samples: 269329630. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 13:11:19,667][25689] Avg episode reward: [(0, '-48.454')] [2022-07-09 13:11:20,493][26022] Updated weights on worker 0-0, policy_version 263013 (0.00088) [2022-07-09 13:11:22,358][26022] Updated weights on worker 0-0, policy_version 263023 (0.00550) [2022-07-09 13:11:23,962][26022] Updated weights on worker 0-0, policy_version 263033 (0.00088) [2022-07-09 13:11:24,765][25689] Fps is (10 sec: 5820.6, 60 sec: 5639.5, 300 sec: 5661.0). Total num frames: 269349888. Throughput: 0: 5793.8. Samples: 269346660. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 13:11:24,767][25689] Avg episode reward: [(0, '-48.044')] [2022-07-09 13:11:26,052][26022] Updated weights on worker 0-0, policy_version 263043 (0.00092) [2022-07-09 13:11:27,391][26022] Updated weights on worker 0-0, policy_version 263053 (0.00946) [2022-07-09 13:11:29,432][26022] Updated weights on worker 0-0, policy_version 263063 (0.00084) [2022-07-09 13:11:29,803][25689] Fps is (10 sec: 5758.1, 60 sec: 5672.9, 300 sec: 5664.3). Total num frames: 269378560. Throughput: 0: 5881.5. Samples: 269380878. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 13:11:29,803][25689] Avg episode reward: [(0, '-48.090')] [2022-07-09 13:11:31,417][26022] Updated weights on worker 0-0, policy_version 263073 (0.00085) [2022-07-09 13:11:32,897][26022] Updated weights on worker 0-0, policy_version 263083 (0.00087) [2022-07-09 13:11:34,814][25689] Fps is (10 sec: 5503.0, 60 sec: 5623.7, 300 sec: 5651.1). Total num frames: 269405184. Throughput: 0: 5895.4. Samples: 269414950. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 13:11:34,814][25689] Avg episode reward: [(0, '-46.858')] [2022-07-09 13:11:34,999][26022] Updated weights on worker 0-0, policy_version 263093 (0.00085) [2022-07-09 13:11:36,671][26022] Updated weights on worker 0-0, policy_version 263103 (0.00082) [2022-07-09 13:11:38,424][26022] Updated weights on worker 0-0, policy_version 263113 (0.00096) [2022-07-09 13:11:39,921][25689] Fps is (10 sec: 5566.5, 60 sec: 5640.6, 300 sec: 5657.4). Total num frames: 269434880. Throughput: 0: 5052.1. Samples: 269431956. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 13:11:39,923][25689] Avg episode reward: [(0, '-47.208')] [2022-07-09 13:11:40,374][26022] Updated weights on worker 0-0, policy_version 263123 (0.00088) [2022-07-09 13:11:42,177][26022] Updated weights on worker 0-0, policy_version 263133 (0.00085) [2022-07-09 13:11:43,791][26022] Updated weights on worker 0-0, policy_version 263143 (0.00089) [2022-07-09 13:11:44,928][25689] Fps is (10 sec: 5871.9, 60 sec: 5658.3, 300 sec: 5664.2). Total num frames: 269464576. Throughput: 0: 5927.5. Samples: 269466176. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 13:11:44,930][25689] Avg episode reward: [(0, '-48.205')] [2022-07-09 13:11:45,743][26022] Updated weights on worker 0-0, policy_version 263153 (0.00092) [2022-07-09 13:11:47,302][26022] Updated weights on worker 0-0, policy_version 263163 (0.00086) [2022-07-09 13:11:49,364][26022] Updated weights on worker 0-0, policy_version 263173 (0.00100) [2022-07-09 13:11:49,967][25689] Fps is (10 sec: 5708.2, 60 sec: 5625.7, 300 sec: 5657.0). Total num frames: 269492224. Throughput: 0: 5951.4. Samples: 269500880. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 13:11:49,967][25689] Avg episode reward: [(0, '-47.954')] [2022-07-09 13:11:50,900][26022] Updated weights on worker 0-0, policy_version 263183 (0.00094) [2022-07-09 13:11:52,799][26022] Updated weights on worker 0-0, policy_version 263193 (0.00090) [2022-07-09 13:11:54,698][26022] Updated weights on worker 0-0, policy_version 263203 (0.00087) [2022-07-09 13:11:54,969][25689] Fps is (10 sec: 5711.2, 60 sec: 5663.4, 300 sec: 5661.3). Total num frames: 269521920. Throughput: 0: 5122.9. Samples: 269518204. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 13:11:54,969][25689] Avg episode reward: [(0, '-48.921')] [2022-07-09 13:11:56,432][26022] Updated weights on worker 0-0, policy_version 263213 (0.00090) [2022-07-09 13:11:58,040][26022] Updated weights on worker 0-0, policy_version 263223 (0.00086) [2022-07-09 13:11:59,968][26022] Updated weights on worker 0-0, policy_version 263233 (0.00092) [2022-07-09 13:12:00,018][25689] Fps is (10 sec: 5807.0, 60 sec: 5646.3, 300 sec: 5667.9). Total num frames: 269550592. Throughput: 0: 5996.2. Samples: 269552462. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 13:12:00,018][25689] Avg episode reward: [(0, '-48.679')] [2022-07-09 13:12:01,819][26022] Updated weights on worker 0-0, policy_version 263243 (0.00093) [2022-07-09 13:12:04,096][26022] Updated weights on worker 0-0, policy_version 263253 (0.00875) [2022-07-09 13:12:05,024][25689] Fps is (10 sec: 5499.5, 60 sec: 5651.3, 300 sec: 5662.2). Total num frames: 269577216. Throughput: 0: 5894.7. Samples: 269584632. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 13:12:05,024][25689] Avg episode reward: [(0, '-48.665')] [2022-07-09 13:12:05,740][26022] Updated weights on worker 0-0, policy_version 263263 (0.00092) [2022-07-09 13:12:07,530][26022] Updated weights on worker 0-0, policy_version 263273 (0.00086) [2022-07-09 13:12:09,383][26022] Updated weights on worker 0-0, policy_version 263283 (0.00091) [2022-07-09 13:12:10,029][25689] Fps is (10 sec: 5421.4, 60 sec: 5651.4, 300 sec: 5659.0). Total num frames: 269604864. Throughput: 0: 5026.9. Samples: 269601730. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 13:12:10,029][25689] Avg episode reward: [(0, '-47.610')] [2022-07-09 13:12:11,139][26022] Updated weights on worker 0-0, policy_version 263293 (0.00085) [2022-07-09 13:12:12,939][26022] Updated weights on worker 0-0, policy_version 263303 (0.00093) [2022-07-09 13:12:14,619][26022] Updated weights on worker 0-0, policy_version 263313 (0.00087) [2022-07-09 13:12:15,036][25689] Fps is (10 sec: 5625.2, 60 sec: 5672.5, 300 sec: 5656.9). Total num frames: 269633536. Throughput: 0: 5857.0. Samples: 269635736. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 13:12:15,036][25689] Avg episode reward: [(0, '-47.774')] [2022-07-09 13:12:16,437][26022] Updated weights on worker 0-0, policy_version 263323 (0.00085) [2022-07-09 13:12:18,275][26022] Updated weights on worker 0-0, policy_version 263333 (0.00086) [2022-07-09 13:12:20,059][26022] Updated weights on worker 0-0, policy_version 263343 (0.00087) [2022-07-09 13:12:20,103][25689] Fps is (10 sec: 5793.8, 60 sec: 5676.0, 300 sec: 5659.4). Total num frames: 269663232. Throughput: 0: 5862.4. Samples: 269670208. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 13:12:20,103][25689] Avg episode reward: [(0, '-48.241')] [2022-07-09 13:12:21,851][26022] Updated weights on worker 0-0, policy_version 263353 (0.00086) [2022-07-09 13:12:23,697][26022] Updated weights on worker 0-0, policy_version 263363 (0.00090) [2022-07-09 13:12:25,114][25689] Fps is (10 sec: 5791.6, 60 sec: 5667.3, 300 sec: 5659.7). Total num frames: 269691904. Throughput: 0: 5112.5. Samples: 269687342. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 13:12:25,114][25689] Avg episode reward: [(0, '-48.643')] [2022-07-09 13:12:25,428][26022] Updated weights on worker 0-0, policy_version 263373 (0.00095) [2022-07-09 13:12:27,394][26022] Updated weights on worker 0-0, policy_version 263383 (0.00084) [2022-07-09 13:12:28,891][26022] Updated weights on worker 0-0, policy_version 263393 (0.00095) [2022-07-09 13:12:30,127][25689] Fps is (10 sec: 5516.5, 60 sec: 5635.7, 300 sec: 5656.5). Total num frames: 269718528. Throughput: 0: 5978.9. Samples: 269721894. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 13:12:30,127][25689] Avg episode reward: [(0, '-49.202')] [2022-07-09 13:12:30,852][26022] Updated weights on worker 0-0, policy_version 263403 (0.00086) [2022-07-09 13:12:32,735][26022] Updated weights on worker 0-0, policy_version 263413 (0.00092) [2022-07-09 13:12:34,283][26022] Updated weights on worker 0-0, policy_version 263423 (0.00094) [2022-07-09 13:12:35,132][25689] Fps is (10 sec: 5723.8, 60 sec: 5704.1, 300 sec: 5660.9). Total num frames: 269749248. Throughput: 0: 5989.3. Samples: 269756100. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 13:12:35,133][25689] Avg episode reward: [(0, '-49.359')] [2022-07-09 13:12:36,347][26022] Updated weights on worker 0-0, policy_version 263433 (0.00096) [2022-07-09 13:12:37,760][26022] Updated weights on worker 0-0, policy_version 263443 (0.00080) [2022-07-09 13:12:39,906][26022] Updated weights on worker 0-0, policy_version 263453 (0.00085) [2022-07-09 13:12:40,195][25689] Fps is (10 sec: 5797.4, 60 sec: 5674.4, 300 sec: 5653.0). Total num frames: 269776896. Throughput: 0: 5131.0. Samples: 269773300. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 13:12:40,196][25689] Avg episode reward: [(0, '-49.748')] [2022-07-09 13:12:41,523][26022] Updated weights on worker 0-0, policy_version 263463 (0.00086) [2022-07-09 13:12:43,533][26022] Updated weights on worker 0-0, policy_version 263473 (0.00092) [2022-07-09 13:12:45,044][26022] Updated weights on worker 0-0, policy_version 263483 (0.00088) [2022-07-09 13:12:45,215][25689] Fps is (10 sec: 5789.1, 60 sec: 5690.2, 300 sec: 5663.2). Total num frames: 269807616. Throughput: 0: 5976.8. Samples: 269807480. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:12:45,215][25689] Avg episode reward: [(0, '-50.447')] [2022-07-09 13:12:47,278][26022] Updated weights on worker 0-0, policy_version 263493 (0.00084) [2022-07-09 13:12:48,554][26022] Updated weights on worker 0-0, policy_version 263503 (0.00088) [2022-07-09 13:12:50,225][25689] Fps is (10 sec: 5717.3, 60 sec: 5675.9, 300 sec: 5653.0). Total num frames: 269834240. Throughput: 0: 5977.0. Samples: 269842018. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:12:50,226][25689] Avg episode reward: [(0, '-49.865')] [2022-07-09 13:12:50,750][26022] Updated weights on worker 0-0, policy_version 263513 (0.00099) [2022-07-09 13:12:52,546][26022] Updated weights on worker 0-0, policy_version 263523 (0.00085) [2022-07-09 13:12:54,071][26022] Updated weights on worker 0-0, policy_version 263533 (0.00085) [2022-07-09 13:12:55,247][25689] Fps is (10 sec: 5614.0, 60 sec: 5674.0, 300 sec: 5660.6). Total num frames: 269863936. Throughput: 0: 5123.4. Samples: 269859154. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:12:55,249][25689] Avg episode reward: [(0, '-50.060')] [2022-07-09 13:12:56,127][26022] Updated weights on worker 0-0, policy_version 263543 (0.00092) [2022-07-09 13:12:57,407][26022] Updated weights on worker 0-0, policy_version 263553 (0.00091) [2022-07-09 13:12:58,313][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:12:58,338][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000263556_269881344.pth [2022-07-09 13:12:58,339][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000261561_267838464.pth [2022-07-09 13:12:59,575][26022] Updated weights on worker 0-0, policy_version 263563 (0.00091) [2022-07-09 13:13:00,334][25689] Fps is (10 sec: 5875.1, 60 sec: 5687.4, 300 sec: 5669.8). Total num frames: 269893632. Throughput: 0: 5974.5. Samples: 269893620. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:13:00,334][25689] Avg episode reward: [(0, '-49.670')] [2022-07-09 13:13:01,367][26022] Updated weights on worker 0-0, policy_version 263573 (0.00085) [2022-07-09 13:13:03,371][26022] Updated weights on worker 0-0, policy_version 263583 (0.00085) [2022-07-09 13:13:05,231][26022] Updated weights on worker 0-0, policy_version 263593 (0.00080) [2022-07-09 13:13:05,364][25689] Fps is (10 sec: 5465.5, 60 sec: 5668.1, 300 sec: 5659.7). Total num frames: 269919232. Throughput: 0: 5880.6. Samples: 269925970. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:13:05,364][25689] Avg episode reward: [(0, '-49.523')] [2022-07-09 13:13:06,831][26022] Updated weights on worker 0-0, policy_version 263603 (0.00089) [2022-07-09 13:13:08,862][26022] Updated weights on worker 0-0, policy_version 263613 (0.00081) [2022-07-09 13:13:10,378][26022] Updated weights on worker 0-0, policy_version 263623 (0.00089) [2022-07-09 13:13:10,380][25689] Fps is (10 sec: 5504.2, 60 sec: 5701.0, 300 sec: 5659.9). Total num frames: 269948928. Throughput: 0: 5029.1. Samples: 269943380. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:13:10,380][25689] Avg episode reward: [(0, '-49.361')] [2022-07-09 13:13:12,449][26022] Updated weights on worker 0-0, policy_version 263633 (0.00091) [2022-07-09 13:13:14,202][26022] Updated weights on worker 0-0, policy_version 263643 (0.00086) [2022-07-09 13:13:15,411][25689] Fps is (10 sec: 5707.7, 60 sec: 5681.8, 300 sec: 5657.4). Total num frames: 269976576. Throughput: 0: 5858.2. Samples: 269977278. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:13:15,411][25689] Avg episode reward: [(0, '-49.360')] [2022-07-09 13:13:16,071][26022] Updated weights on worker 0-0, policy_version 263653 (0.00093) [2022-07-09 13:13:17,740][26022] Updated weights on worker 0-0, policy_version 263663 (0.00090) [2022-07-09 13:13:19,530][26022] Updated weights on worker 0-0, policy_version 263673 (0.00088) [2022-07-09 13:13:20,469][25689] Fps is (10 sec: 5582.4, 60 sec: 5665.7, 300 sec: 5660.5). Total num frames: 270005248. Throughput: 0: 5856.8. Samples: 270011546. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:13:20,470][25689] Avg episode reward: [(0, '-50.152')] [2022-07-09 13:13:21,235][26022] Updated weights on worker 0-0, policy_version 263683 (0.00084) [2022-07-09 13:13:23,208][26022] Updated weights on worker 0-0, policy_version 263693 (0.00084) [2022-07-09 13:13:25,131][26022] Updated weights on worker 0-0, policy_version 263703 (0.00090) [2022-07-09 13:13:25,491][25689] Fps is (10 sec: 5688.9, 60 sec: 5664.7, 300 sec: 5656.7). Total num frames: 270033920. Throughput: 0: 5100.5. Samples: 270028626. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:13:25,492][25689] Avg episode reward: [(0, '-49.632')] [2022-07-09 13:13:26,882][26022] Updated weights on worker 0-0, policy_version 263713 (0.00088) [2022-07-09 13:13:28,797][26022] Updated weights on worker 0-0, policy_version 263723 (0.00090) [2022-07-09 13:13:30,495][25689] Fps is (10 sec: 5617.4, 60 sec: 5682.5, 300 sec: 5657.3). Total num frames: 270061568. Throughput: 0: 5918.7. Samples: 270062434. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:13:30,495][25689] Avg episode reward: [(0, '-48.942')] [2022-07-09 13:13:30,532][26022] Updated weights on worker 0-0, policy_version 263733 (0.00086) [2022-07-09 13:13:32,214][26022] Updated weights on worker 0-0, policy_version 263743 (0.00086) [2022-07-09 13:13:34,255][26022] Updated weights on worker 0-0, policy_version 263753 (0.00088) [2022-07-09 13:13:35,515][25689] Fps is (10 sec: 5720.7, 60 sec: 5664.2, 300 sec: 5659.7). Total num frames: 270091264. Throughput: 0: 5939.1. Samples: 270096676. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:13:35,515][25689] Avg episode reward: [(0, '-49.078')] [2022-07-09 13:13:35,763][26022] Updated weights on worker 0-0, policy_version 263763 (0.00321) [2022-07-09 13:13:37,749][26022] Updated weights on worker 0-0, policy_version 263773 (0.00086) [2022-07-09 13:13:39,206][26022] Updated weights on worker 0-0, policy_version 263783 (0.00082) [2022-07-09 13:13:40,632][25689] Fps is (10 sec: 5656.8, 60 sec: 5659.0, 300 sec: 5654.5). Total num frames: 270118912. Throughput: 0: 5068.0. Samples: 270113732. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:13:40,633][25689] Avg episode reward: [(0, '-48.891')] [2022-07-09 13:13:41,261][26022] Updated weights on worker 0-0, policy_version 263793 (0.00054) [2022-07-09 13:13:43,267][26022] Updated weights on worker 0-0, policy_version 263803 (0.00086) [2022-07-09 13:13:44,856][26022] Updated weights on worker 0-0, policy_version 263813 (0.00094) [2022-07-09 13:13:45,733][25689] Fps is (10 sec: 5611.7, 60 sec: 5634.5, 300 sec: 5656.2). Total num frames: 270148608. Throughput: 0: 5891.3. Samples: 270147880. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:13:45,734][25689] Avg episode reward: [(0, '-48.080')] [2022-07-09 13:13:46,691][26022] Updated weights on worker 0-0, policy_version 263823 (0.00094) [2022-07-09 13:13:48,467][26022] Updated weights on worker 0-0, policy_version 263833 (0.00095) [2022-07-09 13:13:50,264][26022] Updated weights on worker 0-0, policy_version 263843 (0.00082) [2022-07-09 13:13:50,763][25689] Fps is (10 sec: 5761.3, 60 sec: 5666.5, 300 sec: 5656.0). Total num frames: 270177280. Throughput: 0: 5905.7. Samples: 270182132. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:13:50,764][25689] Avg episode reward: [(0, '-47.438')] [2022-07-09 13:13:52,164][26022] Updated weights on worker 0-0, policy_version 263853 (0.00092) [2022-07-09 13:13:53,736][26022] Updated weights on worker 0-0, policy_version 263863 (0.00087) [2022-07-09 13:13:55,628][26022] Updated weights on worker 0-0, policy_version 263873 (0.00087) [2022-07-09 13:13:55,779][25689] Fps is (10 sec: 5810.4, 60 sec: 5667.1, 300 sec: 5660.1). Total num frames: 270206976. Throughput: 0: 5043.0. Samples: 270198868. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:13:55,780][25689] Avg episode reward: [(0, '-47.936')] [2022-07-09 13:13:57,695][26022] Updated weights on worker 0-0, policy_version 263883 (0.00068) [2022-07-09 13:13:59,150][26022] Updated weights on worker 0-0, policy_version 263893 (0.00100) [2022-07-09 13:14:00,821][25689] Fps is (10 sec: 5701.0, 60 sec: 5637.4, 300 sec: 5663.0). Total num frames: 270234624. Throughput: 0: 5927.0. Samples: 270233396. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:14:00,822][25689] Avg episode reward: [(0, '-48.041')] [2022-07-09 13:14:01,097][26022] Updated weights on worker 0-0, policy_version 263903 (0.00091) [2022-07-09 13:14:03,050][26022] Updated weights on worker 0-0, policy_version 263913 (0.00098) [2022-07-09 13:14:05,021][26022] Updated weights on worker 0-0, policy_version 263923 (0.00086) [2022-07-09 13:14:05,914][25689] Fps is (10 sec: 5455.6, 60 sec: 5665.4, 300 sec: 5661.4). Total num frames: 270262272. Throughput: 0: 5837.4. Samples: 270265686. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:14:05,916][25689] Avg episode reward: [(0, '-48.139')] [2022-07-09 13:14:07,032][26022] Updated weights on worker 0-0, policy_version 263933 (0.00088) [2022-07-09 13:14:08,479][26022] Updated weights on worker 0-0, policy_version 263943 (0.00085) [2022-07-09 13:14:10,455][26022] Updated weights on worker 0-0, policy_version 263953 (0.00097) [2022-07-09 13:14:10,950][25689] Fps is (10 sec: 5560.3, 60 sec: 5646.6, 300 sec: 5660.9). Total num frames: 270290944. Throughput: 0: 5836.3. Samples: 270299952. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:14:10,952][25689] Avg episode reward: [(0, '-48.190')] [2022-07-09 13:14:12,204][26022] Updated weights on worker 0-0, policy_version 263963 (0.00091) [2022-07-09 13:14:13,972][26022] Updated weights on worker 0-0, policy_version 263973 (0.00085) [2022-07-09 13:14:15,795][26022] Updated weights on worker 0-0, policy_version 263983 (0.00079) [2022-07-09 13:14:15,955][25689] Fps is (10 sec: 5711.3, 60 sec: 5666.0, 300 sec: 5663.2). Total num frames: 270319616. Throughput: 0: 5854.2. Samples: 270316982. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:14:15,955][25689] Avg episode reward: [(0, '-49.928')] [2022-07-09 13:14:17,610][26022] Updated weights on worker 0-0, policy_version 263993 (0.00085) [2022-07-09 13:14:19,355][26022] Updated weights on worker 0-0, policy_version 264003 (0.00089) [2022-07-09 13:14:21,016][25689] Fps is (10 sec: 5595.3, 60 sec: 5648.8, 300 sec: 5651.8). Total num frames: 270347264. Throughput: 0: 5848.2. Samples: 270351496. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:14:21,016][25689] Avg episode reward: [(0, '-49.612')] [2022-07-09 13:14:21,209][26022] Updated weights on worker 0-0, policy_version 264013 (0.00086) [2022-07-09 13:14:22,925][26022] Updated weights on worker 0-0, policy_version 264023 (0.00088) [2022-07-09 13:14:24,689][26022] Updated weights on worker 0-0, policy_version 264033 (0.00088) [2022-07-09 13:14:26,019][25689] Fps is (10 sec: 5697.8, 60 sec: 5667.4, 300 sec: 5665.8). Total num frames: 270376960. Throughput: 0: 5987.7. Samples: 270386066. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:14:26,020][25689] Avg episode reward: [(0, '-49.474')] [2022-07-09 13:14:26,485][26022] Updated weights on worker 0-0, policy_version 264043 (0.00086) [2022-07-09 13:14:28,228][26022] Updated weights on worker 0-0, policy_version 264053 (0.00092) [2022-07-09 13:14:30,044][26022] Updated weights on worker 0-0, policy_version 264063 (0.00089) [2022-07-09 13:14:31,039][25689] Fps is (10 sec: 5720.8, 60 sec: 5665.9, 300 sec: 5658.9). Total num frames: 270404608. Throughput: 0: 5143.2. Samples: 270403276. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:14:31,041][25689] Avg episode reward: [(0, '-49.290')] [2022-07-09 13:14:31,791][26022] Updated weights on worker 0-0, policy_version 264073 (0.00087) [2022-07-09 13:14:33,571][26022] Updated weights on worker 0-0, policy_version 264083 (0.00106) [2022-07-09 13:14:35,297][26022] Updated weights on worker 0-0, policy_version 264093 (0.00085) [2022-07-09 13:14:36,053][25689] Fps is (10 sec: 5613.1, 60 sec: 5649.6, 300 sec: 5659.5). Total num frames: 270433280. Throughput: 0: 6028.0. Samples: 270438132. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:14:36,054][25689] Avg episode reward: [(0, '-49.603')] [2022-07-09 13:14:37,205][26022] Updated weights on worker 0-0, policy_version 264103 (0.00095) [2022-07-09 13:14:39,025][26022] Updated weights on worker 0-0, policy_version 264113 (0.00087) [2022-07-09 13:14:40,796][26022] Updated weights on worker 0-0, policy_version 264123 (0.00083) [2022-07-09 13:14:41,124][25689] Fps is (10 sec: 5788.1, 60 sec: 5687.8, 300 sec: 5662.1). Total num frames: 270462976. Throughput: 0: 6018.9. Samples: 270472522. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:14:41,125][25689] Avg episode reward: [(0, '-49.706')] [2022-07-09 13:14:42,478][26022] Updated weights on worker 0-0, policy_version 264133 (0.00089) [2022-07-09 13:14:44,237][26022] Updated weights on worker 0-0, policy_version 264143 (0.00087) [2022-07-09 13:14:45,912][26022] Updated weights on worker 0-0, policy_version 264153 (0.00112) [2022-07-09 13:14:46,136][25689] Fps is (10 sec: 5890.2, 60 sec: 5696.2, 300 sec: 5669.6). Total num frames: 270492672. Throughput: 0: 5157.7. Samples: 270489822. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:14:46,138][25689] Avg episode reward: [(0, '-48.687')] [2022-07-09 13:14:47,917][26022] Updated weights on worker 0-0, policy_version 264163 (0.00098) [2022-07-09 13:14:49,616][26022] Updated weights on worker 0-0, policy_version 264173 (0.00096) [2022-07-09 13:14:51,152][25689] Fps is (10 sec: 5820.1, 60 sec: 5697.4, 300 sec: 5662.9). Total num frames: 270521344. Throughput: 0: 6020.7. Samples: 270524368. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:14:51,153][25689] Avg episode reward: [(0, '-48.327')] [2022-07-09 13:14:51,449][26022] Updated weights on worker 0-0, policy_version 264183 (0.00087) [2022-07-09 13:14:53,182][26022] Updated weights on worker 0-0, policy_version 264193 (0.00085) [2022-07-09 13:14:54,912][26022] Updated weights on worker 0-0, policy_version 264203 (0.00051) [2022-07-09 13:14:56,161][25689] Fps is (10 sec: 5720.3, 60 sec: 5681.2, 300 sec: 5667.1). Total num frames: 270550016. Throughput: 0: 6003.6. Samples: 270558852. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:14:56,161][25689] Avg episode reward: [(0, '-47.968')] [2022-07-09 13:14:56,836][26022] Updated weights on worker 0-0, policy_version 264213 (0.00104) [2022-07-09 13:14:58,453][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:14:58,469][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000264222_270563328.pth [2022-07-09 13:14:58,469][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000262226_268519424.pth [2022-07-09 13:14:58,552][26022] Updated weights on worker 0-0, policy_version 264223 (0.00286) [2022-07-09 13:15:00,491][26022] Updated weights on worker 0-0, policy_version 264233 (0.00087) [2022-07-09 13:15:01,207][25689] Fps is (10 sec: 5805.1, 60 sec: 5714.8, 300 sec: 5677.7). Total num frames: 270579712. Throughput: 0: 5141.8. Samples: 270575786. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:15:01,209][25689] Avg episode reward: [(0, '-48.152')] [2022-07-09 13:15:02,471][26022] Updated weights on worker 0-0, policy_version 264243 (0.00094) [2022-07-09 13:15:04,568][26022] Updated weights on worker 0-0, policy_version 264253 (0.00092) [2022-07-09 13:15:06,177][26022] Updated weights on worker 0-0, policy_version 264263 (0.00072) [2022-07-09 13:15:06,215][25689] Fps is (10 sec: 5499.6, 60 sec: 5688.8, 300 sec: 5670.8). Total num frames: 270605312. Throughput: 0: 5875.1. Samples: 270607790. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 13:15:06,219][25689] Avg episode reward: [(0, '-48.323')] [2022-07-09 13:15:08,122][26022] Updated weights on worker 0-0, policy_version 264273 (0.00080) [2022-07-09 13:15:09,770][26022] Updated weights on worker 0-0, policy_version 264283 (0.00092) [2022-07-09 13:15:11,245][25689] Fps is (10 sec: 5304.3, 60 sec: 5672.4, 300 sec: 5660.8). Total num frames: 270632960. Throughput: 0: 5855.6. Samples: 270642026. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 13:15:11,246][25689] Avg episode reward: [(0, '-48.648')] [2022-07-09 13:15:11,784][26022] Updated weights on worker 0-0, policy_version 264293 (0.00092) [2022-07-09 13:15:13,303][26022] Updated weights on worker 0-0, policy_version 264303 (0.00078) [2022-07-09 13:15:15,405][26022] Updated weights on worker 0-0, policy_version 264313 (0.00086) [2022-07-09 13:15:16,267][25689] Fps is (10 sec: 5705.1, 60 sec: 5687.8, 300 sec: 5671.7). Total num frames: 270662656. Throughput: 0: 4968.1. Samples: 270658740. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 13:15:16,267][25689] Avg episode reward: [(0, '-48.739')] [2022-07-09 13:15:16,977][26022] Updated weights on worker 0-0, policy_version 264323 (0.00088) [2022-07-09 13:15:19,094][26022] Updated weights on worker 0-0, policy_version 264333 (0.00087) [2022-07-09 13:15:20,539][26022] Updated weights on worker 0-0, policy_version 264343 (0.00085) [2022-07-09 13:15:21,335][25689] Fps is (10 sec: 5784.8, 60 sec: 5704.0, 300 sec: 5664.1). Total num frames: 270691328. Throughput: 0: 5830.0. Samples: 270693136. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 13:15:21,337][25689] Avg episode reward: [(0, '-49.819')] [2022-07-09 13:15:22,663][26022] Updated weights on worker 0-0, policy_version 264353 (0.00096) [2022-07-09 13:15:24,097][26022] Updated weights on worker 0-0, policy_version 264363 (0.00088) [2022-07-09 13:15:26,209][26022] Updated weights on worker 0-0, policy_version 264373 (0.00613) [2022-07-09 13:15:26,353][25689] Fps is (10 sec: 5482.0, 60 sec: 5651.7, 300 sec: 5664.4). Total num frames: 270717952. Throughput: 0: 5951.9. Samples: 270727650. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 13:15:26,355][25689] Avg episode reward: [(0, '-50.235')] [2022-07-09 13:15:27,781][26022] Updated weights on worker 0-0, policy_version 264383 (0.00092) [2022-07-09 13:15:29,679][26022] Updated weights on worker 0-0, policy_version 264393 (0.00093) [2022-07-09 13:15:31,364][25689] Fps is (10 sec: 5615.8, 60 sec: 5686.6, 300 sec: 5664.7). Total num frames: 270747648. Throughput: 0: 5105.1. Samples: 270744734. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 13:15:31,365][25689] Avg episode reward: [(0, '-49.932')] [2022-07-09 13:15:31,598][26022] Updated weights on worker 0-0, policy_version 264403 (0.00083) [2022-07-09 13:15:33,147][26022] Updated weights on worker 0-0, policy_version 264413 (0.00086) [2022-07-09 13:15:35,075][26022] Updated weights on worker 0-0, policy_version 264423 (0.00087) [2022-07-09 13:15:36,388][25689] Fps is (10 sec: 5918.4, 60 sec: 5702.5, 300 sec: 5669.7). Total num frames: 270777344. Throughput: 0: 5972.1. Samples: 270778910. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 13:15:36,389][25689] Avg episode reward: [(0, '-49.463')] [2022-07-09 13:15:36,920][26022] Updated weights on worker 0-0, policy_version 264433 (0.00090) [2022-07-09 13:15:38,563][26022] Updated weights on worker 0-0, policy_version 264443 (0.00083) [2022-07-09 13:15:40,603][26022] Updated weights on worker 0-0, policy_version 264453 (0.00085) [2022-07-09 13:15:41,477][25689] Fps is (10 sec: 5670.6, 60 sec: 5666.9, 300 sec: 5664.9). Total num frames: 270804992. Throughput: 0: 5951.9. Samples: 270813014. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 13:15:41,477][25689] Avg episode reward: [(0, '-50.054')] [2022-07-09 13:15:42,092][26022] Updated weights on worker 0-0, policy_version 264463 (0.00088) [2022-07-09 13:15:44,122][26022] Updated weights on worker 0-0, policy_version 264473 (0.00090) [2022-07-09 13:15:45,703][26022] Updated weights on worker 0-0, policy_version 264483 (0.00098) [2022-07-09 13:15:46,523][25689] Fps is (10 sec: 5557.3, 60 sec: 5646.8, 300 sec: 5661.5). Total num frames: 270833664. Throughput: 0: 5084.7. Samples: 270830206. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 13:15:46,523][25689] Avg episode reward: [(0, '-49.016')] [2022-07-09 13:15:47,618][26022] Updated weights on worker 0-0, policy_version 264493 (0.00090) [2022-07-09 13:15:49,467][26022] Updated weights on worker 0-0, policy_version 264503 (0.00082) [2022-07-09 13:15:51,192][26022] Updated weights on worker 0-0, policy_version 264513 (0.00087) [2022-07-09 13:15:51,529][25689] Fps is (10 sec: 5806.1, 60 sec: 5664.6, 300 sec: 5669.1). Total num frames: 270863360. Throughput: 0: 5934.9. Samples: 270864414. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 13:15:51,530][25689] Avg episode reward: [(0, '-48.489')] [2022-07-09 13:15:53,040][26022] Updated weights on worker 0-0, policy_version 264523 (0.00090) [2022-07-09 13:15:54,808][26022] Updated weights on worker 0-0, policy_version 264533 (0.00081) [2022-07-09 13:15:56,559][25689] Fps is (10 sec: 5713.9, 60 sec: 5645.7, 300 sec: 5662.6). Total num frames: 270891008. Throughput: 0: 5917.4. Samples: 270898264. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 13:15:56,559][25689] Avg episode reward: [(0, '-48.294')] [2022-07-09 13:15:56,699][26022] Updated weights on worker 0-0, policy_version 264543 (0.00096) [2022-07-09 13:15:58,462][26022] Updated weights on worker 0-0, policy_version 264553 (0.00090) [2022-07-09 13:16:00,087][26022] Updated weights on worker 0-0, policy_version 264563 (0.00089) [2022-07-09 13:16:01,603][25689] Fps is (10 sec: 5489.5, 60 sec: 5612.0, 300 sec: 5666.3). Total num frames: 270918656. Throughput: 0: 5095.0. Samples: 270915556. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 13:16:01,603][25689] Avg episode reward: [(0, '-47.562')] [2022-07-09 13:16:02,412][26022] Updated weights on worker 0-0, policy_version 264573 (0.00089) [2022-07-09 13:16:04,236][26022] Updated weights on worker 0-0, policy_version 264583 (0.00088) [2022-07-09 13:16:06,021][26022] Updated weights on worker 0-0, policy_version 264593 (0.00089) [2022-07-09 13:16:06,613][25689] Fps is (10 sec: 5601.3, 60 sec: 5662.7, 300 sec: 5669.7). Total num frames: 270947328. Throughput: 0: 5854.3. Samples: 270947822. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 13:16:06,614][25689] Avg episode reward: [(0, '-47.606')] [2022-07-09 13:16:07,824][26022] Updated weights on worker 0-0, policy_version 264603 (0.00087) [2022-07-09 13:16:09,428][26022] Updated weights on worker 0-0, policy_version 264613 (0.00084) [2022-07-09 13:16:11,499][26022] Updated weights on worker 0-0, policy_version 264623 (0.00094) [2022-07-09 13:16:11,634][25689] Fps is (10 sec: 5614.3, 60 sec: 5663.6, 300 sec: 5670.3). Total num frames: 270974976. Throughput: 0: 5842.1. Samples: 270981868. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 13:16:11,636][25689] Avg episode reward: [(0, '-47.320')] [2022-07-09 13:16:13,194][26022] Updated weights on worker 0-0, policy_version 264633 (0.00094) [2022-07-09 13:16:14,959][26022] Updated weights on worker 0-0, policy_version 264643 (0.00083) [2022-07-09 13:16:16,651][25689] Fps is (10 sec: 5509.1, 60 sec: 5630.1, 300 sec: 5665.1). Total num frames: 271002624. Throughput: 0: 5013.4. Samples: 270998994. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 13:16:16,652][25689] Avg episode reward: [(0, '-47.654')] [2022-07-09 13:16:16,915][26022] Updated weights on worker 0-0, policy_version 264653 (0.00092) [2022-07-09 13:16:18,597][26022] Updated weights on worker 0-0, policy_version 264663 (0.00085) [2022-07-09 13:16:20,587][26022] Updated weights on worker 0-0, policy_version 264673 (0.00085) [2022-07-09 13:16:21,761][25689] Fps is (10 sec: 5662.7, 60 sec: 5643.2, 300 sec: 5664.8). Total num frames: 271032320. Throughput: 0: 5849.3. Samples: 271033466. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 13:16:21,761][25689] Avg episode reward: [(0, '-48.696')] [2022-07-09 13:16:22,027][26022] Updated weights on worker 0-0, policy_version 264683 (0.00087) [2022-07-09 13:16:24,008][26022] Updated weights on worker 0-0, policy_version 264693 (0.00093) [2022-07-09 13:16:25,674][26022] Updated weights on worker 0-0, policy_version 264703 (0.00093) [2022-07-09 13:16:26,768][25689] Fps is (10 sec: 5769.2, 60 sec: 5678.1, 300 sec: 5665.4). Total num frames: 271060992. Throughput: 0: 5940.1. Samples: 271067542. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 13:16:26,770][25689] Avg episode reward: [(0, '-48.724')] [2022-07-09 13:16:27,756][26022] Updated weights on worker 0-0, policy_version 264713 (0.00089) [2022-07-09 13:16:29,363][26022] Updated weights on worker 0-0, policy_version 264723 (0.00094) [2022-07-09 13:16:31,301][26022] Updated weights on worker 0-0, policy_version 264733 (0.00085) [2022-07-09 13:16:31,807][25689] Fps is (10 sec: 5606.3, 60 sec: 5641.6, 300 sec: 5668.3). Total num frames: 271088640. Throughput: 0: 5093.3. Samples: 271084612. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 13:16:31,808][25689] Avg episode reward: [(0, '-49.488')] [2022-07-09 13:16:33,062][26022] Updated weights on worker 0-0, policy_version 264743 (0.00085) [2022-07-09 13:16:34,815][26022] Updated weights on worker 0-0, policy_version 264753 (0.00089) [2022-07-09 13:16:36,643][26022] Updated weights on worker 0-0, policy_version 264763 (0.00092) [2022-07-09 13:16:36,876][25689] Fps is (10 sec: 5672.8, 60 sec: 5637.4, 300 sec: 5669.0). Total num frames: 271118336. Throughput: 0: 5905.1. Samples: 271118428. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 13:16:36,878][25689] Avg episode reward: [(0, '-50.731')] [2022-07-09 13:16:38,479][26022] Updated weights on worker 0-0, policy_version 264773 (0.00086) [2022-07-09 13:16:40,231][26022] Updated weights on worker 0-0, policy_version 264783 (0.00092) [2022-07-09 13:16:41,963][25689] Fps is (10 sec: 5646.3, 60 sec: 5637.5, 300 sec: 5660.7). Total num frames: 271145984. Throughput: 0: 5884.8. Samples: 271152350. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 13:16:41,963][25689] Avg episode reward: [(0, '-51.160')] [2022-07-09 13:16:42,248][26022] Updated weights on worker 0-0, policy_version 264793 (0.00093) [2022-07-09 13:16:43,878][26022] Updated weights on worker 0-0, policy_version 264803 (0.00089) [2022-07-09 13:16:45,915][26022] Updated weights on worker 0-0, policy_version 264813 (0.00084) [2022-07-09 13:16:46,994][25689] Fps is (10 sec: 5566.7, 60 sec: 5638.9, 300 sec: 5664.2). Total num frames: 271174656. Throughput: 0: 5030.1. Samples: 271169280. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 13:16:46,994][25689] Avg episode reward: [(0, '-50.862')] [2022-07-09 13:16:47,505][26022] Updated weights on worker 0-0, policy_version 264823 (0.00080) [2022-07-09 13:16:49,318][26022] Updated weights on worker 0-0, policy_version 264833 (0.00087) [2022-07-09 13:16:51,187][26022] Updated weights on worker 0-0, policy_version 264843 (0.00087) [2022-07-09 13:16:52,017][25689] Fps is (10 sec: 5907.1, 60 sec: 5654.3, 300 sec: 5667.3). Total num frames: 271205376. Throughput: 0: 5892.2. Samples: 271203696. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 13:16:52,017][25689] Avg episode reward: [(0, '-50.539')] [2022-07-09 13:16:52,926][26022] Updated weights on worker 0-0, policy_version 264853 (0.00087) [2022-07-09 13:16:54,546][26022] Updated weights on worker 0-0, policy_version 264863 (0.00089) [2022-07-09 13:16:56,385][26022] Updated weights on worker 0-0, policy_version 264873 (0.00101) [2022-07-09 13:16:57,031][25689] Fps is (10 sec: 5611.3, 60 sec: 5621.9, 300 sec: 5657.6). Total num frames: 271230976. Throughput: 0: 5944.5. Samples: 271238234. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 13:16:57,031][25689] Avg episode reward: [(0, '-51.398')] [2022-07-09 13:16:58,326][26022] Updated weights on worker 0-0, policy_version 264883 (0.00087) [2022-07-09 13:16:58,547][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:16:58,556][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000264884_271241216.pth [2022-07-09 13:16:58,558][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000262891_269200384.pth [2022-07-09 13:16:59,951][26022] Updated weights on worker 0-0, policy_version 264893 (0.00092) [2022-07-09 13:17:02,121][25689] Fps is (10 sec: 5472.5, 60 sec: 5651.4, 300 sec: 5666.3). Total num frames: 271260672. Throughput: 0: 5108.5. Samples: 271255332. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 13:17:02,122][25689] Avg episode reward: [(0, '-51.391')] [2022-07-09 13:17:02,135][26022] Updated weights on worker 0-0, policy_version 264903 (0.00088) [2022-07-09 13:17:04,165][26022] Updated weights on worker 0-0, policy_version 264913 (0.00087) [2022-07-09 13:17:05,862][26022] Updated weights on worker 0-0, policy_version 264923 (0.00080) [2022-07-09 13:17:07,209][25689] Fps is (10 sec: 5633.8, 60 sec: 5627.3, 300 sec: 5664.8). Total num frames: 271288320. Throughput: 0: 5847.5. Samples: 271287490. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 13:17:07,210][25689] Avg episode reward: [(0, '-50.418')] [2022-07-09 13:17:07,733][26022] Updated weights on worker 0-0, policy_version 264933 (0.00086) [2022-07-09 13:17:09,419][26022] Updated weights on worker 0-0, policy_version 264943 (0.00092) [2022-07-09 13:17:11,233][26022] Updated weights on worker 0-0, policy_version 264953 (0.00084) [2022-07-09 13:17:12,299][25689] Fps is (10 sec: 5634.5, 60 sec: 5654.7, 300 sec: 5666.6). Total num frames: 271318016. Throughput: 0: 5816.9. Samples: 271321672. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 13:17:12,299][25689] Avg episode reward: [(0, '-49.282')] [2022-07-09 13:17:13,179][26022] Updated weights on worker 0-0, policy_version 264963 (0.00086) [2022-07-09 13:17:14,807][26022] Updated weights on worker 0-0, policy_version 264973 (0.00092) [2022-07-09 13:17:16,758][26022] Updated weights on worker 0-0, policy_version 264983 (0.00087) [2022-07-09 13:17:17,324][25689] Fps is (10 sec: 5669.4, 60 sec: 5653.9, 300 sec: 5660.5). Total num frames: 271345664. Throughput: 0: 4936.6. Samples: 271338414. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 13:17:17,324][25689] Avg episode reward: [(0, '-49.202')] [2022-07-09 13:17:18,442][26022] Updated weights on worker 0-0, policy_version 264993 (0.00084) [2022-07-09 13:17:20,174][26022] Updated weights on worker 0-0, policy_version 265003 (0.00090) [2022-07-09 13:17:22,154][26022] Updated weights on worker 0-0, policy_version 265013 (0.00427) [2022-07-09 13:17:22,455][25689] Fps is (10 sec: 5646.1, 60 sec: 5651.9, 300 sec: 5661.7). Total num frames: 271375360. Throughput: 0: 5762.3. Samples: 271372502. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 13:17:22,455][25689] Avg episode reward: [(0, '-48.926')] [2022-07-09 13:17:23,920][26022] Updated weights on worker 0-0, policy_version 265023 (0.00084) [2022-07-09 13:17:25,623][26022] Updated weights on worker 0-0, policy_version 265033 (0.00083) [2022-07-09 13:17:27,373][26022] Updated weights on worker 0-0, policy_version 265043 (0.00087) [2022-07-09 13:17:27,479][25689] Fps is (10 sec: 5747.5, 60 sec: 5650.3, 300 sec: 5668.4). Total num frames: 271404032. Throughput: 0: 5886.3. Samples: 271406806. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 13:17:27,480][25689] Avg episode reward: [(0, '-48.718')] [2022-07-09 13:17:29,346][26022] Updated weights on worker 0-0, policy_version 265053 (0.00086) [2022-07-09 13:17:30,839][26022] Updated weights on worker 0-0, policy_version 265063 (0.00080) [2022-07-09 13:17:32,520][25689] Fps is (10 sec: 5494.0, 60 sec: 5633.3, 300 sec: 5653.9). Total num frames: 271430656. Throughput: 0: 5915.5. Samples: 271441292. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 13:17:32,522][25689] Avg episode reward: [(0, '-49.054')] [2022-07-09 13:17:33,096][26022] Updated weights on worker 0-0, policy_version 265073 (0.00052) [2022-07-09 13:17:34,403][26022] Updated weights on worker 0-0, policy_version 265083 (0.00084) [2022-07-09 13:17:36,348][26022] Updated weights on worker 0-0, policy_version 265093 (0.00087) [2022-07-09 13:17:37,531][25689] Fps is (10 sec: 5806.7, 60 sec: 5672.5, 300 sec: 5668.7). Total num frames: 271462400. Throughput: 0: 5951.8. Samples: 271458684. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 13:17:37,535][25689] Avg episode reward: [(0, '-49.396')] [2022-07-09 13:17:38,314][26022] Updated weights on worker 0-0, policy_version 265103 (0.00091) [2022-07-09 13:17:39,927][26022] Updated weights on worker 0-0, policy_version 265113 (0.00085) [2022-07-09 13:17:41,842][26022] Updated weights on worker 0-0, policy_version 265123 (0.00092) [2022-07-09 13:17:42,585][25689] Fps is (10 sec: 5900.7, 60 sec: 5675.5, 300 sec: 5657.7). Total num frames: 271490048. Throughput: 0: 5978.0. Samples: 271492840. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 13:17:42,586][25689] Avg episode reward: [(0, '-49.998')] [2022-07-09 13:17:43,748][26022] Updated weights on worker 0-0, policy_version 265133 (0.00088) [2022-07-09 13:17:45,460][26022] Updated weights on worker 0-0, policy_version 265143 (0.00098) [2022-07-09 13:17:47,378][26022] Updated weights on worker 0-0, policy_version 265153 (0.00092) [2022-07-09 13:17:47,663][25689] Fps is (10 sec: 5457.8, 60 sec: 5654.3, 300 sec: 5659.9). Total num frames: 271517696. Throughput: 0: 5945.0. Samples: 271526796. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 13:17:47,665][25689] Avg episode reward: [(0, '-50.505')] [2022-07-09 13:17:48,851][26022] Updated weights on worker 0-0, policy_version 265163 (0.00086) [2022-07-09 13:17:51,004][26022] Updated weights on worker 0-0, policy_version 265173 (0.00087) [2022-07-09 13:17:52,519][26022] Updated weights on worker 0-0, policy_version 265183 (0.00083) [2022-07-09 13:17:52,699][25689] Fps is (10 sec: 5771.2, 60 sec: 5653.1, 300 sec: 5663.0). Total num frames: 271548416. Throughput: 0: 5090.7. Samples: 271544016. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 13:17:52,701][25689] Avg episode reward: [(0, '-50.549')] [2022-07-09 13:17:54,594][26022] Updated weights on worker 0-0, policy_version 265193 (0.00081) [2022-07-09 13:17:55,990][26022] Updated weights on worker 0-0, policy_version 265203 (0.00092) [2022-07-09 13:17:57,704][25689] Fps is (10 sec: 5710.6, 60 sec: 5670.7, 300 sec: 5654.2). Total num frames: 271575040. Throughput: 0: 5924.7. Samples: 271578204. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 13:17:57,706][25689] Avg episode reward: [(0, '-50.514')] [2022-07-09 13:17:58,227][26022] Updated weights on worker 0-0, policy_version 265213 (0.00079) [2022-07-09 13:17:59,468][26022] Updated weights on worker 0-0, policy_version 265223 (0.00088) [2022-07-09 13:18:01,890][26022] Updated weights on worker 0-0, policy_version 265233 (0.00097) [2022-07-09 13:18:02,771][25689] Fps is (10 sec: 5388.2, 60 sec: 5639.2, 300 sec: 5660.4). Total num frames: 271602688. Throughput: 0: 5852.2. Samples: 271610972. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 13:18:02,772][25689] Avg episode reward: [(0, '-49.730')] [2022-07-09 13:18:03,691][26022] Updated weights on worker 0-0, policy_version 265243 (0.00086) [2022-07-09 13:18:05,656][26022] Updated weights on worker 0-0, policy_version 265253 (0.00096) [2022-07-09 13:18:07,387][26022] Updated weights on worker 0-0, policy_version 265263 (0.00082) [2022-07-09 13:18:07,775][25689] Fps is (10 sec: 5592.6, 60 sec: 5663.9, 300 sec: 5657.2). Total num frames: 271631360. Throughput: 0: 5006.4. Samples: 271627486. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 13:18:07,776][25689] Avg episode reward: [(0, '-49.034')] [2022-07-09 13:18:09,355][26022] Updated weights on worker 0-0, policy_version 265273 (0.00089) [2022-07-09 13:18:10,988][26022] Updated weights on worker 0-0, policy_version 265283 (0.00081) [2022-07-09 13:18:12,794][25689] Fps is (10 sec: 5619.1, 60 sec: 5636.6, 300 sec: 5657.5). Total num frames: 271659008. Throughput: 0: 5847.3. Samples: 271661520. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 13:18:12,795][25689] Avg episode reward: [(0, '-48.386')] [2022-07-09 13:18:13,042][26022] Updated weights on worker 0-0, policy_version 265293 (0.00087) [2022-07-09 13:18:14,466][26022] Updated weights on worker 0-0, policy_version 265303 (0.00088) [2022-07-09 13:18:16,731][26022] Updated weights on worker 0-0, policy_version 265313 (0.00084) [2022-07-09 13:18:17,840][25689] Fps is (10 sec: 5697.3, 60 sec: 5668.6, 300 sec: 5661.1). Total num frames: 271688704. Throughput: 0: 5812.8. Samples: 271695248. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 13:18:17,840][25689] Avg episode reward: [(0, '-47.306')] [2022-07-09 13:18:18,252][26022] Updated weights on worker 0-0, policy_version 265323 (0.00093) [2022-07-09 13:18:20,225][26022] Updated weights on worker 0-0, policy_version 265333 (0.00083) [2022-07-09 13:18:21,830][26022] Updated weights on worker 0-0, policy_version 265343 (0.00097) [2022-07-09 13:18:22,914][25689] Fps is (10 sec: 5565.2, 60 sec: 5623.1, 300 sec: 5653.2). Total num frames: 271715328. Throughput: 0: 5018.2. Samples: 271712054. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 13:18:22,915][25689] Avg episode reward: [(0, '-48.388')] [2022-07-09 13:18:23,754][26022] Updated weights on worker 0-0, policy_version 265353 (0.00084) [2022-07-09 13:18:25,532][26022] Updated weights on worker 0-0, policy_version 265363 (0.00093) [2022-07-09 13:18:27,585][26022] Updated weights on worker 0-0, policy_version 265373 (0.00085) [2022-07-09 13:18:27,919][25689] Fps is (10 sec: 5486.3, 60 sec: 5624.9, 300 sec: 5656.7). Total num frames: 271744000. Throughput: 0: 5882.5. Samples: 271745984. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 13:18:27,919][25689] Avg episode reward: [(0, '-47.693')] [2022-07-09 13:18:29,280][26022] Updated weights on worker 0-0, policy_version 265383 (0.00086) [2022-07-09 13:18:31,107][26022] Updated weights on worker 0-0, policy_version 265393 (0.00090) [2022-07-09 13:18:32,795][26022] Updated weights on worker 0-0, policy_version 265403 (0.00088) [2022-07-09 13:18:32,922][25689] Fps is (10 sec: 5730.0, 60 sec: 5662.3, 300 sec: 5653.6). Total num frames: 271772672. Throughput: 0: 5875.8. Samples: 271779786. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 13:18:32,922][25689] Avg episode reward: [(0, '-47.848')] [2022-07-09 13:18:34,558][26022] Updated weights on worker 0-0, policy_version 265413 (0.00088) [2022-07-09 13:18:36,660][26022] Updated weights on worker 0-0, policy_version 265423 (0.00055) [2022-07-09 13:18:37,959][25689] Fps is (10 sec: 5813.5, 60 sec: 5626.0, 300 sec: 5662.0). Total num frames: 271802368. Throughput: 0: 5051.5. Samples: 271796880. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 13:18:37,959][25689] Avg episode reward: [(0, '-47.720')] [2022-07-09 13:18:38,090][26022] Updated weights on worker 0-0, policy_version 265433 (0.00097) [2022-07-09 13:18:40,214][26022] Updated weights on worker 0-0, policy_version 265443 (0.00128) [2022-07-09 13:18:41,547][26022] Updated weights on worker 0-0, policy_version 265453 (0.00096) [2022-07-09 13:18:42,999][25689] Fps is (10 sec: 5588.9, 60 sec: 5610.4, 300 sec: 5652.8). Total num frames: 271828992. Throughput: 0: 5940.2. Samples: 271831362. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 13:18:42,999][25689] Avg episode reward: [(0, '-47.525')] [2022-07-09 13:18:43,590][26022] Updated weights on worker 0-0, policy_version 265463 (0.00088) [2022-07-09 13:18:45,474][26022] Updated weights on worker 0-0, policy_version 265473 (0.00084) [2022-07-09 13:18:47,154][26022] Updated weights on worker 0-0, policy_version 265483 (0.00077) [2022-07-09 13:18:48,011][25689] Fps is (10 sec: 5603.0, 60 sec: 5650.4, 300 sec: 5656.6). Total num frames: 271858688. Throughput: 0: 5952.3. Samples: 271865576. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 13:18:48,011][25689] Avg episode reward: [(0, '-47.821')] [2022-07-09 13:18:49,013][26022] Updated weights on worker 0-0, policy_version 265493 (0.00085) [2022-07-09 13:18:50,665][26022] Updated weights on worker 0-0, policy_version 265503 (0.00431) [2022-07-09 13:18:52,559][26022] Updated weights on worker 0-0, policy_version 265513 (0.00084) [2022-07-09 13:18:53,041][25689] Fps is (10 sec: 5812.4, 60 sec: 5617.0, 300 sec: 5652.9). Total num frames: 271887360. Throughput: 0: 5116.1. Samples: 271882718. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 13:18:53,041][25689] Avg episode reward: [(0, '-47.270')] [2022-07-09 13:18:54,554][26022] Updated weights on worker 0-0, policy_version 265523 (0.00083) [2022-07-09 13:18:56,236][26022] Updated weights on worker 0-0, policy_version 265533 (0.00081) [2022-07-09 13:18:58,060][25689] Fps is (10 sec: 5706.4, 60 sec: 5649.7, 300 sec: 5656.8). Total num frames: 271916032. Throughput: 0: 5961.3. Samples: 271916708. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 13:18:58,060][25689] Avg episode reward: [(0, '-48.277')] [2022-07-09 13:18:58,066][26022] Updated weights on worker 0-0, policy_version 265543 (0.00081) [2022-07-09 13:18:58,772][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:18:58,788][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000265547_271920128.pth [2022-07-09 13:18:58,789][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000263556_269881344.pth [2022-07-09 13:18:59,844][26022] Updated weights on worker 0-0, policy_version 265553 (0.00082) [2022-07-09 13:19:01,826][26022] Updated weights on worker 0-0, policy_version 265563 (0.00094) [2022-07-09 13:19:03,152][25689] Fps is (10 sec: 5468.9, 60 sec: 5630.4, 300 sec: 5653.3). Total num frames: 271942656. Throughput: 0: 5816.9. Samples: 271948590. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 13:19:03,152][25689] Avg episode reward: [(0, '-48.467')] [2022-07-09 13:19:03,786][26022] Updated weights on worker 0-0, policy_version 265573 (0.00085) [2022-07-09 13:19:05,827][26022] Updated weights on worker 0-0, policy_version 265583 (0.00084) [2022-07-09 13:19:07,197][26022] Updated weights on worker 0-0, policy_version 265593 (0.00096) [2022-07-09 13:19:08,215][25689] Fps is (10 sec: 5344.3, 60 sec: 5607.9, 300 sec: 5649.4). Total num frames: 271970304. Throughput: 0: 4956.0. Samples: 271965708. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 13:19:08,216][25689] Avg episode reward: [(0, '-49.004')] [2022-07-09 13:19:09,384][26022] Updated weights on worker 0-0, policy_version 265603 (0.00094) [2022-07-09 13:19:10,908][26022] Updated weights on worker 0-0, policy_version 265613 (0.00090) [2022-07-09 13:19:12,729][26022] Updated weights on worker 0-0, policy_version 265623 (0.00087) [2022-07-09 13:19:13,216][25689] Fps is (10 sec: 5596.1, 60 sec: 5626.6, 300 sec: 5649.4). Total num frames: 271998976. Throughput: 0: 5812.3. Samples: 271999982. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 13:19:13,216][25689] Avg episode reward: [(0, '-48.424')] [2022-07-09 13:19:14,593][26022] Updated weights on worker 0-0, policy_version 265633 (0.00106) [2022-07-09 13:19:16,353][26022] Updated weights on worker 0-0, policy_version 265643 (0.00092) [2022-07-09 13:19:18,276][25689] Fps is (10 sec: 5699.6, 60 sec: 5608.3, 300 sec: 5652.9). Total num frames: 272027648. Throughput: 0: 5795.1. Samples: 272033864. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 13:19:18,278][25689] Avg episode reward: [(0, '-48.705')] [2022-07-09 13:19:18,383][26022] Updated weights on worker 0-0, policy_version 265653 (0.00091) [2022-07-09 13:19:20,094][26022] Updated weights on worker 0-0, policy_version 265663 (0.00094) [2022-07-09 13:19:21,901][26022] Updated weights on worker 0-0, policy_version 265673 (0.00089) [2022-07-09 13:19:23,383][25689] Fps is (10 sec: 5640.0, 60 sec: 5639.1, 300 sec: 5647.5). Total num frames: 272056320. Throughput: 0: 5049.4. Samples: 272050748. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 13:19:23,384][25689] Avg episode reward: [(0, '-48.139')] [2022-07-09 13:19:23,937][26022] Updated weights on worker 0-0, policy_version 265683 (0.00088) [2022-07-09 13:19:25,332][26022] Updated weights on worker 0-0, policy_version 265693 (0.00089) [2022-07-09 13:19:27,654][26022] Updated weights on worker 0-0, policy_version 265703 (0.00092) [2022-07-09 13:19:28,399][25689] Fps is (10 sec: 5867.3, 60 sec: 5672.0, 300 sec: 5657.9). Total num frames: 272087040. Throughput: 0: 5901.0. Samples: 272084812. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 13:19:28,399][25689] Avg episode reward: [(0, '-48.460')] [2022-07-09 13:19:28,934][26022] Updated weights on worker 0-0, policy_version 265713 (0.00084) [2022-07-09 13:19:30,977][26022] Updated weights on worker 0-0, policy_version 265723 (0.00090) [2022-07-09 13:19:32,851][26022] Updated weights on worker 0-0, policy_version 265733 (0.00055) [2022-07-09 13:19:33,407][25689] Fps is (10 sec: 5618.9, 60 sec: 5620.7, 300 sec: 5647.7). Total num frames: 272112640. Throughput: 0: 5884.6. Samples: 272118796. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 13:19:33,407][25689] Avg episode reward: [(0, '-47.685')] [2022-07-09 13:19:34,362][26022] Updated weights on worker 0-0, policy_version 265743 (0.00106) [2022-07-09 13:19:36,417][26022] Updated weights on worker 0-0, policy_version 265753 (0.00088) [2022-07-09 13:19:38,127][26022] Updated weights on worker 0-0, policy_version 265763 (0.00088) [2022-07-09 13:19:38,416][25689] Fps is (10 sec: 5519.8, 60 sec: 5623.3, 300 sec: 5648.8). Total num frames: 272142336. Throughput: 0: 5065.5. Samples: 272135884. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 13:19:38,417][25689] Avg episode reward: [(0, '-47.211')] [2022-07-09 13:19:40,162][26022] Updated weights on worker 0-0, policy_version 265773 (0.00094) [2022-07-09 13:19:41,869][26022] Updated weights on worker 0-0, policy_version 265783 (0.00091) [2022-07-09 13:19:43,489][25689] Fps is (10 sec: 5890.7, 60 sec: 5671.0, 300 sec: 5647.7). Total num frames: 272172032. Throughput: 0: 5922.5. Samples: 272169824. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 13:19:43,489][25689] Avg episode reward: [(0, '-47.899')] [2022-07-09 13:19:43,494][26022] Updated weights on worker 0-0, policy_version 265793 (0.00083) [2022-07-09 13:19:45,504][26022] Updated weights on worker 0-0, policy_version 265803 (0.00081) [2022-07-09 13:19:47,116][26022] Updated weights on worker 0-0, policy_version 265813 (0.00086) [2022-07-09 13:19:48,564][25689] Fps is (10 sec: 5550.1, 60 sec: 5614.4, 300 sec: 5639.7). Total num frames: 272198656. Throughput: 0: 5929.5. Samples: 272204382. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 13:19:48,564][25689] Avg episode reward: [(0, '-47.740')] [2022-07-09 13:19:49,091][26022] Updated weights on worker 0-0, policy_version 265823 (0.00077) [2022-07-09 13:19:50,708][26022] Updated weights on worker 0-0, policy_version 265833 (0.00088) [2022-07-09 13:19:52,601][26022] Updated weights on worker 0-0, policy_version 265843 (0.00086) [2022-07-09 13:19:53,569][25689] Fps is (10 sec: 5587.2, 60 sec: 5633.6, 300 sec: 5643.2). Total num frames: 272228352. Throughput: 0: 5099.7. Samples: 272221618. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:19:53,570][25689] Avg episode reward: [(0, '-47.920')] [2022-07-09 13:19:54,331][26022] Updated weights on worker 0-0, policy_version 265853 (0.00085) [2022-07-09 13:19:56,171][26022] Updated weights on worker 0-0, policy_version 265863 (0.00096) [2022-07-09 13:19:57,924][26022] Updated weights on worker 0-0, policy_version 265873 (0.00091) [2022-07-09 13:19:58,583][25689] Fps is (10 sec: 5825.5, 60 sec: 5634.1, 300 sec: 5640.4). Total num frames: 272257024. Throughput: 0: 5951.4. Samples: 272255906. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:19:58,585][25689] Avg episode reward: [(0, '-47.847')] [2022-07-09 13:19:59,903][26022] Updated weights on worker 0-0, policy_version 265883 (0.00087) [2022-07-09 13:20:01,795][26022] Updated weights on worker 0-0, policy_version 265893 (0.00094) [2022-07-09 13:20:03,704][25689] Fps is (10 sec: 5355.0, 60 sec: 5614.5, 300 sec: 5638.2). Total num frames: 272282624. Throughput: 0: 5834.3. Samples: 272287766. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:20:03,705][25689] Avg episode reward: [(0, '-47.703')] [2022-07-09 13:20:03,935][26022] Updated weights on worker 0-0, policy_version 265903 (0.00089) [2022-07-09 13:20:05,411][26022] Updated weights on worker 0-0, policy_version 265913 (0.00079) [2022-07-09 13:20:07,539][26022] Updated weights on worker 0-0, policy_version 265923 (0.00093) [2022-07-09 13:20:08,764][25689] Fps is (10 sec: 5531.8, 60 sec: 5665.5, 300 sec: 5648.0). Total num frames: 272313344. Throughput: 0: 5822.3. Samples: 272321994. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:20:08,765][25689] Avg episode reward: [(0, '-49.062')] [2022-07-09 13:20:08,993][26022] Updated weights on worker 0-0, policy_version 265933 (0.00081) [2022-07-09 13:20:10,998][26022] Updated weights on worker 0-0, policy_version 265943 (0.00105) [2022-07-09 13:20:12,604][26022] Updated weights on worker 0-0, policy_version 265953 (0.00088) [2022-07-09 13:20:13,831][25689] Fps is (10 sec: 5763.9, 60 sec: 5642.5, 300 sec: 5640.2). Total num frames: 272340992. Throughput: 0: 5802.2. Samples: 272339178. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:20:13,833][25689] Avg episode reward: [(0, '-48.788')] [2022-07-09 13:20:14,596][26022] Updated weights on worker 0-0, policy_version 265963 (0.00404) [2022-07-09 13:20:16,187][26022] Updated weights on worker 0-0, policy_version 265973 (0.00097) [2022-07-09 13:20:18,235][26022] Updated weights on worker 0-0, policy_version 265983 (0.00085) [2022-07-09 13:20:18,846][25689] Fps is (10 sec: 5586.3, 60 sec: 5646.6, 300 sec: 5641.3). Total num frames: 272369664. Throughput: 0: 5797.1. Samples: 272373372. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:20:18,847][25689] Avg episode reward: [(0, '-49.815')] [2022-07-09 13:20:20,051][26022] Updated weights on worker 0-0, policy_version 265993 (0.00085) [2022-07-09 13:20:21,783][26022] Updated weights on worker 0-0, policy_version 266003 (0.00094) [2022-07-09 13:20:23,399][26022] Updated weights on worker 0-0, policy_version 266013 (0.00093) [2022-07-09 13:20:23,897][25689] Fps is (10 sec: 5798.2, 60 sec: 5668.8, 300 sec: 5651.0). Total num frames: 272399360. Throughput: 0: 5939.6. Samples: 272407704. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:20:23,899][25689] Avg episode reward: [(0, '-49.863')] [2022-07-09 13:20:25,374][26022] Updated weights on worker 0-0, policy_version 266023 (0.00087) [2022-07-09 13:20:27,124][26022] Updated weights on worker 0-0, policy_version 266033 (0.00083) [2022-07-09 13:20:28,921][25689] Fps is (10 sec: 5691.5, 60 sec: 5617.2, 300 sec: 5643.8). Total num frames: 272427008. Throughput: 0: 5105.5. Samples: 272424906. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:20:28,922][25689] Avg episode reward: [(0, '-50.129')] [2022-07-09 13:20:28,935][26022] Updated weights on worker 0-0, policy_version 266043 (0.00092) [2022-07-09 13:20:30,534][26022] Updated weights on worker 0-0, policy_version 266053 (0.00086) [2022-07-09 13:20:32,539][26022] Updated weights on worker 0-0, policy_version 266063 (0.00082) [2022-07-09 13:20:33,958][25689] Fps is (10 sec: 5801.4, 60 sec: 5699.1, 300 sec: 5647.0). Total num frames: 272457728. Throughput: 0: 5969.1. Samples: 272459322. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:20:33,959][25689] Avg episode reward: [(0, '-49.732')] [2022-07-09 13:20:33,993][26022] Updated weights on worker 0-0, policy_version 266073 (0.00086) [2022-07-09 13:20:36,147][26022] Updated weights on worker 0-0, policy_version 266083 (0.00085) [2022-07-09 13:20:37,776][26022] Updated weights on worker 0-0, policy_version 266093 (0.00084) [2022-07-09 13:20:38,979][25689] Fps is (10 sec: 5701.6, 60 sec: 5647.3, 300 sec: 5644.9). Total num frames: 272484352. Throughput: 0: 5966.1. Samples: 272493486. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:20:38,981][25689] Avg episode reward: [(0, '-49.178')] [2022-07-09 13:20:39,707][26022] Updated weights on worker 0-0, policy_version 266103 (0.00055) [2022-07-09 13:20:41,343][26022] Updated weights on worker 0-0, policy_version 266113 (0.00088) [2022-07-09 13:20:43,287][26022] Updated weights on worker 0-0, policy_version 266123 (0.00089) [2022-07-09 13:20:44,043][25689] Fps is (10 sec: 5584.9, 60 sec: 5648.1, 300 sec: 5648.0). Total num frames: 272514048. Throughput: 0: 5115.0. Samples: 272510748. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:20:44,043][25689] Avg episode reward: [(0, '-49.125')] [2022-07-09 13:20:44,875][26022] Updated weights on worker 0-0, policy_version 266133 (0.00088) [2022-07-09 13:20:46,869][26022] Updated weights on worker 0-0, policy_version 266143 (0.00086) [2022-07-09 13:20:48,541][26022] Updated weights on worker 0-0, policy_version 266153 (0.00082) [2022-07-09 13:20:49,063][25689] Fps is (10 sec: 5889.5, 60 sec: 5704.0, 300 sec: 5647.7). Total num frames: 272543744. Throughput: 0: 5960.5. Samples: 272544962. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:20:49,064][25689] Avg episode reward: [(0, '-49.034')] [2022-07-09 13:20:50,485][26022] Updated weights on worker 0-0, policy_version 266163 (0.00086) [2022-07-09 13:20:52,344][26022] Updated weights on worker 0-0, policy_version 266173 (0.00100) [2022-07-09 13:20:54,087][25689] Fps is (10 sec: 5708.8, 60 sec: 5668.4, 300 sec: 5647.8). Total num frames: 272571392. Throughput: 0: 5945.0. Samples: 272578990. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:20:54,088][25689] Avg episode reward: [(0, '-48.939')] [2022-07-09 13:20:54,103][26022] Updated weights on worker 0-0, policy_version 266183 (0.00086) [2022-07-09 13:20:55,790][26022] Updated weights on worker 0-0, policy_version 266193 (0.00092) [2022-07-09 13:20:57,580][26022] Updated weights on worker 0-0, policy_version 266203 (0.00084) [2022-07-09 13:20:58,958][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:20:58,974][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000266210_272599040.pth [2022-07-09 13:20:58,978][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000264222_270563328.pth [2022-07-09 13:20:59,139][25689] Fps is (10 sec: 5691.4, 60 sec: 5681.8, 300 sec: 5654.5). Total num frames: 272601088. Throughput: 0: 5093.5. Samples: 272596172. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:20:59,140][25689] Avg episode reward: [(0, '-49.678')] [2022-07-09 13:20:59,476][26022] Updated weights on worker 0-0, policy_version 266213 (0.00086) [2022-07-09 13:21:01,314][26022] Updated weights on worker 0-0, policy_version 266223 (0.00085) [2022-07-09 13:21:03,420][26022] Updated weights on worker 0-0, policy_version 266233 (0.00092) [2022-07-09 13:21:04,194][25689] Fps is (10 sec: 5471.4, 60 sec: 5688.0, 300 sec: 5643.4). Total num frames: 272626688. Throughput: 0: 5827.9. Samples: 272628188. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:21:04,194][25689] Avg episode reward: [(0, '-50.397')] [2022-07-09 13:21:05,069][26022] Updated weights on worker 0-0, policy_version 266243 (0.00091) [2022-07-09 13:21:07,039][26022] Updated weights on worker 0-0, policy_version 266253 (0.00097) [2022-07-09 13:21:08,647][26022] Updated weights on worker 0-0, policy_version 266263 (0.00094) [2022-07-09 13:21:09,252][25689] Fps is (10 sec: 5366.5, 60 sec: 5654.3, 300 sec: 5646.1). Total num frames: 272655360. Throughput: 0: 5816.8. Samples: 272662396. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:21:09,253][25689] Avg episode reward: [(0, '-50.320')] [2022-07-09 13:21:10,730][26022] Updated weights on worker 0-0, policy_version 266273 (0.00099) [2022-07-09 13:21:12,212][26022] Updated weights on worker 0-0, policy_version 266283 (0.00089) [2022-07-09 13:21:14,216][26022] Updated weights on worker 0-0, policy_version 266293 (0.01016) [2022-07-09 13:21:14,266][25689] Fps is (10 sec: 5693.5, 60 sec: 5676.2, 300 sec: 5649.6). Total num frames: 272684032. Throughput: 0: 4984.6. Samples: 272679570. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:21:14,266][25689] Avg episode reward: [(0, '-50.435')] [2022-07-09 13:21:15,894][26022] Updated weights on worker 0-0, policy_version 266303 (0.00084) [2022-07-09 13:21:17,904][26022] Updated weights on worker 0-0, policy_version 266313 (0.00093) [2022-07-09 13:21:19,315][25689] Fps is (10 sec: 5800.5, 60 sec: 5690.0, 300 sec: 5650.8). Total num frames: 272713728. Throughput: 0: 5842.7. Samples: 272714052. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:21:19,315][25689] Avg episode reward: [(0, '-50.649')] [2022-07-09 13:21:19,474][26022] Updated weights on worker 0-0, policy_version 266323 (0.00082) [2022-07-09 13:21:21,397][26022] Updated weights on worker 0-0, policy_version 266333 (0.00092) [2022-07-09 13:21:23,143][26022] Updated weights on worker 0-0, policy_version 266343 (0.00087) [2022-07-09 13:21:24,363][25689] Fps is (10 sec: 5678.9, 60 sec: 5656.3, 300 sec: 5646.5). Total num frames: 272741376. Throughput: 0: 5961.5. Samples: 272748428. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:21:24,364][25689] Avg episode reward: [(0, '-51.646')] [2022-07-09 13:21:24,902][26022] Updated weights on worker 0-0, policy_version 266353 (0.00090) [2022-07-09 13:21:26,764][26022] Updated weights on worker 0-0, policy_version 266363 (0.00085) [2022-07-09 13:21:28,533][26022] Updated weights on worker 0-0, policy_version 266373 (0.00085) [2022-07-09 13:21:29,415][25689] Fps is (10 sec: 5677.3, 60 sec: 5687.6, 300 sec: 5653.2). Total num frames: 272771072. Throughput: 0: 5101.1. Samples: 272765250. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:21:29,416][25689] Avg episode reward: [(0, '-51.042')] [2022-07-09 13:21:30,552][26022] Updated weights on worker 0-0, policy_version 266383 (0.00117) [2022-07-09 13:21:32,175][26022] Updated weights on worker 0-0, policy_version 266393 (0.00085) [2022-07-09 13:21:33,957][26022] Updated weights on worker 0-0, policy_version 266403 (0.00085) [2022-07-09 13:21:34,430][25689] Fps is (10 sec: 5696.3, 60 sec: 5638.8, 300 sec: 5647.3). Total num frames: 272798720. Throughput: 0: 5936.7. Samples: 272799282. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:21:34,431][25689] Avg episode reward: [(0, '-51.063')] [2022-07-09 13:21:35,784][26022] Updated weights on worker 0-0, policy_version 266413 (0.00086) [2022-07-09 13:21:37,418][26022] Updated weights on worker 0-0, policy_version 266423 (0.00087) [2022-07-09 13:21:39,449][25689] Fps is (10 sec: 5510.8, 60 sec: 5655.9, 300 sec: 5648.6). Total num frames: 272826368. Throughput: 0: 5939.2. Samples: 272833636. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:21:39,450][25689] Avg episode reward: [(0, '-50.581')] [2022-07-09 13:21:39,493][26022] Updated weights on worker 0-0, policy_version 266433 (0.00091) [2022-07-09 13:21:41,224][26022] Updated weights on worker 0-0, policy_version 266443 (0.00088) [2022-07-09 13:21:42,918][26022] Updated weights on worker 0-0, policy_version 266453 (0.00091) [2022-07-09 13:21:44,565][25689] Fps is (10 sec: 5658.0, 60 sec: 5651.0, 300 sec: 5650.4). Total num frames: 272856064. Throughput: 0: 5063.8. Samples: 272850726. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:21:44,566][25689] Avg episode reward: [(0, '-50.995')] [2022-07-09 13:21:44,683][26022] Updated weights on worker 0-0, policy_version 266463 (0.00093) [2022-07-09 13:21:46,561][26022] Updated weights on worker 0-0, policy_version 266473 (0.00091) [2022-07-09 13:21:48,301][26022] Updated weights on worker 0-0, policy_version 266483 (0.00089) [2022-07-09 13:21:49,570][25689] Fps is (10 sec: 5767.4, 60 sec: 5635.6, 300 sec: 5643.9). Total num frames: 272884736. Throughput: 0: 5940.4. Samples: 272884976. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:21:49,570][25689] Avg episode reward: [(0, '-50.423')] [2022-07-09 13:21:50,069][26022] Updated weights on worker 0-0, policy_version 266493 (0.00096) [2022-07-09 13:21:51,783][26022] Updated weights on worker 0-0, policy_version 266503 (0.00092) [2022-07-09 13:21:53,827][26022] Updated weights on worker 0-0, policy_version 266513 (0.00089) [2022-07-09 13:21:54,602][25689] Fps is (10 sec: 5815.6, 60 sec: 5668.7, 300 sec: 5657.3). Total num frames: 272914432. Throughput: 0: 5955.5. Samples: 272919414. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:21:54,603][25689] Avg episode reward: [(0, '-48.899')] [2022-07-09 13:21:55,576][26022] Updated weights on worker 0-0, policy_version 266523 (0.00090) [2022-07-09 13:21:57,247][26022] Updated weights on worker 0-0, policy_version 266533 (0.00088) [2022-07-09 13:21:59,199][26022] Updated weights on worker 0-0, policy_version 266543 (0.00087) [2022-07-09 13:21:59,637][25689] Fps is (10 sec: 5594.3, 60 sec: 5619.5, 300 sec: 5648.0). Total num frames: 272941056. Throughput: 0: 5089.2. Samples: 272936376. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:21:59,638][25689] Avg episode reward: [(0, '-49.180')] [2022-07-09 13:22:00,822][26022] Updated weights on worker 0-0, policy_version 266553 (0.00087) [2022-07-09 13:22:03,264][26022] Updated weights on worker 0-0, policy_version 266563 (0.00092) [2022-07-09 13:22:04,683][25689] Fps is (10 sec: 5383.6, 60 sec: 5654.2, 300 sec: 5648.9). Total num frames: 272968704. Throughput: 0: 5834.1. Samples: 272968094. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:22:04,683][25689] Avg episode reward: [(0, '-49.620')] [2022-07-09 13:22:04,932][26022] Updated weights on worker 0-0, policy_version 266573 (0.00090) [2022-07-09 13:22:06,909][26022] Updated weights on worker 0-0, policy_version 266583 (0.00085) [2022-07-09 13:22:08,451][26022] Updated weights on worker 0-0, policy_version 266593 (0.00097) [2022-07-09 13:22:09,697][25689] Fps is (10 sec: 5598.4, 60 sec: 5658.3, 300 sec: 5646.8). Total num frames: 272997376. Throughput: 0: 5832.3. Samples: 273002364. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:22:09,697][25689] Avg episode reward: [(0, '-48.651')] [2022-07-09 13:22:10,339][26022] Updated weights on worker 0-0, policy_version 266603 (0.00090) [2022-07-09 13:22:12,067][26022] Updated weights on worker 0-0, policy_version 266613 (0.00090) [2022-07-09 13:22:14,047][26022] Updated weights on worker 0-0, policy_version 266623 (0.00100) [2022-07-09 13:22:14,721][25689] Fps is (10 sec: 5610.4, 60 sec: 5640.4, 300 sec: 5646.9). Total num frames: 273025024. Throughput: 0: 4979.3. Samples: 273019594. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:22:14,722][25689] Avg episode reward: [(0, '-48.832')] [2022-07-09 13:22:15,671][26022] Updated weights on worker 0-0, policy_version 266633 (0.00094) [2022-07-09 13:22:17,519][26022] Updated weights on worker 0-0, policy_version 266643 (0.00085) [2022-07-09 13:22:19,140][26022] Updated weights on worker 0-0, policy_version 266653 (0.00081) [2022-07-09 13:22:19,727][25689] Fps is (10 sec: 5717.5, 60 sec: 5644.4, 300 sec: 5649.2). Total num frames: 273054720. Throughput: 0: 5844.6. Samples: 273053790. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:22:19,727][25689] Avg episode reward: [(0, '-49.165')] [2022-07-09 13:22:21,206][26022] Updated weights on worker 0-0, policy_version 266663 (0.00091) [2022-07-09 13:22:23,093][26022] Updated weights on worker 0-0, policy_version 266673 (0.00087) [2022-07-09 13:22:24,749][26022] Updated weights on worker 0-0, policy_version 266683 (0.00091) [2022-07-09 13:22:24,784][25689] Fps is (10 sec: 5800.6, 60 sec: 5660.6, 300 sec: 5648.6). Total num frames: 273083392. Throughput: 0: 5960.6. Samples: 273087908. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:22:24,784][25689] Avg episode reward: [(0, '-49.814')] [2022-07-09 13:22:26,629][26022] Updated weights on worker 0-0, policy_version 266693 (0.00081) [2022-07-09 13:22:28,450][26022] Updated weights on worker 0-0, policy_version 266703 (0.00081) [2022-07-09 13:22:29,799][25689] Fps is (10 sec: 5591.5, 60 sec: 5630.1, 300 sec: 5652.5). Total num frames: 273111040. Throughput: 0: 5114.3. Samples: 273105172. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:22:29,800][25689] Avg episode reward: [(0, '-50.388')] [2022-07-09 13:22:30,174][26022] Updated weights on worker 0-0, policy_version 266713 (0.00087) [2022-07-09 13:22:31,905][26022] Updated weights on worker 0-0, policy_version 266723 (0.00084) [2022-07-09 13:22:33,728][26022] Updated weights on worker 0-0, policy_version 266733 (0.00104) [2022-07-09 13:22:34,804][25689] Fps is (10 sec: 5722.7, 60 sec: 5665.0, 300 sec: 5645.8). Total num frames: 273140736. Throughput: 0: 5971.4. Samples: 273139516. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:22:34,805][25689] Avg episode reward: [(0, '-50.240')] [2022-07-09 13:22:35,455][26022] Updated weights on worker 0-0, policy_version 266743 (0.00090) [2022-07-09 13:22:37,385][26022] Updated weights on worker 0-0, policy_version 266753 (0.00085) [2022-07-09 13:22:38,972][26022] Updated weights on worker 0-0, policy_version 266763 (0.00087) [2022-07-09 13:22:39,806][25689] Fps is (10 sec: 5730.5, 60 sec: 5666.6, 300 sec: 5646.8). Total num frames: 273168384. Throughput: 0: 5979.2. Samples: 273173848. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:22:39,806][25689] Avg episode reward: [(0, '-50.124')] [2022-07-09 13:22:40,860][26022] Updated weights on worker 0-0, policy_version 266773 (0.00067) [2022-07-09 13:22:42,672][26022] Updated weights on worker 0-0, policy_version 266783 (0.00711) [2022-07-09 13:22:44,297][26022] Updated weights on worker 0-0, policy_version 266793 (0.00087) [2022-07-09 13:22:44,947][25689] Fps is (10 sec: 5754.4, 60 sec: 5681.2, 300 sec: 5655.9). Total num frames: 273199104. Throughput: 0: 5114.5. Samples: 273191034. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:22:44,948][25689] Avg episode reward: [(0, '-48.909')] [2022-07-09 13:22:46,413][26022] Updated weights on worker 0-0, policy_version 266803 (0.00088) [2022-07-09 13:22:47,946][26022] Updated weights on worker 0-0, policy_version 266813 (0.00086) [2022-07-09 13:22:49,962][25689] Fps is (10 sec: 5646.3, 60 sec: 5646.3, 300 sec: 5642.5). Total num frames: 273225728. Throughput: 0: 5963.0. Samples: 273225404. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:22:49,963][25689] Avg episode reward: [(0, '-48.935')] [2022-07-09 13:22:50,006][26022] Updated weights on worker 0-0, policy_version 266823 (0.00087) [2022-07-09 13:22:51,629][26022] Updated weights on worker 0-0, policy_version 266833 (0.00089) [2022-07-09 13:22:53,435][26022] Updated weights on worker 0-0, policy_version 266843 (0.00085) [2022-07-09 13:22:54,985][25689] Fps is (10 sec: 5610.7, 60 sec: 5647.1, 300 sec: 5652.5). Total num frames: 273255424. Throughput: 0: 5950.4. Samples: 273259602. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:22:54,986][25689] Avg episode reward: [(0, '-49.736')] [2022-07-09 13:22:55,279][26022] Updated weights on worker 0-0, policy_version 266853 (0.00096) [2022-07-09 13:22:57,074][26022] Updated weights on worker 0-0, policy_version 266863 (0.00089) [2022-07-09 13:22:58,562][26022] Updated weights on worker 0-0, policy_version 266873 (0.00087) [2022-07-09 13:22:59,209][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:22:59,225][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000266875_273280000.pth [2022-07-09 13:22:59,229][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000264884_271241216.pth [2022-07-09 13:23:00,013][25689] Fps is (10 sec: 5807.3, 60 sec: 5681.8, 300 sec: 5656.7). Total num frames: 273284096. Throughput: 0: 5956.8. Samples: 273294216. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:23:00,013][25689] Avg episode reward: [(0, '-49.152')] [2022-07-09 13:23:00,711][26022] Updated weights on worker 0-0, policy_version 266883 (0.00092) [2022-07-09 13:23:02,803][26022] Updated weights on worker 0-0, policy_version 266893 (0.00086) [2022-07-09 13:23:04,438][26022] Updated weights on worker 0-0, policy_version 266903 (0.00088) [2022-07-09 13:23:05,060][25689] Fps is (10 sec: 5590.3, 60 sec: 5681.6, 300 sec: 5652.4). Total num frames: 273311744. Throughput: 0: 5879.1. Samples: 273309278. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:23:05,060][25689] Avg episode reward: [(0, '-48.923')] [2022-07-09 13:23:06,372][26022] Updated weights on worker 0-0, policy_version 266913 (0.00086) [2022-07-09 13:23:08,076][26022] Updated weights on worker 0-0, policy_version 266923 (0.00107) [2022-07-09 13:23:09,872][26022] Updated weights on worker 0-0, policy_version 266933 (0.00085) [2022-07-09 13:23:10,137][25689] Fps is (10 sec: 5461.6, 60 sec: 5658.7, 300 sec: 5651.3). Total num frames: 273339392. Throughput: 0: 5865.1. Samples: 273343734. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:23:10,138][25689] Avg episode reward: [(0, '-49.338')] [2022-07-09 13:23:11,614][26022] Updated weights on worker 0-0, policy_version 266943 (0.00086) [2022-07-09 13:23:13,394][26022] Updated weights on worker 0-0, policy_version 266953 (0.00092) [2022-07-09 13:23:15,150][25689] Fps is (10 sec: 5683.0, 60 sec: 5693.7, 300 sec: 5652.0). Total num frames: 273369088. Throughput: 0: 5877.0. Samples: 273378112. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:23:15,151][25689] Avg episode reward: [(0, '-49.536')] [2022-07-09 13:23:15,423][26022] Updated weights on worker 0-0, policy_version 266963 (0.00095) [2022-07-09 13:23:17,255][26022] Updated weights on worker 0-0, policy_version 266973 (0.00088) [2022-07-09 13:23:18,829][26022] Updated weights on worker 0-0, policy_version 266983 (0.00089) [2022-07-09 13:23:20,198][25689] Fps is (10 sec: 5598.3, 60 sec: 5638.9, 300 sec: 5652.5). Total num frames: 273395712. Throughput: 0: 4983.8. Samples: 273394812. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:23:20,198][25689] Avg episode reward: [(0, '-50.440')] [2022-07-09 13:23:20,866][26022] Updated weights on worker 0-0, policy_version 266993 (0.00083) [2022-07-09 13:23:22,199][26022] Updated weights on worker 0-0, policy_version 267003 (0.00087) [2022-07-09 13:23:24,438][26022] Updated weights on worker 0-0, policy_version 267013 (0.00085) [2022-07-09 13:23:25,279][25689] Fps is (10 sec: 5863.6, 60 sec: 5704.4, 300 sec: 5664.8). Total num frames: 273428480. Throughput: 0: 5926.0. Samples: 273429098. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:23:25,280][25689] Avg episode reward: [(0, '-50.200')] [2022-07-09 13:23:26,172][26022] Updated weights on worker 0-0, policy_version 267023 (0.00084) [2022-07-09 13:23:27,886][26022] Updated weights on worker 0-0, policy_version 267033 (0.00093) [2022-07-09 13:23:29,832][26022] Updated weights on worker 0-0, policy_version 267043 (0.00093) [2022-07-09 13:23:30,360][25689] Fps is (10 sec: 5743.5, 60 sec: 5664.4, 300 sec: 5653.0). Total num frames: 273454080. Throughput: 0: 5904.2. Samples: 273463134. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:23:30,361][25689] Avg episode reward: [(0, '-50.228')] [2022-07-09 13:23:31,464][26022] Updated weights on worker 0-0, policy_version 267053 (0.00087) [2022-07-09 13:23:33,508][26022] Updated weights on worker 0-0, policy_version 267063 (0.00086) [2022-07-09 13:23:35,083][26022] Updated weights on worker 0-0, policy_version 267073 (0.00080) [2022-07-09 13:23:35,373][25689] Fps is (10 sec: 5376.7, 60 sec: 5646.7, 300 sec: 5650.0). Total num frames: 273482752. Throughput: 0: 5035.4. Samples: 273479942. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:23:35,373][25689] Avg episode reward: [(0, '-49.701')] [2022-07-09 13:23:37,002][26022] Updated weights on worker 0-0, policy_version 267083 (0.00092) [2022-07-09 13:23:38,966][26022] Updated weights on worker 0-0, policy_version 267093 (0.00084) [2022-07-09 13:23:40,375][25689] Fps is (10 sec: 5827.9, 60 sec: 5680.5, 300 sec: 5661.0). Total num frames: 273512448. Throughput: 0: 5912.4. Samples: 273514112. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:23:40,376][25689] Avg episode reward: [(0, '-49.960')] [2022-07-09 13:23:40,506][26022] Updated weights on worker 0-0, policy_version 267103 (0.00096) [2022-07-09 13:23:42,554][26022] Updated weights on worker 0-0, policy_version 267113 (0.00089) [2022-07-09 13:23:44,106][26022] Updated weights on worker 0-0, policy_version 267123 (0.00085) [2022-07-09 13:23:45,480][25689] Fps is (10 sec: 5673.5, 60 sec: 5633.2, 300 sec: 5652.4). Total num frames: 273540096. Throughput: 0: 5902.2. Samples: 273548330. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:23:45,481][25689] Avg episode reward: [(0, '-49.522')] [2022-07-09 13:23:45,995][26022] Updated weights on worker 0-0, policy_version 267133 (0.00086) [2022-07-09 13:23:47,816][26022] Updated weights on worker 0-0, policy_version 267143 (0.00084) [2022-07-09 13:23:49,571][26022] Updated weights on worker 0-0, policy_version 267153 (0.00083) [2022-07-09 13:23:50,502][25689] Fps is (10 sec: 5561.7, 60 sec: 5666.3, 300 sec: 5652.5). Total num frames: 273568768. Throughput: 0: 5074.4. Samples: 273565342. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:23:50,502][25689] Avg episode reward: [(0, '-49.735')] [2022-07-09 13:23:51,466][26022] Updated weights on worker 0-0, policy_version 267163 (0.00086) [2022-07-09 13:23:53,253][26022] Updated weights on worker 0-0, policy_version 267173 (0.00093) [2022-07-09 13:23:54,955][26022] Updated weights on worker 0-0, policy_version 267183 (0.00088) [2022-07-09 13:23:55,537][25689] Fps is (10 sec: 5701.8, 60 sec: 5648.3, 300 sec: 5652.2). Total num frames: 273597440. Throughput: 0: 5914.9. Samples: 273599214. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:23:55,538][25689] Avg episode reward: [(0, '-49.481')] [2022-07-09 13:23:56,845][26022] Updated weights on worker 0-0, policy_version 267193 (0.01151) [2022-07-09 13:23:58,549][26022] Updated weights on worker 0-0, policy_version 267203 (0.00085) [2022-07-09 13:24:00,355][26022] Updated weights on worker 0-0, policy_version 267213 (0.00085) [2022-07-09 13:24:00,552][25689] Fps is (10 sec: 5807.8, 60 sec: 5666.4, 300 sec: 5664.0). Total num frames: 273627136. Throughput: 0: 5931.6. Samples: 273633790. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:24:00,552][25689] Avg episode reward: [(0, '-48.430')] [2022-07-09 13:24:02,659][26022] Updated weights on worker 0-0, policy_version 267223 (0.00087) [2022-07-09 13:24:04,227][26022] Updated weights on worker 0-0, policy_version 267233 (0.00080) [2022-07-09 13:24:05,618][25689] Fps is (10 sec: 5586.9, 60 sec: 5647.7, 300 sec: 5660.5). Total num frames: 273653760. Throughput: 0: 4983.1. Samples: 273648680. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:24:05,619][25689] Avg episode reward: [(0, '-49.440')] [2022-07-09 13:24:06,144][26022] Updated weights on worker 0-0, policy_version 267243 (0.00092) [2022-07-09 13:24:08,029][26022] Updated weights on worker 0-0, policy_version 267253 (0.00087) [2022-07-09 13:24:09,758][26022] Updated weights on worker 0-0, policy_version 267263 (0.00083) [2022-07-09 13:24:10,627][25689] Fps is (10 sec: 5488.2, 60 sec: 5671.0, 300 sec: 5660.4). Total num frames: 273682432. Throughput: 0: 5856.3. Samples: 273683204. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:24:10,629][25689] Avg episode reward: [(0, '-49.317')] [2022-07-09 13:24:11,423][26022] Updated weights on worker 0-0, policy_version 267273 (0.00090) [2022-07-09 13:24:13,257][26022] Updated weights on worker 0-0, policy_version 267283 (0.00087) [2022-07-09 13:24:15,117][26022] Updated weights on worker 0-0, policy_version 267293 (0.00088) [2022-07-09 13:24:15,633][25689] Fps is (10 sec: 5725.9, 60 sec: 5654.7, 300 sec: 5661.4). Total num frames: 273711104. Throughput: 0: 5899.2. Samples: 273717762. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:24:15,634][25689] Avg episode reward: [(0, '-48.988')] [2022-07-09 13:24:16,876][26022] Updated weights on worker 0-0, policy_version 267303 (0.00094) [2022-07-09 13:24:18,749][26022] Updated weights on worker 0-0, policy_version 267313 (0.00087) [2022-07-09 13:24:20,456][26022] Updated weights on worker 0-0, policy_version 267323 (0.00092) [2022-07-09 13:24:20,679][25689] Fps is (10 sec: 5705.1, 60 sec: 5688.8, 300 sec: 5662.6). Total num frames: 273739776. Throughput: 0: 5012.6. Samples: 273734678. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:24:20,679][25689] Avg episode reward: [(0, '-48.678')] [2022-07-09 13:24:22,299][26022] Updated weights on worker 0-0, policy_version 267333 (0.00089) [2022-07-09 13:24:24,293][26022] Updated weights on worker 0-0, policy_version 267343 (0.00091) [2022-07-09 13:24:25,745][25689] Fps is (10 sec: 5671.2, 60 sec: 5622.5, 300 sec: 5654.7). Total num frames: 273768448. Throughput: 0: 5963.9. Samples: 273768712. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:24:25,745][25689] Avg episode reward: [(0, '-49.433')] [2022-07-09 13:24:25,828][26022] Updated weights on worker 0-0, policy_version 267353 (0.00091) [2022-07-09 13:24:27,870][26022] Updated weights on worker 0-0, policy_version 267363 (0.00087) [2022-07-09 13:24:29,395][26022] Updated weights on worker 0-0, policy_version 267373 (0.00090) [2022-07-09 13:24:30,756][25689] Fps is (10 sec: 5588.7, 60 sec: 5662.9, 300 sec: 5661.5). Total num frames: 273796096. Throughput: 0: 5955.3. Samples: 273803078. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:24:30,757][25689] Avg episode reward: [(0, '-50.359')] [2022-07-09 13:24:31,370][26022] Updated weights on worker 0-0, policy_version 267383 (0.00089) [2022-07-09 13:24:33,146][26022] Updated weights on worker 0-0, policy_version 267393 (0.00625) [2022-07-09 13:24:34,885][26022] Updated weights on worker 0-0, policy_version 267403 (0.00085) [2022-07-09 13:24:35,783][25689] Fps is (10 sec: 5610.5, 60 sec: 5661.6, 300 sec: 5657.8). Total num frames: 273824768. Throughput: 0: 5074.8. Samples: 273820018. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:24:35,783][25689] Avg episode reward: [(0, '-50.482')] [2022-07-09 13:24:36,738][26022] Updated weights on worker 0-0, policy_version 267413 (0.00082) [2022-07-09 13:24:38,606][26022] Updated weights on worker 0-0, policy_version 267423 (0.00080) [2022-07-09 13:24:40,277][26022] Updated weights on worker 0-0, policy_version 267433 (0.00080) [2022-07-09 13:24:40,786][25689] Fps is (10 sec: 5819.5, 60 sec: 5661.5, 300 sec: 5659.1). Total num frames: 273854464. Throughput: 0: 5945.1. Samples: 273854218. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 13:24:40,787][25689] Avg episode reward: [(0, '-49.408')] [2022-07-09 13:24:42,110][26022] Updated weights on worker 0-0, policy_version 267443 (0.00084) [2022-07-09 13:24:43,847][26022] Updated weights on worker 0-0, policy_version 267453 (0.00089) [2022-07-09 13:24:45,692][26022] Updated weights on worker 0-0, policy_version 267463 (0.00092) [2022-07-09 13:24:45,908][25689] Fps is (10 sec: 5764.4, 60 sec: 5676.8, 300 sec: 5665.1). Total num frames: 273883136. Throughput: 0: 5930.1. Samples: 273888286. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 13:24:45,909][25689] Avg episode reward: [(0, '-50.261')] [2022-07-09 13:24:47,567][26022] Updated weights on worker 0-0, policy_version 267473 (0.00088) [2022-07-09 13:24:49,263][26022] Updated weights on worker 0-0, policy_version 267483 (0.00082) [2022-07-09 13:24:50,923][25689] Fps is (10 sec: 5555.7, 60 sec: 5660.5, 300 sec: 5658.0). Total num frames: 273910784. Throughput: 0: 5071.1. Samples: 273905348. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 13:24:50,927][25689] Avg episode reward: [(0, '-49.861')] [2022-07-09 13:24:51,131][26022] Updated weights on worker 0-0, policy_version 267493 (0.00087) [2022-07-09 13:24:53,095][26022] Updated weights on worker 0-0, policy_version 267503 (0.00086) [2022-07-09 13:24:54,724][26022] Updated weights on worker 0-0, policy_version 267513 (0.00100) [2022-07-09 13:24:55,962][25689] Fps is (10 sec: 5601.9, 60 sec: 5660.2, 300 sec: 5657.5). Total num frames: 273939456. Throughput: 0: 5916.6. Samples: 273939412. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 13:24:55,964][25689] Avg episode reward: [(0, '-48.593')] [2022-07-09 13:24:56,740][26022] Updated weights on worker 0-0, policy_version 267523 (0.00091) [2022-07-09 13:24:58,395][26022] Updated weights on worker 0-0, policy_version 267533 (0.00084) [2022-07-09 13:24:59,280][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:24:59,297][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000267537_273957888.pth [2022-07-09 13:24:59,297][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000265547_271920128.pth [2022-07-09 13:25:00,098][26022] Updated weights on worker 0-0, policy_version 267543 (0.00091) [2022-07-09 13:25:01,045][25689] Fps is (10 sec: 5766.4, 60 sec: 5653.8, 300 sec: 5672.0). Total num frames: 273969152. Throughput: 0: 5893.3. Samples: 273973612. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 13:25:01,046][25689] Avg episode reward: [(0, '-48.116')] [2022-07-09 13:25:02,618][26022] Updated weights on worker 0-0, policy_version 267553 (0.00083) [2022-07-09 13:25:04,113][26022] Updated weights on worker 0-0, policy_version 267563 (0.00084) [2022-07-09 13:25:06,041][26022] Updated weights on worker 0-0, policy_version 267573 (0.00092) [2022-07-09 13:25:06,175][25689] Fps is (10 sec: 5514.5, 60 sec: 5647.8, 300 sec: 5656.9). Total num frames: 273995776. Throughput: 0: 4935.5. Samples: 273988314. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 13:25:06,176][25689] Avg episode reward: [(0, '-48.951')] [2022-07-09 13:25:07,874][26022] Updated weights on worker 0-0, policy_version 267583 (0.00087) [2022-07-09 13:25:09,615][26022] Updated weights on worker 0-0, policy_version 267593 (0.00080) [2022-07-09 13:25:11,242][25689] Fps is (10 sec: 5322.4, 60 sec: 5625.6, 300 sec: 5656.9). Total num frames: 274023424. Throughput: 0: 5762.9. Samples: 274022444. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 13:25:11,243][25689] Avg episode reward: [(0, '-49.977')] [2022-07-09 13:25:11,486][26022] Updated weights on worker 0-0, policy_version 267603 (0.00085) [2022-07-09 13:25:13,191][26022] Updated weights on worker 0-0, policy_version 267613 (0.00089) [2022-07-09 13:25:15,035][26022] Updated weights on worker 0-0, policy_version 267623 (0.00099) [2022-07-09 13:25:16,296][25689] Fps is (10 sec: 5666.0, 60 sec: 5638.0, 300 sec: 5659.6). Total num frames: 274053120. Throughput: 0: 5755.4. Samples: 274056442. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 13:25:16,297][25689] Avg episode reward: [(0, '-48.889')] [2022-07-09 13:25:16,991][26022] Updated weights on worker 0-0, policy_version 267633 (0.00090) [2022-07-09 13:25:18,691][26022] Updated weights on worker 0-0, policy_version 267643 (0.00093) [2022-07-09 13:25:20,508][26022] Updated weights on worker 0-0, policy_version 267653 (0.00093) [2022-07-09 13:25:21,319][25689] Fps is (10 sec: 5792.5, 60 sec: 5640.1, 300 sec: 5656.7). Total num frames: 274081792. Throughput: 0: 5765.1. Samples: 274090490. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 13:25:21,320][25689] Avg episode reward: [(0, '-49.744')] [2022-07-09 13:25:22,303][26022] Updated weights on worker 0-0, policy_version 267663 (0.00098) [2022-07-09 13:25:24,137][26022] Updated weights on worker 0-0, policy_version 267673 (0.00084) [2022-07-09 13:25:25,859][26022] Updated weights on worker 0-0, policy_version 267683 (0.00095) [2022-07-09 13:25:26,430][25689] Fps is (10 sec: 5557.8, 60 sec: 5619.0, 300 sec: 5655.1). Total num frames: 274109440. Throughput: 0: 5889.6. Samples: 274107604. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 13:25:26,430][25689] Avg episode reward: [(0, '-50.188')] [2022-07-09 13:25:27,693][26022] Updated weights on worker 0-0, policy_version 267693 (0.00086) [2022-07-09 13:25:29,502][26022] Updated weights on worker 0-0, policy_version 267703 (0.00097) [2022-07-09 13:25:31,312][26022] Updated weights on worker 0-0, policy_version 267713 (0.00092) [2022-07-09 13:25:31,454][25689] Fps is (10 sec: 5557.1, 60 sec: 5634.8, 300 sec: 5648.4). Total num frames: 274138112. Throughput: 0: 5894.3. Samples: 274141576. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 13:25:31,455][25689] Avg episode reward: [(0, '-51.128')] [2022-07-09 13:25:33,102][26022] Updated weights on worker 0-0, policy_version 267723 (0.00092) [2022-07-09 13:25:34,829][26022] Updated weights on worker 0-0, policy_version 267733 (0.00080) [2022-07-09 13:25:36,484][25689] Fps is (10 sec: 5703.3, 60 sec: 5634.4, 300 sec: 5655.1). Total num frames: 274166784. Throughput: 0: 5907.9. Samples: 274175712. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 13:25:36,485][25689] Avg episode reward: [(0, '-50.686')] [2022-07-09 13:25:36,755][26022] Updated weights on worker 0-0, policy_version 267743 (0.00095) [2022-07-09 13:25:38,598][26022] Updated weights on worker 0-0, policy_version 267753 (0.00079) [2022-07-09 13:25:40,417][26022] Updated weights on worker 0-0, policy_version 267763 (0.00053) [2022-07-09 13:25:41,503][25689] Fps is (10 sec: 5706.1, 60 sec: 5616.1, 300 sec: 5652.5). Total num frames: 274195456. Throughput: 0: 5069.0. Samples: 274192806. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 13:25:41,504][25689] Avg episode reward: [(0, '-50.488')] [2022-07-09 13:25:42,299][26022] Updated weights on worker 0-0, policy_version 267773 (0.00083) [2022-07-09 13:25:43,995][26022] Updated weights on worker 0-0, policy_version 267783 (0.00084) [2022-07-09 13:25:45,861][26022] Updated weights on worker 0-0, policy_version 267793 (0.00091) [2022-07-09 13:25:46,642][25689] Fps is (10 sec: 5846.8, 60 sec: 5648.2, 300 sec: 5653.7). Total num frames: 274226176. Throughput: 0: 5913.1. Samples: 274227124. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 13:25:46,643][25689] Avg episode reward: [(0, '-50.116')] [2022-07-09 13:25:47,583][26022] Updated weights on worker 0-0, policy_version 267803 (0.00086) [2022-07-09 13:25:49,234][26022] Updated weights on worker 0-0, policy_version 267813 (0.00086) [2022-07-09 13:25:51,060][26022] Updated weights on worker 0-0, policy_version 267823 (0.00085) [2022-07-09 13:25:51,701][25689] Fps is (10 sec: 5723.6, 60 sec: 5644.2, 300 sec: 5653.1). Total num frames: 274253824. Throughput: 0: 5934.3. Samples: 274261730. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 13:25:51,701][25689] Avg episode reward: [(0, '-48.898')] [2022-07-09 13:25:52,969][26022] Updated weights on worker 0-0, policy_version 267833 (0.00093) [2022-07-09 13:25:54,672][26022] Updated weights on worker 0-0, policy_version 267843 (0.00084) [2022-07-09 13:25:56,538][26022] Updated weights on worker 0-0, policy_version 267853 (0.00102) [2022-07-09 13:25:56,702][25689] Fps is (10 sec: 5496.7, 60 sec: 5630.8, 300 sec: 5647.1). Total num frames: 274281472. Throughput: 0: 5105.5. Samples: 274278936. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 13:25:56,703][25689] Avg episode reward: [(0, '-48.199')] [2022-07-09 13:25:58,191][26022] Updated weights on worker 0-0, policy_version 267863 (0.00082) [2022-07-09 13:26:00,037][26022] Updated weights on worker 0-0, policy_version 267873 (0.00081) [2022-07-09 13:26:01,715][25689] Fps is (10 sec: 5624.3, 60 sec: 5620.5, 300 sec: 5658.3). Total num frames: 274310144. Throughput: 0: 5935.0. Samples: 274312762. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 13:26:01,715][25689] Avg episode reward: [(0, '-48.153')] [2022-07-09 13:26:02,219][26022] Updated weights on worker 0-0, policy_version 267883 (0.00093) [2022-07-09 13:26:04,162][26022] Updated weights on worker 0-0, policy_version 267893 (0.00087) [2022-07-09 13:26:05,816][26022] Updated weights on worker 0-0, policy_version 267903 (0.00092) [2022-07-09 13:26:06,876][25689] Fps is (10 sec: 5636.2, 60 sec: 5651.3, 300 sec: 5656.3). Total num frames: 274338816. Throughput: 0: 5818.9. Samples: 274344864. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 13:26:06,877][25689] Avg episode reward: [(0, '-48.102')] [2022-07-09 13:26:07,870][26022] Updated weights on worker 0-0, policy_version 267913 (0.00086) [2022-07-09 13:26:09,208][26022] Updated weights on worker 0-0, policy_version 267923 (0.00085) [2022-07-09 13:26:11,423][26022] Updated weights on worker 0-0, policy_version 267933 (0.00085) [2022-07-09 13:26:11,934][25689] Fps is (10 sec: 5611.1, 60 sec: 5669.0, 300 sec: 5655.5). Total num frames: 274367488. Throughput: 0: 4960.0. Samples: 274362088. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 13:26:11,935][25689] Avg episode reward: [(0, '-48.117')] [2022-07-09 13:26:12,780][26022] Updated weights on worker 0-0, policy_version 267943 (0.00091) [2022-07-09 13:26:14,936][26022] Updated weights on worker 0-0, policy_version 267953 (0.00087) [2022-07-09 13:26:16,667][26022] Updated weights on worker 0-0, policy_version 267963 (0.00086) [2022-07-09 13:26:16,977][25689] Fps is (10 sec: 5575.7, 60 sec: 5636.3, 300 sec: 5648.7). Total num frames: 274395136. Throughput: 0: 5787.8. Samples: 274396284. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 13:26:16,978][25689] Avg episode reward: [(0, '-48.851')] [2022-07-09 13:26:18,431][26022] Updated weights on worker 0-0, policy_version 267973 (0.00084) [2022-07-09 13:26:20,333][26022] Updated weights on worker 0-0, policy_version 267983 (0.00089) [2022-07-09 13:26:21,996][25689] Fps is (10 sec: 5597.5, 60 sec: 5636.6, 300 sec: 5652.7). Total num frames: 274423808. Throughput: 0: 5812.0. Samples: 274430636. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 13:26:21,999][25689] Avg episode reward: [(0, '-48.674')] [2022-07-09 13:26:22,075][26022] Updated weights on worker 0-0, policy_version 267993 (0.00085) [2022-07-09 13:26:23,814][26022] Updated weights on worker 0-0, policy_version 268003 (0.00088) [2022-07-09 13:26:25,549][26022] Updated weights on worker 0-0, policy_version 268013 (0.01117) [2022-07-09 13:26:27,091][25689] Fps is (10 sec: 5872.5, 60 sec: 5688.8, 300 sec: 5655.3). Total num frames: 274454528. Throughput: 0: 5093.0. Samples: 274447816. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 13:26:27,091][25689] Avg episode reward: [(0, '-49.786')] [2022-07-09 13:26:27,209][26022] Updated weights on worker 0-0, policy_version 268023 (0.00086) [2022-07-09 13:26:29,190][26022] Updated weights on worker 0-0, policy_version 268033 (0.00089) [2022-07-09 13:26:31,094][26022] Updated weights on worker 0-0, policy_version 268043 (0.00081) [2022-07-09 13:26:32,094][25689] Fps is (10 sec: 5678.4, 60 sec: 5656.9, 300 sec: 5652.1). Total num frames: 274481152. Throughput: 0: 5950.8. Samples: 274482056. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 13:26:32,095][25689] Avg episode reward: [(0, '-49.735')] [2022-07-09 13:26:32,791][26022] Updated weights on worker 0-0, policy_version 268053 (0.00090) [2022-07-09 13:26:34,530][26022] Updated weights on worker 0-0, policy_version 268063 (0.00085) [2022-07-09 13:26:36,310][26022] Updated weights on worker 0-0, policy_version 268073 (0.00090) [2022-07-09 13:26:37,134][25689] Fps is (10 sec: 5607.6, 60 sec: 5672.9, 300 sec: 5658.6). Total num frames: 274510848. Throughput: 0: 5952.2. Samples: 274516262. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 13:26:37,136][25689] Avg episode reward: [(0, '-49.654')] [2022-07-09 13:26:38,239][26022] Updated weights on worker 0-0, policy_version 268083 (0.00096) [2022-07-09 13:26:40,032][26022] Updated weights on worker 0-0, policy_version 268093 (0.00089) [2022-07-09 13:26:41,940][26022] Updated weights on worker 0-0, policy_version 268103 (0.00084) [2022-07-09 13:26:42,146][25689] Fps is (10 sec: 5704.8, 60 sec: 5656.7, 300 sec: 5653.7). Total num frames: 274538496. Throughput: 0: 5090.7. Samples: 274533216. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 13:26:42,147][25689] Avg episode reward: [(0, '-49.293')] [2022-07-09 13:26:43,640][26022] Updated weights on worker 0-0, policy_version 268113 (0.00087) [2022-07-09 13:26:45,528][26022] Updated weights on worker 0-0, policy_version 268123 (0.00096) [2022-07-09 13:26:47,223][25689] Fps is (10 sec: 5582.7, 60 sec: 5628.7, 300 sec: 5652.3). Total num frames: 274567168. Throughput: 0: 5935.6. Samples: 274567310. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 13:26:47,223][25689] Avg episode reward: [(0, '-49.146')] [2022-07-09 13:26:47,281][26022] Updated weights on worker 0-0, policy_version 268133 (0.00091) [2022-07-09 13:26:49,112][26022] Updated weights on worker 0-0, policy_version 268143 (0.00084) [2022-07-09 13:26:50,867][26022] Updated weights on worker 0-0, policy_version 268153 (0.00091) [2022-07-09 13:26:52,283][25689] Fps is (10 sec: 5657.2, 60 sec: 5645.5, 300 sec: 5648.4). Total num frames: 274595840. Throughput: 0: 5914.3. Samples: 274601456. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 13:26:52,283][25689] Avg episode reward: [(0, '-49.385')] [2022-07-09 13:26:52,639][26022] Updated weights on worker 0-0, policy_version 268163 (0.00091) [2022-07-09 13:26:54,425][26022] Updated weights on worker 0-0, policy_version 268173 (0.00093) [2022-07-09 13:26:56,090][26022] Updated weights on worker 0-0, policy_version 268183 (0.00098) [2022-07-09 13:26:57,289][25689] Fps is (10 sec: 5696.6, 60 sec: 5661.9, 300 sec: 5655.8). Total num frames: 274624512. Throughput: 0: 5083.4. Samples: 274618716. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 13:26:57,290][25689] Avg episode reward: [(0, '-48.787')] [2022-07-09 13:26:58,038][26022] Updated weights on worker 0-0, policy_version 268193 (0.00092) [2022-07-09 13:26:59,455][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:26:59,464][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000268201_274637824.pth [2022-07-09 13:26:59,464][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000266210_272599040.pth [2022-07-09 13:26:59,791][26022] Updated weights on worker 0-0, policy_version 268203 (0.00082) [2022-07-09 13:27:01,952][26022] Updated weights on worker 0-0, policy_version 268213 (0.00087) [2022-07-09 13:27:02,354][25689] Fps is (10 sec: 5490.6, 60 sec: 5623.3, 300 sec: 5652.0). Total num frames: 274651136. Throughput: 0: 5913.8. Samples: 274652720. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 13:27:02,354][25689] Avg episode reward: [(0, '-49.033')] [2022-07-09 13:27:03,998][26022] Updated weights on worker 0-0, policy_version 268223 (0.00092) [2022-07-09 13:27:05,604][26022] Updated weights on worker 0-0, policy_version 268233 (0.00089) [2022-07-09 13:27:07,402][25689] Fps is (10 sec: 5468.0, 60 sec: 5633.9, 300 sec: 5651.3). Total num frames: 274679808. Throughput: 0: 5799.7. Samples: 274684342. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 13:27:07,402][25689] Avg episode reward: [(0, '-49.667')] [2022-07-09 13:27:07,523][26022] Updated weights on worker 0-0, policy_version 268243 (0.00084) [2022-07-09 13:27:09,199][26022] Updated weights on worker 0-0, policy_version 268253 (0.00083) [2022-07-09 13:27:11,140][26022] Updated weights on worker 0-0, policy_version 268263 (0.00085) [2022-07-09 13:27:12,454][25689] Fps is (10 sec: 5677.4, 60 sec: 5634.4, 300 sec: 5654.2). Total num frames: 274708480. Throughput: 0: 4958.7. Samples: 274701480. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 13:27:12,455][25689] Avg episode reward: [(0, '-49.095')] [2022-07-09 13:27:12,948][26022] Updated weights on worker 0-0, policy_version 268273 (0.00079) [2022-07-09 13:27:14,667][26022] Updated weights on worker 0-0, policy_version 268283 (0.00095) [2022-07-09 13:27:16,447][26022] Updated weights on worker 0-0, policy_version 268293 (0.00090) [2022-07-09 13:27:17,463][25689] Fps is (10 sec: 5699.6, 60 sec: 5654.5, 300 sec: 5650.7). Total num frames: 274737152. Throughput: 0: 5794.8. Samples: 274735618. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 13:27:17,463][25689] Avg episode reward: [(0, '-49.155')] [2022-07-09 13:27:18,356][26022] Updated weights on worker 0-0, policy_version 268303 (0.00088) [2022-07-09 13:27:20,180][26022] Updated weights on worker 0-0, policy_version 268313 (0.00094) [2022-07-09 13:27:21,944][26022] Updated weights on worker 0-0, policy_version 268323 (0.00084) [2022-07-09 13:27:22,474][25689] Fps is (10 sec: 5723.0, 60 sec: 5655.2, 300 sec: 5651.6). Total num frames: 274765824. Throughput: 0: 5809.4. Samples: 274769608. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 13:27:22,475][25689] Avg episode reward: [(0, '-49.720')] [2022-07-09 13:27:23,603][26022] Updated weights on worker 0-0, policy_version 268333 (0.00081) [2022-07-09 13:27:25,611][26022] Updated weights on worker 0-0, policy_version 268343 (0.00094) [2022-07-09 13:27:27,375][26022] Updated weights on worker 0-0, policy_version 268353 (0.00087) [2022-07-09 13:27:27,542][25689] Fps is (10 sec: 5689.6, 60 sec: 5623.9, 300 sec: 5654.1). Total num frames: 274794496. Throughput: 0: 5925.4. Samples: 274803678. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 13:27:27,542][25689] Avg episode reward: [(0, '-49.473')] [2022-07-09 13:27:29,200][26022] Updated weights on worker 0-0, policy_version 268363 (0.00809) [2022-07-09 13:27:30,909][26022] Updated weights on worker 0-0, policy_version 268373 (0.00178) [2022-07-09 13:27:32,544][25689] Fps is (10 sec: 5592.8, 60 sec: 5640.9, 300 sec: 5647.2). Total num frames: 274822144. Throughput: 0: 5938.9. Samples: 274820792. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 13:27:32,545][25689] Avg episode reward: [(0, '-49.241')] [2022-07-09 13:27:32,800][26022] Updated weights on worker 0-0, policy_version 268383 (0.00085) [2022-07-09 13:27:34,444][26022] Updated weights on worker 0-0, policy_version 268393 (0.00089) [2022-07-09 13:27:36,327][26022] Updated weights on worker 0-0, policy_version 268403 (0.00116) [2022-07-09 13:27:37,548][25689] Fps is (10 sec: 5731.0, 60 sec: 5644.3, 300 sec: 5654.1). Total num frames: 274851840. Throughput: 0: 5949.5. Samples: 274855112. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 13:27:37,548][25689] Avg episode reward: [(0, '-49.499')] [2022-07-09 13:27:38,372][26022] Updated weights on worker 0-0, policy_version 268413 (0.00096) [2022-07-09 13:27:39,939][26022] Updated weights on worker 0-0, policy_version 268423 (0.00091) [2022-07-09 13:27:41,886][26022] Updated weights on worker 0-0, policy_version 268433 (0.00086) [2022-07-09 13:27:42,570][25689] Fps is (10 sec: 5821.7, 60 sec: 5660.3, 300 sec: 5649.5). Total num frames: 274880512. Throughput: 0: 5969.4. Samples: 274889570. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 13:27:42,571][25689] Avg episode reward: [(0, '-49.918')] [2022-07-09 13:27:43,478][26022] Updated weights on worker 0-0, policy_version 268443 (0.00087) [2022-07-09 13:27:45,366][26022] Updated weights on worker 0-0, policy_version 268453 (0.00094) [2022-07-09 13:27:47,307][26022] Updated weights on worker 0-0, policy_version 268463 (0.00088) [2022-07-09 13:27:47,611][25689] Fps is (10 sec: 5596.7, 60 sec: 5646.7, 300 sec: 5652.4). Total num frames: 274908160. Throughput: 0: 5129.5. Samples: 274906622. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 13:27:47,611][25689] Avg episode reward: [(0, '-49.499')] [2022-07-09 13:27:48,997][26022] Updated weights on worker 0-0, policy_version 268473 (0.00084) [2022-07-09 13:27:50,707][26022] Updated weights on worker 0-0, policy_version 268483 (0.00081) [2022-07-09 13:27:52,506][26022] Updated weights on worker 0-0, policy_version 268493 (0.00087) [2022-07-09 13:27:52,622][25689] Fps is (10 sec: 5603.0, 60 sec: 5651.2, 300 sec: 5649.2). Total num frames: 274936832. Throughput: 0: 5968.6. Samples: 274940628. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 13:27:52,623][25689] Avg episode reward: [(0, '-49.339')] [2022-07-09 13:27:54,334][26022] Updated weights on worker 0-0, policy_version 268503 (0.00089) [2022-07-09 13:27:56,130][26022] Updated weights on worker 0-0, policy_version 268513 (0.00091) [2022-07-09 13:27:57,623][25689] Fps is (10 sec: 5727.1, 60 sec: 5651.7, 300 sec: 5649.7). Total num frames: 274965504. Throughput: 0: 5961.4. Samples: 274974792. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 13:27:57,624][25689] Avg episode reward: [(0, '-47.510')] [2022-07-09 13:27:58,033][26022] Updated weights on worker 0-0, policy_version 268523 (0.00086) [2022-07-09 13:27:59,706][26022] Updated weights on worker 0-0, policy_version 268533 (0.00089) [2022-07-09 13:28:02,042][26022] Updated weights on worker 0-0, policy_version 268543 (0.00087) [2022-07-09 13:28:02,670][25689] Fps is (10 sec: 5401.4, 60 sec: 5636.5, 300 sec: 5642.8). Total num frames: 274991104. Throughput: 0: 5099.6. Samples: 274992070. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 13:28:02,672][25689] Avg episode reward: [(0, '-47.089')] [2022-07-09 13:28:03,621][26022] Updated weights on worker 0-0, policy_version 268553 (0.00082) [2022-07-09 13:28:05,644][26022] Updated weights on worker 0-0, policy_version 268563 (0.00089) [2022-07-09 13:28:07,262][26022] Updated weights on worker 0-0, policy_version 268573 (0.00086) [2022-07-09 13:28:07,718][25689] Fps is (10 sec: 5477.6, 60 sec: 5653.4, 300 sec: 5650.3). Total num frames: 275020800. Throughput: 0: 5845.5. Samples: 275024160. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 13:28:07,720][25689] Avg episode reward: [(0, '-47.735')] [2022-07-09 13:28:09,048][26022] Updated weights on worker 0-0, policy_version 268583 (0.00082) [2022-07-09 13:28:10,842][26022] Updated weights on worker 0-0, policy_version 268593 (0.00086) [2022-07-09 13:28:12,708][26022] Updated weights on worker 0-0, policy_version 268603 (0.00079) [2022-07-09 13:28:12,726][25689] Fps is (10 sec: 5804.3, 60 sec: 5657.6, 300 sec: 5646.9). Total num frames: 275049472. Throughput: 0: 5863.5. Samples: 275058506. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 13:28:12,728][25689] Avg episode reward: [(0, '-47.430')] [2022-07-09 13:28:14,395][26022] Updated weights on worker 0-0, policy_version 268613 (0.00085) [2022-07-09 13:28:16,280][26022] Updated weights on worker 0-0, policy_version 268623 (0.00089) [2022-07-09 13:28:17,739][25689] Fps is (10 sec: 5722.7, 60 sec: 5657.2, 300 sec: 5654.5). Total num frames: 275078144. Throughput: 0: 5015.9. Samples: 275075688. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 13:28:17,739][25689] Avg episode reward: [(0, '-47.742')] [2022-07-09 13:28:18,009][26022] Updated weights on worker 0-0, policy_version 268633 (0.00081) [2022-07-09 13:28:19,907][26022] Updated weights on worker 0-0, policy_version 268643 (0.00090) [2022-07-09 13:28:21,607][26022] Updated weights on worker 0-0, policy_version 268653 (0.00095) [2022-07-09 13:28:22,759][25689] Fps is (10 sec: 5715.4, 60 sec: 5656.3, 300 sec: 5641.8). Total num frames: 275106816. Throughput: 0: 5858.6. Samples: 275109766. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 13:28:22,760][25689] Avg episode reward: [(0, '-48.229')] [2022-07-09 13:28:23,468][26022] Updated weights on worker 0-0, policy_version 268663 (0.00087) [2022-07-09 13:28:25,346][26022] Updated weights on worker 0-0, policy_version 268673 (0.00092) [2022-07-09 13:28:27,187][26022] Updated weights on worker 0-0, policy_version 268683 (0.00088) [2022-07-09 13:28:27,828][25689] Fps is (10 sec: 5581.9, 60 sec: 5639.2, 300 sec: 5649.0). Total num frames: 275134464. Throughput: 0: 5949.7. Samples: 275143810. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 13:28:27,829][25689] Avg episode reward: [(0, '-49.458')] [2022-07-09 13:28:28,786][26022] Updated weights on worker 0-0, policy_version 268693 (0.00086) [2022-07-09 13:28:30,926][26022] Updated weights on worker 0-0, policy_version 268703 (0.00080) [2022-07-09 13:28:32,434][26022] Updated weights on worker 0-0, policy_version 268713 (0.00087) [2022-07-09 13:28:32,868][25689] Fps is (10 sec: 5571.1, 60 sec: 5652.7, 300 sec: 5648.4). Total num frames: 275163136. Throughput: 0: 5074.1. Samples: 275160712. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 13:28:32,869][25689] Avg episode reward: [(0, '-49.900')] [2022-07-09 13:28:34,379][26022] Updated weights on worker 0-0, policy_version 268723 (0.00088) [2022-07-09 13:28:36,176][26022] Updated weights on worker 0-0, policy_version 268733 (0.00119) [2022-07-09 13:28:37,873][25689] Fps is (10 sec: 5709.0, 60 sec: 5635.6, 300 sec: 5645.0). Total num frames: 275191808. Throughput: 0: 5907.7. Samples: 275194634. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 13:28:37,873][25689] Avg episode reward: [(0, '-48.838')] [2022-07-09 13:28:37,921][26022] Updated weights on worker 0-0, policy_version 268743 (0.00090) [2022-07-09 13:28:39,887][26022] Updated weights on worker 0-0, policy_version 268753 (0.00091) [2022-07-09 13:28:41,684][26022] Updated weights on worker 0-0, policy_version 268763 (0.00099) [2022-07-09 13:28:42,888][25689] Fps is (10 sec: 5621.1, 60 sec: 5619.3, 300 sec: 5646.7). Total num frames: 275219456. Throughput: 0: 5909.2. Samples: 275228710. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 13:28:42,888][25689] Avg episode reward: [(0, '-47.905')] [2022-07-09 13:28:43,451][26022] Updated weights on worker 0-0, policy_version 268773 (0.00089) [2022-07-09 13:28:45,299][26022] Updated weights on worker 0-0, policy_version 268783 (0.00089) [2022-07-09 13:28:47,227][26022] Updated weights on worker 0-0, policy_version 268793 (0.00091) [2022-07-09 13:28:48,028][25689] Fps is (10 sec: 5647.0, 60 sec: 5644.0, 300 sec: 5647.9). Total num frames: 275249152. Throughput: 0: 5047.2. Samples: 275245760. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 13:28:48,028][25689] Avg episode reward: [(0, '-47.098')] [2022-07-09 13:28:48,790][26022] Updated weights on worker 0-0, policy_version 268803 (0.00084) [2022-07-09 13:28:50,793][26022] Updated weights on worker 0-0, policy_version 268813 (0.00057) [2022-07-09 13:28:52,462][26022] Updated weights on worker 0-0, policy_version 268823 (0.00091) [2022-07-09 13:28:53,043][25689] Fps is (10 sec: 5747.8, 60 sec: 5643.6, 300 sec: 5648.2). Total num frames: 275277824. Throughput: 0: 5887.1. Samples: 275279480. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 13:28:53,043][25689] Avg episode reward: [(0, '-47.157')] [2022-07-09 13:28:54,535][26022] Updated weights on worker 0-0, policy_version 268833 (0.00087) [2022-07-09 13:28:56,171][26022] Updated weights on worker 0-0, policy_version 268843 (0.00106) [2022-07-09 13:28:58,027][26022] Updated weights on worker 0-0, policy_version 268853 (0.00088) [2022-07-09 13:28:58,053][25689] Fps is (10 sec: 5617.6, 60 sec: 5625.8, 300 sec: 5641.4). Total num frames: 275305472. Throughput: 0: 5901.4. Samples: 275313726. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 13:28:58,054][25689] Avg episode reward: [(0, '-46.325')] [2022-07-09 13:28:59,561][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:28:59,574][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000268862_275314688.pth [2022-07-09 13:28:59,574][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000266875_273280000.pth [2022-07-09 13:28:59,703][26022] Updated weights on worker 0-0, policy_version 268863 (0.00086) [2022-07-09 13:29:01,698][26022] Updated weights on worker 0-0, policy_version 268873 (0.00090) [2022-07-09 13:29:03,071][25689] Fps is (10 sec: 5412.0, 60 sec: 5645.5, 300 sec: 5642.4). Total num frames: 275332096. Throughput: 0: 5049.1. Samples: 275330614. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 13:29:03,071][25689] Avg episode reward: [(0, '-46.600')] [2022-07-09 13:29:03,659][26022] Updated weights on worker 0-0, policy_version 268883 (0.00082) [2022-07-09 13:29:05,701][26022] Updated weights on worker 0-0, policy_version 268893 (0.00088) [2022-07-09 13:29:07,418][26022] Updated weights on worker 0-0, policy_version 268903 (0.00055) [2022-07-09 13:29:08,197][25689] Fps is (10 sec: 5451.4, 60 sec: 5621.3, 300 sec: 5640.2). Total num frames: 275360768. Throughput: 0: 5789.1. Samples: 275362520. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 13:29:08,198][25689] Avg episode reward: [(0, '-47.966')] [2022-07-09 13:29:09,215][26022] Updated weights on worker 0-0, policy_version 268913 (0.00090) [2022-07-09 13:29:11,001][26022] Updated weights on worker 0-0, policy_version 268923 (0.00094) [2022-07-09 13:29:12,764][26022] Updated weights on worker 0-0, policy_version 268933 (0.00090) [2022-07-09 13:29:13,232][25689] Fps is (10 sec: 5744.2, 60 sec: 5635.7, 300 sec: 5643.0). Total num frames: 275390464. Throughput: 0: 5816.8. Samples: 275396918. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 13:29:13,234][25689] Avg episode reward: [(0, '-48.652')] [2022-07-09 13:29:14,398][26022] Updated weights on worker 0-0, policy_version 268943 (0.00084) [2022-07-09 13:29:16,309][26022] Updated weights on worker 0-0, policy_version 268953 (0.00085) [2022-07-09 13:29:18,120][26022] Updated weights on worker 0-0, policy_version 268963 (0.00090) [2022-07-09 13:29:18,259][25689] Fps is (10 sec: 5699.0, 60 sec: 5617.4, 300 sec: 5639.9). Total num frames: 275418112. Throughput: 0: 4966.5. Samples: 275414076. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 13:29:18,259][25689] Avg episode reward: [(0, '-49.247')] [2022-07-09 13:29:20,038][26022] Updated weights on worker 0-0, policy_version 268973 (0.00091) [2022-07-09 13:29:21,655][26022] Updated weights on worker 0-0, policy_version 268983 (0.00089) [2022-07-09 13:29:23,309][25689] Fps is (10 sec: 5589.1, 60 sec: 5614.7, 300 sec: 5640.3). Total num frames: 275446784. Throughput: 0: 5809.1. Samples: 275448178. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 13:29:23,310][25689] Avg episode reward: [(0, '-48.874')] [2022-07-09 13:29:23,759][26022] Updated weights on worker 0-0, policy_version 268993 (0.00081) [2022-07-09 13:29:25,169][26022] Updated weights on worker 0-0, policy_version 269003 (0.00083) [2022-07-09 13:29:27,235][26022] Updated weights on worker 0-0, policy_version 269013 (0.00094) [2022-07-09 13:29:28,366][25689] Fps is (10 sec: 5673.6, 60 sec: 5632.7, 300 sec: 5642.8). Total num frames: 275475456. Throughput: 0: 5944.0. Samples: 275482406. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 13:29:28,367][25689] Avg episode reward: [(0, '-49.299')] [2022-07-09 13:29:28,855][26022] Updated weights on worker 0-0, policy_version 269023 (0.00087) [2022-07-09 13:29:30,771][26022] Updated weights on worker 0-0, policy_version 269033 (0.00087) [2022-07-09 13:29:32,486][26022] Updated weights on worker 0-0, policy_version 269043 (0.00087) [2022-07-09 13:29:33,389][25689] Fps is (10 sec: 5688.5, 60 sec: 5634.3, 300 sec: 5642.9). Total num frames: 275504128. Throughput: 0: 5077.9. Samples: 275499276. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 13:29:33,390][25689] Avg episode reward: [(0, '-48.445')] [2022-07-09 13:29:34,182][26022] Updated weights on worker 0-0, policy_version 269053 (0.00083) [2022-07-09 13:29:36,235][26022] Updated weights on worker 0-0, policy_version 269063 (0.01008) [2022-07-09 13:29:37,670][26022] Updated weights on worker 0-0, policy_version 269073 (0.00100) [2022-07-09 13:29:38,416][25689] Fps is (10 sec: 5705.8, 60 sec: 5632.2, 300 sec: 5639.0). Total num frames: 275532800. Throughput: 0: 5955.1. Samples: 275534114. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 13:29:38,417][25689] Avg episode reward: [(0, '-48.171')] [2022-07-09 13:29:39,615][26022] Updated weights on worker 0-0, policy_version 269083 (0.00086) [2022-07-09 13:29:41,502][26022] Updated weights on worker 0-0, policy_version 269093 (0.00086) [2022-07-09 13:29:43,034][26022] Updated weights on worker 0-0, policy_version 269103 (0.00088) [2022-07-09 13:29:43,477][25689] Fps is (10 sec: 5786.2, 60 sec: 5661.8, 300 sec: 5643.6). Total num frames: 275562496. Throughput: 0: 5979.6. Samples: 275568776. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 13:29:43,477][25689] Avg episode reward: [(0, '-47.566')] [2022-07-09 13:29:45,119][26022] Updated weights on worker 0-0, policy_version 269113 (0.00090) [2022-07-09 13:29:46,773][26022] Updated weights on worker 0-0, policy_version 269123 (0.00088) [2022-07-09 13:29:48,525][25689] Fps is (10 sec: 5773.8, 60 sec: 5653.4, 300 sec: 5646.4). Total num frames: 275591168. Throughput: 0: 5134.6. Samples: 275585916. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 13:29:48,526][25689] Avg episode reward: [(0, '-48.074')] [2022-07-09 13:29:48,559][26022] Updated weights on worker 0-0, policy_version 269133 (0.00078) [2022-07-09 13:29:50,522][26022] Updated weights on worker 0-0, policy_version 269143 (0.00375) [2022-07-09 13:29:51,998][26022] Updated weights on worker 0-0, policy_version 269153 (0.00084) [2022-07-09 13:29:53,597][25689] Fps is (10 sec: 5666.1, 60 sec: 5648.1, 300 sec: 5645.8). Total num frames: 275619840. Throughput: 0: 5990.3. Samples: 275620328. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 13:29:53,598][25689] Avg episode reward: [(0, '-47.437')] [2022-07-09 13:29:54,025][26022] Updated weights on worker 0-0, policy_version 269163 (0.00092) [2022-07-09 13:29:55,627][26022] Updated weights on worker 0-0, policy_version 269173 (0.00083) [2022-07-09 13:29:57,505][26022] Updated weights on worker 0-0, policy_version 269183 (0.00085) [2022-07-09 13:29:58,604][25689] Fps is (10 sec: 5791.3, 60 sec: 5682.3, 300 sec: 5647.2). Total num frames: 275649536. Throughput: 0: 5982.4. Samples: 275654886. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 13:29:58,604][25689] Avg episode reward: [(0, '-48.122')] [2022-07-09 13:29:59,292][26022] Updated weights on worker 0-0, policy_version 269193 (0.00089) [2022-07-09 13:30:00,973][26022] Updated weights on worker 0-0, policy_version 269203 (0.00089) [2022-07-09 13:30:03,316][26022] Updated weights on worker 0-0, policy_version 269213 (0.00090) [2022-07-09 13:30:03,610][25689] Fps is (10 sec: 5522.7, 60 sec: 5666.4, 300 sec: 5646.2). Total num frames: 275675136. Throughput: 0: 5121.9. Samples: 275671896. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 13:30:03,610][25689] Avg episode reward: [(0, '-48.051')] [2022-07-09 13:30:05,075][26022] Updated weights on worker 0-0, policy_version 269223 (0.00584) [2022-07-09 13:30:06,924][26022] Updated weights on worker 0-0, policy_version 269233 (0.00086) [2022-07-09 13:30:08,657][25689] Fps is (10 sec: 5398.4, 60 sec: 5673.8, 300 sec: 5650.0). Total num frames: 275703808. Throughput: 0: 5858.2. Samples: 275703854. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 13:30:08,658][25689] Avg episode reward: [(0, '-48.059')] [2022-07-09 13:30:08,765][26022] Updated weights on worker 0-0, policy_version 269243 (0.00086) [2022-07-09 13:30:10,419][26022] Updated weights on worker 0-0, policy_version 269253 (0.00086) [2022-07-09 13:30:12,238][26022] Updated weights on worker 0-0, policy_version 269263 (0.00082) [2022-07-09 13:30:13,701][25689] Fps is (10 sec: 5885.4, 60 sec: 5689.9, 300 sec: 5653.6). Total num frames: 275734528. Throughput: 0: 5881.0. Samples: 275738560. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 13:30:13,702][25689] Avg episode reward: [(0, '-47.545')] [2022-07-09 13:30:13,944][26022] Updated weights on worker 0-0, policy_version 269273 (0.00088) [2022-07-09 13:30:15,784][26022] Updated weights on worker 0-0, policy_version 269283 (0.00093) [2022-07-09 13:30:17,565][26022] Updated weights on worker 0-0, policy_version 269293 (0.00083) [2022-07-09 13:30:18,792][25689] Fps is (10 sec: 5759.0, 60 sec: 5683.9, 300 sec: 5648.9). Total num frames: 275762176. Throughput: 0: 4998.4. Samples: 275755796. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 13:30:18,793][25689] Avg episode reward: [(0, '-47.403')] [2022-07-09 13:30:19,359][26022] Updated weights on worker 0-0, policy_version 269303 (0.00085) [2022-07-09 13:30:21,047][26022] Updated weights on worker 0-0, policy_version 269313 (0.00086) [2022-07-09 13:30:22,860][26022] Updated weights on worker 0-0, policy_version 269323 (0.00086) [2022-07-09 13:30:23,885][25689] Fps is (10 sec: 5630.7, 60 sec: 5696.7, 300 sec: 5656.1). Total num frames: 275791872. Throughput: 0: 5833.0. Samples: 275790164. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 13:30:23,886][25689] Avg episode reward: [(0, '-47.986')] [2022-07-09 13:30:24,874][26022] Updated weights on worker 0-0, policy_version 269333 (0.00087) [2022-07-09 13:30:26,383][26022] Updated weights on worker 0-0, policy_version 269343 (0.00082) [2022-07-09 13:30:28,482][26022] Updated weights on worker 0-0, policy_version 269353 (0.00086) [2022-07-09 13:30:28,948][25689] Fps is (10 sec: 5646.7, 60 sec: 5679.4, 300 sec: 5651.9). Total num frames: 275819520. Throughput: 0: 5950.6. Samples: 275824594. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 13:30:28,948][25689] Avg episode reward: [(0, '-47.818')] [2022-07-09 13:30:29,979][26022] Updated weights on worker 0-0, policy_version 269363 (0.00091) [2022-07-09 13:30:32,019][26022] Updated weights on worker 0-0, policy_version 269373 (0.00090) [2022-07-09 13:30:33,586][26022] Updated weights on worker 0-0, policy_version 269383 (0.00088) [2022-07-09 13:30:33,965][25689] Fps is (10 sec: 5688.9, 60 sec: 5696.8, 300 sec: 5655.6). Total num frames: 275849216. Throughput: 0: 5941.7. Samples: 275858964. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 13:30:33,966][25689] Avg episode reward: [(0, '-48.430')] [2022-07-09 13:30:35,577][26022] Updated weights on worker 0-0, policy_version 269393 (0.00079) [2022-07-09 13:30:37,204][26022] Updated weights on worker 0-0, policy_version 269403 (0.00084) [2022-07-09 13:30:39,043][25689] Fps is (10 sec: 5781.8, 60 sec: 5692.0, 300 sec: 5654.5). Total num frames: 275877888. Throughput: 0: 5940.2. Samples: 275876088. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 13:30:39,043][25689] Avg episode reward: [(0, '-49.190')] [2022-07-09 13:30:39,101][26022] Updated weights on worker 0-0, policy_version 269413 (0.00086) [2022-07-09 13:30:40,776][26022] Updated weights on worker 0-0, policy_version 269423 (0.00079) [2022-07-09 13:30:42,558][26022] Updated weights on worker 0-0, policy_version 269433 (0.00088) [2022-07-09 13:30:44,064][25689] Fps is (10 sec: 5780.0, 60 sec: 5695.8, 300 sec: 5653.3). Total num frames: 275907584. Throughput: 0: 5984.9. Samples: 275910928. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 13:30:44,064][25689] Avg episode reward: [(0, '-49.430')] [2022-07-09 13:30:44,412][26022] Updated weights on worker 0-0, policy_version 269443 (0.00083) [2022-07-09 13:30:46,171][26022] Updated weights on worker 0-0, policy_version 269453 (0.00089) [2022-07-09 13:30:47,952][26022] Updated weights on worker 0-0, policy_version 269463 (0.00091) [2022-07-09 13:30:49,148][25689] Fps is (10 sec: 5776.2, 60 sec: 5692.4, 300 sec: 5656.2). Total num frames: 275936256. Throughput: 0: 5964.7. Samples: 275945082. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 13:30:49,148][25689] Avg episode reward: [(0, '-50.450')] [2022-07-09 13:30:49,916][26022] Updated weights on worker 0-0, policy_version 269473 (0.00088) [2022-07-09 13:30:51,591][26022] Updated weights on worker 0-0, policy_version 269483 (0.00086) [2022-07-09 13:30:53,372][26022] Updated weights on worker 0-0, policy_version 269493 (0.00089) [2022-07-09 13:30:54,182][25689] Fps is (10 sec: 5667.3, 60 sec: 5696.0, 300 sec: 5659.1). Total num frames: 275964928. Throughput: 0: 5110.5. Samples: 275962282. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 13:30:54,183][25689] Avg episode reward: [(0, '-50.485')] [2022-07-09 13:30:55,200][26022] Updated weights on worker 0-0, policy_version 269503 (0.00092) [2022-07-09 13:30:56,902][26022] Updated weights on worker 0-0, policy_version 269513 (0.00086) [2022-07-09 13:30:58,776][26022] Updated weights on worker 0-0, policy_version 269523 (0.00085) [2022-07-09 13:30:59,190][25689] Fps is (10 sec: 5710.3, 60 sec: 5678.9, 300 sec: 5659.1). Total num frames: 275993600. Throughput: 0: 5997.8. Samples: 275996926. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 13:30:59,190][25689] Avg episode reward: [(0, '-50.365')] [2022-07-09 13:30:59,630][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:30:59,646][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000269529_275997696.pth [2022-07-09 13:30:59,649][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000267537_273957888.pth [2022-07-09 13:31:00,569][26022] Updated weights on worker 0-0, policy_version 269533 (0.00081) [2022-07-09 13:31:02,601][26022] Updated weights on worker 0-0, policy_version 269543 (0.00085) [2022-07-09 13:31:04,198][25689] Fps is (10 sec: 5418.4, 60 sec: 5678.7, 300 sec: 5651.7). Total num frames: 276019200. Throughput: 0: 5874.5. Samples: 276029210. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 13:31:04,199][25689] Avg episode reward: [(0, '-49.935')] [2022-07-09 13:31:04,440][26022] Updated weights on worker 0-0, policy_version 269553 (0.00085) [2022-07-09 13:31:06,044][26022] Updated weights on worker 0-0, policy_version 269563 (0.00091) [2022-07-09 13:31:08,003][26022] Updated weights on worker 0-0, policy_version 269573 (0.00093) [2022-07-09 13:31:09,335][25689] Fps is (10 sec: 5551.5, 60 sec: 5704.1, 300 sec: 5657.1). Total num frames: 276049920. Throughput: 0: 5027.0. Samples: 276046564. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 13:31:09,335][25689] Avg episode reward: [(0, '-49.749')] [2022-07-09 13:31:09,766][26022] Updated weights on worker 0-0, policy_version 269583 (0.00090) [2022-07-09 13:31:11,494][26022] Updated weights on worker 0-0, policy_version 269593 (0.00085) [2022-07-09 13:31:13,385][26022] Updated weights on worker 0-0, policy_version 269603 (0.00100) [2022-07-09 13:31:14,414][25689] Fps is (10 sec: 5814.0, 60 sec: 5667.1, 300 sec: 5659.9). Total num frames: 276078592. Throughput: 0: 5876.6. Samples: 276081174. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 13:31:14,414][25689] Avg episode reward: [(0, '-49.723')] [2022-07-09 13:31:15,099][26022] Updated weights on worker 0-0, policy_version 269613 (0.00117) [2022-07-09 13:31:16,857][26022] Updated weights on worker 0-0, policy_version 269623 (0.00085) [2022-07-09 13:31:18,775][26022] Updated weights on worker 0-0, policy_version 269633 (0.00091) [2022-07-09 13:31:19,422][25689] Fps is (10 sec: 5786.2, 60 sec: 5708.6, 300 sec: 5663.5). Total num frames: 276108288. Throughput: 0: 5873.2. Samples: 276115754. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 13:31:19,423][25689] Avg episode reward: [(0, '-49.341')] [2022-07-09 13:31:20,552][26022] Updated weights on worker 0-0, policy_version 269643 (0.00082) [2022-07-09 13:31:22,280][26022] Updated weights on worker 0-0, policy_version 269653 (0.00083) [2022-07-09 13:31:23,985][26022] Updated weights on worker 0-0, policy_version 269663 (0.00083) [2022-07-09 13:31:24,426][25689] Fps is (10 sec: 5727.2, 60 sec: 5683.2, 300 sec: 5654.9). Total num frames: 276135936. Throughput: 0: 5121.2. Samples: 276132800. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 13:31:24,427][25689] Avg episode reward: [(0, '-48.912')] [2022-07-09 13:31:25,951][26022] Updated weights on worker 0-0, policy_version 269673 (0.00087) [2022-07-09 13:31:27,452][26022] Updated weights on worker 0-0, policy_version 269683 (0.00089) [2022-07-09 13:31:29,520][25689] Fps is (10 sec: 5577.6, 60 sec: 5697.1, 300 sec: 5660.1). Total num frames: 276164608. Throughput: 0: 5983.3. Samples: 276167336. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 13:31:29,520][25689] Avg episode reward: [(0, '-49.189')] [2022-07-09 13:31:29,587][26022] Updated weights on worker 0-0, policy_version 269693 (0.00086) [2022-07-09 13:31:31,146][26022] Updated weights on worker 0-0, policy_version 269703 (0.00087) [2022-07-09 13:31:33,054][26022] Updated weights on worker 0-0, policy_version 269713 (0.00087) [2022-07-09 13:31:34,525][25689] Fps is (10 sec: 5780.0, 60 sec: 5698.4, 300 sec: 5660.8). Total num frames: 276194304. Throughput: 0: 5974.3. Samples: 276201322. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 13:31:34,529][25689] Avg episode reward: [(0, '-48.873')] [2022-07-09 13:31:34,947][26022] Updated weights on worker 0-0, policy_version 269723 (0.00095) [2022-07-09 13:31:36,598][26022] Updated weights on worker 0-0, policy_version 269733 (0.00081) [2022-07-09 13:31:38,659][26022] Updated weights on worker 0-0, policy_version 269743 (0.00088) [2022-07-09 13:31:39,606][25689] Fps is (10 sec: 5888.7, 60 sec: 5714.9, 300 sec: 5666.3). Total num frames: 276224000. Throughput: 0: 5086.9. Samples: 276218424. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 13:31:39,606][25689] Avg episode reward: [(0, '-49.367')] [2022-07-09 13:31:40,067][26022] Updated weights on worker 0-0, policy_version 269753 (0.00093) [2022-07-09 13:31:42,089][26022] Updated weights on worker 0-0, policy_version 269763 (0.00093) [2022-07-09 13:31:43,614][26022] Updated weights on worker 0-0, policy_version 269773 (0.00090) [2022-07-09 13:31:44,622][25689] Fps is (10 sec: 5679.2, 60 sec: 5681.5, 300 sec: 5664.0). Total num frames: 276251648. Throughput: 0: 5941.1. Samples: 276252784. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:31:44,622][25689] Avg episode reward: [(0, '-49.635')] [2022-07-09 13:31:45,604][26022] Updated weights on worker 0-0, policy_version 269783 (0.00104) [2022-07-09 13:31:47,467][26022] Updated weights on worker 0-0, policy_version 269793 (0.00097) [2022-07-09 13:31:49,143][26022] Updated weights on worker 0-0, policy_version 269803 (0.00083) [2022-07-09 13:31:49,726][25689] Fps is (10 sec: 5565.2, 60 sec: 5679.7, 300 sec: 5663.2). Total num frames: 276280320. Throughput: 0: 5927.1. Samples: 276287098. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:31:49,726][25689] Avg episode reward: [(0, '-49.895')] [2022-07-09 13:31:51,006][26022] Updated weights on worker 0-0, policy_version 269813 (0.00092) [2022-07-09 13:31:52,776][26022] Updated weights on worker 0-0, policy_version 269823 (0.00089) [2022-07-09 13:31:54,599][26022] Updated weights on worker 0-0, policy_version 269833 (0.00094) [2022-07-09 13:31:54,765][25689] Fps is (10 sec: 5653.3, 60 sec: 5679.2, 300 sec: 5662.6). Total num frames: 276308992. Throughput: 0: 5083.8. Samples: 276304220. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:31:54,766][25689] Avg episode reward: [(0, '-49.142')] [2022-07-09 13:31:56,414][26022] Updated weights on worker 0-0, policy_version 269843 (0.00091) [2022-07-09 13:31:58,416][26022] Updated weights on worker 0-0, policy_version 269853 (0.00091) [2022-07-09 13:31:59,718][26022] Updated weights on worker 0-0, policy_version 269863 (0.00091) [2022-07-09 13:31:59,784][25689] Fps is (10 sec: 5904.6, 60 sec: 5712.0, 300 sec: 5677.2). Total num frames: 276339712. Throughput: 0: 5961.8. Samples: 276338724. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:31:59,785][25689] Avg episode reward: [(0, '-49.300')] [2022-07-09 13:32:02,283][26022] Updated weights on worker 0-0, policy_version 269873 (0.00087) [2022-07-09 13:32:03,788][26022] Updated weights on worker 0-0, policy_version 269883 (0.00087) [2022-07-09 13:32:04,786][25689] Fps is (10 sec: 5518.3, 60 sec: 5695.7, 300 sec: 5664.3). Total num frames: 276364288. Throughput: 0: 5870.9. Samples: 276371164. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:32:04,786][25689] Avg episode reward: [(0, '-49.500')] [2022-07-09 13:32:05,630][26022] Updated weights on worker 0-0, policy_version 269893 (0.00094) [2022-07-09 13:32:07,791][26022] Updated weights on worker 0-0, policy_version 269903 (0.00089) [2022-07-09 13:32:09,101][26022] Updated weights on worker 0-0, policy_version 269913 (0.00085) [2022-07-09 13:32:09,882][25689] Fps is (10 sec: 5374.6, 60 sec: 5682.6, 300 sec: 5666.9). Total num frames: 276393984. Throughput: 0: 4995.3. Samples: 276387786. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:32:09,883][25689] Avg episode reward: [(0, '-48.764')] [2022-07-09 13:32:11,313][26022] Updated weights on worker 0-0, policy_version 269923 (0.00086) [2022-07-09 13:32:12,749][26022] Updated weights on worker 0-0, policy_version 269933 (0.00095) [2022-07-09 13:32:14,840][26022] Updated weights on worker 0-0, policy_version 269943 (0.00090) [2022-07-09 13:32:14,894][25689] Fps is (10 sec: 5673.1, 60 sec: 5672.0, 300 sec: 5663.4). Total num frames: 276421632. Throughput: 0: 5847.0. Samples: 276421910. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:32:14,894][25689] Avg episode reward: [(0, '-48.523')] [2022-07-09 13:32:16,578][26022] Updated weights on worker 0-0, policy_version 269953 (0.00083) [2022-07-09 13:32:18,320][26022] Updated weights on worker 0-0, policy_version 269963 (0.00083) [2022-07-09 13:32:19,902][25689] Fps is (10 sec: 5722.8, 60 sec: 5672.0, 300 sec: 5666.9). Total num frames: 276451328. Throughput: 0: 5841.5. Samples: 276456244. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:32:19,903][25689] Avg episode reward: [(0, '-48.301')] [2022-07-09 13:32:20,039][26022] Updated weights on worker 0-0, policy_version 269973 (0.00084) [2022-07-09 13:32:21,920][26022] Updated weights on worker 0-0, policy_version 269983 (0.00091) [2022-07-09 13:32:23,823][26022] Updated weights on worker 0-0, policy_version 269993 (0.00095) [2022-07-09 13:32:24,918][25689] Fps is (10 sec: 5720.4, 60 sec: 5670.8, 300 sec: 5664.5). Total num frames: 276478976. Throughput: 0: 5083.5. Samples: 276473508. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:32:24,919][25689] Avg episode reward: [(0, '-49.474')] [2022-07-09 13:32:25,437][26022] Updated weights on worker 0-0, policy_version 270003 (0.00123) [2022-07-09 13:32:27,298][26022] Updated weights on worker 0-0, policy_version 270013 (0.00090) [2022-07-09 13:32:29,045][26022] Updated weights on worker 0-0, policy_version 270023 (0.00086) [2022-07-09 13:32:30,039][25689] Fps is (10 sec: 5657.2, 60 sec: 5685.2, 300 sec: 5669.1). Total num frames: 276508672. Throughput: 0: 5946.9. Samples: 276507656. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:32:30,039][25689] Avg episode reward: [(0, '-49.912')] [2022-07-09 13:32:31,101][26022] Updated weights on worker 0-0, policy_version 270033 (0.00090) [2022-07-09 13:32:32,587][26022] Updated weights on worker 0-0, policy_version 270043 (0.00100) [2022-07-09 13:32:34,642][26022] Updated weights on worker 0-0, policy_version 270053 (0.00094) [2022-07-09 13:32:35,060][25689] Fps is (10 sec: 5553.1, 60 sec: 5632.9, 300 sec: 5658.4). Total num frames: 276535296. Throughput: 0: 5937.8. Samples: 276541654. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:32:35,062][25689] Avg episode reward: [(0, '-49.774')] [2022-07-09 13:32:36,191][26022] Updated weights on worker 0-0, policy_version 270063 (0.00089) [2022-07-09 13:32:38,469][26022] Updated weights on worker 0-0, policy_version 270073 (0.00111) [2022-07-09 13:32:39,819][26022] Updated weights on worker 0-0, policy_version 270083 (0.00085) [2022-07-09 13:32:40,067][25689] Fps is (10 sec: 5616.1, 60 sec: 5639.8, 300 sec: 5662.2). Total num frames: 276564992. Throughput: 0: 5065.0. Samples: 276558380. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:32:40,068][25689] Avg episode reward: [(0, '-49.113')] [2022-07-09 13:32:41,891][26022] Updated weights on worker 0-0, policy_version 270093 (0.00090) [2022-07-09 13:32:43,421][26022] Updated weights on worker 0-0, policy_version 270103 (0.00085) [2022-07-09 13:32:45,090][25689] Fps is (10 sec: 5717.5, 60 sec: 5639.2, 300 sec: 5662.5). Total num frames: 276592640. Throughput: 0: 5908.0. Samples: 276592682. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:32:45,090][25689] Avg episode reward: [(0, '-49.836')] [2022-07-09 13:32:45,314][26022] Updated weights on worker 0-0, policy_version 270113 (0.00089) [2022-07-09 13:32:47,111][26022] Updated weights on worker 0-0, policy_version 270123 (0.00085) [2022-07-09 13:32:48,994][26022] Updated weights on worker 0-0, policy_version 270133 (0.00091) [2022-07-09 13:32:50,147][25689] Fps is (10 sec: 5688.9, 60 sec: 5660.5, 300 sec: 5665.1). Total num frames: 276622336. Throughput: 0: 5939.0. Samples: 276627080. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:32:50,148][25689] Avg episode reward: [(0, '-49.214')] [2022-07-09 13:32:50,799][26022] Updated weights on worker 0-0, policy_version 270143 (0.00095) [2022-07-09 13:32:52,352][26022] Updated weights on worker 0-0, policy_version 270153 (0.00084) [2022-07-09 13:32:54,360][26022] Updated weights on worker 0-0, policy_version 270163 (0.00089) [2022-07-09 13:32:55,187][25689] Fps is (10 sec: 5882.2, 60 sec: 5677.4, 300 sec: 5667.8). Total num frames: 276652032. Throughput: 0: 5109.4. Samples: 276644488. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:32:55,187][25689] Avg episode reward: [(0, '-48.240')] [2022-07-09 13:32:56,138][26022] Updated weights on worker 0-0, policy_version 270173 (0.00085) [2022-07-09 13:32:57,990][26022] Updated weights on worker 0-0, policy_version 270183 (0.00057) [2022-07-09 13:32:59,538][26022] Updated weights on worker 0-0, policy_version 270193 (0.00082) [2022-07-09 13:32:59,769][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:32:59,783][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000270194_276678656.pth [2022-07-09 13:32:59,783][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000268201_274637824.pth [2022-07-09 13:33:00,199][25689] Fps is (10 sec: 5806.4, 60 sec: 5644.1, 300 sec: 5678.8). Total num frames: 276680704. Throughput: 0: 5987.8. Samples: 276678928. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:33:00,201][25689] Avg episode reward: [(0, '-48.667')] [2022-07-09 13:33:01,494][26022] Updated weights on worker 0-0, policy_version 270203 (0.00077) [2022-07-09 13:33:03,433][26022] Updated weights on worker 0-0, policy_version 270213 (0.00087) [2022-07-09 13:33:05,211][25689] Fps is (10 sec: 5413.9, 60 sec: 5660.1, 300 sec: 5665.7). Total num frames: 276706304. Throughput: 0: 5891.1. Samples: 276711220. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:33:05,212][25689] Avg episode reward: [(0, '-47.449')] [2022-07-09 13:33:05,410][26022] Updated weights on worker 0-0, policy_version 270223 (0.00095) [2022-07-09 13:33:07,435][26022] Updated weights on worker 0-0, policy_version 270233 (0.00091) [2022-07-09 13:33:08,830][26022] Updated weights on worker 0-0, policy_version 270243 (0.00087) [2022-07-09 13:33:10,372][25689] Fps is (10 sec: 5435.6, 60 sec: 5654.1, 300 sec: 5666.2). Total num frames: 276736000. Throughput: 0: 5845.7. Samples: 276745310. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:33:10,373][25689] Avg episode reward: [(0, '-47.468')] [2022-07-09 13:33:11,036][26022] Updated weights on worker 0-0, policy_version 270253 (0.00091) [2022-07-09 13:33:12,619][26022] Updated weights on worker 0-0, policy_version 270263 (0.00096) [2022-07-09 13:33:14,573][26022] Updated weights on worker 0-0, policy_version 270273 (0.00091) [2022-07-09 13:33:15,380][25689] Fps is (10 sec: 5840.5, 60 sec: 5688.3, 300 sec: 5669.7). Total num frames: 276765696. Throughput: 0: 5847.2. Samples: 276762562. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:33:15,381][25689] Avg episode reward: [(0, '-47.943')] [2022-07-09 13:33:16,054][26022] Updated weights on worker 0-0, policy_version 270283 (0.00101) [2022-07-09 13:33:18,180][26022] Updated weights on worker 0-0, policy_version 270293 (0.00082) [2022-07-09 13:33:19,781][26022] Updated weights on worker 0-0, policy_version 270303 (0.00086) [2022-07-09 13:33:20,395][25689] Fps is (10 sec: 5721.6, 60 sec: 5653.9, 300 sec: 5666.4). Total num frames: 276793344. Throughput: 0: 5841.0. Samples: 276796888. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:33:20,396][25689] Avg episode reward: [(0, '-48.099')] [2022-07-09 13:33:21,661][26022] Updated weights on worker 0-0, policy_version 270313 (0.01208) [2022-07-09 13:33:23,383][26022] Updated weights on worker 0-0, policy_version 270323 (0.00088) [2022-07-09 13:33:25,071][26022] Updated weights on worker 0-0, policy_version 270333 (0.00088) [2022-07-09 13:33:25,405][25689] Fps is (10 sec: 5720.2, 60 sec: 5688.2, 300 sec: 5674.4). Total num frames: 276823040. Throughput: 0: 5934.3. Samples: 276831056. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:33:25,405][25689] Avg episode reward: [(0, '-48.306')] [2022-07-09 13:33:27,022][26022] Updated weights on worker 0-0, policy_version 270343 (0.00092) [2022-07-09 13:33:28,780][26022] Updated weights on worker 0-0, policy_version 270353 (0.00086) [2022-07-09 13:33:30,472][25689] Fps is (10 sec: 5690.5, 60 sec: 5659.4, 300 sec: 5670.5). Total num frames: 276850688. Throughput: 0: 5118.4. Samples: 276848188. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:33:30,474][25689] Avg episode reward: [(0, '-49.095')] [2022-07-09 13:33:30,585][26022] Updated weights on worker 0-0, policy_version 270363 (0.00098) [2022-07-09 13:33:32,316][26022] Updated weights on worker 0-0, policy_version 270373 (0.00088) [2022-07-09 13:33:34,422][26022] Updated weights on worker 0-0, policy_version 270383 (0.00087) [2022-07-09 13:33:35,475][25689] Fps is (10 sec: 5694.8, 60 sec: 5712.0, 300 sec: 5673.9). Total num frames: 276880384. Throughput: 0: 5967.0. Samples: 276882466. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:33:35,477][25689] Avg episode reward: [(0, '-49.091')] [2022-07-09 13:33:35,844][26022] Updated weights on worker 0-0, policy_version 270393 (0.00093) [2022-07-09 13:33:38,011][26022] Updated weights on worker 0-0, policy_version 270403 (0.00081) [2022-07-09 13:33:39,543][26022] Updated weights on worker 0-0, policy_version 270413 (0.00052) [2022-07-09 13:33:40,514][25689] Fps is (10 sec: 5608.1, 60 sec: 5658.1, 300 sec: 5670.0). Total num frames: 276907008. Throughput: 0: 5944.1. Samples: 276916482. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:33:40,515][25689] Avg episode reward: [(0, '-48.347')] [2022-07-09 13:33:41,527][26022] Updated weights on worker 0-0, policy_version 270423 (0.00092) [2022-07-09 13:33:42,919][26022] Updated weights on worker 0-0, policy_version 270433 (0.00089) [2022-07-09 13:33:44,967][26022] Updated weights on worker 0-0, policy_version 270443 (0.00091) [2022-07-09 13:33:45,535][25689] Fps is (10 sec: 5598.4, 60 sec: 5692.2, 300 sec: 5672.3). Total num frames: 276936704. Throughput: 0: 5094.0. Samples: 276933596. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:33:45,537][25689] Avg episode reward: [(0, '-48.188')] [2022-07-09 13:33:46,783][26022] Updated weights on worker 0-0, policy_version 270453 (0.00091) [2022-07-09 13:33:48,445][26022] Updated weights on worker 0-0, policy_version 270463 (0.00092) [2022-07-09 13:33:50,512][26022] Updated weights on worker 0-0, policy_version 270473 (0.00096) [2022-07-09 13:33:50,587][25689] Fps is (10 sec: 5794.7, 60 sec: 5675.7, 300 sec: 5671.6). Total num frames: 276965376. Throughput: 0: 5941.7. Samples: 276967708. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:33:50,589][25689] Avg episode reward: [(0, '-48.032')] [2022-07-09 13:33:52,175][26022] Updated weights on worker 0-0, policy_version 270483 (0.00088) [2022-07-09 13:33:53,907][26022] Updated weights on worker 0-0, policy_version 270493 (0.00089) [2022-07-09 13:33:55,627][25689] Fps is (10 sec: 5580.6, 60 sec: 5641.8, 300 sec: 5671.0). Total num frames: 276993024. Throughput: 0: 5912.3. Samples: 277001612. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:33:55,627][25689] Avg episode reward: [(0, '-46.939')] [2022-07-09 13:33:55,976][26022] Updated weights on worker 0-0, policy_version 270503 (0.00097) [2022-07-09 13:33:57,516][26022] Updated weights on worker 0-0, policy_version 270513 (0.00084) [2022-07-09 13:33:59,610][26022] Updated weights on worker 0-0, policy_version 270523 (0.00095) [2022-07-09 13:34:00,699][25689] Fps is (10 sec: 5569.8, 60 sec: 5636.3, 300 sec: 5676.9). Total num frames: 277021696. Throughput: 0: 5055.9. Samples: 277018532. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:34:00,699][25689] Avg episode reward: [(0, '-47.007')] [2022-07-09 13:34:01,200][26022] Updated weights on worker 0-0, policy_version 270533 (0.00088) [2022-07-09 13:34:03,582][26022] Updated weights on worker 0-0, policy_version 270543 (0.00085) [2022-07-09 13:34:05,200][26022] Updated weights on worker 0-0, policy_version 270553 (0.00092) [2022-07-09 13:34:05,727][25689] Fps is (10 sec: 5474.9, 60 sec: 5651.7, 300 sec: 5671.9). Total num frames: 277048320. Throughput: 0: 5769.7. Samples: 277050098. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-09 13:34:05,729][25689] Avg episode reward: [(0, '-47.043')] [2022-07-09 13:34:07,295][26022] Updated weights on worker 0-0, policy_version 270563 (0.00091) [2022-07-09 13:34:08,844][26022] Updated weights on worker 0-0, policy_version 270573 (0.00083) [2022-07-09 13:34:10,809][25689] Fps is (10 sec: 5267.0, 60 sec: 5608.3, 300 sec: 5660.7). Total num frames: 277074944. Throughput: 0: 5752.1. Samples: 277084024. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 13:34:10,809][25689] Avg episode reward: [(0, '-48.129')] [2022-07-09 13:34:10,835][26022] Updated weights on worker 0-0, policy_version 270583 (0.00084) [2022-07-09 13:34:12,515][26022] Updated weights on worker 0-0, policy_version 270593 (0.00084) [2022-07-09 13:34:14,393][26022] Updated weights on worker 0-0, policy_version 270603 (0.00213) [2022-07-09 13:34:15,819][25689] Fps is (10 sec: 5580.4, 60 sec: 5608.0, 300 sec: 5667.8). Total num frames: 277104640. Throughput: 0: 4927.4. Samples: 277101108. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 13:34:15,821][25689] Avg episode reward: [(0, '-48.937')] [2022-07-09 13:34:16,226][26022] Updated weights on worker 0-0, policy_version 270613 (0.00084) [2022-07-09 13:34:17,975][26022] Updated weights on worker 0-0, policy_version 270623 (0.00073) [2022-07-09 13:34:19,574][26022] Updated weights on worker 0-0, policy_version 270633 (0.00083) [2022-07-09 13:34:20,824][25689] Fps is (10 sec: 5930.1, 60 sec: 5642.8, 300 sec: 5672.2). Total num frames: 277134336. Throughput: 0: 5815.9. Samples: 277135576. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 13:34:20,825][25689] Avg episode reward: [(0, '-48.672')] [2022-07-09 13:34:21,565][26022] Updated weights on worker 0-0, policy_version 270643 (0.00082) [2022-07-09 13:34:23,179][26022] Updated weights on worker 0-0, policy_version 270653 (0.00096) [2022-07-09 13:34:25,206][26022] Updated weights on worker 0-0, policy_version 270663 (0.00086) [2022-07-09 13:34:25,830][25689] Fps is (10 sec: 5830.7, 60 sec: 5626.3, 300 sec: 5673.1). Total num frames: 277163008. Throughput: 0: 5965.4. Samples: 277170020. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 13:34:25,836][25689] Avg episode reward: [(0, '-49.466')] [2022-07-09 13:34:26,879][26022] Updated weights on worker 0-0, policy_version 270673 (0.00090) [2022-07-09 13:34:28,572][26022] Updated weights on worker 0-0, policy_version 270683 (0.00084) [2022-07-09 13:34:30,827][26022] Updated weights on worker 0-0, policy_version 270693 (0.00092) [2022-07-09 13:34:30,905][25689] Fps is (10 sec: 5484.8, 60 sec: 5608.5, 300 sec: 5665.3). Total num frames: 277189632. Throughput: 0: 5116.6. Samples: 277186852. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 13:34:30,906][25689] Avg episode reward: [(0, '-50.014')] [2022-07-09 13:34:32,217][26022] Updated weights on worker 0-0, policy_version 270703 (0.00088) [2022-07-09 13:34:34,195][26022] Updated weights on worker 0-0, policy_version 270713 (0.00089) [2022-07-09 13:34:35,791][26022] Updated weights on worker 0-0, policy_version 270723 (0.00093) [2022-07-09 13:34:35,927][25689] Fps is (10 sec: 5679.4, 60 sec: 5623.8, 300 sec: 5672.2). Total num frames: 277220352. Throughput: 0: 5974.7. Samples: 277221244. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 13:34:35,927][25689] Avg episode reward: [(0, '-49.626')] [2022-07-09 13:34:37,715][26022] Updated weights on worker 0-0, policy_version 270733 (0.00086) [2022-07-09 13:34:39,570][26022] Updated weights on worker 0-0, policy_version 270743 (0.00099) [2022-07-09 13:34:40,943][25689] Fps is (10 sec: 5814.7, 60 sec: 5642.8, 300 sec: 5666.2). Total num frames: 277248000. Throughput: 0: 5943.3. Samples: 277255154. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 13:34:40,944][25689] Avg episode reward: [(0, '-48.414')] [2022-07-09 13:34:41,336][26022] Updated weights on worker 0-0, policy_version 270753 (0.00089) [2022-07-09 13:34:43,258][26022] Updated weights on worker 0-0, policy_version 270763 (0.00093) [2022-07-09 13:34:44,877][26022] Updated weights on worker 0-0, policy_version 270773 (0.00086) [2022-07-09 13:34:45,949][25689] Fps is (10 sec: 5619.3, 60 sec: 5627.2, 300 sec: 5667.0). Total num frames: 277276672. Throughput: 0: 5081.5. Samples: 277272258. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 13:34:45,950][25689] Avg episode reward: [(0, '-48.794')] [2022-07-09 13:34:46,890][26022] Updated weights on worker 0-0, policy_version 270783 (0.00095) [2022-07-09 13:34:48,433][26022] Updated weights on worker 0-0, policy_version 270793 (0.00084) [2022-07-09 13:34:50,410][26022] Updated weights on worker 0-0, policy_version 270803 (0.00098) [2022-07-09 13:34:51,052][25689] Fps is (10 sec: 5875.4, 60 sec: 5656.4, 300 sec: 5673.3). Total num frames: 277307392. Throughput: 0: 5946.5. Samples: 277306654. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 13:34:51,053][25689] Avg episode reward: [(0, '-49.216')] [2022-07-09 13:34:52,083][26022] Updated weights on worker 0-0, policy_version 270813 (0.00085) [2022-07-09 13:34:53,899][26022] Updated weights on worker 0-0, policy_version 270823 (0.00095) [2022-07-09 13:34:55,622][26022] Updated weights on worker 0-0, policy_version 270833 (0.00085) [2022-07-09 13:34:56,133][25689] Fps is (10 sec: 5832.2, 60 sec: 5669.5, 300 sec: 5668.5). Total num frames: 277336064. Throughput: 0: 5934.7. Samples: 277341160. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 13:34:56,133][25689] Avg episode reward: [(0, '-49.035')] [2022-07-09 13:34:57,307][26022] Updated weights on worker 0-0, policy_version 270843 (0.00084) [2022-07-09 13:34:59,155][26022] Updated weights on worker 0-0, policy_version 270853 (0.00086) [2022-07-09 13:35:00,071][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:35:00,090][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000270858_277358592.pth [2022-07-09 13:35:00,092][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000268862_275314688.pth [2022-07-09 13:35:01,107][26022] Updated weights on worker 0-0, policy_version 270863 (0.00090) [2022-07-09 13:35:01,139][25689] Fps is (10 sec: 5685.2, 60 sec: 5675.7, 300 sec: 5678.8). Total num frames: 277364736. Throughput: 0: 5115.9. Samples: 277358470. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 13:35:01,139][25689] Avg episode reward: [(0, '-48.766')] [2022-07-09 13:35:03,132][26022] Updated weights on worker 0-0, policy_version 270873 (0.00118) [2022-07-09 13:35:05,018][26022] Updated weights on worker 0-0, policy_version 270883 (0.00080) [2022-07-09 13:35:06,173][25689] Fps is (10 sec: 5507.5, 60 sec: 5675.2, 300 sec: 5672.1). Total num frames: 277391360. Throughput: 0: 5869.6. Samples: 277390962. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 13:35:06,173][25689] Avg episode reward: [(0, '-48.654')] [2022-07-09 13:35:06,632][26022] Updated weights on worker 0-0, policy_version 270893 (0.00093) [2022-07-09 13:35:08,712][26022] Updated weights on worker 0-0, policy_version 270903 (0.00320) [2022-07-09 13:35:10,287][26022] Updated weights on worker 0-0, policy_version 270913 (0.00052) [2022-07-09 13:35:11,233][25689] Fps is (10 sec: 5376.6, 60 sec: 5694.2, 300 sec: 5661.5). Total num frames: 277419008. Throughput: 0: 5866.9. Samples: 277425052. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 13:35:11,233][25689] Avg episode reward: [(0, '-49.682')] [2022-07-09 13:35:12,203][26022] Updated weights on worker 0-0, policy_version 270923 (0.00097) [2022-07-09 13:35:13,907][26022] Updated weights on worker 0-0, policy_version 270933 (0.00090) [2022-07-09 13:35:15,805][26022] Updated weights on worker 0-0, policy_version 270943 (0.00095) [2022-07-09 13:35:16,265][25689] Fps is (10 sec: 5580.8, 60 sec: 5675.2, 300 sec: 5666.1). Total num frames: 277447680. Throughput: 0: 5022.1. Samples: 277442264. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 13:35:16,265][25689] Avg episode reward: [(0, '-48.352')] [2022-07-09 13:35:17,521][26022] Updated weights on worker 0-0, policy_version 270953 (0.00085) [2022-07-09 13:35:19,328][26022] Updated weights on worker 0-0, policy_version 270963 (0.00083) [2022-07-09 13:35:21,063][26022] Updated weights on worker 0-0, policy_version 270973 (0.00117) [2022-07-09 13:35:21,273][25689] Fps is (10 sec: 5915.3, 60 sec: 5691.8, 300 sec: 5671.1). Total num frames: 277478400. Throughput: 0: 5860.2. Samples: 277476462. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 13:35:21,274][25689] Avg episode reward: [(0, '-47.503')] [2022-07-09 13:35:23,008][26022] Updated weights on worker 0-0, policy_version 270983 (0.00083) [2022-07-09 13:35:24,666][26022] Updated weights on worker 0-0, policy_version 270993 (0.00084) [2022-07-09 13:35:26,301][25689] Fps is (10 sec: 5713.8, 60 sec: 5655.9, 300 sec: 5668.4). Total num frames: 277505024. Throughput: 0: 5932.9. Samples: 277510378. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 13:35:26,301][25689] Avg episode reward: [(0, '-47.128')] [2022-07-09 13:35:26,736][26022] Updated weights on worker 0-0, policy_version 271003 (0.00085) [2022-07-09 13:35:28,435][26022] Updated weights on worker 0-0, policy_version 271013 (0.00093) [2022-07-09 13:35:30,225][26022] Updated weights on worker 0-0, policy_version 271023 (0.00073) [2022-07-09 13:35:31,349][25689] Fps is (10 sec: 5284.5, 60 sec: 5658.4, 300 sec: 5657.4). Total num frames: 277531648. Throughput: 0: 5089.0. Samples: 277527426. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 13:35:31,351][25689] Avg episode reward: [(0, '-46.589')] [2022-07-09 13:35:31,879][26022] Updated weights on worker 0-0, policy_version 271033 (0.00087) [2022-07-09 13:35:34,057][26022] Updated weights on worker 0-0, policy_version 271043 (0.00093) [2022-07-09 13:35:35,532][26022] Updated weights on worker 0-0, policy_version 271053 (0.00080) [2022-07-09 13:35:36,359][25689] Fps is (10 sec: 5802.9, 60 sec: 5676.4, 300 sec: 5669.1). Total num frames: 277563392. Throughput: 0: 5947.7. Samples: 277561780. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 13:35:36,360][25689] Avg episode reward: [(0, '-46.958')] [2022-07-09 13:35:37,544][26022] Updated weights on worker 0-0, policy_version 271063 (0.00105) [2022-07-09 13:35:39,147][26022] Updated weights on worker 0-0, policy_version 271073 (0.00083) [2022-07-09 13:35:41,035][26022] Updated weights on worker 0-0, policy_version 271083 (0.00090) [2022-07-09 13:35:41,372][25689] Fps is (10 sec: 5823.3, 60 sec: 5659.8, 300 sec: 5658.9). Total num frames: 277590016. Throughput: 0: 5932.7. Samples: 277595706. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 13:35:41,374][25689] Avg episode reward: [(0, '-46.971')] [2022-07-09 13:35:42,725][26022] Updated weights on worker 0-0, policy_version 271093 (0.00096) [2022-07-09 13:35:44,731][26022] Updated weights on worker 0-0, policy_version 271103 (0.00087) [2022-07-09 13:35:46,392][25689] Fps is (10 sec: 5613.3, 60 sec: 5675.4, 300 sec: 5663.5). Total num frames: 277619712. Throughput: 0: 5099.8. Samples: 277612842. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 13:35:46,394][25689] Avg episode reward: [(0, '-47.083')] [2022-07-09 13:35:46,396][26022] Updated weights on worker 0-0, policy_version 271113 (0.00092) [2022-07-09 13:35:48,174][26022] Updated weights on worker 0-0, policy_version 271123 (0.00086) [2022-07-09 13:35:49,940][26022] Updated weights on worker 0-0, policy_version 271133 (0.00083) [2022-07-09 13:35:51,462][25689] Fps is (10 sec: 5683.2, 60 sec: 5627.6, 300 sec: 5659.4). Total num frames: 277647360. Throughput: 0: 5952.4. Samples: 277647148. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 13:35:51,463][25689] Avg episode reward: [(0, '-47.348')] [2022-07-09 13:35:51,858][26022] Updated weights on worker 0-0, policy_version 271143 (0.00081) [2022-07-09 13:35:53,623][26022] Updated weights on worker 0-0, policy_version 271153 (0.00095) [2022-07-09 13:35:55,459][26022] Updated weights on worker 0-0, policy_version 271163 (0.00446) [2022-07-09 13:35:56,485][25689] Fps is (10 sec: 5580.4, 60 sec: 5633.1, 300 sec: 5659.1). Total num frames: 277676032. Throughput: 0: 5945.3. Samples: 277681432. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 13:35:56,485][25689] Avg episode reward: [(0, '-47.803')] [2022-07-09 13:35:57,319][26022] Updated weights on worker 0-0, policy_version 271173 (0.00085) [2022-07-09 13:35:58,864][26022] Updated weights on worker 0-0, policy_version 271183 (0.00096) [2022-07-09 13:36:00,786][26022] Updated weights on worker 0-0, policy_version 271193 (0.00095) [2022-07-09 13:36:01,503][25689] Fps is (10 sec: 5609.3, 60 sec: 5615.0, 300 sec: 5665.8). Total num frames: 277703680. Throughput: 0: 5101.7. Samples: 277698406. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 13:36:01,503][25689] Avg episode reward: [(0, '-47.686')] [2022-07-09 13:36:02,927][26022] Updated weights on worker 0-0, policy_version 271203 (0.00089) [2022-07-09 13:36:04,836][26022] Updated weights on worker 0-0, policy_version 271213 (0.00092) [2022-07-09 13:36:06,544][25689] Fps is (10 sec: 5497.1, 60 sec: 5631.3, 300 sec: 5657.3). Total num frames: 277731328. Throughput: 0: 5833.3. Samples: 277730392. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 13:36:06,544][25689] Avg episode reward: [(0, '-47.561')] [2022-07-09 13:36:06,613][26022] Updated weights on worker 0-0, policy_version 271223 (0.00083) [2022-07-09 13:36:08,455][26022] Updated weights on worker 0-0, policy_version 271233 (0.00088) [2022-07-09 13:36:10,202][26022] Updated weights on worker 0-0, policy_version 271243 (0.00059) [2022-07-09 13:36:11,680][25689] Fps is (10 sec: 5534.2, 60 sec: 5641.2, 300 sec: 5656.2). Total num frames: 277760000. Throughput: 0: 5802.2. Samples: 277764452. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 13:36:11,680][25689] Avg episode reward: [(0, '-47.913')] [2022-07-09 13:36:12,079][26022] Updated weights on worker 0-0, policy_version 271253 (0.00085) [2022-07-09 13:36:13,713][26022] Updated weights on worker 0-0, policy_version 271263 (0.00081) [2022-07-09 13:36:15,527][26022] Updated weights on worker 0-0, policy_version 271273 (0.00082) [2022-07-09 13:36:16,704][25689] Fps is (10 sec: 5744.7, 60 sec: 5658.8, 300 sec: 5655.9). Total num frames: 277789696. Throughput: 0: 5811.8. Samples: 277798944. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 13:36:16,705][25689] Avg episode reward: [(0, '-47.953')] [2022-07-09 13:36:17,547][26022] Updated weights on worker 0-0, policy_version 271283 (0.00097) [2022-07-09 13:36:19,128][26022] Updated weights on worker 0-0, policy_version 271293 (0.00091) [2022-07-09 13:36:21,142][26022] Updated weights on worker 0-0, policy_version 271303 (0.00093) [2022-07-09 13:36:21,762][25689] Fps is (10 sec: 5789.3, 60 sec: 5620.3, 300 sec: 5658.4). Total num frames: 277818368. Throughput: 0: 5804.5. Samples: 277815998. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 13:36:21,762][25689] Avg episode reward: [(0, '-47.369')] [2022-07-09 13:36:22,731][26022] Updated weights on worker 0-0, policy_version 271313 (0.00097) [2022-07-09 13:36:24,575][26022] Updated weights on worker 0-0, policy_version 271323 (0.00090) [2022-07-09 13:36:26,230][26022] Updated weights on worker 0-0, policy_version 271333 (0.00081) [2022-07-09 13:36:26,825][25689] Fps is (10 sec: 5665.7, 60 sec: 5650.8, 300 sec: 5658.9). Total num frames: 277847040. Throughput: 0: 5914.4. Samples: 277850346. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 13:36:26,826][25689] Avg episode reward: [(0, '-47.653')] [2022-07-09 13:36:28,155][26022] Updated weights on worker 0-0, policy_version 271343 (0.00096) [2022-07-09 13:36:30,000][26022] Updated weights on worker 0-0, policy_version 271353 (0.00089) [2022-07-09 13:36:31,876][25689] Fps is (10 sec: 5669.7, 60 sec: 5684.5, 300 sec: 5654.6). Total num frames: 277875712. Throughput: 0: 5934.9. Samples: 277884316. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 13:36:31,876][25689] Avg episode reward: [(0, '-48.302')] [2022-07-09 13:36:31,878][26022] Updated weights on worker 0-0, policy_version 271363 (0.00092) [2022-07-09 13:36:33,645][26022] Updated weights on worker 0-0, policy_version 271373 (0.00085) [2022-07-09 13:36:35,402][26022] Updated weights on worker 0-0, policy_version 271383 (0.00085) [2022-07-09 13:36:36,926][25689] Fps is (10 sec: 5677.3, 60 sec: 5630.0, 300 sec: 5651.8). Total num frames: 277904384. Throughput: 0: 5059.1. Samples: 277901248. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 13:36:36,927][25689] Avg episode reward: [(0, '-49.727')] [2022-07-09 13:36:37,160][26022] Updated weights on worker 0-0, policy_version 271393 (0.00086) [2022-07-09 13:36:38,868][26022] Updated weights on worker 0-0, policy_version 271403 (0.00088) [2022-07-09 13:36:41,002][26022] Updated weights on worker 0-0, policy_version 271413 (0.00090) [2022-07-09 13:36:41,976][25689] Fps is (10 sec: 5677.8, 60 sec: 5660.4, 300 sec: 5654.6). Total num frames: 277933056. Throughput: 0: 5899.9. Samples: 277935258. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 13:36:41,976][25689] Avg episode reward: [(0, '-49.745')] [2022-07-09 13:36:42,675][26022] Updated weights on worker 0-0, policy_version 271423 (0.00089) [2022-07-09 13:36:44,431][26022] Updated weights on worker 0-0, policy_version 271433 (0.00051) [2022-07-09 13:36:46,019][26022] Updated weights on worker 0-0, policy_version 271443 (0.00088) [2022-07-09 13:36:46,995][25689] Fps is (10 sec: 5593.6, 60 sec: 5626.7, 300 sec: 5652.7). Total num frames: 277960704. Throughput: 0: 5911.4. Samples: 277969574. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 13:36:46,995][25689] Avg episode reward: [(0, '-50.234')] [2022-07-09 13:36:48,069][26022] Updated weights on worker 0-0, policy_version 271453 (0.00087) [2022-07-09 13:36:49,755][26022] Updated weights on worker 0-0, policy_version 271463 (0.00084) [2022-07-09 13:36:51,564][26022] Updated weights on worker 0-0, policy_version 271473 (0.00082) [2022-07-09 13:36:52,102][25689] Fps is (10 sec: 5764.2, 60 sec: 5674.0, 300 sec: 5658.3). Total num frames: 277991424. Throughput: 0: 5058.6. Samples: 277986630. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 13:36:52,102][25689] Avg episode reward: [(0, '-50.222')] [2022-07-09 13:36:53,438][26022] Updated weights on worker 0-0, policy_version 271483 (0.00089) [2022-07-09 13:36:55,050][26022] Updated weights on worker 0-0, policy_version 271493 (0.00112) [2022-07-09 13:36:57,074][26022] Updated weights on worker 0-0, policy_version 271503 (0.00090) [2022-07-09 13:36:57,121][25689] Fps is (10 sec: 5764.1, 60 sec: 5657.4, 300 sec: 5648.0). Total num frames: 278019072. Throughput: 0: 5925.1. Samples: 278020904. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 13:36:57,121][25689] Avg episode reward: [(0, '-49.531')] [2022-07-09 13:36:58,761][26022] Updated weights on worker 0-0, policy_version 271513 (0.00094) [2022-07-09 13:37:00,168][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:37:00,181][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000271520_278036480.pth [2022-07-09 13:37:00,181][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000269529_275997696.pth [2022-07-09 13:37:00,638][26022] Updated weights on worker 0-0, policy_version 271523 (0.00093) [2022-07-09 13:37:02,139][25689] Fps is (10 sec: 5611.0, 60 sec: 5674.3, 300 sec: 5661.5). Total num frames: 278047744. Throughput: 0: 5945.5. Samples: 278055138. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 13:37:02,140][25689] Avg episode reward: [(0, '-48.985')] [2022-07-09 13:37:02,756][26022] Updated weights on worker 0-0, policy_version 271533 (0.00086) [2022-07-09 13:37:04,576][26022] Updated weights on worker 0-0, policy_version 271543 (0.00100) [2022-07-09 13:37:06,371][26022] Updated weights on worker 0-0, policy_version 271553 (0.00436) [2022-07-09 13:37:07,149][25689] Fps is (10 sec: 5411.7, 60 sec: 5643.3, 300 sec: 5649.3). Total num frames: 278073344. Throughput: 0: 4988.5. Samples: 278070116. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 13:37:07,150][25689] Avg episode reward: [(0, '-48.338')] [2022-07-09 13:37:08,033][26022] Updated weights on worker 0-0, policy_version 271563 (0.00090) [2022-07-09 13:37:09,900][26022] Updated weights on worker 0-0, policy_version 271573 (0.00095) [2022-07-09 13:37:11,786][26022] Updated weights on worker 0-0, policy_version 271583 (0.00086) [2022-07-09 13:37:12,227][25689] Fps is (10 sec: 5481.5, 60 sec: 5665.7, 300 sec: 5655.0). Total num frames: 278103040. Throughput: 0: 5848.5. Samples: 278104332. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 13:37:12,227][25689] Avg episode reward: [(0, '-48.616')] [2022-07-09 13:37:13,451][26022] Updated weights on worker 0-0, policy_version 271593 (0.00096) [2022-07-09 13:37:15,453][26022] Updated weights on worker 0-0, policy_version 271603 (0.00094) [2022-07-09 13:37:17,072][26022] Updated weights on worker 0-0, policy_version 271613 (0.00085) [2022-07-09 13:37:17,228][25689] Fps is (10 sec: 5791.3, 60 sec: 5651.0, 300 sec: 5651.7). Total num frames: 278131712. Throughput: 0: 5860.9. Samples: 278138750. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 13:37:17,228][25689] Avg episode reward: [(0, '-47.962')] [2022-07-09 13:37:19,051][26022] Updated weights on worker 0-0, policy_version 271623 (0.00089) [2022-07-09 13:37:20,622][26022] Updated weights on worker 0-0, policy_version 271633 (0.00089) [2022-07-09 13:37:22,245][25689] Fps is (10 sec: 5621.5, 60 sec: 5637.8, 300 sec: 5651.6). Total num frames: 278159360. Throughput: 0: 5018.5. Samples: 278156042. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 13:37:22,246][25689] Avg episode reward: [(0, '-48.228')] [2022-07-09 13:37:22,582][26022] Updated weights on worker 0-0, policy_version 271643 (0.00085) [2022-07-09 13:37:24,124][26022] Updated weights on worker 0-0, policy_version 271653 (0.00088) [2022-07-09 13:37:26,035][26022] Updated weights on worker 0-0, policy_version 271663 (0.00099) [2022-07-09 13:37:27,271][25689] Fps is (10 sec: 5811.8, 60 sec: 5675.2, 300 sec: 5656.9). Total num frames: 278190080. Throughput: 0: 5994.0. Samples: 278190724. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 13:37:27,271][25689] Avg episode reward: [(0, '-48.326')] [2022-07-09 13:37:27,887][26022] Updated weights on worker 0-0, policy_version 271673 (0.00099) [2022-07-09 13:37:29,669][26022] Updated weights on worker 0-0, policy_version 271683 (0.00089) [2022-07-09 13:37:31,534][26022] Updated weights on worker 0-0, policy_version 271693 (0.00091) [2022-07-09 13:37:32,315][25689] Fps is (10 sec: 5796.2, 60 sec: 5658.8, 300 sec: 5659.9). Total num frames: 278217728. Throughput: 0: 5989.8. Samples: 278224658. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 13:37:32,316][25689] Avg episode reward: [(0, '-48.579')] [2022-07-09 13:37:33,308][26022] Updated weights on worker 0-0, policy_version 271703 (0.00079) [2022-07-09 13:37:34,933][26022] Updated weights on worker 0-0, policy_version 271713 (0.00087) [2022-07-09 13:37:37,033][26022] Updated weights on worker 0-0, policy_version 271723 (0.00093) [2022-07-09 13:37:37,327][25689] Fps is (10 sec: 5600.7, 60 sec: 5662.5, 300 sec: 5656.4). Total num frames: 278246400. Throughput: 0: 5141.0. Samples: 278242078. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 13:37:37,327][25689] Avg episode reward: [(0, '-48.900')] [2022-07-09 13:37:38,445][26022] Updated weights on worker 0-0, policy_version 271733 (0.00092) [2022-07-09 13:37:40,448][26022] Updated weights on worker 0-0, policy_version 271743 (0.00096) [2022-07-09 13:37:42,082][26022] Updated weights on worker 0-0, policy_version 271753 (0.00088) [2022-07-09 13:37:42,339][25689] Fps is (10 sec: 5823.2, 60 sec: 5683.0, 300 sec: 5663.5). Total num frames: 278276096. Throughput: 0: 5976.6. Samples: 278276132. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 13:37:42,339][25689] Avg episode reward: [(0, '-49.224')] [2022-07-09 13:37:44,091][26022] Updated weights on worker 0-0, policy_version 271763 (0.00085) [2022-07-09 13:37:46,000][26022] Updated weights on worker 0-0, policy_version 271773 (0.00086) [2022-07-09 13:37:47,357][25689] Fps is (10 sec: 5717.2, 60 sec: 5683.0, 300 sec: 5657.3). Total num frames: 278303744. Throughput: 0: 5962.6. Samples: 278310488. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 13:37:47,357][25689] Avg episode reward: [(0, '-49.097')] [2022-07-09 13:37:47,595][26022] Updated weights on worker 0-0, policy_version 271783 (0.00092) [2022-07-09 13:37:49,502][26022] Updated weights on worker 0-0, policy_version 271793 (0.00087) [2022-07-09 13:37:51,254][26022] Updated weights on worker 0-0, policy_version 271803 (0.00089) [2022-07-09 13:37:52,425][25689] Fps is (10 sec: 5685.2, 60 sec: 5669.7, 300 sec: 5656.8). Total num frames: 278333440. Throughput: 0: 5115.1. Samples: 278327522. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 13:37:52,426][25689] Avg episode reward: [(0, '-48.900')] [2022-07-09 13:37:53,085][26022] Updated weights on worker 0-0, policy_version 271813 (0.00091) [2022-07-09 13:37:54,931][26022] Updated weights on worker 0-0, policy_version 271823 (0.00095) [2022-07-09 13:37:56,583][26022] Updated weights on worker 0-0, policy_version 271833 (0.00083) [2022-07-09 13:37:57,469][25689] Fps is (10 sec: 5670.7, 60 sec: 5667.4, 300 sec: 5652.7). Total num frames: 278361088. Throughput: 0: 5954.5. Samples: 278362016. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 13:37:57,471][25689] Avg episode reward: [(0, '-49.486')] [2022-07-09 13:37:58,405][26022] Updated weights on worker 0-0, policy_version 271843 (0.00095) [2022-07-09 13:38:00,380][26022] Updated weights on worker 0-0, policy_version 271853 (0.00096) [2022-07-09 13:38:02,140][26022] Updated weights on worker 0-0, policy_version 271863 (0.00087) [2022-07-09 13:38:02,543][25689] Fps is (10 sec: 5465.4, 60 sec: 5645.2, 300 sec: 5658.4). Total num frames: 278388736. Throughput: 0: 5928.9. Samples: 278395920. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 13:38:02,543][25689] Avg episode reward: [(0, '-49.121')] [2022-07-09 13:38:04,155][26022] Updated weights on worker 0-0, policy_version 271873 (0.00083) [2022-07-09 13:38:05,971][26022] Updated weights on worker 0-0, policy_version 271883 (0.00092) [2022-07-09 13:38:07,551][25689] Fps is (10 sec: 5586.3, 60 sec: 5696.3, 300 sec: 5657.9). Total num frames: 278417408. Throughput: 0: 5002.0. Samples: 278411498. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 13:38:07,551][25689] Avg episode reward: [(0, '-49.103')] [2022-07-09 13:38:07,778][26022] Updated weights on worker 0-0, policy_version 271893 (0.00092) [2022-07-09 13:38:09,475][26022] Updated weights on worker 0-0, policy_version 271903 (0.00087) [2022-07-09 13:38:11,421][26022] Updated weights on worker 0-0, policy_version 271913 (0.00091) [2022-07-09 13:38:12,596][25689] Fps is (10 sec: 5704.3, 60 sec: 5682.4, 300 sec: 5653.8). Total num frames: 278446080. Throughput: 0: 5861.4. Samples: 278445748. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 13:38:12,596][25689] Avg episode reward: [(0, '-48.592')] [2022-07-09 13:38:13,192][26022] Updated weights on worker 0-0, policy_version 271923 (0.00087) [2022-07-09 13:38:14,794][26022] Updated weights on worker 0-0, policy_version 271933 (0.00087) [2022-07-09 13:38:16,761][26022] Updated weights on worker 0-0, policy_version 271943 (0.00081) [2022-07-09 13:38:17,620][25689] Fps is (10 sec: 5796.8, 60 sec: 5697.2, 300 sec: 5660.5). Total num frames: 278475776. Throughput: 0: 5844.0. Samples: 278479778. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 13:38:17,621][25689] Avg episode reward: [(0, '-48.399')] [2022-07-09 13:38:18,430][26022] Updated weights on worker 0-0, policy_version 271953 (0.00095) [2022-07-09 13:38:20,414][26022] Updated weights on worker 0-0, policy_version 271963 (0.00088) [2022-07-09 13:38:22,256][26022] Updated weights on worker 0-0, policy_version 271973 (0.00092) [2022-07-09 13:38:22,627][25689] Fps is (10 sec: 5614.2, 60 sec: 5681.2, 300 sec: 5650.2). Total num frames: 278502400. Throughput: 0: 5035.1. Samples: 278497048. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 13:38:22,628][25689] Avg episode reward: [(0, '-48.637')] [2022-07-09 13:38:23,887][26022] Updated weights on worker 0-0, policy_version 271983 (0.00089) [2022-07-09 13:38:25,952][26022] Updated weights on worker 0-0, policy_version 271993 (0.00087) [2022-07-09 13:38:27,450][26022] Updated weights on worker 0-0, policy_version 272003 (0.00092) [2022-07-09 13:38:27,654][25689] Fps is (10 sec: 5613.1, 60 sec: 5664.1, 300 sec: 5657.8). Total num frames: 278532096. Throughput: 0: 5951.4. Samples: 278531138. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 13:38:27,654][25689] Avg episode reward: [(0, '-48.149')] [2022-07-09 13:38:29,430][26022] Updated weights on worker 0-0, policy_version 272013 (0.00405) [2022-07-09 13:38:30,975][26022] Updated weights on worker 0-0, policy_version 272023 (0.00090) [2022-07-09 13:38:32,779][25689] Fps is (10 sec: 5648.8, 60 sec: 5656.6, 300 sec: 5648.6). Total num frames: 278559744. Throughput: 0: 5915.2. Samples: 278565136. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 13:38:32,779][25689] Avg episode reward: [(0, '-47.684')] [2022-07-09 13:38:32,935][26022] Updated weights on worker 0-0, policy_version 272033 (0.00090) [2022-07-09 13:38:34,722][26022] Updated weights on worker 0-0, policy_version 272043 (0.00090) [2022-07-09 13:38:36,474][26022] Updated weights on worker 0-0, policy_version 272053 (0.00092) [2022-07-09 13:38:37,781][25689] Fps is (10 sec: 5662.2, 60 sec: 5674.4, 300 sec: 5659.7). Total num frames: 278589440. Throughput: 0: 5089.4. Samples: 278582386. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 13:38:37,783][25689] Avg episode reward: [(0, '-48.273')] [2022-07-09 13:38:38,259][26022] Updated weights on worker 0-0, policy_version 272063 (0.00091) [2022-07-09 13:38:39,986][26022] Updated weights on worker 0-0, policy_version 272073 (0.00502) [2022-07-09 13:38:41,976][26022] Updated weights on worker 0-0, policy_version 272083 (0.00107) [2022-07-09 13:38:42,811][25689] Fps is (10 sec: 5716.2, 60 sec: 5638.8, 300 sec: 5652.6). Total num frames: 278617088. Throughput: 0: 5926.2. Samples: 278616658. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 13:38:42,811][25689] Avg episode reward: [(0, '-48.297')] [2022-07-09 13:38:43,625][26022] Updated weights on worker 0-0, policy_version 272093 (0.00084) [2022-07-09 13:38:45,486][26022] Updated weights on worker 0-0, policy_version 272103 (0.00088) [2022-07-09 13:38:47,308][26022] Updated weights on worker 0-0, policy_version 272113 (0.00091) [2022-07-09 13:38:47,884][25689] Fps is (10 sec: 5574.8, 60 sec: 5650.6, 300 sec: 5652.2). Total num frames: 278645760. Throughput: 0: 5908.1. Samples: 278650660. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 13:38:47,884][25689] Avg episode reward: [(0, '-48.634')] [2022-07-09 13:38:49,130][26022] Updated weights on worker 0-0, policy_version 272123 (0.00093) [2022-07-09 13:38:51,048][26022] Updated weights on worker 0-0, policy_version 272133 (0.00089) [2022-07-09 13:38:52,734][26022] Updated weights on worker 0-0, policy_version 272143 (0.00090) [2022-07-09 13:38:52,963][25689] Fps is (10 sec: 5749.2, 60 sec: 5649.6, 300 sec: 5658.4). Total num frames: 278675456. Throughput: 0: 5907.6. Samples: 278684376. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:38:52,963][25689] Avg episode reward: [(0, '-48.861')] [2022-07-09 13:38:54,712][26022] Updated weights on worker 0-0, policy_version 272153 (0.00094) [2022-07-09 13:38:56,390][26022] Updated weights on worker 0-0, policy_version 272163 (0.00087) [2022-07-09 13:38:57,982][25689] Fps is (10 sec: 5577.1, 60 sec: 5635.0, 300 sec: 5652.5). Total num frames: 278702080. Throughput: 0: 5886.1. Samples: 278701292. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:38:57,983][25689] Avg episode reward: [(0, '-49.189')] [2022-07-09 13:38:58,233][26022] Updated weights on worker 0-0, policy_version 272173 (0.00089) [2022-07-09 13:39:00,250][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:39:00,262][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000272183_278715392.pth [2022-07-09 13:39:00,262][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000270194_276678656.pth [2022-07-09 13:39:00,266][26022] Updated weights on worker 0-0, policy_version 272183 (0.00078) [2022-07-09 13:39:02,431][26022] Updated weights on worker 0-0, policy_version 272193 (0.00096) [2022-07-09 13:39:02,994][25689] Fps is (10 sec: 5308.4, 60 sec: 5623.9, 300 sec: 5652.8). Total num frames: 278728704. Throughput: 0: 5789.1. Samples: 278733500. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:39:02,996][25689] Avg episode reward: [(0, '-50.316')] [2022-07-09 13:39:04,038][26022] Updated weights on worker 0-0, policy_version 272203 (0.00090) [2022-07-09 13:39:05,831][26022] Updated weights on worker 0-0, policy_version 272213 (0.00089) [2022-07-09 13:39:07,537][26022] Updated weights on worker 0-0, policy_version 272223 (0.00086) [2022-07-09 13:39:08,023][25689] Fps is (10 sec: 5609.2, 60 sec: 5638.8, 300 sec: 5664.1). Total num frames: 278758400. Throughput: 0: 5788.8. Samples: 278767242. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:39:08,023][25689] Avg episode reward: [(0, '-50.795')] [2022-07-09 13:39:09,803][26022] Updated weights on worker 0-0, policy_version 272233 (0.00106) [2022-07-09 13:39:11,159][26022] Updated weights on worker 0-0, policy_version 272243 (0.00087) [2022-07-09 13:39:13,072][25689] Fps is (10 sec: 5588.1, 60 sec: 5604.5, 300 sec: 5653.0). Total num frames: 278785024. Throughput: 0: 4963.5. Samples: 278784190. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:39:13,074][25689] Avg episode reward: [(0, '-49.846')] [2022-07-09 13:39:13,272][26022] Updated weights on worker 0-0, policy_version 272253 (0.00090) [2022-07-09 13:39:14,906][26022] Updated weights on worker 0-0, policy_version 272263 (0.00086) [2022-07-09 13:39:16,887][26022] Updated weights on worker 0-0, policy_version 272273 (0.00061) [2022-07-09 13:39:18,088][25689] Fps is (10 sec: 5595.7, 60 sec: 5605.4, 300 sec: 5652.8). Total num frames: 278814720. Throughput: 0: 5813.5. Samples: 278818176. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:39:18,090][25689] Avg episode reward: [(0, '-50.021')] [2022-07-09 13:39:18,595][26022] Updated weights on worker 0-0, policy_version 272283 (0.00084) [2022-07-09 13:39:20,256][26022] Updated weights on worker 0-0, policy_version 272293 (0.00081) [2022-07-09 13:39:22,207][26022] Updated weights on worker 0-0, policy_version 272303 (0.00087) [2022-07-09 13:39:23,101][25689] Fps is (10 sec: 5819.9, 60 sec: 5638.6, 300 sec: 5652.7). Total num frames: 278843392. Throughput: 0: 5918.3. Samples: 278852504. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:39:23,103][25689] Avg episode reward: [(0, '-50.272')] [2022-07-09 13:39:24,001][26022] Updated weights on worker 0-0, policy_version 272313 (0.00086) [2022-07-09 13:39:25,652][26022] Updated weights on worker 0-0, policy_version 272323 (0.00093) [2022-07-09 13:39:27,684][26022] Updated weights on worker 0-0, policy_version 272333 (0.00093) [2022-07-09 13:39:28,127][25689] Fps is (10 sec: 5610.1, 60 sec: 5604.9, 300 sec: 5657.1). Total num frames: 278871040. Throughput: 0: 5088.2. Samples: 278869534. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:39:28,128][25689] Avg episode reward: [(0, '-49.485')] [2022-07-09 13:39:29,083][26022] Updated weights on worker 0-0, policy_version 272343 (0.00089) [2022-07-09 13:39:31,303][26022] Updated weights on worker 0-0, policy_version 272353 (0.00091) [2022-07-09 13:39:32,917][26022] Updated weights on worker 0-0, policy_version 272363 (0.00092) [2022-07-09 13:39:33,211][25689] Fps is (10 sec: 5672.1, 60 sec: 5642.5, 300 sec: 5652.4). Total num frames: 278900736. Throughput: 0: 5919.8. Samples: 278903408. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:39:33,212][25689] Avg episode reward: [(0, '-48.810')] [2022-07-09 13:39:34,894][26022] Updated weights on worker 0-0, policy_version 272373 (0.00090) [2022-07-09 13:39:36,658][26022] Updated weights on worker 0-0, policy_version 272383 (0.00090) [2022-07-09 13:39:38,232][25689] Fps is (10 sec: 5674.4, 60 sec: 5606.9, 300 sec: 5652.4). Total num frames: 278928384. Throughput: 0: 5912.9. Samples: 278937290. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:39:38,233][25689] Avg episode reward: [(0, '-48.286')] [2022-07-09 13:39:38,424][26022] Updated weights on worker 0-0, policy_version 272393 (0.00088) [2022-07-09 13:39:40,275][26022] Updated weights on worker 0-0, policy_version 272403 (0.00095) [2022-07-09 13:39:42,239][26022] Updated weights on worker 0-0, policy_version 272413 (0.00089) [2022-07-09 13:39:43,255][25689] Fps is (10 sec: 5505.7, 60 sec: 5607.6, 300 sec: 5648.6). Total num frames: 278956032. Throughput: 0: 5033.6. Samples: 278953948. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:39:43,255][25689] Avg episode reward: [(0, '-49.001')] [2022-07-09 13:39:43,892][26022] Updated weights on worker 0-0, policy_version 272423 (0.00084) [2022-07-09 13:39:45,682][26022] Updated weights on worker 0-0, policy_version 272433 (0.00092) [2022-07-09 13:39:47,643][26022] Updated weights on worker 0-0, policy_version 272443 (0.00085) [2022-07-09 13:39:48,283][25689] Fps is (10 sec: 5501.9, 60 sec: 5594.8, 300 sec: 5639.7). Total num frames: 278983680. Throughput: 0: 5868.1. Samples: 278987812. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:39:48,283][25689] Avg episode reward: [(0, '-47.906')] [2022-07-09 13:39:49,245][26022] Updated weights on worker 0-0, policy_version 272453 (0.00106) [2022-07-09 13:39:51,508][26022] Updated weights on worker 0-0, policy_version 272463 (0.00090) [2022-07-09 13:39:52,793][26022] Updated weights on worker 0-0, policy_version 272473 (0.00095) [2022-07-09 13:39:53,399][25689] Fps is (10 sec: 5854.4, 60 sec: 5625.2, 300 sec: 5649.3). Total num frames: 279015424. Throughput: 0: 5879.0. Samples: 279022094. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:39:53,404][25689] Avg episode reward: [(0, '-47.972')] [2022-07-09 13:39:54,882][26022] Updated weights on worker 0-0, policy_version 272483 (0.00091) [2022-07-09 13:39:56,471][26022] Updated weights on worker 0-0, policy_version 272493 (0.00079) [2022-07-09 13:39:58,250][26022] Updated weights on worker 0-0, policy_version 272503 (0.00087) [2022-07-09 13:39:58,439][25689] Fps is (10 sec: 5847.9, 60 sec: 5640.3, 300 sec: 5645.3). Total num frames: 279043072. Throughput: 0: 5043.8. Samples: 279039208. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:39:58,439][25689] Avg episode reward: [(0, '-47.722')] [2022-07-09 13:40:00,255][26022] Updated weights on worker 0-0, policy_version 272513 (0.00094) [2022-07-09 13:40:02,218][26022] Updated weights on worker 0-0, policy_version 272523 (0.00091) [2022-07-09 13:40:03,449][25689] Fps is (10 sec: 5400.7, 60 sec: 5640.4, 300 sec: 5645.7). Total num frames: 279069696. Throughput: 0: 5812.2. Samples: 279071320. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:40:03,449][25689] Avg episode reward: [(0, '-48.450')] [2022-07-09 13:40:03,990][26022] Updated weights on worker 0-0, policy_version 272533 (0.00089) [2022-07-09 13:40:06,172][26022] Updated weights on worker 0-0, policy_version 272543 (0.00094) [2022-07-09 13:40:07,531][26022] Updated weights on worker 0-0, policy_version 272553 (0.00078) [2022-07-09 13:40:08,543][25689] Fps is (10 sec: 5472.8, 60 sec: 5617.5, 300 sec: 5648.5). Total num frames: 279098368. Throughput: 0: 5816.5. Samples: 279105654. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:40:08,543][25689] Avg episode reward: [(0, '-48.462')] [2022-07-09 13:40:09,687][26022] Updated weights on worker 0-0, policy_version 272563 (0.00086) [2022-07-09 13:40:11,440][26022] Updated weights on worker 0-0, policy_version 272573 (0.00087) [2022-07-09 13:40:13,214][26022] Updated weights on worker 0-0, policy_version 272583 (0.00085) [2022-07-09 13:40:13,587][25689] Fps is (10 sec: 5757.3, 60 sec: 5668.7, 300 sec: 5651.7). Total num frames: 279128064. Throughput: 0: 4976.0. Samples: 279122546. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:40:13,587][25689] Avg episode reward: [(0, '-49.291')] [2022-07-09 13:40:15,049][26022] Updated weights on worker 0-0, policy_version 272593 (0.00089) [2022-07-09 13:40:16,697][26022] Updated weights on worker 0-0, policy_version 272603 (0.00100) [2022-07-09 13:40:18,583][26022] Updated weights on worker 0-0, policy_version 272613 (0.00098) [2022-07-09 13:40:18,597][25689] Fps is (10 sec: 5703.4, 60 sec: 5635.4, 300 sec: 5641.4). Total num frames: 279155712. Throughput: 0: 5835.0. Samples: 279156832. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:40:18,598][25689] Avg episode reward: [(0, '-50.316')] [2022-07-09 13:40:20,370][26022] Updated weights on worker 0-0, policy_version 272623 (0.00093) [2022-07-09 13:40:22,085][26022] Updated weights on worker 0-0, policy_version 272633 (0.00087) [2022-07-09 13:40:23,610][25689] Fps is (10 sec: 5516.7, 60 sec: 5618.5, 300 sec: 5645.1). Total num frames: 279183360. Throughput: 0: 5942.6. Samples: 279191132. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:40:23,611][25689] Avg episode reward: [(0, '-50.090')] [2022-07-09 13:40:23,929][26022] Updated weights on worker 0-0, policy_version 272643 (0.00085) [2022-07-09 13:40:25,520][26022] Updated weights on worker 0-0, policy_version 272653 (0.00093) [2022-07-09 13:40:27,545][26022] Updated weights on worker 0-0, policy_version 272663 (0.00087) [2022-07-09 13:40:28,612][25689] Fps is (10 sec: 5725.7, 60 sec: 5654.5, 300 sec: 5656.3). Total num frames: 279213056. Throughput: 0: 5114.0. Samples: 279208290. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:40:28,613][25689] Avg episode reward: [(0, '-50.624')] [2022-07-09 13:40:29,131][26022] Updated weights on worker 0-0, policy_version 272673 (0.00050) [2022-07-09 13:40:31,087][26022] Updated weights on worker 0-0, policy_version 272683 (0.00089) [2022-07-09 13:40:33,074][26022] Updated weights on worker 0-0, policy_version 272693 (0.00090) [2022-07-09 13:40:33,683][25689] Fps is (10 sec: 5693.1, 60 sec: 5622.0, 300 sec: 5641.4). Total num frames: 279240704. Throughput: 0: 5962.0. Samples: 279242358. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:40:33,683][25689] Avg episode reward: [(0, '-50.464')] [2022-07-09 13:40:34,902][26022] Updated weights on worker 0-0, policy_version 272703 (0.00114) [2022-07-09 13:40:36,568][26022] Updated weights on worker 0-0, policy_version 272713 (0.00092) [2022-07-09 13:40:38,480][26022] Updated weights on worker 0-0, policy_version 272723 (0.00096) [2022-07-09 13:40:38,715][25689] Fps is (10 sec: 5574.9, 60 sec: 5637.9, 300 sec: 5647.9). Total num frames: 279269376. Throughput: 0: 5959.7. Samples: 279276728. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:40:38,715][25689] Avg episode reward: [(0, '-49.546')] [2022-07-09 13:40:39,959][26022] Updated weights on worker 0-0, policy_version 272733 (0.00092) [2022-07-09 13:40:41,923][26022] Updated weights on worker 0-0, policy_version 272743 (0.00082) [2022-07-09 13:40:43,645][26022] Updated weights on worker 0-0, policy_version 272753 (0.00094) [2022-07-09 13:40:43,725][25689] Fps is (10 sec: 5812.4, 60 sec: 5672.9, 300 sec: 5648.1). Total num frames: 279299072. Throughput: 0: 5114.8. Samples: 279294014. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:40:43,725][25689] Avg episode reward: [(0, '-49.033')] [2022-07-09 13:40:45,428][26022] Updated weights on worker 0-0, policy_version 272763 (0.00084) [2022-07-09 13:40:47,480][26022] Updated weights on worker 0-0, policy_version 272773 (0.00084) [2022-07-09 13:40:48,740][25689] Fps is (10 sec: 5822.2, 60 sec: 5691.0, 300 sec: 5652.6). Total num frames: 279327744. Throughput: 0: 5944.7. Samples: 279327942. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:40:48,740][25689] Avg episode reward: [(0, '-48.305')] [2022-07-09 13:40:49,266][26022] Updated weights on worker 0-0, policy_version 272783 (0.00086) [2022-07-09 13:40:51,143][26022] Updated weights on worker 0-0, policy_version 272793 (0.00084) [2022-07-09 13:40:52,639][26022] Updated weights on worker 0-0, policy_version 272803 (0.00082) [2022-07-09 13:40:53,858][25689] Fps is (10 sec: 5658.8, 60 sec: 5640.1, 300 sec: 5650.8). Total num frames: 279356416. Throughput: 0: 5942.5. Samples: 279362250. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:40:53,859][25689] Avg episode reward: [(0, '-48.879')] [2022-07-09 13:40:54,482][26022] Updated weights on worker 0-0, policy_version 272813 (0.00086) [2022-07-09 13:40:56,256][26022] Updated weights on worker 0-0, policy_version 272823 (0.00092) [2022-07-09 13:40:58,116][26022] Updated weights on worker 0-0, policy_version 272833 (0.00092) [2022-07-09 13:40:58,924][25689] Fps is (10 sec: 5731.3, 60 sec: 5671.5, 300 sec: 5656.7). Total num frames: 279386112. Throughput: 0: 5082.1. Samples: 279379432. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:40:58,924][25689] Avg episode reward: [(0, '-47.989')] [2022-07-09 13:40:59,729][26022] Updated weights on worker 0-0, policy_version 272843 (0.00103) [2022-07-09 13:41:00,420][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:41:00,432][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000272846_279394304.pth [2022-07-09 13:41:00,433][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000270858_277358592.pth [2022-07-09 13:41:02,202][26022] Updated weights on worker 0-0, policy_version 272853 (0.00086) [2022-07-09 13:41:03,748][26022] Updated weights on worker 0-0, policy_version 272863 (0.00087) [2022-07-09 13:41:03,935][25689] Fps is (10 sec: 5487.2, 60 sec: 5654.4, 300 sec: 5650.4). Total num frames: 279411712. Throughput: 0: 5829.4. Samples: 279411832. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:41:03,936][25689] Avg episode reward: [(0, '-47.810')] [2022-07-09 13:41:05,764][26022] Updated weights on worker 0-0, policy_version 272873 (0.00090) [2022-07-09 13:41:07,382][26022] Updated weights on worker 0-0, policy_version 272883 (0.00092) [2022-07-09 13:41:08,957][25689] Fps is (10 sec: 5307.3, 60 sec: 5644.2, 300 sec: 5649.1). Total num frames: 279439360. Throughput: 0: 5836.3. Samples: 279445934. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:41:08,957][25689] Avg episode reward: [(0, '-47.401')] [2022-07-09 13:41:09,333][26022] Updated weights on worker 0-0, policy_version 272893 (0.00097) [2022-07-09 13:41:10,947][26022] Updated weights on worker 0-0, policy_version 272903 (0.00085) [2022-07-09 13:41:12,861][26022] Updated weights on worker 0-0, policy_version 272913 (0.00094) [2022-07-09 13:41:14,002][25689] Fps is (10 sec: 5798.4, 60 sec: 5661.1, 300 sec: 5652.2). Total num frames: 279470080. Throughput: 0: 5009.5. Samples: 279463160. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 13:41:14,003][25689] Avg episode reward: [(0, '-48.286')] [2022-07-09 13:41:14,563][26022] Updated weights on worker 0-0, policy_version 272923 (0.00090) [2022-07-09 13:41:16,385][26022] Updated weights on worker 0-0, policy_version 272933 (0.00092) [2022-07-09 13:41:18,390][26022] Updated weights on worker 0-0, policy_version 272943 (0.00087) [2022-07-09 13:41:19,020][25689] Fps is (10 sec: 5698.4, 60 sec: 5643.4, 300 sec: 5646.1). Total num frames: 279496704. Throughput: 0: 5860.4. Samples: 279497204. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 13:41:19,020][25689] Avg episode reward: [(0, '-48.705')] [2022-07-09 13:41:19,833][26022] Updated weights on worker 0-0, policy_version 272953 (0.00091) [2022-07-09 13:41:21,945][26022] Updated weights on worker 0-0, policy_version 272963 (0.00795) [2022-07-09 13:41:23,560][26022] Updated weights on worker 0-0, policy_version 272973 (0.00087) [2022-07-09 13:41:24,099][25689] Fps is (10 sec: 5476.4, 60 sec: 5654.2, 300 sec: 5645.8). Total num frames: 279525376. Throughput: 0: 5937.0. Samples: 279531542. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 13:41:24,099][25689] Avg episode reward: [(0, '-48.034')] [2022-07-09 13:41:25,455][26022] Updated weights on worker 0-0, policy_version 272983 (0.00089) [2022-07-09 13:41:27,307][26022] Updated weights on worker 0-0, policy_version 272993 (0.00084) [2022-07-09 13:41:28,992][26022] Updated weights on worker 0-0, policy_version 273003 (0.00089) [2022-07-09 13:41:29,161][25689] Fps is (10 sec: 5755.4, 60 sec: 5648.6, 300 sec: 5649.0). Total num frames: 279555072. Throughput: 0: 5073.9. Samples: 279548450. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 13:41:29,162][25689] Avg episode reward: [(0, '-48.377')] [2022-07-09 13:41:30,890][26022] Updated weights on worker 0-0, policy_version 273013 (0.00096) [2022-07-09 13:41:32,796][26022] Updated weights on worker 0-0, policy_version 273023 (0.00095) [2022-07-09 13:41:34,233][25689] Fps is (10 sec: 5860.5, 60 sec: 5682.3, 300 sec: 5652.0). Total num frames: 279584768. Throughput: 0: 5905.0. Samples: 279582628. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 13:41:34,234][25689] Avg episode reward: [(0, '-48.403')] [2022-07-09 13:41:34,291][26022] Updated weights on worker 0-0, policy_version 273033 (0.00088) [2022-07-09 13:41:36,315][26022] Updated weights on worker 0-0, policy_version 273043 (0.00097) [2022-07-09 13:41:37,944][26022] Updated weights on worker 0-0, policy_version 273053 (0.00092) [2022-07-09 13:41:39,247][25689] Fps is (10 sec: 5685.6, 60 sec: 5667.0, 300 sec: 5649.2). Total num frames: 279612416. Throughput: 0: 5922.1. Samples: 279616992. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 13:41:39,248][25689] Avg episode reward: [(0, '-47.401')] [2022-07-09 13:41:39,804][26022] Updated weights on worker 0-0, policy_version 273063 (0.00083) [2022-07-09 13:41:41,711][26022] Updated weights on worker 0-0, policy_version 273073 (0.00081) [2022-07-09 13:41:43,501][26022] Updated weights on worker 0-0, policy_version 273083 (0.00092) [2022-07-09 13:41:44,249][25689] Fps is (10 sec: 5520.7, 60 sec: 5633.9, 300 sec: 5649.6). Total num frames: 279640064. Throughput: 0: 5080.3. Samples: 279633912. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 13:41:44,250][25689] Avg episode reward: [(0, '-47.598')] [2022-07-09 13:41:45,362][26022] Updated weights on worker 0-0, policy_version 273093 (0.00087) [2022-07-09 13:41:47,200][26022] Updated weights on worker 0-0, policy_version 273103 (0.00100) [2022-07-09 13:41:48,797][26022] Updated weights on worker 0-0, policy_version 273113 (0.00097) [2022-07-09 13:41:49,262][25689] Fps is (10 sec: 5623.6, 60 sec: 5634.2, 300 sec: 5644.5). Total num frames: 279668736. Throughput: 0: 5939.3. Samples: 279667834. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 13:41:49,263][25689] Avg episode reward: [(0, '-47.358')] [2022-07-09 13:41:50,744][26022] Updated weights on worker 0-0, policy_version 273123 (0.00083) [2022-07-09 13:41:52,446][26022] Updated weights on worker 0-0, policy_version 273133 (0.00090) [2022-07-09 13:41:54,307][26022] Updated weights on worker 0-0, policy_version 273143 (0.00092) [2022-07-09 13:41:54,309][25689] Fps is (10 sec: 5801.9, 60 sec: 5657.7, 300 sec: 5650.8). Total num frames: 279698432. Throughput: 0: 5951.9. Samples: 279702120. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 13:41:54,310][25689] Avg episode reward: [(0, '-47.701')] [2022-07-09 13:41:56,188][26022] Updated weights on worker 0-0, policy_version 273153 (0.00090) [2022-07-09 13:41:57,996][26022] Updated weights on worker 0-0, policy_version 273163 (0.00086) [2022-07-09 13:41:59,354][25689] Fps is (10 sec: 5783.6, 60 sec: 5642.7, 300 sec: 5650.3). Total num frames: 279727104. Throughput: 0: 5940.3. Samples: 279736432. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 13:41:59,354][25689] Avg episode reward: [(0, '-47.802')] [2022-07-09 13:41:59,616][26022] Updated weights on worker 0-0, policy_version 273173 (0.00096) [2022-07-09 13:42:01,508][26022] Updated weights on worker 0-0, policy_version 273183 (0.00089) [2022-07-09 13:42:03,764][26022] Updated weights on worker 0-0, policy_version 273193 (0.00086) [2022-07-09 13:42:04,433][25689] Fps is (10 sec: 5462.3, 60 sec: 5653.4, 300 sec: 5652.5). Total num frames: 279753728. Throughput: 0: 5864.0. Samples: 279752268. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 13:42:04,433][25689] Avg episode reward: [(0, '-47.692')] [2022-07-09 13:42:05,483][26022] Updated weights on worker 0-0, policy_version 273203 (0.00087) [2022-07-09 13:42:07,431][26022] Updated weights on worker 0-0, policy_version 273213 (0.00090) [2022-07-09 13:42:08,919][26022] Updated weights on worker 0-0, policy_version 273223 (0.00088) [2022-07-09 13:42:09,476][25689] Fps is (10 sec: 5462.8, 60 sec: 5668.3, 300 sec: 5649.7). Total num frames: 279782400. Throughput: 0: 5827.4. Samples: 279785630. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 13:42:09,477][25689] Avg episode reward: [(0, '-48.212')] [2022-07-09 13:42:11,115][26022] Updated weights on worker 0-0, policy_version 273233 (0.00095) [2022-07-09 13:42:12,570][26022] Updated weights on worker 0-0, policy_version 273243 (0.00091) [2022-07-09 13:42:14,456][26022] Updated weights on worker 0-0, policy_version 273253 (0.00089) [2022-07-09 13:42:14,543][25689] Fps is (10 sec: 5671.7, 60 sec: 5632.4, 300 sec: 5648.4). Total num frames: 279811072. Throughput: 0: 5818.1. Samples: 279819842. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 13:42:14,544][25689] Avg episode reward: [(0, '-47.750')] [2022-07-09 13:42:16,380][26022] Updated weights on worker 0-0, policy_version 273263 (0.00082) [2022-07-09 13:42:18,056][26022] Updated weights on worker 0-0, policy_version 273273 (0.00088) [2022-07-09 13:42:19,551][25689] Fps is (10 sec: 5590.4, 60 sec: 5650.3, 300 sec: 5648.6). Total num frames: 279838720. Throughput: 0: 4973.4. Samples: 279836878. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 13:42:19,551][25689] Avg episode reward: [(0, '-47.975')] [2022-07-09 13:42:20,007][26022] Updated weights on worker 0-0, policy_version 273283 (0.00085) [2022-07-09 13:42:21,652][26022] Updated weights on worker 0-0, policy_version 273293 (0.00090) [2022-07-09 13:42:23,446][26022] Updated weights on worker 0-0, policy_version 273303 (0.00085) [2022-07-09 13:42:24,589][25689] Fps is (10 sec: 5708.4, 60 sec: 5671.0, 300 sec: 5644.9). Total num frames: 279868416. Throughput: 0: 5885.8. Samples: 279870902. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 13:42:24,589][25689] Avg episode reward: [(0, '-47.905')] [2022-07-09 13:42:25,600][26022] Updated weights on worker 0-0, policy_version 273313 (0.00088) [2022-07-09 13:42:27,014][26022] Updated weights on worker 0-0, policy_version 273323 (0.00085) [2022-07-09 13:42:29,099][26022] Updated weights on worker 0-0, policy_version 273333 (0.00093) [2022-07-09 13:42:29,617][25689] Fps is (10 sec: 5696.8, 60 sec: 5640.4, 300 sec: 5645.2). Total num frames: 279896064. Throughput: 0: 5902.5. Samples: 279904510. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 13:42:29,617][25689] Avg episode reward: [(0, '-48.424')] [2022-07-09 13:42:30,845][26022] Updated weights on worker 0-0, policy_version 273343 (0.00094) [2022-07-09 13:42:32,534][26022] Updated weights on worker 0-0, policy_version 273353 (0.00095) [2022-07-09 13:42:34,541][26022] Updated weights on worker 0-0, policy_version 273363 (0.00090) [2022-07-09 13:42:34,666][25689] Fps is (10 sec: 5588.7, 60 sec: 5625.5, 300 sec: 5644.5). Total num frames: 279924736. Throughput: 0: 5044.9. Samples: 279921360. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 13:42:34,667][25689] Avg episode reward: [(0, '-48.151')] [2022-07-09 13:42:36,118][26022] Updated weights on worker 0-0, policy_version 273373 (0.00086) [2022-07-09 13:42:37,919][26022] Updated weights on worker 0-0, policy_version 273383 (0.00090) [2022-07-09 13:42:39,682][25689] Fps is (10 sec: 5595.4, 60 sec: 5625.3, 300 sec: 5637.5). Total num frames: 279952384. Throughput: 0: 5908.7. Samples: 279955830. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 13:42:39,683][25689] Avg episode reward: [(0, '-48.379')] [2022-07-09 13:42:39,902][26022] Updated weights on worker 0-0, policy_version 273393 (0.00086) [2022-07-09 13:42:41,415][26022] Updated weights on worker 0-0, policy_version 273403 (0.00094) [2022-07-09 13:42:43,565][26022] Updated weights on worker 0-0, policy_version 273413 (0.00051) [2022-07-09 13:42:44,708][25689] Fps is (10 sec: 5812.6, 60 sec: 5674.0, 300 sec: 5647.7). Total num frames: 279983104. Throughput: 0: 5938.0. Samples: 279990370. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 13:42:44,709][25689] Avg episode reward: [(0, '-48.611')] [2022-07-09 13:42:44,865][26022] Updated weights on worker 0-0, policy_version 273423 (0.00086) [2022-07-09 13:42:47,155][26022] Updated weights on worker 0-0, policy_version 273433 (0.00085) [2022-07-09 13:42:48,627][26022] Updated weights on worker 0-0, policy_version 273443 (0.00054) [2022-07-09 13:42:49,732][25689] Fps is (10 sec: 5705.8, 60 sec: 5639.0, 300 sec: 5638.2). Total num frames: 280009728. Throughput: 0: 5105.3. Samples: 280007206. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 13:42:49,733][25689] Avg episode reward: [(0, '-49.585')] [2022-07-09 13:42:50,694][26022] Updated weights on worker 0-0, policy_version 273453 (0.00089) [2022-07-09 13:42:52,577][26022] Updated weights on worker 0-0, policy_version 273463 (0.00086) [2022-07-09 13:42:54,250][26022] Updated weights on worker 0-0, policy_version 273473 (0.00087) [2022-07-09 13:42:54,788][25689] Fps is (10 sec: 5587.4, 60 sec: 5638.3, 300 sec: 5644.9). Total num frames: 280039424. Throughput: 0: 5956.0. Samples: 280041204. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 13:42:54,789][25689] Avg episode reward: [(0, '-48.875')] [2022-07-09 13:42:56,124][26022] Updated weights on worker 0-0, policy_version 273483 (0.00106) [2022-07-09 13:42:57,853][26022] Updated weights on worker 0-0, policy_version 273493 (0.00090) [2022-07-09 13:42:59,480][26022] Updated weights on worker 0-0, policy_version 273503 (0.00092) [2022-07-09 13:42:59,828][25689] Fps is (10 sec: 5679.8, 60 sec: 5621.7, 300 sec: 5645.5). Total num frames: 280067072. Throughput: 0: 5935.4. Samples: 280075406. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 13:42:59,829][25689] Avg episode reward: [(0, '-49.521')] [2022-07-09 13:43:00,637][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:43:00,647][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000273508_280072192.pth [2022-07-09 13:43:00,647][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000271520_278036480.pth [2022-07-09 13:43:01,625][26022] Updated weights on worker 0-0, policy_version 273514 (0.00088) [2022-07-09 13:43:03,990][26022] Updated weights on worker 0-0, policy_version 273524 (0.00096) [2022-07-09 13:43:04,844][25689] Fps is (10 sec: 5498.8, 60 sec: 5644.5, 300 sec: 5641.9). Total num frames: 280094720. Throughput: 0: 4964.9. Samples: 280090342. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 13:43:04,844][25689] Avg episode reward: [(0, '-50.220')] [2022-07-09 13:43:05,631][26022] Updated weights on worker 0-0, policy_version 273534 (0.00092) [2022-07-09 13:43:07,551][26022] Updated weights on worker 0-0, policy_version 273544 (0.00090) [2022-07-09 13:43:09,160][26022] Updated weights on worker 0-0, policy_version 273554 (0.00087) [2022-07-09 13:43:09,869][25689] Fps is (10 sec: 5609.1, 60 sec: 5646.2, 300 sec: 5642.3). Total num frames: 280123392. Throughput: 0: 5828.4. Samples: 280124572. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 13:43:09,870][25689] Avg episode reward: [(0, '-50.621')] [2022-07-09 13:43:11,083][26022] Updated weights on worker 0-0, policy_version 273564 (0.00088) [2022-07-09 13:43:12,790][26022] Updated weights on worker 0-0, policy_version 273574 (0.00090) [2022-07-09 13:43:14,651][26022] Updated weights on worker 0-0, policy_version 273584 (0.00092) [2022-07-09 13:43:14,962][25689] Fps is (10 sec: 5768.4, 60 sec: 5660.7, 300 sec: 5641.0). Total num frames: 280153088. Throughput: 0: 5846.4. Samples: 280159152. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 13:43:14,963][25689] Avg episode reward: [(0, '-50.907')] [2022-07-09 13:43:16,464][26022] Updated weights on worker 0-0, policy_version 273594 (0.00092) [2022-07-09 13:43:18,199][26022] Updated weights on worker 0-0, policy_version 273604 (0.00077) [2022-07-09 13:43:20,012][25689] Fps is (10 sec: 5553.0, 60 sec: 5639.9, 300 sec: 5640.2). Total num frames: 280179712. Throughput: 0: 4994.1. Samples: 280176202. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 13:43:20,012][25689] Avg episode reward: [(0, '-50.893')] [2022-07-09 13:43:20,064][26022] Updated weights on worker 0-0, policy_version 273614 (0.00086) [2022-07-09 13:43:21,779][26022] Updated weights on worker 0-0, policy_version 273624 (0.00089) [2022-07-09 13:43:23,578][26022] Updated weights on worker 0-0, policy_version 273634 (0.00081) [2022-07-09 13:43:25,070][25689] Fps is (10 sec: 5572.2, 60 sec: 5638.0, 300 sec: 5639.6). Total num frames: 280209408. Throughput: 0: 5929.8. Samples: 280210278. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 13:43:25,070][25689] Avg episode reward: [(0, '-50.323')] [2022-07-09 13:43:25,223][26022] Updated weights on worker 0-0, policy_version 273644 (0.00094) [2022-07-09 13:43:27,222][26022] Updated weights on worker 0-0, policy_version 273654 (0.00086) [2022-07-09 13:43:28,998][26022] Updated weights on worker 0-0, policy_version 273664 (0.00089) [2022-07-09 13:43:30,099][25689] Fps is (10 sec: 5684.5, 60 sec: 5637.9, 300 sec: 5641.4). Total num frames: 280237056. Throughput: 0: 5911.2. Samples: 280244156. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 13:43:30,100][25689] Avg episode reward: [(0, '-49.511')] [2022-07-09 13:43:30,769][26022] Updated weights on worker 0-0, policy_version 273674 (0.00091) [2022-07-09 13:43:32,523][26022] Updated weights on worker 0-0, policy_version 273684 (0.00082) [2022-07-09 13:43:34,370][26022] Updated weights on worker 0-0, policy_version 273694 (0.00091) [2022-07-09 13:43:35,180][25689] Fps is (10 sec: 5671.8, 60 sec: 5651.9, 300 sec: 5639.9). Total num frames: 280266752. Throughput: 0: 5042.2. Samples: 280261094. Policy #0 lag: (min: 0.0, avg: 7.7, max: 18.0) [2022-07-09 13:43:35,181][25689] Avg episode reward: [(0, '-49.059')] [2022-07-09 13:43:36,315][26022] Updated weights on worker 0-0, policy_version 273704 (0.00095) [2022-07-09 13:43:37,973][26022] Updated weights on worker 0-0, policy_version 273714 (0.00088) [2022-07-09 13:43:39,804][26022] Updated weights on worker 0-0, policy_version 273724 (0.00091) [2022-07-09 13:43:40,207][25689] Fps is (10 sec: 5673.3, 60 sec: 5650.8, 300 sec: 5640.0). Total num frames: 280294400. Throughput: 0: 5895.8. Samples: 280295270. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:43:40,208][25689] Avg episode reward: [(0, '-49.563')] [2022-07-09 13:43:41,546][26022] Updated weights on worker 0-0, policy_version 273734 (0.00054) [2022-07-09 13:43:43,398][26022] Updated weights on worker 0-0, policy_version 273744 (0.00085) [2022-07-09 13:43:45,064][26022] Updated weights on worker 0-0, policy_version 273754 (0.00090) [2022-07-09 13:43:45,212][25689] Fps is (10 sec: 5818.3, 60 sec: 5652.8, 300 sec: 5648.2). Total num frames: 280325120. Throughput: 0: 5921.2. Samples: 280329544. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:43:45,212][25689] Avg episode reward: [(0, '-49.857')] [2022-07-09 13:43:46,876][26022] Updated weights on worker 0-0, policy_version 273764 (0.00106) [2022-07-09 13:43:48,821][26022] Updated weights on worker 0-0, policy_version 273774 (0.00533) [2022-07-09 13:43:50,218][25689] Fps is (10 sec: 5830.5, 60 sec: 5671.4, 300 sec: 5642.7). Total num frames: 280352768. Throughput: 0: 5095.8. Samples: 280346676. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:43:50,218][25689] Avg episode reward: [(0, '-50.100')] [2022-07-09 13:43:50,779][26022] Updated weights on worker 0-0, policy_version 273784 (0.00104) [2022-07-09 13:43:52,439][26022] Updated weights on worker 0-0, policy_version 273794 (0.00089) [2022-07-09 13:43:54,199][26022] Updated weights on worker 0-0, policy_version 273804 (0.00081) [2022-07-09 13:43:55,265][25689] Fps is (10 sec: 5500.2, 60 sec: 5638.3, 300 sec: 5645.6). Total num frames: 280380416. Throughput: 0: 5945.2. Samples: 280380504. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:43:55,266][25689] Avg episode reward: [(0, '-49.779')] [2022-07-09 13:43:55,870][26022] Updated weights on worker 0-0, policy_version 273814 (0.00093) [2022-07-09 13:43:57,891][26022] Updated weights on worker 0-0, policy_version 273824 (0.00087) [2022-07-09 13:43:59,606][26022] Updated weights on worker 0-0, policy_version 273834 (0.00089) [2022-07-09 13:44:00,319][25689] Fps is (10 sec: 5474.4, 60 sec: 5637.1, 300 sec: 5648.2). Total num frames: 280408064. Throughput: 0: 5934.9. Samples: 280414630. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:44:00,319][25689] Avg episode reward: [(0, '-49.119')] [2022-07-09 13:44:01,475][26022] Updated weights on worker 0-0, policy_version 273844 (0.00088) [2022-07-09 13:44:03,699][26022] Updated weights on worker 0-0, policy_version 273854 (0.00091) [2022-07-09 13:44:05,357][25689] Fps is (10 sec: 5479.5, 60 sec: 5635.0, 300 sec: 5641.2). Total num frames: 280435712. Throughput: 0: 4954.1. Samples: 280429344. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:44:05,357][25689] Avg episode reward: [(0, '-48.553')] [2022-07-09 13:44:05,390][26022] Updated weights on worker 0-0, policy_version 273864 (0.00089) [2022-07-09 13:44:07,301][26022] Updated weights on worker 0-0, policy_version 273874 (0.00088) [2022-07-09 13:44:09,095][26022] Updated weights on worker 0-0, policy_version 273884 (0.00090) [2022-07-09 13:44:10,376][25689] Fps is (10 sec: 5599.8, 60 sec: 5635.6, 300 sec: 5648.6). Total num frames: 280464384. Throughput: 0: 5806.7. Samples: 280463728. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:44:10,377][25689] Avg episode reward: [(0, '-47.632')] [2022-07-09 13:44:10,931][26022] Updated weights on worker 0-0, policy_version 273894 (0.00088) [2022-07-09 13:44:12,651][26022] Updated weights on worker 0-0, policy_version 273904 (0.00095) [2022-07-09 13:44:14,399][26022] Updated weights on worker 0-0, policy_version 273914 (0.00090) [2022-07-09 13:44:15,486][25689] Fps is (10 sec: 5661.2, 60 sec: 5617.1, 300 sec: 5643.4). Total num frames: 280493056. Throughput: 0: 5803.9. Samples: 280497864. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:44:15,487][25689] Avg episode reward: [(0, '-47.375')] [2022-07-09 13:44:16,252][26022] Updated weights on worker 0-0, policy_version 273924 (0.00097) [2022-07-09 13:44:18,163][26022] Updated weights on worker 0-0, policy_version 273934 (0.00082) [2022-07-09 13:44:20,039][26022] Updated weights on worker 0-0, policy_version 273944 (0.00098) [2022-07-09 13:44:20,539][25689] Fps is (10 sec: 5541.6, 60 sec: 5633.7, 300 sec: 5639.2). Total num frames: 280520704. Throughput: 0: 4952.5. Samples: 280514772. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:44:20,540][25689] Avg episode reward: [(0, '-47.369')] [2022-07-09 13:44:21,836][26022] Updated weights on worker 0-0, policy_version 273954 (0.00093) [2022-07-09 13:44:23,602][26022] Updated weights on worker 0-0, policy_version 273964 (0.00087) [2022-07-09 13:44:25,449][26022] Updated weights on worker 0-0, policy_version 273974 (0.00084) [2022-07-09 13:44:25,550][25689] Fps is (10 sec: 5596.0, 60 sec: 5621.1, 300 sec: 5642.9). Total num frames: 280549376. Throughput: 0: 5902.7. Samples: 280548542. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:44:25,552][25689] Avg episode reward: [(0, '-48.082')] [2022-07-09 13:44:27,178][26022] Updated weights on worker 0-0, policy_version 273984 (0.00098) [2022-07-09 13:44:29,162][26022] Updated weights on worker 0-0, policy_version 273994 (0.00086) [2022-07-09 13:44:30,568][25689] Fps is (10 sec: 5820.3, 60 sec: 5656.1, 300 sec: 5644.2). Total num frames: 280579072. Throughput: 0: 5889.3. Samples: 280582642. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:44:30,568][25689] Avg episode reward: [(0, '-48.441')] [2022-07-09 13:44:30,689][26022] Updated weights on worker 0-0, policy_version 274004 (0.00088) [2022-07-09 13:44:32,675][26022] Updated weights on worker 0-0, policy_version 274014 (0.00089) [2022-07-09 13:44:34,340][26022] Updated weights on worker 0-0, policy_version 274024 (0.00089) [2022-07-09 13:44:35,630][25689] Fps is (10 sec: 5587.3, 60 sec: 5607.0, 300 sec: 5640.0). Total num frames: 280605696. Throughput: 0: 5903.3. Samples: 280616782. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:44:35,631][25689] Avg episode reward: [(0, '-48.874')] [2022-07-09 13:44:36,269][26022] Updated weights on worker 0-0, policy_version 274034 (0.00085) [2022-07-09 13:44:38,152][26022] Updated weights on worker 0-0, policy_version 274044 (0.00090) [2022-07-09 13:44:39,718][26022] Updated weights on worker 0-0, policy_version 274054 (0.00087) [2022-07-09 13:44:40,655][25689] Fps is (10 sec: 5583.1, 60 sec: 5641.1, 300 sec: 5646.8). Total num frames: 280635392. Throughput: 0: 5917.3. Samples: 280633804. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:44:40,656][25689] Avg episode reward: [(0, '-48.130')] [2022-07-09 13:44:41,727][26022] Updated weights on worker 0-0, policy_version 274064 (0.00086) [2022-07-09 13:44:43,355][26022] Updated weights on worker 0-0, policy_version 274074 (0.00084) [2022-07-09 13:44:45,215][26022] Updated weights on worker 0-0, policy_version 274084 (0.00093) [2022-07-09 13:44:45,684][25689] Fps is (10 sec: 5805.9, 60 sec: 5605.0, 300 sec: 5650.2). Total num frames: 280664064. Throughput: 0: 5929.1. Samples: 280667914. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:44:45,684][25689] Avg episode reward: [(0, '-47.382')] [2022-07-09 13:44:47,163][26022] Updated weights on worker 0-0, policy_version 274094 (0.00083) [2022-07-09 13:44:48,776][26022] Updated weights on worker 0-0, policy_version 274104 (0.00084) [2022-07-09 13:44:50,765][25689] Fps is (10 sec: 5570.9, 60 sec: 5598.0, 300 sec: 5637.1). Total num frames: 280691712. Throughput: 0: 5902.6. Samples: 280701858. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:44:50,765][25689] Avg episode reward: [(0, '-47.386')] [2022-07-09 13:44:50,806][26022] Updated weights on worker 0-0, policy_version 274114 (0.00087) [2022-07-09 13:44:52,472][26022] Updated weights on worker 0-0, policy_version 274124 (0.00095) [2022-07-09 13:44:54,373][26022] Updated weights on worker 0-0, policy_version 274134 (0.00091) [2022-07-09 13:44:55,867][25689] Fps is (10 sec: 5731.9, 60 sec: 5643.7, 300 sec: 5646.3). Total num frames: 280722432. Throughput: 0: 5053.5. Samples: 280719046. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:44:55,867][25689] Avg episode reward: [(0, '-46.451')] [2022-07-09 13:44:56,094][26022] Updated weights on worker 0-0, policy_version 274144 (0.00086) [2022-07-09 13:44:57,786][26022] Updated weights on worker 0-0, policy_version 274154 (0.00094) [2022-07-09 13:44:59,647][26022] Updated weights on worker 0-0, policy_version 274164 (0.00085) [2022-07-09 13:45:00,678][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:45:00,692][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000274169_280749056.pth [2022-07-09 13:45:00,693][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000272183_278715392.pth [2022-07-09 13:45:00,923][25689] Fps is (10 sec: 5745.9, 60 sec: 5643.4, 300 sec: 5648.8). Total num frames: 280750080. Throughput: 0: 5894.0. Samples: 280753262. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:45:00,924][25689] Avg episode reward: [(0, '-46.727')] [2022-07-09 13:45:01,485][26022] Updated weights on worker 0-0, policy_version 274174 (0.00080) [2022-07-09 13:45:03,820][26022] Updated weights on worker 0-0, policy_version 274184 (0.00094) [2022-07-09 13:45:05,256][26022] Updated weights on worker 0-0, policy_version 274194 (0.00086) [2022-07-09 13:45:05,940][25689] Fps is (10 sec: 5489.3, 60 sec: 5645.4, 300 sec: 5646.9). Total num frames: 280777728. Throughput: 0: 5780.0. Samples: 280784996. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:45:05,941][25689] Avg episode reward: [(0, '-47.616')] [2022-07-09 13:45:07,463][26022] Updated weights on worker 0-0, policy_version 274204 (0.00091) [2022-07-09 13:45:09,042][26022] Updated weights on worker 0-0, policy_version 274214 (0.00095) [2022-07-09 13:45:10,967][25689] Fps is (10 sec: 5403.6, 60 sec: 5610.9, 300 sec: 5636.9). Total num frames: 280804352. Throughput: 0: 4957.4. Samples: 280802006. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:45:10,967][25689] Avg episode reward: [(0, '-47.591')] [2022-07-09 13:45:11,003][26022] Updated weights on worker 0-0, policy_version 274224 (0.00111) [2022-07-09 13:45:12,811][26022] Updated weights on worker 0-0, policy_version 274234 (0.00089) [2022-07-09 13:45:14,585][26022] Updated weights on worker 0-0, policy_version 274244 (0.00087) [2022-07-09 13:45:16,040][25689] Fps is (10 sec: 5576.3, 60 sec: 5631.2, 300 sec: 5642.5). Total num frames: 280834048. Throughput: 0: 5796.7. Samples: 280835984. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:45:16,041][25689] Avg episode reward: [(0, '-48.104')] [2022-07-09 13:45:16,346][26022] Updated weights on worker 0-0, policy_version 274254 (0.00102) [2022-07-09 13:45:18,213][26022] Updated weights on worker 0-0, policy_version 274264 (0.00083) [2022-07-09 13:45:19,983][26022] Updated weights on worker 0-0, policy_version 274274 (0.00082) [2022-07-09 13:45:21,122][25689] Fps is (10 sec: 5646.5, 60 sec: 5628.5, 300 sec: 5641.2). Total num frames: 280861696. Throughput: 0: 5755.7. Samples: 280869524. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:45:21,123][25689] Avg episode reward: [(0, '-47.946')] [2022-07-09 13:45:21,911][26022] Updated weights on worker 0-0, policy_version 274284 (0.00088) [2022-07-09 13:45:23,439][26022] Updated weights on worker 0-0, policy_version 274294 (0.00088) [2022-07-09 13:45:25,527][26022] Updated weights on worker 0-0, policy_version 274304 (0.00100) [2022-07-09 13:45:26,142][25689] Fps is (10 sec: 5473.6, 60 sec: 5610.8, 300 sec: 5634.0). Total num frames: 280889344. Throughput: 0: 5019.0. Samples: 280886390. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:45:26,143][25689] Avg episode reward: [(0, '-48.371')] [2022-07-09 13:45:27,199][26022] Updated weights on worker 0-0, policy_version 274314 (0.00089) [2022-07-09 13:45:29,231][26022] Updated weights on worker 0-0, policy_version 274324 (0.00085) [2022-07-09 13:45:30,816][26022] Updated weights on worker 0-0, policy_version 274334 (0.00090) [2022-07-09 13:45:31,146][25689] Fps is (10 sec: 5720.7, 60 sec: 5612.0, 300 sec: 5642.2). Total num frames: 280919040. Throughput: 0: 5881.4. Samples: 280920692. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:45:31,148][25689] Avg episode reward: [(0, '-47.605')] [2022-07-09 13:45:32,854][26022] Updated weights on worker 0-0, policy_version 274344 (0.00090) [2022-07-09 13:45:34,583][26022] Updated weights on worker 0-0, policy_version 274354 (0.00086) [2022-07-09 13:45:36,173][26022] Updated weights on worker 0-0, policy_version 274364 (0.00085) [2022-07-09 13:45:36,273][25689] Fps is (10 sec: 5862.7, 60 sec: 5656.8, 300 sec: 5643.8). Total num frames: 280948736. Throughput: 0: 5877.6. Samples: 280954904. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:45:36,273][25689] Avg episode reward: [(0, '-47.384')] [2022-07-09 13:45:38,223][26022] Updated weights on worker 0-0, policy_version 274374 (0.00088) [2022-07-09 13:45:39,768][26022] Updated weights on worker 0-0, policy_version 274384 (0.00087) [2022-07-09 13:45:41,282][25689] Fps is (10 sec: 5556.6, 60 sec: 5607.5, 300 sec: 5633.5). Total num frames: 280975360. Throughput: 0: 5089.2. Samples: 280972120. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:45:41,283][25689] Avg episode reward: [(0, '-47.585')] [2022-07-09 13:45:41,748][26022] Updated weights on worker 0-0, policy_version 274394 (0.00091) [2022-07-09 13:45:43,469][26022] Updated weights on worker 0-0, policy_version 274404 (0.00086) [2022-07-09 13:45:45,140][26022] Updated weights on worker 0-0, policy_version 274414 (0.00081) [2022-07-09 13:45:46,329][25689] Fps is (10 sec: 5702.5, 60 sec: 5639.6, 300 sec: 5639.8). Total num frames: 281006080. Throughput: 0: 5944.4. Samples: 281006386. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:45:46,330][25689] Avg episode reward: [(0, '-47.368')] [2022-07-09 13:45:47,190][26022] Updated weights on worker 0-0, policy_version 274424 (0.00087) [2022-07-09 13:45:48,734][26022] Updated weights on worker 0-0, policy_version 274434 (0.00087) [2022-07-09 13:45:50,482][26022] Updated weights on worker 0-0, policy_version 274444 (0.00106) [2022-07-09 13:45:51,366][25689] Fps is (10 sec: 5788.2, 60 sec: 5643.7, 300 sec: 5637.9). Total num frames: 281033728. Throughput: 0: 5945.6. Samples: 281040910. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:45:51,368][25689] Avg episode reward: [(0, '-47.163')] [2022-07-09 13:45:52,530][26022] Updated weights on worker 0-0, policy_version 274454 (0.00092) [2022-07-09 13:45:54,227][26022] Updated weights on worker 0-0, policy_version 274464 (0.00952) [2022-07-09 13:45:56,158][26022] Updated weights on worker 0-0, policy_version 274474 (0.00089) [2022-07-09 13:45:56,463][25689] Fps is (10 sec: 5658.2, 60 sec: 5627.2, 300 sec: 5637.3). Total num frames: 281063424. Throughput: 0: 5098.0. Samples: 281057836. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:45:56,464][25689] Avg episode reward: [(0, '-46.694')] [2022-07-09 13:45:57,755][26022] Updated weights on worker 0-0, policy_version 274484 (0.00083) [2022-07-09 13:45:59,618][26022] Updated weights on worker 0-0, policy_version 274494 (0.00096) [2022-07-09 13:46:01,542][25689] Fps is (10 sec: 5635.5, 60 sec: 5625.2, 300 sec: 5642.9). Total num frames: 281091072. Throughput: 0: 5919.7. Samples: 281092050. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 13:46:01,542][25689] Avg episode reward: [(0, '-47.332')] [2022-07-09 13:46:01,864][26022] Updated weights on worker 0-0, policy_version 274504 (0.00087) [2022-07-09 13:46:03,595][26022] Updated weights on worker 0-0, policy_version 274514 (0.00096) [2022-07-09 13:46:05,438][26022] Updated weights on worker 0-0, policy_version 274524 (0.00091) [2022-07-09 13:46:06,582][25689] Fps is (10 sec: 5565.7, 60 sec: 5639.9, 300 sec: 5646.0). Total num frames: 281119744. Throughput: 0: 5811.0. Samples: 281124082. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 13:46:06,583][25689] Avg episode reward: [(0, '-46.874')] [2022-07-09 13:46:07,163][26022] Updated weights on worker 0-0, policy_version 274534 (0.00083) [2022-07-09 13:46:08,871][26022] Updated weights on worker 0-0, policy_version 274544 (0.00087) [2022-07-09 13:46:10,725][26022] Updated weights on worker 0-0, policy_version 274554 (0.00088) [2022-07-09 13:46:11,585][25689] Fps is (10 sec: 5607.8, 60 sec: 5659.0, 300 sec: 5636.4). Total num frames: 281147392. Throughput: 0: 4964.5. Samples: 281141286. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 13:46:11,585][25689] Avg episode reward: [(0, '-47.304')] [2022-07-09 13:46:12,530][26022] Updated weights on worker 0-0, policy_version 274564 (0.00088) [2022-07-09 13:46:14,355][26022] Updated weights on worker 0-0, policy_version 274574 (0.00053) [2022-07-09 13:46:16,027][26022] Updated weights on worker 0-0, policy_version 274584 (0.00086) [2022-07-09 13:46:16,639][25689] Fps is (10 sec: 5702.3, 60 sec: 5660.8, 300 sec: 5646.1). Total num frames: 281177088. Throughput: 0: 5839.2. Samples: 281175644. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 13:46:16,639][25689] Avg episode reward: [(0, '-47.103')] [2022-07-09 13:46:18,009][26022] Updated weights on worker 0-0, policy_version 274594 (0.00095) [2022-07-09 13:46:19,630][26022] Updated weights on worker 0-0, policy_version 274604 (0.00096) [2022-07-09 13:46:21,456][26022] Updated weights on worker 0-0, policy_version 274614 (0.00084) [2022-07-09 13:46:21,646][25689] Fps is (10 sec: 5801.6, 60 sec: 5684.8, 300 sec: 5647.5). Total num frames: 281205760. Throughput: 0: 5871.6. Samples: 281210092. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 13:46:21,646][25689] Avg episode reward: [(0, '-47.347')] [2022-07-09 13:46:23,312][26022] Updated weights on worker 0-0, policy_version 274624 (0.00087) [2022-07-09 13:46:25,213][26022] Updated weights on worker 0-0, policy_version 274634 (0.00084) [2022-07-09 13:46:26,666][25689] Fps is (10 sec: 5616.7, 60 sec: 5684.8, 300 sec: 5641.4). Total num frames: 281233408. Throughput: 0: 5122.8. Samples: 281226966. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 13:46:26,666][25689] Avg episode reward: [(0, '-47.718')] [2022-07-09 13:46:26,841][26022] Updated weights on worker 0-0, policy_version 274644 (0.00307) [2022-07-09 13:46:28,783][26022] Updated weights on worker 0-0, policy_version 274654 (0.00090) [2022-07-09 13:46:30,480][26022] Updated weights on worker 0-0, policy_version 274664 (0.00089) [2022-07-09 13:46:31,681][25689] Fps is (10 sec: 5509.9, 60 sec: 5649.9, 300 sec: 5635.6). Total num frames: 281261056. Throughput: 0: 5954.1. Samples: 281260942. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 13:46:31,682][25689] Avg episode reward: [(0, '-48.256')] [2022-07-09 13:46:32,487][26022] Updated weights on worker 0-0, policy_version 274674 (0.00052) [2022-07-09 13:46:34,122][26022] Updated weights on worker 0-0, policy_version 274684 (0.00089) [2022-07-09 13:46:35,997][26022] Updated weights on worker 0-0, policy_version 274694 (0.00087) [2022-07-09 13:46:36,795][25689] Fps is (10 sec: 5762.6, 60 sec: 5668.0, 300 sec: 5644.0). Total num frames: 281291776. Throughput: 0: 5914.3. Samples: 281294852. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 13:46:36,795][25689] Avg episode reward: [(0, '-48.797')] [2022-07-09 13:46:38,019][26022] Updated weights on worker 0-0, policy_version 274704 (0.00084) [2022-07-09 13:46:39,732][26022] Updated weights on worker 0-0, policy_version 274714 (0.00091) [2022-07-09 13:46:41,584][26022] Updated weights on worker 0-0, policy_version 274724 (0.00086) [2022-07-09 13:46:41,811][25689] Fps is (10 sec: 5762.2, 60 sec: 5684.3, 300 sec: 5643.7). Total num frames: 281319424. Throughput: 0: 5048.0. Samples: 281311886. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 13:46:41,811][25689] Avg episode reward: [(0, '-49.612')] [2022-07-09 13:46:43,399][26022] Updated weights on worker 0-0, policy_version 274734 (0.00093) [2022-07-09 13:46:44,987][26022] Updated weights on worker 0-0, policy_version 274744 (0.00099) [2022-07-09 13:46:46,838][25689] Fps is (10 sec: 5505.5, 60 sec: 5635.3, 300 sec: 5640.0). Total num frames: 281347072. Throughput: 0: 5907.1. Samples: 281346126. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 13:46:46,839][25689] Avg episode reward: [(0, '-49.685')] [2022-07-09 13:46:46,976][26022] Updated weights on worker 0-0, policy_version 274754 (0.00084) [2022-07-09 13:46:48,617][26022] Updated weights on worker 0-0, policy_version 274764 (0.00094) [2022-07-09 13:46:50,502][26022] Updated weights on worker 0-0, policy_version 274774 (0.00087) [2022-07-09 13:46:51,843][25689] Fps is (10 sec: 5715.7, 60 sec: 5672.2, 300 sec: 5640.8). Total num frames: 281376768. Throughput: 0: 5927.0. Samples: 281380444. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 13:46:51,844][25689] Avg episode reward: [(0, '-50.004')] [2022-07-09 13:46:52,182][26022] Updated weights on worker 0-0, policy_version 274784 (0.00085) [2022-07-09 13:46:54,251][26022] Updated weights on worker 0-0, policy_version 274794 (0.00087) [2022-07-09 13:46:55,755][26022] Updated weights on worker 0-0, policy_version 274804 (0.00091) [2022-07-09 13:46:56,913][25689] Fps is (10 sec: 5691.7, 60 sec: 5640.9, 300 sec: 5636.9). Total num frames: 281404416. Throughput: 0: 5947.4. Samples: 281414506. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 13:46:56,914][25689] Avg episode reward: [(0, '-49.890')] [2022-07-09 13:46:57,647][26022] Updated weights on worker 0-0, policy_version 274814 (0.00092) [2022-07-09 13:46:59,350][26022] Updated weights on worker 0-0, policy_version 274824 (0.00095) [2022-07-09 13:47:00,771][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:47:00,795][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000274831_281426944.pth [2022-07-09 13:47:00,796][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000272846_279394304.pth [2022-07-09 13:47:01,362][26022] Updated weights on worker 0-0, policy_version 274834 (0.00088) [2022-07-09 13:47:01,937][25689] Fps is (10 sec: 5579.7, 60 sec: 5662.9, 300 sec: 5644.8). Total num frames: 281433088. Throughput: 0: 5942.1. Samples: 281431480. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 13:47:01,938][25689] Avg episode reward: [(0, '-49.105')] [2022-07-09 13:47:03,387][26022] Updated weights on worker 0-0, policy_version 274844 (0.00082) [2022-07-09 13:47:05,216][26022] Updated weights on worker 0-0, policy_version 274854 (0.00085) [2022-07-09 13:47:06,941][25689] Fps is (10 sec: 5514.6, 60 sec: 5632.5, 300 sec: 5638.7). Total num frames: 281459712. Throughput: 0: 5831.3. Samples: 281463348. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 13:47:06,941][25689] Avg episode reward: [(0, '-49.174')] [2022-07-09 13:47:07,035][26022] Updated weights on worker 0-0, policy_version 274864 (0.00084) [2022-07-09 13:47:08,867][26022] Updated weights on worker 0-0, policy_version 274874 (0.00085) [2022-07-09 13:47:10,766][26022] Updated weights on worker 0-0, policy_version 274884 (0.00104) [2022-07-09 13:47:11,960][25689] Fps is (10 sec: 5414.7, 60 sec: 5630.9, 300 sec: 5636.2). Total num frames: 281487360. Throughput: 0: 5788.4. Samples: 281496888. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 13:47:11,961][25689] Avg episode reward: [(0, '-49.670')] [2022-07-09 13:47:12,632][26022] Updated weights on worker 0-0, policy_version 274894 (0.00089) [2022-07-09 13:47:14,479][26022] Updated weights on worker 0-0, policy_version 274904 (0.00090) [2022-07-09 13:47:16,330][26022] Updated weights on worker 0-0, policy_version 274914 (0.00090) [2022-07-09 13:47:17,095][25689] Fps is (10 sec: 5647.2, 60 sec: 5623.4, 300 sec: 5640.6). Total num frames: 281517056. Throughput: 0: 4922.7. Samples: 281513854. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 13:47:17,096][25689] Avg episode reward: [(0, '-49.579')] [2022-07-09 13:47:18,014][26022] Updated weights on worker 0-0, policy_version 274924 (0.00091) [2022-07-09 13:47:19,733][26022] Updated weights on worker 0-0, policy_version 274934 (0.00082) [2022-07-09 13:47:21,691][26022] Updated weights on worker 0-0, policy_version 274944 (0.00087) [2022-07-09 13:47:22,119][25689] Fps is (10 sec: 5644.6, 60 sec: 5604.8, 300 sec: 5634.0). Total num frames: 281544704. Throughput: 0: 5788.9. Samples: 281548312. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 13:47:22,120][25689] Avg episode reward: [(0, '-49.081')] [2022-07-09 13:47:23,379][26022] Updated weights on worker 0-0, policy_version 274954 (0.00094) [2022-07-09 13:47:25,140][26022] Updated weights on worker 0-0, policy_version 274964 (0.00084) [2022-07-09 13:47:26,979][26022] Updated weights on worker 0-0, policy_version 274974 (0.00089) [2022-07-09 13:47:27,130][25689] Fps is (10 sec: 5612.3, 60 sec: 5622.6, 300 sec: 5637.8). Total num frames: 281573376. Throughput: 0: 5893.4. Samples: 281582334. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 13:47:27,131][25689] Avg episode reward: [(0, '-49.431')] [2022-07-09 13:47:28,832][26022] Updated weights on worker 0-0, policy_version 274984 (0.00089) [2022-07-09 13:47:30,731][26022] Updated weights on worker 0-0, policy_version 274994 (0.00077) [2022-07-09 13:47:32,203][25689] Fps is (10 sec: 5687.1, 60 sec: 5634.3, 300 sec: 5637.3). Total num frames: 281602048. Throughput: 0: 5053.5. Samples: 281599182. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 13:47:32,203][25689] Avg episode reward: [(0, '-49.038')] [2022-07-09 13:47:32,433][26022] Updated weights on worker 0-0, policy_version 275004 (0.00090) [2022-07-09 13:47:34,269][26022] Updated weights on worker 0-0, policy_version 275014 (0.00089) [2022-07-09 13:47:36,145][26022] Updated weights on worker 0-0, policy_version 275024 (0.00092) [2022-07-09 13:47:37,287][25689] Fps is (10 sec: 5645.9, 60 sec: 5603.1, 300 sec: 5639.5). Total num frames: 281630720. Throughput: 0: 5906.6. Samples: 281633120. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 13:47:37,288][25689] Avg episode reward: [(0, '-48.022')] [2022-07-09 13:47:37,844][26022] Updated weights on worker 0-0, policy_version 275034 (0.00094) [2022-07-09 13:47:39,705][26022] Updated weights on worker 0-0, policy_version 275044 (0.00088) [2022-07-09 13:47:41,403][26022] Updated weights on worker 0-0, policy_version 275054 (0.00090) [2022-07-09 13:47:42,315][25689] Fps is (10 sec: 5670.5, 60 sec: 5618.9, 300 sec: 5632.6). Total num frames: 281659392. Throughput: 0: 5899.9. Samples: 281667466. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 13:47:42,316][25689] Avg episode reward: [(0, '-47.083')] [2022-07-09 13:47:43,386][26022] Updated weights on worker 0-0, policy_version 275064 (0.00086) [2022-07-09 13:47:44,917][26022] Updated weights on worker 0-0, policy_version 275074 (0.00094) [2022-07-09 13:47:46,859][26022] Updated weights on worker 0-0, policy_version 275084 (0.00082) [2022-07-09 13:47:47,329][25689] Fps is (10 sec: 5710.3, 60 sec: 5637.1, 300 sec: 5639.6). Total num frames: 281688064. Throughput: 0: 5065.8. Samples: 281684658. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 13:47:47,330][25689] Avg episode reward: [(0, '-47.267')] [2022-07-09 13:47:48,770][26022] Updated weights on worker 0-0, policy_version 275094 (0.00101) [2022-07-09 13:47:50,416][26022] Updated weights on worker 0-0, policy_version 275104 (0.00095) [2022-07-09 13:47:52,239][26022] Updated weights on worker 0-0, policy_version 275114 (0.00083) [2022-07-09 13:47:52,359][25689] Fps is (10 sec: 5709.6, 60 sec: 5617.9, 300 sec: 5636.7). Total num frames: 281716736. Throughput: 0: 5926.7. Samples: 281718642. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 13:47:52,359][25689] Avg episode reward: [(0, '-47.727')] [2022-07-09 13:47:54,041][26022] Updated weights on worker 0-0, policy_version 275124 (0.00091) [2022-07-09 13:47:55,835][26022] Updated weights on worker 0-0, policy_version 275134 (0.00088) [2022-07-09 13:47:57,401][25689] Fps is (10 sec: 5693.6, 60 sec: 5637.4, 300 sec: 5640.1). Total num frames: 281745408. Throughput: 0: 5961.9. Samples: 281753036. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 13:47:57,402][25689] Avg episode reward: [(0, '-48.479')] [2022-07-09 13:47:57,738][26022] Updated weights on worker 0-0, policy_version 275144 (0.00086) [2022-07-09 13:47:59,419][26022] Updated weights on worker 0-0, policy_version 275154 (0.00099) [2022-07-09 13:48:01,275][26022] Updated weights on worker 0-0, policy_version 275164 (0.00092) [2022-07-09 13:48:02,432][25689] Fps is (10 sec: 5387.5, 60 sec: 5585.9, 300 sec: 5632.9). Total num frames: 281771008. Throughput: 0: 5100.7. Samples: 281770078. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 13:48:02,433][25689] Avg episode reward: [(0, '-48.531')] [2022-07-09 13:48:03,355][26022] Updated weights on worker 0-0, policy_version 275174 (0.00073) [2022-07-09 13:48:05,292][26022] Updated weights on worker 0-0, policy_version 275184 (0.00091) [2022-07-09 13:48:07,025][26022] Updated weights on worker 0-0, policy_version 275194 (0.00088) [2022-07-09 13:48:07,434][25689] Fps is (10 sec: 5511.5, 60 sec: 5636.9, 300 sec: 5636.8). Total num frames: 281800704. Throughput: 0: 5844.4. Samples: 281802156. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 13:48:07,434][25689] Avg episode reward: [(0, '-49.340')] [2022-07-09 13:48:08,806][26022] Updated weights on worker 0-0, policy_version 275204 (0.00091) [2022-07-09 13:48:10,712][26022] Updated weights on worker 0-0, policy_version 275214 (0.00092) [2022-07-09 13:48:12,466][25689] Fps is (10 sec: 5817.3, 60 sec: 5652.6, 300 sec: 5634.5). Total num frames: 281829376. Throughput: 0: 5853.6. Samples: 281836340. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 13:48:12,466][25689] Avg episode reward: [(0, '-50.323')] [2022-07-09 13:48:12,470][26022] Updated weights on worker 0-0, policy_version 275224 (0.00092) [2022-07-09 13:48:14,417][26022] Updated weights on worker 0-0, policy_version 275234 (0.00083) [2022-07-09 13:48:15,938][26022] Updated weights on worker 0-0, policy_version 275244 (0.00080) [2022-07-09 13:48:17,564][25689] Fps is (10 sec: 5559.6, 60 sec: 5622.2, 300 sec: 5637.0). Total num frames: 281857024. Throughput: 0: 4982.2. Samples: 281853492. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 13:48:17,564][25689] Avg episode reward: [(0, '-49.520')] [2022-07-09 13:48:18,067][26022] Updated weights on worker 0-0, policy_version 275254 (0.00079) [2022-07-09 13:48:19,543][26022] Updated weights on worker 0-0, policy_version 275264 (0.00075) [2022-07-09 13:48:21,999][26022] Updated weights on worker 0-0, policy_version 275274 (0.00096) [2022-07-09 13:48:22,599][25689] Fps is (10 sec: 5457.0, 60 sec: 5621.2, 300 sec: 5630.6). Total num frames: 281884672. Throughput: 0: 5764.5. Samples: 281886328. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 13:48:22,599][25689] Avg episode reward: [(0, '-48.539')] [2022-07-09 13:48:23,479][26022] Updated weights on worker 0-0, policy_version 275284 (0.00094) [2022-07-09 13:48:25,772][26022] Updated weights on worker 0-0, policy_version 275294 (0.00086) [2022-07-09 13:48:27,376][26022] Updated weights on worker 0-0, policy_version 275304 (0.00093) [2022-07-09 13:48:27,682][25689] Fps is (10 sec: 5464.8, 60 sec: 5597.6, 300 sec: 5629.6). Total num frames: 281912320. Throughput: 0: 5698.8. Samples: 281917550. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:48:27,683][25689] Avg episode reward: [(0, '-47.955')] [2022-07-09 13:48:29,565][26022] Updated weights on worker 0-0, policy_version 275314 (0.00094) [2022-07-09 13:48:31,223][26022] Updated weights on worker 0-0, policy_version 275324 (0.00080) [2022-07-09 13:48:32,722][25689] Fps is (10 sec: 5361.1, 60 sec: 5566.7, 300 sec: 5620.0). Total num frames: 281938944. Throughput: 0: 4810.2. Samples: 281933778. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:48:32,723][25689] Avg episode reward: [(0, '-48.146')] [2022-07-09 13:48:33,291][26022] Updated weights on worker 0-0, policy_version 275334 (0.00086) [2022-07-09 13:48:34,803][26022] Updated weights on worker 0-0, policy_version 275344 (0.00089) [2022-07-09 13:48:36,841][26022] Updated weights on worker 0-0, policy_version 275354 (0.00096) [2022-07-09 13:48:37,774][25689] Fps is (10 sec: 5479.2, 60 sec: 5569.7, 300 sec: 5623.0). Total num frames: 281967616. Throughput: 0: 5671.1. Samples: 281968108. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:48:37,775][25689] Avg episode reward: [(0, '-47.212')] [2022-07-09 13:48:38,473][26022] Updated weights on worker 0-0, policy_version 275364 (0.00092) [2022-07-09 13:48:40,458][26022] Updated weights on worker 0-0, policy_version 275374 (0.00081) [2022-07-09 13:48:42,091][26022] Updated weights on worker 0-0, policy_version 275384 (0.00091) [2022-07-09 13:48:42,794][25689] Fps is (10 sec: 5795.3, 60 sec: 5587.4, 300 sec: 5619.3). Total num frames: 281997312. Throughput: 0: 5748.9. Samples: 282002426. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:48:42,794][25689] Avg episode reward: [(0, '-47.632')] [2022-07-09 13:48:43,989][26022] Updated weights on worker 0-0, policy_version 275394 (0.00089) [2022-07-09 13:48:45,566][26022] Updated weights on worker 0-0, policy_version 275404 (0.00088) [2022-07-09 13:48:47,599][26022] Updated weights on worker 0-0, policy_version 275414 (0.00088) [2022-07-09 13:48:47,808][25689] Fps is (10 sec: 5613.2, 60 sec: 5553.6, 300 sec: 5615.7). Total num frames: 282023936. Throughput: 0: 5061.0. Samples: 282019406. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:48:47,808][25689] Avg episode reward: [(0, '-47.606')] [2022-07-09 13:48:49,314][26022] Updated weights on worker 0-0, policy_version 275424 (0.00085) [2022-07-09 13:48:51,396][26022] Updated weights on worker 0-0, policy_version 275434 (0.00090) [2022-07-09 13:48:52,833][25689] Fps is (10 sec: 5507.7, 60 sec: 5553.9, 300 sec: 5619.5). Total num frames: 282052608. Throughput: 0: 5934.9. Samples: 282053136. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:48:52,834][25689] Avg episode reward: [(0, '-48.141')] [2022-07-09 13:48:53,124][26022] Updated weights on worker 0-0, policy_version 275444 (0.00085) [2022-07-09 13:48:54,868][26022] Updated weights on worker 0-0, policy_version 275454 (0.00088) [2022-07-09 13:48:56,524][26022] Updated weights on worker 0-0, policy_version 275464 (0.00627) [2022-07-09 13:48:57,871][25689] Fps is (10 sec: 5698.5, 60 sec: 5554.4, 300 sec: 5623.3). Total num frames: 282081280. Throughput: 0: 5919.2. Samples: 282087062. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:48:57,871][25689] Avg episode reward: [(0, '-47.373')] [2022-07-09 13:48:58,603][26022] Updated weights on worker 0-0, policy_version 275474 (0.00092) [2022-07-09 13:49:00,154][26022] Updated weights on worker 0-0, policy_version 275484 (0.00091) [2022-07-09 13:49:00,879][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:49:00,896][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000275487_282098688.pth [2022-07-09 13:49:00,896][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000273508_280072192.pth [2022-07-09 13:49:02,495][26022] Updated weights on worker 0-0, policy_version 275494 (0.00085) [2022-07-09 13:49:02,911][25689] Fps is (10 sec: 5487.2, 60 sec: 5570.5, 300 sec: 5619.8). Total num frames: 282107904. Throughput: 0: 5053.7. Samples: 282104092. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:49:02,911][25689] Avg episode reward: [(0, '-46.722')] [2022-07-09 13:49:04,343][26022] Updated weights on worker 0-0, policy_version 275504 (0.00086) [2022-07-09 13:49:06,012][26022] Updated weights on worker 0-0, policy_version 275514 (0.00087) [2022-07-09 13:49:07,918][25689] Fps is (10 sec: 5401.8, 60 sec: 5536.1, 300 sec: 5616.6). Total num frames: 282135552. Throughput: 0: 5810.4. Samples: 282136254. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:49:07,918][25689] Avg episode reward: [(0, '-46.885')] [2022-07-09 13:49:07,977][26022] Updated weights on worker 0-0, policy_version 275524 (0.00083) [2022-07-09 13:49:09,503][26022] Updated weights on worker 0-0, policy_version 275534 (0.00086) [2022-07-09 13:49:11,451][26022] Updated weights on worker 0-0, policy_version 275544 (0.00077) [2022-07-09 13:49:12,931][25689] Fps is (10 sec: 5722.9, 60 sec: 5554.8, 300 sec: 5621.9). Total num frames: 282165248. Throughput: 0: 5839.9. Samples: 282170502. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:49:12,933][25689] Avg episode reward: [(0, '-46.452')] [2022-07-09 13:49:13,385][26022] Updated weights on worker 0-0, policy_version 275554 (0.00090) [2022-07-09 13:49:15,022][26022] Updated weights on worker 0-0, policy_version 275564 (0.00090) [2022-07-09 13:49:17,002][26022] Updated weights on worker 0-0, policy_version 275574 (0.00094) [2022-07-09 13:49:17,996][25689] Fps is (10 sec: 5893.4, 60 sec: 5591.8, 300 sec: 5628.5). Total num frames: 282194944. Throughput: 0: 4998.7. Samples: 282187660. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:49:17,996][25689] Avg episode reward: [(0, '-46.632')] [2022-07-09 13:49:18,541][26022] Updated weights on worker 0-0, policy_version 275584 (0.00086) [2022-07-09 13:49:20,499][26022] Updated weights on worker 0-0, policy_version 275594 (0.00083) [2022-07-09 13:49:22,331][26022] Updated weights on worker 0-0, policy_version 275604 (0.00083) [2022-07-09 13:49:23,007][25689] Fps is (10 sec: 5691.2, 60 sec: 5594.0, 300 sec: 5625.1). Total num frames: 282222592. Throughput: 0: 5848.5. Samples: 282221624. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:49:23,008][25689] Avg episode reward: [(0, '-46.796')] [2022-07-09 13:49:23,991][26022] Updated weights on worker 0-0, policy_version 275614 (0.00085) [2022-07-09 13:49:25,943][26022] Updated weights on worker 0-0, policy_version 275624 (0.00092) [2022-07-09 13:49:27,626][26022] Updated weights on worker 0-0, policy_version 275634 (0.00091) [2022-07-09 13:49:28,033][25689] Fps is (10 sec: 5509.1, 60 sec: 5599.3, 300 sec: 5618.0). Total num frames: 282250240. Throughput: 0: 5943.5. Samples: 282255806. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:49:28,034][25689] Avg episode reward: [(0, '-47.551')] [2022-07-09 13:49:29,462][26022] Updated weights on worker 0-0, policy_version 275644 (0.00091) [2022-07-09 13:49:31,399][26022] Updated weights on worker 0-0, policy_version 275654 (0.00092) [2022-07-09 13:49:33,040][25689] Fps is (10 sec: 5613.2, 60 sec: 5636.3, 300 sec: 5626.0). Total num frames: 282278912. Throughput: 0: 5099.1. Samples: 282273042. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:49:33,041][25689] Avg episode reward: [(0, '-47.759')] [2022-07-09 13:49:33,113][26022] Updated weights on worker 0-0, policy_version 275664 (0.00083) [2022-07-09 13:49:34,805][26022] Updated weights on worker 0-0, policy_version 275674 (0.00090) [2022-07-09 13:49:36,647][26022] Updated weights on worker 0-0, policy_version 275684 (0.00087) [2022-07-09 13:49:38,145][25689] Fps is (10 sec: 5772.2, 60 sec: 5648.4, 300 sec: 5624.5). Total num frames: 282308608. Throughput: 0: 5942.0. Samples: 282307386. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:49:38,145][25689] Avg episode reward: [(0, '-48.127')] [2022-07-09 13:49:38,686][26022] Updated weights on worker 0-0, policy_version 275694 (0.00087) [2022-07-09 13:49:40,228][26022] Updated weights on worker 0-0, policy_version 275704 (0.00088) [2022-07-09 13:49:42,193][26022] Updated weights on worker 0-0, policy_version 275714 (0.00087) [2022-07-09 13:49:43,168][25689] Fps is (10 sec: 5763.1, 60 sec: 5631.0, 300 sec: 5624.6). Total num frames: 282337280. Throughput: 0: 5938.8. Samples: 282341358. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:49:43,169][25689] Avg episode reward: [(0, '-48.278')] [2022-07-09 13:49:43,725][26022] Updated weights on worker 0-0, policy_version 275724 (0.00089) [2022-07-09 13:49:45,881][26022] Updated weights on worker 0-0, policy_version 275734 (0.00095) [2022-07-09 13:49:47,433][26022] Updated weights on worker 0-0, policy_version 275744 (0.00089) [2022-07-09 13:49:48,178][25689] Fps is (10 sec: 5613.1, 60 sec: 5648.3, 300 sec: 5625.9). Total num frames: 282364928. Throughput: 0: 5088.8. Samples: 282358322. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:49:48,179][25689] Avg episode reward: [(0, '-48.279')] [2022-07-09 13:49:49,267][26022] Updated weights on worker 0-0, policy_version 275754 (0.00573) [2022-07-09 13:49:51,206][26022] Updated weights on worker 0-0, policy_version 275764 (0.00088) [2022-07-09 13:49:52,848][26022] Updated weights on worker 0-0, policy_version 275774 (0.00092) [2022-07-09 13:49:53,252][25689] Fps is (10 sec: 5788.4, 60 sec: 5677.8, 300 sec: 5626.4). Total num frames: 282395648. Throughput: 0: 5902.0. Samples: 282392330. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:49:53,252][25689] Avg episode reward: [(0, '-48.632')] [2022-07-09 13:49:54,856][26022] Updated weights on worker 0-0, policy_version 275784 (0.00095) [2022-07-09 13:49:56,623][26022] Updated weights on worker 0-0, policy_version 275794 (0.00051) [2022-07-09 13:49:58,330][25689] Fps is (10 sec: 5648.8, 60 sec: 5640.1, 300 sec: 5622.6). Total num frames: 282422272. Throughput: 0: 5895.3. Samples: 282426384. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:49:58,330][25689] Avg episode reward: [(0, '-47.726')] [2022-07-09 13:49:58,341][26022] Updated weights on worker 0-0, policy_version 275804 (0.00100) [2022-07-09 13:50:00,148][26022] Updated weights on worker 0-0, policy_version 275814 (0.00101) [2022-07-09 13:50:02,281][26022] Updated weights on worker 0-0, policy_version 275824 (0.00079) [2022-07-09 13:50:03,336][25689] Fps is (10 sec: 5280.4, 60 sec: 5643.2, 300 sec: 5619.3). Total num frames: 282448896. Throughput: 0: 5804.7. Samples: 282458426. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:50:03,336][25689] Avg episode reward: [(0, '-47.686')] [2022-07-09 13:50:04,201][26022] Updated weights on worker 0-0, policy_version 275834 (0.00084) [2022-07-09 13:50:06,018][26022] Updated weights on worker 0-0, policy_version 275844 (0.00089) [2022-07-09 13:50:07,611][26022] Updated weights on worker 0-0, policy_version 275854 (0.00081) [2022-07-09 13:50:08,346][25689] Fps is (10 sec: 5622.6, 60 sec: 5676.8, 300 sec: 5630.0). Total num frames: 282478592. Throughput: 0: 5800.0. Samples: 282475298. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:50:08,347][25689] Avg episode reward: [(0, '-46.955')] [2022-07-09 13:50:09,858][26022] Updated weights on worker 0-0, policy_version 275864 (0.00091) [2022-07-09 13:50:11,219][26022] Updated weights on worker 0-0, policy_version 275874 (0.00088) [2022-07-09 13:50:13,359][25689] Fps is (10 sec: 5516.9, 60 sec: 5609.1, 300 sec: 5617.4). Total num frames: 282504192. Throughput: 0: 5824.4. Samples: 282509442. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:50:13,359][25689] Avg episode reward: [(0, '-47.054')] [2022-07-09 13:50:13,415][26022] Updated weights on worker 0-0, policy_version 275884 (0.00101) [2022-07-09 13:50:14,880][26022] Updated weights on worker 0-0, policy_version 275894 (0.00095) [2022-07-09 13:50:16,823][26022] Updated weights on worker 0-0, policy_version 275904 (0.00088) [2022-07-09 13:50:18,463][25689] Fps is (10 sec: 5567.0, 60 sec: 5622.3, 300 sec: 5627.3). Total num frames: 282534912. Throughput: 0: 5811.1. Samples: 282543382. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:50:18,464][25689] Avg episode reward: [(0, '-47.349')] [2022-07-09 13:50:18,653][26022] Updated weights on worker 0-0, policy_version 275914 (0.00083) [2022-07-09 13:50:20,460][26022] Updated weights on worker 0-0, policy_version 275924 (0.00085) [2022-07-09 13:50:22,276][26022] Updated weights on worker 0-0, policy_version 275934 (0.00098) [2022-07-09 13:50:23,548][25689] Fps is (10 sec: 5829.1, 60 sec: 5632.4, 300 sec: 5629.5). Total num frames: 282563584. Throughput: 0: 5047.4. Samples: 282560444. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:50:23,548][25689] Avg episode reward: [(0, '-47.068')] [2022-07-09 13:50:24,084][26022] Updated weights on worker 0-0, policy_version 275944 (0.00419) [2022-07-09 13:50:25,831][26022] Updated weights on worker 0-0, policy_version 275954 (0.00094) [2022-07-09 13:50:27,793][26022] Updated weights on worker 0-0, policy_version 275964 (0.00099) [2022-07-09 13:50:28,622][25689] Fps is (10 sec: 5644.6, 60 sec: 5644.8, 300 sec: 5624.7). Total num frames: 282592256. Throughput: 0: 5864.8. Samples: 282594212. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:50:28,623][25689] Avg episode reward: [(0, '-48.524')] [2022-07-09 13:50:29,544][26022] Updated weights on worker 0-0, policy_version 275974 (0.00083) [2022-07-09 13:50:31,362][26022] Updated weights on worker 0-0, policy_version 275984 (0.00090) [2022-07-09 13:50:33,130][26022] Updated weights on worker 0-0, policy_version 275994 (0.00087) [2022-07-09 13:50:33,635][25689] Fps is (10 sec: 5583.3, 60 sec: 5627.4, 300 sec: 5620.0). Total num frames: 282619904. Throughput: 0: 5856.1. Samples: 282628182. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:50:33,635][25689] Avg episode reward: [(0, '-48.402')] [2022-07-09 13:50:34,985][26022] Updated weights on worker 0-0, policy_version 276004 (0.00087) [2022-07-09 13:50:36,771][26022] Updated weights on worker 0-0, policy_version 276014 (0.00083) [2022-07-09 13:50:38,690][25689] Fps is (10 sec: 5594.1, 60 sec: 5615.1, 300 sec: 5626.0). Total num frames: 282648576. Throughput: 0: 5035.0. Samples: 282645224. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:50:38,691][25689] Avg episode reward: [(0, '-48.079')] [2022-07-09 13:50:38,694][26022] Updated weights on worker 0-0, policy_version 276024 (0.00084) [2022-07-09 13:50:40,347][26022] Updated weights on worker 0-0, policy_version 276034 (0.00085) [2022-07-09 13:50:42,057][26022] Updated weights on worker 0-0, policy_version 276044 (0.00090) [2022-07-09 13:50:43,745][25689] Fps is (10 sec: 5671.8, 60 sec: 5612.2, 300 sec: 5618.9). Total num frames: 282677248. Throughput: 0: 5898.3. Samples: 282679576. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:50:43,746][25689] Avg episode reward: [(0, '-48.499')] [2022-07-09 13:50:43,989][26022] Updated weights on worker 0-0, policy_version 276054 (0.00088) [2022-07-09 13:50:45,723][26022] Updated weights on worker 0-0, policy_version 276064 (0.00083) [2022-07-09 13:50:47,652][26022] Updated weights on worker 0-0, policy_version 276074 (0.00086) [2022-07-09 13:50:48,797][25689] Fps is (10 sec: 5775.1, 60 sec: 5642.1, 300 sec: 5625.6). Total num frames: 282706944. Throughput: 0: 5920.2. Samples: 282713650. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-09 13:50:48,798][25689] Avg episode reward: [(0, '-48.100')] [2022-07-09 13:50:49,452][26022] Updated weights on worker 0-0, policy_version 276084 (0.00087) [2022-07-09 13:50:51,125][26022] Updated weights on worker 0-0, policy_version 276094 (0.00090) [2022-07-09 13:50:53,171][26022] Updated weights on worker 0-0, policy_version 276104 (0.00092) [2022-07-09 13:50:53,800][25689] Fps is (10 sec: 5805.1, 60 sec: 5614.8, 300 sec: 5623.9). Total num frames: 282735616. Throughput: 0: 5076.3. Samples: 282730544. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 13:50:53,800][25689] Avg episode reward: [(0, '-47.459')] [2022-07-09 13:50:54,664][26022] Updated weights on worker 0-0, policy_version 276114 (0.00092) [2022-07-09 13:50:56,735][26022] Updated weights on worker 0-0, policy_version 276124 (0.00089) [2022-07-09 13:50:58,323][26022] Updated weights on worker 0-0, policy_version 276134 (0.00106) [2022-07-09 13:50:58,893][25689] Fps is (10 sec: 5680.1, 60 sec: 5647.3, 300 sec: 5627.1). Total num frames: 282764288. Throughput: 0: 5910.7. Samples: 282764634. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 13:50:58,893][25689] Avg episode reward: [(0, '-47.277')] [2022-07-09 13:51:00,178][26022] Updated weights on worker 0-0, policy_version 276144 (0.00094) [2022-07-09 13:51:01,077][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:51:01,086][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000276150_282777600.pth [2022-07-09 13:51:01,086][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000274169_280749056.pth [2022-07-09 13:51:01,824][26022] Updated weights on worker 0-0, policy_version 276154 (0.00092) [2022-07-09 13:51:03,919][25689] Fps is (10 sec: 5262.2, 60 sec: 5611.6, 300 sec: 5613.6). Total num frames: 282788864. Throughput: 0: 5819.4. Samples: 282796974. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 13:51:03,919][25689] Avg episode reward: [(0, '-48.226')] [2022-07-09 13:51:04,283][26022] Updated weights on worker 0-0, policy_version 276164 (0.00503) [2022-07-09 13:51:05,883][26022] Updated weights on worker 0-0, policy_version 276174 (0.00089) [2022-07-09 13:51:07,767][26022] Updated weights on worker 0-0, policy_version 276184 (0.00087) [2022-07-09 13:51:08,969][25689] Fps is (10 sec: 5386.0, 60 sec: 5607.9, 300 sec: 5619.6). Total num frames: 282818560. Throughput: 0: 4981.8. Samples: 282814144. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 13:51:08,970][25689] Avg episode reward: [(0, '-47.762')] [2022-07-09 13:51:09,494][26022] Updated weights on worker 0-0, policy_version 276194 (0.00099) [2022-07-09 13:51:11,378][26022] Updated weights on worker 0-0, policy_version 276204 (0.00086) [2022-07-09 13:51:13,067][26022] Updated weights on worker 0-0, policy_version 276214 (0.00077) [2022-07-09 13:51:13,979][25689] Fps is (10 sec: 5700.2, 60 sec: 5642.0, 300 sec: 5613.5). Total num frames: 282846208. Throughput: 0: 5829.9. Samples: 282848184. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 13:51:13,979][25689] Avg episode reward: [(0, '-48.836')] [2022-07-09 13:51:15,122][26022] Updated weights on worker 0-0, policy_version 276224 (0.00092) [2022-07-09 13:51:16,678][26022] Updated weights on worker 0-0, policy_version 276234 (0.00088) [2022-07-09 13:51:18,661][26022] Updated weights on worker 0-0, policy_version 276244 (0.00086) [2022-07-09 13:51:19,106][25689] Fps is (10 sec: 5758.2, 60 sec: 5639.9, 300 sec: 5618.1). Total num frames: 282876928. Throughput: 0: 5836.9. Samples: 282882616. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 13:51:19,106][25689] Avg episode reward: [(0, '-49.075')] [2022-07-09 13:51:20,223][26022] Updated weights on worker 0-0, policy_version 276254 (0.00101) [2022-07-09 13:51:22,217][26022] Updated weights on worker 0-0, policy_version 276264 (0.00093) [2022-07-09 13:51:23,905][26022] Updated weights on worker 0-0, policy_version 276274 (0.00089) [2022-07-09 13:51:24,142][25689] Fps is (10 sec: 5742.7, 60 sec: 5627.4, 300 sec: 5617.8). Total num frames: 282904576. Throughput: 0: 5069.7. Samples: 282899502. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 13:51:24,143][25689] Avg episode reward: [(0, '-48.948')] [2022-07-09 13:51:25,894][26022] Updated weights on worker 0-0, policy_version 276284 (0.00091) [2022-07-09 13:51:27,524][26022] Updated weights on worker 0-0, policy_version 276294 (0.00092) [2022-07-09 13:51:29,145][25689] Fps is (10 sec: 5712.2, 60 sec: 5651.1, 300 sec: 5624.9). Total num frames: 282934272. Throughput: 0: 5921.4. Samples: 282933610. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 13:51:29,145][25689] Avg episode reward: [(0, '-48.849')] [2022-07-09 13:51:29,284][26022] Updated weights on worker 0-0, policy_version 276304 (0.00083) [2022-07-09 13:51:31,161][26022] Updated weights on worker 0-0, policy_version 276314 (0.00090) [2022-07-09 13:51:33,035][26022] Updated weights on worker 0-0, policy_version 276324 (0.00085) [2022-07-09 13:51:34,178][25689] Fps is (10 sec: 5612.0, 60 sec: 5632.2, 300 sec: 5612.7). Total num frames: 282960896. Throughput: 0: 5907.7. Samples: 282967516. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 13:51:34,179][25689] Avg episode reward: [(0, '-49.472')] [2022-07-09 13:51:34,831][26022] Updated weights on worker 0-0, policy_version 276334 (0.00083) [2022-07-09 13:51:36,868][26022] Updated weights on worker 0-0, policy_version 276344 (0.00091) [2022-07-09 13:51:38,324][26022] Updated weights on worker 0-0, policy_version 276354 (0.00093) [2022-07-09 13:51:39,285][25689] Fps is (10 sec: 5654.9, 60 sec: 5661.2, 300 sec: 5621.3). Total num frames: 282991616. Throughput: 0: 5057.5. Samples: 282984674. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 13:51:39,286][25689] Avg episode reward: [(0, '-49.303')] [2022-07-09 13:51:40,419][26022] Updated weights on worker 0-0, policy_version 276364 (0.00092) [2022-07-09 13:51:41,933][26022] Updated weights on worker 0-0, policy_version 276374 (0.00086) [2022-07-09 13:51:43,926][26022] Updated weights on worker 0-0, policy_version 276384 (0.00086) [2022-07-09 13:51:44,298][25689] Fps is (10 sec: 5767.5, 60 sec: 5648.2, 300 sec: 5621.6). Total num frames: 283019264. Throughput: 0: 5922.9. Samples: 283018882. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 13:51:44,299][25689] Avg episode reward: [(0, '-48.930')] [2022-07-09 13:51:45,620][26022] Updated weights on worker 0-0, policy_version 276394 (0.00089) [2022-07-09 13:51:47,390][26022] Updated weights on worker 0-0, policy_version 276404 (0.00094) [2022-07-09 13:51:49,345][25689] Fps is (10 sec: 5496.7, 60 sec: 5614.8, 300 sec: 5613.9). Total num frames: 283046912. Throughput: 0: 5919.0. Samples: 283053178. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 13:51:49,347][25689] Avg episode reward: [(0, '-48.540')] [2022-07-09 13:51:49,362][26022] Updated weights on worker 0-0, policy_version 276414 (0.00081) [2022-07-09 13:51:50,813][26022] Updated weights on worker 0-0, policy_version 276424 (0.00085) [2022-07-09 13:51:53,039][26022] Updated weights on worker 0-0, policy_version 276434 (0.00088) [2022-07-09 13:51:54,383][25689] Fps is (10 sec: 5787.9, 60 sec: 5645.4, 300 sec: 5624.8). Total num frames: 283077632. Throughput: 0: 5088.7. Samples: 283070332. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 13:51:54,383][25689] Avg episode reward: [(0, '-48.386')] [2022-07-09 13:51:54,619][26022] Updated weights on worker 0-0, policy_version 276444 (0.00080) [2022-07-09 13:51:56,604][26022] Updated weights on worker 0-0, policy_version 276454 (0.00168) [2022-07-09 13:51:58,189][26022] Updated weights on worker 0-0, policy_version 276464 (0.00093) [2022-07-09 13:51:59,444][25689] Fps is (10 sec: 5881.0, 60 sec: 5648.3, 300 sec: 5624.1). Total num frames: 283106304. Throughput: 0: 5948.6. Samples: 283104592. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 13:51:59,445][25689] Avg episode reward: [(0, '-47.556')] [2022-07-09 13:51:59,972][26022] Updated weights on worker 0-0, policy_version 276474 (0.00088) [2022-07-09 13:52:02,070][26022] Updated weights on worker 0-0, policy_version 276484 (0.00096) [2022-07-09 13:52:04,039][26022] Updated weights on worker 0-0, policy_version 276494 (0.00095) [2022-07-09 13:52:04,483][25689] Fps is (10 sec: 5272.1, 60 sec: 5647.2, 300 sec: 5616.6). Total num frames: 283130880. Throughput: 0: 5833.1. Samples: 283136620. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 13:52:04,483][25689] Avg episode reward: [(0, '-46.219')] [2022-07-09 13:52:05,831][26022] Updated weights on worker 0-0, policy_version 276504 (0.00086) [2022-07-09 13:52:07,725][26022] Updated weights on worker 0-0, policy_version 276514 (0.00088) [2022-07-09 13:52:09,215][26022] Updated weights on worker 0-0, policy_version 276524 (0.00085) [2022-07-09 13:52:09,503][25689] Fps is (10 sec: 5497.5, 60 sec: 5666.9, 300 sec: 5626.9). Total num frames: 283161600. Throughput: 0: 4989.3. Samples: 283153750. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 13:52:09,503][25689] Avg episode reward: [(0, '-45.411')] [2022-07-09 13:52:11,210][26022] Updated weights on worker 0-0, policy_version 276534 (0.00095) [2022-07-09 13:52:13,144][26022] Updated weights on worker 0-0, policy_version 276544 (0.00092) [2022-07-09 13:52:14,524][25689] Fps is (10 sec: 5812.8, 60 sec: 5665.8, 300 sec: 5622.2). Total num frames: 283189248. Throughput: 0: 5831.4. Samples: 283187784. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 13:52:14,524][25689] Avg episode reward: [(0, '-44.983')] [2022-07-09 13:52:14,746][26022] Updated weights on worker 0-0, policy_version 276554 (0.00087) [2022-07-09 13:52:16,663][26022] Updated weights on worker 0-0, policy_version 276564 (0.00082) [2022-07-09 13:52:18,278][26022] Updated weights on worker 0-0, policy_version 276574 (0.00091) [2022-07-09 13:52:19,627][25689] Fps is (10 sec: 5563.0, 60 sec: 5634.3, 300 sec: 5624.1). Total num frames: 283217920. Throughput: 0: 5831.5. Samples: 283222286. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 13:52:19,627][25689] Avg episode reward: [(0, '-45.755')] [2022-07-09 13:52:20,111][26022] Updated weights on worker 0-0, policy_version 276584 (0.00095) [2022-07-09 13:52:22,152][26022] Updated weights on worker 0-0, policy_version 276594 (0.00087) [2022-07-09 13:52:23,731][26022] Updated weights on worker 0-0, policy_version 276604 (0.00091) [2022-07-09 13:52:24,638][25689] Fps is (10 sec: 5770.8, 60 sec: 5670.5, 300 sec: 5627.5). Total num frames: 283247616. Throughput: 0: 5943.0. Samples: 283256404. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 13:52:24,639][25689] Avg episode reward: [(0, '-44.936')] [2022-07-09 13:52:25,770][26022] Updated weights on worker 0-0, policy_version 276614 (0.00088) [2022-07-09 13:52:27,422][26022] Updated weights on worker 0-0, policy_version 276624 (0.00087) [2022-07-09 13:52:29,157][26022] Updated weights on worker 0-0, policy_version 276634 (0.00087) [2022-07-09 13:52:29,655][25689] Fps is (10 sec: 5820.3, 60 sec: 5652.2, 300 sec: 5628.6). Total num frames: 283276288. Throughput: 0: 5943.3. Samples: 283273522. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 13:52:29,656][25689] Avg episode reward: [(0, '-46.044')] [2022-07-09 13:52:31,023][26022] Updated weights on worker 0-0, policy_version 276644 (0.00548) [2022-07-09 13:52:32,695][26022] Updated weights on worker 0-0, policy_version 276654 (0.00087) [2022-07-09 13:52:34,580][26022] Updated weights on worker 0-0, policy_version 276664 (0.00088) [2022-07-09 13:52:34,686][25689] Fps is (10 sec: 5605.3, 60 sec: 5669.3, 300 sec: 5626.2). Total num frames: 283303936. Throughput: 0: 5953.0. Samples: 283307810. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 13:52:34,687][25689] Avg episode reward: [(0, '-47.124')] [2022-07-09 13:52:36,625][26022] Updated weights on worker 0-0, policy_version 276674 (0.00086) [2022-07-09 13:52:37,995][26022] Updated weights on worker 0-0, policy_version 276684 (0.00082) [2022-07-09 13:52:39,739][25689] Fps is (10 sec: 5483.3, 60 sec: 5623.6, 300 sec: 5622.2). Total num frames: 283331584. Throughput: 0: 5946.0. Samples: 283341876. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 13:52:39,740][25689] Avg episode reward: [(0, '-47.800')] [2022-07-09 13:52:40,323][26022] Updated weights on worker 0-0, policy_version 276694 (0.00083) [2022-07-09 13:52:41,684][26022] Updated weights on worker 0-0, policy_version 276704 (0.00086) [2022-07-09 13:52:43,681][26022] Updated weights on worker 0-0, policy_version 276714 (0.00363) [2022-07-09 13:52:44,781][25689] Fps is (10 sec: 5782.0, 60 sec: 5671.7, 300 sec: 5628.6). Total num frames: 283362304. Throughput: 0: 5093.2. Samples: 283358994. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 13:52:44,781][25689] Avg episode reward: [(0, '-48.297')] [2022-07-09 13:52:45,313][26022] Updated weights on worker 0-0, policy_version 276724 (0.00096) [2022-07-09 13:52:47,048][26022] Updated weights on worker 0-0, policy_version 276734 (0.00099) [2022-07-09 13:52:49,079][26022] Updated weights on worker 0-0, policy_version 276744 (0.00089) [2022-07-09 13:52:49,788][25689] Fps is (10 sec: 5910.4, 60 sec: 5692.4, 300 sec: 5629.0). Total num frames: 283390976. Throughput: 0: 5946.4. Samples: 283393242. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 13:52:49,789][25689] Avg episode reward: [(0, '-48.728')] [2022-07-09 13:52:50,655][26022] Updated weights on worker 0-0, policy_version 276754 (0.00088) [2022-07-09 13:52:52,529][26022] Updated weights on worker 0-0, policy_version 276764 (0.00088) [2022-07-09 13:52:54,375][26022] Updated weights on worker 0-0, policy_version 276774 (0.00083) [2022-07-09 13:52:54,798][25689] Fps is (10 sec: 5520.0, 60 sec: 5627.2, 300 sec: 5622.8). Total num frames: 283417600. Throughput: 0: 5945.0. Samples: 283427376. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 13:52:54,800][25689] Avg episode reward: [(0, '-48.234')] [2022-07-09 13:52:55,980][26022] Updated weights on worker 0-0, policy_version 276784 (0.00093) [2022-07-09 13:52:58,035][26022] Updated weights on worker 0-0, policy_version 276794 (0.00101) [2022-07-09 13:52:59,741][26022] Updated weights on worker 0-0, policy_version 276804 (0.01104) [2022-07-09 13:52:59,890][25689] Fps is (10 sec: 5676.9, 60 sec: 5658.3, 300 sec: 5638.8). Total num frames: 283448320. Throughput: 0: 5093.9. Samples: 283444518. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 13:52:59,890][25689] Avg episode reward: [(0, '-48.135')] [2022-07-09 13:53:01,405][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:53:01,416][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000276813_283456512.pth [2022-07-09 13:53:01,424][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000274831_281426944.pth [2022-07-09 13:53:01,533][26022] Updated weights on worker 0-0, policy_version 276814 (0.00053) [2022-07-09 13:53:03,782][26022] Updated weights on worker 0-0, policy_version 276824 (0.00093) [2022-07-09 13:53:04,940][25689] Fps is (10 sec: 5755.1, 60 sec: 5707.9, 300 sec: 5631.0). Total num frames: 283475968. Throughput: 0: 5832.1. Samples: 283476566. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 13:53:04,943][25689] Avg episode reward: [(0, '-47.328')] [2022-07-09 13:53:05,530][26022] Updated weights on worker 0-0, policy_version 276834 (0.00090) [2022-07-09 13:53:07,335][26022] Updated weights on worker 0-0, policy_version 276844 (0.00088) [2022-07-09 13:53:09,176][26022] Updated weights on worker 0-0, policy_version 276854 (0.00095) [2022-07-09 13:53:09,972][25689] Fps is (10 sec: 5281.0, 60 sec: 5622.1, 300 sec: 5620.7). Total num frames: 283501568. Throughput: 0: 5824.9. Samples: 283510812. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 13:53:09,973][25689] Avg episode reward: [(0, '-47.214')] [2022-07-09 13:53:10,791][26022] Updated weights on worker 0-0, policy_version 276864 (0.00094) [2022-07-09 13:53:12,968][26022] Updated weights on worker 0-0, policy_version 276874 (0.00519) [2022-07-09 13:53:14,569][26022] Updated weights on worker 0-0, policy_version 276884 (0.00693) [2022-07-09 13:53:15,064][25689] Fps is (10 sec: 5462.1, 60 sec: 5649.4, 300 sec: 5627.7). Total num frames: 283531264. Throughput: 0: 4961.6. Samples: 283527926. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:53:15,064][25689] Avg episode reward: [(0, '-47.528')] [2022-07-09 13:53:16,404][26022] Updated weights on worker 0-0, policy_version 276894 (0.00086) [2022-07-09 13:53:18,227][26022] Updated weights on worker 0-0, policy_version 276904 (0.00093) [2022-07-09 13:53:19,900][26022] Updated weights on worker 0-0, policy_version 276914 (0.00089) [2022-07-09 13:53:20,129][25689] Fps is (10 sec: 5847.9, 60 sec: 5669.9, 300 sec: 5634.0). Total num frames: 283560960. Throughput: 0: 5804.1. Samples: 283561988. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:53:20,129][25689] Avg episode reward: [(0, '-48.173')] [2022-07-09 13:53:21,782][26022] Updated weights on worker 0-0, policy_version 276924 (0.00087) [2022-07-09 13:53:23,531][26022] Updated weights on worker 0-0, policy_version 276934 (0.00081) [2022-07-09 13:53:25,151][25689] Fps is (10 sec: 5684.6, 60 sec: 5635.0, 300 sec: 5635.2). Total num frames: 283588608. Throughput: 0: 5920.1. Samples: 283596218. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:53:25,152][25689] Avg episode reward: [(0, '-48.714')] [2022-07-09 13:53:25,348][26022] Updated weights on worker 0-0, policy_version 276944 (0.00088) [2022-07-09 13:53:27,142][26022] Updated weights on worker 0-0, policy_version 276954 (0.00090) [2022-07-09 13:53:28,940][26022] Updated weights on worker 0-0, policy_version 276964 (0.00093) [2022-07-09 13:53:30,179][25689] Fps is (10 sec: 5705.8, 60 sec: 5650.9, 300 sec: 5645.7). Total num frames: 283618304. Throughput: 0: 5072.5. Samples: 283613308. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:53:30,182][25689] Avg episode reward: [(0, '-49.098')] [2022-07-09 13:53:30,705][26022] Updated weights on worker 0-0, policy_version 276974 (0.00620) [2022-07-09 13:53:32,614][26022] Updated weights on worker 0-0, policy_version 276984 (0.00081) [2022-07-09 13:53:34,499][26022] Updated weights on worker 0-0, policy_version 276994 (0.00085) [2022-07-09 13:53:35,267][25689] Fps is (10 sec: 5668.9, 60 sec: 5645.6, 300 sec: 5641.6). Total num frames: 283645952. Throughput: 0: 5902.0. Samples: 283647164. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:53:35,267][25689] Avg episode reward: [(0, '-49.349')] [2022-07-09 13:53:36,198][26022] Updated weights on worker 0-0, policy_version 277004 (0.00091) [2022-07-09 13:53:38,191][26022] Updated weights on worker 0-0, policy_version 277014 (0.00087) [2022-07-09 13:53:39,624][26022] Updated weights on worker 0-0, policy_version 277024 (0.00092) [2022-07-09 13:53:40,332][25689] Fps is (10 sec: 5546.9, 60 sec: 5661.4, 300 sec: 5637.3). Total num frames: 283674624. Throughput: 0: 5902.8. Samples: 283681244. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:53:40,333][25689] Avg episode reward: [(0, '-48.192')] [2022-07-09 13:53:41,701][26022] Updated weights on worker 0-0, policy_version 277034 (0.00087) [2022-07-09 13:53:43,292][26022] Updated weights on worker 0-0, policy_version 277044 (0.00094) [2022-07-09 13:53:45,203][26022] Updated weights on worker 0-0, policy_version 277054 (0.00086) [2022-07-09 13:53:45,383][25689] Fps is (10 sec: 5769.8, 60 sec: 5643.6, 300 sec: 5646.9). Total num frames: 283704320. Throughput: 0: 5054.8. Samples: 283698484. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:53:45,383][25689] Avg episode reward: [(0, '-47.810')] [2022-07-09 13:53:46,940][26022] Updated weights on worker 0-0, policy_version 277064 (0.00083) [2022-07-09 13:53:48,771][26022] Updated weights on worker 0-0, policy_version 277074 (0.00081) [2022-07-09 13:53:50,435][25689] Fps is (10 sec: 5676.0, 60 sec: 5622.6, 300 sec: 5643.0). Total num frames: 283731968. Throughput: 0: 5881.9. Samples: 283732452. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:53:50,435][25689] Avg episode reward: [(0, '-47.694')] [2022-07-09 13:53:50,723][26022] Updated weights on worker 0-0, policy_version 277084 (0.00096) [2022-07-09 13:53:52,558][26022] Updated weights on worker 0-0, policy_version 277094 (0.00088) [2022-07-09 13:53:54,434][26022] Updated weights on worker 0-0, policy_version 277104 (0.00082) [2022-07-09 13:53:55,483][25689] Fps is (10 sec: 5576.3, 60 sec: 5652.8, 300 sec: 5642.8). Total num frames: 283760640. Throughput: 0: 5878.7. Samples: 283766008. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:53:55,483][25689] Avg episode reward: [(0, '-47.401')] [2022-07-09 13:53:56,180][26022] Updated weights on worker 0-0, policy_version 277114 (0.00089) [2022-07-09 13:53:57,908][26022] Updated weights on worker 0-0, policy_version 277124 (0.00054) [2022-07-09 13:53:59,687][26022] Updated weights on worker 0-0, policy_version 277134 (0.00083) [2022-07-09 13:54:00,525][25689] Fps is (10 sec: 5682.9, 60 sec: 5623.6, 300 sec: 5649.6). Total num frames: 283789312. Throughput: 0: 5042.4. Samples: 283783068. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:54:00,526][25689] Avg episode reward: [(0, '-48.499')] [2022-07-09 13:54:01,599][26022] Updated weights on worker 0-0, policy_version 277144 (0.00091) [2022-07-09 13:54:03,915][26022] Updated weights on worker 0-0, policy_version 277154 (0.00088) [2022-07-09 13:54:05,477][26022] Updated weights on worker 0-0, policy_version 277164 (0.00089) [2022-07-09 13:54:05,545][25689] Fps is (10 sec: 5597.0, 60 sec: 5626.5, 300 sec: 5649.4). Total num frames: 283816960. Throughput: 0: 5777.2. Samples: 283814966. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:54:05,546][25689] Avg episode reward: [(0, '-48.325')] [2022-07-09 13:54:07,506][26022] Updated weights on worker 0-0, policy_version 277174 (0.00095) [2022-07-09 13:54:09,140][26022] Updated weights on worker 0-0, policy_version 277184 (0.00090) [2022-07-09 13:54:10,551][25689] Fps is (10 sec: 5413.3, 60 sec: 5645.8, 300 sec: 5639.2). Total num frames: 283843584. Throughput: 0: 5789.7. Samples: 283848920. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:54:10,552][25689] Avg episode reward: [(0, '-48.401')] [2022-07-09 13:54:11,174][26022] Updated weights on worker 0-0, policy_version 277194 (0.00086) [2022-07-09 13:54:12,736][26022] Updated weights on worker 0-0, policy_version 277204 (0.00092) [2022-07-09 13:54:14,696][26022] Updated weights on worker 0-0, policy_version 277214 (0.00085) [2022-07-09 13:54:15,555][25689] Fps is (10 sec: 5421.8, 60 sec: 5620.1, 300 sec: 5633.5). Total num frames: 283871232. Throughput: 0: 4975.9. Samples: 283865886. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:54:15,556][25689] Avg episode reward: [(0, '-49.003')] [2022-07-09 13:54:16,547][26022] Updated weights on worker 0-0, policy_version 277224 (0.00092) [2022-07-09 13:54:18,396][26022] Updated weights on worker 0-0, policy_version 277234 (0.00094) [2022-07-09 13:54:20,108][26022] Updated weights on worker 0-0, policy_version 277244 (0.00090) [2022-07-09 13:54:20,638][25689] Fps is (10 sec: 5786.3, 60 sec: 5635.3, 300 sec: 5642.4). Total num frames: 283901952. Throughput: 0: 5807.7. Samples: 283899878. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:54:20,641][25689] Avg episode reward: [(0, '-48.959')] [2022-07-09 13:54:22,142][26022] Updated weights on worker 0-0, policy_version 277254 (0.00087) [2022-07-09 13:54:23,497][26022] Updated weights on worker 0-0, policy_version 277264 (0.00091) [2022-07-09 13:54:25,713][25689] Fps is (10 sec: 5645.6, 60 sec: 5613.6, 300 sec: 5638.1). Total num frames: 283928576. Throughput: 0: 5895.7. Samples: 283933866. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:54:25,713][25689] Avg episode reward: [(0, '-49.170')] [2022-07-09 13:54:25,718][26022] Updated weights on worker 0-0, policy_version 277274 (0.00092) [2022-07-09 13:54:27,191][26022] Updated weights on worker 0-0, policy_version 277284 (0.00096) [2022-07-09 13:54:29,288][26022] Updated weights on worker 0-0, policy_version 277294 (0.00088) [2022-07-09 13:54:30,753][25689] Fps is (10 sec: 5467.1, 60 sec: 5595.5, 300 sec: 5637.4). Total num frames: 283957248. Throughput: 0: 5042.1. Samples: 283950778. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:54:30,753][25689] Avg episode reward: [(0, '-49.279')] [2022-07-09 13:54:30,992][26022] Updated weights on worker 0-0, policy_version 277304 (0.00099) [2022-07-09 13:54:32,785][26022] Updated weights on worker 0-0, policy_version 277314 (0.00089) [2022-07-09 13:54:34,699][26022] Updated weights on worker 0-0, policy_version 277324 (0.00083) [2022-07-09 13:54:35,795][25689] Fps is (10 sec: 5687.5, 60 sec: 5616.7, 300 sec: 5635.2). Total num frames: 283985920. Throughput: 0: 5847.8. Samples: 283984244. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:54:35,796][25689] Avg episode reward: [(0, '-48.817')] [2022-07-09 13:54:36,295][26022] Updated weights on worker 0-0, policy_version 277334 (0.00083) [2022-07-09 13:54:38,195][26022] Updated weights on worker 0-0, policy_version 277344 (0.00095) [2022-07-09 13:54:40,262][26022] Updated weights on worker 0-0, policy_version 277354 (0.00092) [2022-07-09 13:54:40,849][25689] Fps is (10 sec: 5578.1, 60 sec: 5600.8, 300 sec: 5631.2). Total num frames: 284013568. Throughput: 0: 5865.0. Samples: 284018414. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:54:40,850][25689] Avg episode reward: [(0, '-48.525')] [2022-07-09 13:54:41,742][26022] Updated weights on worker 0-0, policy_version 277364 (0.01262) [2022-07-09 13:54:43,843][26022] Updated weights on worker 0-0, policy_version 277374 (0.00091) [2022-07-09 13:54:45,399][26022] Updated weights on worker 0-0, policy_version 277384 (0.00082) [2022-07-09 13:54:45,855][25689] Fps is (10 sec: 5700.4, 60 sec: 5605.0, 300 sec: 5638.1). Total num frames: 284043264. Throughput: 0: 5884.5. Samples: 284052392. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:54:45,855][25689] Avg episode reward: [(0, '-47.865')] [2022-07-09 13:54:47,400][26022] Updated weights on worker 0-0, policy_version 277394 (0.00093) [2022-07-09 13:54:49,162][26022] Updated weights on worker 0-0, policy_version 277404 (0.00091) [2022-07-09 13:54:50,856][25689] Fps is (10 sec: 5730.5, 60 sec: 5609.7, 300 sec: 5629.2). Total num frames: 284070912. Throughput: 0: 5898.6. Samples: 284069360. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:54:50,857][25689] Avg episode reward: [(0, '-47.796')] [2022-07-09 13:54:50,870][26022] Updated weights on worker 0-0, policy_version 277414 (0.00099) [2022-07-09 13:54:52,686][26022] Updated weights on worker 0-0, policy_version 277424 (0.00092) [2022-07-09 13:54:54,684][26022] Updated weights on worker 0-0, policy_version 277434 (0.00090) [2022-07-09 13:54:55,913][25689] Fps is (10 sec: 5599.3, 60 sec: 5608.8, 300 sec: 5636.4). Total num frames: 284099584. Throughput: 0: 5939.8. Samples: 284103742. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:54:55,914][25689] Avg episode reward: [(0, '-47.441')] [2022-07-09 13:54:56,216][26022] Updated weights on worker 0-0, policy_version 277444 (0.00093) [2022-07-09 13:54:58,296][26022] Updated weights on worker 0-0, policy_version 277454 (0.00094) [2022-07-09 13:54:59,735][26022] Updated weights on worker 0-0, policy_version 277464 (0.00091) [2022-07-09 13:55:00,963][25689] Fps is (10 sec: 5572.6, 60 sec: 5591.3, 300 sec: 5639.1). Total num frames: 284127232. Throughput: 0: 5932.7. Samples: 284137740. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:55:00,963][25689] Avg episode reward: [(0, '-47.323')] [2022-07-09 13:55:01,477][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:55:01,489][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000277472_284131328.pth [2022-07-09 13:55:01,490][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000275487_282098688.pth [2022-07-09 13:55:02,215][26022] Updated weights on worker 0-0, policy_version 277474 (0.00096) [2022-07-09 13:55:03,955][26022] Updated weights on worker 0-0, policy_version 277484 (0.00055) [2022-07-09 13:55:05,709][26022] Updated weights on worker 0-0, policy_version 277494 (0.00080) [2022-07-09 13:55:05,987][25689] Fps is (10 sec: 5590.6, 60 sec: 5607.8, 300 sec: 5635.4). Total num frames: 284155904. Throughput: 0: 4971.9. Samples: 284152488. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:55:05,988][25689] Avg episode reward: [(0, '-47.784')] [2022-07-09 13:55:07,610][26022] Updated weights on worker 0-0, policy_version 277504 (0.00087) [2022-07-09 13:55:09,454][26022] Updated weights on worker 0-0, policy_version 277514 (0.00091) [2022-07-09 13:55:11,007][25689] Fps is (10 sec: 5607.3, 60 sec: 5623.4, 300 sec: 5642.1). Total num frames: 284183552. Throughput: 0: 5822.8. Samples: 284186692. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:55:11,007][25689] Avg episode reward: [(0, '-48.116')] [2022-07-09 13:55:11,020][26022] Updated weights on worker 0-0, policy_version 277524 (0.00088) [2022-07-09 13:55:13,293][26022] Updated weights on worker 0-0, policy_version 277534 (0.00086) [2022-07-09 13:55:14,565][26022] Updated weights on worker 0-0, policy_version 277544 (0.00086) [2022-07-09 13:55:16,022][25689] Fps is (10 sec: 5408.3, 60 sec: 5605.4, 300 sec: 5630.0). Total num frames: 284210176. Throughput: 0: 5810.4. Samples: 284220584. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:55:16,023][25689] Avg episode reward: [(0, '-47.758')] [2022-07-09 13:55:16,818][26022] Updated weights on worker 0-0, policy_version 277554 (0.00090) [2022-07-09 13:55:18,154][26022] Updated weights on worker 0-0, policy_version 277564 (0.00089) [2022-07-09 13:55:20,362][26022] Updated weights on worker 0-0, policy_version 277574 (0.00087) [2022-07-09 13:55:21,067][25689] Fps is (10 sec: 5700.3, 60 sec: 5609.0, 300 sec: 5637.7). Total num frames: 284240896. Throughput: 0: 4963.0. Samples: 284237516. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:55:21,067][25689] Avg episode reward: [(0, '-46.886')] [2022-07-09 13:55:22,174][26022] Updated weights on worker 0-0, policy_version 277584 (0.00093) [2022-07-09 13:55:24,018][26022] Updated weights on worker 0-0, policy_version 277594 (0.00079) [2022-07-09 13:55:25,585][26022] Updated weights on worker 0-0, policy_version 277604 (0.00086) [2022-07-09 13:55:26,071][25689] Fps is (10 sec: 5808.2, 60 sec: 5632.5, 300 sec: 5635.5). Total num frames: 284268544. Throughput: 0: 5926.2. Samples: 284271510. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:55:26,072][25689] Avg episode reward: [(0, '-46.459')] [2022-07-09 13:55:27,815][26022] Updated weights on worker 0-0, policy_version 277614 (0.00090) [2022-07-09 13:55:29,273][26022] Updated weights on worker 0-0, policy_version 277624 (0.00086) [2022-07-09 13:55:31,087][25689] Fps is (10 sec: 5416.4, 60 sec: 5600.8, 300 sec: 5632.1). Total num frames: 284295168. Throughput: 0: 5923.5. Samples: 284305636. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:55:31,087][25689] Avg episode reward: [(0, '-46.615')] [2022-07-09 13:55:31,254][26022] Updated weights on worker 0-0, policy_version 277634 (0.00090) [2022-07-09 13:55:33,035][26022] Updated weights on worker 0-0, policy_version 277644 (0.00093) [2022-07-09 13:55:34,628][26022] Updated weights on worker 0-0, policy_version 277654 (0.00094) [2022-07-09 13:55:36,098][25689] Fps is (10 sec: 5617.0, 60 sec: 5620.7, 300 sec: 5636.3). Total num frames: 284324864. Throughput: 0: 5087.0. Samples: 284322710. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 13:55:36,099][25689] Avg episode reward: [(0, '-46.759')] [2022-07-09 13:55:36,790][26022] Updated weights on worker 0-0, policy_version 277664 (0.00082) [2022-07-09 13:55:38,288][26022] Updated weights on worker 0-0, policy_version 277674 (0.00088) [2022-07-09 13:55:40,111][26022] Updated weights on worker 0-0, policy_version 277684 (0.00088) [2022-07-09 13:55:41,217][25689] Fps is (10 sec: 5862.7, 60 sec: 5648.6, 300 sec: 5638.6). Total num frames: 284354560. Throughput: 0: 5909.1. Samples: 284356588. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 13:55:41,218][25689] Avg episode reward: [(0, '-46.780')] [2022-07-09 13:55:42,052][26022] Updated weights on worker 0-0, policy_version 277694 (0.00083) [2022-07-09 13:55:43,756][26022] Updated weights on worker 0-0, policy_version 277704 (0.00086) [2022-07-09 13:55:45,685][26022] Updated weights on worker 0-0, policy_version 277714 (0.00094) [2022-07-09 13:55:46,241][25689] Fps is (10 sec: 5653.8, 60 sec: 5613.0, 300 sec: 5632.2). Total num frames: 284382208. Throughput: 0: 5906.4. Samples: 284390638. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 13:55:46,241][25689] Avg episode reward: [(0, '-47.554')] [2022-07-09 13:55:47,371][26022] Updated weights on worker 0-0, policy_version 277724 (0.00088) [2022-07-09 13:55:49,048][26022] Updated weights on worker 0-0, policy_version 277734 (0.00087) [2022-07-09 13:55:51,133][26022] Updated weights on worker 0-0, policy_version 277744 (0.00087) [2022-07-09 13:55:51,247][25689] Fps is (10 sec: 5512.9, 60 sec: 5612.5, 300 sec: 5628.7). Total num frames: 284409856. Throughput: 0: 5063.7. Samples: 284407724. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 13:55:51,248][25689] Avg episode reward: [(0, '-47.867')] [2022-07-09 13:55:52,769][26022] Updated weights on worker 0-0, policy_version 277754 (0.00093) [2022-07-09 13:55:54,764][26022] Updated weights on worker 0-0, policy_version 277764 (0.00091) [2022-07-09 13:55:56,260][25689] Fps is (10 sec: 5723.5, 60 sec: 5633.6, 300 sec: 5633.7). Total num frames: 284439552. Throughput: 0: 5901.4. Samples: 284441692. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 13:55:56,260][25689] Avg episode reward: [(0, '-48.096')] [2022-07-09 13:55:56,542][26022] Updated weights on worker 0-0, policy_version 277774 (0.00083) [2022-07-09 13:55:58,397][26022] Updated weights on worker 0-0, policy_version 277784 (0.00081) [2022-07-09 13:56:00,158][26022] Updated weights on worker 0-0, policy_version 277794 (0.00085) [2022-07-09 13:56:01,373][25689] Fps is (10 sec: 5663.0, 60 sec: 5627.6, 300 sec: 5642.3). Total num frames: 284467200. Throughput: 0: 5913.2. Samples: 284475776. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 13:56:01,374][25689] Avg episode reward: [(0, '-47.730')] [2022-07-09 13:56:01,855][26022] Updated weights on worker 0-0, policy_version 277804 (0.00089) [2022-07-09 13:56:04,005][26022] Updated weights on worker 0-0, policy_version 277814 (0.00092) [2022-07-09 13:56:05,757][26022] Updated weights on worker 0-0, policy_version 277824 (0.00082) [2022-07-09 13:56:06,375][25689] Fps is (10 sec: 5365.5, 60 sec: 5595.9, 300 sec: 5632.9). Total num frames: 284493824. Throughput: 0: 4979.1. Samples: 284490888. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 13:56:06,376][25689] Avg episode reward: [(0, '-47.874')] [2022-07-09 13:56:07,791][26022] Updated weights on worker 0-0, policy_version 277834 (0.00086) [2022-07-09 13:56:09,680][26022] Updated weights on worker 0-0, policy_version 277844 (0.00092) [2022-07-09 13:56:11,258][26022] Updated weights on worker 0-0, policy_version 277854 (0.00082) [2022-07-09 13:56:11,415][25689] Fps is (10 sec: 5506.9, 60 sec: 5610.9, 300 sec: 5635.8). Total num frames: 284522496. Throughput: 0: 5794.1. Samples: 284524576. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 13:56:11,418][25689] Avg episode reward: [(0, '-47.525')] [2022-07-09 13:56:13,129][26022] Updated weights on worker 0-0, policy_version 277864 (0.00087) [2022-07-09 13:56:14,840][26022] Updated weights on worker 0-0, policy_version 277874 (0.00085) [2022-07-09 13:56:16,435][25689] Fps is (10 sec: 5700.3, 60 sec: 5644.4, 300 sec: 5630.9). Total num frames: 284551168. Throughput: 0: 5805.8. Samples: 284558824. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 13:56:16,435][25689] Avg episode reward: [(0, '-47.377')] [2022-07-09 13:56:16,716][26022] Updated weights on worker 0-0, policy_version 277884 (0.00083) [2022-07-09 13:56:18,654][26022] Updated weights on worker 0-0, policy_version 277894 (0.00085) [2022-07-09 13:56:20,406][26022] Updated weights on worker 0-0, policy_version 277904 (0.00094) [2022-07-09 13:56:21,476][25689] Fps is (10 sec: 5699.7, 60 sec: 5610.8, 300 sec: 5634.3). Total num frames: 284579840. Throughput: 0: 4984.9. Samples: 284575982. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 13:56:21,476][25689] Avg episode reward: [(0, '-47.498')] [2022-07-09 13:56:22,173][26022] Updated weights on worker 0-0, policy_version 277914 (0.00084) [2022-07-09 13:56:24,023][26022] Updated weights on worker 0-0, policy_version 277924 (0.00084) [2022-07-09 13:56:25,692][26022] Updated weights on worker 0-0, policy_version 277934 (0.00091) [2022-07-09 13:56:26,485][25689] Fps is (10 sec: 5705.4, 60 sec: 5627.3, 300 sec: 5630.7). Total num frames: 284608512. Throughput: 0: 5922.3. Samples: 284609990. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 13:56:26,486][25689] Avg episode reward: [(0, '-47.377')] [2022-07-09 13:56:27,644][26022] Updated weights on worker 0-0, policy_version 277944 (0.00088) [2022-07-09 13:56:29,360][26022] Updated weights on worker 0-0, policy_version 277954 (0.00086) [2022-07-09 13:56:31,190][26022] Updated weights on worker 0-0, policy_version 277964 (0.00089) [2022-07-09 13:56:31,498][25689] Fps is (10 sec: 5721.7, 60 sec: 5661.5, 300 sec: 5638.0). Total num frames: 284637184. Throughput: 0: 5936.8. Samples: 284643806. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 13:56:31,498][25689] Avg episode reward: [(0, '-47.758')] [2022-07-09 13:56:32,884][26022] Updated weights on worker 0-0, policy_version 277974 (0.00090) [2022-07-09 13:56:34,629][26022] Updated weights on worker 0-0, policy_version 277984 (0.00093) [2022-07-09 13:56:36,490][26022] Updated weights on worker 0-0, policy_version 277994 (0.00095) [2022-07-09 13:56:36,547][25689] Fps is (10 sec: 5699.6, 60 sec: 5641.0, 300 sec: 5632.2). Total num frames: 284665856. Throughput: 0: 5078.3. Samples: 284660958. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 13:56:36,547][25689] Avg episode reward: [(0, '-48.843')] [2022-07-09 13:56:38,363][26022] Updated weights on worker 0-0, policy_version 278004 (0.00100) [2022-07-09 13:56:40,218][26022] Updated weights on worker 0-0, policy_version 278014 (0.00089) [2022-07-09 13:56:41,692][25689] Fps is (10 sec: 5625.4, 60 sec: 5621.7, 300 sec: 5633.2). Total num frames: 284694528. Throughput: 0: 5874.3. Samples: 284694738. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 13:56:41,692][25689] Avg episode reward: [(0, '-48.773')] [2022-07-09 13:56:41,910][26022] Updated weights on worker 0-0, policy_version 278024 (0.00098) [2022-07-09 13:56:43,838][26022] Updated weights on worker 0-0, policy_version 278034 (0.00095) [2022-07-09 13:56:45,530][26022] Updated weights on worker 0-0, policy_version 278044 (0.00092) [2022-07-09 13:56:46,708][25689] Fps is (10 sec: 5542.7, 60 sec: 5622.4, 300 sec: 5633.7). Total num frames: 284722176. Throughput: 0: 5882.9. Samples: 284728956. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 13:56:46,709][25689] Avg episode reward: [(0, '-48.157')] [2022-07-09 13:56:47,439][26022] Updated weights on worker 0-0, policy_version 278054 (0.00082) [2022-07-09 13:56:49,179][26022] Updated weights on worker 0-0, policy_version 278064 (0.00087) [2022-07-09 13:56:51,028][26022] Updated weights on worker 0-0, policy_version 278074 (0.00084) [2022-07-09 13:56:51,728][25689] Fps is (10 sec: 5509.7, 60 sec: 5621.1, 300 sec: 5623.7). Total num frames: 284749824. Throughput: 0: 5044.2. Samples: 284745850. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 13:56:51,728][25689] Avg episode reward: [(0, '-47.651')] [2022-07-09 13:56:52,852][26022] Updated weights on worker 0-0, policy_version 278084 (0.00098) [2022-07-09 13:56:54,655][26022] Updated weights on worker 0-0, policy_version 278094 (0.00372) [2022-07-09 13:56:56,379][26022] Updated weights on worker 0-0, policy_version 278104 (0.00087) [2022-07-09 13:56:56,811][25689] Fps is (10 sec: 5878.5, 60 sec: 5648.4, 300 sec: 5633.7). Total num frames: 284781568. Throughput: 0: 5883.2. Samples: 284780178. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 13:56:56,812][25689] Avg episode reward: [(0, '-47.549')] [2022-07-09 13:56:58,581][26022] Updated weights on worker 0-0, policy_version 278114 (0.00085) [2022-07-09 13:56:59,882][26022] Updated weights on worker 0-0, policy_version 278124 (0.00095) [2022-07-09 13:57:01,532][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:57:01,546][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000278132_284807168.pth [2022-07-09 13:57:01,546][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000276150_282777600.pth [2022-07-09 13:57:01,893][25689] Fps is (10 sec: 5741.9, 60 sec: 5634.4, 300 sec: 5639.7). Total num frames: 284808192. Throughput: 0: 5915.8. Samples: 284814246. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 13:57:01,894][25689] Avg episode reward: [(0, '-46.611')] [2022-07-09 13:57:02,619][26022] Updated weights on worker 0-0, policy_version 278134 (0.00088) [2022-07-09 13:57:03,872][26022] Updated weights on worker 0-0, policy_version 278144 (0.00084) [2022-07-09 13:57:06,053][26022] Updated weights on worker 0-0, policy_version 278154 (0.00099) [2022-07-09 13:57:06,962][25689] Fps is (10 sec: 5245.8, 60 sec: 5628.1, 300 sec: 5625.0). Total num frames: 284834816. Throughput: 0: 4940.8. Samples: 284829026. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 13:57:06,962][25689] Avg episode reward: [(0, '-46.727')] [2022-07-09 13:57:07,636][26022] Updated weights on worker 0-0, policy_version 278164 (0.00093) [2022-07-09 13:57:09,521][26022] Updated weights on worker 0-0, policy_version 278174 (0.00087) [2022-07-09 13:57:11,235][26022] Updated weights on worker 0-0, policy_version 278184 (0.00090) [2022-07-09 13:57:12,004][25689] Fps is (10 sec: 5468.8, 60 sec: 5627.9, 300 sec: 5628.1). Total num frames: 284863488. Throughput: 0: 5774.7. Samples: 284862940. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 13:57:12,005][25689] Avg episode reward: [(0, '-47.463')] [2022-07-09 13:57:13,006][26022] Updated weights on worker 0-0, policy_version 278194 (0.00084) [2022-07-09 13:57:14,969][26022] Updated weights on worker 0-0, policy_version 278204 (0.00082) [2022-07-09 13:57:16,646][26022] Updated weights on worker 0-0, policy_version 278214 (0.00093) [2022-07-09 13:57:17,047][25689] Fps is (10 sec: 5787.6, 60 sec: 5642.7, 300 sec: 5632.7). Total num frames: 284893184. Throughput: 0: 5784.3. Samples: 284897228. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 13:57:17,047][25689] Avg episode reward: [(0, '-48.171')] [2022-07-09 13:57:18,601][26022] Updated weights on worker 0-0, policy_version 278224 (0.00085) [2022-07-09 13:57:20,303][26022] Updated weights on worker 0-0, policy_version 278234 (0.00087) [2022-07-09 13:57:22,132][25689] Fps is (10 sec: 5763.1, 60 sec: 5638.6, 300 sec: 5627.8). Total num frames: 284921856. Throughput: 0: 4945.6. Samples: 284914340. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 13:57:22,132][25689] Avg episode reward: [(0, '-49.250')] [2022-07-09 13:57:22,139][26022] Updated weights on worker 0-0, policy_version 278244 (0.00054) [2022-07-09 13:57:23,894][26022] Updated weights on worker 0-0, policy_version 278254 (0.00664) [2022-07-09 13:57:25,504][26022] Updated weights on worker 0-0, policy_version 278264 (0.00087) [2022-07-09 13:57:27,137][25689] Fps is (10 sec: 5683.2, 60 sec: 5639.0, 300 sec: 5628.0). Total num frames: 284950528. Throughput: 0: 5944.0. Samples: 284948948. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 13:57:27,137][25689] Avg episode reward: [(0, '-48.949')] [2022-07-09 13:57:27,406][26022] Updated weights on worker 0-0, policy_version 278274 (0.00619) [2022-07-09 13:57:29,214][26022] Updated weights on worker 0-0, policy_version 278284 (0.00090) [2022-07-09 13:57:31,141][26022] Updated weights on worker 0-0, policy_version 278294 (0.01049) [2022-07-09 13:57:32,166][25689] Fps is (10 sec: 5715.1, 60 sec: 5637.5, 300 sec: 5631.5). Total num frames: 284979200. Throughput: 0: 5953.9. Samples: 284982982. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 13:57:32,166][25689] Avg episode reward: [(0, '-49.382')] [2022-07-09 13:57:32,918][26022] Updated weights on worker 0-0, policy_version 278304 (0.00093) [2022-07-09 13:57:34,676][26022] Updated weights on worker 0-0, policy_version 278314 (0.00090) [2022-07-09 13:57:36,524][26022] Updated weights on worker 0-0, policy_version 278324 (0.00091) [2022-07-09 13:57:37,256][25689] Fps is (10 sec: 5666.8, 60 sec: 5633.6, 300 sec: 5634.3). Total num frames: 285007872. Throughput: 0: 5920.7. Samples: 285016882. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 13:57:37,257][25689] Avg episode reward: [(0, '-48.851')] [2022-07-09 13:57:38,146][26022] Updated weights on worker 0-0, policy_version 278334 (0.00087) [2022-07-09 13:57:40,207][26022] Updated weights on worker 0-0, policy_version 278344 (0.00084) [2022-07-09 13:57:41,743][26022] Updated weights on worker 0-0, policy_version 278354 (0.00088) [2022-07-09 13:57:42,369][25689] Fps is (10 sec: 5620.0, 60 sec: 5636.6, 300 sec: 5626.0). Total num frames: 285036544. Throughput: 0: 5913.8. Samples: 285034020. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 13:57:42,370][25689] Avg episode reward: [(0, '-48.258')] [2022-07-09 13:57:43,770][26022] Updated weights on worker 0-0, policy_version 278364 (0.00087) [2022-07-09 13:57:45,451][26022] Updated weights on worker 0-0, policy_version 278374 (0.00079) [2022-07-09 13:57:47,291][26022] Updated weights on worker 0-0, policy_version 278384 (0.00094) [2022-07-09 13:57:47,394][25689] Fps is (10 sec: 5757.7, 60 sec: 5669.6, 300 sec: 5629.1). Total num frames: 285066240. Throughput: 0: 5890.0. Samples: 285068260. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 13:57:47,394][25689] Avg episode reward: [(0, '-48.209')] [2022-07-09 13:57:49,349][26022] Updated weights on worker 0-0, policy_version 278394 (0.00089) [2022-07-09 13:57:50,798][26022] Updated weights on worker 0-0, policy_version 278404 (0.00089) [2022-07-09 13:57:52,448][25689] Fps is (10 sec: 5587.9, 60 sec: 5649.5, 300 sec: 5628.3). Total num frames: 285092864. Throughput: 0: 5887.5. Samples: 285102394. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 13:57:52,449][25689] Avg episode reward: [(0, '-47.954')] [2022-07-09 13:57:52,826][26022] Updated weights on worker 0-0, policy_version 278414 (0.00093) [2022-07-09 13:57:54,363][26022] Updated weights on worker 0-0, policy_version 278424 (0.00088) [2022-07-09 13:57:56,404][26022] Updated weights on worker 0-0, policy_version 278434 (0.00087) [2022-07-09 13:57:57,474][25689] Fps is (10 sec: 5586.9, 60 sec: 5621.1, 300 sec: 5626.1). Total num frames: 285122560. Throughput: 0: 5078.5. Samples: 285119560. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 13:57:57,475][25689] Avg episode reward: [(0, '-47.752')] [2022-07-09 13:57:58,062][26022] Updated weights on worker 0-0, policy_version 278444 (0.00090) [2022-07-09 13:57:59,975][26022] Updated weights on worker 0-0, policy_version 278454 (0.00084) [2022-07-09 13:58:01,819][26022] Updated weights on worker 0-0, policy_version 278464 (0.00085) [2022-07-09 13:58:02,539][25689] Fps is (10 sec: 5581.4, 60 sec: 5622.7, 300 sec: 5622.4). Total num frames: 285149184. Throughput: 0: 5925.5. Samples: 285153534. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 13:58:02,539][25689] Avg episode reward: [(0, '-47.219')] [2022-07-09 13:58:04,002][26022] Updated weights on worker 0-0, policy_version 278474 (0.00088) [2022-07-09 13:58:05,601][26022] Updated weights on worker 0-0, policy_version 278484 (0.00092) [2022-07-09 13:58:07,553][25689] Fps is (10 sec: 5486.6, 60 sec: 5661.6, 300 sec: 5633.1). Total num frames: 285177856. Throughput: 0: 5818.4. Samples: 285185552. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 13:58:07,554][25689] Avg episode reward: [(0, '-47.254')] [2022-07-09 13:58:07,559][26022] Updated weights on worker 0-0, policy_version 278494 (0.00084) [2022-07-09 13:58:09,182][26022] Updated weights on worker 0-0, policy_version 278504 (0.00099) [2022-07-09 13:58:11,098][26022] Updated weights on worker 0-0, policy_version 278514 (0.00081) [2022-07-09 13:58:12,603][25689] Fps is (10 sec: 5596.3, 60 sec: 5644.0, 300 sec: 5627.0). Total num frames: 285205504. Throughput: 0: 4972.4. Samples: 285202610. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 13:58:12,603][25689] Avg episode reward: [(0, '-47.681')] [2022-07-09 13:58:13,010][26022] Updated weights on worker 0-0, policy_version 278524 (0.00089) [2022-07-09 13:58:14,747][26022] Updated weights on worker 0-0, policy_version 278534 (0.00096) [2022-07-09 13:58:16,561][26022] Updated weights on worker 0-0, policy_version 278544 (0.00088) [2022-07-09 13:58:17,622][25689] Fps is (10 sec: 5593.2, 60 sec: 5629.2, 300 sec: 5624.4). Total num frames: 285234176. Throughput: 0: 5799.7. Samples: 285236410. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 13:58:17,622][25689] Avg episode reward: [(0, '-47.469')] [2022-07-09 13:58:18,342][26022] Updated weights on worker 0-0, policy_version 278554 (0.00087) [2022-07-09 13:58:20,031][26022] Updated weights on worker 0-0, policy_version 278564 (0.00088) [2022-07-09 13:58:22,136][26022] Updated weights on worker 0-0, policy_version 278574 (0.00089) [2022-07-09 13:58:22,742][25689] Fps is (10 sec: 5756.4, 60 sec: 5642.9, 300 sec: 5629.4). Total num frames: 285263872. Throughput: 0: 5800.9. Samples: 285270732. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 13:58:22,743][25689] Avg episode reward: [(0, '-47.496')] [2022-07-09 13:58:23,904][26022] Updated weights on worker 0-0, policy_version 278584 (0.00089) [2022-07-09 13:58:25,279][26022] Updated weights on worker 0-0, policy_version 278594 (0.00088) [2022-07-09 13:58:27,679][26022] Updated weights on worker 0-0, policy_version 278604 (0.00082) [2022-07-09 13:58:27,757][25689] Fps is (10 sec: 5657.7, 60 sec: 5625.0, 300 sec: 5622.8). Total num frames: 285291520. Throughput: 0: 5053.0. Samples: 285287648. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 13:58:27,758][25689] Avg episode reward: [(0, '-48.134')] [2022-07-09 13:58:29,059][26022] Updated weights on worker 0-0, policy_version 278614 (0.00093) [2022-07-09 13:58:31,170][26022] Updated weights on worker 0-0, policy_version 278624 (0.00097) [2022-07-09 13:58:32,763][25689] Fps is (10 sec: 5620.4, 60 sec: 5627.2, 300 sec: 5627.8). Total num frames: 285320192. Throughput: 0: 5898.1. Samples: 285321518. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 13:58:32,764][25689] Avg episode reward: [(0, '-47.902')] [2022-07-09 13:58:32,815][26022] Updated weights on worker 0-0, policy_version 278634 (0.00085) [2022-07-09 13:58:34,490][26022] Updated weights on worker 0-0, policy_version 278644 (0.00097) [2022-07-09 13:58:36,674][26022] Updated weights on worker 0-0, policy_version 278654 (0.00090) [2022-07-09 13:58:37,798][25689] Fps is (10 sec: 5813.0, 60 sec: 5649.2, 300 sec: 5631.8). Total num frames: 285349888. Throughput: 0: 5903.9. Samples: 285355530. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 13:58:37,799][25689] Avg episode reward: [(0, '-47.385')] [2022-07-09 13:58:38,090][26022] Updated weights on worker 0-0, policy_version 278664 (0.00093) [2022-07-09 13:58:40,103][26022] Updated weights on worker 0-0, policy_version 278674 (0.00091) [2022-07-09 13:58:41,731][26022] Updated weights on worker 0-0, policy_version 278684 (0.00091) [2022-07-09 13:58:42,866][25689] Fps is (10 sec: 5675.7, 60 sec: 5636.5, 300 sec: 5624.6). Total num frames: 285377536. Throughput: 0: 5056.5. Samples: 285372490. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 13:58:42,867][25689] Avg episode reward: [(0, '-46.651')] [2022-07-09 13:58:43,708][26022] Updated weights on worker 0-0, policy_version 278694 (0.00087) [2022-07-09 13:58:45,701][26022] Updated weights on worker 0-0, policy_version 278704 (0.00093) [2022-07-09 13:58:47,239][26022] Updated weights on worker 0-0, policy_version 278714 (0.00088) [2022-07-09 13:58:47,901][25689] Fps is (10 sec: 5574.6, 60 sec: 5618.6, 300 sec: 5628.3). Total num frames: 285406208. Throughput: 0: 5904.8. Samples: 285406594. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 13:58:47,902][25689] Avg episode reward: [(0, '-46.718')] [2022-07-09 13:58:49,010][26022] Updated weights on worker 0-0, policy_version 278724 (0.00054) [2022-07-09 13:58:51,145][26022] Updated weights on worker 0-0, policy_version 278734 (0.00095) [2022-07-09 13:58:52,679][26022] Updated weights on worker 0-0, policy_version 278744 (0.00093) [2022-07-09 13:58:52,965][25689] Fps is (10 sec: 5678.2, 60 sec: 5651.5, 300 sec: 5628.0). Total num frames: 285434880. Throughput: 0: 5895.8. Samples: 285440628. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 13:58:52,966][25689] Avg episode reward: [(0, '-46.282')] [2022-07-09 13:58:54,528][26022] Updated weights on worker 0-0, policy_version 278754 (0.00520) [2022-07-09 13:58:56,448][26022] Updated weights on worker 0-0, policy_version 278764 (0.00086) [2022-07-09 13:58:57,939][26022] Updated weights on worker 0-0, policy_version 278774 (0.00098) [2022-07-09 13:58:57,980][25689] Fps is (10 sec: 5791.4, 60 sec: 5652.6, 300 sec: 5632.0). Total num frames: 285464576. Throughput: 0: 5068.4. Samples: 285457818. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 13:58:57,980][25689] Avg episode reward: [(0, '-46.563')] [2022-07-09 13:59:00,091][26022] Updated weights on worker 0-0, policy_version 278784 (0.00086) [2022-07-09 13:59:02,209][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 13:59:02,226][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000278794_285485056.pth [2022-07-09 13:59:02,227][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000276813_283456512.pth [2022-07-09 13:59:02,232][26022] Updated weights on worker 0-0, policy_version 278794 (0.00091) [2022-07-09 13:59:03,091][25689] Fps is (10 sec: 5359.9, 60 sec: 5614.5, 300 sec: 5619.9). Total num frames: 285489152. Throughput: 0: 5911.1. Samples: 285492040. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 13:59:03,092][25689] Avg episode reward: [(0, '-47.209')] [2022-07-09 13:59:03,879][26022] Updated weights on worker 0-0, policy_version 278804 (0.00092) [2022-07-09 13:59:05,867][26022] Updated weights on worker 0-0, policy_version 278814 (0.00083) [2022-07-09 13:59:07,477][26022] Updated weights on worker 0-0, policy_version 278824 (0.00090) [2022-07-09 13:59:08,129][25689] Fps is (10 sec: 5347.6, 60 sec: 5629.2, 300 sec: 5629.7). Total num frames: 285518848. Throughput: 0: 5808.6. Samples: 285524086. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 13:59:08,131][25689] Avg episode reward: [(0, '-47.442')] [2022-07-09 13:59:09,557][26022] Updated weights on worker 0-0, policy_version 278834 (0.00091) [2022-07-09 13:59:11,197][26022] Updated weights on worker 0-0, policy_version 278844 (0.00083) [2022-07-09 13:59:12,894][26022] Updated weights on worker 0-0, policy_version 278854 (0.00095) [2022-07-09 13:59:13,176][25689] Fps is (10 sec: 5787.7, 60 sec: 5646.3, 300 sec: 5632.3). Total num frames: 285547520. Throughput: 0: 4964.7. Samples: 285540966. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 13:59:13,176][25689] Avg episode reward: [(0, '-48.036')] [2022-07-09 13:59:14,740][26022] Updated weights on worker 0-0, policy_version 278864 (0.00085) [2022-07-09 13:59:16,455][26022] Updated weights on worker 0-0, policy_version 278874 (0.00088) [2022-07-09 13:59:18,207][25689] Fps is (10 sec: 5690.0, 60 sec: 5645.3, 300 sec: 5626.4). Total num frames: 285576192. Throughput: 0: 5816.8. Samples: 285575474. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 13:59:18,207][25689] Avg episode reward: [(0, '-47.933')] [2022-07-09 13:59:18,224][26022] Updated weights on worker 0-0, policy_version 278884 (0.00087) [2022-07-09 13:59:20,334][26022] Updated weights on worker 0-0, policy_version 278894 (0.00081) [2022-07-09 13:59:21,820][26022] Updated weights on worker 0-0, policy_version 278904 (0.00097) [2022-07-09 13:59:23,259][25689] Fps is (10 sec: 5686.9, 60 sec: 5634.6, 300 sec: 5633.7). Total num frames: 285604864. Throughput: 0: 5837.4. Samples: 285609770. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 13:59:23,260][25689] Avg episode reward: [(0, '-47.523')] [2022-07-09 13:59:23,739][26022] Updated weights on worker 0-0, policy_version 278914 (0.00087) [2022-07-09 13:59:25,469][26022] Updated weights on worker 0-0, policy_version 278924 (0.00091) [2022-07-09 13:59:27,154][26022] Updated weights on worker 0-0, policy_version 278934 (0.00090) [2022-07-09 13:59:28,273][25689] Fps is (10 sec: 5696.8, 60 sec: 5651.7, 300 sec: 5634.2). Total num frames: 285633536. Throughput: 0: 5100.3. Samples: 285626826. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 13:59:28,273][25689] Avg episode reward: [(0, '-46.260')] [2022-07-09 13:59:29,135][26022] Updated weights on worker 0-0, policy_version 278944 (0.00084) [2022-07-09 13:59:30,971][26022] Updated weights on worker 0-0, policy_version 278954 (0.00087) [2022-07-09 13:59:32,808][26022] Updated weights on worker 0-0, policy_version 278964 (0.00088) [2022-07-09 13:59:33,281][25689] Fps is (10 sec: 5721.8, 60 sec: 5651.4, 300 sec: 5634.8). Total num frames: 285662208. Throughput: 0: 5962.1. Samples: 285660838. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 13:59:33,282][25689] Avg episode reward: [(0, '-46.619')] [2022-07-09 13:59:34,492][26022] Updated weights on worker 0-0, policy_version 278974 (0.00086) [2022-07-09 13:59:36,291][26022] Updated weights on worker 0-0, policy_version 278984 (0.00084) [2022-07-09 13:59:37,965][26022] Updated weights on worker 0-0, policy_version 278994 (0.00092) [2022-07-09 13:59:38,301][25689] Fps is (10 sec: 5616.2, 60 sec: 5619.1, 300 sec: 5635.5). Total num frames: 285689856. Throughput: 0: 5939.6. Samples: 285694824. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 13:59:38,301][25689] Avg episode reward: [(0, '-46.428')] [2022-07-09 13:59:39,987][26022] Updated weights on worker 0-0, policy_version 279004 (0.00087) [2022-07-09 13:59:41,805][26022] Updated weights on worker 0-0, policy_version 279014 (0.00087) [2022-07-09 13:59:43,419][25689] Fps is (10 sec: 5555.6, 60 sec: 5631.3, 300 sec: 5629.9). Total num frames: 285718528. Throughput: 0: 5898.1. Samples: 285728672. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 13:59:43,419][25689] Avg episode reward: [(0, '-45.681')] [2022-07-09 13:59:43,745][26022] Updated weights on worker 0-0, policy_version 279024 (0.00092) [2022-07-09 13:59:45,396][26022] Updated weights on worker 0-0, policy_version 279034 (0.00088) [2022-07-09 13:59:47,290][26022] Updated weights on worker 0-0, policy_version 279044 (0.00096) [2022-07-09 13:59:48,454][25689] Fps is (10 sec: 5748.6, 60 sec: 5648.2, 300 sec: 5636.2). Total num frames: 285748224. Throughput: 0: 5886.6. Samples: 285745626. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 13:59:48,455][25689] Avg episode reward: [(0, '-46.138')] [2022-07-09 13:59:48,976][26022] Updated weights on worker 0-0, policy_version 279054 (0.00094) [2022-07-09 13:59:50,727][26022] Updated weights on worker 0-0, policy_version 279064 (0.00090) [2022-07-09 13:59:52,758][26022] Updated weights on worker 0-0, policy_version 279074 (0.00084) [2022-07-09 13:59:53,479][25689] Fps is (10 sec: 5700.3, 60 sec: 5635.0, 300 sec: 5633.3). Total num frames: 285775872. Throughput: 0: 5872.8. Samples: 285779454. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 13:59:53,479][25689] Avg episode reward: [(0, '-47.303')] [2022-07-09 13:59:54,411][26022] Updated weights on worker 0-0, policy_version 279084 (0.00095) [2022-07-09 13:59:56,129][26022] Updated weights on worker 0-0, policy_version 279094 (0.00086) [2022-07-09 13:59:58,145][26022] Updated weights on worker 0-0, policy_version 279104 (0.00086) [2022-07-09 13:59:58,539][25689] Fps is (10 sec: 5381.5, 60 sec: 5580.0, 300 sec: 5629.7). Total num frames: 285802496. Throughput: 0: 5870.6. Samples: 285813636. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 13:59:58,540][25689] Avg episode reward: [(0, '-47.382')] [2022-07-09 13:59:59,866][26022] Updated weights on worker 0-0, policy_version 279114 (0.00086) [2022-07-09 14:00:02,205][26022] Updated weights on worker 0-0, policy_version 279124 (0.00091) [2022-07-09 14:00:03,612][25689] Fps is (10 sec: 5457.0, 60 sec: 5651.2, 300 sec: 5628.8). Total num frames: 285831168. Throughput: 0: 5004.7. Samples: 285829732. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 14:00:03,612][25689] Avg episode reward: [(0, '-48.187')] [2022-07-09 14:00:03,979][26022] Updated weights on worker 0-0, policy_version 279134 (0.00089) [2022-07-09 14:00:05,771][26022] Updated weights on worker 0-0, policy_version 279144 (0.00089) [2022-07-09 14:00:07,563][26022] Updated weights on worker 0-0, policy_version 279154 (0.00088) [2022-07-09 14:00:08,686][25689] Fps is (10 sec: 5651.5, 60 sec: 5630.9, 300 sec: 5631.2). Total num frames: 285859840. Throughput: 0: 5769.6. Samples: 285862354. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 14:00:08,686][25689] Avg episode reward: [(0, '-48.773')] [2022-07-09 14:00:09,340][26022] Updated weights on worker 0-0, policy_version 279164 (0.00094) [2022-07-09 14:00:11,164][26022] Updated weights on worker 0-0, policy_version 279174 (0.00085) [2022-07-09 14:00:13,006][26022] Updated weights on worker 0-0, policy_version 279184 (0.00089) [2022-07-09 14:00:13,697][25689] Fps is (10 sec: 5584.4, 60 sec: 5617.3, 300 sec: 5634.7). Total num frames: 285887488. Throughput: 0: 5785.3. Samples: 285896424. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 14:00:13,698][25689] Avg episode reward: [(0, '-48.428')] [2022-07-09 14:00:14,709][26022] Updated weights on worker 0-0, policy_version 279194 (0.00085) [2022-07-09 14:00:16,639][26022] Updated weights on worker 0-0, policy_version 279204 (0.00088) [2022-07-09 14:00:18,239][26022] Updated weights on worker 0-0, policy_version 279214 (0.00088) [2022-07-09 14:00:18,719][25689] Fps is (10 sec: 5715.2, 60 sec: 5635.1, 300 sec: 5631.7). Total num frames: 285917184. Throughput: 0: 4949.5. Samples: 285913518. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 14:00:18,720][25689] Avg episode reward: [(0, '-49.114')] [2022-07-09 14:00:20,380][26022] Updated weights on worker 0-0, policy_version 279224 (0.00085) [2022-07-09 14:00:21,971][26022] Updated weights on worker 0-0, policy_version 279234 (0.00087) [2022-07-09 14:00:23,785][25689] Fps is (10 sec: 5684.5, 60 sec: 5616.9, 300 sec: 5630.5). Total num frames: 285944832. Throughput: 0: 5862.0. Samples: 285947986. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 14:00:23,785][25689] Avg episode reward: [(0, '-48.672')] [2022-07-09 14:00:23,887][26022] Updated weights on worker 0-0, policy_version 279244 (0.00088) [2022-07-09 14:00:25,549][26022] Updated weights on worker 0-0, policy_version 279254 (0.00090) [2022-07-09 14:00:27,460][26022] Updated weights on worker 0-0, policy_version 279264 (0.00086) [2022-07-09 14:00:28,799][25689] Fps is (10 sec: 5689.1, 60 sec: 5633.8, 300 sec: 5640.9). Total num frames: 285974528. Throughput: 0: 5954.3. Samples: 285982114. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 14:00:28,799][25689] Avg episode reward: [(0, '-49.088')] [2022-07-09 14:00:29,188][26022] Updated weights on worker 0-0, policy_version 279274 (0.00089) [2022-07-09 14:00:31,102][26022] Updated weights on worker 0-0, policy_version 279284 (0.00088) [2022-07-09 14:00:32,869][26022] Updated weights on worker 0-0, policy_version 279294 (0.00087) [2022-07-09 14:00:33,808][25689] Fps is (10 sec: 5721.0, 60 sec: 5616.8, 300 sec: 5634.0). Total num frames: 286002176. Throughput: 0: 5101.9. Samples: 285999028. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 14:00:33,809][25689] Avg episode reward: [(0, '-48.685')] [2022-07-09 14:00:34,743][26022] Updated weights on worker 0-0, policy_version 279304 (0.00109) [2022-07-09 14:00:36,458][26022] Updated weights on worker 0-0, policy_version 279314 (0.00399) [2022-07-09 14:00:38,235][26022] Updated weights on worker 0-0, policy_version 279324 (0.00089) [2022-07-09 14:00:38,839][25689] Fps is (10 sec: 5609.6, 60 sec: 5632.7, 300 sec: 5632.3). Total num frames: 286030848. Throughput: 0: 5946.2. Samples: 286033152. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 14:00:38,839][25689] Avg episode reward: [(0, '-49.137')] [2022-07-09 14:00:40,063][26022] Updated weights on worker 0-0, policy_version 279334 (0.00094) [2022-07-09 14:00:41,928][26022] Updated weights on worker 0-0, policy_version 279344 (0.00109) [2022-07-09 14:00:43,561][26022] Updated weights on worker 0-0, policy_version 279354 (0.00091) [2022-07-09 14:00:44,003][25689] Fps is (10 sec: 5725.6, 60 sec: 5645.4, 300 sec: 5636.5). Total num frames: 286060544. Throughput: 0: 5886.7. Samples: 286067000. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 14:00:44,003][25689] Avg episode reward: [(0, '-49.451')] [2022-07-09 14:00:45,643][26022] Updated weights on worker 0-0, policy_version 279364 (0.00076) [2022-07-09 14:00:47,340][26022] Updated weights on worker 0-0, policy_version 279374 (0.00110) [2022-07-09 14:00:49,078][25689] Fps is (10 sec: 5600.5, 60 sec: 5607.8, 300 sec: 5635.2). Total num frames: 286088192. Throughput: 0: 5027.4. Samples: 286084064. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 14:00:49,078][25689] Avg episode reward: [(0, '-49.421')] [2022-07-09 14:00:49,266][26022] Updated weights on worker 0-0, policy_version 279384 (0.00090) [2022-07-09 14:00:50,855][26022] Updated weights on worker 0-0, policy_version 279394 (0.00106) [2022-07-09 14:00:52,773][26022] Updated weights on worker 0-0, policy_version 279404 (0.00087) [2022-07-09 14:00:54,088][25689] Fps is (10 sec: 5584.3, 60 sec: 5626.1, 300 sec: 5631.8). Total num frames: 286116864. Throughput: 0: 5871.5. Samples: 286118100. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 14:00:54,089][25689] Avg episode reward: [(0, '-49.018')] [2022-07-09 14:00:54,320][26022] Updated weights on worker 0-0, policy_version 279414 (0.00103) [2022-07-09 14:00:56,397][26022] Updated weights on worker 0-0, policy_version 279424 (0.00090) [2022-07-09 14:00:58,140][26022] Updated weights on worker 0-0, policy_version 279434 (0.00096) [2022-07-09 14:00:59,134][25689] Fps is (10 sec: 5702.2, 60 sec: 5661.2, 300 sec: 5636.5). Total num frames: 286145536. Throughput: 0: 5853.1. Samples: 286151944. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 14:00:59,135][25689] Avg episode reward: [(0, '-49.137')] [2022-07-09 14:00:59,837][26022] Updated weights on worker 0-0, policy_version 279444 (0.00086) [2022-07-09 14:01:01,902][26022] Updated weights on worker 0-0, policy_version 279454 (0.00094) [2022-07-09 14:01:02,489][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:01:02,499][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000279456_286162944.pth [2022-07-09 14:01:02,500][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000277472_284131328.pth [2022-07-09 14:01:04,201][26022] Updated weights on worker 0-0, policy_version 279464 (0.00086) [2022-07-09 14:01:04,215][25689] Fps is (10 sec: 5258.1, 60 sec: 5592.9, 300 sec: 5628.2). Total num frames: 286170112. Throughput: 0: 5035.9. Samples: 286168786. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 14:01:04,215][25689] Avg episode reward: [(0, '-48.435')] [2022-07-09 14:01:05,756][26022] Updated weights on worker 0-0, policy_version 279474 (0.00093) [2022-07-09 14:01:07,846][26022] Updated weights on worker 0-0, policy_version 279484 (0.00091) [2022-07-09 14:01:09,223][25689] Fps is (10 sec: 5582.8, 60 sec: 5649.7, 300 sec: 5639.1). Total num frames: 286201856. Throughput: 0: 5804.5. Samples: 286200994. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 14:01:09,223][25689] Avg episode reward: [(0, '-48.065')] [2022-07-09 14:01:09,235][26022] Updated weights on worker 0-0, policy_version 279494 (0.00088) [2022-07-09 14:01:11,228][26022] Updated weights on worker 0-0, policy_version 279504 (0.00083) [2022-07-09 14:01:12,831][26022] Updated weights on worker 0-0, policy_version 279514 (0.00085) [2022-07-09 14:01:14,254][25689] Fps is (10 sec: 5814.0, 60 sec: 5631.0, 300 sec: 5632.0). Total num frames: 286228480. Throughput: 0: 5800.6. Samples: 286235074. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 14:01:14,254][25689] Avg episode reward: [(0, '-48.008')] [2022-07-09 14:01:14,736][26022] Updated weights on worker 0-0, policy_version 279524 (0.00091) [2022-07-09 14:01:16,496][26022] Updated weights on worker 0-0, policy_version 279534 (0.00085) [2022-07-09 14:01:18,322][26022] Updated weights on worker 0-0, policy_version 279544 (0.00087) [2022-07-09 14:01:19,264][25689] Fps is (10 sec: 5506.6, 60 sec: 5615.1, 300 sec: 5632.6). Total num frames: 286257152. Throughput: 0: 4988.1. Samples: 286252354. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 14:01:19,266][25689] Avg episode reward: [(0, '-47.582')] [2022-07-09 14:01:20,185][26022] Updated weights on worker 0-0, policy_version 279554 (0.00085) [2022-07-09 14:01:21,872][26022] Updated weights on worker 0-0, policy_version 279564 (0.00080) [2022-07-09 14:01:23,752][26022] Updated weights on worker 0-0, policy_version 279574 (0.00986) [2022-07-09 14:01:24,369][25689] Fps is (10 sec: 5770.5, 60 sec: 5645.3, 300 sec: 5634.2). Total num frames: 286286848. Throughput: 0: 5835.4. Samples: 286286392. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 14:01:24,370][25689] Avg episode reward: [(0, '-48.117')] [2022-07-09 14:01:25,432][26022] Updated weights on worker 0-0, policy_version 279584 (0.00085) [2022-07-09 14:01:27,187][26022] Updated weights on worker 0-0, policy_version 279594 (0.00089) [2022-07-09 14:01:29,134][26022] Updated weights on worker 0-0, policy_version 279604 (0.00088) [2022-07-09 14:01:29,423][25689] Fps is (10 sec: 5745.7, 60 sec: 5624.7, 300 sec: 5633.4). Total num frames: 286315520. Throughput: 0: 5925.3. Samples: 286320686. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 14:01:29,423][25689] Avg episode reward: [(0, '-48.002')] [2022-07-09 14:01:31,099][26022] Updated weights on worker 0-0, policy_version 279614 (0.00091) [2022-07-09 14:01:32,810][26022] Updated weights on worker 0-0, policy_version 279624 (0.00087) [2022-07-09 14:01:34,444][25689] Fps is (10 sec: 5590.0, 60 sec: 5623.6, 300 sec: 5630.5). Total num frames: 286343168. Throughput: 0: 5080.2. Samples: 286337642. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 14:01:34,451][25689] Avg episode reward: [(0, '-47.742')] [2022-07-09 14:01:34,609][26022] Updated weights on worker 0-0, policy_version 279634 (0.00087) [2022-07-09 14:01:36,385][26022] Updated weights on worker 0-0, policy_version 279644 (0.00493) [2022-07-09 14:01:38,273][26022] Updated weights on worker 0-0, policy_version 279654 (0.00084) [2022-07-09 14:01:39,460][25689] Fps is (10 sec: 5611.3, 60 sec: 5625.0, 300 sec: 5632.9). Total num frames: 286371840. Throughput: 0: 5916.1. Samples: 286371832. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 14:01:39,460][25689] Avg episode reward: [(0, '-48.156')] [2022-07-09 14:01:39,914][26022] Updated weights on worker 0-0, policy_version 279664 (0.00081) [2022-07-09 14:01:41,934][26022] Updated weights on worker 0-0, policy_version 279674 (0.00083) [2022-07-09 14:01:43,385][26022] Updated weights on worker 0-0, policy_version 279684 (0.00087) [2022-07-09 14:01:44,537][25689] Fps is (10 sec: 5782.8, 60 sec: 5633.0, 300 sec: 5638.7). Total num frames: 286401536. Throughput: 0: 5920.6. Samples: 286405802. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 14:01:44,538][25689] Avg episode reward: [(0, '-48.975')] [2022-07-09 14:01:45,582][26022] Updated weights on worker 0-0, policy_version 279694 (0.00079) [2022-07-09 14:01:47,052][26022] Updated weights on worker 0-0, policy_version 279704 (0.00094) [2022-07-09 14:01:49,063][26022] Updated weights on worker 0-0, policy_version 279714 (0.00092) [2022-07-09 14:01:49,578][25689] Fps is (10 sec: 5768.7, 60 sec: 5653.2, 300 sec: 5641.7). Total num frames: 286430208. Throughput: 0: 5072.4. Samples: 286422920. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 14:01:49,578][25689] Avg episode reward: [(0, '-48.496')] [2022-07-09 14:01:50,736][26022] Updated weights on worker 0-0, policy_version 279724 (0.00081) [2022-07-09 14:01:52,586][26022] Updated weights on worker 0-0, policy_version 279734 (0.00061) [2022-07-09 14:01:54,503][26022] Updated weights on worker 0-0, policy_version 279744 (0.00093) [2022-07-09 14:01:54,599][25689] Fps is (10 sec: 5597.6, 60 sec: 5635.2, 300 sec: 5629.1). Total num frames: 286457856. Throughput: 0: 5918.5. Samples: 286456928. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 14:01:54,599][25689] Avg episode reward: [(0, '-48.168')] [2022-07-09 14:01:56,359][26022] Updated weights on worker 0-0, policy_version 279754 (0.00084) [2022-07-09 14:01:57,866][26022] Updated weights on worker 0-0, policy_version 279764 (0.00081) [2022-07-09 14:01:59,677][25689] Fps is (10 sec: 5475.1, 60 sec: 5615.3, 300 sec: 5632.6). Total num frames: 286485504. Throughput: 0: 5890.0. Samples: 286490914. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 14:01:59,678][25689] Avg episode reward: [(0, '-48.047')] [2022-07-09 14:02:00,067][26022] Updated weights on worker 0-0, policy_version 279774 (0.00087) [2022-07-09 14:02:01,695][26022] Updated weights on worker 0-0, policy_version 279784 (0.00097) [2022-07-09 14:02:03,934][26022] Updated weights on worker 0-0, policy_version 279794 (0.00094) [2022-07-09 14:02:04,797][25689] Fps is (10 sec: 5522.2, 60 sec: 5679.2, 300 sec: 5638.5). Total num frames: 286514176. Throughput: 0: 5774.9. Samples: 286522804. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 14:02:04,798][25689] Avg episode reward: [(0, '-48.027')] [2022-07-09 14:02:05,434][26022] Updated weights on worker 0-0, policy_version 279804 (0.00084) [2022-07-09 14:02:07,458][26022] Updated weights on worker 0-0, policy_version 279814 (0.00088) [2022-07-09 14:02:09,491][26022] Updated weights on worker 0-0, policy_version 279824 (0.00086) [2022-07-09 14:02:09,824][25689] Fps is (10 sec: 5550.6, 60 sec: 5609.9, 300 sec: 5635.4). Total num frames: 286541824. Throughput: 0: 5781.0. Samples: 286539964. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 14:02:09,824][25689] Avg episode reward: [(0, '-47.251')] [2022-07-09 14:02:10,984][26022] Updated weights on worker 0-0, policy_version 279834 (0.00094) [2022-07-09 14:02:12,926][26022] Updated weights on worker 0-0, policy_version 279844 (0.00094) [2022-07-09 14:02:14,711][26022] Updated weights on worker 0-0, policy_version 279854 (0.00083) [2022-07-09 14:02:14,839][25689] Fps is (10 sec: 5710.6, 60 sec: 5662.1, 300 sec: 5635.9). Total num frames: 286571520. Throughput: 0: 5787.3. Samples: 286574066. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 14:02:14,840][25689] Avg episode reward: [(0, '-46.160')] [2022-07-09 14:02:16,469][26022] Updated weights on worker 0-0, policy_version 279864 (0.00081) [2022-07-09 14:02:18,439][26022] Updated weights on worker 0-0, policy_version 279874 (0.00086) [2022-07-09 14:02:19,844][25689] Fps is (10 sec: 5825.2, 60 sec: 5662.6, 300 sec: 5637.5). Total num frames: 286600192. Throughput: 0: 5810.8. Samples: 286608098. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 14:02:19,851][25689] Avg episode reward: [(0, '-46.570')] [2022-07-09 14:02:20,151][26022] Updated weights on worker 0-0, policy_version 279884 (0.00084) [2022-07-09 14:02:21,983][26022] Updated weights on worker 0-0, policy_version 279894 (0.00084) [2022-07-09 14:02:23,602][26022] Updated weights on worker 0-0, policy_version 279904 (0.00091) [2022-07-09 14:02:24,911][25689] Fps is (10 sec: 5592.0, 60 sec: 5632.3, 300 sec: 5632.8). Total num frames: 286627840. Throughput: 0: 5093.2. Samples: 286625244. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 14:02:24,911][25689] Avg episode reward: [(0, '-46.121')] [2022-07-09 14:02:25,601][26022] Updated weights on worker 0-0, policy_version 279914 (0.00086) [2022-07-09 14:02:27,368][26022] Updated weights on worker 0-0, policy_version 279924 (0.00091) [2022-07-09 14:02:29,272][26022] Updated weights on worker 0-0, policy_version 279934 (0.00092) [2022-07-09 14:02:29,943][25689] Fps is (10 sec: 5576.9, 60 sec: 5634.4, 300 sec: 5632.8). Total num frames: 286656512. Throughput: 0: 5922.0. Samples: 286659108. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 14:02:29,943][25689] Avg episode reward: [(0, '-45.423')] [2022-07-09 14:02:30,703][26022] Updated weights on worker 0-0, policy_version 279944 (0.00089) [2022-07-09 14:02:32,814][26022] Updated weights on worker 0-0, policy_version 279954 (0.00089) [2022-07-09 14:02:34,423][26022] Updated weights on worker 0-0, policy_version 279964 (0.00092) [2022-07-09 14:02:35,007][25689] Fps is (10 sec: 5578.4, 60 sec: 5630.4, 300 sec: 5629.8). Total num frames: 286684160. Throughput: 0: 5900.0. Samples: 286693054. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 14:02:35,007][25689] Avg episode reward: [(0, '-45.464')] [2022-07-09 14:02:36,551][26022] Updated weights on worker 0-0, policy_version 279974 (0.00091) [2022-07-09 14:02:38,151][26022] Updated weights on worker 0-0, policy_version 279984 (0.00092) [2022-07-09 14:02:40,038][25689] Fps is (10 sec: 5579.0, 60 sec: 5629.0, 300 sec: 5631.4). Total num frames: 286712832. Throughput: 0: 5051.0. Samples: 286710098. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 14:02:40,040][25689] Avg episode reward: [(0, '-46.352')] [2022-07-09 14:02:40,081][26022] Updated weights on worker 0-0, policy_version 279994 (0.00090) [2022-07-09 14:02:41,755][26022] Updated weights on worker 0-0, policy_version 280004 (0.00079) [2022-07-09 14:02:43,527][26022] Updated weights on worker 0-0, policy_version 280014 (0.00081) [2022-07-09 14:02:45,149][25689] Fps is (10 sec: 5754.8, 60 sec: 5625.8, 300 sec: 5629.7). Total num frames: 286742528. Throughput: 0: 5889.7. Samples: 286744444. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 14:02:45,150][25689] Avg episode reward: [(0, '-46.686')] [2022-07-09 14:02:45,358][26022] Updated weights on worker 0-0, policy_version 280024 (0.00084) [2022-07-09 14:02:47,055][26022] Updated weights on worker 0-0, policy_version 280034 (0.00088) [2022-07-09 14:02:49,036][26022] Updated weights on worker 0-0, policy_version 280044 (0.00090) [2022-07-09 14:02:50,234][25689] Fps is (10 sec: 5623.8, 60 sec: 5604.8, 300 sec: 5632.6). Total num frames: 286770176. Throughput: 0: 5868.7. Samples: 286778194. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:02:50,236][25689] Avg episode reward: [(0, '-45.893')] [2022-07-09 14:02:50,817][26022] Updated weights on worker 0-0, policy_version 280054 (0.00087) [2022-07-09 14:02:52,720][26022] Updated weights on worker 0-0, policy_version 280064 (0.00090) [2022-07-09 14:02:54,358][26022] Updated weights on worker 0-0, policy_version 280074 (0.00089) [2022-07-09 14:02:55,261][25689] Fps is (10 sec: 5670.7, 60 sec: 5638.0, 300 sec: 5632.6). Total num frames: 286799872. Throughput: 0: 5037.0. Samples: 286795076. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:02:55,267][25689] Avg episode reward: [(0, '-45.876')] [2022-07-09 14:02:56,326][26022] Updated weights on worker 0-0, policy_version 280084 (0.00089) [2022-07-09 14:02:57,903][26022] Updated weights on worker 0-0, policy_version 280094 (0.00082) [2022-07-09 14:02:59,822][26022] Updated weights on worker 0-0, policy_version 280104 (0.00088) [2022-07-09 14:03:00,317][25689] Fps is (10 sec: 5687.2, 60 sec: 5640.1, 300 sec: 5636.2). Total num frames: 286827520. Throughput: 0: 5888.0. Samples: 286829504. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:03:00,319][25689] Avg episode reward: [(0, '-46.499')] [2022-07-09 14:03:01,726][26022] Updated weights on worker 0-0, policy_version 280114 (0.00093) [2022-07-09 14:03:02,633][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:03:02,643][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000280117_286839808.pth [2022-07-09 14:03:02,653][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000278132_284807168.pth [2022-07-09 14:03:02,654][25974] Saving a milestone ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/milestones/checkpoint_000280117_286839808.pth.milestone [2022-07-09 14:03:03,847][26022] Updated weights on worker 0-0, policy_version 280124 (0.00484) [2022-07-09 14:03:05,420][25689] Fps is (10 sec: 5443.2, 60 sec: 5624.8, 300 sec: 5631.1). Total num frames: 286855168. Throughput: 0: 5761.8. Samples: 286861242. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:03:05,421][25689] Avg episode reward: [(0, '-46.574')] [2022-07-09 14:03:05,707][26022] Updated weights on worker 0-0, policy_version 280134 (0.00095) [2022-07-09 14:03:07,498][26022] Updated weights on worker 0-0, policy_version 280144 (0.00086) [2022-07-09 14:03:09,210][26022] Updated weights on worker 0-0, policy_version 280154 (0.00093) [2022-07-09 14:03:10,467][25689] Fps is (10 sec: 5549.1, 60 sec: 5639.8, 300 sec: 5634.6). Total num frames: 286883840. Throughput: 0: 4945.3. Samples: 286878250. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:03:10,468][25689] Avg episode reward: [(0, '-46.061')] [2022-07-09 14:03:11,018][26022] Updated weights on worker 0-0, policy_version 280164 (0.00087) [2022-07-09 14:03:12,873][26022] Updated weights on worker 0-0, policy_version 280174 (0.00083) [2022-07-09 14:03:14,660][26022] Updated weights on worker 0-0, policy_version 280184 (0.00109) [2022-07-09 14:03:15,471][25689] Fps is (10 sec: 5705.5, 60 sec: 5624.0, 300 sec: 5634.9). Total num frames: 286912512. Throughput: 0: 5801.7. Samples: 286912324. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:03:15,473][25689] Avg episode reward: [(0, '-45.578')] [2022-07-09 14:03:16,580][26022] Updated weights on worker 0-0, policy_version 280194 (0.00094) [2022-07-09 14:03:18,170][26022] Updated weights on worker 0-0, policy_version 280204 (0.00087) [2022-07-09 14:03:20,297][26022] Updated weights on worker 0-0, policy_version 280214 (0.00076) [2022-07-09 14:03:20,522][25689] Fps is (10 sec: 5601.3, 60 sec: 5602.8, 300 sec: 5629.3). Total num frames: 286940160. Throughput: 0: 5784.0. Samples: 286946366. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:03:20,523][25689] Avg episode reward: [(0, '-46.473')] [2022-07-09 14:03:21,755][26022] Updated weights on worker 0-0, policy_version 280224 (0.00082) [2022-07-09 14:03:23,715][26022] Updated weights on worker 0-0, policy_version 280234 (0.00086) [2022-07-09 14:03:25,617][25689] Fps is (10 sec: 5550.8, 60 sec: 5617.1, 300 sec: 5631.2). Total num frames: 286968832. Throughput: 0: 5066.9. Samples: 286963578. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:03:25,618][25689] Avg episode reward: [(0, '-46.398')] [2022-07-09 14:03:25,619][26022] Updated weights on worker 0-0, policy_version 280244 (0.00087) [2022-07-09 14:03:27,254][26022] Updated weights on worker 0-0, policy_version 280254 (0.00086) [2022-07-09 14:03:29,443][26022] Updated weights on worker 0-0, policy_version 280264 (0.00086) [2022-07-09 14:03:30,633][25689] Fps is (10 sec: 5873.6, 60 sec: 5652.3, 300 sec: 5637.9). Total num frames: 286999552. Throughput: 0: 5916.5. Samples: 286997566. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:03:30,634][25689] Avg episode reward: [(0, '-46.494')] [2022-07-09 14:03:30,800][26022] Updated weights on worker 0-0, policy_version 280274 (0.00092) [2022-07-09 14:03:32,830][26022] Updated weights on worker 0-0, policy_version 280284 (0.00094) [2022-07-09 14:03:34,529][26022] Updated weights on worker 0-0, policy_version 280294 (0.00090) [2022-07-09 14:03:35,689][25689] Fps is (10 sec: 5693.6, 60 sec: 5636.2, 300 sec: 5627.2). Total num frames: 287026176. Throughput: 0: 5906.5. Samples: 287031742. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:03:35,689][25689] Avg episode reward: [(0, '-46.705')] [2022-07-09 14:03:36,338][26022] Updated weights on worker 0-0, policy_version 280304 (0.00095) [2022-07-09 14:03:38,193][26022] Updated weights on worker 0-0, policy_version 280314 (0.00094) [2022-07-09 14:03:40,233][26022] Updated weights on worker 0-0, policy_version 280324 (0.00089) [2022-07-09 14:03:40,732][25689] Fps is (10 sec: 5374.1, 60 sec: 5618.2, 300 sec: 5627.7). Total num frames: 287053824. Throughput: 0: 5062.9. Samples: 287048690. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:03:40,733][25689] Avg episode reward: [(0, '-46.962')] [2022-07-09 14:03:41,726][26022] Updated weights on worker 0-0, policy_version 280334 (0.00088) [2022-07-09 14:03:43,751][26022] Updated weights on worker 0-0, policy_version 280344 (0.00083) [2022-07-09 14:03:45,277][26022] Updated weights on worker 0-0, policy_version 280354 (0.00092) [2022-07-09 14:03:45,840][25689] Fps is (10 sec: 5648.9, 60 sec: 5618.5, 300 sec: 5629.7). Total num frames: 287083520. Throughput: 0: 5882.0. Samples: 287082530. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:03:45,841][25689] Avg episode reward: [(0, '-47.219')] [2022-07-09 14:03:47,354][26022] Updated weights on worker 0-0, policy_version 280364 (0.00091) [2022-07-09 14:03:49,051][26022] Updated weights on worker 0-0, policy_version 280374 (0.00087) [2022-07-09 14:03:50,807][26022] Updated weights on worker 0-0, policy_version 280384 (0.00086) [2022-07-09 14:03:50,902][25689] Fps is (10 sec: 5840.0, 60 sec: 5654.4, 300 sec: 5633.2). Total num frames: 287113216. Throughput: 0: 5866.0. Samples: 287116462. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:03:50,903][25689] Avg episode reward: [(0, '-47.098')] [2022-07-09 14:03:52,928][26022] Updated weights on worker 0-0, policy_version 280394 (0.00092) [2022-07-09 14:03:54,434][26022] Updated weights on worker 0-0, policy_version 280404 (0.00089) [2022-07-09 14:03:55,928][25689] Fps is (10 sec: 5684.6, 60 sec: 5620.8, 300 sec: 5626.1). Total num frames: 287140864. Throughput: 0: 5854.3. Samples: 287150228. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:03:55,928][25689] Avg episode reward: [(0, '-47.363')] [2022-07-09 14:03:56,379][26022] Updated weights on worker 0-0, policy_version 280414 (0.00088) [2022-07-09 14:03:58,183][26022] Updated weights on worker 0-0, policy_version 280424 (0.00083) [2022-07-09 14:03:59,935][26022] Updated weights on worker 0-0, policy_version 280434 (0.00097) [2022-07-09 14:04:00,931][25689] Fps is (10 sec: 5615.8, 60 sec: 5642.6, 300 sec: 5641.9). Total num frames: 287169536. Throughput: 0: 5873.6. Samples: 287167330. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:04:00,932][25689] Avg episode reward: [(0, '-47.534')] [2022-07-09 14:04:02,132][26022] Updated weights on worker 0-0, policy_version 280444 (0.00097) [2022-07-09 14:04:04,111][26022] Updated weights on worker 0-0, policy_version 280454 (0.00091) [2022-07-09 14:04:05,774][26022] Updated weights on worker 0-0, policy_version 280464 (0.00082) [2022-07-09 14:04:05,985][25689] Fps is (10 sec: 5498.1, 60 sec: 5630.2, 300 sec: 5631.3). Total num frames: 287196160. Throughput: 0: 5816.4. Samples: 287199700. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:04:05,985][25689] Avg episode reward: [(0, '-48.128')] [2022-07-09 14:04:07,562][26022] Updated weights on worker 0-0, policy_version 280474 (0.00091) [2022-07-09 14:04:09,262][26022] Updated weights on worker 0-0, policy_version 280484 (0.00089) [2022-07-09 14:04:10,997][25689] Fps is (10 sec: 5493.5, 60 sec: 5633.5, 300 sec: 5631.9). Total num frames: 287224832. Throughput: 0: 5852.3. Samples: 287234062. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:04:10,997][25689] Avg episode reward: [(0, '-48.510')] [2022-07-09 14:04:11,108][26022] Updated weights on worker 0-0, policy_version 280494 (0.00078) [2022-07-09 14:04:13,024][26022] Updated weights on worker 0-0, policy_version 280504 (0.00083) [2022-07-09 14:04:14,519][26022] Updated weights on worker 0-0, policy_version 280514 (0.00459) [2022-07-09 14:04:15,998][25689] Fps is (10 sec: 5726.7, 60 sec: 5633.7, 300 sec: 5632.5). Total num frames: 287253504. Throughput: 0: 5036.6. Samples: 287251316. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:04:15,999][25689] Avg episode reward: [(0, '-49.274')] [2022-07-09 14:04:16,670][26022] Updated weights on worker 0-0, policy_version 280524 (0.00083) [2022-07-09 14:04:18,158][26022] Updated weights on worker 0-0, policy_version 280534 (0.00089) [2022-07-09 14:04:20,176][26022] Updated weights on worker 0-0, policy_version 280544 (0.00091) [2022-07-09 14:04:21,017][25689] Fps is (10 sec: 5722.7, 60 sec: 5653.6, 300 sec: 5633.1). Total num frames: 287282176. Throughput: 0: 5878.9. Samples: 287285414. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:04:21,019][25689] Avg episode reward: [(0, '-49.485')] [2022-07-09 14:04:21,859][26022] Updated weights on worker 0-0, policy_version 280554 (0.00087) [2022-07-09 14:04:23,643][26022] Updated weights on worker 0-0, policy_version 280564 (0.00095) [2022-07-09 14:04:25,504][26022] Updated weights on worker 0-0, policy_version 280574 (0.00083) [2022-07-09 14:04:26,075][25689] Fps is (10 sec: 5792.7, 60 sec: 5674.1, 300 sec: 5635.7). Total num frames: 287311872. Throughput: 0: 5960.2. Samples: 287319438. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:04:26,077][25689] Avg episode reward: [(0, '-48.979')] [2022-07-09 14:04:27,418][26022] Updated weights on worker 0-0, policy_version 280584 (0.00087) [2022-07-09 14:04:28,898][26022] Updated weights on worker 0-0, policy_version 280594 (0.00088) [2022-07-09 14:04:31,088][25689] Fps is (10 sec: 5490.9, 60 sec: 5589.7, 300 sec: 5625.4). Total num frames: 287337472. Throughput: 0: 5106.4. Samples: 287336654. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:04:31,088][25689] Avg episode reward: [(0, '-48.729')] [2022-07-09 14:04:31,124][26022] Updated weights on worker 0-0, policy_version 280604 (0.00086) [2022-07-09 14:04:32,703][26022] Updated weights on worker 0-0, policy_version 280614 (0.00091) [2022-07-09 14:04:34,634][26022] Updated weights on worker 0-0, policy_version 280624 (0.00091) [2022-07-09 14:04:36,096][25689] Fps is (10 sec: 5517.6, 60 sec: 5644.8, 300 sec: 5632.4). Total num frames: 287367168. Throughput: 0: 5942.4. Samples: 287370746. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:04:36,097][25689] Avg episode reward: [(0, '-48.674')] [2022-07-09 14:04:36,438][26022] Updated weights on worker 0-0, policy_version 280634 (0.00084) [2022-07-09 14:04:38,080][26022] Updated weights on worker 0-0, policy_version 280644 (0.00086) [2022-07-09 14:04:40,285][26022] Updated weights on worker 0-0, policy_version 280654 (0.00085) [2022-07-09 14:04:41,114][25689] Fps is (10 sec: 5923.5, 60 sec: 5681.2, 300 sec: 5637.8). Total num frames: 287396864. Throughput: 0: 5937.0. Samples: 287404730. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:04:41,115][25689] Avg episode reward: [(0, '-47.600')] [2022-07-09 14:04:41,626][26022] Updated weights on worker 0-0, policy_version 280664 (0.00090) [2022-07-09 14:04:43,719][26022] Updated weights on worker 0-0, policy_version 280674 (0.00086) [2022-07-09 14:04:45,211][26022] Updated weights on worker 0-0, policy_version 280684 (0.00085) [2022-07-09 14:04:46,179][25689] Fps is (10 sec: 5687.4, 60 sec: 5651.3, 300 sec: 5630.3). Total num frames: 287424512. Throughput: 0: 5096.4. Samples: 287421898. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:04:46,180][25689] Avg episode reward: [(0, '-47.124')] [2022-07-09 14:04:47,243][26022] Updated weights on worker 0-0, policy_version 280694 (0.00085) [2022-07-09 14:04:48,999][26022] Updated weights on worker 0-0, policy_version 280704 (0.00092) [2022-07-09 14:04:50,810][26022] Updated weights on worker 0-0, policy_version 280714 (0.00098) [2022-07-09 14:04:51,191][25689] Fps is (10 sec: 5487.6, 60 sec: 5622.0, 300 sec: 5630.6). Total num frames: 287452160. Throughput: 0: 5933.0. Samples: 287455926. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:04:51,191][25689] Avg episode reward: [(0, '-47.332')] [2022-07-09 14:04:52,483][26022] Updated weights on worker 0-0, policy_version 280724 (0.00082) [2022-07-09 14:04:54,490][26022] Updated weights on worker 0-0, policy_version 280734 (0.00095) [2022-07-09 14:04:55,968][26022] Updated weights on worker 0-0, policy_version 280744 (0.00090) [2022-07-09 14:04:56,207][25689] Fps is (10 sec: 5820.6, 60 sec: 5673.9, 300 sec: 5645.2). Total num frames: 287482880. Throughput: 0: 5926.0. Samples: 287489920. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:04:56,208][25689] Avg episode reward: [(0, '-46.599')] [2022-07-09 14:04:58,010][26022] Updated weights on worker 0-0, policy_version 280754 (0.00090) [2022-07-09 14:04:59,766][26022] Updated weights on worker 0-0, policy_version 280764 (0.00088) [2022-07-09 14:05:01,222][25689] Fps is (10 sec: 5716.3, 60 sec: 5638.7, 300 sec: 5639.4). Total num frames: 287509504. Throughput: 0: 5081.6. Samples: 287506912. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:05:01,223][25689] Avg episode reward: [(0, '-46.854')] [2022-07-09 14:05:02,038][26022] Updated weights on worker 0-0, policy_version 280774 (0.00086) [2022-07-09 14:05:02,738][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:05:02,750][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000280778_287516672.pth [2022-07-09 14:05:02,751][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000278794_285485056.pth [2022-07-09 14:05:03,975][26022] Updated weights on worker 0-0, policy_version 280784 (0.00084) [2022-07-09 14:05:05,543][26022] Updated weights on worker 0-0, policy_version 280794 (0.00086) [2022-07-09 14:05:06,296][25689] Fps is (10 sec: 5277.8, 60 sec: 5636.9, 300 sec: 5632.5). Total num frames: 287536128. Throughput: 0: 5792.6. Samples: 287538428. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:05:06,297][25689] Avg episode reward: [(0, '-46.451')] [2022-07-09 14:05:07,534][26022] Updated weights on worker 0-0, policy_version 280804 (0.00085) [2022-07-09 14:05:09,535][26022] Updated weights on worker 0-0, policy_version 280814 (0.00094) [2022-07-09 14:05:11,024][26022] Updated weights on worker 0-0, policy_version 280824 (0.00090) [2022-07-09 14:05:11,312][25689] Fps is (10 sec: 5582.2, 60 sec: 5653.5, 300 sec: 5639.3). Total num frames: 287565824. Throughput: 0: 5803.0. Samples: 287572688. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 14:05:11,312][25689] Avg episode reward: [(0, '-46.720')] [2022-07-09 14:05:13,163][26022] Updated weights on worker 0-0, policy_version 280834 (0.00087) [2022-07-09 14:05:14,555][26022] Updated weights on worker 0-0, policy_version 280844 (0.00086) [2022-07-09 14:05:16,364][25689] Fps is (10 sec: 5593.9, 60 sec: 5614.8, 300 sec: 5628.4). Total num frames: 287592448. Throughput: 0: 4960.2. Samples: 287589904. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:05:16,365][25689] Avg episode reward: [(0, '-46.489')] [2022-07-09 14:05:16,653][26022] Updated weights on worker 0-0, policy_version 280854 (0.00086) [2022-07-09 14:05:18,134][26022] Updated weights on worker 0-0, policy_version 280864 (0.00083) [2022-07-09 14:05:20,204][26022] Updated weights on worker 0-0, policy_version 280874 (0.00089) [2022-07-09 14:05:21,399][25689] Fps is (10 sec: 5583.0, 60 sec: 5630.3, 300 sec: 5635.9). Total num frames: 287622144. Throughput: 0: 5798.6. Samples: 287623910. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:05:21,400][25689] Avg episode reward: [(0, '-46.267')] [2022-07-09 14:05:21,881][26022] Updated weights on worker 0-0, policy_version 280884 (0.00084) [2022-07-09 14:05:23,754][26022] Updated weights on worker 0-0, policy_version 280894 (0.00089) [2022-07-09 14:05:25,259][26022] Updated weights on worker 0-0, policy_version 280904 (0.00079) [2022-07-09 14:05:26,495][25689] Fps is (10 sec: 5660.3, 60 sec: 5592.8, 300 sec: 5627.4). Total num frames: 287649792. Throughput: 0: 5919.5. Samples: 287657996. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:05:26,496][25689] Avg episode reward: [(0, '-47.337')] [2022-07-09 14:05:27,400][26022] Updated weights on worker 0-0, policy_version 280914 (0.00091) [2022-07-09 14:05:29,017][26022] Updated weights on worker 0-0, policy_version 280924 (0.00091) [2022-07-09 14:05:30,878][26022] Updated weights on worker 0-0, policy_version 280934 (0.00090) [2022-07-09 14:05:31,523][25689] Fps is (10 sec: 5765.8, 60 sec: 5676.2, 300 sec: 5637.4). Total num frames: 287680512. Throughput: 0: 5061.6. Samples: 287674988. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:05:31,523][25689] Avg episode reward: [(0, '-47.709')] [2022-07-09 14:05:32,751][26022] Updated weights on worker 0-0, policy_version 280944 (0.00088) [2022-07-09 14:05:34,548][26022] Updated weights on worker 0-0, policy_version 280954 (0.00094) [2022-07-09 14:05:36,365][26022] Updated weights on worker 0-0, policy_version 280964 (0.00087) [2022-07-09 14:05:36,532][25689] Fps is (10 sec: 5815.4, 60 sec: 5642.2, 300 sec: 5634.4). Total num frames: 287708160. Throughput: 0: 5909.1. Samples: 287709078. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:05:36,533][25689] Avg episode reward: [(0, '-49.081')] [2022-07-09 14:05:38,023][26022] Updated weights on worker 0-0, policy_version 280974 (0.00084) [2022-07-09 14:05:39,850][26022] Updated weights on worker 0-0, policy_version 280984 (0.00099) [2022-07-09 14:05:41,590][25689] Fps is (10 sec: 5594.3, 60 sec: 5621.6, 300 sec: 5632.9). Total num frames: 287736832. Throughput: 0: 5913.7. Samples: 287743310. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:05:41,592][25689] Avg episode reward: [(0, '-48.278')] [2022-07-09 14:05:41,898][26022] Updated weights on worker 0-0, policy_version 280994 (0.00089) [2022-07-09 14:05:43,547][26022] Updated weights on worker 0-0, policy_version 281004 (0.00090) [2022-07-09 14:05:45,380][26022] Updated weights on worker 0-0, policy_version 281014 (0.00109) [2022-07-09 14:05:46,668][25689] Fps is (10 sec: 5657.5, 60 sec: 5637.3, 300 sec: 5636.3). Total num frames: 287765504. Throughput: 0: 5079.5. Samples: 287760464. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:05:46,669][25689] Avg episode reward: [(0, '-48.921')] [2022-07-09 14:05:47,119][26022] Updated weights on worker 0-0, policy_version 281024 (0.00092) [2022-07-09 14:05:48,971][26022] Updated weights on worker 0-0, policy_version 281034 (0.00086) [2022-07-09 14:05:50,777][26022] Updated weights on worker 0-0, policy_version 281044 (0.00088) [2022-07-09 14:05:51,670][25689] Fps is (10 sec: 5688.9, 60 sec: 5655.1, 300 sec: 5636.5). Total num frames: 287794176. Throughput: 0: 5936.5. Samples: 287794592. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:05:51,670][25689] Avg episode reward: [(0, '-48.808')] [2022-07-09 14:05:52,667][26022] Updated weights on worker 0-0, policy_version 281054 (0.00089) [2022-07-09 14:05:54,258][26022] Updated weights on worker 0-0, policy_version 281064 (0.00089) [2022-07-09 14:05:56,406][26022] Updated weights on worker 0-0, policy_version 281074 (0.00092) [2022-07-09 14:05:56,673][25689] Fps is (10 sec: 5629.0, 60 sec: 5605.5, 300 sec: 5633.8). Total num frames: 287821824. Throughput: 0: 5929.0. Samples: 287828496. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:05:56,674][25689] Avg episode reward: [(0, '-48.484')] [2022-07-09 14:05:57,727][26022] Updated weights on worker 0-0, policy_version 281084 (0.00087) [2022-07-09 14:05:59,876][26022] Updated weights on worker 0-0, policy_version 281094 (0.00091) [2022-07-09 14:06:01,547][26022] Updated weights on worker 0-0, policy_version 281104 (0.00080) [2022-07-09 14:06:01,674][25689] Fps is (10 sec: 5731.8, 60 sec: 5657.6, 300 sec: 5652.6). Total num frames: 287851520. Throughput: 0: 5103.5. Samples: 287845812. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:06:01,675][25689] Avg episode reward: [(0, '-47.868')] [2022-07-09 14:06:03,713][26022] Updated weights on worker 0-0, policy_version 281114 (0.00087) [2022-07-09 14:06:05,552][26022] Updated weights on worker 0-0, policy_version 281124 (0.00089) [2022-07-09 14:06:06,806][25689] Fps is (10 sec: 5457.0, 60 sec: 5635.3, 300 sec: 5629.6). Total num frames: 287877120. Throughput: 0: 5823.0. Samples: 287877730. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:06:06,807][25689] Avg episode reward: [(0, '-47.715')] [2022-07-09 14:06:07,453][26022] Updated weights on worker 0-0, policy_version 281134 (0.00086) [2022-07-09 14:06:09,061][26022] Updated weights on worker 0-0, policy_version 281144 (0.00083) [2022-07-09 14:06:11,162][26022] Updated weights on worker 0-0, policy_version 281154 (0.00091) [2022-07-09 14:06:11,830][25689] Fps is (10 sec: 5444.9, 60 sec: 5634.5, 300 sec: 5640.0). Total num frames: 287906816. Throughput: 0: 5835.8. Samples: 287912242. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:06:11,830][25689] Avg episode reward: [(0, '-48.923')] [2022-07-09 14:06:12,727][26022] Updated weights on worker 0-0, policy_version 281164 (0.00089) [2022-07-09 14:06:14,461][26022] Updated weights on worker 0-0, policy_version 281174 (0.00086) [2022-07-09 14:06:16,225][26022] Updated weights on worker 0-0, policy_version 281184 (0.00085) [2022-07-09 14:06:16,907][25689] Fps is (10 sec: 5778.7, 60 sec: 5666.1, 300 sec: 5638.8). Total num frames: 287935488. Throughput: 0: 4988.1. Samples: 287929422. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:06:16,909][25689] Avg episode reward: [(0, '-49.273')] [2022-07-09 14:06:18,158][26022] Updated weights on worker 0-0, policy_version 281194 (0.00087) [2022-07-09 14:06:19,935][26022] Updated weights on worker 0-0, policy_version 281204 (0.00082) [2022-07-09 14:06:21,943][25689] Fps is (10 sec: 5467.5, 60 sec: 5615.2, 300 sec: 5629.7). Total num frames: 287962112. Throughput: 0: 5793.9. Samples: 287963250. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:06:21,945][25689] Avg episode reward: [(0, '-48.790')] [2022-07-09 14:06:21,995][26022] Updated weights on worker 0-0, policy_version 281214 (0.00087) [2022-07-09 14:06:23,514][26022] Updated weights on worker 0-0, policy_version 281224 (0.00087) [2022-07-09 14:06:25,453][26022] Updated weights on worker 0-0, policy_version 281234 (0.00089) [2022-07-09 14:06:27,011][25689] Fps is (10 sec: 5675.1, 60 sec: 5668.6, 300 sec: 5636.3). Total num frames: 287992832. Throughput: 0: 5905.0. Samples: 287997040. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:06:27,012][25689] Avg episode reward: [(0, '-49.134')] [2022-07-09 14:06:27,110][26022] Updated weights on worker 0-0, policy_version 281244 (0.00091) [2022-07-09 14:06:29,216][26022] Updated weights on worker 0-0, policy_version 281254 (0.00089) [2022-07-09 14:06:30,797][26022] Updated weights on worker 0-0, policy_version 281264 (0.00092) [2022-07-09 14:06:32,027][25689] Fps is (10 sec: 5687.1, 60 sec: 5602.0, 300 sec: 5633.0). Total num frames: 288019456. Throughput: 0: 5034.4. Samples: 288013920. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:06:32,028][25689] Avg episode reward: [(0, '-48.648')] [2022-07-09 14:06:32,944][26022] Updated weights on worker 0-0, policy_version 281274 (0.00094) [2022-07-09 14:06:34,391][26022] Updated weights on worker 0-0, policy_version 281284 (0.00085) [2022-07-09 14:06:36,535][26022] Updated weights on worker 0-0, policy_version 281294 (0.00087) [2022-07-09 14:06:37,041][25689] Fps is (10 sec: 5411.3, 60 sec: 5601.6, 300 sec: 5629.6). Total num frames: 288047104. Throughput: 0: 5880.1. Samples: 288047810. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:06:37,041][25689] Avg episode reward: [(0, '-48.048')] [2022-07-09 14:06:37,949][26022] Updated weights on worker 0-0, policy_version 281304 (0.00084) [2022-07-09 14:06:40,174][26022] Updated weights on worker 0-0, policy_version 281314 (0.00082) [2022-07-09 14:06:41,614][26022] Updated weights on worker 0-0, policy_version 281324 (0.00081) [2022-07-09 14:06:42,051][25689] Fps is (10 sec: 5822.6, 60 sec: 5639.9, 300 sec: 5634.3). Total num frames: 288077824. Throughput: 0: 5905.4. Samples: 288081994. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:06:42,053][25689] Avg episode reward: [(0, '-47.200')] [2022-07-09 14:06:43,629][26022] Updated weights on worker 0-0, policy_version 281334 (0.00084) [2022-07-09 14:06:44,983][26022] Updated weights on worker 0-0, policy_version 281344 (0.00104) [2022-07-09 14:06:47,114][25689] Fps is (10 sec: 5794.1, 60 sec: 5624.3, 300 sec: 5630.4). Total num frames: 288105472. Throughput: 0: 5078.7. Samples: 288099138. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:06:47,115][25689] Avg episode reward: [(0, '-46.577')] [2022-07-09 14:06:47,135][26022] Updated weights on worker 0-0, policy_version 281354 (0.00085) [2022-07-09 14:06:48,913][26022] Updated weights on worker 0-0, policy_version 281364 (0.00088) [2022-07-09 14:06:50,694][26022] Updated weights on worker 0-0, policy_version 281374 (0.00090) [2022-07-09 14:06:52,132][25689] Fps is (10 sec: 5688.5, 60 sec: 5639.8, 300 sec: 5637.4). Total num frames: 288135168. Throughput: 0: 5939.6. Samples: 288133336. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:06:52,133][25689] Avg episode reward: [(0, '-46.562')] [2022-07-09 14:06:52,373][26022] Updated weights on worker 0-0, policy_version 281384 (0.00082) [2022-07-09 14:06:54,276][26022] Updated weights on worker 0-0, policy_version 281394 (0.00105) [2022-07-09 14:06:56,202][26022] Updated weights on worker 0-0, policy_version 281404 (0.00091) [2022-07-09 14:06:57,154][25689] Fps is (10 sec: 5813.6, 60 sec: 5654.9, 300 sec: 5641.9). Total num frames: 288163840. Throughput: 0: 5953.3. Samples: 288167552. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:06:57,155][25689] Avg episode reward: [(0, '-46.088')] [2022-07-09 14:06:57,970][26022] Updated weights on worker 0-0, policy_version 281414 (0.00096) [2022-07-09 14:06:59,798][26022] Updated weights on worker 0-0, policy_version 281424 (0.00051) [2022-07-09 14:07:01,920][26022] Updated weights on worker 0-0, policy_version 281434 (0.00090) [2022-07-09 14:07:02,200][25689] Fps is (10 sec: 5390.2, 60 sec: 5583.1, 300 sec: 5633.0). Total num frames: 288189440. Throughput: 0: 5103.2. Samples: 288184820. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:07:02,207][25689] Avg episode reward: [(0, '-45.682')] [2022-07-09 14:07:02,866][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:07:02,885][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000281440_288194560.pth [2022-07-09 14:07:02,886][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000279456_286162944.pth [2022-07-09 14:07:03,537][26022] Updated weights on worker 0-0, policy_version 281444 (0.00085) [2022-07-09 14:07:05,632][26022] Updated weights on worker 0-0, policy_version 281454 (0.00094) [2022-07-09 14:07:07,303][25689] Fps is (10 sec: 5347.9, 60 sec: 5636.6, 300 sec: 5635.0). Total num frames: 288218112. Throughput: 0: 5816.1. Samples: 288216554. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:07:07,303][25689] Avg episode reward: [(0, '-46.215')] [2022-07-09 14:07:07,407][26022] Updated weights on worker 0-0, policy_version 281464 (0.00080) [2022-07-09 14:07:09,051][26022] Updated weights on worker 0-0, policy_version 281474 (0.00090) [2022-07-09 14:07:11,198][26022] Updated weights on worker 0-0, policy_version 281484 (0.00090) [2022-07-09 14:07:12,381][25689] Fps is (10 sec: 5733.1, 60 sec: 5631.5, 300 sec: 5633.8). Total num frames: 288247808. Throughput: 0: 5801.5. Samples: 288250812. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:07:12,382][25689] Avg episode reward: [(0, '-45.841')] [2022-07-09 14:07:12,457][26022] Updated weights on worker 0-0, policy_version 281494 (0.00090) [2022-07-09 14:07:14,639][26022] Updated weights on worker 0-0, policy_version 281504 (0.00082) [2022-07-09 14:07:16,383][26022] Updated weights on worker 0-0, policy_version 281514 (0.00091) [2022-07-09 14:07:17,419][25689] Fps is (10 sec: 5668.2, 60 sec: 5618.1, 300 sec: 5629.7). Total num frames: 288275456. Throughput: 0: 5800.1. Samples: 288285092. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:07:17,420][25689] Avg episode reward: [(0, '-46.358')] [2022-07-09 14:07:18,023][26022] Updated weights on worker 0-0, policy_version 281524 (0.00091) [2022-07-09 14:07:19,993][26022] Updated weights on worker 0-0, policy_version 281534 (0.00091) [2022-07-09 14:07:21,884][26022] Updated weights on worker 0-0, policy_version 281544 (0.00056) [2022-07-09 14:07:22,441][25689] Fps is (10 sec: 5395.1, 60 sec: 5619.6, 300 sec: 5627.1). Total num frames: 288302080. Throughput: 0: 5788.9. Samples: 288301990. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:07:22,446][25689] Avg episode reward: [(0, '-45.643')] [2022-07-09 14:07:23,476][26022] Updated weights on worker 0-0, policy_version 281554 (0.00088) [2022-07-09 14:07:25,515][26022] Updated weights on worker 0-0, policy_version 281564 (0.00126) [2022-07-09 14:07:26,988][26022] Updated weights on worker 0-0, policy_version 281574 (0.00093) [2022-07-09 14:07:27,518][25689] Fps is (10 sec: 5779.9, 60 sec: 5635.6, 300 sec: 5636.6). Total num frames: 288333824. Throughput: 0: 5902.7. Samples: 288335878. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:07:27,521][25689] Avg episode reward: [(0, '-46.147')] [2022-07-09 14:07:29,257][26022] Updated weights on worker 0-0, policy_version 281584 (0.00087) [2022-07-09 14:07:30,860][26022] Updated weights on worker 0-0, policy_version 281594 (0.00091) [2022-07-09 14:07:32,592][25689] Fps is (10 sec: 5951.6, 60 sec: 5664.0, 300 sec: 5639.8). Total num frames: 288362496. Throughput: 0: 5911.2. Samples: 288370282. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:07:32,592][25689] Avg episode reward: [(0, '-46.155')] [2022-07-09 14:07:32,595][26022] Updated weights on worker 0-0, policy_version 281604 (0.00086) [2022-07-09 14:07:34,548][26022] Updated weights on worker 0-0, policy_version 281614 (0.00090) [2022-07-09 14:07:36,150][26022] Updated weights on worker 0-0, policy_version 281624 (0.00094) [2022-07-09 14:07:37,668][25689] Fps is (10 sec: 5548.4, 60 sec: 5658.1, 300 sec: 5635.5). Total num frames: 288390144. Throughput: 0: 5049.2. Samples: 288387334. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2022-07-09 14:07:37,669][25689] Avg episode reward: [(0, '-46.015')] [2022-07-09 14:07:38,027][26022] Updated weights on worker 0-0, policy_version 281634 (0.00086) [2022-07-09 14:07:39,915][26022] Updated weights on worker 0-0, policy_version 281644 (0.00057) [2022-07-09 14:07:41,407][26022] Updated weights on worker 0-0, policy_version 281654 (0.00096) [2022-07-09 14:07:42,726][25689] Fps is (10 sec: 5658.6, 60 sec: 5636.9, 300 sec: 5636.6). Total num frames: 288419840. Throughput: 0: 5893.4. Samples: 288421538. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:07:42,726][25689] Avg episode reward: [(0, '-46.673')] [2022-07-09 14:07:43,569][26022] Updated weights on worker 0-0, policy_version 281664 (0.00086) [2022-07-09 14:07:45,091][26022] Updated weights on worker 0-0, policy_version 281674 (0.00095) [2022-07-09 14:07:47,246][26022] Updated weights on worker 0-0, policy_version 281684 (0.00092) [2022-07-09 14:07:47,797][25689] Fps is (10 sec: 5863.9, 60 sec: 5669.9, 300 sec: 5643.7). Total num frames: 288449536. Throughput: 0: 5891.6. Samples: 288455354. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:07:47,797][25689] Avg episode reward: [(0, '-47.041')] [2022-07-09 14:07:48,758][26022] Updated weights on worker 0-0, policy_version 281694 (0.00089) [2022-07-09 14:07:50,684][26022] Updated weights on worker 0-0, policy_version 281704 (0.00628) [2022-07-09 14:07:52,482][26022] Updated weights on worker 0-0, policy_version 281714 (0.00089) [2022-07-09 14:07:52,823][25689] Fps is (10 sec: 5578.0, 60 sec: 5618.5, 300 sec: 5633.4). Total num frames: 288476160. Throughput: 0: 5038.0. Samples: 288472204. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:07:52,823][25689] Avg episode reward: [(0, '-47.996')] [2022-07-09 14:07:54,600][26022] Updated weights on worker 0-0, policy_version 281724 (0.00086) [2022-07-09 14:07:56,210][26022] Updated weights on worker 0-0, policy_version 281734 (0.00096) [2022-07-09 14:07:57,832][25689] Fps is (10 sec: 5408.3, 60 sec: 5602.9, 300 sec: 5634.3). Total num frames: 288503808. Throughput: 0: 5888.7. Samples: 288506070. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:07:57,832][25689] Avg episode reward: [(0, '-48.126')] [2022-07-09 14:07:58,169][26022] Updated weights on worker 0-0, policy_version 281744 (0.00083) [2022-07-09 14:07:59,635][26022] Updated weights on worker 0-0, policy_version 281754 (0.00089) [2022-07-09 14:08:01,770][26022] Updated weights on worker 0-0, policy_version 281764 (0.00091) [2022-07-09 14:08:02,870][25689] Fps is (10 sec: 5503.6, 60 sec: 5637.4, 300 sec: 5635.5). Total num frames: 288531456. Throughput: 0: 5786.8. Samples: 288538106. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:08:02,870][25689] Avg episode reward: [(0, '-48.576')] [2022-07-09 14:08:03,665][26022] Updated weights on worker 0-0, policy_version 281774 (0.00084) [2022-07-09 14:08:05,555][26022] Updated weights on worker 0-0, policy_version 281784 (0.00084) [2022-07-09 14:08:07,428][26022] Updated weights on worker 0-0, policy_version 281794 (0.00090) [2022-07-09 14:08:07,940][25689] Fps is (10 sec: 5571.7, 60 sec: 5640.3, 300 sec: 5635.1). Total num frames: 288560128. Throughput: 0: 4955.0. Samples: 288555164. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:08:07,941][25689] Avg episode reward: [(0, '-48.711')] [2022-07-09 14:08:09,151][26022] Updated weights on worker 0-0, policy_version 281804 (0.00093) [2022-07-09 14:08:10,943][26022] Updated weights on worker 0-0, policy_version 281814 (0.00087) [2022-07-09 14:08:12,728][26022] Updated weights on worker 0-0, policy_version 281824 (0.00088) [2022-07-09 14:08:12,974][25689] Fps is (10 sec: 5675.0, 60 sec: 5627.6, 300 sec: 5634.5). Total num frames: 288588800. Throughput: 0: 5814.7. Samples: 288589380. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:08:12,976][25689] Avg episode reward: [(0, '-48.271')] [2022-07-09 14:08:14,463][26022] Updated weights on worker 0-0, policy_version 281834 (0.00085) [2022-07-09 14:08:16,351][26022] Updated weights on worker 0-0, policy_version 281844 (0.00084) [2022-07-09 14:08:17,991][25689] Fps is (10 sec: 5603.6, 60 sec: 5629.6, 300 sec: 5635.2). Total num frames: 288616448. Throughput: 0: 5847.0. Samples: 288623940. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:08:17,991][25689] Avg episode reward: [(0, '-47.949')] [2022-07-09 14:08:18,186][26022] Updated weights on worker 0-0, policy_version 281854 (0.00092) [2022-07-09 14:08:19,880][26022] Updated weights on worker 0-0, policy_version 281864 (0.00091) [2022-07-09 14:08:21,683][26022] Updated weights on worker 0-0, policy_version 281874 (0.00085) [2022-07-09 14:08:23,024][25689] Fps is (10 sec: 5706.4, 60 sec: 5679.2, 300 sec: 5639.8). Total num frames: 288646144. Throughput: 0: 5106.9. Samples: 288641030. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:08:23,025][25689] Avg episode reward: [(0, '-47.506')] [2022-07-09 14:08:23,480][26022] Updated weights on worker 0-0, policy_version 281884 (0.00079) [2022-07-09 14:08:25,233][26022] Updated weights on worker 0-0, policy_version 281894 (0.00084) [2022-07-09 14:08:27,192][26022] Updated weights on worker 0-0, policy_version 281904 (0.00087) [2022-07-09 14:08:28,161][25689] Fps is (10 sec: 5739.3, 60 sec: 5623.0, 300 sec: 5630.6). Total num frames: 288674816. Throughput: 0: 5931.0. Samples: 288675092. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:08:28,161][25689] Avg episode reward: [(0, '-47.509')] [2022-07-09 14:08:29,015][26022] Updated weights on worker 0-0, policy_version 281914 (0.00081) [2022-07-09 14:08:30,670][26022] Updated weights on worker 0-0, policy_version 281924 (0.00094) [2022-07-09 14:08:32,595][26022] Updated weights on worker 0-0, policy_version 281934 (0.00092) [2022-07-09 14:08:33,166][25689] Fps is (10 sec: 5653.9, 60 sec: 5629.4, 300 sec: 5638.5). Total num frames: 288703488. Throughput: 0: 5932.5. Samples: 288709166. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:08:33,166][25689] Avg episode reward: [(0, '-46.791')] [2022-07-09 14:08:34,340][26022] Updated weights on worker 0-0, policy_version 281944 (0.00086) [2022-07-09 14:08:36,227][26022] Updated weights on worker 0-0, policy_version 281954 (0.00084) [2022-07-09 14:08:37,986][26022] Updated weights on worker 0-0, policy_version 281964 (0.00089) [2022-07-09 14:08:38,194][25689] Fps is (10 sec: 5817.2, 60 sec: 5667.7, 300 sec: 5645.6). Total num frames: 288733184. Throughput: 0: 5056.1. Samples: 288726092. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:08:38,195][25689] Avg episode reward: [(0, '-46.974')] [2022-07-09 14:08:39,841][26022] Updated weights on worker 0-0, policy_version 281974 (0.00087) [2022-07-09 14:08:41,536][26022] Updated weights on worker 0-0, policy_version 281984 (0.00091) [2022-07-09 14:08:43,236][25689] Fps is (10 sec: 5694.3, 60 sec: 5635.3, 300 sec: 5640.0). Total num frames: 288760832. Throughput: 0: 5913.2. Samples: 288760552. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:08:43,237][25689] Avg episode reward: [(0, '-47.427')] [2022-07-09 14:08:43,434][26022] Updated weights on worker 0-0, policy_version 281994 (0.00085) [2022-07-09 14:08:45,221][26022] Updated weights on worker 0-0, policy_version 282004 (0.00087) [2022-07-09 14:08:46,972][26022] Updated weights on worker 0-0, policy_version 282014 (0.00083) [2022-07-09 14:08:48,310][25689] Fps is (10 sec: 5567.6, 60 sec: 5618.1, 300 sec: 5636.3). Total num frames: 288789504. Throughput: 0: 5934.6. Samples: 288794672. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:08:48,311][25689] Avg episode reward: [(0, '-47.548')] [2022-07-09 14:08:48,696][26022] Updated weights on worker 0-0, policy_version 282024 (0.00090) [2022-07-09 14:08:50,770][26022] Updated weights on worker 0-0, policy_version 282034 (0.00088) [2022-07-09 14:08:52,386][26022] Updated weights on worker 0-0, policy_version 282044 (0.00092) [2022-07-09 14:08:53,367][25689] Fps is (10 sec: 5660.3, 60 sec: 5649.0, 300 sec: 5639.2). Total num frames: 288818176. Throughput: 0: 5071.5. Samples: 288811616. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:08:53,368][25689] Avg episode reward: [(0, '-47.207')] [2022-07-09 14:08:54,404][26022] Updated weights on worker 0-0, policy_version 282054 (0.00086) [2022-07-09 14:08:56,009][26022] Updated weights on worker 0-0, policy_version 282064 (0.00087) [2022-07-09 14:08:57,946][26022] Updated weights on worker 0-0, policy_version 282074 (0.00094) [2022-07-09 14:08:58,380][25689] Fps is (10 sec: 5592.7, 60 sec: 5648.6, 300 sec: 5635.5). Total num frames: 288845824. Throughput: 0: 5911.3. Samples: 288845418. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:08:58,381][25689] Avg episode reward: [(0, '-47.847')] [2022-07-09 14:08:59,563][26022] Updated weights on worker 0-0, policy_version 282084 (0.00079) [2022-07-09 14:09:01,924][26022] Updated weights on worker 0-0, policy_version 282094 (0.00095) [2022-07-09 14:09:03,099][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:09:03,115][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000282101_288871424.pth [2022-07-09 14:09:03,116][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000280117_286839808.pth [2022-07-09 14:09:03,407][25689] Fps is (10 sec: 5405.4, 60 sec: 5632.7, 300 sec: 5636.1). Total num frames: 288872448. Throughput: 0: 5774.3. Samples: 288877026. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:09:03,408][25689] Avg episode reward: [(0, '-47.894')] [2022-07-09 14:09:03,667][26022] Updated weights on worker 0-0, policy_version 282104 (0.00091) [2022-07-09 14:09:05,477][26022] Updated weights on worker 0-0, policy_version 282114 (0.00090) [2022-07-09 14:09:07,273][26022] Updated weights on worker 0-0, policy_version 282124 (0.00087) [2022-07-09 14:09:08,454][25689] Fps is (10 sec: 5590.8, 60 sec: 5651.9, 300 sec: 5638.8). Total num frames: 288902144. Throughput: 0: 4941.0. Samples: 288894204. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:09:08,454][25689] Avg episode reward: [(0, '-48.175')] [2022-07-09 14:09:09,165][26022] Updated weights on worker 0-0, policy_version 282134 (0.00087) [2022-07-09 14:09:10,939][26022] Updated weights on worker 0-0, policy_version 282144 (0.00093) [2022-07-09 14:09:12,820][26022] Updated weights on worker 0-0, policy_version 282154 (0.00089) [2022-07-09 14:09:13,482][25689] Fps is (10 sec: 5590.3, 60 sec: 5618.6, 300 sec: 5631.4). Total num frames: 288928768. Throughput: 0: 5794.4. Samples: 288928170. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:09:13,483][25689] Avg episode reward: [(0, '-47.330')] [2022-07-09 14:09:14,429][26022] Updated weights on worker 0-0, policy_version 282164 (0.00090) [2022-07-09 14:09:16,549][26022] Updated weights on worker 0-0, policy_version 282174 (0.00079) [2022-07-09 14:09:17,962][26022] Updated weights on worker 0-0, policy_version 282184 (0.00079) [2022-07-09 14:09:18,490][25689] Fps is (10 sec: 5611.5, 60 sec: 5653.2, 300 sec: 5635.1). Total num frames: 288958464. Throughput: 0: 5821.3. Samples: 288962486. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:09:18,492][25689] Avg episode reward: [(0, '-47.797')] [2022-07-09 14:09:20,095][26022] Updated weights on worker 0-0, policy_version 282194 (0.00094) [2022-07-09 14:09:21,639][26022] Updated weights on worker 0-0, policy_version 282204 (0.00086) [2022-07-09 14:09:23,539][25689] Fps is (10 sec: 5702.0, 60 sec: 5617.9, 300 sec: 5628.4). Total num frames: 288986112. Throughput: 0: 5083.5. Samples: 288979364. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:09:23,541][25689] Avg episode reward: [(0, '-47.824')] [2022-07-09 14:09:23,655][26022] Updated weights on worker 0-0, policy_version 282214 (0.00093) [2022-07-09 14:09:25,244][26022] Updated weights on worker 0-0, policy_version 282224 (0.00090) [2022-07-09 14:09:27,299][26022] Updated weights on worker 0-0, policy_version 282234 (0.00099) [2022-07-09 14:09:28,664][25689] Fps is (10 sec: 5636.5, 60 sec: 5635.9, 300 sec: 5640.0). Total num frames: 289015808. Throughput: 0: 5903.2. Samples: 289013508. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:09:28,665][25689] Avg episode reward: [(0, '-48.671')] [2022-07-09 14:09:28,927][26022] Updated weights on worker 0-0, policy_version 282244 (0.00082) [2022-07-09 14:09:30,726][26022] Updated weights on worker 0-0, policy_version 282254 (0.00083) [2022-07-09 14:09:32,442][26022] Updated weights on worker 0-0, policy_version 282264 (0.00090) [2022-07-09 14:09:33,703][25689] Fps is (10 sec: 5742.7, 60 sec: 5632.8, 300 sec: 5636.0). Total num frames: 289044480. Throughput: 0: 5904.9. Samples: 289047570. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:09:33,703][25689] Avg episode reward: [(0, '-47.877')] [2022-07-09 14:09:34,423][26022] Updated weights on worker 0-0, policy_version 282274 (0.00069) [2022-07-09 14:09:36,302][26022] Updated weights on worker 0-0, policy_version 282284 (0.00084) [2022-07-09 14:09:37,980][26022] Updated weights on worker 0-0, policy_version 282294 (0.00052) [2022-07-09 14:09:38,739][25689] Fps is (10 sec: 5691.6, 60 sec: 5615.1, 300 sec: 5632.2). Total num frames: 289073152. Throughput: 0: 5887.1. Samples: 289081694. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:09:38,740][25689] Avg episode reward: [(0, '-48.388')] [2022-07-09 14:09:39,844][26022] Updated weights on worker 0-0, policy_version 282304 (0.00089) [2022-07-09 14:09:41,725][26022] Updated weights on worker 0-0, policy_version 282314 (0.00088) [2022-07-09 14:09:43,475][26022] Updated weights on worker 0-0, policy_version 282324 (0.00089) [2022-07-09 14:09:43,787][25689] Fps is (10 sec: 5686.2, 60 sec: 5631.5, 300 sec: 5636.0). Total num frames: 289101824. Throughput: 0: 5902.7. Samples: 289098886. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:09:43,788][25689] Avg episode reward: [(0, '-48.085')] [2022-07-09 14:09:45,416][26022] Updated weights on worker 0-0, policy_version 282334 (0.00086) [2022-07-09 14:09:47,012][26022] Updated weights on worker 0-0, policy_version 282344 (0.00085) [2022-07-09 14:09:48,878][25689] Fps is (10 sec: 5555.2, 60 sec: 5613.0, 300 sec: 5634.5). Total num frames: 289129472. Throughput: 0: 5895.6. Samples: 289132678. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:09:48,878][25689] Avg episode reward: [(0, '-47.428')] [2022-07-09 14:09:48,959][26022] Updated weights on worker 0-0, policy_version 282354 (0.00092) [2022-07-09 14:09:50,722][26022] Updated weights on worker 0-0, policy_version 282364 (0.00094) [2022-07-09 14:09:52,431][26022] Updated weights on worker 0-0, policy_version 282374 (0.00090) [2022-07-09 14:09:53,904][25689] Fps is (10 sec: 5567.1, 60 sec: 5615.9, 300 sec: 5627.4). Total num frames: 289158144. Throughput: 0: 5904.1. Samples: 289166842. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:09:53,905][25689] Avg episode reward: [(0, '-46.636')] [2022-07-09 14:09:54,335][26022] Updated weights on worker 0-0, policy_version 282384 (0.00082) [2022-07-09 14:09:55,999][26022] Updated weights on worker 0-0, policy_version 282394 (0.00089) [2022-07-09 14:09:57,932][26022] Updated weights on worker 0-0, policy_version 282404 (0.00092) [2022-07-09 14:09:58,909][25689] Fps is (10 sec: 5716.7, 60 sec: 5633.6, 300 sec: 5634.5). Total num frames: 289186816. Throughput: 0: 5066.7. Samples: 289183886. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:09:58,909][25689] Avg episode reward: [(0, '-45.542')] [2022-07-09 14:09:59,660][26022] Updated weights on worker 0-0, policy_version 282414 (0.00093) [2022-07-09 14:10:01,611][26022] Updated weights on worker 0-0, policy_version 282424 (0.00091) [2022-07-09 14:10:03,664][26022] Updated weights on worker 0-0, policy_version 282434 (0.00084) [2022-07-09 14:10:03,934][25689] Fps is (10 sec: 5513.1, 60 sec: 5633.8, 300 sec: 5635.4). Total num frames: 289213440. Throughput: 0: 5813.8. Samples: 289216014. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:10:03,935][25689] Avg episode reward: [(0, '-46.083')] [2022-07-09 14:10:05,591][26022] Updated weights on worker 0-0, policy_version 282444 (0.00093) [2022-07-09 14:10:07,367][26022] Updated weights on worker 0-0, policy_version 282454 (0.00093) [2022-07-09 14:10:08,976][25689] Fps is (10 sec: 5390.7, 60 sec: 5600.3, 300 sec: 5628.0). Total num frames: 289241088. Throughput: 0: 5814.9. Samples: 289249550. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:10:08,977][25689] Avg episode reward: [(0, '-46.331')] [2022-07-09 14:10:09,157][26022] Updated weights on worker 0-0, policy_version 282464 (0.00089) [2022-07-09 14:10:11,135][26022] Updated weights on worker 0-0, policy_version 282474 (0.00386) [2022-07-09 14:10:12,767][26022] Updated weights on worker 0-0, policy_version 282484 (0.00088) [2022-07-09 14:10:14,068][25689] Fps is (10 sec: 5557.3, 60 sec: 5628.2, 300 sec: 5634.2). Total num frames: 289269760. Throughput: 0: 4937.0. Samples: 289266394. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:10:14,069][25689] Avg episode reward: [(0, '-47.389')] [2022-07-09 14:10:14,642][26022] Updated weights on worker 0-0, policy_version 282494 (0.00086) [2022-07-09 14:10:16,533][26022] Updated weights on worker 0-0, policy_version 282504 (0.00093) [2022-07-09 14:10:18,332][26022] Updated weights on worker 0-0, policy_version 282514 (0.00091) [2022-07-09 14:10:19,134][25689] Fps is (10 sec: 5847.0, 60 sec: 5639.8, 300 sec: 5637.0). Total num frames: 289300480. Throughput: 0: 5760.5. Samples: 289300394. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:10:19,135][25689] Avg episode reward: [(0, '-47.184')] [2022-07-09 14:10:20,268][26022] Updated weights on worker 0-0, policy_version 282524 (0.00091) [2022-07-09 14:10:21,874][26022] Updated weights on worker 0-0, policy_version 282534 (0.00092) [2022-07-09 14:10:23,731][26022] Updated weights on worker 0-0, policy_version 282544 (0.00089) [2022-07-09 14:10:24,179][25689] Fps is (10 sec: 5773.3, 60 sec: 5640.1, 300 sec: 5638.0). Total num frames: 289328128. Throughput: 0: 5853.8. Samples: 289334522. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:10:24,179][25689] Avg episode reward: [(0, '-47.411')] [2022-07-09 14:10:25,476][26022] Updated weights on worker 0-0, policy_version 282554 (0.00357) [2022-07-09 14:10:27,335][26022] Updated weights on worker 0-0, policy_version 282564 (0.00088) [2022-07-09 14:10:29,144][26022] Updated weights on worker 0-0, policy_version 282574 (0.00091) [2022-07-09 14:10:29,305][25689] Fps is (10 sec: 5537.5, 60 sec: 5623.2, 300 sec: 5629.2). Total num frames: 289356800. Throughput: 0: 5019.0. Samples: 289351578. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:10:29,305][25689] Avg episode reward: [(0, '-47.349')] [2022-07-09 14:10:30,959][26022] Updated weights on worker 0-0, policy_version 282584 (0.00101) [2022-07-09 14:10:32,633][26022] Updated weights on worker 0-0, policy_version 282594 (0.00091) [2022-07-09 14:10:34,324][25689] Fps is (10 sec: 5551.1, 60 sec: 5608.0, 300 sec: 5629.1). Total num frames: 289384448. Throughput: 0: 5895.1. Samples: 289385804. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:10:34,326][25689] Avg episode reward: [(0, '-48.222')] [2022-07-09 14:10:34,612][26022] Updated weights on worker 0-0, policy_version 282604 (0.00091) [2022-07-09 14:10:36,260][26022] Updated weights on worker 0-0, policy_version 282614 (0.00084) [2022-07-09 14:10:38,306][26022] Updated weights on worker 0-0, policy_version 282624 (0.00059) [2022-07-09 14:10:39,380][25689] Fps is (10 sec: 5589.8, 60 sec: 5606.2, 300 sec: 5629.1). Total num frames: 289413120. Throughput: 0: 5891.1. Samples: 289419666. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:10:39,381][25689] Avg episode reward: [(0, '-47.943')] [2022-07-09 14:10:39,879][26022] Updated weights on worker 0-0, policy_version 282634 (0.00092) [2022-07-09 14:10:41,791][26022] Updated weights on worker 0-0, policy_version 282644 (0.00098) [2022-07-09 14:10:43,778][26022] Updated weights on worker 0-0, policy_version 282654 (0.00086) [2022-07-09 14:10:44,402][25689] Fps is (10 sec: 5791.9, 60 sec: 5625.6, 300 sec: 5633.6). Total num frames: 289442816. Throughput: 0: 5050.5. Samples: 289436660. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:10:44,402][25689] Avg episode reward: [(0, '-47.379')] [2022-07-09 14:10:45,338][26022] Updated weights on worker 0-0, policy_version 282664 (0.00087) [2022-07-09 14:10:47,291][26022] Updated weights on worker 0-0, policy_version 282674 (0.00094) [2022-07-09 14:10:48,936][26022] Updated weights on worker 0-0, policy_version 282684 (0.00093) [2022-07-09 14:10:49,502][25689] Fps is (10 sec: 5665.4, 60 sec: 5624.6, 300 sec: 5628.3). Total num frames: 289470464. Throughput: 0: 5914.9. Samples: 289471044. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:10:49,503][25689] Avg episode reward: [(0, '-47.274')] [2022-07-09 14:10:50,652][26022] Updated weights on worker 0-0, policy_version 282694 (0.00089) [2022-07-09 14:10:52,604][26022] Updated weights on worker 0-0, policy_version 282704 (0.00090) [2022-07-09 14:10:54,090][26022] Updated weights on worker 0-0, policy_version 282714 (0.00089) [2022-07-09 14:10:54,516][25689] Fps is (10 sec: 5669.6, 60 sec: 5642.7, 300 sec: 5635.0). Total num frames: 289500160. Throughput: 0: 5906.6. Samples: 289505070. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:10:54,517][25689] Avg episode reward: [(0, '-47.487')] [2022-07-09 14:10:56,349][26022] Updated weights on worker 0-0, policy_version 282724 (0.00079) [2022-07-09 14:10:57,779][26022] Updated weights on worker 0-0, policy_version 282734 (0.00086) [2022-07-09 14:10:59,526][25689] Fps is (10 sec: 5720.8, 60 sec: 5625.3, 300 sec: 5627.9). Total num frames: 289527808. Throughput: 0: 5078.0. Samples: 289521966. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:10:59,526][25689] Avg episode reward: [(0, '-47.025')] [2022-07-09 14:10:59,731][26022] Updated weights on worker 0-0, policy_version 282744 (0.00083) [2022-07-09 14:11:01,525][26022] Updated weights on worker 0-0, policy_version 282754 (0.00091) [2022-07-09 14:11:03,190][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:11:03,203][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000282761_289547264.pth [2022-07-09 14:11:03,204][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000280778_287516672.pth [2022-07-09 14:11:03,963][26022] Updated weights on worker 0-0, policy_version 282764 (0.00098) [2022-07-09 14:11:04,572][25689] Fps is (10 sec: 5396.8, 60 sec: 5623.4, 300 sec: 5633.0). Total num frames: 289554432. Throughput: 0: 5829.5. Samples: 289554244. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:11:04,573][25689] Avg episode reward: [(0, '-46.849')] [2022-07-09 14:11:05,607][26022] Updated weights on worker 0-0, policy_version 282774 (0.00094) [2022-07-09 14:11:07,467][26022] Updated weights on worker 0-0, policy_version 282784 (0.00095) [2022-07-09 14:11:09,096][26022] Updated weights on worker 0-0, policy_version 282794 (0.00089) [2022-07-09 14:11:09,633][25689] Fps is (10 sec: 5471.1, 60 sec: 5638.6, 300 sec: 5628.9). Total num frames: 289583104. Throughput: 0: 5823.5. Samples: 289588274. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:11:09,633][25689] Avg episode reward: [(0, '-46.401')] [2022-07-09 14:11:11,008][26022] Updated weights on worker 0-0, policy_version 282804 (0.00091) [2022-07-09 14:11:12,804][26022] Updated weights on worker 0-0, policy_version 282814 (0.00091) [2022-07-09 14:11:14,423][26022] Updated weights on worker 0-0, policy_version 282824 (0.00083) [2022-07-09 14:11:14,648][25689] Fps is (10 sec: 5691.5, 60 sec: 5645.7, 300 sec: 5630.0). Total num frames: 289611776. Throughput: 0: 4977.3. Samples: 289605272. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:11:14,648][25689] Avg episode reward: [(0, '-47.927')] [2022-07-09 14:11:16,381][26022] Updated weights on worker 0-0, policy_version 282834 (0.00085) [2022-07-09 14:11:18,084][26022] Updated weights on worker 0-0, policy_version 282844 (0.00090) [2022-07-09 14:11:19,667][25689] Fps is (10 sec: 5714.9, 60 sec: 5616.3, 300 sec: 5637.2). Total num frames: 289640448. Throughput: 0: 5835.5. Samples: 289639500. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:11:19,667][25689] Avg episode reward: [(0, '-47.806')] [2022-07-09 14:11:19,838][26022] Updated weights on worker 0-0, policy_version 282854 (0.00092) [2022-07-09 14:11:21,784][26022] Updated weights on worker 0-0, policy_version 282864 (0.00090) [2022-07-09 14:11:23,622][26022] Updated weights on worker 0-0, policy_version 282874 (0.00093) [2022-07-09 14:11:24,682][25689] Fps is (10 sec: 5612.6, 60 sec: 5619.0, 300 sec: 5627.9). Total num frames: 289668096. Throughput: 0: 5929.4. Samples: 289673484. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:11:24,683][25689] Avg episode reward: [(0, '-48.685')] [2022-07-09 14:11:25,410][26022] Updated weights on worker 0-0, policy_version 282884 (0.00634) [2022-07-09 14:11:27,327][26022] Updated weights on worker 0-0, policy_version 282894 (0.00086) [2022-07-09 14:11:28,877][26022] Updated weights on worker 0-0, policy_version 282904 (0.00085) [2022-07-09 14:11:29,744][25689] Fps is (10 sec: 5589.2, 60 sec: 5625.0, 300 sec: 5633.9). Total num frames: 289696768. Throughput: 0: 5064.5. Samples: 289690124. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:11:29,744][25689] Avg episode reward: [(0, '-47.895')] [2022-07-09 14:11:30,972][26022] Updated weights on worker 0-0, policy_version 282914 (0.00094) [2022-07-09 14:11:32,570][26022] Updated weights on worker 0-0, policy_version 282924 (0.00086) [2022-07-09 14:11:34,522][26022] Updated weights on worker 0-0, policy_version 282934 (0.00087) [2022-07-09 14:11:34,825][25689] Fps is (10 sec: 5653.9, 60 sec: 5636.2, 300 sec: 5636.1). Total num frames: 289725440. Throughput: 0: 5880.8. Samples: 289723928. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:11:34,825][25689] Avg episode reward: [(0, '-47.449')] [2022-07-09 14:11:36,415][26022] Updated weights on worker 0-0, policy_version 282944 (0.00087) [2022-07-09 14:11:38,109][26022] Updated weights on worker 0-0, policy_version 282954 (0.00090) [2022-07-09 14:11:39,823][26022] Updated weights on worker 0-0, policy_version 282964 (0.00087) [2022-07-09 14:11:39,921][25689] Fps is (10 sec: 5735.0, 60 sec: 5649.3, 300 sec: 5631.0). Total num frames: 289755136. Throughput: 0: 5855.8. Samples: 289758104. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:11:39,922][25689] Avg episode reward: [(0, '-47.083')] [2022-07-09 14:11:41,702][26022] Updated weights on worker 0-0, policy_version 282974 (0.00059) [2022-07-09 14:11:43,541][26022] Updated weights on worker 0-0, policy_version 282984 (0.00098) [2022-07-09 14:11:44,978][25689] Fps is (10 sec: 5547.0, 60 sec: 5595.3, 300 sec: 5627.7). Total num frames: 289781760. Throughput: 0: 5852.1. Samples: 289792256. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:11:44,987][25689] Avg episode reward: [(0, '-46.773')] [2022-07-09 14:11:45,355][26022] Updated weights on worker 0-0, policy_version 282994 (0.00080) [2022-07-09 14:11:47,195][26022] Updated weights on worker 0-0, policy_version 283004 (0.00083) [2022-07-09 14:11:48,932][26022] Updated weights on worker 0-0, policy_version 283014 (0.00092) [2022-07-09 14:11:50,044][25689] Fps is (10 sec: 5563.5, 60 sec: 5632.3, 300 sec: 5626.8). Total num frames: 289811456. Throughput: 0: 5872.4. Samples: 289809338. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:11:50,046][25689] Avg episode reward: [(0, '-46.968')] [2022-07-09 14:11:50,810][26022] Updated weights on worker 0-0, policy_version 283024 (0.00089) [2022-07-09 14:11:52,595][26022] Updated weights on worker 0-0, policy_version 283034 (0.00089) [2022-07-09 14:11:54,267][26022] Updated weights on worker 0-0, policy_version 283044 (0.00085) [2022-07-09 14:11:55,069][25689] Fps is (10 sec: 5885.6, 60 sec: 5631.3, 300 sec: 5630.2). Total num frames: 289841152. Throughput: 0: 5924.6. Samples: 289843868. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:11:55,070][25689] Avg episode reward: [(0, '-46.898')] [2022-07-09 14:11:56,228][26022] Updated weights on worker 0-0, policy_version 283054 (0.00093) [2022-07-09 14:11:57,983][26022] Updated weights on worker 0-0, policy_version 283064 (0.00094) [2022-07-09 14:11:59,741][26022] Updated weights on worker 0-0, policy_version 283074 (0.00086) [2022-07-09 14:12:00,081][25689] Fps is (10 sec: 5815.5, 60 sec: 5648.0, 300 sec: 5641.1). Total num frames: 289869824. Throughput: 0: 5935.7. Samples: 289877766. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:12:00,081][25689] Avg episode reward: [(0, '-47.323')] [2022-07-09 14:12:01,643][26022] Updated weights on worker 0-0, policy_version 283084 (0.00081) [2022-07-09 14:12:03,742][26022] Updated weights on worker 0-0, policy_version 283094 (0.00085) [2022-07-09 14:12:05,114][25689] Fps is (10 sec: 5403.0, 60 sec: 5632.4, 300 sec: 5632.1). Total num frames: 289895424. Throughput: 0: 4985.4. Samples: 289892642. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:12:05,116][25689] Avg episode reward: [(0, '-47.538')] [2022-07-09 14:12:05,682][26022] Updated weights on worker 0-0, policy_version 283104 (0.00090) [2022-07-09 14:12:07,200][26022] Updated weights on worker 0-0, policy_version 283114 (0.00088) [2022-07-09 14:12:09,250][26022] Updated weights on worker 0-0, policy_version 283124 (0.00085) [2022-07-09 14:12:10,160][25689] Fps is (10 sec: 5486.2, 60 sec: 5650.6, 300 sec: 5632.7). Total num frames: 289925120. Throughput: 0: 5834.4. Samples: 289926702. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:12:10,160][25689] Avg episode reward: [(0, '-48.281')] [2022-07-09 14:12:10,913][26022] Updated weights on worker 0-0, policy_version 283134 (0.00091) [2022-07-09 14:12:12,824][26022] Updated weights on worker 0-0, policy_version 283144 (0.00090) [2022-07-09 14:12:14,772][26022] Updated weights on worker 0-0, policy_version 283154 (0.00085) [2022-07-09 14:12:15,239][25689] Fps is (10 sec: 5562.2, 60 sec: 5610.8, 300 sec: 5628.5). Total num frames: 289951744. Throughput: 0: 5806.8. Samples: 289960992. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:12:15,240][25689] Avg episode reward: [(0, '-48.031')] [2022-07-09 14:12:16,290][26022] Updated weights on worker 0-0, policy_version 283164 (0.00089) [2022-07-09 14:12:18,283][26022] Updated weights on worker 0-0, policy_version 283174 (0.00093) [2022-07-09 14:12:20,043][26022] Updated weights on worker 0-0, policy_version 283184 (0.00102) [2022-07-09 14:12:20,271][25689] Fps is (10 sec: 5570.1, 60 sec: 5626.5, 300 sec: 5638.6). Total num frames: 289981440. Throughput: 0: 4972.5. Samples: 289978162. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:12:20,272][25689] Avg episode reward: [(0, '-48.087')] [2022-07-09 14:12:21,792][26022] Updated weights on worker 0-0, policy_version 283194 (0.00089) [2022-07-09 14:12:23,602][26022] Updated weights on worker 0-0, policy_version 283204 (0.00085) [2022-07-09 14:12:25,274][25689] Fps is (10 sec: 5816.7, 60 sec: 5644.6, 300 sec: 5629.7). Total num frames: 290010112. Throughput: 0: 5925.2. Samples: 290012094. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 14:12:25,274][25689] Avg episode reward: [(0, '-47.588')] [2022-07-09 14:12:25,338][26022] Updated weights on worker 0-0, policy_version 283214 (0.00087) [2022-07-09 14:12:27,438][26022] Updated weights on worker 0-0, policy_version 283224 (0.00091) [2022-07-09 14:12:29,110][26022] Updated weights on worker 0-0, policy_version 283234 (0.00097) [2022-07-09 14:12:30,371][25689] Fps is (10 sec: 5576.4, 60 sec: 5624.4, 300 sec: 5625.9). Total num frames: 290037760. Throughput: 0: 5887.1. Samples: 290045686. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:12:30,371][25689] Avg episode reward: [(0, '-47.436')] [2022-07-09 14:12:30,984][26022] Updated weights on worker 0-0, policy_version 283244 (0.00103) [2022-07-09 14:12:32,694][26022] Updated weights on worker 0-0, policy_version 283254 (0.00097) [2022-07-09 14:12:34,538][26022] Updated weights on worker 0-0, policy_version 283264 (0.00091) [2022-07-09 14:12:35,398][25689] Fps is (10 sec: 5664.2, 60 sec: 5646.3, 300 sec: 5633.7). Total num frames: 290067456. Throughput: 0: 5049.7. Samples: 290062786. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:12:35,398][25689] Avg episode reward: [(0, '-47.904')] [2022-07-09 14:12:36,484][26022] Updated weights on worker 0-0, policy_version 283274 (0.00082) [2022-07-09 14:12:38,129][26022] Updated weights on worker 0-0, policy_version 283284 (0.00086) [2022-07-09 14:12:40,118][26022] Updated weights on worker 0-0, policy_version 283294 (0.00094) [2022-07-09 14:12:40,428][25689] Fps is (10 sec: 5702.1, 60 sec: 5618.7, 300 sec: 5627.3). Total num frames: 290095104. Throughput: 0: 5888.4. Samples: 290096852. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:12:40,428][25689] Avg episode reward: [(0, '-47.013')] [2022-07-09 14:12:41,771][26022] Updated weights on worker 0-0, policy_version 283304 (0.00088) [2022-07-09 14:12:43,414][26022] Updated weights on worker 0-0, policy_version 283314 (0.00081) [2022-07-09 14:12:45,448][25689] Fps is (10 sec: 5501.9, 60 sec: 5639.0, 300 sec: 5621.4). Total num frames: 290122752. Throughput: 0: 5900.9. Samples: 290131140. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:12:45,449][25689] Avg episode reward: [(0, '-46.779')] [2022-07-09 14:12:45,529][26022] Updated weights on worker 0-0, policy_version 283324 (0.00090) [2022-07-09 14:12:47,045][26022] Updated weights on worker 0-0, policy_version 283334 (0.00092) [2022-07-09 14:12:49,274][26022] Updated weights on worker 0-0, policy_version 283344 (0.00082) [2022-07-09 14:12:50,511][25689] Fps is (10 sec: 5788.4, 60 sec: 5656.3, 300 sec: 5634.5). Total num frames: 290153472. Throughput: 0: 5078.4. Samples: 290147966. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:12:50,512][25689] Avg episode reward: [(0, '-47.268')] [2022-07-09 14:12:50,714][26022] Updated weights on worker 0-0, policy_version 283354 (0.00083) [2022-07-09 14:12:52,861][26022] Updated weights on worker 0-0, policy_version 283364 (0.00086) [2022-07-09 14:12:54,486][26022] Updated weights on worker 0-0, policy_version 283374 (0.00090) [2022-07-09 14:12:55,534][25689] Fps is (10 sec: 5584.0, 60 sec: 5588.7, 300 sec: 5627.3). Total num frames: 290179072. Throughput: 0: 5911.9. Samples: 290181830. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:12:55,535][25689] Avg episode reward: [(0, '-46.812')] [2022-07-09 14:12:56,282][26022] Updated weights on worker 0-0, policy_version 283384 (0.00092) [2022-07-09 14:12:58,262][26022] Updated weights on worker 0-0, policy_version 283394 (0.00088) [2022-07-09 14:12:59,870][26022] Updated weights on worker 0-0, policy_version 283404 (0.00088) [2022-07-09 14:13:00,573][25689] Fps is (10 sec: 5495.6, 60 sec: 5603.1, 300 sec: 5634.2). Total num frames: 290208768. Throughput: 0: 5902.5. Samples: 290215762. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:13:00,574][25689] Avg episode reward: [(0, '-47.352')] [2022-07-09 14:13:02,235][26022] Updated weights on worker 0-0, policy_version 283414 (0.00092) [2022-07-09 14:13:03,462][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:13:03,477][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000283421_290223104.pth [2022-07-09 14:13:03,477][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000281440_288194560.pth [2022-07-09 14:13:03,977][26022] Updated weights on worker 0-0, policy_version 283424 (0.00087) [2022-07-09 14:13:05,523][26022] Updated weights on worker 0-0, policy_version 283434 (0.00087) [2022-07-09 14:13:05,619][25689] Fps is (10 sec: 5686.4, 60 sec: 5635.8, 300 sec: 5631.2). Total num frames: 290236416. Throughput: 0: 4939.8. Samples: 290230778. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:13:05,619][25689] Avg episode reward: [(0, '-47.305')] [2022-07-09 14:13:07,652][26022] Updated weights on worker 0-0, policy_version 283444 (0.00096) [2022-07-09 14:13:09,154][26022] Updated weights on worker 0-0, policy_version 283454 (0.00083) [2022-07-09 14:13:10,655][25689] Fps is (10 sec: 5281.7, 60 sec: 5569.0, 300 sec: 5620.9). Total num frames: 290262016. Throughput: 0: 5796.0. Samples: 290264718. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:13:10,655][25689] Avg episode reward: [(0, '-47.390')] [2022-07-09 14:13:11,196][26022] Updated weights on worker 0-0, policy_version 283464 (0.00085) [2022-07-09 14:13:12,918][26022] Updated weights on worker 0-0, policy_version 283474 (0.00097) [2022-07-09 14:13:14,699][26022] Updated weights on worker 0-0, policy_version 283484 (0.00084) [2022-07-09 14:13:15,678][25689] Fps is (10 sec: 5700.7, 60 sec: 5658.9, 300 sec: 5634.5). Total num frames: 290293760. Throughput: 0: 5817.9. Samples: 290299024. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:13:15,678][25689] Avg episode reward: [(0, '-47.536')] [2022-07-09 14:13:16,641][26022] Updated weights on worker 0-0, policy_version 283494 (0.00089) [2022-07-09 14:13:18,566][26022] Updated weights on worker 0-0, policy_version 283504 (0.00085) [2022-07-09 14:13:20,089][26022] Updated weights on worker 0-0, policy_version 283514 (0.01164) [2022-07-09 14:13:20,682][25689] Fps is (10 sec: 5922.9, 60 sec: 5627.6, 300 sec: 5628.1). Total num frames: 290321408. Throughput: 0: 4991.7. Samples: 290316142. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:13:20,683][25689] Avg episode reward: [(0, '-47.192')] [2022-07-09 14:13:21,946][26022] Updated weights on worker 0-0, policy_version 283524 (0.00084) [2022-07-09 14:13:23,721][26022] Updated weights on worker 0-0, policy_version 283534 (0.00087) [2022-07-09 14:13:25,426][26022] Updated weights on worker 0-0, policy_version 283544 (0.00089) [2022-07-09 14:13:25,704][25689] Fps is (10 sec: 5617.2, 60 sec: 5625.8, 300 sec: 5630.3). Total num frames: 290350080. Throughput: 0: 5959.0. Samples: 290350468. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:13:25,706][25689] Avg episode reward: [(0, '-47.990')] [2022-07-09 14:13:27,620][26022] Updated weights on worker 0-0, policy_version 283554 (0.00092) [2022-07-09 14:13:29,023][26022] Updated weights on worker 0-0, policy_version 283564 (0.00087) [2022-07-09 14:13:30,749][25689] Fps is (10 sec: 5594.9, 60 sec: 5630.7, 300 sec: 5626.1). Total num frames: 290377728. Throughput: 0: 5946.8. Samples: 290384212. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:13:30,749][25689] Avg episode reward: [(0, '-47.480')] [2022-07-09 14:13:31,066][26022] Updated weights on worker 0-0, policy_version 283574 (0.00086) [2022-07-09 14:13:32,860][26022] Updated weights on worker 0-0, policy_version 283584 (0.00093) [2022-07-09 14:13:34,392][26022] Updated weights on worker 0-0, policy_version 283594 (0.00085) [2022-07-09 14:13:35,752][25689] Fps is (10 sec: 5604.9, 60 sec: 5615.9, 300 sec: 5623.2). Total num frames: 290406400. Throughput: 0: 5104.2. Samples: 290401490. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:13:35,753][25689] Avg episode reward: [(0, '-47.076')] [2022-07-09 14:13:36,685][26022] Updated weights on worker 0-0, policy_version 283604 (0.00088) [2022-07-09 14:13:38,079][26022] Updated weights on worker 0-0, policy_version 283614 (0.00088) [2022-07-09 14:13:40,145][26022] Updated weights on worker 0-0, policy_version 283624 (0.00095) [2022-07-09 14:13:40,769][25689] Fps is (10 sec: 5620.5, 60 sec: 5617.1, 300 sec: 5623.6). Total num frames: 290434048. Throughput: 0: 5938.8. Samples: 290435434. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:13:40,770][25689] Avg episode reward: [(0, '-47.226')] [2022-07-09 14:13:41,594][26022] Updated weights on worker 0-0, policy_version 283634 (0.00082) [2022-07-09 14:13:43,643][26022] Updated weights on worker 0-0, policy_version 283644 (0.00093) [2022-07-09 14:13:45,415][26022] Updated weights on worker 0-0, policy_version 283654 (0.00083) [2022-07-09 14:13:45,790][25689] Fps is (10 sec: 5713.2, 60 sec: 5651.1, 300 sec: 5628.1). Total num frames: 290463744. Throughput: 0: 5939.5. Samples: 290469766. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:13:45,790][25689] Avg episode reward: [(0, '-47.717')] [2022-07-09 14:13:47,068][26022] Updated weights on worker 0-0, policy_version 283664 (0.00089) [2022-07-09 14:13:49,032][26022] Updated weights on worker 0-0, policy_version 283674 (0.00091) [2022-07-09 14:13:50,842][25689] Fps is (10 sec: 5693.1, 60 sec: 5601.2, 300 sec: 5624.7). Total num frames: 290491392. Throughput: 0: 5118.4. Samples: 290487056. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:13:50,842][25689] Avg episode reward: [(0, '-48.101')] [2022-07-09 14:13:50,874][26022] Updated weights on worker 0-0, policy_version 283684 (0.00085) [2022-07-09 14:13:52,571][26022] Updated weights on worker 0-0, policy_version 283694 (0.00097) [2022-07-09 14:13:54,461][26022] Updated weights on worker 0-0, policy_version 283704 (0.00086) [2022-07-09 14:13:55,874][25689] Fps is (10 sec: 5686.5, 60 sec: 5668.2, 300 sec: 5631.3). Total num frames: 290521088. Throughput: 0: 5941.4. Samples: 290521038. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:13:55,874][25689] Avg episode reward: [(0, '-48.061')] [2022-07-09 14:13:56,109][26022] Updated weights on worker 0-0, policy_version 283714 (0.00089) [2022-07-09 14:13:58,067][26022] Updated weights on worker 0-0, policy_version 283724 (0.00096) [2022-07-09 14:13:59,619][26022] Updated weights on worker 0-0, policy_version 283734 (0.00094) [2022-07-09 14:14:00,913][25689] Fps is (10 sec: 5693.8, 60 sec: 5634.2, 300 sec: 5634.5). Total num frames: 290548736. Throughput: 0: 5948.2. Samples: 290555254. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:14:00,914][25689] Avg episode reward: [(0, '-47.349')] [2022-07-09 14:14:01,976][26022] Updated weights on worker 0-0, policy_version 283744 (0.00109) [2022-07-09 14:14:03,615][26022] Updated weights on worker 0-0, policy_version 283754 (0.00092) [2022-07-09 14:14:05,446][26022] Updated weights on worker 0-0, policy_version 283764 (0.00090) [2022-07-09 14:14:05,939][25689] Fps is (10 sec: 5595.2, 60 sec: 5653.0, 300 sec: 5631.4). Total num frames: 290577408. Throughput: 0: 4982.4. Samples: 290570160. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:14:05,940][25689] Avg episode reward: [(0, '-47.940')] [2022-07-09 14:14:07,560][26022] Updated weights on worker 0-0, policy_version 283774 (0.00081) [2022-07-09 14:14:09,063][26022] Updated weights on worker 0-0, policy_version 283784 (0.00093) [2022-07-09 14:14:11,000][25689] Fps is (10 sec: 5481.9, 60 sec: 5667.7, 300 sec: 5630.8). Total num frames: 290604032. Throughput: 0: 5814.7. Samples: 290604270. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:14:11,000][25689] Avg episode reward: [(0, '-47.181')] [2022-07-09 14:14:11,171][26022] Updated weights on worker 0-0, policy_version 283794 (0.00087) [2022-07-09 14:14:12,609][26022] Updated weights on worker 0-0, policy_version 283804 (0.00083) [2022-07-09 14:14:14,592][26022] Updated weights on worker 0-0, policy_version 283814 (0.00104) [2022-07-09 14:14:16,002][25689] Fps is (10 sec: 5495.5, 60 sec: 5618.7, 300 sec: 5627.5). Total num frames: 290632704. Throughput: 0: 5835.7. Samples: 290638498. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:14:16,002][25689] Avg episode reward: [(0, '-46.592')] [2022-07-09 14:14:16,481][26022] Updated weights on worker 0-0, policy_version 283824 (0.00083) [2022-07-09 14:14:18,107][26022] Updated weights on worker 0-0, policy_version 283834 (0.00083) [2022-07-09 14:14:20,032][26022] Updated weights on worker 0-0, policy_version 283844 (0.00088) [2022-07-09 14:14:21,009][25689] Fps is (10 sec: 5831.5, 60 sec: 5652.4, 300 sec: 5635.1). Total num frames: 290662400. Throughput: 0: 4993.7. Samples: 290655608. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:14:21,011][25689] Avg episode reward: [(0, '-45.634')] [2022-07-09 14:14:21,747][26022] Updated weights on worker 0-0, policy_version 283854 (0.00092) [2022-07-09 14:14:23,644][26022] Updated weights on worker 0-0, policy_version 283864 (0.00445) [2022-07-09 14:14:25,523][26022] Updated weights on worker 0-0, policy_version 283874 (0.00085) [2022-07-09 14:14:26,015][25689] Fps is (10 sec: 5726.8, 60 sec: 5636.9, 300 sec: 5630.5). Total num frames: 290690048. Throughput: 0: 5956.9. Samples: 290689748. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:14:26,017][25689] Avg episode reward: [(0, '-46.086')] [2022-07-09 14:14:27,188][26022] Updated weights on worker 0-0, policy_version 283884 (0.00084) [2022-07-09 14:14:29,004][26022] Updated weights on worker 0-0, policy_version 283894 (0.00090) [2022-07-09 14:14:31,077][25689] Fps is (10 sec: 5492.3, 60 sec: 5635.3, 300 sec: 5626.6). Total num frames: 290717696. Throughput: 0: 5942.7. Samples: 290723582. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:14:31,079][25689] Avg episode reward: [(0, '-46.053')] [2022-07-09 14:14:31,080][26022] Updated weights on worker 0-0, policy_version 283904 (0.00095) [2022-07-09 14:14:32,576][26022] Updated weights on worker 0-0, policy_version 283914 (0.00088) [2022-07-09 14:14:34,544][26022] Updated weights on worker 0-0, policy_version 283924 (0.00080) [2022-07-09 14:14:36,086][25689] Fps is (10 sec: 5693.9, 60 sec: 5651.8, 300 sec: 5630.6). Total num frames: 290747392. Throughput: 0: 5091.5. Samples: 290740758. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:14:36,088][25689] Avg episode reward: [(0, '-45.627')] [2022-07-09 14:14:36,174][26022] Updated weights on worker 0-0, policy_version 283934 (0.00085) [2022-07-09 14:14:38,076][26022] Updated weights on worker 0-0, policy_version 283944 (0.00094) [2022-07-09 14:14:39,867][26022] Updated weights on worker 0-0, policy_version 283954 (0.00083) [2022-07-09 14:14:41,118][25689] Fps is (10 sec: 5711.2, 60 sec: 5650.4, 300 sec: 5627.5). Total num frames: 290775040. Throughput: 0: 5935.1. Samples: 290774954. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:14:41,119][25689] Avg episode reward: [(0, '-46.175')] [2022-07-09 14:14:41,749][26022] Updated weights on worker 0-0, policy_version 283964 (0.00086) [2022-07-09 14:14:43,345][26022] Updated weights on worker 0-0, policy_version 283974 (0.00086) [2022-07-09 14:14:45,426][26022] Updated weights on worker 0-0, policy_version 283984 (0.00091) [2022-07-09 14:14:46,123][25689] Fps is (10 sec: 5509.3, 60 sec: 5617.9, 300 sec: 5629.1). Total num frames: 290802688. Throughput: 0: 5934.3. Samples: 290809074. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:14:46,123][25689] Avg episode reward: [(0, '-46.750')] [2022-07-09 14:14:46,904][26022] Updated weights on worker 0-0, policy_version 283994 (0.00089) [2022-07-09 14:14:49,022][26022] Updated weights on worker 0-0, policy_version 284004 (0.00090) [2022-07-09 14:14:50,515][26022] Updated weights on worker 0-0, policy_version 284014 (0.00095) [2022-07-09 14:14:51,222][25689] Fps is (10 sec: 5776.6, 60 sec: 5664.4, 300 sec: 5634.6). Total num frames: 290833408. Throughput: 0: 5089.1. Samples: 290826102. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 14:14:51,222][25689] Avg episode reward: [(0, '-47.217')] [2022-07-09 14:14:52,567][26022] Updated weights on worker 0-0, policy_version 284024 (0.00093) [2022-07-09 14:14:54,348][26022] Updated weights on worker 0-0, policy_version 284034 (0.00093) [2022-07-09 14:14:56,215][26022] Updated weights on worker 0-0, policy_version 284044 (0.00091) [2022-07-09 14:14:56,313][25689] Fps is (10 sec: 5727.7, 60 sec: 5624.9, 300 sec: 5629.5). Total num frames: 290861056. Throughput: 0: 5915.2. Samples: 290860406. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 14:14:56,314][25689] Avg episode reward: [(0, '-47.786')] [2022-07-09 14:14:57,849][26022] Updated weights on worker 0-0, policy_version 284054 (0.00089) [2022-07-09 14:14:59,865][26022] Updated weights on worker 0-0, policy_version 284064 (0.00098) [2022-07-09 14:15:01,338][25689] Fps is (10 sec: 5668.5, 60 sec: 5660.2, 300 sec: 5639.8). Total num frames: 290890752. Throughput: 0: 5896.2. Samples: 290894178. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 14:15:01,339][25689] Avg episode reward: [(0, '-48.183')] [2022-07-09 14:15:01,554][26022] Updated weights on worker 0-0, policy_version 284074 (0.00091) [2022-07-09 14:15:03,519][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:15:03,529][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000284082_290899968.pth [2022-07-09 14:15:03,530][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000282101_288871424.pth [2022-07-09 14:15:03,956][26022] Updated weights on worker 0-0, policy_version 284084 (0.00088) [2022-07-09 14:15:05,452][26022] Updated weights on worker 0-0, policy_version 284094 (0.00082) [2022-07-09 14:15:06,343][25689] Fps is (10 sec: 5411.2, 60 sec: 5594.4, 300 sec: 5630.2). Total num frames: 290915328. Throughput: 0: 5791.1. Samples: 290926170. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 14:15:06,343][25689] Avg episode reward: [(0, '-48.330')] [2022-07-09 14:15:07,395][26022] Updated weights on worker 0-0, policy_version 284104 (0.00087) [2022-07-09 14:15:09,211][26022] Updated weights on worker 0-0, policy_version 284114 (0.00062) [2022-07-09 14:15:11,004][26022] Updated weights on worker 0-0, policy_version 284124 (0.00093) [2022-07-09 14:15:11,413][25689] Fps is (10 sec: 5387.1, 60 sec: 5644.4, 300 sec: 5634.1). Total num frames: 290945024. Throughput: 0: 5798.8. Samples: 290943184. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 14:15:11,413][25689] Avg episode reward: [(0, '-48.275')] [2022-07-09 14:15:12,682][26022] Updated weights on worker 0-0, policy_version 284134 (0.00093) [2022-07-09 14:15:14,696][26022] Updated weights on worker 0-0, policy_version 284144 (0.01062) [2022-07-09 14:15:16,291][26022] Updated weights on worker 0-0, policy_version 284154 (0.00095) [2022-07-09 14:15:16,416][25689] Fps is (10 sec: 5794.2, 60 sec: 5644.2, 300 sec: 5628.4). Total num frames: 290973696. Throughput: 0: 5808.2. Samples: 290977168. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 14:15:16,417][25689] Avg episode reward: [(0, '-48.178')] [2022-07-09 14:15:18,371][26022] Updated weights on worker 0-0, policy_version 284164 (0.00092) [2022-07-09 14:15:19,787][26022] Updated weights on worker 0-0, policy_version 284174 (0.00893) [2022-07-09 14:15:21,484][25689] Fps is (10 sec: 5592.0, 60 sec: 5604.7, 300 sec: 5627.9). Total num frames: 291001344. Throughput: 0: 5801.6. Samples: 291011058. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 14:15:21,485][25689] Avg episode reward: [(0, '-47.519')] [2022-07-09 14:15:21,852][26022] Updated weights on worker 0-0, policy_version 284184 (0.00100) [2022-07-09 14:15:23,622][26022] Updated weights on worker 0-0, policy_version 284194 (0.00083) [2022-07-09 14:15:25,461][26022] Updated weights on worker 0-0, policy_version 284204 (0.00084) [2022-07-09 14:15:26,490][25689] Fps is (10 sec: 5692.6, 60 sec: 5638.6, 300 sec: 5633.7). Total num frames: 291031040. Throughput: 0: 5059.8. Samples: 291028110. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 14:15:26,490][25689] Avg episode reward: [(0, '-47.389')] [2022-07-09 14:15:27,390][26022] Updated weights on worker 0-0, policy_version 284214 (0.00092) [2022-07-09 14:15:29,113][26022] Updated weights on worker 0-0, policy_version 284224 (0.00088) [2022-07-09 14:15:30,845][26022] Updated weights on worker 0-0, policy_version 284234 (0.00082) [2022-07-09 14:15:31,600][25689] Fps is (10 sec: 5770.2, 60 sec: 5651.0, 300 sec: 5635.4). Total num frames: 291059712. Throughput: 0: 5874.5. Samples: 291061774. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 14:15:31,600][25689] Avg episode reward: [(0, '-47.598')] [2022-07-09 14:15:32,906][26022] Updated weights on worker 0-0, policy_version 284244 (0.00087) [2022-07-09 14:15:34,383][26022] Updated weights on worker 0-0, policy_version 284254 (0.00081) [2022-07-09 14:15:36,406][26022] Updated weights on worker 0-0, policy_version 284264 (0.00383) [2022-07-09 14:15:36,657][25689] Fps is (10 sec: 5539.4, 60 sec: 5612.7, 300 sec: 5631.9). Total num frames: 291087360. Throughput: 0: 5870.9. Samples: 291095998. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 14:15:36,658][25689] Avg episode reward: [(0, '-47.147')] [2022-07-09 14:15:38,012][26022] Updated weights on worker 0-0, policy_version 284274 (0.00085) [2022-07-09 14:15:39,779][26022] Updated weights on worker 0-0, policy_version 284284 (0.00080) [2022-07-09 14:15:41,708][25689] Fps is (10 sec: 5571.5, 60 sec: 5627.8, 300 sec: 5627.9). Total num frames: 291116032. Throughput: 0: 5041.3. Samples: 291113014. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 14:15:41,709][25689] Avg episode reward: [(0, '-47.307')] [2022-07-09 14:15:41,719][26022] Updated weights on worker 0-0, policy_version 284294 (0.00089) [2022-07-09 14:15:43,562][26022] Updated weights on worker 0-0, policy_version 284304 (0.00085) [2022-07-09 14:15:45,402][26022] Updated weights on worker 0-0, policy_version 284314 (0.00090) [2022-07-09 14:15:46,766][25689] Fps is (10 sec: 5672.8, 60 sec: 5639.9, 300 sec: 5632.2). Total num frames: 291144704. Throughput: 0: 5854.8. Samples: 291146820. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 14:15:46,766][25689] Avg episode reward: [(0, '-46.445')] [2022-07-09 14:15:47,401][26022] Updated weights on worker 0-0, policy_version 284324 (0.00086) [2022-07-09 14:15:48,935][26022] Updated weights on worker 0-0, policy_version 284334 (0.00089) [2022-07-09 14:15:50,976][26022] Updated weights on worker 0-0, policy_version 284344 (0.00095) [2022-07-09 14:15:51,815][25689] Fps is (10 sec: 5673.9, 60 sec: 5610.7, 300 sec: 5628.1). Total num frames: 291173376. Throughput: 0: 5882.3. Samples: 291180686. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 14:15:51,815][25689] Avg episode reward: [(0, '-46.193')] [2022-07-09 14:15:52,574][26022] Updated weights on worker 0-0, policy_version 284354 (0.00085) [2022-07-09 14:15:54,506][26022] Updated weights on worker 0-0, policy_version 284364 (0.00093) [2022-07-09 14:15:56,428][26022] Updated weights on worker 0-0, policy_version 284374 (0.00087) [2022-07-09 14:15:56,820][25689] Fps is (10 sec: 5601.3, 60 sec: 5618.7, 300 sec: 5628.1). Total num frames: 291201024. Throughput: 0: 5034.4. Samples: 291197508. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 14:15:56,821][25689] Avg episode reward: [(0, '-46.204')] [2022-07-09 14:15:58,148][26022] Updated weights on worker 0-0, policy_version 284384 (0.00083) [2022-07-09 14:15:59,913][26022] Updated weights on worker 0-0, policy_version 284394 (0.00092) [2022-07-09 14:16:01,835][25689] Fps is (10 sec: 5518.6, 60 sec: 5585.8, 300 sec: 5632.2). Total num frames: 291228672. Throughput: 0: 5880.7. Samples: 291231372. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 14:16:01,835][25689] Avg episode reward: [(0, '-46.870')] [2022-07-09 14:16:01,878][26022] Updated weights on worker 0-0, policy_version 284404 (0.00085) [2022-07-09 14:16:03,904][26022] Updated weights on worker 0-0, policy_version 284414 (0.00086) [2022-07-09 14:16:05,699][26022] Updated weights on worker 0-0, policy_version 284424 (0.00090) [2022-07-09 14:16:06,840][25689] Fps is (10 sec: 5518.8, 60 sec: 5636.5, 300 sec: 5629.8). Total num frames: 291256320. Throughput: 0: 5799.0. Samples: 291263232. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 14:16:06,841][25689] Avg episode reward: [(0, '-46.078')] [2022-07-09 14:16:07,444][26022] Updated weights on worker 0-0, policy_version 284434 (0.00091) [2022-07-09 14:16:09,318][26022] Updated weights on worker 0-0, policy_version 284444 (0.00096) [2022-07-09 14:16:11,222][26022] Updated weights on worker 0-0, policy_version 284454 (0.00088) [2022-07-09 14:16:11,964][25689] Fps is (10 sec: 5560.3, 60 sec: 5614.6, 300 sec: 5627.7). Total num frames: 291284992. Throughput: 0: 4944.4. Samples: 291280310. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 14:16:11,964][25689] Avg episode reward: [(0, '-45.888')] [2022-07-09 14:16:13,064][26022] Updated weights on worker 0-0, policy_version 284464 (0.00089) [2022-07-09 14:16:14,587][26022] Updated weights on worker 0-0, policy_version 284474 (0.00087) [2022-07-09 14:16:16,639][26022] Updated weights on worker 0-0, policy_version 284484 (0.00083) [2022-07-09 14:16:17,003][25689] Fps is (10 sec: 5642.6, 60 sec: 5611.3, 300 sec: 5627.4). Total num frames: 291313664. Throughput: 0: 5803.0. Samples: 291314626. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 14:16:17,003][25689] Avg episode reward: [(0, '-45.931')] [2022-07-09 14:16:18,180][26022] Updated weights on worker 0-0, policy_version 284494 (0.00098) [2022-07-09 14:16:20,071][26022] Updated weights on worker 0-0, policy_version 284504 (0.00084) [2022-07-09 14:16:22,044][25689] Fps is (10 sec: 5587.3, 60 sec: 5613.8, 300 sec: 5626.9). Total num frames: 291341312. Throughput: 0: 5813.3. Samples: 291348854. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 14:16:22,046][25689] Avg episode reward: [(0, '-46.390')] [2022-07-09 14:16:22,047][26022] Updated weights on worker 0-0, policy_version 284514 (0.00099) [2022-07-09 14:16:23,580][26022] Updated weights on worker 0-0, policy_version 284524 (0.00091) [2022-07-09 14:16:25,672][26022] Updated weights on worker 0-0, policy_version 284534 (0.00088) [2022-07-09 14:16:27,056][25689] Fps is (10 sec: 5704.2, 60 sec: 5613.2, 300 sec: 5631.3). Total num frames: 291371008. Throughput: 0: 5089.0. Samples: 291366110. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 14:16:27,056][25689] Avg episode reward: [(0, '-47.036')] [2022-07-09 14:16:27,309][26022] Updated weights on worker 0-0, policy_version 284544 (0.00090) [2022-07-09 14:16:29,343][26022] Updated weights on worker 0-0, policy_version 284554 (0.00086) [2022-07-09 14:16:30,843][26022] Updated weights on worker 0-0, policy_version 284564 (0.00093) [2022-07-09 14:16:32,129][25689] Fps is (10 sec: 5686.2, 60 sec: 5599.7, 300 sec: 5628.0). Total num frames: 291398656. Throughput: 0: 5932.7. Samples: 291399942. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 14:16:32,129][25689] Avg episode reward: [(0, '-46.346')] [2022-07-09 14:16:33,012][26022] Updated weights on worker 0-0, policy_version 284574 (0.00091) [2022-07-09 14:16:34,588][26022] Updated weights on worker 0-0, policy_version 284584 (0.00086) [2022-07-09 14:16:36,554][26022] Updated weights on worker 0-0, policy_version 284594 (0.00085) [2022-07-09 14:16:37,143][25689] Fps is (10 sec: 5583.4, 60 sec: 5620.6, 300 sec: 5626.1). Total num frames: 291427328. Throughput: 0: 5911.0. Samples: 291433674. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 14:16:37,144][25689] Avg episode reward: [(0, '-46.783')] [2022-07-09 14:16:38,199][26022] Updated weights on worker 0-0, policy_version 284604 (0.00093) [2022-07-09 14:16:40,071][26022] Updated weights on worker 0-0, policy_version 284614 (0.00085) [2022-07-09 14:16:42,067][26022] Updated weights on worker 0-0, policy_version 284624 (0.00095) [2022-07-09 14:16:42,171][25689] Fps is (10 sec: 5608.2, 60 sec: 5605.8, 300 sec: 5630.1). Total num frames: 291454976. Throughput: 0: 5065.5. Samples: 291450808. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 14:16:42,171][25689] Avg episode reward: [(0, '-47.792')] [2022-07-09 14:16:43,586][26022] Updated weights on worker 0-0, policy_version 284634 (0.00092) [2022-07-09 14:16:45,586][26022] Updated weights on worker 0-0, policy_version 284644 (0.00050) [2022-07-09 14:16:47,122][26022] Updated weights on worker 0-0, policy_version 284654 (0.00086) [2022-07-09 14:16:47,179][25689] Fps is (10 sec: 5815.6, 60 sec: 5644.3, 300 sec: 5634.6). Total num frames: 291485696. Throughput: 0: 5908.5. Samples: 291485010. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 14:16:47,181][25689] Avg episode reward: [(0, '-48.640')] [2022-07-09 14:16:49,149][26022] Updated weights on worker 0-0, policy_version 284664 (0.00087) [2022-07-09 14:16:50,788][26022] Updated weights on worker 0-0, policy_version 284674 (0.00089) [2022-07-09 14:16:52,242][25689] Fps is (10 sec: 5795.5, 60 sec: 5626.0, 300 sec: 5627.0). Total num frames: 291513344. Throughput: 0: 5920.3. Samples: 291519022. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 14:16:52,244][25689] Avg episode reward: [(0, '-48.310')] [2022-07-09 14:16:52,678][26022] Updated weights on worker 0-0, policy_version 284684 (0.00114) [2022-07-09 14:16:54,559][26022] Updated weights on worker 0-0, policy_version 284694 (0.00106) [2022-07-09 14:16:56,437][26022] Updated weights on worker 0-0, policy_version 284704 (0.00091) [2022-07-09 14:16:57,303][25689] Fps is (10 sec: 5462.1, 60 sec: 5620.9, 300 sec: 5622.6). Total num frames: 291540992. Throughput: 0: 5073.1. Samples: 291535948. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 14:16:57,304][25689] Avg episode reward: [(0, '-47.645')] [2022-07-09 14:16:58,041][26022] Updated weights on worker 0-0, policy_version 284714 (0.00081) [2022-07-09 14:16:59,895][26022] Updated weights on worker 0-0, policy_version 284724 (0.00088) [2022-07-09 14:17:01,644][26022] Updated weights on worker 0-0, policy_version 284734 (0.00096) [2022-07-09 14:17:02,327][25689] Fps is (10 sec: 5483.3, 60 sec: 5620.1, 300 sec: 5629.7). Total num frames: 291568640. Throughput: 0: 5925.1. Samples: 291570232. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 14:17:02,328][25689] Avg episode reward: [(0, '-48.081')] [2022-07-09 14:17:03,601][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:17:03,618][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000284742_291575808.pth [2022-07-09 14:17:03,618][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000282761_289547264.pth [2022-07-09 14:17:03,937][26022] Updated weights on worker 0-0, policy_version 284744 (0.00089) [2022-07-09 14:17:05,776][26022] Updated weights on worker 0-0, policy_version 284754 (0.00093) [2022-07-09 14:17:07,343][25689] Fps is (10 sec: 5609.2, 60 sec: 5635.9, 300 sec: 5626.8). Total num frames: 291597312. Throughput: 0: 5796.4. Samples: 291601890. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 14:17:07,344][25689] Avg episode reward: [(0, '-49.391')] [2022-07-09 14:17:07,429][26022] Updated weights on worker 0-0, policy_version 284764 (0.00090) [2022-07-09 14:17:09,472][26022] Updated weights on worker 0-0, policy_version 284774 (0.00092) [2022-07-09 14:17:11,017][26022] Updated weights on worker 0-0, policy_version 284784 (0.00094) [2022-07-09 14:17:12,447][25689] Fps is (10 sec: 5565.1, 60 sec: 5620.9, 300 sec: 5629.8). Total num frames: 291624960. Throughput: 0: 4949.3. Samples: 291619020. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 14:17:12,447][25689] Avg episode reward: [(0, '-49.445')] [2022-07-09 14:17:13,239][26022] Updated weights on worker 0-0, policy_version 284794 (0.00089) [2022-07-09 14:17:14,651][26022] Updated weights on worker 0-0, policy_version 284804 (0.00100) [2022-07-09 14:17:16,732][26022] Updated weights on worker 0-0, policy_version 284814 (0.00090) [2022-07-09 14:17:17,459][25689] Fps is (10 sec: 5466.5, 60 sec: 5606.4, 300 sec: 5623.3). Total num frames: 291652608. Throughput: 0: 5800.0. Samples: 291652854. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:17:17,459][25689] Avg episode reward: [(0, '-49.611')] [2022-07-09 14:17:18,225][26022] Updated weights on worker 0-0, policy_version 284824 (0.00087) [2022-07-09 14:17:20,562][26022] Updated weights on worker 0-0, policy_version 284834 (0.00055) [2022-07-09 14:17:22,026][26022] Updated weights on worker 0-0, policy_version 284844 (0.00088) [2022-07-09 14:17:22,464][25689] Fps is (10 sec: 5724.8, 60 sec: 5643.7, 300 sec: 5626.7). Total num frames: 291682304. Throughput: 0: 5784.7. Samples: 291686718. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:17:22,464][25689] Avg episode reward: [(0, '-50.054')] [2022-07-09 14:17:23,825][26022] Updated weights on worker 0-0, policy_version 284854 (0.00090) [2022-07-09 14:17:25,688][26022] Updated weights on worker 0-0, policy_version 284864 (0.00093) [2022-07-09 14:17:27,515][25689] Fps is (10 sec: 5702.5, 60 sec: 5606.2, 300 sec: 5627.5). Total num frames: 291709952. Throughput: 0: 5050.7. Samples: 291703770. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:17:27,515][25689] Avg episode reward: [(0, '-49.836')] [2022-07-09 14:17:27,755][26022] Updated weights on worker 0-0, policy_version 284874 (0.00079) [2022-07-09 14:17:29,319][26022] Updated weights on worker 0-0, policy_version 284884 (0.00081) [2022-07-09 14:17:31,368][26022] Updated weights on worker 0-0, policy_version 284894 (0.00083) [2022-07-09 14:17:32,604][25689] Fps is (10 sec: 5654.9, 60 sec: 5638.5, 300 sec: 5626.4). Total num frames: 291739648. Throughput: 0: 5876.7. Samples: 291737478. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:17:32,605][25689] Avg episode reward: [(0, '-49.489')] [2022-07-09 14:17:32,876][26022] Updated weights on worker 0-0, policy_version 284904 (0.00087) [2022-07-09 14:17:34,916][26022] Updated weights on worker 0-0, policy_version 284914 (0.00083) [2022-07-09 14:17:36,572][26022] Updated weights on worker 0-0, policy_version 284924 (0.00093) [2022-07-09 14:17:37,654][25689] Fps is (10 sec: 5655.8, 60 sec: 5618.3, 300 sec: 5626.0). Total num frames: 291767296. Throughput: 0: 5873.2. Samples: 291771462. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:17:37,654][25689] Avg episode reward: [(0, '-48.541')] [2022-07-09 14:17:38,376][26022] Updated weights on worker 0-0, policy_version 284934 (0.00105) [2022-07-09 14:17:40,315][26022] Updated weights on worker 0-0, policy_version 284944 (0.00093) [2022-07-09 14:17:42,087][26022] Updated weights on worker 0-0, policy_version 284954 (0.00088) [2022-07-09 14:17:42,666][25689] Fps is (10 sec: 5597.3, 60 sec: 5636.7, 300 sec: 5629.6). Total num frames: 291795968. Throughput: 0: 5874.8. Samples: 291805404. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:17:42,667][25689] Avg episode reward: [(0, '-48.430')] [2022-07-09 14:17:43,727][26022] Updated weights on worker 0-0, policy_version 284964 (0.00087) [2022-07-09 14:17:45,683][26022] Updated weights on worker 0-0, policy_version 284974 (0.00085) [2022-07-09 14:17:47,382][26022] Updated weights on worker 0-0, policy_version 284984 (0.00094) [2022-07-09 14:17:47,698][25689] Fps is (10 sec: 5709.0, 60 sec: 5600.6, 300 sec: 5623.3). Total num frames: 291824640. Throughput: 0: 5873.4. Samples: 291822316. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:17:47,699][25689] Avg episode reward: [(0, '-48.138')] [2022-07-09 14:17:49,482][26022] Updated weights on worker 0-0, policy_version 284994 (0.00090) [2022-07-09 14:17:51,140][26022] Updated weights on worker 0-0, policy_version 285004 (0.00082) [2022-07-09 14:17:52,752][25689] Fps is (10 sec: 5786.9, 60 sec: 5635.3, 300 sec: 5636.5). Total num frames: 291854336. Throughput: 0: 5896.1. Samples: 291856274. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:17:52,753][25689] Avg episode reward: [(0, '-48.482')] [2022-07-09 14:17:52,758][26022] Updated weights on worker 0-0, policy_version 285014 (0.00084) [2022-07-09 14:17:54,873][26022] Updated weights on worker 0-0, policy_version 285024 (0.00087) [2022-07-09 14:17:56,513][26022] Updated weights on worker 0-0, policy_version 285034 (0.00084) [2022-07-09 14:17:57,811][25689] Fps is (10 sec: 5569.2, 60 sec: 5618.5, 300 sec: 5625.8). Total num frames: 291880960. Throughput: 0: 5903.8. Samples: 291890466. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:17:57,811][25689] Avg episode reward: [(0, '-48.322')] [2022-07-09 14:17:58,319][26022] Updated weights on worker 0-0, policy_version 285044 (0.00085) [2022-07-09 14:18:00,171][26022] Updated weights on worker 0-0, policy_version 285054 (0.00083) [2022-07-09 14:18:02,195][26022] Updated weights on worker 0-0, policy_version 285064 (0.00085) [2022-07-09 14:18:02,825][25689] Fps is (10 sec: 5388.0, 60 sec: 5619.4, 300 sec: 5626.4). Total num frames: 291908608. Throughput: 0: 5069.3. Samples: 291907596. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:18:02,826][25689] Avg episode reward: [(0, '-48.776')] [2022-07-09 14:18:04,287][26022] Updated weights on worker 0-0, policy_version 285074 (0.00079) [2022-07-09 14:18:05,946][26022] Updated weights on worker 0-0, policy_version 285084 (0.00088) [2022-07-09 14:18:07,755][26022] Updated weights on worker 0-0, policy_version 285094 (0.00093) [2022-07-09 14:18:07,857][25689] Fps is (10 sec: 5504.2, 60 sec: 5601.1, 300 sec: 5633.3). Total num frames: 291936256. Throughput: 0: 5795.4. Samples: 291939144. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:18:07,858][25689] Avg episode reward: [(0, '-48.316')] [2022-07-09 14:18:09,725][26022] Updated weights on worker 0-0, policy_version 285104 (0.00087) [2022-07-09 14:18:11,302][26022] Updated weights on worker 0-0, policy_version 285114 (0.00094) [2022-07-09 14:18:12,917][25689] Fps is (10 sec: 5378.0, 60 sec: 5588.2, 300 sec: 5615.4). Total num frames: 291962880. Throughput: 0: 5796.9. Samples: 291973164. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:18:12,917][25689] Avg episode reward: [(0, '-47.883')] [2022-07-09 14:18:13,269][26022] Updated weights on worker 0-0, policy_version 285124 (0.00086) [2022-07-09 14:18:14,789][26022] Updated weights on worker 0-0, policy_version 285134 (0.00090) [2022-07-09 14:18:16,850][26022] Updated weights on worker 0-0, policy_version 285144 (0.00091) [2022-07-09 14:18:17,962][25689] Fps is (10 sec: 5675.1, 60 sec: 5636.0, 300 sec: 5625.0). Total num frames: 291993600. Throughput: 0: 4950.7. Samples: 291990226. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:18:17,962][25689] Avg episode reward: [(0, '-46.521')] [2022-07-09 14:18:18,362][26022] Updated weights on worker 0-0, policy_version 285154 (0.00090) [2022-07-09 14:18:20,531][26022] Updated weights on worker 0-0, policy_version 285164 (0.00089) [2022-07-09 14:18:21,952][26022] Updated weights on worker 0-0, policy_version 285174 (0.00089) [2022-07-09 14:18:22,990][25689] Fps is (10 sec: 5794.5, 60 sec: 5599.9, 300 sec: 5621.4). Total num frames: 292021248. Throughput: 0: 5786.4. Samples: 292024276. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:18:22,991][25689] Avg episode reward: [(0, '-46.833')] [2022-07-09 14:18:24,178][26022] Updated weights on worker 0-0, policy_version 285184 (0.00355) [2022-07-09 14:18:25,626][26022] Updated weights on worker 0-0, policy_version 285194 (0.00091) [2022-07-09 14:18:27,762][26022] Updated weights on worker 0-0, policy_version 285204 (0.00085) [2022-07-09 14:18:28,005][25689] Fps is (10 sec: 5709.6, 60 sec: 5637.1, 300 sec: 5628.9). Total num frames: 292050944. Throughput: 0: 5897.8. Samples: 292057972. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:18:28,006][25689] Avg episode reward: [(0, '-46.697')] [2022-07-09 14:18:29,471][26022] Updated weights on worker 0-0, policy_version 285214 (0.00090) [2022-07-09 14:18:31,391][26022] Updated weights on worker 0-0, policy_version 285224 (0.00080) [2022-07-09 14:18:33,047][26022] Updated weights on worker 0-0, policy_version 285234 (0.00098) [2022-07-09 14:18:33,123][25689] Fps is (10 sec: 5760.1, 60 sec: 5617.6, 300 sec: 5626.7). Total num frames: 292079616. Throughput: 0: 5040.4. Samples: 292075012. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:18:33,123][25689] Avg episode reward: [(0, '-46.821')] [2022-07-09 14:18:35,079][26022] Updated weights on worker 0-0, policy_version 285244 (0.00347) [2022-07-09 14:18:36,641][26022] Updated weights on worker 0-0, policy_version 285254 (0.00059) [2022-07-09 14:18:38,128][25689] Fps is (10 sec: 5563.9, 60 sec: 5621.7, 300 sec: 5626.9). Total num frames: 292107264. Throughput: 0: 5895.4. Samples: 292109112. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:18:38,128][25689] Avg episode reward: [(0, '-47.072')] [2022-07-09 14:18:38,668][26022] Updated weights on worker 0-0, policy_version 285264 (0.00089) [2022-07-09 14:18:40,316][26022] Updated weights on worker 0-0, policy_version 285274 (0.00093) [2022-07-09 14:18:42,325][26022] Updated weights on worker 0-0, policy_version 285284 (0.00092) [2022-07-09 14:18:43,131][25689] Fps is (10 sec: 5525.3, 60 sec: 5605.7, 300 sec: 5620.4). Total num frames: 292134912. Throughput: 0: 5890.0. Samples: 292142906. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:18:43,131][25689] Avg episode reward: [(0, '-47.775')] [2022-07-09 14:18:43,955][26022] Updated weights on worker 0-0, policy_version 285294 (0.00088) [2022-07-09 14:18:45,955][26022] Updated weights on worker 0-0, policy_version 285304 (0.00084) [2022-07-09 14:18:47,595][26022] Updated weights on worker 0-0, policy_version 285314 (0.00084) [2022-07-09 14:18:48,138][25689] Fps is (10 sec: 5728.5, 60 sec: 5624.9, 300 sec: 5628.1). Total num frames: 292164608. Throughput: 0: 5065.2. Samples: 292159948. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:18:48,138][25689] Avg episode reward: [(0, '-47.814')] [2022-07-09 14:18:49,629][26022] Updated weights on worker 0-0, policy_version 285324 (0.00087) [2022-07-09 14:18:51,116][26022] Updated weights on worker 0-0, policy_version 285334 (0.00094) [2022-07-09 14:18:53,275][25689] Fps is (10 sec: 5552.1, 60 sec: 5566.5, 300 sec: 5615.8). Total num frames: 292191232. Throughput: 0: 5882.2. Samples: 292193548. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:18:53,275][25689] Avg episode reward: [(0, '-48.380')] [2022-07-09 14:18:53,366][26022] Updated weights on worker 0-0, policy_version 285344 (0.00577) [2022-07-09 14:18:54,939][26022] Updated weights on worker 0-0, policy_version 285354 (0.00091) [2022-07-09 14:18:56,726][26022] Updated weights on worker 0-0, policy_version 285364 (0.00094) [2022-07-09 14:18:58,283][25689] Fps is (10 sec: 5753.4, 60 sec: 5655.8, 300 sec: 5630.2). Total num frames: 292222976. Throughput: 0: 5877.8. Samples: 292227582. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:18:58,283][25689] Avg episode reward: [(0, '-48.502')] [2022-07-09 14:18:58,287][26022] Updated weights on worker 0-0, policy_version 285374 (0.00084) [2022-07-09 14:19:00,432][26022] Updated weights on worker 0-0, policy_version 285384 (0.00085) [2022-07-09 14:19:02,537][26022] Updated weights on worker 0-0, policy_version 285394 (0.00087) [2022-07-09 14:19:03,319][25689] Fps is (10 sec: 5505.2, 60 sec: 5586.0, 300 sec: 5612.8). Total num frames: 292246528. Throughput: 0: 5029.6. Samples: 292244444. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:19:03,319][25689] Avg episode reward: [(0, '-48.661')] [2022-07-09 14:19:03,777][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:19:03,794][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000285400_292249600.pth [2022-07-09 14:19:03,795][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000283421_290223104.pth [2022-07-09 14:19:04,488][26022] Updated weights on worker 0-0, policy_version 285404 (0.00089) [2022-07-09 14:19:06,047][26022] Updated weights on worker 0-0, policy_version 285414 (0.00095) [2022-07-09 14:19:08,180][26022] Updated weights on worker 0-0, policy_version 285424 (0.00090) [2022-07-09 14:19:08,343][25689] Fps is (10 sec: 5191.0, 60 sec: 5603.6, 300 sec: 5620.3). Total num frames: 292275200. Throughput: 0: 5751.5. Samples: 292276160. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:19:08,344][25689] Avg episode reward: [(0, '-49.197')] [2022-07-09 14:19:09,896][26022] Updated weights on worker 0-0, policy_version 285434 (0.00091) [2022-07-09 14:19:11,644][26022] Updated weights on worker 0-0, policy_version 285444 (0.00091) [2022-07-09 14:19:13,430][25689] Fps is (10 sec: 5772.8, 60 sec: 5652.0, 300 sec: 5622.2). Total num frames: 292304896. Throughput: 0: 5791.4. Samples: 292310274. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:19:13,431][26022] Updated weights on worker 0-0, policy_version 285454 (0.00083) [2022-07-09 14:19:13,434][25689] Avg episode reward: [(0, '-49.770')] [2022-07-09 14:19:15,235][26022] Updated weights on worker 0-0, policy_version 285464 (0.00079) [2022-07-09 14:19:17,164][26022] Updated weights on worker 0-0, policy_version 285474 (0.00091) [2022-07-09 14:19:18,526][25689] Fps is (10 sec: 5631.3, 60 sec: 5596.4, 300 sec: 5613.6). Total num frames: 292332544. Throughput: 0: 4937.3. Samples: 292327530. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:19:18,527][25689] Avg episode reward: [(0, '-49.762')] [2022-07-09 14:19:18,940][26022] Updated weights on worker 0-0, policy_version 285484 (0.00098) [2022-07-09 14:19:20,660][26022] Updated weights on worker 0-0, policy_version 285494 (0.00083) [2022-07-09 14:19:22,591][26022] Updated weights on worker 0-0, policy_version 285504 (0.00092) [2022-07-09 14:19:23,559][25689] Fps is (10 sec: 5762.2, 60 sec: 5646.7, 300 sec: 5623.4). Total num frames: 292363264. Throughput: 0: 5796.2. Samples: 292361760. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:19:23,560][25689] Avg episode reward: [(0, '-49.448')] [2022-07-09 14:19:24,402][26022] Updated weights on worker 0-0, policy_version 285514 (0.00088) [2022-07-09 14:19:25,983][26022] Updated weights on worker 0-0, policy_version 285524 (0.00090) [2022-07-09 14:19:27,964][26022] Updated weights on worker 0-0, policy_version 285534 (0.00084) [2022-07-09 14:19:28,568][25689] Fps is (10 sec: 5710.6, 60 sec: 5596.5, 300 sec: 5621.0). Total num frames: 292389888. Throughput: 0: 5918.4. Samples: 292395858. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:19:28,569][25689] Avg episode reward: [(0, '-49.772')] [2022-07-09 14:19:29,633][26022] Updated weights on worker 0-0, policy_version 285544 (0.00092) [2022-07-09 14:19:31,582][26022] Updated weights on worker 0-0, policy_version 285554 (0.00086) [2022-07-09 14:19:33,195][26022] Updated weights on worker 0-0, policy_version 285564 (0.00085) [2022-07-09 14:19:33,622][25689] Fps is (10 sec: 5495.1, 60 sec: 5602.5, 300 sec: 5616.7). Total num frames: 292418560. Throughput: 0: 5072.3. Samples: 292412696. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:19:33,624][25689] Avg episode reward: [(0, '-48.713')] [2022-07-09 14:19:35,240][26022] Updated weights on worker 0-0, policy_version 285574 (0.00085) [2022-07-09 14:19:37,054][26022] Updated weights on worker 0-0, policy_version 285584 (0.00086) [2022-07-09 14:19:38,664][25689] Fps is (10 sec: 5679.7, 60 sec: 5615.9, 300 sec: 5619.9). Total num frames: 292447232. Throughput: 0: 5905.5. Samples: 292446454. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-09 14:19:38,666][25689] Avg episode reward: [(0, '-48.817')] [2022-07-09 14:19:38,691][26022] Updated weights on worker 0-0, policy_version 285594 (0.00090) [2022-07-09 14:19:40,670][26022] Updated weights on worker 0-0, policy_version 285604 (0.00091) [2022-07-09 14:19:42,381][26022] Updated weights on worker 0-0, policy_version 285614 (0.00085) [2022-07-09 14:19:43,738][25689] Fps is (10 sec: 5567.4, 60 sec: 5609.4, 300 sec: 5618.6). Total num frames: 292474880. Throughput: 0: 5883.5. Samples: 292480482. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:19:43,739][25689] Avg episode reward: [(0, '-48.061')] [2022-07-09 14:19:44,230][26022] Updated weights on worker 0-0, policy_version 285624 (0.00504) [2022-07-09 14:19:46,275][26022] Updated weights on worker 0-0, policy_version 285634 (0.00090) [2022-07-09 14:19:47,865][26022] Updated weights on worker 0-0, policy_version 285644 (0.00086) [2022-07-09 14:19:48,790][25689] Fps is (10 sec: 5663.3, 60 sec: 5605.2, 300 sec: 5616.1). Total num frames: 292504576. Throughput: 0: 5019.2. Samples: 292497356. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:19:48,790][25689] Avg episode reward: [(0, '-47.882')] [2022-07-09 14:19:49,793][26022] Updated weights on worker 0-0, policy_version 285654 (0.00096) [2022-07-09 14:19:51,367][26022] Updated weights on worker 0-0, policy_version 285664 (0.00085) [2022-07-09 14:19:53,353][26022] Updated weights on worker 0-0, policy_version 285674 (0.00090) [2022-07-09 14:19:53,906][25689] Fps is (10 sec: 5740.2, 60 sec: 5640.9, 300 sec: 5619.1). Total num frames: 292533248. Throughput: 0: 5846.1. Samples: 292531280. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:19:53,907][25689] Avg episode reward: [(0, '-47.560')] [2022-07-09 14:19:55,066][26022] Updated weights on worker 0-0, policy_version 285684 (0.00093) [2022-07-09 14:19:56,998][26022] Updated weights on worker 0-0, policy_version 285694 (0.00095) [2022-07-09 14:19:58,727][26022] Updated weights on worker 0-0, policy_version 285704 (0.00078) [2022-07-09 14:19:58,980][25689] Fps is (10 sec: 5627.4, 60 sec: 5584.2, 300 sec: 5614.7). Total num frames: 292561920. Throughput: 0: 5852.1. Samples: 292565344. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:19:58,981][25689] Avg episode reward: [(0, '-47.932')] [2022-07-09 14:20:00,428][26022] Updated weights on worker 0-0, policy_version 285714 (0.00089) [2022-07-09 14:20:02,775][26022] Updated weights on worker 0-0, policy_version 285724 (0.00083) [2022-07-09 14:20:03,998][25689] Fps is (10 sec: 5479.1, 60 sec: 5636.4, 300 sec: 5621.3). Total num frames: 292588544. Throughput: 0: 5745.8. Samples: 292596894. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:20:03,999][25689] Avg episode reward: [(0, '-48.874')] [2022-07-09 14:20:04,823][26022] Updated weights on worker 0-0, policy_version 285734 (0.00110) [2022-07-09 14:20:06,469][26022] Updated weights on worker 0-0, policy_version 285744 (0.00087) [2022-07-09 14:20:08,375][26022] Updated weights on worker 0-0, policy_version 285754 (0.00095) [2022-07-09 14:20:09,048][25689] Fps is (10 sec: 5390.5, 60 sec: 5617.2, 300 sec: 5614.8). Total num frames: 292616192. Throughput: 0: 5743.8. Samples: 292613714. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:20:09,049][25689] Avg episode reward: [(0, '-48.825')] [2022-07-09 14:20:10,138][26022] Updated weights on worker 0-0, policy_version 285764 (0.00090) [2022-07-09 14:20:11,854][26022] Updated weights on worker 0-0, policy_version 285774 (0.00092) [2022-07-09 14:20:13,647][26022] Updated weights on worker 0-0, policy_version 285784 (0.00097) [2022-07-09 14:20:14,103][25689] Fps is (10 sec: 5573.7, 60 sec: 5603.2, 300 sec: 5613.8). Total num frames: 292644864. Throughput: 0: 5773.4. Samples: 292647884. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:20:14,104][25689] Avg episode reward: [(0, '-48.326')] [2022-07-09 14:20:15,485][26022] Updated weights on worker 0-0, policy_version 285794 (0.00094) [2022-07-09 14:20:17,339][26022] Updated weights on worker 0-0, policy_version 285804 (0.00094) [2022-07-09 14:20:19,134][25689] Fps is (10 sec: 5584.3, 60 sec: 5609.4, 300 sec: 5614.6). Total num frames: 292672512. Throughput: 0: 5774.1. Samples: 292681712. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:20:19,134][25689] Avg episode reward: [(0, '-47.840')] [2022-07-09 14:20:19,225][26022] Updated weights on worker 0-0, policy_version 285814 (0.00087) [2022-07-09 14:20:20,957][26022] Updated weights on worker 0-0, policy_version 285824 (0.00102) [2022-07-09 14:20:22,784][26022] Updated weights on worker 0-0, policy_version 285834 (0.00089) [2022-07-09 14:20:24,167][25689] Fps is (10 sec: 5596.5, 60 sec: 5575.6, 300 sec: 5610.6). Total num frames: 292701184. Throughput: 0: 5048.1. Samples: 292698704. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:20:24,167][25689] Avg episode reward: [(0, '-47.195')] [2022-07-09 14:20:24,544][26022] Updated weights on worker 0-0, policy_version 285844 (0.00088) [2022-07-09 14:20:26,078][26022] Updated weights on worker 0-0, policy_version 285854 (0.00087) [2022-07-09 14:20:28,349][26022] Updated weights on worker 0-0, policy_version 285864 (0.00088) [2022-07-09 14:20:29,173][25689] Fps is (10 sec: 5916.0, 60 sec: 5643.4, 300 sec: 5619.4). Total num frames: 292731904. Throughput: 0: 5913.4. Samples: 292732720. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:20:29,173][25689] Avg episode reward: [(0, '-47.405')] [2022-07-09 14:20:29,991][26022] Updated weights on worker 0-0, policy_version 285874 (0.00085) [2022-07-09 14:20:31,879][26022] Updated weights on worker 0-0, policy_version 285884 (0.00088) [2022-07-09 14:20:33,443][26022] Updated weights on worker 0-0, policy_version 285894 (0.00090) [2022-07-09 14:20:34,279][25689] Fps is (10 sec: 5569.5, 60 sec: 5587.9, 300 sec: 5611.6). Total num frames: 292757504. Throughput: 0: 5878.9. Samples: 292766494. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:20:34,281][25689] Avg episode reward: [(0, '-46.614')] [2022-07-09 14:20:35,620][26022] Updated weights on worker 0-0, policy_version 285904 (0.00090) [2022-07-09 14:20:37,544][26022] Updated weights on worker 0-0, policy_version 285914 (0.00093) [2022-07-09 14:20:39,098][26022] Updated weights on worker 0-0, policy_version 285924 (0.00086) [2022-07-09 14:20:39,319][25689] Fps is (10 sec: 5450.0, 60 sec: 5605.0, 300 sec: 5615.3). Total num frames: 292787200. Throughput: 0: 5013.2. Samples: 292782904. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:20:39,320][25689] Avg episode reward: [(0, '-47.342')] [2022-07-09 14:20:41,003][26022] Updated weights on worker 0-0, policy_version 285934 (0.00090) [2022-07-09 14:20:42,634][26022] Updated weights on worker 0-0, policy_version 285944 (0.00086) [2022-07-09 14:20:44,351][25689] Fps is (10 sec: 5795.1, 60 sec: 5625.8, 300 sec: 5615.8). Total num frames: 292815872. Throughput: 0: 5840.1. Samples: 292816582. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:20:44,352][25689] Avg episode reward: [(0, '-47.327')] [2022-07-09 14:20:44,616][26022] Updated weights on worker 0-0, policy_version 285954 (0.00089) [2022-07-09 14:20:46,536][26022] Updated weights on worker 0-0, policy_version 285964 (0.00088) [2022-07-09 14:20:48,064][26022] Updated weights on worker 0-0, policy_version 285974 (0.00085) [2022-07-09 14:20:49,367][25689] Fps is (10 sec: 5503.2, 60 sec: 5578.4, 300 sec: 5609.5). Total num frames: 292842496. Throughput: 0: 5850.7. Samples: 292850868. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:20:49,369][25689] Avg episode reward: [(0, '-48.448')] [2022-07-09 14:20:50,075][26022] Updated weights on worker 0-0, policy_version 285984 (0.00084) [2022-07-09 14:20:51,714][26022] Updated weights on worker 0-0, policy_version 285994 (0.00100) [2022-07-09 14:20:53,716][26022] Updated weights on worker 0-0, policy_version 286004 (0.00093) [2022-07-09 14:20:54,491][25689] Fps is (10 sec: 5655.2, 60 sec: 5611.5, 300 sec: 5617.6). Total num frames: 292873216. Throughput: 0: 5029.1. Samples: 292868142. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:20:54,491][25689] Avg episode reward: [(0, '-48.972')] [2022-07-09 14:20:55,383][26022] Updated weights on worker 0-0, policy_version 286014 (0.00051) [2022-07-09 14:20:57,243][26022] Updated weights on worker 0-0, policy_version 286024 (0.00093) [2022-07-09 14:20:59,107][26022] Updated weights on worker 0-0, policy_version 286034 (0.00086) [2022-07-09 14:20:59,575][25689] Fps is (10 sec: 5717.7, 60 sec: 5593.6, 300 sec: 5616.3). Total num frames: 292900864. Throughput: 0: 5906.6. Samples: 292902550. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:20:59,576][25689] Avg episode reward: [(0, '-49.561')] [2022-07-09 14:21:00,790][26022] Updated weights on worker 0-0, policy_version 286044 (0.00095) [2022-07-09 14:21:03,042][26022] Updated weights on worker 0-0, policy_version 286054 (0.00091) [2022-07-09 14:21:03,892][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:21:03,910][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000286059_292924416.pth [2022-07-09 14:21:03,911][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000284082_290899968.pth [2022-07-09 14:21:04,603][25689] Fps is (10 sec: 5367.1, 60 sec: 5592.7, 300 sec: 5612.4). Total num frames: 292927488. Throughput: 0: 5817.4. Samples: 292934396. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:21:04,603][25689] Avg episode reward: [(0, '-48.755')] [2022-07-09 14:21:04,626][26022] Updated weights on worker 0-0, policy_version 286064 (0.00084) [2022-07-09 14:21:06,481][26022] Updated weights on worker 0-0, policy_version 286074 (0.00092) [2022-07-09 14:21:08,524][26022] Updated weights on worker 0-0, policy_version 286084 (0.00086) [2022-07-09 14:21:09,613][25689] Fps is (10 sec: 5610.4, 60 sec: 5630.2, 300 sec: 5618.0). Total num frames: 292957184. Throughput: 0: 4967.1. Samples: 292951436. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:21:09,614][25689] Avg episode reward: [(0, '-48.389')] [2022-07-09 14:21:10,178][26022] Updated weights on worker 0-0, policy_version 286094 (0.00091) [2022-07-09 14:21:12,214][26022] Updated weights on worker 0-0, policy_version 286104 (0.00083) [2022-07-09 14:21:13,928][26022] Updated weights on worker 0-0, policy_version 286114 (0.00086) [2022-07-09 14:21:14,673][25689] Fps is (10 sec: 5592.8, 60 sec: 5595.9, 300 sec: 5610.7). Total num frames: 292983808. Throughput: 0: 5784.0. Samples: 292984876. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:21:14,673][25689] Avg episode reward: [(0, '-47.880')] [2022-07-09 14:21:15,760][26022] Updated weights on worker 0-0, policy_version 286124 (0.00093) [2022-07-09 14:21:17,444][26022] Updated weights on worker 0-0, policy_version 286134 (0.00089) [2022-07-09 14:21:19,444][26022] Updated weights on worker 0-0, policy_version 286144 (0.00086) [2022-07-09 14:21:19,740][25689] Fps is (10 sec: 5460.3, 60 sec: 5609.4, 300 sec: 5613.7). Total num frames: 293012480. Throughput: 0: 5776.5. Samples: 293019034. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:21:19,741][25689] Avg episode reward: [(0, '-47.915')] [2022-07-09 14:21:20,998][26022] Updated weights on worker 0-0, policy_version 286154 (0.00092) [2022-07-09 14:21:22,808][26022] Updated weights on worker 0-0, policy_version 286164 (0.00085) [2022-07-09 14:21:24,775][25689] Fps is (10 sec: 5676.8, 60 sec: 5609.3, 300 sec: 5609.8). Total num frames: 293041152. Throughput: 0: 5049.0. Samples: 293036242. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:21:24,775][25689] Avg episode reward: [(0, '-48.054')] [2022-07-09 14:21:24,776][26022] Updated weights on worker 0-0, policy_version 286174 (0.00095) [2022-07-09 14:21:26,347][26022] Updated weights on worker 0-0, policy_version 286184 (0.00088) [2022-07-09 14:21:28,399][26022] Updated weights on worker 0-0, policy_version 286194 (0.00085) [2022-07-09 14:21:29,790][25689] Fps is (10 sec: 5807.8, 60 sec: 5591.5, 300 sec: 5617.8). Total num frames: 293070848. Throughput: 0: 5894.2. Samples: 293070362. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:21:29,792][25689] Avg episode reward: [(0, '-48.552')] [2022-07-09 14:21:30,012][26022] Updated weights on worker 0-0, policy_version 286204 (0.00086) [2022-07-09 14:21:31,930][26022] Updated weights on worker 0-0, policy_version 286214 (0.00093) [2022-07-09 14:21:33,779][26022] Updated weights on worker 0-0, policy_version 286224 (0.00077) [2022-07-09 14:21:34,891][25689] Fps is (10 sec: 5870.8, 60 sec: 5659.6, 300 sec: 5619.6). Total num frames: 293100544. Throughput: 0: 5916.6. Samples: 293104498. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:21:34,893][25689] Avg episode reward: [(0, '-49.459')] [2022-07-09 14:21:35,551][26022] Updated weights on worker 0-0, policy_version 286234 (0.00085) [2022-07-09 14:21:37,255][26022] Updated weights on worker 0-0, policy_version 286244 (0.00085) [2022-07-09 14:21:39,253][26022] Updated weights on worker 0-0, policy_version 286254 (0.00096) [2022-07-09 14:21:39,969][25689] Fps is (10 sec: 5633.7, 60 sec: 5622.3, 300 sec: 5618.6). Total num frames: 293128192. Throughput: 0: 5073.8. Samples: 293121670. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:21:39,970][25689] Avg episode reward: [(0, '-49.540')] [2022-07-09 14:21:40,922][26022] Updated weights on worker 0-0, policy_version 286264 (0.00085) [2022-07-09 14:21:42,832][26022] Updated weights on worker 0-0, policy_version 286274 (0.00093) [2022-07-09 14:21:44,560][26022] Updated weights on worker 0-0, policy_version 286284 (0.00089) [2022-07-09 14:21:45,026][25689] Fps is (10 sec: 5557.1, 60 sec: 5620.0, 300 sec: 5610.8). Total num frames: 293156864. Throughput: 0: 5890.6. Samples: 293155534. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:21:45,026][25689] Avg episode reward: [(0, '-50.140')] [2022-07-09 14:21:46,466][26022] Updated weights on worker 0-0, policy_version 286294 (0.00087) [2022-07-09 14:21:48,130][26022] Updated weights on worker 0-0, policy_version 286304 (0.00087) [2022-07-09 14:21:50,037][25689] Fps is (10 sec: 5695.9, 60 sec: 5654.2, 300 sec: 5615.3). Total num frames: 293185536. Throughput: 0: 5888.0. Samples: 293189572. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:21:50,037][25689] Avg episode reward: [(0, '-50.449')] [2022-07-09 14:21:50,043][26022] Updated weights on worker 0-0, policy_version 286314 (0.00086) [2022-07-09 14:21:51,883][26022] Updated weights on worker 0-0, policy_version 286324 (0.00097) [2022-07-09 14:21:53,765][26022] Updated weights on worker 0-0, policy_version 286334 (0.00097) [2022-07-09 14:21:55,128][25689] Fps is (10 sec: 5676.3, 60 sec: 5623.5, 300 sec: 5618.1). Total num frames: 293214208. Throughput: 0: 5877.7. Samples: 293223446. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:21:55,129][25689] Avg episode reward: [(0, '-50.744')] [2022-07-09 14:21:55,490][26022] Updated weights on worker 0-0, policy_version 286344 (0.00084) [2022-07-09 14:21:57,199][26022] Updated weights on worker 0-0, policy_version 286354 (0.00090) [2022-07-09 14:21:59,134][26022] Updated weights on worker 0-0, policy_version 286364 (0.00092) [2022-07-09 14:22:00,135][25689] Fps is (10 sec: 5678.7, 60 sec: 5647.6, 300 sec: 5621.9). Total num frames: 293242880. Throughput: 0: 5892.9. Samples: 293240504. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:22:00,136][25689] Avg episode reward: [(0, '-49.621')] [2022-07-09 14:22:00,828][26022] Updated weights on worker 0-0, policy_version 286374 (0.00086) [2022-07-09 14:22:03,165][26022] Updated weights on worker 0-0, policy_version 286384 (0.00090) [2022-07-09 14:22:05,003][26022] Updated weights on worker 0-0, policy_version 286394 (0.00083) [2022-07-09 14:22:05,210][25689] Fps is (10 sec: 5281.5, 60 sec: 5609.4, 300 sec: 5607.0). Total num frames: 293267456. Throughput: 0: 5786.0. Samples: 293272320. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 14:22:05,211][25689] Avg episode reward: [(0, '-49.375')] [2022-07-09 14:22:06,615][26022] Updated weights on worker 0-0, policy_version 286404 (0.00091) [2022-07-09 14:22:08,804][26022] Updated weights on worker 0-0, policy_version 286414 (0.00097) [2022-07-09 14:22:10,198][26022] Updated weights on worker 0-0, policy_version 286424 (0.00872) [2022-07-09 14:22:10,242][25689] Fps is (10 sec: 5470.6, 60 sec: 5624.2, 300 sec: 5618.7). Total num frames: 293298176. Throughput: 0: 5767.7. Samples: 293306114. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:22:10,243][25689] Avg episode reward: [(0, '-48.870')] [2022-07-09 14:22:12,458][26022] Updated weights on worker 0-0, policy_version 286434 (0.00087) [2022-07-09 14:22:13,715][26022] Updated weights on worker 0-0, policy_version 286444 (0.00093) [2022-07-09 14:22:15,302][25689] Fps is (10 sec: 5682.4, 60 sec: 5624.3, 300 sec: 5614.3). Total num frames: 293324800. Throughput: 0: 4948.3. Samples: 293323270. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:22:15,302][25689] Avg episode reward: [(0, '-48.764')] [2022-07-09 14:22:15,935][26022] Updated weights on worker 0-0, policy_version 286454 (0.00098) [2022-07-09 14:22:17,557][26022] Updated weights on worker 0-0, policy_version 286464 (0.00098) [2022-07-09 14:22:19,338][26022] Updated weights on worker 0-0, policy_version 286474 (0.00084) [2022-07-09 14:22:20,383][25689] Fps is (10 sec: 5553.7, 60 sec: 5639.8, 300 sec: 5612.9). Total num frames: 293354496. Throughput: 0: 5767.6. Samples: 293357290. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:22:20,384][25689] Avg episode reward: [(0, '-48.539')] [2022-07-09 14:22:21,282][26022] Updated weights on worker 0-0, policy_version 286484 (0.00083) [2022-07-09 14:22:22,974][26022] Updated weights on worker 0-0, policy_version 286494 (0.00091) [2022-07-09 14:22:24,797][26022] Updated weights on worker 0-0, policy_version 286504 (0.00093) [2022-07-09 14:22:25,430][25689] Fps is (10 sec: 5762.7, 60 sec: 5638.6, 300 sec: 5616.4). Total num frames: 293383168. Throughput: 0: 5895.4. Samples: 293391526. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:22:25,431][25689] Avg episode reward: [(0, '-48.004')] [2022-07-09 14:22:26,927][26022] Updated weights on worker 0-0, policy_version 286514 (0.00087) [2022-07-09 14:22:28,300][26022] Updated weights on worker 0-0, policy_version 286524 (0.00090) [2022-07-09 14:22:30,460][25689] Fps is (10 sec: 5589.5, 60 sec: 5603.6, 300 sec: 5610.7). Total num frames: 293410816. Throughput: 0: 5063.3. Samples: 293408480. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:22:30,460][25689] Avg episode reward: [(0, '-48.579')] [2022-07-09 14:22:30,480][26022] Updated weights on worker 0-0, policy_version 286534 (0.00096) [2022-07-09 14:22:32,026][26022] Updated weights on worker 0-0, policy_version 286544 (0.00096) [2022-07-09 14:22:33,994][26022] Updated weights on worker 0-0, policy_version 286554 (0.00088) [2022-07-09 14:22:35,534][25689] Fps is (10 sec: 5574.3, 60 sec: 5589.2, 300 sec: 5613.6). Total num frames: 293439488. Throughput: 0: 5882.0. Samples: 293442276. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:22:35,534][25689] Avg episode reward: [(0, '-49.283')] [2022-07-09 14:22:35,783][26022] Updated weights on worker 0-0, policy_version 286564 (0.00089) [2022-07-09 14:22:37,543][26022] Updated weights on worker 0-0, policy_version 286574 (0.00090) [2022-07-09 14:22:39,189][26022] Updated weights on worker 0-0, policy_version 286584 (0.00087) [2022-07-09 14:22:40,600][25689] Fps is (10 sec: 5654.8, 60 sec: 5607.1, 300 sec: 5612.6). Total num frames: 293468160. Throughput: 0: 5891.8. Samples: 293476404. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:22:40,601][25689] Avg episode reward: [(0, '-49.037')] [2022-07-09 14:22:41,080][26022] Updated weights on worker 0-0, policy_version 286594 (0.00090) [2022-07-09 14:22:42,939][26022] Updated weights on worker 0-0, policy_version 286604 (0.00087) [2022-07-09 14:22:44,820][26022] Updated weights on worker 0-0, policy_version 286614 (0.00093) [2022-07-09 14:22:45,602][25689] Fps is (10 sec: 5695.7, 60 sec: 5612.3, 300 sec: 5613.2). Total num frames: 293496832. Throughput: 0: 5054.2. Samples: 293493476. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:22:45,602][25689] Avg episode reward: [(0, '-49.352')] [2022-07-09 14:22:46,591][26022] Updated weights on worker 0-0, policy_version 286624 (0.00092) [2022-07-09 14:22:48,198][26022] Updated weights on worker 0-0, policy_version 286634 (0.00098) [2022-07-09 14:22:50,275][26022] Updated weights on worker 0-0, policy_version 286644 (0.00085) [2022-07-09 14:22:50,635][25689] Fps is (10 sec: 5714.7, 60 sec: 5610.2, 300 sec: 5610.2). Total num frames: 293525504. Throughput: 0: 5894.0. Samples: 293527392. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:22:50,635][25689] Avg episode reward: [(0, '-49.038')] [2022-07-09 14:22:52,031][26022] Updated weights on worker 0-0, policy_version 286654 (0.00094) [2022-07-09 14:22:53,771][26022] Updated weights on worker 0-0, policy_version 286664 (0.00093) [2022-07-09 14:22:55,697][25689] Fps is (10 sec: 5680.1, 60 sec: 5612.9, 300 sec: 5617.0). Total num frames: 293554176. Throughput: 0: 5908.9. Samples: 293561420. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:22:55,698][25689] Avg episode reward: [(0, '-48.672')] [2022-07-09 14:22:55,702][26022] Updated weights on worker 0-0, policy_version 286674 (0.00083) [2022-07-09 14:22:57,477][26022] Updated weights on worker 0-0, policy_version 286684 (0.00078) [2022-07-09 14:22:59,307][26022] Updated weights on worker 0-0, policy_version 286694 (0.00080) [2022-07-09 14:23:00,716][25689] Fps is (10 sec: 5688.2, 60 sec: 5611.8, 300 sec: 5620.3). Total num frames: 293582848. Throughput: 0: 5074.7. Samples: 293578484. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:23:00,717][25689] Avg episode reward: [(0, '-48.406')] [2022-07-09 14:23:01,075][26022] Updated weights on worker 0-0, policy_version 286704 (0.00087) [2022-07-09 14:23:03,182][26022] Updated weights on worker 0-0, policy_version 286714 (0.00087) [2022-07-09 14:23:04,202][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:23:04,213][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000286719_293600256.pth [2022-07-09 14:23:04,213][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000284742_291575808.pth [2022-07-09 14:23:05,087][26022] Updated weights on worker 0-0, policy_version 286724 (0.00094) [2022-07-09 14:23:05,719][25689] Fps is (10 sec: 5517.5, 60 sec: 5652.3, 300 sec: 5617.4). Total num frames: 293609472. Throughput: 0: 5812.5. Samples: 293610408. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:23:05,720][25689] Avg episode reward: [(0, '-48.118')] [2022-07-09 14:23:06,809][26022] Updated weights on worker 0-0, policy_version 286734 (0.00083) [2022-07-09 14:23:08,767][26022] Updated weights on worker 0-0, policy_version 286744 (0.00090) [2022-07-09 14:23:10,614][26022] Updated weights on worker 0-0, policy_version 286754 (0.00096) [2022-07-09 14:23:10,758][25689] Fps is (10 sec: 5404.4, 60 sec: 5600.9, 300 sec: 5621.2). Total num frames: 293637120. Throughput: 0: 5805.2. Samples: 293644212. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:23:10,759][25689] Avg episode reward: [(0, '-48.514')] [2022-07-09 14:23:12,201][26022] Updated weights on worker 0-0, policy_version 286764 (0.00083) [2022-07-09 14:23:14,410][26022] Updated weights on worker 0-0, policy_version 286774 (0.00088) [2022-07-09 14:23:15,845][25689] Fps is (10 sec: 5562.3, 60 sec: 5632.2, 300 sec: 5613.6). Total num frames: 293665792. Throughput: 0: 4933.7. Samples: 293660820. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:23:15,845][25689] Avg episode reward: [(0, '-48.303')] [2022-07-09 14:23:16,007][26022] Updated weights on worker 0-0, policy_version 286784 (0.00086) [2022-07-09 14:23:17,775][26022] Updated weights on worker 0-0, policy_version 286794 (0.00093) [2022-07-09 14:23:19,743][26022] Updated weights on worker 0-0, policy_version 286804 (0.00088) [2022-07-09 14:23:20,920][25689] Fps is (10 sec: 5542.3, 60 sec: 5599.0, 300 sec: 5612.7). Total num frames: 293693440. Throughput: 0: 5748.8. Samples: 293694634. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:23:20,921][25689] Avg episode reward: [(0, '-48.907')] [2022-07-09 14:23:21,351][26022] Updated weights on worker 0-0, policy_version 286814 (0.00095) [2022-07-09 14:23:23,315][26022] Updated weights on worker 0-0, policy_version 286824 (0.00083) [2022-07-09 14:23:24,978][26022] Updated weights on worker 0-0, policy_version 286834 (0.00084) [2022-07-09 14:23:25,924][25689] Fps is (10 sec: 5587.7, 60 sec: 5603.0, 300 sec: 5609.5). Total num frames: 293722112. Throughput: 0: 5865.7. Samples: 293728922. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:23:25,924][25689] Avg episode reward: [(0, '-49.051')] [2022-07-09 14:23:26,866][26022] Updated weights on worker 0-0, policy_version 286844 (0.00077) [2022-07-09 14:23:28,693][26022] Updated weights on worker 0-0, policy_version 286854 (0.00089) [2022-07-09 14:23:30,607][26022] Updated weights on worker 0-0, policy_version 286864 (0.00084) [2022-07-09 14:23:30,976][25689] Fps is (10 sec: 5702.2, 60 sec: 5617.7, 300 sec: 5610.7). Total num frames: 293750784. Throughput: 0: 5028.1. Samples: 293745872. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:23:30,977][25689] Avg episode reward: [(0, '-49.076')] [2022-07-09 14:23:32,155][26022] Updated weights on worker 0-0, policy_version 286874 (0.00081) [2022-07-09 14:23:34,187][26022] Updated weights on worker 0-0, policy_version 286884 (0.00092) [2022-07-09 14:23:35,742][26022] Updated weights on worker 0-0, policy_version 286894 (0.00094) [2022-07-09 14:23:36,118][25689] Fps is (10 sec: 5926.7, 60 sec: 5662.2, 300 sec: 5621.9). Total num frames: 293782528. Throughput: 0: 5876.7. Samples: 293779958. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:23:36,118][25689] Avg episode reward: [(0, '-49.369')] [2022-07-09 14:23:37,824][26022] Updated weights on worker 0-0, policy_version 286904 (0.00083) [2022-07-09 14:23:39,053][26022] Updated weights on worker 0-0, policy_version 286914 (0.00094) [2022-07-09 14:23:41,167][25689] Fps is (10 sec: 5526.7, 60 sec: 5596.2, 300 sec: 5610.7). Total num frames: 293807104. Throughput: 0: 5903.3. Samples: 293814158. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:23:41,168][25689] Avg episode reward: [(0, '-48.821')] [2022-07-09 14:23:41,421][26022] Updated weights on worker 0-0, policy_version 286924 (0.00094) [2022-07-09 14:23:42,979][26022] Updated weights on worker 0-0, policy_version 286934 (0.00090) [2022-07-09 14:23:44,960][26022] Updated weights on worker 0-0, policy_version 286944 (0.00093) [2022-07-09 14:23:46,170][25689] Fps is (10 sec: 5500.7, 60 sec: 5629.9, 300 sec: 5614.2). Total num frames: 293837824. Throughput: 0: 5048.7. Samples: 293831140. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:23:46,171][25689] Avg episode reward: [(0, '-48.318')] [2022-07-09 14:23:46,886][26022] Updated weights on worker 0-0, policy_version 286954 (0.00096) [2022-07-09 14:23:48,573][26022] Updated weights on worker 0-0, policy_version 286964 (0.00089) [2022-07-09 14:23:50,266][26022] Updated weights on worker 0-0, policy_version 286974 (0.00091) [2022-07-09 14:23:51,183][25689] Fps is (10 sec: 5725.2, 60 sec: 5597.9, 300 sec: 5616.5). Total num frames: 293864448. Throughput: 0: 5902.8. Samples: 293865148. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:23:51,184][25689] Avg episode reward: [(0, '-48.224')] [2022-07-09 14:23:52,380][26022] Updated weights on worker 0-0, policy_version 286984 (0.00095) [2022-07-09 14:23:53,931][26022] Updated weights on worker 0-0, policy_version 286994 (0.00085) [2022-07-09 14:23:55,883][26022] Updated weights on worker 0-0, policy_version 287004 (0.00094) [2022-07-09 14:23:56,297][25689] Fps is (10 sec: 5662.9, 60 sec: 5627.0, 300 sec: 5611.1). Total num frames: 293895168. Throughput: 0: 5891.7. Samples: 293898846. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:23:56,297][25689] Avg episode reward: [(0, '-47.468')] [2022-07-09 14:23:57,755][26022] Updated weights on worker 0-0, policy_version 287014 (0.00087) [2022-07-09 14:23:59,483][26022] Updated weights on worker 0-0, policy_version 287024 (0.00086) [2022-07-09 14:24:01,333][25689] Fps is (10 sec: 5750.7, 60 sec: 5608.5, 300 sec: 5624.9). Total num frames: 293922816. Throughput: 0: 5039.1. Samples: 293915776. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:24:01,333][25689] Avg episode reward: [(0, '-46.965')] [2022-07-09 14:24:01,334][26022] Updated weights on worker 0-0, policy_version 287034 (0.00090) [2022-07-09 14:24:03,523][26022] Updated weights on worker 0-0, policy_version 287044 (0.00089) [2022-07-09 14:24:05,359][26022] Updated weights on worker 0-0, policy_version 287054 (0.00087) [2022-07-09 14:24:06,400][25689] Fps is (10 sec: 5371.4, 60 sec: 5602.5, 300 sec: 5617.2). Total num frames: 293949440. Throughput: 0: 5735.5. Samples: 293947170. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:24:06,401][25689] Avg episode reward: [(0, '-48.183')] [2022-07-09 14:24:07,197][26022] Updated weights on worker 0-0, policy_version 287064 (0.00083) [2022-07-09 14:24:08,991][26022] Updated weights on worker 0-0, policy_version 287074 (0.00098) [2022-07-09 14:24:10,814][26022] Updated weights on worker 0-0, policy_version 287084 (0.00083) [2022-07-09 14:24:11,451][25689] Fps is (10 sec: 5262.8, 60 sec: 5584.6, 300 sec: 5607.6). Total num frames: 293976064. Throughput: 0: 5730.0. Samples: 293981280. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:24:11,451][25689] Avg episode reward: [(0, '-48.182')] [2022-07-09 14:24:12,525][26022] Updated weights on worker 0-0, policy_version 287094 (0.00095) [2022-07-09 14:24:14,611][26022] Updated weights on worker 0-0, policy_version 287104 (0.00087) [2022-07-09 14:24:16,171][26022] Updated weights on worker 0-0, policy_version 287114 (0.00080) [2022-07-09 14:24:16,512][25689] Fps is (10 sec: 5569.9, 60 sec: 5603.8, 300 sec: 5615.1). Total num frames: 294005760. Throughput: 0: 5753.3. Samples: 294015150. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:24:16,512][25689] Avg episode reward: [(0, '-48.021')] [2022-07-09 14:24:18,187][26022] Updated weights on worker 0-0, policy_version 287124 (0.00085) [2022-07-09 14:24:19,811][26022] Updated weights on worker 0-0, policy_version 287134 (0.00085) [2022-07-09 14:24:21,519][25689] Fps is (10 sec: 5695.2, 60 sec: 5610.1, 300 sec: 5605.3). Total num frames: 294033408. Throughput: 0: 5773.5. Samples: 294032324. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:24:21,520][25689] Avg episode reward: [(0, '-47.843')] [2022-07-09 14:24:21,612][26022] Updated weights on worker 0-0, policy_version 287144 (0.00085) [2022-07-09 14:24:23,465][26022] Updated weights on worker 0-0, policy_version 287154 (0.00086) [2022-07-09 14:24:25,139][26022] Updated weights on worker 0-0, policy_version 287164 (0.00089) [2022-07-09 14:24:26,538][25689] Fps is (10 sec: 5617.2, 60 sec: 5608.7, 300 sec: 5611.9). Total num frames: 294062080. Throughput: 0: 5919.6. Samples: 294066378. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:24:26,540][25689] Avg episode reward: [(0, '-47.447')] [2022-07-09 14:24:27,000][26022] Updated weights on worker 0-0, policy_version 287174 (0.00089) [2022-07-09 14:24:28,860][26022] Updated weights on worker 0-0, policy_version 287184 (0.00089) [2022-07-09 14:24:30,748][26022] Updated weights on worker 0-0, policy_version 287194 (0.00091) [2022-07-09 14:24:31,562][25689] Fps is (10 sec: 5608.0, 60 sec: 5594.5, 300 sec: 5609.1). Total num frames: 294089728. Throughput: 0: 5908.8. Samples: 294100116. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:24:31,563][25689] Avg episode reward: [(0, '-47.775')] [2022-07-09 14:24:32,543][26022] Updated weights on worker 0-0, policy_version 287204 (0.00096) [2022-07-09 14:24:34,155][26022] Updated weights on worker 0-0, policy_version 287214 (0.00054) [2022-07-09 14:24:35,921][26022] Updated weights on worker 0-0, policy_version 287224 (0.00081) [2022-07-09 14:24:36,675][25689] Fps is (10 sec: 5758.1, 60 sec: 5580.2, 300 sec: 5614.6). Total num frames: 294120448. Throughput: 0: 5066.7. Samples: 294117310. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:24:36,677][25689] Avg episode reward: [(0, '-46.743')] [2022-07-09 14:24:37,900][26022] Updated weights on worker 0-0, policy_version 287234 (0.00090) [2022-07-09 14:24:39,537][26022] Updated weights on worker 0-0, policy_version 287244 (0.00088) [2022-07-09 14:24:41,697][25689] Fps is (10 sec: 5759.0, 60 sec: 5633.4, 300 sec: 5615.6). Total num frames: 294148096. Throughput: 0: 5904.6. Samples: 294151466. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:24:41,698][25689] Avg episode reward: [(0, '-46.895')] [2022-07-09 14:24:41,700][26022] Updated weights on worker 0-0, policy_version 287254 (0.00249) [2022-07-09 14:24:43,300][26022] Updated weights on worker 0-0, policy_version 287264 (0.00087) [2022-07-09 14:24:45,163][26022] Updated weights on worker 0-0, policy_version 287274 (0.00092) [2022-07-09 14:24:46,717][25689] Fps is (10 sec: 5710.5, 60 sec: 5615.0, 300 sec: 5616.2). Total num frames: 294177792. Throughput: 0: 5909.9. Samples: 294185632. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:24:46,717][25689] Avg episode reward: [(0, '-47.803')] [2022-07-09 14:24:46,794][26022] Updated weights on worker 0-0, policy_version 287284 (0.00093) [2022-07-09 14:24:48,806][26022] Updated weights on worker 0-0, policy_version 287294 (0.00093) [2022-07-09 14:24:50,432][26022] Updated weights on worker 0-0, policy_version 287304 (0.00095) [2022-07-09 14:24:51,800][25689] Fps is (10 sec: 5676.0, 60 sec: 5625.3, 300 sec: 5613.4). Total num frames: 294205440. Throughput: 0: 5063.5. Samples: 294202592. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:24:51,801][25689] Avg episode reward: [(0, '-47.385')] [2022-07-09 14:24:52,321][26022] Updated weights on worker 0-0, policy_version 287314 (0.00093) [2022-07-09 14:24:54,090][26022] Updated weights on worker 0-0, policy_version 287324 (0.00097) [2022-07-09 14:24:56,010][26022] Updated weights on worker 0-0, policy_version 287334 (0.00474) [2022-07-09 14:24:56,906][25689] Fps is (10 sec: 5628.1, 60 sec: 5609.2, 300 sec: 5616.2). Total num frames: 294235136. Throughput: 0: 5901.8. Samples: 294236708. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:24:56,911][25689] Avg episode reward: [(0, '-47.448')] [2022-07-09 14:24:57,751][26022] Updated weights on worker 0-0, policy_version 287344 (0.00087) [2022-07-09 14:24:59,519][26022] Updated weights on worker 0-0, policy_version 287354 (0.00084) [2022-07-09 14:25:01,723][26022] Updated weights on worker 0-0, policy_version 287364 (0.00085) [2022-07-09 14:25:01,913][25689] Fps is (10 sec: 5468.1, 60 sec: 5578.1, 300 sec: 5613.0). Total num frames: 294260736. Throughput: 0: 5876.1. Samples: 294270254. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:25:01,913][25689] Avg episode reward: [(0, '-46.702')] [2022-07-09 14:25:03,503][26022] Updated weights on worker 0-0, policy_version 287374 (0.00085) [2022-07-09 14:25:04,505][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:25:04,521][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000287379_294276096.pth [2022-07-09 14:25:04,521][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000285400_292249600.pth [2022-07-09 14:25:05,423][26022] Updated weights on worker 0-0, policy_version 287384 (0.00085) [2022-07-09 14:25:06,936][25689] Fps is (10 sec: 5513.1, 60 sec: 5633.0, 300 sec: 5620.4). Total num frames: 294290432. Throughput: 0: 4965.4. Samples: 294286026. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:25:06,936][25689] Avg episode reward: [(0, '-47.456')] [2022-07-09 14:25:07,034][26022] Updated weights on worker 0-0, policy_version 287394 (0.00087) [2022-07-09 14:25:08,988][26022] Updated weights on worker 0-0, policy_version 287404 (0.00083) [2022-07-09 14:25:10,640][26022] Updated weights on worker 0-0, policy_version 287414 (0.00092) [2022-07-09 14:25:11,955][25689] Fps is (10 sec: 5710.4, 60 sec: 5652.8, 300 sec: 5617.6). Total num frames: 294318080. Throughput: 0: 5834.5. Samples: 294320184. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:25:11,955][25689] Avg episode reward: [(0, '-47.219')] [2022-07-09 14:25:12,478][26022] Updated weights on worker 0-0, policy_version 287424 (0.00090) [2022-07-09 14:25:14,201][26022] Updated weights on worker 0-0, policy_version 287434 (0.00084) [2022-07-09 14:25:16,231][26022] Updated weights on worker 0-0, policy_version 287444 (0.00090) [2022-07-09 14:25:17,048][25689] Fps is (10 sec: 5569.7, 60 sec: 5632.9, 300 sec: 5619.9). Total num frames: 294346752. Throughput: 0: 5830.1. Samples: 294354136. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:25:17,048][25689] Avg episode reward: [(0, '-47.652')] [2022-07-09 14:25:17,975][26022] Updated weights on worker 0-0, policy_version 287454 (0.00100) [2022-07-09 14:25:19,643][26022] Updated weights on worker 0-0, policy_version 287464 (0.00085) [2022-07-09 14:25:21,535][26022] Updated weights on worker 0-0, policy_version 287474 (0.00090) [2022-07-09 14:25:22,095][25689] Fps is (10 sec: 5756.1, 60 sec: 5663.0, 300 sec: 5623.0). Total num frames: 294376448. Throughput: 0: 5002.0. Samples: 294371206. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:25:22,095][25689] Avg episode reward: [(0, '-48.586')] [2022-07-09 14:25:23,375][26022] Updated weights on worker 0-0, policy_version 287484 (0.00084) [2022-07-09 14:25:25,114][26022] Updated weights on worker 0-0, policy_version 287494 (0.00087) [2022-07-09 14:25:27,007][26022] Updated weights on worker 0-0, policy_version 287504 (0.00087) [2022-07-09 14:25:27,106][25689] Fps is (10 sec: 5701.0, 60 sec: 5646.8, 300 sec: 5612.6). Total num frames: 294404096. Throughput: 0: 5915.9. Samples: 294405352. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:25:27,108][25689] Avg episode reward: [(0, '-49.218')] [2022-07-09 14:25:28,813][26022] Updated weights on worker 0-0, policy_version 287514 (0.00097) [2022-07-09 14:25:30,468][26022] Updated weights on worker 0-0, policy_version 287524 (0.00091) [2022-07-09 14:25:32,146][25689] Fps is (10 sec: 5603.3, 60 sec: 5662.2, 300 sec: 5624.2). Total num frames: 294432768. Throughput: 0: 5912.4. Samples: 294439564. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:25:32,148][25689] Avg episode reward: [(0, '-49.250')] [2022-07-09 14:25:32,528][26022] Updated weights on worker 0-0, policy_version 287534 (0.00082) [2022-07-09 14:25:34,227][26022] Updated weights on worker 0-0, policy_version 287544 (0.00093) [2022-07-09 14:25:35,829][26022] Updated weights on worker 0-0, policy_version 287554 (0.00082) [2022-07-09 14:25:37,200][25689] Fps is (10 sec: 5782.5, 60 sec: 5650.8, 300 sec: 5623.9). Total num frames: 294462464. Throughput: 0: 5091.8. Samples: 294456748. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:25:37,202][25689] Avg episode reward: [(0, '-49.429')] [2022-07-09 14:25:37,815][26022] Updated weights on worker 0-0, policy_version 287564 (0.00091) [2022-07-09 14:25:39,552][26022] Updated weights on worker 0-0, policy_version 287574 (0.00086) [2022-07-09 14:25:41,480][26022] Updated weights on worker 0-0, policy_version 287584 (0.01045) [2022-07-09 14:25:42,215][25689] Fps is (10 sec: 5695.4, 60 sec: 5651.5, 300 sec: 5620.8). Total num frames: 294490112. Throughput: 0: 5952.2. Samples: 294490962. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:25:42,215][25689] Avg episode reward: [(0, '-49.559')] [2022-07-09 14:25:43,052][26022] Updated weights on worker 0-0, policy_version 287594 (0.00087) [2022-07-09 14:25:44,947][26022] Updated weights on worker 0-0, policy_version 287604 (0.00079) [2022-07-09 14:25:46,965][26022] Updated weights on worker 0-0, policy_version 287614 (0.00089) [2022-07-09 14:25:47,217][25689] Fps is (10 sec: 5520.3, 60 sec: 5619.3, 300 sec: 5624.5). Total num frames: 294517760. Throughput: 0: 5948.2. Samples: 294524976. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:25:47,217][25689] Avg episode reward: [(0, '-48.433')] [2022-07-09 14:25:48,463][26022] Updated weights on worker 0-0, policy_version 287624 (0.00089) [2022-07-09 14:25:50,446][26022] Updated weights on worker 0-0, policy_version 287634 (0.00085) [2022-07-09 14:25:52,134][26022] Updated weights on worker 0-0, policy_version 287644 (0.00094) [2022-07-09 14:25:52,248][25689] Fps is (10 sec: 5715.3, 60 sec: 5658.0, 300 sec: 5622.8). Total num frames: 294547456. Throughput: 0: 5108.5. Samples: 294542254. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:25:52,249][25689] Avg episode reward: [(0, '-48.673')] [2022-07-09 14:25:54,094][26022] Updated weights on worker 0-0, policy_version 287654 (0.00093) [2022-07-09 14:25:55,870][26022] Updated weights on worker 0-0, policy_version 287664 (0.00080) [2022-07-09 14:25:57,295][25689] Fps is (10 sec: 5791.7, 60 sec: 5646.6, 300 sec: 5627.0). Total num frames: 294576128. Throughput: 0: 5957.5. Samples: 294576462. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:25:57,295][25689] Avg episode reward: [(0, '-48.018')] [2022-07-09 14:25:57,447][26022] Updated weights on worker 0-0, policy_version 287674 (0.00086) [2022-07-09 14:25:59,340][26022] Updated weights on worker 0-0, policy_version 287684 (0.00084) [2022-07-09 14:26:01,163][26022] Updated weights on worker 0-0, policy_version 287694 (0.00078) [2022-07-09 14:26:02,378][25689] Fps is (10 sec: 5357.6, 60 sec: 5639.5, 300 sec: 5622.5). Total num frames: 294601728. Throughput: 0: 5869.2. Samples: 294609304. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:26:02,378][25689] Avg episode reward: [(0, '-48.741')] [2022-07-09 14:26:03,091][26022] Updated weights on worker 0-0, policy_version 287704 (0.00086) [2022-07-09 14:26:05,073][26022] Updated weights on worker 0-0, policy_version 287714 (0.00091) [2022-07-09 14:26:06,743][26022] Updated weights on worker 0-0, policy_version 287724 (0.01026) [2022-07-09 14:26:07,411][25689] Fps is (10 sec: 5466.2, 60 sec: 5638.6, 300 sec: 5622.1). Total num frames: 294631424. Throughput: 0: 5014.2. Samples: 294626232. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:26:07,411][25689] Avg episode reward: [(0, '-48.255')] [2022-07-09 14:26:08,723][26022] Updated weights on worker 0-0, policy_version 287734 (0.00088) [2022-07-09 14:26:10,292][26022] Updated weights on worker 0-0, policy_version 287744 (0.00087) [2022-07-09 14:26:12,014][26022] Updated weights on worker 0-0, policy_version 287754 (0.00090) [2022-07-09 14:26:12,431][25689] Fps is (10 sec: 6009.5, 60 sec: 5689.2, 300 sec: 5636.6). Total num frames: 294662144. Throughput: 0: 5884.6. Samples: 294661024. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:26:12,432][25689] Avg episode reward: [(0, '-48.482')] [2022-07-09 14:26:14,044][26022] Updated weights on worker 0-0, policy_version 287764 (0.00093) [2022-07-09 14:26:15,744][26022] Updated weights on worker 0-0, policy_version 287774 (0.00092) [2022-07-09 14:26:17,517][25689] Fps is (10 sec: 5674.2, 60 sec: 5656.0, 300 sec: 5629.3). Total num frames: 294688768. Throughput: 0: 5862.5. Samples: 294695014. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:26:17,517][25689] Avg episode reward: [(0, '-48.653')] [2022-07-09 14:26:17,596][26022] Updated weights on worker 0-0, policy_version 287784 (0.00092) [2022-07-09 14:26:19,375][26022] Updated weights on worker 0-0, policy_version 287794 (0.00087) [2022-07-09 14:26:21,141][26022] Updated weights on worker 0-0, policy_version 287804 (0.00090) [2022-07-09 14:26:22,547][25689] Fps is (10 sec: 5567.5, 60 sec: 5657.6, 300 sec: 5632.9). Total num frames: 294718464. Throughput: 0: 5095.1. Samples: 294712068. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:26:22,548][25689] Avg episode reward: [(0, '-49.256')] [2022-07-09 14:26:22,853][26022] Updated weights on worker 0-0, policy_version 287814 (0.00087) [2022-07-09 14:26:24,776][26022] Updated weights on worker 0-0, policy_version 287824 (0.00089) [2022-07-09 14:26:26,406][26022] Updated weights on worker 0-0, policy_version 287834 (0.00087) [2022-07-09 14:26:27,560][25689] Fps is (10 sec: 5913.8, 60 sec: 5691.4, 300 sec: 5632.9). Total num frames: 294748160. Throughput: 0: 5986.3. Samples: 294746850. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:26:27,560][25689] Avg episode reward: [(0, '-50.086')] [2022-07-09 14:26:28,291][26022] Updated weights on worker 0-0, policy_version 287844 (0.00088) [2022-07-09 14:26:30,083][26022] Updated weights on worker 0-0, policy_version 287854 (0.00088) [2022-07-09 14:26:31,858][26022] Updated weights on worker 0-0, policy_version 287864 (0.00103) [2022-07-09 14:26:32,583][25689] Fps is (10 sec: 5815.6, 60 sec: 5692.9, 300 sec: 5630.9). Total num frames: 294776832. Throughput: 0: 5964.4. Samples: 294781220. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:26:32,584][25689] Avg episode reward: [(0, '-50.039')] [2022-07-09 14:26:33,633][26022] Updated weights on worker 0-0, policy_version 287874 (0.00086) [2022-07-09 14:26:35,366][26022] Updated weights on worker 0-0, policy_version 287884 (0.00087) [2022-07-09 14:26:37,268][26022] Updated weights on worker 0-0, policy_version 287894 (0.00089) [2022-07-09 14:26:37,651][25689] Fps is (10 sec: 5784.0, 60 sec: 5691.6, 300 sec: 5638.0). Total num frames: 294806528. Throughput: 0: 5124.0. Samples: 294798182. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:26:37,652][25689] Avg episode reward: [(0, '-50.492')] [2022-07-09 14:26:38,932][26022] Updated weights on worker 0-0, policy_version 287904 (0.00089) [2022-07-09 14:26:40,581][26022] Updated weights on worker 0-0, policy_version 287914 (0.00085) [2022-07-09 14:26:42,574][26022] Updated weights on worker 0-0, policy_version 287924 (0.00086) [2022-07-09 14:26:42,679][25689] Fps is (10 sec: 5680.0, 60 sec: 5690.3, 300 sec: 5635.1). Total num frames: 294834176. Throughput: 0: 6005.7. Samples: 294832976. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:26:42,680][25689] Avg episode reward: [(0, '-51.066')] [2022-07-09 14:26:44,262][26022] Updated weights on worker 0-0, policy_version 287934 (0.00084) [2022-07-09 14:26:46,139][26022] Updated weights on worker 0-0, policy_version 287944 (0.00081) [2022-07-09 14:26:47,685][25689] Fps is (10 sec: 5714.9, 60 sec: 5723.9, 300 sec: 5638.6). Total num frames: 294863872. Throughput: 0: 5998.3. Samples: 294867568. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:26:47,686][25689] Avg episode reward: [(0, '-50.811')] [2022-07-09 14:26:47,883][26022] Updated weights on worker 0-0, policy_version 287954 (0.00085) [2022-07-09 14:26:49,460][26022] Updated weights on worker 0-0, policy_version 287964 (0.00080) [2022-07-09 14:26:51,615][26022] Updated weights on worker 0-0, policy_version 287974 (0.00081) [2022-07-09 14:26:52,715][25689] Fps is (10 sec: 5918.1, 60 sec: 5724.0, 300 sec: 5643.3). Total num frames: 294893568. Throughput: 0: 5150.5. Samples: 294884908. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 14:26:52,716][25689] Avg episode reward: [(0, '-50.356')] [2022-07-09 14:26:52,972][26022] Updated weights on worker 0-0, policy_version 287984 (0.00084) [2022-07-09 14:26:55,093][26022] Updated weights on worker 0-0, policy_version 287994 (0.00091) [2022-07-09 14:26:56,692][26022] Updated weights on worker 0-0, policy_version 288004 (0.00090) [2022-07-09 14:26:57,821][25689] Fps is (10 sec: 5556.3, 60 sec: 5684.5, 300 sec: 5634.5). Total num frames: 294920192. Throughput: 0: 6017.5. Samples: 294919558. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:26:57,822][25689] Avg episode reward: [(0, '-50.845')] [2022-07-09 14:26:58,678][26022] Updated weights on worker 0-0, policy_version 288014 (0.00086) [2022-07-09 14:27:00,607][26022] Updated weights on worker 0-0, policy_version 288024 (0.00083) [2022-07-09 14:27:02,243][26022] Updated weights on worker 0-0, policy_version 288034 (0.00085) [2022-07-09 14:27:02,843][25689] Fps is (10 sec: 5459.6, 60 sec: 5741.1, 300 sec: 5649.3). Total num frames: 294948864. Throughput: 0: 6012.6. Samples: 294954214. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:27:02,847][25689] Avg episode reward: [(0, '-50.168')] [2022-07-09 14:27:04,360][26022] Updated weights on worker 0-0, policy_version 288044 (0.00086) [2022-07-09 14:27:04,606][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:27:04,630][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000288046_294959104.pth [2022-07-09 14:27:04,631][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000286059_292924416.pth [2022-07-09 14:27:05,984][26022] Updated weights on worker 0-0, policy_version 288054 (0.00088) [2022-07-09 14:27:07,783][26022] Updated weights on worker 0-0, policy_version 288064 (0.00090) [2022-07-09 14:27:07,871][25689] Fps is (10 sec: 5706.3, 60 sec: 5724.6, 300 sec: 5642.5). Total num frames: 294977536. Throughput: 0: 5900.3. Samples: 294986668. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:27:07,871][25689] Avg episode reward: [(0, '-49.553')] [2022-07-09 14:27:09,720][26022] Updated weights on worker 0-0, policy_version 288074 (0.00529) [2022-07-09 14:27:11,361][26022] Updated weights on worker 0-0, policy_version 288084 (0.00088) [2022-07-09 14:27:12,878][25689] Fps is (10 sec: 5714.5, 60 sec: 5692.0, 300 sec: 5650.3). Total num frames: 295006208. Throughput: 0: 5905.4. Samples: 295003978. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:27:12,879][25689] Avg episode reward: [(0, '-48.852')] [2022-07-09 14:27:13,135][26022] Updated weights on worker 0-0, policy_version 288094 (0.00083) [2022-07-09 14:27:14,979][26022] Updated weights on worker 0-0, policy_version 288104 (0.00084) [2022-07-09 14:27:16,496][26022] Updated weights on worker 0-0, policy_version 288114 (0.00088) [2022-07-09 14:27:17,915][25689] Fps is (10 sec: 5709.3, 60 sec: 5730.5, 300 sec: 5647.8). Total num frames: 295034880. Throughput: 0: 5923.7. Samples: 295038584. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:27:17,915][25689] Avg episode reward: [(0, '-49.563')] [2022-07-09 14:27:18,486][26022] Updated weights on worker 0-0, policy_version 288124 (0.00078) [2022-07-09 14:27:20,173][26022] Updated weights on worker 0-0, policy_version 288134 (0.00089) [2022-07-09 14:27:21,996][26022] Updated weights on worker 0-0, policy_version 288144 (0.00084) [2022-07-09 14:27:22,925][25689] Fps is (10 sec: 5707.3, 60 sec: 5715.4, 300 sec: 5648.4). Total num frames: 295063552. Throughput: 0: 5938.1. Samples: 295073464. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:27:22,926][25689] Avg episode reward: [(0, '-50.328')] [2022-07-09 14:27:23,681][26022] Updated weights on worker 0-0, policy_version 288154 (0.00083) [2022-07-09 14:27:25,497][26022] Updated weights on worker 0-0, policy_version 288164 (0.00086) [2022-07-09 14:27:27,263][26022] Updated weights on worker 0-0, policy_version 288174 (0.00081) [2022-07-09 14:27:27,933][25689] Fps is (10 sec: 5723.8, 60 sec: 5698.9, 300 sec: 5652.3). Total num frames: 295092224. Throughput: 0: 5192.7. Samples: 295090846. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:27:27,934][25689] Avg episode reward: [(0, '-50.386')] [2022-07-09 14:27:29,005][26022] Updated weights on worker 0-0, policy_version 288184 (0.00091) [2022-07-09 14:27:30,980][26022] Updated weights on worker 0-0, policy_version 288194 (0.00087) [2022-07-09 14:27:32,640][26022] Updated weights on worker 0-0, policy_version 288204 (0.00090) [2022-07-09 14:27:32,949][25689] Fps is (10 sec: 5823.0, 60 sec: 5716.6, 300 sec: 5656.8). Total num frames: 295121920. Throughput: 0: 6039.3. Samples: 295125194. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:27:32,950][25689] Avg episode reward: [(0, '-50.490')] [2022-07-09 14:27:34,634][26022] Updated weights on worker 0-0, policy_version 288214 (0.00087) [2022-07-09 14:27:36,185][26022] Updated weights on worker 0-0, policy_version 288224 (0.00091) [2022-07-09 14:27:38,075][25689] Fps is (10 sec: 5856.0, 60 sec: 5711.1, 300 sec: 5659.2). Total num frames: 295151616. Throughput: 0: 6014.0. Samples: 295159830. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:27:38,076][25689] Avg episode reward: [(0, '-50.193')] [2022-07-09 14:27:38,078][26022] Updated weights on worker 0-0, policy_version 288234 (0.00094) [2022-07-09 14:27:39,817][26022] Updated weights on worker 0-0, policy_version 288244 (0.00080) [2022-07-09 14:27:41,487][26022] Updated weights on worker 0-0, policy_version 288254 (0.00077) [2022-07-09 14:27:43,124][25689] Fps is (10 sec: 5836.9, 60 sec: 5743.0, 300 sec: 5661.7). Total num frames: 295181312. Throughput: 0: 5132.4. Samples: 295177134. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:27:43,125][25689] Avg episode reward: [(0, '-49.757')] [2022-07-09 14:27:43,286][26022] Updated weights on worker 0-0, policy_version 288264 (0.00089) [2022-07-09 14:27:45,175][26022] Updated weights on worker 0-0, policy_version 288274 (0.00079) [2022-07-09 14:27:46,828][26022] Updated weights on worker 0-0, policy_version 288284 (0.00096) [2022-07-09 14:27:48,199][25689] Fps is (10 sec: 5664.4, 60 sec: 5702.7, 300 sec: 5657.5). Total num frames: 295208960. Throughput: 0: 5964.7. Samples: 295211724. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:27:48,199][25689] Avg episode reward: [(0, '-48.279')] [2022-07-09 14:27:48,743][26022] Updated weights on worker 0-0, policy_version 288294 (0.00098) [2022-07-09 14:27:50,314][26022] Updated weights on worker 0-0, policy_version 288304 (0.00090) [2022-07-09 14:27:52,202][26022] Updated weights on worker 0-0, policy_version 288314 (0.00084) [2022-07-09 14:27:53,203][25689] Fps is (10 sec: 5588.0, 60 sec: 5688.1, 300 sec: 5658.6). Total num frames: 295237632. Throughput: 0: 5977.2. Samples: 295246256. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:27:53,204][25689] Avg episode reward: [(0, '-48.296')] [2022-07-09 14:27:54,015][26022] Updated weights on worker 0-0, policy_version 288324 (0.00077) [2022-07-09 14:27:55,746][26022] Updated weights on worker 0-0, policy_version 288334 (0.00088) [2022-07-09 14:27:57,551][26022] Updated weights on worker 0-0, policy_version 288344 (0.00085) [2022-07-09 14:27:58,273][25689] Fps is (10 sec: 5895.2, 60 sec: 5759.3, 300 sec: 5664.5). Total num frames: 295268352. Throughput: 0: 5145.4. Samples: 295263758. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:27:58,274][25689] Avg episode reward: [(0, '-47.807')] [2022-07-09 14:27:59,428][26022] Updated weights on worker 0-0, policy_version 288354 (0.00091) [2022-07-09 14:28:01,002][26022] Updated weights on worker 0-0, policy_version 288364 (0.00083) [2022-07-09 14:28:03,294][25689] Fps is (10 sec: 5581.3, 60 sec: 5708.6, 300 sec: 5660.7). Total num frames: 295293952. Throughput: 0: 5995.8. Samples: 295298066. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:28:03,296][25689] Avg episode reward: [(0, '-48.519')] [2022-07-09 14:28:03,346][26022] Updated weights on worker 0-0, policy_version 288374 (0.00088) [2022-07-09 14:28:05,037][26022] Updated weights on worker 0-0, policy_version 288384 (0.00087) [2022-07-09 14:28:06,850][26022] Updated weights on worker 0-0, policy_version 288394 (0.00099) [2022-07-09 14:28:08,308][25689] Fps is (10 sec: 5510.5, 60 sec: 5726.8, 300 sec: 5668.0). Total num frames: 295323648. Throughput: 0: 5927.5. Samples: 295330922. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:28:08,310][25689] Avg episode reward: [(0, '-48.723')] [2022-07-09 14:28:08,464][26022] Updated weights on worker 0-0, policy_version 288404 (0.00085) [2022-07-09 14:28:10,312][26022] Updated weights on worker 0-0, policy_version 288414 (0.00048) [2022-07-09 14:28:12,372][26022] Updated weights on worker 0-0, policy_version 288424 (0.00094) [2022-07-09 14:28:13,320][25689] Fps is (10 sec: 5924.0, 60 sec: 5743.3, 300 sec: 5672.9). Total num frames: 295353344. Throughput: 0: 5067.8. Samples: 295348202. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:28:13,321][25689] Avg episode reward: [(0, '-48.547')] [2022-07-09 14:28:14,003][26022] Updated weights on worker 0-0, policy_version 288434 (0.00088) [2022-07-09 14:28:15,541][26022] Updated weights on worker 0-0, policy_version 288444 (0.01202) [2022-07-09 14:28:17,609][26022] Updated weights on worker 0-0, policy_version 288454 (0.00081) [2022-07-09 14:28:18,403][25689] Fps is (10 sec: 5680.5, 60 sec: 5722.0, 300 sec: 5672.8). Total num frames: 295380992. Throughput: 0: 5899.4. Samples: 295382512. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:28:18,404][25689] Avg episode reward: [(0, '-47.875')] [2022-07-09 14:28:19,347][26022] Updated weights on worker 0-0, policy_version 288464 (0.00090) [2022-07-09 14:28:21,235][26022] Updated weights on worker 0-0, policy_version 288474 (0.00094) [2022-07-09 14:28:22,954][26022] Updated weights on worker 0-0, policy_version 288484 (0.00081) [2022-07-09 14:28:23,431][25689] Fps is (10 sec: 5570.1, 60 sec: 5720.4, 300 sec: 5672.3). Total num frames: 295409664. Throughput: 0: 5915.5. Samples: 295417184. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:28:23,431][25689] Avg episode reward: [(0, '-47.297')] [2022-07-09 14:28:24,599][26022] Updated weights on worker 0-0, policy_version 288494 (0.00083) [2022-07-09 14:28:26,512][26022] Updated weights on worker 0-0, policy_version 288504 (0.00103) [2022-07-09 14:28:28,251][26022] Updated weights on worker 0-0, policy_version 288514 (0.00083) [2022-07-09 14:28:28,439][25689] Fps is (10 sec: 5714.0, 60 sec: 5720.3, 300 sec: 5673.2). Total num frames: 295438336. Throughput: 0: 5129.6. Samples: 295434184. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:28:28,439][25689] Avg episode reward: [(0, '-47.161')] [2022-07-09 14:28:30,094][26022] Updated weights on worker 0-0, policy_version 288524 (0.00088) [2022-07-09 14:28:31,964][26022] Updated weights on worker 0-0, policy_version 288534 (0.00093) [2022-07-09 14:28:33,459][25689] Fps is (10 sec: 5820.5, 60 sec: 5720.0, 300 sec: 5668.6). Total num frames: 295468032. Throughput: 0: 5967.2. Samples: 295468376. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:28:33,459][25689] Avg episode reward: [(0, '-46.113')] [2022-07-09 14:28:33,566][26022] Updated weights on worker 0-0, policy_version 288544 (0.00092) [2022-07-09 14:28:35,671][26022] Updated weights on worker 0-0, policy_version 288554 (0.00082) [2022-07-09 14:28:37,250][26022] Updated weights on worker 0-0, policy_version 288564 (0.00092) [2022-07-09 14:28:38,528][25689] Fps is (10 sec: 5785.0, 60 sec: 5708.4, 300 sec: 5682.0). Total num frames: 295496704. Throughput: 0: 5966.7. Samples: 295502592. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:28:38,529][25689] Avg episode reward: [(0, '-46.781')] [2022-07-09 14:28:39,187][26022] Updated weights on worker 0-0, policy_version 288574 (0.00093) [2022-07-09 14:28:40,879][26022] Updated weights on worker 0-0, policy_version 288584 (0.00091) [2022-07-09 14:28:42,724][26022] Updated weights on worker 0-0, policy_version 288594 (0.00081) [2022-07-09 14:28:43,573][25689] Fps is (10 sec: 5669.4, 60 sec: 5691.8, 300 sec: 5674.3). Total num frames: 295525376. Throughput: 0: 5090.8. Samples: 295519726. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:28:43,574][25689] Avg episode reward: [(0, '-47.844')] [2022-07-09 14:28:44,380][26022] Updated weights on worker 0-0, policy_version 288604 (0.00086) [2022-07-09 14:28:46,222][26022] Updated weights on worker 0-0, policy_version 288614 (0.00090) [2022-07-09 14:28:47,979][26022] Updated weights on worker 0-0, policy_version 288624 (0.00106) [2022-07-09 14:28:48,631][25689] Fps is (10 sec: 5574.9, 60 sec: 5693.4, 300 sec: 5676.9). Total num frames: 295553024. Throughput: 0: 5931.0. Samples: 295553942. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:28:48,631][25689] Avg episode reward: [(0, '-47.484')] [2022-07-09 14:28:49,963][26022] Updated weights on worker 0-0, policy_version 288634 (0.00088) [2022-07-09 14:28:51,613][26022] Updated weights on worker 0-0, policy_version 288644 (0.00093) [2022-07-09 14:28:53,578][26022] Updated weights on worker 0-0, policy_version 288654 (0.00086) [2022-07-09 14:28:53,659][25689] Fps is (10 sec: 5584.3, 60 sec: 5691.2, 300 sec: 5671.6). Total num frames: 295581696. Throughput: 0: 5917.4. Samples: 295587908. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:28:53,659][25689] Avg episode reward: [(0, '-48.584')] [2022-07-09 14:28:55,318][26022] Updated weights on worker 0-0, policy_version 288664 (0.00099) [2022-07-09 14:28:57,147][26022] Updated weights on worker 0-0, policy_version 288674 (0.00093) [2022-07-09 14:28:58,684][25689] Fps is (10 sec: 5805.7, 60 sec: 5678.5, 300 sec: 5678.7). Total num frames: 295611392. Throughput: 0: 5090.1. Samples: 295605188. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:28:58,685][25689] Avg episode reward: [(0, '-49.186')] [2022-07-09 14:28:58,808][26022] Updated weights on worker 0-0, policy_version 288684 (0.00084) [2022-07-09 14:29:00,578][26022] Updated weights on worker 0-0, policy_version 288694 (0.00090) [2022-07-09 14:29:02,747][26022] Updated weights on worker 0-0, policy_version 288704 (0.00089) [2022-07-09 14:29:03,716][25689] Fps is (10 sec: 5599.9, 60 sec: 5694.3, 300 sec: 5679.4). Total num frames: 295638016. Throughput: 0: 5888.2. Samples: 295638330. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:29:03,717][25689] Avg episode reward: [(0, '-49.337')] [2022-07-09 14:29:04,549][26022] Updated weights on worker 0-0, policy_version 288714 (0.00089) [2022-07-09 14:29:04,664][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:29:04,682][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000288715_295644160.pth [2022-07-09 14:29:04,683][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000286719_293600256.pth [2022-07-09 14:29:06,255][26022] Updated weights on worker 0-0, policy_version 288724 (0.00085) [2022-07-09 14:29:08,248][26022] Updated weights on worker 0-0, policy_version 288734 (0.00911) [2022-07-09 14:29:08,735][25689] Fps is (10 sec: 5501.7, 60 sec: 5677.0, 300 sec: 5686.9). Total num frames: 295666688. Throughput: 0: 5896.4. Samples: 295672484. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:29:08,735][25689] Avg episode reward: [(0, '-48.500')] [2022-07-09 14:29:09,827][26022] Updated weights on worker 0-0, policy_version 288744 (0.00088) [2022-07-09 14:29:11,644][26022] Updated weights on worker 0-0, policy_version 288754 (0.00086) [2022-07-09 14:29:13,330][26022] Updated weights on worker 0-0, policy_version 288764 (0.00084) [2022-07-09 14:29:13,737][25689] Fps is (10 sec: 5722.3, 60 sec: 5660.9, 300 sec: 5684.6). Total num frames: 295695360. Throughput: 0: 5067.3. Samples: 295689652. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:29:13,738][25689] Avg episode reward: [(0, '-49.062')] [2022-07-09 14:29:15,368][26022] Updated weights on worker 0-0, policy_version 288774 (0.00090) [2022-07-09 14:29:17,013][26022] Updated weights on worker 0-0, policy_version 288784 (0.00091) [2022-07-09 14:29:18,829][25689] Fps is (10 sec: 5579.6, 60 sec: 5660.1, 300 sec: 5683.0). Total num frames: 295723008. Throughput: 0: 5893.6. Samples: 295723912. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2022-07-09 14:29:18,829][25689] Avg episode reward: [(0, '-48.597')] [2022-07-09 14:29:18,929][26022] Updated weights on worker 0-0, policy_version 288794 (0.00090) [2022-07-09 14:29:20,532][26022] Updated weights on worker 0-0, policy_version 288804 (0.00091) [2022-07-09 14:29:22,503][26022] Updated weights on worker 0-0, policy_version 288814 (0.00092) [2022-07-09 14:29:23,846][25689] Fps is (10 sec: 5875.2, 60 sec: 5711.9, 300 sec: 5693.3). Total num frames: 295754752. Throughput: 0: 5948.3. Samples: 295758070. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:29:23,848][25689] Avg episode reward: [(0, '-48.606')] [2022-07-09 14:29:24,105][26022] Updated weights on worker 0-0, policy_version 288824 (0.00090) [2022-07-09 14:29:26,055][26022] Updated weights on worker 0-0, policy_version 288834 (0.00097) [2022-07-09 14:29:27,807][26022] Updated weights on worker 0-0, policy_version 288844 (0.00080) [2022-07-09 14:29:28,852][25689] Fps is (10 sec: 5823.2, 60 sec: 5678.2, 300 sec: 5690.2). Total num frames: 295781376. Throughput: 0: 5109.3. Samples: 295775270. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:29:28,853][25689] Avg episode reward: [(0, '-48.266')] [2022-07-09 14:29:29,556][26022] Updated weights on worker 0-0, policy_version 288854 (0.00087) [2022-07-09 14:29:31,621][26022] Updated weights on worker 0-0, policy_version 288864 (0.00081) [2022-07-09 14:29:33,042][26022] Updated weights on worker 0-0, policy_version 288874 (0.00087) [2022-07-09 14:29:33,870][25689] Fps is (10 sec: 5618.3, 60 sec: 5678.4, 300 sec: 5688.6). Total num frames: 295811072. Throughput: 0: 5970.3. Samples: 295809854. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:29:33,871][25689] Avg episode reward: [(0, '-48.705')] [2022-07-09 14:29:34,982][26022] Updated weights on worker 0-0, policy_version 288884 (0.00091) [2022-07-09 14:29:36,507][26022] Updated weights on worker 0-0, policy_version 288894 (0.00087) [2022-07-09 14:29:38,553][26022] Updated weights on worker 0-0, policy_version 288904 (0.00097) [2022-07-09 14:29:38,939][25689] Fps is (10 sec: 5888.0, 60 sec: 5695.4, 300 sec: 5694.6). Total num frames: 295840768. Throughput: 0: 6012.7. Samples: 295844830. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:29:38,940][25689] Avg episode reward: [(0, '-48.615')] [2022-07-09 14:29:40,228][26022] Updated weights on worker 0-0, policy_version 288914 (0.00982) [2022-07-09 14:29:42,032][26022] Updated weights on worker 0-0, policy_version 288924 (0.00436) [2022-07-09 14:29:43,715][26022] Updated weights on worker 0-0, policy_version 288934 (0.00088) [2022-07-09 14:29:43,991][25689] Fps is (10 sec: 5767.3, 60 sec: 5694.8, 300 sec: 5690.5). Total num frames: 295869440. Throughput: 0: 5174.9. Samples: 295862318. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:29:43,992][25689] Avg episode reward: [(0, '-48.128')] [2022-07-09 14:29:45,520][26022] Updated weights on worker 0-0, policy_version 288944 (0.00085) [2022-07-09 14:29:47,273][26022] Updated weights on worker 0-0, policy_version 288954 (0.00088) [2022-07-09 14:29:48,999][25689] Fps is (10 sec: 5700.5, 60 sec: 5716.4, 300 sec: 5695.4). Total num frames: 295898112. Throughput: 0: 6039.3. Samples: 295896940. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:29:48,999][25689] Avg episode reward: [(0, '-47.542')] [2022-07-09 14:29:49,070][26022] Updated weights on worker 0-0, policy_version 288964 (0.00089) [2022-07-09 14:29:50,781][26022] Updated weights on worker 0-0, policy_version 288974 (0.00083) [2022-07-09 14:29:52,575][26022] Updated weights on worker 0-0, policy_version 288984 (0.00083) [2022-07-09 14:29:54,013][25689] Fps is (10 sec: 5721.6, 60 sec: 5717.7, 300 sec: 5693.7). Total num frames: 295926784. Throughput: 0: 6043.9. Samples: 295931596. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:29:54,014][25689] Avg episode reward: [(0, '-47.155')] [2022-07-09 14:29:54,533][26022] Updated weights on worker 0-0, policy_version 288994 (0.00085) [2022-07-09 14:29:56,088][26022] Updated weights on worker 0-0, policy_version 289004 (0.00090) [2022-07-09 14:29:57,905][26022] Updated weights on worker 0-0, policy_version 289014 (0.00082) [2022-07-09 14:29:59,096][25689] Fps is (10 sec: 5780.5, 60 sec: 5712.3, 300 sec: 5706.1). Total num frames: 295956480. Throughput: 0: 6022.8. Samples: 295966230. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:29:59,097][25689] Avg episode reward: [(0, '-46.856')] [2022-07-09 14:29:59,706][26022] Updated weights on worker 0-0, policy_version 289024 (0.00095) [2022-07-09 14:30:01,804][26022] Updated weights on worker 0-0, policy_version 289034 (0.00095) [2022-07-09 14:30:03,646][26022] Updated weights on worker 0-0, policy_version 289044 (0.00089) [2022-07-09 14:30:04,144][25689] Fps is (10 sec: 5559.1, 60 sec: 5710.7, 300 sec: 5695.2). Total num frames: 295983104. Throughput: 0: 5919.2. Samples: 295981610. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:30:04,145][25689] Avg episode reward: [(0, '-46.856')] [2022-07-09 14:30:05,431][26022] Updated weights on worker 0-0, policy_version 289054 (0.00082) [2022-07-09 14:30:07,120][26022] Updated weights on worker 0-0, policy_version 289064 (0.00086) [2022-07-09 14:30:08,857][26022] Updated weights on worker 0-0, policy_version 289074 (0.00090) [2022-07-09 14:30:09,188][25689] Fps is (10 sec: 5682.0, 60 sec: 5742.2, 300 sec: 5705.1). Total num frames: 296013824. Throughput: 0: 5918.3. Samples: 296016428. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:30:09,189][25689] Avg episode reward: [(0, '-46.900')] [2022-07-09 14:30:10,858][26022] Updated weights on worker 0-0, policy_version 289084 (0.00084) [2022-07-09 14:30:12,348][26022] Updated weights on worker 0-0, policy_version 289094 (0.00091) [2022-07-09 14:30:14,219][25689] Fps is (10 sec: 5793.8, 60 sec: 5722.6, 300 sec: 5702.8). Total num frames: 296041472. Throughput: 0: 5925.8. Samples: 296051328. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:30:14,219][25689] Avg episode reward: [(0, '-47.571')] [2022-07-09 14:30:14,299][26022] Updated weights on worker 0-0, policy_version 289104 (0.00083) [2022-07-09 14:30:15,971][26022] Updated weights on worker 0-0, policy_version 289114 (0.00088) [2022-07-09 14:30:17,977][26022] Updated weights on worker 0-0, policy_version 289124 (0.00094) [2022-07-09 14:30:19,347][25689] Fps is (10 sec: 5644.8, 60 sec: 5753.0, 300 sec: 5701.3). Total num frames: 296071168. Throughput: 0: 5067.1. Samples: 296068842. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:30:19,347][25689] Avg episode reward: [(0, '-47.959')] [2022-07-09 14:30:19,591][26022] Updated weights on worker 0-0, policy_version 289134 (0.00546) [2022-07-09 14:30:21,506][26022] Updated weights on worker 0-0, policy_version 289144 (0.00084) [2022-07-09 14:30:23,047][26022] Updated weights on worker 0-0, policy_version 289154 (0.00082) [2022-07-09 14:30:24,357][25689] Fps is (10 sec: 5757.1, 60 sec: 5702.9, 300 sec: 5704.8). Total num frames: 296099840. Throughput: 0: 6008.7. Samples: 296103062. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:30:24,358][25689] Avg episode reward: [(0, '-48.064')] [2022-07-09 14:30:25,007][26022] Updated weights on worker 0-0, policy_version 289164 (0.00089) [2022-07-09 14:30:26,833][26022] Updated weights on worker 0-0, policy_version 289174 (0.00333) [2022-07-09 14:30:28,350][26022] Updated weights on worker 0-0, policy_version 289184 (0.00084) [2022-07-09 14:30:29,417][25689] Fps is (10 sec: 5795.9, 60 sec: 5748.5, 300 sec: 5707.8). Total num frames: 296129536. Throughput: 0: 6008.0. Samples: 296137966. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:30:29,418][25689] Avg episode reward: [(0, '-48.281')] [2022-07-09 14:30:30,307][26022] Updated weights on worker 0-0, policy_version 289194 (0.00081) [2022-07-09 14:30:31,890][26022] Updated weights on worker 0-0, policy_version 289204 (0.00222) [2022-07-09 14:30:33,858][26022] Updated weights on worker 0-0, policy_version 289214 (0.00082) [2022-07-09 14:30:34,477][25689] Fps is (10 sec: 5868.9, 60 sec: 5744.6, 300 sec: 5707.7). Total num frames: 296159232. Throughput: 0: 5125.9. Samples: 296155170. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:30:34,477][25689] Avg episode reward: [(0, '-47.865')] [2022-07-09 14:30:35,738][26022] Updated weights on worker 0-0, policy_version 289224 (0.00092) [2022-07-09 14:30:37,335][26022] Updated weights on worker 0-0, policy_version 289234 (0.00084) [2022-07-09 14:30:39,419][26022] Updated weights on worker 0-0, policy_version 289244 (0.00085) [2022-07-09 14:30:39,555][25689] Fps is (10 sec: 5656.3, 60 sec: 5709.9, 300 sec: 5706.5). Total num frames: 296186880. Throughput: 0: 5954.2. Samples: 296189168. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:30:39,557][25689] Avg episode reward: [(0, '-47.984')] [2022-07-09 14:30:40,954][26022] Updated weights on worker 0-0, policy_version 289254 (0.00078) [2022-07-09 14:30:42,925][26022] Updated weights on worker 0-0, policy_version 289264 (0.00083) [2022-07-09 14:30:44,529][26022] Updated weights on worker 0-0, policy_version 289274 (0.00078) [2022-07-09 14:30:44,625][25689] Fps is (10 sec: 5650.9, 60 sec: 5725.1, 300 sec: 5712.1). Total num frames: 296216576. Throughput: 0: 5956.1. Samples: 296223778. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:30:44,625][25689] Avg episode reward: [(0, '-48.002')] [2022-07-09 14:30:46,358][26022] Updated weights on worker 0-0, policy_version 289284 (0.00088) [2022-07-09 14:30:48,140][26022] Updated weights on worker 0-0, policy_version 289294 (0.00083) [2022-07-09 14:30:49,631][25689] Fps is (10 sec: 5894.5, 60 sec: 5742.1, 300 sec: 5712.5). Total num frames: 296246272. Throughput: 0: 5108.6. Samples: 296241230. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:30:49,632][25689] Avg episode reward: [(0, '-47.805')] [2022-07-09 14:30:49,791][26022] Updated weights on worker 0-0, policy_version 289304 (0.00088) [2022-07-09 14:30:51,810][26022] Updated weights on worker 0-0, policy_version 289314 (0.00098) [2022-07-09 14:30:53,459][26022] Updated weights on worker 0-0, policy_version 289324 (0.00085) [2022-07-09 14:30:54,641][25689] Fps is (10 sec: 5725.2, 60 sec: 5725.7, 300 sec: 5709.8). Total num frames: 296273920. Throughput: 0: 5949.6. Samples: 296275136. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:30:54,641][25689] Avg episode reward: [(0, '-48.940')] [2022-07-09 14:30:55,267][26022] Updated weights on worker 0-0, policy_version 289334 (0.00091) [2022-07-09 14:30:57,003][26022] Updated weights on worker 0-0, policy_version 289344 (0.00090) [2022-07-09 14:30:58,918][26022] Updated weights on worker 0-0, policy_version 289354 (0.00082) [2022-07-09 14:30:59,739][25689] Fps is (10 sec: 5673.4, 60 sec: 5724.3, 300 sec: 5723.3). Total num frames: 296303616. Throughput: 0: 5975.0. Samples: 296309764. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:30:59,739][25689] Avg episode reward: [(0, '-48.906')] [2022-07-09 14:31:00,571][26022] Updated weights on worker 0-0, policy_version 289364 (0.00086) [2022-07-09 14:31:02,711][26022] Updated weights on worker 0-0, policy_version 289374 (0.00082) [2022-07-09 14:31:04,612][26022] Updated weights on worker 0-0, policy_version 289384 (0.00093) [2022-07-09 14:31:04,733][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:31:04,741][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000289385_296330240.pth [2022-07-09 14:31:04,742][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000287379_294276096.pth [2022-07-09 14:31:04,746][25689] Fps is (10 sec: 5573.1, 60 sec: 5728.1, 300 sec: 5713.5). Total num frames: 296330240. Throughput: 0: 5104.0. Samples: 296326480. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:31:04,747][25689] Avg episode reward: [(0, '-48.834')] [2022-07-09 14:31:06,364][26022] Updated weights on worker 0-0, policy_version 289394 (0.00096) [2022-07-09 14:31:08,258][26022] Updated weights on worker 0-0, policy_version 289404 (0.00088) [2022-07-09 14:31:09,760][25689] Fps is (10 sec: 5517.8, 60 sec: 5697.2, 300 sec: 5706.7). Total num frames: 296358912. Throughput: 0: 5878.2. Samples: 296359552. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:31:09,761][25689] Avg episode reward: [(0, '-48.663')] [2022-07-09 14:31:09,816][26022] Updated weights on worker 0-0, policy_version 289414 (0.00100) [2022-07-09 14:31:11,763][26022] Updated weights on worker 0-0, policy_version 289424 (0.00081) [2022-07-09 14:31:13,663][26022] Updated weights on worker 0-0, policy_version 289434 (0.00093) [2022-07-09 14:31:14,784][25689] Fps is (10 sec: 5712.8, 60 sec: 5714.7, 300 sec: 5714.8). Total num frames: 296387584. Throughput: 0: 5911.5. Samples: 296394214. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:31:14,785][25689] Avg episode reward: [(0, '-48.443')] [2022-07-09 14:31:15,119][26022] Updated weights on worker 0-0, policy_version 289444 (0.00094) [2022-07-09 14:31:17,208][26022] Updated weights on worker 0-0, policy_version 289454 (0.00103) [2022-07-09 14:31:18,884][26022] Updated weights on worker 0-0, policy_version 289464 (0.00090) [2022-07-09 14:31:19,846][25689] Fps is (10 sec: 5685.5, 60 sec: 5704.0, 300 sec: 5710.7). Total num frames: 296416256. Throughput: 0: 5057.5. Samples: 296411458. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:31:19,847][25689] Avg episode reward: [(0, '-48.919')] [2022-07-09 14:31:20,474][26022] Updated weights on worker 0-0, policy_version 289474 (0.00090) [2022-07-09 14:31:22,485][26022] Updated weights on worker 0-0, policy_version 289484 (0.00087) [2022-07-09 14:31:23,865][26022] Updated weights on worker 0-0, policy_version 289494 (0.00093) [2022-07-09 14:31:24,876][25689] Fps is (10 sec: 5783.6, 60 sec: 5719.1, 300 sec: 5710.4). Total num frames: 296445952. Throughput: 0: 5943.6. Samples: 296446124. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:31:24,877][25689] Avg episode reward: [(0, '-48.381')] [2022-07-09 14:31:26,110][26022] Updated weights on worker 0-0, policy_version 289504 (0.00086) [2022-07-09 14:31:27,669][26022] Updated weights on worker 0-0, policy_version 289514 (0.00087) [2022-07-09 14:31:29,491][26022] Updated weights on worker 0-0, policy_version 289524 (0.00080) [2022-07-09 14:31:29,884][25689] Fps is (10 sec: 5916.8, 60 sec: 5724.0, 300 sec: 5714.1). Total num frames: 296475648. Throughput: 0: 6025.2. Samples: 296480802. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:31:29,885][25689] Avg episode reward: [(0, '-48.872')] [2022-07-09 14:31:31,399][26022] Updated weights on worker 0-0, policy_version 289534 (0.00087) [2022-07-09 14:31:32,992][26022] Updated weights on worker 0-0, policy_version 289544 (0.00086) [2022-07-09 14:31:34,888][25689] Fps is (10 sec: 5625.6, 60 sec: 5678.5, 300 sec: 5705.0). Total num frames: 296502272. Throughput: 0: 5164.3. Samples: 296498034. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:31:34,888][25689] Avg episode reward: [(0, '-48.579')] [2022-07-09 14:31:34,987][26022] Updated weights on worker 0-0, policy_version 289554 (0.00092) [2022-07-09 14:31:36,468][26022] Updated weights on worker 0-0, policy_version 289564 (0.00988) [2022-07-09 14:31:38,460][26022] Updated weights on worker 0-0, policy_version 289574 (0.00091) [2022-07-09 14:31:39,955][25689] Fps is (10 sec: 5592.4, 60 sec: 5713.4, 300 sec: 5711.2). Total num frames: 296531968. Throughput: 0: 6028.8. Samples: 296532690. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:31:39,956][25689] Avg episode reward: [(0, '-49.560')] [2022-07-09 14:31:40,231][26022] Updated weights on worker 0-0, policy_version 289584 (0.00088) [2022-07-09 14:31:41,986][26022] Updated weights on worker 0-0, policy_version 289594 (0.00091) [2022-07-09 14:31:43,737][26022] Updated weights on worker 0-0, policy_version 289604 (0.00085) [2022-07-09 14:31:44,958][25689] Fps is (10 sec: 5897.7, 60 sec: 5719.7, 300 sec: 5711.2). Total num frames: 296561664. Throughput: 0: 6041.2. Samples: 296567442. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 14:31:44,958][25689] Avg episode reward: [(0, '-48.962')] [2022-07-09 14:31:45,512][26022] Updated weights on worker 0-0, policy_version 289614 (0.00092) [2022-07-09 14:31:47,127][26022] Updated weights on worker 0-0, policy_version 289624 (0.00084) [2022-07-09 14:31:48,946][26022] Updated weights on worker 0-0, policy_version 289634 (0.00078) [2022-07-09 14:31:49,968][25689] Fps is (10 sec: 5829.1, 60 sec: 5702.4, 300 sec: 5708.2). Total num frames: 296590336. Throughput: 0: 5184.6. Samples: 296584930. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:31:49,970][25689] Avg episode reward: [(0, '-48.854')] [2022-07-09 14:31:50,753][26022] Updated weights on worker 0-0, policy_version 289644 (0.00081) [2022-07-09 14:31:52,619][26022] Updated weights on worker 0-0, policy_version 289654 (0.00095) [2022-07-09 14:31:54,315][26022] Updated weights on worker 0-0, policy_version 289664 (0.00099) [2022-07-09 14:31:54,991][25689] Fps is (10 sec: 5919.3, 60 sec: 5752.0, 300 sec: 5723.5). Total num frames: 296621056. Throughput: 0: 6046.9. Samples: 296619600. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:31:54,993][25689] Avg episode reward: [(0, '-48.379')] [2022-07-09 14:31:56,062][26022] Updated weights on worker 0-0, policy_version 289674 (0.00083) [2022-07-09 14:31:57,861][26022] Updated weights on worker 0-0, policy_version 289684 (0.00084) [2022-07-09 14:31:59,599][26022] Updated weights on worker 0-0, policy_version 289694 (0.00098) [2022-07-09 14:32:00,060][25689] Fps is (10 sec: 5783.9, 60 sec: 5720.9, 300 sec: 5719.2). Total num frames: 296648704. Throughput: 0: 6032.6. Samples: 296653974. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:32:00,060][25689] Avg episode reward: [(0, '-48.443')] [2022-07-09 14:32:01,399][26022] Updated weights on worker 0-0, policy_version 289704 (0.00095) [2022-07-09 14:32:03,601][26022] Updated weights on worker 0-0, policy_version 289714 (0.00088) [2022-07-09 14:32:05,078][25689] Fps is (10 sec: 5482.0, 60 sec: 5736.8, 300 sec: 5715.9). Total num frames: 296676352. Throughput: 0: 5081.5. Samples: 296669684. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:32:05,080][25689] Avg episode reward: [(0, '-48.546')] [2022-07-09 14:32:05,137][26022] Updated weights on worker 0-0, policy_version 289724 (0.00083) [2022-07-09 14:32:07,222][26022] Updated weights on worker 0-0, policy_version 289734 (0.00087) [2022-07-09 14:32:08,854][26022] Updated weights on worker 0-0, policy_version 289744 (0.00056) [2022-07-09 14:32:10,086][25689] Fps is (10 sec: 5515.2, 60 sec: 5720.4, 300 sec: 5712.5). Total num frames: 296704000. Throughput: 0: 5903.2. Samples: 296703688. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:32:10,087][25689] Avg episode reward: [(0, '-48.341')] [2022-07-09 14:32:10,847][26022] Updated weights on worker 0-0, policy_version 289754 (0.00081) [2022-07-09 14:32:12,479][26022] Updated weights on worker 0-0, policy_version 289764 (0.00087) [2022-07-09 14:32:14,359][26022] Updated weights on worker 0-0, policy_version 289774 (0.00079) [2022-07-09 14:32:15,103][25689] Fps is (10 sec: 5618.3, 60 sec: 5721.1, 300 sec: 5712.9). Total num frames: 296732672. Throughput: 0: 5892.1. Samples: 296738098. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:32:15,103][25689] Avg episode reward: [(0, '-48.238')] [2022-07-09 14:32:16,117][26022] Updated weights on worker 0-0, policy_version 289784 (0.00087) [2022-07-09 14:32:17,745][26022] Updated weights on worker 0-0, policy_version 289794 (0.00086) [2022-07-09 14:32:19,663][26022] Updated weights on worker 0-0, policy_version 289804 (0.00089) [2022-07-09 14:32:20,216][25689] Fps is (10 sec: 5762.0, 60 sec: 5733.2, 300 sec: 5714.3). Total num frames: 296762368. Throughput: 0: 5035.9. Samples: 296755478. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:32:20,216][25689] Avg episode reward: [(0, '-48.987')] [2022-07-09 14:32:21,499][26022] Updated weights on worker 0-0, policy_version 289814 (0.00088) [2022-07-09 14:32:23,229][26022] Updated weights on worker 0-0, policy_version 289824 (0.00399) [2022-07-09 14:32:25,030][26022] Updated weights on worker 0-0, policy_version 289834 (0.00051) [2022-07-09 14:32:25,239][25689] Fps is (10 sec: 5758.5, 60 sec: 5716.9, 300 sec: 5714.0). Total num frames: 296791040. Throughput: 0: 5959.9. Samples: 296789838. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:32:25,239][25689] Avg episode reward: [(0, '-47.912')] [2022-07-09 14:32:26,780][26022] Updated weights on worker 0-0, policy_version 289844 (0.00077) [2022-07-09 14:32:28,560][26022] Updated weights on worker 0-0, policy_version 289854 (0.00079) [2022-07-09 14:32:30,262][25689] Fps is (10 sec: 5707.8, 60 sec: 5698.5, 300 sec: 5710.5). Total num frames: 296819712. Throughput: 0: 5997.9. Samples: 296824704. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:32:30,263][25689] Avg episode reward: [(0, '-48.786')] [2022-07-09 14:32:30,323][26022] Updated weights on worker 0-0, policy_version 289864 (0.00088) [2022-07-09 14:32:32,219][26022] Updated weights on worker 0-0, policy_version 289874 (0.00079) [2022-07-09 14:32:33,841][26022] Updated weights on worker 0-0, policy_version 289884 (0.00085) [2022-07-09 14:32:35,276][25689] Fps is (10 sec: 5713.3, 60 sec: 5731.5, 300 sec: 5709.2). Total num frames: 296848384. Throughput: 0: 5147.2. Samples: 296841932. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:32:35,276][25689] Avg episode reward: [(0, '-48.217')] [2022-07-09 14:32:35,679][26022] Updated weights on worker 0-0, policy_version 289894 (0.00088) [2022-07-09 14:32:37,226][26022] Updated weights on worker 0-0, policy_version 289904 (0.00091) [2022-07-09 14:32:39,432][26022] Updated weights on worker 0-0, policy_version 289914 (0.00087) [2022-07-09 14:32:40,351][25689] Fps is (10 sec: 5887.1, 60 sec: 5747.7, 300 sec: 5712.1). Total num frames: 296879104. Throughput: 0: 6009.5. Samples: 296876480. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:32:40,351][25689] Avg episode reward: [(0, '-48.450')] [2022-07-09 14:32:40,701][26022] Updated weights on worker 0-0, policy_version 289924 (0.00084) [2022-07-09 14:32:42,779][26022] Updated weights on worker 0-0, policy_version 289934 (0.00084) [2022-07-09 14:32:44,488][26022] Updated weights on worker 0-0, policy_version 289944 (0.00093) [2022-07-09 14:32:45,375][25689] Fps is (10 sec: 5779.5, 60 sec: 5711.8, 300 sec: 5713.1). Total num frames: 296906752. Throughput: 0: 6030.9. Samples: 296911276. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:32:45,375][25689] Avg episode reward: [(0, '-48.254')] [2022-07-09 14:32:46,154][26022] Updated weights on worker 0-0, policy_version 289954 (0.00088) [2022-07-09 14:32:47,971][26022] Updated weights on worker 0-0, policy_version 289964 (0.00089) [2022-07-09 14:32:50,110][26022] Updated weights on worker 0-0, policy_version 289974 (0.00085) [2022-07-09 14:32:50,388][25689] Fps is (10 sec: 5509.1, 60 sec: 5694.6, 300 sec: 5709.5). Total num frames: 296934400. Throughput: 0: 5154.2. Samples: 296928436. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:32:50,388][25689] Avg episode reward: [(0, '-49.145')] [2022-07-09 14:32:51,457][26022] Updated weights on worker 0-0, policy_version 289984 (0.00084) [2022-07-09 14:32:53,614][26022] Updated weights on worker 0-0, policy_version 289994 (0.00090) [2022-07-09 14:32:54,811][26022] Updated weights on worker 0-0, policy_version 290004 (0.00085) [2022-07-09 14:32:55,402][25689] Fps is (10 sec: 5923.2, 60 sec: 5712.4, 300 sec: 5714.0). Total num frames: 296966144. Throughput: 0: 6025.3. Samples: 296963198. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:32:55,402][25689] Avg episode reward: [(0, '-49.344')] [2022-07-09 14:32:57,074][26022] Updated weights on worker 0-0, policy_version 290014 (0.00086) [2022-07-09 14:32:58,742][26022] Updated weights on worker 0-0, policy_version 290024 (0.00089) [2022-07-09 14:33:00,456][25689] Fps is (10 sec: 6001.0, 60 sec: 5730.7, 300 sec: 5723.7). Total num frames: 296994816. Throughput: 0: 6020.0. Samples: 296997512. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:33:00,456][25689] Avg episode reward: [(0, '-49.351')] [2022-07-09 14:33:00,459][26022] Updated weights on worker 0-0, policy_version 290034 (0.00087) [2022-07-09 14:33:02,816][26022] Updated weights on worker 0-0, policy_version 290044 (0.00096) [2022-07-09 14:33:04,299][26022] Updated weights on worker 0-0, policy_version 290054 (0.00092) [2022-07-09 14:33:04,867][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:33:04,882][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000290056_297017344.pth [2022-07-09 14:33:04,882][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000288046_294959104.pth [2022-07-09 14:33:05,527][25689] Fps is (10 sec: 5258.7, 60 sec: 5674.9, 300 sec: 5705.4). Total num frames: 297019392. Throughput: 0: 5888.0. Samples: 297029936. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:33:05,528][25689] Avg episode reward: [(0, '-48.494')] [2022-07-09 14:33:06,333][26022] Updated weights on worker 0-0, policy_version 290064 (0.00080) [2022-07-09 14:33:08,229][26022] Updated weights on worker 0-0, policy_version 290074 (0.00085) [2022-07-09 14:33:09,772][26022] Updated weights on worker 0-0, policy_version 290084 (0.00087) [2022-07-09 14:33:10,608][25689] Fps is (10 sec: 5547.7, 60 sec: 5735.7, 300 sec: 5710.9). Total num frames: 297051136. Throughput: 0: 5871.8. Samples: 297047162. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:33:10,608][25689] Avg episode reward: [(0, '-49.657')] [2022-07-09 14:33:11,859][26022] Updated weights on worker 0-0, policy_version 290094 (0.00082) [2022-07-09 14:33:13,283][26022] Updated weights on worker 0-0, policy_version 290104 (0.00079) [2022-07-09 14:33:15,338][26022] Updated weights on worker 0-0, policy_version 290114 (0.00087) [2022-07-09 14:33:15,648][25689] Fps is (10 sec: 5969.8, 60 sec: 5733.5, 300 sec: 5715.2). Total num frames: 297079808. Throughput: 0: 5861.0. Samples: 297081860. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:33:15,648][25689] Avg episode reward: [(0, '-49.393')] [2022-07-09 14:33:17,074][26022] Updated weights on worker 0-0, policy_version 290124 (0.00087) [2022-07-09 14:33:18,625][26022] Updated weights on worker 0-0, policy_version 290134 (0.00085) [2022-07-09 14:33:20,643][26022] Updated weights on worker 0-0, policy_version 290144 (0.00086) [2022-07-09 14:33:20,702][25689] Fps is (10 sec: 5579.2, 60 sec: 5705.2, 300 sec: 5711.3). Total num frames: 297107456. Throughput: 0: 5876.5. Samples: 297116492. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:33:20,703][25689] Avg episode reward: [(0, '-48.657')] [2022-07-09 14:33:22,162][26022] Updated weights on worker 0-0, policy_version 290154 (0.00088) [2022-07-09 14:33:24,095][26022] Updated weights on worker 0-0, policy_version 290164 (0.00055) [2022-07-09 14:33:25,742][25689] Fps is (10 sec: 5680.7, 60 sec: 5720.5, 300 sec: 5714.1). Total num frames: 297137152. Throughput: 0: 5141.8. Samples: 297133876. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:33:25,743][25689] Avg episode reward: [(0, '-48.694')] [2022-07-09 14:33:25,824][26022] Updated weights on worker 0-0, policy_version 290174 (0.00103) [2022-07-09 14:33:27,620][26022] Updated weights on worker 0-0, policy_version 290184 (0.00092) [2022-07-09 14:33:29,413][26022] Updated weights on worker 0-0, policy_version 290194 (0.00300) [2022-07-09 14:33:30,753][25689] Fps is (10 sec: 5807.5, 60 sec: 5721.8, 300 sec: 5710.8). Total num frames: 297165824. Throughput: 0: 6015.4. Samples: 297168344. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:33:30,753][25689] Avg episode reward: [(0, '-48.504')] [2022-07-09 14:33:31,257][26022] Updated weights on worker 0-0, policy_version 290204 (0.00085) [2022-07-09 14:33:32,943][26022] Updated weights on worker 0-0, policy_version 290214 (0.00660) [2022-07-09 14:33:34,811][26022] Updated weights on worker 0-0, policy_version 290224 (0.00104) [2022-07-09 14:33:35,836][25689] Fps is (10 sec: 5680.9, 60 sec: 5715.1, 300 sec: 5710.5). Total num frames: 297194496. Throughput: 0: 5984.1. Samples: 297202672. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:33:35,838][25689] Avg episode reward: [(0, '-47.662')] [2022-07-09 14:33:36,576][26022] Updated weights on worker 0-0, policy_version 290234 (0.00082) [2022-07-09 14:33:38,399][26022] Updated weights on worker 0-0, policy_version 290244 (0.00094) [2022-07-09 14:33:40,115][26022] Updated weights on worker 0-0, policy_version 290254 (0.00481) [2022-07-09 14:33:40,917][25689] Fps is (10 sec: 5843.2, 60 sec: 5714.6, 300 sec: 5716.8). Total num frames: 297225216. Throughput: 0: 5118.0. Samples: 297219952. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:33:40,917][25689] Avg episode reward: [(0, '-47.089')] [2022-07-09 14:33:41,990][26022] Updated weights on worker 0-0, policy_version 290264 (0.00081) [2022-07-09 14:33:43,675][26022] Updated weights on worker 0-0, policy_version 290274 (0.00085) [2022-07-09 14:33:45,482][26022] Updated weights on worker 0-0, policy_version 290284 (0.00086) [2022-07-09 14:33:45,926][25689] Fps is (10 sec: 5785.1, 60 sec: 5716.0, 300 sec: 5717.7). Total num frames: 297252864. Throughput: 0: 5989.4. Samples: 297254762. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:33:45,926][25689] Avg episode reward: [(0, '-47.154')] [2022-07-09 14:33:47,192][26022] Updated weights on worker 0-0, policy_version 290294 (0.00090) [2022-07-09 14:33:49,145][26022] Updated weights on worker 0-0, policy_version 290304 (0.00089) [2022-07-09 14:33:50,689][26022] Updated weights on worker 0-0, policy_version 290314 (0.00103) [2022-07-09 14:33:50,972][25689] Fps is (10 sec: 5703.0, 60 sec: 5746.7, 300 sec: 5720.8). Total num frames: 297282560. Throughput: 0: 5993.3. Samples: 297289524. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:33:50,973][25689] Avg episode reward: [(0, '-46.746')] [2022-07-09 14:33:52,706][26022] Updated weights on worker 0-0, policy_version 290324 (0.00092) [2022-07-09 14:33:54,248][26022] Updated weights on worker 0-0, policy_version 290334 (0.00092) [2022-07-09 14:33:55,986][25689] Fps is (10 sec: 5700.1, 60 sec: 5679.1, 300 sec: 5714.1). Total num frames: 297310208. Throughput: 0: 5144.9. Samples: 297306340. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:33:55,987][25689] Avg episode reward: [(0, '-46.769')] [2022-07-09 14:33:56,291][26022] Updated weights on worker 0-0, policy_version 290344 (0.00095) [2022-07-09 14:33:58,122][26022] Updated weights on worker 0-0, policy_version 290354 (0.00088) [2022-07-09 14:33:59,684][26022] Updated weights on worker 0-0, policy_version 290364 (0.00080) [2022-07-09 14:34:01,043][25689] Fps is (10 sec: 5592.5, 60 sec: 5678.8, 300 sec: 5720.5). Total num frames: 297338880. Throughput: 0: 5995.6. Samples: 297340618. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:34:01,043][25689] Avg episode reward: [(0, '-47.308')] [2022-07-09 14:34:01,845][26022] Updated weights on worker 0-0, policy_version 290374 (0.00099) [2022-07-09 14:34:03,978][26022] Updated weights on worker 0-0, policy_version 290384 (0.00082) [2022-07-09 14:34:05,534][26022] Updated weights on worker 0-0, policy_version 290394 (0.00082) [2022-07-09 14:34:06,052][25689] Fps is (10 sec: 5595.2, 60 sec: 5735.5, 300 sec: 5717.3). Total num frames: 297366528. Throughput: 0: 5875.2. Samples: 297373006. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:34:06,052][25689] Avg episode reward: [(0, '-47.024')] [2022-07-09 14:34:07,458][26022] Updated weights on worker 0-0, policy_version 290404 (0.00087) [2022-07-09 14:34:08,912][26022] Updated weights on worker 0-0, policy_version 290414 (0.00089) [2022-07-09 14:34:10,965][26022] Updated weights on worker 0-0, policy_version 290424 (0.00084) [2022-07-09 14:34:11,057][25689] Fps is (10 sec: 5623.7, 60 sec: 5691.7, 300 sec: 5717.2). Total num frames: 297395200. Throughput: 0: 5009.3. Samples: 297390138. Policy #0 lag: (min: 0.0, avg: 8.1, max: 17.0) [2022-07-09 14:34:11,058][25689] Avg episode reward: [(0, '-47.858')] [2022-07-09 14:34:12,599][26022] Updated weights on worker 0-0, policy_version 290434 (0.00092) [2022-07-09 14:34:14,418][26022] Updated weights on worker 0-0, policy_version 290444 (0.00087) [2022-07-09 14:34:16,095][25689] Fps is (10 sec: 5709.8, 60 sec: 5692.0, 300 sec: 5721.7). Total num frames: 297423872. Throughput: 0: 5883.2. Samples: 297424644. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:34:16,095][25689] Avg episode reward: [(0, '-48.805')] [2022-07-09 14:34:16,161][26022] Updated weights on worker 0-0, policy_version 290454 (0.00081) [2022-07-09 14:34:17,886][26022] Updated weights on worker 0-0, policy_version 290464 (0.00083) [2022-07-09 14:34:19,752][26022] Updated weights on worker 0-0, policy_version 290474 (0.00091) [2022-07-09 14:34:21,141][25689] Fps is (10 sec: 5686.7, 60 sec: 5709.7, 300 sec: 5710.8). Total num frames: 297452544. Throughput: 0: 5911.7. Samples: 297459436. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:34:21,142][25689] Avg episode reward: [(0, '-48.520')] [2022-07-09 14:34:21,427][26022] Updated weights on worker 0-0, policy_version 290484 (0.00090) [2022-07-09 14:34:23,354][26022] Updated weights on worker 0-0, policy_version 290494 (0.00534) [2022-07-09 14:34:25,048][26022] Updated weights on worker 0-0, policy_version 290504 (0.00090) [2022-07-09 14:34:26,149][25689] Fps is (10 sec: 5703.6, 60 sec: 5695.8, 300 sec: 5717.7). Total num frames: 297481216. Throughput: 0: 5154.3. Samples: 297476592. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:34:26,149][25689] Avg episode reward: [(0, '-48.598')] [2022-07-09 14:34:26,951][26022] Updated weights on worker 0-0, policy_version 290514 (0.00086) [2022-07-09 14:34:28,692][26022] Updated weights on worker 0-0, policy_version 290524 (0.00090) [2022-07-09 14:34:30,613][26022] Updated weights on worker 0-0, policy_version 290534 (0.00082) [2022-07-09 14:34:31,165][25689] Fps is (10 sec: 5822.7, 60 sec: 5712.2, 300 sec: 5717.7). Total num frames: 297510912. Throughput: 0: 6005.1. Samples: 297510888. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:34:31,166][25689] Avg episode reward: [(0, '-48.493')] [2022-07-09 14:34:32,288][26022] Updated weights on worker 0-0, policy_version 290544 (0.00092) [2022-07-09 14:34:34,118][26022] Updated weights on worker 0-0, policy_version 290554 (0.00089) [2022-07-09 14:34:35,995][26022] Updated weights on worker 0-0, policy_version 290564 (0.00086) [2022-07-09 14:34:36,199][25689] Fps is (10 sec: 5705.9, 60 sec: 5700.0, 300 sec: 5711.5). Total num frames: 297538560. Throughput: 0: 5986.4. Samples: 297544994. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:34:36,199][25689] Avg episode reward: [(0, '-48.321')] [2022-07-09 14:34:37,803][26022] Updated weights on worker 0-0, policy_version 290574 (0.00083) [2022-07-09 14:34:39,616][26022] Updated weights on worker 0-0, policy_version 290584 (0.00086) [2022-07-09 14:34:41,236][25689] Fps is (10 sec: 5592.7, 60 sec: 5670.1, 300 sec: 5711.8). Total num frames: 297567232. Throughput: 0: 5107.8. Samples: 297562074. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:34:41,236][25689] Avg episode reward: [(0, '-48.826')] [2022-07-09 14:34:41,456][26022] Updated weights on worker 0-0, policy_version 290594 (0.00089) [2022-07-09 14:34:43,164][26022] Updated weights on worker 0-0, policy_version 290604 (0.00093) [2022-07-09 14:34:45,025][26022] Updated weights on worker 0-0, policy_version 290614 (0.00078) [2022-07-09 14:34:46,248][25689] Fps is (10 sec: 5706.4, 60 sec: 5686.8, 300 sec: 5711.7). Total num frames: 297595904. Throughput: 0: 5971.9. Samples: 297596622. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:34:46,248][25689] Avg episode reward: [(0, '-47.771')] [2022-07-09 14:34:46,794][26022] Updated weights on worker 0-0, policy_version 290624 (0.00089) [2022-07-09 14:34:48,319][26022] Updated weights on worker 0-0, policy_version 290634 (0.00090) [2022-07-09 14:34:50,244][26022] Updated weights on worker 0-0, policy_version 290644 (0.00086) [2022-07-09 14:34:51,271][25689] Fps is (10 sec: 5714.4, 60 sec: 5672.0, 300 sec: 5711.5). Total num frames: 297624576. Throughput: 0: 5991.3. Samples: 297631344. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:34:51,271][25689] Avg episode reward: [(0, '-48.344')] [2022-07-09 14:34:52,065][26022] Updated weights on worker 0-0, policy_version 290654 (0.00079) [2022-07-09 14:34:53,715][26022] Updated weights on worker 0-0, policy_version 290664 (0.00090) [2022-07-09 14:34:55,535][26022] Updated weights on worker 0-0, policy_version 290674 (0.00090) [2022-07-09 14:34:56,276][25689] Fps is (10 sec: 5718.4, 60 sec: 5689.8, 300 sec: 5709.6). Total num frames: 297653248. Throughput: 0: 5162.1. Samples: 297648634. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:34:56,276][25689] Avg episode reward: [(0, '-48.799')] [2022-07-09 14:34:57,511][26022] Updated weights on worker 0-0, policy_version 290684 (0.00083) [2022-07-09 14:34:59,391][26022] Updated weights on worker 0-0, policy_version 290694 (0.00088) [2022-07-09 14:35:00,949][26022] Updated weights on worker 0-0, policy_version 290704 (0.00083) [2022-07-09 14:35:01,349][25689] Fps is (10 sec: 5791.3, 60 sec: 5705.2, 300 sec: 5719.4). Total num frames: 297682944. Throughput: 0: 6010.4. Samples: 297682964. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:35:01,353][25689] Avg episode reward: [(0, '-49.344')] [2022-07-09 14:35:03,142][26022] Updated weights on worker 0-0, policy_version 290714 (0.00095) [2022-07-09 14:35:04,830][26022] Updated weights on worker 0-0, policy_version 290724 (0.00080) [2022-07-09 14:35:04,934][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:35:04,945][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000290725_297702400.pth [2022-07-09 14:35:04,946][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000288715_295644160.pth [2022-07-09 14:35:06,371][25689] Fps is (10 sec: 5579.0, 60 sec: 5687.1, 300 sec: 5706.1). Total num frames: 297709568. Throughput: 0: 5909.0. Samples: 297715530. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:35:06,373][25689] Avg episode reward: [(0, '-49.174')] [2022-07-09 14:35:06,852][26022] Updated weights on worker 0-0, policy_version 290734 (0.00096) [2022-07-09 14:35:08,266][26022] Updated weights on worker 0-0, policy_version 290744 (0.00093) [2022-07-09 14:35:10,318][26022] Updated weights on worker 0-0, policy_version 290754 (0.00085) [2022-07-09 14:35:11,374][25689] Fps is (10 sec: 5618.3, 60 sec: 5704.3, 300 sec: 5713.5). Total num frames: 297739264. Throughput: 0: 5051.7. Samples: 297732898. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:35:11,375][25689] Avg episode reward: [(0, '-49.350')] [2022-07-09 14:35:11,781][26022] Updated weights on worker 0-0, policy_version 290764 (0.00095) [2022-07-09 14:35:13,831][26022] Updated weights on worker 0-0, policy_version 290774 (0.00103) [2022-07-09 14:35:15,568][26022] Updated weights on worker 0-0, policy_version 290784 (0.00097) [2022-07-09 14:35:16,387][25689] Fps is (10 sec: 5827.6, 60 sec: 5706.6, 300 sec: 5712.3). Total num frames: 297767936. Throughput: 0: 5913.7. Samples: 297767564. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:35:16,388][25689] Avg episode reward: [(0, '-48.804')] [2022-07-09 14:35:17,391][26022] Updated weights on worker 0-0, policy_version 290794 (0.00087) [2022-07-09 14:35:19,211][26022] Updated weights on worker 0-0, policy_version 290804 (0.00092) [2022-07-09 14:35:20,875][26022] Updated weights on worker 0-0, policy_version 290814 (0.00087) [2022-07-09 14:35:21,478][25689] Fps is (10 sec: 5675.3, 60 sec: 5702.4, 300 sec: 5710.7). Total num frames: 297796608. Throughput: 0: 5924.6. Samples: 297802216. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:35:21,487][25689] Avg episode reward: [(0, '-48.955')] [2022-07-09 14:35:22,628][26022] Updated weights on worker 0-0, policy_version 290824 (0.00095) [2022-07-09 14:35:24,371][26022] Updated weights on worker 0-0, policy_version 290834 (0.00083) [2022-07-09 14:35:26,131][26022] Updated weights on worker 0-0, policy_version 290844 (0.00081) [2022-07-09 14:35:26,509][25689] Fps is (10 sec: 5766.0, 60 sec: 5717.1, 300 sec: 5711.3). Total num frames: 297826304. Throughput: 0: 5156.7. Samples: 297819378. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:35:26,511][25689] Avg episode reward: [(0, '-48.772')] [2022-07-09 14:35:27,827][26022] Updated weights on worker 0-0, policy_version 290854 (0.00085) [2022-07-09 14:35:29,735][26022] Updated weights on worker 0-0, policy_version 290864 (0.00079) [2022-07-09 14:35:31,529][25689] Fps is (10 sec: 5705.1, 60 sec: 5682.9, 300 sec: 5705.1). Total num frames: 297853952. Throughput: 0: 5999.7. Samples: 297853824. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:35:31,530][25689] Avg episode reward: [(0, '-48.500')] [2022-07-09 14:35:31,651][26022] Updated weights on worker 0-0, policy_version 290874 (0.00091) [2022-07-09 14:35:33,456][26022] Updated weights on worker 0-0, policy_version 290884 (0.00081) [2022-07-09 14:35:35,020][26022] Updated weights on worker 0-0, policy_version 290894 (0.00081) [2022-07-09 14:35:36,568][25689] Fps is (10 sec: 5599.5, 60 sec: 5699.4, 300 sec: 5709.3). Total num frames: 297882624. Throughput: 0: 5978.6. Samples: 297888218. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:35:36,569][25689] Avg episode reward: [(0, '-49.335')] [2022-07-09 14:35:37,030][26022] Updated weights on worker 0-0, policy_version 290904 (0.00090) [2022-07-09 14:35:38,503][26022] Updated weights on worker 0-0, policy_version 290914 (0.00084) [2022-07-09 14:35:40,720][26022] Updated weights on worker 0-0, policy_version 290924 (0.00083) [2022-07-09 14:35:41,684][25689] Fps is (10 sec: 5848.4, 60 sec: 5725.7, 300 sec: 5711.9). Total num frames: 297913344. Throughput: 0: 5101.1. Samples: 297905292. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:35:41,685][25689] Avg episode reward: [(0, '-49.939')] [2022-07-09 14:35:42,123][26022] Updated weights on worker 0-0, policy_version 290934 (0.00090) [2022-07-09 14:35:44,070][26022] Updated weights on worker 0-0, policy_version 290944 (0.00085) [2022-07-09 14:35:46,058][26022] Updated weights on worker 0-0, policy_version 290954 (0.00090) [2022-07-09 14:35:46,785][25689] Fps is (10 sec: 5612.3, 60 sec: 5683.5, 300 sec: 5699.7). Total num frames: 297939968. Throughput: 0: 5939.8. Samples: 297939812. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:35:46,787][25689] Avg episode reward: [(0, '-49.043')] [2022-07-09 14:35:47,510][26022] Updated weights on worker 0-0, policy_version 290964 (0.00088) [2022-07-09 14:35:49,684][26022] Updated weights on worker 0-0, policy_version 290974 (0.00085) [2022-07-09 14:35:51,076][26022] Updated weights on worker 0-0, policy_version 290984 (0.00092) [2022-07-09 14:35:51,821][25689] Fps is (10 sec: 5556.1, 60 sec: 5699.2, 300 sec: 5706.1). Total num frames: 297969664. Throughput: 0: 5920.0. Samples: 297973952. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:35:51,822][25689] Avg episode reward: [(0, '-48.994')] [2022-07-09 14:35:53,300][26022] Updated weights on worker 0-0, policy_version 290994 (0.00083) [2022-07-09 14:35:54,933][26022] Updated weights on worker 0-0, policy_version 291004 (0.00097) [2022-07-09 14:35:56,744][26022] Updated weights on worker 0-0, policy_version 291014 (0.00086) [2022-07-09 14:35:56,837][25689] Fps is (10 sec: 5908.8, 60 sec: 5715.1, 300 sec: 5707.7). Total num frames: 297999360. Throughput: 0: 5909.5. Samples: 298007998. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:35:56,837][25689] Avg episode reward: [(0, '-47.884')] [2022-07-09 14:35:58,663][26022] Updated weights on worker 0-0, policy_version 291024 (0.00087) [2022-07-09 14:36:00,141][26022] Updated weights on worker 0-0, policy_version 291034 (0.00098) [2022-07-09 14:36:01,893][25689] Fps is (10 sec: 5693.3, 60 sec: 5682.9, 300 sec: 5710.2). Total num frames: 298027008. Throughput: 0: 5930.2. Samples: 298025134. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:36:01,894][25689] Avg episode reward: [(0, '-48.603')] [2022-07-09 14:36:02,503][26022] Updated weights on worker 0-0, policy_version 291044 (0.00115) [2022-07-09 14:36:04,289][26022] Updated weights on worker 0-0, policy_version 291054 (0.00619) [2022-07-09 14:36:06,235][26022] Updated weights on worker 0-0, policy_version 291064 (0.00083) [2022-07-09 14:36:06,905][25689] Fps is (10 sec: 5390.2, 60 sec: 5683.8, 300 sec: 5703.3). Total num frames: 298053632. Throughput: 0: 5826.3. Samples: 298057038. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:36:06,906][25689] Avg episode reward: [(0, '-48.453')] [2022-07-09 14:36:07,808][26022] Updated weights on worker 0-0, policy_version 291074 (0.00085) [2022-07-09 14:36:09,784][26022] Updated weights on worker 0-0, policy_version 291084 (0.00085) [2022-07-09 14:36:11,454][26022] Updated weights on worker 0-0, policy_version 291094 (0.00087) [2022-07-09 14:36:11,919][25689] Fps is (10 sec: 5413.3, 60 sec: 5648.9, 300 sec: 5700.1). Total num frames: 298081280. Throughput: 0: 5845.4. Samples: 298091432. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:36:11,919][25689] Avg episode reward: [(0, '-49.123')] [2022-07-09 14:36:13,303][26022] Updated weights on worker 0-0, policy_version 291104 (0.00084) [2022-07-09 14:36:15,202][26022] Updated weights on worker 0-0, policy_version 291114 (0.00087) [2022-07-09 14:36:16,803][26022] Updated weights on worker 0-0, policy_version 291124 (0.00088) [2022-07-09 14:36:16,973][25689] Fps is (10 sec: 5695.9, 60 sec: 5662.0, 300 sec: 5703.7). Total num frames: 298110976. Throughput: 0: 5001.3. Samples: 298108706. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:36:16,973][25689] Avg episode reward: [(0, '-49.562')] [2022-07-09 14:36:18,711][26022] Updated weights on worker 0-0, policy_version 291134 (0.00089) [2022-07-09 14:36:20,427][26022] Updated weights on worker 0-0, policy_version 291144 (0.00487) [2022-07-09 14:36:22,040][25689] Fps is (10 sec: 5868.4, 60 sec: 5681.2, 300 sec: 5703.0). Total num frames: 298140672. Throughput: 0: 5858.6. Samples: 298143164. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:36:22,040][25689] Avg episode reward: [(0, '-49.556')] [2022-07-09 14:36:22,317][26022] Updated weights on worker 0-0, policy_version 291154 (0.00090) [2022-07-09 14:36:24,056][26022] Updated weights on worker 0-0, policy_version 291164 (0.00087) [2022-07-09 14:36:25,591][26022] Updated weights on worker 0-0, policy_version 291174 (0.00085) [2022-07-09 14:36:27,058][25689] Fps is (10 sec: 5686.0, 60 sec: 5648.6, 300 sec: 5695.9). Total num frames: 298168320. Throughput: 0: 5995.8. Samples: 298177870. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:36:27,059][25689] Avg episode reward: [(0, '-49.048')] [2022-07-09 14:36:27,635][26022] Updated weights on worker 0-0, policy_version 291184 (0.00095) [2022-07-09 14:36:29,535][26022] Updated weights on worker 0-0, policy_version 291194 (0.00089) [2022-07-09 14:36:31,172][26022] Updated weights on worker 0-0, policy_version 291204 (0.00089) [2022-07-09 14:36:32,141][25689] Fps is (10 sec: 5778.2, 60 sec: 5693.4, 300 sec: 5708.2). Total num frames: 298199040. Throughput: 0: 5126.0. Samples: 298195090. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:36:32,142][25689] Avg episode reward: [(0, '-48.677')] [2022-07-09 14:36:33,045][26022] Updated weights on worker 0-0, policy_version 291214 (0.00090) [2022-07-09 14:36:34,571][26022] Updated weights on worker 0-0, policy_version 291224 (0.00083) [2022-07-09 14:36:36,535][26022] Updated weights on worker 0-0, policy_version 291234 (0.00085) [2022-07-09 14:36:37,145][25689] Fps is (10 sec: 5786.2, 60 sec: 5679.7, 300 sec: 5702.5). Total num frames: 298226688. Throughput: 0: 6009.2. Samples: 298229928. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2022-07-09 14:36:37,146][25689] Avg episode reward: [(0, '-47.325')] [2022-07-09 14:36:38,367][26022] Updated weights on worker 0-0, policy_version 291244 (0.00090) [2022-07-09 14:36:39,749][26022] Updated weights on worker 0-0, policy_version 291254 (0.00083) [2022-07-09 14:36:41,819][26022] Updated weights on worker 0-0, policy_version 291264 (0.00082) [2022-07-09 14:36:42,190][25689] Fps is (10 sec: 5706.1, 60 sec: 5669.5, 300 sec: 5701.7). Total num frames: 298256384. Throughput: 0: 6017.9. Samples: 298264430. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:36:42,191][25689] Avg episode reward: [(0, '-46.549')] [2022-07-09 14:36:43,378][26022] Updated weights on worker 0-0, policy_version 291274 (0.00087) [2022-07-09 14:36:45,294][26022] Updated weights on worker 0-0, policy_version 291284 (0.00082) [2022-07-09 14:36:47,059][26022] Updated weights on worker 0-0, policy_version 291294 (0.00088) [2022-07-09 14:36:47,270][25689] Fps is (10 sec: 5866.3, 60 sec: 5722.3, 300 sec: 5703.8). Total num frames: 298286080. Throughput: 0: 5150.2. Samples: 298281964. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:36:47,270][25689] Avg episode reward: [(0, '-45.461')] [2022-07-09 14:36:48,803][26022] Updated weights on worker 0-0, policy_version 291304 (0.00088) [2022-07-09 14:36:50,646][26022] Updated weights on worker 0-0, policy_version 291314 (0.00094) [2022-07-09 14:36:52,354][25689] Fps is (10 sec: 5742.8, 60 sec: 5700.8, 300 sec: 5695.7). Total num frames: 298314752. Throughput: 0: 6013.9. Samples: 298316650. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:36:52,355][25689] Avg episode reward: [(0, '-45.913')] [2022-07-09 14:36:52,404][26022] Updated weights on worker 0-0, policy_version 291324 (0.00080) [2022-07-09 14:36:54,158][26022] Updated weights on worker 0-0, policy_version 291334 (0.00093) [2022-07-09 14:36:56,052][26022] Updated weights on worker 0-0, policy_version 291344 (0.00081) [2022-07-09 14:36:57,377][25689] Fps is (10 sec: 5774.6, 60 sec: 5700.1, 300 sec: 5703.5). Total num frames: 298344448. Throughput: 0: 5998.2. Samples: 298351284. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:36:57,379][25689] Avg episode reward: [(0, '-45.990')] [2022-07-09 14:36:57,731][26022] Updated weights on worker 0-0, policy_version 291354 (0.00095) [2022-07-09 14:36:59,642][26022] Updated weights on worker 0-0, policy_version 291364 (0.00097) [2022-07-09 14:37:01,412][26022] Updated weights on worker 0-0, policy_version 291374 (0.00082) [2022-07-09 14:37:02,444][25689] Fps is (10 sec: 5378.8, 60 sec: 5648.4, 300 sec: 5692.2). Total num frames: 298369024. Throughput: 0: 5129.6. Samples: 298368324. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:37:02,445][25689] Avg episode reward: [(0, '-45.683')] [2022-07-09 14:37:03,471][26022] Updated weights on worker 0-0, policy_version 291384 (0.00087) [2022-07-09 14:37:05,008][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:37:05,028][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000291393_298386432.pth [2022-07-09 14:37:05,028][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000289385_296330240.pth [2022-07-09 14:37:05,402][26022] Updated weights on worker 0-0, policy_version 291394 (0.00083) [2022-07-09 14:37:07,003][26022] Updated weights on worker 0-0, policy_version 291404 (0.00083) [2022-07-09 14:37:07,460][25689] Fps is (10 sec: 5484.4, 60 sec: 5715.7, 300 sec: 5702.4). Total num frames: 298399744. Throughput: 0: 5886.9. Samples: 298400822. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:37:07,460][25689] Avg episode reward: [(0, '-46.345')] [2022-07-09 14:37:08,791][26022] Updated weights on worker 0-0, policy_version 291414 (0.00106) [2022-07-09 14:37:10,525][26022] Updated weights on worker 0-0, policy_version 291424 (0.00085) [2022-07-09 14:37:12,407][26022] Updated weights on worker 0-0, policy_version 291434 (0.00098) [2022-07-09 14:37:12,493][25689] Fps is (10 sec: 5910.3, 60 sec: 5730.8, 300 sec: 5702.1). Total num frames: 298428416. Throughput: 0: 5895.9. Samples: 298435386. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:37:12,494][25689] Avg episode reward: [(0, '-46.120')] [2022-07-09 14:37:14,153][26022] Updated weights on worker 0-0, policy_version 291444 (0.00093) [2022-07-09 14:37:16,011][26022] Updated weights on worker 0-0, policy_version 291454 (0.00084) [2022-07-09 14:37:17,511][25689] Fps is (10 sec: 5705.3, 60 sec: 5717.3, 300 sec: 5700.5). Total num frames: 298457088. Throughput: 0: 5025.8. Samples: 298452470. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:37:17,512][25689] Avg episode reward: [(0, '-46.174')] [2022-07-09 14:37:17,691][26022] Updated weights on worker 0-0, policy_version 291464 (0.00083) [2022-07-09 14:37:19,647][26022] Updated weights on worker 0-0, policy_version 291474 (0.00082) [2022-07-09 14:37:21,475][26022] Updated weights on worker 0-0, policy_version 291484 (0.00089) [2022-07-09 14:37:22,617][25689] Fps is (10 sec: 5664.2, 60 sec: 5696.6, 300 sec: 5698.9). Total num frames: 298485760. Throughput: 0: 5875.9. Samples: 298486858. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:37:22,618][25689] Avg episode reward: [(0, '-46.249')] [2022-07-09 14:37:23,071][26022] Updated weights on worker 0-0, policy_version 291494 (0.00088) [2022-07-09 14:37:24,861][26022] Updated weights on worker 0-0, policy_version 291504 (0.00084) [2022-07-09 14:37:26,735][26022] Updated weights on worker 0-0, policy_version 291514 (0.00090) [2022-07-09 14:37:27,637][25689] Fps is (10 sec: 5662.9, 60 sec: 5713.4, 300 sec: 5698.9). Total num frames: 298514432. Throughput: 0: 5968.9. Samples: 298521258. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:37:27,637][25689] Avg episode reward: [(0, '-45.958')] [2022-07-09 14:37:28,666][26022] Updated weights on worker 0-0, policy_version 291524 (0.00088) [2022-07-09 14:37:30,366][26022] Updated weights on worker 0-0, policy_version 291534 (0.00097) [2022-07-09 14:37:32,017][26022] Updated weights on worker 0-0, policy_version 291544 (0.00085) [2022-07-09 14:37:32,706][25689] Fps is (10 sec: 5785.1, 60 sec: 5697.8, 300 sec: 5701.3). Total num frames: 298544128. Throughput: 0: 5091.8. Samples: 298538308. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:37:32,707][25689] Avg episode reward: [(0, '-46.679')] [2022-07-09 14:37:33,834][26022] Updated weights on worker 0-0, policy_version 291554 (0.00089) [2022-07-09 14:37:35,766][26022] Updated weights on worker 0-0, policy_version 291564 (0.00087) [2022-07-09 14:37:37,512][26022] Updated weights on worker 0-0, policy_version 291574 (0.00095) [2022-07-09 14:37:37,735][25689] Fps is (10 sec: 5780.4, 60 sec: 5712.4, 300 sec: 5695.3). Total num frames: 298572800. Throughput: 0: 5949.3. Samples: 298572788. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:37:37,735][25689] Avg episode reward: [(0, '-46.928')] [2022-07-09 14:37:39,319][26022] Updated weights on worker 0-0, policy_version 291584 (0.00086) [2022-07-09 14:37:41,021][26022] Updated weights on worker 0-0, policy_version 291594 (0.00080) [2022-07-09 14:37:42,682][26022] Updated weights on worker 0-0, policy_version 291604 (0.00520) [2022-07-09 14:37:42,849][25689] Fps is (10 sec: 5754.6, 60 sec: 5705.9, 300 sec: 5700.5). Total num frames: 298602496. Throughput: 0: 5973.6. Samples: 298607716. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:37:42,850][25689] Avg episode reward: [(0, '-47.341')] [2022-07-09 14:37:44,491][26022] Updated weights on worker 0-0, policy_version 291614 (0.00087) [2022-07-09 14:37:46,335][26022] Updated weights on worker 0-0, policy_version 291624 (0.00084) [2022-07-09 14:37:47,863][25689] Fps is (10 sec: 5864.0, 60 sec: 5712.1, 300 sec: 5707.3). Total num frames: 298632192. Throughput: 0: 5137.8. Samples: 298625174. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:37:47,863][25689] Avg episode reward: [(0, '-47.592')] [2022-07-09 14:37:47,917][26022] Updated weights on worker 0-0, policy_version 291634 (0.00091) [2022-07-09 14:37:49,748][26022] Updated weights on worker 0-0, policy_version 291644 (0.00093) [2022-07-09 14:37:51,511][26022] Updated weights on worker 0-0, policy_version 291654 (0.00084) [2022-07-09 14:37:52,875][25689] Fps is (10 sec: 5821.9, 60 sec: 5718.9, 300 sec: 5697.1). Total num frames: 298660864. Throughput: 0: 6037.8. Samples: 298660080. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:37:52,875][25689] Avg episode reward: [(0, '-47.689')] [2022-07-09 14:37:53,433][26022] Updated weights on worker 0-0, policy_version 291664 (0.00086) [2022-07-09 14:37:55,064][26022] Updated weights on worker 0-0, policy_version 291674 (0.00095) [2022-07-09 14:37:56,887][26022] Updated weights on worker 0-0, policy_version 291684 (0.00091) [2022-07-09 14:37:57,879][25689] Fps is (10 sec: 5725.3, 60 sec: 5703.8, 300 sec: 5698.0). Total num frames: 298689536. Throughput: 0: 6060.5. Samples: 298694872. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:37:57,879][25689] Avg episode reward: [(0, '-47.399')] [2022-07-09 14:37:58,628][26022] Updated weights on worker 0-0, policy_version 291694 (0.00092) [2022-07-09 14:38:00,407][26022] Updated weights on worker 0-0, policy_version 291704 (0.00085) [2022-07-09 14:38:02,611][26022] Updated weights on worker 0-0, policy_version 291714 (0.00086) [2022-07-09 14:38:02,986][25689] Fps is (10 sec: 5468.7, 60 sec: 5733.8, 300 sec: 5704.2). Total num frames: 298716160. Throughput: 0: 5181.6. Samples: 298712058. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:38:02,987][25689] Avg episode reward: [(0, '-46.562')] [2022-07-09 14:38:04,328][26022] Updated weights on worker 0-0, policy_version 291724 (0.00090) [2022-07-09 14:38:06,268][26022] Updated weights on worker 0-0, policy_version 291734 (0.00087) [2022-07-09 14:38:07,927][26022] Updated weights on worker 0-0, policy_version 291744 (0.00086) [2022-07-09 14:38:08,003][25689] Fps is (10 sec: 5663.7, 60 sec: 5733.6, 300 sec: 5702.0). Total num frames: 298746880. Throughput: 0: 5921.9. Samples: 298744446. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:38:08,004][25689] Avg episode reward: [(0, '-45.990')] [2022-07-09 14:38:09,849][26022] Updated weights on worker 0-0, policy_version 291754 (0.00094) [2022-07-09 14:38:11,378][26022] Updated weights on worker 0-0, policy_version 291764 (0.00092) [2022-07-09 14:38:13,039][25689] Fps is (10 sec: 5602.1, 60 sec: 5682.6, 300 sec: 5691.7). Total num frames: 298772480. Throughput: 0: 5878.8. Samples: 298778624. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:38:13,040][25689] Avg episode reward: [(0, '-45.953')] [2022-07-09 14:38:13,393][26022] Updated weights on worker 0-0, policy_version 291774 (0.00094) [2022-07-09 14:38:14,876][26022] Updated weights on worker 0-0, policy_version 291784 (0.00092) [2022-07-09 14:38:16,908][26022] Updated weights on worker 0-0, policy_version 291794 (0.00089) [2022-07-09 14:38:18,064][25689] Fps is (10 sec: 5598.3, 60 sec: 5715.9, 300 sec: 5702.6). Total num frames: 298803200. Throughput: 0: 5009.0. Samples: 298795978. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:38:18,064][25689] Avg episode reward: [(0, '-45.308')] [2022-07-09 14:38:18,598][26022] Updated weights on worker 0-0, policy_version 291804 (0.00083) [2022-07-09 14:38:20,550][26022] Updated weights on worker 0-0, policy_version 291814 (0.00088) [2022-07-09 14:38:22,168][26022] Updated weights on worker 0-0, policy_version 291824 (0.00097) [2022-07-09 14:38:23,116][25689] Fps is (10 sec: 5792.4, 60 sec: 5704.0, 300 sec: 5695.5). Total num frames: 298830848. Throughput: 0: 5889.5. Samples: 298830614. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:38:23,117][25689] Avg episode reward: [(0, '-46.165')] [2022-07-09 14:38:24,182][26022] Updated weights on worker 0-0, policy_version 291834 (0.00087) [2022-07-09 14:38:25,620][26022] Updated weights on worker 0-0, policy_version 291844 (0.00091) [2022-07-09 14:38:27,659][26022] Updated weights on worker 0-0, policy_version 291854 (0.00087) [2022-07-09 14:38:28,139][25689] Fps is (10 sec: 5691.7, 60 sec: 5720.7, 300 sec: 5698.7). Total num frames: 298860544. Throughput: 0: 5984.5. Samples: 298864946. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:38:28,139][25689] Avg episode reward: [(0, '-46.376')] [2022-07-09 14:38:29,503][26022] Updated weights on worker 0-0, policy_version 291864 (0.00090) [2022-07-09 14:38:31,345][26022] Updated weights on worker 0-0, policy_version 291874 (0.00083) [2022-07-09 14:38:33,006][26022] Updated weights on worker 0-0, policy_version 291884 (0.00083) [2022-07-09 14:38:33,175][25689] Fps is (10 sec: 5904.5, 60 sec: 5723.8, 300 sec: 5703.1). Total num frames: 298890240. Throughput: 0: 5994.0. Samples: 298899316. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:38:33,177][25689] Avg episode reward: [(0, '-46.744')] [2022-07-09 14:38:34,803][26022] Updated weights on worker 0-0, policy_version 291894 (0.00085) [2022-07-09 14:38:36,301][26022] Updated weights on worker 0-0, policy_version 291904 (0.00082) [2022-07-09 14:38:38,181][25689] Fps is (10 sec: 5710.5, 60 sec: 5709.0, 300 sec: 5694.2). Total num frames: 298917888. Throughput: 0: 6003.0. Samples: 298916740. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:38:38,181][25689] Avg episode reward: [(0, '-47.740')] [2022-07-09 14:38:38,682][26022] Updated weights on worker 0-0, policy_version 291914 (0.00091) [2022-07-09 14:38:40,014][26022] Updated weights on worker 0-0, policy_version 291924 (0.00087) [2022-07-09 14:38:41,906][26022] Updated weights on worker 0-0, policy_version 291934 (0.00084) [2022-07-09 14:38:43,298][25689] Fps is (10 sec: 5665.0, 60 sec: 5708.8, 300 sec: 5699.0). Total num frames: 298947584. Throughput: 0: 5977.0. Samples: 298951238. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:38:43,298][25689] Avg episode reward: [(0, '-48.053')] [2022-07-09 14:38:43,675][26022] Updated weights on worker 0-0, policy_version 291944 (0.00084) [2022-07-09 14:38:45,421][26022] Updated weights on worker 0-0, policy_version 291954 (0.00087) [2022-07-09 14:38:47,214][26022] Updated weights on worker 0-0, policy_version 291964 (0.00094) [2022-07-09 14:38:48,313][25689] Fps is (10 sec: 5962.7, 60 sec: 5725.6, 300 sec: 5703.0). Total num frames: 298978304. Throughput: 0: 6009.2. Samples: 298986176. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:38:48,314][25689] Avg episode reward: [(0, '-48.008')] [2022-07-09 14:38:49,051][26022] Updated weights on worker 0-0, policy_version 291974 (0.00100) [2022-07-09 14:38:50,614][26022] Updated weights on worker 0-0, policy_version 291984 (0.00090) [2022-07-09 14:38:52,721][26022] Updated weights on worker 0-0, policy_version 291994 (0.00086) [2022-07-09 14:38:53,323][25689] Fps is (10 sec: 5924.2, 60 sec: 5725.8, 300 sec: 5706.5). Total num frames: 299006976. Throughput: 0: 5151.9. Samples: 299003116. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:38:53,325][25689] Avg episode reward: [(0, '-47.608')] [2022-07-09 14:38:54,282][26022] Updated weights on worker 0-0, policy_version 292004 (0.00090) [2022-07-09 14:38:56,179][26022] Updated weights on worker 0-0, policy_version 292014 (0.00090) [2022-07-09 14:38:57,922][26022] Updated weights on worker 0-0, policy_version 292024 (0.00086) [2022-07-09 14:38:58,334][25689] Fps is (10 sec: 5722.1, 60 sec: 5725.0, 300 sec: 5707.4). Total num frames: 299035648. Throughput: 0: 6008.9. Samples: 299037842. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:38:58,336][25689] Avg episode reward: [(0, '-47.385')] [2022-07-09 14:38:59,634][26022] Updated weights on worker 0-0, policy_version 292034 (0.00092) [2022-07-09 14:39:01,725][26022] Updated weights on worker 0-0, policy_version 292044 (0.00085) [2022-07-09 14:39:03,403][25689] Fps is (10 sec: 5384.1, 60 sec: 5711.8, 300 sec: 5699.4). Total num frames: 299061248. Throughput: 0: 5919.8. Samples: 299070258. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 14:39:03,405][25689] Avg episode reward: [(0, '-48.121')] [2022-07-09 14:39:03,787][26022] Updated weights on worker 0-0, policy_version 292054 (0.00081) [2022-07-09 14:39:05,119][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:39:05,133][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000292063_299072512.pth [2022-07-09 14:39:05,133][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000290056_297017344.pth [2022-07-09 14:39:05,370][26022] Updated weights on worker 0-0, policy_version 292064 (0.00088) [2022-07-09 14:39:07,316][26022] Updated weights on worker 0-0, policy_version 292074 (0.00089) [2022-07-09 14:39:08,418][25689] Fps is (10 sec: 5483.6, 60 sec: 5695.0, 300 sec: 5702.6). Total num frames: 299090944. Throughput: 0: 5040.0. Samples: 299087508. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:39:08,420][25689] Avg episode reward: [(0, '-47.870')] [2022-07-09 14:39:08,827][26022] Updated weights on worker 0-0, policy_version 292084 (0.00094) [2022-07-09 14:39:10,786][26022] Updated weights on worker 0-0, policy_version 292094 (0.00087) [2022-07-09 14:39:12,497][26022] Updated weights on worker 0-0, policy_version 292104 (0.00080) [2022-07-09 14:39:13,424][25689] Fps is (10 sec: 5824.2, 60 sec: 5748.7, 300 sec: 5703.2). Total num frames: 299119616. Throughput: 0: 5925.2. Samples: 299122222. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:39:13,425][25689] Avg episode reward: [(0, '-48.187')] [2022-07-09 14:39:14,245][26022] Updated weights on worker 0-0, policy_version 292114 (0.00085) [2022-07-09 14:39:16,056][26022] Updated weights on worker 0-0, policy_version 292124 (0.00090) [2022-07-09 14:39:17,747][26022] Updated weights on worker 0-0, policy_version 292134 (0.00084) [2022-07-09 14:39:18,525][25689] Fps is (10 sec: 5774.9, 60 sec: 5724.5, 300 sec: 5705.6). Total num frames: 299149312. Throughput: 0: 5915.4. Samples: 299157280. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:39:18,529][25689] Avg episode reward: [(0, '-48.276')] [2022-07-09 14:39:19,610][26022] Updated weights on worker 0-0, policy_version 292144 (0.00090) [2022-07-09 14:39:21,285][26022] Updated weights on worker 0-0, policy_version 292154 (0.00612) [2022-07-09 14:39:23,023][26022] Updated weights on worker 0-0, policy_version 292164 (0.00095) [2022-07-09 14:39:23,601][25689] Fps is (10 sec: 5937.1, 60 sec: 5773.1, 300 sec: 5711.2). Total num frames: 299180032. Throughput: 0: 5167.5. Samples: 299174630. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:39:23,601][25689] Avg episode reward: [(0, '-47.967')] [2022-07-09 14:39:24,868][26022] Updated weights on worker 0-0, policy_version 292174 (0.00091) [2022-07-09 14:39:26,590][26022] Updated weights on worker 0-0, policy_version 292184 (0.00092) [2022-07-09 14:39:28,463][26022] Updated weights on worker 0-0, policy_version 292194 (0.00087) [2022-07-09 14:39:28,641][25689] Fps is (10 sec: 5769.8, 60 sec: 5737.5, 300 sec: 5703.9). Total num frames: 299207680. Throughput: 0: 6029.6. Samples: 299209446. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:39:28,642][25689] Avg episode reward: [(0, '-47.001')] [2022-07-09 14:39:30,231][26022] Updated weights on worker 0-0, policy_version 292204 (0.00097) [2022-07-09 14:39:31,901][26022] Updated weights on worker 0-0, policy_version 292214 (0.00085) [2022-07-09 14:39:33,687][25689] Fps is (10 sec: 5583.4, 60 sec: 5719.7, 300 sec: 5707.1). Total num frames: 299236352. Throughput: 0: 6007.2. Samples: 299243946. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:39:33,690][25689] Avg episode reward: [(0, '-47.400')] [2022-07-09 14:39:33,826][26022] Updated weights on worker 0-0, policy_version 292224 (0.00085) [2022-07-09 14:39:35,407][26022] Updated weights on worker 0-0, policy_version 292234 (0.00093) [2022-07-09 14:39:37,370][26022] Updated weights on worker 0-0, policy_version 292244 (0.00052) [2022-07-09 14:39:38,704][25689] Fps is (10 sec: 5800.6, 60 sec: 5752.5, 300 sec: 5710.9). Total num frames: 299266048. Throughput: 0: 5156.4. Samples: 299261328. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:39:38,706][25689] Avg episode reward: [(0, '-46.793')] [2022-07-09 14:39:39,059][26022] Updated weights on worker 0-0, policy_version 292254 (0.00086) [2022-07-09 14:39:40,604][26022] Updated weights on worker 0-0, policy_version 292264 (0.00087) [2022-07-09 14:39:42,696][26022] Updated weights on worker 0-0, policy_version 292274 (0.00084) [2022-07-09 14:39:43,809][25689] Fps is (10 sec: 5969.0, 60 sec: 5770.5, 300 sec: 5716.0). Total num frames: 299296768. Throughput: 0: 6009.5. Samples: 299296074. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:39:43,810][25689] Avg episode reward: [(0, '-46.657')] [2022-07-09 14:39:44,294][26022] Updated weights on worker 0-0, policy_version 292284 (0.00079) [2022-07-09 14:39:46,255][26022] Updated weights on worker 0-0, policy_version 292294 (0.00084) [2022-07-09 14:39:47,912][26022] Updated weights on worker 0-0, policy_version 292304 (0.00087) [2022-07-09 14:39:48,866][25689] Fps is (10 sec: 5744.0, 60 sec: 5715.9, 300 sec: 5711.9). Total num frames: 299324416. Throughput: 0: 6002.9. Samples: 299330850. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:39:48,866][25689] Avg episode reward: [(0, '-46.756')] [2022-07-09 14:39:49,731][26022] Updated weights on worker 0-0, policy_version 292314 (0.00086) [2022-07-09 14:39:51,413][26022] Updated weights on worker 0-0, policy_version 292324 (0.00091) [2022-07-09 14:39:53,375][26022] Updated weights on worker 0-0, policy_version 292334 (0.00080) [2022-07-09 14:39:53,897][25689] Fps is (10 sec: 5583.1, 60 sec: 5713.9, 300 sec: 5711.4). Total num frames: 299353088. Throughput: 0: 5163.0. Samples: 299348286. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:39:53,897][25689] Avg episode reward: [(0, '-47.038')] [2022-07-09 14:39:54,857][26022] Updated weights on worker 0-0, policy_version 292344 (0.00085) [2022-07-09 14:39:56,815][26022] Updated weights on worker 0-0, policy_version 292354 (0.00091) [2022-07-09 14:39:58,456][26022] Updated weights on worker 0-0, policy_version 292364 (0.00091) [2022-07-09 14:39:58,982][25689] Fps is (10 sec: 5769.6, 60 sec: 5723.8, 300 sec: 5711.2). Total num frames: 299382784. Throughput: 0: 6011.0. Samples: 299383220. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:39:58,983][25689] Avg episode reward: [(0, '-46.261')] [2022-07-09 14:40:00,283][26022] Updated weights on worker 0-0, policy_version 292374 (0.00083) [2022-07-09 14:40:02,615][26022] Updated weights on worker 0-0, policy_version 292384 (0.00082) [2022-07-09 14:40:04,057][25689] Fps is (10 sec: 5644.0, 60 sec: 5757.0, 300 sec: 5713.6). Total num frames: 299410432. Throughput: 0: 5882.6. Samples: 299415184. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:40:04,057][25689] Avg episode reward: [(0, '-47.499')] [2022-07-09 14:40:04,217][26022] Updated weights on worker 0-0, policy_version 292394 (0.00092) [2022-07-09 14:40:05,986][26022] Updated weights on worker 0-0, policy_version 292404 (0.00083) [2022-07-09 14:40:07,917][26022] Updated weights on worker 0-0, policy_version 292414 (0.00086) [2022-07-09 14:40:09,072][25689] Fps is (10 sec: 5683.1, 60 sec: 5757.0, 300 sec: 5713.4). Total num frames: 299440128. Throughput: 0: 5039.6. Samples: 299432684. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:40:09,073][25689] Avg episode reward: [(0, '-47.279')] [2022-07-09 14:40:09,460][26022] Updated weights on worker 0-0, policy_version 292424 (0.00080) [2022-07-09 14:40:11,386][26022] Updated weights on worker 0-0, policy_version 292434 (0.00083) [2022-07-09 14:40:12,959][26022] Updated weights on worker 0-0, policy_version 292444 (0.00097) [2022-07-09 14:40:14,084][25689] Fps is (10 sec: 5616.5, 60 sec: 5722.7, 300 sec: 5706.5). Total num frames: 299466752. Throughput: 0: 5897.6. Samples: 299467346. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:40:14,085][25689] Avg episode reward: [(0, '-46.491')] [2022-07-09 14:40:14,913][26022] Updated weights on worker 0-0, policy_version 292454 (0.00087) [2022-07-09 14:40:16,749][26022] Updated weights on worker 0-0, policy_version 292464 (0.00086) [2022-07-09 14:40:18,419][26022] Updated weights on worker 0-0, policy_version 292474 (0.00079) [2022-07-09 14:40:19,098][25689] Fps is (10 sec: 5719.5, 60 sec: 5747.8, 300 sec: 5714.9). Total num frames: 299497472. Throughput: 0: 5902.1. Samples: 299501948. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:40:19,099][25689] Avg episode reward: [(0, '-46.492')] [2022-07-09 14:40:20,333][26022] Updated weights on worker 0-0, policy_version 292484 (0.00086) [2022-07-09 14:40:22,002][26022] Updated weights on worker 0-0, policy_version 292494 (0.00085) [2022-07-09 14:40:23,870][26022] Updated weights on worker 0-0, policy_version 292504 (0.00089) [2022-07-09 14:40:24,183][25689] Fps is (10 sec: 5779.8, 60 sec: 5696.2, 300 sec: 5707.0). Total num frames: 299525120. Throughput: 0: 5170.3. Samples: 299519242. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:40:24,183][25689] Avg episode reward: [(0, '-46.899')] [2022-07-09 14:40:25,526][26022] Updated weights on worker 0-0, policy_version 292514 (0.00090) [2022-07-09 14:40:27,311][26022] Updated weights on worker 0-0, policy_version 292524 (0.00084) [2022-07-09 14:40:29,163][26022] Updated weights on worker 0-0, policy_version 292534 (0.00081) [2022-07-09 14:40:29,215][25689] Fps is (10 sec: 5668.4, 60 sec: 5730.9, 300 sec: 5713.6). Total num frames: 299554816. Throughput: 0: 6004.3. Samples: 299553626. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:40:29,215][25689] Avg episode reward: [(0, '-46.927')] [2022-07-09 14:40:30,914][26022] Updated weights on worker 0-0, policy_version 292544 (0.00092) [2022-07-09 14:40:32,831][26022] Updated weights on worker 0-0, policy_version 292554 (0.00085) [2022-07-09 14:40:34,217][25689] Fps is (10 sec: 5817.1, 60 sec: 5735.0, 300 sec: 5714.3). Total num frames: 299583488. Throughput: 0: 5980.0. Samples: 299587738. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:40:34,219][25689] Avg episode reward: [(0, '-46.886')] [2022-07-09 14:40:34,745][26022] Updated weights on worker 0-0, policy_version 292564 (0.00088) [2022-07-09 14:40:36,259][26022] Updated weights on worker 0-0, policy_version 292574 (0.00086) [2022-07-09 14:40:38,205][26022] Updated weights on worker 0-0, policy_version 292584 (0.00086) [2022-07-09 14:40:39,227][25689] Fps is (10 sec: 5829.7, 60 sec: 5735.6, 300 sec: 5712.9). Total num frames: 299613184. Throughput: 0: 5129.1. Samples: 299605190. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:40:39,227][25689] Avg episode reward: [(0, '-46.767')] [2022-07-09 14:40:39,756][26022] Updated weights on worker 0-0, policy_version 292594 (0.00078) [2022-07-09 14:40:41,868][26022] Updated weights on worker 0-0, policy_version 292604 (0.00088) [2022-07-09 14:40:43,436][26022] Updated weights on worker 0-0, policy_version 292614 (0.00089) [2022-07-09 14:40:44,322][25689] Fps is (10 sec: 5674.7, 60 sec: 5685.8, 300 sec: 5716.5). Total num frames: 299640832. Throughput: 0: 5986.3. Samples: 299639800. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:40:44,323][25689] Avg episode reward: [(0, '-46.292')] [2022-07-09 14:40:45,288][26022] Updated weights on worker 0-0, policy_version 292624 (0.00093) [2022-07-09 14:40:47,056][26022] Updated weights on worker 0-0, policy_version 292634 (0.00086) [2022-07-09 14:40:48,743][26022] Updated weights on worker 0-0, policy_version 292644 (0.00086) [2022-07-09 14:40:49,352][25689] Fps is (10 sec: 5562.6, 60 sec: 5705.3, 300 sec: 5713.2). Total num frames: 299669504. Throughput: 0: 6022.4. Samples: 299674898. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:40:49,353][25689] Avg episode reward: [(0, '-46.160')] [2022-07-09 14:40:50,426][26022] Updated weights on worker 0-0, policy_version 292654 (0.00083) [2022-07-09 14:40:52,647][26022] Updated weights on worker 0-0, policy_version 292664 (0.00103) [2022-07-09 14:40:53,865][26022] Updated weights on worker 0-0, policy_version 292674 (0.00087) [2022-07-09 14:40:54,385][25689] Fps is (10 sec: 6003.6, 60 sec: 5755.8, 300 sec: 5719.7). Total num frames: 299701248. Throughput: 0: 5178.7. Samples: 299692184. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:40:54,386][25689] Avg episode reward: [(0, '-45.770')] [2022-07-09 14:40:56,158][26022] Updated weights on worker 0-0, policy_version 292684 (0.00091) [2022-07-09 14:40:57,439][26022] Updated weights on worker 0-0, policy_version 292694 (0.00084) [2022-07-09 14:40:59,456][25689] Fps is (10 sec: 5776.7, 60 sec: 5706.4, 300 sec: 5716.0). Total num frames: 299727872. Throughput: 0: 5996.2. Samples: 299726486. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:40:59,456][25689] Avg episode reward: [(0, '-45.997')] [2022-07-09 14:40:59,519][26022] Updated weights on worker 0-0, policy_version 292704 (0.00086) [2022-07-09 14:41:01,218][26022] Updated weights on worker 0-0, policy_version 292714 (0.00085) [2022-07-09 14:41:03,469][26022] Updated weights on worker 0-0, policy_version 292724 (0.00087) [2022-07-09 14:41:04,537][25689] Fps is (10 sec: 5245.5, 60 sec: 5688.9, 300 sec: 5714.7). Total num frames: 299754496. Throughput: 0: 5898.8. Samples: 299759040. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:41:04,537][25689] Avg episode reward: [(0, '-45.470')] [2022-07-09 14:41:05,280][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:41:05,293][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000292734_299759616.pth [2022-07-09 14:41:05,293][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000290725_297702400.pth [2022-07-09 14:41:05,295][26022] Updated weights on worker 0-0, policy_version 292734 (0.00083) [2022-07-09 14:41:07,235][26022] Updated weights on worker 0-0, policy_version 292744 (0.00089) [2022-07-09 14:41:08,587][26022] Updated weights on worker 0-0, policy_version 292754 (0.00083) [2022-07-09 14:41:09,589][25689] Fps is (10 sec: 5557.7, 60 sec: 5685.4, 300 sec: 5720.8). Total num frames: 299784192. Throughput: 0: 5851.3. Samples: 299793314. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:41:09,590][25689] Avg episode reward: [(0, '-46.525')] [2022-07-09 14:41:10,879][26022] Updated weights on worker 0-0, policy_version 292764 (0.00096) [2022-07-09 14:41:12,183][26022] Updated weights on worker 0-0, policy_version 292774 (0.00094) [2022-07-09 14:41:14,404][26022] Updated weights on worker 0-0, policy_version 292784 (0.00093) [2022-07-09 14:41:14,648][25689] Fps is (10 sec: 5671.4, 60 sec: 5698.0, 300 sec: 5713.8). Total num frames: 299811840. Throughput: 0: 5826.5. Samples: 299810242. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:41:14,648][25689] Avg episode reward: [(0, '-46.765')] [2022-07-09 14:41:15,820][26022] Updated weights on worker 0-0, policy_version 292794 (0.00081) [2022-07-09 14:41:17,965][26022] Updated weights on worker 0-0, policy_version 292804 (0.00048) [2022-07-09 14:41:19,400][26022] Updated weights on worker 0-0, policy_version 292814 (0.00093) [2022-07-09 14:41:19,714][25689] Fps is (10 sec: 5765.0, 60 sec: 5693.1, 300 sec: 5717.3). Total num frames: 299842560. Throughput: 0: 5833.9. Samples: 299844670. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:41:19,714][25689] Avg episode reward: [(0, '-46.279')] [2022-07-09 14:41:21,571][26022] Updated weights on worker 0-0, policy_version 292824 (0.00090) [2022-07-09 14:41:23,050][26022] Updated weights on worker 0-0, policy_version 292834 (0.00089) [2022-07-09 14:41:24,751][25689] Fps is (10 sec: 5675.8, 60 sec: 5680.6, 300 sec: 5713.5). Total num frames: 299869184. Throughput: 0: 5933.1. Samples: 299878974. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:41:24,751][25689] Avg episode reward: [(0, '-45.726')] [2022-07-09 14:41:25,292][26022] Updated weights on worker 0-0, policy_version 292844 (0.00085) [2022-07-09 14:41:26,708][26022] Updated weights on worker 0-0, policy_version 292854 (0.00086) [2022-07-09 14:41:28,678][26022] Updated weights on worker 0-0, policy_version 292864 (0.00091) [2022-07-09 14:41:29,786][25689] Fps is (10 sec: 5693.4, 60 sec: 5697.2, 300 sec: 5714.4). Total num frames: 299899904. Throughput: 0: 5090.0. Samples: 299896114. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-09 14:41:29,786][25689] Avg episode reward: [(0, '-45.569')] [2022-07-09 14:41:30,309][26022] Updated weights on worker 0-0, policy_version 292874 (0.00092) [2022-07-09 14:41:32,158][26022] Updated weights on worker 0-0, policy_version 292884 (0.00088) [2022-07-09 14:41:33,894][26022] Updated weights on worker 0-0, policy_version 292894 (0.00085) [2022-07-09 14:41:34,805][25689] Fps is (10 sec: 5805.3, 60 sec: 5678.7, 300 sec: 5714.1). Total num frames: 299927552. Throughput: 0: 5956.4. Samples: 299930308. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:41:34,807][25689] Avg episode reward: [(0, '-45.579')] [2022-07-09 14:41:35,781][26022] Updated weights on worker 0-0, policy_version 292904 (0.01116) [2022-07-09 14:41:37,438][26022] Updated weights on worker 0-0, policy_version 292914 (0.00087) [2022-07-09 14:41:39,431][26022] Updated weights on worker 0-0, policy_version 292924 (0.00087) [2022-07-09 14:41:39,808][25689] Fps is (10 sec: 5517.2, 60 sec: 5645.5, 300 sec: 5708.0). Total num frames: 299955200. Throughput: 0: 5961.8. Samples: 299964470. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:41:39,809][25689] Avg episode reward: [(0, '-45.461')] [2022-07-09 14:41:41,078][26022] Updated weights on worker 0-0, policy_version 292934 (0.00083) [2022-07-09 14:41:42,944][26022] Updated weights on worker 0-0, policy_version 292944 (0.00092) [2022-07-09 14:41:44,545][26022] Updated weights on worker 0-0, policy_version 292954 (0.00091) [2022-07-09 14:41:44,854][25689] Fps is (10 sec: 5808.3, 60 sec: 5700.9, 300 sec: 5712.1). Total num frames: 299985920. Throughput: 0: 5108.5. Samples: 299981672. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:41:44,855][25689] Avg episode reward: [(0, '-45.468')] [2022-07-09 14:41:46,679][26022] Updated weights on worker 0-0, policy_version 292964 (0.00094) [2022-07-09 14:41:48,190][26022] Updated weights on worker 0-0, policy_version 292974 (0.00094) [2022-07-09 14:41:49,868][25689] Fps is (10 sec: 5904.1, 60 sec: 5702.4, 300 sec: 5713.5). Total num frames: 300014592. Throughput: 0: 5980.5. Samples: 300016214. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:41:49,869][25689] Avg episode reward: [(0, '-46.565')] [2022-07-09 14:41:50,011][26022] Updated weights on worker 0-0, policy_version 292984 (0.00086) [2022-07-09 14:41:51,737][26022] Updated weights on worker 0-0, policy_version 292994 (0.00052) [2022-07-09 14:41:53,654][26022] Updated weights on worker 0-0, policy_version 293004 (0.00088) [2022-07-09 14:41:54,873][25689] Fps is (10 sec: 5723.4, 60 sec: 5654.3, 300 sec: 5710.4). Total num frames: 300043264. Throughput: 0: 6015.0. Samples: 300051018. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:41:54,874][25689] Avg episode reward: [(0, '-46.780')] [2022-07-09 14:41:55,396][26022] Updated weights on worker 0-0, policy_version 293014 (0.00089) [2022-07-09 14:41:57,124][26022] Updated weights on worker 0-0, policy_version 293024 (0.00086) [2022-07-09 14:41:59,016][26022] Updated weights on worker 0-0, policy_version 293034 (0.00088) [2022-07-09 14:41:59,880][25689] Fps is (10 sec: 5727.6, 60 sec: 5694.1, 300 sec: 5725.3). Total num frames: 300071936. Throughput: 0: 5174.9. Samples: 300068338. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:41:59,881][25689] Avg episode reward: [(0, '-46.581')] [2022-07-09 14:42:00,778][26022] Updated weights on worker 0-0, policy_version 293044 (0.00077) [2022-07-09 14:42:02,878][26022] Updated weights on worker 0-0, policy_version 293054 (0.00085) [2022-07-09 14:42:04,688][26022] Updated weights on worker 0-0, policy_version 293064 (0.00084) [2022-07-09 14:42:04,933][25689] Fps is (10 sec: 5496.9, 60 sec: 5696.8, 300 sec: 5710.8). Total num frames: 300098560. Throughput: 0: 5918.1. Samples: 300100500. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:42:04,933][25689] Avg episode reward: [(0, '-47.428')] [2022-07-09 14:42:06,403][26022] Updated weights on worker 0-0, policy_version 293074 (0.00100) [2022-07-09 14:42:08,257][26022] Updated weights on worker 0-0, policy_version 293084 (0.00090) [2022-07-09 14:42:09,941][25689] Fps is (10 sec: 5394.0, 60 sec: 5667.0, 300 sec: 5707.9). Total num frames: 300126208. Throughput: 0: 5903.6. Samples: 300134720. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:42:09,942][25689] Avg episode reward: [(0, '-48.022')] [2022-07-09 14:42:10,055][26022] Updated weights on worker 0-0, policy_version 293094 (0.00086) [2022-07-09 14:42:11,900][26022] Updated weights on worker 0-0, policy_version 293104 (0.00092) [2022-07-09 14:42:13,719][26022] Updated weights on worker 0-0, policy_version 293114 (0.00090) [2022-07-09 14:42:14,950][25689] Fps is (10 sec: 5724.7, 60 sec: 5705.7, 300 sec: 5711.5). Total num frames: 300155904. Throughput: 0: 5028.7. Samples: 300151976. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:42:14,950][25689] Avg episode reward: [(0, '-47.751')] [2022-07-09 14:42:15,379][26022] Updated weights on worker 0-0, policy_version 293124 (0.00092) [2022-07-09 14:42:17,240][26022] Updated weights on worker 0-0, policy_version 293134 (0.00084) [2022-07-09 14:42:19,199][26022] Updated weights on worker 0-0, policy_version 293144 (0.00095) [2022-07-09 14:42:19,965][25689] Fps is (10 sec: 5823.2, 60 sec: 5676.5, 300 sec: 5713.2). Total num frames: 300184576. Throughput: 0: 5863.6. Samples: 300186108. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:42:19,965][25689] Avg episode reward: [(0, '-48.587')] [2022-07-09 14:42:20,755][26022] Updated weights on worker 0-0, policy_version 293154 (0.00088) [2022-07-09 14:42:22,725][26022] Updated weights on worker 0-0, policy_version 293164 (0.00083) [2022-07-09 14:42:24,354][26022] Updated weights on worker 0-0, policy_version 293174 (0.00085) [2022-07-09 14:42:25,027][25689] Fps is (10 sec: 5690.6, 60 sec: 5708.2, 300 sec: 5712.4). Total num frames: 300213248. Throughput: 0: 5976.8. Samples: 300220598. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:42:25,027][25689] Avg episode reward: [(0, '-48.935')] [2022-07-09 14:42:26,196][26022] Updated weights on worker 0-0, policy_version 293184 (0.00087) [2022-07-09 14:42:27,835][26022] Updated weights on worker 0-0, policy_version 293194 (0.00089) [2022-07-09 14:42:29,887][26022] Updated weights on worker 0-0, policy_version 293204 (0.00083) [2022-07-09 14:42:30,030][25689] Fps is (10 sec: 5595.6, 60 sec: 5660.2, 300 sec: 5706.8). Total num frames: 300240896. Throughput: 0: 5122.3. Samples: 300237618. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:42:30,030][25689] Avg episode reward: [(0, '-48.357')] [2022-07-09 14:42:31,517][26022] Updated weights on worker 0-0, policy_version 293214 (0.00083) [2022-07-09 14:42:33,521][26022] Updated weights on worker 0-0, policy_version 293224 (0.00086) [2022-07-09 14:42:35,035][25689] Fps is (10 sec: 5831.8, 60 sec: 5712.5, 300 sec: 5714.1). Total num frames: 300271616. Throughput: 0: 5993.5. Samples: 300272358. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:42:35,036][25689] Avg episode reward: [(0, '-48.066')] [2022-07-09 14:42:35,041][26022] Updated weights on worker 0-0, policy_version 293234 (0.00621) [2022-07-09 14:42:36,895][26022] Updated weights on worker 0-0, policy_version 293244 (0.00089) [2022-07-09 14:42:38,503][26022] Updated weights on worker 0-0, policy_version 293254 (0.00087) [2022-07-09 14:42:40,063][25689] Fps is (10 sec: 5817.5, 60 sec: 5710.2, 300 sec: 5708.9). Total num frames: 300299264. Throughput: 0: 6005.2. Samples: 300306802. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:42:40,063][25689] Avg episode reward: [(0, '-47.385')] [2022-07-09 14:42:40,522][26022] Updated weights on worker 0-0, policy_version 293264 (0.00087) [2022-07-09 14:42:42,145][26022] Updated weights on worker 0-0, policy_version 293274 (0.00084) [2022-07-09 14:42:44,052][26022] Updated weights on worker 0-0, policy_version 293284 (0.00094) [2022-07-09 14:42:45,129][25689] Fps is (10 sec: 5579.7, 60 sec: 5674.3, 300 sec: 5704.5). Total num frames: 300327936. Throughput: 0: 5137.2. Samples: 300323868. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:42:45,129][25689] Avg episode reward: [(0, '-47.113')] [2022-07-09 14:42:45,868][26022] Updated weights on worker 0-0, policy_version 293294 (0.00084) [2022-07-09 14:42:47,414][26022] Updated weights on worker 0-0, policy_version 293304 (0.00089) [2022-07-09 14:42:49,479][26022] Updated weights on worker 0-0, policy_version 293314 (0.00092) [2022-07-09 14:42:50,154][25689] Fps is (10 sec: 5783.7, 60 sec: 5690.1, 300 sec: 5707.6). Total num frames: 300357632. Throughput: 0: 6009.2. Samples: 300358552. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:42:50,155][25689] Avg episode reward: [(0, '-45.981')] [2022-07-09 14:42:51,207][26022] Updated weights on worker 0-0, policy_version 293324 (0.00089) [2022-07-09 14:42:53,024][26022] Updated weights on worker 0-0, policy_version 293334 (0.00091) [2022-07-09 14:42:54,899][26022] Updated weights on worker 0-0, policy_version 293344 (0.00083) [2022-07-09 14:42:55,176][25689] Fps is (10 sec: 5707.5, 60 sec: 5671.7, 300 sec: 5703.9). Total num frames: 300385280. Throughput: 0: 5976.9. Samples: 300392738. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:42:55,176][25689] Avg episode reward: [(0, '-45.649')] [2022-07-09 14:42:56,550][26022] Updated weights on worker 0-0, policy_version 293354 (0.00097) [2022-07-09 14:42:58,483][26022] Updated weights on worker 0-0, policy_version 293364 (0.00088) [2022-07-09 14:42:59,958][26022] Updated weights on worker 0-0, policy_version 293374 (0.00088) [2022-07-09 14:43:00,220][25689] Fps is (10 sec: 5900.4, 60 sec: 5719.0, 300 sec: 5722.3). Total num frames: 300417024. Throughput: 0: 5122.3. Samples: 300410054. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:43:00,221][25689] Avg episode reward: [(0, '-46.465')] [2022-07-09 14:43:02,409][26022] Updated weights on worker 0-0, policy_version 293384 (0.00102) [2022-07-09 14:43:03,883][26022] Updated weights on worker 0-0, policy_version 293394 (0.00093) [2022-07-09 14:43:05,339][25689] Fps is (10 sec: 5541.8, 60 sec: 5678.9, 300 sec: 5699.7). Total num frames: 300441600. Throughput: 0: 5867.7. Samples: 300442454. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:43:05,339][25689] Avg episode reward: [(0, '-46.489')] [2022-07-09 14:43:05,437][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:43:05,455][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000293401_300442624.pth [2022-07-09 14:43:05,456][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000291393_298386432.pth [2022-07-09 14:43:05,924][26022] Updated weights on worker 0-0, policy_version 293404 (0.00087) [2022-07-09 14:43:07,623][26022] Updated weights on worker 0-0, policy_version 293414 (0.00090) [2022-07-09 14:43:09,460][26022] Updated weights on worker 0-0, policy_version 293424 (0.00087) [2022-07-09 14:43:10,376][25689] Fps is (10 sec: 5343.9, 60 sec: 5710.1, 300 sec: 5713.4). Total num frames: 300471296. Throughput: 0: 5841.4. Samples: 300476674. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:43:10,377][25689] Avg episode reward: [(0, '-47.309')] [2022-07-09 14:43:11,244][26022] Updated weights on worker 0-0, policy_version 293434 (0.00093) [2022-07-09 14:43:13,086][26022] Updated weights on worker 0-0, policy_version 293444 (0.00050) [2022-07-09 14:43:14,738][26022] Updated weights on worker 0-0, policy_version 293454 (0.00084) [2022-07-09 14:43:15,383][25689] Fps is (10 sec: 5811.2, 60 sec: 5693.3, 300 sec: 5706.9). Total num frames: 300499968. Throughput: 0: 5013.4. Samples: 300494042. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:43:15,383][25689] Avg episode reward: [(0, '-46.846')] [2022-07-09 14:43:16,427][26022] Updated weights on worker 0-0, policy_version 293464 (0.00082) [2022-07-09 14:43:18,279][26022] Updated weights on worker 0-0, policy_version 293474 (0.00086) [2022-07-09 14:43:20,101][26022] Updated weights on worker 0-0, policy_version 293484 (0.00080) [2022-07-09 14:43:20,388][25689] Fps is (10 sec: 5829.8, 60 sec: 5711.2, 300 sec: 5714.7). Total num frames: 300529664. Throughput: 0: 5909.2. Samples: 300529230. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:43:20,389][25689] Avg episode reward: [(0, '-46.708')] [2022-07-09 14:43:21,721][26022] Updated weights on worker 0-0, policy_version 293494 (0.00092) [2022-07-09 14:43:23,585][26022] Updated weights on worker 0-0, policy_version 293504 (0.00083) [2022-07-09 14:43:25,271][26022] Updated weights on worker 0-0, policy_version 293514 (0.00089) [2022-07-09 14:43:25,434][25689] Fps is (10 sec: 5807.0, 60 sec: 5712.7, 300 sec: 5710.8). Total num frames: 300558336. Throughput: 0: 6068.1. Samples: 300564394. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:43:25,435][25689] Avg episode reward: [(0, '-47.504')] [2022-07-09 14:43:27,267][26022] Updated weights on worker 0-0, policy_version 293524 (0.00092) [2022-07-09 14:43:28,838][26022] Updated weights on worker 0-0, policy_version 293534 (0.00872) [2022-07-09 14:43:30,471][25689] Fps is (10 sec: 5687.4, 60 sec: 5726.4, 300 sec: 5707.3). Total num frames: 300587008. Throughput: 0: 5222.5. Samples: 300581618. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:43:30,471][25689] Avg episode reward: [(0, '-47.450')] [2022-07-09 14:43:30,720][26022] Updated weights on worker 0-0, policy_version 293544 (0.00085) [2022-07-09 14:43:32,414][26022] Updated weights on worker 0-0, policy_version 293554 (0.00085) [2022-07-09 14:43:34,363][26022] Updated weights on worker 0-0, policy_version 293564 (0.00086) [2022-07-09 14:43:35,473][25689] Fps is (10 sec: 5814.1, 60 sec: 5709.8, 300 sec: 5714.3). Total num frames: 300616704. Throughput: 0: 6070.0. Samples: 300615990. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:43:35,474][25689] Avg episode reward: [(0, '-47.505')] [2022-07-09 14:43:35,935][26022] Updated weights on worker 0-0, policy_version 293574 (0.00092) [2022-07-09 14:43:37,815][26022] Updated weights on worker 0-0, policy_version 293584 (0.00092) [2022-07-09 14:43:39,619][26022] Updated weights on worker 0-0, policy_version 293594 (0.00090) [2022-07-09 14:43:40,497][25689] Fps is (10 sec: 5719.3, 60 sec: 5710.1, 300 sec: 5709.2). Total num frames: 300644352. Throughput: 0: 6026.7. Samples: 300650422. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:43:40,498][25689] Avg episode reward: [(0, '-46.812')] [2022-07-09 14:43:41,463][26022] Updated weights on worker 0-0, policy_version 293604 (0.00088) [2022-07-09 14:43:43,131][26022] Updated weights on worker 0-0, policy_version 293614 (0.00084) [2022-07-09 14:43:44,953][26022] Updated weights on worker 0-0, policy_version 293624 (0.00085) [2022-07-09 14:43:45,551][25689] Fps is (10 sec: 5791.9, 60 sec: 5745.2, 300 sec: 5708.5). Total num frames: 300675072. Throughput: 0: 5128.4. Samples: 300667560. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:43:45,551][25689] Avg episode reward: [(0, '-47.833')] [2022-07-09 14:43:46,768][26022] Updated weights on worker 0-0, policy_version 293634 (0.00086) [2022-07-09 14:43:48,357][26022] Updated weights on worker 0-0, policy_version 293644 (0.00084) [2022-07-09 14:43:50,326][26022] Updated weights on worker 0-0, policy_version 293654 (0.00089) [2022-07-09 14:43:50,568][25689] Fps is (10 sec: 5795.8, 60 sec: 5712.1, 300 sec: 5704.9). Total num frames: 300702720. Throughput: 0: 5999.2. Samples: 300702186. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:43:50,568][25689] Avg episode reward: [(0, '-48.541')] [2022-07-09 14:43:52,074][26022] Updated weights on worker 0-0, policy_version 293664 (0.00085) [2022-07-09 14:43:53,875][26022] Updated weights on worker 0-0, policy_version 293674 (0.00088) [2022-07-09 14:43:55,572][25689] Fps is (10 sec: 5517.6, 60 sec: 5713.7, 300 sec: 5701.6). Total num frames: 300730368. Throughput: 0: 5995.7. Samples: 300736500. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2022-07-09 14:43:55,573][25689] Avg episode reward: [(0, '-48.417')] [2022-07-09 14:43:55,684][26022] Updated weights on worker 0-0, policy_version 293684 (0.00084) [2022-07-09 14:43:57,261][26022] Updated weights on worker 0-0, policy_version 293694 (0.00088) [2022-07-09 14:43:59,211][26022] Updated weights on worker 0-0, policy_version 293704 (0.00085) [2022-07-09 14:44:00,611][25689] Fps is (10 sec: 5812.0, 60 sec: 5697.3, 300 sec: 5719.4). Total num frames: 300761088. Throughput: 0: 5138.6. Samples: 300753778. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 14:44:00,611][25689] Avg episode reward: [(0, '-48.548')] [2022-07-09 14:44:01,228][26022] Updated weights on worker 0-0, policy_version 293714 (0.00090) [2022-07-09 14:44:02,993][26022] Updated weights on worker 0-0, policy_version 293724 (0.00093) [2022-07-09 14:44:05,033][26022] Updated weights on worker 0-0, policy_version 293734 (0.00089) [2022-07-09 14:44:05,648][25689] Fps is (10 sec: 5589.8, 60 sec: 5722.0, 300 sec: 5705.2). Total num frames: 300786688. Throughput: 0: 5908.6. Samples: 300786306. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 14:44:05,649][25689] Avg episode reward: [(0, '-48.693')] [2022-07-09 14:44:06,503][26022] Updated weights on worker 0-0, policy_version 293744 (0.00092) [2022-07-09 14:44:08,575][26022] Updated weights on worker 0-0, policy_version 293754 (0.00099) [2022-07-09 14:44:10,238][26022] Updated weights on worker 0-0, policy_version 293764 (0.00093) [2022-07-09 14:44:10,667][25689] Fps is (10 sec: 5397.0, 60 sec: 5706.8, 300 sec: 5704.9). Total num frames: 300815360. Throughput: 0: 5893.1. Samples: 300820630. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 14:44:10,668][25689] Avg episode reward: [(0, '-49.273')] [2022-07-09 14:44:12,104][26022] Updated weights on worker 0-0, policy_version 293774 (0.00087) [2022-07-09 14:44:13,957][26022] Updated weights on worker 0-0, policy_version 293784 (0.00087) [2022-07-09 14:44:15,626][26022] Updated weights on worker 0-0, policy_version 293794 (0.00087) [2022-07-09 14:44:15,699][25689] Fps is (10 sec: 5807.1, 60 sec: 5721.3, 300 sec: 5706.2). Total num frames: 300845056. Throughput: 0: 5035.1. Samples: 300837840. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 14:44:15,699][25689] Avg episode reward: [(0, '-48.884')] [2022-07-09 14:44:17,536][26022] Updated weights on worker 0-0, policy_version 293804 (0.00093) [2022-07-09 14:44:19,222][26022] Updated weights on worker 0-0, policy_version 293814 (0.00096) [2022-07-09 14:44:20,715][25689] Fps is (10 sec: 5707.1, 60 sec: 5686.4, 300 sec: 5697.0). Total num frames: 300872704. Throughput: 0: 5887.2. Samples: 300872134. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 14:44:20,716][25689] Avg episode reward: [(0, '-48.151')] [2022-07-09 14:44:21,196][26022] Updated weights on worker 0-0, policy_version 293824 (0.00086) [2022-07-09 14:44:22,796][26022] Updated weights on worker 0-0, policy_version 293834 (0.00085) [2022-07-09 14:44:24,682][26022] Updated weights on worker 0-0, policy_version 293844 (0.00092) [2022-07-09 14:44:25,825][25689] Fps is (10 sec: 5763.9, 60 sec: 5714.2, 300 sec: 5706.0). Total num frames: 300903424. Throughput: 0: 5949.1. Samples: 300906344. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 14:44:25,826][25689] Avg episode reward: [(0, '-48.163')] [2022-07-09 14:44:26,557][26022] Updated weights on worker 0-0, policy_version 293854 (0.00086) [2022-07-09 14:44:28,235][26022] Updated weights on worker 0-0, policy_version 293864 (0.00092) [2022-07-09 14:44:30,188][26022] Updated weights on worker 0-0, policy_version 293874 (0.00090) [2022-07-09 14:44:30,838][25689] Fps is (10 sec: 5664.5, 60 sec: 5682.6, 300 sec: 5699.8). Total num frames: 300930048. Throughput: 0: 5946.6. Samples: 300940580. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 14:44:30,838][25689] Avg episode reward: [(0, '-48.689')] [2022-07-09 14:44:31,824][26022] Updated weights on worker 0-0, policy_version 293884 (0.00085) [2022-07-09 14:44:33,753][26022] Updated weights on worker 0-0, policy_version 293894 (0.00086) [2022-07-09 14:44:35,587][26022] Updated weights on worker 0-0, policy_version 293904 (0.00083) [2022-07-09 14:44:35,883][25689] Fps is (10 sec: 5396.1, 60 sec: 5644.6, 300 sec: 5692.4). Total num frames: 300957696. Throughput: 0: 5929.6. Samples: 300957522. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 14:44:35,883][25689] Avg episode reward: [(0, '-48.069')] [2022-07-09 14:44:37,365][26022] Updated weights on worker 0-0, policy_version 293914 (0.00089) [2022-07-09 14:44:39,333][26022] Updated weights on worker 0-0, policy_version 293924 (0.00088) [2022-07-09 14:44:40,880][26022] Updated weights on worker 0-0, policy_version 293934 (0.00085) [2022-07-09 14:44:40,931][25689] Fps is (10 sec: 5782.7, 60 sec: 5693.2, 300 sec: 5693.4). Total num frames: 300988416. Throughput: 0: 5908.0. Samples: 300991576. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 14:44:40,933][25689] Avg episode reward: [(0, '-46.945')] [2022-07-09 14:44:42,704][26022] Updated weights on worker 0-0, policy_version 293944 (0.00085) [2022-07-09 14:44:44,595][26022] Updated weights on worker 0-0, policy_version 293954 (0.00089) [2022-07-09 14:44:46,079][25689] Fps is (10 sec: 5825.1, 60 sec: 5650.5, 300 sec: 5695.1). Total num frames: 301017088. Throughput: 0: 5906.1. Samples: 301025962. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 14:44:46,079][25689] Avg episode reward: [(0, '-47.089')] [2022-07-09 14:44:46,244][26022] Updated weights on worker 0-0, policy_version 293964 (0.00087) [2022-07-09 14:44:48,158][26022] Updated weights on worker 0-0, policy_version 293974 (0.00096) [2022-07-09 14:44:49,811][26022] Updated weights on worker 0-0, policy_version 293984 (0.00083) [2022-07-09 14:44:51,095][25689] Fps is (10 sec: 5743.1, 60 sec: 5684.5, 300 sec: 5698.9). Total num frames: 301046784. Throughput: 0: 5067.2. Samples: 301043226. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 14:44:51,095][25689] Avg episode reward: [(0, '-46.564')] [2022-07-09 14:44:51,821][26022] Updated weights on worker 0-0, policy_version 293994 (0.00094) [2022-07-09 14:44:53,434][26022] Updated weights on worker 0-0, policy_version 294004 (0.00086) [2022-07-09 14:44:55,154][26022] Updated weights on worker 0-0, policy_version 294014 (0.00082) [2022-07-09 14:44:56,111][25689] Fps is (10 sec: 5818.2, 60 sec: 5700.3, 300 sec: 5696.8). Total num frames: 301075456. Throughput: 0: 5944.5. Samples: 301077766. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 14:44:56,111][25689] Avg episode reward: [(0, '-46.227')] [2022-07-09 14:44:57,108][26022] Updated weights on worker 0-0, policy_version 294024 (0.00089) [2022-07-09 14:44:58,572][26022] Updated weights on worker 0-0, policy_version 294034 (0.00087) [2022-07-09 14:45:00,712][26022] Updated weights on worker 0-0, policy_version 294044 (0.00093) [2022-07-09 14:45:01,146][25689] Fps is (10 sec: 5705.4, 60 sec: 5666.8, 300 sec: 5701.0). Total num frames: 301104128. Throughput: 0: 5967.2. Samples: 301112198. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 14:45:01,146][25689] Avg episode reward: [(0, '-46.169')] [2022-07-09 14:45:02,775][26022] Updated weights on worker 0-0, policy_version 294054 (0.00093) [2022-07-09 14:45:04,564][26022] Updated weights on worker 0-0, policy_version 294064 (0.00094) [2022-07-09 14:45:05,463][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:45:05,479][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000294070_301127680.pth [2022-07-09 14:45:05,483][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000292063_299072512.pth [2022-07-09 14:45:06,267][25689] Fps is (10 sec: 5444.3, 60 sec: 5675.8, 300 sec: 5688.6). Total num frames: 301130752. Throughput: 0: 5008.5. Samples: 301127076. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 14:45:06,268][25689] Avg episode reward: [(0, '-45.730')] [2022-07-09 14:45:06,468][26022] Updated weights on worker 0-0, policy_version 294074 (0.00082) [2022-07-09 14:45:08,095][26022] Updated weights on worker 0-0, policy_version 294084 (0.00084) [2022-07-09 14:45:09,886][26022] Updated weights on worker 0-0, policy_version 294094 (0.00090) [2022-07-09 14:45:11,302][25689] Fps is (10 sec: 5545.5, 60 sec: 5691.2, 300 sec: 5698.5). Total num frames: 301160448. Throughput: 0: 5861.5. Samples: 301161668. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 14:45:11,302][25689] Avg episode reward: [(0, '-47.417')] [2022-07-09 14:45:11,708][26022] Updated weights on worker 0-0, policy_version 294104 (0.00095) [2022-07-09 14:45:13,460][26022] Updated weights on worker 0-0, policy_version 294114 (0.00091) [2022-07-09 14:45:15,492][26022] Updated weights on worker 0-0, policy_version 294124 (0.00092) [2022-07-09 14:45:16,314][25689] Fps is (10 sec: 5708.0, 60 sec: 5659.4, 300 sec: 5688.2). Total num frames: 301188096. Throughput: 0: 5850.5. Samples: 301195964. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 14:45:16,314][25689] Avg episode reward: [(0, '-48.572')] [2022-07-09 14:45:16,864][26022] Updated weights on worker 0-0, policy_version 294134 (0.00092) [2022-07-09 14:45:19,109][26022] Updated weights on worker 0-0, policy_version 294144 (0.00087) [2022-07-09 14:45:20,579][26022] Updated weights on worker 0-0, policy_version 294154 (0.00813) [2022-07-09 14:45:21,336][25689] Fps is (10 sec: 5612.9, 60 sec: 5675.6, 300 sec: 5692.9). Total num frames: 301216768. Throughput: 0: 4992.9. Samples: 301213004. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 14:45:21,336][25689] Avg episode reward: [(0, '-49.290')] [2022-07-09 14:45:22,579][26022] Updated weights on worker 0-0, policy_version 294164 (0.00081) [2022-07-09 14:45:24,199][26022] Updated weights on worker 0-0, policy_version 294174 (0.00055) [2022-07-09 14:45:26,168][26022] Updated weights on worker 0-0, policy_version 294184 (0.00099) [2022-07-09 14:45:26,463][25689] Fps is (10 sec: 5650.0, 60 sec: 5640.3, 300 sec: 5687.6). Total num frames: 301245440. Throughput: 0: 5953.9. Samples: 301247318. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 14:45:26,463][25689] Avg episode reward: [(0, '-48.894')] [2022-07-09 14:45:27,971][26022] Updated weights on worker 0-0, policy_version 294194 (0.00091) [2022-07-09 14:45:29,657][26022] Updated weights on worker 0-0, policy_version 294204 (0.00090) [2022-07-09 14:45:31,472][25689] Fps is (10 sec: 5657.2, 60 sec: 5674.4, 300 sec: 5687.4). Total num frames: 301274112. Throughput: 0: 5941.4. Samples: 301281508. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 14:45:31,473][25689] Avg episode reward: [(0, '-49.255')] [2022-07-09 14:45:31,557][26022] Updated weights on worker 0-0, policy_version 294214 (0.00089) [2022-07-09 14:45:33,207][26022] Updated weights on worker 0-0, policy_version 294224 (0.00082) [2022-07-09 14:45:35,095][26022] Updated weights on worker 0-0, policy_version 294234 (0.00088) [2022-07-09 14:45:36,477][25689] Fps is (10 sec: 5828.7, 60 sec: 5712.0, 300 sec: 5687.5). Total num frames: 301303808. Throughput: 0: 5079.8. Samples: 301298388. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 14:45:36,477][25689] Avg episode reward: [(0, '-48.537')] [2022-07-09 14:45:36,979][26022] Updated weights on worker 0-0, policy_version 294244 (0.00090) [2022-07-09 14:45:38,959][26022] Updated weights on worker 0-0, policy_version 294254 (0.00094) [2022-07-09 14:45:40,659][26022] Updated weights on worker 0-0, policy_version 294264 (0.00086) [2022-07-09 14:45:41,522][25689] Fps is (10 sec: 5706.0, 60 sec: 5661.6, 300 sec: 5688.5). Total num frames: 301331456. Throughput: 0: 5908.9. Samples: 301332282. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 14:45:41,522][25689] Avg episode reward: [(0, '-47.640')] [2022-07-09 14:45:42,478][26022] Updated weights on worker 0-0, policy_version 294274 (0.00078) [2022-07-09 14:45:44,014][26022] Updated weights on worker 0-0, policy_version 294284 (0.00086) [2022-07-09 14:45:46,061][26022] Updated weights on worker 0-0, policy_version 294294 (0.00087) [2022-07-09 14:45:46,578][25689] Fps is (10 sec: 5575.3, 60 sec: 5670.1, 300 sec: 5688.0). Total num frames: 301360128. Throughput: 0: 5910.6. Samples: 301366212. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 14:45:46,579][25689] Avg episode reward: [(0, '-46.617')] [2022-07-09 14:45:47,791][26022] Updated weights on worker 0-0, policy_version 294304 (0.00093) [2022-07-09 14:45:49,593][26022] Updated weights on worker 0-0, policy_version 294314 (0.00087) [2022-07-09 14:45:51,347][26022] Updated weights on worker 0-0, policy_version 294324 (0.00096) [2022-07-09 14:45:51,675][25689] Fps is (10 sec: 5647.9, 60 sec: 5645.7, 300 sec: 5676.5). Total num frames: 301388800. Throughput: 0: 5041.7. Samples: 301383364. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 14:45:51,675][25689] Avg episode reward: [(0, '-45.821')] [2022-07-09 14:45:53,236][26022] Updated weights on worker 0-0, policy_version 294334 (0.00091) [2022-07-09 14:45:54,958][26022] Updated weights on worker 0-0, policy_version 294344 (0.00095) [2022-07-09 14:45:56,687][25689] Fps is (10 sec: 5672.8, 60 sec: 5646.0, 300 sec: 5684.5). Total num frames: 301417472. Throughput: 0: 5881.1. Samples: 301417246. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 14:45:56,688][25689] Avg episode reward: [(0, '-45.677')] [2022-07-09 14:45:56,831][26022] Updated weights on worker 0-0, policy_version 294354 (0.00081) [2022-07-09 14:45:58,660][26022] Updated weights on worker 0-0, policy_version 294364 (0.00093) [2022-07-09 14:46:00,433][26022] Updated weights on worker 0-0, policy_version 294374 (0.00084) [2022-07-09 14:46:01,716][25689] Fps is (10 sec: 5506.7, 60 sec: 5612.7, 300 sec: 5685.4). Total num frames: 301444096. Throughput: 0: 5886.5. Samples: 301451158. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 14:46:01,721][25689] Avg episode reward: [(0, '-46.075')] [2022-07-09 14:46:02,622][26022] Updated weights on worker 0-0, policy_version 294384 (0.00107) [2022-07-09 14:46:04,519][26022] Updated weights on worker 0-0, policy_version 294394 (0.00090) [2022-07-09 14:46:06,194][26022] Updated weights on worker 0-0, policy_version 294404 (0.00104) [2022-07-09 14:46:06,787][25689] Fps is (10 sec: 5475.1, 60 sec: 5651.3, 300 sec: 5681.7). Total num frames: 301472768. Throughput: 0: 4936.2. Samples: 301465964. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 14:46:06,787][25689] Avg episode reward: [(0, '-46.099')] [2022-07-09 14:46:08,308][26022] Updated weights on worker 0-0, policy_version 294414 (0.00087) [2022-07-09 14:46:09,898][26022] Updated weights on worker 0-0, policy_version 294424 (0.00095) [2022-07-09 14:46:11,751][26022] Updated weights on worker 0-0, policy_version 294434 (0.00087) [2022-07-09 14:46:11,813][25689] Fps is (10 sec: 5578.2, 60 sec: 5618.3, 300 sec: 5682.3). Total num frames: 301500416. Throughput: 0: 5789.2. Samples: 301499946. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 14:46:11,813][25689] Avg episode reward: [(0, '-45.997')] [2022-07-09 14:46:13,508][26022] Updated weights on worker 0-0, policy_version 294444 (0.00091) [2022-07-09 14:46:15,318][26022] Updated weights on worker 0-0, policy_version 294454 (0.00091) [2022-07-09 14:46:16,822][25689] Fps is (10 sec: 5510.3, 60 sec: 5618.6, 300 sec: 5673.0). Total num frames: 301528064. Throughput: 0: 5790.1. Samples: 301533828. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 14:46:16,822][25689] Avg episode reward: [(0, '-46.436')] [2022-07-09 14:46:17,258][26022] Updated weights on worker 0-0, policy_version 294464 (0.00532) [2022-07-09 14:46:18,907][26022] Updated weights on worker 0-0, policy_version 294474 (0.00088) [2022-07-09 14:46:20,907][26022] Updated weights on worker 0-0, policy_version 294484 (0.00093) [2022-07-09 14:46:21,823][25689] Fps is (10 sec: 5626.1, 60 sec: 5620.4, 300 sec: 5680.6). Total num frames: 301556736. Throughput: 0: 4947.8. Samples: 301550644. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:46:21,824][25689] Avg episode reward: [(0, '-47.112')] [2022-07-09 14:46:22,856][26022] Updated weights on worker 0-0, policy_version 294494 (0.00092) [2022-07-09 14:46:24,355][26022] Updated weights on worker 0-0, policy_version 294504 (0.00093) [2022-07-09 14:46:26,439][26022] Updated weights on worker 0-0, policy_version 294514 (0.00087) [2022-07-09 14:46:26,916][25689] Fps is (10 sec: 5579.6, 60 sec: 5606.8, 300 sec: 5669.2). Total num frames: 301584384. Throughput: 0: 5876.5. Samples: 301584252. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:46:26,916][25689] Avg episode reward: [(0, '-47.353')] [2022-07-09 14:46:28,028][26022] Updated weights on worker 0-0, policy_version 294524 (0.00085) [2022-07-09 14:46:30,250][26022] Updated weights on worker 0-0, policy_version 294534 (0.00087) [2022-07-09 14:46:31,893][26022] Updated weights on worker 0-0, policy_version 294544 (0.00087) [2022-07-09 14:46:31,930][25689] Fps is (10 sec: 5572.3, 60 sec: 5606.2, 300 sec: 5672.7). Total num frames: 301613056. Throughput: 0: 5858.8. Samples: 301617810. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:46:31,931][25689] Avg episode reward: [(0, '-47.228')] [2022-07-09 14:46:33,658][26022] Updated weights on worker 0-0, policy_version 294554 (0.00086) [2022-07-09 14:46:35,466][26022] Updated weights on worker 0-0, policy_version 294564 (0.00092) [2022-07-09 14:46:36,940][25689] Fps is (10 sec: 5720.7, 60 sec: 5588.9, 300 sec: 5676.0). Total num frames: 301641728. Throughput: 0: 5032.8. Samples: 301635078. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:46:36,940][25689] Avg episode reward: [(0, '-47.504')] [2022-07-09 14:46:37,363][26022] Updated weights on worker 0-0, policy_version 294574 (0.00088) [2022-07-09 14:46:39,040][26022] Updated weights on worker 0-0, policy_version 294584 (0.00094) [2022-07-09 14:46:40,895][26022] Updated weights on worker 0-0, policy_version 294594 (0.00089) [2022-07-09 14:46:41,970][25689] Fps is (10 sec: 5711.8, 60 sec: 5607.2, 300 sec: 5669.4). Total num frames: 301670400. Throughput: 0: 5886.2. Samples: 301669232. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:46:41,979][25689] Avg episode reward: [(0, '-46.962')] [2022-07-09 14:46:42,764][26022] Updated weights on worker 0-0, policy_version 294604 (0.00098) [2022-07-09 14:46:44,662][26022] Updated weights on worker 0-0, policy_version 294614 (0.00096) [2022-07-09 14:46:46,348][26022] Updated weights on worker 0-0, policy_version 294624 (0.00091) [2022-07-09 14:46:47,067][25689] Fps is (10 sec: 5561.2, 60 sec: 5586.5, 300 sec: 5664.4). Total num frames: 301698048. Throughput: 0: 5895.6. Samples: 301703056. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:46:47,067][25689] Avg episode reward: [(0, '-46.368')] [2022-07-09 14:46:48,263][26022] Updated weights on worker 0-0, policy_version 294634 (0.00090) [2022-07-09 14:46:49,963][26022] Updated weights on worker 0-0, policy_version 294644 (0.00084) [2022-07-09 14:46:51,868][26022] Updated weights on worker 0-0, policy_version 294654 (0.00093) [2022-07-09 14:46:52,087][25689] Fps is (10 sec: 5566.8, 60 sec: 5593.6, 300 sec: 5664.1). Total num frames: 301726720. Throughput: 0: 5083.6. Samples: 301720280. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:46:52,088][25689] Avg episode reward: [(0, '-46.443')] [2022-07-09 14:46:53,599][26022] Updated weights on worker 0-0, policy_version 294664 (0.00087) [2022-07-09 14:46:55,340][26022] Updated weights on worker 0-0, policy_version 294674 (0.00087) [2022-07-09 14:46:57,116][25689] Fps is (10 sec: 5706.2, 60 sec: 5592.0, 300 sec: 5663.7). Total num frames: 301755392. Throughput: 0: 5917.4. Samples: 301754470. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:46:57,117][25689] Avg episode reward: [(0, '-46.463')] [2022-07-09 14:46:57,271][26022] Updated weights on worker 0-0, policy_version 294684 (0.00094) [2022-07-09 14:46:58,807][26022] Updated weights on worker 0-0, policy_version 294694 (0.00084) [2022-07-09 14:47:00,895][26022] Updated weights on worker 0-0, policy_version 294704 (0.00093) [2022-07-09 14:47:02,134][25689] Fps is (10 sec: 5708.0, 60 sec: 5627.0, 300 sec: 5671.3). Total num frames: 301784064. Throughput: 0: 5920.9. Samples: 301788618. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:47:02,136][25689] Avg episode reward: [(0, '-45.819')] [2022-07-09 14:47:02,987][26022] Updated weights on worker 0-0, policy_version 294714 (0.00863) [2022-07-09 14:47:04,791][26022] Updated weights on worker 0-0, policy_version 294724 (0.00430) [2022-07-09 14:47:05,771][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:47:05,777][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000294730_301803520.pth [2022-07-09 14:47:05,778][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000292734_299759616.pth [2022-07-09 14:47:06,741][26022] Updated weights on worker 0-0, policy_version 294734 (0.00092) [2022-07-09 14:47:07,258][25689] Fps is (10 sec: 5452.5, 60 sec: 5588.1, 300 sec: 5665.6). Total num frames: 301810688. Throughput: 0: 5804.2. Samples: 301820246. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:47:07,258][25689] Avg episode reward: [(0, '-45.643')] [2022-07-09 14:47:08,282][26022] Updated weights on worker 0-0, policy_version 294744 (0.00093) [2022-07-09 14:47:10,302][26022] Updated weights on worker 0-0, policy_version 294754 (0.00085) [2022-07-09 14:47:12,011][26022] Updated weights on worker 0-0, policy_version 294764 (0.00084) [2022-07-09 14:47:12,282][25689] Fps is (10 sec: 5448.8, 60 sec: 5605.3, 300 sec: 5661.9). Total num frames: 301839360. Throughput: 0: 5796.0. Samples: 301837328. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:47:12,282][25689] Avg episode reward: [(0, '-46.742')] [2022-07-09 14:47:13,884][26022] Updated weights on worker 0-0, policy_version 294774 (0.00089) [2022-07-09 14:47:15,665][26022] Updated weights on worker 0-0, policy_version 294784 (0.00089) [2022-07-09 14:47:17,290][25689] Fps is (10 sec: 5614.1, 60 sec: 5605.4, 300 sec: 5658.6). Total num frames: 301867008. Throughput: 0: 5790.6. Samples: 301871286. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:47:17,291][25689] Avg episode reward: [(0, '-46.929')] [2022-07-09 14:47:17,471][26022] Updated weights on worker 0-0, policy_version 294794 (0.00087) [2022-07-09 14:47:19,195][26022] Updated weights on worker 0-0, policy_version 294804 (0.00088) [2022-07-09 14:47:21,015][26022] Updated weights on worker 0-0, policy_version 294814 (0.00086) [2022-07-09 14:47:22,317][25689] Fps is (10 sec: 5816.3, 60 sec: 5636.8, 300 sec: 5666.1). Total num frames: 301897728. Throughput: 0: 5781.2. Samples: 301905302. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:47:22,318][25689] Avg episode reward: [(0, '-46.634')] [2022-07-09 14:47:22,955][26022] Updated weights on worker 0-0, policy_version 294824 (0.00087) [2022-07-09 14:47:24,620][26022] Updated weights on worker 0-0, policy_version 294834 (0.00099) [2022-07-09 14:47:26,490][26022] Updated weights on worker 0-0, policy_version 294844 (0.00090) [2022-07-09 14:47:27,426][25689] Fps is (10 sec: 5758.0, 60 sec: 5635.2, 300 sec: 5664.1). Total num frames: 301925376. Throughput: 0: 5061.7. Samples: 301922334. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:47:27,427][25689] Avg episode reward: [(0, '-46.363')] [2022-07-09 14:47:28,292][26022] Updated weights on worker 0-0, policy_version 294854 (0.00088) [2022-07-09 14:47:30,092][26022] Updated weights on worker 0-0, policy_version 294864 (0.00094) [2022-07-09 14:47:31,764][26022] Updated weights on worker 0-0, policy_version 294874 (0.00086) [2022-07-09 14:47:32,438][25689] Fps is (10 sec: 5564.5, 60 sec: 5635.5, 300 sec: 5657.1). Total num frames: 301954048. Throughput: 0: 5909.9. Samples: 301956450. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:47:32,439][25689] Avg episode reward: [(0, '-47.103')] [2022-07-09 14:47:33,687][26022] Updated weights on worker 0-0, policy_version 294884 (0.00089) [2022-07-09 14:47:35,479][26022] Updated weights on worker 0-0, policy_version 294894 (0.00082) [2022-07-09 14:47:37,311][26022] Updated weights on worker 0-0, policy_version 294904 (0.00089) [2022-07-09 14:47:37,451][25689] Fps is (10 sec: 5720.5, 60 sec: 5635.2, 300 sec: 5660.8). Total num frames: 301982720. Throughput: 0: 5920.7. Samples: 301990654. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:47:37,451][25689] Avg episode reward: [(0, '-47.545')] [2022-07-09 14:47:39,063][26022] Updated weights on worker 0-0, policy_version 294914 (0.00093) [2022-07-09 14:47:40,775][26022] Updated weights on worker 0-0, policy_version 294924 (0.00083) [2022-07-09 14:47:42,477][25689] Fps is (10 sec: 5610.1, 60 sec: 5618.6, 300 sec: 5658.1). Total num frames: 302010368. Throughput: 0: 5083.7. Samples: 302007792. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:47:42,478][25689] Avg episode reward: [(0, '-47.472')] [2022-07-09 14:47:42,876][26022] Updated weights on worker 0-0, policy_version 294934 (0.00093) [2022-07-09 14:47:44,486][26022] Updated weights on worker 0-0, policy_version 294944 (0.00091) [2022-07-09 14:47:46,263][26022] Updated weights on worker 0-0, policy_version 294954 (0.00087) [2022-07-09 14:47:47,596][25689] Fps is (10 sec: 5652.4, 60 sec: 5650.5, 300 sec: 5656.3). Total num frames: 302040064. Throughput: 0: 5925.2. Samples: 302041842. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:47:47,596][25689] Avg episode reward: [(0, '-47.543')] [2022-07-09 14:47:48,132][26022] Updated weights on worker 0-0, policy_version 294964 (0.00081) [2022-07-09 14:47:49,887][26022] Updated weights on worker 0-0, policy_version 294974 (0.00095) [2022-07-09 14:47:51,740][26022] Updated weights on worker 0-0, policy_version 294984 (0.00086) [2022-07-09 14:47:52,618][25689] Fps is (10 sec: 5655.0, 60 sec: 5633.4, 300 sec: 5656.3). Total num frames: 302067712. Throughput: 0: 5917.2. Samples: 302075856. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:47:52,618][25689] Avg episode reward: [(0, '-47.303')] [2022-07-09 14:47:53,555][26022] Updated weights on worker 0-0, policy_version 294994 (0.00082) [2022-07-09 14:47:55,318][26022] Updated weights on worker 0-0, policy_version 295004 (0.00086) [2022-07-09 14:47:57,227][26022] Updated weights on worker 0-0, policy_version 295014 (0.00103) [2022-07-09 14:47:57,638][25689] Fps is (10 sec: 5608.4, 60 sec: 5634.2, 300 sec: 5646.5). Total num frames: 302096384. Throughput: 0: 5063.1. Samples: 302092864. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:47:57,638][25689] Avg episode reward: [(0, '-47.999')] [2022-07-09 14:47:58,971][26022] Updated weights on worker 0-0, policy_version 295024 (0.00084) [2022-07-09 14:48:00,749][26022] Updated weights on worker 0-0, policy_version 295034 (0.00090) [2022-07-09 14:48:02,703][25689] Fps is (10 sec: 5482.9, 60 sec: 5596.0, 300 sec: 5654.4). Total num frames: 302123008. Throughput: 0: 5873.9. Samples: 302126596. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:48:02,703][25689] Avg episode reward: [(0, '-47.836')] [2022-07-09 14:48:02,984][26022] Updated weights on worker 0-0, policy_version 295044 (0.00086) [2022-07-09 14:48:04,728][26022] Updated weights on worker 0-0, policy_version 295054 (0.00092) [2022-07-09 14:48:06,910][26022] Updated weights on worker 0-0, policy_version 295064 (0.00086) [2022-07-09 14:48:07,751][25689] Fps is (10 sec: 5366.3, 60 sec: 5619.9, 300 sec: 5647.3). Total num frames: 302150656. Throughput: 0: 5760.6. Samples: 302157950. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:48:07,752][25689] Avg episode reward: [(0, '-47.973')] [2022-07-09 14:48:08,463][26022] Updated weights on worker 0-0, policy_version 295074 (0.00087) [2022-07-09 14:48:10,359][26022] Updated weights on worker 0-0, policy_version 295084 (0.00093) [2022-07-09 14:48:12,256][26022] Updated weights on worker 0-0, policy_version 295094 (0.00093) [2022-07-09 14:48:12,766][25689] Fps is (10 sec: 5596.7, 60 sec: 5620.8, 300 sec: 5647.1). Total num frames: 302179328. Throughput: 0: 4929.8. Samples: 302175182. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:48:12,767][25689] Avg episode reward: [(0, '-46.560')] [2022-07-09 14:48:14,054][26022] Updated weights on worker 0-0, policy_version 295104 (0.00086) [2022-07-09 14:48:15,746][26022] Updated weights on worker 0-0, policy_version 295114 (0.00086) [2022-07-09 14:48:17,768][25689] Fps is (10 sec: 5520.7, 60 sec: 5604.4, 300 sec: 5636.9). Total num frames: 302205952. Throughput: 0: 5762.9. Samples: 302208870. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:48:17,769][25689] Avg episode reward: [(0, '-46.955')] [2022-07-09 14:48:17,831][26022] Updated weights on worker 0-0, policy_version 295124 (0.00098) [2022-07-09 14:48:19,266][26022] Updated weights on worker 0-0, policy_version 295134 (0.00093) [2022-07-09 14:48:21,307][26022] Updated weights on worker 0-0, policy_version 295144 (0.00088) [2022-07-09 14:48:22,775][25689] Fps is (10 sec: 5627.1, 60 sec: 5589.4, 300 sec: 5641.0). Total num frames: 302235648. Throughput: 0: 5791.9. Samples: 302242850. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:48:22,775][25689] Avg episode reward: [(0, '-45.700')] [2022-07-09 14:48:23,053][26022] Updated weights on worker 0-0, policy_version 295154 (0.00088) [2022-07-09 14:48:24,971][26022] Updated weights on worker 0-0, policy_version 295164 (0.00084) [2022-07-09 14:48:26,786][26022] Updated weights on worker 0-0, policy_version 295174 (0.00091) [2022-07-09 14:48:27,858][25689] Fps is (10 sec: 5784.7, 60 sec: 5608.7, 300 sec: 5640.2). Total num frames: 302264320. Throughput: 0: 5072.9. Samples: 302259948. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:48:27,858][25689] Avg episode reward: [(0, '-45.319')] [2022-07-09 14:48:28,466][26022] Updated weights on worker 0-0, policy_version 295184 (0.00088) [2022-07-09 14:48:30,303][26022] Updated weights on worker 0-0, policy_version 295194 (0.00087) [2022-07-09 14:48:32,161][26022] Updated weights on worker 0-0, policy_version 295204 (0.00093) [2022-07-09 14:48:32,859][25689] Fps is (10 sec: 5686.2, 60 sec: 5609.7, 300 sec: 5636.7). Total num frames: 302292992. Throughput: 0: 5912.0. Samples: 302293974. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:48:32,864][25689] Avg episode reward: [(0, '-44.070')] [2022-07-09 14:48:33,830][26022] Updated weights on worker 0-0, policy_version 295214 (0.00099) [2022-07-09 14:48:35,724][26022] Updated weights on worker 0-0, policy_version 295224 (0.00094) [2022-07-09 14:48:37,485][26022] Updated weights on worker 0-0, policy_version 295234 (0.00082) [2022-07-09 14:48:37,889][25689] Fps is (10 sec: 5614.5, 60 sec: 5591.2, 300 sec: 5636.6). Total num frames: 302320640. Throughput: 0: 5930.3. Samples: 302328196. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:48:37,890][25689] Avg episode reward: [(0, '-45.256')] [2022-07-09 14:48:39,257][26022] Updated weights on worker 0-0, policy_version 295244 (0.00090) [2022-07-09 14:48:41,151][26022] Updated weights on worker 0-0, policy_version 295254 (0.00090) [2022-07-09 14:48:42,891][25689] Fps is (10 sec: 5614.5, 60 sec: 5610.4, 300 sec: 5630.7). Total num frames: 302349312. Throughput: 0: 5097.9. Samples: 302345400. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 14:48:42,891][25689] Avg episode reward: [(0, '-46.590')] [2022-07-09 14:48:42,933][26022] Updated weights on worker 0-0, policy_version 295264 (0.00091) [2022-07-09 14:48:44,688][26022] Updated weights on worker 0-0, policy_version 295274 (0.00091) [2022-07-09 14:48:46,568][26022] Updated weights on worker 0-0, policy_version 295284 (0.00094) [2022-07-09 14:48:47,957][25689] Fps is (10 sec: 5797.1, 60 sec: 5615.2, 300 sec: 5636.7). Total num frames: 302379008. Throughput: 0: 5945.9. Samples: 302379458. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:48:47,958][25689] Avg episode reward: [(0, '-47.835')] [2022-07-09 14:48:48,339][26022] Updated weights on worker 0-0, policy_version 295294 (0.00093) [2022-07-09 14:48:50,089][26022] Updated weights on worker 0-0, policy_version 295304 (0.00097) [2022-07-09 14:48:51,979][26022] Updated weights on worker 0-0, policy_version 295314 (0.00085) [2022-07-09 14:48:53,026][25689] Fps is (10 sec: 5657.9, 60 sec: 5610.9, 300 sec: 5635.5). Total num frames: 302406656. Throughput: 0: 5908.9. Samples: 302413136. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:48:53,026][25689] Avg episode reward: [(0, '-49.134')] [2022-07-09 14:48:53,722][26022] Updated weights on worker 0-0, policy_version 295324 (0.00082) [2022-07-09 14:48:55,832][26022] Updated weights on worker 0-0, policy_version 295334 (0.00089) [2022-07-09 14:48:57,482][26022] Updated weights on worker 0-0, policy_version 295344 (0.00413) [2022-07-09 14:48:58,054][25689] Fps is (10 sec: 5578.2, 60 sec: 5610.2, 300 sec: 5628.8). Total num frames: 302435328. Throughput: 0: 5053.2. Samples: 302430094. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:48:58,054][25689] Avg episode reward: [(0, '-49.092')] [2022-07-09 14:48:59,371][26022] Updated weights on worker 0-0, policy_version 295354 (0.00089) [2022-07-09 14:49:00,936][26022] Updated weights on worker 0-0, policy_version 295364 (0.00090) [2022-07-09 14:49:03,056][25689] Fps is (10 sec: 5410.7, 60 sec: 5599.0, 300 sec: 5629.4). Total num frames: 302460928. Throughput: 0: 5876.4. Samples: 302463902. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:49:03,058][25689] Avg episode reward: [(0, '-49.189')] [2022-07-09 14:49:03,338][26022] Updated weights on worker 0-0, policy_version 295374 (0.00095) [2022-07-09 14:49:05,112][26022] Updated weights on worker 0-0, policy_version 295384 (0.00091) [2022-07-09 14:49:05,872][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:49:05,882][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000295389_302478336.pth [2022-07-09 14:49:05,882][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000293401_300442624.pth [2022-07-09 14:49:06,779][26022] Updated weights on worker 0-0, policy_version 295394 (0.00092) [2022-07-09 14:49:08,127][25689] Fps is (10 sec: 5387.6, 60 sec: 5613.9, 300 sec: 5628.4). Total num frames: 302489600. Throughput: 0: 5791.2. Samples: 302496266. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:49:08,128][25689] Avg episode reward: [(0, '-48.161')] [2022-07-09 14:49:08,602][26022] Updated weights on worker 0-0, policy_version 295404 (0.00089) [2022-07-09 14:49:10,241][26022] Updated weights on worker 0-0, policy_version 295414 (0.00085) [2022-07-09 14:49:12,225][26022] Updated weights on worker 0-0, policy_version 295424 (0.00090) [2022-07-09 14:49:13,168][25689] Fps is (10 sec: 5772.2, 60 sec: 5628.4, 300 sec: 5628.3). Total num frames: 302519296. Throughput: 0: 4982.5. Samples: 302513496. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:49:13,169][25689] Avg episode reward: [(0, '-47.323')] [2022-07-09 14:49:13,912][26022] Updated weights on worker 0-0, policy_version 295434 (0.00092) [2022-07-09 14:49:15,778][26022] Updated weights on worker 0-0, policy_version 295444 (0.00875) [2022-07-09 14:49:17,576][26022] Updated weights on worker 0-0, policy_version 295454 (0.00090) [2022-07-09 14:49:18,190][25689] Fps is (10 sec: 5800.8, 60 sec: 5660.5, 300 sec: 5631.6). Total num frames: 302547968. Throughput: 0: 5827.6. Samples: 302547436. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:49:18,190][25689] Avg episode reward: [(0, '-47.238')] [2022-07-09 14:49:19,303][26022] Updated weights on worker 0-0, policy_version 295464 (0.00091) [2022-07-09 14:49:21,223][26022] Updated weights on worker 0-0, policy_version 295474 (0.00092) [2022-07-09 14:49:23,002][26022] Updated weights on worker 0-0, policy_version 295484 (0.00092) [2022-07-09 14:49:23,196][25689] Fps is (10 sec: 5616.4, 60 sec: 5626.6, 300 sec: 5623.3). Total num frames: 302575616. Throughput: 0: 5831.4. Samples: 302581346. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:49:23,198][25689] Avg episode reward: [(0, '-47.466')] [2022-07-09 14:49:24,793][26022] Updated weights on worker 0-0, policy_version 295494 (0.00078) [2022-07-09 14:49:26,752][26022] Updated weights on worker 0-0, policy_version 295504 (0.00090) [2022-07-09 14:49:28,249][25689] Fps is (10 sec: 5598.8, 60 sec: 5629.4, 300 sec: 5629.4). Total num frames: 302604288. Throughput: 0: 5067.7. Samples: 302598236. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:49:28,249][25689] Avg episode reward: [(0, '-48.344')] [2022-07-09 14:49:28,483][26022] Updated weights on worker 0-0, policy_version 295514 (0.00094) [2022-07-09 14:49:30,427][26022] Updated weights on worker 0-0, policy_version 295524 (0.00084) [2022-07-09 14:49:32,260][26022] Updated weights on worker 0-0, policy_version 295534 (0.00086) [2022-07-09 14:49:33,309][25689] Fps is (10 sec: 5670.4, 60 sec: 5624.0, 300 sec: 5632.5). Total num frames: 302632960. Throughput: 0: 5887.5. Samples: 302632074. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:49:33,309][25689] Avg episode reward: [(0, '-48.404')] [2022-07-09 14:49:33,833][26022] Updated weights on worker 0-0, policy_version 295544 (0.00100) [2022-07-09 14:49:35,931][26022] Updated weights on worker 0-0, policy_version 295554 (0.00106) [2022-07-09 14:49:37,535][26022] Updated weights on worker 0-0, policy_version 295564 (0.00108) [2022-07-09 14:49:38,315][25689] Fps is (10 sec: 5696.8, 60 sec: 5643.2, 300 sec: 5626.5). Total num frames: 302661632. Throughput: 0: 5900.8. Samples: 302666190. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:49:38,315][25689] Avg episode reward: [(0, '-48.012')] [2022-07-09 14:49:39,526][26022] Updated weights on worker 0-0, policy_version 295574 (0.00086) [2022-07-09 14:49:41,207][26022] Updated weights on worker 0-0, policy_version 295584 (0.00453) [2022-07-09 14:49:43,124][26022] Updated weights on worker 0-0, policy_version 295594 (0.00091) [2022-07-09 14:49:43,354][25689] Fps is (10 sec: 5606.6, 60 sec: 5622.7, 300 sec: 5625.1). Total num frames: 302689280. Throughput: 0: 5050.2. Samples: 302683148. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:49:43,359][25689] Avg episode reward: [(0, '-47.641')] [2022-07-09 14:49:44,936][26022] Updated weights on worker 0-0, policy_version 295604 (0.00057) [2022-07-09 14:49:46,740][26022] Updated weights on worker 0-0, policy_version 295614 (0.00096) [2022-07-09 14:49:48,403][26022] Updated weights on worker 0-0, policy_version 295624 (0.00084) [2022-07-09 14:49:48,500][25689] Fps is (10 sec: 5630.0, 60 sec: 5615.3, 300 sec: 5622.6). Total num frames: 302718976. Throughput: 0: 5861.4. Samples: 302716938. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:49:48,501][25689] Avg episode reward: [(0, '-47.763')] [2022-07-09 14:49:50,327][26022] Updated weights on worker 0-0, policy_version 295634 (0.00088) [2022-07-09 14:49:52,235][26022] Updated weights on worker 0-0, policy_version 295644 (0.00084) [2022-07-09 14:49:53,510][25689] Fps is (10 sec: 5747.4, 60 sec: 5637.7, 300 sec: 5622.7). Total num frames: 302747648. Throughput: 0: 5896.0. Samples: 302751178. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:49:53,510][25689] Avg episode reward: [(0, '-47.862')] [2022-07-09 14:49:53,850][26022] Updated weights on worker 0-0, policy_version 295654 (0.00089) [2022-07-09 14:49:55,654][26022] Updated weights on worker 0-0, policy_version 295664 (0.00090) [2022-07-09 14:49:57,567][26022] Updated weights on worker 0-0, policy_version 295674 (0.00082) [2022-07-09 14:49:58,519][25689] Fps is (10 sec: 5723.6, 60 sec: 5639.5, 300 sec: 5623.2). Total num frames: 302776320. Throughput: 0: 5891.6. Samples: 302785226. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:49:58,520][25689] Avg episode reward: [(0, '-47.028')] [2022-07-09 14:49:59,242][26022] Updated weights on worker 0-0, policy_version 295684 (0.00085) [2022-07-09 14:50:01,257][26022] Updated weights on worker 0-0, policy_version 295694 (0.00094) [2022-07-09 14:50:03,222][26022] Updated weights on worker 0-0, policy_version 295704 (0.00083) [2022-07-09 14:50:03,526][25689] Fps is (10 sec: 5418.6, 60 sec: 5639.1, 300 sec: 5621.9). Total num frames: 302801920. Throughput: 0: 5851.4. Samples: 302801180. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:50:03,526][25689] Avg episode reward: [(0, '-45.990')] [2022-07-09 14:50:05,295][26022] Updated weights on worker 0-0, policy_version 295714 (0.00095) [2022-07-09 14:50:06,875][26022] Updated weights on worker 0-0, policy_version 295724 (0.00088) [2022-07-09 14:50:08,601][25689] Fps is (10 sec: 5383.1, 60 sec: 5638.7, 300 sec: 5617.7). Total num frames: 302830592. Throughput: 0: 5821.6. Samples: 302833956. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:50:08,601][25689] Avg episode reward: [(0, '-46.310')] [2022-07-09 14:50:08,774][26022] Updated weights on worker 0-0, policy_version 295734 (0.00095) [2022-07-09 14:50:10,486][26022] Updated weights on worker 0-0, policy_version 295744 (0.00087) [2022-07-09 14:50:12,207][26022] Updated weights on worker 0-0, policy_version 295754 (0.00088) [2022-07-09 14:50:13,629][25689] Fps is (10 sec: 5676.0, 60 sec: 5623.0, 300 sec: 5620.9). Total num frames: 302859264. Throughput: 0: 5811.1. Samples: 302868090. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:50:13,629][25689] Avg episode reward: [(0, '-45.797')] [2022-07-09 14:50:14,216][26022] Updated weights on worker 0-0, policy_version 295764 (0.00091) [2022-07-09 14:50:16,018][26022] Updated weights on worker 0-0, policy_version 295774 (0.00083) [2022-07-09 14:50:17,831][26022] Updated weights on worker 0-0, policy_version 295784 (0.00089) [2022-07-09 14:50:18,700][25689] Fps is (10 sec: 5678.2, 60 sec: 5618.3, 300 sec: 5619.9). Total num frames: 302887936. Throughput: 0: 4944.4. Samples: 302885008. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:50:18,700][25689] Avg episode reward: [(0, '-45.053')] [2022-07-09 14:50:19,547][26022] Updated weights on worker 0-0, policy_version 295794 (0.00084) [2022-07-09 14:50:21,467][26022] Updated weights on worker 0-0, policy_version 295804 (0.00059) [2022-07-09 14:50:23,268][26022] Updated weights on worker 0-0, policy_version 295814 (0.00115) [2022-07-09 14:50:23,726][25689] Fps is (10 sec: 5679.0, 60 sec: 5633.4, 300 sec: 5621.8). Total num frames: 302916608. Throughput: 0: 5815.6. Samples: 302918658. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:50:23,727][25689] Avg episode reward: [(0, '-45.234')] [2022-07-09 14:50:25,141][26022] Updated weights on worker 0-0, policy_version 295824 (0.00098) [2022-07-09 14:50:26,968][26022] Updated weights on worker 0-0, policy_version 295834 (0.00092) [2022-07-09 14:50:28,711][26022] Updated weights on worker 0-0, policy_version 295844 (0.00098) [2022-07-09 14:50:28,786][25689] Fps is (10 sec: 5583.9, 60 sec: 5615.8, 300 sec: 5617.4). Total num frames: 302944256. Throughput: 0: 5880.2. Samples: 302952648. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:50:28,787][25689] Avg episode reward: [(0, '-46.772')] [2022-07-09 14:50:30,555][26022] Updated weights on worker 0-0, policy_version 295854 (0.00603) [2022-07-09 14:50:32,474][26022] Updated weights on worker 0-0, policy_version 295864 (0.00090) [2022-07-09 14:50:33,804][25689] Fps is (10 sec: 5588.7, 60 sec: 5619.8, 300 sec: 5613.8). Total num frames: 302972928. Throughput: 0: 5023.1. Samples: 302969432. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:50:33,804][25689] Avg episode reward: [(0, '-46.767')] [2022-07-09 14:50:33,975][26022] Updated weights on worker 0-0, policy_version 295874 (0.00101) [2022-07-09 14:50:36,144][26022] Updated weights on worker 0-0, policy_version 295884 (0.00092) [2022-07-09 14:50:37,647][26022] Updated weights on worker 0-0, policy_version 295894 (0.00083) [2022-07-09 14:50:38,829][25689] Fps is (10 sec: 5607.7, 60 sec: 5601.0, 300 sec: 5614.1). Total num frames: 303000576. Throughput: 0: 5889.8. Samples: 303003566. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:50:38,830][25689] Avg episode reward: [(0, '-47.135')] [2022-07-09 14:50:39,627][26022] Updated weights on worker 0-0, policy_version 295904 (0.00094) [2022-07-09 14:50:41,279][26022] Updated weights on worker 0-0, policy_version 295914 (0.00095) [2022-07-09 14:50:43,164][26022] Updated weights on worker 0-0, policy_version 295924 (0.00091) [2022-07-09 14:50:43,840][25689] Fps is (10 sec: 5509.7, 60 sec: 5603.7, 300 sec: 5611.5). Total num frames: 303028224. Throughput: 0: 5935.6. Samples: 303038044. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:50:43,841][25689] Avg episode reward: [(0, '-48.023')] [2022-07-09 14:50:44,757][26022] Updated weights on worker 0-0, policy_version 295934 (0.00725) [2022-07-09 14:50:46,801][26022] Updated weights on worker 0-0, policy_version 295944 (0.00093) [2022-07-09 14:50:48,337][26022] Updated weights on worker 0-0, policy_version 295954 (0.00093) [2022-07-09 14:50:48,895][25689] Fps is (10 sec: 5697.0, 60 sec: 5612.1, 300 sec: 5615.8). Total num frames: 303057920. Throughput: 0: 5092.8. Samples: 303055058. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:50:48,897][25689] Avg episode reward: [(0, '-47.989')] [2022-07-09 14:50:50,337][26022] Updated weights on worker 0-0, policy_version 295964 (0.00087) [2022-07-09 14:50:52,106][26022] Updated weights on worker 0-0, policy_version 295974 (0.00089) [2022-07-09 14:50:53,908][25689] Fps is (10 sec: 5797.4, 60 sec: 5611.8, 300 sec: 5615.8). Total num frames: 303086592. Throughput: 0: 5949.6. Samples: 303089044. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:50:53,908][25689] Avg episode reward: [(0, '-47.201')] [2022-07-09 14:50:54,061][26022] Updated weights on worker 0-0, policy_version 295984 (0.00088) [2022-07-09 14:50:55,668][26022] Updated weights on worker 0-0, policy_version 295994 (0.00087) [2022-07-09 14:50:57,721][26022] Updated weights on worker 0-0, policy_version 296004 (0.00085) [2022-07-09 14:50:58,932][25689] Fps is (10 sec: 5815.0, 60 sec: 5627.4, 300 sec: 5626.2). Total num frames: 303116288. Throughput: 0: 5947.1. Samples: 303123120. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:50:58,933][25689] Avg episode reward: [(0, '-46.756')] [2022-07-09 14:50:59,412][26022] Updated weights on worker 0-0, policy_version 296014 (0.00080) [2022-07-09 14:51:01,334][26022] Updated weights on worker 0-0, policy_version 296024 (0.00089) [2022-07-09 14:51:03,441][26022] Updated weights on worker 0-0, policy_version 296034 (0.00086) [2022-07-09 14:51:04,029][25689] Fps is (10 sec: 5463.3, 60 sec: 5619.0, 300 sec: 5615.3). Total num frames: 303141888. Throughput: 0: 5043.4. Samples: 303139868. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:51:04,031][25689] Avg episode reward: [(0, '-46.663')] [2022-07-09 14:51:05,388][26022] Updated weights on worker 0-0, policy_version 296044 (0.00093) [2022-07-09 14:51:06,007][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:51:06,021][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000296048_303153152.pth [2022-07-09 14:51:06,021][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000294070_301127680.pth [2022-07-09 14:51:07,097][26022] Updated weights on worker 0-0, policy_version 296054 (0.00111) [2022-07-09 14:51:09,099][25689] Fps is (10 sec: 5237.4, 60 sec: 5602.5, 300 sec: 5614.5). Total num frames: 303169536. Throughput: 0: 5761.5. Samples: 303171466. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 14:51:09,100][25689] Avg episode reward: [(0, '-45.545')] [2022-07-09 14:51:09,107][26022] Updated weights on worker 0-0, policy_version 296064 (0.00088) [2022-07-09 14:51:10,726][26022] Updated weights on worker 0-0, policy_version 296074 (0.00093) [2022-07-09 14:51:12,546][26022] Updated weights on worker 0-0, policy_version 296084 (0.00087) [2022-07-09 14:51:14,120][25689] Fps is (10 sec: 5683.1, 60 sec: 5620.1, 300 sec: 5621.2). Total num frames: 303199232. Throughput: 0: 5777.0. Samples: 303205806. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:51:14,121][25689] Avg episode reward: [(0, '-46.588')] [2022-07-09 14:51:14,222][26022] Updated weights on worker 0-0, policy_version 296094 (0.00087) [2022-07-09 14:51:16,124][26022] Updated weights on worker 0-0, policy_version 296104 (0.00091) [2022-07-09 14:51:17,663][26022] Updated weights on worker 0-0, policy_version 296114 (0.00087) [2022-07-09 14:51:19,127][25689] Fps is (10 sec: 5718.5, 60 sec: 5609.1, 300 sec: 5617.6). Total num frames: 303226880. Throughput: 0: 4951.0. Samples: 303223104. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:51:19,128][25689] Avg episode reward: [(0, '-46.373')] [2022-07-09 14:51:19,913][26022] Updated weights on worker 0-0, policy_version 296124 (0.00085) [2022-07-09 14:51:21,451][26022] Updated weights on worker 0-0, policy_version 296134 (0.00083) [2022-07-09 14:51:23,299][26022] Updated weights on worker 0-0, policy_version 296144 (0.00084) [2022-07-09 14:51:24,132][25689] Fps is (10 sec: 5625.3, 60 sec: 5611.2, 300 sec: 5622.7). Total num frames: 303255552. Throughput: 0: 5827.7. Samples: 303257018. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:51:24,132][25689] Avg episode reward: [(0, '-47.649')] [2022-07-09 14:51:25,082][26022] Updated weights on worker 0-0, policy_version 296154 (0.00086) [2022-07-09 14:51:26,920][26022] Updated weights on worker 0-0, policy_version 296164 (0.00095) [2022-07-09 14:51:28,630][26022] Updated weights on worker 0-0, policy_version 296174 (0.00093) [2022-07-09 14:51:29,194][25689] Fps is (10 sec: 5696.0, 60 sec: 5627.8, 300 sec: 5621.8). Total num frames: 303284224. Throughput: 0: 5920.2. Samples: 303290434. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:51:29,195][25689] Avg episode reward: [(0, '-47.638')] [2022-07-09 14:51:30,818][26022] Updated weights on worker 0-0, policy_version 296184 (0.00088) [2022-07-09 14:51:32,352][26022] Updated weights on worker 0-0, policy_version 296194 (0.00092) [2022-07-09 14:51:34,239][25689] Fps is (10 sec: 5471.1, 60 sec: 5591.5, 300 sec: 5614.3). Total num frames: 303310848. Throughput: 0: 5044.2. Samples: 303307290. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:51:34,239][25689] Avg episode reward: [(0, '-47.713')] [2022-07-09 14:51:34,390][26022] Updated weights on worker 0-0, policy_version 296204 (0.00099) [2022-07-09 14:51:35,970][26022] Updated weights on worker 0-0, policy_version 296214 (0.00090) [2022-07-09 14:51:38,019][26022] Updated weights on worker 0-0, policy_version 296224 (0.00088) [2022-07-09 14:51:39,270][25689] Fps is (10 sec: 5691.6, 60 sec: 5641.8, 300 sec: 5621.2). Total num frames: 303341568. Throughput: 0: 5852.1. Samples: 303340980. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:51:39,270][25689] Avg episode reward: [(0, '-48.548')] [2022-07-09 14:51:39,603][26022] Updated weights on worker 0-0, policy_version 296234 (0.00084) [2022-07-09 14:51:41,659][26022] Updated weights on worker 0-0, policy_version 296244 (0.00082) [2022-07-09 14:51:43,262][26022] Updated weights on worker 0-0, policy_version 296254 (0.00096) [2022-07-09 14:51:44,313][25689] Fps is (10 sec: 5691.9, 60 sec: 5621.8, 300 sec: 5618.7). Total num frames: 303368192. Throughput: 0: 5854.8. Samples: 303375178. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:51:44,314][25689] Avg episode reward: [(0, '-48.354')] [2022-07-09 14:51:45,265][26022] Updated weights on worker 0-0, policy_version 296264 (0.00087) [2022-07-09 14:51:46,868][26022] Updated weights on worker 0-0, policy_version 296274 (0.00088) [2022-07-09 14:51:49,068][26022] Updated weights on worker 0-0, policy_version 296284 (0.00094) [2022-07-09 14:51:49,375][25689] Fps is (10 sec: 5370.8, 60 sec: 5587.3, 300 sec: 5614.5). Total num frames: 303395840. Throughput: 0: 5041.5. Samples: 303392172. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:51:49,375][25689] Avg episode reward: [(0, '-48.607')] [2022-07-09 14:51:50,559][26022] Updated weights on worker 0-0, policy_version 296294 (0.00101) [2022-07-09 14:51:52,519][26022] Updated weights on worker 0-0, policy_version 296304 (0.00092) [2022-07-09 14:51:54,091][26022] Updated weights on worker 0-0, policy_version 296314 (0.00093) [2022-07-09 14:51:54,399][25689] Fps is (10 sec: 5787.4, 60 sec: 5620.2, 300 sec: 5621.5). Total num frames: 303426560. Throughput: 0: 5908.5. Samples: 303426406. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:51:54,399][25689] Avg episode reward: [(0, '-48.734')] [2022-07-09 14:51:56,171][26022] Updated weights on worker 0-0, policy_version 296324 (0.00090) [2022-07-09 14:51:57,681][26022] Updated weights on worker 0-0, policy_version 296334 (0.00098) [2022-07-09 14:51:59,405][25689] Fps is (10 sec: 5819.2, 60 sec: 5588.0, 300 sec: 5618.3). Total num frames: 303454208. Throughput: 0: 5937.8. Samples: 303460538. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:51:59,405][25689] Avg episode reward: [(0, '-49.312')] [2022-07-09 14:51:59,567][26022] Updated weights on worker 0-0, policy_version 296344 (0.00089) [2022-07-09 14:52:01,268][26022] Updated weights on worker 0-0, policy_version 296354 (0.00087) [2022-07-09 14:52:03,706][26022] Updated weights on worker 0-0, policy_version 296364 (0.00092) [2022-07-09 14:52:04,406][25689] Fps is (10 sec: 5320.8, 60 sec: 5596.9, 300 sec: 5617.1). Total num frames: 303479808. Throughput: 0: 5105.4. Samples: 303477760. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:52:04,407][25689] Avg episode reward: [(0, '-48.531')] [2022-07-09 14:52:05,176][26022] Updated weights on worker 0-0, policy_version 296374 (0.00090) [2022-07-09 14:52:07,527][26022] Updated weights on worker 0-0, policy_version 296384 (0.00086) [2022-07-09 14:52:08,914][26022] Updated weights on worker 0-0, policy_version 296394 (0.00094) [2022-07-09 14:52:09,535][25689] Fps is (10 sec: 5458.7, 60 sec: 5625.3, 300 sec: 5618.6). Total num frames: 303509504. Throughput: 0: 5808.9. Samples: 303509280. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:52:09,535][25689] Avg episode reward: [(0, '-48.096')] [2022-07-09 14:52:10,912][26022] Updated weights on worker 0-0, policy_version 296404 (0.00092) [2022-07-09 14:52:12,667][26022] Updated weights on worker 0-0, policy_version 296414 (0.00089) [2022-07-09 14:52:14,363][26022] Updated weights on worker 0-0, policy_version 296424 (0.00086) [2022-07-09 14:52:14,619][25689] Fps is (10 sec: 5915.5, 60 sec: 5636.3, 300 sec: 5627.5). Total num frames: 303540224. Throughput: 0: 5798.7. Samples: 303543660. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:52:14,620][25689] Avg episode reward: [(0, '-47.669')] [2022-07-09 14:52:16,282][26022] Updated weights on worker 0-0, policy_version 296434 (0.00085) [2022-07-09 14:52:17,970][26022] Updated weights on worker 0-0, policy_version 296444 (0.00100) [2022-07-09 14:52:19,657][25689] Fps is (10 sec: 5564.2, 60 sec: 5599.7, 300 sec: 5610.1). Total num frames: 303565824. Throughput: 0: 4948.2. Samples: 303560748. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:52:19,658][25689] Avg episode reward: [(0, '-47.208')] [2022-07-09 14:52:19,961][26022] Updated weights on worker 0-0, policy_version 296454 (0.00089) [2022-07-09 14:52:21,482][26022] Updated weights on worker 0-0, policy_version 296464 (0.00084) [2022-07-09 14:52:23,497][26022] Updated weights on worker 0-0, policy_version 296474 (0.00093) [2022-07-09 14:52:24,723][25689] Fps is (10 sec: 5574.4, 60 sec: 5627.8, 300 sec: 5621.2). Total num frames: 303596544. Throughput: 0: 5774.9. Samples: 303595086. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:52:24,723][25689] Avg episode reward: [(0, '-47.472')] [2022-07-09 14:52:25,171][26022] Updated weights on worker 0-0, policy_version 296484 (0.00093) [2022-07-09 14:52:27,104][26022] Updated weights on worker 0-0, policy_version 296494 (0.00091) [2022-07-09 14:52:28,929][26022] Updated weights on worker 0-0, policy_version 296504 (0.00085) [2022-07-09 14:52:29,840][25689] Fps is (10 sec: 5731.6, 60 sec: 5605.8, 300 sec: 5615.8). Total num frames: 303624192. Throughput: 0: 5894.7. Samples: 303628974. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:52:29,841][25689] Avg episode reward: [(0, '-46.788')] [2022-07-09 14:52:30,621][26022] Updated weights on worker 0-0, policy_version 296514 (0.00091) [2022-07-09 14:52:32,631][26022] Updated weights on worker 0-0, policy_version 296524 (0.00091) [2022-07-09 14:52:34,180][26022] Updated weights on worker 0-0, policy_version 296534 (0.00084) [2022-07-09 14:52:34,842][25689] Fps is (10 sec: 5667.0, 60 sec: 5660.4, 300 sec: 5619.4). Total num frames: 303653888. Throughput: 0: 5921.2. Samples: 303663400. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:52:34,842][25689] Avg episode reward: [(0, '-47.051')] [2022-07-09 14:52:36,295][26022] Updated weights on worker 0-0, policy_version 296544 (0.00093) [2022-07-09 14:52:37,793][26022] Updated weights on worker 0-0, policy_version 296554 (0.00086) [2022-07-09 14:52:39,714][26022] Updated weights on worker 0-0, policy_version 296564 (0.00090) [2022-07-09 14:52:39,847][25689] Fps is (10 sec: 5833.0, 60 sec: 5629.1, 300 sec: 5623.3). Total num frames: 303682560. Throughput: 0: 5940.0. Samples: 303680678. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:52:39,847][25689] Avg episode reward: [(0, '-47.906')] [2022-07-09 14:52:41,356][26022] Updated weights on worker 0-0, policy_version 296574 (0.00108) [2022-07-09 14:52:43,164][26022] Updated weights on worker 0-0, policy_version 296584 (0.00082) [2022-07-09 14:52:44,898][25689] Fps is (10 sec: 5702.2, 60 sec: 5662.1, 300 sec: 5621.1). Total num frames: 303711232. Throughput: 0: 5935.3. Samples: 303714834. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:52:44,899][25689] Avg episode reward: [(0, '-48.013')] [2022-07-09 14:52:44,988][26022] Updated weights on worker 0-0, policy_version 296594 (0.00094) [2022-07-09 14:52:46,807][26022] Updated weights on worker 0-0, policy_version 296604 (0.00099) [2022-07-09 14:52:48,587][26022] Updated weights on worker 0-0, policy_version 296614 (0.00086) [2022-07-09 14:52:49,963][25689] Fps is (10 sec: 5668.5, 60 sec: 5678.7, 300 sec: 5623.7). Total num frames: 303739904. Throughput: 0: 5950.3. Samples: 303748712. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:52:49,963][25689] Avg episode reward: [(0, '-48.400')] [2022-07-09 14:52:50,404][26022] Updated weights on worker 0-0, policy_version 296624 (0.00084) [2022-07-09 14:52:52,266][26022] Updated weights on worker 0-0, policy_version 296634 (0.00089) [2022-07-09 14:52:54,186][26022] Updated weights on worker 0-0, policy_version 296644 (0.00095) [2022-07-09 14:52:55,002][25689] Fps is (10 sec: 5675.4, 60 sec: 5643.5, 300 sec: 5623.4). Total num frames: 303768576. Throughput: 0: 5075.6. Samples: 303765728. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:52:55,003][25689] Avg episode reward: [(0, '-48.971')] [2022-07-09 14:52:55,946][26022] Updated weights on worker 0-0, policy_version 296654 (0.00092) [2022-07-09 14:52:57,688][26022] Updated weights on worker 0-0, policy_version 296664 (0.00091) [2022-07-09 14:52:59,531][26022] Updated weights on worker 0-0, policy_version 296674 (0.00081) [2022-07-09 14:53:00,034][25689] Fps is (10 sec: 5693.9, 60 sec: 5658.0, 300 sec: 5630.9). Total num frames: 303797248. Throughput: 0: 5892.8. Samples: 303799638. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:53:00,036][25689] Avg episode reward: [(0, '-48.309')] [2022-07-09 14:53:01,447][26022] Updated weights on worker 0-0, policy_version 296684 (0.00078) [2022-07-09 14:53:03,538][26022] Updated weights on worker 0-0, policy_version 296694 (0.00086) [2022-07-09 14:53:05,055][25689] Fps is (10 sec: 5500.3, 60 sec: 5673.0, 300 sec: 5628.0). Total num frames: 303823872. Throughput: 0: 5790.7. Samples: 303831560. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:53:05,057][25689] Avg episode reward: [(0, '-47.540')] [2022-07-09 14:53:05,319][26022] Updated weights on worker 0-0, policy_version 296704 (0.00085) [2022-07-09 14:53:06,333][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:53:06,341][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000296708_303828992.pth [2022-07-09 14:53:06,342][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000294730_301803520.pth [2022-07-09 14:53:07,063][26022] Updated weights on worker 0-0, policy_version 296714 (0.00400) [2022-07-09 14:53:09,114][26022] Updated weights on worker 0-0, policy_version 296724 (0.00093) [2022-07-09 14:53:10,178][25689] Fps is (10 sec: 5249.0, 60 sec: 5622.9, 300 sec: 5619.0). Total num frames: 303850496. Throughput: 0: 4934.7. Samples: 303848470. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:53:10,180][25689] Avg episode reward: [(0, '-47.516')] [2022-07-09 14:53:10,679][26022] Updated weights on worker 0-0, policy_version 296734 (0.00080) [2022-07-09 14:53:12,729][26022] Updated weights on worker 0-0, policy_version 296744 (0.00090) [2022-07-09 14:53:14,255][26022] Updated weights on worker 0-0, policy_version 296754 (0.00084) [2022-07-09 14:53:15,280][25689] Fps is (10 sec: 5608.4, 60 sec: 5621.3, 300 sec: 5630.9). Total num frames: 303881216. Throughput: 0: 5752.1. Samples: 303882368. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:53:15,281][25689] Avg episode reward: [(0, '-47.451')] [2022-07-09 14:53:16,302][26022] Updated weights on worker 0-0, policy_version 296764 (0.00094) [2022-07-09 14:53:17,884][26022] Updated weights on worker 0-0, policy_version 296774 (0.00081) [2022-07-09 14:53:19,931][26022] Updated weights on worker 0-0, policy_version 296784 (0.00104) [2022-07-09 14:53:20,314][25689] Fps is (10 sec: 5758.5, 60 sec: 5655.3, 300 sec: 5623.5). Total num frames: 303908864. Throughput: 0: 5765.0. Samples: 303916554. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:53:20,315][25689] Avg episode reward: [(0, '-47.418')] [2022-07-09 14:53:21,538][26022] Updated weights on worker 0-0, policy_version 296794 (0.00086) [2022-07-09 14:53:23,405][26022] Updated weights on worker 0-0, policy_version 296804 (0.00090) [2022-07-09 14:53:25,146][26022] Updated weights on worker 0-0, policy_version 296814 (0.00091) [2022-07-09 14:53:25,370][25689] Fps is (10 sec: 5683.4, 60 sec: 5639.4, 300 sec: 5627.5). Total num frames: 303938560. Throughput: 0: 5026.0. Samples: 303933660. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:53:25,371][25689] Avg episode reward: [(0, '-47.216')] [2022-07-09 14:53:27,121][26022] Updated weights on worker 0-0, policy_version 296824 (0.00084) [2022-07-09 14:53:28,829][26022] Updated weights on worker 0-0, policy_version 296834 (0.00085) [2022-07-09 14:53:30,450][25689] Fps is (10 sec: 5758.7, 60 sec: 5659.8, 300 sec: 5626.0). Total num frames: 303967232. Throughput: 0: 5865.7. Samples: 303967376. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:53:30,452][25689] Avg episode reward: [(0, '-48.004')] [2022-07-09 14:53:30,627][26022] Updated weights on worker 0-0, policy_version 296844 (0.00091) [2022-07-09 14:53:32,424][26022] Updated weights on worker 0-0, policy_version 296854 (0.00083) [2022-07-09 14:53:34,372][26022] Updated weights on worker 0-0, policy_version 296864 (0.00085) [2022-07-09 14:53:35,454][25689] Fps is (10 sec: 5483.2, 60 sec: 5608.8, 300 sec: 5623.0). Total num frames: 303993856. Throughput: 0: 5897.4. Samples: 304001344. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:53:35,455][25689] Avg episode reward: [(0, '-48.684')] [2022-07-09 14:53:36,065][26022] Updated weights on worker 0-0, policy_version 296874 (0.00088) [2022-07-09 14:53:38,138][26022] Updated weights on worker 0-0, policy_version 296884 (0.00083) [2022-07-09 14:53:39,776][26022] Updated weights on worker 0-0, policy_version 296894 (0.00089) [2022-07-09 14:53:40,465][25689] Fps is (10 sec: 5521.5, 60 sec: 5608.4, 300 sec: 5622.9). Total num frames: 304022528. Throughput: 0: 5057.3. Samples: 304018460. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2022-07-09 14:53:40,465][25689] Avg episode reward: [(0, '-48.303')] [2022-07-09 14:53:41,567][26022] Updated weights on worker 0-0, policy_version 296904 (0.00095) [2022-07-09 14:53:43,307][26022] Updated weights on worker 0-0, policy_version 296914 (0.00085) [2022-07-09 14:53:45,327][26022] Updated weights on worker 0-0, policy_version 296924 (0.00093) [2022-07-09 14:53:45,489][25689] Fps is (10 sec: 5714.7, 60 sec: 5610.9, 300 sec: 5620.2). Total num frames: 304051200. Throughput: 0: 5884.8. Samples: 304052056. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:53:45,489][25689] Avg episode reward: [(0, '-48.168')] [2022-07-09 14:53:46,871][26022] Updated weights on worker 0-0, policy_version 296934 (0.00090) [2022-07-09 14:53:49,091][26022] Updated weights on worker 0-0, policy_version 296944 (0.00080) [2022-07-09 14:53:50,280][26022] Updated weights on worker 0-0, policy_version 296954 (0.00082) [2022-07-09 14:53:50,529][25689] Fps is (10 sec: 6003.0, 60 sec: 5663.9, 300 sec: 5634.5). Total num frames: 304082944. Throughput: 0: 5929.5. Samples: 304086434. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:53:50,530][25689] Avg episode reward: [(0, '-48.163')] [2022-07-09 14:53:52,793][26022] Updated weights on worker 0-0, policy_version 296964 (0.00092) [2022-07-09 14:53:53,893][26022] Updated weights on worker 0-0, policy_version 296974 (0.00085) [2022-07-09 14:53:55,557][25689] Fps is (10 sec: 5390.7, 60 sec: 5563.5, 300 sec: 5613.9). Total num frames: 304105472. Throughput: 0: 5053.1. Samples: 304102922. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:53:55,557][25689] Avg episode reward: [(0, '-48.610')] [2022-07-09 14:53:56,246][26022] Updated weights on worker 0-0, policy_version 296984 (0.00086) [2022-07-09 14:53:57,927][26022] Updated weights on worker 0-0, policy_version 296994 (0.00085) [2022-07-09 14:53:59,787][26022] Updated weights on worker 0-0, policy_version 297004 (0.00085) [2022-07-09 14:54:00,558][25689] Fps is (10 sec: 5411.6, 60 sec: 5617.1, 300 sec: 5634.6). Total num frames: 304137216. Throughput: 0: 5906.2. Samples: 304137134. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:54:00,558][25689] Avg episode reward: [(0, '-49.342')] [2022-07-09 14:54:01,599][26022] Updated weights on worker 0-0, policy_version 297014 (0.00092) [2022-07-09 14:54:03,778][26022] Updated weights on worker 0-0, policy_version 297024 (0.00089) [2022-07-09 14:54:05,379][26022] Updated weights on worker 0-0, policy_version 297034 (0.00095) [2022-07-09 14:54:05,571][25689] Fps is (10 sec: 5726.0, 60 sec: 5600.9, 300 sec: 5625.3). Total num frames: 304162816. Throughput: 0: 5832.0. Samples: 304169176. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:54:05,572][25689] Avg episode reward: [(0, '-48.988')] [2022-07-09 14:54:07,393][26022] Updated weights on worker 0-0, policy_version 297044 (0.00092) [2022-07-09 14:54:09,112][26022] Updated weights on worker 0-0, policy_version 297054 (0.00083) [2022-07-09 14:54:10,716][25689] Fps is (10 sec: 5342.7, 60 sec: 5632.7, 300 sec: 5619.9). Total num frames: 304191488. Throughput: 0: 4937.9. Samples: 304186114. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:54:10,716][25689] Avg episode reward: [(0, '-49.739')] [2022-07-09 14:54:10,921][26022] Updated weights on worker 0-0, policy_version 297064 (0.00091) [2022-07-09 14:54:12,676][26022] Updated weights on worker 0-0, policy_version 297074 (0.00090) [2022-07-09 14:54:14,655][26022] Updated weights on worker 0-0, policy_version 297084 (0.00088) [2022-07-09 14:54:15,756][25689] Fps is (10 sec: 5630.1, 60 sec: 5604.6, 300 sec: 5619.5). Total num frames: 304220160. Throughput: 0: 5806.5. Samples: 304220212. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:54:15,757][25689] Avg episode reward: [(0, '-48.727')] [2022-07-09 14:54:16,259][26022] Updated weights on worker 0-0, policy_version 297094 (0.00093) [2022-07-09 14:54:18,130][26022] Updated weights on worker 0-0, policy_version 297104 (0.00092) [2022-07-09 14:54:19,840][26022] Updated weights on worker 0-0, policy_version 297114 (0.00082) [2022-07-09 14:54:20,767][25689] Fps is (10 sec: 5705.4, 60 sec: 5623.7, 300 sec: 5622.9). Total num frames: 304248832. Throughput: 0: 5813.6. Samples: 304254620. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:54:20,767][25689] Avg episode reward: [(0, '-48.014')] [2022-07-09 14:54:21,656][26022] Updated weights on worker 0-0, policy_version 297124 (0.00093) [2022-07-09 14:54:23,614][26022] Updated weights on worker 0-0, policy_version 297134 (0.00088) [2022-07-09 14:54:25,388][26022] Updated weights on worker 0-0, policy_version 297144 (0.00085) [2022-07-09 14:54:25,790][25689] Fps is (10 sec: 5715.4, 60 sec: 5609.8, 300 sec: 5623.5). Total num frames: 304277504. Throughput: 0: 5059.9. Samples: 304271480. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:54:25,790][25689] Avg episode reward: [(0, '-47.562')] [2022-07-09 14:54:27,167][26022] Updated weights on worker 0-0, policy_version 297154 (0.00087) [2022-07-09 14:54:29,087][26022] Updated weights on worker 0-0, policy_version 297164 (0.00088) [2022-07-09 14:54:30,684][26022] Updated weights on worker 0-0, policy_version 297174 (0.00085) [2022-07-09 14:54:30,883][25689] Fps is (10 sec: 5769.9, 60 sec: 5625.6, 300 sec: 5626.3). Total num frames: 304307200. Throughput: 0: 5917.6. Samples: 304305452. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:54:30,883][25689] Avg episode reward: [(0, '-47.873')] [2022-07-09 14:54:32,776][26022] Updated weights on worker 0-0, policy_version 297184 (0.00092) [2022-07-09 14:54:34,395][26022] Updated weights on worker 0-0, policy_version 297194 (0.00087) [2022-07-09 14:54:35,885][25689] Fps is (10 sec: 5680.6, 60 sec: 5642.8, 300 sec: 5622.9). Total num frames: 304334848. Throughput: 0: 5937.8. Samples: 304339728. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:54:35,886][25689] Avg episode reward: [(0, '-48.315')] [2022-07-09 14:54:36,252][26022] Updated weights on worker 0-0, policy_version 297204 (0.00096) [2022-07-09 14:54:38,063][26022] Updated weights on worker 0-0, policy_version 297214 (0.00086) [2022-07-09 14:54:39,848][26022] Updated weights on worker 0-0, policy_version 297224 (0.00079) [2022-07-09 14:54:40,947][25689] Fps is (10 sec: 5494.1, 60 sec: 5620.9, 300 sec: 5622.5). Total num frames: 304362496. Throughput: 0: 5055.3. Samples: 304356638. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:54:40,949][25689] Avg episode reward: [(0, '-47.441')] [2022-07-09 14:54:41,532][26022] Updated weights on worker 0-0, policy_version 297234 (0.00094) [2022-07-09 14:54:43,352][26022] Updated weights on worker 0-0, policy_version 297244 (0.00091) [2022-07-09 14:54:45,204][26022] Updated weights on worker 0-0, policy_version 297254 (0.00084) [2022-07-09 14:54:45,950][25689] Fps is (10 sec: 5697.3, 60 sec: 5639.9, 300 sec: 5625.2). Total num frames: 304392192. Throughput: 0: 5932.8. Samples: 304391084. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:54:45,951][25689] Avg episode reward: [(0, '-48.054')] [2022-07-09 14:54:46,993][26022] Updated weights on worker 0-0, policy_version 297264 (0.00092) [2022-07-09 14:54:48,752][26022] Updated weights on worker 0-0, policy_version 297274 (0.00691) [2022-07-09 14:54:50,380][26022] Updated weights on worker 0-0, policy_version 297284 (0.00078) [2022-07-09 14:54:51,025][25689] Fps is (10 sec: 5893.6, 60 sec: 5602.8, 300 sec: 5627.4). Total num frames: 304421888. Throughput: 0: 5945.6. Samples: 304425208. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:54:51,025][25689] Avg episode reward: [(0, '-48.747')] [2022-07-09 14:54:52,276][26022] Updated weights on worker 0-0, policy_version 297294 (0.00085) [2022-07-09 14:54:54,086][26022] Updated weights on worker 0-0, policy_version 297304 (0.00089) [2022-07-09 14:54:56,022][26022] Updated weights on worker 0-0, policy_version 297314 (0.00091) [2022-07-09 14:54:56,043][25689] Fps is (10 sec: 5681.6, 60 sec: 5688.3, 300 sec: 5623.8). Total num frames: 304449536. Throughput: 0: 5074.0. Samples: 304442010. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:54:56,043][25689] Avg episode reward: [(0, '-47.466')] [2022-07-09 14:54:57,815][26022] Updated weights on worker 0-0, policy_version 297324 (0.00090) [2022-07-09 14:54:59,786][26022] Updated weights on worker 0-0, policy_version 297334 (0.00080) [2022-07-09 14:55:01,054][25689] Fps is (10 sec: 5615.7, 60 sec: 5636.6, 300 sec: 5634.1). Total num frames: 304478208. Throughput: 0: 5935.3. Samples: 304475976. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:55:01,054][25689] Avg episode reward: [(0, '-47.350')] [2022-07-09 14:55:01,484][26022] Updated weights on worker 0-0, policy_version 297344 (0.00096) [2022-07-09 14:55:03,619][26022] Updated weights on worker 0-0, policy_version 297354 (0.00088) [2022-07-09 14:55:05,350][26022] Updated weights on worker 0-0, policy_version 297364 (0.00099) [2022-07-09 14:55:06,068][25689] Fps is (10 sec: 5413.4, 60 sec: 5636.5, 300 sec: 5624.9). Total num frames: 304503808. Throughput: 0: 5814.2. Samples: 304508056. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:55:06,068][25689] Avg episode reward: [(0, '-47.779')] [2022-07-09 14:55:06,557][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:55:06,576][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000297369_304505856.pth [2022-07-09 14:55:06,576][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000295389_302478336.pth [2022-07-09 14:55:07,337][26022] Updated weights on worker 0-0, policy_version 297374 (0.00086) [2022-07-09 14:55:08,996][26022] Updated weights on worker 0-0, policy_version 297384 (0.00099) [2022-07-09 14:55:11,080][26022] Updated weights on worker 0-0, policy_version 297394 (0.00089) [2022-07-09 14:55:11,108][25689] Fps is (10 sec: 5295.7, 60 sec: 5629.3, 300 sec: 5621.2). Total num frames: 304531456. Throughput: 0: 4963.5. Samples: 304524894. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:55:11,109][25689] Avg episode reward: [(0, '-47.417')] [2022-07-09 14:55:12,653][26022] Updated weights on worker 0-0, policy_version 297404 (0.00081) [2022-07-09 14:55:14,612][26022] Updated weights on worker 0-0, policy_version 297414 (0.00085) [2022-07-09 14:55:16,098][26022] Updated weights on worker 0-0, policy_version 297424 (0.00088) [2022-07-09 14:55:16,113][25689] Fps is (10 sec: 5810.6, 60 sec: 5666.6, 300 sec: 5629.3). Total num frames: 304562176. Throughput: 0: 5835.6. Samples: 304559134. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:55:16,114][25689] Avg episode reward: [(0, '-46.940')] [2022-07-09 14:55:18,236][26022] Updated weights on worker 0-0, policy_version 297434 (0.00089) [2022-07-09 14:55:19,708][26022] Updated weights on worker 0-0, policy_version 297444 (0.00087) [2022-07-09 14:55:21,127][25689] Fps is (10 sec: 5621.7, 60 sec: 5615.4, 300 sec: 5619.3). Total num frames: 304587776. Throughput: 0: 5848.0. Samples: 304593364. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:55:21,127][25689] Avg episode reward: [(0, '-46.271')] [2022-07-09 14:55:21,778][26022] Updated weights on worker 0-0, policy_version 297454 (0.00087) [2022-07-09 14:55:23,425][26022] Updated weights on worker 0-0, policy_version 297464 (0.00093) [2022-07-09 14:55:25,160][26022] Updated weights on worker 0-0, policy_version 297474 (0.00088) [2022-07-09 14:55:26,136][25689] Fps is (10 sec: 5516.8, 60 sec: 5633.6, 300 sec: 5627.1). Total num frames: 304617472. Throughput: 0: 5942.9. Samples: 304627320. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:55:26,137][25689] Avg episode reward: [(0, '-47.171')] [2022-07-09 14:55:27,029][26022] Updated weights on worker 0-0, policy_version 297484 (0.00094) [2022-07-09 14:55:28,879][26022] Updated weights on worker 0-0, policy_version 297494 (0.00087) [2022-07-09 14:55:30,766][26022] Updated weights on worker 0-0, policy_version 297504 (0.00108) [2022-07-09 14:55:31,268][25689] Fps is (10 sec: 5755.5, 60 sec: 5613.0, 300 sec: 5624.9). Total num frames: 304646144. Throughput: 0: 5923.9. Samples: 304644318. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:55:31,269][25689] Avg episode reward: [(0, '-46.236')] [2022-07-09 14:55:32,589][26022] Updated weights on worker 0-0, policy_version 297514 (0.00084) [2022-07-09 14:55:34,064][26022] Updated weights on worker 0-0, policy_version 297524 (0.00092) [2022-07-09 14:55:36,098][26022] Updated weights on worker 0-0, policy_version 297534 (0.00087) [2022-07-09 14:55:36,292][25689] Fps is (10 sec: 5646.6, 60 sec: 5627.9, 300 sec: 5628.4). Total num frames: 304674816. Throughput: 0: 5915.3. Samples: 304678498. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:55:36,292][25689] Avg episode reward: [(0, '-46.029')] [2022-07-09 14:55:37,799][26022] Updated weights on worker 0-0, policy_version 297544 (0.00088) [2022-07-09 14:55:39,774][26022] Updated weights on worker 0-0, policy_version 297554 (0.00086) [2022-07-09 14:55:41,370][25689] Fps is (10 sec: 5676.6, 60 sec: 5643.4, 300 sec: 5630.6). Total num frames: 304703488. Throughput: 0: 5871.7. Samples: 304712226. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:55:41,370][25689] Avg episode reward: [(0, '-46.168')] [2022-07-09 14:55:41,694][26022] Updated weights on worker 0-0, policy_version 297564 (0.00085) [2022-07-09 14:55:43,265][26022] Updated weights on worker 0-0, policy_version 297574 (0.00088) [2022-07-09 14:55:45,316][26022] Updated weights on worker 0-0, policy_version 297584 (0.00089) [2022-07-09 14:55:46,388][25689] Fps is (10 sec: 5882.7, 60 sec: 5658.9, 300 sec: 5634.7). Total num frames: 304734208. Throughput: 0: 5042.1. Samples: 304729430. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:55:46,389][25689] Avg episode reward: [(0, '-46.298')] [2022-07-09 14:55:46,773][26022] Updated weights on worker 0-0, policy_version 297594 (0.00086) [2022-07-09 14:55:48,886][26022] Updated weights on worker 0-0, policy_version 297604 (0.00085) [2022-07-09 14:55:50,642][26022] Updated weights on worker 0-0, policy_version 297614 (0.00092) [2022-07-09 14:55:51,545][25689] Fps is (10 sec: 5635.9, 60 sec: 5600.5, 300 sec: 5625.1). Total num frames: 304760832. Throughput: 0: 5880.0. Samples: 304763544. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:55:51,545][25689] Avg episode reward: [(0, '-45.640')] [2022-07-09 14:55:52,256][26022] Updated weights on worker 0-0, policy_version 297624 (0.00102) [2022-07-09 14:55:54,259][26022] Updated weights on worker 0-0, policy_version 297634 (0.00092) [2022-07-09 14:55:56,068][26022] Updated weights on worker 0-0, policy_version 297644 (0.00086) [2022-07-09 14:55:56,559][25689] Fps is (10 sec: 5537.4, 60 sec: 5634.7, 300 sec: 5625.3). Total num frames: 304790528. Throughput: 0: 5870.3. Samples: 304797470. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:55:56,560][25689] Avg episode reward: [(0, '-46.293')] [2022-07-09 14:55:57,896][26022] Updated weights on worker 0-0, policy_version 297654 (0.00084) [2022-07-09 14:55:59,652][26022] Updated weights on worker 0-0, policy_version 297664 (0.00096) [2022-07-09 14:56:01,519][26022] Updated weights on worker 0-0, policy_version 297674 (0.00096) [2022-07-09 14:56:01,619][25689] Fps is (10 sec: 5692.4, 60 sec: 5613.3, 300 sec: 5632.9). Total num frames: 304818176. Throughput: 0: 5065.7. Samples: 304814800. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:56:01,620][25689] Avg episode reward: [(0, '-45.861')] [2022-07-09 14:56:03,739][26022] Updated weights on worker 0-0, policy_version 297684 (0.00094) [2022-07-09 14:56:05,470][26022] Updated weights on worker 0-0, policy_version 297694 (0.00087) [2022-07-09 14:56:06,669][25689] Fps is (10 sec: 5367.8, 60 sec: 5626.8, 300 sec: 5629.8). Total num frames: 304844800. Throughput: 0: 5785.2. Samples: 304846762. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 14:56:06,671][25689] Avg episode reward: [(0, '-46.159')] [2022-07-09 14:56:07,259][26022] Updated weights on worker 0-0, policy_version 297704 (0.00407) [2022-07-09 14:56:08,967][26022] Updated weights on worker 0-0, policy_version 297714 (0.00085) [2022-07-09 14:56:11,134][26022] Updated weights on worker 0-0, policy_version 297724 (0.00087) [2022-07-09 14:56:11,723][25689] Fps is (10 sec: 5573.6, 60 sec: 5659.3, 300 sec: 5629.2). Total num frames: 304874496. Throughput: 0: 5803.6. Samples: 304880652. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:56:11,724][25689] Avg episode reward: [(0, '-46.757')] [2022-07-09 14:56:12,638][26022] Updated weights on worker 0-0, policy_version 297734 (0.00095) [2022-07-09 14:56:14,621][26022] Updated weights on worker 0-0, policy_version 297744 (0.00091) [2022-07-09 14:56:16,030][26022] Updated weights on worker 0-0, policy_version 297754 (0.00095) [2022-07-09 14:56:16,731][25689] Fps is (10 sec: 5699.4, 60 sec: 5608.4, 300 sec: 5629.2). Total num frames: 304902144. Throughput: 0: 4973.3. Samples: 304897792. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:56:16,731][25689] Avg episode reward: [(0, '-47.418')] [2022-07-09 14:56:18,119][26022] Updated weights on worker 0-0, policy_version 297764 (0.00086) [2022-07-09 14:56:19,972][26022] Updated weights on worker 0-0, policy_version 297774 (0.00089) [2022-07-09 14:56:21,706][26022] Updated weights on worker 0-0, policy_version 297784 (0.00092) [2022-07-09 14:56:21,742][25689] Fps is (10 sec: 5621.4, 60 sec: 5659.3, 300 sec: 5629.0). Total num frames: 304930816. Throughput: 0: 5811.7. Samples: 304931754. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:56:21,743][25689] Avg episode reward: [(0, '-47.158')] [2022-07-09 14:56:23,551][26022] Updated weights on worker 0-0, policy_version 297794 (0.00090) [2022-07-09 14:56:25,231][26022] Updated weights on worker 0-0, policy_version 297804 (0.00097) [2022-07-09 14:56:26,745][25689] Fps is (10 sec: 5623.7, 60 sec: 5626.1, 300 sec: 5626.7). Total num frames: 304958464. Throughput: 0: 5926.5. Samples: 304965744. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:56:26,746][25689] Avg episode reward: [(0, '-47.700')] [2022-07-09 14:56:27,119][26022] Updated weights on worker 0-0, policy_version 297814 (0.00083) [2022-07-09 14:56:29,141][26022] Updated weights on worker 0-0, policy_version 297824 (0.00080) [2022-07-09 14:56:30,659][26022] Updated weights on worker 0-0, policy_version 297834 (0.00086) [2022-07-09 14:56:31,860][25689] Fps is (10 sec: 5566.4, 60 sec: 5627.7, 300 sec: 5632.3). Total num frames: 304987136. Throughput: 0: 5063.6. Samples: 304982616. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:56:31,861][25689] Avg episode reward: [(0, '-47.279')] [2022-07-09 14:56:32,576][26022] Updated weights on worker 0-0, policy_version 297844 (0.00085) [2022-07-09 14:56:34,488][26022] Updated weights on worker 0-0, policy_version 297854 (0.00085) [2022-07-09 14:56:36,299][26022] Updated weights on worker 0-0, policy_version 297864 (0.00096) [2022-07-09 14:56:36,892][25689] Fps is (10 sec: 5752.4, 60 sec: 5643.8, 300 sec: 5628.8). Total num frames: 305016832. Throughput: 0: 5886.6. Samples: 305016474. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:56:36,893][25689] Avg episode reward: [(0, '-47.899')] [2022-07-09 14:56:38,035][26022] Updated weights on worker 0-0, policy_version 297874 (0.00093) [2022-07-09 14:56:39,930][26022] Updated weights on worker 0-0, policy_version 297884 (0.00100) [2022-07-09 14:56:41,651][26022] Updated weights on worker 0-0, policy_version 297894 (0.00095) [2022-07-09 14:56:41,949][25689] Fps is (10 sec: 5785.6, 60 sec: 5645.8, 300 sec: 5635.5). Total num frames: 305045504. Throughput: 0: 5869.2. Samples: 305050348. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:56:41,949][25689] Avg episode reward: [(0, '-47.875')] [2022-07-09 14:56:43,667][26022] Updated weights on worker 0-0, policy_version 297904 (0.00087) [2022-07-09 14:56:45,134][26022] Updated weights on worker 0-0, policy_version 297914 (0.00085) [2022-07-09 14:56:46,994][25689] Fps is (10 sec: 5474.1, 60 sec: 5575.7, 300 sec: 5632.3). Total num frames: 305072128. Throughput: 0: 5010.3. Samples: 305067194. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:56:46,995][25689] Avg episode reward: [(0, '-48.102')] [2022-07-09 14:56:47,217][26022] Updated weights on worker 0-0, policy_version 297924 (0.00092) [2022-07-09 14:56:49,007][26022] Updated weights on worker 0-0, policy_version 297934 (0.00088) [2022-07-09 14:56:50,919][26022] Updated weights on worker 0-0, policy_version 297944 (0.00093) [2022-07-09 14:56:52,104][25689] Fps is (10 sec: 5646.8, 60 sec: 5647.7, 300 sec: 5630.7). Total num frames: 305102848. Throughput: 0: 5844.4. Samples: 305100928. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:56:52,106][25689] Avg episode reward: [(0, '-47.655')] [2022-07-09 14:56:52,566][26022] Updated weights on worker 0-0, policy_version 297954 (0.00080) [2022-07-09 14:56:54,437][26022] Updated weights on worker 0-0, policy_version 297964 (0.00091) [2022-07-09 14:56:56,284][26022] Updated weights on worker 0-0, policy_version 297974 (0.00088) [2022-07-09 14:56:57,158][25689] Fps is (10 sec: 5641.7, 60 sec: 5593.2, 300 sec: 5626.4). Total num frames: 305129472. Throughput: 0: 5838.9. Samples: 305134804. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:56:57,159][25689] Avg episode reward: [(0, '-47.573')] [2022-07-09 14:56:58,001][26022] Updated weights on worker 0-0, policy_version 297984 (0.00086) [2022-07-09 14:56:59,746][26022] Updated weights on worker 0-0, policy_version 297994 (0.00085) [2022-07-09 14:57:01,795][26022] Updated weights on worker 0-0, policy_version 298004 (0.00090) [2022-07-09 14:57:02,187][25689] Fps is (10 sec: 5382.6, 60 sec: 5596.1, 300 sec: 5632.7). Total num frames: 305157120. Throughput: 0: 5021.0. Samples: 305151964. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:57:02,189][25689] Avg episode reward: [(0, '-48.122')] [2022-07-09 14:57:03,823][26022] Updated weights on worker 0-0, policy_version 298014 (0.00086) [2022-07-09 14:57:05,720][26022] Updated weights on worker 0-0, policy_version 298024 (0.00084) [2022-07-09 14:57:06,797][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:57:06,814][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000298030_305182720.pth [2022-07-09 14:57:06,814][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000296048_303153152.pth [2022-07-09 14:57:07,200][25689] Fps is (10 sec: 5506.6, 60 sec: 5616.5, 300 sec: 5628.0). Total num frames: 305184768. Throughput: 0: 5776.3. Samples: 305183910. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:57:07,201][25689] Avg episode reward: [(0, '-47.301')] [2022-07-09 14:57:07,496][26022] Updated weights on worker 0-0, policy_version 298034 (0.00091) [2022-07-09 14:57:09,330][26022] Updated weights on worker 0-0, policy_version 298044 (0.00088) [2022-07-09 14:57:11,259][26022] Updated weights on worker 0-0, policy_version 298054 (0.00086) [2022-07-09 14:57:12,283][25689] Fps is (10 sec: 5578.4, 60 sec: 5596.9, 300 sec: 5621.2). Total num frames: 305213440. Throughput: 0: 5797.2. Samples: 305217910. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:57:12,284][25689] Avg episode reward: [(0, '-46.596')] [2022-07-09 14:57:12,918][26022] Updated weights on worker 0-0, policy_version 298064 (0.00090) [2022-07-09 14:57:14,757][26022] Updated weights on worker 0-0, policy_version 298074 (0.00084) [2022-07-09 14:57:16,587][26022] Updated weights on worker 0-0, policy_version 298084 (0.00092) [2022-07-09 14:57:17,286][25689] Fps is (10 sec: 5685.6, 60 sec: 5614.2, 300 sec: 5632.1). Total num frames: 305242112. Throughput: 0: 4968.2. Samples: 305234802. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:57:17,286][25689] Avg episode reward: [(0, '-47.577')] [2022-07-09 14:57:18,355][26022] Updated weights on worker 0-0, policy_version 298094 (0.00090) [2022-07-09 14:57:20,307][26022] Updated weights on worker 0-0, policy_version 298104 (0.00085) [2022-07-09 14:57:22,075][26022] Updated weights on worker 0-0, policy_version 298114 (0.00085) [2022-07-09 14:57:22,288][25689] Fps is (10 sec: 5731.6, 60 sec: 5615.1, 300 sec: 5626.5). Total num frames: 305270784. Throughput: 0: 5807.9. Samples: 305268708. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:57:22,290][25689] Avg episode reward: [(0, '-47.270')] [2022-07-09 14:57:23,796][26022] Updated weights on worker 0-0, policy_version 298124 (0.00091) [2022-07-09 14:57:25,537][26022] Updated weights on worker 0-0, policy_version 298134 (0.00094) [2022-07-09 14:57:27,303][25689] Fps is (10 sec: 5622.4, 60 sec: 5614.0, 300 sec: 5628.4). Total num frames: 305298432. Throughput: 0: 5895.4. Samples: 305302426. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:57:27,304][25689] Avg episode reward: [(0, '-46.886')] [2022-07-09 14:57:27,589][26022] Updated weights on worker 0-0, policy_version 298144 (0.00091) [2022-07-09 14:57:29,109][26022] Updated weights on worker 0-0, policy_version 298154 (0.00083) [2022-07-09 14:57:31,288][26022] Updated weights on worker 0-0, policy_version 298164 (0.00090) [2022-07-09 14:57:32,446][25689] Fps is (10 sec: 5544.2, 60 sec: 5611.3, 300 sec: 5622.3). Total num frames: 305327104. Throughput: 0: 5861.3. Samples: 305336094. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:57:32,447][25689] Avg episode reward: [(0, '-45.480')] [2022-07-09 14:57:32,729][26022] Updated weights on worker 0-0, policy_version 298174 (0.00087) [2022-07-09 14:57:34,895][26022] Updated weights on worker 0-0, policy_version 298184 (0.00082) [2022-07-09 14:57:36,355][26022] Updated weights on worker 0-0, policy_version 298194 (0.00083) [2022-07-09 14:57:37,507][25689] Fps is (10 sec: 5519.2, 60 sec: 5574.9, 300 sec: 5617.8). Total num frames: 305354752. Throughput: 0: 5851.4. Samples: 305353126. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:57:37,508][25689] Avg episode reward: [(0, '-45.182')] [2022-07-09 14:57:38,396][26022] Updated weights on worker 0-0, policy_version 298204 (0.00092) [2022-07-09 14:57:40,188][26022] Updated weights on worker 0-0, policy_version 298214 (0.00086) [2022-07-09 14:57:41,827][26022] Updated weights on worker 0-0, policy_version 298224 (0.00089) [2022-07-09 14:57:42,515][25689] Fps is (10 sec: 5593.3, 60 sec: 5579.3, 300 sec: 5618.6). Total num frames: 305383424. Throughput: 0: 5868.0. Samples: 305387404. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:57:42,516][25689] Avg episode reward: [(0, '-46.135')] [2022-07-09 14:57:43,739][26022] Updated weights on worker 0-0, policy_version 298234 (0.00082) [2022-07-09 14:57:45,460][26022] Updated weights on worker 0-0, policy_version 298244 (0.00090) [2022-07-09 14:57:47,547][25689] Fps is (10 sec: 5711.6, 60 sec: 5614.4, 300 sec: 5619.2). Total num frames: 305412096. Throughput: 0: 5879.5. Samples: 305421452. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:57:47,547][25689] Avg episode reward: [(0, '-46.465')] [2022-07-09 14:57:47,548][26022] Updated weights on worker 0-0, policy_version 298254 (0.00086) [2022-07-09 14:57:49,334][26022] Updated weights on worker 0-0, policy_version 298264 (0.00076) [2022-07-09 14:57:50,858][26022] Updated weights on worker 0-0, policy_version 298274 (0.00050) [2022-07-09 14:57:52,625][25689] Fps is (10 sec: 5874.8, 60 sec: 5617.3, 300 sec: 5625.4). Total num frames: 305442816. Throughput: 0: 5085.4. Samples: 305438712. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:57:52,625][25689] Avg episode reward: [(0, '-47.915')] [2022-07-09 14:57:52,635][26022] Updated weights on worker 0-0, policy_version 298284 (0.00086) [2022-07-09 14:57:54,588][26022] Updated weights on worker 0-0, policy_version 298294 (0.00090) [2022-07-09 14:57:56,431][26022] Updated weights on worker 0-0, policy_version 298304 (0.00087) [2022-07-09 14:57:57,713][25689] Fps is (10 sec: 5741.3, 60 sec: 5631.1, 300 sec: 5620.9). Total num frames: 305470464. Throughput: 0: 5919.3. Samples: 305472732. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:57:57,714][25689] Avg episode reward: [(0, '-47.992')] [2022-07-09 14:57:58,179][26022] Updated weights on worker 0-0, policy_version 298314 (0.00087) [2022-07-09 14:57:59,846][26022] Updated weights on worker 0-0, policy_version 298324 (0.00089) [2022-07-09 14:58:02,024][26022] Updated weights on worker 0-0, policy_version 298334 (0.00087) [2022-07-09 14:58:02,729][25689] Fps is (10 sec: 5371.5, 60 sec: 5615.4, 300 sec: 5621.0). Total num frames: 305497088. Throughput: 0: 5826.4. Samples: 305505176. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:58:02,729][25689] Avg episode reward: [(0, '-48.686')] [2022-07-09 14:58:03,942][26022] Updated weights on worker 0-0, policy_version 298344 (0.00092) [2022-07-09 14:58:05,734][26022] Updated weights on worker 0-0, policy_version 298354 (0.00088) [2022-07-09 14:58:07,430][26022] Updated weights on worker 0-0, policy_version 298364 (0.00089) [2022-07-09 14:58:07,772][25689] Fps is (10 sec: 5395.3, 60 sec: 5612.6, 300 sec: 5625.9). Total num frames: 305524736. Throughput: 0: 4981.9. Samples: 305522214. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:58:07,773][25689] Avg episode reward: [(0, '-49.430')] [2022-07-09 14:58:09,258][26022] Updated weights on worker 0-0, policy_version 298374 (0.00093) [2022-07-09 14:58:11,392][26022] Updated weights on worker 0-0, policy_version 298384 (0.00088) [2022-07-09 14:58:12,839][25689] Fps is (10 sec: 5772.8, 60 sec: 5647.9, 300 sec: 5626.6). Total num frames: 305555456. Throughput: 0: 5819.9. Samples: 305556358. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:58:12,840][25689] Avg episode reward: [(0, '-48.116')] [2022-07-09 14:58:12,847][26022] Updated weights on worker 0-0, policy_version 298394 (0.00092) [2022-07-09 14:58:14,932][26022] Updated weights on worker 0-0, policy_version 298404 (0.00084) [2022-07-09 14:58:16,430][26022] Updated weights on worker 0-0, policy_version 298414 (0.00094) [2022-07-09 14:58:17,907][25689] Fps is (10 sec: 5658.2, 60 sec: 5608.0, 300 sec: 5622.5). Total num frames: 305582080. Throughput: 0: 5828.7. Samples: 305590436. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:58:17,908][25689] Avg episode reward: [(0, '-46.528')] [2022-07-09 14:58:18,466][26022] Updated weights on worker 0-0, policy_version 298424 (0.00084) [2022-07-09 14:58:19,977][26022] Updated weights on worker 0-0, policy_version 298434 (0.00093) [2022-07-09 14:58:22,015][26022] Updated weights on worker 0-0, policy_version 298444 (0.00088) [2022-07-09 14:58:22,969][25689] Fps is (10 sec: 5661.3, 60 sec: 5636.3, 300 sec: 5625.8). Total num frames: 305612800. Throughput: 0: 5052.6. Samples: 305607440. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:58:22,969][25689] Avg episode reward: [(0, '-45.258')] [2022-07-09 14:58:23,784][26022] Updated weights on worker 0-0, policy_version 298454 (0.00089) [2022-07-09 14:58:25,523][26022] Updated weights on worker 0-0, policy_version 298464 (0.00094) [2022-07-09 14:58:27,563][26022] Updated weights on worker 0-0, policy_version 298474 (0.00086) [2022-07-09 14:58:27,989][25689] Fps is (10 sec: 5687.6, 60 sec: 5618.9, 300 sec: 5620.1). Total num frames: 305639424. Throughput: 0: 5871.3. Samples: 305640914. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:58:27,990][25689] Avg episode reward: [(0, '-45.686')] [2022-07-09 14:58:29,263][26022] Updated weights on worker 0-0, policy_version 298484 (0.00093) [2022-07-09 14:58:31,070][26022] Updated weights on worker 0-0, policy_version 298494 (0.00085) [2022-07-09 14:58:32,856][26022] Updated weights on worker 0-0, policy_version 298504 (0.00091) [2022-07-09 14:58:33,053][25689] Fps is (10 sec: 5585.0, 60 sec: 5643.2, 300 sec: 5629.3). Total num frames: 305669120. Throughput: 0: 5875.9. Samples: 305675130. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-09 14:58:33,053][25689] Avg episode reward: [(0, '-45.793')] [2022-07-09 14:58:34,583][26022] Updated weights on worker 0-0, policy_version 298514 (0.00090) [2022-07-09 14:58:36,373][26022] Updated weights on worker 0-0, policy_version 298524 (0.00088) [2022-07-09 14:58:38,063][25689] Fps is (10 sec: 5794.3, 60 sec: 5664.9, 300 sec: 5629.3). Total num frames: 305697792. Throughput: 0: 5058.0. Samples: 305692384. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 14:58:38,063][25689] Avg episode reward: [(0, '-45.858')] [2022-07-09 14:58:38,161][26022] Updated weights on worker 0-0, policy_version 298534 (0.00082) [2022-07-09 14:58:39,881][26022] Updated weights on worker 0-0, policy_version 298544 (0.00092) [2022-07-09 14:58:41,799][26022] Updated weights on worker 0-0, policy_version 298554 (0.00081) [2022-07-09 14:58:43,084][25689] Fps is (10 sec: 5716.3, 60 sec: 5663.6, 300 sec: 5629.3). Total num frames: 305726464. Throughput: 0: 5933.7. Samples: 305726802. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 14:58:43,085][25689] Avg episode reward: [(0, '-45.602')] [2022-07-09 14:58:43,761][26022] Updated weights on worker 0-0, policy_version 298564 (0.00082) [2022-07-09 14:58:45,278][26022] Updated weights on worker 0-0, policy_version 298574 (0.00085) [2022-07-09 14:58:47,500][26022] Updated weights on worker 0-0, policy_version 298584 (0.00088) [2022-07-09 14:58:48,107][25689] Fps is (10 sec: 5709.0, 60 sec: 5664.4, 300 sec: 5619.3). Total num frames: 305755136. Throughput: 0: 5953.9. Samples: 305760694. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 14:58:48,108][25689] Avg episode reward: [(0, '-45.864')] [2022-07-09 14:58:49,147][26022] Updated weights on worker 0-0, policy_version 298594 (0.00085) [2022-07-09 14:58:50,939][26022] Updated weights on worker 0-0, policy_version 298604 (0.00100) [2022-07-09 14:58:52,767][26022] Updated weights on worker 0-0, policy_version 298614 (0.00086) [2022-07-09 14:58:53,187][25689] Fps is (10 sec: 5676.4, 60 sec: 5630.5, 300 sec: 5639.0). Total num frames: 305783808. Throughput: 0: 5091.2. Samples: 305777636. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 14:58:53,187][25689] Avg episode reward: [(0, '-46.286')] [2022-07-09 14:58:54,615][26022] Updated weights on worker 0-0, policy_version 298624 (0.00095) [2022-07-09 14:58:56,333][26022] Updated weights on worker 0-0, policy_version 298634 (0.00823) [2022-07-09 14:58:58,225][25689] Fps is (10 sec: 5465.2, 60 sec: 5618.2, 300 sec: 5621.1). Total num frames: 305810432. Throughput: 0: 5902.7. Samples: 305811396. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 14:58:58,226][25689] Avg episode reward: [(0, '-46.480')] [2022-07-09 14:58:58,282][26022] Updated weights on worker 0-0, policy_version 298644 (0.00088) [2022-07-09 14:58:59,926][26022] Updated weights on worker 0-0, policy_version 298654 (0.00081) [2022-07-09 14:59:02,357][26022] Updated weights on worker 0-0, policy_version 298664 (0.00092) [2022-07-09 14:59:03,243][25689] Fps is (10 sec: 5396.9, 60 sec: 5634.9, 300 sec: 5627.9). Total num frames: 305838080. Throughput: 0: 5780.5. Samples: 305843326. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 14:59:03,243][25689] Avg episode reward: [(0, '-46.024')] [2022-07-09 14:59:04,040][26022] Updated weights on worker 0-0, policy_version 298674 (0.00084) [2022-07-09 14:59:05,760][26022] Updated weights on worker 0-0, policy_version 298684 (0.00087) [2022-07-09 14:59:06,827][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 14:59:06,838][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000298690_305858560.pth [2022-07-09 14:59:06,838][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000296708_303828992.pth [2022-07-09 14:59:07,786][26022] Updated weights on worker 0-0, policy_version 298694 (0.00092) [2022-07-09 14:59:08,260][25689] Fps is (10 sec: 5612.2, 60 sec: 5654.3, 300 sec: 5630.3). Total num frames: 305866752. Throughput: 0: 4944.2. Samples: 305860334. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 14:59:08,261][25689] Avg episode reward: [(0, '-46.025')] [2022-07-09 14:59:09,339][26022] Updated weights on worker 0-0, policy_version 298704 (0.00067) [2022-07-09 14:59:11,395][26022] Updated weights on worker 0-0, policy_version 298714 (0.00093) [2022-07-09 14:59:13,057][26022] Updated weights on worker 0-0, policy_version 298724 (0.00091) [2022-07-09 14:59:13,371][25689] Fps is (10 sec: 5661.5, 60 sec: 5616.3, 300 sec: 5629.0). Total num frames: 305895424. Throughput: 0: 5770.0. Samples: 305894102. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 14:59:13,372][25689] Avg episode reward: [(0, '-46.796')] [2022-07-09 14:59:14,851][26022] Updated weights on worker 0-0, policy_version 298734 (0.00091) [2022-07-09 14:59:16,810][26022] Updated weights on worker 0-0, policy_version 298744 (0.00087) [2022-07-09 14:59:18,411][25689] Fps is (10 sec: 5548.4, 60 sec: 5635.9, 300 sec: 5625.0). Total num frames: 305923072. Throughput: 0: 5782.0. Samples: 305928110. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 14:59:18,411][25689] Avg episode reward: [(0, '-45.998')] [2022-07-09 14:59:18,473][26022] Updated weights on worker 0-0, policy_version 298754 (0.00079) [2022-07-09 14:59:20,303][26022] Updated weights on worker 0-0, policy_version 298764 (0.00088) [2022-07-09 14:59:22,225][26022] Updated weights on worker 0-0, policy_version 298774 (0.00062) [2022-07-09 14:59:23,474][25689] Fps is (10 sec: 5575.0, 60 sec: 5601.9, 300 sec: 5624.2). Total num frames: 305951744. Throughput: 0: 5038.9. Samples: 305945268. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 14:59:23,475][25689] Avg episode reward: [(0, '-47.002')] [2022-07-09 14:59:23,847][26022] Updated weights on worker 0-0, policy_version 298784 (0.00088) [2022-07-09 14:59:25,797][26022] Updated weights on worker 0-0, policy_version 298794 (0.00090) [2022-07-09 14:59:27,531][26022] Updated weights on worker 0-0, policy_version 298804 (0.00087) [2022-07-09 14:59:28,483][25689] Fps is (10 sec: 5693.4, 60 sec: 5636.8, 300 sec: 5622.4). Total num frames: 305980416. Throughput: 0: 5882.9. Samples: 305979302. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 14:59:28,484][25689] Avg episode reward: [(0, '-46.346')] [2022-07-09 14:59:29,395][26022] Updated weights on worker 0-0, policy_version 298814 (0.00093) [2022-07-09 14:59:31,117][26022] Updated weights on worker 0-0, policy_version 298824 (0.00103) [2022-07-09 14:59:33,067][26022] Updated weights on worker 0-0, policy_version 298834 (0.00070) [2022-07-09 14:59:33,591][25689] Fps is (10 sec: 5769.1, 60 sec: 5632.7, 300 sec: 5627.2). Total num frames: 306010112. Throughput: 0: 5890.8. Samples: 306013212. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 14:59:33,593][25689] Avg episode reward: [(0, '-47.304')] [2022-07-09 14:59:34,603][26022] Updated weights on worker 0-0, policy_version 298844 (0.00086) [2022-07-09 14:59:36,571][26022] Updated weights on worker 0-0, policy_version 298854 (0.00086) [2022-07-09 14:59:38,107][26022] Updated weights on worker 0-0, policy_version 298864 (0.00072) [2022-07-09 14:59:38,597][25689] Fps is (10 sec: 5669.8, 60 sec: 5616.1, 300 sec: 5628.3). Total num frames: 306037760. Throughput: 0: 5914.5. Samples: 306047500. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 14:59:38,597][25689] Avg episode reward: [(0, '-48.113')] [2022-07-09 14:59:40,304][26022] Updated weights on worker 0-0, policy_version 298874 (0.00093) [2022-07-09 14:59:41,781][26022] Updated weights on worker 0-0, policy_version 298884 (0.00083) [2022-07-09 14:59:43,617][25689] Fps is (10 sec: 5515.2, 60 sec: 5599.4, 300 sec: 5621.1). Total num frames: 306065408. Throughput: 0: 5918.9. Samples: 306064496. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 14:59:43,619][25689] Avg episode reward: [(0, '-48.443')] [2022-07-09 14:59:43,842][26022] Updated weights on worker 0-0, policy_version 298894 (0.00090) [2022-07-09 14:59:45,627][26022] Updated weights on worker 0-0, policy_version 298904 (0.00088) [2022-07-09 14:59:47,355][26022] Updated weights on worker 0-0, policy_version 298914 (0.00092) [2022-07-09 14:59:48,635][25689] Fps is (10 sec: 5712.5, 60 sec: 5616.7, 300 sec: 5622.2). Total num frames: 306095104. Throughput: 0: 5919.1. Samples: 306098584. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 14:59:48,635][25689] Avg episode reward: [(0, '-47.524')] [2022-07-09 14:59:49,111][26022] Updated weights on worker 0-0, policy_version 298924 (0.00907) [2022-07-09 14:59:50,915][26022] Updated weights on worker 0-0, policy_version 298934 (0.00083) [2022-07-09 14:59:52,740][26022] Updated weights on worker 0-0, policy_version 298944 (0.00092) [2022-07-09 14:59:53,712][25689] Fps is (10 sec: 5781.7, 60 sec: 5616.9, 300 sec: 5624.5). Total num frames: 306123776. Throughput: 0: 5945.3. Samples: 306132840. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 14:59:53,713][25689] Avg episode reward: [(0, '-46.510')] [2022-07-09 14:59:54,713][26022] Updated weights on worker 0-0, policy_version 298954 (0.00091) [2022-07-09 14:59:56,269][26022] Updated weights on worker 0-0, policy_version 298964 (0.00089) [2022-07-09 14:59:58,408][26022] Updated weights on worker 0-0, policy_version 298974 (0.00086) [2022-07-09 14:59:58,732][25689] Fps is (10 sec: 5577.8, 60 sec: 5635.6, 300 sec: 5620.9). Total num frames: 306151424. Throughput: 0: 5079.6. Samples: 306149780. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 14:59:58,733][25689] Avg episode reward: [(0, '-46.378')] [2022-07-09 14:59:59,933][26022] Updated weights on worker 0-0, policy_version 298984 (0.00079) [2022-07-09 15:00:02,229][26022] Updated weights on worker 0-0, policy_version 298994 (0.00097) [2022-07-09 15:00:03,751][25689] Fps is (10 sec: 5406.5, 60 sec: 5618.6, 300 sec: 5624.2). Total num frames: 306178048. Throughput: 0: 5800.8. Samples: 306181286. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 15:00:03,751][25689] Avg episode reward: [(0, '-45.102')] [2022-07-09 15:00:03,983][26022] Updated weights on worker 0-0, policy_version 299004 (0.00085) [2022-07-09 15:00:06,066][26022] Updated weights on worker 0-0, policy_version 299014 (0.00307) [2022-07-09 15:00:07,711][26022] Updated weights on worker 0-0, policy_version 299024 (0.00092) [2022-07-09 15:00:08,850][25689] Fps is (10 sec: 5363.8, 60 sec: 5594.0, 300 sec: 5623.1). Total num frames: 306205696. Throughput: 0: 5757.8. Samples: 306214980. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 15:00:08,851][25689] Avg episode reward: [(0, '-44.983')] [2022-07-09 15:00:09,848][26022] Updated weights on worker 0-0, policy_version 299034 (0.00096) [2022-07-09 15:00:11,417][26022] Updated weights on worker 0-0, policy_version 299044 (0.00085) [2022-07-09 15:00:13,309][26022] Updated weights on worker 0-0, policy_version 299054 (0.00085) [2022-07-09 15:00:13,925][25689] Fps is (10 sec: 5636.0, 60 sec: 5614.3, 300 sec: 5618.4). Total num frames: 306235392. Throughput: 0: 4893.8. Samples: 306231758. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 15:00:13,926][25689] Avg episode reward: [(0, '-45.521')] [2022-07-09 15:00:15,055][26022] Updated weights on worker 0-0, policy_version 299064 (0.00103) [2022-07-09 15:00:16,916][26022] Updated weights on worker 0-0, policy_version 299074 (0.00093) [2022-07-09 15:00:18,806][26022] Updated weights on worker 0-0, policy_version 299084 (0.00105) [2022-07-09 15:00:18,936][25689] Fps is (10 sec: 5685.6, 60 sec: 5617.0, 300 sec: 5625.3). Total num frames: 306263040. Throughput: 0: 5736.9. Samples: 306265688. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 15:00:18,937][25689] Avg episode reward: [(0, '-46.601')] [2022-07-09 15:00:20,519][26022] Updated weights on worker 0-0, policy_version 299094 (0.00094) [2022-07-09 15:00:22,413][26022] Updated weights on worker 0-0, policy_version 299104 (0.00087) [2022-07-09 15:00:23,940][25689] Fps is (10 sec: 5521.3, 60 sec: 5605.5, 300 sec: 5618.5). Total num frames: 306290688. Throughput: 0: 5870.3. Samples: 306299806. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 15:00:23,941][25689] Avg episode reward: [(0, '-47.004')] [2022-07-09 15:00:24,185][26022] Updated weights on worker 0-0, policy_version 299114 (0.00086) [2022-07-09 15:00:25,892][26022] Updated weights on worker 0-0, policy_version 299124 (0.00090) [2022-07-09 15:00:27,796][26022] Updated weights on worker 0-0, policy_version 299134 (0.00090) [2022-07-09 15:00:28,989][25689] Fps is (10 sec: 5704.2, 60 sec: 5618.7, 300 sec: 5623.5). Total num frames: 306320384. Throughput: 0: 5047.9. Samples: 306316638. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 15:00:28,990][25689] Avg episode reward: [(0, '-46.722')] [2022-07-09 15:00:29,524][26022] Updated weights on worker 0-0, policy_version 299144 (0.00086) [2022-07-09 15:00:31,328][26022] Updated weights on worker 0-0, policy_version 299154 (0.00089) [2022-07-09 15:00:33,308][26022] Updated weights on worker 0-0, policy_version 299164 (0.00096) [2022-07-09 15:00:34,035][25689] Fps is (10 sec: 5782.1, 60 sec: 5607.6, 300 sec: 5623.1). Total num frames: 306349056. Throughput: 0: 5889.6. Samples: 306350196. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 15:00:34,036][25689] Avg episode reward: [(0, '-46.968')] [2022-07-09 15:00:34,865][26022] Updated weights on worker 0-0, policy_version 299174 (0.00083) [2022-07-09 15:00:36,961][26022] Updated weights on worker 0-0, policy_version 299184 (0.00092) [2022-07-09 15:00:38,684][26022] Updated weights on worker 0-0, policy_version 299194 (0.00087) [2022-07-09 15:00:39,040][25689] Fps is (10 sec: 5501.8, 60 sec: 5590.7, 300 sec: 5617.6). Total num frames: 306375680. Throughput: 0: 5890.0. Samples: 306384098. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 15:00:39,040][25689] Avg episode reward: [(0, '-46.942')] [2022-07-09 15:00:40,388][26022] Updated weights on worker 0-0, policy_version 299204 (0.00080) [2022-07-09 15:00:42,463][26022] Updated weights on worker 0-0, policy_version 299214 (0.00098) [2022-07-09 15:00:43,980][26022] Updated weights on worker 0-0, policy_version 299224 (0.00092) [2022-07-09 15:00:44,076][25689] Fps is (10 sec: 5609.0, 60 sec: 5623.1, 300 sec: 5613.8). Total num frames: 306405376. Throughput: 0: 5028.6. Samples: 306401060. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 15:00:44,077][25689] Avg episode reward: [(0, '-46.471')] [2022-07-09 15:00:45,990][26022] Updated weights on worker 0-0, policy_version 299234 (0.00091) [2022-07-09 15:00:47,626][26022] Updated weights on worker 0-0, policy_version 299244 (0.00087) [2022-07-09 15:00:49,131][25689] Fps is (10 sec: 5682.7, 60 sec: 5585.8, 300 sec: 5619.2). Total num frames: 306433024. Throughput: 0: 5870.9. Samples: 306434888. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 15:00:49,132][25689] Avg episode reward: [(0, '-46.536')] [2022-07-09 15:00:49,605][26022] Updated weights on worker 0-0, policy_version 299254 (0.00090) [2022-07-09 15:00:51,419][26022] Updated weights on worker 0-0, policy_version 299264 (0.00090) [2022-07-09 15:00:53,065][26022] Updated weights on worker 0-0, policy_version 299274 (0.00097) [2022-07-09 15:00:54,230][25689] Fps is (10 sec: 5546.5, 60 sec: 5583.8, 300 sec: 5614.1). Total num frames: 306461696. Throughput: 0: 5877.9. Samples: 306468902. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 15:00:54,231][25689] Avg episode reward: [(0, '-45.471')] [2022-07-09 15:00:54,946][26022] Updated weights on worker 0-0, policy_version 299284 (0.00091) [2022-07-09 15:00:56,807][26022] Updated weights on worker 0-0, policy_version 299294 (0.00088) [2022-07-09 15:00:58,702][26022] Updated weights on worker 0-0, policy_version 299304 (0.00094) [2022-07-09 15:00:59,251][25689] Fps is (10 sec: 5666.1, 60 sec: 5600.6, 300 sec: 5618.3). Total num frames: 306490368. Throughput: 0: 5033.5. Samples: 306485834. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 15:00:59,253][25689] Avg episode reward: [(0, '-46.594')] [2022-07-09 15:01:00,533][26022] Updated weights on worker 0-0, policy_version 299314 (0.00088) [2022-07-09 15:01:02,725][26022] Updated weights on worker 0-0, policy_version 299324 (0.00094) [2022-07-09 15:01:04,260][25689] Fps is (10 sec: 5513.4, 60 sec: 5601.6, 300 sec: 5619.1). Total num frames: 306516992. Throughput: 0: 5759.3. Samples: 306517302. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:01:04,260][25689] Avg episode reward: [(0, '-46.428')] [2022-07-09 15:01:04,342][26022] Updated weights on worker 0-0, policy_version 299334 (0.00090) [2022-07-09 15:01:06,266][26022] Updated weights on worker 0-0, policy_version 299344 (0.00091) [2022-07-09 15:01:06,863][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:01:06,878][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000299346_306530304.pth [2022-07-09 15:01:06,879][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000297369_304505856.pth [2022-07-09 15:01:07,904][26022] Updated weights on worker 0-0, policy_version 299354 (0.00091) [2022-07-09 15:01:09,293][25689] Fps is (10 sec: 5302.9, 60 sec: 5590.8, 300 sec: 5609.2). Total num frames: 306543616. Throughput: 0: 5766.5. Samples: 306551152. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:01:09,293][25689] Avg episode reward: [(0, '-46.561')] [2022-07-09 15:01:10,056][26022] Updated weights on worker 0-0, policy_version 299364 (0.00093) [2022-07-09 15:01:11,725][26022] Updated weights on worker 0-0, policy_version 299374 (0.00092) [2022-07-09 15:01:13,602][26022] Updated weights on worker 0-0, policy_version 299384 (0.00382) [2022-07-09 15:01:14,402][25689] Fps is (10 sec: 5552.7, 60 sec: 5587.6, 300 sec: 5614.1). Total num frames: 306573312. Throughput: 0: 4911.1. Samples: 306567970. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:01:14,403][25689] Avg episode reward: [(0, '-46.861')] [2022-07-09 15:01:15,372][26022] Updated weights on worker 0-0, policy_version 299394 (0.00059) [2022-07-09 15:01:17,388][26022] Updated weights on worker 0-0, policy_version 299404 (0.00099) [2022-07-09 15:01:19,132][26022] Updated weights on worker 0-0, policy_version 299414 (0.00088) [2022-07-09 15:01:19,477][25689] Fps is (10 sec: 5630.5, 60 sec: 5581.7, 300 sec: 5609.5). Total num frames: 306600960. Throughput: 0: 5746.0. Samples: 306602050. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:01:19,478][25689] Avg episode reward: [(0, '-47.145')] [2022-07-09 15:01:20,852][26022] Updated weights on worker 0-0, policy_version 299424 (0.00081) [2022-07-09 15:01:22,739][26022] Updated weights on worker 0-0, policy_version 299434 (0.00102) [2022-07-09 15:01:24,428][26022] Updated weights on worker 0-0, policy_version 299444 (0.00092) [2022-07-09 15:01:24,518][25689] Fps is (10 sec: 5668.9, 60 sec: 5612.1, 300 sec: 5615.7). Total num frames: 306630656. Throughput: 0: 5853.5. Samples: 306635884. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:01:24,518][25689] Avg episode reward: [(0, '-47.028')] [2022-07-09 15:01:26,440][26022] Updated weights on worker 0-0, policy_version 299454 (0.00087) [2022-07-09 15:01:28,063][26022] Updated weights on worker 0-0, policy_version 299464 (0.00094) [2022-07-09 15:01:29,545][25689] Fps is (10 sec: 5594.2, 60 sec: 5563.4, 300 sec: 5610.5). Total num frames: 306657280. Throughput: 0: 5023.0. Samples: 306652878. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:01:29,545][25689] Avg episode reward: [(0, '-46.713')] [2022-07-09 15:01:30,006][26022] Updated weights on worker 0-0, policy_version 299474 (0.00088) [2022-07-09 15:01:31,770][26022] Updated weights on worker 0-0, policy_version 299484 (0.00092) [2022-07-09 15:01:33,730][26022] Updated weights on worker 0-0, policy_version 299494 (0.00088) [2022-07-09 15:01:34,678][25689] Fps is (10 sec: 5644.0, 60 sec: 5589.2, 300 sec: 5612.0). Total num frames: 306688000. Throughput: 0: 5830.1. Samples: 306686180. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:01:34,679][25689] Avg episode reward: [(0, '-47.084')] [2022-07-09 15:01:35,438][26022] Updated weights on worker 0-0, policy_version 299504 (0.00095) [2022-07-09 15:01:37,402][26022] Updated weights on worker 0-0, policy_version 299514 (0.00086) [2022-07-09 15:01:39,194][26022] Updated weights on worker 0-0, policy_version 299524 (0.00086) [2022-07-09 15:01:39,737][25689] Fps is (10 sec: 5626.4, 60 sec: 5584.2, 300 sec: 5605.1). Total num frames: 306714624. Throughput: 0: 5832.7. Samples: 306720218. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:01:39,737][25689] Avg episode reward: [(0, '-46.304')] [2022-07-09 15:01:40,907][26022] Updated weights on worker 0-0, policy_version 299534 (0.00083) [2022-07-09 15:01:42,816][26022] Updated weights on worker 0-0, policy_version 299544 (0.00086) [2022-07-09 15:01:44,588][26022] Updated weights on worker 0-0, policy_version 299554 (0.00096) [2022-07-09 15:01:44,740][25689] Fps is (10 sec: 5495.4, 60 sec: 5570.3, 300 sec: 5612.7). Total num frames: 306743296. Throughput: 0: 5012.4. Samples: 306737248. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:01:44,741][25689] Avg episode reward: [(0, '-45.289')] [2022-07-09 15:01:46,613][26022] Updated weights on worker 0-0, policy_version 299564 (0.00095) [2022-07-09 15:01:48,227][26022] Updated weights on worker 0-0, policy_version 299574 (0.00083) [2022-07-09 15:01:49,776][25689] Fps is (10 sec: 5711.9, 60 sec: 5588.9, 300 sec: 5607.2). Total num frames: 306771968. Throughput: 0: 5832.2. Samples: 306770872. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:01:49,777][25689] Avg episode reward: [(0, '-45.372')] [2022-07-09 15:01:50,016][26022] Updated weights on worker 0-0, policy_version 299584 (0.00091) [2022-07-09 15:01:51,857][26022] Updated weights on worker 0-0, policy_version 299594 (0.00082) [2022-07-09 15:01:53,749][26022] Updated weights on worker 0-0, policy_version 299604 (0.00087) [2022-07-09 15:01:54,880][25689] Fps is (10 sec: 5756.7, 60 sec: 5605.5, 300 sec: 5616.6). Total num frames: 306801664. Throughput: 0: 5858.5. Samples: 306804530. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:01:54,880][25689] Avg episode reward: [(0, '-45.713')] [2022-07-09 15:01:55,535][26022] Updated weights on worker 0-0, policy_version 299614 (0.00534) [2022-07-09 15:01:57,445][26022] Updated weights on worker 0-0, policy_version 299624 (0.00087) [2022-07-09 15:01:59,039][26022] Updated weights on worker 0-0, policy_version 299634 (0.00092) [2022-07-09 15:01:59,963][25689] Fps is (10 sec: 5629.2, 60 sec: 5582.8, 300 sec: 5615.6). Total num frames: 306829312. Throughput: 0: 5856.0. Samples: 306838664. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:01:59,965][25689] Avg episode reward: [(0, '-45.720')] [2022-07-09 15:02:01,103][26022] Updated weights on worker 0-0, policy_version 299644 (0.00087) [2022-07-09 15:02:03,324][26022] Updated weights on worker 0-0, policy_version 299654 (0.00358) [2022-07-09 15:02:04,922][26022] Updated weights on worker 0-0, policy_version 299664 (0.00088) [2022-07-09 15:02:05,012][25689] Fps is (10 sec: 5356.5, 60 sec: 5579.1, 300 sec: 5611.5). Total num frames: 306855936. Throughput: 0: 5731.7. Samples: 306853436. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:02:05,012][25689] Avg episode reward: [(0, '-45.579')] [2022-07-09 15:02:06,867][26022] Updated weights on worker 0-0, policy_version 299674 (0.00090) [2022-07-09 15:02:08,435][26022] Updated weights on worker 0-0, policy_version 299684 (0.00097) [2022-07-09 15:02:10,101][25689] Fps is (10 sec: 5353.4, 60 sec: 5590.8, 300 sec: 5607.9). Total num frames: 306883584. Throughput: 0: 5724.9. Samples: 306887230. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:02:10,101][25689] Avg episode reward: [(0, '-45.486')] [2022-07-09 15:02:10,503][26022] Updated weights on worker 0-0, policy_version 299694 (0.00076) [2022-07-09 15:02:11,992][26022] Updated weights on worker 0-0, policy_version 299704 (0.00103) [2022-07-09 15:02:13,942][26022] Updated weights on worker 0-0, policy_version 299714 (0.00090) [2022-07-09 15:02:15,190][25689] Fps is (10 sec: 5634.0, 60 sec: 5592.7, 300 sec: 5609.7). Total num frames: 306913280. Throughput: 0: 5754.5. Samples: 306921406. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:02:15,190][25689] Avg episode reward: [(0, '-46.031')] [2022-07-09 15:02:15,812][26022] Updated weights on worker 0-0, policy_version 299724 (0.00091) [2022-07-09 15:02:17,717][26022] Updated weights on worker 0-0, policy_version 299734 (0.00104) [2022-07-09 15:02:19,487][26022] Updated weights on worker 0-0, policy_version 299744 (0.00096) [2022-07-09 15:02:20,227][25689] Fps is (10 sec: 5764.3, 60 sec: 5613.1, 300 sec: 5609.1). Total num frames: 306941952. Throughput: 0: 4924.1. Samples: 306938444. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:02:20,227][25689] Avg episode reward: [(0, '-45.790')] [2022-07-09 15:02:21,504][26022] Updated weights on worker 0-0, policy_version 299754 (0.00096) [2022-07-09 15:02:22,987][26022] Updated weights on worker 0-0, policy_version 299764 (0.00084) [2022-07-09 15:02:25,067][26022] Updated weights on worker 0-0, policy_version 299774 (0.00085) [2022-07-09 15:02:25,236][25689] Fps is (10 sec: 5606.2, 60 sec: 5582.3, 300 sec: 5609.2). Total num frames: 306969600. Throughput: 0: 5880.0. Samples: 306972356. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:02:25,237][25689] Avg episode reward: [(0, '-45.063')] [2022-07-09 15:02:26,592][26022] Updated weights on worker 0-0, policy_version 299784 (0.00083) [2022-07-09 15:02:28,523][26022] Updated weights on worker 0-0, policy_version 299794 (0.00088) [2022-07-09 15:02:30,230][26022] Updated weights on worker 0-0, policy_version 299804 (0.00095) [2022-07-09 15:02:30,293][25689] Fps is (10 sec: 5696.7, 60 sec: 5630.1, 300 sec: 5614.3). Total num frames: 306999296. Throughput: 0: 5897.9. Samples: 307006320. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:02:30,293][25689] Avg episode reward: [(0, '-45.018')] [2022-07-09 15:02:32,112][26022] Updated weights on worker 0-0, policy_version 299814 (0.00090) [2022-07-09 15:02:34,119][26022] Updated weights on worker 0-0, policy_version 299824 (0.00085) [2022-07-09 15:02:35,402][25689] Fps is (10 sec: 5640.5, 60 sec: 5581.7, 300 sec: 5613.4). Total num frames: 307026944. Throughput: 0: 5025.2. Samples: 307022978. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:02:35,403][25689] Avg episode reward: [(0, '-44.867')] [2022-07-09 15:02:35,707][26022] Updated weights on worker 0-0, policy_version 299834 (0.00058) [2022-07-09 15:02:37,727][26022] Updated weights on worker 0-0, policy_version 299844 (0.00090) [2022-07-09 15:02:39,337][26022] Updated weights on worker 0-0, policy_version 299854 (0.00086) [2022-07-09 15:02:40,425][25689] Fps is (10 sec: 5457.4, 60 sec: 5601.8, 300 sec: 5609.6). Total num frames: 307054592. Throughput: 0: 5854.9. Samples: 307056704. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:02:40,427][25689] Avg episode reward: [(0, '-44.796')] [2022-07-09 15:02:41,211][26022] Updated weights on worker 0-0, policy_version 299864 (0.00085) [2022-07-09 15:02:43,124][26022] Updated weights on worker 0-0, policy_version 299874 (0.00092) [2022-07-09 15:02:44,940][26022] Updated weights on worker 0-0, policy_version 299884 (0.00091) [2022-07-09 15:02:45,450][25689] Fps is (10 sec: 5605.1, 60 sec: 5599.9, 300 sec: 5609.8). Total num frames: 307083264. Throughput: 0: 5845.9. Samples: 307090528. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:02:45,452][25689] Avg episode reward: [(0, '-44.966')] [2022-07-09 15:02:46,852][26022] Updated weights on worker 0-0, policy_version 299894 (0.00085) [2022-07-09 15:02:48,634][26022] Updated weights on worker 0-0, policy_version 299904 (0.00085) [2022-07-09 15:02:50,456][25689] Fps is (10 sec: 5716.6, 60 sec: 5602.6, 300 sec: 5604.2). Total num frames: 307111936. Throughput: 0: 5010.0. Samples: 307107340. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:02:50,458][25689] Avg episode reward: [(0, '-46.030')] [2022-07-09 15:02:50,468][26022] Updated weights on worker 0-0, policy_version 299914 (0.00096) [2022-07-09 15:02:52,207][26022] Updated weights on worker 0-0, policy_version 299924 (0.00086) [2022-07-09 15:02:53,896][26022] Updated weights on worker 0-0, policy_version 299934 (0.00087) [2022-07-09 15:02:55,545][25689] Fps is (10 sec: 5680.8, 60 sec: 5587.1, 300 sec: 5607.7). Total num frames: 307140608. Throughput: 0: 5883.9. Samples: 307141494. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:02:55,546][25689] Avg episode reward: [(0, '-45.800')] [2022-07-09 15:02:55,845][26022] Updated weights on worker 0-0, policy_version 299944 (0.00088) [2022-07-09 15:02:57,634][26022] Updated weights on worker 0-0, policy_version 299954 (0.00093) [2022-07-09 15:02:59,438][26022] Updated weights on worker 0-0, policy_version 299964 (0.00090) [2022-07-09 15:03:00,562][25689] Fps is (10 sec: 5776.1, 60 sec: 5627.1, 300 sec: 5618.0). Total num frames: 307170304. Throughput: 0: 5924.9. Samples: 307176010. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:03:00,562][25689] Avg episode reward: [(0, '-46.241')] [2022-07-09 15:03:01,185][26022] Updated weights on worker 0-0, policy_version 299974 (0.00087) [2022-07-09 15:03:03,284][26022] Updated weights on worker 0-0, policy_version 299984 (0.00088) [2022-07-09 15:03:05,154][26022] Updated weights on worker 0-0, policy_version 299994 (0.00091) [2022-07-09 15:03:05,566][25689] Fps is (10 sec: 5415.8, 60 sec: 5597.4, 300 sec: 5608.4). Total num frames: 307194880. Throughput: 0: 4986.0. Samples: 307190824. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:03:05,566][25689] Avg episode reward: [(0, '-46.464')] [2022-07-09 15:03:06,991][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:03:07,004][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000300004_307204096.pth [2022-07-09 15:03:07,015][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000298030_305182720.pth [2022-07-09 15:03:07,018][26022] Updated weights on worker 0-0, policy_version 300004 (0.00088) [2022-07-09 15:03:08,853][26022] Updated weights on worker 0-0, policy_version 300014 (0.00085) [2022-07-09 15:03:10,583][25689] Fps is (10 sec: 5313.4, 60 sec: 5621.0, 300 sec: 5602.5). Total num frames: 307223552. Throughput: 0: 5819.1. Samples: 307224458. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:03:10,585][25689] Avg episode reward: [(0, '-46.597')] [2022-07-09 15:03:10,697][26022] Updated weights on worker 0-0, policy_version 300024 (0.00084) [2022-07-09 15:03:12,372][26022] Updated weights on worker 0-0, policy_version 300034 (0.00086) [2022-07-09 15:03:14,202][26022] Updated weights on worker 0-0, policy_version 300044 (0.00091) [2022-07-09 15:03:15,690][25689] Fps is (10 sec: 5765.0, 60 sec: 5619.3, 300 sec: 5612.0). Total num frames: 307253248. Throughput: 0: 5811.3. Samples: 307258566. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:03:15,691][25689] Avg episode reward: [(0, '-47.504')] [2022-07-09 15:03:16,113][26022] Updated weights on worker 0-0, policy_version 300054 (0.00086) [2022-07-09 15:03:17,880][26022] Updated weights on worker 0-0, policy_version 300064 (0.00084) [2022-07-09 15:03:19,752][26022] Updated weights on worker 0-0, policy_version 300074 (0.00087) [2022-07-09 15:03:20,693][25689] Fps is (10 sec: 5773.2, 60 sec: 5622.4, 300 sec: 5606.3). Total num frames: 307281920. Throughput: 0: 4946.0. Samples: 307275580. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:03:20,694][25689] Avg episode reward: [(0, '-46.809')] [2022-07-09 15:03:21,614][26022] Updated weights on worker 0-0, policy_version 300084 (0.00083) [2022-07-09 15:03:23,262][26022] Updated weights on worker 0-0, policy_version 300094 (0.00090) [2022-07-09 15:03:25,225][26022] Updated weights on worker 0-0, policy_version 300104 (0.00086) [2022-07-09 15:03:25,714][25689] Fps is (10 sec: 5618.6, 60 sec: 5621.3, 300 sec: 5609.7). Total num frames: 307309568. Throughput: 0: 5895.4. Samples: 307309606. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 15:03:25,715][25689] Avg episode reward: [(0, '-46.444')] [2022-07-09 15:03:26,975][26022] Updated weights on worker 0-0, policy_version 300114 (0.00087) [2022-07-09 15:03:28,679][26022] Updated weights on worker 0-0, policy_version 300124 (0.00096) [2022-07-09 15:03:30,540][26022] Updated weights on worker 0-0, policy_version 300134 (0.00091) [2022-07-09 15:03:30,734][25689] Fps is (10 sec: 5609.1, 60 sec: 5607.8, 300 sec: 5607.1). Total num frames: 307338240. Throughput: 0: 5908.1. Samples: 307343512. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:03:30,734][25689] Avg episode reward: [(0, '-46.851')] [2022-07-09 15:03:32,493][26022] Updated weights on worker 0-0, policy_version 300144 (0.00088) [2022-07-09 15:03:34,289][26022] Updated weights on worker 0-0, policy_version 300154 (0.00091) [2022-07-09 15:03:35,850][25689] Fps is (10 sec: 5556.7, 60 sec: 5607.3, 300 sec: 5601.6). Total num frames: 307365888. Throughput: 0: 5046.3. Samples: 307360296. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:03:35,851][25689] Avg episode reward: [(0, '-46.288')] [2022-07-09 15:03:36,185][26022] Updated weights on worker 0-0, policy_version 300164 (0.00091) [2022-07-09 15:03:37,892][26022] Updated weights on worker 0-0, policy_version 300174 (0.00091) [2022-07-09 15:03:39,757][26022] Updated weights on worker 0-0, policy_version 300184 (0.00084) [2022-07-09 15:03:40,889][25689] Fps is (10 sec: 5545.8, 60 sec: 5622.6, 300 sec: 5601.3). Total num frames: 307394560. Throughput: 0: 5870.0. Samples: 307394132. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:03:40,890][25689] Avg episode reward: [(0, '-45.778')] [2022-07-09 15:03:41,419][26022] Updated weights on worker 0-0, policy_version 300194 (0.00098) [2022-07-09 15:03:43,314][26022] Updated weights on worker 0-0, policy_version 300204 (0.00089) [2022-07-09 15:03:45,081][26022] Updated weights on worker 0-0, policy_version 300214 (0.00078) [2022-07-09 15:03:45,897][25689] Fps is (10 sec: 5707.1, 60 sec: 5624.2, 300 sec: 5601.6). Total num frames: 307423232. Throughput: 0: 5896.6. Samples: 307428620. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:03:45,898][25689] Avg episode reward: [(0, '-45.470')] [2022-07-09 15:03:46,919][26022] Updated weights on worker 0-0, policy_version 300224 (0.00095) [2022-07-09 15:03:48,715][26022] Updated weights on worker 0-0, policy_version 300234 (0.00094) [2022-07-09 15:03:50,458][26022] Updated weights on worker 0-0, policy_version 300244 (0.00084) [2022-07-09 15:03:50,936][25689] Fps is (10 sec: 5707.7, 60 sec: 5621.2, 300 sec: 5602.3). Total num frames: 307451904. Throughput: 0: 5052.6. Samples: 307445586. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:03:50,937][25689] Avg episode reward: [(0, '-46.601')] [2022-07-09 15:03:52,355][26022] Updated weights on worker 0-0, policy_version 300254 (0.00065) [2022-07-09 15:03:54,252][26022] Updated weights on worker 0-0, policy_version 300264 (0.00094) [2022-07-09 15:03:55,931][26022] Updated weights on worker 0-0, policy_version 300274 (0.00084) [2022-07-09 15:03:55,984][25689] Fps is (10 sec: 5685.4, 60 sec: 5625.0, 300 sec: 5609.0). Total num frames: 307480576. Throughput: 0: 5920.4. Samples: 307479498. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:03:55,984][25689] Avg episode reward: [(0, '-47.051')] [2022-07-09 15:03:57,775][26022] Updated weights on worker 0-0, policy_version 300284 (0.00055) [2022-07-09 15:03:59,539][26022] Updated weights on worker 0-0, policy_version 300294 (0.00082) [2022-07-09 15:04:00,995][25689] Fps is (10 sec: 5701.0, 60 sec: 5608.6, 300 sec: 5612.6). Total num frames: 307509248. Throughput: 0: 5960.3. Samples: 307513966. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:04:00,996][25689] Avg episode reward: [(0, '-45.887')] [2022-07-09 15:04:01,335][26022] Updated weights on worker 0-0, policy_version 300304 (0.00088) [2022-07-09 15:04:03,633][26022] Updated weights on worker 0-0, policy_version 300314 (0.00092) [2022-07-09 15:04:05,308][26022] Updated weights on worker 0-0, policy_version 300324 (0.00095) [2022-07-09 15:04:06,020][25689] Fps is (10 sec: 5509.6, 60 sec: 5640.5, 300 sec: 5605.6). Total num frames: 307535872. Throughput: 0: 4986.6. Samples: 307528962. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:04:06,021][25689] Avg episode reward: [(0, '-46.355')] [2022-07-09 15:04:07,177][26022] Updated weights on worker 0-0, policy_version 300334 (0.00083) [2022-07-09 15:04:09,009][26022] Updated weights on worker 0-0, policy_version 300344 (0.00089) [2022-07-09 15:04:10,848][26022] Updated weights on worker 0-0, policy_version 300354 (0.00091) [2022-07-09 15:04:11,024][25689] Fps is (10 sec: 5411.4, 60 sec: 5624.8, 300 sec: 5604.2). Total num frames: 307563520. Throughput: 0: 5832.7. Samples: 307562752. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:04:11,024][25689] Avg episode reward: [(0, '-46.219')] [2022-07-09 15:04:12,687][26022] Updated weights on worker 0-0, policy_version 300364 (0.00088) [2022-07-09 15:04:14,276][26022] Updated weights on worker 0-0, policy_version 300374 (0.01354) [2022-07-09 15:04:16,141][25689] Fps is (10 sec: 5564.5, 60 sec: 5606.9, 300 sec: 5606.1). Total num frames: 307592192. Throughput: 0: 5815.6. Samples: 307596726. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:04:16,142][25689] Avg episode reward: [(0, '-45.795')] [2022-07-09 15:04:16,207][26022] Updated weights on worker 0-0, policy_version 300384 (0.00082) [2022-07-09 15:04:17,892][26022] Updated weights on worker 0-0, policy_version 300394 (0.00092) [2022-07-09 15:04:19,759][26022] Updated weights on worker 0-0, policy_version 300404 (0.00051) [2022-07-09 15:04:21,184][25689] Fps is (10 sec: 5745.1, 60 sec: 5620.2, 300 sec: 5610.0). Total num frames: 307621888. Throughput: 0: 4959.6. Samples: 307614094. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:04:21,184][25689] Avg episode reward: [(0, '-46.310')] [2022-07-09 15:04:21,589][26022] Updated weights on worker 0-0, policy_version 300414 (0.00085) [2022-07-09 15:04:23,282][26022] Updated weights on worker 0-0, policy_version 300424 (0.00095) [2022-07-09 15:04:25,294][26022] Updated weights on worker 0-0, policy_version 300434 (0.00086) [2022-07-09 15:04:26,206][25689] Fps is (10 sec: 5799.5, 60 sec: 5637.0, 300 sec: 5609.7). Total num frames: 307650560. Throughput: 0: 5906.0. Samples: 307648178. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:04:26,206][25689] Avg episode reward: [(0, '-47.062')] [2022-07-09 15:04:26,779][26022] Updated weights on worker 0-0, policy_version 300444 (0.00086) [2022-07-09 15:04:28,854][26022] Updated weights on worker 0-0, policy_version 300454 (0.00097) [2022-07-09 15:04:30,649][26022] Updated weights on worker 0-0, policy_version 300464 (0.00086) [2022-07-09 15:04:31,210][25689] Fps is (10 sec: 5617.1, 60 sec: 5621.5, 300 sec: 5604.8). Total num frames: 307678208. Throughput: 0: 5898.0. Samples: 307681810. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:04:31,211][25689] Avg episode reward: [(0, '-47.922')] [2022-07-09 15:04:32,386][26022] Updated weights on worker 0-0, policy_version 300474 (0.00083) [2022-07-09 15:04:34,229][26022] Updated weights on worker 0-0, policy_version 300484 (0.00085) [2022-07-09 15:04:36,110][26022] Updated weights on worker 0-0, policy_version 300494 (0.00097) [2022-07-09 15:04:36,268][25689] Fps is (10 sec: 5495.4, 60 sec: 5626.9, 300 sec: 5603.8). Total num frames: 307705856. Throughput: 0: 5899.3. Samples: 307715458. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:04:36,269][25689] Avg episode reward: [(0, '-48.287')] [2022-07-09 15:04:37,919][26022] Updated weights on worker 0-0, policy_version 300504 (0.00092) [2022-07-09 15:04:39,895][26022] Updated weights on worker 0-0, policy_version 300514 (0.00082) [2022-07-09 15:04:41,282][25689] Fps is (10 sec: 5591.6, 60 sec: 5629.3, 300 sec: 5607.4). Total num frames: 307734528. Throughput: 0: 5892.4. Samples: 307732524. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:04:41,283][25689] Avg episode reward: [(0, '-47.794')] [2022-07-09 15:04:41,476][26022] Updated weights on worker 0-0, policy_version 300524 (0.00091) [2022-07-09 15:04:43,250][26022] Updated weights on worker 0-0, policy_version 300534 (0.00093) [2022-07-09 15:04:45,122][26022] Updated weights on worker 0-0, policy_version 300544 (0.00263) [2022-07-09 15:04:46,304][25689] Fps is (10 sec: 5713.7, 60 sec: 5628.0, 300 sec: 5603.9). Total num frames: 307763200. Throughput: 0: 5916.3. Samples: 307767086. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:04:46,304][25689] Avg episode reward: [(0, '-47.521')] [2022-07-09 15:04:46,903][26022] Updated weights on worker 0-0, policy_version 300554 (0.00088) [2022-07-09 15:04:48,761][26022] Updated weights on worker 0-0, policy_version 300564 (0.00080) [2022-07-09 15:04:50,564][26022] Updated weights on worker 0-0, policy_version 300574 (0.00090) [2022-07-09 15:04:51,321][25689] Fps is (10 sec: 5610.4, 60 sec: 5613.1, 300 sec: 5601.6). Total num frames: 307790848. Throughput: 0: 5929.0. Samples: 307801046. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:04:51,322][25689] Avg episode reward: [(0, '-46.947')] [2022-07-09 15:04:52,300][26022] Updated weights on worker 0-0, policy_version 300584 (0.00087) [2022-07-09 15:04:54,172][26022] Updated weights on worker 0-0, policy_version 300594 (0.00093) [2022-07-09 15:04:55,923][26022] Updated weights on worker 0-0, policy_version 300604 (0.00094) [2022-07-09 15:04:56,384][25689] Fps is (10 sec: 5587.1, 60 sec: 5611.6, 300 sec: 5604.2). Total num frames: 307819520. Throughput: 0: 5099.1. Samples: 307818034. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:04:56,385][25689] Avg episode reward: [(0, '-46.713')] [2022-07-09 15:04:57,850][26022] Updated weights on worker 0-0, policy_version 300614 (0.00090) [2022-07-09 15:04:59,659][26022] Updated weights on worker 0-0, policy_version 300624 (0.00094) [2022-07-09 15:05:01,319][26022] Updated weights on worker 0-0, policy_version 300634 (0.00090) [2022-07-09 15:05:01,418][25689] Fps is (10 sec: 5780.7, 60 sec: 5626.5, 300 sec: 5614.2). Total num frames: 307849216. Throughput: 0: 5942.2. Samples: 307852174. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:05:01,418][25689] Avg episode reward: [(0, '-46.507')] [2022-07-09 15:05:03,719][26022] Updated weights on worker 0-0, policy_version 300644 (0.00084) [2022-07-09 15:05:05,319][26022] Updated weights on worker 0-0, policy_version 300654 (0.00084) [2022-07-09 15:05:06,429][25689] Fps is (10 sec: 5403.0, 60 sec: 5593.9, 300 sec: 5605.6). Total num frames: 307873792. Throughput: 0: 5800.2. Samples: 307883816. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:05:06,430][25689] Avg episode reward: [(0, '-46.353')] [2022-07-09 15:05:07,202][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:05:07,210][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000300664_307879936.pth [2022-07-09 15:05:07,210][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000298690_305858560.pth [2022-07-09 15:05:07,216][26022] Updated weights on worker 0-0, policy_version 300664 (0.00089) [2022-07-09 15:05:08,970][26022] Updated weights on worker 0-0, policy_version 300674 (0.00111) [2022-07-09 15:05:10,741][26022] Updated weights on worker 0-0, policy_version 300684 (0.00091) [2022-07-09 15:05:11,443][25689] Fps is (10 sec: 5413.7, 60 sec: 5626.9, 300 sec: 5606.7). Total num frames: 307903488. Throughput: 0: 4969.0. Samples: 307901032. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:05:11,443][25689] Avg episode reward: [(0, '-46.879')] [2022-07-09 15:05:12,608][26022] Updated weights on worker 0-0, policy_version 300694 (0.00080) [2022-07-09 15:05:14,447][26022] Updated weights on worker 0-0, policy_version 300704 (0.00088) [2022-07-09 15:05:16,221][26022] Updated weights on worker 0-0, policy_version 300714 (0.00090) [2022-07-09 15:05:16,491][25689] Fps is (10 sec: 5902.6, 60 sec: 5650.2, 300 sec: 5612.9). Total num frames: 307933184. Throughput: 0: 5821.9. Samples: 307935094. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:05:16,492][25689] Avg episode reward: [(0, '-47.028')] [2022-07-09 15:05:18,185][26022] Updated weights on worker 0-0, policy_version 300724 (0.00085) [2022-07-09 15:05:19,735][26022] Updated weights on worker 0-0, policy_version 300734 (0.00087) [2022-07-09 15:05:21,505][25689] Fps is (10 sec: 5597.0, 60 sec: 5602.0, 300 sec: 5609.3). Total num frames: 307959808. Throughput: 0: 5813.3. Samples: 307968948. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:05:21,506][25689] Avg episode reward: [(0, '-47.805')] [2022-07-09 15:05:21,659][26022] Updated weights on worker 0-0, policy_version 300744 (0.00084) [2022-07-09 15:05:23,573][26022] Updated weights on worker 0-0, policy_version 300754 (0.00084) [2022-07-09 15:05:25,238][26022] Updated weights on worker 0-0, policy_version 300764 (0.00094) [2022-07-09 15:05:26,519][25689] Fps is (10 sec: 5412.4, 60 sec: 5585.8, 300 sec: 5603.0). Total num frames: 307987456. Throughput: 0: 5089.0. Samples: 307986052. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:05:26,519][25689] Avg episode reward: [(0, '-47.134')] [2022-07-09 15:05:27,288][26022] Updated weights on worker 0-0, policy_version 300774 (0.00090) [2022-07-09 15:05:28,868][26022] Updated weights on worker 0-0, policy_version 300784 (0.01423) [2022-07-09 15:05:30,678][26022] Updated weights on worker 0-0, policy_version 300794 (0.00090) [2022-07-09 15:05:31,537][25689] Fps is (10 sec: 5716.1, 60 sec: 5618.4, 300 sec: 5607.0). Total num frames: 308017152. Throughput: 0: 5912.5. Samples: 308019840. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:05:31,538][25689] Avg episode reward: [(0, '-46.768')] [2022-07-09 15:05:32,767][26022] Updated weights on worker 0-0, policy_version 300804 (0.00052) [2022-07-09 15:05:34,467][26022] Updated weights on worker 0-0, policy_version 300814 (0.00090) [2022-07-09 15:05:36,271][26022] Updated weights on worker 0-0, policy_version 300824 (0.00097) [2022-07-09 15:05:36,614][25689] Fps is (10 sec: 5781.6, 60 sec: 5633.6, 300 sec: 5612.5). Total num frames: 308045824. Throughput: 0: 5888.0. Samples: 308053578. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:05:36,615][25689] Avg episode reward: [(0, '-47.001')] [2022-07-09 15:05:38,019][26022] Updated weights on worker 0-0, policy_version 300834 (0.00092) [2022-07-09 15:05:39,908][26022] Updated weights on worker 0-0, policy_version 300844 (0.00091) [2022-07-09 15:05:41,648][25689] Fps is (10 sec: 5570.1, 60 sec: 5614.8, 300 sec: 5605.7). Total num frames: 308073472. Throughput: 0: 5035.4. Samples: 308070376. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:05:41,650][25689] Avg episode reward: [(0, '-46.922')] [2022-07-09 15:05:41,716][26022] Updated weights on worker 0-0, policy_version 300854 (0.00128) [2022-07-09 15:05:43,619][26022] Updated weights on worker 0-0, policy_version 300864 (0.00092) [2022-07-09 15:05:45,479][26022] Updated weights on worker 0-0, policy_version 300874 (0.00093) [2022-07-09 15:05:46,728][25689] Fps is (10 sec: 5467.1, 60 sec: 5592.4, 300 sec: 5605.2). Total num frames: 308101120. Throughput: 0: 5830.8. Samples: 308103892. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:05:46,729][25689] Avg episode reward: [(0, '-47.420')] [2022-07-09 15:05:47,296][26022] Updated weights on worker 0-0, policy_version 300884 (0.00094) [2022-07-09 15:05:49,159][26022] Updated weights on worker 0-0, policy_version 300894 (0.00092) [2022-07-09 15:05:50,946][26022] Updated weights on worker 0-0, policy_version 300904 (0.00086) [2022-07-09 15:05:51,786][25689] Fps is (10 sec: 5555.6, 60 sec: 5605.6, 300 sec: 5606.0). Total num frames: 308129792. Throughput: 0: 5802.8. Samples: 308137338. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 15:05:51,786][25689] Avg episode reward: [(0, '-47.224')] [2022-07-09 15:05:52,944][26022] Updated weights on worker 0-0, policy_version 300914 (0.00087) [2022-07-09 15:05:54,555][26022] Updated weights on worker 0-0, policy_version 300924 (0.00082) [2022-07-09 15:05:56,444][26022] Updated weights on worker 0-0, policy_version 300934 (0.00091) [2022-07-09 15:05:56,885][25689] Fps is (10 sec: 5645.7, 60 sec: 5602.3, 300 sec: 5604.5). Total num frames: 308158464. Throughput: 0: 4954.4. Samples: 308154010. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:05:56,886][25689] Avg episode reward: [(0, '-47.387')] [2022-07-09 15:05:58,403][26022] Updated weights on worker 0-0, policy_version 300944 (0.00089) [2022-07-09 15:06:00,064][26022] Updated weights on worker 0-0, policy_version 300954 (0.00090) [2022-07-09 15:06:01,937][25689] Fps is (10 sec: 5548.4, 60 sec: 5566.8, 300 sec: 5607.2). Total num frames: 308186112. Throughput: 0: 5789.3. Samples: 308187830. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:06:01,937][25689] Avg episode reward: [(0, '-46.639')] [2022-07-09 15:06:01,978][26022] Updated weights on worker 0-0, policy_version 300964 (0.00087) [2022-07-09 15:06:04,004][26022] Updated weights on worker 0-0, policy_version 300974 (0.00096) [2022-07-09 15:06:05,962][26022] Updated weights on worker 0-0, policy_version 300984 (0.00086) [2022-07-09 15:06:06,985][25689] Fps is (10 sec: 5373.5, 60 sec: 5597.2, 300 sec: 5606.9). Total num frames: 308212736. Throughput: 0: 5707.2. Samples: 308219502. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:06:06,986][25689] Avg episode reward: [(0, '-45.825')] [2022-07-09 15:06:07,820][26022] Updated weights on worker 0-0, policy_version 300994 (0.00091) [2022-07-09 15:06:09,780][26022] Updated weights on worker 0-0, policy_version 301004 (0.00089) [2022-07-09 15:06:11,600][26022] Updated weights on worker 0-0, policy_version 301014 (0.00082) [2022-07-09 15:06:12,026][25689] Fps is (10 sec: 5379.3, 60 sec: 5560.9, 300 sec: 5601.3). Total num frames: 308240384. Throughput: 0: 4878.1. Samples: 308236070. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:06:12,026][25689] Avg episode reward: [(0, '-45.484')] [2022-07-09 15:06:13,201][26022] Updated weights on worker 0-0, policy_version 301024 (0.00093) [2022-07-09 15:06:15,278][26022] Updated weights on worker 0-0, policy_version 301034 (0.00093) [2022-07-09 15:06:16,847][26022] Updated weights on worker 0-0, policy_version 301044 (0.00089) [2022-07-09 15:06:17,124][25689] Fps is (10 sec: 5656.0, 60 sec: 5556.3, 300 sec: 5607.7). Total num frames: 308270080. Throughput: 0: 5731.3. Samples: 308270002. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:06:17,124][25689] Avg episode reward: [(0, '-44.105')] [2022-07-09 15:06:18,884][26022] Updated weights on worker 0-0, policy_version 301054 (0.00093) [2022-07-09 15:06:20,788][26022] Updated weights on worker 0-0, policy_version 301064 (0.00097) [2022-07-09 15:06:22,172][25689] Fps is (10 sec: 5550.6, 60 sec: 5553.2, 300 sec: 5597.3). Total num frames: 308296704. Throughput: 0: 5686.7. Samples: 308302904. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:06:22,173][25689] Avg episode reward: [(0, '-44.450')] [2022-07-09 15:06:22,488][26022] Updated weights on worker 0-0, policy_version 301074 (0.00082) [2022-07-09 15:06:24,366][26022] Updated weights on worker 0-0, policy_version 301084 (0.00091) [2022-07-09 15:06:26,171][26022] Updated weights on worker 0-0, policy_version 301094 (0.00061) [2022-07-09 15:06:27,179][25689] Fps is (10 sec: 5601.4, 60 sec: 5587.6, 300 sec: 5608.0). Total num frames: 308326400. Throughput: 0: 4969.4. Samples: 308319850. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:06:27,180][25689] Avg episode reward: [(0, '-44.885')] [2022-07-09 15:06:28,091][26022] Updated weights on worker 0-0, policy_version 301104 (0.00083) [2022-07-09 15:06:29,814][26022] Updated weights on worker 0-0, policy_version 301114 (0.00089) [2022-07-09 15:06:31,586][26022] Updated weights on worker 0-0, policy_version 301124 (0.00086) [2022-07-09 15:06:32,211][25689] Fps is (10 sec: 5610.5, 60 sec: 5535.7, 300 sec: 5596.1). Total num frames: 308353024. Throughput: 0: 5830.9. Samples: 308353766. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:06:32,211][25689] Avg episode reward: [(0, '-45.480')] [2022-07-09 15:06:33,347][26022] Updated weights on worker 0-0, policy_version 301134 (0.00086) [2022-07-09 15:06:35,307][26022] Updated weights on worker 0-0, policy_version 301144 (0.00090) [2022-07-09 15:06:37,217][26022] Updated weights on worker 0-0, policy_version 301154 (0.00083) [2022-07-09 15:06:37,287][25689] Fps is (10 sec: 5571.3, 60 sec: 5552.6, 300 sec: 5606.1). Total num frames: 308382720. Throughput: 0: 5832.2. Samples: 308387600. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:06:37,288][25689] Avg episode reward: [(0, '-46.601')] [2022-07-09 15:06:38,855][26022] Updated weights on worker 0-0, policy_version 301164 (0.00098) [2022-07-09 15:06:40,889][26022] Updated weights on worker 0-0, policy_version 301174 (0.01004) [2022-07-09 15:06:42,308][25689] Fps is (10 sec: 5678.8, 60 sec: 5553.8, 300 sec: 5602.3). Total num frames: 308410368. Throughput: 0: 5039.0. Samples: 308404370. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:06:42,309][25689] Avg episode reward: [(0, '-46.391')] [2022-07-09 15:06:42,646][26022] Updated weights on worker 0-0, policy_version 301184 (0.00087) [2022-07-09 15:06:44,214][26022] Updated weights on worker 0-0, policy_version 301194 (0.00085) [2022-07-09 15:06:46,255][26022] Updated weights on worker 0-0, policy_version 301204 (0.00089) [2022-07-09 15:06:47,319][25689] Fps is (10 sec: 5716.4, 60 sec: 5594.0, 300 sec: 5606.2). Total num frames: 308440064. Throughput: 0: 5915.1. Samples: 308438984. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:06:47,320][25689] Avg episode reward: [(0, '-46.453')] [2022-07-09 15:06:47,567][26022] Updated weights on worker 0-0, policy_version 301214 (0.00085) [2022-07-09 15:06:49,728][26022] Updated weights on worker 0-0, policy_version 301224 (0.00084) [2022-07-09 15:06:51,272][26022] Updated weights on worker 0-0, policy_version 301234 (0.00086) [2022-07-09 15:06:52,327][25689] Fps is (10 sec: 5724.0, 60 sec: 5581.7, 300 sec: 5601.2). Total num frames: 308467712. Throughput: 0: 5956.8. Samples: 308473594. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:06:52,327][25689] Avg episode reward: [(0, '-46.844')] [2022-07-09 15:06:53,396][26022] Updated weights on worker 0-0, policy_version 301244 (0.00092) [2022-07-09 15:06:55,078][26022] Updated weights on worker 0-0, policy_version 301254 (0.00089) [2022-07-09 15:06:57,031][26022] Updated weights on worker 0-0, policy_version 301264 (0.00097) [2022-07-09 15:06:57,406][25689] Fps is (10 sec: 5583.3, 60 sec: 5583.5, 300 sec: 5604.7). Total num frames: 308496384. Throughput: 0: 5115.2. Samples: 308490512. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:06:57,407][25689] Avg episode reward: [(0, '-46.898')] [2022-07-09 15:06:58,548][26022] Updated weights on worker 0-0, policy_version 301274 (0.00089) [2022-07-09 15:07:00,634][26022] Updated weights on worker 0-0, policy_version 301284 (0.00086) [2022-07-09 15:07:02,414][25689] Fps is (10 sec: 5583.5, 60 sec: 5587.6, 300 sec: 5608.9). Total num frames: 308524032. Throughput: 0: 5968.5. Samples: 308524368. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:07:02,416][25689] Avg episode reward: [(0, '-47.482')] [2022-07-09 15:07:02,515][26022] Updated weights on worker 0-0, policy_version 301294 (0.00077) [2022-07-09 15:07:04,619][26022] Updated weights on worker 0-0, policy_version 301304 (0.00084) [2022-07-09 15:07:06,069][26022] Updated weights on worker 0-0, policy_version 301314 (0.00081) [2022-07-09 15:07:07,334][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:07:07,349][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000301319_308550656.pth [2022-07-09 15:07:07,349][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000299346_306530304.pth [2022-07-09 15:07:07,425][25689] Fps is (10 sec: 5417.0, 60 sec: 5591.0, 300 sec: 5606.9). Total num frames: 308550656. Throughput: 0: 5829.0. Samples: 308556182. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:07:07,426][25689] Avg episode reward: [(0, '-47.859')] [2022-07-09 15:07:07,999][26022] Updated weights on worker 0-0, policy_version 301324 (0.00084) [2022-07-09 15:07:09,962][26022] Updated weights on worker 0-0, policy_version 301334 (0.00087) [2022-07-09 15:07:11,731][26022] Updated weights on worker 0-0, policy_version 301344 (0.00093) [2022-07-09 15:07:12,455][25689] Fps is (10 sec: 5507.0, 60 sec: 5609.0, 300 sec: 5604.6). Total num frames: 308579328. Throughput: 0: 4931.3. Samples: 308572852. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:07:12,456][25689] Avg episode reward: [(0, '-47.258')] [2022-07-09 15:07:13,726][26022] Updated weights on worker 0-0, policy_version 301354 (0.00110) [2022-07-09 15:07:15,498][26022] Updated weights on worker 0-0, policy_version 301364 (0.00084) [2022-07-09 15:07:17,236][26022] Updated weights on worker 0-0, policy_version 301374 (0.00090) [2022-07-09 15:07:17,559][25689] Fps is (10 sec: 5658.8, 60 sec: 5591.5, 300 sec: 5603.3). Total num frames: 308608000. Throughput: 0: 5760.6. Samples: 308606602. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:07:17,561][25689] Avg episode reward: [(0, '-47.753')] [2022-07-09 15:07:19,106][26022] Updated weights on worker 0-0, policy_version 301384 (0.00086) [2022-07-09 15:07:20,829][26022] Updated weights on worker 0-0, policy_version 301394 (0.00102) [2022-07-09 15:07:22,541][26022] Updated weights on worker 0-0, policy_version 301404 (0.00081) [2022-07-09 15:07:22,639][25689] Fps is (10 sec: 5730.9, 60 sec: 5639.3, 300 sec: 5608.9). Total num frames: 308637696. Throughput: 0: 5760.1. Samples: 308640872. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:07:22,640][25689] Avg episode reward: [(0, '-46.710')] [2022-07-09 15:07:24,498][26022] Updated weights on worker 0-0, policy_version 301414 (0.00091) [2022-07-09 15:07:26,117][26022] Updated weights on worker 0-0, policy_version 301424 (0.00095) [2022-07-09 15:07:27,702][25689] Fps is (10 sec: 5653.4, 60 sec: 5600.2, 300 sec: 5601.9). Total num frames: 308665344. Throughput: 0: 5854.4. Samples: 308674890. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:07:27,702][25689] Avg episode reward: [(0, '-46.130')] [2022-07-09 15:07:28,293][26022] Updated weights on worker 0-0, policy_version 301434 (0.00094) [2022-07-09 15:07:29,897][26022] Updated weights on worker 0-0, policy_version 301444 (0.00089) [2022-07-09 15:07:31,739][26022] Updated weights on worker 0-0, policy_version 301454 (0.00087) [2022-07-09 15:07:32,721][25689] Fps is (10 sec: 5586.5, 60 sec: 5635.3, 300 sec: 5607.0). Total num frames: 308694016. Throughput: 0: 5873.1. Samples: 308691878. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:07:32,721][25689] Avg episode reward: [(0, '-45.880')] [2022-07-09 15:07:33,535][26022] Updated weights on worker 0-0, policy_version 301464 (0.00085) [2022-07-09 15:07:35,250][26022] Updated weights on worker 0-0, policy_version 301474 (0.00086) [2022-07-09 15:07:37,359][26022] Updated weights on worker 0-0, policy_version 301484 (0.00094) [2022-07-09 15:07:37,845][25689] Fps is (10 sec: 5754.3, 60 sec: 5630.8, 300 sec: 5612.0). Total num frames: 308723712. Throughput: 0: 5884.2. Samples: 308725972. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:07:37,846][25689] Avg episode reward: [(0, '-45.159')] [2022-07-09 15:07:38,940][26022] Updated weights on worker 0-0, policy_version 301494 (0.00100) [2022-07-09 15:07:40,825][26022] Updated weights on worker 0-0, policy_version 301504 (0.00087) [2022-07-09 15:07:42,581][26022] Updated weights on worker 0-0, policy_version 301514 (0.00088) [2022-07-09 15:07:42,872][25689] Fps is (10 sec: 5548.2, 60 sec: 5613.4, 300 sec: 5605.1). Total num frames: 308750336. Throughput: 0: 5894.1. Samples: 308760126. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:07:42,873][25689] Avg episode reward: [(0, '-45.476')] [2022-07-09 15:07:44,350][26022] Updated weights on worker 0-0, policy_version 301524 (0.00089) [2022-07-09 15:07:46,267][26022] Updated weights on worker 0-0, policy_version 301534 (0.00092) [2022-07-09 15:07:47,886][25689] Fps is (10 sec: 5609.0, 60 sec: 5613.1, 300 sec: 5608.4). Total num frames: 308780032. Throughput: 0: 5070.0. Samples: 308777224. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:07:47,887][25689] Avg episode reward: [(0, '-46.397')] [2022-07-09 15:07:47,980][26022] Updated weights on worker 0-0, policy_version 301544 (0.00090) [2022-07-09 15:07:49,735][26022] Updated weights on worker 0-0, policy_version 301554 (0.00080) [2022-07-09 15:07:51,439][26022] Updated weights on worker 0-0, policy_version 301564 (0.00083) [2022-07-09 15:07:52,934][25689] Fps is (10 sec: 5800.9, 60 sec: 5626.3, 300 sec: 5609.1). Total num frames: 308808704. Throughput: 0: 5910.6. Samples: 308811348. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:07:52,935][25689] Avg episode reward: [(0, '-47.078')] [2022-07-09 15:07:53,399][26022] Updated weights on worker 0-0, policy_version 301574 (0.00089) [2022-07-09 15:07:55,281][26022] Updated weights on worker 0-0, policy_version 301584 (0.00096) [2022-07-09 15:07:56,984][26022] Updated weights on worker 0-0, policy_version 301594 (0.00115) [2022-07-09 15:07:58,012][25689] Fps is (10 sec: 5663.4, 60 sec: 5626.4, 300 sec: 5604.5). Total num frames: 308837376. Throughput: 0: 5919.2. Samples: 308845342. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:07:58,012][25689] Avg episode reward: [(0, '-46.667')] [2022-07-09 15:07:58,856][26022] Updated weights on worker 0-0, policy_version 301604 (0.00118) [2022-07-09 15:08:00,576][26022] Updated weights on worker 0-0, policy_version 301614 (0.00089) [2022-07-09 15:08:02,779][26022] Updated weights on worker 0-0, policy_version 301624 (0.00085) [2022-07-09 15:08:03,076][25689] Fps is (10 sec: 5552.9, 60 sec: 5621.1, 300 sec: 5613.7). Total num frames: 308865024. Throughput: 0: 5064.7. Samples: 308862454. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:08:03,077][25689] Avg episode reward: [(0, '-46.158')] [2022-07-09 15:08:04,644][26022] Updated weights on worker 0-0, policy_version 301634 (0.00093) [2022-07-09 15:08:06,492][26022] Updated weights on worker 0-0, policy_version 301644 (0.00107) [2022-07-09 15:08:08,081][25689] Fps is (10 sec: 5491.3, 60 sec: 5638.6, 300 sec: 5610.5). Total num frames: 308892672. Throughput: 0: 5802.9. Samples: 308894414. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:08:08,082][25689] Avg episode reward: [(0, '-46.693')] [2022-07-09 15:08:08,144][26022] Updated weights on worker 0-0, policy_version 301654 (0.00090) [2022-07-09 15:08:10,133][26022] Updated weights on worker 0-0, policy_version 301664 (0.00085) [2022-07-09 15:08:11,827][26022] Updated weights on worker 0-0, policy_version 301674 (0.00088) [2022-07-09 15:08:13,103][25689] Fps is (10 sec: 5514.7, 60 sec: 5622.4, 300 sec: 5605.2). Total num frames: 308920320. Throughput: 0: 5794.4. Samples: 308928218. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:08:13,105][25689] Avg episode reward: [(0, '-47.299')] [2022-07-09 15:08:13,737][26022] Updated weights on worker 0-0, policy_version 301684 (0.00084) [2022-07-09 15:08:15,415][26022] Updated weights on worker 0-0, policy_version 301694 (0.00092) [2022-07-09 15:08:17,238][26022] Updated weights on worker 0-0, policy_version 301704 (0.00085) [2022-07-09 15:08:18,187][25689] Fps is (10 sec: 5674.7, 60 sec: 5641.2, 300 sec: 5607.2). Total num frames: 308950016. Throughput: 0: 4945.2. Samples: 308945110. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:08:18,189][25689] Avg episode reward: [(0, '-46.598')] [2022-07-09 15:08:19,016][26022] Updated weights on worker 0-0, policy_version 301714 (0.00092) [2022-07-09 15:08:20,935][26022] Updated weights on worker 0-0, policy_version 301724 (0.00088) [2022-07-09 15:08:22,623][26022] Updated weights on worker 0-0, policy_version 301734 (0.00090) [2022-07-09 15:08:23,232][25689] Fps is (10 sec: 5762.5, 60 sec: 5627.6, 300 sec: 5610.1). Total num frames: 308978688. Throughput: 0: 5804.6. Samples: 308979452. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:08:23,234][25689] Avg episode reward: [(0, '-45.637')] [2022-07-09 15:08:24,487][26022] Updated weights on worker 0-0, policy_version 301744 (0.00094) [2022-07-09 15:08:26,137][26022] Updated weights on worker 0-0, policy_version 301754 (0.00084) [2022-07-09 15:08:28,165][26022] Updated weights on worker 0-0, policy_version 301764 (0.00090) [2022-07-09 15:08:28,257][25689] Fps is (10 sec: 5592.6, 60 sec: 5631.1, 300 sec: 5606.6). Total num frames: 309006336. Throughput: 0: 5915.0. Samples: 309013752. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:08:28,258][25689] Avg episode reward: [(0, '-45.907')] [2022-07-09 15:08:29,848][26022] Updated weights on worker 0-0, policy_version 301774 (0.00090) [2022-07-09 15:08:31,583][26022] Updated weights on worker 0-0, policy_version 301784 (0.00087) [2022-07-09 15:08:33,343][25689] Fps is (10 sec: 5570.2, 60 sec: 5624.9, 300 sec: 5610.6). Total num frames: 309035008. Throughput: 0: 5061.6. Samples: 309030664. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:08:33,344][25689] Avg episode reward: [(0, '-46.966')] [2022-07-09 15:08:33,502][26022] Updated weights on worker 0-0, policy_version 301794 (0.00091) [2022-07-09 15:08:35,307][26022] Updated weights on worker 0-0, policy_version 301804 (0.00101) [2022-07-09 15:08:37,195][26022] Updated weights on worker 0-0, policy_version 301814 (0.00091) [2022-07-09 15:08:38,431][25689] Fps is (10 sec: 5736.9, 60 sec: 5628.2, 300 sec: 5613.1). Total num frames: 309064704. Throughput: 0: 5889.5. Samples: 309064340. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:08:38,432][25689] Avg episode reward: [(0, '-46.733')] [2022-07-09 15:08:38,983][26022] Updated weights on worker 0-0, policy_version 301824 (0.00081) [2022-07-09 15:08:40,925][26022] Updated weights on worker 0-0, policy_version 301834 (0.00089) [2022-07-09 15:08:42,662][26022] Updated weights on worker 0-0, policy_version 301844 (0.00091) [2022-07-09 15:08:43,485][25689] Fps is (10 sec: 5553.5, 60 sec: 5625.8, 300 sec: 5605.4). Total num frames: 309091328. Throughput: 0: 5860.6. Samples: 309098142. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:08:43,485][25689] Avg episode reward: [(0, '-46.408')] [2022-07-09 15:08:44,291][26022] Updated weights on worker 0-0, policy_version 301854 (0.00096) [2022-07-09 15:08:46,210][26022] Updated weights on worker 0-0, policy_version 301864 (0.00083) [2022-07-09 15:08:47,903][26022] Updated weights on worker 0-0, policy_version 301874 (0.00088) [2022-07-09 15:08:48,514][25689] Fps is (10 sec: 5585.9, 60 sec: 5624.4, 300 sec: 5609.0). Total num frames: 309121024. Throughput: 0: 5006.8. Samples: 309115174. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:08:48,516][25689] Avg episode reward: [(0, '-46.524')] [2022-07-09 15:08:49,866][26022] Updated weights on worker 0-0, policy_version 301884 (0.00079) [2022-07-09 15:08:51,818][26022] Updated weights on worker 0-0, policy_version 301894 (0.00080) [2022-07-09 15:08:53,518][26022] Updated weights on worker 0-0, policy_version 301904 (0.00087) [2022-07-09 15:08:53,542][25689] Fps is (10 sec: 5803.5, 60 sec: 5626.2, 300 sec: 5609.4). Total num frames: 309149696. Throughput: 0: 5850.7. Samples: 309148842. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:08:53,543][25689] Avg episode reward: [(0, '-47.183')] [2022-07-09 15:08:55,360][26022] Updated weights on worker 0-0, policy_version 301914 (0.00089) [2022-07-09 15:08:57,117][26022] Updated weights on worker 0-0, policy_version 301924 (0.00092) [2022-07-09 15:08:58,570][25689] Fps is (10 sec: 5600.4, 60 sec: 5613.9, 300 sec: 5605.6). Total num frames: 309177344. Throughput: 0: 5889.6. Samples: 309182950. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:08:58,571][25689] Avg episode reward: [(0, '-46.699')] [2022-07-09 15:08:58,964][26022] Updated weights on worker 0-0, policy_version 301934 (0.00090) [2022-07-09 15:09:00,883][26022] Updated weights on worker 0-0, policy_version 301944 (0.00087) [2022-07-09 15:09:02,818][26022] Updated weights on worker 0-0, policy_version 301954 (0.00089) [2022-07-09 15:09:03,580][25689] Fps is (10 sec: 5508.9, 60 sec: 5619.0, 300 sec: 5609.4). Total num frames: 309204992. Throughput: 0: 5083.9. Samples: 309200302. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:09:03,582][25689] Avg episode reward: [(0, '-46.587')] [2022-07-09 15:09:04,734][26022] Updated weights on worker 0-0, policy_version 301964 (0.00088) [2022-07-09 15:09:06,583][26022] Updated weights on worker 0-0, policy_version 301974 (0.00091) [2022-07-09 15:09:07,462][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:09:07,473][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000301979_309226496.pth [2022-07-09 15:09:07,473][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000300004_307204096.pth [2022-07-09 15:09:08,315][26022] Updated weights on worker 0-0, policy_version 301984 (0.00093) [2022-07-09 15:09:08,583][25689] Fps is (10 sec: 5522.7, 60 sec: 5619.2, 300 sec: 5609.4). Total num frames: 309232640. Throughput: 0: 5836.0. Samples: 309232296. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:09:08,583][25689] Avg episode reward: [(0, '-45.984')] [2022-07-09 15:09:10,177][26022] Updated weights on worker 0-0, policy_version 301994 (0.00086) [2022-07-09 15:09:12,128][26022] Updated weights on worker 0-0, policy_version 302004 (0.00089) [2022-07-09 15:09:13,590][25689] Fps is (10 sec: 5728.6, 60 sec: 5654.4, 300 sec: 5614.9). Total num frames: 309262336. Throughput: 0: 5850.9. Samples: 309266138. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:09:13,590][25689] Avg episode reward: [(0, '-45.989')] [2022-07-09 15:09:13,592][26022] Updated weights on worker 0-0, policy_version 302014 (0.00091) [2022-07-09 15:09:15,661][26022] Updated weights on worker 0-0, policy_version 302024 (0.00090) [2022-07-09 15:09:17,213][26022] Updated weights on worker 0-0, policy_version 302034 (0.00084) [2022-07-09 15:09:18,650][25689] Fps is (10 sec: 5492.3, 60 sec: 5588.8, 300 sec: 5600.8). Total num frames: 309287936. Throughput: 0: 4985.5. Samples: 309283060. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:09:18,652][25689] Avg episode reward: [(0, '-46.180')] [2022-07-09 15:09:19,168][26022] Updated weights on worker 0-0, policy_version 302044 (0.00094) [2022-07-09 15:09:21,008][26022] Updated weights on worker 0-0, policy_version 302054 (0.00096) [2022-07-09 15:09:22,660][26022] Updated weights on worker 0-0, policy_version 302064 (0.00091) [2022-07-09 15:09:23,657][25689] Fps is (10 sec: 5492.5, 60 sec: 5609.3, 300 sec: 5604.5). Total num frames: 309317632. Throughput: 0: 5826.5. Samples: 309317284. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:09:23,659][25689] Avg episode reward: [(0, '-46.436')] [2022-07-09 15:09:24,686][26022] Updated weights on worker 0-0, policy_version 302074 (0.00087) [2022-07-09 15:09:26,351][26022] Updated weights on worker 0-0, policy_version 302084 (0.00086) [2022-07-09 15:09:28,186][26022] Updated weights on worker 0-0, policy_version 302094 (0.00092) [2022-07-09 15:09:28,680][25689] Fps is (10 sec: 5921.7, 60 sec: 5643.4, 300 sec: 5611.1). Total num frames: 309347328. Throughput: 0: 5910.1. Samples: 309351074. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:09:28,682][25689] Avg episode reward: [(0, '-46.425')] [2022-07-09 15:09:30,116][26022] Updated weights on worker 0-0, policy_version 302104 (0.00089) [2022-07-09 15:09:31,694][26022] Updated weights on worker 0-0, policy_version 302114 (0.00087) [2022-07-09 15:09:33,682][25689] Fps is (10 sec: 5617.9, 60 sec: 5617.3, 300 sec: 5608.7). Total num frames: 309373952. Throughput: 0: 5086.9. Samples: 309368346. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:09:33,683][25689] Avg episode reward: [(0, '-46.940')] [2022-07-09 15:09:33,708][26022] Updated weights on worker 0-0, policy_version 302124 (0.00091) [2022-07-09 15:09:35,412][26022] Updated weights on worker 0-0, policy_version 302134 (0.00095) [2022-07-09 15:09:37,319][26022] Updated weights on worker 0-0, policy_version 302144 (0.00094) [2022-07-09 15:09:38,739][25689] Fps is (10 sec: 5497.3, 60 sec: 5603.3, 300 sec: 5607.9). Total num frames: 309402624. Throughput: 0: 5930.9. Samples: 309402200. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:09:38,739][25689] Avg episode reward: [(0, '-47.111')] [2022-07-09 15:09:39,228][26022] Updated weights on worker 0-0, policy_version 302154 (0.00083) [2022-07-09 15:09:40,837][26022] Updated weights on worker 0-0, policy_version 302164 (0.00091) [2022-07-09 15:09:42,855][26022] Updated weights on worker 0-0, policy_version 302174 (0.00124) [2022-07-09 15:09:43,755][25689] Fps is (10 sec: 5896.4, 60 sec: 5674.7, 300 sec: 5614.9). Total num frames: 309433344. Throughput: 0: 5913.3. Samples: 309436128. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:09:43,757][25689] Avg episode reward: [(0, '-47.006')] [2022-07-09 15:09:44,659][26022] Updated weights on worker 0-0, policy_version 302184 (0.00093) [2022-07-09 15:09:46,361][26022] Updated weights on worker 0-0, policy_version 302194 (0.00089) [2022-07-09 15:09:48,327][26022] Updated weights on worker 0-0, policy_version 302204 (0.00088) [2022-07-09 15:09:48,795][25689] Fps is (10 sec: 5600.6, 60 sec: 5605.7, 300 sec: 5607.5). Total num frames: 309458944. Throughput: 0: 5068.6. Samples: 309453028. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:09:48,796][25689] Avg episode reward: [(0, '-47.117')] [2022-07-09 15:09:49,942][26022] Updated weights on worker 0-0, policy_version 302214 (0.00096) [2022-07-09 15:09:52,040][26022] Updated weights on worker 0-0, policy_version 302224 (0.00087) [2022-07-09 15:09:53,576][26022] Updated weights on worker 0-0, policy_version 302234 (0.00092) [2022-07-09 15:09:53,805][25689] Fps is (10 sec: 5502.1, 60 sec: 5624.4, 300 sec: 5612.0). Total num frames: 309488640. Throughput: 0: 5881.2. Samples: 309486690. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:09:53,806][25689] Avg episode reward: [(0, '-47.167')] [2022-07-09 15:09:55,579][26022] Updated weights on worker 0-0, policy_version 302244 (0.00089) [2022-07-09 15:09:57,222][26022] Updated weights on worker 0-0, policy_version 302254 (0.00092) [2022-07-09 15:09:58,857][25689] Fps is (10 sec: 5597.3, 60 sec: 5605.2, 300 sec: 5601.3). Total num frames: 309515264. Throughput: 0: 5882.2. Samples: 309520538. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:09:58,857][25689] Avg episode reward: [(0, '-47.614')] [2022-07-09 15:09:59,144][26022] Updated weights on worker 0-0, policy_version 302264 (0.00089) [2022-07-09 15:10:01,071][26022] Updated weights on worker 0-0, policy_version 302274 (0.00093) [2022-07-09 15:10:03,106][26022] Updated weights on worker 0-0, policy_version 302284 (0.00085) [2022-07-09 15:10:03,909][25689] Fps is (10 sec: 5472.5, 60 sec: 5618.2, 300 sec: 5614.3). Total num frames: 309543936. Throughput: 0: 5785.1. Samples: 309552722. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:10:03,910][25689] Avg episode reward: [(0, '-47.676')] [2022-07-09 15:10:04,946][26022] Updated weights on worker 0-0, policy_version 302294 (0.00091) [2022-07-09 15:10:06,639][26022] Updated weights on worker 0-0, policy_version 302304 (0.00081) [2022-07-09 15:10:08,504][26022] Updated weights on worker 0-0, policy_version 302314 (0.00101) [2022-07-09 15:10:08,921][25689] Fps is (10 sec: 5596.3, 60 sec: 5617.4, 300 sec: 5607.5). Total num frames: 309571584. Throughput: 0: 5815.9. Samples: 309570078. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:10:08,923][25689] Avg episode reward: [(0, '-47.910')] [2022-07-09 15:10:10,275][26022] Updated weights on worker 0-0, policy_version 302324 (0.00092) [2022-07-09 15:10:12,111][26022] Updated weights on worker 0-0, policy_version 302334 (0.00079) [2022-07-09 15:10:13,713][26022] Updated weights on worker 0-0, policy_version 302344 (0.00094) [2022-07-09 15:10:13,936][25689] Fps is (10 sec: 5616.7, 60 sec: 5599.7, 300 sec: 5604.6). Total num frames: 309600256. Throughput: 0: 5836.6. Samples: 309604190. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:10:13,937][25689] Avg episode reward: [(0, '-46.915')] [2022-07-09 15:10:15,809][26022] Updated weights on worker 0-0, policy_version 302354 (0.00084) [2022-07-09 15:10:17,407][26022] Updated weights on worker 0-0, policy_version 302364 (0.00091) [2022-07-09 15:10:19,081][25689] Fps is (10 sec: 5542.8, 60 sec: 5625.7, 300 sec: 5605.6). Total num frames: 309627904. Throughput: 0: 5819.6. Samples: 309638238. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:10:19,082][25689] Avg episode reward: [(0, '-46.962')] [2022-07-09 15:10:19,349][26022] Updated weights on worker 0-0, policy_version 302374 (0.00092) [2022-07-09 15:10:21,023][26022] Updated weights on worker 0-0, policy_version 302384 (0.00094) [2022-07-09 15:10:22,851][26022] Updated weights on worker 0-0, policy_version 302394 (0.00059) [2022-07-09 15:10:24,107][25689] Fps is (10 sec: 5739.0, 60 sec: 5640.9, 300 sec: 5615.7). Total num frames: 309658624. Throughput: 0: 5086.5. Samples: 309655458. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:10:24,107][25689] Avg episode reward: [(0, '-47.513')] [2022-07-09 15:10:24,807][26022] Updated weights on worker 0-0, policy_version 302404 (0.00084) [2022-07-09 15:10:26,515][26022] Updated weights on worker 0-0, policy_version 302414 (0.00091) [2022-07-09 15:10:28,308][26022] Updated weights on worker 0-0, policy_version 302424 (0.00093) [2022-07-09 15:10:29,136][25689] Fps is (10 sec: 5805.2, 60 sec: 5606.5, 300 sec: 5608.6). Total num frames: 309686272. Throughput: 0: 5898.5. Samples: 309689318. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:10:29,136][25689] Avg episode reward: [(0, '-47.298')] [2022-07-09 15:10:30,159][26022] Updated weights on worker 0-0, policy_version 302434 (0.00096) [2022-07-09 15:10:31,922][26022] Updated weights on worker 0-0, policy_version 302444 (0.00085) [2022-07-09 15:10:33,916][26022] Updated weights on worker 0-0, policy_version 302454 (0.00093) [2022-07-09 15:10:34,170][25689] Fps is (10 sec: 5494.7, 60 sec: 5620.4, 300 sec: 5606.0). Total num frames: 309713920. Throughput: 0: 5874.5. Samples: 309723054. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:10:34,171][25689] Avg episode reward: [(0, '-47.483')] [2022-07-09 15:10:35,639][26022] Updated weights on worker 0-0, policy_version 302464 (0.00093) [2022-07-09 15:10:37,450][26022] Updated weights on worker 0-0, policy_version 302474 (0.00095) [2022-07-09 15:10:39,215][26022] Updated weights on worker 0-0, policy_version 302484 (0.00097) [2022-07-09 15:10:39,311][25689] Fps is (10 sec: 5636.0, 60 sec: 5629.5, 300 sec: 5610.9). Total num frames: 309743616. Throughput: 0: 5033.7. Samples: 309740066. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:10:39,311][25689] Avg episode reward: [(0, '-47.430')] [2022-07-09 15:10:41,099][26022] Updated weights on worker 0-0, policy_version 302494 (0.00088) [2022-07-09 15:10:42,803][26022] Updated weights on worker 0-0, policy_version 302504 (0.00089) [2022-07-09 15:10:44,352][25689] Fps is (10 sec: 5631.9, 60 sec: 5576.5, 300 sec: 5611.6). Total num frames: 309771264. Throughput: 0: 5869.7. Samples: 309774292. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:10:44,353][25689] Avg episode reward: [(0, '-48.661')] [2022-07-09 15:10:44,824][26022] Updated weights on worker 0-0, policy_version 302514 (0.00091) [2022-07-09 15:10:46,283][26022] Updated weights on worker 0-0, policy_version 302524 (0.00084) [2022-07-09 15:10:48,234][26022] Updated weights on worker 0-0, policy_version 302534 (0.00081) [2022-07-09 15:10:49,395][25689] Fps is (10 sec: 5788.1, 60 sec: 5660.7, 300 sec: 5618.8). Total num frames: 309801984. Throughput: 0: 5879.3. Samples: 309808426. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:10:49,396][25689] Avg episode reward: [(0, '-48.809')] [2022-07-09 15:10:49,922][26022] Updated weights on worker 0-0, policy_version 302544 (0.00093) [2022-07-09 15:10:51,799][26022] Updated weights on worker 0-0, policy_version 302554 (0.00088) [2022-07-09 15:10:53,735][26022] Updated weights on worker 0-0, policy_version 302564 (0.00256) [2022-07-09 15:10:54,430][25689] Fps is (10 sec: 5689.9, 60 sec: 5607.7, 300 sec: 5613.1). Total num frames: 309828608. Throughput: 0: 5050.6. Samples: 309825380. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:10:54,431][25689] Avg episode reward: [(0, '-48.297')] [2022-07-09 15:10:55,373][26022] Updated weights on worker 0-0, policy_version 302574 (0.00087) [2022-07-09 15:10:57,328][26022] Updated weights on worker 0-0, policy_version 302584 (0.00091) [2022-07-09 15:10:58,900][26022] Updated weights on worker 0-0, policy_version 302594 (0.00101) [2022-07-09 15:10:59,535][25689] Fps is (10 sec: 5554.1, 60 sec: 5653.4, 300 sec: 5619.0). Total num frames: 309858304. Throughput: 0: 5920.5. Samples: 309859804. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:10:59,535][25689] Avg episode reward: [(0, '-47.487')] [2022-07-09 15:11:00,615][26022] Updated weights on worker 0-0, policy_version 302604 (0.00083) [2022-07-09 15:11:03,176][26022] Updated weights on worker 0-0, policy_version 302614 (0.00091) [2022-07-09 15:11:04,606][25689] Fps is (10 sec: 5635.6, 60 sec: 5634.8, 300 sec: 5622.0). Total num frames: 309885952. Throughput: 0: 5813.0. Samples: 309892026. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:11:04,608][25689] Avg episode reward: [(0, '-47.002')] [2022-07-09 15:11:04,629][26022] Updated weights on worker 0-0, policy_version 302624 (0.00098) [2022-07-09 15:11:06,619][26022] Updated weights on worker 0-0, policy_version 302634 (0.00093) [2022-07-09 15:11:07,565][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:11:07,585][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000302640_309903360.pth [2022-07-09 15:11:07,586][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000300664_307879936.pth [2022-07-09 15:11:08,300][26022] Updated weights on worker 0-0, policy_version 302644 (0.00091) [2022-07-09 15:11:09,622][25689] Fps is (10 sec: 5481.9, 60 sec: 5634.4, 300 sec: 5622.4). Total num frames: 309913600. Throughput: 0: 4988.7. Samples: 309909334. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:11:09,622][25689] Avg episode reward: [(0, '-45.993')] [2022-07-09 15:11:10,039][26022] Updated weights on worker 0-0, policy_version 302654 (0.00416) [2022-07-09 15:11:11,978][26022] Updated weights on worker 0-0, policy_version 302664 (0.00088) [2022-07-09 15:11:13,663][26022] Updated weights on worker 0-0, policy_version 302674 (0.00096) [2022-07-09 15:11:14,637][25689] Fps is (10 sec: 5614.3, 60 sec: 5634.4, 300 sec: 5620.6). Total num frames: 309942272. Throughput: 0: 5847.8. Samples: 309943546. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:11:14,638][25689] Avg episode reward: [(0, '-45.919')] [2022-07-09 15:11:15,510][26022] Updated weights on worker 0-0, policy_version 302684 (0.00098) [2022-07-09 15:11:17,367][26022] Updated weights on worker 0-0, policy_version 302694 (0.00084) [2022-07-09 15:11:19,007][26022] Updated weights on worker 0-0, policy_version 302704 (0.01038) [2022-07-09 15:11:19,718][25689] Fps is (10 sec: 5679.6, 60 sec: 5657.3, 300 sec: 5626.8). Total num frames: 309970944. Throughput: 0: 5835.5. Samples: 309977586. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:11:19,721][25689] Avg episode reward: [(0, '-45.841')] [2022-07-09 15:11:20,910][26022] Updated weights on worker 0-0, policy_version 302714 (0.00096) [2022-07-09 15:11:22,660][26022] Updated weights on worker 0-0, policy_version 302724 (0.00085) [2022-07-09 15:11:24,540][26022] Updated weights on worker 0-0, policy_version 302734 (0.00093) [2022-07-09 15:11:24,730][25689] Fps is (10 sec: 5681.7, 60 sec: 5624.8, 300 sec: 5623.3). Total num frames: 309999616. Throughput: 0: 5096.6. Samples: 309994592. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:11:24,732][25689] Avg episode reward: [(0, '-46.470')] [2022-07-09 15:11:26,542][26022] Updated weights on worker 0-0, policy_version 302744 (0.00087) [2022-07-09 15:11:28,194][26022] Updated weights on worker 0-0, policy_version 302754 (0.00095) [2022-07-09 15:11:29,744][25689] Fps is (10 sec: 5821.9, 60 sec: 5660.0, 300 sec: 5634.0). Total num frames: 310029312. Throughput: 0: 5912.8. Samples: 310028312. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:11:29,744][25689] Avg episode reward: [(0, '-46.116')] [2022-07-09 15:11:30,119][26022] Updated weights on worker 0-0, policy_version 302764 (0.00096) [2022-07-09 15:11:31,819][26022] Updated weights on worker 0-0, policy_version 302774 (0.00087) [2022-07-09 15:11:33,683][26022] Updated weights on worker 0-0, policy_version 302784 (0.00089) [2022-07-09 15:11:34,791][25689] Fps is (10 sec: 5699.7, 60 sec: 5658.8, 300 sec: 5627.6). Total num frames: 310056960. Throughput: 0: 5907.9. Samples: 310062612. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:11:34,793][25689] Avg episode reward: [(0, '-46.296')] [2022-07-09 15:11:35,432][26022] Updated weights on worker 0-0, policy_version 302794 (0.00089) [2022-07-09 15:11:37,152][26022] Updated weights on worker 0-0, policy_version 302804 (0.00090) [2022-07-09 15:11:39,117][26022] Updated weights on worker 0-0, policy_version 302814 (0.00086) [2022-07-09 15:11:39,847][25689] Fps is (10 sec: 5574.3, 60 sec: 5649.7, 300 sec: 5630.4). Total num frames: 310085632. Throughput: 0: 5058.8. Samples: 310079414. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:11:39,848][25689] Avg episode reward: [(0, '-46.127')] [2022-07-09 15:11:40,891][26022] Updated weights on worker 0-0, policy_version 302824 (0.00093) [2022-07-09 15:11:42,669][26022] Updated weights on worker 0-0, policy_version 302834 (0.00089) [2022-07-09 15:11:44,535][26022] Updated weights on worker 0-0, policy_version 302844 (0.00086) [2022-07-09 15:11:44,857][25689] Fps is (10 sec: 5595.1, 60 sec: 5652.7, 300 sec: 5623.5). Total num frames: 310113280. Throughput: 0: 5899.5. Samples: 310113330. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:11:44,859][25689] Avg episode reward: [(0, '-46.195')] [2022-07-09 15:11:46,268][26022] Updated weights on worker 0-0, policy_version 302854 (0.00084) [2022-07-09 15:11:48,125][26022] Updated weights on worker 0-0, policy_version 302864 (0.00087) [2022-07-09 15:11:49,893][25689] Fps is (10 sec: 5504.5, 60 sec: 5602.5, 300 sec: 5623.0). Total num frames: 310140928. Throughput: 0: 5905.9. Samples: 310147310. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:11:49,894][25689] Avg episode reward: [(0, '-46.015')] [2022-07-09 15:11:50,161][26022] Updated weights on worker 0-0, policy_version 302874 (0.00083) [2022-07-09 15:11:51,658][26022] Updated weights on worker 0-0, policy_version 302884 (0.00085) [2022-07-09 15:11:53,555][26022] Updated weights on worker 0-0, policy_version 302894 (0.00085) [2022-07-09 15:11:54,937][25689] Fps is (10 sec: 5892.3, 60 sec: 5686.4, 300 sec: 5634.0). Total num frames: 310172672. Throughput: 0: 5055.6. Samples: 310164460. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:11:54,937][25689] Avg episode reward: [(0, '-46.614')] [2022-07-09 15:11:55,168][26022] Updated weights on worker 0-0, policy_version 302904 (0.00087) [2022-07-09 15:11:57,301][26022] Updated weights on worker 0-0, policy_version 302914 (0.00057) [2022-07-09 15:11:58,823][26022] Updated weights on worker 0-0, policy_version 302924 (0.00091) [2022-07-09 15:11:59,984][25689] Fps is (10 sec: 5784.3, 60 sec: 5641.0, 300 sec: 5629.8). Total num frames: 310199296. Throughput: 0: 5917.0. Samples: 310198562. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:11:59,984][25689] Avg episode reward: [(0, '-46.557')] [2022-07-09 15:12:00,709][26022] Updated weights on worker 0-0, policy_version 302934 (0.00086) [2022-07-09 15:12:02,914][26022] Updated weights on worker 0-0, policy_version 302944 (0.00092) [2022-07-09 15:12:04,787][26022] Updated weights on worker 0-0, policy_version 302954 (0.00087) [2022-07-09 15:12:04,995][25689] Fps is (10 sec: 5294.0, 60 sec: 5629.6, 300 sec: 5629.8). Total num frames: 310225920. Throughput: 0: 5813.3. Samples: 310230396. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:12:04,997][25689] Avg episode reward: [(0, '-46.226')] [2022-07-09 15:12:06,520][26022] Updated weights on worker 0-0, policy_version 302964 (0.00095) [2022-07-09 15:12:08,537][26022] Updated weights on worker 0-0, policy_version 302974 (0.00088) [2022-07-09 15:12:10,024][25689] Fps is (10 sec: 5507.7, 60 sec: 5645.4, 300 sec: 5629.8). Total num frames: 310254592. Throughput: 0: 4970.9. Samples: 310247378. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:12:10,026][25689] Avg episode reward: [(0, '-45.898')] [2022-07-09 15:12:10,139][26022] Updated weights on worker 0-0, policy_version 302984 (0.00096) [2022-07-09 15:12:12,128][26022] Updated weights on worker 0-0, policy_version 302994 (0.00107) [2022-07-09 15:12:13,881][26022] Updated weights on worker 0-0, policy_version 303004 (0.00095) [2022-07-09 15:12:15,052][25689] Fps is (10 sec: 5498.0, 60 sec: 5610.3, 300 sec: 5624.4). Total num frames: 310281216. Throughput: 0: 5805.6. Samples: 310281244. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:12:15,054][25689] Avg episode reward: [(0, '-46.353')] [2022-07-09 15:12:15,627][26022] Updated weights on worker 0-0, policy_version 303014 (0.00095) [2022-07-09 15:12:17,778][26022] Updated weights on worker 0-0, policy_version 303024 (0.00092) [2022-07-09 15:12:19,210][26022] Updated weights on worker 0-0, policy_version 303034 (0.00090) [2022-07-09 15:12:20,189][25689] Fps is (10 sec: 5540.7, 60 sec: 5622.1, 300 sec: 5623.3). Total num frames: 310310912. Throughput: 0: 5744.8. Samples: 310314634. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:12:20,189][25689] Avg episode reward: [(0, '-46.569')] [2022-07-09 15:12:21,394][26022] Updated weights on worker 0-0, policy_version 303044 (0.00099) [2022-07-09 15:12:22,925][26022] Updated weights on worker 0-0, policy_version 303054 (0.00083) [2022-07-09 15:12:24,758][26022] Updated weights on worker 0-0, policy_version 303064 (0.00092) [2022-07-09 15:12:25,210][25689] Fps is (10 sec: 5645.2, 60 sec: 5604.2, 300 sec: 5624.1). Total num frames: 310338560. Throughput: 0: 5848.5. Samples: 310348626. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:12:25,211][25689] Avg episode reward: [(0, '-46.133')] [2022-07-09 15:12:26,504][26022] Updated weights on worker 0-0, policy_version 303074 (0.00088) [2022-07-09 15:12:28,666][26022] Updated weights on worker 0-0, policy_version 303084 (0.00084) [2022-07-09 15:12:30,211][25689] Fps is (10 sec: 5619.2, 60 sec: 5588.5, 300 sec: 5624.4). Total num frames: 310367232. Throughput: 0: 5852.4. Samples: 310365524. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:12:30,212][25689] Avg episode reward: [(0, '-46.125')] [2022-07-09 15:12:30,258][26022] Updated weights on worker 0-0, policy_version 303094 (0.00091) [2022-07-09 15:12:32,128][26022] Updated weights on worker 0-0, policy_version 303104 (0.00090) [2022-07-09 15:12:33,874][26022] Updated weights on worker 0-0, policy_version 303114 (0.00087) [2022-07-09 15:12:35,231][25689] Fps is (10 sec: 5620.1, 60 sec: 5591.0, 300 sec: 5619.5). Total num frames: 310394880. Throughput: 0: 5869.4. Samples: 310399682. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:12:35,232][25689] Avg episode reward: [(0, '-45.693')] [2022-07-09 15:12:35,715][26022] Updated weights on worker 0-0, policy_version 303124 (0.00094) [2022-07-09 15:12:37,611][26022] Updated weights on worker 0-0, policy_version 303134 (0.00086) [2022-07-09 15:12:39,336][26022] Updated weights on worker 0-0, policy_version 303144 (0.00094) [2022-07-09 15:12:40,283][25689] Fps is (10 sec: 5591.9, 60 sec: 5591.4, 300 sec: 5625.9). Total num frames: 310423552. Throughput: 0: 5920.6. Samples: 310433604. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:12:40,284][25689] Avg episode reward: [(0, '-46.291')] [2022-07-09 15:12:41,273][26022] Updated weights on worker 0-0, policy_version 303154 (0.00081) [2022-07-09 15:12:42,894][26022] Updated weights on worker 0-0, policy_version 303164 (0.00084) [2022-07-09 15:12:44,651][26022] Updated weights on worker 0-0, policy_version 303174 (0.00086) [2022-07-09 15:12:45,339][25689] Fps is (10 sec: 5876.1, 60 sec: 5637.9, 300 sec: 5628.6). Total num frames: 310454272. Throughput: 0: 5073.6. Samples: 310450750. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:12:45,339][25689] Avg episode reward: [(0, '-46.699')] [2022-07-09 15:12:46,592][26022] Updated weights on worker 0-0, policy_version 303184 (0.00087) [2022-07-09 15:12:48,242][26022] Updated weights on worker 0-0, policy_version 303194 (0.00081) [2022-07-09 15:12:50,172][26022] Updated weights on worker 0-0, policy_version 303204 (0.00090) [2022-07-09 15:12:50,368][25689] Fps is (10 sec: 5787.4, 60 sec: 5638.5, 300 sec: 5625.5). Total num frames: 310481920. Throughput: 0: 5926.1. Samples: 310484976. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:12:50,369][25689] Avg episode reward: [(0, '-46.430')] [2022-07-09 15:12:52,092][26022] Updated weights on worker 0-0, policy_version 303214 (0.00087) [2022-07-09 15:12:53,876][26022] Updated weights on worker 0-0, policy_version 303224 (0.00086) [2022-07-09 15:12:55,391][25689] Fps is (10 sec: 5500.9, 60 sec: 5572.7, 300 sec: 5623.1). Total num frames: 310509568. Throughput: 0: 5915.1. Samples: 310518928. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:12:55,391][25689] Avg episode reward: [(0, '-46.949')] [2022-07-09 15:12:55,743][26022] Updated weights on worker 0-0, policy_version 303234 (0.00092) [2022-07-09 15:12:57,300][26022] Updated weights on worker 0-0, policy_version 303244 (0.00093) [2022-07-09 15:12:59,214][26022] Updated weights on worker 0-0, policy_version 303254 (0.00092) [2022-07-09 15:13:00,452][25689] Fps is (10 sec: 5686.7, 60 sec: 5622.3, 300 sec: 5630.0). Total num frames: 310539264. Throughput: 0: 5066.9. Samples: 310535800. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:13:00,454][25689] Avg episode reward: [(0, '-46.922')] [2022-07-09 15:13:00,911][26022] Updated weights on worker 0-0, policy_version 303264 (0.00095) [2022-07-09 15:13:03,296][26022] Updated weights on worker 0-0, policy_version 303274 (0.00094) [2022-07-09 15:13:05,028][26022] Updated weights on worker 0-0, policy_version 303284 (0.00088) [2022-07-09 15:13:05,466][25689] Fps is (10 sec: 5590.0, 60 sec: 5622.0, 300 sec: 5626.4). Total num frames: 310565888. Throughput: 0: 5816.8. Samples: 310567826. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:13:05,468][25689] Avg episode reward: [(0, '-48.270')] [2022-07-09 15:13:06,738][26022] Updated weights on worker 0-0, policy_version 303294 (0.00094) [2022-07-09 15:13:07,761][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:13:07,769][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000303299_310578176.pth [2022-07-09 15:13:07,770][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000301319_308550656.pth [2022-07-09 15:13:08,579][26022] Updated weights on worker 0-0, policy_version 303304 (0.00092) [2022-07-09 15:13:10,305][26022] Updated weights on worker 0-0, policy_version 303314 (0.00092) [2022-07-09 15:13:10,486][25689] Fps is (10 sec: 5511.1, 60 sec: 5622.8, 300 sec: 5629.9). Total num frames: 310594560. Throughput: 0: 5814.1. Samples: 310601940. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-09 15:13:10,486][25689] Avg episode reward: [(0, '-47.986')] [2022-07-09 15:13:12,260][26022] Updated weights on worker 0-0, policy_version 303324 (0.00091) [2022-07-09 15:13:13,945][26022] Updated weights on worker 0-0, policy_version 303334 (0.00082) [2022-07-09 15:13:15,503][25689] Fps is (10 sec: 5611.3, 60 sec: 5640.8, 300 sec: 5624.3). Total num frames: 310622208. Throughput: 0: 4984.0. Samples: 310619166. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:13:15,504][25689] Avg episode reward: [(0, '-47.886')] [2022-07-09 15:13:15,804][26022] Updated weights on worker 0-0, policy_version 303344 (0.00089) [2022-07-09 15:13:17,687][26022] Updated weights on worker 0-0, policy_version 303354 (0.00093) [2022-07-09 15:13:19,340][26022] Updated weights on worker 0-0, policy_version 303364 (0.00097) [2022-07-09 15:13:20,617][25689] Fps is (10 sec: 5457.9, 60 sec: 5609.0, 300 sec: 5619.5). Total num frames: 310649856. Throughput: 0: 5823.7. Samples: 310653236. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:13:20,618][25689] Avg episode reward: [(0, '-47.114')] [2022-07-09 15:13:21,199][26022] Updated weights on worker 0-0, policy_version 303374 (0.00090) [2022-07-09 15:13:23,096][26022] Updated weights on worker 0-0, policy_version 303384 (0.00088) [2022-07-09 15:13:24,716][26022] Updated weights on worker 0-0, policy_version 303394 (0.00091) [2022-07-09 15:13:25,630][25689] Fps is (10 sec: 5763.6, 60 sec: 5660.7, 300 sec: 5630.1). Total num frames: 310680576. Throughput: 0: 5938.9. Samples: 310687578. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:13:25,630][25689] Avg episode reward: [(0, '-46.680')] [2022-07-09 15:13:26,719][26022] Updated weights on worker 0-0, policy_version 303404 (0.00086) [2022-07-09 15:13:28,256][26022] Updated weights on worker 0-0, policy_version 303414 (0.00084) [2022-07-09 15:13:30,414][26022] Updated weights on worker 0-0, policy_version 303424 (0.00087) [2022-07-09 15:13:30,690][25689] Fps is (10 sec: 5794.7, 60 sec: 5638.2, 300 sec: 5627.2). Total num frames: 310708224. Throughput: 0: 5080.1. Samples: 310704578. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:13:30,691][25689] Avg episode reward: [(0, '-45.991')] [2022-07-09 15:13:31,973][26022] Updated weights on worker 0-0, policy_version 303434 (0.00092) [2022-07-09 15:13:33,865][26022] Updated weights on worker 0-0, policy_version 303444 (0.00090) [2022-07-09 15:13:35,673][26022] Updated weights on worker 0-0, policy_version 303454 (0.00088) [2022-07-09 15:13:35,713][25689] Fps is (10 sec: 5585.4, 60 sec: 5654.9, 300 sec: 5624.9). Total num frames: 310736896. Throughput: 0: 5908.8. Samples: 310738584. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:13:35,714][25689] Avg episode reward: [(0, '-45.387')] [2022-07-09 15:13:37,698][26022] Updated weights on worker 0-0, policy_version 303464 (0.00101) [2022-07-09 15:13:39,153][26022] Updated weights on worker 0-0, policy_version 303474 (0.00091) [2022-07-09 15:13:40,847][25689] Fps is (10 sec: 5544.9, 60 sec: 5630.3, 300 sec: 5626.9). Total num frames: 310764544. Throughput: 0: 5879.7. Samples: 310772180. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:13:40,847][25689] Avg episode reward: [(0, '-44.963')] [2022-07-09 15:13:41,257][26022] Updated weights on worker 0-0, policy_version 303484 (0.00088) [2022-07-09 15:13:42,747][26022] Updated weights on worker 0-0, policy_version 303494 (0.00100) [2022-07-09 15:13:44,835][26022] Updated weights on worker 0-0, policy_version 303504 (0.00089) [2022-07-09 15:13:45,861][25689] Fps is (10 sec: 5751.5, 60 sec: 5634.1, 300 sec: 5630.6). Total num frames: 310795264. Throughput: 0: 5029.9. Samples: 310789340. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:13:45,862][25689] Avg episode reward: [(0, '-45.925')] [2022-07-09 15:13:46,363][26022] Updated weights on worker 0-0, policy_version 303514 (0.00083) [2022-07-09 15:13:48,400][26022] Updated weights on worker 0-0, policy_version 303524 (0.00093) [2022-07-09 15:13:50,160][26022] Updated weights on worker 0-0, policy_version 303534 (0.00084) [2022-07-09 15:13:50,869][25689] Fps is (10 sec: 5824.0, 60 sec: 5636.2, 300 sec: 5627.5). Total num frames: 310822912. Throughput: 0: 5889.5. Samples: 310823422. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:13:50,869][25689] Avg episode reward: [(0, '-45.626')] [2022-07-09 15:13:52,188][26022] Updated weights on worker 0-0, policy_version 303544 (0.00086) [2022-07-09 15:13:53,903][26022] Updated weights on worker 0-0, policy_version 303554 (0.00088) [2022-07-09 15:13:55,616][26022] Updated weights on worker 0-0, policy_version 303564 (0.00085) [2022-07-09 15:13:55,906][25689] Fps is (10 sec: 5402.9, 60 sec: 5617.9, 300 sec: 5623.9). Total num frames: 310849536. Throughput: 0: 5883.9. Samples: 310857400. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:13:55,907][25689] Avg episode reward: [(0, '-45.876')] [2022-07-09 15:13:57,345][26022] Updated weights on worker 0-0, policy_version 303574 (0.00094) [2022-07-09 15:13:59,364][26022] Updated weights on worker 0-0, policy_version 303584 (0.00098) [2022-07-09 15:14:00,868][26022] Updated weights on worker 0-0, policy_version 303594 (0.00092) [2022-07-09 15:14:00,967][25689] Fps is (10 sec: 5678.8, 60 sec: 5634.9, 300 sec: 5633.3). Total num frames: 310880256. Throughput: 0: 5070.0. Samples: 310874188. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:14:00,967][25689] Avg episode reward: [(0, '-46.922')] [2022-07-09 15:14:03,150][26022] Updated weights on worker 0-0, policy_version 303604 (0.00097) [2022-07-09 15:14:04,956][26022] Updated weights on worker 0-0, policy_version 303614 (0.00095) [2022-07-09 15:14:06,002][25689] Fps is (10 sec: 5679.9, 60 sec: 5632.8, 300 sec: 5629.2). Total num frames: 310906880. Throughput: 0: 5818.0. Samples: 310906520. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:14:06,003][25689] Avg episode reward: [(0, '-47.039')] [2022-07-09 15:14:06,743][26022] Updated weights on worker 0-0, policy_version 303624 (0.00089) [2022-07-09 15:14:08,422][26022] Updated weights on worker 0-0, policy_version 303634 (0.00089) [2022-07-09 15:14:10,179][26022] Updated weights on worker 0-0, policy_version 303644 (0.00087) [2022-07-09 15:14:11,069][25689] Fps is (10 sec: 5270.7, 60 sec: 5594.7, 300 sec: 5617.8). Total num frames: 310933504. Throughput: 0: 5825.8. Samples: 310941106. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:14:11,083][25689] Avg episode reward: [(0, '-46.741')] [2022-07-09 15:14:11,862][26022] Updated weights on worker 0-0, policy_version 303654 (0.00085) [2022-07-09 15:14:14,124][26022] Updated weights on worker 0-0, policy_version 303664 (0.00089) [2022-07-09 15:14:15,531][26022] Updated weights on worker 0-0, policy_version 303674 (0.00087) [2022-07-09 15:14:16,137][25689] Fps is (10 sec: 5657.9, 60 sec: 5640.6, 300 sec: 5634.8). Total num frames: 310964224. Throughput: 0: 4986.4. Samples: 310958280. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:14:16,138][25689] Avg episode reward: [(0, '-46.935')] [2022-07-09 15:14:17,691][26022] Updated weights on worker 0-0, policy_version 303684 (0.00093) [2022-07-09 15:14:19,253][26022] Updated weights on worker 0-0, policy_version 303694 (0.00091) [2022-07-09 15:14:21,127][26022] Updated weights on worker 0-0, policy_version 303704 (0.00085) [2022-07-09 15:14:21,222][25689] Fps is (10 sec: 5850.1, 60 sec: 5660.3, 300 sec: 5629.9). Total num frames: 310992896. Throughput: 0: 5831.8. Samples: 310992310. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:14:21,222][25689] Avg episode reward: [(0, '-47.814')] [2022-07-09 15:14:22,795][26022] Updated weights on worker 0-0, policy_version 303714 (0.00089) [2022-07-09 15:14:24,776][26022] Updated weights on worker 0-0, policy_version 303724 (0.00084) [2022-07-09 15:14:26,282][25689] Fps is (10 sec: 5753.5, 60 sec: 5639.0, 300 sec: 5629.2). Total num frames: 311022592. Throughput: 0: 5915.3. Samples: 311026482. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:14:26,284][25689] Avg episode reward: [(0, '-47.439')] [2022-07-09 15:14:26,408][26022] Updated weights on worker 0-0, policy_version 303734 (0.00093) [2022-07-09 15:14:28,343][26022] Updated weights on worker 0-0, policy_version 303744 (0.00092) [2022-07-09 15:14:29,865][26022] Updated weights on worker 0-0, policy_version 303754 (0.00085) [2022-07-09 15:14:31,322][25689] Fps is (10 sec: 5575.9, 60 sec: 5623.9, 300 sec: 5628.5). Total num frames: 311049216. Throughput: 0: 5054.1. Samples: 311043466. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:14:31,324][25689] Avg episode reward: [(0, '-48.126')] [2022-07-09 15:14:32,012][26022] Updated weights on worker 0-0, policy_version 303764 (0.00085) [2022-07-09 15:14:33,507][26022] Updated weights on worker 0-0, policy_version 303774 (0.00094) [2022-07-09 15:14:35,485][26022] Updated weights on worker 0-0, policy_version 303784 (0.00085) [2022-07-09 15:14:36,358][25689] Fps is (10 sec: 5691.0, 60 sec: 5656.5, 300 sec: 5635.8). Total num frames: 311079936. Throughput: 0: 5907.7. Samples: 311077740. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:14:36,360][25689] Avg episode reward: [(0, '-47.547')] [2022-07-09 15:14:37,182][26022] Updated weights on worker 0-0, policy_version 303794 (0.00097) [2022-07-09 15:14:38,890][26022] Updated weights on worker 0-0, policy_version 303804 (0.00096) [2022-07-09 15:14:41,003][26022] Updated weights on worker 0-0, policy_version 303814 (0.00086) [2022-07-09 15:14:41,425][25689] Fps is (10 sec: 5777.4, 60 sec: 5662.8, 300 sec: 5624.5). Total num frames: 311107584. Throughput: 0: 5915.2. Samples: 311111818. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:14:41,427][25689] Avg episode reward: [(0, '-47.069')] [2022-07-09 15:14:42,517][26022] Updated weights on worker 0-0, policy_version 303824 (0.00082) [2022-07-09 15:14:44,564][26022] Updated weights on worker 0-0, policy_version 303834 (0.00089) [2022-07-09 15:14:46,294][26022] Updated weights on worker 0-0, policy_version 303844 (0.00092) [2022-07-09 15:14:46,515][25689] Fps is (10 sec: 5545.2, 60 sec: 5622.0, 300 sec: 5633.9). Total num frames: 311136256. Throughput: 0: 5913.0. Samples: 311146118. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:14:46,515][25689] Avg episode reward: [(0, '-47.134')] [2022-07-09 15:14:48,067][26022] Updated weights on worker 0-0, policy_version 303854 (0.00097) [2022-07-09 15:14:50,072][26022] Updated weights on worker 0-0, policy_version 303864 (0.00095) [2022-07-09 15:14:51,552][25689] Fps is (10 sec: 5763.5, 60 sec: 5652.9, 300 sec: 5633.3). Total num frames: 311165952. Throughput: 0: 5916.4. Samples: 311163156. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:14:51,553][25689] Avg episode reward: [(0, '-47.775')] [2022-07-09 15:14:51,751][26022] Updated weights on worker 0-0, policy_version 303874 (0.00412) [2022-07-09 15:14:53,491][26022] Updated weights on worker 0-0, policy_version 303884 (0.00076) [2022-07-09 15:14:55,157][26022] Updated weights on worker 0-0, policy_version 303894 (0.00095) [2022-07-09 15:14:56,617][25689] Fps is (10 sec: 5777.6, 60 sec: 5684.1, 300 sec: 5640.0). Total num frames: 311194624. Throughput: 0: 5914.6. Samples: 311197564. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:14:56,618][25689] Avg episode reward: [(0, '-47.635')] [2022-07-09 15:14:57,182][26022] Updated weights on worker 0-0, policy_version 303904 (0.00093) [2022-07-09 15:14:58,852][26022] Updated weights on worker 0-0, policy_version 303914 (0.00087) [2022-07-09 15:15:00,518][26022] Updated weights on worker 0-0, policy_version 303924 (0.00091) [2022-07-09 15:15:01,673][25689] Fps is (10 sec: 5666.0, 60 sec: 5650.8, 300 sec: 5639.9). Total num frames: 311223296. Throughput: 0: 5919.7. Samples: 311231680. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:15:01,674][25689] Avg episode reward: [(0, '-47.871')] [2022-07-09 15:15:02,956][26022] Updated weights on worker 0-0, policy_version 303934 (0.00058) [2022-07-09 15:15:04,462][26022] Updated weights on worker 0-0, policy_version 303944 (0.00091) [2022-07-09 15:15:06,459][26022] Updated weights on worker 0-0, policy_version 303954 (0.00091) [2022-07-09 15:15:06,728][25689] Fps is (10 sec: 5468.9, 60 sec: 5649.0, 300 sec: 5635.6). Total num frames: 311249920. Throughput: 0: 4975.2. Samples: 311246684. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:15:06,729][25689] Avg episode reward: [(0, '-47.597')] [2022-07-09 15:15:07,808][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:15:07,825][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000303962_311257088.pth [2022-07-09 15:15:07,826][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000301979_309226496.pth [2022-07-09 15:15:08,043][26022] Updated weights on worker 0-0, policy_version 303964 (0.00079) [2022-07-09 15:15:10,004][26022] Updated weights on worker 0-0, policy_version 303974 (0.00089) [2022-07-09 15:15:11,752][25689] Fps is (10 sec: 5486.4, 60 sec: 5686.8, 300 sec: 5635.5). Total num frames: 311278592. Throughput: 0: 5830.6. Samples: 311280932. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:15:11,752][25689] Avg episode reward: [(0, '-47.419')] [2022-07-09 15:15:11,798][26022] Updated weights on worker 0-0, policy_version 303984 (0.00107) [2022-07-09 15:15:13,498][26022] Updated weights on worker 0-0, policy_version 303994 (0.00088) [2022-07-09 15:15:15,397][26022] Updated weights on worker 0-0, policy_version 304004 (0.00090) [2022-07-09 15:15:16,783][25689] Fps is (10 sec: 5805.3, 60 sec: 5673.4, 300 sec: 5644.5). Total num frames: 311308288. Throughput: 0: 5841.4. Samples: 311315358. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:15:16,783][25689] Avg episode reward: [(0, '-46.914')] [2022-07-09 15:15:17,294][26022] Updated weights on worker 0-0, policy_version 304014 (0.00122) [2022-07-09 15:15:18,923][26022] Updated weights on worker 0-0, policy_version 304024 (0.00085) [2022-07-09 15:15:20,907][26022] Updated weights on worker 0-0, policy_version 304034 (0.00093) [2022-07-09 15:15:21,874][25689] Fps is (10 sec: 5664.9, 60 sec: 5655.8, 300 sec: 5633.0). Total num frames: 311335936. Throughput: 0: 4982.6. Samples: 311332336. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:15:21,875][25689] Avg episode reward: [(0, '-47.230')] [2022-07-09 15:15:22,425][26022] Updated weights on worker 0-0, policy_version 304044 (0.00084) [2022-07-09 15:15:24,526][26022] Updated weights on worker 0-0, policy_version 304054 (0.00093) [2022-07-09 15:15:26,160][26022] Updated weights on worker 0-0, policy_version 304064 (0.00085) [2022-07-09 15:15:26,879][25689] Fps is (10 sec: 5780.8, 60 sec: 5677.9, 300 sec: 5643.7). Total num frames: 311366656. Throughput: 0: 5957.7. Samples: 311366738. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:15:26,880][25689] Avg episode reward: [(0, '-46.763')] [2022-07-09 15:15:28,003][26022] Updated weights on worker 0-0, policy_version 304074 (0.00088) [2022-07-09 15:15:29,690][26022] Updated weights on worker 0-0, policy_version 304084 (0.00101) [2022-07-09 15:15:31,737][26022] Updated weights on worker 0-0, policy_version 304094 (0.00088) [2022-07-09 15:15:31,959][25689] Fps is (10 sec: 5686.4, 60 sec: 5674.2, 300 sec: 5639.4). Total num frames: 311393280. Throughput: 0: 5936.7. Samples: 311400896. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:15:31,959][25689] Avg episode reward: [(0, '-45.597')] [2022-07-09 15:15:33,291][26022] Updated weights on worker 0-0, policy_version 304104 (0.00088) [2022-07-09 15:15:35,236][26022] Updated weights on worker 0-0, policy_version 304114 (0.00088) [2022-07-09 15:15:36,943][26022] Updated weights on worker 0-0, policy_version 304124 (0.00095) [2022-07-09 15:15:37,047][25689] Fps is (10 sec: 5539.4, 60 sec: 5652.5, 300 sec: 5640.4). Total num frames: 311422976. Throughput: 0: 5075.6. Samples: 311418210. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:15:37,047][25689] Avg episode reward: [(0, '-46.329')] [2022-07-09 15:15:38,811][26022] Updated weights on worker 0-0, policy_version 304134 (0.00086) [2022-07-09 15:15:40,547][26022] Updated weights on worker 0-0, policy_version 304144 (0.00080) [2022-07-09 15:15:42,127][25689] Fps is (10 sec: 5740.4, 60 sec: 5668.1, 300 sec: 5643.1). Total num frames: 311451648. Throughput: 0: 5935.5. Samples: 311452544. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 15:15:42,127][25689] Avg episode reward: [(0, '-45.979')] [2022-07-09 15:15:42,390][26022] Updated weights on worker 0-0, policy_version 304154 (0.00439) [2022-07-09 15:15:44,049][26022] Updated weights on worker 0-0, policy_version 304164 (0.00092) [2022-07-09 15:15:45,899][26022] Updated weights on worker 0-0, policy_version 304174 (0.00089) [2022-07-09 15:15:47,130][25689] Fps is (10 sec: 5687.2, 60 sec: 5676.2, 300 sec: 5637.0). Total num frames: 311480320. Throughput: 0: 5937.9. Samples: 311486982. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:15:47,131][25689] Avg episode reward: [(0, '-45.260')] [2022-07-09 15:15:47,542][26022] Updated weights on worker 0-0, policy_version 304184 (0.00082) [2022-07-09 15:15:49,550][26022] Updated weights on worker 0-0, policy_version 304194 (0.00087) [2022-07-09 15:15:51,358][26022] Updated weights on worker 0-0, policy_version 304204 (0.00087) [2022-07-09 15:15:52,135][25689] Fps is (10 sec: 5730.0, 60 sec: 5662.4, 300 sec: 5644.5). Total num frames: 311508992. Throughput: 0: 5120.7. Samples: 311504210. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:15:52,135][25689] Avg episode reward: [(0, '-44.694')] [2022-07-09 15:15:53,065][26022] Updated weights on worker 0-0, policy_version 304214 (0.00096) [2022-07-09 15:15:54,962][26022] Updated weights on worker 0-0, policy_version 304224 (0.00098) [2022-07-09 15:15:56,868][26022] Updated weights on worker 0-0, policy_version 304234 (0.00091) [2022-07-09 15:15:57,147][25689] Fps is (10 sec: 5724.7, 60 sec: 5667.3, 300 sec: 5642.8). Total num frames: 311537664. Throughput: 0: 5949.3. Samples: 311537792. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:15:57,148][25689] Avg episode reward: [(0, '-45.345')] [2022-07-09 15:15:58,662][26022] Updated weights on worker 0-0, policy_version 304244 (0.00088) [2022-07-09 15:16:00,326][26022] Updated weights on worker 0-0, policy_version 304254 (0.00093) [2022-07-09 15:16:02,270][25689] Fps is (10 sec: 5556.6, 60 sec: 5644.1, 300 sec: 5641.8). Total num frames: 311565312. Throughput: 0: 5931.9. Samples: 311572034. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:16:02,271][25689] Avg episode reward: [(0, '-45.701')] [2022-07-09 15:16:02,534][26022] Updated weights on worker 0-0, policy_version 304264 (0.00091) [2022-07-09 15:16:04,222][26022] Updated weights on worker 0-0, policy_version 304274 (0.00087) [2022-07-09 15:16:06,091][26022] Updated weights on worker 0-0, policy_version 304284 (0.00093) [2022-07-09 15:16:07,350][25689] Fps is (10 sec: 5519.5, 60 sec: 5675.6, 300 sec: 5644.0). Total num frames: 311593984. Throughput: 0: 4938.5. Samples: 311586842. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:16:07,351][25689] Avg episode reward: [(0, '-46.038')] [2022-07-09 15:16:08,072][26022] Updated weights on worker 0-0, policy_version 304294 (0.00088) [2022-07-09 15:16:09,668][26022] Updated weights on worker 0-0, policy_version 304304 (0.00084) [2022-07-09 15:16:11,727][26022] Updated weights on worker 0-0, policy_version 304314 (0.00087) [2022-07-09 15:16:12,356][25689] Fps is (10 sec: 5482.6, 60 sec: 5643.4, 300 sec: 5637.3). Total num frames: 311620608. Throughput: 0: 5777.1. Samples: 311621032. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:16:12,357][25689] Avg episode reward: [(0, '-45.782')] [2022-07-09 15:16:13,282][26022] Updated weights on worker 0-0, policy_version 304324 (0.00090) [2022-07-09 15:16:15,479][26022] Updated weights on worker 0-0, policy_version 304334 (0.00084) [2022-07-09 15:16:16,904][26022] Updated weights on worker 0-0, policy_version 304344 (0.00091) [2022-07-09 15:16:17,364][25689] Fps is (10 sec: 5624.5, 60 sec: 5645.6, 300 sec: 5642.2). Total num frames: 311650304. Throughput: 0: 5798.1. Samples: 311655012. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:16:17,364][25689] Avg episode reward: [(0, '-46.503')] [2022-07-09 15:16:18,905][26022] Updated weights on worker 0-0, policy_version 304354 (0.00083) [2022-07-09 15:16:20,600][26022] Updated weights on worker 0-0, policy_version 304364 (0.00088) [2022-07-09 15:16:22,410][25689] Fps is (10 sec: 5703.7, 60 sec: 5649.8, 300 sec: 5638.1). Total num frames: 311677952. Throughput: 0: 4958.3. Samples: 311671892. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:16:22,410][25689] Avg episode reward: [(0, '-46.087')] [2022-07-09 15:16:22,503][26022] Updated weights on worker 0-0, policy_version 304374 (0.00081) [2022-07-09 15:16:24,344][26022] Updated weights on worker 0-0, policy_version 304384 (0.00095) [2022-07-09 15:16:25,935][26022] Updated weights on worker 0-0, policy_version 304394 (0.00093) [2022-07-09 15:16:27,515][25689] Fps is (10 sec: 5548.2, 60 sec: 5606.8, 300 sec: 5632.9). Total num frames: 311706624. Throughput: 0: 5898.8. Samples: 311705788. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:16:27,515][25689] Avg episode reward: [(0, '-45.871')] [2022-07-09 15:16:28,002][26022] Updated weights on worker 0-0, policy_version 304404 (0.00092) [2022-07-09 15:16:29,761][26022] Updated weights on worker 0-0, policy_version 304414 (0.00093) [2022-07-09 15:16:31,648][26022] Updated weights on worker 0-0, policy_version 304424 (0.00090) [2022-07-09 15:16:32,524][25689] Fps is (10 sec: 5669.9, 60 sec: 5647.1, 300 sec: 5637.1). Total num frames: 311735296. Throughput: 0: 5899.7. Samples: 311740016. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:16:32,525][25689] Avg episode reward: [(0, '-45.566')] [2022-07-09 15:16:33,367][26022] Updated weights on worker 0-0, policy_version 304434 (0.00086) [2022-07-09 15:16:35,052][26022] Updated weights on worker 0-0, policy_version 304444 (0.00091) [2022-07-09 15:16:36,871][26022] Updated weights on worker 0-0, policy_version 304454 (0.00096) [2022-07-09 15:16:37,553][25689] Fps is (10 sec: 5712.4, 60 sec: 5635.6, 300 sec: 5637.6). Total num frames: 311763968. Throughput: 0: 5055.1. Samples: 311757072. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:16:37,554][25689] Avg episode reward: [(0, '-45.802')] [2022-07-09 15:16:38,711][26022] Updated weights on worker 0-0, policy_version 304464 (0.00083) [2022-07-09 15:16:40,518][26022] Updated weights on worker 0-0, policy_version 304474 (0.00093) [2022-07-09 15:16:42,212][26022] Updated weights on worker 0-0, policy_version 304484 (0.00050) [2022-07-09 15:16:42,628][25689] Fps is (10 sec: 5776.6, 60 sec: 5653.1, 300 sec: 5643.2). Total num frames: 311793664. Throughput: 0: 5899.3. Samples: 311791164. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:16:42,633][25689] Avg episode reward: [(0, '-45.905')] [2022-07-09 15:16:44,075][26022] Updated weights on worker 0-0, policy_version 304494 (0.00087) [2022-07-09 15:16:45,866][26022] Updated weights on worker 0-0, policy_version 304504 (0.00099) [2022-07-09 15:16:47,645][25689] Fps is (10 sec: 5682.2, 60 sec: 5634.8, 300 sec: 5643.6). Total num frames: 311821312. Throughput: 0: 5943.9. Samples: 311825440. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:16:47,646][25689] Avg episode reward: [(0, '-46.281')] [2022-07-09 15:16:47,704][26022] Updated weights on worker 0-0, policy_version 304514 (0.00091) [2022-07-09 15:16:49,701][26022] Updated weights on worker 0-0, policy_version 304524 (0.00088) [2022-07-09 15:16:51,308][26022] Updated weights on worker 0-0, policy_version 304534 (0.00085) [2022-07-09 15:16:52,664][25689] Fps is (10 sec: 5509.8, 60 sec: 5616.6, 300 sec: 5630.3). Total num frames: 311848960. Throughput: 0: 5079.0. Samples: 311842308. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:16:52,667][25689] Avg episode reward: [(0, '-46.494')] [2022-07-09 15:16:53,207][26022] Updated weights on worker 0-0, policy_version 304544 (0.00092) [2022-07-09 15:16:55,198][26022] Updated weights on worker 0-0, policy_version 304554 (0.00086) [2022-07-09 15:16:56,570][26022] Updated weights on worker 0-0, policy_version 304564 (0.00091) [2022-07-09 15:16:57,671][25689] Fps is (10 sec: 5617.2, 60 sec: 5617.0, 300 sec: 5637.9). Total num frames: 311877632. Throughput: 0: 5935.6. Samples: 311876486. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:16:57,672][25689] Avg episode reward: [(0, '-46.762')] [2022-07-09 15:16:58,828][26022] Updated weights on worker 0-0, policy_version 304574 (0.00086) [2022-07-09 15:17:00,448][26022] Updated weights on worker 0-0, policy_version 304584 (0.00083) [2022-07-09 15:17:02,539][26022] Updated weights on worker 0-0, policy_version 304594 (0.00101) [2022-07-09 15:17:02,773][25689] Fps is (10 sec: 5571.3, 60 sec: 5619.1, 300 sec: 5639.6). Total num frames: 311905280. Throughput: 0: 5904.0. Samples: 311910100. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:17:02,775][25689] Avg episode reward: [(0, '-47.137')] [2022-07-09 15:17:04,684][26022] Updated weights on worker 0-0, policy_version 304604 (0.00090) [2022-07-09 15:17:05,905][26022] Updated weights on worker 0-0, policy_version 304614 (0.00099) [2022-07-09 15:17:07,822][25689] Fps is (10 sec: 5447.4, 60 sec: 5605.0, 300 sec: 5635.8). Total num frames: 311932928. Throughput: 0: 5807.9. Samples: 311942628. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:17:07,824][25689] Avg episode reward: [(0, '-47.107')] [2022-07-09 15:17:08,021][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:17:08,034][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000304623_311933952.pth [2022-07-09 15:17:08,034][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000302640_309903360.pth [2022-07-09 15:17:08,203][26022] Updated weights on worker 0-0, policy_version 304624 (0.00084) [2022-07-09 15:17:09,843][26022] Updated weights on worker 0-0, policy_version 304634 (0.00084) [2022-07-09 15:17:11,650][26022] Updated weights on worker 0-0, policy_version 304644 (0.00356) [2022-07-09 15:17:12,843][25689] Fps is (10 sec: 5592.8, 60 sec: 5637.5, 300 sec: 5642.8). Total num frames: 311961600. Throughput: 0: 5823.4. Samples: 311959818. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:17:12,845][25689] Avg episode reward: [(0, '-47.987')] [2022-07-09 15:17:13,365][26022] Updated weights on worker 0-0, policy_version 304654 (0.00085) [2022-07-09 15:17:15,166][26022] Updated weights on worker 0-0, policy_version 304664 (0.00087) [2022-07-09 15:17:17,097][26022] Updated weights on worker 0-0, policy_version 304674 (0.00089) [2022-07-09 15:17:17,878][25689] Fps is (10 sec: 5804.2, 60 sec: 5634.9, 300 sec: 5644.7). Total num frames: 311991296. Throughput: 0: 5817.0. Samples: 311994030. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:17:17,880][25689] Avg episode reward: [(0, '-47.341')] [2022-07-09 15:17:18,771][26022] Updated weights on worker 0-0, policy_version 304684 (0.00088) [2022-07-09 15:17:20,567][26022] Updated weights on worker 0-0, policy_version 304694 (0.00082) [2022-07-09 15:17:22,455][26022] Updated weights on worker 0-0, policy_version 304704 (0.00086) [2022-07-09 15:17:23,009][25689] Fps is (10 sec: 5640.6, 60 sec: 5627.0, 300 sec: 5642.7). Total num frames: 312018944. Throughput: 0: 5816.3. Samples: 312027800. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:17:23,011][25689] Avg episode reward: [(0, '-47.571')] [2022-07-09 15:17:24,430][26022] Updated weights on worker 0-0, policy_version 304714 (0.00098) [2022-07-09 15:17:25,965][26022] Updated weights on worker 0-0, policy_version 304724 (0.00094) [2022-07-09 15:17:28,016][25689] Fps is (10 sec: 5555.3, 60 sec: 5636.1, 300 sec: 5642.6). Total num frames: 312047616. Throughput: 0: 5056.7. Samples: 312044744. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:17:28,018][26022] Updated weights on worker 0-0, policy_version 304734 (0.00090) [2022-07-09 15:17:28,018][25689] Avg episode reward: [(0, '-47.533')] [2022-07-09 15:17:29,710][26022] Updated weights on worker 0-0, policy_version 304744 (0.00097) [2022-07-09 15:17:31,391][26022] Updated weights on worker 0-0, policy_version 304754 (0.00092) [2022-07-09 15:17:33,042][25689] Fps is (10 sec: 5613.8, 60 sec: 5617.6, 300 sec: 5642.5). Total num frames: 312075264. Throughput: 0: 5883.2. Samples: 312078652. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:17:33,042][25689] Avg episode reward: [(0, '-47.187')] [2022-07-09 15:17:33,302][26022] Updated weights on worker 0-0, policy_version 304764 (0.00104) [2022-07-09 15:17:35,156][26022] Updated weights on worker 0-0, policy_version 304774 (0.00095) [2022-07-09 15:17:36,902][26022] Updated weights on worker 0-0, policy_version 304784 (0.00093) [2022-07-09 15:17:38,057][25689] Fps is (10 sec: 5609.1, 60 sec: 5618.9, 300 sec: 5643.1). Total num frames: 312103936. Throughput: 0: 5892.3. Samples: 312112930. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:17:38,058][25689] Avg episode reward: [(0, '-47.330')] [2022-07-09 15:17:38,720][26022] Updated weights on worker 0-0, policy_version 304794 (0.00096) [2022-07-09 15:17:40,508][26022] Updated weights on worker 0-0, policy_version 304804 (0.00104) [2022-07-09 15:17:42,626][26022] Updated weights on worker 0-0, policy_version 304814 (0.00084) [2022-07-09 15:17:43,104][25689] Fps is (10 sec: 5800.7, 60 sec: 5621.5, 300 sec: 5639.9). Total num frames: 312133632. Throughput: 0: 5072.2. Samples: 312129724. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:17:43,106][25689] Avg episode reward: [(0, '-46.053')] [2022-07-09 15:17:43,994][26022] Updated weights on worker 0-0, policy_version 304824 (0.00091) [2022-07-09 15:17:46,027][26022] Updated weights on worker 0-0, policy_version 304834 (0.00084) [2022-07-09 15:17:47,803][26022] Updated weights on worker 0-0, policy_version 304844 (0.00086) [2022-07-09 15:17:48,155][25689] Fps is (10 sec: 5679.0, 60 sec: 5618.4, 300 sec: 5639.5). Total num frames: 312161280. Throughput: 0: 5921.8. Samples: 312164000. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:17:48,155][25689] Avg episode reward: [(0, '-46.073')] [2022-07-09 15:17:49,558][26022] Updated weights on worker 0-0, policy_version 304854 (0.00056) [2022-07-09 15:17:51,542][26022] Updated weights on worker 0-0, policy_version 304864 (0.00096) [2022-07-09 15:17:53,013][26022] Updated weights on worker 0-0, policy_version 304874 (0.00090) [2022-07-09 15:17:53,164][25689] Fps is (10 sec: 5700.1, 60 sec: 5653.2, 300 sec: 5646.6). Total num frames: 312190976. Throughput: 0: 5929.6. Samples: 312197968. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:17:53,164][25689] Avg episode reward: [(0, '-45.881')] [2022-07-09 15:17:55,037][26022] Updated weights on worker 0-0, policy_version 304884 (0.00087) [2022-07-09 15:17:56,600][26022] Updated weights on worker 0-0, policy_version 304894 (0.00084) [2022-07-09 15:17:58,186][25689] Fps is (10 sec: 5716.3, 60 sec: 5634.8, 300 sec: 5640.5). Total num frames: 312218624. Throughput: 0: 5082.0. Samples: 312215226. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:17:58,187][25689] Avg episode reward: [(0, '-45.822')] [2022-07-09 15:17:58,647][26022] Updated weights on worker 0-0, policy_version 304904 (0.00088) [2022-07-09 15:18:00,316][26022] Updated weights on worker 0-0, policy_version 304914 (0.00086) [2022-07-09 15:18:02,515][26022] Updated weights on worker 0-0, policy_version 304924 (0.00092) [2022-07-09 15:18:03,282][25689] Fps is (10 sec: 5465.2, 60 sec: 5635.4, 300 sec: 5642.4). Total num frames: 312246272. Throughput: 0: 5870.3. Samples: 312248174. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:18:03,282][25689] Avg episode reward: [(0, '-45.458')] [2022-07-09 15:18:04,400][26022] Updated weights on worker 0-0, policy_version 304934 (0.00091) [2022-07-09 15:18:06,071][26022] Updated weights on worker 0-0, policy_version 304944 (0.00085) [2022-07-09 15:18:08,101][26022] Updated weights on worker 0-0, policy_version 304954 (0.00086) [2022-07-09 15:18:08,308][25689] Fps is (10 sec: 5463.4, 60 sec: 5637.6, 300 sec: 5638.8). Total num frames: 312273920. Throughput: 0: 5827.9. Samples: 312281448. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 15:18:08,308][25689] Avg episode reward: [(0, '-44.738')] [2022-07-09 15:18:09,651][26022] Updated weights on worker 0-0, policy_version 304964 (0.00090) [2022-07-09 15:18:11,717][26022] Updated weights on worker 0-0, policy_version 304974 (0.00086) [2022-07-09 15:18:13,322][25689] Fps is (10 sec: 5711.6, 60 sec: 5655.1, 300 sec: 5645.7). Total num frames: 312303616. Throughput: 0: 4980.6. Samples: 312298366. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:18:13,325][25689] Avg episode reward: [(0, '-45.259')] [2022-07-09 15:18:13,329][26022] Updated weights on worker 0-0, policy_version 304984 (0.00087) [2022-07-09 15:18:15,290][26022] Updated weights on worker 0-0, policy_version 304994 (0.00080) [2022-07-09 15:18:16,950][26022] Updated weights on worker 0-0, policy_version 305004 (0.00099) [2022-07-09 15:18:18,380][25689] Fps is (10 sec: 5795.1, 60 sec: 5636.1, 300 sec: 5650.2). Total num frames: 312332288. Throughput: 0: 5807.1. Samples: 312332490. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:18:18,380][25689] Avg episode reward: [(0, '-45.034')] [2022-07-09 15:18:18,802][26022] Updated weights on worker 0-0, policy_version 305014 (0.00092) [2022-07-09 15:18:20,627][26022] Updated weights on worker 0-0, policy_version 305024 (0.00091) [2022-07-09 15:18:22,300][26022] Updated weights on worker 0-0, policy_version 305034 (0.00085) [2022-07-09 15:18:23,507][25689] Fps is (10 sec: 5529.6, 60 sec: 5636.5, 300 sec: 5637.8). Total num frames: 312359936. Throughput: 0: 5854.8. Samples: 312366588. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:18:23,509][25689] Avg episode reward: [(0, '-45.064')] [2022-07-09 15:18:24,263][26022] Updated weights on worker 0-0, policy_version 305044 (0.00087) [2022-07-09 15:18:26,115][26022] Updated weights on worker 0-0, policy_version 305054 (0.00095) [2022-07-09 15:18:27,996][26022] Updated weights on worker 0-0, policy_version 305064 (0.00082) [2022-07-09 15:18:28,524][25689] Fps is (10 sec: 5653.0, 60 sec: 5652.5, 300 sec: 5645.5). Total num frames: 312389632. Throughput: 0: 5042.8. Samples: 312383394. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:18:28,525][25689] Avg episode reward: [(0, '-45.324')] [2022-07-09 15:18:29,647][26022] Updated weights on worker 0-0, policy_version 305074 (0.00085) [2022-07-09 15:18:31,506][26022] Updated weights on worker 0-0, policy_version 305084 (0.00094) [2022-07-09 15:18:33,262][26022] Updated weights on worker 0-0, policy_version 305094 (0.00609) [2022-07-09 15:18:33,614][25689] Fps is (10 sec: 5673.7, 60 sec: 5646.4, 300 sec: 5640.8). Total num frames: 312417280. Throughput: 0: 5871.6. Samples: 312417514. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:18:33,614][25689] Avg episode reward: [(0, '-45.617')] [2022-07-09 15:18:35,006][26022] Updated weights on worker 0-0, policy_version 305104 (0.00092) [2022-07-09 15:18:36,941][26022] Updated weights on worker 0-0, policy_version 305114 (0.00094) [2022-07-09 15:18:38,657][25689] Fps is (10 sec: 5658.9, 60 sec: 5660.7, 300 sec: 5649.3). Total num frames: 312446976. Throughput: 0: 5880.6. Samples: 312451734. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:18:38,658][25689] Avg episode reward: [(0, '-45.127')] [2022-07-09 15:18:38,667][26022] Updated weights on worker 0-0, policy_version 305124 (0.00397) [2022-07-09 15:18:40,459][26022] Updated weights on worker 0-0, policy_version 305134 (0.00089) [2022-07-09 15:18:42,175][26022] Updated weights on worker 0-0, policy_version 305144 (0.00090) [2022-07-09 15:18:43,764][25689] Fps is (10 sec: 5750.3, 60 sec: 5638.2, 300 sec: 5640.7). Total num frames: 312475648. Throughput: 0: 5049.0. Samples: 312468868. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:18:43,765][25689] Avg episode reward: [(0, '-45.044')] [2022-07-09 15:18:44,029][26022] Updated weights on worker 0-0, policy_version 305154 (0.00087) [2022-07-09 15:18:45,795][26022] Updated weights on worker 0-0, policy_version 305164 (0.00086) [2022-07-09 15:18:47,477][26022] Updated weights on worker 0-0, policy_version 305174 (0.00092) [2022-07-09 15:18:48,791][25689] Fps is (10 sec: 5759.8, 60 sec: 5674.3, 300 sec: 5647.2). Total num frames: 312505344. Throughput: 0: 5934.4. Samples: 312503666. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:18:48,791][25689] Avg episode reward: [(0, '-45.729')] [2022-07-09 15:18:49,343][26022] Updated weights on worker 0-0, policy_version 305184 (0.00096) [2022-07-09 15:18:51,203][26022] Updated weights on worker 0-0, policy_version 305194 (0.00087) [2022-07-09 15:18:53,018][26022] Updated weights on worker 0-0, policy_version 305204 (0.00087) [2022-07-09 15:18:53,834][25689] Fps is (10 sec: 5694.8, 60 sec: 5637.3, 300 sec: 5650.6). Total num frames: 312532992. Throughput: 0: 5948.0. Samples: 312537782. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:18:53,834][25689] Avg episode reward: [(0, '-45.128')] [2022-07-09 15:18:54,861][26022] Updated weights on worker 0-0, policy_version 305214 (0.00084) [2022-07-09 15:18:56,567][26022] Updated weights on worker 0-0, policy_version 305224 (0.00067) [2022-07-09 15:18:58,465][26022] Updated weights on worker 0-0, policy_version 305234 (0.00090) [2022-07-09 15:18:58,865][25689] Fps is (10 sec: 5590.2, 60 sec: 5653.3, 300 sec: 5644.2). Total num frames: 312561664. Throughput: 0: 5090.1. Samples: 312554596. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:18:58,866][25689] Avg episode reward: [(0, '-45.522')] [2022-07-09 15:19:00,198][26022] Updated weights on worker 0-0, policy_version 305244 (0.00088) [2022-07-09 15:19:02,599][26022] Updated weights on worker 0-0, policy_version 305254 (0.00086) [2022-07-09 15:19:03,957][25689] Fps is (10 sec: 5664.5, 60 sec: 5670.6, 300 sec: 5650.0). Total num frames: 312590336. Throughput: 0: 5816.4. Samples: 312586318. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:19:03,958][25689] Avg episode reward: [(0, '-46.117')] [2022-07-09 15:19:03,969][26022] Updated weights on worker 0-0, policy_version 305264 (0.00086) [2022-07-09 15:19:06,238][26022] Updated weights on worker 0-0, policy_version 305274 (0.00084) [2022-07-09 15:19:07,654][26022] Updated weights on worker 0-0, policy_version 305284 (0.00085) [2022-07-09 15:19:08,170][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:19:08,181][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000305286_312612864.pth [2022-07-09 15:19:08,181][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000303299_310578176.pth [2022-07-09 15:19:09,032][25689] Fps is (10 sec: 5338.3, 60 sec: 5632.3, 300 sec: 5646.4). Total num frames: 312615936. Throughput: 0: 5774.7. Samples: 312620554. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:19:09,033][25689] Avg episode reward: [(0, '-46.864')] [2022-07-09 15:19:09,574][26022] Updated weights on worker 0-0, policy_version 305294 (0.00057) [2022-07-09 15:19:11,292][26022] Updated weights on worker 0-0, policy_version 305304 (0.00093) [2022-07-09 15:19:13,259][26022] Updated weights on worker 0-0, policy_version 305314 (0.00086) [2022-07-09 15:19:14,064][25689] Fps is (10 sec: 5572.5, 60 sec: 5647.5, 300 sec: 5647.1). Total num frames: 312646656. Throughput: 0: 5785.9. Samples: 312654832. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:19:14,064][25689] Avg episode reward: [(0, '-47.070')] [2022-07-09 15:19:15,055][26022] Updated weights on worker 0-0, policy_version 305324 (0.00108) [2022-07-09 15:19:16,920][26022] Updated weights on worker 0-0, policy_version 305334 (0.00088) [2022-07-09 15:19:18,602][26022] Updated weights on worker 0-0, policy_version 305344 (0.00087) [2022-07-09 15:19:19,123][25689] Fps is (10 sec: 5987.0, 60 sec: 5664.2, 300 sec: 5651.0). Total num frames: 312676352. Throughput: 0: 5787.9. Samples: 312671846. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:19:19,124][25689] Avg episode reward: [(0, '-47.548')] [2022-07-09 15:19:20,545][26022] Updated weights on worker 0-0, policy_version 305354 (0.00097) [2022-07-09 15:19:22,064][26022] Updated weights on worker 0-0, policy_version 305364 (0.00085) [2022-07-09 15:19:24,199][25689] Fps is (10 sec: 5556.8, 60 sec: 5652.1, 300 sec: 5640.4). Total num frames: 312702976. Throughput: 0: 5917.2. Samples: 312706094. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:19:24,200][25689] Avg episode reward: [(0, '-47.737')] [2022-07-09 15:19:24,203][26022] Updated weights on worker 0-0, policy_version 305374 (0.00096) [2022-07-09 15:19:25,675][26022] Updated weights on worker 0-0, policy_version 305384 (0.00051) [2022-07-09 15:19:27,888][26022] Updated weights on worker 0-0, policy_version 305394 (0.00091) [2022-07-09 15:19:29,240][25689] Fps is (10 sec: 5465.6, 60 sec: 5633.0, 300 sec: 5647.3). Total num frames: 312731648. Throughput: 0: 5905.0. Samples: 312739884. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:19:29,241][25689] Avg episode reward: [(0, '-47.619')] [2022-07-09 15:19:29,394][26022] Updated weights on worker 0-0, policy_version 305404 (0.00092) [2022-07-09 15:19:31,399][26022] Updated weights on worker 0-0, policy_version 305414 (0.00085) [2022-07-09 15:19:32,996][26022] Updated weights on worker 0-0, policy_version 305424 (0.00089) [2022-07-09 15:19:34,258][25689] Fps is (10 sec: 5700.8, 60 sec: 5656.6, 300 sec: 5640.8). Total num frames: 312760320. Throughput: 0: 5064.3. Samples: 312757102. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:19:34,259][25689] Avg episode reward: [(0, '-46.499')] [2022-07-09 15:19:34,875][26022] Updated weights on worker 0-0, policy_version 305434 (0.00089) [2022-07-09 15:19:36,719][26022] Updated weights on worker 0-0, policy_version 305444 (0.00083) [2022-07-09 15:19:38,368][26022] Updated weights on worker 0-0, policy_version 305454 (0.00094) [2022-07-09 15:19:39,268][25689] Fps is (10 sec: 5718.7, 60 sec: 5642.8, 300 sec: 5645.3). Total num frames: 312788992. Throughput: 0: 5936.2. Samples: 312791428. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:19:39,268][25689] Avg episode reward: [(0, '-46.873')] [2022-07-09 15:19:40,231][26022] Updated weights on worker 0-0, policy_version 305464 (0.00094) [2022-07-09 15:19:42,166][26022] Updated weights on worker 0-0, policy_version 305474 (0.00092) [2022-07-09 15:19:43,775][26022] Updated weights on worker 0-0, policy_version 305484 (0.00094) [2022-07-09 15:19:44,365][25689] Fps is (10 sec: 5673.6, 60 sec: 5643.7, 300 sec: 5645.1). Total num frames: 312817664. Throughput: 0: 5909.0. Samples: 312825254. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:19:44,366][25689] Avg episode reward: [(0, '-46.501')] [2022-07-09 15:19:45,758][26022] Updated weights on worker 0-0, policy_version 305494 (0.00086) [2022-07-09 15:19:47,401][26022] Updated weights on worker 0-0, policy_version 305504 (0.00089) [2022-07-09 15:19:49,313][26022] Updated weights on worker 0-0, policy_version 305514 (0.00083) [2022-07-09 15:19:49,402][25689] Fps is (10 sec: 5658.4, 60 sec: 5625.9, 300 sec: 5641.7). Total num frames: 312846336. Throughput: 0: 5083.5. Samples: 312842376. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:19:49,403][25689] Avg episode reward: [(0, '-46.887')] [2022-07-09 15:19:51,063][26022] Updated weights on worker 0-0, policy_version 305524 (0.00096) [2022-07-09 15:19:52,899][26022] Updated weights on worker 0-0, policy_version 305534 (0.00084) [2022-07-09 15:19:54,419][25689] Fps is (10 sec: 5805.7, 60 sec: 5662.1, 300 sec: 5646.1). Total num frames: 312876032. Throughput: 0: 5939.6. Samples: 312876848. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:19:54,419][25689] Avg episode reward: [(0, '-45.922')] [2022-07-09 15:19:54,552][26022] Updated weights on worker 0-0, policy_version 305544 (0.00095) [2022-07-09 15:19:56,695][26022] Updated weights on worker 0-0, policy_version 305554 (0.00083) [2022-07-09 15:19:58,211][26022] Updated weights on worker 0-0, policy_version 305564 (0.00084) [2022-07-09 15:19:59,456][25689] Fps is (10 sec: 5601.8, 60 sec: 5627.8, 300 sec: 5639.5). Total num frames: 312902656. Throughput: 0: 5915.3. Samples: 312910848. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:19:59,457][25689] Avg episode reward: [(0, '-46.638')] [2022-07-09 15:20:00,218][26022] Updated weights on worker 0-0, policy_version 305574 (0.00093) [2022-07-09 15:20:01,665][26022] Updated weights on worker 0-0, policy_version 305584 (0.00087) [2022-07-09 15:20:04,139][26022] Updated weights on worker 0-0, policy_version 305594 (0.00086) [2022-07-09 15:20:04,555][25689] Fps is (10 sec: 5253.3, 60 sec: 5593.3, 300 sec: 5638.7). Total num frames: 312929280. Throughput: 0: 5066.5. Samples: 312927544. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:20:04,556][25689] Avg episode reward: [(0, '-46.149')] [2022-07-09 15:20:05,713][26022] Updated weights on worker 0-0, policy_version 305604 (0.00095) [2022-07-09 15:20:07,940][26022] Updated weights on worker 0-0, policy_version 305615 (0.00091) [2022-07-09 15:20:09,568][25689] Fps is (10 sec: 5569.5, 60 sec: 5666.7, 300 sec: 5642.4). Total num frames: 312958976. Throughput: 0: 5817.3. Samples: 312959688. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:20:09,569][25689] Avg episode reward: [(0, '-45.673')] [2022-07-09 15:20:09,608][26022] Updated weights on worker 0-0, policy_version 305625 (0.00088) [2022-07-09 15:20:11,515][26022] Updated weights on worker 0-0, policy_version 305635 (0.00091) [2022-07-09 15:20:13,152][26022] Updated weights on worker 0-0, policy_version 305645 (0.00092) [2022-07-09 15:20:14,587][25689] Fps is (10 sec: 5716.0, 60 sec: 5617.2, 300 sec: 5635.7). Total num frames: 312986624. Throughput: 0: 5799.4. Samples: 312993812. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:20:14,588][25689] Avg episode reward: [(0, '-45.262')] [2022-07-09 15:20:15,178][26022] Updated weights on worker 0-0, policy_version 305655 (0.00092) [2022-07-09 15:20:16,997][26022] Updated weights on worker 0-0, policy_version 305665 (0.00080) [2022-07-09 15:20:18,710][26022] Updated weights on worker 0-0, policy_version 305675 (0.00093) [2022-07-09 15:20:19,607][25689] Fps is (10 sec: 5814.1, 60 sec: 5637.7, 300 sec: 5647.4). Total num frames: 313017344. Throughput: 0: 4969.6. Samples: 313010992. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:20:19,608][25689] Avg episode reward: [(0, '-44.639')] [2022-07-09 15:20:20,457][26022] Updated weights on worker 0-0, policy_version 305685 (0.00094) [2022-07-09 15:20:22,248][26022] Updated weights on worker 0-0, policy_version 305695 (0.00089) [2022-07-09 15:20:24,272][26022] Updated weights on worker 0-0, policy_version 305705 (0.00086) [2022-07-09 15:20:24,676][25689] Fps is (10 sec: 5683.5, 60 sec: 5638.3, 300 sec: 5632.4). Total num frames: 313043968. Throughput: 0: 5835.2. Samples: 313044958. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:20:24,677][25689] Avg episode reward: [(0, '-45.870')] [2022-07-09 15:20:25,923][26022] Updated weights on worker 0-0, policy_version 305715 (0.00096) [2022-07-09 15:20:27,828][26022] Updated weights on worker 0-0, policy_version 305725 (0.00088) [2022-07-09 15:20:29,689][25689] Fps is (10 sec: 5382.9, 60 sec: 5624.0, 300 sec: 5637.1). Total num frames: 313071616. Throughput: 0: 5925.7. Samples: 313078920. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:20:29,690][25689] Avg episode reward: [(0, '-45.610')] [2022-07-09 15:20:29,764][26022] Updated weights on worker 0-0, policy_version 305735 (0.00082) [2022-07-09 15:20:31,290][26022] Updated weights on worker 0-0, policy_version 305745 (0.00097) [2022-07-09 15:20:33,206][26022] Updated weights on worker 0-0, policy_version 305755 (0.00088) [2022-07-09 15:20:34,692][25689] Fps is (10 sec: 5827.6, 60 sec: 5659.3, 300 sec: 5642.1). Total num frames: 313102336. Throughput: 0: 5086.9. Samples: 313096086. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-09 15:20:34,693][25689] Avg episode reward: [(0, '-46.340')] [2022-07-09 15:20:34,899][26022] Updated weights on worker 0-0, policy_version 305765 (0.00084) [2022-07-09 15:20:36,831][26022] Updated weights on worker 0-0, policy_version 305775 (0.00096) [2022-07-09 15:20:38,876][26022] Updated weights on worker 0-0, policy_version 305785 (0.00099) [2022-07-09 15:20:39,717][25689] Fps is (10 sec: 5820.8, 60 sec: 5641.0, 300 sec: 5639.8). Total num frames: 313129984. Throughput: 0: 5922.0. Samples: 313130082. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:20:39,717][25689] Avg episode reward: [(0, '-46.152')] [2022-07-09 15:20:40,265][26022] Updated weights on worker 0-0, policy_version 305795 (0.00090) [2022-07-09 15:20:42,332][26022] Updated weights on worker 0-0, policy_version 305805 (0.00086) [2022-07-09 15:20:43,861][26022] Updated weights on worker 0-0, policy_version 305815 (0.00087) [2022-07-09 15:20:44,764][25689] Fps is (10 sec: 5592.1, 60 sec: 5645.7, 300 sec: 5638.9). Total num frames: 313158656. Throughput: 0: 5942.6. Samples: 313164328. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:20:44,764][25689] Avg episode reward: [(0, '-46.924')] [2022-07-09 15:20:45,862][26022] Updated weights on worker 0-0, policy_version 305825 (0.00085) [2022-07-09 15:20:47,539][26022] Updated weights on worker 0-0, policy_version 305835 (0.00086) [2022-07-09 15:20:49,322][26022] Updated weights on worker 0-0, policy_version 305845 (0.00082) [2022-07-09 15:20:49,773][25689] Fps is (10 sec: 5600.6, 60 sec: 5631.3, 300 sec: 5635.4). Total num frames: 313186304. Throughput: 0: 5108.2. Samples: 313181510. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:20:49,773][25689] Avg episode reward: [(0, '-47.273')] [2022-07-09 15:20:51,144][26022] Updated weights on worker 0-0, policy_version 305855 (0.00080) [2022-07-09 15:20:53,128][26022] Updated weights on worker 0-0, policy_version 305865 (0.00093) [2022-07-09 15:20:54,796][25689] Fps is (10 sec: 5614.0, 60 sec: 5613.8, 300 sec: 5635.2). Total num frames: 313214976. Throughput: 0: 5943.8. Samples: 313215576. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:20:54,796][25689] Avg episode reward: [(0, '-47.572')] [2022-07-09 15:20:54,803][26022] Updated weights on worker 0-0, policy_version 305875 (0.00088) [2022-07-09 15:20:56,671][26022] Updated weights on worker 0-0, policy_version 305885 (0.00418) [2022-07-09 15:20:58,322][26022] Updated weights on worker 0-0, policy_version 305895 (0.00091) [2022-07-09 15:20:59,806][25689] Fps is (10 sec: 5715.3, 60 sec: 5650.2, 300 sec: 5640.8). Total num frames: 313243648. Throughput: 0: 5931.3. Samples: 313249238. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:20:59,807][25689] Avg episode reward: [(0, '-46.371')] [2022-07-09 15:21:00,329][26022] Updated weights on worker 0-0, policy_version 305905 (0.00091) [2022-07-09 15:21:02,566][26022] Updated weights on worker 0-0, policy_version 305915 (0.00083) [2022-07-09 15:21:04,334][26022] Updated weights on worker 0-0, policy_version 305925 (0.00087) [2022-07-09 15:21:04,934][25689] Fps is (10 sec: 5352.9, 60 sec: 5630.5, 300 sec: 5629.6). Total num frames: 313269248. Throughput: 0: 4998.0. Samples: 313265142. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:21:04,935][25689] Avg episode reward: [(0, '-46.257')] [2022-07-09 15:21:06,028][26022] Updated weights on worker 0-0, policy_version 305935 (0.00086) [2022-07-09 15:21:07,963][26022] Updated weights on worker 0-0, policy_version 305945 (0.00090) [2022-07-09 15:21:08,232][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:21:08,246][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000305946_313288704.pth [2022-07-09 15:21:08,246][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000303962_311257088.pth [2022-07-09 15:21:09,648][26022] Updated weights on worker 0-0, policy_version 305955 (0.00092) [2022-07-09 15:21:09,960][25689] Fps is (10 sec: 5546.9, 60 sec: 5646.4, 300 sec: 5642.9). Total num frames: 313299968. Throughput: 0: 5796.5. Samples: 313298522. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:21:09,962][25689] Avg episode reward: [(0, '-47.288')] [2022-07-09 15:21:11,632][26022] Updated weights on worker 0-0, policy_version 305965 (0.00087) [2022-07-09 15:21:13,107][26022] Updated weights on worker 0-0, policy_version 305975 (0.00086) [2022-07-09 15:21:14,981][25689] Fps is (10 sec: 5707.6, 60 sec: 5629.1, 300 sec: 5632.4). Total num frames: 313326592. Throughput: 0: 5798.2. Samples: 313332618. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:21:14,982][25689] Avg episode reward: [(0, '-47.530')] [2022-07-09 15:21:15,275][26022] Updated weights on worker 0-0, policy_version 305985 (0.00086) [2022-07-09 15:21:16,805][26022] Updated weights on worker 0-0, policy_version 305995 (0.00871) [2022-07-09 15:21:18,790][26022] Updated weights on worker 0-0, policy_version 306005 (0.00084) [2022-07-09 15:21:19,995][25689] Fps is (10 sec: 5611.9, 60 sec: 5612.8, 300 sec: 5639.8). Total num frames: 313356288. Throughput: 0: 4981.2. Samples: 313349804. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:21:19,996][25689] Avg episode reward: [(0, '-47.444')] [2022-07-09 15:21:20,565][26022] Updated weights on worker 0-0, policy_version 306015 (0.00088) [2022-07-09 15:21:22,259][26022] Updated weights on worker 0-0, policy_version 306025 (0.00087) [2022-07-09 15:21:24,023][26022] Updated weights on worker 0-0, policy_version 306035 (0.00090) [2022-07-09 15:21:25,103][25689] Fps is (10 sec: 5665.6, 60 sec: 5626.1, 300 sec: 5636.4). Total num frames: 313383936. Throughput: 0: 5890.7. Samples: 313383948. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:21:25,104][25689] Avg episode reward: [(0, '-46.570')] [2022-07-09 15:21:25,928][26022] Updated weights on worker 0-0, policy_version 306045 (0.00092) [2022-07-09 15:21:27,622][26022] Updated weights on worker 0-0, policy_version 306055 (0.00084) [2022-07-09 15:21:29,545][26022] Updated weights on worker 0-0, policy_version 306065 (0.00085) [2022-07-09 15:21:30,142][25689] Fps is (10 sec: 5752.4, 60 sec: 5674.5, 300 sec: 5642.7). Total num frames: 313414656. Throughput: 0: 5928.1. Samples: 313418166. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:21:30,143][25689] Avg episode reward: [(0, '-47.410')] [2022-07-09 15:21:31,418][26022] Updated weights on worker 0-0, policy_version 306075 (0.00093) [2022-07-09 15:21:33,079][26022] Updated weights on worker 0-0, policy_version 306085 (0.00085) [2022-07-09 15:21:35,004][26022] Updated weights on worker 0-0, policy_version 306095 (0.00091) [2022-07-09 15:21:35,171][25689] Fps is (10 sec: 5695.9, 60 sec: 5604.4, 300 sec: 5635.8). Total num frames: 313441280. Throughput: 0: 5072.5. Samples: 313435028. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:21:35,171][25689] Avg episode reward: [(0, '-46.325')] [2022-07-09 15:21:36,607][26022] Updated weights on worker 0-0, policy_version 306105 (0.00091) [2022-07-09 15:21:38,663][26022] Updated weights on worker 0-0, policy_version 306115 (0.00098) [2022-07-09 15:21:40,183][25689] Fps is (10 sec: 5608.8, 60 sec: 5639.3, 300 sec: 5637.0). Total num frames: 313470976. Throughput: 0: 5906.0. Samples: 313469034. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:21:40,184][25689] Avg episode reward: [(0, '-45.846')] [2022-07-09 15:21:40,414][26022] Updated weights on worker 0-0, policy_version 306125 (0.00093) [2022-07-09 15:21:42,122][26022] Updated weights on worker 0-0, policy_version 306135 (0.00089) [2022-07-09 15:21:44,016][26022] Updated weights on worker 0-0, policy_version 306145 (0.00091) [2022-07-09 15:21:45,223][25689] Fps is (10 sec: 5908.1, 60 sec: 5656.9, 300 sec: 5643.4). Total num frames: 313500672. Throughput: 0: 5918.1. Samples: 313503022. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:21:45,224][25689] Avg episode reward: [(0, '-45.782')] [2022-07-09 15:21:45,633][26022] Updated weights on worker 0-0, policy_version 306155 (0.00086) [2022-07-09 15:21:47,752][26022] Updated weights on worker 0-0, policy_version 306165 (0.00086) [2022-07-09 15:21:49,344][26022] Updated weights on worker 0-0, policy_version 306175 (0.00091) [2022-07-09 15:21:50,324][25689] Fps is (10 sec: 5553.8, 60 sec: 5631.4, 300 sec: 5638.4). Total num frames: 313527296. Throughput: 0: 5893.7. Samples: 313537114. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:21:50,325][25689] Avg episode reward: [(0, '-46.422')] [2022-07-09 15:21:51,077][26022] Updated weights on worker 0-0, policy_version 306185 (0.00090) [2022-07-09 15:21:53,104][26022] Updated weights on worker 0-0, policy_version 306195 (0.00092) [2022-07-09 15:21:54,638][26022] Updated weights on worker 0-0, policy_version 306205 (0.00085) [2022-07-09 15:21:55,331][25689] Fps is (10 sec: 5470.5, 60 sec: 5632.9, 300 sec: 5638.4). Total num frames: 313555968. Throughput: 0: 5905.5. Samples: 313554086. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:21:55,332][25689] Avg episode reward: [(0, '-46.322')] [2022-07-09 15:21:56,707][26022] Updated weights on worker 0-0, policy_version 306215 (0.00080) [2022-07-09 15:21:58,496][26022] Updated weights on worker 0-0, policy_version 306225 (0.00090) [2022-07-09 15:22:00,167][26022] Updated weights on worker 0-0, policy_version 306235 (0.00086) [2022-07-09 15:22:00,402][25689] Fps is (10 sec: 5791.5, 60 sec: 5644.2, 300 sec: 5645.9). Total num frames: 313585664. Throughput: 0: 5885.4. Samples: 313588030. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:22:00,404][25689] Avg episode reward: [(0, '-45.780')] [2022-07-09 15:22:02,569][26022] Updated weights on worker 0-0, policy_version 306245 (0.00092) [2022-07-09 15:22:04,252][26022] Updated weights on worker 0-0, policy_version 306255 (0.00090) [2022-07-09 15:22:05,534][25689] Fps is (10 sec: 5419.7, 60 sec: 5643.9, 300 sec: 5637.5). Total num frames: 313611264. Throughput: 0: 5740.6. Samples: 313619618. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:22:05,534][25689] Avg episode reward: [(0, '-45.792')] [2022-07-09 15:22:06,250][26022] Updated weights on worker 0-0, policy_version 306265 (0.00094) [2022-07-09 15:22:08,008][26022] Updated weights on worker 0-0, policy_version 306275 (0.00090) [2022-07-09 15:22:09,730][26022] Updated weights on worker 0-0, policy_version 306285 (0.00091) [2022-07-09 15:22:10,594][25689] Fps is (10 sec: 5325.4, 60 sec: 5606.9, 300 sec: 5636.7). Total num frames: 313639936. Throughput: 0: 4907.2. Samples: 313636580. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:22:10,595][25689] Avg episode reward: [(0, '-45.753')] [2022-07-09 15:22:11,672][26022] Updated weights on worker 0-0, policy_version 306295 (0.00092) [2022-07-09 15:22:13,262][26022] Updated weights on worker 0-0, policy_version 306305 (0.00081) [2022-07-09 15:22:15,322][26022] Updated weights on worker 0-0, policy_version 306315 (0.00610) [2022-07-09 15:22:15,611][25689] Fps is (10 sec: 5690.6, 60 sec: 5641.1, 300 sec: 5633.6). Total num frames: 313668608. Throughput: 0: 5748.8. Samples: 313670670. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:22:15,611][25689] Avg episode reward: [(0, '-45.911')] [2022-07-09 15:22:16,966][26022] Updated weights on worker 0-0, policy_version 306325 (0.00091) [2022-07-09 15:22:18,902][26022] Updated weights on worker 0-0, policy_version 306335 (0.00088) [2022-07-09 15:22:20,349][26022] Updated weights on worker 0-0, policy_version 306345 (0.00084) [2022-07-09 15:22:20,630][25689] Fps is (10 sec: 5815.5, 60 sec: 5640.6, 300 sec: 5642.6). Total num frames: 313698304. Throughput: 0: 5792.4. Samples: 313705198. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:22:20,632][25689] Avg episode reward: [(0, '-45.459')] [2022-07-09 15:22:22,311][26022] Updated weights on worker 0-0, policy_version 306355 (0.00086) [2022-07-09 15:22:24,131][26022] Updated weights on worker 0-0, policy_version 306365 (0.00084) [2022-07-09 15:22:25,707][25689] Fps is (10 sec: 5680.0, 60 sec: 5643.5, 300 sec: 5637.9). Total num frames: 313725952. Throughput: 0: 5092.1. Samples: 313722340. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:22:25,707][25689] Avg episode reward: [(0, '-45.955')] [2022-07-09 15:22:25,983][26022] Updated weights on worker 0-0, policy_version 306375 (0.00100) [2022-07-09 15:22:27,811][26022] Updated weights on worker 0-0, policy_version 306385 (0.00085) [2022-07-09 15:22:29,612][26022] Updated weights on worker 0-0, policy_version 306395 (0.00090) [2022-07-09 15:22:30,733][25689] Fps is (10 sec: 5473.5, 60 sec: 5594.0, 300 sec: 5637.8). Total num frames: 313753600. Throughput: 0: 5940.8. Samples: 313756222. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:22:30,733][25689] Avg episode reward: [(0, '-46.816')] [2022-07-09 15:22:31,352][26022] Updated weights on worker 0-0, policy_version 306405 (0.00099) [2022-07-09 15:22:33,134][26022] Updated weights on worker 0-0, policy_version 306415 (0.00078) [2022-07-09 15:22:34,947][26022] Updated weights on worker 0-0, policy_version 306425 (0.00088) [2022-07-09 15:22:35,737][25689] Fps is (10 sec: 5615.2, 60 sec: 5630.1, 300 sec: 5638.1). Total num frames: 313782272. Throughput: 0: 5937.0. Samples: 313790156. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:22:35,737][25689] Avg episode reward: [(0, '-46.879')] [2022-07-09 15:22:36,944][26022] Updated weights on worker 0-0, policy_version 306435 (0.00087) [2022-07-09 15:22:38,646][26022] Updated weights on worker 0-0, policy_version 306445 (0.00088) [2022-07-09 15:22:40,624][26022] Updated weights on worker 0-0, policy_version 306455 (0.00088) [2022-07-09 15:22:40,772][25689] Fps is (10 sec: 5814.0, 60 sec: 5628.0, 300 sec: 5638.3). Total num frames: 313811968. Throughput: 0: 5062.5. Samples: 313807166. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:22:40,772][25689] Avg episode reward: [(0, '-47.235')] [2022-07-09 15:22:42,275][26022] Updated weights on worker 0-0, policy_version 306465 (0.00085) [2022-07-09 15:22:43,987][26022] Updated weights on worker 0-0, policy_version 306475 (0.00094) [2022-07-09 15:22:45,865][25689] Fps is (10 sec: 5661.7, 60 sec: 5589.3, 300 sec: 5637.5). Total num frames: 313839616. Throughput: 0: 5903.9. Samples: 313841352. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:22:45,865][25689] Avg episode reward: [(0, '-47.181')] [2022-07-09 15:22:45,922][26022] Updated weights on worker 0-0, policy_version 306485 (0.00089) [2022-07-09 15:22:47,526][26022] Updated weights on worker 0-0, policy_version 306495 (0.00090) [2022-07-09 15:22:49,468][26022] Updated weights on worker 0-0, policy_version 306505 (0.00094) [2022-07-09 15:22:50,878][25689] Fps is (10 sec: 5674.1, 60 sec: 5648.1, 300 sec: 5637.4). Total num frames: 313869312. Throughput: 0: 5914.2. Samples: 313875366. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:22:50,879][25689] Avg episode reward: [(0, '-46.437')] [2022-07-09 15:22:51,126][26022] Updated weights on worker 0-0, policy_version 306515 (0.00087) [2022-07-09 15:22:53,150][26022] Updated weights on worker 0-0, policy_version 306525 (0.00082) [2022-07-09 15:22:54,881][26022] Updated weights on worker 0-0, policy_version 306535 (0.00085) [2022-07-09 15:22:55,883][25689] Fps is (10 sec: 5724.3, 60 sec: 5631.4, 300 sec: 5637.7). Total num frames: 313896960. Throughput: 0: 5076.6. Samples: 313892430. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:22:55,883][25689] Avg episode reward: [(0, '-46.523')] [2022-07-09 15:22:56,721][26022] Updated weights on worker 0-0, policy_version 306545 (0.00103) [2022-07-09 15:22:58,497][26022] Updated weights on worker 0-0, policy_version 306555 (0.00086) [2022-07-09 15:23:00,273][26022] Updated weights on worker 0-0, policy_version 306565 (0.00090) [2022-07-09 15:23:00,885][25689] Fps is (10 sec: 5525.6, 60 sec: 5604.0, 300 sec: 5639.5). Total num frames: 313924608. Throughput: 0: 5944.0. Samples: 313926720. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 15:23:00,886][25689] Avg episode reward: [(0, '-45.189')] [2022-07-09 15:23:02,585][26022] Updated weights on worker 0-0, policy_version 306575 (0.00082) [2022-07-09 15:23:04,249][26022] Updated weights on worker 0-0, policy_version 306585 (0.00091) [2022-07-09 15:23:05,974][25689] Fps is (10 sec: 5479.4, 60 sec: 5641.8, 300 sec: 5638.3). Total num frames: 313952256. Throughput: 0: 5806.6. Samples: 313958118. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:23:05,975][25689] Avg episode reward: [(0, '-45.039')] [2022-07-09 15:23:06,164][26022] Updated weights on worker 0-0, policy_version 306595 (0.00093) [2022-07-09 15:23:08,083][26022] Updated weights on worker 0-0, policy_version 306605 (0.00087) [2022-07-09 15:23:08,417][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:23:08,440][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000306607_313965568.pth [2022-07-09 15:23:08,441][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000304623_311933952.pth [2022-07-09 15:23:09,631][26022] Updated weights on worker 0-0, policy_version 306615 (0.00061) [2022-07-09 15:23:10,982][25689] Fps is (10 sec: 5578.0, 60 sec: 5646.6, 300 sec: 5635.0). Total num frames: 313980928. Throughput: 0: 4957.4. Samples: 313975030. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:23:10,984][25689] Avg episode reward: [(0, '-44.132')] [2022-07-09 15:23:11,576][26022] Updated weights on worker 0-0, policy_version 306625 (0.00055) [2022-07-09 15:23:13,249][26022] Updated weights on worker 0-0, policy_version 306635 (0.00089) [2022-07-09 15:23:15,243][26022] Updated weights on worker 0-0, policy_version 306645 (0.00079) [2022-07-09 15:23:16,076][25689] Fps is (10 sec: 5676.5, 60 sec: 5639.4, 300 sec: 5634.3). Total num frames: 314009600. Throughput: 0: 5792.6. Samples: 314009404. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:23:16,078][25689] Avg episode reward: [(0, '-44.116')] [2022-07-09 15:23:17,021][26022] Updated weights on worker 0-0, policy_version 306655 (0.00081) [2022-07-09 15:23:18,674][26022] Updated weights on worker 0-0, policy_version 306665 (0.00087) [2022-07-09 15:23:20,698][26022] Updated weights on worker 0-0, policy_version 306675 (0.00087) [2022-07-09 15:23:21,118][25689] Fps is (10 sec: 5556.4, 60 sec: 5603.5, 300 sec: 5635.9). Total num frames: 314037248. Throughput: 0: 5788.8. Samples: 314043844. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:23:21,120][25689] Avg episode reward: [(0, '-44.324')] [2022-07-09 15:23:22,081][26022] Updated weights on worker 0-0, policy_version 306685 (0.00086) [2022-07-09 15:23:24,135][26022] Updated weights on worker 0-0, policy_version 306695 (0.00093) [2022-07-09 15:23:25,811][26022] Updated weights on worker 0-0, policy_version 306705 (0.00088) [2022-07-09 15:23:26,181][25689] Fps is (10 sec: 5776.5, 60 sec: 5655.6, 300 sec: 5638.5). Total num frames: 314067968. Throughput: 0: 5091.6. Samples: 314061000. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:23:26,181][25689] Avg episode reward: [(0, '-44.661')] [2022-07-09 15:23:27,694][26022] Updated weights on worker 0-0, policy_version 306715 (0.00475) [2022-07-09 15:23:29,293][26022] Updated weights on worker 0-0, policy_version 306725 (0.00097) [2022-07-09 15:23:31,273][25689] Fps is (10 sec: 5647.2, 60 sec: 5632.5, 300 sec: 5635.0). Total num frames: 314094592. Throughput: 0: 5931.9. Samples: 314095392. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:23:31,273][25689] Avg episode reward: [(0, '-45.704')] [2022-07-09 15:23:31,537][26022] Updated weights on worker 0-0, policy_version 306735 (0.00088) [2022-07-09 15:23:32,863][26022] Updated weights on worker 0-0, policy_version 306745 (0.00085) [2022-07-09 15:23:34,925][26022] Updated weights on worker 0-0, policy_version 306755 (0.00084) [2022-07-09 15:23:36,359][25689] Fps is (10 sec: 5633.8, 60 sec: 5658.6, 300 sec: 5637.6). Total num frames: 314125312. Throughput: 0: 5917.8. Samples: 314129434. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:23:36,361][25689] Avg episode reward: [(0, '-46.445')] [2022-07-09 15:23:36,431][26022] Updated weights on worker 0-0, policy_version 306765 (0.00089) [2022-07-09 15:23:38,581][26022] Updated weights on worker 0-0, policy_version 306775 (0.00106) [2022-07-09 15:23:40,103][26022] Updated weights on worker 0-0, policy_version 306785 (0.00093) [2022-07-09 15:23:41,369][25689] Fps is (10 sec: 5781.2, 60 sec: 5627.2, 300 sec: 5636.0). Total num frames: 314152960. Throughput: 0: 5065.6. Samples: 314146424. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:23:41,369][25689] Avg episode reward: [(0, '-46.799')] [2022-07-09 15:23:42,183][26022] Updated weights on worker 0-0, policy_version 306795 (0.00091) [2022-07-09 15:23:43,917][26022] Updated weights on worker 0-0, policy_version 306805 (0.00079) [2022-07-09 15:23:45,705][26022] Updated weights on worker 0-0, policy_version 306815 (0.00085) [2022-07-09 15:23:46,421][25689] Fps is (10 sec: 5699.2, 60 sec: 5664.8, 300 sec: 5635.6). Total num frames: 314182656. Throughput: 0: 5894.6. Samples: 314180308. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:23:46,421][25689] Avg episode reward: [(0, '-47.236')] [2022-07-09 15:23:47,372][26022] Updated weights on worker 0-0, policy_version 306825 (0.00091) [2022-07-09 15:23:49,297][26022] Updated weights on worker 0-0, policy_version 306835 (0.00086) [2022-07-09 15:23:51,013][26022] Updated weights on worker 0-0, policy_version 306845 (0.00090) [2022-07-09 15:23:51,429][25689] Fps is (10 sec: 5700.1, 60 sec: 5631.5, 300 sec: 5636.2). Total num frames: 314210304. Throughput: 0: 5910.7. Samples: 314214530. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:23:51,429][25689] Avg episode reward: [(0, '-47.531')] [2022-07-09 15:23:52,972][26022] Updated weights on worker 0-0, policy_version 306855 (0.00093) [2022-07-09 15:23:54,871][26022] Updated weights on worker 0-0, policy_version 306865 (0.00085) [2022-07-09 15:23:56,449][25689] Fps is (10 sec: 5616.4, 60 sec: 5647.0, 300 sec: 5636.4). Total num frames: 314238976. Throughput: 0: 5916.5. Samples: 314248294. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:23:56,449][25689] Avg episode reward: [(0, '-47.601')] [2022-07-09 15:23:56,457][26022] Updated weights on worker 0-0, policy_version 306875 (0.00079) [2022-07-09 15:23:58,521][26022] Updated weights on worker 0-0, policy_version 306885 (0.00086) [2022-07-09 15:24:00,126][26022] Updated weights on worker 0-0, policy_version 306895 (0.00113) [2022-07-09 15:24:01,471][25689] Fps is (10 sec: 5608.6, 60 sec: 5645.2, 300 sec: 5634.3). Total num frames: 314266624. Throughput: 0: 5922.3. Samples: 314265474. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:24:01,471][25689] Avg episode reward: [(0, '-48.121')] [2022-07-09 15:24:02,422][26022] Updated weights on worker 0-0, policy_version 306905 (0.00086) [2022-07-09 15:24:04,418][26022] Updated weights on worker 0-0, policy_version 306915 (0.00086) [2022-07-09 15:24:05,948][26022] Updated weights on worker 0-0, policy_version 306925 (0.00087) [2022-07-09 15:24:06,611][25689] Fps is (10 sec: 5340.3, 60 sec: 5623.5, 300 sec: 5636.5). Total num frames: 314293248. Throughput: 0: 5793.3. Samples: 314297278. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:24:06,612][25689] Avg episode reward: [(0, '-47.585')] [2022-07-09 15:24:07,923][26022] Updated weights on worker 0-0, policy_version 306935 (0.00090) [2022-07-09 15:24:09,569][26022] Updated weights on worker 0-0, policy_version 306945 (0.00088) [2022-07-09 15:24:11,325][26022] Updated weights on worker 0-0, policy_version 306955 (0.00086) [2022-07-09 15:24:11,646][25689] Fps is (10 sec: 5635.5, 60 sec: 5654.8, 300 sec: 5636.5). Total num frames: 314323968. Throughput: 0: 5786.8. Samples: 314331522. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:24:11,647][25689] Avg episode reward: [(0, '-47.198')] [2022-07-09 15:24:13,454][26022] Updated weights on worker 0-0, policy_version 306965 (0.00090) [2022-07-09 15:24:14,891][26022] Updated weights on worker 0-0, policy_version 306975 (0.00089) [2022-07-09 15:24:16,675][25689] Fps is (10 sec: 5800.1, 60 sec: 5644.0, 300 sec: 5630.2). Total num frames: 314351616. Throughput: 0: 4960.3. Samples: 314348622. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:24:16,675][25689] Avg episode reward: [(0, '-47.449')] [2022-07-09 15:24:16,804][26022] Updated weights on worker 0-0, policy_version 306985 (0.00087) [2022-07-09 15:24:18,488][26022] Updated weights on worker 0-0, policy_version 306995 (0.00088) [2022-07-09 15:24:20,372][26022] Updated weights on worker 0-0, policy_version 307005 (0.00091) [2022-07-09 15:24:21,682][25689] Fps is (10 sec: 5612.0, 60 sec: 5664.1, 300 sec: 5638.4). Total num frames: 314380288. Throughput: 0: 5817.5. Samples: 314383052. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:24:21,682][25689] Avg episode reward: [(0, '-47.351')] [2022-07-09 15:24:22,251][26022] Updated weights on worker 0-0, policy_version 307015 (0.00082) [2022-07-09 15:24:24,095][26022] Updated weights on worker 0-0, policy_version 307025 (0.00089) [2022-07-09 15:24:25,660][26022] Updated weights on worker 0-0, policy_version 307035 (0.00090) [2022-07-09 15:24:26,791][25689] Fps is (10 sec: 5668.6, 60 sec: 5626.0, 300 sec: 5637.1). Total num frames: 314408960. Throughput: 0: 5944.5. Samples: 314417236. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:24:26,791][25689] Avg episode reward: [(0, '-46.353')] [2022-07-09 15:24:27,639][26022] Updated weights on worker 0-0, policy_version 307045 (0.00090) [2022-07-09 15:24:29,350][26022] Updated weights on worker 0-0, policy_version 307055 (0.00095) [2022-07-09 15:24:31,191][26022] Updated weights on worker 0-0, policy_version 307065 (0.00092) [2022-07-09 15:24:31,807][25689] Fps is (10 sec: 5663.3, 60 sec: 5666.8, 300 sec: 5637.1). Total num frames: 314437632. Throughput: 0: 5098.4. Samples: 314434312. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:24:31,808][25689] Avg episode reward: [(0, '-46.444')] [2022-07-09 15:24:33,004][26022] Updated weights on worker 0-0, policy_version 307075 (0.00091) [2022-07-09 15:24:34,776][26022] Updated weights on worker 0-0, policy_version 307085 (0.00095) [2022-07-09 15:24:36,735][26022] Updated weights on worker 0-0, policy_version 307095 (0.00085) [2022-07-09 15:24:36,830][25689] Fps is (10 sec: 5610.3, 60 sec: 5622.1, 300 sec: 5633.4). Total num frames: 314465280. Throughput: 0: 5930.2. Samples: 314468144. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:24:36,830][25689] Avg episode reward: [(0, '-46.592')] [2022-07-09 15:24:38,630][26022] Updated weights on worker 0-0, policy_version 307106 (0.00099) [2022-07-09 15:24:40,347][26022] Updated weights on worker 0-0, policy_version 307116 (0.00090) [2022-07-09 15:24:41,875][25689] Fps is (10 sec: 5594.3, 60 sec: 5635.7, 300 sec: 5634.4). Total num frames: 314493952. Throughput: 0: 5897.5. Samples: 314502140. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:24:41,875][25689] Avg episode reward: [(0, '-45.859')] [2022-07-09 15:24:42,470][26022] Updated weights on worker 0-0, policy_version 307126 (0.00098) [2022-07-09 15:24:43,909][26022] Updated weights on worker 0-0, policy_version 307136 (0.00090) [2022-07-09 15:24:45,839][26022] Updated weights on worker 0-0, policy_version 307146 (0.00086) [2022-07-09 15:24:46,924][25689] Fps is (10 sec: 5883.6, 60 sec: 5652.9, 300 sec: 5641.0). Total num frames: 314524672. Throughput: 0: 5063.7. Samples: 314519184. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:24:46,925][25689] Avg episode reward: [(0, '-45.825')] [2022-07-09 15:24:47,756][26022] Updated weights on worker 0-0, policy_version 307156 (0.00087) [2022-07-09 15:24:49,575][26022] Updated weights on worker 0-0, policy_version 307166 (0.00091) [2022-07-09 15:24:51,254][26022] Updated weights on worker 0-0, policy_version 307176 (0.00092) [2022-07-09 15:24:52,007][25689] Fps is (10 sec: 5558.7, 60 sec: 5612.1, 300 sec: 5626.0). Total num frames: 314550272. Throughput: 0: 5894.2. Samples: 314553370. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:24:52,007][25689] Avg episode reward: [(0, '-45.064')] [2022-07-09 15:24:53,115][26022] Updated weights on worker 0-0, policy_version 307186 (0.00078) [2022-07-09 15:24:54,845][26022] Updated weights on worker 0-0, policy_version 307196 (0.00092) [2022-07-09 15:24:56,780][26022] Updated weights on worker 0-0, policy_version 307206 (0.00092) [2022-07-09 15:24:57,027][25689] Fps is (10 sec: 5574.3, 60 sec: 5645.8, 300 sec: 5640.1). Total num frames: 314580992. Throughput: 0: 5913.2. Samples: 314587578. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:24:57,028][25689] Avg episode reward: [(0, '-45.782')] [2022-07-09 15:24:58,406][26022] Updated weights on worker 0-0, policy_version 307216 (0.00094) [2022-07-09 15:25:00,319][26022] Updated weights on worker 0-0, policy_version 307226 (0.00087) [2022-07-09 15:25:02,094][25689] Fps is (10 sec: 5684.8, 60 sec: 5624.8, 300 sec: 5640.7). Total num frames: 314607616. Throughput: 0: 5075.7. Samples: 314604766. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:25:02,094][25689] Avg episode reward: [(0, '-45.333')] [2022-07-09 15:25:02,377][26022] Updated weights on worker 0-0, policy_version 307236 (0.00091) [2022-07-09 15:25:04,364][26022] Updated weights on worker 0-0, policy_version 307246 (0.00088) [2022-07-09 15:25:06,044][26022] Updated weights on worker 0-0, policy_version 307256 (0.00092) [2022-07-09 15:25:07,210][25689] Fps is (10 sec: 5329.9, 60 sec: 5644.0, 300 sec: 5631.9). Total num frames: 314635264. Throughput: 0: 5797.6. Samples: 314636794. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:25:07,211][25689] Avg episode reward: [(0, '-44.796')] [2022-07-09 15:25:07,958][26022] Updated weights on worker 0-0, policy_version 307266 (0.00089) [2022-07-09 15:25:08,518][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:25:08,532][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000307270_314644480.pth [2022-07-09 15:25:08,533][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000305286_312612864.pth [2022-07-09 15:25:09,639][26022] Updated weights on worker 0-0, policy_version 307276 (0.00085) [2022-07-09 15:25:11,491][26022] Updated weights on worker 0-0, policy_version 307286 (0.00087) [2022-07-09 15:25:12,227][25689] Fps is (10 sec: 5659.2, 60 sec: 5628.7, 300 sec: 5638.8). Total num frames: 314664960. Throughput: 0: 5814.2. Samples: 314670934. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:25:12,227][25689] Avg episode reward: [(0, '-44.981')] [2022-07-09 15:25:13,215][26022] Updated weights on worker 0-0, policy_version 307296 (0.00095) [2022-07-09 15:25:15,153][26022] Updated weights on worker 0-0, policy_version 307306 (0.00088) [2022-07-09 15:25:16,736][26022] Updated weights on worker 0-0, policy_version 307316 (0.00087) [2022-07-09 15:25:17,275][25689] Fps is (10 sec: 5799.3, 60 sec: 5643.8, 300 sec: 5631.4). Total num frames: 314693632. Throughput: 0: 4967.1. Samples: 314688148. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:25:17,275][25689] Avg episode reward: [(0, '-46.444')] [2022-07-09 15:25:18,651][26022] Updated weights on worker 0-0, policy_version 307326 (0.00094) [2022-07-09 15:25:20,322][26022] Updated weights on worker 0-0, policy_version 307336 (0.00084) [2022-07-09 15:25:22,277][26022] Updated weights on worker 0-0, policy_version 307346 (0.00084) [2022-07-09 15:25:22,375][25689] Fps is (10 sec: 5650.7, 60 sec: 5635.2, 300 sec: 5637.7). Total num frames: 314722304. Throughput: 0: 5795.9. Samples: 314722312. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:25:22,375][25689] Avg episode reward: [(0, '-47.370')] [2022-07-09 15:25:23,914][26022] Updated weights on worker 0-0, policy_version 307356 (0.00090) [2022-07-09 15:25:26,049][26022] Updated weights on worker 0-0, policy_version 307366 (0.00084) [2022-07-09 15:25:27,445][25689] Fps is (10 sec: 5739.0, 60 sec: 5655.7, 300 sec: 5643.5). Total num frames: 314752000. Throughput: 0: 5903.7. Samples: 314756256. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:25:27,445][25689] Avg episode reward: [(0, '-47.629')] [2022-07-09 15:25:27,534][26022] Updated weights on worker 0-0, policy_version 307376 (0.00086) [2022-07-09 15:25:29,586][26022] Updated weights on worker 0-0, policy_version 307386 (0.00086) [2022-07-09 15:25:31,298][26022] Updated weights on worker 0-0, policy_version 307396 (0.00083) [2022-07-09 15:25:32,460][25689] Fps is (10 sec: 5787.6, 60 sec: 5655.8, 300 sec: 5636.4). Total num frames: 314780672. Throughput: 0: 5057.1. Samples: 314773254. Policy #0 lag: (min: 0.0, avg: 7.6, max: 19.0) [2022-07-09 15:25:32,460][25689] Avg episode reward: [(0, '-47.010')] [2022-07-09 15:25:33,159][26022] Updated weights on worker 0-0, policy_version 307406 (0.00100) [2022-07-09 15:25:34,935][26022] Updated weights on worker 0-0, policy_version 307416 (0.00088) [2022-07-09 15:25:36,742][26022] Updated weights on worker 0-0, policy_version 307426 (0.00084) [2022-07-09 15:25:37,503][25689] Fps is (10 sec: 5497.9, 60 sec: 5637.0, 300 sec: 5632.6). Total num frames: 314807296. Throughput: 0: 5889.3. Samples: 314807278. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:25:37,503][25689] Avg episode reward: [(0, '-47.578')] [2022-07-09 15:25:38,299][26022] Updated weights on worker 0-0, policy_version 307436 (0.00080) [2022-07-09 15:25:40,323][26022] Updated weights on worker 0-0, policy_version 307446 (0.00092) [2022-07-09 15:25:41,981][26022] Updated weights on worker 0-0, policy_version 307456 (0.00089) [2022-07-09 15:25:42,511][25689] Fps is (10 sec: 5603.4, 60 sec: 5657.4, 300 sec: 5636.8). Total num frames: 314836992. Throughput: 0: 5932.5. Samples: 314841768. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:25:42,511][25689] Avg episode reward: [(0, '-46.574')] [2022-07-09 15:25:43,983][26022] Updated weights on worker 0-0, policy_version 307466 (0.00099) [2022-07-09 15:25:45,740][26022] Updated weights on worker 0-0, policy_version 307476 (0.00334) [2022-07-09 15:25:47,422][26022] Updated weights on worker 0-0, policy_version 307486 (0.00884) [2022-07-09 15:25:47,638][25689] Fps is (10 sec: 5860.0, 60 sec: 5633.2, 300 sec: 5641.4). Total num frames: 314866688. Throughput: 0: 5085.2. Samples: 314858940. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:25:47,641][25689] Avg episode reward: [(0, '-45.338')] [2022-07-09 15:25:49,259][26022] Updated weights on worker 0-0, policy_version 307496 (0.00092) [2022-07-09 15:25:50,981][26022] Updated weights on worker 0-0, policy_version 307506 (0.00087) [2022-07-09 15:25:52,670][25689] Fps is (10 sec: 5745.5, 60 sec: 5688.6, 300 sec: 5641.3). Total num frames: 314895360. Throughput: 0: 5942.4. Samples: 314893350. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:25:52,670][25689] Avg episode reward: [(0, '-45.012')] [2022-07-09 15:25:52,828][26022] Updated weights on worker 0-0, policy_version 307516 (0.00088) [2022-07-09 15:25:54,614][26022] Updated weights on worker 0-0, policy_version 307526 (0.00090) [2022-07-09 15:25:56,371][26022] Updated weights on worker 0-0, policy_version 307536 (0.00106) [2022-07-09 15:25:57,682][25689] Fps is (10 sec: 5607.1, 60 sec: 5638.7, 300 sec: 5637.8). Total num frames: 314923008. Throughput: 0: 5961.0. Samples: 314927568. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:25:57,683][25689] Avg episode reward: [(0, '-43.978')] [2022-07-09 15:25:58,246][26022] Updated weights on worker 0-0, policy_version 307546 (0.00093) [2022-07-09 15:26:00,020][26022] Updated weights on worker 0-0, policy_version 307556 (0.00083) [2022-07-09 15:26:01,704][26022] Updated weights on worker 0-0, policy_version 307566 (0.00090) [2022-07-09 15:26:02,697][25689] Fps is (10 sec: 5412.4, 60 sec: 5643.5, 300 sec: 5643.4). Total num frames: 314949632. Throughput: 0: 5833.4. Samples: 314959522. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:26:02,697][25689] Avg episode reward: [(0, '-44.700')] [2022-07-09 15:26:04,044][26022] Updated weights on worker 0-0, policy_version 307576 (0.00092) [2022-07-09 15:26:05,889][26022] Updated weights on worker 0-0, policy_version 307586 (0.00086) [2022-07-09 15:26:07,743][25689] Fps is (10 sec: 5394.4, 60 sec: 5650.1, 300 sec: 5632.7). Total num frames: 314977280. Throughput: 0: 5831.4. Samples: 314976182. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:26:07,744][25689] Avg episode reward: [(0, '-44.581')] [2022-07-09 15:26:07,774][26022] Updated weights on worker 0-0, policy_version 307596 (0.00091) [2022-07-09 15:26:09,443][26022] Updated weights on worker 0-0, policy_version 307606 (0.00084) [2022-07-09 15:26:11,415][26022] Updated weights on worker 0-0, policy_version 307616 (0.00098) [2022-07-09 15:26:12,755][25689] Fps is (10 sec: 5701.1, 60 sec: 5650.5, 300 sec: 5643.2). Total num frames: 315006976. Throughput: 0: 5834.6. Samples: 315010542. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:26:12,757][25689] Avg episode reward: [(0, '-44.747')] [2022-07-09 15:26:12,980][26022] Updated weights on worker 0-0, policy_version 307626 (0.00086) [2022-07-09 15:26:15,022][26022] Updated weights on worker 0-0, policy_version 307636 (0.00088) [2022-07-09 15:26:16,678][26022] Updated weights on worker 0-0, policy_version 307646 (0.00089) [2022-07-09 15:26:17,766][25689] Fps is (10 sec: 5721.4, 60 sec: 5637.1, 300 sec: 5636.3). Total num frames: 315034624. Throughput: 0: 5838.9. Samples: 315044834. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:26:17,766][25689] Avg episode reward: [(0, '-45.182')] [2022-07-09 15:26:18,477][26022] Updated weights on worker 0-0, policy_version 307656 (0.00086) [2022-07-09 15:26:20,360][26022] Updated weights on worker 0-0, policy_version 307666 (0.00085) [2022-07-09 15:26:22,008][26022] Updated weights on worker 0-0, policy_version 307676 (0.00086) [2022-07-09 15:26:22,770][25689] Fps is (10 sec: 5725.8, 60 sec: 5662.9, 300 sec: 5645.2). Total num frames: 315064320. Throughput: 0: 5102.0. Samples: 315061936. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:26:22,771][25689] Avg episode reward: [(0, '-46.002')] [2022-07-09 15:26:23,917][26022] Updated weights on worker 0-0, policy_version 307686 (0.00082) [2022-07-09 15:26:25,766][26022] Updated weights on worker 0-0, policy_version 307696 (0.00093) [2022-07-09 15:26:27,451][26022] Updated weights on worker 0-0, policy_version 307706 (0.00086) [2022-07-09 15:26:27,834][25689] Fps is (10 sec: 5695.5, 60 sec: 5629.6, 300 sec: 5634.4). Total num frames: 315091968. Throughput: 0: 5963.5. Samples: 315095994. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:26:27,835][25689] Avg episode reward: [(0, '-45.876')] [2022-07-09 15:26:29,259][26022] Updated weights on worker 0-0, policy_version 307716 (0.00084) [2022-07-09 15:26:31,320][26022] Updated weights on worker 0-0, policy_version 307726 (0.00084) [2022-07-09 15:26:32,860][25689] Fps is (10 sec: 5581.9, 60 sec: 5628.6, 300 sec: 5641.3). Total num frames: 315120640. Throughput: 0: 5935.3. Samples: 315129870. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:26:32,861][25689] Avg episode reward: [(0, '-46.318')] [2022-07-09 15:26:32,933][26022] Updated weights on worker 0-0, policy_version 307736 (0.00093) [2022-07-09 15:26:34,896][26022] Updated weights on worker 0-0, policy_version 307746 (0.00086) [2022-07-09 15:26:36,487][26022] Updated weights on worker 0-0, policy_version 307756 (0.00107) [2022-07-09 15:26:37,868][25689] Fps is (10 sec: 5715.3, 60 sec: 5665.8, 300 sec: 5638.0). Total num frames: 315149312. Throughput: 0: 5063.8. Samples: 315146626. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:26:37,868][25689] Avg episode reward: [(0, '-46.443')] [2022-07-09 15:26:38,420][26022] Updated weights on worker 0-0, policy_version 307766 (0.00094) [2022-07-09 15:26:40,198][26022] Updated weights on worker 0-0, policy_version 307776 (0.00092) [2022-07-09 15:26:42,066][26022] Updated weights on worker 0-0, policy_version 307786 (0.00088) [2022-07-09 15:26:42,957][25689] Fps is (10 sec: 5679.3, 60 sec: 5641.2, 300 sec: 5633.6). Total num frames: 315177984. Throughput: 0: 5879.2. Samples: 315180618. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:26:42,958][25689] Avg episode reward: [(0, '-46.925')] [2022-07-09 15:26:43,699][26022] Updated weights on worker 0-0, policy_version 307796 (0.00089) [2022-07-09 15:26:45,739][26022] Updated weights on worker 0-0, policy_version 307806 (0.00097) [2022-07-09 15:26:47,451][26022] Updated weights on worker 0-0, policy_version 307816 (0.00108) [2022-07-09 15:26:48,095][25689] Fps is (10 sec: 5607.1, 60 sec: 5623.3, 300 sec: 5639.8). Total num frames: 315206656. Throughput: 0: 5860.3. Samples: 315214726. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:26:48,096][25689] Avg episode reward: [(0, '-46.439')] [2022-07-09 15:26:49,304][26022] Updated weights on worker 0-0, policy_version 307826 (0.00089) [2022-07-09 15:26:51,109][26022] Updated weights on worker 0-0, policy_version 307836 (0.00567) [2022-07-09 15:26:52,689][26022] Updated weights on worker 0-0, policy_version 307846 (0.00085) [2022-07-09 15:26:53,099][25689] Fps is (10 sec: 5654.3, 60 sec: 5625.9, 300 sec: 5639.8). Total num frames: 315235328. Throughput: 0: 5045.3. Samples: 315231980. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:26:53,100][25689] Avg episode reward: [(0, '-47.118')] [2022-07-09 15:26:54,749][26022] Updated weights on worker 0-0, policy_version 307856 (0.00093) [2022-07-09 15:26:56,462][26022] Updated weights on worker 0-0, policy_version 307866 (0.00088) [2022-07-09 15:26:58,147][25689] Fps is (10 sec: 5806.3, 60 sec: 5656.5, 300 sec: 5640.3). Total num frames: 315265024. Throughput: 0: 5906.1. Samples: 315266398. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:26:58,148][25689] Avg episode reward: [(0, '-46.878')] [2022-07-09 15:26:58,154][26022] Updated weights on worker 0-0, policy_version 307876 (0.00429) [2022-07-09 15:27:00,258][26022] Updated weights on worker 0-0, policy_version 307886 (0.00091) [2022-07-09 15:27:01,583][26022] Updated weights on worker 0-0, policy_version 307896 (0.00085) [2022-07-09 15:27:03,164][25689] Fps is (10 sec: 5494.0, 60 sec: 5639.3, 300 sec: 5642.4). Total num frames: 315290624. Throughput: 0: 5837.0. Samples: 315298562. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:27:03,164][25689] Avg episode reward: [(0, '-46.685')] [2022-07-09 15:27:04,015][26022] Updated weights on worker 0-0, policy_version 307906 (0.00098) [2022-07-09 15:27:05,999][26022] Updated weights on worker 0-0, policy_version 307916 (0.00086) [2022-07-09 15:27:07,591][26022] Updated weights on worker 0-0, policy_version 307926 (0.00079) [2022-07-09 15:27:08,207][25689] Fps is (10 sec: 5395.0, 60 sec: 5656.5, 300 sec: 5642.8). Total num frames: 315319296. Throughput: 0: 4993.1. Samples: 315315148. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:27:08,207][25689] Avg episode reward: [(0, '-46.168')] [2022-07-09 15:27:08,791][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:27:08,810][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000307932_315322368.pth [2022-07-09 15:27:08,811][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000305946_313288704.pth [2022-07-09 15:27:09,582][26022] Updated weights on worker 0-0, policy_version 307936 (0.00085) [2022-07-09 15:27:11,406][26022] Updated weights on worker 0-0, policy_version 307946 (0.00075) [2022-07-09 15:27:13,120][26022] Updated weights on worker 0-0, policy_version 307956 (0.00089) [2022-07-09 15:27:13,215][25689] Fps is (10 sec: 5603.4, 60 sec: 5623.1, 300 sec: 5639.5). Total num frames: 315346944. Throughput: 0: 5837.8. Samples: 315349412. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:27:13,215][25689] Avg episode reward: [(0, '-45.886')] [2022-07-09 15:27:14,903][26022] Updated weights on worker 0-0, policy_version 307966 (0.00085) [2022-07-09 15:27:16,607][26022] Updated weights on worker 0-0, policy_version 307976 (0.00086) [2022-07-09 15:27:18,262][25689] Fps is (10 sec: 5499.3, 60 sec: 5619.7, 300 sec: 5632.1). Total num frames: 315374592. Throughput: 0: 5821.6. Samples: 315383498. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:27:18,262][25689] Avg episode reward: [(0, '-46.767')] [2022-07-09 15:27:18,551][26022] Updated weights on worker 0-0, policy_version 307986 (0.00072) [2022-07-09 15:27:20,355][26022] Updated weights on worker 0-0, policy_version 307996 (0.00088) [2022-07-09 15:27:22,068][26022] Updated weights on worker 0-0, policy_version 308006 (0.00083) [2022-07-09 15:27:23,286][25689] Fps is (10 sec: 5693.7, 60 sec: 5617.8, 300 sec: 5640.0). Total num frames: 315404288. Throughput: 0: 5064.7. Samples: 315400478. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:27:23,287][25689] Avg episode reward: [(0, '-45.814')] [2022-07-09 15:27:23,980][26022] Updated weights on worker 0-0, policy_version 308016 (0.00087) [2022-07-09 15:27:25,828][26022] Updated weights on worker 0-0, policy_version 308026 (0.00091) [2022-07-09 15:27:27,431][26022] Updated weights on worker 0-0, policy_version 308036 (0.00096) [2022-07-09 15:27:28,373][25689] Fps is (10 sec: 5772.6, 60 sec: 5632.6, 300 sec: 5642.2). Total num frames: 315432960. Throughput: 0: 5911.0. Samples: 315434352. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:27:28,374][25689] Avg episode reward: [(0, '-46.225')] [2022-07-09 15:27:29,511][26022] Updated weights on worker 0-0, policy_version 308046 (0.00090) [2022-07-09 15:27:30,971][26022] Updated weights on worker 0-0, policy_version 308056 (0.00088) [2022-07-09 15:27:33,064][26022] Updated weights on worker 0-0, policy_version 308066 (0.00089) [2022-07-09 15:27:33,451][25689] Fps is (10 sec: 5742.3, 60 sec: 5644.7, 300 sec: 5644.3). Total num frames: 315462656. Throughput: 0: 5890.6. Samples: 315468614. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:27:33,451][25689] Avg episode reward: [(0, '-46.094')] [2022-07-09 15:27:34,714][26022] Updated weights on worker 0-0, policy_version 308076 (0.00098) [2022-07-09 15:27:36,588][26022] Updated weights on worker 0-0, policy_version 308086 (0.00089) [2022-07-09 15:27:38,436][26022] Updated weights on worker 0-0, policy_version 308096 (0.00089) [2022-07-09 15:27:38,529][25689] Fps is (10 sec: 5646.4, 60 sec: 5621.2, 300 sec: 5636.6). Total num frames: 315490304. Throughput: 0: 5017.1. Samples: 315485180. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:27:38,530][25689] Avg episode reward: [(0, '-46.584')] [2022-07-09 15:27:40,279][26022] Updated weights on worker 0-0, policy_version 308106 (0.00085) [2022-07-09 15:27:42,015][26022] Updated weights on worker 0-0, policy_version 308116 (0.00094) [2022-07-09 15:27:43,535][25689] Fps is (10 sec: 5584.8, 60 sec: 5629.0, 300 sec: 5641.7). Total num frames: 315518976. Throughput: 0: 5869.5. Samples: 315519330. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:27:43,536][25689] Avg episode reward: [(0, '-46.498')] [2022-07-09 15:27:43,827][26022] Updated weights on worker 0-0, policy_version 308126 (0.00097) [2022-07-09 15:27:45,573][26022] Updated weights on worker 0-0, policy_version 308136 (0.00087) [2022-07-09 15:27:47,614][26022] Updated weights on worker 0-0, policy_version 308146 (0.00086) [2022-07-09 15:27:48,591][25689] Fps is (10 sec: 5597.5, 60 sec: 5619.7, 300 sec: 5634.0). Total num frames: 315546624. Throughput: 0: 5886.9. Samples: 315553370. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:27:48,591][25689] Avg episode reward: [(0, '-46.468')] [2022-07-09 15:27:49,245][26022] Updated weights on worker 0-0, policy_version 308156 (0.00085) [2022-07-09 15:27:51,043][26022] Updated weights on worker 0-0, policy_version 308166 (0.00086) [2022-07-09 15:27:52,827][26022] Updated weights on worker 0-0, policy_version 308176 (0.00085) [2022-07-09 15:27:53,601][25689] Fps is (10 sec: 5595.3, 60 sec: 5619.1, 300 sec: 5637.3). Total num frames: 315575296. Throughput: 0: 5058.8. Samples: 315570550. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:27:53,601][25689] Avg episode reward: [(0, '-46.871')] [2022-07-09 15:27:54,440][26022] Updated weights on worker 0-0, policy_version 308186 (0.00087) [2022-07-09 15:27:56,513][26022] Updated weights on worker 0-0, policy_version 308196 (0.00089) [2022-07-09 15:27:58,115][26022] Updated weights on worker 0-0, policy_version 308206 (0.01305) [2022-07-09 15:27:58,607][25689] Fps is (10 sec: 5725.2, 60 sec: 5606.1, 300 sec: 5640.7). Total num frames: 315603968. Throughput: 0: 5966.3. Samples: 315604968. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 15:27:58,607][25689] Avg episode reward: [(0, '-46.698')] [2022-07-09 15:28:00,050][26022] Updated weights on worker 0-0, policy_version 308216 (0.00088) [2022-07-09 15:28:02,387][26022] Updated weights on worker 0-0, policy_version 308226 (0.00085) [2022-07-09 15:28:03,612][25689] Fps is (10 sec: 5523.6, 60 sec: 5624.1, 300 sec: 5638.9). Total num frames: 315630592. Throughput: 0: 5858.9. Samples: 315636956. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:28:03,614][25689] Avg episode reward: [(0, '-46.765')] [2022-07-09 15:28:03,965][26022] Updated weights on worker 0-0, policy_version 308236 (0.00090) [2022-07-09 15:28:05,975][26022] Updated weights on worker 0-0, policy_version 308246 (0.00093) [2022-07-09 15:28:07,730][26022] Updated weights on worker 0-0, policy_version 308256 (0.00087) [2022-07-09 15:28:08,715][25689] Fps is (10 sec: 5369.2, 60 sec: 5601.7, 300 sec: 5633.6). Total num frames: 315658240. Throughput: 0: 4973.3. Samples: 315653452. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:28:08,715][25689] Avg episode reward: [(0, '-47.109')] [2022-07-09 15:28:09,422][26022] Updated weights on worker 0-0, policy_version 308266 (0.00089) [2022-07-09 15:28:11,311][26022] Updated weights on worker 0-0, policy_version 308276 (0.00089) [2022-07-09 15:28:13,329][26022] Updated weights on worker 0-0, policy_version 308286 (0.00091) [2022-07-09 15:28:13,719][25689] Fps is (10 sec: 5673.5, 60 sec: 5635.8, 300 sec: 5638.8). Total num frames: 315687936. Throughput: 0: 5809.8. Samples: 315687434. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:28:13,720][25689] Avg episode reward: [(0, '-46.613')] [2022-07-09 15:28:15,019][26022] Updated weights on worker 0-0, policy_version 308296 (0.00083) [2022-07-09 15:28:16,849][26022] Updated weights on worker 0-0, policy_version 308306 (0.00080) [2022-07-09 15:28:18,562][26022] Updated weights on worker 0-0, policy_version 308316 (0.00086) [2022-07-09 15:28:18,731][25689] Fps is (10 sec: 5827.4, 60 sec: 5656.1, 300 sec: 5642.8). Total num frames: 315716608. Throughput: 0: 5796.0. Samples: 315721608. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:28:18,731][25689] Avg episode reward: [(0, '-47.245')] [2022-07-09 15:28:20,601][26022] Updated weights on worker 0-0, policy_version 308326 (0.00081) [2022-07-09 15:28:22,050][26022] Updated weights on worker 0-0, policy_version 308336 (0.00092) [2022-07-09 15:28:23,743][25689] Fps is (10 sec: 5618.5, 60 sec: 5623.3, 300 sec: 5633.4). Total num frames: 315744256. Throughput: 0: 5036.6. Samples: 315738350. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:28:23,744][25689] Avg episode reward: [(0, '-48.090')] [2022-07-09 15:28:24,178][26022] Updated weights on worker 0-0, policy_version 308346 (0.00083) [2022-07-09 15:28:25,754][26022] Updated weights on worker 0-0, policy_version 308356 (0.00088) [2022-07-09 15:28:27,727][26022] Updated weights on worker 0-0, policy_version 308366 (0.00096) [2022-07-09 15:28:28,795][25689] Fps is (10 sec: 5698.1, 60 sec: 5643.6, 300 sec: 5644.5). Total num frames: 315773952. Throughput: 0: 5914.1. Samples: 315772206. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:28:28,795][25689] Avg episode reward: [(0, '-47.773')] [2022-07-09 15:28:29,452][26022] Updated weights on worker 0-0, policy_version 308376 (0.00613) [2022-07-09 15:28:31,176][26022] Updated weights on worker 0-0, policy_version 308386 (0.00082) [2022-07-09 15:28:33,129][26022] Updated weights on worker 0-0, policy_version 308396 (0.00085) [2022-07-09 15:28:33,847][25689] Fps is (10 sec: 5574.3, 60 sec: 5595.1, 300 sec: 5631.4). Total num frames: 315800576. Throughput: 0: 5926.0. Samples: 315806710. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:28:33,847][25689] Avg episode reward: [(0, '-48.063')] [2022-07-09 15:28:34,721][26022] Updated weights on worker 0-0, policy_version 308406 (0.00084) [2022-07-09 15:28:36,648][26022] Updated weights on worker 0-0, policy_version 308416 (0.00095) [2022-07-09 15:28:38,351][26022] Updated weights on worker 0-0, policy_version 308426 (0.00098) [2022-07-09 15:28:38,849][25689] Fps is (10 sec: 5499.5, 60 sec: 5619.1, 300 sec: 5634.9). Total num frames: 315829248. Throughput: 0: 5072.4. Samples: 315823658. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:28:38,851][25689] Avg episode reward: [(0, '-48.343')] [2022-07-09 15:28:40,377][26022] Updated weights on worker 0-0, policy_version 308436 (0.00081) [2022-07-09 15:28:42,212][26022] Updated weights on worker 0-0, policy_version 308446 (0.00091) [2022-07-09 15:28:43,843][26022] Updated weights on worker 0-0, policy_version 308456 (0.00084) [2022-07-09 15:28:43,855][25689] Fps is (10 sec: 5831.8, 60 sec: 5636.1, 300 sec: 5635.8). Total num frames: 315858944. Throughput: 0: 5917.5. Samples: 315857364. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:28:43,856][25689] Avg episode reward: [(0, '-47.216')] [2022-07-09 15:28:45,887][26022] Updated weights on worker 0-0, policy_version 308466 (0.00091) [2022-07-09 15:28:47,644][26022] Updated weights on worker 0-0, policy_version 308476 (0.00091) [2022-07-09 15:28:48,961][25689] Fps is (10 sec: 5671.4, 60 sec: 5631.4, 300 sec: 5634.0). Total num frames: 315886592. Throughput: 0: 5923.8. Samples: 315891666. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:28:48,961][25689] Avg episode reward: [(0, '-46.834')] [2022-07-09 15:28:49,322][26022] Updated weights on worker 0-0, policy_version 308486 (0.00085) [2022-07-09 15:28:51,196][26022] Updated weights on worker 0-0, policy_version 308496 (0.00092) [2022-07-09 15:28:53,015][26022] Updated weights on worker 0-0, policy_version 308506 (0.00084) [2022-07-09 15:28:53,975][25689] Fps is (10 sec: 5464.4, 60 sec: 5614.1, 300 sec: 5630.6). Total num frames: 315914240. Throughput: 0: 5061.0. Samples: 315908576. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:28:53,975][25689] Avg episode reward: [(0, '-47.014')] [2022-07-09 15:28:54,678][26022] Updated weights on worker 0-0, policy_version 308516 (0.00083) [2022-07-09 15:28:56,696][26022] Updated weights on worker 0-0, policy_version 308526 (0.00084) [2022-07-09 15:28:58,269][26022] Updated weights on worker 0-0, policy_version 308536 (0.00093) [2022-07-09 15:28:58,981][25689] Fps is (10 sec: 5723.0, 60 sec: 5631.0, 300 sec: 5637.8). Total num frames: 315943936. Throughput: 0: 5898.6. Samples: 315942404. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:28:58,981][25689] Avg episode reward: [(0, '-46.954')] [2022-07-09 15:29:00,366][26022] Updated weights on worker 0-0, policy_version 308546 (0.00087) [2022-07-09 15:29:02,468][26022] Updated weights on worker 0-0, policy_version 308556 (0.00092) [2022-07-09 15:29:03,994][25689] Fps is (10 sec: 5519.1, 60 sec: 5613.3, 300 sec: 5636.8). Total num frames: 315969536. Throughput: 0: 5812.6. Samples: 315974420. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:29:03,994][25689] Avg episode reward: [(0, '-46.310')] [2022-07-09 15:29:04,229][26022] Updated weights on worker 0-0, policy_version 308566 (0.00085) [2022-07-09 15:29:06,128][26022] Updated weights on worker 0-0, policy_version 308576 (0.00087) [2022-07-09 15:29:07,759][26022] Updated weights on worker 0-0, policy_version 308586 (0.00088) [2022-07-09 15:29:08,835][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:29:08,848][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000308591_315997184.pth [2022-07-09 15:29:08,849][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000306607_313965568.pth [2022-07-09 15:29:09,085][25689] Fps is (10 sec: 5370.9, 60 sec: 5631.3, 300 sec: 5628.8). Total num frames: 315998208. Throughput: 0: 4957.7. Samples: 315991438. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:29:09,088][25689] Avg episode reward: [(0, '-46.554')] [2022-07-09 15:29:09,680][26022] Updated weights on worker 0-0, policy_version 308596 (0.00086) [2022-07-09 15:29:11,695][26022] Updated weights on worker 0-0, policy_version 308606 (0.00078) [2022-07-09 15:29:13,388][26022] Updated weights on worker 0-0, policy_version 308616 (0.00085) [2022-07-09 15:29:14,098][25689] Fps is (10 sec: 5675.5, 60 sec: 5613.7, 300 sec: 5632.6). Total num frames: 316026880. Throughput: 0: 5806.3. Samples: 316025416. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:29:14,098][25689] Avg episode reward: [(0, '-47.279')] [2022-07-09 15:29:15,226][26022] Updated weights on worker 0-0, policy_version 308626 (0.00089) [2022-07-09 15:29:16,807][26022] Updated weights on worker 0-0, policy_version 308636 (0.00092) [2022-07-09 15:29:18,924][26022] Updated weights on worker 0-0, policy_version 308646 (0.00090) [2022-07-09 15:29:19,102][25689] Fps is (10 sec: 5622.4, 60 sec: 5597.3, 300 sec: 5629.2). Total num frames: 316054528. Throughput: 0: 5815.5. Samples: 316059422. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:29:19,103][25689] Avg episode reward: [(0, '-47.196')] [2022-07-09 15:29:20,469][26022] Updated weights on worker 0-0, policy_version 308656 (0.00087) [2022-07-09 15:29:22,347][26022] Updated weights on worker 0-0, policy_version 308666 (0.00090) [2022-07-09 15:29:23,978][26022] Updated weights on worker 0-0, policy_version 308676 (0.00084) [2022-07-09 15:29:24,122][25689] Fps is (10 sec: 5720.2, 60 sec: 5630.6, 300 sec: 5634.3). Total num frames: 316084224. Throughput: 0: 5071.1. Samples: 316076494. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:29:24,123][25689] Avg episode reward: [(0, '-48.336')] [2022-07-09 15:29:25,919][26022] Updated weights on worker 0-0, policy_version 308686 (0.00085) [2022-07-09 15:29:27,877][26022] Updated weights on worker 0-0, policy_version 308696 (0.00091) [2022-07-09 15:29:29,210][25689] Fps is (10 sec: 5774.6, 60 sec: 5610.2, 300 sec: 5633.0). Total num frames: 316112896. Throughput: 0: 5917.4. Samples: 316110524. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:29:29,211][25689] Avg episode reward: [(0, '-48.476')] [2022-07-09 15:29:29,572][26022] Updated weights on worker 0-0, policy_version 308706 (0.00093) [2022-07-09 15:29:31,485][26022] Updated weights on worker 0-0, policy_version 308716 (0.00083) [2022-07-09 15:29:33,119][26022] Updated weights on worker 0-0, policy_version 308726 (0.00087) [2022-07-09 15:29:34,228][25689] Fps is (10 sec: 5573.2, 60 sec: 5630.4, 300 sec: 5633.1). Total num frames: 316140544. Throughput: 0: 5918.5. Samples: 316144556. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:29:34,228][25689] Avg episode reward: [(0, '-48.906')] [2022-07-09 15:29:34,991][26022] Updated weights on worker 0-0, policy_version 308736 (0.00090) [2022-07-09 15:29:37,031][26022] Updated weights on worker 0-0, policy_version 308746 (0.00089) [2022-07-09 15:29:38,753][26022] Updated weights on worker 0-0, policy_version 308756 (0.00087) [2022-07-09 15:29:39,249][25689] Fps is (10 sec: 5712.0, 60 sec: 5645.6, 300 sec: 5637.0). Total num frames: 316170240. Throughput: 0: 5899.8. Samples: 316178284. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:29:39,250][25689] Avg episode reward: [(0, '-48.250')] [2022-07-09 15:29:40,367][26022] Updated weights on worker 0-0, policy_version 308766 (0.00090) [2022-07-09 15:29:42,306][26022] Updated weights on worker 0-0, policy_version 308776 (0.00087) [2022-07-09 15:29:43,936][26022] Updated weights on worker 0-0, policy_version 308786 (0.00089) [2022-07-09 15:29:44,273][25689] Fps is (10 sec: 5606.5, 60 sec: 5593.1, 300 sec: 5623.7). Total num frames: 316196864. Throughput: 0: 5898.7. Samples: 316195358. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:29:44,274][25689] Avg episode reward: [(0, '-47.572')] [2022-07-09 15:29:45,855][26022] Updated weights on worker 0-0, policy_version 308796 (0.00106) [2022-07-09 15:29:47,828][26022] Updated weights on worker 0-0, policy_version 308806 (0.00086) [2022-07-09 15:29:49,377][25689] Fps is (10 sec: 5561.3, 60 sec: 5627.1, 300 sec: 5637.0). Total num frames: 316226560. Throughput: 0: 5904.3. Samples: 316229592. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:29:49,378][25689] Avg episode reward: [(0, '-47.176')] [2022-07-09 15:29:49,493][26022] Updated weights on worker 0-0, policy_version 308816 (0.00081) [2022-07-09 15:29:51,411][26022] Updated weights on worker 0-0, policy_version 308826 (0.00083) [2022-07-09 15:29:53,111][26022] Updated weights on worker 0-0, policy_version 308836 (0.00093) [2022-07-09 15:29:54,412][25689] Fps is (10 sec: 5757.0, 60 sec: 5642.1, 300 sec: 5629.9). Total num frames: 316255232. Throughput: 0: 5931.4. Samples: 316264276. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:29:54,413][25689] Avg episode reward: [(0, '-47.708')] [2022-07-09 15:29:54,758][26022] Updated weights on worker 0-0, policy_version 308846 (0.00090) [2022-07-09 15:29:56,860][26022] Updated weights on worker 0-0, policy_version 308856 (0.00088) [2022-07-09 15:29:58,317][26022] Updated weights on worker 0-0, policy_version 308866 (0.00089) [2022-07-09 15:29:59,439][25689] Fps is (10 sec: 5699.0, 60 sec: 5623.2, 300 sec: 5637.5). Total num frames: 316283904. Throughput: 0: 5093.6. Samples: 316281120. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:29:59,439][25689] Avg episode reward: [(0, '-47.337')] [2022-07-09 15:30:00,339][26022] Updated weights on worker 0-0, policy_version 308876 (0.00091) [2022-07-09 15:30:02,126][26022] Updated weights on worker 0-0, policy_version 308886 (0.00086) [2022-07-09 15:30:04,246][26022] Updated weights on worker 0-0, policy_version 308896 (0.00086) [2022-07-09 15:30:04,441][25689] Fps is (10 sec: 5513.8, 60 sec: 5641.2, 300 sec: 5636.2). Total num frames: 316310528. Throughput: 0: 5818.1. Samples: 316312694. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:30:04,441][25689] Avg episode reward: [(0, '-48.179')] [2022-07-09 15:30:06,003][26022] Updated weights on worker 0-0, policy_version 308906 (0.00092) [2022-07-09 15:30:07,946][26022] Updated weights on worker 0-0, policy_version 308916 (0.00109) [2022-07-09 15:30:09,501][25689] Fps is (10 sec: 5291.6, 60 sec: 5610.2, 300 sec: 5625.1). Total num frames: 316337152. Throughput: 0: 5788.4. Samples: 316346082. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:30:09,502][25689] Avg episode reward: [(0, '-48.905')] [2022-07-09 15:30:09,732][26022] Updated weights on worker 0-0, policy_version 308926 (0.00085) [2022-07-09 15:30:11,564][26022] Updated weights on worker 0-0, policy_version 308936 (0.00084) [2022-07-09 15:30:13,333][26022] Updated weights on worker 0-0, policy_version 308946 (0.00094) [2022-07-09 15:30:14,510][25689] Fps is (10 sec: 5593.1, 60 sec: 5627.4, 300 sec: 5629.3). Total num frames: 316366848. Throughput: 0: 4918.3. Samples: 316363126. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:30:14,511][25689] Avg episode reward: [(0, '-48.360')] [2022-07-09 15:30:15,187][26022] Updated weights on worker 0-0, policy_version 308956 (0.00086) [2022-07-09 15:30:16,933][26022] Updated weights on worker 0-0, policy_version 308966 (0.00083) [2022-07-09 15:30:18,685][26022] Updated weights on worker 0-0, policy_version 308976 (0.00087) [2022-07-09 15:30:19,541][25689] Fps is (10 sec: 5813.7, 60 sec: 5641.9, 300 sec: 5630.6). Total num frames: 316395520. Throughput: 0: 5813.2. Samples: 316397980. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:30:19,542][25689] Avg episode reward: [(0, '-48.784')] [2022-07-09 15:30:20,487][26022] Updated weights on worker 0-0, policy_version 308986 (0.00089) [2022-07-09 15:30:22,428][26022] Updated weights on worker 0-0, policy_version 308996 (0.00098) [2022-07-09 15:30:24,228][26022] Updated weights on worker 0-0, policy_version 309006 (0.00087) [2022-07-09 15:30:24,557][25689] Fps is (10 sec: 5809.7, 60 sec: 5642.3, 300 sec: 5631.6). Total num frames: 316425216. Throughput: 0: 5942.9. Samples: 316432242. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 15:30:24,557][25689] Avg episode reward: [(0, '-48.557')] [2022-07-09 15:30:26,036][26022] Updated weights on worker 0-0, policy_version 309016 (0.00092) [2022-07-09 15:30:27,719][26022] Updated weights on worker 0-0, policy_version 309026 (0.00084) [2022-07-09 15:30:29,632][25689] Fps is (10 sec: 5581.3, 60 sec: 5609.6, 300 sec: 5623.6). Total num frames: 316451840. Throughput: 0: 5121.4. Samples: 316449178. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:30:29,633][25689] Avg episode reward: [(0, '-48.066')] [2022-07-09 15:30:29,684][26022] Updated weights on worker 0-0, policy_version 309036 (0.00086) [2022-07-09 15:30:31,326][26022] Updated weights on worker 0-0, policy_version 309046 (0.00083) [2022-07-09 15:30:33,270][26022] Updated weights on worker 0-0, policy_version 309056 (0.00088) [2022-07-09 15:30:34,663][25689] Fps is (10 sec: 5573.0, 60 sec: 5642.3, 300 sec: 5634.1). Total num frames: 316481536. Throughput: 0: 5962.6. Samples: 316483286. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:30:34,663][25689] Avg episode reward: [(0, '-47.679')] [2022-07-09 15:30:34,964][26022] Updated weights on worker 0-0, policy_version 309066 (0.00086) [2022-07-09 15:30:36,743][26022] Updated weights on worker 0-0, policy_version 309076 (0.00052) [2022-07-09 15:30:38,489][26022] Updated weights on worker 0-0, policy_version 309086 (0.00104) [2022-07-09 15:30:39,678][25689] Fps is (10 sec: 5809.8, 60 sec: 5625.9, 300 sec: 5630.5). Total num frames: 316510208. Throughput: 0: 5926.5. Samples: 316517324. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:30:39,679][25689] Avg episode reward: [(0, '-47.574')] [2022-07-09 15:30:40,290][26022] Updated weights on worker 0-0, policy_version 309096 (0.00091) [2022-07-09 15:30:42,032][26022] Updated weights on worker 0-0, policy_version 309106 (0.00085) [2022-07-09 15:30:44,014][26022] Updated weights on worker 0-0, policy_version 309116 (0.00090) [2022-07-09 15:30:44,696][25689] Fps is (10 sec: 5613.4, 60 sec: 5643.5, 300 sec: 5625.7). Total num frames: 316537856. Throughput: 0: 5077.1. Samples: 316534490. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:30:44,698][25689] Avg episode reward: [(0, '-47.181')] [2022-07-09 15:30:45,545][26022] Updated weights on worker 0-0, policy_version 309126 (0.00086) [2022-07-09 15:30:47,630][26022] Updated weights on worker 0-0, policy_version 309136 (0.00084) [2022-07-09 15:30:49,225][26022] Updated weights on worker 0-0, policy_version 309146 (0.00084) [2022-07-09 15:30:49,784][25689] Fps is (10 sec: 5674.6, 60 sec: 5644.9, 300 sec: 5628.1). Total num frames: 316567552. Throughput: 0: 5941.9. Samples: 316568918. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:30:49,786][25689] Avg episode reward: [(0, '-46.782')] [2022-07-09 15:30:51,110][26022] Updated weights on worker 0-0, policy_version 309156 (0.00090) [2022-07-09 15:30:52,905][26022] Updated weights on worker 0-0, policy_version 309166 (0.00096) [2022-07-09 15:30:54,634][26022] Updated weights on worker 0-0, policy_version 309176 (0.00096) [2022-07-09 15:30:54,806][25689] Fps is (10 sec: 5773.0, 60 sec: 5646.1, 300 sec: 5631.4). Total num frames: 316596224. Throughput: 0: 5949.0. Samples: 316603120. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:30:54,808][25689] Avg episode reward: [(0, '-45.757')] [2022-07-09 15:30:56,511][26022] Updated weights on worker 0-0, policy_version 309186 (0.00089) [2022-07-09 15:30:58,154][26022] Updated weights on worker 0-0, policy_version 309196 (0.00082) [2022-07-09 15:30:59,822][25689] Fps is (10 sec: 5712.6, 60 sec: 5647.1, 300 sec: 5638.2). Total num frames: 316624896. Throughput: 0: 5118.9. Samples: 316620438. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:30:59,824][25689] Avg episode reward: [(0, '-44.959')] [2022-07-09 15:31:00,047][26022] Updated weights on worker 0-0, policy_version 309206 (0.00084) [2022-07-09 15:31:01,835][26022] Updated weights on worker 0-0, policy_version 309216 (0.00082) [2022-07-09 15:31:04,091][26022] Updated weights on worker 0-0, policy_version 309226 (0.00085) [2022-07-09 15:31:04,833][25689] Fps is (10 sec: 5617.1, 60 sec: 5663.2, 300 sec: 5638.9). Total num frames: 316652544. Throughput: 0: 5860.7. Samples: 316652508. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:31:04,835][25689] Avg episode reward: [(0, '-44.249')] [2022-07-09 15:31:06,114][26022] Updated weights on worker 0-0, policy_version 309236 (0.00101) [2022-07-09 15:31:07,631][26022] Updated weights on worker 0-0, policy_version 309246 (0.00095) [2022-07-09 15:31:08,922][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:31:08,941][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000309252_316674048.pth [2022-07-09 15:31:08,942][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000307270_314644480.pth [2022-07-09 15:31:09,505][26022] Updated weights on worker 0-0, policy_version 309256 (0.00113) [2022-07-09 15:31:09,951][25689] Fps is (10 sec: 5358.4, 60 sec: 5657.9, 300 sec: 5626.6). Total num frames: 316679168. Throughput: 0: 5827.4. Samples: 316686438. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:31:09,952][25689] Avg episode reward: [(0, '-44.440')] [2022-07-09 15:31:11,150][26022] Updated weights on worker 0-0, policy_version 309266 (0.00087) [2022-07-09 15:31:13,174][26022] Updated weights on worker 0-0, policy_version 309276 (0.00085) [2022-07-09 15:31:14,958][25689] Fps is (10 sec: 5461.6, 60 sec: 5641.1, 300 sec: 5630.1). Total num frames: 316707840. Throughput: 0: 4969.8. Samples: 316703268. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:31:14,958][25689] Avg episode reward: [(0, '-43.361')] [2022-07-09 15:31:15,023][26022] Updated weights on worker 0-0, policy_version 309286 (0.00088) [2022-07-09 15:31:16,623][26022] Updated weights on worker 0-0, policy_version 309296 (0.00083) [2022-07-09 15:31:18,423][26022] Updated weights on worker 0-0, policy_version 309306 (0.00090) [2022-07-09 15:31:20,016][25689] Fps is (10 sec: 5900.7, 60 sec: 5672.5, 300 sec: 5632.5). Total num frames: 316738560. Throughput: 0: 5811.5. Samples: 316737794. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:31:20,016][25689] Avg episode reward: [(0, '-44.496')] [2022-07-09 15:31:20,264][26022] Updated weights on worker 0-0, policy_version 309316 (0.00087) [2022-07-09 15:31:22,236][26022] Updated weights on worker 0-0, policy_version 309326 (0.00086) [2022-07-09 15:31:24,028][26022] Updated weights on worker 0-0, policy_version 309336 (0.00085) [2022-07-09 15:31:25,043][25689] Fps is (10 sec: 5686.0, 60 sec: 5620.6, 300 sec: 5629.8). Total num frames: 316765184. Throughput: 0: 5884.8. Samples: 316771438. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:31:25,043][25689] Avg episode reward: [(0, '-45.414')] [2022-07-09 15:31:25,612][26022] Updated weights on worker 0-0, policy_version 309346 (0.00088) [2022-07-09 15:31:27,597][26022] Updated weights on worker 0-0, policy_version 309356 (0.00085) [2022-07-09 15:31:29,524][26022] Updated weights on worker 0-0, policy_version 309366 (0.00617) [2022-07-09 15:31:30,120][25689] Fps is (10 sec: 5574.2, 60 sec: 5671.3, 300 sec: 5632.3). Total num frames: 316794880. Throughput: 0: 5043.6. Samples: 316788162. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:31:30,120][25689] Avg episode reward: [(0, '-46.161')] [2022-07-09 15:31:31,190][26022] Updated weights on worker 0-0, policy_version 309376 (0.00091) [2022-07-09 15:31:32,959][26022] Updated weights on worker 0-0, policy_version 309386 (0.00093) [2022-07-09 15:31:34,715][26022] Updated weights on worker 0-0, policy_version 309396 (0.00084) [2022-07-09 15:31:35,195][25689] Fps is (10 sec: 5749.0, 60 sec: 5650.1, 300 sec: 5631.0). Total num frames: 316823552. Throughput: 0: 5883.5. Samples: 316822338. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:31:35,196][25689] Avg episode reward: [(0, '-46.775')] [2022-07-09 15:31:36,650][26022] Updated weights on worker 0-0, policy_version 309406 (0.00084) [2022-07-09 15:31:38,411][26022] Updated weights on worker 0-0, policy_version 309416 (0.00089) [2022-07-09 15:31:40,258][25689] Fps is (10 sec: 5555.2, 60 sec: 5628.9, 300 sec: 5628.0). Total num frames: 316851200. Throughput: 0: 5868.3. Samples: 316856580. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:31:40,258][25689] Avg episode reward: [(0, '-47.406')] [2022-07-09 15:31:40,304][26022] Updated weights on worker 0-0, policy_version 309426 (0.00089) [2022-07-09 15:31:41,995][26022] Updated weights on worker 0-0, policy_version 309436 (0.00092) [2022-07-09 15:31:43,861][26022] Updated weights on worker 0-0, policy_version 309446 (0.00088) [2022-07-09 15:31:45,279][25689] Fps is (10 sec: 5686.9, 60 sec: 5662.3, 300 sec: 5633.7). Total num frames: 316880896. Throughput: 0: 5043.1. Samples: 316873492. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:31:45,280][25689] Avg episode reward: [(0, '-47.553')] [2022-07-09 15:31:45,772][26022] Updated weights on worker 0-0, policy_version 309456 (0.00562) [2022-07-09 15:31:47,472][26022] Updated weights on worker 0-0, policy_version 309466 (0.00090) [2022-07-09 15:31:49,303][26022] Updated weights on worker 0-0, policy_version 309476 (0.00084) [2022-07-09 15:31:50,371][25689] Fps is (10 sec: 5771.4, 60 sec: 5645.0, 300 sec: 5632.0). Total num frames: 316909568. Throughput: 0: 5903.4. Samples: 316907714. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:31:50,372][25689] Avg episode reward: [(0, '-47.061')] [2022-07-09 15:31:51,012][26022] Updated weights on worker 0-0, policy_version 309486 (0.00086) [2022-07-09 15:31:52,949][26022] Updated weights on worker 0-0, policy_version 309496 (0.00088) [2022-07-09 15:31:54,655][26022] Updated weights on worker 0-0, policy_version 309506 (0.00899) [2022-07-09 15:31:55,380][25689] Fps is (10 sec: 5677.1, 60 sec: 5646.3, 300 sec: 5629.3). Total num frames: 316938240. Throughput: 0: 5934.2. Samples: 316942116. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:31:55,380][25689] Avg episode reward: [(0, '-46.961')] [2022-07-09 15:31:56,536][26022] Updated weights on worker 0-0, policy_version 309516 (0.00099) [2022-07-09 15:31:57,985][26022] Updated weights on worker 0-0, policy_version 309526 (0.00084) [2022-07-09 15:32:00,077][26022] Updated weights on worker 0-0, policy_version 309536 (0.00054) [2022-07-09 15:32:00,382][25689] Fps is (10 sec: 5626.0, 60 sec: 5630.7, 300 sec: 5636.5). Total num frames: 316965888. Throughput: 0: 5956.6. Samples: 316976450. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:32:00,382][25689] Avg episode reward: [(0, '-46.251')] [2022-07-09 15:32:01,623][26022] Updated weights on worker 0-0, policy_version 309546 (0.00087) [2022-07-09 15:32:04,089][26022] Updated weights on worker 0-0, policy_version 309556 (0.00087) [2022-07-09 15:32:05,483][25689] Fps is (10 sec: 5473.1, 60 sec: 5622.3, 300 sec: 5631.9). Total num frames: 316993536. Throughput: 0: 5840.0. Samples: 316991484. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:32:05,484][25689] Avg episode reward: [(0, '-45.872')] [2022-07-09 15:32:05,727][26022] Updated weights on worker 0-0, policy_version 309566 (0.00088) [2022-07-09 15:32:07,704][26022] Updated weights on worker 0-0, policy_version 309576 (0.00082) [2022-07-09 15:32:09,421][26022] Updated weights on worker 0-0, policy_version 309586 (0.00090) [2022-07-09 15:32:10,629][25689] Fps is (10 sec: 5496.1, 60 sec: 5653.4, 300 sec: 5632.8). Total num frames: 317022208. Throughput: 0: 5808.0. Samples: 317025372. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:32:10,630][25689] Avg episode reward: [(0, '-45.949')] [2022-07-09 15:32:11,316][26022] Updated weights on worker 0-0, policy_version 309596 (0.00087) [2022-07-09 15:32:13,120][26022] Updated weights on worker 0-0, policy_version 309606 (0.00094) [2022-07-09 15:32:14,871][26022] Updated weights on worker 0-0, policy_version 309616 (0.00093) [2022-07-09 15:32:15,681][25689] Fps is (10 sec: 5723.5, 60 sec: 5666.1, 300 sec: 5639.6). Total num frames: 317051904. Throughput: 0: 5773.7. Samples: 317059328. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:32:15,682][25689] Avg episode reward: [(0, '-46.250')] [2022-07-09 15:32:16,645][26022] Updated weights on worker 0-0, policy_version 309626 (0.00091) [2022-07-09 15:32:18,499][26022] Updated weights on worker 0-0, policy_version 309636 (0.00097) [2022-07-09 15:32:20,182][26022] Updated weights on worker 0-0, policy_version 309646 (0.00093) [2022-07-09 15:32:20,768][25689] Fps is (10 sec: 5655.8, 60 sec: 5612.8, 300 sec: 5631.5). Total num frames: 317079552. Throughput: 0: 4900.5. Samples: 317076352. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:32:20,770][25689] Avg episode reward: [(0, '-46.001')] [2022-07-09 15:32:22,009][26022] Updated weights on worker 0-0, policy_version 309656 (0.00087) [2022-07-09 15:32:23,767][26022] Updated weights on worker 0-0, policy_version 309666 (0.00090) [2022-07-09 15:32:25,651][26022] Updated weights on worker 0-0, policy_version 309676 (0.00052) [2022-07-09 15:32:25,793][25689] Fps is (10 sec: 5569.5, 60 sec: 5646.7, 300 sec: 5632.7). Total num frames: 317108224. Throughput: 0: 5856.7. Samples: 317110434. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:32:25,794][25689] Avg episode reward: [(0, '-46.644')] [2022-07-09 15:32:27,462][26022] Updated weights on worker 0-0, policy_version 309686 (0.00095) [2022-07-09 15:32:29,255][26022] Updated weights on worker 0-0, policy_version 309696 (0.00093) [2022-07-09 15:32:30,867][25689] Fps is (10 sec: 5678.4, 60 sec: 5630.2, 300 sec: 5629.3). Total num frames: 317136896. Throughput: 0: 5879.0. Samples: 317144350. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:32:30,867][25689] Avg episode reward: [(0, '-46.925')] [2022-07-09 15:32:30,994][26022] Updated weights on worker 0-0, policy_version 309706 (0.00085) [2022-07-09 15:32:33,083][26022] Updated weights on worker 0-0, policy_version 309716 (0.00085) [2022-07-09 15:32:34,618][26022] Updated weights on worker 0-0, policy_version 309726 (0.00091) [2022-07-09 15:32:35,908][25689] Fps is (10 sec: 5669.3, 60 sec: 5633.4, 300 sec: 5633.4). Total num frames: 317165568. Throughput: 0: 5046.2. Samples: 317161398. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:32:35,909][25689] Avg episode reward: [(0, '-46.913')] [2022-07-09 15:32:36,602][26022] Updated weights on worker 0-0, policy_version 309736 (0.00088) [2022-07-09 15:32:38,384][26022] Updated weights on worker 0-0, policy_version 309746 (0.00096) [2022-07-09 15:32:40,039][26022] Updated weights on worker 0-0, policy_version 309756 (0.00054) [2022-07-09 15:32:40,918][25689] Fps is (10 sec: 5704.8, 60 sec: 5655.1, 300 sec: 5633.4). Total num frames: 317194240. Throughput: 0: 5918.3. Samples: 317195608. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:32:40,923][25689] Avg episode reward: [(0, '-48.240')] [2022-07-09 15:32:42,167][26022] Updated weights on worker 0-0, policy_version 309766 (0.00087) [2022-07-09 15:32:43,665][26022] Updated weights on worker 0-0, policy_version 309776 (0.00090) [2022-07-09 15:32:45,757][26022] Updated weights on worker 0-0, policy_version 309786 (0.00084) [2022-07-09 15:32:46,005][25689] Fps is (10 sec: 5679.2, 60 sec: 5632.1, 300 sec: 5636.2). Total num frames: 317222912. Throughput: 0: 5877.6. Samples: 317229232. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:32:46,006][25689] Avg episode reward: [(0, '-47.304')] [2022-07-09 15:32:47,440][26022] Updated weights on worker 0-0, policy_version 309796 (0.00089) [2022-07-09 15:32:49,248][26022] Updated weights on worker 0-0, policy_version 309806 (0.00086) [2022-07-09 15:32:51,091][25689] Fps is (10 sec: 5435.8, 60 sec: 5599.0, 300 sec: 5627.9). Total num frames: 317249536. Throughput: 0: 5027.1. Samples: 317246020. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:32:51,091][25689] Avg episode reward: [(0, '-47.681')] [2022-07-09 15:32:51,267][26022] Updated weights on worker 0-0, policy_version 309816 (0.00086) [2022-07-09 15:32:52,577][26022] Updated weights on worker 0-0, policy_version 309826 (0.00088) [2022-07-09 15:32:54,791][26022] Updated weights on worker 0-0, policy_version 309836 (0.00086) [2022-07-09 15:32:56,119][25689] Fps is (10 sec: 5770.9, 60 sec: 5647.7, 300 sec: 5637.8). Total num frames: 317281280. Throughput: 0: 5866.6. Samples: 317279970. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 15:32:56,120][25689] Avg episode reward: [(0, '-48.038')] [2022-07-09 15:32:56,634][26022] Updated weights on worker 0-0, policy_version 309846 (0.00082) [2022-07-09 15:32:58,372][26022] Updated weights on worker 0-0, policy_version 309856 (0.00083) [2022-07-09 15:33:00,232][26022] Updated weights on worker 0-0, policy_version 309866 (0.00085) [2022-07-09 15:33:01,186][25689] Fps is (10 sec: 5781.6, 60 sec: 5624.8, 300 sec: 5636.6). Total num frames: 317307904. Throughput: 0: 5828.0. Samples: 317313730. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:33:01,187][25689] Avg episode reward: [(0, '-48.146')] [2022-07-09 15:33:02,264][26022] Updated weights on worker 0-0, policy_version 309876 (0.00086) [2022-07-09 15:33:04,142][26022] Updated weights on worker 0-0, policy_version 309886 (0.00088) [2022-07-09 15:33:05,982][26022] Updated weights on worker 0-0, policy_version 309896 (0.00090) [2022-07-09 15:33:06,224][25689] Fps is (10 sec: 5168.6, 60 sec: 5597.1, 300 sec: 5631.0). Total num frames: 317333504. Throughput: 0: 4923.6. Samples: 317328778. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:33:06,224][25689] Avg episode reward: [(0, '-48.588')] [2022-07-09 15:33:07,765][26022] Updated weights on worker 0-0, policy_version 309906 (0.00448) [2022-07-09 15:33:09,026][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:33:09,038][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000309912_317349888.pth [2022-07-09 15:33:09,039][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000307932_315322368.pth [2022-07-09 15:33:09,605][26022] Updated weights on worker 0-0, policy_version 309916 (0.00079) [2022-07-09 15:33:11,308][25689] Fps is (10 sec: 5463.2, 60 sec: 5619.6, 300 sec: 5629.5). Total num frames: 317363200. Throughput: 0: 5781.3. Samples: 317362900. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:33:11,312][25689] Avg episode reward: [(0, '-48.886')] [2022-07-09 15:33:11,339][26022] Updated weights on worker 0-0, policy_version 309926 (0.00088) [2022-07-09 15:33:13,170][26022] Updated weights on worker 0-0, policy_version 309936 (0.00089) [2022-07-09 15:33:14,975][26022] Updated weights on worker 0-0, policy_version 309946 (0.00090) [2022-07-09 15:33:16,395][25689] Fps is (10 sec: 5839.1, 60 sec: 5616.4, 300 sec: 5631.5). Total num frames: 317392896. Throughput: 0: 5792.4. Samples: 317397414. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:33:16,395][25689] Avg episode reward: [(0, '-48.214')] [2022-07-09 15:33:16,792][26022] Updated weights on worker 0-0, policy_version 309956 (0.00083) [2022-07-09 15:33:18,565][26022] Updated weights on worker 0-0, policy_version 309966 (0.00091) [2022-07-09 15:33:20,377][26022] Updated weights on worker 0-0, policy_version 309976 (0.00087) [2022-07-09 15:33:21,412][25689] Fps is (10 sec: 5776.8, 60 sec: 5639.7, 300 sec: 5634.8). Total num frames: 317421568. Throughput: 0: 4988.7. Samples: 317414628. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:33:21,412][25689] Avg episode reward: [(0, '-48.398')] [2022-07-09 15:33:21,947][26022] Updated weights on worker 0-0, policy_version 309986 (0.00083) [2022-07-09 15:33:23,927][26022] Updated weights on worker 0-0, policy_version 309996 (0.00093) [2022-07-09 15:33:25,622][26022] Updated weights on worker 0-0, policy_version 310006 (0.00091) [2022-07-09 15:33:26,456][25689] Fps is (10 sec: 5699.4, 60 sec: 5638.0, 300 sec: 5631.5). Total num frames: 317450240. Throughput: 0: 5940.8. Samples: 317448978. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:33:26,458][25689] Avg episode reward: [(0, '-47.707')] [2022-07-09 15:33:27,483][26022] Updated weights on worker 0-0, policy_version 310016 (0.00082) [2022-07-09 15:33:29,302][26022] Updated weights on worker 0-0, policy_version 310026 (0.00086) [2022-07-09 15:33:30,995][26022] Updated weights on worker 0-0, policy_version 310036 (0.00085) [2022-07-09 15:33:31,537][25689] Fps is (10 sec: 5562.2, 60 sec: 5620.4, 300 sec: 5634.4). Total num frames: 317477888. Throughput: 0: 5944.0. Samples: 317483144. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:33:31,539][25689] Avg episode reward: [(0, '-47.806')] [2022-07-09 15:33:32,893][26022] Updated weights on worker 0-0, policy_version 310046 (0.00083) [2022-07-09 15:33:34,587][26022] Updated weights on worker 0-0, policy_version 310056 (0.00090) [2022-07-09 15:33:36,419][26022] Updated weights on worker 0-0, policy_version 310066 (0.00092) [2022-07-09 15:33:36,579][25689] Fps is (10 sec: 5665.1, 60 sec: 5637.3, 300 sec: 5637.1). Total num frames: 317507584. Throughput: 0: 5104.5. Samples: 317500442. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:33:36,579][25689] Avg episode reward: [(0, '-46.338')] [2022-07-09 15:33:38,103][26022] Updated weights on worker 0-0, policy_version 310076 (0.00096) [2022-07-09 15:33:40,094][26022] Updated weights on worker 0-0, policy_version 310086 (0.00086) [2022-07-09 15:33:41,587][25689] Fps is (10 sec: 5807.5, 60 sec: 5637.4, 300 sec: 5633.6). Total num frames: 317536256. Throughput: 0: 5964.9. Samples: 317534974. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:33:41,589][25689] Avg episode reward: [(0, '-46.486')] [2022-07-09 15:33:41,645][26022] Updated weights on worker 0-0, policy_version 310096 (0.00088) [2022-07-09 15:33:43,645][26022] Updated weights on worker 0-0, policy_version 310106 (0.00083) [2022-07-09 15:33:45,315][26022] Updated weights on worker 0-0, policy_version 310116 (0.00079) [2022-07-09 15:33:46,599][25689] Fps is (10 sec: 5824.9, 60 sec: 5661.3, 300 sec: 5642.3). Total num frames: 317565952. Throughput: 0: 5961.2. Samples: 317569054. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:33:46,601][25689] Avg episode reward: [(0, '-46.287')] [2022-07-09 15:33:47,282][26022] Updated weights on worker 0-0, policy_version 310126 (0.00087) [2022-07-09 15:33:48,983][26022] Updated weights on worker 0-0, policy_version 310136 (0.00094) [2022-07-09 15:33:50,859][26022] Updated weights on worker 0-0, policy_version 310146 (0.00089) [2022-07-09 15:33:51,663][25689] Fps is (10 sec: 5691.5, 60 sec: 5680.3, 300 sec: 5641.3). Total num frames: 317593600. Throughput: 0: 5962.6. Samples: 317603146. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:33:51,663][25689] Avg episode reward: [(0, '-47.053')] [2022-07-09 15:33:52,489][26022] Updated weights on worker 0-0, policy_version 310156 (0.00085) [2022-07-09 15:33:54,532][26022] Updated weights on worker 0-0, policy_version 310166 (0.00093) [2022-07-09 15:33:56,096][26022] Updated weights on worker 0-0, policy_version 310176 (0.00098) [2022-07-09 15:33:56,672][25689] Fps is (10 sec: 5591.2, 60 sec: 5631.3, 300 sec: 5637.8). Total num frames: 317622272. Throughput: 0: 5961.4. Samples: 317620228. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:33:56,673][25689] Avg episode reward: [(0, '-46.656')] [2022-07-09 15:33:58,191][26022] Updated weights on worker 0-0, policy_version 310186 (0.00088) [2022-07-09 15:33:59,892][26022] Updated weights on worker 0-0, policy_version 310196 (0.00083) [2022-07-09 15:34:01,695][25689] Fps is (10 sec: 5512.0, 60 sec: 5635.4, 300 sec: 5641.1). Total num frames: 317648896. Throughput: 0: 5954.8. Samples: 317654708. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:34:01,695][25689] Avg episode reward: [(0, '-45.369')] [2022-07-09 15:34:02,052][26022] Updated weights on worker 0-0, policy_version 310206 (0.00091) [2022-07-09 15:34:03,833][26022] Updated weights on worker 0-0, policy_version 310216 (0.00106) [2022-07-09 15:34:05,728][26022] Updated weights on worker 0-0, policy_version 310226 (0.00085) [2022-07-09 15:34:06,715][25689] Fps is (10 sec: 5404.0, 60 sec: 5670.9, 300 sec: 5639.0). Total num frames: 317676544. Throughput: 0: 5839.2. Samples: 317686516. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:34:06,716][25689] Avg episode reward: [(0, '-45.508')] [2022-07-09 15:34:07,432][26022] Updated weights on worker 0-0, policy_version 310236 (0.00101) [2022-07-09 15:34:09,291][26022] Updated weights on worker 0-0, policy_version 310246 (0.00088) [2022-07-09 15:34:11,103][26022] Updated weights on worker 0-0, policy_version 310256 (0.00088) [2022-07-09 15:34:11,782][25689] Fps is (10 sec: 5684.7, 60 sec: 5672.5, 300 sec: 5641.4). Total num frames: 317706240. Throughput: 0: 4974.2. Samples: 317703224. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:34:11,783][25689] Avg episode reward: [(0, '-46.050')] [2022-07-09 15:34:12,739][26022] Updated weights on worker 0-0, policy_version 310266 (0.00104) [2022-07-09 15:34:14,636][26022] Updated weights on worker 0-0, policy_version 310276 (0.00093) [2022-07-09 15:34:16,602][26022] Updated weights on worker 0-0, policy_version 310286 (0.00086) [2022-07-09 15:34:16,809][25689] Fps is (10 sec: 5681.1, 60 sec: 5644.3, 300 sec: 5641.0). Total num frames: 317733888. Throughput: 0: 5810.1. Samples: 317737224. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:34:16,810][25689] Avg episode reward: [(0, '-45.158')] [2022-07-09 15:34:18,331][26022] Updated weights on worker 0-0, policy_version 310296 (0.00093) [2022-07-09 15:34:20,103][26022] Updated weights on worker 0-0, policy_version 310306 (0.00088) [2022-07-09 15:34:21,850][25689] Fps is (10 sec: 5594.0, 60 sec: 5642.0, 300 sec: 5637.2). Total num frames: 317762560. Throughput: 0: 5798.0. Samples: 317771568. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:34:21,850][25689] Avg episode reward: [(0, '-44.540')] [2022-07-09 15:34:21,988][26022] Updated weights on worker 0-0, policy_version 310316 (0.00092) [2022-07-09 15:34:23,675][26022] Updated weights on worker 0-0, policy_version 310326 (0.00091) [2022-07-09 15:34:25,534][26022] Updated weights on worker 0-0, policy_version 310336 (0.00083) [2022-07-09 15:34:26,863][25689] Fps is (10 sec: 5805.6, 60 sec: 5661.9, 300 sec: 5642.0). Total num frames: 317792256. Throughput: 0: 5059.5. Samples: 317788452. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:34:26,863][25689] Avg episode reward: [(0, '-45.416')] [2022-07-09 15:34:27,143][26022] Updated weights on worker 0-0, policy_version 310346 (0.00088) [2022-07-09 15:34:29,243][26022] Updated weights on worker 0-0, policy_version 310356 (0.00088) [2022-07-09 15:34:30,957][26022] Updated weights on worker 0-0, policy_version 310366 (0.00090) [2022-07-09 15:34:31,991][25689] Fps is (10 sec: 5553.5, 60 sec: 5640.5, 300 sec: 5636.5). Total num frames: 317818880. Throughput: 0: 5911.7. Samples: 317822694. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:34:31,992][25689] Avg episode reward: [(0, '-45.997')] [2022-07-09 15:34:32,644][26022] Updated weights on worker 0-0, policy_version 310376 (0.00092) [2022-07-09 15:34:34,523][26022] Updated weights on worker 0-0, policy_version 310386 (0.00094) [2022-07-09 15:34:36,358][26022] Updated weights on worker 0-0, policy_version 310396 (0.00084) [2022-07-09 15:34:37,036][25689] Fps is (10 sec: 5636.8, 60 sec: 5657.2, 300 sec: 5639.5). Total num frames: 317849600. Throughput: 0: 5916.9. Samples: 317856906. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:34:37,036][25689] Avg episode reward: [(0, '-45.742')] [2022-07-09 15:34:38,161][26022] Updated weights on worker 0-0, policy_version 310406 (0.00086) [2022-07-09 15:34:39,842][26022] Updated weights on worker 0-0, policy_version 310416 (0.00089) [2022-07-09 15:34:41,789][26022] Updated weights on worker 0-0, policy_version 310426 (0.00089) [2022-07-09 15:34:42,076][25689] Fps is (10 sec: 5788.0, 60 sec: 5637.3, 300 sec: 5642.6). Total num frames: 317877248. Throughput: 0: 5067.4. Samples: 317874060. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:34:42,076][25689] Avg episode reward: [(0, '-46.228')] [2022-07-09 15:34:43,550][26022] Updated weights on worker 0-0, policy_version 310436 (0.00094) [2022-07-09 15:34:45,384][26022] Updated weights on worker 0-0, policy_version 310446 (0.00096) [2022-07-09 15:34:47,111][25689] Fps is (10 sec: 5691.8, 60 sec: 5635.2, 300 sec: 5643.9). Total num frames: 317906944. Throughput: 0: 5912.7. Samples: 317908172. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:34:47,111][25689] Avg episode reward: [(0, '-46.664')] [2022-07-09 15:34:47,115][26022] Updated weights on worker 0-0, policy_version 310456 (0.00089) [2022-07-09 15:34:49,048][26022] Updated weights on worker 0-0, policy_version 310466 (0.00088) [2022-07-09 15:34:50,800][26022] Updated weights on worker 0-0, policy_version 310476 (0.00088) [2022-07-09 15:34:52,248][25689] Fps is (10 sec: 5738.0, 60 sec: 5645.2, 300 sec: 5642.0). Total num frames: 317935616. Throughput: 0: 5899.1. Samples: 317942190. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:34:52,248][25689] Avg episode reward: [(0, '-47.103')] [2022-07-09 15:34:52,609][26022] Updated weights on worker 0-0, policy_version 310486 (0.00087) [2022-07-09 15:34:54,463][26022] Updated weights on worker 0-0, policy_version 310496 (0.00525) [2022-07-09 15:34:56,195][26022] Updated weights on worker 0-0, policy_version 310506 (0.00087) [2022-07-09 15:34:57,285][25689] Fps is (10 sec: 5535.2, 60 sec: 5625.7, 300 sec: 5638.3). Total num frames: 317963264. Throughput: 0: 5047.6. Samples: 317959118. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:34:57,286][25689] Avg episode reward: [(0, '-47.318')] [2022-07-09 15:34:58,048][26022] Updated weights on worker 0-0, policy_version 310516 (0.00083) [2022-07-09 15:34:59,927][26022] Updated weights on worker 0-0, policy_version 310526 (0.00095) [2022-07-09 15:35:01,976][26022] Updated weights on worker 0-0, policy_version 310536 (0.00081) [2022-07-09 15:35:02,291][25689] Fps is (10 sec: 5505.7, 60 sec: 5644.2, 300 sec: 5641.7). Total num frames: 317990912. Throughput: 0: 5898.7. Samples: 317993306. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:35:02,292][25689] Avg episode reward: [(0, '-46.398')] [2022-07-09 15:35:03,932][26022] Updated weights on worker 0-0, policy_version 310546 (0.00091) [2022-07-09 15:35:05,667][26022] Updated weights on worker 0-0, policy_version 310556 (0.00089) [2022-07-09 15:35:07,326][25689] Fps is (10 sec: 5405.2, 60 sec: 5625.9, 300 sec: 5642.2). Total num frames: 318017536. Throughput: 0: 5783.1. Samples: 318025080. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:35:07,327][25689] Avg episode reward: [(0, '-45.938')] [2022-07-09 15:35:07,411][26022] Updated weights on worker 0-0, policy_version 310566 (0.00085) [2022-07-09 15:35:09,187][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:35:09,201][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000310575_318028800.pth [2022-07-09 15:35:09,201][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000308591_315997184.pth [2022-07-09 15:35:09,287][26022] Updated weights on worker 0-0, policy_version 310576 (0.00088) [2022-07-09 15:35:10,888][26022] Updated weights on worker 0-0, policy_version 310586 (0.00086) [2022-07-09 15:35:12,471][25689] Fps is (10 sec: 5532.4, 60 sec: 5618.7, 300 sec: 5639.6). Total num frames: 318047232. Throughput: 0: 4934.6. Samples: 318041984. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:35:12,472][25689] Avg episode reward: [(0, '-45.014')] [2022-07-09 15:35:13,013][26022] Updated weights on worker 0-0, policy_version 310596 (0.00087) [2022-07-09 15:35:14,567][26022] Updated weights on worker 0-0, policy_version 310606 (0.00087) [2022-07-09 15:35:16,651][26022] Updated weights on worker 0-0, policy_version 310616 (0.00087) [2022-07-09 15:35:17,538][25689] Fps is (10 sec: 5815.7, 60 sec: 5648.6, 300 sec: 5642.4). Total num frames: 318076928. Throughput: 0: 5758.3. Samples: 318075740. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:35:17,539][25689] Avg episode reward: [(0, '-44.944')] [2022-07-09 15:35:18,436][26022] Updated weights on worker 0-0, policy_version 310626 (0.00092) [2022-07-09 15:35:20,153][26022] Updated weights on worker 0-0, policy_version 310636 (0.00058) [2022-07-09 15:35:22,221][26022] Updated weights on worker 0-0, policy_version 310646 (0.00092) [2022-07-09 15:35:22,643][25689] Fps is (10 sec: 5536.9, 60 sec: 5609.1, 300 sec: 5630.4). Total num frames: 318103552. Throughput: 0: 5704.5. Samples: 318109402. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 15:35:22,643][25689] Avg episode reward: [(0, '-44.315')] [2022-07-09 15:35:23,675][26022] Updated weights on worker 0-0, policy_version 310656 (0.00084) [2022-07-09 15:35:25,964][26022] Updated weights on worker 0-0, policy_version 310666 (0.00090) [2022-07-09 15:35:27,271][26022] Updated weights on worker 0-0, policy_version 310676 (0.00488) [2022-07-09 15:35:27,733][25689] Fps is (10 sec: 5524.7, 60 sec: 5602.0, 300 sec: 5640.4). Total num frames: 318133248. Throughput: 0: 4957.7. Samples: 318126248. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:35:27,733][25689] Avg episode reward: [(0, '-44.318')] [2022-07-09 15:35:29,500][26022] Updated weights on worker 0-0, policy_version 310686 (0.00092) [2022-07-09 15:35:31,082][26022] Updated weights on worker 0-0, policy_version 310696 (0.00083) [2022-07-09 15:35:32,837][25689] Fps is (10 sec: 5625.2, 60 sec: 5621.0, 300 sec: 5632.1). Total num frames: 318160896. Throughput: 0: 5806.2. Samples: 318160232. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:35:32,837][25689] Avg episode reward: [(0, '-44.326')] [2022-07-09 15:35:32,923][26022] Updated weights on worker 0-0, policy_version 310706 (0.00086) [2022-07-09 15:35:34,683][26022] Updated weights on worker 0-0, policy_version 310716 (0.00090) [2022-07-09 15:35:36,386][26022] Updated weights on worker 0-0, policy_version 310726 (0.00084) [2022-07-09 15:35:37,887][25689] Fps is (10 sec: 5747.8, 60 sec: 5620.5, 300 sec: 5638.4). Total num frames: 318191616. Throughput: 0: 5850.6. Samples: 318194794. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:35:37,888][25689] Avg episode reward: [(0, '-44.238')] [2022-07-09 15:35:38,471][26022] Updated weights on worker 0-0, policy_version 310736 (0.00087) [2022-07-09 15:35:40,055][26022] Updated weights on worker 0-0, policy_version 310746 (0.00082) [2022-07-09 15:35:41,997][26022] Updated weights on worker 0-0, policy_version 310756 (0.00092) [2022-07-09 15:35:42,958][25689] Fps is (10 sec: 5969.5, 60 sec: 5651.3, 300 sec: 5644.2). Total num frames: 318221312. Throughput: 0: 5886.8. Samples: 318228992. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:35:42,958][25689] Avg episode reward: [(0, '-45.100')] [2022-07-09 15:35:43,630][26022] Updated weights on worker 0-0, policy_version 310766 (0.00089) [2022-07-09 15:35:45,430][26022] Updated weights on worker 0-0, policy_version 310776 (0.00081) [2022-07-09 15:35:47,426][26022] Updated weights on worker 0-0, policy_version 310786 (0.00089) [2022-07-09 15:35:47,963][25689] Fps is (10 sec: 5590.0, 60 sec: 5603.6, 300 sec: 5635.5). Total num frames: 318247936. Throughput: 0: 5922.0. Samples: 318246050. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:35:47,963][25689] Avg episode reward: [(0, '-44.300')] [2022-07-09 15:35:49,093][26022] Updated weights on worker 0-0, policy_version 310796 (0.00090) [2022-07-09 15:35:50,895][26022] Updated weights on worker 0-0, policy_version 310806 (0.00096) [2022-07-09 15:35:52,999][26022] Updated weights on worker 0-0, policy_version 310816 (0.00090) [2022-07-09 15:35:53,023][25689] Fps is (10 sec: 5391.8, 60 sec: 5593.8, 300 sec: 5631.3). Total num frames: 318275584. Throughput: 0: 5926.6. Samples: 318279868. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:35:53,024][25689] Avg episode reward: [(0, '-44.417')] [2022-07-09 15:35:54,438][26022] Updated weights on worker 0-0, policy_version 310826 (0.00093) [2022-07-09 15:35:56,543][26022] Updated weights on worker 0-0, policy_version 310836 (0.00083) [2022-07-09 15:35:58,043][25689] Fps is (10 sec: 5688.9, 60 sec: 5629.2, 300 sec: 5634.7). Total num frames: 318305280. Throughput: 0: 5895.5. Samples: 318313620. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:35:58,044][25689] Avg episode reward: [(0, '-44.685')] [2022-07-09 15:35:58,210][26022] Updated weights on worker 0-0, policy_version 310846 (0.00090) [2022-07-09 15:35:59,972][26022] Updated weights on worker 0-0, policy_version 310856 (0.00092) [2022-07-09 15:36:01,937][26022] Updated weights on worker 0-0, policy_version 310866 (0.00091) [2022-07-09 15:36:03,068][25689] Fps is (10 sec: 5607.0, 60 sec: 5610.6, 300 sec: 5631.0). Total num frames: 318331904. Throughput: 0: 5059.9. Samples: 318330748. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:36:03,069][25689] Avg episode reward: [(0, '-45.290')] [2022-07-09 15:36:03,933][26022] Updated weights on worker 0-0, policy_version 310876 (0.00090) [2022-07-09 15:36:05,863][26022] Updated weights on worker 0-0, policy_version 310886 (0.00084) [2022-07-09 15:36:07,727][26022] Updated weights on worker 0-0, policy_version 310896 (0.00087) [2022-07-09 15:36:08,095][25689] Fps is (10 sec: 5296.9, 60 sec: 5611.3, 300 sec: 5632.7). Total num frames: 318358528. Throughput: 0: 5800.5. Samples: 318362830. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:36:08,096][25689] Avg episode reward: [(0, '-45.116')] [2022-07-09 15:36:09,445][26022] Updated weights on worker 0-0, policy_version 310906 (0.00084) [2022-07-09 15:36:11,296][26022] Updated weights on worker 0-0, policy_version 310916 (0.00089) [2022-07-09 15:36:13,031][26022] Updated weights on worker 0-0, policy_version 310926 (0.00090) [2022-07-09 15:36:13,165][25689] Fps is (10 sec: 5679.2, 60 sec: 5635.1, 300 sec: 5638.4). Total num frames: 318389248. Throughput: 0: 5816.7. Samples: 318397028. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:36:13,166][25689] Avg episode reward: [(0, '-45.200')] [2022-07-09 15:36:14,725][26022] Updated weights on worker 0-0, policy_version 310936 (0.00092) [2022-07-09 15:36:16,573][26022] Updated weights on worker 0-0, policy_version 310946 (0.00087) [2022-07-09 15:36:18,168][25689] Fps is (10 sec: 5896.2, 60 sec: 5624.2, 300 sec: 5632.6). Total num frames: 318417920. Throughput: 0: 5003.8. Samples: 318414324. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:36:18,169][25689] Avg episode reward: [(0, '-44.958')] [2022-07-09 15:36:18,471][26022] Updated weights on worker 0-0, policy_version 310956 (0.00090) [2022-07-09 15:36:20,195][26022] Updated weights on worker 0-0, policy_version 310966 (0.00089) [2022-07-09 15:36:22,271][26022] Updated weights on worker 0-0, policy_version 310976 (0.00085) [2022-07-09 15:36:23,241][25689] Fps is (10 sec: 5589.7, 60 sec: 5644.0, 300 sec: 5635.1). Total num frames: 318445568. Throughput: 0: 5835.8. Samples: 318448474. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:36:23,241][25689] Avg episode reward: [(0, '-46.092')] [2022-07-09 15:36:23,508][26022] Updated weights on worker 0-0, policy_version 310986 (0.00094) [2022-07-09 15:36:25,763][26022] Updated weights on worker 0-0, policy_version 310996 (0.00087) [2022-07-09 15:36:27,203][26022] Updated weights on worker 0-0, policy_version 311006 (0.00083) [2022-07-09 15:36:28,251][25689] Fps is (10 sec: 5484.2, 60 sec: 5617.6, 300 sec: 5629.5). Total num frames: 318473216. Throughput: 0: 5942.1. Samples: 318482598. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:36:28,252][25689] Avg episode reward: [(0, '-45.932')] [2022-07-09 15:36:29,227][26022] Updated weights on worker 0-0, policy_version 311016 (0.00088) [2022-07-09 15:36:30,892][26022] Updated weights on worker 0-0, policy_version 311026 (0.00087) [2022-07-09 15:36:32,939][26022] Updated weights on worker 0-0, policy_version 311036 (0.00087) [2022-07-09 15:36:33,335][25689] Fps is (10 sec: 5782.5, 60 sec: 5670.3, 300 sec: 5636.2). Total num frames: 318503936. Throughput: 0: 5075.6. Samples: 318499402. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:36:33,338][25689] Avg episode reward: [(0, '-45.559')] [2022-07-09 15:36:34,587][26022] Updated weights on worker 0-0, policy_version 311046 (0.00081) [2022-07-09 15:36:36,545][26022] Updated weights on worker 0-0, policy_version 311056 (0.00086) [2022-07-09 15:36:37,999][26022] Updated weights on worker 0-0, policy_version 311066 (0.00093) [2022-07-09 15:36:38,363][25689] Fps is (10 sec: 5873.2, 60 sec: 5638.5, 300 sec: 5640.3). Total num frames: 318532608. Throughput: 0: 5902.6. Samples: 318533528. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:36:38,364][25689] Avg episode reward: [(0, '-46.716')] [2022-07-09 15:36:40,221][26022] Updated weights on worker 0-0, policy_version 311076 (0.01105) [2022-07-09 15:36:41,769][26022] Updated weights on worker 0-0, policy_version 311086 (0.00089) [2022-07-09 15:36:43,395][25689] Fps is (10 sec: 5496.6, 60 sec: 5591.3, 300 sec: 5629.8). Total num frames: 318559232. Throughput: 0: 5913.1. Samples: 318567646. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:36:43,395][25689] Avg episode reward: [(0, '-48.125')] [2022-07-09 15:36:43,778][26022] Updated weights on worker 0-0, policy_version 311096 (0.00096) [2022-07-09 15:36:45,454][26022] Updated weights on worker 0-0, policy_version 311106 (0.00098) [2022-07-09 15:36:47,323][26022] Updated weights on worker 0-0, policy_version 311116 (0.00091) [2022-07-09 15:36:48,403][25689] Fps is (10 sec: 5507.6, 60 sec: 5624.8, 300 sec: 5631.4). Total num frames: 318587904. Throughput: 0: 5069.6. Samples: 318584762. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:36:48,404][25689] Avg episode reward: [(0, '-47.143')] [2022-07-09 15:36:49,203][26022] Updated weights on worker 0-0, policy_version 311126 (0.00090) [2022-07-09 15:36:51,123][26022] Updated weights on worker 0-0, policy_version 311136 (0.00088) [2022-07-09 15:36:52,518][26022] Updated weights on worker 0-0, policy_version 311146 (0.00080) [2022-07-09 15:36:53,529][25689] Fps is (10 sec: 5759.4, 60 sec: 5652.6, 300 sec: 5632.6). Total num frames: 318617600. Throughput: 0: 5900.1. Samples: 318618552. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:36:53,530][25689] Avg episode reward: [(0, '-47.439')] [2022-07-09 15:36:54,673][26022] Updated weights on worker 0-0, policy_version 311156 (0.00087) [2022-07-09 15:36:56,102][26022] Updated weights on worker 0-0, policy_version 311166 (0.00096) [2022-07-09 15:36:58,261][26022] Updated weights on worker 0-0, policy_version 311176 (0.00090) [2022-07-09 15:36:58,557][25689] Fps is (10 sec: 5748.7, 60 sec: 5634.9, 300 sec: 5635.6). Total num frames: 318646272. Throughput: 0: 5909.1. Samples: 318652852. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:36:58,557][25689] Avg episode reward: [(0, '-47.360')] [2022-07-09 15:36:59,866][26022] Updated weights on worker 0-0, policy_version 311186 (0.00080) [2022-07-09 15:37:01,732][26022] Updated weights on worker 0-0, policy_version 311196 (0.00092) [2022-07-09 15:37:03,590][25689] Fps is (10 sec: 5496.1, 60 sec: 5634.2, 300 sec: 5633.4). Total num frames: 318672896. Throughput: 0: 5069.1. Samples: 318670018. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:37:03,591][25689] Avg episode reward: [(0, '-47.346')] [2022-07-09 15:37:04,013][26022] Updated weights on worker 0-0, policy_version 311206 (0.00093) [2022-07-09 15:37:05,721][26022] Updated weights on worker 0-0, policy_version 311216 (0.00091) [2022-07-09 15:37:07,343][26022] Updated weights on worker 0-0, policy_version 311226 (0.00089) [2022-07-09 15:37:08,626][25689] Fps is (10 sec: 5491.4, 60 sec: 5667.2, 300 sec: 5635.5). Total num frames: 318701568. Throughput: 0: 5800.3. Samples: 318702060. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:37:08,627][25689] Avg episode reward: [(0, '-46.367')] [2022-07-09 15:37:09,251][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:37:09,267][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000311235_318704640.pth [2022-07-09 15:37:09,267][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000309252_316674048.pth [2022-07-09 15:37:09,346][26022] Updated weights on worker 0-0, policy_version 311236 (0.00083) [2022-07-09 15:37:11,015][26022] Updated weights on worker 0-0, policy_version 311246 (0.00084) [2022-07-09 15:37:12,993][26022] Updated weights on worker 0-0, policy_version 311256 (0.00099) [2022-07-09 15:37:13,687][25689] Fps is (10 sec: 5679.7, 60 sec: 5634.2, 300 sec: 5631.9). Total num frames: 318730240. Throughput: 0: 5834.9. Samples: 318736168. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:37:13,687][25689] Avg episode reward: [(0, '-45.318')] [2022-07-09 15:37:14,682][26022] Updated weights on worker 0-0, policy_version 311266 (0.00092) [2022-07-09 15:37:16,558][26022] Updated weights on worker 0-0, policy_version 311276 (0.00087) [2022-07-09 15:37:18,435][26022] Updated weights on worker 0-0, policy_version 311286 (0.00097) [2022-07-09 15:37:18,691][25689] Fps is (10 sec: 5595.9, 60 sec: 5617.2, 300 sec: 5633.5). Total num frames: 318757888. Throughput: 0: 4989.2. Samples: 318753304. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:37:18,691][25689] Avg episode reward: [(0, '-45.228')] [2022-07-09 15:37:20,144][26022] Updated weights on worker 0-0, policy_version 311296 (0.00087) [2022-07-09 15:37:21,940][26022] Updated weights on worker 0-0, policy_version 311306 (0.00090) [2022-07-09 15:37:23,549][26022] Updated weights on worker 0-0, policy_version 311316 (0.00082) [2022-07-09 15:37:23,695][25689] Fps is (10 sec: 5729.5, 60 sec: 5657.4, 300 sec: 5637.3). Total num frames: 318787584. Throughput: 0: 5833.5. Samples: 318787300. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:37:23,696][25689] Avg episode reward: [(0, '-45.423')] [2022-07-09 15:37:25,914][26022] Updated weights on worker 0-0, policy_version 311326 (0.00082) [2022-07-09 15:37:27,331][26022] Updated weights on worker 0-0, policy_version 311336 (0.00085) [2022-07-09 15:37:28,699][25689] Fps is (10 sec: 5627.2, 60 sec: 5641.1, 300 sec: 5631.7). Total num frames: 318814208. Throughput: 0: 5936.5. Samples: 318821224. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:37:28,701][25689] Avg episode reward: [(0, '-45.403')] [2022-07-09 15:37:29,396][26022] Updated weights on worker 0-0, policy_version 311346 (0.00090) [2022-07-09 15:37:31,024][26022] Updated weights on worker 0-0, policy_version 311356 (0.00087) [2022-07-09 15:37:32,906][26022] Updated weights on worker 0-0, policy_version 311366 (0.00090) [2022-07-09 15:37:33,738][25689] Fps is (10 sec: 5607.8, 60 sec: 5628.3, 300 sec: 5635.2). Total num frames: 318843904. Throughput: 0: 5085.6. Samples: 318838142. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:37:33,739][25689] Avg episode reward: [(0, '-46.035')] [2022-07-09 15:37:34,480][26022] Updated weights on worker 0-0, policy_version 311376 (0.00090) [2022-07-09 15:37:36,257][26022] Updated weights on worker 0-0, policy_version 311386 (0.00595) [2022-07-09 15:37:38,211][26022] Updated weights on worker 0-0, policy_version 311396 (0.00082) [2022-07-09 15:37:38,744][25689] Fps is (10 sec: 5810.9, 60 sec: 5630.5, 300 sec: 5635.3). Total num frames: 318872576. Throughput: 0: 5939.8. Samples: 318872414. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:37:38,744][25689] Avg episode reward: [(0, '-46.163')] [2022-07-09 15:37:40,130][26022] Updated weights on worker 0-0, policy_version 311406 (0.00094) [2022-07-09 15:37:41,770][26022] Updated weights on worker 0-0, policy_version 311416 (0.00087) [2022-07-09 15:37:43,647][26022] Updated weights on worker 0-0, policy_version 311426 (0.00088) [2022-07-09 15:37:43,755][25689] Fps is (10 sec: 5622.8, 60 sec: 5649.3, 300 sec: 5633.3). Total num frames: 318900224. Throughput: 0: 5937.7. Samples: 318906406. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:37:43,755][25689] Avg episode reward: [(0, '-46.513')] [2022-07-09 15:37:45,274][26022] Updated weights on worker 0-0, policy_version 311436 (0.00094) [2022-07-09 15:37:47,285][26022] Updated weights on worker 0-0, policy_version 311446 (0.00091) [2022-07-09 15:37:48,757][25689] Fps is (10 sec: 5624.6, 60 sec: 5649.9, 300 sec: 5641.8). Total num frames: 318928896. Throughput: 0: 5098.5. Samples: 318923488. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-09 15:37:48,757][25689] Avg episode reward: [(0, '-46.915')] [2022-07-09 15:37:49,074][26022] Updated weights on worker 0-0, policy_version 311456 (0.00090) [2022-07-09 15:37:51,038][26022] Updated weights on worker 0-0, policy_version 311466 (0.00082) [2022-07-09 15:37:52,681][26022] Updated weights on worker 0-0, policy_version 311476 (0.00093) [2022-07-09 15:37:53,798][25689] Fps is (10 sec: 5709.9, 60 sec: 5640.9, 300 sec: 5631.2). Total num frames: 318957568. Throughput: 0: 5933.6. Samples: 318957166. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:37:53,798][25689] Avg episode reward: [(0, '-46.590')] [2022-07-09 15:37:54,699][26022] Updated weights on worker 0-0, policy_version 311486 (0.00082) [2022-07-09 15:37:56,271][26022] Updated weights on worker 0-0, policy_version 311496 (0.00070) [2022-07-09 15:37:58,324][26022] Updated weights on worker 0-0, policy_version 311506 (0.00087) [2022-07-09 15:37:58,835][25689] Fps is (10 sec: 5588.5, 60 sec: 5623.0, 300 sec: 5635.2). Total num frames: 318985216. Throughput: 0: 5900.3. Samples: 318990956. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:37:58,835][25689] Avg episode reward: [(0, '-46.938')] [2022-07-09 15:37:59,931][26022] Updated weights on worker 0-0, policy_version 311516 (0.00093) [2022-07-09 15:38:02,187][26022] Updated weights on worker 0-0, policy_version 311526 (0.00123) [2022-07-09 15:38:03,844][25689] Fps is (10 sec: 5401.9, 60 sec: 5625.2, 300 sec: 5639.2). Total num frames: 319011840. Throughput: 0: 5043.0. Samples: 319007718. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:38:03,845][25689] Avg episode reward: [(0, '-47.343')] [2022-07-09 15:38:04,003][26022] Updated weights on worker 0-0, policy_version 311536 (0.00084) [2022-07-09 15:38:06,073][26022] Updated weights on worker 0-0, policy_version 311546 (0.00084) [2022-07-09 15:38:07,628][26022] Updated weights on worker 0-0, policy_version 311556 (0.00093) [2022-07-09 15:38:08,901][25689] Fps is (10 sec: 5391.3, 60 sec: 5606.3, 300 sec: 5632.8). Total num frames: 319039488. Throughput: 0: 5760.0. Samples: 319039520. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:38:08,903][25689] Avg episode reward: [(0, '-47.474')] [2022-07-09 15:38:09,679][26022] Updated weights on worker 0-0, policy_version 311566 (0.00083) [2022-07-09 15:38:11,249][26022] Updated weights on worker 0-0, policy_version 311577 (0.00086) [2022-07-09 15:38:13,364][26022] Updated weights on worker 0-0, policy_version 311587 (0.00085) [2022-07-09 15:38:13,942][25689] Fps is (10 sec: 5679.2, 60 sec: 5625.2, 300 sec: 5633.7). Total num frames: 319069184. Throughput: 0: 5779.1. Samples: 319073580. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:38:13,942][25689] Avg episode reward: [(0, '-47.690')] [2022-07-09 15:38:15,053][26022] Updated weights on worker 0-0, policy_version 311597 (0.00085) [2022-07-09 15:38:16,735][26022] Updated weights on worker 0-0, policy_version 311607 (0.00084) [2022-07-09 15:38:18,764][26022] Updated weights on worker 0-0, policy_version 311617 (0.00092) [2022-07-09 15:38:18,961][25689] Fps is (10 sec: 5700.5, 60 sec: 5623.7, 300 sec: 5630.2). Total num frames: 319096832. Throughput: 0: 4963.7. Samples: 319090856. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:38:18,962][25689] Avg episode reward: [(0, '-47.809')] [2022-07-09 15:38:20,408][26022] Updated weights on worker 0-0, policy_version 311627 (0.00092) [2022-07-09 15:38:22,139][26022] Updated weights on worker 0-0, policy_version 311637 (0.00086) [2022-07-09 15:38:23,968][25689] Fps is (10 sec: 5617.1, 60 sec: 5606.5, 300 sec: 5630.9). Total num frames: 319125504. Throughput: 0: 5843.9. Samples: 319125320. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:38:23,969][25689] Avg episode reward: [(0, '-47.906')] [2022-07-09 15:38:24,204][26022] Updated weights on worker 0-0, policy_version 311647 (0.00091) [2022-07-09 15:38:25,600][26022] Updated weights on worker 0-0, policy_version 311657 (0.00090) [2022-07-09 15:38:27,976][26022] Updated weights on worker 0-0, policy_version 311667 (0.00078) [2022-07-09 15:38:28,972][25689] Fps is (10 sec: 5830.3, 60 sec: 5657.5, 300 sec: 5639.3). Total num frames: 319155200. Throughput: 0: 5965.2. Samples: 319159246. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:38:28,973][25689] Avg episode reward: [(0, '-47.497')] [2022-07-09 15:38:29,339][26022] Updated weights on worker 0-0, policy_version 311677 (0.00089) [2022-07-09 15:38:31,457][26022] Updated weights on worker 0-0, policy_version 311687 (0.00100) [2022-07-09 15:38:32,969][26022] Updated weights on worker 0-0, policy_version 311697 (0.00093) [2022-07-09 15:38:34,093][25689] Fps is (10 sec: 5562.6, 60 sec: 5598.9, 300 sec: 5627.4). Total num frames: 319181824. Throughput: 0: 5092.0. Samples: 319176190. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:38:34,093][25689] Avg episode reward: [(0, '-46.948')] [2022-07-09 15:38:34,961][26022] Updated weights on worker 0-0, policy_version 311707 (0.00091) [2022-07-09 15:38:36,695][26022] Updated weights on worker 0-0, policy_version 311717 (0.00083) [2022-07-09 15:38:38,594][26022] Updated weights on worker 0-0, policy_version 311727 (0.00093) [2022-07-09 15:38:39,159][25689] Fps is (10 sec: 5528.5, 60 sec: 5610.2, 300 sec: 5629.8). Total num frames: 319211520. Throughput: 0: 5894.1. Samples: 319209908. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:38:39,160][25689] Avg episode reward: [(0, '-46.805')] [2022-07-09 15:38:40,311][26022] Updated weights on worker 0-0, policy_version 311737 (0.00091) [2022-07-09 15:38:42,235][26022] Updated weights on worker 0-0, policy_version 311747 (0.00088) [2022-07-09 15:38:43,783][26022] Updated weights on worker 0-0, policy_version 311757 (0.00088) [2022-07-09 15:38:44,188][25689] Fps is (10 sec: 5883.0, 60 sec: 5642.4, 300 sec: 5629.5). Total num frames: 319241216. Throughput: 0: 5871.0. Samples: 319244032. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:38:44,189][25689] Avg episode reward: [(0, '-45.819')] [2022-07-09 15:38:45,820][26022] Updated weights on worker 0-0, policy_version 311767 (0.00112) [2022-07-09 15:38:47,321][26022] Updated weights on worker 0-0, policy_version 311777 (0.00085) [2022-07-09 15:38:49,287][25689] Fps is (10 sec: 5561.1, 60 sec: 5599.6, 300 sec: 5625.4). Total num frames: 319267840. Throughput: 0: 5863.6. Samples: 319278364. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:38:49,288][25689] Avg episode reward: [(0, '-45.702')] [2022-07-09 15:38:49,343][26022] Updated weights on worker 0-0, policy_version 311787 (0.00093) [2022-07-09 15:38:51,208][26022] Updated weights on worker 0-0, policy_version 311797 (0.00087) [2022-07-09 15:38:52,968][26022] Updated weights on worker 0-0, policy_version 311807 (0.00089) [2022-07-09 15:38:54,350][25689] Fps is (10 sec: 5542.3, 60 sec: 5614.4, 300 sec: 5627.8). Total num frames: 319297536. Throughput: 0: 5861.0. Samples: 319294918. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:38:54,351][25689] Avg episode reward: [(0, '-46.104')] [2022-07-09 15:38:54,767][26022] Updated weights on worker 0-0, policy_version 311817 (0.00085) [2022-07-09 15:38:56,432][26022] Updated weights on worker 0-0, policy_version 311827 (0.00094) [2022-07-09 15:38:58,391][26022] Updated weights on worker 0-0, policy_version 311837 (0.00085) [2022-07-09 15:38:59,396][25689] Fps is (10 sec: 5773.6, 60 sec: 5630.5, 300 sec: 5634.2). Total num frames: 319326208. Throughput: 0: 5900.4. Samples: 319329314. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:38:59,397][25689] Avg episode reward: [(0, '-46.874')] [2022-07-09 15:39:00,230][26022] Updated weights on worker 0-0, policy_version 311847 (0.00087) [2022-07-09 15:39:02,464][26022] Updated weights on worker 0-0, policy_version 311857 (0.00086) [2022-07-09 15:39:04,091][26022] Updated weights on worker 0-0, policy_version 311867 (0.00089) [2022-07-09 15:39:04,426][25689] Fps is (10 sec: 5589.6, 60 sec: 5645.5, 300 sec: 5634.0). Total num frames: 319353856. Throughput: 0: 5788.0. Samples: 319361168. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:39:04,427][25689] Avg episode reward: [(0, '-47.192')] [2022-07-09 15:39:06,035][26022] Updated weights on worker 0-0, policy_version 311877 (0.00087) [2022-07-09 15:39:07,732][26022] Updated weights on worker 0-0, policy_version 311887 (0.00090) [2022-07-09 15:39:09,292][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:39:09,301][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000311895_319380480.pth [2022-07-09 15:39:09,301][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000309912_317349888.pth [2022-07-09 15:39:09,446][25689] Fps is (10 sec: 5400.5, 60 sec: 5632.1, 300 sec: 5624.6). Total num frames: 319380480. Throughput: 0: 4955.7. Samples: 319378266. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:39:09,446][25689] Avg episode reward: [(0, '-47.796')] [2022-07-09 15:39:09,595][26022] Updated weights on worker 0-0, policy_version 311897 (0.00087) [2022-07-09 15:39:11,399][26022] Updated weights on worker 0-0, policy_version 311907 (0.00086) [2022-07-09 15:39:13,331][26022] Updated weights on worker 0-0, policy_version 311917 (0.00084) [2022-07-09 15:39:14,550][25689] Fps is (10 sec: 5563.0, 60 sec: 5626.1, 300 sec: 5630.0). Total num frames: 319410176. Throughput: 0: 5799.6. Samples: 319412070. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:39:14,551][25689] Avg episode reward: [(0, '-47.975')] [2022-07-09 15:39:15,088][26022] Updated weights on worker 0-0, policy_version 311927 (0.00094) [2022-07-09 15:39:16,947][26022] Updated weights on worker 0-0, policy_version 311937 (0.00084) [2022-07-09 15:39:18,635][26022] Updated weights on worker 0-0, policy_version 311947 (0.00081) [2022-07-09 15:39:19,587][25689] Fps is (10 sec: 5755.7, 60 sec: 5641.4, 300 sec: 5630.1). Total num frames: 319438848. Throughput: 0: 5803.7. Samples: 319446494. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:39:19,587][25689] Avg episode reward: [(0, '-47.416')] [2022-07-09 15:39:20,587][26022] Updated weights on worker 0-0, policy_version 311957 (0.00087) [2022-07-09 15:39:22,095][26022] Updated weights on worker 0-0, policy_version 311967 (0.00086) [2022-07-09 15:39:24,231][26022] Updated weights on worker 0-0, policy_version 311977 (0.00088) [2022-07-09 15:39:24,599][25689] Fps is (10 sec: 5706.6, 60 sec: 5641.0, 300 sec: 5626.7). Total num frames: 319467520. Throughput: 0: 5073.9. Samples: 319463522. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:39:24,599][25689] Avg episode reward: [(0, '-47.466')] [2022-07-09 15:39:25,709][26022] Updated weights on worker 0-0, policy_version 311987 (0.00081) [2022-07-09 15:39:27,810][26022] Updated weights on worker 0-0, policy_version 311997 (0.00098) [2022-07-09 15:39:29,607][25689] Fps is (10 sec: 5518.5, 60 sec: 5589.9, 300 sec: 5629.0). Total num frames: 319494144. Throughput: 0: 5911.7. Samples: 319497452. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:39:29,607][25689] Avg episode reward: [(0, '-47.761')] [2022-07-09 15:39:29,695][26022] Updated weights on worker 0-0, policy_version 312007 (0.00087) [2022-07-09 15:39:31,428][26022] Updated weights on worker 0-0, policy_version 312017 (0.00085) [2022-07-09 15:39:33,375][26022] Updated weights on worker 0-0, policy_version 312027 (0.00091) [2022-07-09 15:39:34,647][25689] Fps is (10 sec: 5605.0, 60 sec: 5648.1, 300 sec: 5625.6). Total num frames: 319523840. Throughput: 0: 5921.6. Samples: 319531076. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:39:34,647][25689] Avg episode reward: [(0, '-47.876')] [2022-07-09 15:39:34,919][26022] Updated weights on worker 0-0, policy_version 312037 (0.00086) [2022-07-09 15:39:36,885][26022] Updated weights on worker 0-0, policy_version 312047 (0.00088) [2022-07-09 15:39:38,552][26022] Updated weights on worker 0-0, policy_version 312057 (0.00093) [2022-07-09 15:39:39,649][25689] Fps is (10 sec: 5710.4, 60 sec: 5620.3, 300 sec: 5626.3). Total num frames: 319551488. Throughput: 0: 5065.3. Samples: 319548116. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:39:39,649][25689] Avg episode reward: [(0, '-46.720')] [2022-07-09 15:39:40,491][26022] Updated weights on worker 0-0, policy_version 312067 (0.00090) [2022-07-09 15:39:42,202][26022] Updated weights on worker 0-0, policy_version 312077 (0.00094) [2022-07-09 15:39:44,089][26022] Updated weights on worker 0-0, policy_version 312087 (0.00056) [2022-07-09 15:39:44,678][25689] Fps is (10 sec: 5614.7, 60 sec: 5603.4, 300 sec: 5623.0). Total num frames: 319580160. Throughput: 0: 5919.6. Samples: 319582382. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:39:44,678][25689] Avg episode reward: [(0, '-47.227')] [2022-07-09 15:39:45,781][26022] Updated weights on worker 0-0, policy_version 312097 (0.00082) [2022-07-09 15:39:47,762][26022] Updated weights on worker 0-0, policy_version 312107 (0.00084) [2022-07-09 15:39:49,393][26022] Updated weights on worker 0-0, policy_version 312117 (0.00085) [2022-07-09 15:39:49,683][25689] Fps is (10 sec: 5714.7, 60 sec: 5645.9, 300 sec: 5625.5). Total num frames: 319608832. Throughput: 0: 5931.5. Samples: 319616536. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:39:49,684][25689] Avg episode reward: [(0, '-47.907')] [2022-07-09 15:39:51,397][26022] Updated weights on worker 0-0, policy_version 312127 (0.00087) [2022-07-09 15:39:52,980][26022] Updated weights on worker 0-0, policy_version 312137 (0.00089) [2022-07-09 15:39:54,795][25689] Fps is (10 sec: 5769.0, 60 sec: 5641.4, 300 sec: 5631.0). Total num frames: 319638528. Throughput: 0: 5078.2. Samples: 319633394. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:39:54,796][25689] Avg episode reward: [(0, '-47.757')] [2022-07-09 15:39:54,803][26022] Updated weights on worker 0-0, policy_version 312147 (0.00089) [2022-07-09 15:39:56,726][26022] Updated weights on worker 0-0, policy_version 312157 (0.00088) [2022-07-09 15:39:58,521][26022] Updated weights on worker 0-0, policy_version 312167 (0.00094) [2022-07-09 15:39:59,866][25689] Fps is (10 sec: 5732.0, 60 sec: 5639.0, 300 sec: 5633.2). Total num frames: 319667200. Throughput: 0: 5919.5. Samples: 319667792. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:39:59,867][25689] Avg episode reward: [(0, '-47.081')] [2022-07-09 15:40:00,254][26022] Updated weights on worker 0-0, policy_version 312177 (0.00088) [2022-07-09 15:40:02,402][26022] Updated weights on worker 0-0, policy_version 312187 (0.00086) [2022-07-09 15:40:04,123][26022] Updated weights on worker 0-0, policy_version 312197 (0.00096) [2022-07-09 15:40:04,896][25689] Fps is (10 sec: 5474.4, 60 sec: 5622.1, 300 sec: 5633.3). Total num frames: 319693824. Throughput: 0: 5817.1. Samples: 319699994. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:40:04,897][25689] Avg episode reward: [(0, '-47.646')] [2022-07-09 15:40:06,024][26022] Updated weights on worker 0-0, policy_version 312207 (0.00086) [2022-07-09 15:40:07,875][26022] Updated weights on worker 0-0, policy_version 312217 (0.00088) [2022-07-09 15:40:09,571][26022] Updated weights on worker 0-0, policy_version 312227 (0.00093) [2022-07-09 15:40:09,928][25689] Fps is (10 sec: 5495.4, 60 sec: 5654.8, 300 sec: 5631.9). Total num frames: 319722496. Throughput: 0: 4978.8. Samples: 319717332. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:40:09,930][25689] Avg episode reward: [(0, '-47.739')] [2022-07-09 15:40:11,279][26022] Updated weights on worker 0-0, policy_version 312237 (0.00089) [2022-07-09 15:40:13,234][26022] Updated weights on worker 0-0, policy_version 312247 (0.00089) [2022-07-09 15:40:14,912][26022] Updated weights on worker 0-0, policy_version 312257 (0.00097) [2022-07-09 15:40:14,978][25689] Fps is (10 sec: 5789.5, 60 sec: 5659.9, 300 sec: 5632.3). Total num frames: 319752192. Throughput: 0: 5862.8. Samples: 319751720. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:40:14,978][25689] Avg episode reward: [(0, '-47.672')] [2022-07-09 15:40:16,765][26022] Updated weights on worker 0-0, policy_version 312267 (0.00350) [2022-07-09 15:40:18,440][26022] Updated weights on worker 0-0, policy_version 312277 (0.00098) [2022-07-09 15:40:20,071][25689] Fps is (10 sec: 5653.9, 60 sec: 5637.7, 300 sec: 5635.9). Total num frames: 319779840. Throughput: 0: 5871.2. Samples: 319786418. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-09 15:40:20,071][25689] Avg episode reward: [(0, '-47.410')] [2022-07-09 15:40:20,331][26022] Updated weights on worker 0-0, policy_version 312287 (0.00078) [2022-07-09 15:40:22,107][26022] Updated weights on worker 0-0, policy_version 312297 (0.00082) [2022-07-09 15:40:23,986][26022] Updated weights on worker 0-0, policy_version 312307 (0.00085) [2022-07-09 15:40:25,088][25689] Fps is (10 sec: 5773.0, 60 sec: 5671.1, 300 sec: 5640.8). Total num frames: 319810560. Throughput: 0: 5126.3. Samples: 319803504. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:40:25,089][25689] Avg episode reward: [(0, '-47.606')] [2022-07-09 15:40:25,538][26022] Updated weights on worker 0-0, policy_version 312317 (0.00088) [2022-07-09 15:40:27,395][26022] Updated weights on worker 0-0, policy_version 312327 (0.00086) [2022-07-09 15:40:29,373][26022] Updated weights on worker 0-0, policy_version 312337 (0.00093) [2022-07-09 15:40:30,115][25689] Fps is (10 sec: 5607.3, 60 sec: 5652.4, 300 sec: 5635.3). Total num frames: 319836160. Throughput: 0: 5952.4. Samples: 319837490. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:40:30,115][25689] Avg episode reward: [(0, '-46.338')] [2022-07-09 15:40:31,085][26022] Updated weights on worker 0-0, policy_version 312347 (0.00080) [2022-07-09 15:40:33,005][26022] Updated weights on worker 0-0, policy_version 312357 (0.00085) [2022-07-09 15:40:34,610][26022] Updated weights on worker 0-0, policy_version 312367 (0.00083) [2022-07-09 15:40:35,232][25689] Fps is (10 sec: 5653.2, 60 sec: 5679.0, 300 sec: 5637.5). Total num frames: 319867904. Throughput: 0: 5924.1. Samples: 319871708. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:40:35,232][25689] Avg episode reward: [(0, '-46.057')] [2022-07-09 15:40:36,455][26022] Updated weights on worker 0-0, policy_version 312377 (0.00090) [2022-07-09 15:40:38,176][26022] Updated weights on worker 0-0, policy_version 312387 (0.00092) [2022-07-09 15:40:40,240][25689] Fps is (10 sec: 5663.4, 60 sec: 5644.6, 300 sec: 5624.9). Total num frames: 319893504. Throughput: 0: 5086.5. Samples: 319889010. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:40:40,241][25689] Avg episode reward: [(0, '-46.327')] [2022-07-09 15:40:40,251][26022] Updated weights on worker 0-0, policy_version 312397 (0.00089) [2022-07-09 15:40:41,874][26022] Updated weights on worker 0-0, policy_version 312407 (0.00088) [2022-07-09 15:40:43,837][26022] Updated weights on worker 0-0, policy_version 312417 (0.00082) [2022-07-09 15:40:45,264][25689] Fps is (10 sec: 5511.9, 60 sec: 5662.0, 300 sec: 5634.9). Total num frames: 319923200. Throughput: 0: 5914.1. Samples: 319922826. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:40:45,264][25689] Avg episode reward: [(0, '-45.632')] [2022-07-09 15:40:45,525][26022] Updated weights on worker 0-0, policy_version 312427 (0.00093) [2022-07-09 15:40:47,495][26022] Updated weights on worker 0-0, policy_version 312437 (0.00123) [2022-07-09 15:40:49,099][26022] Updated weights on worker 0-0, policy_version 312447 (0.00079) [2022-07-09 15:40:50,333][25689] Fps is (10 sec: 5783.5, 60 sec: 5656.1, 300 sec: 5638.2). Total num frames: 319951872. Throughput: 0: 5904.6. Samples: 319956866. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:40:50,333][25689] Avg episode reward: [(0, '-44.567')] [2022-07-09 15:40:50,930][26022] Updated weights on worker 0-0, policy_version 312457 (0.00099) [2022-07-09 15:40:52,854][26022] Updated weights on worker 0-0, policy_version 312467 (0.00083) [2022-07-09 15:40:54,691][26022] Updated weights on worker 0-0, policy_version 312477 (0.00084) [2022-07-09 15:40:55,400][25689] Fps is (10 sec: 5758.3, 60 sec: 5660.3, 300 sec: 5637.3). Total num frames: 319981568. Throughput: 0: 5056.0. Samples: 319973680. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:40:55,401][25689] Avg episode reward: [(0, '-44.159')] [2022-07-09 15:40:56,496][26022] Updated weights on worker 0-0, policy_version 312487 (0.00091) [2022-07-09 15:40:58,145][26022] Updated weights on worker 0-0, policy_version 312497 (0.00087) [2022-07-09 15:41:00,011][26022] Updated weights on worker 0-0, policy_version 312507 (0.00086) [2022-07-09 15:41:00,426][25689] Fps is (10 sec: 5681.1, 60 sec: 5647.5, 300 sec: 5640.7). Total num frames: 320009216. Throughput: 0: 5882.3. Samples: 320007750. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:41:00,427][25689] Avg episode reward: [(0, '-44.753')] [2022-07-09 15:41:01,640][26022] Updated weights on worker 0-0, policy_version 312517 (0.00085) [2022-07-09 15:41:04,108][26022] Updated weights on worker 0-0, policy_version 312527 (0.00725) [2022-07-09 15:41:05,455][25689] Fps is (10 sec: 5397.8, 60 sec: 5647.7, 300 sec: 5640.7). Total num frames: 320035840. Throughput: 0: 5792.9. Samples: 320039790. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:41:05,455][25689] Avg episode reward: [(0, '-44.726')] [2022-07-09 15:41:05,764][26022] Updated weights on worker 0-0, policy_version 312537 (0.00089) [2022-07-09 15:41:07,743][26022] Updated weights on worker 0-0, policy_version 312547 (0.00083) [2022-07-09 15:41:09,488][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:41:09,498][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000312557_320058368.pth [2022-07-09 15:41:09,499][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000310575_318028800.pth [2022-07-09 15:41:09,509][26022] Updated weights on worker 0-0, policy_version 312557 (0.00095) [2022-07-09 15:41:10,468][25689] Fps is (10 sec: 5404.5, 60 sec: 5632.5, 300 sec: 5631.4). Total num frames: 320063488. Throughput: 0: 4957.0. Samples: 320056680. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:41:10,470][25689] Avg episode reward: [(0, '-44.612')] [2022-07-09 15:41:11,225][26022] Updated weights on worker 0-0, policy_version 312567 (0.00088) [2022-07-09 15:41:12,909][26022] Updated weights on worker 0-0, policy_version 312577 (0.00086) [2022-07-09 15:41:14,790][26022] Updated weights on worker 0-0, policy_version 312587 (0.00084) [2022-07-09 15:41:15,514][25689] Fps is (10 sec: 5700.5, 60 sec: 5632.8, 300 sec: 5634.1). Total num frames: 320093184. Throughput: 0: 5826.8. Samples: 320090880. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:41:15,515][25689] Avg episode reward: [(0, '-45.592')] [2022-07-09 15:41:16,794][26022] Updated weights on worker 0-0, policy_version 312597 (0.00086) [2022-07-09 15:41:18,240][26022] Updated weights on worker 0-0, policy_version 312607 (0.00078) [2022-07-09 15:41:20,473][26022] Updated weights on worker 0-0, policy_version 312617 (0.00095) [2022-07-09 15:41:20,517][25689] Fps is (10 sec: 5604.7, 60 sec: 5624.3, 300 sec: 5631.9). Total num frames: 320119808. Throughput: 0: 5836.7. Samples: 320125014. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:41:20,518][25689] Avg episode reward: [(0, '-45.641')] [2022-07-09 15:41:21,835][26022] Updated weights on worker 0-0, policy_version 312627 (0.00089) [2022-07-09 15:41:23,962][26022] Updated weights on worker 0-0, policy_version 312637 (0.00086) [2022-07-09 15:41:25,521][25689] Fps is (10 sec: 5627.9, 60 sec: 5608.6, 300 sec: 5638.9). Total num frames: 320149504. Throughput: 0: 5934.1. Samples: 320158868. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:41:25,522][25689] Avg episode reward: [(0, '-46.481')] [2022-07-09 15:41:25,620][26022] Updated weights on worker 0-0, policy_version 312647 (0.00084) [2022-07-09 15:41:27,721][26022] Updated weights on worker 0-0, policy_version 312657 (0.00091) [2022-07-09 15:41:29,293][26022] Updated weights on worker 0-0, policy_version 312667 (0.00080) [2022-07-09 15:41:30,563][25689] Fps is (10 sec: 5810.4, 60 sec: 5658.0, 300 sec: 5632.9). Total num frames: 320178176. Throughput: 0: 5925.2. Samples: 320175742. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:41:30,563][25689] Avg episode reward: [(0, '-47.017')] [2022-07-09 15:41:31,414][26022] Updated weights on worker 0-0, policy_version 312677 (0.00091) [2022-07-09 15:41:32,720][26022] Updated weights on worker 0-0, policy_version 312687 (0.00088) [2022-07-09 15:41:35,023][26022] Updated weights on worker 0-0, policy_version 312697 (0.00087) [2022-07-09 15:41:35,672][25689] Fps is (10 sec: 5548.3, 60 sec: 5591.0, 300 sec: 5627.9). Total num frames: 320205824. Throughput: 0: 5895.2. Samples: 320209716. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:41:35,673][25689] Avg episode reward: [(0, '-47.857')] [2022-07-09 15:41:36,346][26022] Updated weights on worker 0-0, policy_version 312707 (0.00090) [2022-07-09 15:41:38,554][26022] Updated weights on worker 0-0, policy_version 312717 (0.00083) [2022-07-09 15:41:40,239][26022] Updated weights on worker 0-0, policy_version 312727 (0.00084) [2022-07-09 15:41:40,684][25689] Fps is (10 sec: 5564.7, 60 sec: 5641.5, 300 sec: 5635.1). Total num frames: 320234496. Throughput: 0: 5888.8. Samples: 320243770. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:41:40,684][25689] Avg episode reward: [(0, '-48.427')] [2022-07-09 15:41:42,031][26022] Updated weights on worker 0-0, policy_version 312737 (0.00084) [2022-07-09 15:41:43,623][26022] Updated weights on worker 0-0, policy_version 312747 (0.00084) [2022-07-09 15:41:45,619][26022] Updated weights on worker 0-0, policy_version 312757 (0.00084) [2022-07-09 15:41:45,709][25689] Fps is (10 sec: 5713.4, 60 sec: 5624.4, 300 sec: 5634.8). Total num frames: 320263168. Throughput: 0: 5064.8. Samples: 320261114. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:41:45,710][25689] Avg episode reward: [(0, '-48.119')] [2022-07-09 15:41:47,310][26022] Updated weights on worker 0-0, policy_version 312767 (0.00090) [2022-07-09 15:41:49,128][26022] Updated weights on worker 0-0, policy_version 312777 (0.00088) [2022-07-09 15:41:50,767][25689] Fps is (10 sec: 5788.6, 60 sec: 5642.4, 300 sec: 5636.1). Total num frames: 320292864. Throughput: 0: 5921.8. Samples: 320295386. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:41:50,767][25689] Avg episode reward: [(0, '-48.522')] [2022-07-09 15:41:50,940][26022] Updated weights on worker 0-0, policy_version 312787 (0.00085) [2022-07-09 15:41:52,753][26022] Updated weights on worker 0-0, policy_version 312797 (0.00091) [2022-07-09 15:41:54,523][26022] Updated weights on worker 0-0, policy_version 312807 (0.00085) [2022-07-09 15:41:55,801][25689] Fps is (10 sec: 5682.4, 60 sec: 5611.6, 300 sec: 5632.5). Total num frames: 320320512. Throughput: 0: 5943.4. Samples: 320329346. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:41:55,802][25689] Avg episode reward: [(0, '-48.088')] [2022-07-09 15:41:56,340][26022] Updated weights on worker 0-0, policy_version 312817 (0.00089) [2022-07-09 15:41:58,152][26022] Updated weights on worker 0-0, policy_version 312827 (0.00087) [2022-07-09 15:41:59,999][26022] Updated weights on worker 0-0, policy_version 312837 (0.00087) [2022-07-09 15:42:00,806][25689] Fps is (10 sec: 5610.1, 60 sec: 5630.5, 300 sec: 5640.0). Total num frames: 320349184. Throughput: 0: 5105.2. Samples: 320346498. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:42:00,807][25689] Avg episode reward: [(0, '-47.207')] [2022-07-09 15:42:01,704][26022] Updated weights on worker 0-0, policy_version 312847 (0.00085) [2022-07-09 15:42:03,908][26022] Updated weights on worker 0-0, policy_version 312857 (0.00098) [2022-07-09 15:42:05,772][26022] Updated weights on worker 0-0, policy_version 312867 (0.00096) [2022-07-09 15:42:05,813][25689] Fps is (10 sec: 5523.0, 60 sec: 5632.5, 300 sec: 5633.6). Total num frames: 320375808. Throughput: 0: 5848.9. Samples: 320378698. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:42:05,813][25689] Avg episode reward: [(0, '-45.546')] [2022-07-09 15:42:07,371][26022] Updated weights on worker 0-0, policy_version 312877 (0.00088) [2022-07-09 15:42:09,428][26022] Updated weights on worker 0-0, policy_version 312887 (0.00091) [2022-07-09 15:42:10,835][25689] Fps is (10 sec: 5615.8, 60 sec: 5665.7, 300 sec: 5637.8). Total num frames: 320405504. Throughput: 0: 5856.8. Samples: 320412920. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:42:10,835][25689] Avg episode reward: [(0, '-45.510')] [2022-07-09 15:42:11,084][26022] Updated weights on worker 0-0, policy_version 312897 (0.00081) [2022-07-09 15:42:13,082][26022] Updated weights on worker 0-0, policy_version 312907 (0.00082) [2022-07-09 15:42:14,935][26022] Updated weights on worker 0-0, policy_version 312917 (0.00087) [2022-07-09 15:42:15,927][25689] Fps is (10 sec: 5771.0, 60 sec: 5644.4, 300 sec: 5639.6). Total num frames: 320434176. Throughput: 0: 4994.0. Samples: 320429856. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:42:15,928][25689] Avg episode reward: [(0, '-46.070')] [2022-07-09 15:42:16,403][26022] Updated weights on worker 0-0, policy_version 312927 (0.00097) [2022-07-09 15:42:18,520][26022] Updated weights on worker 0-0, policy_version 312937 (0.00613) [2022-07-09 15:42:19,966][26022] Updated weights on worker 0-0, policy_version 312947 (0.00090) [2022-07-09 15:42:21,002][25689] Fps is (10 sec: 5539.3, 60 sec: 5654.6, 300 sec: 5631.4). Total num frames: 320461824. Throughput: 0: 5832.8. Samples: 320464298. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:42:21,004][25689] Avg episode reward: [(0, '-45.224')] [2022-07-09 15:42:22,091][26022] Updated weights on worker 0-0, policy_version 312957 (0.00059) [2022-07-09 15:42:23,498][26022] Updated weights on worker 0-0, policy_version 312967 (0.00093) [2022-07-09 15:42:25,537][26022] Updated weights on worker 0-0, policy_version 312977 (0.00086) [2022-07-09 15:42:26,029][25689] Fps is (10 sec: 5575.0, 60 sec: 5635.5, 300 sec: 5637.8). Total num frames: 320490496. Throughput: 0: 5927.2. Samples: 320498524. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:42:26,030][25689] Avg episode reward: [(0, '-45.257')] [2022-07-09 15:42:27,250][26022] Updated weights on worker 0-0, policy_version 312987 (0.00096) [2022-07-09 15:42:29,280][26022] Updated weights on worker 0-0, policy_version 312997 (0.00094) [2022-07-09 15:42:30,884][26022] Updated weights on worker 0-0, policy_version 313007 (0.00088) [2022-07-09 15:42:31,063][25689] Fps is (10 sec: 5801.5, 60 sec: 5653.1, 300 sec: 5637.9). Total num frames: 320520192. Throughput: 0: 5074.4. Samples: 320515566. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:42:31,063][25689] Avg episode reward: [(0, '-45.694')] [2022-07-09 15:42:32,932][26022] Updated weights on worker 0-0, policy_version 313017 (0.00090) [2022-07-09 15:42:34,375][26022] Updated weights on worker 0-0, policy_version 313027 (0.00085) [2022-07-09 15:42:36,176][25689] Fps is (10 sec: 5550.6, 60 sec: 5635.9, 300 sec: 5629.0). Total num frames: 320546816. Throughput: 0: 5912.0. Samples: 320549568. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:42:36,176][25689] Avg episode reward: [(0, '-46.238')] [2022-07-09 15:42:36,523][26022] Updated weights on worker 0-0, policy_version 313037 (0.00091) [2022-07-09 15:42:37,906][26022] Updated weights on worker 0-0, policy_version 313047 (0.00084) [2022-07-09 15:42:39,878][26022] Updated weights on worker 0-0, policy_version 313057 (0.00091) [2022-07-09 15:42:41,195][25689] Fps is (10 sec: 5558.7, 60 sec: 5652.1, 300 sec: 5635.7). Total num frames: 320576512. Throughput: 0: 5915.6. Samples: 320583750. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:42:41,195][25689] Avg episode reward: [(0, '-45.070')] [2022-07-09 15:42:41,797][26022] Updated weights on worker 0-0, policy_version 313067 (0.00085) [2022-07-09 15:42:43,612][26022] Updated weights on worker 0-0, policy_version 313077 (0.00090) [2022-07-09 15:42:45,459][26022] Updated weights on worker 0-0, policy_version 313087 (0.00090) [2022-07-09 15:42:46,235][25689] Fps is (10 sec: 5904.2, 60 sec: 5667.7, 300 sec: 5638.4). Total num frames: 320606208. Throughput: 0: 5057.7. Samples: 320600720. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 15:42:46,235][25689] Avg episode reward: [(0, '-44.820')] [2022-07-09 15:42:47,338][26022] Updated weights on worker 0-0, policy_version 313097 (0.00084) [2022-07-09 15:42:48,917][26022] Updated weights on worker 0-0, policy_version 313107 (0.00546) [2022-07-09 15:42:50,936][26022] Updated weights on worker 0-0, policy_version 313117 (0.00087) [2022-07-09 15:42:51,302][25689] Fps is (10 sec: 5673.3, 60 sec: 5632.9, 300 sec: 5634.5). Total num frames: 320633856. Throughput: 0: 5894.2. Samples: 320634862. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:42:51,303][25689] Avg episode reward: [(0, '-44.414')] [2022-07-09 15:42:52,531][26022] Updated weights on worker 0-0, policy_version 313127 (0.00093) [2022-07-09 15:42:54,351][26022] Updated weights on worker 0-0, policy_version 313137 (0.00094) [2022-07-09 15:42:56,252][26022] Updated weights on worker 0-0, policy_version 313147 (0.00084) [2022-07-09 15:42:56,349][25689] Fps is (10 sec: 5568.3, 60 sec: 5648.6, 300 sec: 5637.7). Total num frames: 320662528. Throughput: 0: 5923.3. Samples: 320669062. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:42:56,350][25689] Avg episode reward: [(0, '-44.560')] [2022-07-09 15:42:57,755][26022] Updated weights on worker 0-0, policy_version 313157 (0.00086) [2022-07-09 15:42:59,997][26022] Updated weights on worker 0-0, policy_version 313167 (0.00090) [2022-07-09 15:43:01,351][26022] Updated weights on worker 0-0, policy_version 313177 (0.00086) [2022-07-09 15:43:01,449][25689] Fps is (10 sec: 5853.5, 60 sec: 5673.6, 300 sec: 5649.8). Total num frames: 320693248. Throughput: 0: 5064.4. Samples: 320686324. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:43:01,449][25689] Avg episode reward: [(0, '-44.300')] [2022-07-09 15:43:03,758][26022] Updated weights on worker 0-0, policy_version 313187 (0.00083) [2022-07-09 15:43:05,539][26022] Updated weights on worker 0-0, policy_version 313197 (0.00102) [2022-07-09 15:43:06,470][25689] Fps is (10 sec: 5463.5, 60 sec: 5638.5, 300 sec: 5640.2). Total num frames: 320717824. Throughput: 0: 5811.8. Samples: 320718324. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:43:06,471][25689] Avg episode reward: [(0, '-44.346')] [2022-07-09 15:43:07,378][26022] Updated weights on worker 0-0, policy_version 313207 (0.00086) [2022-07-09 15:43:09,066][26022] Updated weights on worker 0-0, policy_version 313217 (0.00085) [2022-07-09 15:43:09,588][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:43:09,599][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000313220_320737280.pth [2022-07-09 15:43:09,600][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000311235_318704640.pth [2022-07-09 15:43:11,106][26022] Updated weights on worker 0-0, policy_version 313227 (0.00084) [2022-07-09 15:43:11,548][25689] Fps is (10 sec: 5373.7, 60 sec: 5633.3, 300 sec: 5639.4). Total num frames: 320747520. Throughput: 0: 5813.3. Samples: 320752558. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:43:11,549][25689] Avg episode reward: [(0, '-44.280')] [2022-07-09 15:43:12,674][26022] Updated weights on worker 0-0, policy_version 313237 (0.00082) [2022-07-09 15:43:14,547][26022] Updated weights on worker 0-0, policy_version 313247 (0.00098) [2022-07-09 15:43:16,310][26022] Updated weights on worker 0-0, policy_version 313257 (0.00094) [2022-07-09 15:43:16,637][25689] Fps is (10 sec: 5741.2, 60 sec: 5633.6, 300 sec: 5641.6). Total num frames: 320776192. Throughput: 0: 4951.7. Samples: 320769520. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:43:16,637][25689] Avg episode reward: [(0, '-46.131')] [2022-07-09 15:43:18,229][26022] Updated weights on worker 0-0, policy_version 313267 (0.00084) [2022-07-09 15:43:20,046][26022] Updated weights on worker 0-0, policy_version 313277 (0.00087) [2022-07-09 15:43:21,674][25689] Fps is (10 sec: 5663.1, 60 sec: 5654.0, 300 sec: 5641.0). Total num frames: 320804864. Throughput: 0: 5797.4. Samples: 320803578. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:43:21,675][25689] Avg episode reward: [(0, '-47.338')] [2022-07-09 15:43:21,699][26022] Updated weights on worker 0-0, policy_version 313287 (0.00087) [2022-07-09 15:43:23,630][26022] Updated weights on worker 0-0, policy_version 313297 (0.00099) [2022-07-09 15:43:25,541][26022] Updated weights on worker 0-0, policy_version 313307 (0.00089) [2022-07-09 15:43:26,691][25689] Fps is (10 sec: 5601.7, 60 sec: 5638.1, 300 sec: 5633.9). Total num frames: 320832512. Throughput: 0: 5893.7. Samples: 320837498. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:43:26,691][25689] Avg episode reward: [(0, '-46.561')] [2022-07-09 15:43:27,192][26022] Updated weights on worker 0-0, policy_version 313317 (0.00089) [2022-07-09 15:43:29,086][26022] Updated weights on worker 0-0, policy_version 313327 (0.00088) [2022-07-09 15:43:30,879][26022] Updated weights on worker 0-0, policy_version 313337 (0.00084) [2022-07-09 15:43:31,703][25689] Fps is (10 sec: 5718.1, 60 sec: 5640.1, 300 sec: 5646.3). Total num frames: 320862208. Throughput: 0: 5046.0. Samples: 320854256. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:43:31,704][25689] Avg episode reward: [(0, '-47.544')] [2022-07-09 15:43:32,781][26022] Updated weights on worker 0-0, policy_version 313347 (0.00085) [2022-07-09 15:43:34,581][26022] Updated weights on worker 0-0, policy_version 313357 (0.00084) [2022-07-09 15:43:36,259][26022] Updated weights on worker 0-0, policy_version 313367 (0.00423) [2022-07-09 15:43:36,757][25689] Fps is (10 sec: 5798.6, 60 sec: 5679.4, 300 sec: 5643.1). Total num frames: 320890880. Throughput: 0: 5904.9. Samples: 320888326. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:43:36,757][25689] Avg episode reward: [(0, '-46.993')] [2022-07-09 15:43:38,103][26022] Updated weights on worker 0-0, policy_version 313377 (0.00088) [2022-07-09 15:43:39,895][26022] Updated weights on worker 0-0, policy_version 313387 (0.00084) [2022-07-09 15:43:41,727][26022] Updated weights on worker 0-0, policy_version 313397 (0.00085) [2022-07-09 15:43:41,799][25689] Fps is (10 sec: 5578.3, 60 sec: 5643.4, 300 sec: 5635.9). Total num frames: 320918528. Throughput: 0: 5915.7. Samples: 320922630. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:43:41,799][25689] Avg episode reward: [(0, '-45.650')] [2022-07-09 15:43:43,360][26022] Updated weights on worker 0-0, policy_version 313407 (0.00080) [2022-07-09 15:43:45,184][26022] Updated weights on worker 0-0, policy_version 313417 (0.00087) [2022-07-09 15:43:46,803][25689] Fps is (10 sec: 5707.7, 60 sec: 5646.7, 300 sec: 5648.0). Total num frames: 320948224. Throughput: 0: 5953.1. Samples: 320957232. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:43:46,805][25689] Avg episode reward: [(0, '-45.401')] [2022-07-09 15:43:46,975][26022] Updated weights on worker 0-0, policy_version 313427 (0.00098) [2022-07-09 15:43:48,880][26022] Updated weights on worker 0-0, policy_version 313437 (0.00090) [2022-07-09 15:43:50,571][26022] Updated weights on worker 0-0, policy_version 313447 (0.00089) [2022-07-09 15:43:51,810][25689] Fps is (10 sec: 5625.9, 60 sec: 5635.5, 300 sec: 5638.8). Total num frames: 320974848. Throughput: 0: 5967.5. Samples: 320974244. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:43:51,811][25689] Avg episode reward: [(0, '-44.664')] [2022-07-09 15:43:52,494][26022] Updated weights on worker 0-0, policy_version 313457 (0.00086) [2022-07-09 15:43:54,173][26022] Updated weights on worker 0-0, policy_version 313467 (0.00085) [2022-07-09 15:43:56,143][26022] Updated weights on worker 0-0, policy_version 313477 (0.00098) [2022-07-09 15:43:56,857][25689] Fps is (10 sec: 5499.9, 60 sec: 5635.5, 300 sec: 5638.8). Total num frames: 321003520. Throughput: 0: 5956.5. Samples: 321008056. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:43:56,858][25689] Avg episode reward: [(0, '-44.534')] [2022-07-09 15:43:57,943][26022] Updated weights on worker 0-0, policy_version 313487 (0.00086) [2022-07-09 15:43:59,849][26022] Updated weights on worker 0-0, policy_version 313497 (0.00082) [2022-07-09 15:44:01,841][26022] Updated weights on worker 0-0, policy_version 313507 (0.00092) [2022-07-09 15:44:01,863][25689] Fps is (10 sec: 5602.1, 60 sec: 5593.4, 300 sec: 5639.2). Total num frames: 321031168. Throughput: 0: 5872.2. Samples: 321040450. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:44:01,864][25689] Avg episode reward: [(0, '-44.152')] [2022-07-09 15:44:03,907][26022] Updated weights on worker 0-0, policy_version 313517 (0.00084) [2022-07-09 15:44:05,412][26022] Updated weights on worker 0-0, policy_version 313527 (0.00086) [2022-07-09 15:44:06,867][25689] Fps is (10 sec: 5422.2, 60 sec: 5628.9, 300 sec: 5639.5). Total num frames: 321057792. Throughput: 0: 4978.3. Samples: 321057112. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:44:06,867][25689] Avg episode reward: [(0, '-43.799')] [2022-07-09 15:44:07,368][26022] Updated weights on worker 0-0, policy_version 313537 (0.00098) [2022-07-09 15:44:09,010][26022] Updated weights on worker 0-0, policy_version 313547 (0.00094) [2022-07-09 15:44:10,911][26022] Updated weights on worker 0-0, policy_version 313557 (0.00083) [2022-07-09 15:44:11,870][25689] Fps is (10 sec: 5627.9, 60 sec: 5635.9, 300 sec: 5641.4). Total num frames: 321087488. Throughput: 0: 5831.7. Samples: 321091232. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:44:11,871][25689] Avg episode reward: [(0, '-44.879')] [2022-07-09 15:44:12,937][26022] Updated weights on worker 0-0, policy_version 313567 (0.00082) [2022-07-09 15:44:14,383][26022] Updated weights on worker 0-0, policy_version 313577 (0.00088) [2022-07-09 15:44:16,383][26022] Updated weights on worker 0-0, policy_version 313587 (0.00084) [2022-07-09 15:44:16,948][25689] Fps is (10 sec: 5891.5, 60 sec: 5653.9, 300 sec: 5644.1). Total num frames: 321117184. Throughput: 0: 5843.6. Samples: 321125456. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:44:16,948][25689] Avg episode reward: [(0, '-45.024')] [2022-07-09 15:44:18,117][26022] Updated weights on worker 0-0, policy_version 313597 (0.00088) [2022-07-09 15:44:19,972][26022] Updated weights on worker 0-0, policy_version 313607 (0.00092) [2022-07-09 15:44:21,832][26022] Updated weights on worker 0-0, policy_version 313617 (0.00090) [2022-07-09 15:44:21,955][25689] Fps is (10 sec: 5686.5, 60 sec: 5639.8, 300 sec: 5640.8). Total num frames: 321144832. Throughput: 0: 5088.7. Samples: 321142690. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:44:21,955][25689] Avg episode reward: [(0, '-45.428')] [2022-07-09 15:44:23,532][26022] Updated weights on worker 0-0, policy_version 313627 (0.00091) [2022-07-09 15:44:25,309][26022] Updated weights on worker 0-0, policy_version 313637 (0.00092) [2022-07-09 15:44:26,991][25689] Fps is (10 sec: 5607.7, 60 sec: 5654.9, 300 sec: 5647.1). Total num frames: 321173504. Throughput: 0: 5945.9. Samples: 321176768. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:44:26,991][25689] Avg episode reward: [(0, '-45.228')] [2022-07-09 15:44:27,221][26022] Updated weights on worker 0-0, policy_version 313647 (0.00085) [2022-07-09 15:44:28,967][26022] Updated weights on worker 0-0, policy_version 313657 (0.00088) [2022-07-09 15:44:30,877][26022] Updated weights on worker 0-0, policy_version 313667 (0.00092) [2022-07-09 15:44:32,016][25689] Fps is (10 sec: 5597.9, 60 sec: 5619.8, 300 sec: 5640.5). Total num frames: 321201152. Throughput: 0: 5934.3. Samples: 321210780. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:44:32,016][25689] Avg episode reward: [(0, '-45.993')] [2022-07-09 15:44:32,474][26022] Updated weights on worker 0-0, policy_version 313677 (0.00088) [2022-07-09 15:44:34,499][26022] Updated weights on worker 0-0, policy_version 313687 (0.00087) [2022-07-09 15:44:36,103][26022] Updated weights on worker 0-0, policy_version 313697 (0.00086) [2022-07-09 15:44:37,079][25689] Fps is (10 sec: 5582.8, 60 sec: 5618.9, 300 sec: 5642.8). Total num frames: 321229824. Throughput: 0: 5084.6. Samples: 321227816. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:44:37,080][25689] Avg episode reward: [(0, '-45.766')] [2022-07-09 15:44:38,062][26022] Updated weights on worker 0-0, policy_version 313707 (0.00084) [2022-07-09 15:44:39,663][26022] Updated weights on worker 0-0, policy_version 313717 (0.00094) [2022-07-09 15:44:41,567][26022] Updated weights on worker 0-0, policy_version 313727 (0.00092) [2022-07-09 15:44:42,145][25689] Fps is (10 sec: 5762.4, 60 sec: 5650.6, 300 sec: 5645.5). Total num frames: 321259520. Throughput: 0: 5915.3. Samples: 321262120. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:44:42,145][25689] Avg episode reward: [(0, '-45.757')] [2022-07-09 15:44:43,515][26022] Updated weights on worker 0-0, policy_version 313737 (0.00093) [2022-07-09 15:44:45,151][26022] Updated weights on worker 0-0, policy_version 313747 (0.00096) [2022-07-09 15:44:46,952][26022] Updated weights on worker 0-0, policy_version 313757 (0.00093) [2022-07-09 15:44:47,149][25689] Fps is (10 sec: 5694.4, 60 sec: 5616.7, 300 sec: 5642.1). Total num frames: 321287168. Throughput: 0: 5922.2. Samples: 321296150. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:44:47,150][25689] Avg episode reward: [(0, '-46.323')] [2022-07-09 15:44:48,853][26022] Updated weights on worker 0-0, policy_version 313767 (0.00091) [2022-07-09 15:44:50,691][26022] Updated weights on worker 0-0, policy_version 313777 (0.00502) [2022-07-09 15:44:52,198][25689] Fps is (10 sec: 5602.0, 60 sec: 5646.6, 300 sec: 5639.9). Total num frames: 321315840. Throughput: 0: 5081.1. Samples: 321313330. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:44:52,199][25689] Avg episode reward: [(0, '-47.487')] [2022-07-09 15:44:52,330][26022] Updated weights on worker 0-0, policy_version 313787 (0.00092) [2022-07-09 15:44:54,436][26022] Updated weights on worker 0-0, policy_version 313797 (0.00080) [2022-07-09 15:44:56,044][26022] Updated weights on worker 0-0, policy_version 313807 (0.00094) [2022-07-09 15:44:57,235][25689] Fps is (10 sec: 5787.0, 60 sec: 5664.6, 300 sec: 5643.9). Total num frames: 321345536. Throughput: 0: 5936.0. Samples: 321347464. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:44:57,240][25689] Avg episode reward: [(0, '-47.067')] [2022-07-09 15:44:57,986][26022] Updated weights on worker 0-0, policy_version 313817 (0.00091) [2022-07-09 15:44:59,686][26022] Updated weights on worker 0-0, policy_version 313827 (0.00088) [2022-07-09 15:45:01,819][26022] Updated weights on worker 0-0, policy_version 313837 (0.00088) [2022-07-09 15:45:02,282][25689] Fps is (10 sec: 5483.8, 60 sec: 5626.9, 300 sec: 5640.2). Total num frames: 321371136. Throughput: 0: 5809.0. Samples: 321379096. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:45:02,283][25689] Avg episode reward: [(0, '-47.964')] [2022-07-09 15:45:03,686][26022] Updated weights on worker 0-0, policy_version 313847 (0.00084) [2022-07-09 15:45:05,465][26022] Updated weights on worker 0-0, policy_version 313857 (0.00088) [2022-07-09 15:45:07,254][26022] Updated weights on worker 0-0, policy_version 313867 (0.00086) [2022-07-09 15:45:07,325][25689] Fps is (10 sec: 5480.3, 60 sec: 5674.0, 300 sec: 5643.4). Total num frames: 321400832. Throughput: 0: 4960.5. Samples: 321396232. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:45:07,326][25689] Avg episode reward: [(0, '-48.219')] [2022-07-09 15:45:08,975][26022] Updated weights on worker 0-0, policy_version 313877 (0.00086) [2022-07-09 15:45:09,885][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:45:09,894][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000313881_321414144.pth [2022-07-09 15:45:09,894][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000311895_319380480.pth [2022-07-09 15:45:10,909][26022] Updated weights on worker 0-0, policy_version 313887 (0.00086) [2022-07-09 15:45:12,365][25689] Fps is (10 sec: 5687.2, 60 sec: 5636.8, 300 sec: 5636.7). Total num frames: 321428480. Throughput: 0: 5821.8. Samples: 321430734. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:45:12,365][25689] Avg episode reward: [(0, '-47.834')] [2022-07-09 15:45:12,592][26022] Updated weights on worker 0-0, policy_version 313897 (0.00090) [2022-07-09 15:45:14,456][26022] Updated weights on worker 0-0, policy_version 313907 (0.00083) [2022-07-09 15:45:15,993][26022] Updated weights on worker 0-0, policy_version 313917 (0.00093) [2022-07-09 15:45:17,414][25689] Fps is (10 sec: 5582.6, 60 sec: 5622.5, 300 sec: 5641.0). Total num frames: 321457152. Throughput: 0: 5833.9. Samples: 321465182. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 15:45:17,414][25689] Avg episode reward: [(0, '-47.027')] [2022-07-09 15:45:18,058][26022] Updated weights on worker 0-0, policy_version 313927 (0.00085) [2022-07-09 15:45:19,723][26022] Updated weights on worker 0-0, policy_version 313937 (0.00103) [2022-07-09 15:45:21,629][26022] Updated weights on worker 0-0, policy_version 313947 (0.00083) [2022-07-09 15:45:22,447][25689] Fps is (10 sec: 5789.1, 60 sec: 5653.9, 300 sec: 5637.3). Total num frames: 321486848. Throughput: 0: 5120.9. Samples: 321482360. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:45:22,448][25689] Avg episode reward: [(0, '-46.906')] [2022-07-09 15:45:23,196][26022] Updated weights on worker 0-0, policy_version 313957 (0.00087) [2022-07-09 15:45:25,195][26022] Updated weights on worker 0-0, policy_version 313967 (0.00088) [2022-07-09 15:45:27,067][26022] Updated weights on worker 0-0, policy_version 313977 (0.00084) [2022-07-09 15:45:27,460][25689] Fps is (10 sec: 5809.7, 60 sec: 5656.0, 300 sec: 5647.8). Total num frames: 321515520. Throughput: 0: 5976.1. Samples: 321516562. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:45:27,461][25689] Avg episode reward: [(0, '-46.761')] [2022-07-09 15:45:28,680][26022] Updated weights on worker 0-0, policy_version 313987 (0.00090) [2022-07-09 15:45:30,546][26022] Updated weights on worker 0-0, policy_version 313997 (0.00095) [2022-07-09 15:45:32,477][25689] Fps is (10 sec: 5513.0, 60 sec: 5639.8, 300 sec: 5632.5). Total num frames: 321542144. Throughput: 0: 5948.3. Samples: 321550368. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:45:32,478][25689] Avg episode reward: [(0, '-45.851')] [2022-07-09 15:45:32,480][26022] Updated weights on worker 0-0, policy_version 314007 (0.00090) [2022-07-09 15:45:34,182][26022] Updated weights on worker 0-0, policy_version 314017 (0.00085) [2022-07-09 15:45:36,159][26022] Updated weights on worker 0-0, policy_version 314027 (0.00086) [2022-07-09 15:45:37,551][25689] Fps is (10 sec: 5683.1, 60 sec: 5672.8, 300 sec: 5648.5). Total num frames: 321572864. Throughput: 0: 5078.0. Samples: 321567438. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:45:37,557][25689] Avg episode reward: [(0, '-45.268')] [2022-07-09 15:45:37,723][26022] Updated weights on worker 0-0, policy_version 314037 (0.00080) [2022-07-09 15:45:39,608][26022] Updated weights on worker 0-0, policy_version 314047 (0.00089) [2022-07-09 15:45:41,512][26022] Updated weights on worker 0-0, policy_version 314057 (0.00092) [2022-07-09 15:45:42,581][25689] Fps is (10 sec: 5776.5, 60 sec: 5642.1, 300 sec: 5641.5). Total num frames: 321600512. Throughput: 0: 5939.0. Samples: 321601938. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:45:42,582][25689] Avg episode reward: [(0, '-45.548')] [2022-07-09 15:45:43,096][26022] Updated weights on worker 0-0, policy_version 314067 (0.00089) [2022-07-09 15:45:44,837][26022] Updated weights on worker 0-0, policy_version 314077 (0.00084) [2022-07-09 15:45:46,780][26022] Updated weights on worker 0-0, policy_version 314087 (0.00087) [2022-07-09 15:45:47,612][25689] Fps is (10 sec: 5495.9, 60 sec: 5639.7, 300 sec: 5638.8). Total num frames: 321628160. Throughput: 0: 5950.3. Samples: 321636470. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:45:47,613][25689] Avg episode reward: [(0, '-46.941')] [2022-07-09 15:45:48,690][26022] Updated weights on worker 0-0, policy_version 314098 (0.00093) [2022-07-09 15:45:50,695][26022] Updated weights on worker 0-0, policy_version 314108 (0.00099) [2022-07-09 15:45:52,240][26022] Updated weights on worker 0-0, policy_version 314118 (0.00086) [2022-07-09 15:45:52,621][25689] Fps is (10 sec: 5813.8, 60 sec: 5677.3, 300 sec: 5643.3). Total num frames: 321658880. Throughput: 0: 5126.9. Samples: 321653644. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:45:52,621][25689] Avg episode reward: [(0, '-46.398')] [2022-07-09 15:45:54,166][26022] Updated weights on worker 0-0, policy_version 314128 (0.00086) [2022-07-09 15:45:55,928][26022] Updated weights on worker 0-0, policy_version 314138 (0.00081) [2022-07-09 15:45:57,707][25689] Fps is (10 sec: 5781.5, 60 sec: 5638.8, 300 sec: 5642.2). Total num frames: 321686528. Throughput: 0: 5967.5. Samples: 321687724. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:45:57,708][25689] Avg episode reward: [(0, '-46.951')] [2022-07-09 15:45:57,773][26022] Updated weights on worker 0-0, policy_version 314148 (0.00082) [2022-07-09 15:45:59,568][26022] Updated weights on worker 0-0, policy_version 314158 (0.00087) [2022-07-09 15:46:01,426][26022] Updated weights on worker 0-0, policy_version 314168 (0.00087) [2022-07-09 15:46:02,756][25689] Fps is (10 sec: 5354.7, 60 sec: 5655.5, 300 sec: 5641.8). Total num frames: 321713152. Throughput: 0: 5919.5. Samples: 321721364. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:46:02,757][25689] Avg episode reward: [(0, '-47.171')] [2022-07-09 15:46:03,416][26022] Updated weights on worker 0-0, policy_version 314178 (0.00081) [2022-07-09 15:46:05,314][26022] Updated weights on worker 0-0, policy_version 314188 (0.00089) [2022-07-09 15:46:07,180][26022] Updated weights on worker 0-0, policy_version 314198 (0.00084) [2022-07-09 15:46:07,758][25689] Fps is (10 sec: 5501.6, 60 sec: 5642.5, 300 sec: 5645.5). Total num frames: 321741824. Throughput: 0: 4978.3. Samples: 321736766. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:46:07,760][25689] Avg episode reward: [(0, '-47.936')] [2022-07-09 15:46:09,099][26022] Updated weights on worker 0-0, policy_version 314208 (0.00088) [2022-07-09 15:46:10,841][26022] Updated weights on worker 0-0, policy_version 314218 (0.00082) [2022-07-09 15:46:12,527][26022] Updated weights on worker 0-0, policy_version 314228 (0.00085) [2022-07-09 15:46:12,787][25689] Fps is (10 sec: 5717.2, 60 sec: 5660.5, 300 sec: 5642.3). Total num frames: 321770496. Throughput: 0: 5802.1. Samples: 321770646. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:46:12,787][25689] Avg episode reward: [(0, '-46.957')] [2022-07-09 15:46:14,394][26022] Updated weights on worker 0-0, policy_version 314238 (0.00889) [2022-07-09 15:46:15,967][26022] Updated weights on worker 0-0, policy_version 314248 (0.00092) [2022-07-09 15:46:17,842][25689] Fps is (10 sec: 5585.0, 60 sec: 5642.9, 300 sec: 5644.8). Total num frames: 321798144. Throughput: 0: 5824.2. Samples: 321804994. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:46:17,843][25689] Avg episode reward: [(0, '-45.895')] [2022-07-09 15:46:17,967][26022] Updated weights on worker 0-0, policy_version 314258 (0.00085) [2022-07-09 15:46:19,868][26022] Updated weights on worker 0-0, policy_version 314268 (0.00086) [2022-07-09 15:46:21,502][26022] Updated weights on worker 0-0, policy_version 314278 (0.00095) [2022-07-09 15:46:22,846][25689] Fps is (10 sec: 5598.8, 60 sec: 5628.7, 300 sec: 5641.4). Total num frames: 321826816. Throughput: 0: 5012.1. Samples: 321822054. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:46:22,846][25689] Avg episode reward: [(0, '-46.557')] [2022-07-09 15:46:23,487][26022] Updated weights on worker 0-0, policy_version 314288 (0.00093) [2022-07-09 15:46:25,078][26022] Updated weights on worker 0-0, policy_version 314298 (0.00099) [2022-07-09 15:46:27,007][26022] Updated weights on worker 0-0, policy_version 314308 (0.00085) [2022-07-09 15:46:27,856][25689] Fps is (10 sec: 5726.9, 60 sec: 5629.0, 300 sec: 5642.0). Total num frames: 321855488. Throughput: 0: 5932.7. Samples: 321855996. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:46:27,856][25689] Avg episode reward: [(0, '-46.144')] [2022-07-09 15:46:28,864][26022] Updated weights on worker 0-0, policy_version 314318 (0.00092) [2022-07-09 15:46:30,750][26022] Updated weights on worker 0-0, policy_version 314328 (0.00092) [2022-07-09 15:46:32,503][26022] Updated weights on worker 0-0, policy_version 314338 (0.00084) [2022-07-09 15:46:32,884][25689] Fps is (10 sec: 5814.5, 60 sec: 5678.8, 300 sec: 5650.4). Total num frames: 321885184. Throughput: 0: 5931.5. Samples: 321889854. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:46:32,885][25689] Avg episode reward: [(0, '-46.222')] [2022-07-09 15:46:34,465][26022] Updated weights on worker 0-0, policy_version 314348 (0.00086) [2022-07-09 15:46:35,987][26022] Updated weights on worker 0-0, policy_version 314358 (0.00090) [2022-07-09 15:46:37,987][25689] Fps is (10 sec: 5559.0, 60 sec: 5608.3, 300 sec: 5641.8). Total num frames: 321911808. Throughput: 0: 5050.2. Samples: 321906728. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:46:37,987][25689] Avg episode reward: [(0, '-46.426')] [2022-07-09 15:46:38,139][26022] Updated weights on worker 0-0, policy_version 314368 (0.00083) [2022-07-09 15:46:39,667][26022] Updated weights on worker 0-0, policy_version 314378 (0.00090) [2022-07-09 15:46:41,421][26022] Updated weights on worker 0-0, policy_version 314388 (0.00089) [2022-07-09 15:46:43,035][25689] Fps is (10 sec: 5447.5, 60 sec: 5623.6, 300 sec: 5641.3). Total num frames: 321940480. Throughput: 0: 5899.5. Samples: 321941158. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:46:43,035][25689] Avg episode reward: [(0, '-46.866')] [2022-07-09 15:46:43,326][26022] Updated weights on worker 0-0, policy_version 314398 (0.00087) [2022-07-09 15:46:45,056][26022] Updated weights on worker 0-0, policy_version 314408 (0.00094) [2022-07-09 15:46:46,822][26022] Updated weights on worker 0-0, policy_version 314418 (0.00085) [2022-07-09 15:46:48,060][25689] Fps is (10 sec: 5794.4, 60 sec: 5658.0, 300 sec: 5641.9). Total num frames: 321970176. Throughput: 0: 5902.3. Samples: 321975248. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:46:48,061][25689] Avg episode reward: [(0, '-47.082')] [2022-07-09 15:46:48,622][26022] Updated weights on worker 0-0, policy_version 314428 (0.00088) [2022-07-09 15:46:50,540][26022] Updated weights on worker 0-0, policy_version 314438 (0.00630) [2022-07-09 15:46:52,196][26022] Updated weights on worker 0-0, policy_version 314448 (0.00093) [2022-07-09 15:46:53,147][25689] Fps is (10 sec: 5873.2, 60 sec: 5633.8, 300 sec: 5647.8). Total num frames: 321999872. Throughput: 0: 5057.0. Samples: 321992324. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:46:53,148][25689] Avg episode reward: [(0, '-47.110')] [2022-07-09 15:46:54,039][26022] Updated weights on worker 0-0, policy_version 314458 (0.00083) [2022-07-09 15:46:55,826][26022] Updated weights on worker 0-0, policy_version 314468 (0.00087) [2022-07-09 15:46:57,634][26022] Updated weights on worker 0-0, policy_version 314478 (0.00089) [2022-07-09 15:46:58,191][25689] Fps is (10 sec: 5761.4, 60 sec: 5654.7, 300 sec: 5647.1). Total num frames: 322028544. Throughput: 0: 5936.7. Samples: 322026674. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:46:58,191][25689] Avg episode reward: [(0, '-47.141')] [2022-07-09 15:46:59,228][26022] Updated weights on worker 0-0, policy_version 314488 (0.00087) [2022-07-09 15:47:01,296][26022] Updated weights on worker 0-0, policy_version 314498 (0.00090) [2022-07-09 15:47:03,205][25689] Fps is (10 sec: 5497.6, 60 sec: 5657.9, 300 sec: 5646.9). Total num frames: 322055168. Throughput: 0: 5861.6. Samples: 322059390. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:47:03,206][25689] Avg episode reward: [(0, '-47.703')] [2022-07-09 15:47:03,410][26022] Updated weights on worker 0-0, policy_version 314508 (0.00097) [2022-07-09 15:47:05,206][26022] Updated weights on worker 0-0, policy_version 314518 (0.00089) [2022-07-09 15:47:07,161][26022] Updated weights on worker 0-0, policy_version 314528 (0.00084) [2022-07-09 15:47:08,227][25689] Fps is (10 sec: 5305.8, 60 sec: 5622.2, 300 sec: 5636.6). Total num frames: 322081792. Throughput: 0: 4979.0. Samples: 322075658. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:47:08,227][25689] Avg episode reward: [(0, '-47.734')] [2022-07-09 15:47:08,685][26022] Updated weights on worker 0-0, policy_version 314538 (0.00084) [2022-07-09 15:47:10,189][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:47:10,199][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000314545_322094080.pth [2022-07-09 15:47:10,200][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000312557_320058368.pth [2022-07-09 15:47:10,838][26022] Updated weights on worker 0-0, policy_version 314548 (0.00099) [2022-07-09 15:47:12,455][26022] Updated weights on worker 0-0, policy_version 314558 (0.00082) [2022-07-09 15:47:13,247][25689] Fps is (10 sec: 5608.8, 60 sec: 5639.9, 300 sec: 5641.4). Total num frames: 322111488. Throughput: 0: 5855.5. Samples: 322110018. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:47:13,247][25689] Avg episode reward: [(0, '-46.312')] [2022-07-09 15:47:14,335][26022] Updated weights on worker 0-0, policy_version 314568 (0.00088) [2022-07-09 15:47:16,248][26022] Updated weights on worker 0-0, policy_version 314578 (0.00088) [2022-07-09 15:47:17,733][26022] Updated weights on worker 0-0, policy_version 314588 (0.00086) [2022-07-09 15:47:18,303][25689] Fps is (10 sec: 5792.5, 60 sec: 5656.8, 300 sec: 5645.2). Total num frames: 322140160. Throughput: 0: 5834.4. Samples: 322144018. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:47:18,304][25689] Avg episode reward: [(0, '-46.251')] [2022-07-09 15:47:19,955][26022] Updated weights on worker 0-0, policy_version 314598 (0.00087) [2022-07-09 15:47:21,474][26022] Updated weights on worker 0-0, policy_version 314608 (0.00092) [2022-07-09 15:47:23,308][25689] Fps is (10 sec: 5597.6, 60 sec: 5639.8, 300 sec: 5642.2). Total num frames: 322167808. Throughput: 0: 5908.8. Samples: 322178174. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:47:23,310][25689] Avg episode reward: [(0, '-46.283')] [2022-07-09 15:47:23,319][26022] Updated weights on worker 0-0, policy_version 314618 (0.00092) [2022-07-09 15:47:24,983][26022] Updated weights on worker 0-0, policy_version 314628 (0.00094) [2022-07-09 15:47:26,895][26022] Updated weights on worker 0-0, policy_version 314638 (0.00088) [2022-07-09 15:47:28,322][25689] Fps is (10 sec: 5621.6, 60 sec: 5639.4, 300 sec: 5639.2). Total num frames: 322196480. Throughput: 0: 5950.4. Samples: 322195232. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:47:28,322][25689] Avg episode reward: [(0, '-45.174')] [2022-07-09 15:47:28,819][26022] Updated weights on worker 0-0, policy_version 314648 (0.00082) [2022-07-09 15:47:30,411][26022] Updated weights on worker 0-0, policy_version 314658 (0.00099) [2022-07-09 15:47:32,219][26022] Updated weights on worker 0-0, policy_version 314668 (0.00080) [2022-07-09 15:47:33,343][25689] Fps is (10 sec: 5816.6, 60 sec: 5640.1, 300 sec: 5651.2). Total num frames: 322226176. Throughput: 0: 5934.3. Samples: 322229274. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:47:33,343][25689] Avg episode reward: [(0, '-44.247')] [2022-07-09 15:47:34,107][26022] Updated weights on worker 0-0, policy_version 314678 (0.00078) [2022-07-09 15:47:35,977][26022] Updated weights on worker 0-0, policy_version 314688 (0.00089) [2022-07-09 15:47:37,726][26022] Updated weights on worker 0-0, policy_version 314698 (0.00092) [2022-07-09 15:47:38,439][25689] Fps is (10 sec: 5668.1, 60 sec: 5657.7, 300 sec: 5642.9). Total num frames: 322253824. Throughput: 0: 5911.6. Samples: 322263052. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:47:38,439][25689] Avg episode reward: [(0, '-45.091')] [2022-07-09 15:47:39,628][26022] Updated weights on worker 0-0, policy_version 314708 (0.00094) [2022-07-09 15:47:41,304][26022] Updated weights on worker 0-0, policy_version 314718 (0.00097) [2022-07-09 15:47:43,411][26022] Updated weights on worker 0-0, policy_version 314728 (0.00092) [2022-07-09 15:47:43,448][25689] Fps is (10 sec: 5472.3, 60 sec: 5644.4, 300 sec: 5636.6). Total num frames: 322281472. Throughput: 0: 5058.0. Samples: 322280040. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-09 15:47:43,448][25689] Avg episode reward: [(0, '-46.164')] [2022-07-09 15:47:45,015][26022] Updated weights on worker 0-0, policy_version 314738 (0.00088) [2022-07-09 15:47:46,979][26022] Updated weights on worker 0-0, policy_version 314748 (0.00089) [2022-07-09 15:47:48,463][25689] Fps is (10 sec: 5720.6, 60 sec: 5645.3, 300 sec: 5644.5). Total num frames: 322311168. Throughput: 0: 5896.6. Samples: 322313996. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:47:48,463][25689] Avg episode reward: [(0, '-47.006')] [2022-07-09 15:47:48,687][26022] Updated weights on worker 0-0, policy_version 314758 (0.00093) [2022-07-09 15:47:50,531][26022] Updated weights on worker 0-0, policy_version 314768 (0.00090) [2022-07-09 15:47:52,339][26022] Updated weights on worker 0-0, policy_version 314778 (0.00089) [2022-07-09 15:47:53,527][25689] Fps is (10 sec: 5689.4, 60 sec: 5613.5, 300 sec: 5640.7). Total num frames: 322338816. Throughput: 0: 5874.3. Samples: 322347840. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:47:53,527][25689] Avg episode reward: [(0, '-47.511')] [2022-07-09 15:47:54,094][26022] Updated weights on worker 0-0, policy_version 314788 (0.00087) [2022-07-09 15:47:55,994][26022] Updated weights on worker 0-0, policy_version 314798 (0.00824) [2022-07-09 15:47:57,776][26022] Updated weights on worker 0-0, policy_version 314808 (0.00083) [2022-07-09 15:47:58,618][25689] Fps is (10 sec: 5545.9, 60 sec: 5609.1, 300 sec: 5634.0). Total num frames: 322367488. Throughput: 0: 5051.8. Samples: 322364994. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:47:58,619][25689] Avg episode reward: [(0, '-48.172')] [2022-07-09 15:47:59,528][26022] Updated weights on worker 0-0, policy_version 314818 (0.00085) [2022-07-09 15:48:01,471][26022] Updated weights on worker 0-0, policy_version 314828 (0.00091) [2022-07-09 15:48:03,542][26022] Updated weights on worker 0-0, policy_version 314838 (0.00090) [2022-07-09 15:48:03,639][25689] Fps is (10 sec: 5569.4, 60 sec: 5625.5, 300 sec: 5644.3). Total num frames: 322395136. Throughput: 0: 5774.7. Samples: 322396640. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:48:03,640][25689] Avg episode reward: [(0, '-48.203')] [2022-07-09 15:48:05,451][26022] Updated weights on worker 0-0, policy_version 314848 (0.00089) [2022-07-09 15:48:07,082][26022] Updated weights on worker 0-0, policy_version 314858 (0.00068) [2022-07-09 15:48:08,651][25689] Fps is (10 sec: 5511.7, 60 sec: 5643.3, 300 sec: 5638.7). Total num frames: 322422784. Throughput: 0: 5786.4. Samples: 322430810. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:48:08,651][25689] Avg episode reward: [(0, '-48.287')] [2022-07-09 15:48:08,943][26022] Updated weights on worker 0-0, policy_version 314868 (0.00088) [2022-07-09 15:48:10,678][26022] Updated weights on worker 0-0, policy_version 314878 (0.00089) [2022-07-09 15:48:12,459][26022] Updated weights on worker 0-0, policy_version 314888 (0.00093) [2022-07-09 15:48:13,658][25689] Fps is (10 sec: 5519.2, 60 sec: 5610.6, 300 sec: 5636.8). Total num frames: 322450432. Throughput: 0: 4976.4. Samples: 322448022. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:48:13,659][25689] Avg episode reward: [(0, '-47.533')] [2022-07-09 15:48:14,386][26022] Updated weights on worker 0-0, policy_version 314898 (0.00092) [2022-07-09 15:48:16,225][26022] Updated weights on worker 0-0, policy_version 314908 (0.00085) [2022-07-09 15:48:18,058][26022] Updated weights on worker 0-0, policy_version 314918 (0.00090) [2022-07-09 15:48:18,708][25689] Fps is (10 sec: 5701.6, 60 sec: 5628.1, 300 sec: 5640.0). Total num frames: 322480128. Throughput: 0: 5841.1. Samples: 322482342. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:48:18,709][25689] Avg episode reward: [(0, '-47.105')] [2022-07-09 15:48:19,830][26022] Updated weights on worker 0-0, policy_version 314928 (0.00097) [2022-07-09 15:48:21,602][26022] Updated weights on worker 0-0, policy_version 314938 (0.00086) [2022-07-09 15:48:23,486][26022] Updated weights on worker 0-0, policy_version 314948 (0.00093) [2022-07-09 15:48:23,746][25689] Fps is (10 sec: 5684.5, 60 sec: 5625.1, 300 sec: 5639.6). Total num frames: 322507776. Throughput: 0: 5962.1. Samples: 322516518. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:48:23,746][25689] Avg episode reward: [(0, '-46.653')] [2022-07-09 15:48:25,123][26022] Updated weights on worker 0-0, policy_version 314958 (0.00086) [2022-07-09 15:48:27,210][26022] Updated weights on worker 0-0, policy_version 314968 (0.00092) [2022-07-09 15:48:28,750][25689] Fps is (10 sec: 5608.5, 60 sec: 5626.0, 300 sec: 5636.3). Total num frames: 322536448. Throughput: 0: 5100.6. Samples: 322533330. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:48:28,752][25689] Avg episode reward: [(0, '-46.141')] [2022-07-09 15:48:28,791][26022] Updated weights on worker 0-0, policy_version 314978 (0.00092) [2022-07-09 15:48:30,791][26022] Updated weights on worker 0-0, policy_version 314988 (0.00087) [2022-07-09 15:48:32,478][26022] Updated weights on worker 0-0, policy_version 314998 (0.00087) [2022-07-09 15:48:33,758][25689] Fps is (10 sec: 5727.2, 60 sec: 5610.2, 300 sec: 5637.1). Total num frames: 322565120. Throughput: 0: 5913.2. Samples: 322566880. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:48:33,759][25689] Avg episode reward: [(0, '-46.022')] [2022-07-09 15:48:34,282][26022] Updated weights on worker 0-0, policy_version 315008 (0.00096) [2022-07-09 15:48:36,293][26022] Updated weights on worker 0-0, policy_version 315018 (0.00087) [2022-07-09 15:48:37,856][26022] Updated weights on worker 0-0, policy_version 315028 (0.00091) [2022-07-09 15:48:38,823][25689] Fps is (10 sec: 5489.8, 60 sec: 5596.2, 300 sec: 5633.3). Total num frames: 322591744. Throughput: 0: 5877.5. Samples: 322600564. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:48:38,824][25689] Avg episode reward: [(0, '-46.372')] [2022-07-09 15:48:39,891][26022] Updated weights on worker 0-0, policy_version 315038 (0.00093) [2022-07-09 15:48:41,711][26022] Updated weights on worker 0-0, policy_version 315048 (0.00088) [2022-07-09 15:48:43,235][26022] Updated weights on worker 0-0, policy_version 315058 (0.00088) [2022-07-09 15:48:43,855][25689] Fps is (10 sec: 5578.3, 60 sec: 5627.9, 300 sec: 5632.8). Total num frames: 322621440. Throughput: 0: 5029.1. Samples: 322617648. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:48:43,855][25689] Avg episode reward: [(0, '-46.443')] [2022-07-09 15:48:45,363][26022] Updated weights on worker 0-0, policy_version 315068 (0.00085) [2022-07-09 15:48:46,926][26022] Updated weights on worker 0-0, policy_version 315078 (0.00092) [2022-07-09 15:48:48,795][26022] Updated weights on worker 0-0, policy_version 315088 (0.00085) [2022-07-09 15:48:48,878][25689] Fps is (10 sec: 5804.8, 60 sec: 5610.3, 300 sec: 5639.3). Total num frames: 322650112. Throughput: 0: 5892.6. Samples: 322651934. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:48:48,878][25689] Avg episode reward: [(0, '-46.673')] [2022-07-09 15:48:50,710][26022] Updated weights on worker 0-0, policy_version 315098 (0.00096) [2022-07-09 15:48:52,456][26022] Updated weights on worker 0-0, policy_version 315108 (0.00092) [2022-07-09 15:48:53,905][25689] Fps is (10 sec: 5603.6, 60 sec: 5613.6, 300 sec: 5636.3). Total num frames: 322677760. Throughput: 0: 5878.8. Samples: 322685320. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:48:53,907][25689] Avg episode reward: [(0, '-47.164')] [2022-07-09 15:48:54,455][26022] Updated weights on worker 0-0, policy_version 315118 (0.00094) [2022-07-09 15:48:56,235][26022] Updated weights on worker 0-0, policy_version 315128 (0.00093) [2022-07-09 15:48:57,890][26022] Updated weights on worker 0-0, policy_version 315138 (0.00096) [2022-07-09 15:48:58,982][25689] Fps is (10 sec: 5675.2, 60 sec: 5632.0, 300 sec: 5641.8). Total num frames: 322707456. Throughput: 0: 5043.0. Samples: 322702228. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:48:58,983][25689] Avg episode reward: [(0, '-46.615')] [2022-07-09 15:48:59,741][26022] Updated weights on worker 0-0, policy_version 315148 (0.00529) [2022-07-09 15:49:01,435][26022] Updated weights on worker 0-0, policy_version 315158 (0.00088) [2022-07-09 15:49:03,756][26022] Updated weights on worker 0-0, policy_version 315168 (0.00086) [2022-07-09 15:49:03,992][25689] Fps is (10 sec: 5583.5, 60 sec: 5616.0, 300 sec: 5641.7). Total num frames: 322734080. Throughput: 0: 5797.1. Samples: 322734388. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:49:03,993][25689] Avg episode reward: [(0, '-45.691')] [2022-07-09 15:49:05,357][26022] Updated weights on worker 0-0, policy_version 315178 (0.00086) [2022-07-09 15:49:07,366][26022] Updated weights on worker 0-0, policy_version 315188 (0.00087) [2022-07-09 15:49:09,018][25689] Fps is (10 sec: 5407.7, 60 sec: 5614.7, 300 sec: 5634.4). Total num frames: 322761728. Throughput: 0: 5777.6. Samples: 322768296. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:49:09,018][25689] Avg episode reward: [(0, '-45.833')] [2022-07-09 15:49:09,212][26022] Updated weights on worker 0-0, policy_version 315198 (0.00089) [2022-07-09 15:49:10,365][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:49:10,378][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000315204_322768896.pth [2022-07-09 15:49:10,381][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000313220_320737280.pth [2022-07-09 15:49:10,988][26022] Updated weights on worker 0-0, policy_version 315208 (0.00094) [2022-07-09 15:49:12,702][26022] Updated weights on worker 0-0, policy_version 315218 (0.00086) [2022-07-09 15:49:14,030][25689] Fps is (10 sec: 5508.7, 60 sec: 5614.3, 300 sec: 5628.7). Total num frames: 322789376. Throughput: 0: 4969.4. Samples: 322785328. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:49:14,031][25689] Avg episode reward: [(0, '-45.413')] [2022-07-09 15:49:14,475][26022] Updated weights on worker 0-0, policy_version 315228 (0.00084) [2022-07-09 15:49:16,311][26022] Updated weights on worker 0-0, policy_version 315238 (0.00093) [2022-07-09 15:49:18,123][26022] Updated weights on worker 0-0, policy_version 315248 (0.00089) [2022-07-09 15:49:19,152][25689] Fps is (10 sec: 5658.9, 60 sec: 5607.6, 300 sec: 5633.4). Total num frames: 322819072. Throughput: 0: 5826.5. Samples: 322819746. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:49:19,152][25689] Avg episode reward: [(0, '-44.842')] [2022-07-09 15:49:19,799][26022] Updated weights on worker 0-0, policy_version 315258 (0.00086) [2022-07-09 15:49:21,689][26022] Updated weights on worker 0-0, policy_version 315268 (0.00087) [2022-07-09 15:49:23,540][26022] Updated weights on worker 0-0, policy_version 315278 (0.00082) [2022-07-09 15:49:24,157][25689] Fps is (10 sec: 5864.6, 60 sec: 5644.5, 300 sec: 5637.5). Total num frames: 322848768. Throughput: 0: 5948.1. Samples: 322854332. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:49:24,158][25689] Avg episode reward: [(0, '-44.622')] [2022-07-09 15:49:25,144][26022] Updated weights on worker 0-0, policy_version 315288 (0.00089) [2022-07-09 15:49:26,998][26022] Updated weights on worker 0-0, policy_version 315298 (0.00093) [2022-07-09 15:49:28,744][26022] Updated weights on worker 0-0, policy_version 315308 (0.00086) [2022-07-09 15:49:29,203][25689] Fps is (10 sec: 5705.2, 60 sec: 5623.7, 300 sec: 5637.1). Total num frames: 322876416. Throughput: 0: 5116.3. Samples: 322871566. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:49:29,203][25689] Avg episode reward: [(0, '-46.534')] [2022-07-09 15:49:30,675][26022] Updated weights on worker 0-0, policy_version 315318 (0.00094) [2022-07-09 15:49:32,451][26022] Updated weights on worker 0-0, policy_version 315328 (0.00048) [2022-07-09 15:49:34,247][25689] Fps is (10 sec: 5581.8, 60 sec: 5620.3, 300 sec: 5637.4). Total num frames: 322905088. Throughput: 0: 5954.7. Samples: 322905716. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:49:34,248][25689] Avg episode reward: [(0, '-46.347')] [2022-07-09 15:49:34,262][26022] Updated weights on worker 0-0, policy_version 315338 (0.00079) [2022-07-09 15:49:36,021][26022] Updated weights on worker 0-0, policy_version 315348 (0.00093) [2022-07-09 15:49:37,903][26022] Updated weights on worker 0-0, policy_version 315358 (0.00090) [2022-07-09 15:49:39,377][25689] Fps is (10 sec: 5736.8, 60 sec: 5665.0, 300 sec: 5636.2). Total num frames: 322934784. Throughput: 0: 5936.4. Samples: 322939812. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:49:39,379][25689] Avg episode reward: [(0, '-45.942')] [2022-07-09 15:49:39,659][26022] Updated weights on worker 0-0, policy_version 315368 (0.00098) [2022-07-09 15:49:41,429][26022] Updated weights on worker 0-0, policy_version 315378 (0.00088) [2022-07-09 15:49:43,135][26022] Updated weights on worker 0-0, policy_version 315388 (0.00087) [2022-07-09 15:49:44,469][25689] Fps is (10 sec: 5810.4, 60 sec: 5659.4, 300 sec: 5641.4). Total num frames: 322964480. Throughput: 0: 5060.1. Samples: 322957110. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:49:44,470][25689] Avg episode reward: [(0, '-46.412')] [2022-07-09 15:49:45,020][26022] Updated weights on worker 0-0, policy_version 315398 (0.00088) [2022-07-09 15:49:46,763][26022] Updated weights on worker 0-0, policy_version 315408 (0.00092) [2022-07-09 15:49:48,538][26022] Updated weights on worker 0-0, policy_version 315418 (0.00088) [2022-07-09 15:49:49,495][25689] Fps is (10 sec: 5768.9, 60 sec: 5659.1, 300 sec: 5641.9). Total num frames: 322993152. Throughput: 0: 5917.1. Samples: 322991636. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:49:49,495][25689] Avg episode reward: [(0, '-46.733')] [2022-07-09 15:49:50,380][26022] Updated weights on worker 0-0, policy_version 315428 (0.00087) [2022-07-09 15:49:52,059][26022] Updated weights on worker 0-0, policy_version 315438 (0.00091) [2022-07-09 15:49:53,859][26022] Updated weights on worker 0-0, policy_version 315448 (0.00095) [2022-07-09 15:49:54,517][25689] Fps is (10 sec: 5707.2, 60 sec: 5676.6, 300 sec: 5638.7). Total num frames: 323021824. Throughput: 0: 5936.8. Samples: 323026052. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:49:54,517][25689] Avg episode reward: [(0, '-47.197')] [2022-07-09 15:49:55,637][26022] Updated weights on worker 0-0, policy_version 315458 (0.00092) [2022-07-09 15:49:57,477][26022] Updated weights on worker 0-0, policy_version 315468 (0.00084) [2022-07-09 15:49:59,524][26022] Updated weights on worker 0-0, policy_version 315478 (0.00092) [2022-07-09 15:49:59,591][25689] Fps is (10 sec: 5578.3, 60 sec: 5643.0, 300 sec: 5645.1). Total num frames: 323049472. Throughput: 0: 5952.1. Samples: 323060128. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:49:59,591][25689] Avg episode reward: [(0, '-45.947')] [2022-07-09 15:50:01,142][26022] Updated weights on worker 0-0, policy_version 315488 (0.00085) [2022-07-09 15:50:03,449][26022] Updated weights on worker 0-0, policy_version 315498 (0.00097) [2022-07-09 15:50:04,598][25689] Fps is (10 sec: 5484.9, 60 sec: 5660.2, 300 sec: 5638.9). Total num frames: 323077120. Throughput: 0: 5856.3. Samples: 323074992. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:50:04,599][25689] Avg episode reward: [(0, '-45.313')] [2022-07-09 15:50:05,110][26022] Updated weights on worker 0-0, policy_version 315508 (0.01198) [2022-07-09 15:50:06,912][26022] Updated weights on worker 0-0, policy_version 315518 (0.00086) [2022-07-09 15:50:08,946][26022] Updated weights on worker 0-0, policy_version 315528 (0.00085) [2022-07-09 15:50:09,688][25689] Fps is (10 sec: 5577.8, 60 sec: 5671.1, 300 sec: 5641.4). Total num frames: 323105792. Throughput: 0: 5815.5. Samples: 323109070. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:50:09,689][25689] Avg episode reward: [(0, '-45.436')] [2022-07-09 15:50:10,374][26022] Updated weights on worker 0-0, policy_version 315538 (0.00083) [2022-07-09 15:50:12,540][26022] Updated weights on worker 0-0, policy_version 315548 (0.00083) [2022-07-09 15:50:14,004][26022] Updated weights on worker 0-0, policy_version 315558 (0.00085) [2022-07-09 15:50:14,709][25689] Fps is (10 sec: 5570.2, 60 sec: 5670.2, 300 sec: 5638.5). Total num frames: 323133440. Throughput: 0: 5819.6. Samples: 323143564. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 15:50:14,710][25689] Avg episode reward: [(0, '-45.422')] [2022-07-09 15:50:15,914][26022] Updated weights on worker 0-0, policy_version 315568 (0.00088) [2022-07-09 15:50:17,696][26022] Updated weights on worker 0-0, policy_version 315578 (0.00091) [2022-07-09 15:50:19,493][26022] Updated weights on worker 0-0, policy_version 315588 (0.00085) [2022-07-09 15:50:19,780][25689] Fps is (10 sec: 5682.4, 60 sec: 5675.0, 300 sec: 5637.8). Total num frames: 323163136. Throughput: 0: 4981.5. Samples: 323160698. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:50:19,780][25689] Avg episode reward: [(0, '-45.153')] [2022-07-09 15:50:21,246][26022] Updated weights on worker 0-0, policy_version 315598 (0.00082) [2022-07-09 15:50:23,273][26022] Updated weights on worker 0-0, policy_version 315608 (0.00093) [2022-07-09 15:50:24,814][25689] Fps is (10 sec: 5776.2, 60 sec: 5655.4, 300 sec: 5637.4). Total num frames: 323191808. Throughput: 0: 5933.7. Samples: 323194946. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:50:24,815][25689] Avg episode reward: [(0, '-45.622')] [2022-07-09 15:50:24,899][26022] Updated weights on worker 0-0, policy_version 315618 (0.00088) [2022-07-09 15:50:26,733][26022] Updated weights on worker 0-0, policy_version 315628 (0.00086) [2022-07-09 15:50:28,356][26022] Updated weights on worker 0-0, policy_version 315638 (0.00085) [2022-07-09 15:50:29,836][25689] Fps is (10 sec: 5600.4, 60 sec: 5657.6, 300 sec: 5640.7). Total num frames: 323219456. Throughput: 0: 5948.3. Samples: 323228916. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:50:29,837][25689] Avg episode reward: [(0, '-45.723')] [2022-07-09 15:50:30,317][26022] Updated weights on worker 0-0, policy_version 315648 (0.00095) [2022-07-09 15:50:32,234][26022] Updated weights on worker 0-0, policy_version 315658 (0.00084) [2022-07-09 15:50:33,879][26022] Updated weights on worker 0-0, policy_version 315668 (0.00083) [2022-07-09 15:50:34,867][25689] Fps is (10 sec: 5704.4, 60 sec: 5675.8, 300 sec: 5638.1). Total num frames: 323249152. Throughput: 0: 5073.5. Samples: 323245834. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:50:34,867][25689] Avg episode reward: [(0, '-45.178')] [2022-07-09 15:50:35,990][26022] Updated weights on worker 0-0, policy_version 315678 (0.00090) [2022-07-09 15:50:37,547][26022] Updated weights on worker 0-0, policy_version 315688 (0.00092) [2022-07-09 15:50:39,612][26022] Updated weights on worker 0-0, policy_version 315698 (0.00094) [2022-07-09 15:50:39,944][25689] Fps is (10 sec: 5774.4, 60 sec: 5663.8, 300 sec: 5640.6). Total num frames: 323277824. Throughput: 0: 5906.8. Samples: 323279806. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:50:39,945][25689] Avg episode reward: [(0, '-45.416')] [2022-07-09 15:50:41,148][26022] Updated weights on worker 0-0, policy_version 315708 (0.00084) [2022-07-09 15:50:42,938][26022] Updated weights on worker 0-0, policy_version 315718 (0.00089) [2022-07-09 15:50:44,713][26022] Updated weights on worker 0-0, policy_version 315728 (0.00086) [2022-07-09 15:50:45,011][25689] Fps is (10 sec: 5652.9, 60 sec: 5649.3, 300 sec: 5643.4). Total num frames: 323306496. Throughput: 0: 5907.6. Samples: 323314260. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:50:45,011][25689] Avg episode reward: [(0, '-45.652')] [2022-07-09 15:50:46,620][26022] Updated weights on worker 0-0, policy_version 315738 (0.00056) [2022-07-09 15:50:48,367][26022] Updated weights on worker 0-0, policy_version 315748 (0.00086) [2022-07-09 15:50:50,054][25689] Fps is (10 sec: 5570.8, 60 sec: 5630.7, 300 sec: 5632.4). Total num frames: 323334144. Throughput: 0: 5078.3. Samples: 323331588. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:50:50,054][25689] Avg episode reward: [(0, '-45.798')] [2022-07-09 15:50:50,246][26022] Updated weights on worker 0-0, policy_version 315758 (0.00086) [2022-07-09 15:50:51,869][26022] Updated weights on worker 0-0, policy_version 315768 (0.00090) [2022-07-09 15:50:53,756][26022] Updated weights on worker 0-0, policy_version 315778 (0.00084) [2022-07-09 15:50:55,077][25689] Fps is (10 sec: 5696.8, 60 sec: 5647.5, 300 sec: 5640.5). Total num frames: 323363840. Throughput: 0: 5940.1. Samples: 323365884. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:50:55,078][25689] Avg episode reward: [(0, '-46.197')] [2022-07-09 15:50:55,373][26022] Updated weights on worker 0-0, policy_version 315788 (0.00082) [2022-07-09 15:50:57,306][26022] Updated weights on worker 0-0, policy_version 315798 (0.00089) [2022-07-09 15:50:59,048][26022] Updated weights on worker 0-0, policy_version 315808 (0.00091) [2022-07-09 15:51:00,138][25689] Fps is (10 sec: 5889.4, 60 sec: 5682.5, 300 sec: 5650.6). Total num frames: 323393536. Throughput: 0: 5969.9. Samples: 323400364. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:51:00,140][25689] Avg episode reward: [(0, '-46.466')] [2022-07-09 15:51:00,811][26022] Updated weights on worker 0-0, policy_version 315818 (0.00092) [2022-07-09 15:51:03,137][26022] Updated weights on worker 0-0, policy_version 315828 (0.00080) [2022-07-09 15:51:04,779][26022] Updated weights on worker 0-0, policy_version 315838 (0.00094) [2022-07-09 15:51:05,152][25689] Fps is (10 sec: 5488.5, 60 sec: 5648.2, 300 sec: 5640.0). Total num frames: 323419136. Throughput: 0: 5018.6. Samples: 323415338. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:51:05,152][25689] Avg episode reward: [(0, '-46.602')] [2022-07-09 15:51:06,567][26022] Updated weights on worker 0-0, policy_version 315848 (0.00089) [2022-07-09 15:51:08,509][26022] Updated weights on worker 0-0, policy_version 315858 (0.00084) [2022-07-09 15:51:10,024][26022] Updated weights on worker 0-0, policy_version 315868 (0.00091) [2022-07-09 15:51:10,252][25689] Fps is (10 sec: 5467.5, 60 sec: 5664.1, 300 sec: 5642.1). Total num frames: 323448832. Throughput: 0: 5822.6. Samples: 323449194. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:51:10,254][25689] Avg episode reward: [(0, '-45.987')] [2022-07-09 15:51:10,567][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:51:10,586][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000315870_323450880.pth [2022-07-09 15:51:10,586][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000313881_321414144.pth [2022-07-09 15:51:12,165][26022] Updated weights on worker 0-0, policy_version 315878 (0.00085) [2022-07-09 15:51:13,739][26022] Updated weights on worker 0-0, policy_version 315888 (0.00087) [2022-07-09 15:51:15,307][25689] Fps is (10 sec: 5747.4, 60 sec: 5677.8, 300 sec: 5645.6). Total num frames: 323477504. Throughput: 0: 5824.6. Samples: 323483718. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:51:15,308][25689] Avg episode reward: [(0, '-45.857')] [2022-07-09 15:51:15,667][26022] Updated weights on worker 0-0, policy_version 315898 (0.00093) [2022-07-09 15:51:17,312][26022] Updated weights on worker 0-0, policy_version 315908 (0.00084) [2022-07-09 15:51:19,265][26022] Updated weights on worker 0-0, policy_version 315918 (0.00081) [2022-07-09 15:51:20,369][25689] Fps is (10 sec: 5567.1, 60 sec: 5644.8, 300 sec: 5641.0). Total num frames: 323505152. Throughput: 0: 4960.4. Samples: 323500714. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:51:20,369][25689] Avg episode reward: [(0, '-45.479')] [2022-07-09 15:51:20,998][26022] Updated weights on worker 0-0, policy_version 315928 (0.00092) [2022-07-09 15:51:22,772][26022] Updated weights on worker 0-0, policy_version 315938 (0.00098) [2022-07-09 15:51:24,535][26022] Updated weights on worker 0-0, policy_version 315948 (0.00097) [2022-07-09 15:51:25,395][25689] Fps is (10 sec: 5786.1, 60 sec: 5679.4, 300 sec: 5647.6). Total num frames: 323535872. Throughput: 0: 5920.8. Samples: 323535194. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:51:25,395][25689] Avg episode reward: [(0, '-45.882')] [2022-07-09 15:51:26,411][26022] Updated weights on worker 0-0, policy_version 315958 (0.00087) [2022-07-09 15:51:28,024][26022] Updated weights on worker 0-0, policy_version 315968 (0.00089) [2022-07-09 15:51:30,082][26022] Updated weights on worker 0-0, policy_version 315978 (0.00085) [2022-07-09 15:51:30,459][25689] Fps is (10 sec: 5784.7, 60 sec: 5675.5, 300 sec: 5640.1). Total num frames: 323563520. Throughput: 0: 5950.5. Samples: 323569434. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:51:30,459][25689] Avg episode reward: [(0, '-45.877')] [2022-07-09 15:51:31,721][26022] Updated weights on worker 0-0, policy_version 315988 (0.00087) [2022-07-09 15:51:33,443][26022] Updated weights on worker 0-0, policy_version 315998 (0.00088) [2022-07-09 15:51:35,469][25689] Fps is (10 sec: 5590.4, 60 sec: 5660.5, 300 sec: 5648.7). Total num frames: 323592192. Throughput: 0: 5103.7. Samples: 323586618. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:51:35,459][26022] Updated weights on worker 0-0, policy_version 316008 (0.00083) [2022-07-09 15:51:35,470][25689] Avg episode reward: [(0, '-45.354')] [2022-07-09 15:51:36,911][26022] Updated weights on worker 0-0, policy_version 316018 (0.00097) [2022-07-09 15:51:39,103][26022] Updated weights on worker 0-0, policy_version 316028 (0.00085) [2022-07-09 15:51:40,513][25689] Fps is (10 sec: 5907.3, 60 sec: 5697.5, 300 sec: 5655.7). Total num frames: 323622912. Throughput: 0: 5957.9. Samples: 323620730. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:51:40,513][25689] Avg episode reward: [(0, '-45.582')] [2022-07-09 15:51:40,518][26022] Updated weights on worker 0-0, policy_version 316038 (0.00086) [2022-07-09 15:51:42,450][26022] Updated weights on worker 0-0, policy_version 316048 (0.00082) [2022-07-09 15:51:44,273][26022] Updated weights on worker 0-0, policy_version 316058 (0.00095) [2022-07-09 15:51:45,523][25689] Fps is (10 sec: 5805.7, 60 sec: 5685.9, 300 sec: 5649.1). Total num frames: 323650560. Throughput: 0: 5990.5. Samples: 323655770. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:51:45,523][25689] Avg episode reward: [(0, '-46.919')] [2022-07-09 15:51:45,961][26022] Updated weights on worker 0-0, policy_version 316068 (0.00091) [2022-07-09 15:51:47,795][26022] Updated weights on worker 0-0, policy_version 316078 (0.00078) [2022-07-09 15:51:49,651][26022] Updated weights on worker 0-0, policy_version 316088 (0.00086) [2022-07-09 15:51:50,526][25689] Fps is (10 sec: 5624.2, 60 sec: 5706.5, 300 sec: 5647.2). Total num frames: 323679232. Throughput: 0: 5166.0. Samples: 323673104. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:51:50,527][25689] Avg episode reward: [(0, '-47.387')] [2022-07-09 15:51:51,193][26022] Updated weights on worker 0-0, policy_version 316098 (0.00086) [2022-07-09 15:51:53,171][26022] Updated weights on worker 0-0, policy_version 316108 (0.00089) [2022-07-09 15:51:54,811][26022] Updated weights on worker 0-0, policy_version 316118 (0.00081) [2022-07-09 15:51:55,545][25689] Fps is (10 sec: 5721.6, 60 sec: 5690.0, 300 sec: 5647.7). Total num frames: 323707904. Throughput: 0: 6049.0. Samples: 323708058. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:51:55,546][25689] Avg episode reward: [(0, '-47.798')] [2022-07-09 15:51:56,625][26022] Updated weights on worker 0-0, policy_version 316128 (0.00090) [2022-07-09 15:51:58,515][26022] Updated weights on worker 0-0, policy_version 316138 (0.00095) [2022-07-09 15:51:59,958][26022] Updated weights on worker 0-0, policy_version 316148 (0.00085) [2022-07-09 15:52:00,614][25689] Fps is (10 sec: 5887.5, 60 sec: 5706.2, 300 sec: 5660.4). Total num frames: 323738624. Throughput: 0: 6070.9. Samples: 323742766. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:52:00,615][25689] Avg episode reward: [(0, '-48.412')] [2022-07-09 15:52:01,991][26022] Updated weights on worker 0-0, policy_version 316158 (0.00074) [2022-07-09 15:52:04,112][26022] Updated weights on worker 0-0, policy_version 316168 (0.00088) [2022-07-09 15:52:05,669][25689] Fps is (10 sec: 5664.2, 60 sec: 5719.2, 300 sec: 5659.8). Total num frames: 323765248. Throughput: 0: 5066.2. Samples: 323757834. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:52:05,670][25689] Avg episode reward: [(0, '-48.485')] [2022-07-09 15:52:05,752][26022] Updated weights on worker 0-0, policy_version 316178 (0.00094) [2022-07-09 15:52:07,754][26022] Updated weights on worker 0-0, policy_version 316188 (0.00088) [2022-07-09 15:52:09,378][26022] Updated weights on worker 0-0, policy_version 316198 (0.00091) [2022-07-09 15:52:10,727][25689] Fps is (10 sec: 5265.6, 60 sec: 5672.4, 300 sec: 5648.7). Total num frames: 323791872. Throughput: 0: 5875.8. Samples: 323791798. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:52:10,728][25689] Avg episode reward: [(0, '-48.119')] [2022-07-09 15:52:11,268][26022] Updated weights on worker 0-0, policy_version 316208 (0.00086) [2022-07-09 15:52:13,181][26022] Updated weights on worker 0-0, policy_version 316218 (0.00051) [2022-07-09 15:52:14,776][26022] Updated weights on worker 0-0, policy_version 316228 (0.00088) [2022-07-09 15:52:15,802][25689] Fps is (10 sec: 5557.9, 60 sec: 5687.5, 300 sec: 5651.8). Total num frames: 323821568. Throughput: 0: 5847.8. Samples: 323826518. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:52:15,803][25689] Avg episode reward: [(0, '-47.746')] [2022-07-09 15:52:16,674][26022] Updated weights on worker 0-0, policy_version 316238 (0.00090) [2022-07-09 15:52:18,386][26022] Updated weights on worker 0-0, policy_version 316248 (0.00091) [2022-07-09 15:52:20,123][26022] Updated weights on worker 0-0, policy_version 316258 (0.00085) [2022-07-09 15:52:20,843][25689] Fps is (10 sec: 5870.8, 60 sec: 5723.2, 300 sec: 5658.0). Total num frames: 323851264. Throughput: 0: 4998.8. Samples: 323843888. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:52:20,845][25689] Avg episode reward: [(0, '-48.251')] [2022-07-09 15:52:22,049][26022] Updated weights on worker 0-0, policy_version 316268 (0.00089) [2022-07-09 15:52:23,699][26022] Updated weights on worker 0-0, policy_version 316278 (0.00090) [2022-07-09 15:52:25,662][26022] Updated weights on worker 0-0, policy_version 316288 (0.00082) [2022-07-09 15:52:25,930][25689] Fps is (10 sec: 5864.2, 60 sec: 5700.6, 300 sec: 5660.1). Total num frames: 323880960. Throughput: 0: 5940.2. Samples: 323878190. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:52:25,931][25689] Avg episode reward: [(0, '-47.905')] [2022-07-09 15:52:27,571][26022] Updated weights on worker 0-0, policy_version 316298 (0.00094) [2022-07-09 15:52:29,030][26022] Updated weights on worker 0-0, policy_version 316308 (0.00090) [2022-07-09 15:52:30,987][25689] Fps is (10 sec: 5653.0, 60 sec: 5701.2, 300 sec: 5652.5). Total num frames: 323908608. Throughput: 0: 5954.9. Samples: 323912448. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:52:30,988][25689] Avg episode reward: [(0, '-47.083')] [2022-07-09 15:52:31,006][26022] Updated weights on worker 0-0, policy_version 316318 (0.00086) [2022-07-09 15:52:32,786][26022] Updated weights on worker 0-0, policy_version 316328 (0.00085) [2022-07-09 15:52:34,455][26022] Updated weights on worker 0-0, policy_version 316338 (0.00095) [2022-07-09 15:52:36,015][25689] Fps is (10 sec: 5584.5, 60 sec: 5699.6, 300 sec: 5657.2). Total num frames: 323937280. Throughput: 0: 5955.0. Samples: 323946886. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:52:36,017][25689] Avg episode reward: [(0, '-46.879')] [2022-07-09 15:52:36,507][26022] Updated weights on worker 0-0, policy_version 316348 (0.00079) [2022-07-09 15:52:38,111][26022] Updated weights on worker 0-0, policy_version 316358 (0.00093) [2022-07-09 15:52:40,056][26022] Updated weights on worker 0-0, policy_version 316368 (0.00092) [2022-07-09 15:52:41,098][25689] Fps is (10 sec: 5874.5, 60 sec: 5695.9, 300 sec: 5666.1). Total num frames: 323968000. Throughput: 0: 5914.8. Samples: 323963688. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-09 15:52:41,098][25689] Avg episode reward: [(0, '-46.368')] [2022-07-09 15:52:41,876][26022] Updated weights on worker 0-0, policy_version 316378 (0.00085) [2022-07-09 15:52:43,451][26022] Updated weights on worker 0-0, policy_version 316388 (0.00087) [2022-07-09 15:52:45,455][26022] Updated weights on worker 0-0, policy_version 316398 (0.00084) [2022-07-09 15:52:46,133][25689] Fps is (10 sec: 5869.8, 60 sec: 5710.4, 300 sec: 5662.3). Total num frames: 323996672. Throughput: 0: 5954.0. Samples: 323998480. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:52:46,134][25689] Avg episode reward: [(0, '-46.702')] [2022-07-09 15:52:47,139][26022] Updated weights on worker 0-0, policy_version 316408 (0.00086) [2022-07-09 15:52:48,821][26022] Updated weights on worker 0-0, policy_version 316418 (0.00089) [2022-07-09 15:52:50,730][26022] Updated weights on worker 0-0, policy_version 316428 (0.00094) [2022-07-09 15:52:51,136][25689] Fps is (10 sec: 5508.7, 60 sec: 5676.8, 300 sec: 5660.0). Total num frames: 324023296. Throughput: 0: 5992.0. Samples: 324033176. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:52:51,136][25689] Avg episode reward: [(0, '-47.196')] [2022-07-09 15:52:52,276][26022] Updated weights on worker 0-0, policy_version 316438 (0.00082) [2022-07-09 15:52:54,321][26022] Updated weights on worker 0-0, policy_version 316448 (0.00095) [2022-07-09 15:52:55,959][26022] Updated weights on worker 0-0, policy_version 316458 (0.00091) [2022-07-09 15:52:56,171][25689] Fps is (10 sec: 5611.1, 60 sec: 5692.1, 300 sec: 5664.6). Total num frames: 324052992. Throughput: 0: 5126.8. Samples: 324050218. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:52:56,171][25689] Avg episode reward: [(0, '-47.201')] [2022-07-09 15:52:57,800][26022] Updated weights on worker 0-0, policy_version 316468 (0.00090) [2022-07-09 15:52:59,610][26022] Updated weights on worker 0-0, policy_version 316478 (0.00091) [2022-07-09 15:53:01,223][25689] Fps is (10 sec: 5989.0, 60 sec: 5693.7, 300 sec: 5674.3). Total num frames: 324083712. Throughput: 0: 6013.7. Samples: 324084718. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:53:01,224][25689] Avg episode reward: [(0, '-46.902')] [2022-07-09 15:53:01,226][26022] Updated weights on worker 0-0, policy_version 316488 (0.00085) [2022-07-09 15:53:03,393][26022] Updated weights on worker 0-0, policy_version 316498 (0.00052) [2022-07-09 15:53:05,394][26022] Updated weights on worker 0-0, policy_version 316508 (0.00092) [2022-07-09 15:53:06,267][25689] Fps is (10 sec: 5679.8, 60 sec: 5694.7, 300 sec: 5670.2). Total num frames: 324110336. Throughput: 0: 5911.2. Samples: 324117492. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:53:06,267][25689] Avg episode reward: [(0, '-47.049')] [2022-07-09 15:53:06,937][26022] Updated weights on worker 0-0, policy_version 316518 (0.00080) [2022-07-09 15:53:08,902][26022] Updated weights on worker 0-0, policy_version 316528 (0.00086) [2022-07-09 15:53:10,533][26022] Updated weights on worker 0-0, policy_version 316538 (0.00087) [2022-07-09 15:53:10,770][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:53:10,790][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000316539_324135936.pth [2022-07-09 15:53:10,791][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000314545_322094080.pth [2022-07-09 15:53:11,291][25689] Fps is (10 sec: 5492.3, 60 sec: 5731.7, 300 sec: 5673.3). Total num frames: 324139008. Throughput: 0: 5038.7. Samples: 324134734. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:53:11,292][25689] Avg episode reward: [(0, '-46.163')] [2022-07-09 15:53:12,368][26022] Updated weights on worker 0-0, policy_version 316548 (0.00092) [2022-07-09 15:53:14,074][26022] Updated weights on worker 0-0, policy_version 316558 (0.00089) [2022-07-09 15:53:16,095][26022] Updated weights on worker 0-0, policy_version 316568 (0.00084) [2022-07-09 15:53:16,311][25689] Fps is (10 sec: 5607.4, 60 sec: 5703.1, 300 sec: 5667.1). Total num frames: 324166656. Throughput: 0: 5915.5. Samples: 324169356. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:53:16,311][25689] Avg episode reward: [(0, '-45.102')] [2022-07-09 15:53:17,605][26022] Updated weights on worker 0-0, policy_version 316578 (0.00089) [2022-07-09 15:53:19,628][26022] Updated weights on worker 0-0, policy_version 316588 (0.00095) [2022-07-09 15:53:21,221][26022] Updated weights on worker 0-0, policy_version 316598 (0.00094) [2022-07-09 15:53:21,358][25689] Fps is (10 sec: 5798.0, 60 sec: 5719.5, 300 sec: 5677.2). Total num frames: 324197376. Throughput: 0: 5917.8. Samples: 324203872. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:53:21,359][25689] Avg episode reward: [(0, '-45.243')] [2022-07-09 15:53:23,050][26022] Updated weights on worker 0-0, policy_version 316608 (0.00089) [2022-07-09 15:53:24,787][26022] Updated weights on worker 0-0, policy_version 316618 (0.00089) [2022-07-09 15:53:26,369][25689] Fps is (10 sec: 5904.7, 60 sec: 5709.7, 300 sec: 5677.1). Total num frames: 324226048. Throughput: 0: 5154.6. Samples: 324221112. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:53:26,371][25689] Avg episode reward: [(0, '-45.046')] [2022-07-09 15:53:26,515][26022] Updated weights on worker 0-0, policy_version 316628 (0.00094) [2022-07-09 15:53:28,434][26022] Updated weights on worker 0-0, policy_version 316638 (0.00086) [2022-07-09 15:53:30,222][26022] Updated weights on worker 0-0, policy_version 316648 (0.00083) [2022-07-09 15:53:31,382][25689] Fps is (10 sec: 5720.6, 60 sec: 5730.8, 300 sec: 5677.0). Total num frames: 324254720. Throughput: 0: 6014.1. Samples: 324255564. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:53:31,384][25689] Avg episode reward: [(0, '-45.677')] [2022-07-09 15:53:31,931][26022] Updated weights on worker 0-0, policy_version 316658 (0.00093) [2022-07-09 15:53:33,753][26022] Updated weights on worker 0-0, policy_version 316668 (0.00082) [2022-07-09 15:53:35,554][26022] Updated weights on worker 0-0, policy_version 316678 (0.00088) [2022-07-09 15:53:36,397][25689] Fps is (10 sec: 5616.3, 60 sec: 5715.1, 300 sec: 5681.3). Total num frames: 324282368. Throughput: 0: 6017.4. Samples: 324290224. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:53:36,399][25689] Avg episode reward: [(0, '-45.892')] [2022-07-09 15:53:37,527][26022] Updated weights on worker 0-0, policy_version 316688 (0.00093) [2022-07-09 15:53:39,007][26022] Updated weights on worker 0-0, policy_version 316698 (0.00081) [2022-07-09 15:53:41,029][26022] Updated weights on worker 0-0, policy_version 316708 (0.00079) [2022-07-09 15:53:41,452][25689] Fps is (10 sec: 5796.4, 60 sec: 5717.7, 300 sec: 5684.3). Total num frames: 324313088. Throughput: 0: 5144.4. Samples: 324307244. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:53:41,453][25689] Avg episode reward: [(0, '-46.803')] [2022-07-09 15:53:42,691][26022] Updated weights on worker 0-0, policy_version 316718 (0.00084) [2022-07-09 15:53:44,450][26022] Updated weights on worker 0-0, policy_version 316728 (0.00092) [2022-07-09 15:53:46,183][26022] Updated weights on worker 0-0, policy_version 316738 (0.00100) [2022-07-09 15:53:46,464][25689] Fps is (10 sec: 5798.2, 60 sec: 5703.0, 300 sec: 5681.1). Total num frames: 324340736. Throughput: 0: 6014.2. Samples: 324341964. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:53:46,464][25689] Avg episode reward: [(0, '-46.085')] [2022-07-09 15:53:48,015][26022] Updated weights on worker 0-0, policy_version 316748 (0.00675) [2022-07-09 15:53:49,779][26022] Updated weights on worker 0-0, policy_version 316758 (0.00086) [2022-07-09 15:53:51,483][25689] Fps is (10 sec: 5614.7, 60 sec: 5735.3, 300 sec: 5684.7). Total num frames: 324369408. Throughput: 0: 6017.4. Samples: 324376516. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:53:51,483][25689] Avg episode reward: [(0, '-46.185')] [2022-07-09 15:53:51,566][26022] Updated weights on worker 0-0, policy_version 316768 (0.00086) [2022-07-09 15:53:53,476][26022] Updated weights on worker 0-0, policy_version 316778 (0.00087) [2022-07-09 15:53:55,077][26022] Updated weights on worker 0-0, policy_version 316788 (0.00083) [2022-07-09 15:53:56,490][25689] Fps is (10 sec: 5719.3, 60 sec: 5721.0, 300 sec: 5682.6). Total num frames: 324398080. Throughput: 0: 5144.7. Samples: 324393596. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:53:56,492][25689] Avg episode reward: [(0, '-46.344')] [2022-07-09 15:53:57,172][26022] Updated weights on worker 0-0, policy_version 316798 (0.00087) [2022-07-09 15:53:58,693][26022] Updated weights on worker 0-0, policy_version 316808 (0.00084) [2022-07-09 15:54:00,746][26022] Updated weights on worker 0-0, policy_version 316818 (0.00085) [2022-07-09 15:54:01,537][25689] Fps is (10 sec: 5805.5, 60 sec: 5704.6, 300 sec: 5692.2). Total num frames: 324427776. Throughput: 0: 6007.2. Samples: 324427896. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:54:01,537][25689] Avg episode reward: [(0, '-46.104')] [2022-07-09 15:54:02,544][26022] Updated weights on worker 0-0, policy_version 316828 (0.00090) [2022-07-09 15:54:04,643][26022] Updated weights on worker 0-0, policy_version 316838 (0.00098) [2022-07-09 15:54:06,334][26022] Updated weights on worker 0-0, policy_version 316848 (0.00094) [2022-07-09 15:54:06,591][25689] Fps is (10 sec: 5474.3, 60 sec: 5686.6, 300 sec: 5684.8). Total num frames: 324453376. Throughput: 0: 5868.2. Samples: 324460076. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:54:06,592][25689] Avg episode reward: [(0, '-46.093')] [2022-07-09 15:54:08,034][26022] Updated weights on worker 0-0, policy_version 316858 (0.00098) [2022-07-09 15:54:09,984][26022] Updated weights on worker 0-0, policy_version 316868 (0.00260) [2022-07-09 15:54:11,605][25689] Fps is (10 sec: 5390.4, 60 sec: 5687.6, 300 sec: 5688.2). Total num frames: 324482048. Throughput: 0: 5003.6. Samples: 324477200. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:54:11,605][25689] Avg episode reward: [(0, '-46.844')] [2022-07-09 15:54:11,908][26022] Updated weights on worker 0-0, policy_version 316878 (0.00086) [2022-07-09 15:54:13,421][26022] Updated weights on worker 0-0, policy_version 316888 (0.00087) [2022-07-09 15:54:15,509][26022] Updated weights on worker 0-0, policy_version 316898 (0.00088) [2022-07-09 15:54:16,625][25689] Fps is (10 sec: 5817.0, 60 sec: 5721.5, 300 sec: 5690.2). Total num frames: 324511744. Throughput: 0: 5857.8. Samples: 324511544. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:54:16,626][25689] Avg episode reward: [(0, '-46.749')] [2022-07-09 15:54:17,042][26022] Updated weights on worker 0-0, policy_version 316908 (0.00087) [2022-07-09 15:54:18,934][26022] Updated weights on worker 0-0, policy_version 316918 (0.00129) [2022-07-09 15:54:20,553][26022] Updated weights on worker 0-0, policy_version 316928 (0.00104) [2022-07-09 15:54:21,754][25689] Fps is (10 sec: 5650.0, 60 sec: 5662.9, 300 sec: 5680.9). Total num frames: 324539392. Throughput: 0: 5840.4. Samples: 324545976. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:54:21,755][25689] Avg episode reward: [(0, '-46.984')] [2022-07-09 15:54:22,552][26022] Updated weights on worker 0-0, policy_version 316938 (0.00093) [2022-07-09 15:54:24,371][26022] Updated weights on worker 0-0, policy_version 316948 (0.00082) [2022-07-09 15:54:25,993][26022] Updated weights on worker 0-0, policy_version 316958 (0.00092) [2022-07-09 15:54:26,780][25689] Fps is (10 sec: 5546.1, 60 sec: 5661.5, 300 sec: 5684.7). Total num frames: 324568064. Throughput: 0: 5103.0. Samples: 324563100. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:54:26,781][25689] Avg episode reward: [(0, '-46.956')] [2022-07-09 15:54:27,699][26022] Updated weights on worker 0-0, policy_version 316968 (0.00083) [2022-07-09 15:54:29,532][26022] Updated weights on worker 0-0, policy_version 316978 (0.00095) [2022-07-09 15:54:31,244][26022] Updated weights on worker 0-0, policy_version 316988 (0.00085) [2022-07-09 15:54:31,819][25689] Fps is (10 sec: 5901.3, 60 sec: 5693.0, 300 sec: 5691.7). Total num frames: 324598784. Throughput: 0: 5975.2. Samples: 324597980. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:54:31,819][25689] Avg episode reward: [(0, '-46.894')] [2022-07-09 15:54:33,083][26022] Updated weights on worker 0-0, policy_version 316998 (0.00088) [2022-07-09 15:54:34,807][26022] Updated weights on worker 0-0, policy_version 317008 (0.00080) [2022-07-09 15:54:36,523][26022] Updated weights on worker 0-0, policy_version 317018 (0.00086) [2022-07-09 15:54:36,847][25689] Fps is (10 sec: 5899.9, 60 sec: 5708.7, 300 sec: 5690.2). Total num frames: 324627456. Throughput: 0: 6003.0. Samples: 324632932. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:54:36,847][25689] Avg episode reward: [(0, '-46.145')] [2022-07-09 15:54:38,366][26022] Updated weights on worker 0-0, policy_version 317028 (0.00090) [2022-07-09 15:54:40,143][26022] Updated weights on worker 0-0, policy_version 317038 (0.00087) [2022-07-09 15:54:41,874][26022] Updated weights on worker 0-0, policy_version 317048 (0.00081) [2022-07-09 15:54:41,970][25689] Fps is (10 sec: 5749.5, 60 sec: 5685.3, 300 sec: 5689.6). Total num frames: 324657152. Throughput: 0: 5170.9. Samples: 324650510. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:54:41,971][25689] Avg episode reward: [(0, '-46.002')] [2022-07-09 15:54:43,604][26022] Updated weights on worker 0-0, policy_version 317058 (0.00085) [2022-07-09 15:54:45,324][26022] Updated weights on worker 0-0, policy_version 317068 (0.00084) [2022-07-09 15:54:46,987][25689] Fps is (10 sec: 5957.8, 60 sec: 5735.6, 300 sec: 5696.7). Total num frames: 324687872. Throughput: 0: 6069.1. Samples: 324685740. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:54:46,988][25689] Avg episode reward: [(0, '-45.835')] [2022-07-09 15:54:46,990][26022] Updated weights on worker 0-0, policy_version 317078 (0.00084) [2022-07-09 15:54:49,043][26022] Updated weights on worker 0-0, policy_version 317088 (0.00089) [2022-07-09 15:54:50,497][26022] Updated weights on worker 0-0, policy_version 317098 (0.00090) [2022-07-09 15:54:52,036][25689] Fps is (10 sec: 5697.0, 60 sec: 5698.9, 300 sec: 5689.3). Total num frames: 324714496. Throughput: 0: 6063.9. Samples: 324720576. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:54:52,037][25689] Avg episode reward: [(0, '-45.417')] [2022-07-09 15:54:52,536][26022] Updated weights on worker 0-0, policy_version 317108 (0.00081) [2022-07-09 15:54:53,879][26022] Updated weights on worker 0-0, policy_version 317118 (0.00086) [2022-07-09 15:54:55,823][26022] Updated weights on worker 0-0, policy_version 317128 (0.00089) [2022-07-09 15:54:57,131][25689] Fps is (10 sec: 5754.1, 60 sec: 5741.4, 300 sec: 5702.7). Total num frames: 324746240. Throughput: 0: 6043.5. Samples: 324755520. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:54:57,133][25689] Avg episode reward: [(0, '-45.429')] [2022-07-09 15:54:57,699][26022] Updated weights on worker 0-0, policy_version 317138 (0.00085) [2022-07-09 15:54:59,586][26022] Updated weights on worker 0-0, policy_version 317148 (0.00091) [2022-07-09 15:55:01,099][26022] Updated weights on worker 0-0, policy_version 317158 (0.00096) [2022-07-09 15:55:02,256][25689] Fps is (10 sec: 5711.0, 60 sec: 5683.3, 300 sec: 5697.0). Total num frames: 324772864. Throughput: 0: 6033.9. Samples: 324772914. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:55:02,257][25689] Avg episode reward: [(0, '-45.072')] [2022-07-09 15:55:03,411][26022] Updated weights on worker 0-0, policy_version 317168 (0.00081) [2022-07-09 15:55:05,177][26022] Updated weights on worker 0-0, policy_version 317178 (0.00092) [2022-07-09 15:55:06,883][26022] Updated weights on worker 0-0, policy_version 317188 (0.00081) [2022-07-09 15:55:07,310][25689] Fps is (10 sec: 5633.5, 60 sec: 5767.7, 300 sec: 5704.5). Total num frames: 324803584. Throughput: 0: 5883.1. Samples: 324805302. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:55:07,311][25689] Avg episode reward: [(0, '-45.233')] [2022-07-09 15:55:08,861][26022] Updated weights on worker 0-0, policy_version 317198 (0.00093) [2022-07-09 15:55:10,458][26022] Updated weights on worker 0-0, policy_version 317208 (0.00088) [2022-07-09 15:55:10,872][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:55:10,882][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000317211_324824064.pth [2022-07-09 15:55:10,894][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000315204_322768896.pth [2022-07-09 15:55:12,326][25689] Fps is (10 sec: 5796.6, 60 sec: 5750.6, 300 sec: 5704.6). Total num frames: 324831232. Throughput: 0: 5887.4. Samples: 324840030. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-09 15:55:12,327][25689] Avg episode reward: [(0, '-45.400')] [2022-07-09 15:55:12,330][26022] Updated weights on worker 0-0, policy_version 317218 (0.00092) [2022-07-09 15:55:13,969][26022] Updated weights on worker 0-0, policy_version 317228 (0.00104) [2022-07-09 15:55:15,638][26022] Updated weights on worker 0-0, policy_version 317238 (0.00084) [2022-07-09 15:55:17,401][25689] Fps is (10 sec: 5581.5, 60 sec: 5728.6, 300 sec: 5701.1). Total num frames: 324859904. Throughput: 0: 5029.1. Samples: 324857458. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:55:17,401][25689] Avg episode reward: [(0, '-45.991')] [2022-07-09 15:55:17,532][26022] Updated weights on worker 0-0, policy_version 317248 (0.00093) [2022-07-09 15:55:19,404][26022] Updated weights on worker 0-0, policy_version 317258 (0.00095) [2022-07-09 15:55:21,296][26022] Updated weights on worker 0-0, policy_version 317268 (0.00089) [2022-07-09 15:55:22,533][25689] Fps is (10 sec: 5617.8, 60 sec: 5745.1, 300 sec: 5699.2). Total num frames: 324888576. Throughput: 0: 5869.3. Samples: 324891926. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:55:22,534][25689] Avg episode reward: [(0, '-45.642')] [2022-07-09 15:55:22,883][26022] Updated weights on worker 0-0, policy_version 317278 (0.00082) [2022-07-09 15:55:24,520][26022] Updated weights on worker 0-0, policy_version 317288 (0.00088) [2022-07-09 15:55:26,382][26022] Updated weights on worker 0-0, policy_version 317298 (0.00084) [2022-07-09 15:55:27,552][25689] Fps is (10 sec: 5951.4, 60 sec: 5796.3, 300 sec: 5713.0). Total num frames: 324920320. Throughput: 0: 6000.9. Samples: 324926772. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:55:27,553][25689] Avg episode reward: [(0, '-45.115')] [2022-07-09 15:55:28,223][26022] Updated weights on worker 0-0, policy_version 317308 (0.00092) [2022-07-09 15:55:29,982][26022] Updated weights on worker 0-0, policy_version 317318 (0.00081) [2022-07-09 15:55:31,946][26022] Updated weights on worker 0-0, policy_version 317328 (0.00078) [2022-07-09 15:55:32,594][25689] Fps is (10 sec: 5903.5, 60 sec: 5745.5, 300 sec: 5705.9). Total num frames: 324947968. Throughput: 0: 5135.5. Samples: 324944116. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:55:32,594][25689] Avg episode reward: [(0, '-45.103')] [2022-07-09 15:55:33,480][26022] Updated weights on worker 0-0, policy_version 317338 (0.00087) [2022-07-09 15:55:35,388][26022] Updated weights on worker 0-0, policy_version 317348 (0.00092) [2022-07-09 15:55:37,111][26022] Updated weights on worker 0-0, policy_version 317358 (0.00094) [2022-07-09 15:55:37,615][25689] Fps is (10 sec: 5698.8, 60 sec: 5763.0, 300 sec: 5710.5). Total num frames: 324977664. Throughput: 0: 6007.4. Samples: 324978892. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:55:37,615][25689] Avg episode reward: [(0, '-44.774')] [2022-07-09 15:55:38,954][26022] Updated weights on worker 0-0, policy_version 317368 (0.00081) [2022-07-09 15:55:40,687][26022] Updated weights on worker 0-0, policy_version 317378 (0.00094) [2022-07-09 15:55:42,541][26022] Updated weights on worker 0-0, policy_version 317388 (0.00088) [2022-07-09 15:55:42,695][25689] Fps is (10 sec: 5879.6, 60 sec: 5767.1, 300 sec: 5713.6). Total num frames: 325007360. Throughput: 0: 6025.2. Samples: 325013406. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:55:42,696][25689] Avg episode reward: [(0, '-45.226')] [2022-07-09 15:55:44,178][26022] Updated weights on worker 0-0, policy_version 317398 (0.00089) [2022-07-09 15:55:46,042][26022] Updated weights on worker 0-0, policy_version 317408 (0.00091) [2022-07-09 15:55:47,554][26022] Updated weights on worker 0-0, policy_version 317418 (0.00085) [2022-07-09 15:55:47,718][25689] Fps is (10 sec: 5776.9, 60 sec: 5732.8, 300 sec: 5717.5). Total num frames: 325036032. Throughput: 0: 5157.0. Samples: 325030768. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:55:47,719][25689] Avg episode reward: [(0, '-44.512')] [2022-07-09 15:55:49,614][26022] Updated weights on worker 0-0, policy_version 317428 (0.00088) [2022-07-09 15:55:51,171][26022] Updated weights on worker 0-0, policy_version 317438 (0.00086) [2022-07-09 15:55:52,744][25689] Fps is (10 sec: 5502.9, 60 sec: 5735.0, 300 sec: 5707.1). Total num frames: 325062656. Throughput: 0: 6023.8. Samples: 325065496. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:55:52,744][25689] Avg episode reward: [(0, '-44.391')] [2022-07-09 15:55:53,063][26022] Updated weights on worker 0-0, policy_version 317448 (0.00102) [2022-07-09 15:55:54,955][26022] Updated weights on worker 0-0, policy_version 317458 (0.00087) [2022-07-09 15:55:56,595][26022] Updated weights on worker 0-0, policy_version 317468 (0.00088) [2022-07-09 15:55:57,767][25689] Fps is (10 sec: 5604.8, 60 sec: 5708.0, 300 sec: 5707.8). Total num frames: 325092352. Throughput: 0: 6006.4. Samples: 325099934. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:55:57,767][25689] Avg episode reward: [(0, '-43.967')] [2022-07-09 15:55:58,660][26022] Updated weights on worker 0-0, policy_version 317478 (0.00088) [2022-07-09 15:56:00,160][26022] Updated weights on worker 0-0, policy_version 317488 (0.00112) [2022-07-09 15:56:02,397][26022] Updated weights on worker 0-0, policy_version 317498 (0.00085) [2022-07-09 15:56:02,921][25689] Fps is (10 sec: 5735.2, 60 sec: 5739.1, 300 sec: 5715.5). Total num frames: 325121024. Throughput: 0: 5134.0. Samples: 325117244. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:56:02,922][25689] Avg episode reward: [(0, '-44.148')] [2022-07-09 15:56:04,121][26022] Updated weights on worker 0-0, policy_version 317508 (0.00106) [2022-07-09 15:56:05,803][26022] Updated weights on worker 0-0, policy_version 317518 (0.00087) [2022-07-09 15:56:07,728][26022] Updated weights on worker 0-0, policy_version 317528 (0.00083) [2022-07-09 15:56:07,973][25689] Fps is (10 sec: 5618.4, 60 sec: 5705.5, 300 sec: 5712.9). Total num frames: 325149696. Throughput: 0: 5875.0. Samples: 325149766. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:56:07,974][25689] Avg episode reward: [(0, '-43.646')] [2022-07-09 15:56:09,484][26022] Updated weights on worker 0-0, policy_version 317538 (0.00086) [2022-07-09 15:56:11,200][26022] Updated weights on worker 0-0, policy_version 317548 (0.00093) [2022-07-09 15:56:13,002][25689] Fps is (10 sec: 5688.2, 60 sec: 5721.1, 300 sec: 5713.4). Total num frames: 325178368. Throughput: 0: 5865.1. Samples: 325184314. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:56:13,003][25689] Avg episode reward: [(0, '-43.796')] [2022-07-09 15:56:13,179][26022] Updated weights on worker 0-0, policy_version 317558 (0.00090) [2022-07-09 15:56:14,771][26022] Updated weights on worker 0-0, policy_version 317568 (0.00086) [2022-07-09 15:56:16,551][26022] Updated weights on worker 0-0, policy_version 317578 (0.00090) [2022-07-09 15:56:18,008][25689] Fps is (10 sec: 5816.5, 60 sec: 5744.5, 300 sec: 5721.4). Total num frames: 325208064. Throughput: 0: 5033.1. Samples: 325201810. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:56:18,009][25689] Avg episode reward: [(0, '-43.735')] [2022-07-09 15:56:18,537][26022] Updated weights on worker 0-0, policy_version 317588 (0.00082) [2022-07-09 15:56:20,080][26022] Updated weights on worker 0-0, policy_version 317598 (0.00088) [2022-07-09 15:56:21,979][26022] Updated weights on worker 0-0, policy_version 317608 (0.00089) [2022-07-09 15:56:23,055][25689] Fps is (10 sec: 5806.4, 60 sec: 5752.7, 300 sec: 5714.1). Total num frames: 325236736. Throughput: 0: 5922.0. Samples: 325236476. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:56:23,055][25689] Avg episode reward: [(0, '-44.586')] [2022-07-09 15:56:23,541][26022] Updated weights on worker 0-0, policy_version 317618 (0.00087) [2022-07-09 15:56:25,278][26022] Updated weights on worker 0-0, policy_version 317628 (0.00085) [2022-07-09 15:56:27,396][26022] Updated weights on worker 0-0, policy_version 317638 (0.00091) [2022-07-09 15:56:28,071][25689] Fps is (10 sec: 5800.6, 60 sec: 5719.1, 300 sec: 5721.9). Total num frames: 325266432. Throughput: 0: 6038.6. Samples: 325271126. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:56:28,071][25689] Avg episode reward: [(0, '-44.584')] [2022-07-09 15:56:28,913][26022] Updated weights on worker 0-0, policy_version 317648 (0.00084) [2022-07-09 15:56:30,784][26022] Updated weights on worker 0-0, policy_version 317658 (0.00086) [2022-07-09 15:56:32,631][26022] Updated weights on worker 0-0, policy_version 317668 (0.00087) [2022-07-09 15:56:33,082][25689] Fps is (10 sec: 5718.9, 60 sec: 5722.0, 300 sec: 5718.5). Total num frames: 325294080. Throughput: 0: 5188.8. Samples: 325288502. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:56:33,082][25689] Avg episode reward: [(0, '-44.301')] [2022-07-09 15:56:34,211][26022] Updated weights on worker 0-0, policy_version 317678 (0.00086) [2022-07-09 15:56:36,211][26022] Updated weights on worker 0-0, policy_version 317688 (0.00638) [2022-07-09 15:56:37,843][26022] Updated weights on worker 0-0, policy_version 317698 (0.00478) [2022-07-09 15:56:38,095][25689] Fps is (10 sec: 5618.5, 60 sec: 5705.8, 300 sec: 5712.2). Total num frames: 325322752. Throughput: 0: 6045.0. Samples: 325323232. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:56:38,095][25689] Avg episode reward: [(0, '-43.672')] [2022-07-09 15:56:39,659][26022] Updated weights on worker 0-0, policy_version 317708 (0.00086) [2022-07-09 15:56:41,576][26022] Updated weights on worker 0-0, policy_version 317718 (0.00115) [2022-07-09 15:56:43,104][26022] Updated weights on worker 0-0, policy_version 317728 (0.00070) [2022-07-09 15:56:43,207][25689] Fps is (10 sec: 5865.9, 60 sec: 5719.8, 300 sec: 5720.5). Total num frames: 325353472. Throughput: 0: 6000.3. Samples: 325357394. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:56:43,207][25689] Avg episode reward: [(0, '-43.642')] [2022-07-09 15:56:44,930][26022] Updated weights on worker 0-0, policy_version 317738 (0.00091) [2022-07-09 15:56:46,776][26022] Updated weights on worker 0-0, policy_version 317748 (0.00084) [2022-07-09 15:56:48,230][25689] Fps is (10 sec: 5860.0, 60 sec: 5719.8, 300 sec: 5720.2). Total num frames: 325382144. Throughput: 0: 5132.0. Samples: 325374584. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:56:48,231][25689] Avg episode reward: [(0, '-44.659')] [2022-07-09 15:56:48,395][26022] Updated weights on worker 0-0, policy_version 317758 (0.00098) [2022-07-09 15:56:50,649][26022] Updated weights on worker 0-0, policy_version 317768 (0.00090) [2022-07-09 15:56:52,055][26022] Updated weights on worker 0-0, policy_version 317778 (0.00100) [2022-07-09 15:56:53,261][25689] Fps is (10 sec: 5601.3, 60 sec: 5736.1, 300 sec: 5716.5). Total num frames: 325409792. Throughput: 0: 5989.7. Samples: 325409372. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:56:53,262][25689] Avg episode reward: [(0, '-44.597')] [2022-07-09 15:56:54,029][26022] Updated weights on worker 0-0, policy_version 317788 (0.00099) [2022-07-09 15:56:55,505][26022] Updated weights on worker 0-0, policy_version 317798 (0.00083) [2022-07-09 15:56:57,283][26022] Updated weights on worker 0-0, policy_version 317808 (0.00092) [2022-07-09 15:56:58,285][25689] Fps is (10 sec: 5703.2, 60 sec: 5736.1, 300 sec: 5713.9). Total num frames: 325439488. Throughput: 0: 5981.8. Samples: 325444004. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:56:58,285][25689] Avg episode reward: [(0, '-44.957')] [2022-07-09 15:56:59,163][26022] Updated weights on worker 0-0, policy_version 317818 (0.00089) [2022-07-09 15:57:00,963][26022] Updated weights on worker 0-0, policy_version 317828 (0.00084) [2022-07-09 15:57:03,074][26022] Updated weights on worker 0-0, policy_version 317838 (0.00086) [2022-07-09 15:57:03,331][25689] Fps is (10 sec: 5796.5, 60 sec: 5746.3, 300 sec: 5721.0). Total num frames: 325468160. Throughput: 0: 5168.8. Samples: 325461412. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:57:03,331][25689] Avg episode reward: [(0, '-45.164')] [2022-07-09 15:57:05,048][26022] Updated weights on worker 0-0, policy_version 317848 (0.00086) [2022-07-09 15:57:06,552][26022] Updated weights on worker 0-0, policy_version 317858 (0.00089) [2022-07-09 15:57:08,332][25689] Fps is (10 sec: 5503.6, 60 sec: 5717.3, 300 sec: 5722.1). Total num frames: 325494784. Throughput: 0: 5955.1. Samples: 325494294. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:57:08,332][25689] Avg episode reward: [(0, '-45.607')] [2022-07-09 15:57:08,605][26022] Updated weights on worker 0-0, policy_version 317868 (0.00102) [2022-07-09 15:57:10,145][26022] Updated weights on worker 0-0, policy_version 317878 (0.00051) [2022-07-09 15:57:11,182][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:57:11,194][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000317883_325512192.pth [2022-07-09 15:57:11,194][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000315870_323450880.pth [2022-07-09 15:57:12,010][26022] Updated weights on worker 0-0, policy_version 317888 (0.00083) [2022-07-09 15:57:13,339][25689] Fps is (10 sec: 5627.6, 60 sec: 5736.4, 300 sec: 5723.4). Total num frames: 325524480. Throughput: 0: 5963.7. Samples: 325529106. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:57:13,339][25689] Avg episode reward: [(0, '-45.635')] [2022-07-09 15:57:13,703][26022] Updated weights on worker 0-0, policy_version 317898 (0.00092) [2022-07-09 15:57:15,563][26022] Updated weights on worker 0-0, policy_version 317908 (0.00090) [2022-07-09 15:57:17,366][26022] Updated weights on worker 0-0, policy_version 317918 (0.00086) [2022-07-09 15:57:18,349][25689] Fps is (10 sec: 5929.0, 60 sec: 5735.9, 300 sec: 5724.0). Total num frames: 325554176. Throughput: 0: 5094.1. Samples: 325546214. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:57:18,351][25689] Avg episode reward: [(0, '-44.462')] [2022-07-09 15:57:19,051][26022] Updated weights on worker 0-0, policy_version 317928 (0.00375) [2022-07-09 15:57:20,891][26022] Updated weights on worker 0-0, policy_version 317938 (0.00089) [2022-07-09 15:57:22,711][26022] Updated weights on worker 0-0, policy_version 317948 (0.00089) [2022-07-09 15:57:23,409][25689] Fps is (10 sec: 5694.5, 60 sec: 5717.7, 300 sec: 5717.6). Total num frames: 325581824. Throughput: 0: 5964.0. Samples: 325581156. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:57:23,411][25689] Avg episode reward: [(0, '-44.731')] [2022-07-09 15:57:24,223][26022] Updated weights on worker 0-0, policy_version 317958 (0.00084) [2022-07-09 15:57:26,264][26022] Updated weights on worker 0-0, policy_version 317968 (0.00094) [2022-07-09 15:57:27,887][26022] Updated weights on worker 0-0, policy_version 317978 (0.00094) [2022-07-09 15:57:28,417][25689] Fps is (10 sec: 5695.6, 60 sec: 5718.4, 300 sec: 5725.4). Total num frames: 325611520. Throughput: 0: 6044.4. Samples: 325615698. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:57:28,419][25689] Avg episode reward: [(0, '-44.490')] [2022-07-09 15:57:29,819][26022] Updated weights on worker 0-0, policy_version 317988 (0.00089) [2022-07-09 15:57:31,521][26022] Updated weights on worker 0-0, policy_version 317998 (0.00090) [2022-07-09 15:57:33,209][26022] Updated weights on worker 0-0, policy_version 318008 (0.00081) [2022-07-09 15:57:33,426][25689] Fps is (10 sec: 5826.5, 60 sec: 5735.6, 300 sec: 5725.8). Total num frames: 325640192. Throughput: 0: 5176.9. Samples: 325633098. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:57:33,428][25689] Avg episode reward: [(0, '-45.170')] [2022-07-09 15:57:35,015][26022] Updated weights on worker 0-0, policy_version 318018 (0.00088) [2022-07-09 15:57:36,931][26022] Updated weights on worker 0-0, policy_version 318028 (0.00090) [2022-07-09 15:57:38,434][25689] Fps is (10 sec: 5827.0, 60 sec: 5753.1, 300 sec: 5723.8). Total num frames: 325669888. Throughput: 0: 6045.8. Samples: 325667644. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:57:38,435][25689] Avg episode reward: [(0, '-45.312')] [2022-07-09 15:57:38,643][26022] Updated weights on worker 0-0, policy_version 318038 (0.00087) [2022-07-09 15:57:40,556][26022] Updated weights on worker 0-0, policy_version 318048 (0.00090) [2022-07-09 15:57:41,960][26022] Updated weights on worker 0-0, policy_version 318058 (0.00088) [2022-07-09 15:57:43,491][25689] Fps is (10 sec: 5697.7, 60 sec: 5707.4, 300 sec: 5719.9). Total num frames: 325697536. Throughput: 0: 6036.9. Samples: 325702390. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2022-07-09 15:57:43,491][25689] Avg episode reward: [(0, '-45.573')] [2022-07-09 15:57:44,160][26022] Updated weights on worker 0-0, policy_version 318068 (0.00095) [2022-07-09 15:57:45,709][26022] Updated weights on worker 0-0, policy_version 318078 (0.00089) [2022-07-09 15:57:47,513][26022] Updated weights on worker 0-0, policy_version 318088 (0.00082) [2022-07-09 15:57:48,493][25689] Fps is (10 sec: 5700.8, 60 sec: 5726.4, 300 sec: 5730.3). Total num frames: 325727232. Throughput: 0: 5170.3. Samples: 325719496. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:57:48,493][25689] Avg episode reward: [(0, '-45.851')] [2022-07-09 15:57:49,354][26022] Updated weights on worker 0-0, policy_version 318098 (0.00089) [2022-07-09 15:57:51,095][26022] Updated weights on worker 0-0, policy_version 318108 (0.00088) [2022-07-09 15:57:52,904][26022] Updated weights on worker 0-0, policy_version 318118 (0.00094) [2022-07-09 15:57:53,499][25689] Fps is (10 sec: 5831.8, 60 sec: 5745.7, 300 sec: 5727.4). Total num frames: 325755904. Throughput: 0: 6031.6. Samples: 325754170. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:57:53,500][25689] Avg episode reward: [(0, '-46.382')] [2022-07-09 15:57:54,601][26022] Updated weights on worker 0-0, policy_version 318128 (0.00091) [2022-07-09 15:57:56,362][26022] Updated weights on worker 0-0, policy_version 318138 (0.00094) [2022-07-09 15:57:58,120][26022] Updated weights on worker 0-0, policy_version 318148 (0.00088) [2022-07-09 15:57:58,505][25689] Fps is (10 sec: 5727.4, 60 sec: 5730.4, 300 sec: 5721.4). Total num frames: 325784576. Throughput: 0: 6041.0. Samples: 325788894. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:57:58,506][25689] Avg episode reward: [(0, '-46.051')] [2022-07-09 15:57:59,816][26022] Updated weights on worker 0-0, policy_version 318158 (0.00092) [2022-07-09 15:58:01,711][26022] Updated weights on worker 0-0, policy_version 318168 (0.00085) [2022-07-09 15:58:03,628][25689] Fps is (10 sec: 5560.6, 60 sec: 5706.2, 300 sec: 5723.3). Total num frames: 325812224. Throughput: 0: 5893.1. Samples: 325821060. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:58:03,628][25689] Avg episode reward: [(0, '-46.320')] [2022-07-09 15:58:04,083][26022] Updated weights on worker 0-0, policy_version 318178 (0.00083) [2022-07-09 15:58:05,904][26022] Updated weights on worker 0-0, policy_version 318188 (0.00090) [2022-07-09 15:58:07,467][26022] Updated weights on worker 0-0, policy_version 318198 (0.00085) [2022-07-09 15:58:08,649][25689] Fps is (10 sec: 5451.2, 60 sec: 5721.2, 300 sec: 5719.9). Total num frames: 325839872. Throughput: 0: 5884.0. Samples: 325838094. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:58:08,649][25689] Avg episode reward: [(0, '-46.603')] [2022-07-09 15:58:09,335][26022] Updated weights on worker 0-0, policy_version 318208 (0.00094) [2022-07-09 15:58:11,055][26022] Updated weights on worker 0-0, policy_version 318218 (0.00583) [2022-07-09 15:58:12,976][26022] Updated weights on worker 0-0, policy_version 318228 (0.00088) [2022-07-09 15:58:13,716][25689] Fps is (10 sec: 5785.6, 60 sec: 5732.4, 300 sec: 5729.3). Total num frames: 325870592. Throughput: 0: 5869.3. Samples: 325872830. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:58:13,717][25689] Avg episode reward: [(0, '-46.596')] [2022-07-09 15:58:14,697][26022] Updated weights on worker 0-0, policy_version 318238 (0.00086) [2022-07-09 15:58:16,507][26022] Updated weights on worker 0-0, policy_version 318248 (0.00095) [2022-07-09 15:58:18,155][26022] Updated weights on worker 0-0, policy_version 318258 (0.00082) [2022-07-09 15:58:18,750][25689] Fps is (10 sec: 5778.5, 60 sec: 5696.3, 300 sec: 5719.3). Total num frames: 325898240. Throughput: 0: 5854.4. Samples: 325907414. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:58:18,750][25689] Avg episode reward: [(0, '-46.577')] [2022-07-09 15:58:19,939][26022] Updated weights on worker 0-0, policy_version 318268 (0.00088) [2022-07-09 15:58:21,786][26022] Updated weights on worker 0-0, policy_version 318278 (0.00091) [2022-07-09 15:58:23,527][26022] Updated weights on worker 0-0, policy_version 318288 (0.00083) [2022-07-09 15:58:23,856][25689] Fps is (10 sec: 5655.6, 60 sec: 5725.9, 300 sec: 5720.9). Total num frames: 325927936. Throughput: 0: 5130.9. Samples: 325924848. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:58:23,856][25689] Avg episode reward: [(0, '-46.177')] [2022-07-09 15:58:25,083][26022] Updated weights on worker 0-0, policy_version 318298 (0.00084) [2022-07-09 15:58:27,124][26022] Updated weights on worker 0-0, policy_version 318308 (0.00086) [2022-07-09 15:58:28,638][26022] Updated weights on worker 0-0, policy_version 318318 (0.00087) [2022-07-09 15:58:28,916][25689] Fps is (10 sec: 5943.0, 60 sec: 5737.9, 300 sec: 5726.9). Total num frames: 325958656. Throughput: 0: 6004.4. Samples: 325959784. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:58:28,916][25689] Avg episode reward: [(0, '-46.201')] [2022-07-09 15:58:30,635][26022] Updated weights on worker 0-0, policy_version 318328 (0.00089) [2022-07-09 15:58:32,283][26022] Updated weights on worker 0-0, policy_version 318338 (0.00083) [2022-07-09 15:58:33,926][25689] Fps is (10 sec: 5796.0, 60 sec: 5720.9, 300 sec: 5727.0). Total num frames: 325986304. Throughput: 0: 6013.0. Samples: 325994352. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:58:33,927][25689] Avg episode reward: [(0, '-45.165')] [2022-07-09 15:58:34,134][26022] Updated weights on worker 0-0, policy_version 318348 (0.00088) [2022-07-09 15:58:36,126][26022] Updated weights on worker 0-0, policy_version 318358 (0.00085) [2022-07-09 15:58:37,732][26022] Updated weights on worker 0-0, policy_version 318368 (0.00088) [2022-07-09 15:58:38,929][25689] Fps is (10 sec: 5624.6, 60 sec: 5704.4, 300 sec: 5721.1). Total num frames: 326014976. Throughput: 0: 5162.1. Samples: 326011580. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:58:38,929][25689] Avg episode reward: [(0, '-45.632')] [2022-07-09 15:58:39,475][26022] Updated weights on worker 0-0, policy_version 318378 (0.00079) [2022-07-09 15:58:41,184][26022] Updated weights on worker 0-0, policy_version 318388 (0.00101) [2022-07-09 15:58:43,055][26022] Updated weights on worker 0-0, policy_version 318398 (0.00089) [2022-07-09 15:58:44,011][25689] Fps is (10 sec: 5787.9, 60 sec: 5735.9, 300 sec: 5726.6). Total num frames: 326044672. Throughput: 0: 6020.6. Samples: 326046192. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:58:44,011][25689] Avg episode reward: [(0, '-45.564')] [2022-07-09 15:58:44,857][26022] Updated weights on worker 0-0, policy_version 318408 (0.00089) [2022-07-09 15:58:46,607][26022] Updated weights on worker 0-0, policy_version 318418 (0.00098) [2022-07-09 15:58:48,492][26022] Updated weights on worker 0-0, policy_version 318428 (0.00088) [2022-07-09 15:58:49,023][25689] Fps is (10 sec: 5884.0, 60 sec: 5735.0, 300 sec: 5730.2). Total num frames: 326074368. Throughput: 0: 6015.7. Samples: 326080742. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:58:49,023][25689] Avg episode reward: [(0, '-45.865')] [2022-07-09 15:58:50,056][26022] Updated weights on worker 0-0, policy_version 318438 (0.00088) [2022-07-09 15:58:52,102][26022] Updated weights on worker 0-0, policy_version 318448 (0.00084) [2022-07-09 15:58:53,776][26022] Updated weights on worker 0-0, policy_version 318458 (0.00086) [2022-07-09 15:58:54,046][25689] Fps is (10 sec: 5714.2, 60 sec: 5716.5, 300 sec: 5726.4). Total num frames: 326102016. Throughput: 0: 5134.6. Samples: 326097658. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:58:54,046][25689] Avg episode reward: [(0, '-45.945')] [2022-07-09 15:58:55,639][26022] Updated weights on worker 0-0, policy_version 318468 (0.00081) [2022-07-09 15:58:57,266][26022] Updated weights on worker 0-0, policy_version 318478 (0.00081) [2022-07-09 15:58:59,086][25689] Fps is (10 sec: 5494.9, 60 sec: 5696.3, 300 sec: 5719.7). Total num frames: 326129664. Throughput: 0: 5997.5. Samples: 326132472. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:58:59,086][25689] Avg episode reward: [(0, '-46.407')] [2022-07-09 15:58:59,196][26022] Updated weights on worker 0-0, policy_version 318488 (0.00088) [2022-07-09 15:59:00,881][26022] Updated weights on worker 0-0, policy_version 318498 (0.00086) [2022-07-09 15:59:02,948][26022] Updated weights on worker 0-0, policy_version 318508 (0.00085) [2022-07-09 15:59:04,210][25689] Fps is (10 sec: 5541.1, 60 sec: 5713.1, 300 sec: 5728.7). Total num frames: 326158336. Throughput: 0: 5891.9. Samples: 326165204. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:59:04,211][25689] Avg episode reward: [(0, '-47.077')] [2022-07-09 15:59:04,804][26022] Updated weights on worker 0-0, policy_version 318518 (0.00092) [2022-07-09 15:59:06,519][26022] Updated weights on worker 0-0, policy_version 318528 (0.00081) [2022-07-09 15:59:08,318][26022] Updated weights on worker 0-0, policy_version 318538 (0.00086) [2022-07-09 15:59:09,234][25689] Fps is (10 sec: 5751.4, 60 sec: 5746.6, 300 sec: 5731.9). Total num frames: 326188032. Throughput: 0: 5038.0. Samples: 326182568. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:59:09,235][25689] Avg episode reward: [(0, '-46.856')] [2022-07-09 15:59:10,038][26022] Updated weights on worker 0-0, policy_version 318548 (0.00085) [2022-07-09 15:59:11,228][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 15:59:11,237][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000318554_326199296.pth [2022-07-09 15:59:11,237][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000316539_324135936.pth [2022-07-09 15:59:11,801][26022] Updated weights on worker 0-0, policy_version 318558 (0.00079) [2022-07-09 15:59:13,654][26022] Updated weights on worker 0-0, policy_version 318568 (0.00087) [2022-07-09 15:59:14,258][25689] Fps is (10 sec: 5809.1, 60 sec: 5717.0, 300 sec: 5728.4). Total num frames: 326216704. Throughput: 0: 5927.4. Samples: 326217462. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:59:14,258][25689] Avg episode reward: [(0, '-47.395')] [2022-07-09 15:59:15,439][26022] Updated weights on worker 0-0, policy_version 318578 (0.00093) [2022-07-09 15:59:17,269][26022] Updated weights on worker 0-0, policy_version 318588 (0.00084) [2022-07-09 15:59:18,999][26022] Updated weights on worker 0-0, policy_version 318598 (0.00085) [2022-07-09 15:59:19,324][25689] Fps is (10 sec: 5683.2, 60 sec: 5730.7, 300 sec: 5733.1). Total num frames: 326245376. Throughput: 0: 5903.0. Samples: 326251940. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:59:19,325][25689] Avg episode reward: [(0, '-47.061')] [2022-07-09 15:59:20,810][26022] Updated weights on worker 0-0, policy_version 318608 (0.00088) [2022-07-09 15:59:22,672][26022] Updated weights on worker 0-0, policy_version 318618 (0.00086) [2022-07-09 15:59:24,319][26022] Updated weights on worker 0-0, policy_version 318628 (0.00087) [2022-07-09 15:59:24,415][25689] Fps is (10 sec: 5847.4, 60 sec: 5749.1, 300 sec: 5738.7). Total num frames: 326276096. Throughput: 0: 5139.8. Samples: 326269052. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:59:24,415][25689] Avg episode reward: [(0, '-46.984')] [2022-07-09 15:59:26,017][26022] Updated weights on worker 0-0, policy_version 318638 (0.00089) [2022-07-09 15:59:28,033][26022] Updated weights on worker 0-0, policy_version 318648 (0.00094) [2022-07-09 15:59:29,468][25689] Fps is (10 sec: 5855.0, 60 sec: 5715.9, 300 sec: 5731.5). Total num frames: 326304768. Throughput: 0: 5987.5. Samples: 326303720. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:59:29,469][25689] Avg episode reward: [(0, '-47.609')] [2022-07-09 15:59:29,515][26022] Updated weights on worker 0-0, policy_version 318658 (0.00089) [2022-07-09 15:59:31,411][26022] Updated weights on worker 0-0, policy_version 318668 (0.00081) [2022-07-09 15:59:33,164][26022] Updated weights on worker 0-0, policy_version 318678 (0.00082) [2022-07-09 15:59:34,511][25689] Fps is (10 sec: 5578.6, 60 sec: 5712.9, 300 sec: 5727.8). Total num frames: 326332416. Throughput: 0: 5979.5. Samples: 326338564. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:59:34,511][25689] Avg episode reward: [(0, '-48.039')] [2022-07-09 15:59:34,959][26022] Updated weights on worker 0-0, policy_version 318688 (0.00090) [2022-07-09 15:59:36,727][26022] Updated weights on worker 0-0, policy_version 318698 (0.00094) [2022-07-09 15:59:38,660][26022] Updated weights on worker 0-0, policy_version 318708 (0.00090) [2022-07-09 15:59:39,527][25689] Fps is (10 sec: 5802.9, 60 sec: 5745.4, 300 sec: 5733.3). Total num frames: 326363136. Throughput: 0: 5140.1. Samples: 326355782. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:59:39,527][25689] Avg episode reward: [(0, '-47.443')] [2022-07-09 15:59:40,439][26022] Updated weights on worker 0-0, policy_version 318718 (0.00084) [2022-07-09 15:59:42,170][26022] Updated weights on worker 0-0, policy_version 318728 (0.00085) [2022-07-09 15:59:43,879][26022] Updated weights on worker 0-0, policy_version 318738 (0.00086) [2022-07-09 15:59:44,617][25689] Fps is (10 sec: 5674.1, 60 sec: 5693.9, 300 sec: 5718.1). Total num frames: 326389760. Throughput: 0: 5998.0. Samples: 326390224. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:59:44,617][25689] Avg episode reward: [(0, '-46.097')] [2022-07-09 15:59:45,654][26022] Updated weights on worker 0-0, policy_version 318748 (0.00093) [2022-07-09 15:59:47,627][26022] Updated weights on worker 0-0, policy_version 318758 (0.00093) [2022-07-09 15:59:49,150][26022] Updated weights on worker 0-0, policy_version 318768 (0.00089) [2022-07-09 15:59:49,632][25689] Fps is (10 sec: 5674.7, 60 sec: 5710.6, 300 sec: 5732.6). Total num frames: 326420480. Throughput: 0: 5997.7. Samples: 326424654. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:59:49,632][25689] Avg episode reward: [(0, '-46.221')] [2022-07-09 15:59:51,031][26022] Updated weights on worker 0-0, policy_version 318778 (0.00065) [2022-07-09 15:59:52,712][26022] Updated weights on worker 0-0, policy_version 318788 (0.00092) [2022-07-09 15:59:54,521][26022] Updated weights on worker 0-0, policy_version 318798 (0.00085) [2022-07-09 15:59:54,637][25689] Fps is (10 sec: 6029.2, 60 sec: 5746.0, 300 sec: 5727.4). Total num frames: 326450176. Throughput: 0: 5157.8. Samples: 326442374. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:59:54,638][25689] Avg episode reward: [(0, '-45.087')] [2022-07-09 15:59:56,259][26022] Updated weights on worker 0-0, policy_version 318808 (0.00087) [2022-07-09 15:59:57,847][26022] Updated weights on worker 0-0, policy_version 318818 (0.00086) [2022-07-09 15:59:59,639][25689] Fps is (10 sec: 5730.2, 60 sec: 5749.6, 300 sec: 5733.2). Total num frames: 326477824. Throughput: 0: 6047.7. Samples: 326477416. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 15:59:59,640][25689] Avg episode reward: [(0, '-44.327')] [2022-07-09 15:59:59,758][26022] Updated weights on worker 0-0, policy_version 318828 (0.00087) [2022-07-09 16:00:01,376][26022] Updated weights on worker 0-0, policy_version 318838 (0.00083) [2022-07-09 16:00:03,761][26022] Updated weights on worker 0-0, policy_version 318848 (0.00084) [2022-07-09 16:00:04,757][25689] Fps is (10 sec: 5565.4, 60 sec: 5750.2, 300 sec: 5725.1). Total num frames: 326506496. Throughput: 0: 5946.9. Samples: 326509996. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 16:00:04,758][25689] Avg episode reward: [(0, '-44.443')] [2022-07-09 16:00:05,424][26022] Updated weights on worker 0-0, policy_version 318858 (0.00087) [2022-07-09 16:00:07,318][26022] Updated weights on worker 0-0, policy_version 318868 (0.00091) [2022-07-09 16:00:09,033][26022] Updated weights on worker 0-0, policy_version 318878 (0.00084) [2022-07-09 16:00:09,764][25689] Fps is (10 sec: 5461.5, 60 sec: 5701.1, 300 sec: 5721.8). Total num frames: 326533120. Throughput: 0: 5905.4. Samples: 326543542. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 16:00:09,764][25689] Avg episode reward: [(0, '-44.911')] [2022-07-09 16:00:10,944][26022] Updated weights on worker 0-0, policy_version 318888 (0.00086) [2022-07-09 16:00:12,903][26022] Updated weights on worker 0-0, policy_version 318898 (0.00091) [2022-07-09 16:00:14,648][26022] Updated weights on worker 0-0, policy_version 318908 (0.00091) [2022-07-09 16:00:14,792][25689] Fps is (10 sec: 5510.5, 60 sec: 5700.7, 300 sec: 5722.7). Total num frames: 326561792. Throughput: 0: 5856.9. Samples: 326560416. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:00:14,795][25689] Avg episode reward: [(0, '-45.539')] [2022-07-09 16:00:16,340][26022] Updated weights on worker 0-0, policy_version 318918 (0.00094) [2022-07-09 16:00:18,403][26022] Updated weights on worker 0-0, policy_version 318928 (0.00090) [2022-07-09 16:00:19,843][25689] Fps is (10 sec: 5791.0, 60 sec: 5719.1, 300 sec: 5727.7). Total num frames: 326591488. Throughput: 0: 5768.7. Samples: 326593966. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:00:19,844][25689] Avg episode reward: [(0, '-45.420')] [2022-07-09 16:00:20,097][26022] Updated weights on worker 0-0, policy_version 318938 (0.00092) [2022-07-09 16:00:22,001][26022] Updated weights on worker 0-0, policy_version 318948 (0.00089) [2022-07-09 16:00:23,665][26022] Updated weights on worker 0-0, policy_version 318958 (0.00087) [2022-07-09 16:00:24,901][25689] Fps is (10 sec: 5672.2, 60 sec: 5671.3, 300 sec: 5713.2). Total num frames: 326619136. Throughput: 0: 5872.9. Samples: 326628302. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:00:24,903][25689] Avg episode reward: [(0, '-46.315')] [2022-07-09 16:00:25,489][26022] Updated weights on worker 0-0, policy_version 318968 (0.00079) [2022-07-09 16:00:27,422][26022] Updated weights on worker 0-0, policy_version 318978 (0.00089) [2022-07-09 16:00:29,039][26022] Updated weights on worker 0-0, policy_version 318988 (0.00086) [2022-07-09 16:00:29,943][25689] Fps is (10 sec: 5677.5, 60 sec: 5689.4, 300 sec: 5720.1). Total num frames: 326648832. Throughput: 0: 5053.6. Samples: 326645520. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:00:29,945][25689] Avg episode reward: [(0, '-46.983')] [2022-07-09 16:00:30,856][26022] Updated weights on worker 0-0, policy_version 318998 (0.00092) [2022-07-09 16:00:32,669][26022] Updated weights on worker 0-0, policy_version 319008 (0.00081) [2022-07-09 16:00:34,379][26022] Updated weights on worker 0-0, policy_version 319018 (0.00083) [2022-07-09 16:00:34,951][25689] Fps is (10 sec: 5808.0, 60 sec: 5709.6, 300 sec: 5716.9). Total num frames: 326677504. Throughput: 0: 5937.0. Samples: 326680102. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:00:34,953][25689] Avg episode reward: [(0, '-47.985')] [2022-07-09 16:00:36,370][26022] Updated weights on worker 0-0, policy_version 319028 (0.00101) [2022-07-09 16:00:37,739][26022] Updated weights on worker 0-0, policy_version 319038 (0.00085) [2022-07-09 16:00:39,821][26022] Updated weights on worker 0-0, policy_version 319048 (0.00091) [2022-07-09 16:00:39,966][25689] Fps is (10 sec: 5721.4, 60 sec: 5675.8, 300 sec: 5714.7). Total num frames: 326706176. Throughput: 0: 6003.7. Samples: 326714778. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:00:39,967][25689] Avg episode reward: [(0, '-47.258')] [2022-07-09 16:00:41,454][26022] Updated weights on worker 0-0, policy_version 319058 (0.00086) [2022-07-09 16:00:43,401][26022] Updated weights on worker 0-0, policy_version 319068 (0.00086) [2022-07-09 16:00:45,056][25689] Fps is (10 sec: 5573.7, 60 sec: 5692.8, 300 sec: 5710.0). Total num frames: 326733824. Throughput: 0: 5141.6. Samples: 326731928. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:00:45,056][25689] Avg episode reward: [(0, '-47.470')] [2022-07-09 16:00:45,122][26022] Updated weights on worker 0-0, policy_version 319078 (0.00092) [2022-07-09 16:00:46,896][26022] Updated weights on worker 0-0, policy_version 319088 (0.00091) [2022-07-09 16:00:48,646][26022] Updated weights on worker 0-0, policy_version 319098 (0.00089) [2022-07-09 16:00:50,100][25689] Fps is (10 sec: 5658.5, 60 sec: 5673.1, 300 sec: 5720.0). Total num frames: 326763520. Throughput: 0: 6005.9. Samples: 326766580. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:00:50,101][25689] Avg episode reward: [(0, '-47.952')] [2022-07-09 16:00:50,445][26022] Updated weights on worker 0-0, policy_version 319108 (0.00088) [2022-07-09 16:00:52,336][26022] Updated weights on worker 0-0, policy_version 319118 (0.00079) [2022-07-09 16:00:54,101][26022] Updated weights on worker 0-0, policy_version 319128 (0.00088) [2022-07-09 16:00:55,121][25689] Fps is (10 sec: 5900.5, 60 sec: 5671.6, 300 sec: 5720.0). Total num frames: 326793216. Throughput: 0: 6016.2. Samples: 326801450. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:00:55,122][25689] Avg episode reward: [(0, '-47.872')] [2022-07-09 16:00:55,733][26022] Updated weights on worker 0-0, policy_version 319138 (0.00087) [2022-07-09 16:00:57,619][26022] Updated weights on worker 0-0, policy_version 319148 (0.00084) [2022-07-09 16:00:59,267][26022] Updated weights on worker 0-0, policy_version 319158 (0.00083) [2022-07-09 16:01:00,126][25689] Fps is (10 sec: 5821.4, 60 sec: 5688.2, 300 sec: 5722.9). Total num frames: 326821888. Throughput: 0: 5167.0. Samples: 326818948. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:01:00,128][25689] Avg episode reward: [(0, '-47.456')] [2022-07-09 16:01:01,071][26022] Updated weights on worker 0-0, policy_version 319168 (0.00084) [2022-07-09 16:01:03,286][26022] Updated weights on worker 0-0, policy_version 319178 (0.00087) [2022-07-09 16:01:04,923][26022] Updated weights on worker 0-0, policy_version 319188 (0.00100) [2022-07-09 16:01:05,212][25689] Fps is (10 sec: 5581.1, 60 sec: 5674.3, 300 sec: 5718.8). Total num frames: 326849536. Throughput: 0: 5932.8. Samples: 326851514. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:01:05,213][25689] Avg episode reward: [(0, '-47.295')] [2022-07-09 16:01:06,746][26022] Updated weights on worker 0-0, policy_version 319198 (0.00082) [2022-07-09 16:01:08,634][26022] Updated weights on worker 0-0, policy_version 319208 (0.00085) [2022-07-09 16:01:10,215][25689] Fps is (10 sec: 5582.3, 60 sec: 5708.5, 300 sec: 5719.3). Total num frames: 326878208. Throughput: 0: 5950.1. Samples: 326886268. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:01:10,223][25689] Avg episode reward: [(0, '-48.219')] [2022-07-09 16:01:10,315][26022] Updated weights on worker 0-0, policy_version 319218 (0.00089) [2022-07-09 16:01:11,479][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:01:11,493][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000319224_326885376.pth [2022-07-09 16:01:11,493][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000317211_324824064.pth [2022-07-09 16:01:11,976][26022] Updated weights on worker 0-0, policy_version 319228 (0.00087) [2022-07-09 16:01:13,788][26022] Updated weights on worker 0-0, policy_version 319238 (0.00091) [2022-07-09 16:01:15,235][25689] Fps is (10 sec: 5823.9, 60 sec: 5726.3, 300 sec: 5719.0). Total num frames: 326907904. Throughput: 0: 5086.5. Samples: 326903756. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:01:15,235][25689] Avg episode reward: [(0, '-48.071')] [2022-07-09 16:01:15,480][26022] Updated weights on worker 0-0, policy_version 319248 (0.00088) [2022-07-09 16:01:17,370][26022] Updated weights on worker 0-0, policy_version 319258 (0.00088) [2022-07-09 16:01:18,938][26022] Updated weights on worker 0-0, policy_version 319268 (0.00086) [2022-07-09 16:01:20,262][25689] Fps is (10 sec: 5707.8, 60 sec: 5694.6, 300 sec: 5715.9). Total num frames: 326935552. Throughput: 0: 5944.3. Samples: 326938640. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:01:20,263][25689] Avg episode reward: [(0, '-47.545')] [2022-07-09 16:01:20,909][26022] Updated weights on worker 0-0, policy_version 319278 (0.00081) [2022-07-09 16:01:22,715][26022] Updated weights on worker 0-0, policy_version 319288 (0.00082) [2022-07-09 16:01:24,452][26022] Updated weights on worker 0-0, policy_version 319298 (0.00089) [2022-07-09 16:01:25,323][25689] Fps is (10 sec: 5785.9, 60 sec: 5745.3, 300 sec: 5718.5). Total num frames: 326966272. Throughput: 0: 6042.0. Samples: 326973020. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:01:25,323][25689] Avg episode reward: [(0, '-47.051')] [2022-07-09 16:01:26,303][26022] Updated weights on worker 0-0, policy_version 319308 (0.00083) [2022-07-09 16:01:28,003][26022] Updated weights on worker 0-0, policy_version 319318 (0.00102) [2022-07-09 16:01:29,846][26022] Updated weights on worker 0-0, policy_version 319328 (0.00111) [2022-07-09 16:01:30,327][25689] Fps is (10 sec: 5799.4, 60 sec: 5715.0, 300 sec: 5718.7). Total num frames: 326993920. Throughput: 0: 5165.4. Samples: 326990150. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:01:30,327][25689] Avg episode reward: [(0, '-46.417')] [2022-07-09 16:01:31,670][26022] Updated weights on worker 0-0, policy_version 319338 (0.00093) [2022-07-09 16:01:33,184][26022] Updated weights on worker 0-0, policy_version 319348 (0.00088) [2022-07-09 16:01:35,181][26022] Updated weights on worker 0-0, policy_version 319358 (0.00510) [2022-07-09 16:01:35,337][25689] Fps is (10 sec: 5726.2, 60 sec: 5731.7, 300 sec: 5722.1). Total num frames: 327023616. Throughput: 0: 6030.0. Samples: 327024974. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:01:35,338][25689] Avg episode reward: [(0, '-45.460')] [2022-07-09 16:01:36,737][26022] Updated weights on worker 0-0, policy_version 319368 (0.00094) [2022-07-09 16:01:38,597][26022] Updated weights on worker 0-0, policy_version 319378 (0.00086) [2022-07-09 16:01:40,255][26022] Updated weights on worker 0-0, policy_version 319388 (0.00090) [2022-07-09 16:01:40,346][25689] Fps is (10 sec: 5927.7, 60 sec: 5749.2, 300 sec: 5720.7). Total num frames: 327053312. Throughput: 0: 6032.3. Samples: 327059792. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:01:40,346][25689] Avg episode reward: [(0, '-45.437')] [2022-07-09 16:01:42,215][26022] Updated weights on worker 0-0, policy_version 319398 (0.00086) [2022-07-09 16:01:43,998][26022] Updated weights on worker 0-0, policy_version 319408 (0.00086) [2022-07-09 16:01:45,412][25689] Fps is (10 sec: 5793.4, 60 sec: 5768.4, 300 sec: 5719.9). Total num frames: 327081984. Throughput: 0: 5181.7. Samples: 327077116. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:01:45,412][25689] Avg episode reward: [(0, '-45.095')] [2022-07-09 16:01:45,677][26022] Updated weights on worker 0-0, policy_version 319418 (0.00091) [2022-07-09 16:01:47,396][26022] Updated weights on worker 0-0, policy_version 319428 (0.00085) [2022-07-09 16:01:49,144][26022] Updated weights on worker 0-0, policy_version 319438 (0.00087) [2022-07-09 16:01:50,427][25689] Fps is (10 sec: 5688.2, 60 sec: 5754.2, 300 sec: 5723.6). Total num frames: 327110656. Throughput: 0: 6053.6. Samples: 327111832. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:01:50,428][25689] Avg episode reward: [(0, '-45.814')] [2022-07-09 16:01:51,229][26022] Updated weights on worker 0-0, policy_version 319448 (0.00086) [2022-07-09 16:01:52,810][26022] Updated weights on worker 0-0, policy_version 319458 (0.00096) [2022-07-09 16:01:54,766][26022] Updated weights on worker 0-0, policy_version 319468 (0.00089) [2022-07-09 16:01:55,439][25689] Fps is (10 sec: 5719.1, 60 sec: 5738.2, 300 sec: 5720.4). Total num frames: 327139328. Throughput: 0: 6063.9. Samples: 327146868. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:01:55,439][25689] Avg episode reward: [(0, '-45.780')] [2022-07-09 16:01:56,458][26022] Updated weights on worker 0-0, policy_version 319478 (0.00081) [2022-07-09 16:01:58,089][26022] Updated weights on worker 0-0, policy_version 319488 (0.00091) [2022-07-09 16:01:59,937][26022] Updated weights on worker 0-0, policy_version 319498 (0.00081) [2022-07-09 16:02:00,444][25689] Fps is (10 sec: 5826.8, 60 sec: 5755.1, 300 sec: 5724.6). Total num frames: 327169024. Throughput: 0: 5195.1. Samples: 327164206. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:02:00,445][25689] Avg episode reward: [(0, '-45.945')] [2022-07-09 16:02:01,651][26022] Updated weights on worker 0-0, policy_version 319508 (0.00092) [2022-07-09 16:02:03,745][26022] Updated weights on worker 0-0, policy_version 319518 (0.00092) [2022-07-09 16:02:05,534][25689] Fps is (10 sec: 5579.0, 60 sec: 5737.8, 300 sec: 5722.9). Total num frames: 327195648. Throughput: 0: 5930.9. Samples: 327196456. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:02:05,534][25689] Avg episode reward: [(0, '-46.601')] [2022-07-09 16:02:05,771][26022] Updated weights on worker 0-0, policy_version 319528 (0.00085) [2022-07-09 16:02:07,329][26022] Updated weights on worker 0-0, policy_version 319538 (0.01104) [2022-07-09 16:02:09,395][26022] Updated weights on worker 0-0, policy_version 319548 (0.00089) [2022-07-09 16:02:10,611][25689] Fps is (10 sec: 5539.8, 60 sec: 5747.7, 300 sec: 5721.6). Total num frames: 327225344. Throughput: 0: 5919.3. Samples: 327231306. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:02:10,611][25689] Avg episode reward: [(0, '-46.516')] [2022-07-09 16:02:10,926][26022] Updated weights on worker 0-0, policy_version 319558 (0.00096) [2022-07-09 16:02:12,695][26022] Updated weights on worker 0-0, policy_version 319568 (0.00082) [2022-07-09 16:02:14,378][26022] Updated weights on worker 0-0, policy_version 319578 (0.00087) [2022-07-09 16:02:15,679][25689] Fps is (10 sec: 5854.0, 60 sec: 5743.0, 300 sec: 5720.5). Total num frames: 327255040. Throughput: 0: 5039.5. Samples: 327248870. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:02:15,680][25689] Avg episode reward: [(0, '-47.136')] [2022-07-09 16:02:16,219][26022] Updated weights on worker 0-0, policy_version 319588 (0.00085) [2022-07-09 16:02:18,059][26022] Updated weights on worker 0-0, policy_version 319598 (0.00095) [2022-07-09 16:02:19,860][26022] Updated weights on worker 0-0, policy_version 319608 (0.00086) [2022-07-09 16:02:20,715][25689] Fps is (10 sec: 5675.4, 60 sec: 5742.3, 300 sec: 5720.9). Total num frames: 327282688. Throughput: 0: 5892.8. Samples: 327283656. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:02:20,719][25689] Avg episode reward: [(0, '-46.941')] [2022-07-09 16:02:21,536][26022] Updated weights on worker 0-0, policy_version 319618 (0.00088) [2022-07-09 16:02:23,262][26022] Updated weights on worker 0-0, policy_version 319628 (0.00562) [2022-07-09 16:02:25,027][26022] Updated weights on worker 0-0, policy_version 319638 (0.00083) [2022-07-09 16:02:25,816][25689] Fps is (10 sec: 5758.4, 60 sec: 5738.5, 300 sec: 5722.6). Total num frames: 327313408. Throughput: 0: 6005.0. Samples: 327318248. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:02:25,816][25689] Avg episode reward: [(0, '-47.164')] [2022-07-09 16:02:27,004][26022] Updated weights on worker 0-0, policy_version 319648 (0.00090) [2022-07-09 16:02:28,646][26022] Updated weights on worker 0-0, policy_version 319658 (0.00085) [2022-07-09 16:02:30,387][26022] Updated weights on worker 0-0, policy_version 319668 (0.00090) [2022-07-09 16:02:30,827][25689] Fps is (10 sec: 5873.5, 60 sec: 5754.7, 300 sec: 5722.5). Total num frames: 327342080. Throughput: 0: 5159.4. Samples: 327335608. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:02:30,827][25689] Avg episode reward: [(0, '-46.952')] [2022-07-09 16:02:32,103][26022] Updated weights on worker 0-0, policy_version 319678 (0.00091) [2022-07-09 16:02:33,943][26022] Updated weights on worker 0-0, policy_version 319688 (0.00087) [2022-07-09 16:02:35,794][26022] Updated weights on worker 0-0, policy_version 319698 (0.00092) [2022-07-09 16:02:35,850][25689] Fps is (10 sec: 5816.9, 60 sec: 5753.5, 300 sec: 5722.3). Total num frames: 327371776. Throughput: 0: 6009.8. Samples: 327370090. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:02:35,850][25689] Avg episode reward: [(0, '-46.271')] [2022-07-09 16:02:37,361][26022] Updated weights on worker 0-0, policy_version 319708 (0.00081) [2022-07-09 16:02:39,392][26022] Updated weights on worker 0-0, policy_version 319718 (0.00089) [2022-07-09 16:02:40,917][25689] Fps is (10 sec: 5784.7, 60 sec: 5731.1, 300 sec: 5725.5). Total num frames: 327400448. Throughput: 0: 5999.1. Samples: 327404850. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 16:02:40,917][25689] Avg episode reward: [(0, '-45.913')] [2022-07-09 16:02:41,005][26022] Updated weights on worker 0-0, policy_version 319728 (0.00086) [2022-07-09 16:02:42,965][26022] Updated weights on worker 0-0, policy_version 319738 (0.00086) [2022-07-09 16:02:44,752][26022] Updated weights on worker 0-0, policy_version 319748 (0.00087) [2022-07-09 16:02:46,051][25689] Fps is (10 sec: 5521.0, 60 sec: 5707.8, 300 sec: 5716.1). Total num frames: 327428096. Throughput: 0: 5121.4. Samples: 327421880. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:02:46,051][25689] Avg episode reward: [(0, '-45.110')] [2022-07-09 16:02:46,526][26022] Updated weights on worker 0-0, policy_version 319758 (0.00093) [2022-07-09 16:02:48,434][26022] Updated weights on worker 0-0, policy_version 319768 (0.00094) [2022-07-09 16:02:50,059][26022] Updated weights on worker 0-0, policy_version 319778 (0.00096) [2022-07-09 16:02:51,087][25689] Fps is (10 sec: 5638.5, 60 sec: 5722.7, 300 sec: 5718.9). Total num frames: 327457792. Throughput: 0: 5956.1. Samples: 327456280. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:02:51,087][25689] Avg episode reward: [(0, '-45.716')] [2022-07-09 16:02:52,071][26022] Updated weights on worker 0-0, policy_version 319788 (0.00088) [2022-07-09 16:02:53,540][26022] Updated weights on worker 0-0, policy_version 319798 (0.00093) [2022-07-09 16:02:55,541][26022] Updated weights on worker 0-0, policy_version 319808 (0.00092) [2022-07-09 16:02:56,099][25689] Fps is (10 sec: 5706.9, 60 sec: 5705.8, 300 sec: 5715.4). Total num frames: 327485440. Throughput: 0: 5954.7. Samples: 327490668. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:02:56,100][25689] Avg episode reward: [(0, '-46.291')] [2022-07-09 16:02:57,185][26022] Updated weights on worker 0-0, policy_version 319818 (0.00087) [2022-07-09 16:02:59,104][26022] Updated weights on worker 0-0, policy_version 319828 (0.00086) [2022-07-09 16:03:00,727][26022] Updated weights on worker 0-0, policy_version 319838 (0.00088) [2022-07-09 16:03:01,108][25689] Fps is (10 sec: 5722.3, 60 sec: 5705.4, 300 sec: 5724.5). Total num frames: 327515136. Throughput: 0: 5120.1. Samples: 327508232. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:03:01,109][25689] Avg episode reward: [(0, '-46.343')] [2022-07-09 16:03:03,107][26022] Updated weights on worker 0-0, policy_version 319848 (0.00085) [2022-07-09 16:03:04,698][26022] Updated weights on worker 0-0, policy_version 319858 (0.00086) [2022-07-09 16:03:06,208][25689] Fps is (10 sec: 5571.2, 60 sec: 5704.4, 300 sec: 5719.5). Total num frames: 327541760. Throughput: 0: 5877.5. Samples: 327540354. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:03:06,209][25689] Avg episode reward: [(0, '-45.972')] [2022-07-09 16:03:06,535][26022] Updated weights on worker 0-0, policy_version 319868 (0.00088) [2022-07-09 16:03:08,233][26022] Updated weights on worker 0-0, policy_version 319878 (0.00087) [2022-07-09 16:03:09,908][26022] Updated weights on worker 0-0, policy_version 319888 (0.00081) [2022-07-09 16:03:11,219][25689] Fps is (10 sec: 5671.3, 60 sec: 5727.5, 300 sec: 5720.6). Total num frames: 327572480. Throughput: 0: 5909.7. Samples: 327575256. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:03:11,220][25689] Avg episode reward: [(0, '-46.293')] [2022-07-09 16:03:11,721][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:03:11,746][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000319897_327574528.pth [2022-07-09 16:03:11,747][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000317883_325512192.pth [2022-07-09 16:03:11,863][26022] Updated weights on worker 0-0, policy_version 319898 (0.00117) [2022-07-09 16:03:13,587][26022] Updated weights on worker 0-0, policy_version 319908 (0.00085) [2022-07-09 16:03:15,284][26022] Updated weights on worker 0-0, policy_version 319918 (0.00086) [2022-07-09 16:03:16,287][25689] Fps is (10 sec: 5892.4, 60 sec: 5710.7, 300 sec: 5723.4). Total num frames: 327601152. Throughput: 0: 5909.9. Samples: 327609980. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:03:16,288][25689] Avg episode reward: [(0, '-46.383')] [2022-07-09 16:03:17,075][26022] Updated weights on worker 0-0, policy_version 319928 (0.00087) [2022-07-09 16:03:18,855][26022] Updated weights on worker 0-0, policy_version 319938 (0.00097) [2022-07-09 16:03:20,674][26022] Updated weights on worker 0-0, policy_version 319948 (0.00084) [2022-07-09 16:03:21,304][25689] Fps is (10 sec: 5788.1, 60 sec: 5746.3, 300 sec: 5725.1). Total num frames: 327630848. Throughput: 0: 5904.3. Samples: 327627472. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:03:21,304][25689] Avg episode reward: [(0, '-46.802')] [2022-07-09 16:03:22,500][26022] Updated weights on worker 0-0, policy_version 319958 (0.00087) [2022-07-09 16:03:24,105][26022] Updated weights on worker 0-0, policy_version 319968 (0.00094) [2022-07-09 16:03:26,140][26022] Updated weights on worker 0-0, policy_version 319978 (0.00084) [2022-07-09 16:03:26,396][25689] Fps is (10 sec: 5774.2, 60 sec: 5713.3, 300 sec: 5717.6). Total num frames: 327659520. Throughput: 0: 6017.5. Samples: 327661834. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:03:26,396][25689] Avg episode reward: [(0, '-45.887')] [2022-07-09 16:03:27,673][26022] Updated weights on worker 0-0, policy_version 319988 (0.00085) [2022-07-09 16:03:29,583][26022] Updated weights on worker 0-0, policy_version 319998 (0.00089) [2022-07-09 16:03:31,406][25689] Fps is (10 sec: 5574.8, 60 sec: 5696.5, 300 sec: 5717.6). Total num frames: 327687168. Throughput: 0: 5997.9. Samples: 327696334. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:03:31,407][25689] Avg episode reward: [(0, '-46.029')] [2022-07-09 16:03:31,467][26022] Updated weights on worker 0-0, policy_version 320008 (0.00087) [2022-07-09 16:03:33,089][26022] Updated weights on worker 0-0, policy_version 320018 (0.00087) [2022-07-09 16:03:34,918][26022] Updated weights on worker 0-0, policy_version 320028 (0.00092) [2022-07-09 16:03:36,467][25689] Fps is (10 sec: 5795.4, 60 sec: 5709.8, 300 sec: 5723.3). Total num frames: 327717888. Throughput: 0: 5141.6. Samples: 327713736. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:03:36,469][25689] Avg episode reward: [(0, '-45.871')] [2022-07-09 16:03:36,599][26022] Updated weights on worker 0-0, policy_version 320038 (0.00091) [2022-07-09 16:03:38,346][26022] Updated weights on worker 0-0, policy_version 320048 (0.00095) [2022-07-09 16:03:40,319][26022] Updated weights on worker 0-0, policy_version 320058 (0.00090) [2022-07-09 16:03:41,483][25689] Fps is (10 sec: 5893.7, 60 sec: 5714.6, 300 sec: 5721.2). Total num frames: 327746560. Throughput: 0: 5992.3. Samples: 327748396. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:03:41,485][25689] Avg episode reward: [(0, '-45.225')] [2022-07-09 16:03:41,948][26022] Updated weights on worker 0-0, policy_version 320068 (0.00083) [2022-07-09 16:03:43,812][26022] Updated weights on worker 0-0, policy_version 320078 (0.00091) [2022-07-09 16:03:45,646][26022] Updated weights on worker 0-0, policy_version 320088 (0.00087) [2022-07-09 16:03:46,606][25689] Fps is (10 sec: 5757.1, 60 sec: 5749.5, 300 sec: 5719.0). Total num frames: 327776256. Throughput: 0: 5988.0. Samples: 327782850. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:03:46,606][25689] Avg episode reward: [(0, '-45.129')] [2022-07-09 16:03:47,357][26022] Updated weights on worker 0-0, policy_version 320098 (0.00082) [2022-07-09 16:03:49,223][26022] Updated weights on worker 0-0, policy_version 320108 (0.00078) [2022-07-09 16:03:50,915][26022] Updated weights on worker 0-0, policy_version 320118 (0.00087) [2022-07-09 16:03:51,648][25689] Fps is (10 sec: 5641.4, 60 sec: 5715.1, 300 sec: 5718.7). Total num frames: 327803904. Throughput: 0: 5136.9. Samples: 327800314. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:03:51,650][25689] Avg episode reward: [(0, '-45.098')] [2022-07-09 16:03:52,708][26022] Updated weights on worker 0-0, policy_version 320128 (0.00089) [2022-07-09 16:03:54,652][26022] Updated weights on worker 0-0, policy_version 320138 (0.00096) [2022-07-09 16:03:56,280][26022] Updated weights on worker 0-0, policy_version 320148 (0.00084) [2022-07-09 16:03:56,740][25689] Fps is (10 sec: 5658.3, 60 sec: 5741.3, 300 sec: 5724.5). Total num frames: 327833600. Throughput: 0: 5970.4. Samples: 327834774. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:03:56,741][25689] Avg episode reward: [(0, '-44.640')] [2022-07-09 16:03:57,957][26022] Updated weights on worker 0-0, policy_version 320158 (0.00088) [2022-07-09 16:03:59,873][26022] Updated weights on worker 0-0, policy_version 320168 (0.00081) [2022-07-09 16:04:01,755][25689] Fps is (10 sec: 5673.9, 60 sec: 5707.0, 300 sec: 5723.2). Total num frames: 327861248. Throughput: 0: 5957.9. Samples: 327869170. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:04:01,756][25689] Avg episode reward: [(0, '-45.039')] [2022-07-09 16:04:01,976][26022] Updated weights on worker 0-0, policy_version 320178 (0.00092) [2022-07-09 16:04:03,793][26022] Updated weights on worker 0-0, policy_version 320188 (0.00085) [2022-07-09 16:04:05,517][26022] Updated weights on worker 0-0, policy_version 320198 (0.00084) [2022-07-09 16:04:06,844][25689] Fps is (10 sec: 5573.8, 60 sec: 5741.8, 300 sec: 5718.5). Total num frames: 327889920. Throughput: 0: 5010.7. Samples: 327884262. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:04:06,847][25689] Avg episode reward: [(0, '-45.344')] [2022-07-09 16:04:07,400][26022] Updated weights on worker 0-0, policy_version 320208 (0.00094) [2022-07-09 16:04:09,293][26022] Updated weights on worker 0-0, policy_version 320218 (0.00091) [2022-07-09 16:04:10,935][26022] Updated weights on worker 0-0, policy_version 320228 (0.00086) [2022-07-09 16:04:11,859][25689] Fps is (10 sec: 5574.1, 60 sec: 5690.8, 300 sec: 5715.3). Total num frames: 327917568. Throughput: 0: 5873.7. Samples: 327919024. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:04:11,860][25689] Avg episode reward: [(0, '-45.644')] [2022-07-09 16:04:12,759][26022] Updated weights on worker 0-0, policy_version 320238 (0.00785) [2022-07-09 16:04:14,493][26022] Updated weights on worker 0-0, policy_version 320248 (0.01209) [2022-07-09 16:04:16,218][26022] Updated weights on worker 0-0, policy_version 320258 (0.00092) [2022-07-09 16:04:16,862][25689] Fps is (10 sec: 5826.4, 60 sec: 5730.7, 300 sec: 5723.3). Total num frames: 327948288. Throughput: 0: 5914.7. Samples: 327953790. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:04:16,864][25689] Avg episode reward: [(0, '-45.529')] [2022-07-09 16:04:18,138][26022] Updated weights on worker 0-0, policy_version 320268 (0.00086) [2022-07-09 16:04:19,698][26022] Updated weights on worker 0-0, policy_version 320278 (0.00091) [2022-07-09 16:04:21,602][26022] Updated weights on worker 0-0, policy_version 320288 (0.00083) [2022-07-09 16:04:21,873][25689] Fps is (10 sec: 5930.7, 60 sec: 5714.3, 300 sec: 5718.0). Total num frames: 327976960. Throughput: 0: 5067.2. Samples: 327971112. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:04:21,873][25689] Avg episode reward: [(0, '-45.251')] [2022-07-09 16:04:23,438][26022] Updated weights on worker 0-0, policy_version 320298 (0.00095) [2022-07-09 16:04:25,028][26022] Updated weights on worker 0-0, policy_version 320308 (0.00080) [2022-07-09 16:04:26,910][25689] Fps is (10 sec: 5605.2, 60 sec: 5702.6, 300 sec: 5714.9). Total num frames: 328004608. Throughput: 0: 6056.3. Samples: 328005784. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:04:26,910][25689] Avg episode reward: [(0, '-45.675')] [2022-07-09 16:04:27,074][26022] Updated weights on worker 0-0, policy_version 320318 (0.00088) [2022-07-09 16:04:28,474][26022] Updated weights on worker 0-0, policy_version 320328 (0.00087) [2022-07-09 16:04:30,553][26022] Updated weights on worker 0-0, policy_version 320338 (0.00092) [2022-07-09 16:04:31,914][25689] Fps is (10 sec: 5813.0, 60 sec: 5754.0, 300 sec: 5725.9). Total num frames: 328035328. Throughput: 0: 6046.8. Samples: 328040294. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:04:31,914][25689] Avg episode reward: [(0, '-45.211')] [2022-07-09 16:04:32,329][26022] Updated weights on worker 0-0, policy_version 320348 (0.00086) [2022-07-09 16:04:34,119][26022] Updated weights on worker 0-0, policy_version 320358 (0.00080) [2022-07-09 16:04:35,758][26022] Updated weights on worker 0-0, policy_version 320368 (0.00091) [2022-07-09 16:04:36,925][25689] Fps is (10 sec: 5827.8, 60 sec: 5707.9, 300 sec: 5715.7). Total num frames: 328062976. Throughput: 0: 5177.8. Samples: 328057672. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:04:36,927][25689] Avg episode reward: [(0, '-45.053')] [2022-07-09 16:04:37,575][26022] Updated weights on worker 0-0, policy_version 320378 (0.00085) [2022-07-09 16:04:39,236][26022] Updated weights on worker 0-0, policy_version 320388 (0.00808) [2022-07-09 16:04:41,237][26022] Updated weights on worker 0-0, policy_version 320398 (0.00086) [2022-07-09 16:04:41,950][25689] Fps is (10 sec: 5713.8, 60 sec: 5724.0, 300 sec: 5727.3). Total num frames: 328092672. Throughput: 0: 6046.7. Samples: 328092512. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:04:41,951][25689] Avg episode reward: [(0, '-45.877')] [2022-07-09 16:04:42,679][26022] Updated weights on worker 0-0, policy_version 320408 (0.00090) [2022-07-09 16:04:44,819][26022] Updated weights on worker 0-0, policy_version 320418 (0.00093) [2022-07-09 16:04:46,331][26022] Updated weights on worker 0-0, policy_version 320428 (0.00512) [2022-07-09 16:04:47,045][25689] Fps is (10 sec: 5767.8, 60 sec: 5709.7, 300 sec: 5718.9). Total num frames: 328121344. Throughput: 0: 6009.2. Samples: 328126780. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:04:47,045][25689] Avg episode reward: [(0, '-45.027')] [2022-07-09 16:04:48,255][26022] Updated weights on worker 0-0, policy_version 320438 (0.00086) [2022-07-09 16:04:49,971][26022] Updated weights on worker 0-0, policy_version 320448 (0.00084) [2022-07-09 16:04:51,766][26022] Updated weights on worker 0-0, policy_version 320458 (0.00088) [2022-07-09 16:04:52,135][25689] Fps is (10 sec: 5630.5, 60 sec: 5722.1, 300 sec: 5713.8). Total num frames: 328150016. Throughput: 0: 5126.6. Samples: 328143958. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:04:52,136][25689] Avg episode reward: [(0, '-45.981')] [2022-07-09 16:04:53,708][26022] Updated weights on worker 0-0, policy_version 320468 (0.00084) [2022-07-09 16:04:55,447][26022] Updated weights on worker 0-0, policy_version 320478 (0.00085) [2022-07-09 16:04:57,036][26022] Updated weights on worker 0-0, policy_version 320488 (0.00084) [2022-07-09 16:04:57,143][25689] Fps is (10 sec: 5780.3, 60 sec: 5730.1, 300 sec: 5720.6). Total num frames: 328179712. Throughput: 0: 5979.9. Samples: 328178570. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:04:57,143][25689] Avg episode reward: [(0, '-45.909')] [2022-07-09 16:04:59,095][26022] Updated weights on worker 0-0, policy_version 320498 (0.00095) [2022-07-09 16:05:00,533][26022] Updated weights on worker 0-0, policy_version 320508 (0.00091) [2022-07-09 16:05:02,155][25689] Fps is (10 sec: 5416.2, 60 sec: 5679.4, 300 sec: 5708.8). Total num frames: 328204288. Throughput: 0: 5964.3. Samples: 328213020. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:05:02,157][25689] Avg episode reward: [(0, '-45.374')] [2022-07-09 16:05:02,859][26022] Updated weights on worker 0-0, policy_version 320518 (0.00089) [2022-07-09 16:05:04,938][26022] Updated weights on worker 0-0, policy_version 320528 (0.00086) [2022-07-09 16:05:06,403][26022] Updated weights on worker 0-0, policy_version 320538 (0.00091) [2022-07-09 16:05:07,219][25689] Fps is (10 sec: 5589.4, 60 sec: 5732.8, 300 sec: 5724.9). Total num frames: 328236032. Throughput: 0: 5017.8. Samples: 328228008. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 16:05:07,219][25689] Avg episode reward: [(0, '-45.716')] [2022-07-09 16:05:08,401][26022] Updated weights on worker 0-0, policy_version 320548 (0.00096) [2022-07-09 16:05:09,852][26022] Updated weights on worker 0-0, policy_version 320558 (0.00100) [2022-07-09 16:05:11,801][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:05:11,813][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000320568_328261632.pth [2022-07-09 16:05:11,813][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000318554_326199296.pth [2022-07-09 16:05:11,824][26022] Updated weights on worker 0-0, policy_version 320568 (0.00086) [2022-07-09 16:05:12,229][25689] Fps is (10 sec: 5794.1, 60 sec: 5716.2, 300 sec: 5718.4). Total num frames: 328262656. Throughput: 0: 5910.2. Samples: 328262718. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:05:12,229][25689] Avg episode reward: [(0, '-47.055')] [2022-07-09 16:05:13,546][26022] Updated weights on worker 0-0, policy_version 320578 (0.00093) [2022-07-09 16:05:15,547][26022] Updated weights on worker 0-0, policy_version 320588 (0.00634) [2022-07-09 16:05:17,142][26022] Updated weights on worker 0-0, policy_version 320598 (0.00087) [2022-07-09 16:05:17,243][25689] Fps is (10 sec: 5618.7, 60 sec: 5698.3, 300 sec: 5719.1). Total num frames: 328292352. Throughput: 0: 5910.8. Samples: 328297376. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:05:17,243][25689] Avg episode reward: [(0, '-46.767')] [2022-07-09 16:05:18,830][26022] Updated weights on worker 0-0, policy_version 320608 (0.00089) [2022-07-09 16:05:20,719][26022] Updated weights on worker 0-0, policy_version 320618 (0.00086) [2022-07-09 16:05:22,253][25689] Fps is (10 sec: 5924.8, 60 sec: 5715.3, 300 sec: 5726.9). Total num frames: 328322048. Throughput: 0: 5067.5. Samples: 328314866. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:05:22,254][25689] Avg episode reward: [(0, '-46.076')] [2022-07-09 16:05:22,406][26022] Updated weights on worker 0-0, policy_version 320628 (0.00092) [2022-07-09 16:05:24,331][26022] Updated weights on worker 0-0, policy_version 320638 (0.00084) [2022-07-09 16:05:26,107][26022] Updated weights on worker 0-0, policy_version 320648 (0.00084) [2022-07-09 16:05:27,389][25689] Fps is (10 sec: 5752.5, 60 sec: 5722.8, 300 sec: 5721.7). Total num frames: 328350720. Throughput: 0: 5997.3. Samples: 328348976. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:05:27,390][25689] Avg episode reward: [(0, '-46.186')] [2022-07-09 16:05:27,812][26022] Updated weights on worker 0-0, policy_version 320658 (0.00080) [2022-07-09 16:05:29,697][26022] Updated weights on worker 0-0, policy_version 320668 (0.00087) [2022-07-09 16:05:31,291][26022] Updated weights on worker 0-0, policy_version 320678 (0.00081) [2022-07-09 16:05:32,418][25689] Fps is (10 sec: 5742.1, 60 sec: 5703.5, 300 sec: 5724.7). Total num frames: 328380416. Throughput: 0: 6004.1. Samples: 328383936. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:05:32,419][25689] Avg episode reward: [(0, '-46.840')] [2022-07-09 16:05:33,107][26022] Updated weights on worker 0-0, policy_version 320688 (0.00086) [2022-07-09 16:05:34,874][26022] Updated weights on worker 0-0, policy_version 320698 (0.00109) [2022-07-09 16:05:36,617][26022] Updated weights on worker 0-0, policy_version 320708 (0.00093) [2022-07-09 16:05:37,428][25689] Fps is (10 sec: 5814.1, 60 sec: 5720.6, 300 sec: 5724.8). Total num frames: 328409088. Throughput: 0: 6022.8. Samples: 328418950. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:05:37,429][25689] Avg episode reward: [(0, '-46.986')] [2022-07-09 16:05:38,306][26022] Updated weights on worker 0-0, policy_version 320718 (0.00088) [2022-07-09 16:05:40,226][26022] Updated weights on worker 0-0, policy_version 320728 (0.00085) [2022-07-09 16:05:42,015][26022] Updated weights on worker 0-0, policy_version 320738 (0.00085) [2022-07-09 16:05:42,456][25689] Fps is (10 sec: 5814.5, 60 sec: 5720.3, 300 sec: 5732.9). Total num frames: 328438784. Throughput: 0: 6000.9. Samples: 328436104. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:05:42,458][25689] Avg episode reward: [(0, '-46.958')] [2022-07-09 16:05:43,782][26022] Updated weights on worker 0-0, policy_version 320748 (0.00083) [2022-07-09 16:05:45,483][26022] Updated weights on worker 0-0, policy_version 320758 (0.00087) [2022-07-09 16:05:47,328][26022] Updated weights on worker 0-0, policy_version 320768 (0.00087) [2022-07-09 16:05:47,547][25689] Fps is (10 sec: 5666.9, 60 sec: 5703.7, 300 sec: 5725.1). Total num frames: 328466432. Throughput: 0: 6047.7. Samples: 328470886. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:05:47,549][25689] Avg episode reward: [(0, '-46.997')] [2022-07-09 16:05:48,948][26022] Updated weights on worker 0-0, policy_version 320778 (0.00087) [2022-07-09 16:05:50,983][26022] Updated weights on worker 0-0, policy_version 320788 (0.00082) [2022-07-09 16:05:52,447][26022] Updated weights on worker 0-0, policy_version 320798 (0.00087) [2022-07-09 16:05:52,552][25689] Fps is (10 sec: 5781.6, 60 sec: 5745.7, 300 sec: 5728.9). Total num frames: 328497152. Throughput: 0: 6036.5. Samples: 328505474. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:05:52,553][25689] Avg episode reward: [(0, '-47.390')] [2022-07-09 16:05:54,526][26022] Updated weights on worker 0-0, policy_version 320808 (0.00085) [2022-07-09 16:05:56,210][26022] Updated weights on worker 0-0, policy_version 320818 (0.00094) [2022-07-09 16:05:57,590][25689] Fps is (10 sec: 5811.8, 60 sec: 5708.9, 300 sec: 5724.8). Total num frames: 328524800. Throughput: 0: 5140.3. Samples: 328522590. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:05:57,592][25689] Avg episode reward: [(0, '-46.550')] [2022-07-09 16:05:57,867][26022] Updated weights on worker 0-0, policy_version 320828 (0.00092) [2022-07-09 16:05:59,804][26022] Updated weights on worker 0-0, policy_version 320838 (0.00087) [2022-07-09 16:06:01,895][26022] Updated weights on worker 0-0, policy_version 320848 (0.00083) [2022-07-09 16:06:02,601][25689] Fps is (10 sec: 5502.7, 60 sec: 5759.9, 300 sec: 5726.2). Total num frames: 328552448. Throughput: 0: 5978.5. Samples: 328556536. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:06:02,601][25689] Avg episode reward: [(0, '-46.355')] [2022-07-09 16:06:03,655][26022] Updated weights on worker 0-0, policy_version 320858 (0.00089) [2022-07-09 16:06:05,383][26022] Updated weights on worker 0-0, policy_version 320868 (0.00082) [2022-07-09 16:06:07,268][26022] Updated weights on worker 0-0, policy_version 320878 (0.00089) [2022-07-09 16:06:07,677][25689] Fps is (10 sec: 5685.2, 60 sec: 5724.8, 300 sec: 5728.3). Total num frames: 328582144. Throughput: 0: 5925.5. Samples: 328590164. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:06:07,677][25689] Avg episode reward: [(0, '-46.448')] [2022-07-09 16:06:09,054][26022] Updated weights on worker 0-0, policy_version 320888 (0.00085) [2022-07-09 16:06:10,652][26022] Updated weights on worker 0-0, policy_version 320898 (0.00093) [2022-07-09 16:06:12,325][26022] Updated weights on worker 0-0, policy_version 320908 (0.00084) [2022-07-09 16:06:12,719][25689] Fps is (10 sec: 5869.7, 60 sec: 5772.6, 300 sec: 5727.8). Total num frames: 328611840. Throughput: 0: 5060.3. Samples: 328607526. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:06:12,720][25689] Avg episode reward: [(0, '-45.661')] [2022-07-09 16:06:14,204][26022] Updated weights on worker 0-0, policy_version 320918 (0.00085) [2022-07-09 16:06:15,934][26022] Updated weights on worker 0-0, policy_version 320928 (0.00088) [2022-07-09 16:06:17,732][25689] Fps is (10 sec: 5702.7, 60 sec: 5738.7, 300 sec: 5728.1). Total num frames: 328639488. Throughput: 0: 5957.2. Samples: 328642580. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:06:17,733][25689] Avg episode reward: [(0, '-45.637')] [2022-07-09 16:06:17,801][26022] Updated weights on worker 0-0, policy_version 320938 (0.00093) [2022-07-09 16:06:19,355][26022] Updated weights on worker 0-0, policy_version 320948 (0.00090) [2022-07-09 16:06:21,362][26022] Updated weights on worker 0-0, policy_version 320958 (0.00089) [2022-07-09 16:06:22,768][25689] Fps is (10 sec: 5706.4, 60 sec: 5736.3, 300 sec: 5725.1). Total num frames: 328669184. Throughput: 0: 5988.1. Samples: 328677300. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:06:22,769][25689] Avg episode reward: [(0, '-45.318')] [2022-07-09 16:06:23,283][26022] Updated weights on worker 0-0, policy_version 320968 (0.00088) [2022-07-09 16:06:24,717][26022] Updated weights on worker 0-0, policy_version 320978 (0.00053) [2022-07-09 16:06:26,695][26022] Updated weights on worker 0-0, policy_version 320988 (0.00095) [2022-07-09 16:06:27,818][25689] Fps is (10 sec: 5787.5, 60 sec: 5744.6, 300 sec: 5727.7). Total num frames: 328697856. Throughput: 0: 5176.2. Samples: 328694416. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:06:27,818][25689] Avg episode reward: [(0, '-46.739')] [2022-07-09 16:06:28,251][26022] Updated weights on worker 0-0, policy_version 320998 (0.00091) [2022-07-09 16:06:30,367][26022] Updated weights on worker 0-0, policy_version 321008 (0.00085) [2022-07-09 16:06:32,276][26022] Updated weights on worker 0-0, policy_version 321018 (0.00090) [2022-07-09 16:06:32,832][25689] Fps is (10 sec: 5698.0, 60 sec: 5729.0, 300 sec: 5724.2). Total num frames: 328726528. Throughput: 0: 6022.9. Samples: 328728664. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:06:32,832][25689] Avg episode reward: [(0, '-46.814')] [2022-07-09 16:06:33,809][26022] Updated weights on worker 0-0, policy_version 321028 (0.00085) [2022-07-09 16:06:35,731][26022] Updated weights on worker 0-0, policy_version 321038 (0.00088) [2022-07-09 16:06:37,337][26022] Updated weights on worker 0-0, policy_version 321048 (0.00088) [2022-07-09 16:06:37,855][25689] Fps is (10 sec: 5611.0, 60 sec: 5710.8, 300 sec: 5717.0). Total num frames: 328754176. Throughput: 0: 5986.2. Samples: 328763038. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:06:37,856][25689] Avg episode reward: [(0, '-47.399')] [2022-07-09 16:06:39,328][26022] Updated weights on worker 0-0, policy_version 321058 (0.00086) [2022-07-09 16:06:40,865][26022] Updated weights on worker 0-0, policy_version 321068 (0.00087) [2022-07-09 16:06:42,859][25689] Fps is (10 sec: 5718.7, 60 sec: 5713.1, 300 sec: 5721.6). Total num frames: 328783872. Throughput: 0: 5126.7. Samples: 328780302. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:06:42,860][25689] Avg episode reward: [(0, '-47.702')] [2022-07-09 16:06:42,863][26022] Updated weights on worker 0-0, policy_version 321078 (0.00090) [2022-07-09 16:06:44,498][26022] Updated weights on worker 0-0, policy_version 321088 (0.00090) [2022-07-09 16:06:46,620][26022] Updated weights on worker 0-0, policy_version 321098 (0.00086) [2022-07-09 16:06:47,929][25689] Fps is (10 sec: 5895.4, 60 sec: 5749.0, 300 sec: 5724.0). Total num frames: 328813568. Throughput: 0: 5983.4. Samples: 328814752. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:06:47,930][25689] Avg episode reward: [(0, '-47.445')] [2022-07-09 16:06:48,014][26022] Updated weights on worker 0-0, policy_version 321108 (0.00090) [2022-07-09 16:06:50,058][26022] Updated weights on worker 0-0, policy_version 321118 (0.00709) [2022-07-09 16:06:51,471][26022] Updated weights on worker 0-0, policy_version 321128 (0.00087) [2022-07-09 16:06:52,942][25689] Fps is (10 sec: 5687.4, 60 sec: 5697.3, 300 sec: 5720.6). Total num frames: 328841216. Throughput: 0: 6010.6. Samples: 328849538. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:06:52,943][25689] Avg episode reward: [(0, '-46.628')] [2022-07-09 16:06:53,575][26022] Updated weights on worker 0-0, policy_version 321138 (0.00087) [2022-07-09 16:06:55,260][26022] Updated weights on worker 0-0, policy_version 321148 (0.00092) [2022-07-09 16:06:57,104][26022] Updated weights on worker 0-0, policy_version 321158 (0.01202) [2022-07-09 16:06:58,012][25689] Fps is (10 sec: 5687.6, 60 sec: 5728.3, 300 sec: 5719.3). Total num frames: 328870912. Throughput: 0: 5147.3. Samples: 328866790. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:06:58,012][25689] Avg episode reward: [(0, '-46.231')] [2022-07-09 16:06:58,958][26022] Updated weights on worker 0-0, policy_version 321168 (0.00084) [2022-07-09 16:07:00,783][26022] Updated weights on worker 0-0, policy_version 321178 (0.00092) [2022-07-09 16:07:02,792][26022] Updated weights on worker 0-0, policy_version 321188 (0.00094) [2022-07-09 16:07:03,048][25689] Fps is (10 sec: 5471.4, 60 sec: 5691.9, 300 sec: 5716.9). Total num frames: 328896512. Throughput: 0: 5966.3. Samples: 328900756. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:07:03,049][25689] Avg episode reward: [(0, '-46.303')] [2022-07-09 16:07:04,860][26022] Updated weights on worker 0-0, policy_version 321198 (0.00090) [2022-07-09 16:07:06,611][26022] Updated weights on worker 0-0, policy_version 321208 (0.00089) [2022-07-09 16:07:08,177][25689] Fps is (10 sec: 5338.8, 60 sec: 5670.0, 300 sec: 5712.5). Total num frames: 328925184. Throughput: 0: 5819.5. Samples: 328932584. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:07:08,178][25689] Avg episode reward: [(0, '-47.054')] [2022-07-09 16:07:08,386][26022] Updated weights on worker 0-0, policy_version 321218 (0.00096) [2022-07-09 16:07:10,040][26022] Updated weights on worker 0-0, policy_version 321228 (0.00087) [2022-07-09 16:07:11,901][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:07:11,913][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000321238_328947712.pth [2022-07-09 16:07:11,913][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000319224_326885376.pth [2022-07-09 16:07:11,924][26022] Updated weights on worker 0-0, policy_version 321238 (0.00086) [2022-07-09 16:07:13,218][25689] Fps is (10 sec: 5840.4, 60 sec: 5687.1, 300 sec: 5716.5). Total num frames: 328955904. Throughput: 0: 4948.6. Samples: 328949874. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:07:13,220][25689] Avg episode reward: [(0, '-47.047')] [2022-07-09 16:07:13,493][26022] Updated weights on worker 0-0, policy_version 321248 (0.00084) [2022-07-09 16:07:15,779][26022] Updated weights on worker 0-0, policy_version 321258 (0.00089) [2022-07-09 16:07:17,041][26022] Updated weights on worker 0-0, policy_version 321268 (0.00085) [2022-07-09 16:07:18,278][25689] Fps is (10 sec: 5778.8, 60 sec: 5682.8, 300 sec: 5716.0). Total num frames: 328983552. Throughput: 0: 5803.3. Samples: 328984400. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:07:18,278][25689] Avg episode reward: [(0, '-47.496')] [2022-07-09 16:07:19,168][26022] Updated weights on worker 0-0, policy_version 321278 (0.00087) [2022-07-09 16:07:20,685][26022] Updated weights on worker 0-0, policy_version 321288 (0.00089) [2022-07-09 16:07:22,435][26022] Updated weights on worker 0-0, policy_version 321298 (0.00087) [2022-07-09 16:07:23,304][25689] Fps is (10 sec: 5685.1, 60 sec: 5683.6, 300 sec: 5714.0). Total num frames: 329013248. Throughput: 0: 5846.4. Samples: 329019180. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:07:23,305][25689] Avg episode reward: [(0, '-48.205')] [2022-07-09 16:07:24,291][26022] Updated weights on worker 0-0, policy_version 321308 (0.00086) [2022-07-09 16:07:26,210][26022] Updated weights on worker 0-0, policy_version 321318 (0.00088) [2022-07-09 16:07:27,883][26022] Updated weights on worker 0-0, policy_version 321328 (0.00086) [2022-07-09 16:07:28,373][25689] Fps is (10 sec: 5883.1, 60 sec: 5698.7, 300 sec: 5716.3). Total num frames: 329042944. Throughput: 0: 5148.7. Samples: 329036566. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:07:28,374][25689] Avg episode reward: [(0, '-48.308')] [2022-07-09 16:07:29,820][26022] Updated weights on worker 0-0, policy_version 321338 (0.00092) [2022-07-09 16:07:31,286][26022] Updated weights on worker 0-0, policy_version 321348 (0.01230) [2022-07-09 16:07:33,247][26022] Updated weights on worker 0-0, policy_version 321358 (0.00083) [2022-07-09 16:07:33,386][25689] Fps is (10 sec: 5789.4, 60 sec: 5698.8, 300 sec: 5713.0). Total num frames: 329071616. Throughput: 0: 6000.8. Samples: 329070902. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:07:33,391][25689] Avg episode reward: [(0, '-48.726')] [2022-07-09 16:07:34,842][26022] Updated weights on worker 0-0, policy_version 321368 (0.00091) [2022-07-09 16:07:36,660][26022] Updated weights on worker 0-0, policy_version 321378 (0.00483) [2022-07-09 16:07:38,391][26022] Updated weights on worker 0-0, policy_version 321388 (0.00092) [2022-07-09 16:07:38,402][25689] Fps is (10 sec: 5819.9, 60 sec: 5733.3, 300 sec: 5717.5). Total num frames: 329101312. Throughput: 0: 6042.3. Samples: 329105998. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-09 16:07:38,403][25689] Avg episode reward: [(0, '-47.736')] [2022-07-09 16:07:40,380][26022] Updated weights on worker 0-0, policy_version 321398 (0.00091) [2022-07-09 16:07:41,807][26022] Updated weights on worker 0-0, policy_version 321408 (0.00084) [2022-07-09 16:07:43,424][25689] Fps is (10 sec: 5713.0, 60 sec: 5697.9, 300 sec: 5719.6). Total num frames: 329128960. Throughput: 0: 5178.3. Samples: 329123364. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:07:43,424][25689] Avg episode reward: [(0, '-47.031')] [2022-07-09 16:07:43,943][26022] Updated weights on worker 0-0, policy_version 321418 (0.00089) [2022-07-09 16:07:45,466][26022] Updated weights on worker 0-0, policy_version 321428 (0.00095) [2022-07-09 16:07:47,506][26022] Updated weights on worker 0-0, policy_version 321438 (0.00084) [2022-07-09 16:07:48,496][25689] Fps is (10 sec: 5782.4, 60 sec: 5714.5, 300 sec: 5722.4). Total num frames: 329159680. Throughput: 0: 6023.8. Samples: 329157784. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:07:48,497][25689] Avg episode reward: [(0, '-46.900')] [2022-07-09 16:07:49,142][26022] Updated weights on worker 0-0, policy_version 321448 (0.00077) [2022-07-09 16:07:50,949][26022] Updated weights on worker 0-0, policy_version 321458 (0.00082) [2022-07-09 16:07:52,543][26022] Updated weights on worker 0-0, policy_version 321468 (0.00089) [2022-07-09 16:07:53,509][25689] Fps is (10 sec: 5888.8, 60 sec: 5731.4, 300 sec: 5725.8). Total num frames: 329188352. Throughput: 0: 6062.1. Samples: 329192890. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:07:53,510][25689] Avg episode reward: [(0, '-45.931')] [2022-07-09 16:07:54,424][26022] Updated weights on worker 0-0, policy_version 321478 (0.00087) [2022-07-09 16:07:56,253][26022] Updated weights on worker 0-0, policy_version 321488 (0.00086) [2022-07-09 16:07:58,031][26022] Updated weights on worker 0-0, policy_version 321498 (0.00080) [2022-07-09 16:07:58,538][25689] Fps is (10 sec: 5608.4, 60 sec: 5701.4, 300 sec: 5718.5). Total num frames: 329216000. Throughput: 0: 5177.0. Samples: 329210242. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:07:58,539][25689] Avg episode reward: [(0, '-44.849')] [2022-07-09 16:07:59,717][26022] Updated weights on worker 0-0, policy_version 321508 (0.00086) [2022-07-09 16:08:01,560][26022] Updated weights on worker 0-0, policy_version 321518 (0.00088) [2022-07-09 16:08:03,551][25689] Fps is (10 sec: 5506.9, 60 sec: 5737.6, 300 sec: 5723.7). Total num frames: 329243648. Throughput: 0: 6026.2. Samples: 329244652. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:08:03,551][25689] Avg episode reward: [(0, '-45.246')] [2022-07-09 16:08:03,660][26022] Updated weights on worker 0-0, policy_version 321528 (0.00090) [2022-07-09 16:08:05,495][26022] Updated weights on worker 0-0, policy_version 321538 (0.00084) [2022-07-09 16:08:07,118][26022] Updated weights on worker 0-0, policy_version 321548 (0.00087) [2022-07-09 16:08:08,650][25689] Fps is (10 sec: 5468.7, 60 sec: 5723.5, 300 sec: 5711.6). Total num frames: 329271296. Throughput: 0: 5921.0. Samples: 329277112. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:08:08,650][25689] Avg episode reward: [(0, '-45.695')] [2022-07-09 16:08:09,023][26022] Updated weights on worker 0-0, policy_version 321558 (0.00096) [2022-07-09 16:08:10,847][26022] Updated weights on worker 0-0, policy_version 321568 (0.00081) [2022-07-09 16:08:12,618][26022] Updated weights on worker 0-0, policy_version 321578 (0.00090) [2022-07-09 16:08:13,672][25689] Fps is (10 sec: 5665.7, 60 sec: 5708.2, 300 sec: 5716.0). Total num frames: 329300992. Throughput: 0: 5030.2. Samples: 329294312. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:08:13,673][25689] Avg episode reward: [(0, '-46.301')] [2022-07-09 16:08:14,306][26022] Updated weights on worker 0-0, policy_version 321588 (0.00091) [2022-07-09 16:08:16,153][26022] Updated weights on worker 0-0, policy_version 321598 (0.00095) [2022-07-09 16:08:17,868][26022] Updated weights on worker 0-0, policy_version 321608 (0.00080) [2022-07-09 16:08:18,674][25689] Fps is (10 sec: 5822.5, 60 sec: 5730.7, 300 sec: 5712.8). Total num frames: 329329664. Throughput: 0: 5894.7. Samples: 329328938. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:08:18,675][25689] Avg episode reward: [(0, '-46.652')] [2022-07-09 16:08:19,697][26022] Updated weights on worker 0-0, policy_version 321618 (0.00088) [2022-07-09 16:08:21,474][26022] Updated weights on worker 0-0, policy_version 321628 (0.00088) [2022-07-09 16:08:23,336][26022] Updated weights on worker 0-0, policy_version 321638 (0.00087) [2022-07-09 16:08:23,698][25689] Fps is (10 sec: 5822.0, 60 sec: 5731.0, 300 sec: 5717.6). Total num frames: 329359360. Throughput: 0: 5907.8. Samples: 329363676. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:08:23,698][25689] Avg episode reward: [(0, '-47.351')] [2022-07-09 16:08:25,007][26022] Updated weights on worker 0-0, policy_version 321648 (0.00085) [2022-07-09 16:08:27,011][26022] Updated weights on worker 0-0, policy_version 321658 (0.00084) [2022-07-09 16:08:28,630][26022] Updated weights on worker 0-0, policy_version 321668 (0.00085) [2022-07-09 16:08:28,771][25689] Fps is (10 sec: 5882.5, 60 sec: 5730.6, 300 sec: 5723.2). Total num frames: 329389056. Throughput: 0: 5162.1. Samples: 329380978. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:08:28,771][25689] Avg episode reward: [(0, '-47.517')] [2022-07-09 16:08:30,687][26022] Updated weights on worker 0-0, policy_version 321678 (0.00088) [2022-07-09 16:08:32,318][26022] Updated weights on worker 0-0, policy_version 321688 (0.00088) [2022-07-09 16:08:33,850][25689] Fps is (10 sec: 5648.5, 60 sec: 5707.4, 300 sec: 5712.6). Total num frames: 329416704. Throughput: 0: 5988.8. Samples: 329415152. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:08:33,850][25689] Avg episode reward: [(0, '-47.694')] [2022-07-09 16:08:34,049][26022] Updated weights on worker 0-0, policy_version 321698 (0.00080) [2022-07-09 16:08:35,763][26022] Updated weights on worker 0-0, policy_version 321708 (0.00468) [2022-07-09 16:08:37,680][26022] Updated weights on worker 0-0, policy_version 321718 (0.00094) [2022-07-09 16:08:38,949][25689] Fps is (10 sec: 5734.5, 60 sec: 5716.4, 300 sec: 5717.8). Total num frames: 329447424. Throughput: 0: 5978.0. Samples: 329450142. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:08:38,950][25689] Avg episode reward: [(0, '-47.457')] [2022-07-09 16:08:39,202][26022] Updated weights on worker 0-0, policy_version 321728 (0.00083) [2022-07-09 16:08:41,263][26022] Updated weights on worker 0-0, policy_version 321738 (0.00081) [2022-07-09 16:08:42,770][26022] Updated weights on worker 0-0, policy_version 321748 (0.00091) [2022-07-09 16:08:44,039][25689] Fps is (10 sec: 5728.2, 60 sec: 5710.0, 300 sec: 5711.6). Total num frames: 329475072. Throughput: 0: 5958.2. Samples: 329484876. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:08:44,040][25689] Avg episode reward: [(0, '-47.199')] [2022-07-09 16:08:44,661][26022] Updated weights on worker 0-0, policy_version 321758 (0.00092) [2022-07-09 16:08:46,340][26022] Updated weights on worker 0-0, policy_version 321768 (0.00089) [2022-07-09 16:08:48,248][26022] Updated weights on worker 0-0, policy_version 321778 (0.00099) [2022-07-09 16:08:49,078][25689] Fps is (10 sec: 5762.8, 60 sec: 5713.2, 300 sec: 5722.0). Total num frames: 329505792. Throughput: 0: 5966.9. Samples: 329502146. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:08:49,078][25689] Avg episode reward: [(0, '-47.363')] [2022-07-09 16:08:49,949][26022] Updated weights on worker 0-0, policy_version 321788 (0.00082) [2022-07-09 16:08:51,786][26022] Updated weights on worker 0-0, policy_version 321798 (0.00087) [2022-07-09 16:08:53,630][26022] Updated weights on worker 0-0, policy_version 321808 (0.00089) [2022-07-09 16:08:54,095][25689] Fps is (10 sec: 5804.5, 60 sec: 5695.9, 300 sec: 5716.5). Total num frames: 329533440. Throughput: 0: 5990.9. Samples: 329536440. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:08:54,096][25689] Avg episode reward: [(0, '-48.030')] [2022-07-09 16:08:55,268][26022] Updated weights on worker 0-0, policy_version 321818 (0.00080) [2022-07-09 16:08:57,126][26022] Updated weights on worker 0-0, policy_version 321828 (0.00092) [2022-07-09 16:08:59,086][26022] Updated weights on worker 0-0, policy_version 321838 (0.00088) [2022-07-09 16:08:59,181][25689] Fps is (10 sec: 5574.1, 60 sec: 5707.4, 300 sec: 5718.6). Total num frames: 329562112. Throughput: 0: 5961.2. Samples: 329570750. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:08:59,182][25689] Avg episode reward: [(0, '-47.734')] [2022-07-09 16:09:00,660][26022] Updated weights on worker 0-0, policy_version 321848 (0.00085) [2022-07-09 16:09:03,274][26022] Updated weights on worker 0-0, policy_version 321858 (0.00093) [2022-07-09 16:09:04,191][25689] Fps is (10 sec: 5680.1, 60 sec: 5724.6, 300 sec: 5720.1). Total num frames: 329590784. Throughput: 0: 5070.4. Samples: 329587052. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:09:04,191][25689] Avg episode reward: [(0, '-47.981')] [2022-07-09 16:09:04,564][26022] Updated weights on worker 0-0, policy_version 321868 (0.00089) [2022-07-09 16:09:06,678][26022] Updated weights on worker 0-0, policy_version 321878 (0.00096) [2022-07-09 16:09:08,363][26022] Updated weights on worker 0-0, policy_version 321888 (0.00108) [2022-07-09 16:09:09,279][25689] Fps is (10 sec: 5476.1, 60 sec: 5708.6, 300 sec: 5715.3). Total num frames: 329617408. Throughput: 0: 5825.3. Samples: 329619826. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:09:09,280][25689] Avg episode reward: [(0, '-47.653')] [2022-07-09 16:09:10,073][26022] Updated weights on worker 0-0, policy_version 321898 (0.00092) [2022-07-09 16:09:11,946][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:09:11,958][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000321908_329633792.pth [2022-07-09 16:09:11,958][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000319897_327574528.pth [2022-07-09 16:09:11,963][26022] Updated weights on worker 0-0, policy_version 321908 (0.00090) [2022-07-09 16:09:13,700][26022] Updated weights on worker 0-0, policy_version 321918 (0.00089) [2022-07-09 16:09:14,357][25689] Fps is (10 sec: 5539.7, 60 sec: 5703.4, 300 sec: 5710.4). Total num frames: 329647104. Throughput: 0: 5825.5. Samples: 329654478. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:09:14,358][25689] Avg episode reward: [(0, '-47.093')] [2022-07-09 16:09:15,478][26022] Updated weights on worker 0-0, policy_version 321928 (0.00086) [2022-07-09 16:09:17,462][26022] Updated weights on worker 0-0, policy_version 321938 (0.00087) [2022-07-09 16:09:19,141][26022] Updated weights on worker 0-0, policy_version 321948 (0.00086) [2022-07-09 16:09:19,359][25689] Fps is (10 sec: 5892.4, 60 sec: 5720.4, 300 sec: 5714.0). Total num frames: 329676800. Throughput: 0: 5009.8. Samples: 329671834. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:09:19,359][25689] Avg episode reward: [(0, '-47.218')] [2022-07-09 16:09:20,812][26022] Updated weights on worker 0-0, policy_version 321958 (0.00087) [2022-07-09 16:09:22,677][26022] Updated weights on worker 0-0, policy_version 321968 (0.00083) [2022-07-09 16:09:24,117][26022] Updated weights on worker 0-0, policy_version 321978 (0.00080) [2022-07-09 16:09:24,382][25689] Fps is (10 sec: 5822.1, 60 sec: 5703.4, 300 sec: 5717.7). Total num frames: 329705472. Throughput: 0: 5926.7. Samples: 329706722. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:09:24,383][25689] Avg episode reward: [(0, '-46.897')] [2022-07-09 16:09:26,196][26022] Updated weights on worker 0-0, policy_version 321988 (0.00091) [2022-07-09 16:09:27,979][26022] Updated weights on worker 0-0, policy_version 321998 (0.00093) [2022-07-09 16:09:29,464][25689] Fps is (10 sec: 5674.7, 60 sec: 5685.8, 300 sec: 5709.3). Total num frames: 329734144. Throughput: 0: 6002.2. Samples: 329740978. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:09:29,464][25689] Avg episode reward: [(0, '-45.740')] [2022-07-09 16:09:29,650][26022] Updated weights on worker 0-0, policy_version 322008 (0.00091) [2022-07-09 16:09:31,497][26022] Updated weights on worker 0-0, policy_version 322018 (0.00085) [2022-07-09 16:09:33,246][26022] Updated weights on worker 0-0, policy_version 322028 (0.00086) [2022-07-09 16:09:34,465][25689] Fps is (10 sec: 5687.3, 60 sec: 5709.9, 300 sec: 5713.0). Total num frames: 329762816. Throughput: 0: 5167.4. Samples: 329758384. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:09:34,466][25689] Avg episode reward: [(0, '-46.175')] [2022-07-09 16:09:34,991][26022] Updated weights on worker 0-0, policy_version 322038 (0.00084) [2022-07-09 16:09:36,851][26022] Updated weights on worker 0-0, policy_version 322048 (0.00095) [2022-07-09 16:09:38,556][26022] Updated weights on worker 0-0, policy_version 322058 (0.00084) [2022-07-09 16:09:39,483][25689] Fps is (10 sec: 5825.8, 60 sec: 5700.8, 300 sec: 5713.1). Total num frames: 329792512. Throughput: 0: 6030.1. Samples: 329793184. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:09:39,483][25689] Avg episode reward: [(0, '-46.358')] [2022-07-09 16:09:40,497][26022] Updated weights on worker 0-0, policy_version 322068 (0.00091) [2022-07-09 16:09:42,247][26022] Updated weights on worker 0-0, policy_version 322078 (0.00087) [2022-07-09 16:09:43,920][26022] Updated weights on worker 0-0, policy_version 322088 (0.00985) [2022-07-09 16:09:44,530][25689] Fps is (10 sec: 5799.3, 60 sec: 5721.7, 300 sec: 5714.0). Total num frames: 329821184. Throughput: 0: 6006.1. Samples: 329827730. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:09:44,531][25689] Avg episode reward: [(0, '-46.520')] [2022-07-09 16:09:45,818][26022] Updated weights on worker 0-0, policy_version 322098 (0.00086) [2022-07-09 16:09:47,494][26022] Updated weights on worker 0-0, policy_version 322108 (0.00083) [2022-07-09 16:09:49,498][26022] Updated weights on worker 0-0, policy_version 322118 (0.00086) [2022-07-09 16:09:49,605][25689] Fps is (10 sec: 5766.1, 60 sec: 5701.3, 300 sec: 5717.8). Total num frames: 329850880. Throughput: 0: 5159.1. Samples: 329844890. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:09:49,606][25689] Avg episode reward: [(0, '-46.376')] [2022-07-09 16:09:51,022][26022] Updated weights on worker 0-0, policy_version 322128 (0.00081) [2022-07-09 16:09:52,720][26022] Updated weights on worker 0-0, policy_version 322138 (0.00092) [2022-07-09 16:09:54,659][25689] Fps is (10 sec: 5661.6, 60 sec: 5697.9, 300 sec: 5710.0). Total num frames: 329878528. Throughput: 0: 6018.1. Samples: 329879910. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:09:54,659][25689] Avg episode reward: [(0, '-46.426')] [2022-07-09 16:09:54,700][26022] Updated weights on worker 0-0, policy_version 322148 (0.00097) [2022-07-09 16:09:56,435][26022] Updated weights on worker 0-0, policy_version 322158 (0.00084) [2022-07-09 16:09:58,247][26022] Updated weights on worker 0-0, policy_version 322168 (0.00088) [2022-07-09 16:09:59,666][25689] Fps is (10 sec: 5699.7, 60 sec: 5722.3, 300 sec: 5727.3). Total num frames: 329908224. Throughput: 0: 6015.7. Samples: 329914602. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:09:59,668][25689] Avg episode reward: [(0, '-46.690')] [2022-07-09 16:09:59,915][26022] Updated weights on worker 0-0, policy_version 322178 (0.00093) [2022-07-09 16:10:02,036][26022] Updated weights on worker 0-0, policy_version 322188 (0.00092) [2022-07-09 16:10:03,798][26022] Updated weights on worker 0-0, policy_version 322198 (0.00078) [2022-07-09 16:10:04,693][25689] Fps is (10 sec: 5714.5, 60 sec: 5703.7, 300 sec: 5714.2). Total num frames: 329935872. Throughput: 0: 5061.0. Samples: 329929776. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:10:04,694][25689] Avg episode reward: [(0, '-46.602')] [2022-07-09 16:10:05,521][26022] Updated weights on worker 0-0, policy_version 322208 (0.00054) [2022-07-09 16:10:07,497][26022] Updated weights on worker 0-0, policy_version 322218 (0.00086) [2022-07-09 16:10:09,171][26022] Updated weights on worker 0-0, policy_version 322228 (0.00071) [2022-07-09 16:10:09,784][25689] Fps is (10 sec: 5566.7, 60 sec: 5737.4, 300 sec: 5719.6). Total num frames: 329964544. Throughput: 0: 5923.6. Samples: 329964418. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 16:10:09,784][25689] Avg episode reward: [(0, '-45.852')] [2022-07-09 16:10:11,005][26022] Updated weights on worker 0-0, policy_version 322238 (0.00087) [2022-07-09 16:10:12,624][26022] Updated weights on worker 0-0, policy_version 322248 (0.00090) [2022-07-09 16:10:14,497][26022] Updated weights on worker 0-0, policy_version 322258 (0.00081) [2022-07-09 16:10:14,865][25689] Fps is (10 sec: 5738.3, 60 sec: 5737.1, 300 sec: 5718.3). Total num frames: 329994240. Throughput: 0: 5914.8. Samples: 329999426. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:10:14,866][25689] Avg episode reward: [(0, '-45.897')] [2022-07-09 16:10:16,035][26022] Updated weights on worker 0-0, policy_version 322268 (0.00081) [2022-07-09 16:10:17,966][26022] Updated weights on worker 0-0, policy_version 322278 (0.00084) [2022-07-09 16:10:19,604][26022] Updated weights on worker 0-0, policy_version 322288 (0.00086) [2022-07-09 16:10:19,875][25689] Fps is (10 sec: 5885.2, 60 sec: 5736.2, 300 sec: 5718.3). Total num frames: 330023936. Throughput: 0: 5069.8. Samples: 330017058. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:10:19,876][25689] Avg episode reward: [(0, '-46.008')] [2022-07-09 16:10:21,457][26022] Updated weights on worker 0-0, policy_version 322298 (0.00083) [2022-07-09 16:10:23,264][26022] Updated weights on worker 0-0, policy_version 322308 (0.00090) [2022-07-09 16:10:24,948][25689] Fps is (10 sec: 5890.4, 60 sec: 5748.5, 300 sec: 5722.9). Total num frames: 330053632. Throughput: 0: 6023.2. Samples: 330051772. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:10:24,948][25689] Avg episode reward: [(0, '-46.088')] [2022-07-09 16:10:24,966][26022] Updated weights on worker 0-0, policy_version 322318 (0.00151) [2022-07-09 16:10:26,824][26022] Updated weights on worker 0-0, policy_version 322328 (0.00091) [2022-07-09 16:10:28,491][26022] Updated weights on worker 0-0, policy_version 322338 (0.00085) [2022-07-09 16:10:30,010][25689] Fps is (10 sec: 5658.1, 60 sec: 5733.4, 300 sec: 5715.4). Total num frames: 330081280. Throughput: 0: 6034.7. Samples: 330086480. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:10:30,011][25689] Avg episode reward: [(0, '-45.172')] [2022-07-09 16:10:30,377][26022] Updated weights on worker 0-0, policy_version 322348 (0.00086) [2022-07-09 16:10:32,138][26022] Updated weights on worker 0-0, policy_version 322358 (0.00086) [2022-07-09 16:10:33,887][26022] Updated weights on worker 0-0, policy_version 322368 (0.00104) [2022-07-09 16:10:35,098][25689] Fps is (10 sec: 5649.4, 60 sec: 5742.1, 300 sec: 5717.4). Total num frames: 330110976. Throughput: 0: 6011.5. Samples: 330121058. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:10:35,099][25689] Avg episode reward: [(0, '-44.956')] [2022-07-09 16:10:35,727][26022] Updated weights on worker 0-0, policy_version 322378 (0.00089) [2022-07-09 16:10:37,371][26022] Updated weights on worker 0-0, policy_version 322388 (0.00081) [2022-07-09 16:10:39,174][26022] Updated weights on worker 0-0, policy_version 322398 (0.00089) [2022-07-09 16:10:40,128][25689] Fps is (10 sec: 5971.1, 60 sec: 5757.8, 300 sec: 5720.8). Total num frames: 330141696. Throughput: 0: 5994.6. Samples: 330138466. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:10:40,129][25689] Avg episode reward: [(0, '-44.836')] [2022-07-09 16:10:41,107][26022] Updated weights on worker 0-0, policy_version 322408 (0.00085) [2022-07-09 16:10:42,588][26022] Updated weights on worker 0-0, policy_version 322418 (0.00084) [2022-07-09 16:10:44,626][26022] Updated weights on worker 0-0, policy_version 322428 (0.00082) [2022-07-09 16:10:45,223][25689] Fps is (10 sec: 5866.2, 60 sec: 5753.3, 300 sec: 5724.1). Total num frames: 330170368. Throughput: 0: 6012.4. Samples: 330173674. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:10:45,224][25689] Avg episode reward: [(0, '-45.677')] [2022-07-09 16:10:46,154][26022] Updated weights on worker 0-0, policy_version 322438 (0.00085) [2022-07-09 16:10:48,080][26022] Updated weights on worker 0-0, policy_version 322448 (0.00084) [2022-07-09 16:10:49,697][26022] Updated weights on worker 0-0, policy_version 322458 (0.00100) [2022-07-09 16:10:50,283][25689] Fps is (10 sec: 5647.2, 60 sec: 5737.9, 300 sec: 5716.2). Total num frames: 330199040. Throughput: 0: 6012.0. Samples: 330208358. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:10:50,283][25689] Avg episode reward: [(0, '-45.336')] [2022-07-09 16:10:51,527][26022] Updated weights on worker 0-0, policy_version 322468 (0.00087) [2022-07-09 16:10:53,361][26022] Updated weights on worker 0-0, policy_version 322478 (0.00085) [2022-07-09 16:10:55,070][26022] Updated weights on worker 0-0, policy_version 322488 (0.00091) [2022-07-09 16:10:55,287][25689] Fps is (10 sec: 5799.8, 60 sec: 5776.3, 300 sec: 5723.7). Total num frames: 330228736. Throughput: 0: 5176.2. Samples: 330225556. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:10:55,287][25689] Avg episode reward: [(0, '-46.325')] [2022-07-09 16:10:56,832][26022] Updated weights on worker 0-0, policy_version 322498 (0.00091) [2022-07-09 16:10:58,721][26022] Updated weights on worker 0-0, policy_version 322508 (0.00092) [2022-07-09 16:11:00,291][25689] Fps is (10 sec: 5832.0, 60 sec: 5759.8, 300 sec: 5727.3). Total num frames: 330257408. Throughput: 0: 6040.4. Samples: 330260256. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:11:00,291][25689] Avg episode reward: [(0, '-46.164')] [2022-07-09 16:11:00,323][26022] Updated weights on worker 0-0, policy_version 322518 (0.00081) [2022-07-09 16:11:02,619][26022] Updated weights on worker 0-0, policy_version 322528 (0.00107) [2022-07-09 16:11:04,271][26022] Updated weights on worker 0-0, policy_version 322538 (0.00086) [2022-07-09 16:11:05,307][25689] Fps is (10 sec: 5518.5, 60 sec: 5744.0, 300 sec: 5718.1). Total num frames: 330284032. Throughput: 0: 5934.3. Samples: 330292858. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:11:05,307][25689] Avg episode reward: [(0, '-46.205')] [2022-07-09 16:11:06,175][26022] Updated weights on worker 0-0, policy_version 322548 (0.00087) [2022-07-09 16:11:07,783][26022] Updated weights on worker 0-0, policy_version 322558 (0.00089) [2022-07-09 16:11:09,654][26022] Updated weights on worker 0-0, policy_version 322568 (0.00098) [2022-07-09 16:11:10,351][25689] Fps is (10 sec: 5598.5, 60 sec: 5765.2, 300 sec: 5718.1). Total num frames: 330313728. Throughput: 0: 5079.6. Samples: 330310296. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:11:10,352][25689] Avg episode reward: [(0, '-46.102')] [2022-07-09 16:11:11,452][26022] Updated weights on worker 0-0, policy_version 322578 (0.00082) [2022-07-09 16:11:12,175][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:11:12,186][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000322582_330323968.pth [2022-07-09 16:11:12,186][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000320568_328261632.pth [2022-07-09 16:11:13,201][26022] Updated weights on worker 0-0, policy_version 322588 (0.00085) [2022-07-09 16:11:15,051][26022] Updated weights on worker 0-0, policy_version 322598 (0.00087) [2022-07-09 16:11:15,386][25689] Fps is (10 sec: 5689.5, 60 sec: 5735.8, 300 sec: 5717.7). Total num frames: 330341376. Throughput: 0: 5945.9. Samples: 330345062. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:11:15,387][25689] Avg episode reward: [(0, '-44.980')] [2022-07-09 16:11:16,688][26022] Updated weights on worker 0-0, policy_version 322608 (0.00092) [2022-07-09 16:11:18,511][26022] Updated weights on worker 0-0, policy_version 322618 (0.00091) [2022-07-09 16:11:20,347][26022] Updated weights on worker 0-0, policy_version 322628 (0.00089) [2022-07-09 16:11:20,398][25689] Fps is (10 sec: 5707.1, 60 sec: 5735.6, 300 sec: 5718.1). Total num frames: 330371072. Throughput: 0: 5945.5. Samples: 330379806. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:11:20,399][25689] Avg episode reward: [(0, '-45.939')] [2022-07-09 16:11:22,075][26022] Updated weights on worker 0-0, policy_version 322638 (0.00081) [2022-07-09 16:11:23,823][26022] Updated weights on worker 0-0, policy_version 322648 (0.00082) [2022-07-09 16:11:25,407][25689] Fps is (10 sec: 5926.7, 60 sec: 5741.7, 300 sec: 5722.4). Total num frames: 330400768. Throughput: 0: 5194.6. Samples: 330397270. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:11:25,408][25689] Avg episode reward: [(0, '-44.885')] [2022-07-09 16:11:25,512][26022] Updated weights on worker 0-0, policy_version 322658 (0.00083) [2022-07-09 16:11:27,350][26022] Updated weights on worker 0-0, policy_version 322668 (0.00083) [2022-07-09 16:11:28,964][26022] Updated weights on worker 0-0, policy_version 322678 (0.00083) [2022-07-09 16:11:30,524][25689] Fps is (10 sec: 5764.3, 60 sec: 5753.4, 300 sec: 5720.4). Total num frames: 330429440. Throughput: 0: 6035.8. Samples: 330432060. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:11:30,525][25689] Avg episode reward: [(0, '-45.695')] [2022-07-09 16:11:30,750][26022] Updated weights on worker 0-0, policy_version 322688 (0.00094) [2022-07-09 16:11:32,525][26022] Updated weights on worker 0-0, policy_version 322698 (0.00081) [2022-07-09 16:11:34,442][26022] Updated weights on worker 0-0, policy_version 322708 (0.00086) [2022-07-09 16:11:35,539][25689] Fps is (10 sec: 5760.8, 60 sec: 5760.4, 300 sec: 5727.4). Total num frames: 330459136. Throughput: 0: 6038.0. Samples: 330466746. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:11:35,539][25689] Avg episode reward: [(0, '-45.726')] [2022-07-09 16:11:36,291][26022] Updated weights on worker 0-0, policy_version 322718 (0.00083) [2022-07-09 16:11:37,906][26022] Updated weights on worker 0-0, policy_version 322728 (0.00084) [2022-07-09 16:11:39,552][26022] Updated weights on worker 0-0, policy_version 322738 (0.00085) [2022-07-09 16:11:40,544][25689] Fps is (10 sec: 5825.5, 60 sec: 5728.9, 300 sec: 5724.0). Total num frames: 330487808. Throughput: 0: 5187.4. Samples: 330484308. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:11:40,544][25689] Avg episode reward: [(0, '-47.021')] [2022-07-09 16:11:41,418][26022] Updated weights on worker 0-0, policy_version 322748 (0.00092) [2022-07-09 16:11:43,223][26022] Updated weights on worker 0-0, policy_version 322758 (0.00093) [2022-07-09 16:11:44,905][26022] Updated weights on worker 0-0, policy_version 322768 (0.00084) [2022-07-09 16:11:45,548][25689] Fps is (10 sec: 5831.1, 60 sec: 5754.4, 300 sec: 5725.2). Total num frames: 330517504. Throughput: 0: 6073.7. Samples: 330519604. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:11:45,549][25689] Avg episode reward: [(0, '-47.117')] [2022-07-09 16:11:46,790][26022] Updated weights on worker 0-0, policy_version 322778 (0.00090) [2022-07-09 16:11:48,415][26022] Updated weights on worker 0-0, policy_version 322789 (0.00081) [2022-07-09 16:11:50,359][26022] Updated weights on worker 0-0, policy_version 322799 (0.00096) [2022-07-09 16:11:50,596][25689] Fps is (10 sec: 5908.4, 60 sec: 5772.5, 300 sec: 5731.4). Total num frames: 330547200. Throughput: 0: 6107.8. Samples: 330554654. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:11:50,596][25689] Avg episode reward: [(0, '-47.517')] [2022-07-09 16:11:52,075][26022] Updated weights on worker 0-0, policy_version 322809 (0.00083) [2022-07-09 16:11:54,092][26022] Updated weights on worker 0-0, policy_version 322819 (0.00083) [2022-07-09 16:11:55,632][25689] Fps is (10 sec: 5889.8, 60 sec: 5769.4, 300 sec: 5732.1). Total num frames: 330576896. Throughput: 0: 5225.0. Samples: 330571736. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:11:55,633][25689] Avg episode reward: [(0, '-47.897')] [2022-07-09 16:11:55,635][26022] Updated weights on worker 0-0, policy_version 322829 (0.00082) [2022-07-09 16:11:57,630][26022] Updated weights on worker 0-0, policy_version 322839 (0.00094) [2022-07-09 16:11:59,198][26022] Updated weights on worker 0-0, policy_version 322849 (0.00078) [2022-07-09 16:12:00,662][25689] Fps is (10 sec: 5697.0, 60 sec: 5750.1, 300 sec: 5739.1). Total num frames: 330604544. Throughput: 0: 6062.9. Samples: 330606280. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:12:00,662][25689] Avg episode reward: [(0, '-48.345')] [2022-07-09 16:12:01,179][26022] Updated weights on worker 0-0, policy_version 322859 (0.00090) [2022-07-09 16:12:03,140][26022] Updated weights on worker 0-0, policy_version 322869 (0.00081) [2022-07-09 16:12:05,158][26022] Updated weights on worker 0-0, policy_version 322879 (0.00082) [2022-07-09 16:12:05,718][25689] Fps is (10 sec: 5381.0, 60 sec: 5746.2, 300 sec: 5733.6). Total num frames: 330631168. Throughput: 0: 5889.5. Samples: 330638394. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:12:05,719][25689] Avg episode reward: [(0, '-47.757')] [2022-07-09 16:12:06,849][26022] Updated weights on worker 0-0, policy_version 322889 (0.00093) [2022-07-09 16:12:08,582][26022] Updated weights on worker 0-0, policy_version 322899 (0.00093) [2022-07-09 16:12:10,351][26022] Updated weights on worker 0-0, policy_version 322909 (0.00083) [2022-07-09 16:12:10,829][25689] Fps is (10 sec: 5539.4, 60 sec: 5739.9, 300 sec: 5728.8). Total num frames: 330660864. Throughput: 0: 4996.9. Samples: 330655756. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:12:10,829][25689] Avg episode reward: [(0, '-47.156')] [2022-07-09 16:12:12,236][26022] Updated weights on worker 0-0, policy_version 322919 (0.00085) [2022-07-09 16:12:13,743][26022] Updated weights on worker 0-0, policy_version 322929 (0.00089) [2022-07-09 16:12:15,679][26022] Updated weights on worker 0-0, policy_version 322939 (0.00093) [2022-07-09 16:12:15,845][25689] Fps is (10 sec: 5864.7, 60 sec: 5775.5, 300 sec: 5736.5). Total num frames: 330690560. Throughput: 0: 5878.7. Samples: 330690562. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:12:15,846][25689] Avg episode reward: [(0, '-46.716')] [2022-07-09 16:12:17,218][26022] Updated weights on worker 0-0, policy_version 322949 (0.00085) [2022-07-09 16:12:19,199][26022] Updated weights on worker 0-0, policy_version 322959 (0.00088) [2022-07-09 16:12:20,859][25689] Fps is (10 sec: 5717.2, 60 sec: 5741.6, 300 sec: 5729.9). Total num frames: 330718208. Throughput: 0: 5896.9. Samples: 330725380. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:12:20,859][25689] Avg episode reward: [(0, '-46.941')] [2022-07-09 16:12:21,169][26022] Updated weights on worker 0-0, policy_version 322969 (0.00085) [2022-07-09 16:12:22,651][26022] Updated weights on worker 0-0, policy_version 322979 (0.00084) [2022-07-09 16:12:24,704][26022] Updated weights on worker 0-0, policy_version 322989 (0.00092) [2022-07-09 16:12:25,869][25689] Fps is (10 sec: 5822.9, 60 sec: 5758.3, 300 sec: 5734.5). Total num frames: 330748928. Throughput: 0: 5183.9. Samples: 330742854. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:12:25,870][25689] Avg episode reward: [(0, '-46.217')] [2022-07-09 16:12:26,292][26022] Updated weights on worker 0-0, policy_version 322999 (0.00090) [2022-07-09 16:12:28,075][26022] Updated weights on worker 0-0, policy_version 323009 (0.00085) [2022-07-09 16:12:30,250][26022] Updated weights on worker 0-0, policy_version 323019 (0.00091) [2022-07-09 16:12:30,924][25689] Fps is (10 sec: 5900.7, 60 sec: 5764.3, 300 sec: 5733.7). Total num frames: 330777600. Throughput: 0: 6050.2. Samples: 330777336. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:12:30,925][25689] Avg episode reward: [(0, '-46.568')] [2022-07-09 16:12:31,515][26022] Updated weights on worker 0-0, policy_version 323029 (0.00081) [2022-07-09 16:12:33,576][26022] Updated weights on worker 0-0, policy_version 323039 (0.00083) [2022-07-09 16:12:35,131][26022] Updated weights on worker 0-0, policy_version 323049 (0.00080) [2022-07-09 16:12:35,951][25689] Fps is (10 sec: 5688.0, 60 sec: 5746.1, 300 sec: 5730.0). Total num frames: 330806272. Throughput: 0: 6040.7. Samples: 330812012. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:12:35,951][25689] Avg episode reward: [(0, '-47.836')] [2022-07-09 16:12:36,905][26022] Updated weights on worker 0-0, policy_version 323059 (0.00087) [2022-07-09 16:12:38,882][26022] Updated weights on worker 0-0, policy_version 323069 (0.00094) [2022-07-09 16:12:40,545][26022] Updated weights on worker 0-0, policy_version 323079 (0.00089) [2022-07-09 16:12:40,973][25689] Fps is (10 sec: 5604.7, 60 sec: 5727.6, 300 sec: 5730.0). Total num frames: 330833920. Throughput: 0: 5164.9. Samples: 330829266. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 16:12:40,973][25689] Avg episode reward: [(0, '-47.856')] [2022-07-09 16:12:42,287][26022] Updated weights on worker 0-0, policy_version 323089 (0.00100) [2022-07-09 16:12:44,239][26022] Updated weights on worker 0-0, policy_version 323099 (0.00085) [2022-07-09 16:12:45,807][26022] Updated weights on worker 0-0, policy_version 323109 (0.00084) [2022-07-09 16:12:45,978][25689] Fps is (10 sec: 5820.8, 60 sec: 5744.5, 300 sec: 5731.3). Total num frames: 330864640. Throughput: 0: 6022.8. Samples: 330863966. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:12:45,979][25689] Avg episode reward: [(0, '-47.782')] [2022-07-09 16:12:47,638][26022] Updated weights on worker 0-0, policy_version 323119 (0.00082) [2022-07-09 16:12:49,325][26022] Updated weights on worker 0-0, policy_version 323129 (0.00096) [2022-07-09 16:12:51,084][25689] Fps is (10 sec: 5873.7, 60 sec: 5722.0, 300 sec: 5729.5). Total num frames: 330893312. Throughput: 0: 6011.0. Samples: 330898520. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:12:51,085][25689] Avg episode reward: [(0, '-47.554')] [2022-07-09 16:12:51,273][26022] Updated weights on worker 0-0, policy_version 323139 (0.00572) [2022-07-09 16:12:52,974][26022] Updated weights on worker 0-0, policy_version 323149 (0.00086) [2022-07-09 16:12:54,836][26022] Updated weights on worker 0-0, policy_version 323159 (0.00093) [2022-07-09 16:12:56,102][25689] Fps is (10 sec: 5563.2, 60 sec: 5689.9, 300 sec: 5729.7). Total num frames: 330920960. Throughput: 0: 5153.8. Samples: 330915868. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:12:56,103][25689] Avg episode reward: [(0, '-47.693')] [2022-07-09 16:12:56,491][26022] Updated weights on worker 0-0, policy_version 323169 (0.00085) [2022-07-09 16:12:58,271][26022] Updated weights on worker 0-0, policy_version 323179 (0.00085) [2022-07-09 16:12:59,926][26022] Updated weights on worker 0-0, policy_version 323189 (0.00090) [2022-07-09 16:13:01,108][25689] Fps is (10 sec: 5823.0, 60 sec: 5742.9, 300 sec: 5740.2). Total num frames: 330951680. Throughput: 0: 6041.4. Samples: 330950910. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:13:01,108][25689] Avg episode reward: [(0, '-47.774')] [2022-07-09 16:13:02,267][26022] Updated weights on worker 0-0, policy_version 323199 (0.00095) [2022-07-09 16:13:03,957][26022] Updated weights on worker 0-0, policy_version 323209 (0.00089) [2022-07-09 16:13:05,819][26022] Updated weights on worker 0-0, policy_version 323219 (0.00086) [2022-07-09 16:13:06,180][25689] Fps is (10 sec: 5689.8, 60 sec: 5741.4, 300 sec: 5737.3). Total num frames: 330978304. Throughput: 0: 5894.9. Samples: 330983056. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:13:06,181][25689] Avg episode reward: [(0, '-45.776')] [2022-07-09 16:13:07,584][26022] Updated weights on worker 0-0, policy_version 323229 (0.00081) [2022-07-09 16:13:09,523][26022] Updated weights on worker 0-0, policy_version 323239 (0.00098) [2022-07-09 16:13:11,159][26022] Updated weights on worker 0-0, policy_version 323249 (0.00087) [2022-07-09 16:13:11,252][25689] Fps is (10 sec: 5450.8, 60 sec: 5728.1, 300 sec: 5732.9). Total num frames: 331006976. Throughput: 0: 5054.2. Samples: 331000454. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:13:11,253][25689] Avg episode reward: [(0, '-46.325')] [2022-07-09 16:13:12,193][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:13:12,202][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000323255_331013120.pth [2022-07-09 16:13:12,203][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000321238_328947712.pth [2022-07-09 16:13:13,024][26022] Updated weights on worker 0-0, policy_version 323259 (0.00099) [2022-07-09 16:13:14,746][26022] Updated weights on worker 0-0, policy_version 323269 (0.00081) [2022-07-09 16:13:16,330][25689] Fps is (10 sec: 5750.5, 60 sec: 5722.3, 300 sec: 5734.8). Total num frames: 331036672. Throughput: 0: 5884.2. Samples: 331034896. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:13:16,331][25689] Avg episode reward: [(0, '-46.334')] [2022-07-09 16:13:16,371][26022] Updated weights on worker 0-0, policy_version 323279 (0.00082) [2022-07-09 16:13:18,320][26022] Updated weights on worker 0-0, policy_version 323289 (0.00090) [2022-07-09 16:13:19,953][26022] Updated weights on worker 0-0, policy_version 323299 (0.00081) [2022-07-09 16:13:21,367][25689] Fps is (10 sec: 5871.7, 60 sec: 5753.9, 300 sec: 5734.6). Total num frames: 331066368. Throughput: 0: 5856.5. Samples: 331069560. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:13:21,368][25689] Avg episode reward: [(0, '-46.606')] [2022-07-09 16:13:21,906][26022] Updated weights on worker 0-0, policy_version 323309 (0.00091) [2022-07-09 16:13:23,415][26022] Updated weights on worker 0-0, policy_version 323319 (0.00088) [2022-07-09 16:13:25,507][26022] Updated weights on worker 0-0, policy_version 323329 (0.00099) [2022-07-09 16:13:26,393][25689] Fps is (10 sec: 5800.2, 60 sec: 5718.6, 300 sec: 5732.0). Total num frames: 331095040. Throughput: 0: 5985.3. Samples: 331104038. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:13:26,394][25689] Avg episode reward: [(0, '-46.010')] [2022-07-09 16:13:26,991][26022] Updated weights on worker 0-0, policy_version 323339 (0.00081) [2022-07-09 16:13:28,852][26022] Updated weights on worker 0-0, policy_version 323349 (0.00090) [2022-07-09 16:13:30,746][26022] Updated weights on worker 0-0, policy_version 323359 (0.00091) [2022-07-09 16:13:31,439][25689] Fps is (10 sec: 5591.9, 60 sec: 5702.5, 300 sec: 5732.7). Total num frames: 331122688. Throughput: 0: 5986.6. Samples: 331121304. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:13:31,440][25689] Avg episode reward: [(0, '-47.535')] [2022-07-09 16:13:32,465][26022] Updated weights on worker 0-0, policy_version 323369 (0.00087) [2022-07-09 16:13:34,289][26022] Updated weights on worker 0-0, policy_version 323379 (0.00092) [2022-07-09 16:13:35,836][26022] Updated weights on worker 0-0, policy_version 323389 (0.00083) [2022-07-09 16:13:36,517][25689] Fps is (10 sec: 5664.6, 60 sec: 5714.6, 300 sec: 5729.7). Total num frames: 331152384. Throughput: 0: 6014.6. Samples: 331156308. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:13:36,517][25689] Avg episode reward: [(0, '-48.091')] [2022-07-09 16:13:37,648][26022] Updated weights on worker 0-0, policy_version 323399 (0.00078) [2022-07-09 16:13:39,695][26022] Updated weights on worker 0-0, policy_version 323409 (0.00079) [2022-07-09 16:13:41,212][26022] Updated weights on worker 0-0, policy_version 323419 (0.00088) [2022-07-09 16:13:41,605][25689] Fps is (10 sec: 5842.0, 60 sec: 5742.1, 300 sec: 5736.6). Total num frames: 331182080. Throughput: 0: 6017.9. Samples: 331191350. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:13:41,606][25689] Avg episode reward: [(0, '-49.113')] [2022-07-09 16:13:43,067][26022] Updated weights on worker 0-0, policy_version 323429 (0.00083) [2022-07-09 16:13:44,667][26022] Updated weights on worker 0-0, policy_version 323439 (0.00089) [2022-07-09 16:13:46,510][26022] Updated weights on worker 0-0, policy_version 323449 (0.00094) [2022-07-09 16:13:46,675][25689] Fps is (10 sec: 5846.6, 60 sec: 5719.2, 300 sec: 5732.5). Total num frames: 331211776. Throughput: 0: 5169.5. Samples: 331208884. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:13:46,675][25689] Avg episode reward: [(0, '-49.237')] [2022-07-09 16:13:48,210][26022] Updated weights on worker 0-0, policy_version 323459 (0.00087) [2022-07-09 16:13:50,168][26022] Updated weights on worker 0-0, policy_version 323469 (0.00088) [2022-07-09 16:13:51,701][25689] Fps is (10 sec: 5882.9, 60 sec: 5743.6, 300 sec: 5739.2). Total num frames: 331241472. Throughput: 0: 6032.1. Samples: 331243526. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:13:51,702][25689] Avg episode reward: [(0, '-50.093')] [2022-07-09 16:13:51,811][26022] Updated weights on worker 0-0, policy_version 323479 (0.00093) [2022-07-09 16:13:53,626][26022] Updated weights on worker 0-0, policy_version 323489 (0.00082) [2022-07-09 16:13:55,295][26022] Updated weights on worker 0-0, policy_version 323499 (0.00083) [2022-07-09 16:13:56,732][25689] Fps is (10 sec: 5803.6, 60 sec: 5759.2, 300 sec: 5740.3). Total num frames: 331270144. Throughput: 0: 6041.1. Samples: 331278432. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:13:56,733][25689] Avg episode reward: [(0, '-48.881')] [2022-07-09 16:13:57,152][26022] Updated weights on worker 0-0, policy_version 323509 (0.00087) [2022-07-09 16:13:59,050][26022] Updated weights on worker 0-0, policy_version 323519 (0.00085) [2022-07-09 16:14:00,714][26022] Updated weights on worker 0-0, policy_version 323529 (0.00085) [2022-07-09 16:14:01,786][25689] Fps is (10 sec: 5482.9, 60 sec: 5687.2, 300 sec: 5732.6). Total num frames: 331296768. Throughput: 0: 5174.1. Samples: 331295768. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:14:01,787][25689] Avg episode reward: [(0, '-47.865')] [2022-07-09 16:14:02,712][26022] Updated weights on worker 0-0, policy_version 323539 (0.00089) [2022-07-09 16:14:04,867][26022] Updated weights on worker 0-0, policy_version 323549 (0.00083) [2022-07-09 16:14:06,255][26022] Updated weights on worker 0-0, policy_version 323559 (0.00097) [2022-07-09 16:14:06,828][25689] Fps is (10 sec: 5680.1, 60 sec: 5757.6, 300 sec: 5747.2). Total num frames: 331327488. Throughput: 0: 5932.3. Samples: 331328436. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:14:06,828][25689] Avg episode reward: [(0, '-45.868')] [2022-07-09 16:14:08,485][26022] Updated weights on worker 0-0, policy_version 323569 (0.00087) [2022-07-09 16:14:09,818][26022] Updated weights on worker 0-0, policy_version 323579 (0.00087) [2022-07-09 16:14:11,941][25689] Fps is (10 sec: 5647.0, 60 sec: 5720.0, 300 sec: 5736.2). Total num frames: 331354112. Throughput: 0: 5879.1. Samples: 331362518. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:14:11,942][25689] Avg episode reward: [(0, '-45.752')] [2022-07-09 16:14:12,003][26022] Updated weights on worker 0-0, policy_version 323589 (0.00098) [2022-07-09 16:14:13,603][26022] Updated weights on worker 0-0, policy_version 323599 (0.00097) [2022-07-09 16:14:15,351][26022] Updated weights on worker 0-0, policy_version 323609 (0.00092) [2022-07-09 16:14:16,954][25689] Fps is (10 sec: 5562.0, 60 sec: 5726.1, 300 sec: 5736.0). Total num frames: 331383808. Throughput: 0: 5018.7. Samples: 331379918. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:14:16,954][25689] Avg episode reward: [(0, '-45.165')] [2022-07-09 16:14:17,323][26022] Updated weights on worker 0-0, policy_version 323619 (0.00084) [2022-07-09 16:14:18,935][26022] Updated weights on worker 0-0, policy_version 323629 (0.00084) [2022-07-09 16:14:20,675][26022] Updated weights on worker 0-0, policy_version 323639 (0.00097) [2022-07-09 16:14:22,021][25689] Fps is (10 sec: 5790.5, 60 sec: 5706.4, 300 sec: 5735.2). Total num frames: 331412480. Throughput: 0: 5878.5. Samples: 331414718. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:14:22,031][25689] Avg episode reward: [(0, '-44.146')] [2022-07-09 16:14:22,594][26022] Updated weights on worker 0-0, policy_version 323649 (0.00102) [2022-07-09 16:14:24,062][26022] Updated weights on worker 0-0, policy_version 323659 (0.00087) [2022-07-09 16:14:26,211][26022] Updated weights on worker 0-0, policy_version 323669 (0.00106) [2022-07-09 16:14:27,041][25689] Fps is (10 sec: 5786.3, 60 sec: 5723.8, 300 sec: 5739.8). Total num frames: 331442176. Throughput: 0: 5967.2. Samples: 331449052. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:14:27,041][25689] Avg episode reward: [(0, '-44.511')] [2022-07-09 16:14:27,811][26022] Updated weights on worker 0-0, policy_version 323679 (0.00097) [2022-07-09 16:14:29,711][26022] Updated weights on worker 0-0, policy_version 323689 (0.00091) [2022-07-09 16:14:31,651][26022] Updated weights on worker 0-0, policy_version 323699 (0.00082) [2022-07-09 16:14:32,156][25689] Fps is (10 sec: 5759.2, 60 sec: 5734.2, 300 sec: 5737.6). Total num frames: 331470848. Throughput: 0: 5122.4. Samples: 331466066. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:14:32,156][25689] Avg episode reward: [(0, '-45.396')] [2022-07-09 16:14:33,224][26022] Updated weights on worker 0-0, policy_version 323709 (0.00084) [2022-07-09 16:14:35,222][26022] Updated weights on worker 0-0, policy_version 323719 (0.00085) [2022-07-09 16:14:36,669][26022] Updated weights on worker 0-0, policy_version 323729 (0.00092) [2022-07-09 16:14:37,172][25689] Fps is (10 sec: 5761.3, 60 sec: 5740.0, 300 sec: 5737.6). Total num frames: 331500544. Throughput: 0: 5968.2. Samples: 331500584. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:14:37,173][25689] Avg episode reward: [(0, '-46.239')] [2022-07-09 16:14:38,908][26022] Updated weights on worker 0-0, policy_version 323739 (0.00085) [2022-07-09 16:14:40,340][26022] Updated weights on worker 0-0, policy_version 323749 (0.00085) [2022-07-09 16:14:42,253][25689] Fps is (10 sec: 5679.2, 60 sec: 5707.0, 300 sec: 5733.5). Total num frames: 331528192. Throughput: 0: 5943.3. Samples: 331534962. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:14:42,253][25689] Avg episode reward: [(0, '-47.397')] [2022-07-09 16:14:42,305][26022] Updated weights on worker 0-0, policy_version 323759 (0.00084) [2022-07-09 16:14:43,890][26022] Updated weights on worker 0-0, policy_version 323769 (0.00091) [2022-07-09 16:14:45,859][26022] Updated weights on worker 0-0, policy_version 323779 (0.00081) [2022-07-09 16:14:47,268][25689] Fps is (10 sec: 5781.4, 60 sec: 5729.0, 300 sec: 5738.1). Total num frames: 331558912. Throughput: 0: 5094.4. Samples: 331552096. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:14:47,268][25689] Avg episode reward: [(0, '-47.185')] [2022-07-09 16:14:47,570][26022] Updated weights on worker 0-0, policy_version 323789 (0.00087) [2022-07-09 16:14:49,376][26022] Updated weights on worker 0-0, policy_version 323799 (0.00091) [2022-07-09 16:14:50,992][26022] Updated weights on worker 0-0, policy_version 323809 (0.00099) [2022-07-09 16:14:52,325][25689] Fps is (10 sec: 5896.8, 60 sec: 5709.2, 300 sec: 5741.5). Total num frames: 331587584. Throughput: 0: 6003.2. Samples: 331587146. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:14:52,325][25689] Avg episode reward: [(0, '-46.915')] [2022-07-09 16:14:52,807][26022] Updated weights on worker 0-0, policy_version 323819 (0.00075) [2022-07-09 16:14:54,570][26022] Updated weights on worker 0-0, policy_version 323829 (0.00086) [2022-07-09 16:14:56,394][26022] Updated weights on worker 0-0, policy_version 323839 (0.00085) [2022-07-09 16:14:57,373][25689] Fps is (10 sec: 5573.5, 60 sec: 5690.7, 300 sec: 5733.9). Total num frames: 331615232. Throughput: 0: 6001.9. Samples: 331621828. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:14:57,374][25689] Avg episode reward: [(0, '-47.215')] [2022-07-09 16:14:58,252][26022] Updated weights on worker 0-0, policy_version 323849 (0.00086) [2022-07-09 16:15:00,018][26022] Updated weights on worker 0-0, policy_version 323859 (0.00084) [2022-07-09 16:15:02,089][26022] Updated weights on worker 0-0, policy_version 323869 (0.00096) [2022-07-09 16:15:02,393][25689] Fps is (10 sec: 5593.8, 60 sec: 5727.7, 300 sec: 5737.4). Total num frames: 331643904. Throughput: 0: 5178.8. Samples: 331639264. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:15:02,394][25689] Avg episode reward: [(0, '-46.593')] [2022-07-09 16:15:03,789][26022] Updated weights on worker 0-0, policy_version 323879 (0.00095) [2022-07-09 16:15:05,733][26022] Updated weights on worker 0-0, policy_version 323889 (0.00085) [2022-07-09 16:15:07,445][25689] Fps is (10 sec: 5591.6, 60 sec: 5676.0, 300 sec: 5734.7). Total num frames: 331671552. Throughput: 0: 5925.5. Samples: 331671656. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 16:15:07,447][25689] Avg episode reward: [(0, '-46.105')] [2022-07-09 16:15:07,551][26022] Updated weights on worker 0-0, policy_version 323899 (0.00094) [2022-07-09 16:15:09,324][26022] Updated weights on worker 0-0, policy_version 323909 (0.00098) [2022-07-09 16:15:10,964][26022] Updated weights on worker 0-0, policy_version 323919 (0.00084) [2022-07-09 16:15:12,210][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:15:12,237][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000323925_331699200.pth [2022-07-09 16:15:12,238][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000321908_329633792.pth [2022-07-09 16:15:12,509][25689] Fps is (10 sec: 5567.5, 60 sec: 5714.5, 300 sec: 5731.6). Total num frames: 331700224. Throughput: 0: 5884.2. Samples: 331705912. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:15:12,509][25689] Avg episode reward: [(0, '-45.786')] [2022-07-09 16:15:12,882][26022] Updated weights on worker 0-0, policy_version 323929 (0.01131) [2022-07-09 16:15:14,543][26022] Updated weights on worker 0-0, policy_version 323939 (0.00087) [2022-07-09 16:15:16,240][26022] Updated weights on worker 0-0, policy_version 323949 (0.00087) [2022-07-09 16:15:17,530][25689] Fps is (10 sec: 5787.6, 60 sec: 5713.7, 300 sec: 5731.4). Total num frames: 331729920. Throughput: 0: 5036.6. Samples: 331723352. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:15:17,531][25689] Avg episode reward: [(0, '-46.841')] [2022-07-09 16:15:18,054][26022] Updated weights on worker 0-0, policy_version 323959 (0.00089) [2022-07-09 16:15:19,753][26022] Updated weights on worker 0-0, policy_version 323969 (0.00096) [2022-07-09 16:15:21,776][26022] Updated weights on worker 0-0, policy_version 323979 (0.00084) [2022-07-09 16:15:22,539][25689] Fps is (10 sec: 5819.4, 60 sec: 5719.2, 300 sec: 5729.2). Total num frames: 331758592. Throughput: 0: 5911.6. Samples: 331758356. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:15:22,539][25689] Avg episode reward: [(0, '-47.297')] [2022-07-09 16:15:23,322][26022] Updated weights on worker 0-0, policy_version 323989 (0.00087) [2022-07-09 16:15:25,117][26022] Updated weights on worker 0-0, policy_version 323999 (0.00086) [2022-07-09 16:15:26,955][26022] Updated weights on worker 0-0, policy_version 324009 (0.00083) [2022-07-09 16:15:27,563][25689] Fps is (10 sec: 5817.3, 60 sec: 5718.7, 300 sec: 5736.8). Total num frames: 331788288. Throughput: 0: 6031.6. Samples: 331793002. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:15:27,564][25689] Avg episode reward: [(0, '-46.687')] [2022-07-09 16:15:28,728][26022] Updated weights on worker 0-0, policy_version 324019 (0.00086) [2022-07-09 16:15:30,515][26022] Updated weights on worker 0-0, policy_version 324029 (0.00083) [2022-07-09 16:15:32,226][26022] Updated weights on worker 0-0, policy_version 324039 (0.00089) [2022-07-09 16:15:32,626][25689] Fps is (10 sec: 5786.2, 60 sec: 5723.7, 300 sec: 5733.9). Total num frames: 331816960. Throughput: 0: 5192.1. Samples: 331810362. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:15:32,627][25689] Avg episode reward: [(0, '-46.819')] [2022-07-09 16:15:34,064][26022] Updated weights on worker 0-0, policy_version 324049 (0.00083) [2022-07-09 16:15:35,883][26022] Updated weights on worker 0-0, policy_version 324059 (0.00084) [2022-07-09 16:15:37,534][26022] Updated weights on worker 0-0, policy_version 324069 (0.00086) [2022-07-09 16:15:37,654][25689] Fps is (10 sec: 5885.7, 60 sec: 5739.5, 300 sec: 5733.9). Total num frames: 331847680. Throughput: 0: 6051.6. Samples: 331845136. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:15:37,655][25689] Avg episode reward: [(0, '-47.025')] [2022-07-09 16:15:39,324][26022] Updated weights on worker 0-0, policy_version 324079 (0.00088) [2022-07-09 16:15:41,114][26022] Updated weights on worker 0-0, policy_version 324089 (0.00095) [2022-07-09 16:15:42,686][25689] Fps is (10 sec: 5801.8, 60 sec: 5744.1, 300 sec: 5731.7). Total num frames: 331875328. Throughput: 0: 6018.4. Samples: 331879612. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:15:42,687][25689] Avg episode reward: [(0, '-47.104')] [2022-07-09 16:15:43,063][26022] Updated weights on worker 0-0, policy_version 324099 (0.00082) [2022-07-09 16:15:44,784][26022] Updated weights on worker 0-0, policy_version 324109 (0.00082) [2022-07-09 16:15:46,439][26022] Updated weights on worker 0-0, policy_version 324119 (0.00083) [2022-07-09 16:15:47,717][25689] Fps is (10 sec: 5597.2, 60 sec: 5708.8, 300 sec: 5732.2). Total num frames: 331904000. Throughput: 0: 5162.0. Samples: 331897032. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:15:47,717][25689] Avg episode reward: [(0, '-46.629')] [2022-07-09 16:15:48,371][26022] Updated weights on worker 0-0, policy_version 324129 (0.00097) [2022-07-09 16:15:49,982][26022] Updated weights on worker 0-0, policy_version 324139 (0.00096) [2022-07-09 16:15:51,754][26022] Updated weights on worker 0-0, policy_version 324149 (0.00086) [2022-07-09 16:15:52,830][25689] Fps is (10 sec: 5855.0, 60 sec: 5737.3, 300 sec: 5733.6). Total num frames: 331934720. Throughput: 0: 6009.6. Samples: 331931780. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:15:52,831][25689] Avg episode reward: [(0, '-46.171')] [2022-07-09 16:15:53,565][26022] Updated weights on worker 0-0, policy_version 324159 (0.00618) [2022-07-09 16:15:55,288][26022] Updated weights on worker 0-0, policy_version 324169 (0.00086) [2022-07-09 16:15:57,157][26022] Updated weights on worker 0-0, policy_version 324179 (0.00083) [2022-07-09 16:15:57,843][25689] Fps is (10 sec: 5764.0, 60 sec: 5740.6, 300 sec: 5730.0). Total num frames: 331962368. Throughput: 0: 6014.6. Samples: 331966562. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:15:57,844][25689] Avg episode reward: [(0, '-47.893')] [2022-07-09 16:15:58,706][26022] Updated weights on worker 0-0, policy_version 324189 (0.00090) [2022-07-09 16:16:00,722][26022] Updated weights on worker 0-0, policy_version 324199 (0.00089) [2022-07-09 16:16:02,848][25689] Fps is (10 sec: 5519.8, 60 sec: 5725.1, 300 sec: 5733.6). Total num frames: 331990016. Throughput: 0: 5942.6. Samples: 331999424. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:16:02,849][25689] Avg episode reward: [(0, '-48.214')] [2022-07-09 16:16:02,851][26022] Updated weights on worker 0-0, policy_version 324209 (0.00091) [2022-07-09 16:16:04,547][26022] Updated weights on worker 0-0, policy_version 324219 (0.00058) [2022-07-09 16:16:06,212][26022] Updated weights on worker 0-0, policy_version 324229 (0.00084) [2022-07-09 16:16:07,867][25689] Fps is (10 sec: 5618.5, 60 sec: 5745.2, 300 sec: 5730.6). Total num frames: 332018688. Throughput: 0: 5949.0. Samples: 332016906. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:16:07,868][25689] Avg episode reward: [(0, '-48.188')] [2022-07-09 16:16:08,136][26022] Updated weights on worker 0-0, policy_version 324239 (0.00063) [2022-07-09 16:16:09,977][26022] Updated weights on worker 0-0, policy_version 324249 (0.00090) [2022-07-09 16:16:11,549][26022] Updated weights on worker 0-0, policy_version 324259 (0.00087) [2022-07-09 16:16:12,919][25689] Fps is (10 sec: 5694.0, 60 sec: 5746.3, 300 sec: 5733.8). Total num frames: 332047360. Throughput: 0: 5950.9. Samples: 332051326. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:16:12,919][25689] Avg episode reward: [(0, '-47.138')] [2022-07-09 16:16:13,319][26022] Updated weights on worker 0-0, policy_version 324269 (0.00081) [2022-07-09 16:16:15,131][26022] Updated weights on worker 0-0, policy_version 324279 (0.00080) [2022-07-09 16:16:16,957][26022] Updated weights on worker 0-0, policy_version 324289 (0.00086) [2022-07-09 16:16:17,939][25689] Fps is (10 sec: 5794.8, 60 sec: 5746.4, 300 sec: 5733.6). Total num frames: 332077056. Throughput: 0: 5953.2. Samples: 332086200. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:16:17,940][25689] Avg episode reward: [(0, '-47.029')] [2022-07-09 16:16:18,674][26022] Updated weights on worker 0-0, policy_version 324299 (0.00085) [2022-07-09 16:16:20,388][26022] Updated weights on worker 0-0, policy_version 324309 (0.00085) [2022-07-09 16:16:22,206][26022] Updated weights on worker 0-0, policy_version 324319 (0.00080) [2022-07-09 16:16:23,020][25689] Fps is (10 sec: 5879.6, 60 sec: 5756.5, 300 sec: 5732.2). Total num frames: 332106752. Throughput: 0: 5169.1. Samples: 332103696. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:16:23,022][25689] Avg episode reward: [(0, '-46.918')] [2022-07-09 16:16:24,100][26022] Updated weights on worker 0-0, policy_version 324329 (0.00096) [2022-07-09 16:16:25,618][26022] Updated weights on worker 0-0, policy_version 324339 (0.00093) [2022-07-09 16:16:27,551][26022] Updated weights on worker 0-0, policy_version 324349 (0.00080) [2022-07-09 16:16:28,092][25689] Fps is (10 sec: 5849.6, 60 sec: 5752.0, 300 sec: 5736.5). Total num frames: 332136448. Throughput: 0: 6017.9. Samples: 332138622. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:16:28,093][25689] Avg episode reward: [(0, '-46.991')] [2022-07-09 16:16:29,255][26022] Updated weights on worker 0-0, policy_version 324359 (0.00091) [2022-07-09 16:16:30,910][26022] Updated weights on worker 0-0, policy_version 324369 (0.00084) [2022-07-09 16:16:32,726][26022] Updated weights on worker 0-0, policy_version 324379 (0.00090) [2022-07-09 16:16:33,180][25689] Fps is (10 sec: 5846.0, 60 sec: 5766.5, 300 sec: 5735.1). Total num frames: 332166144. Throughput: 0: 6034.7. Samples: 332173592. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:16:33,180][25689] Avg episode reward: [(0, '-47.159')] [2022-07-09 16:16:34,698][26022] Updated weights on worker 0-0, policy_version 324389 (0.00088) [2022-07-09 16:16:36,192][26022] Updated weights on worker 0-0, policy_version 324399 (0.00084) [2022-07-09 16:16:38,052][26022] Updated weights on worker 0-0, policy_version 324409 (0.00084) [2022-07-09 16:16:38,214][25689] Fps is (10 sec: 5867.9, 60 sec: 5749.0, 300 sec: 5738.0). Total num frames: 332195840. Throughput: 0: 5175.0. Samples: 332191122. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:16:38,214][25689] Avg episode reward: [(0, '-48.149')] [2022-07-09 16:16:40,013][26022] Updated weights on worker 0-0, policy_version 324419 (0.00087) [2022-07-09 16:16:41,536][26022] Updated weights on worker 0-0, policy_version 324429 (0.00089) [2022-07-09 16:16:43,286][25689] Fps is (10 sec: 5674.3, 60 sec: 5745.3, 300 sec: 5729.8). Total num frames: 332223488. Throughput: 0: 6018.7. Samples: 332225666. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:16:43,286][25689] Avg episode reward: [(0, '-48.273')] [2022-07-09 16:16:43,433][26022] Updated weights on worker 0-0, policy_version 324439 (0.00093) [2022-07-09 16:16:45,244][26022] Updated weights on worker 0-0, policy_version 324449 (0.00089) [2022-07-09 16:16:46,995][26022] Updated weights on worker 0-0, policy_version 324459 (0.00085) [2022-07-09 16:16:48,299][25689] Fps is (10 sec: 5686.1, 60 sec: 5763.8, 300 sec: 5730.5). Total num frames: 332253184. Throughput: 0: 6031.8. Samples: 332260504. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:16:48,299][25689] Avg episode reward: [(0, '-48.313')] [2022-07-09 16:16:48,934][26022] Updated weights on worker 0-0, policy_version 324469 (0.00083) [2022-07-09 16:16:50,515][26022] Updated weights on worker 0-0, policy_version 324479 (0.00090) [2022-07-09 16:16:52,518][26022] Updated weights on worker 0-0, policy_version 324489 (0.00090) [2022-07-09 16:16:53,358][25689] Fps is (10 sec: 5794.7, 60 sec: 5735.1, 300 sec: 5726.6). Total num frames: 332281856. Throughput: 0: 5147.9. Samples: 332277470. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:16:53,359][25689] Avg episode reward: [(0, '-47.866')] [2022-07-09 16:16:54,220][26022] Updated weights on worker 0-0, policy_version 324499 (0.00087) [2022-07-09 16:16:55,886][26022] Updated weights on worker 0-0, policy_version 324509 (0.00084) [2022-07-09 16:16:57,577][26022] Updated weights on worker 0-0, policy_version 324519 (0.00086) [2022-07-09 16:16:58,379][25689] Fps is (10 sec: 5689.0, 60 sec: 5751.3, 300 sec: 5730.2). Total num frames: 332310528. Throughput: 0: 5994.2. Samples: 332311994. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:16:58,379][25689] Avg episode reward: [(0, '-47.237')] [2022-07-09 16:16:59,447][26022] Updated weights on worker 0-0, policy_version 324529 (0.00094) [2022-07-09 16:17:01,242][26022] Updated weights on worker 0-0, policy_version 324539 (0.00086) [2022-07-09 16:17:03,394][25689] Fps is (10 sec: 5407.9, 60 sec: 5716.5, 300 sec: 5727.6). Total num frames: 332336128. Throughput: 0: 5910.3. Samples: 332344512. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:17:03,395][25689] Avg episode reward: [(0, '-46.381')] [2022-07-09 16:17:03,603][26022] Updated weights on worker 0-0, policy_version 324549 (0.00085) [2022-07-09 16:17:05,097][26022] Updated weights on worker 0-0, policy_version 324559 (0.00086) [2022-07-09 16:17:06,955][26022] Updated weights on worker 0-0, policy_version 324569 (0.00095) [2022-07-09 16:17:08,399][25689] Fps is (10 sec: 5722.9, 60 sec: 5768.6, 300 sec: 5736.5). Total num frames: 332367872. Throughput: 0: 5039.0. Samples: 332361788. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:17:08,399][25689] Avg episode reward: [(0, '-46.243')] [2022-07-09 16:17:08,619][26022] Updated weights on worker 0-0, policy_version 324579 (0.00086) [2022-07-09 16:17:10,695][26022] Updated weights on worker 0-0, policy_version 324589 (0.00084) [2022-07-09 16:17:12,318][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:17:12,328][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000324599_332389376.pth [2022-07-09 16:17:12,329][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000322582_330323968.pth [2022-07-09 16:17:12,333][26022] Updated weights on worker 0-0, policy_version 324599 (0.00095) [2022-07-09 16:17:13,548][25689] Fps is (10 sec: 5849.2, 60 sec: 5742.5, 300 sec: 5727.1). Total num frames: 332395520. Throughput: 0: 5885.7. Samples: 332396300. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:17:13,548][25689] Avg episode reward: [(0, '-47.219')] [2022-07-09 16:17:14,199][26022] Updated weights on worker 0-0, policy_version 324609 (0.00093) [2022-07-09 16:17:15,714][26022] Updated weights on worker 0-0, policy_version 324619 (0.00088) [2022-07-09 16:17:17,820][26022] Updated weights on worker 0-0, policy_version 324629 (0.00087) [2022-07-09 16:17:18,588][25689] Fps is (10 sec: 5728.5, 60 sec: 5757.5, 300 sec: 5736.9). Total num frames: 332426240. Throughput: 0: 5897.5. Samples: 332431178. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:17:18,588][25689] Avg episode reward: [(0, '-47.297')] [2022-07-09 16:17:19,175][26022] Updated weights on worker 0-0, policy_version 324639 (0.00091) [2022-07-09 16:17:21,230][26022] Updated weights on worker 0-0, policy_version 324649 (0.00083) [2022-07-09 16:17:22,681][26022] Updated weights on worker 0-0, policy_version 324659 (0.00088) [2022-07-09 16:17:23,593][25689] Fps is (10 sec: 5912.7, 60 sec: 5747.8, 300 sec: 5730.1). Total num frames: 332454912. Throughput: 0: 5152.0. Samples: 332448576. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:17:23,593][25689] Avg episode reward: [(0, '-47.489')] [2022-07-09 16:17:24,843][26022] Updated weights on worker 0-0, policy_version 324669 (0.00088) [2022-07-09 16:17:26,303][26022] Updated weights on worker 0-0, policy_version 324679 (0.00089) [2022-07-09 16:17:28,325][26022] Updated weights on worker 0-0, policy_version 324689 (0.00082) [2022-07-09 16:17:28,607][25689] Fps is (10 sec: 5621.2, 60 sec: 5719.5, 300 sec: 5727.4). Total num frames: 332482560. Throughput: 0: 6017.5. Samples: 332483390. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:17:28,608][25689] Avg episode reward: [(0, '-46.574')] [2022-07-09 16:17:29,836][26022] Updated weights on worker 0-0, policy_version 324699 (0.00084) [2022-07-09 16:17:31,939][26022] Updated weights on worker 0-0, policy_version 324709 (0.00082) [2022-07-09 16:17:33,546][26022] Updated weights on worker 0-0, policy_version 324719 (0.00084) [2022-07-09 16:17:33,712][25689] Fps is (10 sec: 5666.9, 60 sec: 5717.8, 300 sec: 5729.4). Total num frames: 332512256. Throughput: 0: 6010.1. Samples: 332517488. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:17:33,712][25689] Avg episode reward: [(0, '-46.647')] [2022-07-09 16:17:35,427][26022] Updated weights on worker 0-0, policy_version 324729 (0.00085) [2022-07-09 16:17:37,140][26022] Updated weights on worker 0-0, policy_version 324739 (0.00096) [2022-07-09 16:17:38,738][25689] Fps is (10 sec: 5862.4, 60 sec: 5718.6, 300 sec: 5736.2). Total num frames: 332541952. Throughput: 0: 5145.6. Samples: 332534862. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-09 16:17:38,740][25689] Avg episode reward: [(0, '-46.513')] [2022-07-09 16:17:38,779][26022] Updated weights on worker 0-0, policy_version 324749 (0.00089) [2022-07-09 16:17:40,798][26022] Updated weights on worker 0-0, policy_version 324759 (0.00085) [2022-07-09 16:17:42,655][26022] Updated weights on worker 0-0, policy_version 324769 (0.00082) [2022-07-09 16:17:43,760][25689] Fps is (10 sec: 5707.0, 60 sec: 5723.3, 300 sec: 5725.5). Total num frames: 332569600. Throughput: 0: 5990.7. Samples: 332569392. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:17:43,762][25689] Avg episode reward: [(0, '-46.326')] [2022-07-09 16:17:44,332][26022] Updated weights on worker 0-0, policy_version 324779 (0.00081) [2022-07-09 16:17:46,103][26022] Updated weights on worker 0-0, policy_version 324789 (0.00085) [2022-07-09 16:17:47,900][26022] Updated weights on worker 0-0, policy_version 324799 (0.00087) [2022-07-09 16:17:48,770][25689] Fps is (10 sec: 5716.3, 60 sec: 5723.6, 300 sec: 5730.8). Total num frames: 332599296. Throughput: 0: 5992.2. Samples: 332604208. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:17:48,770][25689] Avg episode reward: [(0, '-46.866')] [2022-07-09 16:17:49,586][26022] Updated weights on worker 0-0, policy_version 324809 (0.00086) [2022-07-09 16:17:51,517][26022] Updated weights on worker 0-0, policy_version 324819 (0.00100) [2022-07-09 16:17:53,195][26022] Updated weights on worker 0-0, policy_version 324829 (0.00085) [2022-07-09 16:17:53,815][25689] Fps is (10 sec: 5906.7, 60 sec: 5741.9, 300 sec: 5737.2). Total num frames: 332628992. Throughput: 0: 5175.7. Samples: 332621536. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:17:53,815][25689] Avg episode reward: [(0, '-46.585')] [2022-07-09 16:17:54,989][26022] Updated weights on worker 0-0, policy_version 324839 (0.00085) [2022-07-09 16:17:56,623][26022] Updated weights on worker 0-0, policy_version 324849 (0.00085) [2022-07-09 16:17:58,562][26022] Updated weights on worker 0-0, policy_version 324859 (0.00087) [2022-07-09 16:17:58,843][25689] Fps is (10 sec: 5692.8, 60 sec: 5724.3, 300 sec: 5726.4). Total num frames: 332656640. Throughput: 0: 6038.6. Samples: 332656266. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:17:58,843][25689] Avg episode reward: [(0, '-46.823')] [2022-07-09 16:18:00,135][26022] Updated weights on worker 0-0, policy_version 324869 (0.00082) [2022-07-09 16:18:02,575][26022] Updated weights on worker 0-0, policy_version 324879 (0.00090) [2022-07-09 16:18:03,846][25689] Fps is (10 sec: 5512.6, 60 sec: 5759.3, 300 sec: 5731.2). Total num frames: 332684288. Throughput: 0: 5943.1. Samples: 332688764. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:18:03,846][25689] Avg episode reward: [(0, '-47.420')] [2022-07-09 16:18:04,065][26022] Updated weights on worker 0-0, policy_version 324889 (0.00088) [2022-07-09 16:18:06,106][26022] Updated weights on worker 0-0, policy_version 324899 (0.00085) [2022-07-09 16:18:07,509][26022] Updated weights on worker 0-0, policy_version 324909 (0.00086) [2022-07-09 16:18:08,850][25689] Fps is (10 sec: 5525.5, 60 sec: 5691.6, 300 sec: 5729.1). Total num frames: 332711936. Throughput: 0: 5073.7. Samples: 332706090. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:18:08,851][25689] Avg episode reward: [(0, '-47.690')] [2022-07-09 16:18:09,584][26022] Updated weights on worker 0-0, policy_version 324919 (0.00084) [2022-07-09 16:18:11,259][26022] Updated weights on worker 0-0, policy_version 324929 (0.00092) [2022-07-09 16:18:13,053][26022] Updated weights on worker 0-0, policy_version 324939 (0.00447) [2022-07-09 16:18:13,920][25689] Fps is (10 sec: 5590.4, 60 sec: 5716.0, 300 sec: 5725.8). Total num frames: 332740608. Throughput: 0: 5927.5. Samples: 332740710. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:18:13,921][25689] Avg episode reward: [(0, '-47.221')] [2022-07-09 16:18:14,772][26022] Updated weights on worker 0-0, policy_version 324949 (0.00089) [2022-07-09 16:18:16,711][26022] Updated weights on worker 0-0, policy_version 324959 (0.00086) [2022-07-09 16:18:18,342][26022] Updated weights on worker 0-0, policy_version 324969 (0.00088) [2022-07-09 16:18:18,924][25689] Fps is (10 sec: 5895.5, 60 sec: 5719.4, 300 sec: 5729.9). Total num frames: 332771328. Throughput: 0: 5915.0. Samples: 332775048. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:18:18,925][25689] Avg episode reward: [(0, '-47.297')] [2022-07-09 16:18:20,344][26022] Updated weights on worker 0-0, policy_version 324979 (0.00091) [2022-07-09 16:18:21,848][26022] Updated weights on worker 0-0, policy_version 324989 (0.00051) [2022-07-09 16:18:23,887][26022] Updated weights on worker 0-0, policy_version 324999 (0.00087) [2022-07-09 16:18:23,979][25689] Fps is (10 sec: 5802.8, 60 sec: 5697.7, 300 sec: 5725.9). Total num frames: 332798976. Throughput: 0: 5148.4. Samples: 332792416. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:18:23,979][25689] Avg episode reward: [(0, '-46.856')] [2022-07-09 16:18:25,533][26022] Updated weights on worker 0-0, policy_version 325009 (0.00080) [2022-07-09 16:18:27,338][26022] Updated weights on worker 0-0, policy_version 325019 (0.00091) [2022-07-09 16:18:29,029][25689] Fps is (10 sec: 5573.8, 60 sec: 5711.3, 300 sec: 5729.2). Total num frames: 332827648. Throughput: 0: 5988.1. Samples: 332826920. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:18:29,029][25689] Avg episode reward: [(0, '-48.611')] [2022-07-09 16:18:29,183][26022] Updated weights on worker 0-0, policy_version 325029 (0.00083) [2022-07-09 16:18:30,936][26022] Updated weights on worker 0-0, policy_version 325039 (0.00097) [2022-07-09 16:18:32,872][26022] Updated weights on worker 0-0, policy_version 325049 (0.00100) [2022-07-09 16:18:34,086][25689] Fps is (10 sec: 5774.6, 60 sec: 5715.8, 300 sec: 5729.6). Total num frames: 332857344. Throughput: 0: 5963.8. Samples: 332860978. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:18:34,087][25689] Avg episode reward: [(0, '-49.039')] [2022-07-09 16:18:34,391][26022] Updated weights on worker 0-0, policy_version 325059 (0.00091) [2022-07-09 16:18:36,531][26022] Updated weights on worker 0-0, policy_version 325069 (0.00093) [2022-07-09 16:18:37,950][26022] Updated weights on worker 0-0, policy_version 325079 (0.00094) [2022-07-09 16:18:39,110][25689] Fps is (10 sec: 5789.8, 60 sec: 5699.1, 300 sec: 5727.4). Total num frames: 332886016. Throughput: 0: 5969.0. Samples: 332895536. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:18:39,110][25689] Avg episode reward: [(0, '-49.836')] [2022-07-09 16:18:40,062][26022] Updated weights on worker 0-0, policy_version 325089 (0.00082) [2022-07-09 16:18:41,519][26022] Updated weights on worker 0-0, policy_version 325099 (0.00084) [2022-07-09 16:18:43,578][26022] Updated weights on worker 0-0, policy_version 325109 (0.00078) [2022-07-09 16:18:44,142][25689] Fps is (10 sec: 5702.8, 60 sec: 5715.1, 300 sec: 5724.7). Total num frames: 332914688. Throughput: 0: 5965.6. Samples: 332912700. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:18:44,142][25689] Avg episode reward: [(0, '-49.108')] [2022-07-09 16:18:45,295][26022] Updated weights on worker 0-0, policy_version 325119 (0.00088) [2022-07-09 16:18:47,011][26022] Updated weights on worker 0-0, policy_version 325129 (0.00086) [2022-07-09 16:18:48,732][26022] Updated weights on worker 0-0, policy_version 325139 (0.00089) [2022-07-09 16:18:49,156][25689] Fps is (10 sec: 5809.7, 60 sec: 5714.6, 300 sec: 5724.9). Total num frames: 332944384. Throughput: 0: 5991.3. Samples: 332947512. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:18:49,157][25689] Avg episode reward: [(0, '-48.923')] [2022-07-09 16:18:50,514][26022] Updated weights on worker 0-0, policy_version 325149 (0.00084) [2022-07-09 16:18:52,476][26022] Updated weights on worker 0-0, policy_version 325159 (0.00085) [2022-07-09 16:18:54,105][26022] Updated weights on worker 0-0, policy_version 325169 (0.00090) [2022-07-09 16:18:54,212][25689] Fps is (10 sec: 5897.6, 60 sec: 5713.6, 300 sec: 5727.9). Total num frames: 332974080. Throughput: 0: 6014.3. Samples: 332982022. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:18:54,213][25689] Avg episode reward: [(0, '-48.161')] [2022-07-09 16:18:55,928][26022] Updated weights on worker 0-0, policy_version 325179 (0.00094) [2022-07-09 16:18:57,620][26022] Updated weights on worker 0-0, policy_version 325189 (0.00083) [2022-07-09 16:18:59,242][25689] Fps is (10 sec: 5685.9, 60 sec: 5713.5, 300 sec: 5731.8). Total num frames: 333001728. Throughput: 0: 5152.4. Samples: 332999264. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:18:59,242][25689] Avg episode reward: [(0, '-47.730')] [2022-07-09 16:18:59,383][26022] Updated weights on worker 0-0, policy_version 325199 (0.00091) [2022-07-09 16:19:01,063][26022] Updated weights on worker 0-0, policy_version 325209 (0.00096) [2022-07-09 16:19:03,370][26022] Updated weights on worker 0-0, policy_version 325219 (0.00093) [2022-07-09 16:19:04,277][25689] Fps is (10 sec: 5493.9, 60 sec: 5710.4, 300 sec: 5721.6). Total num frames: 333029376. Throughput: 0: 5934.1. Samples: 333032186. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:19:04,278][25689] Avg episode reward: [(0, '-46.678')] [2022-07-09 16:19:05,039][26022] Updated weights on worker 0-0, policy_version 325229 (0.00085) [2022-07-09 16:19:06,871][26022] Updated weights on worker 0-0, policy_version 325239 (0.00087) [2022-07-09 16:19:08,506][26022] Updated weights on worker 0-0, policy_version 325249 (0.00097) [2022-07-09 16:19:09,311][25689] Fps is (10 sec: 5695.0, 60 sec: 5741.5, 300 sec: 5733.5). Total num frames: 333059072. Throughput: 0: 5937.1. Samples: 333067170. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:19:09,311][25689] Avg episode reward: [(0, '-46.124')] [2022-07-09 16:19:10,403][26022] Updated weights on worker 0-0, policy_version 325259 (0.00082) [2022-07-09 16:19:12,133][26022] Updated weights on worker 0-0, policy_version 325269 (0.00091) [2022-07-09 16:19:12,694][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:19:12,709][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000325271_333077504.pth [2022-07-09 16:19:12,711][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000323255_331013120.pth [2022-07-09 16:19:13,943][26022] Updated weights on worker 0-0, policy_version 325279 (0.00084) [2022-07-09 16:19:14,387][25689] Fps is (10 sec: 5773.7, 60 sec: 5741.0, 300 sec: 5728.8). Total num frames: 333087744. Throughput: 0: 5075.4. Samples: 333084414. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:19:14,387][25689] Avg episode reward: [(0, '-46.702')] [2022-07-09 16:19:15,696][26022] Updated weights on worker 0-0, policy_version 325289 (0.00092) [2022-07-09 16:19:17,526][26022] Updated weights on worker 0-0, policy_version 325299 (0.00086) [2022-07-09 16:19:19,372][26022] Updated weights on worker 0-0, policy_version 325309 (0.00096) [2022-07-09 16:19:19,417][25689] Fps is (10 sec: 5674.4, 60 sec: 5704.7, 300 sec: 5729.5). Total num frames: 333116416. Throughput: 0: 5938.1. Samples: 333119064. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:19:19,417][25689] Avg episode reward: [(0, '-46.476')] [2022-07-09 16:19:21,002][26022] Updated weights on worker 0-0, policy_version 325319 (0.00091) [2022-07-09 16:19:22,972][26022] Updated weights on worker 0-0, policy_version 325329 (0.00061) [2022-07-09 16:19:24,431][25689] Fps is (10 sec: 5811.3, 60 sec: 5742.4, 300 sec: 5729.6). Total num frames: 333146112. Throughput: 0: 6017.8. Samples: 333153464. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:19:24,431][25689] Avg episode reward: [(0, '-48.057')] [2022-07-09 16:19:24,463][26022] Updated weights on worker 0-0, policy_version 325339 (0.00087) [2022-07-09 16:19:26,436][26022] Updated weights on worker 0-0, policy_version 325349 (0.00086) [2022-07-09 16:19:28,140][26022] Updated weights on worker 0-0, policy_version 325359 (0.00088) [2022-07-09 16:19:29,470][25689] Fps is (10 sec: 5703.8, 60 sec: 5726.4, 300 sec: 5727.6). Total num frames: 333173760. Throughput: 0: 5131.0. Samples: 333170608. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:19:29,472][25689] Avg episode reward: [(0, '-47.472')] [2022-07-09 16:19:30,047][26022] Updated weights on worker 0-0, policy_version 325369 (0.00085) [2022-07-09 16:19:31,904][26022] Updated weights on worker 0-0, policy_version 325379 (0.00093) [2022-07-09 16:19:33,666][26022] Updated weights on worker 0-0, policy_version 325389 (0.00089) [2022-07-09 16:19:34,515][25689] Fps is (10 sec: 5584.7, 60 sec: 5710.7, 300 sec: 5723.6). Total num frames: 333202432. Throughput: 0: 5986.1. Samples: 333204906. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:19:34,516][25689] Avg episode reward: [(0, '-48.297')] [2022-07-09 16:19:35,499][26022] Updated weights on worker 0-0, policy_version 325399 (0.00086) [2022-07-09 16:19:37,227][26022] Updated weights on worker 0-0, policy_version 325409 (0.00089) [2022-07-09 16:19:38,955][26022] Updated weights on worker 0-0, policy_version 325419 (0.00081) [2022-07-09 16:19:39,529][25689] Fps is (10 sec: 5700.7, 60 sec: 5711.5, 300 sec: 5728.4). Total num frames: 333231104. Throughput: 0: 5996.1. Samples: 333239662. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:19:39,530][25689] Avg episode reward: [(0, '-48.203')] [2022-07-09 16:19:40,726][26022] Updated weights on worker 0-0, policy_version 325429 (0.00091) [2022-07-09 16:19:42,363][26022] Updated weights on worker 0-0, policy_version 325439 (0.00101) [2022-07-09 16:19:44,304][26022] Updated weights on worker 0-0, policy_version 325449 (0.00080) [2022-07-09 16:19:44,555][25689] Fps is (10 sec: 5915.9, 60 sec: 5746.1, 300 sec: 5728.2). Total num frames: 333261824. Throughput: 0: 5152.7. Samples: 333257158. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:19:44,556][25689] Avg episode reward: [(0, '-48.714')] [2022-07-09 16:19:45,963][26022] Updated weights on worker 0-0, policy_version 325459 (0.00088) [2022-07-09 16:19:47,677][26022] Updated weights on worker 0-0, policy_version 325469 (0.00085) [2022-07-09 16:19:49,569][25689] Fps is (10 sec: 5711.8, 60 sec: 5695.3, 300 sec: 5722.1). Total num frames: 333288448. Throughput: 0: 6015.3. Samples: 333291508. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:19:49,569][25689] Avg episode reward: [(0, '-48.324')] [2022-07-09 16:19:49,721][26022] Updated weights on worker 0-0, policy_version 325479 (0.00086) [2022-07-09 16:19:51,131][26022] Updated weights on worker 0-0, policy_version 325489 (0.00097) [2022-07-09 16:19:53,347][26022] Updated weights on worker 0-0, policy_version 325499 (0.00080) [2022-07-09 16:19:54,591][26022] Updated weights on worker 0-0, policy_version 325509 (0.00085) [2022-07-09 16:19:54,669][25689] Fps is (10 sec: 5871.9, 60 sec: 5741.9, 300 sec: 5738.3). Total num frames: 333321216. Throughput: 0: 6013.0. Samples: 333326092. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:19:54,670][25689] Avg episode reward: [(0, '-48.681')] [2022-07-09 16:19:56,984][26022] Updated weights on worker 0-0, policy_version 325519 (0.00084) [2022-07-09 16:19:58,327][26022] Updated weights on worker 0-0, policy_version 325529 (0.00090) [2022-07-09 16:19:59,709][25689] Fps is (10 sec: 5755.9, 60 sec: 5707.0, 300 sec: 5727.6). Total num frames: 333346816. Throughput: 0: 5132.9. Samples: 333343244. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:19:59,710][25689] Avg episode reward: [(0, '-49.067')] [2022-07-09 16:20:00,475][26022] Updated weights on worker 0-0, policy_version 325539 (0.00090) [2022-07-09 16:20:02,455][26022] Updated weights on worker 0-0, policy_version 325549 (0.00091) [2022-07-09 16:20:04,210][26022] Updated weights on worker 0-0, policy_version 325559 (0.00097) [2022-07-09 16:20:04,755][25689] Fps is (10 sec: 5381.3, 60 sec: 5723.0, 300 sec: 5731.2). Total num frames: 333375488. Throughput: 0: 5864.1. Samples: 333375612. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:20:04,755][25689] Avg episode reward: [(0, '-49.287')] [2022-07-09 16:20:06,186][26022] Updated weights on worker 0-0, policy_version 325569 (0.00089) [2022-07-09 16:20:07,885][26022] Updated weights on worker 0-0, policy_version 325579 (0.00089) [2022-07-09 16:20:09,621][26022] Updated weights on worker 0-0, policy_version 325589 (0.00088) [2022-07-09 16:20:09,758][25689] Fps is (10 sec: 5706.4, 60 sec: 5708.9, 300 sec: 5732.3). Total num frames: 333404160. Throughput: 0: 5875.7. Samples: 333410136. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-09 16:20:09,759][25689] Avg episode reward: [(0, '-48.907')] [2022-07-09 16:20:11,559][26022] Updated weights on worker 0-0, policy_version 325599 (0.00083) [2022-07-09 16:20:13,043][26022] Updated weights on worker 0-0, policy_version 325609 (0.00081) [2022-07-09 16:20:14,826][25689] Fps is (10 sec: 5592.0, 60 sec: 5692.7, 300 sec: 5724.6). Total num frames: 333431808. Throughput: 0: 5024.1. Samples: 333427362. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:20:14,827][25689] Avg episode reward: [(0, '-48.476')] [2022-07-09 16:20:15,034][26022] Updated weights on worker 0-0, policy_version 325619 (0.00091) [2022-07-09 16:20:16,738][26022] Updated weights on worker 0-0, policy_version 325629 (0.00085) [2022-07-09 16:20:18,517][26022] Updated weights on worker 0-0, policy_version 325639 (0.00087) [2022-07-09 16:20:19,888][25689] Fps is (10 sec: 5661.1, 60 sec: 5706.6, 300 sec: 5727.0). Total num frames: 333461504. Throughput: 0: 5882.5. Samples: 333461944. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:20:19,888][25689] Avg episode reward: [(0, '-48.093')] [2022-07-09 16:20:20,254][26022] Updated weights on worker 0-0, policy_version 325649 (0.00087) [2022-07-09 16:20:21,926][26022] Updated weights on worker 0-0, policy_version 325659 (0.00086) [2022-07-09 16:20:23,771][26022] Updated weights on worker 0-0, policy_version 325669 (0.00081) [2022-07-09 16:20:24,971][25689] Fps is (10 sec: 5753.4, 60 sec: 5683.2, 300 sec: 5722.4). Total num frames: 333490176. Throughput: 0: 5995.5. Samples: 333496820. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:20:24,972][25689] Avg episode reward: [(0, '-48.580')] [2022-07-09 16:20:25,674][26022] Updated weights on worker 0-0, policy_version 325679 (0.00085) [2022-07-09 16:20:27,366][26022] Updated weights on worker 0-0, policy_version 325689 (0.00086) [2022-07-09 16:20:29,096][26022] Updated weights on worker 0-0, policy_version 325699 (0.00095) [2022-07-09 16:20:30,008][25689] Fps is (10 sec: 5767.5, 60 sec: 5717.3, 300 sec: 5726.3). Total num frames: 333519872. Throughput: 0: 5126.4. Samples: 333513944. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:20:30,009][25689] Avg episode reward: [(0, '-46.959')] [2022-07-09 16:20:30,924][26022] Updated weights on worker 0-0, policy_version 325709 (0.00823) [2022-07-09 16:20:32,706][26022] Updated weights on worker 0-0, policy_version 325719 (0.00096) [2022-07-09 16:20:34,544][26022] Updated weights on worker 0-0, policy_version 325729 (0.00088) [2022-07-09 16:20:35,088][25689] Fps is (10 sec: 5870.5, 60 sec: 5730.9, 300 sec: 5721.9). Total num frames: 333549568. Throughput: 0: 5974.4. Samples: 333548414. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:20:35,089][25689] Avg episode reward: [(0, '-47.175')] [2022-07-09 16:20:36,287][26022] Updated weights on worker 0-0, policy_version 325739 (0.00083) [2022-07-09 16:20:37,891][26022] Updated weights on worker 0-0, policy_version 325749 (0.00085) [2022-07-09 16:20:39,797][26022] Updated weights on worker 0-0, policy_version 325759 (0.00089) [2022-07-09 16:20:40,105][25689] Fps is (10 sec: 5780.7, 60 sec: 5730.6, 300 sec: 5725.6). Total num frames: 333578240. Throughput: 0: 6005.6. Samples: 333583360. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:20:40,106][25689] Avg episode reward: [(0, '-47.460')] [2022-07-09 16:20:41,656][26022] Updated weights on worker 0-0, policy_version 325769 (0.00084) [2022-07-09 16:20:43,416][26022] Updated weights on worker 0-0, policy_version 325779 (0.00090) [2022-07-09 16:20:45,137][25689] Fps is (10 sec: 5808.4, 60 sec: 5713.1, 300 sec: 5729.0). Total num frames: 333607936. Throughput: 0: 5147.7. Samples: 333600628. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:20:45,138][25689] Avg episode reward: [(0, '-47.856')] [2022-07-09 16:20:45,142][26022] Updated weights on worker 0-0, policy_version 325789 (0.00089) [2022-07-09 16:20:47,055][26022] Updated weights on worker 0-0, policy_version 325799 (0.00084) [2022-07-09 16:20:48,899][26022] Updated weights on worker 0-0, policy_version 325809 (0.00079) [2022-07-09 16:20:50,165][25689] Fps is (10 sec: 5801.9, 60 sec: 5745.5, 300 sec: 5723.8). Total num frames: 333636608. Throughput: 0: 6004.9. Samples: 333634986. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:20:50,166][25689] Avg episode reward: [(0, '-47.606')] [2022-07-09 16:20:50,660][26022] Updated weights on worker 0-0, policy_version 325819 (0.00613) [2022-07-09 16:20:52,443][26022] Updated weights on worker 0-0, policy_version 325829 (0.00081) [2022-07-09 16:20:54,239][26022] Updated weights on worker 0-0, policy_version 325839 (0.00935) [2022-07-09 16:20:55,300][25689] Fps is (10 sec: 5642.5, 60 sec: 5674.8, 300 sec: 5724.9). Total num frames: 333665280. Throughput: 0: 5985.7. Samples: 333669396. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:20:55,301][25689] Avg episode reward: [(0, '-47.344')] [2022-07-09 16:20:55,930][26022] Updated weights on worker 0-0, policy_version 325849 (0.00098) [2022-07-09 16:20:57,852][26022] Updated weights on worker 0-0, policy_version 325859 (0.00093) [2022-07-09 16:20:59,299][26022] Updated weights on worker 0-0, policy_version 325869 (0.00086) [2022-07-09 16:21:00,354][25689] Fps is (10 sec: 5628.0, 60 sec: 5724.1, 300 sec: 5727.4). Total num frames: 333693952. Throughput: 0: 5950.6. Samples: 333703856. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:21:00,355][25689] Avg episode reward: [(0, '-48.276')] [2022-07-09 16:21:01,798][26022] Updated weights on worker 0-0, policy_version 325879 (0.00099) [2022-07-09 16:21:03,337][26022] Updated weights on worker 0-0, policy_version 325889 (0.00090) [2022-07-09 16:21:05,247][26022] Updated weights on worker 0-0, policy_version 325899 (0.00093) [2022-07-09 16:21:05,441][25689] Fps is (10 sec: 5553.8, 60 sec: 5703.3, 300 sec: 5722.6). Total num frames: 333721600. Throughput: 0: 5818.5. Samples: 333718766. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:21:05,445][25689] Avg episode reward: [(0, '-48.003')] [2022-07-09 16:21:06,903][26022] Updated weights on worker 0-0, policy_version 325909 (0.00329) [2022-07-09 16:21:08,828][26022] Updated weights on worker 0-0, policy_version 325919 (0.00094) [2022-07-09 16:21:10,500][25689] Fps is (10 sec: 5551.0, 60 sec: 5698.1, 300 sec: 5722.5). Total num frames: 333750272. Throughput: 0: 5807.0. Samples: 333753072. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:21:10,501][25689] Avg episode reward: [(0, '-47.382')] [2022-07-09 16:21:10,806][26022] Updated weights on worker 0-0, policy_version 325929 (0.00086) [2022-07-09 16:21:12,433][26022] Updated weights on worker 0-0, policy_version 325939 (0.00093) [2022-07-09 16:21:12,852][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:21:12,861][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000325941_333763584.pth [2022-07-09 16:21:12,861][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000323925_331699200.pth [2022-07-09 16:21:14,426][26022] Updated weights on worker 0-0, policy_version 325949 (0.00083) [2022-07-09 16:21:15,560][25689] Fps is (10 sec: 5667.1, 60 sec: 5715.7, 300 sec: 5718.3). Total num frames: 333778944. Throughput: 0: 5819.8. Samples: 333787304. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:21:15,560][25689] Avg episode reward: [(0, '-47.807')] [2022-07-09 16:21:16,131][26022] Updated weights on worker 0-0, policy_version 325959 (0.00089) [2022-07-09 16:21:17,981][26022] Updated weights on worker 0-0, policy_version 325969 (0.00085) [2022-07-09 16:21:19,687][26022] Updated weights on worker 0-0, policy_version 325979 (0.00093) [2022-07-09 16:21:20,569][25689] Fps is (10 sec: 5593.5, 60 sec: 5686.9, 300 sec: 5712.8). Total num frames: 333806592. Throughput: 0: 4968.4. Samples: 333804292. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:21:20,570][25689] Avg episode reward: [(0, '-47.271')] [2022-07-09 16:21:21,548][26022] Updated weights on worker 0-0, policy_version 325989 (0.00087) [2022-07-09 16:21:23,314][26022] Updated weights on worker 0-0, policy_version 325999 (0.00087) [2022-07-09 16:21:24,926][26022] Updated weights on worker 0-0, policy_version 326009 (0.00088) [2022-07-09 16:21:25,574][25689] Fps is (10 sec: 5726.2, 60 sec: 5711.2, 300 sec: 5714.1). Total num frames: 333836288. Throughput: 0: 5978.1. Samples: 333839124. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:21:25,574][25689] Avg episode reward: [(0, '-47.845')] [2022-07-09 16:21:26,964][26022] Updated weights on worker 0-0, policy_version 326019 (0.00087) [2022-07-09 16:21:28,456][26022] Updated weights on worker 0-0, policy_version 326029 (0.00086) [2022-07-09 16:21:30,347][26022] Updated weights on worker 0-0, policy_version 326039 (0.00087) [2022-07-09 16:21:30,577][25689] Fps is (10 sec: 5730.3, 60 sec: 5680.6, 300 sec: 5708.8). Total num frames: 333863936. Throughput: 0: 6004.8. Samples: 333873626. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:21:30,577][25689] Avg episode reward: [(0, '-47.429')] [2022-07-09 16:21:32,017][26022] Updated weights on worker 0-0, policy_version 326049 (0.00087) [2022-07-09 16:21:33,844][26022] Updated weights on worker 0-0, policy_version 326059 (0.00086) [2022-07-09 16:21:35,631][25689] Fps is (10 sec: 5702.1, 60 sec: 5683.0, 300 sec: 5708.5). Total num frames: 333893632. Throughput: 0: 5154.8. Samples: 333890766. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:21:35,631][25689] Avg episode reward: [(0, '-47.619')] [2022-07-09 16:21:35,645][26022] Updated weights on worker 0-0, policy_version 326069 (0.00086) [2022-07-09 16:21:37,406][26022] Updated weights on worker 0-0, policy_version 326079 (0.00088) [2022-07-09 16:21:39,199][26022] Updated weights on worker 0-0, policy_version 326089 (0.00093) [2022-07-09 16:21:40,633][25689] Fps is (10 sec: 5803.7, 60 sec: 5684.3, 300 sec: 5713.2). Total num frames: 333922304. Throughput: 0: 6035.8. Samples: 333925396. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:21:40,634][25689] Avg episode reward: [(0, '-48.197')] [2022-07-09 16:21:41,248][26022] Updated weights on worker 0-0, policy_version 326099 (0.00090) [2022-07-09 16:21:42,797][26022] Updated weights on worker 0-0, policy_version 326109 (0.00091) [2022-07-09 16:21:44,692][26022] Updated weights on worker 0-0, policy_version 326119 (0.00094) [2022-07-09 16:21:45,648][25689] Fps is (10 sec: 5929.0, 60 sec: 5702.9, 300 sec: 5716.7). Total num frames: 333953024. Throughput: 0: 6029.6. Samples: 333960162. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:21:45,648][25689] Avg episode reward: [(0, '-48.321')] [2022-07-09 16:21:46,247][26022] Updated weights on worker 0-0, policy_version 326129 (0.00094) [2022-07-09 16:21:48,390][26022] Updated weights on worker 0-0, policy_version 326139 (0.00094) [2022-07-09 16:21:49,903][26022] Updated weights on worker 0-0, policy_version 326149 (0.00093) [2022-07-09 16:21:50,653][25689] Fps is (10 sec: 5723.0, 60 sec: 5671.2, 300 sec: 5710.8). Total num frames: 333979648. Throughput: 0: 5160.1. Samples: 333977224. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:21:50,654][25689] Avg episode reward: [(0, '-48.011')] [2022-07-09 16:21:51,836][26022] Updated weights on worker 0-0, policy_version 326159 (0.00086) [2022-07-09 16:21:53,663][26022] Updated weights on worker 0-0, policy_version 326169 (0.00088) [2022-07-09 16:21:55,315][26022] Updated weights on worker 0-0, policy_version 326179 (0.00090) [2022-07-09 16:21:55,694][25689] Fps is (10 sec: 5707.8, 60 sec: 5713.9, 300 sec: 5717.3). Total num frames: 334010368. Throughput: 0: 6031.5. Samples: 334011780. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:21:55,695][25689] Avg episode reward: [(0, '-47.948')] [2022-07-09 16:21:57,273][26022] Updated weights on worker 0-0, policy_version 326189 (0.00060) [2022-07-09 16:21:58,846][26022] Updated weights on worker 0-0, policy_version 326199 (0.00088) [2022-07-09 16:22:00,717][25689] Fps is (10 sec: 5698.0, 60 sec: 5683.0, 300 sec: 5720.6). Total num frames: 334036992. Throughput: 0: 6004.3. Samples: 334045986. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:22:00,718][25689] Avg episode reward: [(0, '-47.471')] [2022-07-09 16:22:00,836][26022] Updated weights on worker 0-0, policy_version 326209 (0.00086) [2022-07-09 16:22:02,905][26022] Updated weights on worker 0-0, policy_version 326219 (0.00092) [2022-07-09 16:22:04,707][26022] Updated weights on worker 0-0, policy_version 326229 (0.00082) [2022-07-09 16:22:05,730][25689] Fps is (10 sec: 5306.0, 60 sec: 5672.9, 300 sec: 5703.2). Total num frames: 334063616. Throughput: 0: 5018.8. Samples: 334060950. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:22:05,732][25689] Avg episode reward: [(0, '-47.411')] [2022-07-09 16:22:06,550][26022] Updated weights on worker 0-0, policy_version 326239 (0.00093) [2022-07-09 16:22:08,416][26022] Updated weights on worker 0-0, policy_version 326249 (0.00086) [2022-07-09 16:22:10,002][26022] Updated weights on worker 0-0, policy_version 326259 (0.00086) [2022-07-09 16:22:10,778][25689] Fps is (10 sec: 5597.9, 60 sec: 5691.0, 300 sec: 5712.0). Total num frames: 334093312. Throughput: 0: 5879.9. Samples: 334095556. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:22:10,780][25689] Avg episode reward: [(0, '-47.644')] [2022-07-09 16:22:11,895][26022] Updated weights on worker 0-0, policy_version 326269 (0.00087) [2022-07-09 16:22:13,576][26022] Updated weights on worker 0-0, policy_version 326279 (0.00085) [2022-07-09 16:22:15,419][26022] Updated weights on worker 0-0, policy_version 326289 (0.00093) [2022-07-09 16:22:15,839][25689] Fps is (10 sec: 5672.7, 60 sec: 5673.9, 300 sec: 5701.3). Total num frames: 334120960. Throughput: 0: 5851.9. Samples: 334129662. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:22:15,839][25689] Avg episode reward: [(0, '-47.529')] [2022-07-09 16:22:17,529][26022] Updated weights on worker 0-0, policy_version 326299 (0.00093) [2022-07-09 16:22:19,018][26022] Updated weights on worker 0-0, policy_version 326309 (0.00084) [2022-07-09 16:22:20,853][25689] Fps is (10 sec: 5488.6, 60 sec: 5673.4, 300 sec: 5697.7). Total num frames: 334148608. Throughput: 0: 4998.4. Samples: 334146632. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:22:20,853][25689] Avg episode reward: [(0, '-47.044')] [2022-07-09 16:22:20,953][26022] Updated weights on worker 0-0, policy_version 326319 (0.00088) [2022-07-09 16:22:22,646][26022] Updated weights on worker 0-0, policy_version 326329 (0.00089) [2022-07-09 16:22:24,511][26022] Updated weights on worker 0-0, policy_version 326339 (0.00094) [2022-07-09 16:22:25,862][25689] Fps is (10 sec: 5823.1, 60 sec: 5690.0, 300 sec: 5708.1). Total num frames: 334179328. Throughput: 0: 5960.9. Samples: 334180956. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:22:25,864][25689] Avg episode reward: [(0, '-46.796')] [2022-07-09 16:22:26,356][26022] Updated weights on worker 0-0, policy_version 326349 (0.00090) [2022-07-09 16:22:27,981][26022] Updated weights on worker 0-0, policy_version 326359 (0.00082) [2022-07-09 16:22:29,996][26022] Updated weights on worker 0-0, policy_version 326369 (0.00091) [2022-07-09 16:22:30,866][25689] Fps is (10 sec: 5931.7, 60 sec: 5706.9, 300 sec: 5706.6). Total num frames: 334208000. Throughput: 0: 5966.8. Samples: 334215414. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:22:30,866][25689] Avg episode reward: [(0, '-46.717')] [2022-07-09 16:22:31,697][26022] Updated weights on worker 0-0, policy_version 326379 (0.00089) [2022-07-09 16:22:33,339][26022] Updated weights on worker 0-0, policy_version 326389 (0.00093) [2022-07-09 16:22:35,321][26022] Updated weights on worker 0-0, policy_version 326399 (0.00090) [2022-07-09 16:22:35,950][25689] Fps is (10 sec: 5684.7, 60 sec: 5687.1, 300 sec: 5702.1). Total num frames: 334236672. Throughput: 0: 5113.5. Samples: 334232500. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:22:35,950][25689] Avg episode reward: [(0, '-45.980')] [2022-07-09 16:22:36,859][26022] Updated weights on worker 0-0, policy_version 326409 (0.00086) [2022-07-09 16:22:38,871][26022] Updated weights on worker 0-0, policy_version 326419 (0.00084) [2022-07-09 16:22:40,766][26022] Updated weights on worker 0-0, policy_version 326430 (0.00091) [2022-07-09 16:22:41,058][25689] Fps is (10 sec: 5626.4, 60 sec: 5677.2, 300 sec: 5703.9). Total num frames: 334265344. Throughput: 0: 5973.2. Samples: 334267318. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 16:22:41,058][25689] Avg episode reward: [(0, '-45.893')] [2022-07-09 16:22:42,435][26022] Updated weights on worker 0-0, policy_version 326440 (0.00087) [2022-07-09 16:22:44,332][26022] Updated weights on worker 0-0, policy_version 326450 (0.00087) [2022-07-09 16:22:46,056][26022] Updated weights on worker 0-0, policy_version 326460 (0.00095) [2022-07-09 16:22:46,150][25689] Fps is (10 sec: 5722.1, 60 sec: 5653.0, 300 sec: 5702.3). Total num frames: 334295040. Throughput: 0: 5953.6. Samples: 334301742. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:22:46,151][25689] Avg episode reward: [(0, '-46.451')] [2022-07-09 16:22:47,836][26022] Updated weights on worker 0-0, policy_version 326470 (0.00101) [2022-07-09 16:22:49,683][26022] Updated weights on worker 0-0, policy_version 326480 (0.00053) [2022-07-09 16:22:51,239][25689] Fps is (10 sec: 5833.3, 60 sec: 5695.9, 300 sec: 5701.4). Total num frames: 334324736. Throughput: 0: 5068.7. Samples: 334318682. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:22:51,240][25689] Avg episode reward: [(0, '-47.173')] [2022-07-09 16:22:51,464][26022] Updated weights on worker 0-0, policy_version 326490 (0.00085) [2022-07-09 16:22:53,257][26022] Updated weights on worker 0-0, policy_version 326500 (0.00086) [2022-07-09 16:22:55,118][26022] Updated weights on worker 0-0, policy_version 326510 (0.00092) [2022-07-09 16:22:56,298][25689] Fps is (10 sec: 5751.7, 60 sec: 5660.4, 300 sec: 5704.3). Total num frames: 334353408. Throughput: 0: 5936.5. Samples: 334353298. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:22:56,298][25689] Avg episode reward: [(0, '-47.242')] [2022-07-09 16:22:56,820][26022] Updated weights on worker 0-0, policy_version 326520 (0.00101) [2022-07-09 16:22:58,542][26022] Updated weights on worker 0-0, policy_version 326530 (0.00093) [2022-07-09 16:23:00,547][26022] Updated weights on worker 0-0, policy_version 326540 (0.00081) [2022-07-09 16:23:01,320][25689] Fps is (10 sec: 5688.5, 60 sec: 5694.3, 300 sec: 5707.4). Total num frames: 334382080. Throughput: 0: 5946.5. Samples: 334387808. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:23:01,320][25689] Avg episode reward: [(0, '-47.400')] [2022-07-09 16:23:01,991][26022] Updated weights on worker 0-0, policy_version 326550 (0.00083) [2022-07-09 16:23:04,400][26022] Updated weights on worker 0-0, policy_version 326560 (0.00090) [2022-07-09 16:23:05,995][26022] Updated weights on worker 0-0, policy_version 326570 (0.00096) [2022-07-09 16:23:06,335][25689] Fps is (10 sec: 5407.3, 60 sec: 5677.2, 300 sec: 5700.3). Total num frames: 334407680. Throughput: 0: 5008.2. Samples: 334402832. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:23:06,335][25689] Avg episode reward: [(0, '-48.003')] [2022-07-09 16:23:07,863][26022] Updated weights on worker 0-0, policy_version 326580 (0.00094) [2022-07-09 16:23:09,841][26022] Updated weights on worker 0-0, policy_version 326590 (0.00081) [2022-07-09 16:23:11,345][25689] Fps is (10 sec: 5516.0, 60 sec: 5680.8, 300 sec: 5704.9). Total num frames: 334437376. Throughput: 0: 5897.3. Samples: 334437250. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:23:11,345][25689] Avg episode reward: [(0, '-47.704')] [2022-07-09 16:23:11,493][26022] Updated weights on worker 0-0, policy_version 326600 (0.00086) [2022-07-09 16:23:12,940][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:23:12,955][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000326608_334446592.pth [2022-07-09 16:23:12,956][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000324599_332389376.pth [2022-07-09 16:23:13,304][26022] Updated weights on worker 0-0, policy_version 326610 (0.00098) [2022-07-09 16:23:14,960][26022] Updated weights on worker 0-0, policy_version 326620 (0.00091) [2022-07-09 16:23:16,386][25689] Fps is (10 sec: 5806.9, 60 sec: 5699.4, 300 sec: 5697.3). Total num frames: 334466048. Throughput: 0: 5896.6. Samples: 334471750. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:23:16,387][25689] Avg episode reward: [(0, '-47.611')] [2022-07-09 16:23:16,749][26022] Updated weights on worker 0-0, policy_version 326630 (0.00090) [2022-07-09 16:23:18,821][26022] Updated weights on worker 0-0, policy_version 326640 (0.00095) [2022-07-09 16:23:20,461][26022] Updated weights on worker 0-0, policy_version 326650 (0.00081) [2022-07-09 16:23:21,395][25689] Fps is (10 sec: 5502.0, 60 sec: 5683.1, 300 sec: 5694.7). Total num frames: 334492672. Throughput: 0: 5026.8. Samples: 334488720. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:23:21,395][25689] Avg episode reward: [(0, '-47.282')] [2022-07-09 16:23:22,236][26022] Updated weights on worker 0-0, policy_version 326660 (0.00084) [2022-07-09 16:23:24,056][26022] Updated weights on worker 0-0, policy_version 326670 (0.00087) [2022-07-09 16:23:25,736][26022] Updated weights on worker 0-0, policy_version 326680 (0.00093) [2022-07-09 16:23:26,399][25689] Fps is (10 sec: 5727.0, 60 sec: 5683.5, 300 sec: 5702.5). Total num frames: 334523392. Throughput: 0: 5991.5. Samples: 334523048. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:23:26,400][25689] Avg episode reward: [(0, '-47.447')] [2022-07-09 16:23:27,809][26022] Updated weights on worker 0-0, policy_version 326690 (0.00083) [2022-07-09 16:23:29,471][26022] Updated weights on worker 0-0, policy_version 326700 (0.00091) [2022-07-09 16:23:31,252][26022] Updated weights on worker 0-0, policy_version 326710 (0.00090) [2022-07-09 16:23:31,422][25689] Fps is (10 sec: 5820.8, 60 sec: 5664.8, 300 sec: 5696.2). Total num frames: 334551040. Throughput: 0: 5968.0. Samples: 334557074. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:23:31,423][25689] Avg episode reward: [(0, '-47.064')] [2022-07-09 16:23:33,176][26022] Updated weights on worker 0-0, policy_version 326720 (0.00088) [2022-07-09 16:23:34,885][26022] Updated weights on worker 0-0, policy_version 326730 (0.00089) [2022-07-09 16:23:36,512][25689] Fps is (10 sec: 5569.1, 60 sec: 5664.2, 300 sec: 5695.0). Total num frames: 334579712. Throughput: 0: 5076.3. Samples: 334573916. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:23:36,513][25689] Avg episode reward: [(0, '-47.149')] [2022-07-09 16:23:36,787][26022] Updated weights on worker 0-0, policy_version 326740 (0.00098) [2022-07-09 16:23:38,465][26022] Updated weights on worker 0-0, policy_version 326750 (0.00087) [2022-07-09 16:23:40,351][26022] Updated weights on worker 0-0, policy_version 326760 (0.00089) [2022-07-09 16:23:41,587][25689] Fps is (10 sec: 5742.4, 60 sec: 5684.3, 300 sec: 5697.6). Total num frames: 334609408. Throughput: 0: 5906.8. Samples: 334607990. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:23:41,588][25689] Avg episode reward: [(0, '-46.917')] [2022-07-09 16:23:41,940][26022] Updated weights on worker 0-0, policy_version 326770 (0.00089) [2022-07-09 16:23:43,979][26022] Updated weights on worker 0-0, policy_version 326780 (0.00088) [2022-07-09 16:23:45,787][26022] Updated weights on worker 0-0, policy_version 326790 (0.00088) [2022-07-09 16:23:46,612][25689] Fps is (10 sec: 5576.3, 60 sec: 5639.8, 300 sec: 5687.1). Total num frames: 334636032. Throughput: 0: 5887.1. Samples: 334642044. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:23:46,613][25689] Avg episode reward: [(0, '-47.301')] [2022-07-09 16:23:47,455][26022] Updated weights on worker 0-0, policy_version 326800 (0.00084) [2022-07-09 16:23:49,429][26022] Updated weights on worker 0-0, policy_version 326810 (0.00057) [2022-07-09 16:23:51,306][26022] Updated weights on worker 0-0, policy_version 326820 (0.00082) [2022-07-09 16:23:51,630][25689] Fps is (10 sec: 5505.5, 60 sec: 5629.4, 300 sec: 5684.3). Total num frames: 334664704. Throughput: 0: 5037.7. Samples: 334658876. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:23:51,631][25689] Avg episode reward: [(0, '-47.264')] [2022-07-09 16:23:52,995][26022] Updated weights on worker 0-0, policy_version 326830 (0.00086) [2022-07-09 16:23:55,057][26022] Updated weights on worker 0-0, policy_version 326840 (0.00090) [2022-07-09 16:23:56,622][26022] Updated weights on worker 0-0, policy_version 326850 (0.00095) [2022-07-09 16:23:56,732][25689] Fps is (10 sec: 5767.9, 60 sec: 5642.4, 300 sec: 5689.8). Total num frames: 334694400. Throughput: 0: 5861.0. Samples: 334692422. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:23:56,732][25689] Avg episode reward: [(0, '-47.416')] [2022-07-09 16:23:58,381][26022] Updated weights on worker 0-0, policy_version 326860 (0.00089) [2022-07-09 16:24:00,167][26022] Updated weights on worker 0-0, policy_version 326870 (0.00102) [2022-07-09 16:24:01,736][25689] Fps is (10 sec: 5775.7, 60 sec: 5644.0, 300 sec: 5693.9). Total num frames: 334723072. Throughput: 0: 5892.9. Samples: 334726730. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:24:01,737][25689] Avg episode reward: [(0, '-47.369')] [2022-07-09 16:24:02,480][26022] Updated weights on worker 0-0, policy_version 326880 (0.00086) [2022-07-09 16:24:04,289][26022] Updated weights on worker 0-0, policy_version 326890 (0.00093) [2022-07-09 16:24:06,056][26022] Updated weights on worker 0-0, policy_version 326900 (0.00082) [2022-07-09 16:24:06,744][25689] Fps is (10 sec: 5522.9, 60 sec: 5661.7, 300 sec: 5684.0). Total num frames: 334749696. Throughput: 0: 5800.0. Samples: 334758808. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:24:06,744][25689] Avg episode reward: [(0, '-47.719')] [2022-07-09 16:24:07,865][26022] Updated weights on worker 0-0, policy_version 326910 (0.00077) [2022-07-09 16:24:09,711][26022] Updated weights on worker 0-0, policy_version 326920 (0.00089) [2022-07-09 16:24:11,299][26022] Updated weights on worker 0-0, policy_version 326930 (0.00077) [2022-07-09 16:24:11,780][25689] Fps is (10 sec: 5403.5, 60 sec: 5625.3, 300 sec: 5681.4). Total num frames: 334777344. Throughput: 0: 5813.3. Samples: 334776012. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:24:11,781][25689] Avg episode reward: [(0, '-48.139')] [2022-07-09 16:24:13,300][26022] Updated weights on worker 0-0, policy_version 326940 (0.00095) [2022-07-09 16:24:15,044][26022] Updated weights on worker 0-0, policy_version 326950 (0.00083) [2022-07-09 16:24:16,743][26022] Updated weights on worker 0-0, policy_version 326960 (0.00094) [2022-07-09 16:24:16,834][25689] Fps is (10 sec: 5683.0, 60 sec: 5641.1, 300 sec: 5684.3). Total num frames: 334807040. Throughput: 0: 5878.5. Samples: 334810596. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:24:16,835][25689] Avg episode reward: [(0, '-47.937')] [2022-07-09 16:24:18,597][26022] Updated weights on worker 0-0, policy_version 326970 (0.00084) [2022-07-09 16:24:20,350][26022] Updated weights on worker 0-0, policy_version 326980 (0.00095) [2022-07-09 16:24:21,869][25689] Fps is (10 sec: 5684.0, 60 sec: 5655.6, 300 sec: 5677.1). Total num frames: 334834688. Throughput: 0: 5859.8. Samples: 334844702. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:24:21,869][25689] Avg episode reward: [(0, '-47.094')] [2022-07-09 16:24:22,123][26022] Updated weights on worker 0-0, policy_version 326990 (0.00087) [2022-07-09 16:24:23,851][26022] Updated weights on worker 0-0, policy_version 327000 (0.00095) [2022-07-09 16:24:25,609][26022] Updated weights on worker 0-0, policy_version 327010 (0.00085) [2022-07-09 16:24:26,946][25689] Fps is (10 sec: 5570.1, 60 sec: 5615.0, 300 sec: 5679.8). Total num frames: 334863360. Throughput: 0: 5096.0. Samples: 334861756. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:24:26,946][25689] Avg episode reward: [(0, '-47.512')] [2022-07-09 16:24:27,757][26022] Updated weights on worker 0-0, policy_version 327020 (0.00092) [2022-07-09 16:24:29,255][26022] Updated weights on worker 0-0, policy_version 327030 (0.00087) [2022-07-09 16:24:31,261][26022] Updated weights on worker 0-0, policy_version 327040 (0.00079) [2022-07-09 16:24:31,965][25689] Fps is (10 sec: 5781.1, 60 sec: 5649.2, 300 sec: 5683.7). Total num frames: 334893056. Throughput: 0: 5932.0. Samples: 334895750. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:24:31,966][25689] Avg episode reward: [(0, '-47.236')] [2022-07-09 16:24:33,058][26022] Updated weights on worker 0-0, policy_version 327050 (0.00087) [2022-07-09 16:24:34,870][26022] Updated weights on worker 0-0, policy_version 327060 (0.00098) [2022-07-09 16:24:36,871][26022] Updated weights on worker 0-0, policy_version 327070 (0.00096) [2022-07-09 16:24:37,057][25689] Fps is (10 sec: 5671.0, 60 sec: 5632.0, 300 sec: 5678.8). Total num frames: 334920704. Throughput: 0: 5884.9. Samples: 334929606. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:24:37,058][25689] Avg episode reward: [(0, '-47.901')] [2022-07-09 16:24:38,481][26022] Updated weights on worker 0-0, policy_version 327080 (0.00083) [2022-07-09 16:24:40,393][26022] Updated weights on worker 0-0, policy_version 327090 (0.00089) [2022-07-09 16:24:42,116][25689] Fps is (10 sec: 5548.4, 60 sec: 5616.6, 300 sec: 5671.3). Total num frames: 334949376. Throughput: 0: 5031.4. Samples: 334946572. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:24:42,116][25689] Avg episode reward: [(0, '-47.534')] [2022-07-09 16:24:42,141][26022] Updated weights on worker 0-0, policy_version 327100 (0.00085) [2022-07-09 16:24:43,927][26022] Updated weights on worker 0-0, policy_version 327110 (0.00087) [2022-07-09 16:24:45,817][26022] Updated weights on worker 0-0, policy_version 327120 (0.00088) [2022-07-09 16:24:47,128][25689] Fps is (10 sec: 5694.4, 60 sec: 5651.7, 300 sec: 5678.2). Total num frames: 334978048. Throughput: 0: 5888.8. Samples: 334980604. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:24:47,129][25689] Avg episode reward: [(0, '-47.356')] [2022-07-09 16:24:47,410][26022] Updated weights on worker 0-0, policy_version 327130 (0.00095) [2022-07-09 16:24:49,427][26022] Updated weights on worker 0-0, policy_version 327140 (0.00085) [2022-07-09 16:24:50,966][26022] Updated weights on worker 0-0, policy_version 327150 (0.00091) [2022-07-09 16:24:52,156][25689] Fps is (10 sec: 5609.3, 60 sec: 5633.8, 300 sec: 5662.4). Total num frames: 335005696. Throughput: 0: 5894.6. Samples: 335014768. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:24:52,157][25689] Avg episode reward: [(0, '-49.076')] [2022-07-09 16:24:53,195][26022] Updated weights on worker 0-0, policy_version 327160 (0.00086) [2022-07-09 16:24:54,595][26022] Updated weights on worker 0-0, policy_version 327170 (0.00088) [2022-07-09 16:24:56,627][26022] Updated weights on worker 0-0, policy_version 327180 (0.00052) [2022-07-09 16:24:57,203][25689] Fps is (10 sec: 5895.0, 60 sec: 5672.8, 300 sec: 5682.9). Total num frames: 335037440. Throughput: 0: 5076.4. Samples: 335031872. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:24:57,203][25689] Avg episode reward: [(0, '-48.923')] [2022-07-09 16:24:58,413][26022] Updated weights on worker 0-0, policy_version 327190 (0.00091) [2022-07-09 16:25:00,025][26022] Updated weights on worker 0-0, policy_version 327200 (0.00087) [2022-07-09 16:25:02,267][25689] Fps is (10 sec: 5570.1, 60 sec: 5599.5, 300 sec: 5668.8). Total num frames: 335062016. Throughput: 0: 5938.6. Samples: 335066244. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:25:02,268][25689] Avg episode reward: [(0, '-47.599')] [2022-07-09 16:25:02,299][26022] Updated weights on worker 0-0, policy_version 327210 (0.00083) [2022-07-09 16:25:04,119][26022] Updated weights on worker 0-0, policy_version 327220 (0.00102) [2022-07-09 16:25:05,971][26022] Updated weights on worker 0-0, policy_version 327230 (0.00094) [2022-07-09 16:25:07,287][25689] Fps is (10 sec: 5280.4, 60 sec: 5632.3, 300 sec: 5668.5). Total num frames: 335090688. Throughput: 0: 5828.4. Samples: 335098098. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-09 16:25:07,288][25689] Avg episode reward: [(0, '-48.854')] [2022-07-09 16:25:07,775][26022] Updated weights on worker 0-0, policy_version 327240 (0.00087) [2022-07-09 16:25:09,476][26022] Updated weights on worker 0-0, policy_version 327250 (0.00096) [2022-07-09 16:25:11,180][26022] Updated weights on worker 0-0, policy_version 327260 (0.00094) [2022-07-09 16:25:12,315][25689] Fps is (10 sec: 5707.1, 60 sec: 5649.9, 300 sec: 5672.7). Total num frames: 335119360. Throughput: 0: 4990.1. Samples: 335115364. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:25:12,317][25689] Avg episode reward: [(0, '-48.611')] [2022-07-09 16:25:12,967][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:25:12,980][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000327269_335123456.pth [2022-07-09 16:25:12,981][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000325271_333077504.pth [2022-07-09 16:25:13,236][26022] Updated weights on worker 0-0, policy_version 327270 (0.00087) [2022-07-09 16:25:14,845][26022] Updated weights on worker 0-0, policy_version 327280 (0.00092) [2022-07-09 16:25:16,712][26022] Updated weights on worker 0-0, policy_version 327290 (0.00089) [2022-07-09 16:25:17,436][25689] Fps is (10 sec: 5649.8, 60 sec: 5626.7, 300 sec: 5668.1). Total num frames: 335148032. Throughput: 0: 5815.0. Samples: 335149532. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:25:17,437][25689] Avg episode reward: [(0, '-48.518')] [2022-07-09 16:25:18,593][26022] Updated weights on worker 0-0, policy_version 327300 (0.00090) [2022-07-09 16:25:20,256][26022] Updated weights on worker 0-0, policy_version 327310 (0.00092) [2022-07-09 16:25:22,084][26022] Updated weights on worker 0-0, policy_version 327320 (0.00085) [2022-07-09 16:25:22,471][25689] Fps is (10 sec: 5747.3, 60 sec: 5660.5, 300 sec: 5672.5). Total num frames: 335177728. Throughput: 0: 5817.0. Samples: 335183768. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:25:22,471][25689] Avg episode reward: [(0, '-47.639')] [2022-07-09 16:25:23,972][26022] Updated weights on worker 0-0, policy_version 327330 (0.00087) [2022-07-09 16:25:25,619][26022] Updated weights on worker 0-0, policy_version 327340 (0.00089) [2022-07-09 16:25:27,511][25689] Fps is (10 sec: 5692.2, 60 sec: 5647.1, 300 sec: 5665.5). Total num frames: 335205376. Throughput: 0: 5092.5. Samples: 335201088. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:25:27,511][25689] Avg episode reward: [(0, '-47.610')] [2022-07-09 16:25:27,518][26022] Updated weights on worker 0-0, policy_version 327350 (0.00089) [2022-07-09 16:25:29,147][26022] Updated weights on worker 0-0, policy_version 327360 (0.00085) [2022-07-09 16:25:31,135][26022] Updated weights on worker 0-0, policy_version 327370 (0.00088) [2022-07-09 16:25:32,523][25689] Fps is (10 sec: 5602.9, 60 sec: 5630.9, 300 sec: 5663.4). Total num frames: 335234048. Throughput: 0: 5937.2. Samples: 335235342. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:25:32,523][25689] Avg episode reward: [(0, '-47.933')] [2022-07-09 16:25:32,764][26022] Updated weights on worker 0-0, policy_version 327380 (0.00088) [2022-07-09 16:25:34,492][26022] Updated weights on worker 0-0, policy_version 327390 (0.00088) [2022-07-09 16:25:36,539][26022] Updated weights on worker 0-0, policy_version 327400 (0.00085) [2022-07-09 16:25:37,609][25689] Fps is (10 sec: 5982.5, 60 sec: 5699.0, 300 sec: 5672.4). Total num frames: 335265792. Throughput: 0: 5952.0. Samples: 335269602. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:25:37,610][25689] Avg episode reward: [(0, '-48.049')] [2022-07-09 16:25:38,024][26022] Updated weights on worker 0-0, policy_version 327410 (0.00089) [2022-07-09 16:25:40,010][26022] Updated weights on worker 0-0, policy_version 327420 (0.00088) [2022-07-09 16:25:41,737][26022] Updated weights on worker 0-0, policy_version 327430 (0.00088) [2022-07-09 16:25:42,657][25689] Fps is (10 sec: 5759.7, 60 sec: 5666.3, 300 sec: 5661.7). Total num frames: 335292416. Throughput: 0: 5107.5. Samples: 335286866. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:25:42,659][25689] Avg episode reward: [(0, '-47.575')] [2022-07-09 16:25:43,604][26022] Updated weights on worker 0-0, policy_version 327440 (0.00090) [2022-07-09 16:25:45,384][26022] Updated weights on worker 0-0, policy_version 327450 (0.00265) [2022-07-09 16:25:47,016][26022] Updated weights on worker 0-0, policy_version 327460 (0.00101) [2022-07-09 16:25:47,676][25689] Fps is (10 sec: 5594.6, 60 sec: 5682.4, 300 sec: 5665.4). Total num frames: 335322112. Throughput: 0: 5968.3. Samples: 335321442. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:25:47,677][25689] Avg episode reward: [(0, '-47.235')] [2022-07-09 16:25:48,888][26022] Updated weights on worker 0-0, policy_version 327470 (0.00092) [2022-07-09 16:25:50,670][26022] Updated weights on worker 0-0, policy_version 327480 (0.00089) [2022-07-09 16:25:52,491][26022] Updated weights on worker 0-0, policy_version 327490 (0.00087) [2022-07-09 16:25:52,716][25689] Fps is (10 sec: 5802.2, 60 sec: 5698.3, 300 sec: 5667.2). Total num frames: 335350784. Throughput: 0: 5956.9. Samples: 335355632. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:25:52,717][25689] Avg episode reward: [(0, '-46.901')] [2022-07-09 16:25:54,344][26022] Updated weights on worker 0-0, policy_version 327500 (0.00092) [2022-07-09 16:25:55,988][26022] Updated weights on worker 0-0, policy_version 327510 (0.00092) [2022-07-09 16:25:57,853][25689] Fps is (10 sec: 5634.6, 60 sec: 5639.1, 300 sec: 5665.6). Total num frames: 335379456. Throughput: 0: 5100.7. Samples: 335372860. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:25:57,855][25689] Avg episode reward: [(0, '-47.062')] [2022-07-09 16:25:57,907][26022] Updated weights on worker 0-0, policy_version 327520 (0.00088) [2022-07-09 16:25:59,676][26022] Updated weights on worker 0-0, policy_version 327530 (0.00092) [2022-07-09 16:26:01,643][26022] Updated weights on worker 0-0, policy_version 327540 (0.00100) [2022-07-09 16:26:02,876][25689] Fps is (10 sec: 5442.6, 60 sec: 5676.8, 300 sec: 5663.4). Total num frames: 335406080. Throughput: 0: 5955.1. Samples: 335407276. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:26:02,877][25689] Avg episode reward: [(0, '-47.132')] [2022-07-09 16:26:03,637][26022] Updated weights on worker 0-0, policy_version 327550 (0.00099) [2022-07-09 16:26:05,463][26022] Updated weights on worker 0-0, policy_version 327560 (0.00087) [2022-07-09 16:26:07,275][26022] Updated weights on worker 0-0, policy_version 327570 (0.00085) [2022-07-09 16:26:07,896][25689] Fps is (10 sec: 5404.3, 60 sec: 5659.9, 300 sec: 5660.7). Total num frames: 335433728. Throughput: 0: 5815.6. Samples: 335439030. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:26:07,897][25689] Avg episode reward: [(0, '-47.371')] [2022-07-09 16:26:09,052][26022] Updated weights on worker 0-0, policy_version 327580 (0.00085) [2022-07-09 16:26:11,025][26022] Updated weights on worker 0-0, policy_version 327590 (0.00091) [2022-07-09 16:26:12,605][26022] Updated weights on worker 0-0, policy_version 327600 (0.00093) [2022-07-09 16:26:12,900][25689] Fps is (10 sec: 5720.6, 60 sec: 5679.0, 300 sec: 5665.2). Total num frames: 335463424. Throughput: 0: 4978.4. Samples: 335456116. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:26:12,902][25689] Avg episode reward: [(0, '-47.656')] [2022-07-09 16:26:14,586][26022] Updated weights on worker 0-0, policy_version 327610 (0.00086) [2022-07-09 16:26:16,346][26022] Updated weights on worker 0-0, policy_version 327620 (0.00091) [2022-07-09 16:26:17,966][25689] Fps is (10 sec: 5796.0, 60 sec: 5684.2, 300 sec: 5667.5). Total num frames: 335492096. Throughput: 0: 5844.8. Samples: 335490414. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:26:17,968][25689] Avg episode reward: [(0, '-48.036')] [2022-07-09 16:26:18,156][26022] Updated weights on worker 0-0, policy_version 327630 (0.00093) [2022-07-09 16:26:19,891][26022] Updated weights on worker 0-0, policy_version 327640 (0.00085) [2022-07-09 16:26:21,644][26022] Updated weights on worker 0-0, policy_version 327650 (0.00095) [2022-07-09 16:26:22,984][25689] Fps is (10 sec: 5585.2, 60 sec: 5651.9, 300 sec: 5660.4). Total num frames: 335519744. Throughput: 0: 5828.6. Samples: 335524476. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:26:22,985][25689] Avg episode reward: [(0, '-48.376')] [2022-07-09 16:26:23,588][26022] Updated weights on worker 0-0, policy_version 327660 (0.00096) [2022-07-09 16:26:25,319][26022] Updated weights on worker 0-0, policy_version 327670 (0.00091) [2022-07-09 16:26:26,984][26022] Updated weights on worker 0-0, policy_version 327680 (0.00106) [2022-07-09 16:26:28,007][25689] Fps is (10 sec: 5711.2, 60 sec: 5687.4, 300 sec: 5666.9). Total num frames: 335549440. Throughput: 0: 5102.3. Samples: 335541642. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:26:28,008][25689] Avg episode reward: [(0, '-48.181')] [2022-07-09 16:26:28,992][26022] Updated weights on worker 0-0, policy_version 327690 (0.00088) [2022-07-09 16:26:30,639][26022] Updated weights on worker 0-0, policy_version 327700 (0.00079) [2022-07-09 16:26:32,505][26022] Updated weights on worker 0-0, policy_version 327710 (0.00089) [2022-07-09 16:26:33,041][25689] Fps is (10 sec: 5702.1, 60 sec: 5668.4, 300 sec: 5660.4). Total num frames: 335577088. Throughput: 0: 5940.6. Samples: 335575762. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:26:33,042][25689] Avg episode reward: [(0, '-47.175')] [2022-07-09 16:26:34,203][26022] Updated weights on worker 0-0, policy_version 327720 (0.00082) [2022-07-09 16:26:36,147][26022] Updated weights on worker 0-0, policy_version 327730 (0.00089) [2022-07-09 16:26:37,836][26022] Updated weights on worker 0-0, policy_version 327740 (0.00093) [2022-07-09 16:26:38,098][25689] Fps is (10 sec: 5682.6, 60 sec: 5637.3, 300 sec: 5662.8). Total num frames: 335606784. Throughput: 0: 5929.3. Samples: 335609780. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:26:38,098][25689] Avg episode reward: [(0, '-47.783')] [2022-07-09 16:26:39,809][26022] Updated weights on worker 0-0, policy_version 327750 (0.00082) [2022-07-09 16:26:41,595][26022] Updated weights on worker 0-0, policy_version 327760 (0.00093) [2022-07-09 16:26:43,199][25689] Fps is (10 sec: 5645.1, 60 sec: 5649.2, 300 sec: 5650.8). Total num frames: 335634432. Throughput: 0: 5914.2. Samples: 335644030. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:26:43,199][25689] Avg episode reward: [(0, '-47.648')] [2022-07-09 16:26:43,318][26022] Updated weights on worker 0-0, policy_version 327770 (0.00091) [2022-07-09 16:26:45,138][26022] Updated weights on worker 0-0, policy_version 327780 (0.00087) [2022-07-09 16:26:46,943][26022] Updated weights on worker 0-0, policy_version 327790 (0.00080) [2022-07-09 16:26:48,212][25689] Fps is (10 sec: 5568.5, 60 sec: 5632.9, 300 sec: 5657.6). Total num frames: 335663104. Throughput: 0: 5917.6. Samples: 335661208. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:26:48,212][25689] Avg episode reward: [(0, '-48.370')] [2022-07-09 16:26:48,762][26022] Updated weights on worker 0-0, policy_version 327800 (0.00094) [2022-07-09 16:26:50,483][26022] Updated weights on worker 0-0, policy_version 327810 (0.00088) [2022-07-09 16:26:52,397][26022] Updated weights on worker 0-0, policy_version 327820 (0.00093) [2022-07-09 16:26:53,225][25689] Fps is (10 sec: 5719.3, 60 sec: 5635.4, 300 sec: 5651.2). Total num frames: 335691776. Throughput: 0: 5938.0. Samples: 335695618. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:26:53,226][25689] Avg episode reward: [(0, '-47.894')] [2022-07-09 16:26:54,131][26022] Updated weights on worker 0-0, policy_version 327830 (0.00092) [2022-07-09 16:26:55,928][26022] Updated weights on worker 0-0, policy_version 327840 (0.00094) [2022-07-09 16:26:57,706][26022] Updated weights on worker 0-0, policy_version 327850 (0.00088) [2022-07-09 16:26:58,285][25689] Fps is (10 sec: 5692.9, 60 sec: 5642.6, 300 sec: 5657.4). Total num frames: 335720448. Throughput: 0: 5934.0. Samples: 335729568. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:26:58,285][25689] Avg episode reward: [(0, '-48.462')] [2022-07-09 16:26:59,435][26022] Updated weights on worker 0-0, policy_version 327860 (0.00093) [2022-07-09 16:27:01,564][26022] Updated weights on worker 0-0, policy_version 327870 (0.00084) [2022-07-09 16:27:03,297][25689] Fps is (10 sec: 5490.1, 60 sec: 5643.6, 300 sec: 5657.4). Total num frames: 335747072. Throughput: 0: 5101.6. Samples: 335746564. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:27:03,300][25689] Avg episode reward: [(0, '-48.638')] [2022-07-09 16:27:03,516][26022] Updated weights on worker 0-0, policy_version 327880 (0.00090) [2022-07-09 16:27:05,402][26022] Updated weights on worker 0-0, policy_version 327890 (0.00087) [2022-07-09 16:27:06,991][26022] Updated weights on worker 0-0, policy_version 327900 (0.00087) [2022-07-09 16:27:08,319][25689] Fps is (10 sec: 5408.8, 60 sec: 5643.4, 300 sec: 5651.0). Total num frames: 335774720. Throughput: 0: 5838.1. Samples: 335778594. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:27:08,320][25689] Avg episode reward: [(0, '-48.450')] [2022-07-09 16:27:09,042][26022] Updated weights on worker 0-0, policy_version 327910 (0.00088) [2022-07-09 16:27:10,683][26022] Updated weights on worker 0-0, policy_version 327920 (0.00085) [2022-07-09 16:27:12,606][26022] Updated weights on worker 0-0, policy_version 327930 (0.00091) [2022-07-09 16:27:13,115][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:27:13,128][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000327934_335804416.pth [2022-07-09 16:27:13,129][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000325941_333763584.pth [2022-07-09 16:27:13,322][25689] Fps is (10 sec: 5720.1, 60 sec: 5643.5, 300 sec: 5659.0). Total num frames: 335804416. Throughput: 0: 5826.8. Samples: 335812720. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:27:13,324][25689] Avg episode reward: [(0, '-47.841')] [2022-07-09 16:27:14,356][26022] Updated weights on worker 0-0, policy_version 327940 (0.00092) [2022-07-09 16:27:16,159][26022] Updated weights on worker 0-0, policy_version 327950 (0.00088) [2022-07-09 16:27:18,039][26022] Updated weights on worker 0-0, policy_version 327960 (0.00084) [2022-07-09 16:27:18,405][25689] Fps is (10 sec: 5685.5, 60 sec: 5625.0, 300 sec: 5657.7). Total num frames: 335832064. Throughput: 0: 4975.7. Samples: 335829680. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:27:18,407][25689] Avg episode reward: [(0, '-48.362')] [2022-07-09 16:27:19,623][26022] Updated weights on worker 0-0, policy_version 327970 (0.00085) [2022-07-09 16:27:21,689][26022] Updated weights on worker 0-0, policy_version 327980 (0.00097) [2022-07-09 16:27:23,339][26022] Updated weights on worker 0-0, policy_version 327990 (0.00091) [2022-07-09 16:27:23,422][25689] Fps is (10 sec: 5677.9, 60 sec: 5659.0, 300 sec: 5654.1). Total num frames: 335861760. Throughput: 0: 5838.7. Samples: 335864064. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:27:23,422][25689] Avg episode reward: [(0, '-48.006')] [2022-07-09 16:27:25,145][26022] Updated weights on worker 0-0, policy_version 328000 (0.00085) [2022-07-09 16:27:26,967][26022] Updated weights on worker 0-0, policy_version 328010 (0.00088) [2022-07-09 16:27:28,442][25689] Fps is (10 sec: 5917.0, 60 sec: 5659.2, 300 sec: 5657.2). Total num frames: 335891456. Throughput: 0: 5958.5. Samples: 335898500. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:27:28,443][25689] Avg episode reward: [(0, '-48.168')] [2022-07-09 16:27:28,689][26022] Updated weights on worker 0-0, policy_version 328020 (0.00095) [2022-07-09 16:27:30,537][26022] Updated weights on worker 0-0, policy_version 328030 (0.00080) [2022-07-09 16:27:32,115][26022] Updated weights on worker 0-0, policy_version 328040 (0.00085) [2022-07-09 16:27:33,466][25689] Fps is (10 sec: 5709.3, 60 sec: 5660.2, 300 sec: 5654.9). Total num frames: 335919104. Throughput: 0: 5118.7. Samples: 335915828. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:27:33,467][25689] Avg episode reward: [(0, '-48.128')] [2022-07-09 16:27:33,995][26022] Updated weights on worker 0-0, policy_version 328050 (0.00088) [2022-07-09 16:27:35,904][26022] Updated weights on worker 0-0, policy_version 328060 (0.00090) [2022-07-09 16:27:37,697][26022] Updated weights on worker 0-0, policy_version 328070 (0.00107) [2022-07-09 16:27:38,524][25689] Fps is (10 sec: 5586.6, 60 sec: 5643.2, 300 sec: 5655.9). Total num frames: 335947776. Throughput: 0: 5963.9. Samples: 335949666. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-09 16:27:38,524][25689] Avg episode reward: [(0, '-47.595')] [2022-07-09 16:27:39,664][26022] Updated weights on worker 0-0, policy_version 328080 (0.00080) [2022-07-09 16:27:41,209][26022] Updated weights on worker 0-0, policy_version 328090 (0.00086) [2022-07-09 16:27:43,112][26022] Updated weights on worker 0-0, policy_version 328100 (0.00084) [2022-07-09 16:27:43,555][25689] Fps is (10 sec: 5785.1, 60 sec: 5683.6, 300 sec: 5657.0). Total num frames: 335977472. Throughput: 0: 5940.4. Samples: 335983664. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:27:43,556][25689] Avg episode reward: [(0, '-47.114')] [2022-07-09 16:27:45,220][26022] Updated weights on worker 0-0, policy_version 328110 (0.00088) [2022-07-09 16:27:46,719][26022] Updated weights on worker 0-0, policy_version 328120 (0.00082) [2022-07-09 16:27:48,629][25689] Fps is (10 sec: 5573.1, 60 sec: 5643.9, 300 sec: 5647.0). Total num frames: 336004096. Throughput: 0: 5069.5. Samples: 336000838. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:27:48,630][25689] Avg episode reward: [(0, '-46.426')] [2022-07-09 16:27:48,656][26022] Updated weights on worker 0-0, policy_version 328130 (0.00091) [2022-07-09 16:27:50,264][26022] Updated weights on worker 0-0, policy_version 328140 (0.00108) [2022-07-09 16:27:52,041][26022] Updated weights on worker 0-0, policy_version 328150 (0.00085) [2022-07-09 16:27:53,633][25689] Fps is (10 sec: 5588.4, 60 sec: 5661.8, 300 sec: 5651.5). Total num frames: 336033792. Throughput: 0: 5910.3. Samples: 336035024. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:27:53,634][25689] Avg episode reward: [(0, '-45.759')] [2022-07-09 16:27:53,878][26022] Updated weights on worker 0-0, policy_version 328160 (0.00089) [2022-07-09 16:27:55,693][26022] Updated weights on worker 0-0, policy_version 328170 (0.00092) [2022-07-09 16:27:57,458][26022] Updated weights on worker 0-0, policy_version 328180 (0.00089) [2022-07-09 16:27:58,684][25689] Fps is (10 sec: 5703.1, 60 sec: 5645.6, 300 sec: 5647.5). Total num frames: 336061440. Throughput: 0: 5921.4. Samples: 336069044. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:27:58,684][25689] Avg episode reward: [(0, '-45.738')] [2022-07-09 16:27:59,495][26022] Updated weights on worker 0-0, policy_version 328190 (0.00090) [2022-07-09 16:28:01,147][26022] Updated weights on worker 0-0, policy_version 328200 (0.00092) [2022-07-09 16:28:03,338][26022] Updated weights on worker 0-0, policy_version 328210 (0.00087) [2022-07-09 16:28:03,710][25689] Fps is (10 sec: 5487.5, 60 sec: 5661.4, 300 sec: 5654.2). Total num frames: 336089088. Throughput: 0: 5085.6. Samples: 336086162. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:28:03,710][25689] Avg episode reward: [(0, '-46.566')] [2022-07-09 16:28:05,211][26022] Updated weights on worker 0-0, policy_version 328220 (0.00087) [2022-07-09 16:28:06,950][26022] Updated weights on worker 0-0, policy_version 328230 (0.00091) [2022-07-09 16:28:08,715][25689] Fps is (10 sec: 5512.7, 60 sec: 5662.9, 300 sec: 5647.4). Total num frames: 336116736. Throughput: 0: 5844.1. Samples: 336118220. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:28:08,715][25689] Avg episode reward: [(0, '-46.587')] [2022-07-09 16:28:08,963][26022] Updated weights on worker 0-0, policy_version 328240 (0.00083) [2022-07-09 16:28:10,508][26022] Updated weights on worker 0-0, policy_version 328250 (0.00092) [2022-07-09 16:28:12,346][26022] Updated weights on worker 0-0, policy_version 328260 (0.00087) [2022-07-09 16:28:13,724][25689] Fps is (10 sec: 5623.8, 60 sec: 5645.4, 300 sec: 5648.0). Total num frames: 336145408. Throughput: 0: 5839.2. Samples: 336152340. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:28:13,725][25689] Avg episode reward: [(0, '-46.996')] [2022-07-09 16:28:14,066][26022] Updated weights on worker 0-0, policy_version 328270 (0.00086) [2022-07-09 16:28:15,927][26022] Updated weights on worker 0-0, policy_version 328280 (0.00091) [2022-07-09 16:28:17,763][26022] Updated weights on worker 0-0, policy_version 328290 (0.00096) [2022-07-09 16:28:18,784][25689] Fps is (10 sec: 5694.7, 60 sec: 5664.4, 300 sec: 5653.9). Total num frames: 336174080. Throughput: 0: 4989.9. Samples: 336169342. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:28:18,785][25689] Avg episode reward: [(0, '-48.135')] [2022-07-09 16:28:19,507][26022] Updated weights on worker 0-0, policy_version 328300 (0.00086) [2022-07-09 16:28:21,338][26022] Updated weights on worker 0-0, policy_version 328310 (0.00089) [2022-07-09 16:28:22,982][26022] Updated weights on worker 0-0, policy_version 328320 (0.00069) [2022-07-09 16:28:23,802][25689] Fps is (10 sec: 5588.8, 60 sec: 5630.5, 300 sec: 5643.3). Total num frames: 336201728. Throughput: 0: 5842.9. Samples: 336203558. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:28:23,803][25689] Avg episode reward: [(0, '-48.615')] [2022-07-09 16:28:24,944][26022] Updated weights on worker 0-0, policy_version 328330 (0.00091) [2022-07-09 16:28:26,767][26022] Updated weights on worker 0-0, policy_version 328340 (0.00093) [2022-07-09 16:28:28,521][26022] Updated weights on worker 0-0, policy_version 328350 (0.00086) [2022-07-09 16:28:28,828][25689] Fps is (10 sec: 5709.5, 60 sec: 5630.0, 300 sec: 5650.1). Total num frames: 336231424. Throughput: 0: 5943.5. Samples: 336237764. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:28:28,829][25689] Avg episode reward: [(0, '-47.971')] [2022-07-09 16:28:30,304][26022] Updated weights on worker 0-0, policy_version 328360 (0.00090) [2022-07-09 16:28:32,308][26022] Updated weights on worker 0-0, policy_version 328370 (0.00094) [2022-07-09 16:28:33,930][25689] Fps is (10 sec: 5762.8, 60 sec: 5639.6, 300 sec: 5649.9). Total num frames: 336260096. Throughput: 0: 5060.9. Samples: 336254600. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:28:33,930][25689] Avg episode reward: [(0, '-48.481')] [2022-07-09 16:28:34,098][26022] Updated weights on worker 0-0, policy_version 328380 (0.00091) [2022-07-09 16:28:35,915][26022] Updated weights on worker 0-0, policy_version 328390 (0.00094) [2022-07-09 16:28:37,579][26022] Updated weights on worker 0-0, policy_version 328400 (0.00092) [2022-07-09 16:28:39,034][25689] Fps is (10 sec: 5618.7, 60 sec: 5635.3, 300 sec: 5645.9). Total num frames: 336288768. Throughput: 0: 5896.8. Samples: 336288750. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:28:39,036][25689] Avg episode reward: [(0, '-48.951')] [2022-07-09 16:28:39,410][26022] Updated weights on worker 0-0, policy_version 328410 (0.00085) [2022-07-09 16:28:41,183][26022] Updated weights on worker 0-0, policy_version 328420 (0.00086) [2022-07-09 16:28:42,910][26022] Updated weights on worker 0-0, policy_version 328430 (0.00092) [2022-07-09 16:28:44,045][25689] Fps is (10 sec: 5770.3, 60 sec: 5637.1, 300 sec: 5656.5). Total num frames: 336318464. Throughput: 0: 5909.4. Samples: 336323186. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:28:44,047][25689] Avg episode reward: [(0, '-49.383')] [2022-07-09 16:28:44,852][26022] Updated weights on worker 0-0, policy_version 328440 (0.00086) [2022-07-09 16:28:46,541][26022] Updated weights on worker 0-0, policy_version 328450 (0.00082) [2022-07-09 16:28:48,450][26022] Updated weights on worker 0-0, policy_version 328460 (0.00118) [2022-07-09 16:28:49,100][25689] Fps is (10 sec: 5798.2, 60 sec: 5672.8, 300 sec: 5655.8). Total num frames: 336347136. Throughput: 0: 5059.6. Samples: 336340334. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:28:49,101][25689] Avg episode reward: [(0, '-49.707')] [2022-07-09 16:28:50,292][26022] Updated weights on worker 0-0, policy_version 328470 (0.00092) [2022-07-09 16:28:51,901][26022] Updated weights on worker 0-0, policy_version 328480 (0.00083) [2022-07-09 16:28:53,891][26022] Updated weights on worker 0-0, policy_version 328490 (0.00091) [2022-07-09 16:28:54,188][25689] Fps is (10 sec: 5552.5, 60 sec: 5631.1, 300 sec: 5649.2). Total num frames: 336374784. Throughput: 0: 5892.9. Samples: 336373980. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:28:54,190][25689] Avg episode reward: [(0, '-49.766')] [2022-07-09 16:28:55,607][26022] Updated weights on worker 0-0, policy_version 328500 (0.00084) [2022-07-09 16:28:57,602][26022] Updated weights on worker 0-0, policy_version 328510 (0.00084) [2022-07-09 16:28:59,264][25689] Fps is (10 sec: 5641.9, 60 sec: 5662.6, 300 sec: 5651.3). Total num frames: 336404480. Throughput: 0: 5894.0. Samples: 336407988. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:28:59,265][25689] Avg episode reward: [(0, '-49.696')] [2022-07-09 16:28:59,271][26022] Updated weights on worker 0-0, policy_version 328520 (0.00091) [2022-07-09 16:29:01,239][26022] Updated weights on worker 0-0, policy_version 328530 (0.00086) [2022-07-09 16:29:03,265][26022] Updated weights on worker 0-0, policy_version 328540 (0.00085) [2022-07-09 16:29:04,279][25689] Fps is (10 sec: 5479.7, 60 sec: 5629.7, 300 sec: 5647.7). Total num frames: 336430080. Throughput: 0: 5036.1. Samples: 336425088. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:29:04,280][25689] Avg episode reward: [(0, '-49.664')] [2022-07-09 16:29:05,294][26022] Updated weights on worker 0-0, policy_version 328550 (0.00085) [2022-07-09 16:29:06,744][26022] Updated weights on worker 0-0, policy_version 328560 (0.00089) [2022-07-09 16:29:09,022][26022] Updated weights on worker 0-0, policy_version 328570 (0.00082) [2022-07-09 16:29:09,314][25689] Fps is (10 sec: 5400.4, 60 sec: 5643.9, 300 sec: 5651.1). Total num frames: 336458752. Throughput: 0: 5761.2. Samples: 336456788. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:29:09,314][25689] Avg episode reward: [(0, '-48.209')] [2022-07-09 16:29:10,464][26022] Updated weights on worker 0-0, policy_version 328580 (0.00087) [2022-07-09 16:29:12,385][26022] Updated weights on worker 0-0, policy_version 328590 (0.00086) [2022-07-09 16:29:13,298][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:29:13,312][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000328595_336481280.pth [2022-07-09 16:29:13,312][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000326608_334446592.pth [2022-07-09 16:29:14,172][26022] Updated weights on worker 0-0, policy_version 328600 (0.00086) [2022-07-09 16:29:14,321][25689] Fps is (10 sec: 5608.5, 60 sec: 5627.2, 300 sec: 5645.2). Total num frames: 336486400. Throughput: 0: 5797.0. Samples: 336490692. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:29:14,322][25689] Avg episode reward: [(0, '-48.213')] [2022-07-09 16:29:16,036][26022] Updated weights on worker 0-0, policy_version 328610 (0.00088) [2022-07-09 16:29:17,934][26022] Updated weights on worker 0-0, policy_version 328620 (0.00093) [2022-07-09 16:29:19,377][25689] Fps is (10 sec: 5596.4, 60 sec: 5627.6, 300 sec: 5648.2). Total num frames: 336515072. Throughput: 0: 5795.6. Samples: 336524556. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:29:19,378][25689] Avg episode reward: [(0, '-47.247')] [2022-07-09 16:29:19,768][26022] Updated weights on worker 0-0, policy_version 328630 (0.00090) [2022-07-09 16:29:21,323][26022] Updated weights on worker 0-0, policy_version 328640 (0.00094) [2022-07-09 16:29:23,559][26022] Updated weights on worker 0-0, policy_version 328650 (0.00085) [2022-07-09 16:29:24,407][25689] Fps is (10 sec: 5787.4, 60 sec: 5660.2, 300 sec: 5652.5). Total num frames: 336544768. Throughput: 0: 5786.7. Samples: 336541558. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:29:24,407][25689] Avg episode reward: [(0, '-47.248')] [2022-07-09 16:29:24,867][26022] Updated weights on worker 0-0, policy_version 328660 (0.00084) [2022-07-09 16:29:27,207][26022] Updated weights on worker 0-0, policy_version 328670 (0.00087) [2022-07-09 16:29:28,460][26022] Updated weights on worker 0-0, policy_version 328680 (0.00086) [2022-07-09 16:29:29,420][25689] Fps is (10 sec: 5506.0, 60 sec: 5593.8, 300 sec: 5638.9). Total num frames: 336570368. Throughput: 0: 5898.7. Samples: 336575390. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:29:29,422][25689] Avg episode reward: [(0, '-46.265')] [2022-07-09 16:29:30,647][26022] Updated weights on worker 0-0, policy_version 328690 (0.00087) [2022-07-09 16:29:32,463][26022] Updated weights on worker 0-0, policy_version 328700 (0.00084) [2022-07-09 16:29:34,247][26022] Updated weights on worker 0-0, policy_version 328710 (0.00085) [2022-07-09 16:29:34,423][25689] Fps is (10 sec: 5520.9, 60 sec: 5620.0, 300 sec: 5647.5). Total num frames: 336600064. Throughput: 0: 5923.0. Samples: 336609752. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:29:34,423][25689] Avg episode reward: [(0, '-47.011')] [2022-07-09 16:29:35,790][26022] Updated weights on worker 0-0, policy_version 328720 (0.00085) [2022-07-09 16:29:37,934][26022] Updated weights on worker 0-0, policy_version 328730 (0.00092) [2022-07-09 16:29:39,418][26022] Updated weights on worker 0-0, policy_version 328740 (0.00082) [2022-07-09 16:29:39,475][25689] Fps is (10 sec: 5907.0, 60 sec: 5641.7, 300 sec: 5651.0). Total num frames: 336629760. Throughput: 0: 5088.7. Samples: 336626824. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:29:39,475][25689] Avg episode reward: [(0, '-47.048')] [2022-07-09 16:29:41,448][26022] Updated weights on worker 0-0, policy_version 328750 (0.00086) [2022-07-09 16:29:42,948][26022] Updated weights on worker 0-0, policy_version 328760 (0.00080) [2022-07-09 16:29:44,486][25689] Fps is (10 sec: 5596.2, 60 sec: 5590.9, 300 sec: 5644.1). Total num frames: 336656384. Throughput: 0: 5938.6. Samples: 336660804. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:29:44,487][25689] Avg episode reward: [(0, '-46.908')] [2022-07-09 16:29:45,014][26022] Updated weights on worker 0-0, policy_version 328770 (0.00093) [2022-07-09 16:29:46,943][26022] Updated weights on worker 0-0, policy_version 328780 (0.00086) [2022-07-09 16:29:48,651][26022] Updated weights on worker 0-0, policy_version 328790 (0.00102) [2022-07-09 16:29:49,498][25689] Fps is (10 sec: 5618.8, 60 sec: 5611.9, 300 sec: 5651.4). Total num frames: 336686080. Throughput: 0: 5932.3. Samples: 336694498. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:29:49,499][25689] Avg episode reward: [(0, '-47.155')] [2022-07-09 16:29:50,466][26022] Updated weights on worker 0-0, policy_version 328800 (0.00085) [2022-07-09 16:29:52,320][26022] Updated weights on worker 0-0, policy_version 328810 (0.00087) [2022-07-09 16:29:54,003][26022] Updated weights on worker 0-0, policy_version 328820 (0.00087) [2022-07-09 16:29:54,506][25689] Fps is (10 sec: 5722.9, 60 sec: 5619.3, 300 sec: 5638.3). Total num frames: 336713728. Throughput: 0: 5070.6. Samples: 336711590. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:29:54,507][25689] Avg episode reward: [(0, '-48.880')] [2022-07-09 16:29:56,055][26022] Updated weights on worker 0-0, policy_version 328830 (0.00078) [2022-07-09 16:29:57,644][26022] Updated weights on worker 0-0, policy_version 328840 (0.00094) [2022-07-09 16:29:59,603][25689] Fps is (10 sec: 5573.8, 60 sec: 5600.4, 300 sec: 5651.5). Total num frames: 336742400. Throughput: 0: 5897.5. Samples: 336745528. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:29:59,603][25689] Avg episode reward: [(0, '-48.203')] [2022-07-09 16:29:59,604][26022] Updated weights on worker 0-0, policy_version 328850 (0.00120) [2022-07-09 16:30:01,364][26022] Updated weights on worker 0-0, policy_version 328860 (0.00085) [2022-07-09 16:30:03,550][26022] Updated weights on worker 0-0, policy_version 328870 (0.00094) [2022-07-09 16:30:04,654][25689] Fps is (10 sec: 5449.1, 60 sec: 5614.0, 300 sec: 5644.0). Total num frames: 336769024. Throughput: 0: 5791.3. Samples: 336777602. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:30:04,655][25689] Avg episode reward: [(0, '-47.523')] [2022-07-09 16:30:05,141][26022] Updated weights on worker 0-0, policy_version 328880 (0.00096) [2022-07-09 16:30:07,147][26022] Updated weights on worker 0-0, policy_version 328890 (0.00080) [2022-07-09 16:30:08,819][26022] Updated weights on worker 0-0, policy_version 328900 (0.00088) [2022-07-09 16:30:09,684][25689] Fps is (10 sec: 5383.7, 60 sec: 5597.5, 300 sec: 5640.5). Total num frames: 336796672. Throughput: 0: 4955.3. Samples: 336794522. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:30:09,685][25689] Avg episode reward: [(0, '-48.643')] [2022-07-09 16:30:10,874][26022] Updated weights on worker 0-0, policy_version 328910 (0.00089) [2022-07-09 16:30:12,451][26022] Updated weights on worker 0-0, policy_version 328920 (0.00088) [2022-07-09 16:30:14,379][26022] Updated weights on worker 0-0, policy_version 328930 (0.00084) [2022-07-09 16:30:14,687][25689] Fps is (10 sec: 5613.7, 60 sec: 5614.9, 300 sec: 5642.8). Total num frames: 336825344. Throughput: 0: 5801.2. Samples: 336828660. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:30:14,689][25689] Avg episode reward: [(0, '-49.149')] [2022-07-09 16:30:15,968][26022] Updated weights on worker 0-0, policy_version 328940 (0.00078) [2022-07-09 16:30:18,052][26022] Updated weights on worker 0-0, policy_version 328950 (0.00095) [2022-07-09 16:30:19,707][26022] Updated weights on worker 0-0, policy_version 328960 (0.00081) [2022-07-09 16:30:19,769][25689] Fps is (10 sec: 5787.6, 60 sec: 5629.4, 300 sec: 5641.9). Total num frames: 336855040. Throughput: 0: 5805.9. Samples: 336862610. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:30:19,769][25689] Avg episode reward: [(0, '-48.127')] [2022-07-09 16:30:21,635][26022] Updated weights on worker 0-0, policy_version 328970 (0.00086) [2022-07-09 16:30:23,359][26022] Updated weights on worker 0-0, policy_version 328980 (0.00093) [2022-07-09 16:30:24,818][25689] Fps is (10 sec: 5761.5, 60 sec: 5610.6, 300 sec: 5645.1). Total num frames: 336883712. Throughput: 0: 5055.0. Samples: 336879528. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:30:24,818][25689] Avg episode reward: [(0, '-48.349')] [2022-07-09 16:30:25,041][26022] Updated weights on worker 0-0, policy_version 328990 (0.00082) [2022-07-09 16:30:27,037][26022] Updated weights on worker 0-0, policy_version 329000 (0.00089) [2022-07-09 16:30:28,769][26022] Updated weights on worker 0-0, policy_version 329010 (0.00095) [2022-07-09 16:30:29,847][25689] Fps is (10 sec: 5588.2, 60 sec: 5643.1, 300 sec: 5641.4). Total num frames: 336911360. Throughput: 0: 5907.8. Samples: 336913644. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:30:29,847][25689] Avg episode reward: [(0, '-48.458')] [2022-07-09 16:30:30,555][26022] Updated weights on worker 0-0, policy_version 329020 (0.00084) [2022-07-09 16:30:32,465][26022] Updated weights on worker 0-0, policy_version 329030 (0.00052) [2022-07-09 16:30:34,048][26022] Updated weights on worker 0-0, policy_version 329040 (0.00096) [2022-07-09 16:30:34,863][25689] Fps is (10 sec: 5606.6, 60 sec: 5624.9, 300 sec: 5632.4). Total num frames: 336940032. Throughput: 0: 5907.2. Samples: 336947844. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:30:34,863][25689] Avg episode reward: [(0, '-49.625')] [2022-07-09 16:30:36,033][26022] Updated weights on worker 0-0, policy_version 329050 (0.00079) [2022-07-09 16:30:37,935][26022] Updated weights on worker 0-0, policy_version 329060 (0.00092) [2022-07-09 16:30:39,526][26022] Updated weights on worker 0-0, policy_version 329070 (0.00087) [2022-07-09 16:30:39,966][25689] Fps is (10 sec: 5768.3, 60 sec: 5620.1, 300 sec: 5641.6). Total num frames: 336969728. Throughput: 0: 5072.3. Samples: 336965056. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:30:39,966][25689] Avg episode reward: [(0, '-48.370')] [2022-07-09 16:30:41,472][26022] Updated weights on worker 0-0, policy_version 329080 (0.00080) [2022-07-09 16:30:43,149][26022] Updated weights on worker 0-0, policy_version 329090 (0.00088) [2022-07-09 16:30:44,908][26022] Updated weights on worker 0-0, policy_version 329100 (0.00085) [2022-07-09 16:30:45,041][25689] Fps is (10 sec: 5734.4, 60 sec: 5648.0, 300 sec: 5637.1). Total num frames: 336998400. Throughput: 0: 5929.7. Samples: 336999448. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:30:45,042][25689] Avg episode reward: [(0, '-48.073')] [2022-07-09 16:30:46,712][26022] Updated weights on worker 0-0, policy_version 329110 (0.00093) [2022-07-09 16:30:48,359][26022] Updated weights on worker 0-0, policy_version 329120 (0.00092) [2022-07-09 16:30:50,126][25689] Fps is (10 sec: 5643.7, 60 sec: 5624.3, 300 sec: 5636.3). Total num frames: 337027072. Throughput: 0: 5928.7. Samples: 337033874. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:30:50,127][25689] Avg episode reward: [(0, '-49.313')] [2022-07-09 16:30:50,295][26022] Updated weights on worker 0-0, policy_version 329130 (0.00086) [2022-07-09 16:30:52,157][26022] Updated weights on worker 0-0, policy_version 329140 (0.00092) [2022-07-09 16:30:53,808][26022] Updated weights on worker 0-0, policy_version 329150 (0.00092) [2022-07-09 16:30:55,185][25689] Fps is (10 sec: 5753.8, 60 sec: 5653.4, 300 sec: 5641.2). Total num frames: 337056768. Throughput: 0: 5072.8. Samples: 337050938. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:30:55,186][25689] Avg episode reward: [(0, '-49.336')] [2022-07-09 16:30:55,730][26022] Updated weights on worker 0-0, policy_version 329160 (0.00091) [2022-07-09 16:30:57,532][26022] Updated weights on worker 0-0, policy_version 329170 (0.00096) [2022-07-09 16:30:59,335][26022] Updated weights on worker 0-0, policy_version 329180 (0.00094) [2022-07-09 16:31:00,287][25689] Fps is (10 sec: 5845.5, 60 sec: 5669.8, 300 sec: 5650.0). Total num frames: 337086464. Throughput: 0: 5921.4. Samples: 337085384. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:31:00,287][25689] Avg episode reward: [(0, '-48.203')] [2022-07-09 16:31:00,968][26022] Updated weights on worker 0-0, policy_version 329190 (0.00085) [2022-07-09 16:31:03,258][26022] Updated weights on worker 0-0, policy_version 329200 (0.00085) [2022-07-09 16:31:05,000][26022] Updated weights on worker 0-0, policy_version 329210 (0.00090) [2022-07-09 16:31:05,297][25689] Fps is (10 sec: 5569.7, 60 sec: 5673.6, 300 sec: 5646.8). Total num frames: 337113088. Throughput: 0: 5825.6. Samples: 337117452. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:31:05,298][25689] Avg episode reward: [(0, '-47.888')] [2022-07-09 16:31:06,886][26022] Updated weights on worker 0-0, policy_version 329220 (0.00086) [2022-07-09 16:31:08,656][26022] Updated weights on worker 0-0, policy_version 329230 (0.00087) [2022-07-09 16:31:10,323][25689] Fps is (10 sec: 5407.7, 60 sec: 5673.9, 300 sec: 5639.5). Total num frames: 337140736. Throughput: 0: 4982.1. Samples: 337134494. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:31:10,323][25689] Avg episode reward: [(0, '-48.107')] [2022-07-09 16:31:10,458][26022] Updated weights on worker 0-0, policy_version 329240 (0.00089) [2022-07-09 16:31:12,119][26022] Updated weights on worker 0-0, policy_version 329250 (0.00081) [2022-07-09 16:31:13,577][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:31:13,595][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000329257_337159168.pth [2022-07-09 16:31:13,595][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000327269_335123456.pth [2022-07-09 16:31:14,134][26022] Updated weights on worker 0-0, policy_version 329260 (0.00081) [2022-07-09 16:31:15,346][25689] Fps is (10 sec: 5706.4, 60 sec: 5688.9, 300 sec: 5643.7). Total num frames: 337170432. Throughput: 0: 5852.0. Samples: 337168920. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:31:15,352][25689] Avg episode reward: [(0, '-47.806')] [2022-07-09 16:31:15,755][26022] Updated weights on worker 0-0, policy_version 329270 (0.00085) [2022-07-09 16:31:17,739][26022] Updated weights on worker 0-0, policy_version 329280 (0.00098) [2022-07-09 16:31:19,380][26022] Updated weights on worker 0-0, policy_version 329290 (0.00090) [2022-07-09 16:31:20,387][25689] Fps is (10 sec: 5697.9, 60 sec: 5659.0, 300 sec: 5643.3). Total num frames: 337198080. Throughput: 0: 5851.1. Samples: 337202992. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:31:20,388][25689] Avg episode reward: [(0, '-47.707')] [2022-07-09 16:31:21,160][26022] Updated weights on worker 0-0, policy_version 329300 (0.00090) [2022-07-09 16:31:22,978][26022] Updated weights on worker 0-0, policy_version 329310 (0.00088) [2022-07-09 16:31:24,976][26022] Updated weights on worker 0-0, policy_version 329320 (0.00084) [2022-07-09 16:31:25,419][25689] Fps is (10 sec: 5591.6, 60 sec: 5660.6, 300 sec: 5639.7). Total num frames: 337226752. Throughput: 0: 5102.8. Samples: 337220126. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:31:25,419][25689] Avg episode reward: [(0, '-47.188')] [2022-07-09 16:31:26,544][26022] Updated weights on worker 0-0, policy_version 329330 (0.00097) [2022-07-09 16:31:28,507][26022] Updated weights on worker 0-0, policy_version 329340 (0.00086) [2022-07-09 16:31:30,259][26022] Updated weights on worker 0-0, policy_version 329350 (0.00090) [2022-07-09 16:31:30,439][25689] Fps is (10 sec: 5602.9, 60 sec: 5661.5, 300 sec: 5640.0). Total num frames: 337254400. Throughput: 0: 5940.5. Samples: 337253994. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:31:30,439][25689] Avg episode reward: [(0, '-49.185')] [2022-07-09 16:31:32,075][26022] Updated weights on worker 0-0, policy_version 329360 (0.00086) [2022-07-09 16:31:33,992][26022] Updated weights on worker 0-0, policy_version 329370 (0.00093) [2022-07-09 16:31:35,442][25689] Fps is (10 sec: 5516.7, 60 sec: 5645.8, 300 sec: 5634.1). Total num frames: 337282048. Throughput: 0: 5919.1. Samples: 337287868. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:31:35,443][25689] Avg episode reward: [(0, '-48.896')] [2022-07-09 16:31:35,756][26022] Updated weights on worker 0-0, policy_version 329380 (0.00078) [2022-07-09 16:31:37,551][26022] Updated weights on worker 0-0, policy_version 329390 (0.00088) [2022-07-09 16:31:39,306][26022] Updated weights on worker 0-0, policy_version 329400 (0.00088) [2022-07-09 16:31:40,488][25689] Fps is (10 sec: 5502.4, 60 sec: 5617.2, 300 sec: 5635.1). Total num frames: 337309696. Throughput: 0: 5065.4. Samples: 337304814. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:31:40,489][25689] Avg episode reward: [(0, '-49.280')] [2022-07-09 16:31:41,171][26022] Updated weights on worker 0-0, policy_version 329410 (0.00085) [2022-07-09 16:31:42,789][26022] Updated weights on worker 0-0, policy_version 329420 (0.00088) [2022-07-09 16:31:44,804][26022] Updated weights on worker 0-0, policy_version 329430 (0.00093) [2022-07-09 16:31:45,494][25689] Fps is (10 sec: 5704.8, 60 sec: 5640.7, 300 sec: 5638.7). Total num frames: 337339392. Throughput: 0: 5920.7. Samples: 337338984. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:31:45,494][25689] Avg episode reward: [(0, '-48.442')] [2022-07-09 16:31:46,427][26022] Updated weights on worker 0-0, policy_version 329440 (0.00086) [2022-07-09 16:31:48,494][26022] Updated weights on worker 0-0, policy_version 329450 (0.00088) [2022-07-09 16:31:50,062][26022] Updated weights on worker 0-0, policy_version 329460 (0.00091) [2022-07-09 16:31:50,497][25689] Fps is (10 sec: 5933.8, 60 sec: 5665.2, 300 sec: 5642.3). Total num frames: 337369088. Throughput: 0: 5944.0. Samples: 337373222. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:31:50,498][25689] Avg episode reward: [(0, '-48.939')] [2022-07-09 16:31:52,092][26022] Updated weights on worker 0-0, policy_version 329470 (0.00088) [2022-07-09 16:31:53,502][26022] Updated weights on worker 0-0, policy_version 329480 (0.00051) [2022-07-09 16:31:55,523][25689] Fps is (10 sec: 5615.8, 60 sec: 5617.5, 300 sec: 5636.1). Total num frames: 337395712. Throughput: 0: 5103.9. Samples: 337390358. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:31:55,523][25689] Avg episode reward: [(0, '-48.790')] [2022-07-09 16:31:55,741][26022] Updated weights on worker 0-0, policy_version 329490 (0.00084) [2022-07-09 16:31:57,315][26022] Updated weights on worker 0-0, policy_version 329500 (0.00093) [2022-07-09 16:31:59,317][26022] Updated weights on worker 0-0, policy_version 329510 (0.00081) [2022-07-09 16:32:00,612][25689] Fps is (10 sec: 5568.5, 60 sec: 5618.6, 300 sec: 5645.0). Total num frames: 337425408. Throughput: 0: 5939.0. Samples: 337424326. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:32:00,612][25689] Avg episode reward: [(0, '-48.442')] [2022-07-09 16:32:01,095][26022] Updated weights on worker 0-0, policy_version 329520 (0.00086) [2022-07-09 16:32:03,251][26022] Updated weights on worker 0-0, policy_version 329530 (0.00087) [2022-07-09 16:32:05,146][26022] Updated weights on worker 0-0, policy_version 329540 (0.00088) [2022-07-09 16:32:05,633][25689] Fps is (10 sec: 5469.1, 60 sec: 5600.6, 300 sec: 5638.1). Total num frames: 337451008. Throughput: 0: 5817.3. Samples: 337456142. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:32:05,634][25689] Avg episode reward: [(0, '-48.274')] [2022-07-09 16:32:06,843][26022] Updated weights on worker 0-0, policy_version 329550 (0.00087) [2022-07-09 16:32:08,759][26022] Updated weights on worker 0-0, policy_version 329560 (0.00083) [2022-07-09 16:32:10,536][26022] Updated weights on worker 0-0, policy_version 329570 (0.00088) [2022-07-09 16:32:10,644][25689] Fps is (10 sec: 5511.8, 60 sec: 5636.0, 300 sec: 5638.0). Total num frames: 337480704. Throughput: 0: 4963.2. Samples: 337473212. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:32:10,644][25689] Avg episode reward: [(0, '-48.950')] [2022-07-09 16:32:12,164][26022] Updated weights on worker 0-0, policy_version 329580 (0.00084) [2022-07-09 16:32:14,155][26022] Updated weights on worker 0-0, policy_version 329590 (0.00082) [2022-07-09 16:32:15,661][25689] Fps is (10 sec: 5820.7, 60 sec: 5619.6, 300 sec: 5642.6). Total num frames: 337509376. Throughput: 0: 5811.6. Samples: 337507394. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:32:15,662][25689] Avg episode reward: [(0, '-48.608')] [2022-07-09 16:32:15,739][26022] Updated weights on worker 0-0, policy_version 329600 (0.00091) [2022-07-09 16:32:17,777][26022] Updated weights on worker 0-0, policy_version 329610 (0.00091) [2022-07-09 16:32:19,270][26022] Updated weights on worker 0-0, policy_version 329620 (0.00090) [2022-07-09 16:32:20,788][25689] Fps is (10 sec: 5552.2, 60 sec: 5611.6, 300 sec: 5633.7). Total num frames: 337537024. Throughput: 0: 5819.9. Samples: 337541748. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:32:20,788][25689] Avg episode reward: [(0, '-48.876')] [2022-07-09 16:32:21,431][26022] Updated weights on worker 0-0, policy_version 329630 (0.00084) [2022-07-09 16:32:22,860][26022] Updated weights on worker 0-0, policy_version 329640 (0.00090) [2022-07-09 16:32:24,957][26022] Updated weights on worker 0-0, policy_version 329650 (0.00098) [2022-07-09 16:32:25,830][25689] Fps is (10 sec: 5638.9, 60 sec: 5627.5, 300 sec: 5633.3). Total num frames: 337566720. Throughput: 0: 5931.6. Samples: 337575942. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:32:25,831][25689] Avg episode reward: [(0, '-49.786')] [2022-07-09 16:32:26,785][26022] Updated weights on worker 0-0, policy_version 329660 (0.00093) [2022-07-09 16:32:28,506][26022] Updated weights on worker 0-0, policy_version 329670 (0.00098) [2022-07-09 16:32:30,238][26022] Updated weights on worker 0-0, policy_version 329680 (0.00095) [2022-07-09 16:32:30,917][25689] Fps is (10 sec: 5863.4, 60 sec: 5655.2, 300 sec: 5638.9). Total num frames: 337596416. Throughput: 0: 5909.0. Samples: 337593006. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:32:30,917][25689] Avg episode reward: [(0, '-48.409')] [2022-07-09 16:32:32,222][26022] Updated weights on worker 0-0, policy_version 329690 (0.00083) [2022-07-09 16:32:33,644][26022] Updated weights on worker 0-0, policy_version 329700 (0.00088) [2022-07-09 16:32:35,912][26022] Updated weights on worker 0-0, policy_version 329710 (0.00093) [2022-07-09 16:32:36,003][25689] Fps is (10 sec: 5536.4, 60 sec: 5630.6, 300 sec: 5631.5). Total num frames: 337623040. Throughput: 0: 5877.5. Samples: 337626954. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:32:36,003][25689] Avg episode reward: [(0, '-48.268')] [2022-07-09 16:32:37,250][26022] Updated weights on worker 0-0, policy_version 329720 (0.00087) [2022-07-09 16:32:39,429][26022] Updated weights on worker 0-0, policy_version 329730 (0.00094) [2022-07-09 16:32:41,043][26022] Updated weights on worker 0-0, policy_version 329740 (0.00094) [2022-07-09 16:32:41,059][25689] Fps is (10 sec: 5654.3, 60 sec: 5680.4, 300 sec: 5634.5). Total num frames: 337653760. Throughput: 0: 5884.0. Samples: 337661022. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 16:32:41,059][25689] Avg episode reward: [(0, '-48.851')] [2022-07-09 16:32:43,046][26022] Updated weights on worker 0-0, policy_version 329750 (0.00082) [2022-07-09 16:32:44,871][26022] Updated weights on worker 0-0, policy_version 329760 (0.00100) [2022-07-09 16:32:46,101][25689] Fps is (10 sec: 5780.4, 60 sec: 5643.2, 300 sec: 5638.6). Total num frames: 337681408. Throughput: 0: 5038.2. Samples: 337678068. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:32:46,101][25689] Avg episode reward: [(0, '-48.499')] [2022-07-09 16:32:46,556][26022] Updated weights on worker 0-0, policy_version 329770 (0.00086) [2022-07-09 16:32:48,309][26022] Updated weights on worker 0-0, policy_version 329780 (0.00091) [2022-07-09 16:32:50,067][26022] Updated weights on worker 0-0, policy_version 329790 (0.00091) [2022-07-09 16:32:51,122][25689] Fps is (10 sec: 5495.1, 60 sec: 5607.8, 300 sec: 5631.4). Total num frames: 337709056. Throughput: 0: 5894.4. Samples: 337712100. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:32:51,122][25689] Avg episode reward: [(0, '-48.551')] [2022-07-09 16:32:51,951][26022] Updated weights on worker 0-0, policy_version 329800 (0.00089) [2022-07-09 16:32:53,745][26022] Updated weights on worker 0-0, policy_version 329810 (0.00773) [2022-07-09 16:32:55,475][26022] Updated weights on worker 0-0, policy_version 329820 (0.00081) [2022-07-09 16:32:56,137][25689] Fps is (10 sec: 5713.5, 60 sec: 5659.4, 300 sec: 5638.9). Total num frames: 337738752. Throughput: 0: 5936.9. Samples: 337746488. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:32:56,138][25689] Avg episode reward: [(0, '-48.168')] [2022-07-09 16:32:57,413][26022] Updated weights on worker 0-0, policy_version 329830 (0.00089) [2022-07-09 16:32:59,085][26022] Updated weights on worker 0-0, policy_version 329840 (0.00087) [2022-07-09 16:33:00,991][26022] Updated weights on worker 0-0, policy_version 329850 (0.00088) [2022-07-09 16:33:01,257][25689] Fps is (10 sec: 5758.9, 60 sec: 5639.6, 300 sec: 5640.6). Total num frames: 337767424. Throughput: 0: 5078.6. Samples: 337763600. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:33:01,257][25689] Avg episode reward: [(0, '-48.918')] [2022-07-09 16:33:03,190][26022] Updated weights on worker 0-0, policy_version 329860 (0.00092) [2022-07-09 16:33:04,819][26022] Updated weights on worker 0-0, policy_version 329870 (0.00086) [2022-07-09 16:33:06,346][25689] Fps is (10 sec: 5416.7, 60 sec: 5650.3, 300 sec: 5635.5). Total num frames: 337794048. Throughput: 0: 5794.3. Samples: 337795374. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:33:06,348][25689] Avg episode reward: [(0, '-48.363')] [2022-07-09 16:33:06,874][26022] Updated weights on worker 0-0, policy_version 329880 (0.00086) [2022-07-09 16:33:08,415][26022] Updated weights on worker 0-0, policy_version 329890 (0.00093) [2022-07-09 16:33:10,477][26022] Updated weights on worker 0-0, policy_version 329900 (0.00083) [2022-07-09 16:33:11,365][25689] Fps is (10 sec: 5571.9, 60 sec: 5649.5, 300 sec: 5638.8). Total num frames: 337823744. Throughput: 0: 5794.6. Samples: 337829400. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:33:11,366][25689] Avg episode reward: [(0, '-48.379')] [2022-07-09 16:33:11,959][26022] Updated weights on worker 0-0, policy_version 329910 (0.00083) [2022-07-09 16:33:13,697][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:33:13,706][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000329918_337836032.pth [2022-07-09 16:33:13,706][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000327934_335804416.pth [2022-07-09 16:33:13,914][26022] Updated weights on worker 0-0, policy_version 329920 (0.00092) [2022-07-09 16:33:15,722][26022] Updated weights on worker 0-0, policy_version 329930 (0.00090) [2022-07-09 16:33:16,403][25689] Fps is (10 sec: 5701.7, 60 sec: 5630.6, 300 sec: 5635.8). Total num frames: 337851392. Throughput: 0: 4948.5. Samples: 337846774. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:33:16,403][25689] Avg episode reward: [(0, '-48.439')] [2022-07-09 16:33:17,501][26022] Updated weights on worker 0-0, policy_version 329940 (0.00496) [2022-07-09 16:33:19,343][26022] Updated weights on worker 0-0, policy_version 329950 (0.00091) [2022-07-09 16:33:21,170][26022] Updated weights on worker 0-0, policy_version 329960 (0.00091) [2022-07-09 16:33:21,451][25689] Fps is (10 sec: 5583.8, 60 sec: 5654.8, 300 sec: 5638.6). Total num frames: 337880064. Throughput: 0: 5818.0. Samples: 337881090. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:33:21,453][25689] Avg episode reward: [(0, '-47.812')] [2022-07-09 16:33:22,879][26022] Updated weights on worker 0-0, policy_version 329970 (0.00082) [2022-07-09 16:33:24,794][26022] Updated weights on worker 0-0, policy_version 329980 (0.00087) [2022-07-09 16:33:26,290][26022] Updated weights on worker 0-0, policy_version 329990 (0.00083) [2022-07-09 16:33:26,470][25689] Fps is (10 sec: 5798.1, 60 sec: 5657.0, 300 sec: 5638.8). Total num frames: 337909760. Throughput: 0: 5955.9. Samples: 337915232. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:33:26,471][25689] Avg episode reward: [(0, '-47.327')] [2022-07-09 16:33:28,287][26022] Updated weights on worker 0-0, policy_version 330000 (0.00089) [2022-07-09 16:33:30,120][26022] Updated weights on worker 0-0, policy_version 330010 (0.00091) [2022-07-09 16:33:31,552][25689] Fps is (10 sec: 5575.9, 60 sec: 5606.8, 300 sec: 5632.3). Total num frames: 337936384. Throughput: 0: 5093.6. Samples: 337932224. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:33:31,552][25689] Avg episode reward: [(0, '-46.355')] [2022-07-09 16:33:32,012][26022] Updated weights on worker 0-0, policy_version 330020 (0.00096) [2022-07-09 16:33:33,719][26022] Updated weights on worker 0-0, policy_version 330030 (0.00086) [2022-07-09 16:33:35,717][26022] Updated weights on worker 0-0, policy_version 330040 (0.00084) [2022-07-09 16:33:36,588][25689] Fps is (10 sec: 5566.2, 60 sec: 5662.1, 300 sec: 5637.0). Total num frames: 337966080. Throughput: 0: 5912.1. Samples: 337966110. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:33:36,588][25689] Avg episode reward: [(0, '-46.386')] [2022-07-09 16:33:37,154][26022] Updated weights on worker 0-0, policy_version 330050 (0.00087) [2022-07-09 16:33:39,277][26022] Updated weights on worker 0-0, policy_version 330060 (0.00090) [2022-07-09 16:33:40,937][26022] Updated weights on worker 0-0, policy_version 330070 (0.00096) [2022-07-09 16:33:41,651][25689] Fps is (10 sec: 5779.6, 60 sec: 5627.7, 300 sec: 5632.6). Total num frames: 337994752. Throughput: 0: 5901.9. Samples: 338000306. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:33:41,651][25689] Avg episode reward: [(0, '-46.259')] [2022-07-09 16:33:42,826][26022] Updated weights on worker 0-0, policy_version 330080 (0.00093) [2022-07-09 16:33:44,664][26022] Updated weights on worker 0-0, policy_version 330090 (0.00088) [2022-07-09 16:33:46,333][26022] Updated weights on worker 0-0, policy_version 330100 (0.00083) [2022-07-09 16:33:46,682][25689] Fps is (10 sec: 5681.0, 60 sec: 5645.6, 300 sec: 5633.0). Total num frames: 338023424. Throughput: 0: 5057.1. Samples: 338017448. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:33:46,683][25689] Avg episode reward: [(0, '-46.006')] [2022-07-09 16:33:48,224][26022] Updated weights on worker 0-0, policy_version 330110 (0.00092) [2022-07-09 16:33:49,932][26022] Updated weights on worker 0-0, policy_version 330120 (0.00080) [2022-07-09 16:33:51,712][25689] Fps is (10 sec: 5699.2, 60 sec: 5661.6, 300 sec: 5637.6). Total num frames: 338052096. Throughput: 0: 5925.0. Samples: 338051676. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:33:51,713][25689] Avg episode reward: [(0, '-46.791')] [2022-07-09 16:33:51,943][26022] Updated weights on worker 0-0, policy_version 330130 (0.00090) [2022-07-09 16:33:53,636][26022] Updated weights on worker 0-0, policy_version 330140 (0.00084) [2022-07-09 16:33:55,574][26022] Updated weights on worker 0-0, policy_version 330150 (0.00085) [2022-07-09 16:33:56,720][25689] Fps is (10 sec: 5712.5, 60 sec: 5645.4, 300 sec: 5635.4). Total num frames: 338080768. Throughput: 0: 5936.2. Samples: 338085620. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:33:56,721][25689] Avg episode reward: [(0, '-47.277')] [2022-07-09 16:33:57,218][26022] Updated weights on worker 0-0, policy_version 330160 (0.00086) [2022-07-09 16:33:59,242][26022] Updated weights on worker 0-0, policy_version 330170 (0.00084) [2022-07-09 16:34:00,847][26022] Updated weights on worker 0-0, policy_version 330180 (0.00093) [2022-07-09 16:34:01,769][25689] Fps is (10 sec: 5600.4, 60 sec: 5635.1, 300 sec: 5641.7). Total num frames: 338108416. Throughput: 0: 5075.1. Samples: 338102408. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:34:01,770][25689] Avg episode reward: [(0, '-47.270')] [2022-07-09 16:34:03,291][26022] Updated weights on worker 0-0, policy_version 330190 (0.00094) [2022-07-09 16:34:04,823][26022] Updated weights on worker 0-0, policy_version 330200 (0.00089) [2022-07-09 16:34:06,772][25689] Fps is (10 sec: 5195.3, 60 sec: 5609.2, 300 sec: 5628.5). Total num frames: 338132992. Throughput: 0: 5792.0. Samples: 338133812. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:34:06,773][25689] Avg episode reward: [(0, '-47.171')] [2022-07-09 16:34:06,970][26022] Updated weights on worker 0-0, policy_version 330210 (0.00093) [2022-07-09 16:34:08,525][26022] Updated weights on worker 0-0, policy_version 330220 (0.00094) [2022-07-09 16:34:10,532][26022] Updated weights on worker 0-0, policy_version 330230 (0.00099) [2022-07-09 16:34:11,793][25689] Fps is (10 sec: 5516.1, 60 sec: 5625.9, 300 sec: 5638.6). Total num frames: 338163712. Throughput: 0: 5776.7. Samples: 338167678. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:34:11,794][25689] Avg episode reward: [(0, '-46.987')] [2022-07-09 16:34:12,162][26022] Updated weights on worker 0-0, policy_version 330240 (0.00090) [2022-07-09 16:34:14,069][26022] Updated weights on worker 0-0, policy_version 330250 (0.00094) [2022-07-09 16:34:15,851][26022] Updated weights on worker 0-0, policy_version 330260 (0.00087) [2022-07-09 16:34:16,810][25689] Fps is (10 sec: 5712.9, 60 sec: 5611.0, 300 sec: 5632.4). Total num frames: 338190336. Throughput: 0: 4939.3. Samples: 338184850. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:34:16,811][25689] Avg episode reward: [(0, '-47.428')] [2022-07-09 16:34:17,615][26022] Updated weights on worker 0-0, policy_version 330270 (0.00086) [2022-07-09 16:34:19,430][26022] Updated weights on worker 0-0, policy_version 330280 (0.00364) [2022-07-09 16:34:21,366][26022] Updated weights on worker 0-0, policy_version 330290 (0.00090) [2022-07-09 16:34:21,846][25689] Fps is (10 sec: 5500.5, 60 sec: 5612.1, 300 sec: 5628.8). Total num frames: 338219008. Throughput: 0: 5804.8. Samples: 338218954. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:34:21,847][25689] Avg episode reward: [(0, '-47.499')] [2022-07-09 16:34:22,969][26022] Updated weights on worker 0-0, policy_version 330300 (0.00090) [2022-07-09 16:34:24,875][26022] Updated weights on worker 0-0, policy_version 330310 (0.00084) [2022-07-09 16:34:26,608][26022] Updated weights on worker 0-0, policy_version 330320 (0.00090) [2022-07-09 16:34:26,848][25689] Fps is (10 sec: 5814.7, 60 sec: 5613.7, 300 sec: 5642.8). Total num frames: 338248704. Throughput: 0: 5946.2. Samples: 338253186. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:34:26,849][25689] Avg episode reward: [(0, '-47.447')] [2022-07-09 16:34:28,465][26022] Updated weights on worker 0-0, policy_version 330330 (0.00085) [2022-07-09 16:34:30,336][26022] Updated weights on worker 0-0, policy_version 330340 (0.00099) [2022-07-09 16:34:31,855][25689] Fps is (10 sec: 5729.4, 60 sec: 5637.6, 300 sec: 5635.9). Total num frames: 338276352. Throughput: 0: 5107.1. Samples: 338270134. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:34:31,856][25689] Avg episode reward: [(0, '-47.409')] [2022-07-09 16:34:32,130][26022] Updated weights on worker 0-0, policy_version 330350 (0.00093) [2022-07-09 16:34:34,116][26022] Updated weights on worker 0-0, policy_version 330360 (0.00087) [2022-07-09 16:34:35,762][26022] Updated weights on worker 0-0, policy_version 330370 (0.00086) [2022-07-09 16:34:36,868][25689] Fps is (10 sec: 5723.1, 60 sec: 5639.8, 300 sec: 5636.6). Total num frames: 338306048. Throughput: 0: 5943.4. Samples: 338304062. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:34:36,868][25689] Avg episode reward: [(0, '-48.560')] [2022-07-09 16:34:37,751][26022] Updated weights on worker 0-0, policy_version 330380 (0.00093) [2022-07-09 16:34:39,342][26022] Updated weights on worker 0-0, policy_version 330390 (0.00085) [2022-07-09 16:34:41,256][26022] Updated weights on worker 0-0, policy_version 330400 (0.00095) [2022-07-09 16:34:41,970][25689] Fps is (10 sec: 5568.0, 60 sec: 5602.2, 300 sec: 5634.9). Total num frames: 338332672. Throughput: 0: 5929.5. Samples: 338338276. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:34:41,970][25689] Avg episode reward: [(0, '-47.757')] [2022-07-09 16:34:42,873][26022] Updated weights on worker 0-0, policy_version 330410 (0.00542) [2022-07-09 16:34:44,987][26022] Updated weights on worker 0-0, policy_version 330420 (0.00094) [2022-07-09 16:34:46,505][26022] Updated weights on worker 0-0, policy_version 330430 (0.00091) [2022-07-09 16:34:46,973][25689] Fps is (10 sec: 5573.7, 60 sec: 5621.8, 300 sec: 5635.1). Total num frames: 338362368. Throughput: 0: 5079.5. Samples: 338355408. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:34:46,973][25689] Avg episode reward: [(0, '-47.486')] [2022-07-09 16:34:48,392][26022] Updated weights on worker 0-0, policy_version 330440 (0.00088) [2022-07-09 16:34:50,129][26022] Updated weights on worker 0-0, policy_version 330450 (0.00095) [2022-07-09 16:34:51,986][25689] Fps is (10 sec: 5827.5, 60 sec: 5623.4, 300 sec: 5638.4). Total num frames: 338391040. Throughput: 0: 5939.6. Samples: 338389704. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:34:51,988][25689] Avg episode reward: [(0, '-47.287')] [2022-07-09 16:34:51,990][26022] Updated weights on worker 0-0, policy_version 330460 (0.00095) [2022-07-09 16:34:53,682][26022] Updated weights on worker 0-0, policy_version 330470 (0.00084) [2022-07-09 16:34:55,664][26022] Updated weights on worker 0-0, policy_version 330480 (0.00088) [2022-07-09 16:34:57,014][25689] Fps is (10 sec: 5710.7, 60 sec: 5621.5, 300 sec: 5639.7). Total num frames: 338419712. Throughput: 0: 5950.4. Samples: 338423940. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:34:57,015][25689] Avg episode reward: [(0, '-47.574')] [2022-07-09 16:34:57,282][26022] Updated weights on worker 0-0, policy_version 330490 (0.00089) [2022-07-09 16:34:59,211][26022] Updated weights on worker 0-0, policy_version 330500 (0.00089) [2022-07-09 16:35:01,097][26022] Updated weights on worker 0-0, policy_version 330510 (0.00091) [2022-07-09 16:35:02,115][25689] Fps is (10 sec: 5358.2, 60 sec: 5582.8, 300 sec: 5635.3). Total num frames: 338445312. Throughput: 0: 5844.6. Samples: 338456014. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:35:02,115][25689] Avg episode reward: [(0, '-47.046')] [2022-07-09 16:35:03,090][26022] Updated weights on worker 0-0, policy_version 330520 (0.00099) [2022-07-09 16:35:05,279][26022] Updated weights on worker 0-0, policy_version 330530 (0.00094) [2022-07-09 16:35:06,679][26022] Updated weights on worker 0-0, policy_version 330540 (0.00085) [2022-07-09 16:35:07,129][25689] Fps is (10 sec: 5567.7, 60 sec: 5683.5, 300 sec: 5645.9). Total num frames: 338476032. Throughput: 0: 5811.2. Samples: 338472544. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:35:07,130][25689] Avg episode reward: [(0, '-47.050')] [2022-07-09 16:35:08,742][26022] Updated weights on worker 0-0, policy_version 330550 (0.00094) [2022-07-09 16:35:10,276][26022] Updated weights on worker 0-0, policy_version 330560 (0.00087) [2022-07-09 16:35:12,152][25689] Fps is (10 sec: 5815.3, 60 sec: 5632.5, 300 sec: 5642.1). Total num frames: 338503680. Throughput: 0: 5799.4. Samples: 338506654. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-09 16:35:12,153][25689] Avg episode reward: [(0, '-47.476')] [2022-07-09 16:35:12,154][26022] Updated weights on worker 0-0, policy_version 330570 (0.00088) [2022-07-09 16:35:13,934][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:35:13,946][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000330579_338512896.pth [2022-07-09 16:35:13,946][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000328595_336481280.pth [2022-07-09 16:35:14,098][26022] Updated weights on worker 0-0, policy_version 330580 (0.00086) [2022-07-09 16:35:15,711][26022] Updated weights on worker 0-0, policy_version 330590 (0.00085) [2022-07-09 16:35:17,156][25689] Fps is (10 sec: 5515.1, 60 sec: 5650.6, 300 sec: 5636.7). Total num frames: 338531328. Throughput: 0: 5806.0. Samples: 338540882. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:35:17,156][25689] Avg episode reward: [(0, '-48.027')] [2022-07-09 16:35:17,622][26022] Updated weights on worker 0-0, policy_version 330600 (0.00095) [2022-07-09 16:35:19,693][26022] Updated weights on worker 0-0, policy_version 330610 (0.00089) [2022-07-09 16:35:21,138][26022] Updated weights on worker 0-0, policy_version 330620 (0.00087) [2022-07-09 16:35:22,310][25689] Fps is (10 sec: 5544.3, 60 sec: 5639.6, 300 sec: 5634.7). Total num frames: 338560000. Throughput: 0: 5042.9. Samples: 338557856. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:35:22,310][25689] Avg episode reward: [(0, '-47.586')] [2022-07-09 16:35:23,272][26022] Updated weights on worker 0-0, policy_version 330630 (0.00090) [2022-07-09 16:35:24,761][26022] Updated weights on worker 0-0, policy_version 330640 (0.00092) [2022-07-09 16:35:26,832][26022] Updated weights on worker 0-0, policy_version 330650 (0.00091) [2022-07-09 16:35:27,354][25689] Fps is (10 sec: 5723.1, 60 sec: 5635.6, 300 sec: 5641.3). Total num frames: 338589696. Throughput: 0: 5899.4. Samples: 338591860. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:35:27,355][25689] Avg episode reward: [(0, '-47.335')] [2022-07-09 16:35:28,562][26022] Updated weights on worker 0-0, policy_version 330660 (0.00083) [2022-07-09 16:35:30,268][26022] Updated weights on worker 0-0, policy_version 330670 (0.00084) [2022-07-09 16:35:32,287][26022] Updated weights on worker 0-0, policy_version 330680 (0.00081) [2022-07-09 16:35:32,375][25689] Fps is (10 sec: 5595.8, 60 sec: 5617.5, 300 sec: 5634.4). Total num frames: 338616320. Throughput: 0: 5909.9. Samples: 338626170. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:35:32,375][25689] Avg episode reward: [(0, '-46.832')] [2022-07-09 16:35:33,720][26022] Updated weights on worker 0-0, policy_version 330690 (0.00092) [2022-07-09 16:35:35,871][26022] Updated weights on worker 0-0, policy_version 330700 (0.00093) [2022-07-09 16:35:37,397][25689] Fps is (10 sec: 5608.2, 60 sec: 5616.6, 300 sec: 5635.9). Total num frames: 338646016. Throughput: 0: 5047.2. Samples: 338643042. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:35:37,397][25689] Avg episode reward: [(0, '-46.086')] [2022-07-09 16:35:37,476][26022] Updated weights on worker 0-0, policy_version 330710 (0.00092) [2022-07-09 16:35:39,372][26022] Updated weights on worker 0-0, policy_version 330720 (0.00187) [2022-07-09 16:35:41,189][26022] Updated weights on worker 0-0, policy_version 330730 (0.00086) [2022-07-09 16:35:42,500][25689] Fps is (10 sec: 5764.3, 60 sec: 5650.3, 300 sec: 5635.4). Total num frames: 338674688. Throughput: 0: 5903.5. Samples: 338677050. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:35:42,501][25689] Avg episode reward: [(0, '-46.006')] [2022-07-09 16:35:43,119][26022] Updated weights on worker 0-0, policy_version 330740 (0.00085) [2022-07-09 16:35:44,784][26022] Updated weights on worker 0-0, policy_version 330750 (0.00090) [2022-07-09 16:35:46,647][26022] Updated weights on worker 0-0, policy_version 330760 (0.00096) [2022-07-09 16:35:47,521][25689] Fps is (10 sec: 5663.8, 60 sec: 5631.7, 300 sec: 5636.6). Total num frames: 338703360. Throughput: 0: 5918.6. Samples: 338711220. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:35:47,522][25689] Avg episode reward: [(0, '-46.379')] [2022-07-09 16:35:48,313][26022] Updated weights on worker 0-0, policy_version 330770 (0.00084) [2022-07-09 16:35:50,171][26022] Updated weights on worker 0-0, policy_version 330780 (0.00088) [2022-07-09 16:35:51,948][26022] Updated weights on worker 0-0, policy_version 330790 (0.00085) [2022-07-09 16:35:52,529][25689] Fps is (10 sec: 5615.7, 60 sec: 5615.3, 300 sec: 5630.7). Total num frames: 338731008. Throughput: 0: 5064.4. Samples: 338728242. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:35:52,530][25689] Avg episode reward: [(0, '-45.576')] [2022-07-09 16:35:53,712][26022] Updated weights on worker 0-0, policy_version 330800 (0.00083) [2022-07-09 16:35:55,578][26022] Updated weights on worker 0-0, policy_version 330810 (0.00083) [2022-07-09 16:35:57,378][26022] Updated weights on worker 0-0, policy_version 330820 (0.00087) [2022-07-09 16:35:57,538][25689] Fps is (10 sec: 5622.7, 60 sec: 5617.1, 300 sec: 5629.0). Total num frames: 338759680. Throughput: 0: 5934.4. Samples: 338762568. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:35:57,539][25689] Avg episode reward: [(0, '-45.439')] [2022-07-09 16:35:59,210][26022] Updated weights on worker 0-0, policy_version 330830 (0.00099) [2022-07-09 16:36:01,041][26022] Updated weights on worker 0-0, policy_version 330840 (0.00087) [2022-07-09 16:36:02,589][25689] Fps is (10 sec: 5497.0, 60 sec: 5638.7, 300 sec: 5628.2). Total num frames: 338786304. Throughput: 0: 5829.0. Samples: 338794144. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:36:02,589][25689] Avg episode reward: [(0, '-45.034')] [2022-07-09 16:36:03,122][26022] Updated weights on worker 0-0, policy_version 330850 (0.00088) [2022-07-09 16:36:05,138][26022] Updated weights on worker 0-0, policy_version 330860 (0.00097) [2022-07-09 16:36:06,753][26022] Updated weights on worker 0-0, policy_version 330870 (0.00085) [2022-07-09 16:36:07,606][25689] Fps is (10 sec: 5390.2, 60 sec: 5587.6, 300 sec: 5628.4). Total num frames: 338813952. Throughput: 0: 4961.7. Samples: 338810876. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:36:07,607][25689] Avg episode reward: [(0, '-45.657')] [2022-07-09 16:36:08,860][26022] Updated weights on worker 0-0, policy_version 330880 (0.00091) [2022-07-09 16:36:10,512][26022] Updated weights on worker 0-0, policy_version 330890 (0.00091) [2022-07-09 16:36:12,423][26022] Updated weights on worker 0-0, policy_version 330900 (0.00097) [2022-07-09 16:36:12,613][25689] Fps is (10 sec: 5618.4, 60 sec: 5606.0, 300 sec: 5625.2). Total num frames: 338842624. Throughput: 0: 5813.6. Samples: 338844998. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:36:12,613][25689] Avg episode reward: [(0, '-44.950')] [2022-07-09 16:36:14,079][26022] Updated weights on worker 0-0, policy_version 330910 (0.00087) [2022-07-09 16:36:15,922][26022] Updated weights on worker 0-0, policy_version 330920 (0.00085) [2022-07-09 16:36:17,631][25689] Fps is (10 sec: 5720.0, 60 sec: 5621.6, 300 sec: 5629.1). Total num frames: 338871296. Throughput: 0: 5818.3. Samples: 338879480. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:36:17,632][25689] Avg episode reward: [(0, '-46.293')] [2022-07-09 16:36:17,686][26022] Updated weights on worker 0-0, policy_version 330930 (0.00093) [2022-07-09 16:36:19,534][26022] Updated weights on worker 0-0, policy_version 330940 (0.00090) [2022-07-09 16:36:21,261][26022] Updated weights on worker 0-0, policy_version 330950 (0.00081) [2022-07-09 16:36:22,766][25689] Fps is (10 sec: 5748.7, 60 sec: 5640.3, 300 sec: 5630.6). Total num frames: 338900992. Throughput: 0: 5073.7. Samples: 338896518. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:36:22,767][25689] Avg episode reward: [(0, '-46.771')] [2022-07-09 16:36:23,129][26022] Updated weights on worker 0-0, policy_version 330960 (0.00082) [2022-07-09 16:36:24,802][26022] Updated weights on worker 0-0, policy_version 330970 (0.00094) [2022-07-09 16:36:26,776][26022] Updated weights on worker 0-0, policy_version 330980 (0.00086) [2022-07-09 16:36:27,803][25689] Fps is (10 sec: 5637.9, 60 sec: 5607.2, 300 sec: 5630.3). Total num frames: 338928640. Throughput: 0: 5924.8. Samples: 338930534. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:36:27,803][25689] Avg episode reward: [(0, '-46.763')] [2022-07-09 16:36:28,470][26022] Updated weights on worker 0-0, policy_version 330990 (0.00089) [2022-07-09 16:36:30,406][26022] Updated weights on worker 0-0, policy_version 331000 (0.00089) [2022-07-09 16:36:32,124][26022] Updated weights on worker 0-0, policy_version 331010 (0.00087) [2022-07-09 16:36:32,821][25689] Fps is (10 sec: 5601.1, 60 sec: 5641.2, 300 sec: 5633.4). Total num frames: 338957312. Throughput: 0: 5909.7. Samples: 338964424. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:36:32,827][25689] Avg episode reward: [(0, '-47.288')] [2022-07-09 16:36:34,059][26022] Updated weights on worker 0-0, policy_version 331020 (0.00093) [2022-07-09 16:36:35,874][26022] Updated weights on worker 0-0, policy_version 331030 (0.00088) [2022-07-09 16:36:37,579][26022] Updated weights on worker 0-0, policy_version 331040 (0.00090) [2022-07-09 16:36:37,867][25689] Fps is (10 sec: 5697.9, 60 sec: 5622.1, 300 sec: 5636.9). Total num frames: 338985984. Throughput: 0: 5046.0. Samples: 338981586. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:36:37,867][25689] Avg episode reward: [(0, '-46.831')] [2022-07-09 16:36:39,116][26022] Updated weights on worker 0-0, policy_version 331050 (0.00084) [2022-07-09 16:36:41,208][26022] Updated weights on worker 0-0, policy_version 331060 (0.00089) [2022-07-09 16:36:42,983][25689] Fps is (10 sec: 5642.9, 60 sec: 5620.9, 300 sec: 5631.3). Total num frames: 339014656. Throughput: 0: 5895.2. Samples: 339015704. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:36:42,984][25689] Avg episode reward: [(0, '-47.493')] [2022-07-09 16:36:43,003][26022] Updated weights on worker 0-0, policy_version 331070 (0.00306) [2022-07-09 16:36:44,723][26022] Updated weights on worker 0-0, policy_version 331080 (0.00093) [2022-07-09 16:36:46,743][26022] Updated weights on worker 0-0, policy_version 331090 (0.00094) [2022-07-09 16:36:48,005][25689] Fps is (10 sec: 5656.2, 60 sec: 5620.8, 300 sec: 5627.6). Total num frames: 339043328. Throughput: 0: 5901.2. Samples: 339049752. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:36:48,006][25689] Avg episode reward: [(0, '-46.525')] [2022-07-09 16:36:48,335][26022] Updated weights on worker 0-0, policy_version 331100 (0.00062) [2022-07-09 16:36:50,053][26022] Updated weights on worker 0-0, policy_version 331110 (0.00092) [2022-07-09 16:36:51,949][26022] Updated weights on worker 0-0, policy_version 331120 (0.00081) [2022-07-09 16:36:53,012][25689] Fps is (10 sec: 5820.0, 60 sec: 5654.8, 300 sec: 5638.2). Total num frames: 339073024. Throughput: 0: 5083.7. Samples: 339067070. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:36:53,013][25689] Avg episode reward: [(0, '-46.654')] [2022-07-09 16:36:53,876][26022] Updated weights on worker 0-0, policy_version 331130 (0.00090) [2022-07-09 16:36:55,693][26022] Updated weights on worker 0-0, policy_version 331140 (0.00092) [2022-07-09 16:36:57,526][26022] Updated weights on worker 0-0, policy_version 331150 (0.00091) [2022-07-09 16:36:58,016][25689] Fps is (10 sec: 5830.5, 60 sec: 5655.2, 300 sec: 5636.4). Total num frames: 339101696. Throughput: 0: 5919.9. Samples: 339100866. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:36:58,016][25689] Avg episode reward: [(0, '-46.139')] [2022-07-09 16:36:59,258][26022] Updated weights on worker 0-0, policy_version 331160 (0.00086) [2022-07-09 16:37:00,915][26022] Updated weights on worker 0-0, policy_version 331170 (0.00092) [2022-07-09 16:37:03,083][25689] Fps is (10 sec: 5389.2, 60 sec: 5636.8, 300 sec: 5635.5). Total num frames: 339127296. Throughput: 0: 5828.6. Samples: 339132856. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:37:03,083][25689] Avg episode reward: [(0, '-46.165')] [2022-07-09 16:37:03,431][26022] Updated weights on worker 0-0, policy_version 331180 (0.00089) [2022-07-09 16:37:04,987][26022] Updated weights on worker 0-0, policy_version 331190 (0.00085) [2022-07-09 16:37:06,971][26022] Updated weights on worker 0-0, policy_version 331200 (0.00083) [2022-07-09 16:37:08,148][25689] Fps is (10 sec: 5255.1, 60 sec: 5632.3, 300 sec: 5627.6). Total num frames: 339154944. Throughput: 0: 4961.0. Samples: 339149684. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:37:08,149][25689] Avg episode reward: [(0, '-46.920')] [2022-07-09 16:37:08,856][26022] Updated weights on worker 0-0, policy_version 331210 (0.00109) [2022-07-09 16:37:10,558][26022] Updated weights on worker 0-0, policy_version 331220 (0.00078) [2022-07-09 16:37:12,323][26022] Updated weights on worker 0-0, policy_version 331230 (0.00092) [2022-07-09 16:37:13,161][25689] Fps is (10 sec: 5690.2, 60 sec: 5648.7, 300 sec: 5631.1). Total num frames: 339184640. Throughput: 0: 5779.1. Samples: 339183508. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:37:13,161][25689] Avg episode reward: [(0, '-46.715')] [2022-07-09 16:37:14,088][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:37:14,097][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000331240_339189760.pth [2022-07-09 16:37:14,097][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000329257_337159168.pth [2022-07-09 16:37:14,099][26022] Updated weights on worker 0-0, policy_version 331240 (0.00088) [2022-07-09 16:37:15,747][26022] Updated weights on worker 0-0, policy_version 331250 (0.00085) [2022-07-09 16:37:17,649][26022] Updated weights on worker 0-0, policy_version 331260 (0.00086) [2022-07-09 16:37:18,193][25689] Fps is (10 sec: 5708.6, 60 sec: 5630.5, 300 sec: 5632.9). Total num frames: 339212288. Throughput: 0: 5798.2. Samples: 339217860. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:37:18,194][25689] Avg episode reward: [(0, '-45.968')] [2022-07-09 16:37:19,545][26022] Updated weights on worker 0-0, policy_version 331270 (0.00086) [2022-07-09 16:37:21,423][26022] Updated weights on worker 0-0, policy_version 331280 (0.00084) [2022-07-09 16:37:22,981][26022] Updated weights on worker 0-0, policy_version 331290 (0.00093) [2022-07-09 16:37:23,288][25689] Fps is (10 sec: 5662.1, 60 sec: 5634.2, 300 sec: 5631.9). Total num frames: 339241984. Throughput: 0: 5055.5. Samples: 339235004. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:37:23,289][25689] Avg episode reward: [(0, '-45.773')] [2022-07-09 16:37:24,930][26022] Updated weights on worker 0-0, policy_version 331300 (0.00101) [2022-07-09 16:37:26,703][26022] Updated weights on worker 0-0, policy_version 331310 (0.00106) [2022-07-09 16:37:28,336][25689] Fps is (10 sec: 5653.9, 60 sec: 5633.1, 300 sec: 5625.8). Total num frames: 339269632. Throughput: 0: 5906.1. Samples: 339268914. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:37:28,343][25689] Avg episode reward: [(0, '-46.662')] [2022-07-09 16:37:28,637][26022] Updated weights on worker 0-0, policy_version 331320 (0.00092) [2022-07-09 16:37:30,432][26022] Updated weights on worker 0-0, policy_version 331330 (0.00088) [2022-07-09 16:37:32,015][26022] Updated weights on worker 0-0, policy_version 331340 (0.00087) [2022-07-09 16:37:33,363][25689] Fps is (10 sec: 5590.3, 60 sec: 5632.4, 300 sec: 5633.8). Total num frames: 339298304. Throughput: 0: 5920.5. Samples: 339303116. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:37:33,363][25689] Avg episode reward: [(0, '-45.671')] [2022-07-09 16:37:34,082][26022] Updated weights on worker 0-0, policy_version 331350 (0.00051) [2022-07-09 16:37:35,556][26022] Updated weights on worker 0-0, policy_version 331360 (0.00082) [2022-07-09 16:37:37,568][26022] Updated weights on worker 0-0, policy_version 331370 (0.00089) [2022-07-09 16:37:38,388][25689] Fps is (10 sec: 5806.2, 60 sec: 5651.1, 300 sec: 5630.9). Total num frames: 339328000. Throughput: 0: 5072.6. Samples: 339320302. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:37:38,389][25689] Avg episode reward: [(0, '-45.948')] [2022-07-09 16:37:39,511][26022] Updated weights on worker 0-0, policy_version 331380 (0.00098) [2022-07-09 16:37:41,154][26022] Updated weights on worker 0-0, policy_version 331390 (0.00089) [2022-07-09 16:37:43,018][26022] Updated weights on worker 0-0, policy_version 331400 (0.00089) [2022-07-09 16:37:43,466][25689] Fps is (10 sec: 5675.7, 60 sec: 5637.8, 300 sec: 5630.2). Total num frames: 339355648. Throughput: 0: 5900.7. Samples: 339354068. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 16:37:43,467][25689] Avg episode reward: [(0, '-45.671')] [2022-07-09 16:37:44,594][26022] Updated weights on worker 0-0, policy_version 331410 (0.00085) [2022-07-09 16:37:46,567][26022] Updated weights on worker 0-0, policy_version 331420 (0.00092) [2022-07-09 16:37:48,263][26022] Updated weights on worker 0-0, policy_version 331430 (0.00093) [2022-07-09 16:37:48,524][25689] Fps is (10 sec: 5556.9, 60 sec: 5634.5, 300 sec: 5633.0). Total num frames: 339384320. Throughput: 0: 5911.3. Samples: 339388250. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:37:48,524][25689] Avg episode reward: [(0, '-46.584')] [2022-07-09 16:37:50,204][26022] Updated weights on worker 0-0, policy_version 331440 (0.00085) [2022-07-09 16:37:51,945][26022] Updated weights on worker 0-0, policy_version 331450 (0.00093) [2022-07-09 16:37:53,543][25689] Fps is (10 sec: 5690.8, 60 sec: 5616.5, 300 sec: 5629.5). Total num frames: 339412992. Throughput: 0: 5927.8. Samples: 339422740. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:37:53,543][25689] Avg episode reward: [(0, '-46.935')] [2022-07-09 16:37:53,820][26022] Updated weights on worker 0-0, policy_version 331460 (0.00089) [2022-07-09 16:37:55,503][26022] Updated weights on worker 0-0, policy_version 331470 (0.00089) [2022-07-09 16:37:57,419][26022] Updated weights on worker 0-0, policy_version 331480 (0.00087) [2022-07-09 16:37:58,576][25689] Fps is (10 sec: 5704.3, 60 sec: 5613.7, 300 sec: 5631.1). Total num frames: 339441664. Throughput: 0: 5913.6. Samples: 339439686. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:37:58,577][25689] Avg episode reward: [(0, '-46.827')] [2022-07-09 16:37:59,206][26022] Updated weights on worker 0-0, policy_version 331490 (0.00099) [2022-07-09 16:38:01,003][26022] Updated weights on worker 0-0, policy_version 331500 (0.00088) [2022-07-09 16:38:03,258][26022] Updated weights on worker 0-0, policy_version 331510 (0.00094) [2022-07-09 16:38:03,666][25689] Fps is (10 sec: 5563.3, 60 sec: 5645.4, 300 sec: 5634.5). Total num frames: 339469312. Throughput: 0: 5831.9. Samples: 339471872. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:38:03,667][25689] Avg episode reward: [(0, '-46.680')] [2022-07-09 16:38:05,004][26022] Updated weights on worker 0-0, policy_version 331520 (0.00085) [2022-07-09 16:38:06,852][26022] Updated weights on worker 0-0, policy_version 331530 (0.00095) [2022-07-09 16:38:08,579][26022] Updated weights on worker 0-0, policy_version 331540 (0.00084) [2022-07-09 16:38:08,737][25689] Fps is (10 sec: 5543.2, 60 sec: 5661.8, 300 sec: 5630.1). Total num frames: 339497984. Throughput: 0: 5794.5. Samples: 339505374. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:38:08,737][25689] Avg episode reward: [(0, '-47.397')] [2022-07-09 16:38:10,476][26022] Updated weights on worker 0-0, policy_version 331550 (0.00083) [2022-07-09 16:38:12,118][26022] Updated weights on worker 0-0, policy_version 331560 (0.00081) [2022-07-09 16:38:13,837][25689] Fps is (10 sec: 5437.0, 60 sec: 5603.0, 300 sec: 5625.5). Total num frames: 339524608. Throughput: 0: 4919.8. Samples: 339522580. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:38:13,837][25689] Avg episode reward: [(0, '-47.673')] [2022-07-09 16:38:14,174][26022] Updated weights on worker 0-0, policy_version 331570 (0.00086) [2022-07-09 16:38:15,784][26022] Updated weights on worker 0-0, policy_version 331580 (0.00086) [2022-07-09 16:38:17,724][26022] Updated weights on worker 0-0, policy_version 331590 (0.00087) [2022-07-09 16:38:18,876][25689] Fps is (10 sec: 5655.5, 60 sec: 5653.0, 300 sec: 5632.6). Total num frames: 339555328. Throughput: 0: 5755.7. Samples: 339556524. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:38:18,877][25689] Avg episode reward: [(0, '-47.751')] [2022-07-09 16:38:19,369][26022] Updated weights on worker 0-0, policy_version 331600 (0.00087) [2022-07-09 16:38:21,282][26022] Updated weights on worker 0-0, policy_version 331610 (0.00083) [2022-07-09 16:38:23,091][26022] Updated weights on worker 0-0, policy_version 331620 (0.00099) [2022-07-09 16:38:23,925][25689] Fps is (10 sec: 5785.6, 60 sec: 5623.5, 300 sec: 5625.1). Total num frames: 339582976. Throughput: 0: 5861.8. Samples: 339590624. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:38:23,926][25689] Avg episode reward: [(0, '-47.451')] [2022-07-09 16:38:24,891][26022] Updated weights on worker 0-0, policy_version 331630 (0.00088) [2022-07-09 16:38:26,707][26022] Updated weights on worker 0-0, policy_version 331640 (0.00089) [2022-07-09 16:38:28,533][26022] Updated weights on worker 0-0, policy_version 331650 (0.00082) [2022-07-09 16:38:28,930][25689] Fps is (10 sec: 5703.7, 60 sec: 5661.3, 300 sec: 5636.9). Total num frames: 339612672. Throughput: 0: 5067.4. Samples: 339607702. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:38:28,930][25689] Avg episode reward: [(0, '-48.828')] [2022-07-09 16:38:30,332][26022] Updated weights on worker 0-0, policy_version 331660 (0.00085) [2022-07-09 16:38:32,139][26022] Updated weights on worker 0-0, policy_version 331670 (0.00083) [2022-07-09 16:38:33,863][26022] Updated weights on worker 0-0, policy_version 331680 (0.00082) [2022-07-09 16:38:33,952][25689] Fps is (10 sec: 5718.9, 60 sec: 5644.8, 300 sec: 5630.3). Total num frames: 339640320. Throughput: 0: 5944.7. Samples: 339642160. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:38:33,953][25689] Avg episode reward: [(0, '-48.669')] [2022-07-09 16:38:35,760][26022] Updated weights on worker 0-0, policy_version 331690 (0.00087) [2022-07-09 16:38:37,599][26022] Updated weights on worker 0-0, policy_version 331700 (0.00106) [2022-07-09 16:38:39,037][25689] Fps is (10 sec: 5572.3, 60 sec: 5622.4, 300 sec: 5629.8). Total num frames: 339668992. Throughput: 0: 5928.2. Samples: 339676042. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:38:39,038][25689] Avg episode reward: [(0, '-49.004')] [2022-07-09 16:38:39,424][26022] Updated weights on worker 0-0, policy_version 331710 (0.00085) [2022-07-09 16:38:41,068][26022] Updated weights on worker 0-0, policy_version 331720 (0.00087) [2022-07-09 16:38:42,971][26022] Updated weights on worker 0-0, policy_version 331730 (0.00093) [2022-07-09 16:38:44,149][25689] Fps is (10 sec: 5724.3, 60 sec: 5653.0, 300 sec: 5631.8). Total num frames: 339698688. Throughput: 0: 5065.5. Samples: 339693068. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:38:44,149][25689] Avg episode reward: [(0, '-48.913')] [2022-07-09 16:38:44,802][26022] Updated weights on worker 0-0, policy_version 331740 (0.00098) [2022-07-09 16:38:46,479][26022] Updated weights on worker 0-0, policy_version 331750 (0.00098) [2022-07-09 16:38:48,446][26022] Updated weights on worker 0-0, policy_version 331760 (0.00079) [2022-07-09 16:38:49,198][25689] Fps is (10 sec: 5744.3, 60 sec: 5653.8, 300 sec: 5631.4). Total num frames: 339727360. Throughput: 0: 5907.5. Samples: 339727436. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:38:49,200][25689] Avg episode reward: [(0, '-48.474')] [2022-07-09 16:38:49,950][26022] Updated weights on worker 0-0, policy_version 331770 (0.00093) [2022-07-09 16:38:52,011][26022] Updated weights on worker 0-0, policy_version 331780 (0.00080) [2022-07-09 16:38:53,699][26022] Updated weights on worker 0-0, policy_version 331790 (0.00082) [2022-07-09 16:38:54,226][25689] Fps is (10 sec: 5588.7, 60 sec: 5636.1, 300 sec: 5627.6). Total num frames: 339755008. Throughput: 0: 5881.5. Samples: 339761402. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:38:54,227][25689] Avg episode reward: [(0, '-48.874')] [2022-07-09 16:38:55,586][26022] Updated weights on worker 0-0, policy_version 331800 (0.00092) [2022-07-09 16:38:57,452][26022] Updated weights on worker 0-0, policy_version 331810 (0.00087) [2022-07-09 16:38:59,142][26022] Updated weights on worker 0-0, policy_version 331820 (0.00092) [2022-07-09 16:38:59,235][25689] Fps is (10 sec: 5611.5, 60 sec: 5638.4, 300 sec: 5631.8). Total num frames: 339783680. Throughput: 0: 5067.2. Samples: 339778388. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:38:59,236][25689] Avg episode reward: [(0, '-48.069')] [2022-07-09 16:39:01,006][26022] Updated weights on worker 0-0, policy_version 331830 (0.00083) [2022-07-09 16:39:03,144][26022] Updated weights on worker 0-0, policy_version 331840 (0.00088) [2022-07-09 16:39:04,319][25689] Fps is (10 sec: 5580.2, 60 sec: 5638.9, 300 sec: 5640.6). Total num frames: 339811328. Throughput: 0: 5829.9. Samples: 339810660. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:39:04,320][25689] Avg episode reward: [(0, '-46.822')] [2022-07-09 16:39:05,014][26022] Updated weights on worker 0-0, policy_version 331850 (0.00091) [2022-07-09 16:39:06,788][26022] Updated weights on worker 0-0, policy_version 331860 (0.00085) [2022-07-09 16:39:08,560][26022] Updated weights on worker 0-0, policy_version 331870 (0.00087) [2022-07-09 16:39:09,415][25689] Fps is (10 sec: 5532.2, 60 sec: 5636.5, 300 sec: 5632.3). Total num frames: 339840000. Throughput: 0: 5809.7. Samples: 339844890. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:39:09,416][25689] Avg episode reward: [(0, '-46.762')] [2022-07-09 16:39:10,312][26022] Updated weights on worker 0-0, policy_version 331880 (0.00092) [2022-07-09 16:39:12,062][26022] Updated weights on worker 0-0, policy_version 331890 (0.00092) [2022-07-09 16:39:13,943][26022] Updated weights on worker 0-0, policy_version 331900 (0.00584) [2022-07-09 16:39:14,173][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:39:14,189][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000331902_339867648.pth [2022-07-09 16:39:14,190][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000329918_337836032.pth [2022-07-09 16:39:14,436][25689] Fps is (10 sec: 5667.9, 60 sec: 5677.6, 300 sec: 5639.1). Total num frames: 339868672. Throughput: 0: 4984.5. Samples: 339862140. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:39:14,437][25689] Avg episode reward: [(0, '-47.281')] [2022-07-09 16:39:15,627][26022] Updated weights on worker 0-0, policy_version 331910 (0.00093) [2022-07-09 16:39:17,530][26022] Updated weights on worker 0-0, policy_version 331920 (0.00091) [2022-07-09 16:39:19,261][26022] Updated weights on worker 0-0, policy_version 331930 (0.00083) [2022-07-09 16:39:19,453][25689] Fps is (10 sec: 5610.6, 60 sec: 5629.1, 300 sec: 5636.0). Total num frames: 339896320. Throughput: 0: 5843.0. Samples: 339896526. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:39:19,454][25689] Avg episode reward: [(0, '-47.303')] [2022-07-09 16:39:21,119][26022] Updated weights on worker 0-0, policy_version 331940 (0.00087) [2022-07-09 16:39:22,773][26022] Updated weights on worker 0-0, policy_version 331950 (0.00093) [2022-07-09 16:39:24,534][25689] Fps is (10 sec: 5679.2, 60 sec: 5659.9, 300 sec: 5634.5). Total num frames: 339926016. Throughput: 0: 5938.3. Samples: 339930700. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:39:24,534][25689] Avg episode reward: [(0, '-46.858')] [2022-07-09 16:39:24,756][26022] Updated weights on worker 0-0, policy_version 331960 (0.00086) [2022-07-09 16:39:26,383][26022] Updated weights on worker 0-0, policy_version 331970 (0.00089) [2022-07-09 16:39:28,279][26022] Updated weights on worker 0-0, policy_version 331980 (0.00092) [2022-07-09 16:39:29,603][25689] Fps is (10 sec: 5750.9, 60 sec: 5637.1, 300 sec: 5636.8). Total num frames: 339954688. Throughput: 0: 5096.8. Samples: 339947782. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:39:29,603][25689] Avg episode reward: [(0, '-46.803')] [2022-07-09 16:39:29,834][26022] Updated weights on worker 0-0, policy_version 331990 (0.00082) [2022-07-09 16:39:31,913][26022] Updated weights on worker 0-0, policy_version 332000 (0.00090) [2022-07-09 16:39:33,493][26022] Updated weights on worker 0-0, policy_version 332010 (0.00091) [2022-07-09 16:39:34,606][25689] Fps is (10 sec: 5591.2, 60 sec: 5638.8, 300 sec: 5630.1). Total num frames: 339982336. Throughput: 0: 5945.9. Samples: 339982070. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:39:34,607][25689] Avg episode reward: [(0, '-47.673')] [2022-07-09 16:39:35,483][26022] Updated weights on worker 0-0, policy_version 332020 (0.00085) [2022-07-09 16:39:37,109][26022] Updated weights on worker 0-0, policy_version 332030 (0.00084) [2022-07-09 16:39:38,919][26022] Updated weights on worker 0-0, policy_version 332040 (0.00086) [2022-07-09 16:39:39,620][25689] Fps is (10 sec: 5724.6, 60 sec: 5662.3, 300 sec: 5642.1). Total num frames: 340012032. Throughput: 0: 5938.6. Samples: 340016286. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:39:39,620][25689] Avg episode reward: [(0, '-47.852')] [2022-07-09 16:39:40,876][26022] Updated weights on worker 0-0, policy_version 332050 (0.00090) [2022-07-09 16:39:42,448][26022] Updated weights on worker 0-0, policy_version 332060 (0.00091) [2022-07-09 16:39:44,506][26022] Updated weights on worker 0-0, policy_version 332070 (0.00081) [2022-07-09 16:39:44,691][25689] Fps is (10 sec: 5787.9, 60 sec: 5649.2, 300 sec: 5637.3). Total num frames: 340040704. Throughput: 0: 5945.0. Samples: 340050536. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:39:44,691][25689] Avg episode reward: [(0, '-47.574')] [2022-07-09 16:39:46,071][26022] Updated weights on worker 0-0, policy_version 332080 (0.00088) [2022-07-09 16:39:47,939][26022] Updated weights on worker 0-0, policy_version 332090 (0.00086) [2022-07-09 16:39:49,709][25689] Fps is (10 sec: 5683.9, 60 sec: 5652.2, 300 sec: 5637.2). Total num frames: 340069376. Throughput: 0: 5962.4. Samples: 340067662. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:39:49,709][25689] Avg episode reward: [(0, '-47.960')] [2022-07-09 16:39:49,874][26022] Updated weights on worker 0-0, policy_version 332100 (0.00087) [2022-07-09 16:39:51,534][26022] Updated weights on worker 0-0, policy_version 332110 (0.00093) [2022-07-09 16:39:53,375][26022] Updated weights on worker 0-0, policy_version 332120 (0.00095) [2022-07-09 16:39:54,757][25689] Fps is (10 sec: 5798.6, 60 sec: 5684.1, 300 sec: 5640.3). Total num frames: 340099072. Throughput: 0: 5956.6. Samples: 340102096. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:39:54,758][25689] Avg episode reward: [(0, '-47.935')] [2022-07-09 16:39:55,063][26022] Updated weights on worker 0-0, policy_version 332130 (0.00350) [2022-07-09 16:39:57,095][26022] Updated weights on worker 0-0, policy_version 332140 (0.00102) [2022-07-09 16:39:58,840][26022] Updated weights on worker 0-0, policy_version 332150 (0.00085) [2022-07-09 16:39:59,820][25689] Fps is (10 sec: 5772.8, 60 sec: 5679.0, 300 sec: 5651.3). Total num frames: 340127744. Throughput: 0: 5932.2. Samples: 340136116. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:39:59,821][25689] Avg episode reward: [(0, '-48.798')] [2022-07-09 16:40:00,723][26022] Updated weights on worker 0-0, policy_version 332160 (0.00071) [2022-07-09 16:40:02,728][26022] Updated weights on worker 0-0, policy_version 332170 (0.00087) [2022-07-09 16:40:04,747][26022] Updated weights on worker 0-0, policy_version 332180 (0.00088) [2022-07-09 16:40:04,899][25689] Fps is (10 sec: 5250.2, 60 sec: 5628.8, 300 sec: 5629.5). Total num frames: 340152320. Throughput: 0: 4957.4. Samples: 340150716. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:40:04,900][25689] Avg episode reward: [(0, '-47.757')] [2022-07-09 16:40:06,467][26022] Updated weights on worker 0-0, policy_version 332190 (0.00090) [2022-07-09 16:40:08,307][26022] Updated weights on worker 0-0, policy_version 332200 (0.00085) [2022-07-09 16:40:09,916][25689] Fps is (10 sec: 5274.1, 60 sec: 5636.2, 300 sec: 5633.0). Total num frames: 340180992. Throughput: 0: 5792.0. Samples: 340184702. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:40:09,918][25689] Avg episode reward: [(0, '-47.703')] [2022-07-09 16:40:10,228][26022] Updated weights on worker 0-0, policy_version 332210 (0.00096) [2022-07-09 16:40:11,686][26022] Updated weights on worker 0-0, policy_version 332220 (0.00095) [2022-07-09 16:40:13,964][26022] Updated weights on worker 0-0, policy_version 332230 (0.00095) [2022-07-09 16:40:14,935][25689] Fps is (10 sec: 5816.0, 60 sec: 5653.3, 300 sec: 5639.6). Total num frames: 340210688. Throughput: 0: 5770.4. Samples: 340218530. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-09 16:40:14,935][25689] Avg episode reward: [(0, '-48.315')] [2022-07-09 16:40:15,454][26022] Updated weights on worker 0-0, policy_version 332240 (0.00084) [2022-07-09 16:40:17,384][26022] Updated weights on worker 0-0, policy_version 332250 (0.00085) [2022-07-09 16:40:19,076][26022] Updated weights on worker 0-0, policy_version 332260 (0.00085) [2022-07-09 16:40:19,968][25689] Fps is (10 sec: 5704.6, 60 sec: 5651.8, 300 sec: 5638.5). Total num frames: 340238336. Throughput: 0: 4939.5. Samples: 340235638. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:40:19,970][25689] Avg episode reward: [(0, '-47.899')] [2022-07-09 16:40:21,060][26022] Updated weights on worker 0-0, policy_version 332270 (0.00087) [2022-07-09 16:40:22,652][26022] Updated weights on worker 0-0, policy_version 332280 (0.00088) [2022-07-09 16:40:24,754][26022] Updated weights on worker 0-0, policy_version 332290 (0.00086) [2022-07-09 16:40:25,048][25689] Fps is (10 sec: 5568.8, 60 sec: 5634.9, 300 sec: 5634.4). Total num frames: 340267008. Throughput: 0: 5914.0. Samples: 340269878. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:40:25,049][25689] Avg episode reward: [(0, '-47.423')] [2022-07-09 16:40:26,200][26022] Updated weights on worker 0-0, policy_version 332300 (0.00091) [2022-07-09 16:40:28,319][26022] Updated weights on worker 0-0, policy_version 332310 (0.00098) [2022-07-09 16:40:30,056][25689] Fps is (10 sec: 5684.5, 60 sec: 5640.6, 300 sec: 5641.5). Total num frames: 340295680. Throughput: 0: 5914.8. Samples: 340303826. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:40:30,056][25689] Avg episode reward: [(0, '-47.529')] [2022-07-09 16:40:30,062][26022] Updated weights on worker 0-0, policy_version 332320 (0.00092) [2022-07-09 16:40:31,850][26022] Updated weights on worker 0-0, policy_version 332330 (0.00093) [2022-07-09 16:40:33,500][26022] Updated weights on worker 0-0, policy_version 332340 (0.00089) [2022-07-09 16:40:35,075][25689] Fps is (10 sec: 5616.9, 60 sec: 5639.2, 300 sec: 5634.6). Total num frames: 340323328. Throughput: 0: 5081.0. Samples: 340320864. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:40:35,076][25689] Avg episode reward: [(0, '-48.253')] [2022-07-09 16:40:35,434][26022] Updated weights on worker 0-0, policy_version 332350 (0.00086) [2022-07-09 16:40:37,133][26022] Updated weights on worker 0-0, policy_version 332360 (0.00083) [2022-07-09 16:40:39,230][26022] Updated weights on worker 0-0, policy_version 332370 (0.00092) [2022-07-09 16:40:40,100][25689] Fps is (10 sec: 5811.1, 60 sec: 5655.0, 300 sec: 5643.0). Total num frames: 340354048. Throughput: 0: 5920.7. Samples: 340354834. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:40:40,100][25689] Avg episode reward: [(0, '-46.936')] [2022-07-09 16:40:40,738][26022] Updated weights on worker 0-0, policy_version 332380 (0.00085) [2022-07-09 16:40:42,807][26022] Updated weights on worker 0-0, policy_version 332390 (0.00106) [2022-07-09 16:40:44,483][26022] Updated weights on worker 0-0, policy_version 332400 (0.00088) [2022-07-09 16:40:45,164][25689] Fps is (10 sec: 5582.2, 60 sec: 5604.9, 300 sec: 5631.9). Total num frames: 340379648. Throughput: 0: 5904.6. Samples: 340388656. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:40:45,165][25689] Avg episode reward: [(0, '-47.016')] [2022-07-09 16:40:46,210][26022] Updated weights on worker 0-0, policy_version 332410 (0.00087) [2022-07-09 16:40:48,235][26022] Updated weights on worker 0-0, policy_version 332420 (0.00052) [2022-07-09 16:40:49,796][26022] Updated weights on worker 0-0, policy_version 332430 (0.00088) [2022-07-09 16:40:50,179][25689] Fps is (10 sec: 5486.3, 60 sec: 5622.1, 300 sec: 5638.6). Total num frames: 340409344. Throughput: 0: 5065.4. Samples: 340405758. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:40:50,179][25689] Avg episode reward: [(0, '-46.384')] [2022-07-09 16:40:51,831][26022] Updated weights on worker 0-0, policy_version 332440 (0.00076) [2022-07-09 16:40:53,507][26022] Updated weights on worker 0-0, policy_version 332450 (0.00088) [2022-07-09 16:40:55,222][26022] Updated weights on worker 0-0, policy_version 332460 (0.00087) [2022-07-09 16:40:55,223][25689] Fps is (10 sec: 5802.8, 60 sec: 5605.5, 300 sec: 5638.0). Total num frames: 340438016. Throughput: 0: 5908.0. Samples: 340439896. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:40:55,223][25689] Avg episode reward: [(0, '-46.382')] [2022-07-09 16:40:57,009][26022] Updated weights on worker 0-0, policy_version 332470 (0.00084) [2022-07-09 16:40:58,947][26022] Updated weights on worker 0-0, policy_version 332480 (0.00086) [2022-07-09 16:41:00,237][25689] Fps is (10 sec: 5701.1, 60 sec: 5610.0, 300 sec: 5645.5). Total num frames: 340466688. Throughput: 0: 5915.4. Samples: 340473954. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:41:00,238][25689] Avg episode reward: [(0, '-47.023')] [2022-07-09 16:41:00,856][26022] Updated weights on worker 0-0, policy_version 332490 (0.00106) [2022-07-09 16:41:02,914][26022] Updated weights on worker 0-0, policy_version 332500 (0.00091) [2022-07-09 16:41:04,734][26022] Updated weights on worker 0-0, policy_version 332510 (0.00089) [2022-07-09 16:41:05,376][25689] Fps is (10 sec: 5345.1, 60 sec: 5621.4, 300 sec: 5636.4). Total num frames: 340492288. Throughput: 0: 4950.5. Samples: 340488716. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:41:05,377][25689] Avg episode reward: [(0, '-46.337')] [2022-07-09 16:41:06,506][26022] Updated weights on worker 0-0, policy_version 332520 (0.00086) [2022-07-09 16:41:08,452][26022] Updated weights on worker 0-0, policy_version 332530 (0.00093) [2022-07-09 16:41:10,233][26022] Updated weights on worker 0-0, policy_version 332540 (0.00091) [2022-07-09 16:41:10,398][25689] Fps is (10 sec: 5341.5, 60 sec: 5621.0, 300 sec: 5636.1). Total num frames: 340520960. Throughput: 0: 5782.0. Samples: 340522664. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:41:10,398][25689] Avg episode reward: [(0, '-46.469')] [2022-07-09 16:41:12,048][26022] Updated weights on worker 0-0, policy_version 332550 (0.00096) [2022-07-09 16:41:14,011][26022] Updated weights on worker 0-0, policy_version 332560 (0.00461) [2022-07-09 16:41:14,259][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:41:14,269][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000332562_340543488.pth [2022-07-09 16:41:14,269][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000330579_338512896.pth [2022-07-09 16:41:15,408][25689] Fps is (10 sec: 5716.2, 60 sec: 5604.8, 300 sec: 5636.2). Total num frames: 340549632. Throughput: 0: 5756.8. Samples: 340556100. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:41:15,409][25689] Avg episode reward: [(0, '-46.026')] [2022-07-09 16:41:15,708][26022] Updated weights on worker 0-0, policy_version 332570 (0.00085) [2022-07-09 16:41:17,716][26022] Updated weights on worker 0-0, policy_version 332580 (0.00252) [2022-07-09 16:41:19,499][26022] Updated weights on worker 0-0, policy_version 332590 (0.00084) [2022-07-09 16:41:20,441][25689] Fps is (10 sec: 5607.4, 60 sec: 5604.8, 300 sec: 5631.3). Total num frames: 340577280. Throughput: 0: 4910.2. Samples: 340573160. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:41:20,442][25689] Avg episode reward: [(0, '-46.506')] [2022-07-09 16:41:21,312][26022] Updated weights on worker 0-0, policy_version 332600 (0.00089) [2022-07-09 16:41:22,991][26022] Updated weights on worker 0-0, policy_version 332610 (0.00095) [2022-07-09 16:41:24,769][26022] Updated weights on worker 0-0, policy_version 332620 (0.00088) [2022-07-09 16:41:25,501][25689] Fps is (10 sec: 5681.8, 60 sec: 5623.7, 300 sec: 5637.7). Total num frames: 340606976. Throughput: 0: 5884.2. Samples: 340607132. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:41:25,501][25689] Avg episode reward: [(0, '-46.290')] [2022-07-09 16:41:26,662][26022] Updated weights on worker 0-0, policy_version 332630 (0.00090) [2022-07-09 16:41:28,687][26022] Updated weights on worker 0-0, policy_version 332640 (0.00089) [2022-07-09 16:41:30,305][26022] Updated weights on worker 0-0, policy_version 332650 (0.00087) [2022-07-09 16:41:30,512][25689] Fps is (10 sec: 5694.0, 60 sec: 5606.4, 300 sec: 5634.4). Total num frames: 340634624. Throughput: 0: 5873.7. Samples: 340640812. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:41:30,513][25689] Avg episode reward: [(0, '-46.209')] [2022-07-09 16:41:32,046][26022] Updated weights on worker 0-0, policy_version 332660 (0.00083) [2022-07-09 16:41:33,862][26022] Updated weights on worker 0-0, policy_version 332670 (0.00083) [2022-07-09 16:41:35,522][25689] Fps is (10 sec: 5517.6, 60 sec: 5607.3, 300 sec: 5631.6). Total num frames: 340662272. Throughput: 0: 5060.8. Samples: 340657894. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:41:35,523][25689] Avg episode reward: [(0, '-46.211')] [2022-07-09 16:41:35,737][26022] Updated weights on worker 0-0, policy_version 332680 (0.00089) [2022-07-09 16:41:37,414][26022] Updated weights on worker 0-0, policy_version 332690 (0.00092) [2022-07-09 16:41:39,219][26022] Updated weights on worker 0-0, policy_version 332700 (0.00083) [2022-07-09 16:41:40,536][25689] Fps is (10 sec: 5721.0, 60 sec: 5591.4, 300 sec: 5637.0). Total num frames: 340691968. Throughput: 0: 5926.3. Samples: 340692246. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:41:40,536][25689] Avg episode reward: [(0, '-47.671')] [2022-07-09 16:41:40,994][26022] Updated weights on worker 0-0, policy_version 332710 (0.00088) [2022-07-09 16:41:42,775][26022] Updated weights on worker 0-0, policy_version 332720 (0.00098) [2022-07-09 16:41:44,813][26022] Updated weights on worker 0-0, policy_version 332730 (0.00087) [2022-07-09 16:41:45,658][25689] Fps is (10 sec: 5859.7, 60 sec: 5653.7, 300 sec: 5638.6). Total num frames: 340721664. Throughput: 0: 5916.0. Samples: 340726382. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:41:45,658][25689] Avg episode reward: [(0, '-47.610')] [2022-07-09 16:41:46,438][26022] Updated weights on worker 0-0, policy_version 332740 (0.00098) [2022-07-09 16:41:48,292][26022] Updated weights on worker 0-0, policy_version 332750 (0.00098) [2022-07-09 16:41:50,098][26022] Updated weights on worker 0-0, policy_version 332760 (0.00086) [2022-07-09 16:41:50,683][25689] Fps is (10 sec: 5651.0, 60 sec: 5618.9, 300 sec: 5631.3). Total num frames: 340749312. Throughput: 0: 5093.5. Samples: 340743552. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:41:50,684][25689] Avg episode reward: [(0, '-47.363')] [2022-07-09 16:41:51,794][26022] Updated weights on worker 0-0, policy_version 332770 (0.00084) [2022-07-09 16:41:53,763][26022] Updated weights on worker 0-0, policy_version 332780 (0.00089) [2022-07-09 16:41:55,499][26022] Updated weights on worker 0-0, policy_version 332790 (0.00093) [2022-07-09 16:41:55,694][25689] Fps is (10 sec: 5713.9, 60 sec: 5638.9, 300 sec: 5634.6). Total num frames: 340779008. Throughput: 0: 5950.7. Samples: 340777930. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:41:55,695][25689] Avg episode reward: [(0, '-46.738')] [2022-07-09 16:41:57,419][26022] Updated weights on worker 0-0, policy_version 332800 (0.00087) [2022-07-09 16:41:58,999][26022] Updated weights on worker 0-0, policy_version 332810 (0.00096) [2022-07-09 16:42:00,723][25689] Fps is (10 sec: 5610.0, 60 sec: 5603.7, 300 sec: 5638.8). Total num frames: 340805632. Throughput: 0: 5937.8. Samples: 340812112. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:42:00,723][25689] Avg episode reward: [(0, '-47.981')] [2022-07-09 16:42:01,093][26022] Updated weights on worker 0-0, policy_version 332820 (0.00100) [2022-07-09 16:42:02,977][26022] Updated weights on worker 0-0, policy_version 332830 (0.00050) [2022-07-09 16:42:05,068][26022] Updated weights on worker 0-0, policy_version 332840 (0.00090) [2022-07-09 16:42:05,776][25689] Fps is (10 sec: 5383.2, 60 sec: 5645.6, 300 sec: 5639.1). Total num frames: 340833280. Throughput: 0: 4999.2. Samples: 340826954. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:42:05,776][25689] Avg episode reward: [(0, '-47.544')] [2022-07-09 16:42:06,506][26022] Updated weights on worker 0-0, policy_version 332850 (0.00093) [2022-07-09 16:42:08,599][26022] Updated weights on worker 0-0, policy_version 332860 (0.00085) [2022-07-09 16:42:10,065][26022] Updated weights on worker 0-0, policy_version 332870 (0.00091) [2022-07-09 16:42:10,803][25689] Fps is (10 sec: 5485.6, 60 sec: 5628.1, 300 sec: 5631.9). Total num frames: 340860928. Throughput: 0: 5843.3. Samples: 340861116. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:42:10,804][25689] Avg episode reward: [(0, '-47.273')] [2022-07-09 16:42:12,054][26022] Updated weights on worker 0-0, policy_version 332880 (0.00088) [2022-07-09 16:42:13,799][26022] Updated weights on worker 0-0, policy_version 332890 (0.00085) [2022-07-09 16:42:15,690][26022] Updated weights on worker 0-0, policy_version 332900 (0.00086) [2022-07-09 16:42:15,873][25689] Fps is (10 sec: 5577.8, 60 sec: 5622.6, 300 sec: 5634.6). Total num frames: 340889600. Throughput: 0: 5809.8. Samples: 340895166. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:42:15,874][25689] Avg episode reward: [(0, '-47.732')] [2022-07-09 16:42:17,435][26022] Updated weights on worker 0-0, policy_version 332910 (0.00082) [2022-07-09 16:42:19,235][26022] Updated weights on worker 0-0, policy_version 332920 (0.00087) [2022-07-09 16:42:20,916][25689] Fps is (10 sec: 5771.7, 60 sec: 5655.6, 300 sec: 5635.6). Total num frames: 340919296. Throughput: 0: 4967.0. Samples: 340912410. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:42:20,916][25689] Avg episode reward: [(0, '-47.802')] [2022-07-09 16:42:20,937][26022] Updated weights on worker 0-0, policy_version 332930 (0.00077) [2022-07-09 16:42:23,028][26022] Updated weights on worker 0-0, policy_version 332940 (0.00090) [2022-07-09 16:42:24,599][26022] Updated weights on worker 0-0, policy_version 332950 (0.00090) [2022-07-09 16:42:25,959][25689] Fps is (10 sec: 5584.1, 60 sec: 5606.3, 300 sec: 5632.3). Total num frames: 340945920. Throughput: 0: 5919.8. Samples: 340946434. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:42:25,959][25689] Avg episode reward: [(0, '-46.804')] [2022-07-09 16:42:26,645][26022] Updated weights on worker 0-0, policy_version 332960 (0.00094) [2022-07-09 16:42:28,121][26022] Updated weights on worker 0-0, policy_version 332970 (0.00087) [2022-07-09 16:42:30,125][26022] Updated weights on worker 0-0, policy_version 332980 (0.00061) [2022-07-09 16:42:30,969][25689] Fps is (10 sec: 5602.1, 60 sec: 5640.3, 300 sec: 5636.0). Total num frames: 340975616. Throughput: 0: 5912.5. Samples: 340980348. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:42:30,969][25689] Avg episode reward: [(0, '-46.679')] [2022-07-09 16:42:31,763][26022] Updated weights on worker 0-0, policy_version 332990 (0.00464) [2022-07-09 16:42:33,945][26022] Updated weights on worker 0-0, policy_version 333000 (0.00084) [2022-07-09 16:42:35,518][26022] Updated weights on worker 0-0, policy_version 333010 (0.00091) [2022-07-09 16:42:35,986][25689] Fps is (10 sec: 5923.0, 60 sec: 5673.5, 300 sec: 5636.2). Total num frames: 341005312. Throughput: 0: 5086.5. Samples: 340997470. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:42:35,988][25689] Avg episode reward: [(0, '-46.080')] [2022-07-09 16:42:37,635][26022] Updated weights on worker 0-0, policy_version 333020 (0.00088) [2022-07-09 16:42:38,899][26022] Updated weights on worker 0-0, policy_version 333030 (0.00085) [2022-07-09 16:42:40,993][25689] Fps is (10 sec: 5720.4, 60 sec: 5640.2, 300 sec: 5637.5). Total num frames: 341032960. Throughput: 0: 5957.0. Samples: 341032014. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:42:40,994][25689] Avg episode reward: [(0, '-45.724')] [2022-07-09 16:42:41,006][26022] Updated weights on worker 0-0, policy_version 333040 (0.00085) [2022-07-09 16:42:42,318][26022] Updated weights on worker 0-0, policy_version 333050 (0.00084) [2022-07-09 16:42:44,600][26022] Updated weights on worker 0-0, policy_version 333060 (0.00094) [2022-07-09 16:42:46,073][25689] Fps is (10 sec: 5684.6, 60 sec: 5644.1, 300 sec: 5640.5). Total num frames: 341062656. Throughput: 0: 5957.5. Samples: 341066268. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 16:42:46,074][25689] Avg episode reward: [(0, '-45.709')] [2022-07-09 16:42:46,093][26022] Updated weights on worker 0-0, policy_version 333070 (0.00088) [2022-07-09 16:42:48,036][26022] Updated weights on worker 0-0, policy_version 333080 (0.00088) [2022-07-09 16:42:49,987][26022] Updated weights on worker 0-0, policy_version 333090 (0.00089) [2022-07-09 16:42:51,091][25689] Fps is (10 sec: 5780.2, 60 sec: 5661.8, 300 sec: 5640.5). Total num frames: 341091328. Throughput: 0: 5111.9. Samples: 341083212. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:42:51,092][25689] Avg episode reward: [(0, '-45.369')] [2022-07-09 16:42:51,663][26022] Updated weights on worker 0-0, policy_version 333100 (0.00094) [2022-07-09 16:42:53,393][26022] Updated weights on worker 0-0, policy_version 333110 (0.00089) [2022-07-09 16:42:55,177][26022] Updated weights on worker 0-0, policy_version 333120 (0.00089) [2022-07-09 16:42:56,157][25689] Fps is (10 sec: 5585.4, 60 sec: 5622.8, 300 sec: 5636.5). Total num frames: 341118976. Throughput: 0: 5951.9. Samples: 341117526. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:42:56,157][25689] Avg episode reward: [(0, '-46.556')] [2022-07-09 16:42:57,015][26022] Updated weights on worker 0-0, policy_version 333130 (0.00085) [2022-07-09 16:42:59,108][26022] Updated weights on worker 0-0, policy_version 333140 (0.00088) [2022-07-09 16:43:00,581][26022] Updated weights on worker 0-0, policy_version 333150 (0.00089) [2022-07-09 16:43:01,211][25689] Fps is (10 sec: 5666.7, 60 sec: 5671.2, 300 sec: 5644.1). Total num frames: 341148672. Throughput: 0: 5910.3. Samples: 341151504. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:43:01,211][25689] Avg episode reward: [(0, '-46.873')] [2022-07-09 16:43:03,096][26022] Updated weights on worker 0-0, policy_version 333160 (0.00087) [2022-07-09 16:43:04,742][26022] Updated weights on worker 0-0, policy_version 333170 (0.00095) [2022-07-09 16:43:06,252][25689] Fps is (10 sec: 5376.1, 60 sec: 5621.5, 300 sec: 5630.9). Total num frames: 341173248. Throughput: 0: 5792.9. Samples: 341183160. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:43:06,252][25689] Avg episode reward: [(0, '-47.135')] [2022-07-09 16:43:06,759][26022] Updated weights on worker 0-0, policy_version 333180 (0.00054) [2022-07-09 16:43:08,158][26022] Updated weights on worker 0-0, policy_version 333190 (0.00088) [2022-07-09 16:43:10,210][26022] Updated weights on worker 0-0, policy_version 333200 (0.00767) [2022-07-09 16:43:11,254][25689] Fps is (10 sec: 5505.7, 60 sec: 5674.7, 300 sec: 5646.5). Total num frames: 341203968. Throughput: 0: 5813.2. Samples: 341200422. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:43:11,255][25689] Avg episode reward: [(0, '-47.092')] [2022-07-09 16:43:11,735][26022] Updated weights on worker 0-0, policy_version 333210 (0.00079) [2022-07-09 16:43:13,862][26022] Updated weights on worker 0-0, policy_version 333220 (0.00091) [2022-07-09 16:43:14,349][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:43:14,362][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000333224_341221376.pth [2022-07-09 16:43:14,363][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000331240_339189760.pth [2022-07-09 16:43:15,540][26022] Updated weights on worker 0-0, policy_version 333230 (0.00087) [2022-07-09 16:43:16,257][25689] Fps is (10 sec: 5834.0, 60 sec: 5664.0, 300 sec: 5636.8). Total num frames: 341231616. Throughput: 0: 5833.9. Samples: 341234786. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:43:16,257][25689] Avg episode reward: [(0, '-47.579')] [2022-07-09 16:43:17,348][26022] Updated weights on worker 0-0, policy_version 333240 (0.00096) [2022-07-09 16:43:18,938][26022] Updated weights on worker 0-0, policy_version 333250 (0.00087) [2022-07-09 16:43:20,947][26022] Updated weights on worker 0-0, policy_version 333260 (0.00087) [2022-07-09 16:43:21,281][25689] Fps is (10 sec: 5514.5, 60 sec: 5631.8, 300 sec: 5637.3). Total num frames: 341259264. Throughput: 0: 5864.4. Samples: 341269206. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:43:21,282][25689] Avg episode reward: [(0, '-48.017')] [2022-07-09 16:43:22,551][26022] Updated weights on worker 0-0, policy_version 333270 (0.00089) [2022-07-09 16:43:24,480][26022] Updated weights on worker 0-0, policy_version 333280 (0.00080) [2022-07-09 16:43:26,166][26022] Updated weights on worker 0-0, policy_version 333290 (0.00081) [2022-07-09 16:43:26,350][25689] Fps is (10 sec: 5782.8, 60 sec: 5697.2, 300 sec: 5639.5). Total num frames: 341289984. Throughput: 0: 5134.7. Samples: 341286358. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:43:26,350][25689] Avg episode reward: [(0, '-47.453')] [2022-07-09 16:43:27,999][26022] Updated weights on worker 0-0, policy_version 333300 (0.00096) [2022-07-09 16:43:29,899][26022] Updated weights on worker 0-0, policy_version 333310 (0.00095) [2022-07-09 16:43:31,427][25689] Fps is (10 sec: 5753.2, 60 sec: 5657.1, 300 sec: 5638.5). Total num frames: 341317632. Throughput: 0: 5934.7. Samples: 341320140. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:43:31,427][25689] Avg episode reward: [(0, '-47.901')] [2022-07-09 16:43:31,805][26022] Updated weights on worker 0-0, policy_version 333320 (0.00088) [2022-07-09 16:43:33,457][26022] Updated weights on worker 0-0, policy_version 333330 (0.00082) [2022-07-09 16:43:35,259][26022] Updated weights on worker 0-0, policy_version 333340 (0.00090) [2022-07-09 16:43:36,460][25689] Fps is (10 sec: 5570.8, 60 sec: 5638.7, 300 sec: 5639.5). Total num frames: 341346304. Throughput: 0: 5930.0. Samples: 341354590. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:43:36,461][25689] Avg episode reward: [(0, '-48.324')] [2022-07-09 16:43:37,058][26022] Updated weights on worker 0-0, policy_version 333350 (0.00091) [2022-07-09 16:43:38,952][26022] Updated weights on worker 0-0, policy_version 333360 (0.00087) [2022-07-09 16:43:40,729][26022] Updated weights on worker 0-0, policy_version 333370 (0.00066) [2022-07-09 16:43:41,510][25689] Fps is (10 sec: 5686.9, 60 sec: 5651.6, 300 sec: 5637.2). Total num frames: 341374976. Throughput: 0: 5060.6. Samples: 341371576. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:43:41,512][25689] Avg episode reward: [(0, '-48.607')] [2022-07-09 16:43:42,283][26022] Updated weights on worker 0-0, policy_version 333380 (0.00087) [2022-07-09 16:43:44,346][26022] Updated weights on worker 0-0, policy_version 333390 (0.00092) [2022-07-09 16:43:46,292][26022] Updated weights on worker 0-0, policy_version 333400 (0.00081) [2022-07-09 16:43:46,596][25689] Fps is (10 sec: 5657.4, 60 sec: 5634.1, 300 sec: 5636.5). Total num frames: 341403648. Throughput: 0: 5902.4. Samples: 341405858. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:43:46,598][25689] Avg episode reward: [(0, '-48.443')] [2022-07-09 16:43:47,783][26022] Updated weights on worker 0-0, policy_version 333410 (0.00082) [2022-07-09 16:43:49,840][26022] Updated weights on worker 0-0, policy_version 333420 (0.00088) [2022-07-09 16:43:51,428][26022] Updated weights on worker 0-0, policy_version 333430 (0.00089) [2022-07-09 16:43:51,675][25689] Fps is (10 sec: 5742.3, 60 sec: 5645.4, 300 sec: 5642.4). Total num frames: 341433344. Throughput: 0: 5895.6. Samples: 341439514. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:43:51,675][25689] Avg episode reward: [(0, '-48.235')] [2022-07-09 16:43:53,372][26022] Updated weights on worker 0-0, policy_version 333440 (0.00089) [2022-07-09 16:43:55,000][26022] Updated weights on worker 0-0, policy_version 333450 (0.00094) [2022-07-09 16:43:56,689][25689] Fps is (10 sec: 5681.3, 60 sec: 5650.1, 300 sec: 5638.9). Total num frames: 341460992. Throughput: 0: 5039.5. Samples: 341456538. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:43:56,691][25689] Avg episode reward: [(0, '-47.226')] [2022-07-09 16:43:56,976][26022] Updated weights on worker 0-0, policy_version 333460 (0.00090) [2022-07-09 16:43:58,807][26022] Updated weights on worker 0-0, policy_version 333470 (0.00102) [2022-07-09 16:44:00,687][26022] Updated weights on worker 0-0, policy_version 333480 (0.00395) [2022-07-09 16:44:01,711][25689] Fps is (10 sec: 5509.8, 60 sec: 5619.3, 300 sec: 5640.1). Total num frames: 341488640. Throughput: 0: 5877.9. Samples: 341490314. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:44:01,712][25689] Avg episode reward: [(0, '-46.618')] [2022-07-09 16:44:02,815][26022] Updated weights on worker 0-0, policy_version 333490 (0.00095) [2022-07-09 16:44:04,723][26022] Updated weights on worker 0-0, policy_version 333500 (0.00091) [2022-07-09 16:44:06,384][26022] Updated weights on worker 0-0, policy_version 333510 (0.00085) [2022-07-09 16:44:06,850][25689] Fps is (10 sec: 5442.0, 60 sec: 5660.9, 300 sec: 5635.8). Total num frames: 341516288. Throughput: 0: 5732.3. Samples: 341521962. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:44:06,851][25689] Avg episode reward: [(0, '-46.130')] [2022-07-09 16:44:08,295][26022] Updated weights on worker 0-0, policy_version 333520 (0.00087) [2022-07-09 16:44:10,112][26022] Updated weights on worker 0-0, policy_version 333530 (0.00085) [2022-07-09 16:44:11,719][26022] Updated weights on worker 0-0, policy_version 333540 (0.00090) [2022-07-09 16:44:11,935][25689] Fps is (10 sec: 5508.7, 60 sec: 5619.5, 300 sec: 5634.6). Total num frames: 341544960. Throughput: 0: 4916.9. Samples: 341539130. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:44:11,935][25689] Avg episode reward: [(0, '-46.301')] [2022-07-09 16:44:13,784][26022] Updated weights on worker 0-0, policy_version 333550 (0.00090) [2022-07-09 16:44:15,307][26022] Updated weights on worker 0-0, policy_version 333560 (0.00088) [2022-07-09 16:44:16,951][25689] Fps is (10 sec: 5474.5, 60 sec: 5601.3, 300 sec: 5631.2). Total num frames: 341571584. Throughput: 0: 5764.4. Samples: 341573334. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:44:16,951][25689] Avg episode reward: [(0, '-46.772')] [2022-07-09 16:44:17,357][26022] Updated weights on worker 0-0, policy_version 333570 (0.00088) [2022-07-09 16:44:19,156][26022] Updated weights on worker 0-0, policy_version 333580 (0.00080) [2022-07-09 16:44:20,810][26022] Updated weights on worker 0-0, policy_version 333590 (0.00104) [2022-07-09 16:44:21,972][25689] Fps is (10 sec: 5712.8, 60 sec: 5652.2, 300 sec: 5635.7). Total num frames: 341602304. Throughput: 0: 5766.0. Samples: 341607142. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:44:21,973][25689] Avg episode reward: [(0, '-47.125')] [2022-07-09 16:44:22,969][26022] Updated weights on worker 0-0, policy_version 333600 (0.00091) [2022-07-09 16:44:24,491][26022] Updated weights on worker 0-0, policy_version 333610 (0.00088) [2022-07-09 16:44:26,384][26022] Updated weights on worker 0-0, policy_version 333620 (0.00080) [2022-07-09 16:44:27,029][25689] Fps is (10 sec: 5893.1, 60 sec: 5619.6, 300 sec: 5636.0). Total num frames: 341630976. Throughput: 0: 5069.1. Samples: 341624250. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:44:27,030][25689] Avg episode reward: [(0, '-47.944')] [2022-07-09 16:44:28,259][26022] Updated weights on worker 0-0, policy_version 333630 (0.00097) [2022-07-09 16:44:29,852][26022] Updated weights on worker 0-0, policy_version 333640 (0.00097) [2022-07-09 16:44:31,984][26022] Updated weights on worker 0-0, policy_version 333650 (0.00091) [2022-07-09 16:44:32,088][25689] Fps is (10 sec: 5466.3, 60 sec: 5604.4, 300 sec: 5631.5). Total num frames: 341657600. Throughput: 0: 5907.7. Samples: 341658192. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:44:32,089][25689] Avg episode reward: [(0, '-48.555')] [2022-07-09 16:44:33,562][26022] Updated weights on worker 0-0, policy_version 333660 (0.00088) [2022-07-09 16:44:35,473][26022] Updated weights on worker 0-0, policy_version 333670 (0.00085) [2022-07-09 16:44:37,105][25689] Fps is (10 sec: 5487.8, 60 sec: 5605.8, 300 sec: 5628.0). Total num frames: 341686272. Throughput: 0: 5886.8. Samples: 341691980. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:44:37,106][25689] Avg episode reward: [(0, '-48.457')] [2022-07-09 16:44:37,520][26022] Updated weights on worker 0-0, policy_version 333680 (0.00096) [2022-07-09 16:44:39,016][26022] Updated weights on worker 0-0, policy_version 333690 (0.00083) [2022-07-09 16:44:41,085][26022] Updated weights on worker 0-0, policy_version 333700 (0.00086) [2022-07-09 16:44:42,119][25689] Fps is (10 sec: 5818.8, 60 sec: 5626.1, 300 sec: 5632.5). Total num frames: 341715968. Throughput: 0: 5050.8. Samples: 341708898. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:44:42,120][25689] Avg episode reward: [(0, '-48.468')] [2022-07-09 16:44:42,699][26022] Updated weights on worker 0-0, policy_version 333710 (0.00088) [2022-07-09 16:44:44,554][26022] Updated weights on worker 0-0, policy_version 333720 (0.00093) [2022-07-09 16:44:46,470][26022] Updated weights on worker 0-0, policy_version 333730 (0.00098) [2022-07-09 16:44:47,241][25689] Fps is (10 sec: 5556.4, 60 sec: 5588.9, 300 sec: 5623.6). Total num frames: 341742592. Throughput: 0: 5861.9. Samples: 341742732. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:44:47,242][25689] Avg episode reward: [(0, '-48.712')] [2022-07-09 16:44:48,075][26022] Updated weights on worker 0-0, policy_version 333740 (0.00081) [2022-07-09 16:44:50,123][26022] Updated weights on worker 0-0, policy_version 333750 (0.00089) [2022-07-09 16:44:51,794][26022] Updated weights on worker 0-0, policy_version 333760 (0.00090) [2022-07-09 16:44:52,243][25689] Fps is (10 sec: 5563.0, 60 sec: 5596.1, 300 sec: 5624.5). Total num frames: 341772288. Throughput: 0: 5891.4. Samples: 341776932. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:44:52,244][25689] Avg episode reward: [(0, '-48.554')] [2022-07-09 16:44:53,623][26022] Updated weights on worker 0-0, policy_version 333770 (0.00095) [2022-07-09 16:44:55,457][26022] Updated weights on worker 0-0, policy_version 333780 (0.00084) [2022-07-09 16:44:57,007][26022] Updated weights on worker 0-0, policy_version 333790 (0.00080) [2022-07-09 16:44:57,290][25689] Fps is (10 sec: 5910.5, 60 sec: 5626.9, 300 sec: 5628.3). Total num frames: 341801984. Throughput: 0: 5057.3. Samples: 341794060. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:44:57,291][25689] Avg episode reward: [(0, '-48.013')] [2022-07-09 16:44:59,022][26022] Updated weights on worker 0-0, policy_version 333800 (0.00092) [2022-07-09 16:45:00,783][26022] Updated weights on worker 0-0, policy_version 333810 (0.00105) [2022-07-09 16:45:02,292][25689] Fps is (10 sec: 5502.8, 60 sec: 5594.9, 300 sec: 5633.2). Total num frames: 341827584. Throughput: 0: 5921.1. Samples: 341828344. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:45:02,292][25689] Avg episode reward: [(0, '-48.210')] [2022-07-09 16:45:02,991][26022] Updated weights on worker 0-0, policy_version 333820 (0.00084) [2022-07-09 16:45:04,834][26022] Updated weights on worker 0-0, policy_version 333830 (0.00086) [2022-07-09 16:45:06,608][26022] Updated weights on worker 0-0, policy_version 333840 (0.00093) [2022-07-09 16:45:07,356][25689] Fps is (10 sec: 5493.3, 60 sec: 5635.7, 300 sec: 5635.7). Total num frames: 341857280. Throughput: 0: 5848.4. Samples: 341860370. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:45:07,358][25689] Avg episode reward: [(0, '-47.634')] [2022-07-09 16:45:08,304][26022] Updated weights on worker 0-0, policy_version 333850 (0.00091) [2022-07-09 16:45:10,215][26022] Updated weights on worker 0-0, policy_version 333860 (0.00089) [2022-07-09 16:45:11,971][26022] Updated weights on worker 0-0, policy_version 333870 (0.00085) [2022-07-09 16:45:12,368][25689] Fps is (10 sec: 5691.2, 60 sec: 5625.5, 300 sec: 5629.0). Total num frames: 341884928. Throughput: 0: 5841.5. Samples: 341894490. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:45:12,368][25689] Avg episode reward: [(0, '-47.128')] [2022-07-09 16:45:13,879][26022] Updated weights on worker 0-0, policy_version 333880 (0.00086) [2022-07-09 16:45:14,403][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:45:14,415][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000333884_341897216.pth [2022-07-09 16:45:14,416][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000331902_339867648.pth [2022-07-09 16:45:15,794][26022] Updated weights on worker 0-0, policy_version 333890 (0.00097) [2022-07-09 16:45:17,353][26022] Updated weights on worker 0-0, policy_version 333900 (0.00093) [2022-07-09 16:45:17,379][25689] Fps is (10 sec: 5619.1, 60 sec: 5659.9, 300 sec: 5632.8). Total num frames: 341913600. Throughput: 0: 5838.0. Samples: 341911338. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 16:45:17,381][25689] Avg episode reward: [(0, '-46.966')] [2022-07-09 16:45:19,331][26022] Updated weights on worker 0-0, policy_version 333910 (0.00096) [2022-07-09 16:45:21,027][26022] Updated weights on worker 0-0, policy_version 333920 (0.00085) [2022-07-09 16:45:22,413][25689] Fps is (10 sec: 5504.7, 60 sec: 5590.9, 300 sec: 5626.8). Total num frames: 341940224. Throughput: 0: 5803.6. Samples: 341945120. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:45:22,415][25689] Avg episode reward: [(0, '-47.248')] [2022-07-09 16:45:22,834][26022] Updated weights on worker 0-0, policy_version 333930 (0.00097) [2022-07-09 16:45:24,900][26022] Updated weights on worker 0-0, policy_version 333940 (0.00086) [2022-07-09 16:45:26,334][26022] Updated weights on worker 0-0, policy_version 333950 (0.00093) [2022-07-09 16:45:27,546][25689] Fps is (10 sec: 5539.3, 60 sec: 5600.8, 300 sec: 5627.9). Total num frames: 341969920. Throughput: 0: 5882.5. Samples: 341979138. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:45:27,547][25689] Avg episode reward: [(0, '-47.011')] [2022-07-09 16:45:28,439][26022] Updated weights on worker 0-0, policy_version 333960 (0.00091) [2022-07-09 16:45:30,159][26022] Updated weights on worker 0-0, policy_version 333970 (0.00108) [2022-07-09 16:45:32,005][26022] Updated weights on worker 0-0, policy_version 333980 (0.00083) [2022-07-09 16:45:32,577][25689] Fps is (10 sec: 5742.5, 60 sec: 5637.3, 300 sec: 5631.1). Total num frames: 341998592. Throughput: 0: 5016.4. Samples: 341995868. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:45:32,578][25689] Avg episode reward: [(0, '-47.422')] [2022-07-09 16:45:33,711][26022] Updated weights on worker 0-0, policy_version 333990 (0.00102) [2022-07-09 16:45:35,556][26022] Updated weights on worker 0-0, policy_version 334000 (0.00087) [2022-07-09 16:45:37,233][26022] Updated weights on worker 0-0, policy_version 334010 (0.00084) [2022-07-09 16:45:37,583][25689] Fps is (10 sec: 5713.2, 60 sec: 5638.3, 300 sec: 5624.6). Total num frames: 342027264. Throughput: 0: 5877.6. Samples: 342030092. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:45:37,584][25689] Avg episode reward: [(0, '-47.633')] [2022-07-09 16:45:39,207][26022] Updated weights on worker 0-0, policy_version 334020 (0.00082) [2022-07-09 16:45:40,738][26022] Updated weights on worker 0-0, policy_version 334030 (0.00086) [2022-07-09 16:45:42,608][25689] Fps is (10 sec: 5614.8, 60 sec: 5603.4, 300 sec: 5632.2). Total num frames: 342054912. Throughput: 0: 5914.6. Samples: 342064564. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:45:42,608][25689] Avg episode reward: [(0, '-47.601')] [2022-07-09 16:45:42,881][26022] Updated weights on worker 0-0, policy_version 334040 (0.00092) [2022-07-09 16:45:44,358][26022] Updated weights on worker 0-0, policy_version 334050 (0.00084) [2022-07-09 16:45:46,436][26022] Updated weights on worker 0-0, policy_version 334060 (0.00092) [2022-07-09 16:45:47,733][25689] Fps is (10 sec: 5750.6, 60 sec: 5670.9, 300 sec: 5633.6). Total num frames: 342085632. Throughput: 0: 5071.4. Samples: 342081514. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:45:47,734][25689] Avg episode reward: [(0, '-47.201')] [2022-07-09 16:45:48,087][26022] Updated weights on worker 0-0, policy_version 334070 (0.00092) [2022-07-09 16:45:49,872][26022] Updated weights on worker 0-0, policy_version 334080 (0.00089) [2022-07-09 16:45:51,856][26022] Updated weights on worker 0-0, policy_version 334090 (0.00094) [2022-07-09 16:45:52,763][25689] Fps is (10 sec: 5747.5, 60 sec: 5634.4, 300 sec: 5630.4). Total num frames: 342113280. Throughput: 0: 5927.8. Samples: 342115526. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:45:52,764][25689] Avg episode reward: [(0, '-47.711')] [2022-07-09 16:45:53,443][26022] Updated weights on worker 0-0, policy_version 334100 (0.00093) [2022-07-09 16:45:55,467][26022] Updated weights on worker 0-0, policy_version 334110 (0.00084) [2022-07-09 16:45:56,991][26022] Updated weights on worker 0-0, policy_version 334120 (0.00090) [2022-07-09 16:45:57,816][25689] Fps is (10 sec: 5585.9, 60 sec: 5616.9, 300 sec: 5629.7). Total num frames: 342141952. Throughput: 0: 5910.4. Samples: 342149674. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:45:57,816][25689] Avg episode reward: [(0, '-48.294')] [2022-07-09 16:45:58,899][26022] Updated weights on worker 0-0, policy_version 334130 (0.00094) [2022-07-09 16:46:00,726][26022] Updated weights on worker 0-0, policy_version 334140 (0.00092) [2022-07-09 16:46:02,818][25689] Fps is (10 sec: 5499.6, 60 sec: 5633.8, 300 sec: 5635.7). Total num frames: 342168576. Throughput: 0: 5054.2. Samples: 342166710. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:46:02,818][25689] Avg episode reward: [(0, '-48.404')] [2022-07-09 16:46:03,097][26022] Updated weights on worker 0-0, policy_version 334150 (0.00092) [2022-07-09 16:46:04,747][26022] Updated weights on worker 0-0, policy_version 334160 (0.00103) [2022-07-09 16:46:06,654][26022] Updated weights on worker 0-0, policy_version 334170 (0.00512) [2022-07-09 16:46:07,914][25689] Fps is (10 sec: 5475.8, 60 sec: 5613.9, 300 sec: 5634.3). Total num frames: 342197248. Throughput: 0: 5803.8. Samples: 342198640. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:46:07,915][25689] Avg episode reward: [(0, '-48.654')] [2022-07-09 16:46:08,354][26022] Updated weights on worker 0-0, policy_version 334180 (0.00090) [2022-07-09 16:46:10,226][26022] Updated weights on worker 0-0, policy_version 334190 (0.00089) [2022-07-09 16:46:12,193][26022] Updated weights on worker 0-0, policy_version 334200 (0.00086) [2022-07-09 16:46:12,936][25689] Fps is (10 sec: 5667.2, 60 sec: 5629.8, 300 sec: 5634.0). Total num frames: 342225920. Throughput: 0: 5818.6. Samples: 342232906. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:46:12,937][25689] Avg episode reward: [(0, '-48.510')] [2022-07-09 16:46:13,659][26022] Updated weights on worker 0-0, policy_version 334210 (0.00090) [2022-07-09 16:46:15,653][26022] Updated weights on worker 0-0, policy_version 334220 (0.00102) [2022-07-09 16:46:17,259][26022] Updated weights on worker 0-0, policy_version 334230 (0.00088) [2022-07-09 16:46:17,951][25689] Fps is (10 sec: 5713.5, 60 sec: 5629.5, 300 sec: 5637.8). Total num frames: 342254592. Throughput: 0: 4992.0. Samples: 342250188. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:46:17,951][25689] Avg episode reward: [(0, '-47.923')] [2022-07-09 16:46:19,218][26022] Updated weights on worker 0-0, policy_version 334240 (0.00087) [2022-07-09 16:46:21,067][26022] Updated weights on worker 0-0, policy_version 334250 (0.00089) [2022-07-09 16:46:22,763][26022] Updated weights on worker 0-0, policy_version 334260 (0.00087) [2022-07-09 16:46:22,976][25689] Fps is (10 sec: 5609.8, 60 sec: 5647.3, 300 sec: 5631.6). Total num frames: 342282240. Throughput: 0: 5838.7. Samples: 342284408. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:46:22,976][25689] Avg episode reward: [(0, '-47.684')] [2022-07-09 16:46:24,511][26022] Updated weights on worker 0-0, policy_version 334270 (0.00117) [2022-07-09 16:46:26,496][26022] Updated weights on worker 0-0, policy_version 334280 (0.00087) [2022-07-09 16:46:28,037][25689] Fps is (10 sec: 5685.3, 60 sec: 5654.0, 300 sec: 5637.6). Total num frames: 342311936. Throughput: 0: 5960.5. Samples: 342318584. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:46:28,039][25689] Avg episode reward: [(0, '-47.634')] [2022-07-09 16:46:28,151][26022] Updated weights on worker 0-0, policy_version 334290 (0.00097) [2022-07-09 16:46:29,957][26022] Updated weights on worker 0-0, policy_version 334300 (0.00087) [2022-07-09 16:46:31,575][26022] Updated weights on worker 0-0, policy_version 334310 (0.00094) [2022-07-09 16:46:33,052][25689] Fps is (10 sec: 5792.7, 60 sec: 5655.5, 300 sec: 5640.9). Total num frames: 342340608. Throughput: 0: 5115.0. Samples: 342335798. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:46:33,053][25689] Avg episode reward: [(0, '-47.918')] [2022-07-09 16:46:33,698][26022] Updated weights on worker 0-0, policy_version 334320 (0.00093) [2022-07-09 16:46:35,205][26022] Updated weights on worker 0-0, policy_version 334330 (0.00615) [2022-07-09 16:46:37,126][26022] Updated weights on worker 0-0, policy_version 334340 (0.00092) [2022-07-09 16:46:38,073][25689] Fps is (10 sec: 5611.9, 60 sec: 5637.2, 300 sec: 5633.9). Total num frames: 342368256. Throughput: 0: 5953.6. Samples: 342369988. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:46:38,075][25689] Avg episode reward: [(0, '-48.019')] [2022-07-09 16:46:38,735][26022] Updated weights on worker 0-0, policy_version 334350 (0.00086) [2022-07-09 16:46:40,769][26022] Updated weights on worker 0-0, policy_version 334360 (0.00085) [2022-07-09 16:46:42,464][26022] Updated weights on worker 0-0, policy_version 334370 (0.00085) [2022-07-09 16:46:43,118][25689] Fps is (10 sec: 5696.8, 60 sec: 5669.1, 300 sec: 5635.3). Total num frames: 342397952. Throughput: 0: 5947.9. Samples: 342404210. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:46:43,118][25689] Avg episode reward: [(0, '-49.238')] [2022-07-09 16:46:44,375][26022] Updated weights on worker 0-0, policy_version 334380 (0.00086) [2022-07-09 16:46:46,268][26022] Updated weights on worker 0-0, policy_version 334390 (0.01026) [2022-07-09 16:46:47,856][26022] Updated weights on worker 0-0, policy_version 334400 (0.00093) [2022-07-09 16:46:48,198][25689] Fps is (10 sec: 5764.7, 60 sec: 5639.5, 300 sec: 5637.8). Total num frames: 342426624. Throughput: 0: 5089.5. Samples: 342421192. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:46:48,199][25689] Avg episode reward: [(0, '-49.743')] [2022-07-09 16:46:49,919][26022] Updated weights on worker 0-0, policy_version 334410 (0.00086) [2022-07-09 16:46:51,481][26022] Updated weights on worker 0-0, policy_version 334420 (0.00085) [2022-07-09 16:46:53,232][25689] Fps is (10 sec: 5568.2, 60 sec: 5639.1, 300 sec: 5630.4). Total num frames: 342454272. Throughput: 0: 5921.4. Samples: 342455294. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:46:53,233][25689] Avg episode reward: [(0, '-49.163')] [2022-07-09 16:46:53,478][26022] Updated weights on worker 0-0, policy_version 334430 (0.00085) [2022-07-09 16:46:55,211][26022] Updated weights on worker 0-0, policy_version 334440 (0.00096) [2022-07-09 16:46:56,814][26022] Updated weights on worker 0-0, policy_version 334450 (0.00091) [2022-07-09 16:46:58,263][25689] Fps is (10 sec: 5595.6, 60 sec: 5641.1, 300 sec: 5637.3). Total num frames: 342482944. Throughput: 0: 5907.4. Samples: 342489260. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:46:58,264][25689] Avg episode reward: [(0, '-49.284')] [2022-07-09 16:46:59,005][26022] Updated weights on worker 0-0, policy_version 334460 (0.00085) [2022-07-09 16:47:00,350][26022] Updated weights on worker 0-0, policy_version 334470 (0.00087) [2022-07-09 16:47:02,856][26022] Updated weights on worker 0-0, policy_version 334480 (0.00091) [2022-07-09 16:47:03,359][25689] Fps is (10 sec: 5460.3, 60 sec: 5632.4, 300 sec: 5633.0). Total num frames: 342509568. Throughput: 0: 5049.1. Samples: 342506412. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:47:03,360][25689] Avg episode reward: [(0, '-48.418')] [2022-07-09 16:47:04,591][26022] Updated weights on worker 0-0, policy_version 334490 (0.00086) [2022-07-09 16:47:06,372][26022] Updated weights on worker 0-0, policy_version 334500 (0.00085) [2022-07-09 16:47:08,460][25689] Fps is (10 sec: 5422.9, 60 sec: 5632.0, 300 sec: 5635.1). Total num frames: 342538240. Throughput: 0: 5789.5. Samples: 342538498. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:47:08,460][25689] Avg episode reward: [(0, '-47.936')] [2022-07-09 16:47:08,495][26022] Updated weights on worker 0-0, policy_version 334511 (0.00088) [2022-07-09 16:47:10,022][26022] Updated weights on worker 0-0, policy_version 334521 (0.00098) [2022-07-09 16:47:12,002][26022] Updated weights on worker 0-0, policy_version 334531 (0.00091) [2022-07-09 16:47:13,481][25689] Fps is (10 sec: 5766.7, 60 sec: 5649.0, 300 sec: 5639.4). Total num frames: 342567936. Throughput: 0: 5805.8. Samples: 342572852. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:47:13,482][25689] Avg episode reward: [(0, '-47.587')] [2022-07-09 16:47:13,777][26022] Updated weights on worker 0-0, policy_version 334541 (0.00088) [2022-07-09 16:47:14,501][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:47:14,514][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000334545_342574080.pth [2022-07-09 16:47:14,515][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000332562_340543488.pth [2022-07-09 16:47:15,614][26022] Updated weights on worker 0-0, policy_version 334551 (0.00083) [2022-07-09 16:47:17,345][26022] Updated weights on worker 0-0, policy_version 334561 (0.00086) [2022-07-09 16:47:18,492][25689] Fps is (10 sec: 5818.2, 60 sec: 5649.3, 300 sec: 5636.6). Total num frames: 342596608. Throughput: 0: 4980.7. Samples: 342590014. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:47:18,492][25689] Avg episode reward: [(0, '-47.360')] [2022-07-09 16:47:19,027][26022] Updated weights on worker 0-0, policy_version 334571 (0.00093) [2022-07-09 16:47:21,066][26022] Updated weights on worker 0-0, policy_version 334581 (0.01130) [2022-07-09 16:47:22,583][26022] Updated weights on worker 0-0, policy_version 334591 (0.00093) [2022-07-09 16:47:23,505][25689] Fps is (10 sec: 5618.5, 60 sec: 5650.4, 300 sec: 5640.6). Total num frames: 342624256. Throughput: 0: 5827.8. Samples: 342623816. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:47:23,505][25689] Avg episode reward: [(0, '-47.591')] [2022-07-09 16:47:24,815][26022] Updated weights on worker 0-0, policy_version 334601 (0.00081) [2022-07-09 16:47:26,528][26022] Updated weights on worker 0-0, policy_version 334611 (0.00085) [2022-07-09 16:47:28,231][26022] Updated weights on worker 0-0, policy_version 334621 (0.00084) [2022-07-09 16:47:28,639][25689] Fps is (10 sec: 5550.3, 60 sec: 5626.8, 300 sec: 5634.8). Total num frames: 342652928. Throughput: 0: 5905.9. Samples: 342657674. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:47:28,639][25689] Avg episode reward: [(0, '-47.109')] [2022-07-09 16:47:29,928][26022] Updated weights on worker 0-0, policy_version 334631 (0.00090) [2022-07-09 16:47:31,916][26022] Updated weights on worker 0-0, policy_version 334641 (0.00095) [2022-07-09 16:47:33,671][25689] Fps is (10 sec: 5640.9, 60 sec: 5625.2, 300 sec: 5631.1). Total num frames: 342681600. Throughput: 0: 5891.8. Samples: 342691806. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:47:33,671][25689] Avg episode reward: [(0, '-47.613')] [2022-07-09 16:47:33,707][26022] Updated weights on worker 0-0, policy_version 334651 (0.00093) [2022-07-09 16:47:35,403][26022] Updated weights on worker 0-0, policy_version 334661 (0.00097) [2022-07-09 16:47:37,275][26022] Updated weights on worker 0-0, policy_version 334671 (0.00089) [2022-07-09 16:47:38,695][25689] Fps is (10 sec: 5702.6, 60 sec: 5641.8, 300 sec: 5634.2). Total num frames: 342710272. Throughput: 0: 5883.7. Samples: 342708882. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:47:38,695][25689] Avg episode reward: [(0, '-46.944')] [2022-07-09 16:47:39,076][26022] Updated weights on worker 0-0, policy_version 334681 (0.00088) [2022-07-09 16:47:41,075][26022] Updated weights on worker 0-0, policy_version 334691 (0.00092) [2022-07-09 16:47:42,577][26022] Updated weights on worker 0-0, policy_version 334701 (0.00089) [2022-07-09 16:47:43,723][25689] Fps is (10 sec: 5704.8, 60 sec: 5626.5, 300 sec: 5631.8). Total num frames: 342738944. Throughput: 0: 5890.6. Samples: 342742910. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:47:43,725][25689] Avg episode reward: [(0, '-47.121')] [2022-07-09 16:47:44,482][26022] Updated weights on worker 0-0, policy_version 334711 (0.00087) [2022-07-09 16:47:46,178][26022] Updated weights on worker 0-0, policy_version 334721 (0.00089) [2022-07-09 16:47:48,098][26022] Updated weights on worker 0-0, policy_version 334731 (0.00091) [2022-07-09 16:47:48,792][25689] Fps is (10 sec: 5577.7, 60 sec: 5610.6, 300 sec: 5627.3). Total num frames: 342766592. Throughput: 0: 5920.7. Samples: 342776996. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-09 16:47:48,793][25689] Avg episode reward: [(0, '-46.555')] [2022-07-09 16:47:49,920][26022] Updated weights on worker 0-0, policy_version 334741 (0.00096) [2022-07-09 16:47:51,755][26022] Updated weights on worker 0-0, policy_version 334751 (0.00080) [2022-07-09 16:47:53,464][26022] Updated weights on worker 0-0, policy_version 334761 (0.00086) [2022-07-09 16:47:53,867][25689] Fps is (10 sec: 5753.5, 60 sec: 5657.5, 300 sec: 5637.5). Total num frames: 342797312. Throughput: 0: 5058.4. Samples: 342793968. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:47:53,868][25689] Avg episode reward: [(0, '-46.016')] [2022-07-09 16:47:55,542][26022] Updated weights on worker 0-0, policy_version 334771 (0.00088) [2022-07-09 16:47:57,059][26022] Updated weights on worker 0-0, policy_version 334781 (0.00084) [2022-07-09 16:47:58,883][25689] Fps is (10 sec: 5784.1, 60 sec: 5641.9, 300 sec: 5631.3). Total num frames: 342824960. Throughput: 0: 5907.7. Samples: 342828150. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:47:58,885][25689] Avg episode reward: [(0, '-46.776')] [2022-07-09 16:47:59,026][26022] Updated weights on worker 0-0, policy_version 334791 (0.00099) [2022-07-09 16:48:00,764][26022] Updated weights on worker 0-0, policy_version 334801 (0.00086) [2022-07-09 16:48:02,978][26022] Updated weights on worker 0-0, policy_version 334811 (0.00094) [2022-07-09 16:48:03,896][25689] Fps is (10 sec: 5411.7, 60 sec: 5649.7, 300 sec: 5638.7). Total num frames: 342851584. Throughput: 0: 5814.5. Samples: 342860208. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:48:03,898][25689] Avg episode reward: [(0, '-45.771')] [2022-07-09 16:48:04,700][26022] Updated weights on worker 0-0, policy_version 334821 (0.00092) [2022-07-09 16:48:06,516][26022] Updated weights on worker 0-0, policy_version 334831 (0.00095) [2022-07-09 16:48:08,473][26022] Updated weights on worker 0-0, policy_version 334841 (0.00085) [2022-07-09 16:48:09,031][25689] Fps is (10 sec: 5449.2, 60 sec: 5646.5, 300 sec: 5629.3). Total num frames: 342880256. Throughput: 0: 4945.8. Samples: 342877092. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:48:09,039][25689] Avg episode reward: [(0, '-45.601')] [2022-07-09 16:48:10,298][26022] Updated weights on worker 0-0, policy_version 334851 (0.00091) [2022-07-09 16:48:12,057][26022] Updated weights on worker 0-0, policy_version 334861 (0.00089) [2022-07-09 16:48:13,985][26022] Updated weights on worker 0-0, policy_version 334871 (0.00096) [2022-07-09 16:48:14,077][25689] Fps is (10 sec: 5532.1, 60 sec: 5610.4, 300 sec: 5628.5). Total num frames: 342907904. Throughput: 0: 5783.1. Samples: 342910840. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:48:14,077][25689] Avg episode reward: [(0, '-45.470')] [2022-07-09 16:48:15,608][26022] Updated weights on worker 0-0, policy_version 334881 (0.00095) [2022-07-09 16:48:17,420][26022] Updated weights on worker 0-0, policy_version 334891 (0.00089) [2022-07-09 16:48:19,138][25689] Fps is (10 sec: 5774.7, 60 sec: 5639.5, 300 sec: 5638.1). Total num frames: 342938624. Throughput: 0: 5770.6. Samples: 342945034. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:48:19,139][25689] Avg episode reward: [(0, '-46.054')] [2022-07-09 16:48:19,142][26022] Updated weights on worker 0-0, policy_version 334901 (0.00083) [2022-07-09 16:48:21,244][26022] Updated weights on worker 0-0, policy_version 334911 (0.00090) [2022-07-09 16:48:22,851][26022] Updated weights on worker 0-0, policy_version 334921 (0.00093) [2022-07-09 16:48:24,170][25689] Fps is (10 sec: 5681.6, 60 sec: 5620.9, 300 sec: 5625.1). Total num frames: 342965248. Throughput: 0: 5006.6. Samples: 342961710. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:48:24,171][25689] Avg episode reward: [(0, '-45.699')] [2022-07-09 16:48:24,983][26022] Updated weights on worker 0-0, policy_version 334931 (0.00087) [2022-07-09 16:48:26,537][26022] Updated weights on worker 0-0, policy_version 334941 (0.00087) [2022-07-09 16:48:28,497][26022] Updated weights on worker 0-0, policy_version 334951 (0.00089) [2022-07-09 16:48:29,220][25689] Fps is (10 sec: 5586.3, 60 sec: 5645.6, 300 sec: 5632.5). Total num frames: 342994944. Throughput: 0: 5867.2. Samples: 342995548. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:48:29,221][25689] Avg episode reward: [(0, '-46.064')] [2022-07-09 16:48:30,174][26022] Updated weights on worker 0-0, policy_version 334961 (0.00748) [2022-07-09 16:48:32,065][26022] Updated weights on worker 0-0, policy_version 334971 (0.00088) [2022-07-09 16:48:33,688][26022] Updated weights on worker 0-0, policy_version 334981 (0.00094) [2022-07-09 16:48:34,242][25689] Fps is (10 sec: 5693.1, 60 sec: 5629.6, 300 sec: 5629.2). Total num frames: 343022592. Throughput: 0: 5875.3. Samples: 343029318. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:48:34,244][25689] Avg episode reward: [(0, '-46.229')] [2022-07-09 16:48:35,768][26022] Updated weights on worker 0-0, policy_version 334991 (0.00089) [2022-07-09 16:48:37,459][26022] Updated weights on worker 0-0, policy_version 335001 (0.00091) [2022-07-09 16:48:39,302][25689] Fps is (10 sec: 5586.1, 60 sec: 5626.2, 300 sec: 5629.1). Total num frames: 343051264. Throughput: 0: 5019.5. Samples: 343046248. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:48:39,303][25689] Avg episode reward: [(0, '-46.929')] [2022-07-09 16:48:39,303][26022] Updated weights on worker 0-0, policy_version 335011 (0.00093) [2022-07-09 16:48:40,925][26022] Updated weights on worker 0-0, policy_version 335021 (0.00083) [2022-07-09 16:48:43,037][26022] Updated weights on worker 0-0, policy_version 335031 (0.00086) [2022-07-09 16:48:44,355][25689] Fps is (10 sec: 5771.5, 60 sec: 5640.8, 300 sec: 5633.1). Total num frames: 343080960. Throughput: 0: 5879.8. Samples: 343080398. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:48:44,356][25689] Avg episode reward: [(0, '-46.699')] [2022-07-09 16:48:44,504][26022] Updated weights on worker 0-0, policy_version 335041 (0.00084) [2022-07-09 16:48:46,539][26022] Updated weights on worker 0-0, policy_version 335051 (0.00090) [2022-07-09 16:48:48,174][26022] Updated weights on worker 0-0, policy_version 335061 (0.00090) [2022-07-09 16:48:49,455][25689] Fps is (10 sec: 5648.0, 60 sec: 5637.9, 300 sec: 5625.8). Total num frames: 343108608. Throughput: 0: 5890.1. Samples: 343114734. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:48:49,457][25689] Avg episode reward: [(0, '-46.390')] [2022-07-09 16:48:50,048][26022] Updated weights on worker 0-0, policy_version 335071 (0.00092) [2022-07-09 16:48:52,101][26022] Updated weights on worker 0-0, policy_version 335081 (0.00089) [2022-07-09 16:48:53,680][26022] Updated weights on worker 0-0, policy_version 335091 (0.00095) [2022-07-09 16:48:54,459][25689] Fps is (10 sec: 5472.7, 60 sec: 5593.9, 300 sec: 5626.0). Total num frames: 343136256. Throughput: 0: 5043.6. Samples: 343131286. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:48:54,461][25689] Avg episode reward: [(0, '-46.306')] [2022-07-09 16:48:55,359][26022] Updated weights on worker 0-0, policy_version 335101 (0.00093) [2022-07-09 16:48:57,558][26022] Updated weights on worker 0-0, policy_version 335111 (0.00082) [2022-07-09 16:48:58,827][26022] Updated weights on worker 0-0, policy_version 335121 (0.00093) [2022-07-09 16:48:59,514][25689] Fps is (10 sec: 5802.7, 60 sec: 5641.0, 300 sec: 5635.7). Total num frames: 343166976. Throughput: 0: 5909.1. Samples: 343165680. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:48:59,515][25689] Avg episode reward: [(0, '-46.604')] [2022-07-09 16:49:01,136][26022] Updated weights on worker 0-0, policy_version 335131 (0.00097) [2022-07-09 16:49:02,775][26022] Updated weights on worker 0-0, policy_version 335141 (0.00082) [2022-07-09 16:49:04,541][25689] Fps is (10 sec: 5382.9, 60 sec: 5588.9, 300 sec: 5624.1). Total num frames: 343190528. Throughput: 0: 5812.3. Samples: 343197726. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:49:04,543][25689] Avg episode reward: [(0, '-46.612')] [2022-07-09 16:49:05,194][26022] Updated weights on worker 0-0, policy_version 335151 (0.00090) [2022-07-09 16:49:06,624][26022] Updated weights on worker 0-0, policy_version 335161 (0.00093) [2022-07-09 16:49:08,673][26022] Updated weights on worker 0-0, policy_version 335171 (0.00097) [2022-07-09 16:49:09,614][25689] Fps is (10 sec: 5373.7, 60 sec: 5628.5, 300 sec: 5631.2). Total num frames: 343221248. Throughput: 0: 4955.5. Samples: 343214628. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:49:09,614][25689] Avg episode reward: [(0, '-46.854')] [2022-07-09 16:49:10,323][26022] Updated weights on worker 0-0, policy_version 335181 (0.00089) [2022-07-09 16:49:12,070][26022] Updated weights on worker 0-0, policy_version 335191 (0.00092) [2022-07-09 16:49:13,899][26022] Updated weights on worker 0-0, policy_version 335201 (0.00096) [2022-07-09 16:49:14,596][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:49:14,618][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000335205_343249920.pth [2022-07-09 16:49:14,619][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000333224_341221376.pth [2022-07-09 16:49:14,635][25689] Fps is (10 sec: 5782.9, 60 sec: 5630.8, 300 sec: 5634.5). Total num frames: 343248896. Throughput: 0: 5832.0. Samples: 343248950. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:49:14,635][25689] Avg episode reward: [(0, '-46.504')] [2022-07-09 16:49:15,584][26022] Updated weights on worker 0-0, policy_version 335211 (0.00094) [2022-07-09 16:49:17,506][26022] Updated weights on worker 0-0, policy_version 335221 (0.00099) [2022-07-09 16:49:19,363][26022] Updated weights on worker 0-0, policy_version 335231 (0.00095) [2022-07-09 16:49:19,653][25689] Fps is (10 sec: 5609.8, 60 sec: 5601.0, 300 sec: 5627.7). Total num frames: 343277568. Throughput: 0: 5843.3. Samples: 343283360. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:49:19,654][25689] Avg episode reward: [(0, '-46.976')] [2022-07-09 16:49:21,058][26022] Updated weights on worker 0-0, policy_version 335241 (0.00080) [2022-07-09 16:49:22,900][26022] Updated weights on worker 0-0, policy_version 335251 (0.00107) [2022-07-09 16:49:24,686][25689] Fps is (10 sec: 5705.2, 60 sec: 5634.7, 300 sec: 5628.2). Total num frames: 343306240. Throughput: 0: 5083.0. Samples: 343300120. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:49:24,686][25689] Avg episode reward: [(0, '-46.759')] [2022-07-09 16:49:24,774][26022] Updated weights on worker 0-0, policy_version 335261 (0.00086) [2022-07-09 16:49:26,615][26022] Updated weights on worker 0-0, policy_version 335271 (0.00098) [2022-07-09 16:49:28,478][26022] Updated weights on worker 0-0, policy_version 335281 (0.00084) [2022-07-09 16:49:29,783][25689] Fps is (10 sec: 5761.9, 60 sec: 5630.4, 300 sec: 5637.8). Total num frames: 343335936. Throughput: 0: 5902.2. Samples: 343333672. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:49:29,784][25689] Avg episode reward: [(0, '-46.512')] [2022-07-09 16:49:30,354][26022] Updated weights on worker 0-0, policy_version 335291 (0.00094) [2022-07-09 16:49:32,036][26022] Updated weights on worker 0-0, policy_version 335301 (0.00094) [2022-07-09 16:49:34,013][26022] Updated weights on worker 0-0, policy_version 335311 (0.00084) [2022-07-09 16:49:34,875][25689] Fps is (10 sec: 5527.5, 60 sec: 5607.0, 300 sec: 5629.5). Total num frames: 343362560. Throughput: 0: 5871.8. Samples: 343367796. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:49:34,875][25689] Avg episode reward: [(0, '-46.172')] [2022-07-09 16:49:35,534][26022] Updated weights on worker 0-0, policy_version 335321 (0.00089) [2022-07-09 16:49:37,582][26022] Updated weights on worker 0-0, policy_version 335331 (0.00089) [2022-07-09 16:49:39,221][26022] Updated weights on worker 0-0, policy_version 335341 (0.00085) [2022-07-09 16:49:39,883][25689] Fps is (10 sec: 5576.3, 60 sec: 5628.7, 300 sec: 5629.6). Total num frames: 343392256. Throughput: 0: 5021.6. Samples: 343384946. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:49:39,884][25689] Avg episode reward: [(0, '-46.659')] [2022-07-09 16:49:41,011][26022] Updated weights on worker 0-0, policy_version 335351 (0.00080) [2022-07-09 16:49:43,000][26022] Updated weights on worker 0-0, policy_version 335361 (0.00086) [2022-07-09 16:49:44,540][26022] Updated weights on worker 0-0, policy_version 335371 (0.00089) [2022-07-09 16:49:44,933][25689] Fps is (10 sec: 5904.7, 60 sec: 5629.0, 300 sec: 5641.3). Total num frames: 343421952. Throughput: 0: 5880.8. Samples: 343419192. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:49:44,934][25689] Avg episode reward: [(0, '-47.033')] [2022-07-09 16:49:46,543][26022] Updated weights on worker 0-0, policy_version 335381 (0.00083) [2022-07-09 16:49:48,235][26022] Updated weights on worker 0-0, policy_version 335391 (0.00095) [2022-07-09 16:49:50,063][25689] Fps is (10 sec: 5632.8, 60 sec: 5626.2, 300 sec: 5632.0). Total num frames: 343449600. Throughput: 0: 5903.2. Samples: 343453390. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:49:50,064][25689] Avg episode reward: [(0, '-47.477')] [2022-07-09 16:49:50,169][26022] Updated weights on worker 0-0, policy_version 335401 (0.00090) [2022-07-09 16:49:51,959][26022] Updated weights on worker 0-0, policy_version 335411 (0.00082) [2022-07-09 16:49:53,712][26022] Updated weights on worker 0-0, policy_version 335421 (0.00083) [2022-07-09 16:49:55,092][25689] Fps is (10 sec: 5543.9, 60 sec: 5640.8, 300 sec: 5628.9). Total num frames: 343478272. Throughput: 0: 5915.4. Samples: 343487390. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:49:55,093][25689] Avg episode reward: [(0, '-48.128')] [2022-07-09 16:49:55,463][26022] Updated weights on worker 0-0, policy_version 335431 (0.00087) [2022-07-09 16:49:57,259][26022] Updated weights on worker 0-0, policy_version 335441 (0.00084) [2022-07-09 16:49:58,999][26022] Updated weights on worker 0-0, policy_version 335451 (0.00090) [2022-07-09 16:50:00,174][25689] Fps is (10 sec: 5671.8, 60 sec: 5604.5, 300 sec: 5637.7). Total num frames: 343506944. Throughput: 0: 5896.4. Samples: 343504588. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:50:00,174][25689] Avg episode reward: [(0, '-48.108')] [2022-07-09 16:50:00,991][26022] Updated weights on worker 0-0, policy_version 335461 (0.00094) [2022-07-09 16:50:02,833][26022] Updated weights on worker 0-0, policy_version 335471 (0.00086) [2022-07-09 16:50:04,915][26022] Updated weights on worker 0-0, policy_version 335481 (0.00095) [2022-07-09 16:50:05,183][25689] Fps is (10 sec: 5480.0, 60 sec: 5656.9, 300 sec: 5628.4). Total num frames: 343533568. Throughput: 0: 5807.9. Samples: 343536798. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:50:05,183][25689] Avg episode reward: [(0, '-48.122')] [2022-07-09 16:50:06,572][26022] Updated weights on worker 0-0, policy_version 335491 (0.00088) [2022-07-09 16:50:08,431][26022] Updated weights on worker 0-0, policy_version 335501 (0.00085) [2022-07-09 16:50:10,216][25689] Fps is (10 sec: 5506.5, 60 sec: 5626.7, 300 sec: 5631.4). Total num frames: 343562240. Throughput: 0: 5837.6. Samples: 343571030. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:50:10,216][25689] Avg episode reward: [(0, '-48.682')] [2022-07-09 16:50:10,294][26022] Updated weights on worker 0-0, policy_version 335511 (0.00086) [2022-07-09 16:50:11,946][26022] Updated weights on worker 0-0, policy_version 335521 (0.00096) [2022-07-09 16:50:13,895][26022] Updated weights on worker 0-0, policy_version 335531 (0.00093) [2022-07-09 16:50:15,230][25689] Fps is (10 sec: 5707.7, 60 sec: 5644.3, 300 sec: 5631.4). Total num frames: 343590912. Throughput: 0: 5005.5. Samples: 343588188. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:50:15,230][25689] Avg episode reward: [(0, '-48.651')] [2022-07-09 16:50:15,663][26022] Updated weights on worker 0-0, policy_version 335541 (0.00094) [2022-07-09 16:50:17,359][26022] Updated weights on worker 0-0, policy_version 335551 (0.00094) [2022-07-09 16:50:19,303][26022] Updated weights on worker 0-0, policy_version 335561 (0.00102) [2022-07-09 16:50:20,316][25689] Fps is (10 sec: 5880.4, 60 sec: 5671.8, 300 sec: 5644.2). Total num frames: 343621632. Throughput: 0: 5846.8. Samples: 343622354. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-09 16:50:20,316][25689] Avg episode reward: [(0, '-48.479')] [2022-07-09 16:50:20,995][26022] Updated weights on worker 0-0, policy_version 335571 (0.00078) [2022-07-09 16:50:22,782][26022] Updated weights on worker 0-0, policy_version 335581 (0.00089) [2022-07-09 16:50:24,684][26022] Updated weights on worker 0-0, policy_version 335591 (0.00090) [2022-07-09 16:50:25,328][25689] Fps is (10 sec: 5577.1, 60 sec: 5623.0, 300 sec: 5632.7). Total num frames: 343647232. Throughput: 0: 5930.9. Samples: 343656278. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:50:25,329][25689] Avg episode reward: [(0, '-48.365')] [2022-07-09 16:50:26,427][26022] Updated weights on worker 0-0, policy_version 335601 (0.00081) [2022-07-09 16:50:28,386][26022] Updated weights on worker 0-0, policy_version 335611 (0.00086) [2022-07-09 16:50:30,119][26022] Updated weights on worker 0-0, policy_version 335621 (0.00080) [2022-07-09 16:50:30,364][25689] Fps is (10 sec: 5503.1, 60 sec: 5628.7, 300 sec: 5636.0). Total num frames: 343676928. Throughput: 0: 5061.5. Samples: 343673010. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:50:30,365][25689] Avg episode reward: [(0, '-48.218')] [2022-07-09 16:50:32,142][26022] Updated weights on worker 0-0, policy_version 335631 (0.00086) [2022-07-09 16:50:33,826][26022] Updated weights on worker 0-0, policy_version 335641 (0.00091) [2022-07-09 16:50:35,371][25689] Fps is (10 sec: 5710.1, 60 sec: 5653.6, 300 sec: 5632.6). Total num frames: 343704576. Throughput: 0: 5881.5. Samples: 343706648. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:50:35,371][25689] Avg episode reward: [(0, '-48.220')] [2022-07-09 16:50:35,739][26022] Updated weights on worker 0-0, policy_version 335651 (0.00090) [2022-07-09 16:50:37,367][26022] Updated weights on worker 0-0, policy_version 335661 (0.00091) [2022-07-09 16:50:39,395][26022] Updated weights on worker 0-0, policy_version 335671 (0.00094) [2022-07-09 16:50:40,379][25689] Fps is (10 sec: 5623.6, 60 sec: 5636.6, 300 sec: 5636.3). Total num frames: 343733248. Throughput: 0: 5895.0. Samples: 343740626. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:50:40,379][25689] Avg episode reward: [(0, '-47.384')] [2022-07-09 16:50:41,103][26022] Updated weights on worker 0-0, policy_version 335681 (0.00100) [2022-07-09 16:50:42,928][26022] Updated weights on worker 0-0, policy_version 335691 (0.00096) [2022-07-09 16:50:44,580][26022] Updated weights on worker 0-0, policy_version 335701 (0.00104) [2022-07-09 16:50:45,479][25689] Fps is (10 sec: 5571.4, 60 sec: 5598.1, 300 sec: 5626.5). Total num frames: 343760896. Throughput: 0: 5024.4. Samples: 343757526. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:50:45,480][25689] Avg episode reward: [(0, '-47.349')] [2022-07-09 16:50:46,408][26022] Updated weights on worker 0-0, policy_version 335711 (0.00089) [2022-07-09 16:50:48,343][26022] Updated weights on worker 0-0, policy_version 335721 (0.00090) [2022-07-09 16:50:50,148][26022] Updated weights on worker 0-0, policy_version 335731 (0.00085) [2022-07-09 16:50:50,524][25689] Fps is (10 sec: 5652.3, 60 sec: 5639.9, 300 sec: 5633.1). Total num frames: 343790592. Throughput: 0: 5878.7. Samples: 343791526. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:50:50,527][25689] Avg episode reward: [(0, '-46.995')] [2022-07-09 16:50:51,971][26022] Updated weights on worker 0-0, policy_version 335741 (0.00099) [2022-07-09 16:50:53,787][26022] Updated weights on worker 0-0, policy_version 335752 (0.00087) [2022-07-09 16:50:55,573][25689] Fps is (10 sec: 5782.6, 60 sec: 5638.0, 300 sec: 5633.1). Total num frames: 343819264. Throughput: 0: 5884.6. Samples: 343825532. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:50:55,574][25689] Avg episode reward: [(0, '-46.828')] [2022-07-09 16:50:55,739][26022] Updated weights on worker 0-0, policy_version 335762 (0.00087) [2022-07-09 16:50:57,505][26022] Updated weights on worker 0-0, policy_version 335772 (0.00085) [2022-07-09 16:50:59,452][26022] Updated weights on worker 0-0, policy_version 335782 (0.00084) [2022-07-09 16:51:00,652][25689] Fps is (10 sec: 5662.3, 60 sec: 5638.3, 300 sec: 5638.6). Total num frames: 343847936. Throughput: 0: 5026.1. Samples: 343842518. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:51:00,652][25689] Avg episode reward: [(0, '-47.696')] [2022-07-09 16:51:00,991][26022] Updated weights on worker 0-0, policy_version 335792 (0.00085) [2022-07-09 16:51:03,288][26022] Updated weights on worker 0-0, policy_version 335802 (0.00089) [2022-07-09 16:51:05,054][26022] Updated weights on worker 0-0, policy_version 335812 (0.00088) [2022-07-09 16:51:05,698][25689] Fps is (10 sec: 5461.6, 60 sec: 5634.8, 300 sec: 5632.7). Total num frames: 343874560. Throughput: 0: 5792.9. Samples: 343874648. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:51:05,698][25689] Avg episode reward: [(0, '-46.501')] [2022-07-09 16:51:06,753][26022] Updated weights on worker 0-0, policy_version 335822 (0.00091) [2022-07-09 16:51:08,707][26022] Updated weights on worker 0-0, policy_version 335832 (0.00084) [2022-07-09 16:51:10,421][26022] Updated weights on worker 0-0, policy_version 335842 (0.00079) [2022-07-09 16:51:10,779][25689] Fps is (10 sec: 5460.2, 60 sec: 5630.4, 300 sec: 5631.5). Total num frames: 343903232. Throughput: 0: 5789.4. Samples: 343908788. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:51:10,779][25689] Avg episode reward: [(0, '-46.581')] [2022-07-09 16:51:12,340][26022] Updated weights on worker 0-0, policy_version 335852 (0.00098) [2022-07-09 16:51:13,839][26022] Updated weights on worker 0-0, policy_version 335862 (0.00091) [2022-07-09 16:51:14,852][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:51:14,866][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000335866_343926784.pth [2022-07-09 16:51:14,867][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000333884_341897216.pth [2022-07-09 16:51:14,867][25974] Saving a milestone ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/milestones/checkpoint_000335866_343926784.pth.milestone [2022-07-09 16:51:15,788][25689] Fps is (10 sec: 5683.3, 60 sec: 5630.8, 300 sec: 5631.6). Total num frames: 343931904. Throughput: 0: 4975.9. Samples: 343926118. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:51:15,788][25689] Avg episode reward: [(0, '-47.507')] [2022-07-09 16:51:15,821][26022] Updated weights on worker 0-0, policy_version 335872 (0.00090) [2022-07-09 16:51:17,516][26022] Updated weights on worker 0-0, policy_version 335882 (0.00090) [2022-07-09 16:51:19,466][26022] Updated weights on worker 0-0, policy_version 335892 (0.00089) [2022-07-09 16:51:20,812][25689] Fps is (10 sec: 5817.5, 60 sec: 5619.7, 300 sec: 5638.5). Total num frames: 343961600. Throughput: 0: 5846.0. Samples: 343960374. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:51:20,812][25689] Avg episode reward: [(0, '-47.935')] [2022-07-09 16:51:21,094][26022] Updated weights on worker 0-0, policy_version 335902 (0.00089) [2022-07-09 16:51:22,981][26022] Updated weights on worker 0-0, policy_version 335912 (0.00093) [2022-07-09 16:51:24,743][26022] Updated weights on worker 0-0, policy_version 335922 (0.00095) [2022-07-09 16:51:25,826][25689] Fps is (10 sec: 5610.4, 60 sec: 5636.4, 300 sec: 5629.1). Total num frames: 343988224. Throughput: 0: 5972.2. Samples: 343994860. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:51:25,827][25689] Avg episode reward: [(0, '-47.629')] [2022-07-09 16:51:26,608][26022] Updated weights on worker 0-0, policy_version 335932 (0.00085) [2022-07-09 16:51:28,379][26022] Updated weights on worker 0-0, policy_version 335942 (0.00088) [2022-07-09 16:51:30,129][26022] Updated weights on worker 0-0, policy_version 335952 (0.00092) [2022-07-09 16:51:30,927][25689] Fps is (10 sec: 5567.9, 60 sec: 5630.4, 300 sec: 5630.9). Total num frames: 344017920. Throughput: 0: 5103.4. Samples: 344011612. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:51:30,928][25689] Avg episode reward: [(0, '-48.564')] [2022-07-09 16:51:32,187][26022] Updated weights on worker 0-0, policy_version 335962 (0.00087) [2022-07-09 16:51:33,844][26022] Updated weights on worker 0-0, policy_version 335972 (0.00107) [2022-07-09 16:51:35,657][26022] Updated weights on worker 0-0, policy_version 335982 (0.00094) [2022-07-09 16:51:35,949][25689] Fps is (10 sec: 5867.0, 60 sec: 5662.7, 300 sec: 5637.8). Total num frames: 344047616. Throughput: 0: 5930.6. Samples: 344045688. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:51:35,951][25689] Avg episode reward: [(0, '-48.758')] [2022-07-09 16:51:37,474][26022] Updated weights on worker 0-0, policy_version 335992 (0.00085) [2022-07-09 16:51:39,123][26022] Updated weights on worker 0-0, policy_version 336002 (0.00086) [2022-07-09 16:51:40,961][25689] Fps is (10 sec: 5714.9, 60 sec: 5645.5, 300 sec: 5631.5). Total num frames: 344075264. Throughput: 0: 5941.9. Samples: 344080098. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:51:40,962][25689] Avg episode reward: [(0, '-48.314')] [2022-07-09 16:51:41,021][26022] Updated weights on worker 0-0, policy_version 336012 (0.00094) [2022-07-09 16:51:42,698][26022] Updated weights on worker 0-0, policy_version 336022 (0.00083) [2022-07-09 16:51:44,695][26022] Updated weights on worker 0-0, policy_version 336032 (0.00086) [2022-07-09 16:51:45,979][25689] Fps is (10 sec: 5717.3, 60 sec: 5687.0, 300 sec: 5636.1). Total num frames: 344104960. Throughput: 0: 5085.4. Samples: 344097346. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:51:45,980][25689] Avg episode reward: [(0, '-48.177')] [2022-07-09 16:51:46,271][26022] Updated weights on worker 0-0, policy_version 336042 (0.00089) [2022-07-09 16:51:48,093][26022] Updated weights on worker 0-0, policy_version 336052 (0.00085) [2022-07-09 16:51:49,735][26022] Updated weights on worker 0-0, policy_version 336062 (0.00096) [2022-07-09 16:51:51,100][25689] Fps is (10 sec: 5656.0, 60 sec: 5646.1, 300 sec: 5634.5). Total num frames: 344132608. Throughput: 0: 5966.5. Samples: 344131974. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:51:51,100][25689] Avg episode reward: [(0, '-48.550')] [2022-07-09 16:51:51,738][26022] Updated weights on worker 0-0, policy_version 336072 (0.00084) [2022-07-09 16:51:53,408][26022] Updated weights on worker 0-0, policy_version 336082 (0.00092) [2022-07-09 16:51:55,273][26022] Updated weights on worker 0-0, policy_version 336092 (0.00085) [2022-07-09 16:51:56,106][25689] Fps is (10 sec: 5561.6, 60 sec: 5650.1, 300 sec: 5635.0). Total num frames: 344161280. Throughput: 0: 5979.9. Samples: 344166222. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:51:56,106][25689] Avg episode reward: [(0, '-48.433')] [2022-07-09 16:51:57,156][26022] Updated weights on worker 0-0, policy_version 336102 (0.00093) [2022-07-09 16:51:58,980][26022] Updated weights on worker 0-0, policy_version 336112 (0.00089) [2022-07-09 16:52:00,921][26022] Updated weights on worker 0-0, policy_version 336122 (0.00092) [2022-07-09 16:52:01,115][25689] Fps is (10 sec: 5725.6, 60 sec: 5656.6, 300 sec: 5643.5). Total num frames: 344189952. Throughput: 0: 5103.8. Samples: 344182960. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:52:01,116][25689] Avg episode reward: [(0, '-47.817')] [2022-07-09 16:52:02,853][26022] Updated weights on worker 0-0, policy_version 336132 (0.00084) [2022-07-09 16:52:04,893][26022] Updated weights on worker 0-0, policy_version 336142 (0.00087) [2022-07-09 16:52:06,118][25689] Fps is (10 sec: 5522.6, 60 sec: 5660.6, 300 sec: 5638.5). Total num frames: 344216576. Throughput: 0: 5842.5. Samples: 344215010. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:52:06,119][25689] Avg episode reward: [(0, '-47.320')] [2022-07-09 16:52:06,599][26022] Updated weights on worker 0-0, policy_version 336152 (0.00091) [2022-07-09 16:52:08,615][26022] Updated weights on worker 0-0, policy_version 336162 (0.00087) [2022-07-09 16:52:10,280][26022] Updated weights on worker 0-0, policy_version 336172 (0.00097) [2022-07-09 16:52:11,180][25689] Fps is (10 sec: 5392.2, 60 sec: 5645.4, 300 sec: 5630.8). Total num frames: 344244224. Throughput: 0: 5816.3. Samples: 344248766. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:52:11,182][25689] Avg episode reward: [(0, '-46.880')] [2022-07-09 16:52:12,095][26022] Updated weights on worker 0-0, policy_version 336182 (0.00092) [2022-07-09 16:52:13,972][26022] Updated weights on worker 0-0, policy_version 336192 (0.00091) [2022-07-09 16:52:15,639][26022] Updated weights on worker 0-0, policy_version 336202 (0.00087) [2022-07-09 16:52:16,185][25689] Fps is (10 sec: 5696.3, 60 sec: 5662.8, 300 sec: 5634.4). Total num frames: 344273920. Throughput: 0: 5810.6. Samples: 344282896. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:52:16,186][25689] Avg episode reward: [(0, '-46.540')] [2022-07-09 16:52:17,572][26022] Updated weights on worker 0-0, policy_version 336212 (0.00084) [2022-07-09 16:52:19,231][26022] Updated weights on worker 0-0, policy_version 336222 (0.00092) [2022-07-09 16:52:21,055][26022] Updated weights on worker 0-0, policy_version 336232 (0.00668) [2022-07-09 16:52:21,196][25689] Fps is (10 sec: 5827.4, 60 sec: 5647.0, 300 sec: 5637.8). Total num frames: 344302592. Throughput: 0: 5822.8. Samples: 344299888. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:52:21,198][25689] Avg episode reward: [(0, '-46.501')] [2022-07-09 16:52:22,871][26022] Updated weights on worker 0-0, policy_version 336242 (0.00086) [2022-07-09 16:52:24,764][26022] Updated weights on worker 0-0, policy_version 336252 (0.00091) [2022-07-09 16:52:26,214][25689] Fps is (10 sec: 5615.5, 60 sec: 5663.6, 300 sec: 5636.6). Total num frames: 344330240. Throughput: 0: 5918.2. Samples: 344333944. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:52:26,215][25689] Avg episode reward: [(0, '-46.797')] [2022-07-09 16:52:26,510][26022] Updated weights on worker 0-0, policy_version 336262 (0.00088) [2022-07-09 16:52:28,485][26022] Updated weights on worker 0-0, policy_version 336272 (0.00089) [2022-07-09 16:52:30,171][26022] Updated weights on worker 0-0, policy_version 336282 (0.00600) [2022-07-09 16:52:31,284][25689] Fps is (10 sec: 5582.7, 60 sec: 5649.5, 300 sec: 5635.9). Total num frames: 344358912. Throughput: 0: 5919.5. Samples: 344367774. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:52:31,285][25689] Avg episode reward: [(0, '-46.518')] [2022-07-09 16:52:32,124][26022] Updated weights on worker 0-0, policy_version 336292 (0.00084) [2022-07-09 16:52:33,852][26022] Updated weights on worker 0-0, policy_version 336302 (0.00089) [2022-07-09 16:52:35,538][26022] Updated weights on worker 0-0, policy_version 336312 (0.00093) [2022-07-09 16:52:36,333][25689] Fps is (10 sec: 5667.0, 60 sec: 5630.1, 300 sec: 5635.4). Total num frames: 344387584. Throughput: 0: 5051.8. Samples: 344384682. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:52:36,334][25689] Avg episode reward: [(0, '-46.573')] [2022-07-09 16:52:37,340][26022] Updated weights on worker 0-0, policy_version 336322 (0.00084) [2022-07-09 16:52:39,246][26022] Updated weights on worker 0-0, policy_version 336332 (0.00093) [2022-07-09 16:52:40,964][26022] Updated weights on worker 0-0, policy_version 336342 (0.00084) [2022-07-09 16:52:41,337][25689] Fps is (10 sec: 5602.5, 60 sec: 5630.8, 300 sec: 5632.4). Total num frames: 344415232. Throughput: 0: 5903.5. Samples: 344418790. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:52:41,338][25689] Avg episode reward: [(0, '-47.637')] [2022-07-09 16:52:42,776][26022] Updated weights on worker 0-0, policy_version 336352 (0.00086) [2022-07-09 16:52:44,475][26022] Updated weights on worker 0-0, policy_version 336362 (0.00089) [2022-07-09 16:52:46,186][26022] Updated weights on worker 0-0, policy_version 336372 (0.00087) [2022-07-09 16:52:46,407][25689] Fps is (10 sec: 5692.3, 60 sec: 5626.0, 300 sec: 5639.3). Total num frames: 344444928. Throughput: 0: 5899.1. Samples: 344453064. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:52:46,408][25689] Avg episode reward: [(0, '-47.910')] [2022-07-09 16:52:48,114][26022] Updated weights on worker 0-0, policy_version 336382 (0.00084) [2022-07-09 16:52:49,855][26022] Updated weights on worker 0-0, policy_version 336392 (0.00091) [2022-07-09 16:52:51,458][25689] Fps is (10 sec: 5766.9, 60 sec: 5649.4, 300 sec: 5632.9). Total num frames: 344473600. Throughput: 0: 5082.5. Samples: 344470308. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:52:51,459][25689] Avg episode reward: [(0, '-47.969')] [2022-07-09 16:52:51,784][26022] Updated weights on worker 0-0, policy_version 336402 (0.00093) [2022-07-09 16:52:53,473][26022] Updated weights on worker 0-0, policy_version 336412 (0.00087) [2022-07-09 16:52:55,306][26022] Updated weights on worker 0-0, policy_version 336422 (0.00087) [2022-07-09 16:52:56,511][25689] Fps is (10 sec: 5675.8, 60 sec: 5645.1, 300 sec: 5635.6). Total num frames: 344502272. Throughput: 0: 5944.5. Samples: 344504626. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:52:56,511][25689] Avg episode reward: [(0, '-47.806')] [2022-07-09 16:52:57,180][26022] Updated weights on worker 0-0, policy_version 336432 (0.00085) [2022-07-09 16:52:58,991][26022] Updated weights on worker 0-0, policy_version 336442 (0.00088) [2022-07-09 16:53:00,848][26022] Updated weights on worker 0-0, policy_version 336452 (0.00091) [2022-07-09 16:53:01,527][25689] Fps is (10 sec: 5695.2, 60 sec: 5644.4, 300 sec: 5642.4). Total num frames: 344530944. Throughput: 0: 5934.8. Samples: 344538614. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:53:01,528][25689] Avg episode reward: [(0, '-47.533')] [2022-07-09 16:53:02,924][26022] Updated weights on worker 0-0, policy_version 336462 (0.00099) [2022-07-09 16:53:04,763][26022] Updated weights on worker 0-0, policy_version 336472 (0.00165) [2022-07-09 16:53:06,552][25689] Fps is (10 sec: 5404.9, 60 sec: 5625.5, 300 sec: 5634.2). Total num frames: 344556544. Throughput: 0: 4989.7. Samples: 344553580. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:53:06,552][25689] Avg episode reward: [(0, '-47.709')] [2022-07-09 16:53:06,574][26022] Updated weights on worker 0-0, policy_version 336482 (0.00085) [2022-07-09 16:53:08,383][26022] Updated weights on worker 0-0, policy_version 336492 (0.00505) [2022-07-09 16:53:10,125][26022] Updated weights on worker 0-0, policy_version 336502 (0.00094) [2022-07-09 16:53:11,616][25689] Fps is (10 sec: 5379.3, 60 sec: 5642.2, 300 sec: 5637.3). Total num frames: 344585216. Throughput: 0: 5828.1. Samples: 344587790. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:53:11,617][25689] Avg episode reward: [(0, '-47.546')] [2022-07-09 16:53:12,063][26022] Updated weights on worker 0-0, policy_version 336512 (0.00084) [2022-07-09 16:53:13,696][26022] Updated weights on worker 0-0, policy_version 336522 (0.00090) [2022-07-09 16:53:14,957][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:53:14,974][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000336528_344604672.pth [2022-07-09 16:53:14,975][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000334545_342574080.pth [2022-07-09 16:53:15,508][26022] Updated weights on worker 0-0, policy_version 336532 (0.00087) [2022-07-09 16:53:16,683][25689] Fps is (10 sec: 5862.3, 60 sec: 5653.3, 300 sec: 5637.2). Total num frames: 344615936. Throughput: 0: 5828.8. Samples: 344622206. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:53:16,685][25689] Avg episode reward: [(0, '-47.576')] [2022-07-09 16:53:17,226][26022] Updated weights on worker 0-0, policy_version 336542 (0.00087) [2022-07-09 16:53:19,206][26022] Updated weights on worker 0-0, policy_version 336552 (0.00085) [2022-07-09 16:53:20,916][26022] Updated weights on worker 0-0, policy_version 336562 (0.00058) [2022-07-09 16:53:21,719][25689] Fps is (10 sec: 5777.5, 60 sec: 5634.1, 300 sec: 5640.5). Total num frames: 344643584. Throughput: 0: 4994.7. Samples: 344639466. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:53:21,719][25689] Avg episode reward: [(0, '-48.420')] [2022-07-09 16:53:22,795][26022] Updated weights on worker 0-0, policy_version 336572 (0.00092) [2022-07-09 16:53:24,459][26022] Updated weights on worker 0-0, policy_version 336582 (0.00087) [2022-07-09 16:53:26,585][26022] Updated weights on worker 0-0, policy_version 336592 (0.00097) [2022-07-09 16:53:26,749][25689] Fps is (10 sec: 5493.1, 60 sec: 5633.0, 300 sec: 5634.0). Total num frames: 344671232. Throughput: 0: 5927.3. Samples: 344673296. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:53:26,751][25689] Avg episode reward: [(0, '-47.969')] [2022-07-09 16:53:28,005][26022] Updated weights on worker 0-0, policy_version 336602 (0.00092) [2022-07-09 16:53:30,044][26022] Updated weights on worker 0-0, policy_version 336612 (0.00087) [2022-07-09 16:53:31,841][25689] Fps is (10 sec: 5564.1, 60 sec: 5631.0, 300 sec: 5636.1). Total num frames: 344699904. Throughput: 0: 5906.2. Samples: 344707242. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:53:31,842][25689] Avg episode reward: [(0, '-47.599')] [2022-07-09 16:53:31,877][26022] Updated weights on worker 0-0, policy_version 336622 (0.00086) [2022-07-09 16:53:33,643][26022] Updated weights on worker 0-0, policy_version 336632 (0.00087) [2022-07-09 16:53:35,466][26022] Updated weights on worker 0-0, policy_version 336642 (0.00086) [2022-07-09 16:53:36,864][25689] Fps is (10 sec: 5669.3, 60 sec: 5633.4, 300 sec: 5636.9). Total num frames: 344728576. Throughput: 0: 5053.5. Samples: 344724192. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:53:36,866][25689] Avg episode reward: [(0, '-47.365')] [2022-07-09 16:53:37,258][26022] Updated weights on worker 0-0, policy_version 336652 (0.00084) [2022-07-09 16:53:38,996][26022] Updated weights on worker 0-0, policy_version 336662 (0.00086) [2022-07-09 16:53:41,061][26022] Updated weights on worker 0-0, policy_version 336672 (0.00091) [2022-07-09 16:53:41,878][25689] Fps is (10 sec: 5713.1, 60 sec: 5649.3, 300 sec: 5634.1). Total num frames: 344757248. Throughput: 0: 5886.1. Samples: 344758126. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:53:41,879][25689] Avg episode reward: [(0, '-46.383')] [2022-07-09 16:53:42,580][26022] Updated weights on worker 0-0, policy_version 336682 (0.00081) [2022-07-09 16:53:44,435][26022] Updated weights on worker 0-0, policy_version 336692 (0.00094) [2022-07-09 16:53:46,268][26022] Updated weights on worker 0-0, policy_version 336702 (0.00093) [2022-07-09 16:53:46,880][25689] Fps is (10 sec: 5725.3, 60 sec: 5638.8, 300 sec: 5639.4). Total num frames: 344785920. Throughput: 0: 5914.7. Samples: 344792362. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:53:46,881][25689] Avg episode reward: [(0, '-44.894')] [2022-07-09 16:53:48,326][26022] Updated weights on worker 0-0, policy_version 336712 (0.00088) [2022-07-09 16:53:49,891][26022] Updated weights on worker 0-0, policy_version 336722 (0.00092) [2022-07-09 16:53:51,796][26022] Updated weights on worker 0-0, policy_version 336732 (0.00089) [2022-07-09 16:53:51,937][25689] Fps is (10 sec: 5599.4, 60 sec: 5621.3, 300 sec: 5638.5). Total num frames: 344813568. Throughput: 0: 5095.3. Samples: 344809634. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:53:51,939][25689] Avg episode reward: [(0, '-44.869')] [2022-07-09 16:53:53,407][26022] Updated weights on worker 0-0, policy_version 336742 (0.00087) [2022-07-09 16:53:55,576][26022] Updated weights on worker 0-0, policy_version 336752 (0.00089) [2022-07-09 16:53:56,941][25689] Fps is (10 sec: 5699.8, 60 sec: 5642.8, 300 sec: 5636.0). Total num frames: 344843264. Throughput: 0: 5940.6. Samples: 344843458. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:53:56,941][25689] Avg episode reward: [(0, '-46.746')] [2022-07-09 16:53:57,020][26022] Updated weights on worker 0-0, policy_version 336762 (0.00085) [2022-07-09 16:53:59,009][26022] Updated weights on worker 0-0, policy_version 336772 (0.00090) [2022-07-09 16:54:00,690][26022] Updated weights on worker 0-0, policy_version 336782 (0.00091) [2022-07-09 16:54:01,955][25689] Fps is (10 sec: 5519.4, 60 sec: 5592.1, 300 sec: 5643.1). Total num frames: 344868864. Throughput: 0: 5946.4. Samples: 344877510. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:54:01,957][25689] Avg episode reward: [(0, '-46.862')] [2022-07-09 16:54:02,975][26022] Updated weights on worker 0-0, policy_version 336792 (0.00093) [2022-07-09 16:54:04,757][26022] Updated weights on worker 0-0, policy_version 336802 (0.00085) [2022-07-09 16:54:06,651][26022] Updated weights on worker 0-0, policy_version 336812 (0.00082) [2022-07-09 16:54:06,963][25689] Fps is (10 sec: 5313.1, 60 sec: 5627.6, 300 sec: 5634.0). Total num frames: 344896512. Throughput: 0: 4979.9. Samples: 344892372. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:54:06,963][25689] Avg episode reward: [(0, '-46.942')] [2022-07-09 16:54:08,459][26022] Updated weights on worker 0-0, policy_version 336822 (0.00092) [2022-07-09 16:54:10,464][26022] Updated weights on worker 0-0, policy_version 336832 (0.00096) [2022-07-09 16:54:12,007][26022] Updated weights on worker 0-0, policy_version 336842 (0.00085) [2022-07-09 16:54:12,037][25689] Fps is (10 sec: 5789.1, 60 sec: 5660.5, 300 sec: 5643.3). Total num frames: 344927232. Throughput: 0: 5802.8. Samples: 344926276. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:54:12,038][25689] Avg episode reward: [(0, '-47.338')] [2022-07-09 16:54:13,916][26022] Updated weights on worker 0-0, policy_version 336852 (0.00086) [2022-07-09 16:54:15,543][26022] Updated weights on worker 0-0, policy_version 336862 (0.00087) [2022-07-09 16:54:17,092][25689] Fps is (10 sec: 5661.5, 60 sec: 5593.9, 300 sec: 5635.7). Total num frames: 344953856. Throughput: 0: 5800.8. Samples: 344960350. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:54:17,093][25689] Avg episode reward: [(0, '-47.796')] [2022-07-09 16:54:17,440][26022] Updated weights on worker 0-0, policy_version 336872 (0.00092) [2022-07-09 16:54:19,276][26022] Updated weights on worker 0-0, policy_version 336882 (0.00089) [2022-07-09 16:54:20,998][26022] Updated weights on worker 0-0, policy_version 336892 (0.00089) [2022-07-09 16:54:22,119][25689] Fps is (10 sec: 5586.6, 60 sec: 5628.6, 300 sec: 5639.3). Total num frames: 344983552. Throughput: 0: 4962.9. Samples: 344977580. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:54:22,119][25689] Avg episode reward: [(0, '-47.775')] [2022-07-09 16:54:22,733][26022] Updated weights on worker 0-0, policy_version 336902 (0.00086) [2022-07-09 16:54:24,747][26022] Updated weights on worker 0-0, policy_version 336912 (0.00088) [2022-07-09 16:54:26,251][26022] Updated weights on worker 0-0, policy_version 336922 (0.00092) [2022-07-09 16:54:27,134][25689] Fps is (10 sec: 5812.5, 60 sec: 5647.0, 300 sec: 5637.4). Total num frames: 345012224. Throughput: 0: 5930.1. Samples: 345011986. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:54:27,134][25689] Avg episode reward: [(0, '-46.714')] [2022-07-09 16:54:28,425][26022] Updated weights on worker 0-0, policy_version 336932 (0.00272) [2022-07-09 16:54:29,755][26022] Updated weights on worker 0-0, policy_version 336942 (0.00099) [2022-07-09 16:54:32,028][26022] Updated weights on worker 0-0, policy_version 336952 (0.00089) [2022-07-09 16:54:32,257][25689] Fps is (10 sec: 5656.2, 60 sec: 5644.1, 300 sec: 5643.7). Total num frames: 345040896. Throughput: 0: 5914.4. Samples: 345045862. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:54:32,258][25689] Avg episode reward: [(0, '-46.577')] [2022-07-09 16:54:33,716][26022] Updated weights on worker 0-0, policy_version 336962 (0.00090) [2022-07-09 16:54:35,388][26022] Updated weights on worker 0-0, policy_version 336972 (0.00096) [2022-07-09 16:54:37,270][25689] Fps is (10 sec: 5556.3, 60 sec: 5628.1, 300 sec: 5636.7). Total num frames: 345068544. Throughput: 0: 5084.8. Samples: 345062952. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:54:37,271][25689] Avg episode reward: [(0, '-47.095')] [2022-07-09 16:54:37,302][26022] Updated weights on worker 0-0, policy_version 336982 (0.00057) [2022-07-09 16:54:39,035][26022] Updated weights on worker 0-0, policy_version 336992 (0.00086) [2022-07-09 16:54:40,860][26022] Updated weights on worker 0-0, policy_version 337002 (0.00086) [2022-07-09 16:54:42,292][25689] Fps is (10 sec: 5612.3, 60 sec: 5627.3, 300 sec: 5633.8). Total num frames: 345097216. Throughput: 0: 5930.1. Samples: 345097210. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:54:42,293][25689] Avg episode reward: [(0, '-46.852')] [2022-07-09 16:54:42,793][26022] Updated weights on worker 0-0, policy_version 337012 (0.00086) [2022-07-09 16:54:44,443][26022] Updated weights on worker 0-0, policy_version 337022 (0.00095) [2022-07-09 16:54:46,534][26022] Updated weights on worker 0-0, policy_version 337032 (0.00090) [2022-07-09 16:54:47,307][25689] Fps is (10 sec: 5815.2, 60 sec: 5643.0, 300 sec: 5642.9). Total num frames: 345126912. Throughput: 0: 5900.6. Samples: 345131022. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:54:47,308][25689] Avg episode reward: [(0, '-47.061')] [2022-07-09 16:54:48,110][26022] Updated weights on worker 0-0, policy_version 337042 (0.00065) [2022-07-09 16:54:50,087][26022] Updated weights on worker 0-0, policy_version 337052 (0.00085) [2022-07-09 16:54:51,614][26022] Updated weights on worker 0-0, policy_version 337062 (0.00079) [2022-07-09 16:54:52,412][25689] Fps is (10 sec: 5767.4, 60 sec: 5655.4, 300 sec: 5641.4). Total num frames: 345155584. Throughput: 0: 5066.8. Samples: 345147986. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:54:52,413][25689] Avg episode reward: [(0, '-47.308')] [2022-07-09 16:54:53,590][26022] Updated weights on worker 0-0, policy_version 337072 (0.00091) [2022-07-09 16:54:55,292][26022] Updated weights on worker 0-0, policy_version 337082 (0.00093) [2022-07-09 16:54:57,240][26022] Updated weights on worker 0-0, policy_version 337092 (0.00087) [2022-07-09 16:54:57,440][25689] Fps is (10 sec: 5557.9, 60 sec: 5619.4, 300 sec: 5639.0). Total num frames: 345183232. Throughput: 0: 5910.9. Samples: 345182178. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:54:57,441][25689] Avg episode reward: [(0, '-47.776')] [2022-07-09 16:54:58,816][26022] Updated weights on worker 0-0, policy_version 337102 (0.00086) [2022-07-09 16:55:00,966][26022] Updated weights on worker 0-0, policy_version 337112 (0.00092) [2022-07-09 16:55:02,462][25689] Fps is (10 sec: 5400.1, 60 sec: 5635.5, 300 sec: 5638.8). Total num frames: 345209856. Throughput: 0: 5861.2. Samples: 345215434. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:55:02,463][25689] Avg episode reward: [(0, '-48.651')] [2022-07-09 16:55:02,810][26022] Updated weights on worker 0-0, policy_version 337122 (0.00088) [2022-07-09 16:55:04,818][26022] Updated weights on worker 0-0, policy_version 337132 (0.00092) [2022-07-09 16:55:06,771][26022] Updated weights on worker 0-0, policy_version 337142 (0.00081) [2022-07-09 16:55:07,486][25689] Fps is (10 sec: 5402.6, 60 sec: 5634.1, 300 sec: 5635.5). Total num frames: 345237504. Throughput: 0: 4947.6. Samples: 345230860. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:55:07,488][25689] Avg episode reward: [(0, '-48.569')] [2022-07-09 16:55:08,391][26022] Updated weights on worker 0-0, policy_version 337152 (0.00087) [2022-07-09 16:55:10,409][26022] Updated weights on worker 0-0, policy_version 337162 (0.00087) [2022-07-09 16:55:11,948][26022] Updated weights on worker 0-0, policy_version 337172 (0.00086) [2022-07-09 16:55:12,587][25689] Fps is (10 sec: 5562.7, 60 sec: 5597.8, 300 sec: 5633.8). Total num frames: 345266176. Throughput: 0: 5781.3. Samples: 345264622. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:55:12,587][25689] Avg episode reward: [(0, '-48.240')] [2022-07-09 16:55:13,939][26022] Updated weights on worker 0-0, policy_version 337182 (0.00095) [2022-07-09 16:55:15,248][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:55:15,262][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000337189_345281536.pth [2022-07-09 16:55:15,262][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000335205_343249920.pth [2022-07-09 16:55:15,670][26022] Updated weights on worker 0-0, policy_version 337192 (0.00097) [2022-07-09 16:55:17,337][26022] Updated weights on worker 0-0, policy_version 337202 (0.00090) [2022-07-09 16:55:17,669][25689] Fps is (10 sec: 5631.5, 60 sec: 5629.1, 300 sec: 5627.0). Total num frames: 345294848. Throughput: 0: 5769.2. Samples: 345298880. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:55:17,669][25689] Avg episode reward: [(0, '-48.338')] [2022-07-09 16:55:19,432][26022] Updated weights on worker 0-0, policy_version 337212 (0.00087) [2022-07-09 16:55:20,978][26022] Updated weights on worker 0-0, policy_version 337222 (0.00102) [2022-07-09 16:55:22,756][25689] Fps is (10 sec: 5639.1, 60 sec: 5606.6, 300 sec: 5635.9). Total num frames: 345323520. Throughput: 0: 5792.4. Samples: 345332982. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 16:55:22,758][25689] Avg episode reward: [(0, '-48.123')] [2022-07-09 16:55:22,872][26022] Updated weights on worker 0-0, policy_version 337232 (0.00049) [2022-07-09 16:55:24,918][26022] Updated weights on worker 0-0, policy_version 337242 (0.00084) [2022-07-09 16:55:26,443][26022] Updated weights on worker 0-0, policy_version 337252 (0.00095) [2022-07-09 16:55:27,824][25689] Fps is (10 sec: 5747.6, 60 sec: 5618.6, 300 sec: 5635.3). Total num frames: 345353216. Throughput: 0: 5855.8. Samples: 345349954. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:55:27,824][25689] Avg episode reward: [(0, '-47.827')] [2022-07-09 16:55:28,680][26022] Updated weights on worker 0-0, policy_version 337262 (0.00084) [2022-07-09 16:55:29,932][26022] Updated weights on worker 0-0, policy_version 337272 (0.00086) [2022-07-09 16:55:32,262][26022] Updated weights on worker 0-0, policy_version 337282 (0.00103) [2022-07-09 16:55:32,858][25689] Fps is (10 sec: 5676.3, 60 sec: 5609.9, 300 sec: 5634.8). Total num frames: 345380864. Throughput: 0: 5860.9. Samples: 345383430. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:55:32,859][25689] Avg episode reward: [(0, '-47.994')] [2022-07-09 16:55:33,891][26022] Updated weights on worker 0-0, policy_version 337292 (0.00085) [2022-07-09 16:55:35,583][26022] Updated weights on worker 0-0, policy_version 337302 (0.00085) [2022-07-09 16:55:37,491][26022] Updated weights on worker 0-0, policy_version 337312 (0.00097) [2022-07-09 16:55:37,919][25689] Fps is (10 sec: 5477.4, 60 sec: 5605.5, 300 sec: 5630.4). Total num frames: 345408512. Throughput: 0: 5874.7. Samples: 345417844. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:55:37,920][25689] Avg episode reward: [(0, '-47.937')] [2022-07-09 16:55:39,134][26022] Updated weights on worker 0-0, policy_version 337322 (0.00082) [2022-07-09 16:55:41,103][26022] Updated weights on worker 0-0, policy_version 337332 (0.00090) [2022-07-09 16:55:42,926][25689] Fps is (10 sec: 5695.9, 60 sec: 5623.8, 300 sec: 5639.0). Total num frames: 345438208. Throughput: 0: 5047.1. Samples: 345434782. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:55:42,926][25689] Avg episode reward: [(0, '-47.888')] [2022-07-09 16:55:42,932][26022] Updated weights on worker 0-0, policy_version 337342 (0.00093) [2022-07-09 16:55:44,639][26022] Updated weights on worker 0-0, policy_version 337352 (0.00088) [2022-07-09 16:55:46,533][26022] Updated weights on worker 0-0, policy_version 337362 (0.00085) [2022-07-09 16:55:47,995][25689] Fps is (10 sec: 5691.1, 60 sec: 5585.0, 300 sec: 5631.7). Total num frames: 345465856. Throughput: 0: 5891.9. Samples: 345468802. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:55:47,996][25689] Avg episode reward: [(0, '-47.940')] [2022-07-09 16:55:48,266][26022] Updated weights on worker 0-0, policy_version 337372 (0.00086) [2022-07-09 16:55:50,018][26022] Updated weights on worker 0-0, policy_version 337382 (0.00087) [2022-07-09 16:55:51,771][26022] Updated weights on worker 0-0, policy_version 337392 (0.00084) [2022-07-09 16:55:53,091][25689] Fps is (10 sec: 5742.1, 60 sec: 5619.7, 300 sec: 5637.7). Total num frames: 345496576. Throughput: 0: 5909.4. Samples: 345502992. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:55:53,091][25689] Avg episode reward: [(0, '-47.679')] [2022-07-09 16:55:53,753][26022] Updated weights on worker 0-0, policy_version 337402 (0.00097) [2022-07-09 16:55:55,672][26022] Updated weights on worker 0-0, policy_version 337412 (0.00089) [2022-07-09 16:55:57,420][26022] Updated weights on worker 0-0, policy_version 337422 (0.00088) [2022-07-09 16:55:58,093][25689] Fps is (10 sec: 5780.0, 60 sec: 5622.0, 300 sec: 5635.7). Total num frames: 345524224. Throughput: 0: 5061.0. Samples: 345519946. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:55:58,094][25689] Avg episode reward: [(0, '-48.042')] [2022-07-09 16:55:59,142][26022] Updated weights on worker 0-0, policy_version 337432 (0.00085) [2022-07-09 16:56:00,948][26022] Updated weights on worker 0-0, policy_version 337442 (0.00087) [2022-07-09 16:56:02,971][26022] Updated weights on worker 0-0, policy_version 337452 (0.00086) [2022-07-09 16:56:03,168][25689] Fps is (10 sec: 5385.8, 60 sec: 5617.2, 300 sec: 5635.1). Total num frames: 345550848. Throughput: 0: 5809.7. Samples: 345552380. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:56:03,168][25689] Avg episode reward: [(0, '-48.483')] [2022-07-09 16:56:04,868][26022] Updated weights on worker 0-0, policy_version 337462 (0.00093) [2022-07-09 16:56:06,816][26022] Updated weights on worker 0-0, policy_version 337472 (0.00082) [2022-07-09 16:56:08,193][25689] Fps is (10 sec: 5576.3, 60 sec: 5650.7, 300 sec: 5639.6). Total num frames: 345580544. Throughput: 0: 5824.5. Samples: 345586446. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:56:08,194][25689] Avg episode reward: [(0, '-48.528')] [2022-07-09 16:56:08,455][26022] Updated weights on worker 0-0, policy_version 337482 (0.00087) [2022-07-09 16:56:10,483][26022] Updated weights on worker 0-0, policy_version 337492 (0.00082) [2022-07-09 16:56:12,001][26022] Updated weights on worker 0-0, policy_version 337502 (0.00089) [2022-07-09 16:56:13,289][25689] Fps is (10 sec: 5665.9, 60 sec: 5634.4, 300 sec: 5634.6). Total num frames: 345608192. Throughput: 0: 4983.4. Samples: 345603646. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:56:13,289][25689] Avg episode reward: [(0, '-48.247')] [2022-07-09 16:56:13,713][26022] Updated weights on worker 0-0, policy_version 337512 (0.00086) [2022-07-09 16:56:15,659][26022] Updated weights on worker 0-0, policy_version 337522 (0.00087) [2022-07-09 16:56:17,394][26022] Updated weights on worker 0-0, policy_version 337532 (0.00084) [2022-07-09 16:56:18,293][25689] Fps is (10 sec: 5576.4, 60 sec: 5641.6, 300 sec: 5631.5). Total num frames: 345636864. Throughput: 0: 5865.4. Samples: 345638424. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:56:18,293][25689] Avg episode reward: [(0, '-47.706')] [2022-07-09 16:56:19,178][26022] Updated weights on worker 0-0, policy_version 337542 (0.00089) [2022-07-09 16:56:21,058][26022] Updated weights on worker 0-0, policy_version 337552 (0.00086) [2022-07-09 16:56:22,635][26022] Updated weights on worker 0-0, policy_version 337562 (0.00086) [2022-07-09 16:56:23,319][25689] Fps is (10 sec: 5921.3, 60 sec: 5681.1, 300 sec: 5645.0). Total num frames: 345667584. Throughput: 0: 5971.4. Samples: 345672710. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:56:23,321][25689] Avg episode reward: [(0, '-47.985')] [2022-07-09 16:56:24,658][26022] Updated weights on worker 0-0, policy_version 337572 (0.00090) [2022-07-09 16:56:26,163][26022] Updated weights on worker 0-0, policy_version 337582 (0.00092) [2022-07-09 16:56:28,159][26022] Updated weights on worker 0-0, policy_version 337592 (0.00095) [2022-07-09 16:56:28,335][25689] Fps is (10 sec: 5710.6, 60 sec: 5635.2, 300 sec: 5636.3). Total num frames: 345694208. Throughput: 0: 5141.9. Samples: 345690010. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:56:28,337][25689] Avg episode reward: [(0, '-47.765')] [2022-07-09 16:56:29,739][26022] Updated weights on worker 0-0, policy_version 337602 (0.00085) [2022-07-09 16:56:31,756][26022] Updated weights on worker 0-0, policy_version 337612 (0.00090) [2022-07-09 16:56:33,401][25689] Fps is (10 sec: 5484.6, 60 sec: 5649.2, 300 sec: 5632.1). Total num frames: 345722880. Throughput: 0: 5994.5. Samples: 345724208. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:56:33,402][25689] Avg episode reward: [(0, '-47.762')] [2022-07-09 16:56:33,559][26022] Updated weights on worker 0-0, policy_version 337622 (0.00088) [2022-07-09 16:56:35,491][26022] Updated weights on worker 0-0, policy_version 337632 (0.00088) [2022-07-09 16:56:37,114][26022] Updated weights on worker 0-0, policy_version 337642 (0.00091) [2022-07-09 16:56:38,421][25689] Fps is (10 sec: 5786.8, 60 sec: 5686.8, 300 sec: 5638.8). Total num frames: 345752576. Throughput: 0: 5950.7. Samples: 345758200. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:56:38,422][25689] Avg episode reward: [(0, '-48.609')] [2022-07-09 16:56:39,215][26022] Updated weights on worker 0-0, policy_version 337652 (0.00086) [2022-07-09 16:56:40,808][26022] Updated weights on worker 0-0, policy_version 337662 (0.00092) [2022-07-09 16:56:42,770][26022] Updated weights on worker 0-0, policy_version 337672 (0.00093) [2022-07-09 16:56:43,443][25689] Fps is (10 sec: 5710.2, 60 sec: 5651.6, 300 sec: 5631.8). Total num frames: 345780224. Throughput: 0: 5090.4. Samples: 345775150. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:56:43,443][25689] Avg episode reward: [(0, '-48.463')] [2022-07-09 16:56:44,255][26022] Updated weights on worker 0-0, policy_version 337682 (0.00086) [2022-07-09 16:56:46,183][26022] Updated weights on worker 0-0, policy_version 337692 (0.00090) [2022-07-09 16:56:47,864][26022] Updated weights on worker 0-0, policy_version 337702 (0.00090) [2022-07-09 16:56:48,487][25689] Fps is (10 sec: 5594.9, 60 sec: 5670.8, 300 sec: 5636.7). Total num frames: 345808896. Throughput: 0: 5926.0. Samples: 345809436. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:56:48,488][25689] Avg episode reward: [(0, '-48.056')] [2022-07-09 16:56:49,775][26022] Updated weights on worker 0-0, policy_version 337712 (0.00087) [2022-07-09 16:56:51,443][26022] Updated weights on worker 0-0, policy_version 337722 (0.00606) [2022-07-09 16:56:53,176][26022] Updated weights on worker 0-0, policy_version 337732 (0.00092) [2022-07-09 16:56:53,523][25689] Fps is (10 sec: 5892.1, 60 sec: 5676.5, 300 sec: 5643.0). Total num frames: 345839616. Throughput: 0: 5963.9. Samples: 345844216. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:56:53,523][25689] Avg episode reward: [(0, '-48.372')] [2022-07-09 16:56:55,007][26022] Updated weights on worker 0-0, policy_version 337742 (0.00080) [2022-07-09 16:56:56,836][26022] Updated weights on worker 0-0, policy_version 337752 (0.00082) [2022-07-09 16:56:58,551][25689] Fps is (10 sec: 5800.2, 60 sec: 5674.1, 300 sec: 5639.2). Total num frames: 345867264. Throughput: 0: 5121.4. Samples: 345861290. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:56:58,551][25689] Avg episode reward: [(0, '-48.749')] [2022-07-09 16:56:58,651][26022] Updated weights on worker 0-0, policy_version 337762 (0.00091) [2022-07-09 16:57:00,514][26022] Updated weights on worker 0-0, policy_version 337772 (0.00095) [2022-07-09 16:57:02,590][26022] Updated weights on worker 0-0, policy_version 337782 (0.00089) [2022-07-09 16:57:03,581][25689] Fps is (10 sec: 5294.0, 60 sec: 5661.3, 300 sec: 5635.3). Total num frames: 345892864. Throughput: 0: 5987.6. Samples: 345895728. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:57:03,581][25689] Avg episode reward: [(0, '-47.958')] [2022-07-09 16:57:04,502][26022] Updated weights on worker 0-0, policy_version 337792 (0.00088) [2022-07-09 16:57:06,288][26022] Updated weights on worker 0-0, policy_version 337802 (0.00094) [2022-07-09 16:57:08,089][26022] Updated weights on worker 0-0, policy_version 337812 (0.00095) [2022-07-09 16:57:08,631][25689] Fps is (10 sec: 5485.4, 60 sec: 5659.0, 300 sec: 5642.4). Total num frames: 345922560. Throughput: 0: 5854.9. Samples: 345927376. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:57:08,632][25689] Avg episode reward: [(0, '-47.407')] [2022-07-09 16:57:10,045][26022] Updated weights on worker 0-0, policy_version 337822 (0.00093) [2022-07-09 16:57:11,639][26022] Updated weights on worker 0-0, policy_version 337832 (0.00083) [2022-07-09 16:57:13,730][25689] Fps is (10 sec: 5650.3, 60 sec: 5658.6, 300 sec: 5633.7). Total num frames: 345950208. Throughput: 0: 4951.0. Samples: 345944260. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:57:13,730][25689] Avg episode reward: [(0, '-46.455')] [2022-07-09 16:57:13,736][26022] Updated weights on worker 0-0, policy_version 337842 (0.00092) [2022-07-09 16:57:15,297][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:57:15,320][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000337852_345960448.pth [2022-07-09 16:57:15,321][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000335866_343926784.pth [2022-07-09 16:57:15,323][26022] Updated weights on worker 0-0, policy_version 337852 (0.00086) [2022-07-09 16:57:17,230][26022] Updated weights on worker 0-0, policy_version 337862 (0.00097) [2022-07-09 16:57:18,785][25689] Fps is (10 sec: 5647.3, 60 sec: 5670.8, 300 sec: 5636.3). Total num frames: 345979904. Throughput: 0: 5789.7. Samples: 345978444. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:57:18,787][25689] Avg episode reward: [(0, '-47.195')] [2022-07-09 16:57:18,898][26022] Updated weights on worker 0-0, policy_version 337872 (0.00083) [2022-07-09 16:57:20,555][26022] Updated weights on worker 0-0, policy_version 337882 (0.00088) [2022-07-09 16:57:22,519][26022] Updated weights on worker 0-0, policy_version 337892 (0.00093) [2022-07-09 16:57:23,820][25689] Fps is (10 sec: 5886.1, 60 sec: 5653.0, 300 sec: 5642.9). Total num frames: 346009600. Throughput: 0: 5780.2. Samples: 346012714. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:57:23,821][25689] Avg episode reward: [(0, '-47.005')] [2022-07-09 16:57:24,241][26022] Updated weights on worker 0-0, policy_version 337902 (0.00084) [2022-07-09 16:57:26,265][26022] Updated weights on worker 0-0, policy_version 337912 (0.00090) [2022-07-09 16:57:27,930][26022] Updated weights on worker 0-0, policy_version 337922 (0.00096) [2022-07-09 16:57:28,840][25689] Fps is (10 sec: 5601.6, 60 sec: 5652.7, 300 sec: 5637.0). Total num frames: 346036224. Throughput: 0: 5058.9. Samples: 346029608. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:57:28,840][25689] Avg episode reward: [(0, '-46.919')] [2022-07-09 16:57:29,799][26022] Updated weights on worker 0-0, policy_version 337932 (0.00089) [2022-07-09 16:57:31,640][26022] Updated weights on worker 0-0, policy_version 337942 (0.00081) [2022-07-09 16:57:33,230][26022] Updated weights on worker 0-0, policy_version 337952 (0.00097) [2022-07-09 16:57:33,906][25689] Fps is (10 sec: 5583.8, 60 sec: 5669.6, 300 sec: 5640.1). Total num frames: 346065920. Throughput: 0: 5925.8. Samples: 346063822. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:57:33,907][25689] Avg episode reward: [(0, '-47.635')] [2022-07-09 16:57:35,119][26022] Updated weights on worker 0-0, policy_version 337962 (0.00088) [2022-07-09 16:57:36,912][26022] Updated weights on worker 0-0, policy_version 337972 (0.00095) [2022-07-09 16:57:38,919][25689] Fps is (10 sec: 5587.8, 60 sec: 5619.5, 300 sec: 5636.5). Total num frames: 346092544. Throughput: 0: 5926.1. Samples: 346097756. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:57:38,919][25689] Avg episode reward: [(0, '-47.789')] [2022-07-09 16:57:38,959][26022] Updated weights on worker 0-0, policy_version 337982 (0.00088) [2022-07-09 16:57:40,614][26022] Updated weights on worker 0-0, policy_version 337992 (0.00086) [2022-07-09 16:57:42,530][26022] Updated weights on worker 0-0, policy_version 338002 (0.00087) [2022-07-09 16:57:43,951][25689] Fps is (10 sec: 5606.7, 60 sec: 5652.4, 300 sec: 5637.2). Total num frames: 346122240. Throughput: 0: 5918.9. Samples: 346131870. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:57:43,952][25689] Avg episode reward: [(0, '-47.848')] [2022-07-09 16:57:44,176][26022] Updated weights on worker 0-0, policy_version 338012 (0.00093) [2022-07-09 16:57:45,931][26022] Updated weights on worker 0-0, policy_version 338022 (0.00086) [2022-07-09 16:57:47,888][26022] Updated weights on worker 0-0, policy_version 338032 (0.00096) [2022-07-09 16:57:48,981][25689] Fps is (10 sec: 5800.4, 60 sec: 5653.7, 300 sec: 5637.6). Total num frames: 346150912. Throughput: 0: 5924.0. Samples: 346148928. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:57:48,982][25689] Avg episode reward: [(0, '-47.743')] [2022-07-09 16:57:49,704][26022] Updated weights on worker 0-0, policy_version 338042 (0.00086) [2022-07-09 16:57:51,421][26022] Updated weights on worker 0-0, policy_version 338052 (0.00086) [2022-07-09 16:57:53,288][26022] Updated weights on worker 0-0, policy_version 338062 (0.00091) [2022-07-09 16:57:54,023][25689] Fps is (10 sec: 5795.5, 60 sec: 5636.3, 300 sec: 5641.2). Total num frames: 346180608. Throughput: 0: 5931.8. Samples: 346183148. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2022-07-09 16:57:54,023][25689] Avg episode reward: [(0, '-47.540')] [2022-07-09 16:57:54,995][26022] Updated weights on worker 0-0, policy_version 338072 (0.00092) [2022-07-09 16:57:56,786][26022] Updated weights on worker 0-0, policy_version 338082 (0.00092) [2022-07-09 16:57:58,380][26022] Updated weights on worker 0-0, policy_version 338092 (0.00087) [2022-07-09 16:57:59,059][25689] Fps is (10 sec: 5690.5, 60 sec: 5635.5, 300 sec: 5637.4). Total num frames: 346208256. Throughput: 0: 5939.3. Samples: 346217374. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 16:57:59,059][25689] Avg episode reward: [(0, '-47.431')] [2022-07-09 16:58:00,370][26022] Updated weights on worker 0-0, policy_version 338102 (0.00088) [2022-07-09 16:58:02,593][26022] Updated weights on worker 0-0, policy_version 338112 (0.00089) [2022-07-09 16:58:04,063][25689] Fps is (10 sec: 5507.2, 60 sec: 5671.7, 300 sec: 5644.7). Total num frames: 346235904. Throughput: 0: 5063.4. Samples: 346233704. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 16:58:04,064][25689] Avg episode reward: [(0, '-47.717')] [2022-07-09 16:58:04,237][26022] Updated weights on worker 0-0, policy_version 338122 (0.00106) [2022-07-09 16:58:06,279][26022] Updated weights on worker 0-0, policy_version 338132 (0.00085) [2022-07-09 16:58:07,845][26022] Updated weights on worker 0-0, policy_version 338142 (0.00091) [2022-07-09 16:58:09,098][25689] Fps is (10 sec: 5406.0, 60 sec: 5622.4, 300 sec: 5638.4). Total num frames: 346262528. Throughput: 0: 5847.6. Samples: 346266560. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 16:58:09,098][25689] Avg episode reward: [(0, '-47.068')] [2022-07-09 16:58:09,671][26022] Updated weights on worker 0-0, policy_version 338152 (0.00089) [2022-07-09 16:58:11,582][26022] Updated weights on worker 0-0, policy_version 338162 (0.00088) [2022-07-09 16:58:13,287][26022] Updated weights on worker 0-0, policy_version 338172 (0.00086) [2022-07-09 16:58:14,136][25689] Fps is (10 sec: 5591.1, 60 sec: 5661.9, 300 sec: 5635.5). Total num frames: 346292224. Throughput: 0: 5865.0. Samples: 346301114. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 16:58:14,137][25689] Avg episode reward: [(0, '-46.683')] [2022-07-09 16:58:15,140][26022] Updated weights on worker 0-0, policy_version 338182 (0.00088) [2022-07-09 16:58:16,816][26022] Updated weights on worker 0-0, policy_version 338192 (0.00089) [2022-07-09 16:58:18,687][26022] Updated weights on worker 0-0, policy_version 338202 (0.00083) [2022-07-09 16:58:19,147][25689] Fps is (10 sec: 5808.4, 60 sec: 5649.2, 300 sec: 5639.4). Total num frames: 346320896. Throughput: 0: 5027.2. Samples: 346318360. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 16:58:19,147][25689] Avg episode reward: [(0, '-46.224')] [2022-07-09 16:58:20,526][26022] Updated weights on worker 0-0, policy_version 338212 (0.00089) [2022-07-09 16:58:22,209][26022] Updated weights on worker 0-0, policy_version 338222 (0.00087) [2022-07-09 16:58:24,077][26022] Updated weights on worker 0-0, policy_version 338232 (0.00091) [2022-07-09 16:58:24,183][25689] Fps is (10 sec: 5708.0, 60 sec: 5632.1, 300 sec: 5642.7). Total num frames: 346349568. Throughput: 0: 5928.2. Samples: 346352976. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 16:58:24,183][25689] Avg episode reward: [(0, '-46.050')] [2022-07-09 16:58:25,773][26022] Updated weights on worker 0-0, policy_version 338242 (0.00927) [2022-07-09 16:58:27,641][26022] Updated weights on worker 0-0, policy_version 338252 (0.00092) [2022-07-09 16:58:29,197][25689] Fps is (10 sec: 5807.3, 60 sec: 5683.4, 300 sec: 5647.6). Total num frames: 346379264. Throughput: 0: 6014.5. Samples: 346387450. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 16:58:29,198][25689] Avg episode reward: [(0, '-45.496')] [2022-07-09 16:58:29,327][26022] Updated weights on worker 0-0, policy_version 338262 (0.00079) [2022-07-09 16:58:31,244][26022] Updated weights on worker 0-0, policy_version 338272 (0.00096) [2022-07-09 16:58:32,886][26022] Updated weights on worker 0-0, policy_version 338282 (0.00078) [2022-07-09 16:58:34,227][25689] Fps is (10 sec: 5709.3, 60 sec: 5653.0, 300 sec: 5644.1). Total num frames: 346406912. Throughput: 0: 5154.8. Samples: 346404676. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 16:58:34,228][25689] Avg episode reward: [(0, '-44.953')] [2022-07-09 16:58:34,830][26022] Updated weights on worker 0-0, policy_version 338292 (0.00085) [2022-07-09 16:58:36,603][26022] Updated weights on worker 0-0, policy_version 338302 (0.00084) [2022-07-09 16:58:38,313][26022] Updated weights on worker 0-0, policy_version 338312 (0.00091) [2022-07-09 16:58:39,235][25689] Fps is (10 sec: 5713.1, 60 sec: 5704.4, 300 sec: 5647.6). Total num frames: 346436608. Throughput: 0: 6006.5. Samples: 346439018. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 16:58:39,237][25689] Avg episode reward: [(0, '-45.048')] [2022-07-09 16:58:40,168][26022] Updated weights on worker 0-0, policy_version 338322 (0.00087) [2022-07-09 16:58:41,787][26022] Updated weights on worker 0-0, policy_version 338332 (0.00092) [2022-07-09 16:58:43,869][26022] Updated weights on worker 0-0, policy_version 338342 (0.00094) [2022-07-09 16:58:44,248][25689] Fps is (10 sec: 5824.5, 60 sec: 5689.2, 300 sec: 5647.4). Total num frames: 346465280. Throughput: 0: 6001.2. Samples: 346473390. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 16:58:44,248][25689] Avg episode reward: [(0, '-45.166')] [2022-07-09 16:58:45,570][26022] Updated weights on worker 0-0, policy_version 338352 (0.00095) [2022-07-09 16:58:47,383][26022] Updated weights on worker 0-0, policy_version 338362 (0.00098) [2022-07-09 16:58:49,166][26022] Updated weights on worker 0-0, policy_version 338372 (0.00086) [2022-07-09 16:58:49,251][25689] Fps is (10 sec: 5725.3, 60 sec: 5691.8, 300 sec: 5651.9). Total num frames: 346493952. Throughput: 0: 5125.3. Samples: 346490228. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 16:58:49,252][25689] Avg episode reward: [(0, '-44.830')] [2022-07-09 16:58:51,065][26022] Updated weights on worker 0-0, policy_version 338382 (0.00085) [2022-07-09 16:58:52,776][26022] Updated weights on worker 0-0, policy_version 338392 (0.00087) [2022-07-09 16:58:54,295][25689] Fps is (10 sec: 5503.8, 60 sec: 5640.6, 300 sec: 5640.8). Total num frames: 346520576. Throughput: 0: 5946.8. Samples: 346524016. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 16:58:54,299][25689] Avg episode reward: [(0, '-45.021')] [2022-07-09 16:58:54,711][26022] Updated weights on worker 0-0, policy_version 338402 (0.00095) [2022-07-09 16:58:56,330][26022] Updated weights on worker 0-0, policy_version 338412 (0.00088) [2022-07-09 16:58:58,211][26022] Updated weights on worker 0-0, policy_version 338422 (0.00086) [2022-07-09 16:58:59,303][25689] Fps is (10 sec: 5602.4, 60 sec: 5677.1, 300 sec: 5654.7). Total num frames: 346550272. Throughput: 0: 5951.0. Samples: 346558446. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 16:58:59,304][25689] Avg episode reward: [(0, '-45.453')] [2022-07-09 16:58:59,926][26022] Updated weights on worker 0-0, policy_version 338432 (0.00089) [2022-07-09 16:59:02,178][26022] Updated weights on worker 0-0, policy_version 338442 (0.00095) [2022-07-09 16:59:04,012][26022] Updated weights on worker 0-0, policy_version 338452 (0.00086) [2022-07-09 16:59:04,332][25689] Fps is (10 sec: 5611.3, 60 sec: 5657.9, 300 sec: 5650.8). Total num frames: 346576896. Throughput: 0: 4994.5. Samples: 346573698. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 16:59:04,332][25689] Avg episode reward: [(0, '-45.570')] [2022-07-09 16:59:05,730][26022] Updated weights on worker 0-0, policy_version 338462 (0.00088) [2022-07-09 16:59:07,560][26022] Updated weights on worker 0-0, policy_version 338472 (0.00086) [2022-07-09 16:59:09,255][26022] Updated weights on worker 0-0, policy_version 338482 (0.00110) [2022-07-09 16:59:09,344][25689] Fps is (10 sec: 5507.3, 60 sec: 5694.0, 300 sec: 5645.1). Total num frames: 346605568. Throughput: 0: 5860.2. Samples: 346607978. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 16:59:09,346][25689] Avg episode reward: [(0, '-45.949')] [2022-07-09 16:59:11,034][26022] Updated weights on worker 0-0, policy_version 338492 (0.00091) [2022-07-09 16:59:12,930][26022] Updated weights on worker 0-0, policy_version 338502 (0.00096) [2022-07-09 16:59:14,397][25689] Fps is (10 sec: 5697.4, 60 sec: 5675.7, 300 sec: 5652.1). Total num frames: 346634240. Throughput: 0: 5870.6. Samples: 346642026. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 16:59:14,397][25689] Avg episode reward: [(0, '-47.731')] [2022-07-09 16:59:14,719][26022] Updated weights on worker 0-0, policy_version 338512 (0.00085) [2022-07-09 16:59:15,424][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 16:59:15,434][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000338516_346640384.pth [2022-07-09 16:59:15,435][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000336528_344604672.pth [2022-07-09 16:59:16,543][26022] Updated weights on worker 0-0, policy_version 338522 (0.00097) [2022-07-09 16:59:18,385][26022] Updated weights on worker 0-0, policy_version 338532 (0.00089) [2022-07-09 16:59:19,400][25689] Fps is (10 sec: 5804.3, 60 sec: 5693.3, 300 sec: 5652.5). Total num frames: 346663936. Throughput: 0: 5014.7. Samples: 346659224. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 16:59:19,400][25689] Avg episode reward: [(0, '-48.011')] [2022-07-09 16:59:20,059][26022] Updated weights on worker 0-0, policy_version 338542 (0.00090) [2022-07-09 16:59:22,071][26022] Updated weights on worker 0-0, policy_version 338552 (0.00089) [2022-07-09 16:59:23,581][26022] Updated weights on worker 0-0, policy_version 338562 (0.00088) [2022-07-09 16:59:24,410][25689] Fps is (10 sec: 5624.4, 60 sec: 5661.7, 300 sec: 5645.7). Total num frames: 346690560. Throughput: 0: 5962.9. Samples: 346693424. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 16:59:24,411][25689] Avg episode reward: [(0, '-48.138')] [2022-07-09 16:59:25,499][26022] Updated weights on worker 0-0, policy_version 338572 (0.00091) [2022-07-09 16:59:27,387][26022] Updated weights on worker 0-0, policy_version 338582 (0.00087) [2022-07-09 16:59:28,987][26022] Updated weights on worker 0-0, policy_version 338592 (0.00086) [2022-07-09 16:59:29,416][25689] Fps is (10 sec: 5623.1, 60 sec: 5662.6, 300 sec: 5651.4). Total num frames: 346720256. Throughput: 0: 5966.4. Samples: 346727734. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 16:59:29,416][25689] Avg episode reward: [(0, '-48.661')] [2022-07-09 16:59:30,711][26022] Updated weights on worker 0-0, policy_version 338602 (0.00088) [2022-07-09 16:59:32,591][26022] Updated weights on worker 0-0, policy_version 338612 (0.00086) [2022-07-09 16:59:34,459][25689] Fps is (10 sec: 5706.7, 60 sec: 5661.3, 300 sec: 5650.8). Total num frames: 346747904. Throughput: 0: 5117.7. Samples: 346744698. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 16:59:34,459][25689] Avg episode reward: [(0, '-48.800')] [2022-07-09 16:59:34,516][26022] Updated weights on worker 0-0, policy_version 338622 (0.00081) [2022-07-09 16:59:36,196][26022] Updated weights on worker 0-0, policy_version 338632 (0.00089) [2022-07-09 16:59:38,279][26022] Updated weights on worker 0-0, policy_version 338642 (0.00927) [2022-07-09 16:59:39,463][25689] Fps is (10 sec: 5605.3, 60 sec: 5644.6, 300 sec: 5651.2). Total num frames: 346776576. Throughput: 0: 5963.6. Samples: 346778874. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 16:59:39,464][25689] Avg episode reward: [(0, '-48.581')] [2022-07-09 16:59:39,930][26022] Updated weights on worker 0-0, policy_version 338652 (0.00095) [2022-07-09 16:59:41,694][26022] Updated weights on worker 0-0, policy_version 338662 (0.00078) [2022-07-09 16:59:43,720][26022] Updated weights on worker 0-0, policy_version 338672 (0.00095) [2022-07-09 16:59:44,474][25689] Fps is (10 sec: 5623.1, 60 sec: 5627.8, 300 sec: 5644.3). Total num frames: 346804224. Throughput: 0: 5962.4. Samples: 346813056. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 16:59:44,475][25689] Avg episode reward: [(0, '-48.317')] [2022-07-09 16:59:45,061][26022] Updated weights on worker 0-0, policy_version 338682 (0.00086) [2022-07-09 16:59:47,364][26022] Updated weights on worker 0-0, policy_version 338692 (0.00087) [2022-07-09 16:59:48,701][26022] Updated weights on worker 0-0, policy_version 338702 (0.00085) [2022-07-09 16:59:49,495][25689] Fps is (10 sec: 5614.3, 60 sec: 5626.2, 300 sec: 5645.9). Total num frames: 346832896. Throughput: 0: 5099.9. Samples: 346830136. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 16:59:49,495][25689] Avg episode reward: [(0, '-47.560')] [2022-07-09 16:59:50,799][26022] Updated weights on worker 0-0, policy_version 338712 (0.00095) [2022-07-09 16:59:52,586][26022] Updated weights on worker 0-0, policy_version 338722 (0.00088) [2022-07-09 16:59:54,297][26022] Updated weights on worker 0-0, policy_version 338732 (0.00110) [2022-07-09 16:59:54,618][25689] Fps is (10 sec: 5754.3, 60 sec: 5669.7, 300 sec: 5651.0). Total num frames: 346862592. Throughput: 0: 5931.0. Samples: 346864262. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 16:59:54,618][25689] Avg episode reward: [(0, '-46.907')] [2022-07-09 16:59:56,205][26022] Updated weights on worker 0-0, policy_version 338742 (0.00085) [2022-07-09 16:59:58,043][26022] Updated weights on worker 0-0, policy_version 338752 (0.00084) [2022-07-09 16:59:59,667][25689] Fps is (10 sec: 5737.8, 60 sec: 5648.9, 300 sec: 5657.4). Total num frames: 346891264. Throughput: 0: 5901.8. Samples: 346898114. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 16:59:59,668][25689] Avg episode reward: [(0, '-46.397')] [2022-07-09 16:59:59,762][26022] Updated weights on worker 0-0, policy_version 338762 (0.00089) [2022-07-09 17:00:02,075][26022] Updated weights on worker 0-0, policy_version 338772 (0.00092) [2022-07-09 17:00:03,670][26022] Updated weights on worker 0-0, policy_version 338782 (0.00096) [2022-07-09 17:00:04,670][25689] Fps is (10 sec: 5500.8, 60 sec: 5651.3, 300 sec: 5654.4). Total num frames: 346917888. Throughput: 0: 4954.9. Samples: 346913124. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 17:00:04,671][25689] Avg episode reward: [(0, '-46.290')] [2022-07-09 17:00:05,791][26022] Updated weights on worker 0-0, policy_version 338792 (0.00097) [2022-07-09 17:00:07,456][26022] Updated weights on worker 0-0, policy_version 338802 (0.00088) [2022-07-09 17:00:09,222][26022] Updated weights on worker 0-0, policy_version 338812 (0.00088) [2022-07-09 17:00:09,695][25689] Fps is (10 sec: 5412.3, 60 sec: 5633.2, 300 sec: 5652.4). Total num frames: 346945536. Throughput: 0: 5776.8. Samples: 346946828. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 17:00:09,695][25689] Avg episode reward: [(0, '-46.242')] [2022-07-09 17:00:11,230][26022] Updated weights on worker 0-0, policy_version 338822 (0.00094) [2022-07-09 17:00:12,656][26022] Updated weights on worker 0-0, policy_version 338832 (0.00087) [2022-07-09 17:00:14,747][25689] Fps is (10 sec: 5487.6, 60 sec: 5616.3, 300 sec: 5649.5). Total num frames: 346973184. Throughput: 0: 5794.5. Samples: 346980898. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 17:00:14,747][25689] Avg episode reward: [(0, '-45.470')] [2022-07-09 17:00:14,829][26022] Updated weights on worker 0-0, policy_version 338842 (0.00092) [2022-07-09 17:00:16,593][26022] Updated weights on worker 0-0, policy_version 338852 (0.00097) [2022-07-09 17:00:18,312][26022] Updated weights on worker 0-0, policy_version 338862 (0.00080) [2022-07-09 17:00:19,750][25689] Fps is (10 sec: 5703.0, 60 sec: 5616.3, 300 sec: 5654.5). Total num frames: 347002880. Throughput: 0: 4974.5. Samples: 346998014. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 17:00:19,750][25689] Avg episode reward: [(0, '-46.408')] [2022-07-09 17:00:20,092][26022] Updated weights on worker 0-0, policy_version 338872 (0.00090) [2022-07-09 17:00:21,890][26022] Updated weights on worker 0-0, policy_version 338882 (0.00100) [2022-07-09 17:00:23,654][26022] Updated weights on worker 0-0, policy_version 338892 (0.00084) [2022-07-09 17:00:24,754][25689] Fps is (10 sec: 5730.0, 60 sec: 5633.8, 300 sec: 5648.8). Total num frames: 347030528. Throughput: 0: 5936.8. Samples: 347032360. Policy #0 lag: (min: 0.0, avg: 8.4, max: 17.0) [2022-07-09 17:00:24,756][25689] Avg episode reward: [(0, '-46.938')] [2022-07-09 17:00:25,480][26022] Updated weights on worker 0-0, policy_version 338902 (0.00084) [2022-07-09 17:00:27,259][26022] Updated weights on worker 0-0, policy_version 338912 (0.00086) [2022-07-09 17:00:29,111][26022] Updated weights on worker 0-0, policy_version 338922 (0.00059) [2022-07-09 17:00:29,771][25689] Fps is (10 sec: 5620.3, 60 sec: 5615.8, 300 sec: 5652.6). Total num frames: 347059200. Throughput: 0: 5945.0. Samples: 347066180. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:00:29,771][25689] Avg episode reward: [(0, '-47.489')] [2022-07-09 17:00:30,871][26022] Updated weights on worker 0-0, policy_version 338932 (0.00090) [2022-07-09 17:00:32,706][26022] Updated weights on worker 0-0, policy_version 338942 (0.00092) [2022-07-09 17:00:34,555][26022] Updated weights on worker 0-0, policy_version 338952 (0.00086) [2022-07-09 17:00:34,892][25689] Fps is (10 sec: 5757.5, 60 sec: 5642.4, 300 sec: 5658.4). Total num frames: 347088896. Throughput: 0: 5080.3. Samples: 347083242. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:00:34,893][25689] Avg episode reward: [(0, '-47.311')] [2022-07-09 17:00:36,144][26022] Updated weights on worker 0-0, policy_version 338962 (0.00088) [2022-07-09 17:00:38,092][26022] Updated weights on worker 0-0, policy_version 338972 (0.00091) [2022-07-09 17:00:39,854][26022] Updated weights on worker 0-0, policy_version 338982 (0.00092) [2022-07-09 17:00:39,947][25689] Fps is (10 sec: 5735.4, 60 sec: 5637.7, 300 sec: 5654.0). Total num frames: 347117568. Throughput: 0: 5931.7. Samples: 347117820. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:00:39,948][25689] Avg episode reward: [(0, '-47.964')] [2022-07-09 17:00:41,758][26022] Updated weights on worker 0-0, policy_version 338992 (0.00628) [2022-07-09 17:00:43,541][26022] Updated weights on worker 0-0, policy_version 339002 (0.00090) [2022-07-09 17:00:44,990][25689] Fps is (10 sec: 5678.7, 60 sec: 5651.7, 300 sec: 5658.0). Total num frames: 347146240. Throughput: 0: 5901.5. Samples: 347151782. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:00:44,991][25689] Avg episode reward: [(0, '-49.030')] [2022-07-09 17:00:45,476][26022] Updated weights on worker 0-0, policy_version 339012 (0.00091) [2022-07-09 17:00:47,087][26022] Updated weights on worker 0-0, policy_version 339022 (0.01275) [2022-07-09 17:00:49,107][26022] Updated weights on worker 0-0, policy_version 339032 (0.00091) [2022-07-09 17:00:50,016][25689] Fps is (10 sec: 5695.5, 60 sec: 5651.2, 300 sec: 5652.4). Total num frames: 347174912. Throughput: 0: 5072.7. Samples: 347168876. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:00:50,016][25689] Avg episode reward: [(0, '-49.118')] [2022-07-09 17:00:50,631][26022] Updated weights on worker 0-0, policy_version 339042 (0.01086) [2022-07-09 17:00:52,559][26022] Updated weights on worker 0-0, policy_version 339052 (0.00087) [2022-07-09 17:00:54,273][26022] Updated weights on worker 0-0, policy_version 339062 (0.00085) [2022-07-09 17:00:55,084][25689] Fps is (10 sec: 5681.3, 60 sec: 5639.4, 300 sec: 5654.6). Total num frames: 347203584. Throughput: 0: 5924.9. Samples: 347202874. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:00:55,084][25689] Avg episode reward: [(0, '-48.062')] [2022-07-09 17:00:56,186][26022] Updated weights on worker 0-0, policy_version 339072 (0.00081) [2022-07-09 17:00:57,862][26022] Updated weights on worker 0-0, policy_version 339082 (0.00108) [2022-07-09 17:00:59,623][26022] Updated weights on worker 0-0, policy_version 339092 (0.00090) [2022-07-09 17:01:00,159][25689] Fps is (10 sec: 5653.7, 60 sec: 5637.0, 300 sec: 5661.5). Total num frames: 347232256. Throughput: 0: 5888.8. Samples: 347236838. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:01:00,159][25689] Avg episode reward: [(0, '-47.970')] [2022-07-09 17:01:01,525][26022] Updated weights on worker 0-0, policy_version 339102 (0.00085) [2022-07-09 17:01:03,619][26022] Updated weights on worker 0-0, policy_version 339112 (0.00082) [2022-07-09 17:01:05,164][25689] Fps is (10 sec: 5384.1, 60 sec: 5619.9, 300 sec: 5648.1). Total num frames: 347257856. Throughput: 0: 5009.0. Samples: 347252832. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:01:05,164][25689] Avg episode reward: [(0, '-47.662')] [2022-07-09 17:01:05,658][26022] Updated weights on worker 0-0, policy_version 339122 (0.00095) [2022-07-09 17:01:07,446][26022] Updated weights on worker 0-0, policy_version 339132 (0.00090) [2022-07-09 17:01:09,151][26022] Updated weights on worker 0-0, policy_version 339142 (0.00082) [2022-07-09 17:01:10,177][25689] Fps is (10 sec: 5417.2, 60 sec: 5637.9, 300 sec: 5653.1). Total num frames: 347286528. Throughput: 0: 5801.9. Samples: 347285850. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:01:10,177][25689] Avg episode reward: [(0, '-46.935')] [2022-07-09 17:01:10,883][26022] Updated weights on worker 0-0, policy_version 339152 (0.00086) [2022-07-09 17:01:12,783][26022] Updated weights on worker 0-0, policy_version 339162 (0.00096) [2022-07-09 17:01:14,735][26022] Updated weights on worker 0-0, policy_version 339172 (0.00090) [2022-07-09 17:01:15,325][25689] Fps is (10 sec: 5643.3, 60 sec: 5645.8, 300 sec: 5650.4). Total num frames: 347315200. Throughput: 0: 5781.3. Samples: 347319896. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:01:15,326][25689] Avg episode reward: [(0, '-46.785')] [2022-07-09 17:01:15,455][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:01:15,466][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000339177_347317248.pth [2022-07-09 17:01:15,466][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000337189_345281536.pth [2022-07-09 17:01:16,414][26022] Updated weights on worker 0-0, policy_version 339182 (0.00841) [2022-07-09 17:01:18,191][26022] Updated weights on worker 0-0, policy_version 339192 (0.00088) [2022-07-09 17:01:20,011][26022] Updated weights on worker 0-0, policy_version 339202 (0.00090) [2022-07-09 17:01:20,331][25689] Fps is (10 sec: 5748.3, 60 sec: 5645.6, 300 sec: 5647.3). Total num frames: 347344896. Throughput: 0: 5811.3. Samples: 347354066. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:01:20,331][25689] Avg episode reward: [(0, '-47.369')] [2022-07-09 17:01:21,709][26022] Updated weights on worker 0-0, policy_version 339212 (0.00087) [2022-07-09 17:01:23,675][26022] Updated weights on worker 0-0, policy_version 339222 (0.00093) [2022-07-09 17:01:25,323][26022] Updated weights on worker 0-0, policy_version 339232 (0.00087) [2022-07-09 17:01:25,338][25689] Fps is (10 sec: 5727.1, 60 sec: 5645.4, 300 sec: 5650.9). Total num frames: 347372544. Throughput: 0: 5876.1. Samples: 347371376. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:01:25,338][25689] Avg episode reward: [(0, '-47.204')] [2022-07-09 17:01:27,153][26022] Updated weights on worker 0-0, policy_version 339242 (0.00078) [2022-07-09 17:01:28,900][26022] Updated weights on worker 0-0, policy_version 339252 (0.00091) [2022-07-09 17:01:30,386][25689] Fps is (10 sec: 5601.0, 60 sec: 5642.4, 300 sec: 5651.2). Total num frames: 347401216. Throughput: 0: 5917.8. Samples: 347405444. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:01:30,387][25689] Avg episode reward: [(0, '-46.339')] [2022-07-09 17:01:30,808][26022] Updated weights on worker 0-0, policy_version 339262 (0.00094) [2022-07-09 17:01:32,666][26022] Updated weights on worker 0-0, policy_version 339272 (0.00085) [2022-07-09 17:01:34,194][26022] Updated weights on worker 0-0, policy_version 339282 (0.00082) [2022-07-09 17:01:35,458][25689] Fps is (10 sec: 5666.1, 60 sec: 5630.1, 300 sec: 5646.8). Total num frames: 347429888. Throughput: 0: 5951.8. Samples: 347439724. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:01:35,459][25689] Avg episode reward: [(0, '-45.390')] [2022-07-09 17:01:36,280][26022] Updated weights on worker 0-0, policy_version 339292 (0.00087) [2022-07-09 17:01:37,899][26022] Updated weights on worker 0-0, policy_version 339302 (0.00086) [2022-07-09 17:01:39,695][26022] Updated weights on worker 0-0, policy_version 339312 (0.00092) [2022-07-09 17:01:40,471][25689] Fps is (10 sec: 5787.5, 60 sec: 5650.9, 300 sec: 5653.9). Total num frames: 347459584. Throughput: 0: 5112.0. Samples: 347457024. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:01:40,472][25689] Avg episode reward: [(0, '-45.929')] [2022-07-09 17:01:41,631][26022] Updated weights on worker 0-0, policy_version 339322 (0.00094) [2022-07-09 17:01:43,286][26022] Updated weights on worker 0-0, policy_version 339332 (0.00083) [2022-07-09 17:01:45,218][26022] Updated weights on worker 0-0, policy_version 339342 (0.00091) [2022-07-09 17:01:45,506][25689] Fps is (10 sec: 5809.2, 60 sec: 5651.7, 300 sec: 5654.1). Total num frames: 347488256. Throughput: 0: 5926.0. Samples: 347490892. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:01:45,506][25689] Avg episode reward: [(0, '-45.944')] [2022-07-09 17:01:47,101][26022] Updated weights on worker 0-0, policy_version 339352 (0.00091) [2022-07-09 17:01:48,768][26022] Updated weights on worker 0-0, policy_version 339362 (0.00091) [2022-07-09 17:01:50,510][25689] Fps is (10 sec: 5610.3, 60 sec: 5636.8, 300 sec: 5644.3). Total num frames: 347515904. Throughput: 0: 5931.2. Samples: 347524802. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:01:50,511][25689] Avg episode reward: [(0, '-45.780')] [2022-07-09 17:01:50,684][26022] Updated weights on worker 0-0, policy_version 339372 (0.00088) [2022-07-09 17:01:52,402][26022] Updated weights on worker 0-0, policy_version 339382 (0.00084) [2022-07-09 17:01:54,375][26022] Updated weights on worker 0-0, policy_version 339392 (0.00090) [2022-07-09 17:01:55,572][25689] Fps is (10 sec: 5594.8, 60 sec: 5637.3, 300 sec: 5647.1). Total num frames: 347544576. Throughput: 0: 5086.1. Samples: 347542026. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:01:55,573][25689] Avg episode reward: [(0, '-45.684')] [2022-07-09 17:01:56,032][26022] Updated weights on worker 0-0, policy_version 339402 (0.00119) [2022-07-09 17:01:57,802][26022] Updated weights on worker 0-0, policy_version 339412 (0.00087) [2022-07-09 17:01:59,519][26022] Updated weights on worker 0-0, policy_version 339422 (0.00095) [2022-07-09 17:02:00,602][25689] Fps is (10 sec: 5682.1, 60 sec: 5641.5, 300 sec: 5657.5). Total num frames: 347573248. Throughput: 0: 5922.0. Samples: 347576238. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:02:00,603][25689] Avg episode reward: [(0, '-47.244')] [2022-07-09 17:02:01,475][26022] Updated weights on worker 0-0, policy_version 339432 (0.00097) [2022-07-09 17:02:03,615][26022] Updated weights on worker 0-0, policy_version 339442 (0.00088) [2022-07-09 17:02:05,603][26022] Updated weights on worker 0-0, policy_version 339452 (0.00087) [2022-07-09 17:02:05,623][25689] Fps is (10 sec: 5399.9, 60 sec: 5640.1, 300 sec: 5644.2). Total num frames: 347598848. Throughput: 0: 5822.3. Samples: 347608020. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:02:05,623][25689] Avg episode reward: [(0, '-46.544')] [2022-07-09 17:02:07,322][26022] Updated weights on worker 0-0, policy_version 339462 (0.00123) [2022-07-09 17:02:09,231][26022] Updated weights on worker 0-0, policy_version 339472 (0.00095) [2022-07-09 17:02:10,643][25689] Fps is (10 sec: 5303.3, 60 sec: 5622.5, 300 sec: 5645.7). Total num frames: 347626496. Throughput: 0: 4976.1. Samples: 347624984. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:02:10,644][25689] Avg episode reward: [(0, '-46.563')] [2022-07-09 17:02:10,916][26022] Updated weights on worker 0-0, policy_version 339482 (0.00095) [2022-07-09 17:02:12,911][26022] Updated weights on worker 0-0, policy_version 339492 (0.00081) [2022-07-09 17:02:14,524][26022] Updated weights on worker 0-0, policy_version 339502 (0.00087) [2022-07-09 17:02:15,752][25689] Fps is (10 sec: 5661.5, 60 sec: 5643.1, 300 sec: 5644.7). Total num frames: 347656192. Throughput: 0: 5795.7. Samples: 347658980. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:02:15,753][25689] Avg episode reward: [(0, '-46.428')] [2022-07-09 17:02:16,467][26022] Updated weights on worker 0-0, policy_version 339512 (0.00081) [2022-07-09 17:02:18,167][26022] Updated weights on worker 0-0, policy_version 339522 (0.00086) [2022-07-09 17:02:20,071][26022] Updated weights on worker 0-0, policy_version 339532 (0.00093) [2022-07-09 17:02:20,754][25689] Fps is (10 sec: 5873.9, 60 sec: 5643.4, 300 sec: 5645.3). Total num frames: 347685888. Throughput: 0: 5813.3. Samples: 347693386. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:02:20,754][25689] Avg episode reward: [(0, '-46.169')] [2022-07-09 17:02:21,650][26022] Updated weights on worker 0-0, policy_version 339542 (0.00083) [2022-07-09 17:02:23,520][26022] Updated weights on worker 0-0, policy_version 339552 (0.00103) [2022-07-09 17:02:25,240][26022] Updated weights on worker 0-0, policy_version 339562 (0.00111) [2022-07-09 17:02:25,832][25689] Fps is (10 sec: 5688.5, 60 sec: 5636.7, 300 sec: 5647.7). Total num frames: 347713536. Throughput: 0: 5076.0. Samples: 347710604. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:02:25,833][25689] Avg episode reward: [(0, '-46.364')] [2022-07-09 17:02:27,082][26022] Updated weights on worker 0-0, policy_version 339572 (0.00091) [2022-07-09 17:02:28,974][26022] Updated weights on worker 0-0, policy_version 339582 (0.00087) [2022-07-09 17:02:30,647][26022] Updated weights on worker 0-0, policy_version 339592 (0.00089) [2022-07-09 17:02:30,881][25689] Fps is (10 sec: 5561.0, 60 sec: 5636.7, 300 sec: 5644.5). Total num frames: 347742208. Throughput: 0: 5920.3. Samples: 347744804. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:02:30,882][25689] Avg episode reward: [(0, '-45.910')] [2022-07-09 17:02:32,550][26022] Updated weights on worker 0-0, policy_version 339602 (0.00090) [2022-07-09 17:02:34,327][26022] Updated weights on worker 0-0, policy_version 339612 (0.00085) [2022-07-09 17:02:35,986][25689] Fps is (10 sec: 5647.5, 60 sec: 5633.6, 300 sec: 5649.7). Total num frames: 347770880. Throughput: 0: 5917.6. Samples: 347778720. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:02:35,988][25689] Avg episode reward: [(0, '-47.011')] [2022-07-09 17:02:36,120][26022] Updated weights on worker 0-0, policy_version 339622 (0.00089) [2022-07-09 17:02:37,871][26022] Updated weights on worker 0-0, policy_version 339632 (0.00081) [2022-07-09 17:02:39,606][26022] Updated weights on worker 0-0, policy_version 339642 (0.00083) [2022-07-09 17:02:40,996][25689] Fps is (10 sec: 5669.6, 60 sec: 5617.1, 300 sec: 5646.7). Total num frames: 347799552. Throughput: 0: 5064.4. Samples: 347795898. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:02:40,996][25689] Avg episode reward: [(0, '-47.359')] [2022-07-09 17:02:41,624][26022] Updated weights on worker 0-0, policy_version 339652 (0.00085) [2022-07-09 17:02:43,235][26022] Updated weights on worker 0-0, policy_version 339662 (0.00080) [2022-07-09 17:02:45,304][26022] Updated weights on worker 0-0, policy_version 339672 (0.00091) [2022-07-09 17:02:46,055][25689] Fps is (10 sec: 5695.0, 60 sec: 5614.7, 300 sec: 5646.1). Total num frames: 347828224. Throughput: 0: 5900.8. Samples: 347829936. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:02:46,056][25689] Avg episode reward: [(0, '-46.794')] [2022-07-09 17:02:46,885][26022] Updated weights on worker 0-0, policy_version 339682 (0.00080) [2022-07-09 17:02:48,849][26022] Updated weights on worker 0-0, policy_version 339692 (0.00069) [2022-07-09 17:02:50,511][26022] Updated weights on worker 0-0, policy_version 339702 (0.00096) [2022-07-09 17:02:51,135][25689] Fps is (10 sec: 5655.7, 60 sec: 5624.6, 300 sec: 5641.9). Total num frames: 347856896. Throughput: 0: 5873.0. Samples: 347863752. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:02:51,135][25689] Avg episode reward: [(0, '-46.791')] [2022-07-09 17:02:52,663][26022] Updated weights on worker 0-0, policy_version 339712 (0.00092) [2022-07-09 17:02:54,324][26022] Updated weights on worker 0-0, policy_version 339722 (0.00089) [2022-07-09 17:02:56,115][26022] Updated weights on worker 0-0, policy_version 339732 (0.00096) [2022-07-09 17:02:56,208][25689] Fps is (10 sec: 5648.0, 60 sec: 5623.6, 300 sec: 5644.7). Total num frames: 347885568. Throughput: 0: 5042.9. Samples: 347880698. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:02:56,209][25689] Avg episode reward: [(0, '-47.607')] [2022-07-09 17:02:57,885][26022] Updated weights on worker 0-0, policy_version 339742 (0.00089) [2022-07-09 17:02:59,759][26022] Updated weights on worker 0-0, policy_version 339752 (0.00084) [2022-07-09 17:03:01,215][25689] Fps is (10 sec: 5689.0, 60 sec: 5625.8, 300 sec: 5648.1). Total num frames: 347914240. Throughput: 0: 5882.1. Samples: 347914828. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 17:03:01,215][25689] Avg episode reward: [(0, '-46.913')] [2022-07-09 17:03:01,421][26022] Updated weights on worker 0-0, policy_version 339762 (0.00080) [2022-07-09 17:03:03,639][26022] Updated weights on worker 0-0, policy_version 339772 (0.00083) [2022-07-09 17:03:05,355][26022] Updated weights on worker 0-0, policy_version 339782 (0.00093) [2022-07-09 17:03:06,234][25689] Fps is (10 sec: 5413.2, 60 sec: 5625.9, 300 sec: 5644.9). Total num frames: 347939840. Throughput: 0: 5803.0. Samples: 347947034. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:03:06,235][25689] Avg episode reward: [(0, '-46.009')] [2022-07-09 17:03:07,158][26022] Updated weights on worker 0-0, policy_version 339792 (0.00083) [2022-07-09 17:03:08,939][26022] Updated weights on worker 0-0, policy_version 339802 (0.00082) [2022-07-09 17:03:10,870][26022] Updated weights on worker 0-0, policy_version 339812 (0.00628) [2022-07-09 17:03:11,239][25689] Fps is (10 sec: 5515.9, 60 sec: 5661.0, 300 sec: 5645.6). Total num frames: 347969536. Throughput: 0: 4994.9. Samples: 347964174. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:03:11,240][25689] Avg episode reward: [(0, '-45.116')] [2022-07-09 17:03:12,689][26022] Updated weights on worker 0-0, policy_version 339822 (0.00086) [2022-07-09 17:03:14,438][26022] Updated weights on worker 0-0, policy_version 339832 (0.00089) [2022-07-09 17:03:15,649][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:03:15,664][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000339838_347994112.pth [2022-07-09 17:03:15,665][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000337852_345960448.pth [2022-07-09 17:03:16,254][26022] Updated weights on worker 0-0, policy_version 339842 (0.00090) [2022-07-09 17:03:16,295][25689] Fps is (10 sec: 5801.3, 60 sec: 5649.1, 300 sec: 5644.7). Total num frames: 347998208. Throughput: 0: 5857.4. Samples: 347998356. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:03:16,296][25689] Avg episode reward: [(0, '-45.761')] [2022-07-09 17:03:17,958][26022] Updated weights on worker 0-0, policy_version 339852 (0.00092) [2022-07-09 17:03:19,923][26022] Updated weights on worker 0-0, policy_version 339862 (0.00087) [2022-07-09 17:03:21,310][25689] Fps is (10 sec: 5694.0, 60 sec: 5631.0, 300 sec: 5645.1). Total num frames: 348026880. Throughput: 0: 5853.5. Samples: 348032458. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:03:21,311][25689] Avg episode reward: [(0, '-45.976')] [2022-07-09 17:03:21,679][26022] Updated weights on worker 0-0, policy_version 339872 (0.00084) [2022-07-09 17:03:23,363][26022] Updated weights on worker 0-0, policy_version 339882 (0.00087) [2022-07-09 17:03:25,165][26022] Updated weights on worker 0-0, policy_version 339892 (0.00050) [2022-07-09 17:03:26,324][25689] Fps is (10 sec: 5615.8, 60 sec: 5637.0, 300 sec: 5638.2). Total num frames: 348054528. Throughput: 0: 5110.8. Samples: 348049710. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:03:26,324][25689] Avg episode reward: [(0, '-46.827')] [2022-07-09 17:03:26,988][26022] Updated weights on worker 0-0, policy_version 339902 (0.00089) [2022-07-09 17:03:28,861][26022] Updated weights on worker 0-0, policy_version 339912 (0.00087) [2022-07-09 17:03:30,725][26022] Updated weights on worker 0-0, policy_version 339922 (0.00054) [2022-07-09 17:03:31,326][25689] Fps is (10 sec: 5623.2, 60 sec: 5641.4, 300 sec: 5642.2). Total num frames: 348083200. Throughput: 0: 5948.5. Samples: 348083660. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:03:31,327][25689] Avg episode reward: [(0, '-47.266')] [2022-07-09 17:03:32,471][26022] Updated weights on worker 0-0, policy_version 339932 (0.00512) [2022-07-09 17:03:34,420][26022] Updated weights on worker 0-0, policy_version 339942 (0.00085) [2022-07-09 17:03:36,235][26022] Updated weights on worker 0-0, policy_version 339952 (0.00088) [2022-07-09 17:03:36,426][25689] Fps is (10 sec: 5676.8, 60 sec: 5641.9, 300 sec: 5637.0). Total num frames: 348111872. Throughput: 0: 5923.1. Samples: 348117590. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:03:36,426][25689] Avg episode reward: [(0, '-47.435')] [2022-07-09 17:03:37,809][26022] Updated weights on worker 0-0, policy_version 339962 (0.00089) [2022-07-09 17:03:39,693][26022] Updated weights on worker 0-0, policy_version 339972 (0.00086) [2022-07-09 17:03:41,421][26022] Updated weights on worker 0-0, policy_version 339982 (0.00106) [2022-07-09 17:03:41,440][25689] Fps is (10 sec: 5770.9, 60 sec: 5658.3, 300 sec: 5640.4). Total num frames: 348141568. Throughput: 0: 5082.8. Samples: 348134774. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:03:41,442][25689] Avg episode reward: [(0, '-46.588')] [2022-07-09 17:03:43,341][26022] Updated weights on worker 0-0, policy_version 339992 (0.00090) [2022-07-09 17:03:45,107][26022] Updated weights on worker 0-0, policy_version 340002 (0.00096) [2022-07-09 17:03:46,488][25689] Fps is (10 sec: 5597.0, 60 sec: 5625.6, 300 sec: 5632.7). Total num frames: 348168192. Throughput: 0: 5905.9. Samples: 348168796. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:03:46,490][25689] Avg episode reward: [(0, '-45.818')] [2022-07-09 17:03:47,135][26022] Updated weights on worker 0-0, policy_version 340012 (0.00090) [2022-07-09 17:03:48,655][26022] Updated weights on worker 0-0, policy_version 340022 (0.00089) [2022-07-09 17:03:50,724][26022] Updated weights on worker 0-0, policy_version 340032 (0.00088) [2022-07-09 17:03:51,495][25689] Fps is (10 sec: 5601.5, 60 sec: 5649.3, 300 sec: 5643.7). Total num frames: 348197888. Throughput: 0: 5912.8. Samples: 348202912. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:03:51,497][25689] Avg episode reward: [(0, '-45.444')] [2022-07-09 17:03:52,392][26022] Updated weights on worker 0-0, policy_version 340042 (0.00087) [2022-07-09 17:03:54,302][26022] Updated weights on worker 0-0, policy_version 340052 (0.00090) [2022-07-09 17:03:56,019][26022] Updated weights on worker 0-0, policy_version 340062 (0.00095) [2022-07-09 17:03:56,591][25689] Fps is (10 sec: 5676.2, 60 sec: 5630.2, 300 sec: 5635.2). Total num frames: 348225536. Throughput: 0: 5062.5. Samples: 348219676. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:03:56,592][25689] Avg episode reward: [(0, '-45.200')] [2022-07-09 17:03:57,671][26022] Updated weights on worker 0-0, policy_version 340072 (0.00088) [2022-07-09 17:03:59,627][26022] Updated weights on worker 0-0, policy_version 340082 (0.00085) [2022-07-09 17:04:01,259][26022] Updated weights on worker 0-0, policy_version 340092 (0.00096) [2022-07-09 17:04:01,603][25689] Fps is (10 sec: 5571.8, 60 sec: 5629.7, 300 sec: 5642.4). Total num frames: 348254208. Throughput: 0: 5917.3. Samples: 348254080. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:04:01,603][25689] Avg episode reward: [(0, '-45.301')] [2022-07-09 17:04:03,667][26022] Updated weights on worker 0-0, policy_version 340102 (0.00080) [2022-07-09 17:04:05,239][26022] Updated weights on worker 0-0, policy_version 340112 (0.00086) [2022-07-09 17:04:06,629][25689] Fps is (10 sec: 5508.4, 60 sec: 5646.0, 300 sec: 5635.2). Total num frames: 348280832. Throughput: 0: 5840.3. Samples: 348286426. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:04:06,630][25689] Avg episode reward: [(0, '-44.958')] [2022-07-09 17:04:07,004][26022] Updated weights on worker 0-0, policy_version 340122 (0.00088) [2022-07-09 17:04:09,023][26022] Updated weights on worker 0-0, policy_version 340132 (0.00083) [2022-07-09 17:04:10,790][26022] Updated weights on worker 0-0, policy_version 340142 (0.00093) [2022-07-09 17:04:11,703][25689] Fps is (10 sec: 5576.3, 60 sec: 5639.7, 300 sec: 5638.3). Total num frames: 348310528. Throughput: 0: 4978.6. Samples: 348303518. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:04:11,703][25689] Avg episode reward: [(0, '-45.833')] [2022-07-09 17:04:12,630][26022] Updated weights on worker 0-0, policy_version 340152 (0.00098) [2022-07-09 17:04:14,553][26022] Updated weights on worker 0-0, policy_version 340162 (0.00086) [2022-07-09 17:04:15,923][26022] Updated weights on worker 0-0, policy_version 340172 (0.00083) [2022-07-09 17:04:16,758][25689] Fps is (10 sec: 5762.8, 60 sec: 5639.7, 300 sec: 5633.8). Total num frames: 348339200. Throughput: 0: 5840.9. Samples: 348337468. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:04:16,758][25689] Avg episode reward: [(0, '-47.453')] [2022-07-09 17:04:18,008][26022] Updated weights on worker 0-0, policy_version 340182 (0.00095) [2022-07-09 17:04:19,735][26022] Updated weights on worker 0-0, policy_version 340192 (0.00084) [2022-07-09 17:04:21,547][26022] Updated weights on worker 0-0, policy_version 340202 (0.00088) [2022-07-09 17:04:21,762][25689] Fps is (10 sec: 5700.7, 60 sec: 5640.8, 300 sec: 5640.8). Total num frames: 348367872. Throughput: 0: 5826.1. Samples: 348371528. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:04:21,762][25689] Avg episode reward: [(0, '-47.641')] [2022-07-09 17:04:23,617][26022] Updated weights on worker 0-0, policy_version 340212 (0.00088) [2022-07-09 17:04:25,241][26022] Updated weights on worker 0-0, policy_version 340222 (0.00081) [2022-07-09 17:04:26,777][25689] Fps is (10 sec: 5519.0, 60 sec: 5623.7, 300 sec: 5630.3). Total num frames: 348394496. Throughput: 0: 5074.4. Samples: 348388664. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:04:26,777][25689] Avg episode reward: [(0, '-47.193')] [2022-07-09 17:04:27,019][26022] Updated weights on worker 0-0, policy_version 340232 (0.00086) [2022-07-09 17:04:28,595][26022] Updated weights on worker 0-0, policy_version 340242 (0.00084) [2022-07-09 17:04:30,604][26022] Updated weights on worker 0-0, policy_version 340252 (0.00096) [2022-07-09 17:04:31,806][25689] Fps is (10 sec: 5709.2, 60 sec: 5655.1, 300 sec: 5640.9). Total num frames: 348425216. Throughput: 0: 5946.0. Samples: 348423052. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:04:31,806][25689] Avg episode reward: [(0, '-46.352')] [2022-07-09 17:04:32,366][26022] Updated weights on worker 0-0, policy_version 340262 (0.00086) [2022-07-09 17:04:34,174][26022] Updated weights on worker 0-0, policy_version 340272 (0.00087) [2022-07-09 17:04:36,045][26022] Updated weights on worker 0-0, policy_version 340282 (0.00082) [2022-07-09 17:04:36,917][25689] Fps is (10 sec: 5857.2, 60 sec: 5654.0, 300 sec: 5638.9). Total num frames: 348453888. Throughput: 0: 5943.8. Samples: 348457290. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:04:36,917][25689] Avg episode reward: [(0, '-46.357')] [2022-07-09 17:04:37,829][26022] Updated weights on worker 0-0, policy_version 340292 (0.00090) [2022-07-09 17:04:39,719][26022] Updated weights on worker 0-0, policy_version 340302 (0.00083) [2022-07-09 17:04:41,344][26022] Updated weights on worker 0-0, policy_version 340312 (0.00099) [2022-07-09 17:04:41,983][25689] Fps is (10 sec: 5533.8, 60 sec: 5615.3, 300 sec: 5637.9). Total num frames: 348481536. Throughput: 0: 5935.4. Samples: 348491552. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:04:41,984][25689] Avg episode reward: [(0, '-45.935')] [2022-07-09 17:04:43,065][26022] Updated weights on worker 0-0, policy_version 340322 (0.00094) [2022-07-09 17:04:44,938][26022] Updated weights on worker 0-0, policy_version 340332 (0.00089) [2022-07-09 17:04:46,599][26022] Updated weights on worker 0-0, policy_version 340342 (0.00075) [2022-07-09 17:04:46,998][25689] Fps is (10 sec: 5688.1, 60 sec: 5669.1, 300 sec: 5641.4). Total num frames: 348511232. Throughput: 0: 5924.4. Samples: 348508464. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:04:46,999][25689] Avg episode reward: [(0, '-45.463')] [2022-07-09 17:04:48,588][26022] Updated weights on worker 0-0, policy_version 340352 (0.00086) [2022-07-09 17:04:50,271][26022] Updated weights on worker 0-0, policy_version 340362 (0.00084) [2022-07-09 17:04:52,053][25689] Fps is (10 sec: 5796.5, 60 sec: 5647.7, 300 sec: 5639.3). Total num frames: 348539904. Throughput: 0: 5923.1. Samples: 348542978. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:04:52,053][25689] Avg episode reward: [(0, '-45.130')] [2022-07-09 17:04:52,111][26022] Updated weights on worker 0-0, policy_version 340372 (0.00098) [2022-07-09 17:04:53,823][26022] Updated weights on worker 0-0, policy_version 340382 (0.00085) [2022-07-09 17:04:55,781][26022] Updated weights on worker 0-0, policy_version 340392 (0.00086) [2022-07-09 17:04:57,195][25689] Fps is (10 sec: 5724.4, 60 sec: 5677.3, 300 sec: 5640.9). Total num frames: 348569600. Throughput: 0: 5905.8. Samples: 348577046. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:04:57,195][25689] Avg episode reward: [(0, '-44.493')] [2022-07-09 17:04:57,403][26022] Updated weights on worker 0-0, policy_version 340402 (0.00086) [2022-07-09 17:04:59,410][26022] Updated weights on worker 0-0, policy_version 340412 (0.00090) [2022-07-09 17:05:01,004][26022] Updated weights on worker 0-0, policy_version 340422 (0.00094) [2022-07-09 17:05:02,270][25689] Fps is (10 sec: 5412.2, 60 sec: 5620.7, 300 sec: 5636.1). Total num frames: 348595200. Throughput: 0: 5059.1. Samples: 348594184. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:05:02,270][25689] Avg episode reward: [(0, '-45.219')] [2022-07-09 17:05:03,392][26022] Updated weights on worker 0-0, policy_version 340432 (0.00086) [2022-07-09 17:05:04,922][26022] Updated weights on worker 0-0, policy_version 340442 (0.00078) [2022-07-09 17:05:06,968][26022] Updated weights on worker 0-0, policy_version 340452 (0.00097) [2022-07-09 17:05:07,321][25689] Fps is (10 sec: 5359.6, 60 sec: 5652.2, 300 sec: 5639.1). Total num frames: 348623872. Throughput: 0: 5796.3. Samples: 348626260. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:05:07,321][25689] Avg episode reward: [(0, '-45.917')] [2022-07-09 17:05:08,732][26022] Updated weights on worker 0-0, policy_version 340462 (0.00093) [2022-07-09 17:05:10,695][26022] Updated weights on worker 0-0, policy_version 340472 (0.00092) [2022-07-09 17:05:12,229][26022] Updated weights on worker 0-0, policy_version 340482 (0.00079) [2022-07-09 17:05:12,327][25689] Fps is (10 sec: 5803.3, 60 sec: 5658.4, 300 sec: 5646.8). Total num frames: 348653568. Throughput: 0: 5787.4. Samples: 348660314. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:05:12,329][25689] Avg episode reward: [(0, '-45.500')] [2022-07-09 17:05:14,315][26022] Updated weights on worker 0-0, policy_version 340492 (0.00495) [2022-07-09 17:05:15,765][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:05:15,774][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000340501_348673024.pth [2022-07-09 17:05:15,774][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000338516_346640384.pth [2022-07-09 17:05:15,920][26022] Updated weights on worker 0-0, policy_version 340502 (0.00087) [2022-07-09 17:05:17,375][25689] Fps is (10 sec: 5703.6, 60 sec: 5642.2, 300 sec: 5639.1). Total num frames: 348681216. Throughput: 0: 4955.8. Samples: 348677054. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:05:17,375][25689] Avg episode reward: [(0, '-45.967')] [2022-07-09 17:05:17,853][26022] Updated weights on worker 0-0, policy_version 340512 (0.00085) [2022-07-09 17:05:19,588][26022] Updated weights on worker 0-0, policy_version 340522 (0.00090) [2022-07-09 17:05:21,327][26022] Updated weights on worker 0-0, policy_version 340532 (0.00100) [2022-07-09 17:05:22,400][25689] Fps is (10 sec: 5591.4, 60 sec: 5640.3, 300 sec: 5642.1). Total num frames: 348709888. Throughput: 0: 5820.1. Samples: 348711344. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:05:22,402][25689] Avg episode reward: [(0, '-46.271')] [2022-07-09 17:05:23,326][26022] Updated weights on worker 0-0, policy_version 340542 (0.00090) [2022-07-09 17:05:24,992][26022] Updated weights on worker 0-0, policy_version 340552 (0.00090) [2022-07-09 17:05:26,895][26022] Updated weights on worker 0-0, policy_version 340562 (0.00095) [2022-07-09 17:05:27,423][25689] Fps is (10 sec: 5707.0, 60 sec: 5673.3, 300 sec: 5642.0). Total num frames: 348738560. Throughput: 0: 5936.2. Samples: 348745588. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:05:27,423][25689] Avg episode reward: [(0, '-46.955')] [2022-07-09 17:05:28,438][26022] Updated weights on worker 0-0, policy_version 340572 (0.00092) [2022-07-09 17:05:30,476][26022] Updated weights on worker 0-0, policy_version 340582 (0.00088) [2022-07-09 17:05:32,106][26022] Updated weights on worker 0-0, policy_version 340592 (0.00085) [2022-07-09 17:05:32,437][25689] Fps is (10 sec: 5713.4, 60 sec: 5640.9, 300 sec: 5640.6). Total num frames: 348767232. Throughput: 0: 5097.3. Samples: 348762818. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2022-07-09 17:05:32,437][25689] Avg episode reward: [(0, '-47.085')] [2022-07-09 17:05:33,934][26022] Updated weights on worker 0-0, policy_version 340602 (0.00093) [2022-07-09 17:05:35,842][26022] Updated weights on worker 0-0, policy_version 340612 (0.00086) [2022-07-09 17:05:37,490][25689] Fps is (10 sec: 5695.9, 60 sec: 5646.3, 300 sec: 5640.7). Total num frames: 348795904. Throughput: 0: 5969.3. Samples: 348797130. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:05:37,491][25689] Avg episode reward: [(0, '-45.926')] [2022-07-09 17:05:37,708][26022] Updated weights on worker 0-0, policy_version 340622 (0.00085) [2022-07-09 17:05:39,230][26022] Updated weights on worker 0-0, policy_version 340632 (0.00092) [2022-07-09 17:05:41,238][26022] Updated weights on worker 0-0, policy_version 340642 (0.00093) [2022-07-09 17:05:42,521][25689] Fps is (10 sec: 5686.4, 60 sec: 5666.5, 300 sec: 5640.9). Total num frames: 348824576. Throughput: 0: 5968.1. Samples: 348831430. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:05:42,522][25689] Avg episode reward: [(0, '-46.910')] [2022-07-09 17:05:42,953][26022] Updated weights on worker 0-0, policy_version 340652 (0.00090) [2022-07-09 17:05:44,935][26022] Updated weights on worker 0-0, policy_version 340662 (0.00085) [2022-07-09 17:05:46,655][26022] Updated weights on worker 0-0, policy_version 340672 (0.00088) [2022-07-09 17:05:47,528][25689] Fps is (10 sec: 5611.0, 60 sec: 5633.5, 300 sec: 5637.8). Total num frames: 348852224. Throughput: 0: 5110.8. Samples: 348848340. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:05:47,528][25689] Avg episode reward: [(0, '-46.211')] [2022-07-09 17:05:48,458][26022] Updated weights on worker 0-0, policy_version 340682 (0.00087) [2022-07-09 17:05:50,140][26022] Updated weights on worker 0-0, policy_version 340692 (0.00091) [2022-07-09 17:05:51,991][26022] Updated weights on worker 0-0, policy_version 340702 (0.00109) [2022-07-09 17:05:52,555][25689] Fps is (10 sec: 5612.7, 60 sec: 5636.0, 300 sec: 5638.6). Total num frames: 348880896. Throughput: 0: 5951.1. Samples: 348882546. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:05:52,556][25689] Avg episode reward: [(0, '-46.658')] [2022-07-09 17:05:53,916][26022] Updated weights on worker 0-0, policy_version 340712 (0.01517) [2022-07-09 17:05:55,684][26022] Updated weights on worker 0-0, policy_version 340722 (0.00091) [2022-07-09 17:05:57,322][26022] Updated weights on worker 0-0, policy_version 340732 (0.00083) [2022-07-09 17:05:57,609][25689] Fps is (10 sec: 5789.6, 60 sec: 5644.2, 300 sec: 5642.4). Total num frames: 348910592. Throughput: 0: 5923.3. Samples: 348916300. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:05:57,610][25689] Avg episode reward: [(0, '-46.525')] [2022-07-09 17:05:59,141][26022] Updated weights on worker 0-0, policy_version 340742 (0.00087) [2022-07-09 17:06:01,202][26022] Updated weights on worker 0-0, policy_version 340752 (0.00085) [2022-07-09 17:06:02,648][25689] Fps is (10 sec: 5478.9, 60 sec: 5647.6, 300 sec: 5641.8). Total num frames: 348936192. Throughput: 0: 5067.6. Samples: 348933424. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:06:02,649][25689] Avg episode reward: [(0, '-47.010')] [2022-07-09 17:06:03,250][26022] Updated weights on worker 0-0, policy_version 340762 (0.00093) [2022-07-09 17:06:05,043][26022] Updated weights on worker 0-0, policy_version 340772 (0.00082) [2022-07-09 17:06:06,717][26022] Updated weights on worker 0-0, policy_version 340782 (0.00087) [2022-07-09 17:06:07,678][25689] Fps is (10 sec: 5288.6, 60 sec: 5632.6, 300 sec: 5638.0). Total num frames: 348963840. Throughput: 0: 5806.1. Samples: 348965334. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:06:07,678][25689] Avg episode reward: [(0, '-46.751')] [2022-07-09 17:06:08,596][26022] Updated weights on worker 0-0, policy_version 340792 (0.00090) [2022-07-09 17:06:10,525][26022] Updated weights on worker 0-0, policy_version 340802 (0.00087) [2022-07-09 17:06:12,350][26022] Updated weights on worker 0-0, policy_version 340812 (0.00087) [2022-07-09 17:06:12,695][25689] Fps is (10 sec: 5707.7, 60 sec: 5631.6, 300 sec: 5643.9). Total num frames: 348993536. Throughput: 0: 5815.2. Samples: 348999660. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:06:12,695][25689] Avg episode reward: [(0, '-46.248')] [2022-07-09 17:06:13,932][26022] Updated weights on worker 0-0, policy_version 340822 (0.00091) [2022-07-09 17:06:15,899][26022] Updated weights on worker 0-0, policy_version 340832 (0.00083) [2022-07-09 17:06:17,679][26022] Updated weights on worker 0-0, policy_version 340842 (0.00095) [2022-07-09 17:06:17,747][25689] Fps is (10 sec: 5796.8, 60 sec: 5648.2, 300 sec: 5639.6). Total num frames: 349022208. Throughput: 0: 4984.9. Samples: 349016684. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:06:17,747][25689] Avg episode reward: [(0, '-47.717')] [2022-07-09 17:06:19,656][26022] Updated weights on worker 0-0, policy_version 340852 (0.00092) [2022-07-09 17:06:21,268][26022] Updated weights on worker 0-0, policy_version 340862 (0.00083) [2022-07-09 17:06:22,756][25689] Fps is (10 sec: 5699.7, 60 sec: 5649.7, 300 sec: 5643.0). Total num frames: 349050880. Throughput: 0: 5829.0. Samples: 349050632. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:06:22,756][25689] Avg episode reward: [(0, '-47.342')] [2022-07-09 17:06:23,078][26022] Updated weights on worker 0-0, policy_version 340872 (0.00086) [2022-07-09 17:06:24,860][26022] Updated weights on worker 0-0, policy_version 340882 (0.00088) [2022-07-09 17:06:26,919][26022] Updated weights on worker 0-0, policy_version 340892 (0.00083) [2022-07-09 17:06:27,768][25689] Fps is (10 sec: 5722.2, 60 sec: 5650.7, 300 sec: 5643.7). Total num frames: 349079552. Throughput: 0: 5965.1. Samples: 349085174. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:06:27,768][25689] Avg episode reward: [(0, '-48.118')] [2022-07-09 17:06:28,450][26022] Updated weights on worker 0-0, policy_version 340902 (0.00087) [2022-07-09 17:06:30,430][26022] Updated weights on worker 0-0, policy_version 340912 (0.00088) [2022-07-09 17:06:32,191][26022] Updated weights on worker 0-0, policy_version 340922 (0.00084) [2022-07-09 17:06:32,777][25689] Fps is (10 sec: 5722.1, 60 sec: 5651.1, 300 sec: 5644.9). Total num frames: 349108224. Throughput: 0: 5111.1. Samples: 349102302. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:06:32,778][25689] Avg episode reward: [(0, '-48.516')] [2022-07-09 17:06:33,987][26022] Updated weights on worker 0-0, policy_version 340932 (0.00081) [2022-07-09 17:06:35,748][26022] Updated weights on worker 0-0, policy_version 340942 (0.00084) [2022-07-09 17:06:37,568][26022] Updated weights on worker 0-0, policy_version 340952 (0.00098) [2022-07-09 17:06:37,831][25689] Fps is (10 sec: 5596.9, 60 sec: 5634.2, 300 sec: 5637.2). Total num frames: 349135872. Throughput: 0: 5954.9. Samples: 349136282. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:06:37,831][25689] Avg episode reward: [(0, '-47.788')] [2022-07-09 17:06:39,336][26022] Updated weights on worker 0-0, policy_version 340962 (0.00084) [2022-07-09 17:06:41,153][26022] Updated weights on worker 0-0, policy_version 340972 (0.00087) [2022-07-09 17:06:42,837][25689] Fps is (10 sec: 5598.5, 60 sec: 5636.5, 300 sec: 5637.8). Total num frames: 349164544. Throughput: 0: 5970.6. Samples: 349170530. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:06:42,837][25689] Avg episode reward: [(0, '-47.197')] [2022-07-09 17:06:42,964][26022] Updated weights on worker 0-0, policy_version 340982 (0.00088) [2022-07-09 17:06:44,775][26022] Updated weights on worker 0-0, policy_version 340992 (0.00089) [2022-07-09 17:06:46,604][26022] Updated weights on worker 0-0, policy_version 341002 (0.00088) [2022-07-09 17:06:47,865][25689] Fps is (10 sec: 5612.6, 60 sec: 5634.4, 300 sec: 5637.3). Total num frames: 349192192. Throughput: 0: 5094.4. Samples: 349187558. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:06:47,866][25689] Avg episode reward: [(0, '-46.456')] [2022-07-09 17:06:48,356][26022] Updated weights on worker 0-0, policy_version 341012 (0.00085) [2022-07-09 17:06:50,217][26022] Updated weights on worker 0-0, policy_version 341022 (0.00089) [2022-07-09 17:06:51,894][26022] Updated weights on worker 0-0, policy_version 341032 (0.00092) [2022-07-09 17:06:52,871][25689] Fps is (10 sec: 5715.2, 60 sec: 5653.5, 300 sec: 5641.8). Total num frames: 349221888. Throughput: 0: 5945.2. Samples: 349221762. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:06:52,871][25689] Avg episode reward: [(0, '-46.109')] [2022-07-09 17:06:53,733][26022] Updated weights on worker 0-0, policy_version 341042 (0.00083) [2022-07-09 17:06:55,574][26022] Updated weights on worker 0-0, policy_version 341052 (0.00088) [2022-07-09 17:06:57,351][26022] Updated weights on worker 0-0, policy_version 341062 (0.00086) [2022-07-09 17:06:57,943][25689] Fps is (10 sec: 5690.4, 60 sec: 5617.8, 300 sec: 5637.6). Total num frames: 349249536. Throughput: 0: 5948.1. Samples: 349255910. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:06:57,943][25689] Avg episode reward: [(0, '-45.562')] [2022-07-09 17:06:59,218][26022] Updated weights on worker 0-0, policy_version 341072 (0.00092) [2022-07-09 17:07:01,064][26022] Updated weights on worker 0-0, policy_version 341082 (0.00090) [2022-07-09 17:07:02,956][25689] Fps is (10 sec: 5381.4, 60 sec: 5637.2, 300 sec: 5641.2). Total num frames: 349276160. Throughput: 0: 5091.6. Samples: 349272966. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:07:02,957][25689] Avg episode reward: [(0, '-46.512')] [2022-07-09 17:07:03,105][26022] Updated weights on worker 0-0, policy_version 341092 (0.00091) [2022-07-09 17:07:05,040][26022] Updated weights on worker 0-0, policy_version 341102 (0.00088) [2022-07-09 17:07:06,588][26022] Updated weights on worker 0-0, policy_version 341112 (0.00084) [2022-07-09 17:07:07,986][25689] Fps is (10 sec: 5505.8, 60 sec: 5654.2, 300 sec: 5644.4). Total num frames: 349304832. Throughput: 0: 5840.3. Samples: 349305068. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:07:07,987][25689] Avg episode reward: [(0, '-46.743')] [2022-07-09 17:07:08,494][26022] Updated weights on worker 0-0, policy_version 341122 (0.00093) [2022-07-09 17:07:10,492][26022] Updated weights on worker 0-0, policy_version 341132 (0.00095) [2022-07-09 17:07:11,907][26022] Updated weights on worker 0-0, policy_version 341142 (0.00094) [2022-07-09 17:07:13,001][25689] Fps is (10 sec: 5810.7, 60 sec: 5654.4, 300 sec: 5646.2). Total num frames: 349334528. Throughput: 0: 5847.2. Samples: 349339466. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:07:13,001][25689] Avg episode reward: [(0, '-47.671')] [2022-07-09 17:07:14,072][26022] Updated weights on worker 0-0, policy_version 341152 (0.00090) [2022-07-09 17:07:15,568][26022] Updated weights on worker 0-0, policy_version 341162 (0.00097) [2022-07-09 17:07:15,810][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:07:15,822][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000341164_349351936.pth [2022-07-09 17:07:15,822][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000339177_347317248.pth [2022-07-09 17:07:17,591][26022] Updated weights on worker 0-0, policy_version 341172 (0.00084) [2022-07-09 17:07:18,110][25689] Fps is (10 sec: 5866.5, 60 sec: 5666.0, 300 sec: 5644.2). Total num frames: 349364224. Throughput: 0: 4986.4. Samples: 349356472. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:07:18,110][25689] Avg episode reward: [(0, '-47.913')] [2022-07-09 17:07:19,393][26022] Updated weights on worker 0-0, policy_version 341182 (0.00087) [2022-07-09 17:07:21,118][26022] Updated weights on worker 0-0, policy_version 341192 (0.00095) [2022-07-09 17:07:23,044][26022] Updated weights on worker 0-0, policy_version 341202 (0.00091) [2022-07-09 17:07:23,120][25689] Fps is (10 sec: 5565.5, 60 sec: 5631.9, 300 sec: 5642.1). Total num frames: 349390848. Throughput: 0: 5826.9. Samples: 349390462. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:07:23,120][25689] Avg episode reward: [(0, '-48.289')] [2022-07-09 17:07:24,802][26022] Updated weights on worker 0-0, policy_version 341212 (0.00082) [2022-07-09 17:07:26,536][26022] Updated weights on worker 0-0, policy_version 341222 (0.00085) [2022-07-09 17:07:28,179][25689] Fps is (10 sec: 5491.7, 60 sec: 5627.6, 300 sec: 5641.9). Total num frames: 349419520. Throughput: 0: 5935.0. Samples: 349424912. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:07:28,180][25689] Avg episode reward: [(0, '-48.092')] [2022-07-09 17:07:28,358][26022] Updated weights on worker 0-0, policy_version 341232 (0.00088) [2022-07-09 17:07:29,960][26022] Updated weights on worker 0-0, policy_version 341242 (0.00088) [2022-07-09 17:07:32,153][26022] Updated weights on worker 0-0, policy_version 341252 (0.00092) [2022-07-09 17:07:33,216][25689] Fps is (10 sec: 5882.6, 60 sec: 5658.9, 300 sec: 5650.0). Total num frames: 349450240. Throughput: 0: 5078.3. Samples: 349442126. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:07:33,218][25689] Avg episode reward: [(0, '-47.742')] [2022-07-09 17:07:33,723][26022] Updated weights on worker 0-0, policy_version 341262 (0.00088) [2022-07-09 17:07:35,639][26022] Updated weights on worker 0-0, policy_version 341272 (0.00095) [2022-07-09 17:07:37,376][26022] Updated weights on worker 0-0, policy_version 341282 (0.00090) [2022-07-09 17:07:38,267][25689] Fps is (10 sec: 5785.5, 60 sec: 5659.1, 300 sec: 5645.8). Total num frames: 349477888. Throughput: 0: 5939.2. Samples: 349476192. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:07:38,268][25689] Avg episode reward: [(0, '-47.127')] [2022-07-09 17:07:39,060][26022] Updated weights on worker 0-0, policy_version 341292 (0.00090) [2022-07-09 17:07:41,112][26022] Updated weights on worker 0-0, policy_version 341302 (0.00086) [2022-07-09 17:07:42,613][26022] Updated weights on worker 0-0, policy_version 341312 (0.00091) [2022-07-09 17:07:43,340][25689] Fps is (10 sec: 5562.8, 60 sec: 5652.8, 300 sec: 5645.5). Total num frames: 349506560. Throughput: 0: 5933.0. Samples: 349510430. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:07:43,341][25689] Avg episode reward: [(0, '-46.647')] [2022-07-09 17:07:44,496][26022] Updated weights on worker 0-0, policy_version 341322 (0.00078) [2022-07-09 17:07:46,253][26022] Updated weights on worker 0-0, policy_version 341332 (0.00092) [2022-07-09 17:07:48,088][26022] Updated weights on worker 0-0, policy_version 341342 (0.00079) [2022-07-09 17:07:48,439][25689] Fps is (10 sec: 5738.6, 60 sec: 5680.1, 300 sec: 5648.6). Total num frames: 349536256. Throughput: 0: 5908.6. Samples: 349544620. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:07:48,439][25689] Avg episode reward: [(0, '-47.258')] [2022-07-09 17:07:49,975][26022] Updated weights on worker 0-0, policy_version 341352 (0.00085) [2022-07-09 17:07:51,540][26022] Updated weights on worker 0-0, policy_version 341362 (0.00085) [2022-07-09 17:07:53,398][26022] Updated weights on worker 0-0, policy_version 341372 (0.00086) [2022-07-09 17:07:53,449][25689] Fps is (10 sec: 5774.0, 60 sec: 5662.7, 300 sec: 5649.8). Total num frames: 349564928. Throughput: 0: 5921.2. Samples: 349561930. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:07:53,453][25689] Avg episode reward: [(0, '-47.227')] [2022-07-09 17:07:55,150][26022] Updated weights on worker 0-0, policy_version 341382 (0.00097) [2022-07-09 17:07:57,135][26022] Updated weights on worker 0-0, policy_version 341392 (0.00086) [2022-07-09 17:07:58,529][25689] Fps is (10 sec: 5683.0, 60 sec: 5678.9, 300 sec: 5648.4). Total num frames: 349593600. Throughput: 0: 5927.4. Samples: 349596290. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:07:58,530][25689] Avg episode reward: [(0, '-47.058')] [2022-07-09 17:07:58,855][26022] Updated weights on worker 0-0, policy_version 341402 (0.00091) [2022-07-09 17:08:00,692][26022] Updated weights on worker 0-0, policy_version 341412 (0.00085) [2022-07-09 17:08:02,703][26022] Updated weights on worker 0-0, policy_version 341422 (0.00097) [2022-07-09 17:08:03,560][25689] Fps is (10 sec: 5367.6, 60 sec: 5660.3, 300 sec: 5648.2). Total num frames: 349619200. Throughput: 0: 5833.8. Samples: 349628388. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 17:08:03,561][25689] Avg episode reward: [(0, '-46.887')] [2022-07-09 17:08:04,584][26022] Updated weights on worker 0-0, policy_version 341432 (0.00093) [2022-07-09 17:08:06,416][26022] Updated weights on worker 0-0, policy_version 341442 (0.00091) [2022-07-09 17:08:08,279][26022] Updated weights on worker 0-0, policy_version 341452 (0.00082) [2022-07-09 17:08:08,594][25689] Fps is (10 sec: 5392.2, 60 sec: 5660.0, 300 sec: 5644.2). Total num frames: 349647872. Throughput: 0: 5005.2. Samples: 349645502. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:08:08,594][25689] Avg episode reward: [(0, '-47.888')] [2022-07-09 17:08:09,862][26022] Updated weights on worker 0-0, policy_version 341462 (0.00085) [2022-07-09 17:08:11,927][26022] Updated weights on worker 0-0, policy_version 341472 (0.00089) [2022-07-09 17:08:13,515][26022] Updated weights on worker 0-0, policy_version 341482 (0.00095) [2022-07-09 17:08:13,610][25689] Fps is (10 sec: 5807.8, 60 sec: 5659.8, 300 sec: 5648.4). Total num frames: 349677568. Throughput: 0: 5856.3. Samples: 349679998. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:08:13,611][25689] Avg episode reward: [(0, '-47.559')] [2022-07-09 17:08:15,555][26022] Updated weights on worker 0-0, policy_version 341492 (0.00087) [2022-07-09 17:08:17,106][26022] Updated weights on worker 0-0, policy_version 341502 (0.00094) [2022-07-09 17:08:18,707][25689] Fps is (10 sec: 5670.1, 60 sec: 5627.2, 300 sec: 5643.4). Total num frames: 349705216. Throughput: 0: 5815.7. Samples: 349713642. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:08:18,708][25689] Avg episode reward: [(0, '-46.602')] [2022-07-09 17:08:19,183][26022] Updated weights on worker 0-0, policy_version 341512 (0.00088) [2022-07-09 17:08:21,046][26022] Updated weights on worker 0-0, policy_version 341522 (0.00091) [2022-07-09 17:08:22,398][26022] Updated weights on worker 0-0, policy_version 341532 (0.00086) [2022-07-09 17:08:23,771][25689] Fps is (10 sec: 5543.1, 60 sec: 5656.0, 300 sec: 5645.9). Total num frames: 349733888. Throughput: 0: 5062.7. Samples: 349730708. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:08:23,771][25689] Avg episode reward: [(0, '-46.889')] [2022-07-09 17:08:24,523][26022] Updated weights on worker 0-0, policy_version 341542 (0.00083) [2022-07-09 17:08:26,443][26022] Updated weights on worker 0-0, policy_version 341552 (0.00095) [2022-07-09 17:08:28,264][26022] Updated weights on worker 0-0, policy_version 341562 (0.00092) [2022-07-09 17:08:28,794][25689] Fps is (10 sec: 5685.1, 60 sec: 5659.3, 300 sec: 5645.5). Total num frames: 349762560. Throughput: 0: 5875.0. Samples: 349764178. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:08:28,795][25689] Avg episode reward: [(0, '-46.400')] [2022-07-09 17:08:30,135][26022] Updated weights on worker 0-0, policy_version 341572 (0.00086) [2022-07-09 17:08:31,686][26022] Updated weights on worker 0-0, policy_version 341582 (0.00085) [2022-07-09 17:08:33,563][26022] Updated weights on worker 0-0, policy_version 341592 (0.00095) [2022-07-09 17:08:33,804][25689] Fps is (10 sec: 5715.3, 60 sec: 5628.1, 300 sec: 5647.2). Total num frames: 349791232. Throughput: 0: 5854.9. Samples: 349798230. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:08:33,805][25689] Avg episode reward: [(0, '-47.075')] [2022-07-09 17:08:35,540][26022] Updated weights on worker 0-0, policy_version 341602 (0.00084) [2022-07-09 17:08:37,182][26022] Updated weights on worker 0-0, policy_version 341612 (0.00089) [2022-07-09 17:08:38,855][25689] Fps is (10 sec: 5496.1, 60 sec: 5611.2, 300 sec: 5636.2). Total num frames: 349817856. Throughput: 0: 5033.8. Samples: 349815062. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:08:38,857][25689] Avg episode reward: [(0, '-46.601')] [2022-07-09 17:08:39,232][26022] Updated weights on worker 0-0, policy_version 341622 (0.00088) [2022-07-09 17:08:40,843][26022] Updated weights on worker 0-0, policy_version 341632 (0.00089) [2022-07-09 17:08:42,678][26022] Updated weights on worker 0-0, policy_version 341642 (0.00540) [2022-07-09 17:08:43,872][25689] Fps is (10 sec: 5594.2, 60 sec: 5633.3, 300 sec: 5647.1). Total num frames: 349847552. Throughput: 0: 5901.4. Samples: 349849332. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:08:43,872][25689] Avg episode reward: [(0, '-45.389')] [2022-07-09 17:08:44,641][26022] Updated weights on worker 0-0, policy_version 341652 (0.00085) [2022-07-09 17:08:46,259][26022] Updated weights on worker 0-0, policy_version 341662 (0.00087) [2022-07-09 17:08:48,144][26022] Updated weights on worker 0-0, policy_version 341672 (0.00087) [2022-07-09 17:08:48,906][25689] Fps is (10 sec: 5807.0, 60 sec: 5622.3, 300 sec: 5643.1). Total num frames: 349876224. Throughput: 0: 5927.7. Samples: 349883398. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:08:48,907][25689] Avg episode reward: [(0, '-45.600')] [2022-07-09 17:08:50,072][26022] Updated weights on worker 0-0, policy_version 341682 (0.00086) [2022-07-09 17:08:51,752][26022] Updated weights on worker 0-0, policy_version 341692 (0.00092) [2022-07-09 17:08:53,626][26022] Updated weights on worker 0-0, policy_version 341702 (0.00092) [2022-07-09 17:08:53,931][25689] Fps is (10 sec: 5598.9, 60 sec: 5604.1, 300 sec: 5644.5). Total num frames: 349903872. Throughput: 0: 5057.2. Samples: 349900014. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:08:53,932][25689] Avg episode reward: [(0, '-45.385')] [2022-07-09 17:08:55,219][26022] Updated weights on worker 0-0, policy_version 341712 (0.00094) [2022-07-09 17:08:57,260][26022] Updated weights on worker 0-0, policy_version 341722 (0.00094) [2022-07-09 17:08:58,965][25689] Fps is (10 sec: 5599.0, 60 sec: 5608.3, 300 sec: 5644.1). Total num frames: 349932544. Throughput: 0: 5926.7. Samples: 349934250. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:08:58,966][25689] Avg episode reward: [(0, '-46.353')] [2022-07-09 17:08:58,986][26022] Updated weights on worker 0-0, policy_version 341732 (0.00093) [2022-07-09 17:09:00,867][26022] Updated weights on worker 0-0, policy_version 341742 (0.00085) [2022-07-09 17:09:02,879][26022] Updated weights on worker 0-0, policy_version 341752 (0.00089) [2022-07-09 17:09:04,009][25689] Fps is (10 sec: 5588.4, 60 sec: 5641.0, 300 sec: 5647.2). Total num frames: 349960192. Throughput: 0: 5810.4. Samples: 349966336. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:09:04,011][25689] Avg episode reward: [(0, '-47.159')] [2022-07-09 17:09:04,815][26022] Updated weights on worker 0-0, policy_version 341762 (0.00089) [2022-07-09 17:09:06,480][26022] Updated weights on worker 0-0, policy_version 341772 (0.00090) [2022-07-09 17:09:08,468][26022] Updated weights on worker 0-0, policy_version 341782 (0.00086) [2022-07-09 17:09:09,048][25689] Fps is (10 sec: 5585.6, 60 sec: 5640.5, 300 sec: 5644.4). Total num frames: 349988864. Throughput: 0: 4973.3. Samples: 349983570. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:09:09,049][25689] Avg episode reward: [(0, '-47.383')] [2022-07-09 17:09:09,879][26022] Updated weights on worker 0-0, policy_version 341792 (0.00079) [2022-07-09 17:09:11,971][26022] Updated weights on worker 0-0, policy_version 341802 (0.00081) [2022-07-09 17:09:13,691][26022] Updated weights on worker 0-0, policy_version 341812 (0.00093) [2022-07-09 17:09:14,120][25689] Fps is (10 sec: 5671.3, 60 sec: 5618.4, 300 sec: 5644.1). Total num frames: 350017536. Throughput: 0: 5846.4. Samples: 350018048. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:09:14,120][25689] Avg episode reward: [(0, '-48.087')] [2022-07-09 17:09:15,462][26022] Updated weights on worker 0-0, policy_version 341822 (0.00094) [2022-07-09 17:09:15,891][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:09:15,901][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000341824_350027776.pth [2022-07-09 17:09:15,907][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000339838_347994112.pth [2022-07-09 17:09:17,190][26022] Updated weights on worker 0-0, policy_version 341832 (0.00088) [2022-07-09 17:09:19,140][26022] Updated weights on worker 0-0, policy_version 341842 (0.00097) [2022-07-09 17:09:19,229][25689] Fps is (10 sec: 5632.5, 60 sec: 5634.2, 300 sec: 5642.1). Total num frames: 350046208. Throughput: 0: 5819.7. Samples: 350052180. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:09:19,230][25689] Avg episode reward: [(0, '-48.409')] [2022-07-09 17:09:20,767][26022] Updated weights on worker 0-0, policy_version 341852 (0.00100) [2022-07-09 17:09:22,895][26022] Updated weights on worker 0-0, policy_version 341862 (0.00094) [2022-07-09 17:09:24,175][26022] Updated weights on worker 0-0, policy_version 341872 (0.00092) [2022-07-09 17:09:24,272][25689] Fps is (10 sec: 5850.1, 60 sec: 5669.9, 300 sec: 5655.3). Total num frames: 350076928. Throughput: 0: 5075.1. Samples: 350069176. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:09:24,273][25689] Avg episode reward: [(0, '-48.536')] [2022-07-09 17:09:26,374][26022] Updated weights on worker 0-0, policy_version 341882 (0.00085) [2022-07-09 17:09:28,212][26022] Updated weights on worker 0-0, policy_version 341892 (0.00089) [2022-07-09 17:09:29,352][25689] Fps is (10 sec: 5664.9, 60 sec: 5630.8, 300 sec: 5640.6). Total num frames: 350103552. Throughput: 0: 5902.3. Samples: 350103408. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:09:29,353][25689] Avg episode reward: [(0, '-48.321')] [2022-07-09 17:09:29,990][26022] Updated weights on worker 0-0, policy_version 341902 (0.00099) [2022-07-09 17:09:31,654][26022] Updated weights on worker 0-0, policy_version 341912 (0.00086) [2022-07-09 17:09:33,664][26022] Updated weights on worker 0-0, policy_version 341922 (0.00092) [2022-07-09 17:09:34,432][25689] Fps is (10 sec: 5543.3, 60 sec: 5641.2, 300 sec: 5644.6). Total num frames: 350133248. Throughput: 0: 5875.6. Samples: 350137394. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:09:34,433][25689] Avg episode reward: [(0, '-47.559')] [2022-07-09 17:09:35,340][26022] Updated weights on worker 0-0, policy_version 341932 (0.00087) [2022-07-09 17:09:37,298][26022] Updated weights on worker 0-0, policy_version 341942 (0.00081) [2022-07-09 17:09:38,863][26022] Updated weights on worker 0-0, policy_version 341952 (0.00357) [2022-07-09 17:09:39,502][25689] Fps is (10 sec: 5649.5, 60 sec: 5656.3, 300 sec: 5644.5). Total num frames: 350160896. Throughput: 0: 5043.9. Samples: 350154434. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:09:39,503][25689] Avg episode reward: [(0, '-48.302')] [2022-07-09 17:09:40,831][26022] Updated weights on worker 0-0, policy_version 341962 (0.00087) [2022-07-09 17:09:42,454][26022] Updated weights on worker 0-0, policy_version 341972 (0.00084) [2022-07-09 17:09:44,500][26022] Updated weights on worker 0-0, policy_version 341982 (0.00090) [2022-07-09 17:09:44,553][25689] Fps is (10 sec: 5564.7, 60 sec: 5636.2, 300 sec: 5640.4). Total num frames: 350189568. Throughput: 0: 5865.0. Samples: 350188124. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:09:44,554][25689] Avg episode reward: [(0, '-48.087')] [2022-07-09 17:09:46,168][26022] Updated weights on worker 0-0, policy_version 341992 (0.00084) [2022-07-09 17:09:48,119][26022] Updated weights on worker 0-0, policy_version 342002 (0.00085) [2022-07-09 17:09:49,570][25689] Fps is (10 sec: 5797.3, 60 sec: 5654.8, 300 sec: 5644.6). Total num frames: 350219264. Throughput: 0: 5891.6. Samples: 350222526. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:09:49,571][25689] Avg episode reward: [(0, '-47.479')] [2022-07-09 17:09:49,616][26022] Updated weights on worker 0-0, policy_version 342012 (0.00088) [2022-07-09 17:09:51,859][26022] Updated weights on worker 0-0, policy_version 342022 (0.00093) [2022-07-09 17:09:53,167][26022] Updated weights on worker 0-0, policy_version 342032 (0.00106) [2022-07-09 17:09:54,601][25689] Fps is (10 sec: 5707.2, 60 sec: 5654.2, 300 sec: 5639.8). Total num frames: 350246912. Throughput: 0: 5925.5. Samples: 350256902. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:09:54,601][25689] Avg episode reward: [(0, '-47.329')] [2022-07-09 17:09:55,389][26022] Updated weights on worker 0-0, policy_version 342042 (0.00086) [2022-07-09 17:09:57,007][26022] Updated weights on worker 0-0, policy_version 342052 (0.00091) [2022-07-09 17:09:58,877][26022] Updated weights on worker 0-0, policy_version 342062 (0.00094) [2022-07-09 17:09:59,653][25689] Fps is (10 sec: 5585.9, 60 sec: 5652.6, 300 sec: 5650.6). Total num frames: 350275584. Throughput: 0: 5928.2. Samples: 350273890. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:09:59,653][25689] Avg episode reward: [(0, '-47.669')] [2022-07-09 17:10:00,573][26022] Updated weights on worker 0-0, policy_version 342072 (0.00083) [2022-07-09 17:10:02,938][26022] Updated weights on worker 0-0, policy_version 342082 (0.00093) [2022-07-09 17:10:04,491][26022] Updated weights on worker 0-0, policy_version 342092 (0.00090) [2022-07-09 17:10:04,682][25689] Fps is (10 sec: 5688.1, 60 sec: 5670.7, 300 sec: 5651.0). Total num frames: 350304256. Throughput: 0: 5859.3. Samples: 350306066. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:10:04,683][25689] Avg episode reward: [(0, '-47.288')] [2022-07-09 17:10:06,496][26022] Updated weights on worker 0-0, policy_version 342102 (0.00091) [2022-07-09 17:10:08,039][26022] Updated weights on worker 0-0, policy_version 342112 (0.00091) [2022-07-09 17:10:09,761][25689] Fps is (10 sec: 5470.3, 60 sec: 5633.3, 300 sec: 5639.3). Total num frames: 350330880. Throughput: 0: 5831.1. Samples: 350340262. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:10:09,763][25689] Avg episode reward: [(0, '-47.640')] [2022-07-09 17:10:10,007][26022] Updated weights on worker 0-0, policy_version 342122 (0.00106) [2022-07-09 17:10:11,743][26022] Updated weights on worker 0-0, policy_version 342132 (0.00087) [2022-07-09 17:10:13,626][26022] Updated weights on worker 0-0, policy_version 342142 (0.00088) [2022-07-09 17:10:14,774][25689] Fps is (10 sec: 5581.0, 60 sec: 5655.7, 300 sec: 5646.8). Total num frames: 350360576. Throughput: 0: 4972.9. Samples: 350357220. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:10:14,775][25689] Avg episode reward: [(0, '-47.720')] [2022-07-09 17:10:15,353][26022] Updated weights on worker 0-0, policy_version 342152 (0.00089) [2022-07-09 17:10:17,352][26022] Updated weights on worker 0-0, policy_version 342162 (0.00093) [2022-07-09 17:10:18,878][26022] Updated weights on worker 0-0, policy_version 342172 (0.00090) [2022-07-09 17:10:19,894][25689] Fps is (10 sec: 5760.3, 60 sec: 5654.7, 300 sec: 5645.0). Total num frames: 350389248. Throughput: 0: 5799.9. Samples: 350391286. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:10:19,894][25689] Avg episode reward: [(0, '-48.398')] [2022-07-09 17:10:21,019][26022] Updated weights on worker 0-0, policy_version 342182 (0.00092) [2022-07-09 17:10:22,564][26022] Updated weights on worker 0-0, policy_version 342192 (0.00087) [2022-07-09 17:10:24,547][26022] Updated weights on worker 0-0, policy_version 342202 (0.00093) [2022-07-09 17:10:24,932][25689] Fps is (10 sec: 5443.3, 60 sec: 5587.6, 300 sec: 5637.8). Total num frames: 350415872. Throughput: 0: 5875.0. Samples: 350425036. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:10:24,933][25689] Avg episode reward: [(0, '-48.007')] [2022-07-09 17:10:26,249][26022] Updated weights on worker 0-0, policy_version 342212 (0.00091) [2022-07-09 17:10:28,170][26022] Updated weights on worker 0-0, policy_version 342222 (0.00084) [2022-07-09 17:10:29,846][26022] Updated weights on worker 0-0, policy_version 342232 (0.00085) [2022-07-09 17:10:29,941][25689] Fps is (10 sec: 5605.8, 60 sec: 5644.9, 300 sec: 5641.4). Total num frames: 350445568. Throughput: 0: 5040.0. Samples: 350441966. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:10:29,941][25689] Avg episode reward: [(0, '-48.025')] [2022-07-09 17:10:31,646][26022] Updated weights on worker 0-0, policy_version 342242 (0.00090) [2022-07-09 17:10:33,642][26022] Updated weights on worker 0-0, policy_version 342252 (0.00088) [2022-07-09 17:10:34,963][25689] Fps is (10 sec: 5819.1, 60 sec: 5633.4, 300 sec: 5642.0). Total num frames: 350474240. Throughput: 0: 5888.0. Samples: 350476092. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 17:10:34,964][25689] Avg episode reward: [(0, '-47.568')] [2022-07-09 17:10:35,427][26022] Updated weights on worker 0-0, policy_version 342262 (0.00088) [2022-07-09 17:10:37,217][26022] Updated weights on worker 0-0, policy_version 342272 (0.00084) [2022-07-09 17:10:38,946][26022] Updated weights on worker 0-0, policy_version 342282 (0.00085) [2022-07-09 17:10:40,017][25689] Fps is (10 sec: 5691.0, 60 sec: 5651.8, 300 sec: 5641.5). Total num frames: 350502912. Throughput: 0: 5896.5. Samples: 350509940. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:10:40,017][25689] Avg episode reward: [(0, '-47.378')] [2022-07-09 17:10:40,923][26022] Updated weights on worker 0-0, policy_version 342292 (0.00085) [2022-07-09 17:10:42,660][26022] Updated weights on worker 0-0, policy_version 342302 (0.00087) [2022-07-09 17:10:44,376][26022] Updated weights on worker 0-0, policy_version 342312 (0.00091) [2022-07-09 17:10:45,031][25689] Fps is (10 sec: 5492.3, 60 sec: 5621.4, 300 sec: 5638.0). Total num frames: 350529536. Throughput: 0: 5069.7. Samples: 350526928. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:10:45,031][25689] Avg episode reward: [(0, '-46.848')] [2022-07-09 17:10:46,140][26022] Updated weights on worker 0-0, policy_version 342322 (0.00090) [2022-07-09 17:10:48,136][26022] Updated weights on worker 0-0, policy_version 342332 (0.00085) [2022-07-09 17:10:49,927][26022] Updated weights on worker 0-0, policy_version 342342 (0.00093) [2022-07-09 17:10:50,051][25689] Fps is (10 sec: 5510.6, 60 sec: 5604.1, 300 sec: 5638.1). Total num frames: 350558208. Throughput: 0: 5929.8. Samples: 350561218. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:10:50,052][25689] Avg episode reward: [(0, '-47.541')] [2022-07-09 17:10:51,704][26022] Updated weights on worker 0-0, policy_version 342352 (0.00084) [2022-07-09 17:10:53,551][26022] Updated weights on worker 0-0, policy_version 342362 (0.00093) [2022-07-09 17:10:55,064][25689] Fps is (10 sec: 5817.5, 60 sec: 5639.7, 300 sec: 5638.9). Total num frames: 350587904. Throughput: 0: 5915.5. Samples: 350595000. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:10:55,064][25689] Avg episode reward: [(0, '-46.796')] [2022-07-09 17:10:55,232][26022] Updated weights on worker 0-0, policy_version 342372 (0.00089) [2022-07-09 17:10:57,228][26022] Updated weights on worker 0-0, policy_version 342382 (0.00089) [2022-07-09 17:10:58,849][26022] Updated weights on worker 0-0, policy_version 342392 (0.00089) [2022-07-09 17:11:00,188][25689] Fps is (10 sec: 5758.3, 60 sec: 5633.0, 300 sec: 5647.6). Total num frames: 350616576. Throughput: 0: 5060.1. Samples: 350612006. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:11:00,188][25689] Avg episode reward: [(0, '-47.567')] [2022-07-09 17:11:00,834][26022] Updated weights on worker 0-0, policy_version 342402 (0.00084) [2022-07-09 17:11:02,656][26022] Updated weights on worker 0-0, policy_version 342412 (0.00096) [2022-07-09 17:11:04,695][26022] Updated weights on worker 0-0, policy_version 342422 (0.00088) [2022-07-09 17:11:05,195][25689] Fps is (10 sec: 5357.0, 60 sec: 5584.3, 300 sec: 5641.1). Total num frames: 350642176. Throughput: 0: 5806.9. Samples: 350644020. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:11:05,195][25689] Avg episode reward: [(0, '-46.614')] [2022-07-09 17:11:06,442][26022] Updated weights on worker 0-0, policy_version 342432 (0.00088) [2022-07-09 17:11:08,326][26022] Updated weights on worker 0-0, policy_version 342442 (0.00090) [2022-07-09 17:11:10,152][26022] Updated weights on worker 0-0, policy_version 342452 (0.00092) [2022-07-09 17:11:10,207][25689] Fps is (10 sec: 5519.2, 60 sec: 5641.3, 300 sec: 5641.2). Total num frames: 350671872. Throughput: 0: 5801.5. Samples: 350678148. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:11:10,207][25689] Avg episode reward: [(0, '-45.635')] [2022-07-09 17:11:11,897][26022] Updated weights on worker 0-0, policy_version 342462 (0.00091) [2022-07-09 17:11:13,586][26022] Updated weights on worker 0-0, policy_version 342472 (0.00092) [2022-07-09 17:11:15,229][25689] Fps is (10 sec: 5612.9, 60 sec: 5589.6, 300 sec: 5634.9). Total num frames: 350698496. Throughput: 0: 4953.4. Samples: 350694884. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:11:15,229][25689] Avg episode reward: [(0, '-45.802')] [2022-07-09 17:11:15,659][26022] Updated weights on worker 0-0, policy_version 342482 (0.00088) [2022-07-09 17:11:15,979][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:11:15,997][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000342484_350703616.pth [2022-07-09 17:11:15,998][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000340501_348673024.pth [2022-07-09 17:11:17,268][26022] Updated weights on worker 0-0, policy_version 342492 (0.00088) [2022-07-09 17:11:19,216][26022] Updated weights on worker 0-0, policy_version 342502 (0.00088) [2022-07-09 17:11:20,316][25689] Fps is (10 sec: 5570.9, 60 sec: 5609.6, 300 sec: 5636.9). Total num frames: 350728192. Throughput: 0: 5811.3. Samples: 350728978. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:11:20,317][25689] Avg episode reward: [(0, '-46.265')] [2022-07-09 17:11:21,033][26022] Updated weights on worker 0-0, policy_version 342512 (0.00092) [2022-07-09 17:11:22,634][26022] Updated weights on worker 0-0, policy_version 342522 (0.00082) [2022-07-09 17:11:24,561][26022] Updated weights on worker 0-0, policy_version 342532 (0.00086) [2022-07-09 17:11:25,322][25689] Fps is (10 sec: 5782.7, 60 sec: 5646.5, 300 sec: 5637.0). Total num frames: 350756864. Throughput: 0: 5922.6. Samples: 350763228. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:11:25,323][25689] Avg episode reward: [(0, '-47.486')] [2022-07-09 17:11:26,412][26022] Updated weights on worker 0-0, policy_version 342542 (0.00087) [2022-07-09 17:11:28,104][26022] Updated weights on worker 0-0, policy_version 342552 (0.00088) [2022-07-09 17:11:30,103][26022] Updated weights on worker 0-0, policy_version 342562 (0.00095) [2022-07-09 17:11:30,358][25689] Fps is (10 sec: 5608.3, 60 sec: 5610.0, 300 sec: 5633.0). Total num frames: 350784512. Throughput: 0: 5069.5. Samples: 350780308. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:11:30,359][25689] Avg episode reward: [(0, '-47.749')] [2022-07-09 17:11:31,901][26022] Updated weights on worker 0-0, policy_version 342572 (0.00093) [2022-07-09 17:11:33,594][26022] Updated weights on worker 0-0, policy_version 342582 (0.00086) [2022-07-09 17:11:35,388][25689] Fps is (10 sec: 5595.5, 60 sec: 5609.3, 300 sec: 5636.9). Total num frames: 350813184. Throughput: 0: 5918.4. Samples: 350814192. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:11:35,388][25689] Avg episode reward: [(0, '-48.263')] [2022-07-09 17:11:35,624][26022] Updated weights on worker 0-0, policy_version 342592 (0.00089) [2022-07-09 17:11:37,071][26022] Updated weights on worker 0-0, policy_version 342602 (0.00089) [2022-07-09 17:11:39,211][26022] Updated weights on worker 0-0, policy_version 342612 (0.00105) [2022-07-09 17:11:40,522][25689] Fps is (10 sec: 5642.3, 60 sec: 5601.9, 300 sec: 5634.5). Total num frames: 350841856. Throughput: 0: 5911.8. Samples: 350848428. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:11:40,522][25689] Avg episode reward: [(0, '-48.778')] [2022-07-09 17:11:40,939][26022] Updated weights on worker 0-0, policy_version 342622 (0.00083) [2022-07-09 17:11:42,364][26022] Updated weights on worker 0-0, policy_version 342632 (0.00086) [2022-07-09 17:11:44,311][26022] Updated weights on worker 0-0, policy_version 342642 (0.00092) [2022-07-09 17:11:45,555][25689] Fps is (10 sec: 5841.7, 60 sec: 5667.8, 300 sec: 5644.8). Total num frames: 350872576. Throughput: 0: 5063.9. Samples: 350865682. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:11:45,555][25689] Avg episode reward: [(0, '-48.235')] [2022-07-09 17:11:46,117][26022] Updated weights on worker 0-0, policy_version 342652 (0.00089) [2022-07-09 17:11:47,784][26022] Updated weights on worker 0-0, policy_version 342662 (0.00087) [2022-07-09 17:11:50,019][26022] Updated weights on worker 0-0, policy_version 342672 (0.00096) [2022-07-09 17:11:50,623][25689] Fps is (10 sec: 5778.5, 60 sec: 5646.5, 300 sec: 5636.7). Total num frames: 350900224. Throughput: 0: 5905.9. Samples: 350899986. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:11:50,623][25689] Avg episode reward: [(0, '-47.819')] [2022-07-09 17:11:51,355][26022] Updated weights on worker 0-0, policy_version 342682 (0.00089) [2022-07-09 17:11:53,512][26022] Updated weights on worker 0-0, policy_version 342692 (0.00085) [2022-07-09 17:11:55,271][26022] Updated weights on worker 0-0, policy_version 342702 (0.00095) [2022-07-09 17:11:55,635][25689] Fps is (10 sec: 5586.9, 60 sec: 5629.6, 300 sec: 5641.3). Total num frames: 350928896. Throughput: 0: 5923.9. Samples: 350934138. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:11:55,636][25689] Avg episode reward: [(0, '-47.150')] [2022-07-09 17:11:56,836][26022] Updated weights on worker 0-0, policy_version 342712 (0.00087) [2022-07-09 17:11:58,828][26022] Updated weights on worker 0-0, policy_version 342722 (0.00093) [2022-07-09 17:12:00,508][26022] Updated weights on worker 0-0, policy_version 342732 (0.00092) [2022-07-09 17:12:00,732][25689] Fps is (10 sec: 5672.4, 60 sec: 5632.1, 300 sec: 5646.5). Total num frames: 350957568. Throughput: 0: 5085.5. Samples: 350951208. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:12:00,734][25689] Avg episode reward: [(0, '-47.363')] [2022-07-09 17:12:02,916][26022] Updated weights on worker 0-0, policy_version 342742 (0.00085) [2022-07-09 17:12:04,539][26022] Updated weights on worker 0-0, policy_version 342752 (0.00088) [2022-07-09 17:12:05,741][25689] Fps is (10 sec: 5370.4, 60 sec: 5631.9, 300 sec: 5636.6). Total num frames: 350983168. Throughput: 0: 5811.1. Samples: 350982988. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:12:05,742][25689] Avg episode reward: [(0, '-47.453')] [2022-07-09 17:12:06,334][26022] Updated weights on worker 0-0, policy_version 342762 (0.00085) [2022-07-09 17:12:08,231][26022] Updated weights on worker 0-0, policy_version 342772 (0.00096) [2022-07-09 17:12:09,977][26022] Updated weights on worker 0-0, policy_version 342782 (0.00084) [2022-07-09 17:12:10,759][25689] Fps is (10 sec: 5616.6, 60 sec: 5648.2, 300 sec: 5640.0). Total num frames: 351013888. Throughput: 0: 5821.3. Samples: 351017208. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:12:10,761][25689] Avg episode reward: [(0, '-47.253')] [2022-07-09 17:12:11,871][26022] Updated weights on worker 0-0, policy_version 342792 (0.00563) [2022-07-09 17:12:13,556][26022] Updated weights on worker 0-0, policy_version 342802 (0.00100) [2022-07-09 17:12:15,309][26022] Updated weights on worker 0-0, policy_version 342812 (0.00104) [2022-07-09 17:12:15,802][25689] Fps is (10 sec: 5801.7, 60 sec: 5663.3, 300 sec: 5634.4). Total num frames: 351041536. Throughput: 0: 4979.4. Samples: 351034554. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:12:15,802][25689] Avg episode reward: [(0, '-47.038')] [2022-07-09 17:12:17,137][26022] Updated weights on worker 0-0, policy_version 342822 (0.00087) [2022-07-09 17:12:18,979][26022] Updated weights on worker 0-0, policy_version 342832 (0.00083) [2022-07-09 17:12:20,808][26022] Updated weights on worker 0-0, policy_version 342842 (0.00094) [2022-07-09 17:12:20,901][25689] Fps is (10 sec: 5654.5, 60 sec: 5662.2, 300 sec: 5643.0). Total num frames: 351071232. Throughput: 0: 5840.4. Samples: 351069000. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:12:20,901][25689] Avg episode reward: [(0, '-46.137')] [2022-07-09 17:12:22,580][26022] Updated weights on worker 0-0, policy_version 342852 (0.00087) [2022-07-09 17:12:24,267][26022] Updated weights on worker 0-0, policy_version 342862 (0.00086) [2022-07-09 17:12:25,961][25689] Fps is (10 sec: 5745.3, 60 sec: 5657.1, 300 sec: 5643.0). Total num frames: 351099904. Throughput: 0: 5930.1. Samples: 351102894. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:12:25,962][25689] Avg episode reward: [(0, '-46.393')] [2022-07-09 17:12:26,179][26022] Updated weights on worker 0-0, policy_version 342872 (0.00096) [2022-07-09 17:12:28,040][26022] Updated weights on worker 0-0, policy_version 342882 (0.00090) [2022-07-09 17:12:29,949][26022] Updated weights on worker 0-0, policy_version 342892 (0.00089) [2022-07-09 17:12:30,978][25689] Fps is (10 sec: 5690.5, 60 sec: 5675.8, 300 sec: 5636.5). Total num frames: 351128576. Throughput: 0: 5918.9. Samples: 351136878. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:12:30,979][25689] Avg episode reward: [(0, '-47.199')] [2022-07-09 17:12:31,618][26022] Updated weights on worker 0-0, policy_version 342902 (0.00086) [2022-07-09 17:12:33,264][26022] Updated weights on worker 0-0, policy_version 342912 (0.00091) [2022-07-09 17:12:35,343][26022] Updated weights on worker 0-0, policy_version 342922 (0.00086) [2022-07-09 17:12:36,030][25689] Fps is (10 sec: 5593.2, 60 sec: 5656.7, 300 sec: 5636.5). Total num frames: 351156224. Throughput: 0: 5905.1. Samples: 351154006. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:12:36,032][25689] Avg episode reward: [(0, '-47.259')] [2022-07-09 17:12:36,891][26022] Updated weights on worker 0-0, policy_version 342932 (0.00097) [2022-07-09 17:12:38,918][26022] Updated weights on worker 0-0, policy_version 342942 (0.00093) [2022-07-09 17:12:40,676][26022] Updated weights on worker 0-0, policy_version 342952 (0.00087) [2022-07-09 17:12:41,096][25689] Fps is (10 sec: 5565.8, 60 sec: 5663.1, 300 sec: 5636.6). Total num frames: 351184896. Throughput: 0: 5901.4. Samples: 351188184. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:12:41,098][25689] Avg episode reward: [(0, '-47.094')] [2022-07-09 17:12:42,384][26022] Updated weights on worker 0-0, policy_version 342962 (0.00086) [2022-07-09 17:12:44,225][26022] Updated weights on worker 0-0, policy_version 342972 (0.00094) [2022-07-09 17:12:45,633][26022] Updated weights on worker 0-0, policy_version 342982 (0.01098) [2022-07-09 17:12:46,108][25689] Fps is (10 sec: 5792.0, 60 sec: 5648.2, 300 sec: 5638.3). Total num frames: 351214592. Throughput: 0: 5951.9. Samples: 351222806. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:12:46,109][25689] Avg episode reward: [(0, '-47.461')] [2022-07-09 17:12:47,789][26022] Updated weights on worker 0-0, policy_version 342992 (0.00086) [2022-07-09 17:12:49,521][26022] Updated weights on worker 0-0, policy_version 343002 (0.00080) [2022-07-09 17:12:51,131][25689] Fps is (10 sec: 5816.9, 60 sec: 5669.3, 300 sec: 5638.0). Total num frames: 351243264. Throughput: 0: 5130.5. Samples: 351240274. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:12:51,132][25689] Avg episode reward: [(0, '-47.257')] [2022-07-09 17:12:51,239][26022] Updated weights on worker 0-0, policy_version 343012 (0.00084) [2022-07-09 17:12:52,959][26022] Updated weights on worker 0-0, policy_version 343022 (0.00083) [2022-07-09 17:12:54,834][26022] Updated weights on worker 0-0, policy_version 343032 (0.00081) [2022-07-09 17:12:56,142][25689] Fps is (10 sec: 5714.9, 60 sec: 5669.5, 300 sec: 5639.3). Total num frames: 351271936. Throughput: 0: 5989.2. Samples: 351274458. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:12:56,143][25689] Avg episode reward: [(0, '-46.822')] [2022-07-09 17:12:56,548][26022] Updated weights on worker 0-0, policy_version 343042 (0.00082) [2022-07-09 17:12:58,573][26022] Updated weights on worker 0-0, policy_version 343052 (0.00080) [2022-07-09 17:13:00,023][26022] Updated weights on worker 0-0, policy_version 343062 (0.00116) [2022-07-09 17:13:01,255][25689] Fps is (10 sec: 5664.2, 60 sec: 5668.0, 300 sec: 5648.1). Total num frames: 351300608. Throughput: 0: 5992.8. Samples: 351308988. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:13:01,255][25689] Avg episode reward: [(0, '-45.972')] [2022-07-09 17:13:02,336][26022] Updated weights on worker 0-0, policy_version 343072 (0.00088) [2022-07-09 17:13:04,371][26022] Updated weights on worker 0-0, policy_version 343082 (0.00094) [2022-07-09 17:13:05,655][26022] Updated weights on worker 0-0, policy_version 343092 (0.00088) [2022-07-09 17:13:06,297][25689] Fps is (10 sec: 5646.7, 60 sec: 5715.6, 300 sec: 5647.9). Total num frames: 351329280. Throughput: 0: 5013.1. Samples: 351324018. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 17:13:06,298][25689] Avg episode reward: [(0, '-45.914')] [2022-07-09 17:13:08,100][26022] Updated weights on worker 0-0, policy_version 343102 (0.00087) [2022-07-09 17:13:09,319][26022] Updated weights on worker 0-0, policy_version 343112 (0.00096) [2022-07-09 17:13:11,302][25689] Fps is (10 sec: 5503.5, 60 sec: 5649.2, 300 sec: 5637.8). Total num frames: 351355904. Throughput: 0: 5852.9. Samples: 351358334. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:13:11,303][25689] Avg episode reward: [(0, '-45.868')] [2022-07-09 17:13:11,352][26022] Updated weights on worker 0-0, policy_version 343122 (0.00089) [2022-07-09 17:13:13,051][26022] Updated weights on worker 0-0, policy_version 343132 (0.00084) [2022-07-09 17:13:14,756][26022] Updated weights on worker 0-0, policy_version 343142 (0.00086) [2022-07-09 17:13:16,119][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:13:16,133][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000343148_351383552.pth [2022-07-09 17:13:16,134][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000341164_349351936.pth [2022-07-09 17:13:16,341][25689] Fps is (10 sec: 5505.3, 60 sec: 5666.4, 300 sec: 5642.4). Total num frames: 351384576. Throughput: 0: 5847.4. Samples: 351392572. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:13:16,345][25689] Avg episode reward: [(0, '-45.829')] [2022-07-09 17:13:16,859][26022] Updated weights on worker 0-0, policy_version 343152 (0.00088) [2022-07-09 17:13:18,365][26022] Updated weights on worker 0-0, policy_version 343162 (0.00090) [2022-07-09 17:13:20,337][26022] Updated weights on worker 0-0, policy_version 343172 (0.01073) [2022-07-09 17:13:21,466][25689] Fps is (10 sec: 5843.5, 60 sec: 5680.9, 300 sec: 5648.1). Total num frames: 351415296. Throughput: 0: 4984.1. Samples: 351409722. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:13:21,466][25689] Avg episode reward: [(0, '-46.124')] [2022-07-09 17:13:22,066][26022] Updated weights on worker 0-0, policy_version 343182 (0.00086) [2022-07-09 17:13:23,662][26022] Updated weights on worker 0-0, policy_version 343192 (0.00086) [2022-07-09 17:13:26,053][26022] Updated weights on worker 0-0, policy_version 343202 (0.00094) [2022-07-09 17:13:26,506][25689] Fps is (10 sec: 5742.4, 60 sec: 5665.9, 300 sec: 5644.3). Total num frames: 351442944. Throughput: 0: 5926.4. Samples: 351443782. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:13:26,506][25689] Avg episode reward: [(0, '-46.169')] [2022-07-09 17:13:27,519][26022] Updated weights on worker 0-0, policy_version 343212 (0.00085) [2022-07-09 17:13:29,396][26022] Updated weights on worker 0-0, policy_version 343222 (0.00091) [2022-07-09 17:13:31,189][26022] Updated weights on worker 0-0, policy_version 343232 (0.00090) [2022-07-09 17:13:31,531][25689] Fps is (10 sec: 5392.2, 60 sec: 5631.3, 300 sec: 5637.2). Total num frames: 351469568. Throughput: 0: 5900.5. Samples: 351477694. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:13:31,531][25689] Avg episode reward: [(0, '-46.875')] [2022-07-09 17:13:32,829][26022] Updated weights on worker 0-0, policy_version 343242 (0.00090) [2022-07-09 17:13:35,021][26022] Updated weights on worker 0-0, policy_version 343252 (0.00092) [2022-07-09 17:13:36,430][26022] Updated weights on worker 0-0, policy_version 343262 (0.00095) [2022-07-09 17:13:36,535][25689] Fps is (10 sec: 5717.6, 60 sec: 5686.6, 300 sec: 5651.8). Total num frames: 351500288. Throughput: 0: 5059.5. Samples: 351494742. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:13:36,536][25689] Avg episode reward: [(0, '-46.702')] [2022-07-09 17:13:38,418][26022] Updated weights on worker 0-0, policy_version 343272 (0.00091) [2022-07-09 17:13:40,518][26022] Updated weights on worker 0-0, policy_version 343282 (0.00091) [2022-07-09 17:13:41,668][25689] Fps is (10 sec: 5959.6, 60 sec: 5697.2, 300 sec: 5649.6). Total num frames: 351529984. Throughput: 0: 5893.2. Samples: 351528780. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:13:41,669][25689] Avg episode reward: [(0, '-46.090')] [2022-07-09 17:13:41,884][26022] Updated weights on worker 0-0, policy_version 343292 (0.00088) [2022-07-09 17:13:44,082][26022] Updated weights on worker 0-0, policy_version 343302 (0.00085) [2022-07-09 17:13:45,399][26022] Updated weights on worker 0-0, policy_version 343312 (0.00090) [2022-07-09 17:13:46,710][25689] Fps is (10 sec: 5434.7, 60 sec: 5626.8, 300 sec: 5639.2). Total num frames: 351555584. Throughput: 0: 5904.3. Samples: 351563072. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:13:46,710][25689] Avg episode reward: [(0, '-45.803')] [2022-07-09 17:13:47,505][26022] Updated weights on worker 0-0, policy_version 343322 (0.00090) [2022-07-09 17:13:49,043][26022] Updated weights on worker 0-0, policy_version 343332 (0.00090) [2022-07-09 17:13:51,049][26022] Updated weights on worker 0-0, policy_version 343342 (0.00093) [2022-07-09 17:13:51,787][25689] Fps is (10 sec: 5566.0, 60 sec: 5655.5, 300 sec: 5648.5). Total num frames: 351586304. Throughput: 0: 5056.2. Samples: 351580114. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:13:51,787][25689] Avg episode reward: [(0, '-46.146')] [2022-07-09 17:13:52,850][26022] Updated weights on worker 0-0, policy_version 343352 (0.00092) [2022-07-09 17:13:54,665][26022] Updated weights on worker 0-0, policy_version 343362 (0.00088) [2022-07-09 17:13:56,423][26022] Updated weights on worker 0-0, policy_version 343372 (0.00086) [2022-07-09 17:13:56,837][25689] Fps is (10 sec: 5864.6, 60 sec: 5651.9, 300 sec: 5648.2). Total num frames: 351614976. Throughput: 0: 5888.1. Samples: 351614280. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:13:56,837][25689] Avg episode reward: [(0, '-46.157')] [2022-07-09 17:13:58,239][26022] Updated weights on worker 0-0, policy_version 343382 (0.00086) [2022-07-09 17:13:59,907][26022] Updated weights on worker 0-0, policy_version 343392 (0.00086) [2022-07-09 17:14:01,883][25689] Fps is (10 sec: 5578.5, 60 sec: 5641.2, 300 sec: 5648.1). Total num frames: 351642624. Throughput: 0: 5942.9. Samples: 351648912. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:14:01,883][25689] Avg episode reward: [(0, '-46.360')] [2022-07-09 17:14:02,138][26022] Updated weights on worker 0-0, policy_version 343402 (0.00096) [2022-07-09 17:14:03,951][26022] Updated weights on worker 0-0, policy_version 343412 (0.00088) [2022-07-09 17:14:05,756][26022] Updated weights on worker 0-0, policy_version 343422 (0.00086) [2022-07-09 17:14:06,983][25689] Fps is (10 sec: 5450.1, 60 sec: 5619.0, 300 sec: 5643.6). Total num frames: 351670272. Throughput: 0: 4970.8. Samples: 351663846. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:14:06,983][25689] Avg episode reward: [(0, '-46.437')] [2022-07-09 17:14:07,583][26022] Updated weights on worker 0-0, policy_version 343432 (0.00087) [2022-07-09 17:14:09,361][26022] Updated weights on worker 0-0, policy_version 343442 (0.00088) [2022-07-09 17:14:10,865][26022] Updated weights on worker 0-0, policy_version 343452 (0.00090) [2022-07-09 17:14:12,055][25689] Fps is (10 sec: 5637.5, 60 sec: 5663.4, 300 sec: 5647.0). Total num frames: 351699968. Throughput: 0: 5827.3. Samples: 351698220. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:14:12,055][25689] Avg episode reward: [(0, '-46.503')] [2022-07-09 17:14:12,990][26022] Updated weights on worker 0-0, policy_version 343462 (0.00099) [2022-07-09 17:14:14,466][26022] Updated weights on worker 0-0, policy_version 343472 (0.00079) [2022-07-09 17:14:16,371][26022] Updated weights on worker 0-0, policy_version 343482 (0.00086) [2022-07-09 17:14:17,088][25689] Fps is (10 sec: 5776.2, 60 sec: 5663.9, 300 sec: 5648.4). Total num frames: 351728640. Throughput: 0: 5854.1. Samples: 351732830. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:14:17,088][25689] Avg episode reward: [(0, '-46.617')] [2022-07-09 17:14:18,361][26022] Updated weights on worker 0-0, policy_version 343492 (0.00085) [2022-07-09 17:14:19,799][26022] Updated weights on worker 0-0, policy_version 343502 (0.00088) [2022-07-09 17:14:21,782][26022] Updated weights on worker 0-0, policy_version 343512 (0.00085) [2022-07-09 17:14:22,203][25689] Fps is (10 sec: 5852.5, 60 sec: 5664.8, 300 sec: 5647.1). Total num frames: 351759360. Throughput: 0: 5839.1. Samples: 351767562. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:14:22,204][25689] Avg episode reward: [(0, '-47.092')] [2022-07-09 17:14:23,588][26022] Updated weights on worker 0-0, policy_version 343522 (0.00087) [2022-07-09 17:14:25,200][26022] Updated weights on worker 0-0, policy_version 343532 (0.00085) [2022-07-09 17:14:27,208][25689] Fps is (10 sec: 5564.9, 60 sec: 5634.3, 300 sec: 5645.0). Total num frames: 351784960. Throughput: 0: 5970.7. Samples: 351784606. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:14:27,209][25689] Avg episode reward: [(0, '-47.130')] [2022-07-09 17:14:27,416][26022] Updated weights on worker 0-0, policy_version 343542 (0.00093) [2022-07-09 17:14:28,682][26022] Updated weights on worker 0-0, policy_version 343552 (0.00085) [2022-07-09 17:14:31,000][26022] Updated weights on worker 0-0, policy_version 343562 (0.00091) [2022-07-09 17:14:32,218][25689] Fps is (10 sec: 5623.4, 60 sec: 5703.2, 300 sec: 5649.8). Total num frames: 351815680. Throughput: 0: 5964.6. Samples: 351818486. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:14:32,219][25689] Avg episode reward: [(0, '-46.986')] [2022-07-09 17:14:32,518][26022] Updated weights on worker 0-0, policy_version 343572 (0.00088) [2022-07-09 17:14:34,386][26022] Updated weights on worker 0-0, policy_version 343582 (0.00086) [2022-07-09 17:14:36,505][26022] Updated weights on worker 0-0, policy_version 343592 (0.00090) [2022-07-09 17:14:37,282][25689] Fps is (10 sec: 5895.8, 60 sec: 5663.9, 300 sec: 5653.4). Total num frames: 351844352. Throughput: 0: 5928.8. Samples: 351852558. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:14:37,282][25689] Avg episode reward: [(0, '-46.575')] [2022-07-09 17:14:38,057][26022] Updated weights on worker 0-0, policy_version 343602 (0.00084) [2022-07-09 17:14:39,834][26022] Updated weights on worker 0-0, policy_version 343612 (0.00084) [2022-07-09 17:14:41,690][26022] Updated weights on worker 0-0, policy_version 343622 (0.00093) [2022-07-09 17:14:42,380][25689] Fps is (10 sec: 5542.0, 60 sec: 5633.4, 300 sec: 5649.0). Total num frames: 351872000. Throughput: 0: 5062.1. Samples: 351869704. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:14:42,381][25689] Avg episode reward: [(0, '-46.891')] [2022-07-09 17:14:43,292][26022] Updated weights on worker 0-0, policy_version 343632 (0.00087) [2022-07-09 17:14:45,287][26022] Updated weights on worker 0-0, policy_version 343642 (0.00097) [2022-07-09 17:14:46,986][26022] Updated weights on worker 0-0, policy_version 343652 (0.00086) [2022-07-09 17:14:47,387][25689] Fps is (10 sec: 5573.5, 60 sec: 5687.3, 300 sec: 5645.8). Total num frames: 351900672. Throughput: 0: 5910.5. Samples: 351903872. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:14:47,387][25689] Avg episode reward: [(0, '-46.130')] [2022-07-09 17:14:48,707][26022] Updated weights on worker 0-0, policy_version 343662 (0.00086) [2022-07-09 17:14:50,644][26022] Updated weights on worker 0-0, policy_version 343672 (0.00086) [2022-07-09 17:14:52,123][26022] Updated weights on worker 0-0, policy_version 343682 (0.00081) [2022-07-09 17:14:52,415][25689] Fps is (10 sec: 5816.6, 60 sec: 5675.0, 300 sec: 5652.7). Total num frames: 351930368. Throughput: 0: 5964.5. Samples: 351938952. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:14:52,416][25689] Avg episode reward: [(0, '-47.077')] [2022-07-09 17:14:54,231][26022] Updated weights on worker 0-0, policy_version 343692 (0.00081) [2022-07-09 17:14:55,819][26022] Updated weights on worker 0-0, policy_version 343702 (0.00086) [2022-07-09 17:14:57,473][25689] Fps is (10 sec: 5888.6, 60 sec: 5691.2, 300 sec: 5656.0). Total num frames: 351960064. Throughput: 0: 5134.4. Samples: 351956226. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:14:57,473][25689] Avg episode reward: [(0, '-47.210')] [2022-07-09 17:14:57,737][26022] Updated weights on worker 0-0, policy_version 343712 (0.00091) [2022-07-09 17:14:59,503][26022] Updated weights on worker 0-0, policy_version 343722 (0.00082) [2022-07-09 17:15:01,425][26022] Updated weights on worker 0-0, policy_version 343732 (0.00090) [2022-07-09 17:15:02,610][25689] Fps is (10 sec: 5424.2, 60 sec: 5648.9, 300 sec: 5643.7). Total num frames: 351985664. Throughput: 0: 5953.4. Samples: 351990136. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:15:02,610][25689] Avg episode reward: [(0, '-46.544')] [2022-07-09 17:15:03,230][26022] Updated weights on worker 0-0, policy_version 343742 (0.00091) [2022-07-09 17:15:05,361][26022] Updated weights on worker 0-0, policy_version 343752 (0.00092) [2022-07-09 17:15:06,995][26022] Updated weights on worker 0-0, policy_version 343762 (0.00089) [2022-07-09 17:15:07,645][25689] Fps is (10 sec: 5335.4, 60 sec: 5671.8, 300 sec: 5651.4). Total num frames: 352014336. Throughput: 0: 5842.3. Samples: 352022226. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:15:07,645][25689] Avg episode reward: [(0, '-45.548')] [2022-07-09 17:15:08,847][26022] Updated weights on worker 0-0, policy_version 343772 (0.00099) [2022-07-09 17:15:10,849][26022] Updated weights on worker 0-0, policy_version 343782 (0.00088) [2022-07-09 17:15:12,259][26022] Updated weights on worker 0-0, policy_version 343792 (0.00092) [2022-07-09 17:15:12,672][25689] Fps is (10 sec: 5800.5, 60 sec: 5676.0, 300 sec: 5651.1). Total num frames: 352044032. Throughput: 0: 4948.4. Samples: 352039192. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:15:12,673][25689] Avg episode reward: [(0, '-45.303')] [2022-07-09 17:15:14,446][26022] Updated weights on worker 0-0, policy_version 343802 (0.00084) [2022-07-09 17:15:16,015][26022] Updated weights on worker 0-0, policy_version 343812 (0.00087) [2022-07-09 17:15:16,258][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:15:16,270][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000343813_352064512.pth [2022-07-09 17:15:16,271][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000341824_350027776.pth [2022-07-09 17:15:17,696][25689] Fps is (10 sec: 5704.9, 60 sec: 5659.9, 300 sec: 5649.5). Total num frames: 352071680. Throughput: 0: 5802.7. Samples: 352073578. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:15:17,697][25689] Avg episode reward: [(0, '-44.987')] [2022-07-09 17:15:17,839][26022] Updated weights on worker 0-0, policy_version 343822 (0.00091) [2022-07-09 17:15:19,565][26022] Updated weights on worker 0-0, policy_version 343832 (0.00088) [2022-07-09 17:15:21,329][26022] Updated weights on worker 0-0, policy_version 343842 (0.00092) [2022-07-09 17:15:22,762][25689] Fps is (10 sec: 5683.4, 60 sec: 5647.6, 300 sec: 5659.3). Total num frames: 352101376. Throughput: 0: 5840.4. Samples: 352107834. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:15:22,764][25689] Avg episode reward: [(0, '-44.116')] [2022-07-09 17:15:23,378][26022] Updated weights on worker 0-0, policy_version 343852 (0.00089) [2022-07-09 17:15:24,931][26022] Updated weights on worker 0-0, policy_version 343862 (0.00091) [2022-07-09 17:15:26,655][26022] Updated weights on worker 0-0, policy_version 343872 (0.00078) [2022-07-09 17:15:27,770][25689] Fps is (10 sec: 5692.5, 60 sec: 5681.2, 300 sec: 5652.4). Total num frames: 352129024. Throughput: 0: 5112.0. Samples: 352125108. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:15:27,772][25689] Avg episode reward: [(0, '-44.514')] [2022-07-09 17:15:28,572][26022] Updated weights on worker 0-0, policy_version 343882 (0.00082) [2022-07-09 17:15:30,305][26022] Updated weights on worker 0-0, policy_version 343892 (0.00083) [2022-07-09 17:15:32,306][26022] Updated weights on worker 0-0, policy_version 343902 (0.00099) [2022-07-09 17:15:32,792][25689] Fps is (10 sec: 5717.1, 60 sec: 5663.1, 300 sec: 5655.9). Total num frames: 352158720. Throughput: 0: 5976.7. Samples: 352159444. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:15:32,794][25689] Avg episode reward: [(0, '-45.242')] [2022-07-09 17:15:33,995][26022] Updated weights on worker 0-0, policy_version 343912 (0.00086) [2022-07-09 17:15:35,734][26022] Updated weights on worker 0-0, policy_version 343922 (0.00085) [2022-07-09 17:15:37,710][26022] Updated weights on worker 0-0, policy_version 343932 (0.00090) [2022-07-09 17:15:37,798][25689] Fps is (10 sec: 5718.7, 60 sec: 5651.7, 300 sec: 5653.3). Total num frames: 352186368. Throughput: 0: 5974.5. Samples: 352193672. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:15:37,798][25689] Avg episode reward: [(0, '-46.168')] [2022-07-09 17:15:39,331][26022] Updated weights on worker 0-0, policy_version 343942 (0.00081) [2022-07-09 17:15:41,219][26022] Updated weights on worker 0-0, policy_version 343952 (0.00470) [2022-07-09 17:15:42,846][25689] Fps is (10 sec: 5805.7, 60 sec: 5707.2, 300 sec: 5666.4). Total num frames: 352217088. Throughput: 0: 5126.9. Samples: 352210802. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-09 17:15:42,846][25689] Avg episode reward: [(0, '-46.073')] [2022-07-09 17:15:42,848][26022] Updated weights on worker 0-0, policy_version 343962 (0.00086) [2022-07-09 17:15:44,697][26022] Updated weights on worker 0-0, policy_version 343972 (0.00091) [2022-07-09 17:15:46,523][26022] Updated weights on worker 0-0, policy_version 343982 (0.00088) [2022-07-09 17:15:47,908][25689] Fps is (10 sec: 5874.2, 60 sec: 5701.9, 300 sec: 5665.6). Total num frames: 352245760. Throughput: 0: 5971.8. Samples: 352245368. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:15:47,913][25689] Avg episode reward: [(0, '-46.141')] [2022-07-09 17:15:48,201][26022] Updated weights on worker 0-0, policy_version 343992 (0.00089) [2022-07-09 17:15:50,192][26022] Updated weights on worker 0-0, policy_version 344002 (0.00082) [2022-07-09 17:15:51,919][26022] Updated weights on worker 0-0, policy_version 344012 (0.00087) [2022-07-09 17:15:52,921][25689] Fps is (10 sec: 5589.9, 60 sec: 5669.5, 300 sec: 5658.8). Total num frames: 352273408. Throughput: 0: 5986.0. Samples: 352279936. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:15:52,922][25689] Avg episode reward: [(0, '-46.303')] [2022-07-09 17:15:53,534][26022] Updated weights on worker 0-0, policy_version 344022 (0.00088) [2022-07-09 17:15:55,595][26022] Updated weights on worker 0-0, policy_version 344032 (0.00083) [2022-07-09 17:15:57,115][26022] Updated weights on worker 0-0, policy_version 344042 (0.00097) [2022-07-09 17:15:57,961][25689] Fps is (10 sec: 5704.4, 60 sec: 5671.2, 300 sec: 5663.8). Total num frames: 352303104. Throughput: 0: 5132.3. Samples: 352297156. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:15:57,961][25689] Avg episode reward: [(0, '-46.697')] [2022-07-09 17:15:59,184][26022] Updated weights on worker 0-0, policy_version 344052 (0.00086) [2022-07-09 17:16:00,690][26022] Updated weights on worker 0-0, policy_version 344062 (0.00087) [2022-07-09 17:16:03,006][25689] Fps is (10 sec: 5483.1, 60 sec: 5679.8, 300 sec: 5663.1). Total num frames: 352328704. Throughput: 0: 5971.3. Samples: 352331184. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:16:03,007][25689] Avg episode reward: [(0, '-47.518')] [2022-07-09 17:16:03,094][26022] Updated weights on worker 0-0, policy_version 344072 (0.00085) [2022-07-09 17:16:04,573][26022] Updated weights on worker 0-0, policy_version 344082 (0.00090) [2022-07-09 17:16:06,550][26022] Updated weights on worker 0-0, policy_version 344092 (0.00088) [2022-07-09 17:16:08,022][25689] Fps is (10 sec: 5394.3, 60 sec: 5681.6, 300 sec: 5659.6). Total num frames: 352357376. Throughput: 0: 5878.2. Samples: 352363600. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:16:08,022][25689] Avg episode reward: [(0, '-47.925')] [2022-07-09 17:16:08,380][26022] Updated weights on worker 0-0, policy_version 344102 (0.00091) [2022-07-09 17:16:10,180][26022] Updated weights on worker 0-0, policy_version 344112 (0.00095) [2022-07-09 17:16:11,938][26022] Updated weights on worker 0-0, policy_version 344122 (0.00088) [2022-07-09 17:16:13,029][25689] Fps is (10 sec: 5721.2, 60 sec: 5666.6, 300 sec: 5666.7). Total num frames: 352386048. Throughput: 0: 5015.2. Samples: 352380782. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:16:13,030][25689] Avg episode reward: [(0, '-48.089')] [2022-07-09 17:16:13,857][26022] Updated weights on worker 0-0, policy_version 344132 (0.00088) [2022-07-09 17:16:15,548][26022] Updated weights on worker 0-0, policy_version 344142 (0.00089) [2022-07-09 17:16:17,452][26022] Updated weights on worker 0-0, policy_version 344152 (0.00085) [2022-07-09 17:16:18,041][25689] Fps is (10 sec: 5723.2, 60 sec: 5684.7, 300 sec: 5664.7). Total num frames: 352414720. Throughput: 0: 5854.1. Samples: 352414710. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:16:18,042][25689] Avg episode reward: [(0, '-47.719')] [2022-07-09 17:16:19,168][26022] Updated weights on worker 0-0, policy_version 344162 (0.00086) [2022-07-09 17:16:21,182][26022] Updated weights on worker 0-0, policy_version 344172 (0.00083) [2022-07-09 17:16:23,033][26022] Updated weights on worker 0-0, policy_version 344182 (0.00095) [2022-07-09 17:16:23,169][25689] Fps is (10 sec: 5554.3, 60 sec: 5644.9, 300 sec: 5659.0). Total num frames: 352442368. Throughput: 0: 5818.7. Samples: 352448506. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:16:23,169][25689] Avg episode reward: [(0, '-47.558')] [2022-07-09 17:16:24,636][26022] Updated weights on worker 0-0, policy_version 344192 (0.00091) [2022-07-09 17:16:26,446][26022] Updated weights on worker 0-0, policy_version 344202 (0.00118) [2022-07-09 17:16:28,179][25689] Fps is (10 sec: 5555.6, 60 sec: 5661.7, 300 sec: 5662.9). Total num frames: 352471040. Throughput: 0: 5051.9. Samples: 352465432. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:16:28,179][25689] Avg episode reward: [(0, '-47.002')] [2022-07-09 17:16:28,368][26022] Updated weights on worker 0-0, policy_version 344212 (0.00092) [2022-07-09 17:16:30,413][26022] Updated weights on worker 0-0, policy_version 344222 (0.00082) [2022-07-09 17:16:31,993][26022] Updated weights on worker 0-0, policy_version 344232 (0.00081) [2022-07-09 17:16:33,219][25689] Fps is (10 sec: 5808.0, 60 sec: 5660.0, 300 sec: 5666.1). Total num frames: 352500736. Throughput: 0: 5872.8. Samples: 352499352. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:16:33,220][25689] Avg episode reward: [(0, '-47.329')] [2022-07-09 17:16:33,794][26022] Updated weights on worker 0-0, policy_version 344242 (0.00094) [2022-07-09 17:16:35,320][26022] Updated weights on worker 0-0, policy_version 344252 (0.00087) [2022-07-09 17:16:37,487][26022] Updated weights on worker 0-0, policy_version 344262 (0.00086) [2022-07-09 17:16:38,227][25689] Fps is (10 sec: 5706.9, 60 sec: 5659.8, 300 sec: 5665.1). Total num frames: 352528384. Throughput: 0: 5910.9. Samples: 352534026. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:16:38,228][25689] Avg episode reward: [(0, '-46.626')] [2022-07-09 17:16:38,925][26022] Updated weights on worker 0-0, policy_version 344272 (0.00092) [2022-07-09 17:16:41,156][26022] Updated weights on worker 0-0, policy_version 344282 (0.00088) [2022-07-09 17:16:42,678][26022] Updated weights on worker 0-0, policy_version 344292 (0.00078) [2022-07-09 17:16:43,270][25689] Fps is (10 sec: 5807.1, 60 sec: 5660.3, 300 sec: 5664.9). Total num frames: 352559104. Throughput: 0: 5105.2. Samples: 352551124. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:16:43,270][25689] Avg episode reward: [(0, '-46.912')] [2022-07-09 17:16:44,444][26022] Updated weights on worker 0-0, policy_version 344302 (0.00091) [2022-07-09 17:16:46,246][26022] Updated weights on worker 0-0, policy_version 344312 (0.00086) [2022-07-09 17:16:48,189][26022] Updated weights on worker 0-0, policy_version 344322 (0.00093) [2022-07-09 17:16:48,289][25689] Fps is (10 sec: 5698.8, 60 sec: 5630.4, 300 sec: 5662.4). Total num frames: 352585728. Throughput: 0: 5972.8. Samples: 352585548. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:16:48,290][25689] Avg episode reward: [(0, '-47.233')] [2022-07-09 17:16:49,688][26022] Updated weights on worker 0-0, policy_version 344332 (0.00087) [2022-07-09 17:16:51,669][26022] Updated weights on worker 0-0, policy_version 344342 (0.00056) [2022-07-09 17:16:53,231][26022] Updated weights on worker 0-0, policy_version 344352 (0.00087) [2022-07-09 17:16:53,339][25689] Fps is (10 sec: 5695.2, 60 sec: 5677.8, 300 sec: 5668.6). Total num frames: 352616448. Throughput: 0: 6015.5. Samples: 352620384. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:16:53,339][25689] Avg episode reward: [(0, '-47.701')] [2022-07-09 17:16:55,203][26022] Updated weights on worker 0-0, policy_version 344362 (0.00090) [2022-07-09 17:16:57,013][26022] Updated weights on worker 0-0, policy_version 344372 (0.00088) [2022-07-09 17:16:58,360][25689] Fps is (10 sec: 5897.7, 60 sec: 5662.6, 300 sec: 5670.0). Total num frames: 352645120. Throughput: 0: 5143.9. Samples: 352637586. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:16:58,360][25689] Avg episode reward: [(0, '-48.149')] [2022-07-09 17:16:58,617][26022] Updated weights on worker 0-0, policy_version 344382 (0.00092) [2022-07-09 17:17:00,559][26022] Updated weights on worker 0-0, policy_version 344392 (0.00083) [2022-07-09 17:17:02,296][26022] Updated weights on worker 0-0, policy_version 344402 (0.00088) [2022-07-09 17:17:03,446][25689] Fps is (10 sec: 5369.4, 60 sec: 5658.7, 300 sec: 5668.5). Total num frames: 352670720. Throughput: 0: 5976.2. Samples: 352671704. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:17:03,447][25689] Avg episode reward: [(0, '-47.655')] [2022-07-09 17:17:04,404][26022] Updated weights on worker 0-0, policy_version 344412 (0.00083) [2022-07-09 17:17:06,268][26022] Updated weights on worker 0-0, policy_version 344422 (0.00087) [2022-07-09 17:17:08,188][26022] Updated weights on worker 0-0, policy_version 344432 (0.00095) [2022-07-09 17:17:08,448][25689] Fps is (10 sec: 5481.1, 60 sec: 5677.0, 300 sec: 5665.4). Total num frames: 352700416. Throughput: 0: 5866.4. Samples: 352703808. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:17:08,449][25689] Avg episode reward: [(0, '-47.378')] [2022-07-09 17:17:09,936][26022] Updated weights on worker 0-0, policy_version 344442 (0.00081) [2022-07-09 17:17:11,639][26022] Updated weights on worker 0-0, policy_version 344452 (0.00088) [2022-07-09 17:17:13,428][26022] Updated weights on worker 0-0, policy_version 344462 (0.00086) [2022-07-09 17:17:13,480][25689] Fps is (10 sec: 5817.0, 60 sec: 5674.6, 300 sec: 5669.0). Total num frames: 352729088. Throughput: 0: 5843.6. Samples: 352738084. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:17:13,481][25689] Avg episode reward: [(0, '-46.659')] [2022-07-09 17:17:15,315][26022] Updated weights on worker 0-0, policy_version 344472 (0.00083) [2022-07-09 17:17:16,537][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:17:16,547][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000344479_352746496.pth [2022-07-09 17:17:16,548][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000342484_350703616.pth [2022-07-09 17:17:17,049][26022] Updated weights on worker 0-0, policy_version 344482 (0.00089) [2022-07-09 17:17:18,515][25689] Fps is (10 sec: 5696.5, 60 sec: 5672.6, 300 sec: 5666.8). Total num frames: 352757760. Throughput: 0: 5844.4. Samples: 352755380. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:17:18,515][25689] Avg episode reward: [(0, '-46.479')] [2022-07-09 17:17:18,855][26022] Updated weights on worker 0-0, policy_version 344492 (0.00109) [2022-07-09 17:17:20,543][26022] Updated weights on worker 0-0, policy_version 344502 (0.00093) [2022-07-09 17:17:22,351][26022] Updated weights on worker 0-0, policy_version 344512 (0.00090) [2022-07-09 17:17:23,588][25689] Fps is (10 sec: 5673.5, 60 sec: 5694.7, 300 sec: 5666.6). Total num frames: 352786432. Throughput: 0: 5866.8. Samples: 352789870. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:17:23,588][25689] Avg episode reward: [(0, '-45.379')] [2022-07-09 17:17:24,166][26022] Updated weights on worker 0-0, policy_version 344522 (0.01007) [2022-07-09 17:17:25,930][26022] Updated weights on worker 0-0, policy_version 344532 (0.00086) [2022-07-09 17:17:27,728][26022] Updated weights on worker 0-0, policy_version 344542 (0.00081) [2022-07-09 17:17:28,593][25689] Fps is (10 sec: 5588.5, 60 sec: 5678.2, 300 sec: 5663.4). Total num frames: 352814080. Throughput: 0: 5957.4. Samples: 352823816. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:17:28,593][25689] Avg episode reward: [(0, '-45.498')] [2022-07-09 17:17:29,735][26022] Updated weights on worker 0-0, policy_version 344552 (0.00089) [2022-07-09 17:17:31,506][26022] Updated weights on worker 0-0, policy_version 344562 (0.00088) [2022-07-09 17:17:33,289][26022] Updated weights on worker 0-0, policy_version 344572 (0.00100) [2022-07-09 17:17:33,616][25689] Fps is (10 sec: 5616.3, 60 sec: 5662.8, 300 sec: 5667.4). Total num frames: 352842752. Throughput: 0: 5095.9. Samples: 352840692. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:17:33,616][25689] Avg episode reward: [(0, '-45.753')] [2022-07-09 17:17:35,122][26022] Updated weights on worker 0-0, policy_version 344582 (0.00084) [2022-07-09 17:17:36,794][26022] Updated weights on worker 0-0, policy_version 344592 (0.00088) [2022-07-09 17:17:38,638][25689] Fps is (10 sec: 5810.6, 60 sec: 5695.4, 300 sec: 5671.7). Total num frames: 352872448. Throughput: 0: 5941.6. Samples: 352874944. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:17:38,638][25689] Avg episode reward: [(0, '-45.444')] [2022-07-09 17:17:38,643][26022] Updated weights on worker 0-0, policy_version 344602 (0.00088) [2022-07-09 17:17:40,484][26022] Updated weights on worker 0-0, policy_version 344612 (0.00079) [2022-07-09 17:17:42,310][26022] Updated weights on worker 0-0, policy_version 344622 (0.00092) [2022-07-09 17:17:43,758][25689] Fps is (10 sec: 5653.8, 60 sec: 5637.3, 300 sec: 5662.7). Total num frames: 352900096. Throughput: 0: 5916.1. Samples: 352909202. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:17:43,759][25689] Avg episode reward: [(0, '-46.218')] [2022-07-09 17:17:44,054][26022] Updated weights on worker 0-0, policy_version 344632 (0.00086) [2022-07-09 17:17:45,838][26022] Updated weights on worker 0-0, policy_version 344642 (0.00094) [2022-07-09 17:17:47,664][26022] Updated weights on worker 0-0, policy_version 344652 (0.00085) [2022-07-09 17:17:48,779][25689] Fps is (10 sec: 5654.6, 60 sec: 5688.0, 300 sec: 5666.2). Total num frames: 352929792. Throughput: 0: 5069.7. Samples: 352926154. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:17:48,779][25689] Avg episode reward: [(0, '-46.358')] [2022-07-09 17:17:49,534][26022] Updated weights on worker 0-0, policy_version 344662 (0.00394) [2022-07-09 17:17:51,244][26022] Updated weights on worker 0-0, policy_version 344672 (0.00419) [2022-07-09 17:17:53,108][26022] Updated weights on worker 0-0, policy_version 344682 (0.00082) [2022-07-09 17:17:53,842][25689] Fps is (10 sec: 5788.7, 60 sec: 5652.9, 300 sec: 5665.2). Total num frames: 352958464. Throughput: 0: 5922.1. Samples: 352960472. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:17:53,842][25689] Avg episode reward: [(0, '-46.510')] [2022-07-09 17:17:54,803][26022] Updated weights on worker 0-0, policy_version 344692 (0.00090) [2022-07-09 17:17:56,760][26022] Updated weights on worker 0-0, policy_version 344702 (0.00084) [2022-07-09 17:17:58,354][26022] Updated weights on worker 0-0, policy_version 344712 (0.00090) [2022-07-09 17:17:58,887][25689] Fps is (10 sec: 5673.1, 60 sec: 5650.6, 300 sec: 5666.5). Total num frames: 352987136. Throughput: 0: 5925.8. Samples: 352994936. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:17:58,889][25689] Avg episode reward: [(0, '-46.391')] [2022-07-09 17:18:00,094][26022] Updated weights on worker 0-0, policy_version 344722 (0.00093) [2022-07-09 17:18:02,447][26022] Updated weights on worker 0-0, policy_version 344732 (0.00093) [2022-07-09 17:18:03,925][25689] Fps is (10 sec: 5585.1, 60 sec: 5689.0, 300 sec: 5663.1). Total num frames: 353014784. Throughput: 0: 5089.5. Samples: 353011842. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:18:03,926][25689] Avg episode reward: [(0, '-47.319')] [2022-07-09 17:18:03,968][26022] Updated weights on worker 0-0, policy_version 344742 (0.00091) [2022-07-09 17:18:06,109][26022] Updated weights on worker 0-0, policy_version 344752 (0.00084) [2022-07-09 17:18:07,819][26022] Updated weights on worker 0-0, policy_version 344762 (0.00090) [2022-07-09 17:18:08,943][25689] Fps is (10 sec: 5397.1, 60 sec: 5636.7, 300 sec: 5662.9). Total num frames: 353041408. Throughput: 0: 5829.4. Samples: 353043698. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:18:08,943][25689] Avg episode reward: [(0, '-47.583')] [2022-07-09 17:18:09,640][26022] Updated weights on worker 0-0, policy_version 344772 (0.00090) [2022-07-09 17:18:11,317][26022] Updated weights on worker 0-0, policy_version 344782 (0.00093) [2022-07-09 17:18:13,162][26022] Updated weights on worker 0-0, policy_version 344792 (0.00091) [2022-07-09 17:18:13,966][25689] Fps is (10 sec: 5609.5, 60 sec: 5654.6, 300 sec: 5666.6). Total num frames: 353071104. Throughput: 0: 5837.5. Samples: 353077946. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 17:18:13,966][25689] Avg episode reward: [(0, '-48.014')] [2022-07-09 17:18:15,102][26022] Updated weights on worker 0-0, policy_version 344802 (0.00081) [2022-07-09 17:18:16,770][26022] Updated weights on worker 0-0, policy_version 344812 (0.00088) [2022-07-09 17:18:18,797][26022] Updated weights on worker 0-0, policy_version 344822 (0.00086) [2022-07-09 17:18:18,979][25689] Fps is (10 sec: 5713.8, 60 sec: 5639.6, 300 sec: 5658.4). Total num frames: 353098752. Throughput: 0: 4984.7. Samples: 353095088. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:18:18,979][25689] Avg episode reward: [(0, '-47.312')] [2022-07-09 17:18:20,453][26022] Updated weights on worker 0-0, policy_version 344832 (0.00089) [2022-07-09 17:18:22,449][26022] Updated weights on worker 0-0, policy_version 344842 (0.00085) [2022-07-09 17:18:24,056][25689] Fps is (10 sec: 5581.6, 60 sec: 5639.2, 300 sec: 5661.2). Total num frames: 353127424. Throughput: 0: 5825.5. Samples: 353129112. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:18:24,056][25689] Avg episode reward: [(0, '-47.703')] [2022-07-09 17:18:24,091][26022] Updated weights on worker 0-0, policy_version 344852 (0.00092) [2022-07-09 17:18:26,020][26022] Updated weights on worker 0-0, policy_version 344862 (0.00099) [2022-07-09 17:18:27,621][26022] Updated weights on worker 0-0, policy_version 344872 (0.00091) [2022-07-09 17:18:29,098][25689] Fps is (10 sec: 5667.0, 60 sec: 5652.7, 300 sec: 5667.7). Total num frames: 353156096. Throughput: 0: 5907.8. Samples: 353162770. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:18:29,098][25689] Avg episode reward: [(0, '-47.936')] [2022-07-09 17:18:29,510][26022] Updated weights on worker 0-0, policy_version 344882 (0.00088) [2022-07-09 17:18:31,365][26022] Updated weights on worker 0-0, policy_version 344892 (0.00088) [2022-07-09 17:18:33,203][26022] Updated weights on worker 0-0, policy_version 344902 (0.00086) [2022-07-09 17:18:34,112][25689] Fps is (10 sec: 5600.3, 60 sec: 5636.5, 300 sec: 5657.2). Total num frames: 353183744. Throughput: 0: 5048.6. Samples: 353179660. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:18:34,113][25689] Avg episode reward: [(0, '-47.488')] [2022-07-09 17:18:35,027][26022] Updated weights on worker 0-0, policy_version 344912 (0.00094) [2022-07-09 17:18:36,724][26022] Updated weights on worker 0-0, policy_version 344922 (0.00086) [2022-07-09 17:18:38,531][26022] Updated weights on worker 0-0, policy_version 344932 (0.00096) [2022-07-09 17:18:39,124][25689] Fps is (10 sec: 5719.1, 60 sec: 5637.5, 300 sec: 5659.5). Total num frames: 353213440. Throughput: 0: 5889.0. Samples: 353213726. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:18:39,125][25689] Avg episode reward: [(0, '-46.579')] [2022-07-09 17:18:40,583][26022] Updated weights on worker 0-0, policy_version 344942 (0.00096) [2022-07-09 17:18:42,168][26022] Updated weights on worker 0-0, policy_version 344952 (0.00087) [2022-07-09 17:18:44,176][25689] Fps is (10 sec: 5697.9, 60 sec: 5643.9, 300 sec: 5666.2). Total num frames: 353241088. Throughput: 0: 5893.8. Samples: 353247700. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:18:44,178][25689] Avg episode reward: [(0, '-46.222')] [2022-07-09 17:18:44,188][26022] Updated weights on worker 0-0, policy_version 344962 (0.01072) [2022-07-09 17:18:45,792][26022] Updated weights on worker 0-0, policy_version 344972 (0.00089) [2022-07-09 17:18:47,732][26022] Updated weights on worker 0-0, policy_version 344982 (0.00085) [2022-07-09 17:18:49,255][25689] Fps is (10 sec: 5559.3, 60 sec: 5621.5, 300 sec: 5659.3). Total num frames: 353269760. Throughput: 0: 5056.4. Samples: 353264694. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:18:49,257][25689] Avg episode reward: [(0, '-45.498')] [2022-07-09 17:18:49,412][26022] Updated weights on worker 0-0, policy_version 344992 (0.00079) [2022-07-09 17:18:51,138][26022] Updated weights on worker 0-0, policy_version 345002 (0.00089) [2022-07-09 17:18:53,051][26022] Updated weights on worker 0-0, policy_version 345012 (0.00083) [2022-07-09 17:18:54,303][25689] Fps is (10 sec: 5763.6, 60 sec: 5639.8, 300 sec: 5662.8). Total num frames: 353299456. Throughput: 0: 5902.9. Samples: 353298846. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:18:54,305][25689] Avg episode reward: [(0, '-45.967')] [2022-07-09 17:18:54,827][26022] Updated weights on worker 0-0, policy_version 345022 (0.00088) [2022-07-09 17:18:56,605][26022] Updated weights on worker 0-0, policy_version 345032 (0.00093) [2022-07-09 17:18:58,504][26022] Updated weights on worker 0-0, policy_version 345042 (0.00089) [2022-07-09 17:18:59,344][25689] Fps is (10 sec: 5785.6, 60 sec: 5640.3, 300 sec: 5666.3). Total num frames: 353328128. Throughput: 0: 5896.9. Samples: 353332958. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:18:59,344][25689] Avg episode reward: [(0, '-45.675')] [2022-07-09 17:19:00,179][26022] Updated weights on worker 0-0, policy_version 345052 (0.00403) [2022-07-09 17:19:02,520][26022] Updated weights on worker 0-0, policy_version 345062 (0.00080) [2022-07-09 17:19:04,269][26022] Updated weights on worker 0-0, policy_version 345072 (0.00112) [2022-07-09 17:19:04,411][25689] Fps is (10 sec: 5470.6, 60 sec: 5620.6, 300 sec: 5663.5). Total num frames: 353354752. Throughput: 0: 5055.6. Samples: 353350000. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:19:04,412][25689] Avg episode reward: [(0, '-45.969')] [2022-07-09 17:19:05,975][26022] Updated weights on worker 0-0, policy_version 345082 (0.00110) [2022-07-09 17:19:07,971][26022] Updated weights on worker 0-0, policy_version 345092 (0.00093) [2022-07-09 17:19:09,458][25689] Fps is (10 sec: 5466.9, 60 sec: 5651.7, 300 sec: 5660.5). Total num frames: 353383424. Throughput: 0: 5810.0. Samples: 353382074. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:19:09,459][25689] Avg episode reward: [(0, '-45.867')] [2022-07-09 17:19:09,571][26022] Updated weights on worker 0-0, policy_version 345102 (0.00086) [2022-07-09 17:19:11,511][26022] Updated weights on worker 0-0, policy_version 345112 (0.00087) [2022-07-09 17:19:13,027][26022] Updated weights on worker 0-0, policy_version 345122 (0.00082) [2022-07-09 17:19:14,475][25689] Fps is (10 sec: 5698.2, 60 sec: 5635.4, 300 sec: 5660.8). Total num frames: 353412096. Throughput: 0: 5832.2. Samples: 353416490. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:19:14,477][25689] Avg episode reward: [(0, '-46.376')] [2022-07-09 17:19:15,071][26022] Updated weights on worker 0-0, policy_version 345132 (0.00093) [2022-07-09 17:19:16,724][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:19:16,737][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000345142_353425408.pth [2022-07-09 17:19:16,737][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000343148_351383552.pth [2022-07-09 17:19:16,742][26022] Updated weights on worker 0-0, policy_version 345142 (0.00091) [2022-07-09 17:19:18,628][26022] Updated weights on worker 0-0, policy_version 345152 (0.00090) [2022-07-09 17:19:19,501][25689] Fps is (10 sec: 5710.2, 60 sec: 5651.1, 300 sec: 5655.6). Total num frames: 353440768. Throughput: 0: 4994.9. Samples: 353433640. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:19:19,501][25689] Avg episode reward: [(0, '-47.008')] [2022-07-09 17:19:20,354][26022] Updated weights on worker 0-0, policy_version 345162 (0.00084) [2022-07-09 17:19:22,107][26022] Updated weights on worker 0-0, policy_version 345172 (0.00075) [2022-07-09 17:19:23,901][26022] Updated weights on worker 0-0, policy_version 345182 (0.00091) [2022-07-09 17:19:24,621][25689] Fps is (10 sec: 5651.7, 60 sec: 5647.1, 300 sec: 5663.8). Total num frames: 353469440. Throughput: 0: 5836.6. Samples: 353467956. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:19:24,622][25689] Avg episode reward: [(0, '-47.195')] [2022-07-09 17:19:25,858][26022] Updated weights on worker 0-0, policy_version 345192 (0.00088) [2022-07-09 17:19:27,517][26022] Updated weights on worker 0-0, policy_version 345202 (0.00093) [2022-07-09 17:19:29,434][26022] Updated weights on worker 0-0, policy_version 345212 (0.00064) [2022-07-09 17:19:29,630][25689] Fps is (10 sec: 5661.5, 60 sec: 5650.2, 300 sec: 5656.9). Total num frames: 353498112. Throughput: 0: 5943.4. Samples: 353501960. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:19:29,630][25689] Avg episode reward: [(0, '-47.648')] [2022-07-09 17:19:31,407][26022] Updated weights on worker 0-0, policy_version 345222 (0.00082) [2022-07-09 17:19:33,039][26022] Updated weights on worker 0-0, policy_version 345232 (0.00070) [2022-07-09 17:19:34,644][25689] Fps is (10 sec: 5619.4, 60 sec: 5650.2, 300 sec: 5654.4). Total num frames: 353525760. Throughput: 0: 5918.9. Samples: 353535866. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:19:34,644][25689] Avg episode reward: [(0, '-48.311')] [2022-07-09 17:19:34,933][26022] Updated weights on worker 0-0, policy_version 345242 (0.00090) [2022-07-09 17:19:36,534][26022] Updated weights on worker 0-0, policy_version 345252 (0.00088) [2022-07-09 17:19:38,526][26022] Updated weights on worker 0-0, policy_version 345262 (0.00090) [2022-07-09 17:19:39,661][25689] Fps is (10 sec: 5614.2, 60 sec: 5632.8, 300 sec: 5659.4). Total num frames: 353554432. Throughput: 0: 5924.3. Samples: 353553076. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:19:39,662][25689] Avg episode reward: [(0, '-48.540')] [2022-07-09 17:19:40,120][26022] Updated weights on worker 0-0, policy_version 345272 (0.00081) [2022-07-09 17:19:42,130][26022] Updated weights on worker 0-0, policy_version 345282 (0.00091) [2022-07-09 17:19:43,705][26022] Updated weights on worker 0-0, policy_version 345292 (0.00088) [2022-07-09 17:19:44,708][25689] Fps is (10 sec: 5697.5, 60 sec: 5650.2, 300 sec: 5658.6). Total num frames: 353583104. Throughput: 0: 5929.6. Samples: 353587064. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:19:44,709][25689] Avg episode reward: [(0, '-48.113')] [2022-07-09 17:19:45,583][26022] Updated weights on worker 0-0, policy_version 345302 (0.00093) [2022-07-09 17:19:47,571][26022] Updated weights on worker 0-0, policy_version 345312 (0.00089) [2022-07-09 17:19:49,204][26022] Updated weights on worker 0-0, policy_version 345322 (0.00090) [2022-07-09 17:19:49,755][25689] Fps is (10 sec: 5680.9, 60 sec: 5653.1, 300 sec: 5654.8). Total num frames: 353611776. Throughput: 0: 5928.2. Samples: 353621270. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:19:49,756][25689] Avg episode reward: [(0, '-48.231')] [2022-07-09 17:19:51,183][26022] Updated weights on worker 0-0, policy_version 345332 (0.00093) [2022-07-09 17:19:52,816][26022] Updated weights on worker 0-0, policy_version 345342 (0.00087) [2022-07-09 17:19:54,779][26022] Updated weights on worker 0-0, policy_version 345352 (0.00091) [2022-07-09 17:19:54,786][25689] Fps is (10 sec: 5588.8, 60 sec: 5621.0, 300 sec: 5648.5). Total num frames: 353639424. Throughput: 0: 5099.4. Samples: 353638578. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:19:54,786][25689] Avg episode reward: [(0, '-47.727')] [2022-07-09 17:19:56,185][26022] Updated weights on worker 0-0, policy_version 345362 (0.00084) [2022-07-09 17:19:58,103][26022] Updated weights on worker 0-0, policy_version 345372 (0.00093) [2022-07-09 17:19:59,799][25689] Fps is (10 sec: 5811.6, 60 sec: 5657.4, 300 sec: 5668.0). Total num frames: 353670144. Throughput: 0: 5965.1. Samples: 353673198. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:19:59,799][25689] Avg episode reward: [(0, '-46.998')] [2022-07-09 17:19:59,886][26022] Updated weights on worker 0-0, policy_version 345382 (0.00086) [2022-07-09 17:20:01,868][26022] Updated weights on worker 0-0, policy_version 345392 (0.00096) [2022-07-09 17:20:04,110][26022] Updated weights on worker 0-0, policy_version 345402 (0.00090) [2022-07-09 17:20:04,932][25689] Fps is (10 sec: 5550.9, 60 sec: 5634.3, 300 sec: 5655.8). Total num frames: 353695744. Throughput: 0: 5842.9. Samples: 353705228. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:20:04,932][25689] Avg episode reward: [(0, '-46.720')] [2022-07-09 17:20:05,631][26022] Updated weights on worker 0-0, policy_version 345412 (0.00090) [2022-07-09 17:20:07,419][26022] Updated weights on worker 0-0, policy_version 345422 (0.00091) [2022-07-09 17:20:09,543][26022] Updated weights on worker 0-0, policy_version 345432 (0.00094) [2022-07-09 17:20:09,965][25689] Fps is (10 sec: 5338.6, 60 sec: 5635.6, 300 sec: 5652.3). Total num frames: 353724416. Throughput: 0: 4999.3. Samples: 353722302. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:20:09,965][25689] Avg episode reward: [(0, '-47.344')] [2022-07-09 17:20:10,999][26022] Updated weights on worker 0-0, policy_version 345442 (0.00092) [2022-07-09 17:20:13,098][26022] Updated weights on worker 0-0, policy_version 345452 (0.00090) [2022-07-09 17:20:14,624][26022] Updated weights on worker 0-0, policy_version 345462 (0.00091) [2022-07-09 17:20:15,055][25689] Fps is (10 sec: 5765.7, 60 sec: 5645.7, 300 sec: 5657.9). Total num frames: 353754112. Throughput: 0: 5811.6. Samples: 353756376. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:20:15,055][25689] Avg episode reward: [(0, '-46.950')] [2022-07-09 17:20:16,572][26022] Updated weights on worker 0-0, policy_version 345472 (0.00106) [2022-07-09 17:20:18,361][26022] Updated weights on worker 0-0, policy_version 345482 (0.00085) [2022-07-09 17:20:20,101][25689] Fps is (10 sec: 5758.1, 60 sec: 5643.8, 300 sec: 5654.8). Total num frames: 353782784. Throughput: 0: 5787.2. Samples: 353790692. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:20:20,103][25689] Avg episode reward: [(0, '-46.630')] [2022-07-09 17:20:20,104][26022] Updated weights on worker 0-0, policy_version 345492 (0.00088) [2022-07-09 17:20:21,836][26022] Updated weights on worker 0-0, policy_version 345502 (0.00083) [2022-07-09 17:20:23,870][26022] Updated weights on worker 0-0, policy_version 345512 (0.00096) [2022-07-09 17:20:25,169][25689] Fps is (10 sec: 5669.8, 60 sec: 5648.7, 300 sec: 5657.2). Total num frames: 353811456. Throughput: 0: 5067.8. Samples: 353807786. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:20:25,171][25689] Avg episode reward: [(0, '-46.736')] [2022-07-09 17:20:25,510][26022] Updated weights on worker 0-0, policy_version 345522 (0.00092) [2022-07-09 17:20:27,599][26022] Updated weights on worker 0-0, policy_version 345532 (0.00084) [2022-07-09 17:20:28,983][26022] Updated weights on worker 0-0, policy_version 345542 (0.00091) [2022-07-09 17:20:30,194][25689] Fps is (10 sec: 5783.3, 60 sec: 5664.1, 300 sec: 5657.1). Total num frames: 353841152. Throughput: 0: 5897.9. Samples: 353841610. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:20:30,195][25689] Avg episode reward: [(0, '-46.791')] [2022-07-09 17:20:31,249][26022] Updated weights on worker 0-0, policy_version 345552 (0.00089) [2022-07-09 17:20:32,803][26022] Updated weights on worker 0-0, policy_version 345562 (0.00089) [2022-07-09 17:20:34,697][26022] Updated weights on worker 0-0, policy_version 345572 (0.00088) [2022-07-09 17:20:35,281][25689] Fps is (10 sec: 5569.8, 60 sec: 5640.4, 300 sec: 5652.1). Total num frames: 353867776. Throughput: 0: 5876.2. Samples: 353875226. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:20:35,282][25689] Avg episode reward: [(0, '-47.606')] [2022-07-09 17:20:36,462][26022] Updated weights on worker 0-0, policy_version 345582 (0.00081) [2022-07-09 17:20:38,524][26022] Updated weights on worker 0-0, policy_version 345592 (0.00240) [2022-07-09 17:20:40,120][26022] Updated weights on worker 0-0, policy_version 345602 (0.00074) [2022-07-09 17:20:40,346][25689] Fps is (10 sec: 5547.4, 60 sec: 5652.8, 300 sec: 5648.3). Total num frames: 353897472. Throughput: 0: 5009.2. Samples: 353892102. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:20:40,347][25689] Avg episode reward: [(0, '-47.914')] [2022-07-09 17:20:42,046][26022] Updated weights on worker 0-0, policy_version 345612 (0.00084) [2022-07-09 17:20:43,585][26022] Updated weights on worker 0-0, policy_version 345622 (0.01145) [2022-07-09 17:20:45,398][25689] Fps is (10 sec: 5668.1, 60 sec: 5635.5, 300 sec: 5645.1). Total num frames: 353925120. Throughput: 0: 5856.5. Samples: 353926256. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 17:20:45,398][25689] Avg episode reward: [(0, '-48.290')] [2022-07-09 17:20:45,695][26022] Updated weights on worker 0-0, policy_version 345632 (0.00093) [2022-07-09 17:20:47,278][26022] Updated weights on worker 0-0, policy_version 345642 (0.00088) [2022-07-09 17:20:49,140][26022] Updated weights on worker 0-0, policy_version 345652 (0.00094) [2022-07-09 17:20:50,401][25689] Fps is (10 sec: 5703.3, 60 sec: 5656.5, 300 sec: 5652.2). Total num frames: 353954816. Throughput: 0: 5882.6. Samples: 353960480. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:20:50,411][25689] Avg episode reward: [(0, '-49.202')] [2022-07-09 17:20:51,059][26022] Updated weights on worker 0-0, policy_version 345662 (0.00088) [2022-07-09 17:20:52,564][26022] Updated weights on worker 0-0, policy_version 345672 (0.00087) [2022-07-09 17:20:54,759][26022] Updated weights on worker 0-0, policy_version 345682 (0.00092) [2022-07-09 17:20:55,455][25689] Fps is (10 sec: 5803.7, 60 sec: 5671.2, 300 sec: 5648.5). Total num frames: 353983488. Throughput: 0: 5061.2. Samples: 353977332. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:20:55,455][25689] Avg episode reward: [(0, '-49.619')] [2022-07-09 17:20:56,392][26022] Updated weights on worker 0-0, policy_version 345692 (0.00080) [2022-07-09 17:20:58,062][26022] Updated weights on worker 0-0, policy_version 345702 (0.00092) [2022-07-09 17:21:00,051][26022] Updated weights on worker 0-0, policy_version 345712 (0.00106) [2022-07-09 17:21:00,484][25689] Fps is (10 sec: 5484.0, 60 sec: 5602.2, 300 sec: 5652.2). Total num frames: 354010112. Throughput: 0: 5938.4. Samples: 354011688. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:21:00,484][25689] Avg episode reward: [(0, '-48.473')] [2022-07-09 17:21:01,763][26022] Updated weights on worker 0-0, policy_version 345722 (0.00089) [2022-07-09 17:21:04,265][26022] Updated weights on worker 0-0, policy_version 345732 (0.00083) [2022-07-09 17:21:05,621][25689] Fps is (10 sec: 5439.3, 60 sec: 5652.4, 300 sec: 5649.9). Total num frames: 354038784. Throughput: 0: 5812.6. Samples: 354043806. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:21:05,621][25689] Avg episode reward: [(0, '-46.990')] [2022-07-09 17:21:05,626][26022] Updated weights on worker 0-0, policy_version 345742 (0.00089) [2022-07-09 17:21:07,695][26022] Updated weights on worker 0-0, policy_version 345752 (0.00090) [2022-07-09 17:21:09,446][26022] Updated weights on worker 0-0, policy_version 345762 (0.00089) [2022-07-09 17:21:10,647][25689] Fps is (10 sec: 5541.7, 60 sec: 5636.2, 300 sec: 5646.1). Total num frames: 354066432. Throughput: 0: 4947.9. Samples: 354060656. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:21:10,647][25689] Avg episode reward: [(0, '-45.997')] [2022-07-09 17:21:11,371][26022] Updated weights on worker 0-0, policy_version 345772 (0.00089) [2022-07-09 17:21:12,954][26022] Updated weights on worker 0-0, policy_version 345782 (0.00085) [2022-07-09 17:21:14,923][26022] Updated weights on worker 0-0, policy_version 345792 (0.00086) [2022-07-09 17:21:15,658][25689] Fps is (10 sec: 5611.0, 60 sec: 5626.6, 300 sec: 5646.1). Total num frames: 354095104. Throughput: 0: 5813.5. Samples: 354094784. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:21:15,659][25689] Avg episode reward: [(0, '-45.596')] [2022-07-09 17:21:16,347][26022] Updated weights on worker 0-0, policy_version 345802 (0.00084) [2022-07-09 17:21:16,852][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:21:16,873][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000345804_354103296.pth [2022-07-09 17:21:16,874][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000343813_352064512.pth [2022-07-09 17:21:18,525][26022] Updated weights on worker 0-0, policy_version 345812 (0.00692) [2022-07-09 17:21:20,048][26022] Updated weights on worker 0-0, policy_version 345822 (0.00085) [2022-07-09 17:21:20,762][25689] Fps is (10 sec: 5669.1, 60 sec: 5621.3, 300 sec: 5650.0). Total num frames: 354123776. Throughput: 0: 5794.5. Samples: 354129190. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:21:20,762][25689] Avg episode reward: [(0, '-45.361')] [2022-07-09 17:21:21,984][26022] Updated weights on worker 0-0, policy_version 345832 (0.00090) [2022-07-09 17:21:23,804][26022] Updated weights on worker 0-0, policy_version 345842 (0.00088) [2022-07-09 17:21:25,562][26022] Updated weights on worker 0-0, policy_version 345852 (0.00091) [2022-07-09 17:21:25,850][25689] Fps is (10 sec: 5727.2, 60 sec: 5636.3, 300 sec: 5652.0). Total num frames: 354153472. Throughput: 0: 5889.7. Samples: 354162948. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:21:25,850][25689] Avg episode reward: [(0, '-45.487')] [2022-07-09 17:21:27,517][26022] Updated weights on worker 0-0, policy_version 345862 (0.00088) [2022-07-09 17:21:29,271][26022] Updated weights on worker 0-0, policy_version 345872 (0.00094) [2022-07-09 17:21:30,932][25689] Fps is (10 sec: 5739.0, 60 sec: 5614.1, 300 sec: 5647.7). Total num frames: 354182144. Throughput: 0: 5884.7. Samples: 354180032. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:21:30,933][25689] Avg episode reward: [(0, '-45.324')] [2022-07-09 17:21:30,970][26022] Updated weights on worker 0-0, policy_version 345882 (0.00099) [2022-07-09 17:21:33,012][26022] Updated weights on worker 0-0, policy_version 345892 (0.00090) [2022-07-09 17:21:34,558][26022] Updated weights on worker 0-0, policy_version 345902 (0.00084) [2022-07-09 17:21:35,959][25689] Fps is (10 sec: 5672.3, 60 sec: 5653.4, 300 sec: 5650.8). Total num frames: 354210816. Throughput: 0: 5883.3. Samples: 354214220. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:21:35,959][25689] Avg episode reward: [(0, '-46.355')] [2022-07-09 17:21:36,528][26022] Updated weights on worker 0-0, policy_version 345912 (0.00093) [2022-07-09 17:21:38,186][26022] Updated weights on worker 0-0, policy_version 345922 (0.00093) [2022-07-09 17:21:40,176][26022] Updated weights on worker 0-0, policy_version 345932 (0.00090) [2022-07-09 17:21:40,986][25689] Fps is (10 sec: 5703.8, 60 sec: 5640.1, 300 sec: 5644.2). Total num frames: 354239488. Throughput: 0: 5874.4. Samples: 354247994. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:21:40,987][25689] Avg episode reward: [(0, '-46.920')] [2022-07-09 17:21:41,923][26022] Updated weights on worker 0-0, policy_version 345942 (0.00085) [2022-07-09 17:21:43,809][26022] Updated weights on worker 0-0, policy_version 345952 (0.00095) [2022-07-09 17:21:45,414][26022] Updated weights on worker 0-0, policy_version 345962 (0.00089) [2022-07-09 17:21:46,082][25689] Fps is (10 sec: 5563.7, 60 sec: 5636.0, 300 sec: 5646.2). Total num frames: 354267136. Throughput: 0: 5034.9. Samples: 354264816. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:21:46,082][25689] Avg episode reward: [(0, '-46.890')] [2022-07-09 17:21:47,493][26022] Updated weights on worker 0-0, policy_version 345972 (0.00092) [2022-07-09 17:21:49,126][26022] Updated weights on worker 0-0, policy_version 345982 (0.00089) [2022-07-09 17:21:51,091][25689] Fps is (10 sec: 5472.2, 60 sec: 5601.7, 300 sec: 5636.7). Total num frames: 354294784. Throughput: 0: 5893.7. Samples: 354298840. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:21:51,091][25689] Avg episode reward: [(0, '-46.213')] [2022-07-09 17:21:51,184][26022] Updated weights on worker 0-0, policy_version 345992 (0.00094) [2022-07-09 17:21:52,688][26022] Updated weights on worker 0-0, policy_version 346002 (0.00089) [2022-07-09 17:21:54,539][26022] Updated weights on worker 0-0, policy_version 346012 (0.00090) [2022-07-09 17:21:56,109][25689] Fps is (10 sec: 5718.6, 60 sec: 5621.9, 300 sec: 5640.2). Total num frames: 354324480. Throughput: 0: 5894.7. Samples: 354333002. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:21:56,110][25689] Avg episode reward: [(0, '-47.259')] [2022-07-09 17:21:56,296][26022] Updated weights on worker 0-0, policy_version 346022 (0.00084) [2022-07-09 17:21:58,131][26022] Updated weights on worker 0-0, policy_version 346032 (0.00079) [2022-07-09 17:22:00,004][26022] Updated weights on worker 0-0, policy_version 346042 (0.00085) [2022-07-09 17:22:01,134][25689] Fps is (10 sec: 5812.0, 60 sec: 5656.1, 300 sec: 5651.7). Total num frames: 354353152. Throughput: 0: 5067.9. Samples: 354350100. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:22:01,134][25689] Avg episode reward: [(0, '-46.766')] [2022-07-09 17:22:02,376][26022] Updated weights on worker 0-0, policy_version 346052 (0.00093) [2022-07-09 17:22:03,931][26022] Updated weights on worker 0-0, policy_version 346062 (0.00088) [2022-07-09 17:22:06,057][26022] Updated weights on worker 0-0, policy_version 346072 (0.00109) [2022-07-09 17:22:06,217][25689] Fps is (10 sec: 5369.3, 60 sec: 5610.4, 300 sec: 5636.4). Total num frames: 354378752. Throughput: 0: 5827.9. Samples: 354382164. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:22:06,218][25689] Avg episode reward: [(0, '-46.616')] [2022-07-09 17:22:07,459][26022] Updated weights on worker 0-0, policy_version 346082 (0.00093) [2022-07-09 17:22:09,551][26022] Updated weights on worker 0-0, policy_version 346092 (0.00088) [2022-07-09 17:22:11,219][26022] Updated weights on worker 0-0, policy_version 346102 (0.00083) [2022-07-09 17:22:11,236][25689] Fps is (10 sec: 5473.6, 60 sec: 5644.8, 300 sec: 5640.1). Total num frames: 354408448. Throughput: 0: 5821.5. Samples: 354416114. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:22:11,236][25689] Avg episode reward: [(0, '-46.833')] [2022-07-09 17:22:13,115][26022] Updated weights on worker 0-0, policy_version 346112 (0.00093) [2022-07-09 17:22:14,868][26022] Updated weights on worker 0-0, policy_version 346122 (0.00094) [2022-07-09 17:22:16,247][25689] Fps is (10 sec: 5819.7, 60 sec: 5644.9, 300 sec: 5640.5). Total num frames: 354437120. Throughput: 0: 4971.9. Samples: 354433122. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:22:16,247][25689] Avg episode reward: [(0, '-45.885')] [2022-07-09 17:22:16,636][26022] Updated weights on worker 0-0, policy_version 346132 (0.00082) [2022-07-09 17:22:18,316][26022] Updated weights on worker 0-0, policy_version 346142 (0.00086) [2022-07-09 17:22:20,352][26022] Updated weights on worker 0-0, policy_version 346152 (0.00089) [2022-07-09 17:22:21,266][25689] Fps is (10 sec: 5615.0, 60 sec: 5635.8, 300 sec: 5638.1). Total num frames: 354464768. Throughput: 0: 5820.8. Samples: 354467288. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:22:21,267][25689] Avg episode reward: [(0, '-47.438')] [2022-07-09 17:22:22,047][26022] Updated weights on worker 0-0, policy_version 346162 (0.00092) [2022-07-09 17:22:23,929][26022] Updated weights on worker 0-0, policy_version 346172 (0.00090) [2022-07-09 17:22:25,485][26022] Updated weights on worker 0-0, policy_version 346182 (0.00089) [2022-07-09 17:22:26,359][25689] Fps is (10 sec: 5569.6, 60 sec: 5618.5, 300 sec: 5639.9). Total num frames: 354493440. Throughput: 0: 5912.2. Samples: 354501244. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:22:26,359][25689] Avg episode reward: [(0, '-47.327')] [2022-07-09 17:22:27,470][26022] Updated weights on worker 0-0, policy_version 346192 (0.00095) [2022-07-09 17:22:29,307][26022] Updated weights on worker 0-0, policy_version 346202 (0.00442) [2022-07-09 17:22:31,094][26022] Updated weights on worker 0-0, policy_version 346212 (0.00088) [2022-07-09 17:22:31,367][25689] Fps is (10 sec: 5676.9, 60 sec: 5625.4, 300 sec: 5640.1). Total num frames: 354522112. Throughput: 0: 5058.9. Samples: 354517958. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:22:31,368][25689] Avg episode reward: [(0, '-47.109')] [2022-07-09 17:22:32,750][26022] Updated weights on worker 0-0, policy_version 346222 (0.00095) [2022-07-09 17:22:34,860][26022] Updated weights on worker 0-0, policy_version 346232 (0.00083) [2022-07-09 17:22:36,383][25689] Fps is (10 sec: 5720.2, 60 sec: 5626.3, 300 sec: 5636.8). Total num frames: 354550784. Throughput: 0: 5912.1. Samples: 354552172. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:22:36,384][25689] Avg episode reward: [(0, '-46.870')] [2022-07-09 17:22:36,508][26022] Updated weights on worker 0-0, policy_version 346242 (0.00095) [2022-07-09 17:22:38,431][26022] Updated weights on worker 0-0, policy_version 346252 (0.00095) [2022-07-09 17:22:40,100][26022] Updated weights on worker 0-0, policy_version 346262 (0.00091) [2022-07-09 17:22:41,408][25689] Fps is (10 sec: 5609.5, 60 sec: 5609.7, 300 sec: 5638.6). Total num frames: 354578432. Throughput: 0: 5901.0. Samples: 354586142. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:22:41,409][25689] Avg episode reward: [(0, '-46.810')] [2022-07-09 17:22:42,108][26022] Updated weights on worker 0-0, policy_version 346272 (0.00088) [2022-07-09 17:22:43,780][26022] Updated weights on worker 0-0, policy_version 346282 (0.00085) [2022-07-09 17:22:45,666][26022] Updated weights on worker 0-0, policy_version 346292 (0.00088) [2022-07-09 17:22:46,473][25689] Fps is (10 sec: 5683.5, 60 sec: 5646.4, 300 sec: 5637.8). Total num frames: 354608128. Throughput: 0: 5065.1. Samples: 354603126. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:22:46,474][25689] Avg episode reward: [(0, '-46.394')] [2022-07-09 17:22:47,462][26022] Updated weights on worker 0-0, policy_version 346302 (0.00086) [2022-07-09 17:22:49,244][26022] Updated weights on worker 0-0, policy_version 346312 (0.00087) [2022-07-09 17:22:51,026][26022] Updated weights on worker 0-0, policy_version 346322 (0.00086) [2022-07-09 17:22:51,478][25689] Fps is (10 sec: 5796.2, 60 sec: 5663.7, 300 sec: 5638.9). Total num frames: 354636800. Throughput: 0: 5939.1. Samples: 354637396. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:22:51,478][25689] Avg episode reward: [(0, '-47.164')] [2022-07-09 17:22:52,828][26022] Updated weights on worker 0-0, policy_version 346332 (0.00087) [2022-07-09 17:22:54,504][26022] Updated weights on worker 0-0, policy_version 346342 (0.00085) [2022-07-09 17:22:56,478][26022] Updated weights on worker 0-0, policy_version 346352 (0.00083) [2022-07-09 17:22:56,552][25689] Fps is (10 sec: 5587.8, 60 sec: 5624.6, 300 sec: 5634.9). Total num frames: 354664448. Throughput: 0: 5924.4. Samples: 354671658. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:22:56,553][25689] Avg episode reward: [(0, '-46.991')] [2022-07-09 17:22:58,155][26022] Updated weights on worker 0-0, policy_version 346362 (0.00082) [2022-07-09 17:22:59,942][26022] Updated weights on worker 0-0, policy_version 346372 (0.00086) [2022-07-09 17:23:01,584][25689] Fps is (10 sec: 5572.6, 60 sec: 5623.9, 300 sec: 5638.5). Total num frames: 354693120. Throughput: 0: 5082.0. Samples: 354688680. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:23:01,585][25689] Avg episode reward: [(0, '-47.247')] [2022-07-09 17:23:02,223][26022] Updated weights on worker 0-0, policy_version 346382 (0.00902) [2022-07-09 17:23:03,753][26022] Updated weights on worker 0-0, policy_version 346392 (0.00089) [2022-07-09 17:23:05,831][26022] Updated weights on worker 0-0, policy_version 346402 (0.00097) [2022-07-09 17:23:06,659][25689] Fps is (10 sec: 5572.5, 60 sec: 5658.6, 300 sec: 5640.8). Total num frames: 354720768. Throughput: 0: 5825.8. Samples: 354720724. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:23:06,659][25689] Avg episode reward: [(0, '-46.917')] [2022-07-09 17:23:07,480][26022] Updated weights on worker 0-0, policy_version 346412 (0.00091) [2022-07-09 17:23:09,395][26022] Updated weights on worker 0-0, policy_version 346422 (0.00087) [2022-07-09 17:23:11,393][26022] Updated weights on worker 0-0, policy_version 346432 (0.00086) [2022-07-09 17:23:11,663][25689] Fps is (10 sec: 5385.0, 60 sec: 5609.2, 300 sec: 5630.9). Total num frames: 354747392. Throughput: 0: 5801.7. Samples: 354754502. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:23:11,663][25689] Avg episode reward: [(0, '-48.414')] [2022-07-09 17:23:13,067][26022] Updated weights on worker 0-0, policy_version 346442 (0.00081) [2022-07-09 17:23:14,999][26022] Updated weights on worker 0-0, policy_version 346452 (0.00097) [2022-07-09 17:23:16,532][26022] Updated weights on worker 0-0, policy_version 346462 (0.00081) [2022-07-09 17:23:16,693][25689] Fps is (10 sec: 5613.1, 60 sec: 5624.3, 300 sec: 5637.4). Total num frames: 354777088. Throughput: 0: 4965.4. Samples: 354771664. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:23:16,693][25689] Avg episode reward: [(0, '-48.157')] [2022-07-09 17:23:16,972][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:23:16,987][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000346464_354779136.pth [2022-07-09 17:23:16,988][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000344479_352746496.pth [2022-07-09 17:23:18,420][26022] Updated weights on worker 0-0, policy_version 346472 (0.00090) [2022-07-09 17:23:20,220][26022] Updated weights on worker 0-0, policy_version 346482 (0.00090) [2022-07-09 17:23:21,723][25689] Fps is (10 sec: 5699.7, 60 sec: 5623.3, 300 sec: 5634.8). Total num frames: 354804736. Throughput: 0: 5810.3. Samples: 354805694. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 17:23:21,724][25689] Avg episode reward: [(0, '-48.009')] [2022-07-09 17:23:22,155][26022] Updated weights on worker 0-0, policy_version 346492 (0.00086) [2022-07-09 17:23:23,834][26022] Updated weights on worker 0-0, policy_version 346502 (0.00551) [2022-07-09 17:23:25,675][26022] Updated weights on worker 0-0, policy_version 346512 (0.00084) [2022-07-09 17:23:26,827][25689] Fps is (10 sec: 5557.2, 60 sec: 5622.2, 300 sec: 5633.7). Total num frames: 354833408. Throughput: 0: 5902.3. Samples: 354839764. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:23:26,828][25689] Avg episode reward: [(0, '-48.292')] [2022-07-09 17:23:27,276][26022] Updated weights on worker 0-0, policy_version 346522 (0.00085) [2022-07-09 17:23:29,456][26022] Updated weights on worker 0-0, policy_version 346532 (0.00086) [2022-07-09 17:23:30,952][26022] Updated weights on worker 0-0, policy_version 346542 (0.00091) [2022-07-09 17:23:31,890][25689] Fps is (10 sec: 5640.3, 60 sec: 5617.2, 300 sec: 5636.2). Total num frames: 354862080. Throughput: 0: 5056.7. Samples: 354856788. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:23:31,891][25689] Avg episode reward: [(0, '-48.307')] [2022-07-09 17:23:33,052][26022] Updated weights on worker 0-0, policy_version 346552 (0.00093) [2022-07-09 17:23:34,772][26022] Updated weights on worker 0-0, policy_version 346562 (0.00089) [2022-07-09 17:23:36,500][26022] Updated weights on worker 0-0, policy_version 346572 (0.00089) [2022-07-09 17:23:36,902][25689] Fps is (10 sec: 5793.6, 60 sec: 5634.5, 300 sec: 5636.2). Total num frames: 354891776. Throughput: 0: 5896.9. Samples: 354890836. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:23:36,903][25689] Avg episode reward: [(0, '-47.639')] [2022-07-09 17:23:38,246][26022] Updated weights on worker 0-0, policy_version 346582 (0.00085) [2022-07-09 17:23:40,285][26022] Updated weights on worker 0-0, policy_version 346592 (0.00836) [2022-07-09 17:23:41,997][25689] Fps is (10 sec: 5673.8, 60 sec: 5627.9, 300 sec: 5635.4). Total num frames: 354919424. Throughput: 0: 5863.3. Samples: 354924566. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:23:41,999][25689] Avg episode reward: [(0, '-47.054')] [2022-07-09 17:23:42,062][26022] Updated weights on worker 0-0, policy_version 346602 (0.00085) [2022-07-09 17:23:43,910][26022] Updated weights on worker 0-0, policy_version 346612 (0.00091) [2022-07-09 17:23:45,533][26022] Updated weights on worker 0-0, policy_version 346622 (0.00085) [2022-07-09 17:23:47,092][25689] Fps is (10 sec: 5526.9, 60 sec: 5608.3, 300 sec: 5635.1). Total num frames: 354948096. Throughput: 0: 5025.0. Samples: 354941594. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:23:47,093][25689] Avg episode reward: [(0, '-47.387')] [2022-07-09 17:23:47,566][26022] Updated weights on worker 0-0, policy_version 346632 (0.00080) [2022-07-09 17:23:49,300][26022] Updated weights on worker 0-0, policy_version 346642 (0.00087) [2022-07-09 17:23:51,184][26022] Updated weights on worker 0-0, policy_version 346652 (0.00097) [2022-07-09 17:23:52,114][25689] Fps is (10 sec: 5769.6, 60 sec: 5623.6, 300 sec: 5635.6). Total num frames: 354977792. Throughput: 0: 5886.3. Samples: 354975828. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:23:52,114][25689] Avg episode reward: [(0, '-48.156')] [2022-07-09 17:23:52,932][26022] Updated weights on worker 0-0, policy_version 346662 (0.00085) [2022-07-09 17:23:54,668][26022] Updated weights on worker 0-0, policy_version 346672 (0.00086) [2022-07-09 17:23:56,527][26022] Updated weights on worker 0-0, policy_version 346682 (0.00095) [2022-07-09 17:23:57,116][25689] Fps is (10 sec: 5720.7, 60 sec: 5630.3, 300 sec: 5632.9). Total num frames: 355005440. Throughput: 0: 5883.3. Samples: 355009762. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:23:57,117][25689] Avg episode reward: [(0, '-47.754')] [2022-07-09 17:23:58,199][26022] Updated weights on worker 0-0, policy_version 346692 (0.00089) [2022-07-09 17:23:59,984][26022] Updated weights on worker 0-0, policy_version 346702 (0.00092) [2022-07-09 17:24:02,115][26022] Updated weights on worker 0-0, policy_version 346712 (0.00095) [2022-07-09 17:24:02,191][25689] Fps is (10 sec: 5487.4, 60 sec: 5609.4, 300 sec: 5636.2). Total num frames: 355033088. Throughput: 0: 5815.1. Samples: 355041994. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:24:02,191][25689] Avg episode reward: [(0, '-48.139')] [2022-07-09 17:24:04,059][26022] Updated weights on worker 0-0, policy_version 346722 (0.00495) [2022-07-09 17:24:06,007][26022] Updated weights on worker 0-0, policy_version 346732 (0.00091) [2022-07-09 17:24:07,244][25689] Fps is (10 sec: 5459.9, 60 sec: 5611.4, 300 sec: 5632.6). Total num frames: 355060736. Throughput: 0: 5832.8. Samples: 355059134. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:24:07,244][25689] Avg episode reward: [(0, '-49.095')] [2022-07-09 17:24:07,559][26022] Updated weights on worker 0-0, policy_version 346742 (0.00081) [2022-07-09 17:24:09,538][26022] Updated weights on worker 0-0, policy_version 346752 (0.00082) [2022-07-09 17:24:11,168][26022] Updated weights on worker 0-0, policy_version 346762 (0.00089) [2022-07-09 17:24:12,274][25689] Fps is (10 sec: 5585.3, 60 sec: 5642.7, 300 sec: 5632.4). Total num frames: 355089408. Throughput: 0: 5808.5. Samples: 355092930. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:24:12,275][25689] Avg episode reward: [(0, '-49.409')] [2022-07-09 17:24:13,328][26022] Updated weights on worker 0-0, policy_version 346772 (0.00091) [2022-07-09 17:24:14,840][26022] Updated weights on worker 0-0, policy_version 346782 (0.00101) [2022-07-09 17:24:16,759][26022] Updated weights on worker 0-0, policy_version 346792 (0.00093) [2022-07-09 17:24:17,289][25689] Fps is (10 sec: 5708.4, 60 sec: 5627.2, 300 sec: 5632.6). Total num frames: 355118080. Throughput: 0: 5823.8. Samples: 355127246. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:24:17,290][25689] Avg episode reward: [(0, '-48.795')] [2022-07-09 17:24:18,555][26022] Updated weights on worker 0-0, policy_version 346802 (0.00085) [2022-07-09 17:24:20,311][26022] Updated weights on worker 0-0, policy_version 346812 (0.00113) [2022-07-09 17:24:22,174][26022] Updated weights on worker 0-0, policy_version 346822 (0.00085) [2022-07-09 17:24:22,339][25689] Fps is (10 sec: 5697.7, 60 sec: 5642.4, 300 sec: 5633.9). Total num frames: 355146752. Throughput: 0: 5075.1. Samples: 355144248. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:24:22,340][25689] Avg episode reward: [(0, '-47.779')] [2022-07-09 17:24:24,046][26022] Updated weights on worker 0-0, policy_version 346832 (0.00080) [2022-07-09 17:24:25,843][26022] Updated weights on worker 0-0, policy_version 346842 (0.00084) [2022-07-09 17:24:27,411][25689] Fps is (10 sec: 5564.0, 60 sec: 5628.4, 300 sec: 5629.2). Total num frames: 355174400. Throughput: 0: 5897.6. Samples: 355178076. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:24:27,412][25689] Avg episode reward: [(0, '-48.158')] [2022-07-09 17:24:27,542][26022] Updated weights on worker 0-0, policy_version 346852 (0.00110) [2022-07-09 17:24:29,461][26022] Updated weights on worker 0-0, policy_version 346862 (0.00088) [2022-07-09 17:24:30,999][26022] Updated weights on worker 0-0, policy_version 346872 (0.00089) [2022-07-09 17:24:32,421][25689] Fps is (10 sec: 5585.8, 60 sec: 5633.3, 300 sec: 5632.8). Total num frames: 355203072. Throughput: 0: 5929.8. Samples: 355212400. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:24:32,422][25689] Avg episode reward: [(0, '-47.926')] [2022-07-09 17:24:33,196][26022] Updated weights on worker 0-0, policy_version 346882 (0.00093) [2022-07-09 17:24:34,770][26022] Updated weights on worker 0-0, policy_version 346892 (0.00084) [2022-07-09 17:24:36,708][26022] Updated weights on worker 0-0, policy_version 346902 (0.00086) [2022-07-09 17:24:37,425][25689] Fps is (10 sec: 5931.1, 60 sec: 5651.0, 300 sec: 5639.9). Total num frames: 355233792. Throughput: 0: 5080.6. Samples: 355229548. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:24:37,426][25689] Avg episode reward: [(0, '-47.430')] [2022-07-09 17:24:38,500][26022] Updated weights on worker 0-0, policy_version 346912 (0.00089) [2022-07-09 17:24:40,148][26022] Updated weights on worker 0-0, policy_version 346922 (0.00092) [2022-07-09 17:24:41,951][26022] Updated weights on worker 0-0, policy_version 346932 (0.00090) [2022-07-09 17:24:42,447][25689] Fps is (10 sec: 5617.5, 60 sec: 5624.0, 300 sec: 5630.1). Total num frames: 355259392. Throughput: 0: 5947.2. Samples: 355263838. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:24:42,448][25689] Avg episode reward: [(0, '-47.717')] [2022-07-09 17:24:43,683][26022] Updated weights on worker 0-0, policy_version 346942 (0.00083) [2022-07-09 17:24:45,562][26022] Updated weights on worker 0-0, policy_version 346952 (0.00087) [2022-07-09 17:24:47,371][26022] Updated weights on worker 0-0, policy_version 346962 (0.00101) [2022-07-09 17:24:47,513][25689] Fps is (10 sec: 5481.3, 60 sec: 5643.6, 300 sec: 5633.1). Total num frames: 355289088. Throughput: 0: 5972.5. Samples: 355298134. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:24:47,514][25689] Avg episode reward: [(0, '-47.796')] [2022-07-09 17:24:49,354][26022] Updated weights on worker 0-0, policy_version 346972 (0.00085) [2022-07-09 17:24:51,068][26022] Updated weights on worker 0-0, policy_version 346982 (0.00084) [2022-07-09 17:24:52,541][25689] Fps is (10 sec: 5782.5, 60 sec: 5626.1, 300 sec: 5636.6). Total num frames: 355317760. Throughput: 0: 5103.1. Samples: 355315072. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:24:52,542][25689] Avg episode reward: [(0, '-48.218')] [2022-07-09 17:24:52,970][26022] Updated weights on worker 0-0, policy_version 346992 (0.00092) [2022-07-09 17:24:54,483][26022] Updated weights on worker 0-0, policy_version 347002 (0.00079) [2022-07-09 17:24:56,418][26022] Updated weights on worker 0-0, policy_version 347012 (0.00095) [2022-07-09 17:24:57,543][25689] Fps is (10 sec: 5819.5, 60 sec: 5660.0, 300 sec: 5633.4). Total num frames: 355347456. Throughput: 0: 5960.9. Samples: 355349470. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:24:57,544][25689] Avg episode reward: [(0, '-48.485')] [2022-07-09 17:24:57,980][26022] Updated weights on worker 0-0, policy_version 347022 (0.00084) [2022-07-09 17:24:59,919][26022] Updated weights on worker 0-0, policy_version 347032 (0.00091) [2022-07-09 17:25:01,652][26022] Updated weights on worker 0-0, policy_version 347042 (0.00091) [2022-07-09 17:25:02,598][25689] Fps is (10 sec: 5498.6, 60 sec: 5628.0, 300 sec: 5634.9). Total num frames: 355373056. Throughput: 0: 5913.7. Samples: 355383000. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:25:02,598][25689] Avg episode reward: [(0, '-47.548')] [2022-07-09 17:25:03,950][26022] Updated weights on worker 0-0, policy_version 347052 (0.00096) [2022-07-09 17:25:05,743][26022] Updated weights on worker 0-0, policy_version 347062 (0.00082) [2022-07-09 17:25:07,385][26022] Updated weights on worker 0-0, policy_version 347072 (0.00081) [2022-07-09 17:25:07,747][25689] Fps is (10 sec: 5519.3, 60 sec: 5669.8, 300 sec: 5639.6). Total num frames: 355403776. Throughput: 0: 4979.4. Samples: 355398886. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:25:07,748][25689] Avg episode reward: [(0, '-47.977')] [2022-07-09 17:25:09,361][26022] Updated weights on worker 0-0, policy_version 347082 (0.00085) [2022-07-09 17:25:11,032][26022] Updated weights on worker 0-0, policy_version 347092 (0.00080) [2022-07-09 17:25:12,751][25689] Fps is (10 sec: 5647.7, 60 sec: 5638.4, 300 sec: 5630.9). Total num frames: 355430400. Throughput: 0: 5817.6. Samples: 355432646. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:25:12,752][25689] Avg episode reward: [(0, '-48.440')] [2022-07-09 17:25:13,028][26022] Updated weights on worker 0-0, policy_version 347102 (0.00088) [2022-07-09 17:25:14,786][26022] Updated weights on worker 0-0, policy_version 347112 (0.00080) [2022-07-09 17:25:16,554][26022] Updated weights on worker 0-0, policy_version 347122 (0.00090) [2022-07-09 17:25:17,195][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:25:17,210][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000347126_355457024.pth [2022-07-09 17:25:17,211][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000345142_353425408.pth [2022-07-09 17:25:17,844][25689] Fps is (10 sec: 5578.1, 60 sec: 5648.1, 300 sec: 5633.4). Total num frames: 355460096. Throughput: 0: 5790.7. Samples: 355467026. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:25:17,846][25689] Avg episode reward: [(0, '-47.567')] [2022-07-09 17:25:18,483][26022] Updated weights on worker 0-0, policy_version 347132 (0.00082) [2022-07-09 17:25:19,963][26022] Updated weights on worker 0-0, policy_version 347142 (0.00085) [2022-07-09 17:25:22,106][26022] Updated weights on worker 0-0, policy_version 347152 (0.00089) [2022-07-09 17:25:22,917][25689] Fps is (10 sec: 5842.2, 60 sec: 5662.7, 300 sec: 5636.8). Total num frames: 355489792. Throughput: 0: 4976.9. Samples: 355484128. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:25:22,918][25689] Avg episode reward: [(0, '-47.109')] [2022-07-09 17:25:23,591][26022] Updated weights on worker 0-0, policy_version 347162 (0.00087) [2022-07-09 17:25:25,641][26022] Updated weights on worker 0-0, policy_version 347172 (0.00086) [2022-07-09 17:25:27,371][26022] Updated weights on worker 0-0, policy_version 347182 (0.00090) [2022-07-09 17:25:28,063][25689] Fps is (10 sec: 5511.3, 60 sec: 5639.1, 300 sec: 5624.2). Total num frames: 355516416. Throughput: 0: 5871.8. Samples: 355518174. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:25:28,063][25689] Avg episode reward: [(0, '-47.512')] [2022-07-09 17:25:29,070][26022] Updated weights on worker 0-0, policy_version 347192 (0.00087) [2022-07-09 17:25:30,937][26022] Updated weights on worker 0-0, policy_version 347202 (0.00088) [2022-07-09 17:25:32,770][26022] Updated weights on worker 0-0, policy_version 347212 (0.00087) [2022-07-09 17:25:33,066][25689] Fps is (10 sec: 5549.3, 60 sec: 5656.6, 300 sec: 5636.1). Total num frames: 355546112. Throughput: 0: 5881.4. Samples: 355552126. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:25:33,067][25689] Avg episode reward: [(0, '-47.429')] [2022-07-09 17:25:34,452][26022] Updated weights on worker 0-0, policy_version 347222 (0.00088) [2022-07-09 17:25:36,332][26022] Updated weights on worker 0-0, policy_version 347232 (0.00089) [2022-07-09 17:25:38,071][26022] Updated weights on worker 0-0, policy_version 347242 (0.00099) [2022-07-09 17:25:38,161][25689] Fps is (10 sec: 5881.5, 60 sec: 5631.2, 300 sec: 5635.5). Total num frames: 355575808. Throughput: 0: 5035.6. Samples: 355569334. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:25:38,161][25689] Avg episode reward: [(0, '-46.781')] [2022-07-09 17:25:40,114][26022] Updated weights on worker 0-0, policy_version 347252 (0.00094) [2022-07-09 17:25:41,763][26022] Updated weights on worker 0-0, policy_version 347262 (0.00078) [2022-07-09 17:25:43,260][25689] Fps is (10 sec: 5625.4, 60 sec: 5657.8, 300 sec: 5634.6). Total num frames: 355603456. Throughput: 0: 5858.0. Samples: 355603296. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:25:43,260][25689] Avg episode reward: [(0, '-46.624')] [2022-07-09 17:25:43,630][26022] Updated weights on worker 0-0, policy_version 347272 (0.00099) [2022-07-09 17:25:45,446][26022] Updated weights on worker 0-0, policy_version 347282 (0.00085) [2022-07-09 17:25:47,399][26022] Updated weights on worker 0-0, policy_version 347292 (0.00093) [2022-07-09 17:25:48,318][25689] Fps is (10 sec: 5645.5, 60 sec: 5658.5, 300 sec: 5633.6). Total num frames: 355633152. Throughput: 0: 5873.1. Samples: 355637138. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:25:48,319][25689] Avg episode reward: [(0, '-47.093')] [2022-07-09 17:25:49,092][26022] Updated weights on worker 0-0, policy_version 347302 (0.00092) [2022-07-09 17:25:51,076][26022] Updated weights on worker 0-0, policy_version 347312 (0.00089) [2022-07-09 17:25:52,723][26022] Updated weights on worker 0-0, policy_version 347322 (0.00093) [2022-07-09 17:25:53,391][25689] Fps is (10 sec: 5660.1, 60 sec: 5637.5, 300 sec: 5629.8). Total num frames: 355660800. Throughput: 0: 5023.9. Samples: 355654238. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:25:53,392][25689] Avg episode reward: [(0, '-47.245')] [2022-07-09 17:25:54,473][26022] Updated weights on worker 0-0, policy_version 347332 (0.00091) [2022-07-09 17:25:56,309][26022] Updated weights on worker 0-0, policy_version 347342 (0.00085) [2022-07-09 17:25:58,012][26022] Updated weights on worker 0-0, policy_version 347352 (0.00086) [2022-07-09 17:25:58,434][25689] Fps is (10 sec: 5668.7, 60 sec: 5633.7, 300 sec: 5639.8). Total num frames: 355690496. Throughput: 0: 5877.2. Samples: 355688484. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:25:58,435][25689] Avg episode reward: [(0, '-46.468')] [2022-07-09 17:26:00,052][26022] Updated weights on worker 0-0, policy_version 347362 (0.00547) [2022-07-09 17:26:01,700][26022] Updated weights on worker 0-0, policy_version 347372 (0.00082) [2022-07-09 17:26:03,490][25689] Fps is (10 sec: 5576.7, 60 sec: 5650.3, 300 sec: 5634.5). Total num frames: 355717120. Throughput: 0: 5797.7. Samples: 355720586. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:26:03,491][25689] Avg episode reward: [(0, '-47.242')] [2022-07-09 17:26:03,890][26022] Updated weights on worker 0-0, policy_version 347382 (0.00097) [2022-07-09 17:26:05,494][26022] Updated weights on worker 0-0, policy_version 347392 (0.00086) [2022-07-09 17:26:07,403][26022] Updated weights on worker 0-0, policy_version 347402 (0.00085) [2022-07-09 17:26:08,553][25689] Fps is (10 sec: 5565.7, 60 sec: 5641.5, 300 sec: 5640.6). Total num frames: 355746816. Throughput: 0: 5831.6. Samples: 355755140. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:26:08,554][25689] Avg episode reward: [(0, '-46.734')] [2022-07-09 17:26:09,098][26022] Updated weights on worker 0-0, policy_version 347412 (0.00086) [2022-07-09 17:26:10,956][26022] Updated weights on worker 0-0, policy_version 347422 (0.00089) [2022-07-09 17:26:12,885][26022] Updated weights on worker 0-0, policy_version 347432 (0.00087) [2022-07-09 17:26:13,585][25689] Fps is (10 sec: 5680.5, 60 sec: 5655.8, 300 sec: 5636.8). Total num frames: 355774464. Throughput: 0: 5840.2. Samples: 355772176. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:26:13,586][25689] Avg episode reward: [(0, '-46.401')] [2022-07-09 17:26:14,533][26022] Updated weights on worker 0-0, policy_version 347442 (0.00089) [2022-07-09 17:26:16,496][26022] Updated weights on worker 0-0, policy_version 347452 (0.00088) [2022-07-09 17:26:17,956][26022] Updated weights on worker 0-0, policy_version 347462 (0.00089) [2022-07-09 17:26:18,620][25689] Fps is (10 sec: 5594.5, 60 sec: 5644.3, 300 sec: 5638.1). Total num frames: 355803136. Throughput: 0: 5848.3. Samples: 355806540. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:26:18,621][25689] Avg episode reward: [(0, '-46.558')] [2022-07-09 17:26:20,049][26022] Updated weights on worker 0-0, policy_version 347472 (0.00091) [2022-07-09 17:26:21,622][26022] Updated weights on worker 0-0, policy_version 347482 (0.00089) [2022-07-09 17:26:23,543][26022] Updated weights on worker 0-0, policy_version 347492 (0.00087) [2022-07-09 17:26:23,662][25689] Fps is (10 sec: 5792.2, 60 sec: 5647.2, 300 sec: 5639.0). Total num frames: 355832832. Throughput: 0: 5968.7. Samples: 355840986. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:26:23,663][25689] Avg episode reward: [(0, '-46.533')] [2022-07-09 17:26:25,334][26022] Updated weights on worker 0-0, policy_version 347502 (0.00085) [2022-07-09 17:26:27,343][26022] Updated weights on worker 0-0, policy_version 347512 (0.00311) [2022-07-09 17:26:28,780][25689] Fps is (10 sec: 5745.3, 60 sec: 5683.6, 300 sec: 5638.3). Total num frames: 355861504. Throughput: 0: 5082.6. Samples: 355857942. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:26:28,781][25689] Avg episode reward: [(0, '-46.483')] [2022-07-09 17:26:28,804][26022] Updated weights on worker 0-0, policy_version 347522 (0.00091) [2022-07-09 17:26:31,097][26022] Updated weights on worker 0-0, policy_version 347532 (0.00092) [2022-07-09 17:26:32,230][26022] Updated weights on worker 0-0, policy_version 347542 (0.00093) [2022-07-09 17:26:33,807][25689] Fps is (10 sec: 5450.8, 60 sec: 5630.7, 300 sec: 5631.4). Total num frames: 355888128. Throughput: 0: 5929.1. Samples: 355892072. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:26:33,809][25689] Avg episode reward: [(0, '-46.421')] [2022-07-09 17:26:34,543][26022] Updated weights on worker 0-0, policy_version 347552 (0.00081) [2022-07-09 17:26:36,215][26022] Updated weights on worker 0-0, policy_version 347562 (0.00097) [2022-07-09 17:26:37,902][26022] Updated weights on worker 0-0, policy_version 347572 (0.00109) [2022-07-09 17:26:38,867][25689] Fps is (10 sec: 5583.1, 60 sec: 5633.9, 300 sec: 5634.2). Total num frames: 355917824. Throughput: 0: 5921.7. Samples: 355926436. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:26:38,868][25689] Avg episode reward: [(0, '-46.722')] [2022-07-09 17:26:39,659][26022] Updated weights on worker 0-0, policy_version 347582 (0.00088) [2022-07-09 17:26:41,617][26022] Updated weights on worker 0-0, policy_version 347592 (0.00093) [2022-07-09 17:26:43,398][26022] Updated weights on worker 0-0, policy_version 347602 (0.00087) [2022-07-09 17:26:43,924][25689] Fps is (10 sec: 5870.5, 60 sec: 5671.6, 300 sec: 5641.9). Total num frames: 355947520. Throughput: 0: 5051.8. Samples: 355943344. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:26:43,925][25689] Avg episode reward: [(0, '-47.183')] [2022-07-09 17:26:45,435][26022] Updated weights on worker 0-0, policy_version 347612 (0.00089) [2022-07-09 17:26:46,894][26022] Updated weights on worker 0-0, policy_version 347622 (0.00091) [2022-07-09 17:26:49,027][25689] Fps is (10 sec: 5644.3, 60 sec: 5633.7, 300 sec: 5640.1). Total num frames: 355975168. Throughput: 0: 5880.5. Samples: 355977006. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:26:49,027][25689] Avg episode reward: [(0, '-46.545')] [2022-07-09 17:26:49,029][26022] Updated weights on worker 0-0, policy_version 347632 (0.00089) [2022-07-09 17:26:50,519][26022] Updated weights on worker 0-0, policy_version 347642 (0.00084) [2022-07-09 17:26:52,487][26022] Updated weights on worker 0-0, policy_version 347652 (0.00087) [2022-07-09 17:26:54,035][25689] Fps is (10 sec: 5671.7, 60 sec: 5673.6, 300 sec: 5640.3). Total num frames: 356004864. Throughput: 0: 5907.9. Samples: 356011574. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:26:54,035][25689] Avg episode reward: [(0, '-46.487')] [2022-07-09 17:26:54,380][26022] Updated weights on worker 0-0, policy_version 347662 (0.00089) [2022-07-09 17:26:56,079][26022] Updated weights on worker 0-0, policy_version 347672 (0.00096) [2022-07-09 17:26:57,842][26022] Updated weights on worker 0-0, policy_version 347682 (0.00087) [2022-07-09 17:26:59,046][25689] Fps is (10 sec: 5723.5, 60 sec: 5642.7, 300 sec: 5637.1). Total num frames: 356032512. Throughput: 0: 5070.7. Samples: 356028756. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:26:59,046][25689] Avg episode reward: [(0, '-47.045')] [2022-07-09 17:26:59,725][26022] Updated weights on worker 0-0, policy_version 347692 (0.00086) [2022-07-09 17:27:01,375][26022] Updated weights on worker 0-0, policy_version 347702 (0.00090) [2022-07-09 17:27:03,624][26022] Updated weights on worker 0-0, policy_version 347712 (0.00084) [2022-07-09 17:27:04,058][25689] Fps is (10 sec: 5414.7, 60 sec: 5646.9, 300 sec: 5641.9). Total num frames: 356059136. Throughput: 0: 5822.6. Samples: 356060574. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:27:04,058][25689] Avg episode reward: [(0, '-47.241')] [2022-07-09 17:27:05,368][26022] Updated weights on worker 0-0, policy_version 347722 (0.00081) [2022-07-09 17:27:07,259][26022] Updated weights on worker 0-0, policy_version 347732 (0.00094) [2022-07-09 17:27:09,056][26022] Updated weights on worker 0-0, policy_version 347742 (0.00104) [2022-07-09 17:27:09,127][25689] Fps is (10 sec: 5485.4, 60 sec: 5629.4, 300 sec: 5637.5). Total num frames: 356087808. Throughput: 0: 5864.9. Samples: 356094888. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:27:09,127][25689] Avg episode reward: [(0, '-47.932')] [2022-07-09 17:27:10,718][26022] Updated weights on worker 0-0, policy_version 347752 (0.00082) [2022-07-09 17:27:12,617][26022] Updated weights on worker 0-0, policy_version 347762 (0.00085) [2022-07-09 17:27:14,173][25689] Fps is (10 sec: 5669.3, 60 sec: 5645.0, 300 sec: 5636.8). Total num frames: 356116480. Throughput: 0: 4986.6. Samples: 356111994. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:27:14,173][25689] Avg episode reward: [(0, '-47.122')] [2022-07-09 17:27:14,434][26022] Updated weights on worker 0-0, policy_version 347772 (0.00091) [2022-07-09 17:27:16,135][26022] Updated weights on worker 0-0, policy_version 347782 (0.00089) [2022-07-09 17:27:17,344][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:27:17,361][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000347789_356135936.pth [2022-07-09 17:27:17,362][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000345804_354103296.pth [2022-07-09 17:27:18,109][26022] Updated weights on worker 0-0, policy_version 347792 (0.00094) [2022-07-09 17:27:19,199][25689] Fps is (10 sec: 5795.0, 60 sec: 5662.8, 300 sec: 5643.6). Total num frames: 356146176. Throughput: 0: 5838.9. Samples: 356146426. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:27:19,199][25689] Avg episode reward: [(0, '-47.141')] [2022-07-09 17:27:19,756][26022] Updated weights on worker 0-0, policy_version 347802 (0.00088) [2022-07-09 17:27:21,576][26022] Updated weights on worker 0-0, policy_version 347812 (0.00085) [2022-07-09 17:27:23,259][26022] Updated weights on worker 0-0, policy_version 347822 (0.00085) [2022-07-09 17:27:24,223][25689] Fps is (10 sec: 5807.2, 60 sec: 5647.5, 300 sec: 5644.9). Total num frames: 356174848. Throughput: 0: 5968.2. Samples: 356180928. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:27:24,224][25689] Avg episode reward: [(0, '-46.250')] [2022-07-09 17:27:25,216][26022] Updated weights on worker 0-0, policy_version 347832 (0.00078) [2022-07-09 17:27:27,019][26022] Updated weights on worker 0-0, policy_version 347842 (0.00091) [2022-07-09 17:27:28,653][26022] Updated weights on worker 0-0, policy_version 347852 (0.00086) [2022-07-09 17:27:29,336][25689] Fps is (10 sec: 5656.6, 60 sec: 5647.9, 300 sec: 5642.9). Total num frames: 356203520. Throughput: 0: 5113.9. Samples: 356198238. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:27:29,337][25689] Avg episode reward: [(0, '-46.603')] [2022-07-09 17:27:30,459][26022] Updated weights on worker 0-0, policy_version 347862 (0.00085) [2022-07-09 17:27:32,398][26022] Updated weights on worker 0-0, policy_version 347872 (0.00083) [2022-07-09 17:27:34,215][26022] Updated weights on worker 0-0, policy_version 347882 (0.00093) [2022-07-09 17:27:34,351][25689] Fps is (10 sec: 5661.9, 60 sec: 5682.8, 300 sec: 5642.9). Total num frames: 356232192. Throughput: 0: 5971.5. Samples: 356232492. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:27:34,352][25689] Avg episode reward: [(0, '-46.342')] [2022-07-09 17:27:35,825][26022] Updated weights on worker 0-0, policy_version 347892 (0.00408) [2022-07-09 17:27:37,864][26022] Updated weights on worker 0-0, policy_version 347902 (0.00091) [2022-07-09 17:27:39,428][25689] Fps is (10 sec: 5682.1, 60 sec: 5664.4, 300 sec: 5645.4). Total num frames: 356260864. Throughput: 0: 5947.0. Samples: 356266732. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:27:39,429][25689] Avg episode reward: [(0, '-45.574')] [2022-07-09 17:27:39,587][26022] Updated weights on worker 0-0, policy_version 347912 (0.00101) [2022-07-09 17:27:41,283][26022] Updated weights on worker 0-0, policy_version 347922 (0.00084) [2022-07-09 17:27:43,316][26022] Updated weights on worker 0-0, policy_version 347932 (0.00095) [2022-07-09 17:27:44,457][25689] Fps is (10 sec: 5674.4, 60 sec: 5650.1, 300 sec: 5642.6). Total num frames: 356289536. Throughput: 0: 5091.4. Samples: 356283946. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:27:44,458][25689] Avg episode reward: [(0, '-45.635')] [2022-07-09 17:27:44,816][26022] Updated weights on worker 0-0, policy_version 347942 (0.00088) [2022-07-09 17:27:47,055][26022] Updated weights on worker 0-0, policy_version 347952 (0.00094) [2022-07-09 17:27:48,479][26022] Updated weights on worker 0-0, policy_version 347962 (0.00085) [2022-07-09 17:27:49,564][25689] Fps is (10 sec: 5657.9, 60 sec: 5666.7, 300 sec: 5640.7). Total num frames: 356318208. Throughput: 0: 5908.5. Samples: 356317750. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:27:49,564][25689] Avg episode reward: [(0, '-46.897')] [2022-07-09 17:27:50,552][26022] Updated weights on worker 0-0, policy_version 347972 (0.00092) [2022-07-09 17:27:52,244][26022] Updated weights on worker 0-0, policy_version 347982 (0.00092) [2022-07-09 17:27:54,235][26022] Updated weights on worker 0-0, policy_version 347992 (0.00089) [2022-07-09 17:27:54,589][25689] Fps is (10 sec: 5558.6, 60 sec: 5631.1, 300 sec: 5641.6). Total num frames: 356345856. Throughput: 0: 5904.4. Samples: 356351984. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:27:54,590][25689] Avg episode reward: [(0, '-47.530')] [2022-07-09 17:27:55,574][26022] Updated weights on worker 0-0, policy_version 348002 (0.00092) [2022-07-09 17:27:57,777][26022] Updated weights on worker 0-0, policy_version 348012 (0.00081) [2022-07-09 17:27:59,305][26022] Updated weights on worker 0-0, policy_version 348022 (0.00084) [2022-07-09 17:27:59,616][25689] Fps is (10 sec: 5704.7, 60 sec: 5663.6, 300 sec: 5645.2). Total num frames: 356375552. Throughput: 0: 5071.4. Samples: 356369110. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:27:59,616][25689] Avg episode reward: [(0, '-47.187')] [2022-07-09 17:28:01,177][26022] Updated weights on worker 0-0, policy_version 348032 (0.00086) [2022-07-09 17:28:03,326][26022] Updated weights on worker 0-0, policy_version 348042 (0.00086) [2022-07-09 17:28:04,629][25689] Fps is (10 sec: 5609.5, 60 sec: 5663.4, 300 sec: 5642.9). Total num frames: 356402176. Throughput: 0: 5812.0. Samples: 356401186. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:28:04,631][25689] Avg episode reward: [(0, '-47.758')] [2022-07-09 17:28:05,108][26022] Updated weights on worker 0-0, policy_version 348052 (0.00093) [2022-07-09 17:28:07,111][26022] Updated weights on worker 0-0, policy_version 348062 (0.00094) [2022-07-09 17:28:08,775][26022] Updated weights on worker 0-0, policy_version 348072 (0.00089) [2022-07-09 17:28:09,671][25689] Fps is (10 sec: 5397.2, 60 sec: 5649.0, 300 sec: 5645.6). Total num frames: 356429824. Throughput: 0: 5852.2. Samples: 356435424. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:28:09,672][25689] Avg episode reward: [(0, '-48.390')] [2022-07-09 17:28:10,594][26022] Updated weights on worker 0-0, policy_version 348082 (0.00091) [2022-07-09 17:28:12,339][26022] Updated weights on worker 0-0, policy_version 348092 (0.00084) [2022-07-09 17:28:14,023][26022] Updated weights on worker 0-0, policy_version 348102 (0.00081) [2022-07-09 17:28:14,713][25689] Fps is (10 sec: 5686.6, 60 sec: 5666.2, 300 sec: 5645.4). Total num frames: 356459520. Throughput: 0: 4994.6. Samples: 356452496. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:28:14,719][25689] Avg episode reward: [(0, '-47.828')] [2022-07-09 17:28:16,091][26022] Updated weights on worker 0-0, policy_version 348112 (0.00098) [2022-07-09 17:28:17,770][26022] Updated weights on worker 0-0, policy_version 348122 (0.00091) [2022-07-09 17:28:19,543][26022] Updated weights on worker 0-0, policy_version 348132 (0.00092) [2022-07-09 17:28:19,723][25689] Fps is (10 sec: 5806.5, 60 sec: 5650.8, 300 sec: 5649.2). Total num frames: 356488192. Throughput: 0: 5849.7. Samples: 356486736. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:28:19,724][25689] Avg episode reward: [(0, '-47.544')] [2022-07-09 17:28:21,352][26022] Updated weights on worker 0-0, policy_version 348142 (0.00088) [2022-07-09 17:28:23,069][26022] Updated weights on worker 0-0, policy_version 348152 (0.00088) [2022-07-09 17:28:24,739][25689] Fps is (10 sec: 5515.6, 60 sec: 5617.9, 300 sec: 5644.0). Total num frames: 356514816. Throughput: 0: 5945.3. Samples: 356520742. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 17:28:24,739][25689] Avg episode reward: [(0, '-46.971')] [2022-07-09 17:28:24,982][26022] Updated weights on worker 0-0, policy_version 348162 (0.00082) [2022-07-09 17:28:26,745][26022] Updated weights on worker 0-0, policy_version 348172 (0.00092) [2022-07-09 17:28:28,692][26022] Updated weights on worker 0-0, policy_version 348182 (0.00084) [2022-07-09 17:28:29,829][25689] Fps is (10 sec: 5573.3, 60 sec: 5636.9, 300 sec: 5646.9). Total num frames: 356544512. Throughput: 0: 5908.6. Samples: 356554528. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:28:29,829][25689] Avg episode reward: [(0, '-47.329')] [2022-07-09 17:28:30,381][26022] Updated weights on worker 0-0, policy_version 348192 (0.00086) [2022-07-09 17:28:32,284][26022] Updated weights on worker 0-0, policy_version 348202 (0.00087) [2022-07-09 17:28:34,014][26022] Updated weights on worker 0-0, policy_version 348212 (0.00814) [2022-07-09 17:28:34,876][25689] Fps is (10 sec: 5757.8, 60 sec: 5633.9, 300 sec: 5642.8). Total num frames: 356573184. Throughput: 0: 5904.1. Samples: 356571538. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:28:34,877][25689] Avg episode reward: [(0, '-48.184')] [2022-07-09 17:28:35,787][26022] Updated weights on worker 0-0, policy_version 348222 (0.00095) [2022-07-09 17:28:37,569][26022] Updated weights on worker 0-0, policy_version 348232 (0.00087) [2022-07-09 17:28:39,498][26022] Updated weights on worker 0-0, policy_version 348242 (0.00087) [2022-07-09 17:28:39,904][25689] Fps is (10 sec: 5793.4, 60 sec: 5655.4, 300 sec: 5651.0). Total num frames: 356602880. Throughput: 0: 5909.0. Samples: 356605982. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:28:39,904][25689] Avg episode reward: [(0, '-47.620')] [2022-07-09 17:28:41,145][26022] Updated weights on worker 0-0, policy_version 348252 (0.00092) [2022-07-09 17:28:43,123][26022] Updated weights on worker 0-0, policy_version 348262 (0.00089) [2022-07-09 17:28:44,679][26022] Updated weights on worker 0-0, policy_version 348272 (0.00086) [2022-07-09 17:28:44,978][25689] Fps is (10 sec: 5777.7, 60 sec: 5651.2, 300 sec: 5651.3). Total num frames: 356631552. Throughput: 0: 5905.5. Samples: 356640266. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:28:44,979][25689] Avg episode reward: [(0, '-47.537')] [2022-07-09 17:28:46,636][26022] Updated weights on worker 0-0, policy_version 348282 (0.00090) [2022-07-09 17:28:48,380][26022] Updated weights on worker 0-0, policy_version 348292 (0.00085) [2022-07-09 17:28:49,982][26022] Updated weights on worker 0-0, policy_version 348302 (0.00087) [2022-07-09 17:28:50,079][25689] Fps is (10 sec: 5736.4, 60 sec: 5668.7, 300 sec: 5649.8). Total num frames: 356661248. Throughput: 0: 5083.0. Samples: 356657458. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:28:50,079][25689] Avg episode reward: [(0, '-46.980')] [2022-07-09 17:28:52,055][26022] Updated weights on worker 0-0, policy_version 348312 (0.00086) [2022-07-09 17:28:53,771][26022] Updated weights on worker 0-0, policy_version 348322 (0.00090) [2022-07-09 17:28:55,090][25689] Fps is (10 sec: 5569.9, 60 sec: 5653.1, 300 sec: 5646.2). Total num frames: 356687872. Throughput: 0: 5922.5. Samples: 356691254. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:28:55,090][25689] Avg episode reward: [(0, '-46.924')] [2022-07-09 17:28:55,658][26022] Updated weights on worker 0-0, policy_version 348332 (0.00085) [2022-07-09 17:28:57,471][26022] Updated weights on worker 0-0, policy_version 348342 (0.00090) [2022-07-09 17:28:59,164][26022] Updated weights on worker 0-0, policy_version 348352 (0.00085) [2022-07-09 17:29:00,104][25689] Fps is (10 sec: 5719.7, 60 sec: 5671.1, 300 sec: 5657.7). Total num frames: 356718592. Throughput: 0: 5916.3. Samples: 356725494. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:29:00,106][25689] Avg episode reward: [(0, '-45.143')] [2022-07-09 17:29:00,925][26022] Updated weights on worker 0-0, policy_version 348362 (0.00082) [2022-07-09 17:29:03,195][26022] Updated weights on worker 0-0, policy_version 348372 (0.00093) [2022-07-09 17:29:05,096][26022] Updated weights on worker 0-0, policy_version 348382 (0.00094) [2022-07-09 17:29:05,194][25689] Fps is (10 sec: 5472.6, 60 sec: 5630.2, 300 sec: 5646.7). Total num frames: 356743168. Throughput: 0: 4963.6. Samples: 356740608. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:29:05,195][25689] Avg episode reward: [(0, '-44.199')] [2022-07-09 17:29:06,716][26022] Updated weights on worker 0-0, policy_version 348392 (0.00090) [2022-07-09 17:29:08,572][26022] Updated weights on worker 0-0, policy_version 348402 (0.00094) [2022-07-09 17:29:10,183][26022] Updated weights on worker 0-0, policy_version 348412 (0.00091) [2022-07-09 17:29:10,254][25689] Fps is (10 sec: 5448.2, 60 sec: 5679.3, 300 sec: 5653.0). Total num frames: 356773888. Throughput: 0: 5823.6. Samples: 356774950. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:29:10,254][25689] Avg episode reward: [(0, '-45.777')] [2022-07-09 17:29:12,250][26022] Updated weights on worker 0-0, policy_version 348422 (0.00097) [2022-07-09 17:29:13,852][26022] Updated weights on worker 0-0, policy_version 348432 (0.00091) [2022-07-09 17:29:15,272][25689] Fps is (10 sec: 5792.0, 60 sec: 5647.7, 300 sec: 5649.5). Total num frames: 356801536. Throughput: 0: 5840.9. Samples: 356809132. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:29:15,273][25689] Avg episode reward: [(0, '-46.516')] [2022-07-09 17:29:15,734][26022] Updated weights on worker 0-0, policy_version 348442 (0.00086) [2022-07-09 17:29:17,331][26022] Updated weights on worker 0-0, policy_version 348452 (0.00087) [2022-07-09 17:29:17,664][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:29:17,677][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000348453_356815872.pth [2022-07-09 17:29:17,678][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000346464_354779136.pth [2022-07-09 17:29:19,325][26022] Updated weights on worker 0-0, policy_version 348462 (0.00085) [2022-07-09 17:29:20,287][25689] Fps is (10 sec: 5613.3, 60 sec: 5647.2, 300 sec: 5650.2). Total num frames: 356830208. Throughput: 0: 5001.3. Samples: 356826434. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:29:20,289][25689] Avg episode reward: [(0, '-46.730')] [2022-07-09 17:29:21,027][26022] Updated weights on worker 0-0, policy_version 348472 (0.00085) [2022-07-09 17:29:22,943][26022] Updated weights on worker 0-0, policy_version 348482 (0.00086) [2022-07-09 17:29:24,679][26022] Updated weights on worker 0-0, policy_version 348492 (0.00086) [2022-07-09 17:29:25,289][25689] Fps is (10 sec: 5826.4, 60 sec: 5699.2, 300 sec: 5658.4). Total num frames: 356859904. Throughput: 0: 5985.0. Samples: 356860878. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:29:25,291][25689] Avg episode reward: [(0, '-46.335')] [2022-07-09 17:29:26,533][26022] Updated weights on worker 0-0, policy_version 348502 (0.00088) [2022-07-09 17:29:28,255][26022] Updated weights on worker 0-0, policy_version 348512 (0.00089) [2022-07-09 17:29:30,124][26022] Updated weights on worker 0-0, policy_version 348522 (0.00090) [2022-07-09 17:29:30,361][25689] Fps is (10 sec: 5692.2, 60 sec: 5667.1, 300 sec: 5653.8). Total num frames: 356887552. Throughput: 0: 5958.8. Samples: 356894766. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:29:30,362][25689] Avg episode reward: [(0, '-46.761')] [2022-07-09 17:29:31,869][26022] Updated weights on worker 0-0, policy_version 348532 (0.00088) [2022-07-09 17:29:33,747][26022] Updated weights on worker 0-0, policy_version 348542 (0.00082) [2022-07-09 17:29:35,420][25689] Fps is (10 sec: 5458.2, 60 sec: 5649.0, 300 sec: 5642.4). Total num frames: 356915200. Throughput: 0: 5090.4. Samples: 356911696. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:29:35,421][25689] Avg episode reward: [(0, '-47.348')] [2022-07-09 17:29:35,546][26022] Updated weights on worker 0-0, policy_version 348552 (0.00081) [2022-07-09 17:29:37,368][26022] Updated weights on worker 0-0, policy_version 348562 (0.00088) [2022-07-09 17:29:39,205][26022] Updated weights on worker 0-0, policy_version 348572 (0.00082) [2022-07-09 17:29:40,435][25689] Fps is (10 sec: 5590.5, 60 sec: 5633.3, 300 sec: 5652.8). Total num frames: 356943872. Throughput: 0: 5933.7. Samples: 356945988. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:29:40,436][25689] Avg episode reward: [(0, '-46.669')] [2022-07-09 17:29:40,786][26022] Updated weights on worker 0-0, policy_version 348582 (0.00090) [2022-07-09 17:29:42,931][26022] Updated weights on worker 0-0, policy_version 348592 (0.00087) [2022-07-09 17:29:44,366][26022] Updated weights on worker 0-0, policy_version 348602 (0.00094) [2022-07-09 17:29:45,450][25689] Fps is (10 sec: 5717.1, 60 sec: 5638.8, 300 sec: 5650.4). Total num frames: 356972544. Throughput: 0: 5906.5. Samples: 356979960. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:29:45,451][25689] Avg episode reward: [(0, '-46.008')] [2022-07-09 17:29:46,457][26022] Updated weights on worker 0-0, policy_version 348612 (0.00095) [2022-07-09 17:29:47,996][26022] Updated weights on worker 0-0, policy_version 348622 (0.00079) [2022-07-09 17:29:50,139][26022] Updated weights on worker 0-0, policy_version 348632 (0.00088) [2022-07-09 17:29:50,588][25689] Fps is (10 sec: 5648.1, 60 sec: 5618.4, 300 sec: 5648.3). Total num frames: 357001216. Throughput: 0: 5053.2. Samples: 356996978. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:29:50,591][25689] Avg episode reward: [(0, '-44.833')] [2022-07-09 17:29:51,684][26022] Updated weights on worker 0-0, policy_version 348642 (0.00095) [2022-07-09 17:29:53,791][26022] Updated weights on worker 0-0, policy_version 348652 (0.00089) [2022-07-09 17:29:55,266][26022] Updated weights on worker 0-0, policy_version 348662 (0.00089) [2022-07-09 17:29:55,643][25689] Fps is (10 sec: 5726.6, 60 sec: 5665.1, 300 sec: 5647.3). Total num frames: 357030912. Throughput: 0: 5890.6. Samples: 357030820. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:29:55,643][25689] Avg episode reward: [(0, '-45.665')] [2022-07-09 17:29:57,193][26022] Updated weights on worker 0-0, policy_version 348672 (0.00090) [2022-07-09 17:29:59,011][26022] Updated weights on worker 0-0, policy_version 348682 (0.00085) [2022-07-09 17:30:00,658][25689] Fps is (10 sec: 5796.6, 60 sec: 5631.3, 300 sec: 5658.4). Total num frames: 357059584. Throughput: 0: 5897.6. Samples: 357065250. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:30:00,658][25689] Avg episode reward: [(0, '-46.636')] [2022-07-09 17:30:00,763][26022] Updated weights on worker 0-0, policy_version 348692 (0.00089) [2022-07-09 17:30:02,856][26022] Updated weights on worker 0-0, policy_version 348702 (0.00093) [2022-07-09 17:30:04,719][26022] Updated weights on worker 0-0, policy_version 348712 (0.00093) [2022-07-09 17:30:05,705][25689] Fps is (10 sec: 5495.3, 60 sec: 5669.0, 300 sec: 5646.5). Total num frames: 357086208. Throughput: 0: 4967.4. Samples: 357080574. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:30:05,707][25689] Avg episode reward: [(0, '-46.503')] [2022-07-09 17:30:06,290][26022] Updated weights on worker 0-0, policy_version 348722 (0.00085) [2022-07-09 17:30:08,306][26022] Updated weights on worker 0-0, policy_version 348732 (0.00087) [2022-07-09 17:30:09,706][26022] Updated weights on worker 0-0, policy_version 348742 (0.00091) [2022-07-09 17:30:10,839][25689] Fps is (10 sec: 5531.7, 60 sec: 5645.2, 300 sec: 5654.4). Total num frames: 357115904. Throughput: 0: 5848.3. Samples: 357115410. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:30:10,839][25689] Avg episode reward: [(0, '-46.118')] [2022-07-09 17:30:11,840][26022] Updated weights on worker 0-0, policy_version 348752 (0.00091) [2022-07-09 17:30:13,466][26022] Updated weights on worker 0-0, policy_version 348762 (0.00507) [2022-07-09 17:30:15,407][26022] Updated weights on worker 0-0, policy_version 348772 (0.00093) [2022-07-09 17:30:15,883][25689] Fps is (10 sec: 5634.3, 60 sec: 5642.8, 300 sec: 5648.4). Total num frames: 357143552. Throughput: 0: 5868.9. Samples: 357149606. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:30:15,883][25689] Avg episode reward: [(0, '-46.740')] [2022-07-09 17:30:17,256][26022] Updated weights on worker 0-0, policy_version 348782 (0.00090) [2022-07-09 17:30:19,146][26022] Updated weights on worker 0-0, policy_version 348792 (0.00093) [2022-07-09 17:30:20,754][26022] Updated weights on worker 0-0, policy_version 348802 (0.00093) [2022-07-09 17:30:20,964][25689] Fps is (10 sec: 5865.9, 60 sec: 5687.3, 300 sec: 5655.2). Total num frames: 357175296. Throughput: 0: 4993.4. Samples: 357166648. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:30:20,964][25689] Avg episode reward: [(0, '-46.834')] [2022-07-09 17:30:22,633][26022] Updated weights on worker 0-0, policy_version 348812 (0.00087) [2022-07-09 17:30:24,298][26022] Updated weights on worker 0-0, policy_version 348822 (0.00083) [2022-07-09 17:30:26,010][25689] Fps is (10 sec: 5763.8, 60 sec: 5632.7, 300 sec: 5657.1). Total num frames: 357201920. Throughput: 0: 5920.2. Samples: 357200780. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:30:26,010][25689] Avg episode reward: [(0, '-45.701')] [2022-07-09 17:30:26,330][26022] Updated weights on worker 0-0, policy_version 348832 (0.00081) [2022-07-09 17:30:28,022][26022] Updated weights on worker 0-0, policy_version 348842 (0.00103) [2022-07-09 17:30:29,900][26022] Updated weights on worker 0-0, policy_version 348852 (0.00095) [2022-07-09 17:30:31,120][25689] Fps is (10 sec: 5545.3, 60 sec: 5662.8, 300 sec: 5655.0). Total num frames: 357231616. Throughput: 0: 5890.7. Samples: 357234880. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:30:31,122][25689] Avg episode reward: [(0, '-46.274')] [2022-07-09 17:30:31,687][26022] Updated weights on worker 0-0, policy_version 348862 (0.00096) [2022-07-09 17:30:33,506][26022] Updated weights on worker 0-0, policy_version 348872 (0.00093) [2022-07-09 17:30:35,112][26022] Updated weights on worker 0-0, policy_version 348882 (0.00089) [2022-07-09 17:30:36,160][25689] Fps is (10 sec: 5649.7, 60 sec: 5664.6, 300 sec: 5649.2). Total num frames: 357259264. Throughput: 0: 5900.1. Samples: 357269240. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:30:36,161][25689] Avg episode reward: [(0, '-46.274')] [2022-07-09 17:30:36,975][26022] Updated weights on worker 0-0, policy_version 348892 (0.00089) [2022-07-09 17:30:38,787][26022] Updated weights on worker 0-0, policy_version 348902 (0.00084) [2022-07-09 17:30:40,576][26022] Updated weights on worker 0-0, policy_version 348912 (0.00089) [2022-07-09 17:30:41,246][25689] Fps is (10 sec: 5764.1, 60 sec: 5691.6, 300 sec: 5659.7). Total num frames: 357289984. Throughput: 0: 5906.4. Samples: 357286444. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:30:41,247][25689] Avg episode reward: [(0, '-45.603')] [2022-07-09 17:30:42,250][26022] Updated weights on worker 0-0, policy_version 348922 (0.00086) [2022-07-09 17:30:44,108][26022] Updated weights on worker 0-0, policy_version 348932 (0.00094) [2022-07-09 17:30:45,842][26022] Updated weights on worker 0-0, policy_version 348942 (0.00081) [2022-07-09 17:30:46,266][25689] Fps is (10 sec: 5876.8, 60 sec: 5691.2, 300 sec: 5657.1). Total num frames: 357318656. Throughput: 0: 5937.5. Samples: 357321052. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:30:46,266][25689] Avg episode reward: [(0, '-44.801')] [2022-07-09 17:30:47,715][26022] Updated weights on worker 0-0, policy_version 348952 (0.00085) [2022-07-09 17:30:49,560][26022] Updated weights on worker 0-0, policy_version 348962 (0.00088) [2022-07-09 17:30:51,303][26022] Updated weights on worker 0-0, policy_version 348972 (0.00096) [2022-07-09 17:30:51,322][25689] Fps is (10 sec: 5691.2, 60 sec: 5698.8, 300 sec: 5660.8). Total num frames: 357347328. Throughput: 0: 5955.8. Samples: 357355200. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:30:51,323][25689] Avg episode reward: [(0, '-45.700')] [2022-07-09 17:30:53,211][26022] Updated weights on worker 0-0, policy_version 348982 (0.00102) [2022-07-09 17:30:54,803][26022] Updated weights on worker 0-0, policy_version 348992 (0.00089) [2022-07-09 17:30:56,325][25689] Fps is (10 sec: 5598.8, 60 sec: 5669.9, 300 sec: 5654.7). Total num frames: 357374976. Throughput: 0: 5117.4. Samples: 357372436. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:30:56,326][25689] Avg episode reward: [(0, '-45.265')] [2022-07-09 17:30:56,768][26022] Updated weights on worker 0-0, policy_version 349002 (0.00093) [2022-07-09 17:30:58,472][26022] Updated weights on worker 0-0, policy_version 349012 (0.00087) [2022-07-09 17:31:00,295][26022] Updated weights on worker 0-0, policy_version 349022 (0.00081) [2022-07-09 17:31:01,384][25689] Fps is (10 sec: 5699.0, 60 sec: 5682.6, 300 sec: 5664.9). Total num frames: 357404672. Throughput: 0: 5965.0. Samples: 357406566. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-09 17:31:01,385][25689] Avg episode reward: [(0, '-44.914')] [2022-07-09 17:31:02,499][26022] Updated weights on worker 0-0, policy_version 349032 (0.01039) [2022-07-09 17:31:04,272][26022] Updated weights on worker 0-0, policy_version 349042 (0.00085) [2022-07-09 17:31:06,259][26022] Updated weights on worker 0-0, policy_version 349052 (0.00085) [2022-07-09 17:31:06,453][25689] Fps is (10 sec: 5459.7, 60 sec: 5663.8, 300 sec: 5651.1). Total num frames: 357430272. Throughput: 0: 5824.0. Samples: 357438622. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:31:06,454][25689] Avg episode reward: [(0, '-45.736')] [2022-07-09 17:31:07,851][26022] Updated weights on worker 0-0, policy_version 349062 (0.00082) [2022-07-09 17:31:09,817][26022] Updated weights on worker 0-0, policy_version 349072 (0.00082) [2022-07-09 17:31:11,432][26022] Updated weights on worker 0-0, policy_version 349082 (0.00087) [2022-07-09 17:31:11,530][25689] Fps is (10 sec: 5450.4, 60 sec: 5669.1, 300 sec: 5657.1). Total num frames: 357459968. Throughput: 0: 4967.9. Samples: 357455588. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:31:11,530][25689] Avg episode reward: [(0, '-45.959')] [2022-07-09 17:31:13,458][26022] Updated weights on worker 0-0, policy_version 349092 (0.00082) [2022-07-09 17:31:15,003][26022] Updated weights on worker 0-0, policy_version 349102 (0.00083) [2022-07-09 17:31:16,546][25689] Fps is (10 sec: 5681.8, 60 sec: 5671.8, 300 sec: 5654.0). Total num frames: 357487616. Throughput: 0: 5816.4. Samples: 357490046. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:31:16,546][25689] Avg episode reward: [(0, '-46.668')] [2022-07-09 17:31:16,883][26022] Updated weights on worker 0-0, policy_version 349112 (0.00091) [2022-07-09 17:31:17,936][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:31:17,946][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000349118_357496832.pth [2022-07-09 17:31:17,947][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000347126_355457024.pth [2022-07-09 17:31:18,613][26022] Updated weights on worker 0-0, policy_version 349122 (0.00093) [2022-07-09 17:31:20,507][26022] Updated weights on worker 0-0, policy_version 349132 (0.00084) [2022-07-09 17:31:21,550][25689] Fps is (10 sec: 5722.5, 60 sec: 5645.1, 300 sec: 5654.7). Total num frames: 357517312. Throughput: 0: 5838.4. Samples: 357524304. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:31:21,551][25689] Avg episode reward: [(0, '-46.782')] [2022-07-09 17:31:22,283][26022] Updated weights on worker 0-0, policy_version 349142 (0.00087) [2022-07-09 17:31:24,147][26022] Updated weights on worker 0-0, policy_version 349152 (0.00104) [2022-07-09 17:31:25,873][26022] Updated weights on worker 0-0, policy_version 349162 (0.00089) [2022-07-09 17:31:26,610][25689] Fps is (10 sec: 5799.4, 60 sec: 5677.6, 300 sec: 5655.8). Total num frames: 357545984. Throughput: 0: 5097.0. Samples: 357541362. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:31:26,611][25689] Avg episode reward: [(0, '-47.616')] [2022-07-09 17:31:27,871][26022] Updated weights on worker 0-0, policy_version 349172 (0.00090) [2022-07-09 17:31:29,478][26022] Updated weights on worker 0-0, policy_version 349182 (0.00087) [2022-07-09 17:31:31,421][26022] Updated weights on worker 0-0, policy_version 349192 (0.00452) [2022-07-09 17:31:31,651][25689] Fps is (10 sec: 5576.1, 60 sec: 5650.3, 300 sec: 5659.0). Total num frames: 357573632. Throughput: 0: 5957.4. Samples: 357575458. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:31:31,651][25689] Avg episode reward: [(0, '-47.052')] [2022-07-09 17:31:33,131][26022] Updated weights on worker 0-0, policy_version 349202 (0.00090) [2022-07-09 17:31:34,914][26022] Updated weights on worker 0-0, policy_version 349212 (0.00096) [2022-07-09 17:31:36,655][25689] Fps is (10 sec: 5606.5, 60 sec: 5670.5, 300 sec: 5656.6). Total num frames: 357602304. Throughput: 0: 5945.8. Samples: 357609616. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:31:36,656][25689] Avg episode reward: [(0, '-46.908')] [2022-07-09 17:31:36,684][26022] Updated weights on worker 0-0, policy_version 349222 (0.00089) [2022-07-09 17:31:38,599][26022] Updated weights on worker 0-0, policy_version 349232 (0.00082) [2022-07-09 17:31:40,106][26022] Updated weights on worker 0-0, policy_version 349242 (0.00090) [2022-07-09 17:31:41,675][25689] Fps is (10 sec: 5720.7, 60 sec: 5642.9, 300 sec: 5653.9). Total num frames: 357630976. Throughput: 0: 5100.0. Samples: 357626938. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:31:41,675][25689] Avg episode reward: [(0, '-46.352')] [2022-07-09 17:31:42,116][26022] Updated weights on worker 0-0, policy_version 349252 (0.00096) [2022-07-09 17:31:43,864][26022] Updated weights on worker 0-0, policy_version 349262 (0.00085) [2022-07-09 17:31:45,511][26022] Updated weights on worker 0-0, policy_version 349272 (0.00088) [2022-07-09 17:31:46,677][25689] Fps is (10 sec: 5824.3, 60 sec: 5661.5, 300 sec: 5662.7). Total num frames: 357660672. Throughput: 0: 5975.4. Samples: 357661270. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:31:46,677][25689] Avg episode reward: [(0, '-46.573')] [2022-07-09 17:31:47,639][26022] Updated weights on worker 0-0, policy_version 349282 (0.00083) [2022-07-09 17:31:49,246][26022] Updated weights on worker 0-0, policy_version 349292 (0.00114) [2022-07-09 17:31:51,175][26022] Updated weights on worker 0-0, policy_version 349302 (0.00092) [2022-07-09 17:31:51,717][25689] Fps is (10 sec: 5812.2, 60 sec: 5663.0, 300 sec: 5658.6). Total num frames: 357689344. Throughput: 0: 5959.4. Samples: 357695042. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:31:51,718][25689] Avg episode reward: [(0, '-46.725')] [2022-07-09 17:31:52,887][26022] Updated weights on worker 0-0, policy_version 349312 (0.00090) [2022-07-09 17:31:54,727][26022] Updated weights on worker 0-0, policy_version 349322 (0.00091) [2022-07-09 17:31:56,506][26022] Updated weights on worker 0-0, policy_version 349332 (0.00093) [2022-07-09 17:31:56,725][25689] Fps is (10 sec: 5604.8, 60 sec: 5662.5, 300 sec: 5658.7). Total num frames: 357716992. Throughput: 0: 5102.1. Samples: 357712016. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:31:56,726][25689] Avg episode reward: [(0, '-46.917')] [2022-07-09 17:31:58,290][26022] Updated weights on worker 0-0, policy_version 349342 (0.00092) [2022-07-09 17:31:59,989][26022] Updated weights on worker 0-0, policy_version 349352 (0.00084) [2022-07-09 17:32:01,728][25689] Fps is (10 sec: 5523.5, 60 sec: 5633.8, 300 sec: 5662.3). Total num frames: 357744640. Throughput: 0: 5947.6. Samples: 357746208. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:32:01,729][25689] Avg episode reward: [(0, '-48.163')] [2022-07-09 17:32:02,421][26022] Updated weights on worker 0-0, policy_version 349362 (0.00092) [2022-07-09 17:32:03,898][26022] Updated weights on worker 0-0, policy_version 349372 (0.00084) [2022-07-09 17:32:06,008][26022] Updated weights on worker 0-0, policy_version 349382 (0.00083) [2022-07-09 17:32:06,734][25689] Fps is (10 sec: 5422.6, 60 sec: 5656.7, 300 sec: 5656.6). Total num frames: 357771264. Throughput: 0: 5851.5. Samples: 357778634. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:32:06,734][25689] Avg episode reward: [(0, '-48.209')] [2022-07-09 17:32:07,733][26022] Updated weights on worker 0-0, policy_version 349392 (0.00086) [2022-07-09 17:32:09,349][26022] Updated weights on worker 0-0, policy_version 349402 (0.00081) [2022-07-09 17:32:11,284][26022] Updated weights on worker 0-0, policy_version 349412 (0.00093) [2022-07-09 17:32:11,781][25689] Fps is (10 sec: 5602.5, 60 sec: 5659.5, 300 sec: 5660.1). Total num frames: 357800960. Throughput: 0: 5028.6. Samples: 357795934. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:32:11,781][25689] Avg episode reward: [(0, '-47.696')] [2022-07-09 17:32:13,019][26022] Updated weights on worker 0-0, policy_version 349422 (0.00085) [2022-07-09 17:32:14,735][26022] Updated weights on worker 0-0, policy_version 349432 (0.00084) [2022-07-09 17:32:16,803][25689] Fps is (10 sec: 5593.5, 60 sec: 5642.0, 300 sec: 5649.8). Total num frames: 357827584. Throughput: 0: 5887.6. Samples: 357830222. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:32:16,803][25689] Avg episode reward: [(0, '-46.995')] [2022-07-09 17:32:16,899][26022] Updated weights on worker 0-0, policy_version 349442 (0.00086) [2022-07-09 17:32:18,315][26022] Updated weights on worker 0-0, policy_version 349452 (0.00086) [2022-07-09 17:32:20,227][26022] Updated weights on worker 0-0, policy_version 349462 (0.00091) [2022-07-09 17:32:21,826][25689] Fps is (10 sec: 5708.7, 60 sec: 5657.2, 300 sec: 5656.7). Total num frames: 357858304. Throughput: 0: 5880.7. Samples: 357864396. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:32:21,826][25689] Avg episode reward: [(0, '-47.242')] [2022-07-09 17:32:21,859][26022] Updated weights on worker 0-0, policy_version 349472 (0.00091) [2022-07-09 17:32:23,897][26022] Updated weights on worker 0-0, policy_version 349482 (0.00078) [2022-07-09 17:32:25,631][26022] Updated weights on worker 0-0, policy_version 349492 (0.00089) [2022-07-09 17:32:26,829][25689] Fps is (10 sec: 5923.5, 60 sec: 5662.5, 300 sec: 5658.8). Total num frames: 357886976. Throughput: 0: 5121.7. Samples: 357881556. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:32:26,830][25689] Avg episode reward: [(0, '-46.729')] [2022-07-09 17:32:27,666][26022] Updated weights on worker 0-0, policy_version 349502 (0.00093) [2022-07-09 17:32:29,111][26022] Updated weights on worker 0-0, policy_version 349512 (0.00087) [2022-07-09 17:32:31,247][26022] Updated weights on worker 0-0, policy_version 349522 (0.00085) [2022-07-09 17:32:31,909][25689] Fps is (10 sec: 5687.1, 60 sec: 5675.8, 300 sec: 5657.6). Total num frames: 357915648. Throughput: 0: 5944.1. Samples: 357915578. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:32:31,909][25689] Avg episode reward: [(0, '-47.075')] [2022-07-09 17:32:32,672][26022] Updated weights on worker 0-0, policy_version 349532 (0.00091) [2022-07-09 17:32:34,635][26022] Updated weights on worker 0-0, policy_version 349542 (0.00093) [2022-07-09 17:32:36,351][26022] Updated weights on worker 0-0, policy_version 349552 (0.00085) [2022-07-09 17:32:36,914][25689] Fps is (10 sec: 5584.5, 60 sec: 5658.8, 300 sec: 5655.5). Total num frames: 357943296. Throughput: 0: 5961.0. Samples: 357950106. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:32:36,915][25689] Avg episode reward: [(0, '-46.393')] [2022-07-09 17:32:38,249][26022] Updated weights on worker 0-0, policy_version 349562 (0.00087) [2022-07-09 17:32:40,073][26022] Updated weights on worker 0-0, policy_version 349572 (0.00083) [2022-07-09 17:32:41,958][25689] Fps is (10 sec: 5502.8, 60 sec: 5639.5, 300 sec: 5651.8). Total num frames: 357970944. Throughput: 0: 5103.5. Samples: 357967140. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:32:41,958][25689] Avg episode reward: [(0, '-46.291')] [2022-07-09 17:32:41,961][26022] Updated weights on worker 0-0, policy_version 349582 (0.00086) [2022-07-09 17:32:43,493][26022] Updated weights on worker 0-0, policy_version 349592 (0.00087) [2022-07-09 17:32:45,360][26022] Updated weights on worker 0-0, policy_version 349602 (0.00098) [2022-07-09 17:32:46,963][25689] Fps is (10 sec: 5808.2, 60 sec: 5656.2, 300 sec: 5660.6). Total num frames: 358001664. Throughput: 0: 5953.9. Samples: 358001434. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:32:46,965][25689] Avg episode reward: [(0, '-45.866')] [2022-07-09 17:32:47,072][26022] Updated weights on worker 0-0, policy_version 349612 (0.00089) [2022-07-09 17:32:49,150][26022] Updated weights on worker 0-0, policy_version 349622 (0.00087) [2022-07-09 17:32:50,728][26022] Updated weights on worker 0-0, policy_version 349632 (0.00081) [2022-07-09 17:32:52,061][25689] Fps is (10 sec: 5675.6, 60 sec: 5616.8, 300 sec: 5655.8). Total num frames: 358028288. Throughput: 0: 5944.9. Samples: 358035382. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:32:52,062][25689] Avg episode reward: [(0, '-45.037')] [2022-07-09 17:32:52,822][26022] Updated weights on worker 0-0, policy_version 349642 (0.00094) [2022-07-09 17:32:54,096][26022] Updated weights on worker 0-0, policy_version 349652 (0.00095) [2022-07-09 17:32:56,189][26022] Updated weights on worker 0-0, policy_version 349662 (0.00087) [2022-07-09 17:32:57,079][25689] Fps is (10 sec: 5567.8, 60 sec: 5649.9, 300 sec: 5655.9). Total num frames: 358057984. Throughput: 0: 5077.3. Samples: 358052488. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:32:57,080][25689] Avg episode reward: [(0, '-44.926')] [2022-07-09 17:32:57,852][26022] Updated weights on worker 0-0, policy_version 349672 (0.00086) [2022-07-09 17:32:59,857][26022] Updated weights on worker 0-0, policy_version 349682 (0.00084) [2022-07-09 17:33:01,648][26022] Updated weights on worker 0-0, policy_version 349692 (0.00080) [2022-07-09 17:33:02,163][25689] Fps is (10 sec: 5676.8, 60 sec: 5642.3, 300 sec: 5658.0). Total num frames: 358085632. Throughput: 0: 5923.4. Samples: 358086822. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:33:02,163][25689] Avg episode reward: [(0, '-45.151')] [2022-07-09 17:33:03,724][26022] Updated weights on worker 0-0, policy_version 349702 (0.00083) [2022-07-09 17:33:05,487][26022] Updated weights on worker 0-0, policy_version 349712 (0.00097) [2022-07-09 17:33:07,168][25689] Fps is (10 sec: 5582.1, 60 sec: 5676.3, 300 sec: 5662.2). Total num frames: 358114304. Throughput: 0: 5819.4. Samples: 358119012. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:33:07,170][25689] Avg episode reward: [(0, '-45.033')] [2022-07-09 17:33:07,253][26022] Updated weights on worker 0-0, policy_version 349722 (0.00090) [2022-07-09 17:33:09,163][26022] Updated weights on worker 0-0, policy_version 349732 (0.00093) [2022-07-09 17:33:10,919][26022] Updated weights on worker 0-0, policy_version 349742 (0.00093) [2022-07-09 17:33:12,288][25689] Fps is (10 sec: 5562.1, 60 sec: 5635.5, 300 sec: 5653.8). Total num frames: 358141952. Throughput: 0: 4976.2. Samples: 358136036. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:33:12,289][25689] Avg episode reward: [(0, '-45.731')] [2022-07-09 17:33:12,786][26022] Updated weights on worker 0-0, policy_version 349752 (0.00089) [2022-07-09 17:33:14,458][26022] Updated weights on worker 0-0, policy_version 349762 (0.00085) [2022-07-09 17:33:16,359][26022] Updated weights on worker 0-0, policy_version 349772 (0.00084) [2022-07-09 17:33:17,307][25689] Fps is (10 sec: 5656.1, 60 sec: 5686.6, 300 sec: 5657.1). Total num frames: 358171648. Throughput: 0: 5812.9. Samples: 358170070. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:33:17,307][25689] Avg episode reward: [(0, '-45.610')] [2022-07-09 17:33:18,053][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:33:18,062][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000349781_358175744.pth [2022-07-09 17:33:18,064][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000347789_356135936.pth [2022-07-09 17:33:18,231][26022] Updated weights on worker 0-0, policy_version 349782 (0.00096) [2022-07-09 17:33:19,743][26022] Updated weights on worker 0-0, policy_version 349792 (0.00086) [2022-07-09 17:33:21,796][26022] Updated weights on worker 0-0, policy_version 349802 (0.00095) [2022-07-09 17:33:22,325][25689] Fps is (10 sec: 5713.7, 60 sec: 5636.3, 300 sec: 5660.5). Total num frames: 358199296. Throughput: 0: 5830.3. Samples: 358204372. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:33:22,325][25689] Avg episode reward: [(0, '-45.824')] [2022-07-09 17:33:23,537][26022] Updated weights on worker 0-0, policy_version 349812 (0.00092) [2022-07-09 17:33:25,409][26022] Updated weights on worker 0-0, policy_version 349822 (0.00084) [2022-07-09 17:33:27,350][25689] Fps is (10 sec: 5505.7, 60 sec: 5617.4, 300 sec: 5654.8). Total num frames: 358226944. Throughput: 0: 5074.1. Samples: 358221416. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:33:27,350][25689] Avg episode reward: [(0, '-46.776')] [2022-07-09 17:33:27,395][26022] Updated weights on worker 0-0, policy_version 349832 (0.00077) [2022-07-09 17:33:29,086][26022] Updated weights on worker 0-0, policy_version 349842 (0.00086) [2022-07-09 17:33:30,797][26022] Updated weights on worker 0-0, policy_version 349852 (0.00090) [2022-07-09 17:33:32,396][25689] Fps is (10 sec: 5694.1, 60 sec: 5637.5, 300 sec: 5658.3). Total num frames: 358256640. Throughput: 0: 5927.2. Samples: 358255216. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 17:33:32,396][25689] Avg episode reward: [(0, '-46.920')] [2022-07-09 17:33:32,718][26022] Updated weights on worker 0-0, policy_version 349862 (0.00091) [2022-07-09 17:33:34,411][26022] Updated weights on worker 0-0, policy_version 349872 (0.00085) [2022-07-09 17:33:36,249][26022] Updated weights on worker 0-0, policy_version 349882 (0.00616) [2022-07-09 17:33:37,460][25689] Fps is (10 sec: 5773.0, 60 sec: 5648.8, 300 sec: 5654.2). Total num frames: 358285312. Throughput: 0: 5915.3. Samples: 358289286. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:33:37,461][25689] Avg episode reward: [(0, '-46.119')] [2022-07-09 17:33:38,062][26022] Updated weights on worker 0-0, policy_version 349892 (0.00052) [2022-07-09 17:33:39,902][26022] Updated weights on worker 0-0, policy_version 349902 (0.00085) [2022-07-09 17:33:41,835][26022] Updated weights on worker 0-0, policy_version 349912 (0.00087) [2022-07-09 17:33:42,560][25689] Fps is (10 sec: 5641.8, 60 sec: 5660.5, 300 sec: 5653.7). Total num frames: 358313984. Throughput: 0: 5877.9. Samples: 358323312. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:33:42,560][25689] Avg episode reward: [(0, '-46.665')] [2022-07-09 17:33:43,387][26022] Updated weights on worker 0-0, policy_version 349922 (0.00084) [2022-07-09 17:33:45,487][26022] Updated weights on worker 0-0, policy_version 349932 (0.00088) [2022-07-09 17:33:47,071][26022] Updated weights on worker 0-0, policy_version 349942 (0.00098) [2022-07-09 17:33:47,578][25689] Fps is (10 sec: 5769.1, 60 sec: 5642.5, 300 sec: 5655.3). Total num frames: 358343680. Throughput: 0: 5876.7. Samples: 358340288. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:33:47,578][25689] Avg episode reward: [(0, '-46.668')] [2022-07-09 17:33:48,874][26022] Updated weights on worker 0-0, policy_version 349952 (0.00086) [2022-07-09 17:33:50,696][26022] Updated weights on worker 0-0, policy_version 349962 (0.00087) [2022-07-09 17:33:52,633][25689] Fps is (10 sec: 5591.0, 60 sec: 5646.5, 300 sec: 5654.4). Total num frames: 358370304. Throughput: 0: 5869.7. Samples: 358374004. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:33:52,635][25689] Avg episode reward: [(0, '-47.201')] [2022-07-09 17:33:52,639][26022] Updated weights on worker 0-0, policy_version 349972 (0.00090) [2022-07-09 17:33:54,356][26022] Updated weights on worker 0-0, policy_version 349982 (0.00086) [2022-07-09 17:33:56,275][26022] Updated weights on worker 0-0, policy_version 349992 (0.00089) [2022-07-09 17:33:57,662][25689] Fps is (10 sec: 5585.1, 60 sec: 5645.4, 300 sec: 5650.7). Total num frames: 358400000. Throughput: 0: 5882.9. Samples: 358408128. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:33:57,663][25689] Avg episode reward: [(0, '-46.619')] [2022-07-09 17:33:57,946][26022] Updated weights on worker 0-0, policy_version 350002 (0.00088) [2022-07-09 17:33:59,838][26022] Updated weights on worker 0-0, policy_version 350012 (0.00085) [2022-07-09 17:34:01,964][26022] Updated weights on worker 0-0, policy_version 350022 (0.00097) [2022-07-09 17:34:02,676][25689] Fps is (10 sec: 5506.1, 60 sec: 5618.1, 300 sec: 5655.6). Total num frames: 358425600. Throughput: 0: 5068.4. Samples: 358425268. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:34:02,677][25689] Avg episode reward: [(0, '-46.505')] [2022-07-09 17:34:03,759][26022] Updated weights on worker 0-0, policy_version 350032 (0.00086) [2022-07-09 17:34:05,510][26022] Updated weights on worker 0-0, policy_version 350042 (0.00091) [2022-07-09 17:34:07,320][26022] Updated weights on worker 0-0, policy_version 350052 (0.00105) [2022-07-09 17:34:07,690][25689] Fps is (10 sec: 5513.8, 60 sec: 5634.2, 300 sec: 5653.0). Total num frames: 358455296. Throughput: 0: 5836.2. Samples: 358457668. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:34:07,691][25689] Avg episode reward: [(0, '-46.610')] [2022-07-09 17:34:09,234][26022] Updated weights on worker 0-0, policy_version 350062 (0.00088) [2022-07-09 17:34:10,847][26022] Updated weights on worker 0-0, policy_version 350072 (0.00548) [2022-07-09 17:34:12,630][26022] Updated weights on worker 0-0, policy_version 350082 (0.00085) [2022-07-09 17:34:12,817][25689] Fps is (10 sec: 5755.3, 60 sec: 5650.5, 300 sec: 5654.4). Total num frames: 358483968. Throughput: 0: 5829.7. Samples: 358491672. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:34:12,819][25689] Avg episode reward: [(0, '-47.026')] [2022-07-09 17:34:14,593][26022] Updated weights on worker 0-0, policy_version 350092 (0.00086) [2022-07-09 17:34:16,364][26022] Updated weights on worker 0-0, policy_version 350102 (0.00614) [2022-07-09 17:34:17,893][25689] Fps is (10 sec: 5620.3, 60 sec: 5628.2, 300 sec: 5653.2). Total num frames: 358512640. Throughput: 0: 4977.0. Samples: 358508822. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:34:17,894][25689] Avg episode reward: [(0, '-47.130')] [2022-07-09 17:34:18,081][26022] Updated weights on worker 0-0, policy_version 350112 (0.00091) [2022-07-09 17:34:20,026][26022] Updated weights on worker 0-0, policy_version 350122 (0.00093) [2022-07-09 17:34:21,581][26022] Updated weights on worker 0-0, policy_version 350132 (0.00096) [2022-07-09 17:34:22,948][25689] Fps is (10 sec: 5559.5, 60 sec: 5624.8, 300 sec: 5645.3). Total num frames: 358540288. Throughput: 0: 5821.0. Samples: 358543272. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:34:22,948][25689] Avg episode reward: [(0, '-47.449')] [2022-07-09 17:34:23,520][26022] Updated weights on worker 0-0, policy_version 350142 (0.00091) [2022-07-09 17:34:25,090][26022] Updated weights on worker 0-0, policy_version 350152 (0.00102) [2022-07-09 17:34:27,074][26022] Updated weights on worker 0-0, policy_version 350162 (0.00088) [2022-07-09 17:34:27,963][25689] Fps is (10 sec: 5796.4, 60 sec: 5676.4, 300 sec: 5656.7). Total num frames: 358571008. Throughput: 0: 5937.6. Samples: 358578040. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:34:27,963][25689] Avg episode reward: [(0, '-47.770')] [2022-07-09 17:34:28,962][26022] Updated weights on worker 0-0, policy_version 350172 (0.00086) [2022-07-09 17:34:30,514][26022] Updated weights on worker 0-0, policy_version 350182 (0.00092) [2022-07-09 17:34:32,584][26022] Updated weights on worker 0-0, policy_version 350192 (0.00087) [2022-07-09 17:34:33,075][25689] Fps is (10 sec: 5965.8, 60 sec: 5670.2, 300 sec: 5662.6). Total num frames: 358600704. Throughput: 0: 5118.2. Samples: 358595354. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:34:33,075][25689] Avg episode reward: [(0, '-48.377')] [2022-07-09 17:34:34,012][26022] Updated weights on worker 0-0, policy_version 350202 (0.00086) [2022-07-09 17:34:35,926][26022] Updated weights on worker 0-0, policy_version 350212 (0.00087) [2022-07-09 17:34:37,807][26022] Updated weights on worker 0-0, policy_version 350222 (0.00085) [2022-07-09 17:34:38,099][25689] Fps is (10 sec: 5657.6, 60 sec: 5657.1, 300 sec: 5659.0). Total num frames: 358628352. Throughput: 0: 5991.7. Samples: 358629888. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:34:38,099][25689] Avg episode reward: [(0, '-48.648')] [2022-07-09 17:34:39,564][26022] Updated weights on worker 0-0, policy_version 350232 (0.00082) [2022-07-09 17:34:41,415][26022] Updated weights on worker 0-0, policy_version 350242 (0.00084) [2022-07-09 17:34:43,023][26022] Updated weights on worker 0-0, policy_version 350252 (0.00086) [2022-07-09 17:34:43,115][25689] Fps is (10 sec: 5711.3, 60 sec: 5681.8, 300 sec: 5662.4). Total num frames: 358658048. Throughput: 0: 5994.6. Samples: 358664168. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:34:43,116][25689] Avg episode reward: [(0, '-48.915')] [2022-07-09 17:34:44,936][26022] Updated weights on worker 0-0, policy_version 350262 (0.00087) [2022-07-09 17:34:46,628][26022] Updated weights on worker 0-0, policy_version 350272 (0.00087) [2022-07-09 17:34:48,148][25689] Fps is (10 sec: 5706.3, 60 sec: 5646.6, 300 sec: 5661.0). Total num frames: 358685696. Throughput: 0: 5125.9. Samples: 358681506. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:34:48,148][25689] Avg episode reward: [(0, '-48.332')] [2022-07-09 17:34:48,447][26022] Updated weights on worker 0-0, policy_version 350282 (0.00083) [2022-07-09 17:34:50,191][26022] Updated weights on worker 0-0, policy_version 350292 (0.00085) [2022-07-09 17:34:52,177][26022] Updated weights on worker 0-0, policy_version 350302 (0.00103) [2022-07-09 17:34:53,208][25689] Fps is (10 sec: 5681.8, 60 sec: 5696.9, 300 sec: 5660.9). Total num frames: 358715392. Throughput: 0: 5983.9. Samples: 358715828. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:34:53,208][25689] Avg episode reward: [(0, '-48.137')] [2022-07-09 17:34:53,730][26022] Updated weights on worker 0-0, policy_version 350312 (0.00086) [2022-07-09 17:34:55,754][26022] Updated weights on worker 0-0, policy_version 350322 (0.00080) [2022-07-09 17:34:57,391][26022] Updated weights on worker 0-0, policy_version 350332 (0.00086) [2022-07-09 17:34:58,210][25689] Fps is (10 sec: 5699.3, 60 sec: 5665.6, 300 sec: 5657.7). Total num frames: 358743040. Throughput: 0: 5965.0. Samples: 358749850. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:34:58,210][25689] Avg episode reward: [(0, '-46.994')] [2022-07-09 17:34:59,467][26022] Updated weights on worker 0-0, policy_version 350342 (0.00090) [2022-07-09 17:35:01,023][26022] Updated weights on worker 0-0, policy_version 350352 (0.00080) [2022-07-09 17:35:03,215][25689] Fps is (10 sec: 5423.5, 60 sec: 5683.4, 300 sec: 5658.5). Total num frames: 358769664. Throughput: 0: 5122.9. Samples: 358767136. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:35:03,215][25689] Avg episode reward: [(0, '-46.303')] [2022-07-09 17:35:03,389][26022] Updated weights on worker 0-0, policy_version 350362 (0.00088) [2022-07-09 17:35:04,882][26022] Updated weights on worker 0-0, policy_version 350372 (0.00081) [2022-07-09 17:35:06,843][26022] Updated weights on worker 0-0, policy_version 350382 (0.00085) [2022-07-09 17:35:08,228][25689] Fps is (10 sec: 5621.7, 60 sec: 5683.5, 300 sec: 5660.8). Total num frames: 358799360. Throughput: 0: 5890.7. Samples: 358799790. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:35:08,229][25689] Avg episode reward: [(0, '-46.427')] [2022-07-09 17:35:08,437][26022] Updated weights on worker 0-0, policy_version 350392 (0.00085) [2022-07-09 17:35:10,337][26022] Updated weights on worker 0-0, policy_version 350402 (0.00089) [2022-07-09 17:35:12,183][26022] Updated weights on worker 0-0, policy_version 350412 (0.00085) [2022-07-09 17:35:13,347][25689] Fps is (10 sec: 5760.7, 60 sec: 5684.3, 300 sec: 5662.8). Total num frames: 358828032. Throughput: 0: 5881.8. Samples: 358834280. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:35:13,347][25689] Avg episode reward: [(0, '-46.474')] [2022-07-09 17:35:13,829][26022] Updated weights on worker 0-0, policy_version 350422 (0.00082) [2022-07-09 17:35:15,679][26022] Updated weights on worker 0-0, policy_version 350432 (0.00084) [2022-07-09 17:35:17,250][26022] Updated weights on worker 0-0, policy_version 350442 (0.00089) [2022-07-09 17:35:18,150][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:35:18,162][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000350446_358856704.pth [2022-07-09 17:35:18,163][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000348453_356815872.pth [2022-07-09 17:35:18,421][25689] Fps is (10 sec: 5726.0, 60 sec: 5701.3, 300 sec: 5656.0). Total num frames: 358857728. Throughput: 0: 5040.0. Samples: 358851718. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:35:18,422][25689] Avg episode reward: [(0, '-47.413')] [2022-07-09 17:35:19,255][26022] Updated weights on worker 0-0, policy_version 350452 (0.00096) [2022-07-09 17:35:20,943][26022] Updated weights on worker 0-0, policy_version 350462 (0.00086) [2022-07-09 17:35:22,674][26022] Updated weights on worker 0-0, policy_version 350472 (0.00088) [2022-07-09 17:35:23,425][25689] Fps is (10 sec: 5791.4, 60 sec: 5723.0, 300 sec: 5663.7). Total num frames: 358886400. Throughput: 0: 5890.4. Samples: 358886184. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:35:23,426][25689] Avg episode reward: [(0, '-48.134')] [2022-07-09 17:35:24,483][26022] Updated weights on worker 0-0, policy_version 350482 (0.00418) [2022-07-09 17:35:26,327][26022] Updated weights on worker 0-0, policy_version 350492 (0.00087) [2022-07-09 17:35:28,071][26022] Updated weights on worker 0-0, policy_version 350502 (0.00398) [2022-07-09 17:35:28,497][25689] Fps is (10 sec: 5793.1, 60 sec: 5700.7, 300 sec: 5664.5). Total num frames: 358916096. Throughput: 0: 5959.5. Samples: 358920582. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:35:28,497][25689] Avg episode reward: [(0, '-47.877')] [2022-07-09 17:35:30,026][26022] Updated weights on worker 0-0, policy_version 350512 (0.00087) [2022-07-09 17:35:31,428][26022] Updated weights on worker 0-0, policy_version 350522 (0.00091) [2022-07-09 17:35:33,588][25689] Fps is (10 sec: 5642.3, 60 sec: 5668.8, 300 sec: 5663.5). Total num frames: 358943744. Throughput: 0: 5106.1. Samples: 358937636. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:35:33,589][25689] Avg episode reward: [(0, '-48.326')] [2022-07-09 17:35:33,748][26022] Updated weights on worker 0-0, policy_version 350532 (0.00082) [2022-07-09 17:35:35,119][26022] Updated weights on worker 0-0, policy_version 350542 (0.00083) [2022-07-09 17:35:37,328][26022] Updated weights on worker 0-0, policy_version 350552 (0.00081) [2022-07-09 17:35:38,677][25689] Fps is (10 sec: 5632.8, 60 sec: 5696.5, 300 sec: 5660.0). Total num frames: 358973440. Throughput: 0: 5937.7. Samples: 358971990. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:35:38,678][25689] Avg episode reward: [(0, '-47.579')] [2022-07-09 17:35:38,813][26022] Updated weights on worker 0-0, policy_version 350562 (0.00097) [2022-07-09 17:35:40,805][26022] Updated weights on worker 0-0, policy_version 350572 (0.00082) [2022-07-09 17:35:42,312][26022] Updated weights on worker 0-0, policy_version 350582 (0.00080) [2022-07-09 17:35:43,699][25689] Fps is (10 sec: 5874.3, 60 sec: 5696.1, 300 sec: 5663.4). Total num frames: 359003136. Throughput: 0: 5933.7. Samples: 359006482. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:35:43,699][25689] Avg episode reward: [(0, '-47.878')] [2022-07-09 17:35:44,416][26022] Updated weights on worker 0-0, policy_version 350592 (0.00082) [2022-07-09 17:35:45,958][26022] Updated weights on worker 0-0, policy_version 350602 (0.00087) [2022-07-09 17:35:47,819][26022] Updated weights on worker 0-0, policy_version 350612 (0.00082) [2022-07-09 17:35:48,787][25689] Fps is (10 sec: 5672.3, 60 sec: 5690.9, 300 sec: 5659.4). Total num frames: 359030784. Throughput: 0: 5933.6. Samples: 359040974. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:35:48,787][25689] Avg episode reward: [(0, '-46.992')] [2022-07-09 17:35:49,665][26022] Updated weights on worker 0-0, policy_version 350622 (0.00087) [2022-07-09 17:35:51,390][26022] Updated weights on worker 0-0, policy_version 350632 (0.00088) [2022-07-09 17:35:53,179][26022] Updated weights on worker 0-0, policy_version 350642 (0.00098) [2022-07-09 17:35:53,884][25689] Fps is (10 sec: 5730.6, 60 sec: 5704.3, 300 sec: 5667.9). Total num frames: 359061504. Throughput: 0: 5940.8. Samples: 359058210. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:35:53,884][25689] Avg episode reward: [(0, '-47.066')] [2022-07-09 17:35:55,107][26022] Updated weights on worker 0-0, policy_version 350652 (0.00087) [2022-07-09 17:35:56,639][26022] Updated weights on worker 0-0, policy_version 350662 (0.00084) [2022-07-09 17:35:58,616][26022] Updated weights on worker 0-0, policy_version 350672 (0.00823) [2022-07-09 17:35:58,938][25689] Fps is (10 sec: 5749.8, 60 sec: 5699.4, 300 sec: 5661.1). Total num frames: 359089152. Throughput: 0: 5952.1. Samples: 359092584. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:35:58,938][25689] Avg episode reward: [(0, '-47.100')] [2022-07-09 17:36:00,259][26022] Updated weights on worker 0-0, policy_version 350682 (0.00084) [2022-07-09 17:36:02,552][26022] Updated weights on worker 0-0, policy_version 350692 (0.00087) [2022-07-09 17:36:03,968][25689] Fps is (10 sec: 5382.0, 60 sec: 5697.0, 300 sec: 5665.3). Total num frames: 359115776. Throughput: 0: 5845.0. Samples: 359124956. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 17:36:03,968][25689] Avg episode reward: [(0, '-47.073')] [2022-07-09 17:36:04,320][26022] Updated weights on worker 0-0, policy_version 350702 (0.00086) [2022-07-09 17:36:05,850][26022] Updated weights on worker 0-0, policy_version 350712 (0.00087) [2022-07-09 17:36:07,816][26022] Updated weights on worker 0-0, policy_version 350722 (0.00087) [2022-07-09 17:36:08,970][25689] Fps is (10 sec: 5716.4, 60 sec: 5715.0, 300 sec: 5670.1). Total num frames: 359146496. Throughput: 0: 5009.1. Samples: 359142072. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:36:08,970][25689] Avg episode reward: [(0, '-47.244')] [2022-07-09 17:36:09,768][26022] Updated weights on worker 0-0, policy_version 350732 (0.00090) [2022-07-09 17:36:11,403][26022] Updated weights on worker 0-0, policy_version 350742 (0.00089) [2022-07-09 17:36:13,362][26022] Updated weights on worker 0-0, policy_version 350752 (0.00091) [2022-07-09 17:36:14,057][25689] Fps is (10 sec: 5785.3, 60 sec: 5701.0, 300 sec: 5668.8). Total num frames: 359174144. Throughput: 0: 5868.2. Samples: 359176588. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:36:14,057][25689] Avg episode reward: [(0, '-46.975')] [2022-07-09 17:36:14,952][26022] Updated weights on worker 0-0, policy_version 350762 (0.00082) [2022-07-09 17:36:16,798][26022] Updated weights on worker 0-0, policy_version 350772 (0.00087) [2022-07-09 17:36:18,427][26022] Updated weights on worker 0-0, policy_version 350782 (0.00083) [2022-07-09 17:36:19,083][25689] Fps is (10 sec: 5670.2, 60 sec: 5705.6, 300 sec: 5668.4). Total num frames: 359203840. Throughput: 0: 5897.5. Samples: 359211388. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:36:19,083][25689] Avg episode reward: [(0, '-47.021')] [2022-07-09 17:36:20,339][26022] Updated weights on worker 0-0, policy_version 350792 (0.00098) [2022-07-09 17:36:21,920][26022] Updated weights on worker 0-0, policy_version 350802 (0.00091) [2022-07-09 17:36:24,012][26022] Updated weights on worker 0-0, policy_version 350812 (0.00089) [2022-07-09 17:36:24,099][25689] Fps is (10 sec: 5710.7, 60 sec: 5687.6, 300 sec: 5665.8). Total num frames: 359231488. Throughput: 0: 5168.2. Samples: 359228994. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:36:24,099][25689] Avg episode reward: [(0, '-46.841')] [2022-07-09 17:36:25,418][26022] Updated weights on worker 0-0, policy_version 350822 (0.00090) [2022-07-09 17:36:27,453][26022] Updated weights on worker 0-0, policy_version 350832 (0.00081) [2022-07-09 17:36:28,965][26022] Updated weights on worker 0-0, policy_version 350842 (0.00088) [2022-07-09 17:36:29,104][25689] Fps is (10 sec: 5926.6, 60 sec: 5727.6, 300 sec: 5680.2). Total num frames: 359263232. Throughput: 0: 6024.9. Samples: 359263380. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:36:29,105][25689] Avg episode reward: [(0, '-46.315')] [2022-07-09 17:36:30,958][26022] Updated weights on worker 0-0, policy_version 350852 (0.00049) [2022-07-09 17:36:32,660][26022] Updated weights on worker 0-0, policy_version 350862 (0.00089) [2022-07-09 17:36:34,153][25689] Fps is (10 sec: 5805.2, 60 sec: 5714.7, 300 sec: 5672.5). Total num frames: 359289856. Throughput: 0: 6030.6. Samples: 359297778. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:36:34,153][25689] Avg episode reward: [(0, '-46.741')] [2022-07-09 17:36:34,710][26022] Updated weights on worker 0-0, policy_version 350872 (0.00097) [2022-07-09 17:36:36,152][26022] Updated weights on worker 0-0, policy_version 350882 (0.00091) [2022-07-09 17:36:38,158][26022] Updated weights on worker 0-0, policy_version 350892 (0.00092) [2022-07-09 17:36:39,159][25689] Fps is (10 sec: 5703.0, 60 sec: 5739.5, 300 sec: 5679.6). Total num frames: 359320576. Throughput: 0: 5175.9. Samples: 359315302. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:36:39,160][25689] Avg episode reward: [(0, '-46.240')] [2022-07-09 17:36:39,726][26022] Updated weights on worker 0-0, policy_version 350902 (0.00085) [2022-07-09 17:36:41,726][26022] Updated weights on worker 0-0, policy_version 350912 (0.00085) [2022-07-09 17:36:43,391][26022] Updated weights on worker 0-0, policy_version 350922 (0.00424) [2022-07-09 17:36:44,211][25689] Fps is (10 sec: 5803.2, 60 sec: 5702.8, 300 sec: 5671.8). Total num frames: 359348224. Throughput: 0: 6008.1. Samples: 359349830. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:36:44,213][25689] Avg episode reward: [(0, '-46.187')] [2022-07-09 17:36:45,330][26022] Updated weights on worker 0-0, policy_version 350932 (0.00085) [2022-07-09 17:36:46,900][26022] Updated weights on worker 0-0, policy_version 350942 (0.00091) [2022-07-09 17:36:48,774][26022] Updated weights on worker 0-0, policy_version 350952 (0.00094) [2022-07-09 17:36:49,242][25689] Fps is (10 sec: 5585.9, 60 sec: 5725.1, 300 sec: 5672.0). Total num frames: 359376896. Throughput: 0: 5998.7. Samples: 359384180. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:36:49,242][25689] Avg episode reward: [(0, '-46.584')] [2022-07-09 17:36:50,567][26022] Updated weights on worker 0-0, policy_version 350962 (0.00092) [2022-07-09 17:36:52,462][26022] Updated weights on worker 0-0, policy_version 350972 (0.00082) [2022-07-09 17:36:54,132][26022] Updated weights on worker 0-0, policy_version 350982 (0.00096) [2022-07-09 17:36:54,343][25689] Fps is (10 sec: 5659.4, 60 sec: 5690.8, 300 sec: 5673.6). Total num frames: 359405568. Throughput: 0: 5128.5. Samples: 359401324. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:36:54,344][25689] Avg episode reward: [(0, '-46.354')] [2022-07-09 17:36:55,980][26022] Updated weights on worker 0-0, policy_version 350992 (0.00090) [2022-07-09 17:36:57,709][26022] Updated weights on worker 0-0, policy_version 351002 (0.00086) [2022-07-09 17:36:59,406][25689] Fps is (10 sec: 5742.5, 60 sec: 5723.9, 300 sec: 5679.4). Total num frames: 359435264. Throughput: 0: 5939.0. Samples: 359435548. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:36:59,406][25689] Avg episode reward: [(0, '-46.139')] [2022-07-09 17:36:59,527][26022] Updated weights on worker 0-0, policy_version 351012 (0.00094) [2022-07-09 17:37:01,397][26022] Updated weights on worker 0-0, policy_version 351022 (0.00087) [2022-07-09 17:37:03,544][26022] Updated weights on worker 0-0, policy_version 351032 (0.00086) [2022-07-09 17:37:04,441][25689] Fps is (10 sec: 5374.8, 60 sec: 5689.6, 300 sec: 5671.9). Total num frames: 359459840. Throughput: 0: 5817.7. Samples: 359467522. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:37:04,441][25689] Avg episode reward: [(0, '-46.265')] [2022-07-09 17:37:05,339][26022] Updated weights on worker 0-0, policy_version 351042 (0.00618) [2022-07-09 17:37:07,266][26022] Updated weights on worker 0-0, policy_version 351052 (0.00087) [2022-07-09 17:37:08,830][26022] Updated weights on worker 0-0, policy_version 351062 (0.00089) [2022-07-09 17:37:09,475][25689] Fps is (10 sec: 5491.8, 60 sec: 5686.5, 300 sec: 5675.6). Total num frames: 359490560. Throughput: 0: 4969.8. Samples: 359484734. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:37:09,475][25689] Avg episode reward: [(0, '-46.570')] [2022-07-09 17:37:10,835][26022] Updated weights on worker 0-0, policy_version 351072 (0.00089) [2022-07-09 17:37:12,512][26022] Updated weights on worker 0-0, policy_version 351082 (0.00092) [2022-07-09 17:37:14,472][26022] Updated weights on worker 0-0, policy_version 351092 (0.00092) [2022-07-09 17:37:14,525][25689] Fps is (10 sec: 5788.2, 60 sec: 5690.0, 300 sec: 5678.5). Total num frames: 359518208. Throughput: 0: 5815.8. Samples: 359518696. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:37:14,525][25689] Avg episode reward: [(0, '-47.571')] [2022-07-09 17:37:16,126][26022] Updated weights on worker 0-0, policy_version 351102 (0.00093) [2022-07-09 17:37:18,070][26022] Updated weights on worker 0-0, policy_version 351112 (0.00083) [2022-07-09 17:37:18,303][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:37:18,334][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000351113_359539712.pth [2022-07-09 17:37:18,334][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000349118_357496832.pth [2022-07-09 17:37:19,529][25689] Fps is (10 sec: 5601.7, 60 sec: 5675.1, 300 sec: 5672.0). Total num frames: 359546880. Throughput: 0: 5845.3. Samples: 359553172. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:37:19,529][25689] Avg episode reward: [(0, '-48.056')] [2022-07-09 17:37:19,619][26022] Updated weights on worker 0-0, policy_version 351122 (0.00085) [2022-07-09 17:37:21,706][26022] Updated weights on worker 0-0, policy_version 351132 (0.00087) [2022-07-09 17:37:23,181][26022] Updated weights on worker 0-0, policy_version 351142 (0.00090) [2022-07-09 17:37:24,534][25689] Fps is (10 sec: 5728.9, 60 sec: 5693.0, 300 sec: 5672.0). Total num frames: 359575552. Throughput: 0: 5121.7. Samples: 359570436. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:37:24,535][25689] Avg episode reward: [(0, '-49.002')] [2022-07-09 17:37:25,462][26022] Updated weights on worker 0-0, policy_version 351152 (0.00093) [2022-07-09 17:37:26,835][26022] Updated weights on worker 0-0, policy_version 351162 (0.00079) [2022-07-09 17:37:28,911][26022] Updated weights on worker 0-0, policy_version 351172 (0.00079) [2022-07-09 17:37:29,547][25689] Fps is (10 sec: 5724.1, 60 sec: 5641.6, 300 sec: 5673.3). Total num frames: 359604224. Throughput: 0: 5959.9. Samples: 359604362. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:37:29,548][25689] Avg episode reward: [(0, '-48.882')] [2022-07-09 17:37:30,493][26022] Updated weights on worker 0-0, policy_version 351182 (0.00091) [2022-07-09 17:37:32,398][26022] Updated weights on worker 0-0, policy_version 351192 (0.00089) [2022-07-09 17:37:34,180][26022] Updated weights on worker 0-0, policy_version 351202 (0.00093) [2022-07-09 17:37:34,680][25689] Fps is (10 sec: 5652.3, 60 sec: 5667.6, 300 sec: 5674.3). Total num frames: 359632896. Throughput: 0: 5929.5. Samples: 359638204. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:37:34,680][25689] Avg episode reward: [(0, '-48.406')] [2022-07-09 17:37:36,044][26022] Updated weights on worker 0-0, policy_version 351212 (0.00083) [2022-07-09 17:37:37,709][26022] Updated weights on worker 0-0, policy_version 351222 (0.00088) [2022-07-09 17:37:39,697][25689] Fps is (10 sec: 5649.5, 60 sec: 5632.7, 300 sec: 5678.2). Total num frames: 359661568. Throughput: 0: 5071.2. Samples: 359655450. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:37:39,698][25689] Avg episode reward: [(0, '-48.286')] [2022-07-09 17:37:39,702][26022] Updated weights on worker 0-0, policy_version 351232 (0.00087) [2022-07-09 17:37:41,374][26022] Updated weights on worker 0-0, policy_version 351242 (0.00094) [2022-07-09 17:37:43,269][26022] Updated weights on worker 0-0, policy_version 351252 (0.00089) [2022-07-09 17:37:44,738][25689] Fps is (10 sec: 5802.8, 60 sec: 5667.5, 300 sec: 5674.1). Total num frames: 359691264. Throughput: 0: 5899.7. Samples: 359689632. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:37:44,739][25689] Avg episode reward: [(0, '-47.358')] [2022-07-09 17:37:44,847][26022] Updated weights on worker 0-0, policy_version 351262 (0.00089) [2022-07-09 17:37:46,828][26022] Updated weights on worker 0-0, policy_version 351272 (0.00086) [2022-07-09 17:37:48,598][26022] Updated weights on worker 0-0, policy_version 351282 (0.00086) [2022-07-09 17:37:49,751][25689] Fps is (10 sec: 5601.8, 60 sec: 5635.4, 300 sec: 5675.7). Total num frames: 359717888. Throughput: 0: 5909.4. Samples: 359723756. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:37:49,752][25689] Avg episode reward: [(0, '-46.983')] [2022-07-09 17:37:50,499][26022] Updated weights on worker 0-0, policy_version 351292 (0.00085) [2022-07-09 17:37:52,335][26022] Updated weights on worker 0-0, policy_version 351302 (0.00085) [2022-07-09 17:37:54,226][26022] Updated weights on worker 0-0, policy_version 351312 (0.00094) [2022-07-09 17:37:54,827][25689] Fps is (10 sec: 5481.1, 60 sec: 5637.8, 300 sec: 5671.2). Total num frames: 359746560. Throughput: 0: 5096.0. Samples: 359740874. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:37:54,827][25689] Avg episode reward: [(0, '-47.008')] [2022-07-09 17:37:55,847][26022] Updated weights on worker 0-0, policy_version 351322 (0.00097) [2022-07-09 17:37:57,944][26022] Updated weights on worker 0-0, policy_version 351332 (0.00086) [2022-07-09 17:37:59,324][26022] Updated weights on worker 0-0, policy_version 351342 (0.00089) [2022-07-09 17:37:59,861][25689] Fps is (10 sec: 5773.6, 60 sec: 5640.5, 300 sec: 5679.0). Total num frames: 359776256. Throughput: 0: 5916.5. Samples: 359774748. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:37:59,861][25689] Avg episode reward: [(0, '-47.083')] [2022-07-09 17:38:01,410][26022] Updated weights on worker 0-0, policy_version 351352 (0.00085) [2022-07-09 17:38:03,304][26022] Updated weights on worker 0-0, policy_version 351362 (0.00087) [2022-07-09 17:38:04,943][25689] Fps is (10 sec: 5466.0, 60 sec: 5652.9, 300 sec: 5667.2). Total num frames: 359801856. Throughput: 0: 5796.5. Samples: 359806750. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:38:04,944][25689] Avg episode reward: [(0, '-47.219')] [2022-07-09 17:38:05,287][26022] Updated weights on worker 0-0, policy_version 351372 (0.00051) [2022-07-09 17:38:06,932][26022] Updated weights on worker 0-0, policy_version 351382 (0.00091) [2022-07-09 17:38:08,858][26022] Updated weights on worker 0-0, policy_version 351392 (0.00092) [2022-07-09 17:38:09,946][25689] Fps is (10 sec: 5482.8, 60 sec: 5638.9, 300 sec: 5676.3). Total num frames: 359831552. Throughput: 0: 5807.1. Samples: 359841030. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:38:09,947][25689] Avg episode reward: [(0, '-47.239')] [2022-07-09 17:38:10,664][26022] Updated weights on worker 0-0, policy_version 351402 (0.00093) [2022-07-09 17:38:12,578][26022] Updated weights on worker 0-0, policy_version 351412 (0.00086) [2022-07-09 17:38:14,094][26022] Updated weights on worker 0-0, policy_version 351422 (0.00079) [2022-07-09 17:38:14,985][25689] Fps is (10 sec: 5914.5, 60 sec: 5673.8, 300 sec: 5675.9). Total num frames: 359861248. Throughput: 0: 5822.1. Samples: 359858238. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:38:14,989][25689] Avg episode reward: [(0, '-48.303')] [2022-07-09 17:38:16,058][26022] Updated weights on worker 0-0, policy_version 351432 (0.00091) [2022-07-09 17:38:17,595][26022] Updated weights on worker 0-0, policy_version 351442 (0.00083) [2022-07-09 17:38:19,659][26022] Updated weights on worker 0-0, policy_version 351452 (0.00104) [2022-07-09 17:38:20,044][25689] Fps is (10 sec: 5678.9, 60 sec: 5651.8, 300 sec: 5675.2). Total num frames: 359888896. Throughput: 0: 5845.1. Samples: 359892720. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:38:20,044][25689] Avg episode reward: [(0, '-47.595')] [2022-07-09 17:38:21,287][26022] Updated weights on worker 0-0, policy_version 351462 (0.00090) [2022-07-09 17:38:23,237][26022] Updated weights on worker 0-0, policy_version 351472 (0.00085) [2022-07-09 17:38:24,777][26022] Updated weights on worker 0-0, policy_version 351482 (0.00077) [2022-07-09 17:38:25,066][25689] Fps is (10 sec: 5688.5, 60 sec: 5667.2, 300 sec: 5682.1). Total num frames: 359918592. Throughput: 0: 5980.6. Samples: 359927094. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:38:25,066][25689] Avg episode reward: [(0, '-46.668')] [2022-07-09 17:38:26,832][26022] Updated weights on worker 0-0, policy_version 351492 (0.00087) [2022-07-09 17:38:28,452][26022] Updated weights on worker 0-0, policy_version 351502 (0.00088) [2022-07-09 17:38:30,091][25689] Fps is (10 sec: 5707.5, 60 sec: 5649.1, 300 sec: 5675.6). Total num frames: 359946240. Throughput: 0: 5109.1. Samples: 359943952. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:38:30,091][25689] Avg episode reward: [(0, '-45.967')] [2022-07-09 17:38:30,422][26022] Updated weights on worker 0-0, policy_version 351512 (0.00099) [2022-07-09 17:38:32,070][26022] Updated weights on worker 0-0, policy_version 351522 (0.00086) [2022-07-09 17:38:33,987][26022] Updated weights on worker 0-0, policy_version 351532 (0.00090) [2022-07-09 17:38:35,228][25689] Fps is (10 sec: 5643.1, 60 sec: 5665.6, 300 sec: 5677.7). Total num frames: 359975936. Throughput: 0: 5926.2. Samples: 359978200. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:38:35,228][25689] Avg episode reward: [(0, '-45.898')] [2022-07-09 17:38:35,735][26022] Updated weights on worker 0-0, policy_version 351542 (0.00087) [2022-07-09 17:38:37,631][26022] Updated weights on worker 0-0, policy_version 351552 (0.00088) [2022-07-09 17:38:39,370][26022] Updated weights on worker 0-0, policy_version 351562 (0.00122) [2022-07-09 17:38:40,244][25689] Fps is (10 sec: 5748.6, 60 sec: 5665.7, 300 sec: 5679.2). Total num frames: 360004608. Throughput: 0: 5935.2. Samples: 360012614. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2022-07-09 17:38:40,245][25689] Avg episode reward: [(0, '-44.986')] [2022-07-09 17:38:41,134][26022] Updated weights on worker 0-0, policy_version 351572 (0.00088) [2022-07-09 17:38:43,014][26022] Updated weights on worker 0-0, policy_version 351582 (0.00838) [2022-07-09 17:38:44,756][26022] Updated weights on worker 0-0, policy_version 351592 (0.00090) [2022-07-09 17:38:45,261][25689] Fps is (10 sec: 5715.3, 60 sec: 5651.0, 300 sec: 5675.8). Total num frames: 360033280. Throughput: 0: 5078.0. Samples: 360029646. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:38:45,261][25689] Avg episode reward: [(0, '-45.677')] [2022-07-09 17:38:46,543][26022] Updated weights on worker 0-0, policy_version 351602 (0.00089) [2022-07-09 17:38:48,178][26022] Updated weights on worker 0-0, policy_version 351612 (0.00073) [2022-07-09 17:38:50,131][26022] Updated weights on worker 0-0, policy_version 351622 (0.00649) [2022-07-09 17:38:50,275][25689] Fps is (10 sec: 5717.0, 60 sec: 5684.8, 300 sec: 5683.5). Total num frames: 360061952. Throughput: 0: 5960.7. Samples: 360064262. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:38:50,275][25689] Avg episode reward: [(0, '-46.617')] [2022-07-09 17:38:51,808][26022] Updated weights on worker 0-0, policy_version 351632 (0.00088) [2022-07-09 17:38:53,747][26022] Updated weights on worker 0-0, policy_version 351642 (0.00083) [2022-07-09 17:38:55,331][25689] Fps is (10 sec: 5694.8, 60 sec: 5686.6, 300 sec: 5679.5). Total num frames: 360090624. Throughput: 0: 5965.3. Samples: 360098120. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:38:55,332][25689] Avg episode reward: [(0, '-47.276')] [2022-07-09 17:38:55,600][26022] Updated weights on worker 0-0, policy_version 351652 (0.00090) [2022-07-09 17:38:57,361][26022] Updated weights on worker 0-0, policy_version 351662 (0.00085) [2022-07-09 17:38:59,240][26022] Updated weights on worker 0-0, policy_version 351672 (0.00092) [2022-07-09 17:39:00,339][25689] Fps is (10 sec: 5596.2, 60 sec: 5655.2, 300 sec: 5686.5). Total num frames: 360118272. Throughput: 0: 5097.0. Samples: 360115036. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:39:00,340][25689] Avg episode reward: [(0, '-46.703')] [2022-07-09 17:39:00,874][26022] Updated weights on worker 0-0, policy_version 351682 (0.00084) [2022-07-09 17:39:03,188][26022] Updated weights on worker 0-0, policy_version 351692 (0.00091) [2022-07-09 17:39:04,806][26022] Updated weights on worker 0-0, policy_version 351702 (0.00093) [2022-07-09 17:39:05,351][25689] Fps is (10 sec: 5518.2, 60 sec: 5695.7, 300 sec: 5679.7). Total num frames: 360145920. Throughput: 0: 5855.4. Samples: 360147282. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:39:05,353][25689] Avg episode reward: [(0, '-47.324')] [2022-07-09 17:39:06,687][26022] Updated weights on worker 0-0, policy_version 351712 (0.00090) [2022-07-09 17:39:08,366][26022] Updated weights on worker 0-0, policy_version 351722 (0.00094) [2022-07-09 17:39:10,358][26022] Updated weights on worker 0-0, policy_version 351732 (0.00096) [2022-07-09 17:39:10,371][25689] Fps is (10 sec: 5511.9, 60 sec: 5660.2, 300 sec: 5678.3). Total num frames: 360173568. Throughput: 0: 5850.8. Samples: 360181840. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:39:10,371][25689] Avg episode reward: [(0, '-46.303')] [2022-07-09 17:39:12,048][26022] Updated weights on worker 0-0, policy_version 351742 (0.00092) [2022-07-09 17:39:14,010][26022] Updated weights on worker 0-0, policy_version 351752 (0.00332) [2022-07-09 17:39:15,407][25689] Fps is (10 sec: 5702.8, 60 sec: 5660.5, 300 sec: 5682.5). Total num frames: 360203264. Throughput: 0: 5018.6. Samples: 360198874. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:39:15,407][25689] Avg episode reward: [(0, '-45.272')] [2022-07-09 17:39:15,431][26022] Updated weights on worker 0-0, policy_version 351762 (0.00086) [2022-07-09 17:39:17,540][26022] Updated weights on worker 0-0, policy_version 351772 (0.00092) [2022-07-09 17:39:18,510][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:39:18,522][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000351779_360221696.pth [2022-07-09 17:39:18,523][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000349781_358175744.pth [2022-07-09 17:39:19,335][26022] Updated weights on worker 0-0, policy_version 351782 (0.00612) [2022-07-09 17:39:20,414][25689] Fps is (10 sec: 5709.6, 60 sec: 5665.3, 300 sec: 5683.4). Total num frames: 360230912. Throughput: 0: 5876.0. Samples: 360233000. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:39:20,416][25689] Avg episode reward: [(0, '-44.349')] [2022-07-09 17:39:21,157][26022] Updated weights on worker 0-0, policy_version 351792 (0.00092) [2022-07-09 17:39:22,780][26022] Updated weights on worker 0-0, policy_version 351802 (0.00090) [2022-07-09 17:39:24,613][26022] Updated weights on worker 0-0, policy_version 351812 (0.00095) [2022-07-09 17:39:25,459][25689] Fps is (10 sec: 5602.9, 60 sec: 5646.2, 300 sec: 5676.0). Total num frames: 360259584. Throughput: 0: 5973.8. Samples: 360267400. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:39:25,459][25689] Avg episode reward: [(0, '-43.821')] [2022-07-09 17:39:26,412][26022] Updated weights on worker 0-0, policy_version 351822 (0.00091) [2022-07-09 17:39:28,219][26022] Updated weights on worker 0-0, policy_version 351832 (0.00090) [2022-07-09 17:39:30,014][26022] Updated weights on worker 0-0, policy_version 351842 (0.00090) [2022-07-09 17:39:30,466][25689] Fps is (10 sec: 5704.9, 60 sec: 5664.9, 300 sec: 5674.5). Total num frames: 360288256. Throughput: 0: 5103.6. Samples: 360284398. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:39:30,466][25689] Avg episode reward: [(0, '-44.944')] [2022-07-09 17:39:31,745][26022] Updated weights on worker 0-0, policy_version 351852 (0.00616) [2022-07-09 17:39:33,757][26022] Updated weights on worker 0-0, policy_version 351862 (0.00072) [2022-07-09 17:39:35,402][26022] Updated weights on worker 0-0, policy_version 351872 (0.00087) [2022-07-09 17:39:35,565][25689] Fps is (10 sec: 5674.2, 60 sec: 5651.4, 300 sec: 5676.5). Total num frames: 360316928. Throughput: 0: 5933.1. Samples: 360318474. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:39:35,566][25689] Avg episode reward: [(0, '-44.950')] [2022-07-09 17:39:37,322][26022] Updated weights on worker 0-0, policy_version 351882 (0.00088) [2022-07-09 17:39:38,931][26022] Updated weights on worker 0-0, policy_version 351892 (0.00088) [2022-07-09 17:39:40,580][25689] Fps is (10 sec: 5771.1, 60 sec: 5668.6, 300 sec: 5676.6). Total num frames: 360346624. Throughput: 0: 5947.7. Samples: 360352938. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:39:40,581][25689] Avg episode reward: [(0, '-46.175')] [2022-07-09 17:39:40,670][26022] Updated weights on worker 0-0, policy_version 351902 (0.00087) [2022-07-09 17:39:42,818][26022] Updated weights on worker 0-0, policy_version 351912 (0.00087) [2022-07-09 17:39:44,161][26022] Updated weights on worker 0-0, policy_version 351922 (0.00089) [2022-07-09 17:39:45,586][25689] Fps is (10 sec: 5722.5, 60 sec: 5652.6, 300 sec: 5677.1). Total num frames: 360374272. Throughput: 0: 5106.5. Samples: 360370178. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:39:45,589][25689] Avg episode reward: [(0, '-46.514')] [2022-07-09 17:39:46,380][26022] Updated weights on worker 0-0, policy_version 351932 (0.00086) [2022-07-09 17:39:47,859][26022] Updated weights on worker 0-0, policy_version 351942 (0.00088) [2022-07-09 17:39:49,719][26022] Updated weights on worker 0-0, policy_version 351952 (0.00103) [2022-07-09 17:39:50,596][25689] Fps is (10 sec: 5827.3, 60 sec: 5686.9, 300 sec: 5681.5). Total num frames: 360404992. Throughput: 0: 5971.5. Samples: 360404606. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:39:50,597][25689] Avg episode reward: [(0, '-47.236')] [2022-07-09 17:39:51,684][26022] Updated weights on worker 0-0, policy_version 351962 (0.00087) [2022-07-09 17:39:53,291][26022] Updated weights on worker 0-0, policy_version 351972 (0.00093) [2022-07-09 17:39:55,303][26022] Updated weights on worker 0-0, policy_version 351982 (0.00088) [2022-07-09 17:39:55,687][25689] Fps is (10 sec: 5676.8, 60 sec: 5649.6, 300 sec: 5676.3). Total num frames: 360431616. Throughput: 0: 5977.7. Samples: 360438758. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:39:55,688][25689] Avg episode reward: [(0, '-47.082')] [2022-07-09 17:39:57,038][26022] Updated weights on worker 0-0, policy_version 351992 (0.00088) [2022-07-09 17:39:58,735][26022] Updated weights on worker 0-0, policy_version 352002 (0.00091) [2022-07-09 17:40:00,641][26022] Updated weights on worker 0-0, policy_version 352012 (0.00091) [2022-07-09 17:40:00,706][25689] Fps is (10 sec: 5469.7, 60 sec: 5665.6, 300 sec: 5682.9). Total num frames: 360460288. Throughput: 0: 5107.2. Samples: 360455726. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:40:00,706][25689] Avg episode reward: [(0, '-47.084')] [2022-07-09 17:40:02,554][26022] Updated weights on worker 0-0, policy_version 352022 (0.00087) [2022-07-09 17:40:04,708][26022] Updated weights on worker 0-0, policy_version 352032 (0.00086) [2022-07-09 17:40:05,722][25689] Fps is (10 sec: 5612.5, 60 sec: 5665.3, 300 sec: 5676.0). Total num frames: 360487936. Throughput: 0: 5838.7. Samples: 360487746. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:40:05,722][25689] Avg episode reward: [(0, '-46.715')] [2022-07-09 17:40:06,351][26022] Updated weights on worker 0-0, policy_version 352042 (0.00088) [2022-07-09 17:40:08,191][26022] Updated weights on worker 0-0, policy_version 352052 (0.00088) [2022-07-09 17:40:09,967][26022] Updated weights on worker 0-0, policy_version 352062 (0.00086) [2022-07-09 17:40:10,746][25689] Fps is (10 sec: 5507.4, 60 sec: 5664.8, 300 sec: 5674.4). Total num frames: 360515584. Throughput: 0: 5829.4. Samples: 360522068. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:40:10,746][25689] Avg episode reward: [(0, '-47.181')] [2022-07-09 17:40:11,729][26022] Updated weights on worker 0-0, policy_version 352072 (0.00084) [2022-07-09 17:40:13,510][26022] Updated weights on worker 0-0, policy_version 352082 (0.00088) [2022-07-09 17:40:15,295][26022] Updated weights on worker 0-0, policy_version 352092 (0.00096) [2022-07-09 17:40:15,877][25689] Fps is (10 sec: 5444.9, 60 sec: 5622.0, 300 sec: 5666.4). Total num frames: 360543232. Throughput: 0: 4972.0. Samples: 360539144. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:40:15,878][25689] Avg episode reward: [(0, '-47.291')] [2022-07-09 17:40:17,158][26022] Updated weights on worker 0-0, policy_version 352102 (0.00094) [2022-07-09 17:40:18,986][26022] Updated weights on worker 0-0, policy_version 352112 (0.00085) [2022-07-09 17:40:20,643][26022] Updated weights on worker 0-0, policy_version 352122 (0.00087) [2022-07-09 17:40:20,902][25689] Fps is (10 sec: 5746.9, 60 sec: 5671.2, 300 sec: 5672.9). Total num frames: 360573952. Throughput: 0: 5832.4. Samples: 360573522. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:40:20,903][25689] Avg episode reward: [(0, '-47.702')] [2022-07-09 17:40:22,563][26022] Updated weights on worker 0-0, policy_version 352132 (0.00095) [2022-07-09 17:40:24,033][26022] Updated weights on worker 0-0, policy_version 352142 (0.00086) [2022-07-09 17:40:25,915][25689] Fps is (10 sec: 5917.1, 60 sec: 5674.2, 300 sec: 5670.6). Total num frames: 360602624. Throughput: 0: 5955.4. Samples: 360608004. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:40:25,916][25689] Avg episode reward: [(0, '-47.555')] [2022-07-09 17:40:26,198][26022] Updated weights on worker 0-0, policy_version 352152 (0.00091) [2022-07-09 17:40:27,747][26022] Updated weights on worker 0-0, policy_version 352162 (0.00090) [2022-07-09 17:40:29,784][26022] Updated weights on worker 0-0, policy_version 352172 (0.00083) [2022-07-09 17:40:31,001][25689] Fps is (10 sec: 5678.5, 60 sec: 5666.8, 300 sec: 5674.1). Total num frames: 360631296. Throughput: 0: 5072.4. Samples: 360624810. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:40:31,001][25689] Avg episode reward: [(0, '-48.219')] [2022-07-09 17:40:31,590][26022] Updated weights on worker 0-0, policy_version 352182 (0.00096) [2022-07-09 17:40:33,342][26022] Updated weights on worker 0-0, policy_version 352192 (0.00087) [2022-07-09 17:40:35,131][26022] Updated weights on worker 0-0, policy_version 352202 (0.00101) [2022-07-09 17:40:36,092][25689] Fps is (10 sec: 5634.5, 60 sec: 5667.5, 300 sec: 5670.6). Total num frames: 360659968. Throughput: 0: 5916.9. Samples: 360658752. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:40:36,093][25689] Avg episode reward: [(0, '-48.490')] [2022-07-09 17:40:36,898][26022] Updated weights on worker 0-0, policy_version 352212 (0.00109) [2022-07-09 17:40:38,688][26022] Updated weights on worker 0-0, policy_version 352222 (0.00092) [2022-07-09 17:40:40,587][26022] Updated weights on worker 0-0, policy_version 352232 (0.00090) [2022-07-09 17:40:41,135][25689] Fps is (10 sec: 5557.6, 60 sec: 5631.1, 300 sec: 5663.3). Total num frames: 360687616. Throughput: 0: 5902.6. Samples: 360692946. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:40:41,135][25689] Avg episode reward: [(0, '-48.136')] [2022-07-09 17:40:42,326][26022] Updated weights on worker 0-0, policy_version 352242 (0.00067) [2022-07-09 17:40:44,152][26022] Updated weights on worker 0-0, policy_version 352252 (0.00081) [2022-07-09 17:40:46,028][26022] Updated weights on worker 0-0, policy_version 352262 (0.00088) [2022-07-09 17:40:46,151][25689] Fps is (10 sec: 5701.0, 60 sec: 5664.0, 300 sec: 5671.6). Total num frames: 360717312. Throughput: 0: 5042.2. Samples: 360710038. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:40:46,152][25689] Avg episode reward: [(0, '-47.760')] [2022-07-09 17:40:47,818][26022] Updated weights on worker 0-0, policy_version 352272 (0.00086) [2022-07-09 17:40:49,478][26022] Updated weights on worker 0-0, policy_version 352282 (0.00084) [2022-07-09 17:40:51,163][25689] Fps is (10 sec: 5718.4, 60 sec: 5613.1, 300 sec: 5662.9). Total num frames: 360744960. Throughput: 0: 5937.7. Samples: 360744526. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:40:51,163][25689] Avg episode reward: [(0, '-48.117')] [2022-07-09 17:40:51,434][26022] Updated weights on worker 0-0, policy_version 352292 (0.00088) [2022-07-09 17:40:53,086][26022] Updated weights on worker 0-0, policy_version 352302 (0.00090) [2022-07-09 17:40:54,866][26022] Updated weights on worker 0-0, policy_version 352312 (0.00093) [2022-07-09 17:40:56,275][25689] Fps is (10 sec: 5664.2, 60 sec: 5661.9, 300 sec: 5668.7). Total num frames: 360774656. Throughput: 0: 5940.5. Samples: 360778646. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:40:56,278][25689] Avg episode reward: [(0, '-46.962')] [2022-07-09 17:40:56,953][26022] Updated weights on worker 0-0, policy_version 352322 (0.00087) [2022-07-09 17:40:58,469][26022] Updated weights on worker 0-0, policy_version 352332 (0.00080) [2022-07-09 17:41:00,394][26022] Updated weights on worker 0-0, policy_version 352342 (0.00086) [2022-07-09 17:41:01,319][25689] Fps is (10 sec: 5847.6, 60 sec: 5676.3, 300 sec: 5678.7). Total num frames: 360804352. Throughput: 0: 5938.6. Samples: 360812814. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:41:01,320][25689] Avg episode reward: [(0, '-47.943')] [2022-07-09 17:41:02,360][26022] Updated weights on worker 0-0, policy_version 352352 (0.00084) [2022-07-09 17:41:04,307][26022] Updated weights on worker 0-0, policy_version 352362 (0.00091) [2022-07-09 17:41:06,272][26022] Updated weights on worker 0-0, policy_version 352372 (0.00095) [2022-07-09 17:41:06,412][25689] Fps is (10 sec: 5353.9, 60 sec: 5618.6, 300 sec: 5656.3). Total num frames: 360828928. Throughput: 0: 5799.1. Samples: 360827534. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:41:06,412][25689] Avg episode reward: [(0, '-47.376')] [2022-07-09 17:41:07,840][26022] Updated weights on worker 0-0, policy_version 352382 (0.00087) [2022-07-09 17:41:09,943][26022] Updated weights on worker 0-0, policy_version 352392 (0.00093) [2022-07-09 17:41:11,429][25689] Fps is (10 sec: 5368.6, 60 sec: 5653.0, 300 sec: 5664.6). Total num frames: 360858624. Throughput: 0: 5778.7. Samples: 360861636. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-09 17:41:11,429][25689] Avg episode reward: [(0, '-47.424')] [2022-07-09 17:41:11,560][26022] Updated weights on worker 0-0, policy_version 352402 (0.00086) [2022-07-09 17:41:13,272][26022] Updated weights on worker 0-0, policy_version 352412 (0.00090) [2022-07-09 17:41:15,216][26022] Updated weights on worker 0-0, policy_version 352422 (0.00092) [2022-07-09 17:41:16,483][25689] Fps is (10 sec: 5795.5, 60 sec: 5677.1, 300 sec: 5660.6). Total num frames: 360887296. Throughput: 0: 5809.6. Samples: 360896048. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:41:16,485][25689] Avg episode reward: [(0, '-47.080')] [2022-07-09 17:41:16,837][26022] Updated weights on worker 0-0, policy_version 352432 (0.00437) [2022-07-09 17:41:18,550][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:41:18,571][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000352441_360899584.pth [2022-07-09 17:41:18,572][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000350446_358856704.pth [2022-07-09 17:41:18,765][26022] Updated weights on worker 0-0, policy_version 352442 (0.00086) [2022-07-09 17:41:20,585][26022] Updated weights on worker 0-0, policy_version 352452 (0.00086) [2022-07-09 17:41:21,486][25689] Fps is (10 sec: 5599.9, 60 sec: 5628.5, 300 sec: 5660.8). Total num frames: 360914944. Throughput: 0: 4978.1. Samples: 360913204. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:41:21,488][25689] Avg episode reward: [(0, '-47.548')] [2022-07-09 17:41:22,267][26022] Updated weights on worker 0-0, policy_version 352462 (0.00092) [2022-07-09 17:41:24,102][26022] Updated weights on worker 0-0, policy_version 352472 (0.00083) [2022-07-09 17:41:26,002][26022] Updated weights on worker 0-0, policy_version 352482 (0.00095) [2022-07-09 17:41:26,492][25689] Fps is (10 sec: 5729.4, 60 sec: 5646.0, 300 sec: 5653.9). Total num frames: 360944640. Throughput: 0: 5974.0. Samples: 360947492. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:41:26,492][25689] Avg episode reward: [(0, '-47.785')] [2022-07-09 17:41:27,719][26022] Updated weights on worker 0-0, policy_version 352492 (0.00100) [2022-07-09 17:41:29,519][26022] Updated weights on worker 0-0, policy_version 352502 (0.00087) [2022-07-09 17:41:31,003][26022] Updated weights on worker 0-0, policy_version 352512 (0.00088) [2022-07-09 17:41:31,507][25689] Fps is (10 sec: 5824.3, 60 sec: 5652.6, 300 sec: 5661.4). Total num frames: 360973312. Throughput: 0: 5997.9. Samples: 360982066. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:41:31,508][25689] Avg episode reward: [(0, '-48.212')] [2022-07-09 17:41:33,256][26022] Updated weights on worker 0-0, policy_version 352522 (0.00086) [2022-07-09 17:41:34,767][26022] Updated weights on worker 0-0, policy_version 352532 (0.00084) [2022-07-09 17:41:36,583][25689] Fps is (10 sec: 5682.6, 60 sec: 5654.0, 300 sec: 5653.2). Total num frames: 361001984. Throughput: 0: 5117.0. Samples: 360998900. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:41:36,583][25689] Avg episode reward: [(0, '-49.321')] [2022-07-09 17:41:36,743][26022] Updated weights on worker 0-0, policy_version 352542 (0.00604) [2022-07-09 17:41:38,491][26022] Updated weights on worker 0-0, policy_version 352552 (0.00091) [2022-07-09 17:41:40,343][26022] Updated weights on worker 0-0, policy_version 352562 (0.00087) [2022-07-09 17:41:41,616][25689] Fps is (10 sec: 5774.1, 60 sec: 5688.8, 300 sec: 5660.5). Total num frames: 361031680. Throughput: 0: 5979.6. Samples: 361033574. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:41:41,617][25689] Avg episode reward: [(0, '-49.349')] [2022-07-09 17:41:41,948][26022] Updated weights on worker 0-0, policy_version 352572 (0.00085) [2022-07-09 17:41:43,936][26022] Updated weights on worker 0-0, policy_version 352582 (0.00089) [2022-07-09 17:41:45,588][26022] Updated weights on worker 0-0, policy_version 352592 (0.00093) [2022-07-09 17:41:46,643][25689] Fps is (10 sec: 5598.0, 60 sec: 5636.9, 300 sec: 5653.7). Total num frames: 361058304. Throughput: 0: 5947.0. Samples: 361067334. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:41:46,644][25689] Avg episode reward: [(0, '-50.008')] [2022-07-09 17:41:47,359][26022] Updated weights on worker 0-0, policy_version 352602 (0.00084) [2022-07-09 17:41:49,244][26022] Updated weights on worker 0-0, policy_version 352612 (0.00084) [2022-07-09 17:41:50,967][26022] Updated weights on worker 0-0, policy_version 352622 (0.00109) [2022-07-09 17:41:51,653][25689] Fps is (10 sec: 5713.3, 60 sec: 5687.9, 300 sec: 5662.3). Total num frames: 361089024. Throughput: 0: 5080.1. Samples: 361084408. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:41:51,653][25689] Avg episode reward: [(0, '-49.672')] [2022-07-09 17:41:53,072][26022] Updated weights on worker 0-0, policy_version 352632 (0.00090) [2022-07-09 17:41:54,659][26022] Updated weights on worker 0-0, policy_version 352642 (0.00088) [2022-07-09 17:41:56,593][26022] Updated weights on worker 0-0, policy_version 352652 (0.00092) [2022-07-09 17:41:56,732][25689] Fps is (10 sec: 5785.8, 60 sec: 5657.2, 300 sec: 5655.1). Total num frames: 361116672. Throughput: 0: 5950.6. Samples: 361118798. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:41:56,732][25689] Avg episode reward: [(0, '-49.594')] [2022-07-09 17:41:58,148][26022] Updated weights on worker 0-0, policy_version 352662 (0.00817) [2022-07-09 17:42:00,195][26022] Updated weights on worker 0-0, policy_version 352672 (0.00087) [2022-07-09 17:42:01,759][25689] Fps is (10 sec: 5370.0, 60 sec: 5607.9, 300 sec: 5662.1). Total num frames: 361143296. Throughput: 0: 5873.1. Samples: 361151878. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:42:01,760][25689] Avg episode reward: [(0, '-48.921')] [2022-07-09 17:42:02,225][26022] Updated weights on worker 0-0, policy_version 352682 (0.00092) [2022-07-09 17:42:04,218][26022] Updated weights on worker 0-0, policy_version 352692 (0.00085) [2022-07-09 17:42:05,998][26022] Updated weights on worker 0-0, policy_version 352702 (0.00090) [2022-07-09 17:42:06,779][25689] Fps is (10 sec: 5401.6, 60 sec: 5665.6, 300 sec: 5652.1). Total num frames: 361170944. Throughput: 0: 4974.7. Samples: 361167502. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:42:06,779][25689] Avg episode reward: [(0, '-48.033')] [2022-07-09 17:42:07,838][26022] Updated weights on worker 0-0, policy_version 352712 (0.00086) [2022-07-09 17:42:09,578][26022] Updated weights on worker 0-0, policy_version 352722 (0.00093) [2022-07-09 17:42:11,463][26022] Updated weights on worker 0-0, policy_version 352732 (0.00080) [2022-07-09 17:42:11,802][25689] Fps is (10 sec: 5506.0, 60 sec: 5631.1, 300 sec: 5652.6). Total num frames: 361198592. Throughput: 0: 5796.7. Samples: 361201208. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:42:11,803][25689] Avg episode reward: [(0, '-48.155')] [2022-07-09 17:42:13,319][26022] Updated weights on worker 0-0, policy_version 352742 (0.00085) [2022-07-09 17:42:15,098][26022] Updated weights on worker 0-0, policy_version 352752 (0.00088) [2022-07-09 17:42:16,904][25689] Fps is (10 sec: 5663.5, 60 sec: 5643.6, 300 sec: 5654.2). Total num frames: 361228288. Throughput: 0: 5762.4. Samples: 361235040. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:42:16,905][25689] Avg episode reward: [(0, '-47.785')] [2022-07-09 17:42:16,906][26022] Updated weights on worker 0-0, policy_version 352762 (0.00083) [2022-07-09 17:42:18,710][26022] Updated weights on worker 0-0, policy_version 352772 (0.00111) [2022-07-09 17:42:20,343][26022] Updated weights on worker 0-0, policy_version 352782 (0.00088) [2022-07-09 17:42:21,992][25689] Fps is (10 sec: 5728.3, 60 sec: 5652.6, 300 sec: 5652.6). Total num frames: 361256960. Throughput: 0: 4960.0. Samples: 361252230. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:42:21,992][25689] Avg episode reward: [(0, '-48.238')] [2022-07-09 17:42:22,224][26022] Updated weights on worker 0-0, policy_version 352792 (0.00094) [2022-07-09 17:42:23,913][26022] Updated weights on worker 0-0, policy_version 352802 (0.00087) [2022-07-09 17:42:25,934][26022] Updated weights on worker 0-0, policy_version 352812 (0.00084) [2022-07-09 17:42:27,087][25689] Fps is (10 sec: 5531.0, 60 sec: 5610.5, 300 sec: 5647.6). Total num frames: 361284608. Throughput: 0: 5858.2. Samples: 361286470. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:42:27,087][25689] Avg episode reward: [(0, '-47.895')] [2022-07-09 17:42:27,699][26022] Updated weights on worker 0-0, policy_version 352822 (0.00087) [2022-07-09 17:42:29,651][26022] Updated weights on worker 0-0, policy_version 352832 (0.00094) [2022-07-09 17:42:31,108][26022] Updated weights on worker 0-0, policy_version 352842 (0.00179) [2022-07-09 17:42:32,127][25689] Fps is (10 sec: 5758.9, 60 sec: 5642.0, 300 sec: 5656.2). Total num frames: 361315328. Throughput: 0: 5870.8. Samples: 361320530. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:42:32,128][25689] Avg episode reward: [(0, '-47.574')] [2022-07-09 17:42:33,447][26022] Updated weights on worker 0-0, policy_version 352852 (0.00098) [2022-07-09 17:42:34,773][26022] Updated weights on worker 0-0, policy_version 352862 (0.00086) [2022-07-09 17:42:36,846][26022] Updated weights on worker 0-0, policy_version 352872 (0.00086) [2022-07-09 17:42:37,212][25689] Fps is (10 sec: 5764.7, 60 sec: 5624.2, 300 sec: 5651.5). Total num frames: 361342976. Throughput: 0: 5045.4. Samples: 361337500. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:42:37,212][25689] Avg episode reward: [(0, '-47.774')] [2022-07-09 17:42:38,582][26022] Updated weights on worker 0-0, policy_version 352882 (0.00086) [2022-07-09 17:42:40,276][26022] Updated weights on worker 0-0, policy_version 352892 (0.00091) [2022-07-09 17:42:42,131][26022] Updated weights on worker 0-0, policy_version 352902 (0.00088) [2022-07-09 17:42:42,218][25689] Fps is (10 sec: 5580.9, 60 sec: 5609.8, 300 sec: 5648.7). Total num frames: 361371648. Throughput: 0: 5919.7. Samples: 361371966. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:42:42,219][25689] Avg episode reward: [(0, '-48.077')] [2022-07-09 17:42:43,913][26022] Updated weights on worker 0-0, policy_version 352912 (0.00089) [2022-07-09 17:42:45,552][26022] Updated weights on worker 0-0, policy_version 352922 (0.00091) [2022-07-09 17:42:47,262][25689] Fps is (10 sec: 5807.8, 60 sec: 5659.0, 300 sec: 5658.5). Total num frames: 361401344. Throughput: 0: 5939.3. Samples: 361406296. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:42:47,262][25689] Avg episode reward: [(0, '-47.849')] [2022-07-09 17:42:47,630][26022] Updated weights on worker 0-0, policy_version 352932 (0.00094) [2022-07-09 17:42:49,364][26022] Updated weights on worker 0-0, policy_version 352942 (0.00086) [2022-07-09 17:42:51,019][26022] Updated weights on worker 0-0, policy_version 352952 (0.00080) [2022-07-09 17:42:52,353][25689] Fps is (10 sec: 5759.3, 60 sec: 5617.7, 300 sec: 5658.2). Total num frames: 361430016. Throughput: 0: 5087.2. Samples: 361423424. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:42:52,353][25689] Avg episode reward: [(0, '-47.719')] [2022-07-09 17:42:52,781][26022] Updated weights on worker 0-0, policy_version 352962 (0.00088) [2022-07-09 17:42:54,681][26022] Updated weights on worker 0-0, policy_version 352972 (0.00096) [2022-07-09 17:42:56,560][26022] Updated weights on worker 0-0, policy_version 352982 (0.00091) [2022-07-09 17:42:57,464][25689] Fps is (10 sec: 5721.2, 60 sec: 5648.4, 300 sec: 5656.7). Total num frames: 361459712. Throughput: 0: 5928.6. Samples: 361457566. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:42:57,464][25689] Avg episode reward: [(0, '-47.831')] [2022-07-09 17:42:58,221][26022] Updated weights on worker 0-0, policy_version 352992 (0.00089) [2022-07-09 17:42:59,968][26022] Updated weights on worker 0-0, policy_version 353002 (0.00087) [2022-07-09 17:43:02,232][26022] Updated weights on worker 0-0, policy_version 353012 (0.00090) [2022-07-09 17:43:02,535][25689] Fps is (10 sec: 5330.3, 60 sec: 5610.7, 300 sec: 5653.5). Total num frames: 361484288. Throughput: 0: 5914.9. Samples: 361492136. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:43:02,535][25689] Avg episode reward: [(0, '-47.875')] [2022-07-09 17:43:03,927][26022] Updated weights on worker 0-0, policy_version 353022 (0.00089) [2022-07-09 17:43:06,058][26022] Updated weights on worker 0-0, policy_version 353032 (0.00084) [2022-07-09 17:43:07,565][25689] Fps is (10 sec: 5474.2, 60 sec: 5660.2, 300 sec: 5656.4). Total num frames: 361515008. Throughput: 0: 4954.2. Samples: 361506884. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:43:07,566][25689] Avg episode reward: [(0, '-47.486')] [2022-07-09 17:43:07,569][26022] Updated weights on worker 0-0, policy_version 353042 (0.00085) [2022-07-09 17:43:09,427][26022] Updated weights on worker 0-0, policy_version 353052 (0.00084) [2022-07-09 17:43:11,031][26022] Updated weights on worker 0-0, policy_version 353062 (0.00096) [2022-07-09 17:43:12,575][25689] Fps is (10 sec: 5813.2, 60 sec: 5661.5, 300 sec: 5650.1). Total num frames: 361542656. Throughput: 0: 5835.6. Samples: 361541434. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:43:12,576][25689] Avg episode reward: [(0, '-47.365')] [2022-07-09 17:43:13,070][26022] Updated weights on worker 0-0, policy_version 353072 (0.00088) [2022-07-09 17:43:14,990][26022] Updated weights on worker 0-0, policy_version 353082 (0.00091) [2022-07-09 17:43:16,585][26022] Updated weights on worker 0-0, policy_version 353092 (0.00085) [2022-07-09 17:43:17,636][25689] Fps is (10 sec: 5592.2, 60 sec: 5648.4, 300 sec: 5653.5). Total num frames: 361571328. Throughput: 0: 5836.9. Samples: 361575310. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:43:17,637][25689] Avg episode reward: [(0, '-47.238')] [2022-07-09 17:43:18,493][26022] Updated weights on worker 0-0, policy_version 353102 (0.00084) [2022-07-09 17:43:18,727][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:43:18,742][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000353103_361577472.pth [2022-07-09 17:43:18,742][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000351113_359539712.pth [2022-07-09 17:43:20,164][26022] Updated weights on worker 0-0, policy_version 353112 (0.00089) [2022-07-09 17:43:22,053][26022] Updated weights on worker 0-0, policy_version 353122 (0.00105) [2022-07-09 17:43:22,687][25689] Fps is (10 sec: 5772.5, 60 sec: 5668.7, 300 sec: 5652.9). Total num frames: 361601024. Throughput: 0: 5825.4. Samples: 361609530. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:43:22,687][25689] Avg episode reward: [(0, '-46.769')] [2022-07-09 17:43:23,955][26022] Updated weights on worker 0-0, policy_version 353132 (0.00085) [2022-07-09 17:43:25,369][26022] Updated weights on worker 0-0, policy_version 353142 (0.00048) [2022-07-09 17:43:27,552][26022] Updated weights on worker 0-0, policy_version 353152 (0.00095) [2022-07-09 17:43:27,717][25689] Fps is (10 sec: 5688.4, 60 sec: 5674.8, 300 sec: 5652.8). Total num frames: 361628672. Throughput: 0: 5943.9. Samples: 361626666. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:43:27,718][25689] Avg episode reward: [(0, '-46.962')] [2022-07-09 17:43:29,308][26022] Updated weights on worker 0-0, policy_version 353162 (0.00090) [2022-07-09 17:43:31,056][26022] Updated weights on worker 0-0, policy_version 353172 (0.00088) [2022-07-09 17:43:32,780][25689] Fps is (10 sec: 5478.8, 60 sec: 5622.0, 300 sec: 5647.4). Total num frames: 361656320. Throughput: 0: 5893.9. Samples: 361660516. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:43:32,780][25689] Avg episode reward: [(0, '-46.842')] [2022-07-09 17:43:33,107][26022] Updated weights on worker 0-0, policy_version 353182 (0.00090) [2022-07-09 17:43:34,609][26022] Updated weights on worker 0-0, policy_version 353192 (0.00096) [2022-07-09 17:43:36,515][26022] Updated weights on worker 0-0, policy_version 353202 (0.00093) [2022-07-09 17:43:37,904][25689] Fps is (10 sec: 5729.8, 60 sec: 5669.0, 300 sec: 5652.2). Total num frames: 361687040. Throughput: 0: 5897.2. Samples: 361694832. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:43:37,904][25689] Avg episode reward: [(0, '-46.670')] [2022-07-09 17:43:38,295][26022] Updated weights on worker 0-0, policy_version 353212 (0.00102) [2022-07-09 17:43:39,919][26022] Updated weights on worker 0-0, policy_version 353222 (0.00084) [2022-07-09 17:43:41,916][26022] Updated weights on worker 0-0, policy_version 353232 (0.00083) [2022-07-09 17:43:42,907][25689] Fps is (10 sec: 5965.4, 60 sec: 5686.2, 300 sec: 5655.9). Total num frames: 361716736. Throughput: 0: 5072.2. Samples: 361712092. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:43:42,908][25689] Avg episode reward: [(0, '-45.896')] [2022-07-09 17:43:43,503][26022] Updated weights on worker 0-0, policy_version 353242 (0.00095) [2022-07-09 17:43:45,455][26022] Updated weights on worker 0-0, policy_version 353252 (0.00435) [2022-07-09 17:43:47,037][26022] Updated weights on worker 0-0, policy_version 353262 (0.00079) [2022-07-09 17:43:47,979][25689] Fps is (10 sec: 5691.8, 60 sec: 5649.8, 300 sec: 5651.4). Total num frames: 361744384. Throughput: 0: 5915.4. Samples: 361746520. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 17:43:47,979][25689] Avg episode reward: [(0, '-47.008')] [2022-07-09 17:43:48,886][26022] Updated weights on worker 0-0, policy_version 353272 (0.00094) [2022-07-09 17:43:50,848][26022] Updated weights on worker 0-0, policy_version 353282 (0.00090) [2022-07-09 17:43:52,596][26022] Updated weights on worker 0-0, policy_version 353292 (0.00091) [2022-07-09 17:43:52,993][25689] Fps is (10 sec: 5584.1, 60 sec: 5657.0, 300 sec: 5652.1). Total num frames: 361773056. Throughput: 0: 5941.5. Samples: 361780616. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:43:52,994][25689] Avg episode reward: [(0, '-46.853')] [2022-07-09 17:43:54,360][26022] Updated weights on worker 0-0, policy_version 353302 (0.00085) [2022-07-09 17:43:56,210][26022] Updated weights on worker 0-0, policy_version 353312 (0.00097) [2022-07-09 17:43:57,961][26022] Updated weights on worker 0-0, policy_version 353322 (0.00527) [2022-07-09 17:43:58,096][25689] Fps is (10 sec: 5769.3, 60 sec: 5657.8, 300 sec: 5657.2). Total num frames: 361802752. Throughput: 0: 5091.6. Samples: 361797640. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:43:58,096][25689] Avg episode reward: [(0, '-48.142')] [2022-07-09 17:44:00,000][26022] Updated weights on worker 0-0, policy_version 353332 (0.00086) [2022-07-09 17:44:01,516][26022] Updated weights on worker 0-0, policy_version 353342 (0.00104) [2022-07-09 17:44:03,115][25689] Fps is (10 sec: 5564.3, 60 sec: 5696.4, 300 sec: 5653.7). Total num frames: 361829376. Throughput: 0: 5930.0. Samples: 361831922. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:44:03,115][25689] Avg episode reward: [(0, '-49.203')] [2022-07-09 17:44:03,751][26022] Updated weights on worker 0-0, policy_version 353352 (0.00092) [2022-07-09 17:44:05,450][26022] Updated weights on worker 0-0, policy_version 353362 (0.00094) [2022-07-09 17:44:07,281][26022] Updated weights on worker 0-0, policy_version 353372 (0.00081) [2022-07-09 17:44:08,144][25689] Fps is (10 sec: 5401.2, 60 sec: 5645.8, 300 sec: 5653.5). Total num frames: 361857024. Throughput: 0: 5835.7. Samples: 361864198. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:44:08,144][25689] Avg episode reward: [(0, '-49.715')] [2022-07-09 17:44:08,979][26022] Updated weights on worker 0-0, policy_version 353382 (0.00085) [2022-07-09 17:44:10,972][26022] Updated weights on worker 0-0, policy_version 353392 (0.00362) [2022-07-09 17:44:12,699][26022] Updated weights on worker 0-0, policy_version 353402 (0.00090) [2022-07-09 17:44:13,148][25689] Fps is (10 sec: 5715.7, 60 sec: 5680.2, 300 sec: 5654.1). Total num frames: 361886720. Throughput: 0: 5000.7. Samples: 361881402. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:44:13,148][25689] Avg episode reward: [(0, '-49.284')] [2022-07-09 17:44:14,668][26022] Updated weights on worker 0-0, policy_version 353412 (0.00092) [2022-07-09 17:44:16,168][26022] Updated weights on worker 0-0, policy_version 353422 (0.00092) [2022-07-09 17:44:18,195][26022] Updated weights on worker 0-0, policy_version 353432 (0.00096) [2022-07-09 17:44:18,270][25689] Fps is (10 sec: 5662.9, 60 sec: 5657.6, 300 sec: 5651.9). Total num frames: 361914368. Throughput: 0: 5848.9. Samples: 361915638. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:44:18,270][25689] Avg episode reward: [(0, '-49.626')] [2022-07-09 17:44:19,920][26022] Updated weights on worker 0-0, policy_version 353442 (0.00087) [2022-07-09 17:44:21,701][26022] Updated weights on worker 0-0, policy_version 353452 (0.00088) [2022-07-09 17:44:23,324][25689] Fps is (10 sec: 5634.8, 60 sec: 5657.2, 300 sec: 5655.2). Total num frames: 361944064. Throughput: 0: 5850.8. Samples: 361950164. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:44:23,326][25689] Avg episode reward: [(0, '-49.235')] [2022-07-09 17:44:23,505][26022] Updated weights on worker 0-0, policy_version 353462 (0.00050) [2022-07-09 17:44:25,157][26022] Updated weights on worker 0-0, policy_version 353472 (0.00091) [2022-07-09 17:44:27,033][26022] Updated weights on worker 0-0, policy_version 353482 (0.00086) [2022-07-09 17:44:28,328][25689] Fps is (10 sec: 5803.1, 60 sec: 5676.6, 300 sec: 5655.2). Total num frames: 361972736. Throughput: 0: 5113.2. Samples: 361967404. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:44:28,329][25689] Avg episode reward: [(0, '-47.873')] [2022-07-09 17:44:28,920][26022] Updated weights on worker 0-0, policy_version 353492 (0.00089) [2022-07-09 17:44:30,597][26022] Updated weights on worker 0-0, policy_version 353502 (0.00086) [2022-07-09 17:44:32,592][26022] Updated weights on worker 0-0, policy_version 353512 (0.00095) [2022-07-09 17:44:33,346][25689] Fps is (10 sec: 5722.1, 60 sec: 5697.7, 300 sec: 5656.8). Total num frames: 362001408. Throughput: 0: 5936.6. Samples: 362001310. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:44:33,346][25689] Avg episode reward: [(0, '-46.761')] [2022-07-09 17:44:34,191][26022] Updated weights on worker 0-0, policy_version 353522 (0.00094) [2022-07-09 17:44:36,110][26022] Updated weights on worker 0-0, policy_version 353532 (0.00097) [2022-07-09 17:44:37,989][26022] Updated weights on worker 0-0, policy_version 353542 (0.00098) [2022-07-09 17:44:38,415][25689] Fps is (10 sec: 5583.2, 60 sec: 5652.1, 300 sec: 5648.9). Total num frames: 362029056. Throughput: 0: 5927.1. Samples: 362035042. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:44:38,416][25689] Avg episode reward: [(0, '-46.538')] [2022-07-09 17:44:39,818][26022] Updated weights on worker 0-0, policy_version 353552 (0.00093) [2022-07-09 17:44:41,509][26022] Updated weights on worker 0-0, policy_version 353562 (0.00092) [2022-07-09 17:44:43,159][26022] Updated weights on worker 0-0, policy_version 353572 (0.00086) [2022-07-09 17:44:43,482][25689] Fps is (10 sec: 5657.4, 60 sec: 5646.2, 300 sec: 5654.6). Total num frames: 362058752. Throughput: 0: 5061.6. Samples: 362052194. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:44:43,483][25689] Avg episode reward: [(0, '-47.028')] [2022-07-09 17:44:45,149][26022] Updated weights on worker 0-0, policy_version 353582 (0.00085) [2022-07-09 17:44:46,938][26022] Updated weights on worker 0-0, policy_version 353592 (0.00752) [2022-07-09 17:44:48,536][25689] Fps is (10 sec: 5767.4, 60 sec: 5664.7, 300 sec: 5646.9). Total num frames: 362087424. Throughput: 0: 5893.6. Samples: 362086500. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:44:48,538][25689] Avg episode reward: [(0, '-47.119')] [2022-07-09 17:44:48,842][26022] Updated weights on worker 0-0, policy_version 353602 (0.00085) [2022-07-09 17:44:50,388][26022] Updated weights on worker 0-0, policy_version 353612 (0.00082) [2022-07-09 17:44:52,368][26022] Updated weights on worker 0-0, policy_version 353622 (0.00083) [2022-07-09 17:44:53,541][25689] Fps is (10 sec: 5701.1, 60 sec: 5665.7, 300 sec: 5655.4). Total num frames: 362116096. Throughput: 0: 5914.9. Samples: 362120758. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:44:53,542][25689] Avg episode reward: [(0, '-47.572')] [2022-07-09 17:44:54,123][26022] Updated weights on worker 0-0, policy_version 353632 (0.00087) [2022-07-09 17:44:55,944][26022] Updated weights on worker 0-0, policy_version 353642 (0.00086) [2022-07-09 17:44:57,813][26022] Updated weights on worker 0-0, policy_version 353652 (0.00084) [2022-07-09 17:44:58,610][25689] Fps is (10 sec: 5691.9, 60 sec: 5651.8, 300 sec: 5654.4). Total num frames: 362144768. Throughput: 0: 5098.4. Samples: 362138004. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:44:58,612][25689] Avg episode reward: [(0, '-48.037')] [2022-07-09 17:44:59,406][26022] Updated weights on worker 0-0, policy_version 353662 (0.00092) [2022-07-09 17:45:01,246][26022] Updated weights on worker 0-0, policy_version 353672 (0.00085) [2022-07-09 17:45:03,591][26022] Updated weights on worker 0-0, policy_version 353682 (0.00089) [2022-07-09 17:45:03,633][25689] Fps is (10 sec: 5377.4, 60 sec: 5634.5, 300 sec: 5647.4). Total num frames: 362170368. Throughput: 0: 5928.5. Samples: 362171660. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:45:03,635][25689] Avg episode reward: [(0, '-48.000')] [2022-07-09 17:45:05,124][26022] Updated weights on worker 0-0, policy_version 353692 (0.00089) [2022-07-09 17:45:07,053][26022] Updated weights on worker 0-0, policy_version 353702 (0.00083) [2022-07-09 17:45:08,599][26022] Updated weights on worker 0-0, policy_version 353712 (0.00096) [2022-07-09 17:45:08,649][25689] Fps is (10 sec: 5610.1, 60 sec: 5686.5, 300 sec: 5657.9). Total num frames: 362201088. Throughput: 0: 5872.0. Samples: 362204608. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:45:08,650][25689] Avg episode reward: [(0, '-46.512')] [2022-07-09 17:45:10,655][26022] Updated weights on worker 0-0, policy_version 353722 (0.00085) [2022-07-09 17:45:12,290][26022] Updated weights on worker 0-0, policy_version 353732 (0.00086) [2022-07-09 17:45:13,695][25689] Fps is (10 sec: 5597.5, 60 sec: 5614.9, 300 sec: 5652.6). Total num frames: 362226688. Throughput: 0: 5016.1. Samples: 362221854. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:45:13,696][25689] Avg episode reward: [(0, '-45.928')] [2022-07-09 17:45:14,099][26022] Updated weights on worker 0-0, policy_version 353742 (0.00086) [2022-07-09 17:45:15,838][26022] Updated weights on worker 0-0, policy_version 353752 (0.00076) [2022-07-09 17:45:17,676][26022] Updated weights on worker 0-0, policy_version 353762 (0.00252) [2022-07-09 17:45:18,794][25689] Fps is (10 sec: 5652.7, 60 sec: 5684.8, 300 sec: 5654.7). Total num frames: 362258432. Throughput: 0: 5857.9. Samples: 362256238. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:45:18,796][25689] Avg episode reward: [(0, '-45.977')] [2022-07-09 17:45:18,800][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:45:18,817][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000353769_362259456.pth [2022-07-09 17:45:18,818][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000351779_360221696.pth [2022-07-09 17:45:19,519][26022] Updated weights on worker 0-0, policy_version 353772 (0.00090) [2022-07-09 17:45:21,208][26022] Updated weights on worker 0-0, policy_version 353782 (0.00085) [2022-07-09 17:45:23,182][26022] Updated weights on worker 0-0, policy_version 353792 (0.00084) [2022-07-09 17:45:23,803][25689] Fps is (10 sec: 5976.8, 60 sec: 5672.1, 300 sec: 5654.7). Total num frames: 362287104. Throughput: 0: 5902.2. Samples: 362290708. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:45:23,804][25689] Avg episode reward: [(0, '-45.469')] [2022-07-09 17:45:24,711][26022] Updated weights on worker 0-0, policy_version 353802 (0.00080) [2022-07-09 17:45:26,603][26022] Updated weights on worker 0-0, policy_version 353812 (0.00094) [2022-07-09 17:45:28,455][26022] Updated weights on worker 0-0, policy_version 353822 (0.00085) [2022-07-09 17:45:28,827][25689] Fps is (10 sec: 5817.7, 60 sec: 5687.1, 300 sec: 5659.4). Total num frames: 362316800. Throughput: 0: 5985.8. Samples: 362325384. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:45:28,827][25689] Avg episode reward: [(0, '-45.530')] [2022-07-09 17:45:30,196][26022] Updated weights on worker 0-0, policy_version 353832 (0.00087) [2022-07-09 17:45:31,935][26022] Updated weights on worker 0-0, policy_version 353842 (0.00081) [2022-07-09 17:45:33,723][26022] Updated weights on worker 0-0, policy_version 353852 (0.00084) [2022-07-09 17:45:33,847][25689] Fps is (10 sec: 5811.5, 60 sec: 5686.9, 300 sec: 5660.7). Total num frames: 362345472. Throughput: 0: 5987.9. Samples: 362342522. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:45:33,847][25689] Avg episode reward: [(0, '-45.412')] [2022-07-09 17:45:35,647][26022] Updated weights on worker 0-0, policy_version 353862 (0.00094) [2022-07-09 17:45:37,217][26022] Updated weights on worker 0-0, policy_version 353872 (0.00087) [2022-07-09 17:45:38,934][25689] Fps is (10 sec: 5572.2, 60 sec: 5685.3, 300 sec: 5659.9). Total num frames: 362373120. Throughput: 0: 5988.3. Samples: 362376842. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:45:38,935][25689] Avg episode reward: [(0, '-46.228')] [2022-07-09 17:45:39,167][26022] Updated weights on worker 0-0, policy_version 353882 (0.00092) [2022-07-09 17:45:40,863][26022] Updated weights on worker 0-0, policy_version 353892 (0.00083) [2022-07-09 17:45:42,601][26022] Updated weights on worker 0-0, policy_version 353902 (0.00084) [2022-07-09 17:45:43,965][25689] Fps is (10 sec: 5565.9, 60 sec: 5671.6, 300 sec: 5656.1). Total num frames: 362401792. Throughput: 0: 5981.1. Samples: 362411300. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:45:43,966][25689] Avg episode reward: [(0, '-47.137')] [2022-07-09 17:45:44,487][26022] Updated weights on worker 0-0, policy_version 353912 (0.00085) [2022-07-09 17:45:46,309][26022] Updated weights on worker 0-0, policy_version 353922 (0.00086) [2022-07-09 17:45:48,027][26022] Updated weights on worker 0-0, policy_version 353932 (0.00083) [2022-07-09 17:45:48,981][25689] Fps is (10 sec: 5707.3, 60 sec: 5675.2, 300 sec: 5659.5). Total num frames: 362430464. Throughput: 0: 5101.9. Samples: 362428212. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:45:48,982][25689] Avg episode reward: [(0, '-46.315')] [2022-07-09 17:45:49,891][26022] Updated weights on worker 0-0, policy_version 353942 (0.00100) [2022-07-09 17:45:51,623][26022] Updated weights on worker 0-0, policy_version 353952 (0.00081) [2022-07-09 17:45:53,586][26022] Updated weights on worker 0-0, policy_version 353962 (0.00094) [2022-07-09 17:45:53,993][25689] Fps is (10 sec: 5718.5, 60 sec: 5674.5, 300 sec: 5658.0). Total num frames: 362459136. Throughput: 0: 5945.0. Samples: 362462294. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:45:53,995][25689] Avg episode reward: [(0, '-47.345')] [2022-07-09 17:45:55,274][26022] Updated weights on worker 0-0, policy_version 353972 (0.00084) [2022-07-09 17:45:57,107][26022] Updated weights on worker 0-0, policy_version 353982 (0.00084) [2022-07-09 17:45:58,890][26022] Updated weights on worker 0-0, policy_version 353992 (0.00092) [2022-07-09 17:45:59,065][25689] Fps is (10 sec: 5788.7, 60 sec: 5691.3, 300 sec: 5657.5). Total num frames: 362488832. Throughput: 0: 5952.8. Samples: 362496676. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:45:59,067][25689] Avg episode reward: [(0, '-47.752')] [2022-07-09 17:46:00,755][26022] Updated weights on worker 0-0, policy_version 354002 (0.00084) [2022-07-09 17:46:02,976][26022] Updated weights on worker 0-0, policy_version 354012 (0.00089) [2022-07-09 17:46:04,139][25689] Fps is (10 sec: 5550.9, 60 sec: 5703.4, 300 sec: 5664.7). Total num frames: 362515456. Throughput: 0: 4990.6. Samples: 362511980. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:46:04,140][25689] Avg episode reward: [(0, '-47.243')] [2022-07-09 17:46:04,593][26022] Updated weights on worker 0-0, policy_version 354022 (0.00050) [2022-07-09 17:46:06,418][26022] Updated weights on worker 0-0, policy_version 354032 (0.00099) [2022-07-09 17:46:08,338][26022] Updated weights on worker 0-0, policy_version 354042 (0.00087) [2022-07-09 17:46:09,175][25689] Fps is (10 sec: 5367.9, 60 sec: 5650.8, 300 sec: 5657.4). Total num frames: 362543104. Throughput: 0: 5814.3. Samples: 362545624. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:46:09,177][25689] Avg episode reward: [(0, '-46.405')] [2022-07-09 17:46:10,111][26022] Updated weights on worker 0-0, policy_version 354052 (0.00087) [2022-07-09 17:46:11,867][26022] Updated weights on worker 0-0, policy_version 354062 (0.00095) [2022-07-09 17:46:13,593][26022] Updated weights on worker 0-0, policy_version 354072 (0.00088) [2022-07-09 17:46:14,197][25689] Fps is (10 sec: 5599.5, 60 sec: 5703.7, 300 sec: 5658.0). Total num frames: 362571776. Throughput: 0: 5839.7. Samples: 362580280. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:46:14,199][25689] Avg episode reward: [(0, '-46.833')] [2022-07-09 17:46:15,491][26022] Updated weights on worker 0-0, policy_version 354082 (0.00085) [2022-07-09 17:46:17,094][26022] Updated weights on worker 0-0, policy_version 354092 (0.00093) [2022-07-09 17:46:19,116][26022] Updated weights on worker 0-0, policy_version 354102 (0.00089) [2022-07-09 17:46:19,263][25689] Fps is (10 sec: 5785.9, 60 sec: 5673.0, 300 sec: 5663.7). Total num frames: 362601472. Throughput: 0: 4987.4. Samples: 362597414. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-09 17:46:19,265][25689] Avg episode reward: [(0, '-46.721')] [2022-07-09 17:46:20,762][26022] Updated weights on worker 0-0, policy_version 354112 (0.00087) [2022-07-09 17:46:22,610][26022] Updated weights on worker 0-0, policy_version 354122 (0.00088) [2022-07-09 17:46:24,146][26022] Updated weights on worker 0-0, policy_version 354132 (0.00085) [2022-07-09 17:46:24,291][25689] Fps is (10 sec: 5884.2, 60 sec: 5688.2, 300 sec: 5663.3). Total num frames: 362631168. Throughput: 0: 5934.7. Samples: 362631574. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:46:24,292][25689] Avg episode reward: [(0, '-46.898')] [2022-07-09 17:46:26,278][26022] Updated weights on worker 0-0, policy_version 354142 (0.00087) [2022-07-09 17:46:28,130][26022] Updated weights on worker 0-0, policy_version 354152 (0.00091) [2022-07-09 17:46:29,327][25689] Fps is (10 sec: 5698.0, 60 sec: 5653.1, 300 sec: 5659.5). Total num frames: 362658816. Throughput: 0: 5968.7. Samples: 362665904. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:46:29,328][25689] Avg episode reward: [(0, '-47.461')] [2022-07-09 17:46:29,712][26022] Updated weights on worker 0-0, policy_version 354162 (0.00083) [2022-07-09 17:46:31,758][26022] Updated weights on worker 0-0, policy_version 354172 (0.00087) [2022-07-09 17:46:33,377][26022] Updated weights on worker 0-0, policy_version 354182 (0.00086) [2022-07-09 17:46:34,357][25689] Fps is (10 sec: 5493.5, 60 sec: 5635.3, 300 sec: 5656.9). Total num frames: 362686464. Throughput: 0: 5088.4. Samples: 362682858. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:46:34,357][25689] Avg episode reward: [(0, '-47.528')] [2022-07-09 17:46:35,218][26022] Updated weights on worker 0-0, policy_version 354192 (0.00090) [2022-07-09 17:46:37,029][26022] Updated weights on worker 0-0, policy_version 354202 (0.00097) [2022-07-09 17:46:38,703][26022] Updated weights on worker 0-0, policy_version 354212 (0.00086) [2022-07-09 17:46:39,423][25689] Fps is (10 sec: 5578.6, 60 sec: 5654.2, 300 sec: 5652.9). Total num frames: 362715136. Throughput: 0: 5927.1. Samples: 362716902. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:46:39,424][25689] Avg episode reward: [(0, '-47.013')] [2022-07-09 17:46:40,751][26022] Updated weights on worker 0-0, policy_version 354222 (0.00097) [2022-07-09 17:46:42,584][26022] Updated weights on worker 0-0, policy_version 354232 (0.00084) [2022-07-09 17:46:44,175][26022] Updated weights on worker 0-0, policy_version 354242 (0.00082) [2022-07-09 17:46:44,457][25689] Fps is (10 sec: 5880.1, 60 sec: 5687.8, 300 sec: 5666.5). Total num frames: 362745856. Throughput: 0: 5939.3. Samples: 362751348. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:46:44,459][25689] Avg episode reward: [(0, '-47.256')] [2022-07-09 17:46:46,106][26022] Updated weights on worker 0-0, policy_version 354252 (0.00081) [2022-07-09 17:46:47,662][26022] Updated weights on worker 0-0, policy_version 354262 (0.00086) [2022-07-09 17:46:49,463][25689] Fps is (10 sec: 5711.1, 60 sec: 5654.8, 300 sec: 5652.8). Total num frames: 362772480. Throughput: 0: 5102.3. Samples: 362768646. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:46:49,465][25689] Avg episode reward: [(0, '-48.093')] [2022-07-09 17:46:49,604][26022] Updated weights on worker 0-0, policy_version 354272 (0.00091) [2022-07-09 17:46:51,376][26022] Updated weights on worker 0-0, policy_version 354282 (0.00051) [2022-07-09 17:46:53,296][26022] Updated weights on worker 0-0, policy_version 354292 (0.00108) [2022-07-09 17:46:54,487][25689] Fps is (10 sec: 5615.4, 60 sec: 5670.7, 300 sec: 5660.7). Total num frames: 362802176. Throughput: 0: 5955.5. Samples: 362802742. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:46:54,487][25689] Avg episode reward: [(0, '-48.749')] [2022-07-09 17:46:55,205][26022] Updated weights on worker 0-0, policy_version 354302 (0.00092) [2022-07-09 17:46:56,737][26022] Updated weights on worker 0-0, policy_version 354312 (0.00092) [2022-07-09 17:46:58,615][26022] Updated weights on worker 0-0, policy_version 354322 (0.00088) [2022-07-09 17:46:59,594][25689] Fps is (10 sec: 5761.7, 60 sec: 5650.4, 300 sec: 5666.1). Total num frames: 362830848. Throughput: 0: 5955.8. Samples: 362837038. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:46:59,594][25689] Avg episode reward: [(0, '-48.366')] [2022-07-09 17:47:00,289][26022] Updated weights on worker 0-0, policy_version 354332 (0.00091) [2022-07-09 17:47:02,724][26022] Updated weights on worker 0-0, policy_version 354342 (0.00093) [2022-07-09 17:47:04,288][26022] Updated weights on worker 0-0, policy_version 354352 (0.00126) [2022-07-09 17:47:04,651][25689] Fps is (10 sec: 5440.3, 60 sec: 5652.1, 300 sec: 5661.9). Total num frames: 362857472. Throughput: 0: 4987.3. Samples: 362852060. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:47:04,651][25689] Avg episode reward: [(0, '-48.693')] [2022-07-09 17:47:06,226][26022] Updated weights on worker 0-0, policy_version 354362 (0.00085) [2022-07-09 17:47:07,812][26022] Updated weights on worker 0-0, policy_version 354372 (0.00091) [2022-07-09 17:47:09,726][25689] Fps is (10 sec: 5457.6, 60 sec: 5665.3, 300 sec: 5664.4). Total num frames: 362886144. Throughput: 0: 5809.3. Samples: 362886356. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:47:09,726][25689] Avg episode reward: [(0, '-49.139')] [2022-07-09 17:47:09,929][26022] Updated weights on worker 0-0, policy_version 354382 (0.00087) [2022-07-09 17:47:11,658][26022] Updated weights on worker 0-0, policy_version 354392 (0.00086) [2022-07-09 17:47:13,483][26022] Updated weights on worker 0-0, policy_version 354402 (0.00086) [2022-07-09 17:47:14,734][25689] Fps is (10 sec: 5687.0, 60 sec: 5666.6, 300 sec: 5662.7). Total num frames: 362914816. Throughput: 0: 5809.6. Samples: 362920372. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:47:14,735][25689] Avg episode reward: [(0, '-48.516')] [2022-07-09 17:47:15,143][26022] Updated weights on worker 0-0, policy_version 354412 (0.00092) [2022-07-09 17:47:16,897][26022] Updated weights on worker 0-0, policy_version 354422 (0.00087) [2022-07-09 17:47:18,965][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:47:18,976][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000354432_362938368.pth [2022-07-09 17:47:18,977][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000352441_360899584.pth [2022-07-09 17:47:18,980][26022] Updated weights on worker 0-0, policy_version 354432 (0.00087) [2022-07-09 17:47:19,793][25689] Fps is (10 sec: 5696.3, 60 sec: 5650.4, 300 sec: 5663.3). Total num frames: 362943488. Throughput: 0: 4972.0. Samples: 362937468. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:47:19,793][25689] Avg episode reward: [(0, '-47.809')] [2022-07-09 17:47:20,656][26022] Updated weights on worker 0-0, policy_version 354442 (0.00088) [2022-07-09 17:47:22,499][26022] Updated weights on worker 0-0, policy_version 354452 (0.00086) [2022-07-09 17:47:24,352][26022] Updated weights on worker 0-0, policy_version 354462 (0.00092) [2022-07-09 17:47:24,802][25689] Fps is (10 sec: 5695.5, 60 sec: 5635.2, 300 sec: 5668.3). Total num frames: 362972160. Throughput: 0: 5924.6. Samples: 362971450. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:47:24,803][25689] Avg episode reward: [(0, '-45.961')] [2022-07-09 17:47:25,991][26022] Updated weights on worker 0-0, policy_version 354472 (0.00094) [2022-07-09 17:47:28,012][26022] Updated weights on worker 0-0, policy_version 354482 (0.00087) [2022-07-09 17:47:29,710][26022] Updated weights on worker 0-0, policy_version 354492 (0.00093) [2022-07-09 17:47:29,807][25689] Fps is (10 sec: 5624.1, 60 sec: 5638.1, 300 sec: 5658.7). Total num frames: 362999808. Throughput: 0: 5933.6. Samples: 363005508. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:47:29,807][25689] Avg episode reward: [(0, '-46.308')] [2022-07-09 17:47:31,427][26022] Updated weights on worker 0-0, policy_version 354502 (0.00089) [2022-07-09 17:47:33,444][26022] Updated weights on worker 0-0, policy_version 354512 (0.00098) [2022-07-09 17:47:34,859][25689] Fps is (10 sec: 5702.3, 60 sec: 5669.9, 300 sec: 5666.2). Total num frames: 363029504. Throughput: 0: 5076.3. Samples: 363022530. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:47:34,859][25689] Avg episode reward: [(0, '-46.257')] [2022-07-09 17:47:35,036][26022] Updated weights on worker 0-0, policy_version 354522 (0.00089) [2022-07-09 17:47:36,975][26022] Updated weights on worker 0-0, policy_version 354532 (0.00094) [2022-07-09 17:47:38,756][26022] Updated weights on worker 0-0, policy_version 354542 (0.00096) [2022-07-09 17:47:39,924][25689] Fps is (10 sec: 5667.8, 60 sec: 5653.0, 300 sec: 5661.6). Total num frames: 363057152. Throughput: 0: 5933.9. Samples: 363056926. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:47:39,925][25689] Avg episode reward: [(0, '-46.326')] [2022-07-09 17:47:40,357][26022] Updated weights on worker 0-0, policy_version 354552 (0.00065) [2022-07-09 17:47:42,570][26022] Updated weights on worker 0-0, policy_version 354562 (0.00091) [2022-07-09 17:47:43,919][26022] Updated weights on worker 0-0, policy_version 354572 (0.00094) [2022-07-09 17:47:45,012][25689] Fps is (10 sec: 5647.6, 60 sec: 5631.1, 300 sec: 5660.8). Total num frames: 363086848. Throughput: 0: 5922.0. Samples: 363091134. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:47:45,013][25689] Avg episode reward: [(0, '-46.340')] [2022-07-09 17:47:46,008][26022] Updated weights on worker 0-0, policy_version 354582 (0.00083) [2022-07-09 17:47:47,523][26022] Updated weights on worker 0-0, policy_version 354592 (0.00086) [2022-07-09 17:47:49,481][26022] Updated weights on worker 0-0, policy_version 354602 (0.00050) [2022-07-09 17:47:50,069][25689] Fps is (10 sec: 5753.3, 60 sec: 5660.2, 300 sec: 5661.4). Total num frames: 363115520. Throughput: 0: 5076.1. Samples: 363108364. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:47:50,070][25689] Avg episode reward: [(0, '-46.216')] [2022-07-09 17:47:51,102][26022] Updated weights on worker 0-0, policy_version 354612 (0.00087) [2022-07-09 17:47:53,099][26022] Updated weights on worker 0-0, policy_version 354622 (0.00082) [2022-07-09 17:47:54,763][26022] Updated weights on worker 0-0, policy_version 354632 (0.00080) [2022-07-09 17:47:55,111][25689] Fps is (10 sec: 5678.5, 60 sec: 5641.6, 300 sec: 5659.3). Total num frames: 363144192. Throughput: 0: 5937.4. Samples: 363142774. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:47:55,111][25689] Avg episode reward: [(0, '-46.888')] [2022-07-09 17:47:56,669][26022] Updated weights on worker 0-0, policy_version 354642 (0.00083) [2022-07-09 17:47:58,417][26022] Updated weights on worker 0-0, policy_version 354652 (0.00091) [2022-07-09 17:48:00,228][25689] Fps is (10 sec: 5644.7, 60 sec: 5640.6, 300 sec: 5672.2). Total num frames: 363172864. Throughput: 0: 5925.2. Samples: 363177232. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:48:00,229][25689] Avg episode reward: [(0, '-47.372')] [2022-07-09 17:48:00,270][26022] Updated weights on worker 0-0, policy_version 354662 (0.00089) [2022-07-09 17:48:01,897][26022] Updated weights on worker 0-0, policy_version 354672 (0.00090) [2022-07-09 17:48:04,140][26022] Updated weights on worker 0-0, policy_version 354682 (0.00086) [2022-07-09 17:48:05,318][25689] Fps is (10 sec: 5517.9, 60 sec: 5654.5, 300 sec: 5660.8). Total num frames: 363200512. Throughput: 0: 5818.9. Samples: 363209288. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:48:05,320][25689] Avg episode reward: [(0, '-47.631')] [2022-07-09 17:48:05,979][26022] Updated weights on worker 0-0, policy_version 354692 (0.00090) [2022-07-09 17:48:07,886][26022] Updated weights on worker 0-0, policy_version 354702 (0.00085) [2022-07-09 17:48:09,494][26022] Updated weights on worker 0-0, policy_version 354712 (0.00085) [2022-07-09 17:48:10,352][25689] Fps is (10 sec: 5563.2, 60 sec: 5658.3, 300 sec: 5663.7). Total num frames: 363229184. Throughput: 0: 5829.2. Samples: 363226594. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:48:10,352][25689] Avg episode reward: [(0, '-47.142')] [2022-07-09 17:48:11,411][26022] Updated weights on worker 0-0, policy_version 354722 (0.00086) [2022-07-09 17:48:13,169][26022] Updated weights on worker 0-0, policy_version 354732 (0.00092) [2022-07-09 17:48:14,961][26022] Updated weights on worker 0-0, policy_version 354742 (0.00091) [2022-07-09 17:48:15,395][25689] Fps is (10 sec: 5690.3, 60 sec: 5655.0, 300 sec: 5664.1). Total num frames: 363257856. Throughput: 0: 5822.6. Samples: 363260882. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:48:15,397][25689] Avg episode reward: [(0, '-47.972')] [2022-07-09 17:48:16,677][26022] Updated weights on worker 0-0, policy_version 354752 (0.00087) [2022-07-09 17:48:18,582][26022] Updated weights on worker 0-0, policy_version 354762 (0.00086) [2022-07-09 17:48:20,303][26022] Updated weights on worker 0-0, policy_version 354772 (0.00085) [2022-07-09 17:48:20,512][25689] Fps is (10 sec: 5744.7, 60 sec: 5666.4, 300 sec: 5662.8). Total num frames: 363287552. Throughput: 0: 5805.8. Samples: 363294996. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:48:20,513][25689] Avg episode reward: [(0, '-47.478')] [2022-07-09 17:48:22,156][26022] Updated weights on worker 0-0, policy_version 354782 (0.00058) [2022-07-09 17:48:23,820][26022] Updated weights on worker 0-0, policy_version 354792 (0.00093) [2022-07-09 17:48:25,516][25689] Fps is (10 sec: 5767.0, 60 sec: 5667.0, 300 sec: 5666.8). Total num frames: 363316224. Throughput: 0: 5099.1. Samples: 363312282. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:48:25,517][25689] Avg episode reward: [(0, '-47.896')] [2022-07-09 17:48:25,918][26022] Updated weights on worker 0-0, policy_version 354802 (0.00108) [2022-07-09 17:48:27,395][26022] Updated weights on worker 0-0, policy_version 354812 (0.00086) [2022-07-09 17:48:29,391][26022] Updated weights on worker 0-0, policy_version 354822 (0.00088) [2022-07-09 17:48:30,573][25689] Fps is (10 sec: 5699.5, 60 sec: 5678.9, 300 sec: 5670.3). Total num frames: 363344896. Throughput: 0: 5940.6. Samples: 363346722. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:48:30,574][25689] Avg episode reward: [(0, '-48.229')] [2022-07-09 17:48:31,048][26022] Updated weights on worker 0-0, policy_version 354832 (0.00087) [2022-07-09 17:48:32,846][26022] Updated weights on worker 0-0, policy_version 354842 (0.00084) [2022-07-09 17:48:34,750][26022] Updated weights on worker 0-0, policy_version 354852 (0.00413) [2022-07-09 17:48:35,604][25689] Fps is (10 sec: 5785.9, 60 sec: 5680.9, 300 sec: 5668.6). Total num frames: 363374592. Throughput: 0: 5946.8. Samples: 363381060. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:48:35,604][25689] Avg episode reward: [(0, '-47.390')] [2022-07-09 17:48:36,402][26022] Updated weights on worker 0-0, policy_version 354862 (0.00087) [2022-07-09 17:48:38,320][26022] Updated weights on worker 0-0, policy_version 354872 (0.00087) [2022-07-09 17:48:40,233][26022] Updated weights on worker 0-0, policy_version 354882 (0.00085) [2022-07-09 17:48:40,691][25689] Fps is (10 sec: 5667.7, 60 sec: 5678.9, 300 sec: 5660.2). Total num frames: 363402240. Throughput: 0: 5108.7. Samples: 363398084. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:48:40,692][25689] Avg episode reward: [(0, '-46.849')] [2022-07-09 17:48:41,709][26022] Updated weights on worker 0-0, policy_version 354892 (0.00091) [2022-07-09 17:48:43,645][26022] Updated weights on worker 0-0, policy_version 354902 (0.00093) [2022-07-09 17:48:45,219][26022] Updated weights on worker 0-0, policy_version 354912 (0.00088) [2022-07-09 17:48:45,712][25689] Fps is (10 sec: 5571.7, 60 sec: 5668.3, 300 sec: 5664.6). Total num frames: 363430912. Throughput: 0: 5951.8. Samples: 363432486. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:48:45,713][25689] Avg episode reward: [(0, '-46.649')] [2022-07-09 17:48:47,312][26022] Updated weights on worker 0-0, policy_version 354922 (0.00085) [2022-07-09 17:48:49,122][26022] Updated weights on worker 0-0, policy_version 354932 (0.00091) [2022-07-09 17:48:50,723][25689] Fps is (10 sec: 5716.3, 60 sec: 5672.6, 300 sec: 5664.6). Total num frames: 363459584. Throughput: 0: 5949.5. Samples: 363466602. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:48:50,723][25689] Avg episode reward: [(0, '-46.873')] [2022-07-09 17:48:50,888][26022] Updated weights on worker 0-0, policy_version 354942 (0.00088) [2022-07-09 17:48:52,506][26022] Updated weights on worker 0-0, policy_version 354952 (0.00087) [2022-07-09 17:48:54,572][26022] Updated weights on worker 0-0, policy_version 354962 (0.00101) [2022-07-09 17:48:55,808][25689] Fps is (10 sec: 5781.2, 60 sec: 5685.4, 300 sec: 5665.0). Total num frames: 363489280. Throughput: 0: 5064.8. Samples: 363483392. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2022-07-09 17:48:55,809][25689] Avg episode reward: [(0, '-46.677')] [2022-07-09 17:48:56,150][26022] Updated weights on worker 0-0, policy_version 354972 (0.00097) [2022-07-09 17:48:58,122][26022] Updated weights on worker 0-0, policy_version 354982 (0.00093) [2022-07-09 17:48:59,853][26022] Updated weights on worker 0-0, policy_version 354992 (0.00092) [2022-07-09 17:49:00,908][25689] Fps is (10 sec: 5629.9, 60 sec: 5670.1, 300 sec: 5666.9). Total num frames: 363516928. Throughput: 0: 5929.9. Samples: 363517970. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:49:00,909][25689] Avg episode reward: [(0, '-46.948')] [2022-07-09 17:49:01,820][26022] Updated weights on worker 0-0, policy_version 355002 (0.00098) [2022-07-09 17:49:03,791][26022] Updated weights on worker 0-0, policy_version 355012 (0.00088) [2022-07-09 17:49:05,655][26022] Updated weights on worker 0-0, policy_version 355022 (0.00092) [2022-07-09 17:49:05,910][25689] Fps is (10 sec: 5372.5, 60 sec: 5661.4, 300 sec: 5663.9). Total num frames: 363543552. Throughput: 0: 5834.0. Samples: 363550320. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:49:05,911][25689] Avg episode reward: [(0, '-47.588')] [2022-07-09 17:49:07,329][26022] Updated weights on worker 0-0, policy_version 355032 (0.00086) [2022-07-09 17:49:09,068][26022] Updated weights on worker 0-0, policy_version 355042 (0.00091) [2022-07-09 17:49:10,875][26022] Updated weights on worker 0-0, policy_version 355052 (0.00089) [2022-07-09 17:49:10,939][25689] Fps is (10 sec: 5615.0, 60 sec: 5678.8, 300 sec: 5663.5). Total num frames: 363573248. Throughput: 0: 4997.4. Samples: 363567628. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:49:10,939][25689] Avg episode reward: [(0, '-48.013')] [2022-07-09 17:49:12,740][26022] Updated weights on worker 0-0, policy_version 355062 (0.00086) [2022-07-09 17:49:14,386][26022] Updated weights on worker 0-0, policy_version 355072 (0.00090) [2022-07-09 17:49:15,943][25689] Fps is (10 sec: 5817.6, 60 sec: 5682.5, 300 sec: 5669.1). Total num frames: 363601920. Throughput: 0: 5884.6. Samples: 363601878. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:49:15,944][25689] Avg episode reward: [(0, '-48.897')] [2022-07-09 17:49:16,370][26022] Updated weights on worker 0-0, policy_version 355082 (0.00087) [2022-07-09 17:49:17,896][26022] Updated weights on worker 0-0, policy_version 355092 (0.00090) [2022-07-09 17:49:18,981][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:49:18,998][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000355097_363619328.pth [2022-07-09 17:49:18,998][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000353103_361577472.pth [2022-07-09 17:49:19,917][26022] Updated weights on worker 0-0, policy_version 355102 (0.00083) [2022-07-09 17:49:21,069][25689] Fps is (10 sec: 5761.5, 60 sec: 5681.6, 300 sec: 5667.8). Total num frames: 363631616. Throughput: 0: 5865.1. Samples: 363636216. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:49:21,072][25689] Avg episode reward: [(0, '-48.095')] [2022-07-09 17:49:21,697][26022] Updated weights on worker 0-0, policy_version 355112 (0.00092) [2022-07-09 17:49:23,471][26022] Updated weights on worker 0-0, policy_version 355122 (0.00077) [2022-07-09 17:49:25,296][26022] Updated weights on worker 0-0, policy_version 355132 (0.00079) [2022-07-09 17:49:26,087][25689] Fps is (10 sec: 5653.1, 60 sec: 5663.4, 300 sec: 5664.1). Total num frames: 363659264. Throughput: 0: 5109.5. Samples: 363653414. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:49:26,088][25689] Avg episode reward: [(0, '-47.955')] [2022-07-09 17:49:27,098][26022] Updated weights on worker 0-0, policy_version 355142 (0.00104) [2022-07-09 17:49:28,696][26022] Updated weights on worker 0-0, policy_version 355152 (0.00093) [2022-07-09 17:49:30,664][26022] Updated weights on worker 0-0, policy_version 355162 (0.00093) [2022-07-09 17:49:31,150][25689] Fps is (10 sec: 5586.7, 60 sec: 5662.9, 300 sec: 5663.2). Total num frames: 363687936. Throughput: 0: 5954.9. Samples: 363687986. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:49:31,152][25689] Avg episode reward: [(0, '-47.971')] [2022-07-09 17:49:32,446][26022] Updated weights on worker 0-0, policy_version 355172 (0.00086) [2022-07-09 17:49:34,304][26022] Updated weights on worker 0-0, policy_version 355182 (0.00091) [2022-07-09 17:49:36,111][26022] Updated weights on worker 0-0, policy_version 355192 (0.00086) [2022-07-09 17:49:36,174][25689] Fps is (10 sec: 5685.0, 60 sec: 5646.6, 300 sec: 5667.5). Total num frames: 363716608. Throughput: 0: 5944.0. Samples: 363722130. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:49:36,176][25689] Avg episode reward: [(0, '-48.167')] [2022-07-09 17:49:37,738][26022] Updated weights on worker 0-0, policy_version 355202 (0.00087) [2022-07-09 17:49:39,551][26022] Updated weights on worker 0-0, policy_version 355212 (0.00090) [2022-07-09 17:49:41,233][25689] Fps is (10 sec: 5789.2, 60 sec: 5683.1, 300 sec: 5667.7). Total num frames: 363746304. Throughput: 0: 5110.9. Samples: 363739268. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:49:41,235][25689] Avg episode reward: [(0, '-47.480')] [2022-07-09 17:49:41,389][26022] Updated weights on worker 0-0, policy_version 355222 (0.00085) [2022-07-09 17:49:43,282][26022] Updated weights on worker 0-0, policy_version 355232 (0.00088) [2022-07-09 17:49:44,926][26022] Updated weights on worker 0-0, policy_version 355242 (0.00082) [2022-07-09 17:49:46,323][25689] Fps is (10 sec: 5751.2, 60 sec: 5676.6, 300 sec: 5667.0). Total num frames: 363774976. Throughput: 0: 5932.0. Samples: 363773452. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:49:46,324][25689] Avg episode reward: [(0, '-47.700')] [2022-07-09 17:49:46,719][26022] Updated weights on worker 0-0, policy_version 355252 (0.00082) [2022-07-09 17:49:48,499][26022] Updated weights on worker 0-0, policy_version 355262 (0.00087) [2022-07-09 17:49:50,434][26022] Updated weights on worker 0-0, policy_version 355272 (0.00087) [2022-07-09 17:49:51,337][25689] Fps is (10 sec: 5675.0, 60 sec: 5676.3, 300 sec: 5666.8). Total num frames: 363803648. Throughput: 0: 5953.0. Samples: 363808158. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:49:51,338][25689] Avg episode reward: [(0, '-47.944')] [2022-07-09 17:49:52,081][26022] Updated weights on worker 0-0, policy_version 355282 (0.00086) [2022-07-09 17:49:53,993][26022] Updated weights on worker 0-0, policy_version 355292 (0.00092) [2022-07-09 17:49:55,629][26022] Updated weights on worker 0-0, policy_version 355302 (0.00092) [2022-07-09 17:49:56,389][25689] Fps is (10 sec: 5696.5, 60 sec: 5662.5, 300 sec: 5667.1). Total num frames: 363832320. Throughput: 0: 5108.2. Samples: 363825394. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:49:56,390][25689] Avg episode reward: [(0, '-47.718')] [2022-07-09 17:49:57,643][26022] Updated weights on worker 0-0, policy_version 355312 (0.00088) [2022-07-09 17:49:59,312][26022] Updated weights on worker 0-0, policy_version 355322 (0.00094) [2022-07-09 17:50:01,108][26022] Updated weights on worker 0-0, policy_version 355332 (0.00086) [2022-07-09 17:50:01,460][25689] Fps is (10 sec: 5664.6, 60 sec: 5682.1, 300 sec: 5676.5). Total num frames: 363860992. Throughput: 0: 5933.1. Samples: 363859282. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:50:01,462][25689] Avg episode reward: [(0, '-48.401')] [2022-07-09 17:50:03,300][26022] Updated weights on worker 0-0, policy_version 355342 (0.00089) [2022-07-09 17:50:05,039][26022] Updated weights on worker 0-0, policy_version 355352 (0.00090) [2022-07-09 17:50:06,506][25689] Fps is (10 sec: 5364.7, 60 sec: 5661.1, 300 sec: 5658.8). Total num frames: 363886592. Throughput: 0: 5834.5. Samples: 363891210. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:50:06,508][25689] Avg episode reward: [(0, '-48.480')] [2022-07-09 17:50:07,083][26022] Updated weights on worker 0-0, policy_version 355362 (0.00092) [2022-07-09 17:50:08,680][26022] Updated weights on worker 0-0, policy_version 355372 (0.00082) [2022-07-09 17:50:10,653][26022] Updated weights on worker 0-0, policy_version 355382 (0.00084) [2022-07-09 17:50:11,540][25689] Fps is (10 sec: 5486.1, 60 sec: 5660.6, 300 sec: 5672.7). Total num frames: 363916288. Throughput: 0: 5793.5. Samples: 363925202. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:50:11,541][25689] Avg episode reward: [(0, '-48.714')] [2022-07-09 17:50:12,445][26022] Updated weights on worker 0-0, policy_version 355392 (0.00088) [2022-07-09 17:50:14,194][26022] Updated weights on worker 0-0, policy_version 355402 (0.00094) [2022-07-09 17:50:15,985][26022] Updated weights on worker 0-0, policy_version 355412 (0.00085) [2022-07-09 17:50:16,606][25689] Fps is (10 sec: 5778.8, 60 sec: 5654.8, 300 sec: 5663.1). Total num frames: 363944960. Throughput: 0: 5789.1. Samples: 363942432. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:50:16,607][25689] Avg episode reward: [(0, '-48.305')] [2022-07-09 17:50:17,695][26022] Updated weights on worker 0-0, policy_version 355422 (0.00082) [2022-07-09 17:50:19,539][26022] Updated weights on worker 0-0, policy_version 355432 (0.00091) [2022-07-09 17:50:21,408][26022] Updated weights on worker 0-0, policy_version 355442 (0.00085) [2022-07-09 17:50:21,687][25689] Fps is (10 sec: 5651.4, 60 sec: 5642.2, 300 sec: 5661.7). Total num frames: 363973632. Throughput: 0: 5801.4. Samples: 363976622. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:50:21,687][25689] Avg episode reward: [(0, '-48.193')] [2022-07-09 17:50:23,098][26022] Updated weights on worker 0-0, policy_version 355452 (0.00084) [2022-07-09 17:50:24,995][26022] Updated weights on worker 0-0, policy_version 355462 (0.00095) [2022-07-09 17:50:26,693][25689] Fps is (10 sec: 5684.9, 60 sec: 5660.1, 300 sec: 5658.6). Total num frames: 364002304. Throughput: 0: 5929.0. Samples: 364010902. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:50:26,694][25689] Avg episode reward: [(0, '-47.208')] [2022-07-09 17:50:26,767][26022] Updated weights on worker 0-0, policy_version 355472 (0.00083) [2022-07-09 17:50:28,618][26022] Updated weights on worker 0-0, policy_version 355482 (0.00082) [2022-07-09 17:50:30,318][26022] Updated weights on worker 0-0, policy_version 355492 (0.00103) [2022-07-09 17:50:31,704][25689] Fps is (10 sec: 5826.7, 60 sec: 5682.0, 300 sec: 5662.2). Total num frames: 364032000. Throughput: 0: 5104.2. Samples: 364028124. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:50:31,706][25689] Avg episode reward: [(0, '-47.293')] [2022-07-09 17:50:32,099][26022] Updated weights on worker 0-0, policy_version 355502 (0.00514) [2022-07-09 17:50:33,976][26022] Updated weights on worker 0-0, policy_version 355512 (0.00090) [2022-07-09 17:50:35,879][26022] Updated weights on worker 0-0, policy_version 355522 (0.00086) [2022-07-09 17:50:36,735][25689] Fps is (10 sec: 5608.8, 60 sec: 5647.5, 300 sec: 5659.8). Total num frames: 364058624. Throughput: 0: 5966.0. Samples: 364062518. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:50:36,735][25689] Avg episode reward: [(0, '-46.647')] [2022-07-09 17:50:37,443][26022] Updated weights on worker 0-0, policy_version 355532 (0.00084) [2022-07-09 17:50:39,186][26022] Updated weights on worker 0-0, policy_version 355542 (0.00058) [2022-07-09 17:50:41,050][26022] Updated weights on worker 0-0, policy_version 355552 (0.00095) [2022-07-09 17:50:41,772][25689] Fps is (10 sec: 5593.8, 60 sec: 5649.4, 300 sec: 5663.2). Total num frames: 364088320. Throughput: 0: 5980.6. Samples: 364096746. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:50:41,773][25689] Avg episode reward: [(0, '-46.837')] [2022-07-09 17:50:42,870][26022] Updated weights on worker 0-0, policy_version 355562 (0.00083) [2022-07-09 17:50:44,742][26022] Updated weights on worker 0-0, policy_version 355572 (0.00359) [2022-07-09 17:50:46,412][26022] Updated weights on worker 0-0, policy_version 355582 (0.00089) [2022-07-09 17:50:46,774][25689] Fps is (10 sec: 5813.8, 60 sec: 5657.7, 300 sec: 5663.4). Total num frames: 364116992. Throughput: 0: 5130.7. Samples: 364113932. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:50:46,778][25689] Avg episode reward: [(0, '-47.128')] [2022-07-09 17:50:48,264][26022] Updated weights on worker 0-0, policy_version 355592 (0.00094) [2022-07-09 17:50:50,074][26022] Updated weights on worker 0-0, policy_version 355602 (0.00089) [2022-07-09 17:50:51,747][26022] Updated weights on worker 0-0, policy_version 355612 (0.00085) [2022-07-09 17:50:51,783][25689] Fps is (10 sec: 5830.8, 60 sec: 5675.2, 300 sec: 5666.9). Total num frames: 364146688. Throughput: 0: 5991.3. Samples: 364148422. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:50:51,783][25689] Avg episode reward: [(0, '-47.707')] [2022-07-09 17:50:53,778][26022] Updated weights on worker 0-0, policy_version 355622 (0.00083) [2022-07-09 17:50:55,358][26022] Updated weights on worker 0-0, policy_version 355632 (0.00091) [2022-07-09 17:50:56,799][25689] Fps is (10 sec: 5720.4, 60 sec: 5661.6, 300 sec: 5661.1). Total num frames: 364174336. Throughput: 0: 5990.1. Samples: 364182704. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:50:56,799][25689] Avg episode reward: [(0, '-47.944')] [2022-07-09 17:50:57,260][26022] Updated weights on worker 0-0, policy_version 355642 (0.00093) [2022-07-09 17:50:58,987][26022] Updated weights on worker 0-0, policy_version 355652 (0.00083) [2022-07-09 17:51:00,704][26022] Updated weights on worker 0-0, policy_version 355662 (0.00096) [2022-07-09 17:51:01,839][25689] Fps is (10 sec: 5498.8, 60 sec: 5647.6, 300 sec: 5665.2). Total num frames: 364201984. Throughput: 0: 5135.0. Samples: 364199786. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:51:01,839][25689] Avg episode reward: [(0, '-47.413')] [2022-07-09 17:51:03,215][26022] Updated weights on worker 0-0, policy_version 355672 (0.00091) [2022-07-09 17:51:04,614][26022] Updated weights on worker 0-0, policy_version 355682 (0.00511) [2022-07-09 17:51:06,680][26022] Updated weights on worker 0-0, policy_version 355692 (0.00081) [2022-07-09 17:51:06,862][25689] Fps is (10 sec: 5495.0, 60 sec: 5683.6, 300 sec: 5665.4). Total num frames: 364229632. Throughput: 0: 5877.3. Samples: 364231992. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:51:06,862][25689] Avg episode reward: [(0, '-47.861')] [2022-07-09 17:51:08,213][26022] Updated weights on worker 0-0, policy_version 355702 (0.00089) [2022-07-09 17:51:10,187][26022] Updated weights on worker 0-0, policy_version 355712 (0.00088) [2022-07-09 17:51:11,854][26022] Updated weights on worker 0-0, policy_version 355722 (0.00087) [2022-07-09 17:51:11,866][25689] Fps is (10 sec: 5718.6, 60 sec: 5686.4, 300 sec: 5669.2). Total num frames: 364259328. Throughput: 0: 5858.1. Samples: 364266074. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:51:11,867][25689] Avg episode reward: [(0, '-48.204')] [2022-07-09 17:51:13,756][26022] Updated weights on worker 0-0, policy_version 355732 (0.00089) [2022-07-09 17:51:15,537][26022] Updated weights on worker 0-0, policy_version 355742 (0.00084) [2022-07-09 17:51:16,874][25689] Fps is (10 sec: 5624.8, 60 sec: 5657.9, 300 sec: 5660.0). Total num frames: 364285952. Throughput: 0: 5009.3. Samples: 364283270. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:51:16,875][25689] Avg episode reward: [(0, '-48.232')] [2022-07-09 17:51:17,434][26022] Updated weights on worker 0-0, policy_version 355752 (0.00091) [2022-07-09 17:51:19,006][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:51:19,014][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000355761_364299264.pth [2022-07-09 17:51:19,014][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000353769_362259456.pth [2022-07-09 17:51:19,067][26022] Updated weights on worker 0-0, policy_version 355762 (0.00084) [2022-07-09 17:51:21,113][26022] Updated weights on worker 0-0, policy_version 355772 (0.00087) [2022-07-09 17:51:22,003][25689] Fps is (10 sec: 5556.2, 60 sec: 5670.4, 300 sec: 5658.1). Total num frames: 364315648. Throughput: 0: 5832.1. Samples: 364317384. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:51:22,003][25689] Avg episode reward: [(0, '-48.345')] [2022-07-09 17:51:22,738][26022] Updated weights on worker 0-0, policy_version 355782 (0.00086) [2022-07-09 17:51:24,823][26022] Updated weights on worker 0-0, policy_version 355792 (0.00088) [2022-07-09 17:51:26,319][26022] Updated weights on worker 0-0, policy_version 355802 (0.00092) [2022-07-09 17:51:27,007][25689] Fps is (10 sec: 5760.6, 60 sec: 5670.6, 300 sec: 5662.1). Total num frames: 364344320. Throughput: 0: 5920.2. Samples: 364351254. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:51:27,008][25689] Avg episode reward: [(0, '-48.906')] [2022-07-09 17:51:28,425][26022] Updated weights on worker 0-0, policy_version 355812 (0.00092) [2022-07-09 17:51:30,103][26022] Updated weights on worker 0-0, policy_version 355822 (0.00089) [2022-07-09 17:51:32,076][25689] Fps is (10 sec: 5591.0, 60 sec: 5631.2, 300 sec: 5661.4). Total num frames: 364371968. Throughput: 0: 5055.6. Samples: 364368246. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 17:51:32,078][25689] Avg episode reward: [(0, '-48.965')] [2022-07-09 17:51:32,080][26022] Updated weights on worker 0-0, policy_version 355832 (0.00085) [2022-07-09 17:51:33,824][26022] Updated weights on worker 0-0, policy_version 355842 (0.00087) [2022-07-09 17:51:35,444][26022] Updated weights on worker 0-0, policy_version 355852 (0.00086) [2022-07-09 17:51:37,103][25689] Fps is (10 sec: 5578.3, 60 sec: 5665.5, 300 sec: 5662.1). Total num frames: 364400640. Throughput: 0: 5876.4. Samples: 364402142. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:51:37,104][25689] Avg episode reward: [(0, '-48.670')] [2022-07-09 17:51:37,292][26022] Updated weights on worker 0-0, policy_version 355862 (0.00086) [2022-07-09 17:51:39,285][26022] Updated weights on worker 0-0, policy_version 355872 (0.00080) [2022-07-09 17:51:40,910][26022] Updated weights on worker 0-0, policy_version 355882 (0.00091) [2022-07-09 17:51:42,229][25689] Fps is (10 sec: 5748.7, 60 sec: 5657.2, 300 sec: 5656.9). Total num frames: 364430336. Throughput: 0: 5880.2. Samples: 364436320. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:51:42,230][25689] Avg episode reward: [(0, '-48.283')] [2022-07-09 17:51:42,851][26022] Updated weights on worker 0-0, policy_version 355892 (0.00094) [2022-07-09 17:51:44,436][26022] Updated weights on worker 0-0, policy_version 355902 (0.00092) [2022-07-09 17:51:46,473][26022] Updated weights on worker 0-0, policy_version 355912 (0.00087) [2022-07-09 17:51:47,269][25689] Fps is (10 sec: 5641.0, 60 sec: 5636.8, 300 sec: 5659.7). Total num frames: 364457984. Throughput: 0: 5042.4. Samples: 364453422. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:51:47,270][25689] Avg episode reward: [(0, '-48.193')] [2022-07-09 17:51:48,090][26022] Updated weights on worker 0-0, policy_version 355922 (0.00094) [2022-07-09 17:51:50,115][26022] Updated weights on worker 0-0, policy_version 355932 (0.00088) [2022-07-09 17:51:51,599][26022] Updated weights on worker 0-0, policy_version 355942 (0.00090) [2022-07-09 17:51:52,284][25689] Fps is (10 sec: 5601.3, 60 sec: 5619.2, 300 sec: 5656.5). Total num frames: 364486656. Throughput: 0: 5890.2. Samples: 364487276. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:51:52,286][25689] Avg episode reward: [(0, '-48.784')] [2022-07-09 17:51:53,659][26022] Updated weights on worker 0-0, policy_version 355952 (0.00089) [2022-07-09 17:51:55,269][26022] Updated weights on worker 0-0, policy_version 355962 (0.00083) [2022-07-09 17:51:57,239][26022] Updated weights on worker 0-0, policy_version 355972 (0.00089) [2022-07-09 17:51:57,328][25689] Fps is (10 sec: 5700.5, 60 sec: 5633.5, 300 sec: 5657.7). Total num frames: 364515328. Throughput: 0: 5903.9. Samples: 364521548. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:51:57,329][25689] Avg episode reward: [(0, '-48.526')] [2022-07-09 17:51:58,783][26022] Updated weights on worker 0-0, policy_version 355982 (0.00086) [2022-07-09 17:52:00,804][26022] Updated weights on worker 0-0, policy_version 355992 (0.00089) [2022-07-09 17:52:02,416][25689] Fps is (10 sec: 5558.7, 60 sec: 5629.0, 300 sec: 5660.5). Total num frames: 364542976. Throughput: 0: 5059.4. Samples: 364538452. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:52:02,417][25689] Avg episode reward: [(0, '-49.067')] [2022-07-09 17:52:02,876][26022] Updated weights on worker 0-0, policy_version 356002 (0.00092) [2022-07-09 17:52:04,923][26022] Updated weights on worker 0-0, policy_version 356012 (0.00096) [2022-07-09 17:52:06,463][26022] Updated weights on worker 0-0, policy_version 356022 (0.00092) [2022-07-09 17:52:07,494][25689] Fps is (10 sec: 5439.4, 60 sec: 5624.0, 300 sec: 5657.0). Total num frames: 364570624. Throughput: 0: 5792.7. Samples: 364570580. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:52:07,495][25689] Avg episode reward: [(0, '-49.112')] [2022-07-09 17:52:08,387][26022] Updated weights on worker 0-0, policy_version 356032 (0.00085) [2022-07-09 17:52:10,018][26022] Updated weights on worker 0-0, policy_version 356042 (0.00079) [2022-07-09 17:52:12,243][26022] Updated weights on worker 0-0, policy_version 356052 (0.00096) [2022-07-09 17:52:12,504][25689] Fps is (10 sec: 5684.5, 60 sec: 5623.5, 300 sec: 5660.4). Total num frames: 364600320. Throughput: 0: 5816.8. Samples: 364604890. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:52:12,504][25689] Avg episode reward: [(0, '-48.097')] [2022-07-09 17:52:13,587][26022] Updated weights on worker 0-0, policy_version 356062 (0.00090) [2022-07-09 17:52:15,662][26022] Updated weights on worker 0-0, policy_version 356072 (0.00092) [2022-07-09 17:52:17,306][26022] Updated weights on worker 0-0, policy_version 356082 (0.00093) [2022-07-09 17:52:17,565][25689] Fps is (10 sec: 5795.6, 60 sec: 5652.3, 300 sec: 5660.4). Total num frames: 364628992. Throughput: 0: 4978.1. Samples: 364622290. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:52:17,566][25689] Avg episode reward: [(0, '-47.467')] [2022-07-09 17:52:19,217][26022] Updated weights on worker 0-0, policy_version 356092 (0.00092) [2022-07-09 17:52:20,931][26022] Updated weights on worker 0-0, policy_version 356102 (0.00083) [2022-07-09 17:52:22,617][25689] Fps is (10 sec: 5568.9, 60 sec: 5625.6, 300 sec: 5656.1). Total num frames: 364656640. Throughput: 0: 5834.0. Samples: 364656306. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:52:22,623][25689] Avg episode reward: [(0, '-47.158')] [2022-07-09 17:52:22,893][26022] Updated weights on worker 0-0, policy_version 356112 (0.00093) [2022-07-09 17:52:24,447][26022] Updated weights on worker 0-0, policy_version 356122 (0.00089) [2022-07-09 17:52:26,571][26022] Updated weights on worker 0-0, policy_version 356132 (0.00096) [2022-07-09 17:52:27,703][25689] Fps is (10 sec: 5656.4, 60 sec: 5634.9, 300 sec: 5661.5). Total num frames: 364686336. Throughput: 0: 5940.7. Samples: 364690636. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:52:27,703][25689] Avg episode reward: [(0, '-46.484')] [2022-07-09 17:52:28,015][26022] Updated weights on worker 0-0, policy_version 356142 (0.00084) [2022-07-09 17:52:30,071][26022] Updated weights on worker 0-0, policy_version 356152 (0.00097) [2022-07-09 17:52:31,687][26022] Updated weights on worker 0-0, policy_version 356162 (0.00087) [2022-07-09 17:52:32,739][25689] Fps is (10 sec: 5665.5, 60 sec: 5638.0, 300 sec: 5654.9). Total num frames: 364713984. Throughput: 0: 5080.1. Samples: 364707686. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:52:32,739][25689] Avg episode reward: [(0, '-46.565')] [2022-07-09 17:52:33,647][26022] Updated weights on worker 0-0, policy_version 356172 (0.00087) [2022-07-09 17:52:35,444][26022] Updated weights on worker 0-0, policy_version 356182 (0.00089) [2022-07-09 17:52:37,342][26022] Updated weights on worker 0-0, policy_version 356192 (0.00091) [2022-07-09 17:52:37,766][25689] Fps is (10 sec: 5596.7, 60 sec: 5638.0, 300 sec: 5659.1). Total num frames: 364742656. Throughput: 0: 5897.0. Samples: 364741416. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:52:37,766][25689] Avg episode reward: [(0, '-46.599')] [2022-07-09 17:52:38,956][26022] Updated weights on worker 0-0, policy_version 356202 (0.00087) [2022-07-09 17:52:41,023][26022] Updated weights on worker 0-0, policy_version 356212 (0.00089) [2022-07-09 17:52:42,542][26022] Updated weights on worker 0-0, policy_version 356222 (0.00087) [2022-07-09 17:52:42,816][25689] Fps is (10 sec: 5792.1, 60 sec: 5645.1, 300 sec: 5659.8). Total num frames: 364772352. Throughput: 0: 5916.9. Samples: 364775820. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:52:42,816][25689] Avg episode reward: [(0, '-46.651')] [2022-07-09 17:52:44,510][26022] Updated weights on worker 0-0, policy_version 356232 (0.00094) [2022-07-09 17:52:46,016][26022] Updated weights on worker 0-0, policy_version 356242 (0.00092) [2022-07-09 17:52:47,823][25689] Fps is (10 sec: 5702.1, 60 sec: 5648.1, 300 sec: 5657.3). Total num frames: 364800000. Throughput: 0: 5941.3. Samples: 364810174. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:52:47,823][25689] Avg episode reward: [(0, '-46.884')] [2022-07-09 17:52:48,039][26022] Updated weights on worker 0-0, policy_version 356252 (0.00086) [2022-07-09 17:52:49,791][26022] Updated weights on worker 0-0, policy_version 356262 (0.00091) [2022-07-09 17:52:51,527][26022] Updated weights on worker 0-0, policy_version 356272 (0.00087) [2022-07-09 17:52:52,844][25689] Fps is (10 sec: 5616.5, 60 sec: 5647.6, 300 sec: 5657.7). Total num frames: 364828672. Throughput: 0: 5959.8. Samples: 364827506. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:52:52,845][25689] Avg episode reward: [(0, '-46.413')] [2022-07-09 17:52:53,362][26022] Updated weights on worker 0-0, policy_version 356282 (0.00088) [2022-07-09 17:52:55,202][26022] Updated weights on worker 0-0, policy_version 356292 (0.00092) [2022-07-09 17:52:56,929][26022] Updated weights on worker 0-0, policy_version 356302 (0.00087) [2022-07-09 17:52:57,879][25689] Fps is (10 sec: 5702.2, 60 sec: 5648.4, 300 sec: 5659.2). Total num frames: 364857344. Throughput: 0: 5991.9. Samples: 364861932. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:52:57,881][25689] Avg episode reward: [(0, '-47.014')] [2022-07-09 17:52:58,863][26022] Updated weights on worker 0-0, policy_version 356312 (0.00084) [2022-07-09 17:53:00,543][26022] Updated weights on worker 0-0, policy_version 356322 (0.00092) [2022-07-09 17:53:02,714][26022] Updated weights on worker 0-0, policy_version 356332 (0.00092) [2022-07-09 17:53:02,935][25689] Fps is (10 sec: 5581.1, 60 sec: 5651.4, 300 sec: 5659.9). Total num frames: 364884992. Throughput: 0: 5868.1. Samples: 364893878. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:53:02,935][25689] Avg episode reward: [(0, '-46.978')] [2022-07-09 17:53:04,718][26022] Updated weights on worker 0-0, policy_version 356342 (0.00089) [2022-07-09 17:53:06,259][26022] Updated weights on worker 0-0, policy_version 356352 (0.00102) [2022-07-09 17:53:07,946][25689] Fps is (10 sec: 5493.1, 60 sec: 5657.7, 300 sec: 5656.9). Total num frames: 364912640. Throughput: 0: 5017.4. Samples: 364911138. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:53:07,947][25689] Avg episode reward: [(0, '-46.827')] [2022-07-09 17:53:08,160][26022] Updated weights on worker 0-0, policy_version 356362 (0.00092) [2022-07-09 17:53:10,071][26022] Updated weights on worker 0-0, policy_version 356372 (0.00083) [2022-07-09 17:53:11,527][26022] Updated weights on worker 0-0, policy_version 356382 (0.00083) [2022-07-09 17:53:12,975][25689] Fps is (10 sec: 5609.6, 60 sec: 5638.9, 300 sec: 5657.2). Total num frames: 364941312. Throughput: 0: 5858.3. Samples: 364945440. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:53:12,976][25689] Avg episode reward: [(0, '-47.199')] [2022-07-09 17:53:13,652][26022] Updated weights on worker 0-0, policy_version 356392 (0.00086) [2022-07-09 17:53:15,126][26022] Updated weights on worker 0-0, policy_version 356402 (0.00090) [2022-07-09 17:53:17,272][26022] Updated weights on worker 0-0, policy_version 356412 (0.00094) [2022-07-09 17:53:17,996][25689] Fps is (10 sec: 5807.8, 60 sec: 5659.7, 300 sec: 5659.0). Total num frames: 364971008. Throughput: 0: 5831.4. Samples: 364979240. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:53:17,996][25689] Avg episode reward: [(0, '-46.615')] [2022-07-09 17:53:18,835][26022] Updated weights on worker 0-0, policy_version 356422 (0.00086) [2022-07-09 17:53:19,217][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:53:19,226][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000356423_364977152.pth [2022-07-09 17:53:19,228][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000354432_362938368.pth [2022-07-09 17:53:20,743][26022] Updated weights on worker 0-0, policy_version 356432 (0.00088) [2022-07-09 17:53:22,536][26022] Updated weights on worker 0-0, policy_version 356442 (0.00086) [2022-07-09 17:53:23,049][25689] Fps is (10 sec: 5692.3, 60 sec: 5659.6, 300 sec: 5654.6). Total num frames: 364998656. Throughput: 0: 5087.2. Samples: 364996202. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:53:23,049][25689] Avg episode reward: [(0, '-46.877')] [2022-07-09 17:53:24,319][26022] Updated weights on worker 0-0, policy_version 356452 (0.00086) [2022-07-09 17:53:26,116][26022] Updated weights on worker 0-0, policy_version 356462 (0.00099) [2022-07-09 17:53:27,931][26022] Updated weights on worker 0-0, policy_version 356472 (0.00091) [2022-07-09 17:53:28,051][25689] Fps is (10 sec: 5600.9, 60 sec: 5650.4, 300 sec: 5655.7). Total num frames: 365027328. Throughput: 0: 5933.4. Samples: 365030432. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:53:28,051][25689] Avg episode reward: [(0, '-46.615')] [2022-07-09 17:53:29,839][26022] Updated weights on worker 0-0, policy_version 356482 (0.00086) [2022-07-09 17:53:31,617][26022] Updated weights on worker 0-0, policy_version 356492 (0.00094) [2022-07-09 17:53:33,061][25689] Fps is (10 sec: 5727.6, 60 sec: 5669.8, 300 sec: 5652.6). Total num frames: 365056000. Throughput: 0: 5930.5. Samples: 365064560. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:53:33,061][25689] Avg episode reward: [(0, '-48.087')] [2022-07-09 17:53:33,311][26022] Updated weights on worker 0-0, policy_version 356502 (0.00102) [2022-07-09 17:53:35,068][26022] Updated weights on worker 0-0, policy_version 356512 (0.00084) [2022-07-09 17:53:37,129][26022] Updated weights on worker 0-0, policy_version 356522 (0.00086) [2022-07-09 17:53:38,079][25689] Fps is (10 sec: 5718.6, 60 sec: 5670.7, 300 sec: 5657.4). Total num frames: 365084672. Throughput: 0: 5112.0. Samples: 365081904. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:53:38,079][25689] Avg episode reward: [(0, '-48.519')] [2022-07-09 17:53:38,747][26022] Updated weights on worker 0-0, policy_version 356532 (0.00084) [2022-07-09 17:53:40,613][26022] Updated weights on worker 0-0, policy_version 356542 (0.01097) [2022-07-09 17:53:42,212][26022] Updated weights on worker 0-0, policy_version 356552 (0.00087) [2022-07-09 17:53:43,130][25689] Fps is (10 sec: 5491.3, 60 sec: 5619.6, 300 sec: 5649.9). Total num frames: 365111296. Throughput: 0: 5964.4. Samples: 365115976. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:53:43,131][25689] Avg episode reward: [(0, '-48.544')] [2022-07-09 17:53:44,084][26022] Updated weights on worker 0-0, policy_version 356562 (0.00083) [2022-07-09 17:53:46,143][26022] Updated weights on worker 0-0, policy_version 356572 (0.00084) [2022-07-09 17:53:47,655][26022] Updated weights on worker 0-0, policy_version 356582 (0.00084) [2022-07-09 17:53:48,134][25689] Fps is (10 sec: 5702.9, 60 sec: 5670.9, 300 sec: 5656.9). Total num frames: 365142016. Throughput: 0: 5952.0. Samples: 365149966. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:53:48,134][25689] Avg episode reward: [(0, '-48.895')] [2022-07-09 17:53:49,584][26022] Updated weights on worker 0-0, policy_version 356592 (0.00097) [2022-07-09 17:53:51,242][26022] Updated weights on worker 0-0, policy_version 356602 (0.00100) [2022-07-09 17:53:53,137][25689] Fps is (10 sec: 5833.0, 60 sec: 5655.5, 300 sec: 5651.6). Total num frames: 365169664. Throughput: 0: 5105.5. Samples: 365167060. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:53:53,137][25689] Avg episode reward: [(0, '-49.098')] [2022-07-09 17:53:53,186][26022] Updated weights on worker 0-0, policy_version 356612 (0.00089) [2022-07-09 17:53:54,901][26022] Updated weights on worker 0-0, policy_version 356622 (0.00082) [2022-07-09 17:53:56,736][26022] Updated weights on worker 0-0, policy_version 356632 (0.00090) [2022-07-09 17:53:58,163][25689] Fps is (10 sec: 5513.7, 60 sec: 5639.5, 300 sec: 5653.0). Total num frames: 365197312. Throughput: 0: 5929.3. Samples: 365200988. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:53:58,163][25689] Avg episode reward: [(0, '-48.760')] [2022-07-09 17:53:58,552][26022] Updated weights on worker 0-0, policy_version 356642 (0.00084) [2022-07-09 17:54:00,497][26022] Updated weights on worker 0-0, policy_version 356652 (0.00090) [2022-07-09 17:54:02,654][26022] Updated weights on worker 0-0, policy_version 356662 (0.00095) [2022-07-09 17:54:03,204][25689] Fps is (10 sec: 5391.0, 60 sec: 5623.8, 300 sec: 5652.3). Total num frames: 365223936. Throughput: 0: 5825.7. Samples: 365232920. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 17:54:03,205][25689] Avg episode reward: [(0, '-47.701')] [2022-07-09 17:54:04,579][26022] Updated weights on worker 0-0, policy_version 356672 (0.00089) [2022-07-09 17:54:06,473][26022] Updated weights on worker 0-0, policy_version 356682 (0.01048) [2022-07-09 17:54:08,044][26022] Updated weights on worker 0-0, policy_version 356692 (0.00087) [2022-07-09 17:54:08,207][25689] Fps is (10 sec: 5606.9, 60 sec: 5658.5, 300 sec: 5652.7). Total num frames: 365253632. Throughput: 0: 4982.4. Samples: 365249980. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:54:08,209][25689] Avg episode reward: [(0, '-47.530')] [2022-07-09 17:54:09,975][26022] Updated weights on worker 0-0, policy_version 356702 (0.00085) [2022-07-09 17:54:11,677][26022] Updated weights on worker 0-0, policy_version 356712 (0.00086) [2022-07-09 17:54:13,217][25689] Fps is (10 sec: 5624.9, 60 sec: 5626.4, 300 sec: 5645.8). Total num frames: 365280256. Throughput: 0: 5827.8. Samples: 365284082. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:54:13,218][25689] Avg episode reward: [(0, '-46.853')] [2022-07-09 17:54:13,473][26022] Updated weights on worker 0-0, policy_version 356722 (0.00086) [2022-07-09 17:54:15,349][26022] Updated weights on worker 0-0, policy_version 356732 (0.00095) [2022-07-09 17:54:17,092][26022] Updated weights on worker 0-0, policy_version 356742 (0.00087) [2022-07-09 17:54:18,239][25689] Fps is (10 sec: 5512.3, 60 sec: 5609.3, 300 sec: 5644.3). Total num frames: 365308928. Throughput: 0: 5837.1. Samples: 365318174. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:54:18,239][25689] Avg episode reward: [(0, '-46.844')] [2022-07-09 17:54:18,969][26022] Updated weights on worker 0-0, policy_version 356752 (0.00092) [2022-07-09 17:54:20,916][26022] Updated weights on worker 0-0, policy_version 356762 (0.00086) [2022-07-09 17:54:22,369][26022] Updated weights on worker 0-0, policy_version 356772 (0.00084) [2022-07-09 17:54:23,285][25689] Fps is (10 sec: 5797.0, 60 sec: 5643.9, 300 sec: 5650.6). Total num frames: 365338624. Throughput: 0: 5086.8. Samples: 365335070. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:54:23,286][25689] Avg episode reward: [(0, '-46.631')] [2022-07-09 17:54:24,381][26022] Updated weights on worker 0-0, policy_version 356782 (0.00051) [2022-07-09 17:54:26,227][26022] Updated weights on worker 0-0, policy_version 356792 (0.00084) [2022-07-09 17:54:27,895][26022] Updated weights on worker 0-0, policy_version 356802 (0.00087) [2022-07-09 17:54:28,290][25689] Fps is (10 sec: 5806.8, 60 sec: 5643.6, 300 sec: 5651.7). Total num frames: 365367296. Throughput: 0: 5946.6. Samples: 365369406. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:54:28,291][25689] Avg episode reward: [(0, '-46.855')] [2022-07-09 17:54:29,655][26022] Updated weights on worker 0-0, policy_version 356812 (0.00094) [2022-07-09 17:54:31,417][26022] Updated weights on worker 0-0, policy_version 356822 (0.00085) [2022-07-09 17:54:33,258][26022] Updated weights on worker 0-0, policy_version 356832 (0.00084) [2022-07-09 17:54:33,331][25689] Fps is (10 sec: 5708.5, 60 sec: 5640.7, 300 sec: 5651.4). Total num frames: 365395968. Throughput: 0: 5948.2. Samples: 365403724. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:54:33,331][25689] Avg episode reward: [(0, '-47.752')] [2022-07-09 17:54:34,954][26022] Updated weights on worker 0-0, policy_version 356842 (0.00082) [2022-07-09 17:54:36,768][26022] Updated weights on worker 0-0, policy_version 356852 (0.00090) [2022-07-09 17:54:38,339][25689] Fps is (10 sec: 5706.9, 60 sec: 5641.7, 300 sec: 5648.9). Total num frames: 365424640. Throughput: 0: 5119.9. Samples: 365421084. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:54:38,339][25689] Avg episode reward: [(0, '-48.497')] [2022-07-09 17:54:38,731][26022] Updated weights on worker 0-0, policy_version 356862 (0.00087) [2022-07-09 17:54:40,470][26022] Updated weights on worker 0-0, policy_version 356872 (0.00088) [2022-07-09 17:54:42,350][26022] Updated weights on worker 0-0, policy_version 356882 (0.00092) [2022-07-09 17:54:43,421][25689] Fps is (10 sec: 5683.0, 60 sec: 5672.8, 300 sec: 5649.1). Total num frames: 365453312. Throughput: 0: 5975.6. Samples: 365455392. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:54:43,422][25689] Avg episode reward: [(0, '-48.335')] [2022-07-09 17:54:43,985][26022] Updated weights on worker 0-0, policy_version 356892 (0.00090) [2022-07-09 17:54:45,806][26022] Updated weights on worker 0-0, policy_version 356902 (0.00087) [2022-07-09 17:54:47,593][26022] Updated weights on worker 0-0, policy_version 356912 (0.00085) [2022-07-09 17:54:48,493][25689] Fps is (10 sec: 5748.1, 60 sec: 5649.4, 300 sec: 5651.4). Total num frames: 365483008. Throughput: 0: 5972.6. Samples: 365490068. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:54:48,494][25689] Avg episode reward: [(0, '-48.962')] [2022-07-09 17:54:49,613][26022] Updated weights on worker 0-0, policy_version 356922 (0.00099) [2022-07-09 17:54:51,189][26022] Updated weights on worker 0-0, policy_version 356932 (0.00076) [2022-07-09 17:54:53,093][26022] Updated weights on worker 0-0, policy_version 356942 (0.00093) [2022-07-09 17:54:53,509][25689] Fps is (10 sec: 5684.3, 60 sec: 5648.2, 300 sec: 5648.7). Total num frames: 365510656. Throughput: 0: 5112.0. Samples: 365506876. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:54:53,510][25689] Avg episode reward: [(0, '-48.980')] [2022-07-09 17:54:54,595][26022] Updated weights on worker 0-0, policy_version 356952 (0.00082) [2022-07-09 17:54:56,695][26022] Updated weights on worker 0-0, policy_version 356962 (0.00083) [2022-07-09 17:54:58,311][26022] Updated weights on worker 0-0, policy_version 356972 (0.00095) [2022-07-09 17:54:58,515][25689] Fps is (10 sec: 5619.4, 60 sec: 5666.9, 300 sec: 5649.9). Total num frames: 365539328. Throughput: 0: 5958.9. Samples: 365541316. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:54:58,516][25689] Avg episode reward: [(0, '-48.876')] [2022-07-09 17:55:00,187][26022] Updated weights on worker 0-0, policy_version 356982 (0.00087) [2022-07-09 17:55:02,422][26022] Updated weights on worker 0-0, policy_version 356992 (0.00085) [2022-07-09 17:55:03,561][25689] Fps is (10 sec: 5602.9, 60 sec: 5683.5, 300 sec: 5656.8). Total num frames: 365566976. Throughput: 0: 5849.7. Samples: 365573206. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:55:03,562][25689] Avg episode reward: [(0, '-47.590')] [2022-07-09 17:55:04,192][26022] Updated weights on worker 0-0, policy_version 357002 (0.00085) [2022-07-09 17:55:06,096][26022] Updated weights on worker 0-0, policy_version 357012 (0.00108) [2022-07-09 17:55:07,637][26022] Updated weights on worker 0-0, policy_version 357022 (0.00085) [2022-07-09 17:55:08,580][25689] Fps is (10 sec: 5494.0, 60 sec: 5648.1, 300 sec: 5650.2). Total num frames: 365594624. Throughput: 0: 4987.4. Samples: 365590252. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:55:08,581][25689] Avg episode reward: [(0, '-47.442')] [2022-07-09 17:55:09,574][26022] Updated weights on worker 0-0, policy_version 357032 (0.00087) [2022-07-09 17:55:11,284][26022] Updated weights on worker 0-0, policy_version 357042 (0.00089) [2022-07-09 17:55:13,185][26022] Updated weights on worker 0-0, policy_version 357052 (0.00093) [2022-07-09 17:55:13,591][25689] Fps is (10 sec: 5717.6, 60 sec: 5698.9, 300 sec: 5654.7). Total num frames: 365624320. Throughput: 0: 5855.0. Samples: 365624452. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:55:13,591][25689] Avg episode reward: [(0, '-47.920')] [2022-07-09 17:55:14,899][26022] Updated weights on worker 0-0, policy_version 357062 (0.00089) [2022-07-09 17:55:16,639][26022] Updated weights on worker 0-0, policy_version 357072 (0.00085) [2022-07-09 17:55:18,623][25689] Fps is (10 sec: 5608.1, 60 sec: 5664.0, 300 sec: 5648.7). Total num frames: 365650944. Throughput: 0: 5851.6. Samples: 365658976. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:55:18,623][25689] Avg episode reward: [(0, '-47.111')] [2022-07-09 17:55:18,631][26022] Updated weights on worker 0-0, policy_version 357082 (0.00124) [2022-07-09 17:55:19,239][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:55:19,260][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000357086_365656064.pth [2022-07-09 17:55:19,260][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000355097_363619328.pth [2022-07-09 17:55:20,313][26022] Updated weights on worker 0-0, policy_version 357092 (0.00088) [2022-07-09 17:55:22,195][26022] Updated weights on worker 0-0, policy_version 357102 (0.01187) [2022-07-09 17:55:23,710][25689] Fps is (10 sec: 5565.5, 60 sec: 5660.2, 300 sec: 5650.6). Total num frames: 365680640. Throughput: 0: 5090.6. Samples: 365675776. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:55:23,716][25689] Avg episode reward: [(0, '-47.795')] [2022-07-09 17:55:24,123][26022] Updated weights on worker 0-0, policy_version 357112 (0.00086) [2022-07-09 17:55:25,603][26022] Updated weights on worker 0-0, policy_version 357122 (0.00093) [2022-07-09 17:55:27,573][26022] Updated weights on worker 0-0, policy_version 357132 (0.00090) [2022-07-09 17:55:28,788][25689] Fps is (10 sec: 5843.0, 60 sec: 5670.3, 300 sec: 5649.4). Total num frames: 365710336. Throughput: 0: 5936.6. Samples: 365710214. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:55:28,788][25689] Avg episode reward: [(0, '-48.271')] [2022-07-09 17:55:29,423][26022] Updated weights on worker 0-0, policy_version 357142 (0.00086) [2022-07-09 17:55:31,075][26022] Updated weights on worker 0-0, policy_version 357152 (0.00085) [2022-07-09 17:55:33,042][26022] Updated weights on worker 0-0, policy_version 357162 (0.00087) [2022-07-09 17:55:33,807][25689] Fps is (10 sec: 5679.5, 60 sec: 5655.4, 300 sec: 5653.0). Total num frames: 365737984. Throughput: 0: 5949.0. Samples: 365744718. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:55:33,808][25689] Avg episode reward: [(0, '-49.176')] [2022-07-09 17:55:34,395][26022] Updated weights on worker 0-0, policy_version 357172 (0.00087) [2022-07-09 17:55:36,520][26022] Updated weights on worker 0-0, policy_version 357182 (0.00090) [2022-07-09 17:55:38,229][26022] Updated weights on worker 0-0, policy_version 357192 (0.00088) [2022-07-09 17:55:38,814][25689] Fps is (10 sec: 5514.8, 60 sec: 5638.5, 300 sec: 5646.7). Total num frames: 365765632. Throughput: 0: 5095.6. Samples: 365761864. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:55:38,815][25689] Avg episode reward: [(0, '-48.194')] [2022-07-09 17:55:40,143][26022] Updated weights on worker 0-0, policy_version 357202 (0.00095) [2022-07-09 17:55:42,008][26022] Updated weights on worker 0-0, policy_version 357212 (0.00093) [2022-07-09 17:55:43,698][26022] Updated weights on worker 0-0, policy_version 357222 (0.00093) [2022-07-09 17:55:43,935][25689] Fps is (10 sec: 5763.1, 60 sec: 5668.8, 300 sec: 5651.4). Total num frames: 365796352. Throughput: 0: 5960.4. Samples: 365796322. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:55:43,935][25689] Avg episode reward: [(0, '-48.657')] [2022-07-09 17:55:45,438][26022] Updated weights on worker 0-0, policy_version 357232 (0.00090) [2022-07-09 17:55:47,249][26022] Updated weights on worker 0-0, policy_version 357242 (0.00087) [2022-07-09 17:55:48,867][26022] Updated weights on worker 0-0, policy_version 357252 (0.00087) [2022-07-09 17:55:48,963][25689] Fps is (10 sec: 5953.2, 60 sec: 5672.9, 300 sec: 5651.0). Total num frames: 365826048. Throughput: 0: 5984.2. Samples: 365830948. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:55:48,963][25689] Avg episode reward: [(0, '-48.988')] [2022-07-09 17:55:50,967][26022] Updated weights on worker 0-0, policy_version 357262 (0.00094) [2022-07-09 17:55:52,517][26022] Updated weights on worker 0-0, policy_version 357272 (0.00086) [2022-07-09 17:55:54,009][25689] Fps is (10 sec: 5692.0, 60 sec: 5670.1, 300 sec: 5650.4). Total num frames: 365853696. Throughput: 0: 5109.8. Samples: 365847948. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:55:54,010][25689] Avg episode reward: [(0, '-48.012')] [2022-07-09 17:55:54,442][26022] Updated weights on worker 0-0, policy_version 357282 (0.00088) [2022-07-09 17:55:56,155][26022] Updated weights on worker 0-0, policy_version 357292 (0.00082) [2022-07-09 17:55:57,844][26022] Updated weights on worker 0-0, policy_version 357302 (0.00090) [2022-07-09 17:55:59,107][25689] Fps is (10 sec: 5652.9, 60 sec: 5678.4, 300 sec: 5656.2). Total num frames: 365883392. Throughput: 0: 5943.3. Samples: 365882470. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:55:59,108][25689] Avg episode reward: [(0, '-47.705')] [2022-07-09 17:55:59,691][26022] Updated weights on worker 0-0, policy_version 357312 (0.00088) [2022-07-09 17:56:01,748][26022] Updated weights on worker 0-0, policy_version 357322 (0.00086) [2022-07-09 17:56:03,682][26022] Updated weights on worker 0-0, policy_version 357332 (0.00091) [2022-07-09 17:56:04,165][25689] Fps is (10 sec: 5646.1, 60 sec: 5677.2, 300 sec: 5655.5). Total num frames: 365911040. Throughput: 0: 5858.3. Samples: 365914840. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:56:04,166][25689] Avg episode reward: [(0, '-47.354')] [2022-07-09 17:56:05,520][26022] Updated weights on worker 0-0, policy_version 357342 (0.00083) [2022-07-09 17:56:07,166][26022] Updated weights on worker 0-0, policy_version 357352 (0.00086) [2022-07-09 17:56:08,967][26022] Updated weights on worker 0-0, policy_version 357362 (0.00093) [2022-07-09 17:56:09,202][25689] Fps is (10 sec: 5579.0, 60 sec: 5692.5, 300 sec: 5651.5). Total num frames: 365939712. Throughput: 0: 4991.5. Samples: 365931972. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:56:09,203][25689] Avg episode reward: [(0, '-48.689')] [2022-07-09 17:56:10,875][26022] Updated weights on worker 0-0, policy_version 357372 (0.00088) [2022-07-09 17:56:12,520][26022] Updated weights on worker 0-0, policy_version 357382 (0.00045) [2022-07-09 17:56:14,214][25689] Fps is (10 sec: 5604.9, 60 sec: 5658.6, 300 sec: 5654.8). Total num frames: 365967360. Throughput: 0: 5845.2. Samples: 365966048. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:56:14,214][25689] Avg episode reward: [(0, '-48.260')] [2022-07-09 17:56:14,592][26022] Updated weights on worker 0-0, policy_version 357392 (0.00082) [2022-07-09 17:56:15,941][26022] Updated weights on worker 0-0, policy_version 357402 (0.00082) [2022-07-09 17:56:18,002][26022] Updated weights on worker 0-0, policy_version 357412 (0.00089) [2022-07-09 17:56:19,234][25689] Fps is (10 sec: 5614.4, 60 sec: 5693.5, 300 sec: 5653.5). Total num frames: 365996032. Throughput: 0: 5867.6. Samples: 366000564. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:56:19,234][25689] Avg episode reward: [(0, '-48.326')] [2022-07-09 17:56:19,743][26022] Updated weights on worker 0-0, policy_version 357422 (0.00096) [2022-07-09 17:56:21,622][26022] Updated weights on worker 0-0, policy_version 357432 (0.00092) [2022-07-09 17:56:23,400][26022] Updated weights on worker 0-0, policy_version 357442 (0.00092) [2022-07-09 17:56:24,281][25689] Fps is (10 sec: 5696.3, 60 sec: 5680.4, 300 sec: 5652.6). Total num frames: 366024704. Throughput: 0: 5966.8. Samples: 366034864. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:56:24,281][25689] Avg episode reward: [(0, '-48.341')] [2022-07-09 17:56:25,191][26022] Updated weights on worker 0-0, policy_version 357452 (0.00090) [2022-07-09 17:56:26,836][26022] Updated weights on worker 0-0, policy_version 357462 (0.00085) [2022-07-09 17:56:28,772][26022] Updated weights on worker 0-0, policy_version 357472 (0.00089) [2022-07-09 17:56:29,289][25689] Fps is (10 sec: 5703.0, 60 sec: 5670.0, 300 sec: 5657.3). Total num frames: 366053376. Throughput: 0: 5972.0. Samples: 366051928. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:56:29,289][25689] Avg episode reward: [(0, '-48.059')] [2022-07-09 17:56:30,547][26022] Updated weights on worker 0-0, policy_version 357482 (0.00079) [2022-07-09 17:56:32,303][26022] Updated weights on worker 0-0, policy_version 357492 (0.00098) [2022-07-09 17:56:34,306][25689] Fps is (10 sec: 5617.9, 60 sec: 5670.2, 300 sec: 5654.0). Total num frames: 366081024. Throughput: 0: 5994.6. Samples: 366086492. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:56:34,308][25689] Avg episode reward: [(0, '-46.888')] [2022-07-09 17:56:34,314][26022] Updated weights on worker 0-0, policy_version 357502 (0.00082) [2022-07-09 17:56:35,629][26022] Updated weights on worker 0-0, policy_version 357512 (0.00093) [2022-07-09 17:56:37,720][26022] Updated weights on worker 0-0, policy_version 357522 (0.00096) [2022-07-09 17:56:39,321][25689] Fps is (10 sec: 5818.0, 60 sec: 5720.2, 300 sec: 5659.6). Total num frames: 366111744. Throughput: 0: 5984.4. Samples: 366120776. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 17:56:39,321][25689] Avg episode reward: [(0, '-46.283')] [2022-07-09 17:56:39,685][26022] Updated weights on worker 0-0, policy_version 357532 (0.00087) [2022-07-09 17:56:41,216][26022] Updated weights on worker 0-0, policy_version 357542 (0.00088) [2022-07-09 17:56:43,078][26022] Updated weights on worker 0-0, policy_version 357552 (0.00091) [2022-07-09 17:56:44,373][25689] Fps is (10 sec: 5899.6, 60 sec: 5692.8, 300 sec: 5662.8). Total num frames: 366140416. Throughput: 0: 5134.7. Samples: 366138032. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:56:44,374][25689] Avg episode reward: [(0, '-46.308')] [2022-07-09 17:56:44,889][26022] Updated weights on worker 0-0, policy_version 357562 (0.00084) [2022-07-09 17:56:46,545][26022] Updated weights on worker 0-0, policy_version 357572 (0.00093) [2022-07-09 17:56:48,419][26022] Updated weights on worker 0-0, policy_version 357582 (0.00087) [2022-07-09 17:56:49,406][25689] Fps is (10 sec: 5686.1, 60 sec: 5675.4, 300 sec: 5662.4). Total num frames: 366169088. Throughput: 0: 6002.2. Samples: 366172676. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:56:49,406][25689] Avg episode reward: [(0, '-45.846')] [2022-07-09 17:56:49,973][26022] Updated weights on worker 0-0, policy_version 357592 (0.00106) [2022-07-09 17:56:52,145][26022] Updated weights on worker 0-0, policy_version 357602 (0.00092) [2022-07-09 17:56:53,882][26022] Updated weights on worker 0-0, policy_version 357612 (0.00092) [2022-07-09 17:56:54,423][25689] Fps is (10 sec: 5603.7, 60 sec: 5678.1, 300 sec: 5659.5). Total num frames: 366196736. Throughput: 0: 5978.2. Samples: 366206758. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:56:54,424][25689] Avg episode reward: [(0, '-47.043')] [2022-07-09 17:56:55,568][26022] Updated weights on worker 0-0, policy_version 357622 (0.00090) [2022-07-09 17:56:57,488][26022] Updated weights on worker 0-0, policy_version 357632 (0.00084) [2022-07-09 17:56:59,126][26022] Updated weights on worker 0-0, policy_version 357642 (0.00086) [2022-07-09 17:56:59,439][25689] Fps is (10 sec: 5715.3, 60 sec: 5685.9, 300 sec: 5667.8). Total num frames: 366226432. Throughput: 0: 5121.7. Samples: 366223816. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:56:59,440][25689] Avg episode reward: [(0, '-47.503')] [2022-07-09 17:57:00,938][26022] Updated weights on worker 0-0, policy_version 357652 (0.00091) [2022-07-09 17:57:03,356][26022] Updated weights on worker 0-0, policy_version 357662 (0.00087) [2022-07-09 17:57:04,528][25689] Fps is (10 sec: 5675.0, 60 sec: 5683.0, 300 sec: 5667.5). Total num frames: 366254080. Throughput: 0: 5859.3. Samples: 366256126. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:57:04,529][25689] Avg episode reward: [(0, '-48.148')] [2022-07-09 17:57:04,835][26022] Updated weights on worker 0-0, policy_version 357672 (0.00104) [2022-07-09 17:57:06,940][26022] Updated weights on worker 0-0, policy_version 357682 (0.00090) [2022-07-09 17:57:08,599][26022] Updated weights on worker 0-0, policy_version 357692 (0.00050) [2022-07-09 17:57:09,589][25689] Fps is (10 sec: 5347.1, 60 sec: 5646.8, 300 sec: 5656.2). Total num frames: 366280704. Throughput: 0: 5821.7. Samples: 366290178. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:57:09,590][25689] Avg episode reward: [(0, '-48.241')] [2022-07-09 17:57:10,451][26022] Updated weights on worker 0-0, policy_version 357702 (0.00082) [2022-07-09 17:57:12,170][26022] Updated weights on worker 0-0, policy_version 357712 (0.00088) [2022-07-09 17:57:13,973][26022] Updated weights on worker 0-0, policy_version 357722 (0.00092) [2022-07-09 17:57:14,614][25689] Fps is (10 sec: 5482.5, 60 sec: 5662.5, 300 sec: 5656.9). Total num frames: 366309376. Throughput: 0: 4975.2. Samples: 366307210. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:57:14,615][25689] Avg episode reward: [(0, '-48.689')] [2022-07-09 17:57:15,773][26022] Updated weights on worker 0-0, policy_version 357732 (0.00091) [2022-07-09 17:57:17,755][26022] Updated weights on worker 0-0, policy_version 357742 (0.00090) [2022-07-09 17:57:19,296][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:57:19,315][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000357752_366338048.pth [2022-07-09 17:57:19,316][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000355761_364299264.pth [2022-07-09 17:57:19,327][26022] Updated weights on worker 0-0, policy_version 357752 (0.00092) [2022-07-09 17:57:19,637][25689] Fps is (10 sec: 5910.8, 60 sec: 5696.1, 300 sec: 5667.8). Total num frames: 366340096. Throughput: 0: 5829.4. Samples: 366341558. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:57:19,638][25689] Avg episode reward: [(0, '-48.508')] [2022-07-09 17:57:21,506][26022] Updated weights on worker 0-0, policy_version 357762 (0.00089) [2022-07-09 17:57:23,022][26022] Updated weights on worker 0-0, policy_version 357772 (0.00086) [2022-07-09 17:57:24,677][25689] Fps is (10 sec: 5698.3, 60 sec: 5662.8, 300 sec: 5658.3). Total num frames: 366366720. Throughput: 0: 5932.1. Samples: 366375652. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:57:24,678][25689] Avg episode reward: [(0, '-48.627')] [2022-07-09 17:57:24,986][26022] Updated weights on worker 0-0, policy_version 357782 (0.00090) [2022-07-09 17:57:26,627][26022] Updated weights on worker 0-0, policy_version 357792 (0.00083) [2022-07-09 17:57:28,523][26022] Updated weights on worker 0-0, policy_version 357802 (0.00084) [2022-07-09 17:57:29,697][25689] Fps is (10 sec: 5496.9, 60 sec: 5661.8, 300 sec: 5662.1). Total num frames: 366395392. Throughput: 0: 5094.3. Samples: 366392612. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:57:29,697][25689] Avg episode reward: [(0, '-48.753')] [2022-07-09 17:57:30,223][26022] Updated weights on worker 0-0, policy_version 357812 (0.00092) [2022-07-09 17:57:32,049][26022] Updated weights on worker 0-0, policy_version 357822 (0.00089) [2022-07-09 17:57:33,785][26022] Updated weights on worker 0-0, policy_version 357832 (0.00083) [2022-07-09 17:57:34,714][25689] Fps is (10 sec: 5815.3, 60 sec: 5695.6, 300 sec: 5665.7). Total num frames: 366425088. Throughput: 0: 5954.5. Samples: 366426898. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:57:34,715][25689] Avg episode reward: [(0, '-48.646')] [2022-07-09 17:57:35,868][26022] Updated weights on worker 0-0, policy_version 357842 (0.00090) [2022-07-09 17:57:37,282][26022] Updated weights on worker 0-0, policy_version 357852 (0.00077) [2022-07-09 17:57:39,399][26022] Updated weights on worker 0-0, policy_version 357862 (0.00084) [2022-07-09 17:57:39,737][25689] Fps is (10 sec: 5813.6, 60 sec: 5661.0, 300 sec: 5662.8). Total num frames: 366453760. Throughput: 0: 5938.9. Samples: 366460926. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:57:39,737][25689] Avg episode reward: [(0, '-48.293')] [2022-07-09 17:57:41,036][26022] Updated weights on worker 0-0, policy_version 357872 (0.00088) [2022-07-09 17:57:42,834][26022] Updated weights on worker 0-0, policy_version 357882 (0.00088) [2022-07-09 17:57:44,834][25689] Fps is (10 sec: 5565.4, 60 sec: 5639.9, 300 sec: 5661.1). Total num frames: 366481408. Throughput: 0: 5081.6. Samples: 366478080. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:57:44,836][25689] Avg episode reward: [(0, '-48.635')] [2022-07-09 17:57:44,841][26022] Updated weights on worker 0-0, policy_version 357892 (0.00094) [2022-07-09 17:57:46,529][26022] Updated weights on worker 0-0, policy_version 357902 (0.00084) [2022-07-09 17:57:48,297][26022] Updated weights on worker 0-0, policy_version 357912 (0.00087) [2022-07-09 17:57:49,879][25689] Fps is (10 sec: 5553.1, 60 sec: 5638.8, 300 sec: 5660.6). Total num frames: 366510080. Throughput: 0: 5923.5. Samples: 366512160. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:57:49,879][25689] Avg episode reward: [(0, '-48.346')] [2022-07-09 17:57:50,145][26022] Updated weights on worker 0-0, policy_version 357922 (0.00085) [2022-07-09 17:57:51,679][26022] Updated weights on worker 0-0, policy_version 357932 (0.00090) [2022-07-09 17:57:53,875][26022] Updated weights on worker 0-0, policy_version 357942 (0.00086) [2022-07-09 17:57:54,902][25689] Fps is (10 sec: 5797.4, 60 sec: 5672.1, 300 sec: 5664.3). Total num frames: 366539776. Throughput: 0: 5931.7. Samples: 366546644. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:57:54,902][25689] Avg episode reward: [(0, '-47.729')] [2022-07-09 17:57:55,333][26022] Updated weights on worker 0-0, policy_version 357952 (0.00090) [2022-07-09 17:57:57,382][26022] Updated weights on worker 0-0, policy_version 357962 (0.00082) [2022-07-09 17:57:58,958][26022] Updated weights on worker 0-0, policy_version 357972 (0.00088) [2022-07-09 17:57:59,918][25689] Fps is (10 sec: 5813.9, 60 sec: 5655.1, 300 sec: 5668.5). Total num frames: 366568448. Throughput: 0: 5105.9. Samples: 366563970. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:57:59,919][25689] Avg episode reward: [(0, '-47.377')] [2022-07-09 17:58:00,849][26022] Updated weights on worker 0-0, policy_version 357982 (0.00109) [2022-07-09 17:58:02,891][26022] Updated weights on worker 0-0, policy_version 357992 (0.00095) [2022-07-09 17:58:04,916][26022] Updated weights on worker 0-0, policy_version 358002 (0.00096) [2022-07-09 17:58:05,004][25689] Fps is (10 sec: 5372.2, 60 sec: 5621.5, 300 sec: 5660.2). Total num frames: 366594048. Throughput: 0: 5848.5. Samples: 366596046. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:58:05,005][25689] Avg episode reward: [(0, '-48.034')] [2022-07-09 17:58:06,543][26022] Updated weights on worker 0-0, policy_version 358012 (0.00093) [2022-07-09 17:58:08,494][26022] Updated weights on worker 0-0, policy_version 358022 (0.00081) [2022-07-09 17:58:10,029][25689] Fps is (10 sec: 5469.2, 60 sec: 5675.8, 300 sec: 5663.7). Total num frames: 366623744. Throughput: 0: 5855.1. Samples: 366630140. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:58:10,029][25689] Avg episode reward: [(0, '-49.083')] [2022-07-09 17:58:10,042][26022] Updated weights on worker 0-0, policy_version 358032 (0.00086) [2022-07-09 17:58:12,135][26022] Updated weights on worker 0-0, policy_version 358042 (0.00088) [2022-07-09 17:58:13,607][26022] Updated weights on worker 0-0, policy_version 358052 (0.00085) [2022-07-09 17:58:15,087][25689] Fps is (10 sec: 5687.4, 60 sec: 5655.7, 300 sec: 5656.1). Total num frames: 366651392. Throughput: 0: 4985.8. Samples: 366647282. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:58:15,087][25689] Avg episode reward: [(0, '-49.335')] [2022-07-09 17:58:15,744][26022] Updated weights on worker 0-0, policy_version 358062 (0.00088) [2022-07-09 17:58:17,306][26022] Updated weights on worker 0-0, policy_version 358072 (0.00095) [2022-07-09 17:58:19,182][26022] Updated weights on worker 0-0, policy_version 358082 (0.00102) [2022-07-09 17:58:20,176][25689] Fps is (10 sec: 5651.2, 60 sec: 5632.6, 300 sec: 5662.3). Total num frames: 366681088. Throughput: 0: 5816.1. Samples: 366681792. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:58:20,176][25689] Avg episode reward: [(0, '-49.084')] [2022-07-09 17:58:20,717][26022] Updated weights on worker 0-0, policy_version 358092 (0.00087) [2022-07-09 17:58:22,943][26022] Updated weights on worker 0-0, policy_version 358102 (0.00087) [2022-07-09 17:58:24,534][26022] Updated weights on worker 0-0, policy_version 358112 (0.00090) [2022-07-09 17:58:25,292][25689] Fps is (10 sec: 5819.6, 60 sec: 5676.2, 300 sec: 5663.6). Total num frames: 366710784. Throughput: 0: 5903.2. Samples: 366715812. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:58:25,293][25689] Avg episode reward: [(0, '-49.672')] [2022-07-09 17:58:26,483][26022] Updated weights on worker 0-0, policy_version 358122 (0.00088) [2022-07-09 17:58:27,973][26022] Updated weights on worker 0-0, policy_version 358132 (0.00090) [2022-07-09 17:58:29,943][26022] Updated weights on worker 0-0, policy_version 358142 (0.00089) [2022-07-09 17:58:30,318][25689] Fps is (10 sec: 5654.0, 60 sec: 5658.7, 300 sec: 5659.8). Total num frames: 366738432. Throughput: 0: 5062.0. Samples: 366732846. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:58:30,319][25689] Avg episode reward: [(0, '-49.587')] [2022-07-09 17:58:31,741][26022] Updated weights on worker 0-0, policy_version 358152 (0.00092) [2022-07-09 17:58:33,686][26022] Updated weights on worker 0-0, policy_version 358162 (0.00081) [2022-07-09 17:58:35,173][26022] Updated weights on worker 0-0, policy_version 358172 (0.00093) [2022-07-09 17:58:35,329][25689] Fps is (10 sec: 5713.3, 60 sec: 5659.3, 300 sec: 5663.4). Total num frames: 366768128. Throughput: 0: 5921.9. Samples: 366767156. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:58:35,330][25689] Avg episode reward: [(0, '-49.756')] [2022-07-09 17:58:37,252][26022] Updated weights on worker 0-0, policy_version 358182 (0.00091) [2022-07-09 17:58:38,769][26022] Updated weights on worker 0-0, policy_version 358192 (0.00095) [2022-07-09 17:58:40,349][25689] Fps is (10 sec: 5716.6, 60 sec: 5642.7, 300 sec: 5667.5). Total num frames: 366795776. Throughput: 0: 5930.3. Samples: 366801426. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:58:40,350][25689] Avg episode reward: [(0, '-49.108')] [2022-07-09 17:58:40,901][26022] Updated weights on worker 0-0, policy_version 358202 (0.00094) [2022-07-09 17:58:42,563][26022] Updated weights on worker 0-0, policy_version 358212 (0.00086) [2022-07-09 17:58:44,383][26022] Updated weights on worker 0-0, policy_version 358222 (0.00088) [2022-07-09 17:58:45,412][25689] Fps is (10 sec: 5586.0, 60 sec: 5662.8, 300 sec: 5659.5). Total num frames: 366824448. Throughput: 0: 5103.9. Samples: 366818498. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:58:45,412][25689] Avg episode reward: [(0, '-47.908')] [2022-07-09 17:58:46,100][26022] Updated weights on worker 0-0, policy_version 358232 (0.00084) [2022-07-09 17:58:48,002][26022] Updated weights on worker 0-0, policy_version 358242 (0.00095) [2022-07-09 17:58:49,617][26022] Updated weights on worker 0-0, policy_version 358252 (0.00091) [2022-07-09 17:58:50,416][25689] Fps is (10 sec: 5797.8, 60 sec: 5683.5, 300 sec: 5666.3). Total num frames: 366854144. Throughput: 0: 5963.1. Samples: 366852694. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:58:50,417][25689] Avg episode reward: [(0, '-48.285')] [2022-07-09 17:58:51,799][26022] Updated weights on worker 0-0, policy_version 358262 (0.00093) [2022-07-09 17:58:53,405][26022] Updated weights on worker 0-0, policy_version 358272 (0.00094) [2022-07-09 17:58:55,438][25689] Fps is (10 sec: 5515.0, 60 sec: 5615.9, 300 sec: 5659.5). Total num frames: 366879744. Throughput: 0: 5927.8. Samples: 366886356. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:58:55,439][25689] Avg episode reward: [(0, '-47.465')] [2022-07-09 17:58:55,458][26022] Updated weights on worker 0-0, policy_version 358282 (0.00090) [2022-07-09 17:58:57,078][26022] Updated weights on worker 0-0, policy_version 358292 (0.00092) [2022-07-09 17:58:58,749][26022] Updated weights on worker 0-0, policy_version 358302 (0.00082) [2022-07-09 17:59:00,441][25689] Fps is (10 sec: 5413.7, 60 sec: 5617.2, 300 sec: 5667.1). Total num frames: 366908416. Throughput: 0: 5078.9. Samples: 366903472. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:59:00,443][25689] Avg episode reward: [(0, '-47.644')] [2022-07-09 17:59:00,746][26022] Updated weights on worker 0-0, policy_version 358312 (0.00086) [2022-07-09 17:59:02,656][26022] Updated weights on worker 0-0, policy_version 358322 (0.00087) [2022-07-09 17:59:04,656][26022] Updated weights on worker 0-0, policy_version 358332 (0.00085) [2022-07-09 17:59:05,482][25689] Fps is (10 sec: 5709.1, 60 sec: 5672.1, 300 sec: 5663.0). Total num frames: 366937088. Throughput: 0: 5835.8. Samples: 366935626. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:59:05,483][25689] Avg episode reward: [(0, '-47.671')] [2022-07-09 17:59:06,371][26022] Updated weights on worker 0-0, policy_version 358342 (0.00090) [2022-07-09 17:59:08,315][26022] Updated weights on worker 0-0, policy_version 358352 (0.00085) [2022-07-09 17:59:09,984][26022] Updated weights on worker 0-0, policy_version 358362 (0.00088) [2022-07-09 17:59:10,493][25689] Fps is (10 sec: 5602.8, 60 sec: 5639.5, 300 sec: 5666.4). Total num frames: 366964736. Throughput: 0: 5838.3. Samples: 366969910. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:59:10,494][25689] Avg episode reward: [(0, '-47.635')] [2022-07-09 17:59:11,683][26022] Updated weights on worker 0-0, policy_version 358372 (0.00089) [2022-07-09 17:59:13,503][26022] Updated weights on worker 0-0, policy_version 358382 (0.00081) [2022-07-09 17:59:15,513][25689] Fps is (10 sec: 5512.7, 60 sec: 5643.1, 300 sec: 5663.0). Total num frames: 366992384. Throughput: 0: 5017.4. Samples: 366987080. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 17:59:15,521][25689] Avg episode reward: [(0, '-48.089')] [2022-07-09 17:59:15,530][26022] Updated weights on worker 0-0, policy_version 358392 (0.00099) [2022-07-09 17:59:17,136][26022] Updated weights on worker 0-0, policy_version 358402 (0.00091) [2022-07-09 17:59:18,992][26022] Updated weights on worker 0-0, policy_version 358412 (0.00100) [2022-07-09 17:59:19,428][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 17:59:19,442][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000358415_367016960.pth [2022-07-09 17:59:19,442][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000356423_364977152.pth [2022-07-09 17:59:20,542][25689] Fps is (10 sec: 5706.6, 60 sec: 5648.7, 300 sec: 5663.3). Total num frames: 367022080. Throughput: 0: 5873.5. Samples: 367021534. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 17:59:20,544][25689] Avg episode reward: [(0, '-48.534')] [2022-07-09 17:59:20,982][26022] Updated weights on worker 0-0, policy_version 358422 (0.00080) [2022-07-09 17:59:22,530][26022] Updated weights on worker 0-0, policy_version 358432 (0.00095) [2022-07-09 17:59:24,616][26022] Updated weights on worker 0-0, policy_version 358442 (0.00093) [2022-07-09 17:59:25,610][25689] Fps is (10 sec: 5882.4, 60 sec: 5653.3, 300 sec: 5665.6). Total num frames: 367051776. Throughput: 0: 5964.2. Samples: 367055668. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 17:59:25,610][25689] Avg episode reward: [(0, '-47.791')] [2022-07-09 17:59:26,123][26022] Updated weights on worker 0-0, policy_version 358452 (0.00086) [2022-07-09 17:59:27,952][26022] Updated weights on worker 0-0, policy_version 358462 (0.00089) [2022-07-09 17:59:29,737][26022] Updated weights on worker 0-0, policy_version 358472 (0.00094) [2022-07-09 17:59:30,648][25689] Fps is (10 sec: 5775.8, 60 sec: 5669.1, 300 sec: 5665.6). Total num frames: 367080448. Throughput: 0: 5109.7. Samples: 367072892. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 17:59:30,648][25689] Avg episode reward: [(0, '-47.996')] [2022-07-09 17:59:31,644][26022] Updated weights on worker 0-0, policy_version 358482 (0.00090) [2022-07-09 17:59:33,311][26022] Updated weights on worker 0-0, policy_version 358492 (0.00096) [2022-07-09 17:59:35,221][26022] Updated weights on worker 0-0, policy_version 358502 (0.00097) [2022-07-09 17:59:35,717][25689] Fps is (10 sec: 5673.7, 60 sec: 5646.8, 300 sec: 5664.4). Total num frames: 367109120. Throughput: 0: 5950.2. Samples: 367107294. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 17:59:35,717][25689] Avg episode reward: [(0, '-48.964')] [2022-07-09 17:59:36,694][26022] Updated weights on worker 0-0, policy_version 358512 (0.00087) [2022-07-09 17:59:38,683][26022] Updated weights on worker 0-0, policy_version 358522 (0.00092) [2022-07-09 17:59:40,513][26022] Updated weights on worker 0-0, policy_version 358532 (0.00086) [2022-07-09 17:59:40,737][25689] Fps is (10 sec: 5683.6, 60 sec: 5663.6, 300 sec: 5665.6). Total num frames: 367137792. Throughput: 0: 5953.3. Samples: 367141762. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 17:59:40,738][25689] Avg episode reward: [(0, '-48.758')] [2022-07-09 17:59:42,315][26022] Updated weights on worker 0-0, policy_version 358542 (0.00087) [2022-07-09 17:59:43,901][26022] Updated weights on worker 0-0, policy_version 358552 (0.00091) [2022-07-09 17:59:45,792][25689] Fps is (10 sec: 5691.7, 60 sec: 5664.4, 300 sec: 5662.5). Total num frames: 367166464. Throughput: 0: 5974.7. Samples: 367176250. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 17:59:45,792][25689] Avg episode reward: [(0, '-48.896')] [2022-07-09 17:59:45,975][26022] Updated weights on worker 0-0, policy_version 358562 (0.00084) [2022-07-09 17:59:47,586][26022] Updated weights on worker 0-0, policy_version 358572 (0.00089) [2022-07-09 17:59:49,503][26022] Updated weights on worker 0-0, policy_version 358582 (0.00088) [2022-07-09 17:59:50,815][25689] Fps is (10 sec: 5690.3, 60 sec: 5645.7, 300 sec: 5665.8). Total num frames: 367195136. Throughput: 0: 5979.1. Samples: 367193472. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 17:59:50,817][25689] Avg episode reward: [(0, '-48.761')] [2022-07-09 17:59:51,155][26022] Updated weights on worker 0-0, policy_version 358592 (0.00080) [2022-07-09 17:59:52,827][26022] Updated weights on worker 0-0, policy_version 358602 (0.00089) [2022-07-09 17:59:54,829][26022] Updated weights on worker 0-0, policy_version 358612 (0.00110) [2022-07-09 17:59:55,830][25689] Fps is (10 sec: 5712.5, 60 sec: 5697.2, 300 sec: 5665.7). Total num frames: 367223808. Throughput: 0: 5991.9. Samples: 367227812. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 17:59:55,832][25689] Avg episode reward: [(0, '-50.190')] [2022-07-09 17:59:56,446][26022] Updated weights on worker 0-0, policy_version 358622 (0.00426) [2022-07-09 17:59:58,330][26022] Updated weights on worker 0-0, policy_version 358632 (0.00094) [2022-07-09 18:00:00,178][26022] Updated weights on worker 0-0, policy_version 358642 (0.00092) [2022-07-09 18:00:00,863][25689] Fps is (10 sec: 5706.8, 60 sec: 5694.4, 300 sec: 5669.3). Total num frames: 367252480. Throughput: 0: 5984.3. Samples: 367262200. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 18:00:00,863][25689] Avg episode reward: [(0, '-49.202')] [2022-07-09 18:00:02,263][26022] Updated weights on worker 0-0, policy_version 358652 (0.00098) [2022-07-09 18:00:04,364][26022] Updated weights on worker 0-0, policy_version 358662 (0.00090) [2022-07-09 18:00:05,941][25689] Fps is (10 sec: 5570.3, 60 sec: 5674.0, 300 sec: 5668.2). Total num frames: 367280128. Throughput: 0: 5001.5. Samples: 367277026. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 18:00:05,942][25689] Avg episode reward: [(0, '-48.416')] [2022-07-09 18:00:05,942][26022] Updated weights on worker 0-0, policy_version 358672 (0.00089) [2022-07-09 18:00:07,915][26022] Updated weights on worker 0-0, policy_version 358682 (0.00084) [2022-07-09 18:00:09,632][26022] Updated weights on worker 0-0, policy_version 358692 (0.00083) [2022-07-09 18:00:10,945][25689] Fps is (10 sec: 5484.7, 60 sec: 5674.6, 300 sec: 5661.4). Total num frames: 367307776. Throughput: 0: 5842.5. Samples: 367311082. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 18:00:10,945][25689] Avg episode reward: [(0, '-47.510')] [2022-07-09 18:00:11,452][26022] Updated weights on worker 0-0, policy_version 358702 (0.00092) [2022-07-09 18:00:13,175][26022] Updated weights on worker 0-0, policy_version 358712 (0.00088) [2022-07-09 18:00:15,209][26022] Updated weights on worker 0-0, policy_version 358722 (0.00084) [2022-07-09 18:00:16,009][25689] Fps is (10 sec: 5695.6, 60 sec: 5704.3, 300 sec: 5671.2). Total num frames: 367337472. Throughput: 0: 5817.9. Samples: 367345212. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 18:00:16,010][25689] Avg episode reward: [(0, '-46.717')] [2022-07-09 18:00:16,730][26022] Updated weights on worker 0-0, policy_version 358732 (0.00096) [2022-07-09 18:00:18,784][26022] Updated weights on worker 0-0, policy_version 358742 (0.00086) [2022-07-09 18:00:20,251][26022] Updated weights on worker 0-0, policy_version 358752 (0.00096) [2022-07-09 18:00:21,044][25689] Fps is (10 sec: 5678.2, 60 sec: 5669.9, 300 sec: 5665.3). Total num frames: 367365120. Throughput: 0: 4954.9. Samples: 367362194. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 18:00:21,045][25689] Avg episode reward: [(0, '-45.348')] [2022-07-09 18:00:22,320][26022] Updated weights on worker 0-0, policy_version 358762 (0.00091) [2022-07-09 18:00:24,051][26022] Updated weights on worker 0-0, policy_version 358772 (0.00085) [2022-07-09 18:00:25,962][26022] Updated weights on worker 0-0, policy_version 358782 (0.00087) [2022-07-09 18:00:26,152][25689] Fps is (10 sec: 5552.7, 60 sec: 5649.2, 300 sec: 5661.3). Total num frames: 367393792. Throughput: 0: 5908.4. Samples: 367396442. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 18:00:26,154][25689] Avg episode reward: [(0, '-45.045')] [2022-07-09 18:00:27,680][26022] Updated weights on worker 0-0, policy_version 358792 (0.00397) [2022-07-09 18:00:29,478][26022] Updated weights on worker 0-0, policy_version 358802 (0.00080) [2022-07-09 18:00:31,195][25689] Fps is (10 sec: 5649.0, 60 sec: 5648.8, 300 sec: 5664.2). Total num frames: 367422464. Throughput: 0: 5895.6. Samples: 367430468. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 18:00:31,195][25689] Avg episode reward: [(0, '-45.824')] [2022-07-09 18:00:31,197][26022] Updated weights on worker 0-0, policy_version 358812 (0.00087) [2022-07-09 18:00:33,217][26022] Updated weights on worker 0-0, policy_version 358822 (0.00082) [2022-07-09 18:00:34,843][26022] Updated weights on worker 0-0, policy_version 358832 (0.00083) [2022-07-09 18:00:36,214][25689] Fps is (10 sec: 5597.4, 60 sec: 5636.5, 300 sec: 5664.0). Total num frames: 367450112. Throughput: 0: 5064.8. Samples: 367447544. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 18:00:36,214][25689] Avg episode reward: [(0, '-46.084')] [2022-07-09 18:00:36,828][26022] Updated weights on worker 0-0, policy_version 358842 (0.00088) [2022-07-09 18:00:38,295][26022] Updated weights on worker 0-0, policy_version 358852 (0.00084) [2022-07-09 18:00:40,357][26022] Updated weights on worker 0-0, policy_version 358862 (0.00089) [2022-07-09 18:00:41,220][25689] Fps is (10 sec: 5822.4, 60 sec: 5671.7, 300 sec: 5666.2). Total num frames: 367480832. Throughput: 0: 5936.7. Samples: 367481972. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 18:00:41,221][25689] Avg episode reward: [(0, '-46.267')] [2022-07-09 18:00:41,891][26022] Updated weights on worker 0-0, policy_version 358872 (0.00093) [2022-07-09 18:00:43,923][26022] Updated weights on worker 0-0, policy_version 358882 (0.00089) [2022-07-09 18:00:45,666][26022] Updated weights on worker 0-0, policy_version 358892 (0.00093) [2022-07-09 18:00:46,310][25689] Fps is (10 sec: 5882.6, 60 sec: 5668.4, 300 sec: 5661.6). Total num frames: 367509504. Throughput: 0: 5940.2. Samples: 367516184. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 18:00:46,310][25689] Avg episode reward: [(0, '-46.757')] [2022-07-09 18:00:47,359][26022] Updated weights on worker 0-0, policy_version 358902 (0.00090) [2022-07-09 18:00:49,290][26022] Updated weights on worker 0-0, policy_version 358912 (0.00060) [2022-07-09 18:00:50,838][26022] Updated weights on worker 0-0, policy_version 358922 (0.00083) [2022-07-09 18:00:51,351][25689] Fps is (10 sec: 5559.2, 60 sec: 5649.8, 300 sec: 5661.7). Total num frames: 367537152. Throughput: 0: 5104.7. Samples: 367533356. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 18:00:51,351][25689] Avg episode reward: [(0, '-47.125')] [2022-07-09 18:00:52,890][26022] Updated weights on worker 0-0, policy_version 358932 (0.00096) [2022-07-09 18:00:54,658][26022] Updated weights on worker 0-0, policy_version 358942 (0.00084) [2022-07-09 18:00:56,371][25689] Fps is (10 sec: 5598.0, 60 sec: 5649.4, 300 sec: 5659.7). Total num frames: 367565824. Throughput: 0: 5956.3. Samples: 367567604. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 18:00:56,371][25689] Avg episode reward: [(0, '-47.662')] [2022-07-09 18:00:56,418][26022] Updated weights on worker 0-0, policy_version 358952 (0.00080) [2022-07-09 18:00:58,272][26022] Updated weights on worker 0-0, policy_version 358962 (0.00087) [2022-07-09 18:00:59,955][26022] Updated weights on worker 0-0, policy_version 358972 (0.00085) [2022-07-09 18:01:01,389][25689] Fps is (10 sec: 5712.5, 60 sec: 5650.7, 300 sec: 5663.9). Total num frames: 367594496. Throughput: 0: 5953.5. Samples: 367602050. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 18:01:01,389][25689] Avg episode reward: [(0, '-47.795')] [2022-07-09 18:01:01,983][26022] Updated weights on worker 0-0, policy_version 358982 (0.00099) [2022-07-09 18:01:04,037][26022] Updated weights on worker 0-0, policy_version 358992 (0.00086) [2022-07-09 18:01:05,583][26022] Updated weights on worker 0-0, policy_version 359002 (0.00097) [2022-07-09 18:01:06,510][25689] Fps is (10 sec: 5554.8, 60 sec: 5646.7, 300 sec: 5658.9). Total num frames: 367622144. Throughput: 0: 4996.3. Samples: 367617108. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 18:01:06,510][25689] Avg episode reward: [(0, '-48.812')] [2022-07-09 18:01:07,523][26022] Updated weights on worker 0-0, policy_version 359012 (0.00088) [2022-07-09 18:01:09,319][26022] Updated weights on worker 0-0, policy_version 359022 (0.00088) [2022-07-09 18:01:11,223][26022] Updated weights on worker 0-0, policy_version 359032 (0.00086) [2022-07-09 18:01:11,518][25689] Fps is (10 sec: 5560.5, 60 sec: 5663.3, 300 sec: 5662.4). Total num frames: 367650816. Throughput: 0: 5857.9. Samples: 367651492. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 18:01:11,518][25689] Avg episode reward: [(0, '-48.660')] [2022-07-09 18:01:12,899][26022] Updated weights on worker 0-0, policy_version 359042 (0.00090) [2022-07-09 18:01:14,780][26022] Updated weights on worker 0-0, policy_version 359052 (0.00086) [2022-07-09 18:01:16,419][26022] Updated weights on worker 0-0, policy_version 359062 (0.00083) [2022-07-09 18:01:16,574][25689] Fps is (10 sec: 5697.6, 60 sec: 5647.1, 300 sec: 5661.7). Total num frames: 367679488. Throughput: 0: 5852.8. Samples: 367685852. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 18:01:16,575][25689] Avg episode reward: [(0, '-48.602')] [2022-07-09 18:01:18,286][26022] Updated weights on worker 0-0, policy_version 359072 (0.00095) [2022-07-09 18:01:19,786][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:01:19,799][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000359080_367697920.pth [2022-07-09 18:01:19,801][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000357086_365656064.pth [2022-07-09 18:01:20,078][26022] Updated weights on worker 0-0, policy_version 359082 (0.00096) [2022-07-09 18:01:21,608][25689] Fps is (10 sec: 5581.7, 60 sec: 5647.2, 300 sec: 5658.5). Total num frames: 367707136. Throughput: 0: 4994.3. Samples: 367703028. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 18:01:21,609][25689] Avg episode reward: [(0, '-48.082')] [2022-07-09 18:01:21,817][26022] Updated weights on worker 0-0, policy_version 359092 (0.00091) [2022-07-09 18:01:23,687][26022] Updated weights on worker 0-0, policy_version 359102 (0.00082) [2022-07-09 18:01:25,434][26022] Updated weights on worker 0-0, policy_version 359112 (0.00090) [2022-07-09 18:01:26,683][25689] Fps is (10 sec: 5672.7, 60 sec: 5667.2, 300 sec: 5660.7). Total num frames: 367736832. Throughput: 0: 5950.1. Samples: 367737142. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 18:01:26,684][25689] Avg episode reward: [(0, '-48.380')] [2022-07-09 18:01:27,199][26022] Updated weights on worker 0-0, policy_version 359122 (0.00084) [2022-07-09 18:01:29,219][26022] Updated weights on worker 0-0, policy_version 359132 (0.00088) [2022-07-09 18:01:30,740][26022] Updated weights on worker 0-0, policy_version 359142 (0.00087) [2022-07-09 18:01:31,689][25689] Fps is (10 sec: 5688.0, 60 sec: 5653.7, 300 sec: 5660.9). Total num frames: 367764480. Throughput: 0: 5923.8. Samples: 367770986. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 18:01:31,690][25689] Avg episode reward: [(0, '-48.067')] [2022-07-09 18:01:32,855][26022] Updated weights on worker 0-0, policy_version 359152 (0.00091) [2022-07-09 18:01:34,465][26022] Updated weights on worker 0-0, policy_version 359162 (0.00084) [2022-07-09 18:01:36,116][26022] Updated weights on worker 0-0, policy_version 359172 (0.00092) [2022-07-09 18:01:36,705][25689] Fps is (10 sec: 5823.8, 60 sec: 5704.7, 300 sec: 5660.9). Total num frames: 367795200. Throughput: 0: 5093.5. Samples: 367788390. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 18:01:36,706][25689] Avg episode reward: [(0, '-47.518')] [2022-07-09 18:01:38,194][26022] Updated weights on worker 0-0, policy_version 359182 (0.00091) [2022-07-09 18:01:39,815][26022] Updated weights on worker 0-0, policy_version 359192 (0.00085) [2022-07-09 18:01:41,661][26022] Updated weights on worker 0-0, policy_version 359202 (0.00089) [2022-07-09 18:01:41,727][25689] Fps is (10 sec: 5917.0, 60 sec: 5669.4, 300 sec: 5661.5). Total num frames: 367823872. Throughput: 0: 5952.1. Samples: 367822780. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 18:01:41,727][25689] Avg episode reward: [(0, '-47.835')] [2022-07-09 18:01:43,480][26022] Updated weights on worker 0-0, policy_version 359212 (0.00091) [2022-07-09 18:01:45,120][26022] Updated weights on worker 0-0, policy_version 359222 (0.00089) [2022-07-09 18:01:46,786][25689] Fps is (10 sec: 5485.4, 60 sec: 5638.5, 300 sec: 5654.1). Total num frames: 367850496. Throughput: 0: 5963.2. Samples: 367857020. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 18:01:46,786][25689] Avg episode reward: [(0, '-47.128')] [2022-07-09 18:01:47,095][26022] Updated weights on worker 0-0, policy_version 359232 (0.00081) [2022-07-09 18:01:48,852][26022] Updated weights on worker 0-0, policy_version 359242 (0.00086) [2022-07-09 18:01:50,582][26022] Updated weights on worker 0-0, policy_version 359252 (0.00089) [2022-07-09 18:01:51,803][25689] Fps is (10 sec: 5691.3, 60 sec: 5691.5, 300 sec: 5664.4). Total num frames: 367881216. Throughput: 0: 5121.7. Samples: 367874000. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:01:51,803][25689] Avg episode reward: [(0, '-46.790')] [2022-07-09 18:01:52,407][26022] Updated weights on worker 0-0, policy_version 359262 (0.00090) [2022-07-09 18:01:54,209][26022] Updated weights on worker 0-0, policy_version 359272 (0.00086) [2022-07-09 18:01:56,091][26022] Updated weights on worker 0-0, policy_version 359282 (0.00096) [2022-07-09 18:01:56,810][25689] Fps is (10 sec: 5823.0, 60 sec: 5675.8, 300 sec: 5657.7). Total num frames: 367908864. Throughput: 0: 5964.4. Samples: 367908300. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:01:56,810][25689] Avg episode reward: [(0, '-46.927')] [2022-07-09 18:01:57,803][26022] Updated weights on worker 0-0, policy_version 359292 (0.00090) [2022-07-09 18:01:59,684][26022] Updated weights on worker 0-0, policy_version 359302 (0.00094) [2022-07-09 18:02:01,589][26022] Updated weights on worker 0-0, policy_version 359312 (0.00092) [2022-07-09 18:02:01,817][25689] Fps is (10 sec: 5521.5, 60 sec: 5659.9, 300 sec: 5659.2). Total num frames: 367936512. Throughput: 0: 5939.5. Samples: 367942106. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:02:01,818][25689] Avg episode reward: [(0, '-46.565')] [2022-07-09 18:02:03,500][26022] Updated weights on worker 0-0, policy_version 359322 (0.00093) [2022-07-09 18:02:05,440][26022] Updated weights on worker 0-0, policy_version 359332 (0.00089) [2022-07-09 18:02:06,874][25689] Fps is (10 sec: 5494.2, 60 sec: 5665.9, 300 sec: 5662.8). Total num frames: 367964160. Throughput: 0: 4974.8. Samples: 367956954. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:02:06,874][25689] Avg episode reward: [(0, '-46.277')] [2022-07-09 18:02:07,302][26022] Updated weights on worker 0-0, policy_version 359342 (0.00086) [2022-07-09 18:02:08,891][26022] Updated weights on worker 0-0, policy_version 359352 (0.00495) [2022-07-09 18:02:11,034][26022] Updated weights on worker 0-0, policy_version 359362 (0.00086) [2022-07-09 18:02:11,951][25689] Fps is (10 sec: 5557.6, 60 sec: 5659.4, 300 sec: 5661.8). Total num frames: 367992832. Throughput: 0: 5811.1. Samples: 367991086. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:02:11,952][25689] Avg episode reward: [(0, '-46.573')] [2022-07-09 18:02:12,512][26022] Updated weights on worker 0-0, policy_version 359372 (0.00085) [2022-07-09 18:02:14,384][26022] Updated weights on worker 0-0, policy_version 359382 (0.00080) [2022-07-09 18:02:16,240][26022] Updated weights on worker 0-0, policy_version 359392 (0.00092) [2022-07-09 18:02:16,955][25689] Fps is (10 sec: 5688.1, 60 sec: 5664.3, 300 sec: 5655.2). Total num frames: 368021504. Throughput: 0: 5830.8. Samples: 368025766. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:02:16,956][25689] Avg episode reward: [(0, '-46.932')] [2022-07-09 18:02:17,927][26022] Updated weights on worker 0-0, policy_version 359402 (0.00088) [2022-07-09 18:02:19,723][26022] Updated weights on worker 0-0, policy_version 359412 (0.00085) [2022-07-09 18:02:21,555][26022] Updated weights on worker 0-0, policy_version 359422 (0.00091) [2022-07-09 18:02:21,967][25689] Fps is (10 sec: 5725.4, 60 sec: 5683.3, 300 sec: 5662.7). Total num frames: 368050176. Throughput: 0: 5010.2. Samples: 368043060. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:02:21,967][25689] Avg episode reward: [(0, '-46.418')] [2022-07-09 18:02:23,340][26022] Updated weights on worker 0-0, policy_version 359432 (0.00086) [2022-07-09 18:02:25,156][26022] Updated weights on worker 0-0, policy_version 359442 (0.00083) [2022-07-09 18:02:26,999][26022] Updated weights on worker 0-0, policy_version 359452 (0.00097) [2022-07-09 18:02:27,046][25689] Fps is (10 sec: 5784.5, 60 sec: 5683.0, 300 sec: 5665.0). Total num frames: 368079872. Throughput: 0: 5972.2. Samples: 368077424. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:02:27,046][25689] Avg episode reward: [(0, '-46.605')] [2022-07-09 18:02:28,583][26022] Updated weights on worker 0-0, policy_version 359462 (0.00088) [2022-07-09 18:02:30,604][26022] Updated weights on worker 0-0, policy_version 359472 (0.00084) [2022-07-09 18:02:32,127][25689] Fps is (10 sec: 5744.6, 60 sec: 5692.8, 300 sec: 5660.3). Total num frames: 368108544. Throughput: 0: 5975.4. Samples: 368111646. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:02:32,128][25689] Avg episode reward: [(0, '-47.172')] [2022-07-09 18:02:32,319][26022] Updated weights on worker 0-0, policy_version 359482 (0.00093) [2022-07-09 18:02:34,123][26022] Updated weights on worker 0-0, policy_version 359492 (0.00097) [2022-07-09 18:02:35,782][26022] Updated weights on worker 0-0, policy_version 359502 (0.00085) [2022-07-09 18:02:37,201][25689] Fps is (10 sec: 5747.3, 60 sec: 5670.4, 300 sec: 5662.8). Total num frames: 368138240. Throughput: 0: 5952.9. Samples: 368146290. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:02:37,202][25689] Avg episode reward: [(0, '-46.741')] [2022-07-09 18:02:37,566][26022] Updated weights on worker 0-0, policy_version 359512 (0.00090) [2022-07-09 18:02:39,553][26022] Updated weights on worker 0-0, policy_version 359522 (0.00094) [2022-07-09 18:02:41,124][26022] Updated weights on worker 0-0, policy_version 359532 (0.00089) [2022-07-09 18:02:42,221][25689] Fps is (10 sec: 5681.4, 60 sec: 5653.7, 300 sec: 5664.3). Total num frames: 368165888. Throughput: 0: 5946.4. Samples: 368163498. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:02:42,222][25689] Avg episode reward: [(0, '-46.323')] [2022-07-09 18:02:43,033][26022] Updated weights on worker 0-0, policy_version 359542 (0.00087) [2022-07-09 18:02:44,760][26022] Updated weights on worker 0-0, policy_version 359552 (0.00082) [2022-07-09 18:02:46,542][26022] Updated weights on worker 0-0, policy_version 359562 (0.00088) [2022-07-09 18:02:47,286][25689] Fps is (10 sec: 5584.8, 60 sec: 5687.0, 300 sec: 5663.9). Total num frames: 368194560. Throughput: 0: 5951.7. Samples: 368197888. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:02:47,290][25689] Avg episode reward: [(0, '-46.219')] [2022-07-09 18:02:48,306][26022] Updated weights on worker 0-0, policy_version 359572 (0.00086) [2022-07-09 18:02:50,128][26022] Updated weights on worker 0-0, policy_version 359582 (0.00094) [2022-07-09 18:02:51,912][26022] Updated weights on worker 0-0, policy_version 359592 (0.00089) [2022-07-09 18:02:52,298][25689] Fps is (10 sec: 5893.8, 60 sec: 5687.5, 300 sec: 5667.5). Total num frames: 368225280. Throughput: 0: 5981.3. Samples: 368232290. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:02:52,298][25689] Avg episode reward: [(0, '-46.436')] [2022-07-09 18:02:53,882][26022] Updated weights on worker 0-0, policy_version 359602 (0.00083) [2022-07-09 18:02:55,431][26022] Updated weights on worker 0-0, policy_version 359612 (0.00081) [2022-07-09 18:02:57,348][25689] Fps is (10 sec: 5698.9, 60 sec: 5666.4, 300 sec: 5660.0). Total num frames: 368251904. Throughput: 0: 5115.6. Samples: 368249354. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:02:57,349][25689] Avg episode reward: [(0, '-46.089')] [2022-07-09 18:02:57,395][26022] Updated weights on worker 0-0, policy_version 359622 (0.00088) [2022-07-09 18:02:59,138][26022] Updated weights on worker 0-0, policy_version 359632 (0.00104) [2022-07-09 18:03:00,976][26022] Updated weights on worker 0-0, policy_version 359642 (0.00093) [2022-07-09 18:03:02,378][25689] Fps is (10 sec: 5282.2, 60 sec: 5647.4, 300 sec: 5664.5). Total num frames: 368278528. Throughput: 0: 5952.8. Samples: 368283492. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:03:02,379][25689] Avg episode reward: [(0, '-46.278')] [2022-07-09 18:03:03,178][26022] Updated weights on worker 0-0, policy_version 359652 (0.00090) [2022-07-09 18:03:04,923][26022] Updated weights on worker 0-0, policy_version 359662 (0.00084) [2022-07-09 18:03:06,820][26022] Updated weights on worker 0-0, policy_version 359672 (0.00088) [2022-07-09 18:03:07,477][25689] Fps is (10 sec: 5661.8, 60 sec: 5694.3, 300 sec: 5666.6). Total num frames: 368309248. Throughput: 0: 5840.5. Samples: 368315812. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:03:07,477][25689] Avg episode reward: [(0, '-46.242')] [2022-07-09 18:03:08,583][26022] Updated weights on worker 0-0, policy_version 359682 (0.00082) [2022-07-09 18:03:10,428][26022] Updated weights on worker 0-0, policy_version 359692 (0.00085) [2022-07-09 18:03:12,195][26022] Updated weights on worker 0-0, policy_version 359702 (0.00112) [2022-07-09 18:03:12,526][25689] Fps is (10 sec: 5651.0, 60 sec: 5663.1, 300 sec: 5663.3). Total num frames: 368335872. Throughput: 0: 4985.2. Samples: 368333132. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:03:12,527][25689] Avg episode reward: [(0, '-46.634')] [2022-07-09 18:03:13,809][26022] Updated weights on worker 0-0, policy_version 359712 (0.00084) [2022-07-09 18:03:15,683][26022] Updated weights on worker 0-0, policy_version 359722 (0.00084) [2022-07-09 18:03:17,386][26022] Updated weights on worker 0-0, policy_version 359732 (0.00085) [2022-07-09 18:03:17,537][25689] Fps is (10 sec: 5597.9, 60 sec: 5679.3, 300 sec: 5664.8). Total num frames: 368365568. Throughput: 0: 5859.9. Samples: 368367660. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:03:17,539][25689] Avg episode reward: [(0, '-44.894')] [2022-07-09 18:03:19,335][26022] Updated weights on worker 0-0, policy_version 359742 (0.00086) [2022-07-09 18:03:19,841][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:03:19,862][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000359745_368378880.pth [2022-07-09 18:03:19,863][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000357752_366338048.pth [2022-07-09 18:03:20,954][26022] Updated weights on worker 0-0, policy_version 359752 (0.00087) [2022-07-09 18:03:22,600][25689] Fps is (10 sec: 5692.5, 60 sec: 5657.7, 300 sec: 5658.9). Total num frames: 368393216. Throughput: 0: 5860.1. Samples: 368401990. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:03:22,600][25689] Avg episode reward: [(0, '-45.782')] [2022-07-09 18:03:22,872][26022] Updated weights on worker 0-0, policy_version 359762 (0.00089) [2022-07-09 18:03:24,547][26022] Updated weights on worker 0-0, policy_version 359772 (0.00086) [2022-07-09 18:03:26,484][26022] Updated weights on worker 0-0, policy_version 359782 (0.00095) [2022-07-09 18:03:27,651][25689] Fps is (10 sec: 5771.5, 60 sec: 5677.2, 300 sec: 5668.8). Total num frames: 368423936. Throughput: 0: 5124.6. Samples: 368419196. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:03:27,651][25689] Avg episode reward: [(0, '-45.540')] [2022-07-09 18:03:28,197][26022] Updated weights on worker 0-0, policy_version 359792 (0.00090) [2022-07-09 18:03:29,991][26022] Updated weights on worker 0-0, policy_version 359802 (0.00083) [2022-07-09 18:03:31,910][26022] Updated weights on worker 0-0, policy_version 359812 (0.00082) [2022-07-09 18:03:32,718][25689] Fps is (10 sec: 5768.3, 60 sec: 5661.6, 300 sec: 5660.8). Total num frames: 368451584. Throughput: 0: 5926.4. Samples: 368452800. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:03:32,719][25689] Avg episode reward: [(0, '-46.015')] [2022-07-09 18:03:33,882][26022] Updated weights on worker 0-0, policy_version 359822 (0.00125) [2022-07-09 18:03:35,383][26022] Updated weights on worker 0-0, policy_version 359832 (0.00080) [2022-07-09 18:03:37,475][26022] Updated weights on worker 0-0, policy_version 359842 (0.00084) [2022-07-09 18:03:37,743][25689] Fps is (10 sec: 5479.2, 60 sec: 5632.4, 300 sec: 5660.7). Total num frames: 368479232. Throughput: 0: 5923.0. Samples: 368487336. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:03:37,743][25689] Avg episode reward: [(0, '-45.976')] [2022-07-09 18:03:39,021][26022] Updated weights on worker 0-0, policy_version 359852 (0.00987) [2022-07-09 18:03:41,098][26022] Updated weights on worker 0-0, policy_version 359862 (0.00092) [2022-07-09 18:03:42,574][26022] Updated weights on worker 0-0, policy_version 359872 (0.00088) [2022-07-09 18:03:42,772][25689] Fps is (10 sec: 5703.6, 60 sec: 5665.3, 300 sec: 5664.8). Total num frames: 368508928. Throughput: 0: 5078.5. Samples: 368504436. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:03:42,773][25689] Avg episode reward: [(0, '-46.617')] [2022-07-09 18:03:44,479][26022] Updated weights on worker 0-0, policy_version 359882 (0.00092) [2022-07-09 18:03:46,295][26022] Updated weights on worker 0-0, policy_version 359892 (0.00093) [2022-07-09 18:03:47,888][25689] Fps is (10 sec: 5853.8, 60 sec: 5677.4, 300 sec: 5662.7). Total num frames: 368538624. Throughput: 0: 5894.3. Samples: 368538484. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:03:47,889][25689] Avg episode reward: [(0, '-47.262')] [2022-07-09 18:03:47,980][26022] Updated weights on worker 0-0, policy_version 359902 (0.00087) [2022-07-09 18:03:49,990][26022] Updated weights on worker 0-0, policy_version 359912 (0.00079) [2022-07-09 18:03:51,498][26022] Updated weights on worker 0-0, policy_version 359922 (0.00086) [2022-07-09 18:03:52,898][25689] Fps is (10 sec: 5562.2, 60 sec: 5610.0, 300 sec: 5666.3). Total num frames: 368565248. Throughput: 0: 5945.4. Samples: 368572776. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:03:52,899][25689] Avg episode reward: [(0, '-46.344')] [2022-07-09 18:03:53,474][26022] Updated weights on worker 0-0, policy_version 359932 (0.00082) [2022-07-09 18:03:55,303][26022] Updated weights on worker 0-0, policy_version 359942 (0.00090) [2022-07-09 18:03:57,067][26022] Updated weights on worker 0-0, policy_version 359952 (0.00093) [2022-07-09 18:03:57,914][25689] Fps is (10 sec: 5821.7, 60 sec: 5697.7, 300 sec: 5676.4). Total num frames: 368596992. Throughput: 0: 5096.7. Samples: 368590146. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:03:57,915][25689] Avg episode reward: [(0, '-45.769')] [2022-07-09 18:03:58,915][26022] Updated weights on worker 0-0, policy_version 359962 (0.00082) [2022-07-09 18:04:00,647][26022] Updated weights on worker 0-0, policy_version 359972 (0.00097) [2022-07-09 18:04:02,704][26022] Updated weights on worker 0-0, policy_version 359982 (0.00091) [2022-07-09 18:04:02,959][25689] Fps is (10 sec: 5699.6, 60 sec: 5679.5, 300 sec: 5666.0). Total num frames: 368622592. Throughput: 0: 5945.7. Samples: 368624460. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:04:02,959][25689] Avg episode reward: [(0, '-45.810')] [2022-07-09 18:04:04,588][26022] Updated weights on worker 0-0, policy_version 359992 (0.00094) [2022-07-09 18:04:06,309][26022] Updated weights on worker 0-0, policy_version 360002 (0.00082) [2022-07-09 18:04:07,990][25689] Fps is (10 sec: 5386.3, 60 sec: 5651.9, 300 sec: 5669.1). Total num frames: 368651264. Throughput: 0: 5879.6. Samples: 368656676. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:04:07,991][25689] Avg episode reward: [(0, '-45.635')] [2022-07-09 18:04:08,150][26022] Updated weights on worker 0-0, policy_version 360012 (0.00085) [2022-07-09 18:04:09,963][26022] Updated weights on worker 0-0, policy_version 360022 (0.00082) [2022-07-09 18:04:11,697][26022] Updated weights on worker 0-0, policy_version 360032 (0.00087) [2022-07-09 18:04:13,016][25689] Fps is (10 sec: 5701.9, 60 sec: 5688.0, 300 sec: 5672.4). Total num frames: 368679936. Throughput: 0: 5025.6. Samples: 368673882. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:04:13,017][25689] Avg episode reward: [(0, '-45.523')] [2022-07-09 18:04:13,527][26022] Updated weights on worker 0-0, policy_version 360042 (0.00084) [2022-07-09 18:04:15,374][26022] Updated weights on worker 0-0, policy_version 360052 (0.00087) [2022-07-09 18:04:16,971][26022] Updated weights on worker 0-0, policy_version 360062 (0.00085) [2022-07-09 18:04:18,115][25689] Fps is (10 sec: 5663.8, 60 sec: 5662.8, 300 sec: 5667.6). Total num frames: 368708608. Throughput: 0: 5854.6. Samples: 368708412. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:04:18,117][25689] Avg episode reward: [(0, '-45.538')] [2022-07-09 18:04:18,899][26022] Updated weights on worker 0-0, policy_version 360072 (0.00090) [2022-07-09 18:04:20,518][26022] Updated weights on worker 0-0, policy_version 360082 (0.00082) [2022-07-09 18:04:22,590][26022] Updated weights on worker 0-0, policy_version 360092 (0.00087) [2022-07-09 18:04:23,139][25689] Fps is (10 sec: 5766.0, 60 sec: 5700.3, 300 sec: 5668.5). Total num frames: 368738304. Throughput: 0: 5868.3. Samples: 368742882. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-09 18:04:23,140][25689] Avg episode reward: [(0, '-45.472')] [2022-07-09 18:04:24,230][26022] Updated weights on worker 0-0, policy_version 360102 (0.00092) [2022-07-09 18:04:25,991][26022] Updated weights on worker 0-0, policy_version 360112 (0.00093) [2022-07-09 18:04:27,894][26022] Updated weights on worker 0-0, policy_version 360122 (0.00083) [2022-07-09 18:04:28,239][25689] Fps is (10 sec: 5765.3, 60 sec: 5661.8, 300 sec: 5667.3). Total num frames: 368766976. Throughput: 0: 5102.8. Samples: 368760000. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:04:28,240][25689] Avg episode reward: [(0, '-45.799')] [2022-07-09 18:04:29,573][26022] Updated weights on worker 0-0, policy_version 360132 (0.00092) [2022-07-09 18:04:31,511][26022] Updated weights on worker 0-0, policy_version 360142 (0.00089) [2022-07-09 18:04:33,161][26022] Updated weights on worker 0-0, policy_version 360152 (0.00081) [2022-07-09 18:04:33,260][25689] Fps is (10 sec: 5665.3, 60 sec: 5683.1, 300 sec: 5668.2). Total num frames: 368795648. Throughput: 0: 5926.9. Samples: 368793870. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:04:33,261][25689] Avg episode reward: [(0, '-45.786')] [2022-07-09 18:04:35,156][26022] Updated weights on worker 0-0, policy_version 360162 (0.00085) [2022-07-09 18:04:36,776][26022] Updated weights on worker 0-0, policy_version 360172 (0.00086) [2022-07-09 18:04:38,268][25689] Fps is (10 sec: 5616.0, 60 sec: 5684.7, 300 sec: 5665.0). Total num frames: 368823296. Throughput: 0: 5936.4. Samples: 368828046. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:04:38,268][25689] Avg episode reward: [(0, '-45.963')] [2022-07-09 18:04:38,772][26022] Updated weights on worker 0-0, policy_version 360182 (0.01141) [2022-07-09 18:04:40,400][26022] Updated weights on worker 0-0, policy_version 360192 (0.00086) [2022-07-09 18:04:42,335][26022] Updated weights on worker 0-0, policy_version 360202 (0.00087) [2022-07-09 18:04:43,295][25689] Fps is (10 sec: 5714.8, 60 sec: 5684.9, 300 sec: 5668.9). Total num frames: 368852992. Throughput: 0: 5078.8. Samples: 368845252. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:04:43,295][25689] Avg episode reward: [(0, '-46.330')] [2022-07-09 18:04:44,099][26022] Updated weights on worker 0-0, policy_version 360212 (0.00080) [2022-07-09 18:04:45,883][26022] Updated weights on worker 0-0, policy_version 360222 (0.00112) [2022-07-09 18:04:47,633][26022] Updated weights on worker 0-0, policy_version 360232 (0.00093) [2022-07-09 18:04:48,358][25689] Fps is (10 sec: 5784.2, 60 sec: 5672.9, 300 sec: 5668.2). Total num frames: 368881664. Throughput: 0: 5941.7. Samples: 368879544. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:04:48,359][25689] Avg episode reward: [(0, '-45.864')] [2022-07-09 18:04:49,403][26022] Updated weights on worker 0-0, policy_version 360242 (0.00085) [2022-07-09 18:04:51,338][26022] Updated weights on worker 0-0, policy_version 360252 (0.00088) [2022-07-09 18:04:53,045][26022] Updated weights on worker 0-0, policy_version 360262 (0.00095) [2022-07-09 18:04:53,365][25689] Fps is (10 sec: 5694.5, 60 sec: 5707.0, 300 sec: 5668.3). Total num frames: 368910336. Throughput: 0: 5974.7. Samples: 368913988. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:04:53,365][25689] Avg episode reward: [(0, '-46.279')] [2022-07-09 18:04:54,695][26022] Updated weights on worker 0-0, policy_version 360272 (0.00081) [2022-07-09 18:04:56,297][26022] Updated weights on worker 0-0, policy_version 360282 (0.00087) [2022-07-09 18:04:58,381][25689] Fps is (10 sec: 5619.4, 60 sec: 5639.4, 300 sec: 5665.2). Total num frames: 368937984. Throughput: 0: 5983.5. Samples: 368948394. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:04:58,381][25689] Avg episode reward: [(0, '-45.785')] [2022-07-09 18:04:58,637][26022] Updated weights on worker 0-0, policy_version 360292 (0.00088) [2022-07-09 18:04:59,983][26022] Updated weights on worker 0-0, policy_version 360302 (0.00086) [2022-07-09 18:05:02,554][26022] Updated weights on worker 0-0, policy_version 360312 (0.00091) [2022-07-09 18:05:03,447][25689] Fps is (10 sec: 5586.1, 60 sec: 5688.1, 300 sec: 5668.9). Total num frames: 368966656. Throughput: 0: 5939.1. Samples: 368964938. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:05:03,447][25689] Avg episode reward: [(0, '-44.731')] [2022-07-09 18:05:03,973][26022] Updated weights on worker 0-0, policy_version 360322 (0.00087) [2022-07-09 18:05:06,091][26022] Updated weights on worker 0-0, policy_version 360332 (0.00096) [2022-07-09 18:05:07,782][26022] Updated weights on worker 0-0, policy_version 360342 (0.00808) [2022-07-09 18:05:08,503][25689] Fps is (10 sec: 5462.9, 60 sec: 5652.0, 300 sec: 5664.5). Total num frames: 368993280. Throughput: 0: 5838.4. Samples: 368997156. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:05:08,503][25689] Avg episode reward: [(0, '-44.849')] [2022-07-09 18:05:09,631][26022] Updated weights on worker 0-0, policy_version 360352 (0.00092) [2022-07-09 18:05:11,418][26022] Updated weights on worker 0-0, policy_version 360362 (0.00089) [2022-07-09 18:05:13,253][26022] Updated weights on worker 0-0, policy_version 360372 (0.00086) [2022-07-09 18:05:13,511][25689] Fps is (10 sec: 5596.3, 60 sec: 5670.6, 300 sec: 5665.5). Total num frames: 369022976. Throughput: 0: 5828.0. Samples: 369031398. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:05:13,512][25689] Avg episode reward: [(0, '-45.301')] [2022-07-09 18:05:14,909][26022] Updated weights on worker 0-0, policy_version 360382 (0.00085) [2022-07-09 18:05:16,891][26022] Updated weights on worker 0-0, policy_version 360392 (0.00082) [2022-07-09 18:05:18,507][26022] Updated weights on worker 0-0, policy_version 360402 (0.00059) [2022-07-09 18:05:18,524][25689] Fps is (10 sec: 5824.7, 60 sec: 5678.7, 300 sec: 5669.4). Total num frames: 369051648. Throughput: 0: 4976.0. Samples: 369048624. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:05:18,525][25689] Avg episode reward: [(0, '-45.111')] [2022-07-09 18:05:19,874][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:05:19,886][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000360409_369058816.pth [2022-07-09 18:05:19,886][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000358415_367016960.pth [2022-07-09 18:05:20,129][26022] Updated weights on worker 0-0, policy_version 360412 (0.00081) [2022-07-09 18:05:22,170][26022] Updated weights on worker 0-0, policy_version 360422 (0.00099) [2022-07-09 18:05:23,543][25689] Fps is (10 sec: 5716.1, 60 sec: 5662.1, 300 sec: 5671.1). Total num frames: 369080320. Throughput: 0: 5878.3. Samples: 369083066. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:05:23,544][25689] Avg episode reward: [(0, '-45.065')] [2022-07-09 18:05:23,938][26022] Updated weights on worker 0-0, policy_version 360432 (0.00090) [2022-07-09 18:05:25,743][26022] Updated weights on worker 0-0, policy_version 360442 (0.00101) [2022-07-09 18:05:27,554][26022] Updated weights on worker 0-0, policy_version 360452 (0.00094) [2022-07-09 18:05:28,613][25689] Fps is (10 sec: 5582.2, 60 sec: 5648.0, 300 sec: 5667.1). Total num frames: 369107968. Throughput: 0: 5989.5. Samples: 369117604. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:05:28,613][25689] Avg episode reward: [(0, '-46.310')] [2022-07-09 18:05:29,158][26022] Updated weights on worker 0-0, policy_version 360462 (0.00093) [2022-07-09 18:05:31,204][26022] Updated weights on worker 0-0, policy_version 360472 (0.00099) [2022-07-09 18:05:32,968][26022] Updated weights on worker 0-0, policy_version 360482 (0.00090) [2022-07-09 18:05:33,642][25689] Fps is (10 sec: 5576.5, 60 sec: 5647.3, 300 sec: 5670.4). Total num frames: 369136640. Throughput: 0: 5130.2. Samples: 369134676. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:05:33,644][25689] Avg episode reward: [(0, '-46.572')] [2022-07-09 18:05:34,620][26022] Updated weights on worker 0-0, policy_version 360492 (0.00083) [2022-07-09 18:05:36,457][26022] Updated weights on worker 0-0, policy_version 360502 (0.00086) [2022-07-09 18:05:38,223][26022] Updated weights on worker 0-0, policy_version 360512 (0.00093) [2022-07-09 18:05:38,701][25689] Fps is (10 sec: 5786.0, 60 sec: 5676.4, 300 sec: 5665.9). Total num frames: 369166336. Throughput: 0: 5965.3. Samples: 369168986. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:05:38,701][25689] Avg episode reward: [(0, '-47.567')] [2022-07-09 18:05:40,038][26022] Updated weights on worker 0-0, policy_version 360522 (0.00090) [2022-07-09 18:05:41,949][26022] Updated weights on worker 0-0, policy_version 360532 (0.00094) [2022-07-09 18:05:43,576][26022] Updated weights on worker 0-0, policy_version 360542 (0.00091) [2022-07-09 18:05:43,710][25689] Fps is (10 sec: 5899.3, 60 sec: 5678.0, 300 sec: 5670.9). Total num frames: 369196032. Throughput: 0: 5956.6. Samples: 369203194. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:05:43,712][25689] Avg episode reward: [(0, '-47.902')] [2022-07-09 18:05:45,503][26022] Updated weights on worker 0-0, policy_version 360552 (0.00086) [2022-07-09 18:05:47,167][26022] Updated weights on worker 0-0, policy_version 360562 (0.00092) [2022-07-09 18:05:48,739][25689] Fps is (10 sec: 5712.6, 60 sec: 5664.4, 300 sec: 5671.1). Total num frames: 369223680. Throughput: 0: 5103.2. Samples: 369220310. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:05:48,739][25689] Avg episode reward: [(0, '-48.467')] [2022-07-09 18:05:48,949][26022] Updated weights on worker 0-0, policy_version 360572 (0.00079) [2022-07-09 18:05:51,041][26022] Updated weights on worker 0-0, policy_version 360582 (0.01447) [2022-07-09 18:05:52,499][26022] Updated weights on worker 0-0, policy_version 360592 (0.00087) [2022-07-09 18:05:53,742][25689] Fps is (10 sec: 5614.0, 60 sec: 5664.7, 300 sec: 5671.5). Total num frames: 369252352. Throughput: 0: 5965.3. Samples: 369254576. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:05:53,742][25689] Avg episode reward: [(0, '-47.409')] [2022-07-09 18:05:54,486][26022] Updated weights on worker 0-0, policy_version 360602 (0.00085) [2022-07-09 18:05:56,155][26022] Updated weights on worker 0-0, policy_version 360612 (0.00083) [2022-07-09 18:05:57,931][26022] Updated weights on worker 0-0, policy_version 360622 (0.00080) [2022-07-09 18:05:58,779][25689] Fps is (10 sec: 5711.2, 60 sec: 5679.6, 300 sec: 5671.1). Total num frames: 369281024. Throughput: 0: 5981.6. Samples: 369289088. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:05:58,780][25689] Avg episode reward: [(0, '-47.522')] [2022-07-09 18:05:59,886][26022] Updated weights on worker 0-0, policy_version 360632 (0.00092) [2022-07-09 18:06:01,484][26022] Updated weights on worker 0-0, policy_version 360642 (0.00085) [2022-07-09 18:06:03,797][25689] Fps is (10 sec: 5397.4, 60 sec: 5633.3, 300 sec: 5666.2). Total num frames: 369306624. Throughput: 0: 5132.5. Samples: 369306292. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:06:03,798][25689] Avg episode reward: [(0, '-48.435')] [2022-07-09 18:06:03,809][26022] Updated weights on worker 0-0, policy_version 360652 (0.00081) [2022-07-09 18:06:05,679][26022] Updated weights on worker 0-0, policy_version 360662 (0.00088) [2022-07-09 18:06:07,348][26022] Updated weights on worker 0-0, policy_version 360672 (0.00089) [2022-07-09 18:06:08,868][25689] Fps is (10 sec: 5480.9, 60 sec: 5682.7, 300 sec: 5668.4). Total num frames: 369336320. Throughput: 0: 5866.5. Samples: 369338400. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:06:08,869][25689] Avg episode reward: [(0, '-48.444')] [2022-07-09 18:06:09,155][26022] Updated weights on worker 0-0, policy_version 360682 (0.00081) [2022-07-09 18:06:10,915][26022] Updated weights on worker 0-0, policy_version 360692 (0.00085) [2022-07-09 18:06:12,473][26022] Updated weights on worker 0-0, policy_version 360702 (0.00083) [2022-07-09 18:06:13,944][25689] Fps is (10 sec: 5752.2, 60 sec: 5659.4, 300 sec: 5668.0). Total num frames: 369364992. Throughput: 0: 5872.5. Samples: 369373214. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:06:13,945][25689] Avg episode reward: [(0, '-47.458')] [2022-07-09 18:06:14,701][26022] Updated weights on worker 0-0, policy_version 360712 (0.00091) [2022-07-09 18:06:16,106][26022] Updated weights on worker 0-0, policy_version 360722 (0.00081) [2022-07-09 18:06:18,086][26022] Updated weights on worker 0-0, policy_version 360732 (0.00093) [2022-07-09 18:06:18,969][25689] Fps is (10 sec: 5778.4, 60 sec: 5675.2, 300 sec: 5675.1). Total num frames: 369394688. Throughput: 0: 5012.2. Samples: 369390284. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:06:18,970][25689] Avg episode reward: [(0, '-47.422')] [2022-07-09 18:06:19,809][26022] Updated weights on worker 0-0, policy_version 360742 (0.00089) [2022-07-09 18:06:21,513][26022] Updated weights on worker 0-0, policy_version 360752 (0.00085) [2022-07-09 18:06:23,398][26022] Updated weights on worker 0-0, policy_version 360762 (0.00086) [2022-07-09 18:06:24,068][25689] Fps is (10 sec: 5866.3, 60 sec: 5684.6, 300 sec: 5674.6). Total num frames: 369424384. Throughput: 0: 5844.7. Samples: 369424772. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:06:24,069][25689] Avg episode reward: [(0, '-47.872')] [2022-07-09 18:06:25,293][26022] Updated weights on worker 0-0, policy_version 360772 (0.00096) [2022-07-09 18:06:26,882][26022] Updated weights on worker 0-0, policy_version 360782 (0.00111) [2022-07-09 18:06:28,995][26022] Updated weights on worker 0-0, policy_version 360792 (0.00097) [2022-07-09 18:06:29,130][25689] Fps is (10 sec: 5643.8, 60 sec: 5685.4, 300 sec: 5673.6). Total num frames: 369452032. Throughput: 0: 5933.7. Samples: 369458626. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:06:29,130][25689] Avg episode reward: [(0, '-46.726')] [2022-07-09 18:06:30,512][26022] Updated weights on worker 0-0, policy_version 360802 (0.00092) [2022-07-09 18:06:32,409][26022] Updated weights on worker 0-0, policy_version 360812 (0.00091) [2022-07-09 18:06:34,205][25689] Fps is (10 sec: 5555.9, 60 sec: 5681.1, 300 sec: 5665.6). Total num frames: 369480704. Throughput: 0: 5055.1. Samples: 369475636. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:06:34,206][25689] Avg episode reward: [(0, '-47.215')] [2022-07-09 18:06:34,285][26022] Updated weights on worker 0-0, policy_version 360822 (0.00088) [2022-07-09 18:06:35,966][26022] Updated weights on worker 0-0, policy_version 360832 (0.00079) [2022-07-09 18:06:37,955][26022] Updated weights on worker 0-0, policy_version 360842 (0.00085) [2022-07-09 18:06:39,221][25689] Fps is (10 sec: 5682.4, 60 sec: 5668.2, 300 sec: 5665.7). Total num frames: 369509376. Throughput: 0: 5900.3. Samples: 369509776. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:06:39,222][25689] Avg episode reward: [(0, '-47.605')] [2022-07-09 18:06:39,592][26022] Updated weights on worker 0-0, policy_version 360852 (0.00085) [2022-07-09 18:06:41,477][26022] Updated weights on worker 0-0, policy_version 360862 (0.00084) [2022-07-09 18:06:43,356][26022] Updated weights on worker 0-0, policy_version 360872 (0.00099) [2022-07-09 18:06:44,236][25689] Fps is (10 sec: 5717.0, 60 sec: 5650.7, 300 sec: 5673.4). Total num frames: 369538048. Throughput: 0: 5926.2. Samples: 369544288. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:06:44,236][25689] Avg episode reward: [(0, '-47.545')] [2022-07-09 18:06:45,111][26022] Updated weights on worker 0-0, policy_version 360882 (0.00086) [2022-07-09 18:06:46,827][26022] Updated weights on worker 0-0, policy_version 360892 (0.00086) [2022-07-09 18:06:48,651][26022] Updated weights on worker 0-0, policy_version 360902 (0.00086) [2022-07-09 18:06:49,366][25689] Fps is (10 sec: 5753.3, 60 sec: 5675.0, 300 sec: 5667.8). Total num frames: 369567744. Throughput: 0: 5076.7. Samples: 369561360. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:06:49,368][25689] Avg episode reward: [(0, '-47.252')] [2022-07-09 18:06:50,424][26022] Updated weights on worker 0-0, policy_version 360912 (0.00094) [2022-07-09 18:06:52,333][26022] Updated weights on worker 0-0, policy_version 360922 (0.00113) [2022-07-09 18:06:53,947][26022] Updated weights on worker 0-0, policy_version 360932 (0.00085) [2022-07-09 18:06:54,407][25689] Fps is (10 sec: 5638.1, 60 sec: 5654.7, 300 sec: 5667.1). Total num frames: 369595392. Throughput: 0: 5936.7. Samples: 369595566. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:06:54,407][25689] Avg episode reward: [(0, '-47.397')] [2022-07-09 18:06:55,767][26022] Updated weights on worker 0-0, policy_version 360942 (0.00080) [2022-07-09 18:06:57,551][26022] Updated weights on worker 0-0, policy_version 360952 (0.00084) [2022-07-09 18:06:59,352][26022] Updated weights on worker 0-0, policy_version 360962 (0.00089) [2022-07-09 18:06:59,448][25689] Fps is (10 sec: 5687.8, 60 sec: 5671.2, 300 sec: 5673.4). Total num frames: 369625088. Throughput: 0: 5947.8. Samples: 369630082. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:06:59,449][25689] Avg episode reward: [(0, '-48.187')] [2022-07-09 18:07:01,146][26022] Updated weights on worker 0-0, policy_version 360972 (0.01683) [2022-07-09 18:07:03,531][26022] Updated weights on worker 0-0, policy_version 360982 (0.00093) [2022-07-09 18:07:04,518][25689] Fps is (10 sec: 5570.1, 60 sec: 5683.2, 300 sec: 5669.7). Total num frames: 369651712. Throughput: 0: 5068.9. Samples: 369647096. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:07:04,523][25689] Avg episode reward: [(0, '-47.836')] [2022-07-09 18:07:05,092][26022] Updated weights on worker 0-0, policy_version 360992 (0.00091) [2022-07-09 18:07:07,116][26022] Updated weights on worker 0-0, policy_version 361002 (0.00102) [2022-07-09 18:07:08,763][26022] Updated weights on worker 0-0, policy_version 361012 (0.00095) [2022-07-09 18:07:09,617][25689] Fps is (10 sec: 5437.7, 60 sec: 5663.7, 300 sec: 5669.3). Total num frames: 369680384. Throughput: 0: 5798.7. Samples: 369678790. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:07:09,618][25689] Avg episode reward: [(0, '-47.891')] [2022-07-09 18:07:10,647][26022] Updated weights on worker 0-0, policy_version 361022 (0.00086) [2022-07-09 18:07:12,500][26022] Updated weights on worker 0-0, policy_version 361032 (0.00100) [2022-07-09 18:07:14,080][26022] Updated weights on worker 0-0, policy_version 361042 (0.00082) [2022-07-09 18:07:14,648][25689] Fps is (10 sec: 5762.0, 60 sec: 5684.8, 300 sec: 5672.2). Total num frames: 369710080. Throughput: 0: 5814.8. Samples: 369713262. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:07:14,648][25689] Avg episode reward: [(0, '-48.840')] [2022-07-09 18:07:16,043][26022] Updated weights on worker 0-0, policy_version 361052 (0.00084) [2022-07-09 18:07:17,774][26022] Updated weights on worker 0-0, policy_version 361062 (0.00080) [2022-07-09 18:07:19,532][26022] Updated weights on worker 0-0, policy_version 361072 (0.00090) [2022-07-09 18:07:19,673][25689] Fps is (10 sec: 5804.6, 60 sec: 5667.9, 300 sec: 5672.0). Total num frames: 369738752. Throughput: 0: 5818.3. Samples: 369747754. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:07:19,673][25689] Avg episode reward: [(0, '-48.840')] [2022-07-09 18:07:19,890][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:07:19,898][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000361076_369741824.pth [2022-07-09 18:07:19,904][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000359080_367697920.pth [2022-07-09 18:07:21,376][26022] Updated weights on worker 0-0, policy_version 361082 (0.00053) [2022-07-09 18:07:23,006][26022] Updated weights on worker 0-0, policy_version 361092 (0.00059) [2022-07-09 18:07:24,708][25689] Fps is (10 sec: 5598.5, 60 sec: 5640.2, 300 sec: 5665.9). Total num frames: 369766400. Throughput: 0: 5838.4. Samples: 369764972. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:07:24,708][25689] Avg episode reward: [(0, '-48.162')] [2022-07-09 18:07:24,996][26022] Updated weights on worker 0-0, policy_version 361102 (0.00085) [2022-07-09 18:07:26,769][26022] Updated weights on worker 0-0, policy_version 361112 (0.00088) [2022-07-09 18:07:28,526][26022] Updated weights on worker 0-0, policy_version 361122 (0.00083) [2022-07-09 18:07:29,816][25689] Fps is (10 sec: 5552.2, 60 sec: 5652.6, 300 sec: 5665.4). Total num frames: 369795072. Throughput: 0: 5948.1. Samples: 369798936. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:07:29,817][25689] Avg episode reward: [(0, '-48.588')] [2022-07-09 18:07:30,553][26022] Updated weights on worker 0-0, policy_version 361132 (0.00056) [2022-07-09 18:07:32,049][26022] Updated weights on worker 0-0, policy_version 361142 (0.00094) [2022-07-09 18:07:34,139][26022] Updated weights on worker 0-0, policy_version 361152 (0.00087) [2022-07-09 18:07:34,859][25689] Fps is (10 sec: 5749.5, 60 sec: 5672.6, 300 sec: 5666.0). Total num frames: 369824768. Throughput: 0: 5920.3. Samples: 369832922. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:07:34,860][25689] Avg episode reward: [(0, '-49.259')] [2022-07-09 18:07:35,781][26022] Updated weights on worker 0-0, policy_version 361162 (0.00085) [2022-07-09 18:07:37,615][26022] Updated weights on worker 0-0, policy_version 361172 (0.00083) [2022-07-09 18:07:39,655][26022] Updated weights on worker 0-0, policy_version 361182 (0.00084) [2022-07-09 18:07:39,886][25689] Fps is (10 sec: 5593.1, 60 sec: 5637.8, 300 sec: 5662.4). Total num frames: 369851392. Throughput: 0: 5059.0. Samples: 369850014. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:07:39,887][25689] Avg episode reward: [(0, '-49.257')] [2022-07-09 18:07:41,158][26022] Updated weights on worker 0-0, policy_version 361192 (0.00094) [2022-07-09 18:07:43,108][26022] Updated weights on worker 0-0, policy_version 361202 (0.00064) [2022-07-09 18:07:44,771][26022] Updated weights on worker 0-0, policy_version 361212 (0.00082) [2022-07-09 18:07:44,904][25689] Fps is (10 sec: 5607.0, 60 sec: 5654.4, 300 sec: 5666.8). Total num frames: 369881088. Throughput: 0: 5910.1. Samples: 369884334. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:07:44,904][25689] Avg episode reward: [(0, '-49.116')] [2022-07-09 18:07:46,705][26022] Updated weights on worker 0-0, policy_version 361222 (0.00088) [2022-07-09 18:07:48,443][26022] Updated weights on worker 0-0, policy_version 361232 (0.00084) [2022-07-09 18:07:50,042][25689] Fps is (10 sec: 5747.4, 60 sec: 5636.8, 300 sec: 5657.5). Total num frames: 369909760. Throughput: 0: 5914.8. Samples: 369918564. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:07:50,042][25689] Avg episode reward: [(0, '-48.405')] [2022-07-09 18:07:50,264][26022] Updated weights on worker 0-0, policy_version 361242 (0.00086) [2022-07-09 18:07:51,897][26022] Updated weights on worker 0-0, policy_version 361252 (0.00087) [2022-07-09 18:07:53,784][26022] Updated weights on worker 0-0, policy_version 361262 (0.00091) [2022-07-09 18:07:55,060][25689] Fps is (10 sec: 5746.9, 60 sec: 5672.6, 300 sec: 5668.4). Total num frames: 369939456. Throughput: 0: 5090.6. Samples: 369935758. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:07:55,061][25689] Avg episode reward: [(0, '-48.312')] [2022-07-09 18:07:55,538][26022] Updated weights on worker 0-0, policy_version 361272 (0.00087) [2022-07-09 18:07:57,403][26022] Updated weights on worker 0-0, policy_version 361282 (0.00084) [2022-07-09 18:07:59,265][26022] Updated weights on worker 0-0, policy_version 361292 (0.00086) [2022-07-09 18:08:00,073][25689] Fps is (10 sec: 5818.3, 60 sec: 5658.4, 300 sec: 5675.6). Total num frames: 369968128. Throughput: 0: 5933.8. Samples: 369969802. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:08:00,074][25689] Avg episode reward: [(0, '-47.763')] [2022-07-09 18:08:01,120][26022] Updated weights on worker 0-0, policy_version 361302 (0.00104) [2022-07-09 18:08:03,256][26022] Updated weights on worker 0-0, policy_version 361312 (0.00080) [2022-07-09 18:08:05,099][25689] Fps is (10 sec: 5304.5, 60 sec: 5628.7, 300 sec: 5656.4). Total num frames: 369992704. Throughput: 0: 5802.3. Samples: 370001510. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:08:05,099][25689] Avg episode reward: [(0, '-47.732')] [2022-07-09 18:08:05,264][26022] Updated weights on worker 0-0, policy_version 361322 (0.00095) [2022-07-09 18:08:06,933][26022] Updated weights on worker 0-0, policy_version 361332 (0.00088) [2022-07-09 18:08:08,941][26022] Updated weights on worker 0-0, policy_version 361342 (0.00090) [2022-07-09 18:08:10,179][25689] Fps is (10 sec: 5370.5, 60 sec: 5647.4, 300 sec: 5666.1). Total num frames: 370022400. Throughput: 0: 4941.1. Samples: 370018064. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:08:10,179][25689] Avg episode reward: [(0, '-47.471')] [2022-07-09 18:08:10,454][26022] Updated weights on worker 0-0, policy_version 361352 (0.00085) [2022-07-09 18:08:12,626][26022] Updated weights on worker 0-0, policy_version 361362 (0.00090) [2022-07-09 18:08:14,179][26022] Updated weights on worker 0-0, policy_version 361372 (0.00084) [2022-07-09 18:08:15,186][25689] Fps is (10 sec: 5786.0, 60 sec: 5632.6, 300 sec: 5662.7). Total num frames: 370051072. Throughput: 0: 5784.8. Samples: 370052184. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:08:15,188][25689] Avg episode reward: [(0, '-47.786')] [2022-07-09 18:08:16,128][26022] Updated weights on worker 0-0, policy_version 361382 (0.00091) [2022-07-09 18:08:17,747][26022] Updated weights on worker 0-0, policy_version 361392 (0.00082) [2022-07-09 18:08:19,578][26022] Updated weights on worker 0-0, policy_version 361402 (0.00092) [2022-07-09 18:08:20,218][25689] Fps is (10 sec: 5508.1, 60 sec: 5598.2, 300 sec: 5659.9). Total num frames: 370077696. Throughput: 0: 5786.5. Samples: 370086368. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:08:20,218][25689] Avg episode reward: [(0, '-48.383')] [2022-07-09 18:08:21,287][26022] Updated weights on worker 0-0, policy_version 361412 (0.00089) [2022-07-09 18:08:23,239][26022] Updated weights on worker 0-0, policy_version 361422 (0.00094) [2022-07-09 18:08:24,964][26022] Updated weights on worker 0-0, policy_version 361432 (0.00095) [2022-07-09 18:08:25,229][25689] Fps is (10 sec: 5608.0, 60 sec: 5634.2, 300 sec: 5657.2). Total num frames: 370107392. Throughput: 0: 5071.1. Samples: 370103594. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:08:25,231][25689] Avg episode reward: [(0, '-48.793')] [2022-07-09 18:08:26,785][26022] Updated weights on worker 0-0, policy_version 361442 (0.00091) [2022-07-09 18:08:28,438][26022] Updated weights on worker 0-0, policy_version 361452 (0.00089) [2022-07-09 18:08:30,319][25689] Fps is (10 sec: 5880.1, 60 sec: 5653.0, 300 sec: 5663.7). Total num frames: 370137088. Throughput: 0: 5947.3. Samples: 370137840. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:08:30,319][25689] Avg episode reward: [(0, '-48.127')] [2022-07-09 18:08:30,320][26022] Updated weights on worker 0-0, policy_version 361462 (0.00083) [2022-07-09 18:08:32,239][26022] Updated weights on worker 0-0, policy_version 361472 (0.00053) [2022-07-09 18:08:33,894][26022] Updated weights on worker 0-0, policy_version 361482 (0.00083) [2022-07-09 18:08:35,368][25689] Fps is (10 sec: 5655.6, 60 sec: 5618.5, 300 sec: 5663.2). Total num frames: 370164736. Throughput: 0: 5929.5. Samples: 370171854. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:08:35,369][25689] Avg episode reward: [(0, '-47.258')] [2022-07-09 18:08:35,875][26022] Updated weights on worker 0-0, policy_version 361492 (0.00095) [2022-07-09 18:08:37,578][26022] Updated weights on worker 0-0, policy_version 361502 (0.00088) [2022-07-09 18:08:39,533][26022] Updated weights on worker 0-0, policy_version 361512 (0.00086) [2022-07-09 18:08:40,416][25689] Fps is (10 sec: 5577.5, 60 sec: 5650.3, 300 sec: 5659.4). Total num frames: 370193408. Throughput: 0: 5069.9. Samples: 370188772. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:08:40,417][25689] Avg episode reward: [(0, '-47.150')] [2022-07-09 18:08:41,231][26022] Updated weights on worker 0-0, policy_version 361522 (0.00083) [2022-07-09 18:08:43,134][26022] Updated weights on worker 0-0, policy_version 361532 (0.00095) [2022-07-09 18:08:44,822][26022] Updated weights on worker 0-0, policy_version 361542 (0.00084) [2022-07-09 18:08:45,489][25689] Fps is (10 sec: 5565.1, 60 sec: 5611.5, 300 sec: 5653.3). Total num frames: 370221056. Throughput: 0: 5883.1. Samples: 370222782. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:08:45,489][25689] Avg episode reward: [(0, '-47.027')] [2022-07-09 18:08:46,702][26022] Updated weights on worker 0-0, policy_version 361552 (0.00086) [2022-07-09 18:08:48,619][26022] Updated weights on worker 0-0, policy_version 361562 (0.00097) [2022-07-09 18:08:50,324][26022] Updated weights on worker 0-0, policy_version 361572 (0.00090) [2022-07-09 18:08:50,541][25689] Fps is (10 sec: 5663.9, 60 sec: 5636.3, 300 sec: 5662.8). Total num frames: 370250752. Throughput: 0: 5875.7. Samples: 370256660. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:08:50,541][25689] Avg episode reward: [(0, '-46.733')] [2022-07-09 18:08:52,295][26022] Updated weights on worker 0-0, policy_version 361582 (0.00095) [2022-07-09 18:08:53,939][26022] Updated weights on worker 0-0, policy_version 361592 (0.00099) [2022-07-09 18:08:55,547][25689] Fps is (10 sec: 5701.2, 60 sec: 5603.6, 300 sec: 5649.3). Total num frames: 370278400. Throughput: 0: 5037.5. Samples: 370273502. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:08:55,547][25689] Avg episode reward: [(0, '-45.762')] [2022-07-09 18:08:55,815][26022] Updated weights on worker 0-0, policy_version 361602 (0.00091) [2022-07-09 18:08:57,507][26022] Updated weights on worker 0-0, policy_version 361612 (0.00087) [2022-07-09 18:08:59,408][26022] Updated weights on worker 0-0, policy_version 361622 (0.00088) [2022-07-09 18:09:00,598][25689] Fps is (10 sec: 5599.9, 60 sec: 5600.1, 300 sec: 5659.5). Total num frames: 370307072. Throughput: 0: 5904.4. Samples: 370307934. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:09:00,599][25689] Avg episode reward: [(0, '-46.013')] [2022-07-09 18:09:01,055][26022] Updated weights on worker 0-0, policy_version 361632 (0.00086) [2022-07-09 18:09:03,458][26022] Updated weights on worker 0-0, policy_version 361642 (0.00091) [2022-07-09 18:09:05,019][26022] Updated weights on worker 0-0, policy_version 361652 (0.00087) [2022-07-09 18:09:05,613][25689] Fps is (10 sec: 5493.5, 60 sec: 5634.9, 300 sec: 5652.9). Total num frames: 370333696. Throughput: 0: 5826.2. Samples: 370340028. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:09:05,613][25689] Avg episode reward: [(0, '-45.715')] [2022-07-09 18:09:07,068][26022] Updated weights on worker 0-0, policy_version 361662 (0.00092) [2022-07-09 18:09:08,770][26022] Updated weights on worker 0-0, policy_version 361672 (0.00091) [2022-07-09 18:09:10,619][26022] Updated weights on worker 0-0, policy_version 361682 (0.00095) [2022-07-09 18:09:10,708][25689] Fps is (10 sec: 5469.4, 60 sec: 5616.6, 300 sec: 5651.6). Total num frames: 370362368. Throughput: 0: 4968.1. Samples: 370356854. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:09:10,709][25689] Avg episode reward: [(0, '-45.752')] [2022-07-09 18:09:12,102][26022] Updated weights on worker 0-0, policy_version 361692 (0.00081) [2022-07-09 18:09:14,098][26022] Updated weights on worker 0-0, policy_version 361702 (0.00083) [2022-07-09 18:09:15,741][25689] Fps is (10 sec: 5762.7, 60 sec: 5631.2, 300 sec: 5656.3). Total num frames: 370392064. Throughput: 0: 5838.4. Samples: 370391404. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:09:15,743][25689] Avg episode reward: [(0, '-45.579')] [2022-07-09 18:09:15,925][26022] Updated weights on worker 0-0, policy_version 361712 (0.00090) [2022-07-09 18:09:17,813][26022] Updated weights on worker 0-0, policy_version 361722 (0.00086) [2022-07-09 18:09:19,525][26022] Updated weights on worker 0-0, policy_version 361732 (0.00088) [2022-07-09 18:09:20,142][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:09:20,154][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000361736_370417664.pth [2022-07-09 18:09:20,154][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000359745_368378880.pth [2022-07-09 18:09:20,825][25689] Fps is (10 sec: 5769.5, 60 sec: 5660.1, 300 sec: 5651.7). Total num frames: 370420736. Throughput: 0: 5820.2. Samples: 370425658. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:09:20,825][25689] Avg episode reward: [(0, '-46.404')] [2022-07-09 18:09:21,325][26022] Updated weights on worker 0-0, policy_version 361742 (0.00090) [2022-07-09 18:09:23,056][26022] Updated weights on worker 0-0, policy_version 361752 (0.00089) [2022-07-09 18:09:24,973][26022] Updated weights on worker 0-0, policy_version 361762 (0.00091) [2022-07-09 18:09:25,847][25689] Fps is (10 sec: 5674.4, 60 sec: 5642.2, 300 sec: 5653.2). Total num frames: 370449408. Throughput: 0: 5081.2. Samples: 370442842. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:09:25,847][25689] Avg episode reward: [(0, '-47.139')] [2022-07-09 18:09:26,651][26022] Updated weights on worker 0-0, policy_version 361772 (0.00086) [2022-07-09 18:09:28,355][26022] Updated weights on worker 0-0, policy_version 361782 (0.00092) [2022-07-09 18:09:30,252][26022] Updated weights on worker 0-0, policy_version 361792 (0.00097) [2022-07-09 18:09:30,908][25689] Fps is (10 sec: 5687.1, 60 sec: 5627.9, 300 sec: 5652.4). Total num frames: 370478080. Throughput: 0: 5938.2. Samples: 370476806. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-09 18:09:30,909][25689] Avg episode reward: [(0, '-47.343')] [2022-07-09 18:09:31,997][26022] Updated weights on worker 0-0, policy_version 361802 (0.00087) [2022-07-09 18:09:33,899][26022] Updated weights on worker 0-0, policy_version 361812 (0.00094) [2022-07-09 18:09:35,513][26022] Updated weights on worker 0-0, policy_version 361822 (0.00097) [2022-07-09 18:09:36,007][25689] Fps is (10 sec: 5845.7, 60 sec: 5674.0, 300 sec: 5661.0). Total num frames: 370508800. Throughput: 0: 5909.9. Samples: 370511172. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:09:36,009][25689] Avg episode reward: [(0, '-47.584')] [2022-07-09 18:09:37,511][26022] Updated weights on worker 0-0, policy_version 361832 (0.00827) [2022-07-09 18:09:39,036][26022] Updated weights on worker 0-0, policy_version 361842 (0.00102) [2022-07-09 18:09:41,017][25689] Fps is (10 sec: 5571.3, 60 sec: 5626.9, 300 sec: 5647.6). Total num frames: 370534400. Throughput: 0: 5911.0. Samples: 370545014. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:09:41,018][25689] Avg episode reward: [(0, '-47.447')] [2022-07-09 18:09:41,191][26022] Updated weights on worker 0-0, policy_version 361852 (0.00085) [2022-07-09 18:09:42,775][26022] Updated weights on worker 0-0, policy_version 361862 (0.00088) [2022-07-09 18:09:44,603][26022] Updated weights on worker 0-0, policy_version 361872 (0.00094) [2022-07-09 18:09:46,029][25689] Fps is (10 sec: 5517.1, 60 sec: 5666.3, 300 sec: 5652.0). Total num frames: 370564096. Throughput: 0: 5911.1. Samples: 370562144. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:09:46,030][25689] Avg episode reward: [(0, '-47.327')] [2022-07-09 18:09:46,274][26022] Updated weights on worker 0-0, policy_version 361882 (0.00085) [2022-07-09 18:09:48,289][26022] Updated weights on worker 0-0, policy_version 361892 (0.00090) [2022-07-09 18:09:50,230][26022] Updated weights on worker 0-0, policy_version 361902 (0.00099) [2022-07-09 18:09:51,124][25689] Fps is (10 sec: 5775.1, 60 sec: 5645.4, 300 sec: 5650.3). Total num frames: 370592768. Throughput: 0: 5903.5. Samples: 370596150. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:09:51,124][25689] Avg episode reward: [(0, '-47.266')] [2022-07-09 18:09:51,958][26022] Updated weights on worker 0-0, policy_version 361912 (0.00085) [2022-07-09 18:09:53,782][26022] Updated weights on worker 0-0, policy_version 361922 (0.00053) [2022-07-09 18:09:55,561][26022] Updated weights on worker 0-0, policy_version 361932 (0.00098) [2022-07-09 18:09:56,212][25689] Fps is (10 sec: 5631.6, 60 sec: 5654.7, 300 sec: 5652.4). Total num frames: 370621440. Throughput: 0: 5897.6. Samples: 370630334. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:09:56,212][25689] Avg episode reward: [(0, '-47.617')] [2022-07-09 18:09:57,343][26022] Updated weights on worker 0-0, policy_version 361942 (0.00086) [2022-07-09 18:09:59,232][26022] Updated weights on worker 0-0, policy_version 361952 (0.00091) [2022-07-09 18:10:00,923][26022] Updated weights on worker 0-0, policy_version 361962 (0.00095) [2022-07-09 18:10:01,240][25689] Fps is (10 sec: 5668.7, 60 sec: 5656.8, 300 sec: 5653.1). Total num frames: 370650112. Throughput: 0: 5062.6. Samples: 370647394. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:10:01,240][25689] Avg episode reward: [(0, '-47.810')] [2022-07-09 18:10:03,083][26022] Updated weights on worker 0-0, policy_version 361972 (0.00086) [2022-07-09 18:10:05,059][26022] Updated weights on worker 0-0, policy_version 361982 (0.00086) [2022-07-09 18:10:06,319][25689] Fps is (10 sec: 5471.3, 60 sec: 5650.8, 300 sec: 5652.7). Total num frames: 370676736. Throughput: 0: 5775.3. Samples: 370679320. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:10:06,319][25689] Avg episode reward: [(0, '-48.138')] [2022-07-09 18:10:06,735][26022] Updated weights on worker 0-0, policy_version 361992 (0.00091) [2022-07-09 18:10:08,624][26022] Updated weights on worker 0-0, policy_version 362002 (0.00104) [2022-07-09 18:10:10,514][26022] Updated weights on worker 0-0, policy_version 362012 (0.00088) [2022-07-09 18:10:11,384][25689] Fps is (10 sec: 5350.2, 60 sec: 5636.8, 300 sec: 5644.7). Total num frames: 370704384. Throughput: 0: 5754.2. Samples: 370712730. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:10:11,384][25689] Avg episode reward: [(0, '-47.346')] [2022-07-09 18:10:12,394][26022] Updated weights on worker 0-0, policy_version 362022 (0.00091) [2022-07-09 18:10:14,142][26022] Updated weights on worker 0-0, policy_version 362032 (0.00092) [2022-07-09 18:10:15,834][26022] Updated weights on worker 0-0, policy_version 362042 (0.00082) [2022-07-09 18:10:16,411][25689] Fps is (10 sec: 5681.6, 60 sec: 5637.3, 300 sec: 5647.9). Total num frames: 370734080. Throughput: 0: 4915.5. Samples: 370729624. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:10:16,412][25689] Avg episode reward: [(0, '-47.554')] [2022-07-09 18:10:17,687][26022] Updated weights on worker 0-0, policy_version 362052 (0.00088) [2022-07-09 18:10:19,514][26022] Updated weights on worker 0-0, policy_version 362062 (0.00082) [2022-07-09 18:10:21,290][26022] Updated weights on worker 0-0, policy_version 362072 (0.00091) [2022-07-09 18:10:21,480][25689] Fps is (10 sec: 5679.9, 60 sec: 5621.8, 300 sec: 5643.5). Total num frames: 370761728. Throughput: 0: 5759.3. Samples: 370763962. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:10:21,480][25689] Avg episode reward: [(0, '-48.142')] [2022-07-09 18:10:23,005][26022] Updated weights on worker 0-0, policy_version 362082 (0.00091) [2022-07-09 18:10:24,893][26022] Updated weights on worker 0-0, policy_version 362092 (0.00095) [2022-07-09 18:10:26,545][25689] Fps is (10 sec: 5658.6, 60 sec: 5634.7, 300 sec: 5650.5). Total num frames: 370791424. Throughput: 0: 5873.5. Samples: 370798122. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:10:26,546][25689] Avg episode reward: [(0, '-48.392')] [2022-07-09 18:10:26,730][26022] Updated weights on worker 0-0, policy_version 362102 (0.00086) [2022-07-09 18:10:28,475][26022] Updated weights on worker 0-0, policy_version 362112 (0.00382) [2022-07-09 18:10:30,363][26022] Updated weights on worker 0-0, policy_version 362122 (0.00090) [2022-07-09 18:10:31,670][25689] Fps is (10 sec: 5728.1, 60 sec: 5628.8, 300 sec: 5648.7). Total num frames: 370820096. Throughput: 0: 5043.3. Samples: 370815038. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:10:31,670][25689] Avg episode reward: [(0, '-49.008')] [2022-07-09 18:10:32,123][26022] Updated weights on worker 0-0, policy_version 362132 (0.00094) [2022-07-09 18:10:33,967][26022] Updated weights on worker 0-0, policy_version 362142 (0.00093) [2022-07-09 18:10:35,680][26022] Updated weights on worker 0-0, policy_version 362152 (0.00087) [2022-07-09 18:10:36,698][25689] Fps is (10 sec: 5547.5, 60 sec: 5584.7, 300 sec: 5642.4). Total num frames: 370847744. Throughput: 0: 5900.5. Samples: 370849324. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:10:36,699][25689] Avg episode reward: [(0, '-48.833')] [2022-07-09 18:10:37,553][26022] Updated weights on worker 0-0, policy_version 362162 (0.00081) [2022-07-09 18:10:39,254][26022] Updated weights on worker 0-0, policy_version 362172 (0.00925) [2022-07-09 18:10:41,159][26022] Updated weights on worker 0-0, policy_version 362182 (0.00096) [2022-07-09 18:10:41,729][25689] Fps is (10 sec: 5598.5, 60 sec: 5633.4, 300 sec: 5638.5). Total num frames: 370876416. Throughput: 0: 5891.0. Samples: 370883254. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:10:41,730][25689] Avg episode reward: [(0, '-48.773')] [2022-07-09 18:10:42,929][26022] Updated weights on worker 0-0, policy_version 362192 (0.00093) [2022-07-09 18:10:44,699][26022] Updated weights on worker 0-0, policy_version 362202 (0.00088) [2022-07-09 18:10:46,487][26022] Updated weights on worker 0-0, policy_version 362212 (0.00086) [2022-07-09 18:10:46,770][25689] Fps is (10 sec: 5795.0, 60 sec: 5630.8, 300 sec: 5645.2). Total num frames: 370906112. Throughput: 0: 5054.6. Samples: 370900348. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:10:46,770][25689] Avg episode reward: [(0, '-48.188')] [2022-07-09 18:10:48,551][26022] Updated weights on worker 0-0, policy_version 362222 (0.00086) [2022-07-09 18:10:50,045][26022] Updated weights on worker 0-0, policy_version 362232 (0.00081) [2022-07-09 18:10:51,843][25689] Fps is (10 sec: 5771.1, 60 sec: 5632.7, 300 sec: 5643.8). Total num frames: 370934784. Throughput: 0: 5918.3. Samples: 370934432. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:10:51,844][25689] Avg episode reward: [(0, '-46.763')] [2022-07-09 18:10:52,095][26022] Updated weights on worker 0-0, policy_version 362242 (0.00106) [2022-07-09 18:10:53,740][26022] Updated weights on worker 0-0, policy_version 362252 (0.00087) [2022-07-09 18:10:55,496][26022] Updated weights on worker 0-0, policy_version 362262 (0.00091) [2022-07-09 18:10:56,853][25689] Fps is (10 sec: 5585.2, 60 sec: 5623.1, 300 sec: 5640.9). Total num frames: 370962432. Throughput: 0: 5922.4. Samples: 370968694. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:10:56,854][25689] Avg episode reward: [(0, '-46.540')] [2022-07-09 18:10:57,468][26022] Updated weights on worker 0-0, policy_version 362272 (0.00092) [2022-07-09 18:10:59,165][26022] Updated weights on worker 0-0, policy_version 362282 (0.00090) [2022-07-09 18:11:00,798][26022] Updated weights on worker 0-0, policy_version 362292 (0.00084) [2022-07-09 18:11:01,861][25689] Fps is (10 sec: 5724.3, 60 sec: 5641.9, 300 sec: 5654.8). Total num frames: 370992128. Throughput: 0: 5103.8. Samples: 370986000. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:11:01,861][25689] Avg episode reward: [(0, '-46.060')] [2022-07-09 18:11:02,992][26022] Updated weights on worker 0-0, policy_version 362302 (0.00084) [2022-07-09 18:11:04,833][26022] Updated weights on worker 0-0, policy_version 362312 (0.00087) [2022-07-09 18:11:06,696][26022] Updated weights on worker 0-0, policy_version 362322 (0.00088) [2022-07-09 18:11:06,875][25689] Fps is (10 sec: 5619.8, 60 sec: 5647.9, 300 sec: 5645.6). Total num frames: 371018752. Throughput: 0: 5866.8. Samples: 371018302. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:11:06,876][25689] Avg episode reward: [(0, '-46.816')] [2022-07-09 18:11:08,552][26022] Updated weights on worker 0-0, policy_version 362332 (0.00084) [2022-07-09 18:11:10,248][26022] Updated weights on worker 0-0, policy_version 362342 (0.00094) [2022-07-09 18:11:11,944][25689] Fps is (10 sec: 5484.1, 60 sec: 5664.5, 300 sec: 5645.8). Total num frames: 371047424. Throughput: 0: 5853.9. Samples: 371052098. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:11:11,944][25689] Avg episode reward: [(0, '-47.310')] [2022-07-09 18:11:12,193][26022] Updated weights on worker 0-0, policy_version 362352 (0.00085) [2022-07-09 18:11:13,816][26022] Updated weights on worker 0-0, policy_version 362362 (0.00088) [2022-07-09 18:11:15,721][26022] Updated weights on worker 0-0, policy_version 362372 (0.00086) [2022-07-09 18:11:17,001][25689] Fps is (10 sec: 5663.1, 60 sec: 5644.8, 300 sec: 5641.7). Total num frames: 371076096. Throughput: 0: 4989.2. Samples: 371069214. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:11:17,002][25689] Avg episode reward: [(0, '-47.274')] [2022-07-09 18:11:17,571][26022] Updated weights on worker 0-0, policy_version 362382 (0.00087) [2022-07-09 18:11:19,358][26022] Updated weights on worker 0-0, policy_version 362392 (0.00094) [2022-07-09 18:11:20,215][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:11:20,241][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000362397_371094528.pth [2022-07-09 18:11:20,242][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000360409_369058816.pth [2022-07-09 18:11:21,211][26022] Updated weights on worker 0-0, policy_version 362402 (0.00088) [2022-07-09 18:11:22,071][25689] Fps is (10 sec: 5662.5, 60 sec: 5661.6, 300 sec: 5638.8). Total num frames: 371104768. Throughput: 0: 5813.1. Samples: 371103482. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:11:22,071][25689] Avg episode reward: [(0, '-46.493')] [2022-07-09 18:11:22,879][26022] Updated weights on worker 0-0, policy_version 362412 (0.00091) [2022-07-09 18:11:24,802][26022] Updated weights on worker 0-0, policy_version 362422 (0.00087) [2022-07-09 18:11:26,539][26022] Updated weights on worker 0-0, policy_version 362432 (0.00082) [2022-07-09 18:11:27,123][25689] Fps is (10 sec: 5564.1, 60 sec: 5629.0, 300 sec: 5639.0). Total num frames: 371132416. Throughput: 0: 5895.1. Samples: 371137666. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:11:27,124][25689] Avg episode reward: [(0, '-46.609')] [2022-07-09 18:11:28,295][26022] Updated weights on worker 0-0, policy_version 362442 (0.00083) [2022-07-09 18:11:30,251][26022] Updated weights on worker 0-0, policy_version 362452 (0.00090) [2022-07-09 18:11:31,937][26022] Updated weights on worker 0-0, policy_version 362462 (0.00087) [2022-07-09 18:11:32,163][25689] Fps is (10 sec: 5681.8, 60 sec: 5653.7, 300 sec: 5643.1). Total num frames: 371162112. Throughput: 0: 5076.9. Samples: 371154750. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:11:32,164][25689] Avg episode reward: [(0, '-46.968')] [2022-07-09 18:11:33,722][26022] Updated weights on worker 0-0, policy_version 362472 (0.00091) [2022-07-09 18:11:35,494][26022] Updated weights on worker 0-0, policy_version 362482 (0.00093) [2022-07-09 18:11:37,210][25689] Fps is (10 sec: 5888.3, 60 sec: 5685.9, 300 sec: 5646.0). Total num frames: 371191808. Throughput: 0: 5930.7. Samples: 371189064. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:11:37,211][25689] Avg episode reward: [(0, '-46.887')] [2022-07-09 18:11:37,220][26022] Updated weights on worker 0-0, policy_version 362492 (0.00050) [2022-07-09 18:11:39,203][26022] Updated weights on worker 0-0, policy_version 362502 (0.00091) [2022-07-09 18:11:40,776][26022] Updated weights on worker 0-0, policy_version 362512 (0.00086) [2022-07-09 18:11:42,221][25689] Fps is (10 sec: 5701.3, 60 sec: 5670.8, 300 sec: 5642.6). Total num frames: 371219456. Throughput: 0: 5965.9. Samples: 371223698. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:11:42,222][25689] Avg episode reward: [(0, '-46.603')] [2022-07-09 18:11:42,754][26022] Updated weights on worker 0-0, policy_version 362522 (0.00082) [2022-07-09 18:11:44,467][26022] Updated weights on worker 0-0, policy_version 362532 (0.00090) [2022-07-09 18:11:46,206][26022] Updated weights on worker 0-0, policy_version 362542 (0.00087) [2022-07-09 18:11:47,233][25689] Fps is (10 sec: 5721.2, 60 sec: 5673.6, 300 sec: 5644.8). Total num frames: 371249152. Throughput: 0: 5976.2. Samples: 371257844. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:11:47,233][25689] Avg episode reward: [(0, '-47.651')] [2022-07-09 18:11:48,071][26022] Updated weights on worker 0-0, policy_version 362552 (0.00097) [2022-07-09 18:11:49,866][26022] Updated weights on worker 0-0, policy_version 362562 (0.00089) [2022-07-09 18:11:51,721][26022] Updated weights on worker 0-0, policy_version 362572 (0.00091) [2022-07-09 18:11:52,297][25689] Fps is (10 sec: 5691.3, 60 sec: 5657.5, 300 sec: 5644.4). Total num frames: 371276800. Throughput: 0: 5974.2. Samples: 371275034. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:11:52,298][25689] Avg episode reward: [(0, '-47.680')] [2022-07-09 18:11:53,367][26022] Updated weights on worker 0-0, policy_version 362582 (0.00085) [2022-07-09 18:11:55,324][26022] Updated weights on worker 0-0, policy_version 362592 (0.00088) [2022-07-09 18:11:56,939][26022] Updated weights on worker 0-0, policy_version 362602 (0.00088) [2022-07-09 18:11:57,303][25689] Fps is (10 sec: 5694.6, 60 sec: 5691.8, 300 sec: 5645.1). Total num frames: 371306496. Throughput: 0: 5984.6. Samples: 371309312. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:11:57,303][25689] Avg episode reward: [(0, '-47.079')] [2022-07-09 18:11:58,845][26022] Updated weights on worker 0-0, policy_version 362612 (0.00088) [2022-07-09 18:12:00,544][26022] Updated weights on worker 0-0, policy_version 362622 (0.00084) [2022-07-09 18:12:02,314][25689] Fps is (10 sec: 5520.3, 60 sec: 5623.7, 300 sec: 5642.7). Total num frames: 371332096. Throughput: 0: 5896.9. Samples: 371342182. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:12:02,314][25689] Avg episode reward: [(0, '-46.414')] [2022-07-09 18:12:02,822][26022] Updated weights on worker 0-0, policy_version 362632 (0.00091) [2022-07-09 18:12:04,643][26022] Updated weights on worker 0-0, policy_version 362642 (0.00100) [2022-07-09 18:12:06,399][26022] Updated weights on worker 0-0, policy_version 362652 (0.00090) [2022-07-09 18:12:07,316][25689] Fps is (10 sec: 5419.8, 60 sec: 5658.7, 300 sec: 5644.6). Total num frames: 371360768. Throughput: 0: 5029.3. Samples: 371358850. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 18:12:07,317][25689] Avg episode reward: [(0, '-45.356')] [2022-07-09 18:12:08,029][26022] Updated weights on worker 0-0, policy_version 362662 (0.00089) [2022-07-09 18:12:10,037][26022] Updated weights on worker 0-0, policy_version 362672 (0.00188) [2022-07-09 18:12:11,591][26022] Updated weights on worker 0-0, policy_version 362682 (0.00528) [2022-07-09 18:12:12,371][25689] Fps is (10 sec: 5803.9, 60 sec: 5676.9, 300 sec: 5644.1). Total num frames: 371390464. Throughput: 0: 5892.2. Samples: 371393312. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:12:12,371][25689] Avg episode reward: [(0, '-45.141')] [2022-07-09 18:12:13,611][26022] Updated weights on worker 0-0, policy_version 362692 (0.00112) [2022-07-09 18:12:15,231][26022] Updated weights on worker 0-0, policy_version 362702 (0.00087) [2022-07-09 18:12:17,129][26022] Updated weights on worker 0-0, policy_version 362712 (0.00086) [2022-07-09 18:12:17,379][25689] Fps is (10 sec: 5699.0, 60 sec: 5664.6, 300 sec: 5641.0). Total num frames: 371418112. Throughput: 0: 5884.1. Samples: 371427440. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:12:17,379][25689] Avg episode reward: [(0, '-44.214')] [2022-07-09 18:12:18,887][26022] Updated weights on worker 0-0, policy_version 362722 (0.00089) [2022-07-09 18:12:20,657][26022] Updated weights on worker 0-0, policy_version 362732 (0.00047) [2022-07-09 18:12:22,435][25689] Fps is (10 sec: 5596.2, 60 sec: 5665.9, 300 sec: 5644.0). Total num frames: 371446784. Throughput: 0: 5087.5. Samples: 371444546. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:12:22,435][25689] Avg episode reward: [(0, '-45.081')] [2022-07-09 18:12:22,495][26022] Updated weights on worker 0-0, policy_version 362742 (0.00091) [2022-07-09 18:12:24,298][26022] Updated weights on worker 0-0, policy_version 362752 (0.00080) [2022-07-09 18:12:26,082][26022] Updated weights on worker 0-0, policy_version 362762 (0.00084) [2022-07-09 18:12:27,459][25689] Fps is (10 sec: 5688.8, 60 sec: 5685.5, 300 sec: 5645.6). Total num frames: 371475456. Throughput: 0: 5963.6. Samples: 371478970. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:12:27,459][25689] Avg episode reward: [(0, '-45.576')] [2022-07-09 18:12:27,890][26022] Updated weights on worker 0-0, policy_version 362772 (0.00124) [2022-07-09 18:12:29,646][26022] Updated weights on worker 0-0, policy_version 362782 (0.00087) [2022-07-09 18:12:31,398][26022] Updated weights on worker 0-0, policy_version 362792 (0.00081) [2022-07-09 18:12:32,520][25689] Fps is (10 sec: 5787.7, 60 sec: 5683.6, 300 sec: 5645.3). Total num frames: 371505152. Throughput: 0: 5939.7. Samples: 371512990. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:12:32,520][25689] Avg episode reward: [(0, '-46.018')] [2022-07-09 18:12:33,315][26022] Updated weights on worker 0-0, policy_version 362802 (0.00088) [2022-07-09 18:12:35,259][26022] Updated weights on worker 0-0, policy_version 362812 (0.00087) [2022-07-09 18:12:36,759][26022] Updated weights on worker 0-0, policy_version 362822 (0.00510) [2022-07-09 18:12:37,535][25689] Fps is (10 sec: 5691.1, 60 sec: 5652.6, 300 sec: 5649.0). Total num frames: 371532800. Throughput: 0: 5097.3. Samples: 371530182. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:12:37,535][25689] Avg episode reward: [(0, '-47.226')] [2022-07-09 18:12:38,699][26022] Updated weights on worker 0-0, policy_version 362832 (0.00084) [2022-07-09 18:12:40,379][26022] Updated weights on worker 0-0, policy_version 362842 (0.00089) [2022-07-09 18:12:42,129][26022] Updated weights on worker 0-0, policy_version 362852 (0.00094) [2022-07-09 18:12:42,578][25689] Fps is (10 sec: 5599.2, 60 sec: 5666.6, 300 sec: 5645.0). Total num frames: 371561472. Throughput: 0: 5968.4. Samples: 371564770. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:12:42,579][25689] Avg episode reward: [(0, '-47.304')] [2022-07-09 18:12:44,095][26022] Updated weights on worker 0-0, policy_version 362862 (0.00084) [2022-07-09 18:12:45,711][26022] Updated weights on worker 0-0, policy_version 362872 (0.00101) [2022-07-09 18:12:47,530][26022] Updated weights on worker 0-0, policy_version 362882 (0.00096) [2022-07-09 18:12:47,599][25689] Fps is (10 sec: 5799.7, 60 sec: 5665.7, 300 sec: 5650.7). Total num frames: 371591168. Throughput: 0: 5962.3. Samples: 371599052. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:12:47,599][25689] Avg episode reward: [(0, '-47.553')] [2022-07-09 18:12:49,371][26022] Updated weights on worker 0-0, policy_version 362892 (0.00083) [2022-07-09 18:12:51,047][26022] Updated weights on worker 0-0, policy_version 362902 (0.00088) [2022-07-09 18:12:52,658][25689] Fps is (10 sec: 5688.8, 60 sec: 5666.2, 300 sec: 5643.0). Total num frames: 371618816. Throughput: 0: 5117.2. Samples: 371616044. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:12:52,659][25689] Avg episode reward: [(0, '-46.610')] [2022-07-09 18:12:53,071][26022] Updated weights on worker 0-0, policy_version 362912 (0.00082) [2022-07-09 18:12:54,611][26022] Updated weights on worker 0-0, policy_version 362922 (0.00082) [2022-07-09 18:12:56,679][26022] Updated weights on worker 0-0, policy_version 362932 (0.00088) [2022-07-09 18:12:57,660][25689] Fps is (10 sec: 5699.6, 60 sec: 5666.5, 300 sec: 5646.7). Total num frames: 371648512. Throughput: 0: 5980.1. Samples: 371650532. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:12:57,660][25689] Avg episode reward: [(0, '-46.932')] [2022-07-09 18:12:58,234][26022] Updated weights on worker 0-0, policy_version 362942 (0.00084) [2022-07-09 18:13:00,203][26022] Updated weights on worker 0-0, policy_version 362952 (0.00087) [2022-07-09 18:13:02,171][26022] Updated weights on worker 0-0, policy_version 362962 (0.00095) [2022-07-09 18:13:02,676][25689] Fps is (10 sec: 5622.3, 60 sec: 5683.1, 300 sec: 5653.8). Total num frames: 371675136. Throughput: 0: 5867.4. Samples: 371682690. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:13:02,676][25689] Avg episode reward: [(0, '-47.081')] [2022-07-09 18:13:04,234][26022] Updated weights on worker 0-0, policy_version 362972 (0.00110) [2022-07-09 18:13:05,751][26022] Updated weights on worker 0-0, policy_version 362982 (0.00094) [2022-07-09 18:13:07,776][25689] Fps is (10 sec: 5364.7, 60 sec: 5656.9, 300 sec: 5646.5). Total num frames: 371702784. Throughput: 0: 4997.6. Samples: 371699892. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:13:07,777][25689] Avg episode reward: [(0, '-46.583')] [2022-07-09 18:13:07,804][26022] Updated weights on worker 0-0, policy_version 362992 (0.00087) [2022-07-09 18:13:09,450][26022] Updated weights on worker 0-0, policy_version 363002 (0.00096) [2022-07-09 18:13:11,511][26022] Updated weights on worker 0-0, policy_version 363012 (0.00094) [2022-07-09 18:13:12,857][25689] Fps is (10 sec: 5632.4, 60 sec: 5654.5, 300 sec: 5648.6). Total num frames: 371732480. Throughput: 0: 5828.5. Samples: 371733770. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:13:12,857][25689] Avg episode reward: [(0, '-46.980')] [2022-07-09 18:13:13,396][26022] Updated weights on worker 0-0, policy_version 363022 (0.00090) [2022-07-09 18:13:15,092][26022] Updated weights on worker 0-0, policy_version 363032 (0.00098) [2022-07-09 18:13:16,947][26022] Updated weights on worker 0-0, policy_version 363042 (0.00089) [2022-07-09 18:13:17,909][25689] Fps is (10 sec: 5760.4, 60 sec: 5667.2, 300 sec: 5655.0). Total num frames: 371761152. Throughput: 0: 5774.2. Samples: 371767456. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:13:17,910][25689] Avg episode reward: [(0, '-47.030')] [2022-07-09 18:13:18,741][26022] Updated weights on worker 0-0, policy_version 363052 (0.00087) [2022-07-09 18:13:20,454][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:13:20,468][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000363062_371775488.pth [2022-07-09 18:13:20,469][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000361076_369741824.pth [2022-07-09 18:13:20,473][26022] Updated weights on worker 0-0, policy_version 363062 (0.00083) [2022-07-09 18:13:22,357][26022] Updated weights on worker 0-0, policy_version 363072 (0.00097) [2022-07-09 18:13:22,988][25689] Fps is (10 sec: 5761.1, 60 sec: 5682.0, 300 sec: 5653.8). Total num frames: 371790848. Throughput: 0: 5012.2. Samples: 371784500. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:13:22,988][25689] Avg episode reward: [(0, '-46.655')] [2022-07-09 18:13:24,039][26022] Updated weights on worker 0-0, policy_version 363082 (0.00082) [2022-07-09 18:13:25,745][26022] Updated weights on worker 0-0, policy_version 363092 (0.00094) [2022-07-09 18:13:27,629][26022] Updated weights on worker 0-0, policy_version 363102 (0.00092) [2022-07-09 18:13:28,010][25689] Fps is (10 sec: 5677.0, 60 sec: 5665.2, 300 sec: 5648.2). Total num frames: 371818496. Throughput: 0: 5887.6. Samples: 371819022. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:13:28,011][25689] Avg episode reward: [(0, '-45.670')] [2022-07-09 18:13:29,316][26022] Updated weights on worker 0-0, policy_version 363112 (0.00091) [2022-07-09 18:13:31,163][26022] Updated weights on worker 0-0, policy_version 363122 (0.00415) [2022-07-09 18:13:33,038][26022] Updated weights on worker 0-0, policy_version 363132 (0.00082) [2022-07-09 18:13:33,049][25689] Fps is (10 sec: 5597.7, 60 sec: 5650.4, 300 sec: 5651.8). Total num frames: 371847168. Throughput: 0: 5927.5. Samples: 371853462. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:13:33,050][25689] Avg episode reward: [(0, '-45.956')] [2022-07-09 18:13:34,832][26022] Updated weights on worker 0-0, policy_version 363142 (0.00091) [2022-07-09 18:13:36,548][26022] Updated weights on worker 0-0, policy_version 363152 (0.00085) [2022-07-09 18:13:38,059][25689] Fps is (10 sec: 5604.9, 60 sec: 5650.9, 300 sec: 5649.1). Total num frames: 371874816. Throughput: 0: 5114.5. Samples: 371870510. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:13:38,059][25689] Avg episode reward: [(0, '-46.309')] [2022-07-09 18:13:38,541][26022] Updated weights on worker 0-0, policy_version 363162 (0.00085) [2022-07-09 18:13:40,136][26022] Updated weights on worker 0-0, policy_version 363172 (0.00086) [2022-07-09 18:13:42,208][26022] Updated weights on worker 0-0, policy_version 363182 (0.00088) [2022-07-09 18:13:43,090][25689] Fps is (10 sec: 5813.4, 60 sec: 5685.9, 300 sec: 5660.2). Total num frames: 371905536. Throughput: 0: 5982.4. Samples: 371904756. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:13:43,090][25689] Avg episode reward: [(0, '-45.072')] [2022-07-09 18:13:43,817][26022] Updated weights on worker 0-0, policy_version 363192 (0.00093) [2022-07-09 18:13:45,625][26022] Updated weights on worker 0-0, policy_version 363202 (0.00081) [2022-07-09 18:13:47,410][26022] Updated weights on worker 0-0, policy_version 363212 (0.00093) [2022-07-09 18:13:48,093][25689] Fps is (10 sec: 5714.8, 60 sec: 5636.8, 300 sec: 5650.8). Total num frames: 371932160. Throughput: 0: 5957.7. Samples: 371938668. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:13:48,094][25689] Avg episode reward: [(0, '-45.277')] [2022-07-09 18:13:49,355][26022] Updated weights on worker 0-0, policy_version 363222 (0.00574) [2022-07-09 18:13:51,016][26022] Updated weights on worker 0-0, policy_version 363232 (0.00093) [2022-07-09 18:13:53,027][26022] Updated weights on worker 0-0, policy_version 363242 (0.00089) [2022-07-09 18:13:53,167][25689] Fps is (10 sec: 5385.6, 60 sec: 5635.5, 300 sec: 5649.5). Total num frames: 371959808. Throughput: 0: 5083.5. Samples: 371955728. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:13:53,167][25689] Avg episode reward: [(0, '-46.251')] [2022-07-09 18:13:54,520][26022] Updated weights on worker 0-0, policy_version 363252 (0.00085) [2022-07-09 18:13:56,569][26022] Updated weights on worker 0-0, policy_version 363262 (0.00083) [2022-07-09 18:13:58,185][25689] Fps is (10 sec: 5682.3, 60 sec: 5633.9, 300 sec: 5653.6). Total num frames: 371989504. Throughput: 0: 5918.0. Samples: 371989616. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:13:58,185][25689] Avg episode reward: [(0, '-46.338')] [2022-07-09 18:13:58,208][26022] Updated weights on worker 0-0, policy_version 363272 (0.00088) [2022-07-09 18:14:00,128][26022] Updated weights on worker 0-0, policy_version 363282 (0.00092) [2022-07-09 18:14:02,289][26022] Updated weights on worker 0-0, policy_version 363292 (0.00092) [2022-07-09 18:14:03,194][25689] Fps is (10 sec: 5514.7, 60 sec: 5617.6, 300 sec: 5650.2). Total num frames: 372015104. Throughput: 0: 5806.5. Samples: 372021492. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:14:03,196][25689] Avg episode reward: [(0, '-45.720')] [2022-07-09 18:14:04,042][26022] Updated weights on worker 0-0, policy_version 363302 (0.00096) [2022-07-09 18:14:05,987][26022] Updated weights on worker 0-0, policy_version 363312 (0.00086) [2022-07-09 18:14:07,627][26022] Updated weights on worker 0-0, policy_version 363322 (0.00087) [2022-07-09 18:14:08,219][25689] Fps is (10 sec: 5510.9, 60 sec: 5658.6, 300 sec: 5655.0). Total num frames: 372044800. Throughput: 0: 4960.9. Samples: 372038510. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:14:08,219][25689] Avg episode reward: [(0, '-45.492')] [2022-07-09 18:14:09,554][26022] Updated weights on worker 0-0, policy_version 363332 (0.00088) [2022-07-09 18:14:11,420][26022] Updated weights on worker 0-0, policy_version 363342 (0.00097) [2022-07-09 18:14:13,118][26022] Updated weights on worker 0-0, policy_version 363352 (0.00086) [2022-07-09 18:14:13,305][25689] Fps is (10 sec: 5671.3, 60 sec: 5624.1, 300 sec: 5647.1). Total num frames: 372072448. Throughput: 0: 5799.1. Samples: 372072512. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:14:13,306][25689] Avg episode reward: [(0, '-46.266')] [2022-07-09 18:14:15,103][26022] Updated weights on worker 0-0, policy_version 363362 (0.00104) [2022-07-09 18:14:16,843][26022] Updated weights on worker 0-0, policy_version 363372 (0.00094) [2022-07-09 18:14:18,355][25689] Fps is (10 sec: 5556.5, 60 sec: 5624.4, 300 sec: 5647.8). Total num frames: 372101120. Throughput: 0: 5794.3. Samples: 372106486. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:14:18,356][25689] Avg episode reward: [(0, '-45.408')] [2022-07-09 18:14:18,631][26022] Updated weights on worker 0-0, policy_version 363382 (0.00084) [2022-07-09 18:14:20,299][26022] Updated weights on worker 0-0, policy_version 363392 (0.00084) [2022-07-09 18:14:22,144][26022] Updated weights on worker 0-0, policy_version 363402 (0.00090) [2022-07-09 18:14:23,372][25689] Fps is (10 sec: 5696.4, 60 sec: 5613.2, 300 sec: 5647.9). Total num frames: 372129792. Throughput: 0: 5068.4. Samples: 372123762. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:14:23,372][25689] Avg episode reward: [(0, '-44.636')] [2022-07-09 18:14:24,015][26022] Updated weights on worker 0-0, policy_version 363412 (0.00088) [2022-07-09 18:14:25,884][26022] Updated weights on worker 0-0, policy_version 363422 (0.00092) [2022-07-09 18:14:27,619][26022] Updated weights on worker 0-0, policy_version 363432 (0.00089) [2022-07-09 18:14:28,383][25689] Fps is (10 sec: 5718.2, 60 sec: 5631.2, 300 sec: 5648.8). Total num frames: 372158464. Throughput: 0: 5909.1. Samples: 372157662. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:14:28,383][25689] Avg episode reward: [(0, '-45.189')] [2022-07-09 18:14:29,423][26022] Updated weights on worker 0-0, policy_version 363442 (0.00097) [2022-07-09 18:14:31,326][26022] Updated weights on worker 0-0, policy_version 363452 (0.00087) [2022-07-09 18:14:33,133][26022] Updated weights on worker 0-0, policy_version 363462 (0.00091) [2022-07-09 18:14:33,446][25689] Fps is (10 sec: 5692.1, 60 sec: 5628.9, 300 sec: 5642.6). Total num frames: 372187136. Throughput: 0: 5910.1. Samples: 372191548. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:14:33,447][25689] Avg episode reward: [(0, '-46.315')] [2022-07-09 18:14:34,908][26022] Updated weights on worker 0-0, policy_version 363472 (0.00090) [2022-07-09 18:14:36,526][26022] Updated weights on worker 0-0, policy_version 363482 (0.00085) [2022-07-09 18:14:38,454][25689] Fps is (10 sec: 5592.1, 60 sec: 5629.0, 300 sec: 5649.5). Total num frames: 372214784. Throughput: 0: 5080.7. Samples: 372208606. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:14:38,455][25689] Avg episode reward: [(0, '-45.737')] [2022-07-09 18:14:38,749][26022] Updated weights on worker 0-0, policy_version 363492 (0.00086) [2022-07-09 18:14:40,161][26022] Updated weights on worker 0-0, policy_version 363502 (0.00082) [2022-07-09 18:14:42,158][26022] Updated weights on worker 0-0, policy_version 363512 (0.00080) [2022-07-09 18:14:43,523][25689] Fps is (10 sec: 5792.5, 60 sec: 5625.5, 300 sec: 5651.9). Total num frames: 372245504. Throughput: 0: 5907.1. Samples: 372242794. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 18:14:43,523][25689] Avg episode reward: [(0, '-45.659')] [2022-07-09 18:14:43,646][26022] Updated weights on worker 0-0, policy_version 363522 (0.00086) [2022-07-09 18:14:45,947][26022] Updated weights on worker 0-0, policy_version 363532 (0.00097) [2022-07-09 18:14:47,503][26022] Updated weights on worker 0-0, policy_version 363542 (0.00086) [2022-07-09 18:14:48,533][25689] Fps is (10 sec: 5588.1, 60 sec: 5608.0, 300 sec: 5643.2). Total num frames: 372271104. Throughput: 0: 5910.2. Samples: 372276752. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:14:48,534][25689] Avg episode reward: [(0, '-46.364')] [2022-07-09 18:14:49,502][26022] Updated weights on worker 0-0, policy_version 363552 (0.00091) [2022-07-09 18:14:50,997][26022] Updated weights on worker 0-0, policy_version 363562 (0.00088) [2022-07-09 18:14:53,146][26022] Updated weights on worker 0-0, policy_version 363572 (0.00092) [2022-07-09 18:14:53,619][25689] Fps is (10 sec: 5476.9, 60 sec: 5640.7, 300 sec: 5646.7). Total num frames: 372300800. Throughput: 0: 5904.7. Samples: 372310660. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:14:53,619][25689] Avg episode reward: [(0, '-46.751')] [2022-07-09 18:14:54,668][26022] Updated weights on worker 0-0, policy_version 363582 (0.00086) [2022-07-09 18:14:56,732][26022] Updated weights on worker 0-0, policy_version 363592 (0.00081) [2022-07-09 18:14:58,345][26022] Updated weights on worker 0-0, policy_version 363602 (0.00091) [2022-07-09 18:14:58,681][25689] Fps is (10 sec: 5852.1, 60 sec: 5636.5, 300 sec: 5649.5). Total num frames: 372330496. Throughput: 0: 5884.7. Samples: 372327638. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:14:58,682][25689] Avg episode reward: [(0, '-46.817')] [2022-07-09 18:15:00,121][26022] Updated weights on worker 0-0, policy_version 363612 (0.00080) [2022-07-09 18:15:02,259][26022] Updated weights on worker 0-0, policy_version 363622 (0.00088) [2022-07-09 18:15:03,688][25689] Fps is (10 sec: 5491.6, 60 sec: 5636.8, 300 sec: 5647.4). Total num frames: 372356096. Throughput: 0: 5811.8. Samples: 372359990. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:15:03,690][25689] Avg episode reward: [(0, '-47.447')] [2022-07-09 18:15:03,979][26022] Updated weights on worker 0-0, policy_version 363632 (0.00089) [2022-07-09 18:15:05,981][26022] Updated weights on worker 0-0, policy_version 363642 (0.00087) [2022-07-09 18:15:07,625][26022] Updated weights on worker 0-0, policy_version 363652 (0.00084) [2022-07-09 18:15:08,706][25689] Fps is (10 sec: 5311.7, 60 sec: 5603.6, 300 sec: 5648.3). Total num frames: 372383744. Throughput: 0: 5831.6. Samples: 372394394. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:15:08,707][25689] Avg episode reward: [(0, '-47.868')] [2022-07-09 18:15:09,452][26022] Updated weights on worker 0-0, policy_version 363662 (0.00086) [2022-07-09 18:15:11,314][26022] Updated weights on worker 0-0, policy_version 363672 (0.00071) [2022-07-09 18:15:12,856][26022] Updated weights on worker 0-0, policy_version 363682 (0.00095) [2022-07-09 18:15:13,775][25689] Fps is (10 sec: 5684.8, 60 sec: 5639.0, 300 sec: 5647.5). Total num frames: 372413440. Throughput: 0: 5006.2. Samples: 372411568. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:15:13,781][25689] Avg episode reward: [(0, '-48.544')] [2022-07-09 18:15:14,870][26022] Updated weights on worker 0-0, policy_version 363692 (0.00094) [2022-07-09 18:15:16,709][26022] Updated weights on worker 0-0, policy_version 363702 (0.00095) [2022-07-09 18:15:18,470][26022] Updated weights on worker 0-0, policy_version 363712 (0.00093) [2022-07-09 18:15:18,803][25689] Fps is (10 sec: 5882.1, 60 sec: 5658.0, 300 sec: 5655.1). Total num frames: 372443136. Throughput: 0: 5866.6. Samples: 372445684. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:15:18,805][25689] Avg episode reward: [(0, '-47.883')] [2022-07-09 18:15:20,469][26022] Updated weights on worker 0-0, policy_version 363722 (0.00096) [2022-07-09 18:15:20,635][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:15:20,649][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000363723_372452352.pth [2022-07-09 18:15:20,650][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000361736_370417664.pth [2022-07-09 18:15:21,822][26022] Updated weights on worker 0-0, policy_version 363732 (0.00081) [2022-07-09 18:15:23,823][25689] Fps is (10 sec: 5605.3, 60 sec: 5623.9, 300 sec: 5645.7). Total num frames: 372469760. Throughput: 0: 5950.1. Samples: 372479794. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:15:23,825][25689] Avg episode reward: [(0, '-47.894')] [2022-07-09 18:15:24,050][26022] Updated weights on worker 0-0, policy_version 363742 (0.00097) [2022-07-09 18:15:25,745][26022] Updated weights on worker 0-0, policy_version 363752 (0.00097) [2022-07-09 18:15:27,534][26022] Updated weights on worker 0-0, policy_version 363762 (0.00084) [2022-07-09 18:15:28,896][25689] Fps is (10 sec: 5580.3, 60 sec: 5635.0, 300 sec: 5650.1). Total num frames: 372499456. Throughput: 0: 5073.9. Samples: 372496834. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:15:28,897][25689] Avg episode reward: [(0, '-47.499')] [2022-07-09 18:15:29,455][26022] Updated weights on worker 0-0, policy_version 363772 (0.01142) [2022-07-09 18:15:31,113][26022] Updated weights on worker 0-0, policy_version 363782 (0.00081) [2022-07-09 18:15:33,061][26022] Updated weights on worker 0-0, policy_version 363792 (0.00087) [2022-07-09 18:15:33,945][25689] Fps is (10 sec: 5867.3, 60 sec: 5653.2, 300 sec: 5656.6). Total num frames: 372529152. Throughput: 0: 5910.6. Samples: 372530786. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:15:33,946][25689] Avg episode reward: [(0, '-47.499')] [2022-07-09 18:15:34,923][26022] Updated weights on worker 0-0, policy_version 363802 (0.00088) [2022-07-09 18:15:36,567][26022] Updated weights on worker 0-0, policy_version 363812 (0.00090) [2022-07-09 18:15:38,339][26022] Updated weights on worker 0-0, policy_version 363822 (0.00089) [2022-07-09 18:15:38,968][25689] Fps is (10 sec: 5591.6, 60 sec: 5634.9, 300 sec: 5649.9). Total num frames: 372555776. Throughput: 0: 5916.9. Samples: 372564998. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:15:38,970][25689] Avg episode reward: [(0, '-47.554')] [2022-07-09 18:15:40,011][26022] Updated weights on worker 0-0, policy_version 363832 (0.00087) [2022-07-09 18:15:42,125][26022] Updated weights on worker 0-0, policy_version 363842 (0.00088) [2022-07-09 18:15:43,894][26022] Updated weights on worker 0-0, policy_version 363852 (0.00091) [2022-07-09 18:15:43,982][25689] Fps is (10 sec: 5509.3, 60 sec: 5606.1, 300 sec: 5646.9). Total num frames: 372584448. Throughput: 0: 5065.0. Samples: 372581902. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:15:43,983][25689] Avg episode reward: [(0, '-47.225')] [2022-07-09 18:15:45,860][26022] Updated weights on worker 0-0, policy_version 363862 (0.00096) [2022-07-09 18:15:47,502][26022] Updated weights on worker 0-0, policy_version 363872 (0.00090) [2022-07-09 18:15:49,008][25689] Fps is (10 sec: 5711.4, 60 sec: 5655.4, 300 sec: 5647.8). Total num frames: 372613120. Throughput: 0: 5906.9. Samples: 372615636. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:15:49,009][25689] Avg episode reward: [(0, '-46.893')] [2022-07-09 18:15:49,253][26022] Updated weights on worker 0-0, policy_version 363882 (0.00098) [2022-07-09 18:15:51,182][26022] Updated weights on worker 0-0, policy_version 363892 (0.00085) [2022-07-09 18:15:52,901][26022] Updated weights on worker 0-0, policy_version 363902 (0.00080) [2022-07-09 18:15:54,103][25689] Fps is (10 sec: 5666.0, 60 sec: 5637.7, 300 sec: 5649.7). Total num frames: 372641792. Throughput: 0: 5895.1. Samples: 372649618. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:15:54,103][25689] Avg episode reward: [(0, '-46.597')] [2022-07-09 18:15:54,955][26022] Updated weights on worker 0-0, policy_version 363912 (0.00098) [2022-07-09 18:15:56,327][26022] Updated weights on worker 0-0, policy_version 363922 (0.00086) [2022-07-09 18:15:58,391][26022] Updated weights on worker 0-0, policy_version 363932 (0.00087) [2022-07-09 18:15:59,118][25689] Fps is (10 sec: 5773.5, 60 sec: 5642.2, 300 sec: 5649.5). Total num frames: 372671488. Throughput: 0: 5039.3. Samples: 372666538. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:15:59,123][25689] Avg episode reward: [(0, '-46.395')] [2022-07-09 18:16:00,089][26022] Updated weights on worker 0-0, policy_version 363942 (0.00081) [2022-07-09 18:16:02,067][26022] Updated weights on worker 0-0, policy_version 363952 (0.00089) [2022-07-09 18:16:04,127][25689] Fps is (10 sec: 5414.2, 60 sec: 5625.0, 300 sec: 5642.7). Total num frames: 372696064. Throughput: 0: 5787.6. Samples: 372698492. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:16:04,127][25689] Avg episode reward: [(0, '-45.949')] [2022-07-09 18:16:04,244][26022] Updated weights on worker 0-0, policy_version 363962 (0.00088) [2022-07-09 18:16:06,010][26022] Updated weights on worker 0-0, policy_version 363972 (0.00090) [2022-07-09 18:16:07,852][26022] Updated weights on worker 0-0, policy_version 363982 (0.00089) [2022-07-09 18:16:09,128][25689] Fps is (10 sec: 5319.1, 60 sec: 5643.5, 300 sec: 5644.0). Total num frames: 372724736. Throughput: 0: 5803.6. Samples: 372732406. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:16:09,129][25689] Avg episode reward: [(0, '-46.313')] [2022-07-09 18:16:09,714][26022] Updated weights on worker 0-0, policy_version 363992 (0.00085) [2022-07-09 18:16:11,367][26022] Updated weights on worker 0-0, policy_version 364002 (0.00093) [2022-07-09 18:16:13,271][26022] Updated weights on worker 0-0, policy_version 364012 (0.00084) [2022-07-09 18:16:14,277][25689] Fps is (10 sec: 5649.4, 60 sec: 5619.1, 300 sec: 5642.3). Total num frames: 372753408. Throughput: 0: 4943.8. Samples: 372749356. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:16:14,278][25689] Avg episode reward: [(0, '-46.214')] [2022-07-09 18:16:14,874][26022] Updated weights on worker 0-0, policy_version 364022 (0.00089) [2022-07-09 18:16:16,803][26022] Updated weights on worker 0-0, policy_version 364032 (0.00085) [2022-07-09 18:16:18,751][26022] Updated weights on worker 0-0, policy_version 364042 (0.00096) [2022-07-09 18:16:19,331][25689] Fps is (10 sec: 5620.6, 60 sec: 5599.8, 300 sec: 5642.6). Total num frames: 372782080. Throughput: 0: 5767.6. Samples: 372783120. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:16:19,331][25689] Avg episode reward: [(0, '-46.555')] [2022-07-09 18:16:20,526][26022] Updated weights on worker 0-0, policy_version 364052 (0.00083) [2022-07-09 18:16:22,294][26022] Updated weights on worker 0-0, policy_version 364062 (0.00087) [2022-07-09 18:16:23,911][26022] Updated weights on worker 0-0, policy_version 364072 (0.00085) [2022-07-09 18:16:24,353][25689] Fps is (10 sec: 5691.5, 60 sec: 5633.5, 300 sec: 5646.6). Total num frames: 372810752. Throughput: 0: 5866.2. Samples: 372817140. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:16:24,353][25689] Avg episode reward: [(0, '-47.199')] [2022-07-09 18:16:26,034][26022] Updated weights on worker 0-0, policy_version 364082 (0.00094) [2022-07-09 18:16:27,751][26022] Updated weights on worker 0-0, policy_version 364092 (0.00099) [2022-07-09 18:16:29,416][25689] Fps is (10 sec: 5584.4, 60 sec: 5600.5, 300 sec: 5639.3). Total num frames: 372838400. Throughput: 0: 5010.0. Samples: 372834048. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:16:29,417][25689] Avg episode reward: [(0, '-47.210')] [2022-07-09 18:16:29,700][26022] Updated weights on worker 0-0, policy_version 364102 (0.00100) [2022-07-09 18:16:31,258][26022] Updated weights on worker 0-0, policy_version 364112 (0.00087) [2022-07-09 18:16:33,186][26022] Updated weights on worker 0-0, policy_version 364122 (0.00085) [2022-07-09 18:16:34,522][25689] Fps is (10 sec: 5638.8, 60 sec: 5595.3, 300 sec: 5638.1). Total num frames: 372868096. Throughput: 0: 5848.5. Samples: 372867758. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:16:34,523][25689] Avg episode reward: [(0, '-47.244')] [2022-07-09 18:16:35,094][26022] Updated weights on worker 0-0, policy_version 364132 (0.00086) [2022-07-09 18:16:36,802][26022] Updated weights on worker 0-0, policy_version 364142 (0.00082) [2022-07-09 18:16:38,775][26022] Updated weights on worker 0-0, policy_version 364152 (0.00092) [2022-07-09 18:16:39,528][25689] Fps is (10 sec: 5772.3, 60 sec: 5630.7, 300 sec: 5641.7). Total num frames: 372896768. Throughput: 0: 5881.2. Samples: 372901902. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:16:39,528][25689] Avg episode reward: [(0, '-47.677')] [2022-07-09 18:16:40,193][26022] Updated weights on worker 0-0, policy_version 364162 (0.00085) [2022-07-09 18:16:42,287][26022] Updated weights on worker 0-0, policy_version 364172 (0.00085) [2022-07-09 18:16:43,995][26022] Updated weights on worker 0-0, policy_version 364182 (0.00614) [2022-07-09 18:16:44,541][25689] Fps is (10 sec: 5519.1, 60 sec: 5596.9, 300 sec: 5631.3). Total num frames: 372923392. Throughput: 0: 5050.8. Samples: 372919108. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:16:44,542][25689] Avg episode reward: [(0, '-49.126')] [2022-07-09 18:16:45,859][26022] Updated weights on worker 0-0, policy_version 364192 (0.00087) [2022-07-09 18:16:47,787][26022] Updated weights on worker 0-0, policy_version 364202 (0.00088) [2022-07-09 18:16:49,476][26022] Updated weights on worker 0-0, policy_version 364212 (0.00091) [2022-07-09 18:16:49,561][25689] Fps is (10 sec: 5613.6, 60 sec: 5614.4, 300 sec: 5639.1). Total num frames: 372953088. Throughput: 0: 5900.1. Samples: 372952902. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:16:49,561][25689] Avg episode reward: [(0, '-48.635')] [2022-07-09 18:16:51,451][26022] Updated weights on worker 0-0, policy_version 364222 (0.00096) [2022-07-09 18:16:53,027][26022] Updated weights on worker 0-0, policy_version 364232 (0.00402) [2022-07-09 18:16:54,628][25689] Fps is (10 sec: 5684.8, 60 sec: 5600.0, 300 sec: 5631.0). Total num frames: 372980736. Throughput: 0: 5911.8. Samples: 372986622. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:16:54,629][25689] Avg episode reward: [(0, '-48.764')] [2022-07-09 18:16:54,977][26022] Updated weights on worker 0-0, policy_version 364242 (0.00087) [2022-07-09 18:16:56,659][26022] Updated weights on worker 0-0, policy_version 364252 (0.00094) [2022-07-09 18:16:58,728][26022] Updated weights on worker 0-0, policy_version 364262 (0.00083) [2022-07-09 18:16:59,635][25689] Fps is (10 sec: 5692.2, 60 sec: 5600.8, 300 sec: 5644.9). Total num frames: 373010432. Throughput: 0: 5064.0. Samples: 373003724. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:16:59,635][25689] Avg episode reward: [(0, '-48.245')] [2022-07-09 18:17:00,314][26022] Updated weights on worker 0-0, policy_version 364272 (0.00080) [2022-07-09 18:17:02,578][26022] Updated weights on worker 0-0, policy_version 364282 (0.00092) [2022-07-09 18:17:04,291][26022] Updated weights on worker 0-0, policy_version 364292 (0.00086) [2022-07-09 18:17:04,647][25689] Fps is (10 sec: 5519.4, 60 sec: 5617.5, 300 sec: 5634.4). Total num frames: 373036032. Throughput: 0: 5796.5. Samples: 373035650. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:17:04,647][25689] Avg episode reward: [(0, '-47.574')] [2022-07-09 18:17:06,118][26022] Updated weights on worker 0-0, policy_version 364302 (0.00092) [2022-07-09 18:17:07,871][26022] Updated weights on worker 0-0, policy_version 364312 (0.00086) [2022-07-09 18:17:09,678][25689] Fps is (10 sec: 5301.8, 60 sec: 5597.8, 300 sec: 5627.9). Total num frames: 373063680. Throughput: 0: 5787.5. Samples: 373069332. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:17:09,679][25689] Avg episode reward: [(0, '-46.183')] [2022-07-09 18:17:10,022][26022] Updated weights on worker 0-0, policy_version 364322 (0.00437) [2022-07-09 18:17:11,496][26022] Updated weights on worker 0-0, policy_version 364332 (0.00103) [2022-07-09 18:17:13,326][26022] Updated weights on worker 0-0, policy_version 364342 (0.00090) [2022-07-09 18:17:14,736][25689] Fps is (10 sec: 5683.6, 60 sec: 5623.1, 300 sec: 5633.8). Total num frames: 373093376. Throughput: 0: 4965.1. Samples: 373086460. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:17:14,737][25689] Avg episode reward: [(0, '-46.288')] [2022-07-09 18:17:15,153][26022] Updated weights on worker 0-0, policy_version 364352 (0.00090) [2022-07-09 18:17:16,720][26022] Updated weights on worker 0-0, policy_version 364362 (0.00083) [2022-07-09 18:17:19,024][26022] Updated weights on worker 0-0, policy_version 364372 (0.00083) [2022-07-09 18:17:19,759][25689] Fps is (10 sec: 5790.1, 60 sec: 5626.0, 300 sec: 5634.5). Total num frames: 373122048. Throughput: 0: 5817.3. Samples: 373120792. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 18:17:19,759][25689] Avg episode reward: [(0, '-46.319')] [2022-07-09 18:17:20,396][26022] Updated weights on worker 0-0, policy_version 364382 (0.00090) [2022-07-09 18:17:20,841][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:17:20,854][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000364383_373128192.pth [2022-07-09 18:17:20,855][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000362397_371094528.pth [2022-07-09 18:17:22,429][26022] Updated weights on worker 0-0, policy_version 364392 (0.00087) [2022-07-09 18:17:24,265][26022] Updated weights on worker 0-0, policy_version 364402 (0.00086) [2022-07-09 18:17:24,790][25689] Fps is (10 sec: 5602.0, 60 sec: 5608.2, 300 sec: 5630.9). Total num frames: 373149696. Throughput: 0: 5941.5. Samples: 373155330. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:17:24,791][25689] Avg episode reward: [(0, '-45.443')] [2022-07-09 18:17:25,866][26022] Updated weights on worker 0-0, policy_version 364412 (0.00084) [2022-07-09 18:17:27,770][26022] Updated weights on worker 0-0, policy_version 364422 (0.00091) [2022-07-09 18:17:29,274][26022] Updated weights on worker 0-0, policy_version 364432 (0.00088) [2022-07-09 18:17:29,806][25689] Fps is (10 sec: 5707.1, 60 sec: 5646.5, 300 sec: 5631.7). Total num frames: 373179392. Throughput: 0: 5119.3. Samples: 373172376. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:17:29,807][25689] Avg episode reward: [(0, '-44.930')] [2022-07-09 18:17:31,551][26022] Updated weights on worker 0-0, policy_version 364442 (0.00083) [2022-07-09 18:17:33,148][26022] Updated weights on worker 0-0, policy_version 364452 (0.00091) [2022-07-09 18:17:34,870][25689] Fps is (10 sec: 5790.4, 60 sec: 5633.5, 300 sec: 5634.3). Total num frames: 373208064. Throughput: 0: 5966.1. Samples: 373206580. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:17:34,870][25689] Avg episode reward: [(0, '-45.776')] [2022-07-09 18:17:35,054][26022] Updated weights on worker 0-0, policy_version 364462 (0.00085) [2022-07-09 18:17:36,570][26022] Updated weights on worker 0-0, policy_version 364472 (0.00094) [2022-07-09 18:17:38,699][26022] Updated weights on worker 0-0, policy_version 364482 (0.00089) [2022-07-09 18:17:39,892][25689] Fps is (10 sec: 5685.9, 60 sec: 5632.0, 300 sec: 5634.7). Total num frames: 373236736. Throughput: 0: 5957.4. Samples: 373240734. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:17:39,892][25689] Avg episode reward: [(0, '-45.502')] [2022-07-09 18:17:40,426][26022] Updated weights on worker 0-0, policy_version 364492 (0.00091) [2022-07-09 18:17:42,374][26022] Updated weights on worker 0-0, policy_version 364502 (0.00073) [2022-07-09 18:17:43,939][26022] Updated weights on worker 0-0, policy_version 364512 (0.00087) [2022-07-09 18:17:44,914][25689] Fps is (10 sec: 5607.0, 60 sec: 5648.1, 300 sec: 5627.7). Total num frames: 373264384. Throughput: 0: 5095.7. Samples: 373257878. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:17:44,915][25689] Avg episode reward: [(0, '-45.661')] [2022-07-09 18:17:45,602][26022] Updated weights on worker 0-0, policy_version 364522 (0.00085) [2022-07-09 18:17:47,725][26022] Updated weights on worker 0-0, policy_version 364532 (0.00088) [2022-07-09 18:17:49,544][26022] Updated weights on worker 0-0, policy_version 364542 (0.00091) [2022-07-09 18:17:49,932][25689] Fps is (10 sec: 5609.5, 60 sec: 5631.3, 300 sec: 5632.0). Total num frames: 373293056. Throughput: 0: 5926.6. Samples: 373291652. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:17:49,932][25689] Avg episode reward: [(0, '-45.574')] [2022-07-09 18:17:51,168][26022] Updated weights on worker 0-0, policy_version 364552 (0.00086) [2022-07-09 18:17:53,263][26022] Updated weights on worker 0-0, policy_version 364562 (0.00085) [2022-07-09 18:17:54,779][26022] Updated weights on worker 0-0, policy_version 364572 (0.00086) [2022-07-09 18:17:55,059][25689] Fps is (10 sec: 5753.3, 60 sec: 5659.6, 300 sec: 5629.6). Total num frames: 373322752. Throughput: 0: 5929.1. Samples: 373326286. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:17:55,060][25689] Avg episode reward: [(0, '-45.455')] [2022-07-09 18:17:56,755][26022] Updated weights on worker 0-0, policy_version 364582 (0.00094) [2022-07-09 18:17:58,332][26022] Updated weights on worker 0-0, policy_version 364592 (0.00095) [2022-07-09 18:18:00,082][25689] Fps is (10 sec: 5750.7, 60 sec: 5641.2, 300 sec: 5636.4). Total num frames: 373351424. Throughput: 0: 5924.4. Samples: 373360346. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:18:00,082][25689] Avg episode reward: [(0, '-45.724')] [2022-07-09 18:18:00,298][26022] Updated weights on worker 0-0, policy_version 364602 (0.00085) [2022-07-09 18:18:02,202][26022] Updated weights on worker 0-0, policy_version 364612 (0.00094) [2022-07-09 18:18:04,282][26022] Updated weights on worker 0-0, policy_version 364622 (0.00085) [2022-07-09 18:18:05,085][25689] Fps is (10 sec: 5515.4, 60 sec: 5658.9, 300 sec: 5634.8). Total num frames: 373378048. Throughput: 0: 5832.4. Samples: 373375522. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:18:05,088][25689] Avg episode reward: [(0, '-46.238')] [2022-07-09 18:18:05,850][26022] Updated weights on worker 0-0, policy_version 364632 (0.00099) [2022-07-09 18:18:07,764][26022] Updated weights on worker 0-0, policy_version 364642 (0.00088) [2022-07-09 18:18:09,542][26022] Updated weights on worker 0-0, policy_version 364652 (0.00084) [2022-07-09 18:18:10,113][25689] Fps is (10 sec: 5410.1, 60 sec: 5659.2, 300 sec: 5628.9). Total num frames: 373405696. Throughput: 0: 5855.5. Samples: 373409824. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:18:10,114][25689] Avg episode reward: [(0, '-45.728')] [2022-07-09 18:18:11,421][26022] Updated weights on worker 0-0, policy_version 364662 (0.00087) [2022-07-09 18:18:13,235][26022] Updated weights on worker 0-0, policy_version 364672 (0.00085) [2022-07-09 18:18:14,785][26022] Updated weights on worker 0-0, policy_version 364682 (0.00081) [2022-07-09 18:18:15,241][25689] Fps is (10 sec: 5646.7, 60 sec: 5652.7, 300 sec: 5630.9). Total num frames: 373435392. Throughput: 0: 5827.4. Samples: 373443890. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:18:15,242][25689] Avg episode reward: [(0, '-46.357')] [2022-07-09 18:18:16,830][26022] Updated weights on worker 0-0, policy_version 364692 (0.00087) [2022-07-09 18:18:18,585][26022] Updated weights on worker 0-0, policy_version 364702 (0.00085) [2022-07-09 18:18:20,263][25689] Fps is (10 sec: 5548.8, 60 sec: 5618.9, 300 sec: 5621.6). Total num frames: 373462016. Throughput: 0: 4979.7. Samples: 373460844. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:18:20,265][25689] Avg episode reward: [(0, '-46.574')] [2022-07-09 18:18:20,558][26022] Updated weights on worker 0-0, policy_version 364712 (0.00083) [2022-07-09 18:18:22,115][26022] Updated weights on worker 0-0, policy_version 364722 (0.00086) [2022-07-09 18:18:24,155][26022] Updated weights on worker 0-0, policy_version 364732 (0.00092) [2022-07-09 18:18:25,280][25689] Fps is (10 sec: 5712.1, 60 sec: 5671.0, 300 sec: 5632.1). Total num frames: 373492736. Throughput: 0: 5923.6. Samples: 373495146. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:18:25,280][25689] Avg episode reward: [(0, '-47.074')] [2022-07-09 18:18:25,569][26022] Updated weights on worker 0-0, policy_version 364742 (0.00083) [2022-07-09 18:18:27,856][26022] Updated weights on worker 0-0, policy_version 364752 (0.00095) [2022-07-09 18:18:29,324][26022] Updated weights on worker 0-0, policy_version 364762 (0.00095) [2022-07-09 18:18:30,294][25689] Fps is (10 sec: 5818.7, 60 sec: 5637.4, 300 sec: 5629.1). Total num frames: 373520384. Throughput: 0: 5903.5. Samples: 373528962. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:18:30,295][25689] Avg episode reward: [(0, '-47.508')] [2022-07-09 18:18:31,449][26022] Updated weights on worker 0-0, policy_version 364772 (0.00079) [2022-07-09 18:18:32,898][26022] Updated weights on worker 0-0, policy_version 364782 (0.00093) [2022-07-09 18:18:35,123][26022] Updated weights on worker 0-0, policy_version 364792 (0.00090) [2022-07-09 18:18:35,374][25689] Fps is (10 sec: 5579.2, 60 sec: 5635.8, 300 sec: 5631.2). Total num frames: 373549056. Throughput: 0: 5068.5. Samples: 373545938. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:18:35,375][25689] Avg episode reward: [(0, '-46.332')] [2022-07-09 18:18:36,449][26022] Updated weights on worker 0-0, policy_version 364802 (0.00108) [2022-07-09 18:18:38,620][26022] Updated weights on worker 0-0, policy_version 364812 (0.00087) [2022-07-09 18:18:40,070][26022] Updated weights on worker 0-0, policy_version 364822 (0.00092) [2022-07-09 18:18:40,432][25689] Fps is (10 sec: 5858.7, 60 sec: 5666.3, 300 sec: 5630.7). Total num frames: 373579776. Throughput: 0: 5926.3. Samples: 373580368. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:18:40,433][25689] Avg episode reward: [(0, '-47.013')] [2022-07-09 18:18:42,166][26022] Updated weights on worker 0-0, policy_version 364832 (0.00090) [2022-07-09 18:18:43,570][26022] Updated weights on worker 0-0, policy_version 364842 (0.00086) [2022-07-09 18:18:45,461][25689] Fps is (10 sec: 5685.0, 60 sec: 5648.8, 300 sec: 5630.2). Total num frames: 373606400. Throughput: 0: 5931.2. Samples: 373614846. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:18:45,466][25689] Avg episode reward: [(0, '-47.039')] [2022-07-09 18:18:45,729][26022] Updated weights on worker 0-0, policy_version 364852 (0.00085) [2022-07-09 18:18:47,314][26022] Updated weights on worker 0-0, policy_version 364862 (0.00088) [2022-07-09 18:18:49,219][26022] Updated weights on worker 0-0, policy_version 364872 (0.00087) [2022-07-09 18:18:50,524][25689] Fps is (10 sec: 5580.7, 60 sec: 5661.5, 300 sec: 5637.3). Total num frames: 373636096. Throughput: 0: 5096.0. Samples: 373632050. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:18:50,525][25689] Avg episode reward: [(0, '-46.745')] [2022-07-09 18:18:51,128][26022] Updated weights on worker 0-0, policy_version 364882 (0.00095) [2022-07-09 18:18:52,782][26022] Updated weights on worker 0-0, policy_version 364892 (0.00083) [2022-07-09 18:18:54,707][26022] Updated weights on worker 0-0, policy_version 364902 (0.00092) [2022-07-09 18:18:55,609][25689] Fps is (10 sec: 5751.9, 60 sec: 5648.5, 300 sec: 5632.6). Total num frames: 373664768. Throughput: 0: 5917.4. Samples: 373665674. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:18:55,611][25689] Avg episode reward: [(0, '-46.495')] [2022-07-09 18:18:56,342][26022] Updated weights on worker 0-0, policy_version 364912 (0.00104) [2022-07-09 18:18:58,379][26022] Updated weights on worker 0-0, policy_version 364922 (0.00097) [2022-07-09 18:19:00,228][26022] Updated weights on worker 0-0, policy_version 364932 (0.00084) [2022-07-09 18:19:00,618][25689] Fps is (10 sec: 5579.2, 60 sec: 5632.8, 300 sec: 5639.5). Total num frames: 373692416. Throughput: 0: 5924.6. Samples: 373699966. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:19:00,620][25689] Avg episode reward: [(0, '-47.153')] [2022-07-09 18:19:02,236][26022] Updated weights on worker 0-0, policy_version 364942 (0.00085) [2022-07-09 18:19:04,125][26022] Updated weights on worker 0-0, policy_version 364952 (0.00085) [2022-07-09 18:19:05,665][25689] Fps is (10 sec: 5498.5, 60 sec: 5645.7, 300 sec: 5632.1). Total num frames: 373720064. Throughput: 0: 4952.4. Samples: 373714906. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:19:05,667][25689] Avg episode reward: [(0, '-47.527')] [2022-07-09 18:19:05,810][26022] Updated weights on worker 0-0, policy_version 364962 (0.00086) [2022-07-09 18:19:07,679][26022] Updated weights on worker 0-0, policy_version 364972 (0.00085) [2022-07-09 18:19:09,547][26022] Updated weights on worker 0-0, policy_version 364982 (0.00086) [2022-07-09 18:19:10,680][25689] Fps is (10 sec: 5495.6, 60 sec: 5646.9, 300 sec: 5633.5). Total num frames: 373747712. Throughput: 0: 5807.9. Samples: 373749118. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:19:10,682][25689] Avg episode reward: [(0, '-47.410')] [2022-07-09 18:19:11,089][26022] Updated weights on worker 0-0, policy_version 364992 (0.00091) [2022-07-09 18:19:13,197][26022] Updated weights on worker 0-0, policy_version 365002 (0.00084) [2022-07-09 18:19:14,710][26022] Updated weights on worker 0-0, policy_version 365012 (0.00087) [2022-07-09 18:19:15,781][25689] Fps is (10 sec: 5668.9, 60 sec: 5649.4, 300 sec: 5636.0). Total num frames: 373777408. Throughput: 0: 5830.6. Samples: 373783290. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:19:15,782][25689] Avg episode reward: [(0, '-46.470')] [2022-07-09 18:19:16,755][26022] Updated weights on worker 0-0, policy_version 365022 (0.00079) [2022-07-09 18:19:18,279][26022] Updated weights on worker 0-0, policy_version 365032 (0.00083) [2022-07-09 18:19:20,388][26022] Updated weights on worker 0-0, policy_version 365042 (0.00121) [2022-07-09 18:19:20,806][25689] Fps is (10 sec: 5561.8, 60 sec: 5649.1, 300 sec: 5628.9). Total num frames: 373804032. Throughput: 0: 4967.6. Samples: 373800252. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:19:20,808][25689] Avg episode reward: [(0, '-46.644')] [2022-07-09 18:19:20,956][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:19:20,968][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000365045_373806080.pth [2022-07-09 18:19:20,969][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000363062_371775488.pth [2022-07-09 18:19:22,087][26022] Updated weights on worker 0-0, policy_version 365052 (0.00088) [2022-07-09 18:19:23,960][26022] Updated weights on worker 0-0, policy_version 365062 (0.00108) [2022-07-09 18:19:25,615][26022] Updated weights on worker 0-0, policy_version 365072 (0.00087) [2022-07-09 18:19:25,809][25689] Fps is (10 sec: 5616.4, 60 sec: 5633.5, 300 sec: 5632.5). Total num frames: 373833728. Throughput: 0: 5913.7. Samples: 373834028. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:19:25,811][25689] Avg episode reward: [(0, '-45.450')] [2022-07-09 18:19:27,780][26022] Updated weights on worker 0-0, policy_version 365082 (0.00088) [2022-07-09 18:19:29,337][26022] Updated weights on worker 0-0, policy_version 365092 (0.00086) [2022-07-09 18:19:30,821][25689] Fps is (10 sec: 5726.0, 60 sec: 5633.7, 300 sec: 5630.1). Total num frames: 373861376. Throughput: 0: 5892.6. Samples: 373867800. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:19:30,823][25689] Avg episode reward: [(0, '-45.654')] [2022-07-09 18:19:31,215][26022] Updated weights on worker 0-0, policy_version 365102 (0.00085) [2022-07-09 18:19:32,954][26022] Updated weights on worker 0-0, policy_version 365112 (0.00082) [2022-07-09 18:19:34,808][26022] Updated weights on worker 0-0, policy_version 365122 (0.00089) [2022-07-09 18:19:35,875][25689] Fps is (10 sec: 5696.6, 60 sec: 5653.1, 300 sec: 5636.1). Total num frames: 373891072. Throughput: 0: 5059.1. Samples: 373884948. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:19:35,876][25689] Avg episode reward: [(0, '-45.653')] [2022-07-09 18:19:36,687][26022] Updated weights on worker 0-0, policy_version 365132 (0.00083) [2022-07-09 18:19:38,184][26022] Updated weights on worker 0-0, policy_version 365142 (0.00086) [2022-07-09 18:19:40,158][26022] Updated weights on worker 0-0, policy_version 365152 (0.00099) [2022-07-09 18:19:40,903][25689] Fps is (10 sec: 5789.1, 60 sec: 5621.9, 300 sec: 5630.0). Total num frames: 373919744. Throughput: 0: 5932.7. Samples: 373919482. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:19:40,904][25689] Avg episode reward: [(0, '-45.439')] [2022-07-09 18:19:42,014][26022] Updated weights on worker 0-0, policy_version 365162 (0.00093) [2022-07-09 18:19:43,711][26022] Updated weights on worker 0-0, policy_version 365172 (0.00087) [2022-07-09 18:19:45,476][26022] Updated weights on worker 0-0, policy_version 365182 (0.00089) [2022-07-09 18:19:45,953][25689] Fps is (10 sec: 5791.8, 60 sec: 5670.9, 300 sec: 5643.0). Total num frames: 373949440. Throughput: 0: 5952.9. Samples: 373953942. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:19:45,953][25689] Avg episode reward: [(0, '-45.609')] [2022-07-09 18:19:47,314][26022] Updated weights on worker 0-0, policy_version 365192 (0.01165) [2022-07-09 18:19:48,938][26022] Updated weights on worker 0-0, policy_version 365202 (0.00093) [2022-07-09 18:19:51,018][25689] Fps is (10 sec: 5669.5, 60 sec: 5636.8, 300 sec: 5636.5). Total num frames: 373977088. Throughput: 0: 5109.7. Samples: 373971000. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:19:51,020][25689] Avg episode reward: [(0, '-46.326')] [2022-07-09 18:19:51,036][26022] Updated weights on worker 0-0, policy_version 365212 (0.00089) [2022-07-09 18:19:52,696][26022] Updated weights on worker 0-0, policy_version 365222 (0.00076) [2022-07-09 18:19:54,403][26022] Updated weights on worker 0-0, policy_version 365232 (0.00079) [2022-07-09 18:19:56,127][25689] Fps is (10 sec: 5636.1, 60 sec: 5651.4, 300 sec: 5635.6). Total num frames: 374006784. Throughput: 0: 5941.3. Samples: 374005272. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-09 18:19:56,128][25689] Avg episode reward: [(0, '-45.577')] [2022-07-09 18:19:56,145][26022] Updated weights on worker 0-0, policy_version 365242 (0.00090) [2022-07-09 18:19:57,941][26022] Updated weights on worker 0-0, policy_version 365252 (0.00077) [2022-07-09 18:19:59,879][26022] Updated weights on worker 0-0, policy_version 365262 (0.00080) [2022-07-09 18:20:01,173][25689] Fps is (10 sec: 5747.5, 60 sec: 5664.9, 300 sec: 5645.2). Total num frames: 374035456. Throughput: 0: 5931.5. Samples: 374039712. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:20:01,174][25689] Avg episode reward: [(0, '-45.761')] [2022-07-09 18:20:01,915][26022] Updated weights on worker 0-0, policy_version 365272 (0.00086) [2022-07-09 18:20:03,765][26022] Updated weights on worker 0-0, policy_version 365282 (0.00089) [2022-07-09 18:20:05,671][26022] Updated weights on worker 0-0, policy_version 365292 (0.00085) [2022-07-09 18:20:06,204][25689] Fps is (10 sec: 5487.4, 60 sec: 5649.5, 300 sec: 5641.5). Total num frames: 374062080. Throughput: 0: 4977.9. Samples: 374054752. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:20:06,206][25689] Avg episode reward: [(0, '-46.720')] [2022-07-09 18:20:07,372][26022] Updated weights on worker 0-0, policy_version 365302 (0.00086) [2022-07-09 18:20:09,271][26022] Updated weights on worker 0-0, policy_version 365312 (0.00084) [2022-07-09 18:20:11,009][26022] Updated weights on worker 0-0, policy_version 365322 (0.00093) [2022-07-09 18:20:11,305][25689] Fps is (10 sec: 5356.8, 60 sec: 5641.5, 300 sec: 5634.0). Total num frames: 374089728. Throughput: 0: 5816.8. Samples: 374089004. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:20:11,305][25689] Avg episode reward: [(0, '-47.834')] [2022-07-09 18:20:12,816][26022] Updated weights on worker 0-0, policy_version 365332 (0.00085) [2022-07-09 18:20:14,608][26022] Updated weights on worker 0-0, policy_version 365342 (0.00093) [2022-07-09 18:20:16,319][26022] Updated weights on worker 0-0, policy_version 365352 (0.00099) [2022-07-09 18:20:16,403][25689] Fps is (10 sec: 5722.8, 60 sec: 5658.6, 300 sec: 5636.1). Total num frames: 374120448. Throughput: 0: 5805.9. Samples: 374122994. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:20:16,404][25689] Avg episode reward: [(0, '-46.740')] [2022-07-09 18:20:18,297][26022] Updated weights on worker 0-0, policy_version 365362 (0.00096) [2022-07-09 18:20:19,971][26022] Updated weights on worker 0-0, policy_version 365372 (0.00084) [2022-07-09 18:20:21,472][25689] Fps is (10 sec: 5740.8, 60 sec: 5671.5, 300 sec: 5638.6). Total num frames: 374148096. Throughput: 0: 5771.1. Samples: 374156858. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:20:21,472][25689] Avg episode reward: [(0, '-47.146')] [2022-07-09 18:20:21,779][26022] Updated weights on worker 0-0, policy_version 365382 (0.00086) [2022-07-09 18:20:23,705][26022] Updated weights on worker 0-0, policy_version 365392 (0.00095) [2022-07-09 18:20:25,551][26022] Updated weights on worker 0-0, policy_version 365402 (0.00088) [2022-07-09 18:20:26,537][25689] Fps is (10 sec: 5456.6, 60 sec: 5631.9, 300 sec: 5631.9). Total num frames: 374175744. Throughput: 0: 5851.9. Samples: 374173740. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:20:26,538][25689] Avg episode reward: [(0, '-47.396')] [2022-07-09 18:20:27,256][26022] Updated weights on worker 0-0, policy_version 365412 (0.00087) [2022-07-09 18:20:29,314][26022] Updated weights on worker 0-0, policy_version 365422 (0.00087) [2022-07-09 18:20:30,948][26022] Updated weights on worker 0-0, policy_version 365432 (0.00086) [2022-07-09 18:20:31,573][25689] Fps is (10 sec: 5677.1, 60 sec: 5663.4, 300 sec: 5632.2). Total num frames: 374205440. Throughput: 0: 5870.5. Samples: 374207988. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:20:31,573][25689] Avg episode reward: [(0, '-46.799')] [2022-07-09 18:20:32,926][26022] Updated weights on worker 0-0, policy_version 365442 (0.00083) [2022-07-09 18:20:34,603][26022] Updated weights on worker 0-0, policy_version 365452 (0.00085) [2022-07-09 18:20:36,610][26022] Updated weights on worker 0-0, policy_version 365462 (0.00088) [2022-07-09 18:20:36,667][25689] Fps is (10 sec: 5661.0, 60 sec: 5626.0, 300 sec: 5634.3). Total num frames: 374233088. Throughput: 0: 5891.3. Samples: 374242372. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:20:36,667][25689] Avg episode reward: [(0, '-46.352')] [2022-07-09 18:20:38,094][26022] Updated weights on worker 0-0, policy_version 365472 (0.00086) [2022-07-09 18:20:40,222][26022] Updated weights on worker 0-0, policy_version 365482 (0.00090) [2022-07-09 18:20:41,734][25689] Fps is (10 sec: 5643.5, 60 sec: 5639.3, 300 sec: 5636.7). Total num frames: 374262784. Throughput: 0: 5059.4. Samples: 374259372. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:20:41,734][25689] Avg episode reward: [(0, '-46.094')] [2022-07-09 18:20:41,753][26022] Updated weights on worker 0-0, policy_version 365492 (0.00080) [2022-07-09 18:20:43,583][26022] Updated weights on worker 0-0, policy_version 365502 (0.00089) [2022-07-09 18:20:45,430][26022] Updated weights on worker 0-0, policy_version 365512 (0.00089) [2022-07-09 18:20:46,808][25689] Fps is (10 sec: 5856.6, 60 sec: 5637.0, 300 sec: 5639.2). Total num frames: 374292480. Throughput: 0: 5903.9. Samples: 374293416. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:20:46,810][25689] Avg episode reward: [(0, '-45.793')] [2022-07-09 18:20:46,984][26022] Updated weights on worker 0-0, policy_version 365522 (0.00088) [2022-07-09 18:20:49,278][26022] Updated weights on worker 0-0, policy_version 365532 (0.00090) [2022-07-09 18:20:50,608][26022] Updated weights on worker 0-0, policy_version 365542 (0.00624) [2022-07-09 18:20:51,827][25689] Fps is (10 sec: 5580.0, 60 sec: 5624.4, 300 sec: 5633.8). Total num frames: 374319104. Throughput: 0: 5894.0. Samples: 374327366. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:20:51,828][25689] Avg episode reward: [(0, '-46.444')] [2022-07-09 18:20:52,763][26022] Updated weights on worker 0-0, policy_version 365552 (0.00085) [2022-07-09 18:20:54,228][26022] Updated weights on worker 0-0, policy_version 365562 (0.00082) [2022-07-09 18:20:56,259][26022] Updated weights on worker 0-0, policy_version 365572 (0.00091) [2022-07-09 18:20:56,890][25689] Fps is (10 sec: 5687.5, 60 sec: 5645.5, 300 sec: 5636.3). Total num frames: 374349824. Throughput: 0: 5022.1. Samples: 374343936. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:20:56,891][25689] Avg episode reward: [(0, '-46.980')] [2022-07-09 18:20:58,111][26022] Updated weights on worker 0-0, policy_version 365582 (0.00103) [2022-07-09 18:20:59,933][26022] Updated weights on worker 0-0, policy_version 365592 (0.00082) [2022-07-09 18:21:01,720][26022] Updated weights on worker 0-0, policy_version 365602 (0.00095) [2022-07-09 18:21:01,911][25689] Fps is (10 sec: 5788.2, 60 sec: 5631.0, 300 sec: 5646.4). Total num frames: 374377472. Throughput: 0: 5898.4. Samples: 374378384. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:21:01,911][25689] Avg episode reward: [(0, '-48.021')] [2022-07-09 18:21:03,939][26022] Updated weights on worker 0-0, policy_version 365612 (0.00089) [2022-07-09 18:21:05,481][26022] Updated weights on worker 0-0, policy_version 365622 (0.00087) [2022-07-09 18:21:06,941][25689] Fps is (10 sec: 5297.7, 60 sec: 5614.2, 300 sec: 5635.5). Total num frames: 374403072. Throughput: 0: 5828.3. Samples: 374410762. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:21:06,942][25689] Avg episode reward: [(0, '-48.337')] [2022-07-09 18:21:07,388][26022] Updated weights on worker 0-0, policy_version 365632 (0.00091) [2022-07-09 18:21:09,080][26022] Updated weights on worker 0-0, policy_version 365642 (0.00088) [2022-07-09 18:21:11,052][26022] Updated weights on worker 0-0, policy_version 365652 (0.00088) [2022-07-09 18:21:11,958][25689] Fps is (10 sec: 5503.7, 60 sec: 5655.8, 300 sec: 5641.5). Total num frames: 374432768. Throughput: 0: 4993.7. Samples: 374427894. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:21:11,959][25689] Avg episode reward: [(0, '-48.555')] [2022-07-09 18:21:12,855][26022] Updated weights on worker 0-0, policy_version 365662 (0.00229) [2022-07-09 18:21:14,731][26022] Updated weights on worker 0-0, policy_version 365672 (0.00090) [2022-07-09 18:21:16,266][26022] Updated weights on worker 0-0, policy_version 365682 (0.00084) [2022-07-09 18:21:17,060][25689] Fps is (10 sec: 5768.4, 60 sec: 5621.7, 300 sec: 5640.6). Total num frames: 374461440. Throughput: 0: 5843.5. Samples: 374461798. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:21:17,061][25689] Avg episode reward: [(0, '-49.324')] [2022-07-09 18:21:18,340][26022] Updated weights on worker 0-0, policy_version 365692 (0.00090) [2022-07-09 18:21:20,032][26022] Updated weights on worker 0-0, policy_version 365702 (0.00090) [2022-07-09 18:21:21,137][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:21:21,150][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000365707_374483968.pth [2022-07-09 18:21:21,151][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000363723_372452352.pth [2022-07-09 18:21:21,841][26022] Updated weights on worker 0-0, policy_version 365712 (0.00089) [2022-07-09 18:21:22,152][25689] Fps is (10 sec: 5625.4, 60 sec: 5636.4, 300 sec: 5639.2). Total num frames: 374490112. Throughput: 0: 5798.7. Samples: 374495754. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:21:22,152][25689] Avg episode reward: [(0, '-47.949')] [2022-07-09 18:21:23,635][26022] Updated weights on worker 0-0, policy_version 365722 (0.00076) [2022-07-09 18:21:25,527][26022] Updated weights on worker 0-0, policy_version 365732 (0.00086) [2022-07-09 18:21:27,194][25689] Fps is (10 sec: 5557.5, 60 sec: 5638.5, 300 sec: 5639.6). Total num frames: 374517760. Throughput: 0: 5036.3. Samples: 374512760. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:21:27,195][25689] Avg episode reward: [(0, '-48.533')] [2022-07-09 18:21:27,365][26022] Updated weights on worker 0-0, policy_version 365742 (0.00092) [2022-07-09 18:21:29,203][26022] Updated weights on worker 0-0, policy_version 365752 (0.00089) [2022-07-09 18:21:30,843][26022] Updated weights on worker 0-0, policy_version 365762 (0.00088) [2022-07-09 18:21:32,198][25689] Fps is (10 sec: 5606.4, 60 sec: 5624.6, 300 sec: 5638.1). Total num frames: 374546432. Throughput: 0: 5872.8. Samples: 374546756. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:21:32,198][25689] Avg episode reward: [(0, '-48.490')] [2022-07-09 18:21:32,730][26022] Updated weights on worker 0-0, policy_version 365772 (0.00088) [2022-07-09 18:21:34,495][26022] Updated weights on worker 0-0, policy_version 365782 (0.00087) [2022-07-09 18:21:36,243][26022] Updated weights on worker 0-0, policy_version 365792 (0.00092) [2022-07-09 18:21:37,311][25689] Fps is (10 sec: 5769.6, 60 sec: 5656.6, 300 sec: 5639.5). Total num frames: 374576128. Throughput: 0: 5885.0. Samples: 374580972. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:21:37,312][25689] Avg episode reward: [(0, '-49.144')] [2022-07-09 18:21:38,019][26022] Updated weights on worker 0-0, policy_version 365802 (0.00091) [2022-07-09 18:21:39,925][26022] Updated weights on worker 0-0, policy_version 365812 (0.00094) [2022-07-09 18:21:41,689][26022] Updated weights on worker 0-0, policy_version 365822 (0.00091) [2022-07-09 18:21:42,332][25689] Fps is (10 sec: 5759.4, 60 sec: 5644.0, 300 sec: 5646.3). Total num frames: 374604800. Throughput: 0: 5066.0. Samples: 374597986. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:21:42,333][25689] Avg episode reward: [(0, '-48.778')] [2022-07-09 18:21:43,556][26022] Updated weights on worker 0-0, policy_version 365832 (0.00088) [2022-07-09 18:21:45,151][26022] Updated weights on worker 0-0, policy_version 365842 (0.00100) [2022-07-09 18:21:47,253][26022] Updated weights on worker 0-0, policy_version 365852 (0.00086) [2022-07-09 18:21:47,353][25689] Fps is (10 sec: 5608.1, 60 sec: 5615.1, 300 sec: 5639.4). Total num frames: 374632448. Throughput: 0: 5924.8. Samples: 374632198. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:21:47,355][25689] Avg episode reward: [(0, '-48.555')] [2022-07-09 18:21:48,780][26022] Updated weights on worker 0-0, policy_version 365862 (0.00087) [2022-07-09 18:21:50,764][26022] Updated weights on worker 0-0, policy_version 365872 (0.00084) [2022-07-09 18:21:52,302][26022] Updated weights on worker 0-0, policy_version 365882 (0.00090) [2022-07-09 18:21:52,396][25689] Fps is (10 sec: 5799.4, 60 sec: 5680.5, 300 sec: 5650.1). Total num frames: 374663168. Throughput: 0: 5934.0. Samples: 374666616. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:21:52,398][25689] Avg episode reward: [(0, '-48.511')] [2022-07-09 18:21:54,340][26022] Updated weights on worker 0-0, policy_version 365892 (0.00092) [2022-07-09 18:21:55,855][26022] Updated weights on worker 0-0, policy_version 365902 (0.00268) [2022-07-09 18:21:57,469][25689] Fps is (10 sec: 5668.6, 60 sec: 5612.0, 300 sec: 5638.6). Total num frames: 374689792. Throughput: 0: 5098.2. Samples: 374683746. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:21:57,470][25689] Avg episode reward: [(0, '-48.169')] [2022-07-09 18:21:58,037][26022] Updated weights on worker 0-0, policy_version 365912 (0.00079) [2022-07-09 18:21:59,579][26022] Updated weights on worker 0-0, policy_version 365922 (0.00086) [2022-07-09 18:22:01,751][26022] Updated weights on worker 0-0, policy_version 365932 (0.00098) [2022-07-09 18:22:02,483][25689] Fps is (10 sec: 5380.4, 60 sec: 5612.6, 300 sec: 5645.4). Total num frames: 374717440. Throughput: 0: 5936.9. Samples: 374717622. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:22:02,484][25689] Avg episode reward: [(0, '-47.838')] [2022-07-09 18:22:03,569][26022] Updated weights on worker 0-0, policy_version 365942 (0.00087) [2022-07-09 18:22:05,511][26022] Updated weights on worker 0-0, policy_version 365952 (0.00083) [2022-07-09 18:22:07,332][26022] Updated weights on worker 0-0, policy_version 365962 (0.00097) [2022-07-09 18:22:07,520][25689] Fps is (10 sec: 5501.6, 60 sec: 5645.8, 300 sec: 5645.3). Total num frames: 374745088. Throughput: 0: 5830.0. Samples: 374749772. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:22:07,522][25689] Avg episode reward: [(0, '-47.878')] [2022-07-09 18:22:09,214][26022] Updated weights on worker 0-0, policy_version 365972 (0.00088) [2022-07-09 18:22:10,972][26022] Updated weights on worker 0-0, policy_version 365982 (0.00092) [2022-07-09 18:22:12,526][25689] Fps is (10 sec: 5607.8, 60 sec: 5629.9, 300 sec: 5642.8). Total num frames: 374773760. Throughput: 0: 5821.5. Samples: 374783804. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:22:12,528][25689] Avg episode reward: [(0, '-47.709')] [2022-07-09 18:22:12,926][26022] Updated weights on worker 0-0, policy_version 365992 (0.00086) [2022-07-09 18:22:14,422][26022] Updated weights on worker 0-0, policy_version 366002 (0.00099) [2022-07-09 18:22:16,556][26022] Updated weights on worker 0-0, policy_version 366012 (0.00095) [2022-07-09 18:22:17,586][25689] Fps is (10 sec: 5799.0, 60 sec: 5650.8, 300 sec: 5645.6). Total num frames: 374803456. Throughput: 0: 5817.8. Samples: 374800778. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:22:17,586][25689] Avg episode reward: [(0, '-47.619')] [2022-07-09 18:22:17,868][26022] Updated weights on worker 0-0, policy_version 366022 (0.00089) [2022-07-09 18:22:20,243][26022] Updated weights on worker 0-0, policy_version 366032 (0.00062) [2022-07-09 18:22:21,575][26022] Updated weights on worker 0-0, policy_version 366042 (0.00094) [2022-07-09 18:22:22,603][25689] Fps is (10 sec: 5691.0, 60 sec: 5640.8, 300 sec: 5645.9). Total num frames: 374831104. Throughput: 0: 5837.0. Samples: 374835058. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:22:22,603][25689] Avg episode reward: [(0, '-47.097')] [2022-07-09 18:22:23,656][26022] Updated weights on worker 0-0, policy_version 366052 (0.00094) [2022-07-09 18:22:25,219][26022] Updated weights on worker 0-0, policy_version 366062 (0.00143) [2022-07-09 18:22:27,402][26022] Updated weights on worker 0-0, policy_version 366072 (0.00092) [2022-07-09 18:22:27,631][25689] Fps is (10 sec: 5606.7, 60 sec: 5659.1, 300 sec: 5642.2). Total num frames: 374859776. Throughput: 0: 5923.6. Samples: 374868898. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 18:22:27,631][25689] Avg episode reward: [(0, '-46.983')] [2022-07-09 18:22:29,019][26022] Updated weights on worker 0-0, policy_version 366082 (0.00072) [2022-07-09 18:22:31,090][26022] Updated weights on worker 0-0, policy_version 366092 (0.00109) [2022-07-09 18:22:32,583][26022] Updated weights on worker 0-0, policy_version 366102 (0.00083) [2022-07-09 18:22:32,638][25689] Fps is (10 sec: 5714.2, 60 sec: 5658.7, 300 sec: 5643.3). Total num frames: 374888448. Throughput: 0: 5079.0. Samples: 374885950. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:22:32,639][25689] Avg episode reward: [(0, '-46.949')] [2022-07-09 18:22:34,512][26022] Updated weights on worker 0-0, policy_version 366112 (0.00090) [2022-07-09 18:22:36,129][26022] Updated weights on worker 0-0, policy_version 366122 (0.00089) [2022-07-09 18:22:37,747][25689] Fps is (10 sec: 5567.2, 60 sec: 5625.2, 300 sec: 5638.2). Total num frames: 374916096. Throughput: 0: 5920.9. Samples: 374920152. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:22:37,748][25689] Avg episode reward: [(0, '-46.993')] [2022-07-09 18:22:38,147][26022] Updated weights on worker 0-0, policy_version 366132 (0.00097) [2022-07-09 18:22:39,832][26022] Updated weights on worker 0-0, policy_version 366142 (0.00085) [2022-07-09 18:22:41,572][26022] Updated weights on worker 0-0, policy_version 366152 (0.00090) [2022-07-09 18:22:42,824][25689] Fps is (10 sec: 5730.3, 60 sec: 5653.9, 300 sec: 5647.5). Total num frames: 374946816. Throughput: 0: 5907.7. Samples: 374954518. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:22:42,824][25689] Avg episode reward: [(0, '-47.187')] [2022-07-09 18:22:43,460][26022] Updated weights on worker 0-0, policy_version 366162 (0.00089) [2022-07-09 18:22:45,201][26022] Updated weights on worker 0-0, policy_version 366172 (0.00082) [2022-07-09 18:22:46,973][26022] Updated weights on worker 0-0, policy_version 366182 (0.00082) [2022-07-09 18:22:47,857][25689] Fps is (10 sec: 5874.7, 60 sec: 5669.7, 300 sec: 5647.2). Total num frames: 374975488. Throughput: 0: 5074.0. Samples: 374971526. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:22:47,857][25689] Avg episode reward: [(0, '-47.334')] [2022-07-09 18:22:48,851][26022] Updated weights on worker 0-0, policy_version 366192 (0.00085) [2022-07-09 18:22:50,406][26022] Updated weights on worker 0-0, policy_version 366202 (0.00104) [2022-07-09 18:22:52,389][26022] Updated weights on worker 0-0, policy_version 366212 (0.00093) [2022-07-09 18:22:52,866][25689] Fps is (10 sec: 5506.6, 60 sec: 5605.2, 300 sec: 5639.1). Total num frames: 375002112. Throughput: 0: 5940.6. Samples: 375006114. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:22:52,866][25689] Avg episode reward: [(0, '-47.763')] [2022-07-09 18:22:53,910][26022] Updated weights on worker 0-0, policy_version 366222 (0.00088) [2022-07-09 18:22:56,118][26022] Updated weights on worker 0-0, policy_version 366232 (0.00087) [2022-07-09 18:22:57,533][26022] Updated weights on worker 0-0, policy_version 366242 (0.00082) [2022-07-09 18:22:57,916][25689] Fps is (10 sec: 5598.7, 60 sec: 5658.1, 300 sec: 5642.0). Total num frames: 375031808. Throughput: 0: 5963.4. Samples: 375040430. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:22:57,917][25689] Avg episode reward: [(0, '-47.941')] [2022-07-09 18:22:59,572][26022] Updated weights on worker 0-0, policy_version 366252 (0.00084) [2022-07-09 18:23:01,355][26022] Updated weights on worker 0-0, policy_version 366262 (0.00086) [2022-07-09 18:23:02,919][25689] Fps is (10 sec: 5602.3, 60 sec: 5642.3, 300 sec: 5642.0). Total num frames: 375058432. Throughput: 0: 5136.4. Samples: 375057734. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:23:02,920][25689] Avg episode reward: [(0, '-48.558')] [2022-07-09 18:23:03,502][26022] Updated weights on worker 0-0, policy_version 366272 (0.00084) [2022-07-09 18:23:05,275][26022] Updated weights on worker 0-0, policy_version 366282 (0.00476) [2022-07-09 18:23:07,070][26022] Updated weights on worker 0-0, policy_version 366292 (0.00089) [2022-07-09 18:23:07,922][25689] Fps is (10 sec: 5526.6, 60 sec: 5662.4, 300 sec: 5645.9). Total num frames: 375087104. Throughput: 0: 5894.1. Samples: 375089790. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:23:07,923][25689] Avg episode reward: [(0, '-48.394')] [2022-07-09 18:23:08,731][26022] Updated weights on worker 0-0, policy_version 366302 (0.00084) [2022-07-09 18:23:10,715][26022] Updated weights on worker 0-0, policy_version 366312 (0.00085) [2022-07-09 18:23:12,527][26022] Updated weights on worker 0-0, policy_version 366322 (0.00085) [2022-07-09 18:23:12,962][25689] Fps is (10 sec: 5709.8, 60 sec: 5659.2, 300 sec: 5644.2). Total num frames: 375115776. Throughput: 0: 5860.6. Samples: 375123890. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:23:12,964][25689] Avg episode reward: [(0, '-48.123')] [2022-07-09 18:23:14,308][26022] Updated weights on worker 0-0, policy_version 366332 (0.00089) [2022-07-09 18:23:16,088][26022] Updated weights on worker 0-0, policy_version 366342 (0.00092) [2022-07-09 18:23:18,018][25689] Fps is (10 sec: 5679.5, 60 sec: 5642.5, 300 sec: 5650.4). Total num frames: 375144448. Throughput: 0: 4989.5. Samples: 375140728. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:23:18,019][25689] Avg episode reward: [(0, '-48.155')] [2022-07-09 18:23:18,024][26022] Updated weights on worker 0-0, policy_version 366352 (0.00063) [2022-07-09 18:23:19,794][26022] Updated weights on worker 0-0, policy_version 366362 (0.00088) [2022-07-09 18:23:21,167][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:23:21,183][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000366370_375162880.pth [2022-07-09 18:23:21,184][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000364383_373128192.pth [2022-07-09 18:23:21,620][26022] Updated weights on worker 0-0, policy_version 366372 (0.00086) [2022-07-09 18:23:23,103][25689] Fps is (10 sec: 5654.4, 60 sec: 5653.1, 300 sec: 5642.2). Total num frames: 375173120. Throughput: 0: 5796.1. Samples: 375174726. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:23:23,104][25689] Avg episode reward: [(0, '-48.156')] [2022-07-09 18:23:23,295][26022] Updated weights on worker 0-0, policy_version 366382 (0.00084) [2022-07-09 18:23:25,138][26022] Updated weights on worker 0-0, policy_version 366392 (0.00081) [2022-07-09 18:23:27,009][26022] Updated weights on worker 0-0, policy_version 366402 (0.00081) [2022-07-09 18:23:28,111][25689] Fps is (10 sec: 5783.0, 60 sec: 5671.9, 300 sec: 5649.2). Total num frames: 375202816. Throughput: 0: 5885.8. Samples: 375208622. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:23:28,112][25689] Avg episode reward: [(0, '-48.278')] [2022-07-09 18:23:28,871][26022] Updated weights on worker 0-0, policy_version 366412 (0.00095) [2022-07-09 18:23:30,700][26022] Updated weights on worker 0-0, policy_version 366422 (0.00096) [2022-07-09 18:23:32,592][26022] Updated weights on worker 0-0, policy_version 366432 (0.00093) [2022-07-09 18:23:33,114][25689] Fps is (10 sec: 5625.9, 60 sec: 5638.4, 300 sec: 5643.8). Total num frames: 375229440. Throughput: 0: 5047.1. Samples: 375225602. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:23:33,115][25689] Avg episode reward: [(0, '-47.427')] [2022-07-09 18:23:34,075][26022] Updated weights on worker 0-0, policy_version 366442 (0.00083) [2022-07-09 18:23:36,222][26022] Updated weights on worker 0-0, policy_version 366452 (0.00082) [2022-07-09 18:23:37,803][26022] Updated weights on worker 0-0, policy_version 366462 (0.00089) [2022-07-09 18:23:38,208][25689] Fps is (10 sec: 5578.3, 60 sec: 5673.8, 300 sec: 5639.7). Total num frames: 375259136. Throughput: 0: 5880.7. Samples: 375259454. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:23:38,209][25689] Avg episode reward: [(0, '-47.952')] [2022-07-09 18:23:39,772][26022] Updated weights on worker 0-0, policy_version 366472 (0.00090) [2022-07-09 18:23:41,540][26022] Updated weights on worker 0-0, policy_version 366482 (0.00085) [2022-07-09 18:23:43,225][25689] Fps is (10 sec: 5671.4, 60 sec: 5628.5, 300 sec: 5643.3). Total num frames: 375286784. Throughput: 0: 5899.0. Samples: 375293426. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:23:43,227][25689] Avg episode reward: [(0, '-48.776')] [2022-07-09 18:23:43,414][26022] Updated weights on worker 0-0, policy_version 366492 (0.00094) [2022-07-09 18:23:45,167][26022] Updated weights on worker 0-0, policy_version 366502 (0.00088) [2022-07-09 18:23:46,982][26022] Updated weights on worker 0-0, policy_version 366512 (0.00090) [2022-07-09 18:23:48,259][25689] Fps is (10 sec: 5501.1, 60 sec: 5611.4, 300 sec: 5637.0). Total num frames: 375314432. Throughput: 0: 5058.1. Samples: 375310532. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:23:48,260][25689] Avg episode reward: [(0, '-49.107')] [2022-07-09 18:23:48,874][26022] Updated weights on worker 0-0, policy_version 366522 (0.00087) [2022-07-09 18:23:50,691][26022] Updated weights on worker 0-0, policy_version 366532 (0.01339) [2022-07-09 18:23:52,479][26022] Updated weights on worker 0-0, policy_version 366542 (0.00082) [2022-07-09 18:23:53,264][25689] Fps is (10 sec: 5712.3, 60 sec: 5662.7, 300 sec: 5642.0). Total num frames: 375344128. Throughput: 0: 5887.0. Samples: 375344222. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:23:53,264][25689] Avg episode reward: [(0, '-48.794')] [2022-07-09 18:23:54,319][26022] Updated weights on worker 0-0, policy_version 366552 (0.00096) [2022-07-09 18:23:56,057][26022] Updated weights on worker 0-0, policy_version 366562 (0.00092) [2022-07-09 18:23:57,841][26022] Updated weights on worker 0-0, policy_version 366572 (0.00052) [2022-07-09 18:23:58,318][25689] Fps is (10 sec: 5700.7, 60 sec: 5628.4, 300 sec: 5641.1). Total num frames: 375371776. Throughput: 0: 5904.5. Samples: 375378198. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:23:58,319][25689] Avg episode reward: [(0, '-48.567')] [2022-07-09 18:23:59,726][26022] Updated weights on worker 0-0, policy_version 366582 (0.00084) [2022-07-09 18:24:01,780][26022] Updated weights on worker 0-0, policy_version 366592 (0.00096) [2022-07-09 18:24:03,338][25689] Fps is (10 sec: 5285.5, 60 sec: 5609.8, 300 sec: 5634.8). Total num frames: 375397376. Throughput: 0: 5058.3. Samples: 375395164. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:24:03,340][25689] Avg episode reward: [(0, '-48.619')] [2022-07-09 18:24:03,670][26022] Updated weights on worker 0-0, policy_version 366602 (0.00092) [2022-07-09 18:24:05,524][26022] Updated weights on worker 0-0, policy_version 366612 (0.00091) [2022-07-09 18:24:07,247][26022] Updated weights on worker 0-0, policy_version 366622 (0.00056) [2022-07-09 18:24:08,373][25689] Fps is (10 sec: 5397.9, 60 sec: 5606.9, 300 sec: 5637.8). Total num frames: 375426048. Throughput: 0: 5802.4. Samples: 375427238. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:24:08,373][25689] Avg episode reward: [(0, '-47.607')] [2022-07-09 18:24:09,216][26022] Updated weights on worker 0-0, policy_version 366632 (0.00089) [2022-07-09 18:24:10,768][26022] Updated weights on worker 0-0, policy_version 366642 (0.00091) [2022-07-09 18:24:12,538][26022] Updated weights on worker 0-0, policy_version 366652 (0.00087) [2022-07-09 18:24:13,390][25689] Fps is (10 sec: 5705.0, 60 sec: 5609.1, 300 sec: 5636.0). Total num frames: 375454720. Throughput: 0: 5834.9. Samples: 375461654. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:24:13,390][25689] Avg episode reward: [(0, '-46.656')] [2022-07-09 18:24:14,467][26022] Updated weights on worker 0-0, policy_version 366662 (0.00087) [2022-07-09 18:24:16,215][26022] Updated weights on worker 0-0, policy_version 366672 (0.00081) [2022-07-09 18:24:17,978][26022] Updated weights on worker 0-0, policy_version 366682 (0.00092) [2022-07-09 18:24:18,471][25689] Fps is (10 sec: 5780.2, 60 sec: 5623.7, 300 sec: 5645.2). Total num frames: 375484416. Throughput: 0: 4983.8. Samples: 375478632. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:24:18,471][25689] Avg episode reward: [(0, '-47.234')] [2022-07-09 18:24:19,743][26022] Updated weights on worker 0-0, policy_version 366692 (0.00086) [2022-07-09 18:24:21,585][26022] Updated weights on worker 0-0, policy_version 366702 (0.00092) [2022-07-09 18:24:23,473][25689] Fps is (10 sec: 5687.2, 60 sec: 5614.5, 300 sec: 5638.4). Total num frames: 375512064. Throughput: 0: 5849.3. Samples: 375512936. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:24:23,474][25689] Avg episode reward: [(0, '-46.949')] [2022-07-09 18:24:23,551][26022] Updated weights on worker 0-0, policy_version 366712 (0.00083) [2022-07-09 18:24:25,164][26022] Updated weights on worker 0-0, policy_version 366722 (0.00089) [2022-07-09 18:24:27,044][26022] Updated weights on worker 0-0, policy_version 366732 (0.00092) [2022-07-09 18:24:28,479][25689] Fps is (10 sec: 5729.8, 60 sec: 5614.7, 300 sec: 5645.4). Total num frames: 375541760. Throughput: 0: 5970.9. Samples: 375547288. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:24:28,486][25689] Avg episode reward: [(0, '-47.028')] [2022-07-09 18:24:28,582][26022] Updated weights on worker 0-0, policy_version 366742 (0.00083) [2022-07-09 18:24:30,707][26022] Updated weights on worker 0-0, policy_version 366752 (0.00094) [2022-07-09 18:24:32,254][26022] Updated weights on worker 0-0, policy_version 366762 (0.00079) [2022-07-09 18:24:33,492][25689] Fps is (10 sec: 5723.2, 60 sec: 5630.7, 300 sec: 5639.2). Total num frames: 375569408. Throughput: 0: 5110.3. Samples: 375564384. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:24:33,493][25689] Avg episode reward: [(0, '-46.915')] [2022-07-09 18:24:34,271][26022] Updated weights on worker 0-0, policy_version 366772 (0.00087) [2022-07-09 18:24:36,044][26022] Updated weights on worker 0-0, policy_version 366782 (0.00093) [2022-07-09 18:24:37,781][26022] Updated weights on worker 0-0, policy_version 366792 (0.00088) [2022-07-09 18:24:38,555][25689] Fps is (10 sec: 5792.6, 60 sec: 5650.5, 300 sec: 5645.5). Total num frames: 375600128. Throughput: 0: 5972.1. Samples: 375598578. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:24:38,555][25689] Avg episode reward: [(0, '-46.855')] [2022-07-09 18:24:39,792][26022] Updated weights on worker 0-0, policy_version 366802 (0.00088) [2022-07-09 18:24:41,503][26022] Updated weights on worker 0-0, policy_version 366812 (0.00088) [2022-07-09 18:24:43,281][26022] Updated weights on worker 0-0, policy_version 366822 (0.00091) [2022-07-09 18:24:43,594][25689] Fps is (10 sec: 5676.6, 60 sec: 5631.5, 300 sec: 5635.3). Total num frames: 375626752. Throughput: 0: 5968.6. Samples: 375633032. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:24:43,595][25689] Avg episode reward: [(0, '-46.734')] [2022-07-09 18:24:45,071][26022] Updated weights on worker 0-0, policy_version 366832 (0.00085) [2022-07-09 18:24:46,786][26022] Updated weights on worker 0-0, policy_version 366842 (0.00096) [2022-07-09 18:24:48,667][25689] Fps is (10 sec: 5468.4, 60 sec: 5644.8, 300 sec: 5638.6). Total num frames: 375655424. Throughput: 0: 5090.5. Samples: 375650056. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:24:48,668][25689] Avg episode reward: [(0, '-47.094')] [2022-07-09 18:24:48,699][26022] Updated weights on worker 0-0, policy_version 366852 (0.00097) [2022-07-09 18:24:50,347][26022] Updated weights on worker 0-0, policy_version 366862 (0.00092) [2022-07-09 18:24:52,343][26022] Updated weights on worker 0-0, policy_version 366872 (0.00090) [2022-07-09 18:24:53,713][25689] Fps is (10 sec: 5768.1, 60 sec: 5640.9, 300 sec: 5639.8). Total num frames: 375685120. Throughput: 0: 5925.7. Samples: 375684208. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:24:53,714][25689] Avg episode reward: [(0, '-46.628')] [2022-07-09 18:24:54,122][26022] Updated weights on worker 0-0, policy_version 366882 (0.00082) [2022-07-09 18:24:56,001][26022] Updated weights on worker 0-0, policy_version 366892 (0.00093) [2022-07-09 18:24:57,920][26022] Updated weights on worker 0-0, policy_version 366902 (0.00078) [2022-07-09 18:24:58,766][25689] Fps is (10 sec: 5779.8, 60 sec: 5658.1, 300 sec: 5639.7). Total num frames: 375713792. Throughput: 0: 5912.0. Samples: 375718064. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:24:58,766][25689] Avg episode reward: [(0, '-46.541')] [2022-07-09 18:24:59,357][26022] Updated weights on worker 0-0, policy_version 366912 (0.00087) [2022-07-09 18:25:01,764][26022] Updated weights on worker 0-0, policy_version 366922 (0.00087) [2022-07-09 18:25:03,554][26022] Updated weights on worker 0-0, policy_version 366932 (0.00084) [2022-07-09 18:25:03,773][25689] Fps is (10 sec: 5293.2, 60 sec: 5642.3, 300 sec: 5633.3). Total num frames: 375738368. Throughput: 0: 5055.6. Samples: 375735048. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-09 18:25:03,773][25689] Avg episode reward: [(0, '-46.594')] [2022-07-09 18:25:05,208][26022] Updated weights on worker 0-0, policy_version 366942 (0.00085) [2022-07-09 18:25:07,298][26022] Updated weights on worker 0-0, policy_version 366952 (0.00093) [2022-07-09 18:25:08,633][26022] Updated weights on worker 0-0, policy_version 366962 (0.00092) [2022-07-09 18:25:08,790][25689] Fps is (10 sec: 5720.4, 60 sec: 5711.7, 300 sec: 5652.1). Total num frames: 375771136. Throughput: 0: 5831.9. Samples: 375767412. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:25:08,791][25689] Avg episode reward: [(0, '-47.191')] [2022-07-09 18:25:10,843][26022] Updated weights on worker 0-0, policy_version 366972 (0.00087) [2022-07-09 18:25:12,222][26022] Updated weights on worker 0-0, policy_version 366982 (0.00085) [2022-07-09 18:25:13,858][25689] Fps is (10 sec: 5787.7, 60 sec: 5656.1, 300 sec: 5635.4). Total num frames: 375796736. Throughput: 0: 5833.9. Samples: 375801730. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:25:13,858][25689] Avg episode reward: [(0, '-47.958')] [2022-07-09 18:25:14,318][26022] Updated weights on worker 0-0, policy_version 366992 (0.00095) [2022-07-09 18:25:16,097][26022] Updated weights on worker 0-0, policy_version 367002 (0.00086) [2022-07-09 18:25:17,807][26022] Updated weights on worker 0-0, policy_version 367012 (0.00085) [2022-07-09 18:25:18,915][25689] Fps is (10 sec: 5360.3, 60 sec: 5641.4, 300 sec: 5639.1). Total num frames: 375825408. Throughput: 0: 5002.7. Samples: 375818864. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:25:18,916][25689] Avg episode reward: [(0, '-46.980')] [2022-07-09 18:25:19,647][26022] Updated weights on worker 0-0, policy_version 367022 (0.00085) [2022-07-09 18:25:21,471][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:25:21,481][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000367032_375840768.pth [2022-07-09 18:25:21,481][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000365045_373806080.pth [2022-07-09 18:25:21,491][26022] Updated weights on worker 0-0, policy_version 367032 (0.00087) [2022-07-09 18:25:23,104][26022] Updated weights on worker 0-0, policy_version 367042 (0.00091) [2022-07-09 18:25:23,942][25689] Fps is (10 sec: 5686.3, 60 sec: 5656.0, 300 sec: 5643.3). Total num frames: 375854080. Throughput: 0: 5837.9. Samples: 375852796. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:25:23,943][25689] Avg episode reward: [(0, '-47.786')] [2022-07-09 18:25:25,216][26022] Updated weights on worker 0-0, policy_version 367052 (0.00090) [2022-07-09 18:25:26,870][26022] Updated weights on worker 0-0, policy_version 367062 (0.00085) [2022-07-09 18:25:28,803][26022] Updated weights on worker 0-0, policy_version 367072 (0.00088) [2022-07-09 18:25:29,020][25689] Fps is (10 sec: 5775.8, 60 sec: 5649.3, 300 sec: 5642.5). Total num frames: 375883776. Throughput: 0: 5910.6. Samples: 375886986. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:25:29,021][25689] Avg episode reward: [(0, '-47.927')] [2022-07-09 18:25:30,549][26022] Updated weights on worker 0-0, policy_version 367082 (0.00085) [2022-07-09 18:25:32,315][26022] Updated weights on worker 0-0, policy_version 367092 (0.00085) [2022-07-09 18:25:34,061][25689] Fps is (10 sec: 5768.3, 60 sec: 5663.6, 300 sec: 5646.9). Total num frames: 375912448. Throughput: 0: 5914.9. Samples: 375921230. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:25:34,062][25689] Avg episode reward: [(0, '-47.191')] [2022-07-09 18:25:34,062][26022] Updated weights on worker 0-0, policy_version 367102 (0.00077) [2022-07-09 18:25:35,861][26022] Updated weights on worker 0-0, policy_version 367112 (0.00095) [2022-07-09 18:25:37,610][26022] Updated weights on worker 0-0, policy_version 367122 (0.00085) [2022-07-09 18:25:39,179][25689] Fps is (10 sec: 5544.1, 60 sec: 5607.8, 300 sec: 5639.1). Total num frames: 375940096. Throughput: 0: 5892.6. Samples: 375938274. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:25:39,180][25689] Avg episode reward: [(0, '-46.539')] [2022-07-09 18:25:39,471][26022] Updated weights on worker 0-0, policy_version 367132 (0.00087) [2022-07-09 18:25:41,304][26022] Updated weights on worker 0-0, policy_version 367142 (0.00088) [2022-07-09 18:25:43,052][26022] Updated weights on worker 0-0, policy_version 367152 (0.00102) [2022-07-09 18:25:44,218][25689] Fps is (10 sec: 5645.7, 60 sec: 5658.5, 300 sec: 5639.7). Total num frames: 375969792. Throughput: 0: 5905.5. Samples: 375972534. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:25:44,219][25689] Avg episode reward: [(0, '-46.808')] [2022-07-09 18:25:44,817][26022] Updated weights on worker 0-0, policy_version 367162 (0.00090) [2022-07-09 18:25:46,652][26022] Updated weights on worker 0-0, policy_version 367172 (0.00083) [2022-07-09 18:25:48,396][26022] Updated weights on worker 0-0, policy_version 367182 (0.00085) [2022-07-09 18:25:49,227][25689] Fps is (10 sec: 5707.1, 60 sec: 5647.6, 300 sec: 5643.4). Total num frames: 375997440. Throughput: 0: 5928.0. Samples: 376006770. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:25:49,228][25689] Avg episode reward: [(0, '-46.516')] [2022-07-09 18:25:50,276][26022] Updated weights on worker 0-0, policy_version 367192 (0.00085) [2022-07-09 18:25:51,985][26022] Updated weights on worker 0-0, policy_version 367202 (0.00088) [2022-07-09 18:25:53,948][26022] Updated weights on worker 0-0, policy_version 367212 (0.00088) [2022-07-09 18:25:54,235][25689] Fps is (10 sec: 5725.0, 60 sec: 5651.2, 300 sec: 5641.0). Total num frames: 376027136. Throughput: 0: 5087.0. Samples: 376023852. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:25:54,235][25689] Avg episode reward: [(0, '-46.959')] [2022-07-09 18:25:55,419][26022] Updated weights on worker 0-0, policy_version 367222 (0.00087) [2022-07-09 18:25:57,412][26022] Updated weights on worker 0-0, policy_version 367232 (0.00614) [2022-07-09 18:25:59,207][26022] Updated weights on worker 0-0, policy_version 367242 (0.00084) [2022-07-09 18:25:59,354][25689] Fps is (10 sec: 5763.5, 60 sec: 5644.9, 300 sec: 5642.5). Total num frames: 376055808. Throughput: 0: 5947.9. Samples: 376058272. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:25:59,355][25689] Avg episode reward: [(0, '-47.415')] [2022-07-09 18:26:00,910][26022] Updated weights on worker 0-0, policy_version 367252 (0.00086) [2022-07-09 18:26:03,309][26022] Updated weights on worker 0-0, policy_version 367262 (0.00083) [2022-07-09 18:26:04,374][25689] Fps is (10 sec: 5453.7, 60 sec: 5677.5, 300 sec: 5646.2). Total num frames: 376082432. Throughput: 0: 5833.7. Samples: 376090116. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:26:04,374][25689] Avg episode reward: [(0, '-47.897')] [2022-07-09 18:26:04,912][26022] Updated weights on worker 0-0, policy_version 367272 (0.00081) [2022-07-09 18:26:06,756][26022] Updated weights on worker 0-0, policy_version 367282 (0.00089) [2022-07-09 18:26:08,579][26022] Updated weights on worker 0-0, policy_version 367292 (0.00088) [2022-07-09 18:26:09,382][25689] Fps is (10 sec: 5514.2, 60 sec: 5610.8, 300 sec: 5642.9). Total num frames: 376111104. Throughput: 0: 4993.0. Samples: 376107406. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:26:09,383][25689] Avg episode reward: [(0, '-48.834')] [2022-07-09 18:26:10,226][26022] Updated weights on worker 0-0, policy_version 367302 (0.00082) [2022-07-09 18:26:12,326][26022] Updated weights on worker 0-0, policy_version 367312 (0.00082) [2022-07-09 18:26:13,858][26022] Updated weights on worker 0-0, policy_version 367322 (0.00083) [2022-07-09 18:26:14,395][25689] Fps is (10 sec: 5722.4, 60 sec: 5666.6, 300 sec: 5644.6). Total num frames: 376139776. Throughput: 0: 5847.8. Samples: 376141744. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:26:14,395][25689] Avg episode reward: [(0, '-49.252')] [2022-07-09 18:26:15,807][26022] Updated weights on worker 0-0, policy_version 367332 (0.00097) [2022-07-09 18:26:17,527][26022] Updated weights on worker 0-0, policy_version 367342 (0.00093) [2022-07-09 18:26:19,474][25689] Fps is (10 sec: 5682.2, 60 sec: 5664.5, 300 sec: 5644.8). Total num frames: 376168448. Throughput: 0: 5834.6. Samples: 376175664. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:26:19,475][25689] Avg episode reward: [(0, '-48.372')] [2022-07-09 18:26:19,479][26022] Updated weights on worker 0-0, policy_version 367352 (0.00092) [2022-07-09 18:26:21,187][26022] Updated weights on worker 0-0, policy_version 367362 (0.00088) [2022-07-09 18:26:23,035][26022] Updated weights on worker 0-0, policy_version 367372 (0.00086) [2022-07-09 18:26:24,547][25689] Fps is (10 sec: 5648.3, 60 sec: 5660.3, 300 sec: 5647.7). Total num frames: 376197120. Throughput: 0: 5091.9. Samples: 376192836. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:26:24,548][25689] Avg episode reward: [(0, '-47.622')] [2022-07-09 18:26:24,851][26022] Updated weights on worker 0-0, policy_version 367382 (0.00084) [2022-07-09 18:26:26,696][26022] Updated weights on worker 0-0, policy_version 367392 (0.00091) [2022-07-09 18:26:28,342][26022] Updated weights on worker 0-0, policy_version 367402 (0.00092) [2022-07-09 18:26:29,588][25689] Fps is (10 sec: 5568.6, 60 sec: 5630.0, 300 sec: 5643.5). Total num frames: 376224768. Throughput: 0: 5912.8. Samples: 376226878. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:26:29,588][25689] Avg episode reward: [(0, '-47.343')] [2022-07-09 18:26:30,397][26022] Updated weights on worker 0-0, policy_version 367412 (0.00080) [2022-07-09 18:26:31,852][26022] Updated weights on worker 0-0, policy_version 367422 (0.00085) [2022-07-09 18:26:33,939][26022] Updated weights on worker 0-0, policy_version 367432 (0.00087) [2022-07-09 18:26:34,604][25689] Fps is (10 sec: 5600.1, 60 sec: 5632.2, 300 sec: 5641.9). Total num frames: 376253440. Throughput: 0: 5903.0. Samples: 376261040. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:26:34,605][25689] Avg episode reward: [(0, '-46.323')] [2022-07-09 18:26:35,612][26022] Updated weights on worker 0-0, policy_version 367442 (0.00128) [2022-07-09 18:26:37,481][26022] Updated weights on worker 0-0, policy_version 367452 (0.00086) [2022-07-09 18:26:39,211][26022] Updated weights on worker 0-0, policy_version 367462 (0.00097) [2022-07-09 18:26:39,683][25689] Fps is (10 sec: 5782.0, 60 sec: 5669.7, 300 sec: 5644.3). Total num frames: 376283136. Throughput: 0: 5077.8. Samples: 376278280. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:26:39,683][25689] Avg episode reward: [(0, '-45.895')] [2022-07-09 18:26:41,076][26022] Updated weights on worker 0-0, policy_version 367472 (0.00081) [2022-07-09 18:26:42,876][26022] Updated weights on worker 0-0, policy_version 367482 (0.00091) [2022-07-09 18:26:44,541][26022] Updated weights on worker 0-0, policy_version 367492 (0.00093) [2022-07-09 18:26:44,684][25689] Fps is (10 sec: 5892.0, 60 sec: 5673.3, 300 sec: 5651.6). Total num frames: 376312832. Throughput: 0: 5930.2. Samples: 376312252. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:26:44,685][25689] Avg episode reward: [(0, '-44.888')] [2022-07-09 18:26:46,517][26022] Updated weights on worker 0-0, policy_version 367502 (0.00087) [2022-07-09 18:26:48,173][26022] Updated weights on worker 0-0, policy_version 367512 (0.00089) [2022-07-09 18:26:49,721][25689] Fps is (10 sec: 5712.5, 60 sec: 5670.6, 300 sec: 5641.3). Total num frames: 376340480. Throughput: 0: 5956.6. Samples: 376346804. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:26:49,722][25689] Avg episode reward: [(0, '-45.101')] [2022-07-09 18:26:49,942][26022] Updated weights on worker 0-0, policy_version 367522 (0.00082) [2022-07-09 18:26:51,814][26022] Updated weights on worker 0-0, policy_version 367532 (0.00090) [2022-07-09 18:26:53,479][26022] Updated weights on worker 0-0, policy_version 367542 (0.00084) [2022-07-09 18:26:54,727][25689] Fps is (10 sec: 5505.8, 60 sec: 5636.9, 300 sec: 5646.1). Total num frames: 376368128. Throughput: 0: 5121.9. Samples: 376364110. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:26:54,729][25689] Avg episode reward: [(0, '-45.644')] [2022-07-09 18:26:55,508][26022] Updated weights on worker 0-0, policy_version 367552 (0.00091) [2022-07-09 18:26:57,337][26022] Updated weights on worker 0-0, policy_version 367562 (0.00534) [2022-07-09 18:26:59,114][26022] Updated weights on worker 0-0, policy_version 367572 (0.00083) [2022-07-09 18:26:59,777][25689] Fps is (10 sec: 5804.1, 60 sec: 5677.3, 300 sec: 5655.7). Total num frames: 376398848. Throughput: 0: 5963.0. Samples: 376398106. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:26:59,778][25689] Avg episode reward: [(0, '-45.221')] [2022-07-09 18:27:00,954][26022] Updated weights on worker 0-0, policy_version 367582 (0.00104) [2022-07-09 18:27:02,961][26022] Updated weights on worker 0-0, policy_version 367592 (0.00077) [2022-07-09 18:27:04,768][26022] Updated weights on worker 0-0, policy_version 367602 (0.00090) [2022-07-09 18:27:04,784][25689] Fps is (10 sec: 5600.5, 60 sec: 5661.6, 300 sec: 5649.4). Total num frames: 376424448. Throughput: 0: 5859.8. Samples: 376430030. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:27:04,784][25689] Avg episode reward: [(0, '-45.550')] [2022-07-09 18:27:06,725][26022] Updated weights on worker 0-0, policy_version 367612 (0.00085) [2022-07-09 18:27:08,247][26022] Updated weights on worker 0-0, policy_version 367622 (0.00094) [2022-07-09 18:27:09,800][25689] Fps is (10 sec: 5210.8, 60 sec: 5627.0, 300 sec: 5642.3). Total num frames: 376451072. Throughput: 0: 5001.3. Samples: 376447224. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:27:09,805][25689] Avg episode reward: [(0, '-45.821')] [2022-07-09 18:27:10,344][26022] Updated weights on worker 0-0, policy_version 367632 (0.00087) [2022-07-09 18:27:11,810][26022] Updated weights on worker 0-0, policy_version 367642 (0.00091) [2022-07-09 18:27:13,860][26022] Updated weights on worker 0-0, policy_version 367652 (0.00092) [2022-07-09 18:27:14,833][25689] Fps is (10 sec: 5808.4, 60 sec: 5675.9, 300 sec: 5649.7). Total num frames: 376482816. Throughput: 0: 5842.9. Samples: 376481582. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:27:14,834][25689] Avg episode reward: [(0, '-46.450')] [2022-07-09 18:27:15,533][26022] Updated weights on worker 0-0, policy_version 367662 (0.00087) [2022-07-09 18:27:17,319][26022] Updated weights on worker 0-0, policy_version 367672 (0.00084) [2022-07-09 18:27:19,124][26022] Updated weights on worker 0-0, policy_version 367682 (0.00092) [2022-07-09 18:27:19,943][25689] Fps is (10 sec: 5754.2, 60 sec: 5639.1, 300 sec: 5644.5). Total num frames: 376509440. Throughput: 0: 5830.1. Samples: 376515674. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:27:19,944][25689] Avg episode reward: [(0, '-46.080')] [2022-07-09 18:27:21,124][26022] Updated weights on worker 0-0, policy_version 367692 (0.00084) [2022-07-09 18:27:21,489][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:27:21,505][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000367695_376519680.pth [2022-07-09 18:27:21,510][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000365707_374483968.pth [2022-07-09 18:27:22,834][26022] Updated weights on worker 0-0, policy_version 367702 (0.00095) [2022-07-09 18:27:24,691][26022] Updated weights on worker 0-0, policy_version 367712 (0.00090) [2022-07-09 18:27:24,971][25689] Fps is (10 sec: 5454.2, 60 sec: 5643.4, 300 sec: 5644.5). Total num frames: 376538112. Throughput: 0: 5076.9. Samples: 376532518. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:27:24,971][25689] Avg episode reward: [(0, '-46.927')] [2022-07-09 18:27:26,452][26022] Updated weights on worker 0-0, policy_version 367722 (0.00086) [2022-07-09 18:27:28,378][26022] Updated weights on worker 0-0, policy_version 367732 (0.00104) [2022-07-09 18:27:30,027][25689] Fps is (10 sec: 5686.9, 60 sec: 5658.9, 300 sec: 5643.6). Total num frames: 376566784. Throughput: 0: 5897.0. Samples: 376566502. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:27:30,030][25689] Avg episode reward: [(0, '-47.142')] [2022-07-09 18:27:30,047][26022] Updated weights on worker 0-0, policy_version 367742 (0.00086) [2022-07-09 18:27:31,971][26022] Updated weights on worker 0-0, policy_version 367752 (0.00098) [2022-07-09 18:27:33,733][26022] Updated weights on worker 0-0, policy_version 367762 (0.00615) [2022-07-09 18:27:35,094][25689] Fps is (10 sec: 5664.6, 60 sec: 5654.1, 300 sec: 5647.8). Total num frames: 376595456. Throughput: 0: 5866.7. Samples: 376600450. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:27:35,095][25689] Avg episode reward: [(0, '-47.965')] [2022-07-09 18:27:35,578][26022] Updated weights on worker 0-0, policy_version 367772 (0.00094) [2022-07-09 18:27:37,279][26022] Updated weights on worker 0-0, policy_version 367782 (0.00087) [2022-07-09 18:27:39,225][26022] Updated weights on worker 0-0, policy_version 367792 (0.00083) [2022-07-09 18:27:40,176][25689] Fps is (10 sec: 5650.0, 60 sec: 5636.9, 300 sec: 5640.8). Total num frames: 376624128. Throughput: 0: 5025.3. Samples: 376617350. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 18:27:40,176][25689] Avg episode reward: [(0, '-47.987')] [2022-07-09 18:27:40,865][26022] Updated weights on worker 0-0, policy_version 367802 (0.00085) [2022-07-09 18:27:42,765][26022] Updated weights on worker 0-0, policy_version 367812 (0.00088) [2022-07-09 18:27:44,728][26022] Updated weights on worker 0-0, policy_version 367822 (0.00085) [2022-07-09 18:27:45,205][25689] Fps is (10 sec: 5671.3, 60 sec: 5617.4, 300 sec: 5640.9). Total num frames: 376652800. Throughput: 0: 5878.1. Samples: 376651458. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:27:45,206][25689] Avg episode reward: [(0, '-47.905')] [2022-07-09 18:27:46,231][26022] Updated weights on worker 0-0, policy_version 367832 (0.00084) [2022-07-09 18:27:48,266][26022] Updated weights on worker 0-0, policy_version 367842 (0.00083) [2022-07-09 18:27:50,118][26022] Updated weights on worker 0-0, policy_version 367852 (0.00093) [2022-07-09 18:27:50,219][25689] Fps is (10 sec: 5607.8, 60 sec: 5619.5, 300 sec: 5644.2). Total num frames: 376680448. Throughput: 0: 5888.5. Samples: 376685406. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:27:50,220][25689] Avg episode reward: [(0, '-48.849')] [2022-07-09 18:27:51,802][26022] Updated weights on worker 0-0, policy_version 367862 (0.00085) [2022-07-09 18:27:53,835][26022] Updated weights on worker 0-0, policy_version 367872 (0.00094) [2022-07-09 18:27:55,244][25689] Fps is (10 sec: 5712.3, 60 sec: 5651.6, 300 sec: 5644.7). Total num frames: 376710144. Throughput: 0: 5912.1. Samples: 376719578. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:27:55,244][25689] Avg episode reward: [(0, '-47.592')] [2022-07-09 18:27:55,278][26022] Updated weights on worker 0-0, policy_version 367882 (0.00102) [2022-07-09 18:27:57,342][26022] Updated weights on worker 0-0, policy_version 367892 (0.00079) [2022-07-09 18:27:58,787][26022] Updated weights on worker 0-0, policy_version 367902 (0.00091) [2022-07-09 18:28:00,319][25689] Fps is (10 sec: 5677.9, 60 sec: 5598.6, 300 sec: 5646.8). Total num frames: 376737792. Throughput: 0: 5919.7. Samples: 376736588. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:28:00,319][25689] Avg episode reward: [(0, '-47.129')] [2022-07-09 18:28:00,776][26022] Updated weights on worker 0-0, policy_version 367912 (0.00095) [2022-07-09 18:28:02,939][26022] Updated weights on worker 0-0, policy_version 367922 (0.00860) [2022-07-09 18:28:04,668][26022] Updated weights on worker 0-0, policy_version 367932 (0.00079) [2022-07-09 18:28:05,343][25689] Fps is (10 sec: 5475.5, 60 sec: 5630.7, 300 sec: 5643.0). Total num frames: 376765440. Throughput: 0: 5817.5. Samples: 376768608. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:28:05,343][25689] Avg episode reward: [(0, '-46.857')] [2022-07-09 18:28:06,539][26022] Updated weights on worker 0-0, policy_version 367942 (0.00086) [2022-07-09 18:28:08,363][26022] Updated weights on worker 0-0, policy_version 367952 (0.00094) [2022-07-09 18:28:10,124][26022] Updated weights on worker 0-0, policy_version 367962 (0.00086) [2022-07-09 18:28:10,348][25689] Fps is (10 sec: 5615.5, 60 sec: 5665.5, 300 sec: 5643.6). Total num frames: 376794112. Throughput: 0: 5842.3. Samples: 376803006. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:28:10,349][25689] Avg episode reward: [(0, '-47.840')] [2022-07-09 18:28:12,117][26022] Updated weights on worker 0-0, policy_version 367972 (0.00088) [2022-07-09 18:28:13,782][26022] Updated weights on worker 0-0, policy_version 367982 (0.00084) [2022-07-09 18:28:15,364][25689] Fps is (10 sec: 5619.7, 60 sec: 5599.4, 300 sec: 5640.9). Total num frames: 376821760. Throughput: 0: 4995.4. Samples: 376820088. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:28:15,366][25689] Avg episode reward: [(0, '-46.851')] [2022-07-09 18:28:15,589][26022] Updated weights on worker 0-0, policy_version 367992 (0.00083) [2022-07-09 18:28:17,180][26022] Updated weights on worker 0-0, policy_version 368002 (0.00091) [2022-07-09 18:28:19,171][26022] Updated weights on worker 0-0, policy_version 368012 (0.00083) [2022-07-09 18:28:20,483][25689] Fps is (10 sec: 5658.3, 60 sec: 5649.5, 300 sec: 5643.8). Total num frames: 376851456. Throughput: 0: 5837.6. Samples: 376854298. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:28:20,484][25689] Avg episode reward: [(0, '-46.755')] [2022-07-09 18:28:21,053][26022] Updated weights on worker 0-0, policy_version 368022 (0.00082) [2022-07-09 18:28:22,841][26022] Updated weights on worker 0-0, policy_version 368032 (0.00085) [2022-07-09 18:28:24,465][26022] Updated weights on worker 0-0, policy_version 368042 (0.00088) [2022-07-09 18:28:25,538][25689] Fps is (10 sec: 5737.4, 60 sec: 5646.9, 300 sec: 5639.4). Total num frames: 376880128. Throughput: 0: 5938.7. Samples: 376888540. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:28:25,538][25689] Avg episode reward: [(0, '-47.517')] [2022-07-09 18:28:26,367][26022] Updated weights on worker 0-0, policy_version 368052 (0.00083) [2022-07-09 18:28:28,148][26022] Updated weights on worker 0-0, policy_version 368062 (0.00096) [2022-07-09 18:28:30,048][26022] Updated weights on worker 0-0, policy_version 368072 (0.00102) [2022-07-09 18:28:30,563][25689] Fps is (10 sec: 5688.8, 60 sec: 5649.8, 300 sec: 5645.9). Total num frames: 376908800. Throughput: 0: 5062.4. Samples: 376905342. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:28:30,563][25689] Avg episode reward: [(0, '-46.844')] [2022-07-09 18:28:31,853][26022] Updated weights on worker 0-0, policy_version 368082 (0.00083) [2022-07-09 18:28:33,605][26022] Updated weights on worker 0-0, policy_version 368092 (0.00085) [2022-07-09 18:28:35,448][26022] Updated weights on worker 0-0, policy_version 368102 (0.00091) [2022-07-09 18:28:35,567][25689] Fps is (10 sec: 5717.9, 60 sec: 5655.7, 300 sec: 5644.1). Total num frames: 376937472. Throughput: 0: 5897.7. Samples: 376939234. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:28:35,567][25689] Avg episode reward: [(0, '-46.228')] [2022-07-09 18:28:37,420][26022] Updated weights on worker 0-0, policy_version 368112 (0.00080) [2022-07-09 18:28:39,080][26022] Updated weights on worker 0-0, policy_version 368122 (0.00091) [2022-07-09 18:28:40,680][25689] Fps is (10 sec: 5566.7, 60 sec: 5635.9, 300 sec: 5642.3). Total num frames: 376965120. Throughput: 0: 5870.8. Samples: 376972874. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:28:40,681][25689] Avg episode reward: [(0, '-46.306')] [2022-07-09 18:28:41,019][26022] Updated weights on worker 0-0, policy_version 368132 (0.00054) [2022-07-09 18:28:42,695][26022] Updated weights on worker 0-0, policy_version 368142 (0.00093) [2022-07-09 18:28:44,516][26022] Updated weights on worker 0-0, policy_version 368152 (0.00075) [2022-07-09 18:28:45,713][25689] Fps is (10 sec: 5550.9, 60 sec: 5635.5, 300 sec: 5645.8). Total num frames: 376993792. Throughput: 0: 5023.4. Samples: 376989888. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:28:45,713][25689] Avg episode reward: [(0, '-45.843')] [2022-07-09 18:28:46,333][26022] Updated weights on worker 0-0, policy_version 368162 (0.00094) [2022-07-09 18:28:48,354][26022] Updated weights on worker 0-0, policy_version 368172 (0.00087) [2022-07-09 18:28:50,108][26022] Updated weights on worker 0-0, policy_version 368182 (0.00091) [2022-07-09 18:28:50,738][25689] Fps is (10 sec: 5498.0, 60 sec: 5617.6, 300 sec: 5635.1). Total num frames: 377020416. Throughput: 0: 5872.7. Samples: 377023822. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:28:50,738][25689] Avg episode reward: [(0, '-45.731')] [2022-07-09 18:28:52,039][26022] Updated weights on worker 0-0, policy_version 368192 (0.00085) [2022-07-09 18:28:53,789][26022] Updated weights on worker 0-0, policy_version 368202 (0.00080) [2022-07-09 18:28:55,556][26022] Updated weights on worker 0-0, policy_version 368212 (0.00091) [2022-07-09 18:28:55,797][25689] Fps is (10 sec: 5686.8, 60 sec: 5631.3, 300 sec: 5645.3). Total num frames: 377051136. Throughput: 0: 5836.4. Samples: 377057304. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:28:55,797][25689] Avg episode reward: [(0, '-46.061')] [2022-07-09 18:28:57,315][26022] Updated weights on worker 0-0, policy_version 368222 (0.00086) [2022-07-09 18:28:59,065][26022] Updated weights on worker 0-0, policy_version 368232 (0.00091) [2022-07-09 18:29:00,870][25689] Fps is (10 sec: 5659.4, 60 sec: 5614.5, 300 sec: 5647.7). Total num frames: 377077760. Throughput: 0: 5027.2. Samples: 377074374. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:29:00,871][25689] Avg episode reward: [(0, '-47.752')] [2022-07-09 18:29:01,304][26022] Updated weights on worker 0-0, policy_version 368242 (0.00082) [2022-07-09 18:29:03,214][26022] Updated weights on worker 0-0, policy_version 368252 (0.00096) [2022-07-09 18:29:05,182][26022] Updated weights on worker 0-0, policy_version 368262 (0.00088) [2022-07-09 18:29:05,894][25689] Fps is (10 sec: 5171.9, 60 sec: 5580.7, 300 sec: 5637.6). Total num frames: 377103360. Throughput: 0: 5752.7. Samples: 377105986. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:29:05,896][25689] Avg episode reward: [(0, '-47.751')] [2022-07-09 18:29:06,806][26022] Updated weights on worker 0-0, policy_version 368272 (0.00089) [2022-07-09 18:29:08,751][26022] Updated weights on worker 0-0, policy_version 368282 (0.00088) [2022-07-09 18:29:10,373][26022] Updated weights on worker 0-0, policy_version 368292 (0.00085) [2022-07-09 18:29:10,921][25689] Fps is (10 sec: 5705.2, 60 sec: 5629.4, 300 sec: 5647.7). Total num frames: 377135104. Throughput: 0: 5765.7. Samples: 377140196. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:29:10,922][25689] Avg episode reward: [(0, '-48.271')] [2022-07-09 18:29:12,362][26022] Updated weights on worker 0-0, policy_version 368302 (0.00086) [2022-07-09 18:29:13,822][26022] Updated weights on worker 0-0, policy_version 368312 (0.00055) [2022-07-09 18:29:15,952][25689] Fps is (10 sec: 5701.5, 60 sec: 5594.3, 300 sec: 5634.9). Total num frames: 377160704. Throughput: 0: 4966.7. Samples: 377157410. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:29:15,954][25689] Avg episode reward: [(0, '-49.151')] [2022-07-09 18:29:16,126][26022] Updated weights on worker 0-0, policy_version 368322 (0.00090) [2022-07-09 18:29:17,539][26022] Updated weights on worker 0-0, policy_version 368332 (0.00083) [2022-07-09 18:29:19,403][26022] Updated weights on worker 0-0, policy_version 368342 (0.00055) [2022-07-09 18:29:20,999][25689] Fps is (10 sec: 5487.0, 60 sec: 5600.9, 300 sec: 5641.0). Total num frames: 377190400. Throughput: 0: 5822.5. Samples: 377191574. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:29:21,001][25689] Avg episode reward: [(0, '-49.504')] [2022-07-09 18:29:21,191][26022] Updated weights on worker 0-0, policy_version 368352 (0.00094) [2022-07-09 18:29:21,604][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:29:21,618][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000368354_377194496.pth [2022-07-09 18:29:21,623][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000366370_375162880.pth [2022-07-09 18:29:23,136][26022] Updated weights on worker 0-0, policy_version 368362 (0.00087) [2022-07-09 18:29:24,960][26022] Updated weights on worker 0-0, policy_version 368372 (0.00088) [2022-07-09 18:29:26,010][25689] Fps is (10 sec: 5803.0, 60 sec: 5604.9, 300 sec: 5637.4). Total num frames: 377219072. Throughput: 0: 5964.3. Samples: 377225964. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:29:26,011][25689] Avg episode reward: [(0, '-48.644')] [2022-07-09 18:29:26,553][26022] Updated weights on worker 0-0, policy_version 368382 (0.00084) [2022-07-09 18:29:28,452][26022] Updated weights on worker 0-0, policy_version 368392 (0.00092) [2022-07-09 18:29:30,161][26022] Updated weights on worker 0-0, policy_version 368402 (0.00089) [2022-07-09 18:29:31,045][25689] Fps is (10 sec: 5707.9, 60 sec: 5604.0, 300 sec: 5640.5). Total num frames: 377247744. Throughput: 0: 5106.5. Samples: 377242960. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:29:31,046][25689] Avg episode reward: [(0, '-48.400')] [2022-07-09 18:29:31,978][26022] Updated weights on worker 0-0, policy_version 368412 (0.00088) [2022-07-09 18:29:33,664][26022] Updated weights on worker 0-0, policy_version 368422 (0.00085) [2022-07-09 18:29:35,595][26022] Updated weights on worker 0-0, policy_version 368432 (0.00086) [2022-07-09 18:29:36,070][25689] Fps is (10 sec: 5700.0, 60 sec: 5602.0, 300 sec: 5634.3). Total num frames: 377276416. Throughput: 0: 5957.6. Samples: 377277268. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:29:36,071][25689] Avg episode reward: [(0, '-47.408')] [2022-07-09 18:29:37,409][26022] Updated weights on worker 0-0, policy_version 368442 (0.00093) [2022-07-09 18:29:39,165][26022] Updated weights on worker 0-0, policy_version 368452 (0.00088) [2022-07-09 18:29:40,871][26022] Updated weights on worker 0-0, policy_version 368462 (0.00403) [2022-07-09 18:29:41,137][25689] Fps is (10 sec: 5682.4, 60 sec: 5623.3, 300 sec: 5640.7). Total num frames: 377305088. Throughput: 0: 5950.0. Samples: 377311396. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:29:41,138][25689] Avg episode reward: [(0, '-47.045')] [2022-07-09 18:29:42,800][26022] Updated weights on worker 0-0, policy_version 368472 (0.00083) [2022-07-09 18:29:44,676][26022] Updated weights on worker 0-0, policy_version 368482 (0.00084) [2022-07-09 18:29:46,178][25689] Fps is (10 sec: 5673.5, 60 sec: 5622.6, 300 sec: 5641.3). Total num frames: 377333760. Throughput: 0: 5083.0. Samples: 377328476. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:29:46,178][25689] Avg episode reward: [(0, '-45.409')] [2022-07-09 18:29:46,402][26022] Updated weights on worker 0-0, policy_version 368492 (0.00084) [2022-07-09 18:29:48,256][26022] Updated weights on worker 0-0, policy_version 368502 (0.00089) [2022-07-09 18:29:49,965][26022] Updated weights on worker 0-0, policy_version 368512 (0.00093) [2022-07-09 18:29:51,253][25689] Fps is (10 sec: 5668.7, 60 sec: 5651.7, 300 sec: 5637.3). Total num frames: 377362432. Throughput: 0: 5919.4. Samples: 377362576. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:29:51,253][25689] Avg episode reward: [(0, '-45.508')] [2022-07-09 18:29:51,784][26022] Updated weights on worker 0-0, policy_version 368522 (0.00088) [2022-07-09 18:29:53,726][26022] Updated weights on worker 0-0, policy_version 368532 (0.00081) [2022-07-09 18:29:55,237][26022] Updated weights on worker 0-0, policy_version 368542 (0.00088) [2022-07-09 18:29:56,304][25689] Fps is (10 sec: 5764.3, 60 sec: 5635.6, 300 sec: 5640.7). Total num frames: 377392128. Throughput: 0: 5896.5. Samples: 377396572. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:29:56,304][25689] Avg episode reward: [(0, '-44.929')] [2022-07-09 18:29:57,333][26022] Updated weights on worker 0-0, policy_version 368552 (0.00088) [2022-07-09 18:29:58,950][26022] Updated weights on worker 0-0, policy_version 368562 (0.00091) [2022-07-09 18:30:00,919][26022] Updated weights on worker 0-0, policy_version 368572 (0.00094) [2022-07-09 18:30:01,395][25689] Fps is (10 sec: 5654.0, 60 sec: 5650.8, 300 sec: 5649.5). Total num frames: 377419776. Throughput: 0: 5037.7. Samples: 377413450. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:30:01,396][25689] Avg episode reward: [(0, '-45.084')] [2022-07-09 18:30:03,202][26022] Updated weights on worker 0-0, policy_version 368582 (0.00087) [2022-07-09 18:30:04,995][26022] Updated weights on worker 0-0, policy_version 368592 (0.00094) [2022-07-09 18:30:06,451][25689] Fps is (10 sec: 5348.6, 60 sec: 5664.8, 300 sec: 5628.1). Total num frames: 377446400. Throughput: 0: 5752.4. Samples: 377445094. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:30:06,451][25689] Avg episode reward: [(0, '-45.435')] [2022-07-09 18:30:06,874][26022] Updated weights on worker 0-0, policy_version 368602 (0.00086) [2022-07-09 18:30:08,502][26022] Updated weights on worker 0-0, policy_version 368612 (0.00080) [2022-07-09 18:30:10,420][26022] Updated weights on worker 0-0, policy_version 368622 (0.00086) [2022-07-09 18:30:11,463][25689] Fps is (10 sec: 5492.5, 60 sec: 5615.5, 300 sec: 5639.5). Total num frames: 377475072. Throughput: 0: 5770.2. Samples: 377479190. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:30:11,464][25689] Avg episode reward: [(0, '-46.375')] [2022-07-09 18:30:12,248][26022] Updated weights on worker 0-0, policy_version 368632 (0.00092) [2022-07-09 18:30:13,916][26022] Updated weights on worker 0-0, policy_version 368642 (0.00087) [2022-07-09 18:30:15,895][26022] Updated weights on worker 0-0, policy_version 368652 (0.00087) [2022-07-09 18:30:16,470][25689] Fps is (10 sec: 5621.3, 60 sec: 5651.5, 300 sec: 5637.0). Total num frames: 377502720. Throughput: 0: 4953.0. Samples: 377496454. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-09 18:30:16,470][25689] Avg episode reward: [(0, '-46.540')] [2022-07-09 18:30:17,583][26022] Updated weights on worker 0-0, policy_version 368662 (0.00089) [2022-07-09 18:30:19,495][26022] Updated weights on worker 0-0, policy_version 368672 (0.00093) [2022-07-09 18:30:21,268][26022] Updated weights on worker 0-0, policy_version 368682 (0.00092) [2022-07-09 18:30:21,571][25689] Fps is (10 sec: 5673.1, 60 sec: 5646.5, 300 sec: 5639.0). Total num frames: 377532416. Throughput: 0: 5796.1. Samples: 377530390. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:30:21,571][25689] Avg episode reward: [(0, '-45.829')] [2022-07-09 18:30:23,079][26022] Updated weights on worker 0-0, policy_version 368692 (0.00082) [2022-07-09 18:30:24,835][26022] Updated weights on worker 0-0, policy_version 368702 (0.00086) [2022-07-09 18:30:26,584][25689] Fps is (10 sec: 5771.0, 60 sec: 5646.3, 300 sec: 5636.8). Total num frames: 377561088. Throughput: 0: 5923.6. Samples: 377564354. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:30:26,584][25689] Avg episode reward: [(0, '-46.655')] [2022-07-09 18:30:26,585][26022] Updated weights on worker 0-0, policy_version 368712 (0.00085) [2022-07-09 18:30:28,412][26022] Updated weights on worker 0-0, policy_version 368722 (0.00085) [2022-07-09 18:30:30,423][26022] Updated weights on worker 0-0, policy_version 368732 (0.00089) [2022-07-09 18:30:31,616][25689] Fps is (10 sec: 5606.6, 60 sec: 5629.7, 300 sec: 5633.5). Total num frames: 377588736. Throughput: 0: 5906.7. Samples: 377598230. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:30:31,617][25689] Avg episode reward: [(0, '-46.417')] [2022-07-09 18:30:32,147][26022] Updated weights on worker 0-0, policy_version 368742 (0.00093) [2022-07-09 18:30:34,105][26022] Updated weights on worker 0-0, policy_version 368752 (0.00085) [2022-07-09 18:30:35,616][26022] Updated weights on worker 0-0, policy_version 368762 (0.00058) [2022-07-09 18:30:36,643][25689] Fps is (10 sec: 5700.5, 60 sec: 5646.4, 300 sec: 5642.1). Total num frames: 377618432. Throughput: 0: 5888.2. Samples: 377615240. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:30:36,644][25689] Avg episode reward: [(0, '-46.320')] [2022-07-09 18:30:37,674][26022] Updated weights on worker 0-0, policy_version 368772 (0.00087) [2022-07-09 18:30:39,227][26022] Updated weights on worker 0-0, policy_version 368782 (0.00091) [2022-07-09 18:30:41,433][26022] Updated weights on worker 0-0, policy_version 368792 (0.00084) [2022-07-09 18:30:41,709][25689] Fps is (10 sec: 5580.4, 60 sec: 5612.7, 300 sec: 5631.3). Total num frames: 377645056. Throughput: 0: 5898.0. Samples: 377649162. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:30:41,709][25689] Avg episode reward: [(0, '-47.140')] [2022-07-09 18:30:42,825][26022] Updated weights on worker 0-0, policy_version 368802 (0.00088) [2022-07-09 18:30:44,906][26022] Updated weights on worker 0-0, policy_version 368812 (0.00086) [2022-07-09 18:30:46,354][26022] Updated weights on worker 0-0, policy_version 368822 (0.00085) [2022-07-09 18:30:46,733][25689] Fps is (10 sec: 5683.4, 60 sec: 5648.0, 300 sec: 5641.3). Total num frames: 377675776. Throughput: 0: 5908.9. Samples: 377683414. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:30:46,734][25689] Avg episode reward: [(0, '-47.477')] [2022-07-09 18:30:48,502][26022] Updated weights on worker 0-0, policy_version 368832 (0.00086) [2022-07-09 18:30:49,982][26022] Updated weights on worker 0-0, policy_version 368842 (0.00093) [2022-07-09 18:30:51,739][25689] Fps is (10 sec: 5717.0, 60 sec: 5620.6, 300 sec: 5631.1). Total num frames: 377702400. Throughput: 0: 5090.5. Samples: 377700666. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:30:51,739][25689] Avg episode reward: [(0, '-47.545')] [2022-07-09 18:30:52,020][26022] Updated weights on worker 0-0, policy_version 368852 (0.00088) [2022-07-09 18:30:53,618][26022] Updated weights on worker 0-0, policy_version 368862 (0.00091) [2022-07-09 18:30:55,640][26022] Updated weights on worker 0-0, policy_version 368872 (0.00085) [2022-07-09 18:30:56,745][25689] Fps is (10 sec: 5625.1, 60 sec: 5624.7, 300 sec: 5636.6). Total num frames: 377732096. Throughput: 0: 5946.9. Samples: 377734784. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:30:56,746][25689] Avg episode reward: [(0, '-47.409')] [2022-07-09 18:30:57,315][26022] Updated weights on worker 0-0, policy_version 368882 (0.00084) [2022-07-09 18:30:59,198][26022] Updated weights on worker 0-0, policy_version 368892 (0.00082) [2022-07-09 18:31:00,807][26022] Updated weights on worker 0-0, policy_version 368902 (0.00086) [2022-07-09 18:31:01,811][25689] Fps is (10 sec: 5795.2, 60 sec: 5644.1, 300 sec: 5642.7). Total num frames: 377760768. Throughput: 0: 5956.8. Samples: 377768906. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:31:01,811][25689] Avg episode reward: [(0, '-48.350')] [2022-07-09 18:31:03,336][26022] Updated weights on worker 0-0, policy_version 368912 (0.00082) [2022-07-09 18:31:04,766][26022] Updated weights on worker 0-0, policy_version 368922 (0.00091) [2022-07-09 18:31:06,789][26022] Updated weights on worker 0-0, policy_version 368932 (0.00096) [2022-07-09 18:31:06,881][25689] Fps is (10 sec: 5354.6, 60 sec: 5625.8, 300 sec: 5631.2). Total num frames: 377786368. Throughput: 0: 4994.8. Samples: 377784048. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:31:06,881][25689] Avg episode reward: [(0, '-48.224')] [2022-07-09 18:31:08,703][26022] Updated weights on worker 0-0, policy_version 368942 (0.00084) [2022-07-09 18:31:10,532][26022] Updated weights on worker 0-0, policy_version 368952 (0.00090) [2022-07-09 18:31:11,904][25689] Fps is (10 sec: 5376.7, 60 sec: 5624.7, 300 sec: 5631.0). Total num frames: 377815040. Throughput: 0: 5799.5. Samples: 377817616. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:31:11,905][25689] Avg episode reward: [(0, '-47.482')] [2022-07-09 18:31:12,199][26022] Updated weights on worker 0-0, policy_version 368962 (0.00086) [2022-07-09 18:31:14,097][26022] Updated weights on worker 0-0, policy_version 368972 (0.00086) [2022-07-09 18:31:15,664][26022] Updated weights on worker 0-0, policy_version 368982 (0.00091) [2022-07-09 18:31:16,983][25689] Fps is (10 sec: 5676.5, 60 sec: 5635.0, 300 sec: 5631.0). Total num frames: 377843712. Throughput: 0: 5789.5. Samples: 377851948. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:31:16,983][25689] Avg episode reward: [(0, '-48.154')] [2022-07-09 18:31:17,908][26022] Updated weights on worker 0-0, policy_version 368992 (0.00086) [2022-07-09 18:31:19,381][26022] Updated weights on worker 0-0, policy_version 369002 (0.00085) [2022-07-09 18:31:21,209][26022] Updated weights on worker 0-0, policy_version 369012 (0.00088) [2022-07-09 18:31:21,658][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:31:21,667][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000369015_377871360.pth [2022-07-09 18:31:21,668][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000367032_375840768.pth [2022-07-09 18:31:22,047][25689] Fps is (10 sec: 5654.0, 60 sec: 5621.5, 300 sec: 5631.1). Total num frames: 377872384. Throughput: 0: 4942.8. Samples: 377868928. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:31:22,047][25689] Avg episode reward: [(0, '-48.258')] [2022-07-09 18:31:23,032][26022] Updated weights on worker 0-0, policy_version 369022 (0.00084) [2022-07-09 18:31:24,935][26022] Updated weights on worker 0-0, policy_version 369032 (0.00089) [2022-07-09 18:31:26,755][26022] Updated weights on worker 0-0, policy_version 369042 (0.00085) [2022-07-09 18:31:27,064][25689] Fps is (10 sec: 5688.4, 60 sec: 5621.2, 300 sec: 5635.0). Total num frames: 377901056. Throughput: 0: 5915.3. Samples: 377903436. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:31:27,064][25689] Avg episode reward: [(0, '-47.438')] [2022-07-09 18:31:28,402][26022] Updated weights on worker 0-0, policy_version 369052 (0.00077) [2022-07-09 18:31:30,291][26022] Updated weights on worker 0-0, policy_version 369062 (0.00094) [2022-07-09 18:31:32,021][26022] Updated weights on worker 0-0, policy_version 369072 (0.00106) [2022-07-09 18:31:32,067][25689] Fps is (10 sec: 5723.0, 60 sec: 5640.8, 300 sec: 5635.3). Total num frames: 377929728. Throughput: 0: 5943.3. Samples: 377937446. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:31:32,067][25689] Avg episode reward: [(0, '-47.932')] [2022-07-09 18:31:33,795][26022] Updated weights on worker 0-0, policy_version 369082 (0.00095) [2022-07-09 18:31:35,639][26022] Updated weights on worker 0-0, policy_version 369092 (0.00114) [2022-07-09 18:31:37,068][25689] Fps is (10 sec: 5732.0, 60 sec: 5626.3, 300 sec: 5633.3). Total num frames: 377958400. Throughput: 0: 5113.4. Samples: 377954652. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:31:37,068][25689] Avg episode reward: [(0, '-47.723')] [2022-07-09 18:31:37,446][26022] Updated weights on worker 0-0, policy_version 369102 (0.00087) [2022-07-09 18:31:39,342][26022] Updated weights on worker 0-0, policy_version 369112 (0.00088) [2022-07-09 18:31:40,997][26022] Updated weights on worker 0-0, policy_version 369122 (0.00086) [2022-07-09 18:31:42,112][25689] Fps is (10 sec: 5606.8, 60 sec: 5645.3, 300 sec: 5625.6). Total num frames: 377986048. Throughput: 0: 5956.9. Samples: 377988454. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:31:42,113][25689] Avg episode reward: [(0, '-47.414')] [2022-07-09 18:31:43,007][26022] Updated weights on worker 0-0, policy_version 369132 (0.00089) [2022-07-09 18:31:44,661][26022] Updated weights on worker 0-0, policy_version 369142 (0.00088) [2022-07-09 18:31:46,403][26022] Updated weights on worker 0-0, policy_version 369152 (0.00092) [2022-07-09 18:31:47,142][25689] Fps is (10 sec: 5692.1, 60 sec: 5627.8, 300 sec: 5632.6). Total num frames: 378015744. Throughput: 0: 5939.6. Samples: 378022694. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:31:47,143][25689] Avg episode reward: [(0, '-46.793')] [2022-07-09 18:31:48,372][26022] Updated weights on worker 0-0, policy_version 369162 (0.00090) [2022-07-09 18:31:49,994][26022] Updated weights on worker 0-0, policy_version 369172 (0.00109) [2022-07-09 18:31:51,983][26022] Updated weights on worker 0-0, policy_version 369182 (0.00082) [2022-07-09 18:31:52,150][25689] Fps is (10 sec: 5712.5, 60 sec: 5644.5, 300 sec: 5632.6). Total num frames: 378043392. Throughput: 0: 5102.7. Samples: 378039926. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:31:52,151][25689] Avg episode reward: [(0, '-46.441')] [2022-07-09 18:31:53,457][26022] Updated weights on worker 0-0, policy_version 369192 (0.00088) [2022-07-09 18:31:55,396][26022] Updated weights on worker 0-0, policy_version 369202 (0.00091) [2022-07-09 18:31:57,153][25689] Fps is (10 sec: 5625.7, 60 sec: 5627.8, 300 sec: 5626.6). Total num frames: 378072064. Throughput: 0: 5951.7. Samples: 378074196. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:31:57,154][25689] Avg episode reward: [(0, '-47.939')] [2022-07-09 18:31:57,415][26022] Updated weights on worker 0-0, policy_version 369212 (0.00094) [2022-07-09 18:31:58,915][26022] Updated weights on worker 0-0, policy_version 369222 (0.00095) [2022-07-09 18:32:00,959][26022] Updated weights on worker 0-0, policy_version 369232 (0.00084) [2022-07-09 18:32:02,203][25689] Fps is (10 sec: 5602.5, 60 sec: 5612.4, 300 sec: 5632.7). Total num frames: 378099712. Throughput: 0: 5938.3. Samples: 378107760. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:32:02,205][25689] Avg episode reward: [(0, '-47.583')] [2022-07-09 18:32:02,817][26022] Updated weights on worker 0-0, policy_version 369242 (0.00092) [2022-07-09 18:32:04,992][26022] Updated weights on worker 0-0, policy_version 369252 (0.00090) [2022-07-09 18:32:06,374][26022] Updated weights on worker 0-0, policy_version 369262 (0.00088) [2022-07-09 18:32:07,232][25689] Fps is (10 sec: 5384.7, 60 sec: 5633.1, 300 sec: 5632.4). Total num frames: 378126336. Throughput: 0: 5002.9. Samples: 378123202. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:32:07,233][25689] Avg episode reward: [(0, '-47.439')] [2022-07-09 18:32:08,426][26022] Updated weights on worker 0-0, policy_version 369272 (0.00090) [2022-07-09 18:32:10,309][26022] Updated weights on worker 0-0, policy_version 369282 (0.00087) [2022-07-09 18:32:12,031][26022] Updated weights on worker 0-0, policy_version 369292 (0.00082) [2022-07-09 18:32:12,238][25689] Fps is (10 sec: 5612.5, 60 sec: 5651.8, 300 sec: 5626.1). Total num frames: 378156032. Throughput: 0: 5846.7. Samples: 378157372. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:32:12,239][25689] Avg episode reward: [(0, '-46.963')] [2022-07-09 18:32:13,916][26022] Updated weights on worker 0-0, policy_version 369302 (0.00101) [2022-07-09 18:32:15,567][26022] Updated weights on worker 0-0, policy_version 369312 (0.00088) [2022-07-09 18:32:17,241][25689] Fps is (10 sec: 5729.5, 60 sec: 5641.9, 300 sec: 5631.5). Total num frames: 378183680. Throughput: 0: 5856.1. Samples: 378191830. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:32:17,241][25689] Avg episode reward: [(0, '-48.282')] [2022-07-09 18:32:17,457][26022] Updated weights on worker 0-0, policy_version 369322 (0.00088) [2022-07-09 18:32:19,231][26022] Updated weights on worker 0-0, policy_version 369332 (0.00087) [2022-07-09 18:32:21,019][26022] Updated weights on worker 0-0, policy_version 369342 (0.00079) [2022-07-09 18:32:22,326][25689] Fps is (10 sec: 5684.4, 60 sec: 5656.9, 300 sec: 5633.9). Total num frames: 378213376. Throughput: 0: 5017.2. Samples: 378208720. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:32:22,326][25689] Avg episode reward: [(0, '-48.366')] [2022-07-09 18:32:22,865][26022] Updated weights on worker 0-0, policy_version 369352 (0.00092) [2022-07-09 18:32:24,659][26022] Updated weights on worker 0-0, policy_version 369362 (0.00086) [2022-07-09 18:32:26,579][26022] Updated weights on worker 0-0, policy_version 369372 (0.00091) [2022-07-09 18:32:27,359][25689] Fps is (10 sec: 5667.3, 60 sec: 5638.3, 300 sec: 5630.9). Total num frames: 378241024. Throughput: 0: 5941.7. Samples: 378242790. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:32:27,360][25689] Avg episode reward: [(0, '-48.507')] [2022-07-09 18:32:28,206][26022] Updated weights on worker 0-0, policy_version 369382 (0.00093) [2022-07-09 18:32:30,188][26022] Updated weights on worker 0-0, policy_version 369392 (0.00090) [2022-07-09 18:32:31,814][26022] Updated weights on worker 0-0, policy_version 369402 (0.00093) [2022-07-09 18:32:32,373][25689] Fps is (10 sec: 5707.3, 60 sec: 5654.3, 300 sec: 5635.3). Total num frames: 378270720. Throughput: 0: 5947.6. Samples: 378277130. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:32:32,374][25689] Avg episode reward: [(0, '-48.833')] [2022-07-09 18:32:33,764][26022] Updated weights on worker 0-0, policy_version 369412 (0.00088) [2022-07-09 18:32:35,541][26022] Updated weights on worker 0-0, policy_version 369422 (0.00088) [2022-07-09 18:32:37,228][26022] Updated weights on worker 0-0, policy_version 369432 (0.00107) [2022-07-09 18:32:37,400][25689] Fps is (10 sec: 5813.0, 60 sec: 5651.9, 300 sec: 5636.4). Total num frames: 378299392. Throughput: 0: 5082.8. Samples: 378294296. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:32:37,401][25689] Avg episode reward: [(0, '-49.859')] [2022-07-09 18:32:39,149][26022] Updated weights on worker 0-0, policy_version 369442 (0.00083) [2022-07-09 18:32:40,848][26022] Updated weights on worker 0-0, policy_version 369452 (0.00093) [2022-07-09 18:32:42,452][25689] Fps is (10 sec: 5587.9, 60 sec: 5651.2, 300 sec: 5632.5). Total num frames: 378327040. Throughput: 0: 5951.6. Samples: 378328504. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:32:42,453][25689] Avg episode reward: [(0, '-50.716')] [2022-07-09 18:32:42,653][26022] Updated weights on worker 0-0, policy_version 369462 (0.00095) [2022-07-09 18:32:44,588][26022] Updated weights on worker 0-0, policy_version 369472 (0.00090) [2022-07-09 18:32:46,220][26022] Updated weights on worker 0-0, policy_version 369482 (0.00085) [2022-07-09 18:32:47,456][25689] Fps is (10 sec: 5600.9, 60 sec: 5636.7, 300 sec: 5636.1). Total num frames: 378355712. Throughput: 0: 5954.9. Samples: 378362462. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:32:47,456][25689] Avg episode reward: [(0, '-49.356')] [2022-07-09 18:32:48,184][26022] Updated weights on worker 0-0, policy_version 369492 (0.00082) [2022-07-09 18:32:49,770][26022] Updated weights on worker 0-0, policy_version 369502 (0.00094) [2022-07-09 18:32:51,558][26022] Updated weights on worker 0-0, policy_version 369512 (0.00107) [2022-07-09 18:32:52,469][25689] Fps is (10 sec: 5724.6, 60 sec: 5653.1, 300 sec: 5632.9). Total num frames: 378384384. Throughput: 0: 5104.6. Samples: 378379712. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-09 18:32:52,470][25689] Avg episode reward: [(0, '-49.400')] [2022-07-09 18:32:53,344][26022] Updated weights on worker 0-0, policy_version 369522 (0.00086) [2022-07-09 18:32:55,307][26022] Updated weights on worker 0-0, policy_version 369532 (0.00084) [2022-07-09 18:32:57,006][26022] Updated weights on worker 0-0, policy_version 369542 (0.00084) [2022-07-09 18:32:57,482][25689] Fps is (10 sec: 5719.3, 60 sec: 5652.2, 300 sec: 5637.5). Total num frames: 378413056. Throughput: 0: 5961.0. Samples: 378414004. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:32:57,483][25689] Avg episode reward: [(0, '-48.861')] [2022-07-09 18:32:58,814][26022] Updated weights on worker 0-0, policy_version 369552 (0.00089) [2022-07-09 18:33:00,632][26022] Updated weights on worker 0-0, policy_version 369562 (0.00093) [2022-07-09 18:33:02,535][25689] Fps is (10 sec: 5493.7, 60 sec: 5635.0, 300 sec: 5633.5). Total num frames: 378439680. Throughput: 0: 5871.4. Samples: 378446416. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:33:02,535][25689] Avg episode reward: [(0, '-48.287')] [2022-07-09 18:33:02,927][26022] Updated weights on worker 0-0, policy_version 369572 (0.00099) [2022-07-09 18:33:04,583][26022] Updated weights on worker 0-0, policy_version 369582 (0.00093) [2022-07-09 18:33:06,520][26022] Updated weights on worker 0-0, policy_version 369592 (0.00091) [2022-07-09 18:33:07,536][25689] Fps is (10 sec: 5499.9, 60 sec: 5671.5, 300 sec: 5633.6). Total num frames: 378468352. Throughput: 0: 5010.7. Samples: 378463078. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:33:07,537][25689] Avg episode reward: [(0, '-47.628')] [2022-07-09 18:33:08,195][26022] Updated weights on worker 0-0, policy_version 369602 (0.00099) [2022-07-09 18:33:10,155][26022] Updated weights on worker 0-0, policy_version 369612 (0.00109) [2022-07-09 18:33:11,843][26022] Updated weights on worker 0-0, policy_version 369622 (0.00093) [2022-07-09 18:33:12,540][25689] Fps is (10 sec: 5731.3, 60 sec: 5654.7, 300 sec: 5637.3). Total num frames: 378497024. Throughput: 0: 5862.4. Samples: 378497374. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:33:12,540][25689] Avg episode reward: [(0, '-48.212')] [2022-07-09 18:33:13,579][26022] Updated weights on worker 0-0, policy_version 369632 (0.00087) [2022-07-09 18:33:15,596][26022] Updated weights on worker 0-0, policy_version 369642 (0.00085) [2022-07-09 18:33:17,225][26022] Updated weights on worker 0-0, policy_version 369652 (0.00086) [2022-07-09 18:33:17,544][25689] Fps is (10 sec: 5729.9, 60 sec: 5671.6, 300 sec: 5636.0). Total num frames: 378525696. Throughput: 0: 5875.1. Samples: 378531870. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:33:17,545][25689] Avg episode reward: [(0, '-48.094')] [2022-07-09 18:33:18,936][26022] Updated weights on worker 0-0, policy_version 369662 (0.00091) [2022-07-09 18:33:20,674][26022] Updated weights on worker 0-0, policy_version 369672 (0.00090) [2022-07-09 18:33:21,770][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:33:21,784][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000369677_378549248.pth [2022-07-09 18:33:21,785][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000367695_376519680.pth [2022-07-09 18:33:22,544][26022] Updated weights on worker 0-0, policy_version 369682 (0.00095) [2022-07-09 18:33:22,680][25689] Fps is (10 sec: 5655.2, 60 sec: 5649.8, 300 sec: 5634.5). Total num frames: 378554368. Throughput: 0: 5096.1. Samples: 378549078. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:33:22,680][25689] Avg episode reward: [(0, '-46.571')] [2022-07-09 18:33:24,441][26022] Updated weights on worker 0-0, policy_version 369692 (0.00411) [2022-07-09 18:33:26,178][26022] Updated weights on worker 0-0, policy_version 369702 (0.00081) [2022-07-09 18:33:27,761][25689] Fps is (10 sec: 5712.8, 60 sec: 5679.3, 300 sec: 5636.9). Total num frames: 378584064. Throughput: 0: 5939.3. Samples: 378583200. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:33:27,762][25689] Avg episode reward: [(0, '-46.261')] [2022-07-09 18:33:27,936][26022] Updated weights on worker 0-0, policy_version 369712 (0.00090) [2022-07-09 18:33:29,794][26022] Updated weights on worker 0-0, policy_version 369722 (0.00085) [2022-07-09 18:33:31,691][26022] Updated weights on worker 0-0, policy_version 369732 (0.00114) [2022-07-09 18:33:32,810][25689] Fps is (10 sec: 5660.8, 60 sec: 5642.1, 300 sec: 5632.6). Total num frames: 378611712. Throughput: 0: 5925.2. Samples: 378617478. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:33:32,811][25689] Avg episode reward: [(0, '-46.013')] [2022-07-09 18:33:33,363][26022] Updated weights on worker 0-0, policy_version 369742 (0.00091) [2022-07-09 18:33:35,004][26022] Updated weights on worker 0-0, policy_version 369752 (0.00085) [2022-07-09 18:33:36,954][26022] Updated weights on worker 0-0, policy_version 369762 (0.00089) [2022-07-09 18:33:37,828][25689] Fps is (10 sec: 5696.4, 60 sec: 5659.9, 300 sec: 5641.3). Total num frames: 378641408. Throughput: 0: 5064.1. Samples: 378634590. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:33:37,829][25689] Avg episode reward: [(0, '-45.164')] [2022-07-09 18:33:38,780][26022] Updated weights on worker 0-0, policy_version 369772 (0.00086) [2022-07-09 18:33:40,555][26022] Updated weights on worker 0-0, policy_version 369782 (0.00091) [2022-07-09 18:33:42,483][26022] Updated weights on worker 0-0, policy_version 369792 (0.00094) [2022-07-09 18:33:42,947][25689] Fps is (10 sec: 5657.1, 60 sec: 5653.7, 300 sec: 5636.2). Total num frames: 378669056. Throughput: 0: 5898.6. Samples: 378668624. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:33:42,947][25689] Avg episode reward: [(0, '-45.467')] [2022-07-09 18:33:44,112][26022] Updated weights on worker 0-0, policy_version 369802 (0.00089) [2022-07-09 18:33:45,931][26022] Updated weights on worker 0-0, policy_version 369812 (0.00085) [2022-07-09 18:33:47,739][26022] Updated weights on worker 0-0, policy_version 369822 (0.00084) [2022-07-09 18:33:47,967][25689] Fps is (10 sec: 5656.1, 60 sec: 5669.1, 300 sec: 5646.6). Total num frames: 378698752. Throughput: 0: 5921.7. Samples: 378702850. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:33:47,967][25689] Avg episode reward: [(0, '-46.326')] [2022-07-09 18:33:49,486][26022] Updated weights on worker 0-0, policy_version 369832 (0.00090) [2022-07-09 18:33:51,513][26022] Updated weights on worker 0-0, policy_version 369842 (0.00091) [2022-07-09 18:33:52,995][25689] Fps is (10 sec: 5706.9, 60 sec: 5650.8, 300 sec: 5636.9). Total num frames: 378726400. Throughput: 0: 5078.5. Samples: 378719986. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:33:52,995][25689] Avg episode reward: [(0, '-47.172')] [2022-07-09 18:33:53,195][26022] Updated weights on worker 0-0, policy_version 369852 (0.00737) [2022-07-09 18:33:55,031][26022] Updated weights on worker 0-0, policy_version 369862 (0.00086) [2022-07-09 18:33:57,024][26022] Updated weights on worker 0-0, policy_version 369872 (0.00091) [2022-07-09 18:33:58,018][25689] Fps is (10 sec: 5603.1, 60 sec: 5649.8, 300 sec: 5644.7). Total num frames: 378755072. Throughput: 0: 5886.4. Samples: 378753436. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:33:58,019][25689] Avg episode reward: [(0, '-47.179')] [2022-07-09 18:33:58,565][26022] Updated weights on worker 0-0, policy_version 369882 (0.00080) [2022-07-09 18:34:00,728][26022] Updated weights on worker 0-0, policy_version 369892 (0.00088) [2022-07-09 18:34:02,705][26022] Updated weights on worker 0-0, policy_version 369902 (0.00096) [2022-07-09 18:34:03,138][25689] Fps is (10 sec: 5451.6, 60 sec: 5643.5, 300 sec: 5646.3). Total num frames: 378781696. Throughput: 0: 5793.6. Samples: 378785604. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:34:03,138][25689] Avg episode reward: [(0, '-47.139')] [2022-07-09 18:34:04,644][26022] Updated weights on worker 0-0, policy_version 369912 (0.00097) [2022-07-09 18:34:06,235][26022] Updated weights on worker 0-0, policy_version 369922 (0.00096) [2022-07-09 18:34:07,974][26022] Updated weights on worker 0-0, policy_version 369932 (0.00086) [2022-07-09 18:34:08,241][25689] Fps is (10 sec: 5509.4, 60 sec: 5651.0, 300 sec: 5638.0). Total num frames: 378811392. Throughput: 0: 5772.7. Samples: 378819886. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:34:08,241][25689] Avg episode reward: [(0, '-47.491')] [2022-07-09 18:34:09,988][26022] Updated weights on worker 0-0, policy_version 369942 (0.00089) [2022-07-09 18:34:11,701][26022] Updated weights on worker 0-0, policy_version 369952 (0.00086) [2022-07-09 18:34:13,254][25689] Fps is (10 sec: 5668.8, 60 sec: 5633.3, 300 sec: 5645.2). Total num frames: 378839040. Throughput: 0: 5758.6. Samples: 378836648. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:34:13,254][25689] Avg episode reward: [(0, '-47.580')] [2022-07-09 18:34:13,574][26022] Updated weights on worker 0-0, policy_version 369962 (0.00092) [2022-07-09 18:34:15,366][26022] Updated weights on worker 0-0, policy_version 369972 (0.00084) [2022-07-09 18:34:17,059][26022] Updated weights on worker 0-0, policy_version 369982 (0.00092) [2022-07-09 18:34:18,280][25689] Fps is (10 sec: 5712.2, 60 sec: 5648.1, 300 sec: 5645.6). Total num frames: 378868736. Throughput: 0: 5810.9. Samples: 378871174. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:34:18,280][25689] Avg episode reward: [(0, '-48.057')] [2022-07-09 18:34:18,943][26022] Updated weights on worker 0-0, policy_version 369992 (0.00086) [2022-07-09 18:34:20,703][26022] Updated weights on worker 0-0, policy_version 370002 (0.00088) [2022-07-09 18:34:22,414][26022] Updated weights on worker 0-0, policy_version 370012 (0.00087) [2022-07-09 18:34:23,368][25689] Fps is (10 sec: 5771.0, 60 sec: 5652.5, 300 sec: 5644.2). Total num frames: 378897408. Throughput: 0: 5931.8. Samples: 378905602. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:34:23,368][25689] Avg episode reward: [(0, '-47.551')] [2022-07-09 18:34:24,402][26022] Updated weights on worker 0-0, policy_version 370022 (0.00092) [2022-07-09 18:34:26,071][26022] Updated weights on worker 0-0, policy_version 370032 (0.00093) [2022-07-09 18:34:27,836][26022] Updated weights on worker 0-0, policy_version 370042 (0.00089) [2022-07-09 18:34:28,380][25689] Fps is (10 sec: 5576.0, 60 sec: 5625.2, 300 sec: 5641.2). Total num frames: 378925056. Throughput: 0: 5097.9. Samples: 378922554. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:34:28,382][25689] Avg episode reward: [(0, '-48.051')] [2022-07-09 18:34:29,526][26022] Updated weights on worker 0-0, policy_version 370052 (0.00088) [2022-07-09 18:34:31,446][26022] Updated weights on worker 0-0, policy_version 370062 (0.00080) [2022-07-09 18:34:33,393][25689] Fps is (10 sec: 5515.4, 60 sec: 5628.5, 300 sec: 5637.9). Total num frames: 378952704. Throughput: 0: 5968.1. Samples: 378956844. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:34:33,394][25689] Avg episode reward: [(0, '-47.261')] [2022-07-09 18:34:33,494][26022] Updated weights on worker 0-0, policy_version 370072 (0.00089) [2022-07-09 18:34:34,911][26022] Updated weights on worker 0-0, policy_version 370082 (0.00090) [2022-07-09 18:34:36,978][26022] Updated weights on worker 0-0, policy_version 370092 (0.00091) [2022-07-09 18:34:38,364][26022] Updated weights on worker 0-0, policy_version 370102 (0.00094) [2022-07-09 18:34:38,458][25689] Fps is (10 sec: 5893.5, 60 sec: 5658.0, 300 sec: 5648.3). Total num frames: 378984448. Throughput: 0: 5960.8. Samples: 378991452. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:34:38,459][25689] Avg episode reward: [(0, '-46.626')] [2022-07-09 18:34:40,421][26022] Updated weights on worker 0-0, policy_version 370112 (0.00093) [2022-07-09 18:34:42,137][26022] Updated weights on worker 0-0, policy_version 370122 (0.00087) [2022-07-09 18:34:43,579][25689] Fps is (10 sec: 5831.2, 60 sec: 5657.7, 300 sec: 5643.3). Total num frames: 379012096. Throughput: 0: 5090.4. Samples: 379008484. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:34:43,579][25689] Avg episode reward: [(0, '-47.281')] [2022-07-09 18:34:43,974][26022] Updated weights on worker 0-0, policy_version 370132 (0.00087) [2022-07-09 18:34:45,701][26022] Updated weights on worker 0-0, policy_version 370142 (0.00085) [2022-07-09 18:34:47,491][26022] Updated weights on worker 0-0, policy_version 370152 (0.00080) [2022-07-09 18:34:48,626][25689] Fps is (10 sec: 5539.0, 60 sec: 5638.3, 300 sec: 5643.9). Total num frames: 379040768. Throughput: 0: 5961.2. Samples: 379043242. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:34:48,626][25689] Avg episode reward: [(0, '-46.607')] [2022-07-09 18:34:49,301][26022] Updated weights on worker 0-0, policy_version 370162 (0.00066) [2022-07-09 18:34:50,972][26022] Updated weights on worker 0-0, policy_version 370172 (0.00086) [2022-07-09 18:34:52,845][26022] Updated weights on worker 0-0, policy_version 370182 (0.00084) [2022-07-09 18:34:53,630][25689] Fps is (10 sec: 5908.6, 60 sec: 5691.2, 300 sec: 5648.2). Total num frames: 379071488. Throughput: 0: 5983.4. Samples: 379077930. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:34:53,631][25689] Avg episode reward: [(0, '-46.159')] [2022-07-09 18:34:54,677][26022] Updated weights on worker 0-0, policy_version 370192 (0.00051) [2022-07-09 18:34:56,299][26022] Updated weights on worker 0-0, policy_version 370202 (0.00095) [2022-07-09 18:34:58,164][26022] Updated weights on worker 0-0, policy_version 370212 (0.00081) [2022-07-09 18:34:58,647][25689] Fps is (10 sec: 5926.7, 60 sec: 5691.8, 300 sec: 5653.1). Total num frames: 379100160. Throughput: 0: 5136.3. Samples: 379095148. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:34:58,647][25689] Avg episode reward: [(0, '-46.106')] [2022-07-09 18:35:00,119][26022] Updated weights on worker 0-0, policy_version 370222 (0.00088) [2022-07-09 18:35:01,764][26022] Updated weights on worker 0-0, policy_version 370232 (0.00084) [2022-07-09 18:35:03,675][25689] Fps is (10 sec: 5403.1, 60 sec: 5683.5, 300 sec: 5650.1). Total num frames: 379125760. Throughput: 0: 6001.8. Samples: 379129098. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:35:03,676][25689] Avg episode reward: [(0, '-46.005')] [2022-07-09 18:35:04,016][26022] Updated weights on worker 0-0, policy_version 370242 (0.00085) [2022-07-09 18:35:05,431][26022] Updated weights on worker 0-0, policy_version 370252 (0.00092) [2022-07-09 18:35:07,594][26022] Updated weights on worker 0-0, policy_version 370262 (0.00081) [2022-07-09 18:35:08,698][25689] Fps is (10 sec: 5501.3, 60 sec: 5691.0, 300 sec: 5653.4). Total num frames: 379155456. Throughput: 0: 5932.4. Samples: 379162320. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:35:08,699][25689] Avg episode reward: [(0, '-45.511')] [2022-07-09 18:35:09,058][26022] Updated weights on worker 0-0, policy_version 370272 (0.00084) [2022-07-09 18:35:11,204][26022] Updated weights on worker 0-0, policy_version 370282 (0.00092) [2022-07-09 18:35:12,988][26022] Updated weights on worker 0-0, policy_version 370292 (0.00091) [2022-07-09 18:35:13,736][25689] Fps is (10 sec: 5598.0, 60 sec: 5671.8, 300 sec: 5649.3). Total num frames: 379182080. Throughput: 0: 5035.1. Samples: 379179162. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:35:13,736][25689] Avg episode reward: [(0, '-46.267')] [2022-07-09 18:35:14,671][26022] Updated weights on worker 0-0, policy_version 370302 (0.00090) [2022-07-09 18:35:16,571][26022] Updated weights on worker 0-0, policy_version 370312 (0.00092) [2022-07-09 18:35:18,147][26022] Updated weights on worker 0-0, policy_version 370322 (0.00085) [2022-07-09 18:35:18,739][25689] Fps is (10 sec: 5711.4, 60 sec: 5690.9, 300 sec: 5654.7). Total num frames: 379212800. Throughput: 0: 5904.6. Samples: 379213782. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:35:18,739][25689] Avg episode reward: [(0, '-47.338')] [2022-07-09 18:35:20,191][26022] Updated weights on worker 0-0, policy_version 370332 (0.00091) [2022-07-09 18:35:21,633][26022] Updated weights on worker 0-0, policy_version 370342 (0.00086) [2022-07-09 18:35:22,028][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:35:22,044][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000370343_379231232.pth [2022-07-09 18:35:22,044][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000368354_377194496.pth [2022-07-09 18:35:23,743][26022] Updated weights on worker 0-0, policy_version 370352 (0.00081) [2022-07-09 18:35:23,799][25689] Fps is (10 sec: 5800.3, 60 sec: 5676.6, 300 sec: 5650.3). Total num frames: 379240448. Throughput: 0: 5926.1. Samples: 379248352. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:35:23,799][25689] Avg episode reward: [(0, '-47.245')] [2022-07-09 18:35:25,406][26022] Updated weights on worker 0-0, policy_version 370362 (0.00095) [2022-07-09 18:35:27,234][26022] Updated weights on worker 0-0, policy_version 370372 (0.00089) [2022-07-09 18:35:28,817][25689] Fps is (10 sec: 5689.9, 60 sec: 5709.9, 300 sec: 5657.5). Total num frames: 379270144. Throughput: 0: 5130.9. Samples: 379265546. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 18:35:28,819][25689] Avg episode reward: [(0, '-46.591')] [2022-07-09 18:35:28,858][26022] Updated weights on worker 0-0, policy_version 370382 (0.00085) [2022-07-09 18:35:30,875][26022] Updated weights on worker 0-0, policy_version 370392 (0.00084) [2022-07-09 18:35:32,568][26022] Updated weights on worker 0-0, policy_version 370402 (0.00091) [2022-07-09 18:35:33,837][25689] Fps is (10 sec: 5712.4, 60 sec: 5709.2, 300 sec: 5650.7). Total num frames: 379297792. Throughput: 0: 6004.3. Samples: 379299856. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:35:33,838][25689] Avg episode reward: [(0, '-47.215')] [2022-07-09 18:35:34,347][26022] Updated weights on worker 0-0, policy_version 370412 (0.00081) [2022-07-09 18:35:36,159][26022] Updated weights on worker 0-0, policy_version 370422 (0.00087) [2022-07-09 18:35:37,872][26022] Updated weights on worker 0-0, policy_version 370432 (0.00085) [2022-07-09 18:35:38,855][25689] Fps is (10 sec: 5610.7, 60 sec: 5662.8, 300 sec: 5658.5). Total num frames: 379326464. Throughput: 0: 5978.1. Samples: 379334038. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:35:38,856][25689] Avg episode reward: [(0, '-46.521')] [2022-07-09 18:35:39,818][26022] Updated weights on worker 0-0, policy_version 370442 (0.00090) [2022-07-09 18:35:41,576][26022] Updated weights on worker 0-0, policy_version 370452 (0.00088) [2022-07-09 18:35:43,414][26022] Updated weights on worker 0-0, policy_version 370462 (0.00087) [2022-07-09 18:35:43,904][25689] Fps is (10 sec: 5798.0, 60 sec: 5703.5, 300 sec: 5654.6). Total num frames: 379356160. Throughput: 0: 5107.5. Samples: 379351040. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:35:43,905][25689] Avg episode reward: [(0, '-46.378')] [2022-07-09 18:35:45,313][26022] Updated weights on worker 0-0, policy_version 370472 (0.00085) [2022-07-09 18:35:46,989][26022] Updated weights on worker 0-0, policy_version 370482 (0.00086) [2022-07-09 18:35:48,838][26022] Updated weights on worker 0-0, policy_version 370492 (0.00082) [2022-07-09 18:35:48,911][25689] Fps is (10 sec: 5702.2, 60 sec: 5690.2, 300 sec: 5658.0). Total num frames: 379383808. Throughput: 0: 5946.2. Samples: 379385032. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:35:48,913][25689] Avg episode reward: [(0, '-46.701')] [2022-07-09 18:35:50,528][26022] Updated weights on worker 0-0, policy_version 370502 (0.00090) [2022-07-09 18:35:52,251][26022] Updated weights on worker 0-0, policy_version 370512 (0.00082) [2022-07-09 18:35:53,931][25689] Fps is (10 sec: 5514.5, 60 sec: 5637.9, 300 sec: 5650.9). Total num frames: 379411456. Throughput: 0: 5956.5. Samples: 379419548. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:35:53,933][25689] Avg episode reward: [(0, '-46.966')] [2022-07-09 18:35:54,168][26022] Updated weights on worker 0-0, policy_version 370522 (0.00092) [2022-07-09 18:35:55,927][26022] Updated weights on worker 0-0, policy_version 370532 (0.00071) [2022-07-09 18:35:57,755][26022] Updated weights on worker 0-0, policy_version 370542 (0.00090) [2022-07-09 18:35:58,945][25689] Fps is (10 sec: 5817.3, 60 sec: 5672.1, 300 sec: 5658.8). Total num frames: 379442176. Throughput: 0: 5107.3. Samples: 379436644. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:35:58,945][25689] Avg episode reward: [(0, '-46.773')] [2022-07-09 18:35:59,308][26022] Updated weights on worker 0-0, policy_version 370552 (0.00086) [2022-07-09 18:36:01,267][26022] Updated weights on worker 0-0, policy_version 370562 (0.00087) [2022-07-09 18:36:03,568][26022] Updated weights on worker 0-0, policy_version 370572 (0.00080) [2022-07-09 18:36:03,973][25689] Fps is (10 sec: 5608.4, 60 sec: 5672.1, 300 sec: 5659.5). Total num frames: 379467776. Throughput: 0: 5898.3. Samples: 379469416. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:36:03,974][25689] Avg episode reward: [(0, '-47.187')] [2022-07-09 18:36:05,223][26022] Updated weights on worker 0-0, policy_version 370582 (0.00093) [2022-07-09 18:36:07,102][26022] Updated weights on worker 0-0, policy_version 370592 (0.00087) [2022-07-09 18:36:08,818][26022] Updated weights on worker 0-0, policy_version 370602 (0.00093) [2022-07-09 18:36:09,009][25689] Fps is (10 sec: 5494.3, 60 sec: 5670.9, 300 sec: 5662.8). Total num frames: 379497472. Throughput: 0: 5902.6. Samples: 379503662. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:36:09,010][25689] Avg episode reward: [(0, '-47.120')] [2022-07-09 18:36:10,579][26022] Updated weights on worker 0-0, policy_version 370612 (0.00093) [2022-07-09 18:36:12,303][26022] Updated weights on worker 0-0, policy_version 370622 (0.00097) [2022-07-09 18:36:14,016][25689] Fps is (10 sec: 5812.2, 60 sec: 5707.8, 300 sec: 5664.1). Total num frames: 379526144. Throughput: 0: 5054.2. Samples: 379521058. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:36:14,016][25689] Avg episode reward: [(0, '-46.848')] [2022-07-09 18:36:14,038][26022] Updated weights on worker 0-0, policy_version 370632 (0.00088) [2022-07-09 18:36:16,156][26022] Updated weights on worker 0-0, policy_version 370642 (0.00086) [2022-07-09 18:36:17,662][26022] Updated weights on worker 0-0, policy_version 370652 (0.00089) [2022-07-09 18:36:19,037][25689] Fps is (10 sec: 5616.2, 60 sec: 5655.1, 300 sec: 5661.5). Total num frames: 379553792. Throughput: 0: 5899.4. Samples: 379555176. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:36:19,038][25689] Avg episode reward: [(0, '-46.568')] [2022-07-09 18:36:19,543][26022] Updated weights on worker 0-0, policy_version 370662 (0.00084) [2022-07-09 18:36:21,419][26022] Updated weights on worker 0-0, policy_version 370672 (0.00086) [2022-07-09 18:36:22,994][26022] Updated weights on worker 0-0, policy_version 370682 (0.00088) [2022-07-09 18:36:24,120][25689] Fps is (10 sec: 5574.1, 60 sec: 5669.9, 300 sec: 5660.2). Total num frames: 379582464. Throughput: 0: 5952.6. Samples: 379589338. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:36:24,120][25689] Avg episode reward: [(0, '-46.471')] [2022-07-09 18:36:25,032][26022] Updated weights on worker 0-0, policy_version 370692 (0.00093) [2022-07-09 18:36:26,841][26022] Updated weights on worker 0-0, policy_version 370702 (0.00054) [2022-07-09 18:36:28,646][26022] Updated weights on worker 0-0, policy_version 370712 (0.00089) [2022-07-09 18:36:29,130][25689] Fps is (10 sec: 5681.8, 60 sec: 5653.7, 300 sec: 5660.1). Total num frames: 379611136. Throughput: 0: 5095.1. Samples: 379606178. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:36:29,130][25689] Avg episode reward: [(0, '-46.340')] [2022-07-09 18:36:30,306][26022] Updated weights on worker 0-0, policy_version 370722 (0.00089) [2022-07-09 18:36:32,215][26022] Updated weights on worker 0-0, policy_version 370732 (0.00087) [2022-07-09 18:36:33,842][26022] Updated weights on worker 0-0, policy_version 370742 (0.00088) [2022-07-09 18:36:34,169][25689] Fps is (10 sec: 5807.9, 60 sec: 5685.8, 300 sec: 5662.8). Total num frames: 379640832. Throughput: 0: 5927.4. Samples: 379640516. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:36:34,170][25689] Avg episode reward: [(0, '-45.927')] [2022-07-09 18:36:35,922][26022] Updated weights on worker 0-0, policy_version 370752 (0.00093) [2022-07-09 18:36:37,607][26022] Updated weights on worker 0-0, policy_version 370762 (0.00089) [2022-07-09 18:36:39,175][25689] Fps is (10 sec: 5606.8, 60 sec: 5653.1, 300 sec: 5660.1). Total num frames: 379667456. Throughput: 0: 5937.5. Samples: 379674742. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:36:39,176][25689] Avg episode reward: [(0, '-45.759')] [2022-07-09 18:36:39,435][26022] Updated weights on worker 0-0, policy_version 370772 (0.00094) [2022-07-09 18:36:41,346][26022] Updated weights on worker 0-0, policy_version 370782 (0.00091) [2022-07-09 18:36:42,928][26022] Updated weights on worker 0-0, policy_version 370792 (0.00086) [2022-07-09 18:36:44,235][25689] Fps is (10 sec: 5697.0, 60 sec: 5669.0, 300 sec: 5663.0). Total num frames: 379698176. Throughput: 0: 5096.5. Samples: 379691856. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:36:44,236][25689] Avg episode reward: [(0, '-46.112')] [2022-07-09 18:36:44,820][26022] Updated weights on worker 0-0, policy_version 370802 (0.00093) [2022-07-09 18:36:46,663][26022] Updated weights on worker 0-0, policy_version 370812 (0.00078) [2022-07-09 18:36:48,533][26022] Updated weights on worker 0-0, policy_version 370822 (0.00263) [2022-07-09 18:36:49,305][25689] Fps is (10 sec: 5761.6, 60 sec: 5663.1, 300 sec: 5661.8). Total num frames: 379725824. Throughput: 0: 5926.3. Samples: 379725744. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:36:49,307][25689] Avg episode reward: [(0, '-45.557')] [2022-07-09 18:36:50,224][26022] Updated weights on worker 0-0, policy_version 370832 (0.00090) [2022-07-09 18:36:52,120][26022] Updated weights on worker 0-0, policy_version 370842 (0.00095) [2022-07-09 18:36:53,725][26022] Updated weights on worker 0-0, policy_version 370852 (0.00085) [2022-07-09 18:36:54,338][25689] Fps is (10 sec: 5676.0, 60 sec: 5695.8, 300 sec: 5664.7). Total num frames: 379755520. Throughput: 0: 5930.3. Samples: 379760122. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:36:54,338][25689] Avg episode reward: [(0, '-45.602')] [2022-07-09 18:36:55,756][26022] Updated weights on worker 0-0, policy_version 370862 (0.00079) [2022-07-09 18:36:57,350][26022] Updated weights on worker 0-0, policy_version 370872 (0.00101) [2022-07-09 18:36:59,189][26022] Updated weights on worker 0-0, policy_version 370882 (0.00090) [2022-07-09 18:36:59,355][25689] Fps is (10 sec: 5807.9, 60 sec: 5661.6, 300 sec: 5668.7). Total num frames: 379784192. Throughput: 0: 5082.1. Samples: 379777300. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:36:59,355][25689] Avg episode reward: [(0, '-46.128')] [2022-07-09 18:37:01,098][26022] Updated weights on worker 0-0, policy_version 370892 (0.00085) [2022-07-09 18:37:03,271][26022] Updated weights on worker 0-0, policy_version 370902 (0.00099) [2022-07-09 18:37:04,471][25689] Fps is (10 sec: 5355.7, 60 sec: 5653.3, 300 sec: 5663.6). Total num frames: 379809792. Throughput: 0: 5786.8. Samples: 379808960. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:37:04,473][25689] Avg episode reward: [(0, '-46.748')] [2022-07-09 18:37:05,112][26022] Updated weights on worker 0-0, policy_version 370912 (0.00089) [2022-07-09 18:37:06,891][26022] Updated weights on worker 0-0, policy_version 370922 (0.00079) [2022-07-09 18:37:08,651][26022] Updated weights on worker 0-0, policy_version 370932 (0.00053) [2022-07-09 18:37:09,475][25689] Fps is (10 sec: 5363.0, 60 sec: 5639.4, 300 sec: 5660.2). Total num frames: 379838464. Throughput: 0: 5830.6. Samples: 379843344. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:37:09,475][25689] Avg episode reward: [(0, '-46.823')] [2022-07-09 18:37:10,560][26022] Updated weights on worker 0-0, policy_version 370942 (0.00092) [2022-07-09 18:37:12,249][26022] Updated weights on worker 0-0, policy_version 370952 (0.00085) [2022-07-09 18:37:14,054][26022] Updated weights on worker 0-0, policy_version 370962 (0.00437) [2022-07-09 18:37:14,554][25689] Fps is (10 sec: 5789.1, 60 sec: 5649.6, 300 sec: 5665.7). Total num frames: 379868160. Throughput: 0: 4970.9. Samples: 379860614. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:37:14,555][25689] Avg episode reward: [(0, '-48.036')] [2022-07-09 18:37:15,735][26022] Updated weights on worker 0-0, policy_version 370972 (0.00084) [2022-07-09 18:37:17,563][26022] Updated weights on worker 0-0, policy_version 370982 (0.00091) [2022-07-09 18:37:19,414][26022] Updated weights on worker 0-0, policy_version 370992 (0.00092) [2022-07-09 18:37:19,582][25689] Fps is (10 sec: 5775.0, 60 sec: 5665.9, 300 sec: 5663.3). Total num frames: 379896832. Throughput: 0: 5823.9. Samples: 379895098. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:37:19,582][25689] Avg episode reward: [(0, '-48.282')] [2022-07-09 18:37:21,110][26022] Updated weights on worker 0-0, policy_version 371002 (0.00502) [2022-07-09 18:37:22,186][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:37:22,198][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000371008_379912192.pth [2022-07-09 18:37:22,199][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000369015_377871360.pth [2022-07-09 18:37:23,000][26022] Updated weights on worker 0-0, policy_version 371012 (0.00111) [2022-07-09 18:37:24,639][25689] Fps is (10 sec: 5787.3, 60 sec: 5685.1, 300 sec: 5669.8). Total num frames: 379926528. Throughput: 0: 5976.5. Samples: 379929494. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:37:24,640][25689] Avg episode reward: [(0, '-47.631')] [2022-07-09 18:37:24,647][26022] Updated weights on worker 0-0, policy_version 371022 (0.00088) [2022-07-09 18:37:26,401][26022] Updated weights on worker 0-0, policy_version 371032 (0.00449) [2022-07-09 18:37:28,281][26022] Updated weights on worker 0-0, policy_version 371042 (0.00084) [2022-07-09 18:37:29,674][25689] Fps is (10 sec: 5783.6, 60 sec: 5682.9, 300 sec: 5665.9). Total num frames: 379955200. Throughput: 0: 5965.4. Samples: 379963840. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:37:29,674][25689] Avg episode reward: [(0, '-47.151')] [2022-07-09 18:37:29,987][26022] Updated weights on worker 0-0, policy_version 371052 (0.00084) [2022-07-09 18:37:32,031][26022] Updated weights on worker 0-0, policy_version 371062 (0.00081) [2022-07-09 18:37:33,638][26022] Updated weights on worker 0-0, policy_version 371072 (0.00088) [2022-07-09 18:37:34,684][25689] Fps is (10 sec: 5708.9, 60 sec: 5668.7, 300 sec: 5666.2). Total num frames: 379983872. Throughput: 0: 5978.1. Samples: 379980954. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:37:34,685][25689] Avg episode reward: [(0, '-46.972')] [2022-07-09 18:37:35,380][26022] Updated weights on worker 0-0, policy_version 371082 (0.00083) [2022-07-09 18:37:37,170][26022] Updated weights on worker 0-0, policy_version 371092 (0.00092) [2022-07-09 18:37:39,088][26022] Updated weights on worker 0-0, policy_version 371102 (0.00091) [2022-07-09 18:37:39,711][25689] Fps is (10 sec: 5712.8, 60 sec: 5700.5, 300 sec: 5670.1). Total num frames: 380012544. Throughput: 0: 5989.4. Samples: 380015664. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:37:39,712][25689] Avg episode reward: [(0, '-46.805')] [2022-07-09 18:37:40,691][26022] Updated weights on worker 0-0, policy_version 371112 (0.00085) [2022-07-09 18:37:42,532][26022] Updated weights on worker 0-0, policy_version 371122 (0.00096) [2022-07-09 18:37:44,495][26022] Updated weights on worker 0-0, policy_version 371132 (0.00097) [2022-07-09 18:37:44,749][25689] Fps is (10 sec: 5595.5, 60 sec: 5651.8, 300 sec: 5666.1). Total num frames: 380040192. Throughput: 0: 5970.3. Samples: 380049556. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:37:44,750][25689] Avg episode reward: [(0, '-46.800')] [2022-07-09 18:37:46,146][26022] Updated weights on worker 0-0, policy_version 371142 (0.00086) [2022-07-09 18:37:48,174][26022] Updated weights on worker 0-0, policy_version 371152 (0.00085) [2022-07-09 18:37:49,629][26022] Updated weights on worker 0-0, policy_version 371162 (0.00092) [2022-07-09 18:37:49,783][25689] Fps is (10 sec: 5795.6, 60 sec: 5706.1, 300 sec: 5672.6). Total num frames: 380070912. Throughput: 0: 5137.4. Samples: 380067148. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:37:49,783][25689] Avg episode reward: [(0, '-46.712')] [2022-07-09 18:37:51,701][26022] Updated weights on worker 0-0, policy_version 371172 (0.00085) [2022-07-09 18:37:53,200][26022] Updated weights on worker 0-0, policy_version 371182 (0.00087) [2022-07-09 18:37:54,796][25689] Fps is (10 sec: 5707.5, 60 sec: 5657.0, 300 sec: 5665.7). Total num frames: 380097536. Throughput: 0: 6012.4. Samples: 380101878. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:37:54,798][25689] Avg episode reward: [(0, '-46.913')] [2022-07-09 18:37:55,186][26022] Updated weights on worker 0-0, policy_version 371192 (0.00083) [2022-07-09 18:37:56,762][26022] Updated weights on worker 0-0, policy_version 371202 (0.00082) [2022-07-09 18:37:58,631][26022] Updated weights on worker 0-0, policy_version 371212 (0.00083) [2022-07-09 18:37:59,814][25689] Fps is (10 sec: 5716.4, 60 sec: 5690.8, 300 sec: 5680.1). Total num frames: 380128256. Throughput: 0: 6016.4. Samples: 380136610. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:37:59,815][25689] Avg episode reward: [(0, '-47.342')] [2022-07-09 18:38:00,291][26022] Updated weights on worker 0-0, policy_version 371222 (0.00080) [2022-07-09 18:38:02,479][26022] Updated weights on worker 0-0, policy_version 371232 (0.00088) [2022-07-09 18:38:04,262][26022] Updated weights on worker 0-0, policy_version 371242 (0.00082) [2022-07-09 18:38:04,878][25689] Fps is (10 sec: 5586.3, 60 sec: 5695.8, 300 sec: 5668.6). Total num frames: 380153856. Throughput: 0: 5078.7. Samples: 380151784. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 18:38:04,880][25689] Avg episode reward: [(0, '-46.535')] [2022-07-09 18:38:06,023][26022] Updated weights on worker 0-0, policy_version 371252 (0.00085) [2022-07-09 18:38:07,971][26022] Updated weights on worker 0-0, policy_version 371262 (0.00083) [2022-07-09 18:38:09,478][26022] Updated weights on worker 0-0, policy_version 371272 (0.00085) [2022-07-09 18:38:09,939][25689] Fps is (10 sec: 5461.3, 60 sec: 5707.3, 300 sec: 5670.9). Total num frames: 380183552. Throughput: 0: 5890.3. Samples: 380185876. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:38:09,941][25689] Avg episode reward: [(0, '-46.371')] [2022-07-09 18:38:11,638][26022] Updated weights on worker 0-0, policy_version 371282 (0.00080) [2022-07-09 18:38:13,183][26022] Updated weights on worker 0-0, policy_version 371292 (0.00087) [2022-07-09 18:38:14,976][25689] Fps is (10 sec: 5780.4, 60 sec: 5694.4, 300 sec: 5670.3). Total num frames: 380212224. Throughput: 0: 5861.7. Samples: 380220164. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:38:14,978][25689] Avg episode reward: [(0, '-46.872')] [2022-07-09 18:38:15,038][26022] Updated weights on worker 0-0, policy_version 371302 (0.00085) [2022-07-09 18:38:17,015][26022] Updated weights on worker 0-0, policy_version 371312 (0.00081) [2022-07-09 18:38:18,621][26022] Updated weights on worker 0-0, policy_version 371322 (0.00088) [2022-07-09 18:38:20,029][25689] Fps is (10 sec: 5581.8, 60 sec: 5675.0, 300 sec: 5668.4). Total num frames: 380239872. Throughput: 0: 4975.8. Samples: 380237196. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:38:20,031][25689] Avg episode reward: [(0, '-46.445')] [2022-07-09 18:38:20,724][26022] Updated weights on worker 0-0, policy_version 371332 (0.00087) [2022-07-09 18:38:22,367][26022] Updated weights on worker 0-0, policy_version 371342 (0.00090) [2022-07-09 18:38:24,359][26022] Updated weights on worker 0-0, policy_version 371352 (0.00100) [2022-07-09 18:38:25,117][25689] Fps is (10 sec: 5654.3, 60 sec: 5672.1, 300 sec: 5668.3). Total num frames: 380269568. Throughput: 0: 5901.8. Samples: 380271232. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:38:25,118][25689] Avg episode reward: [(0, '-47.362')] [2022-07-09 18:38:25,934][26022] Updated weights on worker 0-0, policy_version 371362 (0.00090) [2022-07-09 18:38:27,787][26022] Updated weights on worker 0-0, policy_version 371372 (0.00088) [2022-07-09 18:38:29,368][26022] Updated weights on worker 0-0, policy_version 371382 (0.00093) [2022-07-09 18:38:30,130][25689] Fps is (10 sec: 5677.2, 60 sec: 5657.3, 300 sec: 5669.0). Total num frames: 380297216. Throughput: 0: 5924.9. Samples: 380305504. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:38:30,130][25689] Avg episode reward: [(0, '-46.889')] [2022-07-09 18:38:31,498][26022] Updated weights on worker 0-0, policy_version 371392 (0.00096) [2022-07-09 18:38:33,224][26022] Updated weights on worker 0-0, policy_version 371402 (0.00093) [2022-07-09 18:38:35,033][26022] Updated weights on worker 0-0, policy_version 371412 (0.00087) [2022-07-09 18:38:35,151][25689] Fps is (10 sec: 5715.1, 60 sec: 5673.2, 300 sec: 5668.9). Total num frames: 380326912. Throughput: 0: 5075.0. Samples: 380322554. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:38:35,152][25689] Avg episode reward: [(0, '-47.297')] [2022-07-09 18:38:36,655][26022] Updated weights on worker 0-0, policy_version 371422 (0.00089) [2022-07-09 18:38:38,404][26022] Updated weights on worker 0-0, policy_version 371432 (0.00095) [2022-07-09 18:38:40,171][25689] Fps is (10 sec: 5813.0, 60 sec: 5673.9, 300 sec: 5674.3). Total num frames: 380355584. Throughput: 0: 5946.7. Samples: 380356972. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:38:40,171][25689] Avg episode reward: [(0, '-48.357')] [2022-07-09 18:38:40,502][26022] Updated weights on worker 0-0, policy_version 371442 (0.00090) [2022-07-09 18:38:42,036][26022] Updated weights on worker 0-0, policy_version 371452 (0.00088) [2022-07-09 18:38:43,831][26022] Updated weights on worker 0-0, policy_version 371462 (0.00089) [2022-07-09 18:38:45,220][25689] Fps is (10 sec: 5695.4, 60 sec: 5689.8, 300 sec: 5670.3). Total num frames: 380384256. Throughput: 0: 5968.5. Samples: 380391212. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:38:45,220][25689] Avg episode reward: [(0, '-47.790')] [2022-07-09 18:38:45,769][26022] Updated weights on worker 0-0, policy_version 371472 (0.00092) [2022-07-09 18:38:47,438][26022] Updated weights on worker 0-0, policy_version 371482 (0.00087) [2022-07-09 18:38:49,254][26022] Updated weights on worker 0-0, policy_version 371492 (0.00096) [2022-07-09 18:38:50,255][25689] Fps is (10 sec: 5686.2, 60 sec: 5655.7, 300 sec: 5673.6). Total num frames: 380412928. Throughput: 0: 5108.6. Samples: 380408318. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:38:50,256][25689] Avg episode reward: [(0, '-47.689')] [2022-07-09 18:38:51,080][26022] Updated weights on worker 0-0, policy_version 371502 (0.00097) [2022-07-09 18:38:52,847][26022] Updated weights on worker 0-0, policy_version 371512 (0.00095) [2022-07-09 18:38:54,574][26022] Updated weights on worker 0-0, policy_version 371522 (0.00089) [2022-07-09 18:38:55,295][25689] Fps is (10 sec: 5691.7, 60 sec: 5687.2, 300 sec: 5673.3). Total num frames: 380441600. Throughput: 0: 5963.7. Samples: 380442684. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:38:55,295][25689] Avg episode reward: [(0, '-47.747')] [2022-07-09 18:38:56,544][26022] Updated weights on worker 0-0, policy_version 371532 (0.00084) [2022-07-09 18:38:58,014][26022] Updated weights on worker 0-0, policy_version 371542 (0.00090) [2022-07-09 18:39:00,020][26022] Updated weights on worker 0-0, policy_version 371552 (0.00090) [2022-07-09 18:39:00,315][25689] Fps is (10 sec: 5700.6, 60 sec: 5653.1, 300 sec: 5682.1). Total num frames: 380470272. Throughput: 0: 5953.4. Samples: 380476898. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:39:00,315][25689] Avg episode reward: [(0, '-48.937')] [2022-07-09 18:39:02,368][26022] Updated weights on worker 0-0, policy_version 371562 (0.00091) [2022-07-09 18:39:04,030][26022] Updated weights on worker 0-0, policy_version 371572 (0.00094) [2022-07-09 18:39:05,426][25689] Fps is (10 sec: 5356.8, 60 sec: 5648.7, 300 sec: 5668.1). Total num frames: 380495872. Throughput: 0: 4970.5. Samples: 380491648. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:39:05,427][25689] Avg episode reward: [(0, '-48.656')] [2022-07-09 18:39:05,820][26022] Updated weights on worker 0-0, policy_version 371582 (0.00091) [2022-07-09 18:39:07,649][26022] Updated weights on worker 0-0, policy_version 371592 (0.00088) [2022-07-09 18:39:09,303][26022] Updated weights on worker 0-0, policy_version 371602 (0.00092) [2022-07-09 18:39:10,440][25689] Fps is (10 sec: 5562.3, 60 sec: 5670.0, 300 sec: 5678.4). Total num frames: 380526592. Throughput: 0: 5839.0. Samples: 380526176. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:39:10,440][25689] Avg episode reward: [(0, '-47.743')] [2022-07-09 18:39:11,295][26022] Updated weights on worker 0-0, policy_version 371612 (0.00085) [2022-07-09 18:39:12,971][26022] Updated weights on worker 0-0, policy_version 371622 (0.00091) [2022-07-09 18:39:14,787][26022] Updated weights on worker 0-0, policy_version 371632 (0.00080) [2022-07-09 18:39:15,478][25689] Fps is (10 sec: 5806.6, 60 sec: 5652.9, 300 sec: 5671.3). Total num frames: 380554240. Throughput: 0: 5838.5. Samples: 380560524. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:39:15,479][25689] Avg episode reward: [(0, '-47.644')] [2022-07-09 18:39:16,570][26022] Updated weights on worker 0-0, policy_version 371642 (0.00081) [2022-07-09 18:39:18,547][26022] Updated weights on worker 0-0, policy_version 371652 (0.00087) [2022-07-09 18:39:20,176][26022] Updated weights on worker 0-0, policy_version 371662 (0.00092) [2022-07-09 18:39:20,485][25689] Fps is (10 sec: 5708.9, 60 sec: 5691.2, 300 sec: 5676.3). Total num frames: 380583936. Throughput: 0: 4989.7. Samples: 380577540. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:39:20,485][25689] Avg episode reward: [(0, '-47.858')] [2022-07-09 18:39:21,922][26022] Updated weights on worker 0-0, policy_version 371672 (0.00093) [2022-07-09 18:39:22,379][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:39:22,392][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000371674_380594176.pth [2022-07-09 18:39:22,393][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000369677_378549248.pth [2022-07-09 18:39:23,855][26022] Updated weights on worker 0-0, policy_version 371682 (0.00100) [2022-07-09 18:39:25,545][25689] Fps is (10 sec: 5696.2, 60 sec: 5659.9, 300 sec: 5675.4). Total num frames: 380611584. Throughput: 0: 5966.9. Samples: 380611696. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:39:25,547][25689] Avg episode reward: [(0, '-46.813')] [2022-07-09 18:39:25,581][26022] Updated weights on worker 0-0, policy_version 371692 (0.00081) [2022-07-09 18:39:27,535][26022] Updated weights on worker 0-0, policy_version 371702 (0.00093) [2022-07-09 18:39:29,166][26022] Updated weights on worker 0-0, policy_version 371712 (0.00086) [2022-07-09 18:39:30,549][25689] Fps is (10 sec: 5494.4, 60 sec: 5660.7, 300 sec: 5675.6). Total num frames: 380639232. Throughput: 0: 5953.9. Samples: 380645902. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:39:30,549][25689] Avg episode reward: [(0, '-47.100')] [2022-07-09 18:39:30,975][26022] Updated weights on worker 0-0, policy_version 371722 (0.00090) [2022-07-09 18:39:32,613][26022] Updated weights on worker 0-0, policy_version 371732 (0.00092) [2022-07-09 18:39:34,679][26022] Updated weights on worker 0-0, policy_version 371742 (0.00083) [2022-07-09 18:39:35,635][25689] Fps is (10 sec: 5784.9, 60 sec: 5671.6, 300 sec: 5671.7). Total num frames: 380669952. Throughput: 0: 5091.8. Samples: 380663158. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:39:35,635][25689] Avg episode reward: [(0, '-47.871')] [2022-07-09 18:39:36,330][26022] Updated weights on worker 0-0, policy_version 371752 (0.00093) [2022-07-09 18:39:38,099][26022] Updated weights on worker 0-0, policy_version 371762 (0.00079) [2022-07-09 18:39:39,865][26022] Updated weights on worker 0-0, policy_version 371772 (0.00089) [2022-07-09 18:39:40,642][25689] Fps is (10 sec: 5782.9, 60 sec: 5655.9, 300 sec: 5673.9). Total num frames: 380697600. Throughput: 0: 5945.5. Samples: 380697384. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:39:40,642][25689] Avg episode reward: [(0, '-48.675')] [2022-07-09 18:39:41,778][26022] Updated weights on worker 0-0, policy_version 371782 (0.00090) [2022-07-09 18:39:43,406][26022] Updated weights on worker 0-0, policy_version 371792 (0.00083) [2022-07-09 18:39:45,275][26022] Updated weights on worker 0-0, policy_version 371802 (0.00085) [2022-07-09 18:39:45,757][25689] Fps is (10 sec: 5664.9, 60 sec: 5666.5, 300 sec: 5676.0). Total num frames: 380727296. Throughput: 0: 5950.3. Samples: 380731966. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:39:45,758][25689] Avg episode reward: [(0, '-48.025')] [2022-07-09 18:39:46,967][26022] Updated weights on worker 0-0, policy_version 371812 (0.00092) [2022-07-09 18:39:48,899][26022] Updated weights on worker 0-0, policy_version 371822 (0.00082) [2022-07-09 18:39:50,557][26022] Updated weights on worker 0-0, policy_version 371832 (0.00094) [2022-07-09 18:39:50,839][25689] Fps is (10 sec: 5824.6, 60 sec: 5679.2, 300 sec: 5671.1). Total num frames: 380756992. Throughput: 0: 5945.7. Samples: 380766540. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:39:50,839][25689] Avg episode reward: [(0, '-48.732')] [2022-07-09 18:39:52,330][26022] Updated weights on worker 0-0, policy_version 371842 (0.00089) [2022-07-09 18:39:54,237][26022] Updated weights on worker 0-0, policy_version 371852 (0.00086) [2022-07-09 18:39:55,856][25689] Fps is (10 sec: 5779.9, 60 sec: 5681.3, 300 sec: 5671.1). Total num frames: 380785664. Throughput: 0: 5969.5. Samples: 380783868. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:39:55,856][25689] Avg episode reward: [(0, '-48.881')] [2022-07-09 18:39:56,020][26022] Updated weights on worker 0-0, policy_version 371862 (0.00092) [2022-07-09 18:39:57,723][26022] Updated weights on worker 0-0, policy_version 371872 (0.00081) [2022-07-09 18:39:59,534][26022] Updated weights on worker 0-0, policy_version 371882 (0.00090) [2022-07-09 18:40:00,951][25689] Fps is (10 sec: 5772.0, 60 sec: 5691.1, 300 sec: 5683.6). Total num frames: 380815360. Throughput: 0: 5963.0. Samples: 380818486. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:40:00,951][25689] Avg episode reward: [(0, '-47.929')] [2022-07-09 18:40:01,176][26022] Updated weights on worker 0-0, policy_version 371892 (0.00084) [2022-07-09 18:40:03,500][26022] Updated weights on worker 0-0, policy_version 371902 (0.00090) [2022-07-09 18:40:05,194][26022] Updated weights on worker 0-0, policy_version 371912 (0.00093) [2022-07-09 18:40:06,038][25689] Fps is (10 sec: 5531.2, 60 sec: 5710.3, 300 sec: 5672.1). Total num frames: 380841984. Throughput: 0: 5865.6. Samples: 380850924. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:40:06,039][25689] Avg episode reward: [(0, '-47.434')] [2022-07-09 18:40:07,007][26022] Updated weights on worker 0-0, policy_version 371922 (0.00089) [2022-07-09 18:40:08,735][26022] Updated weights on worker 0-0, policy_version 371932 (0.00087) [2022-07-09 18:40:10,603][26022] Updated weights on worker 0-0, policy_version 371942 (0.00084) [2022-07-09 18:40:11,041][25689] Fps is (10 sec: 5480.1, 60 sec: 5677.5, 300 sec: 5679.6). Total num frames: 380870656. Throughput: 0: 5032.2. Samples: 380868204. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:40:11,042][25689] Avg episode reward: [(0, '-47.931')] [2022-07-09 18:40:12,392][26022] Updated weights on worker 0-0, policy_version 371952 (0.00086) [2022-07-09 18:40:14,243][26022] Updated weights on worker 0-0, policy_version 371962 (0.00109) [2022-07-09 18:40:15,936][26022] Updated weights on worker 0-0, policy_version 371972 (0.00095) [2022-07-09 18:40:16,147][25689] Fps is (10 sec: 5672.6, 60 sec: 5688.0, 300 sec: 5670.7). Total num frames: 380899328. Throughput: 0: 5833.7. Samples: 380902240. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:40:16,148][25689] Avg episode reward: [(0, '-47.169')] [2022-07-09 18:40:17,679][26022] Updated weights on worker 0-0, policy_version 371982 (0.00092) [2022-07-09 18:40:19,774][26022] Updated weights on worker 0-0, policy_version 371992 (0.00082) [2022-07-09 18:40:21,187][25689] Fps is (10 sec: 5651.9, 60 sec: 5668.0, 300 sec: 5674.6). Total num frames: 380928000. Throughput: 0: 5819.7. Samples: 380936256. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:40:21,188][25689] Avg episode reward: [(0, '-47.075')] [2022-07-09 18:40:21,469][26022] Updated weights on worker 0-0, policy_version 372002 (0.00090) [2022-07-09 18:40:23,338][26022] Updated weights on worker 0-0, policy_version 372012 (0.00086) [2022-07-09 18:40:24,980][26022] Updated weights on worker 0-0, policy_version 372022 (0.00083) [2022-07-09 18:40:26,287][25689] Fps is (10 sec: 5756.1, 60 sec: 5698.0, 300 sec: 5673.0). Total num frames: 380957696. Throughput: 0: 5060.4. Samples: 380953394. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:40:26,288][25689] Avg episode reward: [(0, '-47.561')] [2022-07-09 18:40:26,897][26022] Updated weights on worker 0-0, policy_version 372032 (0.00088) [2022-07-09 18:40:28,640][26022] Updated weights on worker 0-0, policy_version 372042 (0.00089) [2022-07-09 18:40:30,527][26022] Updated weights on worker 0-0, policy_version 372052 (0.00089) [2022-07-09 18:40:31,311][25689] Fps is (10 sec: 5563.3, 60 sec: 5679.3, 300 sec: 5669.5). Total num frames: 380984320. Throughput: 0: 5882.1. Samples: 380987432. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:40:31,311][25689] Avg episode reward: [(0, '-47.136')] [2022-07-09 18:40:32,192][26022] Updated weights on worker 0-0, policy_version 372062 (0.00086) [2022-07-09 18:40:33,974][26022] Updated weights on worker 0-0, policy_version 372072 (0.00101) [2022-07-09 18:40:35,841][26022] Updated weights on worker 0-0, policy_version 372082 (0.00089) [2022-07-09 18:40:36,322][25689] Fps is (10 sec: 5714.3, 60 sec: 5686.3, 300 sec: 5676.5). Total num frames: 381015040. Throughput: 0: 5912.8. Samples: 381021532. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:40:36,323][25689] Avg episode reward: [(0, '-48.209')] [2022-07-09 18:40:37,779][26022] Updated weights on worker 0-0, policy_version 372092 (0.00088) [2022-07-09 18:40:39,375][26022] Updated weights on worker 0-0, policy_version 372102 (0.00085) [2022-07-09 18:40:41,226][26022] Updated weights on worker 0-0, policy_version 372112 (0.00084) [2022-07-09 18:40:41,333][25689] Fps is (10 sec: 5926.2, 60 sec: 5702.9, 300 sec: 5673.8). Total num frames: 381043712. Throughput: 0: 5091.9. Samples: 381038836. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 18:40:41,333][25689] Avg episode reward: [(0, '-46.213')] [2022-07-09 18:40:42,951][26022] Updated weights on worker 0-0, policy_version 372122 (0.00093) [2022-07-09 18:40:44,700][26022] Updated weights on worker 0-0, policy_version 372132 (0.00085) [2022-07-09 18:40:46,371][25689] Fps is (10 sec: 5605.1, 60 sec: 5676.4, 300 sec: 5673.2). Total num frames: 381071360. Throughput: 0: 5962.1. Samples: 381073132. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:40:46,371][25689] Avg episode reward: [(0, '-46.227')] [2022-07-09 18:40:46,653][26022] Updated weights on worker 0-0, policy_version 372142 (0.00083) [2022-07-09 18:40:48,361][26022] Updated weights on worker 0-0, policy_version 372152 (0.00093) [2022-07-09 18:40:50,323][26022] Updated weights on worker 0-0, policy_version 372162 (0.00081) [2022-07-09 18:40:51,407][25689] Fps is (10 sec: 5692.3, 60 sec: 5680.6, 300 sec: 5679.8). Total num frames: 381101056. Throughput: 0: 5953.0. Samples: 381107064. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:40:51,407][25689] Avg episode reward: [(0, '-46.071')] [2022-07-09 18:40:52,022][26022] Updated weights on worker 0-0, policy_version 372172 (0.00087) [2022-07-09 18:40:53,849][26022] Updated weights on worker 0-0, policy_version 372182 (0.00091) [2022-07-09 18:40:55,816][26022] Updated weights on worker 0-0, policy_version 372192 (0.00084) [2022-07-09 18:40:56,414][25689] Fps is (10 sec: 5709.8, 60 sec: 5664.7, 300 sec: 5669.6). Total num frames: 381128704. Throughput: 0: 5114.2. Samples: 381124282. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:40:56,414][25689] Avg episode reward: [(0, '-47.665')] [2022-07-09 18:40:57,423][26022] Updated weights on worker 0-0, policy_version 372202 (0.00087) [2022-07-09 18:40:59,328][26022] Updated weights on worker 0-0, policy_version 372212 (0.00084) [2022-07-09 18:41:00,993][26022] Updated weights on worker 0-0, policy_version 372222 (0.00084) [2022-07-09 18:41:01,431][25689] Fps is (10 sec: 5618.4, 60 sec: 5655.0, 300 sec: 5680.1). Total num frames: 381157376. Throughput: 0: 5948.5. Samples: 381158390. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:41:01,431][25689] Avg episode reward: [(0, '-46.964')] [2022-07-09 18:41:03,187][26022] Updated weights on worker 0-0, policy_version 372232 (0.00083) [2022-07-09 18:41:04,946][26022] Updated weights on worker 0-0, policy_version 372242 (0.00084) [2022-07-09 18:41:06,513][25689] Fps is (10 sec: 5475.2, 60 sec: 5655.5, 300 sec: 5668.9). Total num frames: 381184000. Throughput: 0: 5842.9. Samples: 381190822. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:41:06,513][25689] Avg episode reward: [(0, '-47.511')] [2022-07-09 18:41:06,883][26022] Updated weights on worker 0-0, policy_version 372252 (0.00084) [2022-07-09 18:41:08,419][26022] Updated weights on worker 0-0, policy_version 372262 (0.00092) [2022-07-09 18:41:10,556][26022] Updated weights on worker 0-0, policy_version 372272 (0.00092) [2022-07-09 18:41:11,557][25689] Fps is (10 sec: 5460.9, 60 sec: 5651.7, 300 sec: 5668.2). Total num frames: 381212672. Throughput: 0: 4995.9. Samples: 381207732. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:41:11,558][25689] Avg episode reward: [(0, '-48.041')] [2022-07-09 18:41:12,026][26022] Updated weights on worker 0-0, policy_version 372282 (0.00089) [2022-07-09 18:41:14,055][26022] Updated weights on worker 0-0, policy_version 372292 (0.00091) [2022-07-09 18:41:15,585][26022] Updated weights on worker 0-0, policy_version 372302 (0.00095) [2022-07-09 18:41:16,564][25689] Fps is (10 sec: 5705.2, 60 sec: 5660.9, 300 sec: 5671.9). Total num frames: 381241344. Throughput: 0: 5842.9. Samples: 381242020. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:41:16,565][25689] Avg episode reward: [(0, '-48.437')] [2022-07-09 18:41:17,650][26022] Updated weights on worker 0-0, policy_version 372312 (0.00087) [2022-07-09 18:41:19,344][26022] Updated weights on worker 0-0, policy_version 372322 (0.00090) [2022-07-09 18:41:21,349][26022] Updated weights on worker 0-0, policy_version 372332 (0.00082) [2022-07-09 18:41:21,583][25689] Fps is (10 sec: 5617.1, 60 sec: 5645.9, 300 sec: 5669.7). Total num frames: 381268992. Throughput: 0: 5845.5. Samples: 381276192. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:41:21,584][25689] Avg episode reward: [(0, '-48.656')] [2022-07-09 18:41:22,480][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:41:22,488][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000372340_381276160.pth [2022-07-09 18:41:22,489][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000370343_379231232.pth [2022-07-09 18:41:22,862][26022] Updated weights on worker 0-0, policy_version 372342 (0.00091) [2022-07-09 18:41:24,948][26022] Updated weights on worker 0-0, policy_version 372352 (0.00084) [2022-07-09 18:41:26,471][26022] Updated weights on worker 0-0, policy_version 372362 (0.00085) [2022-07-09 18:41:26,642][25689] Fps is (10 sec: 5791.8, 60 sec: 5666.8, 300 sec: 5675.7). Total num frames: 381299712. Throughput: 0: 5090.4. Samples: 381293286. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:41:26,643][25689] Avg episode reward: [(0, '-46.608')] [2022-07-09 18:41:28,476][26022] Updated weights on worker 0-0, policy_version 372372 (0.00088) [2022-07-09 18:41:30,123][26022] Updated weights on worker 0-0, policy_version 372382 (0.00093) [2022-07-09 18:41:31,703][25689] Fps is (10 sec: 5666.7, 60 sec: 5663.3, 300 sec: 5664.9). Total num frames: 381326336. Throughput: 0: 5942.5. Samples: 381327450. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:41:31,704][25689] Avg episode reward: [(0, '-47.018')] [2022-07-09 18:41:32,073][26022] Updated weights on worker 0-0, policy_version 372392 (0.00095) [2022-07-09 18:41:33,751][26022] Updated weights on worker 0-0, policy_version 372402 (0.00093) [2022-07-09 18:41:35,599][26022] Updated weights on worker 0-0, policy_version 372412 (0.00087) [2022-07-09 18:41:36,736][25689] Fps is (10 sec: 5680.7, 60 sec: 5661.2, 300 sec: 5678.1). Total num frames: 381357056. Throughput: 0: 5938.2. Samples: 381361808. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:41:36,738][25689] Avg episode reward: [(0, '-48.388')] [2022-07-09 18:41:37,229][26022] Updated weights on worker 0-0, policy_version 372422 (0.00344) [2022-07-09 18:41:39,295][26022] Updated weights on worker 0-0, policy_version 372432 (0.00089) [2022-07-09 18:41:40,932][26022] Updated weights on worker 0-0, policy_version 372442 (0.00093) [2022-07-09 18:41:41,765][25689] Fps is (10 sec: 5902.8, 60 sec: 5659.5, 300 sec: 5671.9). Total num frames: 381385728. Throughput: 0: 5105.8. Samples: 381379232. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:41:41,766][25689] Avg episode reward: [(0, '-48.325')] [2022-07-09 18:41:42,672][26022] Updated weights on worker 0-0, policy_version 372452 (0.00087) [2022-07-09 18:41:44,619][26022] Updated weights on worker 0-0, policy_version 372462 (0.00078) [2022-07-09 18:41:46,527][26022] Updated weights on worker 0-0, policy_version 372472 (0.00079) [2022-07-09 18:41:46,829][25689] Fps is (10 sec: 5580.2, 60 sec: 5657.0, 300 sec: 5672.0). Total num frames: 381413376. Throughput: 0: 5964.1. Samples: 381413686. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:41:46,830][25689] Avg episode reward: [(0, '-48.898')] [2022-07-09 18:41:47,965][26022] Updated weights on worker 0-0, policy_version 372482 (0.00086) [2022-07-09 18:41:50,052][26022] Updated weights on worker 0-0, policy_version 372492 (0.00095) [2022-07-09 18:41:51,573][26022] Updated weights on worker 0-0, policy_version 372502 (0.00095) [2022-07-09 18:41:51,836][25689] Fps is (10 sec: 5693.6, 60 sec: 5659.8, 300 sec: 5672.5). Total num frames: 381443072. Throughput: 0: 5991.0. Samples: 381448068. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:41:51,837][25689] Avg episode reward: [(0, '-48.692')] [2022-07-09 18:41:53,586][26022] Updated weights on worker 0-0, policy_version 372512 (0.00089) [2022-07-09 18:41:55,174][26022] Updated weights on worker 0-0, policy_version 372522 (0.00089) [2022-07-09 18:41:56,853][25689] Fps is (10 sec: 5721.0, 60 sec: 5658.9, 300 sec: 5669.0). Total num frames: 381470720. Throughput: 0: 5138.8. Samples: 381465182. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:41:56,853][25689] Avg episode reward: [(0, '-49.002')] [2022-07-09 18:41:56,991][26022] Updated weights on worker 0-0, policy_version 372532 (0.00085) [2022-07-09 18:41:58,725][26022] Updated weights on worker 0-0, policy_version 372542 (0.00094) [2022-07-09 18:42:00,572][26022] Updated weights on worker 0-0, policy_version 372552 (0.00083) [2022-07-09 18:42:01,867][25689] Fps is (10 sec: 5512.8, 60 sec: 5642.2, 300 sec: 5677.9). Total num frames: 381498368. Throughput: 0: 5988.0. Samples: 381499604. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:42:01,867][25689] Avg episode reward: [(0, '-47.175')] [2022-07-09 18:42:02,887][26022] Updated weights on worker 0-0, policy_version 372562 (0.00093) [2022-07-09 18:42:04,657][26022] Updated weights on worker 0-0, policy_version 372572 (0.00090) [2022-07-09 18:42:06,352][26022] Updated weights on worker 0-0, policy_version 372582 (0.00089) [2022-07-09 18:42:07,007][25689] Fps is (10 sec: 5546.5, 60 sec: 5670.7, 300 sec: 5675.3). Total num frames: 381527040. Throughput: 0: 5836.6. Samples: 381531454. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:42:07,007][25689] Avg episode reward: [(0, '-47.552')] [2022-07-09 18:42:08,194][26022] Updated weights on worker 0-0, policy_version 372592 (0.00083) [2022-07-09 18:42:09,989][26022] Updated weights on worker 0-0, policy_version 372602 (0.00086) [2022-07-09 18:42:11,915][26022] Updated weights on worker 0-0, policy_version 372612 (0.00088) [2022-07-09 18:42:12,027][25689] Fps is (10 sec: 5543.4, 60 sec: 5656.0, 300 sec: 5669.5). Total num frames: 381554688. Throughput: 0: 4971.6. Samples: 381548446. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:42:12,028][25689] Avg episode reward: [(0, '-47.527')] [2022-07-09 18:42:13,544][26022] Updated weights on worker 0-0, policy_version 372622 (0.00089) [2022-07-09 18:42:15,519][26022] Updated weights on worker 0-0, policy_version 372632 (0.00091) [2022-07-09 18:42:17,033][25689] Fps is (10 sec: 5617.1, 60 sec: 5656.0, 300 sec: 5669.9). Total num frames: 381583360. Throughput: 0: 5825.4. Samples: 381582742. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:42:17,035][25689] Avg episode reward: [(0, '-46.676')] [2022-07-09 18:42:17,276][26022] Updated weights on worker 0-0, policy_version 372642 (0.00095) [2022-07-09 18:42:19,119][26022] Updated weights on worker 0-0, policy_version 372652 (0.00089) [2022-07-09 18:42:20,841][26022] Updated weights on worker 0-0, policy_version 372662 (0.00086) [2022-07-09 18:42:22,056][25689] Fps is (10 sec: 5717.8, 60 sec: 5672.7, 300 sec: 5667.2). Total num frames: 381612032. Throughput: 0: 5793.9. Samples: 381616576. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:42:22,057][25689] Avg episode reward: [(0, '-46.550')] [2022-07-09 18:42:22,586][26022] Updated weights on worker 0-0, policy_version 372672 (0.00081) [2022-07-09 18:42:24,465][26022] Updated weights on worker 0-0, policy_version 372682 (0.00084) [2022-07-09 18:42:26,303][26022] Updated weights on worker 0-0, policy_version 372692 (0.00085) [2022-07-09 18:42:27,159][25689] Fps is (10 sec: 5562.2, 60 sec: 5617.7, 300 sec: 5662.4). Total num frames: 381639680. Throughput: 0: 5065.4. Samples: 381633532. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:42:27,159][25689] Avg episode reward: [(0, '-46.653')] [2022-07-09 18:42:28,177][26022] Updated weights on worker 0-0, policy_version 372702 (0.00086) [2022-07-09 18:42:29,835][26022] Updated weights on worker 0-0, policy_version 372712 (0.00088) [2022-07-09 18:42:31,670][26022] Updated weights on worker 0-0, policy_version 372722 (0.00098) [2022-07-09 18:42:32,197][25689] Fps is (10 sec: 5755.6, 60 sec: 5687.6, 300 sec: 5668.8). Total num frames: 381670400. Throughput: 0: 5913.4. Samples: 381667720. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:42:32,197][25689] Avg episode reward: [(0, '-45.996')] [2022-07-09 18:42:33,808][26022] Updated weights on worker 0-0, policy_version 372732 (0.00090) [2022-07-09 18:42:35,110][26022] Updated weights on worker 0-0, policy_version 372742 (0.00098) [2022-07-09 18:42:37,219][25689] Fps is (10 sec: 5802.1, 60 sec: 5637.9, 300 sec: 5665.4). Total num frames: 381698048. Throughput: 0: 5908.0. Samples: 381701996. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:42:37,219][25689] Avg episode reward: [(0, '-46.092')] [2022-07-09 18:42:37,221][26022] Updated weights on worker 0-0, policy_version 372752 (0.00084) [2022-07-09 18:42:38,748][26022] Updated weights on worker 0-0, policy_version 372762 (0.00084) [2022-07-09 18:42:40,876][26022] Updated weights on worker 0-0, policy_version 372772 (0.00091) [2022-07-09 18:42:42,283][25689] Fps is (10 sec: 5584.0, 60 sec: 5634.5, 300 sec: 5668.4). Total num frames: 381726720. Throughput: 0: 5916.3. Samples: 381736246. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:42:42,287][25689] Avg episode reward: [(0, '-46.137')] [2022-07-09 18:42:42,503][26022] Updated weights on worker 0-0, policy_version 372782 (0.00082) [2022-07-09 18:42:44,401][26022] Updated weights on worker 0-0, policy_version 372792 (0.00084) [2022-07-09 18:42:46,162][26022] Updated weights on worker 0-0, policy_version 372802 (0.00085) [2022-07-09 18:42:47,353][25689] Fps is (10 sec: 5759.2, 60 sec: 5667.8, 300 sec: 5664.2). Total num frames: 381756416. Throughput: 0: 5941.2. Samples: 381753512. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:42:47,354][25689] Avg episode reward: [(0, '-46.411')] [2022-07-09 18:42:48,029][26022] Updated weights on worker 0-0, policy_version 372812 (0.00090) [2022-07-09 18:42:49,704][26022] Updated weights on worker 0-0, policy_version 372822 (0.00487) [2022-07-09 18:42:51,519][26022] Updated weights on worker 0-0, policy_version 372832 (0.00094) [2022-07-09 18:42:52,364][25689] Fps is (10 sec: 5688.4, 60 sec: 5633.7, 300 sec: 5667.7). Total num frames: 381784064. Throughput: 0: 5948.0. Samples: 381787674. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:42:52,364][25689] Avg episode reward: [(0, '-46.746')] [2022-07-09 18:42:53,222][26022] Updated weights on worker 0-0, policy_version 372842 (0.00087) [2022-07-09 18:42:55,154][26022] Updated weights on worker 0-0, policy_version 372852 (0.00086) [2022-07-09 18:42:57,148][26022] Updated weights on worker 0-0, policy_version 372862 (0.00093) [2022-07-09 18:42:57,367][25689] Fps is (10 sec: 5624.7, 60 sec: 5651.9, 300 sec: 5661.1). Total num frames: 381812736. Throughput: 0: 5946.2. Samples: 381821800. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:42:57,367][25689] Avg episode reward: [(0, '-46.794')] [2022-07-09 18:42:58,671][26022] Updated weights on worker 0-0, policy_version 372872 (0.00088) [2022-07-09 18:43:00,706][26022] Updated weights on worker 0-0, policy_version 372882 (0.00089) [2022-07-09 18:43:02,403][25689] Fps is (10 sec: 5609.7, 60 sec: 5649.7, 300 sec: 5668.5). Total num frames: 381840384. Throughput: 0: 5101.4. Samples: 381838890. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:43:02,404][25689] Avg episode reward: [(0, '-48.210')] [2022-07-09 18:43:02,604][26022] Updated weights on worker 0-0, policy_version 372892 (0.00090) [2022-07-09 18:43:04,592][26022] Updated weights on worker 0-0, policy_version 372902 (0.00090) [2022-07-09 18:43:06,320][26022] Updated weights on worker 0-0, policy_version 372912 (0.00087) [2022-07-09 18:43:07,529][25689] Fps is (10 sec: 5340.6, 60 sec: 5617.3, 300 sec: 5657.0). Total num frames: 381867008. Throughput: 0: 5826.9. Samples: 381871070. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:43:07,530][25689] Avg episode reward: [(0, '-47.750')] [2022-07-09 18:43:08,170][26022] Updated weights on worker 0-0, policy_version 372922 (0.00084) [2022-07-09 18:43:09,792][26022] Updated weights on worker 0-0, policy_version 372932 (0.00091) [2022-07-09 18:43:11,702][26022] Updated weights on worker 0-0, policy_version 372942 (0.00079) [2022-07-09 18:43:12,563][25689] Fps is (10 sec: 5644.2, 60 sec: 5666.7, 300 sec: 5663.9). Total num frames: 381897728. Throughput: 0: 5845.1. Samples: 381905742. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:43:12,564][25689] Avg episode reward: [(0, '-48.032')] [2022-07-09 18:43:13,506][26022] Updated weights on worker 0-0, policy_version 372952 (0.00092) [2022-07-09 18:43:15,208][26022] Updated weights on worker 0-0, policy_version 372962 (0.00088) [2022-07-09 18:43:16,985][26022] Updated weights on worker 0-0, policy_version 372972 (0.00083) [2022-07-09 18:43:17,624][25689] Fps is (10 sec: 5984.8, 60 sec: 5678.5, 300 sec: 5670.6). Total num frames: 381927424. Throughput: 0: 4991.1. Samples: 381922906. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 18:43:17,625][25689] Avg episode reward: [(0, '-47.681')] [2022-07-09 18:43:18,820][26022] Updated weights on worker 0-0, policy_version 372982 (0.00086) [2022-07-09 18:43:20,559][26022] Updated weights on worker 0-0, policy_version 372992 (0.00088) [2022-07-09 18:43:22,303][26022] Updated weights on worker 0-0, policy_version 373002 (0.00092) [2022-07-09 18:43:22,606][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:43:22,619][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000373003_381955072.pth [2022-07-09 18:43:22,620][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000371008_379912192.pth [2022-07-09 18:43:22,723][25689] Fps is (10 sec: 5644.1, 60 sec: 5654.4, 300 sec: 5663.5). Total num frames: 381955072. Throughput: 0: 5827.4. Samples: 381957302. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:43:22,724][25689] Avg episode reward: [(0, '-48.120')] [2022-07-09 18:43:24,039][26022] Updated weights on worker 0-0, policy_version 373012 (0.00085) [2022-07-09 18:43:26,204][26022] Updated weights on worker 0-0, policy_version 373022 (0.00089) [2022-07-09 18:43:27,742][26022] Updated weights on worker 0-0, policy_version 373032 (0.00090) [2022-07-09 18:43:27,787][25689] Fps is (10 sec: 5742.7, 60 sec: 5708.7, 300 sec: 5672.9). Total num frames: 381985792. Throughput: 0: 5942.7. Samples: 381991462. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:43:27,788][25689] Avg episode reward: [(0, '-46.960')] [2022-07-09 18:43:29,615][26022] Updated weights on worker 0-0, policy_version 373042 (0.00091) [2022-07-09 18:43:31,267][26022] Updated weights on worker 0-0, policy_version 373052 (0.00085) [2022-07-09 18:43:32,888][25689] Fps is (10 sec: 5842.8, 60 sec: 5669.1, 300 sec: 5667.9). Total num frames: 382014464. Throughput: 0: 5048.7. Samples: 382008356. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:43:32,889][25689] Avg episode reward: [(0, '-47.332')] [2022-07-09 18:43:33,112][26022] Updated weights on worker 0-0, policy_version 373062 (0.00080) [2022-07-09 18:43:35,035][26022] Updated weights on worker 0-0, policy_version 373072 (0.00094) [2022-07-09 18:43:36,734][26022] Updated weights on worker 0-0, policy_version 373082 (0.00091) [2022-07-09 18:43:37,912][25689] Fps is (10 sec: 5562.6, 60 sec: 5668.9, 300 sec: 5664.4). Total num frames: 382042112. Throughput: 0: 5914.0. Samples: 382042892. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:43:37,913][25689] Avg episode reward: [(0, '-47.767')] [2022-07-09 18:43:38,503][26022] Updated weights on worker 0-0, policy_version 373092 (0.00097) [2022-07-09 18:43:40,163][26022] Updated weights on worker 0-0, policy_version 373102 (0.00092) [2022-07-09 18:43:42,088][26022] Updated weights on worker 0-0, policy_version 373112 (0.00095) [2022-07-09 18:43:42,939][25689] Fps is (10 sec: 5603.7, 60 sec: 5672.4, 300 sec: 5664.8). Total num frames: 382070784. Throughput: 0: 5927.7. Samples: 382077134. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:43:42,939][25689] Avg episode reward: [(0, '-47.657')] [2022-07-09 18:43:43,971][26022] Updated weights on worker 0-0, policy_version 373122 (0.00092) [2022-07-09 18:43:45,844][26022] Updated weights on worker 0-0, policy_version 373132 (0.00094) [2022-07-09 18:43:47,491][26022] Updated weights on worker 0-0, policy_version 373142 (0.00099) [2022-07-09 18:43:48,027][25689] Fps is (10 sec: 5770.2, 60 sec: 5670.7, 300 sec: 5667.3). Total num frames: 382100480. Throughput: 0: 5076.4. Samples: 382094208. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:43:48,028][25689] Avg episode reward: [(0, '-48.227')] [2022-07-09 18:43:49,470][26022] Updated weights on worker 0-0, policy_version 373152 (0.00084) [2022-07-09 18:43:50,968][26022] Updated weights on worker 0-0, policy_version 373162 (0.00087) [2022-07-09 18:43:53,084][25689] Fps is (10 sec: 5551.2, 60 sec: 5649.5, 300 sec: 5660.1). Total num frames: 382127104. Throughput: 0: 5917.0. Samples: 382127856. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:43:53,085][25689] Avg episode reward: [(0, '-47.562')] [2022-07-09 18:43:53,173][26022] Updated weights on worker 0-0, policy_version 373172 (0.00084) [2022-07-09 18:43:54,984][26022] Updated weights on worker 0-0, policy_version 373182 (0.00081) [2022-07-09 18:43:56,582][26022] Updated weights on worker 0-0, policy_version 373192 (0.00098) [2022-07-09 18:43:58,105][25689] Fps is (10 sec: 5588.7, 60 sec: 5664.7, 300 sec: 5663.5). Total num frames: 382156800. Throughput: 0: 5904.5. Samples: 382162120. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:43:58,107][25689] Avg episode reward: [(0, '-48.579')] [2022-07-09 18:43:58,482][26022] Updated weights on worker 0-0, policy_version 373202 (0.00090) [2022-07-09 18:44:00,152][26022] Updated weights on worker 0-0, policy_version 373212 (0.00091) [2022-07-09 18:44:02,259][26022] Updated weights on worker 0-0, policy_version 373222 (0.00086) [2022-07-09 18:44:03,118][25689] Fps is (10 sec: 5511.1, 60 sec: 5633.2, 300 sec: 5665.4). Total num frames: 382182400. Throughput: 0: 5057.3. Samples: 382179186. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:44:03,120][25689] Avg episode reward: [(0, '-48.872')] [2022-07-09 18:44:04,179][26022] Updated weights on worker 0-0, policy_version 373232 (0.00085) [2022-07-09 18:44:06,069][26022] Updated weights on worker 0-0, policy_version 373242 (0.00087) [2022-07-09 18:44:07,866][26022] Updated weights on worker 0-0, policy_version 373252 (0.00087) [2022-07-09 18:44:08,171][25689] Fps is (10 sec: 5493.2, 60 sec: 5690.5, 300 sec: 5661.2). Total num frames: 382212096. Throughput: 0: 5799.1. Samples: 382211024. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:44:08,172][25689] Avg episode reward: [(0, '-48.506')] [2022-07-09 18:44:09,903][26022] Updated weights on worker 0-0, policy_version 373262 (0.00083) [2022-07-09 18:44:11,314][26022] Updated weights on worker 0-0, policy_version 373272 (0.00094) [2022-07-09 18:44:13,185][25689] Fps is (10 sec: 5492.8, 60 sec: 5608.0, 300 sec: 5654.7). Total num frames: 382237696. Throughput: 0: 5819.4. Samples: 382244830. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:44:13,185][25689] Avg episode reward: [(0, '-48.661')] [2022-07-09 18:44:13,432][26022] Updated weights on worker 0-0, policy_version 373282 (0.00083) [2022-07-09 18:44:15,034][26022] Updated weights on worker 0-0, policy_version 373292 (0.00069) [2022-07-09 18:44:17,115][26022] Updated weights on worker 0-0, policy_version 373302 (0.00089) [2022-07-09 18:44:18,195][25689] Fps is (10 sec: 5516.7, 60 sec: 5612.7, 300 sec: 5654.7). Total num frames: 382267392. Throughput: 0: 4953.1. Samples: 382261626. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:44:18,197][25689] Avg episode reward: [(0, '-48.386')] [2022-07-09 18:44:18,805][26022] Updated weights on worker 0-0, policy_version 373312 (0.00087) [2022-07-09 18:44:20,580][26022] Updated weights on worker 0-0, policy_version 373322 (0.00082) [2022-07-09 18:44:22,680][26022] Updated weights on worker 0-0, policy_version 373332 (0.00081) [2022-07-09 18:44:23,217][25689] Fps is (10 sec: 5818.0, 60 sec: 5636.8, 300 sec: 5658.8). Total num frames: 382296064. Throughput: 0: 5781.6. Samples: 382295392. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:44:23,218][25689] Avg episode reward: [(0, '-48.182')] [2022-07-09 18:44:24,136][26022] Updated weights on worker 0-0, policy_version 373342 (0.00049) [2022-07-09 18:44:26,236][26022] Updated weights on worker 0-0, policy_version 373352 (0.00085) [2022-07-09 18:44:27,806][26022] Updated weights on worker 0-0, policy_version 373362 (0.00082) [2022-07-09 18:44:28,307][25689] Fps is (10 sec: 5670.8, 60 sec: 5600.6, 300 sec: 5660.7). Total num frames: 382324736. Throughput: 0: 5893.6. Samples: 382329694. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:44:28,308][25689] Avg episode reward: [(0, '-47.912')] [2022-07-09 18:44:29,734][26022] Updated weights on worker 0-0, policy_version 373372 (0.00091) [2022-07-09 18:44:31,549][26022] Updated weights on worker 0-0, policy_version 373382 (0.00095) [2022-07-09 18:44:33,320][25689] Fps is (10 sec: 5676.0, 60 sec: 5608.7, 300 sec: 5655.2). Total num frames: 382353408. Throughput: 0: 5067.9. Samples: 382346872. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:44:33,320][25689] Avg episode reward: [(0, '-48.007')] [2022-07-09 18:44:33,323][26022] Updated weights on worker 0-0, policy_version 373392 (0.00087) [2022-07-09 18:44:35,061][26022] Updated weights on worker 0-0, policy_version 373402 (0.00086) [2022-07-09 18:44:36,815][26022] Updated weights on worker 0-0, policy_version 373412 (0.00095) [2022-07-09 18:44:38,352][25689] Fps is (10 sec: 5708.3, 60 sec: 5624.9, 300 sec: 5658.1). Total num frames: 382382080. Throughput: 0: 5923.4. Samples: 382381028. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:44:38,354][25689] Avg episode reward: [(0, '-48.863')] [2022-07-09 18:44:38,555][26022] Updated weights on worker 0-0, policy_version 373422 (0.00092) [2022-07-09 18:44:40,499][26022] Updated weights on worker 0-0, policy_version 373432 (0.00084) [2022-07-09 18:44:42,155][26022] Updated weights on worker 0-0, policy_version 373442 (0.00086) [2022-07-09 18:44:43,356][25689] Fps is (10 sec: 5713.8, 60 sec: 5627.0, 300 sec: 5656.8). Total num frames: 382410752. Throughput: 0: 5952.8. Samples: 382415274. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:44:43,356][25689] Avg episode reward: [(0, '-48.136')] [2022-07-09 18:44:44,113][26022] Updated weights on worker 0-0, policy_version 373452 (0.00084) [2022-07-09 18:44:45,632][26022] Updated weights on worker 0-0, policy_version 373462 (0.00089) [2022-07-09 18:44:47,559][26022] Updated weights on worker 0-0, policy_version 373472 (0.00093) [2022-07-09 18:44:48,408][25689] Fps is (10 sec: 5804.8, 60 sec: 5630.5, 300 sec: 5657.4). Total num frames: 382440448. Throughput: 0: 5112.9. Samples: 382432468. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:44:48,408][25689] Avg episode reward: [(0, '-49.604')] [2022-07-09 18:44:49,230][26022] Updated weights on worker 0-0, policy_version 373482 (0.00092) [2022-07-09 18:44:51,203][26022] Updated weights on worker 0-0, policy_version 373492 (0.00080) [2022-07-09 18:44:52,801][26022] Updated weights on worker 0-0, policy_version 373502 (0.00927) [2022-07-09 18:44:53,468][25689] Fps is (10 sec: 5670.7, 60 sec: 5647.1, 300 sec: 5653.1). Total num frames: 382468096. Throughput: 0: 5950.0. Samples: 382466754. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:44:53,469][25689] Avg episode reward: [(0, '-49.147')] [2022-07-09 18:44:54,739][26022] Updated weights on worker 0-0, policy_version 373512 (0.00086) [2022-07-09 18:44:56,440][26022] Updated weights on worker 0-0, policy_version 373522 (0.00095) [2022-07-09 18:44:58,265][26022] Updated weights on worker 0-0, policy_version 373532 (0.00088) [2022-07-09 18:44:58,471][25689] Fps is (10 sec: 5596.8, 60 sec: 5631.8, 300 sec: 5651.4). Total num frames: 382496768. Throughput: 0: 5965.0. Samples: 382501032. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:44:58,471][25689] Avg episode reward: [(0, '-48.995')] [2022-07-09 18:44:59,996][26022] Updated weights on worker 0-0, policy_version 373542 (0.00088) [2022-07-09 18:45:02,172][26022] Updated weights on worker 0-0, policy_version 373552 (0.00096) [2022-07-09 18:45:03,486][25689] Fps is (10 sec: 5519.7, 60 sec: 5648.5, 300 sec: 5652.8). Total num frames: 382523392. Throughput: 0: 5864.9. Samples: 382533336. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:45:03,486][25689] Avg episode reward: [(0, '-48.419')] [2022-07-09 18:45:04,024][26022] Updated weights on worker 0-0, policy_version 373562 (0.00081) [2022-07-09 18:45:06,015][26022] Updated weights on worker 0-0, policy_version 373572 (0.00084) [2022-07-09 18:45:07,556][26022] Updated weights on worker 0-0, policy_version 373582 (0.00089) [2022-07-09 18:45:08,595][25689] Fps is (10 sec: 5562.6, 60 sec: 5643.3, 300 sec: 5654.2). Total num frames: 382553088. Throughput: 0: 5845.9. Samples: 382550484. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:45:08,596][25689] Avg episode reward: [(0, '-48.056')] [2022-07-09 18:45:09,522][26022] Updated weights on worker 0-0, policy_version 373592 (0.00087) [2022-07-09 18:45:11,263][26022] Updated weights on worker 0-0, policy_version 373602 (0.00084) [2022-07-09 18:45:13,106][26022] Updated weights on worker 0-0, policy_version 373612 (0.00083) [2022-07-09 18:45:13,638][25689] Fps is (10 sec: 5749.4, 60 sec: 5691.4, 300 sec: 5655.4). Total num frames: 382581760. Throughput: 0: 5856.2. Samples: 382584874. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:45:13,639][25689] Avg episode reward: [(0, '-47.937')] [2022-07-09 18:45:14,866][26022] Updated weights on worker 0-0, policy_version 373622 (0.00086) [2022-07-09 18:45:16,613][26022] Updated weights on worker 0-0, policy_version 373632 (0.00087) [2022-07-09 18:45:18,446][26022] Updated weights on worker 0-0, policy_version 373642 (0.00102) [2022-07-09 18:45:18,641][25689] Fps is (10 sec: 5708.5, 60 sec: 5675.1, 300 sec: 5656.1). Total num frames: 382610432. Throughput: 0: 5872.5. Samples: 382619480. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:45:18,641][25689] Avg episode reward: [(0, '-46.220')] [2022-07-09 18:45:20,037][26022] Updated weights on worker 0-0, policy_version 373652 (0.00085) [2022-07-09 18:45:22,031][26022] Updated weights on worker 0-0, policy_version 373662 (0.00089) [2022-07-09 18:45:22,688][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:45:22,701][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000373666_382633984.pth [2022-07-09 18:45:22,701][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000371674_380594176.pth [2022-07-09 18:45:23,655][25689] Fps is (10 sec: 5827.0, 60 sec: 5692.9, 300 sec: 5657.8). Total num frames: 382640128. Throughput: 0: 5130.4. Samples: 382636810. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:45:23,655][25689] Avg episode reward: [(0, '-45.990')] [2022-07-09 18:45:23,661][26022] Updated weights on worker 0-0, policy_version 373672 (0.00081) [2022-07-09 18:45:25,654][26022] Updated weights on worker 0-0, policy_version 373682 (0.00097) [2022-07-09 18:45:27,303][26022] Updated weights on worker 0-0, policy_version 373692 (0.00092) [2022-07-09 18:45:28,767][25689] Fps is (10 sec: 5663.0, 60 sec: 5673.8, 300 sec: 5659.5). Total num frames: 382667776. Throughput: 0: 5977.1. Samples: 382671052. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:45:28,767][25689] Avg episode reward: [(0, '-46.466')] [2022-07-09 18:45:29,187][26022] Updated weights on worker 0-0, policy_version 373702 (0.00085) [2022-07-09 18:45:30,944][26022] Updated weights on worker 0-0, policy_version 373712 (0.00084) [2022-07-09 18:45:32,623][26022] Updated weights on worker 0-0, policy_version 373722 (0.00094) [2022-07-09 18:45:33,855][25689] Fps is (10 sec: 5621.9, 60 sec: 5683.7, 300 sec: 5654.6). Total num frames: 382697472. Throughput: 0: 5961.5. Samples: 382705396. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:45:33,856][25689] Avg episode reward: [(0, '-46.295')] [2022-07-09 18:45:34,693][26022] Updated weights on worker 0-0, policy_version 373732 (0.00089) [2022-07-09 18:45:36,303][26022] Updated weights on worker 0-0, policy_version 373742 (0.00084) [2022-07-09 18:45:38,239][26022] Updated weights on worker 0-0, policy_version 373752 (0.00090) [2022-07-09 18:45:38,975][25689] Fps is (10 sec: 5818.2, 60 sec: 5692.4, 300 sec: 5656.0). Total num frames: 382727168. Throughput: 0: 5065.8. Samples: 382722488. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:45:38,975][25689] Avg episode reward: [(0, '-47.013')] [2022-07-09 18:45:39,862][26022] Updated weights on worker 0-0, policy_version 373762 (0.00097) [2022-07-09 18:45:41,657][26022] Updated weights on worker 0-0, policy_version 373772 (0.00082) [2022-07-09 18:45:43,528][26022] Updated weights on worker 0-0, policy_version 373782 (0.00087) [2022-07-09 18:45:43,983][25689] Fps is (10 sec: 5662.1, 60 sec: 5675.1, 300 sec: 5656.5). Total num frames: 382754816. Throughput: 0: 5906.9. Samples: 382756884. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:45:43,983][25689] Avg episode reward: [(0, '-47.279')] [2022-07-09 18:45:45,226][26022] Updated weights on worker 0-0, policy_version 373792 (0.00089) [2022-07-09 18:45:47,071][26022] Updated weights on worker 0-0, policy_version 373802 (0.00086) [2022-07-09 18:45:48,807][26022] Updated weights on worker 0-0, policy_version 373812 (0.00090) [2022-07-09 18:45:49,011][25689] Fps is (10 sec: 5611.5, 60 sec: 5660.4, 300 sec: 5653.2). Total num frames: 382783488. Throughput: 0: 5934.1. Samples: 382791184. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:45:49,012][25689] Avg episode reward: [(0, '-49.127')] [2022-07-09 18:45:50,725][26022] Updated weights on worker 0-0, policy_version 373822 (0.00084) [2022-07-09 18:45:52,441][26022] Updated weights on worker 0-0, policy_version 373832 (0.00370) [2022-07-09 18:45:54,023][25689] Fps is (10 sec: 5711.6, 60 sec: 5681.9, 300 sec: 5656.6). Total num frames: 382812160. Throughput: 0: 5114.5. Samples: 382808544. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 18:45:54,023][25689] Avg episode reward: [(0, '-48.727')] [2022-07-09 18:45:54,158][26022] Updated weights on worker 0-0, policy_version 373842 (0.00095) [2022-07-09 18:45:56,119][26022] Updated weights on worker 0-0, policy_version 373852 (0.00132) [2022-07-09 18:45:57,901][26022] Updated weights on worker 0-0, policy_version 373862 (0.00082) [2022-07-09 18:45:59,038][25689] Fps is (10 sec: 5718.8, 60 sec: 5680.6, 300 sec: 5656.6). Total num frames: 382840832. Throughput: 0: 5996.0. Samples: 382842790. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:45:59,039][25689] Avg episode reward: [(0, '-47.976')] [2022-07-09 18:45:59,547][26022] Updated weights on worker 0-0, policy_version 373872 (0.00088) [2022-07-09 18:46:01,375][26022] Updated weights on worker 0-0, policy_version 373882 (0.00087) [2022-07-09 18:46:03,440][26022] Updated weights on worker 0-0, policy_version 373892 (0.00090) [2022-07-09 18:46:04,074][25689] Fps is (10 sec: 5399.2, 60 sec: 5661.8, 300 sec: 5654.1). Total num frames: 382866432. Throughput: 0: 5864.4. Samples: 382874710. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:46:04,075][25689] Avg episode reward: [(0, '-48.281')] [2022-07-09 18:46:05,541][26022] Updated weights on worker 0-0, policy_version 373902 (0.00091) [2022-07-09 18:46:07,256][26022] Updated weights on worker 0-0, policy_version 373912 (0.00084) [2022-07-09 18:46:09,023][26022] Updated weights on worker 0-0, policy_version 373922 (0.00087) [2022-07-09 18:46:09,107][25689] Fps is (10 sec: 5492.1, 60 sec: 5669.0, 300 sec: 5657.7). Total num frames: 382896128. Throughput: 0: 5003.5. Samples: 382891732. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:46:09,107][25689] Avg episode reward: [(0, '-47.828')] [2022-07-09 18:46:10,748][26022] Updated weights on worker 0-0, policy_version 373932 (0.00090) [2022-07-09 18:46:12,603][26022] Updated weights on worker 0-0, policy_version 373942 (0.00084) [2022-07-09 18:46:14,121][25689] Fps is (10 sec: 5809.8, 60 sec: 5671.7, 300 sec: 5657.6). Total num frames: 382924800. Throughput: 0: 5842.3. Samples: 382925964. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:46:14,122][25689] Avg episode reward: [(0, '-48.862')] [2022-07-09 18:46:14,358][26022] Updated weights on worker 0-0, policy_version 373952 (0.00087) [2022-07-09 18:46:16,191][26022] Updated weights on worker 0-0, policy_version 373962 (0.00089) [2022-07-09 18:46:18,081][26022] Updated weights on worker 0-0, policy_version 373972 (0.00089) [2022-07-09 18:46:19,122][25689] Fps is (10 sec: 5725.5, 60 sec: 5671.8, 300 sec: 5661.4). Total num frames: 382953472. Throughput: 0: 5863.8. Samples: 382960558. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:46:19,123][25689] Avg episode reward: [(0, '-48.675')] [2022-07-09 18:46:19,699][26022] Updated weights on worker 0-0, policy_version 373982 (0.00080) [2022-07-09 18:46:21,763][26022] Updated weights on worker 0-0, policy_version 373992 (0.00357) [2022-07-09 18:46:23,270][26022] Updated weights on worker 0-0, policy_version 374002 (0.00096) [2022-07-09 18:46:24,184][25689] Fps is (10 sec: 5698.3, 60 sec: 5650.4, 300 sec: 5654.4). Total num frames: 382982144. Throughput: 0: 5117.3. Samples: 382977618. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:46:24,185][25689] Avg episode reward: [(0, '-48.262')] [2022-07-09 18:46:25,366][26022] Updated weights on worker 0-0, policy_version 374012 (0.00097) [2022-07-09 18:46:26,945][26022] Updated weights on worker 0-0, policy_version 374022 (0.00083) [2022-07-09 18:46:28,870][26022] Updated weights on worker 0-0, policy_version 374032 (0.00096) [2022-07-09 18:46:29,245][25689] Fps is (10 sec: 5766.1, 60 sec: 5689.1, 300 sec: 5664.8). Total num frames: 383011840. Throughput: 0: 5960.7. Samples: 383011770. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:46:29,245][25689] Avg episode reward: [(0, '-48.100')] [2022-07-09 18:46:30,727][26022] Updated weights on worker 0-0, policy_version 374042 (0.00080) [2022-07-09 18:46:32,406][26022] Updated weights on worker 0-0, policy_version 374052 (0.00087) [2022-07-09 18:46:34,286][25689] Fps is (10 sec: 5575.2, 60 sec: 5642.6, 300 sec: 5650.8). Total num frames: 383038464. Throughput: 0: 5954.1. Samples: 383046030. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:46:34,287][25689] Avg episode reward: [(0, '-48.023')] [2022-07-09 18:46:34,331][26022] Updated weights on worker 0-0, policy_version 374062 (0.00091) [2022-07-09 18:46:36,072][26022] Updated weights on worker 0-0, policy_version 374072 (0.00087) [2022-07-09 18:46:37,834][26022] Updated weights on worker 0-0, policy_version 374082 (0.00084) [2022-07-09 18:46:39,303][25689] Fps is (10 sec: 5497.8, 60 sec: 5635.3, 300 sec: 5651.1). Total num frames: 383067136. Throughput: 0: 5071.8. Samples: 383062910. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:46:39,303][25689] Avg episode reward: [(0, '-47.820')] [2022-07-09 18:46:39,642][26022] Updated weights on worker 0-0, policy_version 374092 (0.00085) [2022-07-09 18:46:41,487][26022] Updated weights on worker 0-0, policy_version 374102 (0.00084) [2022-07-09 18:46:43,180][26022] Updated weights on worker 0-0, policy_version 374112 (0.00090) [2022-07-09 18:46:44,317][25689] Fps is (10 sec: 5818.8, 60 sec: 5668.7, 300 sec: 5658.9). Total num frames: 383096832. Throughput: 0: 5932.9. Samples: 383097064. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:46:44,319][25689] Avg episode reward: [(0, '-47.017')] [2022-07-09 18:46:45,162][26022] Updated weights on worker 0-0, policy_version 374122 (0.00085) [2022-07-09 18:46:46,701][26022] Updated weights on worker 0-0, policy_version 374132 (0.00087) [2022-07-09 18:46:48,770][26022] Updated weights on worker 0-0, policy_version 374142 (0.00095) [2022-07-09 18:46:49,461][25689] Fps is (10 sec: 5745.7, 60 sec: 5657.8, 300 sec: 5652.9). Total num frames: 383125504. Throughput: 0: 5902.2. Samples: 383131090. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:46:49,462][25689] Avg episode reward: [(0, '-47.602')] [2022-07-09 18:46:50,414][26022] Updated weights on worker 0-0, policy_version 374152 (0.00088) [2022-07-09 18:46:52,226][26022] Updated weights on worker 0-0, policy_version 374162 (0.00085) [2022-07-09 18:46:54,011][26022] Updated weights on worker 0-0, policy_version 374172 (0.00106) [2022-07-09 18:46:54,466][25689] Fps is (10 sec: 5751.0, 60 sec: 5675.4, 300 sec: 5660.0). Total num frames: 383155200. Throughput: 0: 5070.0. Samples: 383148342. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:46:54,467][25689] Avg episode reward: [(0, '-47.817')] [2022-07-09 18:46:55,804][26022] Updated weights on worker 0-0, policy_version 374182 (0.00092) [2022-07-09 18:46:57,535][26022] Updated weights on worker 0-0, policy_version 374192 (0.00081) [2022-07-09 18:46:59,470][25689] Fps is (10 sec: 5627.3, 60 sec: 5642.7, 300 sec: 5656.7). Total num frames: 383181824. Throughput: 0: 5930.5. Samples: 383182510. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:46:59,471][25689] Avg episode reward: [(0, '-47.883')] [2022-07-09 18:46:59,542][26022] Updated weights on worker 0-0, policy_version 374202 (0.00092) [2022-07-09 18:47:01,099][26022] Updated weights on worker 0-0, policy_version 374212 (0.00095) [2022-07-09 18:47:03,425][26022] Updated weights on worker 0-0, policy_version 374222 (0.00393) [2022-07-09 18:47:04,494][25689] Fps is (10 sec: 5310.0, 60 sec: 5660.7, 300 sec: 5652.0). Total num frames: 383208448. Throughput: 0: 5831.9. Samples: 383214734. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:47:04,496][25689] Avg episode reward: [(0, '-48.288')] [2022-07-09 18:47:05,021][26022] Updated weights on worker 0-0, policy_version 374232 (0.00084) [2022-07-09 18:47:07,017][26022] Updated weights on worker 0-0, policy_version 374242 (0.00081) [2022-07-09 18:47:08,847][26022] Updated weights on worker 0-0, policy_version 374252 (0.00083) [2022-07-09 18:47:09,602][25689] Fps is (10 sec: 5558.3, 60 sec: 5653.6, 300 sec: 5657.2). Total num frames: 383238144. Throughput: 0: 4998.2. Samples: 383231758. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:47:09,603][25689] Avg episode reward: [(0, '-48.487')] [2022-07-09 18:47:10,758][26022] Updated weights on worker 0-0, policy_version 374262 (0.00090) [2022-07-09 18:47:12,432][26022] Updated weights on worker 0-0, policy_version 374272 (0.00083) [2022-07-09 18:47:14,207][26022] Updated weights on worker 0-0, policy_version 374282 (0.00088) [2022-07-09 18:47:14,666][25689] Fps is (10 sec: 5838.7, 60 sec: 5665.8, 300 sec: 5659.6). Total num frames: 383267840. Throughput: 0: 5828.5. Samples: 383266078. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:47:14,667][25689] Avg episode reward: [(0, '-48.030')] [2022-07-09 18:47:16,102][26022] Updated weights on worker 0-0, policy_version 374292 (0.00086) [2022-07-09 18:47:17,570][26022] Updated weights on worker 0-0, policy_version 374302 (0.00092) [2022-07-09 18:47:19,613][26022] Updated weights on worker 0-0, policy_version 374312 (0.00093) [2022-07-09 18:47:19,671][25689] Fps is (10 sec: 5695.1, 60 sec: 5648.6, 300 sec: 5656.5). Total num frames: 383295488. Throughput: 0: 5834.9. Samples: 383300384. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:47:19,673][25689] Avg episode reward: [(0, '-48.235')] [2022-07-09 18:47:21,285][26022] Updated weights on worker 0-0, policy_version 374322 (0.00081) [2022-07-09 18:47:22,742][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:47:22,761][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000374330_383313920.pth [2022-07-09 18:47:22,761][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000372340_381276160.pth [2022-07-09 18:47:23,263][26022] Updated weights on worker 0-0, policy_version 374332 (0.00091) [2022-07-09 18:47:24,699][25689] Fps is (10 sec: 5613.9, 60 sec: 5651.8, 300 sec: 5661.4). Total num frames: 383324160. Throughput: 0: 5076.3. Samples: 383317298. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:47:24,700][25689] Avg episode reward: [(0, '-48.499')] [2022-07-09 18:47:25,028][26022] Updated weights on worker 0-0, policy_version 374342 (0.00086) [2022-07-09 18:47:26,784][26022] Updated weights on worker 0-0, policy_version 374352 (0.00093) [2022-07-09 18:47:28,709][26022] Updated weights on worker 0-0, policy_version 374362 (0.00114) [2022-07-09 18:47:29,759][25689] Fps is (10 sec: 5786.3, 60 sec: 5651.9, 300 sec: 5657.5). Total num frames: 383353856. Throughput: 0: 5932.4. Samples: 383351334. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:47:29,760][25689] Avg episode reward: [(0, '-47.811')] [2022-07-09 18:47:30,403][26022] Updated weights on worker 0-0, policy_version 374372 (0.00094) [2022-07-09 18:47:32,421][26022] Updated weights on worker 0-0, policy_version 374382 (0.00085) [2022-07-09 18:47:33,876][26022] Updated weights on worker 0-0, policy_version 374392 (0.00086) [2022-07-09 18:47:34,835][25689] Fps is (10 sec: 5657.4, 60 sec: 5665.5, 300 sec: 5656.5). Total num frames: 383381504. Throughput: 0: 5921.2. Samples: 383385500. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:47:34,835][25689] Avg episode reward: [(0, '-47.275')] [2022-07-09 18:47:35,890][26022] Updated weights on worker 0-0, policy_version 374402 (0.00091) [2022-07-09 18:47:37,498][26022] Updated weights on worker 0-0, policy_version 374412 (0.00098) [2022-07-09 18:47:39,497][26022] Updated weights on worker 0-0, policy_version 374422 (0.00087) [2022-07-09 18:47:39,911][25689] Fps is (10 sec: 5547.5, 60 sec: 5660.0, 300 sec: 5656.2). Total num frames: 383410176. Throughput: 0: 5894.3. Samples: 383419682. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:47:39,912][25689] Avg episode reward: [(0, '-46.167')] [2022-07-09 18:47:41,042][26022] Updated weights on worker 0-0, policy_version 374432 (0.00062) [2022-07-09 18:47:43,070][26022] Updated weights on worker 0-0, policy_version 374442 (0.00098) [2022-07-09 18:47:44,798][26022] Updated weights on worker 0-0, policy_version 374452 (0.00081) [2022-07-09 18:47:44,994][25689] Fps is (10 sec: 5745.1, 60 sec: 5653.5, 300 sec: 5656.0). Total num frames: 383439872. Throughput: 0: 5899.1. Samples: 383437024. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:47:44,995][25689] Avg episode reward: [(0, '-46.413')] [2022-07-09 18:47:46,646][26022] Updated weights on worker 0-0, policy_version 374462 (0.00088) [2022-07-09 18:47:48,429][26022] Updated weights on worker 0-0, policy_version 374472 (0.00092) [2022-07-09 18:47:50,058][25689] Fps is (10 sec: 5752.2, 60 sec: 5661.0, 300 sec: 5658.4). Total num frames: 383468544. Throughput: 0: 5915.6. Samples: 383471418. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:47:50,058][25689] Avg episode reward: [(0, '-45.611')] [2022-07-09 18:47:50,265][26022] Updated weights on worker 0-0, policy_version 374482 (0.00087) [2022-07-09 18:47:51,951][26022] Updated weights on worker 0-0, policy_version 374492 (0.00090) [2022-07-09 18:47:53,847][26022] Updated weights on worker 0-0, policy_version 374502 (0.00087) [2022-07-09 18:47:55,114][25689] Fps is (10 sec: 5666.4, 60 sec: 5639.4, 300 sec: 5657.4). Total num frames: 383497216. Throughput: 0: 5918.3. Samples: 383505520. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:47:55,116][25689] Avg episode reward: [(0, '-45.537')] [2022-07-09 18:47:55,428][26022] Updated weights on worker 0-0, policy_version 374512 (0.00087) [2022-07-09 18:47:57,627][26022] Updated weights on worker 0-0, policy_version 374522 (0.00096) [2022-07-09 18:47:59,037][26022] Updated weights on worker 0-0, policy_version 374532 (0.00087) [2022-07-09 18:48:00,150][25689] Fps is (10 sec: 5580.4, 60 sec: 5653.2, 300 sec: 5657.4). Total num frames: 383524864. Throughput: 0: 5081.5. Samples: 383522530. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:48:00,152][25689] Avg episode reward: [(0, '-46.565')] [2022-07-09 18:48:01,025][26022] Updated weights on worker 0-0, policy_version 374542 (0.00086) [2022-07-09 18:48:03,204][26022] Updated weights on worker 0-0, policy_version 374552 (0.00075) [2022-07-09 18:48:04,874][26022] Updated weights on worker 0-0, policy_version 374562 (0.00086) [2022-07-09 18:48:05,154][25689] Fps is (10 sec: 5609.6, 60 sec: 5688.9, 300 sec: 5666.6). Total num frames: 383553536. Throughput: 0: 5830.6. Samples: 383554568. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:48:05,155][25689] Avg episode reward: [(0, '-47.681')] [2022-07-09 18:48:06,833][26022] Updated weights on worker 0-0, policy_version 374572 (0.00084) [2022-07-09 18:48:08,427][26022] Updated weights on worker 0-0, policy_version 374582 (0.00098) [2022-07-09 18:48:10,251][25689] Fps is (10 sec: 5576.0, 60 sec: 5656.2, 300 sec: 5655.1). Total num frames: 383581184. Throughput: 0: 5808.0. Samples: 383588698. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:48:10,251][25689] Avg episode reward: [(0, '-47.763')] [2022-07-09 18:48:10,275][26022] Updated weights on worker 0-0, policy_version 374592 (0.00094) [2022-07-09 18:48:12,243][26022] Updated weights on worker 0-0, policy_version 374602 (0.00611) [2022-07-09 18:48:13,919][26022] Updated weights on worker 0-0, policy_version 374612 (0.00384) [2022-07-09 18:48:15,278][25689] Fps is (10 sec: 5563.1, 60 sec: 5642.7, 300 sec: 5652.3). Total num frames: 383609856. Throughput: 0: 4980.7. Samples: 383605950. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:48:15,279][25689] Avg episode reward: [(0, '-47.888')] [2022-07-09 18:48:15,685][26022] Updated weights on worker 0-0, policy_version 374622 (0.00086) [2022-07-09 18:48:17,359][26022] Updated weights on worker 0-0, policy_version 374632 (0.00089) [2022-07-09 18:48:19,135][26022] Updated weights on worker 0-0, policy_version 374642 (0.00097) [2022-07-09 18:48:20,291][25689] Fps is (10 sec: 5711.6, 60 sec: 5658.9, 300 sec: 5657.4). Total num frames: 383638528. Throughput: 0: 5862.0. Samples: 383640592. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:48:20,291][25689] Avg episode reward: [(0, '-47.769')] [2022-07-09 18:48:21,215][26022] Updated weights on worker 0-0, policy_version 374652 (0.00085) [2022-07-09 18:48:22,819][26022] Updated weights on worker 0-0, policy_version 374662 (0.00094) [2022-07-09 18:48:24,535][26022] Updated weights on worker 0-0, policy_version 374672 (0.00083) [2022-07-09 18:48:25,311][25689] Fps is (10 sec: 5715.8, 60 sec: 5659.6, 300 sec: 5651.3). Total num frames: 383667200. Throughput: 0: 5990.9. Samples: 383675322. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:48:25,312][25689] Avg episode reward: [(0, '-48.398')] [2022-07-09 18:48:26,396][26022] Updated weights on worker 0-0, policy_version 374682 (0.00092) [2022-07-09 18:48:28,116][26022] Updated weights on worker 0-0, policy_version 374692 (0.00100) [2022-07-09 18:48:30,148][26022] Updated weights on worker 0-0, policy_version 374702 (0.00097) [2022-07-09 18:48:30,390][25689] Fps is (10 sec: 5677.9, 60 sec: 5640.9, 300 sec: 5651.7). Total num frames: 383695872. Throughput: 0: 5132.9. Samples: 383692072. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 18:48:30,392][25689] Avg episode reward: [(0, '-48.069')] [2022-07-09 18:48:31,867][26022] Updated weights on worker 0-0, policy_version 374712 (0.00080) [2022-07-09 18:48:33,633][26022] Updated weights on worker 0-0, policy_version 374722 (0.00089) [2022-07-09 18:48:35,422][25689] Fps is (10 sec: 5671.3, 60 sec: 5661.9, 300 sec: 5655.0). Total num frames: 383724544. Throughput: 0: 5974.3. Samples: 383726294. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:48:35,424][25689] Avg episode reward: [(0, '-48.069')] [2022-07-09 18:48:35,616][26022] Updated weights on worker 0-0, policy_version 374732 (0.00091) [2022-07-09 18:48:37,107][26022] Updated weights on worker 0-0, policy_version 374742 (0.00085) [2022-07-09 18:48:39,116][26022] Updated weights on worker 0-0, policy_version 374752 (0.00100) [2022-07-09 18:48:40,475][25689] Fps is (10 sec: 5788.0, 60 sec: 5681.0, 300 sec: 5658.0). Total num frames: 383754240. Throughput: 0: 5962.8. Samples: 383760944. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:48:40,475][25689] Avg episode reward: [(0, '-47.644')] [2022-07-09 18:48:40,738][26022] Updated weights on worker 0-0, policy_version 374762 (0.00087) [2022-07-09 18:48:42,481][26022] Updated weights on worker 0-0, policy_version 374772 (0.00083) [2022-07-09 18:48:44,537][26022] Updated weights on worker 0-0, policy_version 374782 (0.00086) [2022-07-09 18:48:45,499][25689] Fps is (10 sec: 5893.9, 60 sec: 5686.6, 300 sec: 5659.2). Total num frames: 383783936. Throughput: 0: 5083.2. Samples: 383777942. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:48:45,499][25689] Avg episode reward: [(0, '-48.264')] [2022-07-09 18:48:45,967][26022] Updated weights on worker 0-0, policy_version 374792 (0.00087) [2022-07-09 18:48:48,023][26022] Updated weights on worker 0-0, policy_version 374802 (0.00087) [2022-07-09 18:48:49,906][26022] Updated weights on worker 0-0, policy_version 374812 (0.00087) [2022-07-09 18:48:50,621][25689] Fps is (10 sec: 5550.8, 60 sec: 5647.3, 300 sec: 5658.0). Total num frames: 383810560. Throughput: 0: 5933.9. Samples: 383812118. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:48:50,621][25689] Avg episode reward: [(0, '-48.532')] [2022-07-09 18:48:51,559][26022] Updated weights on worker 0-0, policy_version 374822 (0.00086) [2022-07-09 18:48:53,711][26022] Updated weights on worker 0-0, policy_version 374832 (0.00085) [2022-07-09 18:48:55,127][26022] Updated weights on worker 0-0, policy_version 374842 (0.00090) [2022-07-09 18:48:55,641][25689] Fps is (10 sec: 5452.3, 60 sec: 5650.7, 300 sec: 5654.6). Total num frames: 383839232. Throughput: 0: 5915.0. Samples: 383845886. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:48:55,641][25689] Avg episode reward: [(0, '-48.579')] [2022-07-09 18:48:56,970][26022] Updated weights on worker 0-0, policy_version 374852 (0.00088) [2022-07-09 18:48:58,848][26022] Updated weights on worker 0-0, policy_version 374862 (0.00082) [2022-07-09 18:49:00,470][26022] Updated weights on worker 0-0, policy_version 374872 (0.00095) [2022-07-09 18:49:00,707][25689] Fps is (10 sec: 5888.3, 60 sec: 5698.6, 300 sec: 5670.7). Total num frames: 383869952. Throughput: 0: 5049.3. Samples: 383863104. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:49:00,708][25689] Avg episode reward: [(0, '-47.738')] [2022-07-09 18:49:03,019][26022] Updated weights on worker 0-0, policy_version 374882 (0.00094) [2022-07-09 18:49:04,488][26022] Updated weights on worker 0-0, policy_version 374892 (0.00093) [2022-07-09 18:49:05,756][25689] Fps is (10 sec: 5466.4, 60 sec: 5626.8, 300 sec: 5653.6). Total num frames: 383894528. Throughput: 0: 5787.5. Samples: 383895182. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:49:05,757][25689] Avg episode reward: [(0, '-47.528')] [2022-07-09 18:49:06,471][26022] Updated weights on worker 0-0, policy_version 374902 (0.00094) [2022-07-09 18:49:08,148][26022] Updated weights on worker 0-0, policy_version 374912 (0.00092) [2022-07-09 18:49:10,002][26022] Updated weights on worker 0-0, policy_version 374922 (0.00091) [2022-07-09 18:49:10,829][25689] Fps is (10 sec: 5260.8, 60 sec: 5645.9, 300 sec: 5662.8). Total num frames: 383923200. Throughput: 0: 5797.3. Samples: 383929270. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:49:10,829][25689] Avg episode reward: [(0, '-48.165')] [2022-07-09 18:49:11,841][26022] Updated weights on worker 0-0, policy_version 374932 (0.00087) [2022-07-09 18:49:13,565][26022] Updated weights on worker 0-0, policy_version 374942 (0.00095) [2022-07-09 18:49:15,347][26022] Updated weights on worker 0-0, policy_version 374952 (0.00086) [2022-07-09 18:49:15,868][25689] Fps is (10 sec: 5772.2, 60 sec: 5661.7, 300 sec: 5662.3). Total num frames: 383952896. Throughput: 0: 4966.0. Samples: 383946336. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:49:15,869][25689] Avg episode reward: [(0, '-47.400')] [2022-07-09 18:49:17,214][26022] Updated weights on worker 0-0, policy_version 374962 (0.00088) [2022-07-09 18:49:18,948][26022] Updated weights on worker 0-0, policy_version 374972 (0.00093) [2022-07-09 18:49:20,878][25689] Fps is (10 sec: 5706.1, 60 sec: 5645.0, 300 sec: 5659.0). Total num frames: 383980544. Throughput: 0: 5822.9. Samples: 383980558. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:49:20,879][25689] Avg episode reward: [(0, '-47.425')] [2022-07-09 18:49:21,001][26022] Updated weights on worker 0-0, policy_version 374982 (0.00085) [2022-07-09 18:49:22,677][26022] Updated weights on worker 0-0, policy_version 374992 (0.00086) [2022-07-09 18:49:22,762][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:49:22,780][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000374993_383992832.pth [2022-07-09 18:49:22,780][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000373003_381955072.pth [2022-07-09 18:49:24,453][26022] Updated weights on worker 0-0, policy_version 375002 (0.00087) [2022-07-09 18:49:25,902][25689] Fps is (10 sec: 5714.7, 60 sec: 5661.5, 300 sec: 5663.7). Total num frames: 384010240. Throughput: 0: 5945.5. Samples: 384014962. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:49:25,903][25689] Avg episode reward: [(0, '-47.723')] [2022-07-09 18:49:26,240][26022] Updated weights on worker 0-0, policy_version 375012 (0.00123) [2022-07-09 18:49:27,887][26022] Updated weights on worker 0-0, policy_version 375022 (0.00478) [2022-07-09 18:49:30,048][26022] Updated weights on worker 0-0, policy_version 375032 (0.00086) [2022-07-09 18:49:31,027][25689] Fps is (10 sec: 5751.5, 60 sec: 5657.3, 300 sec: 5661.6). Total num frames: 384038912. Throughput: 0: 5077.3. Samples: 384031820. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:49:31,027][25689] Avg episode reward: [(0, '-48.399')] [2022-07-09 18:49:31,611][26022] Updated weights on worker 0-0, policy_version 375042 (0.00091) [2022-07-09 18:49:33,408][26022] Updated weights on worker 0-0, policy_version 375052 (0.00084) [2022-07-09 18:49:35,137][26022] Updated weights on worker 0-0, policy_version 375062 (0.00627) [2022-07-09 18:49:36,039][25689] Fps is (10 sec: 5455.1, 60 sec: 5625.3, 300 sec: 5655.1). Total num frames: 384065536. Throughput: 0: 5919.0. Samples: 384065728. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:49:36,040][25689] Avg episode reward: [(0, '-49.221')] [2022-07-09 18:49:37,022][26022] Updated weights on worker 0-0, policy_version 375072 (0.00090) [2022-07-09 18:49:38,852][26022] Updated weights on worker 0-0, policy_version 375082 (0.00087) [2022-07-09 18:49:40,652][26022] Updated weights on worker 0-0, policy_version 375092 (0.00093) [2022-07-09 18:49:41,081][25689] Fps is (10 sec: 5703.2, 60 sec: 5643.2, 300 sec: 5661.3). Total num frames: 384096256. Throughput: 0: 5914.4. Samples: 384100046. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:49:41,082][25689] Avg episode reward: [(0, '-48.684')] [2022-07-09 18:49:42,370][26022] Updated weights on worker 0-0, policy_version 375102 (0.00083) [2022-07-09 18:49:44,260][26022] Updated weights on worker 0-0, policy_version 375112 (0.00090) [2022-07-09 18:49:45,976][26022] Updated weights on worker 0-0, policy_version 375122 (0.00086) [2022-07-09 18:49:46,102][25689] Fps is (10 sec: 5902.1, 60 sec: 5626.6, 300 sec: 5658.4). Total num frames: 384124928. Throughput: 0: 5923.7. Samples: 384134618. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:49:46,104][25689] Avg episode reward: [(0, '-48.229')] [2022-07-09 18:49:47,719][26022] Updated weights on worker 0-0, policy_version 375132 (0.00089) [2022-07-09 18:49:49,479][26022] Updated weights on worker 0-0, policy_version 375142 (0.00081) [2022-07-09 18:49:51,162][25689] Fps is (10 sec: 5790.4, 60 sec: 5683.2, 300 sec: 5665.3). Total num frames: 384154624. Throughput: 0: 5967.4. Samples: 384151972. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:49:51,162][25689] Avg episode reward: [(0, '-47.569')] [2022-07-09 18:49:51,293][26022] Updated weights on worker 0-0, policy_version 375152 (0.00082) [2022-07-09 18:49:53,194][26022] Updated weights on worker 0-0, policy_version 375162 (0.00086) [2022-07-09 18:49:54,725][26022] Updated weights on worker 0-0, policy_version 375172 (0.00082) [2022-07-09 18:49:56,178][25689] Fps is (10 sec: 5691.0, 60 sec: 5666.6, 300 sec: 5661.6). Total num frames: 384182272. Throughput: 0: 6007.6. Samples: 384186716. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:49:56,179][25689] Avg episode reward: [(0, '-47.235')] [2022-07-09 18:49:56,839][26022] Updated weights on worker 0-0, policy_version 375182 (0.00085) [2022-07-09 18:49:58,415][26022] Updated weights on worker 0-0, policy_version 375192 (0.00067) [2022-07-09 18:50:00,325][26022] Updated weights on worker 0-0, policy_version 375202 (0.00089) [2022-07-09 18:50:01,188][25689] Fps is (10 sec: 5719.1, 60 sec: 5654.9, 300 sec: 5672.0). Total num frames: 384211968. Throughput: 0: 6029.9. Samples: 384221290. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:50:01,189][25689] Avg episode reward: [(0, '-46.050')] [2022-07-09 18:50:01,869][26022] Updated weights on worker 0-0, policy_version 375212 (0.00085) [2022-07-09 18:50:04,189][26022] Updated weights on worker 0-0, policy_version 375222 (0.00081) [2022-07-09 18:50:05,862][26022] Updated weights on worker 0-0, policy_version 375232 (0.00085) [2022-07-09 18:50:06,205][25689] Fps is (10 sec: 5616.9, 60 sec: 5691.8, 300 sec: 5663.5). Total num frames: 384238592. Throughput: 0: 5062.0. Samples: 384236380. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:50:06,206][25689] Avg episode reward: [(0, '-45.828')] [2022-07-09 18:50:07,599][26022] Updated weights on worker 0-0, policy_version 375242 (0.00085) [2022-07-09 18:50:09,435][26022] Updated weights on worker 0-0, policy_version 375252 (0.00082) [2022-07-09 18:50:11,246][25689] Fps is (10 sec: 5498.0, 60 sec: 5694.8, 300 sec: 5663.5). Total num frames: 384267264. Throughput: 0: 5918.2. Samples: 384270836. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:50:11,246][25689] Avg episode reward: [(0, '-46.336')] [2022-07-09 18:50:11,386][26022] Updated weights on worker 0-0, policy_version 375262 (0.00090) [2022-07-09 18:50:13,057][26022] Updated weights on worker 0-0, policy_version 375272 (0.00082) [2022-07-09 18:50:14,778][26022] Updated weights on worker 0-0, policy_version 375282 (0.00085) [2022-07-09 18:50:16,251][25689] Fps is (10 sec: 5708.1, 60 sec: 5681.0, 300 sec: 5663.5). Total num frames: 384295936. Throughput: 0: 5910.6. Samples: 384305360. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:50:16,252][25689] Avg episode reward: [(0, '-46.366')] [2022-07-09 18:50:16,599][26022] Updated weights on worker 0-0, policy_version 375292 (0.00095) [2022-07-09 18:50:18,188][26022] Updated weights on worker 0-0, policy_version 375302 (0.00089) [2022-07-09 18:50:20,223][26022] Updated weights on worker 0-0, policy_version 375312 (0.00089) [2022-07-09 18:50:21,280][25689] Fps is (10 sec: 5919.2, 60 sec: 5730.2, 300 sec: 5666.6). Total num frames: 384326656. Throughput: 0: 5050.1. Samples: 384322752. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:50:21,280][25689] Avg episode reward: [(0, '-47.111')] [2022-07-09 18:50:22,097][26022] Updated weights on worker 0-0, policy_version 375322 (0.00095) [2022-07-09 18:50:23,687][26022] Updated weights on worker 0-0, policy_version 375332 (0.00084) [2022-07-09 18:50:25,717][26022] Updated weights on worker 0-0, policy_version 375342 (0.00091) [2022-07-09 18:50:26,310][25689] Fps is (10 sec: 5700.7, 60 sec: 5678.7, 300 sec: 5664.7). Total num frames: 384353280. Throughput: 0: 5994.1. Samples: 384356892. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:50:26,311][25689] Avg episode reward: [(0, '-47.314')] [2022-07-09 18:50:27,238][26022] Updated weights on worker 0-0, policy_version 375352 (0.00091) [2022-07-09 18:50:29,347][26022] Updated weights on worker 0-0, policy_version 375362 (0.00101) [2022-07-09 18:50:31,130][26022] Updated weights on worker 0-0, policy_version 375372 (0.00096) [2022-07-09 18:50:31,379][25689] Fps is (10 sec: 5475.3, 60 sec: 5684.0, 300 sec: 5661.7). Total num frames: 384381952. Throughput: 0: 5954.3. Samples: 384390714. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:50:31,379][25689] Avg episode reward: [(0, '-46.753')] [2022-07-09 18:50:32,751][26022] Updated weights on worker 0-0, policy_version 375382 (0.00099) [2022-07-09 18:50:34,762][26022] Updated weights on worker 0-0, policy_version 375392 (0.00093) [2022-07-09 18:50:36,387][25689] Fps is (10 sec: 5589.4, 60 sec: 5701.4, 300 sec: 5656.9). Total num frames: 384409600. Throughput: 0: 5076.9. Samples: 384407586. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:50:36,389][25689] Avg episode reward: [(0, '-46.689')] [2022-07-09 18:50:36,775][26022] Updated weights on worker 0-0, policy_version 375402 (0.00084) [2022-07-09 18:50:38,271][26022] Updated weights on worker 0-0, policy_version 375412 (0.00089) [2022-07-09 18:50:40,301][26022] Updated weights on worker 0-0, policy_version 375422 (0.00085) [2022-07-09 18:50:41,410][25689] Fps is (10 sec: 5716.5, 60 sec: 5686.2, 300 sec: 5663.5). Total num frames: 384439296. Throughput: 0: 5902.7. Samples: 384441576. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:50:41,412][25689] Avg episode reward: [(0, '-46.147')] [2022-07-09 18:50:41,827][26022] Updated weights on worker 0-0, policy_version 375432 (0.00085) [2022-07-09 18:50:43,710][26022] Updated weights on worker 0-0, policy_version 375442 (0.00089) [2022-07-09 18:50:45,438][26022] Updated weights on worker 0-0, policy_version 375452 (0.00095) [2022-07-09 18:50:46,413][25689] Fps is (10 sec: 5719.5, 60 sec: 5670.9, 300 sec: 5660.5). Total num frames: 384466944. Throughput: 0: 5939.3. Samples: 384476286. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:50:46,414][25689] Avg episode reward: [(0, '-45.871')] [2022-07-09 18:50:47,166][26022] Updated weights on worker 0-0, policy_version 375462 (0.00086) [2022-07-09 18:50:48,955][26022] Updated weights on worker 0-0, policy_version 375472 (0.00081) [2022-07-09 18:50:50,928][26022] Updated weights on worker 0-0, policy_version 375482 (0.00091) [2022-07-09 18:50:51,453][25689] Fps is (10 sec: 5608.0, 60 sec: 5655.7, 300 sec: 5660.0). Total num frames: 384495616. Throughput: 0: 5119.0. Samples: 384493472. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:50:51,453][25689] Avg episode reward: [(0, '-45.971')] [2022-07-09 18:50:52,640][26022] Updated weights on worker 0-0, policy_version 375492 (0.00090) [2022-07-09 18:50:54,642][26022] Updated weights on worker 0-0, policy_version 375502 (0.00621) [2022-07-09 18:50:56,069][26022] Updated weights on worker 0-0, policy_version 375512 (0.00085) [2022-07-09 18:50:56,455][25689] Fps is (10 sec: 5914.1, 60 sec: 5708.1, 300 sec: 5667.1). Total num frames: 384526336. Throughput: 0: 5974.4. Samples: 384527484. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:50:56,456][25689] Avg episode reward: [(0, '-46.621')] [2022-07-09 18:50:58,214][26022] Updated weights on worker 0-0, policy_version 375522 (0.00086) [2022-07-09 18:50:59,761][26022] Updated weights on worker 0-0, policy_version 375532 (0.00087) [2022-07-09 18:51:01,465][25689] Fps is (10 sec: 5727.6, 60 sec: 5657.1, 300 sec: 5671.1). Total num frames: 384552960. Throughput: 0: 5995.1. Samples: 384561806. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:51:01,465][25689] Avg episode reward: [(0, '-47.811')] [2022-07-09 18:51:01,673][26022] Updated weights on worker 0-0, policy_version 375542 (0.00098) [2022-07-09 18:51:03,871][26022] Updated weights on worker 0-0, policy_version 375552 (0.00090) [2022-07-09 18:51:05,665][26022] Updated weights on worker 0-0, policy_version 375562 (0.00086) [2022-07-09 18:51:06,470][25689] Fps is (10 sec: 5316.8, 60 sec: 5658.2, 300 sec: 5661.3). Total num frames: 384579584. Throughput: 0: 5002.7. Samples: 384576628. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 18:51:06,471][25689] Avg episode reward: [(0, '-48.441')] [2022-07-09 18:51:07,323][26022] Updated weights on worker 0-0, policy_version 375572 (0.00089) [2022-07-09 18:51:09,360][26022] Updated weights on worker 0-0, policy_version 375582 (0.00096) [2022-07-09 18:51:10,779][26022] Updated weights on worker 0-0, policy_version 375592 (0.00081) [2022-07-09 18:51:11,524][25689] Fps is (10 sec: 5496.8, 60 sec: 5656.9, 300 sec: 5660.5). Total num frames: 384608256. Throughput: 0: 5858.8. Samples: 384611068. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:51:11,525][25689] Avg episode reward: [(0, '-48.806')] [2022-07-09 18:51:12,858][26022] Updated weights on worker 0-0, policy_version 375602 (0.00083) [2022-07-09 18:51:14,545][26022] Updated weights on worker 0-0, policy_version 375612 (0.00088) [2022-07-09 18:51:16,247][26022] Updated weights on worker 0-0, policy_version 375622 (0.00086) [2022-07-09 18:51:16,544][25689] Fps is (10 sec: 5794.1, 60 sec: 5672.6, 300 sec: 5663.6). Total num frames: 384637952. Throughput: 0: 5888.3. Samples: 384645772. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:51:16,544][25689] Avg episode reward: [(0, '-48.584')] [2022-07-09 18:51:18,180][26022] Updated weights on worker 0-0, policy_version 375632 (0.00089) [2022-07-09 18:51:19,895][26022] Updated weights on worker 0-0, policy_version 375642 (0.00086) [2022-07-09 18:51:21,548][25689] Fps is (10 sec: 5822.8, 60 sec: 5640.9, 300 sec: 5664.7). Total num frames: 384666624. Throughput: 0: 5032.8. Samples: 384662884. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:51:21,549][25689] Avg episode reward: [(0, '-47.770')] [2022-07-09 18:51:21,718][26022] Updated weights on worker 0-0, policy_version 375652 (0.00544) [2022-07-09 18:51:23,014][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:51:23,024][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000375659_384674816.pth [2022-07-09 18:51:23,024][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000373666_382633984.pth [2022-07-09 18:51:23,754][26022] Updated weights on worker 0-0, policy_version 375662 (0.00090) [2022-07-09 18:51:25,078][26022] Updated weights on worker 0-0, policy_version 375672 (0.00085) [2022-07-09 18:51:26,550][25689] Fps is (10 sec: 5628.3, 60 sec: 5660.6, 300 sec: 5658.9). Total num frames: 384694272. Throughput: 0: 6017.2. Samples: 384697454. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:51:26,551][25689] Avg episode reward: [(0, '-47.402')] [2022-07-09 18:51:27,241][26022] Updated weights on worker 0-0, policy_version 375682 (0.00084) [2022-07-09 18:51:28,626][26022] Updated weights on worker 0-0, policy_version 375692 (0.00078) [2022-07-09 18:51:30,598][26022] Updated weights on worker 0-0, policy_version 375702 (0.00082) [2022-07-09 18:51:31,607][25689] Fps is (10 sec: 5802.6, 60 sec: 5695.6, 300 sec: 5672.4). Total num frames: 384724992. Throughput: 0: 6018.5. Samples: 384731938. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:51:31,608][25689] Avg episode reward: [(0, '-47.334')] [2022-07-09 18:51:32,408][26022] Updated weights on worker 0-0, policy_version 375712 (0.00093) [2022-07-09 18:51:34,082][26022] Updated weights on worker 0-0, policy_version 375722 (0.00082) [2022-07-09 18:51:36,247][26022] Updated weights on worker 0-0, policy_version 375732 (0.00858) [2022-07-09 18:51:36,616][25689] Fps is (10 sec: 5798.5, 60 sec: 5695.5, 300 sec: 5669.1). Total num frames: 384752640. Throughput: 0: 5138.3. Samples: 384748912. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:51:36,617][25689] Avg episode reward: [(0, '-47.807')] [2022-07-09 18:51:37,744][26022] Updated weights on worker 0-0, policy_version 375742 (0.00094) [2022-07-09 18:51:39,620][26022] Updated weights on worker 0-0, policy_version 375752 (0.00089) [2022-07-09 18:51:41,415][26022] Updated weights on worker 0-0, policy_version 375762 (0.00073) [2022-07-09 18:51:41,628][25689] Fps is (10 sec: 5620.5, 60 sec: 5679.6, 300 sec: 5665.7). Total num frames: 384781312. Throughput: 0: 5987.0. Samples: 384783100. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:51:41,628][25689] Avg episode reward: [(0, '-47.190')] [2022-07-09 18:51:43,146][26022] Updated weights on worker 0-0, policy_version 375772 (0.00094) [2022-07-09 18:51:44,947][26022] Updated weights on worker 0-0, policy_version 375782 (0.00082) [2022-07-09 18:51:46,656][25689] Fps is (10 sec: 5711.7, 60 sec: 5694.2, 300 sec: 5667.9). Total num frames: 384809984. Throughput: 0: 5979.2. Samples: 384817670. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:51:46,657][25689] Avg episode reward: [(0, '-47.148')] [2022-07-09 18:51:46,767][26022] Updated weights on worker 0-0, policy_version 375792 (0.00081) [2022-07-09 18:51:48,634][26022] Updated weights on worker 0-0, policy_version 375802 (0.00081) [2022-07-09 18:51:50,377][26022] Updated weights on worker 0-0, policy_version 375812 (0.00087) [2022-07-09 18:51:51,795][25689] Fps is (10 sec: 5640.3, 60 sec: 5684.9, 300 sec: 5661.9). Total num frames: 384838656. Throughput: 0: 5081.7. Samples: 384834524. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:51:51,795][25689] Avg episode reward: [(0, '-47.449')] [2022-07-09 18:51:52,187][26022] Updated weights on worker 0-0, policy_version 375822 (0.00089) [2022-07-09 18:51:54,033][26022] Updated weights on worker 0-0, policy_version 375832 (0.00089) [2022-07-09 18:51:55,692][26022] Updated weights on worker 0-0, policy_version 375842 (0.00078) [2022-07-09 18:51:56,863][25689] Fps is (10 sec: 5618.1, 60 sec: 5644.8, 300 sec: 5667.6). Total num frames: 384867328. Throughput: 0: 5917.4. Samples: 384868720. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:51:56,864][25689] Avg episode reward: [(0, '-46.985')] [2022-07-09 18:51:57,561][26022] Updated weights on worker 0-0, policy_version 375852 (0.00095) [2022-07-09 18:51:59,375][26022] Updated weights on worker 0-0, policy_version 375862 (0.00085) [2022-07-09 18:52:01,188][26022] Updated weights on worker 0-0, policy_version 375872 (0.00095) [2022-07-09 18:52:01,915][25689] Fps is (10 sec: 5767.7, 60 sec: 5691.7, 300 sec: 5677.4). Total num frames: 384897024. Throughput: 0: 5931.2. Samples: 384903424. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:52:01,915][25689] Avg episode reward: [(0, '-46.819')] [2022-07-09 18:52:03,326][26022] Updated weights on worker 0-0, policy_version 375882 (0.00088) [2022-07-09 18:52:04,979][26022] Updated weights on worker 0-0, policy_version 375892 (0.00085) [2022-07-09 18:52:06,991][25689] Fps is (10 sec: 5560.8, 60 sec: 5685.0, 300 sec: 5667.7). Total num frames: 384923648. Throughput: 0: 4956.5. Samples: 384918460. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:52:06,993][25689] Avg episode reward: [(0, '-46.925')] [2022-07-09 18:52:07,001][26022] Updated weights on worker 0-0, policy_version 375902 (0.00085) [2022-07-09 18:52:08,491][26022] Updated weights on worker 0-0, policy_version 375912 (0.00083) [2022-07-09 18:52:10,562][26022] Updated weights on worker 0-0, policy_version 375922 (0.00086) [2022-07-09 18:52:12,120][25689] Fps is (10 sec: 5618.8, 60 sec: 5711.8, 300 sec: 5669.9). Total num frames: 384954368. Throughput: 0: 5829.9. Samples: 384953020. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:52:12,121][25689] Avg episode reward: [(0, '-48.253')] [2022-07-09 18:52:12,122][26022] Updated weights on worker 0-0, policy_version 375932 (0.00088) [2022-07-09 18:52:14,281][26022] Updated weights on worker 0-0, policy_version 375942 (0.00087) [2022-07-09 18:52:15,779][26022] Updated weights on worker 0-0, policy_version 375952 (0.00090) [2022-07-09 18:52:17,144][25689] Fps is (10 sec: 5748.8, 60 sec: 5677.6, 300 sec: 5669.5). Total num frames: 384982016. Throughput: 0: 5845.2. Samples: 384987268. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:52:17,145][25689] Avg episode reward: [(0, '-48.462')] [2022-07-09 18:52:17,687][26022] Updated weights on worker 0-0, policy_version 375962 (0.00085) [2022-07-09 18:52:19,417][26022] Updated weights on worker 0-0, policy_version 375972 (0.00091) [2022-07-09 18:52:21,123][26022] Updated weights on worker 0-0, policy_version 375982 (0.00088) [2022-07-09 18:52:22,180][25689] Fps is (10 sec: 5598.5, 60 sec: 5674.7, 300 sec: 5669.4). Total num frames: 385010688. Throughput: 0: 4995.8. Samples: 385004666. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:52:22,181][25689] Avg episode reward: [(0, '-48.434')] [2022-07-09 18:52:22,928][26022] Updated weights on worker 0-0, policy_version 375992 (0.00085) [2022-07-09 18:52:24,642][26022] Updated weights on worker 0-0, policy_version 376002 (0.00101) [2022-07-09 18:52:26,656][26022] Updated weights on worker 0-0, policy_version 376012 (0.00078) [2022-07-09 18:52:27,228][25689] Fps is (10 sec: 5788.2, 60 sec: 5704.1, 300 sec: 5669.6). Total num frames: 385040384. Throughput: 0: 5973.6. Samples: 385039348. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:52:27,229][25689] Avg episode reward: [(0, '-48.825')] [2022-07-09 18:52:28,332][26022] Updated weights on worker 0-0, policy_version 376022 (0.00084) [2022-07-09 18:52:30,028][26022] Updated weights on worker 0-0, policy_version 376032 (0.00081) [2022-07-09 18:52:31,945][26022] Updated weights on worker 0-0, policy_version 376042 (0.00215) [2022-07-09 18:52:32,315][25689] Fps is (10 sec: 5759.0, 60 sec: 5667.5, 300 sec: 5672.8). Total num frames: 385069056. Throughput: 0: 5977.1. Samples: 385073728. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:52:32,317][25689] Avg episode reward: [(0, '-48.066')] [2022-07-09 18:52:33,691][26022] Updated weights on worker 0-0, policy_version 376052 (0.00088) [2022-07-09 18:52:35,552][26022] Updated weights on worker 0-0, policy_version 376062 (0.00612) [2022-07-09 18:52:37,282][26022] Updated weights on worker 0-0, policy_version 376072 (0.00085) [2022-07-09 18:52:37,378][25689] Fps is (10 sec: 5649.8, 60 sec: 5679.4, 300 sec: 5673.1). Total num frames: 385097728. Throughput: 0: 5107.2. Samples: 385090606. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:52:37,379][25689] Avg episode reward: [(0, '-47.097')] [2022-07-09 18:52:39,032][26022] Updated weights on worker 0-0, policy_version 376082 (0.00089) [2022-07-09 18:52:40,805][26022] Updated weights on worker 0-0, policy_version 376092 (0.00081) [2022-07-09 18:52:42,457][25689] Fps is (10 sec: 5755.2, 60 sec: 5689.9, 300 sec: 5673.2). Total num frames: 385127424. Throughput: 0: 5947.0. Samples: 385125252. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:52:42,458][25689] Avg episode reward: [(0, '-47.395')] [2022-07-09 18:52:42,627][26022] Updated weights on worker 0-0, policy_version 376102 (0.00084) [2022-07-09 18:52:44,428][26022] Updated weights on worker 0-0, policy_version 376112 (0.00091) [2022-07-09 18:52:46,274][26022] Updated weights on worker 0-0, policy_version 376122 (0.00092) [2022-07-09 18:52:47,483][25689] Fps is (10 sec: 5675.1, 60 sec: 5673.3, 300 sec: 5670.4). Total num frames: 385155072. Throughput: 0: 5940.3. Samples: 385159664. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:52:47,483][25689] Avg episode reward: [(0, '-47.230')] [2022-07-09 18:52:47,798][26022] Updated weights on worker 0-0, policy_version 376132 (0.00085) [2022-07-09 18:52:49,935][26022] Updated weights on worker 0-0, policy_version 376142 (0.00087) [2022-07-09 18:52:51,610][26022] Updated weights on worker 0-0, policy_version 376152 (0.00094) [2022-07-09 18:52:52,536][25689] Fps is (10 sec: 5689.3, 60 sec: 5698.1, 300 sec: 5673.9). Total num frames: 385184768. Throughput: 0: 5945.6. Samples: 385193954. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:52:52,537][25689] Avg episode reward: [(0, '-46.735')] [2022-07-09 18:52:53,570][26022] Updated weights on worker 0-0, policy_version 376162 (0.00055) [2022-07-09 18:52:55,085][26022] Updated weights on worker 0-0, policy_version 376172 (0.00091) [2022-07-09 18:52:57,115][26022] Updated weights on worker 0-0, policy_version 376182 (0.00091) [2022-07-09 18:52:57,546][25689] Fps is (10 sec: 5698.1, 60 sec: 5686.7, 300 sec: 5674.4). Total num frames: 385212416. Throughput: 0: 5968.8. Samples: 385210986. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:52:57,547][25689] Avg episode reward: [(0, '-47.490')] [2022-07-09 18:52:58,688][26022] Updated weights on worker 0-0, policy_version 376192 (0.00088) [2022-07-09 18:53:00,640][26022] Updated weights on worker 0-0, policy_version 376202 (0.00090) [2022-07-09 18:53:02,535][26022] Updated weights on worker 0-0, policy_version 376212 (0.00089) [2022-07-09 18:53:02,562][25689] Fps is (10 sec: 5617.7, 60 sec: 5673.2, 300 sec: 5674.2). Total num frames: 385241088. Throughput: 0: 5951.5. Samples: 385244906. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:53:02,564][25689] Avg episode reward: [(0, '-47.705')] [2022-07-09 18:53:04,612][26022] Updated weights on worker 0-0, policy_version 376222 (0.00095) [2022-07-09 18:53:06,375][26022] Updated weights on worker 0-0, policy_version 376232 (0.00086) [2022-07-09 18:53:07,586][25689] Fps is (10 sec: 5507.9, 60 sec: 5678.1, 300 sec: 5672.1). Total num frames: 385267712. Throughput: 0: 5854.4. Samples: 385277356. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:53:07,586][25689] Avg episode reward: [(0, '-47.723')] [2022-07-09 18:53:08,216][26022] Updated weights on worker 0-0, policy_version 376242 (0.00087) [2022-07-09 18:53:09,941][26022] Updated weights on worker 0-0, policy_version 376252 (0.00084) [2022-07-09 18:53:11,787][26022] Updated weights on worker 0-0, policy_version 376262 (0.00082) [2022-07-09 18:53:12,645][25689] Fps is (10 sec: 5483.7, 60 sec: 5650.8, 300 sec: 5671.5). Total num frames: 385296384. Throughput: 0: 4997.9. Samples: 385294458. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:53:12,646][25689] Avg episode reward: [(0, '-47.234')] [2022-07-09 18:53:13,635][26022] Updated weights on worker 0-0, policy_version 376272 (0.00079) [2022-07-09 18:53:15,409][26022] Updated weights on worker 0-0, policy_version 376282 (0.00092) [2022-07-09 18:53:16,949][26022] Updated weights on worker 0-0, policy_version 376292 (0.00090) [2022-07-09 18:53:17,657][25689] Fps is (10 sec: 5795.7, 60 sec: 5685.8, 300 sec: 5675.0). Total num frames: 385326080. Throughput: 0: 5858.0. Samples: 385328792. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:53:17,657][25689] Avg episode reward: [(0, '-48.192')] [2022-07-09 18:53:19,043][26022] Updated weights on worker 0-0, policy_version 376302 (0.00091) [2022-07-09 18:53:20,554][26022] Updated weights on worker 0-0, policy_version 376312 (0.00085) [2022-07-09 18:53:22,603][26022] Updated weights on worker 0-0, policy_version 376322 (0.00088) [2022-07-09 18:53:22,748][25689] Fps is (10 sec: 5675.9, 60 sec: 5663.7, 300 sec: 5670.2). Total num frames: 385353728. Throughput: 0: 5859.8. Samples: 385363196. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:53:22,750][25689] Avg episode reward: [(0, '-47.366')] [2022-07-09 18:53:23,332][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:53:23,346][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000376326_385357824.pth [2022-07-09 18:53:23,346][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000374330_383313920.pth [2022-07-09 18:53:24,273][26022] Updated weights on worker 0-0, policy_version 376332 (0.00087) [2022-07-09 18:53:26,200][26022] Updated weights on worker 0-0, policy_version 376342 (0.00086) [2022-07-09 18:53:27,790][25689] Fps is (10 sec: 5658.7, 60 sec: 5664.3, 300 sec: 5674.3). Total num frames: 385383424. Throughput: 0: 5104.5. Samples: 385380490. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:53:27,791][25689] Avg episode reward: [(0, '-47.445')] [2022-07-09 18:53:27,813][26022] Updated weights on worker 0-0, policy_version 376352 (0.00087) [2022-07-09 18:53:29,694][26022] Updated weights on worker 0-0, policy_version 376362 (0.00080) [2022-07-09 18:53:31,440][26022] Updated weights on worker 0-0, policy_version 376372 (0.00090) [2022-07-09 18:53:32,859][25689] Fps is (10 sec: 5671.8, 60 sec: 5649.1, 300 sec: 5670.2). Total num frames: 385411072. Throughput: 0: 5934.4. Samples: 385414412. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:53:32,861][25689] Avg episode reward: [(0, '-47.553')] [2022-07-09 18:53:33,371][26022] Updated weights on worker 0-0, policy_version 376382 (0.00087) [2022-07-09 18:53:35,240][26022] Updated weights on worker 0-0, policy_version 376392 (0.00090) [2022-07-09 18:53:36,733][26022] Updated weights on worker 0-0, policy_version 376402 (0.00085) [2022-07-09 18:53:37,879][25689] Fps is (10 sec: 5582.8, 60 sec: 5653.1, 300 sec: 5667.4). Total num frames: 385439744. Throughput: 0: 5928.9. Samples: 385448686. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:53:37,879][25689] Avg episode reward: [(0, '-47.650')] [2022-07-09 18:53:38,680][26022] Updated weights on worker 0-0, policy_version 376412 (0.00085) [2022-07-09 18:53:40,542][26022] Updated weights on worker 0-0, policy_version 376422 (0.00083) [2022-07-09 18:53:42,291][26022] Updated weights on worker 0-0, policy_version 376432 (0.00091) [2022-07-09 18:53:42,885][25689] Fps is (10 sec: 5923.5, 60 sec: 5676.8, 300 sec: 5671.2). Total num frames: 385470464. Throughput: 0: 5099.2. Samples: 385465878. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 18:53:42,886][25689] Avg episode reward: [(0, '-47.650')] [2022-07-09 18:53:44,091][26022] Updated weights on worker 0-0, policy_version 376442 (0.00082) [2022-07-09 18:53:45,735][26022] Updated weights on worker 0-0, policy_version 376452 (0.00081) [2022-07-09 18:53:47,697][26022] Updated weights on worker 0-0, policy_version 376462 (0.00098) [2022-07-09 18:53:47,933][25689] Fps is (10 sec: 5805.4, 60 sec: 5674.8, 300 sec: 5676.0). Total num frames: 385498112. Throughput: 0: 5953.9. Samples: 385500416. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:53:47,933][25689] Avg episode reward: [(0, '-47.492')] [2022-07-09 18:53:49,523][26022] Updated weights on worker 0-0, policy_version 376472 (0.00088) [2022-07-09 18:53:51,377][26022] Updated weights on worker 0-0, policy_version 376482 (0.00087) [2022-07-09 18:53:53,001][25689] Fps is (10 sec: 5567.3, 60 sec: 5656.4, 300 sec: 5675.1). Total num frames: 385526784. Throughput: 0: 5933.6. Samples: 385533930. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:53:53,002][25689] Avg episode reward: [(0, '-47.663')] [2022-07-09 18:53:53,097][26022] Updated weights on worker 0-0, policy_version 376492 (0.00097) [2022-07-09 18:53:54,974][26022] Updated weights on worker 0-0, policy_version 376502 (0.00084) [2022-07-09 18:53:56,754][26022] Updated weights on worker 0-0, policy_version 376512 (0.00086) [2022-07-09 18:53:58,059][25689] Fps is (10 sec: 5663.2, 60 sec: 5669.0, 300 sec: 5668.4). Total num frames: 385555456. Throughput: 0: 5075.2. Samples: 385551102. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:53:58,059][25689] Avg episode reward: [(0, '-48.553')] [2022-07-09 18:53:58,630][26022] Updated weights on worker 0-0, policy_version 376522 (0.00850) [2022-07-09 18:54:00,258][26022] Updated weights on worker 0-0, policy_version 376532 (0.00084) [2022-07-09 18:54:02,361][26022] Updated weights on worker 0-0, policy_version 376542 (0.00081) [2022-07-09 18:54:03,117][25689] Fps is (10 sec: 5466.3, 60 sec: 5631.1, 300 sec: 5675.1). Total num frames: 385582080. Throughput: 0: 5918.6. Samples: 385585622. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:54:03,122][25689] Avg episode reward: [(0, '-48.020')] [2022-07-09 18:54:04,130][26022] Updated weights on worker 0-0, policy_version 376552 (0.00101) [2022-07-09 18:54:05,989][26022] Updated weights on worker 0-0, policy_version 376562 (0.00090) [2022-07-09 18:54:07,882][26022] Updated weights on worker 0-0, policy_version 376572 (0.00087) [2022-07-09 18:54:08,149][25689] Fps is (10 sec: 5581.3, 60 sec: 5681.0, 300 sec: 5679.3). Total num frames: 385611776. Throughput: 0: 5811.5. Samples: 385617904. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:54:08,150][25689] Avg episode reward: [(0, '-48.057')] [2022-07-09 18:54:09,563][26022] Updated weights on worker 0-0, policy_version 376582 (0.00084) [2022-07-09 18:54:11,165][26022] Updated weights on worker 0-0, policy_version 376592 (0.00080) [2022-07-09 18:54:13,199][26022] Updated weights on worker 0-0, policy_version 376602 (0.00098) [2022-07-09 18:54:13,229][25689] Fps is (10 sec: 5772.1, 60 sec: 5679.1, 300 sec: 5675.1). Total num frames: 385640448. Throughput: 0: 5001.0. Samples: 385635090. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:54:13,230][25689] Avg episode reward: [(0, '-47.796')] [2022-07-09 18:54:14,919][26022] Updated weights on worker 0-0, policy_version 376612 (0.00091) [2022-07-09 18:54:16,860][26022] Updated weights on worker 0-0, policy_version 376622 (0.00089) [2022-07-09 18:54:18,232][25689] Fps is (10 sec: 5687.6, 60 sec: 5663.1, 300 sec: 5678.7). Total num frames: 385669120. Throughput: 0: 5874.3. Samples: 385669604. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:54:18,232][25689] Avg episode reward: [(0, '-47.314')] [2022-07-09 18:54:18,485][26022] Updated weights on worker 0-0, policy_version 376632 (0.00088) [2022-07-09 18:54:20,319][26022] Updated weights on worker 0-0, policy_version 376642 (0.00093) [2022-07-09 18:54:22,102][26022] Updated weights on worker 0-0, policy_version 376652 (0.00089) [2022-07-09 18:54:23,245][25689] Fps is (10 sec: 5725.1, 60 sec: 5687.3, 300 sec: 5675.4). Total num frames: 385697792. Throughput: 0: 5890.0. Samples: 385704178. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:54:23,246][25689] Avg episode reward: [(0, '-47.515')] [2022-07-09 18:54:23,890][26022] Updated weights on worker 0-0, policy_version 376662 (0.00095) [2022-07-09 18:54:25,586][26022] Updated weights on worker 0-0, policy_version 376672 (0.00087) [2022-07-09 18:54:27,589][26022] Updated weights on worker 0-0, policy_version 376682 (0.00094) [2022-07-09 18:54:28,281][25689] Fps is (10 sec: 5706.0, 60 sec: 5670.9, 300 sec: 5677.1). Total num frames: 385726464. Throughput: 0: 5147.7. Samples: 385721538. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:54:28,283][25689] Avg episode reward: [(0, '-47.790')] [2022-07-09 18:54:29,114][26022] Updated weights on worker 0-0, policy_version 376692 (0.00092) [2022-07-09 18:54:31,153][26022] Updated weights on worker 0-0, policy_version 376702 (0.00086) [2022-07-09 18:54:32,998][26022] Updated weights on worker 0-0, policy_version 376712 (0.00088) [2022-07-09 18:54:33,328][25689] Fps is (10 sec: 5687.6, 60 sec: 5689.9, 300 sec: 5683.4). Total num frames: 385755136. Throughput: 0: 5999.1. Samples: 385755662. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:54:33,329][25689] Avg episode reward: [(0, '-46.927')] [2022-07-09 18:54:34,542][26022] Updated weights on worker 0-0, policy_version 376722 (0.00086) [2022-07-09 18:54:36,707][26022] Updated weights on worker 0-0, policy_version 376732 (0.00892) [2022-07-09 18:54:38,211][26022] Updated weights on worker 0-0, policy_version 376742 (0.00093) [2022-07-09 18:54:38,339][25689] Fps is (10 sec: 5701.3, 60 sec: 5690.7, 300 sec: 5677.1). Total num frames: 385783808. Throughput: 0: 5954.8. Samples: 385789342. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:54:38,341][25689] Avg episode reward: [(0, '-48.114')] [2022-07-09 18:54:40,107][26022] Updated weights on worker 0-0, policy_version 376752 (0.00088) [2022-07-09 18:54:41,961][26022] Updated weights on worker 0-0, policy_version 376762 (0.00094) [2022-07-09 18:54:43,351][25689] Fps is (10 sec: 5619.2, 60 sec: 5639.5, 300 sec: 5673.8). Total num frames: 385811456. Throughput: 0: 5089.5. Samples: 385806502. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:54:43,351][25689] Avg episode reward: [(0, '-48.000')] [2022-07-09 18:54:43,599][26022] Updated weights on worker 0-0, policy_version 376772 (0.00087) [2022-07-09 18:54:45,443][26022] Updated weights on worker 0-0, policy_version 376782 (0.00082) [2022-07-09 18:54:47,236][26022] Updated weights on worker 0-0, policy_version 376792 (0.00090) [2022-07-09 18:54:48,355][25689] Fps is (10 sec: 5725.6, 60 sec: 5677.4, 300 sec: 5674.9). Total num frames: 385841152. Throughput: 0: 5948.4. Samples: 385840944. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:54:48,356][25689] Avg episode reward: [(0, '-48.142')] [2022-07-09 18:54:49,114][26022] Updated weights on worker 0-0, policy_version 376802 (0.00089) [2022-07-09 18:54:50,885][26022] Updated weights on worker 0-0, policy_version 376812 (0.00088) [2022-07-09 18:54:52,576][26022] Updated weights on worker 0-0, policy_version 376822 (0.00085) [2022-07-09 18:54:53,395][25689] Fps is (10 sec: 5811.3, 60 sec: 5680.1, 300 sec: 5677.9). Total num frames: 385869824. Throughput: 0: 5969.0. Samples: 385875440. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:54:53,395][25689] Avg episode reward: [(0, '-47.501')] [2022-07-09 18:54:54,405][26022] Updated weights on worker 0-0, policy_version 376832 (0.00086) [2022-07-09 18:54:56,180][26022] Updated weights on worker 0-0, policy_version 376842 (0.00088) [2022-07-09 18:54:58,055][26022] Updated weights on worker 0-0, policy_version 376852 (0.00086) [2022-07-09 18:54:58,408][25689] Fps is (10 sec: 5805.9, 60 sec: 5701.2, 300 sec: 5677.8). Total num frames: 385899520. Throughput: 0: 5141.0. Samples: 385892514. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:54:58,409][25689] Avg episode reward: [(0, '-48.763')] [2022-07-09 18:54:59,934][26022] Updated weights on worker 0-0, policy_version 376862 (0.00093) [2022-07-09 18:55:01,684][26022] Updated weights on worker 0-0, policy_version 376872 (0.00091) [2022-07-09 18:55:03,455][25689] Fps is (10 sec: 5394.7, 60 sec: 5668.4, 300 sec: 5670.3). Total num frames: 385924096. Throughput: 0: 5982.3. Samples: 385926772. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:55:03,455][25689] Avg episode reward: [(0, '-49.032')] [2022-07-09 18:55:03,831][26022] Updated weights on worker 0-0, policy_version 376882 (0.00080) [2022-07-09 18:55:05,718][26022] Updated weights on worker 0-0, policy_version 376892 (0.00092) [2022-07-09 18:55:07,279][26022] Updated weights on worker 0-0, policy_version 376902 (0.00096) [2022-07-09 18:55:08,487][25689] Fps is (10 sec: 5181.4, 60 sec: 5634.4, 300 sec: 5667.0). Total num frames: 385951744. Throughput: 0: 5840.4. Samples: 385958524. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:55:08,488][25689] Avg episode reward: [(0, '-48.042')] [2022-07-09 18:55:09,519][26022] Updated weights on worker 0-0, policy_version 376912 (0.00083) [2022-07-09 18:55:10,893][26022] Updated weights on worker 0-0, policy_version 376922 (0.00087) [2022-07-09 18:55:13,005][26022] Updated weights on worker 0-0, policy_version 376932 (0.00090) [2022-07-09 18:55:13,592][25689] Fps is (10 sec: 5858.8, 60 sec: 5683.0, 300 sec: 5675.5). Total num frames: 385983488. Throughput: 0: 4951.0. Samples: 385975438. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:55:13,593][25689] Avg episode reward: [(0, '-49.168')] [2022-07-09 18:55:14,781][26022] Updated weights on worker 0-0, policy_version 376942 (0.00095) [2022-07-09 18:55:16,463][26022] Updated weights on worker 0-0, policy_version 376952 (0.00089) [2022-07-09 18:55:18,186][26022] Updated weights on worker 0-0, policy_version 376962 (0.00093) [2022-07-09 18:55:18,612][25689] Fps is (10 sec: 5764.8, 60 sec: 5647.4, 300 sec: 5661.9). Total num frames: 386010112. Throughput: 0: 5818.8. Samples: 386010076. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:55:18,612][25689] Avg episode reward: [(0, '-48.612')] [2022-07-09 18:55:19,985][26022] Updated weights on worker 0-0, policy_version 376972 (0.00094) [2022-07-09 18:55:21,583][26022] Updated weights on worker 0-0, policy_version 376982 (0.00087) [2022-07-09 18:55:23,617][25689] Fps is (10 sec: 5515.8, 60 sec: 5648.2, 300 sec: 5669.2). Total num frames: 386038784. Throughput: 0: 5849.3. Samples: 386044708. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:55:23,618][25689] Avg episode reward: [(0, '-48.462')] [2022-07-09 18:55:23,768][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:55:23,780][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000376992_386039808.pth [2022-07-09 18:55:23,781][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000374993_383992832.pth [2022-07-09 18:55:23,785][26022] Updated weights on worker 0-0, policy_version 376992 (0.00086) [2022-07-09 18:55:25,255][26022] Updated weights on worker 0-0, policy_version 377002 (0.00077) [2022-07-09 18:55:27,157][26022] Updated weights on worker 0-0, policy_version 377012 (0.00089) [2022-07-09 18:55:28,633][25689] Fps is (10 sec: 5824.8, 60 sec: 5667.1, 300 sec: 5673.7). Total num frames: 386068480. Throughput: 0: 5123.7. Samples: 386061744. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:55:28,633][25689] Avg episode reward: [(0, '-47.302')] [2022-07-09 18:55:28,859][26022] Updated weights on worker 0-0, policy_version 377022 (0.00093) [2022-07-09 18:55:30,783][26022] Updated weights on worker 0-0, policy_version 377032 (0.00091) [2022-07-09 18:55:32,633][26022] Updated weights on worker 0-0, policy_version 377042 (0.00090) [2022-07-09 18:55:33,675][25689] Fps is (10 sec: 5599.6, 60 sec: 5633.5, 300 sec: 5669.6). Total num frames: 386095104. Throughput: 0: 5987.6. Samples: 386095686. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:55:33,675][25689] Avg episode reward: [(0, '-47.401')] [2022-07-09 18:55:34,382][26022] Updated weights on worker 0-0, policy_version 377052 (0.00094) [2022-07-09 18:55:36,284][26022] Updated weights on worker 0-0, policy_version 377062 (0.00091) [2022-07-09 18:55:38,022][26022] Updated weights on worker 0-0, policy_version 377072 (0.00086) [2022-07-09 18:55:38,713][25689] Fps is (10 sec: 5587.3, 60 sec: 5648.1, 300 sec: 5669.3). Total num frames: 386124800. Throughput: 0: 5932.3. Samples: 386129318. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:55:38,715][25689] Avg episode reward: [(0, '-47.223')] [2022-07-09 18:55:39,911][26022] Updated weights on worker 0-0, policy_version 377082 (0.00090) [2022-07-09 18:55:41,748][26022] Updated weights on worker 0-0, policy_version 377092 (0.00088) [2022-07-09 18:55:43,378][26022] Updated weights on worker 0-0, policy_version 377102 (0.00086) [2022-07-09 18:55:43,748][25689] Fps is (10 sec: 5794.2, 60 sec: 5662.7, 300 sec: 5672.1). Total num frames: 386153472. Throughput: 0: 5042.5. Samples: 386146220. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:55:43,750][25689] Avg episode reward: [(0, '-47.664')] [2022-07-09 18:55:45,451][26022] Updated weights on worker 0-0, policy_version 377112 (0.00089) [2022-07-09 18:55:47,111][26022] Updated weights on worker 0-0, policy_version 377122 (0.00094) [2022-07-09 18:55:48,766][25689] Fps is (10 sec: 5602.1, 60 sec: 5627.6, 300 sec: 5669.1). Total num frames: 386181120. Throughput: 0: 5882.7. Samples: 386180184. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:55:48,768][25689] Avg episode reward: [(0, '-47.911')] [2022-07-09 18:55:48,971][26022] Updated weights on worker 0-0, policy_version 377132 (0.00083) [2022-07-09 18:55:50,883][26022] Updated weights on worker 0-0, policy_version 377142 (0.00085) [2022-07-09 18:55:52,695][26022] Updated weights on worker 0-0, policy_version 377152 (0.00086) [2022-07-09 18:55:53,884][25689] Fps is (10 sec: 5556.5, 60 sec: 5620.2, 300 sec: 5660.0). Total num frames: 386209792. Throughput: 0: 5859.8. Samples: 386214110. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:55:53,885][25689] Avg episode reward: [(0, '-48.532')] [2022-07-09 18:55:54,607][26022] Updated weights on worker 0-0, policy_version 377162 (0.00096) [2022-07-09 18:55:56,182][26022] Updated weights on worker 0-0, policy_version 377172 (0.00085) [2022-07-09 18:55:58,143][26022] Updated weights on worker 0-0, policy_version 377182 (0.00087) [2022-07-09 18:55:58,903][25689] Fps is (10 sec: 5758.1, 60 sec: 5619.8, 300 sec: 5670.2). Total num frames: 386239488. Throughput: 0: 5884.3. Samples: 386248124. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:55:58,903][25689] Avg episode reward: [(0, '-48.954')] [2022-07-09 18:55:59,971][26022] Updated weights on worker 0-0, policy_version 377192 (0.00091) [2022-07-09 18:56:01,927][26022] Updated weights on worker 0-0, policy_version 377202 (0.00083) [2022-07-09 18:56:03,699][26022] Updated weights on worker 0-0, policy_version 377212 (0.00079) [2022-07-09 18:56:03,929][25689] Fps is (10 sec: 5607.0, 60 sec: 5655.6, 300 sec: 5669.8). Total num frames: 386266112. Throughput: 0: 5861.5. Samples: 386264508. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:56:03,929][25689] Avg episode reward: [(0, '-49.053')] [2022-07-09 18:56:05,519][26022] Updated weights on worker 0-0, policy_version 377222 (0.00086) [2022-07-09 18:56:07,188][26022] Updated weights on worker 0-0, policy_version 377232 (0.00086) [2022-07-09 18:56:08,967][25689] Fps is (10 sec: 5392.8, 60 sec: 5655.1, 300 sec: 5666.7). Total num frames: 386293760. Throughput: 0: 5824.4. Samples: 386297840. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:56:08,967][25689] Avg episode reward: [(0, '-48.842')] [2022-07-09 18:56:09,304][26022] Updated weights on worker 0-0, policy_version 377242 (0.00087) [2022-07-09 18:56:10,919][26022] Updated weights on worker 0-0, policy_version 377252 (0.00081) [2022-07-09 18:56:12,750][26022] Updated weights on worker 0-0, policy_version 377262 (0.00084) [2022-07-09 18:56:14,050][25689] Fps is (10 sec: 5564.4, 60 sec: 5606.2, 300 sec: 5662.0). Total num frames: 386322432. Throughput: 0: 5835.8. Samples: 386331794. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:56:14,051][25689] Avg episode reward: [(0, '-48.136')] [2022-07-09 18:56:14,652][26022] Updated weights on worker 0-0, policy_version 377272 (0.00086) [2022-07-09 18:56:16,229][26022] Updated weights on worker 0-0, policy_version 377282 (0.00060) [2022-07-09 18:56:18,228][26022] Updated weights on worker 0-0, policy_version 377292 (0.00094) [2022-07-09 18:56:19,081][25689] Fps is (10 sec: 5871.9, 60 sec: 5673.0, 300 sec: 5668.4). Total num frames: 386353152. Throughput: 0: 5002.9. Samples: 386349074. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:56:19,082][25689] Avg episode reward: [(0, '-47.286')] [2022-07-09 18:56:20,111][26022] Updated weights on worker 0-0, policy_version 377302 (0.00095) [2022-07-09 18:56:21,723][26022] Updated weights on worker 0-0, policy_version 377312 (0.00091) [2022-07-09 18:56:23,584][26022] Updated weights on worker 0-0, policy_version 377322 (0.00081) [2022-07-09 18:56:24,105][25689] Fps is (10 sec: 5703.0, 60 sec: 5637.3, 300 sec: 5664.5). Total num frames: 386379776. Throughput: 0: 5876.0. Samples: 386383066. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-09 18:56:24,106][25689] Avg episode reward: [(0, '-47.319')] [2022-07-09 18:56:25,158][26022] Updated weights on worker 0-0, policy_version 377332 (0.00092) [2022-07-09 18:56:27,430][26022] Updated weights on worker 0-0, policy_version 377342 (0.00092) [2022-07-09 18:56:28,846][26022] Updated weights on worker 0-0, policy_version 377352 (0.00096) [2022-07-09 18:56:29,117][25689] Fps is (10 sec: 5510.1, 60 sec: 5620.8, 300 sec: 5658.5). Total num frames: 386408448. Throughput: 0: 5932.3. Samples: 386417376. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:56:29,117][25689] Avg episode reward: [(0, '-46.429')] [2022-07-09 18:56:30,798][26022] Updated weights on worker 0-0, policy_version 377362 (0.00093) [2022-07-09 18:56:32,585][26022] Updated weights on worker 0-0, policy_version 377372 (0.00088) [2022-07-09 18:56:34,163][25689] Fps is (10 sec: 5803.7, 60 sec: 5671.2, 300 sec: 5664.7). Total num frames: 386438144. Throughput: 0: 5096.6. Samples: 386434298. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:56:34,163][25689] Avg episode reward: [(0, '-45.615')] [2022-07-09 18:56:34,411][26022] Updated weights on worker 0-0, policy_version 377382 (0.00087) [2022-07-09 18:56:36,286][26022] Updated weights on worker 0-0, policy_version 377392 (0.00089) [2022-07-09 18:56:37,914][26022] Updated weights on worker 0-0, policy_version 377402 (0.00091) [2022-07-09 18:56:39,175][25689] Fps is (10 sec: 5701.2, 60 sec: 5639.8, 300 sec: 5661.2). Total num frames: 386465792. Throughput: 0: 5957.3. Samples: 386468778. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:56:39,175][25689] Avg episode reward: [(0, '-46.293')] [2022-07-09 18:56:39,700][26022] Updated weights on worker 0-0, policy_version 377412 (0.00087) [2022-07-09 18:56:41,585][26022] Updated weights on worker 0-0, policy_version 377422 (0.00086) [2022-07-09 18:56:43,223][26022] Updated weights on worker 0-0, policy_version 377432 (0.00084) [2022-07-09 18:56:44,179][25689] Fps is (10 sec: 5724.7, 60 sec: 5659.6, 300 sec: 5665.1). Total num frames: 386495488. Throughput: 0: 5991.3. Samples: 386503336. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:56:44,186][25689] Avg episode reward: [(0, '-47.591')] [2022-07-09 18:56:45,075][26022] Updated weights on worker 0-0, policy_version 377442 (0.00087) [2022-07-09 18:56:46,786][26022] Updated weights on worker 0-0, policy_version 377452 (0.00082) [2022-07-09 18:56:48,528][26022] Updated weights on worker 0-0, policy_version 377462 (0.00090) [2022-07-09 18:56:49,191][25689] Fps is (10 sec: 5827.6, 60 sec: 5677.2, 300 sec: 5667.5). Total num frames: 386524160. Throughput: 0: 5138.2. Samples: 386520520. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:56:49,192][25689] Avg episode reward: [(0, '-47.834')] [2022-07-09 18:56:50,331][26022] Updated weights on worker 0-0, policy_version 377472 (0.00083) [2022-07-09 18:56:52,181][26022] Updated weights on worker 0-0, policy_version 377482 (0.00088) [2022-07-09 18:56:54,013][26022] Updated weights on worker 0-0, policy_version 377492 (0.00095) [2022-07-09 18:56:54,293][25689] Fps is (10 sec: 5771.1, 60 sec: 5695.6, 300 sec: 5670.3). Total num frames: 386553856. Throughput: 0: 6008.4. Samples: 386555252. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:56:54,295][25689] Avg episode reward: [(0, '-48.112')] [2022-07-09 18:56:55,632][26022] Updated weights on worker 0-0, policy_version 377502 (0.00090) [2022-07-09 18:56:57,440][26022] Updated weights on worker 0-0, policy_version 377512 (0.00094) [2022-07-09 18:56:59,300][25689] Fps is (10 sec: 5672.3, 60 sec: 5662.8, 300 sec: 5664.3). Total num frames: 386581504. Throughput: 0: 6007.6. Samples: 386589684. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:56:59,302][25689] Avg episode reward: [(0, '-47.852')] [2022-07-09 18:56:59,444][26022] Updated weights on worker 0-0, policy_version 377522 (0.00090) [2022-07-09 18:57:00,948][26022] Updated weights on worker 0-0, policy_version 377532 (0.00285) [2022-07-09 18:57:03,379][26022] Updated weights on worker 0-0, policy_version 377542 (0.00094) [2022-07-09 18:57:04,348][25689] Fps is (10 sec: 5397.6, 60 sec: 5660.7, 300 sec: 5664.8). Total num frames: 386608128. Throughput: 0: 5032.0. Samples: 386604824. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:57:04,350][25689] Avg episode reward: [(0, '-49.274')] [2022-07-09 18:57:04,898][26022] Updated weights on worker 0-0, policy_version 377552 (0.00087) [2022-07-09 18:57:06,998][26022] Updated weights on worker 0-0, policy_version 377562 (0.00082) [2022-07-09 18:57:08,487][26022] Updated weights on worker 0-0, policy_version 377572 (0.00089) [2022-07-09 18:57:09,356][25689] Fps is (10 sec: 5498.7, 60 sec: 5680.4, 300 sec: 5660.3). Total num frames: 386636800. Throughput: 0: 5870.8. Samples: 386638908. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:57:09,358][25689] Avg episode reward: [(0, '-48.429')] [2022-07-09 18:57:10,680][26022] Updated weights on worker 0-0, policy_version 377582 (0.00622) [2022-07-09 18:57:12,220][26022] Updated weights on worker 0-0, policy_version 377592 (0.00082) [2022-07-09 18:57:14,127][26022] Updated weights on worker 0-0, policy_version 377602 (0.00087) [2022-07-09 18:57:14,420][25689] Fps is (10 sec: 5795.0, 60 sec: 5699.3, 300 sec: 5666.4). Total num frames: 386666496. Throughput: 0: 5855.0. Samples: 386673096. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:57:14,422][25689] Avg episode reward: [(0, '-48.887')] [2022-07-09 18:57:15,939][26022] Updated weights on worker 0-0, policy_version 377612 (0.00084) [2022-07-09 18:57:17,535][26022] Updated weights on worker 0-0, policy_version 377622 (0.00091) [2022-07-09 18:57:19,448][25689] Fps is (10 sec: 5682.4, 60 sec: 5648.7, 300 sec: 5663.1). Total num frames: 386694144. Throughput: 0: 4997.6. Samples: 386690374. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:57:19,449][25689] Avg episode reward: [(0, '-48.138')] [2022-07-09 18:57:19,480][26022] Updated weights on worker 0-0, policy_version 377632 (0.00087) [2022-07-09 18:57:21,179][26022] Updated weights on worker 0-0, policy_version 377642 (0.00349) [2022-07-09 18:57:22,898][26022] Updated weights on worker 0-0, policy_version 377652 (0.00085) [2022-07-09 18:57:24,125][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:57:24,140][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000377658_386721792.pth [2022-07-09 18:57:24,140][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000375659_384674816.pth [2022-07-09 18:57:24,491][25689] Fps is (10 sec: 5592.7, 60 sec: 5680.9, 300 sec: 5659.8). Total num frames: 386722816. Throughput: 0: 5958.0. Samples: 386724832. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:57:24,491][25689] Avg episode reward: [(0, '-49.154')] [2022-07-09 18:57:24,817][26022] Updated weights on worker 0-0, policy_version 377662 (0.00084) [2022-07-09 18:57:26,392][26022] Updated weights on worker 0-0, policy_version 377672 (0.00085) [2022-07-09 18:57:28,363][26022] Updated weights on worker 0-0, policy_version 377682 (0.00100) [2022-07-09 18:57:29,506][25689] Fps is (10 sec: 5802.9, 60 sec: 5697.4, 300 sec: 5664.6). Total num frames: 386752512. Throughput: 0: 5976.8. Samples: 386759340. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:57:29,507][25689] Avg episode reward: [(0, '-48.949')] [2022-07-09 18:57:30,095][26022] Updated weights on worker 0-0, policy_version 377692 (0.00096) [2022-07-09 18:57:31,996][26022] Updated weights on worker 0-0, policy_version 377702 (0.00086) [2022-07-09 18:57:33,687][26022] Updated weights on worker 0-0, policy_version 377712 (0.00078) [2022-07-09 18:57:34,570][25689] Fps is (10 sec: 5790.8, 60 sec: 5678.8, 300 sec: 5664.5). Total num frames: 386781184. Throughput: 0: 5132.6. Samples: 386776518. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:57:34,571][25689] Avg episode reward: [(0, '-48.629')] [2022-07-09 18:57:35,592][26022] Updated weights on worker 0-0, policy_version 377722 (0.00092) [2022-07-09 18:57:37,133][26022] Updated weights on worker 0-0, policy_version 377732 (0.00080) [2022-07-09 18:57:39,306][26022] Updated weights on worker 0-0, policy_version 377742 (0.00087) [2022-07-09 18:57:39,572][25689] Fps is (10 sec: 5697.0, 60 sec: 5696.7, 300 sec: 5662.6). Total num frames: 386809856. Throughput: 0: 5994.2. Samples: 386811002. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:57:39,572][25689] Avg episode reward: [(0, '-47.940')] [2022-07-09 18:57:40,857][26022] Updated weights on worker 0-0, policy_version 377752 (0.00090) [2022-07-09 18:57:42,723][26022] Updated weights on worker 0-0, policy_version 377762 (0.00057) [2022-07-09 18:57:44,308][26022] Updated weights on worker 0-0, policy_version 377772 (0.00090) [2022-07-09 18:57:44,586][25689] Fps is (10 sec: 5827.4, 60 sec: 5695.8, 300 sec: 5669.7). Total num frames: 386839552. Throughput: 0: 6004.9. Samples: 386845504. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:57:44,587][25689] Avg episode reward: [(0, '-47.300')] [2022-07-09 18:57:46,254][26022] Updated weights on worker 0-0, policy_version 377782 (0.00083) [2022-07-09 18:57:47,968][26022] Updated weights on worker 0-0, policy_version 377792 (0.00083) [2022-07-09 18:57:49,623][25689] Fps is (10 sec: 5705.2, 60 sec: 5676.4, 300 sec: 5663.1). Total num frames: 386867200. Throughput: 0: 5139.2. Samples: 386862724. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:57:49,624][25689] Avg episode reward: [(0, '-48.113')] [2022-07-09 18:57:49,783][26022] Updated weights on worker 0-0, policy_version 377802 (0.00087) [2022-07-09 18:57:51,348][26022] Updated weights on worker 0-0, policy_version 377812 (0.00083) [2022-07-09 18:57:53,339][26022] Updated weights on worker 0-0, policy_version 377822 (0.00078) [2022-07-09 18:57:54,710][25689] Fps is (10 sec: 5765.3, 60 sec: 5694.8, 300 sec: 5672.0). Total num frames: 386897920. Throughput: 0: 6012.9. Samples: 386897618. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:57:54,711][25689] Avg episode reward: [(0, '-48.378')] [2022-07-09 18:57:55,237][26022] Updated weights on worker 0-0, policy_version 377832 (0.00086) [2022-07-09 18:57:56,986][26022] Updated weights on worker 0-0, policy_version 377842 (0.00091) [2022-07-09 18:57:58,820][26022] Updated weights on worker 0-0, policy_version 377852 (0.00089) [2022-07-09 18:57:59,739][25689] Fps is (10 sec: 5871.1, 60 sec: 5709.7, 300 sec: 5671.7). Total num frames: 386926592. Throughput: 0: 5980.6. Samples: 386931614. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:57:59,739][25689] Avg episode reward: [(0, '-48.872')] [2022-07-09 18:58:00,377][26022] Updated weights on worker 0-0, policy_version 377862 (0.00083) [2022-07-09 18:58:02,608][26022] Updated weights on worker 0-0, policy_version 377872 (0.00092) [2022-07-09 18:58:04,559][26022] Updated weights on worker 0-0, policy_version 377882 (0.00087) [2022-07-09 18:58:04,750][25689] Fps is (10 sec: 5405.4, 60 sec: 5696.2, 300 sec: 5668.5). Total num frames: 386952192. Throughput: 0: 5115.4. Samples: 386948652. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:58:04,751][25689] Avg episode reward: [(0, '-49.577')] [2022-07-09 18:58:06,123][26022] Updated weights on worker 0-0, policy_version 377892 (0.00086) [2022-07-09 18:58:08,123][26022] Updated weights on worker 0-0, policy_version 377902 (0.00059) [2022-07-09 18:58:09,753][25689] Fps is (10 sec: 5521.8, 60 sec: 5713.7, 300 sec: 5673.0). Total num frames: 386981888. Throughput: 0: 5894.9. Samples: 386981388. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:58:09,755][25689] Avg episode reward: [(0, '-49.591')] [2022-07-09 18:58:09,756][26022] Updated weights on worker 0-0, policy_version 377912 (0.00100) [2022-07-09 18:58:11,715][26022] Updated weights on worker 0-0, policy_version 377922 (0.00083) [2022-07-09 18:58:13,372][26022] Updated weights on worker 0-0, policy_version 377932 (0.00093) [2022-07-09 18:58:14,811][25689] Fps is (10 sec: 5598.0, 60 sec: 5663.4, 300 sec: 5661.8). Total num frames: 387008512. Throughput: 0: 5848.2. Samples: 387015172. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:58:14,812][25689] Avg episode reward: [(0, '-49.573')] [2022-07-09 18:58:15,532][26022] Updated weights on worker 0-0, policy_version 377942 (0.00090) [2022-07-09 18:58:16,932][26022] Updated weights on worker 0-0, policy_version 377952 (0.00092) [2022-07-09 18:58:18,991][26022] Updated weights on worker 0-0, policy_version 377962 (0.00092) [2022-07-09 18:58:19,912][25689] Fps is (10 sec: 5644.7, 60 sec: 5707.3, 300 sec: 5672.0). Total num frames: 387039232. Throughput: 0: 4993.4. Samples: 387032346. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:58:19,912][25689] Avg episode reward: [(0, '-47.996')] [2022-07-09 18:58:20,424][26022] Updated weights on worker 0-0, policy_version 377972 (0.00093) [2022-07-09 18:58:22,462][26022] Updated weights on worker 0-0, policy_version 377982 (0.00089) [2022-07-09 18:58:24,329][26022] Updated weights on worker 0-0, policy_version 377992 (0.00088) [2022-07-09 18:58:24,917][25689] Fps is (10 sec: 5674.5, 60 sec: 5677.0, 300 sec: 5662.3). Total num frames: 387065856. Throughput: 0: 5849.3. Samples: 387066612. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:58:24,918][25689] Avg episode reward: [(0, '-48.230')] [2022-07-09 18:58:25,998][26022] Updated weights on worker 0-0, policy_version 378002 (0.00096) [2022-07-09 18:58:27,931][26022] Updated weights on worker 0-0, policy_version 378012 (0.00096) [2022-07-09 18:58:29,579][26022] Updated weights on worker 0-0, policy_version 378022 (0.00085) [2022-07-09 18:58:29,929][25689] Fps is (10 sec: 5622.2, 60 sec: 5677.3, 300 sec: 5670.3). Total num frames: 387095552. Throughput: 0: 5903.5. Samples: 387100500. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:58:29,930][25689] Avg episode reward: [(0, '-48.129')] [2022-07-09 18:58:31,556][26022] Updated weights on worker 0-0, policy_version 378032 (0.00079) [2022-07-09 18:58:33,542][26022] Updated weights on worker 0-0, policy_version 378042 (0.00210) [2022-07-09 18:58:35,009][25689] Fps is (10 sec: 5783.5, 60 sec: 5675.8, 300 sec: 5669.2). Total num frames: 387124224. Throughput: 0: 5058.8. Samples: 387117348. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:58:35,010][25689] Avg episode reward: [(0, '-48.112')] [2022-07-09 18:58:35,072][26022] Updated weights on worker 0-0, policy_version 378052 (0.00093) [2022-07-09 18:58:37,085][26022] Updated weights on worker 0-0, policy_version 378062 (0.00085) [2022-07-09 18:58:38,682][26022] Updated weights on worker 0-0, policy_version 378072 (0.00087) [2022-07-09 18:58:40,030][25689] Fps is (10 sec: 5677.1, 60 sec: 5674.0, 300 sec: 5662.0). Total num frames: 387152896. Throughput: 0: 5935.9. Samples: 387151766. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:58:40,031][25689] Avg episode reward: [(0, '-47.540')] [2022-07-09 18:58:40,661][26022] Updated weights on worker 0-0, policy_version 378082 (0.00090) [2022-07-09 18:58:42,382][26022] Updated weights on worker 0-0, policy_version 378092 (0.00088) [2022-07-09 18:58:44,349][26022] Updated weights on worker 0-0, policy_version 378102 (0.00089) [2022-07-09 18:58:45,049][25689] Fps is (10 sec: 5609.4, 60 sec: 5639.7, 300 sec: 5662.5). Total num frames: 387180544. Throughput: 0: 5919.9. Samples: 387185794. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:58:45,051][25689] Avg episode reward: [(0, '-47.686')] [2022-07-09 18:58:45,751][26022] Updated weights on worker 0-0, policy_version 378112 (0.00083) [2022-07-09 18:58:47,987][26022] Updated weights on worker 0-0, policy_version 378122 (0.00084) [2022-07-09 18:58:49,254][26022] Updated weights on worker 0-0, policy_version 378132 (0.00087) [2022-07-09 18:58:50,062][25689] Fps is (10 sec: 5716.5, 60 sec: 5675.8, 300 sec: 5667.1). Total num frames: 387210240. Throughput: 0: 5091.8. Samples: 387203010. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:58:50,063][25689] Avg episode reward: [(0, '-47.055')] [2022-07-09 18:58:51,376][26022] Updated weights on worker 0-0, policy_version 378142 (0.00087) [2022-07-09 18:58:53,082][26022] Updated weights on worker 0-0, policy_version 378152 (0.00092) [2022-07-09 18:58:54,971][26022] Updated weights on worker 0-0, policy_version 378162 (0.00101) [2022-07-09 18:58:55,130][25689] Fps is (10 sec: 5790.0, 60 sec: 5643.7, 300 sec: 5666.8). Total num frames: 387238912. Throughput: 0: 5960.0. Samples: 387237270. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:58:55,132][25689] Avg episode reward: [(0, '-46.900')] [2022-07-09 18:58:56,530][26022] Updated weights on worker 0-0, policy_version 378172 (0.00816) [2022-07-09 18:58:58,616][26022] Updated weights on worker 0-0, policy_version 378182 (0.00089) [2022-07-09 18:59:00,002][26022] Updated weights on worker 0-0, policy_version 378192 (0.00086) [2022-07-09 18:59:00,224][25689] Fps is (10 sec: 5844.3, 60 sec: 5671.5, 300 sec: 5679.9). Total num frames: 387269632. Throughput: 0: 5939.2. Samples: 387271700. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 18:59:00,224][25689] Avg episode reward: [(0, '-46.418')] [2022-07-09 18:59:02,644][26022] Updated weights on worker 0-0, policy_version 378202 (0.00091) [2022-07-09 18:59:04,104][26022] Updated weights on worker 0-0, policy_version 378212 (0.00092) [2022-07-09 18:59:05,296][25689] Fps is (10 sec: 5439.5, 60 sec: 5648.9, 300 sec: 5662.0). Total num frames: 387294208. Throughput: 0: 5844.0. Samples: 387304114. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 18:59:05,297][25689] Avg episode reward: [(0, '-45.746')] [2022-07-09 18:59:06,185][26022] Updated weights on worker 0-0, policy_version 378222 (0.00098) [2022-07-09 18:59:07,802][26022] Updated weights on worker 0-0, policy_version 378232 (0.00059) [2022-07-09 18:59:09,770][26022] Updated weights on worker 0-0, policy_version 378242 (0.00086) [2022-07-09 18:59:10,310][25689] Fps is (10 sec: 5482.4, 60 sec: 5664.7, 300 sec: 5670.1). Total num frames: 387324928. Throughput: 0: 5842.3. Samples: 387321308. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 18:59:10,314][25689] Avg episode reward: [(0, '-46.226')] [2022-07-09 18:59:11,199][26022] Updated weights on worker 0-0, policy_version 378252 (0.00099) [2022-07-09 18:59:13,235][26022] Updated weights on worker 0-0, policy_version 378262 (0.00092) [2022-07-09 18:59:14,898][26022] Updated weights on worker 0-0, policy_version 378272 (0.00083) [2022-07-09 18:59:15,389][25689] Fps is (10 sec: 5783.2, 60 sec: 5679.7, 300 sec: 5665.2). Total num frames: 387352576. Throughput: 0: 5840.9. Samples: 387355596. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 18:59:15,391][25689] Avg episode reward: [(0, '-46.579')] [2022-07-09 18:59:16,835][26022] Updated weights on worker 0-0, policy_version 378282 (0.00094) [2022-07-09 18:59:18,591][26022] Updated weights on worker 0-0, policy_version 378292 (0.00087) [2022-07-09 18:59:20,303][26022] Updated weights on worker 0-0, policy_version 378302 (0.00449) [2022-07-09 18:59:20,403][25689] Fps is (10 sec: 5580.3, 60 sec: 5654.0, 300 sec: 5665.2). Total num frames: 387381248. Throughput: 0: 5861.3. Samples: 387389974. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 18:59:20,404][25689] Avg episode reward: [(0, '-46.817')] [2022-07-09 18:59:22,175][26022] Updated weights on worker 0-0, policy_version 378312 (0.00093) [2022-07-09 18:59:23,903][26022] Updated weights on worker 0-0, policy_version 378322 (0.00082) [2022-07-09 18:59:24,349][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 18:59:24,362][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000378324_387403776.pth [2022-07-09 18:59:24,363][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000376326_385357824.pth [2022-07-09 18:59:25,408][25689] Fps is (10 sec: 5723.3, 60 sec: 5687.8, 300 sec: 5665.8). Total num frames: 387409920. Throughput: 0: 5116.2. Samples: 387407012. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 18:59:25,411][25689] Avg episode reward: [(0, '-47.487')] [2022-07-09 18:59:25,756][26022] Updated weights on worker 0-0, policy_version 378332 (0.00090) [2022-07-09 18:59:27,527][26022] Updated weights on worker 0-0, policy_version 378342 (0.00092) [2022-07-09 18:59:29,363][26022] Updated weights on worker 0-0, policy_version 378352 (0.00104) [2022-07-09 18:59:30,448][25689] Fps is (10 sec: 5504.7, 60 sec: 5634.5, 300 sec: 5659.0). Total num frames: 387436544. Throughput: 0: 5970.0. Samples: 387441530. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 18:59:30,450][25689] Avg episode reward: [(0, '-47.507')] [2022-07-09 18:59:30,993][26022] Updated weights on worker 0-0, policy_version 378362 (0.00088) [2022-07-09 18:59:32,836][26022] Updated weights on worker 0-0, policy_version 378372 (0.00087) [2022-07-09 18:59:34,495][26022] Updated weights on worker 0-0, policy_version 378382 (0.00087) [2022-07-09 18:59:35,489][25689] Fps is (10 sec: 5789.7, 60 sec: 5688.9, 300 sec: 5668.8). Total num frames: 387468288. Throughput: 0: 5994.1. Samples: 387476082. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 18:59:35,492][25689] Avg episode reward: [(0, '-46.726')] [2022-07-09 18:59:36,576][26022] Updated weights on worker 0-0, policy_version 378392 (0.00086) [2022-07-09 18:59:38,164][26022] Updated weights on worker 0-0, policy_version 378402 (0.00092) [2022-07-09 18:59:40,131][26022] Updated weights on worker 0-0, policy_version 378412 (0.00091) [2022-07-09 18:59:40,505][25689] Fps is (10 sec: 5905.5, 60 sec: 5672.5, 300 sec: 5668.7). Total num frames: 387495936. Throughput: 0: 5128.2. Samples: 387493062. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 18:59:40,507][25689] Avg episode reward: [(0, '-47.590')] [2022-07-09 18:59:41,709][26022] Updated weights on worker 0-0, policy_version 378422 (0.00083) [2022-07-09 18:59:43,751][26022] Updated weights on worker 0-0, policy_version 378432 (0.00098) [2022-07-09 18:59:45,467][26022] Updated weights on worker 0-0, policy_version 378442 (0.00100) [2022-07-09 18:59:45,606][25689] Fps is (10 sec: 5668.4, 60 sec: 5698.6, 300 sec: 5666.9). Total num frames: 387525632. Throughput: 0: 5939.2. Samples: 387526972. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 18:59:45,606][25689] Avg episode reward: [(0, '-46.463')] [2022-07-09 18:59:47,314][26022] Updated weights on worker 0-0, policy_version 378452 (0.00086) [2022-07-09 18:59:49,042][26022] Updated weights on worker 0-0, policy_version 378462 (0.00092) [2022-07-09 18:59:50,613][25689] Fps is (10 sec: 5673.3, 60 sec: 5665.3, 300 sec: 5664.0). Total num frames: 387553280. Throughput: 0: 5925.3. Samples: 387561012. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 18:59:50,615][25689] Avg episode reward: [(0, '-46.147')] [2022-07-09 18:59:50,928][26022] Updated weights on worker 0-0, policy_version 378472 (0.00092) [2022-07-09 18:59:52,701][26022] Updated weights on worker 0-0, policy_version 378482 (0.00100) [2022-07-09 18:59:54,567][26022] Updated weights on worker 0-0, policy_version 378492 (0.00059) [2022-07-09 18:59:55,684][25689] Fps is (10 sec: 5588.2, 60 sec: 5665.0, 300 sec: 5659.5). Total num frames: 387581952. Throughput: 0: 5049.8. Samples: 387578062. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 18:59:55,685][25689] Avg episode reward: [(0, '-45.812')] [2022-07-09 18:59:56,449][26022] Updated weights on worker 0-0, policy_version 378502 (0.00087) [2022-07-09 18:59:58,196][26022] Updated weights on worker 0-0, policy_version 378512 (0.00085) [2022-07-09 18:59:59,962][26022] Updated weights on worker 0-0, policy_version 378522 (0.00084) [2022-07-09 19:00:00,746][25689] Fps is (10 sec: 5760.1, 60 sec: 5651.1, 300 sec: 5676.4). Total num frames: 387611648. Throughput: 0: 5879.8. Samples: 387612074. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 19:00:00,746][25689] Avg episode reward: [(0, '-46.609')] [2022-07-09 19:00:02,210][26022] Updated weights on worker 0-0, policy_version 378532 (0.00097) [2022-07-09 19:00:03,783][26022] Updated weights on worker 0-0, policy_version 378542 (0.00084) [2022-07-09 19:00:05,636][26022] Updated weights on worker 0-0, policy_version 378552 (0.00094) [2022-07-09 19:00:05,774][25689] Fps is (10 sec: 5480.6, 60 sec: 5672.2, 300 sec: 5669.6). Total num frames: 387637248. Throughput: 0: 5822.4. Samples: 387644396. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 19:00:05,774][25689] Avg episode reward: [(0, '-47.097')] [2022-07-09 19:00:07,534][26022] Updated weights on worker 0-0, policy_version 378562 (0.00087) [2022-07-09 19:00:09,156][26022] Updated weights on worker 0-0, policy_version 378572 (0.00089) [2022-07-09 19:00:10,804][25689] Fps is (10 sec: 5294.2, 60 sec: 5619.9, 300 sec: 5657.3). Total num frames: 387664896. Throughput: 0: 4971.1. Samples: 387661384. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 19:00:10,804][25689] Avg episode reward: [(0, '-47.216')] [2022-07-09 19:00:11,264][26022] Updated weights on worker 0-0, policy_version 378582 (0.00087) [2022-07-09 19:00:12,814][26022] Updated weights on worker 0-0, policy_version 378592 (0.00099) [2022-07-09 19:00:14,844][26022] Updated weights on worker 0-0, policy_version 378602 (0.00090) [2022-07-09 19:00:15,877][25689] Fps is (10 sec: 5777.2, 60 sec: 5671.2, 300 sec: 5670.0). Total num frames: 387695616. Throughput: 0: 5796.1. Samples: 387695100. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 19:00:15,877][25689] Avg episode reward: [(0, '-47.651')] [2022-07-09 19:00:16,766][26022] Updated weights on worker 0-0, policy_version 378612 (0.00082) [2022-07-09 19:00:18,426][26022] Updated weights on worker 0-0, policy_version 378622 (0.00079) [2022-07-09 19:00:20,343][26022] Updated weights on worker 0-0, policy_version 378632 (0.00260) [2022-07-09 19:00:20,941][25689] Fps is (10 sec: 5656.7, 60 sec: 5632.7, 300 sec: 5662.0). Total num frames: 387722240. Throughput: 0: 5808.9. Samples: 387729386. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 19:00:20,941][25689] Avg episode reward: [(0, '-48.232')] [2022-07-09 19:00:21,943][26022] Updated weights on worker 0-0, policy_version 378642 (0.00089) [2022-07-09 19:00:23,778][26022] Updated weights on worker 0-0, policy_version 378652 (0.00090) [2022-07-09 19:00:25,639][26022] Updated weights on worker 0-0, policy_version 378662 (0.00089) [2022-07-09 19:00:25,955][25689] Fps is (10 sec: 5588.4, 60 sec: 5648.8, 300 sec: 5662.1). Total num frames: 387751936. Throughput: 0: 5053.5. Samples: 387746382. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 19:00:25,955][25689] Avg episode reward: [(0, '-48.167')] [2022-07-09 19:00:27,333][26022] Updated weights on worker 0-0, policy_version 378672 (0.00084) [2022-07-09 19:00:29,236][26022] Updated weights on worker 0-0, policy_version 378682 (0.00086) [2022-07-09 19:00:30,879][26022] Updated weights on worker 0-0, policy_version 378692 (0.00091) [2022-07-09 19:00:31,045][25689] Fps is (10 sec: 5878.2, 60 sec: 5694.8, 300 sec: 5671.5). Total num frames: 387781632. Throughput: 0: 5883.5. Samples: 387780472. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 19:00:31,045][25689] Avg episode reward: [(0, '-48.449')] [2022-07-09 19:00:32,714][26022] Updated weights on worker 0-0, policy_version 378702 (0.00079) [2022-07-09 19:00:34,555][26022] Updated weights on worker 0-0, policy_version 378712 (0.00097) [2022-07-09 19:00:36,107][25689] Fps is (10 sec: 5547.8, 60 sec: 5608.4, 300 sec: 5660.7). Total num frames: 387808256. Throughput: 0: 5914.6. Samples: 387814752. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 19:00:36,107][25689] Avg episode reward: [(0, '-48.364')] [2022-07-09 19:00:36,540][26022] Updated weights on worker 0-0, policy_version 378722 (0.00090) [2022-07-09 19:00:38,191][26022] Updated weights on worker 0-0, policy_version 378732 (0.00096) [2022-07-09 19:00:40,018][26022] Updated weights on worker 0-0, policy_version 378742 (0.00085) [2022-07-09 19:00:41,116][25689] Fps is (10 sec: 5693.7, 60 sec: 5659.7, 300 sec: 5668.1). Total num frames: 387838976. Throughput: 0: 5082.6. Samples: 387831928. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 19:00:41,117][25689] Avg episode reward: [(0, '-47.868')] [2022-07-09 19:00:41,655][26022] Updated weights on worker 0-0, policy_version 378752 (0.00093) [2022-07-09 19:00:43,656][26022] Updated weights on worker 0-0, policy_version 378762 (0.00089) [2022-07-09 19:00:45,553][26022] Updated weights on worker 0-0, policy_version 378772 (0.00090) [2022-07-09 19:00:46,124][25689] Fps is (10 sec: 5724.5, 60 sec: 5617.6, 300 sec: 5664.8). Total num frames: 387865600. Throughput: 0: 5942.5. Samples: 387866236. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 19:00:46,124][25689] Avg episode reward: [(0, '-47.986')] [2022-07-09 19:00:47,120][26022] Updated weights on worker 0-0, policy_version 378782 (0.00086) [2022-07-09 19:00:48,943][26022] Updated weights on worker 0-0, policy_version 378792 (0.00086) [2022-07-09 19:00:50,939][26022] Updated weights on worker 0-0, policy_version 378802 (0.00085) [2022-07-09 19:00:51,131][25689] Fps is (10 sec: 5623.7, 60 sec: 5651.5, 300 sec: 5670.4). Total num frames: 387895296. Throughput: 0: 5965.1. Samples: 387900288. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 19:00:51,131][25689] Avg episode reward: [(0, '-47.559')] [2022-07-09 19:00:52,464][26022] Updated weights on worker 0-0, policy_version 378812 (0.00092) [2022-07-09 19:00:54,319][26022] Updated weights on worker 0-0, policy_version 378822 (0.00086) [2022-07-09 19:00:55,886][26022] Updated weights on worker 0-0, policy_version 378832 (0.00085) [2022-07-09 19:00:56,250][25689] Fps is (10 sec: 5764.0, 60 sec: 5647.0, 300 sec: 5665.0). Total num frames: 387923968. Throughput: 0: 5094.0. Samples: 387917364. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 19:00:56,251][25689] Avg episode reward: [(0, '-47.802')] [2022-07-09 19:00:58,039][26022] Updated weights on worker 0-0, policy_version 378842 (0.00080) [2022-07-09 19:00:59,763][26022] Updated weights on worker 0-0, policy_version 378852 (0.00089) [2022-07-09 19:01:01,304][25689] Fps is (10 sec: 5737.2, 60 sec: 5647.7, 300 sec: 5674.8). Total num frames: 387953664. Throughput: 0: 5926.7. Samples: 387951578. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 19:01:01,305][25689] Avg episode reward: [(0, '-47.481')] [2022-07-09 19:01:01,412][26022] Updated weights on worker 0-0, policy_version 378862 (0.00080) [2022-07-09 19:01:03,807][26022] Updated weights on worker 0-0, policy_version 378872 (0.00084) [2022-07-09 19:01:05,394][26022] Updated weights on worker 0-0, policy_version 378882 (0.00086) [2022-07-09 19:01:06,378][25689] Fps is (10 sec: 5459.8, 60 sec: 5643.4, 300 sec: 5667.2). Total num frames: 387979264. Throughput: 0: 5814.1. Samples: 387983996. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 19:01:06,379][25689] Avg episode reward: [(0, '-47.673')] [2022-07-09 19:01:07,198][26022] Updated weights on worker 0-0, policy_version 378892 (0.00091) [2022-07-09 19:01:09,160][26022] Updated weights on worker 0-0, policy_version 378902 (0.00085) [2022-07-09 19:01:10,987][26022] Updated weights on worker 0-0, policy_version 378912 (0.00084) [2022-07-09 19:01:11,428][25689] Fps is (10 sec: 5360.6, 60 sec: 5658.4, 300 sec: 5667.9). Total num frames: 388007936. Throughput: 0: 5801.8. Samples: 388018052. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 19:01:11,429][25689] Avg episode reward: [(0, '-47.416')] [2022-07-09 19:01:12,806][26022] Updated weights on worker 0-0, policy_version 378922 (0.00086) [2022-07-09 19:01:14,716][26022] Updated weights on worker 0-0, policy_version 378932 (0.00091) [2022-07-09 19:01:16,178][26022] Updated weights on worker 0-0, policy_version 378942 (0.00089) [2022-07-09 19:01:16,479][25689] Fps is (10 sec: 5778.6, 60 sec: 5643.6, 300 sec: 5664.1). Total num frames: 388037632. Throughput: 0: 5810.1. Samples: 388034894. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 19:01:16,479][25689] Avg episode reward: [(0, '-47.344')] [2022-07-09 19:01:18,339][26022] Updated weights on worker 0-0, policy_version 378952 (0.00089) [2022-07-09 19:01:19,822][26022] Updated weights on worker 0-0, policy_version 378962 (0.00086) [2022-07-09 19:01:21,494][25689] Fps is (10 sec: 5696.9, 60 sec: 5665.1, 300 sec: 5667.7). Total num frames: 388065280. Throughput: 0: 5817.5. Samples: 388069034. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 19:01:21,495][25689] Avg episode reward: [(0, '-47.374')] [2022-07-09 19:01:21,727][26022] Updated weights on worker 0-0, policy_version 378972 (0.00088) [2022-07-09 19:01:23,572][26022] Updated weights on worker 0-0, policy_version 378982 (0.00093) [2022-07-09 19:01:24,538][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:01:24,554][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000378987_388082688.pth [2022-07-09 19:01:24,560][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000376992_386039808.pth [2022-07-09 19:01:25,238][26022] Updated weights on worker 0-0, policy_version 378992 (0.00089) [2022-07-09 19:01:26,523][25689] Fps is (10 sec: 5505.5, 60 sec: 5629.9, 300 sec: 5663.9). Total num frames: 388092928. Throughput: 0: 5929.8. Samples: 388103448. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 19:01:26,523][25689] Avg episode reward: [(0, '-46.995')] [2022-07-09 19:01:27,239][26022] Updated weights on worker 0-0, policy_version 379002 (0.00084) [2022-07-09 19:01:28,964][26022] Updated weights on worker 0-0, policy_version 379012 (0.00087) [2022-07-09 19:01:30,606][26022] Updated weights on worker 0-0, policy_version 379022 (0.00091) [2022-07-09 19:01:31,544][25689] Fps is (10 sec: 5807.8, 60 sec: 5653.2, 300 sec: 5667.8). Total num frames: 388123648. Throughput: 0: 5097.3. Samples: 388120588. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 19:01:31,545][25689] Avg episode reward: [(0, '-46.841')] [2022-07-09 19:01:32,601][26022] Updated weights on worker 0-0, policy_version 379032 (0.00087) [2022-07-09 19:01:34,229][26022] Updated weights on worker 0-0, policy_version 379042 (0.00082) [2022-07-09 19:01:36,241][26022] Updated weights on worker 0-0, policy_version 379052 (0.00086) [2022-07-09 19:01:36,688][25689] Fps is (10 sec: 5742.0, 60 sec: 5662.5, 300 sec: 5665.3). Total num frames: 388151296. Throughput: 0: 5926.1. Samples: 388154654. Policy #0 lag: (min: 0.0, avg: 7.5, max: 18.0) [2022-07-09 19:01:36,688][25689] Avg episode reward: [(0, '-46.500')] [2022-07-09 19:01:37,920][26022] Updated weights on worker 0-0, policy_version 379062 (0.00088) [2022-07-09 19:01:39,739][26022] Updated weights on worker 0-0, policy_version 379072 (0.00086) [2022-07-09 19:01:41,476][26022] Updated weights on worker 0-0, policy_version 379082 (0.00089) [2022-07-09 19:01:41,777][25689] Fps is (10 sec: 5704.3, 60 sec: 5655.1, 300 sec: 5667.1). Total num frames: 388182016. Throughput: 0: 5903.2. Samples: 388188764. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:01:41,777][25689] Avg episode reward: [(0, '-46.499')] [2022-07-09 19:01:43,226][26022] Updated weights on worker 0-0, policy_version 379092 (0.00084) [2022-07-09 19:01:45,093][26022] Updated weights on worker 0-0, policy_version 379102 (0.00094) [2022-07-09 19:01:46,784][25689] Fps is (10 sec: 5679.5, 60 sec: 5655.1, 300 sec: 5660.3). Total num frames: 388208640. Throughput: 0: 5060.0. Samples: 388205974. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:01:46,785][25689] Avg episode reward: [(0, '-46.986')] [2022-07-09 19:01:47,161][26022] Updated weights on worker 0-0, policy_version 379112 (0.00087) [2022-07-09 19:01:48,740][26022] Updated weights on worker 0-0, policy_version 379122 (0.00087) [2022-07-09 19:01:50,729][26022] Updated weights on worker 0-0, policy_version 379132 (0.00091) [2022-07-09 19:01:51,825][25689] Fps is (10 sec: 5604.6, 60 sec: 5651.9, 300 sec: 5661.5). Total num frames: 388238336. Throughput: 0: 5890.1. Samples: 388240046. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:01:51,826][25689] Avg episode reward: [(0, '-47.271')] [2022-07-09 19:01:52,351][26022] Updated weights on worker 0-0, policy_version 379142 (0.00088) [2022-07-09 19:01:54,097][26022] Updated weights on worker 0-0, policy_version 379152 (0.00088) [2022-07-09 19:01:56,057][26022] Updated weights on worker 0-0, policy_version 379162 (0.00085) [2022-07-09 19:01:56,909][25689] Fps is (10 sec: 5765.1, 60 sec: 5655.3, 300 sec: 5663.5). Total num frames: 388267008. Throughput: 0: 5931.1. Samples: 388274588. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:01:56,909][25689] Avg episode reward: [(0, '-46.949')] [2022-07-09 19:01:57,508][26022] Updated weights on worker 0-0, policy_version 379172 (0.00089) [2022-07-09 19:01:59,519][26022] Updated weights on worker 0-0, policy_version 379182 (0.00895) [2022-07-09 19:02:01,412][26022] Updated weights on worker 0-0, policy_version 379192 (0.00083) [2022-07-09 19:02:01,946][25689] Fps is (10 sec: 5362.6, 60 sec: 5589.3, 300 sec: 5660.3). Total num frames: 388292608. Throughput: 0: 5106.6. Samples: 388291760. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:02:01,946][25689] Avg episode reward: [(0, '-46.821')] [2022-07-09 19:02:03,363][26022] Updated weights on worker 0-0, policy_version 379202 (0.00079) [2022-07-09 19:02:05,438][26022] Updated weights on worker 0-0, policy_version 379212 (0.00088) [2022-07-09 19:02:06,810][26022] Updated weights on worker 0-0, policy_version 379222 (0.00100) [2022-07-09 19:02:06,969][25689] Fps is (10 sec: 5700.1, 60 sec: 5695.4, 300 sec: 5670.3). Total num frames: 388324352. Throughput: 0: 5854.7. Samples: 388324148. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:02:06,969][25689] Avg episode reward: [(0, '-46.839')] [2022-07-09 19:02:08,881][26022] Updated weights on worker 0-0, policy_version 379232 (0.00095) [2022-07-09 19:02:10,700][26022] Updated weights on worker 0-0, policy_version 379242 (0.00087) [2022-07-09 19:02:12,012][25689] Fps is (10 sec: 5900.5, 60 sec: 5679.2, 300 sec: 5663.8). Total num frames: 388352000. Throughput: 0: 5865.9. Samples: 388358456. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:02:12,013][25689] Avg episode reward: [(0, '-46.126')] [2022-07-09 19:02:12,314][26022] Updated weights on worker 0-0, policy_version 379252 (0.00084) [2022-07-09 19:02:14,448][26022] Updated weights on worker 0-0, policy_version 379262 (0.00079) [2022-07-09 19:02:15,644][26022] Updated weights on worker 0-0, policy_version 379272 (0.00092) [2022-07-09 19:02:17,062][25689] Fps is (10 sec: 5478.5, 60 sec: 5645.4, 300 sec: 5663.4). Total num frames: 388379648. Throughput: 0: 5016.6. Samples: 388375690. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:02:17,063][25689] Avg episode reward: [(0, '-46.943')] [2022-07-09 19:02:17,833][26022] Updated weights on worker 0-0, policy_version 379282 (0.00093) [2022-07-09 19:02:19,685][26022] Updated weights on worker 0-0, policy_version 379292 (0.00089) [2022-07-09 19:02:21,215][26022] Updated weights on worker 0-0, policy_version 379302 (0.00082) [2022-07-09 19:02:22,078][25689] Fps is (10 sec: 5696.4, 60 sec: 5679.1, 300 sec: 5667.3). Total num frames: 388409344. Throughput: 0: 5870.0. Samples: 388409938. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:02:22,091][25689] Avg episode reward: [(0, '-47.151')] [2022-07-09 19:02:23,310][26022] Updated weights on worker 0-0, policy_version 379312 (0.00091) [2022-07-09 19:02:24,983][26022] Updated weights on worker 0-0, policy_version 379322 (0.00090) [2022-07-09 19:02:26,595][26022] Updated weights on worker 0-0, policy_version 379332 (0.00097) [2022-07-09 19:02:27,123][25689] Fps is (10 sec: 5801.8, 60 sec: 5694.5, 300 sec: 5663.3). Total num frames: 388438016. Throughput: 0: 5969.6. Samples: 388444458. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:02:27,123][25689] Avg episode reward: [(0, '-47.364')] [2022-07-09 19:02:28,572][26022] Updated weights on worker 0-0, policy_version 379342 (0.00079) [2022-07-09 19:02:30,160][26022] Updated weights on worker 0-0, policy_version 379352 (0.00086) [2022-07-09 19:02:32,128][25689] Fps is (10 sec: 5604.1, 60 sec: 5645.3, 300 sec: 5661.0). Total num frames: 388465664. Throughput: 0: 5124.8. Samples: 388461550. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:02:32,129][25689] Avg episode reward: [(0, '-47.248')] [2022-07-09 19:02:32,163][26022] Updated weights on worker 0-0, policy_version 379362 (0.00095) [2022-07-09 19:02:33,749][26022] Updated weights on worker 0-0, policy_version 379372 (0.00088) [2022-07-09 19:02:35,740][26022] Updated weights on worker 0-0, policy_version 379382 (0.00090) [2022-07-09 19:02:37,241][25689] Fps is (10 sec: 5667.3, 60 sec: 5682.0, 300 sec: 5662.3). Total num frames: 388495360. Throughput: 0: 5956.0. Samples: 388495874. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:02:37,241][25689] Avg episode reward: [(0, '-46.901')] [2022-07-09 19:02:37,494][26022] Updated weights on worker 0-0, policy_version 379392 (0.00088) [2022-07-09 19:02:39,312][26022] Updated weights on worker 0-0, policy_version 379402 (0.00092) [2022-07-09 19:02:40,996][26022] Updated weights on worker 0-0, policy_version 379412 (0.00081) [2022-07-09 19:02:42,301][25689] Fps is (10 sec: 5737.3, 60 sec: 5650.8, 300 sec: 5658.0). Total num frames: 388524032. Throughput: 0: 5952.3. Samples: 388530312. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:02:42,302][25689] Avg episode reward: [(0, '-47.152')] [2022-07-09 19:02:42,855][26022] Updated weights on worker 0-0, policy_version 379422 (0.00086) [2022-07-09 19:02:44,500][26022] Updated weights on worker 0-0, policy_version 379432 (0.00619) [2022-07-09 19:02:46,418][26022] Updated weights on worker 0-0, policy_version 379442 (0.00085) [2022-07-09 19:02:47,305][25689] Fps is (10 sec: 5799.7, 60 sec: 5702.0, 300 sec: 5665.5). Total num frames: 388553728. Throughput: 0: 5109.3. Samples: 388547576. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:02:47,306][25689] Avg episode reward: [(0, '-47.394')] [2022-07-09 19:02:48,201][26022] Updated weights on worker 0-0, policy_version 379452 (0.00082) [2022-07-09 19:02:49,887][26022] Updated weights on worker 0-0, policy_version 379462 (0.00080) [2022-07-09 19:02:52,004][26022] Updated weights on worker 0-0, policy_version 379472 (0.00088) [2022-07-09 19:02:52,325][25689] Fps is (10 sec: 5618.7, 60 sec: 5653.2, 300 sec: 5653.0). Total num frames: 388580352. Throughput: 0: 5963.0. Samples: 388581986. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:02:52,326][25689] Avg episode reward: [(0, '-46.540')] [2022-07-09 19:02:53,339][26022] Updated weights on worker 0-0, policy_version 379482 (0.00083) [2022-07-09 19:02:55,444][26022] Updated weights on worker 0-0, policy_version 379492 (0.00099) [2022-07-09 19:02:57,012][26022] Updated weights on worker 0-0, policy_version 379502 (0.00086) [2022-07-09 19:02:57,414][25689] Fps is (10 sec: 5571.0, 60 sec: 5669.6, 300 sec: 5655.3). Total num frames: 388610048. Throughput: 0: 5977.3. Samples: 388616456. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:02:57,415][25689] Avg episode reward: [(0, '-47.091')] [2022-07-09 19:02:58,839][26022] Updated weights on worker 0-0, policy_version 379512 (0.00088) [2022-07-09 19:03:00,924][26022] Updated weights on worker 0-0, policy_version 379522 (0.00082) [2022-07-09 19:03:02,503][25689] Fps is (10 sec: 5735.0, 60 sec: 5715.5, 300 sec: 5664.2). Total num frames: 388638720. Throughput: 0: 5102.3. Samples: 388633384. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:03:02,503][25689] Avg episode reward: [(0, '-47.517')] [2022-07-09 19:03:02,798][26022] Updated weights on worker 0-0, policy_version 379532 (0.00086) [2022-07-09 19:03:04,728][26022] Updated weights on worker 0-0, policy_version 379542 (0.00085) [2022-07-09 19:03:06,557][26022] Updated weights on worker 0-0, policy_version 379552 (0.00083) [2022-07-09 19:03:07,581][25689] Fps is (10 sec: 5640.1, 60 sec: 5659.6, 300 sec: 5659.3). Total num frames: 388667392. Throughput: 0: 5829.5. Samples: 388665778. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:03:07,582][25689] Avg episode reward: [(0, '-48.293')] [2022-07-09 19:03:08,174][26022] Updated weights on worker 0-0, policy_version 379562 (0.00082) [2022-07-09 19:03:09,946][26022] Updated weights on worker 0-0, policy_version 379572 (0.00086) [2022-07-09 19:03:11,800][26022] Updated weights on worker 0-0, policy_version 379582 (0.00091) [2022-07-09 19:03:12,622][25689] Fps is (10 sec: 5565.6, 60 sec: 5659.8, 300 sec: 5663.1). Total num frames: 388695040. Throughput: 0: 5814.3. Samples: 388699998. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:03:12,623][25689] Avg episode reward: [(0, '-48.007')] [2022-07-09 19:03:13,471][26022] Updated weights on worker 0-0, policy_version 379592 (0.00087) [2022-07-09 19:03:15,475][26022] Updated weights on worker 0-0, policy_version 379602 (0.00085) [2022-07-09 19:03:17,054][26022] Updated weights on worker 0-0, policy_version 379612 (0.00087) [2022-07-09 19:03:17,779][25689] Fps is (10 sec: 5623.2, 60 sec: 5683.6, 300 sec: 5658.6). Total num frames: 388724736. Throughput: 0: 5783.4. Samples: 388734234. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:03:17,780][25689] Avg episode reward: [(0, '-49.146')] [2022-07-09 19:03:18,987][26022] Updated weights on worker 0-0, policy_version 379622 (0.00090) [2022-07-09 19:03:21,056][26022] Updated weights on worker 0-0, policy_version 379632 (0.00632) [2022-07-09 19:03:22,605][26022] Updated weights on worker 0-0, policy_version 379642 (0.00092) [2022-07-09 19:03:22,836][25689] Fps is (10 sec: 5814.5, 60 sec: 5679.7, 300 sec: 5667.9). Total num frames: 388754432. Throughput: 0: 5791.0. Samples: 388751136. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:03:22,837][25689] Avg episode reward: [(0, '-49.156')] [2022-07-09 19:03:24,539][26022] Updated weights on worker 0-0, policy_version 379652 (0.00086) [2022-07-09 19:03:24,633][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:03:24,645][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000379653_388764672.pth [2022-07-09 19:03:24,646][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000377658_386721792.pth [2022-07-09 19:03:26,149][26022] Updated weights on worker 0-0, policy_version 379662 (0.00086) [2022-07-09 19:03:27,918][25689] Fps is (10 sec: 5655.7, 60 sec: 5659.3, 300 sec: 5659.7). Total num frames: 388782080. Throughput: 0: 5872.2. Samples: 388785200. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:03:27,919][25689] Avg episode reward: [(0, '-49.316')] [2022-07-09 19:03:28,119][26022] Updated weights on worker 0-0, policy_version 379672 (0.00090) [2022-07-09 19:03:29,874][26022] Updated weights on worker 0-0, policy_version 379682 (0.00083) [2022-07-09 19:03:31,631][26022] Updated weights on worker 0-0, policy_version 379692 (0.00093) [2022-07-09 19:03:32,944][25689] Fps is (10 sec: 5673.3, 60 sec: 5691.1, 300 sec: 5664.2). Total num frames: 388811776. Throughput: 0: 5879.9. Samples: 388819490. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:03:32,945][25689] Avg episode reward: [(0, '-48.546')] [2022-07-09 19:03:33,419][26022] Updated weights on worker 0-0, policy_version 379702 (0.00092) [2022-07-09 19:03:35,443][26022] Updated weights on worker 0-0, policy_version 379712 (0.00087) [2022-07-09 19:03:37,144][26022] Updated weights on worker 0-0, policy_version 379722 (0.00092) [2022-07-09 19:03:38,033][25689] Fps is (10 sec: 5669.2, 60 sec: 5659.6, 300 sec: 5659.4). Total num frames: 388839424. Throughput: 0: 5038.6. Samples: 388836286. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:03:38,034][25689] Avg episode reward: [(0, '-47.872')] [2022-07-09 19:03:38,942][26022] Updated weights on worker 0-0, policy_version 379732 (0.00086) [2022-07-09 19:03:40,640][26022] Updated weights on worker 0-0, policy_version 379742 (0.00087) [2022-07-09 19:03:42,680][26022] Updated weights on worker 0-0, policy_version 379752 (0.00093) [2022-07-09 19:03:43,081][25689] Fps is (10 sec: 5657.0, 60 sec: 5677.7, 300 sec: 5665.8). Total num frames: 388869120. Throughput: 0: 5881.1. Samples: 388870196. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:03:43,081][25689] Avg episode reward: [(0, '-48.143')] [2022-07-09 19:03:44,356][26022] Updated weights on worker 0-0, policy_version 379762 (0.00084) [2022-07-09 19:03:46,012][26022] Updated weights on worker 0-0, policy_version 379772 (0.00102) [2022-07-09 19:03:47,870][26022] Updated weights on worker 0-0, policy_version 379782 (0.00089) [2022-07-09 19:03:48,099][25689] Fps is (10 sec: 5697.3, 60 sec: 5642.7, 300 sec: 5658.8). Total num frames: 388896768. Throughput: 0: 5919.2. Samples: 388904650. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:03:48,099][25689] Avg episode reward: [(0, '-47.227')] [2022-07-09 19:03:49,619][26022] Updated weights on worker 0-0, policy_version 379792 (0.00074) [2022-07-09 19:03:51,630][26022] Updated weights on worker 0-0, policy_version 379802 (0.00099) [2022-07-09 19:03:53,127][25689] Fps is (10 sec: 5606.3, 60 sec: 5675.6, 300 sec: 5659.6). Total num frames: 388925440. Throughput: 0: 5063.5. Samples: 388921686. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:03:53,127][25689] Avg episode reward: [(0, '-47.734')] [2022-07-09 19:03:53,246][26022] Updated weights on worker 0-0, policy_version 379812 (0.00088) [2022-07-09 19:03:55,264][26022] Updated weights on worker 0-0, policy_version 379822 (0.00090) [2022-07-09 19:03:56,900][26022] Updated weights on worker 0-0, policy_version 379832 (0.00084) [2022-07-09 19:03:58,221][25689] Fps is (10 sec: 5664.9, 60 sec: 5658.3, 300 sec: 5652.7). Total num frames: 388954112. Throughput: 0: 5901.9. Samples: 388955434. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:03:58,222][25689] Avg episode reward: [(0, '-47.697')] [2022-07-09 19:03:58,882][26022] Updated weights on worker 0-0, policy_version 379842 (0.00093) [2022-07-09 19:04:00,609][26022] Updated weights on worker 0-0, policy_version 379852 (0.00085) [2022-07-09 19:04:02,692][26022] Updated weights on worker 0-0, policy_version 379862 (0.00091) [2022-07-09 19:04:03,234][25689] Fps is (10 sec: 5572.3, 60 sec: 5648.5, 300 sec: 5664.1). Total num frames: 388981760. Throughput: 0: 5826.5. Samples: 388987618. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:04:03,234][25689] Avg episode reward: [(0, '-48.076')] [2022-07-09 19:04:04,603][26022] Updated weights on worker 0-0, policy_version 379872 (0.00091) [2022-07-09 19:04:06,356][26022] Updated weights on worker 0-0, policy_version 379882 (0.00093) [2022-07-09 19:04:08,182][26022] Updated weights on worker 0-0, policy_version 379892 (0.00087) [2022-07-09 19:04:08,253][25689] Fps is (10 sec: 5614.0, 60 sec: 5654.0, 300 sec: 5657.1). Total num frames: 389010432. Throughput: 0: 4967.0. Samples: 389004760. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:04:08,254][25689] Avg episode reward: [(0, '-46.779')] [2022-07-09 19:04:09,768][26022] Updated weights on worker 0-0, policy_version 379902 (0.00080) [2022-07-09 19:04:11,803][26022] Updated weights on worker 0-0, policy_version 379912 (0.00088) [2022-07-09 19:04:13,274][25689] Fps is (10 sec: 5609.3, 60 sec: 5655.8, 300 sec: 5658.2). Total num frames: 389038080. Throughput: 0: 5833.7. Samples: 389039222. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:04:13,276][25689] Avg episode reward: [(0, '-46.697')] [2022-07-09 19:04:13,395][26022] Updated weights on worker 0-0, policy_version 379922 (0.00086) [2022-07-09 19:04:15,333][26022] Updated weights on worker 0-0, policy_version 379932 (0.00311) [2022-07-09 19:04:17,128][26022] Updated weights on worker 0-0, policy_version 379942 (0.00092) [2022-07-09 19:04:18,369][25689] Fps is (10 sec: 5567.9, 60 sec: 5644.8, 300 sec: 5656.7). Total num frames: 389066752. Throughput: 0: 5845.8. Samples: 389073210. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:04:18,370][25689] Avg episode reward: [(0, '-47.276')] [2022-07-09 19:04:19,011][26022] Updated weights on worker 0-0, policy_version 379952 (0.00099) [2022-07-09 19:04:20,750][26022] Updated weights on worker 0-0, policy_version 379962 (0.00119) [2022-07-09 19:04:22,726][26022] Updated weights on worker 0-0, policy_version 379972 (0.00095) [2022-07-09 19:04:23,372][25689] Fps is (10 sec: 5679.0, 60 sec: 5632.9, 300 sec: 5656.7). Total num frames: 389095424. Throughput: 0: 5086.0. Samples: 389090040. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:04:23,372][25689] Avg episode reward: [(0, '-46.239')] [2022-07-09 19:04:24,440][26022] Updated weights on worker 0-0, policy_version 379982 (0.00084) [2022-07-09 19:04:26,233][26022] Updated weights on worker 0-0, policy_version 379992 (0.00614) [2022-07-09 19:04:28,000][26022] Updated weights on worker 0-0, policy_version 380002 (0.00389) [2022-07-09 19:04:28,389][25689] Fps is (10 sec: 5722.6, 60 sec: 5655.9, 300 sec: 5664.0). Total num frames: 389124096. Throughput: 0: 5949.9. Samples: 389124566. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:04:28,390][25689] Avg episode reward: [(0, '-46.701')] [2022-07-09 19:04:29,718][26022] Updated weights on worker 0-0, policy_version 380012 (0.00088) [2022-07-09 19:04:31,607][26022] Updated weights on worker 0-0, policy_version 380022 (0.00085) [2022-07-09 19:04:33,414][25689] Fps is (10 sec: 5608.6, 60 sec: 5622.1, 300 sec: 5650.6). Total num frames: 389151744. Throughput: 0: 5923.5. Samples: 389158518. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:04:33,415][25689] Avg episode reward: [(0, '-47.297')] [2022-07-09 19:04:33,422][26022] Updated weights on worker 0-0, policy_version 380032 (0.00076) [2022-07-09 19:04:35,052][26022] Updated weights on worker 0-0, policy_version 380042 (0.00092) [2022-07-09 19:04:36,959][26022] Updated weights on worker 0-0, policy_version 380052 (0.00087) [2022-07-09 19:04:38,507][25689] Fps is (10 sec: 5869.9, 60 sec: 5689.5, 300 sec: 5662.9). Total num frames: 389183488. Throughput: 0: 5086.6. Samples: 389175648. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:04:38,508][25689] Avg episode reward: [(0, '-47.859')] [2022-07-09 19:04:38,509][26022] Updated weights on worker 0-0, policy_version 380062 (0.00090) [2022-07-09 19:04:40,559][26022] Updated weights on worker 0-0, policy_version 380072 (0.00087) [2022-07-09 19:04:42,491][26022] Updated weights on worker 0-0, policy_version 380082 (0.00087) [2022-07-09 19:04:43,543][25689] Fps is (10 sec: 5762.3, 60 sec: 5639.8, 300 sec: 5653.8). Total num frames: 389210112. Throughput: 0: 5955.6. Samples: 389210170. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:04:43,545][25689] Avg episode reward: [(0, '-48.233')] [2022-07-09 19:04:44,066][26022] Updated weights on worker 0-0, policy_version 380092 (0.00085) [2022-07-09 19:04:45,962][26022] Updated weights on worker 0-0, policy_version 380102 (0.00087) [2022-07-09 19:04:47,630][26022] Updated weights on worker 0-0, policy_version 380112 (0.00096) [2022-07-09 19:04:48,617][25689] Fps is (10 sec: 5469.7, 60 sec: 5651.5, 300 sec: 5656.0). Total num frames: 389238784. Throughput: 0: 5903.9. Samples: 389243988. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:04:48,619][25689] Avg episode reward: [(0, '-48.468')] [2022-07-09 19:04:49,576][26022] Updated weights on worker 0-0, policy_version 380122 (0.00089) [2022-07-09 19:04:51,416][26022] Updated weights on worker 0-0, policy_version 380132 (0.00089) [2022-07-09 19:04:53,345][26022] Updated weights on worker 0-0, policy_version 380142 (0.00084) [2022-07-09 19:04:53,634][25689] Fps is (10 sec: 5581.4, 60 sec: 5635.6, 300 sec: 5653.6). Total num frames: 389266432. Throughput: 0: 5045.6. Samples: 389260540. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:04:53,635][25689] Avg episode reward: [(0, '-48.526')] [2022-07-09 19:04:55,126][26022] Updated weights on worker 0-0, policy_version 380152 (0.00086) [2022-07-09 19:04:57,112][26022] Updated weights on worker 0-0, policy_version 380162 (0.00094) [2022-07-09 19:04:58,568][26022] Updated weights on worker 0-0, policy_version 380172 (0.00088) [2022-07-09 19:04:58,712][25689] Fps is (10 sec: 5781.6, 60 sec: 5671.0, 300 sec: 5656.7). Total num frames: 389297152. Throughput: 0: 5873.4. Samples: 389294320. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:04:58,713][25689] Avg episode reward: [(0, '-48.393')] [2022-07-09 19:05:00,761][26022] Updated weights on worker 0-0, policy_version 380182 (0.00090) [2022-07-09 19:05:02,419][26022] Updated weights on worker 0-0, policy_version 380192 (0.00084) [2022-07-09 19:05:03,765][25689] Fps is (10 sec: 5457.8, 60 sec: 5616.4, 300 sec: 5652.8). Total num frames: 389321728. Throughput: 0: 5726.6. Samples: 389325976. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:05:03,766][25689] Avg episode reward: [(0, '-48.278')] [2022-07-09 19:05:04,647][26022] Updated weights on worker 0-0, policy_version 380202 (0.00088) [2022-07-09 19:05:06,136][26022] Updated weights on worker 0-0, policy_version 380212 (0.00090) [2022-07-09 19:05:08,235][26022] Updated weights on worker 0-0, policy_version 380222 (0.00093) [2022-07-09 19:05:08,807][25689] Fps is (10 sec: 5173.5, 60 sec: 5597.5, 300 sec: 5652.6). Total num frames: 389349376. Throughput: 0: 5765.2. Samples: 389360388. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:05:08,807][25689] Avg episode reward: [(0, '-47.638')] [2022-07-09 19:05:09,821][26022] Updated weights on worker 0-0, policy_version 380232 (0.00088) [2022-07-09 19:05:11,878][26022] Updated weights on worker 0-0, policy_version 380242 (0.00089) [2022-07-09 19:05:13,344][26022] Updated weights on worker 0-0, policy_version 380252 (0.00085) [2022-07-09 19:05:13,906][25689] Fps is (10 sec: 5755.7, 60 sec: 5640.9, 300 sec: 5652.1). Total num frames: 389380096. Throughput: 0: 5774.7. Samples: 389377608. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:05:13,907][25689] Avg episode reward: [(0, '-47.500')] [2022-07-09 19:05:15,427][26022] Updated weights on worker 0-0, policy_version 380262 (0.00081) [2022-07-09 19:05:17,105][26022] Updated weights on worker 0-0, policy_version 380272 (0.00086) [2022-07-09 19:05:18,951][25689] Fps is (10 sec: 5754.0, 60 sec: 5628.6, 300 sec: 5655.9). Total num frames: 389407744. Throughput: 0: 5789.5. Samples: 389411492. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:05:18,951][25689] Avg episode reward: [(0, '-47.206')] [2022-07-09 19:05:18,965][26022] Updated weights on worker 0-0, policy_version 380282 (0.00109) [2022-07-09 19:05:20,940][26022] Updated weights on worker 0-0, policy_version 380292 (0.00094) [2022-07-09 19:05:22,767][26022] Updated weights on worker 0-0, policy_version 380302 (0.00366) [2022-07-09 19:05:23,954][25689] Fps is (10 sec: 5503.4, 60 sec: 5611.7, 300 sec: 5649.2). Total num frames: 389435392. Throughput: 0: 5906.6. Samples: 389445220. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:05:23,954][25689] Avg episode reward: [(0, '-48.118')] [2022-07-09 19:05:24,422][26022] Updated weights on worker 0-0, policy_version 380312 (0.00087) [2022-07-09 19:05:24,724][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:05:24,738][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000380314_389441536.pth [2022-07-09 19:05:24,738][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000378324_387403776.pth [2022-07-09 19:05:26,508][26022] Updated weights on worker 0-0, policy_version 380322 (0.00086) [2022-07-09 19:05:27,866][26022] Updated weights on worker 0-0, policy_version 380332 (0.00085) [2022-07-09 19:05:28,972][25689] Fps is (10 sec: 5722.2, 60 sec: 5628.5, 300 sec: 5650.6). Total num frames: 389465088. Throughput: 0: 5050.3. Samples: 389462228. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:05:28,972][25689] Avg episode reward: [(0, '-47.568')] [2022-07-09 19:05:30,072][26022] Updated weights on worker 0-0, policy_version 380342 (0.00092) [2022-07-09 19:05:31,598][26022] Updated weights on worker 0-0, policy_version 380352 (0.00088) [2022-07-09 19:05:33,634][26022] Updated weights on worker 0-0, policy_version 380362 (0.00092) [2022-07-09 19:05:33,980][25689] Fps is (10 sec: 5719.2, 60 sec: 5630.0, 300 sec: 5655.0). Total num frames: 389492736. Throughput: 0: 5918.3. Samples: 389496412. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:05:33,981][25689] Avg episode reward: [(0, '-47.037')] [2022-07-09 19:05:35,125][26022] Updated weights on worker 0-0, policy_version 380372 (0.00051) [2022-07-09 19:05:37,223][26022] Updated weights on worker 0-0, policy_version 380382 (0.00092) [2022-07-09 19:05:38,822][26022] Updated weights on worker 0-0, policy_version 380392 (0.00090) [2022-07-09 19:05:39,107][25689] Fps is (10 sec: 5658.0, 60 sec: 5593.2, 300 sec: 5649.4). Total num frames: 389522432. Throughput: 0: 5904.1. Samples: 389530496. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:05:39,108][25689] Avg episode reward: [(0, '-45.874')] [2022-07-09 19:05:40,684][26022] Updated weights on worker 0-0, policy_version 380402 (0.00088) [2022-07-09 19:05:42,235][26022] Updated weights on worker 0-0, policy_version 380412 (0.00106) [2022-07-09 19:05:44,122][25689] Fps is (10 sec: 5654.4, 60 sec: 5612.0, 300 sec: 5652.7). Total num frames: 389550080. Throughput: 0: 5088.5. Samples: 389547846. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:05:44,122][25689] Avg episode reward: [(0, '-45.780')] [2022-07-09 19:05:44,287][26022] Updated weights on worker 0-0, policy_version 380422 (0.00095) [2022-07-09 19:05:46,139][26022] Updated weights on worker 0-0, policy_version 380432 (0.00072) [2022-07-09 19:05:47,938][26022] Updated weights on worker 0-0, policy_version 380442 (0.00090) [2022-07-09 19:05:49,128][25689] Fps is (10 sec: 5620.3, 60 sec: 5618.3, 300 sec: 5649.2). Total num frames: 389578752. Throughput: 0: 5933.9. Samples: 389581830. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:05:49,128][25689] Avg episode reward: [(0, '-45.499')] [2022-07-09 19:05:49,615][26022] Updated weights on worker 0-0, policy_version 380452 (0.00089) [2022-07-09 19:05:51,644][26022] Updated weights on worker 0-0, policy_version 380462 (0.00090) [2022-07-09 19:05:53,382][26022] Updated weights on worker 0-0, policy_version 380472 (0.00090) [2022-07-09 19:05:54,134][25689] Fps is (10 sec: 5727.4, 60 sec: 5636.2, 300 sec: 5651.4). Total num frames: 389607424. Throughput: 0: 5924.1. Samples: 389615804. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:05:54,134][25689] Avg episode reward: [(0, '-46.469')] [2022-07-09 19:05:55,348][26022] Updated weights on worker 0-0, policy_version 380482 (0.00615) [2022-07-09 19:05:56,825][26022] Updated weights on worker 0-0, policy_version 380492 (0.00084) [2022-07-09 19:05:58,953][26022] Updated weights on worker 0-0, policy_version 380502 (0.00086) [2022-07-09 19:05:59,232][25689] Fps is (10 sec: 5675.1, 60 sec: 5600.5, 300 sec: 5647.1). Total num frames: 389636096. Throughput: 0: 5087.5. Samples: 389632884. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:05:59,233][25689] Avg episode reward: [(0, '-47.091')] [2022-07-09 19:06:00,462][26022] Updated weights on worker 0-0, policy_version 380512 (0.00089) [2022-07-09 19:06:02,795][26022] Updated weights on worker 0-0, policy_version 380522 (0.00095) [2022-07-09 19:06:04,309][25689] Fps is (10 sec: 5535.2, 60 sec: 5649.1, 300 sec: 5653.9). Total num frames: 389663744. Throughput: 0: 5793.7. Samples: 389664804. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:06:04,309][25689] Avg episode reward: [(0, '-47.821')] [2022-07-09 19:06:04,436][26022] Updated weights on worker 0-0, policy_version 380532 (0.00086) [2022-07-09 19:06:06,385][26022] Updated weights on worker 0-0, policy_version 380542 (0.00053) [2022-07-09 19:06:08,280][26022] Updated weights on worker 0-0, policy_version 380552 (0.00088) [2022-07-09 19:06:09,323][25689] Fps is (10 sec: 5479.8, 60 sec: 5651.6, 300 sec: 5651.2). Total num frames: 389691392. Throughput: 0: 5801.9. Samples: 389699002. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:06:09,325][25689] Avg episode reward: [(0, '-48.240')] [2022-07-09 19:06:09,925][26022] Updated weights on worker 0-0, policy_version 380562 (0.00086) [2022-07-09 19:06:11,759][26022] Updated weights on worker 0-0, policy_version 380572 (0.00089) [2022-07-09 19:06:13,608][26022] Updated weights on worker 0-0, policy_version 380582 (0.00087) [2022-07-09 19:06:14,359][25689] Fps is (10 sec: 5501.9, 60 sec: 5606.7, 300 sec: 5644.6). Total num frames: 389719040. Throughput: 0: 4939.5. Samples: 389715706. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:06:14,360][25689] Avg episode reward: [(0, '-48.079')] [2022-07-09 19:06:15,275][26022] Updated weights on worker 0-0, policy_version 380592 (0.00082) [2022-07-09 19:06:17,265][26022] Updated weights on worker 0-0, policy_version 380602 (0.00086) [2022-07-09 19:06:18,775][26022] Updated weights on worker 0-0, policy_version 380612 (0.00083) [2022-07-09 19:06:19,475][25689] Fps is (10 sec: 5648.6, 60 sec: 5634.0, 300 sec: 5649.5). Total num frames: 389748736. Throughput: 0: 5779.8. Samples: 389749884. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:06:19,476][25689] Avg episode reward: [(0, '-47.461')] [2022-07-09 19:06:20,855][26022] Updated weights on worker 0-0, policy_version 380622 (0.00104) [2022-07-09 19:06:22,410][26022] Updated weights on worker 0-0, policy_version 380632 (0.00090) [2022-07-09 19:06:24,458][26022] Updated weights on worker 0-0, policy_version 380642 (0.00080) [2022-07-09 19:06:24,559][25689] Fps is (10 sec: 5722.5, 60 sec: 5643.4, 300 sec: 5651.9). Total num frames: 389777408. Throughput: 0: 5877.0. Samples: 389783814. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:06:24,559][25689] Avg episode reward: [(0, '-46.399')] [2022-07-09 19:06:26,279][26022] Updated weights on worker 0-0, policy_version 380652 (0.00078) [2022-07-09 19:06:27,990][26022] Updated weights on worker 0-0, policy_version 380662 (0.00092) [2022-07-09 19:06:29,620][25689] Fps is (10 sec: 5652.7, 60 sec: 5622.5, 300 sec: 5644.3). Total num frames: 389806080. Throughput: 0: 5023.7. Samples: 389800966. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:06:29,620][25689] Avg episode reward: [(0, '-46.927')] [2022-07-09 19:06:29,739][26022] Updated weights on worker 0-0, policy_version 380672 (0.00083) [2022-07-09 19:06:31,615][26022] Updated weights on worker 0-0, policy_version 380682 (0.00088) [2022-07-09 19:06:33,371][26022] Updated weights on worker 0-0, policy_version 380692 (0.00099) [2022-07-09 19:06:34,643][25689] Fps is (10 sec: 5787.9, 60 sec: 5654.9, 300 sec: 5653.5). Total num frames: 389835776. Throughput: 0: 5902.4. Samples: 389835432. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:06:34,644][25689] Avg episode reward: [(0, '-46.322')] [2022-07-09 19:06:35,283][26022] Updated weights on worker 0-0, policy_version 380702 (0.00094) [2022-07-09 19:06:37,010][26022] Updated weights on worker 0-0, policy_version 380712 (0.00100) [2022-07-09 19:06:38,991][26022] Updated weights on worker 0-0, policy_version 380722 (0.00093) [2022-07-09 19:06:39,709][25689] Fps is (10 sec: 5683.8, 60 sec: 5626.8, 300 sec: 5643.6). Total num frames: 389863424. Throughput: 0: 5915.7. Samples: 389869580. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:06:39,709][25689] Avg episode reward: [(0, '-46.611')] [2022-07-09 19:06:40,538][26022] Updated weights on worker 0-0, policy_version 380732 (0.00088) [2022-07-09 19:06:42,624][26022] Updated weights on worker 0-0, policy_version 380742 (0.00097) [2022-07-09 19:06:43,942][26022] Updated weights on worker 0-0, policy_version 380752 (0.00085) [2022-07-09 19:06:44,719][25689] Fps is (10 sec: 5589.7, 60 sec: 5644.1, 300 sec: 5650.4). Total num frames: 389892096. Throughput: 0: 5100.8. Samples: 389886646. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:06:44,719][25689] Avg episode reward: [(0, '-46.920')] [2022-07-09 19:06:46,064][26022] Updated weights on worker 0-0, policy_version 380762 (0.00090) [2022-07-09 19:06:47,801][26022] Updated weights on worker 0-0, policy_version 380772 (0.00086) [2022-07-09 19:06:49,680][26022] Updated weights on worker 0-0, policy_version 380782 (0.00098) [2022-07-09 19:06:49,736][25689] Fps is (10 sec: 5718.6, 60 sec: 5643.0, 300 sec: 5647.4). Total num frames: 389920768. Throughput: 0: 5955.3. Samples: 389920766. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-09 19:06:49,737][25689] Avg episode reward: [(0, '-46.953')] [2022-07-09 19:06:51,592][26022] Updated weights on worker 0-0, policy_version 380792 (0.00089) [2022-07-09 19:06:53,300][26022] Updated weights on worker 0-0, policy_version 380802 (0.00083) [2022-07-09 19:06:54,767][25689] Fps is (10 sec: 5503.2, 60 sec: 5607.0, 300 sec: 5641.6). Total num frames: 389947392. Throughput: 0: 5927.6. Samples: 389954716. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:06:54,767][25689] Avg episode reward: [(0, '-47.617')] [2022-07-09 19:06:55,056][26022] Updated weights on worker 0-0, policy_version 380812 (0.00085) [2022-07-09 19:06:56,893][26022] Updated weights on worker 0-0, policy_version 380822 (0.00084) [2022-07-09 19:06:58,712][26022] Updated weights on worker 0-0, policy_version 380832 (0.00089) [2022-07-09 19:06:59,859][25689] Fps is (10 sec: 5664.7, 60 sec: 5641.3, 300 sec: 5657.7). Total num frames: 389978112. Throughput: 0: 5067.2. Samples: 389971688. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:06:59,860][25689] Avg episode reward: [(0, '-47.385')] [2022-07-09 19:07:00,479][26022] Updated weights on worker 0-0, policy_version 380842 (0.00081) [2022-07-09 19:07:02,733][26022] Updated weights on worker 0-0, policy_version 380852 (0.00091) [2022-07-09 19:07:04,487][26022] Updated weights on worker 0-0, policy_version 380862 (0.00090) [2022-07-09 19:07:04,903][25689] Fps is (10 sec: 5657.4, 60 sec: 5627.5, 300 sec: 5640.1). Total num frames: 390004736. Throughput: 0: 5813.4. Samples: 390003984. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:07:04,903][25689] Avg episode reward: [(0, '-47.150')] [2022-07-09 19:07:06,200][26022] Updated weights on worker 0-0, policy_version 380872 (0.00094) [2022-07-09 19:07:08,200][26022] Updated weights on worker 0-0, policy_version 380882 (0.00089) [2022-07-09 19:07:09,799][26022] Updated weights on worker 0-0, policy_version 380892 (0.00087) [2022-07-09 19:07:09,907][25689] Fps is (10 sec: 5605.5, 60 sec: 5662.3, 300 sec: 5647.7). Total num frames: 390034432. Throughput: 0: 5830.2. Samples: 390038362. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:07:09,907][25689] Avg episode reward: [(0, '-46.729')] [2022-07-09 19:07:11,844][26022] Updated weights on worker 0-0, policy_version 380902 (0.00085) [2022-07-09 19:07:13,355][26022] Updated weights on worker 0-0, policy_version 380912 (0.00084) [2022-07-09 19:07:14,978][25689] Fps is (10 sec: 5691.9, 60 sec: 5659.0, 300 sec: 5647.4). Total num frames: 390062080. Throughput: 0: 4979.0. Samples: 390055346. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:07:14,978][25689] Avg episode reward: [(0, '-46.338')] [2022-07-09 19:07:15,341][26022] Updated weights on worker 0-0, policy_version 380922 (0.00082) [2022-07-09 19:07:17,056][26022] Updated weights on worker 0-0, policy_version 380932 (0.00088) [2022-07-09 19:07:18,853][26022] Updated weights on worker 0-0, policy_version 380942 (0.00088) [2022-07-09 19:07:20,031][25689] Fps is (10 sec: 5562.9, 60 sec: 5648.0, 300 sec: 5643.2). Total num frames: 390090752. Throughput: 0: 5852.6. Samples: 390089744. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:07:20,031][25689] Avg episode reward: [(0, '-46.465')] [2022-07-09 19:07:20,615][26022] Updated weights on worker 0-0, policy_version 380952 (0.00086) [2022-07-09 19:07:22,423][26022] Updated weights on worker 0-0, policy_version 380962 (0.00085) [2022-07-09 19:07:24,122][26022] Updated weights on worker 0-0, policy_version 380972 (0.00083) [2022-07-09 19:07:25,040][25689] Fps is (10 sec: 5698.9, 60 sec: 5654.9, 300 sec: 5643.9). Total num frames: 390119424. Throughput: 0: 5947.9. Samples: 390123758. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:07:25,040][25689] Avg episode reward: [(0, '-46.820')] [2022-07-09 19:07:25,056][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:07:25,063][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000380977_390120448.pth [2022-07-09 19:07:25,063][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000378987_388082688.pth [2022-07-09 19:07:26,153][26022] Updated weights on worker 0-0, policy_version 380982 (0.00092) [2022-07-09 19:07:27,719][26022] Updated weights on worker 0-0, policy_version 380992 (0.00087) [2022-07-09 19:07:29,777][26022] Updated weights on worker 0-0, policy_version 381002 (0.00083) [2022-07-09 19:07:30,111][25689] Fps is (10 sec: 5587.4, 60 sec: 5637.1, 300 sec: 5642.6). Total num frames: 390147072. Throughput: 0: 5067.5. Samples: 390140746. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:07:30,111][25689] Avg episode reward: [(0, '-45.981')] [2022-07-09 19:07:31,379][26022] Updated weights on worker 0-0, policy_version 381012 (0.00078) [2022-07-09 19:07:33,130][26022] Updated weights on worker 0-0, policy_version 381022 (0.00090) [2022-07-09 19:07:35,068][26022] Updated weights on worker 0-0, policy_version 381032 (0.00087) [2022-07-09 19:07:35,145][25689] Fps is (10 sec: 5674.9, 60 sec: 5636.1, 300 sec: 5644.1). Total num frames: 390176768. Throughput: 0: 5938.2. Samples: 390175102. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:07:35,145][25689] Avg episode reward: [(0, '-46.647')] [2022-07-09 19:07:36,843][26022] Updated weights on worker 0-0, policy_version 381042 (0.00078) [2022-07-09 19:07:38,615][26022] Updated weights on worker 0-0, policy_version 381052 (0.00089) [2022-07-09 19:07:40,277][25689] Fps is (10 sec: 5841.8, 60 sec: 5663.7, 300 sec: 5646.2). Total num frames: 390206464. Throughput: 0: 5891.7. Samples: 390209030. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:07:40,278][25689] Avg episode reward: [(0, '-46.803')] [2022-07-09 19:07:40,533][26022] Updated weights on worker 0-0, policy_version 381062 (0.00101) [2022-07-09 19:07:42,159][26022] Updated weights on worker 0-0, policy_version 381072 (0.00085) [2022-07-09 19:07:44,163][26022] Updated weights on worker 0-0, policy_version 381082 (0.00088) [2022-07-09 19:07:45,302][25689] Fps is (10 sec: 5645.6, 60 sec: 5645.4, 300 sec: 5638.9). Total num frames: 390234112. Throughput: 0: 5889.2. Samples: 390243086. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:07:45,303][25689] Avg episode reward: [(0, '-46.193')] [2022-07-09 19:07:45,856][26022] Updated weights on worker 0-0, policy_version 381092 (0.00101) [2022-07-09 19:07:47,620][26022] Updated weights on worker 0-0, policy_version 381102 (0.00091) [2022-07-09 19:07:49,608][26022] Updated weights on worker 0-0, policy_version 381112 (0.00090) [2022-07-09 19:07:50,310][25689] Fps is (10 sec: 5613.6, 60 sec: 5646.3, 300 sec: 5646.0). Total num frames: 390262784. Throughput: 0: 5920.9. Samples: 390260346. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:07:50,311][25689] Avg episode reward: [(0, '-46.293')] [2022-07-09 19:07:51,358][26022] Updated weights on worker 0-0, policy_version 381122 (0.00086) [2022-07-09 19:07:53,261][26022] Updated weights on worker 0-0, policy_version 381132 (0.00086) [2022-07-09 19:07:54,904][26022] Updated weights on worker 0-0, policy_version 381142 (0.00093) [2022-07-09 19:07:55,329][25689] Fps is (10 sec: 5718.9, 60 sec: 5681.1, 300 sec: 5643.9). Total num frames: 390291456. Throughput: 0: 5902.3. Samples: 390294236. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:07:55,330][25689] Avg episode reward: [(0, '-46.470')] [2022-07-09 19:07:56,986][26022] Updated weights on worker 0-0, policy_version 381152 (0.00082) [2022-07-09 19:07:58,483][26022] Updated weights on worker 0-0, policy_version 381162 (0.00087) [2022-07-09 19:08:00,442][25689] Fps is (10 sec: 5558.7, 60 sec: 5628.5, 300 sec: 5640.0). Total num frames: 390319104. Throughput: 0: 5907.6. Samples: 390328156. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:08:00,443][25689] Avg episode reward: [(0, '-47.414')] [2022-07-09 19:08:00,615][26022] Updated weights on worker 0-0, policy_version 381172 (0.00085) [2022-07-09 19:08:02,312][26022] Updated weights on worker 0-0, policy_version 381182 (0.00091) [2022-07-09 19:08:04,585][26022] Updated weights on worker 0-0, policy_version 381192 (0.00090) [2022-07-09 19:08:05,451][25689] Fps is (10 sec: 5462.8, 60 sec: 5648.6, 300 sec: 5637.9). Total num frames: 390346752. Throughput: 0: 4970.0. Samples: 390343228. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:08:05,452][25689] Avg episode reward: [(0, '-46.778')] [2022-07-09 19:08:06,011][26022] Updated weights on worker 0-0, policy_version 381202 (0.00091) [2022-07-09 19:08:08,010][26022] Updated weights on worker 0-0, policy_version 381212 (0.00629) [2022-07-09 19:08:09,790][26022] Updated weights on worker 0-0, policy_version 381222 (0.00092) [2022-07-09 19:08:10,467][25689] Fps is (10 sec: 5618.1, 60 sec: 5630.6, 300 sec: 5641.8). Total num frames: 390375424. Throughput: 0: 5813.2. Samples: 390377520. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:08:10,467][25689] Avg episode reward: [(0, '-46.678')] [2022-07-09 19:08:11,572][26022] Updated weights on worker 0-0, policy_version 381232 (0.00087) [2022-07-09 19:08:13,384][26022] Updated weights on worker 0-0, policy_version 381242 (0.00085) [2022-07-09 19:08:15,085][26022] Updated weights on worker 0-0, policy_version 381252 (0.00077) [2022-07-09 19:08:15,470][25689] Fps is (10 sec: 5621.3, 60 sec: 5636.9, 300 sec: 5637.8). Total num frames: 390403072. Throughput: 0: 5847.9. Samples: 390412020. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:08:15,471][25689] Avg episode reward: [(0, '-47.028')] [2022-07-09 19:08:16,952][26022] Updated weights on worker 0-0, policy_version 381262 (0.00093) [2022-07-09 19:08:18,799][26022] Updated weights on worker 0-0, policy_version 381272 (0.00083) [2022-07-09 19:08:20,468][26022] Updated weights on worker 0-0, policy_version 381282 (0.00085) [2022-07-09 19:08:20,532][25689] Fps is (10 sec: 5697.3, 60 sec: 5653.0, 300 sec: 5637.8). Total num frames: 390432768. Throughput: 0: 5023.8. Samples: 390429084. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:08:20,533][25689] Avg episode reward: [(0, '-47.147')] [2022-07-09 19:08:22,366][26022] Updated weights on worker 0-0, policy_version 381292 (0.00087) [2022-07-09 19:08:24,261][26022] Updated weights on worker 0-0, policy_version 381302 (0.00090) [2022-07-09 19:08:25,565][25689] Fps is (10 sec: 5782.2, 60 sec: 5650.8, 300 sec: 5642.1). Total num frames: 390461440. Throughput: 0: 5952.8. Samples: 390462960. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:08:25,565][25689] Avg episode reward: [(0, '-47.666')] [2022-07-09 19:08:26,009][26022] Updated weights on worker 0-0, policy_version 381312 (0.00094) [2022-07-09 19:08:27,816][26022] Updated weights on worker 0-0, policy_version 381322 (0.00088) [2022-07-09 19:08:29,481][26022] Updated weights on worker 0-0, policy_version 381332 (0.00084) [2022-07-09 19:08:30,581][25689] Fps is (10 sec: 5502.8, 60 sec: 5639.0, 300 sec: 5632.0). Total num frames: 390488064. Throughput: 0: 5938.7. Samples: 390496970. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:08:30,581][25689] Avg episode reward: [(0, '-47.089')] [2022-07-09 19:08:31,411][26022] Updated weights on worker 0-0, policy_version 381342 (0.00084) [2022-07-09 19:08:33,152][26022] Updated weights on worker 0-0, policy_version 381352 (0.00090) [2022-07-09 19:08:34,890][26022] Updated weights on worker 0-0, policy_version 381362 (0.00090) [2022-07-09 19:08:35,584][25689] Fps is (10 sec: 5621.4, 60 sec: 5641.9, 300 sec: 5640.5). Total num frames: 390517760. Throughput: 0: 5071.9. Samples: 390514034. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:08:35,584][25689] Avg episode reward: [(0, '-46.834')] [2022-07-09 19:08:36,924][26022] Updated weights on worker 0-0, policy_version 381372 (0.00089) [2022-07-09 19:08:38,540][26022] Updated weights on worker 0-0, policy_version 381382 (0.00083) [2022-07-09 19:08:40,551][26022] Updated weights on worker 0-0, policy_version 381392 (0.00098) [2022-07-09 19:08:40,686][25689] Fps is (10 sec: 5775.9, 60 sec: 5627.8, 300 sec: 5636.0). Total num frames: 390546432. Throughput: 0: 5910.1. Samples: 390548198. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:08:40,687][25689] Avg episode reward: [(0, '-46.410')] [2022-07-09 19:08:42,100][26022] Updated weights on worker 0-0, policy_version 381402 (0.00089) [2022-07-09 19:08:43,935][26022] Updated weights on worker 0-0, policy_version 381412 (0.00086) [2022-07-09 19:08:45,724][25689] Fps is (10 sec: 5655.1, 60 sec: 5643.5, 300 sec: 5639.1). Total num frames: 390575104. Throughput: 0: 5919.2. Samples: 390582286. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:08:45,724][25689] Avg episode reward: [(0, '-46.501')] [2022-07-09 19:08:45,873][26022] Updated weights on worker 0-0, policy_version 381422 (0.00088) [2022-07-09 19:08:47,732][26022] Updated weights on worker 0-0, policy_version 381432 (0.00085) [2022-07-09 19:08:49,274][26022] Updated weights on worker 0-0, policy_version 381442 (0.00091) [2022-07-09 19:08:50,733][25689] Fps is (10 sec: 5605.9, 60 sec: 5626.5, 300 sec: 5636.0). Total num frames: 390602752. Throughput: 0: 5085.0. Samples: 390599444. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:08:50,733][25689] Avg episode reward: [(0, '-46.230')] [2022-07-09 19:08:51,387][26022] Updated weights on worker 0-0, policy_version 381452 (0.00080) [2022-07-09 19:08:53,071][26022] Updated weights on worker 0-0, policy_version 381462 (0.00087) [2022-07-09 19:08:54,856][26022] Updated weights on worker 0-0, policy_version 381472 (0.00097) [2022-07-09 19:08:55,763][25689] Fps is (10 sec: 5814.3, 60 sec: 5659.4, 300 sec: 5644.1). Total num frames: 390633472. Throughput: 0: 5907.3. Samples: 390633236. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:08:55,763][25689] Avg episode reward: [(0, '-46.708')] [2022-07-09 19:08:56,909][26022] Updated weights on worker 0-0, policy_version 381482 (0.00089) [2022-07-09 19:08:58,395][26022] Updated weights on worker 0-0, policy_version 381492 (0.00095) [2022-07-09 19:09:00,360][26022] Updated weights on worker 0-0, policy_version 381502 (0.00208) [2022-07-09 19:09:00,863][25689] Fps is (10 sec: 5761.9, 60 sec: 5660.6, 300 sec: 5642.5). Total num frames: 390661120. Throughput: 0: 5898.0. Samples: 390667200. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:09:00,863][25689] Avg episode reward: [(0, '-47.155')] [2022-07-09 19:09:02,628][26022] Updated weights on worker 0-0, policy_version 381512 (0.00090) [2022-07-09 19:09:04,101][26022] Updated weights on worker 0-0, policy_version 381522 (0.00090) [2022-07-09 19:09:05,922][25689] Fps is (10 sec: 5241.4, 60 sec: 5622.0, 300 sec: 5631.4). Total num frames: 390686720. Throughput: 0: 4934.7. Samples: 390681956. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:09:05,923][25689] Avg episode reward: [(0, '-47.273')] [2022-07-09 19:09:06,270][26022] Updated weights on worker 0-0, policy_version 381532 (0.00088) [2022-07-09 19:09:07,652][26022] Updated weights on worker 0-0, policy_version 381542 (0.00105) [2022-07-09 19:09:09,846][26022] Updated weights on worker 0-0, policy_version 381552 (0.00090) [2022-07-09 19:09:10,943][25689] Fps is (10 sec: 5485.9, 60 sec: 5638.5, 300 sec: 5638.3). Total num frames: 390716416. Throughput: 0: 5779.0. Samples: 390716236. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:09:10,944][25689] Avg episode reward: [(0, '-47.336')] [2022-07-09 19:09:11,236][26022] Updated weights on worker 0-0, policy_version 381562 (0.00098) [2022-07-09 19:09:13,224][26022] Updated weights on worker 0-0, policy_version 381572 (0.00632) [2022-07-09 19:09:14,979][26022] Updated weights on worker 0-0, policy_version 381582 (0.00088) [2022-07-09 19:09:16,015][25689] Fps is (10 sec: 5681.7, 60 sec: 5632.1, 300 sec: 5635.3). Total num frames: 390744064. Throughput: 0: 5784.2. Samples: 390750378. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:09:16,016][25689] Avg episode reward: [(0, '-47.743')] [2022-07-09 19:09:16,787][26022] Updated weights on worker 0-0, policy_version 381592 (0.00093) [2022-07-09 19:09:18,915][26022] Updated weights on worker 0-0, policy_version 381602 (0.00084) [2022-07-09 19:09:20,432][26022] Updated weights on worker 0-0, policy_version 381612 (0.00539) [2022-07-09 19:09:21,137][25689] Fps is (10 sec: 5524.6, 60 sec: 5609.6, 300 sec: 5633.0). Total num frames: 390772736. Throughput: 0: 4933.0. Samples: 390767212. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:09:21,139][25689] Avg episode reward: [(0, '-47.959')] [2022-07-09 19:09:22,283][26022] Updated weights on worker 0-0, policy_version 381622 (0.00098) [2022-07-09 19:09:24,290][26022] Updated weights on worker 0-0, policy_version 381632 (0.00099) [2022-07-09 19:09:25,153][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:09:25,168][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000381637_390796288.pth [2022-07-09 19:09:25,169][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000379653_388764672.pth [2022-07-09 19:09:25,932][26022] Updated weights on worker 0-0, policy_version 381642 (0.00094) [2022-07-09 19:09:26,167][25689] Fps is (10 sec: 5749.5, 60 sec: 5626.8, 300 sec: 5636.2). Total num frames: 390802432. Throughput: 0: 5883.1. Samples: 390801058. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:09:26,167][25689] Avg episode reward: [(0, '-48.788')] [2022-07-09 19:09:27,850][26022] Updated weights on worker 0-0, policy_version 381652 (0.00088) [2022-07-09 19:09:29,533][26022] Updated weights on worker 0-0, policy_version 381662 (0.00092) [2022-07-09 19:09:31,196][25689] Fps is (10 sec: 5700.9, 60 sec: 5642.4, 300 sec: 5636.1). Total num frames: 390830080. Throughput: 0: 5867.3. Samples: 390835068. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-09 19:09:31,196][25689] Avg episode reward: [(0, '-48.902')] [2022-07-09 19:09:31,281][26022] Updated weights on worker 0-0, policy_version 381672 (0.00092) [2022-07-09 19:09:33,216][26022] Updated weights on worker 0-0, policy_version 381682 (0.00091) [2022-07-09 19:09:35,056][26022] Updated weights on worker 0-0, policy_version 381692 (0.00088) [2022-07-09 19:09:36,199][25689] Fps is (10 sec: 5614.1, 60 sec: 5625.6, 300 sec: 5627.5). Total num frames: 390858752. Throughput: 0: 5043.3. Samples: 390852172. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:09:36,199][25689] Avg episode reward: [(0, '-49.711')] [2022-07-09 19:09:36,763][26022] Updated weights on worker 0-0, policy_version 381702 (0.00085) [2022-07-09 19:09:38,543][26022] Updated weights on worker 0-0, policy_version 381712 (0.00086) [2022-07-09 19:09:40,654][26022] Updated weights on worker 0-0, policy_version 381722 (0.00084) [2022-07-09 19:09:41,265][25689] Fps is (10 sec: 5695.1, 60 sec: 5628.9, 300 sec: 5633.8). Total num frames: 390887424. Throughput: 0: 5914.8. Samples: 390886264. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:09:41,266][25689] Avg episode reward: [(0, '-48.408')] [2022-07-09 19:09:42,111][26022] Updated weights on worker 0-0, policy_version 381732 (0.00060) [2022-07-09 19:09:44,128][26022] Updated weights on worker 0-0, policy_version 381742 (0.00089) [2022-07-09 19:09:45,747][26022] Updated weights on worker 0-0, policy_version 381752 (0.00093) [2022-07-09 19:09:46,297][25689] Fps is (10 sec: 5678.7, 60 sec: 5629.5, 300 sec: 5634.6). Total num frames: 390916096. Throughput: 0: 5944.0. Samples: 390920710. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:09:46,297][25689] Avg episode reward: [(0, '-48.708')] [2022-07-09 19:09:47,687][26022] Updated weights on worker 0-0, policy_version 381762 (0.00091) [2022-07-09 19:09:49,430][26022] Updated weights on worker 0-0, policy_version 381772 (0.00082) [2022-07-09 19:09:51,090][26022] Updated weights on worker 0-0, policy_version 381782 (0.00101) [2022-07-09 19:09:51,356][25689] Fps is (10 sec: 5784.2, 60 sec: 5658.6, 300 sec: 5640.7). Total num frames: 390945792. Throughput: 0: 5941.9. Samples: 390954856. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:09:51,358][25689] Avg episode reward: [(0, '-49.029')] [2022-07-09 19:09:53,096][26022] Updated weights on worker 0-0, policy_version 381792 (0.00340) [2022-07-09 19:09:54,739][26022] Updated weights on worker 0-0, policy_version 381802 (0.00118) [2022-07-09 19:09:56,444][25689] Fps is (10 sec: 5650.8, 60 sec: 5602.5, 300 sec: 5630.2). Total num frames: 390973440. Throughput: 0: 5918.2. Samples: 390971992. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:09:56,452][25689] Avg episode reward: [(0, '-47.906')] [2022-07-09 19:09:56,698][26022] Updated weights on worker 0-0, policy_version 381812 (0.00095) [2022-07-09 19:09:58,482][26022] Updated weights on worker 0-0, policy_version 381822 (0.00087) [2022-07-09 19:10:00,091][26022] Updated weights on worker 0-0, policy_version 381832 (0.00099) [2022-07-09 19:10:01,587][25689] Fps is (10 sec: 5504.6, 60 sec: 5615.4, 300 sec: 5642.2). Total num frames: 391002112. Throughput: 0: 5888.2. Samples: 391005926. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:10:01,588][25689] Avg episode reward: [(0, '-46.570')] [2022-07-09 19:10:02,545][26022] Updated weights on worker 0-0, policy_version 381842 (0.00088) [2022-07-09 19:10:04,174][26022] Updated weights on worker 0-0, policy_version 381852 (0.00089) [2022-07-09 19:10:06,303][26022] Updated weights on worker 0-0, policy_version 381862 (0.00082) [2022-07-09 19:10:06,643][25689] Fps is (10 sec: 5522.4, 60 sec: 5649.4, 300 sec: 5642.0). Total num frames: 391029760. Throughput: 0: 5740.8. Samples: 391037514. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:10:06,643][25689] Avg episode reward: [(0, '-46.743')] [2022-07-09 19:10:07,827][26022] Updated weights on worker 0-0, policy_version 381872 (0.00078) [2022-07-09 19:10:09,821][26022] Updated weights on worker 0-0, policy_version 381882 (0.00092) [2022-07-09 19:10:11,600][26022] Updated weights on worker 0-0, policy_version 381892 (0.00097) [2022-07-09 19:10:11,697][25689] Fps is (10 sec: 5469.8, 60 sec: 5612.7, 300 sec: 5632.5). Total num frames: 391057408. Throughput: 0: 4896.5. Samples: 391054454. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:10:11,697][25689] Avg episode reward: [(0, '-46.193')] [2022-07-09 19:10:13,231][26022] Updated weights on worker 0-0, policy_version 381902 (0.00094) [2022-07-09 19:10:15,164][26022] Updated weights on worker 0-0, policy_version 381912 (0.00091) [2022-07-09 19:10:16,704][25689] Fps is (10 sec: 5597.9, 60 sec: 5635.5, 300 sec: 5636.7). Total num frames: 391086080. Throughput: 0: 5759.6. Samples: 391088678. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:10:16,704][25689] Avg episode reward: [(0, '-45.871')] [2022-07-09 19:10:16,988][26022] Updated weights on worker 0-0, policy_version 381922 (0.00080) [2022-07-09 19:10:18,590][26022] Updated weights on worker 0-0, policy_version 381932 (0.00089) [2022-07-09 19:10:20,674][26022] Updated weights on worker 0-0, policy_version 381942 (0.00087) [2022-07-09 19:10:21,779][25689] Fps is (10 sec: 5789.3, 60 sec: 5656.8, 300 sec: 5642.2). Total num frames: 391115776. Throughput: 0: 5804.1. Samples: 391123118. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:10:21,779][25689] Avg episode reward: [(0, '-45.579')] [2022-07-09 19:10:22,083][26022] Updated weights on worker 0-0, policy_version 381952 (0.00093) [2022-07-09 19:10:24,107][26022] Updated weights on worker 0-0, policy_version 381962 (0.00082) [2022-07-09 19:10:26,071][26022] Updated weights on worker 0-0, policy_version 381972 (0.00083) [2022-07-09 19:10:26,792][25689] Fps is (10 sec: 5582.7, 60 sec: 5607.7, 300 sec: 5632.0). Total num frames: 391142400. Throughput: 0: 5080.2. Samples: 391139876. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:10:26,793][25689] Avg episode reward: [(0, '-46.490')] [2022-07-09 19:10:27,924][26022] Updated weights on worker 0-0, policy_version 381982 (0.00091) [2022-07-09 19:10:29,776][26022] Updated weights on worker 0-0, policy_version 381992 (0.00086) [2022-07-09 19:10:31,262][26022] Updated weights on worker 0-0, policy_version 382002 (0.00087) [2022-07-09 19:10:31,811][25689] Fps is (10 sec: 5716.1, 60 sec: 5659.3, 300 sec: 5642.1). Total num frames: 391173120. Throughput: 0: 5933.5. Samples: 391173800. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:10:31,811][25689] Avg episode reward: [(0, '-46.604')] [2022-07-09 19:10:33,391][26022] Updated weights on worker 0-0, policy_version 382012 (0.00090) [2022-07-09 19:10:35,060][26022] Updated weights on worker 0-0, policy_version 382022 (0.00106) [2022-07-09 19:10:36,819][25689] Fps is (10 sec: 5719.2, 60 sec: 5625.1, 300 sec: 5634.0). Total num frames: 391199744. Throughput: 0: 5937.1. Samples: 391208102. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:10:36,819][25689] Avg episode reward: [(0, '-46.181')] [2022-07-09 19:10:36,928][26022] Updated weights on worker 0-0, policy_version 382032 (0.00092) [2022-07-09 19:10:38,469][26022] Updated weights on worker 0-0, policy_version 382042 (0.00086) [2022-07-09 19:10:40,532][26022] Updated weights on worker 0-0, policy_version 382052 (0.00086) [2022-07-09 19:10:41,873][25689] Fps is (10 sec: 5597.1, 60 sec: 5643.1, 300 sec: 5640.1). Total num frames: 391229440. Throughput: 0: 5075.3. Samples: 391225104. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:10:41,874][25689] Avg episode reward: [(0, '-46.900')] [2022-07-09 19:10:42,248][26022] Updated weights on worker 0-0, policy_version 382062 (0.00086) [2022-07-09 19:10:44,071][26022] Updated weights on worker 0-0, policy_version 382072 (0.00089) [2022-07-09 19:10:45,626][26022] Updated weights on worker 0-0, policy_version 382082 (0.00087) [2022-07-09 19:10:46,935][25689] Fps is (10 sec: 5769.9, 60 sec: 5640.3, 300 sec: 5639.1). Total num frames: 391258112. Throughput: 0: 5944.2. Samples: 391259608. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:10:46,935][25689] Avg episode reward: [(0, '-47.218')] [2022-07-09 19:10:47,582][26022] Updated weights on worker 0-0, policy_version 382092 (0.00086) [2022-07-09 19:10:49,347][26022] Updated weights on worker 0-0, policy_version 382102 (0.00088) [2022-07-09 19:10:51,075][26022] Updated weights on worker 0-0, policy_version 382112 (0.00099) [2022-07-09 19:10:51,944][25689] Fps is (10 sec: 5694.1, 60 sec: 5628.0, 300 sec: 5639.0). Total num frames: 391286784. Throughput: 0: 5967.8. Samples: 391293950. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:10:51,944][25689] Avg episode reward: [(0, '-47.025')] [2022-07-09 19:10:52,948][26022] Updated weights on worker 0-0, policy_version 382122 (0.00083) [2022-07-09 19:10:54,619][26022] Updated weights on worker 0-0, policy_version 382132 (0.00096) [2022-07-09 19:10:56,373][26022] Updated weights on worker 0-0, policy_version 382142 (0.00090) [2022-07-09 19:10:56,960][25689] Fps is (10 sec: 5719.7, 60 sec: 5651.7, 300 sec: 5640.6). Total num frames: 391315456. Throughput: 0: 5113.1. Samples: 391311088. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:10:56,961][25689] Avg episode reward: [(0, '-45.807')] [2022-07-09 19:10:58,341][26022] Updated weights on worker 0-0, policy_version 382152 (0.00106) [2022-07-09 19:11:00,029][26022] Updated weights on worker 0-0, policy_version 382162 (0.00092) [2022-07-09 19:11:02,054][25689] Fps is (10 sec: 5469.3, 60 sec: 5622.4, 300 sec: 5636.8). Total num frames: 391342080. Throughput: 0: 5942.2. Samples: 391345022. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:11:02,055][25689] Avg episode reward: [(0, '-46.122')] [2022-07-09 19:11:02,281][26022] Updated weights on worker 0-0, policy_version 382172 (0.00089) [2022-07-09 19:11:04,112][26022] Updated weights on worker 0-0, policy_version 382182 (0.00366) [2022-07-09 19:11:05,837][26022] Updated weights on worker 0-0, policy_version 382192 (0.00086) [2022-07-09 19:11:07,103][25689] Fps is (10 sec: 5451.9, 60 sec: 5640.0, 300 sec: 5639.6). Total num frames: 391370752. Throughput: 0: 5827.0. Samples: 391377128. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:11:07,104][25689] Avg episode reward: [(0, '-46.351')] [2022-07-09 19:11:07,810][26022] Updated weights on worker 0-0, policy_version 382202 (0.00089) [2022-07-09 19:11:09,651][26022] Updated weights on worker 0-0, policy_version 382212 (0.00088) [2022-07-09 19:11:11,264][26022] Updated weights on worker 0-0, policy_version 382222 (0.00087) [2022-07-09 19:11:12,144][25689] Fps is (10 sec: 5683.2, 60 sec: 5658.1, 300 sec: 5642.9). Total num frames: 391399424. Throughput: 0: 4962.7. Samples: 391394198. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:11:12,145][25689] Avg episode reward: [(0, '-46.055')] [2022-07-09 19:11:13,108][26022] Updated weights on worker 0-0, policy_version 382232 (0.00085) [2022-07-09 19:11:14,899][26022] Updated weights on worker 0-0, policy_version 382242 (0.00100) [2022-07-09 19:11:16,597][26022] Updated weights on worker 0-0, policy_version 382252 (0.00055) [2022-07-09 19:11:17,151][25689] Fps is (10 sec: 5808.9, 60 sec: 5675.1, 300 sec: 5645.0). Total num frames: 391429120. Throughput: 0: 5833.7. Samples: 391428874. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:11:17,151][25689] Avg episode reward: [(0, '-46.067')] [2022-07-09 19:11:18,547][26022] Updated weights on worker 0-0, policy_version 382262 (0.00085) [2022-07-09 19:11:20,031][26022] Updated weights on worker 0-0, policy_version 382272 (0.00092) [2022-07-09 19:11:22,104][26022] Updated weights on worker 0-0, policy_version 382282 (0.00089) [2022-07-09 19:11:22,231][25689] Fps is (10 sec: 5685.1, 60 sec: 5640.7, 300 sec: 5641.6). Total num frames: 391456768. Throughput: 0: 5854.8. Samples: 391463152. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:11:22,231][25689] Avg episode reward: [(0, '-46.278')] [2022-07-09 19:11:23,693][26022] Updated weights on worker 0-0, policy_version 382292 (0.00096) [2022-07-09 19:11:25,181][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:11:25,196][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000382299_391474176.pth [2022-07-09 19:11:25,196][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000380314_389441536.pth [2022-07-09 19:11:25,573][26022] Updated weights on worker 0-0, policy_version 382302 (0.00084) [2022-07-09 19:11:27,246][25689] Fps is (10 sec: 5578.8, 60 sec: 5674.4, 300 sec: 5642.5). Total num frames: 391485440. Throughput: 0: 5115.4. Samples: 391480170. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:11:27,247][25689] Avg episode reward: [(0, '-46.432')] [2022-07-09 19:11:27,563][26022] Updated weights on worker 0-0, policy_version 382312 (0.00089) [2022-07-09 19:11:29,078][26022] Updated weights on worker 0-0, policy_version 382322 (0.00092) [2022-07-09 19:11:31,013][26022] Updated weights on worker 0-0, policy_version 382332 (0.00086) [2022-07-09 19:11:32,287][25689] Fps is (10 sec: 5804.3, 60 sec: 5655.4, 300 sec: 5642.2). Total num frames: 391515136. Throughput: 0: 5962.5. Samples: 391514298. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:11:32,287][25689] Avg episode reward: [(0, '-45.540')] [2022-07-09 19:11:32,571][26022] Updated weights on worker 0-0, policy_version 382342 (0.00087) [2022-07-09 19:11:34,559][26022] Updated weights on worker 0-0, policy_version 382352 (0.00097) [2022-07-09 19:11:36,431][26022] Updated weights on worker 0-0, policy_version 382362 (0.00088) [2022-07-09 19:11:37,294][25689] Fps is (10 sec: 5707.0, 60 sec: 5672.4, 300 sec: 5643.3). Total num frames: 391542784. Throughput: 0: 5952.9. Samples: 391548786. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:11:37,295][25689] Avg episode reward: [(0, '-44.875')] [2022-07-09 19:11:38,179][26022] Updated weights on worker 0-0, policy_version 382372 (0.00092) [2022-07-09 19:11:39,907][26022] Updated weights on worker 0-0, policy_version 382382 (0.00083) [2022-07-09 19:11:41,916][26022] Updated weights on worker 0-0, policy_version 382392 (0.00102) [2022-07-09 19:11:42,401][25689] Fps is (10 sec: 5669.7, 60 sec: 5667.5, 300 sec: 5644.9). Total num frames: 391572480. Throughput: 0: 5095.1. Samples: 391565920. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:11:42,401][25689] Avg episode reward: [(0, '-44.886')] [2022-07-09 19:11:43,500][26022] Updated weights on worker 0-0, policy_version 382402 (0.00081) [2022-07-09 19:11:45,476][26022] Updated weights on worker 0-0, policy_version 382412 (0.00094) [2022-07-09 19:11:46,997][26022] Updated weights on worker 0-0, policy_version 382422 (0.00085) [2022-07-09 19:11:47,506][25689] Fps is (10 sec: 5816.3, 60 sec: 5680.4, 300 sec: 5646.7). Total num frames: 391602176. Throughput: 0: 5922.8. Samples: 391600162. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:11:47,506][25689] Avg episode reward: [(0, '-45.510')] [2022-07-09 19:11:48,960][26022] Updated weights on worker 0-0, policy_version 382432 (0.00089) [2022-07-09 19:11:50,595][26022] Updated weights on worker 0-0, policy_version 382442 (0.00082) [2022-07-09 19:11:52,396][26022] Updated weights on worker 0-0, policy_version 382452 (0.00086) [2022-07-09 19:11:52,538][25689] Fps is (10 sec: 5757.9, 60 sec: 5678.2, 300 sec: 5653.5). Total num frames: 391630848. Throughput: 0: 5952.0. Samples: 391634832. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:11:52,538][25689] Avg episode reward: [(0, '-46.527')] [2022-07-09 19:11:54,295][26022] Updated weights on worker 0-0, policy_version 382462 (0.00093) [2022-07-09 19:11:56,076][26022] Updated weights on worker 0-0, policy_version 382472 (0.00095) [2022-07-09 19:11:57,569][25689] Fps is (10 sec: 5698.2, 60 sec: 5676.8, 300 sec: 5647.8). Total num frames: 391659520. Throughput: 0: 5932.9. Samples: 391669074. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:11:57,570][25689] Avg episode reward: [(0, '-47.173')] [2022-07-09 19:11:57,958][26022] Updated weights on worker 0-0, policy_version 382482 (0.00086) [2022-07-09 19:11:59,766][26022] Updated weights on worker 0-0, policy_version 382492 (0.00075) [2022-07-09 19:12:01,631][26022] Updated weights on worker 0-0, policy_version 382502 (0.00096) [2022-07-09 19:12:02,614][25689] Fps is (10 sec: 5386.2, 60 sec: 5664.5, 300 sec: 5644.3). Total num frames: 391685120. Throughput: 0: 5941.3. Samples: 391686012. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:12:02,615][25689] Avg episode reward: [(0, '-47.310')] [2022-07-09 19:12:03,706][26022] Updated weights on worker 0-0, policy_version 382512 (0.00504) [2022-07-09 19:12:05,552][26022] Updated weights on worker 0-0, policy_version 382522 (0.00085) [2022-07-09 19:12:07,137][26022] Updated weights on worker 0-0, policy_version 382532 (0.00087) [2022-07-09 19:12:07,682][25689] Fps is (10 sec: 5467.6, 60 sec: 5679.5, 300 sec: 5643.1). Total num frames: 391714816. Throughput: 0: 5837.7. Samples: 391717948. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 19:12:07,683][25689] Avg episode reward: [(0, '-48.365')] [2022-07-09 19:12:09,172][26022] Updated weights on worker 0-0, policy_version 382542 (0.00089) [2022-07-09 19:12:10,891][26022] Updated weights on worker 0-0, policy_version 382552 (0.00093) [2022-07-09 19:12:12,535][26022] Updated weights on worker 0-0, policy_version 382562 (0.00090) [2022-07-09 19:12:12,697][25689] Fps is (10 sec: 5789.0, 60 sec: 5682.1, 300 sec: 5647.6). Total num frames: 391743488. Throughput: 0: 5825.4. Samples: 391752266. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:12:12,697][25689] Avg episode reward: [(0, '-48.624')] [2022-07-09 19:12:14,436][26022] Updated weights on worker 0-0, policy_version 382572 (0.00087) [2022-07-09 19:12:16,227][26022] Updated weights on worker 0-0, policy_version 382582 (0.00085) [2022-07-09 19:12:17,779][25689] Fps is (10 sec: 5679.4, 60 sec: 5658.1, 300 sec: 5647.1). Total num frames: 391772160. Throughput: 0: 4968.3. Samples: 391769484. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:12:17,780][25689] Avg episode reward: [(0, '-48.548')] [2022-07-09 19:12:17,963][26022] Updated weights on worker 0-0, policy_version 382592 (0.00094) [2022-07-09 19:12:20,034][26022] Updated weights on worker 0-0, policy_version 382602 (0.00087) [2022-07-09 19:12:21,537][26022] Updated weights on worker 0-0, policy_version 382612 (0.00098) [2022-07-09 19:12:22,872][25689] Fps is (10 sec: 5635.8, 60 sec: 5673.8, 300 sec: 5645.5). Total num frames: 391800832. Throughput: 0: 5809.7. Samples: 391803702. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:12:22,872][25689] Avg episode reward: [(0, '-48.514')] [2022-07-09 19:12:23,511][26022] Updated weights on worker 0-0, policy_version 382622 (0.00088) [2022-07-09 19:12:25,110][26022] Updated weights on worker 0-0, policy_version 382632 (0.00087) [2022-07-09 19:12:27,137][26022] Updated weights on worker 0-0, policy_version 382642 (0.00094) [2022-07-09 19:12:27,899][25689] Fps is (10 sec: 5767.6, 60 sec: 5689.5, 300 sec: 5653.2). Total num frames: 391830528. Throughput: 0: 5935.5. Samples: 391837944. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:12:27,900][25689] Avg episode reward: [(0, '-48.019')] [2022-07-09 19:12:28,776][26022] Updated weights on worker 0-0, policy_version 382652 (0.00091) [2022-07-09 19:12:30,823][26022] Updated weights on worker 0-0, policy_version 382662 (0.00090) [2022-07-09 19:12:32,473][26022] Updated weights on worker 0-0, policy_version 382672 (0.00086) [2022-07-09 19:12:32,904][25689] Fps is (10 sec: 5817.9, 60 sec: 5676.0, 300 sec: 5650.3). Total num frames: 391859200. Throughput: 0: 5073.7. Samples: 391854792. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:12:32,906][25689] Avg episode reward: [(0, '-47.603')] [2022-07-09 19:12:34,454][26022] Updated weights on worker 0-0, policy_version 382682 (0.00107) [2022-07-09 19:12:36,027][26022] Updated weights on worker 0-0, policy_version 382692 (0.00088) [2022-07-09 19:12:37,925][25689] Fps is (10 sec: 5515.2, 60 sec: 5657.8, 300 sec: 5642.1). Total num frames: 391885824. Throughput: 0: 5939.1. Samples: 391889134. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:12:37,927][25689] Avg episode reward: [(0, '-46.642')] [2022-07-09 19:12:37,939][26022] Updated weights on worker 0-0, policy_version 382702 (0.00218) [2022-07-09 19:12:39,640][26022] Updated weights on worker 0-0, policy_version 382712 (0.00088) [2022-07-09 19:12:41,473][26022] Updated weights on worker 0-0, policy_version 382722 (0.00086) [2022-07-09 19:12:42,986][25689] Fps is (10 sec: 5586.2, 60 sec: 5662.1, 300 sec: 5648.3). Total num frames: 391915520. Throughput: 0: 5966.5. Samples: 391923714. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:12:42,986][25689] Avg episode reward: [(0, '-46.809')] [2022-07-09 19:12:43,189][26022] Updated weights on worker 0-0, policy_version 382732 (0.00090) [2022-07-09 19:12:44,854][26022] Updated weights on worker 0-0, policy_version 382742 (0.00108) [2022-07-09 19:12:46,791][26022] Updated weights on worker 0-0, policy_version 382752 (0.00107) [2022-07-09 19:12:47,999][25689] Fps is (10 sec: 5896.0, 60 sec: 5670.7, 300 sec: 5651.7). Total num frames: 391945216. Throughput: 0: 5129.1. Samples: 391941036. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:12:48,000][25689] Avg episode reward: [(0, '-47.016')] [2022-07-09 19:12:48,598][26022] Updated weights on worker 0-0, policy_version 382762 (0.00084) [2022-07-09 19:12:50,272][26022] Updated weights on worker 0-0, policy_version 382772 (0.00087) [2022-07-09 19:12:52,245][26022] Updated weights on worker 0-0, policy_version 382782 (0.00091) [2022-07-09 19:12:53,005][25689] Fps is (10 sec: 5825.9, 60 sec: 5673.2, 300 sec: 5651.9). Total num frames: 391973888. Throughput: 0: 6012.4. Samples: 391975646. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:12:53,005][25689] Avg episode reward: [(0, '-47.117')] [2022-07-09 19:12:53,728][26022] Updated weights on worker 0-0, policy_version 382792 (0.00087) [2022-07-09 19:12:55,554][26022] Updated weights on worker 0-0, policy_version 382802 (0.00083) [2022-07-09 19:12:57,433][26022] Updated weights on worker 0-0, policy_version 382812 (0.00088) [2022-07-09 19:12:58,007][25689] Fps is (10 sec: 5627.2, 60 sec: 5658.9, 300 sec: 5654.0). Total num frames: 392001536. Throughput: 0: 6043.1. Samples: 392010490. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:12:58,009][25689] Avg episode reward: [(0, '-46.663')] [2022-07-09 19:12:59,352][26022] Updated weights on worker 0-0, policy_version 382822 (0.00088) [2022-07-09 19:13:01,047][26022] Updated weights on worker 0-0, policy_version 382832 (0.00093) [2022-07-09 19:13:03,045][25689] Fps is (10 sec: 5405.2, 60 sec: 5676.5, 300 sec: 5650.0). Total num frames: 392028160. Throughput: 0: 5176.5. Samples: 392027550. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:13:03,051][25689] Avg episode reward: [(0, '-46.878')] [2022-07-09 19:13:03,333][26022] Updated weights on worker 0-0, policy_version 382842 (0.00087) [2022-07-09 19:13:04,945][26022] Updated weights on worker 0-0, policy_version 382852 (0.00086) [2022-07-09 19:13:06,927][26022] Updated weights on worker 0-0, policy_version 382862 (0.00091) [2022-07-09 19:13:08,063][25689] Fps is (10 sec: 5600.7, 60 sec: 5681.3, 300 sec: 5653.4). Total num frames: 392057856. Throughput: 0: 5899.2. Samples: 392059400. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:13:08,064][25689] Avg episode reward: [(0, '-46.761')] [2022-07-09 19:13:08,477][26022] Updated weights on worker 0-0, policy_version 382872 (0.00089) [2022-07-09 19:13:10,705][26022] Updated weights on worker 0-0, policy_version 382882 (0.00087) [2022-07-09 19:13:12,056][26022] Updated weights on worker 0-0, policy_version 382892 (0.00084) [2022-07-09 19:13:13,080][25689] Fps is (10 sec: 5714.4, 60 sec: 5664.0, 300 sec: 5653.1). Total num frames: 392085504. Throughput: 0: 5872.9. Samples: 392093548. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:13:13,081][25689] Avg episode reward: [(0, '-46.254')] [2022-07-09 19:13:14,104][26022] Updated weights on worker 0-0, policy_version 382902 (0.00098) [2022-07-09 19:13:15,800][26022] Updated weights on worker 0-0, policy_version 382912 (0.00086) [2022-07-09 19:13:17,594][26022] Updated weights on worker 0-0, policy_version 382922 (0.00090) [2022-07-09 19:13:18,110][25689] Fps is (10 sec: 5605.8, 60 sec: 5669.0, 300 sec: 5650.3). Total num frames: 392114176. Throughput: 0: 4978.9. Samples: 392110578. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:13:18,112][25689] Avg episode reward: [(0, '-46.645')] [2022-07-09 19:13:19,546][26022] Updated weights on worker 0-0, policy_version 382932 (0.00087) [2022-07-09 19:13:21,307][26022] Updated weights on worker 0-0, policy_version 382942 (0.00092) [2022-07-09 19:13:22,990][26022] Updated weights on worker 0-0, policy_version 382952 (0.00098) [2022-07-09 19:13:23,159][25689] Fps is (10 sec: 5689.5, 60 sec: 5673.1, 300 sec: 5650.0). Total num frames: 392142848. Throughput: 0: 5830.8. Samples: 392144830. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:13:23,160][25689] Avg episode reward: [(0, '-46.843')] [2022-07-09 19:13:25,022][26022] Updated weights on worker 0-0, policy_version 382962 (0.00087) [2022-07-09 19:13:25,387][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:13:25,396][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000382964_392155136.pth [2022-07-09 19:13:25,396][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000380977_390120448.pth [2022-07-09 19:13:26,510][26022] Updated weights on worker 0-0, policy_version 382972 (0.00263) [2022-07-09 19:13:28,171][25689] Fps is (10 sec: 5597.9, 60 sec: 5640.6, 300 sec: 5653.5). Total num frames: 392170496. Throughput: 0: 5934.7. Samples: 392178732. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:13:28,171][25689] Avg episode reward: [(0, '-46.718')] [2022-07-09 19:13:28,672][26022] Updated weights on worker 0-0, policy_version 382982 (0.00085) [2022-07-09 19:13:30,122][26022] Updated weights on worker 0-0, policy_version 382992 (0.00094) [2022-07-09 19:13:32,115][26022] Updated weights on worker 0-0, policy_version 383002 (0.00085) [2022-07-09 19:13:33,209][25689] Fps is (10 sec: 5705.9, 60 sec: 5654.4, 300 sec: 5652.9). Total num frames: 392200192. Throughput: 0: 5081.6. Samples: 392195834. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:13:33,211][25689] Avg episode reward: [(0, '-47.083')] [2022-07-09 19:13:33,889][26022] Updated weights on worker 0-0, policy_version 383012 (0.00096) [2022-07-09 19:13:35,671][26022] Updated weights on worker 0-0, policy_version 383022 (0.00090) [2022-07-09 19:13:37,667][26022] Updated weights on worker 0-0, policy_version 383032 (0.00086) [2022-07-09 19:13:38,226][25689] Fps is (10 sec: 5804.7, 60 sec: 5688.8, 300 sec: 5654.5). Total num frames: 392228864. Throughput: 0: 5925.2. Samples: 392229770. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:13:38,226][25689] Avg episode reward: [(0, '-46.810')] [2022-07-09 19:13:39,297][26022] Updated weights on worker 0-0, policy_version 383042 (0.00093) [2022-07-09 19:13:41,175][26022] Updated weights on worker 0-0, policy_version 383052 (0.00082) [2022-07-09 19:13:42,918][26022] Updated weights on worker 0-0, policy_version 383062 (0.00090) [2022-07-09 19:13:43,274][25689] Fps is (10 sec: 5595.8, 60 sec: 5656.1, 300 sec: 5650.8). Total num frames: 392256512. Throughput: 0: 5924.7. Samples: 392264004. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:13:43,274][25689] Avg episode reward: [(0, '-46.997')] [2022-07-09 19:13:44,635][26022] Updated weights on worker 0-0, policy_version 383072 (0.00091) [2022-07-09 19:13:46,535][26022] Updated weights on worker 0-0, policy_version 383082 (0.00092) [2022-07-09 19:13:48,291][25689] Fps is (10 sec: 5595.8, 60 sec: 5638.7, 300 sec: 5654.1). Total num frames: 392285184. Throughput: 0: 5088.1. Samples: 392281104. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:13:48,291][25689] Avg episode reward: [(0, '-47.115')] [2022-07-09 19:13:48,450][26022] Updated weights on worker 0-0, policy_version 383092 (0.00086) [2022-07-09 19:13:50,053][26022] Updated weights on worker 0-0, policy_version 383102 (0.00084) [2022-07-09 19:13:52,038][26022] Updated weights on worker 0-0, policy_version 383112 (0.00086) [2022-07-09 19:13:53,305][25689] Fps is (10 sec: 5614.4, 60 sec: 5620.9, 300 sec: 5644.1). Total num frames: 392312832. Throughput: 0: 5935.7. Samples: 392315118. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:13:53,306][25689] Avg episode reward: [(0, '-47.588')] [2022-07-09 19:13:53,740][26022] Updated weights on worker 0-0, policy_version 383122 (0.00098) [2022-07-09 19:13:55,711][26022] Updated weights on worker 0-0, policy_version 383132 (0.00087) [2022-07-09 19:13:57,525][26022] Updated weights on worker 0-0, policy_version 383142 (0.00093) [2022-07-09 19:13:58,329][25689] Fps is (10 sec: 5814.5, 60 sec: 5669.8, 300 sec: 5655.9). Total num frames: 392343552. Throughput: 0: 5937.9. Samples: 392349140. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:13:58,330][25689] Avg episode reward: [(0, '-47.223')] [2022-07-09 19:13:59,341][26022] Updated weights on worker 0-0, policy_version 383152 (0.00086) [2022-07-09 19:14:00,991][26022] Updated weights on worker 0-0, policy_version 383162 (0.00061) [2022-07-09 19:14:03,259][26022] Updated weights on worker 0-0, policy_version 383172 (0.00092) [2022-07-09 19:14:03,376][25689] Fps is (10 sec: 5491.0, 60 sec: 5635.1, 300 sec: 5652.7). Total num frames: 392368128. Throughput: 0: 5094.5. Samples: 392366412. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:14:03,376][25689] Avg episode reward: [(0, '-47.130')] [2022-07-09 19:14:04,903][26022] Updated weights on worker 0-0, policy_version 383182 (0.00079) [2022-07-09 19:14:06,946][26022] Updated weights on worker 0-0, policy_version 383192 (0.00086) [2022-07-09 19:14:08,382][25689] Fps is (10 sec: 5297.0, 60 sec: 5619.2, 300 sec: 5649.5). Total num frames: 392396800. Throughput: 0: 5834.1. Samples: 392398316. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:14:08,382][25689] Avg episode reward: [(0, '-47.702')] [2022-07-09 19:14:08,519][26022] Updated weights on worker 0-0, policy_version 383202 (0.00092) [2022-07-09 19:14:10,328][26022] Updated weights on worker 0-0, policy_version 383212 (0.00084) [2022-07-09 19:14:12,132][26022] Updated weights on worker 0-0, policy_version 383222 (0.00093) [2022-07-09 19:14:13,388][25689] Fps is (10 sec: 5727.4, 60 sec: 5637.2, 300 sec: 5654.2). Total num frames: 392425472. Throughput: 0: 5845.4. Samples: 392432510. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:14:13,388][25689] Avg episode reward: [(0, '-46.909')] [2022-07-09 19:14:14,187][26022] Updated weights on worker 0-0, policy_version 383232 (0.00090) [2022-07-09 19:14:15,809][26022] Updated weights on worker 0-0, policy_version 383242 (0.00093) [2022-07-09 19:14:17,660][26022] Updated weights on worker 0-0, policy_version 383252 (0.00086) [2022-07-09 19:14:18,419][25689] Fps is (10 sec: 5713.2, 60 sec: 5637.1, 300 sec: 5655.9). Total num frames: 392454144. Throughput: 0: 4995.3. Samples: 392449494. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:14:18,419][25689] Avg episode reward: [(0, '-46.214')] [2022-07-09 19:14:19,462][26022] Updated weights on worker 0-0, policy_version 383262 (0.00091) [2022-07-09 19:14:21,179][26022] Updated weights on worker 0-0, policy_version 383272 (0.00082) [2022-07-09 19:14:23,174][26022] Updated weights on worker 0-0, policy_version 383282 (0.00084) [2022-07-09 19:14:23,521][25689] Fps is (10 sec: 5557.8, 60 sec: 5615.2, 300 sec: 5647.7). Total num frames: 392481792. Throughput: 0: 5818.7. Samples: 392483634. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:14:23,522][25689] Avg episode reward: [(0, '-46.628')] [2022-07-09 19:14:24,831][26022] Updated weights on worker 0-0, policy_version 383292 (0.00086) [2022-07-09 19:14:26,751][26022] Updated weights on worker 0-0, policy_version 383302 (0.00082) [2022-07-09 19:14:28,375][26022] Updated weights on worker 0-0, policy_version 383312 (0.00083) [2022-07-09 19:14:28,539][25689] Fps is (10 sec: 5767.6, 60 sec: 5665.5, 300 sec: 5658.2). Total num frames: 392512512. Throughput: 0: 5928.8. Samples: 392517824. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:14:28,539][25689] Avg episode reward: [(0, '-46.230')] [2022-07-09 19:14:30,084][26022] Updated weights on worker 0-0, policy_version 383322 (0.00091) [2022-07-09 19:14:32,071][26022] Updated weights on worker 0-0, policy_version 383332 (0.00091) [2022-07-09 19:14:33,571][25689] Fps is (10 sec: 5910.1, 60 sec: 5649.2, 300 sec: 5657.7). Total num frames: 392541184. Throughput: 0: 5086.1. Samples: 392535162. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:14:33,571][25689] Avg episode reward: [(0, '-46.449')] [2022-07-09 19:14:33,720][26022] Updated weights on worker 0-0, policy_version 383342 (0.00095) [2022-07-09 19:14:35,592][26022] Updated weights on worker 0-0, policy_version 383352 (0.00081) [2022-07-09 19:14:37,390][26022] Updated weights on worker 0-0, policy_version 383362 (0.00094) [2022-07-09 19:14:38,579][25689] Fps is (10 sec: 5507.5, 60 sec: 5616.0, 300 sec: 5651.9). Total num frames: 392567808. Throughput: 0: 5944.5. Samples: 392569336. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:14:38,580][25689] Avg episode reward: [(0, '-46.311')] [2022-07-09 19:14:39,108][26022] Updated weights on worker 0-0, policy_version 383372 (0.00083) [2022-07-09 19:14:41,025][26022] Updated weights on worker 0-0, policy_version 383382 (0.00087) [2022-07-09 19:14:42,875][26022] Updated weights on worker 0-0, policy_version 383392 (0.00087) [2022-07-09 19:14:43,655][25689] Fps is (10 sec: 5686.7, 60 sec: 5664.3, 300 sec: 5657.9). Total num frames: 392598528. Throughput: 0: 5947.9. Samples: 392603384. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-09 19:14:43,655][25689] Avg episode reward: [(0, '-47.542')] [2022-07-09 19:14:44,579][26022] Updated weights on worker 0-0, policy_version 383402 (0.00094) [2022-07-09 19:14:46,548][26022] Updated weights on worker 0-0, policy_version 383412 (0.00085) [2022-07-09 19:14:48,155][26022] Updated weights on worker 0-0, policy_version 383422 (0.00084) [2022-07-09 19:14:48,690][25689] Fps is (10 sec: 5772.6, 60 sec: 5645.6, 300 sec: 5651.5). Total num frames: 392626176. Throughput: 0: 5100.9. Samples: 392620614. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:14:48,691][25689] Avg episode reward: [(0, '-47.511')] [2022-07-09 19:14:49,904][26022] Updated weights on worker 0-0, policy_version 383432 (0.00088) [2022-07-09 19:14:51,705][26022] Updated weights on worker 0-0, policy_version 383442 (0.00091) [2022-07-09 19:14:53,501][26022] Updated weights on worker 0-0, policy_version 383452 (0.00081) [2022-07-09 19:14:53,710][25689] Fps is (10 sec: 5703.0, 60 sec: 5679.1, 300 sec: 5659.7). Total num frames: 392655872. Throughput: 0: 5955.6. Samples: 392655100. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:14:53,710][25689] Avg episode reward: [(0, '-48.313')] [2022-07-09 19:14:55,353][26022] Updated weights on worker 0-0, policy_version 383462 (0.00084) [2022-07-09 19:14:57,180][26022] Updated weights on worker 0-0, policy_version 383472 (0.00091) [2022-07-09 19:14:58,712][25689] Fps is (10 sec: 5824.0, 60 sec: 5647.2, 300 sec: 5662.4). Total num frames: 392684544. Throughput: 0: 5964.2. Samples: 392689414. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:14:58,713][25689] Avg episode reward: [(0, '-48.067')] [2022-07-09 19:14:58,842][26022] Updated weights on worker 0-0, policy_version 383482 (0.00085) [2022-07-09 19:15:00,842][26022] Updated weights on worker 0-0, policy_version 383492 (0.00096) [2022-07-09 19:15:02,678][26022] Updated weights on worker 0-0, policy_version 383502 (0.00084) [2022-07-09 19:15:03,815][25689] Fps is (10 sec: 5370.7, 60 sec: 5658.8, 300 sec: 5654.6). Total num frames: 392710144. Throughput: 0: 5115.1. Samples: 392706506. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:15:03,815][25689] Avg episode reward: [(0, '-48.153')] [2022-07-09 19:15:04,883][26022] Updated weights on worker 0-0, policy_version 383512 (0.00091) [2022-07-09 19:15:06,425][26022] Updated weights on worker 0-0, policy_version 383522 (0.00087) [2022-07-09 19:15:08,342][26022] Updated weights on worker 0-0, policy_version 383532 (0.00096) [2022-07-09 19:15:08,818][25689] Fps is (10 sec: 5471.9, 60 sec: 5676.1, 300 sec: 5662.4). Total num frames: 392739840. Throughput: 0: 5864.0. Samples: 392738642. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:15:08,818][25689] Avg episode reward: [(0, '-47.846')] [2022-07-09 19:15:09,891][26022] Updated weights on worker 0-0, policy_version 383542 (0.00095) [2022-07-09 19:15:11,823][26022] Updated weights on worker 0-0, policy_version 383552 (0.00097) [2022-07-09 19:15:13,511][26022] Updated weights on worker 0-0, policy_version 383562 (0.00090) [2022-07-09 19:15:13,830][25689] Fps is (10 sec: 5930.3, 60 sec: 5692.5, 300 sec: 5665.8). Total num frames: 392769536. Throughput: 0: 5859.8. Samples: 392773000. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:15:13,830][25689] Avg episode reward: [(0, '-47.063')] [2022-07-09 19:15:15,423][26022] Updated weights on worker 0-0, policy_version 383572 (0.00087) [2022-07-09 19:15:17,147][26022] Updated weights on worker 0-0, policy_version 383582 (0.00087) [2022-07-09 19:15:18,832][25689] Fps is (10 sec: 5521.5, 60 sec: 5644.3, 300 sec: 5653.4). Total num frames: 392795136. Throughput: 0: 5001.3. Samples: 392790040. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:15:18,833][25689] Avg episode reward: [(0, '-47.447')] [2022-07-09 19:15:19,160][26022] Updated weights on worker 0-0, policy_version 383592 (0.00095) [2022-07-09 19:15:20,924][26022] Updated weights on worker 0-0, policy_version 383602 (0.00082) [2022-07-09 19:15:22,622][26022] Updated weights on worker 0-0, policy_version 383612 (0.00083) [2022-07-09 19:15:23,882][25689] Fps is (10 sec: 5500.9, 60 sec: 5683.2, 300 sec: 5663.0). Total num frames: 392824832. Throughput: 0: 5856.9. Samples: 392824038. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:15:23,882][25689] Avg episode reward: [(0, '-46.911')] [2022-07-09 19:15:24,693][26022] Updated weights on worker 0-0, policy_version 383622 (0.00090) [2022-07-09 19:15:25,399][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:15:25,410][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000383627_392834048.pth [2022-07-09 19:15:25,411][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000381637_390796288.pth [2022-07-09 19:15:26,361][26022] Updated weights on worker 0-0, policy_version 383632 (0.00078) [2022-07-09 19:15:28,179][26022] Updated weights on worker 0-0, policy_version 383642 (0.00090) [2022-07-09 19:15:28,895][25689] Fps is (10 sec: 5698.9, 60 sec: 5632.7, 300 sec: 5652.8). Total num frames: 392852480. Throughput: 0: 5940.0. Samples: 392857898. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:15:28,895][25689] Avg episode reward: [(0, '-47.587')] [2022-07-09 19:15:29,903][26022] Updated weights on worker 0-0, policy_version 383652 (0.00087) [2022-07-09 19:15:31,692][26022] Updated weights on worker 0-0, policy_version 383662 (0.00103) [2022-07-09 19:15:33,633][26022] Updated weights on worker 0-0, policy_version 383672 (0.00094) [2022-07-09 19:15:33,911][25689] Fps is (10 sec: 5615.7, 60 sec: 5634.2, 300 sec: 5659.5). Total num frames: 392881152. Throughput: 0: 5073.9. Samples: 392874890. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:15:33,912][25689] Avg episode reward: [(0, '-47.368')] [2022-07-09 19:15:35,355][26022] Updated weights on worker 0-0, policy_version 383682 (0.00085) [2022-07-09 19:15:36,998][26022] Updated weights on worker 0-0, policy_version 383692 (0.00079) [2022-07-09 19:15:38,938][25689] Fps is (10 sec: 5709.6, 60 sec: 5666.3, 300 sec: 5656.6). Total num frames: 392909824. Throughput: 0: 5916.2. Samples: 392908990. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:15:38,939][25689] Avg episode reward: [(0, '-48.267')] [2022-07-09 19:15:39,114][26022] Updated weights on worker 0-0, policy_version 383702 (0.00086) [2022-07-09 19:15:40,677][26022] Updated weights on worker 0-0, policy_version 383712 (0.00091) [2022-07-09 19:15:42,501][26022] Updated weights on worker 0-0, policy_version 383722 (0.00086) [2022-07-09 19:15:44,017][25689] Fps is (10 sec: 5775.5, 60 sec: 5649.1, 300 sec: 5659.7). Total num frames: 392939520. Throughput: 0: 5939.0. Samples: 392943620. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:15:44,019][25689] Avg episode reward: [(0, '-48.328')] [2022-07-09 19:15:44,319][26022] Updated weights on worker 0-0, policy_version 383732 (0.00101) [2022-07-09 19:15:46,058][26022] Updated weights on worker 0-0, policy_version 383742 (0.00082) [2022-07-09 19:15:47,969][26022] Updated weights on worker 0-0, policy_version 383752 (0.00081) [2022-07-09 19:15:49,037][25689] Fps is (10 sec: 5880.8, 60 sec: 5684.5, 300 sec: 5663.0). Total num frames: 392969216. Throughput: 0: 5962.3. Samples: 392977996. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:15:49,038][25689] Avg episode reward: [(0, '-48.190')] [2022-07-09 19:15:49,751][26022] Updated weights on worker 0-0, policy_version 383762 (0.00081) [2022-07-09 19:15:51,543][26022] Updated weights on worker 0-0, policy_version 383772 (0.00088) [2022-07-09 19:15:53,232][26022] Updated weights on worker 0-0, policy_version 383782 (0.00048) [2022-07-09 19:15:54,082][25689] Fps is (10 sec: 5697.2, 60 sec: 5648.1, 300 sec: 5659.0). Total num frames: 392996864. Throughput: 0: 5962.8. Samples: 392995168. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:15:54,083][25689] Avg episode reward: [(0, '-48.489')] [2022-07-09 19:15:55,137][26022] Updated weights on worker 0-0, policy_version 383792 (0.00094) [2022-07-09 19:15:56,898][26022] Updated weights on worker 0-0, policy_version 383802 (0.00094) [2022-07-09 19:15:58,785][26022] Updated weights on worker 0-0, policy_version 383812 (0.00145) [2022-07-09 19:15:59,119][25689] Fps is (10 sec: 5586.5, 60 sec: 5644.9, 300 sec: 5667.0). Total num frames: 393025536. Throughput: 0: 5966.9. Samples: 393029406. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:15:59,119][25689] Avg episode reward: [(0, '-47.839')] [2022-07-09 19:16:00,359][26022] Updated weights on worker 0-0, policy_version 383822 (0.00085) [2022-07-09 19:16:02,647][26022] Updated weights on worker 0-0, policy_version 383832 (0.00094) [2022-07-09 19:16:04,155][25689] Fps is (10 sec: 5591.1, 60 sec: 5685.1, 300 sec: 5663.8). Total num frames: 393053184. Throughput: 0: 5841.9. Samples: 393061266. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:16:04,156][25689] Avg episode reward: [(0, '-46.819')] [2022-07-09 19:16:04,316][26022] Updated weights on worker 0-0, policy_version 383842 (0.00089) [2022-07-09 19:16:06,462][26022] Updated weights on worker 0-0, policy_version 383852 (0.00094) [2022-07-09 19:16:08,134][26022] Updated weights on worker 0-0, policy_version 383862 (0.00087) [2022-07-09 19:16:09,218][25689] Fps is (10 sec: 5374.0, 60 sec: 5628.6, 300 sec: 5656.5). Total num frames: 393079808. Throughput: 0: 4957.2. Samples: 393078036. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:16:09,220][25689] Avg episode reward: [(0, '-46.643')] [2022-07-09 19:16:09,923][26022] Updated weights on worker 0-0, policy_version 383872 (0.00094) [2022-07-09 19:16:11,772][26022] Updated weights on worker 0-0, policy_version 383882 (0.00088) [2022-07-09 19:16:13,503][26022] Updated weights on worker 0-0, policy_version 383892 (0.00098) [2022-07-09 19:16:14,254][25689] Fps is (10 sec: 5475.6, 60 sec: 5609.4, 300 sec: 5652.4). Total num frames: 393108480. Throughput: 0: 5806.3. Samples: 393112292. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:16:14,255][25689] Avg episode reward: [(0, '-46.426')] [2022-07-09 19:16:15,256][26022] Updated weights on worker 0-0, policy_version 383902 (0.00088) [2022-07-09 19:16:17,108][26022] Updated weights on worker 0-0, policy_version 383912 (0.00085) [2022-07-09 19:16:18,833][26022] Updated weights on worker 0-0, policy_version 383922 (0.00090) [2022-07-09 19:16:19,265][25689] Fps is (10 sec: 5911.4, 60 sec: 5693.3, 300 sec: 5664.1). Total num frames: 393139200. Throughput: 0: 5828.8. Samples: 393146834. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:16:19,267][25689] Avg episode reward: [(0, '-46.125')] [2022-07-09 19:16:20,870][26022] Updated weights on worker 0-0, policy_version 383932 (0.00082) [2022-07-09 19:16:22,287][26022] Updated weights on worker 0-0, policy_version 383942 (0.00087) [2022-07-09 19:16:24,381][25689] Fps is (10 sec: 5662.6, 60 sec: 5636.3, 300 sec: 5655.3). Total num frames: 393165824. Throughput: 0: 5075.0. Samples: 393163910. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:16:24,382][25689] Avg episode reward: [(0, '-46.082')] [2022-07-09 19:16:24,464][26022] Updated weights on worker 0-0, policy_version 383952 (0.00085) [2022-07-09 19:16:25,830][26022] Updated weights on worker 0-0, policy_version 383962 (0.00090) [2022-07-09 19:16:28,103][26022] Updated weights on worker 0-0, policy_version 383972 (0.00085) [2022-07-09 19:16:29,382][25689] Fps is (10 sec: 5667.9, 60 sec: 5688.2, 300 sec: 5659.5). Total num frames: 393196544. Throughput: 0: 5946.4. Samples: 393197942. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:16:29,383][25689] Avg episode reward: [(0, '-46.006')] [2022-07-09 19:16:29,625][26022] Updated weights on worker 0-0, policy_version 383982 (0.00089) [2022-07-09 19:16:31,611][26022] Updated weights on worker 0-0, policy_version 383992 (0.00084) [2022-07-09 19:16:33,329][26022] Updated weights on worker 0-0, policy_version 384002 (0.00091) [2022-07-09 19:16:34,404][25689] Fps is (10 sec: 5721.1, 60 sec: 5653.8, 300 sec: 5655.8). Total num frames: 393223168. Throughput: 0: 5936.2. Samples: 393231906. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:16:34,405][25689] Avg episode reward: [(0, '-46.540')] [2022-07-09 19:16:35,118][26022] Updated weights on worker 0-0, policy_version 384012 (0.00086) [2022-07-09 19:16:36,936][26022] Updated weights on worker 0-0, policy_version 384022 (0.00091) [2022-07-09 19:16:38,979][26022] Updated weights on worker 0-0, policy_version 384032 (0.00088) [2022-07-09 19:16:39,413][25689] Fps is (10 sec: 5512.7, 60 sec: 5655.5, 300 sec: 5654.2). Total num frames: 393251840. Throughput: 0: 5048.7. Samples: 393248556. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:16:39,413][25689] Avg episode reward: [(0, '-46.058')] [2022-07-09 19:16:40,527][26022] Updated weights on worker 0-0, policy_version 384042 (0.00089) [2022-07-09 19:16:42,466][26022] Updated weights on worker 0-0, policy_version 384052 (0.00307) [2022-07-09 19:16:44,141][26022] Updated weights on worker 0-0, policy_version 384062 (0.00091) [2022-07-09 19:16:44,525][25689] Fps is (10 sec: 5766.9, 60 sec: 5652.4, 300 sec: 5654.0). Total num frames: 393281536. Throughput: 0: 5902.4. Samples: 393282810. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:16:44,526][25689] Avg episode reward: [(0, '-46.355')] [2022-07-09 19:16:46,028][26022] Updated weights on worker 0-0, policy_version 384072 (0.00087) [2022-07-09 19:16:47,670][26022] Updated weights on worker 0-0, policy_version 384082 (0.00092) [2022-07-09 19:16:49,397][26022] Updated weights on worker 0-0, policy_version 384092 (0.00086) [2022-07-09 19:16:49,527][25689] Fps is (10 sec: 5770.9, 60 sec: 5637.2, 300 sec: 5654.6). Total num frames: 393310208. Throughput: 0: 5919.1. Samples: 393317182. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:16:49,530][25689] Avg episode reward: [(0, '-46.515')] [2022-07-09 19:16:51,257][26022] Updated weights on worker 0-0, policy_version 384102 (0.00078) [2022-07-09 19:16:53,148][26022] Updated weights on worker 0-0, policy_version 384112 (0.00092) [2022-07-09 19:16:54,568][25689] Fps is (10 sec: 5709.8, 60 sec: 5654.5, 300 sec: 5654.4). Total num frames: 393338880. Throughput: 0: 5083.4. Samples: 393334408. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:16:54,569][25689] Avg episode reward: [(0, '-47.370')] [2022-07-09 19:16:54,871][26022] Updated weights on worker 0-0, policy_version 384122 (0.00094) [2022-07-09 19:16:56,759][26022] Updated weights on worker 0-0, policy_version 384132 (0.00089) [2022-07-09 19:16:58,267][26022] Updated weights on worker 0-0, policy_version 384142 (0.00095) [2022-07-09 19:16:59,600][25689] Fps is (10 sec: 5591.3, 60 sec: 5638.0, 300 sec: 5661.6). Total num frames: 393366528. Throughput: 0: 5971.7. Samples: 393369106. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:16:59,600][25689] Avg episode reward: [(0, '-47.199')] [2022-07-09 19:17:00,259][26022] Updated weights on worker 0-0, policy_version 384152 (0.00092) [2022-07-09 19:17:01,875][26022] Updated weights on worker 0-0, policy_version 384162 (0.00081) [2022-07-09 19:17:04,267][26022] Updated weights on worker 0-0, policy_version 384172 (0.00088) [2022-07-09 19:17:04,662][25689] Fps is (10 sec: 5478.3, 60 sec: 5635.6, 300 sec: 5654.8). Total num frames: 393394176. Throughput: 0: 5886.3. Samples: 393401340. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:17:04,662][25689] Avg episode reward: [(0, '-47.387')] [2022-07-09 19:17:05,916][26022] Updated weights on worker 0-0, policy_version 384182 (0.00081) [2022-07-09 19:17:07,934][26022] Updated weights on worker 0-0, policy_version 384192 (0.00081) [2022-07-09 19:17:09,615][26022] Updated weights on worker 0-0, policy_version 384202 (0.00092) [2022-07-09 19:17:09,706][25689] Fps is (10 sec: 5572.6, 60 sec: 5671.2, 300 sec: 5654.2). Total num frames: 393422848. Throughput: 0: 5005.3. Samples: 393418186. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:17:09,707][25689] Avg episode reward: [(0, '-47.911')] [2022-07-09 19:17:11,551][26022] Updated weights on worker 0-0, policy_version 384212 (0.00084) [2022-07-09 19:17:13,121][26022] Updated weights on worker 0-0, policy_version 384222 (0.00086) [2022-07-09 19:17:14,717][25689] Fps is (10 sec: 5703.1, 60 sec: 5673.6, 300 sec: 5655.6). Total num frames: 393451520. Throughput: 0: 5859.5. Samples: 393452468. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:17:14,717][25689] Avg episode reward: [(0, '-47.713')] [2022-07-09 19:17:15,115][26022] Updated weights on worker 0-0, policy_version 384232 (0.00058) [2022-07-09 19:17:16,848][26022] Updated weights on worker 0-0, policy_version 384242 (0.00087) [2022-07-09 19:17:18,695][26022] Updated weights on worker 0-0, policy_version 384252 (0.00087) [2022-07-09 19:17:19,734][25689] Fps is (10 sec: 5616.7, 60 sec: 5622.2, 300 sec: 5653.6). Total num frames: 393479168. Throughput: 0: 5837.5. Samples: 393486638. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:17:19,734][25689] Avg episode reward: [(0, '-46.781')] [2022-07-09 19:17:20,354][26022] Updated weights on worker 0-0, policy_version 384262 (0.00085) [2022-07-09 19:17:22,320][26022] Updated weights on worker 0-0, policy_version 384272 (0.00086) [2022-07-09 19:17:24,095][26022] Updated weights on worker 0-0, policy_version 384282 (0.00090) [2022-07-09 19:17:24,817][25689] Fps is (10 sec: 5575.8, 60 sec: 5659.1, 300 sec: 5649.1). Total num frames: 393507840. Throughput: 0: 5070.8. Samples: 393503546. Policy #0 lag: (min: 0.0, avg: 8.5, max: 17.0) [2022-07-09 19:17:24,818][25689] Avg episode reward: [(0, '-46.856')] [2022-07-09 19:17:25,538][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:17:25,562][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000384290_393512960.pth [2022-07-09 19:17:25,563][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000382299_391474176.pth [2022-07-09 19:17:25,845][26022] Updated weights on worker 0-0, policy_version 384292 (0.00106) [2022-07-09 19:17:27,683][26022] Updated weights on worker 0-0, policy_version 384302 (0.00093) [2022-07-09 19:17:29,591][26022] Updated weights on worker 0-0, policy_version 384312 (0.00088) [2022-07-09 19:17:29,832][25689] Fps is (10 sec: 5577.4, 60 sec: 5607.1, 300 sec: 5645.4). Total num frames: 393535488. Throughput: 0: 5915.9. Samples: 393537244. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:17:29,832][25689] Avg episode reward: [(0, '-46.833')] [2022-07-09 19:17:31,378][26022] Updated weights on worker 0-0, policy_version 384322 (0.00084) [2022-07-09 19:17:33,205][26022] Updated weights on worker 0-0, policy_version 384332 (0.00086) [2022-07-09 19:17:34,833][26022] Updated weights on worker 0-0, policy_version 384342 (0.00082) [2022-07-09 19:17:34,834][25689] Fps is (10 sec: 5724.9, 60 sec: 5659.7, 300 sec: 5656.2). Total num frames: 393565184. Throughput: 0: 5920.9. Samples: 393571580. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:17:34,834][25689] Avg episode reward: [(0, '-46.726')] [2022-07-09 19:17:37,024][26022] Updated weights on worker 0-0, policy_version 384352 (0.00095) [2022-07-09 19:17:38,376][26022] Updated weights on worker 0-0, policy_version 384362 (0.00092) [2022-07-09 19:17:39,859][25689] Fps is (10 sec: 5821.0, 60 sec: 5658.2, 300 sec: 5653.4). Total num frames: 393593856. Throughput: 0: 5058.3. Samples: 393588436. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:17:39,859][25689] Avg episode reward: [(0, '-47.365')] [2022-07-09 19:17:40,577][26022] Updated weights on worker 0-0, policy_version 384372 (0.00086) [2022-07-09 19:17:42,087][26022] Updated weights on worker 0-0, policy_version 384382 (0.00085) [2022-07-09 19:17:43,852][26022] Updated weights on worker 0-0, policy_version 384392 (0.00097) [2022-07-09 19:17:44,917][25689] Fps is (10 sec: 5687.0, 60 sec: 5646.3, 300 sec: 5649.1). Total num frames: 393622528. Throughput: 0: 5933.5. Samples: 393622810. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:17:44,918][25689] Avg episode reward: [(0, '-47.297')] [2022-07-09 19:17:45,815][26022] Updated weights on worker 0-0, policy_version 384402 (0.00081) [2022-07-09 19:17:47,526][26022] Updated weights on worker 0-0, policy_version 384412 (0.00085) [2022-07-09 19:17:49,382][26022] Updated weights on worker 0-0, policy_version 384422 (0.00090) [2022-07-09 19:17:49,923][25689] Fps is (10 sec: 5799.4, 60 sec: 5662.9, 300 sec: 5652.5). Total num frames: 393652224. Throughput: 0: 5973.7. Samples: 393657266. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:17:49,924][25689] Avg episode reward: [(0, '-48.147')] [2022-07-09 19:17:51,321][26022] Updated weights on worker 0-0, policy_version 384432 (0.00095) [2022-07-09 19:17:52,856][26022] Updated weights on worker 0-0, policy_version 384442 (0.00099) [2022-07-09 19:17:54,926][25689] Fps is (10 sec: 5525.0, 60 sec: 5615.6, 300 sec: 5645.6). Total num frames: 393677824. Throughput: 0: 5108.6. Samples: 393674220. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:17:54,926][25689] Avg episode reward: [(0, '-47.193')] [2022-07-09 19:17:54,935][26022] Updated weights on worker 0-0, policy_version 384452 (0.00102) [2022-07-09 19:17:56,471][26022] Updated weights on worker 0-0, policy_version 384462 (0.00095) [2022-07-09 19:17:58,381][26022] Updated weights on worker 0-0, policy_version 384472 (0.00092) [2022-07-09 19:17:59,934][25689] Fps is (10 sec: 5523.9, 60 sec: 5651.8, 300 sec: 5656.5). Total num frames: 393707520. Throughput: 0: 5971.7. Samples: 393708318. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:17:59,934][25689] Avg episode reward: [(0, '-47.292')] [2022-07-09 19:18:00,312][26022] Updated weights on worker 0-0, policy_version 384482 (0.00089) [2022-07-09 19:18:01,871][26022] Updated weights on worker 0-0, policy_version 384492 (0.00086) [2022-07-09 19:18:04,228][26022] Updated weights on worker 0-0, policy_version 384502 (0.00091) [2022-07-09 19:18:05,012][25689] Fps is (10 sec: 5786.7, 60 sec: 5667.2, 300 sec: 5651.9). Total num frames: 393736192. Throughput: 0: 5855.6. Samples: 393740478. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:18:05,013][25689] Avg episode reward: [(0, '-46.992')] [2022-07-09 19:18:05,899][26022] Updated weights on worker 0-0, policy_version 384512 (0.00082) [2022-07-09 19:18:07,731][26022] Updated weights on worker 0-0, policy_version 384522 (0.00100) [2022-07-09 19:18:09,448][26022] Updated weights on worker 0-0, policy_version 384532 (0.00079) [2022-07-09 19:18:10,077][25689] Fps is (10 sec: 5552.1, 60 sec: 5648.3, 300 sec: 5651.0). Total num frames: 393763840. Throughput: 0: 4976.3. Samples: 393757560. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:18:10,078][25689] Avg episode reward: [(0, '-47.081')] [2022-07-09 19:18:11,382][26022] Updated weights on worker 0-0, policy_version 384542 (0.00087) [2022-07-09 19:18:12,952][26022] Updated weights on worker 0-0, policy_version 384552 (0.00091) [2022-07-09 19:18:15,019][26022] Updated weights on worker 0-0, policy_version 384562 (0.00978) [2022-07-09 19:18:15,105][25689] Fps is (10 sec: 5478.6, 60 sec: 5629.7, 300 sec: 5647.6). Total num frames: 393791488. Throughput: 0: 5836.8. Samples: 393792004. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:18:15,106][25689] Avg episode reward: [(0, '-47.792')] [2022-07-09 19:18:16,659][26022] Updated weights on worker 0-0, policy_version 384572 (0.00094) [2022-07-09 19:18:18,442][26022] Updated weights on worker 0-0, policy_version 384582 (0.00081) [2022-07-09 19:18:20,101][26022] Updated weights on worker 0-0, policy_version 384592 (0.00089) [2022-07-09 19:18:20,140][25689] Fps is (10 sec: 5800.2, 60 sec: 5678.8, 300 sec: 5654.8). Total num frames: 393822208. Throughput: 0: 5852.6. Samples: 393826582. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:18:20,141][25689] Avg episode reward: [(0, '-47.286')] [2022-07-09 19:18:22,090][26022] Updated weights on worker 0-0, policy_version 384602 (0.00094) [2022-07-09 19:18:23,647][26022] Updated weights on worker 0-0, policy_version 384612 (0.00088) [2022-07-09 19:18:25,266][25689] Fps is (10 sec: 5744.5, 60 sec: 5658.0, 300 sec: 5652.6). Total num frames: 393849856. Throughput: 0: 5104.4. Samples: 393843862. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:18:25,268][25689] Avg episode reward: [(0, '-46.982')] [2022-07-09 19:18:25,666][26022] Updated weights on worker 0-0, policy_version 384622 (0.00085) [2022-07-09 19:18:27,220][26022] Updated weights on worker 0-0, policy_version 384632 (0.00093) [2022-07-09 19:18:29,314][26022] Updated weights on worker 0-0, policy_version 384642 (0.00090) [2022-07-09 19:18:30,330][25689] Fps is (10 sec: 5627.6, 60 sec: 5687.1, 300 sec: 5652.1). Total num frames: 393879552. Throughput: 0: 5942.4. Samples: 393877910. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:18:30,332][25689] Avg episode reward: [(0, '-47.233')] [2022-07-09 19:18:30,908][26022] Updated weights on worker 0-0, policy_version 384652 (0.00108) [2022-07-09 19:18:32,967][26022] Updated weights on worker 0-0, policy_version 384662 (0.00098) [2022-07-09 19:18:34,507][26022] Updated weights on worker 0-0, policy_version 384672 (0.00083) [2022-07-09 19:18:35,431][25689] Fps is (10 sec: 5742.0, 60 sec: 5661.0, 300 sec: 5650.5). Total num frames: 393908224. Throughput: 0: 5906.8. Samples: 393912064. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:18:35,433][25689] Avg episode reward: [(0, '-47.211')] [2022-07-09 19:18:36,291][26022] Updated weights on worker 0-0, policy_version 384682 (0.00084) [2022-07-09 19:18:38,436][26022] Updated weights on worker 0-0, policy_version 384692 (0.00095) [2022-07-09 19:18:39,982][26022] Updated weights on worker 0-0, policy_version 384702 (0.00088) [2022-07-09 19:18:40,488][25689] Fps is (10 sec: 5544.5, 60 sec: 5641.1, 300 sec: 5650.3). Total num frames: 393935872. Throughput: 0: 5861.3. Samples: 393945846. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:18:40,489][25689] Avg episode reward: [(0, '-47.027')] [2022-07-09 19:18:41,915][26022] Updated weights on worker 0-0, policy_version 384712 (0.00084) [2022-07-09 19:18:43,783][26022] Updated weights on worker 0-0, policy_version 384722 (0.00105) [2022-07-09 19:18:45,286][26022] Updated weights on worker 0-0, policy_version 384732 (0.00090) [2022-07-09 19:18:45,567][25689] Fps is (10 sec: 5758.7, 60 sec: 5673.0, 300 sec: 5656.0). Total num frames: 393966592. Throughput: 0: 5861.3. Samples: 393962850. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:18:45,567][25689] Avg episode reward: [(0, '-46.850')] [2022-07-09 19:18:47,497][26022] Updated weights on worker 0-0, policy_version 384742 (0.00090) [2022-07-09 19:18:48,818][26022] Updated weights on worker 0-0, policy_version 384752 (0.00089) [2022-07-09 19:18:50,591][25689] Fps is (10 sec: 5675.9, 60 sec: 5620.6, 300 sec: 5652.4). Total num frames: 393993216. Throughput: 0: 5890.5. Samples: 393997256. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:18:50,592][25689] Avg episode reward: [(0, '-47.590')] [2022-07-09 19:18:51,017][26022] Updated weights on worker 0-0, policy_version 384763 (0.00095) [2022-07-09 19:18:52,852][26022] Updated weights on worker 0-0, policy_version 384773 (0.00083) [2022-07-09 19:18:54,551][26022] Updated weights on worker 0-0, policy_version 384783 (0.00088) [2022-07-09 19:18:55,639][25689] Fps is (10 sec: 5591.7, 60 sec: 5683.9, 300 sec: 5648.5). Total num frames: 394022912. Throughput: 0: 5895.6. Samples: 394031200. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:18:55,647][25689] Avg episode reward: [(0, '-47.476')] [2022-07-09 19:18:56,637][26022] Updated weights on worker 0-0, policy_version 384793 (0.00048) [2022-07-09 19:18:58,216][26022] Updated weights on worker 0-0, policy_version 384803 (0.00084) [2022-07-09 19:18:59,950][26022] Updated weights on worker 0-0, policy_version 384813 (0.00084) [2022-07-09 19:19:00,652][25689] Fps is (10 sec: 5801.5, 60 sec: 5666.6, 300 sec: 5662.9). Total num frames: 394051584. Throughput: 0: 5079.3. Samples: 394048264. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:19:00,652][25689] Avg episode reward: [(0, '-47.787')] [2022-07-09 19:19:02,273][26022] Updated weights on worker 0-0, policy_version 384823 (0.00083) [2022-07-09 19:19:03,982][26022] Updated weights on worker 0-0, policy_version 384833 (0.00087) [2022-07-09 19:19:05,730][25689] Fps is (10 sec: 5378.0, 60 sec: 5616.0, 300 sec: 5651.2). Total num frames: 394077184. Throughput: 0: 5827.0. Samples: 394080344. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:19:05,731][25689] Avg episode reward: [(0, '-48.520')] [2022-07-09 19:19:05,810][26022] Updated weights on worker 0-0, policy_version 384843 (0.00098) [2022-07-09 19:19:07,527][26022] Updated weights on worker 0-0, policy_version 384853 (0.00093) [2022-07-09 19:19:09,673][26022] Updated weights on worker 0-0, policy_version 384863 (0.00091) [2022-07-09 19:19:10,743][25689] Fps is (10 sec: 5479.9, 60 sec: 5654.6, 300 sec: 5654.6). Total num frames: 394106880. Throughput: 0: 5810.2. Samples: 394114340. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:19:10,743][25689] Avg episode reward: [(0, '-48.876')] [2022-07-09 19:19:11,254][26022] Updated weights on worker 0-0, policy_version 384873 (0.00083) [2022-07-09 19:19:13,258][26022] Updated weights on worker 0-0, policy_version 384883 (0.00090) [2022-07-09 19:19:14,919][26022] Updated weights on worker 0-0, policy_version 384893 (0.00081) [2022-07-09 19:19:15,785][25689] Fps is (10 sec: 5703.0, 60 sec: 5653.3, 300 sec: 5650.9). Total num frames: 394134528. Throughput: 0: 4978.1. Samples: 394131492. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:19:15,786][25689] Avg episode reward: [(0, '-48.572')] [2022-07-09 19:19:16,648][26022] Updated weights on worker 0-0, policy_version 384903 (0.00085) [2022-07-09 19:19:18,611][26022] Updated weights on worker 0-0, policy_version 384913 (0.00091) [2022-07-09 19:19:20,131][26022] Updated weights on worker 0-0, policy_version 384923 (0.00084) [2022-07-09 19:19:20,827][25689] Fps is (10 sec: 5584.9, 60 sec: 5618.9, 300 sec: 5655.5). Total num frames: 394163200. Throughput: 0: 5828.6. Samples: 394165856. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:19:20,828][25689] Avg episode reward: [(0, '-48.336')] [2022-07-09 19:19:21,952][26022] Updated weights on worker 0-0, policy_version 384933 (0.00083) [2022-07-09 19:19:23,859][26022] Updated weights on worker 0-0, policy_version 384943 (0.00092) [2022-07-09 19:19:25,626][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:19:25,640][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000384953_394191872.pth [2022-07-09 19:19:25,640][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000382964_392155136.pth [2022-07-09 19:19:25,654][26022] Updated weights on worker 0-0, policy_version 384953 (0.00332) [2022-07-09 19:19:25,883][25689] Fps is (10 sec: 5881.6, 60 sec: 5676.0, 300 sec: 5654.7). Total num frames: 394193920. Throughput: 0: 5953.9. Samples: 394200332. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:19:25,884][25689] Avg episode reward: [(0, '-48.341')] [2022-07-09 19:19:27,527][26022] Updated weights on worker 0-0, policy_version 384963 (0.00087) [2022-07-09 19:19:28,981][26022] Updated weights on worker 0-0, policy_version 384973 (0.00087) [2022-07-09 19:19:30,955][25689] Fps is (10 sec: 5661.8, 60 sec: 5624.6, 300 sec: 5647.1). Total num frames: 394220544. Throughput: 0: 5097.8. Samples: 394217380. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:19:30,956][25689] Avg episode reward: [(0, '-48.141')] [2022-07-09 19:19:31,182][26022] Updated weights on worker 0-0, policy_version 384983 (0.00086) [2022-07-09 19:19:32,785][26022] Updated weights on worker 0-0, policy_version 384993 (0.00088) [2022-07-09 19:19:34,631][26022] Updated weights on worker 0-0, policy_version 385003 (0.00458) [2022-07-09 19:19:35,967][25689] Fps is (10 sec: 5686.8, 60 sec: 5666.7, 300 sec: 5660.8). Total num frames: 394251264. Throughput: 0: 5968.0. Samples: 394251936. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:19:35,969][25689] Avg episode reward: [(0, '-47.233')] [2022-07-09 19:19:36,239][26022] Updated weights on worker 0-0, policy_version 385013 (0.00092) [2022-07-09 19:19:38,269][26022] Updated weights on worker 0-0, policy_version 385023 (0.00090) [2022-07-09 19:19:39,956][26022] Updated weights on worker 0-0, policy_version 385033 (0.00087) [2022-07-09 19:19:40,995][25689] Fps is (10 sec: 5813.9, 60 sec: 5669.4, 300 sec: 5651.4). Total num frames: 394278912. Throughput: 0: 5960.7. Samples: 394286070. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:19:40,995][25689] Avg episode reward: [(0, '-47.921')] [2022-07-09 19:19:41,862][26022] Updated weights on worker 0-0, policy_version 385043 (0.00092) [2022-07-09 19:19:43,479][26022] Updated weights on worker 0-0, policy_version 385053 (0.00085) [2022-07-09 19:19:45,341][26022] Updated weights on worker 0-0, policy_version 385063 (0.00092) [2022-07-09 19:19:46,133][25689] Fps is (10 sec: 5540.0, 60 sec: 5630.0, 300 sec: 5652.9). Total num frames: 394307584. Throughput: 0: 5077.5. Samples: 394303150. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:19:46,134][25689] Avg episode reward: [(0, '-48.377')] [2022-07-09 19:19:47,207][26022] Updated weights on worker 0-0, policy_version 385073 (0.00090) [2022-07-09 19:19:49,082][26022] Updated weights on worker 0-0, policy_version 385083 (0.00088) [2022-07-09 19:19:50,586][26022] Updated weights on worker 0-0, policy_version 385093 (0.00879) [2022-07-09 19:19:51,140][25689] Fps is (10 sec: 5753.2, 60 sec: 5682.4, 300 sec: 5653.1). Total num frames: 394337280. Throughput: 0: 5965.5. Samples: 394337792. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:19:51,141][25689] Avg episode reward: [(0, '-48.008')] [2022-07-09 19:19:52,590][26022] Updated weights on worker 0-0, policy_version 385103 (0.00089) [2022-07-09 19:19:54,161][26022] Updated weights on worker 0-0, policy_version 385113 (0.00089) [2022-07-09 19:19:56,149][25689] Fps is (10 sec: 5725.4, 60 sec: 5652.2, 300 sec: 5649.5). Total num frames: 394364928. Throughput: 0: 5954.9. Samples: 394372116. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:19:56,149][25689] Avg episode reward: [(0, '-46.668')] [2022-07-09 19:19:56,263][26022] Updated weights on worker 0-0, policy_version 385123 (0.00083) [2022-07-09 19:19:57,773][26022] Updated weights on worker 0-0, policy_version 385133 (0.00087) [2022-07-09 19:19:59,596][26022] Updated weights on worker 0-0, policy_version 385143 (0.00085) [2022-07-09 19:20:01,175][25689] Fps is (10 sec: 5816.7, 60 sec: 5684.9, 300 sec: 5668.2). Total num frames: 394395648. Throughput: 0: 5115.9. Samples: 394389308. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 19:20:01,175][25689] Avg episode reward: [(0, '-47.111')] [2022-07-09 19:20:01,299][26022] Updated weights on worker 0-0, policy_version 385153 (0.00081) [2022-07-09 19:20:03,722][26022] Updated weights on worker 0-0, policy_version 385163 (0.00074) [2022-07-09 19:20:05,474][26022] Updated weights on worker 0-0, policy_version 385173 (0.00082) [2022-07-09 19:20:06,303][25689] Fps is (10 sec: 5445.4, 60 sec: 5663.2, 300 sec: 5648.6). Total num frames: 394420224. Throughput: 0: 5857.0. Samples: 394421286. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:20:06,304][25689] Avg episode reward: [(0, '-46.915')] [2022-07-09 19:20:07,245][26022] Updated weights on worker 0-0, policy_version 385183 (0.00089) [2022-07-09 19:20:09,106][26022] Updated weights on worker 0-0, policy_version 385193 (0.00087) [2022-07-09 19:20:10,875][26022] Updated weights on worker 0-0, policy_version 385203 (0.00091) [2022-07-09 19:20:11,388][25689] Fps is (10 sec: 5313.9, 60 sec: 5656.5, 300 sec: 5647.2). Total num frames: 394449920. Throughput: 0: 5787.6. Samples: 394454978. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:20:11,389][25689] Avg episode reward: [(0, '-48.133')] [2022-07-09 19:20:12,979][26022] Updated weights on worker 0-0, policy_version 385213 (0.00091) [2022-07-09 19:20:14,495][26022] Updated weights on worker 0-0, policy_version 385223 (0.00092) [2022-07-09 19:20:16,428][25689] Fps is (10 sec: 5664.0, 60 sec: 5656.8, 300 sec: 5653.4). Total num frames: 394477568. Throughput: 0: 4919.4. Samples: 394471876. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:20:16,428][25689] Avg episode reward: [(0, '-48.078')] [2022-07-09 19:20:16,444][26022] Updated weights on worker 0-0, policy_version 385233 (0.00085) [2022-07-09 19:20:18,199][26022] Updated weights on worker 0-0, policy_version 385243 (0.00083) [2022-07-09 19:20:20,023][26022] Updated weights on worker 0-0, policy_version 385253 (0.00096) [2022-07-09 19:20:21,510][25689] Fps is (10 sec: 5563.9, 60 sec: 5653.0, 300 sec: 5649.4). Total num frames: 394506240. Throughput: 0: 5718.8. Samples: 394505602. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:20:21,511][25689] Avg episode reward: [(0, '-48.805')] [2022-07-09 19:20:22,064][26022] Updated weights on worker 0-0, policy_version 385263 (0.00091) [2022-07-09 19:20:23,764][26022] Updated weights on worker 0-0, policy_version 385273 (0.00095) [2022-07-09 19:20:25,414][26022] Updated weights on worker 0-0, policy_version 385283 (0.00095) [2022-07-09 19:20:26,608][25689] Fps is (10 sec: 5632.9, 60 sec: 5615.4, 300 sec: 5651.2). Total num frames: 394534912. Throughput: 0: 5810.9. Samples: 394539274. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:20:26,608][25689] Avg episode reward: [(0, '-48.764')] [2022-07-09 19:20:27,360][26022] Updated weights on worker 0-0, policy_version 385293 (0.00094) [2022-07-09 19:20:29,112][26022] Updated weights on worker 0-0, policy_version 385303 (0.00086) [2022-07-09 19:20:31,168][26022] Updated weights on worker 0-0, policy_version 385313 (0.00087) [2022-07-09 19:20:31,622][25689] Fps is (10 sec: 5670.8, 60 sec: 5654.5, 300 sec: 5651.2). Total num frames: 394563584. Throughput: 0: 5844.8. Samples: 394573244. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:20:31,623][25689] Avg episode reward: [(0, '-48.686')] [2022-07-09 19:20:32,635][26022] Updated weights on worker 0-0, policy_version 385323 (0.00092) [2022-07-09 19:20:34,537][26022] Updated weights on worker 0-0, policy_version 385333 (0.00092) [2022-07-09 19:20:36,531][26022] Updated weights on worker 0-0, policy_version 385343 (0.00083) [2022-07-09 19:20:36,630][25689] Fps is (10 sec: 5619.5, 60 sec: 5604.2, 300 sec: 5648.1). Total num frames: 394591232. Throughput: 0: 5872.5. Samples: 394590514. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:20:36,631][25689] Avg episode reward: [(0, '-47.847')] [2022-07-09 19:20:38,191][26022] Updated weights on worker 0-0, policy_version 385353 (0.00086) [2022-07-09 19:20:40,149][26022] Updated weights on worker 0-0, policy_version 385363 (0.00088) [2022-07-09 19:20:41,627][26022] Updated weights on worker 0-0, policy_version 385373 (0.00083) [2022-07-09 19:20:41,725][25689] Fps is (10 sec: 5777.2, 60 sec: 5648.6, 300 sec: 5651.3). Total num frames: 394621952. Throughput: 0: 5904.5. Samples: 394624962. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:20:41,726][25689] Avg episode reward: [(0, '-46.792')] [2022-07-09 19:20:43,678][26022] Updated weights on worker 0-0, policy_version 385383 (0.00100) [2022-07-09 19:20:45,412][26022] Updated weights on worker 0-0, policy_version 385393 (0.01060) [2022-07-09 19:20:46,762][25689] Fps is (10 sec: 5861.7, 60 sec: 5658.0, 300 sec: 5647.5). Total num frames: 394650624. Throughput: 0: 5958.6. Samples: 394659366. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:20:46,763][25689] Avg episode reward: [(0, '-46.669')] [2022-07-09 19:20:47,070][26022] Updated weights on worker 0-0, policy_version 385403 (0.00101) [2022-07-09 19:20:48,963][26022] Updated weights on worker 0-0, policy_version 385413 (0.00085) [2022-07-09 19:20:50,595][26022] Updated weights on worker 0-0, policy_version 385423 (0.00090) [2022-07-09 19:20:51,843][25689] Fps is (10 sec: 5566.6, 60 sec: 5617.4, 300 sec: 5646.8). Total num frames: 394678272. Throughput: 0: 5115.4. Samples: 394676680. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:20:51,843][25689] Avg episode reward: [(0, '-46.521')] [2022-07-09 19:20:52,537][26022] Updated weights on worker 0-0, policy_version 385433 (0.00087) [2022-07-09 19:20:54,079][26022] Updated weights on worker 0-0, policy_version 385443 (0.00088) [2022-07-09 19:20:56,136][26022] Updated weights on worker 0-0, policy_version 385453 (0.00090) [2022-07-09 19:20:56,866][25689] Fps is (10 sec: 5675.6, 60 sec: 5649.8, 300 sec: 5650.5). Total num frames: 394707968. Throughput: 0: 5941.1. Samples: 394710738. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:20:56,868][25689] Avg episode reward: [(0, '-45.749')] [2022-07-09 19:20:57,845][26022] Updated weights on worker 0-0, policy_version 385463 (0.00089) [2022-07-09 19:20:59,888][26022] Updated weights on worker 0-0, policy_version 385473 (0.00094) [2022-07-09 19:21:01,575][26022] Updated weights on worker 0-0, policy_version 385483 (0.00089) [2022-07-09 19:21:01,893][25689] Fps is (10 sec: 5807.8, 60 sec: 5616.0, 300 sec: 5654.2). Total num frames: 394736640. Throughput: 0: 5935.8. Samples: 394744672. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:21:01,893][25689] Avg episode reward: [(0, '-45.958')] [2022-07-09 19:21:03,815][26022] Updated weights on worker 0-0, policy_version 385493 (0.00086) [2022-07-09 19:21:05,531][26022] Updated weights on worker 0-0, policy_version 385503 (0.00082) [2022-07-09 19:21:07,017][25689] Fps is (10 sec: 5346.3, 60 sec: 5633.2, 300 sec: 5649.5). Total num frames: 394762240. Throughput: 0: 4947.5. Samples: 394759578. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:21:07,018][25689] Avg episode reward: [(0, '-46.684')] [2022-07-09 19:21:07,351][26022] Updated weights on worker 0-0, policy_version 385513 (0.00080) [2022-07-09 19:21:09,208][26022] Updated weights on worker 0-0, policy_version 385523 (0.00088) [2022-07-09 19:21:10,960][26022] Updated weights on worker 0-0, policy_version 385533 (0.00090) [2022-07-09 19:21:12,056][25689] Fps is (10 sec: 5440.9, 60 sec: 5637.5, 300 sec: 5652.9). Total num frames: 394791936. Throughput: 0: 5799.2. Samples: 394793898. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:21:12,057][25689] Avg episode reward: [(0, '-47.704')] [2022-07-09 19:21:12,513][26022] Updated weights on worker 0-0, policy_version 385543 (0.00089) [2022-07-09 19:21:14,606][26022] Updated weights on worker 0-0, policy_version 385553 (0.00090) [2022-07-09 19:21:16,268][26022] Updated weights on worker 0-0, policy_version 385563 (0.00091) [2022-07-09 19:21:17,061][25689] Fps is (10 sec: 5709.4, 60 sec: 5640.7, 300 sec: 5642.7). Total num frames: 394819584. Throughput: 0: 5803.7. Samples: 394827944. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:21:17,063][25689] Avg episode reward: [(0, '-48.083')] [2022-07-09 19:21:18,063][26022] Updated weights on worker 0-0, policy_version 385573 (0.00081) [2022-07-09 19:21:20,064][26022] Updated weights on worker 0-0, policy_version 385583 (0.00087) [2022-07-09 19:21:21,621][26022] Updated weights on worker 0-0, policy_version 385593 (0.00085) [2022-07-09 19:21:22,098][25689] Fps is (10 sec: 5710.4, 60 sec: 5661.9, 300 sec: 5654.6). Total num frames: 394849280. Throughput: 0: 4966.6. Samples: 394845022. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:21:22,098][25689] Avg episode reward: [(0, '-48.198')] [2022-07-09 19:21:23,746][26022] Updated weights on worker 0-0, policy_version 385603 (0.00087) [2022-07-09 19:21:25,357][26022] Updated weights on worker 0-0, policy_version 385613 (0.00087) [2022-07-09 19:21:25,680][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:21:25,690][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000385615_394869760.pth [2022-07-09 19:21:25,690][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000383627_392834048.pth [2022-07-09 19:21:27,170][25689] Fps is (10 sec: 5672.6, 60 sec: 5647.3, 300 sec: 5642.9). Total num frames: 394876928. Throughput: 0: 5918.9. Samples: 394878860. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:21:27,171][25689] Avg episode reward: [(0, '-49.541')] [2022-07-09 19:21:27,215][26022] Updated weights on worker 0-0, policy_version 385623 (0.00081) [2022-07-09 19:21:28,968][26022] Updated weights on worker 0-0, policy_version 385633 (0.00092) [2022-07-09 19:21:30,838][26022] Updated weights on worker 0-0, policy_version 385643 (0.00090) [2022-07-09 19:21:32,179][25689] Fps is (10 sec: 5586.7, 60 sec: 5647.9, 300 sec: 5650.0). Total num frames: 394905600. Throughput: 0: 5915.0. Samples: 394912926. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:21:32,179][25689] Avg episode reward: [(0, '-48.680')] [2022-07-09 19:21:32,635][26022] Updated weights on worker 0-0, policy_version 385653 (0.00088) [2022-07-09 19:21:34,454][26022] Updated weights on worker 0-0, policy_version 385663 (0.00093) [2022-07-09 19:21:36,094][26022] Updated weights on worker 0-0, policy_version 385673 (0.00093) [2022-07-09 19:21:37,202][25689] Fps is (10 sec: 5715.9, 60 sec: 5663.3, 300 sec: 5649.7). Total num frames: 394934272. Throughput: 0: 5081.3. Samples: 394930288. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:21:37,203][25689] Avg episode reward: [(0, '-48.536')] [2022-07-09 19:21:38,186][26022] Updated weights on worker 0-0, policy_version 385683 (0.00089) [2022-07-09 19:21:39,702][26022] Updated weights on worker 0-0, policy_version 385693 (0.00083) [2022-07-09 19:21:41,612][26022] Updated weights on worker 0-0, policy_version 385703 (0.00087) [2022-07-09 19:21:42,213][25689] Fps is (10 sec: 5816.7, 60 sec: 5654.3, 300 sec: 5651.7). Total num frames: 394963968. Throughput: 0: 5952.4. Samples: 394964758. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:21:42,214][25689] Avg episode reward: [(0, '-48.146')] [2022-07-09 19:21:43,415][26022] Updated weights on worker 0-0, policy_version 385713 (0.00085) [2022-07-09 19:21:45,163][26022] Updated weights on worker 0-0, policy_version 385723 (0.00098) [2022-07-09 19:21:46,954][26022] Updated weights on worker 0-0, policy_version 385733 (0.00079) [2022-07-09 19:21:47,259][25689] Fps is (10 sec: 5702.3, 60 sec: 5636.6, 300 sec: 5647.4). Total num frames: 394991616. Throughput: 0: 5964.6. Samples: 394998680. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:21:47,259][25689] Avg episode reward: [(0, '-47.351')] [2022-07-09 19:21:48,656][26022] Updated weights on worker 0-0, policy_version 385743 (0.00080) [2022-07-09 19:21:50,704][26022] Updated weights on worker 0-0, policy_version 385753 (0.00095) [2022-07-09 19:21:52,276][25689] Fps is (10 sec: 5698.6, 60 sec: 5676.3, 300 sec: 5651.3). Total num frames: 395021312. Throughput: 0: 5119.2. Samples: 395015808. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:21:52,277][25689] Avg episode reward: [(0, '-46.679')] [2022-07-09 19:21:52,288][26022] Updated weights on worker 0-0, policy_version 385763 (0.00088) [2022-07-09 19:21:54,156][26022] Updated weights on worker 0-0, policy_version 385773 (0.00082) [2022-07-09 19:21:55,943][26022] Updated weights on worker 0-0, policy_version 385783 (0.00087) [2022-07-09 19:21:57,297][25689] Fps is (10 sec: 5712.3, 60 sec: 5642.6, 300 sec: 5651.5). Total num frames: 395048960. Throughput: 0: 5967.2. Samples: 395050198. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:21:57,298][25689] Avg episode reward: [(0, '-45.873')] [2022-07-09 19:21:57,720][26022] Updated weights on worker 0-0, policy_version 385793 (0.00097) [2022-07-09 19:21:59,551][26022] Updated weights on worker 0-0, policy_version 385803 (0.00093) [2022-07-09 19:22:01,419][26022] Updated weights on worker 0-0, policy_version 385813 (0.00087) [2022-07-09 19:22:02,320][25689] Fps is (10 sec: 5505.4, 60 sec: 5626.1, 300 sec: 5652.2). Total num frames: 395076608. Throughput: 0: 5945.7. Samples: 395084306. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:22:02,322][25689] Avg episode reward: [(0, '-46.210')] [2022-07-09 19:22:03,464][26022] Updated weights on worker 0-0, policy_version 385823 (0.00085) [2022-07-09 19:22:05,454][26022] Updated weights on worker 0-0, policy_version 385833 (0.00094) [2022-07-09 19:22:07,167][26022] Updated weights on worker 0-0, policy_version 385843 (0.00094) [2022-07-09 19:22:07,413][25689] Fps is (10 sec: 5567.5, 60 sec: 5679.9, 300 sec: 5651.3). Total num frames: 395105280. Throughput: 0: 4996.0. Samples: 395099372. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:22:07,414][25689] Avg episode reward: [(0, '-45.070')] [2022-07-09 19:22:08,971][26022] Updated weights on worker 0-0, policy_version 385853 (0.00077) [2022-07-09 19:22:10,813][26022] Updated weights on worker 0-0, policy_version 385863 (0.00094) [2022-07-09 19:22:12,432][25689] Fps is (10 sec: 5569.8, 60 sec: 5647.8, 300 sec: 5647.7). Total num frames: 395132928. Throughput: 0: 5847.4. Samples: 395133666. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:22:12,433][25689] Avg episode reward: [(0, '-45.908')] [2022-07-09 19:22:12,444][26022] Updated weights on worker 0-0, policy_version 385873 (0.00088) [2022-07-09 19:22:14,445][26022] Updated weights on worker 0-0, policy_version 385883 (0.00538) [2022-07-09 19:22:16,161][26022] Updated weights on worker 0-0, policy_version 385893 (0.00091) [2022-07-09 19:22:17,472][25689] Fps is (10 sec: 5598.8, 60 sec: 5661.5, 300 sec: 5650.7). Total num frames: 395161600. Throughput: 0: 5839.9. Samples: 395168018. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:22:17,473][25689] Avg episode reward: [(0, '-46.339')] [2022-07-09 19:22:17,882][26022] Updated weights on worker 0-0, policy_version 385903 (0.00087) [2022-07-09 19:22:19,798][26022] Updated weights on worker 0-0, policy_version 385913 (0.00090) [2022-07-09 19:22:21,589][26022] Updated weights on worker 0-0, policy_version 385923 (0.00090) [2022-07-09 19:22:22,480][25689] Fps is (10 sec: 5808.9, 60 sec: 5664.2, 300 sec: 5655.6). Total num frames: 395191296. Throughput: 0: 5004.0. Samples: 395185184. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:22:22,480][25689] Avg episode reward: [(0, '-46.297')] [2022-07-09 19:22:23,218][26022] Updated weights on worker 0-0, policy_version 385933 (0.00088) [2022-07-09 19:22:25,012][26022] Updated weights on worker 0-0, policy_version 385943 (0.00052) [2022-07-09 19:22:26,846][26022] Updated weights on worker 0-0, policy_version 385953 (0.00089) [2022-07-09 19:22:27,593][25689] Fps is (10 sec: 5666.3, 60 sec: 5660.4, 300 sec: 5653.7). Total num frames: 395218944. Throughput: 0: 5950.0. Samples: 395219440. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:22:27,593][25689] Avg episode reward: [(0, '-46.969')] [2022-07-09 19:22:28,726][26022] Updated weights on worker 0-0, policy_version 385963 (0.00090) [2022-07-09 19:22:30,390][26022] Updated weights on worker 0-0, policy_version 385973 (0.00084) [2022-07-09 19:22:32,297][26022] Updated weights on worker 0-0, policy_version 385983 (0.00085) [2022-07-09 19:22:32,653][25689] Fps is (10 sec: 5737.3, 60 sec: 5689.4, 300 sec: 5656.1). Total num frames: 395249664. Throughput: 0: 5921.6. Samples: 395253410. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:22:32,654][25689] Avg episode reward: [(0, '-47.521')] [2022-07-09 19:22:34,202][26022] Updated weights on worker 0-0, policy_version 385993 (0.00080) [2022-07-09 19:22:35,868][26022] Updated weights on worker 0-0, policy_version 386003 (0.00095) [2022-07-09 19:22:37,693][25689] Fps is (10 sec: 5677.1, 60 sec: 5654.0, 300 sec: 5648.9). Total num frames: 395276288. Throughput: 0: 5074.8. Samples: 395270636. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-09 19:22:37,694][25689] Avg episode reward: [(0, '-47.590')] [2022-07-09 19:22:37,729][26022] Updated weights on worker 0-0, policy_version 386013 (0.00086) [2022-07-09 19:22:39,356][26022] Updated weights on worker 0-0, policy_version 386023 (0.00083) [2022-07-09 19:22:41,404][26022] Updated weights on worker 0-0, policy_version 386033 (0.00087) [2022-07-09 19:22:42,789][25689] Fps is (10 sec: 5556.7, 60 sec: 5646.1, 300 sec: 5651.6). Total num frames: 395305984. Throughput: 0: 5889.0. Samples: 395304784. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:22:42,789][25689] Avg episode reward: [(0, '-46.830')] [2022-07-09 19:22:43,022][26022] Updated weights on worker 0-0, policy_version 386043 (0.00095) [2022-07-09 19:22:44,996][26022] Updated weights on worker 0-0, policy_version 386053 (0.00092) [2022-07-09 19:22:46,795][26022] Updated weights on worker 0-0, policy_version 386063 (0.00084) [2022-07-09 19:22:47,832][25689] Fps is (10 sec: 5757.0, 60 sec: 5663.2, 300 sec: 5647.5). Total num frames: 395334656. Throughput: 0: 5910.6. Samples: 395339068. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:22:47,833][25689] Avg episode reward: [(0, '-46.361')] [2022-07-09 19:22:48,541][26022] Updated weights on worker 0-0, policy_version 386073 (0.00085) [2022-07-09 19:22:50,103][26022] Updated weights on worker 0-0, policy_version 386083 (0.00087) [2022-07-09 19:22:52,156][26022] Updated weights on worker 0-0, policy_version 386093 (0.00085) [2022-07-09 19:22:52,859][25689] Fps is (10 sec: 5796.2, 60 sec: 5662.4, 300 sec: 5660.8). Total num frames: 395364352. Throughput: 0: 5101.7. Samples: 395356492. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:22:52,859][25689] Avg episode reward: [(0, '-46.088')] [2022-07-09 19:22:53,727][26022] Updated weights on worker 0-0, policy_version 386103 (0.00084) [2022-07-09 19:22:55,590][26022] Updated weights on worker 0-0, policy_version 386113 (0.00086) [2022-07-09 19:22:57,512][26022] Updated weights on worker 0-0, policy_version 386123 (0.00490) [2022-07-09 19:22:57,862][25689] Fps is (10 sec: 5717.5, 60 sec: 5664.1, 300 sec: 5654.0). Total num frames: 395392000. Throughput: 0: 5960.6. Samples: 395390850. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:22:57,864][25689] Avg episode reward: [(0, '-46.392')] [2022-07-09 19:22:59,059][26022] Updated weights on worker 0-0, policy_version 386133 (0.00098) [2022-07-09 19:23:00,963][26022] Updated weights on worker 0-0, policy_version 386143 (0.00086) [2022-07-09 19:23:02,872][25689] Fps is (10 sec: 5317.7, 60 sec: 5631.4, 300 sec: 5645.0). Total num frames: 395417600. Throughput: 0: 5907.4. Samples: 395423424. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:23:02,873][25689] Avg episode reward: [(0, '-45.788')] [2022-07-09 19:23:03,148][26022] Updated weights on worker 0-0, policy_version 386153 (0.00089) [2022-07-09 19:23:05,003][26022] Updated weights on worker 0-0, policy_version 386163 (0.00087) [2022-07-09 19:23:06,917][26022] Updated weights on worker 0-0, policy_version 386173 (0.00092) [2022-07-09 19:23:07,915][25689] Fps is (10 sec: 5500.3, 60 sec: 5653.0, 300 sec: 5652.3). Total num frames: 395447296. Throughput: 0: 5007.8. Samples: 395439638. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:23:07,915][25689] Avg episode reward: [(0, '-45.386')] [2022-07-09 19:23:08,675][26022] Updated weights on worker 0-0, policy_version 386183 (0.00092) [2022-07-09 19:23:10,529][26022] Updated weights on worker 0-0, policy_version 386193 (0.00095) [2022-07-09 19:23:12,366][26022] Updated weights on worker 0-0, policy_version 386203 (0.00091) [2022-07-09 19:23:12,939][25689] Fps is (10 sec: 5798.2, 60 sec: 5669.4, 300 sec: 5655.8). Total num frames: 395475968. Throughput: 0: 5845.9. Samples: 395473878. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:23:12,939][25689] Avg episode reward: [(0, '-46.031')] [2022-07-09 19:23:14,010][26022] Updated weights on worker 0-0, policy_version 386213 (0.00097) [2022-07-09 19:23:15,752][26022] Updated weights on worker 0-0, policy_version 386223 (0.00089) [2022-07-09 19:23:17,711][26022] Updated weights on worker 0-0, policy_version 386233 (0.00088) [2022-07-09 19:23:17,953][25689] Fps is (10 sec: 5610.3, 60 sec: 5654.9, 300 sec: 5645.9). Total num frames: 395503616. Throughput: 0: 5834.7. Samples: 395508080. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:23:17,954][25689] Avg episode reward: [(0, '-45.730')] [2022-07-09 19:23:19,489][26022] Updated weights on worker 0-0, policy_version 386243 (0.00094) [2022-07-09 19:23:21,210][26022] Updated weights on worker 0-0, policy_version 386253 (0.00085) [2022-07-09 19:23:22,917][26022] Updated weights on worker 0-0, policy_version 386263 (0.00098) [2022-07-09 19:23:22,971][25689] Fps is (10 sec: 5715.9, 60 sec: 5654.0, 300 sec: 5654.8). Total num frames: 395533312. Throughput: 0: 5048.4. Samples: 395524892. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:23:22,972][25689] Avg episode reward: [(0, '-45.485')] [2022-07-09 19:23:24,865][26022] Updated weights on worker 0-0, policy_version 386273 (0.00089) [2022-07-09 19:23:25,793][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:23:25,810][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000386278_395548672.pth [2022-07-09 19:23:25,811][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000384290_393512960.pth [2022-07-09 19:23:26,743][26022] Updated weights on worker 0-0, policy_version 386283 (0.00085) [2022-07-09 19:23:28,056][25689] Fps is (10 sec: 5675.9, 60 sec: 5656.5, 300 sec: 5647.5). Total num frames: 395560960. Throughput: 0: 5924.7. Samples: 395558972. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:23:28,057][25689] Avg episode reward: [(0, '-45.652')] [2022-07-09 19:23:28,512][26022] Updated weights on worker 0-0, policy_version 386293 (0.00086) [2022-07-09 19:23:30,330][26022] Updated weights on worker 0-0, policy_version 386303 (0.00088) [2022-07-09 19:23:32,066][26022] Updated weights on worker 0-0, policy_version 386313 (0.00086) [2022-07-09 19:23:33,083][25689] Fps is (10 sec: 5671.1, 60 sec: 5642.8, 300 sec: 5652.4). Total num frames: 395590656. Throughput: 0: 5921.7. Samples: 395593164. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:23:33,083][25689] Avg episode reward: [(0, '-45.956')] [2022-07-09 19:23:33,953][26022] Updated weights on worker 0-0, policy_version 386323 (0.00090) [2022-07-09 19:23:35,674][26022] Updated weights on worker 0-0, policy_version 386333 (0.00090) [2022-07-09 19:23:37,442][26022] Updated weights on worker 0-0, policy_version 386343 (0.00088) [2022-07-09 19:23:38,106][25689] Fps is (10 sec: 5706.3, 60 sec: 5661.4, 300 sec: 5653.0). Total num frames: 395618304. Throughput: 0: 5926.0. Samples: 395627502. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:23:38,106][25689] Avg episode reward: [(0, '-46.266')] [2022-07-09 19:23:39,213][26022] Updated weights on worker 0-0, policy_version 386353 (0.00078) [2022-07-09 19:23:41,126][26022] Updated weights on worker 0-0, policy_version 386363 (0.00091) [2022-07-09 19:23:42,875][26022] Updated weights on worker 0-0, policy_version 386373 (0.00084) [2022-07-09 19:23:43,160][25689] Fps is (10 sec: 5690.3, 60 sec: 5665.2, 300 sec: 5650.0). Total num frames: 395648000. Throughput: 0: 5937.8. Samples: 395644770. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:23:43,162][25689] Avg episode reward: [(0, '-45.462')] [2022-07-09 19:23:44,567][26022] Updated weights on worker 0-0, policy_version 386383 (0.00096) [2022-07-09 19:23:46,501][26022] Updated weights on worker 0-0, policy_version 386393 (0.00098) [2022-07-09 19:23:48,296][25689] Fps is (10 sec: 5728.0, 60 sec: 5656.5, 300 sec: 5654.8). Total num frames: 395676672. Throughput: 0: 5917.8. Samples: 395678744. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:23:48,297][25689] Avg episode reward: [(0, '-46.088')] [2022-07-09 19:23:48,296][26022] Updated weights on worker 0-0, policy_version 386403 (0.00086) [2022-07-09 19:23:50,094][26022] Updated weights on worker 0-0, policy_version 386413 (0.00086) [2022-07-09 19:23:52,077][26022] Updated weights on worker 0-0, policy_version 386423 (0.00090) [2022-07-09 19:23:53,308][25689] Fps is (10 sec: 5448.9, 60 sec: 5607.1, 300 sec: 5645.1). Total num frames: 395703296. Throughput: 0: 5895.8. Samples: 395712410. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:23:53,309][25689] Avg episode reward: [(0, '-46.015')] [2022-07-09 19:23:53,901][26022] Updated weights on worker 0-0, policy_version 386433 (0.00087) [2022-07-09 19:23:55,684][26022] Updated weights on worker 0-0, policy_version 386443 (0.00085) [2022-07-09 19:23:57,387][26022] Updated weights on worker 0-0, policy_version 386453 (0.00084) [2022-07-09 19:23:58,322][25689] Fps is (10 sec: 5719.6, 60 sec: 5656.9, 300 sec: 5652.0). Total num frames: 395734016. Throughput: 0: 5037.2. Samples: 395729332. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:23:58,322][25689] Avg episode reward: [(0, '-46.179')] [2022-07-09 19:23:59,308][26022] Updated weights on worker 0-0, policy_version 386463 (0.00086) [2022-07-09 19:24:00,962][26022] Updated weights on worker 0-0, policy_version 386473 (0.00091) [2022-07-09 19:24:03,278][26022] Updated weights on worker 0-0, policy_version 386483 (0.00080) [2022-07-09 19:24:03,377][25689] Fps is (10 sec: 5492.1, 60 sec: 5635.8, 300 sec: 5649.0). Total num frames: 395758592. Throughput: 0: 5768.9. Samples: 395761396. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:24:03,377][25689] Avg episode reward: [(0, '-46.327')] [2022-07-09 19:24:05,070][26022] Updated weights on worker 0-0, policy_version 386493 (0.00097) [2022-07-09 19:24:06,964][26022] Updated weights on worker 0-0, policy_version 386503 (0.00090) [2022-07-09 19:24:08,462][25689] Fps is (10 sec: 5251.0, 60 sec: 5614.9, 300 sec: 5644.2). Total num frames: 395787264. Throughput: 0: 5767.7. Samples: 395795058. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:24:08,463][25689] Avg episode reward: [(0, '-46.507')] [2022-07-09 19:24:08,783][26022] Updated weights on worker 0-0, policy_version 386513 (0.00111) [2022-07-09 19:24:10,490][26022] Updated weights on worker 0-0, policy_version 386523 (0.00091) [2022-07-09 19:24:12,418][26022] Updated weights on worker 0-0, policy_version 386533 (0.00093) [2022-07-09 19:24:13,482][25689] Fps is (10 sec: 5674.5, 60 sec: 5615.3, 300 sec: 5648.1). Total num frames: 395815936. Throughput: 0: 4943.6. Samples: 395812142. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:24:13,483][25689] Avg episode reward: [(0, '-45.904')] [2022-07-09 19:24:14,097][26022] Updated weights on worker 0-0, policy_version 386543 (0.00080) [2022-07-09 19:24:15,909][26022] Updated weights on worker 0-0, policy_version 386553 (0.00088) [2022-07-09 19:24:17,711][26022] Updated weights on worker 0-0, policy_version 386563 (0.00082) [2022-07-09 19:24:18,514][25689] Fps is (10 sec: 5806.5, 60 sec: 5647.5, 300 sec: 5651.7). Total num frames: 395845632. Throughput: 0: 5785.2. Samples: 395846152. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:24:18,515][25689] Avg episode reward: [(0, '-46.371')] [2022-07-09 19:24:19,718][26022] Updated weights on worker 0-0, policy_version 386573 (0.00084) [2022-07-09 19:24:21,254][26022] Updated weights on worker 0-0, policy_version 386583 (0.00086) [2022-07-09 19:24:23,164][26022] Updated weights on worker 0-0, policy_version 386593 (0.00092) [2022-07-09 19:24:23,589][25689] Fps is (10 sec: 5775.0, 60 sec: 5625.2, 300 sec: 5644.4). Total num frames: 395874304. Throughput: 0: 5885.5. Samples: 395880358. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:24:23,590][25689] Avg episode reward: [(0, '-45.947')] [2022-07-09 19:24:24,828][26022] Updated weights on worker 0-0, policy_version 386603 (0.00096) [2022-07-09 19:24:26,671][26022] Updated weights on worker 0-0, policy_version 386613 (0.00084) [2022-07-09 19:24:28,448][26022] Updated weights on worker 0-0, policy_version 386623 (0.00087) [2022-07-09 19:24:28,688][25689] Fps is (10 sec: 5636.4, 60 sec: 5640.9, 300 sec: 5650.8). Total num frames: 395902976. Throughput: 0: 5063.3. Samples: 395897468. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:24:28,689][25689] Avg episode reward: [(0, '-46.291')] [2022-07-09 19:24:30,276][26022] Updated weights on worker 0-0, policy_version 386633 (0.00098) [2022-07-09 19:24:32,083][26022] Updated weights on worker 0-0, policy_version 386643 (0.00091) [2022-07-09 19:24:33,708][25689] Fps is (10 sec: 5666.9, 60 sec: 5624.6, 300 sec: 5643.8). Total num frames: 395931648. Throughput: 0: 5917.7. Samples: 395931834. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:24:33,709][25689] Avg episode reward: [(0, '-45.284')] [2022-07-09 19:24:33,934][26022] Updated weights on worker 0-0, policy_version 386653 (0.00087) [2022-07-09 19:24:35,726][26022] Updated weights on worker 0-0, policy_version 386663 (0.00088) [2022-07-09 19:24:37,431][26022] Updated weights on worker 0-0, policy_version 386673 (0.00084) [2022-07-09 19:24:38,747][25689] Fps is (10 sec: 5598.9, 60 sec: 5623.1, 300 sec: 5643.6). Total num frames: 395959296. Throughput: 0: 5925.8. Samples: 395966048. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:24:38,748][25689] Avg episode reward: [(0, '-46.130')] [2022-07-09 19:24:39,405][26022] Updated weights on worker 0-0, policy_version 386683 (0.00092) [2022-07-09 19:24:41,164][26022] Updated weights on worker 0-0, policy_version 386693 (0.00083) [2022-07-09 19:24:42,891][26022] Updated weights on worker 0-0, policy_version 386703 (0.00088) [2022-07-09 19:24:43,752][25689] Fps is (10 sec: 5709.4, 60 sec: 5627.7, 300 sec: 5649.5). Total num frames: 395988992. Throughput: 0: 5103.6. Samples: 395983262. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:24:43,753][25689] Avg episode reward: [(0, '-45.592')] [2022-07-09 19:24:44,595][26022] Updated weights on worker 0-0, policy_version 386713 (0.00085) [2022-07-09 19:24:46,595][26022] Updated weights on worker 0-0, policy_version 386723 (0.00089) [2022-07-09 19:24:48,254][26022] Updated weights on worker 0-0, policy_version 386733 (0.00080) [2022-07-09 19:24:48,848][25689] Fps is (10 sec: 5778.6, 60 sec: 5631.4, 300 sec: 5644.4). Total num frames: 396017664. Throughput: 0: 5954.4. Samples: 396017508. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:24:48,848][25689] Avg episode reward: [(0, '-46.516')] [2022-07-09 19:24:50,060][26022] Updated weights on worker 0-0, policy_version 386743 (0.00095) [2022-07-09 19:24:51,779][26022] Updated weights on worker 0-0, policy_version 386753 (0.00096) [2022-07-09 19:24:53,687][26022] Updated weights on worker 0-0, policy_version 386763 (0.00097) [2022-07-09 19:24:53,864][25689] Fps is (10 sec: 5670.8, 60 sec: 5664.8, 300 sec: 5647.7). Total num frames: 396046336. Throughput: 0: 5958.8. Samples: 396051940. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:24:53,866][25689] Avg episode reward: [(0, '-46.821')] [2022-07-09 19:24:55,432][26022] Updated weights on worker 0-0, policy_version 386773 (0.00084) [2022-07-09 19:24:57,128][26022] Updated weights on worker 0-0, policy_version 386783 (0.00091) [2022-07-09 19:24:58,952][25689] Fps is (10 sec: 5776.6, 60 sec: 5641.0, 300 sec: 5643.1). Total num frames: 396076032. Throughput: 0: 5090.8. Samples: 396068906. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:24:58,964][25689] Avg episode reward: [(0, '-46.589')] [2022-07-09 19:24:58,954][26022] Updated weights on worker 0-0, policy_version 386793 (0.00085) [2022-07-09 19:25:00,779][26022] Updated weights on worker 0-0, policy_version 386803 (0.00094) [2022-07-09 19:25:03,009][26022] Updated weights on worker 0-0, policy_version 386813 (0.00093) [2022-07-09 19:25:03,971][25689] Fps is (10 sec: 5572.4, 60 sec: 5678.1, 300 sec: 5652.0). Total num frames: 396102656. Throughput: 0: 5832.6. Samples: 396101192. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:25:03,973][25689] Avg episode reward: [(0, '-47.147')] [2022-07-09 19:25:04,738][26022] Updated weights on worker 0-0, policy_version 386823 (0.00084) [2022-07-09 19:25:06,439][26022] Updated weights on worker 0-0, policy_version 386833 (0.00084) [2022-07-09 19:25:08,410][26022] Updated weights on worker 0-0, policy_version 386843 (0.00100) [2022-07-09 19:25:09,086][25689] Fps is (10 sec: 5355.7, 60 sec: 5658.5, 300 sec: 5644.6). Total num frames: 396130304. Throughput: 0: 5821.8. Samples: 396135328. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:25:09,086][25689] Avg episode reward: [(0, '-46.925')] [2022-07-09 19:25:10,123][26022] Updated weights on worker 0-0, policy_version 386853 (0.00079) [2022-07-09 19:25:11,995][26022] Updated weights on worker 0-0, policy_version 386863 (0.00086) [2022-07-09 19:25:13,784][26022] Updated weights on worker 0-0, policy_version 386873 (0.00093) [2022-07-09 19:25:14,118][25689] Fps is (10 sec: 5550.3, 60 sec: 5657.3, 300 sec: 5648.2). Total num frames: 396158976. Throughput: 0: 4963.3. Samples: 396152466. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:25:14,119][25689] Avg episode reward: [(0, '-45.981')] [2022-07-09 19:25:15,448][26022] Updated weights on worker 0-0, policy_version 386883 (0.00085) [2022-07-09 19:25:17,562][26022] Updated weights on worker 0-0, policy_version 386893 (0.00087) [2022-07-09 19:25:19,029][26022] Updated weights on worker 0-0, policy_version 386903 (0.00093) [2022-07-09 19:25:19,134][25689] Fps is (10 sec: 5808.5, 60 sec: 5658.9, 300 sec: 5652.9). Total num frames: 396188672. Throughput: 0: 5823.4. Samples: 396186436. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-09 19:25:19,135][25689] Avg episode reward: [(0, '-45.401')] [2022-07-09 19:25:21,179][26022] Updated weights on worker 0-0, policy_version 386913 (0.00093) [2022-07-09 19:25:22,636][26022] Updated weights on worker 0-0, policy_version 386923 (0.00093) [2022-07-09 19:25:24,150][25689] Fps is (10 sec: 5614.5, 60 sec: 5630.6, 300 sec: 5647.6). Total num frames: 396215296. Throughput: 0: 5915.3. Samples: 396220552. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:25:24,150][25689] Avg episode reward: [(0, '-45.483')] [2022-07-09 19:25:24,603][26022] Updated weights on worker 0-0, policy_version 386933 (0.00097) [2022-07-09 19:25:26,006][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:25:26,018][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000386941_396227584.pth [2022-07-09 19:25:26,019][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000384953_394191872.pth [2022-07-09 19:25:26,394][26022] Updated weights on worker 0-0, policy_version 386943 (0.00091) [2022-07-09 19:25:28,213][26022] Updated weights on worker 0-0, policy_version 386953 (0.00096) [2022-07-09 19:25:29,193][25689] Fps is (10 sec: 5599.4, 60 sec: 5652.7, 300 sec: 5650.5). Total num frames: 396244992. Throughput: 0: 5085.2. Samples: 396237576. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:25:29,193][25689] Avg episode reward: [(0, '-45.290')] [2022-07-09 19:25:30,134][26022] Updated weights on worker 0-0, policy_version 386963 (0.00085) [2022-07-09 19:25:31,708][26022] Updated weights on worker 0-0, policy_version 386973 (0.00090) [2022-07-09 19:25:33,676][26022] Updated weights on worker 0-0, policy_version 386983 (0.00087) [2022-07-09 19:25:34,217][25689] Fps is (10 sec: 5797.8, 60 sec: 5652.4, 300 sec: 5653.6). Total num frames: 396273664. Throughput: 0: 5923.0. Samples: 396271510. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:25:34,217][25689] Avg episode reward: [(0, '-44.854')] [2022-07-09 19:25:35,447][26022] Updated weights on worker 0-0, policy_version 386993 (0.00334) [2022-07-09 19:25:37,224][26022] Updated weights on worker 0-0, policy_version 387003 (0.00100) [2022-07-09 19:25:38,972][26022] Updated weights on worker 0-0, policy_version 387013 (0.00083) [2022-07-09 19:25:39,278][25689] Fps is (10 sec: 5584.4, 60 sec: 5650.3, 300 sec: 5643.9). Total num frames: 396301312. Throughput: 0: 5928.6. Samples: 396305858. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:25:39,278][25689] Avg episode reward: [(0, '-44.973')] [2022-07-09 19:25:40,861][26022] Updated weights on worker 0-0, policy_version 387023 (0.00087) [2022-07-09 19:25:42,664][26022] Updated weights on worker 0-0, policy_version 387033 (0.00086) [2022-07-09 19:25:44,291][25689] Fps is (10 sec: 5691.9, 60 sec: 5649.5, 300 sec: 5647.8). Total num frames: 396331008. Throughput: 0: 5921.0. Samples: 396339812. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:25:44,292][25689] Avg episode reward: [(0, '-46.451')] [2022-07-09 19:25:44,547][26022] Updated weights on worker 0-0, policy_version 387043 (0.00091) [2022-07-09 19:25:46,342][26022] Updated weights on worker 0-0, policy_version 387053 (0.00095) [2022-07-09 19:25:48,140][26022] Updated weights on worker 0-0, policy_version 387063 (0.00085) [2022-07-09 19:25:49,376][25689] Fps is (10 sec: 5779.8, 60 sec: 5650.5, 300 sec: 5651.2). Total num frames: 396359680. Throughput: 0: 5892.3. Samples: 396356506. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:25:49,377][25689] Avg episode reward: [(0, '-47.492')] [2022-07-09 19:25:50,001][26022] Updated weights on worker 0-0, policy_version 387073 (0.00091) [2022-07-09 19:25:51,790][26022] Updated weights on worker 0-0, policy_version 387083 (0.00095) [2022-07-09 19:25:53,592][26022] Updated weights on worker 0-0, policy_version 387093 (0.00094) [2022-07-09 19:25:54,400][25689] Fps is (10 sec: 5571.5, 60 sec: 5632.9, 300 sec: 5644.3). Total num frames: 396387328. Throughput: 0: 5894.0. Samples: 396390470. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:25:54,401][25689] Avg episode reward: [(0, '-46.509')] [2022-07-09 19:25:55,370][26022] Updated weights on worker 0-0, policy_version 387103 (0.00090) [2022-07-09 19:25:57,015][26022] Updated weights on worker 0-0, policy_version 387113 (0.00772) [2022-07-09 19:25:59,166][26022] Updated weights on worker 0-0, policy_version 387123 (0.00084) [2022-07-09 19:25:59,416][25689] Fps is (10 sec: 5507.6, 60 sec: 5605.7, 300 sec: 5641.0). Total num frames: 396414976. Throughput: 0: 5896.2. Samples: 396424600. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:25:59,417][25689] Avg episode reward: [(0, '-46.850')] [2022-07-09 19:26:00,736][26022] Updated weights on worker 0-0, policy_version 387133 (0.00093) [2022-07-09 19:26:03,046][26022] Updated weights on worker 0-0, policy_version 387143 (0.00090) [2022-07-09 19:26:04,419][25689] Fps is (10 sec: 5519.2, 60 sec: 5624.2, 300 sec: 5650.2). Total num frames: 396442624. Throughput: 0: 5006.5. Samples: 396440580. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:26:04,419][25689] Avg episode reward: [(0, '-46.202')] [2022-07-09 19:26:04,727][26022] Updated weights on worker 0-0, policy_version 387153 (0.00095) [2022-07-09 19:26:06,498][26022] Updated weights on worker 0-0, policy_version 387163 (0.00083) [2022-07-09 19:26:08,495][26022] Updated weights on worker 0-0, policy_version 387173 (0.00082) [2022-07-09 19:26:09,468][25689] Fps is (10 sec: 5602.8, 60 sec: 5647.2, 300 sec: 5646.6). Total num frames: 396471296. Throughput: 0: 5849.4. Samples: 396474032. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:26:09,469][25689] Avg episode reward: [(0, '-46.369')] [2022-07-09 19:26:10,104][26022] Updated weights on worker 0-0, policy_version 387183 (0.00094) [2022-07-09 19:26:12,044][26022] Updated weights on worker 0-0, policy_version 387193 (0.00094) [2022-07-09 19:26:13,761][26022] Updated weights on worker 0-0, policy_version 387203 (0.00091) [2022-07-09 19:26:14,479][25689] Fps is (10 sec: 5598.5, 60 sec: 5632.3, 300 sec: 5646.5). Total num frames: 396498944. Throughput: 0: 5872.5. Samples: 396508382. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:26:14,479][25689] Avg episode reward: [(0, '-45.698')] [2022-07-09 19:26:15,465][26022] Updated weights on worker 0-0, policy_version 387213 (0.00087) [2022-07-09 19:26:17,395][26022] Updated weights on worker 0-0, policy_version 387223 (0.00083) [2022-07-09 19:26:19,127][26022] Updated weights on worker 0-0, policy_version 387233 (0.00083) [2022-07-09 19:26:19,490][25689] Fps is (10 sec: 5721.8, 60 sec: 5632.7, 300 sec: 5646.9). Total num frames: 396528640. Throughput: 0: 5016.5. Samples: 396525302. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:26:19,491][25689] Avg episode reward: [(0, '-45.747')] [2022-07-09 19:26:21,099][26022] Updated weights on worker 0-0, policy_version 387243 (0.00076) [2022-07-09 19:26:22,628][26022] Updated weights on worker 0-0, policy_version 387253 (0.00087) [2022-07-09 19:26:24,503][25689] Fps is (10 sec: 5720.7, 60 sec: 5649.9, 300 sec: 5648.1). Total num frames: 396556288. Throughput: 0: 5938.8. Samples: 396559854. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:26:24,504][25689] Avg episode reward: [(0, '-46.779')] [2022-07-09 19:26:24,665][26022] Updated weights on worker 0-0, policy_version 387263 (0.00084) [2022-07-09 19:26:26,253][26022] Updated weights on worker 0-0, policy_version 387273 (0.00084) [2022-07-09 19:26:28,285][26022] Updated weights on worker 0-0, policy_version 387283 (0.00090) [2022-07-09 19:26:29,586][25689] Fps is (10 sec: 5680.1, 60 sec: 5646.2, 300 sec: 5650.1). Total num frames: 396585984. Throughput: 0: 5965.8. Samples: 396594050. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:26:29,587][25689] Avg episode reward: [(0, '-46.908')] [2022-07-09 19:26:29,963][26022] Updated weights on worker 0-0, policy_version 387293 (0.00081) [2022-07-09 19:26:31,739][26022] Updated weights on worker 0-0, policy_version 387303 (0.00950) [2022-07-09 19:26:33,514][26022] Updated weights on worker 0-0, policy_version 387313 (0.00082) [2022-07-09 19:26:34,634][25689] Fps is (10 sec: 5761.0, 60 sec: 5643.9, 300 sec: 5649.6). Total num frames: 396614656. Throughput: 0: 5110.5. Samples: 396611388. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:26:34,636][25689] Avg episode reward: [(0, '-46.982')] [2022-07-09 19:26:35,357][26022] Updated weights on worker 0-0, policy_version 387323 (0.00084) [2022-07-09 19:26:36,936][26022] Updated weights on worker 0-0, policy_version 387333 (0.00098) [2022-07-09 19:26:38,769][26022] Updated weights on worker 0-0, policy_version 387343 (0.00090) [2022-07-09 19:26:39,651][25689] Fps is (10 sec: 5697.6, 60 sec: 5665.1, 300 sec: 5646.1). Total num frames: 396643328. Throughput: 0: 5975.2. Samples: 396645764. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:26:39,651][25689] Avg episode reward: [(0, '-46.806')] [2022-07-09 19:26:40,520][26022] Updated weights on worker 0-0, policy_version 387353 (0.00094) [2022-07-09 19:26:42,495][26022] Updated weights on worker 0-0, policy_version 387363 (0.00083) [2022-07-09 19:26:44,110][26022] Updated weights on worker 0-0, policy_version 387373 (0.00097) [2022-07-09 19:26:44,667][25689] Fps is (10 sec: 5715.9, 60 sec: 5647.9, 300 sec: 5650.1). Total num frames: 396672000. Throughput: 0: 5969.8. Samples: 396680230. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:26:44,667][25689] Avg episode reward: [(0, '-46.549')] [2022-07-09 19:26:45,972][26022] Updated weights on worker 0-0, policy_version 387383 (0.00093) [2022-07-09 19:26:47,755][26022] Updated weights on worker 0-0, policy_version 387393 (0.00095) [2022-07-09 19:26:49,697][26022] Updated weights on worker 0-0, policy_version 387403 (0.00095) [2022-07-09 19:26:49,797][25689] Fps is (10 sec: 5652.0, 60 sec: 5643.7, 300 sec: 5644.5). Total num frames: 396700672. Throughput: 0: 5105.1. Samples: 396697226. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:26:49,797][25689] Avg episode reward: [(0, '-46.503')] [2022-07-09 19:26:51,611][26022] Updated weights on worker 0-0, policy_version 387413 (0.00089) [2022-07-09 19:26:53,243][26022] Updated weights on worker 0-0, policy_version 387423 (0.00090) [2022-07-09 19:26:54,836][25689] Fps is (10 sec: 5639.1, 60 sec: 5659.2, 300 sec: 5647.6). Total num frames: 396729344. Throughput: 0: 5918.6. Samples: 396730952. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:26:54,836][25689] Avg episode reward: [(0, '-46.323')] [2022-07-09 19:26:55,088][26022] Updated weights on worker 0-0, policy_version 387433 (0.00094) [2022-07-09 19:26:56,871][26022] Updated weights on worker 0-0, policy_version 387443 (0.00088) [2022-07-09 19:26:58,778][26022] Updated weights on worker 0-0, policy_version 387453 (0.00094) [2022-07-09 19:26:59,908][25689] Fps is (10 sec: 5671.2, 60 sec: 5670.9, 300 sec: 5650.1). Total num frames: 396758016. Throughput: 0: 5893.4. Samples: 396765148. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:26:59,910][25689] Avg episode reward: [(0, '-47.445')] [2022-07-09 19:27:00,478][26022] Updated weights on worker 0-0, policy_version 387463 (0.00096) [2022-07-09 19:27:02,573][26022] Updated weights on worker 0-0, policy_version 387473 (0.00080) [2022-07-09 19:27:04,479][26022] Updated weights on worker 0-0, policy_version 387483 (0.00420) [2022-07-09 19:27:04,927][25689] Fps is (10 sec: 5479.9, 60 sec: 5652.4, 300 sec: 5644.7). Total num frames: 396784640. Throughput: 0: 4932.7. Samples: 396780166. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:27:04,929][25689] Avg episode reward: [(0, '-47.424')] [2022-07-09 19:27:06,185][26022] Updated weights on worker 0-0, policy_version 387493 (0.00083) [2022-07-09 19:27:08,151][26022] Updated weights on worker 0-0, policy_version 387503 (0.00092) [2022-07-09 19:27:09,805][26022] Updated weights on worker 0-0, policy_version 387513 (0.00091) [2022-07-09 19:27:09,964][25689] Fps is (10 sec: 5600.7, 60 sec: 5670.5, 300 sec: 5651.2). Total num frames: 396814336. Throughput: 0: 5803.9. Samples: 396814276. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:27:09,964][25689] Avg episode reward: [(0, '-48.044')] [2022-07-09 19:27:11,748][26022] Updated weights on worker 0-0, policy_version 387523 (0.00083) [2022-07-09 19:27:13,362][26022] Updated weights on worker 0-0, policy_version 387533 (0.00092) [2022-07-09 19:27:14,975][25689] Fps is (10 sec: 5706.9, 60 sec: 5670.5, 300 sec: 5648.3). Total num frames: 396841984. Throughput: 0: 5844.5. Samples: 396848654. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:27:14,976][25689] Avg episode reward: [(0, '-47.550')] [2022-07-09 19:27:15,327][26022] Updated weights on worker 0-0, policy_version 387543 (0.00087) [2022-07-09 19:27:17,083][26022] Updated weights on worker 0-0, policy_version 387553 (0.00084) [2022-07-09 19:27:18,840][26022] Updated weights on worker 0-0, policy_version 387563 (0.00094) [2022-07-09 19:27:19,982][25689] Fps is (10 sec: 5519.5, 60 sec: 5637.0, 300 sec: 5641.4). Total num frames: 396869632. Throughput: 0: 5002.8. Samples: 396865576. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:27:19,983][25689] Avg episode reward: [(0, '-47.744')] [2022-07-09 19:27:20,645][26022] Updated weights on worker 0-0, policy_version 387573 (0.00090) [2022-07-09 19:27:22,518][26022] Updated weights on worker 0-0, policy_version 387583 (0.00086) [2022-07-09 19:27:24,207][26022] Updated weights on worker 0-0, policy_version 387593 (0.00093) [2022-07-09 19:27:24,995][25689] Fps is (10 sec: 5722.7, 60 sec: 5670.8, 300 sec: 5650.2). Total num frames: 396899328. Throughput: 0: 5952.0. Samples: 396899614. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:27:24,996][25689] Avg episode reward: [(0, '-47.120')] [2022-07-09 19:27:26,199][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:27:26,213][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000387602_396904448.pth [2022-07-09 19:27:26,213][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000385615_394869760.pth [2022-07-09 19:27:26,241][26022] Updated weights on worker 0-0, policy_version 387603 (0.00087) [2022-07-09 19:27:27,938][26022] Updated weights on worker 0-0, policy_version 387613 (0.00621) [2022-07-09 19:27:29,648][26022] Updated weights on worker 0-0, policy_version 387623 (0.00086) [2022-07-09 19:27:30,035][25689] Fps is (10 sec: 5704.4, 60 sec: 5641.0, 300 sec: 5640.3). Total num frames: 396926976. Throughput: 0: 5944.8. Samples: 396933594. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:27:30,035][25689] Avg episode reward: [(0, '-46.449')] [2022-07-09 19:27:31,407][26022] Updated weights on worker 0-0, policy_version 387633 (0.00086) [2022-07-09 19:27:33,514][26022] Updated weights on worker 0-0, policy_version 387643 (0.00082) [2022-07-09 19:27:35,007][26022] Updated weights on worker 0-0, policy_version 387653 (0.00088) [2022-07-09 19:27:35,051][25689] Fps is (10 sec: 5702.7, 60 sec: 5661.0, 300 sec: 5651.1). Total num frames: 396956672. Throughput: 0: 5089.1. Samples: 396950822. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:27:35,051][25689] Avg episode reward: [(0, '-46.179')] [2022-07-09 19:27:37,002][26022] Updated weights on worker 0-0, policy_version 387663 (0.00087) [2022-07-09 19:27:38,838][26022] Updated weights on worker 0-0, policy_version 387673 (0.00093) [2022-07-09 19:27:40,053][25689] Fps is (10 sec: 5723.8, 60 sec: 5645.3, 300 sec: 5645.9). Total num frames: 396984320. Throughput: 0: 5930.7. Samples: 396984612. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:27:40,054][25689] Avg episode reward: [(0, '-45.299')] [2022-07-09 19:27:40,556][26022] Updated weights on worker 0-0, policy_version 387683 (0.00082) [2022-07-09 19:27:42,391][26022] Updated weights on worker 0-0, policy_version 387693 (0.00104) [2022-07-09 19:27:44,118][26022] Updated weights on worker 0-0, policy_version 387703 (0.00083) [2022-07-09 19:27:45,064][25689] Fps is (10 sec: 5522.5, 60 sec: 5628.9, 300 sec: 5643.1). Total num frames: 397011968. Throughput: 0: 5946.7. Samples: 397018956. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:27:45,064][25689] Avg episode reward: [(0, '-44.361')] [2022-07-09 19:27:45,901][26022] Updated weights on worker 0-0, policy_version 387713 (0.00089) [2022-07-09 19:27:48,011][26022] Updated weights on worker 0-0, policy_version 387723 (0.00096) [2022-07-09 19:27:49,565][26022] Updated weights on worker 0-0, policy_version 387733 (0.00089) [2022-07-09 19:27:50,127][25689] Fps is (10 sec: 5692.5, 60 sec: 5652.1, 300 sec: 5642.4). Total num frames: 397041664. Throughput: 0: 5084.2. Samples: 397035746. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:27:50,127][25689] Avg episode reward: [(0, '-44.117')] [2022-07-09 19:27:51,578][26022] Updated weights on worker 0-0, policy_version 387743 (0.00085) [2022-07-09 19:27:53,253][26022] Updated weights on worker 0-0, policy_version 387753 (0.00091) [2022-07-09 19:27:55,128][25689] Fps is (10 sec: 5697.8, 60 sec: 5638.7, 300 sec: 5642.5). Total num frames: 397069312. Throughput: 0: 5911.5. Samples: 397069508. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-09 19:27:55,128][25689] Avg episode reward: [(0, '-44.191')] [2022-07-09 19:27:55,142][26022] Updated weights on worker 0-0, policy_version 387763 (0.00098) [2022-07-09 19:27:57,027][26022] Updated weights on worker 0-0, policy_version 387773 (0.00091) [2022-07-09 19:27:58,580][26022] Updated weights on worker 0-0, policy_version 387783 (0.00084) [2022-07-09 19:28:00,151][25689] Fps is (10 sec: 5618.5, 60 sec: 5643.3, 300 sec: 5652.6). Total num frames: 397097984. Throughput: 0: 5931.0. Samples: 397103810. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:28:00,151][25689] Avg episode reward: [(0, '-44.378')] [2022-07-09 19:28:00,495][26022] Updated weights on worker 0-0, policy_version 387793 (0.00100) [2022-07-09 19:28:02,578][26022] Updated weights on worker 0-0, policy_version 387803 (0.00093) [2022-07-09 19:28:04,426][26022] Updated weights on worker 0-0, policy_version 387813 (0.00083) [2022-07-09 19:28:05,177][25689] Fps is (10 sec: 5400.6, 60 sec: 5625.6, 300 sec: 5639.1). Total num frames: 397123584. Throughput: 0: 4959.4. Samples: 397118706. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:28:05,178][25689] Avg episode reward: [(0, '-44.572')] [2022-07-09 19:28:06,438][26022] Updated weights on worker 0-0, policy_version 387823 (0.00093) [2022-07-09 19:28:07,944][26022] Updated weights on worker 0-0, policy_version 387833 (0.00091) [2022-07-09 19:28:09,928][26022] Updated weights on worker 0-0, policy_version 387843 (0.00083) [2022-07-09 19:28:10,287][25689] Fps is (10 sec: 5455.4, 60 sec: 5618.8, 300 sec: 5640.9). Total num frames: 397153280. Throughput: 0: 5793.5. Samples: 397152544. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:28:10,287][25689] Avg episode reward: [(0, '-44.529')] [2022-07-09 19:28:11,546][26022] Updated weights on worker 0-0, policy_version 387853 (0.00084) [2022-07-09 19:28:13,473][26022] Updated weights on worker 0-0, policy_version 387863 (0.00091) [2022-07-09 19:28:15,251][26022] Updated weights on worker 0-0, policy_version 387873 (0.00094) [2022-07-09 19:28:15,382][25689] Fps is (10 sec: 5719.6, 60 sec: 5627.9, 300 sec: 5642.8). Total num frames: 397181952. Throughput: 0: 5792.9. Samples: 397186838. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:28:15,383][25689] Avg episode reward: [(0, '-45.718')] [2022-07-09 19:28:17,180][26022] Updated weights on worker 0-0, policy_version 387883 (0.00087) [2022-07-09 19:28:18,816][26022] Updated weights on worker 0-0, policy_version 387893 (0.00097) [2022-07-09 19:28:20,487][25689] Fps is (10 sec: 5521.6, 60 sec: 5618.9, 300 sec: 5634.3). Total num frames: 397209600. Throughput: 0: 4921.3. Samples: 397203888. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:28:20,488][25689] Avg episode reward: [(0, '-45.753')] [2022-07-09 19:28:20,765][26022] Updated weights on worker 0-0, policy_version 387903 (0.00091) [2022-07-09 19:28:22,424][26022] Updated weights on worker 0-0, policy_version 387913 (0.00090) [2022-07-09 19:28:24,348][26022] Updated weights on worker 0-0, policy_version 387923 (0.00084) [2022-07-09 19:28:25,514][25689] Fps is (10 sec: 5659.6, 60 sec: 5617.5, 300 sec: 5642.3). Total num frames: 397239296. Throughput: 0: 5867.6. Samples: 397238036. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:28:25,516][25689] Avg episode reward: [(0, '-45.984')] [2022-07-09 19:28:26,184][26022] Updated weights on worker 0-0, policy_version 387933 (0.00083) [2022-07-09 19:28:27,976][26022] Updated weights on worker 0-0, policy_version 387943 (0.00084) [2022-07-09 19:28:29,749][26022] Updated weights on worker 0-0, policy_version 387953 (0.00094) [2022-07-09 19:28:30,582][25689] Fps is (10 sec: 5781.5, 60 sec: 5631.8, 300 sec: 5638.0). Total num frames: 397267968. Throughput: 0: 5878.6. Samples: 397271854. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:28:30,583][25689] Avg episode reward: [(0, '-46.087')] [2022-07-09 19:28:31,662][26022] Updated weights on worker 0-0, policy_version 387963 (0.00083) [2022-07-09 19:28:33,285][26022] Updated weights on worker 0-0, policy_version 387973 (0.00092) [2022-07-09 19:28:35,363][26022] Updated weights on worker 0-0, policy_version 387983 (0.00084) [2022-07-09 19:28:35,662][25689] Fps is (10 sec: 5549.8, 60 sec: 5592.1, 300 sec: 5637.0). Total num frames: 397295616. Throughput: 0: 5860.6. Samples: 397305692. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:28:35,663][25689] Avg episode reward: [(0, '-46.716')] [2022-07-09 19:28:36,926][26022] Updated weights on worker 0-0, policy_version 387993 (0.00090) [2022-07-09 19:28:38,816][26022] Updated weights on worker 0-0, policy_version 388003 (0.00097) [2022-07-09 19:28:40,635][26022] Updated weights on worker 0-0, policy_version 388013 (0.00094) [2022-07-09 19:28:40,732][25689] Fps is (10 sec: 5649.9, 60 sec: 5619.6, 300 sec: 5636.7). Total num frames: 397325312. Throughput: 0: 5870.7. Samples: 397322742. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:28:40,732][25689] Avg episode reward: [(0, '-46.497')] [2022-07-09 19:28:42,451][26022] Updated weights on worker 0-0, policy_version 388023 (0.00087) [2022-07-09 19:28:44,356][26022] Updated weights on worker 0-0, policy_version 388033 (0.00085) [2022-07-09 19:28:45,741][25689] Fps is (10 sec: 5689.5, 60 sec: 5619.7, 300 sec: 5635.6). Total num frames: 397352960. Throughput: 0: 5871.5. Samples: 397356798. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:28:45,742][25689] Avg episode reward: [(0, '-46.599')] [2022-07-09 19:28:46,143][26022] Updated weights on worker 0-0, policy_version 388043 (0.00095) [2022-07-09 19:28:47,878][26022] Updated weights on worker 0-0, policy_version 388053 (0.00094) [2022-07-09 19:28:49,811][26022] Updated weights on worker 0-0, policy_version 388063 (0.00103) [2022-07-09 19:28:50,817][25689] Fps is (10 sec: 5686.1, 60 sec: 5618.6, 300 sec: 5644.7). Total num frames: 397382656. Throughput: 0: 5861.9. Samples: 397390466. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:28:50,817][25689] Avg episode reward: [(0, '-46.900')] [2022-07-09 19:28:51,505][26022] Updated weights on worker 0-0, policy_version 388073 (0.00091) [2022-07-09 19:28:53,456][26022] Updated weights on worker 0-0, policy_version 388083 (0.00087) [2022-07-09 19:28:55,303][26022] Updated weights on worker 0-0, policy_version 388093 (0.00085) [2022-07-09 19:28:55,883][25689] Fps is (10 sec: 5553.3, 60 sec: 5595.7, 300 sec: 5630.0). Total num frames: 397409280. Throughput: 0: 5025.0. Samples: 397407300. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:28:55,883][25689] Avg episode reward: [(0, '-46.871')] [2022-07-09 19:28:56,740][26022] Updated weights on worker 0-0, policy_version 388103 (0.00087) [2022-07-09 19:28:58,963][26022] Updated weights on worker 0-0, policy_version 388113 (0.00085) [2022-07-09 19:29:00,711][26022] Updated weights on worker 0-0, policy_version 388123 (0.00082) [2022-07-09 19:29:00,903][25689] Fps is (10 sec: 5583.9, 60 sec: 5612.8, 300 sec: 5647.9). Total num frames: 397438976. Throughput: 0: 5883.5. Samples: 397441418. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:29:00,903][25689] Avg episode reward: [(0, '-46.629')] [2022-07-09 19:29:02,935][26022] Updated weights on worker 0-0, policy_version 388133 (0.00095) [2022-07-09 19:29:04,569][26022] Updated weights on worker 0-0, policy_version 388143 (0.00086) [2022-07-09 19:29:05,967][25689] Fps is (10 sec: 5585.2, 60 sec: 5626.2, 300 sec: 5641.4). Total num frames: 397465600. Throughput: 0: 5769.2. Samples: 397473482. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:29:05,967][25689] Avg episode reward: [(0, '-46.700')] [2022-07-09 19:29:06,350][26022] Updated weights on worker 0-0, policy_version 388153 (0.00092) [2022-07-09 19:29:08,220][26022] Updated weights on worker 0-0, policy_version 388163 (0.00098) [2022-07-09 19:29:09,900][26022] Updated weights on worker 0-0, policy_version 388173 (0.00093) [2022-07-09 19:29:11,072][25689] Fps is (10 sec: 5538.3, 60 sec: 5626.6, 300 sec: 5643.2). Total num frames: 397495296. Throughput: 0: 4933.2. Samples: 397490390. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:29:11,075][25689] Avg episode reward: [(0, '-46.221')] [2022-07-09 19:29:11,841][26022] Updated weights on worker 0-0, policy_version 388183 (0.00086) [2022-07-09 19:29:13,766][26022] Updated weights on worker 0-0, policy_version 388193 (0.00087) [2022-07-09 19:29:15,392][26022] Updated weights on worker 0-0, policy_version 388203 (0.00086) [2022-07-09 19:29:16,107][25689] Fps is (10 sec: 5553.8, 60 sec: 5598.4, 300 sec: 5632.8). Total num frames: 397521920. Throughput: 0: 5811.3. Samples: 397524832. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:29:16,108][25689] Avg episode reward: [(0, '-46.592')] [2022-07-09 19:29:17,163][26022] Updated weights on worker 0-0, policy_version 388213 (0.00097) [2022-07-09 19:29:19,136][26022] Updated weights on worker 0-0, policy_version 388223 (0.00093) [2022-07-09 19:29:20,627][26022] Updated weights on worker 0-0, policy_version 388233 (0.00089) [2022-07-09 19:29:21,139][25689] Fps is (10 sec: 5594.6, 60 sec: 5639.0, 300 sec: 5637.1). Total num frames: 397551616. Throughput: 0: 5801.6. Samples: 397558818. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:29:21,139][25689] Avg episode reward: [(0, '-47.352')] [2022-07-09 19:29:22,732][26022] Updated weights on worker 0-0, policy_version 388243 (0.00084) [2022-07-09 19:29:24,316][26022] Updated weights on worker 0-0, policy_version 388253 (0.00091) [2022-07-09 19:29:26,168][25689] Fps is (10 sec: 5700.1, 60 sec: 5605.1, 300 sec: 5635.0). Total num frames: 397579264. Throughput: 0: 5077.3. Samples: 397576048. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:29:26,168][25689] Avg episode reward: [(0, '-47.659')] [2022-07-09 19:29:26,292][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:29:26,307][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000388262_397580288.pth [2022-07-09 19:29:26,308][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000386278_395548672.pth [2022-07-09 19:29:26,410][26022] Updated weights on worker 0-0, policy_version 388263 (0.00095) [2022-07-09 19:29:27,955][26022] Updated weights on worker 0-0, policy_version 388273 (0.00092) [2022-07-09 19:29:29,920][26022] Updated weights on worker 0-0, policy_version 388283 (0.00093) [2022-07-09 19:29:31,233][25689] Fps is (10 sec: 5782.5, 60 sec: 5639.1, 300 sec: 5641.0). Total num frames: 397609984. Throughput: 0: 5919.7. Samples: 397609734. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:29:31,233][25689] Avg episode reward: [(0, '-48.789')] [2022-07-09 19:29:31,583][26022] Updated weights on worker 0-0, policy_version 388293 (0.00084) [2022-07-09 19:29:33,688][26022] Updated weights on worker 0-0, policy_version 388303 (0.00105) [2022-07-09 19:29:35,205][26022] Updated weights on worker 0-0, policy_version 388313 (0.00086) [2022-07-09 19:29:36,333][25689] Fps is (10 sec: 5640.9, 60 sec: 5620.3, 300 sec: 5636.4). Total num frames: 397636608. Throughput: 0: 5868.9. Samples: 397643536. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:29:36,334][25689] Avg episode reward: [(0, '-48.774')] [2022-07-09 19:29:37,107][26022] Updated weights on worker 0-0, policy_version 388323 (0.00081) [2022-07-09 19:29:38,931][26022] Updated weights on worker 0-0, policy_version 388333 (0.00091) [2022-07-09 19:29:40,936][26022] Updated weights on worker 0-0, policy_version 388343 (0.00087) [2022-07-09 19:29:41,373][25689] Fps is (10 sec: 5554.1, 60 sec: 5623.1, 300 sec: 5635.7). Total num frames: 397666304. Throughput: 0: 5054.4. Samples: 397661090. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:29:41,373][25689] Avg episode reward: [(0, '-48.587')] [2022-07-09 19:29:42,365][26022] Updated weights on worker 0-0, policy_version 388353 (0.00082) [2022-07-09 19:29:44,410][26022] Updated weights on worker 0-0, policy_version 388363 (0.00089) [2022-07-09 19:29:45,929][26022] Updated weights on worker 0-0, policy_version 388373 (0.00092) [2022-07-09 19:29:46,449][25689] Fps is (10 sec: 5871.0, 60 sec: 5650.6, 300 sec: 5639.5). Total num frames: 397696000. Throughput: 0: 5898.1. Samples: 397695672. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:29:46,450][25689] Avg episode reward: [(0, '-47.939')] [2022-07-09 19:29:47,919][26022] Updated weights on worker 0-0, policy_version 388383 (0.00084) [2022-07-09 19:29:49,658][26022] Updated weights on worker 0-0, policy_version 388393 (0.00083) [2022-07-09 19:29:51,335][26022] Updated weights on worker 0-0, policy_version 388403 (0.00086) [2022-07-09 19:29:51,490][25689] Fps is (10 sec: 5769.6, 60 sec: 5637.1, 300 sec: 5639.1). Total num frames: 397724672. Throughput: 0: 5929.9. Samples: 397729854. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:29:51,490][25689] Avg episode reward: [(0, '-47.701')] [2022-07-09 19:29:53,242][26022] Updated weights on worker 0-0, policy_version 388413 (0.00084) [2022-07-09 19:29:54,905][26022] Updated weights on worker 0-0, policy_version 388423 (0.00085) [2022-07-09 19:29:56,506][25689] Fps is (10 sec: 5600.3, 60 sec: 5658.6, 300 sec: 5633.6). Total num frames: 397752320. Throughput: 0: 5133.5. Samples: 397747092. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:29:56,506][25689] Avg episode reward: [(0, '-46.179')] [2022-07-09 19:29:56,959][26022] Updated weights on worker 0-0, policy_version 388433 (0.00085) [2022-07-09 19:29:58,662][26022] Updated weights on worker 0-0, policy_version 388443 (0.00091) [2022-07-09 19:30:00,443][26022] Updated weights on worker 0-0, policy_version 388453 (0.00093) [2022-07-09 19:30:01,518][25689] Fps is (10 sec: 5616.1, 60 sec: 5642.4, 300 sec: 5640.6). Total num frames: 397780992. Throughput: 0: 5969.4. Samples: 397781342. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:30:01,527][25689] Avg episode reward: [(0, '-45.754')] [2022-07-09 19:30:02,589][26022] Updated weights on worker 0-0, policy_version 388463 (0.00085) [2022-07-09 19:30:04,375][26022] Updated weights on worker 0-0, policy_version 388473 (0.00500) [2022-07-09 19:30:06,103][26022] Updated weights on worker 0-0, policy_version 388483 (0.00088) [2022-07-09 19:30:06,544][25689] Fps is (10 sec: 5610.5, 60 sec: 5662.8, 300 sec: 5642.3). Total num frames: 397808640. Throughput: 0: 5877.4. Samples: 397813778. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:30:06,545][25689] Avg episode reward: [(0, '-46.929')] [2022-07-09 19:30:07,971][26022] Updated weights on worker 0-0, policy_version 388493 (0.00086) [2022-07-09 19:30:09,621][26022] Updated weights on worker 0-0, policy_version 388503 (0.00087) [2022-07-09 19:30:11,505][26022] Updated weights on worker 0-0, policy_version 388513 (0.00086) [2022-07-09 19:30:11,640][25689] Fps is (10 sec: 5564.0, 60 sec: 5646.8, 300 sec: 5641.1). Total num frames: 397837312. Throughput: 0: 5016.5. Samples: 397830940. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:30:11,641][25689] Avg episode reward: [(0, '-47.763')] [2022-07-09 19:30:13,433][26022] Updated weights on worker 0-0, policy_version 388523 (0.00090) [2022-07-09 19:30:15,007][26022] Updated weights on worker 0-0, policy_version 388533 (0.00089) [2022-07-09 19:30:16,682][25689] Fps is (10 sec: 5656.4, 60 sec: 5680.0, 300 sec: 5637.1). Total num frames: 397865984. Throughput: 0: 5862.0. Samples: 397865366. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:30:16,683][25689] Avg episode reward: [(0, '-47.601')] [2022-07-09 19:30:17,002][26022] Updated weights on worker 0-0, policy_version 388543 (0.00088) [2022-07-09 19:30:18,494][26022] Updated weights on worker 0-0, policy_version 388553 (0.00091) [2022-07-09 19:30:20,527][26022] Updated weights on worker 0-0, policy_version 388563 (0.00089) [2022-07-09 19:30:21,702][25689] Fps is (10 sec: 5800.7, 60 sec: 5681.1, 300 sec: 5647.3). Total num frames: 397895680. Throughput: 0: 5847.8. Samples: 397899376. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:30:21,703][25689] Avg episode reward: [(0, '-47.639')] [2022-07-09 19:30:22,556][26022] Updated weights on worker 0-0, policy_version 388573 (0.00087) [2022-07-09 19:30:24,063][26022] Updated weights on worker 0-0, policy_version 388583 (0.00087) [2022-07-09 19:30:26,060][26022] Updated weights on worker 0-0, policy_version 388593 (0.00092) [2022-07-09 19:30:26,711][25689] Fps is (10 sec: 5717.7, 60 sec: 5682.9, 300 sec: 5641.1). Total num frames: 397923328. Throughput: 0: 5086.5. Samples: 397916358. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:30:26,712][25689] Avg episode reward: [(0, '-48.827')] [2022-07-09 19:30:27,782][26022] Updated weights on worker 0-0, policy_version 388603 (0.00094) [2022-07-09 19:30:29,536][26022] Updated weights on worker 0-0, policy_version 388613 (0.00095) [2022-07-09 19:30:31,450][26022] Updated weights on worker 0-0, policy_version 388623 (0.00085) [2022-07-09 19:30:31,753][25689] Fps is (10 sec: 5603.3, 60 sec: 5651.2, 300 sec: 5640.8). Total num frames: 397952000. Throughput: 0: 5936.3. Samples: 397950340. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:30:31,754][25689] Avg episode reward: [(0, '-47.896')] [2022-07-09 19:30:33,107][26022] Updated weights on worker 0-0, policy_version 388633 (0.00086) [2022-07-09 19:30:34,863][26022] Updated weights on worker 0-0, policy_version 388643 (0.00097) [2022-07-09 19:30:36,775][25689] Fps is (10 sec: 5596.5, 60 sec: 5675.6, 300 sec: 5641.5). Total num frames: 397979648. Throughput: 0: 5920.6. Samples: 397984326. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 19:30:36,775][25689] Avg episode reward: [(0, '-47.471')] [2022-07-09 19:30:36,980][26022] Updated weights on worker 0-0, policy_version 388653 (0.00055) [2022-07-09 19:30:38,701][26022] Updated weights on worker 0-0, policy_version 388663 (0.00087) [2022-07-09 19:30:40,554][26022] Updated weights on worker 0-0, policy_version 388673 (0.00084) [2022-07-09 19:30:41,788][25689] Fps is (10 sec: 5714.5, 60 sec: 5678.0, 300 sec: 5641.5). Total num frames: 398009344. Throughput: 0: 5074.1. Samples: 398001294. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:30:41,789][25689] Avg episode reward: [(0, '-46.932')] [2022-07-09 19:30:42,228][26022] Updated weights on worker 0-0, policy_version 388683 (0.00085) [2022-07-09 19:30:44,039][26022] Updated weights on worker 0-0, policy_version 388693 (0.00417) [2022-07-09 19:30:45,956][26022] Updated weights on worker 0-0, policy_version 388703 (0.00087) [2022-07-09 19:30:46,791][25689] Fps is (10 sec: 5724.9, 60 sec: 5651.0, 300 sec: 5639.6). Total num frames: 398036992. Throughput: 0: 5939.0. Samples: 398035614. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:30:46,792][25689] Avg episode reward: [(0, '-47.936')] [2022-07-09 19:30:47,563][26022] Updated weights on worker 0-0, policy_version 388713 (0.00094) [2022-07-09 19:30:49,684][26022] Updated weights on worker 0-0, policy_version 388723 (0.00088) [2022-07-09 19:30:51,259][26022] Updated weights on worker 0-0, policy_version 388733 (0.00088) [2022-07-09 19:30:51,838][25689] Fps is (10 sec: 5604.3, 60 sec: 5650.4, 300 sec: 5642.6). Total num frames: 398065664. Throughput: 0: 5945.1. Samples: 398069742. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:30:51,838][25689] Avg episode reward: [(0, '-48.045')] [2022-07-09 19:30:53,139][26022] Updated weights on worker 0-0, policy_version 388743 (0.00095) [2022-07-09 19:30:54,855][26022] Updated weights on worker 0-0, policy_version 388753 (0.00113) [2022-07-09 19:30:56,697][26022] Updated weights on worker 0-0, policy_version 388763 (0.00084) [2022-07-09 19:30:56,855][25689] Fps is (10 sec: 5596.5, 60 sec: 5650.3, 300 sec: 5642.6). Total num frames: 398093312. Throughput: 0: 5092.3. Samples: 398086580. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:30:56,855][25689] Avg episode reward: [(0, '-48.024')] [2022-07-09 19:30:58,481][26022] Updated weights on worker 0-0, policy_version 388773 (0.00067) [2022-07-09 19:31:00,396][26022] Updated weights on worker 0-0, policy_version 388783 (0.00092) [2022-07-09 19:31:01,858][25689] Fps is (10 sec: 5620.8, 60 sec: 5651.2, 300 sec: 5646.1). Total num frames: 398121984. Throughput: 0: 5944.0. Samples: 398120586. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:31:01,863][25689] Avg episode reward: [(0, '-47.831')] [2022-07-09 19:31:02,513][26022] Updated weights on worker 0-0, policy_version 388793 (0.00095) [2022-07-09 19:31:04,408][26022] Updated weights on worker 0-0, policy_version 388803 (0.00087) [2022-07-09 19:31:06,035][26022] Updated weights on worker 0-0, policy_version 388813 (0.00083) [2022-07-09 19:31:06,894][25689] Fps is (10 sec: 5304.3, 60 sec: 5599.4, 300 sec: 5632.6). Total num frames: 398146560. Throughput: 0: 5806.1. Samples: 398152330. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:31:06,894][25689] Avg episode reward: [(0, '-48.739')] [2022-07-09 19:31:07,933][26022] Updated weights on worker 0-0, policy_version 388823 (0.00084) [2022-07-09 19:31:09,836][26022] Updated weights on worker 0-0, policy_version 388833 (0.00859) [2022-07-09 19:31:11,777][26022] Updated weights on worker 0-0, policy_version 388843 (0.00086) [2022-07-09 19:31:11,934][25689] Fps is (10 sec: 5385.9, 60 sec: 5621.5, 300 sec: 5638.9). Total num frames: 398176256. Throughput: 0: 5802.6. Samples: 398186354. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:31:11,935][25689] Avg episode reward: [(0, '-48.847')] [2022-07-09 19:31:13,415][26022] Updated weights on worker 0-0, policy_version 388853 (0.00086) [2022-07-09 19:31:15,249][26022] Updated weights on worker 0-0, policy_version 388863 (0.00101) [2022-07-09 19:31:16,901][26022] Updated weights on worker 0-0, policy_version 388873 (0.00088) [2022-07-09 19:31:16,939][25689] Fps is (10 sec: 5912.5, 60 sec: 5642.0, 300 sec: 5639.0). Total num frames: 398205952. Throughput: 0: 5823.6. Samples: 398203540. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:31:16,939][25689] Avg episode reward: [(0, '-48.158')] [2022-07-09 19:31:18,783][26022] Updated weights on worker 0-0, policy_version 388883 (0.00093) [2022-07-09 19:31:20,395][26022] Updated weights on worker 0-0, policy_version 388893 (0.00086) [2022-07-09 19:31:21,952][25689] Fps is (10 sec: 5826.4, 60 sec: 5625.6, 300 sec: 5642.4). Total num frames: 398234624. Throughput: 0: 5843.7. Samples: 398238012. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:31:21,952][25689] Avg episode reward: [(0, '-47.242')] [2022-07-09 19:31:22,348][26022] Updated weights on worker 0-0, policy_version 388903 (0.00088) [2022-07-09 19:31:24,188][26022] Updated weights on worker 0-0, policy_version 388913 (0.00089) [2022-07-09 19:31:25,983][26022] Updated weights on worker 0-0, policy_version 388923 (0.00093) [2022-07-09 19:31:26,483][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:31:26,499][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000388926_398260224.pth [2022-07-09 19:31:26,499][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000386941_396227584.pth [2022-07-09 19:31:26,968][25689] Fps is (10 sec: 5615.8, 60 sec: 5625.0, 300 sec: 5636.8). Total num frames: 398262272. Throughput: 0: 5955.4. Samples: 398271878. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:31:26,968][25689] Avg episode reward: [(0, '-47.493')] [2022-07-09 19:31:27,821][26022] Updated weights on worker 0-0, policy_version 388933 (0.00090) [2022-07-09 19:31:29,426][26022] Updated weights on worker 0-0, policy_version 388943 (0.00107) [2022-07-09 19:31:31,365][26022] Updated weights on worker 0-0, policy_version 388953 (0.00084) [2022-07-09 19:31:32,030][25689] Fps is (10 sec: 5588.5, 60 sec: 5623.2, 300 sec: 5636.6). Total num frames: 398290944. Throughput: 0: 5111.0. Samples: 398289064. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:31:32,030][25689] Avg episode reward: [(0, '-46.900')] [2022-07-09 19:31:33,404][26022] Updated weights on worker 0-0, policy_version 388963 (0.00089) [2022-07-09 19:31:34,902][26022] Updated weights on worker 0-0, policy_version 388973 (0.00085) [2022-07-09 19:31:36,855][26022] Updated weights on worker 0-0, policy_version 388983 (0.00093) [2022-07-09 19:31:37,085][25689] Fps is (10 sec: 5667.7, 60 sec: 5637.0, 300 sec: 5635.8). Total num frames: 398319616. Throughput: 0: 5945.0. Samples: 398323312. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:31:37,086][25689] Avg episode reward: [(0, '-46.939')] [2022-07-09 19:31:38,263][26022] Updated weights on worker 0-0, policy_version 388993 (0.00085) [2022-07-09 19:31:40,479][26022] Updated weights on worker 0-0, policy_version 389003 (0.00087) [2022-07-09 19:31:42,016][26022] Updated weights on worker 0-0, policy_version 389013 (0.00091) [2022-07-09 19:31:42,116][25689] Fps is (10 sec: 5786.9, 60 sec: 5635.3, 300 sec: 5639.0). Total num frames: 398349312. Throughput: 0: 5928.8. Samples: 398357560. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:31:42,117][25689] Avg episode reward: [(0, '-47.585')] [2022-07-09 19:31:43,986][26022] Updated weights on worker 0-0, policy_version 389023 (0.00076) [2022-07-09 19:31:45,756][26022] Updated weights on worker 0-0, policy_version 389033 (0.00087) [2022-07-09 19:31:47,199][25689] Fps is (10 sec: 5771.2, 60 sec: 5644.9, 300 sec: 5639.9). Total num frames: 398377984. Throughput: 0: 5084.2. Samples: 398374738. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:31:47,199][25689] Avg episode reward: [(0, '-46.910')] [2022-07-09 19:31:47,681][26022] Updated weights on worker 0-0, policy_version 389043 (0.00091) [2022-07-09 19:31:49,280][26022] Updated weights on worker 0-0, policy_version 389053 (0.00087) [2022-07-09 19:31:51,219][26022] Updated weights on worker 0-0, policy_version 389063 (0.00084) [2022-07-09 19:31:52,259][25689] Fps is (10 sec: 5552.7, 60 sec: 5626.6, 300 sec: 5636.0). Total num frames: 398405632. Throughput: 0: 5928.5. Samples: 398408992. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:31:52,259][25689] Avg episode reward: [(0, '-47.114')] [2022-07-09 19:31:52,835][26022] Updated weights on worker 0-0, policy_version 389073 (0.00088) [2022-07-09 19:31:54,975][26022] Updated weights on worker 0-0, policy_version 389083 (0.00090) [2022-07-09 19:31:56,517][26022] Updated weights on worker 0-0, policy_version 389093 (0.00087) [2022-07-09 19:31:57,296][25689] Fps is (10 sec: 5577.6, 60 sec: 5641.7, 300 sec: 5636.7). Total num frames: 398434304. Throughput: 0: 5916.9. Samples: 398442900. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:31:57,297][25689] Avg episode reward: [(0, '-46.745')] [2022-07-09 19:31:58,308][26022] Updated weights on worker 0-0, policy_version 389103 (0.00087) [2022-07-09 19:32:00,187][26022] Updated weights on worker 0-0, policy_version 389113 (0.00084) [2022-07-09 19:32:02,374][25689] Fps is (10 sec: 5466.4, 60 sec: 5600.8, 300 sec: 5635.6). Total num frames: 398460928. Throughput: 0: 5053.0. Samples: 398459926. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:32:02,375][25689] Avg episode reward: [(0, '-46.389')] [2022-07-09 19:32:02,392][26022] Updated weights on worker 0-0, policy_version 389123 (0.00092) [2022-07-09 19:32:04,184][26022] Updated weights on worker 0-0, policy_version 389133 (0.00089) [2022-07-09 19:32:06,215][26022] Updated weights on worker 0-0, policy_version 389143 (0.00091) [2022-07-09 19:32:07,398][25689] Fps is (10 sec: 5575.1, 60 sec: 5686.5, 300 sec: 5635.8). Total num frames: 398490624. Throughput: 0: 5800.3. Samples: 398491902. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:32:07,399][25689] Avg episode reward: [(0, '-45.868')] [2022-07-09 19:32:07,796][26022] Updated weights on worker 0-0, policy_version 389153 (0.00085) [2022-07-09 19:32:09,711][26022] Updated weights on worker 0-0, policy_version 389163 (0.00111) [2022-07-09 19:32:11,282][26022] Updated weights on worker 0-0, policy_version 389173 (0.00086) [2022-07-09 19:32:12,460][25689] Fps is (10 sec: 5685.7, 60 sec: 5650.7, 300 sec: 5634.9). Total num frames: 398518272. Throughput: 0: 5798.3. Samples: 398526126. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:32:12,461][25689] Avg episode reward: [(0, '-45.491')] [2022-07-09 19:32:13,407][26022] Updated weights on worker 0-0, policy_version 389183 (0.00093) [2022-07-09 19:32:14,934][26022] Updated weights on worker 0-0, policy_version 389193 (0.00088) [2022-07-09 19:32:16,955][26022] Updated weights on worker 0-0, policy_version 389203 (0.00850) [2022-07-09 19:32:17,487][25689] Fps is (10 sec: 5582.7, 60 sec: 5631.8, 300 sec: 5637.9). Total num frames: 398546944. Throughput: 0: 4974.7. Samples: 398543340. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:32:17,487][25689] Avg episode reward: [(0, '-45.824')] [2022-07-09 19:32:18,339][26022] Updated weights on worker 0-0, policy_version 389213 (0.00087) [2022-07-09 19:32:20,501][26022] Updated weights on worker 0-0, policy_version 389223 (0.00084) [2022-07-09 19:32:22,231][26022] Updated weights on worker 0-0, policy_version 389233 (0.00083) [2022-07-09 19:32:22,503][25689] Fps is (10 sec: 5710.1, 60 sec: 5631.5, 300 sec: 5634.4). Total num frames: 398575616. Throughput: 0: 5839.1. Samples: 398577456. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:32:22,503][25689] Avg episode reward: [(0, '-46.413')] [2022-07-09 19:32:24,145][26022] Updated weights on worker 0-0, policy_version 389243 (0.00108) [2022-07-09 19:32:25,728][26022] Updated weights on worker 0-0, policy_version 389253 (0.00104) [2022-07-09 19:32:27,523][25689] Fps is (10 sec: 5713.9, 60 sec: 5648.0, 300 sec: 5638.3). Total num frames: 398604288. Throughput: 0: 5951.1. Samples: 398611664. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:32:27,523][25689] Avg episode reward: [(0, '-46.876')] [2022-07-09 19:32:27,656][26022] Updated weights on worker 0-0, policy_version 389263 (0.00091) [2022-07-09 19:32:29,467][26022] Updated weights on worker 0-0, policy_version 389273 (0.00073) [2022-07-09 19:32:31,088][26022] Updated weights on worker 0-0, policy_version 389283 (0.00093) [2022-07-09 19:32:32,646][25689] Fps is (10 sec: 5653.3, 60 sec: 5642.3, 300 sec: 5632.8). Total num frames: 398632960. Throughput: 0: 5092.5. Samples: 398628922. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:32:32,647][25689] Avg episode reward: [(0, '-46.728')] [2022-07-09 19:32:33,071][26022] Updated weights on worker 0-0, policy_version 389293 (0.00082) [2022-07-09 19:32:34,755][26022] Updated weights on worker 0-0, policy_version 389303 (0.00088) [2022-07-09 19:32:36,593][26022] Updated weights on worker 0-0, policy_version 389313 (0.00089) [2022-07-09 19:32:37,701][25689] Fps is (10 sec: 5734.5, 60 sec: 5659.3, 300 sec: 5638.7). Total num frames: 398662656. Throughput: 0: 5929.2. Samples: 398663194. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:32:37,702][25689] Avg episode reward: [(0, '-47.597')] [2022-07-09 19:32:38,334][26022] Updated weights on worker 0-0, policy_version 389323 (0.00093) [2022-07-09 19:32:40,084][26022] Updated weights on worker 0-0, policy_version 389333 (0.00084) [2022-07-09 19:32:41,829][26022] Updated weights on worker 0-0, policy_version 389343 (0.00094) [2022-07-09 19:32:42,762][25689] Fps is (10 sec: 5769.7, 60 sec: 5639.5, 300 sec: 5641.1). Total num frames: 398691328. Throughput: 0: 5923.9. Samples: 398697472. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:32:42,763][25689] Avg episode reward: [(0, '-47.369')] [2022-07-09 19:32:43,723][26022] Updated weights on worker 0-0, policy_version 389353 (0.00085) [2022-07-09 19:32:45,527][26022] Updated weights on worker 0-0, policy_version 389363 (0.00092) [2022-07-09 19:32:47,332][26022] Updated weights on worker 0-0, policy_version 389373 (0.00093) [2022-07-09 19:32:47,787][25689] Fps is (10 sec: 5685.7, 60 sec: 5645.0, 300 sec: 5638.4). Total num frames: 398720000. Throughput: 0: 5089.9. Samples: 398714806. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:32:47,787][25689] Avg episode reward: [(0, '-47.191')] [2022-07-09 19:32:49,055][26022] Updated weights on worker 0-0, policy_version 389383 (0.00088) [2022-07-09 19:32:50,730][26022] Updated weights on worker 0-0, policy_version 389393 (0.00085) [2022-07-09 19:32:52,741][26022] Updated weights on worker 0-0, policy_version 389403 (0.00087) [2022-07-09 19:32:52,832][25689] Fps is (10 sec: 5796.3, 60 sec: 5680.1, 300 sec: 5644.5). Total num frames: 398749696. Throughput: 0: 5953.4. Samples: 398749100. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:32:52,833][25689] Avg episode reward: [(0, '-46.137')] [2022-07-09 19:32:54,564][26022] Updated weights on worker 0-0, policy_version 389413 (0.00088) [2022-07-09 19:32:56,324][26022] Updated weights on worker 0-0, policy_version 389423 (0.00090) [2022-07-09 19:32:57,834][25689] Fps is (10 sec: 5809.3, 60 sec: 5683.5, 300 sec: 5644.9). Total num frames: 398778368. Throughput: 0: 5947.0. Samples: 398782924. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:32:57,835][25689] Avg episode reward: [(0, '-45.828')] [2022-07-09 19:32:58,246][26022] Updated weights on worker 0-0, policy_version 389433 (0.01232) [2022-07-09 19:32:59,947][26022] Updated weights on worker 0-0, policy_version 389443 (0.00089) [2022-07-09 19:33:01,647][26022] Updated weights on worker 0-0, policy_version 389453 (0.00101) [2022-07-09 19:33:02,844][25689] Fps is (10 sec: 5318.8, 60 sec: 5656.0, 300 sec: 5641.7). Total num frames: 398802944. Throughput: 0: 5111.6. Samples: 398800120. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:33:02,844][25689] Avg episode reward: [(0, '-45.771')] [2022-07-09 19:33:03,861][26022] Updated weights on worker 0-0, policy_version 389463 (0.00091) [2022-07-09 19:33:05,552][26022] Updated weights on worker 0-0, policy_version 389473 (0.00089) [2022-07-09 19:33:07,474][26022] Updated weights on worker 0-0, policy_version 389483 (0.00627) [2022-07-09 19:33:07,857][25689] Fps is (10 sec: 5414.5, 60 sec: 5657.0, 300 sec: 5643.6). Total num frames: 398832640. Throughput: 0: 5856.5. Samples: 398832350. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:33:07,858][25689] Avg episode reward: [(0, '-45.461')] [2022-07-09 19:33:09,436][26022] Updated weights on worker 0-0, policy_version 389493 (0.00085) [2022-07-09 19:33:11,083][26022] Updated weights on worker 0-0, policy_version 389503 (0.00095) [2022-07-09 19:33:12,762][26022] Updated weights on worker 0-0, policy_version 389513 (0.00093) [2022-07-09 19:33:12,959][25689] Fps is (10 sec: 5871.4, 60 sec: 5687.1, 300 sec: 5646.9). Total num frames: 398862336. Throughput: 0: 5835.5. Samples: 398866550. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-09 19:33:12,960][25689] Avg episode reward: [(0, '-45.126')] [2022-07-09 19:33:14,599][26022] Updated weights on worker 0-0, policy_version 389523 (0.00086) [2022-07-09 19:33:16,310][26022] Updated weights on worker 0-0, policy_version 389533 (0.00094) [2022-07-09 19:33:17,965][25689] Fps is (10 sec: 5673.3, 60 sec: 5672.1, 300 sec: 5648.8). Total num frames: 398889984. Throughput: 0: 5008.5. Samples: 398883750. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:33:17,967][25689] Avg episode reward: [(0, '-44.642')] [2022-07-09 19:33:18,240][26022] Updated weights on worker 0-0, policy_version 389543 (0.00091) [2022-07-09 19:33:20,027][26022] Updated weights on worker 0-0, policy_version 389553 (0.00090) [2022-07-09 19:33:21,810][26022] Updated weights on worker 0-0, policy_version 389563 (0.00088) [2022-07-09 19:33:22,996][25689] Fps is (10 sec: 5713.3, 60 sec: 5687.6, 300 sec: 5648.7). Total num frames: 398919680. Throughput: 0: 5861.9. Samples: 398918252. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:33:22,996][25689] Avg episode reward: [(0, '-45.543')] [2022-07-09 19:33:23,603][26022] Updated weights on worker 0-0, policy_version 389573 (0.00084) [2022-07-09 19:33:25,412][26022] Updated weights on worker 0-0, policy_version 389583 (0.00093) [2022-07-09 19:33:26,542][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:33:26,555][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000389589_398939136.pth [2022-07-09 19:33:26,556][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000387602_396904448.pth [2022-07-09 19:33:27,155][26022] Updated weights on worker 0-0, policy_version 389593 (0.00089) [2022-07-09 19:33:28,062][25689] Fps is (10 sec: 5679.6, 60 sec: 5666.4, 300 sec: 5645.3). Total num frames: 398947328. Throughput: 0: 5943.2. Samples: 398952428. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:33:28,062][25689] Avg episode reward: [(0, '-46.807')] [2022-07-09 19:33:29,007][26022] Updated weights on worker 0-0, policy_version 389603 (0.00085) [2022-07-09 19:33:30,838][26022] Updated weights on worker 0-0, policy_version 389613 (0.00083) [2022-07-09 19:33:32,899][26022] Updated weights on worker 0-0, policy_version 389623 (0.00081) [2022-07-09 19:33:33,108][25689] Fps is (10 sec: 5569.5, 60 sec: 5673.6, 300 sec: 5649.4). Total num frames: 398976000. Throughput: 0: 5096.3. Samples: 398969236. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:33:33,110][25689] Avg episode reward: [(0, '-47.178')] [2022-07-09 19:33:34,308][26022] Updated weights on worker 0-0, policy_version 389633 (0.00089) [2022-07-09 19:33:36,492][26022] Updated weights on worker 0-0, policy_version 389643 (0.00089) [2022-07-09 19:33:37,952][26022] Updated weights on worker 0-0, policy_version 389653 (0.00094) [2022-07-09 19:33:38,117][25689] Fps is (10 sec: 5702.9, 60 sec: 5661.0, 300 sec: 5647.1). Total num frames: 399004672. Throughput: 0: 5946.6. Samples: 399003584. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:33:38,117][25689] Avg episode reward: [(0, '-47.089')] [2022-07-09 19:33:39,951][26022] Updated weights on worker 0-0, policy_version 389663 (0.00098) [2022-07-09 19:33:41,543][26022] Updated weights on worker 0-0, policy_version 389673 (0.00083) [2022-07-09 19:33:43,132][25689] Fps is (10 sec: 5720.8, 60 sec: 5665.4, 300 sec: 5650.4). Total num frames: 399033344. Throughput: 0: 5947.3. Samples: 399038008. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:33:43,133][25689] Avg episode reward: [(0, '-47.302')] [2022-07-09 19:33:43,546][26022] Updated weights on worker 0-0, policy_version 389683 (0.00081) [2022-07-09 19:33:45,172][26022] Updated weights on worker 0-0, policy_version 389693 (0.00086) [2022-07-09 19:33:46,860][26022] Updated weights on worker 0-0, policy_version 389703 (0.00090) [2022-07-09 19:33:48,159][25689] Fps is (10 sec: 5710.4, 60 sec: 5665.1, 300 sec: 5647.9). Total num frames: 399062016. Throughput: 0: 5115.5. Samples: 399055236. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:33:48,161][25689] Avg episode reward: [(0, '-46.494')] [2022-07-09 19:33:48,778][26022] Updated weights on worker 0-0, policy_version 389713 (0.00091) [2022-07-09 19:33:50,673][26022] Updated weights on worker 0-0, policy_version 389723 (0.01109) [2022-07-09 19:33:52,339][26022] Updated weights on worker 0-0, policy_version 389733 (0.00093) [2022-07-09 19:33:53,231][25689] Fps is (10 sec: 5779.9, 60 sec: 5662.7, 300 sec: 5658.1). Total num frames: 399091712. Throughput: 0: 5968.3. Samples: 399089332. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:33:53,231][25689] Avg episode reward: [(0, '-46.945')] [2022-07-09 19:33:54,267][26022] Updated weights on worker 0-0, policy_version 389743 (0.00088) [2022-07-09 19:33:55,798][26022] Updated weights on worker 0-0, policy_version 389753 (0.00088) [2022-07-09 19:33:57,947][26022] Updated weights on worker 0-0, policy_version 389763 (0.00090) [2022-07-09 19:33:58,262][25689] Fps is (10 sec: 5473.0, 60 sec: 5609.0, 300 sec: 5644.1). Total num frames: 399117312. Throughput: 0: 5962.5. Samples: 399123702. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:33:58,263][25689] Avg episode reward: [(0, '-45.502')] [2022-07-09 19:33:59,387][26022] Updated weights on worker 0-0, policy_version 389773 (0.00086) [2022-07-09 19:34:01,912][26022] Updated weights on worker 0-0, policy_version 389783 (0.00087) [2022-07-09 19:34:03,266][25689] Fps is (10 sec: 5509.8, 60 sec: 5694.3, 300 sec: 5655.6). Total num frames: 399147008. Throughput: 0: 5106.7. Samples: 399140828. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:34:03,267][25689] Avg episode reward: [(0, '-45.410')] [2022-07-09 19:34:03,357][26022] Updated weights on worker 0-0, policy_version 389793 (0.00090) [2022-07-09 19:34:05,348][26022] Updated weights on worker 0-0, policy_version 389803 (0.00090) [2022-07-09 19:34:06,898][26022] Updated weights on worker 0-0, policy_version 389813 (0.00086) [2022-07-09 19:34:08,279][25689] Fps is (10 sec: 5724.6, 60 sec: 5660.4, 300 sec: 5650.5). Total num frames: 399174656. Throughput: 0: 5860.4. Samples: 399173152. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:34:08,280][25689] Avg episode reward: [(0, '-45.064')] [2022-07-09 19:34:09,001][26022] Updated weights on worker 0-0, policy_version 389823 (0.00084) [2022-07-09 19:34:10,689][26022] Updated weights on worker 0-0, policy_version 389833 (0.00060) [2022-07-09 19:34:12,540][26022] Updated weights on worker 0-0, policy_version 389843 (0.00087) [2022-07-09 19:34:13,355][25689] Fps is (10 sec: 5582.7, 60 sec: 5646.0, 300 sec: 5656.6). Total num frames: 399203328. Throughput: 0: 5879.6. Samples: 399207654. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:34:13,355][25689] Avg episode reward: [(0, '-46.112')] [2022-07-09 19:34:14,140][26022] Updated weights on worker 0-0, policy_version 389853 (0.00103) [2022-07-09 19:34:16,101][26022] Updated weights on worker 0-0, policy_version 389863 (0.00085) [2022-07-09 19:34:17,742][26022] Updated weights on worker 0-0, policy_version 389873 (0.00098) [2022-07-09 19:34:18,378][25689] Fps is (10 sec: 5779.8, 60 sec: 5678.3, 300 sec: 5656.8). Total num frames: 399233024. Throughput: 0: 5876.9. Samples: 399241920. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:34:18,379][25689] Avg episode reward: [(0, '-46.420')] [2022-07-09 19:34:19,764][26022] Updated weights on worker 0-0, policy_version 389883 (0.00087) [2022-07-09 19:34:21,252][26022] Updated weights on worker 0-0, policy_version 389893 (0.00087) [2022-07-09 19:34:23,384][25689] Fps is (10 sec: 5615.8, 60 sec: 5629.8, 300 sec: 5653.7). Total num frames: 399259648. Throughput: 0: 5880.2. Samples: 399259120. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:34:23,384][25689] Avg episode reward: [(0, '-46.528')] [2022-07-09 19:34:23,457][26022] Updated weights on worker 0-0, policy_version 389903 (0.00086) [2022-07-09 19:34:24,855][26022] Updated weights on worker 0-0, policy_version 389913 (0.00086) [2022-07-09 19:34:26,960][26022] Updated weights on worker 0-0, policy_version 389923 (0.00094) [2022-07-09 19:34:28,396][25689] Fps is (10 sec: 5621.7, 60 sec: 5668.6, 300 sec: 5651.3). Total num frames: 399289344. Throughput: 0: 5962.3. Samples: 399293096. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:34:28,397][25689] Avg episode reward: [(0, '-46.280')] [2022-07-09 19:34:28,586][26022] Updated weights on worker 0-0, policy_version 389933 (0.00088) [2022-07-09 19:34:30,573][26022] Updated weights on worker 0-0, policy_version 389943 (0.00086) [2022-07-09 19:34:32,252][26022] Updated weights on worker 0-0, policy_version 389953 (0.00090) [2022-07-09 19:34:33,469][25689] Fps is (10 sec: 5889.1, 60 sec: 5683.2, 300 sec: 5662.2). Total num frames: 399319040. Throughput: 0: 5940.1. Samples: 399327134. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:34:33,469][25689] Avg episode reward: [(0, '-46.775')] [2022-07-09 19:34:34,120][26022] Updated weights on worker 0-0, policy_version 389963 (0.00086) [2022-07-09 19:34:35,789][26022] Updated weights on worker 0-0, policy_version 389973 (0.00093) [2022-07-09 19:34:37,899][26022] Updated weights on worker 0-0, policy_version 389983 (0.00087) [2022-07-09 19:34:38,518][25689] Fps is (10 sec: 5564.1, 60 sec: 5645.4, 300 sec: 5651.6). Total num frames: 399345664. Throughput: 0: 5068.9. Samples: 399344010. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:34:38,519][25689] Avg episode reward: [(0, '-46.185')] [2022-07-09 19:34:39,553][26022] Updated weights on worker 0-0, policy_version 389993 (0.00083) [2022-07-09 19:34:41,568][26022] Updated weights on worker 0-0, policy_version 390003 (0.00089) [2022-07-09 19:34:43,130][26022] Updated weights on worker 0-0, policy_version 390013 (0.00085) [2022-07-09 19:34:43,559][25689] Fps is (10 sec: 5480.3, 60 sec: 5643.1, 300 sec: 5648.9). Total num frames: 399374336. Throughput: 0: 5911.8. Samples: 399378392. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:34:43,559][25689] Avg episode reward: [(0, '-46.046')] [2022-07-09 19:34:45,065][26022] Updated weights on worker 0-0, policy_version 390023 (0.00086) [2022-07-09 19:34:46,783][26022] Updated weights on worker 0-0, policy_version 390033 (0.00083) [2022-07-09 19:34:48,508][26022] Updated weights on worker 0-0, policy_version 390043 (0.00090) [2022-07-09 19:34:48,586][25689] Fps is (10 sec: 5797.5, 60 sec: 5659.9, 300 sec: 5652.6). Total num frames: 399404032. Throughput: 0: 5917.8. Samples: 399412576. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:34:48,587][25689] Avg episode reward: [(0, '-46.694')] [2022-07-09 19:34:50,303][26022] Updated weights on worker 0-0, policy_version 390053 (0.00088) [2022-07-09 19:34:52,263][26022] Updated weights on worker 0-0, policy_version 390063 (0.00079) [2022-07-09 19:34:53,635][25689] Fps is (10 sec: 5792.6, 60 sec: 5645.1, 300 sec: 5655.4). Total num frames: 399432704. Throughput: 0: 5080.5. Samples: 399429588. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:34:53,636][25689] Avg episode reward: [(0, '-46.717')] [2022-07-09 19:34:53,984][26022] Updated weights on worker 0-0, policy_version 390073 (0.00080) [2022-07-09 19:34:55,875][26022] Updated weights on worker 0-0, policy_version 390083 (0.00449) [2022-07-09 19:34:57,591][26022] Updated weights on worker 0-0, policy_version 390093 (0.00090) [2022-07-09 19:34:58,682][25689] Fps is (10 sec: 5680.2, 60 sec: 5694.6, 300 sec: 5654.7). Total num frames: 399461376. Throughput: 0: 5939.6. Samples: 399463772. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:34:58,683][25689] Avg episode reward: [(0, '-46.799')] [2022-07-09 19:34:59,639][26022] Updated weights on worker 0-0, policy_version 390103 (0.00091) [2022-07-09 19:35:00,999][26022] Updated weights on worker 0-0, policy_version 390113 (0.00094) [2022-07-09 19:35:03,559][26022] Updated weights on worker 0-0, policy_version 390123 (0.00105) [2022-07-09 19:35:03,696][25689] Fps is (10 sec: 5292.6, 60 sec: 5608.9, 300 sec: 5644.6). Total num frames: 399485952. Throughput: 0: 5828.5. Samples: 399495762. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:35:03,698][25689] Avg episode reward: [(0, '-48.166')] [2022-07-09 19:35:05,154][26022] Updated weights on worker 0-0, policy_version 390133 (0.00080) [2022-07-09 19:35:07,139][26022] Updated weights on worker 0-0, policy_version 390143 (0.00085) [2022-07-09 19:35:08,717][26022] Updated weights on worker 0-0, policy_version 390153 (0.00085) [2022-07-09 19:35:08,719][25689] Fps is (10 sec: 5407.3, 60 sec: 5641.9, 300 sec: 5649.5). Total num frames: 399515648. Throughput: 0: 4978.6. Samples: 399512806. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:35:08,719][25689] Avg episode reward: [(0, '-47.490')] [2022-07-09 19:35:10,671][26022] Updated weights on worker 0-0, policy_version 390163 (0.00086) [2022-07-09 19:35:12,498][26022] Updated weights on worker 0-0, policy_version 390173 (0.00085) [2022-07-09 19:35:13,779][25689] Fps is (10 sec: 5788.4, 60 sec: 5643.3, 300 sec: 5649.1). Total num frames: 399544320. Throughput: 0: 5825.9. Samples: 399546948. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:35:13,780][25689] Avg episode reward: [(0, '-48.108')] [2022-07-09 19:35:14,330][26022] Updated weights on worker 0-0, policy_version 390183 (0.00088) [2022-07-09 19:35:15,864][26022] Updated weights on worker 0-0, policy_version 390193 (0.00093) [2022-07-09 19:35:17,827][26022] Updated weights on worker 0-0, policy_version 390203 (0.00093) [2022-07-09 19:35:18,797][25689] Fps is (10 sec: 5689.5, 60 sec: 5626.8, 300 sec: 5645.7). Total num frames: 399572992. Throughput: 0: 5849.0. Samples: 399581428. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:35:18,798][25689] Avg episode reward: [(0, '-47.876')] [2022-07-09 19:35:19,546][26022] Updated weights on worker 0-0, policy_version 390213 (0.00082) [2022-07-09 19:35:21,266][26022] Updated weights on worker 0-0, policy_version 390223 (0.00081) [2022-07-09 19:35:23,112][26022] Updated weights on worker 0-0, policy_version 390233 (0.00091) [2022-07-09 19:35:23,813][25689] Fps is (10 sec: 5817.6, 60 sec: 5676.7, 300 sec: 5652.5). Total num frames: 399602688. Throughput: 0: 5114.6. Samples: 399598652. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:35:23,813][25689] Avg episode reward: [(0, '-48.104')] [2022-07-09 19:35:24,927][26022] Updated weights on worker 0-0, policy_version 390243 (0.00085) [2022-07-09 19:35:26,565][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:35:26,577][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000390252_399618048.pth [2022-07-09 19:35:26,578][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000388262_397580288.pth [2022-07-09 19:35:26,767][26022] Updated weights on worker 0-0, policy_version 390253 (0.00086) [2022-07-09 19:35:28,411][26022] Updated weights on worker 0-0, policy_version 390263 (0.00090) [2022-07-09 19:35:28,828][25689] Fps is (10 sec: 5717.1, 60 sec: 5642.6, 300 sec: 5649.6). Total num frames: 399630336. Throughput: 0: 5968.5. Samples: 399632828. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:35:28,828][25689] Avg episode reward: [(0, '-48.375')] [2022-07-09 19:35:30,428][26022] Updated weights on worker 0-0, policy_version 390273 (0.00085) [2022-07-09 19:35:32,219][26022] Updated weights on worker 0-0, policy_version 390283 (0.00092) [2022-07-09 19:35:33,892][25689] Fps is (10 sec: 5587.9, 60 sec: 5626.5, 300 sec: 5652.2). Total num frames: 399659008. Throughput: 0: 5953.0. Samples: 399666676. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:35:33,899][25689] Avg episode reward: [(0, '-48.016')] [2022-07-09 19:35:33,903][26022] Updated weights on worker 0-0, policy_version 390293 (0.00089) [2022-07-09 19:35:35,689][26022] Updated weights on worker 0-0, policy_version 390303 (0.00088) [2022-07-09 19:35:37,464][26022] Updated weights on worker 0-0, policy_version 390313 (0.00087) [2022-07-09 19:35:38,911][25689] Fps is (10 sec: 5788.5, 60 sec: 5680.2, 300 sec: 5652.1). Total num frames: 399688704. Throughput: 0: 5087.5. Samples: 399683756. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:35:38,912][25689] Avg episode reward: [(0, '-48.042')] [2022-07-09 19:35:39,198][26022] Updated weights on worker 0-0, policy_version 390323 (0.00085) [2022-07-09 19:35:41,179][26022] Updated weights on worker 0-0, policy_version 390333 (0.00081) [2022-07-09 19:35:43,141][26022] Updated weights on worker 0-0, policy_version 390343 (0.00087) [2022-07-09 19:35:43,962][25689] Fps is (10 sec: 5592.8, 60 sec: 5645.3, 300 sec: 5647.7). Total num frames: 399715328. Throughput: 0: 5923.8. Samples: 399718010. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:35:43,962][25689] Avg episode reward: [(0, '-47.615')] [2022-07-09 19:35:44,539][26022] Updated weights on worker 0-0, policy_version 390353 (0.00087) [2022-07-09 19:35:46,788][26022] Updated weights on worker 0-0, policy_version 390363 (0.00096) [2022-07-09 19:35:48,309][26022] Updated weights on worker 0-0, policy_version 390373 (0.00084) [2022-07-09 19:35:48,968][25689] Fps is (10 sec: 5498.3, 60 sec: 5630.3, 300 sec: 5648.5). Total num frames: 399744000. Throughput: 0: 5909.3. Samples: 399751844. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:35:48,968][25689] Avg episode reward: [(0, '-47.588')] [2022-07-09 19:35:50,278][26022] Updated weights on worker 0-0, policy_version 390383 (0.00086) [2022-07-09 19:35:52,245][26022] Updated weights on worker 0-0, policy_version 390393 (0.00092) [2022-07-09 19:35:53,862][26022] Updated weights on worker 0-0, policy_version 390403 (0.00086) [2022-07-09 19:35:54,009][25689] Fps is (10 sec: 5707.4, 60 sec: 5631.1, 300 sec: 5651.5). Total num frames: 399772672. Throughput: 0: 5071.8. Samples: 399768706. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 19:35:54,010][25689] Avg episode reward: [(0, '-47.131')] [2022-07-09 19:35:55,875][26022] Updated weights on worker 0-0, policy_version 390413 (0.00085) [2022-07-09 19:35:57,431][26022] Updated weights on worker 0-0, policy_version 390423 (0.00084) [2022-07-09 19:35:59,046][25689] Fps is (10 sec: 5588.3, 60 sec: 5615.0, 300 sec: 5647.4). Total num frames: 399800320. Throughput: 0: 5901.6. Samples: 399802586. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:35:59,047][25689] Avg episode reward: [(0, '-47.247')] [2022-07-09 19:35:59,345][26022] Updated weights on worker 0-0, policy_version 390433 (0.00091) [2022-07-09 19:36:01,311][26022] Updated weights on worker 0-0, policy_version 390443 (0.00084) [2022-07-09 19:36:03,398][26022] Updated weights on worker 0-0, policy_version 390453 (0.00086) [2022-07-09 19:36:04,066][25689] Fps is (10 sec: 5498.3, 60 sec: 5665.3, 300 sec: 5658.0). Total num frames: 399827968. Throughput: 0: 5782.3. Samples: 399834258. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:36:04,068][25689] Avg episode reward: [(0, '-46.574')] [2022-07-09 19:36:05,252][26022] Updated weights on worker 0-0, policy_version 390463 (0.00089) [2022-07-09 19:36:06,918][26022] Updated weights on worker 0-0, policy_version 390473 (0.00096) [2022-07-09 19:36:08,947][26022] Updated weights on worker 0-0, policy_version 390483 (0.00087) [2022-07-09 19:36:09,078][25689] Fps is (10 sec: 5512.2, 60 sec: 5632.4, 300 sec: 5651.7). Total num frames: 399855616. Throughput: 0: 4944.6. Samples: 399851278. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:36:09,079][25689] Avg episode reward: [(0, '-46.143')] [2022-07-09 19:36:10,560][26022] Updated weights on worker 0-0, policy_version 390493 (0.00087) [2022-07-09 19:36:12,459][26022] Updated weights on worker 0-0, policy_version 390503 (0.00095) [2022-07-09 19:36:14,208][25689] Fps is (10 sec: 5552.8, 60 sec: 5625.9, 300 sec: 5645.8). Total num frames: 399884288. Throughput: 0: 5744.4. Samples: 399884740. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:36:14,210][25689] Avg episode reward: [(0, '-46.709')] [2022-07-09 19:36:14,320][26022] Updated weights on worker 0-0, policy_version 390513 (0.00095) [2022-07-09 19:36:16,113][26022] Updated weights on worker 0-0, policy_version 390523 (0.00090) [2022-07-09 19:36:17,884][26022] Updated weights on worker 0-0, policy_version 390533 (0.00089) [2022-07-09 19:36:19,243][25689] Fps is (10 sec: 5641.2, 60 sec: 5624.4, 300 sec: 5645.4). Total num frames: 399912960. Throughput: 0: 5771.7. Samples: 399919156. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:36:19,244][25689] Avg episode reward: [(0, '-46.268')] [2022-07-09 19:36:19,678][26022] Updated weights on worker 0-0, policy_version 390543 (0.00090) [2022-07-09 19:36:21,499][26022] Updated weights on worker 0-0, policy_version 390553 (0.00095) [2022-07-09 19:36:23,300][26022] Updated weights on worker 0-0, policy_version 390563 (0.00081) [2022-07-09 19:36:24,253][25689] Fps is (10 sec: 5810.9, 60 sec: 5624.8, 300 sec: 5652.4). Total num frames: 399942656. Throughput: 0: 5053.8. Samples: 399936280. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:36:24,253][25689] Avg episode reward: [(0, '-45.626')] [2022-07-09 19:36:25,182][26022] Updated weights on worker 0-0, policy_version 390573 (0.00090) [2022-07-09 19:36:26,799][26022] Updated weights on worker 0-0, policy_version 390583 (0.00087) [2022-07-09 19:36:28,771][26022] Updated weights on worker 0-0, policy_version 390593 (0.00084) [2022-07-09 19:36:29,271][25689] Fps is (10 sec: 5718.6, 60 sec: 5624.6, 300 sec: 5649.8). Total num frames: 399970304. Throughput: 0: 5901.7. Samples: 399970450. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:36:29,273][25689] Avg episode reward: [(0, '-45.838')] [2022-07-09 19:36:30,655][26022] Updated weights on worker 0-0, policy_version 390603 (0.00083) [2022-07-09 19:36:32,472][26022] Updated weights on worker 0-0, policy_version 390613 (0.00084) [2022-07-09 19:36:34,207][26022] Updated weights on worker 0-0, policy_version 390623 (0.00096) [2022-07-09 19:36:34,393][25689] Fps is (10 sec: 5554.4, 60 sec: 5619.2, 300 sec: 5648.6). Total num frames: 399998976. Throughput: 0: 5917.9. Samples: 400004190. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:36:34,394][25689] Avg episode reward: [(0, '-47.019')] [2022-07-09 19:36:35,969][26022] Updated weights on worker 0-0, policy_version 390633 (0.00088) [2022-07-09 19:36:37,564][26022] Updated weights on worker 0-0, policy_version 390643 (0.00094) [2022-07-09 19:36:39,406][25689] Fps is (10 sec: 5557.0, 60 sec: 5586.0, 300 sec: 5642.0). Total num frames: 400026624. Throughput: 0: 5065.0. Samples: 400021280. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:36:39,407][25689] Avg episode reward: [(0, '-46.436')] [2022-07-09 19:36:39,575][26022] Updated weights on worker 0-0, policy_version 390653 (0.00086) [2022-07-09 19:36:41,308][26022] Updated weights on worker 0-0, policy_version 390663 (0.00089) [2022-07-09 19:36:43,155][26022] Updated weights on worker 0-0, policy_version 390673 (0.00091) [2022-07-09 19:36:44,490][25689] Fps is (10 sec: 5780.4, 60 sec: 5650.5, 300 sec: 5648.9). Total num frames: 400057344. Throughput: 0: 5891.3. Samples: 400055504. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:36:44,492][25689] Avg episode reward: [(0, '-45.761')] [2022-07-09 19:36:44,792][26022] Updated weights on worker 0-0, policy_version 390683 (0.00090) [2022-07-09 19:36:46,699][26022] Updated weights on worker 0-0, policy_version 390693 (0.00088) [2022-07-09 19:36:48,526][26022] Updated weights on worker 0-0, policy_version 390703 (0.00091) [2022-07-09 19:36:49,505][25689] Fps is (10 sec: 5779.2, 60 sec: 5632.8, 300 sec: 5649.7). Total num frames: 400084992. Throughput: 0: 5886.3. Samples: 400089556. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:36:49,507][25689] Avg episode reward: [(0, '-46.721')] [2022-07-09 19:36:50,380][26022] Updated weights on worker 0-0, policy_version 390713 (0.00097) [2022-07-09 19:36:52,112][26022] Updated weights on worker 0-0, policy_version 390723 (0.00085) [2022-07-09 19:36:53,973][26022] Updated weights on worker 0-0, policy_version 390733 (0.00082) [2022-07-09 19:36:54,589][25689] Fps is (10 sec: 5678.2, 60 sec: 5645.6, 300 sec: 5652.3). Total num frames: 400114688. Throughput: 0: 5078.6. Samples: 400106760. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:36:54,590][25689] Avg episode reward: [(0, '-47.383')] [2022-07-09 19:36:55,681][26022] Updated weights on worker 0-0, policy_version 390743 (0.00090) [2022-07-09 19:36:57,489][26022] Updated weights on worker 0-0, policy_version 390753 (0.00089) [2022-07-09 19:36:59,197][26022] Updated weights on worker 0-0, policy_version 390763 (0.00091) [2022-07-09 19:36:59,614][25689] Fps is (10 sec: 5672.9, 60 sec: 5646.9, 300 sec: 5656.7). Total num frames: 400142336. Throughput: 0: 5935.5. Samples: 400141226. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:36:59,615][25689] Avg episode reward: [(0, '-47.227')] [2022-07-09 19:37:01,155][26022] Updated weights on worker 0-0, policy_version 390773 (0.00434) [2022-07-09 19:37:03,321][26022] Updated weights on worker 0-0, policy_version 390783 (0.00088) [2022-07-09 19:37:04,662][25689] Fps is (10 sec: 5388.1, 60 sec: 5627.3, 300 sec: 5646.0). Total num frames: 400168960. Throughput: 0: 5821.9. Samples: 400172942. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:37:04,662][25689] Avg episode reward: [(0, '-45.775')] [2022-07-09 19:37:05,055][26022] Updated weights on worker 0-0, policy_version 390793 (0.00097) [2022-07-09 19:37:06,914][26022] Updated weights on worker 0-0, policy_version 390803 (0.00082) [2022-07-09 19:37:08,740][26022] Updated weights on worker 0-0, policy_version 390813 (0.00091) [2022-07-09 19:37:09,762][25689] Fps is (10 sec: 5347.8, 60 sec: 5619.1, 300 sec: 5645.2). Total num frames: 400196608. Throughput: 0: 5821.9. Samples: 400207490. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:37:09,764][25689] Avg episode reward: [(0, '-46.626')] [2022-07-09 19:37:10,373][26022] Updated weights on worker 0-0, policy_version 390823 (0.00087) [2022-07-09 19:37:12,413][26022] Updated weights on worker 0-0, policy_version 390833 (0.00086) [2022-07-09 19:37:13,910][26022] Updated weights on worker 0-0, policy_version 390843 (0.00086) [2022-07-09 19:37:14,873][25689] Fps is (10 sec: 5716.3, 60 sec: 5654.7, 300 sec: 5650.5). Total num frames: 400227328. Throughput: 0: 5822.0. Samples: 400224850. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:37:14,873][25689] Avg episode reward: [(0, '-45.564')] [2022-07-09 19:37:15,956][26022] Updated weights on worker 0-0, policy_version 390853 (0.00093) [2022-07-09 19:37:17,764][26022] Updated weights on worker 0-0, policy_version 390863 (0.00087) [2022-07-09 19:37:19,452][26022] Updated weights on worker 0-0, policy_version 390873 (0.00090) [2022-07-09 19:37:19,951][25689] Fps is (10 sec: 5929.4, 60 sec: 5667.5, 300 sec: 5652.8). Total num frames: 400257024. Throughput: 0: 5799.0. Samples: 400259164. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:37:19,952][25689] Avg episode reward: [(0, '-46.547')] [2022-07-09 19:37:21,308][26022] Updated weights on worker 0-0, policy_version 390883 (0.00082) [2022-07-09 19:37:22,880][26022] Updated weights on worker 0-0, policy_version 390893 (0.00084) [2022-07-09 19:37:24,987][25689] Fps is (10 sec: 5669.4, 60 sec: 5631.3, 300 sec: 5649.0). Total num frames: 400284672. Throughput: 0: 5926.3. Samples: 400293396. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:37:24,988][25689] Avg episode reward: [(0, '-45.908')] [2022-07-09 19:37:24,994][26022] Updated weights on worker 0-0, policy_version 390903 (0.00620) [2022-07-09 19:37:26,605][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:37:26,618][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000390913_400294912.pth [2022-07-09 19:37:26,619][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000388926_398260224.pth [2022-07-09 19:37:26,623][26022] Updated weights on worker 0-0, policy_version 390913 (0.00084) [2022-07-09 19:37:28,541][26022] Updated weights on worker 0-0, policy_version 390923 (0.00089) [2022-07-09 19:37:30,075][25689] Fps is (10 sec: 5563.5, 60 sec: 5641.7, 300 sec: 5649.7). Total num frames: 400313344. Throughput: 0: 5062.1. Samples: 400310314. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:37:30,076][25689] Avg episode reward: [(0, '-46.901')] [2022-07-09 19:37:30,217][26022] Updated weights on worker 0-0, policy_version 390933 (0.00083) [2022-07-09 19:37:32,163][26022] Updated weights on worker 0-0, policy_version 390943 (0.00092) [2022-07-09 19:37:33,902][26022] Updated weights on worker 0-0, policy_version 390953 (0.00092) [2022-07-09 19:37:35,196][25689] Fps is (10 sec: 5717.7, 60 sec: 5658.7, 300 sec: 5648.4). Total num frames: 400343040. Throughput: 0: 5871.8. Samples: 400344182. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:37:35,196][25689] Avg episode reward: [(0, '-47.031')] [2022-07-09 19:37:35,830][26022] Updated weights on worker 0-0, policy_version 390963 (0.00090) [2022-07-09 19:37:37,400][26022] Updated weights on worker 0-0, policy_version 390973 (0.00088) [2022-07-09 19:37:39,311][26022] Updated weights on worker 0-0, policy_version 390983 (0.00093) [2022-07-09 19:37:40,215][25689] Fps is (10 sec: 5655.1, 60 sec: 5658.1, 300 sec: 5645.8). Total num frames: 400370688. Throughput: 0: 5890.7. Samples: 400378530. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:37:40,215][25689] Avg episode reward: [(0, '-46.866')] [2022-07-09 19:37:40,978][26022] Updated weights on worker 0-0, policy_version 390993 (0.00081) [2022-07-09 19:37:43,035][26022] Updated weights on worker 0-0, policy_version 391003 (0.00101) [2022-07-09 19:37:44,520][26022] Updated weights on worker 0-0, policy_version 391013 (0.00090) [2022-07-09 19:37:45,275][25689] Fps is (10 sec: 5689.4, 60 sec: 5643.5, 300 sec: 5648.6). Total num frames: 400400384. Throughput: 0: 5041.9. Samples: 400395684. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:37:45,275][25689] Avg episode reward: [(0, '-47.350')] [2022-07-09 19:37:46,408][26022] Updated weights on worker 0-0, policy_version 391023 (0.00997) [2022-07-09 19:37:48,234][26022] Updated weights on worker 0-0, policy_version 391033 (0.00092) [2022-07-09 19:37:49,781][26022] Updated weights on worker 0-0, policy_version 391043 (0.00093) [2022-07-09 19:37:50,351][25689] Fps is (10 sec: 5859.4, 60 sec: 5671.5, 300 sec: 5648.0). Total num frames: 400430080. Throughput: 0: 5912.4. Samples: 400430196. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:37:50,351][25689] Avg episode reward: [(0, '-46.014')] [2022-07-09 19:37:51,805][26022] Updated weights on worker 0-0, policy_version 391053 (0.00092) [2022-07-09 19:37:53,432][26022] Updated weights on worker 0-0, policy_version 391063 (0.00094) [2022-07-09 19:37:55,396][25689] Fps is (10 sec: 5665.7, 60 sec: 5641.5, 300 sec: 5643.7). Total num frames: 400457728. Throughput: 0: 5949.6. Samples: 400464366. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:37:55,397][25689] Avg episode reward: [(0, '-46.988')] [2022-07-09 19:37:55,431][26022] Updated weights on worker 0-0, policy_version 391073 (0.00086) [2022-07-09 19:37:57,231][26022] Updated weights on worker 0-0, policy_version 391083 (0.00085) [2022-07-09 19:37:59,153][26022] Updated weights on worker 0-0, policy_version 391093 (0.00088) [2022-07-09 19:38:00,411][25689] Fps is (10 sec: 5700.3, 60 sec: 5676.1, 300 sec: 5660.8). Total num frames: 400487424. Throughput: 0: 5090.6. Samples: 400481340. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:38:00,413][25689] Avg episode reward: [(0, '-47.083')] [2022-07-09 19:38:00,696][26022] Updated weights on worker 0-0, policy_version 391103 (0.00091) [2022-07-09 19:38:03,163][26022] Updated weights on worker 0-0, policy_version 391113 (0.00082) [2022-07-09 19:38:04,666][26022] Updated weights on worker 0-0, policy_version 391123 (0.00096) [2022-07-09 19:38:05,456][25689] Fps is (10 sec: 5496.5, 60 sec: 5659.5, 300 sec: 5646.5). Total num frames: 400513024. Throughput: 0: 5831.1. Samples: 400513364. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:38:05,457][25689] Avg episode reward: [(0, '-47.252')] [2022-07-09 19:38:06,710][26022] Updated weights on worker 0-0, policy_version 391133 (0.00094) [2022-07-09 19:38:08,074][26022] Updated weights on worker 0-0, policy_version 391143 (0.00085) [2022-07-09 19:38:10,204][26022] Updated weights on worker 0-0, policy_version 391153 (0.00089) [2022-07-09 19:38:10,468][25689] Fps is (10 sec: 5396.6, 60 sec: 5684.6, 300 sec: 5644.7). Total num frames: 400541696. Throughput: 0: 5844.6. Samples: 400547770. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:38:10,468][25689] Avg episode reward: [(0, '-47.540')] [2022-07-09 19:38:11,840][26022] Updated weights on worker 0-0, policy_version 391163 (0.00087) [2022-07-09 19:38:13,613][26022] Updated weights on worker 0-0, policy_version 391173 (0.00083) [2022-07-09 19:38:15,579][25689] Fps is (10 sec: 5665.1, 60 sec: 5650.9, 300 sec: 5646.2). Total num frames: 400570368. Throughput: 0: 4976.3. Samples: 400564800. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:38:15,579][25689] Avg episode reward: [(0, '-47.227')] [2022-07-09 19:38:15,851][26022] Updated weights on worker 0-0, policy_version 391183 (0.00097) [2022-07-09 19:38:17,171][26022] Updated weights on worker 0-0, policy_version 391193 (0.00085) [2022-07-09 19:38:19,202][26022] Updated weights on worker 0-0, policy_version 391203 (0.00085) [2022-07-09 19:38:20,622][25689] Fps is (10 sec: 5848.8, 60 sec: 5671.0, 300 sec: 5649.4). Total num frames: 400601088. Throughput: 0: 5824.1. Samples: 400599052. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:38:20,623][25689] Avg episode reward: [(0, '-46.931')] [2022-07-09 19:38:20,758][26022] Updated weights on worker 0-0, policy_version 391213 (0.00082) [2022-07-09 19:38:22,702][26022] Updated weights on worker 0-0, policy_version 391223 (0.00091) [2022-07-09 19:38:24,496][26022] Updated weights on worker 0-0, policy_version 391233 (0.00087) [2022-07-09 19:38:25,695][25689] Fps is (10 sec: 5769.8, 60 sec: 5667.6, 300 sec: 5649.3). Total num frames: 400628736. Throughput: 0: 5929.7. Samples: 400633372. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:38:25,696][25689] Avg episode reward: [(0, '-46.297')] [2022-07-09 19:38:26,310][26022] Updated weights on worker 0-0, policy_version 391243 (0.00095) [2022-07-09 19:38:28,148][26022] Updated weights on worker 0-0, policy_version 391253 (0.00077) [2022-07-09 19:38:29,918][26022] Updated weights on worker 0-0, policy_version 391263 (0.00095) [2022-07-09 19:38:30,707][25689] Fps is (10 sec: 5584.7, 60 sec: 5674.6, 300 sec: 5649.9). Total num frames: 400657408. Throughput: 0: 5065.0. Samples: 400650282. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 19:38:30,708][25689] Avg episode reward: [(0, '-45.843')] [2022-07-09 19:38:31,812][26022] Updated weights on worker 0-0, policy_version 391273 (0.00083) [2022-07-09 19:38:33,734][26022] Updated weights on worker 0-0, policy_version 391283 (0.00094) [2022-07-09 19:38:35,445][26022] Updated weights on worker 0-0, policy_version 391293 (0.00087) [2022-07-09 19:38:35,827][25689] Fps is (10 sec: 5659.5, 60 sec: 5657.8, 300 sec: 5647.8). Total num frames: 400686080. Throughput: 0: 5899.8. Samples: 400684262. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:38:35,828][25689] Avg episode reward: [(0, '-44.316')] [2022-07-09 19:38:37,162][26022] Updated weights on worker 0-0, policy_version 391303 (0.00091) [2022-07-09 19:38:39,030][26022] Updated weights on worker 0-0, policy_version 391313 (0.00092) [2022-07-09 19:38:40,737][26022] Updated weights on worker 0-0, policy_version 391323 (0.00084) [2022-07-09 19:38:40,861][25689] Fps is (10 sec: 5647.5, 60 sec: 5673.3, 300 sec: 5647.4). Total num frames: 400714752. Throughput: 0: 5902.7. Samples: 400718516. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:38:40,861][25689] Avg episode reward: [(0, '-44.237')] [2022-07-09 19:38:42,698][26022] Updated weights on worker 0-0, policy_version 391333 (0.00095) [2022-07-09 19:38:44,305][26022] Updated weights on worker 0-0, policy_version 391343 (0.00081) [2022-07-09 19:38:45,865][25689] Fps is (10 sec: 5610.5, 60 sec: 5644.7, 300 sec: 5644.4). Total num frames: 400742400. Throughput: 0: 5917.4. Samples: 400752730. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:38:45,866][25689] Avg episode reward: [(0, '-44.232')] [2022-07-09 19:38:46,248][26022] Updated weights on worker 0-0, policy_version 391353 (0.00087) [2022-07-09 19:38:47,865][26022] Updated weights on worker 0-0, policy_version 391363 (0.00278) [2022-07-09 19:38:49,835][26022] Updated weights on worker 0-0, policy_version 391373 (0.00507) [2022-07-09 19:38:50,888][25689] Fps is (10 sec: 5616.5, 60 sec: 5632.8, 300 sec: 5641.9). Total num frames: 400771072. Throughput: 0: 5934.6. Samples: 400770050. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:38:50,889][25689] Avg episode reward: [(0, '-44.203')] [2022-07-09 19:38:51,596][26022] Updated weights on worker 0-0, policy_version 391383 (0.00089) [2022-07-09 19:38:53,366][26022] Updated weights on worker 0-0, policy_version 391393 (0.00087) [2022-07-09 19:38:55,255][26022] Updated weights on worker 0-0, policy_version 391403 (0.00092) [2022-07-09 19:38:56,003][25689] Fps is (10 sec: 5858.7, 60 sec: 5677.0, 300 sec: 5657.5). Total num frames: 400801792. Throughput: 0: 5937.4. Samples: 400804054. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:38:56,003][25689] Avg episode reward: [(0, '-45.766')] [2022-07-09 19:38:57,124][26022] Updated weights on worker 0-0, policy_version 391413 (0.00089) [2022-07-09 19:38:58,574][26022] Updated weights on worker 0-0, policy_version 391423 (0.00089) [2022-07-09 19:39:00,635][26022] Updated weights on worker 0-0, policy_version 391433 (0.00094) [2022-07-09 19:39:01,035][25689] Fps is (10 sec: 5752.6, 60 sec: 5641.6, 300 sec: 5650.1). Total num frames: 400829440. Throughput: 0: 5934.4. Samples: 400838236. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:39:01,035][25689] Avg episode reward: [(0, '-46.513')] [2022-07-09 19:39:02,679][26022] Updated weights on worker 0-0, policy_version 391443 (0.00101) [2022-07-09 19:39:04,619][26022] Updated weights on worker 0-0, policy_version 391453 (0.00089) [2022-07-09 19:39:06,064][25689] Fps is (10 sec: 5394.3, 60 sec: 5660.0, 300 sec: 5646.4). Total num frames: 400856064. Throughput: 0: 4960.4. Samples: 400852922. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:39:06,066][25689] Avg episode reward: [(0, '-46.842')] [2022-07-09 19:39:06,594][26022] Updated weights on worker 0-0, policy_version 391463 (0.00092) [2022-07-09 19:39:07,932][26022] Updated weights on worker 0-0, policy_version 391473 (0.00086) [2022-07-09 19:39:10,144][26022] Updated weights on worker 0-0, policy_version 391483 (0.00083) [2022-07-09 19:39:11,076][25689] Fps is (10 sec: 5507.1, 60 sec: 5660.0, 300 sec: 5647.6). Total num frames: 400884736. Throughput: 0: 5816.9. Samples: 400887478. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:39:11,076][25689] Avg episode reward: [(0, '-46.214')] [2022-07-09 19:39:11,710][26022] Updated weights on worker 0-0, policy_version 391493 (0.00082) [2022-07-09 19:39:13,560][26022] Updated weights on worker 0-0, policy_version 391503 (0.00085) [2022-07-09 19:39:15,222][26022] Updated weights on worker 0-0, policy_version 391513 (0.00068) [2022-07-09 19:39:16,140][25689] Fps is (10 sec: 5690.9, 60 sec: 5664.3, 300 sec: 5643.4). Total num frames: 400913408. Throughput: 0: 5847.7. Samples: 400921812. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:39:16,142][25689] Avg episode reward: [(0, '-46.822')] [2022-07-09 19:39:17,151][26022] Updated weights on worker 0-0, policy_version 391523 (0.00099) [2022-07-09 19:39:18,890][26022] Updated weights on worker 0-0, policy_version 391533 (0.00099) [2022-07-09 19:39:20,944][26022] Updated weights on worker 0-0, policy_version 391543 (0.00090) [2022-07-09 19:39:21,202][25689] Fps is (10 sec: 5662.6, 60 sec: 5628.8, 300 sec: 5649.2). Total num frames: 400942080. Throughput: 0: 4992.6. Samples: 400938924. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:39:21,203][25689] Avg episode reward: [(0, '-46.732')] [2022-07-09 19:39:22,470][26022] Updated weights on worker 0-0, policy_version 391553 (0.00052) [2022-07-09 19:39:24,574][26022] Updated weights on worker 0-0, policy_version 391563 (0.00096) [2022-07-09 19:39:25,883][26022] Updated weights on worker 0-0, policy_version 391573 (0.00092) [2022-07-09 19:39:26,228][25689] Fps is (10 sec: 5786.2, 60 sec: 5667.0, 300 sec: 5648.9). Total num frames: 400971776. Throughput: 0: 5945.9. Samples: 400972814. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:39:26,232][25689] Avg episode reward: [(0, '-46.422')] [2022-07-09 19:39:26,856][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:39:26,871][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000391577_400974848.pth [2022-07-09 19:39:26,872][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000389589_398939136.pth [2022-07-09 19:39:26,872][25974] Saving a milestone ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/milestones/checkpoint_000391577_400974848.pth.milestone [2022-07-09 19:39:28,258][26022] Updated weights on worker 0-0, policy_version 391583 (0.00109) [2022-07-09 19:39:29,565][26022] Updated weights on worker 0-0, policy_version 391593 (0.00094) [2022-07-09 19:39:31,253][25689] Fps is (10 sec: 5603.4, 60 sec: 5631.9, 300 sec: 5639.5). Total num frames: 400998400. Throughput: 0: 5929.0. Samples: 401007112. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:39:31,256][25689] Avg episode reward: [(0, '-46.425')] [2022-07-09 19:39:31,786][26022] Updated weights on worker 0-0, policy_version 391603 (0.00091) [2022-07-09 19:39:33,149][26022] Updated weights on worker 0-0, policy_version 391613 (0.00078) [2022-07-09 19:39:35,355][26022] Updated weights on worker 0-0, policy_version 391623 (0.00088) [2022-07-09 19:39:36,365][25689] Fps is (10 sec: 5656.6, 60 sec: 5666.5, 300 sec: 5652.1). Total num frames: 401029120. Throughput: 0: 5046.7. Samples: 401023882. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:39:36,367][25689] Avg episode reward: [(0, '-47.095')] [2022-07-09 19:39:36,728][26022] Updated weights on worker 0-0, policy_version 391633 (0.00090) [2022-07-09 19:39:39,044][26022] Updated weights on worker 0-0, policy_version 391643 (0.00078) [2022-07-09 19:39:40,482][26022] Updated weights on worker 0-0, policy_version 391653 (0.00078) [2022-07-09 19:39:41,395][25689] Fps is (10 sec: 5654.4, 60 sec: 5633.0, 300 sec: 5645.4). Total num frames: 401055744. Throughput: 0: 5902.2. Samples: 401058104. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:39:41,395][25689] Avg episode reward: [(0, '-46.994')] [2022-07-09 19:39:42,379][26022] Updated weights on worker 0-0, policy_version 391663 (0.00082) [2022-07-09 19:39:43,904][26022] Updated weights on worker 0-0, policy_version 391673 (0.00095) [2022-07-09 19:39:46,146][26022] Updated weights on worker 0-0, policy_version 391683 (0.00086) [2022-07-09 19:39:46,403][25689] Fps is (10 sec: 5611.0, 60 sec: 5666.6, 300 sec: 5645.8). Total num frames: 401085440. Throughput: 0: 5932.7. Samples: 401092506. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:39:46,403][25689] Avg episode reward: [(0, '-47.373')] [2022-07-09 19:39:47,675][26022] Updated weights on worker 0-0, policy_version 391693 (0.00083) [2022-07-09 19:39:49,482][26022] Updated weights on worker 0-0, policy_version 391703 (0.00080) [2022-07-09 19:39:51,312][26022] Updated weights on worker 0-0, policy_version 391713 (0.00087) [2022-07-09 19:39:51,411][25689] Fps is (10 sec: 5827.6, 60 sec: 5668.0, 300 sec: 5646.6). Total num frames: 401114112. Throughput: 0: 5092.6. Samples: 401109764. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:39:51,412][25689] Avg episode reward: [(0, '-47.115')] [2022-07-09 19:39:53,204][26022] Updated weights on worker 0-0, policy_version 391723 (0.00099) [2022-07-09 19:39:54,974][26022] Updated weights on worker 0-0, policy_version 391733 (0.00091) [2022-07-09 19:39:56,534][25689] Fps is (10 sec: 5660.3, 60 sec: 5633.4, 300 sec: 5645.1). Total num frames: 401142784. Throughput: 0: 5954.3. Samples: 401143970. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:39:56,536][25689] Avg episode reward: [(0, '-46.773')] [2022-07-09 19:39:56,704][26022] Updated weights on worker 0-0, policy_version 391743 (0.00095) [2022-07-09 19:39:58,348][26022] Updated weights on worker 0-0, policy_version 391753 (0.00080) [2022-07-09 19:40:00,580][26022] Updated weights on worker 0-0, policy_version 391763 (0.00083) [2022-07-09 19:40:01,540][25689] Fps is (10 sec: 5762.0, 60 sec: 5669.5, 300 sec: 5662.4). Total num frames: 401172480. Throughput: 0: 5965.3. Samples: 401178278. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:40:01,541][25689] Avg episode reward: [(0, '-46.907')] [2022-07-09 19:40:02,111][26022] Updated weights on worker 0-0, policy_version 391773 (0.00097) [2022-07-09 19:40:04,426][26022] Updated weights on worker 0-0, policy_version 391783 (0.00092) [2022-07-09 19:40:06,188][26022] Updated weights on worker 0-0, policy_version 391793 (0.00096) [2022-07-09 19:40:06,588][25689] Fps is (10 sec: 5499.6, 60 sec: 5650.9, 300 sec: 5648.2). Total num frames: 401198080. Throughput: 0: 4993.6. Samples: 401193304. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:40:06,590][25689] Avg episode reward: [(0, '-46.785')] [2022-07-09 19:40:07,901][26022] Updated weights on worker 0-0, policy_version 391803 (0.00090) [2022-07-09 19:40:09,792][26022] Updated weights on worker 0-0, policy_version 391813 (0.00094) [2022-07-09 19:40:11,596][26022] Updated weights on worker 0-0, policy_version 391823 (0.00092) [2022-07-09 19:40:11,598][25689] Fps is (10 sec: 5395.9, 60 sec: 5651.0, 300 sec: 5649.2). Total num frames: 401226752. Throughput: 0: 5804.1. Samples: 401226936. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:40:11,599][25689] Avg episode reward: [(0, '-47.234')] [2022-07-09 19:40:13,415][26022] Updated weights on worker 0-0, policy_version 391833 (0.00088) [2022-07-09 19:40:15,287][26022] Updated weights on worker 0-0, policy_version 391843 (0.00086) [2022-07-09 19:40:16,665][25689] Fps is (10 sec: 5690.5, 60 sec: 5650.8, 300 sec: 5648.3). Total num frames: 401255424. Throughput: 0: 5810.2. Samples: 401260940. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:40:16,666][25689] Avg episode reward: [(0, '-47.159')] [2022-07-09 19:40:16,778][26022] Updated weights on worker 0-0, policy_version 391853 (0.00084) [2022-07-09 19:40:18,898][26022] Updated weights on worker 0-0, policy_version 391863 (0.00093) [2022-07-09 19:40:20,416][26022] Updated weights on worker 0-0, policy_version 391873 (0.00093) [2022-07-09 19:40:21,676][25689] Fps is (10 sec: 5487.2, 60 sec: 5621.8, 300 sec: 5638.0). Total num frames: 401282048. Throughput: 0: 4953.1. Samples: 401278010. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:40:21,676][25689] Avg episode reward: [(0, '-47.089')] [2022-07-09 19:40:22,502][26022] Updated weights on worker 0-0, policy_version 391883 (0.00091) [2022-07-09 19:40:24,169][26022] Updated weights on worker 0-0, policy_version 391893 (0.00086) [2022-07-09 19:40:25,920][26022] Updated weights on worker 0-0, policy_version 391903 (0.00102) [2022-07-09 19:40:26,692][25689] Fps is (10 sec: 5616.8, 60 sec: 5622.6, 300 sec: 5644.9). Total num frames: 401311744. Throughput: 0: 5920.8. Samples: 401312338. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:40:26,695][25689] Avg episode reward: [(0, '-46.748')] [2022-07-09 19:40:27,793][26022] Updated weights on worker 0-0, policy_version 391913 (0.00101) [2022-07-09 19:40:29,675][26022] Updated weights on worker 0-0, policy_version 391923 (0.00588) [2022-07-09 19:40:31,299][26022] Updated weights on worker 0-0, policy_version 391933 (0.00082) [2022-07-09 19:40:31,753][25689] Fps is (10 sec: 5995.0, 60 sec: 5687.0, 300 sec: 5651.8). Total num frames: 401342464. Throughput: 0: 5931.7. Samples: 401346488. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:40:31,754][25689] Avg episode reward: [(0, '-46.342')] [2022-07-09 19:40:33,155][26022] Updated weights on worker 0-0, policy_version 391943 (0.00081) [2022-07-09 19:40:35,005][26022] Updated weights on worker 0-0, policy_version 391953 (0.00085) [2022-07-09 19:40:36,835][25689] Fps is (10 sec: 5653.9, 60 sec: 5622.1, 300 sec: 5640.3). Total num frames: 401369088. Throughput: 0: 5079.0. Samples: 401363382. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:40:36,835][25689] Avg episode reward: [(0, '-45.759')] [2022-07-09 19:40:36,851][26022] Updated weights on worker 0-0, policy_version 391963 (0.00085) [2022-07-09 19:40:38,728][26022] Updated weights on worker 0-0, policy_version 391973 (0.00086) [2022-07-09 19:40:40,624][26022] Updated weights on worker 0-0, policy_version 391983 (0.00107) [2022-07-09 19:40:41,844][25689] Fps is (10 sec: 5479.8, 60 sec: 5657.9, 300 sec: 5648.0). Total num frames: 401397760. Throughput: 0: 5903.8. Samples: 401397082. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:40:41,845][25689] Avg episode reward: [(0, '-46.653')] [2022-07-09 19:40:42,178][26022] Updated weights on worker 0-0, policy_version 391993 (0.00088) [2022-07-09 19:40:44,242][26022] Updated weights on worker 0-0, policy_version 392003 (0.00292) [2022-07-09 19:40:45,893][26022] Updated weights on worker 0-0, policy_version 392013 (0.00096) [2022-07-09 19:40:46,864][25689] Fps is (10 sec: 5819.6, 60 sec: 5656.7, 300 sec: 5651.1). Total num frames: 401427456. Throughput: 0: 5910.6. Samples: 401431566. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:40:46,865][25689] Avg episode reward: [(0, '-46.950')] [2022-07-09 19:40:47,827][26022] Updated weights on worker 0-0, policy_version 392023 (0.00088) [2022-07-09 19:40:49,424][26022] Updated weights on worker 0-0, policy_version 392033 (0.00088) [2022-07-09 19:40:51,513][26022] Updated weights on worker 0-0, policy_version 392043 (0.00085) [2022-07-09 19:40:51,871][25689] Fps is (10 sec: 5616.8, 60 sec: 5623.0, 300 sec: 5644.9). Total num frames: 401454080. Throughput: 0: 5074.7. Samples: 401448580. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:40:51,872][25689] Avg episode reward: [(0, '-47.005')] [2022-07-09 19:40:52,987][26022] Updated weights on worker 0-0, policy_version 392053 (0.00086) [2022-07-09 19:40:55,042][26022] Updated weights on worker 0-0, policy_version 392063 (0.00085) [2022-07-09 19:40:56,555][26022] Updated weights on worker 0-0, policy_version 392073 (0.00080) [2022-07-09 19:40:56,924][25689] Fps is (10 sec: 5598.4, 60 sec: 5646.4, 300 sec: 5651.5). Total num frames: 401483776. Throughput: 0: 5950.6. Samples: 401482926. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:40:56,925][25689] Avg episode reward: [(0, '-46.908')] [2022-07-09 19:40:58,731][26022] Updated weights on worker 0-0, policy_version 392083 (0.00085) [2022-07-09 19:41:00,171][26022] Updated weights on worker 0-0, policy_version 392093 (0.00094) [2022-07-09 19:41:01,959][25689] Fps is (10 sec: 5684.7, 60 sec: 5610.0, 300 sec: 5651.2). Total num frames: 401511424. Throughput: 0: 5971.8. Samples: 401517202. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:41:01,959][25689] Avg episode reward: [(0, '-48.129')] [2022-07-09 19:41:02,580][26022] Updated weights on worker 0-0, policy_version 392103 (0.00085) [2022-07-09 19:41:04,205][26022] Updated weights on worker 0-0, policy_version 392113 (0.00087) [2022-07-09 19:41:06,181][26022] Updated weights on worker 0-0, policy_version 392123 (0.00087) [2022-07-09 19:41:07,004][25689] Fps is (10 sec: 5587.4, 60 sec: 5661.0, 300 sec: 5654.0). Total num frames: 401540096. Throughput: 0: 5001.9. Samples: 401532304. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:41:07,005][25689] Avg episode reward: [(0, '-46.928')] [2022-07-09 19:41:07,853][26022] Updated weights on worker 0-0, policy_version 392133 (0.00084) [2022-07-09 19:41:09,960][26022] Updated weights on worker 0-0, policy_version 392143 (0.00088) [2022-07-09 19:41:11,400][26022] Updated weights on worker 0-0, policy_version 392153 (0.00093) [2022-07-09 19:41:12,045][25689] Fps is (10 sec: 5482.4, 60 sec: 5624.3, 300 sec: 5648.8). Total num frames: 401566720. Throughput: 0: 5811.9. Samples: 401565826. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 19:41:12,047][25689] Avg episode reward: [(0, '-46.725')] [2022-07-09 19:41:13,506][26022] Updated weights on worker 0-0, policy_version 392163 (0.00084) [2022-07-09 19:41:15,220][26022] Updated weights on worker 0-0, policy_version 392173 (0.00094) [2022-07-09 19:41:17,092][26022] Updated weights on worker 0-0, policy_version 392183 (0.00093) [2022-07-09 19:41:17,185][25689] Fps is (10 sec: 5431.5, 60 sec: 5617.5, 300 sec: 5646.8). Total num frames: 401595392. Throughput: 0: 5752.9. Samples: 401599482. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:41:17,185][25689] Avg episode reward: [(0, '-47.452')] [2022-07-09 19:41:19,138][26022] Updated weights on worker 0-0, policy_version 392193 (0.00090) [2022-07-09 19:41:20,489][26022] Updated weights on worker 0-0, policy_version 392203 (0.00088) [2022-07-09 19:41:22,215][25689] Fps is (10 sec: 5638.6, 60 sec: 5649.5, 300 sec: 5643.0). Total num frames: 401624064. Throughput: 0: 5740.9. Samples: 401633490. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:41:22,215][25689] Avg episode reward: [(0, '-47.089')] [2022-07-09 19:41:22,530][26022] Updated weights on worker 0-0, policy_version 392213 (0.00095) [2022-07-09 19:41:24,156][26022] Updated weights on worker 0-0, policy_version 392223 (0.00092) [2022-07-09 19:41:26,074][26022] Updated weights on worker 0-0, policy_version 392233 (0.00095) [2022-07-09 19:41:26,875][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:41:26,889][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000392237_401650688.pth [2022-07-09 19:41:26,889][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000390252_399618048.pth [2022-07-09 19:41:27,233][25689] Fps is (10 sec: 5706.7, 60 sec: 5632.4, 300 sec: 5646.5). Total num frames: 401652736. Throughput: 0: 5846.9. Samples: 401650582. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:41:27,234][25689] Avg episode reward: [(0, '-46.902')] [2022-07-09 19:41:27,963][26022] Updated weights on worker 0-0, policy_version 392243 (0.00097) [2022-07-09 19:41:29,828][26022] Updated weights on worker 0-0, policy_version 392253 (0.00084) [2022-07-09 19:41:31,536][26022] Updated weights on worker 0-0, policy_version 392263 (0.00083) [2022-07-09 19:41:32,243][25689] Fps is (10 sec: 5718.4, 60 sec: 5603.4, 300 sec: 5648.6). Total num frames: 401681408. Throughput: 0: 5880.6. Samples: 401684602. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:41:32,243][25689] Avg episode reward: [(0, '-47.292')] [2022-07-09 19:41:33,312][26022] Updated weights on worker 0-0, policy_version 392273 (0.00084) [2022-07-09 19:41:35,115][26022] Updated weights on worker 0-0, policy_version 392283 (0.00082) [2022-07-09 19:41:36,847][26022] Updated weights on worker 0-0, policy_version 392293 (0.00085) [2022-07-09 19:41:37,275][25689] Fps is (10 sec: 5710.4, 60 sec: 5641.8, 300 sec: 5651.7). Total num frames: 401710080. Throughput: 0: 5953.0. Samples: 401719082. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:41:37,276][25689] Avg episode reward: [(0, '-47.495')] [2022-07-09 19:41:38,938][26022] Updated weights on worker 0-0, policy_version 392303 (0.00087) [2022-07-09 19:41:40,316][26022] Updated weights on worker 0-0, policy_version 392313 (0.00082) [2022-07-09 19:41:42,302][25689] Fps is (10 sec: 5497.0, 60 sec: 5606.3, 300 sec: 5639.0). Total num frames: 401736704. Throughput: 0: 5104.7. Samples: 401736030. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:41:42,303][25689] Avg episode reward: [(0, '-46.935')] [2022-07-09 19:41:42,498][26022] Updated weights on worker 0-0, policy_version 392323 (0.00092) [2022-07-09 19:41:43,753][26022] Updated weights on worker 0-0, policy_version 392333 (0.00093) [2022-07-09 19:41:45,998][26022] Updated weights on worker 0-0, policy_version 392343 (0.00083) [2022-07-09 19:41:47,305][25689] Fps is (10 sec: 5819.7, 60 sec: 5641.8, 300 sec: 5653.0). Total num frames: 401768448. Throughput: 0: 5967.3. Samples: 401770354. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:41:47,305][25689] Avg episode reward: [(0, '-46.516')] [2022-07-09 19:41:47,415][26022] Updated weights on worker 0-0, policy_version 392353 (0.00088) [2022-07-09 19:41:49,472][26022] Updated weights on worker 0-0, policy_version 392363 (0.00090) [2022-07-09 19:41:51,452][26022] Updated weights on worker 0-0, policy_version 392373 (0.00090) [2022-07-09 19:41:52,326][25689] Fps is (10 sec: 5925.2, 60 sec: 5657.4, 300 sec: 5647.3). Total num frames: 401796096. Throughput: 0: 5987.4. Samples: 401804848. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:41:52,326][25689] Avg episode reward: [(0, '-46.680')] [2022-07-09 19:41:53,150][26022] Updated weights on worker 0-0, policy_version 392383 (0.00093) [2022-07-09 19:41:54,680][26022] Updated weights on worker 0-0, policy_version 392393 (0.00092) [2022-07-09 19:41:56,795][26022] Updated weights on worker 0-0, policy_version 392403 (0.00087) [2022-07-09 19:41:57,397][25689] Fps is (10 sec: 5478.9, 60 sec: 5621.8, 300 sec: 5646.4). Total num frames: 401823744. Throughput: 0: 5108.6. Samples: 401821876. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:41:57,398][25689] Avg episode reward: [(0, '-45.831')] [2022-07-09 19:41:58,161][26022] Updated weights on worker 0-0, policy_version 392413 (0.00095) [2022-07-09 19:42:00,420][26022] Updated weights on worker 0-0, policy_version 392423 (0.00086) [2022-07-09 19:42:02,280][26022] Updated weights on worker 0-0, policy_version 392433 (0.00094) [2022-07-09 19:42:02,419][25689] Fps is (10 sec: 5478.3, 60 sec: 5623.0, 300 sec: 5650.3). Total num frames: 401851392. Throughput: 0: 5961.2. Samples: 401855954. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:42:02,420][25689] Avg episode reward: [(0, '-45.793')] [2022-07-09 19:42:04,170][26022] Updated weights on worker 0-0, policy_version 392443 (0.00092) [2022-07-09 19:42:06,115][26022] Updated weights on worker 0-0, policy_version 392453 (0.00087) [2022-07-09 19:42:07,440][25689] Fps is (10 sec: 5607.8, 60 sec: 5625.2, 300 sec: 5655.3). Total num frames: 401880064. Throughput: 0: 5845.0. Samples: 401888048. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:42:07,441][25689] Avg episode reward: [(0, '-45.266')] [2022-07-09 19:42:07,689][26022] Updated weights on worker 0-0, policy_version 392463 (0.00086) [2022-07-09 19:42:09,679][26022] Updated weights on worker 0-0, policy_version 392473 (0.00089) [2022-07-09 19:42:11,455][26022] Updated weights on worker 0-0, policy_version 392483 (0.00086) [2022-07-09 19:42:12,448][25689] Fps is (10 sec: 5615.9, 60 sec: 5645.3, 300 sec: 5646.9). Total num frames: 401907712. Throughput: 0: 4970.0. Samples: 401904856. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:42:12,448][25689] Avg episode reward: [(0, '-45.801')] [2022-07-09 19:42:13,330][26022] Updated weights on worker 0-0, policy_version 392493 (0.00091) [2022-07-09 19:42:15,132][26022] Updated weights on worker 0-0, policy_version 392503 (0.00084) [2022-07-09 19:42:16,887][26022] Updated weights on worker 0-0, policy_version 392513 (0.00095) [2022-07-09 19:42:17,554][25689] Fps is (10 sec: 5670.1, 60 sec: 5665.4, 300 sec: 5646.4). Total num frames: 401937408. Throughput: 0: 5812.1. Samples: 401939028. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:42:17,554][25689] Avg episode reward: [(0, '-45.481')] [2022-07-09 19:42:18,647][26022] Updated weights on worker 0-0, policy_version 392523 (0.00087) [2022-07-09 19:42:20,561][26022] Updated weights on worker 0-0, policy_version 392533 (0.00087) [2022-07-09 19:42:22,403][26022] Updated weights on worker 0-0, policy_version 392543 (0.00082) [2022-07-09 19:42:22,612][25689] Fps is (10 sec: 5642.1, 60 sec: 5645.9, 300 sec: 5646.0). Total num frames: 401965056. Throughput: 0: 5793.3. Samples: 401972934. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:42:22,612][25689] Avg episode reward: [(0, '-46.285')] [2022-07-09 19:42:24,071][26022] Updated weights on worker 0-0, policy_version 392553 (0.00088) [2022-07-09 19:42:26,104][26022] Updated weights on worker 0-0, policy_version 392563 (0.00091) [2022-07-09 19:42:27,675][25689] Fps is (10 sec: 5564.4, 60 sec: 5641.6, 300 sec: 5646.4). Total num frames: 401993728. Throughput: 0: 5035.0. Samples: 401989932. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:42:27,677][25689] Avg episode reward: [(0, '-47.259')] [2022-07-09 19:42:27,684][26022] Updated weights on worker 0-0, policy_version 392573 (0.00089) [2022-07-09 19:42:29,716][26022] Updated weights on worker 0-0, policy_version 392583 (0.00086) [2022-07-09 19:42:31,576][26022] Updated weights on worker 0-0, policy_version 392593 (0.00091) [2022-07-09 19:42:32,682][25689] Fps is (10 sec: 5592.7, 60 sec: 5625.0, 300 sec: 5641.7). Total num frames: 402021376. Throughput: 0: 5863.7. Samples: 402023502. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:42:32,682][25689] Avg episode reward: [(0, '-47.154')] [2022-07-09 19:42:33,219][26022] Updated weights on worker 0-0, policy_version 392603 (0.00092) [2022-07-09 19:42:35,175][26022] Updated weights on worker 0-0, policy_version 392613 (0.00087) [2022-07-09 19:42:37,080][26022] Updated weights on worker 0-0, policy_version 392623 (0.00089) [2022-07-09 19:42:37,817][25689] Fps is (10 sec: 5553.2, 60 sec: 5615.4, 300 sec: 5643.0). Total num frames: 402050048. Throughput: 0: 5847.6. Samples: 402057520. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:42:37,818][25689] Avg episode reward: [(0, '-47.160')] [2022-07-09 19:42:38,813][26022] Updated weights on worker 0-0, policy_version 392633 (0.00082) [2022-07-09 19:42:40,490][26022] Updated weights on worker 0-0, policy_version 392643 (0.00087) [2022-07-09 19:42:42,245][26022] Updated weights on worker 0-0, policy_version 392653 (0.00092) [2022-07-09 19:42:42,843][25689] Fps is (10 sec: 5643.6, 60 sec: 5649.4, 300 sec: 5640.2). Total num frames: 402078720. Throughput: 0: 5019.3. Samples: 402074480. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:42:42,843][25689] Avg episode reward: [(0, '-47.526')] [2022-07-09 19:42:43,986][26022] Updated weights on worker 0-0, policy_version 392663 (0.00081) [2022-07-09 19:42:46,067][26022] Updated weights on worker 0-0, policy_version 392673 (0.00079) [2022-07-09 19:42:47,525][26022] Updated weights on worker 0-0, policy_version 392683 (0.00093) [2022-07-09 19:42:47,875][25689] Fps is (10 sec: 5803.3, 60 sec: 5612.8, 300 sec: 5641.0). Total num frames: 402108416. Throughput: 0: 5881.4. Samples: 402108734. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:42:47,875][25689] Avg episode reward: [(0, '-46.617')] [2022-07-09 19:42:49,532][26022] Updated weights on worker 0-0, policy_version 392693 (0.00085) [2022-07-09 19:42:51,414][26022] Updated weights on worker 0-0, policy_version 392703 (0.00077) [2022-07-09 19:42:52,946][25689] Fps is (10 sec: 5676.0, 60 sec: 5608.2, 300 sec: 5640.5). Total num frames: 402136064. Throughput: 0: 5901.1. Samples: 402143080. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:42:52,946][25689] Avg episode reward: [(0, '-45.719')] [2022-07-09 19:42:53,055][26022] Updated weights on worker 0-0, policy_version 392713 (0.00078) [2022-07-09 19:42:54,991][26022] Updated weights on worker 0-0, policy_version 392723 (0.00097) [2022-07-09 19:42:56,714][26022] Updated weights on worker 0-0, policy_version 392733 (0.00091) [2022-07-09 19:42:58,034][25689] Fps is (10 sec: 5644.4, 60 sec: 5640.4, 300 sec: 5639.1). Total num frames: 402165760. Throughput: 0: 5080.0. Samples: 402160224. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:42:58,035][25689] Avg episode reward: [(0, '-45.113')] [2022-07-09 19:42:58,529][26022] Updated weights on worker 0-0, policy_version 392743 (0.00086) [2022-07-09 19:43:00,314][26022] Updated weights on worker 0-0, policy_version 392753 (0.00087) [2022-07-09 19:43:02,461][26022] Updated weights on worker 0-0, policy_version 392763 (0.00092) [2022-07-09 19:43:03,039][25689] Fps is (10 sec: 5478.2, 60 sec: 5608.2, 300 sec: 5639.9). Total num frames: 402191360. Throughput: 0: 5926.7. Samples: 402194180. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:43:03,040][25689] Avg episode reward: [(0, '-44.583')] [2022-07-09 19:43:04,128][26022] Updated weights on worker 0-0, policy_version 392773 (0.00087) [2022-07-09 19:43:06,076][26022] Updated weights on worker 0-0, policy_version 392783 (0.00085) [2022-07-09 19:43:07,806][26022] Updated weights on worker 0-0, policy_version 392793 (0.00087) [2022-07-09 19:43:08,042][25689] Fps is (10 sec: 5627.9, 60 sec: 5643.7, 300 sec: 5646.9). Total num frames: 402222080. Throughput: 0: 5840.1. Samples: 402226510. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:43:08,042][25689] Avg episode reward: [(0, '-43.904')] [2022-07-09 19:43:09,624][26022] Updated weights on worker 0-0, policy_version 392803 (0.00081) [2022-07-09 19:43:11,312][26022] Updated weights on worker 0-0, policy_version 392813 (0.00091) [2022-07-09 19:43:13,109][25689] Fps is (10 sec: 5796.5, 60 sec: 5638.1, 300 sec: 5644.3). Total num frames: 402249728. Throughput: 0: 4982.1. Samples: 402243536. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:43:13,110][25689] Avg episode reward: [(0, '-43.867')] [2022-07-09 19:43:13,252][26022] Updated weights on worker 0-0, policy_version 392823 (0.00091) [2022-07-09 19:43:15,079][26022] Updated weights on worker 0-0, policy_version 392833 (0.00091) [2022-07-09 19:43:16,940][26022] Updated weights on worker 0-0, policy_version 392843 (0.00083) [2022-07-09 19:43:18,164][25689] Fps is (10 sec: 5563.8, 60 sec: 5626.0, 300 sec: 5637.2). Total num frames: 402278400. Throughput: 0: 5839.6. Samples: 402277774. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:43:18,165][25689] Avg episode reward: [(0, '-44.629')] [2022-07-09 19:43:18,606][26022] Updated weights on worker 0-0, policy_version 392853 (0.00084) [2022-07-09 19:43:20,373][26022] Updated weights on worker 0-0, policy_version 392863 (0.00093) [2022-07-09 19:43:22,218][26022] Updated weights on worker 0-0, policy_version 392873 (0.00101) [2022-07-09 19:43:23,206][25689] Fps is (10 sec: 5679.3, 60 sec: 5644.3, 300 sec: 5641.3). Total num frames: 402307072. Throughput: 0: 5854.2. Samples: 402312240. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:43:23,206][25689] Avg episode reward: [(0, '-44.271')] [2022-07-09 19:43:24,096][26022] Updated weights on worker 0-0, policy_version 392883 (0.00086) [2022-07-09 19:43:25,850][26022] Updated weights on worker 0-0, policy_version 392893 (0.00090) [2022-07-09 19:43:26,925][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:43:26,935][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000392899_402328576.pth [2022-07-09 19:43:26,939][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000390913_400294912.pth [2022-07-09 19:43:27,672][26022] Updated weights on worker 0-0, policy_version 392903 (0.00087) [2022-07-09 19:43:28,255][25689] Fps is (10 sec: 5682.6, 60 sec: 5645.7, 300 sec: 5640.5). Total num frames: 402335744. Throughput: 0: 5085.1. Samples: 402329298. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:43:28,256][25689] Avg episode reward: [(0, '-45.605')] [2022-07-09 19:43:29,398][26022] Updated weights on worker 0-0, policy_version 392913 (0.00093) [2022-07-09 19:43:31,252][26022] Updated weights on worker 0-0, policy_version 392923 (0.00086) [2022-07-09 19:43:33,044][26022] Updated weights on worker 0-0, policy_version 392933 (0.00088) [2022-07-09 19:43:33,269][25689] Fps is (10 sec: 5800.3, 60 sec: 5678.8, 300 sec: 5646.0). Total num frames: 402365440. Throughput: 0: 5948.2. Samples: 402363452. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:43:33,270][25689] Avg episode reward: [(0, '-46.371')] [2022-07-09 19:43:34,638][26022] Updated weights on worker 0-0, policy_version 392943 (0.00091) [2022-07-09 19:43:36,595][26022] Updated weights on worker 0-0, policy_version 392953 (0.00087) [2022-07-09 19:43:38,315][25689] Fps is (10 sec: 5700.6, 60 sec: 5670.3, 300 sec: 5642.3). Total num frames: 402393088. Throughput: 0: 5952.7. Samples: 402397724. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:43:38,315][25689] Avg episode reward: [(0, '-46.536')] [2022-07-09 19:43:38,558][26022] Updated weights on worker 0-0, policy_version 392963 (0.00089) [2022-07-09 19:43:40,150][26022] Updated weights on worker 0-0, policy_version 392973 (0.00086) [2022-07-09 19:43:42,212][26022] Updated weights on worker 0-0, policy_version 392983 (0.00089) [2022-07-09 19:43:43,416][25689] Fps is (10 sec: 5550.6, 60 sec: 5663.3, 300 sec: 5644.0). Total num frames: 402421760. Throughput: 0: 5071.5. Samples: 402414734. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:43:43,416][25689] Avg episode reward: [(0, '-46.577')] [2022-07-09 19:43:43,691][26022] Updated weights on worker 0-0, policy_version 392993 (0.00092) [2022-07-09 19:43:45,895][26022] Updated weights on worker 0-0, policy_version 393003 (0.00100) [2022-07-09 19:43:47,362][26022] Updated weights on worker 0-0, policy_version 393013 (0.00105) [2022-07-09 19:43:48,473][25689] Fps is (10 sec: 5645.0, 60 sec: 5644.0, 300 sec: 5643.3). Total num frames: 402450432. Throughput: 0: 5912.2. Samples: 402448828. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 19:43:48,474][25689] Avg episode reward: [(0, '-46.405')] [2022-07-09 19:43:49,223][26022] Updated weights on worker 0-0, policy_version 393023 (0.00087) [2022-07-09 19:43:50,959][26022] Updated weights on worker 0-0, policy_version 393033 (0.00086) [2022-07-09 19:43:52,682][26022] Updated weights on worker 0-0, policy_version 393043 (0.00090) [2022-07-09 19:43:53,539][25689] Fps is (10 sec: 5664.4, 60 sec: 5661.3, 300 sec: 5637.3). Total num frames: 402479104. Throughput: 0: 5892.9. Samples: 402482904. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:43:53,541][25689] Avg episode reward: [(0, '-46.588')] [2022-07-09 19:43:54,732][26022] Updated weights on worker 0-0, policy_version 393053 (0.00092) [2022-07-09 19:43:56,594][26022] Updated weights on worker 0-0, policy_version 393063 (0.00086) [2022-07-09 19:43:58,220][26022] Updated weights on worker 0-0, policy_version 393073 (0.00090) [2022-07-09 19:43:58,636][25689] Fps is (10 sec: 5743.0, 60 sec: 5660.5, 300 sec: 5643.0). Total num frames: 402508800. Throughput: 0: 5872.6. Samples: 402517064. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:43:58,637][25689] Avg episode reward: [(0, '-46.175')] [2022-07-09 19:44:00,225][26022] Updated weights on worker 0-0, policy_version 393083 (0.00086) [2022-07-09 19:44:02,152][26022] Updated weights on worker 0-0, policy_version 393093 (0.00083) [2022-07-09 19:44:03,681][25689] Fps is (10 sec: 5452.6, 60 sec: 5656.9, 300 sec: 5639.3). Total num frames: 402534400. Throughput: 0: 5849.7. Samples: 402533278. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:44:03,685][25689] Avg episode reward: [(0, '-46.691')] [2022-07-09 19:44:04,134][26022] Updated weights on worker 0-0, policy_version 393103 (0.00089) [2022-07-09 19:44:05,721][26022] Updated weights on worker 0-0, policy_version 393113 (0.00086) [2022-07-09 19:44:07,584][26022] Updated weights on worker 0-0, policy_version 393123 (0.00089) [2022-07-09 19:44:08,744][25689] Fps is (10 sec: 5470.9, 60 sec: 5634.4, 300 sec: 5641.7). Total num frames: 402564096. Throughput: 0: 5822.4. Samples: 402566852. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:44:08,744][25689] Avg episode reward: [(0, '-47.626')] [2022-07-09 19:44:09,320][26022] Updated weights on worker 0-0, policy_version 393133 (0.00084) [2022-07-09 19:44:11,163][26022] Updated weights on worker 0-0, policy_version 393143 (0.00074) [2022-07-09 19:44:12,886][26022] Updated weights on worker 0-0, policy_version 393153 (0.00084) [2022-07-09 19:44:13,755][25689] Fps is (10 sec: 5793.5, 60 sec: 5656.4, 300 sec: 5642.7). Total num frames: 402592768. Throughput: 0: 5842.4. Samples: 402601014. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:44:13,756][25689] Avg episode reward: [(0, '-48.095')] [2022-07-09 19:44:14,783][26022] Updated weights on worker 0-0, policy_version 393163 (0.00081) [2022-07-09 19:44:16,509][26022] Updated weights on worker 0-0, policy_version 393173 (0.00087) [2022-07-09 19:44:18,276][26022] Updated weights on worker 0-0, policy_version 393183 (0.00089) [2022-07-09 19:44:18,825][25689] Fps is (10 sec: 5688.4, 60 sec: 5655.1, 300 sec: 5642.6). Total num frames: 402621440. Throughput: 0: 5004.0. Samples: 402618086. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:44:18,825][25689] Avg episode reward: [(0, '-48.353')] [2022-07-09 19:44:20,135][26022] Updated weights on worker 0-0, policy_version 393193 (0.00086) [2022-07-09 19:44:21,969][26022] Updated weights on worker 0-0, policy_version 393203 (0.00089) [2022-07-09 19:44:23,850][25689] Fps is (10 sec: 5680.4, 60 sec: 5656.6, 300 sec: 5639.1). Total num frames: 402650112. Throughput: 0: 5887.2. Samples: 402652022. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:44:23,851][25689] Avg episode reward: [(0, '-48.059')] [2022-07-09 19:44:23,855][26022] Updated weights on worker 0-0, policy_version 393213 (0.00097) [2022-07-09 19:44:25,655][26022] Updated weights on worker 0-0, policy_version 393223 (0.00091) [2022-07-09 19:44:27,503][26022] Updated weights on worker 0-0, policy_version 393233 (0.00101) [2022-07-09 19:44:28,912][25689] Fps is (10 sec: 5684.7, 60 sec: 5655.4, 300 sec: 5645.3). Total num frames: 402678784. Throughput: 0: 5922.2. Samples: 402686294. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:44:28,913][25689] Avg episode reward: [(0, '-47.847')] [2022-07-09 19:44:29,371][26022] Updated weights on worker 0-0, policy_version 393243 (0.00090) [2022-07-09 19:44:31,023][26022] Updated weights on worker 0-0, policy_version 393253 (0.00089) [2022-07-09 19:44:32,946][26022] Updated weights on worker 0-0, policy_version 393263 (0.00088) [2022-07-09 19:44:33,980][25689] Fps is (10 sec: 5660.8, 60 sec: 5633.5, 300 sec: 5639.3). Total num frames: 402707456. Throughput: 0: 5058.2. Samples: 402703316. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:44:33,981][25689] Avg episode reward: [(0, '-47.496')] [2022-07-09 19:44:34,660][26022] Updated weights on worker 0-0, policy_version 393273 (0.00088) [2022-07-09 19:44:36,364][26022] Updated weights on worker 0-0, policy_version 393283 (0.00096) [2022-07-09 19:44:38,407][26022] Updated weights on worker 0-0, policy_version 393293 (0.00095) [2022-07-09 19:44:39,019][25689] Fps is (10 sec: 5673.5, 60 sec: 5651.0, 300 sec: 5646.0). Total num frames: 402736128. Throughput: 0: 5928.8. Samples: 402737818. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:44:39,020][25689] Avg episode reward: [(0, '-47.268')] [2022-07-09 19:44:39,828][26022] Updated weights on worker 0-0, policy_version 393303 (0.00087) [2022-07-09 19:44:41,936][26022] Updated weights on worker 0-0, policy_version 393313 (0.00096) [2022-07-09 19:44:43,509][26022] Updated weights on worker 0-0, policy_version 393323 (0.00093) [2022-07-09 19:44:44,028][25689] Fps is (10 sec: 5707.3, 60 sec: 5659.6, 300 sec: 5642.5). Total num frames: 402764800. Throughput: 0: 5954.3. Samples: 402772164. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:44:44,028][25689] Avg episode reward: [(0, '-46.676')] [2022-07-09 19:44:45,424][26022] Updated weights on worker 0-0, policy_version 393333 (0.00086) [2022-07-09 19:44:47,206][26022] Updated weights on worker 0-0, policy_version 393343 (0.00077) [2022-07-09 19:44:48,880][26022] Updated weights on worker 0-0, policy_version 393353 (0.00095) [2022-07-09 19:44:49,048][25689] Fps is (10 sec: 5718.0, 60 sec: 5663.1, 300 sec: 5642.3). Total num frames: 402793472. Throughput: 0: 5111.5. Samples: 402789220. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:44:49,049][25689] Avg episode reward: [(0, '-47.184')] [2022-07-09 19:44:50,831][26022] Updated weights on worker 0-0, policy_version 393363 (0.00085) [2022-07-09 19:44:52,487][26022] Updated weights on worker 0-0, policy_version 393373 (0.00093) [2022-07-09 19:44:54,062][25689] Fps is (10 sec: 5714.7, 60 sec: 5667.9, 300 sec: 5644.4). Total num frames: 402822144. Throughput: 0: 5992.5. Samples: 402823656. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:44:54,064][25689] Avg episode reward: [(0, '-46.978')] [2022-07-09 19:44:54,317][26022] Updated weights on worker 0-0, policy_version 393383 (0.00087) [2022-07-09 19:44:56,204][26022] Updated weights on worker 0-0, policy_version 393393 (0.00087) [2022-07-09 19:44:57,796][26022] Updated weights on worker 0-0, policy_version 393403 (0.00108) [2022-07-09 19:44:59,134][25689] Fps is (10 sec: 5685.8, 60 sec: 5653.4, 300 sec: 5639.7). Total num frames: 402850816. Throughput: 0: 5958.5. Samples: 402857668. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:44:59,134][25689] Avg episode reward: [(0, '-46.987')] [2022-07-09 19:44:59,831][26022] Updated weights on worker 0-0, policy_version 393413 (0.00107) [2022-07-09 19:45:01,545][26022] Updated weights on worker 0-0, policy_version 393423 (0.00086) [2022-07-09 19:45:03,762][26022] Updated weights on worker 0-0, policy_version 393433 (0.00086) [2022-07-09 19:45:04,158][25689] Fps is (10 sec: 5477.1, 60 sec: 5672.2, 300 sec: 5643.6). Total num frames: 402877440. Throughput: 0: 5092.9. Samples: 402874686. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:45:04,159][25689] Avg episode reward: [(0, '-47.152')] [2022-07-09 19:45:05,626][26022] Updated weights on worker 0-0, policy_version 393443 (0.00084) [2022-07-09 19:45:07,404][26022] Updated weights on worker 0-0, policy_version 393453 (0.00087) [2022-07-09 19:45:09,176][25689] Fps is (10 sec: 5404.5, 60 sec: 5642.6, 300 sec: 5640.0). Total num frames: 402905088. Throughput: 0: 5834.9. Samples: 402906662. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:45:09,177][25689] Avg episode reward: [(0, '-46.986')] [2022-07-09 19:45:09,210][26022] Updated weights on worker 0-0, policy_version 393463 (0.00089) [2022-07-09 19:45:10,887][26022] Updated weights on worker 0-0, policy_version 393473 (0.00098) [2022-07-09 19:45:12,899][26022] Updated weights on worker 0-0, policy_version 393483 (0.00088) [2022-07-09 19:45:14,196][25689] Fps is (10 sec: 5712.8, 60 sec: 5658.7, 300 sec: 5644.3). Total num frames: 402934784. Throughput: 0: 5819.8. Samples: 402940830. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:45:14,198][25689] Avg episode reward: [(0, '-47.243')] [2022-07-09 19:45:14,527][26022] Updated weights on worker 0-0, policy_version 393493 (0.00085) [2022-07-09 19:45:16,453][26022] Updated weights on worker 0-0, policy_version 393503 (0.00095) [2022-07-09 19:45:18,135][26022] Updated weights on worker 0-0, policy_version 393513 (0.00087) [2022-07-09 19:45:19,312][25689] Fps is (10 sec: 5657.4, 60 sec: 5637.4, 300 sec: 5645.8). Total num frames: 402962432. Throughput: 0: 4965.2. Samples: 402957856. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:45:19,312][25689] Avg episode reward: [(0, '-46.962')] [2022-07-09 19:45:19,920][26022] Updated weights on worker 0-0, policy_version 393523 (0.00093) [2022-07-09 19:45:21,754][26022] Updated weights on worker 0-0, policy_version 393533 (0.00086) [2022-07-09 19:45:23,333][26022] Updated weights on worker 0-0, policy_version 393543 (0.00103) [2022-07-09 19:45:24,352][25689] Fps is (10 sec: 5646.3, 60 sec: 5653.0, 300 sec: 5645.3). Total num frames: 402992128. Throughput: 0: 5834.2. Samples: 402992500. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:45:24,352][25689] Avg episode reward: [(0, '-47.709')] [2022-07-09 19:45:25,295][26022] Updated weights on worker 0-0, policy_version 393553 (0.00386) [2022-07-09 19:45:26,991][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:45:27,000][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000393563_403008512.pth [2022-07-09 19:45:27,000][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000391577_400974848.pth [2022-07-09 19:45:27,006][26022] Updated weights on worker 0-0, policy_version 393563 (0.00088) [2022-07-09 19:45:29,028][26022] Updated weights on worker 0-0, policy_version 393573 (0.00090) [2022-07-09 19:45:29,394][25689] Fps is (10 sec: 5890.8, 60 sec: 5671.8, 300 sec: 5642.2). Total num frames: 403021824. Throughput: 0: 5935.4. Samples: 403026666. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:45:29,394][25689] Avg episode reward: [(0, '-47.805')] [2022-07-09 19:45:30,616][26022] Updated weights on worker 0-0, policy_version 393583 (0.00088) [2022-07-09 19:45:32,371][26022] Updated weights on worker 0-0, policy_version 393593 (0.00097) [2022-07-09 19:45:34,408][25689] Fps is (10 sec: 5702.5, 60 sec: 5659.9, 300 sec: 5647.0). Total num frames: 403049472. Throughput: 0: 5083.6. Samples: 403043580. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:45:34,408][25689] Avg episode reward: [(0, '-48.051')] [2022-07-09 19:45:34,415][26022] Updated weights on worker 0-0, policy_version 393603 (0.00089) [2022-07-09 19:45:36,126][26022] Updated weights on worker 0-0, policy_version 393613 (0.00092) [2022-07-09 19:45:37,959][26022] Updated weights on worker 0-0, policy_version 393623 (0.00087) [2022-07-09 19:45:39,531][25689] Fps is (10 sec: 5556.0, 60 sec: 5652.1, 300 sec: 5644.8). Total num frames: 403078144. Throughput: 0: 5922.5. Samples: 403077602. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:45:39,535][25689] Avg episode reward: [(0, '-47.610')] [2022-07-09 19:45:39,897][26022] Updated weights on worker 0-0, policy_version 393633 (0.00090) [2022-07-09 19:45:41,527][26022] Updated weights on worker 0-0, policy_version 393643 (0.00082) [2022-07-09 19:45:43,424][26022] Updated weights on worker 0-0, policy_version 393653 (0.00086) [2022-07-09 19:45:44,592][25689] Fps is (10 sec: 5630.7, 60 sec: 5647.2, 300 sec: 5640.6). Total num frames: 403106816. Throughput: 0: 5884.9. Samples: 403111612. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:45:44,592][25689] Avg episode reward: [(0, '-47.007')] [2022-07-09 19:45:45,064][26022] Updated weights on worker 0-0, policy_version 393663 (0.00085) [2022-07-09 19:45:46,978][26022] Updated weights on worker 0-0, policy_version 393673 (0.00089) [2022-07-09 19:45:48,849][26022] Updated weights on worker 0-0, policy_version 393683 (0.00088) [2022-07-09 19:45:49,595][25689] Fps is (10 sec: 5799.5, 60 sec: 5665.7, 300 sec: 5651.0). Total num frames: 403136512. Throughput: 0: 5052.1. Samples: 403128724. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:45:49,595][25689] Avg episode reward: [(0, '-47.142')] [2022-07-09 19:45:50,627][26022] Updated weights on worker 0-0, policy_version 393693 (0.00086) [2022-07-09 19:45:52,446][26022] Updated weights on worker 0-0, policy_version 393703 (0.00090) [2022-07-09 19:45:54,134][26022] Updated weights on worker 0-0, policy_version 393713 (0.00084) [2022-07-09 19:45:54,652][25689] Fps is (10 sec: 5700.2, 60 sec: 5644.8, 300 sec: 5644.0). Total num frames: 403164160. Throughput: 0: 5907.6. Samples: 403163174. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:45:54,652][25689] Avg episode reward: [(0, '-46.771')] [2022-07-09 19:45:56,063][26022] Updated weights on worker 0-0, policy_version 393723 (0.00086) [2022-07-09 19:45:57,663][26022] Updated weights on worker 0-0, policy_version 393733 (0.00093) [2022-07-09 19:45:59,604][26022] Updated weights on worker 0-0, policy_version 393743 (0.00086) [2022-07-09 19:45:59,735][25689] Fps is (10 sec: 5655.4, 60 sec: 5660.6, 300 sec: 5650.0). Total num frames: 403193856. Throughput: 0: 5934.2. Samples: 403197496. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:45:59,735][25689] Avg episode reward: [(0, '-46.366')] [2022-07-09 19:46:01,247][26022] Updated weights on worker 0-0, policy_version 393753 (0.00087) [2022-07-09 19:46:03,549][26022] Updated weights on worker 0-0, policy_version 393763 (0.00085) [2022-07-09 19:46:04,746][25689] Fps is (10 sec: 5478.2, 60 sec: 5645.0, 300 sec: 5640.3). Total num frames: 403219456. Throughput: 0: 5859.7. Samples: 403229708. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:46:04,746][25689] Avg episode reward: [(0, '-45.230')] [2022-07-09 19:46:05,279][26022] Updated weights on worker 0-0, policy_version 393773 (0.00084) [2022-07-09 19:46:07,064][26022] Updated weights on worker 0-0, policy_version 393783 (0.00100) [2022-07-09 19:46:09,014][26022] Updated weights on worker 0-0, policy_version 393793 (0.00092) [2022-07-09 19:46:09,768][25689] Fps is (10 sec: 5409.3, 60 sec: 5661.5, 300 sec: 5647.6). Total num frames: 403248128. Throughput: 0: 5852.1. Samples: 403246778. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:46:09,768][25689] Avg episode reward: [(0, '-45.586')] [2022-07-09 19:46:10,726][26022] Updated weights on worker 0-0, policy_version 393803 (0.00092) [2022-07-09 19:46:12,445][26022] Updated weights on worker 0-0, policy_version 393813 (0.00094) [2022-07-09 19:46:14,626][26022] Updated weights on worker 0-0, policy_version 393823 (0.00085) [2022-07-09 19:46:14,776][25689] Fps is (10 sec: 5615.3, 60 sec: 5628.8, 300 sec: 5646.6). Total num frames: 403275776. Throughput: 0: 5820.7. Samples: 403280308. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:46:14,776][25689] Avg episode reward: [(0, '-45.849')] [2022-07-09 19:46:16,196][26022] Updated weights on worker 0-0, policy_version 393833 (0.00091) [2022-07-09 19:46:18,254][26022] Updated weights on worker 0-0, policy_version 393843 (0.00079) [2022-07-09 19:46:19,835][25689] Fps is (10 sec: 5696.3, 60 sec: 5667.9, 300 sec: 5649.5). Total num frames: 403305472. Throughput: 0: 5796.9. Samples: 403314014. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:46:19,835][25689] Avg episode reward: [(0, '-45.338')] [2022-07-09 19:46:19,838][26022] Updated weights on worker 0-0, policy_version 393853 (0.00086) [2022-07-09 19:46:21,844][26022] Updated weights on worker 0-0, policy_version 393863 (0.00089) [2022-07-09 19:46:23,527][26022] Updated weights on worker 0-0, policy_version 393873 (0.00090) [2022-07-09 19:46:24,895][25689] Fps is (10 sec: 5666.8, 60 sec: 5632.2, 300 sec: 5645.3). Total num frames: 403333120. Throughput: 0: 5032.9. Samples: 403331116. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:46:24,895][25689] Avg episode reward: [(0, '-45.417')] [2022-07-09 19:46:25,467][26022] Updated weights on worker 0-0, policy_version 393883 (0.00088) [2022-07-09 19:46:27,251][26022] Updated weights on worker 0-0, policy_version 393893 (0.00085) [2022-07-09 19:46:29,065][26022] Updated weights on worker 0-0, policy_version 393903 (0.00084) [2022-07-09 19:46:29,915][25689] Fps is (10 sec: 5485.4, 60 sec: 5600.4, 300 sec: 5641.6). Total num frames: 403360768. Throughput: 0: 5858.7. Samples: 403364816. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-09 19:46:29,916][25689] Avg episode reward: [(0, '-46.647')] [2022-07-09 19:46:30,857][26022] Updated weights on worker 0-0, policy_version 393913 (0.00088) [2022-07-09 19:46:32,750][26022] Updated weights on worker 0-0, policy_version 393923 (0.00088) [2022-07-09 19:46:34,643][26022] Updated weights on worker 0-0, policy_version 393933 (0.00092) [2022-07-09 19:46:34,926][25689] Fps is (10 sec: 5614.5, 60 sec: 5617.6, 300 sec: 5642.1). Total num frames: 403389440. Throughput: 0: 5869.4. Samples: 403398578. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:46:34,928][25689] Avg episode reward: [(0, '-46.754')] [2022-07-09 19:46:36,319][26022] Updated weights on worker 0-0, policy_version 393943 (0.00061) [2022-07-09 19:46:38,231][26022] Updated weights on worker 0-0, policy_version 393953 (0.00094) [2022-07-09 19:46:40,002][25689] Fps is (10 sec: 5685.0, 60 sec: 5622.0, 300 sec: 5648.0). Total num frames: 403418112. Throughput: 0: 5022.8. Samples: 403415312. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:46:40,004][25689] Avg episode reward: [(0, '-47.477')] [2022-07-09 19:46:40,008][26022] Updated weights on worker 0-0, policy_version 393963 (0.00094) [2022-07-09 19:46:41,664][26022] Updated weights on worker 0-0, policy_version 393973 (0.00097) [2022-07-09 19:46:43,733][26022] Updated weights on worker 0-0, policy_version 393983 (0.00091) [2022-07-09 19:46:45,042][25689] Fps is (10 sec: 5668.8, 60 sec: 5624.0, 300 sec: 5637.0). Total num frames: 403446784. Throughput: 0: 5864.1. Samples: 403449260. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:46:45,042][25689] Avg episode reward: [(0, '-47.901')] [2022-07-09 19:46:45,247][26022] Updated weights on worker 0-0, policy_version 393993 (0.00083) [2022-07-09 19:46:47,478][26022] Updated weights on worker 0-0, policy_version 394003 (0.00440) [2022-07-09 19:46:48,917][26022] Updated weights on worker 0-0, policy_version 394013 (0.00206) [2022-07-09 19:46:50,136][25689] Fps is (10 sec: 5658.6, 60 sec: 5598.6, 300 sec: 5639.0). Total num frames: 403475456. Throughput: 0: 5873.4. Samples: 403483582. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:46:50,137][25689] Avg episode reward: [(0, '-47.714')] [2022-07-09 19:46:50,835][26022] Updated weights on worker 0-0, policy_version 394023 (0.00080) [2022-07-09 19:46:52,577][26022] Updated weights on worker 0-0, policy_version 394033 (0.00085) [2022-07-09 19:46:54,566][26022] Updated weights on worker 0-0, policy_version 394043 (0.00088) [2022-07-09 19:46:55,186][25689] Fps is (10 sec: 5652.7, 60 sec: 5616.1, 300 sec: 5642.9). Total num frames: 403504128. Throughput: 0: 5032.4. Samples: 403500536. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:46:55,188][25689] Avg episode reward: [(0, '-47.124')] [2022-07-09 19:46:56,168][26022] Updated weights on worker 0-0, policy_version 394053 (0.00096) [2022-07-09 19:46:58,039][26022] Updated weights on worker 0-0, policy_version 394063 (0.00087) [2022-07-09 19:46:59,839][26022] Updated weights on worker 0-0, policy_version 394073 (0.00087) [2022-07-09 19:47:00,241][25689] Fps is (10 sec: 5674.6, 60 sec: 5601.8, 300 sec: 5645.7). Total num frames: 403532800. Throughput: 0: 5890.4. Samples: 403534530. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:47:00,243][25689] Avg episode reward: [(0, '-47.288')] [2022-07-09 19:47:01,896][26022] Updated weights on worker 0-0, policy_version 394083 (0.00104) [2022-07-09 19:47:03,739][26022] Updated weights on worker 0-0, policy_version 394093 (0.00263) [2022-07-09 19:47:05,332][25689] Fps is (10 sec: 5450.3, 60 sec: 5611.3, 300 sec: 5637.5). Total num frames: 403559424. Throughput: 0: 5787.2. Samples: 403566682. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:47:05,333][25689] Avg episode reward: [(0, '-47.663')] [2022-07-09 19:47:05,715][26022] Updated weights on worker 0-0, policy_version 394103 (0.00093) [2022-07-09 19:47:07,304][26022] Updated weights on worker 0-0, policy_version 394113 (0.00099) [2022-07-09 19:47:09,314][26022] Updated weights on worker 0-0, policy_version 394123 (0.00089) [2022-07-09 19:47:10,398][25689] Fps is (10 sec: 5444.5, 60 sec: 5607.3, 300 sec: 5639.8). Total num frames: 403588096. Throughput: 0: 4946.6. Samples: 403583808. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:47:10,398][25689] Avg episode reward: [(0, '-47.534')] [2022-07-09 19:47:10,913][26022] Updated weights on worker 0-0, policy_version 394133 (0.00094) [2022-07-09 19:47:12,870][26022] Updated weights on worker 0-0, policy_version 394143 (0.00090) [2022-07-09 19:47:14,604][26022] Updated weights on worker 0-0, policy_version 394153 (0.00086) [2022-07-09 19:47:15,428][25689] Fps is (10 sec: 5679.8, 60 sec: 5622.1, 300 sec: 5637.8). Total num frames: 403616768. Throughput: 0: 5803.9. Samples: 403618016. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:47:15,428][25689] Avg episode reward: [(0, '-47.833')] [2022-07-09 19:47:16,595][26022] Updated weights on worker 0-0, policy_version 394163 (0.00093) [2022-07-09 19:47:18,260][26022] Updated weights on worker 0-0, policy_version 394173 (0.00088) [2022-07-09 19:47:20,175][26022] Updated weights on worker 0-0, policy_version 394183 (0.00087) [2022-07-09 19:47:20,471][25689] Fps is (10 sec: 5692.9, 60 sec: 5606.7, 300 sec: 5641.5). Total num frames: 403645440. Throughput: 0: 5809.3. Samples: 403652048. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:47:20,471][25689] Avg episode reward: [(0, '-47.759')] [2022-07-09 19:47:21,854][26022] Updated weights on worker 0-0, policy_version 394193 (0.00091) [2022-07-09 19:47:23,687][26022] Updated weights on worker 0-0, policy_version 394203 (0.00083) [2022-07-09 19:47:25,420][26022] Updated weights on worker 0-0, policy_version 394213 (0.00091) [2022-07-09 19:47:25,474][25689] Fps is (10 sec: 5707.7, 60 sec: 5628.8, 300 sec: 5642.7). Total num frames: 403674112. Throughput: 0: 5077.4. Samples: 403668954. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:47:25,475][25689] Avg episode reward: [(0, '-48.009')] [2022-07-09 19:47:27,105][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:47:27,120][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000394222_403683328.pth [2022-07-09 19:47:27,121][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000392237_401650688.pth [2022-07-09 19:47:27,242][26022] Updated weights on worker 0-0, policy_version 394223 (0.00082) [2022-07-09 19:47:29,079][26022] Updated weights on worker 0-0, policy_version 394233 (0.00089) [2022-07-09 19:47:30,483][25689] Fps is (10 sec: 5625.0, 60 sec: 5629.9, 300 sec: 5642.6). Total num frames: 403701760. Throughput: 0: 5938.5. Samples: 403703084. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:47:30,484][25689] Avg episode reward: [(0, '-47.757')] [2022-07-09 19:47:31,060][26022] Updated weights on worker 0-0, policy_version 394243 (0.00085) [2022-07-09 19:47:32,611][26022] Updated weights on worker 0-0, policy_version 394253 (0.00091) [2022-07-09 19:47:34,478][26022] Updated weights on worker 0-0, policy_version 394263 (0.00085) [2022-07-09 19:47:35,492][25689] Fps is (10 sec: 5520.0, 60 sec: 5613.2, 300 sec: 5641.6). Total num frames: 403729408. Throughput: 0: 5952.0. Samples: 403737438. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:47:35,492][25689] Avg episode reward: [(0, '-48.134')] [2022-07-09 19:47:36,174][26022] Updated weights on worker 0-0, policy_version 394273 (0.00094) [2022-07-09 19:47:38,029][26022] Updated weights on worker 0-0, policy_version 394283 (0.00084) [2022-07-09 19:47:39,904][26022] Updated weights on worker 0-0, policy_version 394293 (0.00093) [2022-07-09 19:47:40,591][25689] Fps is (10 sec: 5774.7, 60 sec: 5644.9, 300 sec: 5647.1). Total num frames: 403760128. Throughput: 0: 5098.8. Samples: 403754636. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:47:40,591][25689] Avg episode reward: [(0, '-48.131')] [2022-07-09 19:47:41,414][26022] Updated weights on worker 0-0, policy_version 394303 (0.00090) [2022-07-09 19:47:43,463][26022] Updated weights on worker 0-0, policy_version 394313 (0.00080) [2022-07-09 19:47:45,200][26022] Updated weights on worker 0-0, policy_version 394323 (0.00101) [2022-07-09 19:47:45,615][25689] Fps is (10 sec: 5766.0, 60 sec: 5629.4, 300 sec: 5640.4). Total num frames: 403787776. Throughput: 0: 5964.0. Samples: 403789070. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:47:45,615][25689] Avg episode reward: [(0, '-47.781')] [2022-07-09 19:47:47,052][26022] Updated weights on worker 0-0, policy_version 394333 (0.00086) [2022-07-09 19:47:49,015][26022] Updated weights on worker 0-0, policy_version 394343 (0.00088) [2022-07-09 19:47:50,547][26022] Updated weights on worker 0-0, policy_version 394353 (0.00086) [2022-07-09 19:47:50,623][25689] Fps is (10 sec: 5715.6, 60 sec: 5654.3, 300 sec: 5648.4). Total num frames: 403817472. Throughput: 0: 5947.3. Samples: 403822866. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:47:50,624][25689] Avg episode reward: [(0, '-48.016')] [2022-07-09 19:47:52,686][26022] Updated weights on worker 0-0, policy_version 394363 (0.00090) [2022-07-09 19:47:54,351][26022] Updated weights on worker 0-0, policy_version 394373 (0.00091) [2022-07-09 19:47:55,634][25689] Fps is (10 sec: 5723.3, 60 sec: 5641.1, 300 sec: 5643.0). Total num frames: 403845120. Throughput: 0: 5086.6. Samples: 403839894. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:47:55,635][25689] Avg episode reward: [(0, '-47.907')] [2022-07-09 19:47:56,103][26022] Updated weights on worker 0-0, policy_version 394383 (0.00095) [2022-07-09 19:47:58,034][26022] Updated weights on worker 0-0, policy_version 394393 (0.00088) [2022-07-09 19:47:59,678][26022] Updated weights on worker 0-0, policy_version 394403 (0.00087) [2022-07-09 19:48:00,725][25689] Fps is (10 sec: 5575.4, 60 sec: 5637.8, 300 sec: 5651.7). Total num frames: 403873792. Throughput: 0: 5919.6. Samples: 403873824. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:48:00,726][25689] Avg episode reward: [(0, '-47.681')] [2022-07-09 19:48:01,923][26022] Updated weights on worker 0-0, policy_version 394413 (0.00094) [2022-07-09 19:48:03,635][26022] Updated weights on worker 0-0, policy_version 394423 (0.00093) [2022-07-09 19:48:05,622][26022] Updated weights on worker 0-0, policy_version 394433 (0.00085) [2022-07-09 19:48:05,745][25689] Fps is (10 sec: 5367.9, 60 sec: 5627.4, 300 sec: 5634.2). Total num frames: 403899392. Throughput: 0: 5794.5. Samples: 403905714. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:48:05,745][25689] Avg episode reward: [(0, '-47.496')] [2022-07-09 19:48:07,290][26022] Updated weights on worker 0-0, policy_version 394443 (0.00082) [2022-07-09 19:48:09,128][26022] Updated weights on worker 0-0, policy_version 394453 (0.00082) [2022-07-09 19:48:10,811][25689] Fps is (10 sec: 5584.0, 60 sec: 5661.3, 300 sec: 5644.5). Total num frames: 403930112. Throughput: 0: 4958.4. Samples: 403922964. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:48:10,812][25689] Avg episode reward: [(0, '-47.345')] [2022-07-09 19:48:10,819][26022] Updated weights on worker 0-0, policy_version 394463 (0.00088) [2022-07-09 19:48:12,599][26022] Updated weights on worker 0-0, policy_version 394473 (0.00082) [2022-07-09 19:48:14,423][26022] Updated weights on worker 0-0, policy_version 394483 (0.00082) [2022-07-09 19:48:15,848][25689] Fps is (10 sec: 5777.2, 60 sec: 5643.7, 300 sec: 5641.4). Total num frames: 403957760. Throughput: 0: 5811.3. Samples: 403957362. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:48:15,848][25689] Avg episode reward: [(0, '-46.572')] [2022-07-09 19:48:16,261][26022] Updated weights on worker 0-0, policy_version 394493 (0.00085) [2022-07-09 19:48:18,271][26022] Updated weights on worker 0-0, policy_version 394503 (0.00086) [2022-07-09 19:48:19,814][26022] Updated weights on worker 0-0, policy_version 394513 (0.00086) [2022-07-09 19:48:20,896][25689] Fps is (10 sec: 5584.6, 60 sec: 5643.2, 300 sec: 5641.3). Total num frames: 403986432. Throughput: 0: 5832.6. Samples: 403991474. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:48:20,897][25689] Avg episode reward: [(0, '-46.764')] [2022-07-09 19:48:21,742][26022] Updated weights on worker 0-0, policy_version 394523 (0.00707) [2022-07-09 19:48:23,510][26022] Updated weights on worker 0-0, policy_version 394533 (0.00089) [2022-07-09 19:48:25,275][26022] Updated weights on worker 0-0, policy_version 394543 (0.00096) [2022-07-09 19:48:25,910][25689] Fps is (10 sec: 5698.7, 60 sec: 5642.2, 300 sec: 5642.0). Total num frames: 404015104. Throughput: 0: 5108.7. Samples: 404008736. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:48:25,911][25689] Avg episode reward: [(0, '-48.096')] [2022-07-09 19:48:27,324][26022] Updated weights on worker 0-0, policy_version 394553 (0.00092) [2022-07-09 19:48:28,951][26022] Updated weights on worker 0-0, policy_version 394563 (0.00096) [2022-07-09 19:48:30,720][26022] Updated weights on worker 0-0, policy_version 394573 (0.00086) [2022-07-09 19:48:30,993][25689] Fps is (10 sec: 5780.5, 60 sec: 5669.1, 300 sec: 5640.7). Total num frames: 404044800. Throughput: 0: 5919.2. Samples: 404042426. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:48:30,994][25689] Avg episode reward: [(0, '-48.186')] [2022-07-09 19:48:32,663][26022] Updated weights on worker 0-0, policy_version 394583 (0.00092) [2022-07-09 19:48:34,235][26022] Updated weights on worker 0-0, policy_version 394593 (0.00088) [2022-07-09 19:48:36,083][25689] Fps is (10 sec: 5536.2, 60 sec: 5644.6, 300 sec: 5636.4). Total num frames: 404071424. Throughput: 0: 5880.0. Samples: 404076348. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:48:36,084][25689] Avg episode reward: [(0, '-48.101')] [2022-07-09 19:48:36,450][26022] Updated weights on worker 0-0, policy_version 394603 (0.00091) [2022-07-09 19:48:37,795][26022] Updated weights on worker 0-0, policy_version 394613 (0.00090) [2022-07-09 19:48:39,803][26022] Updated weights on worker 0-0, policy_version 394623 (0.00092) [2022-07-09 19:48:41,231][25689] Fps is (10 sec: 5601.4, 60 sec: 5640.1, 300 sec: 5642.4). Total num frames: 404102144. Throughput: 0: 5870.7. Samples: 404110854. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:48:41,231][25689] Avg episode reward: [(0, '-48.647')] [2022-07-09 19:48:41,503][26022] Updated weights on worker 0-0, policy_version 394633 (0.00083) [2022-07-09 19:48:43,380][26022] Updated weights on worker 0-0, policy_version 394643 (0.00087) [2022-07-09 19:48:45,064][26022] Updated weights on worker 0-0, policy_version 394653 (0.00087) [2022-07-09 19:48:46,329][25689] Fps is (10 sec: 5797.1, 60 sec: 5650.1, 300 sec: 5641.6). Total num frames: 404130816. Throughput: 0: 5849.1. Samples: 404128164. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:48:46,329][25689] Avg episode reward: [(0, '-48.728')] [2022-07-09 19:48:47,094][26022] Updated weights on worker 0-0, policy_version 394663 (0.00088) [2022-07-09 19:48:48,779][26022] Updated weights on worker 0-0, policy_version 394673 (0.00089) [2022-07-09 19:48:50,778][26022] Updated weights on worker 0-0, policy_version 394683 (0.00087) [2022-07-09 19:48:51,332][25689] Fps is (10 sec: 5575.5, 60 sec: 5616.9, 300 sec: 5639.4). Total num frames: 404158464. Throughput: 0: 5885.0. Samples: 404162120. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:48:51,333][25689] Avg episode reward: [(0, '-47.658')] [2022-07-09 19:48:52,314][26022] Updated weights on worker 0-0, policy_version 394693 (0.00090) [2022-07-09 19:48:54,368][26022] Updated weights on worker 0-0, policy_version 394703 (0.00089) [2022-07-09 19:48:55,983][26022] Updated weights on worker 0-0, policy_version 394713 (0.00100) [2022-07-09 19:48:56,400][25689] Fps is (10 sec: 5693.7, 60 sec: 5645.3, 300 sec: 5639.9). Total num frames: 404188160. Throughput: 0: 5877.1. Samples: 404195752. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:48:56,401][25689] Avg episode reward: [(0, '-47.285')] [2022-07-09 19:48:58,006][26022] Updated weights on worker 0-0, policy_version 394723 (0.00090) [2022-07-09 19:48:59,507][26022] Updated weights on worker 0-0, policy_version 394733 (0.00090) [2022-07-09 19:49:01,463][25689] Fps is (10 sec: 5559.4, 60 sec: 5614.2, 300 sec: 5643.0). Total num frames: 404214784. Throughput: 0: 5044.8. Samples: 404212920. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:49:01,463][25689] Avg episode reward: [(0, '-47.075')] [2022-07-09 19:49:02,063][26022] Updated weights on worker 0-0, policy_version 394743 (0.00091) [2022-07-09 19:49:03,614][26022] Updated weights on worker 0-0, policy_version 394753 (0.00492) [2022-07-09 19:49:05,659][26022] Updated weights on worker 0-0, policy_version 394763 (0.00083) [2022-07-09 19:49:06,483][25689] Fps is (10 sec: 5484.3, 60 sec: 5664.7, 300 sec: 5640.4). Total num frames: 404243456. Throughput: 0: 5797.2. Samples: 404245002. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 19:49:06,483][25689] Avg episode reward: [(0, '-46.427')] [2022-07-09 19:49:07,147][26022] Updated weights on worker 0-0, policy_version 394773 (0.00086) [2022-07-09 19:49:09,112][26022] Updated weights on worker 0-0, policy_version 394783 (0.00092) [2022-07-09 19:49:10,858][26022] Updated weights on worker 0-0, policy_version 394793 (0.00091) [2022-07-09 19:49:11,528][25689] Fps is (10 sec: 5493.8, 60 sec: 5599.2, 300 sec: 5632.9). Total num frames: 404270080. Throughput: 0: 5792.7. Samples: 404279108. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:49:11,528][25689] Avg episode reward: [(0, '-46.526')] [2022-07-09 19:49:12,715][26022] Updated weights on worker 0-0, policy_version 394803 (0.00084) [2022-07-09 19:49:14,408][26022] Updated weights on worker 0-0, policy_version 394813 (0.00086) [2022-07-09 19:49:16,280][26022] Updated weights on worker 0-0, policy_version 394823 (0.00092) [2022-07-09 19:49:16,563][25689] Fps is (10 sec: 5587.3, 60 sec: 5633.1, 300 sec: 5637.0). Total num frames: 404299776. Throughput: 0: 4983.0. Samples: 404296222. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:49:16,563][25689] Avg episode reward: [(0, '-47.018')] [2022-07-09 19:49:18,201][26022] Updated weights on worker 0-0, policy_version 394833 (0.00085) [2022-07-09 19:49:19,955][26022] Updated weights on worker 0-0, policy_version 394843 (0.01090) [2022-07-09 19:49:21,592][26022] Updated weights on worker 0-0, policy_version 394853 (0.00091) [2022-07-09 19:49:21,623][25689] Fps is (10 sec: 5883.5, 60 sec: 5648.9, 300 sec: 5639.7). Total num frames: 404329472. Throughput: 0: 5804.2. Samples: 404329932. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:49:21,623][25689] Avg episode reward: [(0, '-47.274')] [2022-07-09 19:49:23,508][26022] Updated weights on worker 0-0, policy_version 394863 (0.00086) [2022-07-09 19:49:25,351][26022] Updated weights on worker 0-0, policy_version 394873 (0.00099) [2022-07-09 19:49:26,647][25689] Fps is (10 sec: 5585.1, 60 sec: 5614.3, 300 sec: 5633.6). Total num frames: 404356096. Throughput: 0: 5913.2. Samples: 404364236. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:49:26,647][25689] Avg episode reward: [(0, '-46.956')] [2022-07-09 19:49:27,154][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:49:27,165][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000394883_404360192.pth [2022-07-09 19:49:27,165][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000392899_402328576.pth [2022-07-09 19:49:27,171][26022] Updated weights on worker 0-0, policy_version 394883 (0.00089) [2022-07-09 19:49:29,064][26022] Updated weights on worker 0-0, policy_version 394893 (0.00091) [2022-07-09 19:49:30,667][26022] Updated weights on worker 0-0, policy_version 394903 (0.00082) [2022-07-09 19:49:31,664][25689] Fps is (10 sec: 5609.1, 60 sec: 5620.4, 300 sec: 5638.0). Total num frames: 404385792. Throughput: 0: 5075.1. Samples: 404381298. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:49:31,664][25689] Avg episode reward: [(0, '-46.255')] [2022-07-09 19:49:32,597][26022] Updated weights on worker 0-0, policy_version 394913 (0.00087) [2022-07-09 19:49:34,196][26022] Updated weights on worker 0-0, policy_version 394923 (0.00096) [2022-07-09 19:49:36,037][26022] Updated weights on worker 0-0, policy_version 394933 (0.00087) [2022-07-09 19:49:36,679][25689] Fps is (10 sec: 5818.2, 60 sec: 5661.2, 300 sec: 5638.4). Total num frames: 404414464. Throughput: 0: 5936.6. Samples: 404415642. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:49:36,679][25689] Avg episode reward: [(0, '-46.680')] [2022-07-09 19:49:37,821][26022] Updated weights on worker 0-0, policy_version 394943 (0.00087) [2022-07-09 19:49:39,639][26022] Updated weights on worker 0-0, policy_version 394953 (0.00099) [2022-07-09 19:49:41,641][26022] Updated weights on worker 0-0, policy_version 394963 (0.00086) [2022-07-09 19:49:41,727][25689] Fps is (10 sec: 5698.2, 60 sec: 5636.6, 300 sec: 5637.7). Total num frames: 404443136. Throughput: 0: 5965.2. Samples: 404449858. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:49:41,728][25689] Avg episode reward: [(0, '-46.119')] [2022-07-09 19:49:43,307][26022] Updated weights on worker 0-0, policy_version 394973 (0.00087) [2022-07-09 19:49:45,015][26022] Updated weights on worker 0-0, policy_version 394983 (0.00093) [2022-07-09 19:49:46,740][25689] Fps is (10 sec: 5699.4, 60 sec: 5644.5, 300 sec: 5637.8). Total num frames: 404471808. Throughput: 0: 5109.0. Samples: 404466894. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:49:46,741][25689] Avg episode reward: [(0, '-46.276')] [2022-07-09 19:49:46,802][26022] Updated weights on worker 0-0, policy_version 394993 (0.00093) [2022-07-09 19:49:48,656][26022] Updated weights on worker 0-0, policy_version 395003 (0.00092) [2022-07-09 19:49:50,499][26022] Updated weights on worker 0-0, policy_version 395013 (0.00083) [2022-07-09 19:49:51,747][25689] Fps is (10 sec: 5723.1, 60 sec: 5661.2, 300 sec: 5638.0). Total num frames: 404500480. Throughput: 0: 5962.2. Samples: 404501038. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:49:51,747][25689] Avg episode reward: [(0, '-46.583')] [2022-07-09 19:49:52,387][26022] Updated weights on worker 0-0, policy_version 395023 (0.00081) [2022-07-09 19:49:53,962][26022] Updated weights on worker 0-0, policy_version 395033 (0.00090) [2022-07-09 19:49:55,903][26022] Updated weights on worker 0-0, policy_version 395043 (0.00086) [2022-07-09 19:49:56,757][25689] Fps is (10 sec: 5622.7, 60 sec: 5632.7, 300 sec: 5635.7). Total num frames: 404528128. Throughput: 0: 5967.9. Samples: 404535466. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:49:56,757][25689] Avg episode reward: [(0, '-46.832')] [2022-07-09 19:49:57,582][26022] Updated weights on worker 0-0, policy_version 395053 (0.00085) [2022-07-09 19:49:59,498][26022] Updated weights on worker 0-0, policy_version 395063 (0.00082) [2022-07-09 19:50:01,268][26022] Updated weights on worker 0-0, policy_version 395073 (0.00095) [2022-07-09 19:50:01,801][25689] Fps is (10 sec: 5703.7, 60 sec: 5685.3, 300 sec: 5645.6). Total num frames: 404557824. Throughput: 0: 5114.0. Samples: 404552514. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:50:01,803][25689] Avg episode reward: [(0, '-47.368')] [2022-07-09 19:50:03,396][26022] Updated weights on worker 0-0, policy_version 395083 (0.00087) [2022-07-09 19:50:05,164][26022] Updated weights on worker 0-0, policy_version 395093 (0.00090) [2022-07-09 19:50:06,814][25689] Fps is (10 sec: 5396.3, 60 sec: 5618.1, 300 sec: 5635.4). Total num frames: 404582400. Throughput: 0: 5866.1. Samples: 404584648. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:50:06,816][25689] Avg episode reward: [(0, '-47.554')] [2022-07-09 19:50:07,120][26022] Updated weights on worker 0-0, policy_version 395103 (0.00082) [2022-07-09 19:50:08,719][26022] Updated weights on worker 0-0, policy_version 395113 (0.00085) [2022-07-09 19:50:10,611][26022] Updated weights on worker 0-0, policy_version 395123 (0.00089) [2022-07-09 19:50:11,839][25689] Fps is (10 sec: 5508.6, 60 sec: 5687.9, 300 sec: 5638.8). Total num frames: 404613120. Throughput: 0: 5874.3. Samples: 404619064. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:50:11,839][25689] Avg episode reward: [(0, '-47.128')] [2022-07-09 19:50:12,464][26022] Updated weights on worker 0-0, policy_version 395133 (0.00193) [2022-07-09 19:50:14,098][26022] Updated weights on worker 0-0, policy_version 395143 (0.00077) [2022-07-09 19:50:15,969][26022] Updated weights on worker 0-0, policy_version 395153 (0.00088) [2022-07-09 19:50:16,858][25689] Fps is (10 sec: 5811.1, 60 sec: 5655.4, 300 sec: 5640.6). Total num frames: 404640768. Throughput: 0: 5016.5. Samples: 404636304. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:50:16,860][25689] Avg episode reward: [(0, '-45.946')] [2022-07-09 19:50:17,623][26022] Updated weights on worker 0-0, policy_version 395163 (0.00083) [2022-07-09 19:50:19,733][26022] Updated weights on worker 0-0, policy_version 395173 (0.00083) [2022-07-09 19:50:21,379][26022] Updated weights on worker 0-0, policy_version 395183 (0.00097) [2022-07-09 19:50:21,889][25689] Fps is (10 sec: 5705.7, 60 sec: 5658.1, 300 sec: 5640.8). Total num frames: 404670464. Throughput: 0: 5871.8. Samples: 404670470. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:50:21,890][25689] Avg episode reward: [(0, '-46.168')] [2022-07-09 19:50:23,187][26022] Updated weights on worker 0-0, policy_version 395193 (0.00487) [2022-07-09 19:50:24,993][26022] Updated weights on worker 0-0, policy_version 395203 (0.00088) [2022-07-09 19:50:26,765][26022] Updated weights on worker 0-0, policy_version 395213 (0.00082) [2022-07-09 19:50:26,900][25689] Fps is (10 sec: 5812.3, 60 sec: 5693.3, 300 sec: 5637.9). Total num frames: 404699136. Throughput: 0: 5966.5. Samples: 404704494. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:50:26,901][25689] Avg episode reward: [(0, '-46.696')] [2022-07-09 19:50:28,768][26022] Updated weights on worker 0-0, policy_version 395223 (0.00087) [2022-07-09 19:50:30,372][26022] Updated weights on worker 0-0, policy_version 395233 (0.00085) [2022-07-09 19:50:31,901][25689] Fps is (10 sec: 5625.1, 60 sec: 5660.8, 300 sec: 5638.1). Total num frames: 404726784. Throughput: 0: 5119.1. Samples: 404721766. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:50:31,902][25689] Avg episode reward: [(0, '-46.609')] [2022-07-09 19:50:32,195][26022] Updated weights on worker 0-0, policy_version 395243 (0.00089) [2022-07-09 19:50:33,835][26022] Updated weights on worker 0-0, policy_version 395253 (0.00087) [2022-07-09 19:50:35,870][26022] Updated weights on worker 0-0, policy_version 395263 (0.00095) [2022-07-09 19:50:36,909][25689] Fps is (10 sec: 5627.2, 60 sec: 5661.5, 300 sec: 5640.3). Total num frames: 404755456. Throughput: 0: 5990.7. Samples: 404756422. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:50:36,909][25689] Avg episode reward: [(0, '-45.797')] [2022-07-09 19:50:37,387][26022] Updated weights on worker 0-0, policy_version 395273 (0.00081) [2022-07-09 19:50:39,255][26022] Updated weights on worker 0-0, policy_version 395283 (0.00083) [2022-07-09 19:50:41,007][26022] Updated weights on worker 0-0, policy_version 395293 (0.00083) [2022-07-09 19:50:41,967][25689] Fps is (10 sec: 5696.8, 60 sec: 5660.5, 300 sec: 5640.4). Total num frames: 404784128. Throughput: 0: 5985.3. Samples: 404790644. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:50:41,968][25689] Avg episode reward: [(0, '-46.334')] [2022-07-09 19:50:42,922][26022] Updated weights on worker 0-0, policy_version 395303 (0.00089) [2022-07-09 19:50:44,678][26022] Updated weights on worker 0-0, policy_version 395313 (0.00084) [2022-07-09 19:50:46,459][26022] Updated weights on worker 0-0, policy_version 395323 (0.00088) [2022-07-09 19:50:46,972][25689] Fps is (10 sec: 5698.0, 60 sec: 5661.3, 300 sec: 5636.9). Total num frames: 404812800. Throughput: 0: 5148.1. Samples: 404807828. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:50:46,973][25689] Avg episode reward: [(0, '-46.346')] [2022-07-09 19:50:48,235][26022] Updated weights on worker 0-0, policy_version 395333 (0.00098) [2022-07-09 19:50:50,114][26022] Updated weights on worker 0-0, policy_version 395343 (0.00094) [2022-07-09 19:50:51,747][26022] Updated weights on worker 0-0, policy_version 395353 (0.00086) [2022-07-09 19:50:51,976][25689] Fps is (10 sec: 5729.7, 60 sec: 5661.6, 300 sec: 5641.4). Total num frames: 404841472. Throughput: 0: 6015.5. Samples: 404842522. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:50:51,976][25689] Avg episode reward: [(0, '-46.685')] [2022-07-09 19:50:53,724][26022] Updated weights on worker 0-0, policy_version 395363 (0.00076) [2022-07-09 19:50:55,451][26022] Updated weights on worker 0-0, policy_version 395373 (0.00098) [2022-07-09 19:50:56,989][25689] Fps is (10 sec: 5725.0, 60 sec: 5678.3, 300 sec: 5639.2). Total num frames: 404870144. Throughput: 0: 5993.9. Samples: 404876780. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:50:56,989][25689] Avg episode reward: [(0, '-44.943')] [2022-07-09 19:50:57,432][26022] Updated weights on worker 0-0, policy_version 395383 (0.00058) [2022-07-09 19:50:59,047][26022] Updated weights on worker 0-0, policy_version 395393 (0.00094) [2022-07-09 19:51:00,967][26022] Updated weights on worker 0-0, policy_version 395403 (0.00099) [2022-07-09 19:51:02,039][25689] Fps is (10 sec: 5596.7, 60 sec: 5643.7, 300 sec: 5645.4). Total num frames: 404897792. Throughput: 0: 5132.9. Samples: 404893668. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:51:02,039][25689] Avg episode reward: [(0, '-45.301')] [2022-07-09 19:51:03,195][26022] Updated weights on worker 0-0, policy_version 395413 (0.00086) [2022-07-09 19:51:04,822][26022] Updated weights on worker 0-0, policy_version 395423 (0.00108) [2022-07-09 19:51:06,756][26022] Updated weights on worker 0-0, policy_version 395433 (0.00090) [2022-07-09 19:51:07,043][25689] Fps is (10 sec: 5500.0, 60 sec: 5695.6, 300 sec: 5642.3). Total num frames: 404925440. Throughput: 0: 5856.8. Samples: 404925374. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:51:07,043][25689] Avg episode reward: [(0, '-46.033')] [2022-07-09 19:51:08,620][26022] Updated weights on worker 0-0, policy_version 395443 (0.00089) [2022-07-09 19:51:10,179][26022] Updated weights on worker 0-0, policy_version 395453 (0.00086) [2022-07-09 19:51:12,052][25689] Fps is (10 sec: 5419.9, 60 sec: 5629.0, 300 sec: 5638.8). Total num frames: 404952064. Throughput: 0: 5823.9. Samples: 404959446. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:51:12,054][25689] Avg episode reward: [(0, '-46.258')] [2022-07-09 19:51:12,399][26022] Updated weights on worker 0-0, policy_version 395463 (0.00088) [2022-07-09 19:51:13,732][26022] Updated weights on worker 0-0, policy_version 395473 (0.00087) [2022-07-09 19:51:15,831][26022] Updated weights on worker 0-0, policy_version 395483 (0.00112) [2022-07-09 19:51:17,071][25689] Fps is (10 sec: 5718.4, 60 sec: 5680.1, 300 sec: 5643.0). Total num frames: 404982784. Throughput: 0: 4972.6. Samples: 404976638. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:51:17,071][25689] Avg episode reward: [(0, '-46.452')] [2022-07-09 19:51:17,380][26022] Updated weights on worker 0-0, policy_version 395493 (0.00061) [2022-07-09 19:51:19,445][26022] Updated weights on worker 0-0, policy_version 395503 (0.00082) [2022-07-09 19:51:21,324][26022] Updated weights on worker 0-0, policy_version 395513 (0.00094) [2022-07-09 19:51:22,115][25689] Fps is (10 sec: 5698.8, 60 sec: 5627.9, 300 sec: 5639.9). Total num frames: 405009408. Throughput: 0: 5817.8. Samples: 405010466. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:51:22,116][25689] Avg episode reward: [(0, '-46.393')] [2022-07-09 19:51:22,990][26022] Updated weights on worker 0-0, policy_version 395523 (0.00102) [2022-07-09 19:51:24,880][26022] Updated weights on worker 0-0, policy_version 395533 (0.00055) [2022-07-09 19:51:26,840][26022] Updated weights on worker 0-0, policy_version 395543 (0.00089) [2022-07-09 19:51:27,121][25689] Fps is (10 sec: 5501.9, 60 sec: 5628.3, 300 sec: 5643.6). Total num frames: 405038080. Throughput: 0: 5913.2. Samples: 405044102. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:51:27,123][25689] Avg episode reward: [(0, '-46.150')] [2022-07-09 19:51:27,351][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:51:27,361][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000395547_405040128.pth [2022-07-09 19:51:27,361][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000393563_403008512.pth [2022-07-09 19:51:28,474][26022] Updated weights on worker 0-0, policy_version 395553 (0.00089) [2022-07-09 19:51:30,325][26022] Updated weights on worker 0-0, policy_version 395563 (0.00088) [2022-07-09 19:51:31,948][26022] Updated weights on worker 0-0, policy_version 395573 (0.00090) [2022-07-09 19:51:32,128][25689] Fps is (10 sec: 5726.5, 60 sec: 5644.8, 300 sec: 5643.7). Total num frames: 405066752. Throughput: 0: 5070.8. Samples: 405061250. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:51:32,129][25689] Avg episode reward: [(0, '-45.546')] [2022-07-09 19:51:33,874][26022] Updated weights on worker 0-0, policy_version 395583 (0.00094) [2022-07-09 19:51:35,933][26022] Updated weights on worker 0-0, policy_version 395593 (0.00083) [2022-07-09 19:51:37,131][25689] Fps is (10 sec: 5728.6, 60 sec: 5645.2, 300 sec: 5645.1). Total num frames: 405095424. Throughput: 0: 5915.0. Samples: 405095296. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:51:37,132][25689] Avg episode reward: [(0, '-46.111')] [2022-07-09 19:51:37,419][26022] Updated weights on worker 0-0, policy_version 395603 (0.00051) [2022-07-09 19:51:39,435][26022] Updated weights on worker 0-0, policy_version 395613 (0.00385) [2022-07-09 19:51:41,024][26022] Updated weights on worker 0-0, policy_version 395623 (0.00088) [2022-07-09 19:51:42,174][25689] Fps is (10 sec: 5606.8, 60 sec: 5629.7, 300 sec: 5641.6). Total num frames: 405123072. Throughput: 0: 5927.6. Samples: 405129366. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:51:42,174][25689] Avg episode reward: [(0, '-46.171')] [2022-07-09 19:51:43,026][26022] Updated weights on worker 0-0, policy_version 395633 (0.00094) [2022-07-09 19:51:44,808][26022] Updated weights on worker 0-0, policy_version 395643 (0.00088) [2022-07-09 19:51:46,508][26022] Updated weights on worker 0-0, policy_version 395653 (0.00094) [2022-07-09 19:51:47,202][25689] Fps is (10 sec: 5694.2, 60 sec: 5644.6, 300 sec: 5646.3). Total num frames: 405152768. Throughput: 0: 5104.7. Samples: 405146608. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-09 19:51:47,202][25689] Avg episode reward: [(0, '-45.166')] [2022-07-09 19:51:48,394][26022] Updated weights on worker 0-0, policy_version 395663 (0.00093) [2022-07-09 19:51:50,054][26022] Updated weights on worker 0-0, policy_version 395673 (0.00085) [2022-07-09 19:51:52,026][26022] Updated weights on worker 0-0, policy_version 395683 (0.00088) [2022-07-09 19:51:52,208][25689] Fps is (10 sec: 5714.9, 60 sec: 5627.3, 300 sec: 5643.7). Total num frames: 405180416. Throughput: 0: 5957.9. Samples: 405180880. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:51:52,208][25689] Avg episode reward: [(0, '-46.231')] [2022-07-09 19:51:53,579][26022] Updated weights on worker 0-0, policy_version 395693 (0.00089) [2022-07-09 19:51:55,602][26022] Updated weights on worker 0-0, policy_version 395703 (0.00478) [2022-07-09 19:51:57,125][26022] Updated weights on worker 0-0, policy_version 395713 (0.00087) [2022-07-09 19:51:57,223][25689] Fps is (10 sec: 5722.3, 60 sec: 5644.1, 300 sec: 5647.9). Total num frames: 405210112. Throughput: 0: 5957.9. Samples: 405215002. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:51:57,223][25689] Avg episode reward: [(0, '-45.748')] [2022-07-09 19:51:59,210][26022] Updated weights on worker 0-0, policy_version 395723 (0.00085) [2022-07-09 19:52:00,834][26022] Updated weights on worker 0-0, policy_version 395733 (0.00094) [2022-07-09 19:52:02,271][25689] Fps is (10 sec: 5495.0, 60 sec: 5610.3, 300 sec: 5645.2). Total num frames: 405235712. Throughput: 0: 5111.9. Samples: 405232100. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:52:02,271][25689] Avg episode reward: [(0, '-45.925')] [2022-07-09 19:52:03,265][26022] Updated weights on worker 0-0, policy_version 395743 (0.00093) [2022-07-09 19:52:04,703][26022] Updated weights on worker 0-0, policy_version 395753 (0.00096) [2022-07-09 19:52:06,758][26022] Updated weights on worker 0-0, policy_version 395763 (0.00088) [2022-07-09 19:52:07,287][25689] Fps is (10 sec: 5291.1, 60 sec: 5609.2, 300 sec: 5642.7). Total num frames: 405263360. Throughput: 0: 5844.8. Samples: 405264000. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:52:07,287][25689] Avg episode reward: [(0, '-45.428')] [2022-07-09 19:52:08,635][26022] Updated weights on worker 0-0, policy_version 395773 (0.00085) [2022-07-09 19:52:10,381][26022] Updated weights on worker 0-0, policy_version 395783 (0.00081) [2022-07-09 19:52:12,101][26022] Updated weights on worker 0-0, policy_version 395793 (0.00090) [2022-07-09 19:52:12,295][25689] Fps is (10 sec: 5720.3, 60 sec: 5660.3, 300 sec: 5646.6). Total num frames: 405293056. Throughput: 0: 5833.3. Samples: 405298058. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:52:12,296][25689] Avg episode reward: [(0, '-45.726')] [2022-07-09 19:52:13,895][26022] Updated weights on worker 0-0, policy_version 395803 (0.00081) [2022-07-09 19:52:15,775][26022] Updated weights on worker 0-0, policy_version 395813 (0.00089) [2022-07-09 19:52:17,315][25689] Fps is (10 sec: 5718.3, 60 sec: 5609.2, 300 sec: 5643.6). Total num frames: 405320704. Throughput: 0: 4990.8. Samples: 405315276. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:52:17,315][25689] Avg episode reward: [(0, '-46.741')] [2022-07-09 19:52:17,643][26022] Updated weights on worker 0-0, policy_version 395823 (0.00095) [2022-07-09 19:52:19,311][26022] Updated weights on worker 0-0, policy_version 395833 (0.00084) [2022-07-09 19:52:21,131][26022] Updated weights on worker 0-0, policy_version 395843 (0.00088) [2022-07-09 19:52:22,433][25689] Fps is (10 sec: 5656.3, 60 sec: 5653.2, 300 sec: 5644.8). Total num frames: 405350400. Throughput: 0: 5813.8. Samples: 405349322. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:52:22,434][25689] Avg episode reward: [(0, '-46.538')] [2022-07-09 19:52:22,934][26022] Updated weights on worker 0-0, policy_version 395853 (0.00094) [2022-07-09 19:52:24,631][26022] Updated weights on worker 0-0, policy_version 395863 (0.01309) [2022-07-09 19:52:26,628][26022] Updated weights on worker 0-0, policy_version 395873 (0.00083) [2022-07-09 19:52:27,435][25689] Fps is (10 sec: 5666.3, 60 sec: 5636.6, 300 sec: 5645.0). Total num frames: 405378048. Throughput: 0: 5918.6. Samples: 405383250. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:52:27,435][25689] Avg episode reward: [(0, '-47.005')] [2022-07-09 19:52:28,265][26022] Updated weights on worker 0-0, policy_version 395883 (0.00086) [2022-07-09 19:52:30,276][26022] Updated weights on worker 0-0, policy_version 395893 (0.00085) [2022-07-09 19:52:31,984][26022] Updated weights on worker 0-0, policy_version 395903 (0.00090) [2022-07-09 19:52:32,451][25689] Fps is (10 sec: 5622.0, 60 sec: 5635.8, 300 sec: 5648.3). Total num frames: 405406720. Throughput: 0: 5075.4. Samples: 405400356. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:52:32,452][25689] Avg episode reward: [(0, '-47.496')] [2022-07-09 19:52:33,836][26022] Updated weights on worker 0-0, policy_version 395913 (0.00087) [2022-07-09 19:52:35,671][26022] Updated weights on worker 0-0, policy_version 395923 (0.00098) [2022-07-09 19:52:37,282][26022] Updated weights on worker 0-0, policy_version 395933 (0.00089) [2022-07-09 19:52:37,456][25689] Fps is (10 sec: 5722.4, 60 sec: 5635.6, 300 sec: 5643.2). Total num frames: 405435392. Throughput: 0: 5921.6. Samples: 405434544. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:52:37,456][25689] Avg episode reward: [(0, '-47.771')] [2022-07-09 19:52:39,131][26022] Updated weights on worker 0-0, policy_version 395943 (0.00097) [2022-07-09 19:52:40,787][26022] Updated weights on worker 0-0, policy_version 395953 (0.00091) [2022-07-09 19:52:42,568][25689] Fps is (10 sec: 5668.3, 60 sec: 5646.1, 300 sec: 5645.0). Total num frames: 405464064. Throughput: 0: 5936.6. Samples: 405468852. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:52:42,568][25689] Avg episode reward: [(0, '-47.547')] [2022-07-09 19:52:42,737][26022] Updated weights on worker 0-0, policy_version 395963 (0.00087) [2022-07-09 19:52:44,507][26022] Updated weights on worker 0-0, policy_version 395973 (0.00100) [2022-07-09 19:52:46,314][26022] Updated weights on worker 0-0, policy_version 395983 (0.00088) [2022-07-09 19:52:47,614][25689] Fps is (10 sec: 5645.3, 60 sec: 5627.5, 300 sec: 5640.8). Total num frames: 405492736. Throughput: 0: 5931.7. Samples: 405502944. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:52:47,614][25689] Avg episode reward: [(0, '-46.923')] [2022-07-09 19:52:48,187][26022] Updated weights on worker 0-0, policy_version 395993 (0.00086) [2022-07-09 19:52:49,986][26022] Updated weights on worker 0-0, policy_version 396003 (0.00093) [2022-07-09 19:52:51,680][26022] Updated weights on worker 0-0, policy_version 396013 (0.00084) [2022-07-09 19:52:52,649][25689] Fps is (10 sec: 5789.7, 60 sec: 5658.6, 300 sec: 5647.2). Total num frames: 405522432. Throughput: 0: 5918.5. Samples: 405519898. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:52:52,651][25689] Avg episode reward: [(0, '-46.886')] [2022-07-09 19:52:53,630][26022] Updated weights on worker 0-0, policy_version 396023 (0.00105) [2022-07-09 19:52:55,479][26022] Updated weights on worker 0-0, policy_version 396033 (0.00094) [2022-07-09 19:52:57,300][26022] Updated weights on worker 0-0, policy_version 396043 (0.00088) [2022-07-09 19:52:57,738][25689] Fps is (10 sec: 5664.1, 60 sec: 5617.9, 300 sec: 5643.8). Total num frames: 405550080. Throughput: 0: 5872.6. Samples: 405553652. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:52:57,738][25689] Avg episode reward: [(0, '-47.268')] [2022-07-09 19:52:59,055][26022] Updated weights on worker 0-0, policy_version 396053 (0.00087) [2022-07-09 19:53:00,816][26022] Updated weights on worker 0-0, policy_version 396063 (0.00083) [2022-07-09 19:53:02,800][25689] Fps is (10 sec: 5346.5, 60 sec: 5633.5, 300 sec: 5646.5). Total num frames: 405576704. Throughput: 0: 5774.7. Samples: 405585688. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:53:02,801][25689] Avg episode reward: [(0, '-47.091')] [2022-07-09 19:53:02,995][26022] Updated weights on worker 0-0, policy_version 396073 (0.00092) [2022-07-09 19:53:04,990][26022] Updated weights on worker 0-0, policy_version 396083 (0.00096) [2022-07-09 19:53:06,719][26022] Updated weights on worker 0-0, policy_version 396093 (0.00087) [2022-07-09 19:53:07,817][25689] Fps is (10 sec: 5486.2, 60 sec: 5650.3, 300 sec: 5640.5). Total num frames: 405605376. Throughput: 0: 4940.9. Samples: 405602766. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:53:07,819][25689] Avg episode reward: [(0, '-46.730')] [2022-07-09 19:53:08,492][26022] Updated weights on worker 0-0, policy_version 396103 (0.00086) [2022-07-09 19:53:10,222][26022] Updated weights on worker 0-0, policy_version 396113 (0.00085) [2022-07-09 19:53:12,041][26022] Updated weights on worker 0-0, policy_version 396123 (0.00087) [2022-07-09 19:53:12,831][25689] Fps is (10 sec: 5716.8, 60 sec: 5632.9, 300 sec: 5644.4). Total num frames: 405634048. Throughput: 0: 5807.0. Samples: 405637094. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:53:12,832][25689] Avg episode reward: [(0, '-46.163')] [2022-07-09 19:53:13,877][26022] Updated weights on worker 0-0, policy_version 396133 (0.00091) [2022-07-09 19:53:15,667][26022] Updated weights on worker 0-0, policy_version 396143 (0.00086) [2022-07-09 19:53:17,421][26022] Updated weights on worker 0-0, policy_version 396153 (0.00089) [2022-07-09 19:53:17,838][25689] Fps is (10 sec: 5824.5, 60 sec: 5667.9, 300 sec: 5648.6). Total num frames: 405663744. Throughput: 0: 5846.3. Samples: 405671164. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:53:17,839][25689] Avg episode reward: [(0, '-46.215')] [2022-07-09 19:53:19,383][26022] Updated weights on worker 0-0, policy_version 396163 (0.00086) [2022-07-09 19:53:20,863][26022] Updated weights on worker 0-0, policy_version 396173 (0.00068) [2022-07-09 19:53:22,888][25689] Fps is (10 sec: 5701.8, 60 sec: 5640.5, 300 sec: 5644.5). Total num frames: 405691392. Throughput: 0: 5114.9. Samples: 405688436. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:53:22,889][25689] Avg episode reward: [(0, '-46.066')] [2022-07-09 19:53:23,070][26022] Updated weights on worker 0-0, policy_version 396184 (0.00395) [2022-07-09 19:53:24,679][26022] Updated weights on worker 0-0, policy_version 396194 (0.00345) [2022-07-09 19:53:26,663][26022] Updated weights on worker 0-0, policy_version 396204 (0.00086) [2022-07-09 19:53:27,370][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:53:27,385][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000396208_405716992.pth [2022-07-09 19:53:27,386][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000394222_403683328.pth [2022-07-09 19:53:27,919][25689] Fps is (10 sec: 5586.9, 60 sec: 5654.7, 300 sec: 5642.0). Total num frames: 405720064. Throughput: 0: 5949.1. Samples: 405722354. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:53:27,919][25689] Avg episode reward: [(0, '-46.054')] [2022-07-09 19:53:28,522][26022] Updated weights on worker 0-0, policy_version 396214 (0.00083) [2022-07-09 19:53:30,159][26022] Updated weights on worker 0-0, policy_version 396224 (0.00085) [2022-07-09 19:53:32,008][26022] Updated weights on worker 0-0, policy_version 396234 (0.00091) [2022-07-09 19:53:32,939][25689] Fps is (10 sec: 5705.2, 60 sec: 5654.3, 300 sec: 5650.3). Total num frames: 405748736. Throughput: 0: 5958.7. Samples: 405756914. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:53:32,940][25689] Avg episode reward: [(0, '-45.522')] [2022-07-09 19:53:33,632][26022] Updated weights on worker 0-0, policy_version 396244 (0.00087) [2022-07-09 19:53:35,514][26022] Updated weights on worker 0-0, policy_version 396254 (0.00088) [2022-07-09 19:53:37,473][26022] Updated weights on worker 0-0, policy_version 396264 (0.00084) [2022-07-09 19:53:37,950][25689] Fps is (10 sec: 5614.4, 60 sec: 5636.8, 300 sec: 5642.5). Total num frames: 405776384. Throughput: 0: 5114.9. Samples: 405774036. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:53:37,951][25689] Avg episode reward: [(0, '-45.662')] [2022-07-09 19:53:39,139][26022] Updated weights on worker 0-0, policy_version 396274 (0.00106) [2022-07-09 19:53:41,019][26022] Updated weights on worker 0-0, policy_version 396284 (0.00079) [2022-07-09 19:53:42,795][26022] Updated weights on worker 0-0, policy_version 396294 (0.00089) [2022-07-09 19:53:43,037][25689] Fps is (10 sec: 5678.9, 60 sec: 5656.0, 300 sec: 5646.2). Total num frames: 405806080. Throughput: 0: 5939.9. Samples: 405808118. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:53:43,038][25689] Avg episode reward: [(0, '-46.318')] [2022-07-09 19:53:44,605][26022] Updated weights on worker 0-0, policy_version 396304 (0.00090) [2022-07-09 19:53:46,530][26022] Updated weights on worker 0-0, policy_version 396314 (0.00088) [2022-07-09 19:53:48,075][25689] Fps is (10 sec: 5663.9, 60 sec: 5639.9, 300 sec: 5645.5). Total num frames: 405833728. Throughput: 0: 5954.9. Samples: 405842380. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:53:48,076][25689] Avg episode reward: [(0, '-46.182')] [2022-07-09 19:53:48,248][26022] Updated weights on worker 0-0, policy_version 396324 (0.00089) [2022-07-09 19:53:50,099][26022] Updated weights on worker 0-0, policy_version 396334 (0.00087) [2022-07-09 19:53:51,799][26022] Updated weights on worker 0-0, policy_version 396344 (0.00085) [2022-07-09 19:53:53,110][25689] Fps is (10 sec: 5591.3, 60 sec: 5623.0, 300 sec: 5642.7). Total num frames: 405862400. Throughput: 0: 5080.1. Samples: 405859382. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:53:53,110][25689] Avg episode reward: [(0, '-46.697')] [2022-07-09 19:53:53,760][26022] Updated weights on worker 0-0, policy_version 396354 (0.00084) [2022-07-09 19:53:55,482][26022] Updated weights on worker 0-0, policy_version 396364 (0.00086) [2022-07-09 19:53:57,528][26022] Updated weights on worker 0-0, policy_version 396374 (0.00091) [2022-07-09 19:53:58,117][25689] Fps is (10 sec: 5710.4, 60 sec: 5647.6, 300 sec: 5650.7). Total num frames: 405891072. Throughput: 0: 5896.7. Samples: 405892952. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:53:58,117][25689] Avg episode reward: [(0, '-46.875')] [2022-07-09 19:53:59,187][26022] Updated weights on worker 0-0, policy_version 396384 (0.00093) [2022-07-09 19:54:01,066][26022] Updated weights on worker 0-0, policy_version 396394 (0.00085) [2022-07-09 19:54:03,182][25689] Fps is (10 sec: 5489.9, 60 sec: 5647.3, 300 sec: 5642.9). Total num frames: 405917696. Throughput: 0: 5787.2. Samples: 405924702. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:54:03,182][25689] Avg episode reward: [(0, '-47.526')] [2022-07-09 19:54:03,192][26022] Updated weights on worker 0-0, policy_version 396404 (0.00087) [2022-07-09 19:54:04,903][26022] Updated weights on worker 0-0, policy_version 396414 (0.00086) [2022-07-09 19:54:06,615][26022] Updated weights on worker 0-0, policy_version 396424 (0.00090) [2022-07-09 19:54:08,225][25689] Fps is (10 sec: 5368.7, 60 sec: 5627.8, 300 sec: 5646.4). Total num frames: 405945344. Throughput: 0: 4939.8. Samples: 405941922. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:54:08,226][25689] Avg episode reward: [(0, '-47.040')] [2022-07-09 19:54:08,575][26022] Updated weights on worker 0-0, policy_version 396434 (0.00093) [2022-07-09 19:54:10,299][26022] Updated weights on worker 0-0, policy_version 396444 (0.00089) [2022-07-09 19:54:12,139][26022] Updated weights on worker 0-0, policy_version 396454 (0.00087) [2022-07-09 19:54:13,255][25689] Fps is (10 sec: 5794.5, 60 sec: 5660.3, 300 sec: 5649.9). Total num frames: 405976064. Throughput: 0: 5802.3. Samples: 405976272. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:54:13,255][25689] Avg episode reward: [(0, '-47.598')] [2022-07-09 19:54:13,841][26022] Updated weights on worker 0-0, policy_version 396464 (0.00092) [2022-07-09 19:54:15,666][26022] Updated weights on worker 0-0, policy_version 396474 (0.00083) [2022-07-09 19:54:17,636][26022] Updated weights on worker 0-0, policy_version 396484 (0.00090) [2022-07-09 19:54:18,261][25689] Fps is (10 sec: 5714.1, 60 sec: 5609.5, 300 sec: 5640.6). Total num frames: 406002688. Throughput: 0: 5846.5. Samples: 406010726. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:54:18,262][25689] Avg episode reward: [(0, '-47.629')] [2022-07-09 19:54:19,206][26022] Updated weights on worker 0-0, policy_version 396494 (0.00091) [2022-07-09 19:54:21,287][26022] Updated weights on worker 0-0, policy_version 396504 (0.00109) [2022-07-09 19:54:22,912][26022] Updated weights on worker 0-0, policy_version 396514 (0.00090) [2022-07-09 19:54:23,371][25689] Fps is (10 sec: 5567.5, 60 sec: 5637.8, 300 sec: 5649.3). Total num frames: 406032384. Throughput: 0: 5108.1. Samples: 406027830. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:54:23,371][25689] Avg episode reward: [(0, '-47.115')] [2022-07-09 19:54:24,735][26022] Updated weights on worker 0-0, policy_version 396524 (0.00081) [2022-07-09 19:54:26,612][26022] Updated weights on worker 0-0, policy_version 396534 (0.00087) [2022-07-09 19:54:28,377][25689] Fps is (10 sec: 5769.9, 60 sec: 5640.1, 300 sec: 5646.1). Total num frames: 406061056. Throughput: 0: 5944.7. Samples: 406061716. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-09 19:54:28,377][25689] Avg episode reward: [(0, '-47.274')] [2022-07-09 19:54:28,378][26022] Updated weights on worker 0-0, policy_version 396544 (0.00088) [2022-07-09 19:54:30,276][26022] Updated weights on worker 0-0, policy_version 396554 (0.00097) [2022-07-09 19:54:32,118][26022] Updated weights on worker 0-0, policy_version 396564 (0.00086) [2022-07-09 19:54:33,409][25689] Fps is (10 sec: 5610.8, 60 sec: 5622.1, 300 sec: 5642.3). Total num frames: 406088704. Throughput: 0: 5928.3. Samples: 406095750. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:54:33,409][25689] Avg episode reward: [(0, '-45.947')] [2022-07-09 19:54:33,818][26022] Updated weights on worker 0-0, policy_version 396574 (0.00084) [2022-07-09 19:54:35,775][26022] Updated weights on worker 0-0, policy_version 396584 (0.00098) [2022-07-09 19:54:37,366][26022] Updated weights on worker 0-0, policy_version 396594 (0.00091) [2022-07-09 19:54:38,468][25689] Fps is (10 sec: 5581.4, 60 sec: 5634.6, 300 sec: 5642.1). Total num frames: 406117376. Throughput: 0: 5047.7. Samples: 406112722. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:54:38,468][25689] Avg episode reward: [(0, '-46.442')] [2022-07-09 19:54:39,375][26022] Updated weights on worker 0-0, policy_version 396604 (0.00095) [2022-07-09 19:54:41,022][26022] Updated weights on worker 0-0, policy_version 396614 (0.00093) [2022-07-09 19:54:42,790][26022] Updated weights on worker 0-0, policy_version 396624 (0.00091) [2022-07-09 19:54:43,513][25689] Fps is (10 sec: 5776.5, 60 sec: 5638.5, 300 sec: 5644.9). Total num frames: 406147072. Throughput: 0: 5907.0. Samples: 406146810. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:54:43,514][25689] Avg episode reward: [(0, '-46.653')] [2022-07-09 19:54:44,767][26022] Updated weights on worker 0-0, policy_version 396634 (0.00096) [2022-07-09 19:54:46,419][26022] Updated weights on worker 0-0, policy_version 396644 (0.00085) [2022-07-09 19:54:48,350][26022] Updated weights on worker 0-0, policy_version 396654 (0.00084) [2022-07-09 19:54:48,534][25689] Fps is (10 sec: 5696.6, 60 sec: 5640.0, 300 sec: 5641.2). Total num frames: 406174720. Throughput: 0: 5903.2. Samples: 406180706. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:54:48,535][25689] Avg episode reward: [(0, '-47.884')] [2022-07-09 19:54:50,125][26022] Updated weights on worker 0-0, policy_version 396664 (0.00083) [2022-07-09 19:54:51,851][26022] Updated weights on worker 0-0, policy_version 396674 (0.00093) [2022-07-09 19:54:53,540][25689] Fps is (10 sec: 5616.7, 60 sec: 5642.7, 300 sec: 5644.7). Total num frames: 406203392. Throughput: 0: 5072.0. Samples: 406197858. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:54:53,541][25689] Avg episode reward: [(0, '-48.014')] [2022-07-09 19:54:53,697][26022] Updated weights on worker 0-0, policy_version 396684 (0.00093) [2022-07-09 19:54:55,662][26022] Updated weights on worker 0-0, policy_version 396694 (0.00107) [2022-07-09 19:54:57,279][26022] Updated weights on worker 0-0, policy_version 396704 (0.00083) [2022-07-09 19:54:58,580][25689] Fps is (10 sec: 5504.1, 60 sec: 5605.7, 300 sec: 5634.5). Total num frames: 406230016. Throughput: 0: 5902.8. Samples: 406231442. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:54:58,581][25689] Avg episode reward: [(0, '-48.642')] [2022-07-09 19:54:59,301][26022] Updated weights on worker 0-0, policy_version 396714 (0.00096) [2022-07-09 19:55:01,108][26022] Updated weights on worker 0-0, policy_version 396724 (0.00087) [2022-07-09 19:55:03,379][26022] Updated weights on worker 0-0, policy_version 396734 (0.00084) [2022-07-09 19:55:03,638][25689] Fps is (10 sec: 5374.6, 60 sec: 5623.4, 300 sec: 5644.0). Total num frames: 406257664. Throughput: 0: 5784.6. Samples: 406263226. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:55:03,639][25689] Avg episode reward: [(0, '-48.863')] [2022-07-09 19:55:05,004][26022] Updated weights on worker 0-0, policy_version 396744 (0.00095) [2022-07-09 19:55:06,754][26022] Updated weights on worker 0-0, policy_version 396754 (0.00089) [2022-07-09 19:55:08,656][25689] Fps is (10 sec: 5589.4, 60 sec: 5642.7, 300 sec: 5637.2). Total num frames: 406286336. Throughput: 0: 4947.1. Samples: 406280254. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:55:08,658][25689] Avg episode reward: [(0, '-48.233')] [2022-07-09 19:55:08,667][26022] Updated weights on worker 0-0, policy_version 396764 (0.00095) [2022-07-09 19:55:10,503][26022] Updated weights on worker 0-0, policy_version 396774 (0.00089) [2022-07-09 19:55:12,268][26022] Updated weights on worker 0-0, policy_version 396784 (0.00093) [2022-07-09 19:55:13,747][25689] Fps is (10 sec: 5672.8, 60 sec: 5603.1, 300 sec: 5639.3). Total num frames: 406315008. Throughput: 0: 5778.8. Samples: 406314626. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:55:13,747][25689] Avg episode reward: [(0, '-48.124')] [2022-07-09 19:55:14,168][26022] Updated weights on worker 0-0, policy_version 396794 (0.01000) [2022-07-09 19:55:15,704][26022] Updated weights on worker 0-0, policy_version 396804 (0.00089) [2022-07-09 19:55:17,597][26022] Updated weights on worker 0-0, policy_version 396814 (0.00087) [2022-07-09 19:55:18,772][25689] Fps is (10 sec: 5770.2, 60 sec: 5652.2, 300 sec: 5639.4). Total num frames: 406344704. Throughput: 0: 5829.5. Samples: 406349150. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:55:18,774][25689] Avg episode reward: [(0, '-47.630')] [2022-07-09 19:55:19,342][26022] Updated weights on worker 0-0, policy_version 396824 (0.00104) [2022-07-09 19:55:21,275][26022] Updated weights on worker 0-0, policy_version 396834 (0.00094) [2022-07-09 19:55:22,890][26022] Updated weights on worker 0-0, policy_version 396844 (0.00100) [2022-07-09 19:55:23,828][25689] Fps is (10 sec: 5790.0, 60 sec: 5640.3, 300 sec: 5638.5). Total num frames: 406373376. Throughput: 0: 5101.1. Samples: 406366214. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:55:23,828][25689] Avg episode reward: [(0, '-47.522')] [2022-07-09 19:55:24,739][26022] Updated weights on worker 0-0, policy_version 396854 (0.00081) [2022-07-09 19:55:26,432][26022] Updated weights on worker 0-0, policy_version 396864 (0.00106) [2022-07-09 19:55:27,499][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:55:27,510][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000396869_406393856.pth [2022-07-09 19:55:27,511][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000394883_404360192.pth [2022-07-09 19:55:28,535][26022] Updated weights on worker 0-0, policy_version 396874 (0.00091) [2022-07-09 19:55:28,836][25689] Fps is (10 sec: 5697.7, 60 sec: 5640.0, 300 sec: 5641.9). Total num frames: 406402048. Throughput: 0: 5966.7. Samples: 406400662. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:55:28,837][25689] Avg episode reward: [(0, '-47.381')] [2022-07-09 19:55:30,151][26022] Updated weights on worker 0-0, policy_version 396884 (0.00080) [2022-07-09 19:55:31,917][26022] Updated weights on worker 0-0, policy_version 396894 (0.00087) [2022-07-09 19:55:33,525][26022] Updated weights on worker 0-0, policy_version 396904 (0.00101) [2022-07-09 19:55:33,844][25689] Fps is (10 sec: 5725.0, 60 sec: 5659.2, 300 sec: 5641.8). Total num frames: 406430720. Throughput: 0: 6020.4. Samples: 406435620. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:55:33,844][25689] Avg episode reward: [(0, '-48.207')] [2022-07-09 19:55:35,521][26022] Updated weights on worker 0-0, policy_version 396914 (0.00077) [2022-07-09 19:55:37,077][26022] Updated weights on worker 0-0, policy_version 396924 (0.00091) [2022-07-09 19:55:38,870][25689] Fps is (10 sec: 5511.2, 60 sec: 5628.4, 300 sec: 5635.6). Total num frames: 406457344. Throughput: 0: 5153.1. Samples: 406452716. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:55:38,870][25689] Avg episode reward: [(0, '-48.331')] [2022-07-09 19:55:39,107][26022] Updated weights on worker 0-0, policy_version 396934 (0.00058) [2022-07-09 19:55:40,709][26022] Updated weights on worker 0-0, policy_version 396944 (0.00094) [2022-07-09 19:55:42,747][26022] Updated weights on worker 0-0, policy_version 396954 (0.00097) [2022-07-09 19:55:43,962][25689] Fps is (10 sec: 5768.7, 60 sec: 5657.9, 300 sec: 5644.3). Total num frames: 406489088. Throughput: 0: 5991.0. Samples: 406486838. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:55:43,962][25689] Avg episode reward: [(0, '-48.356')] [2022-07-09 19:55:44,191][26022] Updated weights on worker 0-0, policy_version 396964 (0.00089) [2022-07-09 19:55:46,200][26022] Updated weights on worker 0-0, policy_version 396974 (0.00083) [2022-07-09 19:55:47,894][26022] Updated weights on worker 0-0, policy_version 396984 (0.00091) [2022-07-09 19:55:48,965][25689] Fps is (10 sec: 5882.9, 60 sec: 5659.6, 300 sec: 5640.8). Total num frames: 406516736. Throughput: 0: 5992.3. Samples: 406521282. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:55:48,966][25689] Avg episode reward: [(0, '-48.064')] [2022-07-09 19:55:49,822][26022] Updated weights on worker 0-0, policy_version 396994 (0.00094) [2022-07-09 19:55:51,335][26022] Updated weights on worker 0-0, policy_version 397004 (0.00082) [2022-07-09 19:55:53,340][26022] Updated weights on worker 0-0, policy_version 397014 (0.00395) [2022-07-09 19:55:54,018][25689] Fps is (10 sec: 5702.4, 60 sec: 5672.2, 300 sec: 5643.5). Total num frames: 406546432. Throughput: 0: 5104.1. Samples: 406538592. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:55:54,018][25689] Avg episode reward: [(0, '-47.873')] [2022-07-09 19:55:55,120][26022] Updated weights on worker 0-0, policy_version 397024 (0.00089) [2022-07-09 19:55:56,986][26022] Updated weights on worker 0-0, policy_version 397034 (0.00089) [2022-07-09 19:55:58,908][26022] Updated weights on worker 0-0, policy_version 397044 (0.00091) [2022-07-09 19:55:59,089][25689] Fps is (10 sec: 5664.1, 60 sec: 5686.1, 300 sec: 5643.1). Total num frames: 406574080. Throughput: 0: 5923.9. Samples: 406572496. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:55:59,090][25689] Avg episode reward: [(0, '-47.251')] [2022-07-09 19:56:00,643][26022] Updated weights on worker 0-0, policy_version 397054 (0.00090) [2022-07-09 19:56:02,695][26022] Updated weights on worker 0-0, policy_version 397064 (0.00053) [2022-07-09 19:56:04,157][25689] Fps is (10 sec: 5453.9, 60 sec: 5685.2, 300 sec: 5641.9). Total num frames: 406601728. Throughput: 0: 5847.0. Samples: 406604918. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:56:04,157][25689] Avg episode reward: [(0, '-47.784')] [2022-07-09 19:56:04,676][26022] Updated weights on worker 0-0, policy_version 397074 (0.00620) [2022-07-09 19:56:06,189][26022] Updated weights on worker 0-0, policy_version 397084 (0.00093) [2022-07-09 19:56:08,284][26022] Updated weights on worker 0-0, policy_version 397094 (0.00081) [2022-07-09 19:56:09,176][25689] Fps is (10 sec: 5583.9, 60 sec: 5685.2, 300 sec: 5648.6). Total num frames: 406630400. Throughput: 0: 5825.0. Samples: 406639006. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:56:09,176][25689] Avg episode reward: [(0, '-47.220')] [2022-07-09 19:56:09,598][26022] Updated weights on worker 0-0, policy_version 397104 (0.00083) [2022-07-09 19:56:11,770][26022] Updated weights on worker 0-0, policy_version 397114 (0.00086) [2022-07-09 19:56:13,315][26022] Updated weights on worker 0-0, policy_version 397124 (0.00090) [2022-07-09 19:56:14,194][25689] Fps is (10 sec: 5611.4, 60 sec: 5675.1, 300 sec: 5638.3). Total num frames: 406658048. Throughput: 0: 5833.0. Samples: 406656276. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:56:14,194][25689] Avg episode reward: [(0, '-46.987')] [2022-07-09 19:56:15,174][26022] Updated weights on worker 0-0, policy_version 397134 (0.00085) [2022-07-09 19:56:17,122][26022] Updated weights on worker 0-0, policy_version 397144 (0.00087) [2022-07-09 19:56:18,783][26022] Updated weights on worker 0-0, policy_version 397154 (0.00084) [2022-07-09 19:56:19,203][25689] Fps is (10 sec: 5616.7, 60 sec: 5659.6, 300 sec: 5645.8). Total num frames: 406686720. Throughput: 0: 5870.0. Samples: 406690560. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:56:19,203][25689] Avg episode reward: [(0, '-47.452')] [2022-07-09 19:56:20,696][26022] Updated weights on worker 0-0, policy_version 397164 (0.00092) [2022-07-09 19:56:22,770][26022] Updated weights on worker 0-0, policy_version 397174 (0.00098) [2022-07-09 19:56:24,303][25689] Fps is (10 sec: 5773.4, 60 sec: 5672.3, 300 sec: 5647.5). Total num frames: 406716416. Throughput: 0: 5935.7. Samples: 406724502. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:56:24,304][25689] Avg episode reward: [(0, '-47.715')] [2022-07-09 19:56:24,306][26022] Updated weights on worker 0-0, policy_version 397184 (0.00082) [2022-07-09 19:56:26,351][26022] Updated weights on worker 0-0, policy_version 397194 (0.00084) [2022-07-09 19:56:27,916][26022] Updated weights on worker 0-0, policy_version 397204 (0.00085) [2022-07-09 19:56:29,379][25689] Fps is (10 sec: 5534.6, 60 sec: 5632.3, 300 sec: 5639.3). Total num frames: 406743040. Throughput: 0: 5058.6. Samples: 406741206. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:56:29,379][25689] Avg episode reward: [(0, '-47.299')] [2022-07-09 19:56:29,772][26022] Updated weights on worker 0-0, policy_version 397214 (0.00086) [2022-07-09 19:56:31,492][26022] Updated weights on worker 0-0, policy_version 397224 (0.00074) [2022-07-09 19:56:33,355][26022] Updated weights on worker 0-0, policy_version 397234 (0.00085) [2022-07-09 19:56:34,405][25689] Fps is (10 sec: 5676.6, 60 sec: 5664.3, 300 sec: 5645.8). Total num frames: 406773760. Throughput: 0: 5898.6. Samples: 406775496. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:56:34,406][25689] Avg episode reward: [(0, '-46.828')] [2022-07-09 19:56:35,182][26022] Updated weights on worker 0-0, policy_version 397244 (0.00092) [2022-07-09 19:56:37,019][26022] Updated weights on worker 0-0, policy_version 397254 (0.00089) [2022-07-09 19:56:38,636][26022] Updated weights on worker 0-0, policy_version 397264 (0.00094) [2022-07-09 19:56:39,437][25689] Fps is (10 sec: 5803.2, 60 sec: 5680.7, 300 sec: 5646.0). Total num frames: 406801408. Throughput: 0: 5894.5. Samples: 406809828. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:56:39,437][25689] Avg episode reward: [(0, '-46.511')] [2022-07-09 19:56:40,540][26022] Updated weights on worker 0-0, policy_version 397274 (0.00086) [2022-07-09 19:56:42,331][26022] Updated weights on worker 0-0, policy_version 397284 (0.00514) [2022-07-09 19:56:44,200][26022] Updated weights on worker 0-0, policy_version 397294 (0.00089) [2022-07-09 19:56:44,520][25689] Fps is (10 sec: 5669.3, 60 sec: 5647.7, 300 sec: 5644.9). Total num frames: 406831104. Throughput: 0: 5061.3. Samples: 406826824. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:56:44,521][25689] Avg episode reward: [(0, '-46.693')] [2022-07-09 19:56:46,103][26022] Updated weights on worker 0-0, policy_version 397304 (0.00527) [2022-07-09 19:56:47,655][26022] Updated weights on worker 0-0, policy_version 397314 (0.00085) [2022-07-09 19:56:49,524][25689] Fps is (10 sec: 5684.8, 60 sec: 5647.7, 300 sec: 5645.0). Total num frames: 406858752. Throughput: 0: 5958.6. Samples: 406861242. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:56:49,524][25689] Avg episode reward: [(0, '-46.704')] [2022-07-09 19:56:49,531][26022] Updated weights on worker 0-0, policy_version 397324 (0.00083) [2022-07-09 19:56:51,326][26022] Updated weights on worker 0-0, policy_version 397334 (0.00087) [2022-07-09 19:56:53,095][26022] Updated weights on worker 0-0, policy_version 397344 (0.00084) [2022-07-09 19:56:54,539][25689] Fps is (10 sec: 5519.2, 60 sec: 5617.4, 300 sec: 5638.1). Total num frames: 406886400. Throughput: 0: 5968.8. Samples: 406895670. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:56:54,539][25689] Avg episode reward: [(0, '-46.148')] [2022-07-09 19:56:55,028][26022] Updated weights on worker 0-0, policy_version 397354 (0.00109) [2022-07-09 19:56:56,615][26022] Updated weights on worker 0-0, policy_version 397364 (0.00088) [2022-07-09 19:56:58,619][26022] Updated weights on worker 0-0, policy_version 397374 (0.00086) [2022-07-09 19:56:59,592][25689] Fps is (10 sec: 5797.2, 60 sec: 5669.8, 300 sec: 5655.2). Total num frames: 406917120. Throughput: 0: 5097.7. Samples: 406912574. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:56:59,593][25689] Avg episode reward: [(0, '-46.151')] [2022-07-09 19:57:00,363][26022] Updated weights on worker 0-0, policy_version 397384 (0.00094) [2022-07-09 19:57:02,455][26022] Updated weights on worker 0-0, policy_version 397394 (0.00095) [2022-07-09 19:57:04,551][26022] Updated weights on worker 0-0, policy_version 397404 (0.00959) [2022-07-09 19:57:04,680][25689] Fps is (10 sec: 5553.7, 60 sec: 5634.1, 300 sec: 5646.9). Total num frames: 406942720. Throughput: 0: 5839.5. Samples: 406944548. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-09 19:57:04,681][25689] Avg episode reward: [(0, '-46.716')] [2022-07-09 19:57:05,786][26022] Updated weights on worker 0-0, policy_version 397414 (0.00085) [2022-07-09 19:57:07,943][26022] Updated weights on worker 0-0, policy_version 397424 (0.00093) [2022-07-09 19:57:09,623][26022] Updated weights on worker 0-0, policy_version 397434 (0.00096) [2022-07-09 19:57:09,697][25689] Fps is (10 sec: 5472.4, 60 sec: 5651.2, 300 sec: 5646.8). Total num frames: 406972416. Throughput: 0: 5840.2. Samples: 406979054. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:57:09,699][25689] Avg episode reward: [(0, '-46.589')] [2022-07-09 19:57:11,476][26022] Updated weights on worker 0-0, policy_version 397444 (0.00085) [2022-07-09 19:57:13,295][26022] Updated weights on worker 0-0, policy_version 397454 (0.00095) [2022-07-09 19:57:14,703][25689] Fps is (10 sec: 5823.1, 60 sec: 5669.2, 300 sec: 5650.5). Total num frames: 407001088. Throughput: 0: 4994.0. Samples: 406996372. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:57:14,704][25689] Avg episode reward: [(0, '-46.306')] [2022-07-09 19:57:14,893][26022] Updated weights on worker 0-0, policy_version 397464 (0.00093) [2022-07-09 19:57:16,810][26022] Updated weights on worker 0-0, policy_version 397474 (0.00099) [2022-07-09 19:57:18,616][26022] Updated weights on worker 0-0, policy_version 397484 (0.00089) [2022-07-09 19:57:19,710][25689] Fps is (10 sec: 5828.7, 60 sec: 5686.3, 300 sec: 5652.6). Total num frames: 407030784. Throughput: 0: 5885.5. Samples: 407030980. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:57:19,712][25689] Avg episode reward: [(0, '-46.130')] [2022-07-09 19:57:20,349][26022] Updated weights on worker 0-0, policy_version 397494 (0.00086) [2022-07-09 19:57:22,119][26022] Updated weights on worker 0-0, policy_version 397504 (0.00079) [2022-07-09 19:57:23,980][26022] Updated weights on worker 0-0, policy_version 397514 (0.00092) [2022-07-09 19:57:24,831][25689] Fps is (10 sec: 5763.5, 60 sec: 5667.5, 300 sec: 5653.8). Total num frames: 407059456. Throughput: 0: 6005.8. Samples: 407065568. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:57:24,832][25689] Avg episode reward: [(0, '-46.606')] [2022-07-09 19:57:25,647][26022] Updated weights on worker 0-0, policy_version 397524 (0.00087) [2022-07-09 19:57:27,541][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:57:27,550][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000397533_407073792.pth [2022-07-09 19:57:27,550][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000395547_405040128.pth [2022-07-09 19:57:27,660][26022] Updated weights on worker 0-0, policy_version 397534 (0.00090) [2022-07-09 19:57:29,376][26022] Updated weights on worker 0-0, policy_version 397544 (0.00084) [2022-07-09 19:57:29,890][25689] Fps is (10 sec: 5532.7, 60 sec: 5685.9, 300 sec: 5649.5). Total num frames: 407087104. Throughput: 0: 5111.9. Samples: 407082280. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:57:29,892][25689] Avg episode reward: [(0, '-46.825')] [2022-07-09 19:57:31,090][26022] Updated weights on worker 0-0, policy_version 397554 (0.00104) [2022-07-09 19:57:33,069][26022] Updated weights on worker 0-0, policy_version 397564 (0.00086) [2022-07-09 19:57:34,555][26022] Updated weights on worker 0-0, policy_version 397574 (0.00087) [2022-07-09 19:57:34,896][25689] Fps is (10 sec: 5697.4, 60 sec: 5671.0, 300 sec: 5652.9). Total num frames: 407116800. Throughput: 0: 5949.6. Samples: 407116508. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:57:34,897][25689] Avg episode reward: [(0, '-46.176')] [2022-07-09 19:57:36,770][26022] Updated weights on worker 0-0, policy_version 397584 (0.00109) [2022-07-09 19:57:38,139][26022] Updated weights on worker 0-0, policy_version 397594 (0.00087) [2022-07-09 19:57:39,897][25689] Fps is (10 sec: 5730.1, 60 sec: 5673.8, 300 sec: 5651.6). Total num frames: 407144448. Throughput: 0: 5930.0. Samples: 407150688. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:57:39,898][25689] Avg episode reward: [(0, '-46.304')] [2022-07-09 19:57:40,252][26022] Updated weights on worker 0-0, policy_version 397604 (0.00088) [2022-07-09 19:57:41,944][26022] Updated weights on worker 0-0, policy_version 397614 (0.00085) [2022-07-09 19:57:43,819][26022] Updated weights on worker 0-0, policy_version 397624 (0.00086) [2022-07-09 19:57:44,968][25689] Fps is (10 sec: 5693.0, 60 sec: 5674.9, 300 sec: 5654.6). Total num frames: 407174144. Throughput: 0: 5072.0. Samples: 407167706. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:57:44,969][25689] Avg episode reward: [(0, '-46.689')] [2022-07-09 19:57:45,488][26022] Updated weights on worker 0-0, policy_version 397634 (0.00093) [2022-07-09 19:57:47,507][26022] Updated weights on worker 0-0, policy_version 397644 (0.00086) [2022-07-09 19:57:49,048][26022] Updated weights on worker 0-0, policy_version 397654 (0.00090) [2022-07-09 19:57:50,067][25689] Fps is (10 sec: 5739.6, 60 sec: 5683.0, 300 sec: 5649.9). Total num frames: 407202816. Throughput: 0: 5941.9. Samples: 407202168. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:57:50,067][25689] Avg episode reward: [(0, '-46.458')] [2022-07-09 19:57:51,089][26022] Updated weights on worker 0-0, policy_version 397664 (0.00082) [2022-07-09 19:57:52,527][26022] Updated weights on worker 0-0, policy_version 397674 (0.00080) [2022-07-09 19:57:54,541][26022] Updated weights on worker 0-0, policy_version 397684 (0.00087) [2022-07-09 19:57:55,141][25689] Fps is (10 sec: 5737.5, 60 sec: 5711.2, 300 sec: 5657.1). Total num frames: 407232512. Throughput: 0: 5948.9. Samples: 407236948. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:57:55,142][25689] Avg episode reward: [(0, '-46.346')] [2022-07-09 19:57:56,217][26022] Updated weights on worker 0-0, policy_version 397694 (0.00093) [2022-07-09 19:57:58,155][26022] Updated weights on worker 0-0, policy_version 397704 (0.00082) [2022-07-09 19:57:59,800][26022] Updated weights on worker 0-0, policy_version 397714 (0.00088) [2022-07-09 19:58:00,171][25689] Fps is (10 sec: 5776.7, 60 sec: 5679.6, 300 sec: 5664.6). Total num frames: 407261184. Throughput: 0: 5095.9. Samples: 407253998. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:58:00,171][25689] Avg episode reward: [(0, '-46.007')] [2022-07-09 19:58:01,962][26022] Updated weights on worker 0-0, policy_version 397724 (0.00088) [2022-07-09 19:58:03,773][26022] Updated weights on worker 0-0, policy_version 397734 (0.00087) [2022-07-09 19:58:05,306][25689] Fps is (10 sec: 5440.2, 60 sec: 5692.1, 300 sec: 5655.4). Total num frames: 407287808. Throughput: 0: 5837.9. Samples: 407286434. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:58:05,306][25689] Avg episode reward: [(0, '-46.864')] [2022-07-09 19:58:05,410][26022] Updated weights on worker 0-0, policy_version 397744 (0.00082) [2022-07-09 19:58:07,367][26022] Updated weights on worker 0-0, policy_version 397754 (0.00089) [2022-07-09 19:58:09,131][26022] Updated weights on worker 0-0, policy_version 397764 (0.00094) [2022-07-09 19:58:10,350][25689] Fps is (10 sec: 5432.0, 60 sec: 5672.6, 300 sec: 5654.9). Total num frames: 407316480. Throughput: 0: 5857.3. Samples: 407320976. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:58:10,351][25689] Avg episode reward: [(0, '-47.017')] [2022-07-09 19:58:10,883][26022] Updated weights on worker 0-0, policy_version 397774 (0.00092) [2022-07-09 19:58:12,734][26022] Updated weights on worker 0-0, policy_version 397784 (0.00084) [2022-07-09 19:58:14,311][26022] Updated weights on worker 0-0, policy_version 397794 (0.00092) [2022-07-09 19:58:15,391][25689] Fps is (10 sec: 5889.2, 60 sec: 5703.2, 300 sec: 5657.7). Total num frames: 407347200. Throughput: 0: 5848.0. Samples: 407355366. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:58:15,391][25689] Avg episode reward: [(0, '-46.282')] [2022-07-09 19:58:16,423][26022] Updated weights on worker 0-0, policy_version 397804 (0.00090) [2022-07-09 19:58:17,875][26022] Updated weights on worker 0-0, policy_version 397814 (0.00091) [2022-07-09 19:58:19,907][26022] Updated weights on worker 0-0, policy_version 397824 (0.00084) [2022-07-09 19:58:20,447][25689] Fps is (10 sec: 5882.6, 60 sec: 5681.8, 300 sec: 5661.0). Total num frames: 407375872. Throughput: 0: 5857.1. Samples: 407372756. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:58:20,447][25689] Avg episode reward: [(0, '-46.669')] [2022-07-09 19:58:21,462][26022] Updated weights on worker 0-0, policy_version 397834 (0.00088) [2022-07-09 19:58:23,353][26022] Updated weights on worker 0-0, policy_version 397844 (0.00089) [2022-07-09 19:58:25,155][26022] Updated weights on worker 0-0, policy_version 397854 (0.00089) [2022-07-09 19:58:25,522][25689] Fps is (10 sec: 5659.9, 60 sec: 5685.9, 300 sec: 5660.1). Total num frames: 407404544. Throughput: 0: 5976.9. Samples: 407407266. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:58:25,523][25689] Avg episode reward: [(0, '-46.917')] [2022-07-09 19:58:26,970][26022] Updated weights on worker 0-0, policy_version 397864 (0.00093) [2022-07-09 19:58:28,797][26022] Updated weights on worker 0-0, policy_version 397874 (0.00085) [2022-07-09 19:58:30,541][25689] Fps is (10 sec: 5579.2, 60 sec: 5689.7, 300 sec: 5656.7). Total num frames: 407432192. Throughput: 0: 5959.7. Samples: 407441308. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:58:30,543][25689] Avg episode reward: [(0, '-47.603')] [2022-07-09 19:58:30,717][26022] Updated weights on worker 0-0, policy_version 397884 (0.00087) [2022-07-09 19:58:32,359][26022] Updated weights on worker 0-0, policy_version 397894 (0.00086) [2022-07-09 19:58:34,444][26022] Updated weights on worker 0-0, policy_version 397904 (0.00081) [2022-07-09 19:58:35,556][25689] Fps is (10 sec: 5715.3, 60 sec: 5688.9, 300 sec: 5663.6). Total num frames: 407461888. Throughput: 0: 5107.3. Samples: 407458354. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:58:35,556][25689] Avg episode reward: [(0, '-46.238')] [2022-07-09 19:58:35,746][26022] Updated weights on worker 0-0, policy_version 397914 (0.00082) [2022-07-09 19:58:37,928][26022] Updated weights on worker 0-0, policy_version 397924 (0.00086) [2022-07-09 19:58:39,288][26022] Updated weights on worker 0-0, policy_version 397934 (0.00090) [2022-07-09 19:58:40,579][25689] Fps is (10 sec: 5712.6, 60 sec: 5686.8, 300 sec: 5657.9). Total num frames: 407489536. Throughput: 0: 5960.3. Samples: 407492754. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:58:40,580][25689] Avg episode reward: [(0, '-46.561')] [2022-07-09 19:58:41,492][26022] Updated weights on worker 0-0, policy_version 397944 (0.00098) [2022-07-09 19:58:43,143][26022] Updated weights on worker 0-0, policy_version 397954 (0.00082) [2022-07-09 19:58:44,873][26022] Updated weights on worker 0-0, policy_version 397964 (0.00090) [2022-07-09 19:58:45,647][25689] Fps is (10 sec: 5581.0, 60 sec: 5670.2, 300 sec: 5660.7). Total num frames: 407518208. Throughput: 0: 5955.4. Samples: 407527118. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:58:45,649][25689] Avg episode reward: [(0, '-45.895')] [2022-07-09 19:58:46,771][26022] Updated weights on worker 0-0, policy_version 397974 (0.00090) [2022-07-09 19:58:48,715][26022] Updated weights on worker 0-0, policy_version 397984 (0.00087) [2022-07-09 19:58:50,234][26022] Updated weights on worker 0-0, policy_version 397994 (0.00091) [2022-07-09 19:58:50,671][25689] Fps is (10 sec: 5885.4, 60 sec: 5711.0, 300 sec: 5667.8). Total num frames: 407548928. Throughput: 0: 5109.0. Samples: 407544150. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:58:50,673][25689] Avg episode reward: [(0, '-45.865')] [2022-07-09 19:58:52,376][26022] Updated weights on worker 0-0, policy_version 398004 (0.00086) [2022-07-09 19:58:53,708][26022] Updated weights on worker 0-0, policy_version 398014 (0.00085) [2022-07-09 19:58:55,714][25689] Fps is (10 sec: 5696.0, 60 sec: 5663.2, 300 sec: 5660.3). Total num frames: 407575552. Throughput: 0: 5958.8. Samples: 407578476. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:58:55,715][25689] Avg episode reward: [(0, '-46.049')] [2022-07-09 19:58:56,035][26022] Updated weights on worker 0-0, policy_version 398024 (0.00088) [2022-07-09 19:58:57,376][26022] Updated weights on worker 0-0, policy_version 398034 (0.00085) [2022-07-09 19:58:59,270][26022] Updated weights on worker 0-0, policy_version 398044 (0.00093) [2022-07-09 19:59:00,751][25689] Fps is (10 sec: 5689.1, 60 sec: 5696.4, 300 sec: 5674.6). Total num frames: 407606272. Throughput: 0: 5949.3. Samples: 407612758. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:59:00,751][25689] Avg episode reward: [(0, '-45.980')] [2022-07-09 19:59:01,053][26022] Updated weights on worker 0-0, policy_version 398054 (0.00093) [2022-07-09 19:59:03,519][26022] Updated weights on worker 0-0, policy_version 398064 (0.00088) [2022-07-09 19:59:05,103][26022] Updated weights on worker 0-0, policy_version 398074 (0.00091) [2022-07-09 19:59:05,815][25689] Fps is (10 sec: 5575.7, 60 sec: 5686.1, 300 sec: 5667.3). Total num frames: 407631872. Throughput: 0: 4978.1. Samples: 407627518. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:59:05,816][25689] Avg episode reward: [(0, '-46.292')] [2022-07-09 19:59:06,845][26022] Updated weights on worker 0-0, policy_version 398084 (0.00088) [2022-07-09 19:59:08,734][26022] Updated weights on worker 0-0, policy_version 398094 (0.00090) [2022-07-09 19:59:10,340][26022] Updated weights on worker 0-0, policy_version 398104 (0.00081) [2022-07-09 19:59:10,817][25689] Fps is (10 sec: 5391.6, 60 sec: 5690.2, 300 sec: 5660.9). Total num frames: 407660544. Throughput: 0: 5851.5. Samples: 407662032. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:59:10,817][25689] Avg episode reward: [(0, '-46.418')] [2022-07-09 19:59:12,387][26022] Updated weights on worker 0-0, policy_version 398114 (0.00095) [2022-07-09 19:59:13,891][26022] Updated weights on worker 0-0, policy_version 398124 (0.00080) [2022-07-09 19:59:15,839][25689] Fps is (10 sec: 5618.9, 60 sec: 5641.1, 300 sec: 5664.1). Total num frames: 407688192. Throughput: 0: 5851.9. Samples: 407696242. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:59:15,839][25689] Avg episode reward: [(0, '-46.380')] [2022-07-09 19:59:15,961][26022] Updated weights on worker 0-0, policy_version 398134 (0.00092) [2022-07-09 19:59:17,516][26022] Updated weights on worker 0-0, policy_version 398144 (0.00085) [2022-07-09 19:59:19,315][26022] Updated weights on worker 0-0, policy_version 398154 (0.00091) [2022-07-09 19:59:20,878][25689] Fps is (10 sec: 5699.2, 60 sec: 5659.6, 300 sec: 5665.4). Total num frames: 407717888. Throughput: 0: 4997.3. Samples: 407713342. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:59:20,879][25689] Avg episode reward: [(0, '-46.545')] [2022-07-09 19:59:21,380][26022] Updated weights on worker 0-0, policy_version 398164 (0.00089) [2022-07-09 19:59:22,859][26022] Updated weights on worker 0-0, policy_version 398174 (0.00096) [2022-07-09 19:59:25,072][26022] Updated weights on worker 0-0, policy_version 398184 (0.00081) [2022-07-09 19:59:26,017][25689] Fps is (10 sec: 5734.6, 60 sec: 5653.6, 300 sec: 5662.9). Total num frames: 407746560. Throughput: 0: 5939.4. Samples: 407747502. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:59:26,018][25689] Avg episode reward: [(0, '-47.053')] [2022-07-09 19:59:26,639][26022] Updated weights on worker 0-0, policy_version 398194 (0.00085) [2022-07-09 19:59:27,651][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 19:59:27,663][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000398199_407755776.pth [2022-07-09 19:59:27,664][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000396208_405716992.pth [2022-07-09 19:59:28,426][26022] Updated weights on worker 0-0, policy_version 398204 (0.00093) [2022-07-09 19:59:30,215][26022] Updated weights on worker 0-0, policy_version 398214 (0.00112) [2022-07-09 19:59:31,062][25689] Fps is (10 sec: 5530.6, 60 sec: 5651.3, 300 sec: 5662.6). Total num frames: 407774208. Throughput: 0: 5897.0. Samples: 407781418. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:59:31,062][25689] Avg episode reward: [(0, '-47.110')] [2022-07-09 19:59:32,118][26022] Updated weights on worker 0-0, policy_version 398224 (0.00089) [2022-07-09 19:59:33,775][26022] Updated weights on worker 0-0, policy_version 398234 (0.00085) [2022-07-09 19:59:35,663][26022] Updated weights on worker 0-0, policy_version 398244 (0.00088) [2022-07-09 19:59:36,064][25689] Fps is (10 sec: 5809.4, 60 sec: 5669.3, 300 sec: 5670.6). Total num frames: 407804928. Throughput: 0: 5062.7. Samples: 407798636. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:59:36,065][25689] Avg episode reward: [(0, '-47.116')] [2022-07-09 19:59:37,327][26022] Updated weights on worker 0-0, policy_version 398254 (0.00088) [2022-07-09 19:59:39,216][26022] Updated weights on worker 0-0, policy_version 398264 (0.00088) [2022-07-09 19:59:41,001][26022] Updated weights on worker 0-0, policy_version 398274 (0.00091) [2022-07-09 19:59:41,071][25689] Fps is (10 sec: 5831.5, 60 sec: 5670.9, 300 sec: 5664.5). Total num frames: 407832576. Throughput: 0: 5932.6. Samples: 407833136. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:59:41,071][25689] Avg episode reward: [(0, '-47.368')] [2022-07-09 19:59:42,820][26022] Updated weights on worker 0-0, policy_version 398284 (0.00089) [2022-07-09 19:59:44,286][26022] Updated weights on worker 0-0, policy_version 398294 (0.00091) [2022-07-09 19:59:46,154][25689] Fps is (10 sec: 5581.7, 60 sec: 5669.4, 300 sec: 5666.7). Total num frames: 407861248. Throughput: 0: 5966.0. Samples: 407867642. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 19:59:46,155][25689] Avg episode reward: [(0, '-46.990')] [2022-07-09 19:59:46,519][26022] Updated weights on worker 0-0, policy_version 398304 (0.00096) [2022-07-09 19:59:48,002][26022] Updated weights on worker 0-0, policy_version 398314 (0.00050) [2022-07-09 19:59:50,015][26022] Updated weights on worker 0-0, policy_version 398324 (0.00083) [2022-07-09 19:59:51,184][25689] Fps is (10 sec: 5771.4, 60 sec: 5651.9, 300 sec: 5669.7). Total num frames: 407890944. Throughput: 0: 5143.8. Samples: 407884924. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 19:59:51,185][25689] Avg episode reward: [(0, '-46.943')] [2022-07-09 19:59:51,594][26022] Updated weights on worker 0-0, policy_version 398334 (0.00092) [2022-07-09 19:59:53,391][26022] Updated weights on worker 0-0, policy_version 398344 (0.00081) [2022-07-09 19:59:55,231][26022] Updated weights on worker 0-0, policy_version 398354 (0.00085) [2022-07-09 19:59:56,203][25689] Fps is (10 sec: 5808.5, 60 sec: 5688.1, 300 sec: 5677.0). Total num frames: 407919616. Throughput: 0: 5988.0. Samples: 407919226. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 19:59:56,204][25689] Avg episode reward: [(0, '-47.440')] [2022-07-09 19:59:57,176][26022] Updated weights on worker 0-0, policy_version 398364 (0.00087) [2022-07-09 19:59:58,827][26022] Updated weights on worker 0-0, policy_version 398374 (0.00090) [2022-07-09 20:00:00,687][26022] Updated weights on worker 0-0, policy_version 398384 (0.00092) [2022-07-09 20:00:01,207][25689] Fps is (10 sec: 5619.0, 60 sec: 5640.3, 300 sec: 5678.0). Total num frames: 407947264. Throughput: 0: 5976.5. Samples: 407953482. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:00:01,209][25689] Avg episode reward: [(0, '-46.794')] [2022-07-09 20:00:02,750][26022] Updated weights on worker 0-0, policy_version 398394 (0.00089) [2022-07-09 20:00:04,779][26022] Updated weights on worker 0-0, policy_version 398404 (0.00086) [2022-07-09 20:00:06,254][25689] Fps is (10 sec: 5501.6, 60 sec: 5675.9, 300 sec: 5674.0). Total num frames: 407974912. Throughput: 0: 5008.8. Samples: 407968316. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:00:06,256][25689] Avg episode reward: [(0, '-46.396')] [2022-07-09 20:00:06,319][26022] Updated weights on worker 0-0, policy_version 398414 (0.00093) [2022-07-09 20:00:08,392][26022] Updated weights on worker 0-0, policy_version 398424 (0.00079) [2022-07-09 20:00:09,940][26022] Updated weights on worker 0-0, policy_version 398434 (0.00168) [2022-07-09 20:00:11,330][25689] Fps is (10 sec: 5462.4, 60 sec: 5651.9, 300 sec: 5670.9). Total num frames: 408002560. Throughput: 0: 5845.4. Samples: 408002686. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:00:11,332][25689] Avg episode reward: [(0, '-45.952')] [2022-07-09 20:00:12,025][26022] Updated weights on worker 0-0, policy_version 398444 (0.00094) [2022-07-09 20:00:13,601][26022] Updated weights on worker 0-0, policy_version 398454 (0.00091) [2022-07-09 20:00:15,310][26022] Updated weights on worker 0-0, policy_version 398464 (0.00088) [2022-07-09 20:00:16,362][25689] Fps is (10 sec: 5571.6, 60 sec: 5667.9, 300 sec: 5667.3). Total num frames: 408031232. Throughput: 0: 5837.6. Samples: 408036908. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:00:16,364][25689] Avg episode reward: [(0, '-45.639')] [2022-07-09 20:00:17,273][26022] Updated weights on worker 0-0, policy_version 398474 (0.00337) [2022-07-09 20:00:18,717][26022] Updated weights on worker 0-0, policy_version 398484 (0.00085) [2022-07-09 20:00:20,787][26022] Updated weights on worker 0-0, policy_version 398494 (0.00090) [2022-07-09 20:00:21,366][25689] Fps is (10 sec: 5918.1, 60 sec: 5688.2, 300 sec: 5675.2). Total num frames: 408061952. Throughput: 0: 4996.4. Samples: 408054202. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:00:21,366][25689] Avg episode reward: [(0, '-45.215')] [2022-07-09 20:00:22,668][26022] Updated weights on worker 0-0, policy_version 398504 (0.00081) [2022-07-09 20:00:24,284][26022] Updated weights on worker 0-0, policy_version 398514 (0.00082) [2022-07-09 20:00:26,342][26022] Updated weights on worker 0-0, policy_version 398524 (0.00086) [2022-07-09 20:00:26,461][25689] Fps is (10 sec: 5779.5, 60 sec: 5675.3, 300 sec: 5670.1). Total num frames: 408089600. Throughput: 0: 5952.5. Samples: 408088602. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:00:26,462][25689] Avg episode reward: [(0, '-45.086')] [2022-07-09 20:00:27,946][26022] Updated weights on worker 0-0, policy_version 398534 (0.00088) [2022-07-09 20:00:29,951][26022] Updated weights on worker 0-0, policy_version 398544 (0.00095) [2022-07-09 20:00:31,494][25689] Fps is (10 sec: 5560.6, 60 sec: 5693.3, 300 sec: 5669.6). Total num frames: 408118272. Throughput: 0: 5952.9. Samples: 408122720. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:00:31,495][25689] Avg episode reward: [(0, '-45.063')] [2022-07-09 20:00:31,660][26022] Updated weights on worker 0-0, policy_version 398554 (0.00092) [2022-07-09 20:00:33,439][26022] Updated weights on worker 0-0, policy_version 398564 (0.00078) [2022-07-09 20:00:35,225][26022] Updated weights on worker 0-0, policy_version 398574 (0.00092) [2022-07-09 20:00:36,514][25689] Fps is (10 sec: 5704.5, 60 sec: 5657.9, 300 sec: 5676.6). Total num frames: 408146944. Throughput: 0: 5093.9. Samples: 408139558. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:00:36,514][25689] Avg episode reward: [(0, '-45.214')] [2022-07-09 20:00:37,219][26022] Updated weights on worker 0-0, policy_version 398584 (0.00085) [2022-07-09 20:00:38,656][26022] Updated weights on worker 0-0, policy_version 398594 (0.00087) [2022-07-09 20:00:40,543][26022] Updated weights on worker 0-0, policy_version 398604 (0.00091) [2022-07-09 20:00:41,568][25689] Fps is (10 sec: 5692.4, 60 sec: 5670.3, 300 sec: 5667.0). Total num frames: 408175616. Throughput: 0: 5932.4. Samples: 408174050. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:00:41,569][25689] Avg episode reward: [(0, '-46.379')] [2022-07-09 20:00:42,217][26022] Updated weights on worker 0-0, policy_version 398614 (0.00084) [2022-07-09 20:00:44,172][26022] Updated weights on worker 0-0, policy_version 398624 (0.00084) [2022-07-09 20:00:46,012][26022] Updated weights on worker 0-0, policy_version 398634 (0.00086) [2022-07-09 20:00:46,690][25689] Fps is (10 sec: 5735.6, 60 sec: 5683.6, 300 sec: 5671.6). Total num frames: 408205312. Throughput: 0: 5933.5. Samples: 408208630. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:00:46,691][25689] Avg episode reward: [(0, '-46.159')] [2022-07-09 20:00:47,638][26022] Updated weights on worker 0-0, policy_version 398644 (0.00084) [2022-07-09 20:00:49,539][26022] Updated weights on worker 0-0, policy_version 398654 (0.00095) [2022-07-09 20:00:51,263][26022] Updated weights on worker 0-0, policy_version 398664 (0.00105) [2022-07-09 20:00:51,752][25689] Fps is (10 sec: 5731.3, 60 sec: 5663.7, 300 sec: 5668.0). Total num frames: 408233984. Throughput: 0: 5940.3. Samples: 408243058. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:00:51,753][25689] Avg episode reward: [(0, '-45.972')] [2022-07-09 20:00:53,030][26022] Updated weights on worker 0-0, policy_version 398674 (0.00085) [2022-07-09 20:00:54,831][26022] Updated weights on worker 0-0, policy_version 398684 (0.00086) [2022-07-09 20:00:56,504][26022] Updated weights on worker 0-0, policy_version 398694 (0.00089) [2022-07-09 20:00:56,767][25689] Fps is (10 sec: 5792.4, 60 sec: 5681.0, 300 sec: 5675.9). Total num frames: 408263680. Throughput: 0: 5961.9. Samples: 408260306. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:00:56,768][25689] Avg episode reward: [(0, '-46.149')] [2022-07-09 20:00:58,487][26022] Updated weights on worker 0-0, policy_version 398704 (0.00086) [2022-07-09 20:01:00,149][26022] Updated weights on worker 0-0, policy_version 398714 (0.00082) [2022-07-09 20:01:01,823][25689] Fps is (10 sec: 5694.3, 60 sec: 5676.2, 300 sec: 5676.2). Total num frames: 408291328. Throughput: 0: 5960.5. Samples: 408294778. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:01:01,823][25689] Avg episode reward: [(0, '-47.094')] [2022-07-09 20:01:02,573][26022] Updated weights on worker 0-0, policy_version 398724 (0.00093) [2022-07-09 20:01:03,982][26022] Updated weights on worker 0-0, policy_version 398734 (0.00104) [2022-07-09 20:01:06,167][26022] Updated weights on worker 0-0, policy_version 398744 (0.00086) [2022-07-09 20:01:06,936][25689] Fps is (10 sec: 5437.9, 60 sec: 5670.0, 300 sec: 5670.9). Total num frames: 408318976. Throughput: 0: 5850.7. Samples: 408327078. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:01:06,936][25689] Avg episode reward: [(0, '-46.852')] [2022-07-09 20:01:07,629][26022] Updated weights on worker 0-0, policy_version 398754 (0.00449) [2022-07-09 20:01:09,652][26022] Updated weights on worker 0-0, policy_version 398764 (0.00085) [2022-07-09 20:01:11,122][26022] Updated weights on worker 0-0, policy_version 398774 (0.00086) [2022-07-09 20:01:11,938][25689] Fps is (10 sec: 5466.6, 60 sec: 5676.9, 300 sec: 5671.2). Total num frames: 408346624. Throughput: 0: 5009.3. Samples: 408344174. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:01:11,939][25689] Avg episode reward: [(0, '-48.242')] [2022-07-09 20:01:13,092][26022] Updated weights on worker 0-0, policy_version 398784 (0.00455) [2022-07-09 20:01:14,770][26022] Updated weights on worker 0-0, policy_version 398794 (0.00081) [2022-07-09 20:01:16,704][26022] Updated weights on worker 0-0, policy_version 398804 (0.00084) [2022-07-09 20:01:16,950][25689] Fps is (10 sec: 5726.0, 60 sec: 5695.6, 300 sec: 5674.6). Total num frames: 408376320. Throughput: 0: 5882.4. Samples: 408379032. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:01:16,951][25689] Avg episode reward: [(0, '-47.566')] [2022-07-09 20:01:18,483][26022] Updated weights on worker 0-0, policy_version 398814 (0.00096) [2022-07-09 20:01:20,182][26022] Updated weights on worker 0-0, policy_version 398824 (0.00088) [2022-07-09 20:01:21,945][26022] Updated weights on worker 0-0, policy_version 398834 (0.00089) [2022-07-09 20:01:21,975][25689] Fps is (10 sec: 5917.4, 60 sec: 5676.8, 300 sec: 5676.1). Total num frames: 408406016. Throughput: 0: 5896.2. Samples: 408413598. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:01:21,975][25689] Avg episode reward: [(0, '-47.973')] [2022-07-09 20:01:23,734][26022] Updated weights on worker 0-0, policy_version 398844 (0.00085) [2022-07-09 20:01:25,572][26022] Updated weights on worker 0-0, policy_version 398854 (0.00084) [2022-07-09 20:01:27,040][25689] Fps is (10 sec: 5886.6, 60 sec: 5713.5, 300 sec: 5686.6). Total num frames: 408435712. Throughput: 0: 5162.7. Samples: 408430870. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:01:27,040][25689] Avg episode reward: [(0, '-47.746')] [2022-07-09 20:01:27,247][26022] Updated weights on worker 0-0, policy_version 398864 (0.00081) [2022-07-09 20:01:27,737][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:01:27,750][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000398866_408438784.pth [2022-07-09 20:01:27,751][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000396869_406393856.pth [2022-07-09 20:01:29,178][26022] Updated weights on worker 0-0, policy_version 398874 (0.00086) [2022-07-09 20:01:30,968][26022] Updated weights on worker 0-0, policy_version 398884 (0.00085) [2022-07-09 20:01:32,056][25689] Fps is (10 sec: 5688.1, 60 sec: 5698.1, 300 sec: 5676.5). Total num frames: 408463360. Throughput: 0: 6010.6. Samples: 408465096. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:01:32,057][25689] Avg episode reward: [(0, '-47.598')] [2022-07-09 20:01:32,610][26022] Updated weights on worker 0-0, policy_version 398894 (0.00089) [2022-07-09 20:01:34,371][26022] Updated weights on worker 0-0, policy_version 398904 (0.00424) [2022-07-09 20:01:36,279][26022] Updated weights on worker 0-0, policy_version 398914 (0.00092) [2022-07-09 20:01:37,069][25689] Fps is (10 sec: 5615.3, 60 sec: 5698.7, 300 sec: 5680.2). Total num frames: 408492032. Throughput: 0: 6003.6. Samples: 408499820. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:01:37,070][25689] Avg episode reward: [(0, '-46.997')] [2022-07-09 20:01:37,890][26022] Updated weights on worker 0-0, policy_version 398924 (0.00091) [2022-07-09 20:01:39,847][26022] Updated weights on worker 0-0, policy_version 398934 (0.00087) [2022-07-09 20:01:41,547][26022] Updated weights on worker 0-0, policy_version 398944 (0.00095) [2022-07-09 20:01:42,097][25689] Fps is (10 sec: 5710.7, 60 sec: 5701.2, 300 sec: 5677.9). Total num frames: 408520704. Throughput: 0: 5139.4. Samples: 408517018. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:01:42,098][25689] Avg episode reward: [(0, '-47.811')] [2022-07-09 20:01:43,478][26022] Updated weights on worker 0-0, policy_version 398954 (0.00086) [2022-07-09 20:01:45,254][26022] Updated weights on worker 0-0, policy_version 398964 (0.00097) [2022-07-09 20:01:46,879][26022] Updated weights on worker 0-0, policy_version 398974 (0.00080) [2022-07-09 20:01:47,183][25689] Fps is (10 sec: 5771.0, 60 sec: 5704.6, 300 sec: 5683.2). Total num frames: 408550400. Throughput: 0: 5972.2. Samples: 408551172. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:01:47,184][25689] Avg episode reward: [(0, '-47.213')] [2022-07-09 20:01:48,860][26022] Updated weights on worker 0-0, policy_version 398984 (0.00092) [2022-07-09 20:01:50,600][26022] Updated weights on worker 0-0, policy_version 398994 (0.00089) [2022-07-09 20:01:52,257][25689] Fps is (10 sec: 5745.3, 60 sec: 5703.5, 300 sec: 5685.5). Total num frames: 408579072. Throughput: 0: 5961.4. Samples: 408585520. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:01:52,257][25689] Avg episode reward: [(0, '-46.565')] [2022-07-09 20:01:52,468][26022] Updated weights on worker 0-0, policy_version 399004 (0.00089) [2022-07-09 20:01:54,143][26022] Updated weights on worker 0-0, policy_version 399014 (0.00082) [2022-07-09 20:01:55,926][26022] Updated weights on worker 0-0, policy_version 399024 (0.00089) [2022-07-09 20:01:57,290][25689] Fps is (10 sec: 5572.8, 60 sec: 5668.0, 300 sec: 5675.6). Total num frames: 408606720. Throughput: 0: 5084.9. Samples: 408602638. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:01:57,290][25689] Avg episode reward: [(0, '-46.955')] [2022-07-09 20:01:57,865][26022] Updated weights on worker 0-0, policy_version 399034 (0.00081) [2022-07-09 20:01:59,520][26022] Updated weights on worker 0-0, policy_version 399044 (0.00091) [2022-07-09 20:02:01,363][26022] Updated weights on worker 0-0, policy_version 399054 (0.00079) [2022-07-09 20:02:02,324][25689] Fps is (10 sec: 5492.7, 60 sec: 5670.0, 300 sec: 5683.5). Total num frames: 408634368. Throughput: 0: 5924.6. Samples: 408636852. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:02:02,325][25689] Avg episode reward: [(0, '-47.641')] [2022-07-09 20:02:03,403][26022] Updated weights on worker 0-0, policy_version 399064 (0.00098) [2022-07-09 20:02:05,428][26022] Updated weights on worker 0-0, policy_version 399074 (0.00108) [2022-07-09 20:02:07,253][26022] Updated weights on worker 0-0, policy_version 399084 (0.00081) [2022-07-09 20:02:07,431][25689] Fps is (10 sec: 5654.3, 60 sec: 5704.3, 300 sec: 5681.7). Total num frames: 408664064. Throughput: 0: 5804.2. Samples: 408668696. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:02:07,432][25689] Avg episode reward: [(0, '-46.622')] [2022-07-09 20:02:09,104][26022] Updated weights on worker 0-0, policy_version 399094 (0.00107) [2022-07-09 20:02:10,700][26022] Updated weights on worker 0-0, policy_version 399104 (0.00090) [2022-07-09 20:02:12,438][25689] Fps is (10 sec: 5770.8, 60 sec: 5720.8, 300 sec: 5681.7). Total num frames: 408692736. Throughput: 0: 4989.1. Samples: 408686208. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:02:12,440][25689] Avg episode reward: [(0, '-45.996')] [2022-07-09 20:02:12,441][26022] Updated weights on worker 0-0, policy_version 399114 (0.00096) [2022-07-09 20:02:14,233][26022] Updated weights on worker 0-0, policy_version 399124 (0.00087) [2022-07-09 20:02:16,192][26022] Updated weights on worker 0-0, policy_version 399134 (0.00085) [2022-07-09 20:02:17,489][25689] Fps is (10 sec: 5599.9, 60 sec: 5683.4, 300 sec: 5674.0). Total num frames: 408720384. Throughput: 0: 5836.4. Samples: 408720528. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:02:17,489][25689] Avg episode reward: [(0, '-46.472')] [2022-07-09 20:02:17,893][26022] Updated weights on worker 0-0, policy_version 399144 (0.00084) [2022-07-09 20:02:19,606][26022] Updated weights on worker 0-0, policy_version 399154 (0.00083) [2022-07-09 20:02:21,438][26022] Updated weights on worker 0-0, policy_version 399164 (0.00096) [2022-07-09 20:02:22,520][25689] Fps is (10 sec: 5688.2, 60 sec: 5682.8, 300 sec: 5679.2). Total num frames: 408750080. Throughput: 0: 5852.6. Samples: 408755048. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:02:22,520][25689] Avg episode reward: [(0, '-46.474')] [2022-07-09 20:02:23,222][26022] Updated weights on worker 0-0, policy_version 399174 (0.00084) [2022-07-09 20:02:25,107][26022] Updated weights on worker 0-0, policy_version 399184 (0.00087) [2022-07-09 20:02:26,838][26022] Updated weights on worker 0-0, policy_version 399194 (0.00082) [2022-07-09 20:02:27,621][25689] Fps is (10 sec: 5760.3, 60 sec: 5662.4, 300 sec: 5681.8). Total num frames: 408778752. Throughput: 0: 5138.6. Samples: 408772444. Policy #0 lag: (min: 0.0, avg: 7.4, max: 19.0) [2022-07-09 20:02:27,622][25689] Avg episode reward: [(0, '-46.237')] [2022-07-09 20:02:28,572][26022] Updated weights on worker 0-0, policy_version 399204 (0.00083) [2022-07-09 20:02:30,417][26022] Updated weights on worker 0-0, policy_version 399214 (0.01138) [2022-07-09 20:02:32,114][26022] Updated weights on worker 0-0, policy_version 399224 (0.00083) [2022-07-09 20:02:32,623][25689] Fps is (10 sec: 5675.5, 60 sec: 5680.7, 300 sec: 5678.4). Total num frames: 408807424. Throughput: 0: 5985.6. Samples: 408807026. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:02:32,625][25689] Avg episode reward: [(0, '-47.087')] [2022-07-09 20:02:33,808][26022] Updated weights on worker 0-0, policy_version 399234 (0.00088) [2022-07-09 20:02:35,772][26022] Updated weights on worker 0-0, policy_version 399244 (0.00089) [2022-07-09 20:02:37,439][26022] Updated weights on worker 0-0, policy_version 399254 (0.00082) [2022-07-09 20:02:37,639][25689] Fps is (10 sec: 5724.2, 60 sec: 5680.5, 300 sec: 5681.6). Total num frames: 408836096. Throughput: 0: 6004.6. Samples: 408841524. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:02:37,640][25689] Avg episode reward: [(0, '-47.096')] [2022-07-09 20:02:39,403][26022] Updated weights on worker 0-0, policy_version 399264 (0.00085) [2022-07-09 20:02:41,066][26022] Updated weights on worker 0-0, policy_version 399274 (0.00087) [2022-07-09 20:02:42,699][25689] Fps is (10 sec: 5691.3, 60 sec: 5677.5, 300 sec: 5678.4). Total num frames: 408864768. Throughput: 0: 5145.2. Samples: 408858876. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:02:42,699][25689] Avg episode reward: [(0, '-47.370')] [2022-07-09 20:02:42,936][26022] Updated weights on worker 0-0, policy_version 399284 (0.00091) [2022-07-09 20:02:44,460][26022] Updated weights on worker 0-0, policy_version 399294 (0.00093) [2022-07-09 20:02:46,530][26022] Updated weights on worker 0-0, policy_version 399304 (0.00089) [2022-07-09 20:02:47,765][25689] Fps is (10 sec: 5865.5, 60 sec: 5696.3, 300 sec: 5685.9). Total num frames: 408895488. Throughput: 0: 6010.5. Samples: 408893516. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:02:47,765][25689] Avg episode reward: [(0, '-47.305')] [2022-07-09 20:02:47,943][26022] Updated weights on worker 0-0, policy_version 399314 (0.00092) [2022-07-09 20:02:49,921][26022] Updated weights on worker 0-0, policy_version 399324 (0.00088) [2022-07-09 20:02:51,711][26022] Updated weights on worker 0-0, policy_version 399334 (0.00099) [2022-07-09 20:02:52,853][25689] Fps is (10 sec: 5748.1, 60 sec: 5678.0, 300 sec: 5678.7). Total num frames: 408923136. Throughput: 0: 5985.3. Samples: 408928108. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:02:52,854][25689] Avg episode reward: [(0, '-46.698')] [2022-07-09 20:02:53,539][26022] Updated weights on worker 0-0, policy_version 399344 (0.00084) [2022-07-09 20:02:55,258][26022] Updated weights on worker 0-0, policy_version 399354 (0.00090) [2022-07-09 20:02:57,187][26022] Updated weights on worker 0-0, policy_version 399364 (0.00078) [2022-07-09 20:02:57,857][25689] Fps is (10 sec: 5682.2, 60 sec: 5714.5, 300 sec: 5682.7). Total num frames: 408952832. Throughput: 0: 5972.9. Samples: 408962282. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:02:57,857][25689] Avg episode reward: [(0, '-47.307')] [2022-07-09 20:02:58,872][26022] Updated weights on worker 0-0, policy_version 399374 (0.00088) [2022-07-09 20:03:00,677][26022] Updated weights on worker 0-0, policy_version 399384 (0.00090) [2022-07-09 20:03:02,887][25689] Fps is (10 sec: 5612.9, 60 sec: 5698.0, 300 sec: 5684.7). Total num frames: 408979456. Throughput: 0: 5965.6. Samples: 408979310. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:03:02,889][25689] Avg episode reward: [(0, '-46.661')] [2022-07-09 20:03:02,890][26022] Updated weights on worker 0-0, policy_version 399394 (0.00091) [2022-07-09 20:03:04,629][26022] Updated weights on worker 0-0, policy_version 399404 (0.00081) [2022-07-09 20:03:06,579][26022] Updated weights on worker 0-0, policy_version 399414 (0.00082) [2022-07-09 20:03:07,942][25689] Fps is (10 sec: 5584.3, 60 sec: 5702.9, 300 sec: 5687.9). Total num frames: 409009152. Throughput: 0: 5842.6. Samples: 409011404. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:03:07,943][25689] Avg episode reward: [(0, '-46.548')] [2022-07-09 20:03:08,205][26022] Updated weights on worker 0-0, policy_version 399424 (0.00086) [2022-07-09 20:03:10,118][26022] Updated weights on worker 0-0, policy_version 399434 (0.00087) [2022-07-09 20:03:11,859][26022] Updated weights on worker 0-0, policy_version 399444 (0.00082) [2022-07-09 20:03:12,949][25689] Fps is (10 sec: 5699.4, 60 sec: 5686.0, 300 sec: 5678.2). Total num frames: 409036800. Throughput: 0: 5872.6. Samples: 409046120. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:03:12,949][25689] Avg episode reward: [(0, '-46.339')] [2022-07-09 20:03:13,447][26022] Updated weights on worker 0-0, policy_version 399454 (0.00094) [2022-07-09 20:03:15,459][26022] Updated weights on worker 0-0, policy_version 399464 (0.00080) [2022-07-09 20:03:16,938][26022] Updated weights on worker 0-0, policy_version 399474 (0.00081) [2022-07-09 20:03:17,979][25689] Fps is (10 sec: 5611.4, 60 sec: 5704.8, 300 sec: 5678.7). Total num frames: 409065472. Throughput: 0: 5034.2. Samples: 409063582. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:03:17,979][25689] Avg episode reward: [(0, '-46.871')] [2022-07-09 20:03:18,891][26022] Updated weights on worker 0-0, policy_version 399484 (0.00085) [2022-07-09 20:03:20,727][26022] Updated weights on worker 0-0, policy_version 399494 (0.00084) [2022-07-09 20:03:22,491][26022] Updated weights on worker 0-0, policy_version 399504 (0.00081) [2022-07-09 20:03:23,066][25689] Fps is (10 sec: 5667.8, 60 sec: 5682.6, 300 sec: 5678.5). Total num frames: 409094144. Throughput: 0: 5882.8. Samples: 409098020. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:03:23,068][25689] Avg episode reward: [(0, '-46.763')] [2022-07-09 20:03:24,268][26022] Updated weights on worker 0-0, policy_version 399514 (0.00085) [2022-07-09 20:03:26,160][26022] Updated weights on worker 0-0, policy_version 399524 (0.00089) [2022-07-09 20:03:27,865][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:03:27,880][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000399534_409122816.pth [2022-07-09 20:03:27,880][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000397533_407073792.pth [2022-07-09 20:03:27,884][26022] Updated weights on worker 0-0, policy_version 399534 (0.00084) [2022-07-09 20:03:28,206][25689] Fps is (10 sec: 5707.2, 60 sec: 5696.0, 300 sec: 5683.1). Total num frames: 409123840. Throughput: 0: 5953.5. Samples: 409132044. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:03:28,207][25689] Avg episode reward: [(0, '-47.083')] [2022-07-09 20:03:29,882][26022] Updated weights on worker 0-0, policy_version 399544 (0.00094) [2022-07-09 20:03:31,544][26022] Updated weights on worker 0-0, policy_version 399554 (0.00089) [2022-07-09 20:03:33,277][25689] Fps is (10 sec: 5716.2, 60 sec: 5689.5, 300 sec: 5678.5). Total num frames: 409152512. Throughput: 0: 5062.1. Samples: 409149026. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:03:33,278][25689] Avg episode reward: [(0, '-46.611')] [2022-07-09 20:03:33,315][26022] Updated weights on worker 0-0, policy_version 399564 (0.00091) [2022-07-09 20:03:35,173][26022] Updated weights on worker 0-0, policy_version 399574 (0.00087) [2022-07-09 20:03:37,003][26022] Updated weights on worker 0-0, policy_version 399584 (0.00102) [2022-07-09 20:03:38,342][25689] Fps is (10 sec: 5657.3, 60 sec: 5684.8, 300 sec: 5681.2). Total num frames: 409181184. Throughput: 0: 5866.4. Samples: 409183042. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:03:38,343][25689] Avg episode reward: [(0, '-47.204')] [2022-07-09 20:03:38,823][26022] Updated weights on worker 0-0, policy_version 399594 (0.00095) [2022-07-09 20:03:40,636][26022] Updated weights on worker 0-0, policy_version 399604 (0.00089) [2022-07-09 20:03:42,368][26022] Updated weights on worker 0-0, policy_version 399614 (0.00095) [2022-07-09 20:03:43,359][25689] Fps is (10 sec: 5586.5, 60 sec: 5672.0, 300 sec: 5678.7). Total num frames: 409208832. Throughput: 0: 5876.7. Samples: 409217272. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:03:43,360][25689] Avg episode reward: [(0, '-47.041')] [2022-07-09 20:03:44,249][26022] Updated weights on worker 0-0, policy_version 399624 (0.00077) [2022-07-09 20:03:45,895][26022] Updated weights on worker 0-0, policy_version 399634 (0.00112) [2022-07-09 20:03:47,718][26022] Updated weights on worker 0-0, policy_version 399644 (0.00094) [2022-07-09 20:03:48,459][25689] Fps is (10 sec: 5668.3, 60 sec: 5652.0, 300 sec: 5673.8). Total num frames: 409238528. Throughput: 0: 5048.0. Samples: 409234280. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:03:48,459][25689] Avg episode reward: [(0, '-47.167')] [2022-07-09 20:03:49,601][26022] Updated weights on worker 0-0, policy_version 399654 (0.00086) [2022-07-09 20:03:51,372][26022] Updated weights on worker 0-0, policy_version 399664 (0.00085) [2022-07-09 20:03:53,301][26022] Updated weights on worker 0-0, policy_version 399674 (0.00086) [2022-07-09 20:03:53,465][25689] Fps is (10 sec: 5775.4, 60 sec: 5676.5, 300 sec: 5681.4). Total num frames: 409267200. Throughput: 0: 5916.8. Samples: 409268472. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:03:53,465][25689] Avg episode reward: [(0, '-47.120')] [2022-07-09 20:03:54,794][26022] Updated weights on worker 0-0, policy_version 399684 (0.00087) [2022-07-09 20:03:56,865][26022] Updated weights on worker 0-0, policy_version 399694 (0.00088) [2022-07-09 20:03:58,483][25689] Fps is (10 sec: 5618.5, 60 sec: 5641.4, 300 sec: 5671.5). Total num frames: 409294848. Throughput: 0: 5942.2. Samples: 409302722. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:03:58,484][25689] Avg episode reward: [(0, '-47.356')] [2022-07-09 20:03:58,736][26022] Updated weights on worker 0-0, policy_version 399704 (0.00084) [2022-07-09 20:04:00,247][26022] Updated weights on worker 0-0, policy_version 399714 (0.00084) [2022-07-09 20:04:02,627][26022] Updated weights on worker 0-0, policy_version 399724 (0.00084) [2022-07-09 20:04:03,513][25689] Fps is (10 sec: 5503.1, 60 sec: 5658.3, 300 sec: 5679.0). Total num frames: 409322496. Throughput: 0: 5090.7. Samples: 409319872. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:04:03,515][25689] Avg episode reward: [(0, '-47.087')] [2022-07-09 20:04:04,172][26022] Updated weights on worker 0-0, policy_version 399734 (0.00085) [2022-07-09 20:04:06,198][26022] Updated weights on worker 0-0, policy_version 399744 (0.00085) [2022-07-09 20:04:07,867][26022] Updated weights on worker 0-0, policy_version 399754 (0.00097) [2022-07-09 20:04:08,566][25689] Fps is (10 sec: 5585.7, 60 sec: 5641.6, 300 sec: 5678.0). Total num frames: 409351168. Throughput: 0: 5863.8. Samples: 409352182. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:04:08,566][25689] Avg episode reward: [(0, '-46.903')] [2022-07-09 20:04:09,857][26022] Updated weights on worker 0-0, policy_version 399764 (0.00093) [2022-07-09 20:04:11,471][26022] Updated weights on worker 0-0, policy_version 399774 (0.00082) [2022-07-09 20:04:13,299][26022] Updated weights on worker 0-0, policy_version 399784 (0.00088) [2022-07-09 20:04:13,568][25689] Fps is (10 sec: 5601.3, 60 sec: 5642.0, 300 sec: 5678.4). Total num frames: 409378816. Throughput: 0: 5859.0. Samples: 409386256. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:04:13,568][25689] Avg episode reward: [(0, '-47.360')] [2022-07-09 20:04:14,864][26022] Updated weights on worker 0-0, policy_version 399794 (0.00081) [2022-07-09 20:04:16,910][26022] Updated weights on worker 0-0, policy_version 399804 (0.00094) [2022-07-09 20:04:18,578][25689] Fps is (10 sec: 5829.7, 60 sec: 5677.7, 300 sec: 5682.4). Total num frames: 409409536. Throughput: 0: 5025.5. Samples: 409403710. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:04:18,579][25689] Avg episode reward: [(0, '-47.877')] [2022-07-09 20:04:18,579][26022] Updated weights on worker 0-0, policy_version 399814 (0.00090) [2022-07-09 20:04:20,481][26022] Updated weights on worker 0-0, policy_version 399824 (0.01120) [2022-07-09 20:04:22,083][26022] Updated weights on worker 0-0, policy_version 399834 (0.00086) [2022-07-09 20:04:23,597][25689] Fps is (10 sec: 5819.9, 60 sec: 5667.2, 300 sec: 5681.2). Total num frames: 409437184. Throughput: 0: 5882.8. Samples: 409438022. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:04:23,597][25689] Avg episode reward: [(0, '-47.242')] [2022-07-09 20:04:24,079][26022] Updated weights on worker 0-0, policy_version 399844 (0.00087) [2022-07-09 20:04:25,974][26022] Updated weights on worker 0-0, policy_version 399854 (0.00094) [2022-07-09 20:04:27,617][26022] Updated weights on worker 0-0, policy_version 399864 (0.00084) [2022-07-09 20:04:28,659][25689] Fps is (10 sec: 5586.7, 60 sec: 5657.5, 300 sec: 5684.3). Total num frames: 409465856. Throughput: 0: 5967.0. Samples: 409472080. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:04:28,660][25689] Avg episode reward: [(0, '-46.459')] [2022-07-09 20:04:29,516][26022] Updated weights on worker 0-0, policy_version 399874 (0.00092) [2022-07-09 20:04:31,149][26022] Updated weights on worker 0-0, policy_version 399884 (0.00085) [2022-07-09 20:04:33,086][26022] Updated weights on worker 0-0, policy_version 399894 (0.00086) [2022-07-09 20:04:33,673][25689] Fps is (10 sec: 5589.4, 60 sec: 5645.9, 300 sec: 5673.8). Total num frames: 409493504. Throughput: 0: 5113.5. Samples: 409489064. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:04:33,674][25689] Avg episode reward: [(0, '-45.993')] [2022-07-09 20:04:34,781][26022] Updated weights on worker 0-0, policy_version 399904 (0.00088) [2022-07-09 20:04:36,719][26022] Updated weights on worker 0-0, policy_version 399914 (0.00083) [2022-07-09 20:04:38,546][26022] Updated weights on worker 0-0, policy_version 399924 (0.00082) [2022-07-09 20:04:38,686][25689] Fps is (10 sec: 5616.7, 60 sec: 5650.8, 300 sec: 5677.1). Total num frames: 409522176. Throughput: 0: 5959.5. Samples: 409523546. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:04:38,687][25689] Avg episode reward: [(0, '-46.585')] [2022-07-09 20:04:40,359][26022] Updated weights on worker 0-0, policy_version 399934 (0.00092) [2022-07-09 20:04:42,026][26022] Updated weights on worker 0-0, policy_version 399944 (0.00086) [2022-07-09 20:04:43,697][25689] Fps is (10 sec: 5822.8, 60 sec: 5685.3, 300 sec: 5682.0). Total num frames: 409551872. Throughput: 0: 5951.5. Samples: 409557648. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:04:43,697][25689] Avg episode reward: [(0, '-45.968')] [2022-07-09 20:04:43,962][26022] Updated weights on worker 0-0, policy_version 399954 (0.00082) [2022-07-09 20:04:45,600][26022] Updated weights on worker 0-0, policy_version 399964 (0.00085) [2022-07-09 20:04:47,555][26022] Updated weights on worker 0-0, policy_version 399974 (0.00088) [2022-07-09 20:04:48,785][25689] Fps is (10 sec: 5779.5, 60 sec: 5669.4, 300 sec: 5677.4). Total num frames: 409580544. Throughput: 0: 5100.1. Samples: 409574726. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:04:48,786][25689] Avg episode reward: [(0, '-46.012')] [2022-07-09 20:04:49,190][26022] Updated weights on worker 0-0, policy_version 399984 (0.00091) [2022-07-09 20:04:51,152][26022] Updated weights on worker 0-0, policy_version 399994 (0.00096) [2022-07-09 20:04:52,885][26022] Updated weights on worker 0-0, policy_version 400004 (0.00094) [2022-07-09 20:04:53,794][25689] Fps is (10 sec: 5679.3, 60 sec: 5669.2, 300 sec: 5677.6). Total num frames: 409609216. Throughput: 0: 5966.4. Samples: 409609114. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:04:53,795][25689] Avg episode reward: [(0, '-46.332')] [2022-07-09 20:04:54,650][26022] Updated weights on worker 0-0, policy_version 400014 (0.00094) [2022-07-09 20:04:56,496][26022] Updated weights on worker 0-0, policy_version 400024 (0.00084) [2022-07-09 20:04:58,319][26022] Updated weights on worker 0-0, policy_version 400034 (0.00093) [2022-07-09 20:04:58,855][25689] Fps is (10 sec: 5694.9, 60 sec: 5682.1, 300 sec: 5680.0). Total num frames: 409637888. Throughput: 0: 5953.6. Samples: 409643620. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:04:58,855][25689] Avg episode reward: [(0, '-47.286')] [2022-07-09 20:04:59,945][26022] Updated weights on worker 0-0, policy_version 400044 (0.00087) [2022-07-09 20:05:01,924][26022] Updated weights on worker 0-0, policy_version 400054 (0.00097) [2022-07-09 20:05:03,946][25689] Fps is (10 sec: 5447.1, 60 sec: 5659.5, 300 sec: 5675.7). Total num frames: 409664512. Throughput: 0: 5092.0. Samples: 409660754. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:05:03,946][25689] Avg episode reward: [(0, '-47.630')] [2022-07-09 20:05:04,131][26022] Updated weights on worker 0-0, policy_version 400064 (0.00089) [2022-07-09 20:05:05,632][26022] Updated weights on worker 0-0, policy_version 400074 (0.00091) [2022-07-09 20:05:07,802][26022] Updated weights on worker 0-0, policy_version 400084 (0.00088) [2022-07-09 20:05:09,012][25689] Fps is (10 sec: 5443.8, 60 sec: 5658.1, 300 sec: 5679.3). Total num frames: 409693184. Throughput: 0: 5832.3. Samples: 409692696. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-09 20:05:09,013][25689] Avg episode reward: [(0, '-49.027')] [2022-07-09 20:05:09,416][26022] Updated weights on worker 0-0, policy_version 400094 (0.00079) [2022-07-09 20:05:11,230][26022] Updated weights on worker 0-0, policy_version 400104 (0.00083) [2022-07-09 20:05:13,090][26022] Updated weights on worker 0-0, policy_version 400114 (0.00090) [2022-07-09 20:05:14,029][25689] Fps is (10 sec: 5687.2, 60 sec: 5673.8, 300 sec: 5679.6). Total num frames: 409721856. Throughput: 0: 5822.6. Samples: 409726932. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:05:14,029][25689] Avg episode reward: [(0, '-48.139')] [2022-07-09 20:05:14,639][26022] Updated weights on worker 0-0, policy_version 400124 (0.00090) [2022-07-09 20:05:16,632][26022] Updated weights on worker 0-0, policy_version 400134 (0.00173) [2022-07-09 20:05:18,316][26022] Updated weights on worker 0-0, policy_version 400144 (0.00088) [2022-07-09 20:05:19,036][25689] Fps is (10 sec: 5619.0, 60 sec: 5623.3, 300 sec: 5669.2). Total num frames: 409749504. Throughput: 0: 4987.3. Samples: 409744268. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:05:19,036][25689] Avg episode reward: [(0, '-47.455')] [2022-07-09 20:05:20,190][26022] Updated weights on worker 0-0, policy_version 400154 (0.00087) [2022-07-09 20:05:22,025][26022] Updated weights on worker 0-0, policy_version 400164 (0.00087) [2022-07-09 20:05:23,525][26022] Updated weights on worker 0-0, policy_version 400174 (0.00050) [2022-07-09 20:05:24,080][25689] Fps is (10 sec: 5704.9, 60 sec: 5654.7, 300 sec: 5677.1). Total num frames: 409779200. Throughput: 0: 5859.3. Samples: 409778730. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:05:24,081][25689] Avg episode reward: [(0, '-47.565')] [2022-07-09 20:05:25,545][26022] Updated weights on worker 0-0, policy_version 400184 (0.00085) [2022-07-09 20:05:27,176][26022] Updated weights on worker 0-0, policy_version 400194 (0.00083) [2022-07-09 20:05:27,978][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:05:28,005][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000400197_409801728.pth [2022-07-09 20:05:28,006][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000398199_407755776.pth [2022-07-09 20:05:29,000][26022] Updated weights on worker 0-0, policy_version 400204 (0.00094) [2022-07-09 20:05:29,203][25689] Fps is (10 sec: 5942.2, 60 sec: 5682.9, 300 sec: 5682.2). Total num frames: 409809920. Throughput: 0: 5962.9. Samples: 409813088. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:05:29,203][25689] Avg episode reward: [(0, '-47.731')] [2022-07-09 20:05:30,991][26022] Updated weights on worker 0-0, policy_version 400214 (0.00083) [2022-07-09 20:05:32,765][26022] Updated weights on worker 0-0, policy_version 400224 (0.00618) [2022-07-09 20:05:34,204][25689] Fps is (10 sec: 5664.2, 60 sec: 5667.2, 300 sec: 5675.7). Total num frames: 409836544. Throughput: 0: 5960.1. Samples: 409847182. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:05:34,205][25689] Avg episode reward: [(0, '-47.329')] [2022-07-09 20:05:34,613][26022] Updated weights on worker 0-0, policy_version 400234 (0.00079) [2022-07-09 20:05:36,154][26022] Updated weights on worker 0-0, policy_version 400244 (0.00081) [2022-07-09 20:05:38,266][26022] Updated weights on worker 0-0, policy_version 400254 (0.00084) [2022-07-09 20:05:39,219][25689] Fps is (10 sec: 5623.1, 60 sec: 5684.0, 300 sec: 5679.9). Total num frames: 409866240. Throughput: 0: 5961.5. Samples: 409864590. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:05:39,219][25689] Avg episode reward: [(0, '-46.782')] [2022-07-09 20:05:39,740][26022] Updated weights on worker 0-0, policy_version 400264 (0.00094) [2022-07-09 20:05:41,760][26022] Updated weights on worker 0-0, policy_version 400274 (0.00085) [2022-07-09 20:05:43,430][26022] Updated weights on worker 0-0, policy_version 400284 (0.00097) [2022-07-09 20:05:44,243][25689] Fps is (10 sec: 5814.6, 60 sec: 5665.8, 300 sec: 5678.3). Total num frames: 409894912. Throughput: 0: 5949.4. Samples: 409898684. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:05:44,243][25689] Avg episode reward: [(0, '-46.778')] [2022-07-09 20:05:45,306][26022] Updated weights on worker 0-0, policy_version 400294 (0.00082) [2022-07-09 20:05:47,045][26022] Updated weights on worker 0-0, policy_version 400304 (0.00087) [2022-07-09 20:05:48,902][26022] Updated weights on worker 0-0, policy_version 400314 (0.00084) [2022-07-09 20:05:49,349][25689] Fps is (10 sec: 5559.6, 60 sec: 5647.2, 300 sec: 5674.0). Total num frames: 409922560. Throughput: 0: 5950.4. Samples: 409932966. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:05:49,349][25689] Avg episode reward: [(0, '-47.385')] [2022-07-09 20:05:50,663][26022] Updated weights on worker 0-0, policy_version 400324 (0.00085) [2022-07-09 20:05:52,467][26022] Updated weights on worker 0-0, policy_version 400334 (0.00089) [2022-07-09 20:05:54,316][26022] Updated weights on worker 0-0, policy_version 400344 (0.00089) [2022-07-09 20:05:54,355][25689] Fps is (10 sec: 5670.7, 60 sec: 5664.4, 300 sec: 5674.2). Total num frames: 409952256. Throughput: 0: 5116.3. Samples: 409950278. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:05:54,355][25689] Avg episode reward: [(0, '-46.829')] [2022-07-09 20:05:56,032][26022] Updated weights on worker 0-0, policy_version 400354 (0.01063) [2022-07-09 20:05:57,756][26022] Updated weights on worker 0-0, policy_version 400364 (0.00080) [2022-07-09 20:05:59,379][25689] Fps is (10 sec: 5921.5, 60 sec: 5684.8, 300 sec: 5681.7). Total num frames: 409981952. Throughput: 0: 5962.9. Samples: 409984802. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:05:59,379][25689] Avg episode reward: [(0, '-47.132')] [2022-07-09 20:05:59,495][26022] Updated weights on worker 0-0, policy_version 400374 (0.00084) [2022-07-09 20:06:01,388][26022] Updated weights on worker 0-0, policy_version 400384 (0.00092) [2022-07-09 20:06:03,614][26022] Updated weights on worker 0-0, policy_version 400394 (0.00093) [2022-07-09 20:06:04,414][25689] Fps is (10 sec: 5599.1, 60 sec: 5690.0, 300 sec: 5679.7). Total num frames: 410008576. Throughput: 0: 5858.7. Samples: 410016860. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:06:04,415][25689] Avg episode reward: [(0, '-46.938')] [2022-07-09 20:06:05,291][26022] Updated weights on worker 0-0, policy_version 400404 (0.00480) [2022-07-09 20:06:07,119][26022] Updated weights on worker 0-0, policy_version 400414 (0.00084) [2022-07-09 20:06:09,075][26022] Updated weights on worker 0-0, policy_version 400424 (0.00085) [2022-07-09 20:06:09,495][25689] Fps is (10 sec: 5466.3, 60 sec: 5688.7, 300 sec: 5681.7). Total num frames: 410037248. Throughput: 0: 5016.1. Samples: 410034020. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:06:09,496][25689] Avg episode reward: [(0, '-47.846')] [2022-07-09 20:06:10,755][26022] Updated weights on worker 0-0, policy_version 400434 (0.00095) [2022-07-09 20:06:12,475][26022] Updated weights on worker 0-0, policy_version 400444 (0.00072) [2022-07-09 20:06:14,342][26022] Updated weights on worker 0-0, policy_version 400454 (0.00087) [2022-07-09 20:06:14,506][25689] Fps is (10 sec: 5682.1, 60 sec: 5689.2, 300 sec: 5678.3). Total num frames: 410065920. Throughput: 0: 5856.1. Samples: 410068284. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:06:14,506][25689] Avg episode reward: [(0, '-48.197')] [2022-07-09 20:06:16,216][26022] Updated weights on worker 0-0, policy_version 400464 (0.00091) [2022-07-09 20:06:18,082][26022] Updated weights on worker 0-0, policy_version 400474 (0.00081) [2022-07-09 20:06:19,578][25689] Fps is (10 sec: 5585.5, 60 sec: 5683.0, 300 sec: 5670.5). Total num frames: 410093568. Throughput: 0: 5823.9. Samples: 410102440. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:06:19,580][25689] Avg episode reward: [(0, '-47.525')] [2022-07-09 20:06:19,750][26022] Updated weights on worker 0-0, policy_version 400484 (0.00084) [2022-07-09 20:06:21,453][26022] Updated weights on worker 0-0, policy_version 400494 (0.00086) [2022-07-09 20:06:23,247][26022] Updated weights on worker 0-0, policy_version 400504 (0.00083) [2022-07-09 20:06:24,582][25689] Fps is (10 sec: 5691.2, 60 sec: 5686.9, 300 sec: 5671.6). Total num frames: 410123264. Throughput: 0: 5097.3. Samples: 410119662. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:06:24,587][25689] Avg episode reward: [(0, '-47.815')] [2022-07-09 20:06:24,927][26022] Updated weights on worker 0-0, policy_version 400514 (0.00095) [2022-07-09 20:06:26,833][26022] Updated weights on worker 0-0, policy_version 400524 (0.00089) [2022-07-09 20:06:28,891][26022] Updated weights on worker 0-0, policy_version 400534 (0.00104) [2022-07-09 20:06:29,719][25689] Fps is (10 sec: 5654.5, 60 sec: 5634.7, 300 sec: 5669.3). Total num frames: 410150912. Throughput: 0: 5921.4. Samples: 410153778. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:06:29,721][25689] Avg episode reward: [(0, '-47.341')] [2022-07-09 20:06:30,334][26022] Updated weights on worker 0-0, policy_version 400544 (0.00085) [2022-07-09 20:06:32,615][26022] Updated weights on worker 0-0, policy_version 400554 (0.00089) [2022-07-09 20:06:33,887][26022] Updated weights on worker 0-0, policy_version 400564 (0.00087) [2022-07-09 20:06:34,742][25689] Fps is (10 sec: 5643.7, 60 sec: 5683.4, 300 sec: 5672.6). Total num frames: 410180608. Throughput: 0: 5910.1. Samples: 410187884. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:06:34,743][25689] Avg episode reward: [(0, '-47.415')] [2022-07-09 20:06:36,062][26022] Updated weights on worker 0-0, policy_version 400574 (0.00092) [2022-07-09 20:06:37,626][26022] Updated weights on worker 0-0, policy_version 400584 (0.00086) [2022-07-09 20:06:39,660][26022] Updated weights on worker 0-0, policy_version 400594 (0.00081) [2022-07-09 20:06:39,773][25689] Fps is (10 sec: 5805.3, 60 sec: 5665.0, 300 sec: 5672.5). Total num frames: 410209280. Throughput: 0: 5082.6. Samples: 410205086. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:06:39,774][25689] Avg episode reward: [(0, '-46.503')] [2022-07-09 20:06:41,260][26022] Updated weights on worker 0-0, policy_version 400604 (0.00090) [2022-07-09 20:06:43,154][26022] Updated weights on worker 0-0, policy_version 400614 (0.00085) [2022-07-09 20:06:44,814][25689] Fps is (10 sec: 5795.0, 60 sec: 5680.3, 300 sec: 5673.4). Total num frames: 410238976. Throughput: 0: 5912.2. Samples: 410239282. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:06:44,815][25689] Avg episode reward: [(0, '-46.473')] [2022-07-09 20:06:44,815][26022] Updated weights on worker 0-0, policy_version 400624 (0.00082) [2022-07-09 20:06:46,800][26022] Updated weights on worker 0-0, policy_version 400634 (0.00084) [2022-07-09 20:06:48,238][26022] Updated weights on worker 0-0, policy_version 400644 (0.00086) [2022-07-09 20:06:49,875][25689] Fps is (10 sec: 5575.6, 60 sec: 5667.7, 300 sec: 5666.8). Total num frames: 410265600. Throughput: 0: 5953.3. Samples: 410273770. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:06:49,875][25689] Avg episode reward: [(0, '-47.103')] [2022-07-09 20:06:50,243][26022] Updated weights on worker 0-0, policy_version 400654 (0.00089) [2022-07-09 20:06:52,188][26022] Updated weights on worker 0-0, policy_version 400664 (0.00086) [2022-07-09 20:06:53,823][26022] Updated weights on worker 0-0, policy_version 400674 (0.00086) [2022-07-09 20:06:54,965][25689] Fps is (10 sec: 5649.3, 60 sec: 5676.7, 300 sec: 5676.0). Total num frames: 410296320. Throughput: 0: 5086.1. Samples: 410290740. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:06:54,967][25689] Avg episode reward: [(0, '-46.474')] [2022-07-09 20:06:55,788][26022] Updated weights on worker 0-0, policy_version 400684 (0.00078) [2022-07-09 20:06:57,271][26022] Updated weights on worker 0-0, policy_version 400694 (0.00092) [2022-07-09 20:06:59,223][26022] Updated weights on worker 0-0, policy_version 400704 (0.00089) [2022-07-09 20:06:59,983][25689] Fps is (10 sec: 5875.4, 60 sec: 5660.3, 300 sec: 5679.7). Total num frames: 410324992. Throughput: 0: 5937.0. Samples: 410325072. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:06:59,984][25689] Avg episode reward: [(0, '-46.594')] [2022-07-09 20:07:01,131][26022] Updated weights on worker 0-0, policy_version 400714 (0.00091) [2022-07-09 20:07:03,066][26022] Updated weights on worker 0-0, policy_version 400724 (0.00087) [2022-07-09 20:07:05,008][25689] Fps is (10 sec: 5301.9, 60 sec: 5627.4, 300 sec: 5664.1). Total num frames: 410349568. Throughput: 0: 5823.3. Samples: 410356876. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:07:05,009][25689] Avg episode reward: [(0, '-46.761')] [2022-07-09 20:07:05,171][26022] Updated weights on worker 0-0, policy_version 400734 (0.00085) [2022-07-09 20:07:06,711][26022] Updated weights on worker 0-0, policy_version 400744 (0.00097) [2022-07-09 20:07:08,621][26022] Updated weights on worker 0-0, policy_version 400754 (0.00088) [2022-07-09 20:07:10,116][25689] Fps is (10 sec: 5457.0, 60 sec: 5658.7, 300 sec: 5669.1). Total num frames: 410380288. Throughput: 0: 4949.4. Samples: 410373954. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:07:10,117][25689] Avg episode reward: [(0, '-46.378')] [2022-07-09 20:07:10,636][26022] Updated weights on worker 0-0, policy_version 400764 (0.00093) [2022-07-09 20:07:12,184][26022] Updated weights on worker 0-0, policy_version 400774 (0.00088) [2022-07-09 20:07:14,222][26022] Updated weights on worker 0-0, policy_version 400784 (0.00082) [2022-07-09 20:07:15,150][25689] Fps is (10 sec: 5755.3, 60 sec: 5639.7, 300 sec: 5669.4). Total num frames: 410407936. Throughput: 0: 5814.7. Samples: 410408108. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:07:15,151][25689] Avg episode reward: [(0, '-45.868')] [2022-07-09 20:07:15,755][26022] Updated weights on worker 0-0, policy_version 400794 (0.00088) [2022-07-09 20:07:17,811][26022] Updated weights on worker 0-0, policy_version 400804 (0.00080) [2022-07-09 20:07:19,542][26022] Updated weights on worker 0-0, policy_version 400814 (0.00098) [2022-07-09 20:07:20,251][25689] Fps is (10 sec: 5658.2, 60 sec: 5670.8, 300 sec: 5668.0). Total num frames: 410437632. Throughput: 0: 5793.9. Samples: 410442502. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:07:20,251][25689] Avg episode reward: [(0, '-45.587')] [2022-07-09 20:07:21,248][26022] Updated weights on worker 0-0, policy_version 400824 (0.00087) [2022-07-09 20:07:23,073][26022] Updated weights on worker 0-0, policy_version 400834 (0.00085) [2022-07-09 20:07:24,850][26022] Updated weights on worker 0-0, policy_version 400844 (0.00094) [2022-07-09 20:07:25,261][25689] Fps is (10 sec: 5772.3, 60 sec: 5653.3, 300 sec: 5669.8). Total num frames: 410466304. Throughput: 0: 5082.4. Samples: 410459812. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:07:25,263][25689] Avg episode reward: [(0, '-46.072')] [2022-07-09 20:07:26,553][26022] Updated weights on worker 0-0, policy_version 400854 (0.00086) [2022-07-09 20:07:28,438][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:07:28,467][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000400863_410483712.pth [2022-07-09 20:07:28,468][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000398866_408438784.pth [2022-07-09 20:07:28,703][26022] Updated weights on worker 0-0, policy_version 400864 (0.00093) [2022-07-09 20:07:30,297][26022] Updated weights on worker 0-0, policy_version 400874 (0.00088) [2022-07-09 20:07:30,326][25689] Fps is (10 sec: 5691.5, 60 sec: 5677.0, 300 sec: 5668.6). Total num frames: 410494976. Throughput: 0: 5938.1. Samples: 410493964. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:07:30,327][25689] Avg episode reward: [(0, '-45.322')] [2022-07-09 20:07:32,173][26022] Updated weights on worker 0-0, policy_version 400884 (0.00079) [2022-07-09 20:07:33,677][26022] Updated weights on worker 0-0, policy_version 400894 (0.00089) [2022-07-09 20:07:35,334][25689] Fps is (10 sec: 5591.3, 60 sec: 5644.6, 300 sec: 5665.3). Total num frames: 410522624. Throughput: 0: 5953.5. Samples: 410528276. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:07:35,338][25689] Avg episode reward: [(0, '-45.624')] [2022-07-09 20:07:35,815][26022] Updated weights on worker 0-0, policy_version 400904 (0.00088) [2022-07-09 20:07:37,312][26022] Updated weights on worker 0-0, policy_version 400914 (0.00091) [2022-07-09 20:07:39,320][26022] Updated weights on worker 0-0, policy_version 400924 (0.00088) [2022-07-09 20:07:40,353][25689] Fps is (10 sec: 5718.9, 60 sec: 5662.6, 300 sec: 5669.5). Total num frames: 410552320. Throughput: 0: 5118.0. Samples: 410545386. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:07:40,354][25689] Avg episode reward: [(0, '-46.322')] [2022-07-09 20:07:41,084][26022] Updated weights on worker 0-0, policy_version 400934 (0.00093) [2022-07-09 20:07:42,791][26022] Updated weights on worker 0-0, policy_version 400944 (0.00081) [2022-07-09 20:07:44,503][26022] Updated weights on worker 0-0, policy_version 400954 (0.00088) [2022-07-09 20:07:45,381][25689] Fps is (10 sec: 5707.4, 60 sec: 5630.0, 300 sec: 5659.9). Total num frames: 410579968. Throughput: 0: 5947.8. Samples: 410579482. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:07:45,383][25689] Avg episode reward: [(0, '-46.599')] [2022-07-09 20:07:46,322][26022] Updated weights on worker 0-0, policy_version 400964 (0.00079) [2022-07-09 20:07:48,280][26022] Updated weights on worker 0-0, policy_version 400974 (0.00087) [2022-07-09 20:07:49,980][26022] Updated weights on worker 0-0, policy_version 400984 (0.00088) [2022-07-09 20:07:50,437][25689] Fps is (10 sec: 5687.1, 60 sec: 5681.2, 300 sec: 5667.4). Total num frames: 410609664. Throughput: 0: 5971.1. Samples: 410614044. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:07:50,437][25689] Avg episode reward: [(0, '-46.521')] [2022-07-09 20:07:51,691][26022] Updated weights on worker 0-0, policy_version 400994 (0.00090) [2022-07-09 20:07:53,639][26022] Updated weights on worker 0-0, policy_version 401004 (0.00093) [2022-07-09 20:07:55,333][26022] Updated weights on worker 0-0, policy_version 401014 (0.00088) [2022-07-09 20:07:55,446][25689] Fps is (10 sec: 5799.2, 60 sec: 5654.9, 300 sec: 5663.9). Total num frames: 410638336. Throughput: 0: 5117.9. Samples: 410631208. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:07:55,447][25689] Avg episode reward: [(0, '-47.179')] [2022-07-09 20:07:57,037][26022] Updated weights on worker 0-0, policy_version 401024 (0.00090) [2022-07-09 20:07:58,934][26022] Updated weights on worker 0-0, policy_version 401034 (0.00081) [2022-07-09 20:08:00,469][25689] Fps is (10 sec: 5818.2, 60 sec: 5671.5, 300 sec: 5674.3). Total num frames: 410668032. Throughput: 0: 5989.5. Samples: 410665866. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:08:00,469][25689] Avg episode reward: [(0, '-47.125')] [2022-07-09 20:08:00,698][26022] Updated weights on worker 0-0, policy_version 401044 (0.00088) [2022-07-09 20:08:02,894][26022] Updated weights on worker 0-0, policy_version 401054 (0.00094) [2022-07-09 20:08:04,735][26022] Updated weights on worker 0-0, policy_version 401064 (0.00094) [2022-07-09 20:08:05,494][25689] Fps is (10 sec: 5401.4, 60 sec: 5671.4, 300 sec: 5657.7). Total num frames: 410692608. Throughput: 0: 5890.6. Samples: 410697958. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:08:05,495][25689] Avg episode reward: [(0, '-46.651')] [2022-07-09 20:08:06,248][26022] Updated weights on worker 0-0, policy_version 401074 (0.00080) [2022-07-09 20:08:08,376][26022] Updated weights on worker 0-0, policy_version 401084 (0.00094) [2022-07-09 20:08:10,002][26022] Updated weights on worker 0-0, policy_version 401094 (0.00097) [2022-07-09 20:08:10,634][25689] Fps is (10 sec: 5439.7, 60 sec: 5668.4, 300 sec: 5665.5). Total num frames: 410723328. Throughput: 0: 5858.7. Samples: 410732374. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:08:10,634][25689] Avg episode reward: [(0, '-46.349')] [2022-07-09 20:08:11,948][26022] Updated weights on worker 0-0, policy_version 401104 (0.00091) [2022-07-09 20:08:13,600][26022] Updated weights on worker 0-0, policy_version 401114 (0.00098) [2022-07-09 20:08:15,272][26022] Updated weights on worker 0-0, policy_version 401124 (0.00101) [2022-07-09 20:08:15,659][25689] Fps is (10 sec: 5842.8, 60 sec: 5686.1, 300 sec: 5665.6). Total num frames: 410752000. Throughput: 0: 5852.9. Samples: 410749512. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:08:15,660][25689] Avg episode reward: [(0, '-46.463')] [2022-07-09 20:08:17,215][26022] Updated weights on worker 0-0, policy_version 401134 (0.00084) [2022-07-09 20:08:19,248][26022] Updated weights on worker 0-0, policy_version 401144 (0.00090) [2022-07-09 20:08:20,725][25689] Fps is (10 sec: 5784.3, 60 sec: 5689.5, 300 sec: 5669.4). Total num frames: 410781696. Throughput: 0: 5823.5. Samples: 410783826. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:08:20,725][25689] Avg episode reward: [(0, '-45.605')] [2022-07-09 20:08:20,726][26022] Updated weights on worker 0-0, policy_version 401154 (0.00086) [2022-07-09 20:08:22,840][26022] Updated weights on worker 0-0, policy_version 401164 (0.00087) [2022-07-09 20:08:24,204][26022] Updated weights on worker 0-0, policy_version 401174 (0.00090) [2022-07-09 20:08:25,750][25689] Fps is (10 sec: 5479.7, 60 sec: 5637.3, 300 sec: 5657.8). Total num frames: 410807296. Throughput: 0: 5932.9. Samples: 410818136. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:08:25,751][25689] Avg episode reward: [(0, '-45.777')] [2022-07-09 20:08:26,513][26022] Updated weights on worker 0-0, policy_version 401184 (0.00090) [2022-07-09 20:08:27,823][26022] Updated weights on worker 0-0, policy_version 401194 (0.00086) [2022-07-09 20:08:30,031][26022] Updated weights on worker 0-0, policy_version 401204 (0.00094) [2022-07-09 20:08:30,839][25689] Fps is (10 sec: 5669.6, 60 sec: 5685.8, 300 sec: 5667.8). Total num frames: 410839040. Throughput: 0: 5082.5. Samples: 410835066. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:08:30,839][25689] Avg episode reward: [(0, '-45.832')] [2022-07-09 20:08:31,582][26022] Updated weights on worker 0-0, policy_version 401214 (0.00089) [2022-07-09 20:08:33,460][26022] Updated weights on worker 0-0, policy_version 401224 (0.00094) [2022-07-09 20:08:35,244][26022] Updated weights on worker 0-0, policy_version 401234 (0.00085) [2022-07-09 20:08:35,869][25689] Fps is (10 sec: 5970.5, 60 sec: 5700.6, 300 sec: 5668.5). Total num frames: 410867712. Throughput: 0: 5910.6. Samples: 410868966. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:08:35,870][25689] Avg episode reward: [(0, '-46.293')] [2022-07-09 20:08:37,324][26022] Updated weights on worker 0-0, policy_version 401244 (0.00085) [2022-07-09 20:08:38,662][26022] Updated weights on worker 0-0, policy_version 401254 (0.00093) [2022-07-09 20:08:40,694][26022] Updated weights on worker 0-0, policy_version 401264 (0.00090) [2022-07-09 20:08:40,883][25689] Fps is (10 sec: 5607.5, 60 sec: 5667.3, 300 sec: 5668.5). Total num frames: 410895360. Throughput: 0: 5943.9. Samples: 410903644. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:08:40,884][25689] Avg episode reward: [(0, '-46.069')] [2022-07-09 20:08:42,262][26022] Updated weights on worker 0-0, policy_version 401274 (0.00125) [2022-07-09 20:08:44,244][26022] Updated weights on worker 0-0, policy_version 401284 (0.00103) [2022-07-09 20:08:45,903][25689] Fps is (10 sec: 5613.1, 60 sec: 5685.0, 300 sec: 5666.6). Total num frames: 410924032. Throughput: 0: 5084.8. Samples: 410920608. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:08:45,905][25689] Avg episode reward: [(0, '-46.002')] [2022-07-09 20:08:46,015][26022] Updated weights on worker 0-0, policy_version 401294 (0.00089) [2022-07-09 20:08:47,754][26022] Updated weights on worker 0-0, policy_version 401304 (0.00094) [2022-07-09 20:08:49,484][26022] Updated weights on worker 0-0, policy_version 401314 (0.00090) [2022-07-09 20:08:51,002][25689] Fps is (10 sec: 5666.7, 60 sec: 5663.9, 300 sec: 5664.9). Total num frames: 410952704. Throughput: 0: 5943.3. Samples: 410954902. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:08:51,003][25689] Avg episode reward: [(0, '-45.427')] [2022-07-09 20:08:51,600][26022] Updated weights on worker 0-0, policy_version 401324 (0.00086) [2022-07-09 20:08:53,159][26022] Updated weights on worker 0-0, policy_version 401334 (0.00081) [2022-07-09 20:08:55,044][26022] Updated weights on worker 0-0, policy_version 401344 (0.00099) [2022-07-09 20:08:56,008][25689] Fps is (10 sec: 5776.1, 60 sec: 5681.2, 300 sec: 5672.0). Total num frames: 410982400. Throughput: 0: 5965.9. Samples: 410989112. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:08:56,009][25689] Avg episode reward: [(0, '-45.170')] [2022-07-09 20:08:56,734][26022] Updated weights on worker 0-0, policy_version 401354 (0.00089) [2022-07-09 20:08:58,665][26022] Updated weights on worker 0-0, policy_version 401364 (0.00087) [2022-07-09 20:09:00,515][26022] Updated weights on worker 0-0, policy_version 401374 (0.00086) [2022-07-09 20:09:01,032][25689] Fps is (10 sec: 5615.0, 60 sec: 5630.3, 300 sec: 5668.6). Total num frames: 411009024. Throughput: 0: 5085.4. Samples: 411006114. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:09:01,033][25689] Avg episode reward: [(0, '-44.485')] [2022-07-09 20:09:02,046][26022] Updated weights on worker 0-0, policy_version 401384 (0.00086) [2022-07-09 20:09:04,453][26022] Updated weights on worker 0-0, policy_version 401394 (0.00090) [2022-07-09 20:09:06,041][25689] Fps is (10 sec: 5409.2, 60 sec: 5682.6, 300 sec: 5666.0). Total num frames: 411036672. Throughput: 0: 5850.8. Samples: 411038434. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:09:06,042][25689] Avg episode reward: [(0, '-44.268')] [2022-07-09 20:09:06,230][26022] Updated weights on worker 0-0, policy_version 401404 (0.00096) [2022-07-09 20:09:07,966][26022] Updated weights on worker 0-0, policy_version 401414 (0.00081) [2022-07-09 20:09:09,746][26022] Updated weights on worker 0-0, policy_version 401424 (0.00096) [2022-07-09 20:09:11,106][25689] Fps is (10 sec: 5692.6, 60 sec: 5672.7, 300 sec: 5671.7). Total num frames: 411066368. Throughput: 0: 5870.7. Samples: 411072924. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:09:11,106][25689] Avg episode reward: [(0, '-43.438')] [2022-07-09 20:09:11,356][26022] Updated weights on worker 0-0, policy_version 401434 (0.00091) [2022-07-09 20:09:13,208][26022] Updated weights on worker 0-0, policy_version 401444 (0.00085) [2022-07-09 20:09:15,091][26022] Updated weights on worker 0-0, policy_version 401454 (0.00088) [2022-07-09 20:09:16,113][25689] Fps is (10 sec: 5693.5, 60 sec: 5657.5, 300 sec: 5661.4). Total num frames: 411094016. Throughput: 0: 5022.8. Samples: 411090096. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:09:16,114][25689] Avg episode reward: [(0, '-43.782')] [2022-07-09 20:09:16,722][26022] Updated weights on worker 0-0, policy_version 401464 (0.00082) [2022-07-09 20:09:18,615][26022] Updated weights on worker 0-0, policy_version 401474 (0.00068) [2022-07-09 20:09:20,163][26022] Updated weights on worker 0-0, policy_version 401484 (0.00084) [2022-07-09 20:09:21,139][25689] Fps is (10 sec: 5715.5, 60 sec: 5661.2, 300 sec: 5668.2). Total num frames: 411123712. Throughput: 0: 5919.1. Samples: 411125126. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:09:21,139][25689] Avg episode reward: [(0, '-44.135')] [2022-07-09 20:09:22,134][26022] Updated weights on worker 0-0, policy_version 401494 (0.00084) [2022-07-09 20:09:23,726][26022] Updated weights on worker 0-0, policy_version 401504 (0.00055) [2022-07-09 20:09:25,718][26022] Updated weights on worker 0-0, policy_version 401514 (0.00097) [2022-07-09 20:09:26,157][25689] Fps is (10 sec: 5709.2, 60 sec: 5695.8, 300 sec: 5665.6). Total num frames: 411151360. Throughput: 0: 6026.8. Samples: 411159670. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:09:26,158][25689] Avg episode reward: [(0, '-45.126')] [2022-07-09 20:09:27,657][26022] Updated weights on worker 0-0, policy_version 401524 (0.00097) [2022-07-09 20:09:28,480][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:09:28,492][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000401529_411165696.pth [2022-07-09 20:09:28,492][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000399534_409122816.pth [2022-07-09 20:09:29,204][26022] Updated weights on worker 0-0, policy_version 401534 (0.00088) [2022-07-09 20:09:31,129][26022] Updated weights on worker 0-0, policy_version 401544 (0.00091) [2022-07-09 20:09:31,264][25689] Fps is (10 sec: 5663.7, 60 sec: 5660.2, 300 sec: 5670.7). Total num frames: 411181056. Throughput: 0: 5151.1. Samples: 411176760. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:09:31,264][25689] Avg episode reward: [(0, '-46.434')] [2022-07-09 20:09:32,823][26022] Updated weights on worker 0-0, policy_version 401554 (0.00094) [2022-07-09 20:09:34,849][26022] Updated weights on worker 0-0, policy_version 401564 (0.00082) [2022-07-09 20:09:36,273][25689] Fps is (10 sec: 5871.5, 60 sec: 5679.2, 300 sec: 5674.2). Total num frames: 411210752. Throughput: 0: 5999.8. Samples: 411211050. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:09:36,273][25689] Avg episode reward: [(0, '-46.962')] [2022-07-09 20:09:36,417][26022] Updated weights on worker 0-0, policy_version 401574 (0.00092) [2022-07-09 20:09:38,238][26022] Updated weights on worker 0-0, policy_version 401584 (0.00093) [2022-07-09 20:09:39,980][26022] Updated weights on worker 0-0, policy_version 401594 (0.00088) [2022-07-09 20:09:41,318][25689] Fps is (10 sec: 5805.5, 60 sec: 5693.1, 300 sec: 5670.1). Total num frames: 411239424. Throughput: 0: 5969.2. Samples: 411245578. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:09:41,318][25689] Avg episode reward: [(0, '-47.520')] [2022-07-09 20:09:41,700][26022] Updated weights on worker 0-0, policy_version 401604 (0.00071) [2022-07-09 20:09:43,706][26022] Updated weights on worker 0-0, policy_version 401614 (0.00078) [2022-07-09 20:09:45,348][26022] Updated weights on worker 0-0, policy_version 401624 (0.00088) [2022-07-09 20:09:46,386][25689] Fps is (10 sec: 5670.3, 60 sec: 5688.6, 300 sec: 5670.5). Total num frames: 411268096. Throughput: 0: 5085.2. Samples: 411262536. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:09:46,386][25689] Avg episode reward: [(0, '-47.978')] [2022-07-09 20:09:47,138][26022] Updated weights on worker 0-0, policy_version 401634 (0.00086) [2022-07-09 20:09:48,994][26022] Updated weights on worker 0-0, policy_version 401644 (0.00089) [2022-07-09 20:09:50,942][26022] Updated weights on worker 0-0, policy_version 401654 (0.00085) [2022-07-09 20:09:51,462][25689] Fps is (10 sec: 5753.9, 60 sec: 5707.7, 300 sec: 5672.7). Total num frames: 411297792. Throughput: 0: 5933.6. Samples: 411296608. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:09:51,462][25689] Avg episode reward: [(0, '-47.587')] [2022-07-09 20:09:52,706][26022] Updated weights on worker 0-0, policy_version 401664 (0.00089) [2022-07-09 20:09:54,512][26022] Updated weights on worker 0-0, policy_version 401674 (0.00095) [2022-07-09 20:09:56,163][26022] Updated weights on worker 0-0, policy_version 401684 (0.00940) [2022-07-09 20:09:56,508][25689] Fps is (10 sec: 5766.1, 60 sec: 5687.0, 300 sec: 5673.0). Total num frames: 411326464. Throughput: 0: 5922.4. Samples: 411330894. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:09:56,509][25689] Avg episode reward: [(0, '-47.118')] [2022-07-09 20:09:57,922][26022] Updated weights on worker 0-0, policy_version 401694 (0.00089) [2022-07-09 20:09:59,643][26022] Updated weights on worker 0-0, policy_version 401704 (0.00093) [2022-07-09 20:10:01,524][25689] Fps is (10 sec: 5699.1, 60 sec: 5721.7, 300 sec: 5681.3). Total num frames: 411355136. Throughput: 0: 5075.0. Samples: 411348124. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:10:01,524][25689] Avg episode reward: [(0, '-46.326')] [2022-07-09 20:10:01,535][26022] Updated weights on worker 0-0, policy_version 401714 (0.00085) [2022-07-09 20:10:03,761][26022] Updated weights on worker 0-0, policy_version 401724 (0.00090) [2022-07-09 20:10:05,495][26022] Updated weights on worker 0-0, policy_version 401734 (0.00089) [2022-07-09 20:10:06,543][25689] Fps is (10 sec: 5408.4, 60 sec: 5686.8, 300 sec: 5671.9). Total num frames: 411380736. Throughput: 0: 5855.2. Samples: 411380562. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:10:06,544][25689] Avg episode reward: [(0, '-46.353')] [2022-07-09 20:10:07,290][26022] Updated weights on worker 0-0, policy_version 401744 (0.00084) [2022-07-09 20:10:09,039][26022] Updated weights on worker 0-0, policy_version 401754 (0.00086) [2022-07-09 20:10:10,806][26022] Updated weights on worker 0-0, policy_version 401764 (0.00089) [2022-07-09 20:10:11,635][25689] Fps is (10 sec: 5367.5, 60 sec: 5667.4, 300 sec: 5670.4). Total num frames: 411409408. Throughput: 0: 5864.5. Samples: 411414912. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:10:11,635][25689] Avg episode reward: [(0, '-46.193')] [2022-07-09 20:10:12,777][26022] Updated weights on worker 0-0, policy_version 401774 (0.00083) [2022-07-09 20:10:14,404][26022] Updated weights on worker 0-0, policy_version 401784 (0.00085) [2022-07-09 20:10:16,299][26022] Updated weights on worker 0-0, policy_version 401794 (0.00092) [2022-07-09 20:10:16,669][25689] Fps is (10 sec: 5663.3, 60 sec: 5681.8, 300 sec: 5673.3). Total num frames: 411438080. Throughput: 0: 5024.5. Samples: 411432190. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:10:16,669][25689] Avg episode reward: [(0, '-45.072')] [2022-07-09 20:10:17,915][26022] Updated weights on worker 0-0, policy_version 401804 (0.00084) [2022-07-09 20:10:19,858][26022] Updated weights on worker 0-0, policy_version 401814 (0.00087) [2022-07-09 20:10:21,624][26022] Updated weights on worker 0-0, policy_version 401824 (0.00982) [2022-07-09 20:10:21,713][25689] Fps is (10 sec: 5791.5, 60 sec: 5680.0, 300 sec: 5673.4). Total num frames: 411467776. Throughput: 0: 5865.1. Samples: 411466536. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:10:21,719][25689] Avg episode reward: [(0, '-45.441')] [2022-07-09 20:10:23,627][26022] Updated weights on worker 0-0, policy_version 401834 (0.00086) [2022-07-09 20:10:25,016][26022] Updated weights on worker 0-0, policy_version 401844 (0.00108) [2022-07-09 20:10:26,774][25689] Fps is (10 sec: 5674.9, 60 sec: 5676.1, 300 sec: 5664.2). Total num frames: 411495424. Throughput: 0: 5946.6. Samples: 411500864. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2022-07-09 20:10:26,775][25689] Avg episode reward: [(0, '-44.032')] [2022-07-09 20:10:26,995][26022] Updated weights on worker 0-0, policy_version 401854 (0.00085) [2022-07-09 20:10:28,789][26022] Updated weights on worker 0-0, policy_version 401864 (0.00086) [2022-07-09 20:10:30,683][26022] Updated weights on worker 0-0, policy_version 401874 (0.00086) [2022-07-09 20:10:31,830][25689] Fps is (10 sec: 5566.9, 60 sec: 5663.9, 300 sec: 5670.0). Total num frames: 411524096. Throughput: 0: 5103.3. Samples: 411517974. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:10:31,831][25689] Avg episode reward: [(0, '-44.891')] [2022-07-09 20:10:32,327][26022] Updated weights on worker 0-0, policy_version 401884 (0.00090) [2022-07-09 20:10:34,202][26022] Updated weights on worker 0-0, policy_version 401894 (0.00090) [2022-07-09 20:10:35,919][26022] Updated weights on worker 0-0, policy_version 401904 (0.00087) [2022-07-09 20:10:36,842][25689] Fps is (10 sec: 5898.8, 60 sec: 5680.5, 300 sec: 5673.5). Total num frames: 411554816. Throughput: 0: 5967.6. Samples: 411552576. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:10:36,843][25689] Avg episode reward: [(0, '-44.468')] [2022-07-09 20:10:38,022][26022] Updated weights on worker 0-0, policy_version 401914 (0.00095) [2022-07-09 20:10:39,441][26022] Updated weights on worker 0-0, policy_version 401924 (0.00082) [2022-07-09 20:10:41,416][26022] Updated weights on worker 0-0, policy_version 401934 (0.00087) [2022-07-09 20:10:41,848][25689] Fps is (10 sec: 5928.4, 60 sec: 5684.1, 300 sec: 5673.9). Total num frames: 411583488. Throughput: 0: 5985.7. Samples: 411587060. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:10:41,849][25689] Avg episode reward: [(0, '-43.812')] [2022-07-09 20:10:43,113][26022] Updated weights on worker 0-0, policy_version 401944 (0.00088) [2022-07-09 20:10:44,836][26022] Updated weights on worker 0-0, policy_version 401954 (0.00092) [2022-07-09 20:10:46,698][26022] Updated weights on worker 0-0, policy_version 401964 (0.00096) [2022-07-09 20:10:46,858][25689] Fps is (10 sec: 5623.1, 60 sec: 5672.7, 300 sec: 5675.7). Total num frames: 411611136. Throughput: 0: 6005.3. Samples: 411621476. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:10:46,859][25689] Avg episode reward: [(0, '-44.526')] [2022-07-09 20:10:48,532][26022] Updated weights on worker 0-0, policy_version 401974 (0.00091) [2022-07-09 20:10:50,073][26022] Updated weights on worker 0-0, policy_version 401984 (0.00085) [2022-07-09 20:10:51,906][25689] Fps is (10 sec: 5599.7, 60 sec: 5658.4, 300 sec: 5671.5). Total num frames: 411639808. Throughput: 0: 6015.1. Samples: 411638732. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:10:51,906][25689] Avg episode reward: [(0, '-45.135')] [2022-07-09 20:10:52,175][26022] Updated weights on worker 0-0, policy_version 401994 (0.00085) [2022-07-09 20:10:53,724][26022] Updated weights on worker 0-0, policy_version 402004 (0.00088) [2022-07-09 20:10:55,735][26022] Updated weights on worker 0-0, policy_version 402014 (0.00086) [2022-07-09 20:10:56,917][25689] Fps is (10 sec: 5802.6, 60 sec: 5678.7, 300 sec: 5671.7). Total num frames: 411669504. Throughput: 0: 5998.4. Samples: 411672992. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:10:56,917][25689] Avg episode reward: [(0, '-46.880')] [2022-07-09 20:10:57,347][26022] Updated weights on worker 0-0, policy_version 402024 (0.00081) [2022-07-09 20:10:58,951][26022] Updated weights on worker 0-0, policy_version 402034 (0.00080) [2022-07-09 20:11:00,904][26022] Updated weights on worker 0-0, policy_version 402044 (0.00081) [2022-07-09 20:11:01,938][25689] Fps is (10 sec: 5715.9, 60 sec: 5661.2, 300 sec: 5675.4). Total num frames: 411697152. Throughput: 0: 5947.1. Samples: 411706538. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:11:01,939][25689] Avg episode reward: [(0, '-46.584')] [2022-07-09 20:11:02,967][26022] Updated weights on worker 0-0, policy_version 402054 (0.00085) [2022-07-09 20:11:04,895][26022] Updated weights on worker 0-0, policy_version 402064 (0.00088) [2022-07-09 20:11:06,792][26022] Updated weights on worker 0-0, policy_version 402074 (0.00092) [2022-07-09 20:11:06,953][25689] Fps is (10 sec: 5509.8, 60 sec: 5695.5, 300 sec: 5673.2). Total num frames: 411724800. Throughput: 0: 5031.4. Samples: 411722582. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:11:06,953][25689] Avg episode reward: [(0, '-46.571')] [2022-07-09 20:11:08,486][26022] Updated weights on worker 0-0, policy_version 402084 (0.00087) [2022-07-09 20:11:10,429][26022] Updated weights on worker 0-0, policy_version 402094 (0.00083) [2022-07-09 20:11:12,003][25689] Fps is (10 sec: 5595.8, 60 sec: 5699.4, 300 sec: 5672.5). Total num frames: 411753472. Throughput: 0: 5871.3. Samples: 411756728. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:11:12,005][25689] Avg episode reward: [(0, '-47.751')] [2022-07-09 20:11:12,068][26022] Updated weights on worker 0-0, policy_version 402104 (0.00086) [2022-07-09 20:11:13,929][26022] Updated weights on worker 0-0, policy_version 402114 (0.00082) [2022-07-09 20:11:15,683][26022] Updated weights on worker 0-0, policy_version 402124 (0.00085) [2022-07-09 20:11:17,007][25689] Fps is (10 sec: 5703.7, 60 sec: 5702.3, 300 sec: 5677.2). Total num frames: 411782144. Throughput: 0: 5895.0. Samples: 411791420. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:11:17,008][25689] Avg episode reward: [(0, '-47.622')] [2022-07-09 20:11:17,565][26022] Updated weights on worker 0-0, policy_version 402134 (0.00080) [2022-07-09 20:11:19,219][26022] Updated weights on worker 0-0, policy_version 402144 (0.00086) [2022-07-09 20:11:21,181][26022] Updated weights on worker 0-0, policy_version 402154 (0.00083) [2022-07-09 20:11:22,036][25689] Fps is (10 sec: 5715.6, 60 sec: 5686.7, 300 sec: 5673.3). Total num frames: 411810816. Throughput: 0: 5067.6. Samples: 411808384. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:11:22,036][25689] Avg episode reward: [(0, '-47.415')] [2022-07-09 20:11:22,608][26022] Updated weights on worker 0-0, policy_version 402164 (0.00087) [2022-07-09 20:11:24,599][26022] Updated weights on worker 0-0, policy_version 402174 (0.00094) [2022-07-09 20:11:26,583][26022] Updated weights on worker 0-0, policy_version 402184 (0.00090) [2022-07-09 20:11:27,052][25689] Fps is (10 sec: 5708.6, 60 sec: 5707.9, 300 sec: 5679.1). Total num frames: 411839488. Throughput: 0: 5994.4. Samples: 411843064. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:11:27,052][25689] Avg episode reward: [(0, '-46.400')] [2022-07-09 20:11:28,177][26022] Updated weights on worker 0-0, policy_version 402194 (0.00092) [2022-07-09 20:11:28,601][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:11:28,618][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000402196_411848704.pth [2022-07-09 20:11:28,619][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000400197_409801728.pth [2022-07-09 20:11:29,998][26022] Updated weights on worker 0-0, policy_version 402204 (0.00094) [2022-07-09 20:11:32,019][26022] Updated weights on worker 0-0, policy_version 402214 (0.00087) [2022-07-09 20:11:32,128][25689] Fps is (10 sec: 5580.8, 60 sec: 5689.1, 300 sec: 5671.2). Total num frames: 411867136. Throughput: 0: 5978.6. Samples: 411877046. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:11:32,128][25689] Avg episode reward: [(0, '-46.167')] [2022-07-09 20:11:33,504][26022] Updated weights on worker 0-0, policy_version 402224 (0.00092) [2022-07-09 20:11:35,686][26022] Updated weights on worker 0-0, policy_version 402234 (0.00088) [2022-07-09 20:11:37,011][26022] Updated weights on worker 0-0, policy_version 402244 (0.00086) [2022-07-09 20:11:37,131][25689] Fps is (10 sec: 5791.0, 60 sec: 5689.9, 300 sec: 5678.6). Total num frames: 411897856. Throughput: 0: 5109.3. Samples: 411894242. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:11:37,131][25689] Avg episode reward: [(0, '-45.677')] [2022-07-09 20:11:39,149][26022] Updated weights on worker 0-0, policy_version 402254 (0.00089) [2022-07-09 20:11:40,769][26022] Updated weights on worker 0-0, policy_version 402264 (0.00085) [2022-07-09 20:11:42,155][25689] Fps is (10 sec: 5820.8, 60 sec: 5671.2, 300 sec: 5672.0). Total num frames: 411925504. Throughput: 0: 5981.0. Samples: 411928718. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:11:42,156][25689] Avg episode reward: [(0, '-45.651')] [2022-07-09 20:11:42,540][26022] Updated weights on worker 0-0, policy_version 402274 (0.00084) [2022-07-09 20:11:44,359][26022] Updated weights on worker 0-0, policy_version 402284 (0.00086) [2022-07-09 20:11:46,082][26022] Updated weights on worker 0-0, policy_version 402294 (0.00085) [2022-07-09 20:11:47,171][25689] Fps is (10 sec: 5711.4, 60 sec: 5704.6, 300 sec: 5683.2). Total num frames: 411955200. Throughput: 0: 5985.0. Samples: 411963478. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:11:47,172][25689] Avg episode reward: [(0, '-45.578')] [2022-07-09 20:11:48,080][26022] Updated weights on worker 0-0, policy_version 402304 (0.00086) [2022-07-09 20:11:49,639][26022] Updated weights on worker 0-0, policy_version 402314 (0.00093) [2022-07-09 20:11:51,582][26022] Updated weights on worker 0-0, policy_version 402324 (0.00082) [2022-07-09 20:11:52,268][25689] Fps is (10 sec: 5771.9, 60 sec: 5700.0, 300 sec: 5676.2). Total num frames: 411983872. Throughput: 0: 5134.5. Samples: 411980454. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:11:52,268][25689] Avg episode reward: [(0, '-45.738')] [2022-07-09 20:11:53,106][26022] Updated weights on worker 0-0, policy_version 402334 (0.00091) [2022-07-09 20:11:55,319][26022] Updated weights on worker 0-0, policy_version 402344 (0.00093) [2022-07-09 20:11:56,660][26022] Updated weights on worker 0-0, policy_version 402354 (0.00090) [2022-07-09 20:11:57,324][25689] Fps is (10 sec: 5648.2, 60 sec: 5678.8, 300 sec: 5675.5). Total num frames: 412012544. Throughput: 0: 5979.3. Samples: 412014980. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:11:57,324][25689] Avg episode reward: [(0, '-45.161')] [2022-07-09 20:11:58,771][26022] Updated weights on worker 0-0, policy_version 402364 (0.00086) [2022-07-09 20:12:00,339][26022] Updated weights on worker 0-0, policy_version 402374 (0.00086) [2022-07-09 20:12:02,340][25689] Fps is (10 sec: 5490.1, 60 sec: 5662.4, 300 sec: 5682.5). Total num frames: 412039168. Throughput: 0: 5874.9. Samples: 412047300. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:12:02,340][25689] Avg episode reward: [(0, '-46.474')] [2022-07-09 20:12:02,611][26022] Updated weights on worker 0-0, policy_version 402384 (0.00085) [2022-07-09 20:12:04,352][26022] Updated weights on worker 0-0, policy_version 402394 (0.00085) [2022-07-09 20:12:06,118][26022] Updated weights on worker 0-0, policy_version 402404 (0.00092) [2022-07-09 20:12:07,352][25689] Fps is (10 sec: 5514.2, 60 sec: 5679.6, 300 sec: 5677.5). Total num frames: 412067840. Throughput: 0: 4996.3. Samples: 412064306. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:12:07,353][25689] Avg episode reward: [(0, '-46.995')] [2022-07-09 20:12:08,003][26022] Updated weights on worker 0-0, policy_version 402414 (0.00080) [2022-07-09 20:12:09,726][26022] Updated weights on worker 0-0, policy_version 402424 (0.00085) [2022-07-09 20:12:11,545][26022] Updated weights on worker 0-0, policy_version 402434 (0.00085) [2022-07-09 20:12:12,441][25689] Fps is (10 sec: 5778.1, 60 sec: 5692.8, 300 sec: 5683.3). Total num frames: 412097536. Throughput: 0: 5857.2. Samples: 412098614. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:12:12,442][25689] Avg episode reward: [(0, '-46.452')] [2022-07-09 20:12:13,325][26022] Updated weights on worker 0-0, policy_version 402444 (0.00087) [2022-07-09 20:12:15,055][26022] Updated weights on worker 0-0, policy_version 402454 (0.00091) [2022-07-09 20:12:17,087][26022] Updated weights on worker 0-0, policy_version 402464 (0.00088) [2022-07-09 20:12:17,471][25689] Fps is (10 sec: 5667.0, 60 sec: 5673.5, 300 sec: 5677.8). Total num frames: 412125184. Throughput: 0: 5852.7. Samples: 412132894. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:12:17,472][25689] Avg episode reward: [(0, '-46.485')] [2022-07-09 20:12:18,843][26022] Updated weights on worker 0-0, policy_version 402474 (0.00083) [2022-07-09 20:12:20,627][26022] Updated weights on worker 0-0, policy_version 402484 (0.00088) [2022-07-09 20:12:22,424][26022] Updated weights on worker 0-0, policy_version 402494 (0.00089) [2022-07-09 20:12:22,512][25689] Fps is (10 sec: 5694.0, 60 sec: 5689.2, 300 sec: 5680.6). Total num frames: 412154880. Throughput: 0: 5072.3. Samples: 412149620. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:12:22,513][25689] Avg episode reward: [(0, '-46.818')] [2022-07-09 20:12:24,096][26022] Updated weights on worker 0-0, policy_version 402504 (0.00086) [2022-07-09 20:12:25,961][26022] Updated weights on worker 0-0, policy_version 402514 (0.00091) [2022-07-09 20:12:27,517][25689] Fps is (10 sec: 5707.7, 60 sec: 5673.3, 300 sec: 5678.3). Total num frames: 412182528. Throughput: 0: 5938.5. Samples: 412184060. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:12:27,519][25689] Avg episode reward: [(0, '-47.318')] [2022-07-09 20:12:27,883][26022] Updated weights on worker 0-0, policy_version 402524 (0.00088) [2022-07-09 20:12:29,499][26022] Updated weights on worker 0-0, policy_version 402534 (0.00093) [2022-07-09 20:12:31,512][26022] Updated weights on worker 0-0, policy_version 402544 (0.00091) [2022-07-09 20:12:32,591][25689] Fps is (10 sec: 5588.1, 60 sec: 5690.5, 300 sec: 5680.5). Total num frames: 412211200. Throughput: 0: 5937.5. Samples: 412218252. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:12:32,596][25689] Avg episode reward: [(0, '-45.977')] [2022-07-09 20:12:33,070][26022] Updated weights on worker 0-0, policy_version 402554 (0.00098) [2022-07-09 20:12:35,231][26022] Updated weights on worker 0-0, policy_version 402564 (0.00086) [2022-07-09 20:12:36,651][26022] Updated weights on worker 0-0, policy_version 402574 (0.00085) [2022-07-09 20:12:37,607][25689] Fps is (10 sec: 5683.7, 60 sec: 5655.4, 300 sec: 5677.1). Total num frames: 412239872. Throughput: 0: 5093.1. Samples: 412235448. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:12:37,609][25689] Avg episode reward: [(0, '-45.520')] [2022-07-09 20:12:38,770][26022] Updated weights on worker 0-0, policy_version 402584 (0.00082) [2022-07-09 20:12:40,239][26022] Updated weights on worker 0-0, policy_version 402594 (0.00095) [2022-07-09 20:12:42,182][26022] Updated weights on worker 0-0, policy_version 402604 (0.00091) [2022-07-09 20:12:42,646][25689] Fps is (10 sec: 5702.9, 60 sec: 5670.9, 300 sec: 5680.4). Total num frames: 412268544. Throughput: 0: 5961.4. Samples: 412269646. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:12:42,648][25689] Avg episode reward: [(0, '-46.104')] [2022-07-09 20:12:43,972][26022] Updated weights on worker 0-0, policy_version 402614 (0.00082) [2022-07-09 20:12:45,868][26022] Updated weights on worker 0-0, policy_version 402624 (0.00097) [2022-07-09 20:12:47,534][26022] Updated weights on worker 0-0, policy_version 402634 (0.00088) [2022-07-09 20:12:47,671][25689] Fps is (10 sec: 5799.4, 60 sec: 5670.1, 300 sec: 5681.0). Total num frames: 412298240. Throughput: 0: 5962.0. Samples: 412304216. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:12:47,672][25689] Avg episode reward: [(0, '-46.312')] [2022-07-09 20:12:49,177][26022] Updated weights on worker 0-0, policy_version 402644 (0.00087) [2022-07-09 20:12:51,093][26022] Updated weights on worker 0-0, policy_version 402654 (0.00562) [2022-07-09 20:12:52,716][25689] Fps is (10 sec: 5694.6, 60 sec: 5658.0, 300 sec: 5676.8). Total num frames: 412325888. Throughput: 0: 5113.9. Samples: 412321170. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:12:52,716][25689] Avg episode reward: [(0, '-47.054')] [2022-07-09 20:12:53,054][26022] Updated weights on worker 0-0, policy_version 402664 (0.00090) [2022-07-09 20:12:54,665][26022] Updated weights on worker 0-0, policy_version 402674 (0.00094) [2022-07-09 20:12:56,471][26022] Updated weights on worker 0-0, policy_version 402684 (0.00084) [2022-07-09 20:12:57,722][25689] Fps is (10 sec: 5705.3, 60 sec: 5679.6, 300 sec: 5677.1). Total num frames: 412355584. Throughput: 0: 5981.0. Samples: 412355760. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:12:57,723][25689] Avg episode reward: [(0, '-47.462')] [2022-07-09 20:12:58,264][26022] Updated weights on worker 0-0, policy_version 402694 (0.00089) [2022-07-09 20:12:59,950][26022] Updated weights on worker 0-0, policy_version 402704 (0.00089) [2022-07-09 20:13:02,328][26022] Updated weights on worker 0-0, policy_version 402714 (0.00090) [2022-07-09 20:13:02,748][25689] Fps is (10 sec: 5613.7, 60 sec: 5678.7, 300 sec: 5684.0). Total num frames: 412382208. Throughput: 0: 5876.0. Samples: 412387768. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:13:02,749][25689] Avg episode reward: [(0, '-48.074')] [2022-07-09 20:13:04,126][26022] Updated weights on worker 0-0, policy_version 402724 (0.00081) [2022-07-09 20:13:05,748][26022] Updated weights on worker 0-0, policy_version 402734 (0.00086) [2022-07-09 20:13:07,760][25689] Fps is (10 sec: 5304.9, 60 sec: 5644.8, 300 sec: 5672.7). Total num frames: 412408832. Throughput: 0: 5016.2. Samples: 412404984. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 20:13:07,760][25689] Avg episode reward: [(0, '-47.290')] [2022-07-09 20:13:07,903][26022] Updated weights on worker 0-0, policy_version 402744 (0.00089) [2022-07-09 20:13:09,313][26022] Updated weights on worker 0-0, policy_version 402754 (0.00096) [2022-07-09 20:13:11,487][26022] Updated weights on worker 0-0, policy_version 402764 (0.00091) [2022-07-09 20:13:12,826][25689] Fps is (10 sec: 5690.1, 60 sec: 5663.9, 300 sec: 5678.8). Total num frames: 412439552. Throughput: 0: 5860.1. Samples: 412439020. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:13:12,827][25689] Avg episode reward: [(0, '-48.716')] [2022-07-09 20:13:12,931][26022] Updated weights on worker 0-0, policy_version 402774 (0.00083) [2022-07-09 20:13:14,935][26022] Updated weights on worker 0-0, policy_version 402784 (0.00094) [2022-07-09 20:13:16,598][26022] Updated weights on worker 0-0, policy_version 402794 (0.00086) [2022-07-09 20:13:17,872][25689] Fps is (10 sec: 5873.5, 60 sec: 5679.4, 300 sec: 5675.7). Total num frames: 412468224. Throughput: 0: 5848.6. Samples: 412473606. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:13:17,872][25689] Avg episode reward: [(0, '-49.277')] [2022-07-09 20:13:18,281][26022] Updated weights on worker 0-0, policy_version 402804 (0.00087) [2022-07-09 20:13:20,164][26022] Updated weights on worker 0-0, policy_version 402814 (0.00088) [2022-07-09 20:13:22,102][26022] Updated weights on worker 0-0, policy_version 402824 (0.00090) [2022-07-09 20:13:22,954][25689] Fps is (10 sec: 5662.1, 60 sec: 5658.6, 300 sec: 5685.0). Total num frames: 412496896. Throughput: 0: 5092.0. Samples: 412490654. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:13:22,954][25689] Avg episode reward: [(0, '-48.947')] [2022-07-09 20:13:23,669][26022] Updated weights on worker 0-0, policy_version 402834 (0.00080) [2022-07-09 20:13:25,584][26022] Updated weights on worker 0-0, policy_version 402844 (0.00088) [2022-07-09 20:13:27,254][26022] Updated weights on worker 0-0, policy_version 402854 (0.00084) [2022-07-09 20:13:27,983][25689] Fps is (10 sec: 5671.3, 60 sec: 5673.3, 300 sec: 5675.8). Total num frames: 412525568. Throughput: 0: 5935.6. Samples: 412525022. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:13:27,983][25689] Avg episode reward: [(0, '-49.699')] [2022-07-09 20:13:28,742][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:13:28,753][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000402861_412529664.pth [2022-07-09 20:13:28,753][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000400863_410483712.pth [2022-07-09 20:13:29,155][26022] Updated weights on worker 0-0, policy_version 402864 (0.00088) [2022-07-09 20:13:31,092][26022] Updated weights on worker 0-0, policy_version 402874 (0.00089) [2022-07-09 20:13:32,528][26022] Updated weights on worker 0-0, policy_version 402884 (0.00085) [2022-07-09 20:13:33,027][25689] Fps is (10 sec: 5692.6, 60 sec: 5676.0, 300 sec: 5675.5). Total num frames: 412554240. Throughput: 0: 5951.9. Samples: 412559254. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:13:33,028][25689] Avg episode reward: [(0, '-50.604')] [2022-07-09 20:13:34,631][26022] Updated weights on worker 0-0, policy_version 402894 (0.00087) [2022-07-09 20:13:36,183][26022] Updated weights on worker 0-0, policy_version 402904 (0.00089) [2022-07-09 20:13:38,055][25689] Fps is (10 sec: 5591.5, 60 sec: 5657.9, 300 sec: 5675.2). Total num frames: 412581888. Throughput: 0: 5092.3. Samples: 412576388. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:13:38,057][25689] Avg episode reward: [(0, '-49.575')] [2022-07-09 20:13:38,155][26022] Updated weights on worker 0-0, policy_version 402914 (0.00094) [2022-07-09 20:13:39,779][26022] Updated weights on worker 0-0, policy_version 402924 (0.00089) [2022-07-09 20:13:41,583][26022] Updated weights on worker 0-0, policy_version 402934 (0.00604) [2022-07-09 20:13:43,079][25689] Fps is (10 sec: 5704.9, 60 sec: 5676.4, 300 sec: 5678.6). Total num frames: 412611584. Throughput: 0: 5972.4. Samples: 412610850. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:13:43,079][25689] Avg episode reward: [(0, '-48.806')] [2022-07-09 20:13:43,462][26022] Updated weights on worker 0-0, policy_version 402944 (0.00084) [2022-07-09 20:13:45,062][26022] Updated weights on worker 0-0, policy_version 402954 (0.00089) [2022-07-09 20:13:47,142][26022] Updated weights on worker 0-0, policy_version 402964 (0.00089) [2022-07-09 20:13:48,085][25689] Fps is (10 sec: 5921.7, 60 sec: 5678.2, 300 sec: 5683.9). Total num frames: 412641280. Throughput: 0: 5988.1. Samples: 412645394. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:13:48,087][25689] Avg episode reward: [(0, '-48.453')] [2022-07-09 20:13:48,952][26022] Updated weights on worker 0-0, policy_version 402974 (0.00091) [2022-07-09 20:13:50,486][26022] Updated weights on worker 0-0, policy_version 402984 (0.00086) [2022-07-09 20:13:52,577][26022] Updated weights on worker 0-0, policy_version 402994 (0.00079) [2022-07-09 20:13:53,215][25689] Fps is (10 sec: 5657.6, 60 sec: 5670.2, 300 sec: 5674.6). Total num frames: 412668928. Throughput: 0: 5942.0. Samples: 412679208. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:13:53,222][25689] Avg episode reward: [(0, '-48.310')] [2022-07-09 20:13:54,247][26022] Updated weights on worker 0-0, policy_version 403004 (0.00093) [2022-07-09 20:13:56,191][26022] Updated weights on worker 0-0, policy_version 403014 (0.00086) [2022-07-09 20:13:57,908][26022] Updated weights on worker 0-0, policy_version 403024 (0.00087) [2022-07-09 20:13:58,231][25689] Fps is (10 sec: 5550.9, 60 sec: 5652.3, 300 sec: 5681.6). Total num frames: 412697600. Throughput: 0: 5947.8. Samples: 412696388. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:13:58,233][25689] Avg episode reward: [(0, '-47.938')] [2022-07-09 20:13:59,600][26022] Updated weights on worker 0-0, policy_version 403034 (0.00083) [2022-07-09 20:14:01,543][26022] Updated weights on worker 0-0, policy_version 403044 (0.00088) [2022-07-09 20:14:03,254][25689] Fps is (10 sec: 5609.9, 60 sec: 5669.5, 300 sec: 5681.4). Total num frames: 412725248. Throughput: 0: 5914.6. Samples: 412730178. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:14:03,256][25689] Avg episode reward: [(0, '-47.136')] [2022-07-09 20:14:03,547][26022] Updated weights on worker 0-0, policy_version 403054 (0.00084) [2022-07-09 20:14:05,427][26022] Updated weights on worker 0-0, policy_version 403064 (0.00085) [2022-07-09 20:14:07,028][26022] Updated weights on worker 0-0, policy_version 403074 (0.00094) [2022-07-09 20:14:08,278][25689] Fps is (10 sec: 5605.8, 60 sec: 5702.2, 300 sec: 5678.7). Total num frames: 412753920. Throughput: 0: 5859.5. Samples: 412763714. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:14:08,280][25689] Avg episode reward: [(0, '-47.343')] [2022-07-09 20:14:08,892][26022] Updated weights on worker 0-0, policy_version 403084 (0.00103) [2022-07-09 20:14:10,660][26022] Updated weights on worker 0-0, policy_version 403094 (0.00087) [2022-07-09 20:14:12,784][26022] Updated weights on worker 0-0, policy_version 403104 (0.00085) [2022-07-09 20:14:13,332][25689] Fps is (10 sec: 5689.8, 60 sec: 5669.5, 300 sec: 5681.2). Total num frames: 412782592. Throughput: 0: 5053.8. Samples: 412780880. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:14:13,333][25689] Avg episode reward: [(0, '-47.386')] [2022-07-09 20:14:14,174][26022] Updated weights on worker 0-0, policy_version 403114 (0.00088) [2022-07-09 20:14:16,067][26022] Updated weights on worker 0-0, policy_version 403124 (0.00091) [2022-07-09 20:14:17,638][26022] Updated weights on worker 0-0, policy_version 403134 (0.00087) [2022-07-09 20:14:18,340][25689] Fps is (10 sec: 5800.7, 60 sec: 5690.0, 300 sec: 5681.6). Total num frames: 412812288. Throughput: 0: 5934.1. Samples: 412815716. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:14:18,340][25689] Avg episode reward: [(0, '-47.135')] [2022-07-09 20:14:19,634][26022] Updated weights on worker 0-0, policy_version 403144 (0.00085) [2022-07-09 20:14:21,223][26022] Updated weights on worker 0-0, policy_version 403154 (0.00086) [2022-07-09 20:14:23,377][25689] Fps is (10 sec: 5709.1, 60 sec: 5677.3, 300 sec: 5681.2). Total num frames: 412839936. Throughput: 0: 5948.6. Samples: 412849880. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:14:23,377][25689] Avg episode reward: [(0, '-47.094')] [2022-07-09 20:14:23,383][26022] Updated weights on worker 0-0, policy_version 403164 (0.00090) [2022-07-09 20:14:24,924][26022] Updated weights on worker 0-0, policy_version 403174 (0.00087) [2022-07-09 20:14:26,879][26022] Updated weights on worker 0-0, policy_version 403184 (0.00086) [2022-07-09 20:14:28,425][25689] Fps is (10 sec: 5686.0, 60 sec: 5692.5, 300 sec: 5682.4). Total num frames: 412869632. Throughput: 0: 5124.8. Samples: 412866964. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:14:28,426][25689] Avg episode reward: [(0, '-46.891')] [2022-07-09 20:14:28,507][26022] Updated weights on worker 0-0, policy_version 403194 (0.00085) [2022-07-09 20:14:30,546][26022] Updated weights on worker 0-0, policy_version 403204 (0.00087) [2022-07-09 20:14:32,111][26022] Updated weights on worker 0-0, policy_version 403214 (0.00085) [2022-07-09 20:14:33,513][25689] Fps is (10 sec: 5758.1, 60 sec: 5688.3, 300 sec: 5677.4). Total num frames: 412898304. Throughput: 0: 5962.0. Samples: 412901198. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:14:33,514][25689] Avg episode reward: [(0, '-47.542')] [2022-07-09 20:14:34,178][26022] Updated weights on worker 0-0, policy_version 403224 (0.00066) [2022-07-09 20:14:35,739][26022] Updated weights on worker 0-0, policy_version 403234 (0.00089) [2022-07-09 20:14:37,594][26022] Updated weights on worker 0-0, policy_version 403244 (0.00092) [2022-07-09 20:14:38,599][25689] Fps is (10 sec: 5736.9, 60 sec: 5716.7, 300 sec: 5680.1). Total num frames: 412928000. Throughput: 0: 5918.6. Samples: 412935622. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:14:38,600][25689] Avg episode reward: [(0, '-47.607')] [2022-07-09 20:14:39,464][26022] Updated weights on worker 0-0, policy_version 403254 (0.00087) [2022-07-09 20:14:41,113][26022] Updated weights on worker 0-0, policy_version 403264 (0.00095) [2022-07-09 20:14:43,083][26022] Updated weights on worker 0-0, policy_version 403274 (0.00092) [2022-07-09 20:14:43,635][25689] Fps is (10 sec: 5665.7, 60 sec: 5681.8, 300 sec: 5677.2). Total num frames: 412955648. Throughput: 0: 5075.7. Samples: 412952702. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:14:43,635][25689] Avg episode reward: [(0, '-47.709')] [2022-07-09 20:14:44,706][26022] Updated weights on worker 0-0, policy_version 403284 (0.00085) [2022-07-09 20:14:46,614][26022] Updated weights on worker 0-0, policy_version 403294 (0.00088) [2022-07-09 20:14:48,281][26022] Updated weights on worker 0-0, policy_version 403304 (0.00091) [2022-07-09 20:14:48,639][25689] Fps is (10 sec: 5711.8, 60 sec: 5682.0, 300 sec: 5678.6). Total num frames: 412985344. Throughput: 0: 5946.4. Samples: 412987162. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:14:48,639][25689] Avg episode reward: [(0, '-47.726')] [2022-07-09 20:14:50,136][26022] Updated weights on worker 0-0, policy_version 403314 (0.00099) [2022-07-09 20:14:51,894][26022] Updated weights on worker 0-0, policy_version 403324 (0.00093) [2022-07-09 20:14:53,759][25689] Fps is (10 sec: 5663.8, 60 sec: 5682.8, 300 sec: 5673.8). Total num frames: 413012992. Throughput: 0: 5946.0. Samples: 413021580. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:14:53,760][25689] Avg episode reward: [(0, '-47.819')] [2022-07-09 20:14:53,839][26022] Updated weights on worker 0-0, policy_version 403334 (0.00099) [2022-07-09 20:14:55,378][26022] Updated weights on worker 0-0, policy_version 403344 (0.00092) [2022-07-09 20:14:57,298][26022] Updated weights on worker 0-0, policy_version 403354 (0.00087) [2022-07-09 20:14:58,774][25689] Fps is (10 sec: 5759.1, 60 sec: 5716.8, 300 sec: 5680.7). Total num frames: 413043712. Throughput: 0: 5101.0. Samples: 413038532. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:14:58,774][25689] Avg episode reward: [(0, '-48.259')] [2022-07-09 20:14:58,915][26022] Updated weights on worker 0-0, policy_version 403364 (0.00088) [2022-07-09 20:15:00,808][26022] Updated weights on worker 0-0, policy_version 403374 (0.00086) [2022-07-09 20:15:02,910][26022] Updated weights on worker 0-0, policy_version 403384 (0.00084) [2022-07-09 20:15:03,776][25689] Fps is (10 sec: 5622.7, 60 sec: 5685.0, 300 sec: 5681.0). Total num frames: 413069312. Throughput: 0: 5874.5. Samples: 413071022. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:15:03,776][25689] Avg episode reward: [(0, '-47.801')] [2022-07-09 20:15:04,633][26022] Updated weights on worker 0-0, policy_version 403394 (0.00085) [2022-07-09 20:15:06,706][26022] Updated weights on worker 0-0, policy_version 403404 (0.00090) [2022-07-09 20:15:08,290][26022] Updated weights on worker 0-0, policy_version 403414 (0.00097) [2022-07-09 20:15:08,839][25689] Fps is (10 sec: 5289.9, 60 sec: 5664.3, 300 sec: 5678.1). Total num frames: 413096960. Throughput: 0: 5851.0. Samples: 413105356. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:15:08,840][25689] Avg episode reward: [(0, '-48.061')] [2022-07-09 20:15:10,144][26022] Updated weights on worker 0-0, policy_version 403424 (0.00085) [2022-07-09 20:15:11,896][26022] Updated weights on worker 0-0, policy_version 403434 (0.00091) [2022-07-09 20:15:13,835][26022] Updated weights on worker 0-0, policy_version 403444 (0.00083) [2022-07-09 20:15:13,866][25689] Fps is (10 sec: 5784.7, 60 sec: 5700.8, 300 sec: 5685.1). Total num frames: 413127680. Throughput: 0: 5023.3. Samples: 413122582. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:15:13,866][25689] Avg episode reward: [(0, '-47.218')] [2022-07-09 20:15:15,646][26022] Updated weights on worker 0-0, policy_version 403454 (0.00091) [2022-07-09 20:15:17,369][26022] Updated weights on worker 0-0, policy_version 403464 (0.00091) [2022-07-09 20:15:18,917][25689] Fps is (10 sec: 5792.0, 60 sec: 5662.9, 300 sec: 5678.1). Total num frames: 413155328. Throughput: 0: 5879.8. Samples: 413156970. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:15:18,919][25689] Avg episode reward: [(0, '-47.212')] [2022-07-09 20:15:19,199][26022] Updated weights on worker 0-0, policy_version 403474 (0.00084) [2022-07-09 20:15:20,913][26022] Updated weights on worker 0-0, policy_version 403484 (0.00086) [2022-07-09 20:15:22,633][26022] Updated weights on worker 0-0, policy_version 403494 (0.00093) [2022-07-09 20:15:24,015][25689] Fps is (10 sec: 5650.0, 60 sec: 5690.9, 300 sec: 5684.3). Total num frames: 413185024. Throughput: 0: 5932.2. Samples: 413191086. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:15:24,017][25689] Avg episode reward: [(0, '-46.418')] [2022-07-09 20:15:24,551][26022] Updated weights on worker 0-0, policy_version 403504 (0.00087) [2022-07-09 20:15:26,271][26022] Updated weights on worker 0-0, policy_version 403514 (0.00088) [2022-07-09 20:15:27,982][26022] Updated weights on worker 0-0, policy_version 403524 (0.00095) [2022-07-09 20:15:28,818][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:15:28,830][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000403528_413212672.pth [2022-07-09 20:15:28,830][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000401529_411165696.pth [2022-07-09 20:15:29,031][25689] Fps is (10 sec: 5770.7, 60 sec: 5677.0, 300 sec: 5685.0). Total num frames: 413213696. Throughput: 0: 5094.2. Samples: 413208218. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:15:29,033][25689] Avg episode reward: [(0, '-46.313')] [2022-07-09 20:15:30,158][26022] Updated weights on worker 0-0, policy_version 403534 (0.00090) [2022-07-09 20:15:31,732][26022] Updated weights on worker 0-0, policy_version 403544 (0.00083) [2022-07-09 20:15:33,704][26022] Updated weights on worker 0-0, policy_version 403554 (0.00091) [2022-07-09 20:15:34,156][25689] Fps is (10 sec: 5655.1, 60 sec: 5673.7, 300 sec: 5676.0). Total num frames: 413242368. Throughput: 0: 5902.9. Samples: 413242350. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:15:34,156][25689] Avg episode reward: [(0, '-46.196')] [2022-07-09 20:15:35,351][26022] Updated weights on worker 0-0, policy_version 403564 (0.00087) [2022-07-09 20:15:36,986][26022] Updated weights on worker 0-0, policy_version 403574 (0.00084) [2022-07-09 20:15:39,106][26022] Updated weights on worker 0-0, policy_version 403584 (0.00089) [2022-07-09 20:15:39,205][25689] Fps is (10 sec: 5535.8, 60 sec: 5643.3, 300 sec: 5671.7). Total num frames: 413270016. Throughput: 0: 5871.0. Samples: 413276082. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:15:39,206][25689] Avg episode reward: [(0, '-46.543')] [2022-07-09 20:15:40,802][26022] Updated weights on worker 0-0, policy_version 403594 (0.00089) [2022-07-09 20:15:42,600][26022] Updated weights on worker 0-0, policy_version 403604 (0.00087) [2022-07-09 20:15:44,212][25689] Fps is (10 sec: 5702.4, 60 sec: 5679.8, 300 sec: 5678.7). Total num frames: 413299712. Throughput: 0: 5061.4. Samples: 413293308. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:15:44,212][25689] Avg episode reward: [(0, '-46.319')] [2022-07-09 20:15:44,447][26022] Updated weights on worker 0-0, policy_version 403614 (0.00079) [2022-07-09 20:15:46,036][26022] Updated weights on worker 0-0, policy_version 403624 (0.00086) [2022-07-09 20:15:47,891][26022] Updated weights on worker 0-0, policy_version 403634 (0.00085) [2022-07-09 20:15:49,239][25689] Fps is (10 sec: 5715.2, 60 sec: 5643.8, 300 sec: 5675.6). Total num frames: 413327360. Throughput: 0: 5927.4. Samples: 413327996. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 20:15:49,239][25689] Avg episode reward: [(0, '-46.471')] [2022-07-09 20:15:49,753][26022] Updated weights on worker 0-0, policy_version 403644 (0.00092) [2022-07-09 20:15:51,545][26022] Updated weights on worker 0-0, policy_version 403654 (0.00343) [2022-07-09 20:15:53,436][26022] Updated weights on worker 0-0, policy_version 403664 (0.00094) [2022-07-09 20:15:54,341][25689] Fps is (10 sec: 5661.1, 60 sec: 5679.3, 300 sec: 5673.9). Total num frames: 413357056. Throughput: 0: 5909.3. Samples: 413361632. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:15:54,342][25689] Avg episode reward: [(0, '-46.769')] [2022-07-09 20:15:55,415][26022] Updated weights on worker 0-0, policy_version 403674 (0.00095) [2022-07-09 20:15:56,953][26022] Updated weights on worker 0-0, policy_version 403684 (0.00097) [2022-07-09 20:15:58,763][26022] Updated weights on worker 0-0, policy_version 403694 (0.00089) [2022-07-09 20:15:59,381][25689] Fps is (10 sec: 5653.9, 60 sec: 5626.2, 300 sec: 5673.5). Total num frames: 413384704. Throughput: 0: 5090.2. Samples: 413378782. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:15:59,382][25689] Avg episode reward: [(0, '-46.255')] [2022-07-09 20:16:00,473][26022] Updated weights on worker 0-0, policy_version 403704 (0.00084) [2022-07-09 20:16:02,849][26022] Updated weights on worker 0-0, policy_version 403714 (0.00090) [2022-07-09 20:16:04,455][25689] Fps is (10 sec: 5467.2, 60 sec: 5653.3, 300 sec: 5672.4). Total num frames: 413412352. Throughput: 0: 5814.3. Samples: 413411010. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:16:04,456][25689] Avg episode reward: [(0, '-46.334')] [2022-07-09 20:16:04,579][26022] Updated weights on worker 0-0, policy_version 403724 (0.00079) [2022-07-09 20:16:06,507][26022] Updated weights on worker 0-0, policy_version 403734 (0.00097) [2022-07-09 20:16:08,172][26022] Updated weights on worker 0-0, policy_version 403744 (0.00105) [2022-07-09 20:16:09,487][25689] Fps is (10 sec: 5573.3, 60 sec: 5673.2, 300 sec: 5672.8). Total num frames: 413441024. Throughput: 0: 5787.0. Samples: 413445170. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:16:09,487][25689] Avg episode reward: [(0, '-45.955')] [2022-07-09 20:16:10,166][26022] Updated weights on worker 0-0, policy_version 403754 (0.00110) [2022-07-09 20:16:11,711][26022] Updated weights on worker 0-0, policy_version 403764 (0.00090) [2022-07-09 20:16:13,590][26022] Updated weights on worker 0-0, policy_version 403774 (0.00087) [2022-07-09 20:16:14,544][25689] Fps is (10 sec: 5785.3, 60 sec: 5653.4, 300 sec: 5675.2). Total num frames: 413470720. Throughput: 0: 5831.8. Samples: 413479452. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:16:14,545][25689] Avg episode reward: [(0, '-46.242')] [2022-07-09 20:16:15,341][26022] Updated weights on worker 0-0, policy_version 403784 (0.00085) [2022-07-09 20:16:17,148][26022] Updated weights on worker 0-0, policy_version 403794 (0.00081) [2022-07-09 20:16:18,830][26022] Updated weights on worker 0-0, policy_version 403804 (0.00086) [2022-07-09 20:16:19,547][25689] Fps is (10 sec: 5700.2, 60 sec: 5658.0, 300 sec: 5672.2). Total num frames: 413498368. Throughput: 0: 5835.9. Samples: 413496466. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:16:19,547][25689] Avg episode reward: [(0, '-45.816')] [2022-07-09 20:16:20,669][26022] Updated weights on worker 0-0, policy_version 403814 (0.00092) [2022-07-09 20:16:22,588][26022] Updated weights on worker 0-0, policy_version 403824 (0.00093) [2022-07-09 20:16:24,461][26022] Updated weights on worker 0-0, policy_version 403834 (0.00083) [2022-07-09 20:16:24,581][25689] Fps is (10 sec: 5611.3, 60 sec: 5647.0, 300 sec: 5671.9). Total num frames: 413527040. Throughput: 0: 5952.9. Samples: 413530818. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:16:24,582][25689] Avg episode reward: [(0, '-46.189')] [2022-07-09 20:16:26,109][26022] Updated weights on worker 0-0, policy_version 403844 (0.00093) [2022-07-09 20:16:27,991][26022] Updated weights on worker 0-0, policy_version 403854 (0.00095) [2022-07-09 20:16:29,597][25689] Fps is (10 sec: 5705.7, 60 sec: 5647.0, 300 sec: 5676.5). Total num frames: 413555712. Throughput: 0: 5945.7. Samples: 413564740. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:16:29,597][25689] Avg episode reward: [(0, '-46.693')] [2022-07-09 20:16:29,807][26022] Updated weights on worker 0-0, policy_version 403864 (0.00092) [2022-07-09 20:16:31,584][26022] Updated weights on worker 0-0, policy_version 403874 (0.00091) [2022-07-09 20:16:33,492][26022] Updated weights on worker 0-0, policy_version 403884 (0.00580) [2022-07-09 20:16:34,668][25689] Fps is (10 sec: 5685.1, 60 sec: 5652.0, 300 sec: 5668.3). Total num frames: 413584384. Throughput: 0: 5082.1. Samples: 413581722. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:16:34,668][25689] Avg episode reward: [(0, '-46.772')] [2022-07-09 20:16:35,153][26022] Updated weights on worker 0-0, policy_version 403894 (0.00091) [2022-07-09 20:16:37,104][26022] Updated weights on worker 0-0, policy_version 403904 (0.00084) [2022-07-09 20:16:38,868][26022] Updated weights on worker 0-0, policy_version 403914 (0.00086) [2022-07-09 20:16:39,753][25689] Fps is (10 sec: 5545.3, 60 sec: 5648.7, 300 sec: 5667.1). Total num frames: 413612032. Throughput: 0: 5892.7. Samples: 413615538. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:16:39,754][25689] Avg episode reward: [(0, '-47.211')] [2022-07-09 20:16:40,590][26022] Updated weights on worker 0-0, policy_version 403924 (0.00359) [2022-07-09 20:16:42,469][26022] Updated weights on worker 0-0, policy_version 403934 (0.00085) [2022-07-09 20:16:44,228][26022] Updated weights on worker 0-0, policy_version 403944 (0.00090) [2022-07-09 20:16:44,820][25689] Fps is (10 sec: 5648.5, 60 sec: 5643.1, 300 sec: 5666.2). Total num frames: 413641728. Throughput: 0: 5890.9. Samples: 413650042. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:16:44,820][25689] Avg episode reward: [(0, '-47.177')] [2022-07-09 20:16:46,015][26022] Updated weights on worker 0-0, policy_version 403954 (0.00090) [2022-07-09 20:16:47,745][26022] Updated weights on worker 0-0, policy_version 403964 (0.00092) [2022-07-09 20:16:49,551][26022] Updated weights on worker 0-0, policy_version 403974 (0.00087) [2022-07-09 20:16:49,862][25689] Fps is (10 sec: 5773.8, 60 sec: 5658.5, 300 sec: 5667.2). Total num frames: 413670400. Throughput: 0: 5054.4. Samples: 413667170. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:16:49,863][25689] Avg episode reward: [(0, '-47.205')] [2022-07-09 20:16:51,466][26022] Updated weights on worker 0-0, policy_version 403984 (0.00080) [2022-07-09 20:16:53,028][26022] Updated weights on worker 0-0, policy_version 403994 (0.00094) [2022-07-09 20:16:54,971][25689] Fps is (10 sec: 5548.1, 60 sec: 5624.1, 300 sec: 5662.8). Total num frames: 413698048. Throughput: 0: 5890.8. Samples: 413701324. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:16:54,972][25689] Avg episode reward: [(0, '-47.343')] [2022-07-09 20:16:55,209][26022] Updated weights on worker 0-0, policy_version 404004 (0.00087) [2022-07-09 20:16:56,823][26022] Updated weights on worker 0-0, policy_version 404014 (0.00082) [2022-07-09 20:16:58,691][26022] Updated weights on worker 0-0, policy_version 404024 (0.00069) [2022-07-09 20:16:59,983][25689] Fps is (10 sec: 5564.9, 60 sec: 5643.7, 300 sec: 5669.7). Total num frames: 413726720. Throughput: 0: 5930.1. Samples: 413735502. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:16:59,983][25689] Avg episode reward: [(0, '-47.260')] [2022-07-09 20:17:00,330][26022] Updated weights on worker 0-0, policy_version 404034 (0.00083) [2022-07-09 20:17:02,575][26022] Updated weights on worker 0-0, policy_version 404044 (0.00089) [2022-07-09 20:17:04,412][26022] Updated weights on worker 0-0, policy_version 404054 (0.00092) [2022-07-09 20:17:05,046][25689] Fps is (10 sec: 5691.6, 60 sec: 5661.6, 300 sec: 5668.7). Total num frames: 413755392. Throughput: 0: 4968.1. Samples: 413750528. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:17:05,047][25689] Avg episode reward: [(0, '-47.515')] [2022-07-09 20:17:06,242][26022] Updated weights on worker 0-0, policy_version 404064 (0.00089) [2022-07-09 20:17:08,016][26022] Updated weights on worker 0-0, policy_version 404074 (0.00088) [2022-07-09 20:17:09,900][26022] Updated weights on worker 0-0, policy_version 404084 (0.00089) [2022-07-09 20:17:10,084][25689] Fps is (10 sec: 5474.0, 60 sec: 5627.1, 300 sec: 5659.4). Total num frames: 413782016. Throughput: 0: 5782.0. Samples: 413784094. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:17:10,085][25689] Avg episode reward: [(0, '-47.899')] [2022-07-09 20:17:11,547][26022] Updated weights on worker 0-0, policy_version 404094 (0.00083) [2022-07-09 20:17:13,696][26022] Updated weights on worker 0-0, policy_version 404104 (0.00084) [2022-07-09 20:17:15,151][25689] Fps is (10 sec: 5573.7, 60 sec: 5626.3, 300 sec: 5665.6). Total num frames: 413811712. Throughput: 0: 5796.7. Samples: 413818300. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:17:15,152][25689] Avg episode reward: [(0, '-47.926')] [2022-07-09 20:17:15,224][26022] Updated weights on worker 0-0, policy_version 404114 (0.00090) [2022-07-09 20:17:17,136][26022] Updated weights on worker 0-0, policy_version 404124 (0.00087) [2022-07-09 20:17:18,939][26022] Updated weights on worker 0-0, policy_version 404134 (0.00088) [2022-07-09 20:17:20,162][25689] Fps is (10 sec: 5791.8, 60 sec: 5642.4, 300 sec: 5662.7). Total num frames: 413840384. Throughput: 0: 4945.9. Samples: 413835306. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:17:20,164][25689] Avg episode reward: [(0, '-48.484')] [2022-07-09 20:17:20,750][26022] Updated weights on worker 0-0, policy_version 404144 (0.00098) [2022-07-09 20:17:22,503][26022] Updated weights on worker 0-0, policy_version 404154 (0.00095) [2022-07-09 20:17:24,412][26022] Updated weights on worker 0-0, policy_version 404164 (0.00086) [2022-07-09 20:17:25,168][25689] Fps is (10 sec: 5622.7, 60 sec: 5628.2, 300 sec: 5662.7). Total num frames: 413868032. Throughput: 0: 5917.1. Samples: 413869588. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:17:25,168][25689] Avg episode reward: [(0, '-48.215')] [2022-07-09 20:17:26,172][26022] Updated weights on worker 0-0, policy_version 404174 (0.00084) [2022-07-09 20:17:28,076][26022] Updated weights on worker 0-0, policy_version 404184 (0.00091) [2022-07-09 20:17:29,033][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:17:29,042][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000404190_413890560.pth [2022-07-09 20:17:29,047][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000402196_411848704.pth [2022-07-09 20:17:29,795][26022] Updated weights on worker 0-0, policy_version 404194 (0.00095) [2022-07-09 20:17:30,199][25689] Fps is (10 sec: 5713.5, 60 sec: 5643.7, 300 sec: 5666.9). Total num frames: 413897728. Throughput: 0: 5944.5. Samples: 413903664. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:17:30,199][25689] Avg episode reward: [(0, '-48.049')] [2022-07-09 20:17:31,568][26022] Updated weights on worker 0-0, policy_version 404204 (0.00089) [2022-07-09 20:17:33,353][26022] Updated weights on worker 0-0, policy_version 404214 (0.00086) [2022-07-09 20:17:35,083][26022] Updated weights on worker 0-0, policy_version 404224 (0.00090) [2022-07-09 20:17:35,300][25689] Fps is (10 sec: 5760.4, 60 sec: 5640.8, 300 sec: 5665.3). Total num frames: 413926400. Throughput: 0: 5089.6. Samples: 413920852. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:17:35,301][25689] Avg episode reward: [(0, '-48.043')] [2022-07-09 20:17:37,000][26022] Updated weights on worker 0-0, policy_version 404234 (0.00097) [2022-07-09 20:17:38,564][26022] Updated weights on worker 0-0, policy_version 404244 (0.00088) [2022-07-09 20:17:40,313][25689] Fps is (10 sec: 5568.1, 60 sec: 5647.5, 300 sec: 5662.4). Total num frames: 413954048. Throughput: 0: 5958.8. Samples: 413955384. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:17:40,314][25689] Avg episode reward: [(0, '-48.104')] [2022-07-09 20:17:40,509][26022] Updated weights on worker 0-0, policy_version 404254 (0.00092) [2022-07-09 20:17:42,109][26022] Updated weights on worker 0-0, policy_version 404264 (0.00090) [2022-07-09 20:17:44,150][26022] Updated weights on worker 0-0, policy_version 404274 (0.00086) [2022-07-09 20:17:45,331][25689] Fps is (10 sec: 5717.0, 60 sec: 5652.2, 300 sec: 5662.5). Total num frames: 413983744. Throughput: 0: 5957.7. Samples: 413989712. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:17:45,331][25689] Avg episode reward: [(0, '-47.876')] [2022-07-09 20:17:45,793][26022] Updated weights on worker 0-0, policy_version 404284 (0.00085) [2022-07-09 20:17:47,686][26022] Updated weights on worker 0-0, policy_version 404294 (0.00089) [2022-07-09 20:17:49,357][26022] Updated weights on worker 0-0, policy_version 404304 (0.00083) [2022-07-09 20:17:50,334][25689] Fps is (10 sec: 5824.7, 60 sec: 5655.8, 300 sec: 5666.7). Total num frames: 414012416. Throughput: 0: 5132.9. Samples: 414007018. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:17:50,335][25689] Avg episode reward: [(0, '-46.950')] [2022-07-09 20:17:51,330][26022] Updated weights on worker 0-0, policy_version 404314 (0.00097) [2022-07-09 20:17:52,908][26022] Updated weights on worker 0-0, policy_version 404324 (0.00095) [2022-07-09 20:17:54,870][26022] Updated weights on worker 0-0, policy_version 404334 (0.00098) [2022-07-09 20:17:55,400][25689] Fps is (10 sec: 5796.4, 60 sec: 5693.7, 300 sec: 5665.6). Total num frames: 414042112. Throughput: 0: 5999.1. Samples: 414041434. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:17:55,401][25689] Avg episode reward: [(0, '-47.074')] [2022-07-09 20:17:56,495][26022] Updated weights on worker 0-0, policy_version 404344 (0.00085) [2022-07-09 20:17:58,486][26022] Updated weights on worker 0-0, policy_version 404354 (0.00050) [2022-07-09 20:18:00,045][26022] Updated weights on worker 0-0, policy_version 404364 (0.00090) [2022-07-09 20:18:00,451][25689] Fps is (10 sec: 5668.2, 60 sec: 5673.1, 300 sec: 5668.6). Total num frames: 414069760. Throughput: 0: 5966.2. Samples: 414075528. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:18:00,451][25689] Avg episode reward: [(0, '-47.380')] [2022-07-09 20:18:01,839][26022] Updated weights on worker 0-0, policy_version 404374 (0.00497) [2022-07-09 20:18:04,111][26022] Updated weights on worker 0-0, policy_version 404384 (0.00091) [2022-07-09 20:18:05,488][25689] Fps is (10 sec: 5481.7, 60 sec: 5658.6, 300 sec: 5671.5). Total num frames: 414097408. Throughput: 0: 5010.1. Samples: 414090702. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:18:05,489][25689] Avg episode reward: [(0, '-47.066')] [2022-07-09 20:18:05,882][26022] Updated weights on worker 0-0, policy_version 404394 (0.00102) [2022-07-09 20:18:07,742][26022] Updated weights on worker 0-0, policy_version 404404 (0.00089) [2022-07-09 20:18:09,502][26022] Updated weights on worker 0-0, policy_version 404414 (0.00089) [2022-07-09 20:18:10,525][25689] Fps is (10 sec: 5488.9, 60 sec: 5675.6, 300 sec: 5661.8). Total num frames: 414125056. Throughput: 0: 5851.9. Samples: 414125174. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:18:10,526][25689] Avg episode reward: [(0, '-47.360')] [2022-07-09 20:18:11,267][26022] Updated weights on worker 0-0, policy_version 404424 (0.00087) [2022-07-09 20:18:13,175][26022] Updated weights on worker 0-0, policy_version 404434 (0.00087) [2022-07-09 20:18:14,828][26022] Updated weights on worker 0-0, policy_version 404444 (0.00090) [2022-07-09 20:18:15,659][25689] Fps is (10 sec: 5537.1, 60 sec: 5652.4, 300 sec: 5660.1). Total num frames: 414153728. Throughput: 0: 5824.8. Samples: 414159438. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:18:15,660][25689] Avg episode reward: [(0, '-47.764')] [2022-07-09 20:18:16,738][26022] Updated weights on worker 0-0, policy_version 404454 (0.00083) [2022-07-09 20:18:18,360][26022] Updated weights on worker 0-0, policy_version 404464 (0.00092) [2022-07-09 20:18:20,183][26022] Updated weights on worker 0-0, policy_version 404474 (0.00097) [2022-07-09 20:18:20,667][25689] Fps is (10 sec: 5755.3, 60 sec: 5669.6, 300 sec: 5664.9). Total num frames: 414183424. Throughput: 0: 5010.0. Samples: 414176810. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:18:20,668][25689] Avg episode reward: [(0, '-48.284')] [2022-07-09 20:18:21,914][26022] Updated weights on worker 0-0, policy_version 404484 (0.00083) [2022-07-09 20:18:23,971][26022] Updated weights on worker 0-0, policy_version 404494 (0.00077) [2022-07-09 20:18:25,423][26022] Updated weights on worker 0-0, policy_version 404504 (0.00093) [2022-07-09 20:18:25,691][25689] Fps is (10 sec: 5920.1, 60 sec: 5701.7, 300 sec: 5668.5). Total num frames: 414213120. Throughput: 0: 5976.9. Samples: 414211458. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:18:25,692][25689] Avg episode reward: [(0, '-47.756')] [2022-07-09 20:18:27,662][26022] Updated weights on worker 0-0, policy_version 404514 (0.00094) [2022-07-09 20:18:28,838][26022] Updated weights on worker 0-0, policy_version 404524 (0.00083) [2022-07-09 20:18:30,731][25689] Fps is (10 sec: 5596.2, 60 sec: 5650.2, 300 sec: 5661.7). Total num frames: 414239744. Throughput: 0: 5954.8. Samples: 414245494. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-09 20:18:30,731][25689] Avg episode reward: [(0, '-47.774')] [2022-07-09 20:18:31,319][26022] Updated weights on worker 0-0, policy_version 404534 (0.00083) [2022-07-09 20:18:32,509][26022] Updated weights on worker 0-0, policy_version 404544 (0.00095) [2022-07-09 20:18:34,567][26022] Updated weights on worker 0-0, policy_version 404554 (0.00086) [2022-07-09 20:18:35,821][25689] Fps is (10 sec: 5660.9, 60 sec: 5685.0, 300 sec: 5670.8). Total num frames: 414270464. Throughput: 0: 5117.8. Samples: 414262624. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:18:35,822][25689] Avg episode reward: [(0, '-48.200')] [2022-07-09 20:18:36,349][26022] Updated weights on worker 0-0, policy_version 404564 (0.00089) [2022-07-09 20:18:37,957][26022] Updated weights on worker 0-0, policy_version 404574 (0.00088) [2022-07-09 20:18:40,081][26022] Updated weights on worker 0-0, policy_version 404584 (0.00090) [2022-07-09 20:18:40,828][25689] Fps is (10 sec: 5780.8, 60 sec: 5685.7, 300 sec: 5664.3). Total num frames: 414298112. Throughput: 0: 5966.0. Samples: 414297090. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:18:40,828][25689] Avg episode reward: [(0, '-48.863')] [2022-07-09 20:18:41,723][26022] Updated weights on worker 0-0, policy_version 404594 (0.00081) [2022-07-09 20:18:43,585][26022] Updated weights on worker 0-0, policy_version 404604 (0.00097) [2022-07-09 20:18:45,274][26022] Updated weights on worker 0-0, policy_version 404614 (0.00089) [2022-07-09 20:18:45,838][25689] Fps is (10 sec: 5622.5, 60 sec: 5669.4, 300 sec: 5660.7). Total num frames: 414326784. Throughput: 0: 5959.2. Samples: 414331516. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:18:45,839][25689] Avg episode reward: [(0, '-48.570')] [2022-07-09 20:18:46,986][26022] Updated weights on worker 0-0, policy_version 404624 (0.00087) [2022-07-09 20:18:48,923][26022] Updated weights on worker 0-0, policy_version 404634 (0.00088) [2022-07-09 20:18:50,816][26022] Updated weights on worker 0-0, policy_version 404644 (0.00088) [2022-07-09 20:18:50,849][25689] Fps is (10 sec: 5722.5, 60 sec: 5668.7, 300 sec: 5666.5). Total num frames: 414355456. Throughput: 0: 5127.4. Samples: 414348646. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:18:50,850][25689] Avg episode reward: [(0, '-49.068')] [2022-07-09 20:18:52,479][26022] Updated weights on worker 0-0, policy_version 404654 (0.00086) [2022-07-09 20:18:54,478][26022] Updated weights on worker 0-0, policy_version 404664 (0.00086) [2022-07-09 20:18:55,973][25689] Fps is (10 sec: 5758.9, 60 sec: 5663.2, 300 sec: 5667.8). Total num frames: 414385152. Throughput: 0: 5942.5. Samples: 414382378. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:18:55,974][25689] Avg episode reward: [(0, '-50.305')] [2022-07-09 20:18:56,163][26022] Updated weights on worker 0-0, policy_version 404674 (0.00085) [2022-07-09 20:18:58,346][26022] Updated weights on worker 0-0, policy_version 404684 (0.00092) [2022-07-09 20:18:59,588][26022] Updated weights on worker 0-0, policy_version 404694 (0.00086) [2022-07-09 20:19:01,045][25689] Fps is (10 sec: 5623.8, 60 sec: 5661.3, 300 sec: 5666.9). Total num frames: 414412800. Throughput: 0: 5886.7. Samples: 414416104. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:19:01,047][25689] Avg episode reward: [(0, '-50.116')] [2022-07-09 20:19:01,812][26022] Updated weights on worker 0-0, policy_version 404704 (0.00090) [2022-07-09 20:19:03,732][26022] Updated weights on worker 0-0, policy_version 404714 (0.00095) [2022-07-09 20:19:05,707][26022] Updated weights on worker 0-0, policy_version 404724 (0.00086) [2022-07-09 20:19:06,079][25689] Fps is (10 sec: 5370.7, 60 sec: 5644.7, 300 sec: 5659.8). Total num frames: 414439424. Throughput: 0: 5766.9. Samples: 414448240. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:19:06,079][25689] Avg episode reward: [(0, '-48.160')] [2022-07-09 20:19:07,458][26022] Updated weights on worker 0-0, policy_version 404734 (0.00090) [2022-07-09 20:19:09,124][26022] Updated weights on worker 0-0, policy_version 404744 (0.00091) [2022-07-09 20:19:10,963][26022] Updated weights on worker 0-0, policy_version 404754 (0.00111) [2022-07-09 20:19:11,120][25689] Fps is (10 sec: 5590.5, 60 sec: 5678.1, 300 sec: 5663.5). Total num frames: 414469120. Throughput: 0: 5757.2. Samples: 414465350. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:19:11,120][25689] Avg episode reward: [(0, '-48.057')] [2022-07-09 20:19:12,712][26022] Updated weights on worker 0-0, policy_version 404764 (0.00079) [2022-07-09 20:19:14,534][26022] Updated weights on worker 0-0, policy_version 404774 (0.00089) [2022-07-09 20:19:16,197][25689] Fps is (10 sec: 5768.3, 60 sec: 5683.4, 300 sec: 5658.8). Total num frames: 414497792. Throughput: 0: 5792.1. Samples: 414499518. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:19:16,198][25689] Avg episode reward: [(0, '-47.676')] [2022-07-09 20:19:16,303][26022] Updated weights on worker 0-0, policy_version 404784 (0.00085) [2022-07-09 20:19:18,197][26022] Updated weights on worker 0-0, policy_version 404794 (0.00090) [2022-07-09 20:19:19,867][26022] Updated weights on worker 0-0, policy_version 404804 (0.00088) [2022-07-09 20:19:21,242][25689] Fps is (10 sec: 5563.9, 60 sec: 5646.2, 300 sec: 5658.6). Total num frames: 414525440. Throughput: 0: 5828.0. Samples: 414533810. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:19:21,247][25689] Avg episode reward: [(0, '-47.229')] [2022-07-09 20:19:21,879][26022] Updated weights on worker 0-0, policy_version 404814 (0.00083) [2022-07-09 20:19:23,501][26022] Updated weights on worker 0-0, policy_version 404824 (0.00084) [2022-07-09 20:19:25,410][26022] Updated weights on worker 0-0, policy_version 404834 (0.00092) [2022-07-09 20:19:26,249][25689] Fps is (10 sec: 5603.0, 60 sec: 5630.9, 300 sec: 5656.0). Total num frames: 414554112. Throughput: 0: 5100.9. Samples: 414551128. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:19:26,250][25689] Avg episode reward: [(0, '-46.924')] [2022-07-09 20:19:27,171][26022] Updated weights on worker 0-0, policy_version 404844 (0.00088) [2022-07-09 20:19:29,012][26022] Updated weights on worker 0-0, policy_version 404854 (0.00084) [2022-07-09 20:19:29,144][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:19:29,158][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000404855_414571520.pth [2022-07-09 20:19:29,160][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000402861_412529664.pth [2022-07-09 20:19:30,883][26022] Updated weights on worker 0-0, policy_version 404864 (0.00084) [2022-07-09 20:19:31,267][25689] Fps is (10 sec: 5720.2, 60 sec: 5666.7, 300 sec: 5657.3). Total num frames: 414582784. Throughput: 0: 5939.1. Samples: 414585006. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:19:31,267][25689] Avg episode reward: [(0, '-47.388')] [2022-07-09 20:19:32,691][26022] Updated weights on worker 0-0, policy_version 404874 (0.00081) [2022-07-09 20:19:34,226][26022] Updated weights on worker 0-0, policy_version 404884 (0.00086) [2022-07-09 20:19:36,323][25689] Fps is (10 sec: 5692.3, 60 sec: 5636.1, 300 sec: 5654.4). Total num frames: 414611456. Throughput: 0: 5957.6. Samples: 414619418. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:19:36,324][25689] Avg episode reward: [(0, '-48.448')] [2022-07-09 20:19:36,326][26022] Updated weights on worker 0-0, policy_version 404894 (0.00089) [2022-07-09 20:19:37,936][26022] Updated weights on worker 0-0, policy_version 404904 (0.00087) [2022-07-09 20:19:39,879][26022] Updated weights on worker 0-0, policy_version 404914 (0.00087) [2022-07-09 20:19:41,399][25689] Fps is (10 sec: 5760.3, 60 sec: 5663.4, 300 sec: 5660.5). Total num frames: 414641152. Throughput: 0: 5097.1. Samples: 414636556. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:19:41,400][25689] Avg episode reward: [(0, '-48.324')] [2022-07-09 20:19:41,443][26022] Updated weights on worker 0-0, policy_version 404924 (0.00087) [2022-07-09 20:19:43,447][26022] Updated weights on worker 0-0, policy_version 404934 (0.00087) [2022-07-09 20:19:45,164][26022] Updated weights on worker 0-0, policy_version 404944 (0.00093) [2022-07-09 20:19:46,418][25689] Fps is (10 sec: 5781.8, 60 sec: 5662.6, 300 sec: 5656.8). Total num frames: 414669824. Throughput: 0: 5950.1. Samples: 414671136. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:19:46,419][25689] Avg episode reward: [(0, '-49.125')] [2022-07-09 20:19:47,011][26022] Updated weights on worker 0-0, policy_version 404954 (0.00091) [2022-07-09 20:19:48,712][26022] Updated weights on worker 0-0, policy_version 404964 (0.00101) [2022-07-09 20:19:50,548][26022] Updated weights on worker 0-0, policy_version 404974 (0.00087) [2022-07-09 20:19:51,450][25689] Fps is (10 sec: 5705.4, 60 sec: 5660.6, 300 sec: 5661.9). Total num frames: 414698496. Throughput: 0: 5975.3. Samples: 414705610. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:19:51,451][25689] Avg episode reward: [(0, '-48.912')] [2022-07-09 20:19:52,219][26022] Updated weights on worker 0-0, policy_version 404984 (0.00081) [2022-07-09 20:19:54,202][26022] Updated weights on worker 0-0, policy_version 404994 (0.00095) [2022-07-09 20:19:55,757][26022] Updated weights on worker 0-0, policy_version 405004 (0.00090) [2022-07-09 20:19:56,529][25689] Fps is (10 sec: 5772.8, 60 sec: 5664.9, 300 sec: 5657.3). Total num frames: 414728192. Throughput: 0: 5113.6. Samples: 414722744. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:19:56,530][25689] Avg episode reward: [(0, '-48.096')] [2022-07-09 20:19:57,977][26022] Updated weights on worker 0-0, policy_version 405014 (0.00082) [2022-07-09 20:19:59,523][26022] Updated weights on worker 0-0, policy_version 405024 (0.00091) [2022-07-09 20:20:01,283][26022] Updated weights on worker 0-0, policy_version 405034 (0.00084) [2022-07-09 20:20:01,600][25689] Fps is (10 sec: 5649.6, 60 sec: 5664.9, 300 sec: 5662.8). Total num frames: 414755840. Throughput: 0: 5953.0. Samples: 414756812. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:20:01,601][25689] Avg episode reward: [(0, '-48.651')] [2022-07-09 20:20:03,489][26022] Updated weights on worker 0-0, policy_version 405044 (0.00090) [2022-07-09 20:20:05,239][26022] Updated weights on worker 0-0, policy_version 405054 (0.00090) [2022-07-09 20:20:06,700][25689] Fps is (10 sec: 5336.0, 60 sec: 5658.7, 300 sec: 5658.7). Total num frames: 414782464. Throughput: 0: 5824.8. Samples: 414789274. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:20:06,701][25689] Avg episode reward: [(0, '-48.868')] [2022-07-09 20:20:06,931][26022] Updated weights on worker 0-0, policy_version 405064 (0.00083) [2022-07-09 20:20:08,787][26022] Updated weights on worker 0-0, policy_version 405074 (0.00092) [2022-07-09 20:20:10,651][26022] Updated weights on worker 0-0, policy_version 405084 (0.00394) [2022-07-09 20:20:11,709][25689] Fps is (10 sec: 5672.9, 60 sec: 5678.6, 300 sec: 5659.0). Total num frames: 414813184. Throughput: 0: 4985.6. Samples: 414806610. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:20:11,709][25689] Avg episode reward: [(0, '-48.770')] [2022-07-09 20:20:12,326][26022] Updated weights on worker 0-0, policy_version 405094 (0.00081) [2022-07-09 20:20:14,242][26022] Updated weights on worker 0-0, policy_version 405104 (0.00091) [2022-07-09 20:20:15,870][26022] Updated weights on worker 0-0, policy_version 405114 (0.00086) [2022-07-09 20:20:16,767][25689] Fps is (10 sec: 5798.0, 60 sec: 5663.6, 300 sec: 5658.9). Total num frames: 414840832. Throughput: 0: 5853.6. Samples: 414841208. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:20:16,769][25689] Avg episode reward: [(0, '-48.414')] [2022-07-09 20:20:17,607][26022] Updated weights on worker 0-0, policy_version 405124 (0.00084) [2022-07-09 20:20:19,433][26022] Updated weights on worker 0-0, policy_version 405134 (0.00092) [2022-07-09 20:20:21,216][26022] Updated weights on worker 0-0, policy_version 405144 (0.00080) [2022-07-09 20:20:21,788][25689] Fps is (10 sec: 5587.8, 60 sec: 5682.7, 300 sec: 5656.9). Total num frames: 414869504. Throughput: 0: 5900.6. Samples: 414875930. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:20:21,788][25689] Avg episode reward: [(0, '-48.330')] [2022-07-09 20:20:22,943][26022] Updated weights on worker 0-0, policy_version 405154 (0.00096) [2022-07-09 20:20:24,725][26022] Updated weights on worker 0-0, policy_version 405164 (0.00084) [2022-07-09 20:20:26,428][26022] Updated weights on worker 0-0, policy_version 405174 (0.00094) [2022-07-09 20:20:26,830][25689] Fps is (10 sec: 5901.9, 60 sec: 5713.2, 300 sec: 5663.3). Total num frames: 414900224. Throughput: 0: 5161.0. Samples: 414893164. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:20:26,831][25689] Avg episode reward: [(0, '-47.950')] [2022-07-09 20:20:28,714][26022] Updated weights on worker 0-0, policy_version 405184 (0.00091) [2022-07-09 20:20:30,330][26022] Updated weights on worker 0-0, policy_version 405194 (0.00096) [2022-07-09 20:20:31,867][25689] Fps is (10 sec: 5587.5, 60 sec: 5660.7, 300 sec: 5654.6). Total num frames: 414925824. Throughput: 0: 5958.9. Samples: 414926734. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:20:31,868][25689] Avg episode reward: [(0, '-47.274')] [2022-07-09 20:20:32,077][26022] Updated weights on worker 0-0, policy_version 405204 (0.00087) [2022-07-09 20:20:33,820][26022] Updated weights on worker 0-0, policy_version 405214 (0.00083) [2022-07-09 20:20:35,862][26022] Updated weights on worker 0-0, policy_version 405224 (0.00088) [2022-07-09 20:20:37,003][25689] Fps is (10 sec: 5435.7, 60 sec: 5670.2, 300 sec: 5659.9). Total num frames: 414955520. Throughput: 0: 5905.7. Samples: 414960714. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:20:37,004][25689] Avg episode reward: [(0, '-47.819')] [2022-07-09 20:20:37,615][26022] Updated weights on worker 0-0, policy_version 405234 (0.00089) [2022-07-09 20:20:39,346][26022] Updated weights on worker 0-0, policy_version 405244 (0.00084) [2022-07-09 20:20:41,156][26022] Updated weights on worker 0-0, policy_version 405254 (0.00090) [2022-07-09 20:20:42,029][25689] Fps is (10 sec: 5744.3, 60 sec: 5658.0, 300 sec: 5656.1). Total num frames: 414984192. Throughput: 0: 5028.0. Samples: 414977700. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:20:42,029][25689] Avg episode reward: [(0, '-47.264')] [2022-07-09 20:20:42,811][26022] Updated weights on worker 0-0, policy_version 405264 (0.00081) [2022-07-09 20:20:44,892][26022] Updated weights on worker 0-0, policy_version 405274 (0.00085) [2022-07-09 20:20:46,439][26022] Updated weights on worker 0-0, policy_version 405284 (0.00089) [2022-07-09 20:20:47,047][25689] Fps is (10 sec: 5811.4, 60 sec: 5675.0, 300 sec: 5663.1). Total num frames: 415013888. Throughput: 0: 5895.4. Samples: 415012348. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:20:47,047][25689] Avg episode reward: [(0, '-47.693')] [2022-07-09 20:20:48,395][26022] Updated weights on worker 0-0, policy_version 405294 (0.00086) [2022-07-09 20:20:49,997][26022] Updated weights on worker 0-0, policy_version 405304 (0.00088) [2022-07-09 20:20:51,933][26022] Updated weights on worker 0-0, policy_version 405314 (0.00251) [2022-07-09 20:20:52,140][25689] Fps is (10 sec: 5772.3, 60 sec: 5669.2, 300 sec: 5659.9). Total num frames: 415042560. Throughput: 0: 5914.6. Samples: 415046640. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:20:52,141][25689] Avg episode reward: [(0, '-46.888')] [2022-07-09 20:20:53,751][26022] Updated weights on worker 0-0, policy_version 405324 (0.00089) [2022-07-09 20:20:55,453][26022] Updated weights on worker 0-0, policy_version 405334 (0.00078) [2022-07-09 20:20:57,189][25689] Fps is (10 sec: 5754.9, 60 sec: 5672.0, 300 sec: 5666.6). Total num frames: 415072256. Throughput: 0: 5952.1. Samples: 415080864. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:20:57,190][25689] Avg episode reward: [(0, '-47.289')] [2022-07-09 20:20:57,194][26022] Updated weights on worker 0-0, policy_version 405344 (0.00086) [2022-07-09 20:20:59,174][26022] Updated weights on worker 0-0, policy_version 405354 (0.00083) [2022-07-09 20:21:00,852][26022] Updated weights on worker 0-0, policy_version 405364 (0.00100) [2022-07-09 20:21:02,195][25689] Fps is (10 sec: 5499.5, 60 sec: 5644.3, 300 sec: 5661.0). Total num frames: 415097856. Throughput: 0: 5971.8. Samples: 415098130. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:21:02,195][25689] Avg episode reward: [(0, '-47.245')] [2022-07-09 20:21:03,051][26022] Updated weights on worker 0-0, policy_version 405374 (0.00090) [2022-07-09 20:21:04,604][26022] Updated weights on worker 0-0, policy_version 405384 (0.00402) [2022-07-09 20:21:06,603][26022] Updated weights on worker 0-0, policy_version 405394 (0.00085) [2022-07-09 20:21:07,218][25689] Fps is (10 sec: 5309.2, 60 sec: 5668.4, 300 sec: 5657.7). Total num frames: 415125504. Throughput: 0: 5862.9. Samples: 415130612. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:21:07,219][25689] Avg episode reward: [(0, '-46.401')] [2022-07-09 20:21:08,241][26022] Updated weights on worker 0-0, policy_version 405404 (0.00083) [2022-07-09 20:21:10,244][26022] Updated weights on worker 0-0, policy_version 405414 (0.00089) [2022-07-09 20:21:11,929][26022] Updated weights on worker 0-0, policy_version 405424 (0.00087) [2022-07-09 20:21:12,235][25689] Fps is (10 sec: 5711.5, 60 sec: 5650.7, 300 sec: 5658.5). Total num frames: 415155200. Throughput: 0: 5891.1. Samples: 415165020. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2022-07-09 20:21:12,236][25689] Avg episode reward: [(0, '-46.725')] [2022-07-09 20:21:13,747][26022] Updated weights on worker 0-0, policy_version 405434 (0.00099) [2022-07-09 20:21:15,582][26022] Updated weights on worker 0-0, policy_version 405444 (0.00079) [2022-07-09 20:21:17,254][26022] Updated weights on worker 0-0, policy_version 405454 (0.00095) [2022-07-09 20:21:17,335][25689] Fps is (10 sec: 5870.9, 60 sec: 5680.7, 300 sec: 5663.5). Total num frames: 415184896. Throughput: 0: 5019.6. Samples: 415181988. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:21:17,335][25689] Avg episode reward: [(0, '-46.839')] [2022-07-09 20:21:19,230][26022] Updated weights on worker 0-0, policy_version 405464 (0.00089) [2022-07-09 20:21:21,051][26022] Updated weights on worker 0-0, policy_version 405474 (0.00094) [2022-07-09 20:21:22,387][25689] Fps is (10 sec: 5648.5, 60 sec: 5660.8, 300 sec: 5659.7). Total num frames: 415212544. Throughput: 0: 5858.7. Samples: 415216430. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:21:22,389][25689] Avg episode reward: [(0, '-46.891')] [2022-07-09 20:21:22,766][26022] Updated weights on worker 0-0, policy_version 405484 (0.00088) [2022-07-09 20:21:24,470][26022] Updated weights on worker 0-0, policy_version 405494 (0.00094) [2022-07-09 20:21:26,310][26022] Updated weights on worker 0-0, policy_version 405504 (0.00091) [2022-07-09 20:21:27,424][25689] Fps is (10 sec: 5683.5, 60 sec: 5644.4, 300 sec: 5662.8). Total num frames: 415242240. Throughput: 0: 5931.2. Samples: 415250458. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:21:27,425][25689] Avg episode reward: [(0, '-46.583')] [2022-07-09 20:21:28,054][26022] Updated weights on worker 0-0, policy_version 405514 (0.00084) [2022-07-09 20:21:29,197][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:21:29,210][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000405520_415252480.pth [2022-07-09 20:21:29,210][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000403528_413212672.pth [2022-07-09 20:21:30,174][26022] Updated weights on worker 0-0, policy_version 405524 (0.00092) [2022-07-09 20:21:31,892][26022] Updated weights on worker 0-0, policy_version 405534 (0.00085) [2022-07-09 20:21:32,446][25689] Fps is (10 sec: 5700.6, 60 sec: 5679.6, 300 sec: 5660.3). Total num frames: 415269888. Throughput: 0: 5067.3. Samples: 415267438. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:21:32,448][25689] Avg episode reward: [(0, '-46.504')] [2022-07-09 20:21:33,597][26022] Updated weights on worker 0-0, policy_version 405544 (0.00084) [2022-07-09 20:21:35,493][26022] Updated weights on worker 0-0, policy_version 405554 (0.00256) [2022-07-09 20:21:37,371][26022] Updated weights on worker 0-0, policy_version 405564 (0.00091) [2022-07-09 20:21:37,527][25689] Fps is (10 sec: 5574.7, 60 sec: 5667.8, 300 sec: 5663.8). Total num frames: 415298560. Throughput: 0: 5908.5. Samples: 415301296. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:21:37,528][25689] Avg episode reward: [(0, '-46.997')] [2022-07-09 20:21:39,135][26022] Updated weights on worker 0-0, policy_version 405574 (0.00080) [2022-07-09 20:21:40,759][26022] Updated weights on worker 0-0, policy_version 405584 (0.00086) [2022-07-09 20:21:42,600][25689] Fps is (10 sec: 5748.7, 60 sec: 5680.3, 300 sec: 5663.7). Total num frames: 415328256. Throughput: 0: 5901.4. Samples: 415335712. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:21:42,600][25689] Avg episode reward: [(0, '-46.735')] [2022-07-09 20:21:42,607][26022] Updated weights on worker 0-0, policy_version 405594 (0.00573) [2022-07-09 20:21:44,428][26022] Updated weights on worker 0-0, policy_version 405604 (0.00087) [2022-07-09 20:21:46,143][26022] Updated weights on worker 0-0, policy_version 405614 (0.00092) [2022-07-09 20:21:47,622][25689] Fps is (10 sec: 5782.2, 60 sec: 5663.1, 300 sec: 5664.1). Total num frames: 415356928. Throughput: 0: 5069.7. Samples: 415352852. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:21:47,622][25689] Avg episode reward: [(0, '-46.755')] [2022-07-09 20:21:47,795][26022] Updated weights on worker 0-0, policy_version 405624 (0.00093) [2022-07-09 20:21:49,943][26022] Updated weights on worker 0-0, policy_version 405634 (0.00089) [2022-07-09 20:21:51,520][26022] Updated weights on worker 0-0, policy_version 405644 (0.00092) [2022-07-09 20:21:52,638][25689] Fps is (10 sec: 5610.6, 60 sec: 5653.4, 300 sec: 5665.8). Total num frames: 415384576. Throughput: 0: 5936.0. Samples: 415387294. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:21:52,638][25689] Avg episode reward: [(0, '-46.714')] [2022-07-09 20:21:53,439][26022] Updated weights on worker 0-0, policy_version 405654 (0.00086) [2022-07-09 20:21:55,027][26022] Updated weights on worker 0-0, policy_version 405664 (0.00089) [2022-07-09 20:21:56,947][26022] Updated weights on worker 0-0, policy_version 405674 (0.00088) [2022-07-09 20:21:57,710][25689] Fps is (10 sec: 5785.7, 60 sec: 5668.1, 300 sec: 5671.6). Total num frames: 415415296. Throughput: 0: 5960.6. Samples: 415421598. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:21:57,710][25689] Avg episode reward: [(0, '-47.370')] [2022-07-09 20:21:58,943][26022] Updated weights on worker 0-0, policy_version 405684 (0.00090) [2022-07-09 20:22:00,507][26022] Updated weights on worker 0-0, policy_version 405694 (0.00089) [2022-07-09 20:22:02,741][25689] Fps is (10 sec: 5473.0, 60 sec: 5648.8, 300 sec: 5658.4). Total num frames: 415439872. Throughput: 0: 5120.4. Samples: 415438846. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:22:02,742][25689] Avg episode reward: [(0, '-48.263')] [2022-07-09 20:22:02,787][26022] Updated weights on worker 0-0, policy_version 405704 (0.00084) [2022-07-09 20:22:04,307][26022] Updated weights on worker 0-0, policy_version 405714 (0.00086) [2022-07-09 20:22:06,320][26022] Updated weights on worker 0-0, policy_version 405724 (0.00089) [2022-07-09 20:22:07,811][25689] Fps is (10 sec: 5372.8, 60 sec: 5678.3, 300 sec: 5668.1). Total num frames: 415469568. Throughput: 0: 5856.0. Samples: 415471082. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:22:07,812][25689] Avg episode reward: [(0, '-48.147')] [2022-07-09 20:22:08,127][26022] Updated weights on worker 0-0, policy_version 405734 (0.00086) [2022-07-09 20:22:09,893][26022] Updated weights on worker 0-0, policy_version 405744 (0.00090) [2022-07-09 20:22:11,800][26022] Updated weights on worker 0-0, policy_version 405754 (0.00081) [2022-07-09 20:22:12,828][25689] Fps is (10 sec: 5786.4, 60 sec: 5661.3, 300 sec: 5665.6). Total num frames: 415498240. Throughput: 0: 5870.3. Samples: 415505820. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:22:12,829][25689] Avg episode reward: [(0, '-48.437')] [2022-07-09 20:22:13,367][26022] Updated weights on worker 0-0, policy_version 405764 (0.00087) [2022-07-09 20:22:15,058][26022] Updated weights on worker 0-0, policy_version 405774 (0.00084) [2022-07-09 20:22:16,988][26022] Updated weights on worker 0-0, policy_version 405784 (0.00082) [2022-07-09 20:22:17,886][25689] Fps is (10 sec: 5793.7, 60 sec: 5665.3, 300 sec: 5668.2). Total num frames: 415527936. Throughput: 0: 5027.1. Samples: 415523024. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:22:17,887][25689] Avg episode reward: [(0, '-48.091')] [2022-07-09 20:22:18,677][26022] Updated weights on worker 0-0, policy_version 405794 (0.00094) [2022-07-09 20:22:20,537][26022] Updated weights on worker 0-0, policy_version 405804 (0.00080) [2022-07-09 20:22:22,487][26022] Updated weights on worker 0-0, policy_version 405814 (0.00092) [2022-07-09 20:22:22,921][25689] Fps is (10 sec: 5783.0, 60 sec: 5683.8, 300 sec: 5671.0). Total num frames: 415556608. Throughput: 0: 5881.1. Samples: 415557528. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:22:22,922][25689] Avg episode reward: [(0, '-47.852')] [2022-07-09 20:22:24,112][26022] Updated weights on worker 0-0, policy_version 405824 (0.00090) [2022-07-09 20:22:25,914][26022] Updated weights on worker 0-0, policy_version 405834 (0.00093) [2022-07-09 20:22:27,661][26022] Updated weights on worker 0-0, policy_version 405844 (0.00086) [2022-07-09 20:22:27,938][25689] Fps is (10 sec: 5704.8, 60 sec: 5668.8, 300 sec: 5667.9). Total num frames: 415585280. Throughput: 0: 5993.9. Samples: 415591716. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:22:27,938][25689] Avg episode reward: [(0, '-47.870')] [2022-07-09 20:22:29,550][26022] Updated weights on worker 0-0, policy_version 405854 (0.00096) [2022-07-09 20:22:31,489][26022] Updated weights on worker 0-0, policy_version 405864 (0.00098) [2022-07-09 20:22:32,942][25689] Fps is (10 sec: 5620.4, 60 sec: 5670.5, 300 sec: 5666.3). Total num frames: 415612928. Throughput: 0: 5107.7. Samples: 415608554. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:22:32,943][25689] Avg episode reward: [(0, '-47.445')] [2022-07-09 20:22:33,163][26022] Updated weights on worker 0-0, policy_version 405874 (0.00084) [2022-07-09 20:22:35,032][26022] Updated weights on worker 0-0, policy_version 405884 (0.00082) [2022-07-09 20:22:36,822][26022] Updated weights on worker 0-0, policy_version 405894 (0.00095) [2022-07-09 20:22:38,021][25689] Fps is (10 sec: 5585.2, 60 sec: 5670.6, 300 sec: 5668.5). Total num frames: 415641600. Throughput: 0: 5934.7. Samples: 415642524. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:22:38,023][25689] Avg episode reward: [(0, '-47.557')] [2022-07-09 20:22:38,550][26022] Updated weights on worker 0-0, policy_version 405904 (0.00086) [2022-07-09 20:22:40,356][26022] Updated weights on worker 0-0, policy_version 405914 (0.00086) [2022-07-09 20:22:42,269][26022] Updated weights on worker 0-0, policy_version 405924 (0.00084) [2022-07-09 20:22:43,074][25689] Fps is (10 sec: 5760.8, 60 sec: 5672.5, 300 sec: 5667.8). Total num frames: 415671296. Throughput: 0: 5937.3. Samples: 415677180. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:22:43,076][25689] Avg episode reward: [(0, '-47.938')] [2022-07-09 20:22:43,629][26022] Updated weights on worker 0-0, policy_version 405934 (0.00082) [2022-07-09 20:22:45,772][26022] Updated weights on worker 0-0, policy_version 405944 (0.00086) [2022-07-09 20:22:47,436][26022] Updated weights on worker 0-0, policy_version 405954 (0.00092) [2022-07-09 20:22:48,121][25689] Fps is (10 sec: 5779.0, 60 sec: 5670.1, 300 sec: 5667.0). Total num frames: 415699968. Throughput: 0: 5089.1. Samples: 415694434. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:22:48,123][25689] Avg episode reward: [(0, '-48.129')] [2022-07-09 20:22:49,348][26022] Updated weights on worker 0-0, policy_version 405964 (0.00085) [2022-07-09 20:22:51,030][26022] Updated weights on worker 0-0, policy_version 405974 (0.00082) [2022-07-09 20:22:52,637][26022] Updated weights on worker 0-0, policy_version 405984 (0.00084) [2022-07-09 20:22:53,173][25689] Fps is (10 sec: 5779.5, 60 sec: 5700.6, 300 sec: 5667.2). Total num frames: 415729664. Throughput: 0: 5946.9. Samples: 415728866. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:22:53,174][25689] Avg episode reward: [(0, '-48.445')] [2022-07-09 20:22:54,901][26022] Updated weights on worker 0-0, policy_version 405994 (0.00092) [2022-07-09 20:22:56,387][26022] Updated weights on worker 0-0, policy_version 406004 (0.00091) [2022-07-09 20:22:58,196][26022] Updated weights on worker 0-0, policy_version 406014 (0.00089) [2022-07-09 20:22:58,228][25689] Fps is (10 sec: 5775.1, 60 sec: 5668.4, 300 sec: 5670.6). Total num frames: 415758336. Throughput: 0: 5968.9. Samples: 415763136. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:22:58,229][25689] Avg episode reward: [(0, '-48.290')] [2022-07-09 20:22:59,932][26022] Updated weights on worker 0-0, policy_version 406024 (0.00095) [2022-07-09 20:23:01,788][26022] Updated weights on worker 0-0, policy_version 406034 (0.00092) [2022-07-09 20:23:03,286][25689] Fps is (10 sec: 5366.4, 60 sec: 5682.8, 300 sec: 5663.3). Total num frames: 415783936. Throughput: 0: 5098.3. Samples: 415780228. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:23:03,287][25689] Avg episode reward: [(0, '-47.984')] [2022-07-09 20:23:03,924][26022] Updated weights on worker 0-0, policy_version 406044 (0.00098) [2022-07-09 20:23:05,899][26022] Updated weights on worker 0-0, policy_version 406054 (0.00103) [2022-07-09 20:23:07,486][26022] Updated weights on worker 0-0, policy_version 406064 (0.00081) [2022-07-09 20:23:08,311][25689] Fps is (10 sec: 5585.8, 60 sec: 5704.0, 300 sec: 5673.9). Total num frames: 415814656. Throughput: 0: 5846.6. Samples: 415812474. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:23:08,311][25689] Avg episode reward: [(0, '-48.506')] [2022-07-09 20:23:09,415][26022] Updated weights on worker 0-0, policy_version 406074 (0.00090) [2022-07-09 20:23:10,999][26022] Updated weights on worker 0-0, policy_version 406084 (0.00083) [2022-07-09 20:23:12,917][26022] Updated weights on worker 0-0, policy_version 406094 (0.00094) [2022-07-09 20:23:13,399][25689] Fps is (10 sec: 5670.4, 60 sec: 5663.4, 300 sec: 5667.9). Total num frames: 415841280. Throughput: 0: 5834.7. Samples: 415846880. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:23:13,400][25689] Avg episode reward: [(0, '-47.721')] [2022-07-09 20:23:14,661][26022] Updated weights on worker 0-0, policy_version 406104 (0.00082) [2022-07-09 20:23:16,659][26022] Updated weights on worker 0-0, policy_version 406114 (0.00095) [2022-07-09 20:23:18,221][26022] Updated weights on worker 0-0, policy_version 406124 (0.00085) [2022-07-09 20:23:18,444][25689] Fps is (10 sec: 5659.0, 60 sec: 5681.5, 300 sec: 5670.6). Total num frames: 415872000. Throughput: 0: 4979.5. Samples: 415863802. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:23:18,445][25689] Avg episode reward: [(0, '-46.408')] [2022-07-09 20:23:20,310][26022] Updated weights on worker 0-0, policy_version 406134 (0.00085) [2022-07-09 20:23:21,954][26022] Updated weights on worker 0-0, policy_version 406144 (0.00051) [2022-07-09 20:23:23,524][25689] Fps is (10 sec: 5765.0, 60 sec: 5660.5, 300 sec: 5662.7). Total num frames: 415899648. Throughput: 0: 5831.1. Samples: 415898236. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:23:23,525][25689] Avg episode reward: [(0, '-46.580')] [2022-07-09 20:23:23,806][26022] Updated weights on worker 0-0, policy_version 406154 (0.00094) [2022-07-09 20:23:25,479][26022] Updated weights on worker 0-0, policy_version 406164 (0.00095) [2022-07-09 20:23:27,351][26022] Updated weights on worker 0-0, policy_version 406174 (0.00095) [2022-07-09 20:23:28,542][25689] Fps is (10 sec: 5678.8, 60 sec: 5677.2, 300 sec: 5673.4). Total num frames: 415929344. Throughput: 0: 5940.4. Samples: 415932656. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:23:28,543][25689] Avg episode reward: [(0, '-46.049')] [2022-07-09 20:23:29,192][26022] Updated weights on worker 0-0, policy_version 406184 (0.00088) [2022-07-09 20:23:29,361][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:23:29,371][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000406185_415933440.pth [2022-07-09 20:23:29,372][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000404190_413890560.pth [2022-07-09 20:23:30,843][26022] Updated weights on worker 0-0, policy_version 406194 (0.00088) [2022-07-09 20:23:32,700][26022] Updated weights on worker 0-0, policy_version 406204 (0.00093) [2022-07-09 20:23:33,550][25689] Fps is (10 sec: 5821.5, 60 sec: 5693.8, 300 sec: 5668.1). Total num frames: 415958016. Throughput: 0: 5939.2. Samples: 415966560. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:23:33,551][25689] Avg episode reward: [(0, '-44.817')] [2022-07-09 20:23:34,689][26022] Updated weights on worker 0-0, policy_version 406214 (0.00093) [2022-07-09 20:23:36,301][26022] Updated weights on worker 0-0, policy_version 406224 (0.00092) [2022-07-09 20:23:38,164][26022] Updated weights on worker 0-0, policy_version 406234 (0.00088) [2022-07-09 20:23:38,617][25689] Fps is (10 sec: 5590.2, 60 sec: 5678.0, 300 sec: 5666.9). Total num frames: 415985664. Throughput: 0: 5948.1. Samples: 415983790. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:23:38,617][25689] Avg episode reward: [(0, '-45.460')] [2022-07-09 20:23:39,848][26022] Updated weights on worker 0-0, policy_version 406244 (0.00091) [2022-07-09 20:23:41,638][26022] Updated weights on worker 0-0, policy_version 406254 (0.00087) [2022-07-09 20:23:43,449][26022] Updated weights on worker 0-0, policy_version 406264 (0.00080) [2022-07-09 20:23:43,673][25689] Fps is (10 sec: 5664.8, 60 sec: 5677.7, 300 sec: 5669.5). Total num frames: 416015360. Throughput: 0: 5959.5. Samples: 416018314. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:23:43,674][25689] Avg episode reward: [(0, '-46.085')] [2022-07-09 20:23:45,245][26022] Updated weights on worker 0-0, policy_version 406274 (0.00088) [2022-07-09 20:23:46,898][26022] Updated weights on worker 0-0, policy_version 406284 (0.00086) [2022-07-09 20:23:48,687][25689] Fps is (10 sec: 5796.3, 60 sec: 5680.9, 300 sec: 5669.4). Total num frames: 416044032. Throughput: 0: 5973.5. Samples: 416052990. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:23:48,687][25689] Avg episode reward: [(0, '-46.548')] [2022-07-09 20:23:48,843][26022] Updated weights on worker 0-0, policy_version 406294 (0.00081) [2022-07-09 20:23:50,411][26022] Updated weights on worker 0-0, policy_version 406304 (0.00090) [2022-07-09 20:23:52,555][26022] Updated weights on worker 0-0, policy_version 406314 (0.00087) [2022-07-09 20:23:53,693][25689] Fps is (10 sec: 5723.2, 60 sec: 5668.3, 300 sec: 5668.3). Total num frames: 416072704. Throughput: 0: 5123.1. Samples: 416069752. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 20:23:53,694][25689] Avg episode reward: [(0, '-46.534')] [2022-07-09 20:23:54,035][26022] Updated weights on worker 0-0, policy_version 406324 (0.00082) [2022-07-09 20:23:56,151][26022] Updated weights on worker 0-0, policy_version 406334 (0.00116) [2022-07-09 20:23:57,602][26022] Updated weights on worker 0-0, policy_version 406344 (0.00613) [2022-07-09 20:23:58,744][25689] Fps is (10 sec: 5701.7, 60 sec: 5668.6, 300 sec: 5672.1). Total num frames: 416101376. Throughput: 0: 5972.7. Samples: 416104004. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:23:58,745][25689] Avg episode reward: [(0, '-46.806')] [2022-07-09 20:23:59,736][26022] Updated weights on worker 0-0, policy_version 406354 (0.00085) [2022-07-09 20:24:01,276][26022] Updated weights on worker 0-0, policy_version 406364 (0.00094) [2022-07-09 20:24:03,744][26022] Updated weights on worker 0-0, policy_version 406374 (0.00088) [2022-07-09 20:24:03,826][25689] Fps is (10 sec: 5456.8, 60 sec: 5683.3, 300 sec: 5671.2). Total num frames: 416128000. Throughput: 0: 5844.4. Samples: 416136096. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:24:03,827][25689] Avg episode reward: [(0, '-47.315')] [2022-07-09 20:24:05,298][26022] Updated weights on worker 0-0, policy_version 406384 (0.00085) [2022-07-09 20:24:07,212][26022] Updated weights on worker 0-0, policy_version 406394 (0.00090) [2022-07-09 20:24:08,864][25689] Fps is (10 sec: 5463.9, 60 sec: 5648.2, 300 sec: 5667.8). Total num frames: 416156672. Throughput: 0: 4959.7. Samples: 416153064. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:24:08,865][25689] Avg episode reward: [(0, '-47.269')] [2022-07-09 20:24:08,900][26022] Updated weights on worker 0-0, policy_version 406404 (0.00084) [2022-07-09 20:24:10,624][26022] Updated weights on worker 0-0, policy_version 406414 (0.00082) [2022-07-09 20:24:12,647][26022] Updated weights on worker 0-0, policy_version 406424 (0.00085) [2022-07-09 20:24:13,875][25689] Fps is (10 sec: 5808.5, 60 sec: 5706.2, 300 sec: 5672.5). Total num frames: 416186368. Throughput: 0: 5848.8. Samples: 416187794. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:24:13,875][25689] Avg episode reward: [(0, '-47.592')] [2022-07-09 20:24:14,205][26022] Updated weights on worker 0-0, policy_version 406434 (0.00087) [2022-07-09 20:24:16,271][26022] Updated weights on worker 0-0, policy_version 406444 (0.00086) [2022-07-09 20:24:17,649][26022] Updated weights on worker 0-0, policy_version 406454 (0.00091) [2022-07-09 20:24:18,997][25689] Fps is (10 sec: 5659.0, 60 sec: 5648.2, 300 sec: 5671.0). Total num frames: 416214016. Throughput: 0: 5838.7. Samples: 416222258. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:24:18,998][25689] Avg episode reward: [(0, '-47.602')] [2022-07-09 20:24:19,648][26022] Updated weights on worker 0-0, policy_version 406464 (0.00087) [2022-07-09 20:24:21,293][26022] Updated weights on worker 0-0, policy_version 406474 (0.00085) [2022-07-09 20:24:23,198][26022] Updated weights on worker 0-0, policy_version 406484 (0.00086) [2022-07-09 20:24:24,011][25689] Fps is (10 sec: 5657.5, 60 sec: 5688.2, 300 sec: 5674.3). Total num frames: 416243712. Throughput: 0: 5120.1. Samples: 416239444. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:24:24,011][25689] Avg episode reward: [(0, '-47.223')] [2022-07-09 20:24:24,981][26022] Updated weights on worker 0-0, policy_version 406494 (0.00087) [2022-07-09 20:24:26,870][26022] Updated weights on worker 0-0, policy_version 406504 (0.00097) [2022-07-09 20:24:28,547][26022] Updated weights on worker 0-0, policy_version 406514 (0.00084) [2022-07-09 20:24:29,021][25689] Fps is (10 sec: 5823.0, 60 sec: 5672.0, 300 sec: 5674.5). Total num frames: 416272384. Throughput: 0: 5979.3. Samples: 416273588. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:24:29,022][25689] Avg episode reward: [(0, '-47.870')] [2022-07-09 20:24:30,634][26022] Updated weights on worker 0-0, policy_version 406524 (0.00102) [2022-07-09 20:24:32,198][26022] Updated weights on worker 0-0, policy_version 406534 (0.00088) [2022-07-09 20:24:34,091][25689] Fps is (10 sec: 5688.5, 60 sec: 5666.2, 300 sec: 5674.2). Total num frames: 416301056. Throughput: 0: 5934.0. Samples: 416307760. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:24:34,092][25689] Avg episode reward: [(0, '-48.152')] [2022-07-09 20:24:34,096][26022] Updated weights on worker 0-0, policy_version 406544 (0.00092) [2022-07-09 20:24:35,893][26022] Updated weights on worker 0-0, policy_version 406554 (0.00090) [2022-07-09 20:24:37,724][26022] Updated weights on worker 0-0, policy_version 406564 (0.01033) [2022-07-09 20:24:39,180][25689] Fps is (10 sec: 5543.8, 60 sec: 5664.1, 300 sec: 5667.1). Total num frames: 416328704. Throughput: 0: 5072.6. Samples: 416324638. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:24:39,181][25689] Avg episode reward: [(0, '-48.440')] [2022-07-09 20:24:39,479][26022] Updated weights on worker 0-0, policy_version 406574 (0.00091) [2022-07-09 20:24:41,185][26022] Updated weights on worker 0-0, policy_version 406584 (0.00093) [2022-07-09 20:24:43,023][26022] Updated weights on worker 0-0, policy_version 406594 (0.00077) [2022-07-09 20:24:44,259][25689] Fps is (10 sec: 5740.9, 60 sec: 5679.0, 300 sec: 5672.8). Total num frames: 416359424. Throughput: 0: 5899.1. Samples: 416358888. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:24:44,259][25689] Avg episode reward: [(0, '-48.098')] [2022-07-09 20:24:44,850][26022] Updated weights on worker 0-0, policy_version 406604 (0.00087) [2022-07-09 20:24:46,546][26022] Updated weights on worker 0-0, policy_version 406614 (0.00084) [2022-07-09 20:24:48,498][26022] Updated weights on worker 0-0, policy_version 406624 (0.00082) [2022-07-09 20:24:49,285][25689] Fps is (10 sec: 5776.5, 60 sec: 5660.9, 300 sec: 5669.5). Total num frames: 416387072. Throughput: 0: 5935.3. Samples: 416393858. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:24:49,287][25689] Avg episode reward: [(0, '-48.283')] [2022-07-09 20:24:49,968][26022] Updated weights on worker 0-0, policy_version 406634 (0.00086) [2022-07-09 20:24:52,072][26022] Updated weights on worker 0-0, policy_version 406644 (0.00087) [2022-07-09 20:24:53,498][26022] Updated weights on worker 0-0, policy_version 406654 (0.00088) [2022-07-09 20:24:54,299][25689] Fps is (10 sec: 5711.2, 60 sec: 5677.0, 300 sec: 5670.7). Total num frames: 416416768. Throughput: 0: 5119.4. Samples: 416411212. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:24:54,300][25689] Avg episode reward: [(0, '-47.949')] [2022-07-09 20:24:55,491][26022] Updated weights on worker 0-0, policy_version 406664 (0.00092) [2022-07-09 20:24:57,246][26022] Updated weights on worker 0-0, policy_version 406674 (0.00086) [2022-07-09 20:24:59,066][26022] Updated weights on worker 0-0, policy_version 406684 (0.00081) [2022-07-09 20:24:59,411][25689] Fps is (10 sec: 5764.2, 60 sec: 5671.4, 300 sec: 5673.4). Total num frames: 416445440. Throughput: 0: 5980.6. Samples: 416445628. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:24:59,411][25689] Avg episode reward: [(0, '-48.384')] [2022-07-09 20:25:00,783][26022] Updated weights on worker 0-0, policy_version 406694 (0.00112) [2022-07-09 20:25:03,084][26022] Updated weights on worker 0-0, policy_version 406704 (0.00083) [2022-07-09 20:25:04,439][25689] Fps is (10 sec: 5453.6, 60 sec: 5676.4, 300 sec: 5674.8). Total num frames: 416472064. Throughput: 0: 5885.3. Samples: 416477654. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:25:04,439][25689] Avg episode reward: [(0, '-47.373')] [2022-07-09 20:25:04,774][26022] Updated weights on worker 0-0, policy_version 406714 (0.00088) [2022-07-09 20:25:06,676][26022] Updated weights on worker 0-0, policy_version 406724 (0.00081) [2022-07-09 20:25:08,292][26022] Updated weights on worker 0-0, policy_version 406734 (0.00085) [2022-07-09 20:25:09,480][25689] Fps is (10 sec: 5390.1, 60 sec: 5659.3, 300 sec: 5663.8). Total num frames: 416499712. Throughput: 0: 4987.7. Samples: 416494584. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:25:09,481][25689] Avg episode reward: [(0, '-47.732')] [2022-07-09 20:25:10,284][26022] Updated weights on worker 0-0, policy_version 406744 (0.00089) [2022-07-09 20:25:12,070][26022] Updated weights on worker 0-0, policy_version 406754 (0.00088) [2022-07-09 20:25:13,738][26022] Updated weights on worker 0-0, policy_version 406764 (0.00085) [2022-07-09 20:25:14,502][25689] Fps is (10 sec: 5698.2, 60 sec: 5658.2, 300 sec: 5671.4). Total num frames: 416529408. Throughput: 0: 5820.2. Samples: 416528798. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:25:14,504][25689] Avg episode reward: [(0, '-48.467')] [2022-07-09 20:25:15,650][26022] Updated weights on worker 0-0, policy_version 406774 (0.00092) [2022-07-09 20:25:17,335][26022] Updated weights on worker 0-0, policy_version 406784 (0.00086) [2022-07-09 20:25:19,238][26022] Updated weights on worker 0-0, policy_version 406794 (0.00086) [2022-07-09 20:25:19,542][25689] Fps is (10 sec: 5800.7, 60 sec: 5682.8, 300 sec: 5671.0). Total num frames: 416558080. Throughput: 0: 5833.6. Samples: 416563064. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:25:19,542][25689] Avg episode reward: [(0, '-48.008')] [2022-07-09 20:25:21,097][26022] Updated weights on worker 0-0, policy_version 406804 (0.00091) [2022-07-09 20:25:22,724][26022] Updated weights on worker 0-0, policy_version 406814 (0.00087) [2022-07-09 20:25:24,508][26022] Updated weights on worker 0-0, policy_version 406824 (0.00079) [2022-07-09 20:25:24,561][25689] Fps is (10 sec: 5802.9, 60 sec: 5682.3, 300 sec: 5668.1). Total num frames: 416587776. Throughput: 0: 5097.4. Samples: 416580224. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:25:24,562][25689] Avg episode reward: [(0, '-48.098')] [2022-07-09 20:25:26,464][26022] Updated weights on worker 0-0, policy_version 406834 (0.00086) [2022-07-09 20:25:28,397][26022] Updated weights on worker 0-0, policy_version 406844 (0.00085) [2022-07-09 20:25:29,578][25689] Fps is (10 sec: 5611.6, 60 sec: 5647.8, 300 sec: 5671.9). Total num frames: 416614400. Throughput: 0: 5973.9. Samples: 416614652. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:25:29,579][25689] Avg episode reward: [(0, '-47.677')] [2022-07-09 20:25:29,679][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:25:29,690][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000406851_416615424.pth [2022-07-09 20:25:29,691][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000404855_414571520.pth [2022-07-09 20:25:30,009][26022] Updated weights on worker 0-0, policy_version 406854 (0.00089) [2022-07-09 20:25:31,666][26022] Updated weights on worker 0-0, policy_version 406864 (0.00095) [2022-07-09 20:25:33,555][26022] Updated weights on worker 0-0, policy_version 406874 (0.00088) [2022-07-09 20:25:34,610][25689] Fps is (10 sec: 5706.0, 60 sec: 5685.2, 300 sec: 5677.3). Total num frames: 416645120. Throughput: 0: 5970.6. Samples: 416648856. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:25:34,611][25689] Avg episode reward: [(0, '-47.432')] [2022-07-09 20:25:35,427][26022] Updated weights on worker 0-0, policy_version 406884 (0.00096) [2022-07-09 20:25:37,258][26022] Updated weights on worker 0-0, policy_version 406894 (0.00085) [2022-07-09 20:25:39,212][26022] Updated weights on worker 0-0, policy_version 406904 (0.00086) [2022-07-09 20:25:39,687][25689] Fps is (10 sec: 5774.2, 60 sec: 5686.4, 300 sec: 5672.9). Total num frames: 416672768. Throughput: 0: 5094.3. Samples: 416665688. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:25:39,687][25689] Avg episode reward: [(0, '-46.890')] [2022-07-09 20:25:40,808][26022] Updated weights on worker 0-0, policy_version 406914 (0.00093) [2022-07-09 20:25:42,573][26022] Updated weights on worker 0-0, policy_version 406924 (0.00080) [2022-07-09 20:25:44,360][26022] Updated weights on worker 0-0, policy_version 406934 (0.00086) [2022-07-09 20:25:44,727][25689] Fps is (10 sec: 5567.2, 60 sec: 5656.1, 300 sec: 5669.0). Total num frames: 416701440. Throughput: 0: 5946.6. Samples: 416700144. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:25:44,729][25689] Avg episode reward: [(0, '-47.252')] [2022-07-09 20:25:46,039][26022] Updated weights on worker 0-0, policy_version 406944 (0.00091) [2022-07-09 20:25:48,034][26022] Updated weights on worker 0-0, policy_version 406954 (0.00084) [2022-07-09 20:25:49,742][25689] Fps is (10 sec: 5703.1, 60 sec: 5674.1, 300 sec: 5670.5). Total num frames: 416730112. Throughput: 0: 5929.2. Samples: 416734204. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:25:49,742][25689] Avg episode reward: [(0, '-46.596')] [2022-07-09 20:25:49,829][26022] Updated weights on worker 0-0, policy_version 406964 (0.00087) [2022-07-09 20:25:51,543][26022] Updated weights on worker 0-0, policy_version 406974 (0.00086) [2022-07-09 20:25:53,437][26022] Updated weights on worker 0-0, policy_version 406984 (0.00089) [2022-07-09 20:25:54,746][25689] Fps is (10 sec: 5825.3, 60 sec: 5675.0, 300 sec: 5671.3). Total num frames: 416759808. Throughput: 0: 5956.4. Samples: 416768794. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:25:54,747][25689] Avg episode reward: [(0, '-46.685')] [2022-07-09 20:25:55,000][26022] Updated weights on worker 0-0, policy_version 406994 (0.00109) [2022-07-09 20:25:56,947][26022] Updated weights on worker 0-0, policy_version 407004 (0.00083) [2022-07-09 20:25:58,704][26022] Updated weights on worker 0-0, policy_version 407014 (0.00091) [2022-07-09 20:25:59,835][25689] Fps is (10 sec: 5782.9, 60 sec: 5677.2, 300 sec: 5680.1). Total num frames: 416788480. Throughput: 0: 5964.3. Samples: 416785858. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:25:59,835][25689] Avg episode reward: [(0, '-46.488')] [2022-07-09 20:26:00,343][26022] Updated weights on worker 0-0, policy_version 407024 (0.00093) [2022-07-09 20:26:02,879][26022] Updated weights on worker 0-0, policy_version 407034 (0.00083) [2022-07-09 20:26:04,315][26022] Updated weights on worker 0-0, policy_version 407044 (0.00083) [2022-07-09 20:26:04,855][25689] Fps is (10 sec: 5369.0, 60 sec: 5661.0, 300 sec: 5673.3). Total num frames: 416814080. Throughput: 0: 5859.0. Samples: 416818076. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:26:04,856][25689] Avg episode reward: [(0, '-47.636')] [2022-07-09 20:26:06,204][26022] Updated weights on worker 0-0, policy_version 407054 (0.00090) [2022-07-09 20:26:08,197][26022] Updated weights on worker 0-0, policy_version 407064 (0.00087) [2022-07-09 20:26:09,782][26022] Updated weights on worker 0-0, policy_version 407074 (0.00112) [2022-07-09 20:26:09,867][25689] Fps is (10 sec: 5512.0, 60 sec: 5697.6, 300 sec: 5673.4). Total num frames: 416843776. Throughput: 0: 5873.7. Samples: 416852412. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:26:09,867][25689] Avg episode reward: [(0, '-47.563')] [2022-07-09 20:26:11,696][26022] Updated weights on worker 0-0, policy_version 407084 (0.00111) [2022-07-09 20:26:13,573][26022] Updated weights on worker 0-0, policy_version 407094 (0.00088) [2022-07-09 20:26:14,897][25689] Fps is (10 sec: 5812.3, 60 sec: 5679.9, 300 sec: 5671.3). Total num frames: 416872448. Throughput: 0: 5013.0. Samples: 416869810. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:26:14,898][25689] Avg episode reward: [(0, '-46.749')] [2022-07-09 20:26:15,117][26022] Updated weights on worker 0-0, policy_version 407104 (0.00095) [2022-07-09 20:26:17,119][26022] Updated weights on worker 0-0, policy_version 407114 (0.00089) [2022-07-09 20:26:18,696][26022] Updated weights on worker 0-0, policy_version 407124 (0.00096) [2022-07-09 20:26:19,952][25689] Fps is (10 sec: 5584.3, 60 sec: 5661.5, 300 sec: 5671.2). Total num frames: 416900096. Throughput: 0: 5883.3. Samples: 416904214. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:26:19,954][25689] Avg episode reward: [(0, '-48.581')] [2022-07-09 20:26:20,660][26022] Updated weights on worker 0-0, policy_version 407134 (0.00083) [2022-07-09 20:26:22,387][26022] Updated weights on worker 0-0, policy_version 407144 (0.00092) [2022-07-09 20:26:24,175][26022] Updated weights on worker 0-0, policy_version 407154 (0.00094) [2022-07-09 20:26:24,955][25689] Fps is (10 sec: 5701.4, 60 sec: 5663.0, 300 sec: 5671.9). Total num frames: 416929792. Throughput: 0: 5998.7. Samples: 416938650. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:26:24,955][25689] Avg episode reward: [(0, '-48.815')] [2022-07-09 20:26:26,053][26022] Updated weights on worker 0-0, policy_version 407164 (0.00091) [2022-07-09 20:26:27,623][26022] Updated weights on worker 0-0, policy_version 407174 (0.00098) [2022-07-09 20:26:29,588][26022] Updated weights on worker 0-0, policy_version 407184 (0.00091) [2022-07-09 20:26:29,971][25689] Fps is (10 sec: 5723.8, 60 sec: 5680.2, 300 sec: 5672.0). Total num frames: 416957440. Throughput: 0: 5133.1. Samples: 416955608. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:26:29,971][25689] Avg episode reward: [(0, '-48.531')] [2022-07-09 20:26:31,239][26022] Updated weights on worker 0-0, policy_version 407194 (0.00091) [2022-07-09 20:26:33,347][26022] Updated weights on worker 0-0, policy_version 407204 (0.00085) [2022-07-09 20:26:34,928][26022] Updated weights on worker 0-0, policy_version 407214 (0.00087) [2022-07-09 20:26:34,980][25689] Fps is (10 sec: 5720.1, 60 sec: 5665.4, 300 sec: 5676.8). Total num frames: 416987136. Throughput: 0: 5975.4. Samples: 416989814. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-09 20:26:34,981][25689] Avg episode reward: [(0, '-48.019')] [2022-07-09 20:26:37,031][26022] Updated weights on worker 0-0, policy_version 407224 (0.00084) [2022-07-09 20:26:38,504][26022] Updated weights on worker 0-0, policy_version 407234 (0.00084) [2022-07-09 20:26:40,012][25689] Fps is (10 sec: 5711.0, 60 sec: 5669.6, 300 sec: 5670.7). Total num frames: 417014784. Throughput: 0: 5971.6. Samples: 417024002. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:26:40,012][25689] Avg episode reward: [(0, '-48.791')] [2022-07-09 20:26:40,548][26022] Updated weights on worker 0-0, policy_version 407244 (0.00086) [2022-07-09 20:26:42,127][26022] Updated weights on worker 0-0, policy_version 407254 (0.00095) [2022-07-09 20:26:43,970][26022] Updated weights on worker 0-0, policy_version 407264 (0.00080) [2022-07-09 20:26:45,013][25689] Fps is (10 sec: 5715.4, 60 sec: 5690.2, 300 sec: 5674.5). Total num frames: 417044480. Throughput: 0: 5119.2. Samples: 417041332. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:26:45,014][25689] Avg episode reward: [(0, '-48.500')] [2022-07-09 20:26:45,651][26022] Updated weights on worker 0-0, policy_version 407274 (0.00093) [2022-07-09 20:26:47,608][26022] Updated weights on worker 0-0, policy_version 407284 (0.00093) [2022-07-09 20:26:49,143][26022] Updated weights on worker 0-0, policy_version 407294 (0.00091) [2022-07-09 20:26:50,039][25689] Fps is (10 sec: 5718.7, 60 sec: 5672.1, 300 sec: 5674.3). Total num frames: 417072128. Throughput: 0: 5998.6. Samples: 417075990. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:26:50,042][25689] Avg episode reward: [(0, '-47.366')] [2022-07-09 20:26:51,114][26022] Updated weights on worker 0-0, policy_version 407304 (0.00086) [2022-07-09 20:26:52,812][26022] Updated weights on worker 0-0, policy_version 407314 (0.00165) [2022-07-09 20:26:54,640][26022] Updated weights on worker 0-0, policy_version 407324 (0.00091) [2022-07-09 20:26:55,066][25689] Fps is (10 sec: 5704.0, 60 sec: 5670.0, 300 sec: 5671.7). Total num frames: 417101824. Throughput: 0: 6002.6. Samples: 417110384. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:26:55,067][25689] Avg episode reward: [(0, '-48.064')] [2022-07-09 20:26:56,473][26022] Updated weights on worker 0-0, policy_version 407334 (0.00087) [2022-07-09 20:26:58,195][26022] Updated weights on worker 0-0, policy_version 407344 (0.00093) [2022-07-09 20:26:59,940][26022] Updated weights on worker 0-0, policy_version 407354 (0.00085) [2022-07-09 20:27:00,110][25689] Fps is (10 sec: 5897.7, 60 sec: 5691.3, 300 sec: 5688.7). Total num frames: 417131520. Throughput: 0: 5147.4. Samples: 417127454. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:27:00,110][25689] Avg episode reward: [(0, '-48.585')] [2022-07-09 20:27:01,888][26022] Updated weights on worker 0-0, policy_version 407364 (0.00097) [2022-07-09 20:27:04,152][26022] Updated weights on worker 0-0, policy_version 407374 (0.00093) [2022-07-09 20:27:05,111][25689] Fps is (10 sec: 5505.0, 60 sec: 5693.0, 300 sec: 5676.3). Total num frames: 417157120. Throughput: 0: 5866.2. Samples: 417159230. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:27:05,112][25689] Avg episode reward: [(0, '-48.135')] [2022-07-09 20:27:05,819][26022] Updated weights on worker 0-0, policy_version 407384 (0.00086) [2022-07-09 20:27:07,756][26022] Updated weights on worker 0-0, policy_version 407394 (0.00079) [2022-07-09 20:27:09,475][26022] Updated weights on worker 0-0, policy_version 407404 (0.00089) [2022-07-09 20:27:10,126][25689] Fps is (10 sec: 5213.6, 60 sec: 5641.7, 300 sec: 5669.4). Total num frames: 417183744. Throughput: 0: 5842.5. Samples: 417193350. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:27:10,127][25689] Avg episode reward: [(0, '-48.388')] [2022-07-09 20:27:11,375][26022] Updated weights on worker 0-0, policy_version 407414 (0.00084) [2022-07-09 20:27:13,048][26022] Updated weights on worker 0-0, policy_version 407424 (0.00093) [2022-07-09 20:27:14,883][26022] Updated weights on worker 0-0, policy_version 407434 (0.00086) [2022-07-09 20:27:15,160][25689] Fps is (10 sec: 5604.7, 60 sec: 5658.4, 300 sec: 5669.9). Total num frames: 417213440. Throughput: 0: 4976.7. Samples: 417210384. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:27:15,161][25689] Avg episode reward: [(0, '-48.712')] [2022-07-09 20:27:16,551][26022] Updated weights on worker 0-0, policy_version 407444 (0.00085) [2022-07-09 20:27:18,445][26022] Updated weights on worker 0-0, policy_version 407454 (0.00420) [2022-07-09 20:27:20,234][25689] Fps is (10 sec: 5774.5, 60 sec: 5673.6, 300 sec: 5669.1). Total num frames: 417242112. Throughput: 0: 5816.9. Samples: 417244518. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:27:20,235][25689] Avg episode reward: [(0, '-47.835')] [2022-07-09 20:27:20,280][26022] Updated weights on worker 0-0, policy_version 407464 (0.00090) [2022-07-09 20:27:22,139][26022] Updated weights on worker 0-0, policy_version 407474 (0.00093) [2022-07-09 20:27:23,867][26022] Updated weights on worker 0-0, policy_version 407484 (0.00092) [2022-07-09 20:27:25,269][25689] Fps is (10 sec: 5773.7, 60 sec: 5670.6, 300 sec: 5672.2). Total num frames: 417271808. Throughput: 0: 5945.8. Samples: 417279086. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:27:25,271][25689] Avg episode reward: [(0, '-47.527')] [2022-07-09 20:27:25,563][26022] Updated weights on worker 0-0, policy_version 407494 (0.00086) [2022-07-09 20:27:27,442][26022] Updated weights on worker 0-0, policy_version 407504 (0.00087) [2022-07-09 20:27:29,192][26022] Updated weights on worker 0-0, policy_version 407514 (0.00097) [2022-07-09 20:27:29,717][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:27:29,724][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000407517_417297408.pth [2022-07-09 20:27:29,725][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000405520_415252480.pth [2022-07-09 20:27:30,280][25689] Fps is (10 sec: 5708.2, 60 sec: 5671.0, 300 sec: 5672.1). Total num frames: 417299456. Throughput: 0: 5108.6. Samples: 417296308. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:27:30,282][25689] Avg episode reward: [(0, '-47.235')] [2022-07-09 20:27:31,062][26022] Updated weights on worker 0-0, policy_version 407524 (0.00089) [2022-07-09 20:27:32,879][26022] Updated weights on worker 0-0, policy_version 407534 (0.00094) [2022-07-09 20:27:34,581][26022] Updated weights on worker 0-0, policy_version 407544 (0.00090) [2022-07-09 20:27:35,364][25689] Fps is (10 sec: 5578.8, 60 sec: 5647.0, 300 sec: 5672.0). Total num frames: 417328128. Throughput: 0: 5949.0. Samples: 417330582. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:27:35,366][25689] Avg episode reward: [(0, '-47.332')] [2022-07-09 20:27:36,400][26022] Updated weights on worker 0-0, policy_version 407554 (0.00087) [2022-07-09 20:27:38,209][26022] Updated weights on worker 0-0, policy_version 407564 (0.00092) [2022-07-09 20:27:40,118][26022] Updated weights on worker 0-0, policy_version 407574 (0.00092) [2022-07-09 20:27:40,477][25689] Fps is (10 sec: 5623.7, 60 sec: 5656.4, 300 sec: 5667.4). Total num frames: 417356800. Throughput: 0: 5914.4. Samples: 417364242. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:27:40,478][25689] Avg episode reward: [(0, '-47.268')] [2022-07-09 20:27:41,903][26022] Updated weights on worker 0-0, policy_version 407584 (0.00092) [2022-07-09 20:27:43,703][26022] Updated weights on worker 0-0, policy_version 407594 (0.00080) [2022-07-09 20:27:45,553][25689] Fps is (10 sec: 5628.0, 60 sec: 5632.5, 300 sec: 5666.9). Total num frames: 417385472. Throughput: 0: 5038.8. Samples: 417381296. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:27:45,556][25689] Avg episode reward: [(0, '-47.179')] [2022-07-09 20:27:45,676][26022] Updated weights on worker 0-0, policy_version 407604 (0.00088) [2022-07-09 20:27:47,275][26022] Updated weights on worker 0-0, policy_version 407614 (0.00095) [2022-07-09 20:27:49,142][26022] Updated weights on worker 0-0, policy_version 407624 (0.00086) [2022-07-09 20:27:50,572][25689] Fps is (10 sec: 5782.0, 60 sec: 5667.0, 300 sec: 5667.5). Total num frames: 417415168. Throughput: 0: 5870.9. Samples: 417415438. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:27:50,574][25689] Avg episode reward: [(0, '-47.842')] [2022-07-09 20:27:50,777][26022] Updated weights on worker 0-0, policy_version 407634 (0.00088) [2022-07-09 20:27:52,674][26022] Updated weights on worker 0-0, policy_version 407644 (0.00086) [2022-07-09 20:27:54,318][26022] Updated weights on worker 0-0, policy_version 407654 (0.00090) [2022-07-09 20:27:55,587][25689] Fps is (10 sec: 5817.3, 60 sec: 5651.2, 300 sec: 5668.3). Total num frames: 417443840. Throughput: 0: 5919.6. Samples: 417450292. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:27:55,587][25689] Avg episode reward: [(0, '-47.609')] [2022-07-09 20:27:56,408][26022] Updated weights on worker 0-0, policy_version 407664 (0.00088) [2022-07-09 20:27:57,976][26022] Updated weights on worker 0-0, policy_version 407674 (0.00087) [2022-07-09 20:27:59,803][26022] Updated weights on worker 0-0, policy_version 407684 (0.00097) [2022-07-09 20:28:00,650][25689] Fps is (10 sec: 5588.5, 60 sec: 5615.6, 300 sec: 5675.0). Total num frames: 417471488. Throughput: 0: 5114.5. Samples: 417467416. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:28:00,650][25689] Avg episode reward: [(0, '-48.126')] [2022-07-09 20:28:01,422][26022] Updated weights on worker 0-0, policy_version 407694 (0.00086) [2022-07-09 20:28:03,925][26022] Updated weights on worker 0-0, policy_version 407704 (0.00087) [2022-07-09 20:28:05,347][26022] Updated weights on worker 0-0, policy_version 407714 (0.00083) [2022-07-09 20:28:05,679][25689] Fps is (10 sec: 5479.4, 60 sec: 5646.9, 300 sec: 5664.6). Total num frames: 417499136. Throughput: 0: 5880.7. Samples: 417499646. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:28:05,679][25689] Avg episode reward: [(0, '-48.843')] [2022-07-09 20:28:07,481][26022] Updated weights on worker 0-0, policy_version 407724 (0.00091) [2022-07-09 20:28:09,029][26022] Updated weights on worker 0-0, policy_version 407734 (0.00099) [2022-07-09 20:28:10,698][25689] Fps is (10 sec: 5502.9, 60 sec: 5663.4, 300 sec: 5669.4). Total num frames: 417526784. Throughput: 0: 5895.7. Samples: 417534096. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:28:10,699][25689] Avg episode reward: [(0, '-47.864')] [2022-07-09 20:28:11,115][26022] Updated weights on worker 0-0, policy_version 407744 (0.00085) [2022-07-09 20:28:12,483][26022] Updated weights on worker 0-0, policy_version 407754 (0.00089) [2022-07-09 20:28:14,717][26022] Updated weights on worker 0-0, policy_version 407764 (0.00083) [2022-07-09 20:28:15,706][25689] Fps is (10 sec: 5922.8, 60 sec: 5699.6, 300 sec: 5673.5). Total num frames: 417558528. Throughput: 0: 5020.9. Samples: 417551310. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:28:15,707][25689] Avg episode reward: [(0, '-47.806')] [2022-07-09 20:28:16,121][26022] Updated weights on worker 0-0, policy_version 407774 (0.00084) [2022-07-09 20:28:18,081][26022] Updated weights on worker 0-0, policy_version 407784 (0.00079) [2022-07-09 20:28:19,776][26022] Updated weights on worker 0-0, policy_version 407794 (0.00092) [2022-07-09 20:28:20,746][25689] Fps is (10 sec: 5911.0, 60 sec: 5685.9, 300 sec: 5674.3). Total num frames: 417586176. Throughput: 0: 5888.2. Samples: 417585746. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:28:20,746][25689] Avg episode reward: [(0, '-47.707')] [2022-07-09 20:28:21,622][26022] Updated weights on worker 0-0, policy_version 407804 (0.00067) [2022-07-09 20:28:23,354][26022] Updated weights on worker 0-0, policy_version 407814 (0.00084) [2022-07-09 20:28:25,484][26022] Updated weights on worker 0-0, policy_version 407824 (0.00088) [2022-07-09 20:28:25,846][25689] Fps is (10 sec: 5453.5, 60 sec: 5646.0, 300 sec: 5665.9). Total num frames: 417613824. Throughput: 0: 5965.2. Samples: 417619948. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:28:25,846][25689] Avg episode reward: [(0, '-48.125')] [2022-07-09 20:28:26,793][26022] Updated weights on worker 0-0, policy_version 407834 (0.00084) [2022-07-09 20:28:29,148][26022] Updated weights on worker 0-0, policy_version 407844 (0.00090) [2022-07-09 20:28:30,498][26022] Updated weights on worker 0-0, policy_version 407854 (0.00087) [2022-07-09 20:28:30,855][25689] Fps is (10 sec: 5672.7, 60 sec: 5680.0, 300 sec: 5669.3). Total num frames: 417643520. Throughput: 0: 5096.7. Samples: 417636828. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:28:30,855][25689] Avg episode reward: [(0, '-47.924')] [2022-07-09 20:28:32,570][26022] Updated weights on worker 0-0, policy_version 407864 (0.00086) [2022-07-09 20:28:34,242][26022] Updated weights on worker 0-0, policy_version 407874 (0.00084) [2022-07-09 20:28:35,858][25689] Fps is (10 sec: 5727.7, 60 sec: 5670.7, 300 sec: 5670.5). Total num frames: 417671168. Throughput: 0: 5939.4. Samples: 417670996. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:28:35,859][25689] Avg episode reward: [(0, '-47.947')] [2022-07-09 20:28:36,051][26022] Updated weights on worker 0-0, policy_version 407884 (0.00056) [2022-07-09 20:28:37,864][26022] Updated weights on worker 0-0, policy_version 407894 (0.00092) [2022-07-09 20:28:39,871][26022] Updated weights on worker 0-0, policy_version 407904 (0.00093) [2022-07-09 20:28:40,926][25689] Fps is (10 sec: 5592.3, 60 sec: 5674.9, 300 sec: 5666.8). Total num frames: 417699840. Throughput: 0: 5912.5. Samples: 417705060. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:28:40,926][25689] Avg episode reward: [(0, '-48.097')] [2022-07-09 20:28:41,286][26022] Updated weights on worker 0-0, policy_version 407914 (0.00807) [2022-07-09 20:28:43,503][26022] Updated weights on worker 0-0, policy_version 407924 (0.00089) [2022-07-09 20:28:45,031][26022] Updated weights on worker 0-0, policy_version 407934 (0.00850) [2022-07-09 20:28:45,937][25689] Fps is (10 sec: 5689.0, 60 sec: 5681.0, 300 sec: 5666.9). Total num frames: 417728512. Throughput: 0: 5090.3. Samples: 417722220. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:28:45,938][25689] Avg episode reward: [(0, '-48.678')] [2022-07-09 20:28:46,937][26022] Updated weights on worker 0-0, policy_version 407944 (0.00093) [2022-07-09 20:28:48,618][26022] Updated weights on worker 0-0, policy_version 407954 (0.00091) [2022-07-09 20:28:50,692][26022] Updated weights on worker 0-0, policy_version 407964 (0.00091) [2022-07-09 20:28:50,950][25689] Fps is (10 sec: 5516.0, 60 sec: 5630.6, 300 sec: 5659.8). Total num frames: 417755136. Throughput: 0: 5944.1. Samples: 417756278. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:28:50,951][25689] Avg episode reward: [(0, '-47.311')] [2022-07-09 20:28:52,283][26022] Updated weights on worker 0-0, policy_version 407974 (0.00089) [2022-07-09 20:28:54,436][26022] Updated weights on worker 0-0, policy_version 407984 (0.00991) [2022-07-09 20:28:55,789][26022] Updated weights on worker 0-0, policy_version 407994 (0.00083) [2022-07-09 20:28:55,994][25689] Fps is (10 sec: 5701.9, 60 sec: 5661.8, 300 sec: 5666.9). Total num frames: 417785856. Throughput: 0: 5913.8. Samples: 417790080. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:28:55,995][25689] Avg episode reward: [(0, '-46.828')] [2022-07-09 20:28:58,052][26022] Updated weights on worker 0-0, policy_version 408004 (0.00085) [2022-07-09 20:28:59,412][26022] Updated weights on worker 0-0, policy_version 408014 (0.00086) [2022-07-09 20:29:01,118][25689] Fps is (10 sec: 5740.3, 60 sec: 5656.1, 300 sec: 5669.5). Total num frames: 417813504. Throughput: 0: 5059.4. Samples: 417807222. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:29:01,119][25689] Avg episode reward: [(0, '-46.356')] [2022-07-09 20:29:01,503][26022] Updated weights on worker 0-0, policy_version 408024 (0.00087) [2022-07-09 20:29:03,507][26022] Updated weights on worker 0-0, policy_version 408034 (0.00090) [2022-07-09 20:29:05,451][26022] Updated weights on worker 0-0, policy_version 408044 (0.00082) [2022-07-09 20:29:06,147][25689] Fps is (10 sec: 5446.6, 60 sec: 5656.1, 300 sec: 5666.3). Total num frames: 417841152. Throughput: 0: 5801.0. Samples: 417839454. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:29:06,147][25689] Avg episode reward: [(0, '-46.481')] [2022-07-09 20:29:07,002][26022] Updated weights on worker 0-0, policy_version 408054 (0.00086) [2022-07-09 20:29:09,052][26022] Updated weights on worker 0-0, policy_version 408064 (0.00092) [2022-07-09 20:29:10,767][26022] Updated weights on worker 0-0, policy_version 408074 (0.00075) [2022-07-09 20:29:11,158][25689] Fps is (10 sec: 5609.9, 60 sec: 5673.9, 300 sec: 5662.8). Total num frames: 417869824. Throughput: 0: 5796.5. Samples: 417873410. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:29:11,158][25689] Avg episode reward: [(0, '-46.616')] [2022-07-09 20:29:12,643][26022] Updated weights on worker 0-0, policy_version 408084 (0.00079) [2022-07-09 20:29:14,245][26022] Updated weights on worker 0-0, policy_version 408094 (0.00109) [2022-07-09 20:29:16,186][25689] Fps is (10 sec: 5610.2, 60 sec: 5604.3, 300 sec: 5664.6). Total num frames: 417897472. Throughput: 0: 5821.9. Samples: 417907632. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-09 20:29:16,186][25689] Avg episode reward: [(0, '-46.580')] [2022-07-09 20:29:16,303][26022] Updated weights on worker 0-0, policy_version 408104 (0.00134) [2022-07-09 20:29:17,804][26022] Updated weights on worker 0-0, policy_version 408114 (0.00088) [2022-07-09 20:29:19,960][26022] Updated weights on worker 0-0, policy_version 408124 (0.00088) [2022-07-09 20:29:21,241][25689] Fps is (10 sec: 5686.8, 60 sec: 5636.6, 300 sec: 5663.8). Total num frames: 417927168. Throughput: 0: 5833.4. Samples: 417924608. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:29:21,242][25689] Avg episode reward: [(0, '-47.036')] [2022-07-09 20:29:21,472][26022] Updated weights on worker 0-0, policy_version 408134 (0.00091) [2022-07-09 20:29:23,319][26022] Updated weights on worker 0-0, policy_version 408144 (0.00079) [2022-07-09 20:29:25,269][26022] Updated weights on worker 0-0, policy_version 408154 (0.00085) [2022-07-09 20:29:26,243][25689] Fps is (10 sec: 5803.7, 60 sec: 5662.8, 300 sec: 5664.0). Total num frames: 417955840. Throughput: 0: 5936.8. Samples: 417958760. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:29:26,243][25689] Avg episode reward: [(0, '-47.556')] [2022-07-09 20:29:26,865][26022] Updated weights on worker 0-0, policy_version 408164 (0.00086) [2022-07-09 20:29:28,967][26022] Updated weights on worker 0-0, policy_version 408174 (0.00092) [2022-07-09 20:29:29,900][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:29:29,914][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000408180_417976320.pth [2022-07-09 20:29:29,915][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000406185_415933440.pth [2022-07-09 20:29:30,658][26022] Updated weights on worker 0-0, policy_version 408184 (0.00084) [2022-07-09 20:29:31,256][25689] Fps is (10 sec: 5521.5, 60 sec: 5611.5, 300 sec: 5658.2). Total num frames: 417982464. Throughput: 0: 5941.8. Samples: 417992830. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:29:31,257][25689] Avg episode reward: [(0, '-48.075')] [2022-07-09 20:29:32,423][26022] Updated weights on worker 0-0, policy_version 408194 (0.00086) [2022-07-09 20:29:34,299][26022] Updated weights on worker 0-0, policy_version 408204 (0.00090) [2022-07-09 20:29:35,917][26022] Updated weights on worker 0-0, policy_version 408214 (0.00086) [2022-07-09 20:29:36,261][25689] Fps is (10 sec: 5621.8, 60 sec: 5645.3, 300 sec: 5666.7). Total num frames: 418012160. Throughput: 0: 5095.5. Samples: 418009926. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:29:36,261][25689] Avg episode reward: [(0, '-47.874')] [2022-07-09 20:29:37,892][26022] Updated weights on worker 0-0, policy_version 408224 (0.00090) [2022-07-09 20:29:39,603][26022] Updated weights on worker 0-0, policy_version 408234 (0.00097) [2022-07-09 20:29:41,334][25689] Fps is (10 sec: 5791.5, 60 sec: 5644.8, 300 sec: 5659.9). Total num frames: 418040832. Throughput: 0: 5939.7. Samples: 418043954. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:29:41,335][25689] Avg episode reward: [(0, '-48.194')] [2022-07-09 20:29:41,457][26022] Updated weights on worker 0-0, policy_version 408244 (0.00091) [2022-07-09 20:29:43,347][26022] Updated weights on worker 0-0, policy_version 408254 (0.00087) [2022-07-09 20:29:45,071][26022] Updated weights on worker 0-0, policy_version 408264 (0.00087) [2022-07-09 20:29:46,379][25689] Fps is (10 sec: 5667.3, 60 sec: 5641.6, 300 sec: 5663.0). Total num frames: 418069504. Throughput: 0: 5933.1. Samples: 418078232. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:29:46,380][25689] Avg episode reward: [(0, '-48.377')] [2022-07-09 20:29:46,799][26022] Updated weights on worker 0-0, policy_version 408274 (0.00089) [2022-07-09 20:29:48,672][26022] Updated weights on worker 0-0, policy_version 408284 (0.00095) [2022-07-09 20:29:50,437][26022] Updated weights on worker 0-0, policy_version 408294 (0.00083) [2022-07-09 20:29:51,384][25689] Fps is (10 sec: 5604.3, 60 sec: 5659.4, 300 sec: 5656.2). Total num frames: 418097152. Throughput: 0: 5091.3. Samples: 418095306. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:29:51,384][25689] Avg episode reward: [(0, '-49.516')] [2022-07-09 20:29:52,104][26022] Updated weights on worker 0-0, policy_version 408304 (0.00086) [2022-07-09 20:29:54,180][26022] Updated weights on worker 0-0, policy_version 408314 (0.00089) [2022-07-09 20:29:55,654][26022] Updated weights on worker 0-0, policy_version 408324 (0.00083) [2022-07-09 20:29:56,409][25689] Fps is (10 sec: 5717.7, 60 sec: 5644.2, 300 sec: 5661.3). Total num frames: 418126848. Throughput: 0: 5945.6. Samples: 418129716. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:29:56,409][25689] Avg episode reward: [(0, '-49.574')] [2022-07-09 20:29:57,699][26022] Updated weights on worker 0-0, policy_version 408334 (0.00085) [2022-07-09 20:29:59,343][26022] Updated weights on worker 0-0, policy_version 408344 (0.00075) [2022-07-09 20:30:01,239][26022] Updated weights on worker 0-0, policy_version 408354 (0.00093) [2022-07-09 20:30:01,466][25689] Fps is (10 sec: 5890.9, 60 sec: 5684.4, 300 sec: 5671.1). Total num frames: 418156544. Throughput: 0: 5949.6. Samples: 418163728. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:30:01,466][25689] Avg episode reward: [(0, '-48.842')] [2022-07-09 20:30:03,582][26022] Updated weights on worker 0-0, policy_version 408364 (0.00093) [2022-07-09 20:30:05,148][26022] Updated weights on worker 0-0, policy_version 408374 (0.00092) [2022-07-09 20:30:06,470][25689] Fps is (10 sec: 5393.8, 60 sec: 5635.7, 300 sec: 5661.5). Total num frames: 418181120. Throughput: 0: 4991.8. Samples: 418178524. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:30:06,471][25689] Avg episode reward: [(0, '-48.508')] [2022-07-09 20:30:07,112][26022] Updated weights on worker 0-0, policy_version 408384 (0.01014) [2022-07-09 20:30:08,834][26022] Updated weights on worker 0-0, policy_version 408394 (0.00092) [2022-07-09 20:30:10,563][26022] Updated weights on worker 0-0, policy_version 408404 (0.00098) [2022-07-09 20:30:11,529][25689] Fps is (10 sec: 5393.0, 60 sec: 5648.2, 300 sec: 5660.8). Total num frames: 418210816. Throughput: 0: 5831.2. Samples: 418212778. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:30:11,531][25689] Avg episode reward: [(0, '-47.140')] [2022-07-09 20:30:12,594][26022] Updated weights on worker 0-0, policy_version 408414 (0.00094) [2022-07-09 20:30:14,189][26022] Updated weights on worker 0-0, policy_version 408424 (0.00090) [2022-07-09 20:30:16,297][26022] Updated weights on worker 0-0, policy_version 408434 (0.00086) [2022-07-09 20:30:16,535][25689] Fps is (10 sec: 5799.6, 60 sec: 5667.3, 300 sec: 5661.4). Total num frames: 418239488. Throughput: 0: 5829.0. Samples: 418247030. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:30:16,535][25689] Avg episode reward: [(0, '-46.711')] [2022-07-09 20:30:17,695][26022] Updated weights on worker 0-0, policy_version 408444 (0.00085) [2022-07-09 20:30:19,598][26022] Updated weights on worker 0-0, policy_version 408454 (0.00090) [2022-07-09 20:30:21,418][26022] Updated weights on worker 0-0, policy_version 408464 (0.00087) [2022-07-09 20:30:21,636][25689] Fps is (10 sec: 5673.7, 60 sec: 5646.1, 300 sec: 5656.4). Total num frames: 418268160. Throughput: 0: 4972.8. Samples: 418264032. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:30:21,637][25689] Avg episode reward: [(0, '-46.771')] [2022-07-09 20:30:23,027][26022] Updated weights on worker 0-0, policy_version 408474 (0.00086) [2022-07-09 20:30:25,139][26022] Updated weights on worker 0-0, policy_version 408484 (0.00088) [2022-07-09 20:30:26,658][25689] Fps is (10 sec: 5664.5, 60 sec: 5644.1, 300 sec: 5663.2). Total num frames: 418296832. Throughput: 0: 5944.0. Samples: 418298520. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:30:26,659][25689] Avg episode reward: [(0, '-47.099')] [2022-07-09 20:30:26,755][26022] Updated weights on worker 0-0, policy_version 408494 (0.00087) [2022-07-09 20:30:28,740][26022] Updated weights on worker 0-0, policy_version 408504 (0.00086) [2022-07-09 20:30:30,443][26022] Updated weights on worker 0-0, policy_version 408514 (0.00093) [2022-07-09 20:30:31,729][25689] Fps is (10 sec: 5681.7, 60 sec: 5672.6, 300 sec: 5655.6). Total num frames: 418325504. Throughput: 0: 5942.2. Samples: 418332810. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:30:31,730][25689] Avg episode reward: [(0, '-47.236')] [2022-07-09 20:30:32,106][26022] Updated weights on worker 0-0, policy_version 408524 (0.00089) [2022-07-09 20:30:34,069][26022] Updated weights on worker 0-0, policy_version 408534 (0.00091) [2022-07-09 20:30:35,687][26022] Updated weights on worker 0-0, policy_version 408544 (0.00087) [2022-07-09 20:30:36,757][25689] Fps is (10 sec: 5475.5, 60 sec: 5619.7, 300 sec: 5653.1). Total num frames: 418352128. Throughput: 0: 5080.9. Samples: 418349778. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:30:36,757][25689] Avg episode reward: [(0, '-47.503')] [2022-07-09 20:30:37,627][26022] Updated weights on worker 0-0, policy_version 408554 (0.00093) [2022-07-09 20:30:39,336][26022] Updated weights on worker 0-0, policy_version 408564 (0.00113) [2022-07-09 20:30:41,220][26022] Updated weights on worker 0-0, policy_version 408574 (0.00090) [2022-07-09 20:30:41,845][25689] Fps is (10 sec: 5770.0, 60 sec: 5669.1, 300 sec: 5662.5). Total num frames: 418383872. Throughput: 0: 5932.4. Samples: 418383916. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:30:41,845][25689] Avg episode reward: [(0, '-47.969')] [2022-07-09 20:30:42,931][26022] Updated weights on worker 0-0, policy_version 408584 (0.00086) [2022-07-09 20:30:44,683][26022] Updated weights on worker 0-0, policy_version 408594 (0.00085) [2022-07-09 20:30:46,692][26022] Updated weights on worker 0-0, policy_version 408604 (0.00088) [2022-07-09 20:30:46,873][25689] Fps is (10 sec: 5769.9, 60 sec: 5636.8, 300 sec: 5655.4). Total num frames: 418410496. Throughput: 0: 5917.2. Samples: 418418134. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:30:46,873][25689] Avg episode reward: [(0, '-48.749')] [2022-07-09 20:30:48,418][26022] Updated weights on worker 0-0, policy_version 408614 (0.00085) [2022-07-09 20:30:50,387][26022] Updated weights on worker 0-0, policy_version 408624 (0.00122) [2022-07-09 20:30:51,903][25689] Fps is (10 sec: 5599.4, 60 sec: 5668.3, 300 sec: 5654.9). Total num frames: 418440192. Throughput: 0: 5072.5. Samples: 418435142. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:30:51,903][25689] Avg episode reward: [(0, '-48.706')] [2022-07-09 20:30:51,951][26022] Updated weights on worker 0-0, policy_version 408634 (0.00087) [2022-07-09 20:30:53,868][26022] Updated weights on worker 0-0, policy_version 408644 (0.00139) [2022-07-09 20:30:55,615][26022] Updated weights on worker 0-0, policy_version 408654 (0.00087) [2022-07-09 20:30:56,917][25689] Fps is (10 sec: 5811.2, 60 sec: 5652.4, 300 sec: 5656.3). Total num frames: 418468864. Throughput: 0: 5944.6. Samples: 418469622. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:30:56,917][25689] Avg episode reward: [(0, '-48.300')] [2022-07-09 20:30:57,475][26022] Updated weights on worker 0-0, policy_version 408664 (0.00083) [2022-07-09 20:30:59,111][26022] Updated weights on worker 0-0, policy_version 408674 (0.00087) [2022-07-09 20:31:00,972][26022] Updated weights on worker 0-0, policy_version 408684 (0.00090) [2022-07-09 20:31:01,989][25689] Fps is (10 sec: 5583.7, 60 sec: 5617.1, 300 sec: 5662.2). Total num frames: 418496512. Throughput: 0: 5943.3. Samples: 418503642. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:31:01,990][25689] Avg episode reward: [(0, '-49.283')] [2022-07-09 20:31:03,009][26022] Updated weights on worker 0-0, policy_version 408694 (0.00104) [2022-07-09 20:31:05,062][26022] Updated weights on worker 0-0, policy_version 408704 (0.00085) [2022-07-09 20:31:06,443][26022] Updated weights on worker 0-0, policy_version 408714 (0.00088) [2022-07-09 20:31:07,046][25689] Fps is (10 sec: 5560.4, 60 sec: 5679.9, 300 sec: 5657.9). Total num frames: 418525184. Throughput: 0: 5009.4. Samples: 418519188. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:31:07,046][25689] Avg episode reward: [(0, '-48.886')] [2022-07-09 20:31:08,499][26022] Updated weights on worker 0-0, policy_version 408724 (0.00086) [2022-07-09 20:31:10,127][26022] Updated weights on worker 0-0, policy_version 408734 (0.00085) [2022-07-09 20:31:12,103][25689] Fps is (10 sec: 5669.9, 60 sec: 5663.2, 300 sec: 5657.4). Total num frames: 418553856. Throughput: 0: 5873.0. Samples: 418553778. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:31:12,104][25689] Avg episode reward: [(0, '-48.283')] [2022-07-09 20:31:12,114][26022] Updated weights on worker 0-0, policy_version 408744 (0.00086) [2022-07-09 20:31:13,907][26022] Updated weights on worker 0-0, policy_version 408754 (0.00092) [2022-07-09 20:31:15,743][26022] Updated weights on worker 0-0, policy_version 408764 (0.00084) [2022-07-09 20:31:17,167][25689] Fps is (10 sec: 5767.1, 60 sec: 5674.6, 300 sec: 5664.1). Total num frames: 418583552. Throughput: 0: 5834.7. Samples: 418587772. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:31:17,167][25689] Avg episode reward: [(0, '-48.271')] [2022-07-09 20:31:17,441][26022] Updated weights on worker 0-0, policy_version 408774 (0.00083) [2022-07-09 20:31:19,183][26022] Updated weights on worker 0-0, policy_version 408784 (0.00094) [2022-07-09 20:31:21,026][26022] Updated weights on worker 0-0, policy_version 408794 (0.00087) [2022-07-09 20:31:22,227][25689] Fps is (10 sec: 5562.9, 60 sec: 5644.6, 300 sec: 5652.7). Total num frames: 418610176. Throughput: 0: 4997.8. Samples: 418604788. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:31:22,228][25689] Avg episode reward: [(0, '-47.905')] [2022-07-09 20:31:22,831][26022] Updated weights on worker 0-0, policy_version 408804 (0.00098) [2022-07-09 20:31:24,587][26022] Updated weights on worker 0-0, policy_version 408814 (0.00089) [2022-07-09 20:31:26,481][26022] Updated weights on worker 0-0, policy_version 408824 (0.00097) [2022-07-09 20:31:27,298][25689] Fps is (10 sec: 5558.7, 60 sec: 5657.0, 300 sec: 5658.5). Total num frames: 418639872. Throughput: 0: 5936.0. Samples: 418639408. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:31:27,299][25689] Avg episode reward: [(0, '-47.848')] [2022-07-09 20:31:28,088][26022] Updated weights on worker 0-0, policy_version 408834 (0.00103) [2022-07-09 20:31:30,009][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:31:30,024][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000408843_418655232.pth [2022-07-09 20:31:30,025][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000406851_416615424.pth [2022-07-09 20:31:30,095][26022] Updated weights on worker 0-0, policy_version 408844 (0.00072) [2022-07-09 20:31:31,708][26022] Updated weights on worker 0-0, policy_version 408854 (0.00081) [2022-07-09 20:31:32,307][25689] Fps is (10 sec: 5892.5, 60 sec: 5679.7, 300 sec: 5658.5). Total num frames: 418669568. Throughput: 0: 5934.4. Samples: 418673674. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:31:32,309][25689] Avg episode reward: [(0, '-46.987')] [2022-07-09 20:31:33,626][26022] Updated weights on worker 0-0, policy_version 408864 (0.00085) [2022-07-09 20:31:35,398][26022] Updated weights on worker 0-0, policy_version 408874 (0.00091) [2022-07-09 20:31:37,071][26022] Updated weights on worker 0-0, policy_version 408884 (0.00085) [2022-07-09 20:31:37,323][25689] Fps is (10 sec: 5822.6, 60 sec: 5714.6, 300 sec: 5662.3). Total num frames: 418698240. Throughput: 0: 5113.1. Samples: 418690832. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:31:37,325][25689] Avg episode reward: [(0, '-47.617')] [2022-07-09 20:31:39,003][26022] Updated weights on worker 0-0, policy_version 408894 (0.00084) [2022-07-09 20:31:40,761][26022] Updated weights on worker 0-0, policy_version 408904 (0.00092) [2022-07-09 20:31:42,374][26022] Updated weights on worker 0-0, policy_version 408914 (0.00087) [2022-07-09 20:31:42,414][25689] Fps is (10 sec: 5774.9, 60 sec: 5680.5, 300 sec: 5660.6). Total num frames: 418727936. Throughput: 0: 5972.9. Samples: 418725360. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:31:42,415][25689] Avg episode reward: [(0, '-47.719')] [2022-07-09 20:31:44,492][26022] Updated weights on worker 0-0, policy_version 408924 (0.00113) [2022-07-09 20:31:46,067][26022] Updated weights on worker 0-0, policy_version 408934 (0.00089) [2022-07-09 20:31:47,446][25689] Fps is (10 sec: 5664.5, 60 sec: 5697.0, 300 sec: 5660.4). Total num frames: 418755584. Throughput: 0: 5966.6. Samples: 418759622. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:31:47,447][25689] Avg episode reward: [(0, '-47.694')] [2022-07-09 20:31:47,872][26022] Updated weights on worker 0-0, policy_version 408944 (0.00091) [2022-07-09 20:31:49,735][26022] Updated weights on worker 0-0, policy_version 408954 (0.00081) [2022-07-09 20:31:51,409][26022] Updated weights on worker 0-0, policy_version 408964 (0.00086) [2022-07-09 20:31:52,467][25689] Fps is (10 sec: 5602.0, 60 sec: 5681.0, 300 sec: 5657.1). Total num frames: 418784256. Throughput: 0: 5111.9. Samples: 418776734. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:31:52,468][25689] Avg episode reward: [(0, '-46.965')] [2022-07-09 20:31:53,340][26022] Updated weights on worker 0-0, policy_version 408974 (0.00089) [2022-07-09 20:31:55,027][26022] Updated weights on worker 0-0, policy_version 408984 (0.00074) [2022-07-09 20:31:56,871][26022] Updated weights on worker 0-0, policy_version 408994 (0.00085) [2022-07-09 20:31:57,524][25689] Fps is (10 sec: 5689.9, 60 sec: 5676.9, 300 sec: 5653.4). Total num frames: 418812928. Throughput: 0: 5956.0. Samples: 418811152. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 20:31:57,525][25689] Avg episode reward: [(0, '-47.478')] [2022-07-09 20:31:58,842][26022] Updated weights on worker 0-0, policy_version 409004 (0.00087) [2022-07-09 20:32:00,373][26022] Updated weights on worker 0-0, policy_version 409014 (0.00098) [2022-07-09 20:32:02,599][25689] Fps is (10 sec: 5457.8, 60 sec: 5659.8, 300 sec: 5655.5). Total num frames: 418839552. Throughput: 0: 5962.6. Samples: 418845716. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:32:02,599][25689] Avg episode reward: [(0, '-46.475')] [2022-07-09 20:32:02,622][26022] Updated weights on worker 0-0, policy_version 409024 (0.00057) [2022-07-09 20:32:04,500][26022] Updated weights on worker 0-0, policy_version 409034 (0.01003) [2022-07-09 20:32:06,183][26022] Updated weights on worker 0-0, policy_version 409044 (0.00093) [2022-07-09 20:32:07,606][25689] Fps is (10 sec: 5383.4, 60 sec: 5647.6, 300 sec: 5659.1). Total num frames: 418867200. Throughput: 0: 5870.6. Samples: 418877970. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:32:07,606][25689] Avg episode reward: [(0, '-46.500')] [2022-07-09 20:32:08,029][26022] Updated weights on worker 0-0, policy_version 409054 (0.00086) [2022-07-09 20:32:09,755][26022] Updated weights on worker 0-0, policy_version 409064 (0.00083) [2022-07-09 20:32:11,563][26022] Updated weights on worker 0-0, policy_version 409074 (0.00083) [2022-07-09 20:32:12,706][25689] Fps is (10 sec: 5774.8, 60 sec: 5677.3, 300 sec: 5661.2). Total num frames: 418897920. Throughput: 0: 5854.1. Samples: 418895214. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:32:12,707][25689] Avg episode reward: [(0, '-46.026')] [2022-07-09 20:32:13,320][26022] Updated weights on worker 0-0, policy_version 409084 (0.00087) [2022-07-09 20:32:15,059][26022] Updated weights on worker 0-0, policy_version 409094 (0.00080) [2022-07-09 20:32:16,951][26022] Updated weights on worker 0-0, policy_version 409104 (0.00087) [2022-07-09 20:32:17,730][25689] Fps is (10 sec: 5866.1, 60 sec: 5664.1, 300 sec: 5662.2). Total num frames: 418926592. Throughput: 0: 5856.2. Samples: 418929482. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:32:17,731][25689] Avg episode reward: [(0, '-46.130')] [2022-07-09 20:32:18,747][26022] Updated weights on worker 0-0, policy_version 409114 (0.00090) [2022-07-09 20:32:20,582][26022] Updated weights on worker 0-0, policy_version 409124 (0.00093) [2022-07-09 20:32:22,331][26022] Updated weights on worker 0-0, policy_version 409134 (0.00092) [2022-07-09 20:32:22,856][25689] Fps is (10 sec: 5649.7, 60 sec: 5691.8, 300 sec: 5657.0). Total num frames: 418955264. Throughput: 0: 5823.3. Samples: 418963680. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:32:22,857][25689] Avg episode reward: [(0, '-47.441')] [2022-07-09 20:32:24,069][26022] Updated weights on worker 0-0, policy_version 409144 (0.00086) [2022-07-09 20:32:25,984][26022] Updated weights on worker 0-0, policy_version 409154 (0.00090) [2022-07-09 20:32:27,739][26022] Updated weights on worker 0-0, policy_version 409164 (0.00086) [2022-07-09 20:32:27,866][25689] Fps is (10 sec: 5657.5, 60 sec: 5680.7, 300 sec: 5660.5). Total num frames: 418983936. Throughput: 0: 5067.5. Samples: 418980640. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:32:27,867][25689] Avg episode reward: [(0, '-47.839')] [2022-07-09 20:32:29,537][26022] Updated weights on worker 0-0, policy_version 409174 (0.00093) [2022-07-09 20:32:31,334][26022] Updated weights on worker 0-0, policy_version 409184 (0.00610) [2022-07-09 20:32:32,935][25689] Fps is (10 sec: 5689.7, 60 sec: 5658.1, 300 sec: 5660.8). Total num frames: 419012608. Throughput: 0: 5904.1. Samples: 419014644. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:32:32,935][25689] Avg episode reward: [(0, '-47.397')] [2022-07-09 20:32:33,322][26022] Updated weights on worker 0-0, policy_version 409194 (0.00093) [2022-07-09 20:32:34,861][26022] Updated weights on worker 0-0, policy_version 409204 (0.00092) [2022-07-09 20:32:36,707][26022] Updated weights on worker 0-0, policy_version 409214 (0.00091) [2022-07-09 20:32:37,976][25689] Fps is (10 sec: 5773.5, 60 sec: 5672.7, 300 sec: 5665.6). Total num frames: 419042304. Throughput: 0: 5904.0. Samples: 419049012. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:32:37,978][25689] Avg episode reward: [(0, '-47.516')] [2022-07-09 20:32:38,519][26022] Updated weights on worker 0-0, policy_version 409224 (0.00086) [2022-07-09 20:32:40,260][26022] Updated weights on worker 0-0, policy_version 409234 (0.00090) [2022-07-09 20:32:42,276][26022] Updated weights on worker 0-0, policy_version 409244 (0.00097) [2022-07-09 20:32:43,057][25689] Fps is (10 sec: 5665.2, 60 sec: 5639.8, 300 sec: 5662.1). Total num frames: 419069952. Throughput: 0: 5074.2. Samples: 419066182. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:32:43,059][25689] Avg episode reward: [(0, '-48.653')] [2022-07-09 20:32:43,906][26022] Updated weights on worker 0-0, policy_version 409254 (0.00091) [2022-07-09 20:32:45,827][26022] Updated weights on worker 0-0, policy_version 409264 (0.00088) [2022-07-09 20:32:47,464][26022] Updated weights on worker 0-0, policy_version 409274 (0.00084) [2022-07-09 20:32:48,078][25689] Fps is (10 sec: 5676.3, 60 sec: 5674.6, 300 sec: 5662.0). Total num frames: 419099648. Throughput: 0: 5922.2. Samples: 419100340. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:32:48,080][25689] Avg episode reward: [(0, '-49.098')] [2022-07-09 20:32:49,411][26022] Updated weights on worker 0-0, policy_version 409284 (0.00088) [2022-07-09 20:32:51,037][26022] Updated weights on worker 0-0, policy_version 409294 (0.00089) [2022-07-09 20:32:52,991][26022] Updated weights on worker 0-0, policy_version 409304 (0.00080) [2022-07-09 20:32:53,161][25689] Fps is (10 sec: 5675.5, 60 sec: 5652.0, 300 sec: 5657.3). Total num frames: 419127296. Throughput: 0: 5920.4. Samples: 419134390. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:32:53,161][25689] Avg episode reward: [(0, '-48.896')] [2022-07-09 20:32:54,745][26022] Updated weights on worker 0-0, policy_version 409314 (0.00094) [2022-07-09 20:32:56,762][26022] Updated weights on worker 0-0, policy_version 409324 (0.00084) [2022-07-09 20:32:58,233][25689] Fps is (10 sec: 5646.8, 60 sec: 5667.4, 300 sec: 5664.0). Total num frames: 419156992. Throughput: 0: 5059.7. Samples: 419151508. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:32:58,235][25689] Avg episode reward: [(0, '-48.559')] [2022-07-09 20:32:58,364][26022] Updated weights on worker 0-0, policy_version 409334 (0.00101) [2022-07-09 20:33:00,188][26022] Updated weights on worker 0-0, policy_version 409344 (0.00059) [2022-07-09 20:33:02,061][26022] Updated weights on worker 0-0, policy_version 409354 (0.00085) [2022-07-09 20:33:03,284][25689] Fps is (10 sec: 5563.5, 60 sec: 5669.7, 300 sec: 5660.1). Total num frames: 419183616. Throughput: 0: 5911.2. Samples: 419185748. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:33:03,284][25689] Avg episode reward: [(0, '-48.187')] [2022-07-09 20:33:04,062][26022] Updated weights on worker 0-0, policy_version 409364 (0.00090) [2022-07-09 20:33:05,993][26022] Updated weights on worker 0-0, policy_version 409374 (0.00088) [2022-07-09 20:33:07,789][26022] Updated weights on worker 0-0, policy_version 409384 (0.00091) [2022-07-09 20:33:08,350][25689] Fps is (10 sec: 5465.8, 60 sec: 5681.0, 300 sec: 5662.7). Total num frames: 419212288. Throughput: 0: 5789.5. Samples: 419217704. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:33:08,350][25689] Avg episode reward: [(0, '-47.324')] [2022-07-09 20:33:09,732][26022] Updated weights on worker 0-0, policy_version 409394 (0.00092) [2022-07-09 20:33:11,455][26022] Updated weights on worker 0-0, policy_version 409404 (0.00084) [2022-07-09 20:33:13,068][26022] Updated weights on worker 0-0, policy_version 409414 (0.00102) [2022-07-09 20:33:13,394][25689] Fps is (10 sec: 5671.6, 60 sec: 5652.5, 300 sec: 5651.7). Total num frames: 419240960. Throughput: 0: 4966.6. Samples: 419234884. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:33:13,395][25689] Avg episode reward: [(0, '-46.981')] [2022-07-09 20:33:14,888][26022] Updated weights on worker 0-0, policy_version 409424 (0.00087) [2022-07-09 20:33:16,618][26022] Updated weights on worker 0-0, policy_version 409434 (0.00087) [2022-07-09 20:33:18,397][25689] Fps is (10 sec: 5707.2, 60 sec: 5654.4, 300 sec: 5655.8). Total num frames: 419269632. Throughput: 0: 5839.9. Samples: 419269266. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:33:18,398][25689] Avg episode reward: [(0, '-46.772')] [2022-07-09 20:33:18,477][26022] Updated weights on worker 0-0, policy_version 409444 (0.00091) [2022-07-09 20:33:20,372][26022] Updated weights on worker 0-0, policy_version 409454 (0.00089) [2022-07-09 20:33:22,025][26022] Updated weights on worker 0-0, policy_version 409464 (0.00086) [2022-07-09 20:33:23,509][25689] Fps is (10 sec: 5770.6, 60 sec: 5672.6, 300 sec: 5662.4). Total num frames: 419299328. Throughput: 0: 5836.6. Samples: 419303796. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:33:23,510][25689] Avg episode reward: [(0, '-46.995')] [2022-07-09 20:33:23,920][26022] Updated weights on worker 0-0, policy_version 409474 (0.00093) [2022-07-09 20:33:25,702][26022] Updated weights on worker 0-0, policy_version 409484 (0.00093) [2022-07-09 20:33:27,357][26022] Updated weights on worker 0-0, policy_version 409494 (0.00083) [2022-07-09 20:33:28,520][25689] Fps is (10 sec: 5563.9, 60 sec: 5638.8, 300 sec: 5652.1). Total num frames: 419325952. Throughput: 0: 5125.2. Samples: 419321080. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:33:28,520][25689] Avg episode reward: [(0, '-46.998')] [2022-07-09 20:33:29,255][26022] Updated weights on worker 0-0, policy_version 409504 (0.00087) [2022-07-09 20:33:30,063][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:33:30,073][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000409508_419336192.pth [2022-07-09 20:33:30,073][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000407517_417297408.pth [2022-07-09 20:33:31,244][26022] Updated weights on worker 0-0, policy_version 409514 (0.00091) [2022-07-09 20:33:32,807][26022] Updated weights on worker 0-0, policy_version 409524 (0.00081) [2022-07-09 20:33:33,536][25689] Fps is (10 sec: 5718.9, 60 sec: 5677.5, 300 sec: 5662.2). Total num frames: 419356672. Throughput: 0: 5963.5. Samples: 419355000. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:33:33,537][25689] Avg episode reward: [(0, '-46.906')] [2022-07-09 20:33:34,824][26022] Updated weights on worker 0-0, policy_version 409534 (0.00088) [2022-07-09 20:33:36,349][26022] Updated weights on worker 0-0, policy_version 409544 (0.00050) [2022-07-09 20:33:38,277][26022] Updated weights on worker 0-0, policy_version 409554 (0.00086) [2022-07-09 20:33:38,588][25689] Fps is (10 sec: 5797.5, 60 sec: 5642.7, 300 sec: 5659.0). Total num frames: 419384320. Throughput: 0: 5958.1. Samples: 419389560. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:33:38,588][25689] Avg episode reward: [(0, '-47.338')] [2022-07-09 20:33:39,927][26022] Updated weights on worker 0-0, policy_version 409564 (0.00085) [2022-07-09 20:33:41,751][26022] Updated weights on worker 0-0, policy_version 409574 (0.00103) [2022-07-09 20:33:43,642][25689] Fps is (10 sec: 5674.6, 60 sec: 5679.0, 300 sec: 5661.7). Total num frames: 419414016. Throughput: 0: 5114.7. Samples: 419406768. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:33:43,642][25689] Avg episode reward: [(0, '-47.297')] [2022-07-09 20:33:43,647][26022] Updated weights on worker 0-0, policy_version 409584 (0.00087) [2022-07-09 20:33:45,488][26022] Updated weights on worker 0-0, policy_version 409594 (0.00098) [2022-07-09 20:33:47,199][26022] Updated weights on worker 0-0, policy_version 409604 (0.00099) [2022-07-09 20:33:48,720][25689] Fps is (10 sec: 5760.5, 60 sec: 5656.8, 300 sec: 5667.3). Total num frames: 419442688. Throughput: 0: 5915.8. Samples: 419440580. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:33:48,721][25689] Avg episode reward: [(0, '-46.940')] [2022-07-09 20:33:49,175][26022] Updated weights on worker 0-0, policy_version 409614 (0.00099) [2022-07-09 20:33:50,851][26022] Updated weights on worker 0-0, policy_version 409624 (0.00084) [2022-07-09 20:33:52,873][26022] Updated weights on worker 0-0, policy_version 409634 (0.00095) [2022-07-09 20:33:53,756][25689] Fps is (10 sec: 5568.2, 60 sec: 5661.1, 300 sec: 5657.1). Total num frames: 419470336. Throughput: 0: 5896.4. Samples: 419474226. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:33:53,757][25689] Avg episode reward: [(0, '-46.577')] [2022-07-09 20:33:54,668][26022] Updated weights on worker 0-0, policy_version 409644 (0.00091) [2022-07-09 20:33:56,373][26022] Updated weights on worker 0-0, policy_version 409654 (0.00443) [2022-07-09 20:33:58,260][26022] Updated weights on worker 0-0, policy_version 409664 (0.00090) [2022-07-09 20:33:58,775][25689] Fps is (10 sec: 5600.9, 60 sec: 5649.2, 300 sec: 5662.6). Total num frames: 419499008. Throughput: 0: 5896.8. Samples: 419508604. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:33:58,776][25689] Avg episode reward: [(0, '-46.510')] [2022-07-09 20:34:00,008][26022] Updated weights on worker 0-0, policy_version 409674 (0.00086) [2022-07-09 20:34:02,050][26022] Updated weights on worker 0-0, policy_version 409684 (0.00428) [2022-07-09 20:34:03,852][25689] Fps is (10 sec: 5578.9, 60 sec: 5663.7, 300 sec: 5661.7). Total num frames: 419526656. Throughput: 0: 5850.0. Samples: 419524996. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:34:03,852][25689] Avg episode reward: [(0, '-47.108')] [2022-07-09 20:34:03,856][26022] Updated weights on worker 0-0, policy_version 409694 (0.00086) [2022-07-09 20:34:05,639][26022] Updated weights on worker 0-0, policy_version 409704 (0.00090) [2022-07-09 20:34:07,612][26022] Updated weights on worker 0-0, policy_version 409714 (0.00087) [2022-07-09 20:34:08,867][25689] Fps is (10 sec: 5479.5, 60 sec: 5651.6, 300 sec: 5658.1). Total num frames: 419554304. Throughput: 0: 5814.3. Samples: 419557720. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:34:08,868][25689] Avg episode reward: [(0, '-46.855')] [2022-07-09 20:34:09,344][26022] Updated weights on worker 0-0, policy_version 409724 (0.00088) [2022-07-09 20:34:11,151][26022] Updated weights on worker 0-0, policy_version 409734 (0.00092) [2022-07-09 20:34:13,007][26022] Updated weights on worker 0-0, policy_version 409744 (0.00085) [2022-07-09 20:34:13,875][25689] Fps is (10 sec: 5517.0, 60 sec: 5638.1, 300 sec: 5658.5). Total num frames: 419581952. Throughput: 0: 5862.0. Samples: 419592158. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:34:13,875][25689] Avg episode reward: [(0, '-46.311')] [2022-07-09 20:34:14,658][26022] Updated weights on worker 0-0, policy_version 409754 (0.00085) [2022-07-09 20:34:16,752][26022] Updated weights on worker 0-0, policy_version 409764 (0.00091) [2022-07-09 20:34:18,359][26022] Updated weights on worker 0-0, policy_version 409774 (0.00083) [2022-07-09 20:34:18,891][25689] Fps is (10 sec: 5618.6, 60 sec: 5636.8, 300 sec: 5655.8). Total num frames: 419610624. Throughput: 0: 4986.5. Samples: 419608910. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:34:18,892][25689] Avg episode reward: [(0, '-46.751')] [2022-07-09 20:34:20,321][26022] Updated weights on worker 0-0, policy_version 409784 (0.00087) [2022-07-09 20:34:21,880][26022] Updated weights on worker 0-0, policy_version 409794 (0.00085) [2022-07-09 20:34:23,762][26022] Updated weights on worker 0-0, policy_version 409804 (0.00088) [2022-07-09 20:34:23,963][25689] Fps is (10 sec: 5785.6, 60 sec: 5640.5, 300 sec: 5657.9). Total num frames: 419640320. Throughput: 0: 5875.8. Samples: 419643166. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:34:23,964][25689] Avg episode reward: [(0, '-47.101')] [2022-07-09 20:34:25,613][26022] Updated weights on worker 0-0, policy_version 409814 (0.00088) [2022-07-09 20:34:27,368][26022] Updated weights on worker 0-0, policy_version 409824 (0.00107) [2022-07-09 20:34:28,975][25689] Fps is (10 sec: 5787.8, 60 sec: 5674.2, 300 sec: 5664.8). Total num frames: 419668992. Throughput: 0: 5950.5. Samples: 419677376. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:34:28,976][25689] Avg episode reward: [(0, '-47.366')] [2022-07-09 20:34:29,021][26022] Updated weights on worker 0-0, policy_version 409834 (0.00086) [2022-07-09 20:34:31,305][26022] Updated weights on worker 0-0, policy_version 409844 (0.00086) [2022-07-09 20:34:32,619][26022] Updated weights on worker 0-0, policy_version 409854 (0.00094) [2022-07-09 20:34:34,007][25689] Fps is (10 sec: 5607.6, 60 sec: 5622.1, 300 sec: 5657.4). Total num frames: 419696640. Throughput: 0: 5080.2. Samples: 419694432. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 20:34:34,007][25689] Avg episode reward: [(0, '-46.805')] [2022-07-09 20:34:34,804][26022] Updated weights on worker 0-0, policy_version 409864 (0.00097) [2022-07-09 20:34:36,024][26022] Updated weights on worker 0-0, policy_version 409874 (0.00107) [2022-07-09 20:34:38,276][26022] Updated weights on worker 0-0, policy_version 409884 (0.00092) [2022-07-09 20:34:39,010][25689] Fps is (10 sec: 5817.0, 60 sec: 5677.4, 300 sec: 5665.7). Total num frames: 419727360. Throughput: 0: 5969.3. Samples: 419729004. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:34:39,010][25689] Avg episode reward: [(0, '-47.009')] [2022-07-09 20:34:39,803][26022] Updated weights on worker 0-0, policy_version 409894 (0.00088) [2022-07-09 20:34:41,724][26022] Updated weights on worker 0-0, policy_version 409904 (0.00089) [2022-07-09 20:34:43,527][26022] Updated weights on worker 0-0, policy_version 409914 (0.00091) [2022-07-09 20:34:44,131][25689] Fps is (10 sec: 5765.1, 60 sec: 5637.2, 300 sec: 5660.8). Total num frames: 419755008. Throughput: 0: 5948.0. Samples: 419763124. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:34:44,131][25689] Avg episode reward: [(0, '-48.045')] [2022-07-09 20:34:45,393][26022] Updated weights on worker 0-0, policy_version 409924 (0.00086) [2022-07-09 20:34:47,110][26022] Updated weights on worker 0-0, policy_version 409934 (0.00079) [2022-07-09 20:34:49,078][26022] Updated weights on worker 0-0, policy_version 409944 (0.00094) [2022-07-09 20:34:49,162][25689] Fps is (10 sec: 5446.3, 60 sec: 5624.7, 300 sec: 5660.3). Total num frames: 419782656. Throughput: 0: 5101.3. Samples: 419780356. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:34:49,163][25689] Avg episode reward: [(0, '-48.151')] [2022-07-09 20:34:50,587][26022] Updated weights on worker 0-0, policy_version 409954 (0.00089) [2022-07-09 20:34:52,641][26022] Updated weights on worker 0-0, policy_version 409964 (0.00094) [2022-07-09 20:34:54,177][25689] Fps is (10 sec: 5606.0, 60 sec: 5643.6, 300 sec: 5657.0). Total num frames: 419811328. Throughput: 0: 5940.6. Samples: 419814258. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:34:54,178][25689] Avg episode reward: [(0, '-47.441')] [2022-07-09 20:34:54,431][26022] Updated weights on worker 0-0, policy_version 409974 (0.00078) [2022-07-09 20:34:56,255][26022] Updated weights on worker 0-0, policy_version 409984 (0.00055) [2022-07-09 20:34:58,083][26022] Updated weights on worker 0-0, policy_version 409994 (0.00087) [2022-07-09 20:34:59,218][25689] Fps is (10 sec: 5702.6, 60 sec: 5641.6, 300 sec: 5653.9). Total num frames: 419840000. Throughput: 0: 5912.7. Samples: 419848492. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:34:59,219][25689] Avg episode reward: [(0, '-47.074')] [2022-07-09 20:34:59,707][26022] Updated weights on worker 0-0, policy_version 410004 (0.00087) [2022-07-09 20:35:01,657][26022] Updated weights on worker 0-0, policy_version 410014 (0.00099) [2022-07-09 20:35:03,843][26022] Updated weights on worker 0-0, policy_version 410024 (0.00091) [2022-07-09 20:35:04,282][25689] Fps is (10 sec: 5472.4, 60 sec: 5625.8, 300 sec: 5659.6). Total num frames: 419866624. Throughput: 0: 4983.0. Samples: 419863536. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:35:04,282][25689] Avg episode reward: [(0, '-47.872')] [2022-07-09 20:35:05,332][26022] Updated weights on worker 0-0, policy_version 410034 (0.00085) [2022-07-09 20:35:07,455][26022] Updated weights on worker 0-0, policy_version 410044 (0.00087) [2022-07-09 20:35:09,008][26022] Updated weights on worker 0-0, policy_version 410054 (0.00082) [2022-07-09 20:35:09,294][25689] Fps is (10 sec: 5487.8, 60 sec: 5643.0, 300 sec: 5657.1). Total num frames: 419895296. Throughput: 0: 5832.0. Samples: 419897766. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:35:09,296][25689] Avg episode reward: [(0, '-47.244')] [2022-07-09 20:35:10,856][26022] Updated weights on worker 0-0, policy_version 410064 (0.00091) [2022-07-09 20:35:12,579][26022] Updated weights on worker 0-0, policy_version 410074 (0.00095) [2022-07-09 20:35:14,301][25689] Fps is (10 sec: 5825.7, 60 sec: 5677.0, 300 sec: 5660.5). Total num frames: 419924992. Throughput: 0: 5864.8. Samples: 419932278. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:35:14,301][25689] Avg episode reward: [(0, '-47.232')] [2022-07-09 20:35:14,424][26022] Updated weights on worker 0-0, policy_version 410084 (0.00091) [2022-07-09 20:35:16,196][26022] Updated weights on worker 0-0, policy_version 410094 (0.00089) [2022-07-09 20:35:18,180][26022] Updated weights on worker 0-0, policy_version 410104 (0.00094) [2022-07-09 20:35:19,339][25689] Fps is (10 sec: 5709.1, 60 sec: 5658.0, 300 sec: 5658.3). Total num frames: 419952640. Throughput: 0: 5007.5. Samples: 419949246. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:35:19,339][25689] Avg episode reward: [(0, '-46.604')] [2022-07-09 20:35:19,847][26022] Updated weights on worker 0-0, policy_version 410114 (0.01039) [2022-07-09 20:35:21,935][26022] Updated weights on worker 0-0, policy_version 410124 (0.00080) [2022-07-09 20:35:23,400][26022] Updated weights on worker 0-0, policy_version 410134 (0.00087) [2022-07-09 20:35:24,465][25689] Fps is (10 sec: 5641.7, 60 sec: 5653.0, 300 sec: 5659.7). Total num frames: 419982336. Throughput: 0: 5937.2. Samples: 419983368. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:35:24,467][25689] Avg episode reward: [(0, '-47.117')] [2022-07-09 20:35:25,318][26022] Updated weights on worker 0-0, policy_version 410144 (0.00089) [2022-07-09 20:35:27,124][26022] Updated weights on worker 0-0, policy_version 410154 (0.00091) [2022-07-09 20:35:29,047][26022] Updated weights on worker 0-0, policy_version 410164 (0.00058) [2022-07-09 20:35:29,534][25689] Fps is (10 sec: 5724.9, 60 sec: 5647.7, 300 sec: 5659.8). Total num frames: 420011008. Throughput: 0: 5910.6. Samples: 420017396. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:35:29,535][25689] Avg episode reward: [(0, '-46.376')] [2022-07-09 20:35:30,212][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:35:30,224][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000410171_420015104.pth [2022-07-09 20:35:30,225][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000408180_417976320.pth [2022-07-09 20:35:30,892][26022] Updated weights on worker 0-0, policy_version 410174 (0.00089) [2022-07-09 20:35:32,649][26022] Updated weights on worker 0-0, policy_version 410184 (0.00085) [2022-07-09 20:35:34,251][26022] Updated weights on worker 0-0, policy_version 410194 (0.00094) [2022-07-09 20:35:34,571][25689] Fps is (10 sec: 5674.5, 60 sec: 5664.1, 300 sec: 5666.5). Total num frames: 420039680. Throughput: 0: 5031.9. Samples: 420034274. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:35:34,571][25689] Avg episode reward: [(0, '-46.573')] [2022-07-09 20:35:36,168][26022] Updated weights on worker 0-0, policy_version 410204 (0.00092) [2022-07-09 20:35:37,830][26022] Updated weights on worker 0-0, policy_version 410214 (0.00088) [2022-07-09 20:35:39,588][25689] Fps is (10 sec: 5601.9, 60 sec: 5612.0, 300 sec: 5654.1). Total num frames: 420067328. Throughput: 0: 5879.9. Samples: 420068310. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:35:39,588][25689] Avg episode reward: [(0, '-47.351')] [2022-07-09 20:35:39,928][26022] Updated weights on worker 0-0, policy_version 410224 (0.00088) [2022-07-09 20:35:41,667][26022] Updated weights on worker 0-0, policy_version 410234 (0.00084) [2022-07-09 20:35:43,420][26022] Updated weights on worker 0-0, policy_version 410245 (0.00090) [2022-07-09 20:35:44,667][25689] Fps is (10 sec: 5578.5, 60 sec: 5632.9, 300 sec: 5660.0). Total num frames: 420096000. Throughput: 0: 5897.6. Samples: 420102508. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:35:44,667][25689] Avg episode reward: [(0, '-46.745')] [2022-07-09 20:35:45,574][26022] Updated weights on worker 0-0, policy_version 410255 (0.00087) [2022-07-09 20:35:46,964][26022] Updated weights on worker 0-0, policy_version 410265 (0.00081) [2022-07-09 20:35:49,155][26022] Updated weights on worker 0-0, policy_version 410275 (0.00144) [2022-07-09 20:35:49,685][25689] Fps is (10 sec: 5780.5, 60 sec: 5667.9, 300 sec: 5660.2). Total num frames: 420125696. Throughput: 0: 5074.9. Samples: 420119662. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:35:49,686][25689] Avg episode reward: [(0, '-47.383')] [2022-07-09 20:35:50,656][26022] Updated weights on worker 0-0, policy_version 410285 (0.00091) [2022-07-09 20:35:52,588][26022] Updated weights on worker 0-0, policy_version 410295 (0.00082) [2022-07-09 20:35:54,175][26022] Updated weights on worker 0-0, policy_version 410305 (0.00088) [2022-07-09 20:35:54,691][25689] Fps is (10 sec: 5720.5, 60 sec: 5651.9, 300 sec: 5656.9). Total num frames: 420153344. Throughput: 0: 5942.9. Samples: 420153846. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:35:54,692][25689] Avg episode reward: [(0, '-47.152')] [2022-07-09 20:35:55,927][26022] Updated weights on worker 0-0, policy_version 410315 (0.00091) [2022-07-09 20:35:58,003][26022] Updated weights on worker 0-0, policy_version 410325 (0.00087) [2022-07-09 20:35:59,699][25689] Fps is (10 sec: 5624.1, 60 sec: 5654.9, 300 sec: 5661.6). Total num frames: 420182016. Throughput: 0: 5960.4. Samples: 420188182. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:35:59,700][25689] Avg episode reward: [(0, '-47.423')] [2022-07-09 20:35:59,724][26022] Updated weights on worker 0-0, policy_version 410335 (0.00091) [2022-07-09 20:36:01,972][26022] Updated weights on worker 0-0, policy_version 410345 (0.00095) [2022-07-09 20:36:03,695][26022] Updated weights on worker 0-0, policy_version 410355 (0.00089) [2022-07-09 20:36:04,802][25689] Fps is (10 sec: 5569.9, 60 sec: 5668.2, 300 sec: 5657.2). Total num frames: 420209664. Throughput: 0: 5007.1. Samples: 420203330. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:36:04,804][25689] Avg episode reward: [(0, '-47.169')] [2022-07-09 20:36:05,527][26022] Updated weights on worker 0-0, policy_version 410365 (0.00089) [2022-07-09 20:36:07,200][26022] Updated weights on worker 0-0, policy_version 410375 (0.00091) [2022-07-09 20:36:09,276][26022] Updated weights on worker 0-0, policy_version 410385 (0.00095) [2022-07-09 20:36:09,828][25689] Fps is (10 sec: 5560.5, 60 sec: 5666.9, 300 sec: 5657.8). Total num frames: 420238336. Throughput: 0: 5840.5. Samples: 420237304. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:36:09,830][25689] Avg episode reward: [(0, '-46.019')] [2022-07-09 20:36:10,768][26022] Updated weights on worker 0-0, policy_version 410395 (0.00086) [2022-07-09 20:36:12,788][26022] Updated weights on worker 0-0, policy_version 410405 (0.00090) [2022-07-09 20:36:14,419][26022] Updated weights on worker 0-0, policy_version 410415 (0.00083) [2022-07-09 20:36:14,866][25689] Fps is (10 sec: 5596.2, 60 sec: 5630.1, 300 sec: 5651.4). Total num frames: 420265984. Throughput: 0: 5834.8. Samples: 420271566. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:36:14,868][25689] Avg episode reward: [(0, '-46.316')] [2022-07-09 20:36:16,180][26022] Updated weights on worker 0-0, policy_version 410425 (0.00084) [2022-07-09 20:36:18,143][26022] Updated weights on worker 0-0, policy_version 410435 (0.00083) [2022-07-09 20:36:19,885][25689] Fps is (10 sec: 5600.2, 60 sec: 5648.9, 300 sec: 5659.1). Total num frames: 420294656. Throughput: 0: 4987.7. Samples: 420288860. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:36:19,885][25689] Avg episode reward: [(0, '-47.539')] [2022-07-09 20:36:19,917][26022] Updated weights on worker 0-0, policy_version 410445 (0.00051) [2022-07-09 20:36:21,761][26022] Updated weights on worker 0-0, policy_version 410455 (0.00083) [2022-07-09 20:36:23,692][26022] Updated weights on worker 0-0, policy_version 410465 (0.00084) [2022-07-09 20:36:24,960][25689] Fps is (10 sec: 5681.0, 60 sec: 5636.7, 300 sec: 5655.6). Total num frames: 420323328. Throughput: 0: 5919.5. Samples: 420322656. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:36:24,961][25689] Avg episode reward: [(0, '-46.960')] [2022-07-09 20:36:25,218][26022] Updated weights on worker 0-0, policy_version 410475 (0.00097) [2022-07-09 20:36:27,271][26022] Updated weights on worker 0-0, policy_version 410485 (0.00087) [2022-07-09 20:36:28,926][26022] Updated weights on worker 0-0, policy_version 410495 (0.00095) [2022-07-09 20:36:30,056][25689] Fps is (10 sec: 5738.5, 60 sec: 5651.1, 300 sec: 5653.9). Total num frames: 420353024. Throughput: 0: 5891.2. Samples: 420356472. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:36:30,057][25689] Avg episode reward: [(0, '-47.238')] [2022-07-09 20:36:30,975][26022] Updated weights on worker 0-0, policy_version 410505 (0.00094) [2022-07-09 20:36:32,676][26022] Updated weights on worker 0-0, policy_version 410515 (0.00090) [2022-07-09 20:36:34,366][26022] Updated weights on worker 0-0, policy_version 410525 (0.00088) [2022-07-09 20:36:35,071][25689] Fps is (10 sec: 5671.8, 60 sec: 5636.2, 300 sec: 5650.5). Total num frames: 420380672. Throughput: 0: 5045.5. Samples: 420373508. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:36:35,072][25689] Avg episode reward: [(0, '-47.621')] [2022-07-09 20:36:36,257][26022] Updated weights on worker 0-0, policy_version 410535 (0.00087) [2022-07-09 20:36:37,734][26022] Updated weights on worker 0-0, policy_version 410545 (0.00090) [2022-07-09 20:36:39,869][26022] Updated weights on worker 0-0, policy_version 410555 (0.00087) [2022-07-09 20:36:40,106][25689] Fps is (10 sec: 5705.6, 60 sec: 5668.3, 300 sec: 5651.6). Total num frames: 420410368. Throughput: 0: 5893.1. Samples: 420408030. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:36:40,107][25689] Avg episode reward: [(0, '-49.193')] [2022-07-09 20:36:41,670][26022] Updated weights on worker 0-0, policy_version 410565 (0.00097) [2022-07-09 20:36:43,388][26022] Updated weights on worker 0-0, policy_version 410575 (0.00093) [2022-07-09 20:36:45,173][25689] Fps is (10 sec: 5777.9, 60 sec: 5669.5, 300 sec: 5654.4). Total num frames: 420439040. Throughput: 0: 5900.9. Samples: 420441928. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:36:45,173][25689] Avg episode reward: [(0, '-49.070')] [2022-07-09 20:36:45,184][26022] Updated weights on worker 0-0, policy_version 410585 (0.00084) [2022-07-09 20:36:47,131][26022] Updated weights on worker 0-0, policy_version 410595 (0.00082) [2022-07-09 20:36:48,798][26022] Updated weights on worker 0-0, policy_version 410605 (0.00083) [2022-07-09 20:36:50,197][25689] Fps is (10 sec: 5581.6, 60 sec: 5635.2, 300 sec: 5650.9). Total num frames: 420466688. Throughput: 0: 5934.7. Samples: 420476002. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:36:50,197][25689] Avg episode reward: [(0, '-47.129')] [2022-07-09 20:36:50,535][26022] Updated weights on worker 0-0, policy_version 410615 (0.00085) [2022-07-09 20:36:52,419][26022] Updated weights on worker 0-0, policy_version 410625 (0.00083) [2022-07-09 20:36:54,207][26022] Updated weights on worker 0-0, policy_version 410635 (0.00094) [2022-07-09 20:36:55,230][25689] Fps is (10 sec: 5600.1, 60 sec: 5649.5, 300 sec: 5651.3). Total num frames: 420495360. Throughput: 0: 5948.0. Samples: 420493414. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:36:55,230][25689] Avg episode reward: [(0, '-46.855')] [2022-07-09 20:36:56,028][26022] Updated weights on worker 0-0, policy_version 410645 (0.00090) [2022-07-09 20:36:57,843][26022] Updated weights on worker 0-0, policy_version 410655 (0.00090) [2022-07-09 20:36:59,546][26022] Updated weights on worker 0-0, policy_version 410665 (0.00093) [2022-07-09 20:37:00,259][25689] Fps is (10 sec: 5800.9, 60 sec: 5664.5, 300 sec: 5662.5). Total num frames: 420525056. Throughput: 0: 5938.0. Samples: 420527694. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:37:00,259][25689] Avg episode reward: [(0, '-45.888')] [2022-07-09 20:37:01,587][26022] Updated weights on worker 0-0, policy_version 410675 (0.00083) [2022-07-09 20:37:03,487][26022] Updated weights on worker 0-0, policy_version 410685 (0.00090) [2022-07-09 20:37:05,298][25689] Fps is (10 sec: 5492.4, 60 sec: 5636.7, 300 sec: 5655.0). Total num frames: 420550656. Throughput: 0: 5863.9. Samples: 420559938. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:37:05,299][25689] Avg episode reward: [(0, '-45.517')] [2022-07-09 20:37:05,375][26022] Updated weights on worker 0-0, policy_version 410695 (0.00086) [2022-07-09 20:37:07,159][26022] Updated weights on worker 0-0, policy_version 410705 (0.00089) [2022-07-09 20:37:09,057][26022] Updated weights on worker 0-0, policy_version 410715 (0.00085) [2022-07-09 20:37:10,299][25689] Fps is (10 sec: 5507.1, 60 sec: 5655.8, 300 sec: 5653.5). Total num frames: 420580352. Throughput: 0: 5027.0. Samples: 420577058. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:37:10,300][25689] Avg episode reward: [(0, '-45.469')] [2022-07-09 20:37:10,690][26022] Updated weights on worker 0-0, policy_version 410725 (0.00093) [2022-07-09 20:37:12,543][26022] Updated weights on worker 0-0, policy_version 410735 (0.00081) [2022-07-09 20:37:14,183][26022] Updated weights on worker 0-0, policy_version 410745 (0.00203) [2022-07-09 20:37:15,363][25689] Fps is (10 sec: 5595.6, 60 sec: 5636.6, 300 sec: 5645.8). Total num frames: 420606976. Throughput: 0: 5866.6. Samples: 420611524. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:37:15,363][25689] Avg episode reward: [(0, '-46.323')] [2022-07-09 20:37:16,127][26022] Updated weights on worker 0-0, policy_version 410755 (0.00086) [2022-07-09 20:37:17,961][26022] Updated weights on worker 0-0, policy_version 410765 (0.00092) [2022-07-09 20:37:19,685][26022] Updated weights on worker 0-0, policy_version 410775 (0.00260) [2022-07-09 20:37:20,398][25689] Fps is (10 sec: 5779.9, 60 sec: 5685.8, 300 sec: 5657.9). Total num frames: 420638720. Throughput: 0: 5839.6. Samples: 420645298. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2022-07-09 20:37:20,398][25689] Avg episode reward: [(0, '-45.246')] [2022-07-09 20:37:21,647][26022] Updated weights on worker 0-0, policy_version 410785 (0.00088) [2022-07-09 20:37:23,303][26022] Updated weights on worker 0-0, policy_version 410795 (0.00093) [2022-07-09 20:37:25,176][26022] Updated weights on worker 0-0, policy_version 410805 (0.00094) [2022-07-09 20:37:25,517][25689] Fps is (10 sec: 5748.3, 60 sec: 5647.9, 300 sec: 5648.9). Total num frames: 420665344. Throughput: 0: 5063.7. Samples: 420662320. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:37:25,517][25689] Avg episode reward: [(0, '-45.385')] [2022-07-09 20:37:27,002][26022] Updated weights on worker 0-0, policy_version 410815 (0.00088) [2022-07-09 20:37:28,782][26022] Updated weights on worker 0-0, policy_version 410825 (0.00094) [2022-07-09 20:37:30,327][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:37:30,342][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000410833_420692992.pth [2022-07-09 20:37:30,346][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000408843_418655232.pth [2022-07-09 20:37:30,562][25689] Fps is (10 sec: 5440.1, 60 sec: 5635.6, 300 sec: 5649.4). Total num frames: 420694016. Throughput: 0: 5891.0. Samples: 420696426. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:37:30,563][25689] Avg episode reward: [(0, '-45.504')] [2022-07-09 20:37:30,615][26022] Updated weights on worker 0-0, policy_version 410835 (0.00095) [2022-07-09 20:37:32,417][26022] Updated weights on worker 0-0, policy_version 410845 (0.00086) [2022-07-09 20:37:34,252][26022] Updated weights on worker 0-0, policy_version 410855 (0.00100) [2022-07-09 20:37:35,608][25689] Fps is (10 sec: 5783.7, 60 sec: 5666.6, 300 sec: 5649.3). Total num frames: 420723712. Throughput: 0: 5880.3. Samples: 420730574. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:37:35,609][25689] Avg episode reward: [(0, '-45.339')] [2022-07-09 20:37:36,221][26022] Updated weights on worker 0-0, policy_version 410865 (0.00090) [2022-07-09 20:37:37,731][26022] Updated weights on worker 0-0, policy_version 410875 (0.00092) [2022-07-09 20:37:39,775][26022] Updated weights on worker 0-0, policy_version 410885 (0.00086) [2022-07-09 20:37:40,681][25689] Fps is (10 sec: 5768.4, 60 sec: 5646.2, 300 sec: 5652.9). Total num frames: 420752384. Throughput: 0: 5048.1. Samples: 420747688. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:37:40,681][25689] Avg episode reward: [(0, '-45.309')] [2022-07-09 20:37:41,356][26022] Updated weights on worker 0-0, policy_version 410895 (0.00092) [2022-07-09 20:37:43,245][26022] Updated weights on worker 0-0, policy_version 410905 (0.00084) [2022-07-09 20:37:45,056][26022] Updated weights on worker 0-0, policy_version 410915 (0.00086) [2022-07-09 20:37:45,738][25689] Fps is (10 sec: 5661.2, 60 sec: 5647.1, 300 sec: 5648.8). Total num frames: 420781056. Throughput: 0: 5909.2. Samples: 420781812. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:37:45,740][25689] Avg episode reward: [(0, '-44.972')] [2022-07-09 20:37:46,576][26022] Updated weights on worker 0-0, policy_version 410925 (0.00086) [2022-07-09 20:37:48,635][26022] Updated weights on worker 0-0, policy_version 410935 (0.00086) [2022-07-09 20:37:50,391][26022] Updated weights on worker 0-0, policy_version 410945 (0.00088) [2022-07-09 20:37:50,784][25689] Fps is (10 sec: 5675.6, 60 sec: 5661.9, 300 sec: 5652.9). Total num frames: 420809728. Throughput: 0: 5909.0. Samples: 420815918. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:37:50,785][25689] Avg episode reward: [(0, '-45.676')] [2022-07-09 20:37:52,247][26022] Updated weights on worker 0-0, policy_version 410955 (0.00093) [2022-07-09 20:37:53,989][26022] Updated weights on worker 0-0, policy_version 410965 (0.00089) [2022-07-09 20:37:55,790][25689] Fps is (10 sec: 5602.9, 60 sec: 5647.6, 300 sec: 5647.3). Total num frames: 420837376. Throughput: 0: 5067.9. Samples: 420832852. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:37:55,791][25689] Avg episode reward: [(0, '-47.375')] [2022-07-09 20:37:55,808][26022] Updated weights on worker 0-0, policy_version 410975 (0.00086) [2022-07-09 20:37:57,665][26022] Updated weights on worker 0-0, policy_version 410985 (0.00092) [2022-07-09 20:37:59,399][26022] Updated weights on worker 0-0, policy_version 410995 (0.00090) [2022-07-09 20:38:00,826][25689] Fps is (10 sec: 5710.6, 60 sec: 5646.9, 300 sec: 5657.9). Total num frames: 420867072. Throughput: 0: 5925.9. Samples: 420867066. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:38:00,828][25689] Avg episode reward: [(0, '-47.246')] [2022-07-09 20:38:01,232][26022] Updated weights on worker 0-0, policy_version 411005 (0.00093) [2022-07-09 20:38:03,447][26022] Updated weights on worker 0-0, policy_version 411015 (0.00086) [2022-07-09 20:38:05,363][26022] Updated weights on worker 0-0, policy_version 411025 (0.00094) [2022-07-09 20:38:05,863][25689] Fps is (10 sec: 5387.5, 60 sec: 5630.1, 300 sec: 5644.7). Total num frames: 420891648. Throughput: 0: 5813.0. Samples: 420898802. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:38:05,864][25689] Avg episode reward: [(0, '-47.846')] [2022-07-09 20:38:06,926][26022] Updated weights on worker 0-0, policy_version 411035 (0.00092) [2022-07-09 20:38:08,903][26022] Updated weights on worker 0-0, policy_version 411045 (0.00094) [2022-07-09 20:38:10,480][26022] Updated weights on worker 0-0, policy_version 411055 (0.00104) [2022-07-09 20:38:10,876][25689] Fps is (10 sec: 5400.3, 60 sec: 5629.1, 300 sec: 5648.7). Total num frames: 420921344. Throughput: 0: 4973.1. Samples: 420915834. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:38:10,876][25689] Avg episode reward: [(0, '-47.630')] [2022-07-09 20:38:12,526][26022] Updated weights on worker 0-0, policy_version 411065 (0.00091) [2022-07-09 20:38:14,208][26022] Updated weights on worker 0-0, policy_version 411075 (0.00082) [2022-07-09 20:38:15,955][25689] Fps is (10 sec: 5885.3, 60 sec: 5678.3, 300 sec: 5650.7). Total num frames: 420951040. Throughput: 0: 5804.8. Samples: 420949906. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:38:15,955][25689] Avg episode reward: [(0, '-47.567')] [2022-07-09 20:38:15,965][26022] Updated weights on worker 0-0, policy_version 411085 (0.00081) [2022-07-09 20:38:17,757][26022] Updated weights on worker 0-0, policy_version 411095 (0.00087) [2022-07-09 20:38:19,884][26022] Updated weights on worker 0-0, policy_version 411105 (0.00091) [2022-07-09 20:38:20,985][25689] Fps is (10 sec: 5571.0, 60 sec: 5594.3, 300 sec: 5641.9). Total num frames: 420977664. Throughput: 0: 5811.6. Samples: 420984224. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:38:20,986][25689] Avg episode reward: [(0, '-46.393')] [2022-07-09 20:38:21,387][26022] Updated weights on worker 0-0, policy_version 411115 (0.00089) [2022-07-09 20:38:23,395][26022] Updated weights on worker 0-0, policy_version 411125 (0.00095) [2022-07-09 20:38:24,953][26022] Updated weights on worker 0-0, policy_version 411135 (0.00088) [2022-07-09 20:38:26,100][25689] Fps is (10 sec: 5450.4, 60 sec: 5628.5, 300 sec: 5646.8). Total num frames: 421006336. Throughput: 0: 5052.1. Samples: 421001040. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:38:26,101][25689] Avg episode reward: [(0, '-45.161')] [2022-07-09 20:38:27,121][26022] Updated weights on worker 0-0, policy_version 411145 (0.00086) [2022-07-09 20:38:28,687][26022] Updated weights on worker 0-0, policy_version 411155 (0.00085) [2022-07-09 20:38:30,760][26022] Updated weights on worker 0-0, policy_version 411165 (0.00098) [2022-07-09 20:38:31,104][25689] Fps is (10 sec: 5667.2, 60 sec: 5632.4, 300 sec: 5640.2). Total num frames: 421035008. Throughput: 0: 5889.8. Samples: 421034974. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:38:31,104][25689] Avg episode reward: [(0, '-45.459')] [2022-07-09 20:38:32,380][26022] Updated weights on worker 0-0, policy_version 411175 (0.00089) [2022-07-09 20:38:34,285][26022] Updated weights on worker 0-0, policy_version 411185 (0.00093) [2022-07-09 20:38:35,911][26022] Updated weights on worker 0-0, policy_version 411195 (0.00088) [2022-07-09 20:38:36,185][25689] Fps is (10 sec: 5787.5, 60 sec: 5629.1, 300 sec: 5646.5). Total num frames: 421064704. Throughput: 0: 5898.6. Samples: 421069238. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:38:36,186][25689] Avg episode reward: [(0, '-44.789')] [2022-07-09 20:38:37,799][26022] Updated weights on worker 0-0, policy_version 411205 (0.00093) [2022-07-09 20:38:39,477][26022] Updated weights on worker 0-0, policy_version 411215 (0.00085) [2022-07-09 20:38:41,215][25689] Fps is (10 sec: 5671.6, 60 sec: 5616.2, 300 sec: 5640.1). Total num frames: 421092352. Throughput: 0: 5037.8. Samples: 421086136. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:38:41,215][25689] Avg episode reward: [(0, '-45.424')] [2022-07-09 20:38:41,391][26022] Updated weights on worker 0-0, policy_version 411225 (0.00087) [2022-07-09 20:38:43,310][26022] Updated weights on worker 0-0, policy_version 411235 (0.00096) [2022-07-09 20:38:45,103][26022] Updated weights on worker 0-0, policy_version 411245 (0.00083) [2022-07-09 20:38:46,277][25689] Fps is (10 sec: 5479.3, 60 sec: 5598.7, 300 sec: 5636.9). Total num frames: 421120000. Throughput: 0: 5899.0. Samples: 421120064. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:38:46,278][25689] Avg episode reward: [(0, '-45.343')] [2022-07-09 20:38:46,759][26022] Updated weights on worker 0-0, policy_version 411255 (0.00106) [2022-07-09 20:38:48,826][26022] Updated weights on worker 0-0, policy_version 411265 (0.00088) [2022-07-09 20:38:50,307][26022] Updated weights on worker 0-0, policy_version 411275 (0.00083) [2022-07-09 20:38:51,298][25689] Fps is (10 sec: 5788.3, 60 sec: 5634.9, 300 sec: 5647.6). Total num frames: 421150720. Throughput: 0: 5920.2. Samples: 421154530. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:38:51,299][25689] Avg episode reward: [(0, '-46.067')] [2022-07-09 20:38:52,575][26022] Updated weights on worker 0-0, policy_version 411285 (0.00085) [2022-07-09 20:38:53,912][26022] Updated weights on worker 0-0, policy_version 411295 (0.00087) [2022-07-09 20:38:55,972][26022] Updated weights on worker 0-0, policy_version 411305 (0.00091) [2022-07-09 20:38:56,303][25689] Fps is (10 sec: 5821.5, 60 sec: 5635.0, 300 sec: 5644.4). Total num frames: 421178368. Throughput: 0: 5058.6. Samples: 421171008. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:38:56,304][25689] Avg episode reward: [(0, '-46.491')] [2022-07-09 20:38:57,603][26022] Updated weights on worker 0-0, policy_version 411315 (0.00087) [2022-07-09 20:38:59,435][26022] Updated weights on worker 0-0, policy_version 411325 (0.00062) [2022-07-09 20:39:01,381][25689] Fps is (10 sec: 5484.4, 60 sec: 5597.3, 300 sec: 5644.4). Total num frames: 421206016. Throughput: 0: 5917.0. Samples: 421205460. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:39:01,382][25689] Avg episode reward: [(0, '-46.440')] [2022-07-09 20:39:01,401][26022] Updated weights on worker 0-0, policy_version 411335 (0.00081) [2022-07-09 20:39:03,411][26022] Updated weights on worker 0-0, policy_version 411345 (0.00096) [2022-07-09 20:39:05,180][26022] Updated weights on worker 0-0, policy_version 411355 (0.00088) [2022-07-09 20:39:06,463][25689] Fps is (10 sec: 5543.4, 60 sec: 5660.7, 300 sec: 5646.5). Total num frames: 421234688. Throughput: 0: 5823.6. Samples: 421237620. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:39:06,465][25689] Avg episode reward: [(0, '-46.498')] [2022-07-09 20:39:07,074][26022] Updated weights on worker 0-0, policy_version 411365 (0.00083) [2022-07-09 20:39:08,747][26022] Updated weights on worker 0-0, policy_version 411375 (0.00094) [2022-07-09 20:39:10,730][26022] Updated weights on worker 0-0, policy_version 411385 (0.00095) [2022-07-09 20:39:11,493][25689] Fps is (10 sec: 5670.7, 60 sec: 5642.2, 300 sec: 5649.5). Total num frames: 421263360. Throughput: 0: 5799.2. Samples: 421271644. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:39:11,494][25689] Avg episode reward: [(0, '-46.528')] [2022-07-09 20:39:12,508][26022] Updated weights on worker 0-0, policy_version 411395 (0.00087) [2022-07-09 20:39:14,228][26022] Updated weights on worker 0-0, policy_version 411405 (0.00094) [2022-07-09 20:39:16,202][26022] Updated weights on worker 0-0, policy_version 411415 (0.00094) [2022-07-09 20:39:16,499][25689] Fps is (10 sec: 5612.2, 60 sec: 5615.2, 300 sec: 5646.3). Total num frames: 421291008. Throughput: 0: 5828.6. Samples: 421288718. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:39:16,499][25689] Avg episode reward: [(0, '-46.612')] [2022-07-09 20:39:18,008][26022] Updated weights on worker 0-0, policy_version 411425 (0.00092) [2022-07-09 20:39:19,705][26022] Updated weights on worker 0-0, policy_version 411435 (0.01125) [2022-07-09 20:39:21,500][26022] Updated weights on worker 0-0, policy_version 411445 (0.00092) [2022-07-09 20:39:21,508][25689] Fps is (10 sec: 5623.8, 60 sec: 5651.1, 300 sec: 5644.1). Total num frames: 421319680. Throughput: 0: 5819.8. Samples: 421322594. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:39:21,508][25689] Avg episode reward: [(0, '-46.490')] [2022-07-09 20:39:23,427][26022] Updated weights on worker 0-0, policy_version 411455 (0.00093) [2022-07-09 20:39:25,226][26022] Updated weights on worker 0-0, policy_version 411465 (0.00087) [2022-07-09 20:39:26,639][25689] Fps is (10 sec: 5655.0, 60 sec: 5649.5, 300 sec: 5641.8). Total num frames: 421348352. Throughput: 0: 5903.2. Samples: 421356722. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:39:26,640][25689] Avg episode reward: [(0, '-47.275')] [2022-07-09 20:39:26,875][26022] Updated weights on worker 0-0, policy_version 411475 (0.00082) [2022-07-09 20:39:28,740][26022] Updated weights on worker 0-0, policy_version 411485 (0.00086) [2022-07-09 20:39:30,357][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:39:30,370][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000411494_421369856.pth [2022-07-09 20:39:30,379][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000409508_419336192.pth [2022-07-09 20:39:30,670][26022] Updated weights on worker 0-0, policy_version 411495 (0.00092) [2022-07-09 20:39:31,669][25689] Fps is (10 sec: 5643.6, 60 sec: 5647.1, 300 sec: 5645.3). Total num frames: 421377024. Throughput: 0: 5068.9. Samples: 421373912. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:39:31,670][25689] Avg episode reward: [(0, '-46.686')] [2022-07-09 20:39:32,250][26022] Updated weights on worker 0-0, policy_version 411505 (0.00090) [2022-07-09 20:39:34,149][26022] Updated weights on worker 0-0, policy_version 411515 (0.00086) [2022-07-09 20:39:35,795][26022] Updated weights on worker 0-0, policy_version 411525 (0.00089) [2022-07-09 20:39:36,745][25689] Fps is (10 sec: 5572.9, 60 sec: 5613.8, 300 sec: 5633.6). Total num frames: 421404672. Throughput: 0: 5905.4. Samples: 421408282. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:39:36,746][25689] Avg episode reward: [(0, '-46.944')] [2022-07-09 20:39:37,743][26022] Updated weights on worker 0-0, policy_version 411535 (0.00087) [2022-07-09 20:39:39,591][26022] Updated weights on worker 0-0, policy_version 411545 (0.00594) [2022-07-09 20:39:41,234][26022] Updated weights on worker 0-0, policy_version 411555 (0.00078) [2022-07-09 20:39:41,769][25689] Fps is (10 sec: 5778.9, 60 sec: 5665.0, 300 sec: 5645.7). Total num frames: 421435392. Throughput: 0: 5921.8. Samples: 421442576. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:39:41,770][25689] Avg episode reward: [(0, '-47.215')] [2022-07-09 20:39:43,079][26022] Updated weights on worker 0-0, policy_version 411565 (0.00090) [2022-07-09 20:39:44,927][26022] Updated weights on worker 0-0, policy_version 411575 (0.00081) [2022-07-09 20:39:46,634][26022] Updated weights on worker 0-0, policy_version 411585 (0.00081) [2022-07-09 20:39:46,834][25689] Fps is (10 sec: 5887.1, 60 sec: 5681.7, 300 sec: 5648.5). Total num frames: 421464064. Throughput: 0: 5101.4. Samples: 421459742. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:39:46,834][25689] Avg episode reward: [(0, '-46.399')] [2022-07-09 20:39:48,596][26022] Updated weights on worker 0-0, policy_version 411595 (0.00093) [2022-07-09 20:39:50,117][26022] Updated weights on worker 0-0, policy_version 411605 (0.00088) [2022-07-09 20:39:51,879][25689] Fps is (10 sec: 5672.2, 60 sec: 5645.7, 300 sec: 5648.0). Total num frames: 421492736. Throughput: 0: 5952.5. Samples: 421494212. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:39:51,879][25689] Avg episode reward: [(0, '-45.771')] [2022-07-09 20:39:52,121][26022] Updated weights on worker 0-0, policy_version 411615 (0.00093) [2022-07-09 20:39:53,843][26022] Updated weights on worker 0-0, policy_version 411625 (0.00087) [2022-07-09 20:39:55,490][26022] Updated weights on worker 0-0, policy_version 411635 (0.00093) [2022-07-09 20:39:56,916][25689] Fps is (10 sec: 5586.2, 60 sec: 5642.7, 300 sec: 5644.6). Total num frames: 421520384. Throughput: 0: 5966.2. Samples: 421528624. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:39:56,923][25689] Avg episode reward: [(0, '-45.692')] [2022-07-09 20:39:57,409][26022] Updated weights on worker 0-0, policy_version 411645 (0.00086) [2022-07-09 20:39:58,855][26022] Updated weights on worker 0-0, policy_version 411655 (0.00097) [2022-07-09 20:40:00,902][26022] Updated weights on worker 0-0, policy_version 411665 (0.00086) [2022-07-09 20:40:01,940][25689] Fps is (10 sec: 5597.8, 60 sec: 5664.6, 300 sec: 5652.2). Total num frames: 421549056. Throughput: 0: 5125.2. Samples: 421545956. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-09 20:40:01,940][25689] Avg episode reward: [(0, '-45.313')] [2022-07-09 20:40:03,296][26022] Updated weights on worker 0-0, policy_version 411675 (0.00091) [2022-07-09 20:40:04,823][26022] Updated weights on worker 0-0, policy_version 411685 (0.00088) [2022-07-09 20:40:06,699][26022] Updated weights on worker 0-0, policy_version 411695 (0.00096) [2022-07-09 20:40:07,029][25689] Fps is (10 sec: 5670.3, 60 sec: 5663.9, 300 sec: 5650.8). Total num frames: 421577728. Throughput: 0: 5862.3. Samples: 421578132. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:40:07,029][25689] Avg episode reward: [(0, '-45.443')] [2022-07-09 20:40:08,499][26022] Updated weights on worker 0-0, policy_version 411705 (0.00084) [2022-07-09 20:40:10,215][26022] Updated weights on worker 0-0, policy_version 411715 (0.00087) [2022-07-09 20:40:12,058][25689] Fps is (10 sec: 5566.3, 60 sec: 5647.1, 300 sec: 5643.5). Total num frames: 421605376. Throughput: 0: 5855.3. Samples: 421612368. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:40:12,058][25689] Avg episode reward: [(0, '-45.863')] [2022-07-09 20:40:12,084][26022] Updated weights on worker 0-0, policy_version 411725 (0.00089) [2022-07-09 20:40:13,796][26022] Updated weights on worker 0-0, policy_version 411735 (0.00084) [2022-07-09 20:40:15,724][26022] Updated weights on worker 0-0, policy_version 411745 (0.00099) [2022-07-09 20:40:17,067][25689] Fps is (10 sec: 5610.7, 60 sec: 5663.7, 300 sec: 5647.5). Total num frames: 421634048. Throughput: 0: 5001.5. Samples: 421629410. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:40:17,068][25689] Avg episode reward: [(0, '-46.083')] [2022-07-09 20:40:17,379][26022] Updated weights on worker 0-0, policy_version 411755 (0.00083) [2022-07-09 20:40:19,384][26022] Updated weights on worker 0-0, policy_version 411765 (0.00085) [2022-07-09 20:40:21,103][26022] Updated weights on worker 0-0, policy_version 411775 (0.00088) [2022-07-09 20:40:22,129][25689] Fps is (10 sec: 5795.7, 60 sec: 5675.7, 300 sec: 5648.7). Total num frames: 421663744. Throughput: 0: 5842.8. Samples: 421663916. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:40:22,129][25689] Avg episode reward: [(0, '-46.700')] [2022-07-09 20:40:23,076][26022] Updated weights on worker 0-0, policy_version 411785 (0.00081) [2022-07-09 20:40:24,550][26022] Updated weights on worker 0-0, policy_version 411795 (0.00091) [2022-07-09 20:40:26,595][26022] Updated weights on worker 0-0, policy_version 411805 (0.00336) [2022-07-09 20:40:27,194][25689] Fps is (10 sec: 5763.8, 60 sec: 5681.9, 300 sec: 5648.8). Total num frames: 421692416. Throughput: 0: 5959.4. Samples: 421698302. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:40:27,194][25689] Avg episode reward: [(0, '-46.730')] [2022-07-09 20:40:28,179][26022] Updated weights on worker 0-0, policy_version 411815 (0.00087) [2022-07-09 20:40:30,138][26022] Updated weights on worker 0-0, policy_version 411825 (0.00085) [2022-07-09 20:40:31,822][26022] Updated weights on worker 0-0, policy_version 411835 (0.00089) [2022-07-09 20:40:32,248][25689] Fps is (10 sec: 5666.9, 60 sec: 5679.6, 300 sec: 5648.4). Total num frames: 421721088. Throughput: 0: 5089.2. Samples: 421715118. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:40:32,248][25689] Avg episode reward: [(0, '-46.498')] [2022-07-09 20:40:33,864][26022] Updated weights on worker 0-0, policy_version 411845 (0.00089) [2022-07-09 20:40:35,243][26022] Updated weights on worker 0-0, policy_version 411855 (0.00088) [2022-07-09 20:40:37,256][25689] Fps is (10 sec: 5596.9, 60 sec: 5686.0, 300 sec: 5648.6). Total num frames: 421748736. Throughput: 0: 5962.3. Samples: 421749786. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:40:37,257][25689] Avg episode reward: [(0, '-46.470')] [2022-07-09 20:40:37,515][26022] Updated weights on worker 0-0, policy_version 411865 (0.00088) [2022-07-09 20:40:38,793][26022] Updated weights on worker 0-0, policy_version 411875 (0.00082) [2022-07-09 20:40:40,756][26022] Updated weights on worker 0-0, policy_version 411885 (0.00084) [2022-07-09 20:40:42,296][25689] Fps is (10 sec: 5808.6, 60 sec: 5684.4, 300 sec: 5656.2). Total num frames: 421779456. Throughput: 0: 5982.6. Samples: 421784570. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:40:42,297][25689] Avg episode reward: [(0, '-45.983')] [2022-07-09 20:40:42,380][26022] Updated weights on worker 0-0, policy_version 411895 (0.00085) [2022-07-09 20:40:44,326][26022] Updated weights on worker 0-0, policy_version 411905 (0.00087) [2022-07-09 20:40:46,165][26022] Updated weights on worker 0-0, policy_version 411915 (0.00091) [2022-07-09 20:40:47,404][25689] Fps is (10 sec: 5852.8, 60 sec: 5680.4, 300 sec: 5651.1). Total num frames: 421808128. Throughput: 0: 5126.9. Samples: 421801918. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:40:47,404][25689] Avg episode reward: [(0, '-46.656')] [2022-07-09 20:40:47,833][26022] Updated weights on worker 0-0, policy_version 411925 (0.00051) [2022-07-09 20:40:49,601][26022] Updated weights on worker 0-0, policy_version 411935 (0.00089) [2022-07-09 20:40:51,459][26022] Updated weights on worker 0-0, policy_version 411945 (0.00089) [2022-07-09 20:40:52,418][25689] Fps is (10 sec: 5665.3, 60 sec: 5683.3, 300 sec: 5654.4). Total num frames: 421836800. Throughput: 0: 6015.0. Samples: 421836444. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:40:52,419][25689] Avg episode reward: [(0, '-46.854')] [2022-07-09 20:40:53,023][26022] Updated weights on worker 0-0, policy_version 411955 (0.00088) [2022-07-09 20:40:54,957][26022] Updated weights on worker 0-0, policy_version 411965 (0.00087) [2022-07-09 20:40:56,679][26022] Updated weights on worker 0-0, policy_version 411975 (0.00094) [2022-07-09 20:40:57,435][25689] Fps is (10 sec: 5716.5, 60 sec: 5702.1, 300 sec: 5654.2). Total num frames: 421865472. Throughput: 0: 5995.7. Samples: 421870774. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:40:57,436][25689] Avg episode reward: [(0, '-46.840')] [2022-07-09 20:40:58,435][26022] Updated weights on worker 0-0, policy_version 411985 (0.00084) [2022-07-09 20:41:00,320][26022] Updated weights on worker 0-0, policy_version 411995 (0.00095) [2022-07-09 20:41:02,286][26022] Updated weights on worker 0-0, policy_version 412005 (0.00104) [2022-07-09 20:41:02,483][25689] Fps is (10 sec: 5595.9, 60 sec: 5683.0, 300 sec: 5655.2). Total num frames: 421893120. Throughput: 0: 5121.8. Samples: 421887960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:41:02,483][25689] Avg episode reward: [(0, '-47.569')] [2022-07-09 20:41:04,270][26022] Updated weights on worker 0-0, policy_version 412015 (0.00090) [2022-07-09 20:41:06,043][26022] Updated weights on worker 0-0, policy_version 412025 (0.00085) [2022-07-09 20:41:07,600][25689] Fps is (10 sec: 5540.4, 60 sec: 5680.3, 300 sec: 5653.5). Total num frames: 421921792. Throughput: 0: 5860.4. Samples: 421920278. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:41:07,601][25689] Avg episode reward: [(0, '-47.663')] [2022-07-09 20:41:07,949][26022] Updated weights on worker 0-0, policy_version 412035 (0.00091) [2022-07-09 20:41:09,691][26022] Updated weights on worker 0-0, policy_version 412045 (0.00087) [2022-07-09 20:41:11,513][26022] Updated weights on worker 0-0, policy_version 412055 (0.00092) [2022-07-09 20:41:12,697][25689] Fps is (10 sec: 5714.4, 60 sec: 5707.7, 300 sec: 5659.3). Total num frames: 421951488. Throughput: 0: 5829.1. Samples: 421954650. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:41:12,697][25689] Avg episode reward: [(0, '-47.498')] [2022-07-09 20:41:13,366][26022] Updated weights on worker 0-0, policy_version 412065 (0.00093) [2022-07-09 20:41:14,965][26022] Updated weights on worker 0-0, policy_version 412075 (0.00081) [2022-07-09 20:41:16,935][26022] Updated weights on worker 0-0, policy_version 412085 (0.00087) [2022-07-09 20:41:17,709][25689] Fps is (10 sec: 5774.4, 60 sec: 5707.5, 300 sec: 5659.4). Total num frames: 421980160. Throughput: 0: 5836.1. Samples: 421989090. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:41:17,709][25689] Avg episode reward: [(0, '-46.430')] [2022-07-09 20:41:18,691][26022] Updated weights on worker 0-0, policy_version 412095 (0.00090) [2022-07-09 20:41:20,608][26022] Updated weights on worker 0-0, policy_version 412105 (0.00080) [2022-07-09 20:41:22,296][26022] Updated weights on worker 0-0, policy_version 412115 (0.00084) [2022-07-09 20:41:22,766][25689] Fps is (10 sec: 5695.2, 60 sec: 5691.1, 300 sec: 5659.8). Total num frames: 422008832. Throughput: 0: 5834.1. Samples: 422006292. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:41:22,766][25689] Avg episode reward: [(0, '-46.761')] [2022-07-09 20:41:24,013][26022] Updated weights on worker 0-0, policy_version 412125 (0.00086) [2022-07-09 20:41:25,683][26022] Updated weights on worker 0-0, policy_version 412135 (0.00095) [2022-07-09 20:41:27,863][25689] Fps is (10 sec: 5445.5, 60 sec: 5654.3, 300 sec: 5649.4). Total num frames: 422035456. Throughput: 0: 5942.2. Samples: 422040682. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:41:27,864][25689] Avg episode reward: [(0, '-45.231')] [2022-07-09 20:41:27,867][26022] Updated weights on worker 0-0, policy_version 412145 (0.00092) [2022-07-09 20:41:29,272][26022] Updated weights on worker 0-0, policy_version 412155 (0.00091) [2022-07-09 20:41:30,682][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:41:30,695][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000412161_422052864.pth [2022-07-09 20:41:30,695][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000410171_420015104.pth [2022-07-09 20:41:31,327][26022] Updated weights on worker 0-0, policy_version 412165 (0.00082) [2022-07-09 20:41:32,814][26022] Updated weights on worker 0-0, policy_version 412175 (0.00092) [2022-07-09 20:41:32,916][25689] Fps is (10 sec: 5750.5, 60 sec: 5705.0, 300 sec: 5662.4). Total num frames: 422067200. Throughput: 0: 5952.7. Samples: 422075008. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:41:32,916][25689] Avg episode reward: [(0, '-45.192')] [2022-07-09 20:41:34,749][26022] Updated weights on worker 0-0, policy_version 412185 (0.00093) [2022-07-09 20:41:36,422][26022] Updated weights on worker 0-0, policy_version 412195 (0.00092) [2022-07-09 20:41:38,008][25689] Fps is (10 sec: 5955.2, 60 sec: 5714.0, 300 sec: 5657.9). Total num frames: 422095872. Throughput: 0: 5079.2. Samples: 422092194. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:41:38,010][25689] Avg episode reward: [(0, '-45.611')] [2022-07-09 20:41:38,260][26022] Updated weights on worker 0-0, policy_version 412205 (0.00086) [2022-07-09 20:41:40,107][26022] Updated weights on worker 0-0, policy_version 412215 (0.00087) [2022-07-09 20:41:41,775][26022] Updated weights on worker 0-0, policy_version 412225 (0.00086) [2022-07-09 20:41:43,067][25689] Fps is (10 sec: 5649.1, 60 sec: 5678.6, 300 sec: 5658.1). Total num frames: 422124544. Throughput: 0: 5928.0. Samples: 422126636. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:41:43,067][25689] Avg episode reward: [(0, '-45.318')] [2022-07-09 20:41:43,808][26022] Updated weights on worker 0-0, policy_version 412235 (0.00081) [2022-07-09 20:41:45,529][26022] Updated weights on worker 0-0, policy_version 412245 (0.00085) [2022-07-09 20:41:47,290][26022] Updated weights on worker 0-0, policy_version 412255 (0.00090) [2022-07-09 20:41:48,125][25689] Fps is (10 sec: 5668.1, 60 sec: 5683.2, 300 sec: 5660.9). Total num frames: 422153216. Throughput: 0: 5940.1. Samples: 422161040. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:41:48,125][25689] Avg episode reward: [(0, '-45.254')] [2022-07-09 20:41:49,039][26022] Updated weights on worker 0-0, policy_version 412265 (0.00713) [2022-07-09 20:41:51,031][26022] Updated weights on worker 0-0, policy_version 412275 (0.00091) [2022-07-09 20:41:52,679][26022] Updated weights on worker 0-0, policy_version 412285 (0.00080) [2022-07-09 20:41:53,133][25689] Fps is (10 sec: 5696.7, 60 sec: 5683.8, 300 sec: 5661.3). Total num frames: 422181888. Throughput: 0: 5101.2. Samples: 422178140. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:41:53,133][25689] Avg episode reward: [(0, '-46.392')] [2022-07-09 20:41:54,484][26022] Updated weights on worker 0-0, policy_version 412295 (0.00087) [2022-07-09 20:41:56,272][26022] Updated weights on worker 0-0, policy_version 412305 (0.00051) [2022-07-09 20:41:57,959][26022] Updated weights on worker 0-0, policy_version 412315 (0.00087) [2022-07-09 20:41:58,140][25689] Fps is (10 sec: 5827.8, 60 sec: 5701.5, 300 sec: 5661.7). Total num frames: 422211584. Throughput: 0: 5991.0. Samples: 422212808. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:41:58,141][25689] Avg episode reward: [(0, '-46.856')] [2022-07-09 20:41:59,836][26022] Updated weights on worker 0-0, policy_version 412325 (0.00091) [2022-07-09 20:42:01,540][26022] Updated weights on worker 0-0, policy_version 412335 (0.00085) [2022-07-09 20:42:03,196][25689] Fps is (10 sec: 5494.7, 60 sec: 5667.0, 300 sec: 5661.4). Total num frames: 422237184. Throughput: 0: 5882.3. Samples: 422245044. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:42:03,197][25689] Avg episode reward: [(0, '-47.108')] [2022-07-09 20:42:03,835][26022] Updated weights on worker 0-0, policy_version 412345 (0.00093) [2022-07-09 20:42:05,671][26022] Updated weights on worker 0-0, policy_version 412355 (0.00097) [2022-07-09 20:42:07,509][26022] Updated weights on worker 0-0, policy_version 412365 (0.00172) [2022-07-09 20:42:08,334][25689] Fps is (10 sec: 5424.8, 60 sec: 5682.0, 300 sec: 5658.8). Total num frames: 422266880. Throughput: 0: 5005.2. Samples: 422262188. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:42:08,334][25689] Avg episode reward: [(0, '-46.357')] [2022-07-09 20:42:09,147][26022] Updated weights on worker 0-0, policy_version 412375 (0.00084) [2022-07-09 20:42:10,924][26022] Updated weights on worker 0-0, policy_version 412385 (0.00088) [2022-07-09 20:42:12,659][26022] Updated weights on worker 0-0, policy_version 412395 (0.00090) [2022-07-09 20:42:13,418][25689] Fps is (10 sec: 5810.5, 60 sec: 5683.2, 300 sec: 5668.7). Total num frames: 422296576. Throughput: 0: 5845.1. Samples: 422296708. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:42:13,418][25689] Avg episode reward: [(0, '-46.591')] [2022-07-09 20:42:14,691][26022] Updated weights on worker 0-0, policy_version 412405 (0.00092) [2022-07-09 20:42:16,121][26022] Updated weights on worker 0-0, policy_version 412415 (0.00088) [2022-07-09 20:42:18,179][26022] Updated weights on worker 0-0, policy_version 412425 (0.00088) [2022-07-09 20:42:18,460][25689] Fps is (10 sec: 5663.0, 60 sec: 5663.5, 300 sec: 5654.9). Total num frames: 422324224. Throughput: 0: 5819.8. Samples: 422331062. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:42:18,460][25689] Avg episode reward: [(0, '-45.774')] [2022-07-09 20:42:20,059][26022] Updated weights on worker 0-0, policy_version 412435 (0.00084) [2022-07-09 20:42:21,600][26022] Updated weights on worker 0-0, policy_version 412445 (0.00085) [2022-07-09 20:42:23,432][26022] Updated weights on worker 0-0, policy_version 412455 (0.00101) [2022-07-09 20:42:23,490][25689] Fps is (10 sec: 5693.3, 60 sec: 5682.9, 300 sec: 5666.9). Total num frames: 422353920. Throughput: 0: 5091.4. Samples: 422348368. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:42:23,490][25689] Avg episode reward: [(0, '-45.970')] [2022-07-09 20:42:25,166][26022] Updated weights on worker 0-0, policy_version 412465 (0.00082) [2022-07-09 20:42:27,156][26022] Updated weights on worker 0-0, policy_version 412475 (0.00084) [2022-07-09 20:42:28,539][25689] Fps is (10 sec: 5689.5, 60 sec: 5704.3, 300 sec: 5663.4). Total num frames: 422381568. Throughput: 0: 5958.1. Samples: 422382572. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:42:28,539][25689] Avg episode reward: [(0, '-45.960')] [2022-07-09 20:42:28,937][26022] Updated weights on worker 0-0, policy_version 412485 (0.00087) [2022-07-09 20:42:30,647][26022] Updated weights on worker 0-0, policy_version 412495 (0.00129) [2022-07-09 20:42:32,452][26022] Updated weights on worker 0-0, policy_version 412505 (0.00089) [2022-07-09 20:42:33,556][25689] Fps is (10 sec: 5696.5, 60 sec: 5673.8, 300 sec: 5663.9). Total num frames: 422411264. Throughput: 0: 5949.1. Samples: 422416516. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:42:33,557][25689] Avg episode reward: [(0, '-45.804')] [2022-07-09 20:42:34,287][26022] Updated weights on worker 0-0, policy_version 412515 (0.00088) [2022-07-09 20:42:36,202][26022] Updated weights on worker 0-0, policy_version 412525 (0.00083) [2022-07-09 20:42:37,782][26022] Updated weights on worker 0-0, policy_version 412535 (0.00052) [2022-07-09 20:42:38,571][25689] Fps is (10 sec: 5817.7, 60 sec: 5681.0, 300 sec: 5665.0). Total num frames: 422439936. Throughput: 0: 5101.4. Samples: 422433660. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:42:38,573][25689] Avg episode reward: [(0, '-46.408')] [2022-07-09 20:42:39,700][26022] Updated weights on worker 0-0, policy_version 412545 (0.00083) [2022-07-09 20:42:41,255][26022] Updated weights on worker 0-0, policy_version 412555 (0.00226) [2022-07-09 20:42:43,232][26022] Updated weights on worker 0-0, policy_version 412565 (0.00086) [2022-07-09 20:42:43,599][25689] Fps is (10 sec: 5812.2, 60 sec: 5700.9, 300 sec: 5669.0). Total num frames: 422469632. Throughput: 0: 5974.7. Samples: 422468512. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:42:43,599][25689] Avg episode reward: [(0, '-46.578')] [2022-07-09 20:42:45,005][26022] Updated weights on worker 0-0, policy_version 412575 (0.00088) [2022-07-09 20:42:46,830][26022] Updated weights on worker 0-0, policy_version 412585 (0.00091) [2022-07-09 20:42:48,475][26022] Updated weights on worker 0-0, policy_version 412595 (0.00080) [2022-07-09 20:42:48,671][25689] Fps is (10 sec: 5779.5, 60 sec: 5699.6, 300 sec: 5668.5). Total num frames: 422498304. Throughput: 0: 5996.9. Samples: 422503302. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:42:48,672][25689] Avg episode reward: [(0, '-46.562')] [2022-07-09 20:42:50,257][26022] Updated weights on worker 0-0, policy_version 412605 (0.00090) [2022-07-09 20:42:51,875][26022] Updated weights on worker 0-0, policy_version 412615 (0.00087) [2022-07-09 20:42:53,678][25689] Fps is (10 sec: 5689.2, 60 sec: 5699.7, 300 sec: 5671.9). Total num frames: 422526976. Throughput: 0: 5179.0. Samples: 422520726. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:42:53,679][25689] Avg episode reward: [(0, '-45.809')] [2022-07-09 20:42:53,783][26022] Updated weights on worker 0-0, policy_version 412625 (0.00087) [2022-07-09 20:42:55,501][26022] Updated weights on worker 0-0, policy_version 412635 (0.00082) [2022-07-09 20:42:57,224][26022] Updated weights on worker 0-0, policy_version 412645 (0.00092) [2022-07-09 20:42:58,685][25689] Fps is (10 sec: 5726.3, 60 sec: 5682.8, 300 sec: 5669.1). Total num frames: 422555648. Throughput: 0: 6068.3. Samples: 422555714. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:42:58,686][25689] Avg episode reward: [(0, '-45.983')] [2022-07-09 20:42:59,140][26022] Updated weights on worker 0-0, policy_version 412655 (0.00090) [2022-07-09 20:43:00,827][26022] Updated weights on worker 0-0, policy_version 412665 (0.00090) [2022-07-09 20:43:03,263][26022] Updated weights on worker 0-0, policy_version 412675 (0.00087) [2022-07-09 20:43:03,697][25689] Fps is (10 sec: 5723.6, 60 sec: 5737.7, 300 sec: 5683.3). Total num frames: 422584320. Throughput: 0: 5932.0. Samples: 422587734. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:43:03,697][25689] Avg episode reward: [(0, '-46.722')] [2022-07-09 20:43:04,858][26022] Updated weights on worker 0-0, policy_version 412685 (0.00091) [2022-07-09 20:43:06,628][26022] Updated weights on worker 0-0, policy_version 412695 (0.00092) [2022-07-09 20:43:08,563][26022] Updated weights on worker 0-0, policy_version 412705 (0.00086) [2022-07-09 20:43:08,760][25689] Fps is (10 sec: 5488.3, 60 sec: 5693.9, 300 sec: 5672.0). Total num frames: 422610944. Throughput: 0: 5055.1. Samples: 422604856. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:43:08,760][25689] Avg episode reward: [(0, '-46.345')] [2022-07-09 20:43:10,263][26022] Updated weights on worker 0-0, policy_version 412715 (0.00090) [2022-07-09 20:43:12,047][26022] Updated weights on worker 0-0, policy_version 412725 (0.00083) [2022-07-09 20:43:13,779][25689] Fps is (10 sec: 5586.3, 60 sec: 5700.1, 300 sec: 5673.2). Total num frames: 422640640. Throughput: 0: 5901.0. Samples: 422639340. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:43:13,784][26022] Updated weights on worker 0-0, policy_version 412735 (0.00084) [2022-07-09 20:43:13,779][25689] Avg episode reward: [(0, '-46.172')] [2022-07-09 20:43:15,627][26022] Updated weights on worker 0-0, policy_version 412745 (0.00101) [2022-07-09 20:43:17,395][26022] Updated weights on worker 0-0, policy_version 412755 (0.00093) [2022-07-09 20:43:18,799][25689] Fps is (10 sec: 5813.8, 60 sec: 5719.1, 300 sec: 5680.2). Total num frames: 422669312. Throughput: 0: 5862.0. Samples: 422673626. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:43:18,800][25689] Avg episode reward: [(0, '-46.098')] [2022-07-09 20:43:19,334][26022] Updated weights on worker 0-0, policy_version 412765 (0.00090) [2022-07-09 20:43:21,039][26022] Updated weights on worker 0-0, policy_version 412775 (0.00086) [2022-07-09 20:43:22,787][26022] Updated weights on worker 0-0, policy_version 412785 (0.00076) [2022-07-09 20:43:23,881][25689] Fps is (10 sec: 5676.1, 60 sec: 5697.2, 300 sec: 5680.8). Total num frames: 422697984. Throughput: 0: 5110.3. Samples: 422690882. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:43:23,882][25689] Avg episode reward: [(0, '-46.568')] [2022-07-09 20:43:24,567][26022] Updated weights on worker 0-0, policy_version 412795 (0.00079) [2022-07-09 20:43:26,259][26022] Updated weights on worker 0-0, policy_version 412805 (0.00089) [2022-07-09 20:43:28,347][26022] Updated weights on worker 0-0, policy_version 412815 (0.00088) [2022-07-09 20:43:28,927][25689] Fps is (10 sec: 5662.0, 60 sec: 5714.5, 300 sec: 5680.0). Total num frames: 422726656. Throughput: 0: 5953.2. Samples: 422724916. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:43:28,929][25689] Avg episode reward: [(0, '-45.901')] [2022-07-09 20:43:30,056][26022] Updated weights on worker 0-0, policy_version 412825 (0.00092) [2022-07-09 20:43:30,719][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:43:30,734][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000412829_422736896.pth [2022-07-09 20:43:30,734][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000410833_420692992.pth [2022-07-09 20:43:31,784][26022] Updated weights on worker 0-0, policy_version 412835 (0.00092) [2022-07-09 20:43:33,555][26022] Updated weights on worker 0-0, policy_version 412845 (0.00084) [2022-07-09 20:43:34,028][25689] Fps is (10 sec: 5651.5, 60 sec: 5689.7, 300 sec: 5676.2). Total num frames: 422755328. Throughput: 0: 5898.2. Samples: 422758774. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:43:34,030][25689] Avg episode reward: [(0, '-45.471')] [2022-07-09 20:43:35,451][26022] Updated weights on worker 0-0, policy_version 412855 (0.00055) [2022-07-09 20:43:37,215][26022] Updated weights on worker 0-0, policy_version 412865 (0.00081) [2022-07-09 20:43:39,095][25689] Fps is (10 sec: 5539.1, 60 sec: 5667.9, 300 sec: 5675.5). Total num frames: 422782976. Throughput: 0: 5874.2. Samples: 422792846. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:43:39,096][25689] Avg episode reward: [(0, '-45.818')] [2022-07-09 20:43:39,106][26022] Updated weights on worker 0-0, policy_version 412875 (0.00087) [2022-07-09 20:43:41,054][26022] Updated weights on worker 0-0, policy_version 412885 (0.00091) [2022-07-09 20:43:42,775][26022] Updated weights on worker 0-0, policy_version 412895 (0.00091) [2022-07-09 20:43:44,110][25689] Fps is (10 sec: 5687.7, 60 sec: 5669.0, 300 sec: 5683.3). Total num frames: 422812672. Throughput: 0: 5875.4. Samples: 422809734. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:43:44,111][25689] Avg episode reward: [(0, '-46.715')] [2022-07-09 20:43:44,533][26022] Updated weights on worker 0-0, policy_version 412905 (0.00088) [2022-07-09 20:43:46,372][26022] Updated weights on worker 0-0, policy_version 412915 (0.00085) [2022-07-09 20:43:48,076][26022] Updated weights on worker 0-0, policy_version 412925 (0.00087) [2022-07-09 20:43:49,234][25689] Fps is (10 sec: 5757.0, 60 sec: 5664.2, 300 sec: 5674.5). Total num frames: 422841344. Throughput: 0: 5866.6. Samples: 422844046. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:43:49,236][25689] Avg episode reward: [(0, '-46.052')] [2022-07-09 20:43:49,933][26022] Updated weights on worker 0-0, policy_version 412935 (0.00084) [2022-07-09 20:43:51,783][26022] Updated weights on worker 0-0, policy_version 412945 (0.00087) [2022-07-09 20:43:53,582][26022] Updated weights on worker 0-0, policy_version 412955 (0.00086) [2022-07-09 20:43:54,264][25689] Fps is (10 sec: 5647.6, 60 sec: 5662.1, 300 sec: 5677.4). Total num frames: 422870016. Throughput: 0: 5896.7. Samples: 422878098. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:43:54,264][25689] Avg episode reward: [(0, '-46.011')] [2022-07-09 20:43:55,407][26022] Updated weights on worker 0-0, policy_version 412965 (0.00091) [2022-07-09 20:43:56,867][26022] Updated weights on worker 0-0, policy_version 412975 (0.00088) [2022-07-09 20:43:58,915][26022] Updated weights on worker 0-0, policy_version 412985 (0.00097) [2022-07-09 20:43:59,267][25689] Fps is (10 sec: 5715.3, 60 sec: 5662.4, 300 sec: 5682.3). Total num frames: 422898688. Throughput: 0: 5087.2. Samples: 422895464. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:43:59,268][25689] Avg episode reward: [(0, '-46.374')] [2022-07-09 20:44:00,676][26022] Updated weights on worker 0-0, policy_version 412995 (0.01256) [2022-07-09 20:44:02,705][26022] Updated weights on worker 0-0, policy_version 413005 (0.00087) [2022-07-09 20:44:04,303][25689] Fps is (10 sec: 5406.2, 60 sec: 5609.5, 300 sec: 5672.9). Total num frames: 422924288. Throughput: 0: 5850.9. Samples: 422927878. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:44:04,304][25689] Avg episode reward: [(0, '-45.816')] [2022-07-09 20:44:04,756][26022] Updated weights on worker 0-0, policy_version 413015 (0.00099) [2022-07-09 20:44:06,351][26022] Updated weights on worker 0-0, policy_version 413025 (0.00086) [2022-07-09 20:44:08,211][26022] Updated weights on worker 0-0, policy_version 413035 (0.00093) [2022-07-09 20:44:09,392][25689] Fps is (10 sec: 5461.2, 60 sec: 5657.7, 300 sec: 5675.2). Total num frames: 422953984. Throughput: 0: 5859.2. Samples: 422962160. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:44:09,398][25689] Avg episode reward: [(0, '-45.713')] [2022-07-09 20:44:09,949][26022] Updated weights on worker 0-0, policy_version 413045 (0.00085) [2022-07-09 20:44:11,735][26022] Updated weights on worker 0-0, policy_version 413055 (0.00090) [2022-07-09 20:44:13,570][26022] Updated weights on worker 0-0, policy_version 413065 (0.00079) [2022-07-09 20:44:14,454][25689] Fps is (10 sec: 5850.6, 60 sec: 5653.7, 300 sec: 5681.0). Total num frames: 422983680. Throughput: 0: 5011.3. Samples: 422979278. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:44:14,455][25689] Avg episode reward: [(0, '-45.646')] [2022-07-09 20:44:15,260][26022] Updated weights on worker 0-0, policy_version 413075 (0.00095) [2022-07-09 20:44:17,031][26022] Updated weights on worker 0-0, policy_version 413085 (0.00099) [2022-07-09 20:44:19,006][26022] Updated weights on worker 0-0, policy_version 413095 (0.00093) [2022-07-09 20:44:19,525][25689] Fps is (10 sec: 5659.5, 60 sec: 5632.2, 300 sec: 5676.4). Total num frames: 423011328. Throughput: 0: 5832.6. Samples: 423013618. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:44:19,531][25689] Avg episode reward: [(0, '-45.583')] [2022-07-09 20:44:20,685][26022] Updated weights on worker 0-0, policy_version 413105 (0.00098) [2022-07-09 20:44:22,834][26022] Updated weights on worker 0-0, policy_version 413115 (0.00092) [2022-07-09 20:44:24,184][26022] Updated weights on worker 0-0, policy_version 413125 (0.00098) [2022-07-09 20:44:24,562][25689] Fps is (10 sec: 5673.2, 60 sec: 5653.2, 300 sec: 5681.6). Total num frames: 423041024. Throughput: 0: 5913.6. Samples: 423047684. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:44:24,563][25689] Avg episode reward: [(0, '-45.167')] [2022-07-09 20:44:26,371][26022] Updated weights on worker 0-0, policy_version 413135 (0.00086) [2022-07-09 20:44:28,034][26022] Updated weights on worker 0-0, policy_version 413145 (0.00088) [2022-07-09 20:44:29,599][25689] Fps is (10 sec: 5793.9, 60 sec: 5654.1, 300 sec: 5681.4). Total num frames: 423069696. Throughput: 0: 5072.1. Samples: 423064650. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:44:29,599][25689] Avg episode reward: [(0, '-45.290')] [2022-07-09 20:44:29,786][26022] Updated weights on worker 0-0, policy_version 413155 (0.00089) [2022-07-09 20:44:31,578][26022] Updated weights on worker 0-0, policy_version 413165 (0.00090) [2022-07-09 20:44:33,464][26022] Updated weights on worker 0-0, policy_version 413175 (0.00090) [2022-07-09 20:44:34,668][25689] Fps is (10 sec: 5572.8, 60 sec: 5640.1, 300 sec: 5681.6). Total num frames: 423097344. Throughput: 0: 5906.7. Samples: 423098678. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:44:34,669][25689] Avg episode reward: [(0, '-45.605')] [2022-07-09 20:44:35,183][26022] Updated weights on worker 0-0, policy_version 413185 (0.00081) [2022-07-09 20:44:36,919][26022] Updated weights on worker 0-0, policy_version 413195 (0.00088) [2022-07-09 20:44:38,828][26022] Updated weights on worker 0-0, policy_version 413205 (0.00091) [2022-07-09 20:44:39,684][25689] Fps is (10 sec: 5584.4, 60 sec: 5661.8, 300 sec: 5674.9). Total num frames: 423126016. Throughput: 0: 5924.3. Samples: 423133048. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:44:39,684][25689] Avg episode reward: [(0, '-46.294')] [2022-07-09 20:44:40,455][26022] Updated weights on worker 0-0, policy_version 413215 (0.00085) [2022-07-09 20:44:42,403][26022] Updated weights on worker 0-0, policy_version 413225 (0.00080) [2022-07-09 20:44:44,128][26022] Updated weights on worker 0-0, policy_version 413235 (0.00087) [2022-07-09 20:44:44,715][25689] Fps is (10 sec: 5708.0, 60 sec: 5643.4, 300 sec: 5675.5). Total num frames: 423154688. Throughput: 0: 5091.2. Samples: 423150284. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:44:44,715][25689] Avg episode reward: [(0, '-46.955')] [2022-07-09 20:44:45,991][26022] Updated weights on worker 0-0, policy_version 413245 (0.00089) [2022-07-09 20:44:47,700][26022] Updated weights on worker 0-0, policy_version 413255 (0.00093) [2022-07-09 20:44:49,492][26022] Updated weights on worker 0-0, policy_version 413265 (0.00086) [2022-07-09 20:44:49,788][25689] Fps is (10 sec: 5776.5, 60 sec: 5665.0, 300 sec: 5678.4). Total num frames: 423184384. Throughput: 0: 5944.3. Samples: 423184664. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:44:49,789][25689] Avg episode reward: [(0, '-47.385')] [2022-07-09 20:44:51,222][26022] Updated weights on worker 0-0, policy_version 413275 (0.00085) [2022-07-09 20:44:53,057][26022] Updated weights on worker 0-0, policy_version 413285 (0.00090) [2022-07-09 20:44:54,803][25689] Fps is (10 sec: 5785.6, 60 sec: 5666.4, 300 sec: 5682.3). Total num frames: 423213056. Throughput: 0: 5973.6. Samples: 423218954. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:44:54,804][25689] Avg episode reward: [(0, '-47.898')] [2022-07-09 20:44:54,991][26022] Updated weights on worker 0-0, policy_version 413295 (0.00076) [2022-07-09 20:44:56,537][26022] Updated weights on worker 0-0, policy_version 413305 (0.00083) [2022-07-09 20:44:58,351][26022] Updated weights on worker 0-0, policy_version 413315 (0.00083) [2022-07-09 20:44:59,843][25689] Fps is (10 sec: 5703.5, 60 sec: 5663.0, 300 sec: 5682.0). Total num frames: 423241728. Throughput: 0: 5122.7. Samples: 423236314. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:44:59,843][25689] Avg episode reward: [(0, '-47.682')] [2022-07-09 20:45:00,168][26022] Updated weights on worker 0-0, policy_version 413325 (0.00091) [2022-07-09 20:45:02,547][26022] Updated weights on worker 0-0, policy_version 413335 (0.00083) [2022-07-09 20:45:04,258][26022] Updated weights on worker 0-0, policy_version 413345 (0.00084) [2022-07-09 20:45:04,865][25689] Fps is (10 sec: 5597.2, 60 sec: 5698.0, 300 sec: 5679.8). Total num frames: 423269376. Throughput: 0: 5863.2. Samples: 423268430. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:45:04,866][25689] Avg episode reward: [(0, '-46.699')] [2022-07-09 20:45:06,027][26022] Updated weights on worker 0-0, policy_version 413355 (0.00089) [2022-07-09 20:45:07,811][26022] Updated weights on worker 0-0, policy_version 413365 (0.00092) [2022-07-09 20:45:09,850][26022] Updated weights on worker 0-0, policy_version 413375 (0.00089) [2022-07-09 20:45:09,919][25689] Fps is (10 sec: 5386.1, 60 sec: 5650.7, 300 sec: 5675.9). Total num frames: 423296000. Throughput: 0: 5872.9. Samples: 423302888. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:45:09,919][25689] Avg episode reward: [(0, '-46.287')] [2022-07-09 20:45:11,205][26022] Updated weights on worker 0-0, policy_version 413385 (0.00082) [2022-07-09 20:45:13,309][26022] Updated weights on worker 0-0, policy_version 413395 (0.00092) [2022-07-09 20:45:14,799][26022] Updated weights on worker 0-0, policy_version 413405 (0.00098) [2022-07-09 20:45:14,939][25689] Fps is (10 sec: 5692.6, 60 sec: 5671.5, 300 sec: 5682.6). Total num frames: 423326720. Throughput: 0: 5016.8. Samples: 423319970. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:45:14,939][25689] Avg episode reward: [(0, '-46.361')] [2022-07-09 20:45:16,909][26022] Updated weights on worker 0-0, policy_version 413415 (0.00087) [2022-07-09 20:45:18,422][26022] Updated weights on worker 0-0, policy_version 413425 (0.00086) [2022-07-09 20:45:19,967][25689] Fps is (10 sec: 5808.6, 60 sec: 5675.4, 300 sec: 5676.3). Total num frames: 423354368. Throughput: 0: 5850.6. Samples: 423354056. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:45:19,968][25689] Avg episode reward: [(0, '-46.907')] [2022-07-09 20:45:20,504][26022] Updated weights on worker 0-0, policy_version 413435 (0.00084) [2022-07-09 20:45:22,106][26022] Updated weights on worker 0-0, policy_version 413445 (0.00085) [2022-07-09 20:45:24,061][26022] Updated weights on worker 0-0, policy_version 413455 (0.00085) [2022-07-09 20:45:24,985][25689] Fps is (10 sec: 5707.8, 60 sec: 5677.3, 300 sec: 5680.6). Total num frames: 423384064. Throughput: 0: 5982.8. Samples: 423388804. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 20:45:24,986][25689] Avg episode reward: [(0, '-46.297')] [2022-07-09 20:45:25,566][26022] Updated weights on worker 0-0, policy_version 413465 (0.00090) [2022-07-09 20:45:27,818][26022] Updated weights on worker 0-0, policy_version 413475 (0.00091) [2022-07-09 20:45:29,073][26022] Updated weights on worker 0-0, policy_version 413485 (0.00084) [2022-07-09 20:45:30,048][25689] Fps is (10 sec: 5688.8, 60 sec: 5657.9, 300 sec: 5677.0). Total num frames: 423411712. Throughput: 0: 5102.6. Samples: 423405596. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:45:30,048][25689] Avg episode reward: [(0, '-47.235')] [2022-07-09 20:45:30,840][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:45:30,864][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000413492_423415808.pth [2022-07-09 20:45:30,864][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000411494_421369856.pth [2022-07-09 20:45:31,284][26022] Updated weights on worker 0-0, policy_version 413495 (0.00092) [2022-07-09 20:45:32,745][26022] Updated weights on worker 0-0, policy_version 413505 (0.00089) [2022-07-09 20:45:34,697][26022] Updated weights on worker 0-0, policy_version 413515 (0.00086) [2022-07-09 20:45:35,055][25689] Fps is (10 sec: 5694.8, 60 sec: 5697.7, 300 sec: 5684.0). Total num frames: 423441408. Throughput: 0: 5955.4. Samples: 423439768. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:45:35,055][25689] Avg episode reward: [(0, '-47.375')] [2022-07-09 20:45:36,342][26022] Updated weights on worker 0-0, policy_version 413525 (0.00086) [2022-07-09 20:45:38,312][26022] Updated weights on worker 0-0, policy_version 413535 (0.00082) [2022-07-09 20:45:40,069][25689] Fps is (10 sec: 5722.2, 60 sec: 5680.9, 300 sec: 5674.1). Total num frames: 423469056. Throughput: 0: 5999.6. Samples: 423474656. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:45:40,069][25689] Avg episode reward: [(0, '-47.281')] [2022-07-09 20:45:40,096][26022] Updated weights on worker 0-0, policy_version 413545 (0.00094) [2022-07-09 20:45:41,850][26022] Updated weights on worker 0-0, policy_version 413555 (0.00096) [2022-07-09 20:45:43,455][26022] Updated weights on worker 0-0, policy_version 413565 (0.00095) [2022-07-09 20:45:45,070][25689] Fps is (10 sec: 5725.7, 60 sec: 5700.7, 300 sec: 5679.6). Total num frames: 423498752. Throughput: 0: 5131.9. Samples: 423491876. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:45:45,070][25689] Avg episode reward: [(0, '-47.207')] [2022-07-09 20:45:45,495][26022] Updated weights on worker 0-0, policy_version 413575 (0.00087) [2022-07-09 20:45:47,062][26022] Updated weights on worker 0-0, policy_version 413585 (0.00090) [2022-07-09 20:45:48,944][26022] Updated weights on worker 0-0, policy_version 413595 (0.00087) [2022-07-09 20:45:50,109][25689] Fps is (10 sec: 5813.6, 60 sec: 5687.0, 300 sec: 5679.1). Total num frames: 423527424. Throughput: 0: 6038.4. Samples: 423526734. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:45:50,109][25689] Avg episode reward: [(0, '-47.432')] [2022-07-09 20:45:50,626][26022] Updated weights on worker 0-0, policy_version 413605 (0.00088) [2022-07-09 20:45:52,498][26022] Updated weights on worker 0-0, policy_version 413615 (0.00091) [2022-07-09 20:45:54,191][26022] Updated weights on worker 0-0, policy_version 413625 (0.00090) [2022-07-09 20:45:55,131][25689] Fps is (10 sec: 5699.7, 60 sec: 5686.3, 300 sec: 5679.0). Total num frames: 423556096. Throughput: 0: 6045.9. Samples: 423561146. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:45:55,131][25689] Avg episode reward: [(0, '-47.868')] [2022-07-09 20:45:56,057][26022] Updated weights on worker 0-0, policy_version 413635 (0.00095) [2022-07-09 20:45:57,761][26022] Updated weights on worker 0-0, policy_version 413645 (0.00082) [2022-07-09 20:45:59,609][26022] Updated weights on worker 0-0, policy_version 413655 (0.00086) [2022-07-09 20:46:00,143][25689] Fps is (10 sec: 5816.7, 60 sec: 5705.8, 300 sec: 5686.6). Total num frames: 423585792. Throughput: 0: 5167.2. Samples: 423578384. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:46:00,145][25689] Avg episode reward: [(0, '-47.798')] [2022-07-09 20:46:01,536][26022] Updated weights on worker 0-0, policy_version 413665 (0.00085) [2022-07-09 20:46:03,605][26022] Updated weights on worker 0-0, policy_version 413675 (0.00092) [2022-07-09 20:46:05,155][25689] Fps is (10 sec: 5516.2, 60 sec: 5672.9, 300 sec: 5678.3). Total num frames: 423611392. Throughput: 0: 5912.0. Samples: 423610620. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:46:05,155][25689] Avg episode reward: [(0, '-47.865')] [2022-07-09 20:46:05,411][26022] Updated weights on worker 0-0, policy_version 413685 (0.00089) [2022-07-09 20:46:06,990][26022] Updated weights on worker 0-0, policy_version 413695 (0.00092) [2022-07-09 20:46:09,236][26022] Updated weights on worker 0-0, policy_version 413705 (0.00090) [2022-07-09 20:46:10,202][25689] Fps is (10 sec: 5497.3, 60 sec: 5724.4, 300 sec: 5679.2). Total num frames: 423641088. Throughput: 0: 5864.7. Samples: 423644576. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:46:10,204][25689] Avg episode reward: [(0, '-47.789')] [2022-07-09 20:46:10,785][26022] Updated weights on worker 0-0, policy_version 413715 (0.00089) [2022-07-09 20:46:12,475][26022] Updated weights on worker 0-0, policy_version 413725 (0.00088) [2022-07-09 20:46:14,386][26022] Updated weights on worker 0-0, policy_version 413735 (0.00086) [2022-07-09 20:46:15,206][25689] Fps is (10 sec: 5705.5, 60 sec: 5675.0, 300 sec: 5675.9). Total num frames: 423668736. Throughput: 0: 5022.9. Samples: 423661980. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:46:15,207][25689] Avg episode reward: [(0, '-47.784')] [2022-07-09 20:46:16,094][26022] Updated weights on worker 0-0, policy_version 413745 (0.00082) [2022-07-09 20:46:17,880][26022] Updated weights on worker 0-0, policy_version 413755 (0.00089) [2022-07-09 20:46:19,806][26022] Updated weights on worker 0-0, policy_version 413765 (0.00081) [2022-07-09 20:46:20,210][25689] Fps is (10 sec: 5627.7, 60 sec: 5694.3, 300 sec: 5677.0). Total num frames: 423697408. Throughput: 0: 5887.6. Samples: 423696528. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:46:20,211][25689] Avg episode reward: [(0, '-46.923')] [2022-07-09 20:46:21,298][26022] Updated weights on worker 0-0, policy_version 413775 (0.00082) [2022-07-09 20:46:23,349][26022] Updated weights on worker 0-0, policy_version 413785 (0.00089) [2022-07-09 20:46:24,933][26022] Updated weights on worker 0-0, policy_version 413795 (0.00096) [2022-07-09 20:46:25,230][25689] Fps is (10 sec: 5822.8, 60 sec: 5694.1, 300 sec: 5688.8). Total num frames: 423727104. Throughput: 0: 6014.2. Samples: 423731352. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:46:25,230][25689] Avg episode reward: [(0, '-46.933')] [2022-07-09 20:46:26,769][26022] Updated weights on worker 0-0, policy_version 413805 (0.00081) [2022-07-09 20:46:28,552][26022] Updated weights on worker 0-0, policy_version 413815 (0.00097) [2022-07-09 20:46:30,273][25689] Fps is (10 sec: 5800.3, 60 sec: 5713.0, 300 sec: 5678.6). Total num frames: 423755776. Throughput: 0: 5180.6. Samples: 423748554. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:46:30,273][25689] Avg episode reward: [(0, '-47.268')] [2022-07-09 20:46:30,340][26022] Updated weights on worker 0-0, policy_version 413825 (0.00086) [2022-07-09 20:46:32,310][26022] Updated weights on worker 0-0, policy_version 413835 (0.00091) [2022-07-09 20:46:34,003][26022] Updated weights on worker 0-0, policy_version 413845 (0.00085) [2022-07-09 20:46:35,286][25689] Fps is (10 sec: 5702.1, 60 sec: 5695.4, 300 sec: 5680.1). Total num frames: 423784448. Throughput: 0: 6014.9. Samples: 423782762. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:46:35,287][25689] Avg episode reward: [(0, '-46.801')] [2022-07-09 20:46:35,844][26022] Updated weights on worker 0-0, policy_version 413855 (0.00088) [2022-07-09 20:46:37,410][26022] Updated weights on worker 0-0, policy_version 413865 (0.00082) [2022-07-09 20:46:39,270][26022] Updated weights on worker 0-0, policy_version 413875 (0.00087) [2022-07-09 20:46:40,307][25689] Fps is (10 sec: 5817.0, 60 sec: 5728.8, 300 sec: 5684.3). Total num frames: 423814144. Throughput: 0: 6023.8. Samples: 423817588. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:46:40,307][25689] Avg episode reward: [(0, '-47.103')] [2022-07-09 20:46:41,120][26022] Updated weights on worker 0-0, policy_version 413885 (0.00080) [2022-07-09 20:46:42,723][26022] Updated weights on worker 0-0, policy_version 413895 (0.00081) [2022-07-09 20:46:44,653][26022] Updated weights on worker 0-0, policy_version 413905 (0.00083) [2022-07-09 20:46:45,310][25689] Fps is (10 sec: 5720.9, 60 sec: 5694.6, 300 sec: 5681.9). Total num frames: 423841792. Throughput: 0: 5158.4. Samples: 423834934. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:46:45,310][25689] Avg episode reward: [(0, '-47.652')] [2022-07-09 20:46:46,253][26022] Updated weights on worker 0-0, policy_version 413915 (0.00086) [2022-07-09 20:46:48,234][26022] Updated weights on worker 0-0, policy_version 413925 (0.00092) [2022-07-09 20:46:49,923][26022] Updated weights on worker 0-0, policy_version 413935 (0.00080) [2022-07-09 20:46:50,351][25689] Fps is (10 sec: 5709.3, 60 sec: 5711.4, 300 sec: 5684.7). Total num frames: 423871488. Throughput: 0: 6028.4. Samples: 423869592. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:46:50,351][25689] Avg episode reward: [(0, '-47.860')] [2022-07-09 20:46:51,598][26022] Updated weights on worker 0-0, policy_version 413945 (0.00084) [2022-07-09 20:46:53,458][26022] Updated weights on worker 0-0, policy_version 413955 (0.00081) [2022-07-09 20:46:55,269][26022] Updated weights on worker 0-0, policy_version 413965 (0.00092) [2022-07-09 20:46:55,354][25689] Fps is (10 sec: 5913.1, 60 sec: 5730.2, 300 sec: 5684.8). Total num frames: 423901184. Throughput: 0: 6068.3. Samples: 423904538. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:46:55,354][25689] Avg episode reward: [(0, '-47.920')] [2022-07-09 20:46:56,956][26022] Updated weights on worker 0-0, policy_version 413975 (0.00093) [2022-07-09 20:46:58,645][26022] Updated weights on worker 0-0, policy_version 413985 (0.00094) [2022-07-09 20:47:00,362][25689] Fps is (10 sec: 5830.0, 60 sec: 5713.5, 300 sec: 5696.0). Total num frames: 423929856. Throughput: 0: 6057.7. Samples: 423939080. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:47:00,363][25689] Avg episode reward: [(0, '-47.607')] [2022-07-09 20:47:00,471][26022] Updated weights on worker 0-0, policy_version 413995 (0.00082) [2022-07-09 20:47:02,684][26022] Updated weights on worker 0-0, policy_version 414005 (0.00091) [2022-07-09 20:47:04,343][26022] Updated weights on worker 0-0, policy_version 414015 (0.00081) [2022-07-09 20:47:05,396][25689] Fps is (10 sec: 5404.5, 60 sec: 5711.5, 300 sec: 5684.2). Total num frames: 423955456. Throughput: 0: 5922.3. Samples: 423953890. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:47:05,397][25689] Avg episode reward: [(0, '-47.666')] [2022-07-09 20:47:06,216][26022] Updated weights on worker 0-0, policy_version 414025 (0.00087) [2022-07-09 20:47:08,042][26022] Updated weights on worker 0-0, policy_version 414035 (0.00091) [2022-07-09 20:47:09,771][26022] Updated weights on worker 0-0, policy_version 414045 (0.00086) [2022-07-09 20:47:10,436][25689] Fps is (10 sec: 5489.1, 60 sec: 5712.1, 300 sec: 5685.1). Total num frames: 423985152. Throughput: 0: 5924.1. Samples: 423988580. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:47:10,437][25689] Avg episode reward: [(0, '-48.101')] [2022-07-09 20:47:11,691][26022] Updated weights on worker 0-0, policy_version 414055 (0.00090) [2022-07-09 20:47:13,210][26022] Updated weights on worker 0-0, policy_version 414065 (0.00085) [2022-07-09 20:47:15,406][26022] Updated weights on worker 0-0, policy_version 414075 (0.00086) [2022-07-09 20:47:15,444][25689] Fps is (10 sec: 5707.1, 60 sec: 5711.7, 300 sec: 5685.7). Total num frames: 424012800. Throughput: 0: 5924.1. Samples: 424023552. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:47:15,444][25689] Avg episode reward: [(0, '-47.482')] [2022-07-09 20:47:16,784][26022] Updated weights on worker 0-0, policy_version 414085 (0.00082) [2022-07-09 20:47:18,689][26022] Updated weights on worker 0-0, policy_version 414095 (0.00089) [2022-07-09 20:47:20,227][26022] Updated weights on worker 0-0, policy_version 414105 (0.00087) [2022-07-09 20:47:20,459][25689] Fps is (10 sec: 5823.6, 60 sec: 5744.7, 300 sec: 5689.5). Total num frames: 424043520. Throughput: 0: 5079.2. Samples: 424041154. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:47:20,459][25689] Avg episode reward: [(0, '-47.718')] [2022-07-09 20:47:22,260][26022] Updated weights on worker 0-0, policy_version 414115 (0.00082) [2022-07-09 20:47:23,985][26022] Updated weights on worker 0-0, policy_version 414125 (0.00085) [2022-07-09 20:47:25,470][25689] Fps is (10 sec: 6025.7, 60 sec: 5745.5, 300 sec: 5697.1). Total num frames: 424073216. Throughput: 0: 6096.7. Samples: 424076276. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:47:25,471][25689] Avg episode reward: [(0, '-46.897')] [2022-07-09 20:47:25,598][26022] Updated weights on worker 0-0, policy_version 414135 (0.00091) [2022-07-09 20:47:27,376][26022] Updated weights on worker 0-0, policy_version 414145 (0.00086) [2022-07-09 20:47:29,144][26022] Updated weights on worker 0-0, policy_version 414155 (0.00094) [2022-07-09 20:47:30,548][25689] Fps is (10 sec: 5784.9, 60 sec: 5742.2, 300 sec: 5692.5). Total num frames: 424101888. Throughput: 0: 6084.4. Samples: 424110950. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:47:30,549][25689] Avg episode reward: [(0, '-47.378')] [2022-07-09 20:47:30,893][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:47:30,907][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000414165_424104960.pth [2022-07-09 20:47:30,907][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000412161_422052864.pth [2022-07-09 20:47:30,913][26022] Updated weights on worker 0-0, policy_version 414165 (0.00092) [2022-07-09 20:47:32,860][26022] Updated weights on worker 0-0, policy_version 414175 (0.00096) [2022-07-09 20:47:34,463][26022] Updated weights on worker 0-0, policy_version 414185 (0.00090) [2022-07-09 20:47:35,556][25689] Fps is (10 sec: 5685.4, 60 sec: 5742.7, 300 sec: 5692.6). Total num frames: 424130560. Throughput: 0: 5198.1. Samples: 424128100. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:47:35,557][25689] Avg episode reward: [(0, '-47.544')] [2022-07-09 20:47:36,395][26022] Updated weights on worker 0-0, policy_version 414195 (0.00088) [2022-07-09 20:47:38,053][26022] Updated weights on worker 0-0, policy_version 414205 (0.00086) [2022-07-09 20:47:39,940][26022] Updated weights on worker 0-0, policy_version 414215 (0.00091) [2022-07-09 20:47:40,576][25689] Fps is (10 sec: 5820.6, 60 sec: 5742.7, 300 sec: 5692.8). Total num frames: 424160256. Throughput: 0: 6050.7. Samples: 424162880. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:47:40,577][25689] Avg episode reward: [(0, '-47.186')] [2022-07-09 20:47:41,655][26022] Updated weights on worker 0-0, policy_version 414225 (0.00094) [2022-07-09 20:47:43,450][26022] Updated weights on worker 0-0, policy_version 414235 (0.00087) [2022-07-09 20:47:45,146][26022] Updated weights on worker 0-0, policy_version 414245 (0.00079) [2022-07-09 20:47:45,584][25689] Fps is (10 sec: 5718.7, 60 sec: 5742.3, 300 sec: 5690.6). Total num frames: 424187904. Throughput: 0: 6032.2. Samples: 424197604. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:47:45,584][25689] Avg episode reward: [(0, '-46.896')] [2022-07-09 20:47:47,005][26022] Updated weights on worker 0-0, policy_version 414255 (0.00089) [2022-07-09 20:47:48,642][26022] Updated weights on worker 0-0, policy_version 414265 (0.00088) [2022-07-09 20:47:50,565][26022] Updated weights on worker 0-0, policy_version 414275 (0.00089) [2022-07-09 20:47:50,648][25689] Fps is (10 sec: 5795.4, 60 sec: 5757.1, 300 sec: 5696.4). Total num frames: 424218624. Throughput: 0: 5171.4. Samples: 424214892. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:47:50,648][25689] Avg episode reward: [(0, '-47.130')] [2022-07-09 20:47:52,463][26022] Updated weights on worker 0-0, policy_version 414285 (0.00086) [2022-07-09 20:47:53,958][26022] Updated weights on worker 0-0, policy_version 414295 (0.00088) [2022-07-09 20:47:55,666][25689] Fps is (10 sec: 5890.8, 60 sec: 5738.7, 300 sec: 5696.1). Total num frames: 424247296. Throughput: 0: 6053.9. Samples: 424249842. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:47:55,667][25689] Avg episode reward: [(0, '-46.503')] [2022-07-09 20:47:56,061][26022] Updated weights on worker 0-0, policy_version 414305 (0.00088) [2022-07-09 20:47:57,476][26022] Updated weights on worker 0-0, policy_version 414315 (0.00085) [2022-07-09 20:47:59,314][26022] Updated weights on worker 0-0, policy_version 414325 (0.00086) [2022-07-09 20:48:00,674][25689] Fps is (10 sec: 5719.2, 60 sec: 5738.7, 300 sec: 5696.2). Total num frames: 424275968. Throughput: 0: 6064.8. Samples: 424284770. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:48:00,676][25689] Avg episode reward: [(0, '-46.225')] [2022-07-09 20:48:00,912][26022] Updated weights on worker 0-0, policy_version 414335 (0.00086) [2022-07-09 20:48:03,042][26022] Updated weights on worker 0-0, policy_version 414345 (0.00085) [2022-07-09 20:48:04,983][26022] Updated weights on worker 0-0, policy_version 414355 (0.00092) [2022-07-09 20:48:05,682][25689] Fps is (10 sec: 5623.0, 60 sec: 5775.1, 300 sec: 5700.7). Total num frames: 424303616. Throughput: 0: 5095.5. Samples: 424300014. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-09 20:48:05,684][25689] Avg episode reward: [(0, '-46.339')] [2022-07-09 20:48:06,852][26022] Updated weights on worker 0-0, policy_version 414365 (0.00095) [2022-07-09 20:48:08,581][26022] Updated weights on worker 0-0, policy_version 414375 (0.00048) [2022-07-09 20:48:10,430][26022] Updated weights on worker 0-0, policy_version 414385 (0.00088) [2022-07-09 20:48:10,791][25689] Fps is (10 sec: 5465.7, 60 sec: 5734.6, 300 sec: 5692.1). Total num frames: 424331264. Throughput: 0: 5933.9. Samples: 424334420. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:48:10,792][25689] Avg episode reward: [(0, '-45.814')] [2022-07-09 20:48:11,991][26022] Updated weights on worker 0-0, policy_version 414395 (0.00085) [2022-07-09 20:48:13,974][26022] Updated weights on worker 0-0, policy_version 414405 (0.00732) [2022-07-09 20:48:15,811][25689] Fps is (10 sec: 5560.3, 60 sec: 5750.4, 300 sec: 5692.1). Total num frames: 424359936. Throughput: 0: 5918.5. Samples: 424369070. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:48:15,811][25689] Avg episode reward: [(0, '-45.155')] [2022-07-09 20:48:15,815][26022] Updated weights on worker 0-0, policy_version 414415 (0.00088) [2022-07-09 20:48:17,369][26022] Updated weights on worker 0-0, policy_version 414425 (0.00082) [2022-07-09 20:48:19,475][26022] Updated weights on worker 0-0, policy_version 414435 (0.00089) [2022-07-09 20:48:20,875][25689] Fps is (10 sec: 5889.7, 60 sec: 5745.7, 300 sec: 5699.4). Total num frames: 424390656. Throughput: 0: 5019.2. Samples: 424386162. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:48:20,876][25689] Avg episode reward: [(0, '-46.107')] [2022-07-09 20:48:20,934][26022] Updated weights on worker 0-0, policy_version 414445 (0.00109) [2022-07-09 20:48:23,050][26022] Updated weights on worker 0-0, policy_version 414455 (0.00092) [2022-07-09 20:48:24,761][26022] Updated weights on worker 0-0, policy_version 414465 (0.00084) [2022-07-09 20:48:25,893][25689] Fps is (10 sec: 5789.4, 60 sec: 5711.2, 300 sec: 5696.4). Total num frames: 424418304. Throughput: 0: 5970.1. Samples: 424420674. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:48:25,893][25689] Avg episode reward: [(0, '-46.974')] [2022-07-09 20:48:26,493][26022] Updated weights on worker 0-0, policy_version 414475 (0.00084) [2022-07-09 20:48:28,298][26022] Updated weights on worker 0-0, policy_version 414485 (0.00108) [2022-07-09 20:48:30,323][26022] Updated weights on worker 0-0, policy_version 414495 (0.00089) [2022-07-09 20:48:30,968][25689] Fps is (10 sec: 5580.0, 60 sec: 5711.5, 300 sec: 5696.9). Total num frames: 424446976. Throughput: 0: 5948.6. Samples: 424454446. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:48:30,969][25689] Avg episode reward: [(0, '-46.724')] [2022-07-09 20:48:31,772][26022] Updated weights on worker 0-0, policy_version 414505 (0.00090) [2022-07-09 20:48:33,973][26022] Updated weights on worker 0-0, policy_version 414515 (0.00096) [2022-07-09 20:48:35,435][26022] Updated weights on worker 0-0, policy_version 414525 (0.00083) [2022-07-09 20:48:36,055][25689] Fps is (10 sec: 5642.9, 60 sec: 5704.1, 300 sec: 5700.0). Total num frames: 424475648. Throughput: 0: 5052.7. Samples: 424471356. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:48:36,055][25689] Avg episode reward: [(0, '-46.988')] [2022-07-09 20:48:37,358][26022] Updated weights on worker 0-0, policy_version 414535 (0.00090) [2022-07-09 20:48:39,125][26022] Updated weights on worker 0-0, policy_version 414545 (0.00084) [2022-07-09 20:48:41,074][25689] Fps is (10 sec: 5572.9, 60 sec: 5670.3, 300 sec: 5693.0). Total num frames: 424503296. Throughput: 0: 5899.6. Samples: 424505330. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:48:41,075][25689] Avg episode reward: [(0, '-47.105')] [2022-07-09 20:48:41,075][26022] Updated weights on worker 0-0, policy_version 414555 (0.00080) [2022-07-09 20:48:42,868][26022] Updated weights on worker 0-0, policy_version 414565 (0.00085) [2022-07-09 20:48:44,503][26022] Updated weights on worker 0-0, policy_version 414575 (0.00087) [2022-07-09 20:48:46,087][25689] Fps is (10 sec: 5716.0, 60 sec: 5703.6, 300 sec: 5698.6). Total num frames: 424532992. Throughput: 0: 5881.3. Samples: 424539442. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:48:46,088][25689] Avg episode reward: [(0, '-48.154')] [2022-07-09 20:48:46,269][26022] Updated weights on worker 0-0, policy_version 414585 (0.00092) [2022-07-09 20:48:48,245][26022] Updated weights on worker 0-0, policy_version 414595 (0.00082) [2022-07-09 20:48:49,986][26022] Updated weights on worker 0-0, policy_version 414605 (0.00084) [2022-07-09 20:48:51,171][25689] Fps is (10 sec: 5578.3, 60 sec: 5634.1, 300 sec: 5690.7). Total num frames: 424559616. Throughput: 0: 5056.0. Samples: 424556590. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:48:51,173][25689] Avg episode reward: [(0, '-47.307')] [2022-07-09 20:48:51,749][26022] Updated weights on worker 0-0, policy_version 414615 (0.00093) [2022-07-09 20:48:53,464][26022] Updated weights on worker 0-0, policy_version 414625 (0.00085) [2022-07-09 20:48:55,235][26022] Updated weights on worker 0-0, policy_version 414635 (0.00085) [2022-07-09 20:48:56,191][25689] Fps is (10 sec: 5675.4, 60 sec: 5667.8, 300 sec: 5697.2). Total num frames: 424590336. Throughput: 0: 5921.4. Samples: 424590590. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:48:56,192][25689] Avg episode reward: [(0, '-48.219')] [2022-07-09 20:48:57,222][26022] Updated weights on worker 0-0, policy_version 414645 (0.00091) [2022-07-09 20:48:58,815][26022] Updated weights on worker 0-0, policy_version 414655 (0.00087) [2022-07-09 20:49:00,678][26022] Updated weights on worker 0-0, policy_version 414665 (0.00080) [2022-07-09 20:49:01,201][25689] Fps is (10 sec: 6023.1, 60 sec: 5684.5, 300 sec: 5711.5). Total num frames: 424620032. Throughput: 0: 5978.2. Samples: 424625654. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:49:01,204][25689] Avg episode reward: [(0, '-47.049')] [2022-07-09 20:49:02,942][26022] Updated weights on worker 0-0, policy_version 414675 (0.00091) [2022-07-09 20:49:04,651][26022] Updated weights on worker 0-0, policy_version 414685 (0.00617) [2022-07-09 20:49:06,215][25689] Fps is (10 sec: 5618.5, 60 sec: 5667.0, 300 sec: 5702.6). Total num frames: 424646656. Throughput: 0: 5049.4. Samples: 424641076. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:49:06,217][25689] Avg episode reward: [(0, '-47.125')] [2022-07-09 20:49:06,310][26022] Updated weights on worker 0-0, policy_version 414695 (0.00084) [2022-07-09 20:49:08,139][26022] Updated weights on worker 0-0, policy_version 414705 (0.00082) [2022-07-09 20:49:09,874][26022] Updated weights on worker 0-0, policy_version 414715 (0.00086) [2022-07-09 20:49:11,328][25689] Fps is (10 sec: 5460.7, 60 sec: 5683.6, 300 sec: 5698.2). Total num frames: 424675328. Throughput: 0: 5893.8. Samples: 424675390. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:49:11,328][25689] Avg episode reward: [(0, '-46.464')] [2022-07-09 20:49:11,864][26022] Updated weights on worker 0-0, policy_version 414725 (0.00084) [2022-07-09 20:49:13,549][26022] Updated weights on worker 0-0, policy_version 414735 (0.00083) [2022-07-09 20:49:15,296][26022] Updated weights on worker 0-0, policy_version 414745 (0.00095) [2022-07-09 20:49:16,393][25689] Fps is (10 sec: 5734.7, 60 sec: 5696.2, 300 sec: 5705.2). Total num frames: 424705024. Throughput: 0: 5921.6. Samples: 424710218. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:49:16,394][25689] Avg episode reward: [(0, '-46.450')] [2022-07-09 20:49:17,147][26022] Updated weights on worker 0-0, policy_version 414755 (0.00106) [2022-07-09 20:49:18,669][26022] Updated weights on worker 0-0, policy_version 414765 (0.00048) [2022-07-09 20:49:20,654][26022] Updated weights on worker 0-0, policy_version 414775 (0.00089) [2022-07-09 20:49:21,448][25689] Fps is (10 sec: 5868.5, 60 sec: 5680.2, 300 sec: 5704.9). Total num frames: 424734720. Throughput: 0: 5030.3. Samples: 424727500. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:49:21,449][25689] Avg episode reward: [(0, '-46.841')] [2022-07-09 20:49:22,506][26022] Updated weights on worker 0-0, policy_version 414785 (0.00085) [2022-07-09 20:49:24,033][26022] Updated weights on worker 0-0, policy_version 414795 (0.00089) [2022-07-09 20:49:26,012][26022] Updated weights on worker 0-0, policy_version 414805 (0.00090) [2022-07-09 20:49:26,525][25689] Fps is (10 sec: 5760.5, 60 sec: 5691.5, 300 sec: 5704.1). Total num frames: 424763392. Throughput: 0: 5967.1. Samples: 424762270. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:49:26,526][25689] Avg episode reward: [(0, '-46.166')] [2022-07-09 20:49:27,740][26022] Updated weights on worker 0-0, policy_version 414815 (0.00094) [2022-07-09 20:49:29,546][26022] Updated weights on worker 0-0, policy_version 414825 (0.00085) [2022-07-09 20:49:31,079][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:49:31,089][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000414834_424790016.pth [2022-07-09 20:49:31,089][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000412829_422736896.pth [2022-07-09 20:49:31,219][26022] Updated weights on worker 0-0, policy_version 414835 (0.00084) [2022-07-09 20:49:31,664][25689] Fps is (10 sec: 5613.1, 60 sec: 5685.5, 300 sec: 5706.2). Total num frames: 424792064. Throughput: 0: 5955.1. Samples: 424796496. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:49:31,665][25689] Avg episode reward: [(0, '-46.092')] [2022-07-09 20:49:33,173][26022] Updated weights on worker 0-0, policy_version 414845 (0.00093) [2022-07-09 20:49:34,812][26022] Updated weights on worker 0-0, policy_version 414855 (0.00085) [2022-07-09 20:49:36,702][25689] Fps is (10 sec: 5735.2, 60 sec: 5706.9, 300 sec: 5709.2). Total num frames: 424821760. Throughput: 0: 5913.8. Samples: 424830324. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:49:36,703][25689] Avg episode reward: [(0, '-45.914')] [2022-07-09 20:49:36,706][26022] Updated weights on worker 0-0, policy_version 414865 (0.00086) [2022-07-09 20:49:38,516][26022] Updated weights on worker 0-0, policy_version 414875 (0.00089) [2022-07-09 20:49:40,463][26022] Updated weights on worker 0-0, policy_version 414885 (0.00082) [2022-07-09 20:49:41,774][25689] Fps is (10 sec: 5874.8, 60 sec: 5735.8, 300 sec: 5711.9). Total num frames: 424851456. Throughput: 0: 5911.9. Samples: 424847664. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:49:41,774][25689] Avg episode reward: [(0, '-46.004')] [2022-07-09 20:49:41,963][26022] Updated weights on worker 0-0, policy_version 414895 (0.00089) [2022-07-09 20:49:44,055][26022] Updated weights on worker 0-0, policy_version 414905 (0.00085) [2022-07-09 20:49:45,470][26022] Updated weights on worker 0-0, policy_version 414915 (0.00084) [2022-07-09 20:49:46,799][25689] Fps is (10 sec: 5578.2, 60 sec: 5684.0, 300 sec: 5702.5). Total num frames: 424878080. Throughput: 0: 5930.8. Samples: 424882508. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:49:46,799][25689] Avg episode reward: [(0, '-45.790')] [2022-07-09 20:49:47,503][26022] Updated weights on worker 0-0, policy_version 414925 (0.00079) [2022-07-09 20:49:49,140][26022] Updated weights on worker 0-0, policy_version 414935 (0.00081) [2022-07-09 20:49:50,960][26022] Updated weights on worker 0-0, policy_version 414945 (0.00088) [2022-07-09 20:49:51,840][25689] Fps is (10 sec: 5696.9, 60 sec: 5755.6, 300 sec: 5708.9). Total num frames: 424908800. Throughput: 0: 5984.4. Samples: 424917234. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:49:51,841][25689] Avg episode reward: [(0, '-45.772')] [2022-07-09 20:49:52,780][26022] Updated weights on worker 0-0, policy_version 414955 (0.00092) [2022-07-09 20:49:54,453][26022] Updated weights on worker 0-0, policy_version 414965 (0.00088) [2022-07-09 20:49:56,326][26022] Updated weights on worker 0-0, policy_version 414975 (0.00093) [2022-07-09 20:49:56,869][25689] Fps is (10 sec: 5897.9, 60 sec: 5721.0, 300 sec: 5709.1). Total num frames: 424937472. Throughput: 0: 5164.8. Samples: 424934480. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:49:56,869][25689] Avg episode reward: [(0, '-45.156')] [2022-07-09 20:49:57,865][26022] Updated weights on worker 0-0, policy_version 414985 (0.00092) [2022-07-09 20:49:59,757][26022] Updated weights on worker 0-0, policy_version 414995 (0.00080) [2022-07-09 20:50:01,874][25689] Fps is (10 sec: 5510.4, 60 sec: 5670.8, 300 sec: 5706.0). Total num frames: 424964096. Throughput: 0: 6058.7. Samples: 424969448. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:50:01,875][25689] Avg episode reward: [(0, '-45.104')] [2022-07-09 20:50:01,953][26022] Updated weights on worker 0-0, policy_version 415005 (0.00083) [2022-07-09 20:50:03,615][26022] Updated weights on worker 0-0, policy_version 415015 (0.00090) [2022-07-09 20:50:05,661][26022] Updated weights on worker 0-0, policy_version 415025 (0.00093) [2022-07-09 20:50:06,886][25689] Fps is (10 sec: 5519.9, 60 sec: 5704.7, 300 sec: 5713.7). Total num frames: 424992768. Throughput: 0: 5921.1. Samples: 425001448. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:50:06,887][25689] Avg episode reward: [(0, '-45.357')] [2022-07-09 20:50:07,254][26022] Updated weights on worker 0-0, policy_version 415035 (0.00086) [2022-07-09 20:50:09,386][26022] Updated weights on worker 0-0, policy_version 415045 (0.00087) [2022-07-09 20:50:10,800][26022] Updated weights on worker 0-0, policy_version 415055 (0.00085) [2022-07-09 20:50:12,016][25689] Fps is (10 sec: 5755.4, 60 sec: 5720.0, 300 sec: 5708.1). Total num frames: 425022464. Throughput: 0: 5026.9. Samples: 425018658. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:50:12,016][25689] Avg episode reward: [(0, '-46.299')] [2022-07-09 20:50:12,833][26022] Updated weights on worker 0-0, policy_version 415065 (0.00080) [2022-07-09 20:50:14,456][26022] Updated weights on worker 0-0, policy_version 415075 (0.00083) [2022-07-09 20:50:16,359][26022] Updated weights on worker 0-0, policy_version 415085 (0.00571) [2022-07-09 20:50:17,039][25689] Fps is (10 sec: 5749.1, 60 sec: 5707.1, 300 sec: 5711.7). Total num frames: 425051136. Throughput: 0: 5909.7. Samples: 425053680. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:50:17,039][25689] Avg episode reward: [(0, '-46.405')] [2022-07-09 20:50:17,987][26022] Updated weights on worker 0-0, policy_version 415095 (0.00091) [2022-07-09 20:50:19,867][26022] Updated weights on worker 0-0, policy_version 415105 (0.00093) [2022-07-09 20:50:21,427][26022] Updated weights on worker 0-0, policy_version 415115 (0.00090) [2022-07-09 20:50:22,125][25689] Fps is (10 sec: 5773.8, 60 sec: 5704.2, 300 sec: 5710.3). Total num frames: 425080832. Throughput: 0: 5871.8. Samples: 425088354. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:50:22,127][25689] Avg episode reward: [(0, '-45.881')] [2022-07-09 20:50:23,329][26022] Updated weights on worker 0-0, policy_version 415125 (0.00090) [2022-07-09 20:50:25,042][26022] Updated weights on worker 0-0, policy_version 415135 (0.00086) [2022-07-09 20:50:26,957][26022] Updated weights on worker 0-0, policy_version 415145 (0.00081) [2022-07-09 20:50:27,153][25689] Fps is (10 sec: 5770.6, 60 sec: 5708.7, 300 sec: 5714.4). Total num frames: 425109504. Throughput: 0: 5156.0. Samples: 425105946. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:50:27,155][25689] Avg episode reward: [(0, '-46.747')] [2022-07-09 20:50:28,535][26022] Updated weights on worker 0-0, policy_version 415155 (0.00094) [2022-07-09 20:50:30,420][26022] Updated weights on worker 0-0, policy_version 415165 (0.00096) [2022-07-09 20:50:31,986][26022] Updated weights on worker 0-0, policy_version 415175 (0.00106) [2022-07-09 20:50:32,285][25689] Fps is (10 sec: 5744.7, 60 sec: 5726.3, 300 sec: 5712.0). Total num frames: 425139200. Throughput: 0: 6012.2. Samples: 425140522. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:50:32,286][25689] Avg episode reward: [(0, '-47.189')] [2022-07-09 20:50:33,932][26022] Updated weights on worker 0-0, policy_version 415185 (0.00085) [2022-07-09 20:50:35,817][26022] Updated weights on worker 0-0, policy_version 415195 (0.00080) [2022-07-09 20:50:37,297][25689] Fps is (10 sec: 5956.1, 60 sec: 5745.7, 300 sec: 5722.4). Total num frames: 425169920. Throughput: 0: 6010.5. Samples: 425175442. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:50:37,297][25689] Avg episode reward: [(0, '-45.626')] [2022-07-09 20:50:37,300][26022] Updated weights on worker 0-0, policy_version 415205 (0.00085) [2022-07-09 20:50:39,399][26022] Updated weights on worker 0-0, policy_version 415215 (0.00091) [2022-07-09 20:50:40,943][26022] Updated weights on worker 0-0, policy_version 415225 (0.00083) [2022-07-09 20:50:42,308][25689] Fps is (10 sec: 5823.5, 60 sec: 5717.6, 300 sec: 5715.3). Total num frames: 425197568. Throughput: 0: 5163.5. Samples: 425192570. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:50:42,310][25689] Avg episode reward: [(0, '-45.854')] [2022-07-09 20:50:42,783][26022] Updated weights on worker 0-0, policy_version 415235 (0.00087) [2022-07-09 20:50:44,478][26022] Updated weights on worker 0-0, policy_version 415245 (0.00086) [2022-07-09 20:50:46,254][26022] Updated weights on worker 0-0, policy_version 415255 (0.00086) [2022-07-09 20:50:47,352][25689] Fps is (10 sec: 5601.2, 60 sec: 5749.7, 300 sec: 5715.2). Total num frames: 425226240. Throughput: 0: 6005.4. Samples: 425227246. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-09 20:50:47,354][25689] Avg episode reward: [(0, '-46.140')] [2022-07-09 20:50:48,143][26022] Updated weights on worker 0-0, policy_version 415265 (0.00085) [2022-07-09 20:50:49,953][26022] Updated weights on worker 0-0, policy_version 415275 (0.00086) [2022-07-09 20:50:51,711][26022] Updated weights on worker 0-0, policy_version 415285 (0.00095) [2022-07-09 20:50:52,397][25689] Fps is (10 sec: 5785.2, 60 sec: 5732.3, 300 sec: 5718.2). Total num frames: 425255936. Throughput: 0: 6025.7. Samples: 425261712. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:50:52,398][25689] Avg episode reward: [(0, '-46.480')] [2022-07-09 20:50:53,584][26022] Updated weights on worker 0-0, policy_version 415295 (0.00093) [2022-07-09 20:50:55,435][26022] Updated weights on worker 0-0, policy_version 415305 (0.00103) [2022-07-09 20:50:56,988][26022] Updated weights on worker 0-0, policy_version 415315 (0.00090) [2022-07-09 20:50:57,446][25689] Fps is (10 sec: 5782.5, 60 sec: 5730.5, 300 sec: 5714.1). Total num frames: 425284608. Throughput: 0: 5136.7. Samples: 425278940. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:50:57,446][25689] Avg episode reward: [(0, '-45.924')] [2022-07-09 20:50:58,886][26022] Updated weights on worker 0-0, policy_version 415325 (0.00093) [2022-07-09 20:51:00,609][26022] Updated weights on worker 0-0, policy_version 415335 (0.00092) [2022-07-09 20:51:02,472][25689] Fps is (10 sec: 5387.1, 60 sec: 5711.7, 300 sec: 5713.8). Total num frames: 425310208. Throughput: 0: 5990.2. Samples: 425313354. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:51:02,474][25689] Avg episode reward: [(0, '-46.653')] [2022-07-09 20:51:02,835][26022] Updated weights on worker 0-0, policy_version 415345 (0.00083) [2022-07-09 20:51:04,360][26022] Updated weights on worker 0-0, policy_version 415355 (0.00084) [2022-07-09 20:51:06,518][26022] Updated weights on worker 0-0, policy_version 415365 (0.00089) [2022-07-09 20:51:07,490][25689] Fps is (10 sec: 5403.1, 60 sec: 5711.0, 300 sec: 5710.9). Total num frames: 425338880. Throughput: 0: 5893.2. Samples: 425345926. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:51:07,491][25689] Avg episode reward: [(0, '-46.786')] [2022-07-09 20:51:07,916][26022] Updated weights on worker 0-0, policy_version 415375 (0.00085) [2022-07-09 20:51:10,256][26022] Updated weights on worker 0-0, policy_version 415385 (0.00089) [2022-07-09 20:51:11,407][26022] Updated weights on worker 0-0, policy_version 415395 (0.00090) [2022-07-09 20:51:12,634][25689] Fps is (10 sec: 5743.7, 60 sec: 5709.7, 300 sec: 5715.1). Total num frames: 425368576. Throughput: 0: 5003.1. Samples: 425362958. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:51:12,634][25689] Avg episode reward: [(0, '-46.649')] [2022-07-09 20:51:13,643][26022] Updated weights on worker 0-0, policy_version 415405 (0.00094) [2022-07-09 20:51:15,038][26022] Updated weights on worker 0-0, policy_version 415415 (0.00091) [2022-07-09 20:51:17,074][26022] Updated weights on worker 0-0, policy_version 415425 (0.00078) [2022-07-09 20:51:17,668][25689] Fps is (10 sec: 5835.4, 60 sec: 5725.5, 300 sec: 5718.0). Total num frames: 425398272. Throughput: 0: 5865.1. Samples: 425397546. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:51:17,669][25689] Avg episode reward: [(0, '-46.763')] [2022-07-09 20:51:18,799][26022] Updated weights on worker 0-0, policy_version 415435 (0.00085) [2022-07-09 20:51:20,491][26022] Updated weights on worker 0-0, policy_version 415445 (0.00078) [2022-07-09 20:51:22,404][26022] Updated weights on worker 0-0, policy_version 415455 (0.00093) [2022-07-09 20:51:22,698][25689] Fps is (10 sec: 6003.3, 60 sec: 5747.8, 300 sec: 5721.2). Total num frames: 425428992. Throughput: 0: 5875.6. Samples: 425432194. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:51:22,698][25689] Avg episode reward: [(0, '-46.932')] [2022-07-09 20:51:24,306][26022] Updated weights on worker 0-0, policy_version 415465 (0.00091) [2022-07-09 20:51:25,826][26022] Updated weights on worker 0-0, policy_version 415475 (0.00089) [2022-07-09 20:51:27,752][25689] Fps is (10 sec: 5788.2, 60 sec: 5728.4, 300 sec: 5717.6). Total num frames: 425456640. Throughput: 0: 5125.4. Samples: 425449776. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:51:27,753][25689] Avg episode reward: [(0, '-46.851')] [2022-07-09 20:51:27,761][26022] Updated weights on worker 0-0, policy_version 415485 (0.00336) [2022-07-09 20:51:29,302][26022] Updated weights on worker 0-0, policy_version 415495 (0.00092) [2022-07-09 20:51:31,094][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:51:31,104][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000415504_425476096.pth [2022-07-09 20:51:31,104][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000413492_423415808.pth [2022-07-09 20:51:31,242][26022] Updated weights on worker 0-0, policy_version 415505 (0.00085) [2022-07-09 20:51:32,863][25689] Fps is (10 sec: 5540.3, 60 sec: 5713.5, 300 sec: 5715.7). Total num frames: 425485312. Throughput: 0: 5983.2. Samples: 425483994. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:51:32,864][25689] Avg episode reward: [(0, '-46.024')] [2022-07-09 20:51:33,321][26022] Updated weights on worker 0-0, policy_version 415515 (0.00086) [2022-07-09 20:51:34,867][26022] Updated weights on worker 0-0, policy_version 415525 (0.00089) [2022-07-09 20:51:36,627][26022] Updated weights on worker 0-0, policy_version 415535 (0.00079) [2022-07-09 20:51:37,923][25689] Fps is (10 sec: 5839.2, 60 sec: 5708.9, 300 sec: 5718.4). Total num frames: 425516032. Throughput: 0: 5980.1. Samples: 425518674. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:51:37,925][25689] Avg episode reward: [(0, '-47.277')] [2022-07-09 20:51:38,310][26022] Updated weights on worker 0-0, policy_version 415545 (0.00085) [2022-07-09 20:51:40,122][26022] Updated weights on worker 0-0, policy_version 415555 (0.00080) [2022-07-09 20:51:41,963][26022] Updated weights on worker 0-0, policy_version 415565 (0.00086) [2022-07-09 20:51:42,978][25689] Fps is (10 sec: 5871.8, 60 sec: 5721.7, 300 sec: 5720.8). Total num frames: 425544704. Throughput: 0: 5986.7. Samples: 425553606. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:51:42,979][25689] Avg episode reward: [(0, '-47.422')] [2022-07-09 20:51:43,659][26022] Updated weights on worker 0-0, policy_version 415575 (0.00104) [2022-07-09 20:51:45,375][26022] Updated weights on worker 0-0, policy_version 415585 (0.00089) [2022-07-09 20:51:47,353][26022] Updated weights on worker 0-0, policy_version 415595 (0.00093) [2022-07-09 20:51:47,984][25689] Fps is (10 sec: 5699.9, 60 sec: 5725.3, 300 sec: 5718.1). Total num frames: 425573376. Throughput: 0: 6001.6. Samples: 425571200. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:51:47,985][25689] Avg episode reward: [(0, '-47.258')] [2022-07-09 20:51:49,012][26022] Updated weights on worker 0-0, policy_version 415605 (0.00085) [2022-07-09 20:51:50,811][26022] Updated weights on worker 0-0, policy_version 415615 (0.00087) [2022-07-09 20:51:52,518][26022] Updated weights on worker 0-0, policy_version 415625 (0.00090) [2022-07-09 20:51:53,096][25689] Fps is (10 sec: 5769.0, 60 sec: 5719.0, 300 sec: 5716.0). Total num frames: 425603072. Throughput: 0: 6013.6. Samples: 425605664. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:51:53,096][25689] Avg episode reward: [(0, '-47.549')] [2022-07-09 20:51:54,238][26022] Updated weights on worker 0-0, policy_version 415635 (0.00085) [2022-07-09 20:51:56,002][26022] Updated weights on worker 0-0, policy_version 415645 (0.00096) [2022-07-09 20:51:57,819][26022] Updated weights on worker 0-0, policy_version 415655 (0.00092) [2022-07-09 20:51:58,134][25689] Fps is (10 sec: 5851.4, 60 sec: 5736.9, 300 sec: 5718.9). Total num frames: 425632768. Throughput: 0: 6028.2. Samples: 425640508. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:51:58,135][25689] Avg episode reward: [(0, '-47.909')] [2022-07-09 20:51:59,646][26022] Updated weights on worker 0-0, policy_version 415665 (0.00095) [2022-07-09 20:52:01,443][26022] Updated weights on worker 0-0, policy_version 415675 (0.00081) [2022-07-09 20:52:03,154][25689] Fps is (10 sec: 5599.1, 60 sec: 5754.3, 300 sec: 5722.5). Total num frames: 425659392. Throughput: 0: 5169.5. Samples: 425657908. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:52:03,155][25689] Avg episode reward: [(0, '-47.185')] [2022-07-09 20:52:03,363][26022] Updated weights on worker 0-0, policy_version 415685 (0.00084) [2022-07-09 20:52:05,419][26022] Updated weights on worker 0-0, policy_version 415695 (0.00075) [2022-07-09 20:52:06,828][26022] Updated weights on worker 0-0, policy_version 415705 (0.00091) [2022-07-09 20:52:08,172][25689] Fps is (10 sec: 5406.5, 60 sec: 5737.4, 300 sec: 5716.1). Total num frames: 425687040. Throughput: 0: 5905.1. Samples: 425690414. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:52:08,173][25689] Avg episode reward: [(0, '-47.339')] [2022-07-09 20:52:08,909][26022] Updated weights on worker 0-0, policy_version 415715 (0.00095) [2022-07-09 20:52:10,663][26022] Updated weights on worker 0-0, policy_version 415725 (0.00484) [2022-07-09 20:52:12,398][26022] Updated weights on worker 0-0, policy_version 415735 (0.00093) [2022-07-09 20:52:13,278][25689] Fps is (10 sec: 5765.5, 60 sec: 5757.9, 300 sec: 5724.5). Total num frames: 425717760. Throughput: 0: 5916.7. Samples: 425725076. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:52:13,278][25689] Avg episode reward: [(0, '-46.982')] [2022-07-09 20:52:14,138][26022] Updated weights on worker 0-0, policy_version 415745 (0.00088) [2022-07-09 20:52:15,843][26022] Updated weights on worker 0-0, policy_version 415755 (0.00081) [2022-07-09 20:52:17,744][26022] Updated weights on worker 0-0, policy_version 415765 (0.00088) [2022-07-09 20:52:18,299][25689] Fps is (10 sec: 5763.9, 60 sec: 5725.4, 300 sec: 5714.1). Total num frames: 425745408. Throughput: 0: 5062.9. Samples: 425742600. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:52:18,299][25689] Avg episode reward: [(0, '-47.397')] [2022-07-09 20:52:19,485][26022] Updated weights on worker 0-0, policy_version 415775 (0.00092) [2022-07-09 20:52:21,250][26022] Updated weights on worker 0-0, policy_version 415785 (0.00083) [2022-07-09 20:52:22,984][26022] Updated weights on worker 0-0, policy_version 415795 (0.00087) [2022-07-09 20:52:23,316][25689] Fps is (10 sec: 5712.7, 60 sec: 5709.7, 300 sec: 5714.0). Total num frames: 425775104. Throughput: 0: 5927.2. Samples: 425777410. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:52:23,317][25689] Avg episode reward: [(0, '-46.695')] [2022-07-09 20:52:24,716][26022] Updated weights on worker 0-0, policy_version 415805 (0.00089) [2022-07-09 20:52:26,611][26022] Updated weights on worker 0-0, policy_version 415815 (0.00081) [2022-07-09 20:52:28,268][26022] Updated weights on worker 0-0, policy_version 415825 (0.00091) [2022-07-09 20:52:28,367][25689] Fps is (10 sec: 5899.3, 60 sec: 5743.8, 300 sec: 5718.0). Total num frames: 425804800. Throughput: 0: 6031.8. Samples: 425812222. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:52:28,367][25689] Avg episode reward: [(0, '-46.043')] [2022-07-09 20:52:30,314][26022] Updated weights on worker 0-0, policy_version 415835 (0.00089) [2022-07-09 20:52:31,712][26022] Updated weights on worker 0-0, policy_version 415845 (0.00090) [2022-07-09 20:52:33,450][25689] Fps is (10 sec: 5759.8, 60 sec: 5746.5, 300 sec: 5716.5). Total num frames: 425833472. Throughput: 0: 5167.5. Samples: 425829314. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:52:33,450][25689] Avg episode reward: [(0, '-45.760')] [2022-07-09 20:52:33,915][26022] Updated weights on worker 0-0, policy_version 415855 (0.00622) [2022-07-09 20:52:35,428][26022] Updated weights on worker 0-0, policy_version 415865 (0.00084) [2022-07-09 20:52:37,379][26022] Updated weights on worker 0-0, policy_version 415875 (0.00089) [2022-07-09 20:52:38,526][25689] Fps is (10 sec: 5846.3, 60 sec: 5745.0, 300 sec: 5718.9). Total num frames: 425864192. Throughput: 0: 5987.7. Samples: 425863710. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:52:38,526][25689] Avg episode reward: [(0, '-45.826')] [2022-07-09 20:52:38,967][26022] Updated weights on worker 0-0, policy_version 415885 (0.00098) [2022-07-09 20:52:40,846][26022] Updated weights on worker 0-0, policy_version 415895 (0.00084) [2022-07-09 20:52:42,445][26022] Updated weights on worker 0-0, policy_version 415905 (0.00089) [2022-07-09 20:52:43,549][25689] Fps is (10 sec: 5678.1, 60 sec: 5714.2, 300 sec: 5715.1). Total num frames: 425890816. Throughput: 0: 5976.3. Samples: 425898328. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:52:43,549][25689] Avg episode reward: [(0, '-45.086')] [2022-07-09 20:52:44,367][26022] Updated weights on worker 0-0, policy_version 415915 (0.00086) [2022-07-09 20:52:46,162][26022] Updated weights on worker 0-0, policy_version 415925 (0.00090) [2022-07-09 20:52:47,905][26022] Updated weights on worker 0-0, policy_version 415935 (0.00088) [2022-07-09 20:52:48,558][25689] Fps is (10 sec: 5613.7, 60 sec: 5730.8, 300 sec: 5712.7). Total num frames: 425920512. Throughput: 0: 5129.2. Samples: 425915788. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:52:48,559][25689] Avg episode reward: [(0, '-44.716')] [2022-07-09 20:52:49,755][26022] Updated weights on worker 0-0, policy_version 415945 (0.00089) [2022-07-09 20:52:51,455][26022] Updated weights on worker 0-0, policy_version 415955 (0.00086) [2022-07-09 20:52:53,046][26022] Updated weights on worker 0-0, policy_version 415965 (0.00086) [2022-07-09 20:52:53,600][25689] Fps is (10 sec: 6010.7, 60 sec: 5754.3, 300 sec: 5719.2). Total num frames: 425951232. Throughput: 0: 6028.6. Samples: 425950794. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:52:53,601][25689] Avg episode reward: [(0, '-44.831')] [2022-07-09 20:52:54,958][26022] Updated weights on worker 0-0, policy_version 415975 (0.00092) [2022-07-09 20:52:56,652][26022] Updated weights on worker 0-0, policy_version 415985 (0.00083) [2022-07-09 20:52:58,576][26022] Updated weights on worker 0-0, policy_version 415995 (0.00079) [2022-07-09 20:52:58,603][25689] Fps is (10 sec: 5810.8, 60 sec: 5723.8, 300 sec: 5715.8). Total num frames: 425978880. Throughput: 0: 6063.4. Samples: 425985446. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:52:58,603][25689] Avg episode reward: [(0, '-45.859')] [2022-07-09 20:53:00,423][26022] Updated weights on worker 0-0, policy_version 416005 (0.00086) [2022-07-09 20:53:02,470][26022] Updated weights on worker 0-0, policy_version 416015 (0.00091) [2022-07-09 20:53:03,607][25689] Fps is (10 sec: 5423.4, 60 sec: 5725.3, 300 sec: 5712.5). Total num frames: 426005504. Throughput: 0: 5205.4. Samples: 426002740. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:53:03,609][25689] Avg episode reward: [(0, '-46.577')] [2022-07-09 20:53:04,303][26022] Updated weights on worker 0-0, policy_version 416025 (0.00087) [2022-07-09 20:53:05,890][26022] Updated weights on worker 0-0, policy_version 416035 (0.00097) [2022-07-09 20:53:07,735][26022] Updated weights on worker 0-0, policy_version 416045 (0.00090) [2022-07-09 20:53:08,627][25689] Fps is (10 sec: 5618.2, 60 sec: 5759.0, 300 sec: 5721.0). Total num frames: 426035200. Throughput: 0: 5962.3. Samples: 426035448. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:53:08,629][25689] Avg episode reward: [(0, '-46.305')] [2022-07-09 20:53:09,627][26022] Updated weights on worker 0-0, policy_version 416055 (0.00097) [2022-07-09 20:53:11,238][26022] Updated weights on worker 0-0, policy_version 416065 (0.00086) [2022-07-09 20:53:13,323][26022] Updated weights on worker 0-0, policy_version 416075 (0.00093) [2022-07-09 20:53:13,666][25689] Fps is (10 sec: 5802.8, 60 sec: 5731.5, 300 sec: 5720.7). Total num frames: 426063872. Throughput: 0: 5942.9. Samples: 426070042. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:53:13,667][25689] Avg episode reward: [(0, '-47.431')] [2022-07-09 20:53:14,722][26022] Updated weights on worker 0-0, policy_version 416085 (0.00082) [2022-07-09 20:53:16,575][26022] Updated weights on worker 0-0, policy_version 416095 (0.00085) [2022-07-09 20:53:18,381][26022] Updated weights on worker 0-0, policy_version 416105 (0.00084) [2022-07-09 20:53:18,729][25689] Fps is (10 sec: 5677.0, 60 sec: 5744.5, 300 sec: 5713.8). Total num frames: 426092544. Throughput: 0: 5079.5. Samples: 426087674. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:53:18,729][25689] Avg episode reward: [(0, '-47.656')] [2022-07-09 20:53:20,131][26022] Updated weights on worker 0-0, policy_version 416115 (0.00424) [2022-07-09 20:53:21,886][26022] Updated weights on worker 0-0, policy_version 416125 (0.00081) [2022-07-09 20:53:23,829][25689] Fps is (10 sec: 5642.3, 60 sec: 5719.6, 300 sec: 5715.7). Total num frames: 426121216. Throughput: 0: 5923.1. Samples: 426122516. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:53:23,830][25689] Avg episode reward: [(0, '-46.447')] [2022-07-09 20:53:23,852][26022] Updated weights on worker 0-0, policy_version 416135 (0.00083) [2022-07-09 20:53:25,331][26022] Updated weights on worker 0-0, policy_version 416145 (0.00093) [2022-07-09 20:53:27,385][26022] Updated weights on worker 0-0, policy_version 416155 (0.00084) [2022-07-09 20:53:28,852][25689] Fps is (10 sec: 5866.9, 60 sec: 5739.2, 300 sec: 5723.6). Total num frames: 426151936. Throughput: 0: 6010.7. Samples: 426157010. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-09 20:53:28,852][25689] Avg episode reward: [(0, '-46.295')] [2022-07-09 20:53:29,048][26022] Updated weights on worker 0-0, policy_version 416165 (0.00087) [2022-07-09 20:53:30,724][26022] Updated weights on worker 0-0, policy_version 416175 (0.00089) [2022-07-09 20:53:31,290][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:53:31,305][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000416178_426166272.pth [2022-07-09 20:53:31,305][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000414165_424104960.pth [2022-07-09 20:53:32,912][26022] Updated weights on worker 0-0, policy_version 416185 (0.00095) [2022-07-09 20:53:33,922][25689] Fps is (10 sec: 5985.9, 60 sec: 5757.3, 300 sec: 5727.3). Total num frames: 426181632. Throughput: 0: 5138.8. Samples: 426174134. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:53:33,923][25689] Avg episode reward: [(0, '-47.113')] [2022-07-09 20:53:34,332][26022] Updated weights on worker 0-0, policy_version 416195 (0.00095) [2022-07-09 20:53:36,330][26022] Updated weights on worker 0-0, policy_version 416205 (0.00090) [2022-07-09 20:53:37,992][26022] Updated weights on worker 0-0, policy_version 416215 (0.00088) [2022-07-09 20:53:38,948][25689] Fps is (10 sec: 5679.5, 60 sec: 5711.2, 300 sec: 5727.2). Total num frames: 426209280. Throughput: 0: 5976.1. Samples: 426208510. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:53:38,949][25689] Avg episode reward: [(0, '-46.271')] [2022-07-09 20:53:39,789][26022] Updated weights on worker 0-0, policy_version 416225 (0.00082) [2022-07-09 20:53:41,704][26022] Updated weights on worker 0-0, policy_version 416235 (0.00088) [2022-07-09 20:53:43,210][26022] Updated weights on worker 0-0, policy_version 416245 (0.00087) [2022-07-09 20:53:43,957][25689] Fps is (10 sec: 5612.3, 60 sec: 5746.5, 300 sec: 5723.8). Total num frames: 426237952. Throughput: 0: 5979.7. Samples: 426242876. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:53:43,958][25689] Avg episode reward: [(0, '-45.629')] [2022-07-09 20:53:45,156][26022] Updated weights on worker 0-0, policy_version 416255 (0.00087) [2022-07-09 20:53:46,991][26022] Updated weights on worker 0-0, policy_version 416265 (0.00091) [2022-07-09 20:53:48,621][26022] Updated weights on worker 0-0, policy_version 416275 (0.00087) [2022-07-09 20:53:48,958][25689] Fps is (10 sec: 5728.6, 60 sec: 5730.3, 300 sec: 5732.3). Total num frames: 426266624. Throughput: 0: 5127.3. Samples: 426260104. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:53:48,959][25689] Avg episode reward: [(0, '-45.960')] [2022-07-09 20:53:50,402][26022] Updated weights on worker 0-0, policy_version 416285 (0.00086) [2022-07-09 20:53:52,310][26022] Updated weights on worker 0-0, policy_version 416295 (0.00091) [2022-07-09 20:53:54,019][25689] Fps is (10 sec: 5699.1, 60 sec: 5694.6, 300 sec: 5724.6). Total num frames: 426295296. Throughput: 0: 5985.3. Samples: 426294420. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:53:54,020][25689] Avg episode reward: [(0, '-46.535')] [2022-07-09 20:53:54,190][26022] Updated weights on worker 0-0, policy_version 416305 (0.00090) [2022-07-09 20:53:55,998][26022] Updated weights on worker 0-0, policy_version 416315 (0.00091) [2022-07-09 20:53:57,625][26022] Updated weights on worker 0-0, policy_version 416325 (0.00105) [2022-07-09 20:53:59,048][25689] Fps is (10 sec: 5784.9, 60 sec: 5726.0, 300 sec: 5724.3). Total num frames: 426324992. Throughput: 0: 5998.1. Samples: 426329070. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:53:59,049][25689] Avg episode reward: [(0, '-46.501')] [2022-07-09 20:53:59,347][26022] Updated weights on worker 0-0, policy_version 416335 (0.00086) [2022-07-09 20:54:01,162][26022] Updated weights on worker 0-0, policy_version 416345 (0.00085) [2022-07-09 20:54:03,264][26022] Updated weights on worker 0-0, policy_version 416355 (0.00083) [2022-07-09 20:54:04,050][25689] Fps is (10 sec: 5716.5, 60 sec: 5743.2, 300 sec: 5727.9). Total num frames: 426352640. Throughput: 0: 5167.2. Samples: 426346702. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:54:04,051][25689] Avg episode reward: [(0, '-46.078')] [2022-07-09 20:54:05,092][26022] Updated weights on worker 0-0, policy_version 416365 (0.00084) [2022-07-09 20:54:06,802][26022] Updated weights on worker 0-0, policy_version 416375 (0.00085) [2022-07-09 20:54:08,777][26022] Updated weights on worker 0-0, policy_version 416385 (0.00088) [2022-07-09 20:54:09,090][25689] Fps is (10 sec: 5404.6, 60 sec: 5690.5, 300 sec: 5722.5). Total num frames: 426379264. Throughput: 0: 5908.5. Samples: 426379052. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:54:09,092][25689] Avg episode reward: [(0, '-47.787')] [2022-07-09 20:54:10,352][26022] Updated weights on worker 0-0, policy_version 416395 (0.00088) [2022-07-09 20:54:12,397][26022] Updated weights on worker 0-0, policy_version 416405 (0.00090) [2022-07-09 20:54:13,868][26022] Updated weights on worker 0-0, policy_version 416415 (0.00086) [2022-07-09 20:54:14,229][25689] Fps is (10 sec: 5633.6, 60 sec: 5714.9, 300 sec: 5724.5). Total num frames: 426409984. Throughput: 0: 5893.2. Samples: 426413522. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:54:14,230][25689] Avg episode reward: [(0, '-47.330')] [2022-07-09 20:54:15,863][26022] Updated weights on worker 0-0, policy_version 416425 (0.00084) [2022-07-09 20:54:17,425][26022] Updated weights on worker 0-0, policy_version 416435 (0.00093) [2022-07-09 20:54:19,251][25689] Fps is (10 sec: 5744.6, 60 sec: 5701.8, 300 sec: 5718.3). Total num frames: 426437632. Throughput: 0: 5904.4. Samples: 426448354. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:54:19,251][25689] Avg episode reward: [(0, '-47.567')] [2022-07-09 20:54:19,352][26022] Updated weights on worker 0-0, policy_version 416445 (0.00092) [2022-07-09 20:54:21,099][26022] Updated weights on worker 0-0, policy_version 416455 (0.00092) [2022-07-09 20:54:22,759][26022] Updated weights on worker 0-0, policy_version 416465 (0.00085) [2022-07-09 20:54:24,262][25689] Fps is (10 sec: 5919.8, 60 sec: 5761.1, 300 sec: 5729.8). Total num frames: 426469376. Throughput: 0: 5889.3. Samples: 426465734. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:54:24,263][25689] Avg episode reward: [(0, '-46.818')] [2022-07-09 20:54:24,454][26022] Updated weights on worker 0-0, policy_version 416475 (0.00080) [2022-07-09 20:54:26,406][26022] Updated weights on worker 0-0, policy_version 416485 (0.00084) [2022-07-09 20:54:28,179][26022] Updated weights on worker 0-0, policy_version 416495 (0.00090) [2022-07-09 20:54:29,268][25689] Fps is (10 sec: 6031.2, 60 sec: 5728.8, 300 sec: 5732.4). Total num frames: 426498048. Throughput: 0: 6022.8. Samples: 426500580. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:54:29,268][25689] Avg episode reward: [(0, '-47.352')] [2022-07-09 20:54:29,915][26022] Updated weights on worker 0-0, policy_version 416505 (0.00089) [2022-07-09 20:54:31,612][26022] Updated weights on worker 0-0, policy_version 416515 (0.00084) [2022-07-09 20:54:33,416][26022] Updated weights on worker 0-0, policy_version 416525 (0.00090) [2022-07-09 20:54:34,326][25689] Fps is (10 sec: 5596.1, 60 sec: 5696.0, 300 sec: 5725.1). Total num frames: 426525696. Throughput: 0: 6068.1. Samples: 426535472. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:54:34,327][25689] Avg episode reward: [(0, '-47.663')] [2022-07-09 20:54:35,090][26022] Updated weights on worker 0-0, policy_version 416535 (0.00097) [2022-07-09 20:54:37,099][26022] Updated weights on worker 0-0, policy_version 416545 (0.00087) [2022-07-09 20:54:38,725][26022] Updated weights on worker 0-0, policy_version 416555 (0.00087) [2022-07-09 20:54:39,375][25689] Fps is (10 sec: 5774.9, 60 sec: 5744.7, 300 sec: 5729.0). Total num frames: 426556416. Throughput: 0: 5189.1. Samples: 426552782. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:54:39,377][25689] Avg episode reward: [(0, '-46.803')] [2022-07-09 20:54:40,425][26022] Updated weights on worker 0-0, policy_version 416565 (0.00081) [2022-07-09 20:54:42,253][26022] Updated weights on worker 0-0, policy_version 416575 (0.00088) [2022-07-09 20:54:44,095][26022] Updated weights on worker 0-0, policy_version 416585 (0.00089) [2022-07-09 20:54:44,445][25689] Fps is (10 sec: 5869.5, 60 sec: 5738.9, 300 sec: 5735.0). Total num frames: 426585088. Throughput: 0: 6037.1. Samples: 426587580. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:54:44,446][25689] Avg episode reward: [(0, '-46.632')] [2022-07-09 20:54:45,911][26022] Updated weights on worker 0-0, policy_version 416595 (0.00099) [2022-07-09 20:54:47,653][26022] Updated weights on worker 0-0, policy_version 416605 (0.00091) [2022-07-09 20:54:49,468][25689] Fps is (10 sec: 5681.7, 60 sec: 5736.9, 300 sec: 5728.5). Total num frames: 426613760. Throughput: 0: 6026.5. Samples: 426622316. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:54:49,468][25689] Avg episode reward: [(0, '-46.533')] [2022-07-09 20:54:49,476][26022] Updated weights on worker 0-0, policy_version 416615 (0.00096) [2022-07-09 20:54:51,091][26022] Updated weights on worker 0-0, policy_version 416625 (0.00092) [2022-07-09 20:54:52,952][26022] Updated weights on worker 0-0, policy_version 416635 (0.00083) [2022-07-09 20:54:54,554][25689] Fps is (10 sec: 5773.7, 60 sec: 5751.4, 300 sec: 5730.8). Total num frames: 426643456. Throughput: 0: 5142.6. Samples: 426639496. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:54:54,555][25689] Avg episode reward: [(0, '-45.970')] [2022-07-09 20:54:54,792][26022] Updated weights on worker 0-0, policy_version 416645 (0.00091) [2022-07-09 20:54:56,488][26022] Updated weights on worker 0-0, policy_version 416655 (0.00085) [2022-07-09 20:54:58,500][26022] Updated weights on worker 0-0, policy_version 416665 (0.00086) [2022-07-09 20:54:59,614][25689] Fps is (10 sec: 5752.9, 60 sec: 5731.6, 300 sec: 5736.7). Total num frames: 426672128. Throughput: 0: 5989.8. Samples: 426674008. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:54:59,614][25689] Avg episode reward: [(0, '-45.674')] [2022-07-09 20:55:00,059][26022] Updated weights on worker 0-0, policy_version 416675 (0.00084) [2022-07-09 20:55:02,423][26022] Updated weights on worker 0-0, policy_version 416685 (0.00082) [2022-07-09 20:55:03,992][26022] Updated weights on worker 0-0, policy_version 416695 (0.00096) [2022-07-09 20:55:04,623][25689] Fps is (10 sec: 5390.4, 60 sec: 5697.1, 300 sec: 5726.4). Total num frames: 426697728. Throughput: 0: 5884.7. Samples: 426706318. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:55:04,623][25689] Avg episode reward: [(0, '-46.438')] [2022-07-09 20:55:05,755][26022] Updated weights on worker 0-0, policy_version 416705 (0.00085) [2022-07-09 20:55:07,695][26022] Updated weights on worker 0-0, policy_version 416715 (0.00090) [2022-07-09 20:55:09,243][26022] Updated weights on worker 0-0, policy_version 416725 (0.00087) [2022-07-09 20:55:09,636][25689] Fps is (10 sec: 5619.5, 60 sec: 5767.3, 300 sec: 5732.1). Total num frames: 426728448. Throughput: 0: 5025.6. Samples: 426723672. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:55:09,636][25689] Avg episode reward: [(0, '-46.420')] [2022-07-09 20:55:11,185][26022] Updated weights on worker 0-0, policy_version 416735 (0.00089) [2022-07-09 20:55:12,784][26022] Updated weights on worker 0-0, policy_version 416745 (0.00091) [2022-07-09 20:55:14,733][25689] Fps is (10 sec: 5772.9, 60 sec: 5720.5, 300 sec: 5727.2). Total num frames: 426756096. Throughput: 0: 5888.5. Samples: 426758320. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:55:14,734][25689] Avg episode reward: [(0, '-46.533')] [2022-07-09 20:55:14,876][26022] Updated weights on worker 0-0, policy_version 416755 (0.00082) [2022-07-09 20:55:16,407][26022] Updated weights on worker 0-0, policy_version 416765 (0.00086) [2022-07-09 20:55:18,345][26022] Updated weights on worker 0-0, policy_version 416775 (0.00092) [2022-07-09 20:55:19,778][25689] Fps is (10 sec: 5654.2, 60 sec: 5752.1, 300 sec: 5728.0). Total num frames: 426785792. Throughput: 0: 5903.9. Samples: 426793054. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:55:19,780][25689] Avg episode reward: [(0, '-46.111')] [2022-07-09 20:55:20,044][26022] Updated weights on worker 0-0, policy_version 416785 (0.00092) [2022-07-09 20:55:21,741][26022] Updated weights on worker 0-0, policy_version 416795 (0.00086) [2022-07-09 20:55:23,491][26022] Updated weights on worker 0-0, policy_version 416805 (0.00086) [2022-07-09 20:55:24,805][25689] Fps is (10 sec: 5795.0, 60 sec: 5699.9, 300 sec: 5728.0). Total num frames: 426814464. Throughput: 0: 5159.8. Samples: 426810458. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:55:24,809][25689] Avg episode reward: [(0, '-45.823')] [2022-07-09 20:55:25,362][26022] Updated weights on worker 0-0, policy_version 416815 (0.00083) [2022-07-09 20:55:27,089][26022] Updated weights on worker 0-0, policy_version 416825 (0.00096) [2022-07-09 20:55:28,832][26022] Updated weights on worker 0-0, policy_version 416835 (0.00091) [2022-07-09 20:55:29,887][25689] Fps is (10 sec: 5874.9, 60 sec: 5726.5, 300 sec: 5732.4). Total num frames: 426845184. Throughput: 0: 5999.1. Samples: 426845162. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:55:29,888][25689] Avg episode reward: [(0, '-45.312')] [2022-07-09 20:55:30,670][26022] Updated weights on worker 0-0, policy_version 416845 (0.00103) [2022-07-09 20:55:31,347][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:55:31,362][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000416848_426852352.pth [2022-07-09 20:55:31,362][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000414834_424790016.pth [2022-07-09 20:55:32,448][26022] Updated weights on worker 0-0, policy_version 416855 (0.00079) [2022-07-09 20:55:34,057][26022] Updated weights on worker 0-0, policy_version 416865 (0.00086) [2022-07-09 20:55:34,946][25689] Fps is (10 sec: 5756.1, 60 sec: 5726.5, 300 sec: 5721.2). Total num frames: 426872832. Throughput: 0: 6001.2. Samples: 426879618. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:55:34,946][25689] Avg episode reward: [(0, '-44.726')] [2022-07-09 20:55:36,061][26022] Updated weights on worker 0-0, policy_version 416875 (0.00085) [2022-07-09 20:55:37,814][26022] Updated weights on worker 0-0, policy_version 416885 (0.00085) [2022-07-09 20:55:39,610][26022] Updated weights on worker 0-0, policy_version 416895 (0.00090) [2022-07-09 20:55:39,947][25689] Fps is (10 sec: 5700.4, 60 sec: 5714.1, 300 sec: 5728.3). Total num frames: 426902528. Throughput: 0: 5152.8. Samples: 426896982. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:55:39,948][25689] Avg episode reward: [(0, '-44.458')] [2022-07-09 20:55:41,269][26022] Updated weights on worker 0-0, policy_version 416905 (0.00108) [2022-07-09 20:55:43,172][26022] Updated weights on worker 0-0, policy_version 416915 (0.00083) [2022-07-09 20:55:44,756][26022] Updated weights on worker 0-0, policy_version 416925 (0.00094) [2022-07-09 20:55:44,952][25689] Fps is (10 sec: 5832.8, 60 sec: 5720.2, 300 sec: 5729.0). Total num frames: 426931200. Throughput: 0: 6027.2. Samples: 426931888. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:55:44,953][25689] Avg episode reward: [(0, '-44.906')] [2022-07-09 20:55:46,653][26022] Updated weights on worker 0-0, policy_version 416935 (0.00087) [2022-07-09 20:55:48,209][26022] Updated weights on worker 0-0, policy_version 416945 (0.00094) [2022-07-09 20:55:50,008][25689] Fps is (10 sec: 5699.5, 60 sec: 5717.1, 300 sec: 5725.4). Total num frames: 426959872. Throughput: 0: 6028.6. Samples: 426966462. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:55:50,009][25689] Avg episode reward: [(0, '-44.797')] [2022-07-09 20:55:50,319][26022] Updated weights on worker 0-0, policy_version 416955 (0.00082) [2022-07-09 20:55:51,845][26022] Updated weights on worker 0-0, policy_version 416965 (0.00079) [2022-07-09 20:55:53,804][26022] Updated weights on worker 0-0, policy_version 416975 (0.00089) [2022-07-09 20:55:55,037][25689] Fps is (10 sec: 5788.1, 60 sec: 5722.5, 300 sec: 5729.2). Total num frames: 426989568. Throughput: 0: 5173.5. Samples: 426983558. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:55:55,037][25689] Avg episode reward: [(0, '-44.840')] [2022-07-09 20:55:55,441][26022] Updated weights on worker 0-0, policy_version 416985 (0.00084) [2022-07-09 20:55:57,304][26022] Updated weights on worker 0-0, policy_version 416995 (0.00093) [2022-07-09 20:55:59,050][26022] Updated weights on worker 0-0, policy_version 417005 (0.00103) [2022-07-09 20:56:00,059][25689] Fps is (10 sec: 5807.5, 60 sec: 5726.1, 300 sec: 5739.6). Total num frames: 427018240. Throughput: 0: 6048.1. Samples: 427018620. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:56:00,059][25689] Avg episode reward: [(0, '-44.759')] [2022-07-09 20:56:00,822][26022] Updated weights on worker 0-0, policy_version 417015 (0.00105) [2022-07-09 20:56:02,933][26022] Updated weights on worker 0-0, policy_version 417025 (0.00089) [2022-07-09 20:56:04,773][26022] Updated weights on worker 0-0, policy_version 417035 (0.00089) [2022-07-09 20:56:05,083][25689] Fps is (10 sec: 5504.0, 60 sec: 5741.5, 300 sec: 5732.6). Total num frames: 427044864. Throughput: 0: 5929.1. Samples: 427051244. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:56:05,084][25689] Avg episode reward: [(0, '-44.455')] [2022-07-09 20:56:06,445][26022] Updated weights on worker 0-0, policy_version 417045 (0.00087) [2022-07-09 20:56:08,292][26022] Updated weights on worker 0-0, policy_version 417055 (0.00085) [2022-07-09 20:56:10,087][26022] Updated weights on worker 0-0, policy_version 417065 (0.00088) [2022-07-09 20:56:10,098][25689] Fps is (10 sec: 5610.2, 60 sec: 5724.5, 300 sec: 5735.1). Total num frames: 427074560. Throughput: 0: 5088.5. Samples: 427068686. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-09 20:56:10,098][25689] Avg episode reward: [(0, '-45.060')] [2022-07-09 20:56:11,783][26022] Updated weights on worker 0-0, policy_version 417075 (0.00087) [2022-07-09 20:56:13,550][26022] Updated weights on worker 0-0, policy_version 417085 (0.00088) [2022-07-09 20:56:15,154][25689] Fps is (10 sec: 5795.6, 60 sec: 5745.3, 300 sec: 5731.2). Total num frames: 427103232. Throughput: 0: 5959.4. Samples: 427103448. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:56:15,156][25689] Avg episode reward: [(0, '-44.624')] [2022-07-09 20:56:15,402][26022] Updated weights on worker 0-0, policy_version 417095 (0.00087) [2022-07-09 20:56:17,098][26022] Updated weights on worker 0-0, policy_version 417105 (0.00086) [2022-07-09 20:56:18,878][26022] Updated weights on worker 0-0, policy_version 417115 (0.00084) [2022-07-09 20:56:20,171][25689] Fps is (10 sec: 5794.5, 60 sec: 5748.0, 300 sec: 5728.0). Total num frames: 427132928. Throughput: 0: 5928.6. Samples: 427137856. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:56:20,171][25689] Avg episode reward: [(0, '-44.828')] [2022-07-09 20:56:20,638][26022] Updated weights on worker 0-0, policy_version 417125 (0.00086) [2022-07-09 20:56:22,550][26022] Updated weights on worker 0-0, policy_version 417135 (0.00086) [2022-07-09 20:56:24,283][26022] Updated weights on worker 0-0, policy_version 417145 (0.00087) [2022-07-09 20:56:25,177][25689] Fps is (10 sec: 5925.5, 60 sec: 5766.9, 300 sec: 5735.8). Total num frames: 427162624. Throughput: 0: 5180.2. Samples: 427155336. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:56:25,178][25689] Avg episode reward: [(0, '-44.562')] [2022-07-09 20:56:26,087][26022] Updated weights on worker 0-0, policy_version 417155 (0.00095) [2022-07-09 20:56:27,735][26022] Updated weights on worker 0-0, policy_version 417165 (0.00080) [2022-07-09 20:56:29,468][26022] Updated weights on worker 0-0, policy_version 417175 (0.00086) [2022-07-09 20:56:30,187][25689] Fps is (10 sec: 5724.9, 60 sec: 5722.9, 300 sec: 5734.4). Total num frames: 427190272. Throughput: 0: 6049.3. Samples: 427190214. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:56:30,189][25689] Avg episode reward: [(0, '-44.789')] [2022-07-09 20:56:31,252][26022] Updated weights on worker 0-0, policy_version 417185 (0.00092) [2022-07-09 20:56:33,065][26022] Updated weights on worker 0-0, policy_version 417195 (0.00084) [2022-07-09 20:56:34,771][26022] Updated weights on worker 0-0, policy_version 417205 (0.00090) [2022-07-09 20:56:35,229][25689] Fps is (10 sec: 5806.8, 60 sec: 5775.4, 300 sec: 5734.7). Total num frames: 427220992. Throughput: 0: 6056.2. Samples: 427225024. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:56:35,231][25689] Avg episode reward: [(0, '-45.189')] [2022-07-09 20:56:36,755][26022] Updated weights on worker 0-0, policy_version 417215 (0.01224) [2022-07-09 20:56:38,109][26022] Updated weights on worker 0-0, policy_version 417225 (0.00083) [2022-07-09 20:56:40,235][25689] Fps is (10 sec: 5604.9, 60 sec: 5707.0, 300 sec: 5725.3). Total num frames: 427246592. Throughput: 0: 5205.2. Samples: 427242298. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:56:40,236][25689] Avg episode reward: [(0, '-45.646')] [2022-07-09 20:56:40,394][26022] Updated weights on worker 0-0, policy_version 417235 (0.00090) [2022-07-09 20:56:41,792][26022] Updated weights on worker 0-0, policy_version 417245 (0.00090) [2022-07-09 20:56:43,957][26022] Updated weights on worker 0-0, policy_version 417255 (0.00082) [2022-07-09 20:56:45,244][25689] Fps is (10 sec: 5521.4, 60 sec: 5723.7, 300 sec: 5728.7). Total num frames: 427276288. Throughput: 0: 6030.2. Samples: 427276340. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:56:45,244][25689] Avg episode reward: [(0, '-46.348')] [2022-07-09 20:56:45,556][26022] Updated weights on worker 0-0, policy_version 417265 (0.00095) [2022-07-09 20:56:47,633][26022] Updated weights on worker 0-0, policy_version 417275 (0.00086) [2022-07-09 20:56:49,074][26022] Updated weights on worker 0-0, policy_version 417285 (0.00077) [2022-07-09 20:56:50,259][25689] Fps is (10 sec: 5721.1, 60 sec: 5710.6, 300 sec: 5723.7). Total num frames: 427303936. Throughput: 0: 5988.2. Samples: 427310406. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:56:50,259][25689] Avg episode reward: [(0, '-46.529')] [2022-07-09 20:56:51,155][26022] Updated weights on worker 0-0, policy_version 417295 (0.00092) [2022-07-09 20:56:52,775][26022] Updated weights on worker 0-0, policy_version 417305 (0.00085) [2022-07-09 20:56:54,699][26022] Updated weights on worker 0-0, policy_version 417315 (0.00082) [2022-07-09 20:56:55,398][25689] Fps is (10 sec: 5748.0, 60 sec: 5717.0, 300 sec: 5725.2). Total num frames: 427334656. Throughput: 0: 5089.1. Samples: 427327668. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:56:55,399][25689] Avg episode reward: [(0, '-47.463')] [2022-07-09 20:56:56,376][26022] Updated weights on worker 0-0, policy_version 417325 (0.00085) [2022-07-09 20:56:58,000][26022] Updated weights on worker 0-0, policy_version 417335 (0.00081) [2022-07-09 20:56:59,971][26022] Updated weights on worker 0-0, policy_version 417345 (0.00087) [2022-07-09 20:57:00,423][25689] Fps is (10 sec: 5742.5, 60 sec: 5699.8, 300 sec: 5728.5). Total num frames: 427362304. Throughput: 0: 5934.2. Samples: 427362094. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:57:00,423][25689] Avg episode reward: [(0, '-47.803')] [2022-07-09 20:57:01,686][26022] Updated weights on worker 0-0, policy_version 417355 (0.00094) [2022-07-09 20:57:03,983][26022] Updated weights on worker 0-0, policy_version 417365 (0.00100) [2022-07-09 20:57:05,445][25689] Fps is (10 sec: 5503.9, 60 sec: 5717.0, 300 sec: 5728.5). Total num frames: 427389952. Throughput: 0: 5834.6. Samples: 427394206. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:57:05,447][25689] Avg episode reward: [(0, '-48.468')] [2022-07-09 20:57:05,678][26022] Updated weights on worker 0-0, policy_version 417375 (0.00085) [2022-07-09 20:57:07,496][26022] Updated weights on worker 0-0, policy_version 417385 (0.00086) [2022-07-09 20:57:09,380][26022] Updated weights on worker 0-0, policy_version 417395 (0.00085) [2022-07-09 20:57:10,523][25689] Fps is (10 sec: 5474.5, 60 sec: 5677.0, 300 sec: 5718.6). Total num frames: 427417600. Throughput: 0: 4981.4. Samples: 427411352. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:57:10,525][25689] Avg episode reward: [(0, '-47.772')] [2022-07-09 20:57:11,216][26022] Updated weights on worker 0-0, policy_version 417405 (0.00085) [2022-07-09 20:57:12,855][26022] Updated weights on worker 0-0, policy_version 417415 (0.00105) [2022-07-09 20:57:14,748][26022] Updated weights on worker 0-0, policy_version 417425 (0.00116) [2022-07-09 20:57:15,656][25689] Fps is (10 sec: 5716.4, 60 sec: 5703.8, 300 sec: 5726.8). Total num frames: 427448320. Throughput: 0: 5828.5. Samples: 427445738. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:57:15,657][25689] Avg episode reward: [(0, '-46.959')] [2022-07-09 20:57:16,486][26022] Updated weights on worker 0-0, policy_version 417435 (0.00089) [2022-07-09 20:57:18,227][26022] Updated weights on worker 0-0, policy_version 417445 (0.00084) [2022-07-09 20:57:20,055][26022] Updated weights on worker 0-0, policy_version 417455 (0.00092) [2022-07-09 20:57:20,661][25689] Fps is (10 sec: 5858.8, 60 sec: 5687.9, 300 sec: 5723.6). Total num frames: 427476992. Throughput: 0: 5849.0. Samples: 427480464. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:57:20,662][25689] Avg episode reward: [(0, '-46.503')] [2022-07-09 20:57:21,805][26022] Updated weights on worker 0-0, policy_version 417465 (0.00086) [2022-07-09 20:57:23,706][26022] Updated weights on worker 0-0, policy_version 417475 (0.00086) [2022-07-09 20:57:25,199][26022] Updated weights on worker 0-0, policy_version 417485 (0.00094) [2022-07-09 20:57:25,748][25689] Fps is (10 sec: 5783.3, 60 sec: 5680.4, 300 sec: 5722.9). Total num frames: 427506688. Throughput: 0: 5955.9. Samples: 427515130. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:57:25,749][25689] Avg episode reward: [(0, '-46.147')] [2022-07-09 20:57:27,087][26022] Updated weights on worker 0-0, policy_version 417495 (0.00085) [2022-07-09 20:57:29,055][26022] Updated weights on worker 0-0, policy_version 417505 (0.00092) [2022-07-09 20:57:30,600][26022] Updated weights on worker 0-0, policy_version 417515 (0.00105) [2022-07-09 20:57:30,755][25689] Fps is (10 sec: 5782.4, 60 sec: 5697.6, 300 sec: 5724.4). Total num frames: 427535360. Throughput: 0: 5986.1. Samples: 427532458. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:57:30,755][25689] Avg episode reward: [(0, '-45.144')] [2022-07-09 20:57:31,369][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:57:31,377][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000417520_427540480.pth [2022-07-09 20:57:31,378][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000415504_425476096.pth [2022-07-09 20:57:32,524][26022] Updated weights on worker 0-0, policy_version 417525 (0.00098) [2022-07-09 20:57:34,086][26022] Updated weights on worker 0-0, policy_version 417535 (0.00059) [2022-07-09 20:57:35,802][25689] Fps is (10 sec: 5703.4, 60 sec: 5663.2, 300 sec: 5718.0). Total num frames: 427564032. Throughput: 0: 6021.2. Samples: 427567046. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:57:35,803][25689] Avg episode reward: [(0, '-44.931')] [2022-07-09 20:57:36,086][26022] Updated weights on worker 0-0, policy_version 417545 (0.00091) [2022-07-09 20:57:37,785][26022] Updated weights on worker 0-0, policy_version 417555 (0.00081) [2022-07-09 20:57:39,611][26022] Updated weights on worker 0-0, policy_version 417565 (0.00086) [2022-07-09 20:57:40,807][25689] Fps is (10 sec: 5704.6, 60 sec: 5714.1, 300 sec: 5725.3). Total num frames: 427592704. Throughput: 0: 6016.2. Samples: 427601668. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:57:40,807][25689] Avg episode reward: [(0, '-45.448')] [2022-07-09 20:57:41,355][26022] Updated weights on worker 0-0, policy_version 417575 (0.00092) [2022-07-09 20:57:43,063][26022] Updated weights on worker 0-0, policy_version 417585 (0.00089) [2022-07-09 20:57:44,958][26022] Updated weights on worker 0-0, policy_version 417595 (0.00086) [2022-07-09 20:57:45,882][25689] Fps is (10 sec: 5892.4, 60 sec: 5724.7, 300 sec: 5727.5). Total num frames: 427623424. Throughput: 0: 5160.0. Samples: 427619020. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:57:45,882][25689] Avg episode reward: [(0, '-46.308')] [2022-07-09 20:57:46,835][26022] Updated weights on worker 0-0, policy_version 417605 (0.00087) [2022-07-09 20:57:48,498][26022] Updated weights on worker 0-0, policy_version 417615 (0.00079) [2022-07-09 20:57:50,319][26022] Updated weights on worker 0-0, policy_version 417625 (0.00093) [2022-07-09 20:57:50,907][25689] Fps is (10 sec: 5779.1, 60 sec: 5723.8, 300 sec: 5717.5). Total num frames: 427651072. Throughput: 0: 6024.7. Samples: 427653868. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:57:50,908][25689] Avg episode reward: [(0, '-46.566')] [2022-07-09 20:57:51,999][26022] Updated weights on worker 0-0, policy_version 417635 (0.00089) [2022-07-09 20:57:53,900][26022] Updated weights on worker 0-0, policy_version 417645 (0.00079) [2022-07-09 20:57:55,689][26022] Updated weights on worker 0-0, policy_version 417655 (0.00086) [2022-07-09 20:57:55,979][25689] Fps is (10 sec: 5578.0, 60 sec: 5696.4, 300 sec: 5719.6). Total num frames: 427679744. Throughput: 0: 6019.3. Samples: 427688494. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:57:55,980][25689] Avg episode reward: [(0, '-46.328')] [2022-07-09 20:57:57,279][26022] Updated weights on worker 0-0, policy_version 417665 (0.00080) [2022-07-09 20:57:59,392][26022] Updated weights on worker 0-0, policy_version 417675 (0.00089) [2022-07-09 20:58:00,788][26022] Updated weights on worker 0-0, policy_version 417685 (0.00089) [2022-07-09 20:58:01,020][25689] Fps is (10 sec: 5973.7, 60 sec: 5762.3, 300 sec: 5736.1). Total num frames: 427711488. Throughput: 0: 5149.1. Samples: 427705750. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:58:01,022][25689] Avg episode reward: [(0, '-46.802')] [2022-07-09 20:58:03,234][26022] Updated weights on worker 0-0, policy_version 417695 (0.00086) [2022-07-09 20:58:04,808][26022] Updated weights on worker 0-0, policy_version 417705 (0.00085) [2022-07-09 20:58:06,033][25689] Fps is (10 sec: 5601.5, 60 sec: 5712.5, 300 sec: 5719.0). Total num frames: 427736064. Throughput: 0: 5903.1. Samples: 427737976. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:58:06,034][25689] Avg episode reward: [(0, '-47.620')] [2022-07-09 20:58:06,680][26022] Updated weights on worker 0-0, policy_version 417715 (0.00084) [2022-07-09 20:58:08,393][26022] Updated weights on worker 0-0, policy_version 417725 (0.00094) [2022-07-09 20:58:10,156][26022] Updated weights on worker 0-0, policy_version 417735 (0.00086) [2022-07-09 20:58:11,059][25689] Fps is (10 sec: 5406.2, 60 sec: 5751.3, 300 sec: 5722.7). Total num frames: 427765760. Throughput: 0: 5896.8. Samples: 427772704. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:58:11,061][25689] Avg episode reward: [(0, '-47.258')] [2022-07-09 20:58:11,919][26022] Updated weights on worker 0-0, policy_version 417745 (0.00085) [2022-07-09 20:58:13,717][26022] Updated weights on worker 0-0, policy_version 417755 (0.00086) [2022-07-09 20:58:15,427][26022] Updated weights on worker 0-0, policy_version 417765 (0.00083) [2022-07-09 20:58:16,118][25689] Fps is (10 sec: 5889.1, 60 sec: 5741.3, 300 sec: 5726.2). Total num frames: 427795456. Throughput: 0: 5051.0. Samples: 427790218. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:58:16,119][25689] Avg episode reward: [(0, '-47.462')] [2022-07-09 20:58:17,350][26022] Updated weights on worker 0-0, policy_version 417775 (0.00098) [2022-07-09 20:58:19,057][26022] Updated weights on worker 0-0, policy_version 417785 (0.00089) [2022-07-09 20:58:21,023][26022] Updated weights on worker 0-0, policy_version 417795 (0.00086) [2022-07-09 20:58:21,130][25689] Fps is (10 sec: 5795.5, 60 sec: 5740.7, 300 sec: 5727.9). Total num frames: 427824128. Throughput: 0: 5908.9. Samples: 427824578. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:58:21,131][25689] Avg episode reward: [(0, '-48.475')] [2022-07-09 20:58:22,768][26022] Updated weights on worker 0-0, policy_version 417805 (0.00084) [2022-07-09 20:58:24,486][26022] Updated weights on worker 0-0, policy_version 417815 (0.00084) [2022-07-09 20:58:26,170][25689] Fps is (10 sec: 5704.9, 60 sec: 5728.3, 300 sec: 5720.7). Total num frames: 427852800. Throughput: 0: 6041.0. Samples: 427859622. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:58:26,170][25689] Avg episode reward: [(0, '-48.511')] [2022-07-09 20:58:26,172][26022] Updated weights on worker 0-0, policy_version 417825 (0.00095) [2022-07-09 20:58:27,866][26022] Updated weights on worker 0-0, policy_version 417835 (0.00085) [2022-07-09 20:58:29,661][26022] Updated weights on worker 0-0, policy_version 417845 (0.00088) [2022-07-09 20:58:31,202][25689] Fps is (10 sec: 5693.6, 60 sec: 5725.8, 300 sec: 5718.0). Total num frames: 427881472. Throughput: 0: 5164.6. Samples: 427876734. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:58:31,203][25689] Avg episode reward: [(0, '-47.733')] [2022-07-09 20:58:31,528][26022] Updated weights on worker 0-0, policy_version 417855 (0.00070) [2022-07-09 20:58:33,263][26022] Updated weights on worker 0-0, policy_version 417865 (0.00086) [2022-07-09 20:58:34,946][26022] Updated weights on worker 0-0, policy_version 417875 (0.00090) [2022-07-09 20:58:36,338][25689] Fps is (10 sec: 5740.4, 60 sec: 5734.4, 300 sec: 5722.8). Total num frames: 427911168. Throughput: 0: 5988.6. Samples: 427911304. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:58:36,338][25689] Avg episode reward: [(0, '-46.839')] [2022-07-09 20:58:37,059][26022] Updated weights on worker 0-0, policy_version 417885 (0.00087) [2022-07-09 20:58:38,300][26022] Updated weights on worker 0-0, policy_version 417895 (0.00095) [2022-07-09 20:58:40,382][26022] Updated weights on worker 0-0, policy_version 417905 (0.00085) [2022-07-09 20:58:41,366][25689] Fps is (10 sec: 5843.4, 60 sec: 5749.1, 300 sec: 5725.8). Total num frames: 427940864. Throughput: 0: 5999.5. Samples: 427945982. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:58:41,367][25689] Avg episode reward: [(0, '-46.975')] [2022-07-09 20:58:41,965][26022] Updated weights on worker 0-0, policy_version 417915 (0.00084) [2022-07-09 20:58:44,040][26022] Updated weights on worker 0-0, policy_version 417925 (0.00090) [2022-07-09 20:58:45,671][26022] Updated weights on worker 0-0, policy_version 417935 (0.00098) [2022-07-09 20:58:46,385][25689] Fps is (10 sec: 5707.3, 60 sec: 5703.6, 300 sec: 5722.0). Total num frames: 427968512. Throughput: 0: 5132.9. Samples: 427963386. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:58:46,386][25689] Avg episode reward: [(0, '-46.401')] [2022-07-09 20:58:47,550][26022] Updated weights on worker 0-0, policy_version 417945 (0.00093) [2022-07-09 20:58:49,175][26022] Updated weights on worker 0-0, policy_version 417955 (0.00084) [2022-07-09 20:58:51,152][26022] Updated weights on worker 0-0, policy_version 417965 (0.00093) [2022-07-09 20:58:51,444][25689] Fps is (10 sec: 5690.2, 60 sec: 5734.3, 300 sec: 5725.5). Total num frames: 427998208. Throughput: 0: 5985.7. Samples: 427997896. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:58:51,444][25689] Avg episode reward: [(0, '-46.282')] [2022-07-09 20:58:52,923][26022] Updated weights on worker 0-0, policy_version 417975 (0.00093) [2022-07-09 20:58:54,600][26022] Updated weights on worker 0-0, policy_version 417985 (0.00082) [2022-07-09 20:58:56,413][26022] Updated weights on worker 0-0, policy_version 417995 (0.00088) [2022-07-09 20:58:56,510][25689] Fps is (10 sec: 5764.8, 60 sec: 5734.8, 300 sec: 5721.4). Total num frames: 428026880. Throughput: 0: 5998.0. Samples: 428032298. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-09 20:58:56,511][25689] Avg episode reward: [(0, '-46.581')] [2022-07-09 20:58:58,022][26022] Updated weights on worker 0-0, policy_version 418005 (0.00092) [2022-07-09 20:58:59,979][26022] Updated weights on worker 0-0, policy_version 418015 (0.00089) [2022-07-09 20:59:01,550][25689] Fps is (10 sec: 5775.5, 60 sec: 5701.1, 300 sec: 5727.5). Total num frames: 428056576. Throughput: 0: 5142.2. Samples: 428049772. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 20:59:01,551][25689] Avg episode reward: [(0, '-46.358')] [2022-07-09 20:59:01,851][26022] Updated weights on worker 0-0, policy_version 418025 (0.00104) [2022-07-09 20:59:03,813][26022] Updated weights on worker 0-0, policy_version 418035 (0.00092) [2022-07-09 20:59:05,370][26022] Updated weights on worker 0-0, policy_version 418045 (0.00092) [2022-07-09 20:59:06,621][25689] Fps is (10 sec: 5671.7, 60 sec: 5746.4, 300 sec: 5730.4). Total num frames: 428084224. Throughput: 0: 5907.6. Samples: 428082928. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 20:59:06,621][25689] Avg episode reward: [(0, '-45.869')] [2022-07-09 20:59:07,405][26022] Updated weights on worker 0-0, policy_version 418055 (0.00089) [2022-07-09 20:59:09,119][26022] Updated weights on worker 0-0, policy_version 418065 (0.00421) [2022-07-09 20:59:10,674][26022] Updated weights on worker 0-0, policy_version 418075 (0.00091) [2022-07-09 20:59:11,702][25689] Fps is (10 sec: 5446.9, 60 sec: 5707.4, 300 sec: 5721.2). Total num frames: 428111872. Throughput: 0: 5906.7. Samples: 428117554. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 20:59:11,703][25689] Avg episode reward: [(0, '-44.888')] [2022-07-09 20:59:12,563][26022] Updated weights on worker 0-0, policy_version 418085 (0.00093) [2022-07-09 20:59:14,538][26022] Updated weights on worker 0-0, policy_version 418095 (0.00083) [2022-07-09 20:59:16,075][26022] Updated weights on worker 0-0, policy_version 418105 (0.00087) [2022-07-09 20:59:16,770][25689] Fps is (10 sec: 5851.6, 60 sec: 5740.3, 300 sec: 5734.0). Total num frames: 428143616. Throughput: 0: 5913.2. Samples: 428152100. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 20:59:16,771][25689] Avg episode reward: [(0, '-45.442')] [2022-07-09 20:59:18,198][26022] Updated weights on worker 0-0, policy_version 418115 (0.00080) [2022-07-09 20:59:19,602][26022] Updated weights on worker 0-0, policy_version 418125 (0.00091) [2022-07-09 20:59:21,623][26022] Updated weights on worker 0-0, policy_version 418135 (0.00089) [2022-07-09 20:59:21,803][25689] Fps is (10 sec: 5879.5, 60 sec: 5721.4, 300 sec: 5719.8). Total num frames: 428171264. Throughput: 0: 5882.5. Samples: 428168912. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 20:59:21,804][25689] Avg episode reward: [(0, '-45.273')] [2022-07-09 20:59:23,563][26022] Updated weights on worker 0-0, policy_version 418145 (0.00083) [2022-07-09 20:59:25,102][26022] Updated weights on worker 0-0, policy_version 418155 (0.00088) [2022-07-09 20:59:26,835][25689] Fps is (10 sec: 5596.1, 60 sec: 5722.2, 300 sec: 5719.4). Total num frames: 428199936. Throughput: 0: 5957.0. Samples: 428203342. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 20:59:26,835][25689] Avg episode reward: [(0, '-45.583')] [2022-07-09 20:59:26,987][26022] Updated weights on worker 0-0, policy_version 418165 (0.00100) [2022-07-09 20:59:28,648][26022] Updated weights on worker 0-0, policy_version 418175 (0.00087) [2022-07-09 20:59:30,616][26022] Updated weights on worker 0-0, policy_version 418185 (0.00086) [2022-07-09 20:59:31,383][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 20:59:31,400][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000418190_428226560.pth [2022-07-09 20:59:31,401][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000416178_426166272.pth [2022-07-09 20:59:31,904][25689] Fps is (10 sec: 5778.9, 60 sec: 5735.6, 300 sec: 5726.0). Total num frames: 428229632. Throughput: 0: 5946.7. Samples: 428237688. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 20:59:31,904][25689] Avg episode reward: [(0, '-45.817')] [2022-07-09 20:59:32,481][26022] Updated weights on worker 0-0, policy_version 418195 (0.00087) [2022-07-09 20:59:34,079][26022] Updated weights on worker 0-0, policy_version 418205 (0.00086) [2022-07-09 20:59:36,122][26022] Updated weights on worker 0-0, policy_version 418215 (0.00086) [2022-07-09 20:59:36,990][25689] Fps is (10 sec: 5646.4, 60 sec: 5706.5, 300 sec: 5715.0). Total num frames: 428257280. Throughput: 0: 5074.4. Samples: 428254704. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 20:59:36,991][25689] Avg episode reward: [(0, '-46.623')] [2022-07-09 20:59:37,759][26022] Updated weights on worker 0-0, policy_version 418225 (0.00083) [2022-07-09 20:59:39,512][26022] Updated weights on worker 0-0, policy_version 418235 (0.00082) [2022-07-09 20:59:41,260][26022] Updated weights on worker 0-0, policy_version 418245 (0.00088) [2022-07-09 20:59:42,061][25689] Fps is (10 sec: 5544.7, 60 sec: 5685.6, 300 sec: 5715.0). Total num frames: 428285952. Throughput: 0: 5924.1. Samples: 428288920. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 20:59:42,062][25689] Avg episode reward: [(0, '-47.114')] [2022-07-09 20:59:43,185][26022] Updated weights on worker 0-0, policy_version 418255 (0.00100) [2022-07-09 20:59:44,926][26022] Updated weights on worker 0-0, policy_version 418265 (0.00093) [2022-07-09 20:59:46,776][26022] Updated weights on worker 0-0, policy_version 418275 (0.00087) [2022-07-09 20:59:47,121][25689] Fps is (10 sec: 5761.5, 60 sec: 5715.5, 300 sec: 5717.7). Total num frames: 428315648. Throughput: 0: 5931.7. Samples: 428323676. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 20:59:47,122][25689] Avg episode reward: [(0, '-46.357')] [2022-07-09 20:59:48,467][26022] Updated weights on worker 0-0, policy_version 418285 (0.00095) [2022-07-09 20:59:50,277][26022] Updated weights on worker 0-0, policy_version 418295 (0.00090) [2022-07-09 20:59:52,109][26022] Updated weights on worker 0-0, policy_version 418305 (0.00077) [2022-07-09 20:59:52,203][25689] Fps is (10 sec: 5755.4, 60 sec: 5696.5, 300 sec: 5714.3). Total num frames: 428344320. Throughput: 0: 5080.8. Samples: 428340824. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 20:59:52,203][25689] Avg episode reward: [(0, '-46.613')] [2022-07-09 20:59:53,707][26022] Updated weights on worker 0-0, policy_version 418315 (0.00086) [2022-07-09 20:59:55,521][26022] Updated weights on worker 0-0, policy_version 418325 (0.00091) [2022-07-09 20:59:57,299][25689] Fps is (10 sec: 5734.6, 60 sec: 5710.5, 300 sec: 5717.1). Total num frames: 428374016. Throughput: 0: 5943.6. Samples: 428375412. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 20:59:57,300][25689] Avg episode reward: [(0, '-46.625')] [2022-07-09 20:59:57,587][26022] Updated weights on worker 0-0, policy_version 418335 (0.00094) [2022-07-09 20:59:59,240][26022] Updated weights on worker 0-0, policy_version 418345 (0.00087) [2022-07-09 21:00:01,298][26022] Updated weights on worker 0-0, policy_version 418355 (0.00085) [2022-07-09 21:00:02,326][25689] Fps is (10 sec: 5563.6, 60 sec: 5661.2, 300 sec: 5720.2). Total num frames: 428400640. Throughput: 0: 5845.2. Samples: 428407370. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 21:00:02,327][25689] Avg episode reward: [(0, '-47.750')] [2022-07-09 21:00:03,181][26022] Updated weights on worker 0-0, policy_version 418365 (0.00087) [2022-07-09 21:00:04,830][26022] Updated weights on worker 0-0, policy_version 418375 (0.00093) [2022-07-09 21:00:06,870][26022] Updated weights on worker 0-0, policy_version 418385 (0.00091) [2022-07-09 21:00:07,348][25689] Fps is (10 sec: 5503.1, 60 sec: 5682.6, 300 sec: 5713.1). Total num frames: 428429312. Throughput: 0: 4980.6. Samples: 428424416. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 21:00:07,348][25689] Avg episode reward: [(0, '-46.827')] [2022-07-09 21:00:08,536][26022] Updated weights on worker 0-0, policy_version 418395 (0.00091) [2022-07-09 21:00:10,334][26022] Updated weights on worker 0-0, policy_version 418405 (0.00083) [2022-07-09 21:00:12,108][26022] Updated weights on worker 0-0, policy_version 418415 (0.00092) [2022-07-09 21:00:12,361][25689] Fps is (10 sec: 5714.4, 60 sec: 5705.9, 300 sec: 5718.2). Total num frames: 428457984. Throughput: 0: 5865.2. Samples: 428459054. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 21:00:12,361][25689] Avg episode reward: [(0, '-46.336')] [2022-07-09 21:00:13,842][26022] Updated weights on worker 0-0, policy_version 418425 (0.00083) [2022-07-09 21:00:15,751][26022] Updated weights on worker 0-0, policy_version 418435 (0.00082) [2022-07-09 21:00:17,524][25689] Fps is (10 sec: 5635.1, 60 sec: 5646.4, 300 sec: 5712.5). Total num frames: 428486656. Throughput: 0: 5850.6. Samples: 428493736. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 21:00:17,524][25689] Avg episode reward: [(0, '-46.636')] [2022-07-09 21:00:17,555][26022] Updated weights on worker 0-0, policy_version 418445 (0.00080) [2022-07-09 21:00:19,073][26022] Updated weights on worker 0-0, policy_version 418455 (0.00099) [2022-07-09 21:00:21,137][26022] Updated weights on worker 0-0, policy_version 418465 (0.00087) [2022-07-09 21:00:22,569][25689] Fps is (10 sec: 5718.0, 60 sec: 5679.1, 300 sec: 5715.6). Total num frames: 428516352. Throughput: 0: 5125.1. Samples: 428511118. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 21:00:22,570][25689] Avg episode reward: [(0, '-45.906')] [2022-07-09 21:00:22,690][26022] Updated weights on worker 0-0, policy_version 418475 (0.00095) [2022-07-09 21:00:24,625][26022] Updated weights on worker 0-0, policy_version 418485 (0.00091) [2022-07-09 21:00:26,160][26022] Updated weights on worker 0-0, policy_version 418495 (0.00089) [2022-07-09 21:00:27,575][25689] Fps is (10 sec: 5806.8, 60 sec: 5681.3, 300 sec: 5710.1). Total num frames: 428545024. Throughput: 0: 5999.1. Samples: 428545764. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 21:00:27,576][25689] Avg episode reward: [(0, '-45.040')] [2022-07-09 21:00:28,084][26022] Updated weights on worker 0-0, policy_version 418505 (0.00088) [2022-07-09 21:00:29,871][26022] Updated weights on worker 0-0, policy_version 418515 (0.00057) [2022-07-09 21:00:31,927][26022] Updated weights on worker 0-0, policy_version 418525 (0.00496) [2022-07-09 21:00:32,584][25689] Fps is (10 sec: 5623.1, 60 sec: 5653.2, 300 sec: 5711.1). Total num frames: 428572672. Throughput: 0: 5979.1. Samples: 428579974. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 21:00:32,585][25689] Avg episode reward: [(0, '-44.545')] [2022-07-09 21:00:33,488][26022] Updated weights on worker 0-0, policy_version 418535 (0.00091) [2022-07-09 21:00:35,352][26022] Updated weights on worker 0-0, policy_version 418545 (0.00083) [2022-07-09 21:00:36,896][26022] Updated weights on worker 0-0, policy_version 418555 (0.00086) [2022-07-09 21:00:37,643][25689] Fps is (10 sec: 5696.2, 60 sec: 5689.6, 300 sec: 5710.0). Total num frames: 428602368. Throughput: 0: 5142.0. Samples: 428597186. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 21:00:37,643][25689] Avg episode reward: [(0, '-44.339')] [2022-07-09 21:00:39,015][26022] Updated weights on worker 0-0, policy_version 418565 (0.00090) [2022-07-09 21:00:40,579][26022] Updated weights on worker 0-0, policy_version 418575 (0.00095) [2022-07-09 21:00:42,530][26022] Updated weights on worker 0-0, policy_version 418585 (0.00086) [2022-07-09 21:00:42,720][25689] Fps is (10 sec: 5960.5, 60 sec: 5722.7, 300 sec: 5715.5). Total num frames: 428633088. Throughput: 0: 5959.5. Samples: 428631214. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 21:00:42,721][25689] Avg episode reward: [(0, '-44.803')] [2022-07-09 21:00:44,444][26022] Updated weights on worker 0-0, policy_version 418595 (0.00080) [2022-07-09 21:00:46,035][26022] Updated weights on worker 0-0, policy_version 418605 (0.00084) [2022-07-09 21:00:47,763][25689] Fps is (10 sec: 5767.2, 60 sec: 5690.6, 300 sec: 5712.3). Total num frames: 428660736. Throughput: 0: 5954.4. Samples: 428665970. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 21:00:47,764][25689] Avg episode reward: [(0, '-45.067')] [2022-07-09 21:00:47,858][26022] Updated weights on worker 0-0, policy_version 418615 (0.00097) [2022-07-09 21:00:49,580][26022] Updated weights on worker 0-0, policy_version 418625 (0.00088) [2022-07-09 21:00:51,269][26022] Updated weights on worker 0-0, policy_version 418635 (0.00090) [2022-07-09 21:00:52,817][25689] Fps is (10 sec: 5578.0, 60 sec: 5693.2, 300 sec: 5708.4). Total num frames: 428689408. Throughput: 0: 5105.8. Samples: 428683278. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 21:00:52,818][25689] Avg episode reward: [(0, '-45.532')] [2022-07-09 21:00:53,309][26022] Updated weights on worker 0-0, policy_version 418645 (0.00084) [2022-07-09 21:00:54,869][26022] Updated weights on worker 0-0, policy_version 418655 (0.00092) [2022-07-09 21:00:56,857][26022] Updated weights on worker 0-0, policy_version 418665 (0.00087) [2022-07-09 21:00:57,899][25689] Fps is (10 sec: 5758.8, 60 sec: 5694.6, 300 sec: 5710.7). Total num frames: 428719104. Throughput: 0: 5940.0. Samples: 428717508. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 21:00:57,899][25689] Avg episode reward: [(0, '-45.586')] [2022-07-09 21:00:58,401][26022] Updated weights on worker 0-0, policy_version 418675 (0.00082) [2022-07-09 21:01:00,508][26022] Updated weights on worker 0-0, policy_version 418685 (0.00081) [2022-07-09 21:01:01,968][26022] Updated weights on worker 0-0, policy_version 418695 (0.00079) [2022-07-09 21:01:02,914][25689] Fps is (10 sec: 5577.8, 60 sec: 5695.6, 300 sec: 5710.8). Total num frames: 428745728. Throughput: 0: 5973.5. Samples: 428751842. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 21:01:02,915][25689] Avg episode reward: [(0, '-45.447')] [2022-07-09 21:01:04,524][26022] Updated weights on worker 0-0, policy_version 418705 (0.00093) [2022-07-09 21:01:05,928][26022] Updated weights on worker 0-0, policy_version 418715 (0.00088) [2022-07-09 21:01:07,938][25689] Fps is (10 sec: 5507.9, 60 sec: 5695.4, 300 sec: 5707.2). Total num frames: 428774400. Throughput: 0: 5013.9. Samples: 428767124. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 21:01:07,939][25689] Avg episode reward: [(0, '-45.508')] [2022-07-09 21:01:07,950][26022] Updated weights on worker 0-0, policy_version 418725 (0.00098) [2022-07-09 21:01:09,477][26022] Updated weights on worker 0-0, policy_version 418735 (0.00089) [2022-07-09 21:01:11,611][26022] Updated weights on worker 0-0, policy_version 418745 (0.00096) [2022-07-09 21:01:12,962][25689] Fps is (10 sec: 5809.2, 60 sec: 5711.3, 300 sec: 5711.3). Total num frames: 428804096. Throughput: 0: 5856.1. Samples: 428801246. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 21:01:12,963][25689] Avg episode reward: [(0, '-44.105')] [2022-07-09 21:01:13,019][26022] Updated weights on worker 0-0, policy_version 418755 (0.00082) [2022-07-09 21:01:15,258][26022] Updated weights on worker 0-0, policy_version 418765 (0.00079) [2022-07-09 21:01:16,821][26022] Updated weights on worker 0-0, policy_version 418775 (0.00085) [2022-07-09 21:01:18,103][25689] Fps is (10 sec: 5641.3, 60 sec: 5696.5, 300 sec: 5702.0). Total num frames: 428831744. Throughput: 0: 5841.5. Samples: 428835532. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 21:01:18,104][25689] Avg episode reward: [(0, '-44.596')] [2022-07-09 21:01:18,767][26022] Updated weights on worker 0-0, policy_version 418785 (0.00065) [2022-07-09 21:01:20,225][26022] Updated weights on worker 0-0, policy_version 418795 (0.00106) [2022-07-09 21:01:22,215][26022] Updated weights on worker 0-0, policy_version 418805 (0.00094) [2022-07-09 21:01:23,135][25689] Fps is (10 sec: 5536.2, 60 sec: 5680.8, 300 sec: 5698.1). Total num frames: 428860416. Throughput: 0: 5841.2. Samples: 428869954. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 21:01:23,136][25689] Avg episode reward: [(0, '-44.748')] [2022-07-09 21:01:23,877][26022] Updated weights on worker 0-0, policy_version 418815 (0.00086) [2022-07-09 21:01:25,839][26022] Updated weights on worker 0-0, policy_version 418825 (0.00084) [2022-07-09 21:01:27,762][26022] Updated weights on worker 0-0, policy_version 418835 (0.00093) [2022-07-09 21:01:28,152][25689] Fps is (10 sec: 5808.6, 60 sec: 5696.7, 300 sec: 5704.8). Total num frames: 428890112. Throughput: 0: 5932.5. Samples: 428887042. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 21:01:28,152][25689] Avg episode reward: [(0, '-44.694')] [2022-07-09 21:01:29,386][26022] Updated weights on worker 0-0, policy_version 418845 (0.00092) [2022-07-09 21:01:31,200][26022] Updated weights on worker 0-0, policy_version 418855 (0.00085) [2022-07-09 21:01:31,419][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:01:31,432][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000418857_428909568.pth [2022-07-09 21:01:31,432][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000416848_426852352.pth [2022-07-09 21:01:33,054][26022] Updated weights on worker 0-0, policy_version 418865 (0.00090) [2022-07-09 21:01:33,164][25689] Fps is (10 sec: 5717.6, 60 sec: 5696.4, 300 sec: 5695.1). Total num frames: 428917760. Throughput: 0: 5932.0. Samples: 428921086. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 21:01:33,165][25689] Avg episode reward: [(0, '-45.248')] [2022-07-09 21:01:34,747][26022] Updated weights on worker 0-0, policy_version 418875 (0.00090) [2022-07-09 21:01:36,761][26022] Updated weights on worker 0-0, policy_version 418885 (0.00088) [2022-07-09 21:01:38,271][25689] Fps is (10 sec: 5666.8, 60 sec: 5691.8, 300 sec: 5706.9). Total num frames: 428947456. Throughput: 0: 5942.9. Samples: 428955388. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 21:01:38,271][25689] Avg episode reward: [(0, '-45.484')] [2022-07-09 21:01:38,404][26022] Updated weights on worker 0-0, policy_version 418895 (0.00091) [2022-07-09 21:01:40,272][26022] Updated weights on worker 0-0, policy_version 418905 (0.00091) [2022-07-09 21:01:42,123][26022] Updated weights on worker 0-0, policy_version 418915 (0.00084) [2022-07-09 21:01:43,286][25689] Fps is (10 sec: 5665.6, 60 sec: 5647.1, 300 sec: 5699.9). Total num frames: 428975104. Throughput: 0: 5079.5. Samples: 428972310. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:01:43,286][25689] Avg episode reward: [(0, '-46.567')] [2022-07-09 21:01:43,727][26022] Updated weights on worker 0-0, policy_version 418925 (0.00082) [2022-07-09 21:01:45,594][26022] Updated weights on worker 0-0, policy_version 418935 (0.00090) [2022-07-09 21:01:47,432][26022] Updated weights on worker 0-0, policy_version 418945 (0.00085) [2022-07-09 21:01:48,319][25689] Fps is (10 sec: 5605.3, 60 sec: 5664.9, 300 sec: 5703.0). Total num frames: 429003776. Throughput: 0: 5943.1. Samples: 429006896. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:01:48,319][25689] Avg episode reward: [(0, '-46.142')] [2022-07-09 21:01:49,220][26022] Updated weights on worker 0-0, policy_version 418955 (0.00088) [2022-07-09 21:01:51,106][26022] Updated weights on worker 0-0, policy_version 418965 (0.00081) [2022-07-09 21:01:52,607][26022] Updated weights on worker 0-0, policy_version 418975 (0.00096) [2022-07-09 21:01:53,322][25689] Fps is (10 sec: 5815.6, 60 sec: 5686.5, 300 sec: 5702.2). Total num frames: 429033472. Throughput: 0: 5960.4. Samples: 429041236. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:01:53,323][25689] Avg episode reward: [(0, '-45.501')] [2022-07-09 21:01:54,708][26022] Updated weights on worker 0-0, policy_version 418985 (0.00668) [2022-07-09 21:01:56,438][26022] Updated weights on worker 0-0, policy_version 418995 (0.00086) [2022-07-09 21:01:58,331][26022] Updated weights on worker 0-0, policy_version 419005 (0.00095) [2022-07-09 21:01:58,425][25689] Fps is (10 sec: 5674.4, 60 sec: 5650.7, 300 sec: 5700.7). Total num frames: 429061120. Throughput: 0: 5088.7. Samples: 429057944. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:01:58,425][25689] Avg episode reward: [(0, '-45.633')] [2022-07-09 21:02:00,005][26022] Updated weights on worker 0-0, policy_version 419015 (0.00080) [2022-07-09 21:02:01,966][26022] Updated weights on worker 0-0, policy_version 419025 (0.00093) [2022-07-09 21:02:03,495][25689] Fps is (10 sec: 5334.7, 60 sec: 5645.6, 300 sec: 5696.3). Total num frames: 429087744. Throughput: 0: 5874.3. Samples: 429091028. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:02:03,496][25689] Avg episode reward: [(0, '-45.994')] [2022-07-09 21:02:03,859][26022] Updated weights on worker 0-0, policy_version 419035 (0.00379) [2022-07-09 21:02:05,913][26022] Updated weights on worker 0-0, policy_version 419045 (0.00085) [2022-07-09 21:02:07,352][26022] Updated weights on worker 0-0, policy_version 419055 (0.00077) [2022-07-09 21:02:08,556][25689] Fps is (10 sec: 5457.6, 60 sec: 5642.1, 300 sec: 5700.1). Total num frames: 429116416. Throughput: 0: 5810.6. Samples: 429124490. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:02:08,557][25689] Avg episode reward: [(0, '-45.656')] [2022-07-09 21:02:09,546][26022] Updated weights on worker 0-0, policy_version 419065 (0.00080) [2022-07-09 21:02:11,099][26022] Updated weights on worker 0-0, policy_version 419075 (0.00095) [2022-07-09 21:02:13,049][26022] Updated weights on worker 0-0, policy_version 419085 (0.00091) [2022-07-09 21:02:13,568][25689] Fps is (10 sec: 5693.3, 60 sec: 5626.4, 300 sec: 5695.5). Total num frames: 429145088. Throughput: 0: 4950.2. Samples: 429141456. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:02:13,568][25689] Avg episode reward: [(0, '-45.596')] [2022-07-09 21:02:14,656][26022] Updated weights on worker 0-0, policy_version 419095 (0.00090) [2022-07-09 21:02:16,567][26022] Updated weights on worker 0-0, policy_version 419105 (0.00085) [2022-07-09 21:02:18,123][26022] Updated weights on worker 0-0, policy_version 419115 (0.00083) [2022-07-09 21:02:18,698][25689] Fps is (10 sec: 5856.4, 60 sec: 5678.1, 300 sec: 5700.0). Total num frames: 429175808. Throughput: 0: 5827.0. Samples: 429176076. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:02:18,698][25689] Avg episode reward: [(0, '-47.197')] [2022-07-09 21:02:20,503][26022] Updated weights on worker 0-0, policy_version 419125 (0.00087) [2022-07-09 21:02:21,755][26022] Updated weights on worker 0-0, policy_version 419135 (0.00084) [2022-07-09 21:02:23,779][25689] Fps is (10 sec: 5716.1, 60 sec: 5656.6, 300 sec: 5693.3). Total num frames: 429203456. Throughput: 0: 5894.8. Samples: 429210594. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:02:23,779][25689] Avg episode reward: [(0, '-46.791')] [2022-07-09 21:02:23,816][26022] Updated weights on worker 0-0, policy_version 419145 (0.00091) [2022-07-09 21:02:25,338][26022] Updated weights on worker 0-0, policy_version 419155 (0.00103) [2022-07-09 21:02:27,331][26022] Updated weights on worker 0-0, policy_version 419165 (0.00093) [2022-07-09 21:02:28,833][25689] Fps is (10 sec: 5658.0, 60 sec: 5653.1, 300 sec: 5695.8). Total num frames: 429233152. Throughput: 0: 5081.0. Samples: 429227514. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:02:28,833][25689] Avg episode reward: [(0, '-47.080')] [2022-07-09 21:02:29,149][26022] Updated weights on worker 0-0, policy_version 419175 (0.00085) [2022-07-09 21:02:30,914][26022] Updated weights on worker 0-0, policy_version 419185 (0.00089) [2022-07-09 21:02:32,808][26022] Updated weights on worker 0-0, policy_version 419195 (0.00095) [2022-07-09 21:02:33,836][25689] Fps is (10 sec: 5905.3, 60 sec: 5687.8, 300 sec: 5700.1). Total num frames: 429262848. Throughput: 0: 5926.3. Samples: 429261574. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:02:33,837][25689] Avg episode reward: [(0, '-46.934')] [2022-07-09 21:02:34,607][26022] Updated weights on worker 0-0, policy_version 419205 (0.00084) [2022-07-09 21:02:36,285][26022] Updated weights on worker 0-0, policy_version 419215 (0.00093) [2022-07-09 21:02:38,294][26022] Updated weights on worker 0-0, policy_version 419225 (0.00085) [2022-07-09 21:02:38,887][25689] Fps is (10 sec: 5601.9, 60 sec: 5642.4, 300 sec: 5692.3). Total num frames: 429289472. Throughput: 0: 5928.9. Samples: 429295774. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:02:38,887][25689] Avg episode reward: [(0, '-46.380')] [2022-07-09 21:02:39,867][26022] Updated weights on worker 0-0, policy_version 419235 (0.00090) [2022-07-09 21:02:42,048][26022] Updated weights on worker 0-0, policy_version 419245 (0.00086) [2022-07-09 21:02:43,383][26022] Updated weights on worker 0-0, policy_version 419255 (0.00084) [2022-07-09 21:02:43,915][25689] Fps is (10 sec: 5486.5, 60 sec: 5658.0, 300 sec: 5686.4). Total num frames: 429318144. Throughput: 0: 5071.8. Samples: 429312720. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:02:43,915][25689] Avg episode reward: [(0, '-45.792')] [2022-07-09 21:02:45,575][26022] Updated weights on worker 0-0, policy_version 419265 (0.00088) [2022-07-09 21:02:47,081][26022] Updated weights on worker 0-0, policy_version 419275 (0.00086) [2022-07-09 21:02:48,930][25689] Fps is (10 sec: 5709.8, 60 sec: 5659.7, 300 sec: 5690.0). Total num frames: 429346816. Throughput: 0: 5936.0. Samples: 429346808. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:02:48,930][25689] Avg episode reward: [(0, '-45.658')] [2022-07-09 21:02:49,053][26022] Updated weights on worker 0-0, policy_version 419285 (0.00091) [2022-07-09 21:02:50,701][26022] Updated weights on worker 0-0, policy_version 419295 (0.00081) [2022-07-09 21:02:52,582][26022] Updated weights on worker 0-0, policy_version 419305 (0.00089) [2022-07-09 21:02:53,970][25689] Fps is (10 sec: 5703.0, 60 sec: 5639.4, 300 sec: 5690.6). Total num frames: 429375488. Throughput: 0: 5934.3. Samples: 429381052. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:02:53,970][25689] Avg episode reward: [(0, '-45.293')] [2022-07-09 21:02:54,333][26022] Updated weights on worker 0-0, policy_version 419315 (0.00091) [2022-07-09 21:02:56,257][26022] Updated weights on worker 0-0, policy_version 419325 (0.00087) [2022-07-09 21:02:57,790][26022] Updated weights on worker 0-0, policy_version 419335 (0.00090) [2022-07-09 21:02:59,057][25689] Fps is (10 sec: 5763.5, 60 sec: 5674.6, 300 sec: 5682.8). Total num frames: 429405184. Throughput: 0: 5061.1. Samples: 429397858. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:02:59,058][25689] Avg episode reward: [(0, '-44.870')] [2022-07-09 21:03:00,046][26022] Updated weights on worker 0-0, policy_version 419345 (0.00083) [2022-07-09 21:03:01,544][26022] Updated weights on worker 0-0, policy_version 419355 (0.00082) [2022-07-09 21:03:03,873][26022] Updated weights on worker 0-0, policy_version 419365 (0.00091) [2022-07-09 21:03:04,064][25689] Fps is (10 sec: 5376.5, 60 sec: 5646.8, 300 sec: 5683.0). Total num frames: 429429760. Throughput: 0: 5828.4. Samples: 429430158. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:03:04,065][25689] Avg episode reward: [(0, '-44.984')] [2022-07-09 21:03:05,395][26022] Updated weights on worker 0-0, policy_version 419375 (0.00090) [2022-07-09 21:03:07,478][26022] Updated weights on worker 0-0, policy_version 419385 (0.00095) [2022-07-09 21:03:09,076][25689] Fps is (10 sec: 5417.0, 60 sec: 5668.3, 300 sec: 5683.2). Total num frames: 429459456. Throughput: 0: 5826.2. Samples: 429464182. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:03:09,077][25689] Avg episode reward: [(0, '-45.178')] [2022-07-09 21:03:09,119][26022] Updated weights on worker 0-0, policy_version 419395 (0.00093) [2022-07-09 21:03:11,143][26022] Updated weights on worker 0-0, policy_version 419405 (0.00089) [2022-07-09 21:03:12,718][26022] Updated weights on worker 0-0, policy_version 419415 (0.00085) [2022-07-09 21:03:14,079][25689] Fps is (10 sec: 5726.0, 60 sec: 5652.1, 300 sec: 5677.4). Total num frames: 429487104. Throughput: 0: 4985.0. Samples: 429481296. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:03:14,079][25689] Avg episode reward: [(0, '-46.350')] [2022-07-09 21:03:14,760][26022] Updated weights on worker 0-0, policy_version 419425 (0.00091) [2022-07-09 21:03:16,257][26022] Updated weights on worker 0-0, policy_version 419435 (0.00083) [2022-07-09 21:03:18,082][26022] Updated weights on worker 0-0, policy_version 419445 (0.00091) [2022-07-09 21:03:19,149][25689] Fps is (10 sec: 5591.0, 60 sec: 5623.9, 300 sec: 5676.3). Total num frames: 429515776. Throughput: 0: 5857.8. Samples: 429515554. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:03:19,149][25689] Avg episode reward: [(0, '-45.959')] [2022-07-09 21:03:19,930][26022] Updated weights on worker 0-0, policy_version 419455 (0.00090) [2022-07-09 21:03:21,735][26022] Updated weights on worker 0-0, policy_version 419465 (0.00089) [2022-07-09 21:03:23,680][26022] Updated weights on worker 0-0, policy_version 419475 (0.00086) [2022-07-09 21:03:24,243][25689] Fps is (10 sec: 5843.2, 60 sec: 5673.4, 300 sec: 5682.2). Total num frames: 429546496. Throughput: 0: 5923.8. Samples: 429549694. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:03:24,244][25689] Avg episode reward: [(0, '-45.625')] [2022-07-09 21:03:25,677][26022] Updated weights on worker 0-0, policy_version 419485 (0.00090) [2022-07-09 21:03:27,112][26022] Updated weights on worker 0-0, policy_version 419495 (0.00091) [2022-07-09 21:03:29,265][25689] Fps is (10 sec: 5567.6, 60 sec: 5608.7, 300 sec: 5672.0). Total num frames: 429572096. Throughput: 0: 5910.5. Samples: 429583508. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:03:29,265][25689] Avg episode reward: [(0, '-45.645')] [2022-07-09 21:03:29,387][26022] Updated weights on worker 0-0, policy_version 419505 (0.00097) [2022-07-09 21:03:30,737][26022] Updated weights on worker 0-0, policy_version 419515 (0.00232) [2022-07-09 21:03:31,679][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:03:31,696][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000419519_429587456.pth [2022-07-09 21:03:31,696][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000417520_427540480.pth [2022-07-09 21:03:32,847][26022] Updated weights on worker 0-0, policy_version 419525 (0.00099) [2022-07-09 21:03:34,267][25689] Fps is (10 sec: 5720.8, 60 sec: 5642.7, 300 sec: 5681.5). Total num frames: 429603840. Throughput: 0: 5906.2. Samples: 429600530. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:03:34,267][25689] Avg episode reward: [(0, '-45.164')] [2022-07-09 21:03:34,270][26022] Updated weights on worker 0-0, policy_version 419535 (0.00099) [2022-07-09 21:03:36,503][26022] Updated weights on worker 0-0, policy_version 419545 (0.00097) [2022-07-09 21:03:38,100][26022] Updated weights on worker 0-0, policy_version 419555 (0.00095) [2022-07-09 21:03:39,335][25689] Fps is (10 sec: 5694.4, 60 sec: 5624.1, 300 sec: 5667.0). Total num frames: 429629440. Throughput: 0: 5889.8. Samples: 429634446. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:03:39,335][25689] Avg episode reward: [(0, '-44.978')] [2022-07-09 21:03:40,003][26022] Updated weights on worker 0-0, policy_version 419565 (0.00087) [2022-07-09 21:03:41,639][26022] Updated weights on worker 0-0, policy_version 419575 (0.00084) [2022-07-09 21:03:43,698][26022] Updated weights on worker 0-0, policy_version 419585 (0.00088) [2022-07-09 21:03:44,352][25689] Fps is (10 sec: 5584.4, 60 sec: 5659.0, 300 sec: 5677.3). Total num frames: 429660160. Throughput: 0: 5910.9. Samples: 429668556. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:03:44,353][25689] Avg episode reward: [(0, '-45.272')] [2022-07-09 21:03:45,364][26022] Updated weights on worker 0-0, policy_version 419595 (0.00096) [2022-07-09 21:03:47,103][26022] Updated weights on worker 0-0, policy_version 419605 (0.01244) [2022-07-09 21:03:48,917][26022] Updated weights on worker 0-0, policy_version 419615 (0.00182) [2022-07-09 21:03:49,377][25689] Fps is (10 sec: 5914.4, 60 sec: 5658.1, 300 sec: 5674.5). Total num frames: 429688832. Throughput: 0: 5084.7. Samples: 429685772. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:03:49,377][25689] Avg episode reward: [(0, '-45.946')] [2022-07-09 21:03:50,876][26022] Updated weights on worker 0-0, policy_version 419625 (0.00091) [2022-07-09 21:03:52,627][26022] Updated weights on worker 0-0, policy_version 419635 (0.00095) [2022-07-09 21:03:54,380][25689] Fps is (10 sec: 5514.0, 60 sec: 5627.6, 300 sec: 5668.8). Total num frames: 429715456. Throughput: 0: 5911.6. Samples: 429719432. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:03:54,382][25689] Avg episode reward: [(0, '-46.576')] [2022-07-09 21:03:54,624][26022] Updated weights on worker 0-0, policy_version 419645 (0.00096) [2022-07-09 21:03:56,268][26022] Updated weights on worker 0-0, policy_version 419655 (0.00094) [2022-07-09 21:03:58,189][26022] Updated weights on worker 0-0, policy_version 419665 (0.00090) [2022-07-09 21:03:59,456][25689] Fps is (10 sec: 5486.3, 60 sec: 5611.8, 300 sec: 5664.7). Total num frames: 429744128. Throughput: 0: 5896.8. Samples: 429753094. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:03:59,457][25689] Avg episode reward: [(0, '-46.868')] [2022-07-09 21:03:59,896][26022] Updated weights on worker 0-0, policy_version 419675 (0.00091) [2022-07-09 21:04:02,143][26022] Updated weights on worker 0-0, policy_version 419685 (0.00094) [2022-07-09 21:04:03,902][26022] Updated weights on worker 0-0, policy_version 419695 (0.00085) [2022-07-09 21:04:04,553][25689] Fps is (10 sec: 5536.1, 60 sec: 5654.1, 300 sec: 5664.2). Total num frames: 429771776. Throughput: 0: 4925.7. Samples: 429768062. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:04:04,554][25689] Avg episode reward: [(0, '-47.343')] [2022-07-09 21:04:05,873][26022] Updated weights on worker 0-0, policy_version 419705 (0.00066) [2022-07-09 21:04:07,465][26022] Updated weights on worker 0-0, policy_version 419715 (0.00082) [2022-07-09 21:04:09,294][26022] Updated weights on worker 0-0, policy_version 419725 (0.00085) [2022-07-09 21:04:09,586][25689] Fps is (10 sec: 5357.6, 60 sec: 5601.5, 300 sec: 5661.7). Total num frames: 429798400. Throughput: 0: 5776.7. Samples: 429802512. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:04:09,586][25689] Avg episode reward: [(0, '-46.063')] [2022-07-09 21:04:10,948][26022] Updated weights on worker 0-0, policy_version 419735 (0.00084) [2022-07-09 21:04:12,891][26022] Updated weights on worker 0-0, policy_version 419745 (0.00084) [2022-07-09 21:04:14,582][26022] Updated weights on worker 0-0, policy_version 419755 (0.00081) [2022-07-09 21:04:14,589][25689] Fps is (10 sec: 5714.1, 60 sec: 5652.2, 300 sec: 5659.5). Total num frames: 429829120. Throughput: 0: 5809.4. Samples: 429836832. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:04:14,590][25689] Avg episode reward: [(0, '-45.953')] [2022-07-09 21:04:16,448][26022] Updated weights on worker 0-0, policy_version 419765 (0.00112) [2022-07-09 21:04:18,271][26022] Updated weights on worker 0-0, policy_version 419775 (0.00091) [2022-07-09 21:04:19,644][25689] Fps is (10 sec: 5904.8, 60 sec: 5653.6, 300 sec: 5662.5). Total num frames: 429857792. Throughput: 0: 4996.7. Samples: 429853968. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:04:19,645][25689] Avg episode reward: [(0, '-46.342')] [2022-07-09 21:04:19,867][26022] Updated weights on worker 0-0, policy_version 419785 (0.00088) [2022-07-09 21:04:21,827][26022] Updated weights on worker 0-0, policy_version 419795 (0.00108) [2022-07-09 21:04:23,501][26022] Updated weights on worker 0-0, policy_version 419805 (0.00091) [2022-07-09 21:04:24,653][25689] Fps is (10 sec: 5596.2, 60 sec: 5610.7, 300 sec: 5659.5). Total num frames: 429885440. Throughput: 0: 6001.2. Samples: 429888682. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:04:24,654][25689] Avg episode reward: [(0, '-45.950')] [2022-07-09 21:04:25,486][26022] Updated weights on worker 0-0, policy_version 419815 (0.00092) [2022-07-09 21:04:27,065][26022] Updated weights on worker 0-0, policy_version 419825 (0.00086) [2022-07-09 21:04:29,059][26022] Updated weights on worker 0-0, policy_version 419835 (0.00087) [2022-07-09 21:04:29,671][25689] Fps is (10 sec: 5719.1, 60 sec: 5678.9, 300 sec: 5660.5). Total num frames: 429915136. Throughput: 0: 5983.8. Samples: 429922696. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:04:29,671][25689] Avg episode reward: [(0, '-46.028')] [2022-07-09 21:04:30,730][26022] Updated weights on worker 0-0, policy_version 419845 (0.00089) [2022-07-09 21:04:32,603][26022] Updated weights on worker 0-0, policy_version 419855 (0.00083) [2022-07-09 21:04:34,306][26022] Updated weights on worker 0-0, policy_version 419865 (0.00087) [2022-07-09 21:04:34,674][25689] Fps is (10 sec: 5722.5, 60 sec: 5611.0, 300 sec: 5662.1). Total num frames: 429942784. Throughput: 0: 5121.1. Samples: 429939688. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:04:34,674][25689] Avg episode reward: [(0, '-45.637')] [2022-07-09 21:04:36,119][26022] Updated weights on worker 0-0, policy_version 419875 (0.00080) [2022-07-09 21:04:38,127][26022] Updated weights on worker 0-0, policy_version 419885 (0.00087) [2022-07-09 21:04:39,723][25689] Fps is (10 sec: 5602.9, 60 sec: 5663.7, 300 sec: 5662.5). Total num frames: 429971456. Throughput: 0: 5966.8. Samples: 429973774. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:04:39,723][25689] Avg episode reward: [(0, '-46.195')] [2022-07-09 21:04:39,783][26022] Updated weights on worker 0-0, policy_version 419895 (0.00089) [2022-07-09 21:04:41,501][26022] Updated weights on worker 0-0, policy_version 419905 (0.00086) [2022-07-09 21:04:43,503][26022] Updated weights on worker 0-0, policy_version 419915 (0.00090) [2022-07-09 21:04:44,745][25689] Fps is (10 sec: 5795.2, 60 sec: 5646.2, 300 sec: 5663.2). Total num frames: 430001152. Throughput: 0: 5937.8. Samples: 430007988. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:04:44,746][25689] Avg episode reward: [(0, '-46.878')] [2022-07-09 21:04:44,959][26022] Updated weights on worker 0-0, policy_version 419925 (0.00085) [2022-07-09 21:04:47,019][26022] Updated weights on worker 0-0, policy_version 419935 (0.00086) [2022-07-09 21:04:48,679][26022] Updated weights on worker 0-0, policy_version 419945 (0.00078) [2022-07-09 21:04:49,751][25689] Fps is (10 sec: 5615.9, 60 sec: 5614.0, 300 sec: 5657.7). Total num frames: 430027776. Throughput: 0: 5104.3. Samples: 430025194. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:04:49,761][25689] Avg episode reward: [(0, '-46.108')] [2022-07-09 21:04:50,533][26022] Updated weights on worker 0-0, policy_version 419955 (0.00086) [2022-07-09 21:04:52,323][26022] Updated weights on worker 0-0, policy_version 419965 (0.00084) [2022-07-09 21:04:53,978][26022] Updated weights on worker 0-0, policy_version 419975 (0.00089) [2022-07-09 21:04:54,776][25689] Fps is (10 sec: 5614.9, 60 sec: 5662.9, 300 sec: 5659.1). Total num frames: 430057472. Throughput: 0: 5974.5. Samples: 430059790. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:04:54,777][25689] Avg episode reward: [(0, '-46.555')] [2022-07-09 21:04:55,943][26022] Updated weights on worker 0-0, policy_version 419985 (0.00095) [2022-07-09 21:04:57,844][26022] Updated weights on worker 0-0, policy_version 419995 (0.00088) [2022-07-09 21:04:59,574][26022] Updated weights on worker 0-0, policy_version 420005 (0.00093) [2022-07-09 21:04:59,815][25689] Fps is (10 sec: 5800.0, 60 sec: 5666.4, 300 sec: 5665.8). Total num frames: 430086144. Throughput: 0: 5967.7. Samples: 430093678. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:04:59,817][25689] Avg episode reward: [(0, '-46.607')] [2022-07-09 21:05:01,799][26022] Updated weights on worker 0-0, policy_version 420015 (0.00094) [2022-07-09 21:05:03,384][26022] Updated weights on worker 0-0, policy_version 420025 (0.00096) [2022-07-09 21:05:04,831][25689] Fps is (10 sec: 5397.4, 60 sec: 5640.0, 300 sec: 5655.5). Total num frames: 430111744. Throughput: 0: 5008.2. Samples: 430108582. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:05:04,832][25689] Avg episode reward: [(0, '-46.303')] [2022-07-09 21:05:05,624][26022] Updated weights on worker 0-0, policy_version 420035 (0.00095) [2022-07-09 21:05:07,058][26022] Updated weights on worker 0-0, policy_version 420045 (0.00088) [2022-07-09 21:05:09,048][26022] Updated weights on worker 0-0, policy_version 420055 (0.00085) [2022-07-09 21:05:09,835][25689] Fps is (10 sec: 5620.4, 60 sec: 5710.6, 300 sec: 5662.6). Total num frames: 430142464. Throughput: 0: 5850.5. Samples: 430142696. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:05:09,837][25689] Avg episode reward: [(0, '-46.241')] [2022-07-09 21:05:10,880][26022] Updated weights on worker 0-0, policy_version 420065 (0.00087) [2022-07-09 21:05:12,483][26022] Updated weights on worker 0-0, policy_version 420075 (0.00093) [2022-07-09 21:05:14,362][26022] Updated weights on worker 0-0, policy_version 420085 (0.00085) [2022-07-09 21:05:14,847][25689] Fps is (10 sec: 5827.4, 60 sec: 5658.8, 300 sec: 5662.0). Total num frames: 430170112. Throughput: 0: 5855.2. Samples: 430177312. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:05:14,848][25689] Avg episode reward: [(0, '-46.216')] [2022-07-09 21:05:15,953][26022] Updated weights on worker 0-0, policy_version 420095 (0.00083) [2022-07-09 21:05:17,791][26022] Updated weights on worker 0-0, policy_version 420105 (0.00085) [2022-07-09 21:05:19,643][26022] Updated weights on worker 0-0, policy_version 420115 (0.00101) [2022-07-09 21:05:19,899][25689] Fps is (10 sec: 5596.6, 60 sec: 5659.2, 300 sec: 5658.5). Total num frames: 430198784. Throughput: 0: 5029.3. Samples: 430194686. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:05:19,899][25689] Avg episode reward: [(0, '-46.669')] [2022-07-09 21:05:21,412][26022] Updated weights on worker 0-0, policy_version 420125 (0.00092) [2022-07-09 21:05:23,299][26022] Updated weights on worker 0-0, policy_version 420135 (0.00084) [2022-07-09 21:05:24,920][25689] Fps is (10 sec: 5693.1, 60 sec: 5675.0, 300 sec: 5658.2). Total num frames: 430227456. Throughput: 0: 5997.9. Samples: 430229072. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:05:24,920][25689] Avg episode reward: [(0, '-46.388')] [2022-07-09 21:05:24,927][26022] Updated weights on worker 0-0, policy_version 420145 (0.00087) [2022-07-09 21:05:26,889][26022] Updated weights on worker 0-0, policy_version 420155 (0.00088) [2022-07-09 21:05:28,627][26022] Updated weights on worker 0-0, policy_version 420165 (0.00098) [2022-07-09 21:05:29,961][25689] Fps is (10 sec: 5596.8, 60 sec: 5638.8, 300 sec: 5657.6). Total num frames: 430255104. Throughput: 0: 5979.1. Samples: 430263032. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:05:29,962][25689] Avg episode reward: [(0, '-46.743')] [2022-07-09 21:05:30,514][26022] Updated weights on worker 0-0, policy_version 420175 (0.00091) [2022-07-09 21:05:31,808][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:05:31,827][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000420183_430267392.pth [2022-07-09 21:05:31,828][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000418190_428226560.pth [2022-07-09 21:05:32,188][26022] Updated weights on worker 0-0, policy_version 420185 (0.00087) [2022-07-09 21:05:33,980][26022] Updated weights on worker 0-0, policy_version 420195 (0.00087) [2022-07-09 21:05:34,977][25689] Fps is (10 sec: 5701.5, 60 sec: 5671.6, 300 sec: 5658.4). Total num frames: 430284800. Throughput: 0: 5109.0. Samples: 430280156. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:05:34,978][25689] Avg episode reward: [(0, '-46.764')] [2022-07-09 21:05:35,794][26022] Updated weights on worker 0-0, policy_version 420205 (0.00084) [2022-07-09 21:05:37,608][26022] Updated weights on worker 0-0, policy_version 420215 (0.00094) [2022-07-09 21:05:39,375][26022] Updated weights on worker 0-0, policy_version 420225 (0.00090) [2022-07-09 21:05:40,044][25689] Fps is (10 sec: 5890.7, 60 sec: 5686.9, 300 sec: 5655.1). Total num frames: 430314496. Throughput: 0: 5948.8. Samples: 430314526. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:05:40,044][25689] Avg episode reward: [(0, '-46.169')] [2022-07-09 21:05:41,218][26022] Updated weights on worker 0-0, policy_version 420235 (0.00114) [2022-07-09 21:05:42,907][26022] Updated weights on worker 0-0, policy_version 420245 (0.00087) [2022-07-09 21:05:44,871][26022] Updated weights on worker 0-0, policy_version 420255 (0.00096) [2022-07-09 21:05:45,063][25689] Fps is (10 sec: 5685.5, 60 sec: 5653.2, 300 sec: 5655.6). Total num frames: 430342144. Throughput: 0: 5948.4. Samples: 430348894. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:05:45,064][25689] Avg episode reward: [(0, '-46.123')] [2022-07-09 21:05:46,495][26022] Updated weights on worker 0-0, policy_version 420265 (0.00087) [2022-07-09 21:05:48,407][26022] Updated weights on worker 0-0, policy_version 420275 (0.00084) [2022-07-09 21:05:50,074][25689] Fps is (10 sec: 5614.9, 60 sec: 5686.7, 300 sec: 5656.4). Total num frames: 430370816. Throughput: 0: 5122.2. Samples: 430366054. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:05:50,075][25689] Avg episode reward: [(0, '-45.474')] [2022-07-09 21:05:50,091][26022] Updated weights on worker 0-0, policy_version 420285 (0.00095) [2022-07-09 21:05:52,037][26022] Updated weights on worker 0-0, policy_version 420295 (0.00084) [2022-07-09 21:05:53,741][26022] Updated weights on worker 0-0, policy_version 420305 (0.00087) [2022-07-09 21:05:55,107][25689] Fps is (10 sec: 5607.8, 60 sec: 5652.0, 300 sec: 5650.5). Total num frames: 430398464. Throughput: 0: 5969.1. Samples: 430400310. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:05:55,107][25689] Avg episode reward: [(0, '-45.320')] [2022-07-09 21:05:55,526][26022] Updated weights on worker 0-0, policy_version 420315 (0.00094) [2022-07-09 21:05:57,515][26022] Updated weights on worker 0-0, policy_version 420325 (0.00099) [2022-07-09 21:05:59,067][26022] Updated weights on worker 0-0, policy_version 420335 (0.00087) [2022-07-09 21:06:00,151][25689] Fps is (10 sec: 5589.3, 60 sec: 5651.6, 300 sec: 5656.8). Total num frames: 430427136. Throughput: 0: 5949.4. Samples: 430434150. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:06:00,151][25689] Avg episode reward: [(0, '-45.262')] [2022-07-09 21:06:01,205][26022] Updated weights on worker 0-0, policy_version 420345 (0.00139) [2022-07-09 21:06:03,216][26022] Updated weights on worker 0-0, policy_version 420355 (0.00087) [2022-07-09 21:06:05,035][26022] Updated weights on worker 0-0, policy_version 420365 (0.00113) [2022-07-09 21:06:05,166][25689] Fps is (10 sec: 5598.9, 60 sec: 5685.6, 300 sec: 5653.5). Total num frames: 430454784. Throughput: 0: 4992.9. Samples: 430449262. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:06:05,167][25689] Avg episode reward: [(0, '-45.818')] [2022-07-09 21:06:06,956][26022] Updated weights on worker 0-0, policy_version 420375 (0.00093) [2022-07-09 21:06:08,459][26022] Updated weights on worker 0-0, policy_version 420385 (0.00086) [2022-07-09 21:06:10,191][25689] Fps is (10 sec: 5507.5, 60 sec: 5632.8, 300 sec: 5646.6). Total num frames: 430482432. Throughput: 0: 5821.2. Samples: 430483156. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:06:10,192][25689] Avg episode reward: [(0, '-45.738')] [2022-07-09 21:06:10,532][26022] Updated weights on worker 0-0, policy_version 420395 (0.00091) [2022-07-09 21:06:12,163][26022] Updated weights on worker 0-0, policy_version 420405 (0.00092) [2022-07-09 21:06:14,011][26022] Updated weights on worker 0-0, policy_version 420415 (0.00095) [2022-07-09 21:06:15,225][25689] Fps is (10 sec: 5700.7, 60 sec: 5664.6, 300 sec: 5655.5). Total num frames: 430512128. Throughput: 0: 5831.6. Samples: 430517630. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:06:15,225][25689] Avg episode reward: [(0, '-45.471')] [2022-07-09 21:06:15,946][26022] Updated weights on worker 0-0, policy_version 420425 (0.00084) [2022-07-09 21:06:17,411][26022] Updated weights on worker 0-0, policy_version 420435 (0.00093) [2022-07-09 21:06:19,338][26022] Updated weights on worker 0-0, policy_version 420445 (0.00081) [2022-07-09 21:06:20,266][25689] Fps is (10 sec: 5895.1, 60 sec: 5682.6, 300 sec: 5658.8). Total num frames: 430541824. Throughput: 0: 5007.9. Samples: 430534884. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:06:20,266][25689] Avg episode reward: [(0, '-45.942')] [2022-07-09 21:06:21,136][26022] Updated weights on worker 0-0, policy_version 420455 (0.00370) [2022-07-09 21:06:22,819][26022] Updated weights on worker 0-0, policy_version 420465 (0.00082) [2022-07-09 21:06:24,802][26022] Updated weights on worker 0-0, policy_version 420475 (0.00088) [2022-07-09 21:06:25,291][25689] Fps is (10 sec: 5595.2, 60 sec: 5648.3, 300 sec: 5648.3). Total num frames: 430568448. Throughput: 0: 5961.8. Samples: 430569242. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:06:25,291][25689] Avg episode reward: [(0, '-46.452')] [2022-07-09 21:06:26,619][26022] Updated weights on worker 0-0, policy_version 420485 (0.00082) [2022-07-09 21:06:28,246][26022] Updated weights on worker 0-0, policy_version 420495 (0.00083) [2022-07-09 21:06:30,219][26022] Updated weights on worker 0-0, policy_version 420505 (0.00103) [2022-07-09 21:06:30,322][25689] Fps is (10 sec: 5498.6, 60 sec: 5666.2, 300 sec: 5651.4). Total num frames: 430597120. Throughput: 0: 5960.7. Samples: 430603152. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:06:30,323][25689] Avg episode reward: [(0, '-45.696')] [2022-07-09 21:06:31,895][26022] Updated weights on worker 0-0, policy_version 420515 (0.00094) [2022-07-09 21:06:33,683][26022] Updated weights on worker 0-0, policy_version 420525 (0.00091) [2022-07-09 21:06:35,326][25689] Fps is (10 sec: 5714.2, 60 sec: 5650.4, 300 sec: 5649.9). Total num frames: 430625792. Throughput: 0: 5110.3. Samples: 430620354. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:06:35,328][25689] Avg episode reward: [(0, '-45.620')] [2022-07-09 21:06:35,637][26022] Updated weights on worker 0-0, policy_version 420535 (0.00094) [2022-07-09 21:06:37,290][26022] Updated weights on worker 0-0, policy_version 420545 (0.00086) [2022-07-09 21:06:39,379][26022] Updated weights on worker 0-0, policy_version 420555 (0.00494) [2022-07-09 21:06:40,359][25689] Fps is (10 sec: 5815.5, 60 sec: 5653.5, 300 sec: 5656.5). Total num frames: 430655488. Throughput: 0: 5940.2. Samples: 430654242. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:06:40,359][25689] Avg episode reward: [(0, '-45.941')] [2022-07-09 21:06:41,119][26022] Updated weights on worker 0-0, policy_version 420565 (0.00084) [2022-07-09 21:06:42,881][26022] Updated weights on worker 0-0, policy_version 420575 (0.00088) [2022-07-09 21:06:44,646][26022] Updated weights on worker 0-0, policy_version 420585 (0.00090) [2022-07-09 21:06:45,383][25689] Fps is (10 sec: 5702.1, 60 sec: 5653.1, 300 sec: 5653.2). Total num frames: 430683136. Throughput: 0: 5929.2. Samples: 430688372. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:06:45,383][25689] Avg episode reward: [(0, '-45.971')] [2022-07-09 21:06:46,420][26022] Updated weights on worker 0-0, policy_version 420595 (0.00090) [2022-07-09 21:06:48,242][26022] Updated weights on worker 0-0, policy_version 420605 (0.01135) [2022-07-09 21:06:49,902][26022] Updated weights on worker 0-0, policy_version 420615 (0.00080) [2022-07-09 21:06:50,392][25689] Fps is (10 sec: 5613.5, 60 sec: 5653.3, 300 sec: 5649.6). Total num frames: 430711808. Throughput: 0: 5101.5. Samples: 430705538. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:06:50,392][25689] Avg episode reward: [(0, '-45.267')] [2022-07-09 21:06:51,768][26022] Updated weights on worker 0-0, policy_version 420625 (0.00087) [2022-07-09 21:06:53,526][26022] Updated weights on worker 0-0, policy_version 420635 (0.00085) [2022-07-09 21:06:55,263][26022] Updated weights on worker 0-0, policy_version 420645 (0.00092) [2022-07-09 21:06:55,417][25689] Fps is (10 sec: 5714.5, 60 sec: 5670.9, 300 sec: 5654.5). Total num frames: 430740480. Throughput: 0: 5957.9. Samples: 430740056. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:06:55,419][25689] Avg episode reward: [(0, '-45.089')] [2022-07-09 21:06:57,283][26022] Updated weights on worker 0-0, policy_version 420655 (0.00087) [2022-07-09 21:06:58,993][26022] Updated weights on worker 0-0, policy_version 420665 (0.00084) [2022-07-09 21:07:00,467][25689] Fps is (10 sec: 5590.0, 60 sec: 5653.4, 300 sec: 5658.4). Total num frames: 430768128. Throughput: 0: 5976.9. Samples: 430774426. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:07:00,467][25689] Avg episode reward: [(0, '-45.483')] [2022-07-09 21:07:00,639][26022] Updated weights on worker 0-0, policy_version 420675 (0.00089) [2022-07-09 21:07:03,002][26022] Updated weights on worker 0-0, policy_version 420685 (0.00083) [2022-07-09 21:07:04,554][26022] Updated weights on worker 0-0, policy_version 420695 (0.00092) [2022-07-09 21:07:05,497][25689] Fps is (10 sec: 5587.7, 60 sec: 5669.0, 300 sec: 5659.0). Total num frames: 430796800. Throughput: 0: 5028.5. Samples: 430789516. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-09 21:07:05,497][25689] Avg episode reward: [(0, '-45.174')] [2022-07-09 21:07:06,792][26022] Updated weights on worker 0-0, policy_version 420705 (0.00094) [2022-07-09 21:07:08,110][26022] Updated weights on worker 0-0, policy_version 420715 (0.00088) [2022-07-09 21:07:10,233][26022] Updated weights on worker 0-0, policy_version 420725 (0.00090) [2022-07-09 21:07:10,549][25689] Fps is (10 sec: 5586.1, 60 sec: 5666.4, 300 sec: 5654.8). Total num frames: 430824448. Throughput: 0: 5860.5. Samples: 430823670. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:07:10,550][25689] Avg episode reward: [(0, '-45.083')] [2022-07-09 21:07:11,831][26022] Updated weights on worker 0-0, policy_version 420735 (0.00086) [2022-07-09 21:07:13,811][26022] Updated weights on worker 0-0, policy_version 420745 (0.00096) [2022-07-09 21:07:15,403][26022] Updated weights on worker 0-0, policy_version 420755 (0.00082) [2022-07-09 21:07:15,556][25689] Fps is (10 sec: 5598.8, 60 sec: 5652.0, 300 sec: 5650.2). Total num frames: 430853120. Throughput: 0: 5852.0. Samples: 430857906. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:07:15,556][25689] Avg episode reward: [(0, '-46.073')] [2022-07-09 21:07:17,360][26022] Updated weights on worker 0-0, policy_version 420765 (0.00090) [2022-07-09 21:07:19,002][26022] Updated weights on worker 0-0, policy_version 420775 (0.00081) [2022-07-09 21:07:20,632][25689] Fps is (10 sec: 5687.1, 60 sec: 5631.7, 300 sec: 5653.8). Total num frames: 430881792. Throughput: 0: 4987.3. Samples: 430874996. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:07:20,633][25689] Avg episode reward: [(0, '-45.535')] [2022-07-09 21:07:20,879][26022] Updated weights on worker 0-0, policy_version 420785 (0.00092) [2022-07-09 21:07:22,440][26022] Updated weights on worker 0-0, policy_version 420795 (0.00094) [2022-07-09 21:07:24,507][26022] Updated weights on worker 0-0, policy_version 420805 (0.00085) [2022-07-09 21:07:25,682][25689] Fps is (10 sec: 5764.3, 60 sec: 5680.3, 300 sec: 5653.8). Total num frames: 430911488. Throughput: 0: 5942.4. Samples: 430909464. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:07:25,683][25689] Avg episode reward: [(0, '-45.343')] [2022-07-09 21:07:26,327][26022] Updated weights on worker 0-0, policy_version 420815 (0.00089) [2022-07-09 21:07:27,936][26022] Updated weights on worker 0-0, policy_version 420825 (0.00092) [2022-07-09 21:07:29,918][26022] Updated weights on worker 0-0, policy_version 420835 (0.00083) [2022-07-09 21:07:30,758][25689] Fps is (10 sec: 5663.3, 60 sec: 5659.2, 300 sec: 5645.6). Total num frames: 430939136. Throughput: 0: 5921.0. Samples: 430943326. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:07:30,758][25689] Avg episode reward: [(0, '-45.796')] [2022-07-09 21:07:31,576][26022] Updated weights on worker 0-0, policy_version 420845 (0.00089) [2022-07-09 21:07:32,108][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:07:32,119][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000420847_430947328.pth [2022-07-09 21:07:32,119][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000418857_428909568.pth [2022-07-09 21:07:33,375][26022] Updated weights on worker 0-0, policy_version 420855 (0.00547) [2022-07-09 21:07:35,310][26022] Updated weights on worker 0-0, policy_version 420865 (0.00091) [2022-07-09 21:07:35,839][25689] Fps is (10 sec: 5544.7, 60 sec: 5651.9, 300 sec: 5651.9). Total num frames: 430967808. Throughput: 0: 5908.3. Samples: 430977746. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:07:35,841][25689] Avg episode reward: [(0, '-45.731')] [2022-07-09 21:07:37,000][26022] Updated weights on worker 0-0, policy_version 420875 (0.00082) [2022-07-09 21:07:38,885][26022] Updated weights on worker 0-0, policy_version 420885 (0.00088) [2022-07-09 21:07:40,784][26022] Updated weights on worker 0-0, policy_version 420895 (0.00082) [2022-07-09 21:07:40,987][25689] Fps is (10 sec: 5606.2, 60 sec: 5624.3, 300 sec: 5649.6). Total num frames: 430996480. Throughput: 0: 5882.9. Samples: 430994738. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:07:40,991][25689] Avg episode reward: [(0, '-46.126')] [2022-07-09 21:07:42,366][26022] Updated weights on worker 0-0, policy_version 420905 (0.00094) [2022-07-09 21:07:44,487][26022] Updated weights on worker 0-0, policy_version 420915 (0.00086) [2022-07-09 21:07:45,847][26022] Updated weights on worker 0-0, policy_version 420925 (0.00093) [2022-07-09 21:07:46,050][25689] Fps is (10 sec: 5916.7, 60 sec: 5688.2, 300 sec: 5659.0). Total num frames: 431028224. Throughput: 0: 5852.9. Samples: 431028680. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:07:46,051][25689] Avg episode reward: [(0, '-47.238')] [2022-07-09 21:07:48,257][26022] Updated weights on worker 0-0, policy_version 420935 (0.00065) [2022-07-09 21:07:49,510][26022] Updated weights on worker 0-0, policy_version 420945 (0.00090) [2022-07-09 21:07:51,097][25689] Fps is (10 sec: 5671.4, 60 sec: 5634.0, 300 sec: 5648.5). Total num frames: 431053824. Throughput: 0: 5871.1. Samples: 431062742. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:07:51,099][25689] Avg episode reward: [(0, '-47.850')] [2022-07-09 21:07:51,581][26022] Updated weights on worker 0-0, policy_version 420955 (0.00086) [2022-07-09 21:07:53,165][26022] Updated weights on worker 0-0, policy_version 420965 (0.00087) [2022-07-09 21:07:55,129][26022] Updated weights on worker 0-0, policy_version 420975 (0.00457) [2022-07-09 21:07:56,136][25689] Fps is (10 sec: 5584.1, 60 sec: 5666.5, 300 sec: 5652.9). Total num frames: 431084544. Throughput: 0: 5037.2. Samples: 431079986. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:07:56,138][25689] Avg episode reward: [(0, '-47.668')] [2022-07-09 21:07:57,096][26022] Updated weights on worker 0-0, policy_version 420985 (0.00093) [2022-07-09 21:07:58,597][26022] Updated weights on worker 0-0, policy_version 420995 (0.00088) [2022-07-09 21:08:00,673][26022] Updated weights on worker 0-0, policy_version 421005 (0.00088) [2022-07-09 21:08:01,205][25689] Fps is (10 sec: 5875.7, 60 sec: 5681.5, 300 sec: 5665.5). Total num frames: 431113216. Throughput: 0: 5902.3. Samples: 431114076. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:08:01,206][25689] Avg episode reward: [(0, '-47.998')] [2022-07-09 21:08:02,585][26022] Updated weights on worker 0-0, policy_version 421015 (0.00093) [2022-07-09 21:08:04,451][26022] Updated weights on worker 0-0, policy_version 421025 (0.00092) [2022-07-09 21:08:06,217][25689] Fps is (10 sec: 5383.7, 60 sec: 5632.7, 300 sec: 5651.7). Total num frames: 431138816. Throughput: 0: 5819.5. Samples: 431146040. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:08:06,217][25689] Avg episode reward: [(0, '-48.191')] [2022-07-09 21:08:06,440][26022] Updated weights on worker 0-0, policy_version 421035 (0.00094) [2022-07-09 21:08:07,939][26022] Updated weights on worker 0-0, policy_version 421045 (0.00086) [2022-07-09 21:08:09,896][26022] Updated weights on worker 0-0, policy_version 421055 (0.00105) [2022-07-09 21:08:11,272][25689] Fps is (10 sec: 5492.8, 60 sec: 5666.1, 300 sec: 5657.6). Total num frames: 431168512. Throughput: 0: 4972.5. Samples: 431163062. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:08:11,273][25689] Avg episode reward: [(0, '-47.770')] [2022-07-09 21:08:11,743][26022] Updated weights on worker 0-0, policy_version 421065 (0.00082) [2022-07-09 21:08:13,414][26022] Updated weights on worker 0-0, policy_version 421075 (0.00090) [2022-07-09 21:08:15,407][26022] Updated weights on worker 0-0, policy_version 421085 (0.00084) [2022-07-09 21:08:16,315][25689] Fps is (10 sec: 5678.9, 60 sec: 5645.9, 300 sec: 5654.7). Total num frames: 431196160. Throughput: 0: 5827.4. Samples: 431197574. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:08:16,315][25689] Avg episode reward: [(0, '-48.393')] [2022-07-09 21:08:16,864][26022] Updated weights on worker 0-0, policy_version 421095 (0.00087) [2022-07-09 21:08:18,827][26022] Updated weights on worker 0-0, policy_version 421105 (0.00069) [2022-07-09 21:08:20,539][26022] Updated weights on worker 0-0, policy_version 421115 (0.00090) [2022-07-09 21:08:21,366][25689] Fps is (10 sec: 5580.1, 60 sec: 5648.3, 300 sec: 5648.6). Total num frames: 431224832. Throughput: 0: 5860.3. Samples: 431232220. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:08:21,366][25689] Avg episode reward: [(0, '-48.484')] [2022-07-09 21:08:22,452][26022] Updated weights on worker 0-0, policy_version 421125 (0.00093) [2022-07-09 21:08:24,147][26022] Updated weights on worker 0-0, policy_version 421135 (0.00082) [2022-07-09 21:08:25,896][26022] Updated weights on worker 0-0, policy_version 421145 (0.00095) [2022-07-09 21:08:26,383][25689] Fps is (10 sec: 5797.4, 60 sec: 5651.3, 300 sec: 5662.5). Total num frames: 431254528. Throughput: 0: 5135.4. Samples: 431249600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:08:26,383][25689] Avg episode reward: [(0, '-47.766')] [2022-07-09 21:08:27,818][26022] Updated weights on worker 0-0, policy_version 421155 (0.00091) [2022-07-09 21:08:29,536][26022] Updated weights on worker 0-0, policy_version 421165 (0.00087) [2022-07-09 21:08:31,415][25689] Fps is (10 sec: 5706.3, 60 sec: 5655.4, 300 sec: 5648.2). Total num frames: 431282176. Throughput: 0: 5979.5. Samples: 431283502. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:08:31,415][25689] Avg episode reward: [(0, '-47.868')] [2022-07-09 21:08:31,466][26022] Updated weights on worker 0-0, policy_version 421175 (0.00078) [2022-07-09 21:08:33,117][26022] Updated weights on worker 0-0, policy_version 421185 (0.00090) [2022-07-09 21:08:34,912][26022] Updated weights on worker 0-0, policy_version 421195 (0.00102) [2022-07-09 21:08:36,424][25689] Fps is (10 sec: 5710.6, 60 sec: 5679.0, 300 sec: 5663.0). Total num frames: 431311872. Throughput: 0: 5969.6. Samples: 431317622. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:08:36,426][25689] Avg episode reward: [(0, '-47.295')] [2022-07-09 21:08:36,646][26022] Updated weights on worker 0-0, policy_version 421205 (0.00087) [2022-07-09 21:08:38,636][26022] Updated weights on worker 0-0, policy_version 421215 (0.00091) [2022-07-09 21:08:40,588][26022] Updated weights on worker 0-0, policy_version 421225 (0.00092) [2022-07-09 21:08:41,466][25689] Fps is (10 sec: 5704.9, 60 sec: 5672.0, 300 sec: 5652.2). Total num frames: 431339520. Throughput: 0: 5098.8. Samples: 431334710. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:08:41,468][25689] Avg episode reward: [(0, '-46.561')] [2022-07-09 21:08:42,119][26022] Updated weights on worker 0-0, policy_version 421235 (0.00084) [2022-07-09 21:08:44,057][26022] Updated weights on worker 0-0, policy_version 421245 (0.00087) [2022-07-09 21:08:45,606][26022] Updated weights on worker 0-0, policy_version 421255 (0.00074) [2022-07-09 21:08:46,482][25689] Fps is (10 sec: 5498.0, 60 sec: 5608.7, 300 sec: 5649.0). Total num frames: 431367168. Throughput: 0: 5951.9. Samples: 431369228. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:08:46,486][25689] Avg episode reward: [(0, '-46.274')] [2022-07-09 21:08:47,559][26022] Updated weights on worker 0-0, policy_version 421265 (0.00085) [2022-07-09 21:08:49,638][26022] Updated weights on worker 0-0, policy_version 421275 (0.00086) [2022-07-09 21:08:50,890][26022] Updated weights on worker 0-0, policy_version 421285 (0.00108) [2022-07-09 21:08:51,508][25689] Fps is (10 sec: 5812.3, 60 sec: 5695.4, 300 sec: 5662.3). Total num frames: 431397888. Throughput: 0: 5966.8. Samples: 431403398. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:08:51,509][25689] Avg episode reward: [(0, '-46.012')] [2022-07-09 21:08:53,229][26022] Updated weights on worker 0-0, policy_version 421295 (0.00090) [2022-07-09 21:08:54,772][26022] Updated weights on worker 0-0, policy_version 421305 (0.00092) [2022-07-09 21:08:56,515][25689] Fps is (10 sec: 5817.3, 60 sec: 5647.5, 300 sec: 5660.2). Total num frames: 431425536. Throughput: 0: 5106.4. Samples: 431420216. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:08:56,517][25689] Avg episode reward: [(0, '-45.931')] [2022-07-09 21:08:56,523][26022] Updated weights on worker 0-0, policy_version 421315 (0.00092) [2022-07-09 21:08:58,703][26022] Updated weights on worker 0-0, policy_version 421325 (0.00091) [2022-07-09 21:09:00,205][26022] Updated weights on worker 0-0, policy_version 421335 (0.00085) [2022-07-09 21:09:01,589][25689] Fps is (10 sec: 5282.4, 60 sec: 5596.3, 300 sec: 5653.7). Total num frames: 431451136. Throughput: 0: 5915.9. Samples: 431453752. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:09:01,589][25689] Avg episode reward: [(0, '-45.837')] [2022-07-09 21:09:02,662][26022] Updated weights on worker 0-0, policy_version 421345 (0.00092) [2022-07-09 21:09:04,387][26022] Updated weights on worker 0-0, policy_version 421355 (0.00096) [2022-07-09 21:09:06,156][26022] Updated weights on worker 0-0, policy_version 421365 (0.00087) [2022-07-09 21:09:06,607][25689] Fps is (10 sec: 5378.0, 60 sec: 5646.5, 300 sec: 5660.9). Total num frames: 431479808. Throughput: 0: 5760.8. Samples: 431485164. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:09:06,607][25689] Avg episode reward: [(0, '-46.051')] [2022-07-09 21:09:08,053][26022] Updated weights on worker 0-0, policy_version 421375 (0.00087) [2022-07-09 21:09:09,825][26022] Updated weights on worker 0-0, policy_version 421385 (0.00090) [2022-07-09 21:09:11,562][26022] Updated weights on worker 0-0, policy_version 421395 (0.00084) [2022-07-09 21:09:11,616][25689] Fps is (10 sec: 5718.8, 60 sec: 5633.9, 300 sec: 5653.9). Total num frames: 431508480. Throughput: 0: 4915.7. Samples: 431502240. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:09:11,618][25689] Avg episode reward: [(0, '-44.934')] [2022-07-09 21:09:13,443][26022] Updated weights on worker 0-0, policy_version 421405 (0.00084) [2022-07-09 21:09:15,118][26022] Updated weights on worker 0-0, policy_version 421415 (0.00089) [2022-07-09 21:09:16,621][25689] Fps is (10 sec: 5521.9, 60 sec: 5620.4, 300 sec: 5648.0). Total num frames: 431535104. Throughput: 0: 5783.6. Samples: 431536498. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:09:16,623][25689] Avg episode reward: [(0, '-45.266')] [2022-07-09 21:09:17,157][26022] Updated weights on worker 0-0, policy_version 421425 (0.00093) [2022-07-09 21:09:18,660][26022] Updated weights on worker 0-0, policy_version 421435 (0.00084) [2022-07-09 21:09:20,799][26022] Updated weights on worker 0-0, policy_version 421445 (0.00084) [2022-07-09 21:09:21,674][25689] Fps is (10 sec: 5701.1, 60 sec: 5654.1, 300 sec: 5657.4). Total num frames: 431565824. Throughput: 0: 5824.5. Samples: 431570742. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:09:21,676][25689] Avg episode reward: [(0, '-45.835')] [2022-07-09 21:09:21,999][26022] Updated weights on worker 0-0, policy_version 421455 (0.00083) [2022-07-09 21:09:24,240][26022] Updated weights on worker 0-0, policy_version 421465 (0.00084) [2022-07-09 21:09:25,753][26022] Updated weights on worker 0-0, policy_version 421475 (0.00098) [2022-07-09 21:09:26,719][25689] Fps is (10 sec: 5780.0, 60 sec: 5617.6, 300 sec: 5650.0). Total num frames: 431593472. Throughput: 0: 5124.5. Samples: 431588230. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:09:26,721][25689] Avg episode reward: [(0, '-46.029')] [2022-07-09 21:09:27,663][26022] Updated weights on worker 0-0, policy_version 421485 (0.00093) [2022-07-09 21:09:29,724][26022] Updated weights on worker 0-0, policy_version 421495 (0.00084) [2022-07-09 21:09:31,147][26022] Updated weights on worker 0-0, policy_version 421505 (0.00081) [2022-07-09 21:09:31,751][25689] Fps is (10 sec: 5792.5, 60 sec: 5668.5, 300 sec: 5659.8). Total num frames: 431624192. Throughput: 0: 5978.8. Samples: 431622624. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:09:31,752][25689] Avg episode reward: [(0, '-45.779')] [2022-07-09 21:09:32,215][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:09:32,226][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000421510_431626240.pth [2022-07-09 21:09:32,226][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000419519_429587456.pth [2022-07-09 21:09:33,341][26022] Updated weights on worker 0-0, policy_version 421515 (0.00086) [2022-07-09 21:09:34,823][26022] Updated weights on worker 0-0, policy_version 421525 (0.00081) [2022-07-09 21:09:36,753][25689] Fps is (10 sec: 5715.3, 60 sec: 5618.3, 300 sec: 5653.8). Total num frames: 431650816. Throughput: 0: 5982.2. Samples: 431656930. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:09:36,753][25689] Avg episode reward: [(0, '-46.362')] [2022-07-09 21:09:36,781][26022] Updated weights on worker 0-0, policy_version 421535 (0.00090) [2022-07-09 21:09:38,569][26022] Updated weights on worker 0-0, policy_version 421545 (0.00080) [2022-07-09 21:09:40,276][26022] Updated weights on worker 0-0, policy_version 421555 (0.00092) [2022-07-09 21:09:41,850][25689] Fps is (10 sec: 5576.7, 60 sec: 5647.1, 300 sec: 5652.4). Total num frames: 431680512. Throughput: 0: 5122.6. Samples: 431674094. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:09:41,851][25689] Avg episode reward: [(0, '-47.274')] [2022-07-09 21:09:42,077][26022] Updated weights on worker 0-0, policy_version 421565 (0.00087) [2022-07-09 21:09:43,787][26022] Updated weights on worker 0-0, policy_version 421575 (0.00085) [2022-07-09 21:09:45,594][26022] Updated weights on worker 0-0, policy_version 421585 (0.00086) [2022-07-09 21:09:46,914][25689] Fps is (10 sec: 5844.8, 60 sec: 5676.5, 300 sec: 5661.7). Total num frames: 431710208. Throughput: 0: 5964.3. Samples: 431708678. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-09 21:09:46,915][25689] Avg episode reward: [(0, '-47.145')] [2022-07-09 21:09:47,308][26022] Updated weights on worker 0-0, policy_version 421595 (0.00090) [2022-07-09 21:09:49,174][26022] Updated weights on worker 0-0, policy_version 421605 (0.00089) [2022-07-09 21:09:50,955][26022] Updated weights on worker 0-0, policy_version 421615 (0.00089) [2022-07-09 21:09:51,917][25689] Fps is (10 sec: 5696.2, 60 sec: 5627.8, 300 sec: 5655.2). Total num frames: 431737856. Throughput: 0: 5956.2. Samples: 431742738. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:09:51,918][25689] Avg episode reward: [(0, '-46.910')] [2022-07-09 21:09:52,856][26022] Updated weights on worker 0-0, policy_version 421625 (0.00093) [2022-07-09 21:09:54,542][26022] Updated weights on worker 0-0, policy_version 421635 (0.00089) [2022-07-09 21:09:56,421][26022] Updated weights on worker 0-0, policy_version 421645 (0.00084) [2022-07-09 21:09:56,933][25689] Fps is (10 sec: 5621.5, 60 sec: 5644.0, 300 sec: 5655.6). Total num frames: 431766528. Throughput: 0: 5099.9. Samples: 431759848. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:09:56,933][25689] Avg episode reward: [(0, '-46.958')] [2022-07-09 21:09:58,384][26022] Updated weights on worker 0-0, policy_version 421655 (0.00095) [2022-07-09 21:10:00,023][26022] Updated weights on worker 0-0, policy_version 421665 (0.00099) [2022-07-09 21:10:02,009][25689] Fps is (10 sec: 5479.2, 60 sec: 5660.6, 300 sec: 5657.9). Total num frames: 431793152. Throughput: 0: 5939.8. Samples: 431793834. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:10:02,010][25689] Avg episode reward: [(0, '-46.907')] [2022-07-09 21:10:02,246][26022] Updated weights on worker 0-0, policy_version 421675 (0.00090) [2022-07-09 21:10:04,115][26022] Updated weights on worker 0-0, policy_version 421685 (0.00089) [2022-07-09 21:10:05,808][26022] Updated weights on worker 0-0, policy_version 421695 (0.00083) [2022-07-09 21:10:07,029][25689] Fps is (10 sec: 5375.3, 60 sec: 5643.5, 300 sec: 5647.3). Total num frames: 431820800. Throughput: 0: 5815.9. Samples: 431825666. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:10:07,030][25689] Avg episode reward: [(0, '-46.256')] [2022-07-09 21:10:07,647][26022] Updated weights on worker 0-0, policy_version 421705 (0.00094) [2022-07-09 21:10:09,546][26022] Updated weights on worker 0-0, policy_version 421715 (0.00088) [2022-07-09 21:10:11,274][26022] Updated weights on worker 0-0, policy_version 421725 (0.00084) [2022-07-09 21:10:12,051][25689] Fps is (10 sec: 5812.7, 60 sec: 5676.2, 300 sec: 5657.4). Total num frames: 431851520. Throughput: 0: 5842.7. Samples: 431860372. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:10:12,051][25689] Avg episode reward: [(0, '-45.943')] [2022-07-09 21:10:13,028][26022] Updated weights on worker 0-0, policy_version 421735 (0.00112) [2022-07-09 21:10:14,585][26022] Updated weights on worker 0-0, policy_version 421745 (0.00052) [2022-07-09 21:10:16,698][26022] Updated weights on worker 0-0, policy_version 421755 (0.00090) [2022-07-09 21:10:17,071][25689] Fps is (10 sec: 5710.7, 60 sec: 5674.8, 300 sec: 5651.1). Total num frames: 431878144. Throughput: 0: 5852.6. Samples: 431877708. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:10:17,071][25689] Avg episode reward: [(0, '-46.703')] [2022-07-09 21:10:18,200][26022] Updated weights on worker 0-0, policy_version 421765 (0.00092) [2022-07-09 21:10:20,251][26022] Updated weights on worker 0-0, policy_version 421775 (0.00099) [2022-07-09 21:10:21,749][26022] Updated weights on worker 0-0, policy_version 421785 (0.00086) [2022-07-09 21:10:22,185][25689] Fps is (10 sec: 5759.5, 60 sec: 5686.0, 300 sec: 5659.7). Total num frames: 431909888. Throughput: 0: 5862.7. Samples: 431912118. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:10:22,185][25689] Avg episode reward: [(0, '-46.714')] [2022-07-09 21:10:23,732][26022] Updated weights on worker 0-0, policy_version 421795 (0.00091) [2022-07-09 21:10:25,425][26022] Updated weights on worker 0-0, policy_version 421805 (0.00094) [2022-07-09 21:10:27,204][25689] Fps is (10 sec: 5861.1, 60 sec: 5688.4, 300 sec: 5660.1). Total num frames: 431937536. Throughput: 0: 6000.8. Samples: 431946730. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:10:27,206][25689] Avg episode reward: [(0, '-46.500')] [2022-07-09 21:10:27,224][26022] Updated weights on worker 0-0, policy_version 421815 (0.00090) [2022-07-09 21:10:28,867][26022] Updated weights on worker 0-0, policy_version 421825 (0.00083) [2022-07-09 21:10:30,816][26022] Updated weights on worker 0-0, policy_version 421835 (0.00082) [2022-07-09 21:10:32,216][25689] Fps is (10 sec: 5614.3, 60 sec: 5656.4, 300 sec: 5656.8). Total num frames: 431966208. Throughput: 0: 5149.4. Samples: 431964212. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:10:32,217][25689] Avg episode reward: [(0, '-46.543')] [2022-07-09 21:10:32,607][26022] Updated weights on worker 0-0, policy_version 421845 (0.00088) [2022-07-09 21:10:34,449][26022] Updated weights on worker 0-0, policy_version 421855 (0.00538) [2022-07-09 21:10:36,183][26022] Updated weights on worker 0-0, policy_version 421865 (0.00087) [2022-07-09 21:10:37,234][25689] Fps is (10 sec: 5819.2, 60 sec: 5705.6, 300 sec: 5657.7). Total num frames: 431995904. Throughput: 0: 5986.6. Samples: 431998418. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:10:37,235][25689] Avg episode reward: [(0, '-47.680')] [2022-07-09 21:10:38,007][26022] Updated weights on worker 0-0, policy_version 421875 (0.00091) [2022-07-09 21:10:39,811][26022] Updated weights on worker 0-0, policy_version 421885 (0.00051) [2022-07-09 21:10:41,639][26022] Updated weights on worker 0-0, policy_version 421895 (0.00087) [2022-07-09 21:10:42,324][25689] Fps is (10 sec: 5673.5, 60 sec: 5672.6, 300 sec: 5656.4). Total num frames: 432023552. Throughput: 0: 5964.3. Samples: 432032232. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:10:42,324][25689] Avg episode reward: [(0, '-47.456')] [2022-07-09 21:10:43,484][26022] Updated weights on worker 0-0, policy_version 421905 (0.00093) [2022-07-09 21:10:45,258][26022] Updated weights on worker 0-0, policy_version 421915 (0.00086) [2022-07-09 21:10:46,991][26022] Updated weights on worker 0-0, policy_version 421925 (0.00090) [2022-07-09 21:10:47,352][25689] Fps is (10 sec: 5667.6, 60 sec: 5675.9, 300 sec: 5659.5). Total num frames: 432053248. Throughput: 0: 5088.5. Samples: 432049254. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:10:47,353][25689] Avg episode reward: [(0, '-46.487')] [2022-07-09 21:10:48,935][26022] Updated weights on worker 0-0, policy_version 421935 (0.00086) [2022-07-09 21:10:50,576][26022] Updated weights on worker 0-0, policy_version 421945 (0.00093) [2022-07-09 21:10:52,376][25689] Fps is (10 sec: 5704.5, 60 sec: 5674.0, 300 sec: 5659.6). Total num frames: 432080896. Throughput: 0: 5922.2. Samples: 432083602. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:10:52,376][25689] Avg episode reward: [(0, '-46.950')] [2022-07-09 21:10:52,394][26022] Updated weights on worker 0-0, policy_version 421955 (0.00086) [2022-07-09 21:10:54,408][26022] Updated weights on worker 0-0, policy_version 421965 (0.00086) [2022-07-09 21:10:56,035][26022] Updated weights on worker 0-0, policy_version 421975 (0.00096) [2022-07-09 21:10:57,382][25689] Fps is (10 sec: 5615.6, 60 sec: 5674.9, 300 sec: 5660.4). Total num frames: 432109568. Throughput: 0: 5928.8. Samples: 432117866. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:10:57,382][25689] Avg episode reward: [(0, '-47.072')] [2022-07-09 21:10:57,903][26022] Updated weights on worker 0-0, policy_version 421985 (0.00088) [2022-07-09 21:10:59,687][26022] Updated weights on worker 0-0, policy_version 421995 (0.00089) [2022-07-09 21:11:01,337][26022] Updated weights on worker 0-0, policy_version 422005 (0.00088) [2022-07-09 21:11:02,420][25689] Fps is (10 sec: 5607.1, 60 sec: 5695.4, 300 sec: 5659.9). Total num frames: 432137216. Throughput: 0: 5115.7. Samples: 432135042. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:11:02,421][25689] Avg episode reward: [(0, '-47.154')] [2022-07-09 21:11:03,736][26022] Updated weights on worker 0-0, policy_version 422015 (0.00089) [2022-07-09 21:11:05,232][26022] Updated weights on worker 0-0, policy_version 422025 (0.00086) [2022-07-09 21:11:07,359][26022] Updated weights on worker 0-0, policy_version 422035 (0.00059) [2022-07-09 21:11:07,439][25689] Fps is (10 sec: 5396.3, 60 sec: 5678.6, 300 sec: 5656.6). Total num frames: 432163840. Throughput: 0: 5858.0. Samples: 432166922. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:11:07,439][25689] Avg episode reward: [(0, '-45.855')] [2022-07-09 21:11:08,723][26022] Updated weights on worker 0-0, policy_version 422045 (0.00054) [2022-07-09 21:11:10,902][26022] Updated weights on worker 0-0, policy_version 422055 (0.00090) [2022-07-09 21:11:12,459][25689] Fps is (10 sec: 5610.4, 60 sec: 5661.8, 300 sec: 5656.8). Total num frames: 432193536. Throughput: 0: 5862.0. Samples: 432201328. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:11:12,459][25689] Avg episode reward: [(0, '-45.526')] [2022-07-09 21:11:12,505][26022] Updated weights on worker 0-0, policy_version 422065 (0.00089) [2022-07-09 21:11:14,396][26022] Updated weights on worker 0-0, policy_version 422075 (0.00093) [2022-07-09 21:11:16,038][26022] Updated weights on worker 0-0, policy_version 422085 (0.00088) [2022-07-09 21:11:17,463][25689] Fps is (10 sec: 5822.8, 60 sec: 5697.2, 300 sec: 5654.1). Total num frames: 432222208. Throughput: 0: 5014.4. Samples: 432218562. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:11:17,463][25689] Avg episode reward: [(0, '-45.617')] [2022-07-09 21:11:17,926][26022] Updated weights on worker 0-0, policy_version 422095 (0.00089) [2022-07-09 21:11:19,881][26022] Updated weights on worker 0-0, policy_version 422105 (0.00089) [2022-07-09 21:11:21,531][26022] Updated weights on worker 0-0, policy_version 422115 (0.00085) [2022-07-09 21:11:22,570][25689] Fps is (10 sec: 5570.3, 60 sec: 5630.0, 300 sec: 5656.0). Total num frames: 432249856. Throughput: 0: 5840.7. Samples: 432252726. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:11:22,570][25689] Avg episode reward: [(0, '-45.379')] [2022-07-09 21:11:23,293][26022] Updated weights on worker 0-0, policy_version 422125 (0.00081) [2022-07-09 21:11:25,068][26022] Updated weights on worker 0-0, policy_version 422135 (0.00081) [2022-07-09 21:11:26,839][26022] Updated weights on worker 0-0, policy_version 422145 (0.00085) [2022-07-09 21:11:27,600][25689] Fps is (10 sec: 5656.7, 60 sec: 5662.9, 300 sec: 5659.5). Total num frames: 432279552. Throughput: 0: 5963.9. Samples: 432287162. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:11:27,601][25689] Avg episode reward: [(0, '-45.876')] [2022-07-09 21:11:28,728][26022] Updated weights on worker 0-0, policy_version 422155 (0.00088) [2022-07-09 21:11:30,432][26022] Updated weights on worker 0-0, policy_version 422165 (0.00089) [2022-07-09 21:11:32,279][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:11:32,286][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000422175_432307200.pth [2022-07-09 21:11:32,286][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000420183_430267392.pth [2022-07-09 21:11:32,292][26022] Updated weights on worker 0-0, policy_version 422175 (0.00088) [2022-07-09 21:11:32,606][25689] Fps is (10 sec: 5713.6, 60 sec: 5646.5, 300 sec: 5656.0). Total num frames: 432307200. Throughput: 0: 5111.6. Samples: 432304312. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:11:32,607][25689] Avg episode reward: [(0, '-46.486')] [2022-07-09 21:11:34,120][26022] Updated weights on worker 0-0, policy_version 422185 (0.00106) [2022-07-09 21:11:36,028][26022] Updated weights on worker 0-0, policy_version 422195 (0.00086) [2022-07-09 21:11:37,491][26022] Updated weights on worker 0-0, policy_version 422205 (0.00097) [2022-07-09 21:11:37,614][25689] Fps is (10 sec: 5828.7, 60 sec: 5664.4, 300 sec: 5659.9). Total num frames: 432337920. Throughput: 0: 5943.4. Samples: 432338328. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:11:37,615][25689] Avg episode reward: [(0, '-47.004')] [2022-07-09 21:11:39,721][26022] Updated weights on worker 0-0, policy_version 422215 (0.00080) [2022-07-09 21:11:41,153][26022] Updated weights on worker 0-0, policy_version 422225 (0.00089) [2022-07-09 21:11:42,719][25689] Fps is (10 sec: 5670.4, 60 sec: 5646.0, 300 sec: 5654.9). Total num frames: 432364544. Throughput: 0: 5945.9. Samples: 432372532. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:11:42,719][25689] Avg episode reward: [(0, '-47.195')] [2022-07-09 21:11:43,247][26022] Updated weights on worker 0-0, policy_version 422235 (0.00431) [2022-07-09 21:11:45,006][26022] Updated weights on worker 0-0, policy_version 422245 (0.00084) [2022-07-09 21:11:46,639][26022] Updated weights on worker 0-0, policy_version 422255 (0.00093) [2022-07-09 21:11:47,751][25689] Fps is (10 sec: 5556.1, 60 sec: 5645.7, 300 sec: 5657.9). Total num frames: 432394240. Throughput: 0: 5089.7. Samples: 432389722. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:11:47,751][25689] Avg episode reward: [(0, '-47.458')] [2022-07-09 21:11:48,667][26022] Updated weights on worker 0-0, policy_version 422265 (0.00087) [2022-07-09 21:11:50,151][26022] Updated weights on worker 0-0, policy_version 422275 (0.00094) [2022-07-09 21:11:52,113][26022] Updated weights on worker 0-0, policy_version 422285 (0.00091) [2022-07-09 21:11:52,762][25689] Fps is (10 sec: 5811.8, 60 sec: 5663.8, 300 sec: 5658.2). Total num frames: 432422912. Throughput: 0: 5938.4. Samples: 432424006. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:11:52,762][25689] Avg episode reward: [(0, '-47.786')] [2022-07-09 21:11:53,874][26022] Updated weights on worker 0-0, policy_version 422295 (0.00084) [2022-07-09 21:11:55,674][26022] Updated weights on worker 0-0, policy_version 422305 (0.00082) [2022-07-09 21:11:57,528][26022] Updated weights on worker 0-0, policy_version 422315 (0.00089) [2022-07-09 21:11:57,767][25689] Fps is (10 sec: 5725.2, 60 sec: 5663.9, 300 sec: 5662.5). Total num frames: 432451584. Throughput: 0: 5963.9. Samples: 432458516. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:11:57,767][25689] Avg episode reward: [(0, '-47.589')] [2022-07-09 21:11:59,328][26022] Updated weights on worker 0-0, policy_version 422325 (0.00085) [2022-07-09 21:12:01,051][26022] Updated weights on worker 0-0, policy_version 422335 (0.00094) [2022-07-09 21:12:02,854][25689] Fps is (10 sec: 5478.9, 60 sec: 5642.4, 300 sec: 5654.5). Total num frames: 432478208. Throughput: 0: 5113.8. Samples: 432475502. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:12:02,855][25689] Avg episode reward: [(0, '-47.513')] [2022-07-09 21:12:03,392][26022] Updated weights on worker 0-0, policy_version 422345 (0.00095) [2022-07-09 21:12:05,163][26022] Updated weights on worker 0-0, policy_version 422355 (0.00091) [2022-07-09 21:12:06,882][26022] Updated weights on worker 0-0, policy_version 422365 (0.00086) [2022-07-09 21:12:07,869][25689] Fps is (10 sec: 5473.7, 60 sec: 5676.7, 300 sec: 5658.7). Total num frames: 432506880. Throughput: 0: 5834.3. Samples: 432507096. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:12:07,869][25689] Avg episode reward: [(0, '-47.509')] [2022-07-09 21:12:08,835][26022] Updated weights on worker 0-0, policy_version 422375 (0.00086) [2022-07-09 21:12:10,373][26022] Updated weights on worker 0-0, policy_version 422385 (0.00083) [2022-07-09 21:12:12,491][26022] Updated weights on worker 0-0, policy_version 422395 (0.00088) [2022-07-09 21:12:12,881][25689] Fps is (10 sec: 5718.9, 60 sec: 5660.4, 300 sec: 5658.6). Total num frames: 432535552. Throughput: 0: 5845.1. Samples: 432541608. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:12:12,882][25689] Avg episode reward: [(0, '-47.519')] [2022-07-09 21:12:13,943][26022] Updated weights on worker 0-0, policy_version 422405 (0.00083) [2022-07-09 21:12:16,000][26022] Updated weights on worker 0-0, policy_version 422415 (0.00088) [2022-07-09 21:12:17,549][26022] Updated weights on worker 0-0, policy_version 422425 (0.00090) [2022-07-09 21:12:17,931][25689] Fps is (10 sec: 5698.8, 60 sec: 5656.1, 300 sec: 5659.1). Total num frames: 432564224. Throughput: 0: 4975.8. Samples: 432558850. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:12:17,932][25689] Avg episode reward: [(0, '-47.091')] [2022-07-09 21:12:19,458][26022] Updated weights on worker 0-0, policy_version 422435 (0.00087) [2022-07-09 21:12:21,203][26022] Updated weights on worker 0-0, policy_version 422445 (0.00088) [2022-07-09 21:12:23,078][25689] Fps is (10 sec: 5623.6, 60 sec: 5669.2, 300 sec: 5653.8). Total num frames: 432592896. Throughput: 0: 5802.1. Samples: 432592844. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:12:23,079][25689] Avg episode reward: [(0, '-46.205')] [2022-07-09 21:12:23,119][26022] Updated weights on worker 0-0, policy_version 422455 (0.00095) [2022-07-09 21:12:25,023][26022] Updated weights on worker 0-0, policy_version 422465 (0.00089) [2022-07-09 21:12:26,518][26022] Updated weights on worker 0-0, policy_version 422475 (0.00085) [2022-07-09 21:12:28,083][25689] Fps is (10 sec: 5648.7, 60 sec: 5654.8, 300 sec: 5658.6). Total num frames: 432621568. Throughput: 0: 5947.6. Samples: 432627322. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-09 21:12:28,083][25689] Avg episode reward: [(0, '-46.717')] [2022-07-09 21:12:28,704][26022] Updated weights on worker 0-0, policy_version 422485 (0.00089) [2022-07-09 21:12:30,212][26022] Updated weights on worker 0-0, policy_version 422495 (0.00087) [2022-07-09 21:12:32,140][26022] Updated weights on worker 0-0, policy_version 422505 (0.00084) [2022-07-09 21:12:33,087][25689] Fps is (10 sec: 5832.0, 60 sec: 5688.8, 300 sec: 5663.5). Total num frames: 432651264. Throughput: 0: 5073.8. Samples: 432644126. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:12:33,087][25689] Avg episode reward: [(0, '-46.904')] [2022-07-09 21:12:33,938][26022] Updated weights on worker 0-0, policy_version 422515 (0.00086) [2022-07-09 21:12:35,660][26022] Updated weights on worker 0-0, policy_version 422525 (0.00085) [2022-07-09 21:12:37,647][26022] Updated weights on worker 0-0, policy_version 422535 (0.00084) [2022-07-09 21:12:38,103][25689] Fps is (10 sec: 5723.0, 60 sec: 5637.3, 300 sec: 5662.5). Total num frames: 432678912. Throughput: 0: 5933.8. Samples: 432678544. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:12:38,103][25689] Avg episode reward: [(0, '-47.433')] [2022-07-09 21:12:39,228][26022] Updated weights on worker 0-0, policy_version 422545 (0.00082) [2022-07-09 21:12:41,112][26022] Updated weights on worker 0-0, policy_version 422555 (0.00092) [2022-07-09 21:12:42,647][26022] Updated weights on worker 0-0, policy_version 422565 (0.00090) [2022-07-09 21:12:43,191][25689] Fps is (10 sec: 5573.9, 60 sec: 5672.7, 300 sec: 5651.7). Total num frames: 432707584. Throughput: 0: 5946.2. Samples: 432712436. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:12:43,192][25689] Avg episode reward: [(0, '-46.495')] [2022-07-09 21:12:44,916][26022] Updated weights on worker 0-0, policy_version 422575 (0.00095) [2022-07-09 21:12:46,504][26022] Updated weights on worker 0-0, policy_version 422585 (0.00085) [2022-07-09 21:12:48,210][25689] Fps is (10 sec: 5673.7, 60 sec: 5657.0, 300 sec: 5662.6). Total num frames: 432736256. Throughput: 0: 5083.1. Samples: 432729630. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:12:48,210][25689] Avg episode reward: [(0, '-46.554')] [2022-07-09 21:12:48,272][26022] Updated weights on worker 0-0, policy_version 422595 (0.00092) [2022-07-09 21:12:50,031][26022] Updated weights on worker 0-0, policy_version 422605 (0.00082) [2022-07-09 21:12:52,077][26022] Updated weights on worker 0-0, policy_version 422615 (0.00086) [2022-07-09 21:12:53,213][25689] Fps is (10 sec: 5619.5, 60 sec: 5640.8, 300 sec: 5652.9). Total num frames: 432763904. Throughput: 0: 5951.3. Samples: 432763904. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:12:53,214][25689] Avg episode reward: [(0, '-47.214')] [2022-07-09 21:12:53,629][26022] Updated weights on worker 0-0, policy_version 422625 (0.00086) [2022-07-09 21:12:55,646][26022] Updated weights on worker 0-0, policy_version 422635 (0.00086) [2022-07-09 21:12:57,236][26022] Updated weights on worker 0-0, policy_version 422645 (0.00081) [2022-07-09 21:12:58,233][25689] Fps is (10 sec: 5619.1, 60 sec: 5639.4, 300 sec: 5653.9). Total num frames: 432792576. Throughput: 0: 5943.8. Samples: 432798192. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:12:58,234][25689] Avg episode reward: [(0, '-47.426')] [2022-07-09 21:12:59,196][26022] Updated weights on worker 0-0, policy_version 422655 (0.00092) [2022-07-09 21:13:00,785][26022] Updated weights on worker 0-0, policy_version 422665 (0.00092) [2022-07-09 21:13:03,119][26022] Updated weights on worker 0-0, policy_version 422675 (0.00089) [2022-07-09 21:13:03,360][25689] Fps is (10 sec: 5651.4, 60 sec: 5669.6, 300 sec: 5662.0). Total num frames: 432821248. Throughput: 0: 5093.1. Samples: 432815160. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:13:03,361][25689] Avg episode reward: [(0, '-47.122')] [2022-07-09 21:13:04,828][26022] Updated weights on worker 0-0, policy_version 422685 (0.00087) [2022-07-09 21:13:06,694][26022] Updated weights on worker 0-0, policy_version 422695 (0.00084) [2022-07-09 21:13:08,379][25689] Fps is (10 sec: 5550.7, 60 sec: 5652.2, 300 sec: 5655.8). Total num frames: 432848896. Throughput: 0: 5827.7. Samples: 432847172. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:13:08,380][25689] Avg episode reward: [(0, '-47.240')] [2022-07-09 21:13:08,513][26022] Updated weights on worker 0-0, policy_version 422705 (0.00080) [2022-07-09 21:13:10,394][26022] Updated weights on worker 0-0, policy_version 422715 (0.00088) [2022-07-09 21:13:12,207][26022] Updated weights on worker 0-0, policy_version 422725 (0.00083) [2022-07-09 21:13:13,385][25689] Fps is (10 sec: 5617.7, 60 sec: 5652.8, 300 sec: 5660.0). Total num frames: 432877568. Throughput: 0: 5820.1. Samples: 432881310. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:13:13,387][25689] Avg episode reward: [(0, '-47.193')] [2022-07-09 21:13:14,036][26022] Updated weights on worker 0-0, policy_version 422735 (0.00088) [2022-07-09 21:13:15,479][26022] Updated weights on worker 0-0, policy_version 422745 (0.00048) [2022-07-09 21:13:17,522][26022] Updated weights on worker 0-0, policy_version 422755 (0.00087) [2022-07-09 21:13:18,389][25689] Fps is (10 sec: 5831.0, 60 sec: 5674.0, 300 sec: 5664.3). Total num frames: 432907264. Throughput: 0: 5840.8. Samples: 432915922. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:13:18,390][25689] Avg episode reward: [(0, '-46.918')] [2022-07-09 21:13:19,194][26022] Updated weights on worker 0-0, policy_version 422765 (0.00085) [2022-07-09 21:13:20,961][26022] Updated weights on worker 0-0, policy_version 422775 (0.00084) [2022-07-09 21:13:22,808][26022] Updated weights on worker 0-0, policy_version 422785 (0.00089) [2022-07-09 21:13:23,432][25689] Fps is (10 sec: 5605.8, 60 sec: 5649.9, 300 sec: 5653.5). Total num frames: 432933888. Throughput: 0: 5870.0. Samples: 432932984. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:13:23,434][25689] Avg episode reward: [(0, '-46.726')] [2022-07-09 21:13:24,598][26022] Updated weights on worker 0-0, policy_version 422795 (0.00091) [2022-07-09 21:13:26,454][26022] Updated weights on worker 0-0, policy_version 422805 (0.00091) [2022-07-09 21:13:28,240][26022] Updated weights on worker 0-0, policy_version 422815 (0.00087) [2022-07-09 21:13:28,451][25689] Fps is (10 sec: 5597.3, 60 sec: 5665.5, 300 sec: 5660.6). Total num frames: 432963584. Throughput: 0: 5969.6. Samples: 432966994. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:13:28,453][25689] Avg episode reward: [(0, '-46.566')] [2022-07-09 21:13:29,935][26022] Updated weights on worker 0-0, policy_version 422825 (0.00094) [2022-07-09 21:13:31,845][26022] Updated weights on worker 0-0, policy_version 422835 (0.00087) [2022-07-09 21:13:32,289][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:13:32,305][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000422838_432986112.pth [2022-07-09 21:13:32,305][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000420847_430947328.pth [2022-07-09 21:13:33,467][25689] Fps is (10 sec: 5816.4, 60 sec: 5647.4, 300 sec: 5657.0). Total num frames: 432992256. Throughput: 0: 5961.6. Samples: 433001030. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:13:33,469][25689] Avg episode reward: [(0, '-46.166')] [2022-07-09 21:13:33,496][26022] Updated weights on worker 0-0, policy_version 422845 (0.00089) [2022-07-09 21:13:35,359][26022] Updated weights on worker 0-0, policy_version 422855 (0.00081) [2022-07-09 21:13:37,352][26022] Updated weights on worker 0-0, policy_version 422865 (0.00089) [2022-07-09 21:13:38,481][25689] Fps is (10 sec: 5513.3, 60 sec: 5630.7, 300 sec: 5654.1). Total num frames: 433018880. Throughput: 0: 5085.9. Samples: 433018106. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:13:38,483][25689] Avg episode reward: [(0, '-46.906')] [2022-07-09 21:13:39,051][26022] Updated weights on worker 0-0, policy_version 422875 (0.00086) [2022-07-09 21:13:40,968][26022] Updated weights on worker 0-0, policy_version 422885 (0.00089) [2022-07-09 21:13:42,643][26022] Updated weights on worker 0-0, policy_version 422895 (0.00089) [2022-07-09 21:13:43,542][25689] Fps is (10 sec: 5590.3, 60 sec: 5650.1, 300 sec: 5660.2). Total num frames: 433048576. Throughput: 0: 5925.3. Samples: 433052140. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:13:43,544][25689] Avg episode reward: [(0, '-46.434')] [2022-07-09 21:13:44,345][26022] Updated weights on worker 0-0, policy_version 422905 (0.00087) [2022-07-09 21:13:46,178][26022] Updated weights on worker 0-0, policy_version 422915 (0.00091) [2022-07-09 21:13:48,158][26022] Updated weights on worker 0-0, policy_version 422925 (0.00091) [2022-07-09 21:13:48,571][25689] Fps is (10 sec: 5683.4, 60 sec: 5632.3, 300 sec: 5649.8). Total num frames: 433076224. Throughput: 0: 5926.3. Samples: 433086226. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:13:48,572][25689] Avg episode reward: [(0, '-47.285')] [2022-07-09 21:13:49,865][26022] Updated weights on worker 0-0, policy_version 422935 (0.00086) [2022-07-09 21:13:51,831][26022] Updated weights on worker 0-0, policy_version 422945 (0.00084) [2022-07-09 21:13:53,574][25689] Fps is (10 sec: 5613.9, 60 sec: 5649.2, 300 sec: 5653.3). Total num frames: 433104896. Throughput: 0: 5066.8. Samples: 433102908. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:13:53,576][25689] Avg episode reward: [(0, '-46.594')] [2022-07-09 21:13:53,775][26022] Updated weights on worker 0-0, policy_version 422955 (0.00083) [2022-07-09 21:13:55,255][26022] Updated weights on worker 0-0, policy_version 422965 (0.00082) [2022-07-09 21:13:57,443][26022] Updated weights on worker 0-0, policy_version 422975 (0.00086) [2022-07-09 21:13:58,607][25689] Fps is (10 sec: 5917.7, 60 sec: 5681.9, 300 sec: 5671.3). Total num frames: 433135616. Throughput: 0: 5931.4. Samples: 433137482. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:13:58,607][25689] Avg episode reward: [(0, '-46.306')] [2022-07-09 21:13:58,695][26022] Updated weights on worker 0-0, policy_version 422985 (0.00086) [2022-07-09 21:14:00,851][26022] Updated weights on worker 0-0, policy_version 422995 (0.00109) [2022-07-09 21:14:02,926][26022] Updated weights on worker 0-0, policy_version 423005 (0.00089) [2022-07-09 21:14:03,716][25689] Fps is (10 sec: 5553.1, 60 sec: 5632.7, 300 sec: 5659.2). Total num frames: 433161216. Throughput: 0: 5828.7. Samples: 433169730. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:14:03,717][25689] Avg episode reward: [(0, '-45.871')] [2022-07-09 21:14:04,774][26022] Updated weights on worker 0-0, policy_version 423015 (0.00086) [2022-07-09 21:14:06,585][26022] Updated weights on worker 0-0, policy_version 423025 (0.00093) [2022-07-09 21:14:08,616][26022] Updated weights on worker 0-0, policy_version 423035 (0.00088) [2022-07-09 21:14:08,743][25689] Fps is (10 sec: 5152.6, 60 sec: 5615.1, 300 sec: 5652.0). Total num frames: 433187840. Throughput: 0: 4986.9. Samples: 433186820. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:14:08,743][25689] Avg episode reward: [(0, '-45.984')] [2022-07-09 21:14:10,073][26022] Updated weights on worker 0-0, policy_version 423045 (0.00086) [2022-07-09 21:14:12,173][26022] Updated weights on worker 0-0, policy_version 423055 (0.00107) [2022-07-09 21:14:13,639][26022] Updated weights on worker 0-0, policy_version 423065 (0.00082) [2022-07-09 21:14:13,755][25689] Fps is (10 sec: 5814.4, 60 sec: 5665.4, 300 sec: 5669.1). Total num frames: 433219584. Throughput: 0: 5825.8. Samples: 433220476. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:14:13,755][25689] Avg episode reward: [(0, '-46.513')] [2022-07-09 21:14:15,672][26022] Updated weights on worker 0-0, policy_version 423075 (0.00094) [2022-07-09 21:14:17,371][26022] Updated weights on worker 0-0, policy_version 423085 (0.00092) [2022-07-09 21:14:18,769][25689] Fps is (10 sec: 5821.5, 60 sec: 5613.6, 300 sec: 5656.1). Total num frames: 433246208. Throughput: 0: 5792.2. Samples: 433254264. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:14:18,769][25689] Avg episode reward: [(0, '-46.997')] [2022-07-09 21:14:19,340][26022] Updated weights on worker 0-0, policy_version 423095 (0.00092) [2022-07-09 21:14:21,037][26022] Updated weights on worker 0-0, policy_version 423105 (0.00090) [2022-07-09 21:14:23,061][26022] Updated weights on worker 0-0, policy_version 423115 (0.00094) [2022-07-09 21:14:23,875][25689] Fps is (10 sec: 5464.1, 60 sec: 5641.6, 300 sec: 5658.3). Total num frames: 433274880. Throughput: 0: 5030.2. Samples: 433271130. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:14:23,875][25689] Avg episode reward: [(0, '-48.144')] [2022-07-09 21:14:24,674][26022] Updated weights on worker 0-0, policy_version 423125 (0.00082) [2022-07-09 21:14:26,568][26022] Updated weights on worker 0-0, policy_version 423135 (0.00085) [2022-07-09 21:14:28,321][26022] Updated weights on worker 0-0, policy_version 423145 (0.00086) [2022-07-09 21:14:28,881][25689] Fps is (10 sec: 5670.9, 60 sec: 5625.9, 300 sec: 5651.9). Total num frames: 433303552. Throughput: 0: 5878.7. Samples: 433305208. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:14:28,881][25689] Avg episode reward: [(0, '-48.324')] [2022-07-09 21:14:30,038][26022] Updated weights on worker 0-0, policy_version 423155 (0.00087) [2022-07-09 21:14:31,929][26022] Updated weights on worker 0-0, policy_version 423165 (0.00083) [2022-07-09 21:14:33,612][26022] Updated weights on worker 0-0, policy_version 423175 (0.00089) [2022-07-09 21:14:33,887][25689] Fps is (10 sec: 5727.3, 60 sec: 5626.8, 300 sec: 5658.7). Total num frames: 433332224. Throughput: 0: 5913.4. Samples: 433339526. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:14:33,888][25689] Avg episode reward: [(0, '-48.015')] [2022-07-09 21:14:35,525][26022] Updated weights on worker 0-0, policy_version 423185 (0.00081) [2022-07-09 21:14:37,351][26022] Updated weights on worker 0-0, policy_version 423195 (0.00085) [2022-07-09 21:14:38,956][25689] Fps is (10 sec: 5691.8, 60 sec: 5655.5, 300 sec: 5655.9). Total num frames: 433360896. Throughput: 0: 5073.6. Samples: 433356684. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:14:38,957][25689] Avg episode reward: [(0, '-47.349')] [2022-07-09 21:14:38,964][26022] Updated weights on worker 0-0, policy_version 423205 (0.00094) [2022-07-09 21:14:41,052][26022] Updated weights on worker 0-0, policy_version 423215 (0.00093) [2022-07-09 21:14:42,573][26022] Updated weights on worker 0-0, policy_version 423225 (0.00096) [2022-07-09 21:14:44,029][25689] Fps is (10 sec: 5654.2, 60 sec: 5637.4, 300 sec: 5652.2). Total num frames: 433389568. Throughput: 0: 5935.7. Samples: 433390760. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:14:44,031][25689] Avg episode reward: [(0, '-47.585')] [2022-07-09 21:14:44,668][26022] Updated weights on worker 0-0, policy_version 423235 (0.00097) [2022-07-09 21:14:46,330][26022] Updated weights on worker 0-0, policy_version 423245 (0.00090) [2022-07-09 21:14:48,164][26022] Updated weights on worker 0-0, policy_version 423255 (0.00089) [2022-07-09 21:14:49,041][25689] Fps is (10 sec: 5685.9, 60 sec: 5655.9, 300 sec: 5655.5). Total num frames: 433418240. Throughput: 0: 5967.4. Samples: 433425512. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:14:49,041][25689] Avg episode reward: [(0, '-46.880')] [2022-07-09 21:14:49,763][26022] Updated weights on worker 0-0, policy_version 423265 (0.00089) [2022-07-09 21:14:51,528][26022] Updated weights on worker 0-0, policy_version 423275 (0.00086) [2022-07-09 21:14:53,490][26022] Updated weights on worker 0-0, policy_version 423285 (0.00721) [2022-07-09 21:14:54,065][25689] Fps is (10 sec: 5815.9, 60 sec: 5671.0, 300 sec: 5658.8). Total num frames: 433447936. Throughput: 0: 5109.5. Samples: 433442624. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:14:54,067][25689] Avg episode reward: [(0, '-45.717')] [2022-07-09 21:14:55,045][26022] Updated weights on worker 0-0, policy_version 423295 (0.00091) [2022-07-09 21:14:57,047][26022] Updated weights on worker 0-0, policy_version 423305 (0.00087) [2022-07-09 21:14:58,850][26022] Updated weights on worker 0-0, policy_version 423315 (0.00103) [2022-07-09 21:14:59,157][25689] Fps is (10 sec: 5668.5, 60 sec: 5614.6, 300 sec: 5661.9). Total num frames: 433475584. Throughput: 0: 5953.6. Samples: 433476956. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:14:59,159][25689] Avg episode reward: [(0, '-45.814')] [2022-07-09 21:15:00,500][26022] Updated weights on worker 0-0, policy_version 423325 (0.00086) [2022-07-09 21:15:02,925][26022] Updated weights on worker 0-0, policy_version 423335 (0.00096) [2022-07-09 21:15:04,248][25689] Fps is (10 sec: 5530.9, 60 sec: 5667.1, 300 sec: 5664.0). Total num frames: 433504256. Throughput: 0: 5847.2. Samples: 433508984. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:15:04,250][25689] Avg episode reward: [(0, '-45.807')] [2022-07-09 21:15:04,509][26022] Updated weights on worker 0-0, policy_version 423345 (0.00086) [2022-07-09 21:15:06,418][26022] Updated weights on worker 0-0, policy_version 423355 (0.00097) [2022-07-09 21:15:08,164][26022] Updated weights on worker 0-0, policy_version 423365 (0.00079) [2022-07-09 21:15:09,337][25689] Fps is (10 sec: 5532.4, 60 sec: 5678.1, 300 sec: 5652.4). Total num frames: 433531904. Throughput: 0: 4962.6. Samples: 433526234. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:15:09,338][25689] Avg episode reward: [(0, '-46.633')] [2022-07-09 21:15:09,876][26022] Updated weights on worker 0-0, policy_version 423375 (0.00096) [2022-07-09 21:15:11,881][26022] Updated weights on worker 0-0, policy_version 423385 (0.00091) [2022-07-09 21:15:13,523][26022] Updated weights on worker 0-0, policy_version 423395 (0.00099) [2022-07-09 21:15:14,366][25689] Fps is (10 sec: 5465.2, 60 sec: 5609.0, 300 sec: 5655.7). Total num frames: 433559552. Throughput: 0: 5786.5. Samples: 433560094. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 21:15:14,366][25689] Avg episode reward: [(0, '-46.728')] [2022-07-09 21:15:15,459][26022] Updated weights on worker 0-0, policy_version 423405 (0.00093) [2022-07-09 21:15:17,160][26022] Updated weights on worker 0-0, policy_version 423415 (0.00085) [2022-07-09 21:15:18,845][26022] Updated weights on worker 0-0, policy_version 423425 (0.00077) [2022-07-09 21:15:19,375][25689] Fps is (10 sec: 5815.0, 60 sec: 5677.1, 300 sec: 5654.2). Total num frames: 433590272. Throughput: 0: 5819.3. Samples: 433594608. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:15:19,377][25689] Avg episode reward: [(0, '-47.612')] [2022-07-09 21:15:20,863][26022] Updated weights on worker 0-0, policy_version 423435 (0.00084) [2022-07-09 21:15:22,466][26022] Updated weights on worker 0-0, policy_version 423445 (0.00088) [2022-07-09 21:15:24,465][25689] Fps is (10 sec: 5779.3, 60 sec: 5661.6, 300 sec: 5652.9). Total num frames: 433617920. Throughput: 0: 5088.3. Samples: 433611856. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:15:24,466][26022] Updated weights on worker 0-0, policy_version 423455 (0.00087) [2022-07-09 21:15:24,467][25689] Avg episode reward: [(0, '-46.884')] [2022-07-09 21:15:25,924][26022] Updated weights on worker 0-0, policy_version 423465 (0.00094) [2022-07-09 21:15:27,895][26022] Updated weights on worker 0-0, policy_version 423475 (0.00086) [2022-07-09 21:15:29,494][25689] Fps is (10 sec: 5565.7, 60 sec: 5659.5, 300 sec: 5652.6). Total num frames: 433646592. Throughput: 0: 5936.3. Samples: 433645890. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:15:29,496][25689] Avg episode reward: [(0, '-46.384')] [2022-07-09 21:15:29,766][26022] Updated weights on worker 0-0, policy_version 423485 (0.00090) [2022-07-09 21:15:31,606][26022] Updated weights on worker 0-0, policy_version 423495 (0.00088) [2022-07-09 21:15:32,439][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:15:32,453][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000423499_433662976.pth [2022-07-09 21:15:32,453][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000421510_431626240.pth [2022-07-09 21:15:33,348][26022] Updated weights on worker 0-0, policy_version 423505 (0.00091) [2022-07-09 21:15:34,516][25689] Fps is (10 sec: 5705.1, 60 sec: 5658.0, 300 sec: 5649.0). Total num frames: 433675264. Throughput: 0: 5945.8. Samples: 433679906. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:15:34,517][25689] Avg episode reward: [(0, '-46.739')] [2022-07-09 21:15:35,108][26022] Updated weights on worker 0-0, policy_version 423515 (0.00093) [2022-07-09 21:15:36,945][26022] Updated weights on worker 0-0, policy_version 423525 (0.00086) [2022-07-09 21:15:38,749][26022] Updated weights on worker 0-0, policy_version 423535 (0.00091) [2022-07-09 21:15:39,550][25689] Fps is (10 sec: 5702.3, 60 sec: 5661.2, 300 sec: 5653.5). Total num frames: 433703936. Throughput: 0: 5076.8. Samples: 433697034. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:15:39,551][25689] Avg episode reward: [(0, '-46.371')] [2022-07-09 21:15:40,735][26022] Updated weights on worker 0-0, policy_version 423545 (0.00409) [2022-07-09 21:15:42,372][26022] Updated weights on worker 0-0, policy_version 423555 (0.00100) [2022-07-09 21:15:44,327][26022] Updated weights on worker 0-0, policy_version 423565 (0.00082) [2022-07-09 21:15:44,674][25689] Fps is (10 sec: 5745.9, 60 sec: 5673.4, 300 sec: 5651.7). Total num frames: 433733632. Throughput: 0: 5890.2. Samples: 433730894. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:15:44,677][25689] Avg episode reward: [(0, '-46.265')] [2022-07-09 21:15:45,899][26022] Updated weights on worker 0-0, policy_version 423575 (0.00086) [2022-07-09 21:15:47,745][26022] Updated weights on worker 0-0, policy_version 423585 (0.00088) [2022-07-09 21:15:49,683][25689] Fps is (10 sec: 5557.9, 60 sec: 5639.9, 300 sec: 5648.6). Total num frames: 433760256. Throughput: 0: 5893.1. Samples: 433764870. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:15:49,684][25689] Avg episode reward: [(0, '-46.310')] [2022-07-09 21:15:49,776][26022] Updated weights on worker 0-0, policy_version 423595 (0.00091) [2022-07-09 21:15:51,581][26022] Updated weights on worker 0-0, policy_version 423605 (0.00090) [2022-07-09 21:15:53,426][26022] Updated weights on worker 0-0, policy_version 423615 (0.00091) [2022-07-09 21:15:54,697][25689] Fps is (10 sec: 5517.2, 60 sec: 5623.9, 300 sec: 5648.4). Total num frames: 433788928. Throughput: 0: 5883.3. Samples: 433798634. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:15:54,697][25689] Avg episode reward: [(0, '-46.338')] [2022-07-09 21:15:55,093][26022] Updated weights on worker 0-0, policy_version 423625 (0.00092) [2022-07-09 21:15:57,036][26022] Updated weights on worker 0-0, policy_version 423635 (0.00086) [2022-07-09 21:15:58,755][26022] Updated weights on worker 0-0, policy_version 423645 (0.00088) [2022-07-09 21:15:59,707][25689] Fps is (10 sec: 5618.9, 60 sec: 5631.6, 300 sec: 5649.0). Total num frames: 433816576. Throughput: 0: 5883.5. Samples: 433815626. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:15:59,707][25689] Avg episode reward: [(0, '-46.998')] [2022-07-09 21:16:00,459][26022] Updated weights on worker 0-0, policy_version 423655 (0.00087) [2022-07-09 21:16:02,822][26022] Updated weights on worker 0-0, policy_version 423665 (0.00083) [2022-07-09 21:16:04,655][26022] Updated weights on worker 0-0, policy_version 423675 (0.00084) [2022-07-09 21:16:04,848][25689] Fps is (10 sec: 5447.1, 60 sec: 5610.0, 300 sec: 5650.1). Total num frames: 433844224. Throughput: 0: 5774.9. Samples: 433847396. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:16:04,849][25689] Avg episode reward: [(0, '-46.220')] [2022-07-09 21:16:06,462][26022] Updated weights on worker 0-0, policy_version 423685 (0.00092) [2022-07-09 21:16:08,151][26022] Updated weights on worker 0-0, policy_version 423695 (0.00092) [2022-07-09 21:16:09,866][25689] Fps is (10 sec: 5442.9, 60 sec: 5616.6, 300 sec: 5643.2). Total num frames: 433871872. Throughput: 0: 5784.9. Samples: 433881626. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:16:09,867][25689] Avg episode reward: [(0, '-46.756')] [2022-07-09 21:16:10,072][26022] Updated weights on worker 0-0, policy_version 423705 (0.00091) [2022-07-09 21:16:11,884][26022] Updated weights on worker 0-0, policy_version 423715 (0.00098) [2022-07-09 21:16:13,724][26022] Updated weights on worker 0-0, policy_version 423725 (0.00087) [2022-07-09 21:16:14,882][25689] Fps is (10 sec: 5510.9, 60 sec: 5617.7, 300 sec: 5639.5). Total num frames: 433899520. Throughput: 0: 4943.4. Samples: 433898420. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:16:14,883][25689] Avg episode reward: [(0, '-46.814')] [2022-07-09 21:16:15,408][26022] Updated weights on worker 0-0, policy_version 423735 (0.00092) [2022-07-09 21:16:17,267][26022] Updated weights on worker 0-0, policy_version 423745 (0.00089) [2022-07-09 21:16:18,924][26022] Updated weights on worker 0-0, policy_version 423755 (0.00089) [2022-07-09 21:16:19,918][25689] Fps is (10 sec: 5806.8, 60 sec: 5615.3, 300 sec: 5651.2). Total num frames: 433930240. Throughput: 0: 5789.0. Samples: 433932628. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:16:19,919][25689] Avg episode reward: [(0, '-46.897')] [2022-07-09 21:16:20,992][26022] Updated weights on worker 0-0, policy_version 423765 (0.00093) [2022-07-09 21:16:22,510][26022] Updated weights on worker 0-0, policy_version 423775 (0.00088) [2022-07-09 21:16:24,448][26022] Updated weights on worker 0-0, policy_version 423785 (0.00093) [2022-07-09 21:16:25,047][25689] Fps is (10 sec: 5742.4, 60 sec: 5611.7, 300 sec: 5642.5). Total num frames: 433957888. Throughput: 0: 5928.0. Samples: 433967132. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:16:25,047][25689] Avg episode reward: [(0, '-47.439')] [2022-07-09 21:16:26,224][26022] Updated weights on worker 0-0, policy_version 423795 (0.00087) [2022-07-09 21:16:28,055][26022] Updated weights on worker 0-0, policy_version 423805 (0.00095) [2022-07-09 21:16:29,959][26022] Updated weights on worker 0-0, policy_version 423815 (0.00085) [2022-07-09 21:16:30,062][25689] Fps is (10 sec: 5551.8, 60 sec: 5613.0, 300 sec: 5645.7). Total num frames: 433986560. Throughput: 0: 5074.3. Samples: 433984108. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:16:30,064][25689] Avg episode reward: [(0, '-46.596')] [2022-07-09 21:16:31,604][26022] Updated weights on worker 0-0, policy_version 423825 (0.00092) [2022-07-09 21:16:33,596][26022] Updated weights on worker 0-0, policy_version 423835 (0.00092) [2022-07-09 21:16:35,071][25689] Fps is (10 sec: 5720.7, 60 sec: 5614.2, 300 sec: 5638.8). Total num frames: 434015232. Throughput: 0: 5927.2. Samples: 434018080. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:16:35,072][25689] Avg episode reward: [(0, '-46.584')] [2022-07-09 21:16:35,278][26022] Updated weights on worker 0-0, policy_version 423845 (0.00085) [2022-07-09 21:16:37,224][26022] Updated weights on worker 0-0, policy_version 423855 (0.00089) [2022-07-09 21:16:38,840][26022] Updated weights on worker 0-0, policy_version 423865 (0.00082) [2022-07-09 21:16:40,118][25689] Fps is (10 sec: 5906.5, 60 sec: 5646.9, 300 sec: 5653.7). Total num frames: 434045952. Throughput: 0: 5923.8. Samples: 434052288. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:16:40,120][25689] Avg episode reward: [(0, '-46.030')] [2022-07-09 21:16:40,790][26022] Updated weights on worker 0-0, policy_version 423875 (0.00087) [2022-07-09 21:16:42,351][26022] Updated weights on worker 0-0, policy_version 423885 (0.00087) [2022-07-09 21:16:44,523][26022] Updated weights on worker 0-0, policy_version 423895 (0.00093) [2022-07-09 21:16:45,190][25689] Fps is (10 sec: 5767.8, 60 sec: 5617.8, 300 sec: 5646.1). Total num frames: 434073600. Throughput: 0: 5074.9. Samples: 434069358. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:16:45,191][25689] Avg episode reward: [(0, '-46.005')] [2022-07-09 21:16:46,065][26022] Updated weights on worker 0-0, policy_version 423905 (0.00091) [2022-07-09 21:16:48,001][26022] Updated weights on worker 0-0, policy_version 423915 (0.00083) [2022-07-09 21:16:49,484][26022] Updated weights on worker 0-0, policy_version 423925 (0.00083) [2022-07-09 21:16:50,195][25689] Fps is (10 sec: 5588.9, 60 sec: 5652.1, 300 sec: 5646.2). Total num frames: 434102272. Throughput: 0: 5943.0. Samples: 434103756. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:16:50,195][25689] Avg episode reward: [(0, '-46.203')] [2022-07-09 21:16:51,511][26022] Updated weights on worker 0-0, policy_version 423935 (0.00098) [2022-07-09 21:16:53,131][26022] Updated weights on worker 0-0, policy_version 423945 (0.00082) [2022-07-09 21:16:55,225][25689] Fps is (10 sec: 5510.3, 60 sec: 5616.7, 300 sec: 5638.8). Total num frames: 434128896. Throughput: 0: 5956.7. Samples: 434138136. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:16:55,226][25689] Avg episode reward: [(0, '-46.087')] [2022-07-09 21:16:55,280][26022] Updated weights on worker 0-0, policy_version 423955 (0.00095) [2022-07-09 21:16:56,556][26022] Updated weights on worker 0-0, policy_version 423965 (0.00098) [2022-07-09 21:16:58,714][26022] Updated weights on worker 0-0, policy_version 423975 (0.00086) [2022-07-09 21:17:00,140][26022] Updated weights on worker 0-0, policy_version 423985 (0.00079) [2022-07-09 21:17:00,241][25689] Fps is (10 sec: 5809.9, 60 sec: 5683.8, 300 sec: 5657.4). Total num frames: 434160640. Throughput: 0: 5121.0. Samples: 434155340. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:17:00,243][25689] Avg episode reward: [(0, '-45.987')] [2022-07-09 21:17:02,391][26022] Updated weights on worker 0-0, policy_version 423995 (0.00087) [2022-07-09 21:17:04,599][26022] Updated weights on worker 0-0, policy_version 424005 (0.00086) [2022-07-09 21:17:05,281][25689] Fps is (10 sec: 5702.9, 60 sec: 5659.5, 300 sec: 5646.6). Total num frames: 434186240. Throughput: 0: 5885.0. Samples: 434187590. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:17:05,281][25689] Avg episode reward: [(0, '-46.829')] [2022-07-09 21:17:06,146][26022] Updated weights on worker 0-0, policy_version 424015 (0.00088) [2022-07-09 21:17:08,115][26022] Updated weights on worker 0-0, policy_version 424025 (0.00087) [2022-07-09 21:17:09,683][26022] Updated weights on worker 0-0, policy_version 424035 (0.00087) [2022-07-09 21:17:10,288][25689] Fps is (10 sec: 5402.0, 60 sec: 5677.4, 300 sec: 5646.7). Total num frames: 434214912. Throughput: 0: 5862.2. Samples: 434221546. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:17:10,288][25689] Avg episode reward: [(0, '-46.487')] [2022-07-09 21:17:11,650][26022] Updated weights on worker 0-0, policy_version 424045 (0.00084) [2022-07-09 21:17:13,345][26022] Updated weights on worker 0-0, policy_version 424055 (0.00092) [2022-07-09 21:17:15,175][26022] Updated weights on worker 0-0, policy_version 424065 (0.00092) [2022-07-09 21:17:15,332][25689] Fps is (10 sec: 5705.3, 60 sec: 5691.8, 300 sec: 5646.8). Total num frames: 434243584. Throughput: 0: 5003.1. Samples: 434238730. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:17:15,332][25689] Avg episode reward: [(0, '-47.014')] [2022-07-09 21:17:16,823][26022] Updated weights on worker 0-0, policy_version 424075 (0.00086) [2022-07-09 21:17:18,677][26022] Updated weights on worker 0-0, policy_version 424085 (0.00092) [2022-07-09 21:17:20,356][25689] Fps is (10 sec: 5593.9, 60 sec: 5642.0, 300 sec: 5645.7). Total num frames: 434271232. Throughput: 0: 5858.8. Samples: 434273188. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:17:20,356][25689] Avg episode reward: [(0, '-46.607')] [2022-07-09 21:17:20,489][26022] Updated weights on worker 0-0, policy_version 424095 (0.00092) [2022-07-09 21:17:22,391][26022] Updated weights on worker 0-0, policy_version 424105 (0.00084) [2022-07-09 21:17:24,002][26022] Updated weights on worker 0-0, policy_version 424115 (0.00085) [2022-07-09 21:17:25,427][25689] Fps is (10 sec: 5680.4, 60 sec: 5681.4, 300 sec: 5647.9). Total num frames: 434300928. Throughput: 0: 5957.7. Samples: 434307614. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:17:25,427][25689] Avg episode reward: [(0, '-46.791')] [2022-07-09 21:17:26,048][26022] Updated weights on worker 0-0, policy_version 424125 (0.00095) [2022-07-09 21:17:27,806][26022] Updated weights on worker 0-0, policy_version 424135 (0.00087) [2022-07-09 21:17:29,517][26022] Updated weights on worker 0-0, policy_version 424145 (0.00089) [2022-07-09 21:17:30,428][25689] Fps is (10 sec: 5896.7, 60 sec: 5699.7, 300 sec: 5647.9). Total num frames: 434330624. Throughput: 0: 5113.0. Samples: 434324524. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:17:30,429][25689] Avg episode reward: [(0, '-46.756')] [2022-07-09 21:17:31,154][26022] Updated weights on worker 0-0, policy_version 424155 (0.00083) [2022-07-09 21:17:32,541][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:17:32,556][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000424162_434341888.pth [2022-07-09 21:17:32,556][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000422175_432307200.pth [2022-07-09 21:17:32,995][26022] Updated weights on worker 0-0, policy_version 424165 (0.00078) [2022-07-09 21:17:35,060][26022] Updated weights on worker 0-0, policy_version 424175 (0.00081) [2022-07-09 21:17:35,436][25689] Fps is (10 sec: 5729.3, 60 sec: 5682.8, 300 sec: 5648.1). Total num frames: 434358272. Throughput: 0: 5976.8. Samples: 434358888. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:17:35,436][25689] Avg episode reward: [(0, '-47.337')] [2022-07-09 21:17:36,715][26022] Updated weights on worker 0-0, policy_version 424185 (0.00089) [2022-07-09 21:17:38,704][26022] Updated weights on worker 0-0, policy_version 424195 (0.00082) [2022-07-09 21:17:40,237][26022] Updated weights on worker 0-0, policy_version 424205 (0.00097) [2022-07-09 21:17:40,465][25689] Fps is (10 sec: 5611.0, 60 sec: 5650.5, 300 sec: 5649.2). Total num frames: 434386944. Throughput: 0: 5957.5. Samples: 434392990. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:17:40,466][25689] Avg episode reward: [(0, '-46.300')] [2022-07-09 21:17:42,195][26022] Updated weights on worker 0-0, policy_version 424215 (0.00112) [2022-07-09 21:17:43,814][26022] Updated weights on worker 0-0, policy_version 424225 (0.00077) [2022-07-09 21:17:45,514][25689] Fps is (10 sec: 5588.5, 60 sec: 5652.8, 300 sec: 5645.2). Total num frames: 434414592. Throughput: 0: 5089.5. Samples: 434409846. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:17:45,515][25689] Avg episode reward: [(0, '-46.676')] [2022-07-09 21:17:45,796][26022] Updated weights on worker 0-0, policy_version 424235 (0.00087) [2022-07-09 21:17:47,521][26022] Updated weights on worker 0-0, policy_version 424245 (0.00090) [2022-07-09 21:17:49,375][26022] Updated weights on worker 0-0, policy_version 424255 (0.00091) [2022-07-09 21:17:50,525][25689] Fps is (10 sec: 5700.5, 60 sec: 5669.1, 300 sec: 5651.9). Total num frames: 434444288. Throughput: 0: 5950.9. Samples: 434444118. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:17:50,525][25689] Avg episode reward: [(0, '-45.285')] [2022-07-09 21:17:51,234][26022] Updated weights on worker 0-0, policy_version 424265 (0.00087) [2022-07-09 21:17:52,918][26022] Updated weights on worker 0-0, policy_version 424275 (0.00086) [2022-07-09 21:17:54,718][26022] Updated weights on worker 0-0, policy_version 424285 (0.00087) [2022-07-09 21:17:55,530][25689] Fps is (10 sec: 5724.8, 60 sec: 5688.4, 300 sec: 5648.8). Total num frames: 434471936. Throughput: 0: 5955.9. Samples: 434478570. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 21:17:55,532][25689] Avg episode reward: [(0, '-45.716')] [2022-07-09 21:17:56,367][26022] Updated weights on worker 0-0, policy_version 424295 (0.00093) [2022-07-09 21:17:58,115][26022] Updated weights on worker 0-0, policy_version 424305 (0.00099) [2022-07-09 21:18:00,188][26022] Updated weights on worker 0-0, policy_version 424315 (0.00091) [2022-07-09 21:18:00,535][25689] Fps is (10 sec: 5728.4, 60 sec: 5655.5, 300 sec: 5654.5). Total num frames: 434501632. Throughput: 0: 5130.1. Samples: 434495950. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:18:00,537][25689] Avg episode reward: [(0, '-45.495')] [2022-07-09 21:18:02,110][26022] Updated weights on worker 0-0, policy_version 424325 (0.00094) [2022-07-09 21:18:04,019][26022] Updated weights on worker 0-0, policy_version 424335 (0.00088) [2022-07-09 21:18:05,590][25689] Fps is (10 sec: 5598.7, 60 sec: 5671.1, 300 sec: 5650.4). Total num frames: 434528256. Throughput: 0: 5888.6. Samples: 434528066. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:18:05,591][25689] Avg episode reward: [(0, '-45.694')] [2022-07-09 21:18:05,695][26022] Updated weights on worker 0-0, policy_version 424345 (0.00093) [2022-07-09 21:18:07,587][26022] Updated weights on worker 0-0, policy_version 424355 (0.00100) [2022-07-09 21:18:09,427][26022] Updated weights on worker 0-0, policy_version 424365 (0.00095) [2022-07-09 21:18:10,592][25689] Fps is (10 sec: 5396.5, 60 sec: 5654.5, 300 sec: 5647.0). Total num frames: 434555904. Throughput: 0: 5888.5. Samples: 434562284. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:18:10,594][25689] Avg episode reward: [(0, '-45.871')] [2022-07-09 21:18:11,012][26022] Updated weights on worker 0-0, policy_version 424375 (0.00091) [2022-07-09 21:18:12,985][26022] Updated weights on worker 0-0, policy_version 424385 (0.00097) [2022-07-09 21:18:14,700][26022] Updated weights on worker 0-0, policy_version 424395 (0.00085) [2022-07-09 21:18:15,598][25689] Fps is (10 sec: 5627.3, 60 sec: 5658.1, 300 sec: 5643.5). Total num frames: 434584576. Throughput: 0: 5021.4. Samples: 434579338. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:18:15,600][25689] Avg episode reward: [(0, '-46.489')] [2022-07-09 21:18:16,457][26022] Updated weights on worker 0-0, policy_version 424405 (0.00088) [2022-07-09 21:18:18,361][26022] Updated weights on worker 0-0, policy_version 424415 (0.00084) [2022-07-09 21:18:19,932][26022] Updated weights on worker 0-0, policy_version 424425 (0.00093) [2022-07-09 21:18:20,621][25689] Fps is (10 sec: 5820.1, 60 sec: 5692.2, 300 sec: 5654.3). Total num frames: 434614272. Throughput: 0: 5863.2. Samples: 434613716. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:18:20,621][25689] Avg episode reward: [(0, '-46.333')] [2022-07-09 21:18:22,029][26022] Updated weights on worker 0-0, policy_version 424435 (0.00088) [2022-07-09 21:18:23,538][26022] Updated weights on worker 0-0, policy_version 424445 (0.00090) [2022-07-09 21:18:25,655][26022] Updated weights on worker 0-0, policy_version 424455 (0.00094) [2022-07-09 21:18:25,664][25689] Fps is (10 sec: 5696.8, 60 sec: 5660.8, 300 sec: 5646.9). Total num frames: 434641920. Throughput: 0: 5984.9. Samples: 434648210. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:18:25,665][25689] Avg episode reward: [(0, '-46.682')] [2022-07-09 21:18:27,424][26022] Updated weights on worker 0-0, policy_version 424465 (0.00095) [2022-07-09 21:18:29,107][26022] Updated weights on worker 0-0, policy_version 424475 (0.00085) [2022-07-09 21:18:30,685][25689] Fps is (10 sec: 5494.3, 60 sec: 5625.0, 300 sec: 5643.4). Total num frames: 434669568. Throughput: 0: 5111.7. Samples: 434664992. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:18:30,685][25689] Avg episode reward: [(0, '-46.578')] [2022-07-09 21:18:31,175][26022] Updated weights on worker 0-0, policy_version 424485 (0.00094) [2022-07-09 21:18:32,795][26022] Updated weights on worker 0-0, policy_version 424495 (0.00087) [2022-07-09 21:18:34,607][26022] Updated weights on worker 0-0, policy_version 424505 (0.00081) [2022-07-09 21:18:35,719][25689] Fps is (10 sec: 5703.4, 60 sec: 5656.5, 300 sec: 5653.3). Total num frames: 434699264. Throughput: 0: 5944.3. Samples: 434698940. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:18:35,719][25689] Avg episode reward: [(0, '-45.982')] [2022-07-09 21:18:36,518][26022] Updated weights on worker 0-0, policy_version 424515 (0.00082) [2022-07-09 21:18:38,143][26022] Updated weights on worker 0-0, policy_version 424525 (0.00083) [2022-07-09 21:18:40,044][26022] Updated weights on worker 0-0, policy_version 424535 (0.00086) [2022-07-09 21:18:40,754][25689] Fps is (10 sec: 5796.7, 60 sec: 5656.0, 300 sec: 5650.4). Total num frames: 434727936. Throughput: 0: 5939.3. Samples: 434733294. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:18:40,755][25689] Avg episode reward: [(0, '-45.635')] [2022-07-09 21:18:41,703][26022] Updated weights on worker 0-0, policy_version 424545 (0.00096) [2022-07-09 21:18:43,640][26022] Updated weights on worker 0-0, policy_version 424555 (0.00091) [2022-07-09 21:18:45,346][26022] Updated weights on worker 0-0, policy_version 424565 (0.00086) [2022-07-09 21:18:45,830][25689] Fps is (10 sec: 5671.4, 60 sec: 5670.4, 300 sec: 5652.9). Total num frames: 434756608. Throughput: 0: 5066.4. Samples: 434750376. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:18:45,830][25689] Avg episode reward: [(0, '-45.578')] [2022-07-09 21:18:47,278][26022] Updated weights on worker 0-0, policy_version 424575 (0.00088) [2022-07-09 21:18:48,845][26022] Updated weights on worker 0-0, policy_version 424585 (0.00094) [2022-07-09 21:18:50,861][25689] Fps is (10 sec: 5572.4, 60 sec: 5634.6, 300 sec: 5649.0). Total num frames: 434784256. Throughput: 0: 5917.0. Samples: 434784374. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:18:50,862][25689] Avg episode reward: [(0, '-45.274')] [2022-07-09 21:18:50,871][26022] Updated weights on worker 0-0, policy_version 424595 (0.00090) [2022-07-09 21:18:52,419][26022] Updated weights on worker 0-0, policy_version 424605 (0.00090) [2022-07-09 21:18:54,366][26022] Updated weights on worker 0-0, policy_version 424615 (0.00083) [2022-07-09 21:18:55,869][25689] Fps is (10 sec: 5712.0, 60 sec: 5668.3, 300 sec: 5646.0). Total num frames: 434813952. Throughput: 0: 5927.5. Samples: 434818380. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:18:55,869][25689] Avg episode reward: [(0, '-44.736')] [2022-07-09 21:18:56,199][26022] Updated weights on worker 0-0, policy_version 424625 (0.00086) [2022-07-09 21:18:58,008][26022] Updated weights on worker 0-0, policy_version 424635 (0.00091) [2022-07-09 21:18:59,794][26022] Updated weights on worker 0-0, policy_version 424645 (0.00086) [2022-07-09 21:19:00,992][25689] Fps is (10 sec: 5761.4, 60 sec: 5640.3, 300 sec: 5656.1). Total num frames: 434842624. Throughput: 0: 5063.2. Samples: 434835762. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:19:00,993][25689] Avg episode reward: [(0, '-44.562')] [2022-07-09 21:19:01,899][26022] Updated weights on worker 0-0, policy_version 424655 (0.00086) [2022-07-09 21:19:03,693][26022] Updated weights on worker 0-0, policy_version 424665 (0.00095) [2022-07-09 21:19:05,839][26022] Updated weights on worker 0-0, policy_version 424675 (0.00087) [2022-07-09 21:19:06,071][25689] Fps is (10 sec: 5420.1, 60 sec: 5638.0, 300 sec: 5655.1). Total num frames: 434869248. Throughput: 0: 5804.1. Samples: 434867858. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:19:06,071][25689] Avg episode reward: [(0, '-45.127')] [2022-07-09 21:19:07,362][26022] Updated weights on worker 0-0, policy_version 424685 (0.00091) [2022-07-09 21:19:09,231][26022] Updated weights on worker 0-0, policy_version 424695 (0.00087) [2022-07-09 21:19:11,041][26022] Updated weights on worker 0-0, policy_version 424705 (0.00082) [2022-07-09 21:19:11,137][25689] Fps is (10 sec: 5551.3, 60 sec: 5665.8, 300 sec: 5647.1). Total num frames: 434898944. Throughput: 0: 5796.1. Samples: 434901898. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:19:11,138][25689] Avg episode reward: [(0, '-45.334')] [2022-07-09 21:19:12,838][26022] Updated weights on worker 0-0, policy_version 424715 (0.00088) [2022-07-09 21:19:14,561][26022] Updated weights on worker 0-0, policy_version 424725 (0.00090) [2022-07-09 21:19:16,178][25689] Fps is (10 sec: 5673.4, 60 sec: 5645.7, 300 sec: 5650.1). Total num frames: 434926592. Throughput: 0: 5804.9. Samples: 434936274. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:19:16,179][25689] Avg episode reward: [(0, '-45.344')] [2022-07-09 21:19:16,412][26022] Updated weights on worker 0-0, policy_version 424735 (0.00099) [2022-07-09 21:19:18,159][26022] Updated weights on worker 0-0, policy_version 424745 (0.00081) [2022-07-09 21:19:20,116][26022] Updated weights on worker 0-0, policy_version 424755 (0.00083) [2022-07-09 21:19:21,248][25689] Fps is (10 sec: 5671.7, 60 sec: 5641.3, 300 sec: 5654.2). Total num frames: 434956288. Throughput: 0: 5796.9. Samples: 434953184. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:19:21,249][25689] Avg episode reward: [(0, '-44.926')] [2022-07-09 21:19:21,550][26022] Updated weights on worker 0-0, policy_version 424765 (0.00087) [2022-07-09 21:19:23,655][26022] Updated weights on worker 0-0, policy_version 424775 (0.00095) [2022-07-09 21:19:25,130][26022] Updated weights on worker 0-0, policy_version 424785 (0.00093) [2022-07-09 21:19:26,398][25689] Fps is (10 sec: 5611.0, 60 sec: 5631.4, 300 sec: 5648.0). Total num frames: 434983936. Throughput: 0: 5880.9. Samples: 434987402. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:19:26,400][25689] Avg episode reward: [(0, '-44.917')] [2022-07-09 21:19:27,197][26022] Updated weights on worker 0-0, policy_version 424795 (0.00089) [2022-07-09 21:19:28,991][26022] Updated weights on worker 0-0, policy_version 424805 (0.00088) [2022-07-09 21:19:30,791][26022] Updated weights on worker 0-0, policy_version 424815 (0.00085) [2022-07-09 21:19:31,474][25689] Fps is (10 sec: 5607.9, 60 sec: 5660.0, 300 sec: 5650.2). Total num frames: 435013632. Throughput: 0: 5879.6. Samples: 435021466. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:19:31,474][25689] Avg episode reward: [(0, '-44.884')] [2022-07-09 21:19:32,564][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:19:32,578][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000424824_435019776.pth [2022-07-09 21:19:32,578][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000422838_432986112.pth [2022-07-09 21:19:32,723][26022] Updated weights on worker 0-0, policy_version 424825 (0.00088) [2022-07-09 21:19:34,204][26022] Updated weights on worker 0-0, policy_version 424835 (0.00086) [2022-07-09 21:19:36,150][26022] Updated weights on worker 0-0, policy_version 424845 (0.00092) [2022-07-09 21:19:36,496][25689] Fps is (10 sec: 5881.4, 60 sec: 5661.0, 300 sec: 5654.5). Total num frames: 435043328. Throughput: 0: 5046.3. Samples: 435038806. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:19:36,497][25689] Avg episode reward: [(0, '-45.144')] [2022-07-09 21:19:38,043][26022] Updated weights on worker 0-0, policy_version 424855 (0.00088) [2022-07-09 21:19:39,616][26022] Updated weights on worker 0-0, policy_version 424865 (0.00089) [2022-07-09 21:19:41,534][25689] Fps is (10 sec: 5700.1, 60 sec: 5644.0, 300 sec: 5651.7). Total num frames: 435070976. Throughput: 0: 5927.2. Samples: 435073422. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:19:41,535][25689] Avg episode reward: [(0, '-44.936')] [2022-07-09 21:19:41,607][26022] Updated weights on worker 0-0, policy_version 424875 (0.00082) [2022-07-09 21:19:43,141][26022] Updated weights on worker 0-0, policy_version 424885 (0.00088) [2022-07-09 21:19:45,212][26022] Updated weights on worker 0-0, policy_version 424895 (0.00088) [2022-07-09 21:19:46,608][25689] Fps is (10 sec: 5671.0, 60 sec: 5660.9, 300 sec: 5654.0). Total num frames: 435100672. Throughput: 0: 5944.5. Samples: 435107540. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:19:46,609][25689] Avg episode reward: [(0, '-45.429')] [2022-07-09 21:19:46,887][26022] Updated weights on worker 0-0, policy_version 424905 (0.00091) [2022-07-09 21:19:48,805][26022] Updated weights on worker 0-0, policy_version 424915 (0.00091) [2022-07-09 21:19:50,283][26022] Updated weights on worker 0-0, policy_version 424925 (0.00085) [2022-07-09 21:19:51,671][25689] Fps is (10 sec: 5858.7, 60 sec: 5691.7, 300 sec: 5653.2). Total num frames: 435130368. Throughput: 0: 5112.2. Samples: 435124720. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:19:51,673][25689] Avg episode reward: [(0, '-47.047')] [2022-07-09 21:19:52,515][26022] Updated weights on worker 0-0, policy_version 424935 (0.00086) [2022-07-09 21:19:54,075][26022] Updated weights on worker 0-0, policy_version 424945 (0.00091) [2022-07-09 21:19:55,982][26022] Updated weights on worker 0-0, policy_version 424955 (0.00084) [2022-07-09 21:19:56,718][25689] Fps is (10 sec: 5672.1, 60 sec: 5654.4, 300 sec: 5654.1). Total num frames: 435158016. Throughput: 0: 5937.9. Samples: 435158880. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:19:56,720][25689] Avg episode reward: [(0, '-47.666')] [2022-07-09 21:19:57,586][26022] Updated weights on worker 0-0, policy_version 424965 (0.00083) [2022-07-09 21:19:59,548][26022] Updated weights on worker 0-0, policy_version 424975 (0.00081) [2022-07-09 21:20:01,199][26022] Updated weights on worker 0-0, policy_version 424985 (0.00084) [2022-07-09 21:20:01,809][25689] Fps is (10 sec: 5555.4, 60 sec: 5657.4, 300 sec: 5654.1). Total num frames: 435186688. Throughput: 0: 5904.0. Samples: 435193128. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:20:01,810][25689] Avg episode reward: [(0, '-48.335')] [2022-07-09 21:20:03,352][26022] Updated weights on worker 0-0, policy_version 424995 (0.00080) [2022-07-09 21:20:05,189][26022] Updated weights on worker 0-0, policy_version 425005 (0.00095) [2022-07-09 21:20:06,872][25689] Fps is (10 sec: 5546.6, 60 sec: 5675.7, 300 sec: 5654.6). Total num frames: 435214336. Throughput: 0: 4969.6. Samples: 435208244. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:20:06,874][25689] Avg episode reward: [(0, '-48.814')] [2022-07-09 21:20:07,168][26022] Updated weights on worker 0-0, policy_version 425015 (0.00086) [2022-07-09 21:20:08,733][26022] Updated weights on worker 0-0, policy_version 425025 (0.00089) [2022-07-09 21:20:10,663][26022] Updated weights on worker 0-0, policy_version 425035 (0.00093) [2022-07-09 21:20:11,890][25689] Fps is (10 sec: 5586.5, 60 sec: 5663.3, 300 sec: 5658.2). Total num frames: 435243008. Throughput: 0: 5825.9. Samples: 435242516. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:20:11,891][25689] Avg episode reward: [(0, '-48.484')] [2022-07-09 21:20:12,333][26022] Updated weights on worker 0-0, policy_version 425045 (0.00089) [2022-07-09 21:20:14,178][26022] Updated weights on worker 0-0, policy_version 425055 (0.00104) [2022-07-09 21:20:16,036][26022] Updated weights on worker 0-0, policy_version 425065 (0.00089) [2022-07-09 21:20:16,939][25689] Fps is (10 sec: 5594.6, 60 sec: 5662.6, 300 sec: 5647.1). Total num frames: 435270656. Throughput: 0: 5827.3. Samples: 435276712. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:20:16,939][25689] Avg episode reward: [(0, '-48.121')] [2022-07-09 21:20:17,675][26022] Updated weights on worker 0-0, policy_version 425075 (0.00085) [2022-07-09 21:20:19,597][26022] Updated weights on worker 0-0, policy_version 425085 (0.01151) [2022-07-09 21:20:21,591][26022] Updated weights on worker 0-0, policy_version 425095 (0.00079) [2022-07-09 21:20:21,949][25689] Fps is (10 sec: 5599.5, 60 sec: 5651.3, 300 sec: 5652.1). Total num frames: 435299328. Throughput: 0: 5000.4. Samples: 435293834. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:20:21,950][25689] Avg episode reward: [(0, '-47.620')] [2022-07-09 21:20:23,130][26022] Updated weights on worker 0-0, policy_version 425105 (0.00088) [2022-07-09 21:20:25,108][26022] Updated weights on worker 0-0, policy_version 425115 (0.00082) [2022-07-09 21:20:26,574][26022] Updated weights on worker 0-0, policy_version 425125 (0.00093) [2022-07-09 21:20:27,060][25689] Fps is (10 sec: 5767.0, 60 sec: 5688.7, 300 sec: 5654.0). Total num frames: 435329024. Throughput: 0: 5936.4. Samples: 435328088. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:20:27,062][25689] Avg episode reward: [(0, '-47.131')] [2022-07-09 21:20:28,499][26022] Updated weights on worker 0-0, policy_version 425135 (0.00083) [2022-07-09 21:20:30,420][26022] Updated weights on worker 0-0, policy_version 425145 (0.00090) [2022-07-09 21:20:32,140][25689] Fps is (10 sec: 5726.9, 60 sec: 5671.4, 300 sec: 5652.9). Total num frames: 435357696. Throughput: 0: 5912.4. Samples: 435362242. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:20:32,141][25689] Avg episode reward: [(0, '-46.955')] [2022-07-09 21:20:32,279][26022] Updated weights on worker 0-0, policy_version 425155 (0.00100) [2022-07-09 21:20:33,914][26022] Updated weights on worker 0-0, policy_version 425165 (0.00093) [2022-07-09 21:20:35,591][26022] Updated weights on worker 0-0, policy_version 425175 (0.00094) [2022-07-09 21:20:37,148][25689] Fps is (10 sec: 5785.6, 60 sec: 5672.8, 300 sec: 5656.8). Total num frames: 435387392. Throughput: 0: 5082.0. Samples: 435379416. Policy #0 lag: (min: 0.0, avg: 8.9, max: 17.0) [2022-07-09 21:20:37,150][25689] Avg episode reward: [(0, '-47.196')] [2022-07-09 21:20:37,340][26022] Updated weights on worker 0-0, policy_version 425185 (0.00090) [2022-07-09 21:20:39,404][26022] Updated weights on worker 0-0, policy_version 425195 (0.00089) [2022-07-09 21:20:41,089][26022] Updated weights on worker 0-0, policy_version 425205 (0.00090) [2022-07-09 21:20:42,232][25689] Fps is (10 sec: 5682.5, 60 sec: 5668.5, 300 sec: 5650.7). Total num frames: 435415040. Throughput: 0: 5906.4. Samples: 435413636. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:20:42,232][25689] Avg episode reward: [(0, '-47.569')] [2022-07-09 21:20:43,145][26022] Updated weights on worker 0-0, policy_version 425215 (0.00090) [2022-07-09 21:20:44,623][26022] Updated weights on worker 0-0, policy_version 425225 (0.00087) [2022-07-09 21:20:46,565][26022] Updated weights on worker 0-0, policy_version 425235 (0.00100) [2022-07-09 21:20:47,284][25689] Fps is (10 sec: 5657.7, 60 sec: 5670.5, 300 sec: 5660.2). Total num frames: 435444736. Throughput: 0: 5928.1. Samples: 435447978. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:20:47,284][25689] Avg episode reward: [(0, '-47.937')] [2022-07-09 21:20:48,313][26022] Updated weights on worker 0-0, policy_version 425245 (0.00087) [2022-07-09 21:20:50,319][26022] Updated weights on worker 0-0, policy_version 425255 (0.00090) [2022-07-09 21:20:51,908][26022] Updated weights on worker 0-0, policy_version 425265 (0.00085) [2022-07-09 21:20:52,299][25689] Fps is (10 sec: 5696.3, 60 sec: 5641.3, 300 sec: 5656.7). Total num frames: 435472384. Throughput: 0: 5931.4. Samples: 435481808. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:20:52,299][25689] Avg episode reward: [(0, '-48.139')] [2022-07-09 21:20:53,892][26022] Updated weights on worker 0-0, policy_version 425275 (0.00104) [2022-07-09 21:20:55,432][26022] Updated weights on worker 0-0, policy_version 425285 (0.00088) [2022-07-09 21:20:57,302][25689] Fps is (10 sec: 5621.8, 60 sec: 5662.2, 300 sec: 5660.3). Total num frames: 435501056. Throughput: 0: 5939.5. Samples: 435499118. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:20:57,303][25689] Avg episode reward: [(0, '-47.847')] [2022-07-09 21:20:57,368][26022] Updated weights on worker 0-0, policy_version 425295 (0.00087) [2022-07-09 21:20:59,287][26022] Updated weights on worker 0-0, policy_version 425305 (0.00083) [2022-07-09 21:21:01,003][26022] Updated weights on worker 0-0, policy_version 425315 (0.00089) [2022-07-09 21:21:02,323][25689] Fps is (10 sec: 5516.4, 60 sec: 5635.0, 300 sec: 5659.2). Total num frames: 435527680. Throughput: 0: 5941.5. Samples: 435533006. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:21:02,324][25689] Avg episode reward: [(0, '-47.022')] [2022-07-09 21:21:03,280][26022] Updated weights on worker 0-0, policy_version 425325 (0.00083) [2022-07-09 21:21:05,146][26022] Updated weights on worker 0-0, policy_version 425335 (0.00084) [2022-07-09 21:21:06,756][26022] Updated weights on worker 0-0, policy_version 425345 (0.00086) [2022-07-09 21:21:07,402][25689] Fps is (10 sec: 5474.7, 60 sec: 5650.3, 300 sec: 5661.4). Total num frames: 435556352. Throughput: 0: 5814.2. Samples: 435564950. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:21:07,403][25689] Avg episode reward: [(0, '-45.251')] [2022-07-09 21:21:08,808][26022] Updated weights on worker 0-0, policy_version 425355 (0.00100) [2022-07-09 21:21:10,237][26022] Updated weights on worker 0-0, policy_version 425365 (0.00088) [2022-07-09 21:21:12,369][26022] Updated weights on worker 0-0, policy_version 425375 (0.00086) [2022-07-09 21:21:12,455][25689] Fps is (10 sec: 5558.5, 60 sec: 5630.2, 300 sec: 5660.7). Total num frames: 435584000. Throughput: 0: 4961.5. Samples: 435581814. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:21:12,455][25689] Avg episode reward: [(0, '-45.359')] [2022-07-09 21:21:14,113][26022] Updated weights on worker 0-0, policy_version 425385 (0.00092) [2022-07-09 21:21:15,954][26022] Updated weights on worker 0-0, policy_version 425395 (0.00090) [2022-07-09 21:21:17,461][25689] Fps is (10 sec: 5599.0, 60 sec: 5651.1, 300 sec: 5654.4). Total num frames: 435612672. Throughput: 0: 5803.3. Samples: 435616108. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:21:17,462][25689] Avg episode reward: [(0, '-44.697')] [2022-07-09 21:21:17,723][26022] Updated weights on worker 0-0, policy_version 425405 (0.00088) [2022-07-09 21:21:19,519][26022] Updated weights on worker 0-0, policy_version 425415 (0.00086) [2022-07-09 21:21:21,324][26022] Updated weights on worker 0-0, policy_version 425425 (0.00091) [2022-07-09 21:21:22,465][25689] Fps is (10 sec: 5626.5, 60 sec: 5634.7, 300 sec: 5656.8). Total num frames: 435640320. Throughput: 0: 5798.7. Samples: 435649804. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:21:22,465][25689] Avg episode reward: [(0, '-45.028')] [2022-07-09 21:21:23,100][26022] Updated weights on worker 0-0, policy_version 425435 (0.00087) [2022-07-09 21:21:24,908][26022] Updated weights on worker 0-0, policy_version 425445 (0.00092) [2022-07-09 21:21:26,857][26022] Updated weights on worker 0-0, policy_version 425455 (0.00093) [2022-07-09 21:21:27,542][25689] Fps is (10 sec: 5587.1, 60 sec: 5621.0, 300 sec: 5655.6). Total num frames: 435668992. Throughput: 0: 5063.0. Samples: 435666916. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:21:27,542][25689] Avg episode reward: [(0, '-45.577')] [2022-07-09 21:21:28,487][26022] Updated weights on worker 0-0, policy_version 425465 (0.00090) [2022-07-09 21:21:30,517][26022] Updated weights on worker 0-0, policy_version 425475 (0.00093) [2022-07-09 21:21:32,099][26022] Updated weights on worker 0-0, policy_version 425485 (0.00101) [2022-07-09 21:21:32,614][25689] Fps is (10 sec: 5650.3, 60 sec: 5621.8, 300 sec: 5654.4). Total num frames: 435697664. Throughput: 0: 5911.4. Samples: 435700980. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:21:32,614][25689] Avg episode reward: [(0, '-45.519')] [2022-07-09 21:21:32,623][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:21:32,637][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000425487_435698688.pth [2022-07-09 21:21:32,637][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000423499_433662976.pth [2022-07-09 21:21:34,208][26022] Updated weights on worker 0-0, policy_version 425495 (0.00095) [2022-07-09 21:21:35,723][26022] Updated weights on worker 0-0, policy_version 425505 (0.00091) [2022-07-09 21:21:37,611][26022] Updated weights on worker 0-0, policy_version 425515 (0.00090) [2022-07-09 21:21:37,663][25689] Fps is (10 sec: 5766.8, 60 sec: 5618.0, 300 sec: 5650.9). Total num frames: 435727360. Throughput: 0: 5897.8. Samples: 435735254. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:21:37,664][25689] Avg episode reward: [(0, '-45.738')] [2022-07-09 21:21:39,332][26022] Updated weights on worker 0-0, policy_version 425525 (0.00082) [2022-07-09 21:21:41,215][26022] Updated weights on worker 0-0, policy_version 425535 (0.00084) [2022-07-09 21:21:42,675][25689] Fps is (10 sec: 5801.1, 60 sec: 5641.5, 300 sec: 5655.5). Total num frames: 435756032. Throughput: 0: 5074.8. Samples: 435752368. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:21:42,676][25689] Avg episode reward: [(0, '-45.826')] [2022-07-09 21:21:42,944][26022] Updated weights on worker 0-0, policy_version 425545 (0.00087) [2022-07-09 21:21:44,741][26022] Updated weights on worker 0-0, policy_version 425555 (0.00101) [2022-07-09 21:21:46,598][26022] Updated weights on worker 0-0, policy_version 425565 (0.00085) [2022-07-09 21:21:47,780][25689] Fps is (10 sec: 5668.2, 60 sec: 5619.7, 300 sec: 5653.6). Total num frames: 435784704. Throughput: 0: 5931.6. Samples: 435786960. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:21:47,780][25689] Avg episode reward: [(0, '-46.825')] [2022-07-09 21:21:48,173][26022] Updated weights on worker 0-0, policy_version 425575 (0.00088) [2022-07-09 21:21:50,165][26022] Updated weights on worker 0-0, policy_version 425585 (0.00087) [2022-07-09 21:21:52,072][26022] Updated weights on worker 0-0, policy_version 425595 (0.00086) [2022-07-09 21:21:52,790][25689] Fps is (10 sec: 5669.5, 60 sec: 5637.1, 300 sec: 5660.9). Total num frames: 435813376. Throughput: 0: 5944.4. Samples: 435820914. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:21:52,791][25689] Avg episode reward: [(0, '-46.772')] [2022-07-09 21:21:53,642][26022] Updated weights on worker 0-0, policy_version 425605 (0.00090) [2022-07-09 21:21:55,783][26022] Updated weights on worker 0-0, policy_version 425615 (0.00086) [2022-07-09 21:21:57,237][26022] Updated weights on worker 0-0, policy_version 425625 (0.00095) [2022-07-09 21:21:57,824][25689] Fps is (10 sec: 5811.1, 60 sec: 5651.1, 300 sec: 5653.7). Total num frames: 435843072. Throughput: 0: 5096.6. Samples: 435838006. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:21:57,825][25689] Avg episode reward: [(0, '-47.153')] [2022-07-09 21:21:59,317][26022] Updated weights on worker 0-0, policy_version 425635 (0.00083) [2022-07-09 21:22:00,642][26022] Updated weights on worker 0-0, policy_version 425645 (0.00093) [2022-07-09 21:22:02,840][25689] Fps is (10 sec: 5399.9, 60 sec: 5617.7, 300 sec: 5650.7). Total num frames: 435867648. Throughput: 0: 5930.6. Samples: 435871958. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:22:02,842][25689] Avg episode reward: [(0, '-47.035')] [2022-07-09 21:22:03,247][26022] Updated weights on worker 0-0, policy_version 425655 (0.00083) [2022-07-09 21:22:04,914][26022] Updated weights on worker 0-0, policy_version 425665 (0.00088) [2022-07-09 21:22:06,831][26022] Updated weights on worker 0-0, policy_version 425675 (0.00087) [2022-07-09 21:22:07,899][25689] Fps is (10 sec: 5386.8, 60 sec: 5636.6, 300 sec: 5653.1). Total num frames: 435897344. Throughput: 0: 5813.8. Samples: 435903928. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:22:07,899][25689] Avg episode reward: [(0, '-46.962')] [2022-07-09 21:22:08,447][26022] Updated weights on worker 0-0, policy_version 425685 (0.00095) [2022-07-09 21:22:10,523][26022] Updated weights on worker 0-0, policy_version 425695 (0.00087) [2022-07-09 21:22:12,010][26022] Updated weights on worker 0-0, policy_version 425705 (0.00093) [2022-07-09 21:22:12,964][25689] Fps is (10 sec: 5765.6, 60 sec: 5652.3, 300 sec: 5652.7). Total num frames: 435926016. Throughput: 0: 4956.4. Samples: 435920904. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:22:12,964][25689] Avg episode reward: [(0, '-46.792')] [2022-07-09 21:22:14,203][26022] Updated weights on worker 0-0, policy_version 425715 (0.00091) [2022-07-09 21:22:15,434][26022] Updated weights on worker 0-0, policy_version 425725 (0.00089) [2022-07-09 21:22:17,755][26022] Updated weights on worker 0-0, policy_version 425735 (0.00086) [2022-07-09 21:22:17,991][25689] Fps is (10 sec: 5682.2, 60 sec: 5650.4, 300 sec: 5656.1). Total num frames: 435954688. Throughput: 0: 5809.1. Samples: 435955156. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:22:17,991][25689] Avg episode reward: [(0, '-46.740')] [2022-07-09 21:22:19,517][26022] Updated weights on worker 0-0, policy_version 425745 (0.00094) [2022-07-09 21:22:21,222][26022] Updated weights on worker 0-0, policy_version 425755 (0.00066) [2022-07-09 21:22:22,925][26022] Updated weights on worker 0-0, policy_version 425765 (0.00093) [2022-07-09 21:22:23,011][25689] Fps is (10 sec: 5707.6, 60 sec: 5665.8, 300 sec: 5653.6). Total num frames: 435983360. Throughput: 0: 5810.5. Samples: 435989158. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:22:23,011][25689] Avg episode reward: [(0, '-45.807')] [2022-07-09 21:22:24,754][26022] Updated weights on worker 0-0, policy_version 425775 (0.00081) [2022-07-09 21:22:26,594][26022] Updated weights on worker 0-0, policy_version 425785 (0.00093) [2022-07-09 21:22:28,110][25689] Fps is (10 sec: 5666.6, 60 sec: 5663.7, 300 sec: 5648.3). Total num frames: 436012032. Throughput: 0: 5062.7. Samples: 436006252. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:22:28,111][25689] Avg episode reward: [(0, '-45.628')] [2022-07-09 21:22:28,555][26022] Updated weights on worker 0-0, policy_version 425795 (0.00084) [2022-07-09 21:22:30,304][26022] Updated weights on worker 0-0, policy_version 425805 (0.00090) [2022-07-09 21:22:32,011][26022] Updated weights on worker 0-0, policy_version 425815 (0.00110) [2022-07-09 21:22:33,163][25689] Fps is (10 sec: 5648.4, 60 sec: 5665.5, 300 sec: 5650.9). Total num frames: 436040704. Throughput: 0: 5918.1. Samples: 436040444. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:22:33,163][25689] Avg episode reward: [(0, '-45.784')] [2022-07-09 21:22:33,787][26022] Updated weights on worker 0-0, policy_version 425825 (0.00082) [2022-07-09 21:22:35,905][26022] Updated weights on worker 0-0, policy_version 425835 (0.00087) [2022-07-09 21:22:37,459][26022] Updated weights on worker 0-0, policy_version 425845 (0.00092) [2022-07-09 21:22:38,222][25689] Fps is (10 sec: 5670.9, 60 sec: 5647.6, 300 sec: 5650.4). Total num frames: 436069376. Throughput: 0: 5898.6. Samples: 436074494. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:22:38,223][25689] Avg episode reward: [(0, '-46.435')] [2022-07-09 21:22:39,410][26022] Updated weights on worker 0-0, policy_version 425855 (0.00082) [2022-07-09 21:22:41,093][26022] Updated weights on worker 0-0, policy_version 425865 (0.00083) [2022-07-09 21:22:42,760][26022] Updated weights on worker 0-0, policy_version 425875 (0.00082) [2022-07-09 21:22:43,258][25689] Fps is (10 sec: 5680.2, 60 sec: 5645.4, 300 sec: 5654.0). Total num frames: 436098048. Throughput: 0: 5062.5. Samples: 436091662. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:22:43,259][25689] Avg episode reward: [(0, '-45.865')] [2022-07-09 21:22:44,709][26022] Updated weights on worker 0-0, policy_version 425885 (0.00087) [2022-07-09 21:22:46,523][26022] Updated weights on worker 0-0, policy_version 425895 (0.00089) [2022-07-09 21:22:48,327][26022] Updated weights on worker 0-0, policy_version 425905 (0.00087) [2022-07-09 21:22:48,331][25689] Fps is (10 sec: 5673.0, 60 sec: 5648.4, 300 sec: 5649.4). Total num frames: 436126720. Throughput: 0: 5912.9. Samples: 436125812. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:22:48,331][25689] Avg episode reward: [(0, '-45.874')] [2022-07-09 21:22:50,087][26022] Updated weights on worker 0-0, policy_version 425915 (0.00087) [2022-07-09 21:22:51,857][26022] Updated weights on worker 0-0, policy_version 425925 (0.00087) [2022-07-09 21:22:53,342][25689] Fps is (10 sec: 5585.1, 60 sec: 5631.3, 300 sec: 5649.3). Total num frames: 436154368. Throughput: 0: 5925.0. Samples: 436160006. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:22:53,343][25689] Avg episode reward: [(0, '-46.156')] [2022-07-09 21:22:53,706][26022] Updated weights on worker 0-0, policy_version 425935 (0.00087) [2022-07-09 21:22:55,410][26022] Updated weights on worker 0-0, policy_version 425945 (0.00090) [2022-07-09 21:22:57,060][26022] Updated weights on worker 0-0, policy_version 425955 (0.00084) [2022-07-09 21:22:58,396][25689] Fps is (10 sec: 5798.6, 60 sec: 5646.4, 300 sec: 5651.8). Total num frames: 436185088. Throughput: 0: 5091.2. Samples: 436177204. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:22:58,397][25689] Avg episode reward: [(0, '-45.450')] [2022-07-09 21:22:59,087][26022] Updated weights on worker 0-0, policy_version 425965 (0.00115) [2022-07-09 21:23:00,864][26022] Updated weights on worker 0-0, policy_version 425975 (0.00087) [2022-07-09 21:23:03,034][26022] Updated weights on worker 0-0, policy_version 425985 (0.00083) [2022-07-09 21:23:03,432][25689] Fps is (10 sec: 5582.1, 60 sec: 5661.5, 300 sec: 5648.7). Total num frames: 436210688. Throughput: 0: 5936.2. Samples: 436211414. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:23:03,433][25689] Avg episode reward: [(0, '-45.800')] [2022-07-09 21:23:04,682][26022] Updated weights on worker 0-0, policy_version 425995 (0.00094) [2022-07-09 21:23:06,555][26022] Updated weights on worker 0-0, policy_version 426005 (0.00090) [2022-07-09 21:23:08,335][26022] Updated weights on worker 0-0, policy_version 426015 (0.00118) [2022-07-09 21:23:08,587][25689] Fps is (10 sec: 5426.2, 60 sec: 5652.5, 300 sec: 5652.7). Total num frames: 436240384. Throughput: 0: 5831.4. Samples: 436243936. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:23:08,589][25689] Avg episode reward: [(0, '-45.337')] [2022-07-09 21:23:10,284][26022] Updated weights on worker 0-0, policy_version 426025 (0.00092) [2022-07-09 21:23:11,966][26022] Updated weights on worker 0-0, policy_version 426035 (0.00086) [2022-07-09 21:23:13,646][25689] Fps is (10 sec: 5614.0, 60 sec: 5636.2, 300 sec: 5648.3). Total num frames: 436268032. Throughput: 0: 5807.1. Samples: 436277912. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:23:13,646][25689] Avg episode reward: [(0, '-45.384')] [2022-07-09 21:23:13,959][26022] Updated weights on worker 0-0, policy_version 426045 (0.00087) [2022-07-09 21:23:15,613][26022] Updated weights on worker 0-0, policy_version 426055 (0.00089) [2022-07-09 21:23:17,408][26022] Updated weights on worker 0-0, policy_version 426065 (0.00091) [2022-07-09 21:23:18,701][25689] Fps is (10 sec: 5669.7, 60 sec: 5650.4, 300 sec: 5647.7). Total num frames: 436297728. Throughput: 0: 5805.0. Samples: 436295072. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:23:18,707][25689] Avg episode reward: [(0, '-46.422')] [2022-07-09 21:23:19,213][26022] Updated weights on worker 0-0, policy_version 426075 (0.00087) [2022-07-09 21:23:21,161][26022] Updated weights on worker 0-0, policy_version 426085 (0.00497) [2022-07-09 21:23:22,756][26022] Updated weights on worker 0-0, policy_version 426095 (0.00089) [2022-07-09 21:23:23,752][25689] Fps is (10 sec: 5876.8, 60 sec: 5664.4, 300 sec: 5654.4). Total num frames: 436327424. Throughput: 0: 5788.2. Samples: 436329034. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:23:23,753][25689] Avg episode reward: [(0, '-46.712')] [2022-07-09 21:23:24,931][26022] Updated weights on worker 0-0, policy_version 426105 (0.00087) [2022-07-09 21:23:26,324][26022] Updated weights on worker 0-0, policy_version 426115 (0.00086) [2022-07-09 21:23:28,658][26022] Updated weights on worker 0-0, policy_version 426125 (0.00090) [2022-07-09 21:23:28,828][25689] Fps is (10 sec: 5460.4, 60 sec: 5616.0, 300 sec: 5646.5). Total num frames: 436353024. Throughput: 0: 5888.5. Samples: 436363124. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:23:28,828][25689] Avg episode reward: [(0, '-47.241')] [2022-07-09 21:23:29,927][26022] Updated weights on worker 0-0, policy_version 426135 (0.00100) [2022-07-09 21:23:32,039][26022] Updated weights on worker 0-0, policy_version 426145 (0.00096) [2022-07-09 21:23:32,657][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:23:32,675][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000426149_436376576.pth [2022-07-09 21:23:32,675][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000424162_434341888.pth [2022-07-09 21:23:33,531][26022] Updated weights on worker 0-0, policy_version 426155 (0.00087) [2022-07-09 21:23:33,829][25689] Fps is (10 sec: 5589.0, 60 sec: 5654.5, 300 sec: 5650.5). Total num frames: 436383744. Throughput: 0: 5061.7. Samples: 436380072. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:23:33,830][25689] Avg episode reward: [(0, '-47.223')] [2022-07-09 21:23:35,580][26022] Updated weights on worker 0-0, policy_version 426165 (0.00095) [2022-07-09 21:23:37,361][26022] Updated weights on worker 0-0, policy_version 426175 (0.00083) [2022-07-09 21:23:38,837][25689] Fps is (10 sec: 5729.2, 60 sec: 5625.6, 300 sec: 5644.2). Total num frames: 436410368. Throughput: 0: 5909.0. Samples: 436414054. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:23:38,837][25689] Avg episode reward: [(0, '-46.717')] [2022-07-09 21:23:39,025][26022] Updated weights on worker 0-0, policy_version 426185 (0.00086) [2022-07-09 21:23:40,984][26022] Updated weights on worker 0-0, policy_version 426195 (0.00107) [2022-07-09 21:23:42,687][26022] Updated weights on worker 0-0, policy_version 426205 (0.00094) [2022-07-09 21:23:43,841][25689] Fps is (10 sec: 5523.3, 60 sec: 5628.6, 300 sec: 5645.5). Total num frames: 436439040. Throughput: 0: 5927.3. Samples: 436448104. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:23:43,841][25689] Avg episode reward: [(0, '-45.903')] [2022-07-09 21:23:44,527][26022] Updated weights on worker 0-0, policy_version 426215 (0.00083) [2022-07-09 21:23:46,446][26022] Updated weights on worker 0-0, policy_version 426225 (0.00047) [2022-07-09 21:23:48,229][26022] Updated weights on worker 0-0, policy_version 426235 (0.00089) [2022-07-09 21:23:48,887][25689] Fps is (10 sec: 5909.3, 60 sec: 5664.8, 300 sec: 5655.6). Total num frames: 436469760. Throughput: 0: 5086.7. Samples: 436465162. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:23:48,888][25689] Avg episode reward: [(0, '-45.715')] [2022-07-09 21:23:50,025][26022] Updated weights on worker 0-0, policy_version 426245 (0.00088) [2022-07-09 21:23:52,082][26022] Updated weights on worker 0-0, policy_version 426255 (0.00094) [2022-07-09 21:23:53,671][26022] Updated weights on worker 0-0, policy_version 426265 (0.00093) [2022-07-09 21:23:53,923][25689] Fps is (10 sec: 5687.7, 60 sec: 5645.6, 300 sec: 5644.7). Total num frames: 436496384. Throughput: 0: 5910.9. Samples: 436498844. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:23:53,923][25689] Avg episode reward: [(0, '-45.978')] [2022-07-09 21:23:55,408][26022] Updated weights on worker 0-0, policy_version 426275 (0.00087) [2022-07-09 21:23:57,276][26022] Updated weights on worker 0-0, policy_version 426285 (0.00088) [2022-07-09 21:23:58,878][26022] Updated weights on worker 0-0, policy_version 426295 (0.00083) [2022-07-09 21:23:58,940][25689] Fps is (10 sec: 5602.6, 60 sec: 5632.2, 300 sec: 5650.2). Total num frames: 436526080. Throughput: 0: 5917.5. Samples: 436533014. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:23:58,941][25689] Avg episode reward: [(0, '-45.829')] [2022-07-09 21:24:01,023][26022] Updated weights on worker 0-0, policy_version 426305 (0.00089) [2022-07-09 21:24:03,002][26022] Updated weights on worker 0-0, policy_version 426315 (0.00096) [2022-07-09 21:24:03,944][25689] Fps is (10 sec: 5619.8, 60 sec: 5652.0, 300 sec: 5651.6). Total num frames: 436552704. Throughput: 0: 5007.6. Samples: 436548776. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:24:03,945][25689] Avg episode reward: [(0, '-46.138')] [2022-07-09 21:24:04,978][26022] Updated weights on worker 0-0, policy_version 426325 (0.00093) [2022-07-09 21:24:06,489][26022] Updated weights on worker 0-0, policy_version 426335 (0.00085) [2022-07-09 21:24:08,530][26022] Updated weights on worker 0-0, policy_version 426345 (0.00089) [2022-07-09 21:24:09,063][25689] Fps is (10 sec: 5361.3, 60 sec: 5621.5, 300 sec: 5643.7). Total num frames: 436580352. Throughput: 0: 5817.0. Samples: 436582522. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:24:09,063][25689] Avg episode reward: [(0, '-46.178')] [2022-07-09 21:24:10,160][26022] Updated weights on worker 0-0, policy_version 426355 (0.00091) [2022-07-09 21:24:12,059][26022] Updated weights on worker 0-0, policy_version 426365 (0.00094) [2022-07-09 21:24:13,824][26022] Updated weights on worker 0-0, policy_version 426375 (0.00088) [2022-07-09 21:24:14,075][25689] Fps is (10 sec: 5660.6, 60 sec: 5659.8, 300 sec: 5651.1). Total num frames: 436610048. Throughput: 0: 5846.2. Samples: 436616658. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:24:14,076][25689] Avg episode reward: [(0, '-46.879')] [2022-07-09 21:24:15,586][26022] Updated weights on worker 0-0, policy_version 426385 (0.00085) [2022-07-09 21:24:17,363][26022] Updated weights on worker 0-0, policy_version 426395 (0.00077) [2022-07-09 21:24:19,099][25689] Fps is (10 sec: 5713.7, 60 sec: 5628.8, 300 sec: 5645.1). Total num frames: 436637696. Throughput: 0: 4992.5. Samples: 436633658. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:24:19,100][25689] Avg episode reward: [(0, '-46.545')] [2022-07-09 21:24:19,203][26022] Updated weights on worker 0-0, policy_version 426405 (0.00096) [2022-07-09 21:24:21,066][26022] Updated weights on worker 0-0, policy_version 426415 (0.00090) [2022-07-09 21:24:22,815][26022] Updated weights on worker 0-0, policy_version 426425 (0.00093) [2022-07-09 21:24:24,138][25689] Fps is (10 sec: 5393.1, 60 sec: 5579.0, 300 sec: 5643.8). Total num frames: 436664320. Throughput: 0: 5867.3. Samples: 436667258. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:24:24,139][25689] Avg episode reward: [(0, '-46.211')] [2022-07-09 21:24:24,764][26022] Updated weights on worker 0-0, policy_version 426435 (0.00084) [2022-07-09 21:24:26,486][26022] Updated weights on worker 0-0, policy_version 426445 (0.00092) [2022-07-09 21:24:28,537][26022] Updated weights on worker 0-0, policy_version 426455 (0.00098) [2022-07-09 21:24:29,201][25689] Fps is (10 sec: 5676.3, 60 sec: 5665.0, 300 sec: 5647.5). Total num frames: 436695040. Throughput: 0: 5869.3. Samples: 436700722. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:24:29,202][25689] Avg episode reward: [(0, '-46.112')] [2022-07-09 21:24:30,414][26022] Updated weights on worker 0-0, policy_version 426465 (0.00091) [2022-07-09 21:24:32,063][26022] Updated weights on worker 0-0, policy_version 426475 (0.00089) [2022-07-09 21:24:34,038][26022] Updated weights on worker 0-0, policy_version 426485 (0.00097) [2022-07-09 21:24:34,213][25689] Fps is (10 sec: 5691.9, 60 sec: 5596.2, 300 sec: 5637.4). Total num frames: 436721664. Throughput: 0: 5007.8. Samples: 436717504. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:24:34,213][25689] Avg episode reward: [(0, '-45.975')] [2022-07-09 21:24:35,592][26022] Updated weights on worker 0-0, policy_version 426495 (0.00084) [2022-07-09 21:24:37,636][26022] Updated weights on worker 0-0, policy_version 426505 (0.00089) [2022-07-09 21:24:39,211][26022] Updated weights on worker 0-0, policy_version 426515 (0.00095) [2022-07-09 21:24:39,220][25689] Fps is (10 sec: 5621.6, 60 sec: 5647.2, 300 sec: 5644.8). Total num frames: 436751360. Throughput: 0: 5857.2. Samples: 436751510. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:24:39,220][25689] Avg episode reward: [(0, '-45.770')] [2022-07-09 21:24:41,332][26022] Updated weights on worker 0-0, policy_version 426525 (0.00108) [2022-07-09 21:24:42,813][26022] Updated weights on worker 0-0, policy_version 426535 (0.00083) [2022-07-09 21:24:44,229][25689] Fps is (10 sec: 5622.5, 60 sec: 5612.7, 300 sec: 5635.7). Total num frames: 436777984. Throughput: 0: 5881.5. Samples: 436785426. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:24:44,230][25689] Avg episode reward: [(0, '-45.766')] [2022-07-09 21:24:44,847][26022] Updated weights on worker 0-0, policy_version 426545 (0.00086) [2022-07-09 21:24:46,516][26022] Updated weights on worker 0-0, policy_version 426555 (0.00095) [2022-07-09 21:24:48,567][26022] Updated weights on worker 0-0, policy_version 426565 (0.00095) [2022-07-09 21:24:49,319][25689] Fps is (10 sec: 5576.6, 60 sec: 5591.8, 300 sec: 5635.2). Total num frames: 436807680. Throughput: 0: 5053.4. Samples: 436802386. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:24:49,320][25689] Avg episode reward: [(0, '-45.204')] [2022-07-09 21:24:50,105][26022] Updated weights on worker 0-0, policy_version 426575 (0.00093) [2022-07-09 21:24:52,161][26022] Updated weights on worker 0-0, policy_version 426585 (0.00081) [2022-07-09 21:24:53,756][26022] Updated weights on worker 0-0, policy_version 426595 (0.00092) [2022-07-09 21:24:54,377][25689] Fps is (10 sec: 5651.1, 60 sec: 5606.6, 300 sec: 5635.0). Total num frames: 436835328. Throughput: 0: 5884.0. Samples: 436836150. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:24:54,377][25689] Avg episode reward: [(0, '-45.470')] [2022-07-09 21:24:55,666][26022] Updated weights on worker 0-0, policy_version 426605 (0.00093) [2022-07-09 21:24:57,462][26022] Updated weights on worker 0-0, policy_version 426615 (0.00404) [2022-07-09 21:24:59,361][26022] Updated weights on worker 0-0, policy_version 426625 (0.00089) [2022-07-09 21:24:59,394][25689] Fps is (10 sec: 5590.0, 60 sec: 5589.7, 300 sec: 5636.4). Total num frames: 436864000. Throughput: 0: 5866.2. Samples: 436869858. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:24:59,396][25689] Avg episode reward: [(0, '-45.857')] [2022-07-09 21:25:01,145][26022] Updated weights on worker 0-0, policy_version 426635 (0.00086) [2022-07-09 21:25:03,435][26022] Updated weights on worker 0-0, policy_version 426645 (0.00084) [2022-07-09 21:25:04,403][25689] Fps is (10 sec: 5413.2, 60 sec: 5572.4, 300 sec: 5630.6). Total num frames: 436889600. Throughput: 0: 5028.4. Samples: 436886864. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:25:04,405][25689] Avg episode reward: [(0, '-46.059')] [2022-07-09 21:25:05,205][26022] Updated weights on worker 0-0, policy_version 426655 (0.00093) [2022-07-09 21:25:06,932][26022] Updated weights on worker 0-0, policy_version 426665 (0.00090) [2022-07-09 21:25:08,769][26022] Updated weights on worker 0-0, policy_version 426675 (0.00088) [2022-07-09 21:25:09,489][25689] Fps is (10 sec: 5376.2, 60 sec: 5592.3, 300 sec: 5629.3). Total num frames: 436918272. Throughput: 0: 5768.2. Samples: 436918730. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:25:09,490][25689] Avg episode reward: [(0, '-46.576')] [2022-07-09 21:25:10,493][26022] Updated weights on worker 0-0, policy_version 426685 (0.00090) [2022-07-09 21:25:12,447][26022] Updated weights on worker 0-0, policy_version 426695 (0.00085) [2022-07-09 21:25:14,059][26022] Updated weights on worker 0-0, policy_version 426705 (0.00088) [2022-07-09 21:25:14,572][25689] Fps is (10 sec: 5739.4, 60 sec: 5585.7, 300 sec: 5635.5). Total num frames: 436947968. Throughput: 0: 5770.3. Samples: 436952684. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:25:14,573][25689] Avg episode reward: [(0, '-46.410')] [2022-07-09 21:25:16,065][26022] Updated weights on worker 0-0, policy_version 426715 (0.00059) [2022-07-09 21:25:17,689][26022] Updated weights on worker 0-0, policy_version 426725 (0.00055) [2022-07-09 21:25:19,514][26022] Updated weights on worker 0-0, policy_version 426735 (0.00087) [2022-07-09 21:25:19,611][25689] Fps is (10 sec: 5766.4, 60 sec: 5601.2, 300 sec: 5634.9). Total num frames: 436976640. Throughput: 0: 4944.2. Samples: 436969816. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:25:19,612][25689] Avg episode reward: [(0, '-46.701')] [2022-07-09 21:25:21,505][26022] Updated weights on worker 0-0, policy_version 426745 (0.00124) [2022-07-09 21:25:23,235][26022] Updated weights on worker 0-0, policy_version 426755 (0.00093) [2022-07-09 21:25:24,680][25689] Fps is (10 sec: 5572.2, 60 sec: 5615.4, 300 sec: 5628.9). Total num frames: 437004288. Throughput: 0: 5768.0. Samples: 437003822. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:25:24,680][25689] Avg episode reward: [(0, '-46.140')] [2022-07-09 21:25:24,996][26022] Updated weights on worker 0-0, policy_version 426765 (0.00084) [2022-07-09 21:25:27,025][26022] Updated weights on worker 0-0, policy_version 426775 (0.00090) [2022-07-09 21:25:28,660][26022] Updated weights on worker 0-0, policy_version 426785 (0.00087) [2022-07-09 21:25:29,763][25689] Fps is (10 sec: 5547.7, 60 sec: 5579.7, 300 sec: 5628.8). Total num frames: 437032960. Throughput: 0: 5871.0. Samples: 437037760. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:25:29,765][25689] Avg episode reward: [(0, '-45.287')] [2022-07-09 21:25:30,455][26022] Updated weights on worker 0-0, policy_version 426795 (0.00097) [2022-07-09 21:25:32,353][26022] Updated weights on worker 0-0, policy_version 426805 (0.00086) [2022-07-09 21:25:32,732][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:25:32,753][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000426807_437050368.pth [2022-07-09 21:25:32,754][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000424824_435019776.pth [2022-07-09 21:25:34,038][26022] Updated weights on worker 0-0, policy_version 426815 (0.00087) [2022-07-09 21:25:34,786][25689] Fps is (10 sec: 5775.3, 60 sec: 5629.4, 300 sec: 5628.5). Total num frames: 437062656. Throughput: 0: 5061.8. Samples: 437055002. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:25:34,788][25689] Avg episode reward: [(0, '-45.895')] [2022-07-09 21:25:36,051][26022] Updated weights on worker 0-0, policy_version 426825 (0.00098) [2022-07-09 21:25:37,582][26022] Updated weights on worker 0-0, policy_version 426835 (0.00102) [2022-07-09 21:25:39,713][26022] Updated weights on worker 0-0, policy_version 426845 (0.00087) [2022-07-09 21:25:39,886][25689] Fps is (10 sec: 5664.9, 60 sec: 5587.0, 300 sec: 5628.2). Total num frames: 437090304. Throughput: 0: 5887.8. Samples: 437089190. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:25:39,888][25689] Avg episode reward: [(0, '-45.385')] [2022-07-09 21:25:41,050][26022] Updated weights on worker 0-0, policy_version 426855 (0.00089) [2022-07-09 21:25:43,036][26022] Updated weights on worker 0-0, policy_version 426865 (0.00101) [2022-07-09 21:25:44,917][25689] Fps is (10 sec: 5559.5, 60 sec: 5618.9, 300 sec: 5625.2). Total num frames: 437118976. Throughput: 0: 5909.9. Samples: 437123420. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:25:44,917][25689] Avg episode reward: [(0, '-45.227')] [2022-07-09 21:25:45,067][26022] Updated weights on worker 0-0, policy_version 426875 (0.00095) [2022-07-09 21:25:46,647][26022] Updated weights on worker 0-0, policy_version 426885 (0.00089) [2022-07-09 21:25:48,536][26022] Updated weights on worker 0-0, policy_version 426895 (0.00091) [2022-07-09 21:25:50,030][25689] Fps is (10 sec: 5754.1, 60 sec: 5616.7, 300 sec: 5630.2). Total num frames: 437148672. Throughput: 0: 5906.6. Samples: 437157466. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:25:50,030][25689] Avg episode reward: [(0, '-45.557')] [2022-07-09 21:25:50,219][26022] Updated weights on worker 0-0, policy_version 426905 (0.00084) [2022-07-09 21:25:52,238][26022] Updated weights on worker 0-0, policy_version 426915 (0.00096) [2022-07-09 21:25:53,899][26022] Updated weights on worker 0-0, policy_version 426925 (0.00087) [2022-07-09 21:25:55,057][25689] Fps is (10 sec: 5755.8, 60 sec: 5636.4, 300 sec: 5629.8). Total num frames: 437177344. Throughput: 0: 5884.0. Samples: 437174276. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:25:55,058][25689] Avg episode reward: [(0, '-45.920')] [2022-07-09 21:25:55,734][26022] Updated weights on worker 0-0, policy_version 426935 (0.00089) [2022-07-09 21:25:57,537][26022] Updated weights on worker 0-0, policy_version 426945 (0.00091) [2022-07-09 21:25:59,411][26022] Updated weights on worker 0-0, policy_version 426955 (0.00098) [2022-07-09 21:26:00,058][25689] Fps is (10 sec: 5615.9, 60 sec: 5621.0, 300 sec: 5633.6). Total num frames: 437204992. Throughput: 0: 5913.7. Samples: 437208484. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:26:00,059][25689] Avg episode reward: [(0, '-45.511')] [2022-07-09 21:26:01,230][26022] Updated weights on worker 0-0, policy_version 426965 (0.00054) [2022-07-09 21:26:03,297][26022] Updated weights on worker 0-0, policy_version 426975 (0.00088) [2022-07-09 21:26:05,060][25689] Fps is (10 sec: 5425.9, 60 sec: 5638.5, 300 sec: 5628.2). Total num frames: 437231616. Throughput: 0: 5820.8. Samples: 437240670. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 21:26:05,060][25689] Avg episode reward: [(0, '-45.336')] [2022-07-09 21:26:05,095][26022] Updated weights on worker 0-0, policy_version 426985 (0.00089) [2022-07-09 21:26:06,929][26022] Updated weights on worker 0-0, policy_version 426995 (0.00094) [2022-07-09 21:26:08,793][26022] Updated weights on worker 0-0, policy_version 427005 (0.00096) [2022-07-09 21:26:10,158][25689] Fps is (10 sec: 5576.4, 60 sec: 5654.3, 300 sec: 5634.2). Total num frames: 437261312. Throughput: 0: 4977.8. Samples: 437257660. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:26:10,159][25689] Avg episode reward: [(0, '-45.806')] [2022-07-09 21:26:10,455][26022] Updated weights on worker 0-0, policy_version 427015 (0.00088) [2022-07-09 21:26:12,204][26022] Updated weights on worker 0-0, policy_version 427025 (0.00091) [2022-07-09 21:26:14,051][26022] Updated weights on worker 0-0, policy_version 427035 (0.00090) [2022-07-09 21:26:15,213][25689] Fps is (10 sec: 5748.8, 60 sec: 5640.1, 300 sec: 5633.3). Total num frames: 437289984. Throughput: 0: 5828.3. Samples: 437291748. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:26:15,213][25689] Avg episode reward: [(0, '-46.673')] [2022-07-09 21:26:15,876][26022] Updated weights on worker 0-0, policy_version 427045 (0.00088) [2022-07-09 21:26:17,731][26022] Updated weights on worker 0-0, policy_version 427055 (0.00085) [2022-07-09 21:26:19,637][26022] Updated weights on worker 0-0, policy_version 427065 (0.00091) [2022-07-09 21:26:20,256][25689] Fps is (10 sec: 5577.2, 60 sec: 5622.8, 300 sec: 5632.5). Total num frames: 437317632. Throughput: 0: 5827.6. Samples: 437326188. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:26:20,257][25689] Avg episode reward: [(0, '-45.851')] [2022-07-09 21:26:21,252][26022] Updated weights on worker 0-0, policy_version 427075 (0.00086) [2022-07-09 21:26:23,236][26022] Updated weights on worker 0-0, policy_version 427085 (0.00089) [2022-07-09 21:26:24,703][26022] Updated weights on worker 0-0, policy_version 427095 (0.00085) [2022-07-09 21:26:25,318][25689] Fps is (10 sec: 5674.8, 60 sec: 5657.2, 300 sec: 5636.2). Total num frames: 437347328. Throughput: 0: 5072.0. Samples: 437343420. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:26:25,318][25689] Avg episode reward: [(0, '-45.480')] [2022-07-09 21:26:26,560][26022] Updated weights on worker 0-0, policy_version 427105 (0.00095) [2022-07-09 21:26:28,583][26022] Updated weights on worker 0-0, policy_version 427115 (0.00091) [2022-07-09 21:26:30,273][26022] Updated weights on worker 0-0, policy_version 427125 (0.00092) [2022-07-09 21:26:30,358][25689] Fps is (10 sec: 5778.0, 60 sec: 5661.2, 300 sec: 5636.8). Total num frames: 437376000. Throughput: 0: 5917.8. Samples: 437377200. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:26:30,359][25689] Avg episode reward: [(0, '-46.247')] [2022-07-09 21:26:32,274][26022] Updated weights on worker 0-0, policy_version 427135 (0.00085) [2022-07-09 21:26:33,887][26022] Updated weights on worker 0-0, policy_version 427145 (0.00084) [2022-07-09 21:26:35,458][25689] Fps is (10 sec: 5554.1, 60 sec: 5620.3, 300 sec: 5629.0). Total num frames: 437403648. Throughput: 0: 5900.3. Samples: 437411202. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:26:35,460][25689] Avg episode reward: [(0, '-46.865')] [2022-07-09 21:26:35,915][26022] Updated weights on worker 0-0, policy_version 427155 (0.00094) [2022-07-09 21:26:37,492][26022] Updated weights on worker 0-0, policy_version 427165 (0.00085) [2022-07-09 21:26:39,563][26022] Updated weights on worker 0-0, policy_version 427175 (0.00077) [2022-07-09 21:26:40,508][25689] Fps is (10 sec: 5649.6, 60 sec: 5658.7, 300 sec: 5631.7). Total num frames: 437433344. Throughput: 0: 5043.7. Samples: 437428332. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:26:40,510][25689] Avg episode reward: [(0, '-46.310')] [2022-07-09 21:26:41,094][26022] Updated weights on worker 0-0, policy_version 427185 (0.00090) [2022-07-09 21:26:42,926][26022] Updated weights on worker 0-0, policy_version 427195 (0.00091) [2022-07-09 21:26:44,767][26022] Updated weights on worker 0-0, policy_version 427205 (0.00087) [2022-07-09 21:26:45,522][25689] Fps is (10 sec: 5799.9, 60 sec: 5660.2, 300 sec: 5633.4). Total num frames: 437462016. Throughput: 0: 5896.3. Samples: 437462548. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:26:45,524][25689] Avg episode reward: [(0, '-45.702')] [2022-07-09 21:26:46,615][26022] Updated weights on worker 0-0, policy_version 427215 (0.00094) [2022-07-09 21:26:48,380][26022] Updated weights on worker 0-0, policy_version 427225 (0.00083) [2022-07-09 21:26:50,267][26022] Updated weights on worker 0-0, policy_version 427235 (0.00088) [2022-07-09 21:26:50,644][25689] Fps is (10 sec: 5556.4, 60 sec: 5625.6, 300 sec: 5627.9). Total num frames: 437489664. Throughput: 0: 5882.3. Samples: 437496530. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:26:50,645][25689] Avg episode reward: [(0, '-46.342')] [2022-07-09 21:26:51,997][26022] Updated weights on worker 0-0, policy_version 427245 (0.00089) [2022-07-09 21:26:54,022][26022] Updated weights on worker 0-0, policy_version 427255 (0.00084) [2022-07-09 21:26:55,590][26022] Updated weights on worker 0-0, policy_version 427265 (0.00087) [2022-07-09 21:26:55,661][25689] Fps is (10 sec: 5756.8, 60 sec: 5660.4, 300 sec: 5631.6). Total num frames: 437520384. Throughput: 0: 5074.1. Samples: 437513712. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:26:55,662][25689] Avg episode reward: [(0, '-46.529')] [2022-07-09 21:26:57,550][26022] Updated weights on worker 0-0, policy_version 427275 (0.00096) [2022-07-09 21:26:59,192][26022] Updated weights on worker 0-0, policy_version 427285 (0.00091) [2022-07-09 21:27:00,684][25689] Fps is (10 sec: 5711.6, 60 sec: 5641.4, 300 sec: 5638.4). Total num frames: 437547008. Throughput: 0: 5912.1. Samples: 437547614. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:27:00,686][25689] Avg episode reward: [(0, '-46.464')] [2022-07-09 21:27:01,216][26022] Updated weights on worker 0-0, policy_version 427295 (0.00091) [2022-07-09 21:27:03,180][26022] Updated weights on worker 0-0, policy_version 427305 (0.00086) [2022-07-09 21:27:05,001][26022] Updated weights on worker 0-0, policy_version 427315 (0.00086) [2022-07-09 21:27:05,769][25689] Fps is (10 sec: 5267.6, 60 sec: 5633.6, 300 sec: 5627.6). Total num frames: 437573632. Throughput: 0: 5787.7. Samples: 437579736. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:27:05,770][25689] Avg episode reward: [(0, '-46.560')] [2022-07-09 21:27:06,780][26022] Updated weights on worker 0-0, policy_version 427325 (0.00086) [2022-07-09 21:27:08,604][26022] Updated weights on worker 0-0, policy_version 427335 (0.00087) [2022-07-09 21:27:10,335][26022] Updated weights on worker 0-0, policy_version 427345 (0.00092) [2022-07-09 21:27:10,844][25689] Fps is (10 sec: 5543.5, 60 sec: 5635.9, 300 sec: 5630.8). Total num frames: 437603328. Throughput: 0: 4956.5. Samples: 437596650. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:27:10,844][25689] Avg episode reward: [(0, '-46.223')] [2022-07-09 21:27:12,312][26022] Updated weights on worker 0-0, policy_version 427355 (0.00088) [2022-07-09 21:27:13,924][26022] Updated weights on worker 0-0, policy_version 427365 (0.00097) [2022-07-09 21:27:15,879][25689] Fps is (10 sec: 5571.0, 60 sec: 5603.9, 300 sec: 5623.8). Total num frames: 437629952. Throughput: 0: 5786.3. Samples: 437630700. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:27:15,879][25689] Avg episode reward: [(0, '-46.575')] [2022-07-09 21:27:15,964][26022] Updated weights on worker 0-0, policy_version 427375 (0.00086) [2022-07-09 21:27:17,640][26022] Updated weights on worker 0-0, policy_version 427385 (0.00095) [2022-07-09 21:27:19,360][26022] Updated weights on worker 0-0, policy_version 427395 (0.00103) [2022-07-09 21:27:20,881][25689] Fps is (10 sec: 5611.2, 60 sec: 5641.5, 300 sec: 5627.6). Total num frames: 437659648. Throughput: 0: 5783.3. Samples: 437664418. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:27:20,881][25689] Avg episode reward: [(0, '-47.262')] [2022-07-09 21:27:21,475][26022] Updated weights on worker 0-0, policy_version 427405 (0.00094) [2022-07-09 21:27:22,894][26022] Updated weights on worker 0-0, policy_version 427415 (0.00083) [2022-07-09 21:27:24,872][26022] Updated weights on worker 0-0, policy_version 427425 (0.00090) [2022-07-09 21:27:25,924][25689] Fps is (10 sec: 5708.7, 60 sec: 5609.5, 300 sec: 5625.2). Total num frames: 437687296. Throughput: 0: 5054.4. Samples: 437681604. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:27:25,924][25689] Avg episode reward: [(0, '-46.790')] [2022-07-09 21:27:26,540][26022] Updated weights on worker 0-0, policy_version 427435 (0.00080) [2022-07-09 21:27:28,524][26022] Updated weights on worker 0-0, policy_version 427445 (0.00085) [2022-07-09 21:27:30,403][26022] Updated weights on worker 0-0, policy_version 427455 (0.00100) [2022-07-09 21:27:30,991][25689] Fps is (10 sec: 5570.8, 60 sec: 5607.0, 300 sec: 5624.9). Total num frames: 437715968. Throughput: 0: 5880.4. Samples: 437715122. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:27:30,991][25689] Avg episode reward: [(0, '-46.982')] [2022-07-09 21:27:32,208][26022] Updated weights on worker 0-0, policy_version 427465 (0.00091) [2022-07-09 21:27:32,905][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:27:32,914][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000427469_437728256.pth [2022-07-09 21:27:32,930][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000425487_435698688.pth [2022-07-09 21:27:33,856][26022] Updated weights on worker 0-0, policy_version 427475 (0.00092) [2022-07-09 21:27:35,777][26022] Updated weights on worker 0-0, policy_version 427485 (0.00087) [2022-07-09 21:27:35,999][25689] Fps is (10 sec: 5691.7, 60 sec: 5632.4, 300 sec: 5625.9). Total num frames: 437744640. Throughput: 0: 5884.4. Samples: 437749094. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:27:35,999][25689] Avg episode reward: [(0, '-47.360')] [2022-07-09 21:27:37,593][26022] Updated weights on worker 0-0, policy_version 427495 (0.00095) [2022-07-09 21:27:39,519][26022] Updated weights on worker 0-0, policy_version 427505 (0.00087) [2022-07-09 21:27:41,042][25689] Fps is (10 sec: 5705.4, 60 sec: 5616.2, 300 sec: 5625.8). Total num frames: 437773312. Throughput: 0: 5054.7. Samples: 437766324. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:27:41,042][25689] Avg episode reward: [(0, '-46.929')] [2022-07-09 21:27:41,205][26022] Updated weights on worker 0-0, policy_version 427515 (0.00082) [2022-07-09 21:27:42,883][26022] Updated weights on worker 0-0, policy_version 427525 (0.00090) [2022-07-09 21:27:44,866][26022] Updated weights on worker 0-0, policy_version 427535 (0.00057) [2022-07-09 21:27:46,051][25689] Fps is (10 sec: 5806.7, 60 sec: 5633.5, 300 sec: 5630.4). Total num frames: 437803008. Throughput: 0: 5923.8. Samples: 437800832. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:27:46,051][25689] Avg episode reward: [(0, '-46.651')] [2022-07-09 21:27:46,516][26022] Updated weights on worker 0-0, policy_version 427545 (0.00094) [2022-07-09 21:27:48,384][26022] Updated weights on worker 0-0, policy_version 427555 (0.00084) [2022-07-09 21:27:50,020][26022] Updated weights on worker 0-0, policy_version 427565 (0.00093) [2022-07-09 21:27:51,098][25689] Fps is (10 sec: 5804.0, 60 sec: 5657.5, 300 sec: 5633.2). Total num frames: 437831680. Throughput: 0: 5971.8. Samples: 437835200. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:27:51,099][25689] Avg episode reward: [(0, '-45.944')] [2022-07-09 21:27:51,968][26022] Updated weights on worker 0-0, policy_version 427575 (0.00081) [2022-07-09 21:27:53,678][26022] Updated weights on worker 0-0, policy_version 427585 (0.00093) [2022-07-09 21:27:55,471][26022] Updated weights on worker 0-0, policy_version 427595 (0.00091) [2022-07-09 21:27:56,127][25689] Fps is (10 sec: 5691.1, 60 sec: 5622.5, 300 sec: 5626.8). Total num frames: 437860352. Throughput: 0: 5981.7. Samples: 437869492. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:27:56,127][25689] Avg episode reward: [(0, '-45.888')] [2022-07-09 21:27:57,247][26022] Updated weights on worker 0-0, policy_version 427605 (0.00098) [2022-07-09 21:27:59,008][26022] Updated weights on worker 0-0, policy_version 427615 (0.00089) [2022-07-09 21:28:00,830][26022] Updated weights on worker 0-0, policy_version 427625 (0.00093) [2022-07-09 21:28:01,139][25689] Fps is (10 sec: 5711.2, 60 sec: 5657.4, 300 sec: 5637.5). Total num frames: 437889024. Throughput: 0: 5990.6. Samples: 437886718. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:28:01,139][25689] Avg episode reward: [(0, '-44.949')] [2022-07-09 21:28:03,086][26022] Updated weights on worker 0-0, policy_version 427635 (0.00083) [2022-07-09 21:28:05,071][26022] Updated weights on worker 0-0, policy_version 427645 (0.00084) [2022-07-09 21:28:06,151][25689] Fps is (10 sec: 5516.4, 60 sec: 5664.3, 300 sec: 5629.9). Total num frames: 437915648. Throughput: 0: 5875.8. Samples: 437918936. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:28:06,151][25689] Avg episode reward: [(0, '-45.122')] [2022-07-09 21:28:06,538][26022] Updated weights on worker 0-0, policy_version 427655 (0.00094) [2022-07-09 21:28:08,543][26022] Updated weights on worker 0-0, policy_version 427665 (0.00092) [2022-07-09 21:28:10,264][26022] Updated weights on worker 0-0, policy_version 427675 (0.00097) [2022-07-09 21:28:11,196][25689] Fps is (10 sec: 5498.1, 60 sec: 5650.0, 300 sec: 5633.6). Total num frames: 437944320. Throughput: 0: 5875.4. Samples: 437953282. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:28:11,198][25689] Avg episode reward: [(0, '-44.878')] [2022-07-09 21:28:12,013][26022] Updated weights on worker 0-0, policy_version 427685 (0.00091) [2022-07-09 21:28:13,879][26022] Updated weights on worker 0-0, policy_version 427695 (0.00098) [2022-07-09 21:28:15,617][26022] Updated weights on worker 0-0, policy_version 427705 (0.00084) [2022-07-09 21:28:16,223][25689] Fps is (10 sec: 5591.6, 60 sec: 5667.8, 300 sec: 5627.3). Total num frames: 437971968. Throughput: 0: 5016.7. Samples: 437970308. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:28:16,223][25689] Avg episode reward: [(0, '-45.422')] [2022-07-09 21:28:17,338][26022] Updated weights on worker 0-0, policy_version 427715 (0.00087) [2022-07-09 21:28:19,483][26022] Updated weights on worker 0-0, policy_version 427725 (0.00085) [2022-07-09 21:28:21,008][26022] Updated weights on worker 0-0, policy_version 427735 (0.00088) [2022-07-09 21:28:21,231][25689] Fps is (10 sec: 5714.6, 60 sec: 5667.2, 300 sec: 5628.1). Total num frames: 438001664. Throughput: 0: 5858.0. Samples: 438004418. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:28:21,231][25689] Avg episode reward: [(0, '-45.402')] [2022-07-09 21:28:22,930][26022] Updated weights on worker 0-0, policy_version 427745 (0.00090) [2022-07-09 21:28:24,510][26022] Updated weights on worker 0-0, policy_version 427755 (0.00096) [2022-07-09 21:28:26,245][25689] Fps is (10 sec: 5619.8, 60 sec: 5653.0, 300 sec: 5632.7). Total num frames: 438028288. Throughput: 0: 5938.1. Samples: 438038256. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:28:26,245][25689] Avg episode reward: [(0, '-45.525')] [2022-07-09 21:28:26,652][26022] Updated weights on worker 0-0, policy_version 427765 (0.00088) [2022-07-09 21:28:28,245][26022] Updated weights on worker 0-0, policy_version 427775 (0.00091) [2022-07-09 21:28:30,161][26022] Updated weights on worker 0-0, policy_version 427785 (0.01445) [2022-07-09 21:28:31,302][25689] Fps is (10 sec: 5592.0, 60 sec: 5670.8, 300 sec: 5628.2). Total num frames: 438057984. Throughput: 0: 5070.0. Samples: 438055220. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:28:31,303][25689] Avg episode reward: [(0, '-44.599')] [2022-07-09 21:28:31,770][26022] Updated weights on worker 0-0, policy_version 427795 (0.00085) [2022-07-09 21:28:33,919][26022] Updated weights on worker 0-0, policy_version 427805 (0.00095) [2022-07-09 21:28:35,534][26022] Updated weights on worker 0-0, policy_version 427815 (0.00090) [2022-07-09 21:28:36,305][25689] Fps is (10 sec: 5700.0, 60 sec: 5654.4, 300 sec: 5631.7). Total num frames: 438085632. Throughput: 0: 5927.1. Samples: 438089336. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:28:36,305][25689] Avg episode reward: [(0, '-44.905')] [2022-07-09 21:28:37,438][26022] Updated weights on worker 0-0, policy_version 427825 (0.00088) [2022-07-09 21:28:39,123][26022] Updated weights on worker 0-0, policy_version 427835 (0.00098) [2022-07-09 21:28:41,007][26022] Updated weights on worker 0-0, policy_version 427845 (0.00211) [2022-07-09 21:28:41,327][25689] Fps is (10 sec: 5618.2, 60 sec: 5656.3, 300 sec: 5631.4). Total num frames: 438114304. Throughput: 0: 5923.2. Samples: 438123450. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:28:41,327][25689] Avg episode reward: [(0, '-46.040')] [2022-07-09 21:28:42,796][26022] Updated weights on worker 0-0, policy_version 427855 (0.00090) [2022-07-09 21:28:44,667][26022] Updated weights on worker 0-0, policy_version 427865 (0.00087) [2022-07-09 21:28:46,351][25689] Fps is (10 sec: 5708.3, 60 sec: 5637.9, 300 sec: 5625.0). Total num frames: 438142976. Throughput: 0: 5096.3. Samples: 438140722. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:28:46,351][25689] Avg episode reward: [(0, '-46.440')] [2022-07-09 21:28:46,426][26022] Updated weights on worker 0-0, policy_version 427875 (0.00093) [2022-07-09 21:28:48,197][26022] Updated weights on worker 0-0, policy_version 427885 (0.00087) [2022-07-09 21:28:50,203][26022] Updated weights on worker 0-0, policy_version 427895 (0.00086) [2022-07-09 21:28:51,382][25689] Fps is (10 sec: 5702.9, 60 sec: 5639.5, 300 sec: 5631.9). Total num frames: 438171648. Throughput: 0: 5952.7. Samples: 438174748. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 21:28:51,382][25689] Avg episode reward: [(0, '-46.233')] [2022-07-09 21:28:51,919][26022] Updated weights on worker 0-0, policy_version 427905 (0.00090) [2022-07-09 21:28:53,765][26022] Updated weights on worker 0-0, policy_version 427915 (0.00095) [2022-07-09 21:28:55,551][26022] Updated weights on worker 0-0, policy_version 427925 (0.00092) [2022-07-09 21:28:56,390][25689] Fps is (10 sec: 5712.0, 60 sec: 5641.4, 300 sec: 5628.6). Total num frames: 438200320. Throughput: 0: 5911.4. Samples: 438208066. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:28:56,390][25689] Avg episode reward: [(0, '-48.103')] [2022-07-09 21:28:57,326][26022] Updated weights on worker 0-0, policy_version 427935 (0.00086) [2022-07-09 21:28:59,071][26022] Updated weights on worker 0-0, policy_version 427945 (0.00089) [2022-07-09 21:29:00,971][26022] Updated weights on worker 0-0, policy_version 427955 (0.00970) [2022-07-09 21:29:01,448][25689] Fps is (10 sec: 5493.4, 60 sec: 5603.1, 300 sec: 5627.6). Total num frames: 438226944. Throughput: 0: 5049.2. Samples: 438225046. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:29:01,449][25689] Avg episode reward: [(0, '-47.373')] [2022-07-09 21:29:03,330][26022] Updated weights on worker 0-0, policy_version 427965 (0.00092) [2022-07-09 21:29:04,996][26022] Updated weights on worker 0-0, policy_version 427975 (0.00092) [2022-07-09 21:29:06,458][25689] Fps is (10 sec: 5288.6, 60 sec: 5603.3, 300 sec: 5626.2). Total num frames: 438253568. Throughput: 0: 5775.4. Samples: 438256852. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:29:06,459][25689] Avg episode reward: [(0, '-46.882')] [2022-07-09 21:29:07,001][26022] Updated weights on worker 0-0, policy_version 427985 (0.00091) [2022-07-09 21:29:08,742][26022] Updated weights on worker 0-0, policy_version 427995 (0.00086) [2022-07-09 21:29:10,372][26022] Updated weights on worker 0-0, policy_version 428005 (0.00083) [2022-07-09 21:29:11,595][25689] Fps is (10 sec: 5550.4, 60 sec: 5611.8, 300 sec: 5623.9). Total num frames: 438283264. Throughput: 0: 5734.5. Samples: 438290658. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:29:11,595][25689] Avg episode reward: [(0, '-46.568')] [2022-07-09 21:29:12,557][26022] Updated weights on worker 0-0, policy_version 428015 (0.00086) [2022-07-09 21:29:14,015][26022] Updated weights on worker 0-0, policy_version 428025 (0.00085) [2022-07-09 21:29:16,035][26022] Updated weights on worker 0-0, policy_version 428035 (0.00096) [2022-07-09 21:29:16,634][25689] Fps is (10 sec: 5837.0, 60 sec: 5644.6, 300 sec: 5630.5). Total num frames: 438312960. Throughput: 0: 4922.9. Samples: 438307724. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:29:16,634][25689] Avg episode reward: [(0, '-45.907')] [2022-07-09 21:29:17,599][26022] Updated weights on worker 0-0, policy_version 428045 (0.00092) [2022-07-09 21:29:19,627][26022] Updated weights on worker 0-0, policy_version 428055 (0.00088) [2022-07-09 21:29:21,335][26022] Updated weights on worker 0-0, policy_version 428065 (0.00086) [2022-07-09 21:29:21,675][25689] Fps is (10 sec: 5688.9, 60 sec: 5607.6, 300 sec: 5633.9). Total num frames: 438340608. Throughput: 0: 5775.6. Samples: 438341868. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:29:21,675][25689] Avg episode reward: [(0, '-44.461')] [2022-07-09 21:29:23,200][26022] Updated weights on worker 0-0, policy_version 428075 (0.00087) [2022-07-09 21:29:24,847][26022] Updated weights on worker 0-0, policy_version 428085 (0.00093) [2022-07-09 21:29:26,682][25689] Fps is (10 sec: 5401.2, 60 sec: 5608.2, 300 sec: 5621.2). Total num frames: 438367232. Throughput: 0: 5884.1. Samples: 438375848. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:29:26,682][25689] Avg episode reward: [(0, '-43.821')] [2022-07-09 21:29:26,858][26022] Updated weights on worker 0-0, policy_version 428095 (0.00095) [2022-07-09 21:29:28,467][26022] Updated weights on worker 0-0, policy_version 428105 (0.00102) [2022-07-09 21:29:30,530][26022] Updated weights on worker 0-0, policy_version 428115 (0.00094) [2022-07-09 21:29:31,722][25689] Fps is (10 sec: 5605.6, 60 sec: 5609.8, 300 sec: 5631.0). Total num frames: 438396928. Throughput: 0: 5062.2. Samples: 438392546. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:29:31,722][25689] Avg episode reward: [(0, '-44.557')] [2022-07-09 21:29:32,251][26022] Updated weights on worker 0-0, policy_version 428125 (0.00092) [2022-07-09 21:29:33,050][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:29:33,070][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000428129_438404096.pth [2022-07-09 21:29:33,070][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000426149_436376576.pth [2022-07-09 21:29:34,107][26022] Updated weights on worker 0-0, policy_version 428135 (0.00079) [2022-07-09 21:29:35,825][26022] Updated weights on worker 0-0, policy_version 428145 (0.00086) [2022-07-09 21:29:36,726][25689] Fps is (10 sec: 5811.3, 60 sec: 5626.7, 300 sec: 5627.6). Total num frames: 438425600. Throughput: 0: 5925.6. Samples: 438426782. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:29:36,726][25689] Avg episode reward: [(0, '-45.247')] [2022-07-09 21:29:37,715][26022] Updated weights on worker 0-0, policy_version 428155 (0.00085) [2022-07-09 21:29:39,459][26022] Updated weights on worker 0-0, policy_version 428165 (0.00383) [2022-07-09 21:29:41,290][26022] Updated weights on worker 0-0, policy_version 428175 (0.00098) [2022-07-09 21:29:41,728][25689] Fps is (10 sec: 5526.0, 60 sec: 5594.5, 300 sec: 5627.7). Total num frames: 438452224. Throughput: 0: 5925.2. Samples: 438460690. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:29:41,729][25689] Avg episode reward: [(0, '-44.629')] [2022-07-09 21:29:43,045][26022] Updated weights on worker 0-0, policy_version 428185 (0.00086) [2022-07-09 21:29:44,817][26022] Updated weights on worker 0-0, policy_version 428195 (0.00093) [2022-07-09 21:29:46,762][25689] Fps is (10 sec: 5509.8, 60 sec: 5593.7, 300 sec: 5625.3). Total num frames: 438480896. Throughput: 0: 5079.5. Samples: 438477844. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:29:46,762][25689] Avg episode reward: [(0, '-45.946')] [2022-07-09 21:29:46,827][26022] Updated weights on worker 0-0, policy_version 428205 (0.00084) [2022-07-09 21:29:48,521][26022] Updated weights on worker 0-0, policy_version 428215 (0.00088) [2022-07-09 21:29:50,328][26022] Updated weights on worker 0-0, policy_version 428225 (0.00087) [2022-07-09 21:29:51,803][25689] Fps is (10 sec: 5793.8, 60 sec: 5609.7, 300 sec: 5632.5). Total num frames: 438510592. Throughput: 0: 5931.1. Samples: 438511646. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:29:51,804][25689] Avg episode reward: [(0, '-47.670')] [2022-07-09 21:29:52,175][26022] Updated weights on worker 0-0, policy_version 428235 (0.00095) [2022-07-09 21:29:53,871][26022] Updated weights on worker 0-0, policy_version 428245 (0.00086) [2022-07-09 21:29:55,696][26022] Updated weights on worker 0-0, policy_version 428255 (0.00088) [2022-07-09 21:29:56,903][25689] Fps is (10 sec: 5856.2, 60 sec: 5618.1, 300 sec: 5634.4). Total num frames: 438540288. Throughput: 0: 5904.2. Samples: 438545914. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:29:56,904][25689] Avg episode reward: [(0, '-48.142')] [2022-07-09 21:29:57,870][26022] Updated weights on worker 0-0, policy_version 428266 (0.00088) [2022-07-09 21:29:59,474][26022] Updated weights on worker 0-0, policy_version 428276 (0.00530) [2022-07-09 21:30:01,375][26022] Updated weights on worker 0-0, policy_version 428286 (0.00089) [2022-07-09 21:30:01,954][25689] Fps is (10 sec: 5548.1, 60 sec: 5618.7, 300 sec: 5637.1). Total num frames: 438566912. Throughput: 0: 5060.8. Samples: 438563048. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:30:01,959][25689] Avg episode reward: [(0, '-48.145')] [2022-07-09 21:30:03,436][26022] Updated weights on worker 0-0, policy_version 428296 (0.00091) [2022-07-09 21:30:05,311][26022] Updated weights on worker 0-0, policy_version 428306 (0.00090) [2022-07-09 21:30:07,043][25689] Fps is (10 sec: 5352.4, 60 sec: 5628.4, 300 sec: 5633.6). Total num frames: 438594560. Throughput: 0: 5805.8. Samples: 438595594. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:30:07,045][25689] Avg episode reward: [(0, '-48.325')] [2022-07-09 21:30:07,106][26022] Updated weights on worker 0-0, policy_version 428316 (0.00084) [2022-07-09 21:30:08,827][26022] Updated weights on worker 0-0, policy_version 428326 (0.00098) [2022-07-09 21:30:10,631][26022] Updated weights on worker 0-0, policy_version 428336 (0.00087) [2022-07-09 21:30:12,099][25689] Fps is (10 sec: 5753.5, 60 sec: 5652.8, 300 sec: 5637.5). Total num frames: 438625280. Throughput: 0: 5817.0. Samples: 438629708. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:30:12,099][25689] Avg episode reward: [(0, '-47.963')] [2022-07-09 21:30:12,517][26022] Updated weights on worker 0-0, policy_version 428346 (0.00085) [2022-07-09 21:30:14,447][26022] Updated weights on worker 0-0, policy_version 428356 (0.00090) [2022-07-09 21:30:16,028][26022] Updated weights on worker 0-0, policy_version 428366 (0.00096) [2022-07-09 21:30:17,130][25689] Fps is (10 sec: 5685.0, 60 sec: 5602.7, 300 sec: 5630.8). Total num frames: 438651904. Throughput: 0: 4982.3. Samples: 438646690. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:30:17,132][25689] Avg episode reward: [(0, '-47.889')] [2022-07-09 21:30:17,950][26022] Updated weights on worker 0-0, policy_version 428376 (0.00081) [2022-07-09 21:30:19,652][26022] Updated weights on worker 0-0, policy_version 428386 (0.00091) [2022-07-09 21:30:21,464][26022] Updated weights on worker 0-0, policy_version 428396 (0.00082) [2022-07-09 21:30:22,136][25689] Fps is (10 sec: 5509.1, 60 sec: 5622.9, 300 sec: 5635.4). Total num frames: 438680576. Throughput: 0: 5830.3. Samples: 438680716. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:30:22,138][25689] Avg episode reward: [(0, '-47.194')] [2022-07-09 21:30:23,385][26022] Updated weights on worker 0-0, policy_version 428406 (0.00087) [2022-07-09 21:30:25,009][26022] Updated weights on worker 0-0, policy_version 428416 (0.00085) [2022-07-09 21:30:26,892][26022] Updated weights on worker 0-0, policy_version 428426 (0.00097) [2022-07-09 21:30:27,171][25689] Fps is (10 sec: 5711.1, 60 sec: 5654.2, 300 sec: 5636.3). Total num frames: 438709248. Throughput: 0: 5930.8. Samples: 438714968. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:30:27,171][25689] Avg episode reward: [(0, '-47.324')] [2022-07-09 21:30:28,649][26022] Updated weights on worker 0-0, policy_version 428436 (0.00087) [2022-07-09 21:30:30,658][26022] Updated weights on worker 0-0, policy_version 428446 (0.00082) [2022-07-09 21:30:32,283][25689] Fps is (10 sec: 5651.4, 60 sec: 5630.6, 300 sec: 5631.2). Total num frames: 438737920. Throughput: 0: 5050.1. Samples: 438731640. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:30:32,285][25689] Avg episode reward: [(0, '-47.269')] [2022-07-09 21:30:32,374][26022] Updated weights on worker 0-0, policy_version 428456 (0.00091) [2022-07-09 21:30:34,299][26022] Updated weights on worker 0-0, policy_version 428466 (0.00092) [2022-07-09 21:30:35,819][26022] Updated weights on worker 0-0, policy_version 428476 (0.00093) [2022-07-09 21:30:37,345][25689] Fps is (10 sec: 5535.7, 60 sec: 5608.3, 300 sec: 5631.9). Total num frames: 438765568. Throughput: 0: 5879.2. Samples: 438765536. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:30:37,345][25689] Avg episode reward: [(0, '-47.671')] [2022-07-09 21:30:37,936][26022] Updated weights on worker 0-0, policy_version 428486 (0.00094) [2022-07-09 21:30:39,579][26022] Updated weights on worker 0-0, policy_version 428496 (0.00083) [2022-07-09 21:30:41,583][26022] Updated weights on worker 0-0, policy_version 428506 (0.00080) [2022-07-09 21:30:42,419][25689] Fps is (10 sec: 5657.5, 60 sec: 5652.3, 300 sec: 5634.5). Total num frames: 438795264. Throughput: 0: 5873.5. Samples: 438799846. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:30:42,420][25689] Avg episode reward: [(0, '-48.341')] [2022-07-09 21:30:43,255][26022] Updated weights on worker 0-0, policy_version 428516 (0.00088) [2022-07-09 21:30:45,029][26022] Updated weights on worker 0-0, policy_version 428526 (0.00089) [2022-07-09 21:30:46,741][26022] Updated weights on worker 0-0, policy_version 428536 (0.00083) [2022-07-09 21:30:47,441][25689] Fps is (10 sec: 5679.4, 60 sec: 5636.4, 300 sec: 5629.4). Total num frames: 438822912. Throughput: 0: 5033.7. Samples: 438817004. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:30:47,443][25689] Avg episode reward: [(0, '-47.668')] [2022-07-09 21:30:48,581][26022] Updated weights on worker 0-0, policy_version 428546 (0.00087) [2022-07-09 21:30:50,433][26022] Updated weights on worker 0-0, policy_version 428556 (0.00083) [2022-07-09 21:30:52,236][26022] Updated weights on worker 0-0, policy_version 428566 (0.00090) [2022-07-09 21:30:52,531][25689] Fps is (10 sec: 5671.0, 60 sec: 5631.9, 300 sec: 5631.6). Total num frames: 438852608. Throughput: 0: 5903.2. Samples: 438851166. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:30:52,531][25689] Avg episode reward: [(0, '-47.025')] [2022-07-09 21:30:54,033][26022] Updated weights on worker 0-0, policy_version 428576 (0.00087) [2022-07-09 21:30:55,802][26022] Updated weights on worker 0-0, policy_version 428586 (0.00079) [2022-07-09 21:30:57,499][26022] Updated weights on worker 0-0, policy_version 428596 (0.00095) [2022-07-09 21:30:57,559][25689] Fps is (10 sec: 5870.5, 60 sec: 5638.7, 300 sec: 5638.0). Total num frames: 438882304. Throughput: 0: 5938.2. Samples: 438885570. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:30:57,559][25689] Avg episode reward: [(0, '-47.136')] [2022-07-09 21:30:59,228][26022] Updated weights on worker 0-0, policy_version 428606 (0.00089) [2022-07-09 21:31:01,360][26022] Updated weights on worker 0-0, policy_version 428616 (0.00087) [2022-07-09 21:31:02,580][25689] Fps is (10 sec: 5400.4, 60 sec: 5607.6, 300 sec: 5630.8). Total num frames: 438906880. Throughput: 0: 5831.7. Samples: 438917420. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:31:02,581][25689] Avg episode reward: [(0, '-46.267')] [2022-07-09 21:31:03,330][26022] Updated weights on worker 0-0, policy_version 428626 (0.00089) [2022-07-09 21:31:05,450][26022] Updated weights on worker 0-0, policy_version 428636 (0.00087) [2022-07-09 21:31:06,895][26022] Updated weights on worker 0-0, policy_version 428646 (0.00201) [2022-07-09 21:31:07,607][25689] Fps is (10 sec: 5503.1, 60 sec: 5664.1, 300 sec: 5635.6). Total num frames: 438937600. Throughput: 0: 5828.5. Samples: 438934538. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:31:07,607][25689] Avg episode reward: [(0, '-46.473')] [2022-07-09 21:31:08,858][26022] Updated weights on worker 0-0, policy_version 428656 (0.00092) [2022-07-09 21:31:10,497][26022] Updated weights on worker 0-0, policy_version 428666 (0.00086) [2022-07-09 21:31:12,405][26022] Updated weights on worker 0-0, policy_version 428676 (0.00089) [2022-07-09 21:31:12,736][25689] Fps is (10 sec: 5747.3, 60 sec: 5606.6, 300 sec: 5630.7). Total num frames: 438965248. Throughput: 0: 5817.0. Samples: 438968700. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:31:12,737][25689] Avg episode reward: [(0, '-44.850')] [2022-07-09 21:31:14,448][26022] Updated weights on worker 0-0, policy_version 428686 (0.00096) [2022-07-09 21:31:15,882][26022] Updated weights on worker 0-0, policy_version 428696 (0.00085) [2022-07-09 21:31:17,784][25689] Fps is (10 sec: 5533.9, 60 sec: 5638.8, 300 sec: 5634.1). Total num frames: 438993920. Throughput: 0: 5801.1. Samples: 439002900. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:31:17,784][25689] Avg episode reward: [(0, '-45.131')] [2022-07-09 21:31:17,936][26022] Updated weights on worker 0-0, policy_version 428706 (0.00094) [2022-07-09 21:31:19,615][26022] Updated weights on worker 0-0, policy_version 428716 (0.00087) [2022-07-09 21:31:21,352][26022] Updated weights on worker 0-0, policy_version 428726 (0.00088) [2022-07-09 21:31:22,847][25689] Fps is (10 sec: 5671.5, 60 sec: 5633.5, 300 sec: 5630.6). Total num frames: 439022592. Throughput: 0: 5060.1. Samples: 439019972. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:31:22,847][25689] Avg episode reward: [(0, '-44.465')] [2022-07-09 21:31:23,415][26022] Updated weights on worker 0-0, policy_version 428736 (0.00085) [2022-07-09 21:31:24,963][26022] Updated weights on worker 0-0, policy_version 428746 (0.00087) [2022-07-09 21:31:26,822][26022] Updated weights on worker 0-0, policy_version 428756 (0.00087) [2022-07-09 21:31:27,883][25689] Fps is (10 sec: 5779.7, 60 sec: 5650.3, 300 sec: 5634.1). Total num frames: 439052288. Throughput: 0: 5912.7. Samples: 439054424. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:31:27,883][25689] Avg episode reward: [(0, '-44.176')] [2022-07-09 21:31:28,587][26022] Updated weights on worker 0-0, policy_version 428766 (0.00093) [2022-07-09 21:31:30,251][26022] Updated weights on worker 0-0, policy_version 428776 (0.00089) [2022-07-09 21:31:32,171][26022] Updated weights on worker 0-0, policy_version 428786 (0.00086) [2022-07-09 21:31:32,960][25689] Fps is (10 sec: 5872.8, 60 sec: 5670.4, 300 sec: 5641.4). Total num frames: 439081984. Throughput: 0: 5924.4. Samples: 439088516. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-09 21:31:32,960][25689] Avg episode reward: [(0, '-44.169')] [2022-07-09 21:31:33,391][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:31:33,403][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000428793_439084032.pth [2022-07-09 21:31:33,403][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000426807_437050368.pth [2022-07-09 21:31:34,024][26022] Updated weights on worker 0-0, policy_version 428796 (0.00088) [2022-07-09 21:31:35,688][26022] Updated weights on worker 0-0, policy_version 428806 (0.00094) [2022-07-09 21:31:37,951][26022] Updated weights on worker 0-0, policy_version 428816 (0.00087) [2022-07-09 21:31:38,035][25689] Fps is (10 sec: 5547.6, 60 sec: 5652.3, 300 sec: 5630.6). Total num frames: 439108608. Throughput: 0: 5063.9. Samples: 439105448. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:31:38,035][25689] Avg episode reward: [(0, '-44.432')] [2022-07-09 21:31:39,267][26022] Updated weights on worker 0-0, policy_version 428826 (0.00088) [2022-07-09 21:31:41,333][26022] Updated weights on worker 0-0, policy_version 428836 (0.00082) [2022-07-09 21:31:43,039][25689] Fps is (10 sec: 5588.0, 60 sec: 5658.9, 300 sec: 5634.3). Total num frames: 439138304. Throughput: 0: 5918.4. Samples: 439139474. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:31:43,039][25689] Avg episode reward: [(0, '-43.919')] [2022-07-09 21:31:43,044][26022] Updated weights on worker 0-0, policy_version 428846 (0.00093) [2022-07-09 21:31:44,710][26022] Updated weights on worker 0-0, policy_version 428856 (0.00092) [2022-07-09 21:31:46,728][26022] Updated weights on worker 0-0, policy_version 428866 (0.00086) [2022-07-09 21:31:48,068][25689] Fps is (10 sec: 5817.6, 60 sec: 5675.2, 300 sec: 5639.5). Total num frames: 439166976. Throughput: 0: 5912.0. Samples: 439173758. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:31:48,068][25689] Avg episode reward: [(0, '-44.349')] [2022-07-09 21:31:48,542][26022] Updated weights on worker 0-0, policy_version 428876 (0.00085) [2022-07-09 21:31:50,201][26022] Updated weights on worker 0-0, policy_version 428886 (0.00091) [2022-07-09 21:31:52,199][26022] Updated weights on worker 0-0, policy_version 428896 (0.00089) [2022-07-09 21:31:53,150][25689] Fps is (10 sec: 5671.4, 60 sec: 5658.9, 300 sec: 5631.3). Total num frames: 439195648. Throughput: 0: 5063.2. Samples: 439190742. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:31:53,150][25689] Avg episode reward: [(0, '-44.587')] [2022-07-09 21:31:53,838][26022] Updated weights on worker 0-0, policy_version 428906 (0.00087) [2022-07-09 21:31:55,715][26022] Updated weights on worker 0-0, policy_version 428916 (0.00087) [2022-07-09 21:31:57,471][26022] Updated weights on worker 0-0, policy_version 428926 (0.00084) [2022-07-09 21:31:58,160][25689] Fps is (10 sec: 5580.8, 60 sec: 5626.8, 300 sec: 5635.0). Total num frames: 439223296. Throughput: 0: 5936.9. Samples: 439224926. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:31:58,160][25689] Avg episode reward: [(0, '-45.521')] [2022-07-09 21:31:59,079][26022] Updated weights on worker 0-0, policy_version 428936 (0.00086) [2022-07-09 21:32:01,130][26022] Updated weights on worker 0-0, policy_version 428946 (0.00086) [2022-07-09 21:32:03,163][25689] Fps is (10 sec: 5420.2, 60 sec: 5662.4, 300 sec: 5636.6). Total num frames: 439249920. Throughput: 0: 5831.8. Samples: 439256834. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:32:03,163][25689] Avg episode reward: [(0, '-45.540')] [2022-07-09 21:32:03,183][26022] Updated weights on worker 0-0, policy_version 428956 (0.00099) [2022-07-09 21:32:05,044][26022] Updated weights on worker 0-0, policy_version 428966 (0.00090) [2022-07-09 21:32:07,016][26022] Updated weights on worker 0-0, policy_version 428976 (0.00084) [2022-07-09 21:32:08,195][25689] Fps is (10 sec: 5611.9, 60 sec: 5644.9, 300 sec: 5637.4). Total num frames: 439279616. Throughput: 0: 4978.5. Samples: 439273962. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:32:08,196][25689] Avg episode reward: [(0, '-45.664')] [2022-07-09 21:32:08,497][26022] Updated weights on worker 0-0, policy_version 428986 (0.00086) [2022-07-09 21:32:10,510][26022] Updated weights on worker 0-0, policy_version 428996 (0.00098) [2022-07-09 21:32:12,025][26022] Updated weights on worker 0-0, policy_version 429006 (0.00092) [2022-07-09 21:32:13,253][25689] Fps is (10 sec: 5581.4, 60 sec: 5634.6, 300 sec: 5637.0). Total num frames: 439306240. Throughput: 0: 5847.6. Samples: 439308300. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:32:13,254][25689] Avg episode reward: [(0, '-45.891')] [2022-07-09 21:32:13,954][26022] Updated weights on worker 0-0, policy_version 429016 (0.00083) [2022-07-09 21:32:15,877][26022] Updated weights on worker 0-0, policy_version 429026 (0.00093) [2022-07-09 21:32:17,610][26022] Updated weights on worker 0-0, policy_version 429036 (0.00087) [2022-07-09 21:32:18,297][25689] Fps is (10 sec: 5575.3, 60 sec: 5651.9, 300 sec: 5636.2). Total num frames: 439335936. Throughput: 0: 5844.2. Samples: 439342614. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:32:18,298][25689] Avg episode reward: [(0, '-45.864')] [2022-07-09 21:32:19,277][26022] Updated weights on worker 0-0, policy_version 429046 (0.00086) [2022-07-09 21:32:21,244][26022] Updated weights on worker 0-0, policy_version 429056 (0.00087) [2022-07-09 21:32:23,022][26022] Updated weights on worker 0-0, policy_version 429066 (0.00087) [2022-07-09 21:32:23,327][25689] Fps is (10 sec: 5997.5, 60 sec: 5688.9, 300 sec: 5646.8). Total num frames: 439366656. Throughput: 0: 5108.6. Samples: 439359844. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:32:23,327][25689] Avg episode reward: [(0, '-45.810')] [2022-07-09 21:32:24,832][26022] Updated weights on worker 0-0, policy_version 429076 (0.00082) [2022-07-09 21:32:26,407][26022] Updated weights on worker 0-0, policy_version 429086 (0.00085) [2022-07-09 21:32:28,351][25689] Fps is (10 sec: 5602.0, 60 sec: 5622.3, 300 sec: 5637.2). Total num frames: 439392256. Throughput: 0: 5949.8. Samples: 439393882. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:32:28,351][25689] Avg episode reward: [(0, '-46.765')] [2022-07-09 21:32:28,442][26022] Updated weights on worker 0-0, policy_version 429096 (0.00087) [2022-07-09 21:32:30,366][26022] Updated weights on worker 0-0, policy_version 429106 (0.00089) [2022-07-09 21:32:32,020][26022] Updated weights on worker 0-0, policy_version 429116 (0.00082) [2022-07-09 21:32:33,446][25689] Fps is (10 sec: 5565.5, 60 sec: 5637.5, 300 sec: 5642.5). Total num frames: 439422976. Throughput: 0: 5922.2. Samples: 439427886. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:32:33,447][25689] Avg episode reward: [(0, '-46.582')] [2022-07-09 21:32:33,859][26022] Updated weights on worker 0-0, policy_version 429126 (0.00098) [2022-07-09 21:32:35,598][26022] Updated weights on worker 0-0, policy_version 429136 (0.00393) [2022-07-09 21:32:37,516][26022] Updated weights on worker 0-0, policy_version 429146 (0.00083) [2022-07-09 21:32:38,513][25689] Fps is (10 sec: 5743.3, 60 sec: 5655.2, 300 sec: 5638.6). Total num frames: 439450624. Throughput: 0: 5057.2. Samples: 439444852. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:32:38,514][25689] Avg episode reward: [(0, '-46.539')] [2022-07-09 21:32:39,037][26022] Updated weights on worker 0-0, policy_version 429156 (0.00092) [2022-07-09 21:32:41,156][26022] Updated weights on worker 0-0, policy_version 429166 (0.00088) [2022-07-09 21:32:42,672][26022] Updated weights on worker 0-0, policy_version 429176 (0.00084) [2022-07-09 21:32:43,543][25689] Fps is (10 sec: 5578.2, 60 sec: 5635.9, 300 sec: 5634.7). Total num frames: 439479296. Throughput: 0: 5916.6. Samples: 439479454. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:32:43,543][25689] Avg episode reward: [(0, '-45.524')] [2022-07-09 21:32:44,635][26022] Updated weights on worker 0-0, policy_version 429186 (0.00087) [2022-07-09 21:32:46,355][26022] Updated weights on worker 0-0, policy_version 429196 (0.00082) [2022-07-09 21:32:47,958][26022] Updated weights on worker 0-0, policy_version 429206 (0.00080) [2022-07-09 21:32:48,570][25689] Fps is (10 sec: 5804.0, 60 sec: 5653.0, 300 sec: 5638.6). Total num frames: 439508992. Throughput: 0: 5936.9. Samples: 439513922. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:32:48,570][25689] Avg episode reward: [(0, '-44.949')] [2022-07-09 21:32:49,971][26022] Updated weights on worker 0-0, policy_version 429216 (0.00082) [2022-07-09 21:32:51,724][26022] Updated weights on worker 0-0, policy_version 429226 (0.00087) [2022-07-09 21:32:53,571][26022] Updated weights on worker 0-0, policy_version 429236 (0.00086) [2022-07-09 21:32:53,655][25689] Fps is (10 sec: 5772.0, 60 sec: 5652.7, 300 sec: 5637.5). Total num frames: 439537664. Throughput: 0: 5101.2. Samples: 439530976. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:32:53,655][25689] Avg episode reward: [(0, '-45.555')] [2022-07-09 21:32:55,408][26022] Updated weights on worker 0-0, policy_version 429246 (0.00094) [2022-07-09 21:32:57,169][26022] Updated weights on worker 0-0, policy_version 429256 (0.00090) [2022-07-09 21:32:58,710][25689] Fps is (10 sec: 5655.4, 60 sec: 5665.4, 300 sec: 5636.7). Total num frames: 439566336. Throughput: 0: 5963.0. Samples: 439565282. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:32:58,710][25689] Avg episode reward: [(0, '-44.549')] [2022-07-09 21:32:59,017][26022] Updated weights on worker 0-0, policy_version 429266 (0.00458) [2022-07-09 21:33:00,868][26022] Updated weights on worker 0-0, policy_version 429276 (0.00089) [2022-07-09 21:33:03,029][26022] Updated weights on worker 0-0, policy_version 429286 (0.00097) [2022-07-09 21:33:03,751][25689] Fps is (10 sec: 5476.8, 60 sec: 5661.8, 300 sec: 5636.1). Total num frames: 439592960. Throughput: 0: 5818.3. Samples: 439597034. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:33:03,752][25689] Avg episode reward: [(0, '-43.974')] [2022-07-09 21:33:04,743][26022] Updated weights on worker 0-0, policy_version 429296 (0.00080) [2022-07-09 21:33:06,530][26022] Updated weights on worker 0-0, policy_version 429306 (0.00096) [2022-07-09 21:33:08,586][26022] Updated weights on worker 0-0, policy_version 429316 (0.00090) [2022-07-09 21:33:08,788][25689] Fps is (10 sec: 5385.1, 60 sec: 5627.6, 300 sec: 5632.8). Total num frames: 439620608. Throughput: 0: 4947.6. Samples: 439613952. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:33:08,788][25689] Avg episode reward: [(0, '-44.710')] [2022-07-09 21:33:10,161][26022] Updated weights on worker 0-0, policy_version 429326 (0.00082) [2022-07-09 21:33:12,200][26022] Updated weights on worker 0-0, policy_version 429336 (0.00094) [2022-07-09 21:33:13,604][26022] Updated weights on worker 0-0, policy_version 429346 (0.00092) [2022-07-09 21:33:13,865][25689] Fps is (10 sec: 5771.1, 60 sec: 5693.4, 300 sec: 5642.2). Total num frames: 439651328. Throughput: 0: 5794.4. Samples: 439648084. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:33:13,866][25689] Avg episode reward: [(0, '-44.899')] [2022-07-09 21:33:15,885][26022] Updated weights on worker 0-0, policy_version 429356 (0.00052) [2022-07-09 21:33:17,337][26022] Updated weights on worker 0-0, policy_version 429366 (0.00087) [2022-07-09 21:33:18,890][25689] Fps is (10 sec: 5575.0, 60 sec: 5627.6, 300 sec: 5628.1). Total num frames: 439676928. Throughput: 0: 5796.0. Samples: 439682248. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:33:18,891][25689] Avg episode reward: [(0, '-44.980')] [2022-07-09 21:33:19,379][26022] Updated weights on worker 0-0, policy_version 429376 (0.00114) [2022-07-09 21:33:21,067][26022] Updated weights on worker 0-0, policy_version 429386 (0.00086) [2022-07-09 21:33:23,003][26022] Updated weights on worker 0-0, policy_version 429396 (0.00094) [2022-07-09 21:33:23,908][25689] Fps is (10 sec: 5506.1, 60 sec: 5611.8, 300 sec: 5638.4). Total num frames: 439706624. Throughput: 0: 5903.2. Samples: 439716022. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:33:23,908][25689] Avg episode reward: [(0, '-44.785')] [2022-07-09 21:33:24,755][26022] Updated weights on worker 0-0, policy_version 429406 (0.00099) [2022-07-09 21:33:26,656][26022] Updated weights on worker 0-0, policy_version 429416 (0.00079) [2022-07-09 21:33:28,372][26022] Updated weights on worker 0-0, policy_version 429426 (0.00096) [2022-07-09 21:33:28,925][25689] Fps is (10 sec: 5714.3, 60 sec: 5646.2, 300 sec: 5632.2). Total num frames: 439734272. Throughput: 0: 5904.0. Samples: 439732844. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:33:28,926][25689] Avg episode reward: [(0, '-43.602')] [2022-07-09 21:33:30,266][26022] Updated weights on worker 0-0, policy_version 429436 (0.00079) [2022-07-09 21:33:32,086][26022] Updated weights on worker 0-0, policy_version 429446 (0.00056) [2022-07-09 21:33:33,499][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:33:33,516][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000429454_439760896.pth [2022-07-09 21:33:33,517][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000427469_437728256.pth [2022-07-09 21:33:33,978][25689] Fps is (10 sec: 5592.7, 60 sec: 5616.3, 300 sec: 5634.7). Total num frames: 439762944. Throughput: 0: 5880.6. Samples: 439766360. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:33:33,979][25689] Avg episode reward: [(0, '-43.678')] [2022-07-09 21:33:33,983][26022] Updated weights on worker 0-0, policy_version 429456 (0.00090) [2022-07-09 21:33:35,698][26022] Updated weights on worker 0-0, policy_version 429466 (0.00094) [2022-07-09 21:33:37,711][26022] Updated weights on worker 0-0, policy_version 429476 (0.00084) [2022-07-09 21:33:39,054][25689] Fps is (10 sec: 5661.6, 60 sec: 5632.5, 300 sec: 5633.7). Total num frames: 439791616. Throughput: 0: 5852.7. Samples: 439800260. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:33:39,054][25689] Avg episode reward: [(0, '-43.627')] [2022-07-09 21:33:39,271][26022] Updated weights on worker 0-0, policy_version 429486 (0.01260) [2022-07-09 21:33:41,378][26022] Updated weights on worker 0-0, policy_version 429496 (0.00083) [2022-07-09 21:33:42,795][26022] Updated weights on worker 0-0, policy_version 429506 (0.00396) [2022-07-09 21:33:44,069][25689] Fps is (10 sec: 5682.7, 60 sec: 5633.8, 300 sec: 5633.9). Total num frames: 439820288. Throughput: 0: 5037.1. Samples: 439817576. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:33:44,073][25689] Avg episode reward: [(0, '-43.980')] [2022-07-09 21:33:44,839][26022] Updated weights on worker 0-0, policy_version 429516 (0.00089) [2022-07-09 21:33:46,222][26022] Updated weights on worker 0-0, policy_version 429526 (0.00085) [2022-07-09 21:33:48,491][26022] Updated weights on worker 0-0, policy_version 429536 (0.00087) [2022-07-09 21:33:49,100][25689] Fps is (10 sec: 5605.9, 60 sec: 5599.6, 300 sec: 5630.4). Total num frames: 439847936. Throughput: 0: 5898.6. Samples: 439851848. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:33:49,101][25689] Avg episode reward: [(0, '-44.338')] [2022-07-09 21:33:49,912][26022] Updated weights on worker 0-0, policy_version 429546 (0.00092) [2022-07-09 21:33:52,221][26022] Updated weights on worker 0-0, policy_version 429556 (0.00093) [2022-07-09 21:33:53,631][26022] Updated weights on worker 0-0, policy_version 429566 (0.00084) [2022-07-09 21:33:54,206][25689] Fps is (10 sec: 5758.3, 60 sec: 5631.5, 300 sec: 5635.5). Total num frames: 439878656. Throughput: 0: 5889.1. Samples: 439885480. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:33:54,206][25689] Avg episode reward: [(0, '-44.744')] [2022-07-09 21:33:55,943][26022] Updated weights on worker 0-0, policy_version 429576 (0.00090) [2022-07-09 21:33:57,165][26022] Updated weights on worker 0-0, policy_version 429586 (0.00088) [2022-07-09 21:33:59,213][25689] Fps is (10 sec: 5569.5, 60 sec: 5585.2, 300 sec: 5633.0). Total num frames: 439904256. Throughput: 0: 5069.5. Samples: 439902454. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:33:59,213][25689] Avg episode reward: [(0, '-44.782')] [2022-07-09 21:33:59,344][26022] Updated weights on worker 0-0, policy_version 429596 (0.00086) [2022-07-09 21:34:00,707][26022] Updated weights on worker 0-0, policy_version 429606 (0.00085) [2022-07-09 21:34:03,172][26022] Updated weights on worker 0-0, policy_version 429616 (0.00079) [2022-07-09 21:34:04,233][25689] Fps is (10 sec: 5412.4, 60 sec: 5621.0, 300 sec: 5639.7). Total num frames: 439932928. Throughput: 0: 5818.9. Samples: 439934906. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:34:04,233][25689] Avg episode reward: [(0, '-45.042')] [2022-07-09 21:34:04,931][26022] Updated weights on worker 0-0, policy_version 429626 (0.00085) [2022-07-09 21:34:06,769][26022] Updated weights on worker 0-0, policy_version 429636 (0.00085) [2022-07-09 21:34:08,649][26022] Updated weights on worker 0-0, policy_version 429646 (0.00089) [2022-07-09 21:34:09,252][25689] Fps is (10 sec: 5609.9, 60 sec: 5622.6, 300 sec: 5635.0). Total num frames: 439960576. Throughput: 0: 5792.6. Samples: 439968578. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:34:09,253][25689] Avg episode reward: [(0, '-45.441')] [2022-07-09 21:34:10,440][26022] Updated weights on worker 0-0, policy_version 429656 (0.00091) [2022-07-09 21:34:12,221][26022] Updated weights on worker 0-0, policy_version 429666 (0.00095) [2022-07-09 21:34:14,035][26022] Updated weights on worker 0-0, policy_version 429676 (0.00092) [2022-07-09 21:34:14,309][25689] Fps is (10 sec: 5691.3, 60 sec: 5607.6, 300 sec: 5634.7). Total num frames: 439990272. Throughput: 0: 4990.9. Samples: 439985812. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:34:14,309][25689] Avg episode reward: [(0, '-44.977')] [2022-07-09 21:34:15,756][26022] Updated weights on worker 0-0, policy_version 429686 (0.00094) [2022-07-09 21:34:17,724][26022] Updated weights on worker 0-0, policy_version 429696 (0.00080) [2022-07-09 21:34:19,329][25689] Fps is (10 sec: 5690.6, 60 sec: 5641.9, 300 sec: 5635.1). Total num frames: 440017920. Throughput: 0: 5837.2. Samples: 440019876. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-09 21:34:19,331][25689] Avg episode reward: [(0, '-44.605')] [2022-07-09 21:34:19,432][26022] Updated weights on worker 0-0, policy_version 429706 (0.00081) [2022-07-09 21:34:21,243][26022] Updated weights on worker 0-0, policy_version 429716 (0.00085) [2022-07-09 21:34:22,992][26022] Updated weights on worker 0-0, policy_version 429726 (0.00084) [2022-07-09 21:34:24,355][25689] Fps is (10 sec: 5707.9, 60 sec: 5641.2, 300 sec: 5645.1). Total num frames: 440047616. Throughput: 0: 5919.7. Samples: 440054022. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:34:24,356][25689] Avg episode reward: [(0, '-44.883')] [2022-07-09 21:34:24,909][26022] Updated weights on worker 0-0, policy_version 429736 (0.00081) [2022-07-09 21:34:26,544][26022] Updated weights on worker 0-0, policy_version 429746 (0.00085) [2022-07-09 21:34:28,573][26022] Updated weights on worker 0-0, policy_version 429756 (0.00080) [2022-07-09 21:34:29,359][25689] Fps is (10 sec: 5615.3, 60 sec: 5625.5, 300 sec: 5635.4). Total num frames: 440074240. Throughput: 0: 5090.5. Samples: 440070930. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:34:29,359][25689] Avg episode reward: [(0, '-44.952')] [2022-07-09 21:34:30,161][26022] Updated weights on worker 0-0, policy_version 429766 (0.00084) [2022-07-09 21:34:32,283][26022] Updated weights on worker 0-0, policy_version 429776 (0.00085) [2022-07-09 21:34:33,751][26022] Updated weights on worker 0-0, policy_version 429786 (0.00381) [2022-07-09 21:34:34,471][25689] Fps is (10 sec: 5466.4, 60 sec: 5620.0, 300 sec: 5633.4). Total num frames: 440102912. Throughput: 0: 5897.7. Samples: 440104722. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:34:34,471][25689] Avg episode reward: [(0, '-44.520')] [2022-07-09 21:34:35,823][26022] Updated weights on worker 0-0, policy_version 429796 (0.00088) [2022-07-09 21:34:37,567][26022] Updated weights on worker 0-0, policy_version 429806 (0.00091) [2022-07-09 21:34:39,371][26022] Updated weights on worker 0-0, policy_version 429816 (0.00091) [2022-07-09 21:34:39,475][25689] Fps is (10 sec: 5668.6, 60 sec: 5626.7, 300 sec: 5640.2). Total num frames: 440131584. Throughput: 0: 5905.0. Samples: 440138838. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:34:39,475][25689] Avg episode reward: [(0, '-44.593')] [2022-07-09 21:34:41,204][26022] Updated weights on worker 0-0, policy_version 429826 (0.00096) [2022-07-09 21:34:42,938][26022] Updated weights on worker 0-0, policy_version 429836 (0.00089) [2022-07-09 21:34:44,518][25689] Fps is (10 sec: 5707.4, 60 sec: 5624.1, 300 sec: 5640.0). Total num frames: 440160256. Throughput: 0: 5055.2. Samples: 440155950. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:34:44,518][25689] Avg episode reward: [(0, '-44.988')] [2022-07-09 21:34:44,784][26022] Updated weights on worker 0-0, policy_version 429846 (0.00087) [2022-07-09 21:34:46,559][26022] Updated weights on worker 0-0, policy_version 429856 (0.00090) [2022-07-09 21:34:48,320][26022] Updated weights on worker 0-0, policy_version 429866 (0.00086) [2022-07-09 21:34:49,571][25689] Fps is (10 sec: 5781.0, 60 sec: 5655.9, 300 sec: 5639.8). Total num frames: 440189952. Throughput: 0: 5896.6. Samples: 440190116. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:34:49,572][25689] Avg episode reward: [(0, '-45.200')] [2022-07-09 21:34:50,118][26022] Updated weights on worker 0-0, policy_version 429876 (0.00089) [2022-07-09 21:34:52,042][26022] Updated weights on worker 0-0, policy_version 429886 (0.00088) [2022-07-09 21:34:53,651][26022] Updated weights on worker 0-0, policy_version 429896 (0.00096) [2022-07-09 21:34:54,639][25689] Fps is (10 sec: 5666.0, 60 sec: 5608.6, 300 sec: 5633.6). Total num frames: 440217600. Throughput: 0: 5921.7. Samples: 440224150. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:34:54,639][25689] Avg episode reward: [(0, '-45.068')] [2022-07-09 21:34:55,751][26022] Updated weights on worker 0-0, policy_version 429906 (0.00098) [2022-07-09 21:34:57,474][26022] Updated weights on worker 0-0, policy_version 429916 (0.00098) [2022-07-09 21:34:59,247][26022] Updated weights on worker 0-0, policy_version 429926 (0.00087) [2022-07-09 21:34:59,729][25689] Fps is (10 sec: 5645.3, 60 sec: 5668.5, 300 sec: 5643.1). Total num frames: 440247296. Throughput: 0: 5048.5. Samples: 440241092. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:34:59,730][25689] Avg episode reward: [(0, '-45.486')] [2022-07-09 21:35:00,985][26022] Updated weights on worker 0-0, policy_version 429936 (0.00083) [2022-07-09 21:35:03,251][26022] Updated weights on worker 0-0, policy_version 429946 (0.00093) [2022-07-09 21:35:04,744][25689] Fps is (10 sec: 5573.6, 60 sec: 5635.2, 300 sec: 5641.1). Total num frames: 440273920. Throughput: 0: 5803.1. Samples: 440273322. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:35:04,744][25689] Avg episode reward: [(0, '-46.476')] [2022-07-09 21:35:05,001][26022] Updated weights on worker 0-0, policy_version 429956 (0.00092) [2022-07-09 21:35:07,021][26022] Updated weights on worker 0-0, policy_version 429966 (0.00083) [2022-07-09 21:35:08,390][26022] Updated weights on worker 0-0, policy_version 429976 (0.00096) [2022-07-09 21:35:09,748][25689] Fps is (10 sec: 5212.6, 60 sec: 5602.8, 300 sec: 5624.9). Total num frames: 440299520. Throughput: 0: 5810.4. Samples: 440307350. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:35:09,750][25689] Avg episode reward: [(0, '-46.487')] [2022-07-09 21:35:10,555][26022] Updated weights on worker 0-0, policy_version 429986 (0.00089) [2022-07-09 21:35:12,145][26022] Updated weights on worker 0-0, policy_version 429996 (0.00088) [2022-07-09 21:35:14,058][26022] Updated weights on worker 0-0, policy_version 430006 (0.00084) [2022-07-09 21:35:14,812][25689] Fps is (10 sec: 5695.3, 60 sec: 5635.9, 300 sec: 5641.4). Total num frames: 440331264. Throughput: 0: 4962.9. Samples: 440324270. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:35:14,813][25689] Avg episode reward: [(0, '-46.703')] [2022-07-09 21:35:15,816][26022] Updated weights on worker 0-0, policy_version 430016 (0.00092) [2022-07-09 21:35:17,524][26022] Updated weights on worker 0-0, policy_version 430026 (0.00091) [2022-07-09 21:35:19,506][26022] Updated weights on worker 0-0, policy_version 430036 (0.00088) [2022-07-09 21:35:19,816][25689] Fps is (10 sec: 5899.3, 60 sec: 5637.5, 300 sec: 5638.1). Total num frames: 440358912. Throughput: 0: 5845.2. Samples: 440358500. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:35:19,816][25689] Avg episode reward: [(0, '-46.832')] [2022-07-09 21:35:21,217][26022] Updated weights on worker 0-0, policy_version 430046 (0.00085) [2022-07-09 21:35:23,001][26022] Updated weights on worker 0-0, policy_version 430056 (0.00081) [2022-07-09 21:35:24,758][26022] Updated weights on worker 0-0, policy_version 430066 (0.00089) [2022-07-09 21:35:24,829][25689] Fps is (10 sec: 5622.4, 60 sec: 5621.7, 300 sec: 5638.5). Total num frames: 440387584. Throughput: 0: 5953.0. Samples: 440392892. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:35:24,831][25689] Avg episode reward: [(0, '-46.244')] [2022-07-09 21:35:26,568][26022] Updated weights on worker 0-0, policy_version 430076 (0.00088) [2022-07-09 21:35:28,241][26022] Updated weights on worker 0-0, policy_version 430086 (0.00088) [2022-07-09 21:35:29,861][25689] Fps is (10 sec: 5708.5, 60 sec: 5652.9, 300 sec: 5640.0). Total num frames: 440416256. Throughput: 0: 5101.4. Samples: 440409954. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:35:29,861][25689] Avg episode reward: [(0, '-46.525')] [2022-07-09 21:35:30,350][26022] Updated weights on worker 0-0, policy_version 430096 (0.00093) [2022-07-09 21:35:32,047][26022] Updated weights on worker 0-0, policy_version 430106 (0.00096) [2022-07-09 21:35:33,545][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:35:33,554][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000430114_440436736.pth [2022-07-09 21:35:33,557][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000428129_438404096.pth [2022-07-09 21:35:33,762][26022] Updated weights on worker 0-0, policy_version 430116 (0.00082) [2022-07-09 21:35:34,893][25689] Fps is (10 sec: 5698.2, 60 sec: 5660.4, 300 sec: 5644.0). Total num frames: 440444928. Throughput: 0: 5967.1. Samples: 440444092. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:35:34,893][25689] Avg episode reward: [(0, '-46.112')] [2022-07-09 21:35:35,646][26022] Updated weights on worker 0-0, policy_version 430126 (0.00087) [2022-07-09 21:35:37,311][26022] Updated weights on worker 0-0, policy_version 430136 (0.00090) [2022-07-09 21:35:39,353][26022] Updated weights on worker 0-0, policy_version 430146 (0.00087) [2022-07-09 21:35:39,965][25689] Fps is (10 sec: 5675.2, 60 sec: 5654.1, 300 sec: 5640.6). Total num frames: 440473600. Throughput: 0: 5929.7. Samples: 440477982. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:35:39,967][25689] Avg episode reward: [(0, '-46.412')] [2022-07-09 21:35:41,019][26022] Updated weights on worker 0-0, policy_version 430156 (0.00056) [2022-07-09 21:35:42,733][26022] Updated weights on worker 0-0, policy_version 430166 (0.00083) [2022-07-09 21:35:44,614][26022] Updated weights on worker 0-0, policy_version 430176 (0.00091) [2022-07-09 21:35:44,998][25689] Fps is (10 sec: 5472.0, 60 sec: 5621.1, 300 sec: 5636.9). Total num frames: 440500224. Throughput: 0: 5072.9. Samples: 440495206. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:35:44,999][25689] Avg episode reward: [(0, '-46.404')] [2022-07-09 21:35:46,285][26022] Updated weights on worker 0-0, policy_version 430186 (0.00088) [2022-07-09 21:35:48,350][26022] Updated weights on worker 0-0, policy_version 430196 (0.00090) [2022-07-09 21:35:50,003][25689] Fps is (10 sec: 5610.6, 60 sec: 5625.6, 300 sec: 5638.5). Total num frames: 440529920. Throughput: 0: 5920.3. Samples: 440529202. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:35:50,004][25689] Avg episode reward: [(0, '-47.185')] [2022-07-09 21:35:50,166][26022] Updated weights on worker 0-0, policy_version 430206 (0.00083) [2022-07-09 21:35:51,721][26022] Updated weights on worker 0-0, policy_version 430216 (0.00085) [2022-07-09 21:35:53,948][26022] Updated weights on worker 0-0, policy_version 430226 (0.00087) [2022-07-09 21:35:55,121][25689] Fps is (10 sec: 5866.9, 60 sec: 5654.8, 300 sec: 5636.8). Total num frames: 440559616. Throughput: 0: 5890.4. Samples: 440563246. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:35:55,122][25689] Avg episode reward: [(0, '-47.471')] [2022-07-09 21:35:55,340][26022] Updated weights on worker 0-0, policy_version 430236 (0.00104) [2022-07-09 21:35:57,487][26022] Updated weights on worker 0-0, policy_version 430246 (0.00089) [2022-07-09 21:35:59,111][26022] Updated weights on worker 0-0, policy_version 430256 (0.00094) [2022-07-09 21:36:00,139][25689] Fps is (10 sec: 5556.5, 60 sec: 5610.7, 300 sec: 5643.8). Total num frames: 440586240. Throughput: 0: 5065.3. Samples: 440580170. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:36:00,140][25689] Avg episode reward: [(0, '-47.286')] [2022-07-09 21:36:00,971][26022] Updated weights on worker 0-0, policy_version 430266 (0.00085) [2022-07-09 21:36:03,043][26022] Updated weights on worker 0-0, policy_version 430276 (0.00088) [2022-07-09 21:36:05,008][26022] Updated weights on worker 0-0, policy_version 430286 (0.00090) [2022-07-09 21:36:05,159][25689] Fps is (10 sec: 5304.9, 60 sec: 5610.2, 300 sec: 5630.1). Total num frames: 440612864. Throughput: 0: 5801.8. Samples: 440612176. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:36:05,160][25689] Avg episode reward: [(0, '-46.817')] [2022-07-09 21:36:06,575][26022] Updated weights on worker 0-0, policy_version 430296 (0.00551) [2022-07-09 21:36:08,575][26022] Updated weights on worker 0-0, policy_version 430306 (0.00093) [2022-07-09 21:36:10,191][25689] Fps is (10 sec: 5602.9, 60 sec: 5675.4, 300 sec: 5638.9). Total num frames: 440642560. Throughput: 0: 5795.2. Samples: 440646194. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:36:10,191][25689] Avg episode reward: [(0, '-46.235')] [2022-07-09 21:36:10,253][26022] Updated weights on worker 0-0, policy_version 430316 (0.00083) [2022-07-09 21:36:12,203][26022] Updated weights on worker 0-0, policy_version 430326 (0.00094) [2022-07-09 21:36:13,918][26022] Updated weights on worker 0-0, policy_version 430336 (0.00100) [2022-07-09 21:36:15,317][25689] Fps is (10 sec: 5745.9, 60 sec: 5618.8, 300 sec: 5637.4). Total num frames: 440671232. Throughput: 0: 5781.2. Samples: 440680002. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:36:15,318][25689] Avg episode reward: [(0, '-46.701')] [2022-07-09 21:36:15,908][26022] Updated weights on worker 0-0, policy_version 430346 (0.00105) [2022-07-09 21:36:17,680][26022] Updated weights on worker 0-0, policy_version 430356 (0.00099) [2022-07-09 21:36:19,363][26022] Updated weights on worker 0-0, policy_version 430366 (0.00085) [2022-07-09 21:36:20,407][25689] Fps is (10 sec: 5613.3, 60 sec: 5627.8, 300 sec: 5636.9). Total num frames: 440699904. Throughput: 0: 5768.4. Samples: 440697082. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:36:20,407][25689] Avg episode reward: [(0, '-45.718')] [2022-07-09 21:36:21,179][26022] Updated weights on worker 0-0, policy_version 430376 (0.00087) [2022-07-09 21:36:23,008][26022] Updated weights on worker 0-0, policy_version 430386 (0.00081) [2022-07-09 21:36:24,819][26022] Updated weights on worker 0-0, policy_version 430396 (0.00087) [2022-07-09 21:36:25,447][25689] Fps is (10 sec: 5762.0, 60 sec: 5642.2, 300 sec: 5636.8). Total num frames: 440729600. Throughput: 0: 5873.3. Samples: 440731334. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:36:25,448][25689] Avg episode reward: [(0, '-45.348')] [2022-07-09 21:36:26,604][26022] Updated weights on worker 0-0, policy_version 430406 (0.00084) [2022-07-09 21:36:28,416][26022] Updated weights on worker 0-0, policy_version 430416 (0.00093) [2022-07-09 21:36:30,178][26022] Updated weights on worker 0-0, policy_version 430426 (0.00090) [2022-07-09 21:36:30,512][25689] Fps is (10 sec: 5573.0, 60 sec: 5605.3, 300 sec: 5626.7). Total num frames: 440756224. Throughput: 0: 5869.5. Samples: 440765472. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:36:30,513][25689] Avg episode reward: [(0, '-44.705')] [2022-07-09 21:36:32,135][26022] Updated weights on worker 0-0, policy_version 430436 (0.00093) [2022-07-09 21:36:34,020][26022] Updated weights on worker 0-0, policy_version 430446 (0.00087) [2022-07-09 21:36:35,542][26022] Updated weights on worker 0-0, policy_version 430456 (0.00089) [2022-07-09 21:36:35,628][25689] Fps is (10 sec: 5733.2, 60 sec: 5648.1, 300 sec: 5643.1). Total num frames: 440787968. Throughput: 0: 5044.1. Samples: 440782450. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:36:35,628][25689] Avg episode reward: [(0, '-44.281')] [2022-07-09 21:36:37,496][26022] Updated weights on worker 0-0, policy_version 430466 (0.00084) [2022-07-09 21:36:39,145][26022] Updated weights on worker 0-0, policy_version 430476 (0.00095) [2022-07-09 21:36:40,675][25689] Fps is (10 sec: 5743.5, 60 sec: 5616.8, 300 sec: 5632.0). Total num frames: 440814592. Throughput: 0: 5897.2. Samples: 440816608. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:36:40,675][25689] Avg episode reward: [(0, '-44.014')] [2022-07-09 21:36:40,943][26022] Updated weights on worker 0-0, policy_version 430486 (0.00422) [2022-07-09 21:36:42,786][26022] Updated weights on worker 0-0, policy_version 430496 (0.00087) [2022-07-09 21:36:44,666][26022] Updated weights on worker 0-0, policy_version 430506 (0.00080) [2022-07-09 21:36:45,734][25689] Fps is (10 sec: 5471.6, 60 sec: 5648.1, 300 sec: 5631.4). Total num frames: 440843264. Throughput: 0: 5902.6. Samples: 440851078. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:36:45,734][25689] Avg episode reward: [(0, '-43.612')] [2022-07-09 21:36:46,309][26022] Updated weights on worker 0-0, policy_version 430516 (0.00092) [2022-07-09 21:36:48,336][26022] Updated weights on worker 0-0, policy_version 430526 (0.00090) [2022-07-09 21:36:49,999][26022] Updated weights on worker 0-0, policy_version 430536 (0.00090) [2022-07-09 21:36:50,780][25689] Fps is (10 sec: 5775.9, 60 sec: 5644.2, 300 sec: 5635.5). Total num frames: 440872960. Throughput: 0: 5065.9. Samples: 440868146. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:36:50,781][25689] Avg episode reward: [(0, '-43.763')] [2022-07-09 21:36:52,015][26022] Updated weights on worker 0-0, policy_version 430546 (0.00079) [2022-07-09 21:36:53,586][26022] Updated weights on worker 0-0, policy_version 430556 (0.00088) [2022-07-09 21:36:55,720][26022] Updated weights on worker 0-0, policy_version 430566 (0.00092) [2022-07-09 21:36:55,814][25689] Fps is (10 sec: 5587.2, 60 sec: 5601.5, 300 sec: 5631.7). Total num frames: 440899584. Throughput: 0: 5921.6. Samples: 440901984. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:36:55,816][25689] Avg episode reward: [(0, '-43.397')] [2022-07-09 21:36:57,247][26022] Updated weights on worker 0-0, policy_version 430576 (0.00093) [2022-07-09 21:36:59,138][26022] Updated weights on worker 0-0, policy_version 430586 (0.00089) [2022-07-09 21:37:00,875][25689] Fps is (10 sec: 5578.9, 60 sec: 5648.0, 300 sec: 5640.9). Total num frames: 440929280. Throughput: 0: 5916.3. Samples: 440936120. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-09 21:37:00,876][25689] Avg episode reward: [(0, '-43.711')] [2022-07-09 21:37:01,019][26022] Updated weights on worker 0-0, policy_version 430596 (0.00088) [2022-07-09 21:37:03,057][26022] Updated weights on worker 0-0, policy_version 430606 (0.00087) [2022-07-09 21:37:04,874][26022] Updated weights on worker 0-0, policy_version 430616 (0.00083) [2022-07-09 21:37:05,914][25689] Fps is (10 sec: 5677.8, 60 sec: 5663.2, 300 sec: 5633.9). Total num frames: 440956928. Throughput: 0: 4952.8. Samples: 440951024. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:37:05,915][25689] Avg episode reward: [(0, '-44.359')] [2022-07-09 21:37:06,573][26022] Updated weights on worker 0-0, policy_version 430626 (0.00090) [2022-07-09 21:37:08,366][26022] Updated weights on worker 0-0, policy_version 430636 (0.00088) [2022-07-09 21:37:10,203][26022] Updated weights on worker 0-0, policy_version 430646 (0.00088) [2022-07-09 21:37:10,931][25689] Fps is (10 sec: 5499.2, 60 sec: 5630.8, 300 sec: 5638.1). Total num frames: 440984576. Throughput: 0: 5819.6. Samples: 440985414. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:37:10,932][25689] Avg episode reward: [(0, '-42.967')] [2022-07-09 21:37:12,121][26022] Updated weights on worker 0-0, policy_version 430656 (0.00105) [2022-07-09 21:37:13,907][26022] Updated weights on worker 0-0, policy_version 430666 (0.00098) [2022-07-09 21:37:15,555][26022] Updated weights on worker 0-0, policy_version 430676 (0.00089) [2022-07-09 21:37:15,975][25689] Fps is (10 sec: 5699.4, 60 sec: 5655.3, 300 sec: 5638.1). Total num frames: 441014272. Throughput: 0: 5830.8. Samples: 441019540. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:37:15,976][25689] Avg episode reward: [(0, '-43.385')] [2022-07-09 21:37:17,323][26022] Updated weights on worker 0-0, policy_version 430686 (0.00086) [2022-07-09 21:37:19,422][26022] Updated weights on worker 0-0, policy_version 430696 (0.00088) [2022-07-09 21:37:20,993][25689] Fps is (10 sec: 5698.8, 60 sec: 5645.1, 300 sec: 5628.0). Total num frames: 441041920. Throughput: 0: 4998.9. Samples: 441036686. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:37:20,994][25689] Avg episode reward: [(0, '-43.628')] [2022-07-09 21:37:21,174][26022] Updated weights on worker 0-0, policy_version 430706 (0.00085) [2022-07-09 21:37:22,930][26022] Updated weights on worker 0-0, policy_version 430716 (0.00092) [2022-07-09 21:37:24,465][26022] Updated weights on worker 0-0, policy_version 430726 (0.00090) [2022-07-09 21:37:26,007][25689] Fps is (10 sec: 5511.7, 60 sec: 5613.7, 300 sec: 5635.1). Total num frames: 441069568. Throughput: 0: 5963.9. Samples: 441070862. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:37:26,008][25689] Avg episode reward: [(0, '-44.550')] [2022-07-09 21:37:26,528][26022] Updated weights on worker 0-0, policy_version 430736 (0.00083) [2022-07-09 21:37:28,202][26022] Updated weights on worker 0-0, policy_version 430746 (0.00086) [2022-07-09 21:37:30,016][26022] Updated weights on worker 0-0, policy_version 430756 (0.00094) [2022-07-09 21:37:31,042][25689] Fps is (10 sec: 5910.6, 60 sec: 5701.2, 300 sec: 5639.7). Total num frames: 441101312. Throughput: 0: 5941.0. Samples: 441104892. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:37:31,042][25689] Avg episode reward: [(0, '-44.884')] [2022-07-09 21:37:31,998][26022] Updated weights on worker 0-0, policy_version 430766 (0.00082) [2022-07-09 21:37:33,621][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:37:33,636][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000430776_441114624.pth [2022-07-09 21:37:33,636][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000428793_439084032.pth [2022-07-09 21:37:33,646][26022] Updated weights on worker 0-0, policy_version 430776 (0.00086) [2022-07-09 21:37:35,481][26022] Updated weights on worker 0-0, policy_version 430786 (0.00084) [2022-07-09 21:37:36,102][25689] Fps is (10 sec: 5782.2, 60 sec: 5621.7, 300 sec: 5636.3). Total num frames: 441127936. Throughput: 0: 5084.6. Samples: 441121876. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:37:36,102][25689] Avg episode reward: [(0, '-45.678')] [2022-07-09 21:37:37,275][26022] Updated weights on worker 0-0, policy_version 430796 (0.00081) [2022-07-09 21:37:39,083][26022] Updated weights on worker 0-0, policy_version 430806 (0.00086) [2022-07-09 21:37:41,021][26022] Updated weights on worker 0-0, policy_version 430816 (0.00086) [2022-07-09 21:37:41,151][25689] Fps is (10 sec: 5469.5, 60 sec: 5655.4, 300 sec: 5636.0). Total num frames: 441156608. Throughput: 0: 5904.1. Samples: 441155702. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:37:41,152][25689] Avg episode reward: [(0, '-45.761')] [2022-07-09 21:37:42,663][26022] Updated weights on worker 0-0, policy_version 430826 (0.00093) [2022-07-09 21:37:44,626][26022] Updated weights on worker 0-0, policy_version 430836 (0.00086) [2022-07-09 21:37:46,155][26022] Updated weights on worker 0-0, policy_version 430846 (0.00094) [2022-07-09 21:37:46,189][25689] Fps is (10 sec: 5786.1, 60 sec: 5674.2, 300 sec: 5635.8). Total num frames: 441186304. Throughput: 0: 5911.0. Samples: 441190158. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:37:46,190][25689] Avg episode reward: [(0, '-45.624')] [2022-07-09 21:37:48,070][26022] Updated weights on worker 0-0, policy_version 430856 (0.00089) [2022-07-09 21:37:49,771][26022] Updated weights on worker 0-0, policy_version 430866 (0.00081) [2022-07-09 21:37:51,205][25689] Fps is (10 sec: 5704.0, 60 sec: 5643.3, 300 sec: 5633.6). Total num frames: 441213952. Throughput: 0: 5079.0. Samples: 441207300. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:37:51,205][25689] Avg episode reward: [(0, '-44.847')] [2022-07-09 21:37:51,606][26022] Updated weights on worker 0-0, policy_version 430876 (0.00087) [2022-07-09 21:37:53,539][26022] Updated weights on worker 0-0, policy_version 430886 (0.00094) [2022-07-09 21:37:55,389][26022] Updated weights on worker 0-0, policy_version 430896 (0.00084) [2022-07-09 21:37:56,334][25689] Fps is (10 sec: 5652.8, 60 sec: 5685.1, 300 sec: 5635.7). Total num frames: 441243648. Throughput: 0: 5927.6. Samples: 441241804. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:37:56,334][25689] Avg episode reward: [(0, '-45.236')] [2022-07-09 21:37:56,935][26022] Updated weights on worker 0-0, policy_version 430906 (0.00080) [2022-07-09 21:37:58,899][26022] Updated weights on worker 0-0, policy_version 430916 (0.00086) [2022-07-09 21:38:00,449][26022] Updated weights on worker 0-0, policy_version 430926 (0.00088) [2022-07-09 21:38:01,360][25689] Fps is (10 sec: 5747.4, 60 sec: 5671.5, 300 sec: 5642.8). Total num frames: 441272320. Throughput: 0: 5970.3. Samples: 441276356. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:38:01,361][25689] Avg episode reward: [(0, '-44.793')] [2022-07-09 21:38:02,747][26022] Updated weights on worker 0-0, policy_version 430936 (0.00092) [2022-07-09 21:38:04,440][26022] Updated weights on worker 0-0, policy_version 430946 (0.00086) [2022-07-09 21:38:06,372][25689] Fps is (10 sec: 5406.6, 60 sec: 5640.1, 300 sec: 5636.4). Total num frames: 441297920. Throughput: 0: 5008.0. Samples: 441291232. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:38:06,372][25689] Avg episode reward: [(0, '-45.077')] [2022-07-09 21:38:06,394][26022] Updated weights on worker 0-0, policy_version 430956 (0.00093) [2022-07-09 21:38:08,167][26022] Updated weights on worker 0-0, policy_version 430966 (0.00086) [2022-07-09 21:38:10,170][26022] Updated weights on worker 0-0, policy_version 430976 (0.00090) [2022-07-09 21:38:11,383][25689] Fps is (10 sec: 5415.0, 60 sec: 5657.6, 300 sec: 5630.8). Total num frames: 441326592. Throughput: 0: 5838.2. Samples: 441325104. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:38:11,385][25689] Avg episode reward: [(0, '-45.405')] [2022-07-09 21:38:11,679][26022] Updated weights on worker 0-0, policy_version 430986 (0.00088) [2022-07-09 21:38:13,734][26022] Updated weights on worker 0-0, policy_version 430996 (0.00082) [2022-07-09 21:38:15,368][26022] Updated weights on worker 0-0, policy_version 431006 (0.00085) [2022-07-09 21:38:16,434][25689] Fps is (10 sec: 5699.3, 60 sec: 5640.1, 300 sec: 5640.6). Total num frames: 441355264. Throughput: 0: 5849.8. Samples: 441359384. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:38:16,434][25689] Avg episode reward: [(0, '-44.643')] [2022-07-09 21:38:17,200][26022] Updated weights on worker 0-0, policy_version 431016 (0.00083) [2022-07-09 21:38:18,823][26022] Updated weights on worker 0-0, policy_version 431026 (0.01052) [2022-07-09 21:38:20,799][26022] Updated weights on worker 0-0, policy_version 431036 (0.00081) [2022-07-09 21:38:21,449][25689] Fps is (10 sec: 5696.7, 60 sec: 5657.3, 300 sec: 5637.2). Total num frames: 441383936. Throughput: 0: 5835.2. Samples: 441393578. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:38:21,449][25689] Avg episode reward: [(0, '-45.212')] [2022-07-09 21:38:22,599][26022] Updated weights on worker 0-0, policy_version 431046 (0.00084) [2022-07-09 21:38:24,336][26022] Updated weights on worker 0-0, policy_version 431056 (0.00092) [2022-07-09 21:38:26,157][26022] Updated weights on worker 0-0, policy_version 431066 (0.00573) [2022-07-09 21:38:26,467][25689] Fps is (10 sec: 5715.4, 60 sec: 5673.9, 300 sec: 5640.7). Total num frames: 441412608. Throughput: 0: 5965.7. Samples: 441411112. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:38:26,467][25689] Avg episode reward: [(0, '-45.148')] [2022-07-09 21:38:28,119][26022] Updated weights on worker 0-0, policy_version 431076 (0.00092) [2022-07-09 21:38:29,732][26022] Updated weights on worker 0-0, policy_version 431086 (0.00090) [2022-07-09 21:38:31,490][25689] Fps is (10 sec: 5507.2, 60 sec: 5590.3, 300 sec: 5634.4). Total num frames: 441439232. Throughput: 0: 5962.3. Samples: 441444986. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:38:31,490][25689] Avg episode reward: [(0, '-44.463')] [2022-07-09 21:38:31,749][26022] Updated weights on worker 0-0, policy_version 431096 (0.00085) [2022-07-09 21:38:33,281][26022] Updated weights on worker 0-0, policy_version 431106 (0.00086) [2022-07-09 21:38:35,400][26022] Updated weights on worker 0-0, policy_version 431116 (0.00090) [2022-07-09 21:38:36,563][25689] Fps is (10 sec: 5781.4, 60 sec: 5673.8, 300 sec: 5644.7). Total num frames: 441470976. Throughput: 0: 5930.2. Samples: 441478754. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:38:36,563][25689] Avg episode reward: [(0, '-43.467')] [2022-07-09 21:38:37,073][26022] Updated weights on worker 0-0, policy_version 431126 (0.00090) [2022-07-09 21:38:39,051][26022] Updated weights on worker 0-0, policy_version 431136 (0.00094) [2022-07-09 21:38:40,593][26022] Updated weights on worker 0-0, policy_version 431146 (0.00085) [2022-07-09 21:38:41,567][25689] Fps is (10 sec: 5690.6, 60 sec: 5627.2, 300 sec: 5634.6). Total num frames: 441496576. Throughput: 0: 5066.9. Samples: 441495512. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:38:41,567][25689] Avg episode reward: [(0, '-43.717')] [2022-07-09 21:38:42,524][26022] Updated weights on worker 0-0, policy_version 431156 (0.00108) [2022-07-09 21:38:44,242][26022] Updated weights on worker 0-0, policy_version 431166 (0.00084) [2022-07-09 21:38:46,357][26022] Updated weights on worker 0-0, policy_version 431176 (0.00084) [2022-07-09 21:38:46,576][25689] Fps is (10 sec: 5522.4, 60 sec: 5629.9, 300 sec: 5641.9). Total num frames: 441526272. Throughput: 0: 5885.6. Samples: 441529466. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:38:46,576][25689] Avg episode reward: [(0, '-43.315')] [2022-07-09 21:38:48,033][26022] Updated weights on worker 0-0, policy_version 431186 (0.00330) [2022-07-09 21:38:49,808][26022] Updated weights on worker 0-0, policy_version 431196 (0.00086) [2022-07-09 21:38:51,473][26022] Updated weights on worker 0-0, policy_version 431206 (0.00090) [2022-07-09 21:38:51,590][25689] Fps is (10 sec: 5823.1, 60 sec: 5646.9, 300 sec: 5636.8). Total num frames: 441554944. Throughput: 0: 5876.8. Samples: 441563114. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:38:51,593][25689] Avg episode reward: [(0, '-42.951')] [2022-07-09 21:38:53,473][26022] Updated weights on worker 0-0, policy_version 431216 (0.00087) [2022-07-09 21:38:55,178][26022] Updated weights on worker 0-0, policy_version 431226 (0.00091) [2022-07-09 21:38:56,631][25689] Fps is (10 sec: 5601.1, 60 sec: 5621.2, 300 sec: 5643.0). Total num frames: 441582592. Throughput: 0: 5046.9. Samples: 441580036. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:38:56,631][25689] Avg episode reward: [(0, '-43.235')] [2022-07-09 21:38:57,088][26022] Updated weights on worker 0-0, policy_version 431236 (0.00098) [2022-07-09 21:38:58,846][26022] Updated weights on worker 0-0, policy_version 431246 (0.00082) [2022-07-09 21:39:00,792][26022] Updated weights on worker 0-0, policy_version 431256 (0.00090) [2022-07-09 21:39:01,641][25689] Fps is (10 sec: 5501.5, 60 sec: 5605.7, 300 sec: 5639.8). Total num frames: 441610240. Throughput: 0: 5913.1. Samples: 441614216. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:39:01,642][25689] Avg episode reward: [(0, '-43.895')] [2022-07-09 21:39:02,779][26022] Updated weights on worker 0-0, policy_version 431266 (0.00088) [2022-07-09 21:39:04,660][26022] Updated weights on worker 0-0, policy_version 431276 (0.00093) [2022-07-09 21:39:06,371][26022] Updated weights on worker 0-0, policy_version 431286 (0.00089) [2022-07-09 21:39:06,652][25689] Fps is (10 sec: 5517.9, 60 sec: 5639.8, 300 sec: 5639.9). Total num frames: 441637888. Throughput: 0: 5830.2. Samples: 441646516. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:39:06,653][25689] Avg episode reward: [(0, '-44.488')] [2022-07-09 21:39:08,179][26022] Updated weights on worker 0-0, policy_version 431296 (0.00089) [2022-07-09 21:39:10,032][26022] Updated weights on worker 0-0, policy_version 431306 (0.00091) [2022-07-09 21:39:11,656][25689] Fps is (10 sec: 5521.3, 60 sec: 5623.4, 300 sec: 5634.0). Total num frames: 441665536. Throughput: 0: 4997.8. Samples: 441663400. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:39:11,657][25689] Avg episode reward: [(0, '-45.582')] [2022-07-09 21:39:11,850][26022] Updated weights on worker 0-0, policy_version 431316 (0.00093) [2022-07-09 21:39:13,684][26022] Updated weights on worker 0-0, policy_version 431326 (0.00092) [2022-07-09 21:39:15,410][26022] Updated weights on worker 0-0, policy_version 431336 (0.00091) [2022-07-09 21:39:16,707][25689] Fps is (10 sec: 5601.3, 60 sec: 5623.4, 300 sec: 5636.9). Total num frames: 441694208. Throughput: 0: 5860.3. Samples: 441697688. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:39:16,708][25689] Avg episode reward: [(0, '-46.074')] [2022-07-09 21:39:17,331][26022] Updated weights on worker 0-0, policy_version 431346 (0.00092) [2022-07-09 21:39:19,184][26022] Updated weights on worker 0-0, policy_version 431356 (0.00092) [2022-07-09 21:39:20,723][26022] Updated weights on worker 0-0, policy_version 431366 (0.00091) [2022-07-09 21:39:21,732][25689] Fps is (10 sec: 5691.4, 60 sec: 5622.6, 300 sec: 5633.5). Total num frames: 441722880. Throughput: 0: 5838.1. Samples: 441731506. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:39:21,732][25689] Avg episode reward: [(0, '-45.039')] [2022-07-09 21:39:22,777][26022] Updated weights on worker 0-0, policy_version 431376 (0.00087) [2022-07-09 21:39:24,470][26022] Updated weights on worker 0-0, policy_version 431386 (0.00091) [2022-07-09 21:39:26,252][26022] Updated weights on worker 0-0, policy_version 431396 (0.00087) [2022-07-09 21:39:26,802][25689] Fps is (10 sec: 5781.6, 60 sec: 5634.6, 300 sec: 5642.5). Total num frames: 441752576. Throughput: 0: 5071.1. Samples: 441748700. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:39:26,803][25689] Avg episode reward: [(0, '-45.039')] [2022-07-09 21:39:28,402][26022] Updated weights on worker 0-0, policy_version 431406 (0.00094) [2022-07-09 21:39:29,919][26022] Updated weights on worker 0-0, policy_version 431416 (0.00083) [2022-07-09 21:39:31,838][25689] Fps is (10 sec: 5674.4, 60 sec: 5650.4, 300 sec: 5640.5). Total num frames: 441780224. Throughput: 0: 5905.0. Samples: 441782572. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:39:31,838][25689] Avg episode reward: [(0, '-44.596')] [2022-07-09 21:39:31,840][26022] Updated weights on worker 0-0, policy_version 431426 (0.00088) [2022-07-09 21:39:33,424][26022] Updated weights on worker 0-0, policy_version 431436 (0.00092) [2022-07-09 21:39:33,806][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:39:33,818][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000431438_441792512.pth [2022-07-09 21:39:33,819][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000429454_439760896.pth [2022-07-09 21:39:35,509][26022] Updated weights on worker 0-0, policy_version 431446 (0.00085) [2022-07-09 21:39:36,878][25689] Fps is (10 sec: 5488.0, 60 sec: 5585.6, 300 sec: 5636.4). Total num frames: 441807872. Throughput: 0: 5897.8. Samples: 441816654. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:39:36,879][25689] Avg episode reward: [(0, '-44.499')] [2022-07-09 21:39:37,155][26022] Updated weights on worker 0-0, policy_version 431456 (0.00100) [2022-07-09 21:39:39,123][26022] Updated weights on worker 0-0, policy_version 431466 (0.00094) [2022-07-09 21:39:40,761][26022] Updated weights on worker 0-0, policy_version 431476 (0.00092) [2022-07-09 21:39:41,926][25689] Fps is (10 sec: 5582.5, 60 sec: 5632.4, 300 sec: 5636.3). Total num frames: 441836544. Throughput: 0: 5051.2. Samples: 441833512. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:39:41,927][25689] Avg episode reward: [(0, '-44.454')] [2022-07-09 21:39:42,539][26022] Updated weights on worker 0-0, policy_version 431486 (0.00097) [2022-07-09 21:39:44,415][26022] Updated weights on worker 0-0, policy_version 431496 (0.00085) [2022-07-09 21:39:46,393][26022] Updated weights on worker 0-0, policy_version 431506 (0.00095) [2022-07-09 21:39:46,967][25689] Fps is (10 sec: 5683.8, 60 sec: 5612.4, 300 sec: 5633.1). Total num frames: 441865216. Throughput: 0: 5895.6. Samples: 441867584. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-09 21:39:46,968][25689] Avg episode reward: [(0, '-45.068')] [2022-07-09 21:39:48,165][26022] Updated weights on worker 0-0, policy_version 431516 (0.00094) [2022-07-09 21:39:49,705][26022] Updated weights on worker 0-0, policy_version 431526 (0.00089) [2022-07-09 21:39:51,574][26022] Updated weights on worker 0-0, policy_version 431536 (0.00092) [2022-07-09 21:39:52,061][25689] Fps is (10 sec: 5759.2, 60 sec: 5622.0, 300 sec: 5639.5). Total num frames: 441894912. Throughput: 0: 5901.9. Samples: 441901928. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:39:52,062][25689] Avg episode reward: [(0, '-45.116')] [2022-07-09 21:39:53,420][26022] Updated weights on worker 0-0, policy_version 431546 (0.00086) [2022-07-09 21:39:55,264][26022] Updated weights on worker 0-0, policy_version 431556 (0.00091) [2022-07-09 21:39:56,896][26022] Updated weights on worker 0-0, policy_version 431566 (0.00098) [2022-07-09 21:39:57,161][25689] Fps is (10 sec: 5826.4, 60 sec: 5650.3, 300 sec: 5639.3). Total num frames: 441924608. Throughput: 0: 5056.0. Samples: 441919204. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:39:57,162][25689] Avg episode reward: [(0, '-44.831')] [2022-07-09 21:39:58,883][26022] Updated weights on worker 0-0, policy_version 431576 (0.00087) [2022-07-09 21:40:00,479][26022] Updated weights on worker 0-0, policy_version 431586 (0.00085) [2022-07-09 21:40:02,207][25689] Fps is (10 sec: 5551.2, 60 sec: 5630.1, 300 sec: 5638.7). Total num frames: 441951232. Throughput: 0: 5909.3. Samples: 441953356. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:40:02,207][25689] Avg episode reward: [(0, '-45.515')] [2022-07-09 21:40:02,883][26022] Updated weights on worker 0-0, policy_version 431596 (0.00090) [2022-07-09 21:40:04,604][26022] Updated weights on worker 0-0, policy_version 431606 (0.00091) [2022-07-09 21:40:06,343][26022] Updated weights on worker 0-0, policy_version 431616 (0.00083) [2022-07-09 21:40:07,307][25689] Fps is (10 sec: 5248.3, 60 sec: 5605.0, 300 sec: 5640.3). Total num frames: 441977856. Throughput: 0: 5776.3. Samples: 441985070. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:40:07,307][25689] Avg episode reward: [(0, '-44.786')] [2022-07-09 21:40:08,223][26022] Updated weights on worker 0-0, policy_version 431626 (0.00089) [2022-07-09 21:40:10,134][26022] Updated weights on worker 0-0, policy_version 431636 (0.00091) [2022-07-09 21:40:11,897][26022] Updated weights on worker 0-0, policy_version 431646 (0.00084) [2022-07-09 21:40:12,337][25689] Fps is (10 sec: 5559.8, 60 sec: 5636.4, 300 sec: 5634.1). Total num frames: 442007552. Throughput: 0: 4937.5. Samples: 442002038. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:40:12,337][25689] Avg episode reward: [(0, '-43.862')] [2022-07-09 21:40:13,648][26022] Updated weights on worker 0-0, policy_version 431656 (0.00082) [2022-07-09 21:40:15,488][26022] Updated weights on worker 0-0, policy_version 431666 (0.00087) [2022-07-09 21:40:17,314][26022] Updated weights on worker 0-0, policy_version 431676 (0.00637) [2022-07-09 21:40:17,403][25689] Fps is (10 sec: 5781.3, 60 sec: 5634.9, 300 sec: 5636.4). Total num frames: 442036224. Throughput: 0: 5763.6. Samples: 442035870. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:40:17,403][25689] Avg episode reward: [(0, '-44.296')] [2022-07-09 21:40:19,234][26022] Updated weights on worker 0-0, policy_version 431686 (0.00087) [2022-07-09 21:40:20,727][26022] Updated weights on worker 0-0, policy_version 431696 (0.00086) [2022-07-09 21:40:22,468][25689] Fps is (10 sec: 5660.1, 60 sec: 5631.2, 300 sec: 5635.4). Total num frames: 442064896. Throughput: 0: 5766.4. Samples: 442070190. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:40:22,468][25689] Avg episode reward: [(0, '-44.213')] [2022-07-09 21:40:22,821][26022] Updated weights on worker 0-0, policy_version 431706 (0.00090) [2022-07-09 21:40:24,315][26022] Updated weights on worker 0-0, policy_version 431716 (0.00086) [2022-07-09 21:40:26,311][26022] Updated weights on worker 0-0, policy_version 431726 (0.00086) [2022-07-09 21:40:27,496][25689] Fps is (10 sec: 5681.4, 60 sec: 5618.2, 300 sec: 5635.5). Total num frames: 442093568. Throughput: 0: 5063.0. Samples: 442087288. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:40:27,497][25689] Avg episode reward: [(0, '-44.292')] [2022-07-09 21:40:28,070][26022] Updated weights on worker 0-0, policy_version 431736 (0.00064) [2022-07-09 21:40:29,880][26022] Updated weights on worker 0-0, policy_version 431746 (0.00087) [2022-07-09 21:40:31,807][26022] Updated weights on worker 0-0, policy_version 431756 (0.00086) [2022-07-09 21:40:32,538][25689] Fps is (10 sec: 5694.3, 60 sec: 5634.5, 300 sec: 5635.3). Total num frames: 442122240. Throughput: 0: 5903.8. Samples: 442121304. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:40:32,539][25689] Avg episode reward: [(0, '-43.624')] [2022-07-09 21:40:33,583][26022] Updated weights on worker 0-0, policy_version 431766 (0.00091) [2022-07-09 21:40:35,212][26022] Updated weights on worker 0-0, policy_version 431776 (0.00091) [2022-07-09 21:40:37,184][26022] Updated weights on worker 0-0, policy_version 431786 (0.00089) [2022-07-09 21:40:37,609][25689] Fps is (10 sec: 5569.4, 60 sec: 5631.7, 300 sec: 5631.9). Total num frames: 442149888. Throughput: 0: 5916.9. Samples: 442155426. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:40:37,609][25689] Avg episode reward: [(0, '-43.158')] [2022-07-09 21:40:38,897][26022] Updated weights on worker 0-0, policy_version 431796 (0.00094) [2022-07-09 21:40:40,723][26022] Updated weights on worker 0-0, policy_version 431806 (0.00088) [2022-07-09 21:40:42,556][26022] Updated weights on worker 0-0, policy_version 431816 (0.00084) [2022-07-09 21:40:42,625][25689] Fps is (10 sec: 5685.2, 60 sec: 5651.5, 300 sec: 5642.5). Total num frames: 442179584. Throughput: 0: 5088.6. Samples: 442172760. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:40:42,627][25689] Avg episode reward: [(0, '-44.121')] [2022-07-09 21:40:44,163][26022] Updated weights on worker 0-0, policy_version 431826 (0.00090) [2022-07-09 21:40:46,151][26022] Updated weights on worker 0-0, policy_version 431836 (0.00085) [2022-07-09 21:40:47,695][25689] Fps is (10 sec: 5888.1, 60 sec: 5665.7, 300 sec: 5641.3). Total num frames: 442209280. Throughput: 0: 5921.2. Samples: 442206892. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:40:47,696][25689] Avg episode reward: [(0, '-43.627')] [2022-07-09 21:40:47,751][26022] Updated weights on worker 0-0, policy_version 431846 (0.00083) [2022-07-09 21:40:49,777][26022] Updated weights on worker 0-0, policy_version 431856 (0.00087) [2022-07-09 21:40:51,474][26022] Updated weights on worker 0-0, policy_version 431866 (0.00083) [2022-07-09 21:40:52,726][25689] Fps is (10 sec: 5677.0, 60 sec: 5637.8, 300 sec: 5636.0). Total num frames: 442236928. Throughput: 0: 5935.0. Samples: 442241118. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:40:52,727][25689] Avg episode reward: [(0, '-44.070')] [2022-07-09 21:40:53,436][26022] Updated weights on worker 0-0, policy_version 431876 (0.00091) [2022-07-09 21:40:54,959][26022] Updated weights on worker 0-0, policy_version 431886 (0.00084) [2022-07-09 21:40:57,126][26022] Updated weights on worker 0-0, policy_version 431896 (0.00093) [2022-07-09 21:40:57,791][25689] Fps is (10 sec: 5578.9, 60 sec: 5624.2, 300 sec: 5642.0). Total num frames: 442265600. Throughput: 0: 5932.5. Samples: 442275156. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:40:57,791][25689] Avg episode reward: [(0, '-44.162')] [2022-07-09 21:40:58,520][26022] Updated weights on worker 0-0, policy_version 431906 (0.00091) [2022-07-09 21:41:00,679][26022] Updated weights on worker 0-0, policy_version 431916 (0.00089) [2022-07-09 21:41:02,528][26022] Updated weights on worker 0-0, policy_version 431926 (0.00084) [2022-07-09 21:41:02,810][25689] Fps is (10 sec: 5585.2, 60 sec: 5643.6, 300 sec: 5645.5). Total num frames: 442293248. Throughput: 0: 5929.6. Samples: 442292448. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:41:02,810][25689] Avg episode reward: [(0, '-45.436')] [2022-07-09 21:41:04,400][26022] Updated weights on worker 0-0, policy_version 431936 (0.00088) [2022-07-09 21:41:06,374][26022] Updated weights on worker 0-0, policy_version 431946 (0.00093) [2022-07-09 21:41:07,819][25689] Fps is (10 sec: 5514.2, 60 sec: 5669.0, 300 sec: 5639.0). Total num frames: 442320896. Throughput: 0: 5855.8. Samples: 442324730. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:41:07,819][25689] Avg episode reward: [(0, '-46.403')] [2022-07-09 21:41:08,003][26022] Updated weights on worker 0-0, policy_version 431956 (0.00088) [2022-07-09 21:41:09,871][26022] Updated weights on worker 0-0, policy_version 431966 (0.00085) [2022-07-09 21:41:11,799][26022] Updated weights on worker 0-0, policy_version 431976 (0.00092) [2022-07-09 21:41:12,854][25689] Fps is (10 sec: 5606.8, 60 sec: 5651.5, 300 sec: 5640.8). Total num frames: 442349568. Throughput: 0: 5850.9. Samples: 442358890. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:41:12,855][25689] Avg episode reward: [(0, '-46.645')] [2022-07-09 21:41:13,547][26022] Updated weights on worker 0-0, policy_version 431986 (0.00086) [2022-07-09 21:41:15,380][26022] Updated weights on worker 0-0, policy_version 431996 (0.00086) [2022-07-09 21:41:17,083][26022] Updated weights on worker 0-0, policy_version 432006 (0.00093) [2022-07-09 21:41:17,903][25689] Fps is (10 sec: 5788.2, 60 sec: 5670.1, 300 sec: 5645.0). Total num frames: 442379264. Throughput: 0: 5008.1. Samples: 442375878. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:41:17,903][25689] Avg episode reward: [(0, '-46.413')] [2022-07-09 21:41:18,939][26022] Updated weights on worker 0-0, policy_version 432016 (0.00087) [2022-07-09 21:41:20,712][26022] Updated weights on worker 0-0, policy_version 432026 (0.00087) [2022-07-09 21:41:22,407][26022] Updated weights on worker 0-0, policy_version 432036 (0.00086) [2022-07-09 21:41:22,922][25689] Fps is (10 sec: 5695.8, 60 sec: 5657.4, 300 sec: 5638.5). Total num frames: 442406912. Throughput: 0: 5859.6. Samples: 442410300. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:41:22,927][25689] Avg episode reward: [(0, '-46.611')] [2022-07-09 21:41:24,130][26022] Updated weights on worker 0-0, policy_version 432046 (0.00092) [2022-07-09 21:41:26,015][26022] Updated weights on worker 0-0, policy_version 432056 (0.00066) [2022-07-09 21:41:27,750][26022] Updated weights on worker 0-0, policy_version 432066 (0.00095) [2022-07-09 21:41:27,931][25689] Fps is (10 sec: 5615.9, 60 sec: 5659.2, 300 sec: 5646.4). Total num frames: 442435584. Throughput: 0: 5965.6. Samples: 442444714. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:41:27,932][25689] Avg episode reward: [(0, '-46.302')] [2022-07-09 21:41:29,766][26022] Updated weights on worker 0-0, policy_version 432076 (0.00085) [2022-07-09 21:41:31,395][26022] Updated weights on worker 0-0, policy_version 432086 (0.00085) [2022-07-09 21:41:32,940][25689] Fps is (10 sec: 5724.1, 60 sec: 5662.3, 300 sec: 5638.1). Total num frames: 442464256. Throughput: 0: 5126.0. Samples: 442461850. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:41:32,942][25689] Avg episode reward: [(0, '-45.433')] [2022-07-09 21:41:33,292][26022] Updated weights on worker 0-0, policy_version 432096 (0.00090) [2022-07-09 21:41:33,889][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:41:33,895][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000432100_442470400.pth [2022-07-09 21:41:33,896][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000430114_440436736.pth [2022-07-09 21:41:34,923][26022] Updated weights on worker 0-0, policy_version 432106 (0.00089) [2022-07-09 21:41:36,710][26022] Updated weights on worker 0-0, policy_version 432116 (0.00088) [2022-07-09 21:41:38,005][25689] Fps is (10 sec: 5692.5, 60 sec: 5679.8, 300 sec: 5644.7). Total num frames: 442492928. Throughput: 0: 5990.5. Samples: 442496302. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:41:38,005][25689] Avg episode reward: [(0, '-44.805')] [2022-07-09 21:41:38,547][26022] Updated weights on worker 0-0, policy_version 432126 (0.00101) [2022-07-09 21:41:40,237][26022] Updated weights on worker 0-0, policy_version 432136 (0.00084) [2022-07-09 21:41:42,199][26022] Updated weights on worker 0-0, policy_version 432146 (0.00083) [2022-07-09 21:41:43,014][25689] Fps is (10 sec: 5794.3, 60 sec: 5680.5, 300 sec: 5649.1). Total num frames: 442522624. Throughput: 0: 5979.1. Samples: 442530428. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:41:43,014][25689] Avg episode reward: [(0, '-45.398')] [2022-07-09 21:41:43,916][26022] Updated weights on worker 0-0, policy_version 432156 (0.00093) [2022-07-09 21:41:45,729][26022] Updated weights on worker 0-0, policy_version 432166 (0.00090) [2022-07-09 21:41:47,671][26022] Updated weights on worker 0-0, policy_version 432176 (0.00096) [2022-07-09 21:41:48,074][25689] Fps is (10 sec: 5593.2, 60 sec: 5630.6, 300 sec: 5638.5). Total num frames: 442549248. Throughput: 0: 5088.1. Samples: 442547204. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:41:48,075][25689] Avg episode reward: [(0, '-45.583')] [2022-07-09 21:41:49,327][26022] Updated weights on worker 0-0, policy_version 432186 (0.00082) [2022-07-09 21:41:51,174][26022] Updated weights on worker 0-0, policy_version 432196 (0.00085) [2022-07-09 21:41:52,975][26022] Updated weights on worker 0-0, policy_version 432206 (0.00090) [2022-07-09 21:41:53,106][25689] Fps is (10 sec: 5682.2, 60 sec: 5681.4, 300 sec: 5652.3). Total num frames: 442579968. Throughput: 0: 5962.3. Samples: 442582082. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:41:53,106][25689] Avg episode reward: [(0, '-45.965')] [2022-07-09 21:41:54,846][26022] Updated weights on worker 0-0, policy_version 432216 (0.00080) [2022-07-09 21:41:56,447][26022] Updated weights on worker 0-0, policy_version 432226 (0.00089) [2022-07-09 21:41:58,202][25689] Fps is (10 sec: 5864.5, 60 sec: 5678.4, 300 sec: 5648.2). Total num frames: 442608640. Throughput: 0: 5947.5. Samples: 442616422. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:41:58,202][25689] Avg episode reward: [(0, '-47.321')] [2022-07-09 21:41:58,507][26022] Updated weights on worker 0-0, policy_version 432236 (0.00085) [2022-07-09 21:42:00,033][26022] Updated weights on worker 0-0, policy_version 432246 (0.00088) [2022-07-09 21:42:02,257][26022] Updated weights on worker 0-0, policy_version 432256 (0.00088) [2022-07-09 21:42:03,212][25689] Fps is (10 sec: 5471.6, 60 sec: 5662.3, 300 sec: 5645.3). Total num frames: 442635264. Throughput: 0: 5107.4. Samples: 442633586. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:42:03,213][25689] Avg episode reward: [(0, '-47.316')] [2022-07-09 21:42:03,931][26022] Updated weights on worker 0-0, policy_version 432266 (0.00086) [2022-07-09 21:42:05,839][26022] Updated weights on worker 0-0, policy_version 432276 (0.00098) [2022-07-09 21:42:07,526][26022] Updated weights on worker 0-0, policy_version 432286 (0.00087) [2022-07-09 21:42:08,221][25689] Fps is (10 sec: 5620.9, 60 sec: 5696.2, 300 sec: 5652.3). Total num frames: 442664960. Throughput: 0: 5900.1. Samples: 442666074. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:42:08,222][25689] Avg episode reward: [(0, '-46.998')] [2022-07-09 21:42:09,456][26022] Updated weights on worker 0-0, policy_version 432296 (0.00089) [2022-07-09 21:42:11,044][26022] Updated weights on worker 0-0, policy_version 432306 (0.00085) [2022-07-09 21:42:13,201][26022] Updated weights on worker 0-0, policy_version 432316 (0.00088) [2022-07-09 21:42:13,259][25689] Fps is (10 sec: 5605.4, 60 sec: 5662.1, 300 sec: 5642.1). Total num frames: 442691584. Throughput: 0: 5878.0. Samples: 442700544. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:42:13,260][25689] Avg episode reward: [(0, '-46.821')] [2022-07-09 21:42:14,695][26022] Updated weights on worker 0-0, policy_version 432326 (0.00095) [2022-07-09 21:42:16,514][26022] Updated weights on worker 0-0, policy_version 432336 (0.00084) [2022-07-09 21:42:18,302][25689] Fps is (10 sec: 5587.2, 60 sec: 5662.6, 300 sec: 5648.5). Total num frames: 442721280. Throughput: 0: 5028.3. Samples: 442717490. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:42:18,302][25689] Avg episode reward: [(0, '-46.673')] [2022-07-09 21:42:18,351][26022] Updated weights on worker 0-0, policy_version 432346 (0.00089) [2022-07-09 21:42:20,170][26022] Updated weights on worker 0-0, policy_version 432356 (0.00082) [2022-07-09 21:42:22,134][26022] Updated weights on worker 0-0, policy_version 432366 (0.00094) [2022-07-09 21:42:23,311][25689] Fps is (10 sec: 5908.8, 60 sec: 5697.5, 300 sec: 5655.5). Total num frames: 442750976. Throughput: 0: 5877.9. Samples: 442751726. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:42:23,311][25689] Avg episode reward: [(0, '-47.304')] [2022-07-09 21:42:23,515][26022] Updated weights on worker 0-0, policy_version 432376 (0.00089) [2022-07-09 21:42:25,709][26022] Updated weights on worker 0-0, policy_version 432386 (0.00099) [2022-07-09 21:42:27,165][26022] Updated weights on worker 0-0, policy_version 432396 (0.00091) [2022-07-09 21:42:28,316][25689] Fps is (10 sec: 5726.3, 60 sec: 5680.9, 300 sec: 5642.3). Total num frames: 442778624. Throughput: 0: 5971.9. Samples: 442786076. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:42:28,316][25689] Avg episode reward: [(0, '-46.984')] [2022-07-09 21:42:29,138][26022] Updated weights on worker 0-0, policy_version 432406 (0.00087) [2022-07-09 21:42:31,015][26022] Updated weights on worker 0-0, policy_version 432416 (0.00091) [2022-07-09 21:42:32,604][26022] Updated weights on worker 0-0, policy_version 432426 (0.00091) [2022-07-09 21:42:33,326][25689] Fps is (10 sec: 5623.1, 60 sec: 5680.8, 300 sec: 5650.1). Total num frames: 442807296. Throughput: 0: 5105.8. Samples: 442803004. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-09 21:42:33,327][25689] Avg episode reward: [(0, '-46.218')] [2022-07-09 21:42:34,814][26022] Updated weights on worker 0-0, policy_version 432436 (0.00096) [2022-07-09 21:42:36,293][26022] Updated weights on worker 0-0, policy_version 432446 (0.00088) [2022-07-09 21:42:38,138][26022] Updated weights on worker 0-0, policy_version 432456 (0.00086) [2022-07-09 21:42:38,364][25689] Fps is (10 sec: 5706.7, 60 sec: 5683.3, 300 sec: 5650.4). Total num frames: 442835968. Throughput: 0: 5958.8. Samples: 442837040. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:42:38,365][25689] Avg episode reward: [(0, '-45.977')] [2022-07-09 21:42:40,031][26022] Updated weights on worker 0-0, policy_version 432466 (0.00080) [2022-07-09 21:42:41,671][26022] Updated weights on worker 0-0, policy_version 432476 (0.00095) [2022-07-09 21:42:43,397][25689] Fps is (10 sec: 5694.2, 60 sec: 5664.1, 300 sec: 5647.0). Total num frames: 442864640. Throughput: 0: 5961.7. Samples: 442871476. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:42:43,398][25689] Avg episode reward: [(0, '-46.261')] [2022-07-09 21:42:43,710][26022] Updated weights on worker 0-0, policy_version 432486 (0.00087) [2022-07-09 21:42:45,240][26022] Updated weights on worker 0-0, policy_version 432496 (0.00090) [2022-07-09 21:42:47,144][26022] Updated weights on worker 0-0, policy_version 432506 (0.00228) [2022-07-09 21:42:48,404][25689] Fps is (10 sec: 5609.9, 60 sec: 5686.1, 300 sec: 5647.2). Total num frames: 442892288. Throughput: 0: 5102.0. Samples: 442888564. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:42:48,405][25689] Avg episode reward: [(0, '-46.754')] [2022-07-09 21:42:49,006][26022] Updated weights on worker 0-0, policy_version 432516 (0.00087) [2022-07-09 21:42:50,682][26022] Updated weights on worker 0-0, policy_version 432526 (0.00081) [2022-07-09 21:42:52,569][26022] Updated weights on worker 0-0, policy_version 432536 (0.00094) [2022-07-09 21:42:53,412][25689] Fps is (10 sec: 5725.9, 60 sec: 5671.3, 300 sec: 5649.5). Total num frames: 442921984. Throughput: 0: 5972.8. Samples: 442922970. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:42:53,412][25689] Avg episode reward: [(0, '-45.413')] [2022-07-09 21:42:54,383][26022] Updated weights on worker 0-0, policy_version 432546 (0.00086) [2022-07-09 21:42:56,047][26022] Updated weights on worker 0-0, policy_version 432556 (0.00093) [2022-07-09 21:42:57,811][26022] Updated weights on worker 0-0, policy_version 432566 (0.00084) [2022-07-09 21:42:58,484][25689] Fps is (10 sec: 5689.0, 60 sec: 5656.6, 300 sec: 5645.2). Total num frames: 442949632. Throughput: 0: 5970.5. Samples: 442957162. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:42:58,484][25689] Avg episode reward: [(0, '-45.635')] [2022-07-09 21:42:59,681][26022] Updated weights on worker 0-0, policy_version 432576 (0.00090) [2022-07-09 21:43:01,847][26022] Updated weights on worker 0-0, policy_version 432586 (0.00092) [2022-07-09 21:43:03,493][25689] Fps is (10 sec: 5485.3, 60 sec: 5673.7, 300 sec: 5652.1). Total num frames: 442977280. Throughput: 0: 5116.4. Samples: 442974294. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:43:03,493][25689] Avg episode reward: [(0, '-45.723')] [2022-07-09 21:43:03,544][26022] Updated weights on worker 0-0, policy_version 432596 (0.00089) [2022-07-09 21:43:05,352][26022] Updated weights on worker 0-0, policy_version 432606 (0.00094) [2022-07-09 21:43:07,139][26022] Updated weights on worker 0-0, policy_version 432616 (0.00088) [2022-07-09 21:43:08,496][25689] Fps is (10 sec: 5625.0, 60 sec: 5657.3, 300 sec: 5652.3). Total num frames: 443005952. Throughput: 0: 5875.1. Samples: 443006608. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:43:08,496][25689] Avg episode reward: [(0, '-45.445')] [2022-07-09 21:43:09,315][26022] Updated weights on worker 0-0, policy_version 432626 (0.00091) [2022-07-09 21:43:10,788][26022] Updated weights on worker 0-0, policy_version 432636 (0.00088) [2022-07-09 21:43:12,571][26022] Updated weights on worker 0-0, policy_version 432646 (0.00094) [2022-07-09 21:43:13,520][25689] Fps is (10 sec: 5616.7, 60 sec: 5675.6, 300 sec: 5649.3). Total num frames: 443033600. Throughput: 0: 5857.2. Samples: 443040746. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:43:13,522][25689] Avg episode reward: [(0, '-44.568')] [2022-07-09 21:43:14,278][26022] Updated weights on worker 0-0, policy_version 432656 (0.00087) [2022-07-09 21:43:16,155][26022] Updated weights on worker 0-0, policy_version 432666 (0.00085) [2022-07-09 21:43:18,209][26022] Updated weights on worker 0-0, policy_version 432676 (0.00093) [2022-07-09 21:43:18,611][25689] Fps is (10 sec: 5466.9, 60 sec: 5637.1, 300 sec: 5644.4). Total num frames: 443061248. Throughput: 0: 4987.5. Samples: 443057546. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:43:18,611][25689] Avg episode reward: [(0, '-44.607')] [2022-07-09 21:43:19,754][26022] Updated weights on worker 0-0, policy_version 432686 (0.00099) [2022-07-09 21:43:21,683][26022] Updated weights on worker 0-0, policy_version 432696 (0.00083) [2022-07-09 21:43:23,188][26022] Updated weights on worker 0-0, policy_version 432706 (0.00091) [2022-07-09 21:43:23,632][25689] Fps is (10 sec: 5772.2, 60 sec: 5652.9, 300 sec: 5651.3). Total num frames: 443091968. Throughput: 0: 5848.3. Samples: 443092074. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:43:23,632][25689] Avg episode reward: [(0, '-45.111')] [2022-07-09 21:43:25,203][26022] Updated weights on worker 0-0, policy_version 432716 (0.00096) [2022-07-09 21:43:27,051][26022] Updated weights on worker 0-0, policy_version 432726 (0.00072) [2022-07-09 21:43:28,658][25689] Fps is (10 sec: 5911.3, 60 sec: 5667.9, 300 sec: 5658.1). Total num frames: 443120640. Throughput: 0: 5933.1. Samples: 443126232. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:43:28,658][25689] Avg episode reward: [(0, '-44.744')] [2022-07-09 21:43:28,867][26022] Updated weights on worker 0-0, policy_version 432736 (0.00090) [2022-07-09 21:43:30,838][26022] Updated weights on worker 0-0, policy_version 432746 (0.00093) [2022-07-09 21:43:32,411][26022] Updated weights on worker 0-0, policy_version 432756 (0.00083) [2022-07-09 21:43:33,687][25689] Fps is (10 sec: 5601.3, 60 sec: 5649.3, 300 sec: 5645.2). Total num frames: 443148288. Throughput: 0: 5087.7. Samples: 443143348. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:43:33,688][25689] Avg episode reward: [(0, '-44.625')] [2022-07-09 21:43:34,021][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:43:34,034][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000432765_443151360.pth [2022-07-09 21:43:34,035][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000430776_441114624.pth [2022-07-09 21:43:34,281][26022] Updated weights on worker 0-0, policy_version 432766 (0.00089) [2022-07-09 21:43:35,996][26022] Updated weights on worker 0-0, policy_version 432776 (0.00085) [2022-07-09 21:43:37,972][26022] Updated weights on worker 0-0, policy_version 432786 (0.00084) [2022-07-09 21:43:38,815][25689] Fps is (10 sec: 5746.7, 60 sec: 5674.7, 300 sec: 5660.0). Total num frames: 443179008. Throughput: 0: 5947.0. Samples: 443177700. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:43:38,815][25689] Avg episode reward: [(0, '-44.326')] [2022-07-09 21:43:39,713][26022] Updated weights on worker 0-0, policy_version 432796 (0.00090) [2022-07-09 21:43:41,533][26022] Updated weights on worker 0-0, policy_version 432806 (0.00085) [2022-07-09 21:43:43,075][26022] Updated weights on worker 0-0, policy_version 432816 (0.00091) [2022-07-09 21:43:43,847][25689] Fps is (10 sec: 5845.7, 60 sec: 5674.8, 300 sec: 5656.1). Total num frames: 443207680. Throughput: 0: 5945.8. Samples: 443212268. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:43:43,847][25689] Avg episode reward: [(0, '-44.063')] [2022-07-09 21:43:44,954][26022] Updated weights on worker 0-0, policy_version 432826 (0.00088) [2022-07-09 21:43:46,639][26022] Updated weights on worker 0-0, policy_version 432836 (0.00093) [2022-07-09 21:43:48,659][26022] Updated weights on worker 0-0, policy_version 432846 (0.00085) [2022-07-09 21:43:48,915][25689] Fps is (10 sec: 5575.9, 60 sec: 5669.0, 300 sec: 5651.7). Total num frames: 443235328. Throughput: 0: 5091.8. Samples: 443229380. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:43:48,916][25689] Avg episode reward: [(0, '-43.794')] [2022-07-09 21:43:50,207][26022] Updated weights on worker 0-0, policy_version 432856 (0.00087) [2022-07-09 21:43:52,244][26022] Updated weights on worker 0-0, policy_version 432866 (0.00092) [2022-07-09 21:43:53,945][25689] Fps is (10 sec: 5577.3, 60 sec: 5650.1, 300 sec: 5655.3). Total num frames: 443264000. Throughput: 0: 5924.8. Samples: 443263372. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:43:53,945][25689] Avg episode reward: [(0, '-44.523')] [2022-07-09 21:43:54,095][26022] Updated weights on worker 0-0, policy_version 432876 (0.00087) [2022-07-09 21:43:55,779][26022] Updated weights on worker 0-0, policy_version 432886 (0.00056) [2022-07-09 21:43:57,726][26022] Updated weights on worker 0-0, policy_version 432896 (0.00090) [2022-07-09 21:43:59,079][25689] Fps is (10 sec: 5743.0, 60 sec: 5678.1, 300 sec: 5659.9). Total num frames: 443293696. Throughput: 0: 5898.8. Samples: 443297234. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:43:59,079][25689] Avg episode reward: [(0, '-44.524')] [2022-07-09 21:43:59,436][26022] Updated weights on worker 0-0, policy_version 432906 (0.00086) [2022-07-09 21:44:01,231][26022] Updated weights on worker 0-0, policy_version 432916 (0.00090) [2022-07-09 21:44:03,564][26022] Updated weights on worker 0-0, policy_version 432926 (0.00090) [2022-07-09 21:44:04,107][25689] Fps is (10 sec: 5542.2, 60 sec: 5659.4, 300 sec: 5656.1). Total num frames: 443320320. Throughput: 0: 5767.5. Samples: 443329118. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:44:04,107][25689] Avg episode reward: [(0, '-45.768')] [2022-07-09 21:44:05,176][26022] Updated weights on worker 0-0, policy_version 432936 (0.00094) [2022-07-09 21:44:06,951][26022] Updated weights on worker 0-0, policy_version 432946 (0.00093) [2022-07-09 21:44:09,051][26022] Updated weights on worker 0-0, policy_version 432956 (0.00089) [2022-07-09 21:44:09,135][25689] Fps is (10 sec: 5295.2, 60 sec: 5623.3, 300 sec: 5652.2). Total num frames: 443346944. Throughput: 0: 5781.5. Samples: 443346278. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:44:09,135][25689] Avg episode reward: [(0, '-46.158')] [2022-07-09 21:44:10,483][26022] Updated weights on worker 0-0, policy_version 432966 (0.00083) [2022-07-09 21:44:12,439][26022] Updated weights on worker 0-0, policy_version 432976 (0.00090) [2022-07-09 21:44:14,118][26022] Updated weights on worker 0-0, policy_version 432986 (0.00083) [2022-07-09 21:44:14,138][25689] Fps is (10 sec: 5716.4, 60 sec: 5675.9, 300 sec: 5660.0). Total num frames: 443377664. Throughput: 0: 5814.7. Samples: 443380792. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:44:14,139][25689] Avg episode reward: [(0, '-46.524')] [2022-07-09 21:44:15,947][26022] Updated weights on worker 0-0, policy_version 432996 (0.00084) [2022-07-09 21:44:17,793][26022] Updated weights on worker 0-0, policy_version 433006 (0.00085) [2022-07-09 21:44:19,262][25689] Fps is (10 sec: 5763.2, 60 sec: 5672.8, 300 sec: 5654.7). Total num frames: 443405312. Throughput: 0: 5825.2. Samples: 443414808. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:44:19,264][25689] Avg episode reward: [(0, '-45.147')] [2022-07-09 21:44:19,583][26022] Updated weights on worker 0-0, policy_version 433016 (0.00086) [2022-07-09 21:44:21,534][26022] Updated weights on worker 0-0, policy_version 433026 (0.00090) [2022-07-09 21:44:23,160][26022] Updated weights on worker 0-0, policy_version 433036 (0.00087) [2022-07-09 21:44:24,293][25689] Fps is (10 sec: 5545.9, 60 sec: 5638.1, 300 sec: 5652.0). Total num frames: 443433984. Throughput: 0: 5086.9. Samples: 443431806. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:44:24,294][25689] Avg episode reward: [(0, '-44.883')] [2022-07-09 21:44:25,064][26022] Updated weights on worker 0-0, policy_version 433046 (0.00089) [2022-07-09 21:44:26,883][26022] Updated weights on worker 0-0, policy_version 433056 (0.00100) [2022-07-09 21:44:28,829][26022] Updated weights on worker 0-0, policy_version 433066 (0.00090) [2022-07-09 21:44:29,330][25689] Fps is (10 sec: 5797.4, 60 sec: 5654.0, 300 sec: 5658.8). Total num frames: 443463680. Throughput: 0: 5917.8. Samples: 443465792. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:44:29,331][25689] Avg episode reward: [(0, '-45.579')] [2022-07-09 21:44:30,478][26022] Updated weights on worker 0-0, policy_version 433076 (0.00093) [2022-07-09 21:44:32,302][26022] Updated weights on worker 0-0, policy_version 433086 (0.00091) [2022-07-09 21:44:34,142][26022] Updated weights on worker 0-0, policy_version 433096 (0.00091) [2022-07-09 21:44:34,343][25689] Fps is (10 sec: 5705.9, 60 sec: 5655.4, 300 sec: 5659.3). Total num frames: 443491328. Throughput: 0: 5888.7. Samples: 443499774. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:44:34,345][25689] Avg episode reward: [(0, '-45.230')] [2022-07-09 21:44:36,091][26022] Updated weights on worker 0-0, policy_version 433106 (0.00091) [2022-07-09 21:44:37,596][26022] Updated weights on worker 0-0, policy_version 433116 (0.00091) [2022-07-09 21:44:39,421][25689] Fps is (10 sec: 5480.1, 60 sec: 5609.5, 300 sec: 5655.3). Total num frames: 443518976. Throughput: 0: 5061.1. Samples: 443516832. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:44:39,421][25689] Avg episode reward: [(0, '-44.826')] [2022-07-09 21:44:39,754][26022] Updated weights on worker 0-0, policy_version 433126 (0.00085) [2022-07-09 21:44:41,154][26022] Updated weights on worker 0-0, policy_version 433136 (0.00094) [2022-07-09 21:44:43,284][26022] Updated weights on worker 0-0, policy_version 433146 (0.00083) [2022-07-09 21:44:44,453][25689] Fps is (10 sec: 5773.2, 60 sec: 5643.2, 300 sec: 5662.4). Total num frames: 443549696. Throughput: 0: 5917.3. Samples: 443551098. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:44:44,454][25689] Avg episode reward: [(0, '-45.326')] [2022-07-09 21:44:44,784][26022] Updated weights on worker 0-0, policy_version 433156 (0.00087) [2022-07-09 21:44:46,729][26022] Updated weights on worker 0-0, policy_version 433166 (0.00089) [2022-07-09 21:44:48,498][26022] Updated weights on worker 0-0, policy_version 433176 (0.00086) [2022-07-09 21:44:49,475][25689] Fps is (10 sec: 5703.6, 60 sec: 5630.7, 300 sec: 5653.4). Total num frames: 443576320. Throughput: 0: 5925.1. Samples: 443585150. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:44:49,475][25689] Avg episode reward: [(0, '-45.891')] [2022-07-09 21:44:50,224][26022] Updated weights on worker 0-0, policy_version 433186 (0.00085) [2022-07-09 21:44:52,117][26022] Updated weights on worker 0-0, policy_version 433196 (0.00092) [2022-07-09 21:44:53,855][26022] Updated weights on worker 0-0, policy_version 433206 (0.00096) [2022-07-09 21:44:54,569][25689] Fps is (10 sec: 5567.8, 60 sec: 5641.6, 300 sec: 5653.5). Total num frames: 443606016. Throughput: 0: 5075.6. Samples: 443602430. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:44:54,569][25689] Avg episode reward: [(0, '-44.777')] [2022-07-09 21:44:55,531][26022] Updated weights on worker 0-0, policy_version 433216 (0.00084) [2022-07-09 21:44:57,400][26022] Updated weights on worker 0-0, policy_version 433226 (0.00087) [2022-07-09 21:44:59,006][26022] Updated weights on worker 0-0, policy_version 433236 (0.00088) [2022-07-09 21:44:59,656][25689] Fps is (10 sec: 5833.3, 60 sec: 5645.9, 300 sec: 5663.1). Total num frames: 443635712. Throughput: 0: 5935.6. Samples: 443636942. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:44:59,657][25689] Avg episode reward: [(0, '-44.377')] [2022-07-09 21:45:01,186][26022] Updated weights on worker 0-0, policy_version 433246 (0.00083) [2022-07-09 21:45:03,145][26022] Updated weights on worker 0-0, policy_version 433256 (0.00089) [2022-07-09 21:45:04,668][25689] Fps is (10 sec: 5576.6, 60 sec: 5647.4, 300 sec: 5664.8). Total num frames: 443662336. Throughput: 0: 5840.2. Samples: 443669156. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:45:04,669][25689] Avg episode reward: [(0, '-42.917')] [2022-07-09 21:45:05,097][26022] Updated weights on worker 0-0, policy_version 433266 (0.00091) [2022-07-09 21:45:06,772][26022] Updated weights on worker 0-0, policy_version 433276 (0.00089) [2022-07-09 21:45:08,650][26022] Updated weights on worker 0-0, policy_version 433286 (0.00097) [2022-07-09 21:45:09,677][25689] Fps is (10 sec: 5415.8, 60 sec: 5666.1, 300 sec: 5658.3). Total num frames: 443689984. Throughput: 0: 5004.0. Samples: 443686244. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:45:09,678][25689] Avg episode reward: [(0, '-43.268')] [2022-07-09 21:45:10,313][26022] Updated weights on worker 0-0, policy_version 433296 (0.00085) [2022-07-09 21:45:12,114][26022] Updated weights on worker 0-0, policy_version 433306 (0.00085) [2022-07-09 21:45:14,007][26022] Updated weights on worker 0-0, policy_version 433316 (0.00085) [2022-07-09 21:45:14,727][25689] Fps is (10 sec: 5701.0, 60 sec: 5644.9, 300 sec: 5662.0). Total num frames: 443719680. Throughput: 0: 5867.1. Samples: 443720698. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-09 21:45:14,728][25689] Avg episode reward: [(0, '-44.232')] [2022-07-09 21:45:15,863][26022] Updated weights on worker 0-0, policy_version 433326 (0.00086) [2022-07-09 21:45:17,599][26022] Updated weights on worker 0-0, policy_version 433336 (0.00091) [2022-07-09 21:45:19,580][26022] Updated weights on worker 0-0, policy_version 433346 (0.00094) [2022-07-09 21:45:19,787][25689] Fps is (10 sec: 5773.5, 60 sec: 5667.8, 300 sec: 5662.1). Total num frames: 443748352. Throughput: 0: 5837.3. Samples: 443754450. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:45:19,788][25689] Avg episode reward: [(0, '-45.047')] [2022-07-09 21:45:21,311][26022] Updated weights on worker 0-0, policy_version 433356 (0.00084) [2022-07-09 21:45:23,115][26022] Updated weights on worker 0-0, policy_version 433366 (0.00086) [2022-07-09 21:45:24,794][25689] Fps is (10 sec: 5696.0, 60 sec: 5670.0, 300 sec: 5662.5). Total num frames: 443777024. Throughput: 0: 5084.5. Samples: 443771486. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:45:24,795][25689] Avg episode reward: [(0, '-45.379')] [2022-07-09 21:45:24,799][26022] Updated weights on worker 0-0, policy_version 433376 (0.00092) [2022-07-09 21:45:26,715][26022] Updated weights on worker 0-0, policy_version 433386 (0.00105) [2022-07-09 21:45:28,430][26022] Updated weights on worker 0-0, policy_version 433396 (0.00092) [2022-07-09 21:45:29,882][25689] Fps is (10 sec: 5477.6, 60 sec: 5614.5, 300 sec: 5654.8). Total num frames: 443803648. Throughput: 0: 5921.8. Samples: 443805892. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:45:29,883][25689] Avg episode reward: [(0, '-45.361')] [2022-07-09 21:45:30,284][26022] Updated weights on worker 0-0, policy_version 433406 (0.00102) [2022-07-09 21:45:31,946][26022] Updated weights on worker 0-0, policy_version 433416 (0.00083) [2022-07-09 21:45:33,664][26022] Updated weights on worker 0-0, policy_version 433426 (0.00086) [2022-07-09 21:45:34,249][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:45:34,257][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000433429_443831296.pth [2022-07-09 21:45:34,258][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000431438_441792512.pth [2022-07-09 21:45:34,921][25689] Fps is (10 sec: 5763.5, 60 sec: 5679.7, 300 sec: 5669.1). Total num frames: 443835392. Throughput: 0: 5945.3. Samples: 443840760. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:45:34,922][25689] Avg episode reward: [(0, '-45.661')] [2022-07-09 21:45:35,520][26022] Updated weights on worker 0-0, policy_version 433436 (0.00085) [2022-07-09 21:45:37,295][26022] Updated weights on worker 0-0, policy_version 433446 (0.00086) [2022-07-09 21:45:39,209][26022] Updated weights on worker 0-0, policy_version 433456 (0.00087) [2022-07-09 21:45:39,980][25689] Fps is (10 sec: 6084.6, 60 sec: 5715.3, 300 sec: 5668.3). Total num frames: 443865088. Throughput: 0: 5128.4. Samples: 443858006. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:45:39,981][25689] Avg episode reward: [(0, '-46.235')] [2022-07-09 21:45:40,749][26022] Updated weights on worker 0-0, policy_version 433466 (0.00086) [2022-07-09 21:45:42,563][26022] Updated weights on worker 0-0, policy_version 433476 (0.00094) [2022-07-09 21:45:44,404][26022] Updated weights on worker 0-0, policy_version 433486 (0.00087) [2022-07-09 21:45:45,067][25689] Fps is (10 sec: 5651.9, 60 sec: 5659.4, 300 sec: 5661.1). Total num frames: 443892736. Throughput: 0: 5991.8. Samples: 443892958. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:45:45,068][25689] Avg episode reward: [(0, '-46.100')] [2022-07-09 21:45:46,146][26022] Updated weights on worker 0-0, policy_version 433496 (0.00083) [2022-07-09 21:45:48,199][26022] Updated weights on worker 0-0, policy_version 433506 (0.00101) [2022-07-09 21:45:49,869][26022] Updated weights on worker 0-0, policy_version 433516 (0.00087) [2022-07-09 21:45:50,080][25689] Fps is (10 sec: 5677.3, 60 sec: 5710.9, 300 sec: 5668.3). Total num frames: 443922432. Throughput: 0: 6004.4. Samples: 443927170. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:45:50,081][25689] Avg episode reward: [(0, '-45.982')] [2022-07-09 21:45:51,579][26022] Updated weights on worker 0-0, policy_version 433526 (0.00086) [2022-07-09 21:45:53,282][26022] Updated weights on worker 0-0, policy_version 433536 (0.00086) [2022-07-09 21:45:55,131][25689] Fps is (10 sec: 5698.0, 60 sec: 5681.2, 300 sec: 5665.2). Total num frames: 443950080. Throughput: 0: 5126.8. Samples: 443944372. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:45:55,132][25689] Avg episode reward: [(0, '-45.645')] [2022-07-09 21:45:55,231][26022] Updated weights on worker 0-0, policy_version 433546 (0.00089) [2022-07-09 21:45:56,987][26022] Updated weights on worker 0-0, policy_version 433556 (0.00086) [2022-07-09 21:45:58,879][26022] Updated weights on worker 0-0, policy_version 433566 (0.00084) [2022-07-09 21:46:00,174][25689] Fps is (10 sec: 5681.1, 60 sec: 5685.4, 300 sec: 5671.6). Total num frames: 443979776. Throughput: 0: 5967.8. Samples: 443978522. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:46:00,175][25689] Avg episode reward: [(0, '-45.295')] [2022-07-09 21:46:00,372][26022] Updated weights on worker 0-0, policy_version 433576 (0.00085) [2022-07-09 21:46:02,867][26022] Updated weights on worker 0-0, policy_version 433586 (0.00088) [2022-07-09 21:46:04,335][26022] Updated weights on worker 0-0, policy_version 433596 (0.00088) [2022-07-09 21:46:05,196][25689] Fps is (10 sec: 5595.7, 60 sec: 5684.4, 300 sec: 5667.9). Total num frames: 444006400. Throughput: 0: 5854.3. Samples: 444010798. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:46:05,196][25689] Avg episode reward: [(0, '-45.201')] [2022-07-09 21:46:06,405][26022] Updated weights on worker 0-0, policy_version 433606 (0.00085) [2022-07-09 21:46:07,998][26022] Updated weights on worker 0-0, policy_version 433616 (0.00095) [2022-07-09 21:46:09,967][26022] Updated weights on worker 0-0, policy_version 433626 (0.00092) [2022-07-09 21:46:10,242][25689] Fps is (10 sec: 5492.0, 60 sec: 5697.8, 300 sec: 5667.7). Total num frames: 444035072. Throughput: 0: 5001.1. Samples: 444028004. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:46:10,243][25689] Avg episode reward: [(0, '-44.670')] [2022-07-09 21:46:11,638][26022] Updated weights on worker 0-0, policy_version 433636 (0.00081) [2022-07-09 21:46:13,419][26022] Updated weights on worker 0-0, policy_version 433646 (0.00095) [2022-07-09 21:46:15,258][25689] Fps is (10 sec: 5699.2, 60 sec: 5684.1, 300 sec: 5664.9). Total num frames: 444063744. Throughput: 0: 5870.0. Samples: 444062516. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:46:15,258][25689] Avg episode reward: [(0, '-44.979')] [2022-07-09 21:46:15,259][26022] Updated weights on worker 0-0, policy_version 433656 (0.00088) [2022-07-09 21:46:17,063][26022] Updated weights on worker 0-0, policy_version 433666 (0.00085) [2022-07-09 21:46:18,736][26022] Updated weights on worker 0-0, policy_version 433676 (0.00094) [2022-07-09 21:46:20,348][25689] Fps is (10 sec: 5674.4, 60 sec: 5681.3, 300 sec: 5667.0). Total num frames: 444092416. Throughput: 0: 5870.6. Samples: 444096958. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:46:20,349][25689] Avg episode reward: [(0, '-45.132')] [2022-07-09 21:46:20,865][26022] Updated weights on worker 0-0, policy_version 433686 (0.00090) [2022-07-09 21:46:22,520][26022] Updated weights on worker 0-0, policy_version 433696 (0.00089) [2022-07-09 21:46:24,224][26022] Updated weights on worker 0-0, policy_version 433706 (0.00083) [2022-07-09 21:46:25,408][25689] Fps is (10 sec: 5750.3, 60 sec: 5693.2, 300 sec: 5669.4). Total num frames: 444122112. Throughput: 0: 5110.6. Samples: 444114094. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:46:25,408][25689] Avg episode reward: [(0, '-45.900')] [2022-07-09 21:46:25,984][26022] Updated weights on worker 0-0, policy_version 433716 (0.00094) [2022-07-09 21:46:27,707][26022] Updated weights on worker 0-0, policy_version 433726 (0.00100) [2022-07-09 21:46:29,745][26022] Updated weights on worker 0-0, policy_version 433736 (0.00089) [2022-07-09 21:46:30,430][25689] Fps is (10 sec: 5687.8, 60 sec: 5716.3, 300 sec: 5665.8). Total num frames: 444149760. Throughput: 0: 5971.2. Samples: 444148550. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:46:30,431][25689] Avg episode reward: [(0, '-47.236')] [2022-07-09 21:46:31,448][26022] Updated weights on worker 0-0, policy_version 433746 (0.00094) [2022-07-09 21:46:33,053][26022] Updated weights on worker 0-0, policy_version 433756 (0.00081) [2022-07-09 21:46:35,016][26022] Updated weights on worker 0-0, policy_version 433766 (0.00093) [2022-07-09 21:46:35,511][25689] Fps is (10 sec: 5574.4, 60 sec: 5661.7, 300 sec: 5665.4). Total num frames: 444178432. Throughput: 0: 5943.0. Samples: 444182884. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:46:35,512][25689] Avg episode reward: [(0, '-47.100')] [2022-07-09 21:46:36,718][26022] Updated weights on worker 0-0, policy_version 433776 (0.00088) [2022-07-09 21:46:38,638][26022] Updated weights on worker 0-0, policy_version 433786 (0.00081) [2022-07-09 21:46:40,390][26022] Updated weights on worker 0-0, policy_version 433796 (0.00092) [2022-07-09 21:46:40,587][25689] Fps is (10 sec: 5645.8, 60 sec: 5643.1, 300 sec: 5660.7). Total num frames: 444207104. Throughput: 0: 5934.8. Samples: 444217072. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:46:40,588][25689] Avg episode reward: [(0, '-46.473')] [2022-07-09 21:46:42,247][26022] Updated weights on worker 0-0, policy_version 433806 (0.00106) [2022-07-09 21:46:44,058][26022] Updated weights on worker 0-0, policy_version 433816 (0.00087) [2022-07-09 21:46:45,695][25689] Fps is (10 sec: 5731.2, 60 sec: 5675.0, 300 sec: 5670.1). Total num frames: 444236800. Throughput: 0: 5912.5. Samples: 444234044. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:46:45,696][25689] Avg episode reward: [(0, '-46.635')] [2022-07-09 21:46:45,747][26022] Updated weights on worker 0-0, policy_version 433826 (0.00091) [2022-07-09 21:46:47,477][26022] Updated weights on worker 0-0, policy_version 433836 (0.00092) [2022-07-09 21:46:49,332][26022] Updated weights on worker 0-0, policy_version 433846 (0.00093) [2022-07-09 21:46:50,715][25689] Fps is (10 sec: 5763.2, 60 sec: 5657.5, 300 sec: 5663.5). Total num frames: 444265472. Throughput: 0: 5906.4. Samples: 444268360. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:46:50,715][25689] Avg episode reward: [(0, '-45.202')] [2022-07-09 21:46:51,229][26022] Updated weights on worker 0-0, policy_version 433856 (0.00086) [2022-07-09 21:46:52,969][26022] Updated weights on worker 0-0, policy_version 433866 (0.00090) [2022-07-09 21:46:54,727][26022] Updated weights on worker 0-0, policy_version 433876 (0.00088) [2022-07-09 21:46:55,734][25689] Fps is (10 sec: 5712.3, 60 sec: 5677.3, 300 sec: 5664.9). Total num frames: 444294144. Throughput: 0: 5930.9. Samples: 444302824. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:46:55,735][25689] Avg episode reward: [(0, '-44.475')] [2022-07-09 21:46:56,343][26022] Updated weights on worker 0-0, policy_version 433886 (0.00088) [2022-07-09 21:46:58,403][26022] Updated weights on worker 0-0, policy_version 433896 (0.00086) [2022-07-09 21:47:00,084][26022] Updated weights on worker 0-0, policy_version 433906 (0.00090) [2022-07-09 21:47:00,767][25689] Fps is (10 sec: 5704.9, 60 sec: 5661.4, 300 sec: 5671.4). Total num frames: 444322816. Throughput: 0: 5092.5. Samples: 444319836. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:47:00,767][25689] Avg episode reward: [(0, '-43.855')] [2022-07-09 21:47:02,286][26022] Updated weights on worker 0-0, policy_version 433916 (0.00086) [2022-07-09 21:47:04,257][26022] Updated weights on worker 0-0, policy_version 433926 (0.01367) [2022-07-09 21:47:05,775][25689] Fps is (10 sec: 5506.8, 60 sec: 5662.6, 300 sec: 5661.1). Total num frames: 444349440. Throughput: 0: 5883.9. Samples: 444352192. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:47:05,776][25689] Avg episode reward: [(0, '-43.880')] [2022-07-09 21:47:05,803][26022] Updated weights on worker 0-0, policy_version 433936 (0.00089) [2022-07-09 21:47:07,855][26022] Updated weights on worker 0-0, policy_version 433946 (0.00093) [2022-07-09 21:47:09,275][26022] Updated weights on worker 0-0, policy_version 433956 (0.00082) [2022-07-09 21:47:10,785][25689] Fps is (10 sec: 5519.4, 60 sec: 5666.1, 300 sec: 5668.5). Total num frames: 444378112. Throughput: 0: 5911.2. Samples: 444387000. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:47:10,786][25689] Avg episode reward: [(0, '-44.146')] [2022-07-09 21:47:11,200][26022] Updated weights on worker 0-0, policy_version 433966 (0.00086) [2022-07-09 21:47:13,032][26022] Updated weights on worker 0-0, policy_version 433976 (0.00092) [2022-07-09 21:47:14,728][26022] Updated weights on worker 0-0, policy_version 433986 (0.00091) [2022-07-09 21:47:15,801][25689] Fps is (10 sec: 5719.8, 60 sec: 5666.0, 300 sec: 5665.6). Total num frames: 444406784. Throughput: 0: 5051.8. Samples: 444404198. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:47:15,808][25689] Avg episode reward: [(0, '-44.809')] [2022-07-09 21:47:16,667][26022] Updated weights on worker 0-0, policy_version 433996 (0.00099) [2022-07-09 21:47:18,362][26022] Updated weights on worker 0-0, policy_version 434006 (0.00088) [2022-07-09 21:47:20,146][26022] Updated weights on worker 0-0, policy_version 434016 (0.00085) [2022-07-09 21:47:20,892][25689] Fps is (10 sec: 5876.6, 60 sec: 5699.9, 300 sec: 5667.5). Total num frames: 444437504. Throughput: 0: 5898.4. Samples: 444438540. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:47:20,892][25689] Avg episode reward: [(0, '-44.813')] [2022-07-09 21:47:22,133][26022] Updated weights on worker 0-0, policy_version 434026 (0.00086) [2022-07-09 21:47:23,710][26022] Updated weights on worker 0-0, policy_version 434036 (0.00092) [2022-07-09 21:47:25,472][26022] Updated weights on worker 0-0, policy_version 434046 (0.00058) [2022-07-09 21:47:25,914][25689] Fps is (10 sec: 5670.1, 60 sec: 5652.6, 300 sec: 5663.7). Total num frames: 444464128. Throughput: 0: 5987.5. Samples: 444472774. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:47:25,915][25689] Avg episode reward: [(0, '-44.455')] [2022-07-09 21:47:27,531][26022] Updated weights on worker 0-0, policy_version 434056 (0.00085) [2022-07-09 21:47:29,084][26022] Updated weights on worker 0-0, policy_version 434066 (0.00089) [2022-07-09 21:47:30,822][26022] Updated weights on worker 0-0, policy_version 434076 (0.00135) [2022-07-09 21:47:30,950][25689] Fps is (10 sec: 5599.2, 60 sec: 5685.1, 300 sec: 5666.6). Total num frames: 444493824. Throughput: 0: 5097.0. Samples: 444489782. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:47:30,951][25689] Avg episode reward: [(0, '-44.164')] [2022-07-09 21:47:32,651][26022] Updated weights on worker 0-0, policy_version 434086 (0.00087) [2022-07-09 21:47:34,372][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:47:34,388][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000434095_444513280.pth [2022-07-09 21:47:34,389][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000432100_442470400.pth [2022-07-09 21:47:34,499][26022] Updated weights on worker 0-0, policy_version 434096 (0.00082) [2022-07-09 21:47:35,966][25689] Fps is (10 sec: 5908.9, 60 sec: 5708.2, 300 sec: 5670.5). Total num frames: 444523520. Throughput: 0: 5948.8. Samples: 444524154. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:47:35,966][25689] Avg episode reward: [(0, '-43.594')] [2022-07-09 21:47:36,371][26022] Updated weights on worker 0-0, policy_version 434106 (0.00086) [2022-07-09 21:47:38,330][26022] Updated weights on worker 0-0, policy_version 434116 (0.00089) [2022-07-09 21:47:39,769][26022] Updated weights on worker 0-0, policy_version 434126 (0.00718) [2022-07-09 21:47:41,114][25689] Fps is (10 sec: 5440.4, 60 sec: 5650.6, 300 sec: 5658.0). Total num frames: 444549120. Throughput: 0: 5915.6. Samples: 444558168. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:47:41,115][25689] Avg episode reward: [(0, '-43.064')] [2022-07-09 21:47:41,750][26022] Updated weights on worker 0-0, policy_version 434136 (0.00095) [2022-07-09 21:47:43,426][26022] Updated weights on worker 0-0, policy_version 434146 (0.00088) [2022-07-09 21:47:45,321][26022] Updated weights on worker 0-0, policy_version 434156 (0.00107) [2022-07-09 21:47:46,127][25689] Fps is (10 sec: 5643.5, 60 sec: 5693.5, 300 sec: 5671.6). Total num frames: 444580864. Throughput: 0: 5078.0. Samples: 444575412. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:47:46,127][25689] Avg episode reward: [(0, '-43.228')] [2022-07-09 21:47:47,221][26022] Updated weights on worker 0-0, policy_version 434166 (0.00081) [2022-07-09 21:47:48,819][26022] Updated weights on worker 0-0, policy_version 434176 (0.00089) [2022-07-09 21:47:50,770][26022] Updated weights on worker 0-0, policy_version 434186 (0.00089) [2022-07-09 21:47:51,138][25689] Fps is (10 sec: 5822.7, 60 sec: 5660.3, 300 sec: 5661.3). Total num frames: 444607488. Throughput: 0: 5932.9. Samples: 444609556. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:47:51,139][25689] Avg episode reward: [(0, '-43.283')] [2022-07-09 21:47:52,400][26022] Updated weights on worker 0-0, policy_version 434196 (0.00087) [2022-07-09 21:47:54,600][26022] Updated weights on worker 0-0, policy_version 434206 (0.00092) [2022-07-09 21:47:56,151][25689] Fps is (10 sec: 5618.3, 60 sec: 5677.9, 300 sec: 5669.2). Total num frames: 444637184. Throughput: 0: 5907.5. Samples: 444643400. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:47:56,152][25689] Avg episode reward: [(0, '-43.683')] [2022-07-09 21:47:56,156][26022] Updated weights on worker 0-0, policy_version 434216 (0.00085) [2022-07-09 21:47:58,053][26022] Updated weights on worker 0-0, policy_version 434226 (0.00085) [2022-07-09 21:47:59,680][26022] Updated weights on worker 0-0, policy_version 434236 (0.00081) [2022-07-09 21:48:01,219][25689] Fps is (10 sec: 5688.5, 60 sec: 5657.6, 300 sec: 5668.1). Total num frames: 444664832. Throughput: 0: 5086.0. Samples: 444660420. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-09 21:48:01,219][25689] Avg episode reward: [(0, '-44.726')] [2022-07-09 21:48:01,916][26022] Updated weights on worker 0-0, policy_version 434246 (0.00055) [2022-07-09 21:48:03,802][26022] Updated weights on worker 0-0, policy_version 434256 (0.00083) [2022-07-09 21:48:05,580][26022] Updated weights on worker 0-0, policy_version 434266 (0.00086) [2022-07-09 21:48:06,285][25689] Fps is (10 sec: 5456.7, 60 sec: 5669.2, 300 sec: 5663.5). Total num frames: 444692480. Throughput: 0: 5817.3. Samples: 444692678. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:48:06,285][25689] Avg episode reward: [(0, '-45.399')] [2022-07-09 21:48:07,381][26022] Updated weights on worker 0-0, policy_version 434276 (0.00081) [2022-07-09 21:48:09,175][26022] Updated weights on worker 0-0, policy_version 434286 (0.00101) [2022-07-09 21:48:10,934][26022] Updated weights on worker 0-0, policy_version 434296 (0.00088) [2022-07-09 21:48:11,314][25689] Fps is (10 sec: 5477.3, 60 sec: 5650.4, 300 sec: 5663.4). Total num frames: 444720128. Throughput: 0: 5806.7. Samples: 444726712. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:48:11,315][25689] Avg episode reward: [(0, '-45.791')] [2022-07-09 21:48:12,882][26022] Updated weights on worker 0-0, policy_version 434306 (0.00083) [2022-07-09 21:48:14,643][26022] Updated weights on worker 0-0, policy_version 434316 (0.00098) [2022-07-09 21:48:16,262][26022] Updated weights on worker 0-0, policy_version 434326 (0.00092) [2022-07-09 21:48:16,332][25689] Fps is (10 sec: 5707.5, 60 sec: 5667.2, 300 sec: 5671.7). Total num frames: 444749824. Throughput: 0: 4984.2. Samples: 444743984. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:48:16,332][25689] Avg episode reward: [(0, '-45.630')] [2022-07-09 21:48:18,180][26022] Updated weights on worker 0-0, policy_version 434336 (0.00089) [2022-07-09 21:48:19,897][26022] Updated weights on worker 0-0, policy_version 434346 (0.00094) [2022-07-09 21:48:21,391][25689] Fps is (10 sec: 5690.6, 60 sec: 5619.3, 300 sec: 5660.6). Total num frames: 444777472. Throughput: 0: 5851.9. Samples: 444778466. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:48:21,392][25689] Avg episode reward: [(0, '-45.360')] [2022-07-09 21:48:21,849][26022] Updated weights on worker 0-0, policy_version 434356 (0.00086) [2022-07-09 21:48:23,557][26022] Updated weights on worker 0-0, policy_version 434366 (0.00091) [2022-07-09 21:48:25,239][26022] Updated weights on worker 0-0, policy_version 434376 (0.00087) [2022-07-09 21:48:26,398][25689] Fps is (10 sec: 5696.5, 60 sec: 5671.6, 300 sec: 5664.4). Total num frames: 444807168. Throughput: 0: 5959.0. Samples: 444812536. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:48:26,400][25689] Avg episode reward: [(0, '-45.436')] [2022-07-09 21:48:27,251][26022] Updated weights on worker 0-0, policy_version 434386 (0.00090) [2022-07-09 21:48:28,869][26022] Updated weights on worker 0-0, policy_version 434396 (0.00307) [2022-07-09 21:48:30,880][26022] Updated weights on worker 0-0, policy_version 434406 (0.00087) [2022-07-09 21:48:31,466][25689] Fps is (10 sec: 5692.1, 60 sec: 5634.8, 300 sec: 5663.7). Total num frames: 444834816. Throughput: 0: 5093.5. Samples: 444829352. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:48:31,467][25689] Avg episode reward: [(0, '-44.813')] [2022-07-09 21:48:32,718][26022] Updated weights on worker 0-0, policy_version 434417 (0.00089) [2022-07-09 21:48:34,701][26022] Updated weights on worker 0-0, policy_version 434427 (0.00096) [2022-07-09 21:48:36,252][26022] Updated weights on worker 0-0, policy_version 434437 (0.00089) [2022-07-09 21:48:36,487][25689] Fps is (10 sec: 5582.8, 60 sec: 5617.4, 300 sec: 5658.8). Total num frames: 444863488. Throughput: 0: 5948.0. Samples: 444863864. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:48:36,487][25689] Avg episode reward: [(0, '-43.868')] [2022-07-09 21:48:38,252][26022] Updated weights on worker 0-0, policy_version 434447 (0.00087) [2022-07-09 21:48:39,947][26022] Updated weights on worker 0-0, policy_version 434457 (0.00087) [2022-07-09 21:48:41,536][25689] Fps is (10 sec: 5694.5, 60 sec: 5677.5, 300 sec: 5658.5). Total num frames: 444892160. Throughput: 0: 5933.8. Samples: 444897998. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:48:41,536][25689] Avg episode reward: [(0, '-44.642')] [2022-07-09 21:48:41,841][26022] Updated weights on worker 0-0, policy_version 434467 (0.00085) [2022-07-09 21:48:43,551][26022] Updated weights on worker 0-0, policy_version 434477 (0.00082) [2022-07-09 21:48:45,310][26022] Updated weights on worker 0-0, policy_version 434487 (0.00093) [2022-07-09 21:48:46,542][25689] Fps is (10 sec: 5804.9, 60 sec: 5644.2, 300 sec: 5666.6). Total num frames: 444921856. Throughput: 0: 5095.1. Samples: 444915168. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:48:46,542][25689] Avg episode reward: [(0, '-44.249')] [2022-07-09 21:48:47,105][26022] Updated weights on worker 0-0, policy_version 434497 (0.00084) [2022-07-09 21:48:48,716][26022] Updated weights on worker 0-0, policy_version 434507 (0.00087) [2022-07-09 21:48:50,655][26022] Updated weights on worker 0-0, policy_version 434517 (0.00098) [2022-07-09 21:48:51,570][25689] Fps is (10 sec: 5714.7, 60 sec: 5659.6, 300 sec: 5663.2). Total num frames: 444949504. Throughput: 0: 5987.9. Samples: 444949736. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:48:51,570][25689] Avg episode reward: [(0, '-43.639')] [2022-07-09 21:48:52,409][26022] Updated weights on worker 0-0, policy_version 434527 (0.00082) [2022-07-09 21:48:54,283][26022] Updated weights on worker 0-0, policy_version 434537 (0.00093) [2022-07-09 21:48:55,926][26022] Updated weights on worker 0-0, policy_version 434547 (0.00084) [2022-07-09 21:48:56,589][25689] Fps is (10 sec: 5605.5, 60 sec: 5642.0, 300 sec: 5661.9). Total num frames: 444978176. Throughput: 0: 5961.7. Samples: 444983708. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:48:56,589][25689] Avg episode reward: [(0, '-44.622')] [2022-07-09 21:48:57,756][26022] Updated weights on worker 0-0, policy_version 434557 (0.00087) [2022-07-09 21:48:59,511][26022] Updated weights on worker 0-0, policy_version 434567 (0.00091) [2022-07-09 21:49:01,314][26022] Updated weights on worker 0-0, policy_version 434577 (0.00087) [2022-07-09 21:49:01,708][25689] Fps is (10 sec: 5757.4, 60 sec: 5671.1, 300 sec: 5670.5). Total num frames: 445007872. Throughput: 0: 5106.0. Samples: 445000998. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:49:01,708][25689] Avg episode reward: [(0, '-44.344')] [2022-07-09 21:49:03,486][26022] Updated weights on worker 0-0, policy_version 434587 (0.00065) [2022-07-09 21:49:05,477][26022] Updated weights on worker 0-0, policy_version 434597 (0.00106) [2022-07-09 21:49:06,723][25689] Fps is (10 sec: 5557.4, 60 sec: 5658.9, 300 sec: 5670.8). Total num frames: 445034496. Throughput: 0: 5856.3. Samples: 445033356. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:49:06,724][25689] Avg episode reward: [(0, '-43.682')] [2022-07-09 21:49:07,042][26022] Updated weights on worker 0-0, policy_version 434607 (0.00086) [2022-07-09 21:49:09,048][26022] Updated weights on worker 0-0, policy_version 434617 (0.00056) [2022-07-09 21:49:10,758][26022] Updated weights on worker 0-0, policy_version 434627 (0.00084) [2022-07-09 21:49:11,799][25689] Fps is (10 sec: 5479.9, 60 sec: 5671.6, 300 sec: 5662.5). Total num frames: 445063168. Throughput: 0: 5825.1. Samples: 445067570. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:49:11,799][25689] Avg episode reward: [(0, '-43.556')] [2022-07-09 21:49:12,464][26022] Updated weights on worker 0-0, policy_version 434637 (0.00091) [2022-07-09 21:49:14,531][26022] Updated weights on worker 0-0, policy_version 434647 (0.00510) [2022-07-09 21:49:15,956][26022] Updated weights on worker 0-0, policy_version 434657 (0.00092) [2022-07-09 21:49:16,802][25689] Fps is (10 sec: 5689.7, 60 sec: 5656.0, 300 sec: 5668.3). Total num frames: 445091840. Throughput: 0: 4997.1. Samples: 445084714. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:49:16,802][25689] Avg episode reward: [(0, '-43.414')] [2022-07-09 21:49:17,995][26022] Updated weights on worker 0-0, policy_version 434667 (0.00091) [2022-07-09 21:49:19,675][26022] Updated weights on worker 0-0, policy_version 434677 (0.00086) [2022-07-09 21:49:21,591][26022] Updated weights on worker 0-0, policy_version 434687 (0.00099) [2022-07-09 21:49:21,883][25689] Fps is (10 sec: 5788.2, 60 sec: 5687.8, 300 sec: 5670.7). Total num frames: 445121536. Throughput: 0: 5847.8. Samples: 445118976. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:49:21,883][25689] Avg episode reward: [(0, '-43.070')] [2022-07-09 21:49:23,499][26022] Updated weights on worker 0-0, policy_version 434697 (0.00087) [2022-07-09 21:49:25,028][26022] Updated weights on worker 0-0, policy_version 434707 (0.00084) [2022-07-09 21:49:26,897][25689] Fps is (10 sec: 5680.2, 60 sec: 5653.3, 300 sec: 5664.3). Total num frames: 445149184. Throughput: 0: 5929.7. Samples: 445152982. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:49:26,898][25689] Avg episode reward: [(0, '-42.454')] [2022-07-09 21:49:27,057][26022] Updated weights on worker 0-0, policy_version 434717 (0.00049) [2022-07-09 21:49:28,798][26022] Updated weights on worker 0-0, policy_version 434727 (0.00093) [2022-07-09 21:49:30,595][26022] Updated weights on worker 0-0, policy_version 434737 (0.00096) [2022-07-09 21:49:31,903][25689] Fps is (10 sec: 5620.8, 60 sec: 5676.0, 300 sec: 5667.9). Total num frames: 445177856. Throughput: 0: 5931.9. Samples: 445186824. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:49:31,904][25689] Avg episode reward: [(0, '-43.043')] [2022-07-09 21:49:32,624][26022] Updated weights on worker 0-0, policy_version 434747 (0.00093) [2022-07-09 21:49:34,055][26022] Updated weights on worker 0-0, policy_version 434757 (0.00584) [2022-07-09 21:49:34,434][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:49:34,452][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000434759_445193216.pth [2022-07-09 21:49:34,453][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000432765_443151360.pth [2022-07-09 21:49:36,043][26022] Updated weights on worker 0-0, policy_version 434767 (0.00093) [2022-07-09 21:49:36,927][25689] Fps is (10 sec: 5513.4, 60 sec: 5641.8, 300 sec: 5665.5). Total num frames: 445204480. Throughput: 0: 5928.3. Samples: 445204020. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:49:36,927][25689] Avg episode reward: [(0, '-43.038')] [2022-07-09 21:49:37,719][26022] Updated weights on worker 0-0, policy_version 434777 (0.00091) [2022-07-09 21:49:39,775][26022] Updated weights on worker 0-0, policy_version 434787 (0.00092) [2022-07-09 21:49:41,399][26022] Updated weights on worker 0-0, policy_version 434797 (0.00095) [2022-07-09 21:49:42,062][25689] Fps is (10 sec: 5745.5, 60 sec: 5684.6, 300 sec: 5666.9). Total num frames: 445236224. Throughput: 0: 5903.0. Samples: 445238092. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:49:42,062][25689] Avg episode reward: [(0, '-43.610')] [2022-07-09 21:49:43,301][26022] Updated weights on worker 0-0, policy_version 434807 (0.00089) [2022-07-09 21:49:44,879][26022] Updated weights on worker 0-0, policy_version 434817 (0.00090) [2022-07-09 21:49:46,851][26022] Updated weights on worker 0-0, policy_version 434827 (0.00084) [2022-07-09 21:49:47,107][25689] Fps is (10 sec: 5733.2, 60 sec: 5630.1, 300 sec: 5666.5). Total num frames: 445262848. Throughput: 0: 5928.7. Samples: 445272802. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:49:47,108][25689] Avg episode reward: [(0, '-44.287')] [2022-07-09 21:49:48,471][26022] Updated weights on worker 0-0, policy_version 434837 (0.00095) [2022-07-09 21:49:50,463][26022] Updated weights on worker 0-0, policy_version 434847 (0.00091) [2022-07-09 21:49:52,127][25689] Fps is (10 sec: 5595.3, 60 sec: 5664.7, 300 sec: 5667.9). Total num frames: 445292544. Throughput: 0: 5099.3. Samples: 445289954. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:49:52,128][25689] Avg episode reward: [(0, '-44.878')] [2022-07-09 21:49:52,268][26022] Updated weights on worker 0-0, policy_version 434857 (0.00085) [2022-07-09 21:49:53,971][26022] Updated weights on worker 0-0, policy_version 434867 (0.00089) [2022-07-09 21:49:55,771][26022] Updated weights on worker 0-0, policy_version 434877 (0.00087) [2022-07-09 21:49:57,150][25689] Fps is (10 sec: 5812.1, 60 sec: 5664.4, 300 sec: 5665.7). Total num frames: 445321216. Throughput: 0: 5929.7. Samples: 445323938. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:49:57,150][25689] Avg episode reward: [(0, '-44.992')] [2022-07-09 21:49:57,698][26022] Updated weights on worker 0-0, policy_version 434887 (0.00078) [2022-07-09 21:49:59,500][26022] Updated weights on worker 0-0, policy_version 434897 (0.00091) [2022-07-09 21:50:01,302][26022] Updated weights on worker 0-0, policy_version 434907 (0.00085) [2022-07-09 21:50:02,235][25689] Fps is (10 sec: 5470.3, 60 sec: 5616.8, 300 sec: 5664.3). Total num frames: 445347840. Throughput: 0: 5910.0. Samples: 445357320. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:50:02,236][25689] Avg episode reward: [(0, '-44.791')] [2022-07-09 21:50:03,396][26022] Updated weights on worker 0-0, policy_version 434917 (0.00084) [2022-07-09 21:50:05,307][26022] Updated weights on worker 0-0, policy_version 434927 (0.00081) [2022-07-09 21:50:06,821][26022] Updated weights on worker 0-0, policy_version 434937 (0.00088) [2022-07-09 21:50:07,287][25689] Fps is (10 sec: 5555.8, 60 sec: 5664.1, 300 sec: 5670.4). Total num frames: 445377536. Throughput: 0: 4972.6. Samples: 445373148. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:50:07,287][25689] Avg episode reward: [(0, '-45.994')] [2022-07-09 21:50:08,734][26022] Updated weights on worker 0-0, policy_version 434947 (0.00080) [2022-07-09 21:50:10,543][26022] Updated weights on worker 0-0, policy_version 434957 (0.00092) [2022-07-09 21:50:12,368][25689] Fps is (10 sec: 5659.6, 60 sec: 5646.7, 300 sec: 5662.9). Total num frames: 445405184. Throughput: 0: 5805.8. Samples: 445407466. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:50:12,368][25689] Avg episode reward: [(0, '-44.962')] [2022-07-09 21:50:12,371][26022] Updated weights on worker 0-0, policy_version 434967 (0.00094) [2022-07-09 21:50:14,130][26022] Updated weights on worker 0-0, policy_version 434977 (0.00114) [2022-07-09 21:50:15,810][26022] Updated weights on worker 0-0, policy_version 434987 (0.00083) [2022-07-09 21:50:17,393][25689] Fps is (10 sec: 5572.8, 60 sec: 5644.6, 300 sec: 5663.5). Total num frames: 445433856. Throughput: 0: 5829.3. Samples: 445441942. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:50:17,395][25689] Avg episode reward: [(0, '-44.633')] [2022-07-09 21:50:17,607][26022] Updated weights on worker 0-0, policy_version 434997 (0.00098) [2022-07-09 21:50:19,702][26022] Updated weights on worker 0-0, policy_version 435007 (0.00090) [2022-07-09 21:50:21,175][26022] Updated weights on worker 0-0, policy_version 435017 (0.00087) [2022-07-09 21:50:22,522][25689] Fps is (10 sec: 5747.9, 60 sec: 5640.1, 300 sec: 5664.7). Total num frames: 445463552. Throughput: 0: 5008.1. Samples: 445458918. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:50:22,525][25689] Avg episode reward: [(0, '-43.788')] [2022-07-09 21:50:23,384][26022] Updated weights on worker 0-0, policy_version 435027 (0.00091) [2022-07-09 21:50:24,814][26022] Updated weights on worker 0-0, policy_version 435037 (0.00091) [2022-07-09 21:50:27,074][26022] Updated weights on worker 0-0, policy_version 435047 (0.00095) [2022-07-09 21:50:27,567][25689] Fps is (10 sec: 5535.8, 60 sec: 5620.5, 300 sec: 5665.5). Total num frames: 445490176. Throughput: 0: 5892.5. Samples: 445492648. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:50:27,567][25689] Avg episode reward: [(0, '-44.179')] [2022-07-09 21:50:28,423][26022] Updated weights on worker 0-0, policy_version 435057 (0.00087) [2022-07-09 21:50:30,750][26022] Updated weights on worker 0-0, policy_version 435067 (0.00096) [2022-07-09 21:50:32,142][26022] Updated weights on worker 0-0, policy_version 435077 (0.00087) [2022-07-09 21:50:32,581][25689] Fps is (10 sec: 5700.7, 60 sec: 5653.4, 300 sec: 5662.5). Total num frames: 445520896. Throughput: 0: 5887.4. Samples: 445526472. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:50:32,583][25689] Avg episode reward: [(0, '-44.421')] [2022-07-09 21:50:34,252][26022] Updated weights on worker 0-0, policy_version 435087 (0.00091) [2022-07-09 21:50:35,698][26022] Updated weights on worker 0-0, policy_version 435097 (0.00091) [2022-07-09 21:50:37,663][25689] Fps is (10 sec: 5781.3, 60 sec: 5664.9, 300 sec: 5655.2). Total num frames: 445548544. Throughput: 0: 5003.7. Samples: 445543366. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:50:37,665][25689] Avg episode reward: [(0, '-44.022')] [2022-07-09 21:50:37,894][26022] Updated weights on worker 0-0, policy_version 435107 (0.00086) [2022-07-09 21:50:39,577][26022] Updated weights on worker 0-0, policy_version 435117 (0.00095) [2022-07-09 21:50:41,417][26022] Updated weights on worker 0-0, policy_version 435127 (0.00086) [2022-07-09 21:50:42,721][25689] Fps is (10 sec: 5554.3, 60 sec: 5621.4, 300 sec: 5659.2). Total num frames: 445577216. Throughput: 0: 5855.3. Samples: 445577190. Policy #0 lag: (min: 0.0, avg: 8.0, max: 17.0) [2022-07-09 21:50:42,722][25689] Avg episode reward: [(0, '-45.211')] [2022-07-09 21:50:43,012][26022] Updated weights on worker 0-0, policy_version 435137 (0.00091) [2022-07-09 21:50:45,374][26022] Updated weights on worker 0-0, policy_version 435147 (0.00091) [2022-07-09 21:50:46,536][26022] Updated weights on worker 0-0, policy_version 435157 (0.00087) [2022-07-09 21:50:47,768][25689] Fps is (10 sec: 5674.6, 60 sec: 5655.0, 300 sec: 5655.1). Total num frames: 445605888. Throughput: 0: 5885.8. Samples: 445611550. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:50:47,769][25689] Avg episode reward: [(0, '-44.671')] [2022-07-09 21:50:48,676][26022] Updated weights on worker 0-0, policy_version 435167 (0.00107) [2022-07-09 21:50:50,253][26022] Updated weights on worker 0-0, policy_version 435177 (0.00080) [2022-07-09 21:50:52,020][26022] Updated weights on worker 0-0, policy_version 435187 (0.00092) [2022-07-09 21:50:52,806][25689] Fps is (10 sec: 5787.6, 60 sec: 5653.3, 300 sec: 5662.2). Total num frames: 445635584. Throughput: 0: 5062.3. Samples: 445628856. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:50:52,807][25689] Avg episode reward: [(0, '-44.336')] [2022-07-09 21:50:53,886][26022] Updated weights on worker 0-0, policy_version 435197 (0.00097) [2022-07-09 21:50:55,627][26022] Updated weights on worker 0-0, policy_version 435207 (0.00088) [2022-07-09 21:50:57,510][26022] Updated weights on worker 0-0, policy_version 435217 (0.00094) [2022-07-09 21:50:57,879][25689] Fps is (10 sec: 5671.6, 60 sec: 5631.8, 300 sec: 5654.8). Total num frames: 445663232. Throughput: 0: 5935.7. Samples: 445663362. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:50:57,879][25689] Avg episode reward: [(0, '-43.787')] [2022-07-09 21:50:59,498][26022] Updated weights on worker 0-0, policy_version 435227 (0.00093) [2022-07-09 21:51:01,128][26022] Updated weights on worker 0-0, policy_version 435237 (0.00620) [2022-07-09 21:51:02,932][25689] Fps is (10 sec: 5359.7, 60 sec: 5634.8, 300 sec: 5654.2). Total num frames: 445689856. Throughput: 0: 5869.4. Samples: 445695816. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:51:02,933][25689] Avg episode reward: [(0, '-43.633')] [2022-07-09 21:51:03,527][26022] Updated weights on worker 0-0, policy_version 435247 (0.00084) [2022-07-09 21:51:05,006][26022] Updated weights on worker 0-0, policy_version 435257 (0.00090) [2022-07-09 21:51:06,909][26022] Updated weights on worker 0-0, policy_version 435267 (0.00101) [2022-07-09 21:51:07,939][25689] Fps is (10 sec: 5598.2, 60 sec: 5639.0, 300 sec: 5658.4). Total num frames: 445719552. Throughput: 0: 5001.5. Samples: 445712436. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:51:07,940][25689] Avg episode reward: [(0, '-42.902')] [2022-07-09 21:51:08,799][26022] Updated weights on worker 0-0, policy_version 435277 (0.00091) [2022-07-09 21:51:10,343][26022] Updated weights on worker 0-0, policy_version 435287 (0.00086) [2022-07-09 21:51:12,259][26022] Updated weights on worker 0-0, policy_version 435297 (0.00088) [2022-07-09 21:51:12,954][25689] Fps is (10 sec: 5824.1, 60 sec: 5662.0, 300 sec: 5658.4). Total num frames: 445748224. Throughput: 0: 5864.5. Samples: 445747012. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:51:12,955][25689] Avg episode reward: [(0, '-42.949')] [2022-07-09 21:51:14,000][26022] Updated weights on worker 0-0, policy_version 435307 (0.00090) [2022-07-09 21:51:15,799][26022] Updated weights on worker 0-0, policy_version 435317 (0.00088) [2022-07-09 21:51:17,655][26022] Updated weights on worker 0-0, policy_version 435327 (0.00096) [2022-07-09 21:51:17,975][25689] Fps is (10 sec: 5612.1, 60 sec: 5645.5, 300 sec: 5656.3). Total num frames: 445775872. Throughput: 0: 5891.8. Samples: 445781762. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:51:17,975][25689] Avg episode reward: [(0, '-42.991')] [2022-07-09 21:51:19,153][26022] Updated weights on worker 0-0, policy_version 435337 (0.00084) [2022-07-09 21:51:21,166][26022] Updated weights on worker 0-0, policy_version 435347 (0.00084) [2022-07-09 21:51:22,854][26022] Updated weights on worker 0-0, policy_version 435357 (0.00089) [2022-07-09 21:51:23,031][25689] Fps is (10 sec: 5792.1, 60 sec: 5669.2, 300 sec: 5659.8). Total num frames: 445806592. Throughput: 0: 5128.7. Samples: 445798898. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:51:23,032][25689] Avg episode reward: [(0, '-43.932')] [2022-07-09 21:51:24,610][26022] Updated weights on worker 0-0, policy_version 435367 (0.00091) [2022-07-09 21:51:26,652][26022] Updated weights on worker 0-0, policy_version 435377 (0.00088) [2022-07-09 21:51:28,047][25689] Fps is (10 sec: 5896.6, 60 sec: 5705.8, 300 sec: 5663.3). Total num frames: 445835264. Throughput: 0: 6032.8. Samples: 445833742. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:51:28,047][25689] Avg episode reward: [(0, '-44.116')] [2022-07-09 21:51:28,149][26022] Updated weights on worker 0-0, policy_version 435387 (0.00089) [2022-07-09 21:51:30,110][26022] Updated weights on worker 0-0, policy_version 435397 (0.00086) [2022-07-09 21:51:31,659][26022] Updated weights on worker 0-0, policy_version 435407 (0.00095) [2022-07-09 21:51:33,059][25689] Fps is (10 sec: 5616.5, 60 sec: 5655.3, 300 sec: 5661.2). Total num frames: 445862912. Throughput: 0: 6016.2. Samples: 445867966. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:51:33,059][25689] Avg episode reward: [(0, '-44.263')] [2022-07-09 21:51:33,637][26022] Updated weights on worker 0-0, policy_version 435417 (0.00091) [2022-07-09 21:51:34,652][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:51:34,665][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000435424_445874176.pth [2022-07-09 21:51:34,666][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000433429_443831296.pth [2022-07-09 21:51:35,443][26022] Updated weights on worker 0-0, policy_version 435427 (0.00084) [2022-07-09 21:51:37,255][26022] Updated weights on worker 0-0, policy_version 435437 (0.00089) [2022-07-09 21:51:38,095][25689] Fps is (10 sec: 5707.1, 60 sec: 5693.4, 300 sec: 5665.4). Total num frames: 445892608. Throughput: 0: 5134.4. Samples: 445885066. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:51:38,097][25689] Avg episode reward: [(0, '-44.468')] [2022-07-09 21:51:39,057][26022] Updated weights on worker 0-0, policy_version 435447 (0.00092) [2022-07-09 21:51:40,644][26022] Updated weights on worker 0-0, policy_version 435457 (0.00092) [2022-07-09 21:51:42,587][26022] Updated weights on worker 0-0, policy_version 435467 (0.00085) [2022-07-09 21:51:43,211][25689] Fps is (10 sec: 5749.4, 60 sec: 5688.0, 300 sec: 5661.8). Total num frames: 445921280. Throughput: 0: 5973.9. Samples: 445919448. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:51:43,211][25689] Avg episode reward: [(0, '-44.089')] [2022-07-09 21:51:44,283][26022] Updated weights on worker 0-0, policy_version 435477 (0.00089) [2022-07-09 21:51:45,993][26022] Updated weights on worker 0-0, policy_version 435487 (0.00086) [2022-07-09 21:51:48,011][26022] Updated weights on worker 0-0, policy_version 435497 (0.00092) [2022-07-09 21:51:48,240][25689] Fps is (10 sec: 5551.5, 60 sec: 5672.8, 300 sec: 5658.2). Total num frames: 445948928. Throughput: 0: 5959.5. Samples: 445954082. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:51:48,240][25689] Avg episode reward: [(0, '-44.227')] [2022-07-09 21:51:49,486][26022] Updated weights on worker 0-0, policy_version 435507 (0.00087) [2022-07-09 21:51:51,684][26022] Updated weights on worker 0-0, policy_version 435517 (0.00091) [2022-07-09 21:51:53,087][26022] Updated weights on worker 0-0, policy_version 435527 (0.00086) [2022-07-09 21:51:53,245][25689] Fps is (10 sec: 5816.6, 60 sec: 5692.7, 300 sec: 5665.3). Total num frames: 445979648. Throughput: 0: 5960.3. Samples: 445988284. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:51:53,246][25689] Avg episode reward: [(0, '-43.348')] [2022-07-09 21:51:55,054][26022] Updated weights on worker 0-0, policy_version 435537 (0.00085) [2022-07-09 21:51:56,873][26022] Updated weights on worker 0-0, policy_version 435547 (0.00087) [2022-07-09 21:51:58,260][25689] Fps is (10 sec: 5824.9, 60 sec: 5698.2, 300 sec: 5662.2). Total num frames: 446007296. Throughput: 0: 5977.5. Samples: 446005606. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:51:58,261][25689] Avg episode reward: [(0, '-43.758')] [2022-07-09 21:51:58,694][26022] Updated weights on worker 0-0, policy_version 435557 (0.00083) [2022-07-09 21:52:00,438][26022] Updated weights on worker 0-0, policy_version 435567 (0.00083) [2022-07-09 21:52:02,748][26022] Updated weights on worker 0-0, policy_version 435577 (0.00082) [2022-07-09 21:52:03,315][25689] Fps is (10 sec: 5389.9, 60 sec: 5698.1, 300 sec: 5661.4). Total num frames: 446033920. Throughput: 0: 5876.9. Samples: 446037598. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:52:03,317][25689] Avg episode reward: [(0, '-44.254')] [2022-07-09 21:52:04,412][26022] Updated weights on worker 0-0, policy_version 435587 (0.00085) [2022-07-09 21:52:06,533][26022] Updated weights on worker 0-0, policy_version 435597 (0.00092) [2022-07-09 21:52:07,960][26022] Updated weights on worker 0-0, policy_version 435607 (0.00087) [2022-07-09 21:52:08,319][25689] Fps is (10 sec: 5497.5, 60 sec: 5681.4, 300 sec: 5661.5). Total num frames: 446062592. Throughput: 0: 5862.2. Samples: 446071788. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:52:08,319][25689] Avg episode reward: [(0, '-44.706')] [2022-07-09 21:52:09,910][26022] Updated weights on worker 0-0, policy_version 435617 (0.00089) [2022-07-09 21:52:11,708][26022] Updated weights on worker 0-0, policy_version 435627 (0.00091) [2022-07-09 21:52:13,328][25689] Fps is (10 sec: 5624.3, 60 sec: 5664.9, 300 sec: 5658.2). Total num frames: 446090240. Throughput: 0: 5003.9. Samples: 446088778. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:52:13,329][25689] Avg episode reward: [(0, '-44.840')] [2022-07-09 21:52:13,502][26022] Updated weights on worker 0-0, policy_version 435637 (0.00088) [2022-07-09 21:52:15,218][26022] Updated weights on worker 0-0, policy_version 435647 (0.00087) [2022-07-09 21:52:17,096][26022] Updated weights on worker 0-0, policy_version 435657 (0.00092) [2022-07-09 21:52:18,365][25689] Fps is (10 sec: 5708.0, 60 sec: 5697.3, 300 sec: 5655.7). Total num frames: 446119936. Throughput: 0: 5856.5. Samples: 446123350. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:52:18,365][25689] Avg episode reward: [(0, '-46.198')] [2022-07-09 21:52:18,625][26022] Updated weights on worker 0-0, policy_version 435667 (0.00089) [2022-07-09 21:52:20,750][26022] Updated weights on worker 0-0, policy_version 435677 (0.00089) [2022-07-09 21:52:22,388][26022] Updated weights on worker 0-0, policy_version 435687 (0.00089) [2022-07-09 21:52:23,411][25689] Fps is (10 sec: 5687.4, 60 sec: 5647.4, 300 sec: 5658.7). Total num frames: 446147584. Throughput: 0: 5953.6. Samples: 446157244. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:52:23,412][25689] Avg episode reward: [(0, '-46.690')] [2022-07-09 21:52:24,268][26022] Updated weights on worker 0-0, policy_version 435697 (0.00092) [2022-07-09 21:52:26,156][26022] Updated weights on worker 0-0, policy_version 435707 (0.00087) [2022-07-09 21:52:27,856][26022] Updated weights on worker 0-0, policy_version 435717 (0.00092) [2022-07-09 21:52:28,424][25689] Fps is (10 sec: 5598.8, 60 sec: 5647.7, 300 sec: 5655.7). Total num frames: 446176256. Throughput: 0: 5097.2. Samples: 446174270. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:52:28,425][25689] Avg episode reward: [(0, '-46.714')] [2022-07-09 21:52:29,699][26022] Updated weights on worker 0-0, policy_version 435727 (0.00106) [2022-07-09 21:52:31,496][26022] Updated weights on worker 0-0, policy_version 435737 (0.00090) [2022-07-09 21:52:33,219][26022] Updated weights on worker 0-0, policy_version 435747 (0.00093) [2022-07-09 21:52:33,453][25689] Fps is (10 sec: 5812.2, 60 sec: 5680.0, 300 sec: 5655.5). Total num frames: 446205952. Throughput: 0: 5936.0. Samples: 446208240. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:52:33,454][25689] Avg episode reward: [(0, '-45.986')] [2022-07-09 21:52:35,223][26022] Updated weights on worker 0-0, policy_version 435757 (0.00091) [2022-07-09 21:52:36,679][26022] Updated weights on worker 0-0, policy_version 435767 (0.00089) [2022-07-09 21:52:38,456][25689] Fps is (10 sec: 5818.1, 60 sec: 5666.1, 300 sec: 5668.6). Total num frames: 446234624. Throughput: 0: 5940.5. Samples: 446242704. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:52:38,457][25689] Avg episode reward: [(0, '-45.753')] [2022-07-09 21:52:38,700][26022] Updated weights on worker 0-0, policy_version 435777 (0.00089) [2022-07-09 21:52:40,507][26022] Updated weights on worker 0-0, policy_version 435787 (0.00497) [2022-07-09 21:52:42,315][26022] Updated weights on worker 0-0, policy_version 435797 (0.00100) [2022-07-09 21:52:43,488][25689] Fps is (10 sec: 5510.3, 60 sec: 5640.0, 300 sec: 5651.0). Total num frames: 446261248. Throughput: 0: 5107.6. Samples: 446259790. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:52:43,489][25689] Avg episode reward: [(0, '-45.692')] [2022-07-09 21:52:44,038][26022] Updated weights on worker 0-0, policy_version 435807 (0.00086) [2022-07-09 21:52:45,831][26022] Updated weights on worker 0-0, policy_version 435817 (0.00091) [2022-07-09 21:52:47,612][26022] Updated weights on worker 0-0, policy_version 435827 (0.00097) [2022-07-09 21:52:48,499][25689] Fps is (10 sec: 5608.1, 60 sec: 5675.7, 300 sec: 5661.3). Total num frames: 446290944. Throughput: 0: 5968.5. Samples: 446294088. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:52:48,500][25689] Avg episode reward: [(0, '-44.619')] [2022-07-09 21:52:49,461][26022] Updated weights on worker 0-0, policy_version 435837 (0.00094) [2022-07-09 21:52:51,233][26022] Updated weights on worker 0-0, policy_version 435847 (0.00088) [2022-07-09 21:52:53,039][26022] Updated weights on worker 0-0, policy_version 435857 (0.00054) [2022-07-09 21:52:53,511][25689] Fps is (10 sec: 5823.4, 60 sec: 5641.1, 300 sec: 5657.9). Total num frames: 446319616. Throughput: 0: 6003.6. Samples: 446328660. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:52:53,512][25689] Avg episode reward: [(0, '-43.841')] [2022-07-09 21:52:54,799][26022] Updated weights on worker 0-0, policy_version 435867 (0.00083) [2022-07-09 21:52:56,627][26022] Updated weights on worker 0-0, policy_version 435877 (0.00088) [2022-07-09 21:52:58,511][26022] Updated weights on worker 0-0, policy_version 435887 (0.01484) [2022-07-09 21:52:58,516][25689] Fps is (10 sec: 5724.7, 60 sec: 5659.1, 300 sec: 5662.5). Total num frames: 446348288. Throughput: 0: 5134.9. Samples: 446345708. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:52:58,527][25689] Avg episode reward: [(0, '-44.766')] [2022-07-09 21:53:00,216][26022] Updated weights on worker 0-0, policy_version 435897 (0.00091) [2022-07-09 21:53:02,459][26022] Updated weights on worker 0-0, policy_version 435907 (0.00083) [2022-07-09 21:53:03,599][25689] Fps is (10 sec: 5481.4, 60 sec: 5656.4, 300 sec: 5658.8). Total num frames: 446374912. Throughput: 0: 5856.3. Samples: 446377566. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:53:03,600][25689] Avg episode reward: [(0, '-45.062')] [2022-07-09 21:53:04,199][26022] Updated weights on worker 0-0, policy_version 435917 (0.00088) [2022-07-09 21:53:06,141][26022] Updated weights on worker 0-0, policy_version 435927 (0.00094) [2022-07-09 21:53:07,588][26022] Updated weights on worker 0-0, policy_version 435937 (0.00082) [2022-07-09 21:53:08,606][25689] Fps is (10 sec: 5480.5, 60 sec: 5656.1, 300 sec: 5662.6). Total num frames: 446403584. Throughput: 0: 5855.6. Samples: 446411824. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:53:08,606][25689] Avg episode reward: [(0, '-46.186')] [2022-07-09 21:53:09,608][26022] Updated weights on worker 0-0, policy_version 435947 (0.00091) [2022-07-09 21:53:11,323][26022] Updated weights on worker 0-0, policy_version 435957 (0.00094) [2022-07-09 21:53:13,228][26022] Updated weights on worker 0-0, policy_version 435967 (0.00091) [2022-07-09 21:53:13,614][25689] Fps is (10 sec: 5623.5, 60 sec: 5656.2, 300 sec: 5655.9). Total num frames: 446431232. Throughput: 0: 4987.0. Samples: 446428914. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:53:13,615][25689] Avg episode reward: [(0, '-46.055')] [2022-07-09 21:53:14,824][26022] Updated weights on worker 0-0, policy_version 435977 (0.00093) [2022-07-09 21:53:16,910][26022] Updated weights on worker 0-0, policy_version 435987 (0.00090) [2022-07-09 21:53:18,411][26022] Updated weights on worker 0-0, policy_version 435997 (0.00101) [2022-07-09 21:53:18,628][25689] Fps is (10 sec: 5823.8, 60 sec: 5675.4, 300 sec: 5667.1). Total num frames: 446461952. Throughput: 0: 5865.6. Samples: 446463678. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:53:18,629][25689] Avg episode reward: [(0, '-45.816')] [2022-07-09 21:53:20,446][26022] Updated weights on worker 0-0, policy_version 436007 (0.00087) [2022-07-09 21:53:21,988][26022] Updated weights on worker 0-0, policy_version 436017 (0.00086) [2022-07-09 21:53:23,707][25689] Fps is (10 sec: 5783.5, 60 sec: 5672.3, 300 sec: 5658.9). Total num frames: 446489600. Throughput: 0: 5996.4. Samples: 446498138. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:53:23,707][25689] Avg episode reward: [(0, '-46.049')] [2022-07-09 21:53:23,975][26022] Updated weights on worker 0-0, policy_version 436027 (0.00085) [2022-07-09 21:53:25,512][26022] Updated weights on worker 0-0, policy_version 436037 (0.00089) [2022-07-09 21:53:27,704][26022] Updated weights on worker 0-0, policy_version 436047 (0.00095) [2022-07-09 21:53:28,709][25689] Fps is (10 sec: 5688.5, 60 sec: 5690.4, 300 sec: 5667.0). Total num frames: 446519296. Throughput: 0: 5136.7. Samples: 446515088. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-09 21:53:28,710][25689] Avg episode reward: [(0, '-45.960')] [2022-07-09 21:53:29,395][26022] Updated weights on worker 0-0, policy_version 436057 (0.00088) [2022-07-09 21:53:31,190][26022] Updated weights on worker 0-0, policy_version 436067 (0.00089) [2022-07-09 21:53:32,848][26022] Updated weights on worker 0-0, policy_version 436077 (0.00081) [2022-07-09 21:53:33,737][25689] Fps is (10 sec: 5717.1, 60 sec: 5656.5, 300 sec: 5663.4). Total num frames: 446546944. Throughput: 0: 5973.2. Samples: 446549110. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:53:33,738][25689] Avg episode reward: [(0, '-44.542')] [2022-07-09 21:53:34,688][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:53:34,704][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000436086_446552064.pth [2022-07-09 21:53:34,705][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000434095_444513280.pth [2022-07-09 21:53:34,901][26022] Updated weights on worker 0-0, policy_version 436087 (0.00089) [2022-07-09 21:53:36,374][26022] Updated weights on worker 0-0, policy_version 436097 (0.00092) [2022-07-09 21:53:38,376][26022] Updated weights on worker 0-0, policy_version 436107 (0.00095) [2022-07-09 21:53:38,740][25689] Fps is (10 sec: 5512.5, 60 sec: 5639.5, 300 sec: 5660.9). Total num frames: 446574592. Throughput: 0: 5949.1. Samples: 446583324. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:53:38,740][25689] Avg episode reward: [(0, '-44.209')] [2022-07-09 21:53:39,916][26022] Updated weights on worker 0-0, policy_version 436117 (0.00087) [2022-07-09 21:53:41,912][26022] Updated weights on worker 0-0, policy_version 436127 (0.00090) [2022-07-09 21:53:43,776][25689] Fps is (10 sec: 5712.1, 60 sec: 5690.1, 300 sec: 5660.3). Total num frames: 446604288. Throughput: 0: 5103.1. Samples: 446600554. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:53:43,776][25689] Avg episode reward: [(0, '-44.927')] [2022-07-09 21:53:43,786][26022] Updated weights on worker 0-0, policy_version 436137 (0.00090) [2022-07-09 21:53:45,391][26022] Updated weights on worker 0-0, policy_version 436147 (0.00087) [2022-07-09 21:53:47,389][26022] Updated weights on worker 0-0, policy_version 436157 (0.00090) [2022-07-09 21:53:48,830][25689] Fps is (10 sec: 5784.6, 60 sec: 5669.0, 300 sec: 5663.2). Total num frames: 446632960. Throughput: 0: 5957.6. Samples: 446634962. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:53:48,831][25689] Avg episode reward: [(0, '-45.137')] [2022-07-09 21:53:49,007][26022] Updated weights on worker 0-0, policy_version 436167 (0.00095) [2022-07-09 21:53:50,979][26022] Updated weights on worker 0-0, policy_version 436177 (0.00087) [2022-07-09 21:53:52,807][26022] Updated weights on worker 0-0, policy_version 436187 (0.00082) [2022-07-09 21:53:53,865][25689] Fps is (10 sec: 5683.6, 60 sec: 5666.8, 300 sec: 5662.9). Total num frames: 446661632. Throughput: 0: 5940.7. Samples: 446668686. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:53:53,866][25689] Avg episode reward: [(0, '-44.766')] [2022-07-09 21:53:54,502][26022] Updated weights on worker 0-0, policy_version 436197 (0.00089) [2022-07-09 21:53:56,394][26022] Updated weights on worker 0-0, policy_version 436207 (0.00087) [2022-07-09 21:53:57,947][26022] Updated weights on worker 0-0, policy_version 436217 (0.00097) [2022-07-09 21:53:58,883][25689] Fps is (10 sec: 5602.0, 60 sec: 5648.6, 300 sec: 5658.0). Total num frames: 446689280. Throughput: 0: 5083.6. Samples: 446685726. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:53:58,884][25689] Avg episode reward: [(0, '-45.439')] [2022-07-09 21:54:00,093][26022] Updated weights on worker 0-0, policy_version 436227 (0.00084) [2022-07-09 21:54:01,898][26022] Updated weights on worker 0-0, policy_version 436237 (0.00092) [2022-07-09 21:54:04,032][25689] Fps is (10 sec: 5338.1, 60 sec: 5642.5, 300 sec: 5655.4). Total num frames: 446715904. Throughput: 0: 5786.7. Samples: 446717770. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:54:04,035][25689] Avg episode reward: [(0, '-45.918')] [2022-07-09 21:54:04,055][26022] Updated weights on worker 0-0, policy_version 436247 (0.00089) [2022-07-09 21:54:05,843][26022] Updated weights on worker 0-0, policy_version 436257 (0.00084) [2022-07-09 21:54:07,628][26022] Updated weights on worker 0-0, policy_version 436267 (0.00096) [2022-07-09 21:54:09,132][25689] Fps is (10 sec: 5495.2, 60 sec: 5650.7, 300 sec: 5658.4). Total num frames: 446745600. Throughput: 0: 5727.6. Samples: 446751246. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:54:09,133][25689] Avg episode reward: [(0, '-46.002')] [2022-07-09 21:54:09,520][26022] Updated weights on worker 0-0, policy_version 436277 (0.00110) [2022-07-09 21:54:11,403][26022] Updated weights on worker 0-0, policy_version 436287 (0.00091) [2022-07-09 21:54:12,933][26022] Updated weights on worker 0-0, policy_version 436297 (0.00090) [2022-07-09 21:54:14,157][25689] Fps is (10 sec: 5764.9, 60 sec: 5666.1, 300 sec: 5658.0). Total num frames: 446774272. Throughput: 0: 4918.0. Samples: 446768480. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:54:14,157][25689] Avg episode reward: [(0, '-46.981')] [2022-07-09 21:54:14,933][26022] Updated weights on worker 0-0, policy_version 436307 (0.00087) [2022-07-09 21:54:16,583][26022] Updated weights on worker 0-0, policy_version 436317 (0.00082) [2022-07-09 21:54:18,439][26022] Updated weights on worker 0-0, policy_version 436327 (0.00087) [2022-07-09 21:54:19,204][25689] Fps is (10 sec: 5693.5, 60 sec: 5629.2, 300 sec: 5655.2). Total num frames: 446802944. Throughput: 0: 5761.1. Samples: 446802796. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:54:19,205][25689] Avg episode reward: [(0, '-46.618')] [2022-07-09 21:54:20,137][26022] Updated weights on worker 0-0, policy_version 436337 (0.00086) [2022-07-09 21:54:21,964][26022] Updated weights on worker 0-0, policy_version 436347 (0.00089) [2022-07-09 21:54:23,684][26022] Updated weights on worker 0-0, policy_version 436357 (0.00087) [2022-07-09 21:54:24,287][25689] Fps is (10 sec: 5661.0, 60 sec: 5645.7, 300 sec: 5657.3). Total num frames: 446831616. Throughput: 0: 5894.1. Samples: 446837152. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:54:24,287][25689] Avg episode reward: [(0, '-45.782')] [2022-07-09 21:54:25,635][26022] Updated weights on worker 0-0, policy_version 436367 (0.00087) [2022-07-09 21:54:27,474][26022] Updated weights on worker 0-0, policy_version 436377 (0.00086) [2022-07-09 21:54:29,251][26022] Updated weights on worker 0-0, policy_version 436387 (0.00094) [2022-07-09 21:54:29,337][25689] Fps is (10 sec: 5659.1, 60 sec: 5624.3, 300 sec: 5656.5). Total num frames: 446860288. Throughput: 0: 5085.5. Samples: 446853998. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:54:29,338][25689] Avg episode reward: [(0, '-45.991')] [2022-07-09 21:54:31,067][26022] Updated weights on worker 0-0, policy_version 436397 (0.00084) [2022-07-09 21:54:32,908][26022] Updated weights on worker 0-0, policy_version 436407 (0.00090) [2022-07-09 21:54:34,384][25689] Fps is (10 sec: 5678.9, 60 sec: 5639.4, 300 sec: 5662.9). Total num frames: 446888960. Throughput: 0: 5912.7. Samples: 446888080. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:54:34,385][25689] Avg episode reward: [(0, '-45.977')] [2022-07-09 21:54:34,621][26022] Updated weights on worker 0-0, policy_version 436417 (0.00090) [2022-07-09 21:54:36,363][26022] Updated weights on worker 0-0, policy_version 436427 (0.00082) [2022-07-09 21:54:38,270][26022] Updated weights on worker 0-0, policy_version 436437 (0.00095) [2022-07-09 21:54:39,393][25689] Fps is (10 sec: 5804.2, 60 sec: 5672.6, 300 sec: 5658.4). Total num frames: 446918656. Throughput: 0: 5916.6. Samples: 446922246. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:54:39,394][25689] Avg episode reward: [(0, '-44.827')] [2022-07-09 21:54:40,315][26022] Updated weights on worker 0-0, policy_version 436447 (0.00865) [2022-07-09 21:54:41,932][26022] Updated weights on worker 0-0, policy_version 436457 (0.00084) [2022-07-09 21:54:43,865][26022] Updated weights on worker 0-0, policy_version 436467 (0.00084) [2022-07-09 21:54:44,433][25689] Fps is (10 sec: 5706.8, 60 sec: 5638.5, 300 sec: 5662.0). Total num frames: 446946304. Throughput: 0: 5065.5. Samples: 446939202. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:54:44,433][25689] Avg episode reward: [(0, '-44.373')] [2022-07-09 21:54:45,495][26022] Updated weights on worker 0-0, policy_version 436477 (0.00082) [2022-07-09 21:54:47,343][26022] Updated weights on worker 0-0, policy_version 436487 (0.00086) [2022-07-09 21:54:49,022][26022] Updated weights on worker 0-0, policy_version 436497 (0.00088) [2022-07-09 21:54:49,501][25689] Fps is (10 sec: 5673.0, 60 sec: 5654.1, 300 sec: 5661.1). Total num frames: 446976000. Throughput: 0: 5917.8. Samples: 446973326. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:54:49,502][25689] Avg episode reward: [(0, '-43.690')] [2022-07-09 21:54:50,932][26022] Updated weights on worker 0-0, policy_version 436507 (0.00092) [2022-07-09 21:54:52,664][26022] Updated weights on worker 0-0, policy_version 436517 (0.00086) [2022-07-09 21:54:54,267][26022] Updated weights on worker 0-0, policy_version 436527 (0.00084) [2022-07-09 21:54:54,560][25689] Fps is (10 sec: 5662.5, 60 sec: 5635.1, 300 sec: 5656.9). Total num frames: 447003648. Throughput: 0: 5935.5. Samples: 447007830. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:54:54,560][25689] Avg episode reward: [(0, '-43.935')] [2022-07-09 21:54:56,299][26022] Updated weights on worker 0-0, policy_version 436537 (0.00084) [2022-07-09 21:54:58,008][26022] Updated weights on worker 0-0, policy_version 436547 (0.00085) [2022-07-09 21:54:59,574][25689] Fps is (10 sec: 5693.1, 60 sec: 5669.2, 300 sec: 5668.6). Total num frames: 447033344. Throughput: 0: 5942.8. Samples: 447042176. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:54:59,574][25689] Avg episode reward: [(0, '-43.774')] [2022-07-09 21:54:59,814][26022] Updated weights on worker 0-0, policy_version 436557 (0.00086) [2022-07-09 21:55:01,813][26022] Updated weights on worker 0-0, policy_version 436567 (0.00085) [2022-07-09 21:55:03,729][26022] Updated weights on worker 0-0, policy_version 436577 (0.00086) [2022-07-09 21:55:04,655][25689] Fps is (10 sec: 5477.2, 60 sec: 5658.6, 300 sec: 5654.3). Total num frames: 447058944. Throughput: 0: 5819.5. Samples: 447056888. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:55:04,656][25689] Avg episode reward: [(0, '-43.795')] [2022-07-09 21:55:05,665][26022] Updated weights on worker 0-0, policy_version 436587 (0.00082) [2022-07-09 21:55:07,501][26022] Updated weights on worker 0-0, policy_version 436597 (0.00949) [2022-07-09 21:55:09,177][26022] Updated weights on worker 0-0, policy_version 436607 (0.00088) [2022-07-09 21:55:09,694][25689] Fps is (10 sec: 5463.8, 60 sec: 5664.3, 300 sec: 5662.0). Total num frames: 447088640. Throughput: 0: 5828.4. Samples: 447091018. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:55:09,695][25689] Avg episode reward: [(0, '-43.448')] [2022-07-09 21:55:11,235][26022] Updated weights on worker 0-0, policy_version 436617 (0.00083) [2022-07-09 21:55:12,755][26022] Updated weights on worker 0-0, policy_version 436627 (0.01389) [2022-07-09 21:55:14,612][26022] Updated weights on worker 0-0, policy_version 436637 (0.00087) [2022-07-09 21:55:14,717][25689] Fps is (10 sec: 5699.0, 60 sec: 5647.5, 300 sec: 5658.6). Total num frames: 447116288. Throughput: 0: 5841.8. Samples: 447125588. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:55:14,718][25689] Avg episode reward: [(0, '-44.401')] [2022-07-09 21:55:16,375][26022] Updated weights on worker 0-0, policy_version 436647 (0.00084) [2022-07-09 21:55:18,099][26022] Updated weights on worker 0-0, policy_version 436657 (0.00082) [2022-07-09 21:55:19,719][25689] Fps is (10 sec: 5618.3, 60 sec: 5651.8, 300 sec: 5657.6). Total num frames: 447144960. Throughput: 0: 4999.3. Samples: 447142886. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:55:19,719][25689] Avg episode reward: [(0, '-45.022')] [2022-07-09 21:55:20,021][26022] Updated weights on worker 0-0, policy_version 436667 (0.00087) [2022-07-09 21:55:21,550][26022] Updated weights on worker 0-0, policy_version 436677 (0.00090) [2022-07-09 21:55:23,725][26022] Updated weights on worker 0-0, policy_version 436687 (0.00085) [2022-07-09 21:55:24,768][25689] Fps is (10 sec: 5807.7, 60 sec: 5671.9, 300 sec: 5667.8). Total num frames: 447174656. Throughput: 0: 5991.5. Samples: 447177390. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:55:24,768][25689] Avg episode reward: [(0, '-45.325')] [2022-07-09 21:55:25,255][26022] Updated weights on worker 0-0, policy_version 436697 (0.00093) [2022-07-09 21:55:27,221][26022] Updated weights on worker 0-0, policy_version 436707 (0.00084) [2022-07-09 21:55:29,038][26022] Updated weights on worker 0-0, policy_version 436717 (0.00090) [2022-07-09 21:55:29,770][25689] Fps is (10 sec: 5705.2, 60 sec: 5659.4, 300 sec: 5657.7). Total num frames: 447202304. Throughput: 0: 6004.1. Samples: 447211554. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:55:29,771][25689] Avg episode reward: [(0, '-44.924')] [2022-07-09 21:55:30,756][26022] Updated weights on worker 0-0, policy_version 436727 (0.00088) [2022-07-09 21:55:32,463][26022] Updated weights on worker 0-0, policy_version 436737 (0.00088) [2022-07-09 21:55:34,503][26022] Updated weights on worker 0-0, policy_version 436747 (0.00089) [2022-07-09 21:55:34,722][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:55:34,737][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000436748_447229952.pth [2022-07-09 21:55:34,737][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000434759_445193216.pth [2022-07-09 21:55:34,795][25689] Fps is (10 sec: 5514.6, 60 sec: 5644.6, 300 sec: 5658.8). Total num frames: 447229952. Throughput: 0: 5118.4. Samples: 447228352. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:55:34,795][25689] Avg episode reward: [(0, '-45.002')] [2022-07-09 21:55:36,125][26022] Updated weights on worker 0-0, policy_version 436757 (0.00085) [2022-07-09 21:55:38,136][26022] Updated weights on worker 0-0, policy_version 436767 (0.00095) [2022-07-09 21:55:39,693][26022] Updated weights on worker 0-0, policy_version 436777 (0.00089) [2022-07-09 21:55:39,836][25689] Fps is (10 sec: 5696.6, 60 sec: 5641.5, 300 sec: 5662.6). Total num frames: 447259648. Throughput: 0: 5946.0. Samples: 447262504. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:55:39,837][25689] Avg episode reward: [(0, '-45.245')] [2022-07-09 21:55:41,723][26022] Updated weights on worker 0-0, policy_version 436787 (0.00092) [2022-07-09 21:55:43,320][26022] Updated weights on worker 0-0, policy_version 436797 (0.00093) [2022-07-09 21:55:44,902][25689] Fps is (10 sec: 5876.1, 60 sec: 5673.0, 300 sec: 5665.6). Total num frames: 447289344. Throughput: 0: 5927.8. Samples: 447296744. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:55:44,903][25689] Avg episode reward: [(0, '-44.226')] [2022-07-09 21:55:45,037][26022] Updated weights on worker 0-0, policy_version 436807 (0.00093) [2022-07-09 21:55:46,910][26022] Updated weights on worker 0-0, policy_version 436817 (0.00096) [2022-07-09 21:55:48,822][26022] Updated weights on worker 0-0, policy_version 436827 (0.00104) [2022-07-09 21:55:49,934][25689] Fps is (10 sec: 5679.0, 60 sec: 5642.5, 300 sec: 5658.9). Total num frames: 447316992. Throughput: 0: 5083.1. Samples: 447314048. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:55:49,934][25689] Avg episode reward: [(0, '-44.740')] [2022-07-09 21:55:50,574][26022] Updated weights on worker 0-0, policy_version 436837 (0.00089) [2022-07-09 21:55:52,185][26022] Updated weights on worker 0-0, policy_version 436847 (0.00090) [2022-07-09 21:55:54,195][26022] Updated weights on worker 0-0, policy_version 436857 (0.00088) [2022-07-09 21:55:54,955][25689] Fps is (10 sec: 5704.5, 60 sec: 5680.0, 300 sec: 5666.8). Total num frames: 447346688. Throughput: 0: 5945.8. Samples: 447348218. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:55:54,955][25689] Avg episode reward: [(0, '-45.090')] [2022-07-09 21:55:56,011][26022] Updated weights on worker 0-0, policy_version 436867 (0.00097) [2022-07-09 21:55:57,758][26022] Updated weights on worker 0-0, policy_version 436877 (0.00091) [2022-07-09 21:55:59,503][26022] Updated weights on worker 0-0, policy_version 436887 (0.00090) [2022-07-09 21:56:00,007][25689] Fps is (10 sec: 5794.2, 60 sec: 5659.4, 300 sec: 5673.7). Total num frames: 447375360. Throughput: 0: 5956.6. Samples: 447382654. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:56:00,008][25689] Avg episode reward: [(0, '-44.743')] [2022-07-09 21:56:01,373][26022] Updated weights on worker 0-0, policy_version 436897 (0.00092) [2022-07-09 21:56:03,306][26022] Updated weights on worker 0-0, policy_version 436907 (0.00086) [2022-07-09 21:56:05,063][25689] Fps is (10 sec: 5368.8, 60 sec: 5661.8, 300 sec: 5658.9). Total num frames: 447400960. Throughput: 0: 4994.7. Samples: 447397444. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:56:05,064][25689] Avg episode reward: [(0, '-44.678')] [2022-07-09 21:56:05,323][26022] Updated weights on worker 0-0, policy_version 436917 (0.00091) [2022-07-09 21:56:07,065][26022] Updated weights on worker 0-0, policy_version 436927 (0.00084) [2022-07-09 21:56:08,911][26022] Updated weights on worker 0-0, policy_version 436937 (0.00085) [2022-07-09 21:56:10,067][25689] Fps is (10 sec: 5395.1, 60 sec: 5648.1, 300 sec: 5659.2). Total num frames: 447429632. Throughput: 0: 5851.7. Samples: 447431860. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:56:10,067][25689] Avg episode reward: [(0, '-44.988')] [2022-07-09 21:56:10,771][26022] Updated weights on worker 0-0, policy_version 436947 (0.00082) [2022-07-09 21:56:12,312][26022] Updated weights on worker 0-0, policy_version 436957 (0.00083) [2022-07-09 21:56:14,371][26022] Updated weights on worker 0-0, policy_version 436967 (0.00088) [2022-07-09 21:56:15,071][25689] Fps is (10 sec: 5832.5, 60 sec: 5683.9, 300 sec: 5666.4). Total num frames: 447459328. Throughput: 0: 5882.7. Samples: 447466556. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-09 21:56:15,071][25689] Avg episode reward: [(0, '-44.651')] [2022-07-09 21:56:15,844][26022] Updated weights on worker 0-0, policy_version 436977 (0.00090) [2022-07-09 21:56:17,767][26022] Updated weights on worker 0-0, policy_version 436987 (0.00081) [2022-07-09 21:56:19,550][26022] Updated weights on worker 0-0, policy_version 436997 (0.00083) [2022-07-09 21:56:20,102][25689] Fps is (10 sec: 5714.1, 60 sec: 5664.1, 300 sec: 5656.5). Total num frames: 447486976. Throughput: 0: 5025.6. Samples: 447483644. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:56:20,103][25689] Avg episode reward: [(0, '-44.515')] [2022-07-09 21:56:21,333][26022] Updated weights on worker 0-0, policy_version 437007 (0.00092) [2022-07-09 21:56:23,151][26022] Updated weights on worker 0-0, policy_version 437017 (0.00084) [2022-07-09 21:56:24,957][26022] Updated weights on worker 0-0, policy_version 437027 (0.01145) [2022-07-09 21:56:25,199][25689] Fps is (10 sec: 5560.5, 60 sec: 5642.6, 300 sec: 5655.0). Total num frames: 447515648. Throughput: 0: 5992.5. Samples: 447518110. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:56:25,200][25689] Avg episode reward: [(0, '-45.348')] [2022-07-09 21:56:26,697][26022] Updated weights on worker 0-0, policy_version 437037 (0.00093) [2022-07-09 21:56:28,612][26022] Updated weights on worker 0-0, policy_version 437047 (0.00088) [2022-07-09 21:56:30,239][25689] Fps is (10 sec: 5758.2, 60 sec: 5673.0, 300 sec: 5661.3). Total num frames: 447545344. Throughput: 0: 5960.0. Samples: 447552084. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:56:30,239][25689] Avg episode reward: [(0, '-45.434')] [2022-07-09 21:56:30,252][26022] Updated weights on worker 0-0, policy_version 437057 (0.00092) [2022-07-09 21:56:32,119][26022] Updated weights on worker 0-0, policy_version 437067 (0.00096) [2022-07-09 21:56:34,325][26022] Updated weights on worker 0-0, policy_version 437077 (0.00081) [2022-07-09 21:56:35,262][25689] Fps is (10 sec: 5698.5, 60 sec: 5673.1, 300 sec: 5654.7). Total num frames: 447572992. Throughput: 0: 5070.3. Samples: 447568934. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:56:35,263][25689] Avg episode reward: [(0, '-44.983')] [2022-07-09 21:56:35,591][26022] Updated weights on worker 0-0, policy_version 437087 (0.00085) [2022-07-09 21:56:37,715][26022] Updated weights on worker 0-0, policy_version 437097 (0.00079) [2022-07-09 21:56:39,200][26022] Updated weights on worker 0-0, policy_version 437107 (0.00088) [2022-07-09 21:56:40,280][25689] Fps is (10 sec: 5507.1, 60 sec: 5641.5, 300 sec: 5653.1). Total num frames: 447600640. Throughput: 0: 5926.0. Samples: 447603214. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:56:40,280][25689] Avg episode reward: [(0, '-44.876')] [2022-07-09 21:56:41,198][26022] Updated weights on worker 0-0, policy_version 437117 (0.00087) [2022-07-09 21:56:43,240][26022] Updated weights on worker 0-0, policy_version 437127 (0.00095) [2022-07-09 21:56:44,670][26022] Updated weights on worker 0-0, policy_version 437137 (0.00088) [2022-07-09 21:56:45,354][25689] Fps is (10 sec: 5885.2, 60 sec: 5674.6, 300 sec: 5666.0). Total num frames: 447632384. Throughput: 0: 5930.3. Samples: 447637634. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:56:45,360][25689] Avg episode reward: [(0, '-44.582')] [2022-07-09 21:56:46,730][26022] Updated weights on worker 0-0, policy_version 437147 (0.00083) [2022-07-09 21:56:48,144][26022] Updated weights on worker 0-0, policy_version 437157 (0.00086) [2022-07-09 21:56:50,071][26022] Updated weights on worker 0-0, policy_version 437167 (0.00092) [2022-07-09 21:56:50,362][25689] Fps is (10 sec: 5890.5, 60 sec: 5676.8, 300 sec: 5655.6). Total num frames: 447660032. Throughput: 0: 5109.1. Samples: 447654896. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:56:50,363][25689] Avg episode reward: [(0, '-44.392')] [2022-07-09 21:56:51,964][26022] Updated weights on worker 0-0, policy_version 437177 (0.00081) [2022-07-09 21:56:53,667][26022] Updated weights on worker 0-0, policy_version 437187 (0.00088) [2022-07-09 21:56:55,398][25689] Fps is (10 sec: 5607.5, 60 sec: 5658.4, 300 sec: 5658.7). Total num frames: 447688704. Throughput: 0: 5986.2. Samples: 447689470. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:56:55,398][25689] Avg episode reward: [(0, '-44.576')] [2022-07-09 21:56:55,527][26022] Updated weights on worker 0-0, policy_version 437197 (0.00090) [2022-07-09 21:56:57,387][26022] Updated weights on worker 0-0, policy_version 437207 (0.00090) [2022-07-09 21:56:58,928][26022] Updated weights on worker 0-0, policy_version 437217 (0.00060) [2022-07-09 21:57:00,399][25689] Fps is (10 sec: 5611.3, 60 sec: 5646.3, 300 sec: 5663.1). Total num frames: 447716352. Throughput: 0: 6007.4. Samples: 447724080. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:57:00,399][25689] Avg episode reward: [(0, '-44.278')] [2022-07-09 21:57:00,924][26022] Updated weights on worker 0-0, policy_version 437227 (0.00087) [2022-07-09 21:57:02,987][26022] Updated weights on worker 0-0, policy_version 437237 (0.00101) [2022-07-09 21:57:04,680][26022] Updated weights on worker 0-0, policy_version 437247 (0.00088) [2022-07-09 21:57:05,447][25689] Fps is (10 sec: 5604.5, 60 sec: 5698.0, 300 sec: 5662.3). Total num frames: 447745024. Throughput: 0: 5050.9. Samples: 447739120. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:57:05,448][25689] Avg episode reward: [(0, '-44.821')] [2022-07-09 21:57:06,698][26022] Updated weights on worker 0-0, policy_version 437257 (0.00085) [2022-07-09 21:57:08,186][26022] Updated weights on worker 0-0, policy_version 437267 (0.00083) [2022-07-09 21:57:10,421][26022] Updated weights on worker 0-0, policy_version 437277 (0.00093) [2022-07-09 21:57:10,472][25689] Fps is (10 sec: 5489.6, 60 sec: 5662.0, 300 sec: 5658.6). Total num frames: 447771648. Throughput: 0: 5877.6. Samples: 447773094. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:57:10,472][25689] Avg episode reward: [(0, '-45.313')] [2022-07-09 21:57:11,821][26022] Updated weights on worker 0-0, policy_version 437287 (0.00085) [2022-07-09 21:57:13,926][26022] Updated weights on worker 0-0, policy_version 437297 (0.00088) [2022-07-09 21:57:15,408][26022] Updated weights on worker 0-0, policy_version 437307 (0.00088) [2022-07-09 21:57:15,485][25689] Fps is (10 sec: 5712.6, 60 sec: 5678.1, 300 sec: 5662.5). Total num frames: 447802368. Throughput: 0: 5877.2. Samples: 447807526. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:57:15,486][25689] Avg episode reward: [(0, '-44.590')] [2022-07-09 21:57:17,459][26022] Updated weights on worker 0-0, policy_version 437317 (0.00091) [2022-07-09 21:57:19,237][26022] Updated weights on worker 0-0, policy_version 437327 (0.00089) [2022-07-09 21:57:20,494][25689] Fps is (10 sec: 5823.8, 60 sec: 5680.2, 300 sec: 5663.2). Total num frames: 447830016. Throughput: 0: 5009.9. Samples: 447824756. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:57:20,495][25689] Avg episode reward: [(0, '-44.306')] [2022-07-09 21:57:21,014][26022] Updated weights on worker 0-0, policy_version 437337 (0.00091) [2022-07-09 21:57:22,600][26022] Updated weights on worker 0-0, policy_version 437347 (0.00086) [2022-07-09 21:57:24,656][26022] Updated weights on worker 0-0, policy_version 437357 (0.00095) [2022-07-09 21:57:25,647][25689] Fps is (10 sec: 5643.2, 60 sec: 5692.0, 300 sec: 5664.0). Total num frames: 447859712. Throughput: 0: 5938.9. Samples: 447859084. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:57:25,647][25689] Avg episode reward: [(0, '-44.276')] [2022-07-09 21:57:26,321][26022] Updated weights on worker 0-0, policy_version 437367 (0.00093) [2022-07-09 21:57:28,079][26022] Updated weights on worker 0-0, policy_version 437377 (0.00084) [2022-07-09 21:57:30,133][26022] Updated weights on worker 0-0, policy_version 437387 (0.00086) [2022-07-09 21:57:30,677][25689] Fps is (10 sec: 5631.7, 60 sec: 5659.0, 300 sec: 5657.1). Total num frames: 447887360. Throughput: 0: 5946.0. Samples: 447893232. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:57:30,677][25689] Avg episode reward: [(0, '-43.971')] [2022-07-09 21:57:31,598][26022] Updated weights on worker 0-0, policy_version 437397 (0.00089) [2022-07-09 21:57:33,671][26022] Updated weights on worker 0-0, policy_version 437407 (0.00084) [2022-07-09 21:57:34,888][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:57:34,897][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000437415_447912960.pth [2022-07-09 21:57:34,897][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000435424_445874176.pth [2022-07-09 21:57:35,285][26022] Updated weights on worker 0-0, policy_version 437417 (0.00089) [2022-07-09 21:57:35,681][25689] Fps is (10 sec: 5612.8, 60 sec: 5677.7, 300 sec: 5657.0). Total num frames: 447916032. Throughput: 0: 5092.3. Samples: 447910368. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:57:35,681][25689] Avg episode reward: [(0, '-43.837')] [2022-07-09 21:57:37,031][26022] Updated weights on worker 0-0, policy_version 437427 (0.00087) [2022-07-09 21:57:39,048][26022] Updated weights on worker 0-0, policy_version 437437 (0.00081) [2022-07-09 21:57:40,594][26022] Updated weights on worker 0-0, policy_version 437447 (0.00085) [2022-07-09 21:57:40,700][25689] Fps is (10 sec: 5823.4, 60 sec: 5711.5, 300 sec: 5667.6). Total num frames: 447945728. Throughput: 0: 5934.5. Samples: 447944664. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:57:40,700][25689] Avg episode reward: [(0, '-43.490')] [2022-07-09 21:57:42,515][26022] Updated weights on worker 0-0, policy_version 437457 (0.00090) [2022-07-09 21:57:44,108][26022] Updated weights on worker 0-0, policy_version 437467 (0.00089) [2022-07-09 21:57:45,742][25689] Fps is (10 sec: 5801.6, 60 sec: 5663.7, 300 sec: 5663.6). Total num frames: 447974400. Throughput: 0: 5992.9. Samples: 447979510. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:57:45,742][25689] Avg episode reward: [(0, '-44.054')] [2022-07-09 21:57:46,016][26022] Updated weights on worker 0-0, policy_version 437477 (0.00093) [2022-07-09 21:57:47,808][26022] Updated weights on worker 0-0, policy_version 437487 (0.00087) [2022-07-09 21:57:49,520][26022] Updated weights on worker 0-0, policy_version 437497 (0.00084) [2022-07-09 21:57:50,755][25689] Fps is (10 sec: 5702.7, 60 sec: 5680.1, 300 sec: 5663.6). Total num frames: 448003072. Throughput: 0: 5149.0. Samples: 447996616. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:57:50,756][25689] Avg episode reward: [(0, '-43.880')] [2022-07-09 21:57:51,341][26022] Updated weights on worker 0-0, policy_version 437507 (0.00084) [2022-07-09 21:57:53,192][26022] Updated weights on worker 0-0, policy_version 437517 (0.00088) [2022-07-09 21:57:55,060][26022] Updated weights on worker 0-0, policy_version 437527 (0.00611) [2022-07-09 21:57:55,771][25689] Fps is (10 sec: 5717.4, 60 sec: 5682.0, 300 sec: 5663.3). Total num frames: 448031744. Throughput: 0: 5997.5. Samples: 448030860. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:57:55,772][25689] Avg episode reward: [(0, '-44.446')] [2022-07-09 21:57:56,790][26022] Updated weights on worker 0-0, policy_version 437537 (0.00111) [2022-07-09 21:57:58,631][26022] Updated weights on worker 0-0, policy_version 437547 (0.00089) [2022-07-09 21:58:00,499][26022] Updated weights on worker 0-0, policy_version 437557 (0.00090) [2022-07-09 21:58:00,796][25689] Fps is (10 sec: 5711.0, 60 sec: 5696.7, 300 sec: 5671.3). Total num frames: 448060416. Throughput: 0: 5995.1. Samples: 448065144. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:58:00,797][25689] Avg episode reward: [(0, '-45.146')] [2022-07-09 21:58:02,670][26022] Updated weights on worker 0-0, policy_version 437567 (0.00106) [2022-07-09 21:58:04,397][26022] Updated weights on worker 0-0, policy_version 437577 (0.00083) [2022-07-09 21:58:05,879][25689] Fps is (10 sec: 5369.5, 60 sec: 5642.6, 300 sec: 5659.6). Total num frames: 448086016. Throughput: 0: 4999.6. Samples: 448080188. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:58:05,879][25689] Avg episode reward: [(0, '-44.789')] [2022-07-09 21:58:06,240][26022] Updated weights on worker 0-0, policy_version 437587 (0.00093) [2022-07-09 21:58:07,833][26022] Updated weights on worker 0-0, policy_version 437597 (0.00093) [2022-07-09 21:58:09,775][26022] Updated weights on worker 0-0, policy_version 437607 (0.00084) [2022-07-09 21:58:10,892][25689] Fps is (10 sec: 5578.6, 60 sec: 5711.5, 300 sec: 5669.8). Total num frames: 448116736. Throughput: 0: 5844.7. Samples: 448114310. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:58:10,894][25689] Avg episode reward: [(0, '-44.730')] [2022-07-09 21:58:11,554][26022] Updated weights on worker 0-0, policy_version 437617 (0.00085) [2022-07-09 21:58:13,374][26022] Updated weights on worker 0-0, policy_version 437627 (0.00930) [2022-07-09 21:58:15,290][26022] Updated weights on worker 0-0, policy_version 437637 (0.00092) [2022-07-09 21:58:15,895][25689] Fps is (10 sec: 5827.4, 60 sec: 5661.6, 300 sec: 5659.7). Total num frames: 448144384. Throughput: 0: 5854.3. Samples: 448148670. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:58:15,895][25689] Avg episode reward: [(0, '-44.800')] [2022-07-09 21:58:16,805][26022] Updated weights on worker 0-0, policy_version 437647 (0.00085) [2022-07-09 21:58:18,791][26022] Updated weights on worker 0-0, policy_version 437657 (0.00082) [2022-07-09 21:58:20,512][26022] Updated weights on worker 0-0, policy_version 437667 (0.00085) [2022-07-09 21:58:20,955][25689] Fps is (10 sec: 5596.8, 60 sec: 5673.8, 300 sec: 5663.5). Total num frames: 448173056. Throughput: 0: 4984.3. Samples: 448165620. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:58:20,956][25689] Avg episode reward: [(0, '-45.252')] [2022-07-09 21:58:22,328][26022] Updated weights on worker 0-0, policy_version 437677 (0.00101) [2022-07-09 21:58:24,080][26022] Updated weights on worker 0-0, policy_version 437687 (0.00086) [2022-07-09 21:58:25,877][26022] Updated weights on worker 0-0, policy_version 437697 (0.00100) [2022-07-09 21:58:26,060][25689] Fps is (10 sec: 5641.1, 60 sec: 5661.3, 300 sec: 5658.1). Total num frames: 448201728. Throughput: 0: 5945.5. Samples: 448200176. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:58:26,062][25689] Avg episode reward: [(0, '-45.178')] [2022-07-09 21:58:27,763][26022] Updated weights on worker 0-0, policy_version 437707 (0.00087) [2022-07-09 21:58:29,662][26022] Updated weights on worker 0-0, policy_version 437717 (0.00089) [2022-07-09 21:58:31,091][25689] Fps is (10 sec: 5657.1, 60 sec: 5678.1, 300 sec: 5661.4). Total num frames: 448230400. Throughput: 0: 5944.8. Samples: 448234392. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:58:31,093][25689] Avg episode reward: [(0, '-44.311')] [2022-07-09 21:58:31,213][26022] Updated weights on worker 0-0, policy_version 437727 (0.00089) [2022-07-09 21:58:33,155][26022] Updated weights on worker 0-0, policy_version 437737 (0.00085) [2022-07-09 21:58:34,969][26022] Updated weights on worker 0-0, policy_version 437747 (0.00086) [2022-07-09 21:58:36,171][25689] Fps is (10 sec: 5570.3, 60 sec: 5654.1, 300 sec: 5660.0). Total num frames: 448258048. Throughput: 0: 5893.5. Samples: 448268166. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:58:36,173][25689] Avg episode reward: [(0, '-45.726')] [2022-07-09 21:58:36,797][26022] Updated weights on worker 0-0, policy_version 437757 (0.00082) [2022-07-09 21:58:38,674][26022] Updated weights on worker 0-0, policy_version 437767 (0.00092) [2022-07-09 21:58:40,279][26022] Updated weights on worker 0-0, policy_version 437777 (0.00084) [2022-07-09 21:58:41,241][25689] Fps is (10 sec: 5650.1, 60 sec: 5649.3, 300 sec: 5659.3). Total num frames: 448287744. Throughput: 0: 5898.1. Samples: 448285268. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:58:41,242][25689] Avg episode reward: [(0, '-45.301')] [2022-07-09 21:58:42,354][26022] Updated weights on worker 0-0, policy_version 437787 (0.00084) [2022-07-09 21:58:43,878][26022] Updated weights on worker 0-0, policy_version 437797 (0.00088) [2022-07-09 21:58:45,852][26022] Updated weights on worker 0-0, policy_version 437807 (0.00087) [2022-07-09 21:58:46,309][25689] Fps is (10 sec: 5959.2, 60 sec: 5680.7, 300 sec: 5666.0). Total num frames: 448318464. Throughput: 0: 5892.4. Samples: 448319492. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:58:46,310][25689] Avg episode reward: [(0, '-45.297')] [2022-07-09 21:58:47,570][26022] Updated weights on worker 0-0, policy_version 437817 (0.00098) [2022-07-09 21:58:49,385][26022] Updated weights on worker 0-0, policy_version 437827 (0.00084) [2022-07-09 21:58:51,280][26022] Updated weights on worker 0-0, policy_version 437837 (0.00089) [2022-07-09 21:58:51,383][25689] Fps is (10 sec: 5654.0, 60 sec: 5641.3, 300 sec: 5658.3). Total num frames: 448345088. Throughput: 0: 5862.2. Samples: 448353344. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:58:51,383][25689] Avg episode reward: [(0, '-45.190')] [2022-07-09 21:58:52,948][26022] Updated weights on worker 0-0, policy_version 437847 (0.00086) [2022-07-09 21:58:54,917][26022] Updated weights on worker 0-0, policy_version 437857 (0.00086) [2022-07-09 21:58:56,445][25689] Fps is (10 sec: 5556.6, 60 sec: 5653.9, 300 sec: 5664.4). Total num frames: 448374784. Throughput: 0: 5044.8. Samples: 448370444. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:58:56,445][25689] Avg episode reward: [(0, '-44.447')] [2022-07-09 21:58:56,703][26022] Updated weights on worker 0-0, policy_version 437867 (0.00085) [2022-07-09 21:58:58,464][26022] Updated weights on worker 0-0, policy_version 437877 (0.00092) [2022-07-09 21:59:00,173][26022] Updated weights on worker 0-0, policy_version 437887 (0.00082) [2022-07-09 21:59:01,448][25689] Fps is (10 sec: 5697.0, 60 sec: 5639.0, 300 sec: 5670.6). Total num frames: 448402432. Throughput: 0: 5909.1. Samples: 448404678. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-09 21:59:01,449][25689] Avg episode reward: [(0, '-44.866')] [2022-07-09 21:59:01,945][26022] Updated weights on worker 0-0, policy_version 437897 (0.00094) [2022-07-09 21:59:04,459][26022] Updated weights on worker 0-0, policy_version 437907 (0.00087) [2022-07-09 21:59:05,952][26022] Updated weights on worker 0-0, policy_version 437917 (0.00091) [2022-07-09 21:59:06,605][25689] Fps is (10 sec: 5442.5, 60 sec: 5665.8, 300 sec: 5662.6). Total num frames: 448430080. Throughput: 0: 5758.6. Samples: 448436368. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 21:59:06,605][25689] Avg episode reward: [(0, '-44.780')] [2022-07-09 21:59:08,043][26022] Updated weights on worker 0-0, policy_version 437927 (0.00094) [2022-07-09 21:59:09,508][26022] Updated weights on worker 0-0, policy_version 437937 (0.00093) [2022-07-09 21:59:11,652][25689] Fps is (10 sec: 5318.8, 60 sec: 5595.2, 300 sec: 5655.3). Total num frames: 448456704. Throughput: 0: 4938.0. Samples: 448453434. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 21:59:11,653][25689] Avg episode reward: [(0, '-44.889')] [2022-07-09 21:59:11,683][26022] Updated weights on worker 0-0, policy_version 437947 (0.00076) [2022-07-09 21:59:13,031][26022] Updated weights on worker 0-0, policy_version 437957 (0.00087) [2022-07-09 21:59:15,295][26022] Updated weights on worker 0-0, policy_version 437967 (0.00087) [2022-07-09 21:59:16,678][25689] Fps is (10 sec: 5794.6, 60 sec: 5660.5, 300 sec: 5666.1). Total num frames: 448488448. Throughput: 0: 5806.7. Samples: 448487930. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 21:59:16,678][25689] Avg episode reward: [(0, '-44.892')] [2022-07-09 21:59:16,680][26022] Updated weights on worker 0-0, policy_version 437977 (0.00117) [2022-07-09 21:59:18,641][26022] Updated weights on worker 0-0, policy_version 437987 (0.00084) [2022-07-09 21:59:20,314][26022] Updated weights on worker 0-0, policy_version 437997 (0.00109) [2022-07-09 21:59:21,696][25689] Fps is (10 sec: 5811.4, 60 sec: 5630.7, 300 sec: 5660.4). Total num frames: 448515072. Throughput: 0: 5810.2. Samples: 448522320. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 21:59:21,696][25689] Avg episode reward: [(0, '-44.985')] [2022-07-09 21:59:22,125][26022] Updated weights on worker 0-0, policy_version 438007 (0.00096) [2022-07-09 21:59:23,963][26022] Updated weights on worker 0-0, policy_version 438017 (0.00086) [2022-07-09 21:59:25,763][26022] Updated weights on worker 0-0, policy_version 438027 (0.00942) [2022-07-09 21:59:26,766][25689] Fps is (10 sec: 5582.4, 60 sec: 5650.8, 300 sec: 5663.5). Total num frames: 448544768. Throughput: 0: 5113.9. Samples: 448539472. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 21:59:26,767][25689] Avg episode reward: [(0, '-44.179')] [2022-07-09 21:59:27,511][26022] Updated weights on worker 0-0, policy_version 438037 (0.00087) [2022-07-09 21:59:29,426][26022] Updated weights on worker 0-0, policy_version 438047 (0.00084) [2022-07-09 21:59:31,164][26022] Updated weights on worker 0-0, policy_version 438057 (0.00086) [2022-07-09 21:59:31,824][25689] Fps is (10 sec: 5762.7, 60 sec: 5648.3, 300 sec: 5663.3). Total num frames: 448573440. Throughput: 0: 5964.2. Samples: 448573744. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 21:59:31,825][25689] Avg episode reward: [(0, '-44.125')] [2022-07-09 21:59:33,226][26022] Updated weights on worker 0-0, policy_version 438067 (0.00093) [2022-07-09 21:59:34,639][26022] Updated weights on worker 0-0, policy_version 438077 (0.00087) [2022-07-09 21:59:35,105][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 21:59:35,119][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000438080_448593920.pth [2022-07-09 21:59:35,121][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000436086_446552064.pth [2022-07-09 21:59:36,872][25689] Fps is (10 sec: 5471.7, 60 sec: 5634.4, 300 sec: 5652.2). Total num frames: 448600064. Throughput: 0: 5935.8. Samples: 448607800. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 21:59:36,873][25689] Avg episode reward: [(0, '-44.233')] [2022-07-09 21:59:36,885][26022] Updated weights on worker 0-0, policy_version 438087 (0.00089) [2022-07-09 21:59:38,239][26022] Updated weights on worker 0-0, policy_version 438097 (0.00082) [2022-07-09 21:59:40,386][26022] Updated weights on worker 0-0, policy_version 438107 (0.00083) [2022-07-09 21:59:41,871][26022] Updated weights on worker 0-0, policy_version 438117 (0.00085) [2022-07-09 21:59:41,946][25689] Fps is (10 sec: 5766.5, 60 sec: 5667.7, 300 sec: 5665.3). Total num frames: 448631808. Throughput: 0: 5065.1. Samples: 448624890. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 21:59:41,947][25689] Avg episode reward: [(0, '-43.430')] [2022-07-09 21:59:43,768][26022] Updated weights on worker 0-0, policy_version 438127 (0.00093) [2022-07-09 21:59:45,475][26022] Updated weights on worker 0-0, policy_version 438137 (0.00086) [2022-07-09 21:59:47,000][25689] Fps is (10 sec: 5864.1, 60 sec: 5618.5, 300 sec: 5658.7). Total num frames: 448659456. Throughput: 0: 5927.5. Samples: 448659408. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 21:59:47,001][25689] Avg episode reward: [(0, '-44.292')] [2022-07-09 21:59:47,327][26022] Updated weights on worker 0-0, policy_version 438147 (0.00086) [2022-07-09 21:59:48,954][26022] Updated weights on worker 0-0, policy_version 438157 (0.00081) [2022-07-09 21:59:51,063][26022] Updated weights on worker 0-0, policy_version 438167 (0.00109) [2022-07-09 21:59:52,001][25689] Fps is (10 sec: 5703.0, 60 sec: 5675.9, 300 sec: 5666.7). Total num frames: 448689152. Throughput: 0: 5953.1. Samples: 448693860. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 21:59:52,002][25689] Avg episode reward: [(0, '-44.871')] [2022-07-09 21:59:52,615][26022] Updated weights on worker 0-0, policy_version 438177 (0.00084) [2022-07-09 21:59:54,584][26022] Updated weights on worker 0-0, policy_version 438187 (0.00089) [2022-07-09 21:59:56,242][26022] Updated weights on worker 0-0, policy_version 438197 (0.00091) [2022-07-09 21:59:57,029][25689] Fps is (10 sec: 5718.1, 60 sec: 5645.3, 300 sec: 5659.5). Total num frames: 448716800. Throughput: 0: 5118.5. Samples: 448710972. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 21:59:57,030][25689] Avg episode reward: [(0, '-45.208')] [2022-07-09 21:59:58,251][26022] Updated weights on worker 0-0, policy_version 438207 (0.00085) [2022-07-09 21:59:59,822][26022] Updated weights on worker 0-0, policy_version 438217 (0.00100) [2022-07-09 22:00:01,815][26022] Updated weights on worker 0-0, policy_version 438227 (0.00090) [2022-07-09 22:00:02,083][25689] Fps is (10 sec: 5586.6, 60 sec: 5657.5, 300 sec: 5670.4). Total num frames: 448745472. Throughput: 0: 5972.4. Samples: 448745152. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 22:00:02,083][25689] Avg episode reward: [(0, '-45.108')] [2022-07-09 22:00:03,993][26022] Updated weights on worker 0-0, policy_version 438237 (0.00081) [2022-07-09 22:00:05,699][26022] Updated weights on worker 0-0, policy_version 438247 (0.00096) [2022-07-09 22:00:07,182][25689] Fps is (10 sec: 5446.4, 60 sec: 5646.0, 300 sec: 5658.9). Total num frames: 448772096. Throughput: 0: 5820.6. Samples: 448776876. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 22:00:07,182][25689] Avg episode reward: [(0, '-45.038')] [2022-07-09 22:00:07,616][26022] Updated weights on worker 0-0, policy_version 438257 (0.00082) [2022-07-09 22:00:09,465][26022] Updated weights on worker 0-0, policy_version 438267 (0.00086) [2022-07-09 22:00:11,075][26022] Updated weights on worker 0-0, policy_version 438277 (0.00086) [2022-07-09 22:00:12,225][25689] Fps is (10 sec: 5553.2, 60 sec: 5697.1, 300 sec: 5665.4). Total num frames: 448801792. Throughput: 0: 5785.3. Samples: 448810858. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 22:00:12,225][25689] Avg episode reward: [(0, '-45.238')] [2022-07-09 22:00:13,212][26022] Updated weights on worker 0-0, policy_version 438287 (0.00089) [2022-07-09 22:00:14,663][26022] Updated weights on worker 0-0, policy_version 438297 (0.00087) [2022-07-09 22:00:16,787][26022] Updated weights on worker 0-0, policy_version 438307 (0.00082) [2022-07-09 22:00:17,250][25689] Fps is (10 sec: 5797.1, 60 sec: 5646.4, 300 sec: 5665.0). Total num frames: 448830464. Throughput: 0: 5784.6. Samples: 448827944. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 22:00:17,251][25689] Avg episode reward: [(0, '-45.200')] [2022-07-09 22:00:18,262][26022] Updated weights on worker 0-0, policy_version 438317 (0.00086) [2022-07-09 22:00:20,122][26022] Updated weights on worker 0-0, policy_version 438327 (0.00090) [2022-07-09 22:00:21,863][26022] Updated weights on worker 0-0, policy_version 438337 (0.00079) [2022-07-09 22:00:22,291][25689] Fps is (10 sec: 5798.5, 60 sec: 5695.0, 300 sec: 5665.1). Total num frames: 448860160. Throughput: 0: 5802.7. Samples: 448862412. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 22:00:22,291][25689] Avg episode reward: [(0, '-44.607')] [2022-07-09 22:00:23,997][26022] Updated weights on worker 0-0, policy_version 438347 (0.00080) [2022-07-09 22:00:25,332][26022] Updated weights on worker 0-0, policy_version 438357 (0.00099) [2022-07-09 22:00:27,401][25689] Fps is (10 sec: 5548.2, 60 sec: 5640.6, 300 sec: 5659.6). Total num frames: 448886784. Throughput: 0: 5905.9. Samples: 448896290. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 22:00:27,402][25689] Avg episode reward: [(0, '-44.509')] [2022-07-09 22:00:27,599][26022] Updated weights on worker 0-0, policy_version 438367 (0.00113) [2022-07-09 22:00:28,980][26022] Updated weights on worker 0-0, policy_version 438377 (0.00087) [2022-07-09 22:00:31,227][26022] Updated weights on worker 0-0, policy_version 438387 (0.00084) [2022-07-09 22:00:32,431][25689] Fps is (10 sec: 5553.9, 60 sec: 5660.1, 300 sec: 5666.4). Total num frames: 448916480. Throughput: 0: 5072.7. Samples: 448913360. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 22:00:32,433][25689] Avg episode reward: [(0, '-45.317')] [2022-07-09 22:00:32,802][26022] Updated weights on worker 0-0, policy_version 438397 (0.00090) [2022-07-09 22:00:34,663][26022] Updated weights on worker 0-0, policy_version 438407 (0.00093) [2022-07-09 22:00:36,387][26022] Updated weights on worker 0-0, policy_version 438417 (0.00084) [2022-07-09 22:00:37,469][25689] Fps is (10 sec: 5797.4, 60 sec: 5694.8, 300 sec: 5663.0). Total num frames: 448945152. Throughput: 0: 5901.6. Samples: 448947268. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 22:00:37,470][25689] Avg episode reward: [(0, '-45.334')] [2022-07-09 22:00:38,363][26022] Updated weights on worker 0-0, policy_version 438427 (0.00092) [2022-07-09 22:00:39,976][26022] Updated weights on worker 0-0, policy_version 438437 (0.00087) [2022-07-09 22:00:41,996][26022] Updated weights on worker 0-0, policy_version 438447 (0.00089) [2022-07-09 22:00:42,502][25689] Fps is (10 sec: 5694.3, 60 sec: 5648.0, 300 sec: 5660.2). Total num frames: 448973824. Throughput: 0: 5868.6. Samples: 448981022. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 22:00:42,502][25689] Avg episode reward: [(0, '-44.860')] [2022-07-09 22:00:43,640][26022] Updated weights on worker 0-0, policy_version 438457 (0.00088) [2022-07-09 22:00:45,420][26022] Updated weights on worker 0-0, policy_version 438467 (0.00093) [2022-07-09 22:00:47,485][26022] Updated weights on worker 0-0, policy_version 438477 (0.00086) [2022-07-09 22:00:47,565][25689] Fps is (10 sec: 5578.7, 60 sec: 5647.1, 300 sec: 5659.6). Total num frames: 449001472. Throughput: 0: 5053.5. Samples: 448998186. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 22:00:47,566][25689] Avg episode reward: [(0, '-44.598')] [2022-07-09 22:00:48,915][26022] Updated weights on worker 0-0, policy_version 438487 (0.00085) [2022-07-09 22:00:51,095][26022] Updated weights on worker 0-0, policy_version 438497 (0.00085) [2022-07-09 22:00:52,612][25689] Fps is (10 sec: 5570.3, 60 sec: 5625.9, 300 sec: 5655.7). Total num frames: 449030144. Throughput: 0: 5891.1. Samples: 449032250. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 22:00:52,613][25689] Avg episode reward: [(0, '-44.749')] [2022-07-09 22:00:52,630][26022] Updated weights on worker 0-0, policy_version 438507 (0.00089) [2022-07-09 22:00:54,554][26022] Updated weights on worker 0-0, policy_version 438517 (0.00086) [2022-07-09 22:00:56,380][26022] Updated weights on worker 0-0, policy_version 438527 (0.00962) [2022-07-09 22:00:57,638][25689] Fps is (10 sec: 5692.6, 60 sec: 5643.0, 300 sec: 5656.2). Total num frames: 449058816. Throughput: 0: 5906.1. Samples: 449066388. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 22:00:57,639][25689] Avg episode reward: [(0, '-43.760')] [2022-07-09 22:00:57,951][26022] Updated weights on worker 0-0, policy_version 438537 (0.00081) [2022-07-09 22:01:00,093][26022] Updated weights on worker 0-0, policy_version 438547 (0.00620) [2022-07-09 22:01:01,854][26022] Updated weights on worker 0-0, policy_version 438557 (0.00078) [2022-07-09 22:01:02,717][25689] Fps is (10 sec: 5370.7, 60 sec: 5589.9, 300 sec: 5655.7). Total num frames: 449084416. Throughput: 0: 5070.7. Samples: 449083530. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 22:01:02,719][25689] Avg episode reward: [(0, '-43.624')] [2022-07-09 22:01:03,865][26022] Updated weights on worker 0-0, policy_version 438567 (0.00092) [2022-07-09 22:01:05,792][26022] Updated weights on worker 0-0, policy_version 438577 (0.00102) [2022-07-09 22:01:07,527][26022] Updated weights on worker 0-0, policy_version 438587 (0.00097) [2022-07-09 22:01:07,811][25689] Fps is (10 sec: 5435.8, 60 sec: 5641.1, 300 sec: 5657.5). Total num frames: 449114112. Throughput: 0: 5800.8. Samples: 449115628. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 22:01:07,811][25689] Avg episode reward: [(0, '-43.876')] [2022-07-09 22:01:09,388][26022] Updated weights on worker 0-0, policy_version 438597 (0.00097) [2022-07-09 22:01:11,148][26022] Updated weights on worker 0-0, policy_version 438607 (0.00090) [2022-07-09 22:01:12,815][25689] Fps is (10 sec: 5780.5, 60 sec: 5627.8, 300 sec: 5654.0). Total num frames: 449142784. Throughput: 0: 5803.6. Samples: 449149498. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 22:01:12,816][25689] Avg episode reward: [(0, '-44.484')] [2022-07-09 22:01:12,944][26022] Updated weights on worker 0-0, policy_version 438617 (0.00530) [2022-07-09 22:01:14,816][26022] Updated weights on worker 0-0, policy_version 438627 (0.00099) [2022-07-09 22:01:16,519][26022] Updated weights on worker 0-0, policy_version 438637 (0.00087) [2022-07-09 22:01:17,833][25689] Fps is (10 sec: 5619.6, 60 sec: 5611.6, 300 sec: 5654.3). Total num frames: 449170432. Throughput: 0: 4970.5. Samples: 449166766. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 22:01:17,833][25689] Avg episode reward: [(0, '-44.451')] [2022-07-09 22:01:18,283][26022] Updated weights on worker 0-0, policy_version 438647 (0.00086) [2022-07-09 22:01:20,161][26022] Updated weights on worker 0-0, policy_version 438657 (0.00086) [2022-07-09 22:01:21,843][26022] Updated weights on worker 0-0, policy_version 438667 (0.00093) [2022-07-09 22:01:22,845][25689] Fps is (10 sec: 5717.0, 60 sec: 5614.2, 300 sec: 5659.3). Total num frames: 449200128. Throughput: 0: 5854.6. Samples: 449201370. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 22:01:22,846][25689] Avg episode reward: [(0, '-45.160')] [2022-07-09 22:01:23,662][26022] Updated weights on worker 0-0, policy_version 438677 (0.00097) [2022-07-09 22:01:25,410][26022] Updated weights on worker 0-0, policy_version 438687 (0.00098) [2022-07-09 22:01:27,358][26022] Updated weights on worker 0-0, policy_version 438697 (0.00087) [2022-07-09 22:01:27,915][25689] Fps is (10 sec: 5789.4, 60 sec: 5651.9, 300 sec: 5655.3). Total num frames: 449228800. Throughput: 0: 5955.3. Samples: 449235354. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 22:01:27,915][25689] Avg episode reward: [(0, '-46.262')] [2022-07-09 22:01:29,136][26022] Updated weights on worker 0-0, policy_version 438707 (0.00091) [2022-07-09 22:01:30,992][26022] Updated weights on worker 0-0, policy_version 438717 (0.00085) [2022-07-09 22:01:32,745][26022] Updated weights on worker 0-0, policy_version 438727 (0.00081) [2022-07-09 22:01:32,936][25689] Fps is (10 sec: 5682.6, 60 sec: 5635.7, 300 sec: 5658.8). Total num frames: 449257472. Throughput: 0: 5118.4. Samples: 449252488. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 22:01:32,937][25689] Avg episode reward: [(0, '-46.685')] [2022-07-09 22:01:34,490][26022] Updated weights on worker 0-0, policy_version 438737 (0.00085) [2022-07-09 22:01:35,355][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:01:35,366][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000438742_449271808.pth [2022-07-09 22:01:35,367][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000436748_447229952.pth [2022-07-09 22:01:36,091][26022] Updated weights on worker 0-0, policy_version 438747 (0.00091) [2022-07-09 22:01:37,964][25689] Fps is (10 sec: 5706.2, 60 sec: 5636.7, 300 sec: 5662.0). Total num frames: 449286144. Throughput: 0: 5969.2. Samples: 449286934. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 22:01:37,965][25689] Avg episode reward: [(0, '-45.726')] [2022-07-09 22:01:38,170][26022] Updated weights on worker 0-0, policy_version 438757 (0.00095) [2022-07-09 22:01:39,887][26022] Updated weights on worker 0-0, policy_version 438767 (0.00058) [2022-07-09 22:01:41,668][26022] Updated weights on worker 0-0, policy_version 438777 (0.00090) [2022-07-09 22:01:42,987][25689] Fps is (10 sec: 5603.8, 60 sec: 5620.7, 300 sec: 5649.3). Total num frames: 449313792. Throughput: 0: 5936.5. Samples: 449320940. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-09 22:01:42,987][25689] Avg episode reward: [(0, '-45.869')] [2022-07-09 22:01:43,559][26022] Updated weights on worker 0-0, policy_version 438787 (0.00092) [2022-07-09 22:01:45,239][26022] Updated weights on worker 0-0, policy_version 438797 (0.00089) [2022-07-09 22:01:47,063][26022] Updated weights on worker 0-0, policy_version 438807 (0.00087) [2022-07-09 22:01:48,070][25689] Fps is (10 sec: 5674.3, 60 sec: 5652.6, 300 sec: 5654.7). Total num frames: 449343488. Throughput: 0: 5102.1. Samples: 449338190. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:01:48,071][25689] Avg episode reward: [(0, '-45.414')] [2022-07-09 22:01:48,985][26022] Updated weights on worker 0-0, policy_version 438817 (0.00089) [2022-07-09 22:01:50,643][26022] Updated weights on worker 0-0, policy_version 438827 (0.00087) [2022-07-09 22:01:52,535][26022] Updated weights on worker 0-0, policy_version 438837 (0.00099) [2022-07-09 22:01:53,113][25689] Fps is (10 sec: 5764.1, 60 sec: 5653.1, 300 sec: 5654.6). Total num frames: 449372160. Throughput: 0: 5938.1. Samples: 449372298. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:01:53,113][25689] Avg episode reward: [(0, '-44.949')] [2022-07-09 22:01:54,300][26022] Updated weights on worker 0-0, policy_version 438847 (0.00089) [2022-07-09 22:01:56,004][26022] Updated weights on worker 0-0, policy_version 438857 (0.00096) [2022-07-09 22:01:57,914][26022] Updated weights on worker 0-0, policy_version 438867 (0.00093) [2022-07-09 22:01:58,205][25689] Fps is (10 sec: 5657.8, 60 sec: 5646.9, 300 sec: 5656.3). Total num frames: 449400832. Throughput: 0: 5918.4. Samples: 449406730. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:01:58,206][25689] Avg episode reward: [(0, '-45.440')] [2022-07-09 22:01:59,553][26022] Updated weights on worker 0-0, policy_version 438877 (0.00086) [2022-07-09 22:02:01,824][26022] Updated weights on worker 0-0, policy_version 438887 (0.00079) [2022-07-09 22:02:03,234][25689] Fps is (10 sec: 5564.6, 60 sec: 5685.5, 300 sec: 5653.2). Total num frames: 449428480. Throughput: 0: 5076.6. Samples: 449423728. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:02:03,236][25689] Avg episode reward: [(0, '-46.109')] [2022-07-09 22:02:03,594][26022] Updated weights on worker 0-0, policy_version 438897 (0.00087) [2022-07-09 22:02:05,187][26022] Updated weights on worker 0-0, policy_version 438907 (0.00089) [2022-07-09 22:02:07,302][26022] Updated weights on worker 0-0, policy_version 438917 (0.00377) [2022-07-09 22:02:08,345][25689] Fps is (10 sec: 5655.5, 60 sec: 5683.8, 300 sec: 5661.9). Total num frames: 449458176. Throughput: 0: 5809.4. Samples: 449455974. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:02:08,347][25689] Avg episode reward: [(0, '-45.935')] [2022-07-09 22:02:08,962][26022] Updated weights on worker 0-0, policy_version 438927 (0.00091) [2022-07-09 22:02:10,831][26022] Updated weights on worker 0-0, policy_version 438937 (0.00624) [2022-07-09 22:02:12,699][26022] Updated weights on worker 0-0, policy_version 438947 (0.00094) [2022-07-09 22:02:13,366][25689] Fps is (10 sec: 5558.5, 60 sec: 5648.4, 300 sec: 5648.0). Total num frames: 449484800. Throughput: 0: 5827.2. Samples: 449490318. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:02:13,368][25689] Avg episode reward: [(0, '-45.103')] [2022-07-09 22:02:14,429][26022] Updated weights on worker 0-0, policy_version 438957 (0.00090) [2022-07-09 22:02:16,128][26022] Updated weights on worker 0-0, policy_version 438967 (0.00086) [2022-07-09 22:02:18,024][26022] Updated weights on worker 0-0, policy_version 438977 (0.00089) [2022-07-09 22:02:18,394][25689] Fps is (10 sec: 5502.5, 60 sec: 5664.3, 300 sec: 5651.1). Total num frames: 449513472. Throughput: 0: 4996.1. Samples: 449507594. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:02:18,396][25689] Avg episode reward: [(0, '-45.280')] [2022-07-09 22:02:19,707][26022] Updated weights on worker 0-0, policy_version 438987 (0.00086) [2022-07-09 22:02:21,567][26022] Updated weights on worker 0-0, policy_version 438997 (0.00092) [2022-07-09 22:02:23,402][25689] Fps is (10 sec: 5714.0, 60 sec: 5647.9, 300 sec: 5650.4). Total num frames: 449542144. Throughput: 0: 5867.5. Samples: 449542064. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:02:23,402][25689] Avg episode reward: [(0, '-44.760')] [2022-07-09 22:02:23,553][26022] Updated weights on worker 0-0, policy_version 439007 (0.00090) [2022-07-09 22:02:25,095][26022] Updated weights on worker 0-0, policy_version 439017 (0.00083) [2022-07-09 22:02:26,900][26022] Updated weights on worker 0-0, policy_version 439027 (0.00085) [2022-07-09 22:02:28,491][25689] Fps is (10 sec: 5882.4, 60 sec: 5679.9, 300 sec: 5659.6). Total num frames: 449572864. Throughput: 0: 5966.0. Samples: 449576164. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:02:28,491][25689] Avg episode reward: [(0, '-45.145')] [2022-07-09 22:02:28,792][26022] Updated weights on worker 0-0, policy_version 439037 (0.00086) [2022-07-09 22:02:30,677][26022] Updated weights on worker 0-0, policy_version 439047 (0.00099) [2022-07-09 22:02:32,497][26022] Updated weights on worker 0-0, policy_version 439057 (0.00088) [2022-07-09 22:02:33,536][25689] Fps is (10 sec: 5759.7, 60 sec: 5660.8, 300 sec: 5655.4). Total num frames: 449600512. Throughput: 0: 5094.9. Samples: 449593080. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:02:33,536][25689] Avg episode reward: [(0, '-45.575')] [2022-07-09 22:02:34,043][26022] Updated weights on worker 0-0, policy_version 439067 (0.00082) [2022-07-09 22:02:36,141][26022] Updated weights on worker 0-0, policy_version 439077 (0.00092) [2022-07-09 22:02:38,056][26022] Updated weights on worker 0-0, policy_version 439087 (0.00089) [2022-07-09 22:02:38,549][25689] Fps is (10 sec: 5497.7, 60 sec: 5645.3, 300 sec: 5648.6). Total num frames: 449628160. Throughput: 0: 5922.9. Samples: 449626966. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:02:38,549][25689] Avg episode reward: [(0, '-45.571')] [2022-07-09 22:02:39,744][26022] Updated weights on worker 0-0, policy_version 439097 (0.00090) [2022-07-09 22:02:41,658][26022] Updated weights on worker 0-0, policy_version 439107 (0.00088) [2022-07-09 22:02:43,362][26022] Updated weights on worker 0-0, policy_version 439117 (0.00093) [2022-07-09 22:02:43,552][25689] Fps is (10 sec: 5724.9, 60 sec: 5680.9, 300 sec: 5652.8). Total num frames: 449657856. Throughput: 0: 5872.0. Samples: 449660386. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:02:43,553][25689] Avg episode reward: [(0, '-45.407')] [2022-07-09 22:02:45,422][26022] Updated weights on worker 0-0, policy_version 439127 (0.00094) [2022-07-09 22:02:47,061][26022] Updated weights on worker 0-0, policy_version 439137 (0.00089) [2022-07-09 22:02:48,609][25689] Fps is (10 sec: 5496.3, 60 sec: 5615.7, 300 sec: 5641.6). Total num frames: 449683456. Throughput: 0: 5834.2. Samples: 449693540. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:02:48,610][25689] Avg episode reward: [(0, '-44.756')] [2022-07-09 22:02:49,099][26022] Updated weights on worker 0-0, policy_version 439147 (0.00086) [2022-07-09 22:02:50,757][26022] Updated weights on worker 0-0, policy_version 439157 (0.00094) [2022-07-09 22:02:52,629][26022] Updated weights on worker 0-0, policy_version 439167 (0.00084) [2022-07-09 22:02:53,703][25689] Fps is (10 sec: 5447.6, 60 sec: 5627.9, 300 sec: 5643.6). Total num frames: 449713152. Throughput: 0: 5832.2. Samples: 449710698. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:02:53,703][25689] Avg episode reward: [(0, '-44.649')] [2022-07-09 22:02:54,521][26022] Updated weights on worker 0-0, policy_version 439177 (0.00086) [2022-07-09 22:02:56,288][26022] Updated weights on worker 0-0, policy_version 439187 (0.00090) [2022-07-09 22:02:57,990][26022] Updated weights on worker 0-0, policy_version 439197 (0.00084) [2022-07-09 22:02:58,734][25689] Fps is (10 sec: 5764.7, 60 sec: 5633.5, 300 sec: 5643.5). Total num frames: 449741824. Throughput: 0: 5837.2. Samples: 449744794. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:02:58,735][25689] Avg episode reward: [(0, '-44.309')] [2022-07-09 22:02:59,775][26022] Updated weights on worker 0-0, policy_version 439207 (0.00087) [2022-07-09 22:03:01,590][26022] Updated weights on worker 0-0, policy_version 439217 (0.00104) [2022-07-09 22:03:03,772][25689] Fps is (10 sec: 5491.8, 60 sec: 5615.8, 300 sec: 5647.8). Total num frames: 449768448. Throughput: 0: 5772.0. Samples: 449777092. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:03:03,772][25689] Avg episode reward: [(0, '-44.029')] [2022-07-09 22:03:03,776][26022] Updated weights on worker 0-0, policy_version 439227 (0.00092) [2022-07-09 22:03:05,635][26022] Updated weights on worker 0-0, policy_version 439237 (0.00085) [2022-07-09 22:03:07,233][26022] Updated weights on worker 0-0, policy_version 439247 (0.00087) [2022-07-09 22:03:08,841][25689] Fps is (10 sec: 5370.0, 60 sec: 5585.8, 300 sec: 5636.4). Total num frames: 449796096. Throughput: 0: 4975.0. Samples: 449794192. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:03:08,843][25689] Avg episode reward: [(0, '-44.116')] [2022-07-09 22:03:09,065][26022] Updated weights on worker 0-0, policy_version 439257 (0.00090) [2022-07-09 22:03:11,095][26022] Updated weights on worker 0-0, policy_version 439267 (0.00100) [2022-07-09 22:03:12,709][26022] Updated weights on worker 0-0, policy_version 439277 (0.00086) [2022-07-09 22:03:13,860][25689] Fps is (10 sec: 5785.8, 60 sec: 5653.8, 300 sec: 5646.4). Total num frames: 449826816. Throughput: 0: 5845.7. Samples: 449828530. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:03:13,860][25689] Avg episode reward: [(0, '-45.566')] [2022-07-09 22:03:14,599][26022] Updated weights on worker 0-0, policy_version 439287 (0.00088) [2022-07-09 22:03:16,179][26022] Updated weights on worker 0-0, policy_version 439297 (0.00083) [2022-07-09 22:03:17,900][26022] Updated weights on worker 0-0, policy_version 439307 (0.00103) [2022-07-09 22:03:18,867][25689] Fps is (10 sec: 5821.8, 60 sec: 5638.8, 300 sec: 5644.0). Total num frames: 449854464. Throughput: 0: 5868.7. Samples: 449862944. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:03:18,867][25689] Avg episode reward: [(0, '-45.869')] [2022-07-09 22:03:19,921][26022] Updated weights on worker 0-0, policy_version 439317 (0.00087) [2022-07-09 22:03:21,587][26022] Updated weights on worker 0-0, policy_version 439327 (0.00086) [2022-07-09 22:03:23,374][26022] Updated weights on worker 0-0, policy_version 439337 (0.00087) [2022-07-09 22:03:23,905][25689] Fps is (10 sec: 5708.5, 60 sec: 5652.8, 300 sec: 5648.7). Total num frames: 449884160. Throughput: 0: 5121.9. Samples: 449880214. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:03:23,906][25689] Avg episode reward: [(0, '-44.879')] [2022-07-09 22:03:25,297][26022] Updated weights on worker 0-0, policy_version 439347 (0.00091) [2022-07-09 22:03:26,875][26022] Updated weights on worker 0-0, policy_version 439357 (0.00093) [2022-07-09 22:03:28,943][25689] Fps is (10 sec: 5589.3, 60 sec: 5589.8, 300 sec: 5641.7). Total num frames: 449910784. Throughput: 0: 5982.9. Samples: 449914462. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:03:28,944][25689] Avg episode reward: [(0, '-44.847')] [2022-07-09 22:03:28,972][26022] Updated weights on worker 0-0, policy_version 439367 (0.00082) [2022-07-09 22:03:30,514][26022] Updated weights on worker 0-0, policy_version 439377 (0.00087) [2022-07-09 22:03:32,563][26022] Updated weights on worker 0-0, policy_version 439387 (0.00082) [2022-07-09 22:03:34,032][25689] Fps is (10 sec: 5662.5, 60 sec: 5636.5, 300 sec: 5651.8). Total num frames: 449941504. Throughput: 0: 5944.0. Samples: 449948436. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:03:34,033][25689] Avg episode reward: [(0, '-44.930')] [2022-07-09 22:03:34,191][26022] Updated weights on worker 0-0, policy_version 439397 (0.00090) [2022-07-09 22:03:35,559][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:03:35,572][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000439404_449949696.pth [2022-07-09 22:03:35,573][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000437415_447912960.pth [2022-07-09 22:03:35,984][26022] Updated weights on worker 0-0, policy_version 439407 (0.00089) [2022-07-09 22:03:37,902][26022] Updated weights on worker 0-0, policy_version 439417 (0.00084) [2022-07-09 22:03:39,042][25689] Fps is (10 sec: 5779.6, 60 sec: 5636.8, 300 sec: 5646.1). Total num frames: 449969152. Throughput: 0: 5090.4. Samples: 449965644. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:03:39,044][25689] Avg episode reward: [(0, '-44.570')] [2022-07-09 22:03:39,589][26022] Updated weights on worker 0-0, policy_version 439427 (0.00082) [2022-07-09 22:03:41,523][26022] Updated weights on worker 0-0, policy_version 439437 (0.00082) [2022-07-09 22:03:43,436][26022] Updated weights on worker 0-0, policy_version 439447 (0.00089) [2022-07-09 22:03:44,075][25689] Fps is (10 sec: 5506.0, 60 sec: 5600.3, 300 sec: 5636.4). Total num frames: 449996800. Throughput: 0: 5903.3. Samples: 449999282. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:03:44,076][25689] Avg episode reward: [(0, '-43.527')] [2022-07-09 22:03:44,950][26022] Updated weights on worker 0-0, policy_version 439457 (0.00096) [2022-07-09 22:03:47,125][26022] Updated weights on worker 0-0, policy_version 439467 (0.00088) [2022-07-09 22:03:48,735][26022] Updated weights on worker 0-0, policy_version 439477 (0.00084) [2022-07-09 22:03:49,149][25689] Fps is (10 sec: 5673.9, 60 sec: 5666.4, 300 sec: 5646.7). Total num frames: 450026496. Throughput: 0: 5879.1. Samples: 450033252. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:03:49,149][25689] Avg episode reward: [(0, '-43.927')] [2022-07-09 22:03:50,720][26022] Updated weights on worker 0-0, policy_version 439487 (0.00089) [2022-07-09 22:03:52,354][26022] Updated weights on worker 0-0, policy_version 439497 (0.00082) [2022-07-09 22:03:54,223][25689] Fps is (10 sec: 5651.0, 60 sec: 5634.4, 300 sec: 5639.6). Total num frames: 450054144. Throughput: 0: 5030.4. Samples: 450050002. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:03:54,223][25689] Avg episode reward: [(0, '-44.712')] [2022-07-09 22:03:54,283][26022] Updated weights on worker 0-0, policy_version 439507 (0.00087) [2022-07-09 22:03:55,917][26022] Updated weights on worker 0-0, policy_version 439517 (0.00083) [2022-07-09 22:03:58,088][26022] Updated weights on worker 0-0, policy_version 439527 (0.00093) [2022-07-09 22:03:59,287][25689] Fps is (10 sec: 5655.9, 60 sec: 5648.2, 300 sec: 5645.4). Total num frames: 450083840. Throughput: 0: 5836.6. Samples: 450083808. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:03:59,288][25689] Avg episode reward: [(0, '-44.377')] [2022-07-09 22:03:59,684][26022] Updated weights on worker 0-0, policy_version 439537 (0.00093) [2022-07-09 22:04:01,644][26022] Updated weights on worker 0-0, policy_version 439547 (0.00093) [2022-07-09 22:04:03,587][26022] Updated weights on worker 0-0, policy_version 439557 (0.00085) [2022-07-09 22:04:04,377][25689] Fps is (10 sec: 5445.7, 60 sec: 5626.5, 300 sec: 5639.8). Total num frames: 450109440. Throughput: 0: 5734.1. Samples: 450115694. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:04:04,377][25689] Avg episode reward: [(0, '-44.492')] [2022-07-09 22:04:05,563][26022] Updated weights on worker 0-0, policy_version 439567 (0.00884) [2022-07-09 22:04:07,356][26022] Updated weights on worker 0-0, policy_version 439577 (0.00081) [2022-07-09 22:04:09,149][26022] Updated weights on worker 0-0, policy_version 439587 (0.00090) [2022-07-09 22:04:09,481][25689] Fps is (10 sec: 5424.7, 60 sec: 5657.0, 300 sec: 5649.0). Total num frames: 450139136. Throughput: 0: 4885.9. Samples: 450132592. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:04:09,481][25689] Avg episode reward: [(0, '-44.225')] [2022-07-09 22:04:10,720][26022] Updated weights on worker 0-0, policy_version 439597 (0.00089) [2022-07-09 22:04:12,875][26022] Updated weights on worker 0-0, policy_version 439607 (0.00085) [2022-07-09 22:04:14,399][26022] Updated weights on worker 0-0, policy_version 439617 (0.00090) [2022-07-09 22:04:14,486][25689] Fps is (10 sec: 5774.0, 60 sec: 5624.5, 300 sec: 5639.1). Total num frames: 450167808. Throughput: 0: 5761.0. Samples: 450166736. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:04:14,486][25689] Avg episode reward: [(0, '-43.825')] [2022-07-09 22:04:16,508][26022] Updated weights on worker 0-0, policy_version 439627 (0.00080) [2022-07-09 22:04:18,140][26022] Updated weights on worker 0-0, policy_version 439637 (0.00096) [2022-07-09 22:04:19,528][25689] Fps is (10 sec: 5707.6, 60 sec: 5638.2, 300 sec: 5645.5). Total num frames: 450196480. Throughput: 0: 5796.5. Samples: 450201132. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:04:19,528][25689] Avg episode reward: [(0, '-44.314')] [2022-07-09 22:04:19,891][26022] Updated weights on worker 0-0, policy_version 439647 (0.00086) [2022-07-09 22:04:21,866][26022] Updated weights on worker 0-0, policy_version 439657 (0.00089) [2022-07-09 22:04:23,464][26022] Updated weights on worker 0-0, policy_version 439667 (0.00094) [2022-07-09 22:04:24,563][25689] Fps is (10 sec: 5588.6, 60 sec: 5604.7, 300 sec: 5639.3). Total num frames: 450224128. Throughput: 0: 5086.7. Samples: 450218376. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:04:24,564][25689] Avg episode reward: [(0, '-44.527')] [2022-07-09 22:04:25,292][26022] Updated weights on worker 0-0, policy_version 439677 (0.00106) [2022-07-09 22:04:27,068][26022] Updated weights on worker 0-0, policy_version 439687 (0.00092) [2022-07-09 22:04:28,943][26022] Updated weights on worker 0-0, policy_version 439697 (0.00086) [2022-07-09 22:04:29,652][25689] Fps is (10 sec: 5663.9, 60 sec: 5650.6, 300 sec: 5642.1). Total num frames: 450253824. Throughput: 0: 5948.7. Samples: 450252586. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:04:29,653][25689] Avg episode reward: [(0, '-44.850')] [2022-07-09 22:04:30,589][26022] Updated weights on worker 0-0, policy_version 439707 (0.00087) [2022-07-09 22:04:32,617][26022] Updated weights on worker 0-0, policy_version 439717 (0.00093) [2022-07-09 22:04:34,275][26022] Updated weights on worker 0-0, policy_version 439727 (0.00079) [2022-07-09 22:04:34,700][25689] Fps is (10 sec: 5657.2, 60 sec: 5603.8, 300 sec: 5645.6). Total num frames: 450281472. Throughput: 0: 5942.7. Samples: 450286862. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:04:34,702][25689] Avg episode reward: [(0, '-45.852')] [2022-07-09 22:04:36,180][26022] Updated weights on worker 0-0, policy_version 439737 (0.00091) [2022-07-09 22:04:38,035][26022] Updated weights on worker 0-0, policy_version 439747 (0.00088) [2022-07-09 22:04:39,691][26022] Updated weights on worker 0-0, policy_version 439757 (0.00095) [2022-07-09 22:04:39,766][25689] Fps is (10 sec: 5770.9, 60 sec: 5649.1, 300 sec: 5642.3). Total num frames: 450312192. Throughput: 0: 5077.3. Samples: 450303890. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:04:39,767][25689] Avg episode reward: [(0, '-46.663')] [2022-07-09 22:04:41,635][26022] Updated weights on worker 0-0, policy_version 439767 (0.00087) [2022-07-09 22:04:43,170][26022] Updated weights on worker 0-0, policy_version 439777 (0.00083) [2022-07-09 22:04:44,768][25689] Fps is (10 sec: 5695.5, 60 sec: 5635.2, 300 sec: 5639.8). Total num frames: 450338816. Throughput: 0: 5922.5. Samples: 450338038. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:04:44,768][25689] Avg episode reward: [(0, '-45.963')] [2022-07-09 22:04:45,169][26022] Updated weights on worker 0-0, policy_version 439787 (0.00083) [2022-07-09 22:04:46,823][26022] Updated weights on worker 0-0, policy_version 439797 (0.00086) [2022-07-09 22:04:48,587][26022] Updated weights on worker 0-0, policy_version 439807 (0.00091) [2022-07-09 22:04:49,819][25689] Fps is (10 sec: 5500.5, 60 sec: 5620.4, 300 sec: 5635.4). Total num frames: 450367488. Throughput: 0: 5938.6. Samples: 450372350. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:04:49,820][25689] Avg episode reward: [(0, '-45.675')] [2022-07-09 22:04:50,473][26022] Updated weights on worker 0-0, policy_version 439817 (0.00093) [2022-07-09 22:04:52,163][26022] Updated weights on worker 0-0, policy_version 439827 (0.00092) [2022-07-09 22:04:54,109][26022] Updated weights on worker 0-0, policy_version 439837 (0.00090) [2022-07-09 22:04:54,839][25689] Fps is (10 sec: 5897.3, 60 sec: 5676.1, 300 sec: 5645.9). Total num frames: 450398208. Throughput: 0: 5105.9. Samples: 450389692. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:04:54,840][25689] Avg episode reward: [(0, '-46.177')] [2022-07-09 22:04:55,927][26022] Updated weights on worker 0-0, policy_version 439847 (0.00094) [2022-07-09 22:04:57,699][26022] Updated weights on worker 0-0, policy_version 439857 (0.00079) [2022-07-09 22:04:59,457][26022] Updated weights on worker 0-0, policy_version 439867 (0.00331) [2022-07-09 22:04:59,905][25689] Fps is (10 sec: 5685.9, 60 sec: 5625.3, 300 sec: 5638.8). Total num frames: 450424832. Throughput: 0: 5962.9. Samples: 450423972. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:04:59,905][25689] Avg episode reward: [(0, '-45.463')] [2022-07-09 22:05:01,228][26022] Updated weights on worker 0-0, policy_version 439877 (0.00088) [2022-07-09 22:05:03,378][26022] Updated weights on worker 0-0, policy_version 439887 (0.00094) [2022-07-09 22:05:04,910][25689] Fps is (10 sec: 5287.5, 60 sec: 5650.1, 300 sec: 5640.6). Total num frames: 450451456. Throughput: 0: 5853.8. Samples: 450455944. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:05:04,910][25689] Avg episode reward: [(0, '-45.525')] [2022-07-09 22:05:05,367][26022] Updated weights on worker 0-0, policy_version 439897 (0.00092) [2022-07-09 22:05:06,866][26022] Updated weights on worker 0-0, policy_version 439907 (0.00088) [2022-07-09 22:05:08,839][26022] Updated weights on worker 0-0, policy_version 439917 (0.00084) [2022-07-09 22:05:10,066][25689] Fps is (10 sec: 5643.2, 60 sec: 5662.1, 300 sec: 5641.9). Total num frames: 450482176. Throughput: 0: 5839.6. Samples: 450490584. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:05:10,067][25689] Avg episode reward: [(0, '-45.175')] [2022-07-09 22:05:10,615][26022] Updated weights on worker 0-0, policy_version 439927 (0.00079) [2022-07-09 22:05:12,249][26022] Updated weights on worker 0-0, policy_version 439937 (0.00088) [2022-07-09 22:05:14,164][26022] Updated weights on worker 0-0, policy_version 439947 (0.00087) [2022-07-09 22:05:15,099][25689] Fps is (10 sec: 5828.9, 60 sec: 5659.5, 300 sec: 5641.7). Total num frames: 450510848. Throughput: 0: 5834.9. Samples: 450507906. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:05:15,099][25689] Avg episode reward: [(0, '-44.548')] [2022-07-09 22:05:15,756][26022] Updated weights on worker 0-0, policy_version 439957 (0.00091) [2022-07-09 22:05:17,735][26022] Updated weights on worker 0-0, policy_version 439967 (0.00091) [2022-07-09 22:05:19,719][26022] Updated weights on worker 0-0, policy_version 439977 (0.00091) [2022-07-09 22:05:20,175][25689] Fps is (10 sec: 5672.9, 60 sec: 5656.3, 300 sec: 5637.6). Total num frames: 450539520. Throughput: 0: 5833.6. Samples: 450542220. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:05:20,175][25689] Avg episode reward: [(0, '-45.066')] [2022-07-09 22:05:21,154][26022] Updated weights on worker 0-0, policy_version 439987 (0.00082) [2022-07-09 22:05:23,324][26022] Updated weights on worker 0-0, policy_version 439997 (0.00083) [2022-07-09 22:05:24,754][26022] Updated weights on worker 0-0, policy_version 440007 (0.00083) [2022-07-09 22:05:25,218][25689] Fps is (10 sec: 5768.2, 60 sec: 5689.4, 300 sec: 5649.2). Total num frames: 450569216. Throughput: 0: 5947.6. Samples: 450576730. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:05:25,218][25689] Avg episode reward: [(0, '-45.133')] [2022-07-09 22:05:26,806][26022] Updated weights on worker 0-0, policy_version 440017 (0.00094) [2022-07-09 22:05:28,558][26022] Updated weights on worker 0-0, policy_version 440027 (0.00086) [2022-07-09 22:05:30,267][25689] Fps is (10 sec: 5682.1, 60 sec: 5659.4, 300 sec: 5642.0). Total num frames: 450596864. Throughput: 0: 5102.1. Samples: 450593648. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:05:30,267][25689] Avg episode reward: [(0, '-44.489')] [2022-07-09 22:05:30,419][26022] Updated weights on worker 0-0, policy_version 440037 (0.00609) [2022-07-09 22:05:32,054][26022] Updated weights on worker 0-0, policy_version 440047 (0.00079) [2022-07-09 22:05:33,847][26022] Updated weights on worker 0-0, policy_version 440057 (0.00090) [2022-07-09 22:05:35,278][25689] Fps is (10 sec: 5496.4, 60 sec: 5662.7, 300 sec: 5639.0). Total num frames: 450624512. Throughput: 0: 5950.4. Samples: 450627982. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:05:35,279][25689] Avg episode reward: [(0, '-43.627')] [2022-07-09 22:05:35,581][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:05:35,611][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000440067_450628608.pth [2022-07-09 22:05:35,611][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000438080_448593920.pth [2022-07-09 22:05:35,613][26022] Updated weights on worker 0-0, policy_version 440067 (0.00091) [2022-07-09 22:05:37,505][26022] Updated weights on worker 0-0, policy_version 440077 (0.00087) [2022-07-09 22:05:39,331][26022] Updated weights on worker 0-0, policy_version 440087 (0.00093) [2022-07-09 22:05:40,300][25689] Fps is (10 sec: 5715.1, 60 sec: 5650.0, 300 sec: 5642.7). Total num frames: 450654208. Throughput: 0: 5960.4. Samples: 450662178. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:05:40,301][25689] Avg episode reward: [(0, '-44.527')] [2022-07-09 22:05:41,061][26022] Updated weights on worker 0-0, policy_version 440097 (0.00261) [2022-07-09 22:05:42,792][26022] Updated weights on worker 0-0, policy_version 440107 (0.00090) [2022-07-09 22:05:44,803][26022] Updated weights on worker 0-0, policy_version 440117 (0.00084) [2022-07-09 22:05:45,337][25689] Fps is (10 sec: 5802.6, 60 sec: 5680.5, 300 sec: 5646.6). Total num frames: 450682880. Throughput: 0: 5100.3. Samples: 450679344. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:05:45,338][25689] Avg episode reward: [(0, '-45.036')] [2022-07-09 22:05:46,597][26022] Updated weights on worker 0-0, policy_version 440127 (0.00104) [2022-07-09 22:05:48,183][26022] Updated weights on worker 0-0, policy_version 440137 (0.00088) [2022-07-09 22:05:50,164][26022] Updated weights on worker 0-0, policy_version 440147 (0.00090) [2022-07-09 22:05:50,402][25689] Fps is (10 sec: 5676.9, 60 sec: 5679.3, 300 sec: 5646.3). Total num frames: 450711552. Throughput: 0: 5947.8. Samples: 450713408. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:05:50,402][25689] Avg episode reward: [(0, '-45.294')] [2022-07-09 22:05:51,879][26022] Updated weights on worker 0-0, policy_version 440157 (0.00102) [2022-07-09 22:05:53,696][26022] Updated weights on worker 0-0, policy_version 440167 (0.00091) [2022-07-09 22:05:55,436][25689] Fps is (10 sec: 5577.2, 60 sec: 5627.3, 300 sec: 5642.7). Total num frames: 450739200. Throughput: 0: 5915.0. Samples: 450747212. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:05:55,437][25689] Avg episode reward: [(0, '-44.770')] [2022-07-09 22:05:55,723][26022] Updated weights on worker 0-0, policy_version 440177 (0.00084) [2022-07-09 22:05:57,357][26022] Updated weights on worker 0-0, policy_version 440187 (0.00095) [2022-07-09 22:05:59,360][26022] Updated weights on worker 0-0, policy_version 440197 (0.00092) [2022-07-09 22:06:00,448][25689] Fps is (10 sec: 5708.0, 60 sec: 5682.9, 300 sec: 5657.7). Total num frames: 450768896. Throughput: 0: 5066.9. Samples: 450764264. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:06:00,448][25689] Avg episode reward: [(0, '-46.088')] [2022-07-09 22:06:01,131][26022] Updated weights on worker 0-0, policy_version 440207 (0.00093) [2022-07-09 22:06:03,376][26022] Updated weights on worker 0-0, policy_version 440217 (0.00092) [2022-07-09 22:06:05,007][26022] Updated weights on worker 0-0, policy_version 440227 (0.00092) [2022-07-09 22:06:05,469][25689] Fps is (10 sec: 5511.2, 60 sec: 5664.5, 300 sec: 5645.3). Total num frames: 450794496. Throughput: 0: 5802.7. Samples: 450796162. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:06:05,469][25689] Avg episode reward: [(0, '-46.703')] [2022-07-09 22:06:06,780][26022] Updated weights on worker 0-0, policy_version 440237 (0.00088) [2022-07-09 22:06:08,658][26022] Updated weights on worker 0-0, policy_version 440247 (0.00084) [2022-07-09 22:06:10,399][26022] Updated weights on worker 0-0, policy_version 440257 (0.00084) [2022-07-09 22:06:10,546][25689] Fps is (10 sec: 5475.9, 60 sec: 5655.1, 300 sec: 5647.4). Total num frames: 450824192. Throughput: 0: 5788.6. Samples: 450830014. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:06:10,546][25689] Avg episode reward: [(0, '-46.315')] [2022-07-09 22:06:12,343][26022] Updated weights on worker 0-0, policy_version 440267 (0.00081) [2022-07-09 22:06:13,979][26022] Updated weights on worker 0-0, policy_version 440277 (0.00104) [2022-07-09 22:06:15,557][25689] Fps is (10 sec: 5684.1, 60 sec: 5640.1, 300 sec: 5647.5). Total num frames: 450851840. Throughput: 0: 4971.7. Samples: 450847252. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:06:15,558][25689] Avg episode reward: [(0, '-45.575')] [2022-07-09 22:06:15,897][26022] Updated weights on worker 0-0, policy_version 440287 (0.00084) [2022-07-09 22:06:17,693][26022] Updated weights on worker 0-0, policy_version 440297 (0.00096) [2022-07-09 22:06:19,372][26022] Updated weights on worker 0-0, policy_version 440307 (0.00085) [2022-07-09 22:06:20,573][25689] Fps is (10 sec: 5616.6, 60 sec: 5645.7, 300 sec: 5644.0). Total num frames: 450880512. Throughput: 0: 5816.1. Samples: 450881316. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:06:20,574][25689] Avg episode reward: [(0, '-45.805')] [2022-07-09 22:06:21,357][26022] Updated weights on worker 0-0, policy_version 440317 (0.00086) [2022-07-09 22:06:22,991][26022] Updated weights on worker 0-0, policy_version 440327 (0.00092) [2022-07-09 22:06:24,752][26022] Updated weights on worker 0-0, policy_version 440337 (0.00081) [2022-07-09 22:06:25,585][25689] Fps is (10 sec: 5718.5, 60 sec: 5631.7, 300 sec: 5645.1). Total num frames: 450909184. Throughput: 0: 5942.2. Samples: 450915698. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:06:25,586][25689] Avg episode reward: [(0, '-45.091')] [2022-07-09 22:06:26,855][26022] Updated weights on worker 0-0, policy_version 440347 (0.00098) [2022-07-09 22:06:28,370][26022] Updated weights on worker 0-0, policy_version 440357 (0.00096) [2022-07-09 22:06:30,234][26022] Updated weights on worker 0-0, policy_version 440367 (0.00086) [2022-07-09 22:06:30,626][25689] Fps is (10 sec: 5806.0, 60 sec: 5666.3, 300 sec: 5648.2). Total num frames: 450938880. Throughput: 0: 5110.8. Samples: 450932642. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:06:30,627][25689] Avg episode reward: [(0, '-44.881')] [2022-07-09 22:06:32,004][26022] Updated weights on worker 0-0, policy_version 440377 (0.00093) [2022-07-09 22:06:33,715][26022] Updated weights on worker 0-0, policy_version 440387 (0.00094) [2022-07-09 22:06:35,633][25689] Fps is (10 sec: 5605.0, 60 sec: 5649.8, 300 sec: 5641.7). Total num frames: 450965504. Throughput: 0: 5966.9. Samples: 450967042. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:06:35,634][25689] Avg episode reward: [(0, '-45.702')] [2022-07-09 22:06:35,682][26022] Updated weights on worker 0-0, policy_version 440397 (0.00093) [2022-07-09 22:06:37,312][26022] Updated weights on worker 0-0, policy_version 440407 (0.00086) [2022-07-09 22:06:39,213][26022] Updated weights on worker 0-0, policy_version 440417 (0.00112) [2022-07-09 22:06:40,647][25689] Fps is (10 sec: 5518.3, 60 sec: 5633.6, 300 sec: 5645.3). Total num frames: 450994176. Throughput: 0: 5979.6. Samples: 451001348. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:06:40,647][25689] Avg episode reward: [(0, '-45.948')] [2022-07-09 22:06:40,874][26022] Updated weights on worker 0-0, policy_version 440427 (0.00084) [2022-07-09 22:06:42,692][26022] Updated weights on worker 0-0, policy_version 440437 (0.00099) [2022-07-09 22:06:44,700][26022] Updated weights on worker 0-0, policy_version 440447 (0.00088) [2022-07-09 22:06:45,651][25689] Fps is (10 sec: 5826.2, 60 sec: 5653.6, 300 sec: 5646.8). Total num frames: 451023872. Throughput: 0: 5122.1. Samples: 451018480. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:06:45,653][25689] Avg episode reward: [(0, '-45.345')] [2022-07-09 22:06:46,436][26022] Updated weights on worker 0-0, policy_version 440457 (0.00092) [2022-07-09 22:06:48,259][26022] Updated weights on worker 0-0, policy_version 440467 (0.00093) [2022-07-09 22:06:50,098][26022] Updated weights on worker 0-0, policy_version 440477 (0.00088) [2022-07-09 22:06:50,691][25689] Fps is (10 sec: 5709.4, 60 sec: 5639.0, 300 sec: 5643.4). Total num frames: 451051520. Throughput: 0: 5969.3. Samples: 451052412. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:06:50,691][25689] Avg episode reward: [(0, '-44.745')] [2022-07-09 22:06:51,763][26022] Updated weights on worker 0-0, policy_version 440487 (0.00427) [2022-07-09 22:06:53,727][26022] Updated weights on worker 0-0, policy_version 440497 (0.00086) [2022-07-09 22:06:55,465][26022] Updated weights on worker 0-0, policy_version 440507 (0.00079) [2022-07-09 22:06:55,703][25689] Fps is (10 sec: 5603.3, 60 sec: 5658.0, 300 sec: 5645.0). Total num frames: 451080192. Throughput: 0: 5962.6. Samples: 451086708. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:06:55,704][25689] Avg episode reward: [(0, '-44.735')] [2022-07-09 22:06:57,125][26022] Updated weights on worker 0-0, policy_version 440517 (0.00083) [2022-07-09 22:06:59,243][26022] Updated weights on worker 0-0, policy_version 440527 (0.00054) [2022-07-09 22:07:00,707][25689] Fps is (10 sec: 5725.2, 60 sec: 5641.8, 300 sec: 5648.9). Total num frames: 451108864. Throughput: 0: 5114.1. Samples: 451103936. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:07:00,708][25689] Avg episode reward: [(0, '-44.649')] [2022-07-09 22:07:00,784][26022] Updated weights on worker 0-0, policy_version 440537 (0.00087) [2022-07-09 22:07:03,013][26022] Updated weights on worker 0-0, policy_version 440547 (0.00088) [2022-07-09 22:07:04,799][26022] Updated weights on worker 0-0, policy_version 440557 (0.00093) [2022-07-09 22:07:05,722][25689] Fps is (10 sec: 5416.5, 60 sec: 5642.3, 300 sec: 5636.9). Total num frames: 451134464. Throughput: 0: 5867.4. Samples: 451136244. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:07:05,724][25689] Avg episode reward: [(0, '-44.869')] [2022-07-09 22:07:06,413][26022] Updated weights on worker 0-0, policy_version 440567 (0.00084) [2022-07-09 22:07:08,515][26022] Updated weights on worker 0-0, policy_version 440577 (0.00094) [2022-07-09 22:07:09,962][26022] Updated weights on worker 0-0, policy_version 440587 (0.00086) [2022-07-09 22:07:10,780][25689] Fps is (10 sec: 5489.4, 60 sec: 5644.1, 300 sec: 5646.5). Total num frames: 451164160. Throughput: 0: 5876.5. Samples: 451170466. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:07:10,782][25689] Avg episode reward: [(0, '-44.549')] [2022-07-09 22:07:11,845][26022] Updated weights on worker 0-0, policy_version 440597 (0.00082) [2022-07-09 22:07:13,720][26022] Updated weights on worker 0-0, policy_version 440607 (0.00092) [2022-07-09 22:07:15,485][26022] Updated weights on worker 0-0, policy_version 440617 (0.00096) [2022-07-09 22:07:15,785][25689] Fps is (10 sec: 5800.5, 60 sec: 5661.7, 300 sec: 5647.0). Total num frames: 451192832. Throughput: 0: 5024.8. Samples: 451187618. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 22:07:15,787][25689] Avg episode reward: [(0, '-44.158')] [2022-07-09 22:07:17,378][26022] Updated weights on worker 0-0, policy_version 440627 (0.00091) [2022-07-09 22:07:19,146][26022] Updated weights on worker 0-0, policy_version 440637 (0.00092) [2022-07-09 22:07:20,788][25689] Fps is (10 sec: 5729.7, 60 sec: 5662.9, 300 sec: 5647.1). Total num frames: 451221504. Throughput: 0: 5876.3. Samples: 451221940. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:07:20,789][25689] Avg episode reward: [(0, '-43.776')] [2022-07-09 22:07:20,842][26022] Updated weights on worker 0-0, policy_version 440647 (0.00086) [2022-07-09 22:07:22,880][26022] Updated weights on worker 0-0, policy_version 440657 (0.00088) [2022-07-09 22:07:24,563][26022] Updated weights on worker 0-0, policy_version 440667 (0.00092) [2022-07-09 22:07:25,790][25689] Fps is (10 sec: 5628.9, 60 sec: 5646.8, 300 sec: 5638.4). Total num frames: 451249152. Throughput: 0: 5977.2. Samples: 451256196. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:07:25,792][25689] Avg episode reward: [(0, '-43.138')] [2022-07-09 22:07:26,462][26022] Updated weights on worker 0-0, policy_version 440677 (0.00085) [2022-07-09 22:07:28,133][26022] Updated weights on worker 0-0, policy_version 440687 (0.00102) [2022-07-09 22:07:29,901][26022] Updated weights on worker 0-0, policy_version 440697 (0.00095) [2022-07-09 22:07:30,923][25689] Fps is (10 sec: 5658.3, 60 sec: 5638.3, 300 sec: 5643.6). Total num frames: 451278848. Throughput: 0: 5092.6. Samples: 451273044. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:07:30,924][25689] Avg episode reward: [(0, '-43.264')] [2022-07-09 22:07:31,899][26022] Updated weights on worker 0-0, policy_version 440707 (0.00086) [2022-07-09 22:07:33,585][26022] Updated weights on worker 0-0, policy_version 440717 (0.00091) [2022-07-09 22:07:35,325][26022] Updated weights on worker 0-0, policy_version 440727 (0.00089) [2022-07-09 22:07:35,819][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:07:35,835][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000440729_451306496.pth [2022-07-09 22:07:35,835][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000438742_449271808.pth [2022-07-09 22:07:35,934][25689] Fps is (10 sec: 5754.0, 60 sec: 5671.8, 300 sec: 5647.1). Total num frames: 451307520. Throughput: 0: 5930.4. Samples: 451307112. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:07:35,935][25689] Avg episode reward: [(0, '-42.851')] [2022-07-09 22:07:37,332][26022] Updated weights on worker 0-0, policy_version 440737 (0.00087) [2022-07-09 22:07:39,084][26022] Updated weights on worker 0-0, policy_version 440747 (0.00089) [2022-07-09 22:07:40,821][26022] Updated weights on worker 0-0, policy_version 440757 (0.00089) [2022-07-09 22:07:40,952][25689] Fps is (10 sec: 5717.8, 60 sec: 5671.5, 300 sec: 5643.4). Total num frames: 451336192. Throughput: 0: 5909.4. Samples: 451341094. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:07:40,952][25689] Avg episode reward: [(0, '-42.880')] [2022-07-09 22:07:42,580][26022] Updated weights on worker 0-0, policy_version 440767 (0.00090) [2022-07-09 22:07:44,151][26022] Updated weights on worker 0-0, policy_version 440777 (0.00082) [2022-07-09 22:07:46,000][25689] Fps is (10 sec: 5595.6, 60 sec: 5633.4, 300 sec: 5650.4). Total num frames: 451363840. Throughput: 0: 5053.8. Samples: 451358328. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:07:46,001][25689] Avg episode reward: [(0, '-42.849')] [2022-07-09 22:07:46,298][26022] Updated weights on worker 0-0, policy_version 440787 (0.00090) [2022-07-09 22:07:47,872][26022] Updated weights on worker 0-0, policy_version 440797 (0.00084) [2022-07-09 22:07:49,880][26022] Updated weights on worker 0-0, policy_version 440807 (0.00094) [2022-07-09 22:07:51,076][25689] Fps is (10 sec: 5765.2, 60 sec: 5680.9, 300 sec: 5654.2). Total num frames: 451394560. Throughput: 0: 5940.3. Samples: 451392760. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:07:51,077][25689] Avg episode reward: [(0, '-43.271')] [2022-07-09 22:07:51,487][26022] Updated weights on worker 0-0, policy_version 440817 (0.00088) [2022-07-09 22:07:53,358][26022] Updated weights on worker 0-0, policy_version 440827 (0.00085) [2022-07-09 22:07:55,082][26022] Updated weights on worker 0-0, policy_version 440837 (0.00088) [2022-07-09 22:07:56,139][25689] Fps is (10 sec: 5756.6, 60 sec: 5659.1, 300 sec: 5650.2). Total num frames: 451422208. Throughput: 0: 5950.8. Samples: 451427344. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:07:56,140][25689] Avg episode reward: [(0, '-43.500')] [2022-07-09 22:07:57,100][26022] Updated weights on worker 0-0, policy_version 440847 (0.00085) [2022-07-09 22:07:58,663][26022] Updated weights on worker 0-0, policy_version 440857 (0.00091) [2022-07-09 22:08:00,679][26022] Updated weights on worker 0-0, policy_version 440867 (0.00097) [2022-07-09 22:08:01,193][25689] Fps is (10 sec: 5566.5, 60 sec: 5654.4, 300 sec: 5656.7). Total num frames: 451450880. Throughput: 0: 5093.5. Samples: 451444196. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:08:01,194][25689] Avg episode reward: [(0, '-44.012')] [2022-07-09 22:08:02,651][26022] Updated weights on worker 0-0, policy_version 440877 (0.00089) [2022-07-09 22:08:04,554][26022] Updated weights on worker 0-0, policy_version 440887 (0.00084) [2022-07-09 22:08:06,196][25689] Fps is (10 sec: 5498.0, 60 sec: 5672.5, 300 sec: 5654.5). Total num frames: 451477504. Throughput: 0: 5838.3. Samples: 451476242. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:08:06,198][25689] Avg episode reward: [(0, '-43.790')] [2022-07-09 22:08:06,394][26022] Updated weights on worker 0-0, policy_version 440897 (0.00093) [2022-07-09 22:08:08,183][26022] Updated weights on worker 0-0, policy_version 440907 (0.00082) [2022-07-09 22:08:10,065][26022] Updated weights on worker 0-0, policy_version 440917 (0.00081) [2022-07-09 22:08:11,243][25689] Fps is (10 sec: 5502.5, 60 sec: 5656.6, 300 sec: 5647.1). Total num frames: 451506176. Throughput: 0: 5833.8. Samples: 451510410. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:08:11,244][25689] Avg episode reward: [(0, '-45.091')] [2022-07-09 22:08:11,714][26022] Updated weights on worker 0-0, policy_version 440927 (0.00094) [2022-07-09 22:08:13,630][26022] Updated weights on worker 0-0, policy_version 440937 (0.00101) [2022-07-09 22:08:15,268][26022] Updated weights on worker 0-0, policy_version 440947 (0.00087) [2022-07-09 22:08:16,268][25689] Fps is (10 sec: 5592.1, 60 sec: 5637.8, 300 sec: 5646.8). Total num frames: 451533824. Throughput: 0: 4984.6. Samples: 451527678. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:08:16,269][25689] Avg episode reward: [(0, '-46.946')] [2022-07-09 22:08:17,178][26022] Updated weights on worker 0-0, policy_version 440957 (0.00090) [2022-07-09 22:08:18,909][26022] Updated weights on worker 0-0, policy_version 440967 (0.00088) [2022-07-09 22:08:20,936][26022] Updated weights on worker 0-0, policy_version 440977 (0.00090) [2022-07-09 22:08:21,298][25689] Fps is (10 sec: 5601.2, 60 sec: 5635.3, 300 sec: 5643.5). Total num frames: 451562496. Throughput: 0: 5843.4. Samples: 451561672. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:08:21,299][25689] Avg episode reward: [(0, '-47.244')] [2022-07-09 22:08:22,490][26022] Updated weights on worker 0-0, policy_version 440987 (0.00086) [2022-07-09 22:08:24,399][26022] Updated weights on worker 0-0, policy_version 440997 (0.00090) [2022-07-09 22:08:25,883][26022] Updated weights on worker 0-0, policy_version 441007 (0.00096) [2022-07-09 22:08:26,327][25689] Fps is (10 sec: 5802.2, 60 sec: 5666.6, 300 sec: 5654.0). Total num frames: 451592192. Throughput: 0: 5962.9. Samples: 451596282. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:08:26,328][25689] Avg episode reward: [(0, '-47.235')] [2022-07-09 22:08:27,971][26022] Updated weights on worker 0-0, policy_version 441017 (0.00066) [2022-07-09 22:08:29,853][26022] Updated weights on worker 0-0, policy_version 441027 (0.00089) [2022-07-09 22:08:31,383][25689] Fps is (10 sec: 5787.2, 60 sec: 5656.8, 300 sec: 5647.7). Total num frames: 451620864. Throughput: 0: 5104.1. Samples: 451613210. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:08:31,384][25689] Avg episode reward: [(0, '-46.170')] [2022-07-09 22:08:31,668][26022] Updated weights on worker 0-0, policy_version 441037 (0.00086) [2022-07-09 22:08:33,256][26022] Updated weights on worker 0-0, policy_version 441047 (0.00088) [2022-07-09 22:08:34,984][26022] Updated weights on worker 0-0, policy_version 441057 (0.00083) [2022-07-09 22:08:36,423][25689] Fps is (10 sec: 5680.0, 60 sec: 5654.2, 300 sec: 5650.6). Total num frames: 451649536. Throughput: 0: 5952.0. Samples: 451647644. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:08:36,425][25689] Avg episode reward: [(0, '-46.575')] [2022-07-09 22:08:36,863][26022] Updated weights on worker 0-0, policy_version 441067 (0.00094) [2022-07-09 22:08:38,706][26022] Updated weights on worker 0-0, policy_version 441077 (0.00095) [2022-07-09 22:08:40,481][26022] Updated weights on worker 0-0, policy_version 441087 (0.00084) [2022-07-09 22:08:41,446][25689] Fps is (10 sec: 5596.7, 60 sec: 5636.7, 300 sec: 5650.8). Total num frames: 451677184. Throughput: 0: 5958.8. Samples: 451681734. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:08:41,447][25689] Avg episode reward: [(0, '-46.567')] [2022-07-09 22:08:42,184][26022] Updated weights on worker 0-0, policy_version 441097 (0.00092) [2022-07-09 22:08:44,103][26022] Updated weights on worker 0-0, policy_version 441107 (0.00087) [2022-07-09 22:08:45,883][26022] Updated weights on worker 0-0, policy_version 441117 (0.00086) [2022-07-09 22:08:46,475][25689] Fps is (10 sec: 5602.8, 60 sec: 5655.4, 300 sec: 5648.2). Total num frames: 451705856. Throughput: 0: 5953.8. Samples: 451716238. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:08:46,476][25689] Avg episode reward: [(0, '-45.323')] [2022-07-09 22:08:47,712][26022] Updated weights on worker 0-0, policy_version 441127 (0.00091) [2022-07-09 22:08:49,500][26022] Updated weights on worker 0-0, policy_version 441137 (0.00085) [2022-07-09 22:08:51,176][26022] Updated weights on worker 0-0, policy_version 441147 (0.00090) [2022-07-09 22:08:51,536][25689] Fps is (10 sec: 5683.5, 60 sec: 5623.0, 300 sec: 5651.9). Total num frames: 451734528. Throughput: 0: 5949.8. Samples: 451733114. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:08:51,536][25689] Avg episode reward: [(0, '-45.187')] [2022-07-09 22:08:53,259][26022] Updated weights on worker 0-0, policy_version 441157 (0.00087) [2022-07-09 22:08:55,071][26022] Updated weights on worker 0-0, policy_version 441167 (0.00084) [2022-07-09 22:08:56,561][25689] Fps is (10 sec: 5685.4, 60 sec: 5643.4, 300 sec: 5649.2). Total num frames: 451763200. Throughput: 0: 5924.1. Samples: 451766946. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:08:56,562][25689] Avg episode reward: [(0, '-45.588')] [2022-07-09 22:08:56,765][26022] Updated weights on worker 0-0, policy_version 441177 (0.00348) [2022-07-09 22:08:58,611][26022] Updated weights on worker 0-0, policy_version 441187 (0.00096) [2022-07-09 22:09:00,424][26022] Updated weights on worker 0-0, policy_version 441197 (0.00093) [2022-07-09 22:09:01,591][25689] Fps is (10 sec: 5703.2, 60 sec: 5645.8, 300 sec: 5660.7). Total num frames: 451791872. Throughput: 0: 5906.4. Samples: 451800714. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:09:01,591][25689] Avg episode reward: [(0, '-45.569')] [2022-07-09 22:09:02,673][26022] Updated weights on worker 0-0, policy_version 441207 (0.00092) [2022-07-09 22:09:04,463][26022] Updated weights on worker 0-0, policy_version 441217 (0.00086) [2022-07-09 22:09:06,183][26022] Updated weights on worker 0-0, policy_version 441227 (0.00089) [2022-07-09 22:09:06,608][25689] Fps is (10 sec: 5402.0, 60 sec: 5627.5, 300 sec: 5648.5). Total num frames: 451817472. Throughput: 0: 4932.0. Samples: 451815532. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:09:06,609][25689] Avg episode reward: [(0, '-45.646')] [2022-07-09 22:09:08,092][26022] Updated weights on worker 0-0, policy_version 441237 (0.00092) [2022-07-09 22:09:09,898][26022] Updated weights on worker 0-0, policy_version 441247 (0.00080) [2022-07-09 22:09:11,467][26022] Updated weights on worker 0-0, policy_version 441257 (0.00087) [2022-07-09 22:09:11,742][25689] Fps is (10 sec: 5548.0, 60 sec: 5653.2, 300 sec: 5653.0). Total num frames: 451848192. Throughput: 0: 5762.4. Samples: 451849550. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:09:11,743][25689] Avg episode reward: [(0, '-46.573')] [2022-07-09 22:09:13,611][26022] Updated weights on worker 0-0, policy_version 441267 (0.00087) [2022-07-09 22:09:15,068][26022] Updated weights on worker 0-0, policy_version 441277 (0.00086) [2022-07-09 22:09:16,790][25689] Fps is (10 sec: 5632.0, 60 sec: 5634.2, 300 sec: 5646.0). Total num frames: 451874816. Throughput: 0: 5790.9. Samples: 451884088. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:09:16,790][25689] Avg episode reward: [(0, '-45.931')] [2022-07-09 22:09:17,207][26022] Updated weights on worker 0-0, policy_version 441287 (0.00094) [2022-07-09 22:09:18,606][26022] Updated weights on worker 0-0, policy_version 441297 (0.00085) [2022-07-09 22:09:20,756][26022] Updated weights on worker 0-0, policy_version 441307 (0.00097) [2022-07-09 22:09:21,839][25689] Fps is (10 sec: 5679.3, 60 sec: 5666.2, 300 sec: 5656.0). Total num frames: 451905536. Throughput: 0: 4948.1. Samples: 451900906. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:09:21,840][25689] Avg episode reward: [(0, '-46.561')] [2022-07-09 22:09:22,612][26022] Updated weights on worker 0-0, policy_version 441317 (0.00086) [2022-07-09 22:09:24,333][26022] Updated weights on worker 0-0, policy_version 441327 (0.00094) [2022-07-09 22:09:25,920][26022] Updated weights on worker 0-0, policy_version 441337 (0.00092) [2022-07-09 22:09:26,887][25689] Fps is (10 sec: 5679.2, 60 sec: 5613.8, 300 sec: 5646.5). Total num frames: 451932160. Throughput: 0: 5893.5. Samples: 451935046. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:09:26,888][25689] Avg episode reward: [(0, '-46.940')] [2022-07-09 22:09:27,971][26022] Updated weights on worker 0-0, policy_version 441347 (0.00096) [2022-07-09 22:09:29,743][26022] Updated weights on worker 0-0, policy_version 441357 (0.00057) [2022-07-09 22:09:31,607][26022] Updated weights on worker 0-0, policy_version 441367 (0.00074) [2022-07-09 22:09:32,001][25689] Fps is (10 sec: 5642.9, 60 sec: 5642.2, 300 sec: 5655.6). Total num frames: 451962880. Throughput: 0: 5890.5. Samples: 451968886. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:09:32,003][25689] Avg episode reward: [(0, '-46.871')] [2022-07-09 22:09:33,787][26022] Updated weights on worker 0-0, policy_version 441378 (0.00091) [2022-07-09 22:09:35,272][26022] Updated weights on worker 0-0, policy_version 441388 (0.00085) [2022-07-09 22:09:36,023][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:09:36,039][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000441392_451985408.pth [2022-07-09 22:09:36,043][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000439404_449949696.pth [2022-07-09 22:09:37,038][25689] Fps is (10 sec: 5649.0, 60 sec: 5608.6, 300 sec: 5642.4). Total num frames: 451989504. Throughput: 0: 5025.1. Samples: 451985838. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:09:37,039][25689] Avg episode reward: [(0, '-47.103')] [2022-07-09 22:09:37,296][26022] Updated weights on worker 0-0, policy_version 441398 (0.00088) [2022-07-09 22:09:38,976][26022] Updated weights on worker 0-0, policy_version 441408 (0.00092) [2022-07-09 22:09:40,948][26022] Updated weights on worker 0-0, policy_version 441418 (0.00089) [2022-07-09 22:09:42,070][25689] Fps is (10 sec: 5593.5, 60 sec: 5641.6, 300 sec: 5652.1). Total num frames: 452019200. Throughput: 0: 5882.6. Samples: 452019916. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:09:42,071][25689] Avg episode reward: [(0, '-46.336')] [2022-07-09 22:09:42,697][26022] Updated weights on worker 0-0, policy_version 441428 (0.00091) [2022-07-09 22:09:44,399][26022] Updated weights on worker 0-0, policy_version 441438 (0.00095) [2022-07-09 22:09:46,259][26022] Updated weights on worker 0-0, policy_version 441448 (0.00084) [2022-07-09 22:09:47,139][25689] Fps is (10 sec: 5677.3, 60 sec: 5621.0, 300 sec: 5648.3). Total num frames: 452046848. Throughput: 0: 5868.7. Samples: 452053898. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:09:47,141][25689] Avg episode reward: [(0, '-45.247')] [2022-07-09 22:09:48,140][26022] Updated weights on worker 0-0, policy_version 441458 (0.00088) [2022-07-09 22:09:49,864][26022] Updated weights on worker 0-0, policy_version 441468 (0.00085) [2022-07-09 22:09:51,710][26022] Updated weights on worker 0-0, policy_version 441478 (0.00096) [2022-07-09 22:09:52,205][25689] Fps is (10 sec: 5658.1, 60 sec: 5637.4, 300 sec: 5644.0). Total num frames: 452076544. Throughput: 0: 5041.9. Samples: 452070750. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:09:52,206][25689] Avg episode reward: [(0, '-45.304')] [2022-07-09 22:09:53,400][26022] Updated weights on worker 0-0, policy_version 441488 (0.00086) [2022-07-09 22:09:55,188][26022] Updated weights on worker 0-0, policy_version 441498 (0.00089) [2022-07-09 22:09:56,861][26022] Updated weights on worker 0-0, policy_version 441508 (0.00087) [2022-07-09 22:09:57,213][25689] Fps is (10 sec: 5793.8, 60 sec: 5639.0, 300 sec: 5652.0). Total num frames: 452105216. Throughput: 0: 5930.7. Samples: 452105490. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:09:57,219][25689] Avg episode reward: [(0, '-44.910')] [2022-07-09 22:09:58,762][26022] Updated weights on worker 0-0, policy_version 441518 (0.00109) [2022-07-09 22:10:00,875][26022] Updated weights on worker 0-0, policy_version 441528 (0.00084) [2022-07-09 22:10:02,234][25689] Fps is (10 sec: 5513.4, 60 sec: 5606.0, 300 sec: 5651.7). Total num frames: 452131840. Throughput: 0: 5888.7. Samples: 452138658. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 22:10:02,235][25689] Avg episode reward: [(0, '-43.882')] [2022-07-09 22:10:02,866][26022] Updated weights on worker 0-0, policy_version 441538 (0.00096) [2022-07-09 22:10:04,590][26022] Updated weights on worker 0-0, policy_version 441548 (0.00094) [2022-07-09 22:10:06,243][26022] Updated weights on worker 0-0, policy_version 441558 (0.00089) [2022-07-09 22:10:07,260][25689] Fps is (10 sec: 5300.0, 60 sec: 5622.1, 300 sec: 5640.4). Total num frames: 452158464. Throughput: 0: 4986.7. Samples: 452154236. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:10:07,261][25689] Avg episode reward: [(0, '-43.814')] [2022-07-09 22:10:08,320][26022] Updated weights on worker 0-0, policy_version 441568 (0.00519) [2022-07-09 22:10:10,127][26022] Updated weights on worker 0-0, policy_version 441578 (0.00084) [2022-07-09 22:10:11,797][26022] Updated weights on worker 0-0, policy_version 441588 (0.00084) [2022-07-09 22:10:12,357][25689] Fps is (10 sec: 5665.2, 60 sec: 5625.6, 300 sec: 5646.1). Total num frames: 452189184. Throughput: 0: 5832.4. Samples: 452188282. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:10:12,357][25689] Avg episode reward: [(0, '-43.635')] [2022-07-09 22:10:13,587][26022] Updated weights on worker 0-0, policy_version 441598 (0.00085) [2022-07-09 22:10:15,378][26022] Updated weights on worker 0-0, policy_version 441608 (0.00089) [2022-07-09 22:10:17,336][26022] Updated weights on worker 0-0, policy_version 441618 (0.00087) [2022-07-09 22:10:17,407][25689] Fps is (10 sec: 5752.5, 60 sec: 5642.3, 300 sec: 5643.1). Total num frames: 452216832. Throughput: 0: 5804.4. Samples: 452222700. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:10:17,407][25689] Avg episode reward: [(0, '-45.041')] [2022-07-09 22:10:18,988][26022] Updated weights on worker 0-0, policy_version 441628 (0.00085) [2022-07-09 22:10:20,858][26022] Updated weights on worker 0-0, policy_version 441638 (0.00094) [2022-07-09 22:10:22,423][25689] Fps is (10 sec: 5696.7, 60 sec: 5628.5, 300 sec: 5643.6). Total num frames: 452246528. Throughput: 0: 4994.9. Samples: 452239496. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:10:22,424][25689] Avg episode reward: [(0, '-44.813')] [2022-07-09 22:10:22,586][26022] Updated weights on worker 0-0, policy_version 441648 (0.00597) [2022-07-09 22:10:24,510][26022] Updated weights on worker 0-0, policy_version 441658 (0.00087) [2022-07-09 22:10:26,186][26022] Updated weights on worker 0-0, policy_version 441668 (0.00091) [2022-07-09 22:10:27,441][25689] Fps is (10 sec: 5714.6, 60 sec: 5648.1, 300 sec: 5644.2). Total num frames: 452274176. Throughput: 0: 5932.0. Samples: 452273952. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:10:27,442][25689] Avg episode reward: [(0, '-46.069')] [2022-07-09 22:10:28,044][26022] Updated weights on worker 0-0, policy_version 441678 (0.00084) [2022-07-09 22:10:29,667][26022] Updated weights on worker 0-0, policy_version 441688 (0.00087) [2022-07-09 22:10:31,652][26022] Updated weights on worker 0-0, policy_version 441698 (0.00093) [2022-07-09 22:10:32,495][25689] Fps is (10 sec: 5591.5, 60 sec: 5619.9, 300 sec: 5646.8). Total num frames: 452302848. Throughput: 0: 5945.4. Samples: 452308014. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:10:32,496][25689] Avg episode reward: [(0, '-46.077')] [2022-07-09 22:10:33,372][26022] Updated weights on worker 0-0, policy_version 441708 (0.00096) [2022-07-09 22:10:35,340][26022] Updated weights on worker 0-0, policy_version 441718 (0.00091) [2022-07-09 22:10:36,969][26022] Updated weights on worker 0-0, policy_version 441728 (0.00084) [2022-07-09 22:10:37,561][25689] Fps is (10 sec: 5666.5, 60 sec: 5651.0, 300 sec: 5642.6). Total num frames: 452331520. Throughput: 0: 5069.8. Samples: 452324880. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:10:37,562][25689] Avg episode reward: [(0, '-46.912')] [2022-07-09 22:10:38,983][26022] Updated weights on worker 0-0, policy_version 441738 (0.00084) [2022-07-09 22:10:40,683][26022] Updated weights on worker 0-0, policy_version 441748 (0.00090) [2022-07-09 22:10:42,511][26022] Updated weights on worker 0-0, policy_version 441758 (0.00088) [2022-07-09 22:10:42,603][25689] Fps is (10 sec: 5673.0, 60 sec: 5633.1, 300 sec: 5642.5). Total num frames: 452360192. Throughput: 0: 5921.8. Samples: 452359002. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:10:42,604][25689] Avg episode reward: [(0, '-45.335')] [2022-07-09 22:10:44,304][26022] Updated weights on worker 0-0, policy_version 441768 (0.00085) [2022-07-09 22:10:46,034][26022] Updated weights on worker 0-0, policy_version 441778 (0.00093) [2022-07-09 22:10:47,661][25689] Fps is (10 sec: 5778.9, 60 sec: 5668.0, 300 sec: 5646.0). Total num frames: 452389888. Throughput: 0: 5910.8. Samples: 452393468. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:10:47,663][25689] Avg episode reward: [(0, '-45.487')] [2022-07-09 22:10:47,867][26022] Updated weights on worker 0-0, policy_version 441788 (0.00083) [2022-07-09 22:10:49,654][26022] Updated weights on worker 0-0, policy_version 441798 (0.00094) [2022-07-09 22:10:51,512][26022] Updated weights on worker 0-0, policy_version 441808 (0.00084) [2022-07-09 22:10:52,704][25689] Fps is (10 sec: 5677.2, 60 sec: 5636.3, 300 sec: 5645.9). Total num frames: 452417536. Throughput: 0: 5904.7. Samples: 452427342. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:10:52,705][25689] Avg episode reward: [(0, '-45.777')] [2022-07-09 22:10:53,336][26022] Updated weights on worker 0-0, policy_version 441818 (0.00087) [2022-07-09 22:10:55,014][26022] Updated weights on worker 0-0, policy_version 441828 (0.00082) [2022-07-09 22:10:56,904][26022] Updated weights on worker 0-0, policy_version 441838 (0.00088) [2022-07-09 22:10:57,718][25689] Fps is (10 sec: 5498.5, 60 sec: 5618.9, 300 sec: 5638.9). Total num frames: 452445184. Throughput: 0: 5936.5. Samples: 452444540. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:10:57,720][25689] Avg episode reward: [(0, '-45.381')] [2022-07-09 22:10:58,692][26022] Updated weights on worker 0-0, policy_version 441848 (0.00091) [2022-07-09 22:11:00,482][26022] Updated weights on worker 0-0, policy_version 441858 (0.00092) [2022-07-09 22:11:02,611][26022] Updated weights on worker 0-0, policy_version 441868 (0.00091) [2022-07-09 22:11:02,733][25689] Fps is (10 sec: 5513.6, 60 sec: 5636.4, 300 sec: 5645.9). Total num frames: 452472832. Throughput: 0: 5847.9. Samples: 452476718. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:11:02,735][25689] Avg episode reward: [(0, '-44.683')] [2022-07-09 22:11:04,503][26022] Updated weights on worker 0-0, policy_version 441878 (0.00082) [2022-07-09 22:11:06,271][26022] Updated weights on worker 0-0, policy_version 441888 (0.00086) [2022-07-09 22:11:07,736][25689] Fps is (10 sec: 5519.5, 60 sec: 5655.4, 300 sec: 5640.5). Total num frames: 452500480. Throughput: 0: 5835.0. Samples: 452510606. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:11:07,737][25689] Avg episode reward: [(0, '-44.483')] [2022-07-09 22:11:08,147][26022] Updated weights on worker 0-0, policy_version 441898 (0.00092) [2022-07-09 22:11:10,041][26022] Updated weights on worker 0-0, policy_version 441908 (0.00081) [2022-07-09 22:11:11,942][26022] Updated weights on worker 0-0, policy_version 441919 (0.00090) [2022-07-09 22:11:12,802][25689] Fps is (10 sec: 5593.7, 60 sec: 5624.4, 300 sec: 5642.9). Total num frames: 452529152. Throughput: 0: 4978.1. Samples: 452527388. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:11:12,802][25689] Avg episode reward: [(0, '-44.369')] [2022-07-09 22:11:13,685][26022] Updated weights on worker 0-0, policy_version 441929 (0.00087) [2022-07-09 22:11:15,571][26022] Updated weights on worker 0-0, policy_version 441939 (0.00084) [2022-07-09 22:11:17,472][26022] Updated weights on worker 0-0, policy_version 441949 (0.00111) [2022-07-09 22:11:17,833][25689] Fps is (10 sec: 5780.7, 60 sec: 5660.0, 300 sec: 5646.0). Total num frames: 452558848. Throughput: 0: 5810.6. Samples: 452561422. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:11:17,834][25689] Avg episode reward: [(0, '-44.618')] [2022-07-09 22:11:19,136][26022] Updated weights on worker 0-0, policy_version 441959 (0.00098) [2022-07-09 22:11:20,939][26022] Updated weights on worker 0-0, policy_version 441969 (0.00088) [2022-07-09 22:11:22,705][26022] Updated weights on worker 0-0, policy_version 441979 (0.00086) [2022-07-09 22:11:22,895][25689] Fps is (10 sec: 5681.4, 60 sec: 5621.9, 300 sec: 5641.6). Total num frames: 452586496. Throughput: 0: 5887.4. Samples: 452595418. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:11:22,895][25689] Avg episode reward: [(0, '-44.854')] [2022-07-09 22:11:24,548][26022] Updated weights on worker 0-0, policy_version 441989 (0.00080) [2022-07-09 22:11:26,401][26022] Updated weights on worker 0-0, policy_version 441999 (0.00087) [2022-07-09 22:11:27,902][25689] Fps is (10 sec: 5695.2, 60 sec: 5656.8, 300 sec: 5642.3). Total num frames: 452616192. Throughput: 0: 5060.1. Samples: 452612646. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:11:27,903][25689] Avg episode reward: [(0, '-45.547')] [2022-07-09 22:11:28,142][26022] Updated weights on worker 0-0, policy_version 442009 (0.00086) [2022-07-09 22:11:30,018][26022] Updated weights on worker 0-0, policy_version 442019 (0.00091) [2022-07-09 22:11:31,803][26022] Updated weights on worker 0-0, policy_version 442029 (0.00086) [2022-07-09 22:11:32,965][25689] Fps is (10 sec: 5592.9, 60 sec: 5622.1, 300 sec: 5641.2). Total num frames: 452642816. Throughput: 0: 5923.0. Samples: 452646814. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:11:32,965][25689] Avg episode reward: [(0, '-45.596')] [2022-07-09 22:11:33,455][26022] Updated weights on worker 0-0, policy_version 442039 (0.00083) [2022-07-09 22:11:35,636][26022] Updated weights on worker 0-0, policy_version 442049 (0.00090) [2022-07-09 22:11:36,128][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:11:36,142][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000442052_452661248.pth [2022-07-09 22:11:36,142][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000440067_450628608.pth [2022-07-09 22:11:36,980][26022] Updated weights on worker 0-0, policy_version 442059 (0.00085) [2022-07-09 22:11:37,973][25689] Fps is (10 sec: 5592.4, 60 sec: 5644.5, 300 sec: 5644.8). Total num frames: 452672512. Throughput: 0: 5958.4. Samples: 452681422. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:11:37,973][25689] Avg episode reward: [(0, '-46.068')] [2022-07-09 22:11:39,097][26022] Updated weights on worker 0-0, policy_version 442069 (0.00089) [2022-07-09 22:11:40,557][26022] Updated weights on worker 0-0, policy_version 442079 (0.00092) [2022-07-09 22:11:42,505][26022] Updated weights on worker 0-0, policy_version 442089 (0.00093) [2022-07-09 22:11:42,979][25689] Fps is (10 sec: 5828.6, 60 sec: 5647.9, 300 sec: 5641.3). Total num frames: 452701184. Throughput: 0: 5140.4. Samples: 452698658. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:11:42,979][25689] Avg episode reward: [(0, '-45.878')] [2022-07-09 22:11:44,362][26022] Updated weights on worker 0-0, policy_version 442099 (0.00096) [2022-07-09 22:11:46,065][26022] Updated weights on worker 0-0, policy_version 442109 (0.00089) [2022-07-09 22:11:47,863][26022] Updated weights on worker 0-0, policy_version 442119 (0.00086) [2022-07-09 22:11:48,080][25689] Fps is (10 sec: 5775.0, 60 sec: 5643.8, 300 sec: 5647.0). Total num frames: 452730880. Throughput: 0: 5965.4. Samples: 452733014. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:11:48,080][25689] Avg episode reward: [(0, '-45.901')] [2022-07-09 22:11:49,698][26022] Updated weights on worker 0-0, policy_version 442129 (0.00087) [2022-07-09 22:11:51,304][26022] Updated weights on worker 0-0, policy_version 442139 (0.00095) [2022-07-09 22:11:53,199][25689] Fps is (10 sec: 5710.8, 60 sec: 5653.6, 300 sec: 5645.0). Total num frames: 452759552. Throughput: 0: 5944.8. Samples: 452767106. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:11:53,200][25689] Avg episode reward: [(0, '-45.216')] [2022-07-09 22:11:53,370][26022] Updated weights on worker 0-0, policy_version 442149 (0.00085) [2022-07-09 22:11:54,836][26022] Updated weights on worker 0-0, policy_version 442159 (0.00090) [2022-07-09 22:11:56,876][26022] Updated weights on worker 0-0, policy_version 442169 (0.00090) [2022-07-09 22:11:58,219][25689] Fps is (10 sec: 5655.7, 60 sec: 5670.0, 300 sec: 5644.7). Total num frames: 452788224. Throughput: 0: 5089.5. Samples: 452784464. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:11:58,221][25689] Avg episode reward: [(0, '-46.494')] [2022-07-09 22:11:58,614][26022] Updated weights on worker 0-0, policy_version 442179 (0.00086) [2022-07-09 22:12:00,544][26022] Updated weights on worker 0-0, policy_version 442189 (0.00093) [2022-07-09 22:12:02,403][26022] Updated weights on worker 0-0, policy_version 442199 (0.00099) [2022-07-09 22:12:03,231][25689] Fps is (10 sec: 5512.3, 60 sec: 5653.4, 300 sec: 5648.2). Total num frames: 452814848. Throughput: 0: 5930.9. Samples: 452818770. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:12:03,233][25689] Avg episode reward: [(0, '-45.863')] [2022-07-09 22:12:04,424][26022] Updated weights on worker 0-0, policy_version 442209 (0.00195) [2022-07-09 22:12:06,009][26022] Updated weights on worker 0-0, policy_version 442219 (0.00086) [2022-07-09 22:12:08,051][26022] Updated weights on worker 0-0, policy_version 442229 (0.00083) [2022-07-09 22:12:08,244][25689] Fps is (10 sec: 5413.7, 60 sec: 5652.5, 300 sec: 5642.2). Total num frames: 452842496. Throughput: 0: 5855.4. Samples: 452851082. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:12:08,244][25689] Avg episode reward: [(0, '-47.016')] [2022-07-09 22:12:09,806][26022] Updated weights on worker 0-0, policy_version 442239 (0.00087) [2022-07-09 22:12:11,579][26022] Updated weights on worker 0-0, policy_version 442249 (0.00080) [2022-07-09 22:12:13,266][26022] Updated weights on worker 0-0, policy_version 442259 (0.00090) [2022-07-09 22:12:13,283][25689] Fps is (10 sec: 5806.6, 60 sec: 5688.8, 300 sec: 5648.4). Total num frames: 452873216. Throughput: 0: 5033.3. Samples: 452868192. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:12:13,283][25689] Avg episode reward: [(0, '-46.865')] [2022-07-09 22:12:15,282][26022] Updated weights on worker 0-0, policy_version 442269 (0.00086) [2022-07-09 22:12:16,778][26022] Updated weights on worker 0-0, policy_version 442279 (0.00084) [2022-07-09 22:12:18,293][25689] Fps is (10 sec: 5706.5, 60 sec: 5640.0, 300 sec: 5641.4). Total num frames: 452899840. Throughput: 0: 5898.7. Samples: 452902874. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:12:18,293][25689] Avg episode reward: [(0, '-46.114')] [2022-07-09 22:12:18,931][26022] Updated weights on worker 0-0, policy_version 442289 (0.00091) [2022-07-09 22:12:20,292][26022] Updated weights on worker 0-0, policy_version 442299 (0.00095) [2022-07-09 22:12:22,322][26022] Updated weights on worker 0-0, policy_version 442309 (0.00091) [2022-07-09 22:12:23,317][25689] Fps is (10 sec: 5714.7, 60 sec: 5694.3, 300 sec: 5651.3). Total num frames: 452930560. Throughput: 0: 5904.6. Samples: 452937372. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:12:23,318][25689] Avg episode reward: [(0, '-46.681')] [2022-07-09 22:12:23,938][26022] Updated weights on worker 0-0, policy_version 442319 (0.00081) [2022-07-09 22:12:25,902][26022] Updated weights on worker 0-0, policy_version 442329 (0.00093) [2022-07-09 22:12:27,576][26022] Updated weights on worker 0-0, policy_version 442339 (0.00087) [2022-07-09 22:12:28,354][25689] Fps is (10 sec: 5903.0, 60 sec: 5674.6, 300 sec: 5649.7). Total num frames: 452959232. Throughput: 0: 5137.1. Samples: 452954390. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:12:28,355][25689] Avg episode reward: [(0, '-45.452')] [2022-07-09 22:12:29,694][26022] Updated weights on worker 0-0, policy_version 442349 (0.00089) [2022-07-09 22:12:31,188][26022] Updated weights on worker 0-0, policy_version 442359 (0.00079) [2022-07-09 22:12:33,298][26022] Updated weights on worker 0-0, policy_version 442369 (0.00079) [2022-07-09 22:12:33,408][25689] Fps is (10 sec: 5480.0, 60 sec: 5675.4, 300 sec: 5642.0). Total num frames: 452985856. Throughput: 0: 5985.8. Samples: 452988656. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:12:33,408][25689] Avg episode reward: [(0, '-44.915')] [2022-07-09 22:12:34,587][26022] Updated weights on worker 0-0, policy_version 442379 (0.00087) [2022-07-09 22:12:36,811][26022] Updated weights on worker 0-0, policy_version 442389 (0.00083) [2022-07-09 22:12:38,420][25689] Fps is (10 sec: 5595.3, 60 sec: 5675.1, 300 sec: 5645.5). Total num frames: 453015552. Throughput: 0: 5952.0. Samples: 453022670. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:12:38,420][25689] Avg episode reward: [(0, '-45.262')] [2022-07-09 22:12:38,461][26022] Updated weights on worker 0-0, policy_version 442399 (0.00090) [2022-07-09 22:12:40,322][26022] Updated weights on worker 0-0, policy_version 442409 (0.00090) [2022-07-09 22:12:41,992][26022] Updated weights on worker 0-0, policy_version 442419 (0.00085) [2022-07-09 22:12:43,430][25689] Fps is (10 sec: 5823.9, 60 sec: 5674.7, 300 sec: 5649.7). Total num frames: 453044224. Throughput: 0: 5098.4. Samples: 453039914. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:12:43,430][25689] Avg episode reward: [(0, '-45.804')] [2022-07-09 22:12:43,740][26022] Updated weights on worker 0-0, policy_version 442429 (0.00093) [2022-07-09 22:12:45,776][26022] Updated weights on worker 0-0, policy_version 442439 (0.00355) [2022-07-09 22:12:47,464][26022] Updated weights on worker 0-0, policy_version 442449 (0.00083) [2022-07-09 22:12:48,444][25689] Fps is (10 sec: 5822.8, 60 sec: 5682.9, 300 sec: 5647.4). Total num frames: 453073920. Throughput: 0: 5979.3. Samples: 453074512. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-09 22:12:48,444][25689] Avg episode reward: [(0, '-46.534')] [2022-07-09 22:12:49,244][26022] Updated weights on worker 0-0, policy_version 442459 (0.00089) [2022-07-09 22:12:51,175][26022] Updated weights on worker 0-0, policy_version 442469 (0.00081) [2022-07-09 22:12:52,913][26022] Updated weights on worker 0-0, policy_version 442479 (0.00091) [2022-07-09 22:12:53,587][25689] Fps is (10 sec: 5746.3, 60 sec: 5680.6, 300 sec: 5649.3). Total num frames: 453102592. Throughput: 0: 5955.4. Samples: 453108834. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:12:53,588][25689] Avg episode reward: [(0, '-45.861')] [2022-07-09 22:12:54,687][26022] Updated weights on worker 0-0, policy_version 442489 (0.00080) [2022-07-09 22:12:56,466][26022] Updated weights on worker 0-0, policy_version 442499 (0.00089) [2022-07-09 22:12:58,390][26022] Updated weights on worker 0-0, policy_version 442509 (0.00096) [2022-07-09 22:12:58,604][25689] Fps is (10 sec: 5644.1, 60 sec: 5680.9, 300 sec: 5650.1). Total num frames: 453131264. Throughput: 0: 5111.8. Samples: 453125846. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:12:58,604][25689] Avg episode reward: [(0, '-45.799')] [2022-07-09 22:13:00,119][26022] Updated weights on worker 0-0, policy_version 442519 (0.00097) [2022-07-09 22:13:01,857][26022] Updated weights on worker 0-0, policy_version 442529 (0.00084) [2022-07-09 22:13:03,610][25689] Fps is (10 sec: 5516.8, 60 sec: 5681.4, 300 sec: 5650.0). Total num frames: 453157888. Throughput: 0: 5917.6. Samples: 453159334. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:13:03,611][25689] Avg episode reward: [(0, '-45.560')] [2022-07-09 22:13:03,892][26022] Updated weights on worker 0-0, policy_version 442539 (0.00096) [2022-07-09 22:13:05,776][26022] Updated weights on worker 0-0, policy_version 442549 (0.00100) [2022-07-09 22:13:07,465][26022] Updated weights on worker 0-0, policy_version 442559 (0.00092) [2022-07-09 22:13:08,629][25689] Fps is (10 sec: 5413.5, 60 sec: 5680.9, 300 sec: 5647.1). Total num frames: 453185536. Throughput: 0: 5837.2. Samples: 453192338. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:13:08,629][25689] Avg episode reward: [(0, '-45.124')] [2022-07-09 22:13:09,508][26022] Updated weights on worker 0-0, policy_version 442569 (0.00092) [2022-07-09 22:13:11,063][26022] Updated weights on worker 0-0, policy_version 442579 (0.00085) [2022-07-09 22:13:13,019][26022] Updated weights on worker 0-0, policy_version 442589 (0.00057) [2022-07-09 22:13:13,675][25689] Fps is (10 sec: 5697.7, 60 sec: 5663.3, 300 sec: 5653.6). Total num frames: 453215232. Throughput: 0: 5021.3. Samples: 453209700. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:13:13,675][25689] Avg episode reward: [(0, '-44.628')] [2022-07-09 22:13:14,409][26022] Updated weights on worker 0-0, policy_version 442599 (0.00087) [2022-07-09 22:13:16,663][26022] Updated weights on worker 0-0, policy_version 442609 (0.00090) [2022-07-09 22:13:18,085][26022] Updated weights on worker 0-0, policy_version 442619 (0.00093) [2022-07-09 22:13:18,681][25689] Fps is (10 sec: 5806.4, 60 sec: 5697.5, 300 sec: 5654.0). Total num frames: 453243904. Throughput: 0: 5905.6. Samples: 453244418. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:13:18,682][25689] Avg episode reward: [(0, '-44.081')] [2022-07-09 22:13:20,074][26022] Updated weights on worker 0-0, policy_version 442629 (0.00078) [2022-07-09 22:13:21,856][26022] Updated weights on worker 0-0, policy_version 442639 (0.00095) [2022-07-09 22:13:23,558][26022] Updated weights on worker 0-0, policy_version 442649 (0.00087) [2022-07-09 22:13:23,739][25689] Fps is (10 sec: 5799.7, 60 sec: 5677.4, 300 sec: 5653.5). Total num frames: 453273600. Throughput: 0: 5939.8. Samples: 453278892. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:13:23,739][25689] Avg episode reward: [(0, '-44.776')] [2022-07-09 22:13:25,494][26022] Updated weights on worker 0-0, policy_version 442659 (0.00086) [2022-07-09 22:13:27,059][26022] Updated weights on worker 0-0, policy_version 442669 (0.00082) [2022-07-09 22:13:28,781][25689] Fps is (10 sec: 5677.6, 60 sec: 5659.9, 300 sec: 5650.3). Total num frames: 453301248. Throughput: 0: 5140.9. Samples: 453295940. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:13:28,782][25689] Avg episode reward: [(0, '-44.953')] [2022-07-09 22:13:29,139][26022] Updated weights on worker 0-0, policy_version 442679 (0.00089) [2022-07-09 22:13:30,812][26022] Updated weights on worker 0-0, policy_version 442689 (0.00095) [2022-07-09 22:13:32,781][26022] Updated weights on worker 0-0, policy_version 442699 (0.00085) [2022-07-09 22:13:33,866][25689] Fps is (10 sec: 5662.5, 60 sec: 5707.9, 300 sec: 5652.9). Total num frames: 453330944. Throughput: 0: 5954.7. Samples: 453329932. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:13:33,867][25689] Avg episode reward: [(0, '-44.824')] [2022-07-09 22:13:34,455][26022] Updated weights on worker 0-0, policy_version 442709 (0.00088) [2022-07-09 22:13:36,288][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:13:36,302][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000442719_453344256.pth [2022-07-09 22:13:36,302][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000440729_451306496.pth [2022-07-09 22:13:36,304][26022] Updated weights on worker 0-0, policy_version 442719 (0.00086) [2022-07-09 22:13:37,957][26022] Updated weights on worker 0-0, policy_version 442729 (0.00059) [2022-07-09 22:13:38,901][25689] Fps is (10 sec: 5666.8, 60 sec: 5671.9, 300 sec: 5652.7). Total num frames: 453358592. Throughput: 0: 5912.5. Samples: 453363964. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:13:38,902][25689] Avg episode reward: [(0, '-44.943')] [2022-07-09 22:13:39,976][26022] Updated weights on worker 0-0, policy_version 442739 (0.00091) [2022-07-09 22:13:41,690][26022] Updated weights on worker 0-0, policy_version 442749 (0.00093) [2022-07-09 22:13:43,497][26022] Updated weights on worker 0-0, policy_version 442759 (0.00088) [2022-07-09 22:13:43,951][25689] Fps is (10 sec: 5584.6, 60 sec: 5668.1, 300 sec: 5652.3). Total num frames: 453387264. Throughput: 0: 5054.7. Samples: 453381058. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:13:43,952][25689] Avg episode reward: [(0, '-44.962')] [2022-07-09 22:13:45,300][26022] Updated weights on worker 0-0, policy_version 442769 (0.00085) [2022-07-09 22:13:47,010][26022] Updated weights on worker 0-0, policy_version 442779 (0.00093) [2022-07-09 22:13:48,789][26022] Updated weights on worker 0-0, policy_version 442789 (0.00094) [2022-07-09 22:13:48,986][25689] Fps is (10 sec: 5685.9, 60 sec: 5649.2, 300 sec: 5652.7). Total num frames: 453415936. Throughput: 0: 5926.3. Samples: 453415678. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:13:48,986][25689] Avg episode reward: [(0, '-44.250')] [2022-07-09 22:13:50,723][26022] Updated weights on worker 0-0, policy_version 442799 (0.00092) [2022-07-09 22:13:52,572][26022] Updated weights on worker 0-0, policy_version 442809 (0.00085) [2022-07-09 22:13:54,022][25689] Fps is (10 sec: 5795.4, 60 sec: 5676.2, 300 sec: 5656.0). Total num frames: 453445632. Throughput: 0: 5942.4. Samples: 453449710. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:13:54,023][25689] Avg episode reward: [(0, '-45.059')] [2022-07-09 22:13:54,085][26022] Updated weights on worker 0-0, policy_version 442819 (0.00082) [2022-07-09 22:13:56,161][26022] Updated weights on worker 0-0, policy_version 442829 (0.00093) [2022-07-09 22:13:57,672][26022] Updated weights on worker 0-0, policy_version 442839 (0.00087) [2022-07-09 22:13:59,034][25689] Fps is (10 sec: 5707.2, 60 sec: 5659.7, 300 sec: 5652.9). Total num frames: 453473280. Throughput: 0: 5111.4. Samples: 453466872. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:13:59,035][25689] Avg episode reward: [(0, '-45.970')] [2022-07-09 22:13:59,728][26022] Updated weights on worker 0-0, policy_version 442849 (0.00088) [2022-07-09 22:14:01,162][26022] Updated weights on worker 0-0, policy_version 442859 (0.00088) [2022-07-09 22:14:03,519][26022] Updated weights on worker 0-0, policy_version 442869 (0.00086) [2022-07-09 22:14:04,064][25689] Fps is (10 sec: 5404.8, 60 sec: 5657.5, 300 sec: 5656.1). Total num frames: 453499904. Throughput: 0: 5874.3. Samples: 453499208. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:14:04,064][25689] Avg episode reward: [(0, '-45.547')] [2022-07-09 22:14:05,367][26022] Updated weights on worker 0-0, policy_version 442879 (0.00093) [2022-07-09 22:14:07,128][26022] Updated weights on worker 0-0, policy_version 442889 (0.00089) [2022-07-09 22:14:09,101][25689] Fps is (10 sec: 5391.0, 60 sec: 5655.8, 300 sec: 5647.6). Total num frames: 453527552. Throughput: 0: 5837.3. Samples: 453533094. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:14:09,101][25689] Avg episode reward: [(0, '-45.824')] [2022-07-09 22:14:09,110][26022] Updated weights on worker 0-0, policy_version 442899 (0.00088) [2022-07-09 22:14:10,709][26022] Updated weights on worker 0-0, policy_version 442909 (0.00098) [2022-07-09 22:14:12,712][26022] Updated weights on worker 0-0, policy_version 442919 (0.00084) [2022-07-09 22:14:14,147][25689] Fps is (10 sec: 5788.8, 60 sec: 5672.7, 300 sec: 5661.4). Total num frames: 453558272. Throughput: 0: 4990.6. Samples: 453550142. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:14:14,147][25689] Avg episode reward: [(0, '-46.423')] [2022-07-09 22:14:14,221][26022] Updated weights on worker 0-0, policy_version 442929 (0.00089) [2022-07-09 22:14:16,282][26022] Updated weights on worker 0-0, policy_version 442939 (0.00084) [2022-07-09 22:14:17,918][26022] Updated weights on worker 0-0, policy_version 442949 (0.00103) [2022-07-09 22:14:19,155][25689] Fps is (10 sec: 5703.4, 60 sec: 5638.7, 300 sec: 5648.4). Total num frames: 453584896. Throughput: 0: 5855.3. Samples: 453584690. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:14:19,156][25689] Avg episode reward: [(0, '-47.244')] [2022-07-09 22:14:19,719][26022] Updated weights on worker 0-0, policy_version 442959 (0.00088) [2022-07-09 22:14:21,545][26022] Updated weights on worker 0-0, policy_version 442969 (0.00090) [2022-07-09 22:14:23,304][26022] Updated weights on worker 0-0, policy_version 442979 (0.00084) [2022-07-09 22:14:24,179][25689] Fps is (10 sec: 5715.7, 60 sec: 5658.7, 300 sec: 5662.6). Total num frames: 453615616. Throughput: 0: 5948.0. Samples: 453618856. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:14:24,180][25689] Avg episode reward: [(0, '-46.666')] [2022-07-09 22:14:25,519][26022] Updated weights on worker 0-0, policy_version 442989 (0.00088) [2022-07-09 22:14:26,892][26022] Updated weights on worker 0-0, policy_version 442999 (0.00089) [2022-07-09 22:14:29,040][26022] Updated weights on worker 0-0, policy_version 443009 (0.00072) [2022-07-09 22:14:29,191][25689] Fps is (10 sec: 5713.9, 60 sec: 5644.7, 300 sec: 5650.8). Total num frames: 453642240. Throughput: 0: 5100.7. Samples: 453635568. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:14:29,192][25689] Avg episode reward: [(0, '-45.897')] [2022-07-09 22:14:30,488][26022] Updated weights on worker 0-0, policy_version 443019 (0.00089) [2022-07-09 22:14:32,396][26022] Updated weights on worker 0-0, policy_version 443029 (0.00090) [2022-07-09 22:14:34,282][25689] Fps is (10 sec: 5473.4, 60 sec: 5627.1, 300 sec: 5656.7). Total num frames: 453670912. Throughput: 0: 5932.2. Samples: 453669590. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:14:34,284][25689] Avg episode reward: [(0, '-45.770')] [2022-07-09 22:14:34,294][26022] Updated weights on worker 0-0, policy_version 443039 (0.00083) [2022-07-09 22:14:35,894][26022] Updated weights on worker 0-0, policy_version 443049 (0.00092) [2022-07-09 22:14:37,939][26022] Updated weights on worker 0-0, policy_version 443059 (0.00548) [2022-07-09 22:14:39,286][25689] Fps is (10 sec: 5680.6, 60 sec: 5646.9, 300 sec: 5653.8). Total num frames: 453699584. Throughput: 0: 5902.5. Samples: 453703512. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:14:39,286][25689] Avg episode reward: [(0, '-45.172')] [2022-07-09 22:14:39,688][26022] Updated weights on worker 0-0, policy_version 443069 (0.00094) [2022-07-09 22:14:41,544][26022] Updated weights on worker 0-0, policy_version 443079 (0.00084) [2022-07-09 22:14:43,450][26022] Updated weights on worker 0-0, policy_version 443089 (0.00091) [2022-07-09 22:14:44,291][25689] Fps is (10 sec: 5626.8, 60 sec: 5634.2, 300 sec: 5655.0). Total num frames: 453727232. Throughput: 0: 5909.1. Samples: 453737700. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:14:44,293][25689] Avg episode reward: [(0, '-45.376')] [2022-07-09 22:14:44,829][26022] Updated weights on worker 0-0, policy_version 443099 (0.00087) [2022-07-09 22:14:47,033][26022] Updated weights on worker 0-0, policy_version 443109 (0.00095) [2022-07-09 22:14:48,622][26022] Updated weights on worker 0-0, policy_version 443119 (0.00087) [2022-07-09 22:14:49,302][25689] Fps is (10 sec: 5725.1, 60 sec: 5653.4, 300 sec: 5656.0). Total num frames: 453756928. Throughput: 0: 5933.3. Samples: 453754894. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:14:49,303][25689] Avg episode reward: [(0, '-44.551')] [2022-07-09 22:14:50,625][26022] Updated weights on worker 0-0, policy_version 443129 (0.00086) [2022-07-09 22:14:52,260][26022] Updated weights on worker 0-0, policy_version 443139 (0.00093) [2022-07-09 22:14:54,205][26022] Updated weights on worker 0-0, policy_version 443149 (0.00084) [2022-07-09 22:14:54,387][25689] Fps is (10 sec: 5781.6, 60 sec: 5631.9, 300 sec: 5654.6). Total num frames: 453785600. Throughput: 0: 5946.5. Samples: 453789142. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:14:54,389][25689] Avg episode reward: [(0, '-44.362')] [2022-07-09 22:14:56,096][26022] Updated weights on worker 0-0, policy_version 443159 (0.00094) [2022-07-09 22:14:57,696][26022] Updated weights on worker 0-0, policy_version 443169 (0.00086) [2022-07-09 22:14:59,426][25689] Fps is (10 sec: 5664.2, 60 sec: 5646.3, 300 sec: 5661.1). Total num frames: 453814272. Throughput: 0: 5950.3. Samples: 453823352. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:14:59,427][25689] Avg episode reward: [(0, '-45.198')] [2022-07-09 22:14:59,435][26022] Updated weights on worker 0-0, policy_version 443179 (0.00095) [2022-07-09 22:15:01,707][26022] Updated weights on worker 0-0, policy_version 443189 (0.00092) [2022-07-09 22:15:03,538][26022] Updated weights on worker 0-0, policy_version 443199 (0.00091) [2022-07-09 22:15:04,451][25689] Fps is (10 sec: 5494.5, 60 sec: 5646.8, 300 sec: 5661.1). Total num frames: 453840896. Throughput: 0: 4992.4. Samples: 453838344. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:15:04,451][25689] Avg episode reward: [(0, '-44.791')] [2022-07-09 22:15:05,345][26022] Updated weights on worker 0-0, policy_version 443209 (0.00085) [2022-07-09 22:15:07,032][26022] Updated weights on worker 0-0, policy_version 443219 (0.00091) [2022-07-09 22:15:08,828][26022] Updated weights on worker 0-0, policy_version 443229 (0.00092) [2022-07-09 22:15:09,459][25689] Fps is (10 sec: 5613.4, 60 sec: 5683.4, 300 sec: 5659.3). Total num frames: 453870592. Throughput: 0: 5853.7. Samples: 453872888. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:15:09,460][25689] Avg episode reward: [(0, '-44.861')] [2022-07-09 22:15:10,481][26022] Updated weights on worker 0-0, policy_version 443239 (0.00097) [2022-07-09 22:15:12,439][26022] Updated weights on worker 0-0, policy_version 443249 (0.00087) [2022-07-09 22:15:14,257][26022] Updated weights on worker 0-0, policy_version 443259 (0.00091) [2022-07-09 22:15:14,590][25689] Fps is (10 sec: 5655.4, 60 sec: 5624.5, 300 sec: 5657.8). Total num frames: 453898240. Throughput: 0: 5852.2. Samples: 453907378. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:15:14,591][25689] Avg episode reward: [(0, '-45.596')] [2022-07-09 22:15:15,935][26022] Updated weights on worker 0-0, policy_version 443269 (0.00086) [2022-07-09 22:15:17,771][26022] Updated weights on worker 0-0, policy_version 443279 (0.00085) [2022-07-09 22:15:19,447][26022] Updated weights on worker 0-0, policy_version 443289 (0.00088) [2022-07-09 22:15:19,659][25689] Fps is (10 sec: 5722.7, 60 sec: 5686.7, 300 sec: 5660.3). Total num frames: 453928960. Throughput: 0: 5005.2. Samples: 453924620. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:15:19,659][25689] Avg episode reward: [(0, '-46.401')] [2022-07-09 22:15:21,380][26022] Updated weights on worker 0-0, policy_version 443299 (0.00081) [2022-07-09 22:15:23,120][26022] Updated weights on worker 0-0, policy_version 443309 (0.00082) [2022-07-09 22:15:24,691][25689] Fps is (10 sec: 5880.2, 60 sec: 5652.1, 300 sec: 5663.4). Total num frames: 453957632. Throughput: 0: 5964.2. Samples: 453959060. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:15:24,691][25689] Avg episode reward: [(0, '-46.681')] [2022-07-09 22:15:24,820][26022] Updated weights on worker 0-0, policy_version 443319 (0.01376) [2022-07-09 22:15:26,780][26022] Updated weights on worker 0-0, policy_version 443329 (0.00082) [2022-07-09 22:15:28,560][26022] Updated weights on worker 0-0, policy_version 443339 (0.00096) [2022-07-09 22:15:29,728][25689] Fps is (10 sec: 5491.6, 60 sec: 5649.7, 300 sec: 5656.9). Total num frames: 453984256. Throughput: 0: 5936.4. Samples: 453993210. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:15:29,729][25689] Avg episode reward: [(0, '-46.477')] [2022-07-09 22:15:30,450][26022] Updated weights on worker 0-0, policy_version 443349 (0.00092) [2022-07-09 22:15:32,111][26022] Updated weights on worker 0-0, policy_version 443359 (0.00087) [2022-07-09 22:15:34,079][26022] Updated weights on worker 0-0, policy_version 443369 (0.00084) [2022-07-09 22:15:34,820][25689] Fps is (10 sec: 5661.4, 60 sec: 5683.5, 300 sec: 5663.3). Total num frames: 454014976. Throughput: 0: 5079.7. Samples: 454010138. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 22:15:34,821][25689] Avg episode reward: [(0, '-46.911')] [2022-07-09 22:15:35,782][26022] Updated weights on worker 0-0, policy_version 443379 (0.00095) [2022-07-09 22:15:36,412][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:15:36,429][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000443383_454024192.pth [2022-07-09 22:15:36,429][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000441392_451985408.pth [2022-07-09 22:15:37,565][26022] Updated weights on worker 0-0, policy_version 443389 (0.00088) [2022-07-09 22:15:39,278][26022] Updated weights on worker 0-0, policy_version 443399 (0.00093) [2022-07-09 22:15:39,834][25689] Fps is (10 sec: 5775.3, 60 sec: 5665.5, 300 sec: 5660.4). Total num frames: 454042624. Throughput: 0: 5932.2. Samples: 454044306. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:15:39,835][25689] Avg episode reward: [(0, '-47.611')] [2022-07-09 22:15:41,246][26022] Updated weights on worker 0-0, policy_version 443409 (0.00086) [2022-07-09 22:15:42,884][26022] Updated weights on worker 0-0, policy_version 443419 (0.00090) [2022-07-09 22:15:44,699][26022] Updated weights on worker 0-0, policy_version 443429 (0.00091) [2022-07-09 22:15:44,868][25689] Fps is (10 sec: 5604.8, 60 sec: 5679.8, 300 sec: 5657.4). Total num frames: 454071296. Throughput: 0: 5927.0. Samples: 454078652. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:15:44,870][25689] Avg episode reward: [(0, '-48.324')] [2022-07-09 22:15:46,468][26022] Updated weights on worker 0-0, policy_version 443439 (0.00094) [2022-07-09 22:15:48,402][26022] Updated weights on worker 0-0, policy_version 443449 (0.00089) [2022-07-09 22:15:49,905][25689] Fps is (10 sec: 5795.6, 60 sec: 5677.3, 300 sec: 5664.4). Total num frames: 454100992. Throughput: 0: 5088.3. Samples: 454095880. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:15:49,906][25689] Avg episode reward: [(0, '-48.001')] [2022-07-09 22:15:49,944][26022] Updated weights on worker 0-0, policy_version 443459 (0.00211) [2022-07-09 22:15:51,989][26022] Updated weights on worker 0-0, policy_version 443469 (0.00110) [2022-07-09 22:15:53,740][26022] Updated weights on worker 0-0, policy_version 443479 (0.00080) [2022-07-09 22:15:54,983][25689] Fps is (10 sec: 5770.5, 60 sec: 5677.9, 300 sec: 5666.6). Total num frames: 454129664. Throughput: 0: 5948.2. Samples: 454130074. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:15:54,984][25689] Avg episode reward: [(0, '-48.482')] [2022-07-09 22:15:55,684][26022] Updated weights on worker 0-0, policy_version 443489 (0.00083) [2022-07-09 22:15:57,227][26022] Updated weights on worker 0-0, policy_version 443499 (0.00087) [2022-07-09 22:15:59,015][26022] Updated weights on worker 0-0, policy_version 443509 (0.00098) [2022-07-09 22:16:00,011][25689] Fps is (10 sec: 5674.6, 60 sec: 5679.1, 300 sec: 5669.8). Total num frames: 454158336. Throughput: 0: 5949.0. Samples: 454164336. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:16:00,011][25689] Avg episode reward: [(0, '-47.470')] [2022-07-09 22:16:00,863][26022] Updated weights on worker 0-0, policy_version 443519 (0.00093) [2022-07-09 22:16:03,055][26022] Updated weights on worker 0-0, policy_version 443529 (0.00083) [2022-07-09 22:16:04,900][26022] Updated weights on worker 0-0, policy_version 443539 (0.00105) [2022-07-09 22:16:05,056][25689] Fps is (10 sec: 5489.8, 60 sec: 5677.1, 300 sec: 5665.6). Total num frames: 454184960. Throughput: 0: 5006.3. Samples: 454179716. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:16:05,058][25689] Avg episode reward: [(0, '-47.114')] [2022-07-09 22:16:06,575][26022] Updated weights on worker 0-0, policy_version 443549 (0.00090) [2022-07-09 22:16:08,447][26022] Updated weights on worker 0-0, policy_version 443559 (0.00085) [2022-07-09 22:16:10,124][25689] Fps is (10 sec: 5467.9, 60 sec: 5654.7, 300 sec: 5665.5). Total num frames: 454213632. Throughput: 0: 5823.8. Samples: 454213628. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:16:10,124][25689] Avg episode reward: [(0, '-47.071')] [2022-07-09 22:16:10,340][26022] Updated weights on worker 0-0, policy_version 443569 (0.00095) [2022-07-09 22:16:11,975][26022] Updated weights on worker 0-0, policy_version 443579 (0.00092) [2022-07-09 22:16:13,965][26022] Updated weights on worker 0-0, policy_version 443589 (0.00089) [2022-07-09 22:16:15,205][25689] Fps is (10 sec: 5751.0, 60 sec: 5693.1, 300 sec: 5664.6). Total num frames: 454243328. Throughput: 0: 5812.2. Samples: 454247606. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:16:15,206][25689] Avg episode reward: [(0, '-45.838')] [2022-07-09 22:16:15,592][26022] Updated weights on worker 0-0, policy_version 443599 (0.00093) [2022-07-09 22:16:17,395][26022] Updated weights on worker 0-0, policy_version 443609 (0.00085) [2022-07-09 22:16:19,490][26022] Updated weights on worker 0-0, policy_version 443619 (0.00096) [2022-07-09 22:16:20,242][25689] Fps is (10 sec: 5566.4, 60 sec: 5628.5, 300 sec: 5661.6). Total num frames: 454269952. Throughput: 0: 4952.9. Samples: 454264536. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:16:20,242][25689] Avg episode reward: [(0, '-45.268')] [2022-07-09 22:16:21,023][26022] Updated weights on worker 0-0, policy_version 443629 (0.00083) [2022-07-09 22:16:23,142][26022] Updated weights on worker 0-0, policy_version 443639 (0.00093) [2022-07-09 22:16:24,659][26022] Updated weights on worker 0-0, policy_version 443649 (0.00081) [2022-07-09 22:16:25,251][25689] Fps is (10 sec: 5504.5, 60 sec: 5630.7, 300 sec: 5658.1). Total num frames: 454298624. Throughput: 0: 5879.8. Samples: 454298456. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:16:25,251][25689] Avg episode reward: [(0, '-44.331')] [2022-07-09 22:16:26,471][26022] Updated weights on worker 0-0, policy_version 443659 (0.00086) [2022-07-09 22:16:28,294][26022] Updated weights on worker 0-0, policy_version 443669 (0.00080) [2022-07-09 22:16:30,059][26022] Updated weights on worker 0-0, policy_version 443679 (0.00057) [2022-07-09 22:16:30,253][25689] Fps is (10 sec: 5830.1, 60 sec: 5684.6, 300 sec: 5669.6). Total num frames: 454328320. Throughput: 0: 5921.7. Samples: 454332828. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:16:30,254][25689] Avg episode reward: [(0, '-44.741')] [2022-07-09 22:16:32,067][26022] Updated weights on worker 0-0, policy_version 443689 (0.00085) [2022-07-09 22:16:33,691][26022] Updated weights on worker 0-0, policy_version 443699 (0.00088) [2022-07-09 22:16:35,389][25689] Fps is (10 sec: 5756.9, 60 sec: 5646.7, 300 sec: 5663.7). Total num frames: 454356992. Throughput: 0: 5064.7. Samples: 454349834. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:16:35,390][25689] Avg episode reward: [(0, '-44.840')] [2022-07-09 22:16:35,474][26022] Updated weights on worker 0-0, policy_version 443709 (0.00085) [2022-07-09 22:16:37,312][26022] Updated weights on worker 0-0, policy_version 443719 (0.00094) [2022-07-09 22:16:39,134][26022] Updated weights on worker 0-0, policy_version 443729 (0.00092) [2022-07-09 22:16:40,395][25689] Fps is (10 sec: 5553.0, 60 sec: 5647.5, 300 sec: 5660.3). Total num frames: 454384640. Throughput: 0: 5943.9. Samples: 454384328. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:16:40,396][25689] Avg episode reward: [(0, '-45.163')] [2022-07-09 22:16:40,862][26022] Updated weights on worker 0-0, policy_version 443739 (0.00089) [2022-07-09 22:16:42,724][26022] Updated weights on worker 0-0, policy_version 443749 (0.00088) [2022-07-09 22:16:44,444][26022] Updated weights on worker 0-0, policy_version 443759 (0.00093) [2022-07-09 22:16:45,459][25689] Fps is (10 sec: 5593.0, 60 sec: 5644.7, 300 sec: 5657.5). Total num frames: 454413312. Throughput: 0: 5955.8. Samples: 454418814. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:16:45,460][25689] Avg episode reward: [(0, '-45.940')] [2022-07-09 22:16:46,274][26022] Updated weights on worker 0-0, policy_version 443769 (0.00089) [2022-07-09 22:16:48,076][26022] Updated weights on worker 0-0, policy_version 443779 (0.00088) [2022-07-09 22:16:49,931][26022] Updated weights on worker 0-0, policy_version 443789 (0.00084) [2022-07-09 22:16:50,549][25689] Fps is (10 sec: 5849.3, 60 sec: 5656.7, 300 sec: 5665.0). Total num frames: 454444032. Throughput: 0: 5082.2. Samples: 454435976. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:16:50,549][25689] Avg episode reward: [(0, '-47.017')] [2022-07-09 22:16:51,601][26022] Updated weights on worker 0-0, policy_version 443799 (0.00088) [2022-07-09 22:16:53,509][26022] Updated weights on worker 0-0, policy_version 443809 (0.00086) [2022-07-09 22:16:55,224][26022] Updated weights on worker 0-0, policy_version 443819 (0.00093) [2022-07-09 22:16:55,660][25689] Fps is (10 sec: 5721.7, 60 sec: 5636.7, 300 sec: 5659.8). Total num frames: 454471680. Throughput: 0: 5934.1. Samples: 454470124. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:16:55,661][25689] Avg episode reward: [(0, '-46.811')] [2022-07-09 22:16:57,106][26022] Updated weights on worker 0-0, policy_version 443829 (0.00097) [2022-07-09 22:16:58,786][26022] Updated weights on worker 0-0, policy_version 443839 (0.00091) [2022-07-09 22:17:00,668][25689] Fps is (10 sec: 5565.6, 60 sec: 5638.5, 300 sec: 5666.8). Total num frames: 454500352. Throughput: 0: 5903.4. Samples: 454504008. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:17:00,669][25689] Avg episode reward: [(0, '-46.691')] [2022-07-09 22:17:00,888][26022] Updated weights on worker 0-0, policy_version 443849 (0.00055) [2022-07-09 22:17:02,986][26022] Updated weights on worker 0-0, policy_version 443859 (0.00089) [2022-07-09 22:17:04,642][26022] Updated weights on worker 0-0, policy_version 443869 (0.00092) [2022-07-09 22:17:05,696][25689] Fps is (10 sec: 5612.1, 60 sec: 5657.0, 300 sec: 5666.5). Total num frames: 454528000. Throughput: 0: 5814.9. Samples: 454536488. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:17:05,696][25689] Avg episode reward: [(0, '-46.063')] [2022-07-09 22:17:06,610][26022] Updated weights on worker 0-0, policy_version 443879 (0.00084) [2022-07-09 22:17:08,182][26022] Updated weights on worker 0-0, policy_version 443889 (0.00094) [2022-07-09 22:17:10,285][26022] Updated weights on worker 0-0, policy_version 443899 (0.00090) [2022-07-09 22:17:10,729][25689] Fps is (10 sec: 5597.9, 60 sec: 5660.2, 300 sec: 5659.7). Total num frames: 454556672. Throughput: 0: 5829.1. Samples: 454553608. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:17:10,731][25689] Avg episode reward: [(0, '-46.962')] [2022-07-09 22:17:11,836][26022] Updated weights on worker 0-0, policy_version 443909 (0.00089) [2022-07-09 22:17:13,682][26022] Updated weights on worker 0-0, policy_version 443919 (0.00086) [2022-07-09 22:17:15,397][26022] Updated weights on worker 0-0, policy_version 443929 (0.00084) [2022-07-09 22:17:15,828][25689] Fps is (10 sec: 5659.7, 60 sec: 5641.7, 300 sec: 5664.9). Total num frames: 454585344. Throughput: 0: 5846.5. Samples: 454588032. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:17:15,828][25689] Avg episode reward: [(0, '-46.875')] [2022-07-09 22:17:17,158][26022] Updated weights on worker 0-0, policy_version 443939 (0.00097) [2022-07-09 22:17:18,967][26022] Updated weights on worker 0-0, policy_version 443949 (0.00096) [2022-07-09 22:17:20,846][25689] Fps is (10 sec: 5567.2, 60 sec: 5660.3, 300 sec: 5654.7). Total num frames: 454612992. Throughput: 0: 5865.0. Samples: 454622348. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:17:20,846][25689] Avg episode reward: [(0, '-45.757')] [2022-07-09 22:17:20,975][26022] Updated weights on worker 0-0, policy_version 443959 (0.00099) [2022-07-09 22:17:22,475][26022] Updated weights on worker 0-0, policy_version 443969 (0.00090) [2022-07-09 22:17:24,598][26022] Updated weights on worker 0-0, policy_version 443979 (0.00087) [2022-07-09 22:17:25,921][25689] Fps is (10 sec: 5782.8, 60 sec: 5687.9, 300 sec: 5660.8). Total num frames: 454643712. Throughput: 0: 5097.1. Samples: 454639578. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:17:25,922][25689] Avg episode reward: [(0, '-46.478')] [2022-07-09 22:17:26,036][26022] Updated weights on worker 0-0, policy_version 443989 (0.00092) [2022-07-09 22:17:28,095][26022] Updated weights on worker 0-0, policy_version 443999 (0.00095) [2022-07-09 22:17:29,736][26022] Updated weights on worker 0-0, policy_version 444009 (0.00090) [2022-07-09 22:17:31,014][25689] Fps is (10 sec: 5740.4, 60 sec: 5645.8, 300 sec: 5663.5). Total num frames: 454671360. Throughput: 0: 5916.2. Samples: 454673614. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:17:31,015][25689] Avg episode reward: [(0, '-46.439')] [2022-07-09 22:17:31,695][26022] Updated weights on worker 0-0, policy_version 444019 (0.00085) [2022-07-09 22:17:33,447][26022] Updated weights on worker 0-0, policy_version 444029 (0.00095) [2022-07-09 22:17:35,376][26022] Updated weights on worker 0-0, policy_version 444039 (0.00108) [2022-07-09 22:17:36,057][25689] Fps is (10 sec: 5455.5, 60 sec: 5637.5, 300 sec: 5656.1). Total num frames: 454699008. Throughput: 0: 5915.1. Samples: 454707690. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:17:36,058][25689] Avg episode reward: [(0, '-45.478')] [2022-07-09 22:17:36,528][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:17:36,546][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000444046_454703104.pth [2022-07-09 22:17:36,546][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000442052_452661248.pth [2022-07-09 22:17:36,936][26022] Updated weights on worker 0-0, policy_version 444049 (0.00082) [2022-07-09 22:17:38,973][26022] Updated weights on worker 0-0, policy_version 444059 (0.00085) [2022-07-09 22:17:40,675][26022] Updated weights on worker 0-0, policy_version 444069 (0.00084) [2022-07-09 22:17:41,066][25689] Fps is (10 sec: 5704.5, 60 sec: 5671.0, 300 sec: 5659.5). Total num frames: 454728704. Throughput: 0: 5066.4. Samples: 454724792. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:17:41,066][25689] Avg episode reward: [(0, '-45.501')] [2022-07-09 22:17:42,589][26022] Updated weights on worker 0-0, policy_version 444079 (0.00090) [2022-07-09 22:17:44,283][26022] Updated weights on worker 0-0, policy_version 444089 (0.00088) [2022-07-09 22:17:45,989][26022] Updated weights on worker 0-0, policy_version 444099 (0.00094) [2022-07-09 22:17:46,082][25689] Fps is (10 sec: 5822.5, 60 sec: 5675.5, 300 sec: 5656.1). Total num frames: 454757376. Throughput: 0: 5927.0. Samples: 454759070. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:17:46,082][25689] Avg episode reward: [(0, '-46.371')] [2022-07-09 22:17:47,868][26022] Updated weights on worker 0-0, policy_version 444109 (0.00095) [2022-07-09 22:17:49,635][26022] Updated weights on worker 0-0, policy_version 444119 (0.00090) [2022-07-09 22:17:51,091][25689] Fps is (10 sec: 5720.2, 60 sec: 5649.2, 300 sec: 5658.6). Total num frames: 454786048. Throughput: 0: 5967.8. Samples: 454793432. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:17:51,091][25689] Avg episode reward: [(0, '-46.821')] [2022-07-09 22:17:51,444][26022] Updated weights on worker 0-0, policy_version 444129 (0.00090) [2022-07-09 22:17:53,175][26022] Updated weights on worker 0-0, policy_version 444139 (0.00087) [2022-07-09 22:17:55,121][26022] Updated weights on worker 0-0, policy_version 444149 (0.00082) [2022-07-09 22:17:56,163][25689] Fps is (10 sec: 5789.9, 60 sec: 5686.8, 300 sec: 5661.0). Total num frames: 454815744. Throughput: 0: 5111.0. Samples: 454810450. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:17:56,163][25689] Avg episode reward: [(0, '-46.634')] [2022-07-09 22:17:56,906][26022] Updated weights on worker 0-0, policy_version 444159 (0.00088) [2022-07-09 22:17:58,606][26022] Updated weights on worker 0-0, policy_version 444169 (0.00097) [2022-07-09 22:18:00,521][26022] Updated weights on worker 0-0, policy_version 444179 (0.00097) [2022-07-09 22:18:01,165][25689] Fps is (10 sec: 5488.7, 60 sec: 5636.5, 300 sec: 5657.6). Total num frames: 454841344. Throughput: 0: 5945.1. Samples: 454844286. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:18:01,166][25689] Avg episode reward: [(0, '-46.061')] [2022-07-09 22:18:02,693][26022] Updated weights on worker 0-0, policy_version 444189 (0.00080) [2022-07-09 22:18:04,500][26022] Updated weights on worker 0-0, policy_version 444199 (0.00092) [2022-07-09 22:18:06,227][25689] Fps is (10 sec: 5188.8, 60 sec: 5616.4, 300 sec: 5653.4). Total num frames: 454867968. Throughput: 0: 5813.0. Samples: 454876178. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:18:06,228][25689] Avg episode reward: [(0, '-45.223')] [2022-07-09 22:18:06,331][26022] Updated weights on worker 0-0, policy_version 444209 (0.00095) [2022-07-09 22:18:08,047][26022] Updated weights on worker 0-0, policy_version 444219 (0.00090) [2022-07-09 22:18:09,843][26022] Updated weights on worker 0-0, policy_version 444229 (0.00088) [2022-07-09 22:18:11,262][25689] Fps is (10 sec: 5578.2, 60 sec: 5633.2, 300 sec: 5653.6). Total num frames: 454897664. Throughput: 0: 4950.8. Samples: 454893294. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:18:11,262][25689] Avg episode reward: [(0, '-44.583')] [2022-07-09 22:18:11,883][26022] Updated weights on worker 0-0, policy_version 444239 (0.00091) [2022-07-09 22:18:13,357][26022] Updated weights on worker 0-0, policy_version 444249 (0.00086) [2022-07-09 22:18:15,343][26022] Updated weights on worker 0-0, policy_version 444259 (0.00092) [2022-07-09 22:18:16,320][25689] Fps is (10 sec: 5884.8, 60 sec: 5653.9, 300 sec: 5656.0). Total num frames: 454927360. Throughput: 0: 5806.1. Samples: 454927484. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:18:16,320][25689] Avg episode reward: [(0, '-44.046')] [2022-07-09 22:18:17,072][26022] Updated weights on worker 0-0, policy_version 444269 (0.00097) [2022-07-09 22:18:18,794][26022] Updated weights on worker 0-0, policy_version 444279 (0.00090) [2022-07-09 22:18:20,767][26022] Updated weights on worker 0-0, policy_version 444289 (0.00091) [2022-07-09 22:18:21,339][25689] Fps is (10 sec: 5690.7, 60 sec: 5653.8, 300 sec: 5649.9). Total num frames: 454955008. Throughput: 0: 5823.0. Samples: 454961754. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 22:18:21,339][25689] Avg episode reward: [(0, '-44.232')] [2022-07-09 22:18:22,383][26022] Updated weights on worker 0-0, policy_version 444299 (0.00088) [2022-07-09 22:18:24,184][26022] Updated weights on worker 0-0, policy_version 444309 (0.00086) [2022-07-09 22:18:26,208][26022] Updated weights on worker 0-0, policy_version 444319 (0.00078) [2022-07-09 22:18:26,349][25689] Fps is (10 sec: 5513.5, 60 sec: 5609.1, 300 sec: 5650.5). Total num frames: 454982656. Throughput: 0: 5100.2. Samples: 454978802. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:18:26,350][25689] Avg episode reward: [(0, '-43.922')] [2022-07-09 22:18:27,934][26022] Updated weights on worker 0-0, policy_version 444329 (0.00095) [2022-07-09 22:18:29,640][26022] Updated weights on worker 0-0, policy_version 444339 (0.00090) [2022-07-09 22:18:31,352][25689] Fps is (10 sec: 5726.9, 60 sec: 5651.3, 300 sec: 5652.1). Total num frames: 455012352. Throughput: 0: 5939.2. Samples: 455012612. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:18:31,352][25689] Avg episode reward: [(0, '-43.902')] [2022-07-09 22:18:31,514][26022] Updated weights on worker 0-0, policy_version 444349 (0.00094) [2022-07-09 22:18:33,278][26022] Updated weights on worker 0-0, policy_version 444359 (0.00084) [2022-07-09 22:18:35,243][26022] Updated weights on worker 0-0, policy_version 444369 (0.00093) [2022-07-09 22:18:36,402][25689] Fps is (10 sec: 5704.2, 60 sec: 5650.7, 300 sec: 5651.8). Total num frames: 455040000. Throughput: 0: 5945.5. Samples: 455046882. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:18:36,402][25689] Avg episode reward: [(0, '-44.434')] [2022-07-09 22:18:36,969][26022] Updated weights on worker 0-0, policy_version 444379 (0.00088) [2022-07-09 22:18:38,709][26022] Updated weights on worker 0-0, policy_version 444389 (0.00082) [2022-07-09 22:18:40,468][26022] Updated weights on worker 0-0, policy_version 444399 (0.00089) [2022-07-09 22:18:41,411][25689] Fps is (10 sec: 5700.8, 60 sec: 5650.7, 300 sec: 5656.0). Total num frames: 455069696. Throughput: 0: 5091.7. Samples: 455063954. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:18:41,411][25689] Avg episode reward: [(0, '-45.632')] [2022-07-09 22:18:42,567][26022] Updated weights on worker 0-0, policy_version 444409 (0.00086) [2022-07-09 22:18:44,076][26022] Updated weights on worker 0-0, policy_version 444419 (0.00085) [2022-07-09 22:18:46,025][26022] Updated weights on worker 0-0, policy_version 444429 (0.00086) [2022-07-09 22:18:46,481][25689] Fps is (10 sec: 5791.2, 60 sec: 5645.6, 300 sec: 5655.3). Total num frames: 455098368. Throughput: 0: 5944.9. Samples: 455098482. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:18:46,481][25689] Avg episode reward: [(0, '-45.628')] [2022-07-09 22:18:47,558][26022] Updated weights on worker 0-0, policy_version 444439 (0.00083) [2022-07-09 22:18:49,598][26022] Updated weights on worker 0-0, policy_version 444449 (0.00088) [2022-07-09 22:18:51,133][26022] Updated weights on worker 0-0, policy_version 444459 (0.00101) [2022-07-09 22:18:51,493][25689] Fps is (10 sec: 5687.7, 60 sec: 5645.4, 300 sec: 5652.4). Total num frames: 455127040. Throughput: 0: 5976.1. Samples: 455132976. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:18:51,493][25689] Avg episode reward: [(0, '-45.707')] [2022-07-09 22:18:53,186][26022] Updated weights on worker 0-0, policy_version 444469 (0.00080) [2022-07-09 22:18:54,872][26022] Updated weights on worker 0-0, policy_version 444479 (0.00085) [2022-07-09 22:18:56,604][25689] Fps is (10 sec: 5664.7, 60 sec: 5624.8, 300 sec: 5653.9). Total num frames: 455155712. Throughput: 0: 5097.3. Samples: 455149856. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:18:56,604][25689] Avg episode reward: [(0, '-46.333')] [2022-07-09 22:18:56,624][26022] Updated weights on worker 0-0, policy_version 444489 (0.00097) [2022-07-09 22:18:58,454][26022] Updated weights on worker 0-0, policy_version 444499 (0.00090) [2022-07-09 22:19:00,326][26022] Updated weights on worker 0-0, policy_version 444509 (0.00088) [2022-07-09 22:19:01,606][25689] Fps is (10 sec: 5670.3, 60 sec: 5675.7, 300 sec: 5661.3). Total num frames: 455184384. Throughput: 0: 5955.9. Samples: 455184236. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:19:01,611][25689] Avg episode reward: [(0, '-46.428')] [2022-07-09 22:19:02,399][26022] Updated weights on worker 0-0, policy_version 444519 (0.00088) [2022-07-09 22:19:04,227][26022] Updated weights on worker 0-0, policy_version 444529 (0.00407) [2022-07-09 22:19:06,138][26022] Updated weights on worker 0-0, policy_version 444539 (0.00083) [2022-07-09 22:19:06,631][25689] Fps is (10 sec: 5412.5, 60 sec: 5662.2, 300 sec: 5654.7). Total num frames: 455209984. Throughput: 0: 5850.3. Samples: 455216366. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:19:06,631][25689] Avg episode reward: [(0, '-45.620')] [2022-07-09 22:19:07,670][26022] Updated weights on worker 0-0, policy_version 444549 (0.00083) [2022-07-09 22:19:09,643][26022] Updated weights on worker 0-0, policy_version 444559 (0.00084) [2022-07-09 22:19:11,247][26022] Updated weights on worker 0-0, policy_version 444569 (0.00086) [2022-07-09 22:19:11,660][25689] Fps is (10 sec: 5499.7, 60 sec: 5662.7, 300 sec: 5651.5). Total num frames: 455239680. Throughput: 0: 4988.6. Samples: 455233584. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:19:11,661][25689] Avg episode reward: [(0, '-45.720')] [2022-07-09 22:19:13,161][26022] Updated weights on worker 0-0, policy_version 444579 (0.00094) [2022-07-09 22:19:15,163][26022] Updated weights on worker 0-0, policy_version 444589 (0.00089) [2022-07-09 22:19:16,711][25689] Fps is (10 sec: 5892.3, 60 sec: 5663.4, 300 sec: 5661.1). Total num frames: 455269376. Throughput: 0: 5866.2. Samples: 455267806. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:19:16,711][25689] Avg episode reward: [(0, '-44.151')] [2022-07-09 22:19:16,712][26022] Updated weights on worker 0-0, policy_version 444599 (0.00084) [2022-07-09 22:19:18,664][26022] Updated weights on worker 0-0, policy_version 444609 (0.00089) [2022-07-09 22:19:20,349][26022] Updated weights on worker 0-0, policy_version 444619 (0.00084) [2022-07-09 22:19:21,788][25689] Fps is (10 sec: 5662.4, 60 sec: 5657.9, 300 sec: 5649.7). Total num frames: 455297024. Throughput: 0: 5851.2. Samples: 455302322. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:19:21,788][25689] Avg episode reward: [(0, '-44.331')] [2022-07-09 22:19:22,118][26022] Updated weights on worker 0-0, policy_version 444629 (0.00091) [2022-07-09 22:19:24,040][26022] Updated weights on worker 0-0, policy_version 444639 (0.00086) [2022-07-09 22:19:25,592][26022] Updated weights on worker 0-0, policy_version 444649 (0.00096) [2022-07-09 22:19:26,799][25689] Fps is (10 sec: 5481.2, 60 sec: 5657.9, 300 sec: 5653.2). Total num frames: 455324672. Throughput: 0: 5121.7. Samples: 455319660. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:19:26,799][25689] Avg episode reward: [(0, '-44.346')] [2022-07-09 22:19:27,443][26022] Updated weights on worker 0-0, policy_version 444659 (0.00079) [2022-07-09 22:19:29,169][26022] Updated weights on worker 0-0, policy_version 444669 (0.00086) [2022-07-09 22:19:31,056][26022] Updated weights on worker 0-0, policy_version 444679 (0.00598) [2022-07-09 22:19:31,811][25689] Fps is (10 sec: 5720.9, 60 sec: 5657.0, 300 sec: 5658.1). Total num frames: 455354368. Throughput: 0: 5978.7. Samples: 455354060. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:19:31,812][25689] Avg episode reward: [(0, '-45.217')] [2022-07-09 22:19:33,068][26022] Updated weights on worker 0-0, policy_version 444689 (0.00092) [2022-07-09 22:19:34,556][26022] Updated weights on worker 0-0, policy_version 444699 (0.00089) [2022-07-09 22:19:36,539][26022] Updated weights on worker 0-0, policy_version 444709 (0.00093) [2022-07-09 22:19:36,652][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:19:36,666][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000444710_455383040.pth [2022-07-09 22:19:36,667][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000442719_453344256.pth [2022-07-09 22:19:36,899][25689] Fps is (10 sec: 5779.0, 60 sec: 5670.4, 300 sec: 5656.5). Total num frames: 455383040. Throughput: 0: 5958.6. Samples: 455388100. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:19:36,899][25689] Avg episode reward: [(0, '-44.707')] [2022-07-09 22:19:38,214][26022] Updated weights on worker 0-0, policy_version 444719 (0.00082) [2022-07-09 22:19:40,263][26022] Updated weights on worker 0-0, policy_version 444729 (0.00085) [2022-07-09 22:19:41,902][26022] Updated weights on worker 0-0, policy_version 444739 (0.00082) [2022-07-09 22:19:41,906][25689] Fps is (10 sec: 5782.2, 60 sec: 5670.6, 300 sec: 5663.4). Total num frames: 455412736. Throughput: 0: 5116.4. Samples: 455405254. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:19:41,906][25689] Avg episode reward: [(0, '-45.305')] [2022-07-09 22:19:43,659][26022] Updated weights on worker 0-0, policy_version 444749 (0.00088) [2022-07-09 22:19:45,538][26022] Updated weights on worker 0-0, policy_version 444759 (0.00089) [2022-07-09 22:19:46,910][25689] Fps is (10 sec: 5830.0, 60 sec: 5676.7, 300 sec: 5660.1). Total num frames: 455441408. Throughput: 0: 5957.4. Samples: 455439474. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:19:46,911][25689] Avg episode reward: [(0, '-46.232')] [2022-07-09 22:19:47,127][26022] Updated weights on worker 0-0, policy_version 444769 (0.00083) [2022-07-09 22:19:49,075][26022] Updated weights on worker 0-0, policy_version 444779 (0.00087) [2022-07-09 22:19:50,619][26022] Updated weights on worker 0-0, policy_version 444789 (0.00086) [2022-07-09 22:19:51,922][25689] Fps is (10 sec: 5725.2, 60 sec: 5676.8, 300 sec: 5661.5). Total num frames: 455470080. Throughput: 0: 5968.7. Samples: 455474094. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:19:51,922][25689] Avg episode reward: [(0, '-46.515')] [2022-07-09 22:19:52,739][26022] Updated weights on worker 0-0, policy_version 444799 (0.00093) [2022-07-09 22:19:54,282][26022] Updated weights on worker 0-0, policy_version 444809 (0.00083) [2022-07-09 22:19:56,183][26022] Updated weights on worker 0-0, policy_version 444819 (0.00089) [2022-07-09 22:19:56,982][25689] Fps is (10 sec: 5693.7, 60 sec: 5681.5, 300 sec: 5661.1). Total num frames: 455498752. Throughput: 0: 5989.7. Samples: 455508392. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:19:56,983][25689] Avg episode reward: [(0, '-46.995')] [2022-07-09 22:19:57,902][26022] Updated weights on worker 0-0, policy_version 444829 (0.00094) [2022-07-09 22:19:59,777][26022] Updated weights on worker 0-0, policy_version 444839 (0.00082) [2022-07-09 22:20:02,005][25689] Fps is (10 sec: 5382.1, 60 sec: 5628.7, 300 sec: 5657.6). Total num frames: 455524352. Throughput: 0: 5979.7. Samples: 455525444. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:20:02,006][25689] Avg episode reward: [(0, '-47.057')] [2022-07-09 22:20:02,247][26022] Updated weights on worker 0-0, policy_version 444849 (0.00096) [2022-07-09 22:20:03,594][26022] Updated weights on worker 0-0, policy_version 444859 (0.00087) [2022-07-09 22:20:05,644][26022] Updated weights on worker 0-0, policy_version 444869 (0.00086) [2022-07-09 22:20:07,009][25689] Fps is (10 sec: 5412.6, 60 sec: 5681.5, 300 sec: 5654.3). Total num frames: 455553024. Throughput: 0: 5859.5. Samples: 455557240. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:20:07,009][25689] Avg episode reward: [(0, '-47.172')] [2022-07-09 22:20:07,278][26022] Updated weights on worker 0-0, policy_version 444879 (0.00085) [2022-07-09 22:20:09,239][26022] Updated weights on worker 0-0, policy_version 444889 (0.00084) [2022-07-09 22:20:10,954][26022] Updated weights on worker 0-0, policy_version 444899 (0.00087) [2022-07-09 22:20:12,010][25689] Fps is (10 sec: 5731.9, 60 sec: 5667.2, 300 sec: 5660.2). Total num frames: 455581696. Throughput: 0: 5850.5. Samples: 455591620. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:20:12,010][25689] Avg episode reward: [(0, '-46.410')] [2022-07-09 22:20:12,762][26022] Updated weights on worker 0-0, policy_version 444909 (0.00091) [2022-07-09 22:20:14,502][26022] Updated weights on worker 0-0, policy_version 444919 (0.00093) [2022-07-09 22:20:16,574][26022] Updated weights on worker 0-0, policy_version 444929 (0.00094) [2022-07-09 22:20:17,097][25689] Fps is (10 sec: 5684.5, 60 sec: 5646.9, 300 sec: 5653.0). Total num frames: 455610368. Throughput: 0: 4991.0. Samples: 455608784. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:20:17,097][25689] Avg episode reward: [(0, '-45.877')] [2022-07-09 22:20:17,950][26022] Updated weights on worker 0-0, policy_version 444939 (0.00095) [2022-07-09 22:20:20,142][26022] Updated weights on worker 0-0, policy_version 444949 (0.00433) [2022-07-09 22:20:21,519][26022] Updated weights on worker 0-0, policy_version 444959 (0.00093) [2022-07-09 22:20:22,140][25689] Fps is (10 sec: 5762.1, 60 sec: 5684.0, 300 sec: 5656.2). Total num frames: 455640064. Throughput: 0: 5837.4. Samples: 455642974. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:20:22,140][25689] Avg episode reward: [(0, '-46.164')] [2022-07-09 22:20:23,724][26022] Updated weights on worker 0-0, policy_version 444969 (0.00088) [2022-07-09 22:20:25,348][26022] Updated weights on worker 0-0, policy_version 444979 (0.00092) [2022-07-09 22:20:27,090][26022] Updated weights on worker 0-0, policy_version 444989 (0.00097) [2022-07-09 22:20:27,189][25689] Fps is (10 sec: 5783.5, 60 sec: 5697.4, 300 sec: 5662.9). Total num frames: 455668736. Throughput: 0: 5943.1. Samples: 455677172. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:20:27,189][25689] Avg episode reward: [(0, '-45.811')] [2022-07-09 22:20:29,064][26022] Updated weights on worker 0-0, policy_version 444999 (0.00086) [2022-07-09 22:20:30,736][26022] Updated weights on worker 0-0, policy_version 445009 (0.00093) [2022-07-09 22:20:32,192][25689] Fps is (10 sec: 5500.7, 60 sec: 5647.3, 300 sec: 5650.8). Total num frames: 455695360. Throughput: 0: 5075.5. Samples: 455694054. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:20:32,193][25689] Avg episode reward: [(0, '-46.016')] [2022-07-09 22:20:32,712][26022] Updated weights on worker 0-0, policy_version 445019 (0.00088) [2022-07-09 22:20:34,535][26022] Updated weights on worker 0-0, policy_version 445029 (0.00089) [2022-07-09 22:20:36,210][26022] Updated weights on worker 0-0, policy_version 445039 (0.00111) [2022-07-09 22:20:37,267][25689] Fps is (10 sec: 5588.3, 60 sec: 5665.5, 300 sec: 5656.5). Total num frames: 455725056. Throughput: 0: 5892.7. Samples: 455727642. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:20:37,267][25689] Avg episode reward: [(0, '-46.558')] [2022-07-09 22:20:38,164][26022] Updated weights on worker 0-0, policy_version 445049 (0.00090) [2022-07-09 22:20:39,687][26022] Updated weights on worker 0-0, policy_version 445059 (0.00085) [2022-07-09 22:20:41,832][26022] Updated weights on worker 0-0, policy_version 445069 (0.00087) [2022-07-09 22:20:42,272][25689] Fps is (10 sec: 5689.1, 60 sec: 5631.8, 300 sec: 5653.6). Total num frames: 455752704. Throughput: 0: 5910.9. Samples: 455761972. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:20:42,272][25689] Avg episode reward: [(0, '-46.346')] [2022-07-09 22:20:43,510][26022] Updated weights on worker 0-0, policy_version 445079 (0.00094) [2022-07-09 22:20:45,259][26022] Updated weights on worker 0-0, policy_version 445089 (0.00079) [2022-07-09 22:20:47,075][26022] Updated weights on worker 0-0, policy_version 445099 (0.00087) [2022-07-09 22:20:47,277][25689] Fps is (10 sec: 5728.9, 60 sec: 5648.7, 300 sec: 5654.2). Total num frames: 455782400. Throughput: 0: 5087.7. Samples: 455779372. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:20:47,277][25689] Avg episode reward: [(0, '-46.466')] [2022-07-09 22:20:48,680][26022] Updated weights on worker 0-0, policy_version 445109 (0.00090) [2022-07-09 22:20:50,766][26022] Updated weights on worker 0-0, policy_version 445119 (0.00103) [2022-07-09 22:20:52,352][25689] Fps is (10 sec: 5891.7, 60 sec: 5659.6, 300 sec: 5657.7). Total num frames: 455812096. Throughput: 0: 5951.5. Samples: 455814038. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:20:52,353][25689] Avg episode reward: [(0, '-45.895')] [2022-07-09 22:20:52,358][26022] Updated weights on worker 0-0, policy_version 445129 (0.00084) [2022-07-09 22:20:54,371][26022] Updated weights on worker 0-0, policy_version 445139 (0.00087) [2022-07-09 22:20:55,912][26022] Updated weights on worker 0-0, policy_version 445149 (0.00083) [2022-07-09 22:20:57,453][25689] Fps is (10 sec: 5634.8, 60 sec: 5638.9, 300 sec: 5652.9). Total num frames: 455839744. Throughput: 0: 5967.7. Samples: 455848108. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:20:57,454][25689] Avg episode reward: [(0, '-46.616')] [2022-07-09 22:20:58,028][26022] Updated weights on worker 0-0, policy_version 445159 (0.00092) [2022-07-09 22:20:59,594][26022] Updated weights on worker 0-0, policy_version 445169 (0.00085) [2022-07-09 22:21:01,486][26022] Updated weights on worker 0-0, policy_version 445179 (0.00091) [2022-07-09 22:21:02,461][25689] Fps is (10 sec: 5470.3, 60 sec: 5674.3, 300 sec: 5657.0). Total num frames: 455867392. Throughput: 0: 5108.4. Samples: 455865106. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:21:02,462][25689] Avg episode reward: [(0, '-46.082')] [2022-07-09 22:21:03,627][26022] Updated weights on worker 0-0, policy_version 445189 (0.00082) [2022-07-09 22:21:05,433][26022] Updated weights on worker 0-0, policy_version 445199 (0.00086) [2022-07-09 22:21:07,344][26022] Updated weights on worker 0-0, policy_version 445209 (0.00086) [2022-07-09 22:21:07,466][25689] Fps is (10 sec: 5624.9, 60 sec: 5674.1, 300 sec: 5658.2). Total num frames: 455896064. Throughput: 0: 5820.8. Samples: 455896890. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-09 22:21:07,466][25689] Avg episode reward: [(0, '-46.182')] [2022-07-09 22:21:09,320][26022] Updated weights on worker 0-0, policy_version 445219 (0.00084) [2022-07-09 22:21:10,755][26022] Updated weights on worker 0-0, policy_version 445229 (0.00082) [2022-07-09 22:21:12,518][25689] Fps is (10 sec: 5396.3, 60 sec: 5618.5, 300 sec: 5645.0). Total num frames: 455921664. Throughput: 0: 5788.9. Samples: 455930776. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:21:12,519][25689] Avg episode reward: [(0, '-45.989')] [2022-07-09 22:21:12,981][26022] Updated weights on worker 0-0, policy_version 445239 (0.00085) [2022-07-09 22:21:14,423][26022] Updated weights on worker 0-0, policy_version 445249 (0.00084) [2022-07-09 22:21:16,257][26022] Updated weights on worker 0-0, policy_version 445259 (0.00094) [2022-07-09 22:21:17,600][25689] Fps is (10 sec: 5658.4, 60 sec: 5669.7, 300 sec: 5661.4). Total num frames: 455953408. Throughput: 0: 4960.1. Samples: 455948038. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:21:17,601][25689] Avg episode reward: [(0, '-46.193')] [2022-07-09 22:21:17,974][26022] Updated weights on worker 0-0, policy_version 445269 (0.00088) [2022-07-09 22:21:19,953][26022] Updated weights on worker 0-0, policy_version 445279 (0.00085) [2022-07-09 22:21:21,548][26022] Updated weights on worker 0-0, policy_version 445289 (0.00085) [2022-07-09 22:21:22,678][25689] Fps is (10 sec: 5845.6, 60 sec: 5632.6, 300 sec: 5656.6). Total num frames: 455981056. Throughput: 0: 5789.8. Samples: 455982160. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:21:22,679][25689] Avg episode reward: [(0, '-46.615')] [2022-07-09 22:21:23,642][26022] Updated weights on worker 0-0, policy_version 445299 (0.00087) [2022-07-09 22:21:25,327][26022] Updated weights on worker 0-0, policy_version 445309 (0.00096) [2022-07-09 22:21:27,302][26022] Updated weights on worker 0-0, policy_version 445319 (0.00619) [2022-07-09 22:21:27,680][25689] Fps is (10 sec: 5587.5, 60 sec: 5637.1, 300 sec: 5653.2). Total num frames: 456009728. Throughput: 0: 5896.8. Samples: 456016086. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:21:27,680][25689] Avg episode reward: [(0, '-46.209')] [2022-07-09 22:21:28,755][26022] Updated weights on worker 0-0, policy_version 445329 (0.00089) [2022-07-09 22:21:30,931][26022] Updated weights on worker 0-0, policy_version 445339 (0.00103) [2022-07-09 22:21:32,411][26022] Updated weights on worker 0-0, policy_version 445349 (0.00088) [2022-07-09 22:21:32,741][25689] Fps is (10 sec: 5596.9, 60 sec: 5648.6, 300 sec: 5651.2). Total num frames: 456037376. Throughput: 0: 5063.7. Samples: 456033178. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:21:32,743][25689] Avg episode reward: [(0, '-46.284')] [2022-07-09 22:21:34,539][26022] Updated weights on worker 0-0, policy_version 445359 (0.00084) [2022-07-09 22:21:36,234][26022] Updated weights on worker 0-0, policy_version 445369 (0.01064) [2022-07-09 22:21:36,768][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:21:36,777][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000445372_456060928.pth [2022-07-09 22:21:36,791][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000443383_454024192.pth [2022-07-09 22:21:37,787][25689] Fps is (10 sec: 5471.1, 60 sec: 5617.5, 300 sec: 5650.4). Total num frames: 456065024. Throughput: 0: 5882.5. Samples: 456066786. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:21:37,787][25689] Avg episode reward: [(0, '-46.859')] [2022-07-09 22:21:37,993][26022] Updated weights on worker 0-0, policy_version 445379 (0.00084) [2022-07-09 22:21:39,941][26022] Updated weights on worker 0-0, policy_version 445389 (0.00080) [2022-07-09 22:21:41,683][26022] Updated weights on worker 0-0, policy_version 445399 (0.00093) [2022-07-09 22:21:42,811][25689] Fps is (10 sec: 5694.7, 60 sec: 5649.5, 300 sec: 5654.6). Total num frames: 456094720. Throughput: 0: 5884.2. Samples: 456100624. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:21:42,811][25689] Avg episode reward: [(0, '-46.467')] [2022-07-09 22:21:43,611][26022] Updated weights on worker 0-0, policy_version 445409 (0.00086) [2022-07-09 22:21:45,161][26022] Updated weights on worker 0-0, policy_version 445419 (0.00091) [2022-07-09 22:21:47,223][26022] Updated weights on worker 0-0, policy_version 445429 (0.00084) [2022-07-09 22:21:47,823][25689] Fps is (10 sec: 5816.0, 60 sec: 5632.0, 300 sec: 5649.2). Total num frames: 456123392. Throughput: 0: 5043.7. Samples: 456117680. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:21:47,823][25689] Avg episode reward: [(0, '-46.514')] [2022-07-09 22:21:48,828][26022] Updated weights on worker 0-0, policy_version 445439 (0.00091) [2022-07-09 22:21:50,703][26022] Updated weights on worker 0-0, policy_version 445449 (0.00087) [2022-07-09 22:21:52,305][26022] Updated weights on worker 0-0, policy_version 445459 (0.00087) [2022-07-09 22:21:52,856][25689] Fps is (10 sec: 5708.5, 60 sec: 5619.0, 300 sec: 5654.1). Total num frames: 456152064. Throughput: 0: 5913.2. Samples: 456152122. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:21:52,856][25689] Avg episode reward: [(0, '-46.254')] [2022-07-09 22:21:54,163][26022] Updated weights on worker 0-0, policy_version 445469 (0.00082) [2022-07-09 22:21:56,058][26022] Updated weights on worker 0-0, policy_version 445479 (0.00091) [2022-07-09 22:21:57,650][26022] Updated weights on worker 0-0, policy_version 445489 (0.00087) [2022-07-09 22:21:57,968][25689] Fps is (10 sec: 5753.1, 60 sec: 5651.8, 300 sec: 5655.6). Total num frames: 456181760. Throughput: 0: 5935.5. Samples: 456186572. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:21:57,969][25689] Avg episode reward: [(0, '-47.115')] [2022-07-09 22:21:59,704][26022] Updated weights on worker 0-0, policy_version 445499 (0.00089) [2022-07-09 22:22:01,363][26022] Updated weights on worker 0-0, policy_version 445509 (0.00083) [2022-07-09 22:22:03,004][25689] Fps is (10 sec: 5449.1, 60 sec: 5615.3, 300 sec: 5648.6). Total num frames: 456207360. Throughput: 0: 5111.3. Samples: 456203836. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:22:03,004][25689] Avg episode reward: [(0, '-46.986')] [2022-07-09 22:22:03,556][26022] Updated weights on worker 0-0, policy_version 445519 (0.00091) [2022-07-09 22:22:05,542][26022] Updated weights on worker 0-0, policy_version 445529 (0.00086) [2022-07-09 22:22:07,034][26022] Updated weights on worker 0-0, policy_version 445539 (0.00083) [2022-07-09 22:22:08,052][25689] Fps is (10 sec: 5381.8, 60 sec: 5611.3, 300 sec: 5648.3). Total num frames: 456236032. Throughput: 0: 5851.5. Samples: 456236054. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:22:08,053][25689] Avg episode reward: [(0, '-46.159')] [2022-07-09 22:22:09,063][26022] Updated weights on worker 0-0, policy_version 445549 (0.00436) [2022-07-09 22:22:10,667][26022] Updated weights on worker 0-0, policy_version 445559 (0.00082) [2022-07-09 22:22:12,584][26022] Updated weights on worker 0-0, policy_version 445569 (0.00093) [2022-07-09 22:22:13,075][25689] Fps is (10 sec: 5795.6, 60 sec: 5681.7, 300 sec: 5653.2). Total num frames: 456265728. Throughput: 0: 5858.7. Samples: 456270578. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:22:13,075][25689] Avg episode reward: [(0, '-46.900')] [2022-07-09 22:22:14,166][26022] Updated weights on worker 0-0, policy_version 445579 (0.00087) [2022-07-09 22:22:15,873][26022] Updated weights on worker 0-0, policy_version 445589 (0.00090) [2022-07-09 22:22:17,770][26022] Updated weights on worker 0-0, policy_version 445599 (0.00092) [2022-07-09 22:22:18,205][25689] Fps is (10 sec: 5849.8, 60 sec: 5643.4, 300 sec: 5657.9). Total num frames: 456295424. Throughput: 0: 5015.6. Samples: 456288074. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:22:18,206][25689] Avg episode reward: [(0, '-46.373')] [2022-07-09 22:22:19,487][26022] Updated weights on worker 0-0, policy_version 445609 (0.00087) [2022-07-09 22:22:21,287][26022] Updated weights on worker 0-0, policy_version 445619 (0.00084) [2022-07-09 22:22:23,212][25689] Fps is (10 sec: 5656.8, 60 sec: 5650.0, 300 sec: 5648.9). Total num frames: 456323072. Throughput: 0: 5877.7. Samples: 456322614. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:22:23,213][25689] Avg episode reward: [(0, '-44.776')] [2022-07-09 22:22:23,291][26022] Updated weights on worker 0-0, policy_version 445629 (0.00094) [2022-07-09 22:22:24,748][26022] Updated weights on worker 0-0, policy_version 445639 (0.00083) [2022-07-09 22:22:26,795][26022] Updated weights on worker 0-0, policy_version 445649 (0.00091) [2022-07-09 22:22:28,256][25689] Fps is (10 sec: 5705.4, 60 sec: 5662.9, 300 sec: 5656.7). Total num frames: 456352768. Throughput: 0: 5985.6. Samples: 456356986. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:22:28,257][25689] Avg episode reward: [(0, '-44.590')] [2022-07-09 22:22:28,346][26022] Updated weights on worker 0-0, policy_version 445659 (0.00086) [2022-07-09 22:22:30,471][26022] Updated weights on worker 0-0, policy_version 445669 (0.00092) [2022-07-09 22:22:32,031][26022] Updated weights on worker 0-0, policy_version 445679 (0.00096) [2022-07-09 22:22:33,259][25689] Fps is (10 sec: 5707.5, 60 sec: 5668.4, 300 sec: 5657.5). Total num frames: 456380416. Throughput: 0: 5129.4. Samples: 456374112. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:22:33,260][25689] Avg episode reward: [(0, '-44.539')] [2022-07-09 22:22:33,951][26022] Updated weights on worker 0-0, policy_version 445689 (0.00084) [2022-07-09 22:22:35,787][26022] Updated weights on worker 0-0, policy_version 445699 (0.00098) [2022-07-09 22:22:37,422][26022] Updated weights on worker 0-0, policy_version 445709 (0.00088) [2022-07-09 22:22:38,384][25689] Fps is (10 sec: 5661.9, 60 sec: 5694.8, 300 sec: 5655.2). Total num frames: 456410112. Throughput: 0: 5956.8. Samples: 456408276. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:22:38,384][25689] Avg episode reward: [(0, '-44.217')] [2022-07-09 22:22:39,518][26022] Updated weights on worker 0-0, policy_version 445719 (0.00085) [2022-07-09 22:22:41,026][26022] Updated weights on worker 0-0, policy_version 445729 (0.00090) [2022-07-09 22:22:42,968][26022] Updated weights on worker 0-0, policy_version 445739 (0.00083) [2022-07-09 22:22:43,393][25689] Fps is (10 sec: 5860.8, 60 sec: 5696.2, 300 sec: 5658.8). Total num frames: 456439808. Throughput: 0: 5946.9. Samples: 456442628. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:22:43,394][25689] Avg episode reward: [(0, '-43.905')] [2022-07-09 22:22:44,616][26022] Updated weights on worker 0-0, policy_version 445749 (0.00083) [2022-07-09 22:22:46,591][26022] Updated weights on worker 0-0, policy_version 445759 (0.00086) [2022-07-09 22:22:48,286][26022] Updated weights on worker 0-0, policy_version 445769 (0.00103) [2022-07-09 22:22:48,435][25689] Fps is (10 sec: 5807.1, 60 sec: 5693.3, 300 sec: 5658.2). Total num frames: 456468480. Throughput: 0: 5938.7. Samples: 456476824. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:22:48,436][25689] Avg episode reward: [(0, '-44.340')] [2022-07-09 22:22:50,138][26022] Updated weights on worker 0-0, policy_version 445779 (0.00088) [2022-07-09 22:22:51,901][26022] Updated weights on worker 0-0, policy_version 445789 (0.00090) [2022-07-09 22:22:53,450][25689] Fps is (10 sec: 5600.2, 60 sec: 5678.2, 300 sec: 5652.4). Total num frames: 456496128. Throughput: 0: 5943.0. Samples: 456494104. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:22:53,450][25689] Avg episode reward: [(0, '-46.056')] [2022-07-09 22:22:53,669][26022] Updated weights on worker 0-0, policy_version 445799 (0.00086) [2022-07-09 22:22:55,593][26022] Updated weights on worker 0-0, policy_version 445809 (0.00082) [2022-07-09 22:22:57,307][26022] Updated weights on worker 0-0, policy_version 445819 (0.00088) [2022-07-09 22:22:58,506][25689] Fps is (10 sec: 5592.5, 60 sec: 5666.6, 300 sec: 5661.7). Total num frames: 456524800. Throughput: 0: 5952.9. Samples: 456528058. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:22:58,506][25689] Avg episode reward: [(0, '-46.051')] [2022-07-09 22:22:59,071][26022] Updated weights on worker 0-0, policy_version 445829 (0.00093) [2022-07-09 22:23:00,847][26022] Updated weights on worker 0-0, policy_version 445839 (0.00093) [2022-07-09 22:23:03,059][26022] Updated weights on worker 0-0, policy_version 445849 (0.00087) [2022-07-09 22:23:03,550][25689] Fps is (10 sec: 5373.4, 60 sec: 5665.8, 300 sec: 5658.6). Total num frames: 456550400. Throughput: 0: 5821.5. Samples: 456559972. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:23:03,551][25689] Avg episode reward: [(0, '-45.826')] [2022-07-09 22:23:04,842][26022] Updated weights on worker 0-0, policy_version 445859 (0.00085) [2022-07-09 22:23:06,836][26022] Updated weights on worker 0-0, policy_version 445869 (0.00084) [2022-07-09 22:23:08,379][26022] Updated weights on worker 0-0, policy_version 445879 (0.00096) [2022-07-09 22:23:08,577][25689] Fps is (10 sec: 5490.4, 60 sec: 5684.7, 300 sec: 5658.7). Total num frames: 456580096. Throughput: 0: 4980.4. Samples: 456577140. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:23:08,578][25689] Avg episode reward: [(0, '-45.554')] [2022-07-09 22:23:10,428][26022] Updated weights on worker 0-0, policy_version 445889 (0.00087) [2022-07-09 22:23:12,278][26022] Updated weights on worker 0-0, policy_version 445899 (0.00095) [2022-07-09 22:23:13,601][25689] Fps is (10 sec: 5807.0, 60 sec: 5667.6, 300 sec: 5655.9). Total num frames: 456608768. Throughput: 0: 5823.6. Samples: 456611458. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:23:13,602][25689] Avg episode reward: [(0, '-45.741')] [2022-07-09 22:23:13,808][26022] Updated weights on worker 0-0, policy_version 445909 (0.00090) [2022-07-09 22:23:15,622][26022] Updated weights on worker 0-0, policy_version 445919 (0.00093) [2022-07-09 22:23:17,438][26022] Updated weights on worker 0-0, policy_version 445929 (0.00090) [2022-07-09 22:23:18,685][25689] Fps is (10 sec: 5876.0, 60 sec: 5688.9, 300 sec: 5665.0). Total num frames: 456639488. Throughput: 0: 5861.4. Samples: 456646336. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:23:18,685][25689] Avg episode reward: [(0, '-45.949')] [2022-07-09 22:23:19,238][26022] Updated weights on worker 0-0, policy_version 445939 (0.00091) [2022-07-09 22:23:21,003][26022] Updated weights on worker 0-0, policy_version 445949 (0.00085) [2022-07-09 22:23:22,729][26022] Updated weights on worker 0-0, policy_version 445959 (0.00087) [2022-07-09 22:23:23,725][25689] Fps is (10 sec: 5765.3, 60 sec: 5685.8, 300 sec: 5664.5). Total num frames: 456667136. Throughput: 0: 5144.1. Samples: 456663756. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:23:23,726][25689] Avg episode reward: [(0, '-45.825')] [2022-07-09 22:23:24,423][26022] Updated weights on worker 0-0, policy_version 445969 (0.00087) [2022-07-09 22:23:26,241][26022] Updated weights on worker 0-0, policy_version 445979 (0.00080) [2022-07-09 22:23:27,992][26022] Updated weights on worker 0-0, policy_version 445989 (0.00077) [2022-07-09 22:23:28,751][25689] Fps is (10 sec: 5595.0, 60 sec: 5670.6, 300 sec: 5660.6). Total num frames: 456695808. Throughput: 0: 6011.2. Samples: 456698408. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:23:28,751][25689] Avg episode reward: [(0, '-46.013')] [2022-07-09 22:23:29,886][26022] Updated weights on worker 0-0, policy_version 445999 (0.00090) [2022-07-09 22:23:31,668][26022] Updated weights on worker 0-0, policy_version 446009 (0.00089) [2022-07-09 22:23:33,294][26022] Updated weights on worker 0-0, policy_version 446019 (0.00086) [2022-07-09 22:23:33,804][25689] Fps is (10 sec: 5791.2, 60 sec: 5699.7, 300 sec: 5667.4). Total num frames: 456725504. Throughput: 0: 5997.8. Samples: 456732630. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:23:33,805][25689] Avg episode reward: [(0, '-46.940')] [2022-07-09 22:23:35,319][26022] Updated weights on worker 0-0, policy_version 446029 (0.00087) [2022-07-09 22:23:36,768][26022] Updated weights on worker 0-0, policy_version 446039 (0.00098) [2022-07-09 22:23:36,994][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:23:37,008][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000446040_456744960.pth [2022-07-09 22:23:37,009][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000444046_454703104.pth [2022-07-09 22:23:38,858][25689] Fps is (10 sec: 5673.5, 60 sec: 5672.5, 300 sec: 5659.7). Total num frames: 456753152. Throughput: 0: 5127.2. Samples: 456749770. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:23:38,859][25689] Avg episode reward: [(0, '-46.858')] [2022-07-09 22:23:38,996][26022] Updated weights on worker 0-0, policy_version 446049 (0.00088) [2022-07-09 22:23:40,482][26022] Updated weights on worker 0-0, policy_version 446059 (0.00085) [2022-07-09 22:23:42,681][26022] Updated weights on worker 0-0, policy_version 446069 (0.00088) [2022-07-09 22:23:43,946][25689] Fps is (10 sec: 5654.3, 60 sec: 5665.1, 300 sec: 5662.8). Total num frames: 456782848. Throughput: 0: 5939.3. Samples: 456783850. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:23:43,947][25689] Avg episode reward: [(0, '-47.275')] [2022-07-09 22:23:44,223][26022] Updated weights on worker 0-0, policy_version 446079 (0.00085) [2022-07-09 22:23:46,002][26022] Updated weights on worker 0-0, policy_version 446089 (0.00090) [2022-07-09 22:23:47,587][26022] Updated weights on worker 0-0, policy_version 446099 (0.00085) [2022-07-09 22:23:48,990][25689] Fps is (10 sec: 5760.8, 60 sec: 5664.9, 300 sec: 5662.2). Total num frames: 456811520. Throughput: 0: 5945.5. Samples: 456818740. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:23:48,991][25689] Avg episode reward: [(0, '-46.014')] [2022-07-09 22:23:49,564][26022] Updated weights on worker 0-0, policy_version 446109 (0.00090) [2022-07-09 22:23:51,319][26022] Updated weights on worker 0-0, policy_version 446119 (0.00087) [2022-07-09 22:23:53,347][26022] Updated weights on worker 0-0, policy_version 446129 (0.00082) [2022-07-09 22:23:54,012][25689] Fps is (10 sec: 5798.2, 60 sec: 5698.0, 300 sec: 5667.3). Total num frames: 456841216. Throughput: 0: 5098.1. Samples: 456835654. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-09 22:23:54,013][25689] Avg episode reward: [(0, '-45.415')] [2022-07-09 22:23:54,868][26022] Updated weights on worker 0-0, policy_version 446139 (0.00094) [2022-07-09 22:23:56,654][26022] Updated weights on worker 0-0, policy_version 446149 (0.00084) [2022-07-09 22:23:58,366][26022] Updated weights on worker 0-0, policy_version 446159 (0.00088) [2022-07-09 22:23:59,077][25689] Fps is (10 sec: 5786.8, 60 sec: 5697.3, 300 sec: 5666.1). Total num frames: 456869888. Throughput: 0: 5959.2. Samples: 456870254. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:23:59,077][25689] Avg episode reward: [(0, '-44.982')] [2022-07-09 22:24:00,493][26022] Updated weights on worker 0-0, policy_version 446169 (0.00085) [2022-07-09 22:24:02,491][26022] Updated weights on worker 0-0, policy_version 446179 (0.00092) [2022-07-09 22:24:04,146][25689] Fps is (10 sec: 5456.5, 60 sec: 5711.8, 300 sec: 5668.7). Total num frames: 456896512. Throughput: 0: 5875.6. Samples: 456902540. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:24:04,147][25689] Avg episode reward: [(0, '-44.057')] [2022-07-09 22:24:04,236][26022] Updated weights on worker 0-0, policy_version 446189 (0.00085) [2022-07-09 22:24:06,077][26022] Updated weights on worker 0-0, policy_version 446199 (0.00085) [2022-07-09 22:24:07,788][26022] Updated weights on worker 0-0, policy_version 446209 (0.00085) [2022-07-09 22:24:09,154][25689] Fps is (10 sec: 5487.1, 60 sec: 5696.7, 300 sec: 5665.7). Total num frames: 456925184. Throughput: 0: 5008.7. Samples: 456919736. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:24:09,155][25689] Avg episode reward: [(0, '-43.344')] [2022-07-09 22:24:09,655][26022] Updated weights on worker 0-0, policy_version 446219 (0.00086) [2022-07-09 22:24:11,389][26022] Updated weights on worker 0-0, policy_version 446229 (0.00096) [2022-07-09 22:24:13,171][26022] Updated weights on worker 0-0, policy_version 446239 (0.00087) [2022-07-09 22:24:14,183][25689] Fps is (10 sec: 5713.1, 60 sec: 5696.2, 300 sec: 5662.6). Total num frames: 456953856. Throughput: 0: 5873.3. Samples: 456954126. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:24:14,185][25689] Avg episode reward: [(0, '-43.003')] [2022-07-09 22:24:15,087][26022] Updated weights on worker 0-0, policy_version 446249 (0.00082) [2022-07-09 22:24:16,798][26022] Updated weights on worker 0-0, policy_version 446259 (0.00089) [2022-07-09 22:24:18,676][26022] Updated weights on worker 0-0, policy_version 446269 (0.00087) [2022-07-09 22:24:19,257][25689] Fps is (10 sec: 5777.4, 60 sec: 5680.2, 300 sec: 5669.6). Total num frames: 456983552. Throughput: 0: 5866.4. Samples: 456988640. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:24:19,257][25689] Avg episode reward: [(0, '-42.755')] [2022-07-09 22:24:20,391][26022] Updated weights on worker 0-0, policy_version 446279 (0.00094) [2022-07-09 22:24:22,196][26022] Updated weights on worker 0-0, policy_version 446289 (0.00088) [2022-07-09 22:24:23,867][26022] Updated weights on worker 0-0, policy_version 446299 (0.00087) [2022-07-09 22:24:24,261][25689] Fps is (10 sec: 5791.8, 60 sec: 5700.6, 300 sec: 5673.2). Total num frames: 457012224. Throughput: 0: 5141.8. Samples: 457005966. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:24:24,262][25689] Avg episode reward: [(0, '-42.636')] [2022-07-09 22:24:25,694][26022] Updated weights on worker 0-0, policy_version 446309 (0.00091) [2022-07-09 22:24:27,392][26022] Updated weights on worker 0-0, policy_version 446319 (0.00095) [2022-07-09 22:24:29,286][26022] Updated weights on worker 0-0, policy_version 446329 (0.00085) [2022-07-09 22:24:29,302][25689] Fps is (10 sec: 5606.6, 60 sec: 5682.2, 300 sec: 5665.7). Total num frames: 457039872. Throughput: 0: 5999.1. Samples: 457040606. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:24:29,303][25689] Avg episode reward: [(0, '-43.315')] [2022-07-09 22:24:30,796][26022] Updated weights on worker 0-0, policy_version 446339 (0.00096) [2022-07-09 22:24:32,779][26022] Updated weights on worker 0-0, policy_version 446349 (0.00078) [2022-07-09 22:24:34,326][25689] Fps is (10 sec: 5799.0, 60 sec: 5701.8, 300 sec: 5673.8). Total num frames: 457070592. Throughput: 0: 6027.7. Samples: 457075540. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:24:34,327][25689] Avg episode reward: [(0, '-43.609')] [2022-07-09 22:24:34,510][26022] Updated weights on worker 0-0, policy_version 446359 (0.00090) [2022-07-09 22:24:36,387][26022] Updated weights on worker 0-0, policy_version 446369 (0.00079) [2022-07-09 22:24:37,933][26022] Updated weights on worker 0-0, policy_version 446379 (0.00089) [2022-07-09 22:24:39,374][25689] Fps is (10 sec: 5896.8, 60 sec: 5719.4, 300 sec: 5669.6). Total num frames: 457099264. Throughput: 0: 5182.1. Samples: 457092890. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:24:39,376][25689] Avg episode reward: [(0, '-43.983')] [2022-07-09 22:24:39,796][26022] Updated weights on worker 0-0, policy_version 446389 (0.00095) [2022-07-09 22:24:41,547][26022] Updated weights on worker 0-0, policy_version 446399 (0.00087) [2022-07-09 22:24:43,487][26022] Updated weights on worker 0-0, policy_version 446409 (0.00079) [2022-07-09 22:24:44,379][25689] Fps is (10 sec: 5806.0, 60 sec: 5727.2, 300 sec: 5673.0). Total num frames: 457128960. Throughput: 0: 6030.7. Samples: 457127292. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:24:44,381][25689] Avg episode reward: [(0, '-44.483')] [2022-07-09 22:24:45,150][26022] Updated weights on worker 0-0, policy_version 446419 (0.00085) [2022-07-09 22:24:46,792][26022] Updated weights on worker 0-0, policy_version 446429 (0.00091) [2022-07-09 22:24:48,678][26022] Updated weights on worker 0-0, policy_version 446439 (0.00089) [2022-07-09 22:24:49,401][25689] Fps is (10 sec: 5719.0, 60 sec: 5712.3, 300 sec: 5669.4). Total num frames: 457156608. Throughput: 0: 6048.3. Samples: 457162170. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:24:49,402][25689] Avg episode reward: [(0, '-45.528')] [2022-07-09 22:24:50,246][26022] Updated weights on worker 0-0, policy_version 446449 (0.00089) [2022-07-09 22:24:52,364][26022] Updated weights on worker 0-0, policy_version 446459 (0.00085) [2022-07-09 22:24:53,886][26022] Updated weights on worker 0-0, policy_version 446469 (0.00092) [2022-07-09 22:24:54,422][25689] Fps is (10 sec: 5710.3, 60 sec: 5712.5, 300 sec: 5673.6). Total num frames: 457186304. Throughput: 0: 5186.4. Samples: 457179762. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:24:54,423][25689] Avg episode reward: [(0, '-46.298')] [2022-07-09 22:24:55,685][26022] Updated weights on worker 0-0, policy_version 446479 (0.00090) [2022-07-09 22:24:57,441][26022] Updated weights on worker 0-0, policy_version 446489 (0.00092) [2022-07-09 22:24:59,321][26022] Updated weights on worker 0-0, policy_version 446499 (0.00086) [2022-07-09 22:24:59,572][25689] Fps is (10 sec: 5839.4, 60 sec: 5721.3, 300 sec: 5684.9). Total num frames: 457216000. Throughput: 0: 6021.8. Samples: 457214518. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:24:59,573][25689] Avg episode reward: [(0, '-45.602')] [2022-07-09 22:25:01,140][26022] Updated weights on worker 0-0, policy_version 446509 (0.00087) [2022-07-09 22:25:03,305][26022] Updated weights on worker 0-0, policy_version 446519 (0.00084) [2022-07-09 22:25:04,581][25689] Fps is (10 sec: 5543.7, 60 sec: 5727.1, 300 sec: 5678.0). Total num frames: 457242624. Throughput: 0: 5908.0. Samples: 457246642. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:25:04,581][25689] Avg episode reward: [(0, '-45.576')] [2022-07-09 22:25:05,000][26022] Updated weights on worker 0-0, policy_version 446529 (0.00096) [2022-07-09 22:25:07,019][26022] Updated weights on worker 0-0, policy_version 446539 (0.00087) [2022-07-09 22:25:08,691][26022] Updated weights on worker 0-0, policy_version 446549 (0.00089) [2022-07-09 22:25:09,597][25689] Fps is (10 sec: 5515.6, 60 sec: 5726.2, 300 sec: 5677.7). Total num frames: 457271296. Throughput: 0: 5869.0. Samples: 457280702. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:25:09,598][25689] Avg episode reward: [(0, '-44.331')] [2022-07-09 22:25:10,675][26022] Updated weights on worker 0-0, policy_version 446559 (0.00091) [2022-07-09 22:25:12,174][26022] Updated weights on worker 0-0, policy_version 446569 (0.00077) [2022-07-09 22:25:14,025][26022] Updated weights on worker 0-0, policy_version 446579 (0.00051) [2022-07-09 22:25:14,684][25689] Fps is (10 sec: 5777.1, 60 sec: 5737.7, 300 sec: 5681.1). Total num frames: 457300992. Throughput: 0: 5848.1. Samples: 457298260. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:25:14,685][25689] Avg episode reward: [(0, '-43.789')] [2022-07-09 22:25:15,813][26022] Updated weights on worker 0-0, policy_version 446589 (0.00085) [2022-07-09 22:25:17,643][26022] Updated weights on worker 0-0, policy_version 446599 (0.00090) [2022-07-09 22:25:19,420][26022] Updated weights on worker 0-0, policy_version 446609 (0.00095) [2022-07-09 22:25:19,784][25689] Fps is (10 sec: 5830.5, 60 sec: 5735.2, 300 sec: 5680.0). Total num frames: 457330688. Throughput: 0: 5855.8. Samples: 457332874. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:25:19,785][25689] Avg episode reward: [(0, '-44.446')] [2022-07-09 22:25:21,059][26022] Updated weights on worker 0-0, policy_version 446619 (0.00091) [2022-07-09 22:25:22,931][26022] Updated weights on worker 0-0, policy_version 446629 (0.00087) [2022-07-09 22:25:24,708][26022] Updated weights on worker 0-0, policy_version 446639 (0.00090) [2022-07-09 22:25:24,854][25689] Fps is (10 sec: 5739.0, 60 sec: 5728.9, 300 sec: 5679.6). Total num frames: 457359360. Throughput: 0: 5970.2. Samples: 457367680. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:25:24,855][25689] Avg episode reward: [(0, '-44.324')] [2022-07-09 22:25:26,486][26022] Updated weights on worker 0-0, policy_version 446649 (0.00094) [2022-07-09 22:25:28,128][26022] Updated weights on worker 0-0, policy_version 446659 (0.00086) [2022-07-09 22:25:29,906][25689] Fps is (10 sec: 5564.0, 60 sec: 5727.9, 300 sec: 5682.1). Total num frames: 457387008. Throughput: 0: 5116.8. Samples: 457384618. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:25:29,907][25689] Avg episode reward: [(0, '-43.666')] [2022-07-09 22:25:30,056][26022] Updated weights on worker 0-0, policy_version 446669 (0.00094) [2022-07-09 22:25:31,628][26022] Updated weights on worker 0-0, policy_version 446679 (0.00092) [2022-07-09 22:25:33,560][26022] Updated weights on worker 0-0, policy_version 446689 (0.00089) [2022-07-09 22:25:34,933][25689] Fps is (10 sec: 5791.4, 60 sec: 5727.7, 300 sec: 5686.5). Total num frames: 457417728. Throughput: 0: 5972.7. Samples: 457419200. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:25:34,933][25689] Avg episode reward: [(0, '-44.544')] [2022-07-09 22:25:35,395][26022] Updated weights on worker 0-0, policy_version 446699 (0.00097) [2022-07-09 22:25:37,057][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:25:37,067][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000446709_457430016.pth [2022-07-09 22:25:37,068][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000444710_455383040.pth [2022-07-09 22:25:37,075][26022] Updated weights on worker 0-0, policy_version 446709 (0.00085) [2022-07-09 22:25:38,973][26022] Updated weights on worker 0-0, policy_version 446719 (0.00093) [2022-07-09 22:25:39,972][25689] Fps is (10 sec: 5798.4, 60 sec: 5711.6, 300 sec: 5685.8). Total num frames: 457445376. Throughput: 0: 5964.4. Samples: 457453286. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:25:39,973][25689] Avg episode reward: [(0, '-44.352')] [2022-07-09 22:25:40,577][26022] Updated weights on worker 0-0, policy_version 446729 (0.00080) [2022-07-09 22:25:42,694][26022] Updated weights on worker 0-0, policy_version 446739 (0.00081) [2022-07-09 22:25:44,195][26022] Updated weights on worker 0-0, policy_version 446749 (0.00087) [2022-07-09 22:25:44,987][25689] Fps is (10 sec: 5500.1, 60 sec: 5676.9, 300 sec: 5678.8). Total num frames: 457473024. Throughput: 0: 5108.5. Samples: 457470524. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:25:44,987][25689] Avg episode reward: [(0, '-44.005')] [2022-07-09 22:25:45,936][26022] Updated weights on worker 0-0, policy_version 446759 (0.00086) [2022-07-09 22:25:48,099][26022] Updated weights on worker 0-0, policy_version 446769 (0.00089) [2022-07-09 22:25:49,573][26022] Updated weights on worker 0-0, policy_version 446779 (0.00086) [2022-07-09 22:25:49,994][25689] Fps is (10 sec: 5824.3, 60 sec: 5729.0, 300 sec: 5683.5). Total num frames: 457503744. Throughput: 0: 6012.2. Samples: 457505388. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:25:49,994][25689] Avg episode reward: [(0, '-43.772')] [2022-07-09 22:25:51,482][26022] Updated weights on worker 0-0, policy_version 446789 (0.00086) [2022-07-09 22:25:53,139][26022] Updated weights on worker 0-0, policy_version 446799 (0.00092) [2022-07-09 22:25:54,943][26022] Updated weights on worker 0-0, policy_version 446809 (0.00085) [2022-07-09 22:25:55,001][25689] Fps is (10 sec: 5930.7, 60 sec: 5713.4, 300 sec: 5688.7). Total num frames: 457532416. Throughput: 0: 6011.6. Samples: 457539840. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:25:55,001][25689] Avg episode reward: [(0, '-43.628')] [2022-07-09 22:25:56,773][26022] Updated weights on worker 0-0, policy_version 446819 (0.00083) [2022-07-09 22:25:58,665][26022] Updated weights on worker 0-0, policy_version 446829 (0.00082) [2022-07-09 22:26:00,171][25689] Fps is (10 sec: 5634.5, 60 sec: 5694.6, 300 sec: 5689.1). Total num frames: 457561088. Throughput: 0: 5137.4. Samples: 457557056. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:26:00,173][25689] Avg episode reward: [(0, '-43.741')] [2022-07-09 22:26:00,473][26022] Updated weights on worker 0-0, policy_version 446839 (0.00077) [2022-07-09 22:26:02,205][26022] Updated weights on worker 0-0, policy_version 446849 (0.00080) [2022-07-09 22:26:04,277][26022] Updated weights on worker 0-0, policy_version 446859 (0.00052) [2022-07-09 22:26:05,246][25689] Fps is (10 sec: 5497.1, 60 sec: 5705.2, 300 sec: 5684.3). Total num frames: 457588736. Throughput: 0: 5880.3. Samples: 457589658. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:26:05,247][25689] Avg episode reward: [(0, '-44.149')] [2022-07-09 22:26:05,928][26022] Updated weights on worker 0-0, policy_version 446869 (0.00087) [2022-07-09 22:26:07,844][26022] Updated weights on worker 0-0, policy_version 446879 (0.00088) [2022-07-09 22:26:09,670][26022] Updated weights on worker 0-0, policy_version 446889 (0.00101) [2022-07-09 22:26:10,270][25689] Fps is (10 sec: 5475.4, 60 sec: 5687.7, 300 sec: 5691.7). Total num frames: 457616384. Throughput: 0: 5846.2. Samples: 457623928. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:26:10,271][25689] Avg episode reward: [(0, '-45.238')] [2022-07-09 22:26:11,326][26022] Updated weights on worker 0-0, policy_version 446899 (0.00083) [2022-07-09 22:26:13,198][26022] Updated weights on worker 0-0, policy_version 446909 (0.00090) [2022-07-09 22:26:14,872][26022] Updated weights on worker 0-0, policy_version 446919 (0.00597) [2022-07-09 22:26:15,319][25689] Fps is (10 sec: 5693.2, 60 sec: 5691.3, 300 sec: 5685.5). Total num frames: 457646080. Throughput: 0: 4991.4. Samples: 457641254. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:26:15,319][25689] Avg episode reward: [(0, '-45.499')] [2022-07-09 22:26:16,699][26022] Updated weights on worker 0-0, policy_version 446929 (0.00088) [2022-07-09 22:26:18,562][26022] Updated weights on worker 0-0, policy_version 446939 (0.00090) [2022-07-09 22:26:20,269][26022] Updated weights on worker 0-0, policy_version 446949 (0.00080) [2022-07-09 22:26:20,358][25689] Fps is (10 sec: 5887.5, 60 sec: 5697.0, 300 sec: 5693.1). Total num frames: 457675776. Throughput: 0: 5885.3. Samples: 457675862. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:26:20,358][25689] Avg episode reward: [(0, '-45.846')] [2022-07-09 22:26:22,214][26022] Updated weights on worker 0-0, policy_version 446959 (0.00095) [2022-07-09 22:26:23,739][26022] Updated weights on worker 0-0, policy_version 446969 (0.00085) [2022-07-09 22:26:25,362][25689] Fps is (10 sec: 5709.5, 60 sec: 5686.3, 300 sec: 5689.6). Total num frames: 457703424. Throughput: 0: 5992.0. Samples: 457710192. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:26:25,363][25689] Avg episode reward: [(0, '-45.678')] [2022-07-09 22:26:25,733][26022] Updated weights on worker 0-0, policy_version 446979 (0.00092) [2022-07-09 22:26:27,689][26022] Updated weights on worker 0-0, policy_version 446989 (0.00089) [2022-07-09 22:26:29,108][26022] Updated weights on worker 0-0, policy_version 446999 (0.00086) [2022-07-09 22:26:30,413][25689] Fps is (10 sec: 5601.1, 60 sec: 5703.3, 300 sec: 5693.2). Total num frames: 457732096. Throughput: 0: 5139.3. Samples: 457727450. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:26:30,414][25689] Avg episode reward: [(0, '-45.452')] [2022-07-09 22:26:31,346][26022] Updated weights on worker 0-0, policy_version 447009 (0.00083) [2022-07-09 22:26:32,659][26022] Updated weights on worker 0-0, policy_version 447019 (0.00081) [2022-07-09 22:26:34,911][26022] Updated weights on worker 0-0, policy_version 447029 (0.00099) [2022-07-09 22:26:35,434][25689] Fps is (10 sec: 5795.1, 60 sec: 5686.9, 300 sec: 5700.6). Total num frames: 457761792. Throughput: 0: 5993.5. Samples: 457761816. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:26:35,435][25689] Avg episode reward: [(0, '-45.021')] [2022-07-09 22:26:36,222][26022] Updated weights on worker 0-0, policy_version 447039 (0.00082) [2022-07-09 22:26:38,590][26022] Updated weights on worker 0-0, policy_version 447049 (0.00098) [2022-07-09 22:26:40,127][26022] Updated weights on worker 0-0, policy_version 447059 (0.00086) [2022-07-09 22:26:40,519][25689] Fps is (10 sec: 5877.2, 60 sec: 5716.5, 300 sec: 5699.4). Total num frames: 457791488. Throughput: 0: 5957.8. Samples: 457795976. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 22:26:40,519][25689] Avg episode reward: [(0, '-45.652')] [2022-07-09 22:26:42,075][26022] Updated weights on worker 0-0, policy_version 447069 (0.00109) [2022-07-09 22:26:43,460][26022] Updated weights on worker 0-0, policy_version 447079 (0.00095) [2022-07-09 22:26:45,543][25689] Fps is (10 sec: 5571.6, 60 sec: 5698.7, 300 sec: 5692.3). Total num frames: 457818112. Throughput: 0: 5099.3. Samples: 457813098. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:26:45,543][25689] Avg episode reward: [(0, '-45.073')] [2022-07-09 22:26:45,685][26022] Updated weights on worker 0-0, policy_version 447089 (0.00093) [2022-07-09 22:26:47,192][26022] Updated weights on worker 0-0, policy_version 447099 (0.00082) [2022-07-09 22:26:49,170][26022] Updated weights on worker 0-0, policy_version 447109 (0.00150) [2022-07-09 22:26:50,545][26022] Updated weights on worker 0-0, policy_version 447119 (0.00492) [2022-07-09 22:26:50,570][25689] Fps is (10 sec: 5806.7, 60 sec: 5713.6, 300 sec: 5702.8). Total num frames: 457849856. Throughput: 0: 5960.9. Samples: 457847606. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:26:50,571][25689] Avg episode reward: [(0, '-45.824')] [2022-07-09 22:26:52,594][26022] Updated weights on worker 0-0, policy_version 447129 (0.00083) [2022-07-09 22:26:54,344][26022] Updated weights on worker 0-0, policy_version 447139 (0.00618) [2022-07-09 22:26:55,577][25689] Fps is (10 sec: 5918.9, 60 sec: 5696.8, 300 sec: 5697.9). Total num frames: 457877504. Throughput: 0: 5979.7. Samples: 457882264. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:26:55,578][25689] Avg episode reward: [(0, '-45.884')] [2022-07-09 22:26:56,412][26022] Updated weights on worker 0-0, policy_version 447149 (0.00093) [2022-07-09 22:26:57,823][26022] Updated weights on worker 0-0, policy_version 447159 (0.00082) [2022-07-09 22:26:59,902][26022] Updated weights on worker 0-0, policy_version 447169 (0.00087) [2022-07-09 22:27:00,651][25689] Fps is (10 sec: 5688.4, 60 sec: 5722.7, 300 sec: 5710.9). Total num frames: 457907200. Throughput: 0: 5143.3. Samples: 457899524. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:27:00,652][25689] Avg episode reward: [(0, '-46.534')] [2022-07-09 22:27:01,399][26022] Updated weights on worker 0-0, policy_version 447179 (0.00103) [2022-07-09 22:27:03,829][26022] Updated weights on worker 0-0, policy_version 447189 (0.00616) [2022-07-09 22:27:05,443][26022] Updated weights on worker 0-0, policy_version 447199 (0.00085) [2022-07-09 22:27:05,703][25689] Fps is (10 sec: 5461.0, 60 sec: 5691.1, 300 sec: 5700.5). Total num frames: 457932800. Throughput: 0: 5896.9. Samples: 457931980. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:27:05,703][25689] Avg episode reward: [(0, '-46.769')] [2022-07-09 22:27:07,294][26022] Updated weights on worker 0-0, policy_version 447209 (0.00087) [2022-07-09 22:27:09,106][26022] Updated weights on worker 0-0, policy_version 447219 (0.00084) [2022-07-09 22:27:10,721][25689] Fps is (10 sec: 5389.4, 60 sec: 5708.5, 300 sec: 5697.2). Total num frames: 457961472. Throughput: 0: 5905.1. Samples: 457966600. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:27:10,722][25689] Avg episode reward: [(0, '-45.632')] [2022-07-09 22:27:10,860][26022] Updated weights on worker 0-0, policy_version 447229 (0.00087) [2022-07-09 22:27:12,655][26022] Updated weights on worker 0-0, policy_version 447239 (0.00088) [2022-07-09 22:27:14,346][26022] Updated weights on worker 0-0, policy_version 447249 (0.00091) [2022-07-09 22:27:15,731][25689] Fps is (10 sec: 5718.3, 60 sec: 5695.3, 300 sec: 5696.0). Total num frames: 457990144. Throughput: 0: 5032.7. Samples: 457983692. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:27:15,732][25689] Avg episode reward: [(0, '-45.799')] [2022-07-09 22:27:16,071][26022] Updated weights on worker 0-0, policy_version 447259 (0.00085) [2022-07-09 22:27:17,981][26022] Updated weights on worker 0-0, policy_version 447269 (0.00091) [2022-07-09 22:27:19,618][26022] Updated weights on worker 0-0, policy_version 447279 (0.00092) [2022-07-09 22:27:20,831][25689] Fps is (10 sec: 5672.4, 60 sec: 5672.6, 300 sec: 5697.7). Total num frames: 458018816. Throughput: 0: 5867.7. Samples: 458017930. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:27:20,831][25689] Avg episode reward: [(0, '-45.538')] [2022-07-09 22:27:21,630][26022] Updated weights on worker 0-0, policy_version 447289 (0.01223) [2022-07-09 22:27:23,143][26022] Updated weights on worker 0-0, policy_version 447299 (0.00081) [2022-07-09 22:27:24,948][26022] Updated weights on worker 0-0, policy_version 447309 (0.00090) [2022-07-09 22:27:25,854][25689] Fps is (10 sec: 5867.2, 60 sec: 5721.7, 300 sec: 5701.6). Total num frames: 458049536. Throughput: 0: 6001.8. Samples: 458052922. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:27:25,855][25689] Avg episode reward: [(0, '-45.423')] [2022-07-09 22:27:26,990][26022] Updated weights on worker 0-0, policy_version 447319 (0.00082) [2022-07-09 22:27:28,614][26022] Updated weights on worker 0-0, policy_version 447329 (0.00053) [2022-07-09 22:27:30,532][26022] Updated weights on worker 0-0, policy_version 447339 (0.00087) [2022-07-09 22:27:30,881][25689] Fps is (10 sec: 5807.7, 60 sec: 5707.0, 300 sec: 5701.1). Total num frames: 458077184. Throughput: 0: 5131.3. Samples: 458070044. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:27:30,881][25689] Avg episode reward: [(0, '-45.203')] [2022-07-09 22:27:31,971][26022] Updated weights on worker 0-0, policy_version 447349 (0.00090) [2022-07-09 22:27:34,066][26022] Updated weights on worker 0-0, policy_version 447359 (0.00088) [2022-07-09 22:27:35,651][26022] Updated weights on worker 0-0, policy_version 447369 (0.00092) [2022-07-09 22:27:35,897][25689] Fps is (10 sec: 5607.8, 60 sec: 5690.5, 300 sec: 5699.7). Total num frames: 458105856. Throughput: 0: 5997.1. Samples: 458104630. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:27:35,898][25689] Avg episode reward: [(0, '-45.387')] [2022-07-09 22:27:37,157][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:27:37,170][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000447377_458114048.pth [2022-07-09 22:27:37,171][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000445372_456060928.pth [2022-07-09 22:27:37,171][25974] Saving a milestone ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/milestones/checkpoint_000447377_458114048.pth.milestone [2022-07-09 22:27:37,607][26022] Updated weights on worker 0-0, policy_version 447379 (0.00090) [2022-07-09 22:27:39,467][26022] Updated weights on worker 0-0, policy_version 447389 (0.00084) [2022-07-09 22:27:40,987][25689] Fps is (10 sec: 5775.2, 60 sec: 5689.9, 300 sec: 5698.2). Total num frames: 458135552. Throughput: 0: 6014.3. Samples: 458139158. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:27:40,988][25689] Avg episode reward: [(0, '-46.283')] [2022-07-09 22:27:41,187][26022] Updated weights on worker 0-0, policy_version 447399 (0.00082) [2022-07-09 22:27:43,056][26022] Updated weights on worker 0-0, policy_version 447409 (0.00082) [2022-07-09 22:27:44,668][26022] Updated weights on worker 0-0, policy_version 447419 (0.00087) [2022-07-09 22:27:46,014][25689] Fps is (10 sec: 5668.2, 60 sec: 5706.7, 300 sec: 5695.0). Total num frames: 458163200. Throughput: 0: 5984.6. Samples: 458173572. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:27:46,014][25689] Avg episode reward: [(0, '-46.668')] [2022-07-09 22:27:46,408][26022] Updated weights on worker 0-0, policy_version 447429 (0.00091) [2022-07-09 22:27:48,463][26022] Updated weights on worker 0-0, policy_version 447439 (0.00087) [2022-07-09 22:27:49,987][26022] Updated weights on worker 0-0, policy_version 447449 (0.00082) [2022-07-09 22:27:51,042][25689] Fps is (10 sec: 5703.3, 60 sec: 5672.8, 300 sec: 5701.7). Total num frames: 458192896. Throughput: 0: 5992.6. Samples: 458190862. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:27:51,042][25689] Avg episode reward: [(0, '-46.811')] [2022-07-09 22:27:51,975][26022] Updated weights on worker 0-0, policy_version 447459 (0.00085) [2022-07-09 22:27:53,707][26022] Updated weights on worker 0-0, policy_version 447469 (0.00083) [2022-07-09 22:27:55,294][26022] Updated weights on worker 0-0, policy_version 447479 (0.00082) [2022-07-09 22:27:56,071][25689] Fps is (10 sec: 5803.4, 60 sec: 5687.6, 300 sec: 5702.2). Total num frames: 458221568. Throughput: 0: 5977.9. Samples: 458225230. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:27:56,072][25689] Avg episode reward: [(0, '-45.866')] [2022-07-09 22:27:57,447][26022] Updated weights on worker 0-0, policy_version 447489 (0.00084) [2022-07-09 22:27:58,880][26022] Updated weights on worker 0-0, policy_version 447499 (0.00084) [2022-07-09 22:28:00,825][26022] Updated weights on worker 0-0, policy_version 447509 (0.00086) [2022-07-09 22:28:01,183][25689] Fps is (10 sec: 5755.7, 60 sec: 5684.0, 300 sec: 5714.6). Total num frames: 458251264. Throughput: 0: 5958.0. Samples: 458259484. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:28:01,183][25689] Avg episode reward: [(0, '-45.660')] [2022-07-09 22:28:02,936][26022] Updated weights on worker 0-0, policy_version 447519 (0.00091) [2022-07-09 22:28:04,687][26022] Updated weights on worker 0-0, policy_version 447529 (0.00093) [2022-07-09 22:28:06,194][25689] Fps is (10 sec: 5563.5, 60 sec: 5704.7, 300 sec: 5704.6). Total num frames: 458277888. Throughput: 0: 5001.2. Samples: 458274498. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:28:06,195][25689] Avg episode reward: [(0, '-44.850')] [2022-07-09 22:28:06,643][26022] Updated weights on worker 0-0, policy_version 447539 (0.00098) [2022-07-09 22:28:08,201][26022] Updated weights on worker 0-0, policy_version 447549 (0.00085) [2022-07-09 22:28:10,294][26022] Updated weights on worker 0-0, policy_version 447559 (0.00092) [2022-07-09 22:28:11,205][25689] Fps is (10 sec: 5415.0, 60 sec: 5688.5, 300 sec: 5701.4). Total num frames: 458305536. Throughput: 0: 5858.3. Samples: 458308986. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:28:11,206][25689] Avg episode reward: [(0, '-44.521')] [2022-07-09 22:28:11,879][26022] Updated weights on worker 0-0, policy_version 447569 (0.00087) [2022-07-09 22:28:13,751][26022] Updated weights on worker 0-0, policy_version 447579 (0.00087) [2022-07-09 22:28:15,321][26022] Updated weights on worker 0-0, policy_version 447589 (0.00087) [2022-07-09 22:28:16,222][25689] Fps is (10 sec: 5616.3, 60 sec: 5687.8, 300 sec: 5695.8). Total num frames: 458334208. Throughput: 0: 5865.1. Samples: 458343418. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:28:16,223][25689] Avg episode reward: [(0, '-44.077')] [2022-07-09 22:28:17,479][26022] Updated weights on worker 0-0, policy_version 447599 (0.00087) [2022-07-09 22:28:18,935][26022] Updated weights on worker 0-0, policy_version 447609 (0.00080) [2022-07-09 22:28:20,967][26022] Updated weights on worker 0-0, policy_version 447619 (0.00079) [2022-07-09 22:28:21,325][25689] Fps is (10 sec: 5767.7, 60 sec: 5704.4, 300 sec: 5701.5). Total num frames: 458363904. Throughput: 0: 5018.2. Samples: 458360562. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:28:21,326][25689] Avg episode reward: [(0, '-45.335')] [2022-07-09 22:28:22,423][26022] Updated weights on worker 0-0, policy_version 447629 (0.00052) [2022-07-09 22:28:24,439][26022] Updated weights on worker 0-0, policy_version 447639 (0.00090) [2022-07-09 22:28:26,104][26022] Updated weights on worker 0-0, policy_version 447649 (0.00094) [2022-07-09 22:28:26,352][25689] Fps is (10 sec: 5863.0, 60 sec: 5687.2, 300 sec: 5704.9). Total num frames: 458393600. Throughput: 0: 6006.2. Samples: 458395570. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:28:26,352][25689] Avg episode reward: [(0, '-46.163')] [2022-07-09 22:28:27,968][26022] Updated weights on worker 0-0, policy_version 447659 (0.00090) [2022-07-09 22:28:29,603][26022] Updated weights on worker 0-0, policy_version 447669 (0.00090) [2022-07-09 22:28:31,415][25689] Fps is (10 sec: 5784.8, 60 sec: 5700.7, 300 sec: 5701.3). Total num frames: 458422272. Throughput: 0: 5973.3. Samples: 458429704. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:28:31,416][25689] Avg episode reward: [(0, '-45.917')] [2022-07-09 22:28:31,764][26022] Updated weights on worker 0-0, policy_version 447679 (0.00096) [2022-07-09 22:28:33,017][26022] Updated weights on worker 0-0, policy_version 447689 (0.00089) [2022-07-09 22:28:35,187][26022] Updated weights on worker 0-0, policy_version 447699 (0.00087) [2022-07-09 22:28:36,485][25689] Fps is (10 sec: 5861.3, 60 sec: 5729.4, 300 sec: 5711.3). Total num frames: 458452992. Throughput: 0: 5116.0. Samples: 458447082. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:28:36,485][25689] Avg episode reward: [(0, '-45.809')] [2022-07-09 22:28:36,656][26022] Updated weights on worker 0-0, policy_version 447709 (0.00087) [2022-07-09 22:28:38,820][26022] Updated weights on worker 0-0, policy_version 447719 (0.00090) [2022-07-09 22:28:40,311][26022] Updated weights on worker 0-0, policy_version 447729 (0.00094) [2022-07-09 22:28:41,530][25689] Fps is (10 sec: 5669.5, 60 sec: 5683.0, 300 sec: 5701.8). Total num frames: 458479616. Throughput: 0: 5978.2. Samples: 458481348. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:28:41,531][25689] Avg episode reward: [(0, '-45.859')] [2022-07-09 22:28:42,286][26022] Updated weights on worker 0-0, policy_version 447739 (0.00093) [2022-07-09 22:28:44,128][26022] Updated weights on worker 0-0, policy_version 447749 (0.00088) [2022-07-09 22:28:45,925][26022] Updated weights on worker 0-0, policy_version 447759 (0.00090) [2022-07-09 22:28:46,534][25689] Fps is (10 sec: 5502.8, 60 sec: 5702.0, 300 sec: 5702.6). Total num frames: 458508288. Throughput: 0: 5953.4. Samples: 458515718. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:28:46,534][25689] Avg episode reward: [(0, '-44.901')] [2022-07-09 22:28:47,536][26022] Updated weights on worker 0-0, policy_version 447769 (0.00084) [2022-07-09 22:28:49,602][26022] Updated weights on worker 0-0, policy_version 447779 (0.00084) [2022-07-09 22:28:51,127][26022] Updated weights on worker 0-0, policy_version 447789 (0.00100) [2022-07-09 22:28:51,577][25689] Fps is (10 sec: 5809.2, 60 sec: 5700.6, 300 sec: 5702.2). Total num frames: 458537984. Throughput: 0: 5108.2. Samples: 458532696. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:28:51,578][25689] Avg episode reward: [(0, '-45.188')] [2022-07-09 22:28:53,295][26022] Updated weights on worker 0-0, policy_version 447799 (0.00084) [2022-07-09 22:28:54,641][26022] Updated weights on worker 0-0, policy_version 447809 (0.00080) [2022-07-09 22:28:56,587][25689] Fps is (10 sec: 5704.1, 60 sec: 5685.5, 300 sec: 5699.8). Total num frames: 458565632. Throughput: 0: 5966.0. Samples: 458567010. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:28:56,587][25689] Avg episode reward: [(0, '-46.118')] [2022-07-09 22:28:56,774][26022] Updated weights on worker 0-0, policy_version 447819 (0.00084) [2022-07-09 22:28:58,277][26022] Updated weights on worker 0-0, policy_version 447829 (0.00094) [2022-07-09 22:29:00,194][26022] Updated weights on worker 0-0, policy_version 447839 (0.00080) [2022-07-09 22:29:01,663][25689] Fps is (10 sec: 5584.3, 60 sec: 5672.0, 300 sec: 5706.5). Total num frames: 458594304. Throughput: 0: 5966.8. Samples: 458601478. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:29:01,663][25689] Avg episode reward: [(0, '-46.176')] [2022-07-09 22:29:02,267][26022] Updated weights on worker 0-0, policy_version 447849 (0.00089) [2022-07-09 22:29:03,923][26022] Updated weights on worker 0-0, policy_version 447859 (0.00077) [2022-07-09 22:29:06,019][26022] Updated weights on worker 0-0, policy_version 447869 (0.00082) [2022-07-09 22:29:06,692][25689] Fps is (10 sec: 5674.4, 60 sec: 5704.1, 300 sec: 5706.1). Total num frames: 458622976. Throughput: 0: 5010.1. Samples: 458616716. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:29:06,693][25689] Avg episode reward: [(0, '-45.833')] [2022-07-09 22:29:07,569][26022] Updated weights on worker 0-0, policy_version 447879 (0.00088) [2022-07-09 22:29:09,359][26022] Updated weights on worker 0-0, policy_version 447889 (0.00092) [2022-07-09 22:29:10,993][26022] Updated weights on worker 0-0, policy_version 447899 (0.00084) [2022-07-09 22:29:11,723][25689] Fps is (10 sec: 5598.4, 60 sec: 5702.3, 300 sec: 5702.7). Total num frames: 458650624. Throughput: 0: 5906.6. Samples: 458651686. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:29:11,723][25689] Avg episode reward: [(0, '-45.075')] [2022-07-09 22:29:12,924][26022] Updated weights on worker 0-0, policy_version 447909 (0.00088) [2022-07-09 22:29:14,943][26022] Updated weights on worker 0-0, policy_version 447919 (0.00089) [2022-07-09 22:29:16,417][26022] Updated weights on worker 0-0, policy_version 447929 (0.00091) [2022-07-09 22:29:16,728][25689] Fps is (10 sec: 5713.9, 60 sec: 5720.3, 300 sec: 5704.0). Total num frames: 458680320. Throughput: 0: 5911.4. Samples: 458686076. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:29:16,729][25689] Avg episode reward: [(0, '-44.525')] [2022-07-09 22:29:18,362][26022] Updated weights on worker 0-0, policy_version 447939 (0.00090) [2022-07-09 22:29:20,247][26022] Updated weights on worker 0-0, policy_version 447949 (0.00085) [2022-07-09 22:29:21,711][26022] Updated weights on worker 0-0, policy_version 447959 (0.00083) [2022-07-09 22:29:21,772][25689] Fps is (10 sec: 5909.9, 60 sec: 5725.9, 300 sec: 5706.6). Total num frames: 458710016. Throughput: 0: 5053.0. Samples: 458703092. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:29:21,773][25689] Avg episode reward: [(0, '-44.712')] [2022-07-09 22:29:23,926][26022] Updated weights on worker 0-0, policy_version 447969 (0.00087) [2022-07-09 22:29:25,441][26022] Updated weights on worker 0-0, policy_version 447979 (0.00088) [2022-07-09 22:29:26,780][25689] Fps is (10 sec: 5602.8, 60 sec: 5676.8, 300 sec: 5703.8). Total num frames: 458736640. Throughput: 0: 6013.2. Samples: 458737510. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-09 22:29:26,781][25689] Avg episode reward: [(0, '-44.692')] [2022-07-09 22:29:27,372][26022] Updated weights on worker 0-0, policy_version 447989 (0.00096) [2022-07-09 22:29:29,263][26022] Updated weights on worker 0-0, policy_version 447999 (0.00102) [2022-07-09 22:29:30,746][26022] Updated weights on worker 0-0, policy_version 448009 (0.00086) [2022-07-09 22:29:31,801][25689] Fps is (10 sec: 5514.1, 60 sec: 5680.9, 300 sec: 5697.0). Total num frames: 458765312. Throughput: 0: 5995.8. Samples: 458772068. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:29:31,801][25689] Avg episode reward: [(0, '-45.743')] [2022-07-09 22:29:32,719][26022] Updated weights on worker 0-0, policy_version 448019 (0.00098) [2022-07-09 22:29:34,344][26022] Updated weights on worker 0-0, policy_version 448029 (0.00091) [2022-07-09 22:29:36,141][26022] Updated weights on worker 0-0, policy_version 448039 (0.00089) [2022-07-09 22:29:36,833][25689] Fps is (10 sec: 5908.0, 60 sec: 5684.3, 300 sec: 5704.2). Total num frames: 458796032. Throughput: 0: 5136.2. Samples: 458789338. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:29:36,834][25689] Avg episode reward: [(0, '-46.226')] [2022-07-09 22:29:37,194][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:29:37,210][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000448045_458798080.pth [2022-07-09 22:29:37,210][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000446040_456744960.pth [2022-07-09 22:29:38,108][26022] Updated weights on worker 0-0, policy_version 448049 (0.00080) [2022-07-09 22:29:39,675][26022] Updated weights on worker 0-0, policy_version 448059 (0.00084) [2022-07-09 22:29:41,658][26022] Updated weights on worker 0-0, policy_version 448069 (0.00088) [2022-07-09 22:29:41,933][25689] Fps is (10 sec: 5760.6, 60 sec: 5696.1, 300 sec: 5695.5). Total num frames: 458823680. Throughput: 0: 5996.2. Samples: 458823978. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:29:41,935][25689] Avg episode reward: [(0, '-46.745')] [2022-07-09 22:29:43,299][26022] Updated weights on worker 0-0, policy_version 448079 (0.00086) [2022-07-09 22:29:45,158][26022] Updated weights on worker 0-0, policy_version 448089 (0.00091) [2022-07-09 22:29:46,984][25689] Fps is (10 sec: 5649.5, 60 sec: 5708.6, 300 sec: 5701.8). Total num frames: 458853376. Throughput: 0: 5964.6. Samples: 458858014. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:29:46,984][25689] Avg episode reward: [(0, '-46.480')] [2022-07-09 22:29:46,992][26022] Updated weights on worker 0-0, policy_version 448099 (0.00089) [2022-07-09 22:29:48,715][26022] Updated weights on worker 0-0, policy_version 448109 (0.00091) [2022-07-09 22:29:50,497][26022] Updated weights on worker 0-0, policy_version 448119 (0.00087) [2022-07-09 22:29:52,039][25689] Fps is (10 sec: 5876.7, 60 sec: 5707.5, 300 sec: 5701.2). Total num frames: 458883072. Throughput: 0: 5097.0. Samples: 458875224. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:29:52,040][25689] Avg episode reward: [(0, '-45.484')] [2022-07-09 22:29:52,303][26022] Updated weights on worker 0-0, policy_version 448129 (0.00093) [2022-07-09 22:29:54,198][26022] Updated weights on worker 0-0, policy_version 448139 (0.00088) [2022-07-09 22:29:55,931][26022] Updated weights on worker 0-0, policy_version 448149 (0.00085) [2022-07-09 22:29:57,059][25689] Fps is (10 sec: 5691.6, 60 sec: 5706.6, 300 sec: 5696.8). Total num frames: 458910720. Throughput: 0: 5954.6. Samples: 458909774. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:29:57,062][25689] Avg episode reward: [(0, '-45.114')] [2022-07-09 22:29:57,757][26022] Updated weights on worker 0-0, policy_version 448159 (0.00085) [2022-07-09 22:29:59,367][26022] Updated weights on worker 0-0, policy_version 448169 (0.00090) [2022-07-09 22:30:01,210][26022] Updated weights on worker 0-0, policy_version 448179 (0.00090) [2022-07-09 22:30:02,119][25689] Fps is (10 sec: 5688.9, 60 sec: 5725.0, 300 sec: 5706.1). Total num frames: 458940416. Throughput: 0: 5973.0. Samples: 458944552. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:30:02,120][25689] Avg episode reward: [(0, '-45.169')] [2022-07-09 22:30:03,365][26022] Updated weights on worker 0-0, policy_version 448189 (0.00083) [2022-07-09 22:30:05,143][26022] Updated weights on worker 0-0, policy_version 448199 (0.00084) [2022-07-09 22:30:07,056][26022] Updated weights on worker 0-0, policy_version 448209 (0.00088) [2022-07-09 22:30:07,151][25689] Fps is (10 sec: 5479.2, 60 sec: 5674.0, 300 sec: 5695.5). Total num frames: 458966016. Throughput: 0: 5030.8. Samples: 458959472. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:30:07,151][25689] Avg episode reward: [(0, '-45.694')] [2022-07-09 22:30:08,830][26022] Updated weights on worker 0-0, policy_version 448219 (0.00088) [2022-07-09 22:30:10,572][26022] Updated weights on worker 0-0, policy_version 448229 (0.00085) [2022-07-09 22:30:12,174][25689] Fps is (10 sec: 5499.6, 60 sec: 5708.6, 300 sec: 5696.7). Total num frames: 458995712. Throughput: 0: 5894.2. Samples: 458993900. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:30:12,174][25689] Avg episode reward: [(0, '-46.288')] [2022-07-09 22:30:12,424][26022] Updated weights on worker 0-0, policy_version 448239 (0.00092) [2022-07-09 22:30:14,115][26022] Updated weights on worker 0-0, policy_version 448249 (0.00470) [2022-07-09 22:30:15,971][26022] Updated weights on worker 0-0, policy_version 448259 (0.00093) [2022-07-09 22:30:17,184][25689] Fps is (10 sec: 5817.6, 60 sec: 5691.2, 300 sec: 5695.0). Total num frames: 459024384. Throughput: 0: 5900.4. Samples: 459028520. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:30:17,184][25689] Avg episode reward: [(0, '-47.048')] [2022-07-09 22:30:17,680][26022] Updated weights on worker 0-0, policy_version 448269 (0.00086) [2022-07-09 22:30:19,661][26022] Updated weights on worker 0-0, policy_version 448279 (0.00104) [2022-07-09 22:30:20,910][26022] Updated weights on worker 0-0, policy_version 448289 (0.00084) [2022-07-09 22:30:22,254][25689] Fps is (10 sec: 5587.3, 60 sec: 5654.9, 300 sec: 5691.6). Total num frames: 459052032. Throughput: 0: 5864.7. Samples: 459062636. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:30:22,254][25689] Avg episode reward: [(0, '-46.790')] [2022-07-09 22:30:23,312][26022] Updated weights on worker 0-0, policy_version 448299 (0.00240) [2022-07-09 22:30:24,810][26022] Updated weights on worker 0-0, policy_version 448309 (0.00096) [2022-07-09 22:30:26,742][26022] Updated weights on worker 0-0, policy_version 448319 (0.00090) [2022-07-09 22:30:27,320][25689] Fps is (10 sec: 5657.4, 60 sec: 5700.2, 300 sec: 5698.2). Total num frames: 459081728. Throughput: 0: 5966.7. Samples: 459079816. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:30:27,320][25689] Avg episode reward: [(0, '-46.189')] [2022-07-09 22:30:28,348][26022] Updated weights on worker 0-0, policy_version 448329 (0.00089) [2022-07-09 22:30:30,152][26022] Updated weights on worker 0-0, policy_version 448339 (0.00087) [2022-07-09 22:30:31,959][26022] Updated weights on worker 0-0, policy_version 448349 (0.00085) [2022-07-09 22:30:32,344][25689] Fps is (10 sec: 5784.8, 60 sec: 5699.9, 300 sec: 5691.4). Total num frames: 459110400. Throughput: 0: 5959.3. Samples: 459114100. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:30:32,344][25689] Avg episode reward: [(0, '-45.309')] [2022-07-09 22:30:33,828][26022] Updated weights on worker 0-0, policy_version 448359 (0.00091) [2022-07-09 22:30:35,633][26022] Updated weights on worker 0-0, policy_version 448369 (0.00090) [2022-07-09 22:30:37,374][25689] Fps is (10 sec: 5602.0, 60 sec: 5649.4, 300 sec: 5691.5). Total num frames: 459138048. Throughput: 0: 5942.7. Samples: 459148502. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:30:37,374][25689] Avg episode reward: [(0, '-44.786')] [2022-07-09 22:30:37,601][26022] Updated weights on worker 0-0, policy_version 448379 (0.00086) [2022-07-09 22:30:39,076][26022] Updated weights on worker 0-0, policy_version 448389 (0.00091) [2022-07-09 22:30:41,071][26022] Updated weights on worker 0-0, policy_version 448399 (0.00098) [2022-07-09 22:30:42,496][25689] Fps is (10 sec: 5749.2, 60 sec: 5698.0, 300 sec: 5699.8). Total num frames: 459168768. Throughput: 0: 5085.1. Samples: 459165570. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:30:42,498][25689] Avg episode reward: [(0, '-44.628')] [2022-07-09 22:30:42,701][26022] Updated weights on worker 0-0, policy_version 448409 (0.00091) [2022-07-09 22:30:44,690][26022] Updated weights on worker 0-0, policy_version 448419 (0.00087) [2022-07-09 22:30:46,323][26022] Updated weights on worker 0-0, policy_version 448429 (0.00086) [2022-07-09 22:30:47,567][25689] Fps is (10 sec: 5826.7, 60 sec: 5679.2, 300 sec: 5691.7). Total num frames: 459197440. Throughput: 0: 5934.4. Samples: 459199968. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:30:47,567][25689] Avg episode reward: [(0, '-44.700')] [2022-07-09 22:30:48,245][26022] Updated weights on worker 0-0, policy_version 448439 (0.00087) [2022-07-09 22:30:50,061][26022] Updated weights on worker 0-0, policy_version 448449 (0.00089) [2022-07-09 22:30:51,977][26022] Updated weights on worker 0-0, policy_version 448459 (0.00092) [2022-07-09 22:30:52,570][25689] Fps is (10 sec: 5590.5, 60 sec: 5650.3, 300 sec: 5688.3). Total num frames: 459225088. Throughput: 0: 5934.6. Samples: 459234136. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:30:52,571][25689] Avg episode reward: [(0, '-43.974')] [2022-07-09 22:30:53,489][26022] Updated weights on worker 0-0, policy_version 448469 (0.00079) [2022-07-09 22:30:55,520][26022] Updated weights on worker 0-0, policy_version 448479 (0.00383) [2022-07-09 22:30:56,869][26022] Updated weights on worker 0-0, policy_version 448489 (0.00082) [2022-07-09 22:30:57,620][25689] Fps is (10 sec: 5806.1, 60 sec: 5698.2, 300 sec: 5697.5). Total num frames: 459255808. Throughput: 0: 5093.6. Samples: 459251628. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:30:57,620][25689] Avg episode reward: [(0, '-44.283')] [2022-07-09 22:30:59,107][26022] Updated weights on worker 0-0, policy_version 448499 (0.00086) [2022-07-09 22:31:00,671][26022] Updated weights on worker 0-0, policy_version 448509 (0.00093) [2022-07-09 22:31:02,655][25689] Fps is (10 sec: 5584.9, 60 sec: 5632.9, 300 sec: 5691.4). Total num frames: 459281408. Throughput: 0: 5978.2. Samples: 459286082. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:31:02,655][25689] Avg episode reward: [(0, '-44.614')] [2022-07-09 22:31:02,899][26022] Updated weights on worker 0-0, policy_version 448519 (0.00095) [2022-07-09 22:31:04,792][26022] Updated weights on worker 0-0, policy_version 448529 (0.00095) [2022-07-09 22:31:06,535][26022] Updated weights on worker 0-0, policy_version 448539 (0.00089) [2022-07-09 22:31:07,657][25689] Fps is (10 sec: 5304.8, 60 sec: 5669.5, 300 sec: 5691.8). Total num frames: 459309056. Throughput: 0: 5880.0. Samples: 459318100. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:31:07,658][25689] Avg episode reward: [(0, '-44.224')] [2022-07-09 22:31:08,185][26022] Updated weights on worker 0-0, policy_version 448549 (0.00081) [2022-07-09 22:31:10,189][26022] Updated weights on worker 0-0, policy_version 448559 (0.00092) [2022-07-09 22:31:11,688][26022] Updated weights on worker 0-0, policy_version 448569 (0.00086) [2022-07-09 22:31:12,665][25689] Fps is (10 sec: 5626.5, 60 sec: 5654.0, 300 sec: 5689.2). Total num frames: 459337728. Throughput: 0: 5029.0. Samples: 459335190. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:31:12,665][25689] Avg episode reward: [(0, '-44.560')] [2022-07-09 22:31:13,724][26022] Updated weights on worker 0-0, policy_version 448579 (0.00101) [2022-07-09 22:31:15,331][26022] Updated weights on worker 0-0, policy_version 448589 (0.00087) [2022-07-09 22:31:17,165][26022] Updated weights on worker 0-0, policy_version 448599 (0.00092) [2022-07-09 22:31:17,688][25689] Fps is (10 sec: 5921.2, 60 sec: 5686.6, 300 sec: 5692.9). Total num frames: 459368448. Throughput: 0: 5890.8. Samples: 459369844. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:31:17,689][25689] Avg episode reward: [(0, '-44.647')] [2022-07-09 22:31:19,087][26022] Updated weights on worker 0-0, policy_version 448609 (0.00086) [2022-07-09 22:31:20,647][26022] Updated weights on worker 0-0, policy_version 448619 (0.00088) [2022-07-09 22:31:22,710][26022] Updated weights on worker 0-0, policy_version 448629 (0.00091) [2022-07-09 22:31:22,783][25689] Fps is (10 sec: 5768.4, 60 sec: 5684.3, 300 sec: 5691.2). Total num frames: 459396096. Throughput: 0: 5853.1. Samples: 459403896. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:31:22,784][25689] Avg episode reward: [(0, '-44.387')] [2022-07-09 22:31:24,492][26022] Updated weights on worker 0-0, policy_version 448639 (0.00083) [2022-07-09 22:31:26,351][26022] Updated weights on worker 0-0, policy_version 448649 (0.00082) [2022-07-09 22:31:27,847][25689] Fps is (10 sec: 5543.8, 60 sec: 5667.6, 300 sec: 5690.9). Total num frames: 459424768. Throughput: 0: 5100.5. Samples: 459421076. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:31:27,847][25689] Avg episode reward: [(0, '-44.672')] [2022-07-09 22:31:27,994][26022] Updated weights on worker 0-0, policy_version 448659 (0.00102) [2022-07-09 22:31:29,859][26022] Updated weights on worker 0-0, policy_version 448669 (0.00087) [2022-07-09 22:31:31,727][26022] Updated weights on worker 0-0, policy_version 448679 (0.00087) [2022-07-09 22:31:32,871][25689] Fps is (10 sec: 5684.3, 60 sec: 5667.5, 300 sec: 5687.4). Total num frames: 459453440. Throughput: 0: 5933.7. Samples: 459455090. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:31:32,872][25689] Avg episode reward: [(0, '-44.921')] [2022-07-09 22:31:33,356][26022] Updated weights on worker 0-0, policy_version 448689 (0.00081) [2022-07-09 22:31:35,488][26022] Updated weights on worker 0-0, policy_version 448699 (0.00088) [2022-07-09 22:31:37,051][26022] Updated weights on worker 0-0, policy_version 448709 (0.00088) [2022-07-09 22:31:37,372][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:31:37,390][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000448711_459480064.pth [2022-07-09 22:31:37,391][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000446709_457430016.pth [2022-07-09 22:31:37,887][25689] Fps is (10 sec: 5711.6, 60 sec: 5685.8, 300 sec: 5685.3). Total num frames: 459482112. Throughput: 0: 5899.8. Samples: 459489014. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:31:37,887][25689] Avg episode reward: [(0, '-44.762')] [2022-07-09 22:31:39,046][26022] Updated weights on worker 0-0, policy_version 448719 (0.00089) [2022-07-09 22:31:40,654][26022] Updated weights on worker 0-0, policy_version 448729 (0.00086) [2022-07-09 22:31:42,495][26022] Updated weights on worker 0-0, policy_version 448739 (0.00087) [2022-07-09 22:31:42,939][25689] Fps is (10 sec: 5695.7, 60 sec: 5658.5, 300 sec: 5691.6). Total num frames: 459510784. Throughput: 0: 5066.9. Samples: 459506026. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:31:42,940][25689] Avg episode reward: [(0, '-43.871')] [2022-07-09 22:31:44,285][26022] Updated weights on worker 0-0, policy_version 448749 (0.00087) [2022-07-09 22:31:46,000][26022] Updated weights on worker 0-0, policy_version 448759 (0.00087) [2022-07-09 22:31:47,957][25689] Fps is (10 sec: 5592.8, 60 sec: 5646.5, 300 sec: 5678.1). Total num frames: 459538432. Throughput: 0: 5938.4. Samples: 459540498. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:31:47,958][25689] Avg episode reward: [(0, '-44.467')] [2022-07-09 22:31:48,052][26022] Updated weights on worker 0-0, policy_version 448769 (0.00089) [2022-07-09 22:31:49,638][26022] Updated weights on worker 0-0, policy_version 448779 (0.00083) [2022-07-09 22:31:51,548][26022] Updated weights on worker 0-0, policy_version 448789 (0.00085) [2022-07-09 22:31:52,989][25689] Fps is (10 sec: 5604.1, 60 sec: 5660.8, 300 sec: 5681.0). Total num frames: 459567104. Throughput: 0: 5934.6. Samples: 459574482. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:31:52,990][25689] Avg episode reward: [(0, '-45.492')] [2022-07-09 22:31:53,283][26022] Updated weights on worker 0-0, policy_version 448799 (0.00083) [2022-07-09 22:31:55,043][26022] Updated weights on worker 0-0, policy_version 448809 (0.00102) [2022-07-09 22:31:56,875][26022] Updated weights on worker 0-0, policy_version 448819 (0.00086) [2022-07-09 22:31:57,999][25689] Fps is (10 sec: 5812.5, 60 sec: 5647.5, 300 sec: 5682.2). Total num frames: 459596800. Throughput: 0: 5097.9. Samples: 459591542. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:31:58,000][25689] Avg episode reward: [(0, '-45.023')] [2022-07-09 22:31:58,521][26022] Updated weights on worker 0-0, policy_version 448829 (0.00093) [2022-07-09 22:32:00,467][26022] Updated weights on worker 0-0, policy_version 448839 (0.00094) [2022-07-09 22:32:02,899][26022] Updated weights on worker 0-0, policy_version 448849 (0.00094) [2022-07-09 22:32:03,070][25689] Fps is (10 sec: 5485.1, 60 sec: 5644.1, 300 sec: 5681.9). Total num frames: 459622400. Throughput: 0: 5951.4. Samples: 459625834. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:32:03,072][25689] Avg episode reward: [(0, '-45.467')] [2022-07-09 22:32:04,285][26022] Updated weights on worker 0-0, policy_version 448859 (0.00083) [2022-07-09 22:32:06,280][26022] Updated weights on worker 0-0, policy_version 448869 (0.00105) [2022-07-09 22:32:07,812][26022] Updated weights on worker 0-0, policy_version 448879 (0.00089) [2022-07-09 22:32:08,099][25689] Fps is (10 sec: 5576.2, 60 sec: 5692.5, 300 sec: 5688.6). Total num frames: 459653120. Throughput: 0: 5848.0. Samples: 459658288. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:32:08,100][25689] Avg episode reward: [(0, '-44.611')] [2022-07-09 22:32:09,760][26022] Updated weights on worker 0-0, policy_version 448889 (0.00082) [2022-07-09 22:32:11,635][26022] Updated weights on worker 0-0, policy_version 448899 (0.00089) [2022-07-09 22:32:13,119][25689] Fps is (10 sec: 5706.7, 60 sec: 5657.4, 300 sec: 5681.5). Total num frames: 459679744. Throughput: 0: 5021.8. Samples: 459675570. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-09 22:32:13,120][25689] Avg episode reward: [(0, '-44.963')] [2022-07-09 22:32:13,312][26022] Updated weights on worker 0-0, policy_version 448909 (0.00099) [2022-07-09 22:32:15,152][26022] Updated weights on worker 0-0, policy_version 448919 (0.00091) [2022-07-09 22:32:16,775][26022] Updated weights on worker 0-0, policy_version 448929 (0.00093) [2022-07-09 22:32:18,136][25689] Fps is (10 sec: 5610.9, 60 sec: 5641.0, 300 sec: 5686.5). Total num frames: 459709440. Throughput: 0: 5897.0. Samples: 459710294. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:32:18,138][25689] Avg episode reward: [(0, '-45.248')] [2022-07-09 22:32:18,761][26022] Updated weights on worker 0-0, policy_version 448939 (0.00091) [2022-07-09 22:32:20,395][26022] Updated weights on worker 0-0, policy_version 448949 (0.00084) [2022-07-09 22:32:22,212][26022] Updated weights on worker 0-0, policy_version 448959 (0.00085) [2022-07-09 22:32:23,251][25689] Fps is (10 sec: 5962.9, 60 sec: 5690.1, 300 sec: 5684.7). Total num frames: 459740160. Throughput: 0: 5892.9. Samples: 459744756. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:32:23,253][25689] Avg episode reward: [(0, '-45.537')] [2022-07-09 22:32:23,902][26022] Updated weights on worker 0-0, policy_version 448969 (0.00086) [2022-07-09 22:32:25,817][26022] Updated weights on worker 0-0, policy_version 448979 (0.00100) [2022-07-09 22:32:27,525][26022] Updated weights on worker 0-0, policy_version 448989 (0.00091) [2022-07-09 22:32:28,309][25689] Fps is (10 sec: 5838.6, 60 sec: 5690.6, 300 sec: 5687.6). Total num frames: 459768832. Throughput: 0: 5145.3. Samples: 459762274. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:32:28,310][25689] Avg episode reward: [(0, '-45.355')] [2022-07-09 22:32:29,409][26022] Updated weights on worker 0-0, policy_version 448999 (0.00094) [2022-07-09 22:32:31,071][26022] Updated weights on worker 0-0, policy_version 449009 (0.00084) [2022-07-09 22:32:32,895][26022] Updated weights on worker 0-0, policy_version 449019 (0.00093) [2022-07-09 22:32:33,350][25689] Fps is (10 sec: 5678.1, 60 sec: 5689.0, 300 sec: 5687.1). Total num frames: 459797504. Throughput: 0: 5971.3. Samples: 459796376. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:32:33,350][25689] Avg episode reward: [(0, '-45.596')] [2022-07-09 22:32:34,736][26022] Updated weights on worker 0-0, policy_version 449029 (0.00091) [2022-07-09 22:32:36,463][26022] Updated weights on worker 0-0, policy_version 449039 (0.00100) [2022-07-09 22:32:38,387][25689] Fps is (10 sec: 5588.4, 60 sec: 5670.1, 300 sec: 5681.3). Total num frames: 459825152. Throughput: 0: 5956.5. Samples: 459830916. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:32:38,388][25689] Avg episode reward: [(0, '-46.216')] [2022-07-09 22:32:38,486][26022] Updated weights on worker 0-0, policy_version 449049 (0.00090) [2022-07-09 22:32:40,107][26022] Updated weights on worker 0-0, policy_version 449059 (0.00095) [2022-07-09 22:32:41,777][26022] Updated weights on worker 0-0, policy_version 449069 (0.00085) [2022-07-09 22:32:43,503][25689] Fps is (10 sec: 5748.8, 60 sec: 5697.9, 300 sec: 5689.9). Total num frames: 459855872. Throughput: 0: 5094.8. Samples: 459847936. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:32:43,503][25689] Avg episode reward: [(0, '-45.616')] [2022-07-09 22:32:43,793][26022] Updated weights on worker 0-0, policy_version 449079 (0.00050) [2022-07-09 22:32:45,415][26022] Updated weights on worker 0-0, policy_version 449089 (0.00650) [2022-07-09 22:32:47,180][26022] Updated weights on worker 0-0, policy_version 449099 (0.00091) [2022-07-09 22:32:48,582][25689] Fps is (10 sec: 5925.9, 60 sec: 5726.0, 300 sec: 5688.9). Total num frames: 459885568. Throughput: 0: 5940.6. Samples: 459882710. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:32:48,582][25689] Avg episode reward: [(0, '-45.145')] [2022-07-09 22:32:49,089][26022] Updated weights on worker 0-0, policy_version 449109 (0.00094) [2022-07-09 22:32:50,691][26022] Updated weights on worker 0-0, policy_version 449119 (0.00087) [2022-07-09 22:32:52,617][26022] Updated weights on worker 0-0, policy_version 449129 (0.00080) [2022-07-09 22:32:53,614][25689] Fps is (10 sec: 5772.9, 60 sec: 5726.0, 300 sec: 5688.8). Total num frames: 459914240. Throughput: 0: 5981.0. Samples: 459917574. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:32:53,614][25689] Avg episode reward: [(0, '-45.486')] [2022-07-09 22:32:54,266][26022] Updated weights on worker 0-0, policy_version 449139 (0.00086) [2022-07-09 22:32:55,945][26022] Updated weights on worker 0-0, policy_version 449149 (0.00085) [2022-07-09 22:32:57,859][26022] Updated weights on worker 0-0, policy_version 449159 (0.00097) [2022-07-09 22:32:58,643][25689] Fps is (10 sec: 5699.4, 60 sec: 5707.2, 300 sec: 5687.0). Total num frames: 459942912. Throughput: 0: 5992.0. Samples: 459952292. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:32:58,644][25689] Avg episode reward: [(0, '-46.218')] [2022-07-09 22:32:59,559][26022] Updated weights on worker 0-0, policy_version 449169 (0.00088) [2022-07-09 22:33:01,424][26022] Updated weights on worker 0-0, policy_version 449179 (0.00079) [2022-07-09 22:33:03,541][26022] Updated weights on worker 0-0, policy_version 449189 (0.00086) [2022-07-09 22:33:03,746][25689] Fps is (10 sec: 5558.5, 60 sec: 5738.1, 300 sec: 5688.7). Total num frames: 459970560. Throughput: 0: 5977.6. Samples: 459968940. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:33:03,746][25689] Avg episode reward: [(0, '-46.266')] [2022-07-09 22:33:05,312][26022] Updated weights on worker 0-0, policy_version 449199 (0.00084) [2022-07-09 22:33:07,147][26022] Updated weights on worker 0-0, policy_version 449209 (0.00087) [2022-07-09 22:33:08,789][25689] Fps is (10 sec: 5551.0, 60 sec: 5702.9, 300 sec: 5691.5). Total num frames: 459999232. Throughput: 0: 5886.3. Samples: 460001656. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:33:08,791][25689] Avg episode reward: [(0, '-46.634')] [2022-07-09 22:33:08,797][26022] Updated weights on worker 0-0, policy_version 449219 (0.00087) [2022-07-09 22:33:10,992][26022] Updated weights on worker 0-0, policy_version 449229 (0.00089) [2022-07-09 22:33:12,461][26022] Updated weights on worker 0-0, policy_version 449239 (0.00087) [2022-07-09 22:33:13,858][25689] Fps is (10 sec: 5468.4, 60 sec: 5698.3, 300 sec: 5683.6). Total num frames: 460025856. Throughput: 0: 5843.7. Samples: 460035876. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:33:13,858][25689] Avg episode reward: [(0, '-46.728')] [2022-07-09 22:33:14,351][26022] Updated weights on worker 0-0, policy_version 449249 (0.00082) [2022-07-09 22:33:16,111][26022] Updated weights on worker 0-0, policy_version 449259 (0.00083) [2022-07-09 22:33:17,800][26022] Updated weights on worker 0-0, policy_version 449269 (0.00087) [2022-07-09 22:33:18,870][25689] Fps is (10 sec: 5586.6, 60 sec: 5698.8, 300 sec: 5685.3). Total num frames: 460055552. Throughput: 0: 4986.7. Samples: 460053160. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:33:18,871][25689] Avg episode reward: [(0, '-47.180')] [2022-07-09 22:33:19,834][26022] Updated weights on worker 0-0, policy_version 449279 (0.00087) [2022-07-09 22:33:21,600][26022] Updated weights on worker 0-0, policy_version 449289 (0.00086) [2022-07-09 22:33:23,291][26022] Updated weights on worker 0-0, policy_version 449299 (0.00097) [2022-07-09 22:33:23,903][25689] Fps is (10 sec: 5912.5, 60 sec: 5689.6, 300 sec: 5685.2). Total num frames: 460085248. Throughput: 0: 5875.2. Samples: 460087366. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:33:23,904][25689] Avg episode reward: [(0, '-46.381')] [2022-07-09 22:33:25,154][26022] Updated weights on worker 0-0, policy_version 449309 (0.00090) [2022-07-09 22:33:26,846][26022] Updated weights on worker 0-0, policy_version 449319 (0.00088) [2022-07-09 22:33:28,647][26022] Updated weights on worker 0-0, policy_version 449329 (0.00077) [2022-07-09 22:33:28,931][25689] Fps is (10 sec: 5801.5, 60 sec: 5692.4, 300 sec: 5685.9). Total num frames: 460113920. Throughput: 0: 5965.5. Samples: 460121814. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:33:28,933][25689] Avg episode reward: [(0, '-45.589')] [2022-07-09 22:33:30,543][26022] Updated weights on worker 0-0, policy_version 449339 (0.00081) [2022-07-09 22:33:32,180][26022] Updated weights on worker 0-0, policy_version 449349 (0.00088) [2022-07-09 22:33:33,947][25689] Fps is (10 sec: 5607.2, 60 sec: 5677.9, 300 sec: 5676.6). Total num frames: 460141568. Throughput: 0: 5135.7. Samples: 460139048. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:33:33,948][25689] Avg episode reward: [(0, '-44.671')] [2022-07-09 22:33:34,151][26022] Updated weights on worker 0-0, policy_version 449359 (0.00088) [2022-07-09 22:33:35,553][26022] Updated weights on worker 0-0, policy_version 449369 (0.00083) [2022-07-09 22:33:37,410][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:33:37,422][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000449377_460162048.pth [2022-07-09 22:33:37,422][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000447377_458114048.pth [2022-07-09 22:33:37,650][26022] Updated weights on worker 0-0, policy_version 449379 (0.00052) [2022-07-09 22:33:38,971][25689] Fps is (10 sec: 5711.8, 60 sec: 5712.9, 300 sec: 5687.3). Total num frames: 460171264. Throughput: 0: 5984.5. Samples: 460173450. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:33:38,972][25689] Avg episode reward: [(0, '-44.274')] [2022-07-09 22:33:39,506][26022] Updated weights on worker 0-0, policy_version 449389 (0.00052) [2022-07-09 22:33:41,117][26022] Updated weights on worker 0-0, policy_version 449399 (0.00104) [2022-07-09 22:33:42,801][26022] Updated weights on worker 0-0, policy_version 449409 (0.00053) [2022-07-09 22:33:44,100][25689] Fps is (10 sec: 5950.4, 60 sec: 5711.7, 300 sec: 5691.8). Total num frames: 460201984. Throughput: 0: 5981.4. Samples: 460208174. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:33:44,101][25689] Avg episode reward: [(0, '-44.528')] [2022-07-09 22:33:44,799][26022] Updated weights on worker 0-0, policy_version 449419 (0.00083) [2022-07-09 22:33:46,440][26022] Updated weights on worker 0-0, policy_version 449429 (0.00098) [2022-07-09 22:33:48,352][26022] Updated weights on worker 0-0, policy_version 449439 (0.00086) [2022-07-09 22:33:49,136][25689] Fps is (10 sec: 5741.8, 60 sec: 5681.9, 300 sec: 5685.1). Total num frames: 460229632. Throughput: 0: 5124.3. Samples: 460225350. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:33:49,137][25689] Avg episode reward: [(0, '-44.772')] [2022-07-09 22:33:50,078][26022] Updated weights on worker 0-0, policy_version 449449 (0.00076) [2022-07-09 22:33:52,071][26022] Updated weights on worker 0-0, policy_version 449459 (0.00087) [2022-07-09 22:33:53,869][26022] Updated weights on worker 0-0, policy_version 449469 (0.00087) [2022-07-09 22:33:54,141][25689] Fps is (10 sec: 5507.0, 60 sec: 5667.5, 300 sec: 5685.2). Total num frames: 460257280. Throughput: 0: 5932.2. Samples: 460258844. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:33:54,142][25689] Avg episode reward: [(0, '-45.266')] [2022-07-09 22:33:55,664][26022] Updated weights on worker 0-0, policy_version 449479 (0.00087) [2022-07-09 22:33:57,515][26022] Updated weights on worker 0-0, policy_version 449489 (0.00083) [2022-07-09 22:33:59,190][25689] Fps is (10 sec: 5602.1, 60 sec: 5665.7, 300 sec: 5685.7). Total num frames: 460285952. Throughput: 0: 5903.5. Samples: 460292812. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:33:59,190][25689] Avg episode reward: [(0, '-45.249')] [2022-07-09 22:33:59,222][26022] Updated weights on worker 0-0, policy_version 449499 (0.00091) [2022-07-09 22:34:01,066][26022] Updated weights on worker 0-0, policy_version 449509 (0.00087) [2022-07-09 22:34:03,304][26022] Updated weights on worker 0-0, policy_version 449519 (0.00091) [2022-07-09 22:34:04,311][25689] Fps is (10 sec: 5538.1, 60 sec: 5664.0, 300 sec: 5680.5). Total num frames: 460313600. Throughput: 0: 4959.9. Samples: 460308416. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:34:04,311][25689] Avg episode reward: [(0, '-45.519')] [2022-07-09 22:34:04,881][26022] Updated weights on worker 0-0, policy_version 449529 (0.00086) [2022-07-09 22:34:06,941][26022] Updated weights on worker 0-0, policy_version 449539 (0.00081) [2022-07-09 22:34:08,834][26022] Updated weights on worker 0-0, policy_version 449549 (0.00087) [2022-07-09 22:34:09,339][25689] Fps is (10 sec: 5448.4, 60 sec: 5648.5, 300 sec: 5680.6). Total num frames: 460341248. Throughput: 0: 5781.0. Samples: 460342140. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:34:09,340][25689] Avg episode reward: [(0, '-45.570')] [2022-07-09 22:34:10,390][26022] Updated weights on worker 0-0, policy_version 449559 (0.00089) [2022-07-09 22:34:12,295][26022] Updated weights on worker 0-0, policy_version 449569 (0.00093) [2022-07-09 22:34:14,230][26022] Updated weights on worker 0-0, policy_version 449579 (0.00091) [2022-07-09 22:34:14,351][25689] Fps is (10 sec: 5507.7, 60 sec: 5670.8, 300 sec: 5673.6). Total num frames: 460368896. Throughput: 0: 5806.6. Samples: 460376190. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:34:14,352][25689] Avg episode reward: [(0, '-45.654')] [2022-07-09 22:34:15,604][26022] Updated weights on worker 0-0, policy_version 449589 (0.00084) [2022-07-09 22:34:17,836][26022] Updated weights on worker 0-0, policy_version 449599 (0.00090) [2022-07-09 22:34:19,129][26022] Updated weights on worker 0-0, policy_version 449609 (0.00083) [2022-07-09 22:34:19,392][25689] Fps is (10 sec: 5907.5, 60 sec: 5701.9, 300 sec: 5680.5). Total num frames: 460400640. Throughput: 0: 4982.2. Samples: 460393466. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:34:19,393][25689] Avg episode reward: [(0, '-46.405')] [2022-07-09 22:34:21,290][26022] Updated weights on worker 0-0, policy_version 449619 (0.00090) [2022-07-09 22:34:22,873][26022] Updated weights on worker 0-0, policy_version 449629 (0.00090) [2022-07-09 22:34:24,507][25689] Fps is (10 sec: 5948.7, 60 sec: 5677.3, 300 sec: 5685.3). Total num frames: 460429312. Throughput: 0: 5903.1. Samples: 460427634. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:34:24,507][25689] Avg episode reward: [(0, '-45.952')] [2022-07-09 22:34:24,591][26022] Updated weights on worker 0-0, policy_version 449639 (0.00081) [2022-07-09 22:34:26,525][26022] Updated weights on worker 0-0, policy_version 449649 (0.00088) [2022-07-09 22:34:28,381][26022] Updated weights on worker 0-0, policy_version 449659 (0.00099) [2022-07-09 22:34:29,539][25689] Fps is (10 sec: 5449.8, 60 sec: 5643.1, 300 sec: 5678.2). Total num frames: 460455936. Throughput: 0: 5937.8. Samples: 460462084. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:34:29,539][25689] Avg episode reward: [(0, '-47.005')] [2022-07-09 22:34:30,107][26022] Updated weights on worker 0-0, policy_version 449669 (0.00087) [2022-07-09 22:34:32,052][26022] Updated weights on worker 0-0, policy_version 449679 (0.00096) [2022-07-09 22:34:33,548][26022] Updated weights on worker 0-0, policy_version 449689 (0.00082) [2022-07-09 22:34:34,578][25689] Fps is (10 sec: 5592.0, 60 sec: 5674.7, 300 sec: 5674.7). Total num frames: 460485632. Throughput: 0: 5095.4. Samples: 460479258. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:34:34,579][25689] Avg episode reward: [(0, '-46.723')] [2022-07-09 22:34:35,552][26022] Updated weights on worker 0-0, policy_version 449699 (0.00087) [2022-07-09 22:34:37,159][26022] Updated weights on worker 0-0, policy_version 449709 (0.00097) [2022-07-09 22:34:39,047][26022] Updated weights on worker 0-0, policy_version 449719 (0.00086) [2022-07-09 22:34:39,602][25689] Fps is (10 sec: 5800.3, 60 sec: 5657.8, 300 sec: 5679.5). Total num frames: 460514304. Throughput: 0: 5966.4. Samples: 460514042. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:34:39,602][25689] Avg episode reward: [(0, '-47.255')] [2022-07-09 22:34:40,842][26022] Updated weights on worker 0-0, policy_version 449729 (0.00082) [2022-07-09 22:34:42,671][26022] Updated weights on worker 0-0, policy_version 449739 (0.00085) [2022-07-09 22:34:44,549][26022] Updated weights on worker 0-0, policy_version 449749 (0.00087) [2022-07-09 22:34:44,732][25689] Fps is (10 sec: 5748.2, 60 sec: 5640.8, 300 sec: 5678.0). Total num frames: 460544000. Throughput: 0: 5964.9. Samples: 460548278. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:34:44,734][25689] Avg episode reward: [(0, '-46.230')] [2022-07-09 22:34:46,233][26022] Updated weights on worker 0-0, policy_version 449759 (0.00087) [2022-07-09 22:34:48,088][26022] Updated weights on worker 0-0, policy_version 449769 (0.00092) [2022-07-09 22:34:49,756][25689] Fps is (10 sec: 5747.8, 60 sec: 5658.8, 300 sec: 5675.2). Total num frames: 460572672. Throughput: 0: 5964.6. Samples: 460582674. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:34:49,757][25689] Avg episode reward: [(0, '-45.510')] [2022-07-09 22:34:49,871][26022] Updated weights on worker 0-0, policy_version 449779 (0.00087) [2022-07-09 22:34:51,583][26022] Updated weights on worker 0-0, policy_version 449789 (0.00088) [2022-07-09 22:34:53,698][26022] Updated weights on worker 0-0, policy_version 449799 (0.00090) [2022-07-09 22:34:54,797][25689] Fps is (10 sec: 5799.2, 60 sec: 5689.3, 300 sec: 5681.7). Total num frames: 460602368. Throughput: 0: 5940.5. Samples: 460599368. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:34:54,798][25689] Avg episode reward: [(0, '-44.580')] [2022-07-09 22:34:55,039][26022] Updated weights on worker 0-0, policy_version 449809 (0.00085) [2022-07-09 22:34:57,198][26022] Updated weights on worker 0-0, policy_version 449819 (0.00087) [2022-07-09 22:34:58,895][26022] Updated weights on worker 0-0, policy_version 449829 (0.00093) [2022-07-09 22:34:59,819][25689] Fps is (10 sec: 5596.7, 60 sec: 5657.9, 300 sec: 5672.1). Total num frames: 460628992. Throughput: 0: 5914.5. Samples: 460633620. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2022-07-09 22:34:59,821][25689] Avg episode reward: [(0, '-44.434')] [2022-07-09 22:35:00,655][26022] Updated weights on worker 0-0, policy_version 449839 (0.00090) [2022-07-09 22:35:02,826][26022] Updated weights on worker 0-0, policy_version 449849 (0.00073) [2022-07-09 22:35:04,704][26022] Updated weights on worker 0-0, policy_version 449859 (0.00093) [2022-07-09 22:35:04,946][25689] Fps is (10 sec: 5347.5, 60 sec: 5657.4, 300 sec: 5677.1). Total num frames: 460656640. Throughput: 0: 5800.9. Samples: 460665536. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:35:04,946][25689] Avg episode reward: [(0, '-45.636')] [2022-07-09 22:35:06,491][26022] Updated weights on worker 0-0, policy_version 449869 (0.00499) [2022-07-09 22:35:08,262][26022] Updated weights on worker 0-0, policy_version 449879 (0.00087) [2022-07-09 22:35:09,792][26022] Updated weights on worker 0-0, policy_version 449889 (0.00096) [2022-07-09 22:35:09,999][25689] Fps is (10 sec: 5633.3, 60 sec: 5688.9, 300 sec: 5676.6). Total num frames: 460686336. Throughput: 0: 4935.5. Samples: 460682582. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:35:09,999][25689] Avg episode reward: [(0, '-45.158')] [2022-07-09 22:35:11,927][26022] Updated weights on worker 0-0, policy_version 449899 (0.00363) [2022-07-09 22:35:13,559][26022] Updated weights on worker 0-0, policy_version 449909 (0.00089) [2022-07-09 22:35:15,031][25689] Fps is (10 sec: 5686.1, 60 sec: 5687.0, 300 sec: 5672.7). Total num frames: 460713984. Throughput: 0: 5800.9. Samples: 460716744. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:35:15,032][25689] Avg episode reward: [(0, '-46.387')] [2022-07-09 22:35:15,528][26022] Updated weights on worker 0-0, policy_version 449919 (0.00088) [2022-07-09 22:35:17,297][26022] Updated weights on worker 0-0, policy_version 449929 (0.00088) [2022-07-09 22:35:18,864][26022] Updated weights on worker 0-0, policy_version 449939 (0.00091) [2022-07-09 22:35:20,083][25689] Fps is (10 sec: 5483.5, 60 sec: 5618.5, 300 sec: 5673.1). Total num frames: 460741632. Throughput: 0: 5814.6. Samples: 460751444. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:35:20,084][25689] Avg episode reward: [(0, '-47.956')] [2022-07-09 22:35:20,881][26022] Updated weights on worker 0-0, policy_version 449949 (0.00094) [2022-07-09 22:35:22,614][26022] Updated weights on worker 0-0, policy_version 449959 (0.00085) [2022-07-09 22:35:24,289][26022] Updated weights on worker 0-0, policy_version 449969 (0.00088) [2022-07-09 22:35:25,188][25689] Fps is (10 sec: 5847.6, 60 sec: 5670.0, 300 sec: 5679.2). Total num frames: 460773376. Throughput: 0: 5084.4. Samples: 460768452. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:35:25,188][25689] Avg episode reward: [(0, '-47.524')] [2022-07-09 22:35:26,352][26022] Updated weights on worker 0-0, policy_version 449979 (0.00084) [2022-07-09 22:35:27,785][26022] Updated weights on worker 0-0, policy_version 449989 (0.00086) [2022-07-09 22:35:29,919][26022] Updated weights on worker 0-0, policy_version 449999 (0.00091) [2022-07-09 22:35:30,217][25689] Fps is (10 sec: 5759.7, 60 sec: 5670.3, 300 sec: 5672.2). Total num frames: 460800000. Throughput: 0: 5935.4. Samples: 460802582. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:35:30,217][25689] Avg episode reward: [(0, '-46.731')] [2022-07-09 22:35:31,640][26022] Updated weights on worker 0-0, policy_version 450009 (0.00088) [2022-07-09 22:35:33,391][26022] Updated weights on worker 0-0, policy_version 450019 (0.00097) [2022-07-09 22:35:35,238][25689] Fps is (10 sec: 5603.6, 60 sec: 5672.0, 300 sec: 5679.3). Total num frames: 460829696. Throughput: 0: 5957.9. Samples: 460837136. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:35:35,239][25689] Avg episode reward: [(0, '-46.066')] [2022-07-09 22:35:35,247][26022] Updated weights on worker 0-0, policy_version 450029 (0.00055) [2022-07-09 22:35:36,986][26022] Updated weights on worker 0-0, policy_version 450039 (0.00092) [2022-07-09 22:35:37,586][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:35:37,598][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000450042_460843008.pth [2022-07-09 22:35:37,598][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000448045_458798080.pth [2022-07-09 22:35:38,794][26022] Updated weights on worker 0-0, policy_version 450049 (0.00087) [2022-07-09 22:35:40,261][25689] Fps is (10 sec: 5708.9, 60 sec: 5655.1, 300 sec: 5670.8). Total num frames: 460857344. Throughput: 0: 5089.6. Samples: 460854144. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:35:40,262][25689] Avg episode reward: [(0, '-45.105')] [2022-07-09 22:35:40,809][26022] Updated weights on worker 0-0, policy_version 450059 (0.00089) [2022-07-09 22:35:42,396][26022] Updated weights on worker 0-0, policy_version 450069 (0.00087) [2022-07-09 22:35:44,326][26022] Updated weights on worker 0-0, policy_version 450079 (0.00089) [2022-07-09 22:35:45,303][25689] Fps is (10 sec: 5697.7, 60 sec: 5663.5, 300 sec: 5674.8). Total num frames: 460887040. Throughput: 0: 5960.2. Samples: 460888342. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:35:45,303][25689] Avg episode reward: [(0, '-44.187')] [2022-07-09 22:35:46,098][26022] Updated weights on worker 0-0, policy_version 450089 (0.00081) [2022-07-09 22:35:47,778][26022] Updated weights on worker 0-0, policy_version 450099 (0.00092) [2022-07-09 22:35:49,795][26022] Updated weights on worker 0-0, policy_version 450109 (0.00079) [2022-07-09 22:35:50,307][25689] Fps is (10 sec: 5810.1, 60 sec: 5665.3, 300 sec: 5678.2). Total num frames: 460915712. Throughput: 0: 5979.2. Samples: 460922708. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:35:50,308][25689] Avg episode reward: [(0, '-43.789')] [2022-07-09 22:35:51,448][26022] Updated weights on worker 0-0, policy_version 450119 (0.00092) [2022-07-09 22:35:53,150][26022] Updated weights on worker 0-0, policy_version 450129 (0.00085) [2022-07-09 22:35:54,966][26022] Updated weights on worker 0-0, policy_version 450139 (0.00086) [2022-07-09 22:35:55,331][25689] Fps is (10 sec: 5616.3, 60 sec: 5633.1, 300 sec: 5668.4). Total num frames: 460943360. Throughput: 0: 5096.0. Samples: 460939528. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:35:55,331][25689] Avg episode reward: [(0, '-43.795')] [2022-07-09 22:35:56,619][26022] Updated weights on worker 0-0, policy_version 450149 (0.00079) [2022-07-09 22:35:58,704][26022] Updated weights on worker 0-0, policy_version 450159 (0.00091) [2022-07-09 22:36:00,224][26022] Updated weights on worker 0-0, policy_version 450169 (0.00090) [2022-07-09 22:36:00,354][25689] Fps is (10 sec: 5707.6, 60 sec: 5683.7, 300 sec: 5682.4). Total num frames: 460973056. Throughput: 0: 5970.7. Samples: 460974112. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:36:00,360][25689] Avg episode reward: [(0, '-44.254')] [2022-07-09 22:36:02,466][26022] Updated weights on worker 0-0, policy_version 450179 (0.00079) [2022-07-09 22:36:04,376][26022] Updated weights on worker 0-0, policy_version 450189 (0.00081) [2022-07-09 22:36:05,399][25689] Fps is (10 sec: 5593.9, 60 sec: 5674.5, 300 sec: 5678.2). Total num frames: 460999680. Throughput: 0: 5893.0. Samples: 461006768. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:36:05,399][25689] Avg episode reward: [(0, '-44.781')] [2022-07-09 22:36:05,833][26022] Updated weights on worker 0-0, policy_version 450199 (0.00086) [2022-07-09 22:36:07,999][26022] Updated weights on worker 0-0, policy_version 450209 (0.00084) [2022-07-09 22:36:09,531][26022] Updated weights on worker 0-0, policy_version 450219 (0.00087) [2022-07-09 22:36:10,426][25689] Fps is (10 sec: 5490.4, 60 sec: 5660.0, 300 sec: 5677.8). Total num frames: 461028352. Throughput: 0: 5029.1. Samples: 461023886. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:36:10,427][25689] Avg episode reward: [(0, '-43.888')] [2022-07-09 22:36:11,388][26022] Updated weights on worker 0-0, policy_version 450229 (0.00086) [2022-07-09 22:36:13,208][26022] Updated weights on worker 0-0, policy_version 450239 (0.00091) [2022-07-09 22:36:14,961][26022] Updated weights on worker 0-0, policy_version 450249 (0.00083) [2022-07-09 22:36:15,456][25689] Fps is (10 sec: 5701.9, 60 sec: 5677.1, 300 sec: 5670.8). Total num frames: 461057024. Throughput: 0: 5893.6. Samples: 461058138. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:36:15,462][25689] Avg episode reward: [(0, '-44.777')] [2022-07-09 22:36:16,806][26022] Updated weights on worker 0-0, policy_version 450259 (0.00090) [2022-07-09 22:36:18,656][26022] Updated weights on worker 0-0, policy_version 450269 (0.00083) [2022-07-09 22:36:20,303][26022] Updated weights on worker 0-0, policy_version 450279 (0.00088) [2022-07-09 22:36:20,471][25689] Fps is (10 sec: 5912.1, 60 sec: 5731.4, 300 sec: 5682.6). Total num frames: 461087744. Throughput: 0: 5904.6. Samples: 461092896. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:36:20,475][25689] Avg episode reward: [(0, '-45.602')] [2022-07-09 22:36:22,359][26022] Updated weights on worker 0-0, policy_version 450289 (0.00092) [2022-07-09 22:36:23,830][26022] Updated weights on worker 0-0, policy_version 450299 (0.01209) [2022-07-09 22:36:25,553][25689] Fps is (10 sec: 5679.0, 60 sec: 5648.8, 300 sec: 5675.4). Total num frames: 461114368. Throughput: 0: 5111.3. Samples: 461109784. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:36:25,554][25689] Avg episode reward: [(0, '-45.999')] [2022-07-09 22:36:25,786][26022] Updated weights on worker 0-0, policy_version 450309 (0.00087) [2022-07-09 22:36:27,487][26022] Updated weights on worker 0-0, policy_version 450319 (0.00093) [2022-07-09 22:36:29,383][26022] Updated weights on worker 0-0, policy_version 450329 (0.00084) [2022-07-09 22:36:30,609][25689] Fps is (10 sec: 5454.6, 60 sec: 5680.2, 300 sec: 5674.8). Total num frames: 461143040. Throughput: 0: 5961.2. Samples: 461144204. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:36:30,610][25689] Avg episode reward: [(0, '-45.153')] [2022-07-09 22:36:31,145][26022] Updated weights on worker 0-0, policy_version 450339 (0.01138) [2022-07-09 22:36:33,014][26022] Updated weights on worker 0-0, policy_version 450349 (0.00374) [2022-07-09 22:36:34,752][26022] Updated weights on worker 0-0, policy_version 450359 (0.00088) [2022-07-09 22:36:35,616][25689] Fps is (10 sec: 5800.7, 60 sec: 5681.6, 300 sec: 5678.4). Total num frames: 461172736. Throughput: 0: 5973.8. Samples: 461178568. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:36:35,616][25689] Avg episode reward: [(0, '-45.478')] [2022-07-09 22:36:36,525][26022] Updated weights on worker 0-0, policy_version 450369 (0.00093) [2022-07-09 22:36:38,296][26022] Updated weights on worker 0-0, policy_version 450379 (0.00091) [2022-07-09 22:36:40,061][26022] Updated weights on worker 0-0, policy_version 450389 (0.00088) [2022-07-09 22:36:40,627][25689] Fps is (10 sec: 5826.4, 60 sec: 5699.7, 300 sec: 5679.2). Total num frames: 461201408. Throughput: 0: 5106.1. Samples: 461195812. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:36:40,627][25689] Avg episode reward: [(0, '-46.175')] [2022-07-09 22:36:41,950][26022] Updated weights on worker 0-0, policy_version 450399 (0.00089) [2022-07-09 22:36:43,503][26022] Updated weights on worker 0-0, policy_version 450409 (0.00079) [2022-07-09 22:36:45,477][26022] Updated weights on worker 0-0, policy_version 450419 (0.00096) [2022-07-09 22:36:45,765][25689] Fps is (10 sec: 5851.7, 60 sec: 5707.5, 300 sec: 5687.2). Total num frames: 461232128. Throughput: 0: 5956.6. Samples: 461230178. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:36:45,766][25689] Avg episode reward: [(0, '-46.057')] [2022-07-09 22:36:47,159][26022] Updated weights on worker 0-0, policy_version 450429 (0.00088) [2022-07-09 22:36:48,922][26022] Updated weights on worker 0-0, policy_version 450439 (0.00091) [2022-07-09 22:36:50,809][25689] Fps is (10 sec: 5632.0, 60 sec: 5669.9, 300 sec: 5680.1). Total num frames: 461258752. Throughput: 0: 5959.4. Samples: 461264582. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:36:50,809][25689] Avg episode reward: [(0, '-45.855')] [2022-07-09 22:36:50,827][26022] Updated weights on worker 0-0, policy_version 450449 (0.00090) [2022-07-09 22:36:52,610][26022] Updated weights on worker 0-0, policy_version 450459 (0.00096) [2022-07-09 22:36:54,389][26022] Updated weights on worker 0-0, policy_version 450469 (0.00100) [2022-07-09 22:36:55,877][25689] Fps is (10 sec: 5468.5, 60 sec: 5682.7, 300 sec: 5675.6). Total num frames: 461287424. Throughput: 0: 5083.9. Samples: 461281574. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:36:55,877][25689] Avg episode reward: [(0, '-46.318')] [2022-07-09 22:36:56,407][26022] Updated weights on worker 0-0, policy_version 450479 (0.00089) [2022-07-09 22:36:57,848][26022] Updated weights on worker 0-0, policy_version 450489 (0.00086) [2022-07-09 22:36:59,858][26022] Updated weights on worker 0-0, policy_version 450499 (0.00090) [2022-07-09 22:37:00,934][25689] Fps is (10 sec: 5966.8, 60 sec: 5713.3, 300 sec: 5696.5). Total num frames: 461319168. Throughput: 0: 5932.3. Samples: 461316280. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:37:00,935][25689] Avg episode reward: [(0, '-46.512')] [2022-07-09 22:37:01,361][26022] Updated weights on worker 0-0, policy_version 450509 (0.00086) [2022-07-09 22:37:03,794][26022] Updated weights on worker 0-0, policy_version 450519 (0.00088) [2022-07-09 22:37:05,702][26022] Updated weights on worker 0-0, policy_version 450529 (0.00088) [2022-07-09 22:37:06,047][25689] Fps is (10 sec: 5437.3, 60 sec: 5656.3, 300 sec: 5670.8). Total num frames: 461342720. Throughput: 0: 5830.8. Samples: 461348436. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:37:06,047][25689] Avg episode reward: [(0, '-45.486')] [2022-07-09 22:37:07,212][26022] Updated weights on worker 0-0, policy_version 450539 (0.00097) [2022-07-09 22:37:09,191][26022] Updated weights on worker 0-0, policy_version 450549 (0.00087) [2022-07-09 22:37:10,755][26022] Updated weights on worker 0-0, policy_version 450559 (0.00084) [2022-07-09 22:37:11,058][25689] Fps is (10 sec: 5360.9, 60 sec: 5691.5, 300 sec: 5684.7). Total num frames: 461373440. Throughput: 0: 4990.8. Samples: 461365640. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:37:11,058][25689] Avg episode reward: [(0, '-46.566')] [2022-07-09 22:37:12,617][26022] Updated weights on worker 0-0, policy_version 450569 (0.00452) [2022-07-09 22:37:14,399][26022] Updated weights on worker 0-0, policy_version 450579 (0.00092) [2022-07-09 22:37:16,061][25689] Fps is (10 sec: 5930.8, 60 sec: 5694.0, 300 sec: 5681.5). Total num frames: 461402112. Throughput: 0: 5863.2. Samples: 461399916. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:37:16,062][25689] Avg episode reward: [(0, '-46.308')] [2022-07-09 22:37:16,245][26022] Updated weights on worker 0-0, policy_version 450589 (0.00090) [2022-07-09 22:37:17,972][26022] Updated weights on worker 0-0, policy_version 450599 (0.00101) [2022-07-09 22:37:20,047][26022] Updated weights on worker 0-0, policy_version 450609 (0.00082) [2022-07-09 22:37:21,087][25689] Fps is (10 sec: 5717.5, 60 sec: 5659.2, 300 sec: 5676.3). Total num frames: 461430784. Throughput: 0: 5851.2. Samples: 461434200. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:37:21,088][25689] Avg episode reward: [(0, '-45.556')] [2022-07-09 22:37:21,571][26022] Updated weights on worker 0-0, policy_version 450619 (0.00084) [2022-07-09 22:37:23,551][26022] Updated weights on worker 0-0, policy_version 450629 (0.00095) [2022-07-09 22:37:25,193][26022] Updated weights on worker 0-0, policy_version 450639 (0.00086) [2022-07-09 22:37:26,175][25689] Fps is (10 sec: 5568.3, 60 sec: 5675.6, 300 sec: 5672.3). Total num frames: 461458432. Throughput: 0: 5967.6. Samples: 461468556. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:37:26,176][25689] Avg episode reward: [(0, '-44.322')] [2022-07-09 22:37:26,884][26022] Updated weights on worker 0-0, policy_version 450649 (0.00093) [2022-07-09 22:37:28,824][26022] Updated weights on worker 0-0, policy_version 450659 (0.00095) [2022-07-09 22:37:30,491][26022] Updated weights on worker 0-0, policy_version 450669 (0.00095) [2022-07-09 22:37:31,212][25689] Fps is (10 sec: 5562.9, 60 sec: 5677.4, 300 sec: 5672.4). Total num frames: 461487104. Throughput: 0: 5961.6. Samples: 461485788. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:37:31,212][25689] Avg episode reward: [(0, '-45.350')] [2022-07-09 22:37:32,390][26022] Updated weights on worker 0-0, policy_version 450679 (0.00093) [2022-07-09 22:37:34,237][26022] Updated weights on worker 0-0, policy_version 450689 (0.00086) [2022-07-09 22:37:35,872][26022] Updated weights on worker 0-0, policy_version 450699 (0.00095) [2022-07-09 22:37:36,235][25689] Fps is (10 sec: 5903.7, 60 sec: 5692.7, 300 sec: 5683.0). Total num frames: 461517824. Throughput: 0: 5965.9. Samples: 461520274. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:37:36,236][25689] Avg episode reward: [(0, '-44.799')] [2022-07-09 22:37:37,900][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:37:37,913][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000450709_461526016.pth [2022-07-09 22:37:37,914][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000448711_459480064.pth [2022-07-09 22:37:37,915][26022] Updated weights on worker 0-0, policy_version 450709 (0.00085) [2022-07-09 22:37:39,360][26022] Updated weights on worker 0-0, policy_version 450719 (0.00103) [2022-07-09 22:37:41,218][26022] Updated weights on worker 0-0, policy_version 450729 (0.00082) [2022-07-09 22:37:41,260][25689] Fps is (10 sec: 5910.6, 60 sec: 5691.4, 300 sec: 5677.9). Total num frames: 461546496. Throughput: 0: 5982.2. Samples: 461554876. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:37:41,260][25689] Avg episode reward: [(0, '-45.083')] [2022-07-09 22:37:43,162][26022] Updated weights on worker 0-0, policy_version 450739 (0.00095) [2022-07-09 22:37:44,608][26022] Updated weights on worker 0-0, policy_version 450749 (0.00086) [2022-07-09 22:37:46,383][25689] Fps is (10 sec: 5550.1, 60 sec: 5642.2, 300 sec: 5670.1). Total num frames: 461574144. Throughput: 0: 5124.1. Samples: 461572102. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2022-07-09 22:37:46,383][25689] Avg episode reward: [(0, '-44.994')] [2022-07-09 22:37:46,720][26022] Updated weights on worker 0-0, policy_version 450759 (0.00086) [2022-07-09 22:37:48,256][26022] Updated weights on worker 0-0, policy_version 450769 (0.00090) [2022-07-09 22:37:50,248][26022] Updated weights on worker 0-0, policy_version 450779 (0.00089) [2022-07-09 22:37:51,460][25689] Fps is (10 sec: 5722.1, 60 sec: 5706.6, 300 sec: 5676.2). Total num frames: 461604864. Throughput: 0: 5965.1. Samples: 461606574. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:37:51,461][25689] Avg episode reward: [(0, '-46.778')] [2022-07-09 22:37:51,948][26022] Updated weights on worker 0-0, policy_version 450789 (0.00100) [2022-07-09 22:37:53,948][26022] Updated weights on worker 0-0, policy_version 450799 (0.00082) [2022-07-09 22:37:55,816][26022] Updated weights on worker 0-0, policy_version 450809 (0.00087) [2022-07-09 22:37:56,539][25689] Fps is (10 sec: 5847.8, 60 sec: 5705.6, 300 sec: 5675.2). Total num frames: 461633536. Throughput: 0: 5923.7. Samples: 461640548. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:37:56,540][25689] Avg episode reward: [(0, '-46.159')] [2022-07-09 22:37:57,400][26022] Updated weights on worker 0-0, policy_version 450819 (0.00086) [2022-07-09 22:37:59,354][26022] Updated weights on worker 0-0, policy_version 450829 (0.00089) [2022-07-09 22:38:00,932][26022] Updated weights on worker 0-0, policy_version 450839 (0.00078) [2022-07-09 22:38:01,563][25689] Fps is (10 sec: 5574.9, 60 sec: 5641.2, 300 sec: 5676.7). Total num frames: 461661184. Throughput: 0: 5071.2. Samples: 461657836. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:38:01,563][25689] Avg episode reward: [(0, '-45.925')] [2022-07-09 22:38:03,075][26022] Updated weights on worker 0-0, policy_version 450849 (0.00081) [2022-07-09 22:38:04,972][26022] Updated weights on worker 0-0, policy_version 450859 (0.00081) [2022-07-09 22:38:06,664][25689] Fps is (10 sec: 5562.6, 60 sec: 5726.7, 300 sec: 5675.6). Total num frames: 461689856. Throughput: 0: 5835.0. Samples: 461690442. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:38:06,664][25689] Avg episode reward: [(0, '-45.724')] [2022-07-09 22:38:06,665][26022] Updated weights on worker 0-0, policy_version 450869 (0.00076) [2022-07-09 22:38:08,596][26022] Updated weights on worker 0-0, policy_version 450879 (0.00092) [2022-07-09 22:38:10,179][26022] Updated weights on worker 0-0, policy_version 450889 (0.00083) [2022-07-09 22:38:11,714][25689] Fps is (10 sec: 5547.9, 60 sec: 5672.3, 300 sec: 5679.4). Total num frames: 461717504. Throughput: 0: 5848.6. Samples: 461725032. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:38:11,715][25689] Avg episode reward: [(0, '-46.238')] [2022-07-09 22:38:12,099][26022] Updated weights on worker 0-0, policy_version 450899 (0.00083) [2022-07-09 22:38:13,711][26022] Updated weights on worker 0-0, policy_version 450909 (0.00081) [2022-07-09 22:38:15,627][26022] Updated weights on worker 0-0, policy_version 450919 (0.00088) [2022-07-09 22:38:16,716][25689] Fps is (10 sec: 5806.4, 60 sec: 5706.2, 300 sec: 5683.0). Total num frames: 461748224. Throughput: 0: 5049.7. Samples: 461742436. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:38:16,717][25689] Avg episode reward: [(0, '-46.618')] [2022-07-09 22:38:17,176][26022] Updated weights on worker 0-0, policy_version 450929 (0.00066) [2022-07-09 22:38:18,954][26022] Updated weights on worker 0-0, policy_version 450939 (0.00082) [2022-07-09 22:38:20,971][26022] Updated weights on worker 0-0, policy_version 450949 (0.00090) [2022-07-09 22:38:21,727][25689] Fps is (10 sec: 5829.6, 60 sec: 5690.8, 300 sec: 5676.6). Total num frames: 461775872. Throughput: 0: 5918.3. Samples: 461777172. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:38:21,727][25689] Avg episode reward: [(0, '-46.619')] [2022-07-09 22:38:22,725][26022] Updated weights on worker 0-0, policy_version 450959 (0.00089) [2022-07-09 22:38:24,429][26022] Updated weights on worker 0-0, policy_version 450969 (0.00080) [2022-07-09 22:38:26,254][26022] Updated weights on worker 0-0, policy_version 450979 (0.00086) [2022-07-09 22:38:26,839][25689] Fps is (10 sec: 5563.9, 60 sec: 5705.4, 300 sec: 5675.0). Total num frames: 461804544. Throughput: 0: 5996.0. Samples: 461811410. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:38:26,839][25689] Avg episode reward: [(0, '-46.907')] [2022-07-09 22:38:28,056][26022] Updated weights on worker 0-0, policy_version 450989 (0.00091) [2022-07-09 22:38:29,830][26022] Updated weights on worker 0-0, policy_version 450999 (0.00089) [2022-07-09 22:38:31,663][26022] Updated weights on worker 0-0, policy_version 451009 (0.00091) [2022-07-09 22:38:31,853][25689] Fps is (10 sec: 5764.2, 60 sec: 5724.4, 300 sec: 5681.9). Total num frames: 461834240. Throughput: 0: 5148.5. Samples: 461828714. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:38:31,853][25689] Avg episode reward: [(0, '-47.207')] [2022-07-09 22:38:33,347][26022] Updated weights on worker 0-0, policy_version 451019 (0.00565) [2022-07-09 22:38:35,251][26022] Updated weights on worker 0-0, policy_version 451029 (0.00080) [2022-07-09 22:38:36,859][25689] Fps is (10 sec: 5722.5, 60 sec: 5675.4, 300 sec: 5675.4). Total num frames: 461861888. Throughput: 0: 6003.0. Samples: 461863356. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:38:36,860][25689] Avg episode reward: [(0, '-47.443')] [2022-07-09 22:38:36,999][26022] Updated weights on worker 0-0, policy_version 451039 (0.00085) [2022-07-09 22:38:38,668][26022] Updated weights on worker 0-0, policy_version 451049 (0.00085) [2022-07-09 22:38:40,448][26022] Updated weights on worker 0-0, policy_version 451059 (0.00081) [2022-07-09 22:38:41,876][25689] Fps is (10 sec: 5720.9, 60 sec: 5693.0, 300 sec: 5674.1). Total num frames: 461891584. Throughput: 0: 6018.2. Samples: 461898436. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:38:41,877][25689] Avg episode reward: [(0, '-47.943')] [2022-07-09 22:38:42,214][26022] Updated weights on worker 0-0, policy_version 451069 (0.00086) [2022-07-09 22:38:43,891][26022] Updated weights on worker 0-0, policy_version 451079 (0.00091) [2022-07-09 22:38:45,863][26022] Updated weights on worker 0-0, policy_version 451089 (0.00094) [2022-07-09 22:38:46,925][25689] Fps is (10 sec: 6002.3, 60 sec: 5750.7, 300 sec: 5684.1). Total num frames: 461922304. Throughput: 0: 5185.8. Samples: 461915574. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:38:46,925][25689] Avg episode reward: [(0, '-47.603')] [2022-07-09 22:38:47,515][26022] Updated weights on worker 0-0, policy_version 451099 (0.00084) [2022-07-09 22:38:49,263][26022] Updated weights on worker 0-0, policy_version 451109 (0.00092) [2022-07-09 22:38:51,024][26022] Updated weights on worker 0-0, policy_version 451119 (0.00092) [2022-07-09 22:38:51,938][25689] Fps is (10 sec: 5699.2, 60 sec: 5689.1, 300 sec: 5680.5). Total num frames: 461948928. Throughput: 0: 6073.9. Samples: 461950710. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:38:51,938][25689] Avg episode reward: [(0, '-47.188')] [2022-07-09 22:38:52,860][26022] Updated weights on worker 0-0, policy_version 451129 (0.00094) [2022-07-09 22:38:54,691][26022] Updated weights on worker 0-0, policy_version 451139 (0.00087) [2022-07-09 22:38:56,523][26022] Updated weights on worker 0-0, policy_version 451149 (0.00086) [2022-07-09 22:38:56,971][25689] Fps is (10 sec: 5606.3, 60 sec: 5710.4, 300 sec: 5684.3). Total num frames: 461978624. Throughput: 0: 6029.0. Samples: 461984608. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:38:56,971][25689] Avg episode reward: [(0, '-48.146')] [2022-07-09 22:38:58,080][26022] Updated weights on worker 0-0, policy_version 451159 (0.00089) [2022-07-09 22:38:59,989][26022] Updated weights on worker 0-0, policy_version 451169 (0.00087) [2022-07-09 22:39:01,981][25689] Fps is (10 sec: 5607.9, 60 sec: 5694.7, 300 sec: 5683.0). Total num frames: 462005248. Throughput: 0: 5152.7. Samples: 462002030. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:39:01,981][25689] Avg episode reward: [(0, '-47.801')] [2022-07-09 22:39:02,115][26022] Updated weights on worker 0-0, policy_version 451179 (0.00091) [2022-07-09 22:39:04,142][26022] Updated weights on worker 0-0, policy_version 451189 (0.00092) [2022-07-09 22:39:05,720][26022] Updated weights on worker 0-0, policy_version 451199 (0.00086) [2022-07-09 22:39:07,083][25689] Fps is (10 sec: 5468.4, 60 sec: 5694.6, 300 sec: 5685.0). Total num frames: 462033920. Throughput: 0: 5885.7. Samples: 462034218. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:39:07,083][25689] Avg episode reward: [(0, '-47.144')] [2022-07-09 22:39:07,708][26022] Updated weights on worker 0-0, policy_version 451209 (0.00089) [2022-07-09 22:39:09,445][26022] Updated weights on worker 0-0, policy_version 451219 (0.00083) [2022-07-09 22:39:11,155][26022] Updated weights on worker 0-0, policy_version 451229 (0.00085) [2022-07-09 22:39:12,099][25689] Fps is (10 sec: 5869.6, 60 sec: 5748.7, 300 sec: 5695.2). Total num frames: 462064640. Throughput: 0: 5848.8. Samples: 462068630. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:39:12,100][25689] Avg episode reward: [(0, '-46.063')] [2022-07-09 22:39:12,983][26022] Updated weights on worker 0-0, policy_version 451239 (0.00087) [2022-07-09 22:39:14,856][26022] Updated weights on worker 0-0, policy_version 451249 (0.00095) [2022-07-09 22:39:16,523][26022] Updated weights on worker 0-0, policy_version 451259 (0.00086) [2022-07-09 22:39:17,175][25689] Fps is (10 sec: 5783.5, 60 sec: 5690.9, 300 sec: 5680.8). Total num frames: 462092288. Throughput: 0: 5009.8. Samples: 462085826. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:39:17,176][25689] Avg episode reward: [(0, '-45.901')] [2022-07-09 22:39:18,583][26022] Updated weights on worker 0-0, policy_version 451269 (0.00087) [2022-07-09 22:39:19,999][26022] Updated weights on worker 0-0, policy_version 451279 (0.00436) [2022-07-09 22:39:22,023][26022] Updated weights on worker 0-0, policy_version 451289 (0.00089) [2022-07-09 22:39:22,215][25689] Fps is (10 sec: 5669.1, 60 sec: 5722.0, 300 sec: 5685.7). Total num frames: 462121984. Throughput: 0: 5847.9. Samples: 462120354. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:39:22,215][25689] Avg episode reward: [(0, '-45.170')] [2022-07-09 22:39:23,488][26022] Updated weights on worker 0-0, policy_version 451299 (0.00091) [2022-07-09 22:39:25,613][26022] Updated weights on worker 0-0, policy_version 451309 (0.00096) [2022-07-09 22:39:27,211][26022] Updated weights on worker 0-0, policy_version 451319 (0.00090) [2022-07-09 22:39:27,333][25689] Fps is (10 sec: 5746.3, 60 sec: 5721.4, 300 sec: 5690.9). Total num frames: 462150656. Throughput: 0: 5934.2. Samples: 462154384. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:39:27,333][25689] Avg episode reward: [(0, '-45.091')] [2022-07-09 22:39:29,218][26022] Updated weights on worker 0-0, policy_version 451329 (0.00091) [2022-07-09 22:39:30,846][26022] Updated weights on worker 0-0, policy_version 451339 (0.00088) [2022-07-09 22:39:32,351][25689] Fps is (10 sec: 5556.2, 60 sec: 5687.2, 300 sec: 5684.4). Total num frames: 462178304. Throughput: 0: 5083.5. Samples: 462171578. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:39:32,351][25689] Avg episode reward: [(0, '-44.834')] [2022-07-09 22:39:32,710][26022] Updated weights on worker 0-0, policy_version 451349 (0.00090) [2022-07-09 22:39:34,483][26022] Updated weights on worker 0-0, policy_version 451359 (0.00086) [2022-07-09 22:39:36,239][26022] Updated weights on worker 0-0, policy_version 451369 (0.00085) [2022-07-09 22:39:37,388][25689] Fps is (10 sec: 5600.8, 60 sec: 5701.2, 300 sec: 5684.2). Total num frames: 462206976. Throughput: 0: 5969.2. Samples: 462206482. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:39:37,390][25689] Avg episode reward: [(0, '-45.677')] [2022-07-09 22:39:37,940][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:39:37,954][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000451379_462212096.pth [2022-07-09 22:39:37,954][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000449377_460162048.pth [2022-07-09 22:39:37,956][26022] Updated weights on worker 0-0, policy_version 451379 (0.00084) [2022-07-09 22:39:39,692][26022] Updated weights on worker 0-0, policy_version 451389 (0.00091) [2022-07-09 22:39:41,580][26022] Updated weights on worker 0-0, policy_version 451399 (0.00080) [2022-07-09 22:39:42,406][25689] Fps is (10 sec: 5906.4, 60 sec: 5718.0, 300 sec: 5689.8). Total num frames: 462237696. Throughput: 0: 5974.6. Samples: 462240992. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:39:42,407][25689] Avg episode reward: [(0, '-45.149')] [2022-07-09 22:39:43,383][26022] Updated weights on worker 0-0, policy_version 451409 (0.00093) [2022-07-09 22:39:45,171][26022] Updated weights on worker 0-0, policy_version 451419 (0.00084) [2022-07-09 22:39:47,043][26022] Updated weights on worker 0-0, policy_version 451429 (0.00085) [2022-07-09 22:39:47,533][25689] Fps is (10 sec: 5753.7, 60 sec: 5660.0, 300 sec: 5684.4). Total num frames: 462265344. Throughput: 0: 5982.1. Samples: 462275224. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:39:47,535][25689] Avg episode reward: [(0, '-45.827')] [2022-07-09 22:39:48,567][26022] Updated weights on worker 0-0, policy_version 451439 (0.00090) [2022-07-09 22:39:50,711][26022] Updated weights on worker 0-0, policy_version 451449 (0.00090) [2022-07-09 22:39:52,281][26022] Updated weights on worker 0-0, policy_version 451459 (0.00094) [2022-07-09 22:39:52,605][25689] Fps is (10 sec: 5723.3, 60 sec: 5722.0, 300 sec: 5687.2). Total num frames: 462296064. Throughput: 0: 5981.3. Samples: 462292722. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:39:52,605][25689] Avg episode reward: [(0, '-45.731')] [2022-07-09 22:39:54,042][26022] Updated weights on worker 0-0, policy_version 451469 (0.00083) [2022-07-09 22:39:55,800][26022] Updated weights on worker 0-0, policy_version 451479 (0.00094) [2022-07-09 22:39:57,619][25689] Fps is (10 sec: 5888.4, 60 sec: 5706.9, 300 sec: 5694.2). Total num frames: 462324736. Throughput: 0: 5945.5. Samples: 462326764. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:39:57,619][25689] Avg episode reward: [(0, '-46.209')] [2022-07-09 22:39:57,624][26022] Updated weights on worker 0-0, policy_version 451489 (0.00086) [2022-07-09 22:39:59,298][26022] Updated weights on worker 0-0, policy_version 451499 (0.00091) [2022-07-09 22:40:01,222][26022] Updated weights on worker 0-0, policy_version 451509 (0.00087) [2022-07-09 22:40:02,659][25689] Fps is (10 sec: 5398.1, 60 sec: 5687.2, 300 sec: 5689.0). Total num frames: 462350336. Throughput: 0: 5858.6. Samples: 462359642. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:40:02,659][25689] Avg episode reward: [(0, '-45.997')] [2022-07-09 22:40:03,412][26022] Updated weights on worker 0-0, policy_version 451519 (0.00089) [2022-07-09 22:40:05,339][26022] Updated weights on worker 0-0, policy_version 451529 (0.00086) [2022-07-09 22:40:06,904][26022] Updated weights on worker 0-0, policy_version 451539 (0.00095) [2022-07-09 22:40:07,790][25689] Fps is (10 sec: 5436.4, 60 sec: 5701.3, 300 sec: 5687.5). Total num frames: 462380032. Throughput: 0: 4988.1. Samples: 462376272. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:40:07,791][25689] Avg episode reward: [(0, '-46.865')] [2022-07-09 22:40:08,874][26022] Updated weights on worker 0-0, policy_version 451549 (0.00088) [2022-07-09 22:40:10,575][26022] Updated weights on worker 0-0, policy_version 451559 (0.00083) [2022-07-09 22:40:12,439][26022] Updated weights on worker 0-0, policy_version 451569 (0.00093) [2022-07-09 22:40:12,855][25689] Fps is (10 sec: 5724.4, 60 sec: 5663.1, 300 sec: 5690.3). Total num frames: 462408704. Throughput: 0: 5826.7. Samples: 462410714. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:40:12,856][25689] Avg episode reward: [(0, '-47.605')] [2022-07-09 22:40:14,023][26022] Updated weights on worker 0-0, policy_version 451579 (0.00088) [2022-07-09 22:40:15,982][26022] Updated weights on worker 0-0, policy_version 451589 (0.00088) [2022-07-09 22:40:17,697][26022] Updated weights on worker 0-0, policy_version 451599 (0.00084) [2022-07-09 22:40:17,859][25689] Fps is (10 sec: 5797.1, 60 sec: 5703.5, 300 sec: 5698.1). Total num frames: 462438400. Throughput: 0: 5850.9. Samples: 462445186. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:40:17,859][25689] Avg episode reward: [(0, '-47.579')] [2022-07-09 22:40:19,519][26022] Updated weights on worker 0-0, policy_version 451609 (0.00084) [2022-07-09 22:40:21,275][26022] Updated weights on worker 0-0, policy_version 451619 (0.00091) [2022-07-09 22:40:22,898][25689] Fps is (10 sec: 5811.6, 60 sec: 5686.7, 300 sec: 5689.1). Total num frames: 462467072. Throughput: 0: 5082.5. Samples: 462462510. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:40:22,899][25689] Avg episode reward: [(0, '-48.270')] [2022-07-09 22:40:23,063][26022] Updated weights on worker 0-0, policy_version 451629 (0.00084) [2022-07-09 22:40:25,003][26022] Updated weights on worker 0-0, policy_version 451639 (0.00083) [2022-07-09 22:40:26,745][26022] Updated weights on worker 0-0, policy_version 451649 (0.00087) [2022-07-09 22:40:27,966][25689] Fps is (10 sec: 5572.6, 60 sec: 5674.5, 300 sec: 5691.8). Total num frames: 462494720. Throughput: 0: 5948.1. Samples: 462496276. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:40:27,966][25689] Avg episode reward: [(0, '-47.807')] [2022-07-09 22:40:28,464][26022] Updated weights on worker 0-0, policy_version 451659 (0.00084) [2022-07-09 22:40:30,605][26022] Updated weights on worker 0-0, policy_version 451669 (0.00088) [2022-07-09 22:40:32,093][26022] Updated weights on worker 0-0, policy_version 451679 (0.00085) [2022-07-09 22:40:33,024][25689] Fps is (10 sec: 5562.0, 60 sec: 5687.6, 300 sec: 5687.6). Total num frames: 462523392. Throughput: 0: 5926.8. Samples: 462530252. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-09 22:40:33,025][25689] Avg episode reward: [(0, '-47.209')] [2022-07-09 22:40:34,096][26022] Updated weights on worker 0-0, policy_version 451689 (0.00093) [2022-07-09 22:40:35,842][26022] Updated weights on worker 0-0, policy_version 451699 (0.00086) [2022-07-09 22:40:37,604][26022] Updated weights on worker 0-0, policy_version 451709 (0.00088) [2022-07-09 22:40:38,059][25689] Fps is (10 sec: 5782.6, 60 sec: 5704.7, 300 sec: 5694.3). Total num frames: 462553088. Throughput: 0: 5058.9. Samples: 462547378. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:40:38,062][25689] Avg episode reward: [(0, '-46.709')] [2022-07-09 22:40:39,487][26022] Updated weights on worker 0-0, policy_version 451719 (0.00089) [2022-07-09 22:40:41,164][26022] Updated weights on worker 0-0, policy_version 451729 (0.00084) [2022-07-09 22:40:43,049][26022] Updated weights on worker 0-0, policy_version 451739 (0.00100) [2022-07-09 22:40:43,107][25689] Fps is (10 sec: 5687.6, 60 sec: 5651.4, 300 sec: 5687.3). Total num frames: 462580736. Throughput: 0: 5895.3. Samples: 462581644. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:40:43,107][25689] Avg episode reward: [(0, '-47.211')] [2022-07-09 22:40:44,778][26022] Updated weights on worker 0-0, policy_version 451749 (0.00892) [2022-07-09 22:40:46,487][26022] Updated weights on worker 0-0, policy_version 451759 (0.00089) [2022-07-09 22:40:48,235][25689] Fps is (10 sec: 5534.9, 60 sec: 5668.1, 300 sec: 5684.9). Total num frames: 462609408. Throughput: 0: 5928.0. Samples: 462616432. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:40:48,250][25689] Avg episode reward: [(0, '-46.895')] [2022-07-09 22:40:48,338][26022] Updated weights on worker 0-0, policy_version 451769 (0.00090) [2022-07-09 22:40:50,295][26022] Updated weights on worker 0-0, policy_version 451779 (0.00089) [2022-07-09 22:40:51,895][26022] Updated weights on worker 0-0, policy_version 451789 (0.00084) [2022-07-09 22:40:53,346][25689] Fps is (10 sec: 5700.3, 60 sec: 5647.6, 300 sec: 5690.1). Total num frames: 462639104. Throughput: 0: 5078.4. Samples: 462633454. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:40:53,346][25689] Avg episode reward: [(0, '-46.466')] [2022-07-09 22:40:53,905][26022] Updated weights on worker 0-0, policy_version 451799 (0.00090) [2022-07-09 22:40:55,567][26022] Updated weights on worker 0-0, policy_version 451809 (0.00086) [2022-07-09 22:40:57,528][26022] Updated weights on worker 0-0, policy_version 451819 (0.00092) [2022-07-09 22:40:58,423][25689] Fps is (10 sec: 5829.5, 60 sec: 5658.6, 300 sec: 5689.1). Total num frames: 462668800. Throughput: 0: 5907.2. Samples: 462667670. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:40:58,424][25689] Avg episode reward: [(0, '-46.872')] [2022-07-09 22:40:59,033][26022] Updated weights on worker 0-0, policy_version 451829 (0.00098) [2022-07-09 22:41:01,004][26022] Updated weights on worker 0-0, policy_version 451839 (0.00080) [2022-07-09 22:41:02,852][26022] Updated weights on worker 0-0, policy_version 451849 (0.00061) [2022-07-09 22:41:03,475][25689] Fps is (10 sec: 5560.4, 60 sec: 5674.3, 300 sec: 5689.0). Total num frames: 462695424. Throughput: 0: 5813.3. Samples: 462700050. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:41:03,475][25689] Avg episode reward: [(0, '-46.650')] [2022-07-09 22:41:05,049][26022] Updated weights on worker 0-0, policy_version 451859 (0.00092) [2022-07-09 22:41:06,398][26022] Updated weights on worker 0-0, policy_version 451869 (0.00097) [2022-07-09 22:41:08,515][25689] Fps is (10 sec: 5377.9, 60 sec: 5649.1, 300 sec: 5685.3). Total num frames: 462723072. Throughput: 0: 4976.8. Samples: 462717360. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:41:08,515][25689] Avg episode reward: [(0, '-46.853')] [2022-07-09 22:41:08,595][26022] Updated weights on worker 0-0, policy_version 451879 (0.00090) [2022-07-09 22:41:10,010][26022] Updated weights on worker 0-0, policy_version 451889 (0.00061) [2022-07-09 22:41:12,060][26022] Updated weights on worker 0-0, policy_version 451899 (0.00086) [2022-07-09 22:41:13,591][25689] Fps is (10 sec: 5769.8, 60 sec: 5681.8, 300 sec: 5691.3). Total num frames: 462753792. Throughput: 0: 5855.9. Samples: 462752006. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:41:13,593][25689] Avg episode reward: [(0, '-45.979')] [2022-07-09 22:41:13,750][26022] Updated weights on worker 0-0, policy_version 451909 (0.00090) [2022-07-09 22:41:15,590][26022] Updated weights on worker 0-0, policy_version 451919 (0.00096) [2022-07-09 22:41:17,145][26022] Updated weights on worker 0-0, policy_version 451929 (0.00086) [2022-07-09 22:41:18,656][25689] Fps is (10 sec: 5856.3, 60 sec: 5659.2, 300 sec: 5683.5). Total num frames: 462782464. Throughput: 0: 5889.2. Samples: 462786826. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:41:18,658][25689] Avg episode reward: [(0, '-45.847')] [2022-07-09 22:41:19,074][26022] Updated weights on worker 0-0, policy_version 451939 (0.00085) [2022-07-09 22:41:20,738][26022] Updated weights on worker 0-0, policy_version 451949 (0.00089) [2022-07-09 22:41:22,655][26022] Updated weights on worker 0-0, policy_version 451959 (0.00086) [2022-07-09 22:41:23,661][25689] Fps is (10 sec: 5796.0, 60 sec: 5679.2, 300 sec: 5695.2). Total num frames: 462812160. Throughput: 0: 6030.1. Samples: 462821778. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:41:23,662][25689] Avg episode reward: [(0, '-45.564')] [2022-07-09 22:41:24,099][26022] Updated weights on worker 0-0, policy_version 451969 (0.00086) [2022-07-09 22:41:26,505][26022] Updated weights on worker 0-0, policy_version 451979 (0.00086) [2022-07-09 22:41:27,709][26022] Updated weights on worker 0-0, policy_version 451989 (0.00085) [2022-07-09 22:41:28,823][25689] Fps is (10 sec: 5741.0, 60 sec: 5687.3, 300 sec: 5693.2). Total num frames: 462840832. Throughput: 0: 5987.2. Samples: 462838950. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:41:28,823][25689] Avg episode reward: [(0, '-45.401')] [2022-07-09 22:41:29,878][26022] Updated weights on worker 0-0, policy_version 451999 (0.00089) [2022-07-09 22:41:31,340][26022] Updated weights on worker 0-0, policy_version 452009 (0.00088) [2022-07-09 22:41:33,237][26022] Updated weights on worker 0-0, policy_version 452019 (0.00091) [2022-07-09 22:41:33,828][25689] Fps is (10 sec: 5842.0, 60 sec: 5726.0, 300 sec: 5696.7). Total num frames: 462871552. Throughput: 0: 6015.5. Samples: 462873740. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:41:33,828][25689] Avg episode reward: [(0, '-45.216')] [2022-07-09 22:41:34,915][26022] Updated weights on worker 0-0, policy_version 452029 (0.00085) [2022-07-09 22:41:36,591][26022] Updated weights on worker 0-0, policy_version 452039 (0.00097) [2022-07-09 22:41:37,983][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:41:38,002][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000452045_462894080.pth [2022-07-09 22:41:38,002][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000450042_460843008.pth [2022-07-09 22:41:38,535][26022] Updated weights on worker 0-0, policy_version 452049 (0.00076) [2022-07-09 22:41:38,864][25689] Fps is (10 sec: 5813.0, 60 sec: 5692.2, 300 sec: 5692.8). Total num frames: 462899200. Throughput: 0: 5991.0. Samples: 462907890. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:41:38,864][25689] Avg episode reward: [(0, '-45.262')] [2022-07-09 22:41:40,464][26022] Updated weights on worker 0-0, policy_version 452059 (0.00086) [2022-07-09 22:41:42,134][26022] Updated weights on worker 0-0, policy_version 452069 (0.00086) [2022-07-09 22:41:43,877][25689] Fps is (10 sec: 5502.4, 60 sec: 5695.4, 300 sec: 5684.8). Total num frames: 462926848. Throughput: 0: 5110.1. Samples: 462925082. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:41:43,878][25689] Avg episode reward: [(0, '-45.421')] [2022-07-09 22:41:44,034][26022] Updated weights on worker 0-0, policy_version 452079 (0.00095) [2022-07-09 22:41:45,557][26022] Updated weights on worker 0-0, policy_version 452089 (0.00086) [2022-07-09 22:41:47,635][26022] Updated weights on worker 0-0, policy_version 452099 (0.00097) [2022-07-09 22:41:48,976][25689] Fps is (10 sec: 5772.0, 60 sec: 5731.8, 300 sec: 5697.5). Total num frames: 462957568. Throughput: 0: 5972.6. Samples: 462959318. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:41:48,977][25689] Avg episode reward: [(0, '-45.927')] [2022-07-09 22:41:49,375][26022] Updated weights on worker 0-0, policy_version 452109 (0.00098) [2022-07-09 22:41:51,202][26022] Updated weights on worker 0-0, policy_version 452119 (0.00089) [2022-07-09 22:41:53,025][26022] Updated weights on worker 0-0, policy_version 452129 (0.00094) [2022-07-09 22:41:53,981][25689] Fps is (10 sec: 5675.7, 60 sec: 5691.3, 300 sec: 5691.9). Total num frames: 462984192. Throughput: 0: 5939.3. Samples: 462993434. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:41:53,981][25689] Avg episode reward: [(0, '-45.865')] [2022-07-09 22:41:54,700][26022] Updated weights on worker 0-0, policy_version 452139 (0.00091) [2022-07-09 22:41:56,676][26022] Updated weights on worker 0-0, policy_version 452149 (0.00086) [2022-07-09 22:41:58,318][26022] Updated weights on worker 0-0, policy_version 452159 (0.00091) [2022-07-09 22:41:58,983][25689] Fps is (10 sec: 5628.1, 60 sec: 5698.3, 300 sec: 5686.0). Total num frames: 463013888. Throughput: 0: 5092.6. Samples: 463010350. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:41:58,984][25689] Avg episode reward: [(0, '-46.504')] [2022-07-09 22:42:00,095][26022] Updated weights on worker 0-0, policy_version 452169 (0.00067) [2022-07-09 22:42:02,378][26022] Updated weights on worker 0-0, policy_version 452179 (0.00090) [2022-07-09 22:42:04,007][25689] Fps is (10 sec: 5515.2, 60 sec: 5684.0, 300 sec: 5694.6). Total num frames: 463039488. Throughput: 0: 5836.2. Samples: 463042562. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:42:04,007][25689] Avg episode reward: [(0, '-45.933')] [2022-07-09 22:42:04,208][26022] Updated weights on worker 0-0, policy_version 452189 (0.00086) [2022-07-09 22:42:06,095][26022] Updated weights on worker 0-0, policy_version 452199 (0.00086) [2022-07-09 22:42:07,579][26022] Updated weights on worker 0-0, policy_version 452209 (0.00078) [2022-07-09 22:42:09,063][25689] Fps is (10 sec: 5384.4, 60 sec: 5699.4, 300 sec: 5686.9). Total num frames: 463068160. Throughput: 0: 5853.5. Samples: 463076894. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:42:09,063][25689] Avg episode reward: [(0, '-45.388')] [2022-07-09 22:42:09,705][26022] Updated weights on worker 0-0, policy_version 452219 (0.00097) [2022-07-09 22:42:11,128][26022] Updated weights on worker 0-0, policy_version 452229 (0.00088) [2022-07-09 22:42:13,411][26022] Updated weights on worker 0-0, policy_version 452239 (0.00091) [2022-07-09 22:42:14,068][25689] Fps is (10 sec: 5699.6, 60 sec: 5672.3, 300 sec: 5686.8). Total num frames: 463096832. Throughput: 0: 5014.0. Samples: 463094150. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:42:14,068][25689] Avg episode reward: [(0, '-45.090')] [2022-07-09 22:42:14,740][26022] Updated weights on worker 0-0, policy_version 452249 (0.00090) [2022-07-09 22:42:16,763][26022] Updated weights on worker 0-0, policy_version 452259 (0.00089) [2022-07-09 22:42:18,733][26022] Updated weights on worker 0-0, policy_version 452269 (0.00091) [2022-07-09 22:42:19,099][25689] Fps is (10 sec: 5713.6, 60 sec: 5675.4, 300 sec: 5686.8). Total num frames: 463125504. Throughput: 0: 5854.0. Samples: 463128108. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:42:19,100][25689] Avg episode reward: [(0, '-45.065')] [2022-07-09 22:42:20,322][26022] Updated weights on worker 0-0, policy_version 452279 (0.00087) [2022-07-09 22:42:22,318][26022] Updated weights on worker 0-0, policy_version 452289 (0.00089) [2022-07-09 22:42:24,123][25689] Fps is (10 sec: 5600.8, 60 sec: 5639.8, 300 sec: 5688.0). Total num frames: 463153152. Throughput: 0: 5942.5. Samples: 463162106. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:42:24,124][25689] Avg episode reward: [(0, '-45.402')] [2022-07-09 22:42:24,194][26022] Updated weights on worker 0-0, policy_version 452299 (0.00088) [2022-07-09 22:42:25,804][26022] Updated weights on worker 0-0, policy_version 452309 (0.00090) [2022-07-09 22:42:27,813][26022] Updated weights on worker 0-0, policy_version 452319 (0.00090) [2022-07-09 22:42:29,188][25689] Fps is (10 sec: 5683.8, 60 sec: 5665.8, 300 sec: 5690.9). Total num frames: 463182848. Throughput: 0: 5072.4. Samples: 463178978. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:42:29,192][25689] Avg episode reward: [(0, '-45.508')] [2022-07-09 22:42:29,497][26022] Updated weights on worker 0-0, policy_version 452329 (0.00087) [2022-07-09 22:42:31,400][26022] Updated weights on worker 0-0, policy_version 452339 (0.00084) [2022-07-09 22:42:33,326][26022] Updated weights on worker 0-0, policy_version 452350 (0.00088) [2022-07-09 22:42:34,203][25689] Fps is (10 sec: 5790.9, 60 sec: 5631.0, 300 sec: 5684.2). Total num frames: 463211520. Throughput: 0: 5885.1. Samples: 463212646. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:42:34,203][25689] Avg episode reward: [(0, '-45.445')] [2022-07-09 22:42:35,243][26022] Updated weights on worker 0-0, policy_version 452360 (0.00196) [2022-07-09 22:42:36,838][26022] Updated weights on worker 0-0, policy_version 452370 (0.00091) [2022-07-09 22:42:38,857][26022] Updated weights on worker 0-0, policy_version 452380 (0.00082) [2022-07-09 22:42:39,210][25689] Fps is (10 sec: 5517.6, 60 sec: 5616.7, 300 sec: 5677.6). Total num frames: 463238144. Throughput: 0: 5916.8. Samples: 463247100. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:42:39,212][25689] Avg episode reward: [(0, '-46.447')] [2022-07-09 22:42:40,451][26022] Updated weights on worker 0-0, policy_version 452390 (0.00084) [2022-07-09 22:42:42,400][26022] Updated weights on worker 0-0, policy_version 452400 (0.00088) [2022-07-09 22:42:44,093][26022] Updated weights on worker 0-0, policy_version 452410 (0.00083) [2022-07-09 22:42:44,214][25689] Fps is (10 sec: 5625.7, 60 sec: 5651.5, 300 sec: 5686.8). Total num frames: 463267840. Throughput: 0: 5074.2. Samples: 463264048. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:42:44,215][25689] Avg episode reward: [(0, '-46.605')] [2022-07-09 22:42:45,942][26022] Updated weights on worker 0-0, policy_version 452420 (0.00092) [2022-07-09 22:42:47,843][26022] Updated weights on worker 0-0, policy_version 452430 (0.00084) [2022-07-09 22:42:49,288][25689] Fps is (10 sec: 5893.2, 60 sec: 5636.8, 300 sec: 5683.4). Total num frames: 463297536. Throughput: 0: 5929.4. Samples: 463298158. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:42:49,289][25689] Avg episode reward: [(0, '-45.827')] [2022-07-09 22:42:49,520][26022] Updated weights on worker 0-0, policy_version 452440 (0.00086) [2022-07-09 22:42:51,244][26022] Updated weights on worker 0-0, policy_version 452450 (0.00085) [2022-07-09 22:42:53,288][26022] Updated weights on worker 0-0, policy_version 452460 (0.00651) [2022-07-09 22:42:54,363][25689] Fps is (10 sec: 5751.1, 60 sec: 5664.2, 300 sec: 5683.5). Total num frames: 463326208. Throughput: 0: 5937.7. Samples: 463332350. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:42:54,363][25689] Avg episode reward: [(0, '-45.902')] [2022-07-09 22:42:54,976][26022] Updated weights on worker 0-0, policy_version 452470 (0.00072) [2022-07-09 22:42:56,829][26022] Updated weights on worker 0-0, policy_version 452480 (0.00084) [2022-07-09 22:42:58,558][26022] Updated weights on worker 0-0, policy_version 452490 (0.00083) [2022-07-09 22:42:59,427][25689] Fps is (10 sec: 5554.7, 60 sec: 5624.5, 300 sec: 5682.7). Total num frames: 463353856. Throughput: 0: 5055.4. Samples: 463349306. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:42:59,428][25689] Avg episode reward: [(0, '-45.843')] [2022-07-09 22:43:00,361][26022] Updated weights on worker 0-0, policy_version 452500 (0.00079) [2022-07-09 22:43:02,677][26022] Updated weights on worker 0-0, policy_version 452510 (0.00087) [2022-07-09 22:43:04,435][25689] Fps is (10 sec: 5286.5, 60 sec: 5625.9, 300 sec: 5674.2). Total num frames: 463379456. Throughput: 0: 5776.8. Samples: 463380862. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:43:04,436][25689] Avg episode reward: [(0, '-46.047')] [2022-07-09 22:43:04,704][26022] Updated weights on worker 0-0, policy_version 452520 (0.00087) [2022-07-09 22:43:06,139][26022] Updated weights on worker 0-0, policy_version 452530 (0.00083) [2022-07-09 22:43:08,104][26022] Updated weights on worker 0-0, policy_version 452540 (0.00634) [2022-07-09 22:43:09,596][25689] Fps is (10 sec: 5438.0, 60 sec: 5633.2, 300 sec: 5678.9). Total num frames: 463409152. Throughput: 0: 5751.6. Samples: 463414958. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:43:09,596][25689] Avg episode reward: [(0, '-45.846')] [2022-07-09 22:43:09,773][26022] Updated weights on worker 0-0, policy_version 452550 (0.00086) [2022-07-09 22:43:11,666][26022] Updated weights on worker 0-0, policy_version 452560 (0.00090) [2022-07-09 22:43:13,454][26022] Updated weights on worker 0-0, policy_version 452570 (0.00048) [2022-07-09 22:43:14,636][25689] Fps is (10 sec: 5722.0, 60 sec: 5629.9, 300 sec: 5671.3). Total num frames: 463437824. Throughput: 0: 4917.3. Samples: 463432038. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:43:14,636][25689] Avg episode reward: [(0, '-46.073')] [2022-07-09 22:43:15,083][26022] Updated weights on worker 0-0, policy_version 452580 (0.00086) [2022-07-09 22:43:16,982][26022] Updated weights on worker 0-0, policy_version 452590 (0.00101) [2022-07-09 22:43:18,809][26022] Updated weights on worker 0-0, policy_version 452600 (0.00091) [2022-07-09 22:43:19,675][25689] Fps is (10 sec: 5689.1, 60 sec: 5629.2, 300 sec: 5674.2). Total num frames: 463466496. Throughput: 0: 5789.8. Samples: 463466538. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-09 22:43:19,681][25689] Avg episode reward: [(0, '-46.381')] [2022-07-09 22:43:20,493][26022] Updated weights on worker 0-0, policy_version 452610 (0.00093) [2022-07-09 22:43:22,515][26022] Updated weights on worker 0-0, policy_version 452620 (0.00092) [2022-07-09 22:43:23,998][26022] Updated weights on worker 0-0, policy_version 452630 (0.00087) [2022-07-09 22:43:24,703][25689] Fps is (10 sec: 5797.6, 60 sec: 5662.6, 300 sec: 5679.2). Total num frames: 463496192. Throughput: 0: 5923.6. Samples: 463500922. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:43:24,704][25689] Avg episode reward: [(0, '-46.243')] [2022-07-09 22:43:26,033][26022] Updated weights on worker 0-0, policy_version 452640 (0.00114) [2022-07-09 22:43:27,862][26022] Updated weights on worker 0-0, policy_version 452650 (0.00096) [2022-07-09 22:43:29,632][26022] Updated weights on worker 0-0, policy_version 452660 (0.00090) [2022-07-09 22:43:29,836][25689] Fps is (10 sec: 5744.3, 60 sec: 5639.4, 300 sec: 5673.5). Total num frames: 463524864. Throughput: 0: 5078.2. Samples: 463517744. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:43:29,837][25689] Avg episode reward: [(0, '-46.952')] [2022-07-09 22:43:31,517][26022] Updated weights on worker 0-0, policy_version 452670 (0.00086) [2022-07-09 22:43:33,273][26022] Updated weights on worker 0-0, policy_version 452680 (0.00090) [2022-07-09 22:43:34,846][25689] Fps is (10 sec: 5553.1, 60 sec: 5623.0, 300 sec: 5673.5). Total num frames: 463552512. Throughput: 0: 5912.2. Samples: 463551522. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:43:34,846][25689] Avg episode reward: [(0, '-47.439')] [2022-07-09 22:43:35,114][26022] Updated weights on worker 0-0, policy_version 452690 (0.00090) [2022-07-09 22:43:36,912][26022] Updated weights on worker 0-0, policy_version 452700 (0.00089) [2022-07-09 22:43:38,032][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:43:38,048][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000452706_463570944.pth [2022-07-09 22:43:38,048][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000450709_461526016.pth [2022-07-09 22:43:38,683][26022] Updated weights on worker 0-0, policy_version 452710 (0.00093) [2022-07-09 22:43:39,875][25689] Fps is (10 sec: 5610.4, 60 sec: 5654.7, 300 sec: 5669.8). Total num frames: 463581184. Throughput: 0: 5906.3. Samples: 463585842. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:43:39,875][25689] Avg episode reward: [(0, '-47.662')] [2022-07-09 22:43:40,426][26022] Updated weights on worker 0-0, policy_version 452720 (0.00091) [2022-07-09 22:43:42,223][26022] Updated weights on worker 0-0, policy_version 452730 (0.00087) [2022-07-09 22:43:43,996][26022] Updated weights on worker 0-0, policy_version 452740 (0.00094) [2022-07-09 22:43:44,895][25689] Fps is (10 sec: 5706.3, 60 sec: 5636.3, 300 sec: 5663.5). Total num frames: 463609856. Throughput: 0: 5054.6. Samples: 463602980. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:43:44,895][25689] Avg episode reward: [(0, '-47.687')] [2022-07-09 22:43:45,910][26022] Updated weights on worker 0-0, policy_version 452750 (0.00086) [2022-07-09 22:43:47,577][26022] Updated weights on worker 0-0, policy_version 452760 (0.00093) [2022-07-09 22:43:49,432][26022] Updated weights on worker 0-0, policy_version 452770 (0.00089) [2022-07-09 22:43:49,943][25689] Fps is (10 sec: 5695.3, 60 sec: 5621.8, 300 sec: 5669.7). Total num frames: 463638528. Throughput: 0: 5948.3. Samples: 463637346. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:43:49,944][25689] Avg episode reward: [(0, '-48.080')] [2022-07-09 22:43:51,121][26022] Updated weights on worker 0-0, policy_version 452780 (0.00090) [2022-07-09 22:43:53,120][26022] Updated weights on worker 0-0, policy_version 452790 (0.00083) [2022-07-09 22:43:55,007][25689] Fps is (10 sec: 5569.5, 60 sec: 5606.0, 300 sec: 5662.2). Total num frames: 463666176. Throughput: 0: 5928.3. Samples: 463671044. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:43:55,007][25689] Avg episode reward: [(0, '-48.138')] [2022-07-09 22:43:55,081][26022] Updated weights on worker 0-0, policy_version 452800 (0.00089) [2022-07-09 22:43:56,743][26022] Updated weights on worker 0-0, policy_version 452810 (0.00092) [2022-07-09 22:43:58,500][26022] Updated weights on worker 0-0, policy_version 452820 (0.00081) [2022-07-09 22:44:00,038][25689] Fps is (10 sec: 5680.5, 60 sec: 5642.8, 300 sec: 5672.1). Total num frames: 463695872. Throughput: 0: 5921.9. Samples: 463705248. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:44:00,039][25689] Avg episode reward: [(0, '-49.454')] [2022-07-09 22:44:00,448][26022] Updated weights on worker 0-0, policy_version 452830 (0.00085) [2022-07-09 22:44:02,453][26022] Updated weights on worker 0-0, policy_version 452840 (0.00092) [2022-07-09 22:44:04,406][26022] Updated weights on worker 0-0, policy_version 452850 (0.00088) [2022-07-09 22:44:05,046][25689] Fps is (10 sec: 5507.9, 60 sec: 5642.8, 300 sec: 5663.6). Total num frames: 463721472. Throughput: 0: 5812.1. Samples: 463720102. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:44:05,047][25689] Avg episode reward: [(0, '-49.164')] [2022-07-09 22:44:05,970][26022] Updated weights on worker 0-0, policy_version 452860 (0.00089) [2022-07-09 22:44:08,036][26022] Updated weights on worker 0-0, policy_version 452870 (0.00083) [2022-07-09 22:44:09,683][26022] Updated weights on worker 0-0, policy_version 452880 (0.00089) [2022-07-09 22:44:10,131][25689] Fps is (10 sec: 5478.9, 60 sec: 5649.9, 300 sec: 5658.9). Total num frames: 463751168. Throughput: 0: 5785.9. Samples: 463754148. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:44:10,131][25689] Avg episode reward: [(0, '-48.217')] [2022-07-09 22:44:11,455][26022] Updated weights on worker 0-0, policy_version 452890 (0.00086) [2022-07-09 22:44:13,259][26022] Updated weights on worker 0-0, policy_version 452900 (0.00085) [2022-07-09 22:44:15,149][25689] Fps is (10 sec: 5676.1, 60 sec: 5635.0, 300 sec: 5660.0). Total num frames: 463778816. Throughput: 0: 5846.0. Samples: 463788794. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:44:15,149][25689] Avg episode reward: [(0, '-48.067')] [2022-07-09 22:44:15,199][26022] Updated weights on worker 0-0, policy_version 452910 (0.00095) [2022-07-09 22:44:16,769][26022] Updated weights on worker 0-0, policy_version 452920 (0.00087) [2022-07-09 22:44:18,706][26022] Updated weights on worker 0-0, policy_version 452930 (0.00089) [2022-07-09 22:44:20,171][25689] Fps is (10 sec: 5813.5, 60 sec: 5670.5, 300 sec: 5663.8). Total num frames: 463809536. Throughput: 0: 5003.1. Samples: 463805972. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:44:20,171][25689] Avg episode reward: [(0, '-47.173')] [2022-07-09 22:44:20,324][26022] Updated weights on worker 0-0, policy_version 452940 (0.00088) [2022-07-09 22:44:22,182][26022] Updated weights on worker 0-0, policy_version 452950 (0.00880) [2022-07-09 22:44:24,028][26022] Updated weights on worker 0-0, policy_version 452960 (0.00084) [2022-07-09 22:44:25,183][25689] Fps is (10 sec: 5919.2, 60 sec: 5655.1, 300 sec: 5665.8). Total num frames: 463838208. Throughput: 0: 5961.2. Samples: 463840140. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:44:25,183][25689] Avg episode reward: [(0, '-46.048')] [2022-07-09 22:44:26,048][26022] Updated weights on worker 0-0, policy_version 452970 (0.00083) [2022-07-09 22:44:27,645][26022] Updated weights on worker 0-0, policy_version 452980 (0.00090) [2022-07-09 22:44:29,452][26022] Updated weights on worker 0-0, policy_version 452990 (0.00088) [2022-07-09 22:44:30,288][25689] Fps is (10 sec: 5465.7, 60 sec: 5623.8, 300 sec: 5660.7). Total num frames: 463864832. Throughput: 0: 5972.4. Samples: 463874534. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:44:30,290][25689] Avg episode reward: [(0, '-45.847')] [2022-07-09 22:44:31,064][26022] Updated weights on worker 0-0, policy_version 453000 (0.00088) [2022-07-09 22:44:33,180][26022] Updated weights on worker 0-0, policy_version 453010 (0.00098) [2022-07-09 22:44:34,726][26022] Updated weights on worker 0-0, policy_version 453020 (0.00096) [2022-07-09 22:44:35,327][25689] Fps is (10 sec: 5552.0, 60 sec: 5654.9, 300 sec: 5664.1). Total num frames: 463894528. Throughput: 0: 5098.0. Samples: 463891660. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:44:35,328][25689] Avg episode reward: [(0, '-45.795')] [2022-07-09 22:44:36,597][26022] Updated weights on worker 0-0, policy_version 453030 (0.00093) [2022-07-09 22:44:38,463][26022] Updated weights on worker 0-0, policy_version 453040 (0.00087) [2022-07-09 22:44:40,226][26022] Updated weights on worker 0-0, policy_version 453050 (0.00082) [2022-07-09 22:44:40,359][25689] Fps is (10 sec: 5897.4, 60 sec: 5671.6, 300 sec: 5660.4). Total num frames: 463924224. Throughput: 0: 5944.1. Samples: 463925970. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:44:40,359][25689] Avg episode reward: [(0, '-45.842')] [2022-07-09 22:44:41,892][26022] Updated weights on worker 0-0, policy_version 453060 (0.00086) [2022-07-09 22:44:43,875][26022] Updated weights on worker 0-0, policy_version 453070 (0.00089) [2022-07-09 22:44:45,367][25689] Fps is (10 sec: 5813.3, 60 sec: 5672.7, 300 sec: 5666.1). Total num frames: 463952896. Throughput: 0: 5957.4. Samples: 463960386. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:44:45,368][25689] Avg episode reward: [(0, '-46.375')] [2022-07-09 22:44:45,491][26022] Updated weights on worker 0-0, policy_version 453080 (0.00088) [2022-07-09 22:44:47,356][26022] Updated weights on worker 0-0, policy_version 453090 (0.00080) [2022-07-09 22:44:48,943][26022] Updated weights on worker 0-0, policy_version 453100 (0.00081) [2022-07-09 22:44:50,486][25689] Fps is (10 sec: 5560.9, 60 sec: 5649.1, 300 sec: 5654.9). Total num frames: 463980544. Throughput: 0: 5103.7. Samples: 463977622. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:44:50,487][25689] Avg episode reward: [(0, '-46.672')] [2022-07-09 22:44:50,910][26022] Updated weights on worker 0-0, policy_version 453110 (0.00093) [2022-07-09 22:44:52,489][26022] Updated weights on worker 0-0, policy_version 453120 (0.00617) [2022-07-09 22:44:54,402][26022] Updated weights on worker 0-0, policy_version 453130 (0.00096) [2022-07-09 22:44:55,534][25689] Fps is (10 sec: 5741.1, 60 sec: 5701.4, 300 sec: 5661.1). Total num frames: 464011264. Throughput: 0: 5973.1. Samples: 464012360. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:44:55,534][25689] Avg episode reward: [(0, '-46.184')] [2022-07-09 22:44:56,127][26022] Updated weights on worker 0-0, policy_version 453140 (0.00088) [2022-07-09 22:44:57,851][26022] Updated weights on worker 0-0, policy_version 453150 (0.00085) [2022-07-09 22:44:59,904][26022] Updated weights on worker 0-0, policy_version 453160 (0.01014) [2022-07-09 22:45:00,537][25689] Fps is (10 sec: 5909.2, 60 sec: 5687.1, 300 sec: 5672.1). Total num frames: 464039936. Throughput: 0: 5986.3. Samples: 464046766. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:45:00,538][25689] Avg episode reward: [(0, '-45.811')] [2022-07-09 22:45:01,613][26022] Updated weights on worker 0-0, policy_version 453170 (0.00086) [2022-07-09 22:45:03,825][26022] Updated weights on worker 0-0, policy_version 453180 (0.00097) [2022-07-09 22:45:05,557][25689] Fps is (10 sec: 5414.5, 60 sec: 5686.0, 300 sec: 5660.5). Total num frames: 464065536. Throughput: 0: 5025.9. Samples: 464061860. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:45:05,558][25689] Avg episode reward: [(0, '-45.503')] [2022-07-09 22:45:05,674][26022] Updated weights on worker 0-0, policy_version 453190 (0.00084) [2022-07-09 22:45:07,367][26022] Updated weights on worker 0-0, policy_version 453200 (0.00089) [2022-07-09 22:45:09,323][26022] Updated weights on worker 0-0, policy_version 453210 (0.00059) [2022-07-09 22:45:10,681][25689] Fps is (10 sec: 5451.2, 60 sec: 5682.3, 300 sec: 5662.8). Total num frames: 464095232. Throughput: 0: 5850.7. Samples: 464095774. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:45:10,682][25689] Avg episode reward: [(0, '-45.108')] [2022-07-09 22:45:11,008][26022] Updated weights on worker 0-0, policy_version 453220 (0.00090) [2022-07-09 22:45:12,729][26022] Updated weights on worker 0-0, policy_version 453230 (0.00088) [2022-07-09 22:45:14,599][26022] Updated weights on worker 0-0, policy_version 453240 (0.00087) [2022-07-09 22:45:15,743][25689] Fps is (10 sec: 5730.3, 60 sec: 5695.1, 300 sec: 5658.3). Total num frames: 464123904. Throughput: 0: 5826.8. Samples: 464130116. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:45:15,744][25689] Avg episode reward: [(0, '-45.236')] [2022-07-09 22:45:16,303][26022] Updated weights on worker 0-0, policy_version 453250 (0.00096) [2022-07-09 22:45:18,281][26022] Updated weights on worker 0-0, policy_version 453260 (0.00083) [2022-07-09 22:45:19,979][26022] Updated weights on worker 0-0, policy_version 453270 (0.00086) [2022-07-09 22:45:20,758][25689] Fps is (10 sec: 5792.2, 60 sec: 5678.8, 300 sec: 5662.2). Total num frames: 464153600. Throughput: 0: 4970.1. Samples: 464147266. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:45:20,759][25689] Avg episode reward: [(0, '-44.915')] [2022-07-09 22:45:21,682][26022] Updated weights on worker 0-0, policy_version 453280 (0.00098) [2022-07-09 22:45:23,616][26022] Updated weights on worker 0-0, policy_version 453290 (0.00092) [2022-07-09 22:45:25,135][26022] Updated weights on worker 0-0, policy_version 453300 (0.00086) [2022-07-09 22:45:25,810][25689] Fps is (10 sec: 5696.0, 60 sec: 5658.1, 300 sec: 5662.4). Total num frames: 464181248. Throughput: 0: 5923.8. Samples: 464181836. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:45:25,811][25689] Avg episode reward: [(0, '-45.723')] [2022-07-09 22:45:27,269][26022] Updated weights on worker 0-0, policy_version 453310 (0.00083) [2022-07-09 22:45:28,847][26022] Updated weights on worker 0-0, policy_version 453320 (0.00088) [2022-07-09 22:45:30,879][25689] Fps is (10 sec: 5463.6, 60 sec: 5678.4, 300 sec: 5658.8). Total num frames: 464208896. Throughput: 0: 5945.8. Samples: 464215866. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:45:30,879][25689] Avg episode reward: [(0, '-46.220')] [2022-07-09 22:45:31,002][26022] Updated weights on worker 0-0, policy_version 453330 (0.00095) [2022-07-09 22:45:32,456][26022] Updated weights on worker 0-0, policy_version 453340 (0.00094) [2022-07-09 22:45:34,511][26022] Updated weights on worker 0-0, policy_version 453350 (0.00086) [2022-07-09 22:45:35,959][25689] Fps is (10 sec: 5751.4, 60 sec: 5691.5, 300 sec: 5661.4). Total num frames: 464239616. Throughput: 0: 5089.9. Samples: 464233014. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:45:35,960][25689] Avg episode reward: [(0, '-46.473')] [2022-07-09 22:45:36,130][26022] Updated weights on worker 0-0, policy_version 453360 (0.00089) [2022-07-09 22:45:38,150][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:45:38,162][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000453370_464250880.pth [2022-07-09 22:45:38,162][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000451379_462212096.pth [2022-07-09 22:45:38,169][26022] Updated weights on worker 0-0, policy_version 453370 (0.00084) [2022-07-09 22:45:39,743][26022] Updated weights on worker 0-0, policy_version 453380 (0.00083) [2022-07-09 22:45:41,033][25689] Fps is (10 sec: 5748.0, 60 sec: 5653.7, 300 sec: 5660.9). Total num frames: 464267264. Throughput: 0: 5900.7. Samples: 464266904. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:45:41,034][25689] Avg episode reward: [(0, '-46.011')] [2022-07-09 22:45:41,511][26022] Updated weights on worker 0-0, policy_version 453390 (0.00091) [2022-07-09 22:45:43,437][26022] Updated weights on worker 0-0, policy_version 453400 (0.00090) [2022-07-09 22:45:45,163][26022] Updated weights on worker 0-0, policy_version 453410 (0.00092) [2022-07-09 22:45:46,059][25689] Fps is (10 sec: 5677.5, 60 sec: 5669.0, 300 sec: 5666.3). Total num frames: 464296960. Throughput: 0: 5889.6. Samples: 464301092. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:45:46,060][25689] Avg episode reward: [(0, '-46.525')] [2022-07-09 22:45:47,028][26022] Updated weights on worker 0-0, policy_version 453420 (0.00089) [2022-07-09 22:45:48,917][26022] Updated weights on worker 0-0, policy_version 453430 (0.00087) [2022-07-09 22:45:50,602][26022] Updated weights on worker 0-0, policy_version 453440 (0.00086) [2022-07-09 22:45:51,131][25689] Fps is (10 sec: 5679.3, 60 sec: 5673.5, 300 sec: 5660.1). Total num frames: 464324608. Throughput: 0: 5054.6. Samples: 464318230. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:45:51,131][25689] Avg episode reward: [(0, '-46.387')] [2022-07-09 22:45:52,358][26022] Updated weights on worker 0-0, policy_version 453450 (0.00085) [2022-07-09 22:45:54,175][26022] Updated weights on worker 0-0, policy_version 453460 (0.00614) [2022-07-09 22:45:55,944][26022] Updated weights on worker 0-0, policy_version 453470 (0.00091) [2022-07-09 22:45:56,183][25689] Fps is (10 sec: 5563.3, 60 sec: 5639.3, 300 sec: 5657.2). Total num frames: 464353280. Throughput: 0: 5893.2. Samples: 464352196. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:45:56,183][25689] Avg episode reward: [(0, '-46.728')] [2022-07-09 22:45:57,886][26022] Updated weights on worker 0-0, policy_version 453480 (0.00088) [2022-07-09 22:45:59,462][26022] Updated weights on worker 0-0, policy_version 453490 (0.00092) [2022-07-09 22:46:01,239][25689] Fps is (10 sec: 5672.8, 60 sec: 5634.4, 300 sec: 5664.0). Total num frames: 464381952. Throughput: 0: 5911.4. Samples: 464386348. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:46:01,240][25689] Avg episode reward: [(0, '-46.449')] [2022-07-09 22:46:01,372][26022] Updated weights on worker 0-0, policy_version 453500 (0.00087) [2022-07-09 22:46:03,507][26022] Updated weights on worker 0-0, policy_version 453510 (0.00087) [2022-07-09 22:46:05,279][26022] Updated weights on worker 0-0, policy_version 453520 (0.00083) [2022-07-09 22:46:06,301][25689] Fps is (10 sec: 5464.9, 60 sec: 5647.3, 300 sec: 5660.1). Total num frames: 464408576. Throughput: 0: 5807.6. Samples: 464418650. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:46:06,303][25689] Avg episode reward: [(0, '-46.482')] [2022-07-09 22:46:07,172][26022] Updated weights on worker 0-0, policy_version 453530 (0.00094) [2022-07-09 22:46:08,967][26022] Updated weights on worker 0-0, policy_version 453540 (0.00084) [2022-07-09 22:46:10,706][26022] Updated weights on worker 0-0, policy_version 453550 (0.00092) [2022-07-09 22:46:11,351][25689] Fps is (10 sec: 5671.1, 60 sec: 5671.1, 300 sec: 5660.6). Total num frames: 464439296. Throughput: 0: 5817.4. Samples: 464435858. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:46:11,352][25689] Avg episode reward: [(0, '-46.651')] [2022-07-09 22:46:12,566][26022] Updated weights on worker 0-0, policy_version 453560 (0.00088) [2022-07-09 22:46:14,233][26022] Updated weights on worker 0-0, policy_version 453570 (0.00097) [2022-07-09 22:46:16,086][26022] Updated weights on worker 0-0, policy_version 453580 (0.00085) [2022-07-09 22:46:16,353][25689] Fps is (10 sec: 5806.4, 60 sec: 5659.8, 300 sec: 5658.4). Total num frames: 464466944. Throughput: 0: 5860.2. Samples: 464470400. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:46:16,354][25689] Avg episode reward: [(0, '-46.296')] [2022-07-09 22:46:17,935][26022] Updated weights on worker 0-0, policy_version 453590 (0.00085) [2022-07-09 22:46:19,822][26022] Updated weights on worker 0-0, policy_version 453600 (0.01356) [2022-07-09 22:46:21,378][25689] Fps is (10 sec: 5718.8, 60 sec: 5658.9, 300 sec: 5658.0). Total num frames: 464496640. Throughput: 0: 5890.9. Samples: 464504984. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:46:21,379][25689] Avg episode reward: [(0, '-46.439')] [2022-07-09 22:46:21,380][26022] Updated weights on worker 0-0, policy_version 453610 (0.00084) [2022-07-09 22:46:23,251][26022] Updated weights on worker 0-0, policy_version 453620 (0.00086) [2022-07-09 22:46:24,977][26022] Updated weights on worker 0-0, policy_version 453630 (0.00092) [2022-07-09 22:46:26,402][25689] Fps is (10 sec: 5808.6, 60 sec: 5678.4, 300 sec: 5660.6). Total num frames: 464525312. Throughput: 0: 5154.4. Samples: 464522258. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:46:26,403][25689] Avg episode reward: [(0, '-46.073')] [2022-07-09 22:46:26,725][26022] Updated weights on worker 0-0, policy_version 453640 (0.00091) [2022-07-09 22:46:28,625][26022] Updated weights on worker 0-0, policy_version 453650 (0.00089) [2022-07-09 22:46:30,513][26022] Updated weights on worker 0-0, policy_version 453660 (0.00100) [2022-07-09 22:46:31,454][25689] Fps is (10 sec: 5589.6, 60 sec: 5680.0, 300 sec: 5649.4). Total num frames: 464552960. Throughput: 0: 5996.2. Samples: 464556402. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:46:31,454][25689] Avg episode reward: [(0, '-45.783')] [2022-07-09 22:46:32,175][26022] Updated weights on worker 0-0, policy_version 453670 (0.00089) [2022-07-09 22:46:33,944][26022] Updated weights on worker 0-0, policy_version 453680 (0.00106) [2022-07-09 22:46:35,646][26022] Updated weights on worker 0-0, policy_version 453690 (0.00094) [2022-07-09 22:46:36,465][25689] Fps is (10 sec: 5596.8, 60 sec: 5652.6, 300 sec: 5653.3). Total num frames: 464581632. Throughput: 0: 5967.1. Samples: 464590408. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:46:36,466][25689] Avg episode reward: [(0, '-45.860')] [2022-07-09 22:46:37,631][26022] Updated weights on worker 0-0, policy_version 453700 (0.00092) [2022-07-09 22:46:39,294][26022] Updated weights on worker 0-0, policy_version 453710 (0.00092) [2022-07-09 22:46:41,254][26022] Updated weights on worker 0-0, policy_version 453720 (0.00086) [2022-07-09 22:46:41,482][25689] Fps is (10 sec: 5718.6, 60 sec: 5674.9, 300 sec: 5656.7). Total num frames: 464610304. Throughput: 0: 5102.0. Samples: 464607552. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:46:41,482][25689] Avg episode reward: [(0, '-46.140')] [2022-07-09 22:46:42,939][26022] Updated weights on worker 0-0, policy_version 453730 (0.00096) [2022-07-09 22:46:44,990][26022] Updated weights on worker 0-0, policy_version 453740 (0.00086) [2022-07-09 22:46:46,490][25689] Fps is (10 sec: 5720.2, 60 sec: 5659.7, 300 sec: 5651.5). Total num frames: 464638976. Throughput: 0: 5946.4. Samples: 464641708. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:46:46,490][25689] Avg episode reward: [(0, '-45.652')] [2022-07-09 22:46:46,543][26022] Updated weights on worker 0-0, policy_version 453750 (0.00088) [2022-07-09 22:46:48,511][26022] Updated weights on worker 0-0, policy_version 453760 (0.00085) [2022-07-09 22:46:50,180][26022] Updated weights on worker 0-0, policy_version 453770 (0.00087) [2022-07-09 22:46:51,602][25689] Fps is (10 sec: 5666.4, 60 sec: 5672.8, 300 sec: 5656.4). Total num frames: 464667648. Throughput: 0: 5940.5. Samples: 464676090. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:46:51,602][25689] Avg episode reward: [(0, '-45.296')] [2022-07-09 22:46:51,882][26022] Updated weights on worker 0-0, policy_version 453780 (0.00084) [2022-07-09 22:46:53,842][26022] Updated weights on worker 0-0, policy_version 453790 (0.00091) [2022-07-09 22:46:55,714][26022] Updated weights on worker 0-0, policy_version 453800 (0.00164) [2022-07-09 22:46:56,631][25689] Fps is (10 sec: 5553.7, 60 sec: 5658.0, 300 sec: 5649.0). Total num frames: 464695296. Throughput: 0: 5095.4. Samples: 464693160. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:46:56,631][25689] Avg episode reward: [(0, '-45.488')] [2022-07-09 22:46:57,462][26022] Updated weights on worker 0-0, policy_version 453810 (0.00090) [2022-07-09 22:46:59,432][26022] Updated weights on worker 0-0, policy_version 453820 (0.00086) [2022-07-09 22:47:00,866][26022] Updated weights on worker 0-0, policy_version 453830 (0.00095) [2022-07-09 22:47:01,641][25689] Fps is (10 sec: 5711.8, 60 sec: 5679.3, 300 sec: 5663.0). Total num frames: 464724992. Throughput: 0: 5934.4. Samples: 464727186. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:47:01,642][25689] Avg episode reward: [(0, '-45.445')] [2022-07-09 22:47:03,429][26022] Updated weights on worker 0-0, policy_version 453840 (0.00081) [2022-07-09 22:47:04,844][26022] Updated weights on worker 0-0, policy_version 453850 (0.00090) [2022-07-09 22:47:06,648][25689] Fps is (10 sec: 5622.2, 60 sec: 5684.5, 300 sec: 5657.0). Total num frames: 464751616. Throughput: 0: 5828.4. Samples: 464759200. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:47:06,650][25689] Avg episode reward: [(0, '-45.085')] [2022-07-09 22:47:06,840][26022] Updated weights on worker 0-0, policy_version 453860 (0.00095) [2022-07-09 22:47:08,795][26022] Updated weights on worker 0-0, policy_version 453870 (0.00083) [2022-07-09 22:47:10,509][26022] Updated weights on worker 0-0, policy_version 453880 (0.00086) [2022-07-09 22:47:11,754][25689] Fps is (10 sec: 5468.1, 60 sec: 5645.3, 300 sec: 5655.1). Total num frames: 464780288. Throughput: 0: 4968.0. Samples: 464776206. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:47:11,756][25689] Avg episode reward: [(0, '-45.826')] [2022-07-09 22:47:12,267][26022] Updated weights on worker 0-0, policy_version 453890 (0.00082) [2022-07-09 22:47:14,183][26022] Updated weights on worker 0-0, policy_version 453901 (0.00085) [2022-07-09 22:47:15,955][26022] Updated weights on worker 0-0, policy_version 453911 (0.00088) [2022-07-09 22:47:16,781][25689] Fps is (10 sec: 5760.3, 60 sec: 5676.9, 300 sec: 5658.7). Total num frames: 464809984. Throughput: 0: 5824.5. Samples: 464810526. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:47:16,783][25689] Avg episode reward: [(0, '-47.039')] [2022-07-09 22:47:17,790][26022] Updated weights on worker 0-0, policy_version 453921 (0.00101) [2022-07-09 22:47:19,515][26022] Updated weights on worker 0-0, policy_version 453931 (0.00093) [2022-07-09 22:47:21,465][26022] Updated weights on worker 0-0, policy_version 453941 (0.00097) [2022-07-09 22:47:21,784][25689] Fps is (10 sec: 5717.3, 60 sec: 5645.0, 300 sec: 5659.1). Total num frames: 464837632. Throughput: 0: 5841.8. Samples: 464844854. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:47:21,786][25689] Avg episode reward: [(0, '-47.286')] [2022-07-09 22:47:23,144][26022] Updated weights on worker 0-0, policy_version 453951 (0.00087) [2022-07-09 22:47:24,889][26022] Updated weights on worker 0-0, policy_version 453961 (0.00094) [2022-07-09 22:47:26,739][26022] Updated weights on worker 0-0, policy_version 453971 (0.00074) [2022-07-09 22:47:26,800][25689] Fps is (10 sec: 5723.6, 60 sec: 5662.7, 300 sec: 5660.0). Total num frames: 464867328. Throughput: 0: 5116.2. Samples: 464862300. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:47:26,801][25689] Avg episode reward: [(0, '-47.219')] [2022-07-09 22:47:28,453][26022] Updated weights on worker 0-0, policy_version 453981 (0.00087) [2022-07-09 22:47:30,390][26022] Updated weights on worker 0-0, policy_version 453991 (0.00097) [2022-07-09 22:47:31,838][25689] Fps is (10 sec: 5601.7, 60 sec: 5647.1, 300 sec: 5652.7). Total num frames: 464893952. Throughput: 0: 5980.4. Samples: 464896318. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:47:31,838][25689] Avg episode reward: [(0, '-46.424')] [2022-07-09 22:47:32,025][26022] Updated weights on worker 0-0, policy_version 454001 (0.00095) [2022-07-09 22:47:33,799][26022] Updated weights on worker 0-0, policy_version 454011 (0.00080) [2022-07-09 22:47:35,551][26022] Updated weights on worker 0-0, policy_version 454021 (0.00088) [2022-07-09 22:47:36,869][25689] Fps is (10 sec: 5593.3, 60 sec: 5662.1, 300 sec: 5662.5). Total num frames: 464923648. Throughput: 0: 5977.0. Samples: 464930592. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:47:36,869][25689] Avg episode reward: [(0, '-46.373')] [2022-07-09 22:47:37,599][26022] Updated weights on worker 0-0, policy_version 454031 (0.00094) [2022-07-09 22:47:38,173][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:47:38,195][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000454035_464931840.pth [2022-07-09 22:47:38,195][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000452045_462894080.pth [2022-07-09 22:47:39,445][26022] Updated weights on worker 0-0, policy_version 454041 (0.00085) [2022-07-09 22:47:41,041][26022] Updated weights on worker 0-0, policy_version 454051 (0.00090) [2022-07-09 22:47:41,883][25689] Fps is (10 sec: 5810.6, 60 sec: 5662.4, 300 sec: 5658.9). Total num frames: 464952320. Throughput: 0: 5116.0. Samples: 464947682. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:47:41,883][25689] Avg episode reward: [(0, '-46.422')] [2022-07-09 22:47:43,067][26022] Updated weights on worker 0-0, policy_version 454061 (0.00085) [2022-07-09 22:47:44,595][26022] Updated weights on worker 0-0, policy_version 454071 (0.00095) [2022-07-09 22:47:46,586][26022] Updated weights on worker 0-0, policy_version 454081 (0.00084) [2022-07-09 22:47:46,901][25689] Fps is (10 sec: 5614.1, 60 sec: 5644.5, 300 sec: 5653.1). Total num frames: 464979968. Throughput: 0: 5954.2. Samples: 464981986. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:47:46,902][25689] Avg episode reward: [(0, '-45.572')] [2022-07-09 22:47:48,103][26022] Updated weights on worker 0-0, policy_version 454091 (0.00086) [2022-07-09 22:47:50,132][26022] Updated weights on worker 0-0, policy_version 454101 (0.00090) [2022-07-09 22:47:51,676][26022] Updated weights on worker 0-0, policy_version 454111 (0.00094) [2022-07-09 22:47:51,948][25689] Fps is (10 sec: 5798.8, 60 sec: 5684.5, 300 sec: 5660.5). Total num frames: 465010688. Throughput: 0: 5970.7. Samples: 465016392. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:47:51,949][25689] Avg episode reward: [(0, '-45.744')] [2022-07-09 22:47:53,696][26022] Updated weights on worker 0-0, policy_version 454121 (0.00090) [2022-07-09 22:47:55,243][26022] Updated weights on worker 0-0, policy_version 454131 (0.00085) [2022-07-09 22:47:56,989][25689] Fps is (10 sec: 5785.6, 60 sec: 5683.4, 300 sec: 5660.9). Total num frames: 465038336. Throughput: 0: 5130.1. Samples: 465033812. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:47:56,990][25689] Avg episode reward: [(0, '-46.322')] [2022-07-09 22:47:57,289][26022] Updated weights on worker 0-0, policy_version 454141 (0.00085) [2022-07-09 22:47:59,073][26022] Updated weights on worker 0-0, policy_version 454151 (0.00086) [2022-07-09 22:48:00,644][26022] Updated weights on worker 0-0, policy_version 454161 (0.00086) [2022-07-09 22:48:02,017][25689] Fps is (10 sec: 5593.6, 60 sec: 5664.8, 300 sec: 5670.9). Total num frames: 465067008. Throughput: 0: 5995.4. Samples: 465068394. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:48:02,018][25689] Avg episode reward: [(0, '-46.793')] [2022-07-09 22:48:03,045][26022] Updated weights on worker 0-0, policy_version 454171 (0.00090) [2022-07-09 22:48:04,696][26022] Updated weights on worker 0-0, policy_version 454181 (0.00084) [2022-07-09 22:48:06,652][26022] Updated weights on worker 0-0, policy_version 454191 (0.00090) [2022-07-09 22:48:07,030][25689] Fps is (10 sec: 5507.0, 60 sec: 5664.2, 300 sec: 5663.4). Total num frames: 465093632. Throughput: 0: 5882.3. Samples: 465100394. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:48:07,031][25689] Avg episode reward: [(0, '-47.167')] [2022-07-09 22:48:08,216][26022] Updated weights on worker 0-0, policy_version 454201 (0.00089) [2022-07-09 22:48:10,263][26022] Updated weights on worker 0-0, policy_version 454211 (0.00373) [2022-07-09 22:48:11,972][26022] Updated weights on worker 0-0, policy_version 454221 (0.00107) [2022-07-09 22:48:12,159][25689] Fps is (10 sec: 5452.5, 60 sec: 5662.1, 300 sec: 5661.7). Total num frames: 465122304. Throughput: 0: 4983.5. Samples: 465117110. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:48:12,159][25689] Avg episode reward: [(0, '-46.770')] [2022-07-09 22:48:13,666][26022] Updated weights on worker 0-0, policy_version 454231 (0.00091) [2022-07-09 22:48:15,575][26022] Updated weights on worker 0-0, policy_version 454241 (0.00090) [2022-07-09 22:48:17,184][25689] Fps is (10 sec: 5748.8, 60 sec: 5662.3, 300 sec: 5665.4). Total num frames: 465152000. Throughput: 0: 5816.5. Samples: 465151272. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:48:17,184][25689] Avg episode reward: [(0, '-46.730')] [2022-07-09 22:48:17,374][26022] Updated weights on worker 0-0, policy_version 454251 (0.00086) [2022-07-09 22:48:19,168][26022] Updated weights on worker 0-0, policy_version 454261 (0.00093) [2022-07-09 22:48:21,109][26022] Updated weights on worker 0-0, policy_version 454271 (0.00088) [2022-07-09 22:48:22,237][25689] Fps is (10 sec: 5892.9, 60 sec: 5691.4, 300 sec: 5664.9). Total num frames: 465181696. Throughput: 0: 5798.3. Samples: 465185636. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:48:22,238][25689] Avg episode reward: [(0, '-47.319')] [2022-07-09 22:48:22,500][26022] Updated weights on worker 0-0, policy_version 454281 (0.00086) [2022-07-09 22:48:24,639][26022] Updated weights on worker 0-0, policy_version 454291 (0.00087) [2022-07-09 22:48:26,326][26022] Updated weights on worker 0-0, policy_version 454301 (0.00095) [2022-07-09 22:48:27,275][25689] Fps is (10 sec: 5581.1, 60 sec: 5638.6, 300 sec: 5659.8). Total num frames: 465208320. Throughput: 0: 5067.1. Samples: 465202974. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:48:27,275][25689] Avg episode reward: [(0, '-46.826')] [2022-07-09 22:48:28,201][26022] Updated weights on worker 0-0, policy_version 454311 (0.00484) [2022-07-09 22:48:30,052][26022] Updated weights on worker 0-0, policy_version 454321 (0.00091) [2022-07-09 22:48:31,643][26022] Updated weights on worker 0-0, policy_version 454331 (0.00085) [2022-07-09 22:48:32,326][25689] Fps is (10 sec: 5582.4, 60 sec: 5688.1, 300 sec: 5665.9). Total num frames: 465238016. Throughput: 0: 5941.3. Samples: 465236932. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:48:32,327][25689] Avg episode reward: [(0, '-46.801')] [2022-07-09 22:48:33,645][26022] Updated weights on worker 0-0, policy_version 454341 (0.00079) [2022-07-09 22:48:35,403][26022] Updated weights on worker 0-0, policy_version 454351 (0.00091) [2022-07-09 22:48:37,201][26022] Updated weights on worker 0-0, policy_version 454361 (0.00085) [2022-07-09 22:48:37,334][25689] Fps is (10 sec: 5700.9, 60 sec: 5656.5, 300 sec: 5662.9). Total num frames: 465265664. Throughput: 0: 5945.4. Samples: 465271072. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:48:37,334][25689] Avg episode reward: [(0, '-46.095')] [2022-07-09 22:48:38,943][26022] Updated weights on worker 0-0, policy_version 454371 (0.00096) [2022-07-09 22:48:40,699][26022] Updated weights on worker 0-0, policy_version 454381 (0.00098) [2022-07-09 22:48:42,345][25689] Fps is (10 sec: 5723.4, 60 sec: 5673.6, 300 sec: 5666.5). Total num frames: 465295360. Throughput: 0: 5096.0. Samples: 465288106. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:48:42,346][25689] Avg episode reward: [(0, '-47.105')] [2022-07-09 22:48:42,465][26022] Updated weights on worker 0-0, policy_version 454391 (0.00084) [2022-07-09 22:48:44,228][26022] Updated weights on worker 0-0, policy_version 454401 (0.00089) [2022-07-09 22:48:45,980][26022] Updated weights on worker 0-0, policy_version 454411 (0.00090) [2022-07-09 22:48:47,359][25689] Fps is (10 sec: 5822.2, 60 sec: 5691.0, 300 sec: 5667.2). Total num frames: 465324032. Throughput: 0: 5969.8. Samples: 465322870. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:48:47,359][25689] Avg episode reward: [(0, '-46.068')] [2022-07-09 22:48:47,894][26022] Updated weights on worker 0-0, policy_version 454421 (0.00086) [2022-07-09 22:48:49,797][26022] Updated weights on worker 0-0, policy_version 454431 (0.00089) [2022-07-09 22:48:51,356][26022] Updated weights on worker 0-0, policy_version 454441 (0.00084) [2022-07-09 22:48:52,429][25689] Fps is (10 sec: 5483.9, 60 sec: 5621.1, 300 sec: 5663.6). Total num frames: 465350656. Throughput: 0: 5965.5. Samples: 465356854. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-09 22:48:52,430][25689] Avg episode reward: [(0, '-46.210')] [2022-07-09 22:48:53,347][26022] Updated weights on worker 0-0, policy_version 454451 (0.00086) [2022-07-09 22:48:54,790][26022] Updated weights on worker 0-0, policy_version 454461 (0.00091) [2022-07-09 22:48:56,798][26022] Updated weights on worker 0-0, policy_version 454471 (0.00081) [2022-07-09 22:48:57,469][25689] Fps is (10 sec: 5773.0, 60 sec: 5688.9, 300 sec: 5670.3). Total num frames: 465382400. Throughput: 0: 5122.6. Samples: 465374218. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:48:57,470][25689] Avg episode reward: [(0, '-45.977')] [2022-07-09 22:48:58,831][26022] Updated weights on worker 0-0, policy_version 454481 (0.00090) [2022-07-09 22:49:00,267][26022] Updated weights on worker 0-0, policy_version 454491 (0.00082) [2022-07-09 22:49:02,502][25689] Fps is (10 sec: 5692.8, 60 sec: 5637.7, 300 sec: 5669.9). Total num frames: 465408000. Throughput: 0: 5975.8. Samples: 465408556. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:49:02,502][25689] Avg episode reward: [(0, '-45.833')] [2022-07-09 22:49:02,746][26022] Updated weights on worker 0-0, policy_version 454501 (0.00088) [2022-07-09 22:49:04,299][26022] Updated weights on worker 0-0, policy_version 454511 (0.00085) [2022-07-09 22:49:06,076][26022] Updated weights on worker 0-0, policy_version 454521 (0.00091) [2022-07-09 22:49:07,506][25689] Fps is (10 sec: 5305.2, 60 sec: 5655.4, 300 sec: 5664.5). Total num frames: 465435648. Throughput: 0: 5852.8. Samples: 465440788. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:49:07,506][25689] Avg episode reward: [(0, '-47.088')] [2022-07-09 22:49:08,209][26022] Updated weights on worker 0-0, policy_version 454531 (0.00095) [2022-07-09 22:49:09,693][26022] Updated weights on worker 0-0, policy_version 454541 (0.00081) [2022-07-09 22:49:11,700][26022] Updated weights on worker 0-0, policy_version 454551 (0.00082) [2022-07-09 22:49:12,555][25689] Fps is (10 sec: 5704.1, 60 sec: 5679.8, 300 sec: 5670.8). Total num frames: 465465344. Throughput: 0: 5009.4. Samples: 465457674. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:49:12,555][25689] Avg episode reward: [(0, '-47.423')] [2022-07-09 22:49:13,056][26022] Updated weights on worker 0-0, policy_version 454561 (0.00085) [2022-07-09 22:49:15,219][26022] Updated weights on worker 0-0, policy_version 454571 (0.00078) [2022-07-09 22:49:16,865][26022] Updated weights on worker 0-0, policy_version 454581 (0.00085) [2022-07-09 22:49:17,558][25689] Fps is (10 sec: 5806.5, 60 sec: 5664.9, 300 sec: 5664.3). Total num frames: 465494016. Throughput: 0: 5888.1. Samples: 465492504. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:49:17,561][25689] Avg episode reward: [(0, '-48.064')] [2022-07-09 22:49:18,778][26022] Updated weights on worker 0-0, policy_version 454591 (0.00089) [2022-07-09 22:49:20,300][26022] Updated weights on worker 0-0, policy_version 454601 (0.00088) [2022-07-09 22:49:22,379][26022] Updated weights on worker 0-0, policy_version 454611 (0.00087) [2022-07-09 22:49:22,565][25689] Fps is (10 sec: 5728.6, 60 sec: 5652.3, 300 sec: 5664.4). Total num frames: 465522688. Throughput: 0: 5902.1. Samples: 465526972. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:49:22,567][25689] Avg episode reward: [(0, '-49.184')] [2022-07-09 22:49:24,019][26022] Updated weights on worker 0-0, policy_version 454621 (0.00086) [2022-07-09 22:49:25,797][26022] Updated weights on worker 0-0, policy_version 454631 (0.00095) [2022-07-09 22:49:27,571][25689] Fps is (10 sec: 5727.0, 60 sec: 5689.3, 300 sec: 5673.1). Total num frames: 465551360. Throughput: 0: 5156.9. Samples: 465544262. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:49:27,571][25689] Avg episode reward: [(0, '-50.203')] [2022-07-09 22:49:27,606][26022] Updated weights on worker 0-0, policy_version 454641 (0.00084) [2022-07-09 22:49:29,455][26022] Updated weights on worker 0-0, policy_version 454651 (0.00088) [2022-07-09 22:49:31,255][26022] Updated weights on worker 0-0, policy_version 454661 (0.00084) [2022-07-09 22:49:32,683][25689] Fps is (10 sec: 5667.5, 60 sec: 5666.6, 300 sec: 5668.3). Total num frames: 465580032. Throughput: 0: 5995.1. Samples: 465578344. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:49:32,683][25689] Avg episode reward: [(0, '-48.766')] [2022-07-09 22:49:33,044][26022] Updated weights on worker 0-0, policy_version 454671 (0.00090) [2022-07-09 22:49:34,804][26022] Updated weights on worker 0-0, policy_version 454681 (0.00090) [2022-07-09 22:49:36,641][26022] Updated weights on worker 0-0, policy_version 454691 (0.00085) [2022-07-09 22:49:37,698][25689] Fps is (10 sec: 5662.7, 60 sec: 5682.9, 300 sec: 5665.2). Total num frames: 465608704. Throughput: 0: 5957.7. Samples: 465612490. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:49:37,698][25689] Avg episode reward: [(0, '-48.657')] [2022-07-09 22:49:38,250][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:49:38,258][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000454700_465612800.pth [2022-07-09 22:49:38,259][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000452706_463570944.pth [2022-07-09 22:49:38,452][26022] Updated weights on worker 0-0, policy_version 454701 (0.00095) [2022-07-09 22:49:40,298][26022] Updated weights on worker 0-0, policy_version 454711 (0.00087) [2022-07-09 22:49:42,122][26022] Updated weights on worker 0-0, policy_version 454721 (0.00088) [2022-07-09 22:49:42,712][25689] Fps is (10 sec: 5615.6, 60 sec: 5648.7, 300 sec: 5661.6). Total num frames: 465636352. Throughput: 0: 5921.9. Samples: 465646282. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:49:42,714][25689] Avg episode reward: [(0, '-48.176')] [2022-07-09 22:49:43,967][26022] Updated weights on worker 0-0, policy_version 454731 (0.00092) [2022-07-09 22:49:45,775][26022] Updated weights on worker 0-0, policy_version 454741 (0.00095) [2022-07-09 22:49:47,521][26022] Updated weights on worker 0-0, policy_version 454751 (0.00090) [2022-07-09 22:49:47,755][25689] Fps is (10 sec: 5701.9, 60 sec: 5662.9, 300 sec: 5670.0). Total num frames: 465666048. Throughput: 0: 5904.4. Samples: 465663434. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:49:47,755][25689] Avg episode reward: [(0, '-48.375')] [2022-07-09 22:49:49,446][26022] Updated weights on worker 0-0, policy_version 454761 (0.00093) [2022-07-09 22:49:51,138][26022] Updated weights on worker 0-0, policy_version 454771 (0.00080) [2022-07-09 22:49:52,812][25689] Fps is (10 sec: 5778.7, 60 sec: 5697.9, 300 sec: 5662.9). Total num frames: 465694720. Throughput: 0: 5916.8. Samples: 465697448. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:49:52,814][25689] Avg episode reward: [(0, '-48.058')] [2022-07-09 22:49:52,937][26022] Updated weights on worker 0-0, policy_version 454781 (0.00088) [2022-07-09 22:49:54,626][26022] Updated weights on worker 0-0, policy_version 454791 (0.00090) [2022-07-09 22:49:56,579][26022] Updated weights on worker 0-0, policy_version 454801 (0.00086) [2022-07-09 22:49:57,836][25689] Fps is (10 sec: 5688.0, 60 sec: 5648.6, 300 sec: 5662.5). Total num frames: 465723392. Throughput: 0: 5925.1. Samples: 465731814. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:49:57,837][25689] Avg episode reward: [(0, '-47.925')] [2022-07-09 22:49:58,195][26022] Updated weights on worker 0-0, policy_version 454811 (0.00087) [2022-07-09 22:50:00,003][26022] Updated weights on worker 0-0, policy_version 454821 (0.00083) [2022-07-09 22:50:01,902][26022] Updated weights on worker 0-0, policy_version 454831 (0.00098) [2022-07-09 22:50:02,905][25689] Fps is (10 sec: 5377.2, 60 sec: 5645.2, 300 sec: 5661.6). Total num frames: 465748992. Throughput: 0: 5089.9. Samples: 465749064. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:50:02,906][25689] Avg episode reward: [(0, '-46.384')] [2022-07-09 22:50:04,143][26022] Updated weights on worker 0-0, policy_version 454841 (0.00098) [2022-07-09 22:50:05,840][26022] Updated weights on worker 0-0, policy_version 454851 (0.00085) [2022-07-09 22:50:07,631][26022] Updated weights on worker 0-0, policy_version 454861 (0.00087) [2022-07-09 22:50:07,989][25689] Fps is (10 sec: 5547.3, 60 sec: 5688.6, 300 sec: 5665.8). Total num frames: 465779712. Throughput: 0: 5822.7. Samples: 465781252. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:50:07,989][25689] Avg episode reward: [(0, '-46.937')] [2022-07-09 22:50:09,610][26022] Updated weights on worker 0-0, policy_version 454871 (0.00088) [2022-07-09 22:50:11,268][26022] Updated weights on worker 0-0, policy_version 454881 (0.00105) [2022-07-09 22:50:13,078][25689] Fps is (10 sec: 5737.5, 60 sec: 5650.9, 300 sec: 5661.8). Total num frames: 465807360. Throughput: 0: 5814.0. Samples: 465815274. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:50:13,079][25689] Avg episode reward: [(0, '-46.628')] [2022-07-09 22:50:13,096][26022] Updated weights on worker 0-0, policy_version 454891 (0.00091) [2022-07-09 22:50:14,783][26022] Updated weights on worker 0-0, policy_version 454901 (0.00087) [2022-07-09 22:50:16,672][26022] Updated weights on worker 0-0, policy_version 454911 (0.00086) [2022-07-09 22:50:18,085][25689] Fps is (10 sec: 5781.2, 60 sec: 5684.5, 300 sec: 5665.4). Total num frames: 465838080. Throughput: 0: 4980.5. Samples: 465832664. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:50:18,086][25689] Avg episode reward: [(0, '-45.942')] [2022-07-09 22:50:18,400][26022] Updated weights on worker 0-0, policy_version 454921 (0.00087) [2022-07-09 22:50:20,311][26022] Updated weights on worker 0-0, policy_version 454931 (0.00085) [2022-07-09 22:50:21,857][26022] Updated weights on worker 0-0, policy_version 454941 (0.00095) [2022-07-09 22:50:23,118][25689] Fps is (10 sec: 5712.0, 60 sec: 5648.2, 300 sec: 5662.3). Total num frames: 465864704. Throughput: 0: 5838.6. Samples: 465867076. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:50:23,120][25689] Avg episode reward: [(0, '-45.549')] [2022-07-09 22:50:23,852][26022] Updated weights on worker 0-0, policy_version 454951 (0.00084) [2022-07-09 22:50:25,648][26022] Updated weights on worker 0-0, policy_version 454961 (0.00091) [2022-07-09 22:50:27,428][26022] Updated weights on worker 0-0, policy_version 454971 (0.00090) [2022-07-09 22:50:28,174][25689] Fps is (10 sec: 5582.4, 60 sec: 5660.5, 300 sec: 5669.5). Total num frames: 465894400. Throughput: 0: 5957.6. Samples: 465901506. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:50:28,174][25689] Avg episode reward: [(0, '-45.926')] [2022-07-09 22:50:29,281][26022] Updated weights on worker 0-0, policy_version 454981 (0.00085) [2022-07-09 22:50:30,975][26022] Updated weights on worker 0-0, policy_version 454991 (0.00079) [2022-07-09 22:50:32,823][26022] Updated weights on worker 0-0, policy_version 455001 (0.00090) [2022-07-09 22:50:33,279][25689] Fps is (10 sec: 5844.8, 60 sec: 5678.0, 300 sec: 5665.5). Total num frames: 465924096. Throughput: 0: 5116.0. Samples: 465918620. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:50:33,280][25689] Avg episode reward: [(0, '-46.146')] [2022-07-09 22:50:34,747][26022] Updated weights on worker 0-0, policy_version 455011 (0.00089) [2022-07-09 22:50:36,397][26022] Updated weights on worker 0-0, policy_version 455021 (0.00085) [2022-07-09 22:50:38,204][26022] Updated weights on worker 0-0, policy_version 455031 (0.00086) [2022-07-09 22:50:38,324][25689] Fps is (10 sec: 5649.4, 60 sec: 5658.2, 300 sec: 5666.1). Total num frames: 465951744. Throughput: 0: 5938.9. Samples: 465952862. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:50:38,325][25689] Avg episode reward: [(0, '-46.183')] [2022-07-09 22:50:39,942][26022] Updated weights on worker 0-0, policy_version 455041 (0.00098) [2022-07-09 22:50:41,683][26022] Updated weights on worker 0-0, policy_version 455051 (0.00093) [2022-07-09 22:50:43,330][25689] Fps is (10 sec: 5603.5, 60 sec: 5675.9, 300 sec: 5663.0). Total num frames: 465980416. Throughput: 0: 5938.5. Samples: 465987106. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:50:43,330][25689] Avg episode reward: [(0, '-45.912')] [2022-07-09 22:50:43,846][26022] Updated weights on worker 0-0, policy_version 455061 (0.00086) [2022-07-09 22:50:45,156][26022] Updated weights on worker 0-0, policy_version 455071 (0.00091) [2022-07-09 22:50:47,395][26022] Updated weights on worker 0-0, policy_version 455081 (0.00086) [2022-07-09 22:50:48,420][25689] Fps is (10 sec: 5984.5, 60 sec: 5705.3, 300 sec: 5676.5). Total num frames: 466012160. Throughput: 0: 5076.4. Samples: 466004282. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:50:48,420][25689] Avg episode reward: [(0, '-46.640')] [2022-07-09 22:50:48,640][26022] Updated weights on worker 0-0, policy_version 455091 (0.00088) [2022-07-09 22:50:50,891][26022] Updated weights on worker 0-0, policy_version 455101 (0.00093) [2022-07-09 22:50:52,653][26022] Updated weights on worker 0-0, policy_version 455111 (0.00096) [2022-07-09 22:50:53,486][25689] Fps is (10 sec: 5646.1, 60 sec: 5653.8, 300 sec: 5665.9). Total num frames: 466037760. Throughput: 0: 5938.2. Samples: 466038614. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:50:53,487][25689] Avg episode reward: [(0, '-47.147')] [2022-07-09 22:50:54,276][26022] Updated weights on worker 0-0, policy_version 455121 (0.00093) [2022-07-09 22:50:56,306][26022] Updated weights on worker 0-0, policy_version 455131 (0.00619) [2022-07-09 22:50:57,827][26022] Updated weights on worker 0-0, policy_version 455141 (0.00086) [2022-07-09 22:50:58,549][25689] Fps is (10 sec: 5357.7, 60 sec: 5650.2, 300 sec: 5665.7). Total num frames: 466066432. Throughput: 0: 5931.9. Samples: 466072836. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:50:58,550][25689] Avg episode reward: [(0, '-47.529')] [2022-07-09 22:50:59,954][26022] Updated weights on worker 0-0, policy_version 455151 (0.00088) [2022-07-09 22:51:01,589][26022] Updated weights on worker 0-0, policy_version 455161 (0.00083) [2022-07-09 22:51:03,564][25689] Fps is (10 sec: 5487.2, 60 sec: 5672.2, 300 sec: 5666.6). Total num frames: 466093056. Throughput: 0: 5069.1. Samples: 466089672. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:51:03,566][25689] Avg episode reward: [(0, '-47.614')] [2022-07-09 22:51:03,755][26022] Updated weights on worker 0-0, policy_version 455171 (0.00084) [2022-07-09 22:51:05,679][26022] Updated weights on worker 0-0, policy_version 455181 (0.00088) [2022-07-09 22:51:07,305][26022] Updated weights on worker 0-0, policy_version 455191 (0.00082) [2022-07-09 22:51:08,569][25689] Fps is (10 sec: 5620.7, 60 sec: 5662.5, 300 sec: 5664.0). Total num frames: 466122752. Throughput: 0: 5838.8. Samples: 466121932. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:51:08,570][25689] Avg episode reward: [(0, '-47.293')] [2022-07-09 22:51:09,343][26022] Updated weights on worker 0-0, policy_version 455201 (0.00086) [2022-07-09 22:51:11,016][26022] Updated weights on worker 0-0, policy_version 455211 (0.00085) [2022-07-09 22:51:12,831][26022] Updated weights on worker 0-0, policy_version 455221 (0.00095) [2022-07-09 22:51:13,640][25689] Fps is (10 sec: 5792.5, 60 sec: 5681.2, 300 sec: 5666.2). Total num frames: 466151424. Throughput: 0: 5834.8. Samples: 466156206. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:51:13,640][25689] Avg episode reward: [(0, '-47.380')] [2022-07-09 22:51:14,545][26022] Updated weights on worker 0-0, policy_version 455231 (0.00081) [2022-07-09 22:51:16,247][26022] Updated weights on worker 0-0, policy_version 455241 (0.00096) [2022-07-09 22:51:18,104][26022] Updated weights on worker 0-0, policy_version 455251 (0.00085) [2022-07-09 22:51:18,719][25689] Fps is (10 sec: 5751.0, 60 sec: 5657.6, 300 sec: 5665.2). Total num frames: 466181120. Throughput: 0: 5001.8. Samples: 466173718. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:51:18,721][25689] Avg episode reward: [(0, '-47.038')] [2022-07-09 22:51:19,888][26022] Updated weights on worker 0-0, policy_version 455261 (0.00090) [2022-07-09 22:51:21,569][26022] Updated weights on worker 0-0, policy_version 455271 (0.00086) [2022-07-09 22:51:23,486][26022] Updated weights on worker 0-0, policy_version 455281 (0.00084) [2022-07-09 22:51:23,740][25689] Fps is (10 sec: 5677.7, 60 sec: 5675.5, 300 sec: 5661.8). Total num frames: 466208768. Throughput: 0: 5881.0. Samples: 466208330. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:51:23,740][25689] Avg episode reward: [(0, '-46.876')] [2022-07-09 22:51:25,036][26022] Updated weights on worker 0-0, policy_version 455291 (0.00084) [2022-07-09 22:51:27,103][26022] Updated weights on worker 0-0, policy_version 455301 (0.00086) [2022-07-09 22:51:28,616][26022] Updated weights on worker 0-0, policy_version 455311 (0.00091) [2022-07-09 22:51:28,749][25689] Fps is (10 sec: 5717.0, 60 sec: 5679.9, 300 sec: 5669.5). Total num frames: 466238464. Throughput: 0: 5986.0. Samples: 466242728. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:51:28,750][25689] Avg episode reward: [(0, '-46.842')] [2022-07-09 22:51:30,670][26022] Updated weights on worker 0-0, policy_version 455321 (0.00092) [2022-07-09 22:51:32,232][26022] Updated weights on worker 0-0, policy_version 455331 (0.00089) [2022-07-09 22:51:33,825][25689] Fps is (10 sec: 5787.3, 60 sec: 5665.7, 300 sec: 5668.2). Total num frames: 466267136. Throughput: 0: 5139.2. Samples: 466259942. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:51:33,826][25689] Avg episode reward: [(0, '-46.192')] [2022-07-09 22:51:34,188][26022] Updated weights on worker 0-0, policy_version 455341 (0.00090) [2022-07-09 22:51:35,843][26022] Updated weights on worker 0-0, policy_version 455351 (0.00084) [2022-07-09 22:51:37,688][26022] Updated weights on worker 0-0, policy_version 455361 (0.00087) [2022-07-09 22:51:38,268][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:51:38,289][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000455365_466293760.pth [2022-07-09 22:51:38,295][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000453370_464250880.pth [2022-07-09 22:51:38,882][25689] Fps is (10 sec: 5658.9, 60 sec: 5681.5, 300 sec: 5667.4). Total num frames: 466295808. Throughput: 0: 5982.2. Samples: 466294342. Policy #0 lag: (min: 0.0, avg: 7.0, max: 19.0) [2022-07-09 22:51:38,883][25689] Avg episode reward: [(0, '-46.948')] [2022-07-09 22:51:39,602][26022] Updated weights on worker 0-0, policy_version 455371 (0.00081) [2022-07-09 22:51:41,220][26022] Updated weights on worker 0-0, policy_version 455381 (0.00089) [2022-07-09 22:51:43,239][26022] Updated weights on worker 0-0, policy_version 455391 (0.00083) [2022-07-09 22:51:43,911][25689] Fps is (10 sec: 5787.5, 60 sec: 5696.3, 300 sec: 5670.5). Total num frames: 466325504. Throughput: 0: 5957.0. Samples: 466328486. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:51:43,911][25689] Avg episode reward: [(0, '-47.512')] [2022-07-09 22:51:44,880][26022] Updated weights on worker 0-0, policy_version 455401 (0.00085) [2022-07-09 22:51:46,642][26022] Updated weights on worker 0-0, policy_version 455411 (0.00088) [2022-07-09 22:51:48,430][26022] Updated weights on worker 0-0, policy_version 455421 (0.00083) [2022-07-09 22:51:48,913][25689] Fps is (10 sec: 5614.9, 60 sec: 5619.9, 300 sec: 5665.7). Total num frames: 466352128. Throughput: 0: 5964.1. Samples: 466362988. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:51:48,914][25689] Avg episode reward: [(0, '-47.640')] [2022-07-09 22:51:50,264][26022] Updated weights on worker 0-0, policy_version 455431 (0.00088) [2022-07-09 22:51:52,075][26022] Updated weights on worker 0-0, policy_version 455441 (0.00097) [2022-07-09 22:51:53,651][26022] Updated weights on worker 0-0, policy_version 455451 (0.00088) [2022-07-09 22:51:53,984][25689] Fps is (10 sec: 5692.8, 60 sec: 5704.2, 300 sec: 5675.3). Total num frames: 466382848. Throughput: 0: 5973.1. Samples: 466380348. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:51:53,984][25689] Avg episode reward: [(0, '-46.370')] [2022-07-09 22:51:55,641][26022] Updated weights on worker 0-0, policy_version 455461 (0.00087) [2022-07-09 22:51:57,452][26022] Updated weights on worker 0-0, policy_version 455471 (0.00082) [2022-07-09 22:51:58,875][26022] Updated weights on worker 0-0, policy_version 455481 (0.00101) [2022-07-09 22:51:59,026][25689] Fps is (10 sec: 6075.4, 60 sec: 5740.0, 300 sec: 5678.1). Total num frames: 466413568. Throughput: 0: 5994.7. Samples: 466415096. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:51:59,026][25689] Avg episode reward: [(0, '-46.245')] [2022-07-09 22:52:01,044][26022] Updated weights on worker 0-0, policy_version 455491 (0.00091) [2022-07-09 22:52:02,729][26022] Updated weights on worker 0-0, policy_version 455501 (0.00083) [2022-07-09 22:52:04,059][25689] Fps is (10 sec: 5488.4, 60 sec: 5704.4, 300 sec: 5670.7). Total num frames: 466438144. Throughput: 0: 5900.2. Samples: 466447364. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:52:04,059][25689] Avg episode reward: [(0, '-46.204')] [2022-07-09 22:52:05,013][26022] Updated weights on worker 0-0, policy_version 455511 (0.00086) [2022-07-09 22:52:06,631][26022] Updated weights on worker 0-0, policy_version 455521 (0.00086) [2022-07-09 22:52:08,502][26022] Updated weights on worker 0-0, policy_version 455531 (0.00084) [2022-07-09 22:52:09,061][25689] Fps is (10 sec: 5408.0, 60 sec: 5704.7, 300 sec: 5676.1). Total num frames: 466467840. Throughput: 0: 5037.7. Samples: 466464490. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:52:09,062][25689] Avg episode reward: [(0, '-46.086')] [2022-07-09 22:52:10,314][26022] Updated weights on worker 0-0, policy_version 455541 (0.00086) [2022-07-09 22:52:12,005][26022] Updated weights on worker 0-0, policy_version 455551 (0.00087) [2022-07-09 22:52:14,005][26022] Updated weights on worker 0-0, policy_version 455561 (0.00088) [2022-07-09 22:52:14,109][25689] Fps is (10 sec: 5603.5, 60 sec: 5672.9, 300 sec: 5665.4). Total num frames: 466494464. Throughput: 0: 5875.3. Samples: 466498594. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:52:14,110][25689] Avg episode reward: [(0, '-45.872')] [2022-07-09 22:52:15,629][26022] Updated weights on worker 0-0, policy_version 455571 (0.00091) [2022-07-09 22:52:17,490][26022] Updated weights on worker 0-0, policy_version 455581 (0.00089) [2022-07-09 22:52:19,134][25689] Fps is (10 sec: 5591.5, 60 sec: 5678.0, 300 sec: 5671.9). Total num frames: 466524160. Throughput: 0: 5858.7. Samples: 466532902. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:52:19,136][25689] Avg episode reward: [(0, '-46.295')] [2022-07-09 22:52:19,213][26022] Updated weights on worker 0-0, policy_version 455591 (0.00087) [2022-07-09 22:52:20,888][26022] Updated weights on worker 0-0, policy_version 455601 (0.00085) [2022-07-09 22:52:23,014][26022] Updated weights on worker 0-0, policy_version 455611 (0.00083) [2022-07-09 22:52:24,142][25689] Fps is (10 sec: 5919.8, 60 sec: 5713.2, 300 sec: 5672.0). Total num frames: 466553856. Throughput: 0: 5117.6. Samples: 466550144. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:52:24,144][25689] Avg episode reward: [(0, '-47.274')] [2022-07-09 22:52:24,395][26022] Updated weights on worker 0-0, policy_version 455621 (0.00084) [2022-07-09 22:52:26,459][26022] Updated weights on worker 0-0, policy_version 455631 (0.00087) [2022-07-09 22:52:28,134][26022] Updated weights on worker 0-0, policy_version 455641 (0.00084) [2022-07-09 22:52:29,167][25689] Fps is (10 sec: 5613.6, 60 sec: 5660.9, 300 sec: 5672.3). Total num frames: 466580480. Throughput: 0: 5976.4. Samples: 466584646. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:52:29,168][25689] Avg episode reward: [(0, '-47.782')] [2022-07-09 22:52:29,851][26022] Updated weights on worker 0-0, policy_version 455651 (0.00086) [2022-07-09 22:52:31,967][26022] Updated weights on worker 0-0, policy_version 455661 (0.00094) [2022-07-09 22:52:33,467][26022] Updated weights on worker 0-0, policy_version 455671 (0.00088) [2022-07-09 22:52:34,232][25689] Fps is (10 sec: 5582.1, 60 sec: 5678.9, 300 sec: 5671.6). Total num frames: 466610176. Throughput: 0: 5971.4. Samples: 466618752. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:52:34,232][25689] Avg episode reward: [(0, '-46.920')] [2022-07-09 22:52:35,420][26022] Updated weights on worker 0-0, policy_version 455681 (0.00096) [2022-07-09 22:52:37,262][26022] Updated weights on worker 0-0, policy_version 455691 (0.00088) [2022-07-09 22:52:38,890][26022] Updated weights on worker 0-0, policy_version 455701 (0.00094) [2022-07-09 22:52:39,331][25689] Fps is (10 sec: 5742.6, 60 sec: 5674.9, 300 sec: 5670.0). Total num frames: 466638848. Throughput: 0: 5095.1. Samples: 466635806. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:52:39,333][25689] Avg episode reward: [(0, '-46.085')] [2022-07-09 22:52:40,973][26022] Updated weights on worker 0-0, policy_version 455711 (0.00092) [2022-07-09 22:52:42,574][26022] Updated weights on worker 0-0, policy_version 455721 (0.00088) [2022-07-09 22:52:44,247][26022] Updated weights on worker 0-0, policy_version 455731 (0.00093) [2022-07-09 22:52:44,373][25689] Fps is (10 sec: 5755.1, 60 sec: 5673.6, 300 sec: 5676.4). Total num frames: 466668544. Throughput: 0: 5931.2. Samples: 466670140. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:52:44,374][25689] Avg episode reward: [(0, '-45.799')] [2022-07-09 22:52:46,273][26022] Updated weights on worker 0-0, policy_version 455741 (0.00089) [2022-07-09 22:52:47,828][26022] Updated weights on worker 0-0, policy_version 455751 (0.00102) [2022-07-09 22:52:49,383][25689] Fps is (10 sec: 5704.7, 60 sec: 5689.8, 300 sec: 5666.8). Total num frames: 466696192. Throughput: 0: 5912.0. Samples: 466704164. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:52:49,385][25689] Avg episode reward: [(0, '-44.682')] [2022-07-09 22:52:49,862][26022] Updated weights on worker 0-0, policy_version 455761 (0.00088) [2022-07-09 22:52:51,553][26022] Updated weights on worker 0-0, policy_version 455771 (0.00091) [2022-07-09 22:52:53,191][26022] Updated weights on worker 0-0, policy_version 455781 (0.00086) [2022-07-09 22:52:54,463][25689] Fps is (10 sec: 5683.5, 60 sec: 5672.0, 300 sec: 5672.9). Total num frames: 466725888. Throughput: 0: 5078.5. Samples: 466721498. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:52:54,464][25689] Avg episode reward: [(0, '-44.454')] [2022-07-09 22:52:55,229][26022] Updated weights on worker 0-0, policy_version 455791 (0.00090) [2022-07-09 22:52:56,759][26022] Updated weights on worker 0-0, policy_version 455801 (0.00084) [2022-07-09 22:52:58,669][26022] Updated weights on worker 0-0, policy_version 455811 (0.00092) [2022-07-09 22:52:59,484][25689] Fps is (10 sec: 5879.9, 60 sec: 5657.1, 300 sec: 5676.5). Total num frames: 466755584. Throughput: 0: 5981.4. Samples: 466756350. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:52:59,484][25689] Avg episode reward: [(0, '-44.474')] [2022-07-09 22:53:00,495][26022] Updated weights on worker 0-0, policy_version 455821 (0.00090) [2022-07-09 22:53:02,343][26022] Updated weights on worker 0-0, policy_version 455831 (0.00091) [2022-07-09 22:53:04,409][26022] Updated weights on worker 0-0, policy_version 455841 (0.00088) [2022-07-09 22:53:04,500][25689] Fps is (10 sec: 5509.4, 60 sec: 5675.6, 300 sec: 5673.0). Total num frames: 466781184. Throughput: 0: 5891.6. Samples: 466788716. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:53:04,500][25689] Avg episode reward: [(0, '-45.304')] [2022-07-09 22:53:06,076][26022] Updated weights on worker 0-0, policy_version 455851 (0.00093) [2022-07-09 22:53:07,902][26022] Updated weights on worker 0-0, policy_version 455861 (0.00089) [2022-07-09 22:53:09,536][25689] Fps is (10 sec: 5398.9, 60 sec: 5655.5, 300 sec: 5674.8). Total num frames: 466809856. Throughput: 0: 5046.1. Samples: 466805860. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:53:09,537][25689] Avg episode reward: [(0, '-45.501')] [2022-07-09 22:53:09,884][26022] Updated weights on worker 0-0, policy_version 455871 (0.00086) [2022-07-09 22:53:11,428][26022] Updated weights on worker 0-0, policy_version 455881 (0.00085) [2022-07-09 22:53:13,230][26022] Updated weights on worker 0-0, policy_version 455891 (0.00082) [2022-07-09 22:53:14,609][25689] Fps is (10 sec: 5773.7, 60 sec: 5704.0, 300 sec: 5673.9). Total num frames: 466839552. Throughput: 0: 5911.4. Samples: 466840590. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:53:14,610][25689] Avg episode reward: [(0, '-47.146')] [2022-07-09 22:53:15,203][26022] Updated weights on worker 0-0, policy_version 455901 (0.00082) [2022-07-09 22:53:16,742][26022] Updated weights on worker 0-0, policy_version 455911 (0.00095) [2022-07-09 22:53:18,668][26022] Updated weights on worker 0-0, policy_version 455921 (0.00081) [2022-07-09 22:53:19,645][25689] Fps is (10 sec: 5774.2, 60 sec: 5686.0, 300 sec: 5670.8). Total num frames: 466868224. Throughput: 0: 5905.7. Samples: 466875414. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:53:19,645][25689] Avg episode reward: [(0, '-46.882')] [2022-07-09 22:53:20,323][26022] Updated weights on worker 0-0, policy_version 455931 (0.00091) [2022-07-09 22:53:22,181][26022] Updated weights on worker 0-0, policy_version 455941 (0.00084) [2022-07-09 22:53:24,002][26022] Updated weights on worker 0-0, policy_version 455951 (0.00093) [2022-07-09 22:53:24,657][25689] Fps is (10 sec: 5706.9, 60 sec: 5668.7, 300 sec: 5678.1). Total num frames: 466896896. Throughput: 0: 5159.1. Samples: 466892710. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:53:24,658][25689] Avg episode reward: [(0, '-47.586')] [2022-07-09 22:53:25,779][26022] Updated weights on worker 0-0, policy_version 455961 (0.00100) [2022-07-09 22:53:27,606][26022] Updated weights on worker 0-0, policy_version 455971 (0.00075) [2022-07-09 22:53:29,177][26022] Updated weights on worker 0-0, policy_version 455981 (0.00089) [2022-07-09 22:53:29,663][25689] Fps is (10 sec: 5723.9, 60 sec: 5704.3, 300 sec: 5675.6). Total num frames: 466925568. Throughput: 0: 6022.1. Samples: 466927066. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:53:29,663][25689] Avg episode reward: [(0, '-47.910')] [2022-07-09 22:53:31,233][26022] Updated weights on worker 0-0, policy_version 455991 (0.00087) [2022-07-09 22:53:33,062][26022] Updated weights on worker 0-0, policy_version 456001 (0.00096) [2022-07-09 22:53:34,607][26022] Updated weights on worker 0-0, policy_version 456011 (0.00086) [2022-07-09 22:53:34,791][25689] Fps is (10 sec: 5760.0, 60 sec: 5698.4, 300 sec: 5680.1). Total num frames: 466955264. Throughput: 0: 5980.4. Samples: 466961284. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:53:34,791][25689] Avg episode reward: [(0, '-48.778')] [2022-07-09 22:53:36,600][26022] Updated weights on worker 0-0, policy_version 456021 (0.00090) [2022-07-09 22:53:38,347][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:53:38,363][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000456031_466975744.pth [2022-07-09 22:53:38,363][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000454035_464931840.pth [2022-07-09 22:53:38,367][26022] Updated weights on worker 0-0, policy_version 456031 (0.00091) [2022-07-09 22:53:39,848][25689] Fps is (10 sec: 5831.2, 60 sec: 5719.2, 300 sec: 5679.3). Total num frames: 466984960. Throughput: 0: 5108.4. Samples: 466978622. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:53:39,849][25689] Avg episode reward: [(0, '-47.874')] [2022-07-09 22:53:40,083][26022] Updated weights on worker 0-0, policy_version 456041 (0.00100) [2022-07-09 22:53:41,958][26022] Updated weights on worker 0-0, policy_version 456051 (0.00935) [2022-07-09 22:53:43,454][26022] Updated weights on worker 0-0, policy_version 456061 (0.00093) [2022-07-09 22:53:44,885][25689] Fps is (10 sec: 5680.8, 60 sec: 5685.9, 300 sec: 5675.4). Total num frames: 467012608. Throughput: 0: 5951.1. Samples: 467013086. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:53:44,886][25689] Avg episode reward: [(0, '-48.882')] [2022-07-09 22:53:45,467][26022] Updated weights on worker 0-0, policy_version 456071 (0.00094) [2022-07-09 22:53:47,315][26022] Updated weights on worker 0-0, policy_version 456081 (0.00087) [2022-07-09 22:53:48,927][26022] Updated weights on worker 0-0, policy_version 456091 (0.00086) [2022-07-09 22:53:49,893][25689] Fps is (10 sec: 5709.0, 60 sec: 5719.9, 300 sec: 5686.9). Total num frames: 467042304. Throughput: 0: 5943.9. Samples: 467047310. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:53:49,893][25689] Avg episode reward: [(0, '-50.300')] [2022-07-09 22:53:50,958][26022] Updated weights on worker 0-0, policy_version 456101 (0.00093) [2022-07-09 22:53:52,476][26022] Updated weights on worker 0-0, policy_version 456111 (0.00093) [2022-07-09 22:53:54,652][26022] Updated weights on worker 0-0, policy_version 456121 (0.00095) [2022-07-09 22:53:54,971][25689] Fps is (10 sec: 5685.3, 60 sec: 5686.2, 300 sec: 5672.4). Total num frames: 467069952. Throughput: 0: 5080.5. Samples: 467063812. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:53:54,972][25689] Avg episode reward: [(0, '-48.636')] [2022-07-09 22:53:56,387][26022] Updated weights on worker 0-0, policy_version 456131 (0.00086) [2022-07-09 22:53:58,110][26022] Updated weights on worker 0-0, policy_version 456141 (0.00094) [2022-07-09 22:53:59,938][26022] Updated weights on worker 0-0, policy_version 456151 (0.00088) [2022-07-09 22:54:00,053][25689] Fps is (10 sec: 5542.9, 60 sec: 5663.6, 300 sec: 5681.8). Total num frames: 467098624. Throughput: 0: 5912.2. Samples: 467098078. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:54:00,054][25689] Avg episode reward: [(0, '-48.404')] [2022-07-09 22:54:01,818][26022] Updated weights on worker 0-0, policy_version 456161 (0.00081) [2022-07-09 22:54:03,898][26022] Updated weights on worker 0-0, policy_version 456171 (0.00091) [2022-07-09 22:54:05,071][25689] Fps is (10 sec: 5475.0, 60 sec: 5680.3, 300 sec: 5678.1). Total num frames: 467125248. Throughput: 0: 5801.0. Samples: 467130186. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:54:05,072][25689] Avg episode reward: [(0, '-48.199')] [2022-07-09 22:54:05,819][26022] Updated weights on worker 0-0, policy_version 456181 (0.00086) [2022-07-09 22:54:07,243][26022] Updated weights on worker 0-0, policy_version 456191 (0.00088) [2022-07-09 22:54:09,570][26022] Updated weights on worker 0-0, policy_version 456201 (0.00094) [2022-07-09 22:54:10,125][25689] Fps is (10 sec: 5490.3, 60 sec: 5678.7, 300 sec: 5674.5). Total num frames: 467153920. Throughput: 0: 4938.7. Samples: 467147230. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:54:10,126][25689] Avg episode reward: [(0, '-47.652')] [2022-07-09 22:54:10,832][26022] Updated weights on worker 0-0, policy_version 456211 (0.00100) [2022-07-09 22:54:12,978][26022] Updated weights on worker 0-0, policy_version 456221 (0.00084) [2022-07-09 22:54:14,494][26022] Updated weights on worker 0-0, policy_version 456231 (0.00425) [2022-07-09 22:54:15,215][25689] Fps is (10 sec: 5754.0, 60 sec: 5677.1, 300 sec: 5676.3). Total num frames: 467183616. Throughput: 0: 5825.8. Samples: 467181748. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:54:15,217][25689] Avg episode reward: [(0, '-46.757')] [2022-07-09 22:54:16,562][26022] Updated weights on worker 0-0, policy_version 456241 (0.00087) [2022-07-09 22:54:18,103][26022] Updated weights on worker 0-0, policy_version 456251 (0.00094) [2022-07-09 22:54:20,120][26022] Updated weights on worker 0-0, policy_version 456261 (0.00093) [2022-07-09 22:54:20,227][25689] Fps is (10 sec: 5777.8, 60 sec: 5679.3, 300 sec: 5676.2). Total num frames: 467212288. Throughput: 0: 5844.9. Samples: 467215990. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:54:20,229][25689] Avg episode reward: [(0, '-46.830')] [2022-07-09 22:54:21,685][26022] Updated weights on worker 0-0, policy_version 456271 (0.00091) [2022-07-09 22:54:23,783][26022] Updated weights on worker 0-0, policy_version 456281 (0.00089) [2022-07-09 22:54:25,271][25689] Fps is (10 sec: 5702.4, 60 sec: 5676.3, 300 sec: 5675.5). Total num frames: 467240960. Throughput: 0: 5949.8. Samples: 467250372. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:54:25,272][25689] Avg episode reward: [(0, '-47.950')] [2022-07-09 22:54:25,279][26022] Updated weights on worker 0-0, policy_version 456291 (0.00095) [2022-07-09 22:54:27,259][26022] Updated weights on worker 0-0, policy_version 456301 (0.00080) [2022-07-09 22:54:28,810][26022] Updated weights on worker 0-0, policy_version 456311 (0.00075) [2022-07-09 22:54:30,315][25689] Fps is (10 sec: 5582.9, 60 sec: 5655.9, 300 sec: 5673.4). Total num frames: 467268608. Throughput: 0: 5963.6. Samples: 467267636. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-09 22:54:30,315][25689] Avg episode reward: [(0, '-47.215')] [2022-07-09 22:54:30,816][26022] Updated weights on worker 0-0, policy_version 456321 (0.00082) [2022-07-09 22:54:32,484][26022] Updated weights on worker 0-0, policy_version 456331 (0.00081) [2022-07-09 22:54:34,317][26022] Updated weights on worker 0-0, policy_version 456341 (0.01101) [2022-07-09 22:54:35,386][25689] Fps is (10 sec: 5770.5, 60 sec: 5678.1, 300 sec: 5679.2). Total num frames: 467299328. Throughput: 0: 5967.5. Samples: 467302118. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:54:35,386][25689] Avg episode reward: [(0, '-47.262')] [2022-07-09 22:54:36,128][26022] Updated weights on worker 0-0, policy_version 456351 (0.00086) [2022-07-09 22:54:37,899][26022] Updated weights on worker 0-0, policy_version 456361 (0.00089) [2022-07-09 22:54:39,859][26022] Updated weights on worker 0-0, policy_version 456371 (0.00087) [2022-07-09 22:54:40,447][25689] Fps is (10 sec: 5861.7, 60 sec: 5660.8, 300 sec: 5681.7). Total num frames: 467328000. Throughput: 0: 5962.6. Samples: 467336556. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:54:40,448][25689] Avg episode reward: [(0, '-47.564')] [2022-07-09 22:54:41,361][26022] Updated weights on worker 0-0, policy_version 456381 (0.00082) [2022-07-09 22:54:43,316][26022] Updated weights on worker 0-0, policy_version 456391 (0.00087) [2022-07-09 22:54:45,019][26022] Updated weights on worker 0-0, policy_version 456401 (0.00089) [2022-07-09 22:54:45,468][25689] Fps is (10 sec: 5586.1, 60 sec: 5662.3, 300 sec: 5675.3). Total num frames: 467355648. Throughput: 0: 5118.5. Samples: 467353750. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:54:45,470][25689] Avg episode reward: [(0, '-47.382')] [2022-07-09 22:54:46,832][26022] Updated weights on worker 0-0, policy_version 456411 (0.00090) [2022-07-09 22:54:48,848][26022] Updated weights on worker 0-0, policy_version 456421 (0.00090) [2022-07-09 22:54:50,343][26022] Updated weights on worker 0-0, policy_version 456431 (0.00091) [2022-07-09 22:54:50,481][25689] Fps is (10 sec: 5714.8, 60 sec: 5661.8, 300 sec: 5679.5). Total num frames: 467385344. Throughput: 0: 5973.8. Samples: 467388106. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:54:50,482][25689] Avg episode reward: [(0, '-46.997')] [2022-07-09 22:54:52,245][26022] Updated weights on worker 0-0, policy_version 456441 (0.00096) [2022-07-09 22:54:53,887][26022] Updated weights on worker 0-0, policy_version 456451 (0.00088) [2022-07-09 22:54:55,598][25689] Fps is (10 sec: 5863.3, 60 sec: 5692.1, 300 sec: 5681.2). Total num frames: 467415040. Throughput: 0: 5963.1. Samples: 467422642. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:54:55,598][25689] Avg episode reward: [(0, '-45.677')] [2022-07-09 22:54:55,658][26022] Updated weights on worker 0-0, policy_version 456461 (0.00084) [2022-07-09 22:54:57,784][26022] Updated weights on worker 0-0, policy_version 456471 (0.00085) [2022-07-09 22:54:59,357][26022] Updated weights on worker 0-0, policy_version 456481 (0.00089) [2022-07-09 22:55:00,619][25689] Fps is (10 sec: 5656.8, 60 sec: 5680.9, 300 sec: 5689.0). Total num frames: 467442688. Throughput: 0: 5118.3. Samples: 467439800. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:55:00,619][25689] Avg episode reward: [(0, '-45.365')] [2022-07-09 22:55:01,103][26022] Updated weights on worker 0-0, policy_version 456491 (0.00102) [2022-07-09 22:55:03,348][26022] Updated weights on worker 0-0, policy_version 456501 (0.00086) [2022-07-09 22:55:05,092][26022] Updated weights on worker 0-0, policy_version 456511 (0.00095) [2022-07-09 22:55:05,639][25689] Fps is (10 sec: 5404.5, 60 sec: 5680.6, 300 sec: 5676.5). Total num frames: 467469312. Throughput: 0: 5850.7. Samples: 467471768. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:55:05,640][25689] Avg episode reward: [(0, '-45.091')] [2022-07-09 22:55:07,024][26022] Updated weights on worker 0-0, policy_version 456521 (0.00091) [2022-07-09 22:55:08,768][26022] Updated weights on worker 0-0, policy_version 456531 (0.00083) [2022-07-09 22:55:10,620][26022] Updated weights on worker 0-0, policy_version 456541 (0.00090) [2022-07-09 22:55:10,654][25689] Fps is (10 sec: 5510.2, 60 sec: 5684.3, 300 sec: 5681.3). Total num frames: 467497984. Throughput: 0: 5843.2. Samples: 467505978. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:55:10,654][25689] Avg episode reward: [(0, '-45.459')] [2022-07-09 22:55:12,316][26022] Updated weights on worker 0-0, policy_version 456551 (0.00092) [2022-07-09 22:55:14,063][26022] Updated weights on worker 0-0, policy_version 456561 (0.00093) [2022-07-09 22:55:15,698][25689] Fps is (10 sec: 5701.0, 60 sec: 5671.7, 300 sec: 5673.7). Total num frames: 467526656. Throughput: 0: 4990.5. Samples: 467522954. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:55:15,699][25689] Avg episode reward: [(0, '-45.661')] [2022-07-09 22:55:15,757][26022] Updated weights on worker 0-0, policy_version 456571 (0.00085) [2022-07-09 22:55:18,091][26022] Updated weights on worker 0-0, policy_version 456581 (0.00097) [2022-07-09 22:55:19,527][26022] Updated weights on worker 0-0, policy_version 456591 (0.00100) [2022-07-09 22:55:20,730][25689] Fps is (10 sec: 5691.3, 60 sec: 5669.8, 300 sec: 5680.6). Total num frames: 467555328. Throughput: 0: 5819.4. Samples: 467556834. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:55:20,730][25689] Avg episode reward: [(0, '-45.951')] [2022-07-09 22:55:21,561][26022] Updated weights on worker 0-0, policy_version 456601 (0.00091) [2022-07-09 22:55:23,255][26022] Updated weights on worker 0-0, policy_version 456611 (0.00096) [2022-07-09 22:55:25,077][26022] Updated weights on worker 0-0, policy_version 456621 (0.00086) [2022-07-09 22:55:25,739][25689] Fps is (10 sec: 5507.3, 60 sec: 5639.2, 300 sec: 5671.2). Total num frames: 467581952. Throughput: 0: 5929.0. Samples: 467590936. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:55:25,739][25689] Avg episode reward: [(0, '-46.815')] [2022-07-09 22:55:26,779][26022] Updated weights on worker 0-0, policy_version 456631 (0.00086) [2022-07-09 22:55:28,891][26022] Updated weights on worker 0-0, policy_version 456641 (0.00095) [2022-07-09 22:55:30,234][26022] Updated weights on worker 0-0, policy_version 456651 (0.00093) [2022-07-09 22:55:30,749][25689] Fps is (10 sec: 5621.3, 60 sec: 5676.3, 300 sec: 5673.0). Total num frames: 467611648. Throughput: 0: 5078.2. Samples: 467608024. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:55:30,752][25689] Avg episode reward: [(0, '-46.376')] [2022-07-09 22:55:32,523][26022] Updated weights on worker 0-0, policy_version 456661 (0.00084) [2022-07-09 22:55:33,827][26022] Updated weights on worker 0-0, policy_version 456671 (0.00082) [2022-07-09 22:55:35,797][25689] Fps is (10 sec: 5701.1, 60 sec: 5627.6, 300 sec: 5673.0). Total num frames: 467639296. Throughput: 0: 5928.8. Samples: 467642116. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:55:35,798][25689] Avg episode reward: [(0, '-46.607')] [2022-07-09 22:55:36,092][26022] Updated weights on worker 0-0, policy_version 456681 (0.00084) [2022-07-09 22:55:37,463][26022] Updated weights on worker 0-0, policy_version 456691 (0.00087) [2022-07-09 22:55:38,436][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:55:38,452][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000456695_467655680.pth [2022-07-09 22:55:38,453][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000454700_465612800.pth [2022-07-09 22:55:39,638][26022] Updated weights on worker 0-0, policy_version 456701 (0.00087) [2022-07-09 22:55:40,813][25689] Fps is (10 sec: 5697.8, 60 sec: 5648.8, 300 sec: 5676.2). Total num frames: 467668992. Throughput: 0: 5956.6. Samples: 467676462. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:55:40,813][25689] Avg episode reward: [(0, '-46.162')] [2022-07-09 22:55:41,223][26022] Updated weights on worker 0-0, policy_version 456711 (0.00087) [2022-07-09 22:55:43,170][26022] Updated weights on worker 0-0, policy_version 456721 (0.00085) [2022-07-09 22:55:44,643][26022] Updated weights on worker 0-0, policy_version 456731 (0.00084) [2022-07-09 22:55:45,829][25689] Fps is (10 sec: 5716.2, 60 sec: 5649.2, 300 sec: 5663.8). Total num frames: 467696640. Throughput: 0: 5111.9. Samples: 467693636. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:55:45,829][25689] Avg episode reward: [(0, '-45.905')] [2022-07-09 22:55:46,716][26022] Updated weights on worker 0-0, policy_version 456741 (0.00093) [2022-07-09 22:55:48,185][26022] Updated weights on worker 0-0, policy_version 456751 (0.00091) [2022-07-09 22:55:50,480][26022] Updated weights on worker 0-0, policy_version 456761 (0.00090) [2022-07-09 22:55:50,852][25689] Fps is (10 sec: 5609.9, 60 sec: 5631.3, 300 sec: 5675.0). Total num frames: 467725312. Throughput: 0: 5968.3. Samples: 467728008. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:55:50,853][25689] Avg episode reward: [(0, '-46.293')] [2022-07-09 22:55:51,958][26022] Updated weights on worker 0-0, policy_version 456771 (0.00083) [2022-07-09 22:55:54,137][26022] Updated weights on worker 0-0, policy_version 456781 (0.00083) [2022-07-09 22:55:55,546][26022] Updated weights on worker 0-0, policy_version 456791 (0.00089) [2022-07-09 22:55:55,953][25689] Fps is (10 sec: 5765.1, 60 sec: 5632.7, 300 sec: 5677.7). Total num frames: 467755008. Throughput: 0: 5955.5. Samples: 467762156. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:55:55,955][25689] Avg episode reward: [(0, '-46.833')] [2022-07-09 22:55:57,607][26022] Updated weights on worker 0-0, policy_version 456801 (0.00085) [2022-07-09 22:55:59,072][26022] Updated weights on worker 0-0, policy_version 456811 (0.00086) [2022-07-09 22:56:00,966][25689] Fps is (10 sec: 5771.5, 60 sec: 5650.5, 300 sec: 5684.6). Total num frames: 467783680. Throughput: 0: 5110.3. Samples: 467779448. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:56:00,966][25689] Avg episode reward: [(0, '-47.557')] [2022-07-09 22:56:01,086][26022] Updated weights on worker 0-0, policy_version 456821 (0.00096) [2022-07-09 22:56:03,102][26022] Updated weights on worker 0-0, policy_version 456831 (0.00087) [2022-07-09 22:56:05,014][26022] Updated weights on worker 0-0, policy_version 456841 (0.00090) [2022-07-09 22:56:06,001][25689] Fps is (10 sec: 5503.4, 60 sec: 5649.2, 300 sec: 5673.8). Total num frames: 467810304. Throughput: 0: 5873.3. Samples: 467812112. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:56:06,002][25689] Avg episode reward: [(0, '-48.321')] [2022-07-09 22:56:06,792][26022] Updated weights on worker 0-0, policy_version 456851 (0.00088) [2022-07-09 22:56:08,543][26022] Updated weights on worker 0-0, policy_version 456861 (0.00088) [2022-07-09 22:56:10,500][26022] Updated weights on worker 0-0, policy_version 456871 (0.00088) [2022-07-09 22:56:11,003][25689] Fps is (10 sec: 5509.0, 60 sec: 5650.3, 300 sec: 5675.1). Total num frames: 467838976. Throughput: 0: 5855.5. Samples: 467846000. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:56:11,005][25689] Avg episode reward: [(0, '-48.030')] [2022-07-09 22:56:12,202][26022] Updated weights on worker 0-0, policy_version 456881 (0.00091) [2022-07-09 22:56:13,924][26022] Updated weights on worker 0-0, policy_version 456891 (0.00086) [2022-07-09 22:56:15,751][26022] Updated weights on worker 0-0, policy_version 456901 (0.00089) [2022-07-09 22:56:16,056][25689] Fps is (10 sec: 5702.6, 60 sec: 5649.5, 300 sec: 5672.1). Total num frames: 467867648. Throughput: 0: 5021.6. Samples: 467863106. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:56:16,057][25689] Avg episode reward: [(0, '-47.684')] [2022-07-09 22:56:17,612][26022] Updated weights on worker 0-0, policy_version 456911 (0.00092) [2022-07-09 22:56:19,397][26022] Updated weights on worker 0-0, policy_version 456921 (0.00091) [2022-07-09 22:56:21,075][25689] Fps is (10 sec: 5693.5, 60 sec: 5650.7, 300 sec: 5675.6). Total num frames: 467896320. Throughput: 0: 5859.4. Samples: 467897276. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:56:21,075][25689] Avg episode reward: [(0, '-47.698')] [2022-07-09 22:56:21,235][26022] Updated weights on worker 0-0, policy_version 456931 (0.00087) [2022-07-09 22:56:22,972][26022] Updated weights on worker 0-0, policy_version 456941 (0.00087) [2022-07-09 22:56:24,874][26022] Updated weights on worker 0-0, policy_version 456951 (0.00092) [2022-07-09 22:56:26,080][25689] Fps is (10 sec: 5721.2, 60 sec: 5685.0, 300 sec: 5672.2). Total num frames: 467924992. Throughput: 0: 5950.6. Samples: 467931594. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:56:26,080][25689] Avg episode reward: [(0, '-47.678')] [2022-07-09 22:56:26,548][26022] Updated weights on worker 0-0, policy_version 456961 (0.00081) [2022-07-09 22:56:28,355][26022] Updated weights on worker 0-0, policy_version 456971 (0.00091) [2022-07-09 22:56:30,162][26022] Updated weights on worker 0-0, policy_version 456981 (0.00094) [2022-07-09 22:56:31,127][25689] Fps is (10 sec: 5602.8, 60 sec: 5647.6, 300 sec: 5669.3). Total num frames: 467952640. Throughput: 0: 5100.2. Samples: 467948636. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:56:31,127][25689] Avg episode reward: [(0, '-46.695')] [2022-07-09 22:56:31,889][26022] Updated weights on worker 0-0, policy_version 456991 (0.00090) [2022-07-09 22:56:33,918][26022] Updated weights on worker 0-0, policy_version 457001 (0.00087) [2022-07-09 22:56:35,331][26022] Updated weights on worker 0-0, policy_version 457011 (0.00097) [2022-07-09 22:56:36,254][25689] Fps is (10 sec: 5636.1, 60 sec: 5674.1, 300 sec: 5671.5). Total num frames: 467982336. Throughput: 0: 5925.5. Samples: 467982786. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:56:36,255][25689] Avg episode reward: [(0, '-47.552')] [2022-07-09 22:56:37,543][26022] Updated weights on worker 0-0, policy_version 457021 (0.00086) [2022-07-09 22:56:39,111][26022] Updated weights on worker 0-0, policy_version 457031 (0.00097) [2022-07-09 22:56:41,158][26022] Updated weights on worker 0-0, policy_version 457041 (0.00084) [2022-07-09 22:56:41,323][25689] Fps is (10 sec: 5624.0, 60 sec: 5635.3, 300 sec: 5663.8). Total num frames: 468009984. Throughput: 0: 5896.2. Samples: 468016664. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:56:41,323][25689] Avg episode reward: [(0, '-47.748')] [2022-07-09 22:56:42,605][26022] Updated weights on worker 0-0, policy_version 457051 (0.00083) [2022-07-09 22:56:44,571][26022] Updated weights on worker 0-0, policy_version 457061 (0.00082) [2022-07-09 22:56:46,294][26022] Updated weights on worker 0-0, policy_version 457071 (0.00097) [2022-07-09 22:56:46,387][25689] Fps is (10 sec: 5760.2, 60 sec: 5681.6, 300 sec: 5676.4). Total num frames: 468040704. Throughput: 0: 5036.6. Samples: 468033874. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:56:46,387][25689] Avg episode reward: [(0, '-47.531')] [2022-07-09 22:56:48,096][26022] Updated weights on worker 0-0, policy_version 457081 (0.00095) [2022-07-09 22:56:50,166][26022] Updated weights on worker 0-0, policy_version 457091 (0.00089) [2022-07-09 22:56:51,472][25689] Fps is (10 sec: 5953.2, 60 sec: 5692.7, 300 sec: 5672.7). Total num frames: 468070400. Throughput: 0: 5863.6. Samples: 468067930. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:56:51,472][25689] Avg episode reward: [(0, '-47.177')] [2022-07-09 22:56:51,828][26022] Updated weights on worker 0-0, policy_version 457101 (0.00089) [2022-07-09 22:56:53,618][26022] Updated weights on worker 0-0, policy_version 457111 (0.00085) [2022-07-09 22:56:55,322][26022] Updated weights on worker 0-0, policy_version 457121 (0.00081) [2022-07-09 22:56:56,550][25689] Fps is (10 sec: 5642.5, 60 sec: 5661.0, 300 sec: 5661.7). Total num frames: 468098048. Throughput: 0: 5890.1. Samples: 468102332. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:56:56,550][25689] Avg episode reward: [(0, '-46.910')] [2022-07-09 22:56:57,106][26022] Updated weights on worker 0-0, policy_version 457131 (0.00087) [2022-07-09 22:56:58,947][26022] Updated weights on worker 0-0, policy_version 457141 (0.00085) [2022-07-09 22:57:00,809][26022] Updated weights on worker 0-0, policy_version 457151 (0.00053) [2022-07-09 22:57:01,610][25689] Fps is (10 sec: 5454.3, 60 sec: 5639.7, 300 sec: 5671.5). Total num frames: 468125696. Throughput: 0: 5077.8. Samples: 468119674. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:57:01,615][25689] Avg episode reward: [(0, '-46.385')] [2022-07-09 22:57:02,671][26022] Updated weights on worker 0-0, policy_version 457161 (0.00094) [2022-07-09 22:57:04,754][26022] Updated weights on worker 0-0, policy_version 457171 (0.00089) [2022-07-09 22:57:06,276][26022] Updated weights on worker 0-0, policy_version 457181 (0.00085) [2022-07-09 22:57:06,664][25689] Fps is (10 sec: 5568.2, 60 sec: 5671.7, 300 sec: 5667.0). Total num frames: 468154368. Throughput: 0: 5830.3. Samples: 468152098. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:57:06,665][25689] Avg episode reward: [(0, '-45.997')] [2022-07-09 22:57:08,266][26022] Updated weights on worker 0-0, policy_version 457191 (0.00825) [2022-07-09 22:57:10,108][26022] Updated weights on worker 0-0, policy_version 457201 (0.00085) [2022-07-09 22:57:11,731][25689] Fps is (10 sec: 5767.0, 60 sec: 5682.5, 300 sec: 5677.0). Total num frames: 468184064. Throughput: 0: 5851.5. Samples: 468186476. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:57:11,732][25689] Avg episode reward: [(0, '-46.445')] [2022-07-09 22:57:11,735][26022] Updated weights on worker 0-0, policy_version 457211 (0.00086) [2022-07-09 22:57:13,734][26022] Updated weights on worker 0-0, policy_version 457221 (0.00091) [2022-07-09 22:57:15,391][26022] Updated weights on worker 0-0, policy_version 457231 (0.00089) [2022-07-09 22:57:16,817][25689] Fps is (10 sec: 5749.1, 60 sec: 5679.5, 300 sec: 5672.4). Total num frames: 468212736. Throughput: 0: 5851.7. Samples: 468220928. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-09 22:57:16,817][25689] Avg episode reward: [(0, '-47.092')] [2022-07-09 22:57:17,123][26022] Updated weights on worker 0-0, policy_version 457241 (0.00089) [2022-07-09 22:57:18,910][26022] Updated weights on worker 0-0, policy_version 457251 (0.00093) [2022-07-09 22:57:20,828][26022] Updated weights on worker 0-0, policy_version 457261 (0.00091) [2022-07-09 22:57:21,919][25689] Fps is (10 sec: 5528.1, 60 sec: 5654.9, 300 sec: 5663.7). Total num frames: 468240384. Throughput: 0: 5825.3. Samples: 468237980. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:57:21,920][25689] Avg episode reward: [(0, '-47.660')] [2022-07-09 22:57:22,447][26022] Updated weights on worker 0-0, policy_version 457271 (0.00091) [2022-07-09 22:57:24,556][26022] Updated weights on worker 0-0, policy_version 457281 (0.00090) [2022-07-09 22:57:25,955][26022] Updated weights on worker 0-0, policy_version 457291 (0.00093) [2022-07-09 22:57:26,933][25689] Fps is (10 sec: 5567.2, 60 sec: 5653.9, 300 sec: 5670.8). Total num frames: 468269056. Throughput: 0: 5932.1. Samples: 468272336. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:57:26,935][25689] Avg episode reward: [(0, '-48.105')] [2022-07-09 22:57:28,168][26022] Updated weights on worker 0-0, policy_version 457301 (0.00088) [2022-07-09 22:57:29,697][26022] Updated weights on worker 0-0, policy_version 457311 (0.00095) [2022-07-09 22:57:31,542][26022] Updated weights on worker 0-0, policy_version 457321 (0.00088) [2022-07-09 22:57:31,954][25689] Fps is (10 sec: 5714.2, 60 sec: 5673.3, 300 sec: 5668.2). Total num frames: 468297728. Throughput: 0: 5930.0. Samples: 468306400. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:57:31,954][25689] Avg episode reward: [(0, '-48.404')] [2022-07-09 22:57:33,326][26022] Updated weights on worker 0-0, policy_version 457331 (0.00086) [2022-07-09 22:57:35,302][26022] Updated weights on worker 0-0, policy_version 457341 (0.00087) [2022-07-09 22:57:36,923][26022] Updated weights on worker 0-0, policy_version 457351 (0.00104) [2022-07-09 22:57:37,019][25689] Fps is (10 sec: 5787.3, 60 sec: 5679.1, 300 sec: 5672.3). Total num frames: 468327424. Throughput: 0: 5078.7. Samples: 468323528. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:57:37,019][25689] Avg episode reward: [(0, '-48.318')] [2022-07-09 22:57:38,613][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:57:38,626][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000457359_468335616.pth [2022-07-09 22:57:38,626][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000455365_466293760.pth [2022-07-09 22:57:38,807][26022] Updated weights on worker 0-0, policy_version 457361 (0.00091) [2022-07-09 22:57:40,456][26022] Updated weights on worker 0-0, policy_version 457371 (0.00094) [2022-07-09 22:57:42,030][25689] Fps is (10 sec: 5690.8, 60 sec: 5684.4, 300 sec: 5666.0). Total num frames: 468355072. Throughput: 0: 5949.1. Samples: 468357628. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:57:42,032][25689] Avg episode reward: [(0, '-48.574')] [2022-07-09 22:57:42,553][26022] Updated weights on worker 0-0, policy_version 457381 (0.00081) [2022-07-09 22:57:44,285][26022] Updated weights on worker 0-0, policy_version 457391 (0.00095) [2022-07-09 22:57:45,923][26022] Updated weights on worker 0-0, policy_version 457401 (0.00096) [2022-07-09 22:57:47,033][25689] Fps is (10 sec: 5623.9, 60 sec: 5656.4, 300 sec: 5669.6). Total num frames: 468383744. Throughput: 0: 5946.0. Samples: 468391850. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:57:47,035][25689] Avg episode reward: [(0, '-47.714')] [2022-07-09 22:57:47,811][26022] Updated weights on worker 0-0, policy_version 457411 (0.00088) [2022-07-09 22:57:49,419][26022] Updated weights on worker 0-0, policy_version 457421 (0.00086) [2022-07-09 22:57:51,399][26022] Updated weights on worker 0-0, policy_version 457431 (0.00086) [2022-07-09 22:57:52,076][25689] Fps is (10 sec: 5708.2, 60 sec: 5643.4, 300 sec: 5666.8). Total num frames: 468412416. Throughput: 0: 5100.2. Samples: 468409028. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:57:52,077][25689] Avg episode reward: [(0, '-47.325')] [2022-07-09 22:57:53,187][26022] Updated weights on worker 0-0, policy_version 457441 (0.00086) [2022-07-09 22:57:55,038][26022] Updated weights on worker 0-0, policy_version 457451 (0.00087) [2022-07-09 22:57:56,868][26022] Updated weights on worker 0-0, policy_version 457461 (0.00086) [2022-07-09 22:57:57,154][25689] Fps is (10 sec: 5767.0, 60 sec: 5677.2, 300 sec: 5665.8). Total num frames: 468442112. Throughput: 0: 5958.9. Samples: 468443512. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:57:57,155][25689] Avg episode reward: [(0, '-46.772')] [2022-07-09 22:57:58,462][26022] Updated weights on worker 0-0, policy_version 457471 (0.00093) [2022-07-09 22:58:00,398][26022] Updated weights on worker 0-0, policy_version 457481 (0.00091) [2022-07-09 22:58:02,263][25689] Fps is (10 sec: 5629.5, 60 sec: 5672.6, 300 sec: 5670.9). Total num frames: 468469760. Throughput: 0: 5930.0. Samples: 468477604. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:58:02,263][25689] Avg episode reward: [(0, '-46.844')] [2022-07-09 22:58:02,500][26022] Updated weights on worker 0-0, policy_version 457491 (0.00090) [2022-07-09 22:58:04,317][26022] Updated weights on worker 0-0, policy_version 457501 (0.00193) [2022-07-09 22:58:06,090][26022] Updated weights on worker 0-0, policy_version 457511 (0.00095) [2022-07-09 22:58:07,276][25689] Fps is (10 sec: 5462.9, 60 sec: 5659.6, 300 sec: 5667.9). Total num frames: 468497408. Throughput: 0: 4972.8. Samples: 468492514. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:58:07,277][25689] Avg episode reward: [(0, '-46.337')] [2022-07-09 22:58:07,889][26022] Updated weights on worker 0-0, policy_version 457521 (0.00092) [2022-07-09 22:58:09,605][26022] Updated weights on worker 0-0, policy_version 457531 (0.00102) [2022-07-09 22:58:11,447][26022] Updated weights on worker 0-0, policy_version 457541 (0.00091) [2022-07-09 22:58:12,311][25689] Fps is (10 sec: 5605.1, 60 sec: 5645.7, 300 sec: 5665.2). Total num frames: 468526080. Throughput: 0: 5830.9. Samples: 468527012. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:58:12,311][25689] Avg episode reward: [(0, '-45.127')] [2022-07-09 22:58:13,311][26022] Updated weights on worker 0-0, policy_version 457551 (0.00086) [2022-07-09 22:58:14,988][26022] Updated weights on worker 0-0, policy_version 457561 (0.00090) [2022-07-09 22:58:16,890][26022] Updated weights on worker 0-0, policy_version 457571 (0.00099) [2022-07-09 22:58:17,356][25689] Fps is (10 sec: 5790.9, 60 sec: 5666.4, 300 sec: 5668.4). Total num frames: 468555776. Throughput: 0: 5833.5. Samples: 468561358. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:58:17,356][25689] Avg episode reward: [(0, '-45.711')] [2022-07-09 22:58:18,735][26022] Updated weights on worker 0-0, policy_version 457581 (0.00088) [2022-07-09 22:58:20,463][26022] Updated weights on worker 0-0, policy_version 457591 (0.00083) [2022-07-09 22:58:22,288][26022] Updated weights on worker 0-0, policy_version 457601 (0.00094) [2022-07-09 22:58:22,374][25689] Fps is (10 sec: 5698.4, 60 sec: 5674.3, 300 sec: 5664.9). Total num frames: 468583424. Throughput: 0: 5011.6. Samples: 468578394. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:58:22,374][25689] Avg episode reward: [(0, '-45.125')] [2022-07-09 22:58:24,024][26022] Updated weights on worker 0-0, policy_version 457611 (0.00090) [2022-07-09 22:58:25,908][26022] Updated weights on worker 0-0, policy_version 457621 (0.00091) [2022-07-09 22:58:27,414][25689] Fps is (10 sec: 5599.2, 60 sec: 5671.9, 300 sec: 5664.2). Total num frames: 468612096. Throughput: 0: 5970.5. Samples: 468612748. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:58:27,415][25689] Avg episode reward: [(0, '-45.954')] [2022-07-09 22:58:27,790][26022] Updated weights on worker 0-0, policy_version 457631 (0.00092) [2022-07-09 22:58:29,280][26022] Updated weights on worker 0-0, policy_version 457641 (0.00090) [2022-07-09 22:58:31,213][26022] Updated weights on worker 0-0, policy_version 457651 (0.00082) [2022-07-09 22:58:32,451][25689] Fps is (10 sec: 5792.1, 60 sec: 5687.3, 300 sec: 5665.9). Total num frames: 468641792. Throughput: 0: 5961.9. Samples: 468647088. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:58:32,452][25689] Avg episode reward: [(0, '-45.900')] [2022-07-09 22:58:33,086][26022] Updated weights on worker 0-0, policy_version 457661 (0.00087) [2022-07-09 22:58:34,898][26022] Updated weights on worker 0-0, policy_version 457671 (0.00093) [2022-07-09 22:58:36,807][26022] Updated weights on worker 0-0, policy_version 457681 (0.00089) [2022-07-09 22:58:37,566][25689] Fps is (10 sec: 5648.7, 60 sec: 5648.8, 300 sec: 5658.0). Total num frames: 468669440. Throughput: 0: 5094.6. Samples: 468664322. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:58:37,567][25689] Avg episode reward: [(0, '-46.396')] [2022-07-09 22:58:38,218][26022] Updated weights on worker 0-0, policy_version 457691 (0.00094) [2022-07-09 22:58:40,343][26022] Updated weights on worker 0-0, policy_version 457701 (0.00087) [2022-07-09 22:58:41,781][26022] Updated weights on worker 0-0, policy_version 457711 (0.00085) [2022-07-09 22:58:42,587][25689] Fps is (10 sec: 5657.4, 60 sec: 5681.7, 300 sec: 5665.2). Total num frames: 468699136. Throughput: 0: 5954.1. Samples: 468698746. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:58:42,588][25689] Avg episode reward: [(0, '-46.381')] [2022-07-09 22:58:43,951][26022] Updated weights on worker 0-0, policy_version 457721 (0.00087) [2022-07-09 22:58:45,474][26022] Updated weights on worker 0-0, policy_version 457731 (0.00086) [2022-07-09 22:58:47,346][26022] Updated weights on worker 0-0, policy_version 457741 (0.00086) [2022-07-09 22:58:47,590][25689] Fps is (10 sec: 5822.7, 60 sec: 5681.7, 300 sec: 5661.8). Total num frames: 468727808. Throughput: 0: 5980.9. Samples: 468733418. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:58:47,591][25689] Avg episode reward: [(0, '-47.026')] [2022-07-09 22:58:49,094][26022] Updated weights on worker 0-0, policy_version 457751 (0.00083) [2022-07-09 22:58:50,835][26022] Updated weights on worker 0-0, policy_version 457761 (0.00087) [2022-07-09 22:58:52,596][25689] Fps is (10 sec: 5831.8, 60 sec: 5702.1, 300 sec: 5670.1). Total num frames: 468757504. Throughput: 0: 5145.0. Samples: 468750732. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:58:52,596][25689] Avg episode reward: [(0, '-47.759')] [2022-07-09 22:58:52,601][26022] Updated weights on worker 0-0, policy_version 457771 (0.00088) [2022-07-09 22:58:54,571][26022] Updated weights on worker 0-0, policy_version 457781 (0.00083) [2022-07-09 22:58:56,278][26022] Updated weights on worker 0-0, policy_version 457791 (0.00085) [2022-07-09 22:58:57,689][25689] Fps is (10 sec: 5779.9, 60 sec: 5683.8, 300 sec: 5669.9). Total num frames: 468786176. Throughput: 0: 5979.0. Samples: 468784636. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:58:57,689][25689] Avg episode reward: [(0, '-47.048')] [2022-07-09 22:58:58,086][26022] Updated weights on worker 0-0, policy_version 457801 (0.00108) [2022-07-09 22:58:59,760][26022] Updated weights on worker 0-0, policy_version 457811 (0.00081) [2022-07-09 22:59:01,687][26022] Updated weights on worker 0-0, policy_version 457821 (0.00081) [2022-07-09 22:59:02,729][25689] Fps is (10 sec: 5457.0, 60 sec: 5673.3, 300 sec: 5669.4). Total num frames: 468812800. Throughput: 0: 5941.8. Samples: 468818424. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:59:02,729][25689] Avg episode reward: [(0, '-47.067')] [2022-07-09 22:59:03,823][26022] Updated weights on worker 0-0, policy_version 457831 (0.00082) [2022-07-09 22:59:05,466][26022] Updated weights on worker 0-0, policy_version 457841 (0.00087) [2022-07-09 22:59:07,459][26022] Updated weights on worker 0-0, policy_version 457851 (0.00086) [2022-07-09 22:59:07,761][25689] Fps is (10 sec: 5489.7, 60 sec: 5688.4, 300 sec: 5669.8). Total num frames: 468841472. Throughput: 0: 5002.7. Samples: 468834328. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:59:07,762][25689] Avg episode reward: [(0, '-47.174')] [2022-07-09 22:59:09,151][26022] Updated weights on worker 0-0, policy_version 457861 (0.00084) [2022-07-09 22:59:10,910][26022] Updated weights on worker 0-0, policy_version 457871 (0.00436) [2022-07-09 22:59:12,620][26022] Updated weights on worker 0-0, policy_version 457881 (0.00092) [2022-07-09 22:59:12,764][25689] Fps is (10 sec: 5714.5, 60 sec: 5691.4, 300 sec: 5668.1). Total num frames: 468870144. Throughput: 0: 5850.5. Samples: 468868724. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:59:12,764][25689] Avg episode reward: [(0, '-47.588')] [2022-07-09 22:59:14,644][26022] Updated weights on worker 0-0, policy_version 457891 (0.00081) [2022-07-09 22:59:16,230][26022] Updated weights on worker 0-0, policy_version 457901 (0.00097) [2022-07-09 22:59:17,864][25689] Fps is (10 sec: 5676.4, 60 sec: 5669.3, 300 sec: 5666.4). Total num frames: 468898816. Throughput: 0: 5856.2. Samples: 468902784. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:59:17,865][25689] Avg episode reward: [(0, '-46.588')] [2022-07-09 22:59:18,207][26022] Updated weights on worker 0-0, policy_version 457911 (0.00053) [2022-07-09 22:59:19,896][26022] Updated weights on worker 0-0, policy_version 457921 (0.00084) [2022-07-09 22:59:21,749][26022] Updated weights on worker 0-0, policy_version 457931 (0.00080) [2022-07-09 22:59:22,884][25689] Fps is (10 sec: 5565.3, 60 sec: 5669.2, 300 sec: 5663.4). Total num frames: 468926464. Throughput: 0: 5037.8. Samples: 468919960. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:59:22,884][25689] Avg episode reward: [(0, '-46.602')] [2022-07-09 22:59:23,566][26022] Updated weights on worker 0-0, policy_version 457941 (0.00090) [2022-07-09 22:59:25,290][26022] Updated weights on worker 0-0, policy_version 457951 (0.00085) [2022-07-09 22:59:27,150][26022] Updated weights on worker 0-0, policy_version 457961 (0.00088) [2022-07-09 22:59:27,917][25689] Fps is (10 sec: 5703.8, 60 sec: 5686.7, 300 sec: 5670.5). Total num frames: 468956160. Throughput: 0: 5952.6. Samples: 468954308. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:59:27,918][25689] Avg episode reward: [(0, '-47.700')] [2022-07-09 22:59:28,917][26022] Updated weights on worker 0-0, policy_version 457971 (0.00090) [2022-07-09 22:59:30,670][26022] Updated weights on worker 0-0, policy_version 457981 (0.00087) [2022-07-09 22:59:32,509][26022] Updated weights on worker 0-0, policy_version 457991 (0.00094) [2022-07-09 22:59:32,935][25689] Fps is (10 sec: 5705.3, 60 sec: 5654.7, 300 sec: 5661.2). Total num frames: 468983808. Throughput: 0: 5935.3. Samples: 468988444. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:59:32,936][25689] Avg episode reward: [(0, '-48.482')] [2022-07-09 22:59:34,304][26022] Updated weights on worker 0-0, policy_version 458001 (0.00090) [2022-07-09 22:59:36,351][26022] Updated weights on worker 0-0, policy_version 458011 (0.00095) [2022-07-09 22:59:37,687][26022] Updated weights on worker 0-0, policy_version 458021 (0.00094) [2022-07-09 22:59:37,984][25689] Fps is (10 sec: 5899.6, 60 sec: 5728.6, 300 sec: 5671.7). Total num frames: 469015552. Throughput: 0: 5971.1. Samples: 469022928. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:59:37,986][25689] Avg episode reward: [(0, '-47.281')] [2022-07-09 22:59:38,692][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 22:59:38,711][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000458026_469018624.pth [2022-07-09 22:59:38,712][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000456031_466975744.pth [2022-07-09 22:59:39,880][26022] Updated weights on worker 0-0, policy_version 458031 (0.00083) [2022-07-09 22:59:41,221][26022] Updated weights on worker 0-0, policy_version 458041 (0.00084) [2022-07-09 22:59:42,991][25689] Fps is (10 sec: 5702.4, 60 sec: 5662.2, 300 sec: 5665.1). Total num frames: 469041152. Throughput: 0: 5971.6. Samples: 469040032. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:59:42,993][25689] Avg episode reward: [(0, '-48.028')] [2022-07-09 22:59:43,471][26022] Updated weights on worker 0-0, policy_version 458051 (0.00086) [2022-07-09 22:59:45,052][26022] Updated weights on worker 0-0, policy_version 458061 (0.00089) [2022-07-09 22:59:46,800][26022] Updated weights on worker 0-0, policy_version 458071 (0.00093) [2022-07-09 22:59:48,027][25689] Fps is (10 sec: 5404.4, 60 sec: 5659.1, 300 sec: 5661.3). Total num frames: 469069824. Throughput: 0: 5971.9. Samples: 469074398. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:59:48,028][25689] Avg episode reward: [(0, '-48.675')] [2022-07-09 22:59:48,828][26022] Updated weights on worker 0-0, policy_version 458081 (0.00080) [2022-07-09 22:59:50,264][26022] Updated weights on worker 0-0, policy_version 458091 (0.00086) [2022-07-09 22:59:52,280][26022] Updated weights on worker 0-0, policy_version 458101 (0.00099) [2022-07-09 22:59:53,039][25689] Fps is (10 sec: 5808.5, 60 sec: 5658.4, 300 sec: 5663.2). Total num frames: 469099520. Throughput: 0: 5971.0. Samples: 469108490. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:59:53,042][25689] Avg episode reward: [(0, '-47.977')] [2022-07-09 22:59:54,063][26022] Updated weights on worker 0-0, policy_version 458111 (0.00085) [2022-07-09 22:59:55,871][26022] Updated weights on worker 0-0, policy_version 458121 (0.00089) [2022-07-09 22:59:57,814][26022] Updated weights on worker 0-0, policy_version 458131 (0.00089) [2022-07-09 22:59:58,106][25689] Fps is (10 sec: 5790.8, 60 sec: 5660.9, 300 sec: 5665.8). Total num frames: 469128192. Throughput: 0: 5096.1. Samples: 469125470. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 22:59:58,108][25689] Avg episode reward: [(0, '-47.232')] [2022-07-09 22:59:59,438][26022] Updated weights on worker 0-0, policy_version 458141 (0.00086) [2022-07-09 23:00:01,344][26022] Updated weights on worker 0-0, policy_version 458151 (0.00085) [2022-07-09 23:00:03,119][25689] Fps is (10 sec: 5384.4, 60 sec: 5646.5, 300 sec: 5662.5). Total num frames: 469153792. Throughput: 0: 5853.2. Samples: 469157846. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-09 23:00:03,121][25689] Avg episode reward: [(0, '-47.046')] [2022-07-09 23:00:03,616][26022] Updated weights on worker 0-0, policy_version 458161 (0.00082) [2022-07-09 23:00:05,245][26022] Updated weights on worker 0-0, policy_version 458171 (0.00084) [2022-07-09 23:00:07,063][26022] Updated weights on worker 0-0, policy_version 458181 (0.00088) [2022-07-09 23:00:08,137][25689] Fps is (10 sec: 5308.2, 60 sec: 5630.9, 300 sec: 5659.0). Total num frames: 469181440. Throughput: 0: 5837.6. Samples: 469191796. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:00:08,139][25689] Avg episode reward: [(0, '-46.992')] [2022-07-09 23:00:08,839][26022] Updated weights on worker 0-0, policy_version 458191 (0.00085) [2022-07-09 23:00:10,680][26022] Updated weights on worker 0-0, policy_version 458201 (0.00087) [2022-07-09 23:00:12,610][26022] Updated weights on worker 0-0, policy_version 458211 (0.00093) [2022-07-09 23:00:13,165][25689] Fps is (10 sec: 5809.8, 60 sec: 5662.4, 300 sec: 5666.2). Total num frames: 469212160. Throughput: 0: 4976.5. Samples: 469208646. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:00:13,167][25689] Avg episode reward: [(0, '-46.702')] [2022-07-09 23:00:14,307][26022] Updated weights on worker 0-0, policy_version 458221 (0.00089) [2022-07-09 23:00:16,121][26022] Updated weights on worker 0-0, policy_version 458231 (0.00092) [2022-07-09 23:00:17,944][26022] Updated weights on worker 0-0, policy_version 458241 (0.00089) [2022-07-09 23:00:18,257][25689] Fps is (10 sec: 5768.0, 60 sec: 5646.2, 300 sec: 5661.6). Total num frames: 469239808. Throughput: 0: 5828.4. Samples: 469242916. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:00:18,257][25689] Avg episode reward: [(0, '-47.031')] [2022-07-09 23:00:19,643][26022] Updated weights on worker 0-0, policy_version 458251 (0.00091) [2022-07-09 23:00:21,594][26022] Updated weights on worker 0-0, policy_version 458261 (0.00072) [2022-07-09 23:00:23,282][25689] Fps is (10 sec: 5465.7, 60 sec: 5645.7, 300 sec: 5664.7). Total num frames: 469267456. Throughput: 0: 5904.1. Samples: 469276892. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:00:23,283][25689] Avg episode reward: [(0, '-47.854')] [2022-07-09 23:00:23,434][26022] Updated weights on worker 0-0, policy_version 458271 (0.00085) [2022-07-09 23:00:25,003][26022] Updated weights on worker 0-0, policy_version 458281 (0.00086) [2022-07-09 23:00:26,979][26022] Updated weights on worker 0-0, policy_version 458291 (0.00088) [2022-07-09 23:00:28,299][25689] Fps is (10 sec: 5812.3, 60 sec: 5664.3, 300 sec: 5668.0). Total num frames: 469298176. Throughput: 0: 5060.2. Samples: 469293816. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:00:28,299][25689] Avg episode reward: [(0, '-47.516')] [2022-07-09 23:00:28,780][26022] Updated weights on worker 0-0, policy_version 458301 (0.00089) [2022-07-09 23:00:30,648][26022] Updated weights on worker 0-0, policy_version 458311 (0.00099) [2022-07-09 23:00:32,391][26022] Updated weights on worker 0-0, policy_version 458321 (0.00087) [2022-07-09 23:00:33,325][25689] Fps is (10 sec: 5812.2, 60 sec: 5663.4, 300 sec: 5668.5). Total num frames: 469325824. Throughput: 0: 5931.4. Samples: 469328218. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:00:33,325][25689] Avg episode reward: [(0, '-47.770')] [2022-07-09 23:00:34,103][26022] Updated weights on worker 0-0, policy_version 458331 (0.00081) [2022-07-09 23:00:35,923][26022] Updated weights on worker 0-0, policy_version 458341 (0.00049) [2022-07-09 23:00:37,900][26022] Updated weights on worker 0-0, policy_version 458351 (0.00097) [2022-07-09 23:00:38,454][25689] Fps is (10 sec: 5545.7, 60 sec: 5605.2, 300 sec: 5662.9). Total num frames: 469354496. Throughput: 0: 5921.0. Samples: 469362506. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:00:38,455][25689] Avg episode reward: [(0, '-47.931')] [2022-07-09 23:00:39,368][26022] Updated weights on worker 0-0, policy_version 458361 (0.00107) [2022-07-09 23:00:41,412][26022] Updated weights on worker 0-0, policy_version 458371 (0.00087) [2022-07-09 23:00:43,087][26022] Updated weights on worker 0-0, policy_version 458381 (0.00088) [2022-07-09 23:00:43,473][25689] Fps is (10 sec: 5751.7, 60 sec: 5671.8, 300 sec: 5669.7). Total num frames: 469384192. Throughput: 0: 5083.8. Samples: 469379536. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:00:43,473][25689] Avg episode reward: [(0, '-47.257')] [2022-07-09 23:00:44,905][26022] Updated weights on worker 0-0, policy_version 458391 (0.00089) [2022-07-09 23:00:46,827][26022] Updated weights on worker 0-0, policy_version 458401 (0.00088) [2022-07-09 23:00:48,511][25689] Fps is (10 sec: 5702.4, 60 sec: 5654.7, 300 sec: 5666.0). Total num frames: 469411840. Throughput: 0: 5931.7. Samples: 469413706. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:00:48,511][25689] Avg episode reward: [(0, '-47.428')] [2022-07-09 23:00:48,704][26022] Updated weights on worker 0-0, policy_version 458411 (0.00090) [2022-07-09 23:00:50,400][26022] Updated weights on worker 0-0, policy_version 458421 (0.00086) [2022-07-09 23:00:52,073][26022] Updated weights on worker 0-0, policy_version 458431 (0.00084) [2022-07-09 23:00:53,584][25689] Fps is (10 sec: 5570.2, 60 sec: 5632.1, 300 sec: 5663.1). Total num frames: 469440512. Throughput: 0: 5915.7. Samples: 469448064. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:00:53,584][25689] Avg episode reward: [(0, '-47.844')] [2022-07-09 23:00:54,026][26022] Updated weights on worker 0-0, policy_version 458441 (0.00094) [2022-07-09 23:00:55,736][26022] Updated weights on worker 0-0, policy_version 458451 (0.00093) [2022-07-09 23:00:57,690][26022] Updated weights on worker 0-0, policy_version 458461 (0.00094) [2022-07-09 23:00:58,641][25689] Fps is (10 sec: 5761.4, 60 sec: 5649.9, 300 sec: 5665.7). Total num frames: 469470208. Throughput: 0: 5075.1. Samples: 469464958. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:00:58,642][25689] Avg episode reward: [(0, '-46.607')] [2022-07-09 23:00:59,194][26022] Updated weights on worker 0-0, policy_version 458471 (0.00088) [2022-07-09 23:01:01,295][26022] Updated weights on worker 0-0, policy_version 458481 (0.00075) [2022-07-09 23:01:03,375][26022] Updated weights on worker 0-0, policy_version 458491 (0.00089) [2022-07-09 23:01:03,645][25689] Fps is (10 sec: 5597.6, 60 sec: 5667.6, 300 sec: 5666.3). Total num frames: 469496832. Throughput: 0: 5833.5. Samples: 469497210. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:01:03,646][25689] Avg episode reward: [(0, '-46.375')] [2022-07-09 23:01:04,988][26022] Updated weights on worker 0-0, policy_version 458501 (0.00088) [2022-07-09 23:01:06,795][26022] Updated weights on worker 0-0, policy_version 458511 (0.00092) [2022-07-09 23:01:08,571][26022] Updated weights on worker 0-0, policy_version 458521 (0.00086) [2022-07-09 23:01:08,657][25689] Fps is (10 sec: 5520.9, 60 sec: 5685.1, 300 sec: 5666.1). Total num frames: 469525504. Throughput: 0: 5865.7. Samples: 469531880. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:01:08,658][25689] Avg episode reward: [(0, '-46.415')] [2022-07-09 23:01:10,317][26022] Updated weights on worker 0-0, policy_version 458531 (0.00083) [2022-07-09 23:01:12,287][26022] Updated weights on worker 0-0, policy_version 458541 (0.00091) [2022-07-09 23:01:13,664][25689] Fps is (10 sec: 5825.8, 60 sec: 5670.2, 300 sec: 5670.4). Total num frames: 469555200. Throughput: 0: 5035.7. Samples: 469549182. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:01:13,666][25689] Avg episode reward: [(0, '-45.877')] [2022-07-09 23:01:13,755][26022] Updated weights on worker 0-0, policy_version 458551 (0.00081) [2022-07-09 23:01:15,795][26022] Updated weights on worker 0-0, policy_version 458561 (0.00088) [2022-07-09 23:01:17,510][26022] Updated weights on worker 0-0, policy_version 458571 (0.00096) [2022-07-09 23:01:18,798][25689] Fps is (10 sec: 5654.6, 60 sec: 5666.2, 300 sec: 5664.7). Total num frames: 469582848. Throughput: 0: 5913.2. Samples: 469584150. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:01:18,800][25689] Avg episode reward: [(0, '-45.381')] [2022-07-09 23:01:19,284][26022] Updated weights on worker 0-0, policy_version 458581 (0.00098) [2022-07-09 23:01:21,072][26022] Updated weights on worker 0-0, policy_version 458591 (0.00088) [2022-07-09 23:01:22,828][26022] Updated weights on worker 0-0, policy_version 458601 (0.00081) [2022-07-09 23:01:23,847][25689] Fps is (10 sec: 5631.3, 60 sec: 5697.8, 300 sec: 5667.3). Total num frames: 469612544. Throughput: 0: 6007.4. Samples: 469618572. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:01:23,847][25689] Avg episode reward: [(0, '-45.704')] [2022-07-09 23:01:24,724][26022] Updated weights on worker 0-0, policy_version 458611 (0.00101) [2022-07-09 23:01:26,443][26022] Updated weights on worker 0-0, policy_version 458621 (0.00094) [2022-07-09 23:01:28,251][26022] Updated weights on worker 0-0, policy_version 458631 (0.00082) [2022-07-09 23:01:28,890][25689] Fps is (10 sec: 5783.9, 60 sec: 5661.6, 300 sec: 5670.9). Total num frames: 469641216. Throughput: 0: 5113.3. Samples: 469635336. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:01:28,890][25689] Avg episode reward: [(0, '-45.499')] [2022-07-09 23:01:29,835][26022] Updated weights on worker 0-0, policy_version 458641 (0.00089) [2022-07-09 23:01:31,896][26022] Updated weights on worker 0-0, policy_version 458651 (0.00087) [2022-07-09 23:01:33,474][26022] Updated weights on worker 0-0, policy_version 458661 (0.00086) [2022-07-09 23:01:33,899][25689] Fps is (10 sec: 5704.8, 60 sec: 5680.0, 300 sec: 5669.7). Total num frames: 469669888. Throughput: 0: 5972.1. Samples: 469670028. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:01:33,899][25689] Avg episode reward: [(0, '-45.652')] [2022-07-09 23:01:35,446][26022] Updated weights on worker 0-0, policy_version 458671 (0.00086) [2022-07-09 23:01:37,194][26022] Updated weights on worker 0-0, policy_version 458681 (0.00086) [2022-07-09 23:01:38,921][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:01:38,934][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000458690_469698560.pth [2022-07-09 23:01:38,935][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000456695_467655680.pth [2022-07-09 23:01:39,036][25689] Fps is (10 sec: 5651.9, 60 sec: 5679.4, 300 sec: 5671.8). Total num frames: 469698560. Throughput: 0: 5940.9. Samples: 469704380. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:01:39,036][25689] Avg episode reward: [(0, '-46.274')] [2022-07-09 23:01:39,234][26022] Updated weights on worker 0-0, policy_version 458691 (0.00082) [2022-07-09 23:01:40,734][26022] Updated weights on worker 0-0, policy_version 458701 (0.00083) [2022-07-09 23:01:42,789][26022] Updated weights on worker 0-0, policy_version 458711 (0.00084) [2022-07-09 23:01:44,088][25689] Fps is (10 sec: 5829.0, 60 sec: 5693.1, 300 sec: 5672.1). Total num frames: 469729280. Throughput: 0: 5076.6. Samples: 469721328. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:01:44,089][25689] Avg episode reward: [(0, '-45.853')] [2022-07-09 23:01:44,183][26022] Updated weights on worker 0-0, policy_version 458721 (0.00091) [2022-07-09 23:01:46,342][26022] Updated weights on worker 0-0, policy_version 458731 (0.00093) [2022-07-09 23:01:47,962][26022] Updated weights on worker 0-0, policy_version 458741 (0.00087) [2022-07-09 23:01:49,102][25689] Fps is (10 sec: 5798.3, 60 sec: 5695.3, 300 sec: 5666.5). Total num frames: 469756928. Throughput: 0: 5965.6. Samples: 469755916. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:01:49,103][25689] Avg episode reward: [(0, '-45.652')] [2022-07-09 23:01:49,742][26022] Updated weights on worker 0-0, policy_version 458751 (0.00085) [2022-07-09 23:01:51,454][26022] Updated weights on worker 0-0, policy_version 458761 (0.00085) [2022-07-09 23:01:53,353][26022] Updated weights on worker 0-0, policy_version 458771 (0.00085) [2022-07-09 23:01:54,119][25689] Fps is (10 sec: 5716.9, 60 sec: 5717.5, 300 sec: 5674.6). Total num frames: 469786624. Throughput: 0: 5963.5. Samples: 469790608. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:01:54,119][25689] Avg episode reward: [(0, '-46.078')] [2022-07-09 23:01:55,007][26022] Updated weights on worker 0-0, policy_version 458781 (0.00090) [2022-07-09 23:01:56,981][26022] Updated weights on worker 0-0, policy_version 458791 (0.00091) [2022-07-09 23:01:58,554][26022] Updated weights on worker 0-0, policy_version 458801 (0.00079) [2022-07-09 23:01:59,194][25689] Fps is (10 sec: 5682.2, 60 sec: 5682.1, 300 sec: 5674.3). Total num frames: 469814272. Throughput: 0: 5111.5. Samples: 469807418. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:01:59,195][25689] Avg episode reward: [(0, '-45.963')] [2022-07-09 23:02:00,446][26022] Updated weights on worker 0-0, policy_version 458811 (0.00082) [2022-07-09 23:02:02,768][26022] Updated weights on worker 0-0, policy_version 458821 (0.00085) [2022-07-09 23:02:04,204][25689] Fps is (10 sec: 5482.7, 60 sec: 5698.4, 300 sec: 5671.7). Total num frames: 469841920. Throughput: 0: 5893.5. Samples: 469839882. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:02:04,205][25689] Avg episode reward: [(0, '-45.819')] [2022-07-09 23:02:04,415][26022] Updated weights on worker 0-0, policy_version 458831 (0.01101) [2022-07-09 23:02:06,277][26022] Updated weights on worker 0-0, policy_version 458841 (0.00085) [2022-07-09 23:02:07,919][26022] Updated weights on worker 0-0, policy_version 458851 (0.00057) [2022-07-09 23:02:09,235][25689] Fps is (10 sec: 5609.2, 60 sec: 5696.6, 300 sec: 5668.9). Total num frames: 469870592. Throughput: 0: 5876.1. Samples: 469874216. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:02:09,235][25689] Avg episode reward: [(0, '-46.336')] [2022-07-09 23:02:09,682][26022] Updated weights on worker 0-0, policy_version 458861 (0.00085) [2022-07-09 23:02:11,670][26022] Updated weights on worker 0-0, policy_version 458871 (0.00078) [2022-07-09 23:02:13,361][26022] Updated weights on worker 0-0, policy_version 458881 (0.00086) [2022-07-09 23:02:14,239][25689] Fps is (10 sec: 5714.7, 60 sec: 5680.0, 300 sec: 5670.5). Total num frames: 469899264. Throughput: 0: 5008.0. Samples: 469891368. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:02:14,239][25689] Avg episode reward: [(0, '-45.372')] [2022-07-09 23:02:15,268][26022] Updated weights on worker 0-0, policy_version 458891 (0.00089) [2022-07-09 23:02:17,017][26022] Updated weights on worker 0-0, policy_version 458901 (0.00096) [2022-07-09 23:02:18,719][26022] Updated weights on worker 0-0, policy_version 458911 (0.00089) [2022-07-09 23:02:19,333][25689] Fps is (10 sec: 5678.3, 60 sec: 5700.6, 300 sec: 5674.1). Total num frames: 469927936. Throughput: 0: 5887.5. Samples: 469925990. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:02:19,334][25689] Avg episode reward: [(0, '-44.788')] [2022-07-09 23:02:20,507][26022] Updated weights on worker 0-0, policy_version 458921 (0.00085) [2022-07-09 23:02:22,394][26022] Updated weights on worker 0-0, policy_version 458931 (0.00086) [2022-07-09 23:02:23,975][26022] Updated weights on worker 0-0, policy_version 458941 (0.00086) [2022-07-09 23:02:24,343][25689] Fps is (10 sec: 5776.4, 60 sec: 5704.3, 300 sec: 5677.6). Total num frames: 469957632. Throughput: 0: 5983.3. Samples: 469960380. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:02:24,344][25689] Avg episode reward: [(0, '-45.159')] [2022-07-09 23:02:26,056][26022] Updated weights on worker 0-0, policy_version 458951 (0.00089) [2022-07-09 23:02:27,491][26022] Updated weights on worker 0-0, policy_version 458961 (0.00089) [2022-07-09 23:02:29,349][25689] Fps is (10 sec: 5725.7, 60 sec: 5690.9, 300 sec: 5674.4). Total num frames: 469985280. Throughput: 0: 6000.7. Samples: 469994914. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:02:29,349][25689] Avg episode reward: [(0, '-44.681')] [2022-07-09 23:02:29,431][26022] Updated weights on worker 0-0, policy_version 458971 (0.00089) [2022-07-09 23:02:31,267][26022] Updated weights on worker 0-0, policy_version 458981 (0.00068) [2022-07-09 23:02:32,935][26022] Updated weights on worker 0-0, policy_version 458991 (0.00085) [2022-07-09 23:02:34,386][25689] Fps is (10 sec: 5709.9, 60 sec: 5705.1, 300 sec: 5675.0). Total num frames: 470014976. Throughput: 0: 5994.4. Samples: 470012140. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:02:34,387][25689] Avg episode reward: [(0, '-45.296')] [2022-07-09 23:02:34,791][26022] Updated weights on worker 0-0, policy_version 459001 (0.00086) [2022-07-09 23:02:36,383][26022] Updated weights on worker 0-0, policy_version 459011 (0.00089) [2022-07-09 23:02:38,582][26022] Updated weights on worker 0-0, policy_version 459021 (0.00093) [2022-07-09 23:02:39,499][25689] Fps is (10 sec: 5851.5, 60 sec: 5724.4, 300 sec: 5679.9). Total num frames: 470044672. Throughput: 0: 5993.4. Samples: 470046848. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:02:39,499][25689] Avg episode reward: [(0, '-46.302')] [2022-07-09 23:02:40,008][26022] Updated weights on worker 0-0, policy_version 459031 (0.00083) [2022-07-09 23:02:41,913][26022] Updated weights on worker 0-0, policy_version 459041 (0.00084) [2022-07-09 23:02:43,677][26022] Updated weights on worker 0-0, policy_version 459051 (0.00087) [2022-07-09 23:02:44,507][25689] Fps is (10 sec: 5666.1, 60 sec: 5677.7, 300 sec: 5676.4). Total num frames: 470072320. Throughput: 0: 5991.3. Samples: 470081186. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:02:44,507][25689] Avg episode reward: [(0, '-47.165')] [2022-07-09 23:02:45,397][26022] Updated weights on worker 0-0, policy_version 459061 (0.00088) [2022-07-09 23:02:47,258][26022] Updated weights on worker 0-0, policy_version 459071 (0.00088) [2022-07-09 23:02:48,942][26022] Updated weights on worker 0-0, policy_version 459081 (0.00095) [2022-07-09 23:02:49,524][25689] Fps is (10 sec: 5617.8, 60 sec: 5694.4, 300 sec: 5676.9). Total num frames: 470100992. Throughput: 0: 5118.4. Samples: 470098180. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:02:49,525][25689] Avg episode reward: [(0, '-47.433')] [2022-07-09 23:02:50,756][26022] Updated weights on worker 0-0, policy_version 459091 (0.00093) [2022-07-09 23:02:52,764][26022] Updated weights on worker 0-0, policy_version 459101 (0.00088) [2022-07-09 23:02:54,171][26022] Updated weights on worker 0-0, policy_version 459111 (0.00091) [2022-07-09 23:02:54,543][25689] Fps is (10 sec: 5713.8, 60 sec: 5677.2, 300 sec: 5674.6). Total num frames: 470129664. Throughput: 0: 5975.6. Samples: 470132588. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-09 23:02:54,543][25689] Avg episode reward: [(0, '-47.500')] [2022-07-09 23:02:56,182][26022] Updated weights on worker 0-0, policy_version 459121 (0.00095) [2022-07-09 23:02:57,869][26022] Updated weights on worker 0-0, policy_version 459131 (0.00092) [2022-07-09 23:02:59,635][25689] Fps is (10 sec: 5671.7, 60 sec: 5692.6, 300 sec: 5678.3). Total num frames: 470158336. Throughput: 0: 5946.6. Samples: 470166588. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:02:59,635][25689] Avg episode reward: [(0, '-47.453')] [2022-07-09 23:02:59,962][26022] Updated weights on worker 0-0, policy_version 459141 (0.00097) [2022-07-09 23:03:01,603][26022] Updated weights on worker 0-0, policy_version 459151 (0.00388) [2022-07-09 23:03:03,892][26022] Updated weights on worker 0-0, policy_version 459161 (0.00082) [2022-07-09 23:03:04,686][25689] Fps is (10 sec: 5451.4, 60 sec: 5671.8, 300 sec: 5674.2). Total num frames: 470184960. Throughput: 0: 5009.3. Samples: 470182270. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:03:04,687][25689] Avg episode reward: [(0, '-47.071')] [2022-07-09 23:03:05,657][26022] Updated weights on worker 0-0, policy_version 459171 (0.00083) [2022-07-09 23:03:07,475][26022] Updated weights on worker 0-0, policy_version 459181 (0.00086) [2022-07-09 23:03:09,287][26022] Updated weights on worker 0-0, policy_version 459191 (0.00087) [2022-07-09 23:03:09,721][25689] Fps is (10 sec: 5482.1, 60 sec: 5671.4, 300 sec: 5674.2). Total num frames: 470213632. Throughput: 0: 5813.8. Samples: 470215602. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:03:09,722][25689] Avg episode reward: [(0, '-47.018')] [2022-07-09 23:03:11,034][26022] Updated weights on worker 0-0, policy_version 459201 (0.00087) [2022-07-09 23:03:12,791][26022] Updated weights on worker 0-0, policy_version 459211 (0.00082) [2022-07-09 23:03:14,699][26022] Updated weights on worker 0-0, policy_version 459221 (0.00087) [2022-07-09 23:03:14,805][25689] Fps is (10 sec: 5667.2, 60 sec: 5663.9, 300 sec: 5670.0). Total num frames: 470242304. Throughput: 0: 5788.2. Samples: 470249868. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:03:14,805][25689] Avg episode reward: [(0, '-47.174')] [2022-07-09 23:03:16,298][26022] Updated weights on worker 0-0, policy_version 459231 (0.00083) [2022-07-09 23:03:18,354][26022] Updated weights on worker 0-0, policy_version 459241 (0.00086) [2022-07-09 23:03:19,859][25689] Fps is (10 sec: 5858.2, 60 sec: 5701.5, 300 sec: 5679.6). Total num frames: 470273024. Throughput: 0: 4966.6. Samples: 470267036. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:03:19,860][25689] Avg episode reward: [(0, '-47.616')] [2022-07-09 23:03:19,862][26022] Updated weights on worker 0-0, policy_version 459251 (0.00096) [2022-07-09 23:03:21,799][26022] Updated weights on worker 0-0, policy_version 459261 (0.00100) [2022-07-09 23:03:23,587][26022] Updated weights on worker 0-0, policy_version 459271 (0.00084) [2022-07-09 23:03:24,956][25689] Fps is (10 sec: 5749.5, 60 sec: 5659.5, 300 sec: 5675.1). Total num frames: 470300672. Throughput: 0: 5881.6. Samples: 470301492. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:03:24,957][25689] Avg episode reward: [(0, '-47.754')] [2022-07-09 23:03:25,239][26022] Updated weights on worker 0-0, policy_version 459281 (0.00088) [2022-07-09 23:03:27,326][26022] Updated weights on worker 0-0, policy_version 459291 (0.00092) [2022-07-09 23:03:28,893][26022] Updated weights on worker 0-0, policy_version 459301 (0.00082) [2022-07-09 23:03:30,053][25689] Fps is (10 sec: 5424.8, 60 sec: 5651.0, 300 sec: 5667.1). Total num frames: 470328320. Throughput: 0: 5898.3. Samples: 470335524. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:03:30,053][25689] Avg episode reward: [(0, '-47.671')] [2022-07-09 23:03:30,818][26022] Updated weights on worker 0-0, policy_version 459311 (0.00094) [2022-07-09 23:03:32,422][26022] Updated weights on worker 0-0, policy_version 459321 (0.00085) [2022-07-09 23:03:34,316][26022] Updated weights on worker 0-0, policy_version 459331 (0.00091) [2022-07-09 23:03:35,072][25689] Fps is (10 sec: 5769.9, 60 sec: 5669.6, 300 sec: 5679.2). Total num frames: 470359040. Throughput: 0: 5071.4. Samples: 470352654. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:03:35,073][25689] Avg episode reward: [(0, '-46.732')] [2022-07-09 23:03:36,231][26022] Updated weights on worker 0-0, policy_version 459341 (0.00110) [2022-07-09 23:03:37,923][26022] Updated weights on worker 0-0, policy_version 459351 (0.00088) [2022-07-09 23:03:38,966][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:03:38,980][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000459356_470380544.pth [2022-07-09 23:03:38,980][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000457359_468335616.pth [2022-07-09 23:03:39,597][26022] Updated weights on worker 0-0, policy_version 459361 (0.00090) [2022-07-09 23:03:40,207][25689] Fps is (10 sec: 5949.5, 60 sec: 5667.5, 300 sec: 5677.0). Total num frames: 470388736. Throughput: 0: 5908.3. Samples: 470387258. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:03:40,208][25689] Avg episode reward: [(0, '-46.517')] [2022-07-09 23:03:41,327][26022] Updated weights on worker 0-0, policy_version 459371 (0.00086) [2022-07-09 23:03:43,310][26022] Updated weights on worker 0-0, policy_version 459381 (0.00086) [2022-07-09 23:03:45,052][26022] Updated weights on worker 0-0, policy_version 459391 (0.00090) [2022-07-09 23:03:45,209][25689] Fps is (10 sec: 5758.1, 60 sec: 5684.9, 300 sec: 5677.1). Total num frames: 470417408. Throughput: 0: 5941.9. Samples: 470421830. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:03:45,210][25689] Avg episode reward: [(0, '-46.081')] [2022-07-09 23:03:46,922][26022] Updated weights on worker 0-0, policy_version 459401 (0.00082) [2022-07-09 23:03:48,603][26022] Updated weights on worker 0-0, policy_version 459411 (0.00092) [2022-07-09 23:03:50,220][25689] Fps is (10 sec: 5625.1, 60 sec: 5668.7, 300 sec: 5670.1). Total num frames: 470445056. Throughput: 0: 5124.7. Samples: 470438874. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:03:50,220][25689] Avg episode reward: [(0, '-46.527')] [2022-07-09 23:03:50,520][26022] Updated weights on worker 0-0, policy_version 459421 (0.00085) [2022-07-09 23:03:52,209][26022] Updated weights on worker 0-0, policy_version 459431 (0.00094) [2022-07-09 23:03:54,104][26022] Updated weights on worker 0-0, policy_version 459441 (0.00090) [2022-07-09 23:03:55,245][25689] Fps is (10 sec: 5509.8, 60 sec: 5651.2, 300 sec: 5667.9). Total num frames: 470472704. Throughput: 0: 5962.6. Samples: 470472936. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:03:55,245][25689] Avg episode reward: [(0, '-46.787')] [2022-07-09 23:03:55,963][26022] Updated weights on worker 0-0, policy_version 459451 (0.00091) [2022-07-09 23:03:57,559][26022] Updated weights on worker 0-0, policy_version 459461 (0.00087) [2022-07-09 23:03:59,487][26022] Updated weights on worker 0-0, policy_version 459471 (0.00087) [2022-07-09 23:04:00,343][25689] Fps is (10 sec: 5664.7, 60 sec: 5667.5, 300 sec: 5677.2). Total num frames: 470502400. Throughput: 0: 5943.4. Samples: 470506930. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:04:00,343][25689] Avg episode reward: [(0, '-47.133')] [2022-07-09 23:04:01,314][26022] Updated weights on worker 0-0, policy_version 459481 (0.00086) [2022-07-09 23:04:03,491][26022] Updated weights on worker 0-0, policy_version 459491 (0.00087) [2022-07-09 23:04:05,223][26022] Updated weights on worker 0-0, policy_version 459501 (0.00083) [2022-07-09 23:04:05,349][25689] Fps is (10 sec: 5574.1, 60 sec: 5671.8, 300 sec: 5670.8). Total num frames: 470529024. Throughput: 0: 4973.3. Samples: 470521992. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:04:05,349][25689] Avg episode reward: [(0, '-48.405')] [2022-07-09 23:04:06,916][26022] Updated weights on worker 0-0, policy_version 459511 (0.00088) [2022-07-09 23:04:08,754][26022] Updated weights on worker 0-0, policy_version 459521 (0.00099) [2022-07-09 23:04:10,359][25689] Fps is (10 sec: 5418.2, 60 sec: 5657.2, 300 sec: 5667.2). Total num frames: 470556672. Throughput: 0: 5839.5. Samples: 470556480. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:04:10,360][25689] Avg episode reward: [(0, '-48.900')] [2022-07-09 23:04:10,779][26022] Updated weights on worker 0-0, policy_version 459531 (0.00349) [2022-07-09 23:04:12,169][26022] Updated weights on worker 0-0, policy_version 459541 (0.00086) [2022-07-09 23:04:14,439][26022] Updated weights on worker 0-0, policy_version 459551 (0.00093) [2022-07-09 23:04:15,371][25689] Fps is (10 sec: 5925.9, 60 sec: 5714.6, 300 sec: 5679.2). Total num frames: 470588416. Throughput: 0: 5870.3. Samples: 470591084. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:04:15,372][25689] Avg episode reward: [(0, '-48.816')] [2022-07-09 23:04:15,889][26022] Updated weights on worker 0-0, policy_version 459561 (0.00092) [2022-07-09 23:04:17,752][26022] Updated weights on worker 0-0, policy_version 459571 (0.00093) [2022-07-09 23:04:19,687][26022] Updated weights on worker 0-0, policy_version 459581 (0.00088) [2022-07-09 23:04:20,410][25689] Fps is (10 sec: 5807.2, 60 sec: 5648.4, 300 sec: 5675.4). Total num frames: 470615040. Throughput: 0: 5050.7. Samples: 470608284. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:04:20,411][25689] Avg episode reward: [(0, '-48.067')] [2022-07-09 23:04:21,397][26022] Updated weights on worker 0-0, policy_version 459591 (0.00091) [2022-07-09 23:04:23,094][26022] Updated weights on worker 0-0, policy_version 459601 (0.00083) [2022-07-09 23:04:25,118][26022] Updated weights on worker 0-0, policy_version 459611 (0.00087) [2022-07-09 23:04:25,437][25689] Fps is (10 sec: 5493.3, 60 sec: 5671.9, 300 sec: 5672.1). Total num frames: 470643712. Throughput: 0: 6011.2. Samples: 470642748. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:04:25,438][25689] Avg episode reward: [(0, '-47.246')] [2022-07-09 23:04:26,620][26022] Updated weights on worker 0-0, policy_version 459621 (0.00093) [2022-07-09 23:04:28,685][26022] Updated weights on worker 0-0, policy_version 459631 (0.00088) [2022-07-09 23:04:30,156][26022] Updated weights on worker 0-0, policy_version 459641 (0.00088) [2022-07-09 23:04:30,439][25689] Fps is (10 sec: 5718.1, 60 sec: 5697.7, 300 sec: 5675.8). Total num frames: 470672384. Throughput: 0: 5987.6. Samples: 470676708. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:04:30,439][25689] Avg episode reward: [(0, '-47.132')] [2022-07-09 23:04:32,210][26022] Updated weights on worker 0-0, policy_version 459651 (0.00094) [2022-07-09 23:04:34,126][26022] Updated weights on worker 0-0, policy_version 459661 (0.00086) [2022-07-09 23:04:35,450][25689] Fps is (10 sec: 5726.9, 60 sec: 5664.6, 300 sec: 5666.2). Total num frames: 470701056. Throughput: 0: 5121.1. Samples: 470693912. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:04:35,451][25689] Avg episode reward: [(0, '-46.210')] [2022-07-09 23:04:35,629][26022] Updated weights on worker 0-0, policy_version 459671 (0.00091) [2022-07-09 23:04:37,622][26022] Updated weights on worker 0-0, policy_version 459681 (0.00087) [2022-07-09 23:04:39,235][26022] Updated weights on worker 0-0, policy_version 459691 (0.00083) [2022-07-09 23:04:40,492][25689] Fps is (10 sec: 5602.2, 60 sec: 5639.4, 300 sec: 5672.4). Total num frames: 470728704. Throughput: 0: 5963.6. Samples: 470728044. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:04:40,492][25689] Avg episode reward: [(0, '-45.555')] [2022-07-09 23:04:41,043][26022] Updated weights on worker 0-0, policy_version 459701 (0.00091) [2022-07-09 23:04:43,006][26022] Updated weights on worker 0-0, policy_version 459711 (0.00086) [2022-07-09 23:04:44,548][26022] Updated weights on worker 0-0, policy_version 459721 (0.00084) [2022-07-09 23:04:45,498][25689] Fps is (10 sec: 5605.3, 60 sec: 5639.0, 300 sec: 5673.0). Total num frames: 470757376. Throughput: 0: 5965.4. Samples: 470762418. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:04:45,500][25689] Avg episode reward: [(0, '-46.120')] [2022-07-09 23:04:46,563][26022] Updated weights on worker 0-0, policy_version 459731 (0.00087) [2022-07-09 23:04:48,234][26022] Updated weights on worker 0-0, policy_version 459741 (0.00085) [2022-07-09 23:04:49,899][26022] Updated weights on worker 0-0, policy_version 459751 (0.00079) [2022-07-09 23:04:50,515][25689] Fps is (10 sec: 5823.6, 60 sec: 5672.4, 300 sec: 5672.9). Total num frames: 470787072. Throughput: 0: 5119.0. Samples: 470779476. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:04:50,516][25689] Avg episode reward: [(0, '-46.167')] [2022-07-09 23:04:51,802][26022] Updated weights on worker 0-0, policy_version 459761 (0.00090) [2022-07-09 23:04:53,634][26022] Updated weights on worker 0-0, policy_version 459771 (0.00084) [2022-07-09 23:04:55,307][26022] Updated weights on worker 0-0, policy_version 459781 (0.00094) [2022-07-09 23:04:55,526][25689] Fps is (10 sec: 5922.7, 60 sec: 5707.7, 300 sec: 5677.4). Total num frames: 470816768. Throughput: 0: 5990.5. Samples: 470814174. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:04:55,528][25689] Avg episode reward: [(0, '-46.706')] [2022-07-09 23:04:57,420][26022] Updated weights on worker 0-0, policy_version 459791 (0.00090) [2022-07-09 23:04:58,974][26022] Updated weights on worker 0-0, policy_version 459801 (0.00091) [2022-07-09 23:05:00,596][25689] Fps is (10 sec: 5586.3, 60 sec: 5659.3, 300 sec: 5679.8). Total num frames: 470843392. Throughput: 0: 5974.2. Samples: 470848152. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:05:00,597][25689] Avg episode reward: [(0, '-46.480')] [2022-07-09 23:05:00,951][26022] Updated weights on worker 0-0, policy_version 459811 (0.00085) [2022-07-09 23:05:02,980][26022] Updated weights on worker 0-0, policy_version 459821 (0.00090) [2022-07-09 23:05:04,804][26022] Updated weights on worker 0-0, policy_version 459831 (0.00090) [2022-07-09 23:05:05,677][25689] Fps is (10 sec: 5447.5, 60 sec: 5686.3, 300 sec: 5682.0). Total num frames: 470872064. Throughput: 0: 5847.9. Samples: 470880420. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:05:05,678][25689] Avg episode reward: [(0, '-47.119')] [2022-07-09 23:05:06,621][26022] Updated weights on worker 0-0, policy_version 459841 (0.00088) [2022-07-09 23:05:08,399][26022] Updated weights on worker 0-0, policy_version 459851 (0.00089) [2022-07-09 23:05:10,227][26022] Updated weights on worker 0-0, policy_version 459861 (0.00086) [2022-07-09 23:05:10,694][25689] Fps is (10 sec: 5577.4, 60 sec: 5685.6, 300 sec: 5671.9). Total num frames: 470899712. Throughput: 0: 5861.0. Samples: 470897748. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:05:10,695][25689] Avg episode reward: [(0, '-47.670')] [2022-07-09 23:05:11,928][26022] Updated weights on worker 0-0, policy_version 459871 (0.00082) [2022-07-09 23:05:13,571][26022] Updated weights on worker 0-0, policy_version 459881 (0.00083) [2022-07-09 23:05:15,688][26022] Updated weights on worker 0-0, policy_version 459891 (0.00089) [2022-07-09 23:05:15,781][25689] Fps is (10 sec: 5573.6, 60 sec: 5627.6, 300 sec: 5675.4). Total num frames: 470928384. Throughput: 0: 5835.4. Samples: 470932374. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:05:15,782][25689] Avg episode reward: [(0, '-47.858')] [2022-07-09 23:05:17,247][26022] Updated weights on worker 0-0, policy_version 459901 (0.00085) [2022-07-09 23:05:19,209][26022] Updated weights on worker 0-0, policy_version 459911 (0.00085) [2022-07-09 23:05:20,781][26022] Updated weights on worker 0-0, policy_version 459921 (0.00091) [2022-07-09 23:05:20,838][25689] Fps is (10 sec: 5855.0, 60 sec: 5693.8, 300 sec: 5685.1). Total num frames: 470959104. Throughput: 0: 5861.2. Samples: 470966794. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:05:20,839][25689] Avg episode reward: [(0, '-47.485')] [2022-07-09 23:05:22,634][26022] Updated weights on worker 0-0, policy_version 459931 (0.00090) [2022-07-09 23:05:24,395][26022] Updated weights on worker 0-0, policy_version 459941 (0.00084) [2022-07-09 23:05:25,841][25689] Fps is (10 sec: 5802.5, 60 sec: 5679.2, 300 sec: 5675.1). Total num frames: 470986752. Throughput: 0: 5133.1. Samples: 470983924. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:05:25,841][25689] Avg episode reward: [(0, '-47.461')] [2022-07-09 23:05:26,162][26022] Updated weights on worker 0-0, policy_version 459951 (0.00093) [2022-07-09 23:05:28,067][26022] Updated weights on worker 0-0, policy_version 459961 (0.00082) [2022-07-09 23:05:29,774][26022] Updated weights on worker 0-0, policy_version 459971 (0.00086) [2022-07-09 23:05:30,845][25689] Fps is (10 sec: 5525.7, 60 sec: 5661.9, 300 sec: 5675.5). Total num frames: 471014400. Throughput: 0: 5972.9. Samples: 471018108. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:05:30,846][25689] Avg episode reward: [(0, '-47.227')] [2022-07-09 23:05:31,529][26022] Updated weights on worker 0-0, policy_version 459981 (0.00091) [2022-07-09 23:05:33,499][26022] Updated weights on worker 0-0, policy_version 459991 (0.00092) [2022-07-09 23:05:35,158][26022] Updated weights on worker 0-0, policy_version 460001 (0.00084) [2022-07-09 23:05:35,875][25689] Fps is (10 sec: 5612.8, 60 sec: 5660.2, 300 sec: 5677.4). Total num frames: 471043072. Throughput: 0: 5994.5. Samples: 471052822. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:05:35,875][25689] Avg episode reward: [(0, '-47.882')] [2022-07-09 23:05:37,016][26022] Updated weights on worker 0-0, policy_version 460011 (0.00090) [2022-07-09 23:05:38,739][26022] Updated weights on worker 0-0, policy_version 460021 (0.00093) [2022-07-09 23:05:39,221][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:05:39,233][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000460022_471062528.pth [2022-07-09 23:05:39,234][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000458026_469018624.pth [2022-07-09 23:05:40,536][26022] Updated weights on worker 0-0, policy_version 460031 (0.00090) [2022-07-09 23:05:40,931][25689] Fps is (10 sec: 5787.0, 60 sec: 5692.7, 300 sec: 5676.7). Total num frames: 471072768. Throughput: 0: 5131.5. Samples: 471069902. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:05:40,932][25689] Avg episode reward: [(0, '-47.382')] [2022-07-09 23:05:42,584][26022] Updated weights on worker 0-0, policy_version 460041 (0.00085) [2022-07-09 23:05:44,319][26022] Updated weights on worker 0-0, policy_version 460051 (0.00093) [2022-07-09 23:05:45,942][25689] Fps is (10 sec: 5797.7, 60 sec: 5692.3, 300 sec: 5680.6). Total num frames: 471101440. Throughput: 0: 5965.4. Samples: 471103838. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:05:45,943][25689] Avg episode reward: [(0, '-47.151')] [2022-07-09 23:05:46,042][26022] Updated weights on worker 0-0, policy_version 460061 (0.00989) [2022-07-09 23:05:47,872][26022] Updated weights on worker 0-0, policy_version 460071 (0.00090) [2022-07-09 23:05:49,399][26022] Updated weights on worker 0-0, policy_version 460081 (0.00084) [2022-07-09 23:05:50,953][25689] Fps is (10 sec: 5619.6, 60 sec: 5658.9, 300 sec: 5678.4). Total num frames: 471129088. Throughput: 0: 5977.8. Samples: 471138310. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:05:50,954][25689] Avg episode reward: [(0, '-47.817')] [2022-07-09 23:05:51,603][26022] Updated weights on worker 0-0, policy_version 460091 (0.00079) [2022-07-09 23:05:53,089][26022] Updated weights on worker 0-0, policy_version 460101 (0.00097) [2022-07-09 23:05:54,982][26022] Updated weights on worker 0-0, policy_version 460111 (0.00091) [2022-07-09 23:05:55,962][25689] Fps is (10 sec: 5723.3, 60 sec: 5659.2, 300 sec: 5679.3). Total num frames: 471158784. Throughput: 0: 5110.0. Samples: 471155466. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:05:55,962][25689] Avg episode reward: [(0, '-48.926')] [2022-07-09 23:05:56,783][26022] Updated weights on worker 0-0, policy_version 460121 (0.00083) [2022-07-09 23:05:58,636][26022] Updated weights on worker 0-0, policy_version 460131 (0.00090) [2022-07-09 23:06:00,313][26022] Updated weights on worker 0-0, policy_version 460141 (0.00080) [2022-07-09 23:06:01,010][25689] Fps is (10 sec: 5905.7, 60 sec: 5712.1, 300 sec: 5688.8). Total num frames: 471188480. Throughput: 0: 5955.8. Samples: 471189488. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:06:01,011][25689] Avg episode reward: [(0, '-48.528')] [2022-07-09 23:06:02,611][26022] Updated weights on worker 0-0, policy_version 460151 (0.00095) [2022-07-09 23:06:04,109][26022] Updated weights on worker 0-0, policy_version 460161 (0.00090) [2022-07-09 23:06:06,015][25689] Fps is (10 sec: 5398.6, 60 sec: 5651.4, 300 sec: 5675.2). Total num frames: 471213056. Throughput: 0: 5861.7. Samples: 471221496. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:06:06,015][25689] Avg episode reward: [(0, '-46.738')] [2022-07-09 23:06:06,451][26022] Updated weights on worker 0-0, policy_version 460171 (0.00095) [2022-07-09 23:06:07,752][26022] Updated weights on worker 0-0, policy_version 460181 (0.00090) [2022-07-09 23:06:09,976][26022] Updated weights on worker 0-0, policy_version 460191 (0.00104) [2022-07-09 23:06:11,082][25689] Fps is (10 sec: 5388.6, 60 sec: 5680.6, 300 sec: 5674.0). Total num frames: 471242752. Throughput: 0: 4973.0. Samples: 471238408. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:06:11,083][25689] Avg episode reward: [(0, '-46.752')] [2022-07-09 23:06:11,698][26022] Updated weights on worker 0-0, policy_version 460201 (0.00089) [2022-07-09 23:06:13,419][26022] Updated weights on worker 0-0, policy_version 460211 (0.00102) [2022-07-09 23:06:15,120][26022] Updated weights on worker 0-0, policy_version 460221 (0.00092) [2022-07-09 23:06:16,095][25689] Fps is (10 sec: 5892.0, 60 sec: 5704.6, 300 sec: 5683.2). Total num frames: 471272448. Throughput: 0: 5836.6. Samples: 471272976. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:06:16,100][25689] Avg episode reward: [(0, '-46.540')] [2022-07-09 23:06:17,136][26022] Updated weights on worker 0-0, policy_version 460231 (0.00091) [2022-07-09 23:06:18,567][26022] Updated weights on worker 0-0, policy_version 460241 (0.00087) [2022-07-09 23:06:20,719][26022] Updated weights on worker 0-0, policy_version 460251 (0.00086) [2022-07-09 23:06:21,165][25689] Fps is (10 sec: 5788.5, 60 sec: 5669.4, 300 sec: 5679.4). Total num frames: 471301120. Throughput: 0: 5853.8. Samples: 471307472. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:06:21,167][25689] Avg episode reward: [(0, '-46.111')] [2022-07-09 23:06:22,146][26022] Updated weights on worker 0-0, policy_version 460261 (0.00616) [2022-07-09 23:06:24,135][26022] Updated weights on worker 0-0, policy_version 460271 (0.00090) [2022-07-09 23:06:25,784][26022] Updated weights on worker 0-0, policy_version 460281 (0.00095) [2022-07-09 23:06:26,212][25689] Fps is (10 sec: 5566.8, 60 sec: 5665.2, 300 sec: 5675.8). Total num frames: 471328768. Throughput: 0: 5103.8. Samples: 471324578. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:06:26,212][25689] Avg episode reward: [(0, '-45.293')] [2022-07-09 23:06:27,652][26022] Updated weights on worker 0-0, policy_version 460291 (0.00092) [2022-07-09 23:06:29,622][26022] Updated weights on worker 0-0, policy_version 460301 (0.00093) [2022-07-09 23:06:31,267][25689] Fps is (10 sec: 5575.6, 60 sec: 5677.5, 300 sec: 5675.0). Total num frames: 471357440. Throughput: 0: 5961.5. Samples: 471358738. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:06:31,268][25689] Avg episode reward: [(0, '-45.650')] [2022-07-09 23:06:31,483][26022] Updated weights on worker 0-0, policy_version 460311 (0.00085) [2022-07-09 23:06:33,169][26022] Updated weights on worker 0-0, policy_version 460321 (0.00085) [2022-07-09 23:06:35,054][26022] Updated weights on worker 0-0, policy_version 460331 (0.00092) [2022-07-09 23:06:36,302][25689] Fps is (10 sec: 5886.1, 60 sec: 5710.8, 300 sec: 5683.8). Total num frames: 471388160. Throughput: 0: 5935.0. Samples: 471392906. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:06:36,304][25689] Avg episode reward: [(0, '-46.425')] [2022-07-09 23:06:36,444][26022] Updated weights on worker 0-0, policy_version 460341 (0.00092) [2022-07-09 23:06:38,699][26022] Updated weights on worker 0-0, policy_version 460351 (0.00083) [2022-07-09 23:06:40,268][26022] Updated weights on worker 0-0, policy_version 460361 (0.00086) [2022-07-09 23:06:41,394][25689] Fps is (10 sec: 5662.1, 60 sec: 5656.6, 300 sec: 5669.3). Total num frames: 471414784. Throughput: 0: 5062.0. Samples: 471409870. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:06:41,395][25689] Avg episode reward: [(0, '-47.229')] [2022-07-09 23:06:42,056][26022] Updated weights on worker 0-0, policy_version 460371 (0.00088) [2022-07-09 23:06:43,929][26022] Updated weights on worker 0-0, policy_version 460381 (0.00093) [2022-07-09 23:06:45,450][26022] Updated weights on worker 0-0, policy_version 460391 (0.00086) [2022-07-09 23:06:46,486][25689] Fps is (10 sec: 5429.9, 60 sec: 5649.1, 300 sec: 5671.2). Total num frames: 471443456. Throughput: 0: 5909.5. Samples: 471444386. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:06:46,486][25689] Avg episode reward: [(0, '-46.061')] [2022-07-09 23:06:47,507][26022] Updated weights on worker 0-0, policy_version 460401 (0.00086) [2022-07-09 23:06:49,266][26022] Updated weights on worker 0-0, policy_version 460411 (0.00087) [2022-07-09 23:06:51,033][26022] Updated weights on worker 0-0, policy_version 460421 (0.00089) [2022-07-09 23:06:51,498][25689] Fps is (10 sec: 5878.0, 60 sec: 5699.7, 300 sec: 5674.7). Total num frames: 471474176. Throughput: 0: 5917.7. Samples: 471478464. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:06:51,499][25689] Avg episode reward: [(0, '-47.042')] [2022-07-09 23:06:53,088][26022] Updated weights on worker 0-0, policy_version 460431 (0.00094) [2022-07-09 23:06:54,639][26022] Updated weights on worker 0-0, policy_version 460441 (0.00085) [2022-07-09 23:06:56,501][26022] Updated weights on worker 0-0, policy_version 460451 (0.00086) [2022-07-09 23:06:56,533][25689] Fps is (10 sec: 5809.0, 60 sec: 5663.4, 300 sec: 5675.5). Total num frames: 471501824. Throughput: 0: 5078.6. Samples: 471495652. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:06:56,534][25689] Avg episode reward: [(0, '-46.896')] [2022-07-09 23:06:58,166][26022] Updated weights on worker 0-0, policy_version 460461 (0.00092) [2022-07-09 23:07:00,127][26022] Updated weights on worker 0-0, policy_version 460471 (0.00093) [2022-07-09 23:07:01,664][25689] Fps is (10 sec: 5540.2, 60 sec: 5638.8, 300 sec: 5676.6). Total num frames: 471530496. Throughput: 0: 5923.6. Samples: 471529942. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:07:01,665][25689] Avg episode reward: [(0, '-46.954')] [2022-07-09 23:07:01,881][26022] Updated weights on worker 0-0, policy_version 460481 (0.00093) [2022-07-09 23:07:04,046][26022] Updated weights on worker 0-0, policy_version 460491 (0.00091) [2022-07-09 23:07:05,712][26022] Updated weights on worker 0-0, policy_version 460501 (0.00093) [2022-07-09 23:07:06,700][25689] Fps is (10 sec: 5439.1, 60 sec: 5669.7, 300 sec: 5669.7). Total num frames: 471557120. Throughput: 0: 5822.8. Samples: 471562090. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:07:06,700][25689] Avg episode reward: [(0, '-46.640')] [2022-07-09 23:07:07,602][26022] Updated weights on worker 0-0, policy_version 460511 (0.00084) [2022-07-09 23:07:09,359][26022] Updated weights on worker 0-0, policy_version 460521 (0.00086) [2022-07-09 23:07:11,029][26022] Updated weights on worker 0-0, policy_version 460531 (0.00093) [2022-07-09 23:07:11,713][25689] Fps is (10 sec: 5503.0, 60 sec: 5657.9, 300 sec: 5669.5). Total num frames: 471585792. Throughput: 0: 4977.8. Samples: 471579086. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:07:11,713][25689] Avg episode reward: [(0, '-46.768')] [2022-07-09 23:07:13,006][26022] Updated weights on worker 0-0, policy_version 460541 (0.00090) [2022-07-09 23:07:14,697][26022] Updated weights on worker 0-0, policy_version 460551 (0.00079) [2022-07-09 23:07:16,455][26022] Updated weights on worker 0-0, policy_version 460561 (0.00114) [2022-07-09 23:07:16,737][25689] Fps is (10 sec: 5713.3, 60 sec: 5639.9, 300 sec: 5670.8). Total num frames: 471614464. Throughput: 0: 5842.6. Samples: 471613692. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:07:16,737][25689] Avg episode reward: [(0, '-46.824')] [2022-07-09 23:07:18,333][26022] Updated weights on worker 0-0, policy_version 460571 (0.00092) [2022-07-09 23:07:20,056][26022] Updated weights on worker 0-0, policy_version 460581 (0.00089) [2022-07-09 23:07:21,787][25689] Fps is (10 sec: 5692.2, 60 sec: 5641.8, 300 sec: 5666.6). Total num frames: 471643136. Throughput: 0: 5866.8. Samples: 471647996. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:07:21,787][25689] Avg episode reward: [(0, '-46.461')] [2022-07-09 23:07:22,009][26022] Updated weights on worker 0-0, policy_version 460591 (0.00085) [2022-07-09 23:07:23,740][26022] Updated weights on worker 0-0, policy_version 460601 (0.00084) [2022-07-09 23:07:25,520][26022] Updated weights on worker 0-0, policy_version 460611 (0.00084) [2022-07-09 23:07:26,806][25689] Fps is (10 sec: 5796.5, 60 sec: 5678.1, 300 sec: 5673.3). Total num frames: 471672832. Throughput: 0: 5127.0. Samples: 471665176. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:07:26,807][25689] Avg episode reward: [(0, '-46.714')] [2022-07-09 23:07:27,219][26022] Updated weights on worker 0-0, policy_version 460621 (0.00089) [2022-07-09 23:07:29,076][26022] Updated weights on worker 0-0, policy_version 460631 (0.00089) [2022-07-09 23:07:30,997][26022] Updated weights on worker 0-0, policy_version 460641 (0.00108) [2022-07-09 23:07:31,831][25689] Fps is (10 sec: 5811.2, 60 sec: 5681.0, 300 sec: 5670.1). Total num frames: 471701504. Throughput: 0: 5970.8. Samples: 471699208. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:07:31,831][25689] Avg episode reward: [(0, '-47.041')] [2022-07-09 23:07:32,699][26022] Updated weights on worker 0-0, policy_version 460651 (0.00082) [2022-07-09 23:07:34,554][26022] Updated weights on worker 0-0, policy_version 460661 (0.00090) [2022-07-09 23:07:36,290][26022] Updated weights on worker 0-0, policy_version 460671 (0.00092) [2022-07-09 23:07:36,834][25689] Fps is (10 sec: 5616.3, 60 sec: 5633.2, 300 sec: 5665.3). Total num frames: 471729152. Throughput: 0: 5968.8. Samples: 471733650. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:07:36,835][25689] Avg episode reward: [(0, '-47.049')] [2022-07-09 23:07:38,106][26022] Updated weights on worker 0-0, policy_version 460681 (0.00094) [2022-07-09 23:07:39,288][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:07:39,304][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000460688_471744512.pth [2022-07-09 23:07:39,304][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000458690_469698560.pth [2022-07-09 23:07:39,829][26022] Updated weights on worker 0-0, policy_version 460691 (0.00084) [2022-07-09 23:07:41,627][26022] Updated weights on worker 0-0, policy_version 460701 (0.00090) [2022-07-09 23:07:41,882][25689] Fps is (10 sec: 5705.0, 60 sec: 5688.1, 300 sec: 5671.4). Total num frames: 471758848. Throughput: 0: 5978.5. Samples: 471768138. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:07:41,884][25689] Avg episode reward: [(0, '-46.856')] [2022-07-09 23:07:43,594][26022] Updated weights on worker 0-0, policy_version 460711 (0.00092) [2022-07-09 23:07:45,136][26022] Updated weights on worker 0-0, policy_version 460721 (0.00093) [2022-07-09 23:07:46,950][25689] Fps is (10 sec: 5769.9, 60 sec: 5690.3, 300 sec: 5670.4). Total num frames: 471787520. Throughput: 0: 5962.0. Samples: 471785274. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:07:46,951][25689] Avg episode reward: [(0, '-46.591')] [2022-07-09 23:07:47,121][26022] Updated weights on worker 0-0, policy_version 460731 (0.00061) [2022-07-09 23:07:48,721][26022] Updated weights on worker 0-0, policy_version 460741 (0.00085) [2022-07-09 23:07:50,701][26022] Updated weights on worker 0-0, policy_version 460751 (0.00083) [2022-07-09 23:07:52,015][25689] Fps is (10 sec: 5659.5, 60 sec: 5651.6, 300 sec: 5669.5). Total num frames: 471816192. Throughput: 0: 5976.4. Samples: 471819834. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:07:52,015][25689] Avg episode reward: [(0, '-46.468')] [2022-07-09 23:07:52,399][26022] Updated weights on worker 0-0, policy_version 460761 (0.00093) [2022-07-09 23:07:54,250][26022] Updated weights on worker 0-0, policy_version 460771 (0.00092) [2022-07-09 23:07:56,061][26022] Updated weights on worker 0-0, policy_version 460781 (0.00093) [2022-07-09 23:07:57,095][25689] Fps is (10 sec: 5652.6, 60 sec: 5664.3, 300 sec: 5669.8). Total num frames: 471844864. Throughput: 0: 5926.2. Samples: 471853718. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:07:57,095][25689] Avg episode reward: [(0, '-46.388')] [2022-07-09 23:07:57,692][26022] Updated weights on worker 0-0, policy_version 460791 (0.00086) [2022-07-09 23:07:59,636][26022] Updated weights on worker 0-0, policy_version 460801 (0.00088) [2022-07-09 23:08:01,568][26022] Updated weights on worker 0-0, policy_version 460811 (0.00094) [2022-07-09 23:08:02,153][25689] Fps is (10 sec: 5453.9, 60 sec: 5637.2, 300 sec: 5669.6). Total num frames: 471871488. Throughput: 0: 5065.5. Samples: 471870822. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:08:02,154][25689] Avg episode reward: [(0, '-46.314')] [2022-07-09 23:08:03,661][26022] Updated weights on worker 0-0, policy_version 460821 (0.00087) [2022-07-09 23:08:05,475][26022] Updated weights on worker 0-0, policy_version 460831 (0.00088) [2022-07-09 23:08:07,191][25689] Fps is (10 sec: 5476.8, 60 sec: 5670.9, 300 sec: 5669.6). Total num frames: 471900160. Throughput: 0: 5791.0. Samples: 471902490. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:08:07,192][25689] Avg episode reward: [(0, '-45.460')] [2022-07-09 23:08:07,198][26022] Updated weights on worker 0-0, policy_version 460841 (0.00084) [2022-07-09 23:08:09,304][26022] Updated weights on worker 0-0, policy_version 460851 (0.00095) [2022-07-09 23:08:10,680][26022] Updated weights on worker 0-0, policy_version 460861 (0.00089) [2022-07-09 23:08:12,226][25689] Fps is (10 sec: 5692.8, 60 sec: 5668.8, 300 sec: 5670.5). Total num frames: 471928832. Throughput: 0: 5796.9. Samples: 471937000. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:08:12,227][25689] Avg episode reward: [(0, '-45.339')] [2022-07-09 23:08:12,743][26022] Updated weights on worker 0-0, policy_version 460871 (0.00085) [2022-07-09 23:08:14,288][26022] Updated weights on worker 0-0, policy_version 460881 (0.00084) [2022-07-09 23:08:16,275][26022] Updated weights on worker 0-0, policy_version 460891 (0.00090) [2022-07-09 23:08:17,255][25689] Fps is (10 sec: 5698.1, 60 sec: 5668.4, 300 sec: 5664.1). Total num frames: 471957504. Throughput: 0: 4988.2. Samples: 471954280. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:08:17,255][25689] Avg episode reward: [(0, '-45.349')] [2022-07-09 23:08:18,106][26022] Updated weights on worker 0-0, policy_version 460901 (0.00087) [2022-07-09 23:08:19,790][26022] Updated weights on worker 0-0, policy_version 460911 (0.00094) [2022-07-09 23:08:21,509][26022] Updated weights on worker 0-0, policy_version 460921 (0.00084) [2022-07-09 23:08:22,376][25689] Fps is (10 sec: 5750.7, 60 sec: 5678.6, 300 sec: 5670.5). Total num frames: 471987200. Throughput: 0: 5820.1. Samples: 471988520. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:08:22,377][25689] Avg episode reward: [(0, '-45.818')] [2022-07-09 23:08:23,535][26022] Updated weights on worker 0-0, policy_version 460931 (0.00092) [2022-07-09 23:08:25,016][26022] Updated weights on worker 0-0, policy_version 460941 (0.00081) [2022-07-09 23:08:27,092][26022] Updated weights on worker 0-0, policy_version 460951 (0.00096) [2022-07-09 23:08:27,397][25689] Fps is (10 sec: 5755.3, 60 sec: 5661.6, 300 sec: 5675.4). Total num frames: 472015872. Throughput: 0: 5959.1. Samples: 472022896. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-09 23:08:27,397][25689] Avg episode reward: [(0, '-45.845')] [2022-07-09 23:08:28,696][26022] Updated weights on worker 0-0, policy_version 460961 (0.00090) [2022-07-09 23:08:30,552][26022] Updated weights on worker 0-0, policy_version 460971 (0.00086) [2022-07-09 23:08:32,401][25689] Fps is (10 sec: 5515.8, 60 sec: 5629.7, 300 sec: 5661.9). Total num frames: 472042496. Throughput: 0: 5092.1. Samples: 472039732. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:08:32,402][25689] Avg episode reward: [(0, '-46.741')] [2022-07-09 23:08:32,538][26022] Updated weights on worker 0-0, policy_version 460981 (0.00091) [2022-07-09 23:08:34,140][26022] Updated weights on worker 0-0, policy_version 460991 (0.00085) [2022-07-09 23:08:35,911][26022] Updated weights on worker 0-0, policy_version 461001 (0.00087) [2022-07-09 23:08:37,446][25689] Fps is (10 sec: 5706.0, 60 sec: 5676.5, 300 sec: 5667.1). Total num frames: 472073216. Throughput: 0: 5921.7. Samples: 472073848. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:08:37,447][25689] Avg episode reward: [(0, '-47.179')] [2022-07-09 23:08:37,898][26022] Updated weights on worker 0-0, policy_version 461011 (0.00089) [2022-07-09 23:08:39,530][26022] Updated weights on worker 0-0, policy_version 461021 (0.00097) [2022-07-09 23:08:41,484][26022] Updated weights on worker 0-0, policy_version 461031 (0.00089) [2022-07-09 23:08:42,490][25689] Fps is (10 sec: 5785.3, 60 sec: 5643.1, 300 sec: 5662.8). Total num frames: 472100864. Throughput: 0: 5940.1. Samples: 472108000. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:08:42,491][25689] Avg episode reward: [(0, '-47.516')] [2022-07-09 23:08:43,232][26022] Updated weights on worker 0-0, policy_version 461041 (0.00096) [2022-07-09 23:08:45,034][26022] Updated weights on worker 0-0, policy_version 461051 (0.00086) [2022-07-09 23:08:46,830][26022] Updated weights on worker 0-0, policy_version 461061 (0.00089) [2022-07-09 23:08:47,496][25689] Fps is (10 sec: 5705.9, 60 sec: 5665.8, 300 sec: 5669.8). Total num frames: 472130560. Throughput: 0: 5087.2. Samples: 472125148. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:08:47,497][25689] Avg episode reward: [(0, '-47.081')] [2022-07-09 23:08:48,691][26022] Updated weights on worker 0-0, policy_version 461071 (0.00086) [2022-07-09 23:08:50,262][26022] Updated weights on worker 0-0, policy_version 461081 (0.00091) [2022-07-09 23:08:52,291][26022] Updated weights on worker 0-0, policy_version 461091 (0.00084) [2022-07-09 23:08:52,507][25689] Fps is (10 sec: 5622.7, 60 sec: 5637.0, 300 sec: 5666.7). Total num frames: 472157184. Throughput: 0: 5945.3. Samples: 472159266. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:08:52,507][25689] Avg episode reward: [(0, '-46.900')] [2022-07-09 23:08:53,877][26022] Updated weights on worker 0-0, policy_version 461101 (0.00085) [2022-07-09 23:08:55,966][26022] Updated weights on worker 0-0, policy_version 461111 (0.00086) [2022-07-09 23:08:57,528][25689] Fps is (10 sec: 5614.0, 60 sec: 5659.4, 300 sec: 5668.1). Total num frames: 472186880. Throughput: 0: 5951.5. Samples: 472193366. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:08:57,529][25689] Avg episode reward: [(0, '-46.455')] [2022-07-09 23:08:57,659][26022] Updated weights on worker 0-0, policy_version 461121 (0.00094) [2022-07-09 23:08:59,436][26022] Updated weights on worker 0-0, policy_version 461131 (0.00083) [2022-07-09 23:09:01,302][26022] Updated weights on worker 0-0, policy_version 461141 (0.00093) [2022-07-09 23:09:02,653][25689] Fps is (10 sec: 5551.1, 60 sec: 5653.2, 300 sec: 5665.9). Total num frames: 472213504. Throughput: 0: 5076.6. Samples: 472210356. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:09:02,653][25689] Avg episode reward: [(0, '-46.309')] [2022-07-09 23:09:03,638][26022] Updated weights on worker 0-0, policy_version 461151 (0.00090) [2022-07-09 23:09:05,203][26022] Updated weights on worker 0-0, policy_version 461161 (0.00085) [2022-07-09 23:09:07,394][26022] Updated weights on worker 0-0, policy_version 461171 (0.00095) [2022-07-09 23:09:07,726][25689] Fps is (10 sec: 5322.0, 60 sec: 5633.0, 300 sec: 5664.7). Total num frames: 472241152. Throughput: 0: 5786.9. Samples: 472242216. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:09:07,727][25689] Avg episode reward: [(0, '-46.001')] [2022-07-09 23:09:08,711][26022] Updated weights on worker 0-0, policy_version 461181 (0.00085) [2022-07-09 23:09:10,919][26022] Updated weights on worker 0-0, policy_version 461191 (0.00084) [2022-07-09 23:09:12,366][26022] Updated weights on worker 0-0, policy_version 461201 (0.00087) [2022-07-09 23:09:12,740][25689] Fps is (10 sec: 5786.3, 60 sec: 5668.8, 300 sec: 5661.2). Total num frames: 472271872. Throughput: 0: 5793.6. Samples: 472276488. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:09:12,740][25689] Avg episode reward: [(0, '-45.820')] [2022-07-09 23:09:14,292][26022] Updated weights on worker 0-0, policy_version 461211 (0.00085) [2022-07-09 23:09:15,902][26022] Updated weights on worker 0-0, policy_version 461221 (0.00086) [2022-07-09 23:09:17,749][25689] Fps is (10 sec: 5823.3, 60 sec: 5653.7, 300 sec: 5665.2). Total num frames: 472299520. Throughput: 0: 4971.0. Samples: 472293884. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:09:17,750][25689] Avg episode reward: [(0, '-46.056')] [2022-07-09 23:09:17,863][26022] Updated weights on worker 0-0, policy_version 461231 (0.00087) [2022-07-09 23:09:19,762][26022] Updated weights on worker 0-0, policy_version 461241 (0.00089) [2022-07-09 23:09:21,332][26022] Updated weights on worker 0-0, policy_version 461251 (0.00081) [2022-07-09 23:09:22,796][25689] Fps is (10 sec: 5600.3, 60 sec: 5643.7, 300 sec: 5664.8). Total num frames: 472328192. Throughput: 0: 5844.0. Samples: 472328078. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:09:22,797][25689] Avg episode reward: [(0, '-45.705')] [2022-07-09 23:09:23,316][26022] Updated weights on worker 0-0, policy_version 461261 (0.00092) [2022-07-09 23:09:24,995][26022] Updated weights on worker 0-0, policy_version 461271 (0.00101) [2022-07-09 23:09:26,788][26022] Updated weights on worker 0-0, policy_version 461281 (0.00084) [2022-07-09 23:09:27,809][25689] Fps is (10 sec: 5802.0, 60 sec: 5661.3, 300 sec: 5668.0). Total num frames: 472357888. Throughput: 0: 5993.3. Samples: 472362580. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:09:27,810][25689] Avg episode reward: [(0, '-44.674')] [2022-07-09 23:09:28,563][26022] Updated weights on worker 0-0, policy_version 461291 (0.00082) [2022-07-09 23:09:30,166][26022] Updated weights on worker 0-0, policy_version 461301 (0.00090) [2022-07-09 23:09:32,166][26022] Updated weights on worker 0-0, policy_version 461311 (0.00092) [2022-07-09 23:09:32,822][25689] Fps is (10 sec: 5719.8, 60 sec: 5677.5, 300 sec: 5664.6). Total num frames: 472385536. Throughput: 0: 5141.4. Samples: 472379738. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:09:32,823][25689] Avg episode reward: [(0, '-44.621')] [2022-07-09 23:09:33,922][26022] Updated weights on worker 0-0, policy_version 461321 (0.00091) [2022-07-09 23:09:35,867][26022] Updated weights on worker 0-0, policy_version 461331 (0.00095) [2022-07-09 23:09:37,561][26022] Updated weights on worker 0-0, policy_version 461341 (0.00087) [2022-07-09 23:09:37,939][25689] Fps is (10 sec: 5559.9, 60 sec: 5636.9, 300 sec: 5666.6). Total num frames: 472414208. Throughput: 0: 5931.5. Samples: 472413638. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:09:37,939][25689] Avg episode reward: [(0, '-44.163')] [2022-07-09 23:09:39,401][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:09:39,430][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000461351_472423424.pth [2022-07-09 23:09:39,431][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000459356_470380544.pth [2022-07-09 23:09:39,437][26022] Updated weights on worker 0-0, policy_version 461351 (0.00095) [2022-07-09 23:09:41,179][26022] Updated weights on worker 0-0, policy_version 461361 (0.00088) [2022-07-09 23:09:43,031][25689] Fps is (10 sec: 5617.1, 60 sec: 5649.4, 300 sec: 5664.9). Total num frames: 472442880. Throughput: 0: 5930.2. Samples: 472448072. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:09:43,031][25689] Avg episode reward: [(0, '-43.673')] [2022-07-09 23:09:43,075][26022] Updated weights on worker 0-0, policy_version 461371 (0.00092) [2022-07-09 23:09:44,620][26022] Updated weights on worker 0-0, policy_version 461381 (0.00091) [2022-07-09 23:09:46,481][26022] Updated weights on worker 0-0, policy_version 461391 (0.00080) [2022-07-09 23:09:48,085][25689] Fps is (10 sec: 5752.9, 60 sec: 5644.9, 300 sec: 5664.2). Total num frames: 472472576. Throughput: 0: 5915.4. Samples: 472482518. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:09:48,085][25689] Avg episode reward: [(0, '-44.650')] [2022-07-09 23:09:48,391][26022] Updated weights on worker 0-0, policy_version 461401 (0.00080) [2022-07-09 23:09:50,123][26022] Updated weights on worker 0-0, policy_version 461411 (0.00087) [2022-07-09 23:09:51,992][26022] Updated weights on worker 0-0, policy_version 461421 (0.00085) [2022-07-09 23:09:53,099][25689] Fps is (10 sec: 5899.2, 60 sec: 5695.3, 300 sec: 5664.2). Total num frames: 472502272. Throughput: 0: 5927.4. Samples: 472499926. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:09:53,099][25689] Avg episode reward: [(0, '-45.213')] [2022-07-09 23:09:53,768][26022] Updated weights on worker 0-0, policy_version 461431 (0.00092) [2022-07-09 23:09:55,482][26022] Updated weights on worker 0-0, policy_version 461441 (0.00105) [2022-07-09 23:09:57,341][26022] Updated weights on worker 0-0, policy_version 461451 (0.00091) [2022-07-09 23:09:58,100][25689] Fps is (10 sec: 5827.6, 60 sec: 5680.2, 300 sec: 5672.4). Total num frames: 472530944. Throughput: 0: 6001.5. Samples: 472534638. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:09:58,102][25689] Avg episode reward: [(0, '-45.186')] [2022-07-09 23:09:58,982][26022] Updated weights on worker 0-0, policy_version 461461 (0.00085) [2022-07-09 23:10:00,801][26022] Updated weights on worker 0-0, policy_version 461471 (0.00092) [2022-07-09 23:10:02,902][26022] Updated weights on worker 0-0, policy_version 461481 (0.00094) [2022-07-09 23:10:03,142][25689] Fps is (10 sec: 5404.0, 60 sec: 5671.1, 300 sec: 5662.8). Total num frames: 472556544. Throughput: 0: 5904.7. Samples: 472566820. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:10:03,142][25689] Avg episode reward: [(0, '-45.220')] [2022-07-09 23:10:04,826][26022] Updated weights on worker 0-0, policy_version 461491 (0.00086) [2022-07-09 23:10:06,675][26022] Updated weights on worker 0-0, policy_version 461501 (0.00079) [2022-07-09 23:10:08,155][25689] Fps is (10 sec: 5499.6, 60 sec: 5710.6, 300 sec: 5669.8). Total num frames: 472586240. Throughput: 0: 5039.7. Samples: 472583666. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:10:08,157][25689] Avg episode reward: [(0, '-44.779')] [2022-07-09 23:10:08,283][26022] Updated weights on worker 0-0, policy_version 461511 (0.00087) [2022-07-09 23:10:10,294][26022] Updated weights on worker 0-0, policy_version 461521 (0.00086) [2022-07-09 23:10:11,987][26022] Updated weights on worker 0-0, policy_version 461531 (0.00099) [2022-07-09 23:10:13,181][25689] Fps is (10 sec: 5711.8, 60 sec: 5658.6, 300 sec: 5667.5). Total num frames: 472613888. Throughput: 0: 5874.9. Samples: 472617908. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:10:13,182][25689] Avg episode reward: [(0, '-44.658')] [2022-07-09 23:10:13,826][26022] Updated weights on worker 0-0, policy_version 461541 (0.00087) [2022-07-09 23:10:15,575][26022] Updated weights on worker 0-0, policy_version 461551 (0.00085) [2022-07-09 23:10:17,336][26022] Updated weights on worker 0-0, policy_version 461561 (0.00083) [2022-07-09 23:10:18,199][25689] Fps is (10 sec: 5709.3, 60 sec: 5691.7, 300 sec: 5664.8). Total num frames: 472643584. Throughput: 0: 5862.8. Samples: 472652472. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:10:18,200][25689] Avg episode reward: [(0, '-44.591')] [2022-07-09 23:10:19,122][26022] Updated weights on worker 0-0, policy_version 461571 (0.00086) [2022-07-09 23:10:20,972][26022] Updated weights on worker 0-0, policy_version 461581 (0.00085) [2022-07-09 23:10:22,659][26022] Updated weights on worker 0-0, policy_version 461591 (0.00103) [2022-07-09 23:10:23,238][25689] Fps is (10 sec: 5702.2, 60 sec: 5675.6, 300 sec: 5664.1). Total num frames: 472671232. Throughput: 0: 5107.6. Samples: 472669462. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:10:23,238][25689] Avg episode reward: [(0, '-44.610')] [2022-07-09 23:10:24,599][26022] Updated weights on worker 0-0, policy_version 461601 (0.00094) [2022-07-09 23:10:26,394][26022] Updated weights on worker 0-0, policy_version 461611 (0.00086) [2022-07-09 23:10:28,063][26022] Updated weights on worker 0-0, policy_version 461621 (0.00085) [2022-07-09 23:10:28,247][25689] Fps is (10 sec: 5809.0, 60 sec: 5692.9, 300 sec: 5674.3). Total num frames: 472701952. Throughput: 0: 5989.6. Samples: 472704008. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:10:28,247][25689] Avg episode reward: [(0, '-43.978')] [2022-07-09 23:10:29,942][26022] Updated weights on worker 0-0, policy_version 461631 (0.00423) [2022-07-09 23:10:31,533][26022] Updated weights on worker 0-0, policy_version 461641 (0.00085) [2022-07-09 23:10:33,268][25689] Fps is (10 sec: 5717.1, 60 sec: 5675.2, 300 sec: 5667.6). Total num frames: 472728576. Throughput: 0: 6000.7. Samples: 472738442. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:10:33,269][25689] Avg episode reward: [(0, '-44.687')] [2022-07-09 23:10:33,461][26022] Updated weights on worker 0-0, policy_version 461651 (0.00086) [2022-07-09 23:10:35,441][26022] Updated weights on worker 0-0, policy_version 461661 (0.00088) [2022-07-09 23:10:37,032][26022] Updated weights on worker 0-0, policy_version 461671 (0.00093) [2022-07-09 23:10:38,298][25689] Fps is (10 sec: 5501.6, 60 sec: 5683.3, 300 sec: 5664.7). Total num frames: 472757248. Throughput: 0: 5110.7. Samples: 472755190. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:10:38,298][25689] Avg episode reward: [(0, '-45.204')] [2022-07-09 23:10:39,042][26022] Updated weights on worker 0-0, policy_version 461681 (0.00064) [2022-07-09 23:10:40,444][26022] Updated weights on worker 0-0, policy_version 461691 (0.00092) [2022-07-09 23:10:42,389][26022] Updated weights on worker 0-0, policy_version 461701 (0.00095) [2022-07-09 23:10:43,348][25689] Fps is (10 sec: 5790.3, 60 sec: 5704.2, 300 sec: 5667.3). Total num frames: 472786944. Throughput: 0: 5983.7. Samples: 472789798. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:10:43,349][25689] Avg episode reward: [(0, '-45.539')] [2022-07-09 23:10:44,343][26022] Updated weights on worker 0-0, policy_version 461711 (0.00085) [2022-07-09 23:10:45,950][26022] Updated weights on worker 0-0, policy_version 461721 (0.00091) [2022-07-09 23:10:47,964][26022] Updated weights on worker 0-0, policy_version 461731 (0.00083) [2022-07-09 23:10:48,355][25689] Fps is (10 sec: 5803.7, 60 sec: 5691.7, 300 sec: 5670.9). Total num frames: 472815616. Throughput: 0: 5983.7. Samples: 472824328. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:10:48,355][25689] Avg episode reward: [(0, '-46.597')] [2022-07-09 23:10:49,690][26022] Updated weights on worker 0-0, policy_version 461741 (0.00089) [2022-07-09 23:10:51,387][26022] Updated weights on worker 0-0, policy_version 461751 (0.00050) [2022-07-09 23:10:53,357][26022] Updated weights on worker 0-0, policy_version 461761 (0.00093) [2022-07-09 23:10:53,382][25689] Fps is (10 sec: 5613.2, 60 sec: 5656.5, 300 sec: 5663.6). Total num frames: 472843264. Throughput: 0: 5122.6. Samples: 472841476. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:10:53,382][25689] Avg episode reward: [(0, '-47.021')] [2022-07-09 23:10:54,783][26022] Updated weights on worker 0-0, policy_version 461771 (0.00093) [2022-07-09 23:10:56,847][26022] Updated weights on worker 0-0, policy_version 461781 (0.00088) [2022-07-09 23:10:58,396][25689] Fps is (10 sec: 5609.1, 60 sec: 5655.4, 300 sec: 5660.9). Total num frames: 472871936. Throughput: 0: 5991.8. Samples: 472875612. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:10:58,396][25689] Avg episode reward: [(0, '-47.221')] [2022-07-09 23:10:58,729][26022] Updated weights on worker 0-0, policy_version 461791 (0.00083) [2022-07-09 23:11:00,154][26022] Updated weights on worker 0-0, policy_version 461801 (0.00087) [2022-07-09 23:11:02,783][26022] Updated weights on worker 0-0, policy_version 461811 (0.00093) [2022-07-09 23:11:03,453][25689] Fps is (10 sec: 5693.5, 60 sec: 5704.8, 300 sec: 5673.6). Total num frames: 472900608. Throughput: 0: 5877.5. Samples: 472907966. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:11:03,454][25689] Avg episode reward: [(0, '-46.588')] [2022-07-09 23:11:04,216][26022] Updated weights on worker 0-0, policy_version 461821 (0.00085) [2022-07-09 23:11:06,233][26022] Updated weights on worker 0-0, policy_version 461831 (0.00184) [2022-07-09 23:11:08,055][26022] Updated weights on worker 0-0, policy_version 461841 (0.00090) [2022-07-09 23:11:08,461][25689] Fps is (10 sec: 5595.5, 60 sec: 5671.4, 300 sec: 5667.9). Total num frames: 472928256. Throughput: 0: 5005.3. Samples: 472924964. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:11:08,463][25689] Avg episode reward: [(0, '-46.875')] [2022-07-09 23:11:09,760][26022] Updated weights on worker 0-0, policy_version 461851 (0.00088) [2022-07-09 23:11:11,615][26022] Updated weights on worker 0-0, policy_version 461861 (0.00095) [2022-07-09 23:11:13,346][26022] Updated weights on worker 0-0, policy_version 461871 (0.00089) [2022-07-09 23:11:13,515][25689] Fps is (10 sec: 5495.7, 60 sec: 5668.7, 300 sec: 5660.2). Total num frames: 472955904. Throughput: 0: 5842.3. Samples: 472959102. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:11:13,516][25689] Avg episode reward: [(0, '-46.236')] [2022-07-09 23:11:15,138][26022] Updated weights on worker 0-0, policy_version 461881 (0.00094) [2022-07-09 23:11:16,927][26022] Updated weights on worker 0-0, policy_version 461891 (0.00088) [2022-07-09 23:11:18,526][25689] Fps is (10 sec: 5697.4, 60 sec: 5669.4, 300 sec: 5664.8). Total num frames: 472985600. Throughput: 0: 5852.3. Samples: 472993420. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-09 23:11:18,535][25689] Avg episode reward: [(0, '-45.463')] [2022-07-09 23:11:18,595][26022] Updated weights on worker 0-0, policy_version 461901 (0.00094) [2022-07-09 23:11:20,540][26022] Updated weights on worker 0-0, policy_version 461911 (0.00086) [2022-07-09 23:11:22,323][26022] Updated weights on worker 0-0, policy_version 461921 (0.00086) [2022-07-09 23:11:23,694][25689] Fps is (10 sec: 5633.9, 60 sec: 5657.3, 300 sec: 5662.5). Total num frames: 473013248. Throughput: 0: 5055.5. Samples: 473010290. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:11:23,694][25689] Avg episode reward: [(0, '-44.730')] [2022-07-09 23:11:24,058][26022] Updated weights on worker 0-0, policy_version 461931 (0.00081) [2022-07-09 23:11:26,140][26022] Updated weights on worker 0-0, policy_version 461941 (0.00087) [2022-07-09 23:11:27,650][26022] Updated weights on worker 0-0, policy_version 461951 (0.00091) [2022-07-09 23:11:28,756][25689] Fps is (10 sec: 5505.3, 60 sec: 5618.5, 300 sec: 5662.3). Total num frames: 473041920. Throughput: 0: 5890.0. Samples: 473044502. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:11:28,756][25689] Avg episode reward: [(0, '-45.387')] [2022-07-09 23:11:29,563][26022] Updated weights on worker 0-0, policy_version 461961 (0.00093) [2022-07-09 23:11:31,407][26022] Updated weights on worker 0-0, policy_version 461971 (0.00096) [2022-07-09 23:11:33,043][26022] Updated weights on worker 0-0, policy_version 461981 (0.00078) [2022-07-09 23:11:33,849][25689] Fps is (10 sec: 5848.2, 60 sec: 5679.4, 300 sec: 5661.2). Total num frames: 473072640. Throughput: 0: 5867.3. Samples: 473078408. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:11:33,850][25689] Avg episode reward: [(0, '-45.861')] [2022-07-09 23:11:35,113][26022] Updated weights on worker 0-0, policy_version 461991 (0.00087) [2022-07-09 23:11:36,671][26022] Updated weights on worker 0-0, policy_version 462001 (0.00087) [2022-07-09 23:11:38,609][26022] Updated weights on worker 0-0, policy_version 462011 (0.00088) [2022-07-09 23:11:38,910][25689] Fps is (10 sec: 5748.4, 60 sec: 5659.6, 300 sec: 5665.3). Total num frames: 473100288. Throughput: 0: 5014.0. Samples: 473095624. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:11:38,910][25689] Avg episode reward: [(0, '-45.336')] [2022-07-09 23:11:39,611][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:11:39,622][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000462017_473105408.pth [2022-07-09 23:11:39,622][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000460022_471062528.pth [2022-07-09 23:11:40,405][26022] Updated weights on worker 0-0, policy_version 462021 (0.00087) [2022-07-09 23:11:42,068][26022] Updated weights on worker 0-0, policy_version 462031 (0.00087) [2022-07-09 23:11:43,956][26022] Updated weights on worker 0-0, policy_version 462041 (0.00117) [2022-07-09 23:11:43,979][25689] Fps is (10 sec: 5661.1, 60 sec: 5657.9, 300 sec: 5669.1). Total num frames: 473129984. Throughput: 0: 5889.8. Samples: 473129766. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:11:43,979][25689] Avg episode reward: [(0, '-45.915')] [2022-07-09 23:11:45,833][26022] Updated weights on worker 0-0, policy_version 462051 (0.00088) [2022-07-09 23:11:47,482][26022] Updated weights on worker 0-0, policy_version 462061 (0.00084) [2022-07-09 23:11:48,985][25689] Fps is (10 sec: 5691.2, 60 sec: 5641.0, 300 sec: 5658.9). Total num frames: 473157632. Throughput: 0: 5911.1. Samples: 473164082. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:11:48,987][25689] Avg episode reward: [(0, '-46.324')] [2022-07-09 23:11:49,485][26022] Updated weights on worker 0-0, policy_version 462071 (0.00083) [2022-07-09 23:11:50,900][26022] Updated weights on worker 0-0, policy_version 462081 (0.00091) [2022-07-09 23:11:53,011][26022] Updated weights on worker 0-0, policy_version 462091 (0.00088) [2022-07-09 23:11:54,001][25689] Fps is (10 sec: 5823.5, 60 sec: 5692.7, 300 sec: 5669.6). Total num frames: 473188352. Throughput: 0: 5108.8. Samples: 473181362. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:11:54,002][25689] Avg episode reward: [(0, '-46.953')] [2022-07-09 23:11:54,600][26022] Updated weights on worker 0-0, policy_version 462101 (0.00083) [2022-07-09 23:11:56,461][26022] Updated weights on worker 0-0, policy_version 462111 (0.00080) [2022-07-09 23:11:58,259][26022] Updated weights on worker 0-0, policy_version 462121 (0.00088) [2022-07-09 23:11:59,043][25689] Fps is (10 sec: 5803.4, 60 sec: 5673.2, 300 sec: 5667.9). Total num frames: 473216000. Throughput: 0: 5977.8. Samples: 473215978. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:11:59,043][25689] Avg episode reward: [(0, '-46.323')] [2022-07-09 23:11:59,920][26022] Updated weights on worker 0-0, policy_version 462131 (0.00094) [2022-07-09 23:12:01,865][26022] Updated weights on worker 0-0, policy_version 462141 (0.00085) [2022-07-09 23:12:04,024][26022] Updated weights on worker 0-0, policy_version 462151 (0.00091) [2022-07-09 23:12:04,100][25689] Fps is (10 sec: 5374.3, 60 sec: 5639.5, 300 sec: 5667.5). Total num frames: 473242624. Throughput: 0: 5871.1. Samples: 473247900. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:12:04,102][25689] Avg episode reward: [(0, '-45.999')] [2022-07-09 23:12:05,734][26022] Updated weights on worker 0-0, policy_version 462161 (0.00089) [2022-07-09 23:12:07,711][26022] Updated weights on worker 0-0, policy_version 462171 (0.00084) [2022-07-09 23:12:09,107][25689] Fps is (10 sec: 5392.8, 60 sec: 5639.6, 300 sec: 5664.1). Total num frames: 473270272. Throughput: 0: 5008.5. Samples: 473264860. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:12:09,107][25689] Avg episode reward: [(0, '-44.414')] [2022-07-09 23:12:09,455][26022] Updated weights on worker 0-0, policy_version 462181 (0.00084) [2022-07-09 23:12:11,251][26022] Updated weights on worker 0-0, policy_version 462191 (0.00084) [2022-07-09 23:12:13,082][26022] Updated weights on worker 0-0, policy_version 462201 (0.00085) [2022-07-09 23:12:14,140][25689] Fps is (10 sec: 5609.3, 60 sec: 5658.4, 300 sec: 5664.0). Total num frames: 473298944. Throughput: 0: 5838.6. Samples: 473298946. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:12:14,141][25689] Avg episode reward: [(0, '-44.569')] [2022-07-09 23:12:14,655][26022] Updated weights on worker 0-0, policy_version 462211 (0.00084) [2022-07-09 23:12:16,751][26022] Updated weights on worker 0-0, policy_version 462221 (0.00091) [2022-07-09 23:12:18,421][26022] Updated weights on worker 0-0, policy_version 462231 (0.00115) [2022-07-09 23:12:19,152][25689] Fps is (10 sec: 5810.2, 60 sec: 5658.2, 300 sec: 5668.1). Total num frames: 473328640. Throughput: 0: 5829.6. Samples: 473333208. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:12:19,153][25689] Avg episode reward: [(0, '-44.509')] [2022-07-09 23:12:20,347][26022] Updated weights on worker 0-0, policy_version 462241 (0.00086) [2022-07-09 23:12:21,995][26022] Updated weights on worker 0-0, policy_version 462251 (0.00087) [2022-07-09 23:12:23,938][26022] Updated weights on worker 0-0, policy_version 462261 (0.00089) [2022-07-09 23:12:24,247][25689] Fps is (10 sec: 5775.2, 60 sec: 5682.0, 300 sec: 5663.2). Total num frames: 473357312. Throughput: 0: 5076.8. Samples: 473350182. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:12:24,249][25689] Avg episode reward: [(0, '-43.997')] [2022-07-09 23:12:25,517][26022] Updated weights on worker 0-0, policy_version 462271 (0.00090) [2022-07-09 23:12:27,588][26022] Updated weights on worker 0-0, policy_version 462281 (0.00098) [2022-07-09 23:12:28,951][26022] Updated weights on worker 0-0, policy_version 462291 (0.00091) [2022-07-09 23:12:29,258][25689] Fps is (10 sec: 5775.6, 60 sec: 5703.7, 300 sec: 5666.9). Total num frames: 473387008. Throughput: 0: 5946.5. Samples: 473384692. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:12:29,259][25689] Avg episode reward: [(0, '-43.471')] [2022-07-09 23:12:31,099][26022] Updated weights on worker 0-0, policy_version 462301 (0.00085) [2022-07-09 23:12:32,893][26022] Updated weights on worker 0-0, policy_version 462311 (0.00095) [2022-07-09 23:12:34,298][25689] Fps is (10 sec: 5705.2, 60 sec: 5657.9, 300 sec: 5666.2). Total num frames: 473414656. Throughput: 0: 5939.1. Samples: 473418664. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:12:34,300][25689] Avg episode reward: [(0, '-43.592')] [2022-07-09 23:12:34,596][26022] Updated weights on worker 0-0, policy_version 462321 (0.00088) [2022-07-09 23:12:36,487][26022] Updated weights on worker 0-0, policy_version 462331 (0.00088) [2022-07-09 23:12:38,195][26022] Updated weights on worker 0-0, policy_version 462341 (0.00085) [2022-07-09 23:12:39,368][25689] Fps is (10 sec: 5469.6, 60 sec: 5657.1, 300 sec: 5658.9). Total num frames: 473442304. Throughput: 0: 5074.6. Samples: 473435794. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:12:39,370][25689] Avg episode reward: [(0, '-44.780')] [2022-07-09 23:12:40,001][26022] Updated weights on worker 0-0, policy_version 462351 (0.00085) [2022-07-09 23:12:41,933][26022] Updated weights on worker 0-0, policy_version 462361 (0.00090) [2022-07-09 23:12:43,649][26022] Updated weights on worker 0-0, policy_version 462371 (0.00330) [2022-07-09 23:12:44,499][25689] Fps is (10 sec: 5621.7, 60 sec: 5651.3, 300 sec: 5661.2). Total num frames: 473472000. Throughput: 0: 5916.3. Samples: 473469996. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:12:44,500][25689] Avg episode reward: [(0, '-44.152')] [2022-07-09 23:12:45,366][26022] Updated weights on worker 0-0, policy_version 462381 (0.00080) [2022-07-09 23:12:47,235][26022] Updated weights on worker 0-0, policy_version 462391 (0.00082) [2022-07-09 23:12:49,043][26022] Updated weights on worker 0-0, policy_version 462401 (0.00095) [2022-07-09 23:12:49,534][25689] Fps is (10 sec: 5842.3, 60 sec: 5682.5, 300 sec: 5665.2). Total num frames: 473501696. Throughput: 0: 5896.9. Samples: 473504254. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:12:49,535][25689] Avg episode reward: [(0, '-44.129')] [2022-07-09 23:12:50,855][26022] Updated weights on worker 0-0, policy_version 462411 (0.00102) [2022-07-09 23:12:52,603][26022] Updated weights on worker 0-0, policy_version 462421 (0.00086) [2022-07-09 23:12:54,540][25689] Fps is (10 sec: 5609.0, 60 sec: 5615.8, 300 sec: 5659.7). Total num frames: 473528320. Throughput: 0: 5935.6. Samples: 473538810. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:12:54,541][25689] Avg episode reward: [(0, '-44.488')] [2022-07-09 23:12:54,544][26022] Updated weights on worker 0-0, policy_version 462431 (0.00090) [2022-07-09 23:12:56,122][26022] Updated weights on worker 0-0, policy_version 462441 (0.00060) [2022-07-09 23:12:58,203][26022] Updated weights on worker 0-0, policy_version 462451 (0.00087) [2022-07-09 23:12:59,625][25689] Fps is (10 sec: 5682.6, 60 sec: 5662.4, 300 sec: 5673.0). Total num frames: 473559040. Throughput: 0: 5936.9. Samples: 473556058. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:12:59,626][25689] Avg episode reward: [(0, '-44.395')] [2022-07-09 23:12:59,742][26022] Updated weights on worker 0-0, policy_version 462461 (0.00090) [2022-07-09 23:13:01,520][26022] Updated weights on worker 0-0, policy_version 462471 (0.00088) [2022-07-09 23:13:03,851][26022] Updated weights on worker 0-0, policy_version 462481 (0.00090) [2022-07-09 23:13:04,683][25689] Fps is (10 sec: 5451.5, 60 sec: 5628.5, 300 sec: 5658.8). Total num frames: 473583616. Throughput: 0: 5858.0. Samples: 473588236. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:13:04,683][25689] Avg episode reward: [(0, '-44.637')] [2022-07-09 23:13:05,563][26022] Updated weights on worker 0-0, policy_version 462491 (0.00094) [2022-07-09 23:13:07,463][26022] Updated weights on worker 0-0, policy_version 462501 (0.00085) [2022-07-09 23:13:08,872][26022] Updated weights on worker 0-0, policy_version 462511 (0.00092) [2022-07-09 23:13:09,729][25689] Fps is (10 sec: 5472.5, 60 sec: 5675.5, 300 sec: 5665.5). Total num frames: 473614336. Throughput: 0: 5847.8. Samples: 473622352. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:13:09,730][25689] Avg episode reward: [(0, '-44.767')] [2022-07-09 23:13:11,130][26022] Updated weights on worker 0-0, policy_version 462521 (0.00100) [2022-07-09 23:13:12,678][26022] Updated weights on worker 0-0, policy_version 462531 (0.00092) [2022-07-09 23:13:14,538][26022] Updated weights on worker 0-0, policy_version 462541 (0.00087) [2022-07-09 23:13:14,775][25689] Fps is (10 sec: 5884.9, 60 sec: 5674.4, 300 sec: 5665.2). Total num frames: 473643008. Throughput: 0: 4957.7. Samples: 473639132. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:13:14,776][25689] Avg episode reward: [(0, '-45.381')] [2022-07-09 23:13:16,773][26022] Updated weights on worker 0-0, policy_version 462551 (0.00087) [2022-07-09 23:13:18,129][26022] Updated weights on worker 0-0, policy_version 462561 (0.00086) [2022-07-09 23:13:19,874][25689] Fps is (10 sec: 5551.9, 60 sec: 5632.6, 300 sec: 5658.7). Total num frames: 473670656. Throughput: 0: 5787.1. Samples: 473673240. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:13:19,874][25689] Avg episode reward: [(0, '-45.161')] [2022-07-09 23:13:20,105][26022] Updated weights on worker 0-0, policy_version 462571 (0.00081) [2022-07-09 23:13:21,683][26022] Updated weights on worker 0-0, policy_version 462581 (0.00082) [2022-07-09 23:13:23,521][26022] Updated weights on worker 0-0, policy_version 462591 (0.00088) [2022-07-09 23:13:24,936][25689] Fps is (10 sec: 5542.9, 60 sec: 5635.6, 300 sec: 5657.9). Total num frames: 473699328. Throughput: 0: 5895.8. Samples: 473707644. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:13:24,936][25689] Avg episode reward: [(0, '-45.141')] [2022-07-09 23:13:25,491][26022] Updated weights on worker 0-0, policy_version 462601 (0.00081) [2022-07-09 23:13:26,987][26022] Updated weights on worker 0-0, policy_version 462611 (0.00088) [2022-07-09 23:13:29,047][26022] Updated weights on worker 0-0, policy_version 462621 (0.00081) [2022-07-09 23:13:29,986][25689] Fps is (10 sec: 5771.7, 60 sec: 5631.9, 300 sec: 5667.4). Total num frames: 473729024. Throughput: 0: 5050.7. Samples: 473724664. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:13:29,987][25689] Avg episode reward: [(0, '-45.486')] [2022-07-09 23:13:30,699][26022] Updated weights on worker 0-0, policy_version 462631 (0.00088) [2022-07-09 23:13:32,591][26022] Updated weights on worker 0-0, policy_version 462641 (0.00089) [2022-07-09 23:13:34,485][26022] Updated weights on worker 0-0, policy_version 462651 (0.00092) [2022-07-09 23:13:35,000][25689] Fps is (10 sec: 5698.0, 60 sec: 5634.4, 300 sec: 5657.7). Total num frames: 473756672. Throughput: 0: 5928.8. Samples: 473759040. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:13:35,000][25689] Avg episode reward: [(0, '-45.610')] [2022-07-09 23:13:36,134][26022] Updated weights on worker 0-0, policy_version 462661 (0.00086) [2022-07-09 23:13:37,921][26022] Updated weights on worker 0-0, policy_version 462671 (0.00086) [2022-07-09 23:13:39,847][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:13:39,852][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000462681_473785344.pth [2022-07-09 23:13:39,857][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000460688_471744512.pth [2022-07-09 23:13:39,871][26022] Updated weights on worker 0-0, policy_version 462681 (0.00091) [2022-07-09 23:13:40,032][25689] Fps is (10 sec: 5708.3, 60 sec: 5671.6, 300 sec: 5664.8). Total num frames: 473786368. Throughput: 0: 5963.4. Samples: 473793454. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:13:40,033][25689] Avg episode reward: [(0, '-44.387')] [2022-07-09 23:13:41,666][26022] Updated weights on worker 0-0, policy_version 462691 (0.00085) [2022-07-09 23:13:43,511][26022] Updated weights on worker 0-0, policy_version 462701 (0.00087) [2022-07-09 23:13:45,137][25689] Fps is (10 sec: 5758.0, 60 sec: 5657.2, 300 sec: 5659.4). Total num frames: 473815040. Throughput: 0: 5076.7. Samples: 473810198. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:13:45,137][25689] Avg episode reward: [(0, '-44.444')] [2022-07-09 23:13:45,336][26022] Updated weights on worker 0-0, policy_version 462711 (0.00097) [2022-07-09 23:13:47,002][26022] Updated weights on worker 0-0, policy_version 462721 (0.00086) [2022-07-09 23:13:48,825][26022] Updated weights on worker 0-0, policy_version 462731 (0.00084) [2022-07-09 23:13:50,159][25689] Fps is (10 sec: 5561.4, 60 sec: 5624.6, 300 sec: 5662.7). Total num frames: 473842688. Throughput: 0: 5940.6. Samples: 473844502. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:13:50,159][25689] Avg episode reward: [(0, '-44.382')] [2022-07-09 23:13:50,706][26022] Updated weights on worker 0-0, policy_version 462741 (0.00082) [2022-07-09 23:13:52,433][26022] Updated weights on worker 0-0, policy_version 462751 (0.00084) [2022-07-09 23:13:54,536][26022] Updated weights on worker 0-0, policy_version 462761 (0.00095) [2022-07-09 23:13:55,251][25689] Fps is (10 sec: 5669.7, 60 sec: 5667.2, 300 sec: 5661.3). Total num frames: 473872384. Throughput: 0: 5876.6. Samples: 473878046. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:13:55,251][25689] Avg episode reward: [(0, '-44.087')] [2022-07-09 23:13:56,025][26022] Updated weights on worker 0-0, policy_version 462771 (0.00093) [2022-07-09 23:13:58,026][26022] Updated weights on worker 0-0, policy_version 462781 (0.00084) [2022-07-09 23:13:59,737][26022] Updated weights on worker 0-0, policy_version 462791 (0.00089) [2022-07-09 23:14:00,304][25689] Fps is (10 sec: 5652.5, 60 sec: 5619.6, 300 sec: 5666.1). Total num frames: 473900032. Throughput: 0: 5012.5. Samples: 473895066. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:14:00,304][25689] Avg episode reward: [(0, '-43.398')] [2022-07-09 23:14:01,459][26022] Updated weights on worker 0-0, policy_version 462801 (0.00088) [2022-07-09 23:14:03,835][26022] Updated weights on worker 0-0, policy_version 462811 (0.00095) [2022-07-09 23:14:05,414][25689] Fps is (10 sec: 5440.7, 60 sec: 5665.4, 300 sec: 5665.4). Total num frames: 473927680. Throughput: 0: 5769.2. Samples: 473927184. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-09 23:14:05,417][25689] Avg episode reward: [(0, '-43.044')] [2022-07-09 23:14:05,431][26022] Updated weights on worker 0-0, policy_version 462821 (0.00080) [2022-07-09 23:14:07,456][26022] Updated weights on worker 0-0, policy_version 462831 (0.00082) [2022-07-09 23:14:09,063][26022] Updated weights on worker 0-0, policy_version 462841 (0.00086) [2022-07-09 23:14:10,455][25689] Fps is (10 sec: 5548.1, 60 sec: 5632.1, 300 sec: 5658.0). Total num frames: 473956352. Throughput: 0: 5756.2. Samples: 473961330. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:14:10,456][25689] Avg episode reward: [(0, '-43.981')] [2022-07-09 23:14:10,913][26022] Updated weights on worker 0-0, policy_version 462851 (0.00087) [2022-07-09 23:14:12,836][26022] Updated weights on worker 0-0, policy_version 462861 (0.00086) [2022-07-09 23:14:14,372][26022] Updated weights on worker 0-0, policy_version 462871 (0.00084) [2022-07-09 23:14:15,494][25689] Fps is (10 sec: 5688.8, 60 sec: 5632.7, 300 sec: 5660.9). Total num frames: 473985024. Throughput: 0: 4952.1. Samples: 473978294. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:14:15,495][25689] Avg episode reward: [(0, '-44.397')] [2022-07-09 23:14:16,291][26022] Updated weights on worker 0-0, policy_version 462881 (0.00090) [2022-07-09 23:14:18,240][26022] Updated weights on worker 0-0, policy_version 462891 (0.00089) [2022-07-09 23:14:19,879][26022] Updated weights on worker 0-0, policy_version 462901 (0.00095) [2022-07-09 23:14:20,499][25689] Fps is (10 sec: 5811.1, 60 sec: 5675.2, 300 sec: 5665.1). Total num frames: 474014720. Throughput: 0: 5828.4. Samples: 474012774. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:14:20,500][25689] Avg episode reward: [(0, '-44.873')] [2022-07-09 23:14:21,945][26022] Updated weights on worker 0-0, policy_version 462911 (0.00078) [2022-07-09 23:14:23,371][26022] Updated weights on worker 0-0, policy_version 462921 (0.00091) [2022-07-09 23:14:25,341][26022] Updated weights on worker 0-0, policy_version 462931 (0.00087) [2022-07-09 23:14:25,589][25689] Fps is (10 sec: 5782.2, 60 sec: 5672.7, 300 sec: 5660.2). Total num frames: 474043392. Throughput: 0: 5951.9. Samples: 474047262. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:14:25,589][25689] Avg episode reward: [(0, '-45.359')] [2022-07-09 23:14:26,917][26022] Updated weights on worker 0-0, policy_version 462941 (0.00090) [2022-07-09 23:14:28,730][26022] Updated weights on worker 0-0, policy_version 462951 (0.00088) [2022-07-09 23:14:30,649][25689] Fps is (10 sec: 5549.2, 60 sec: 5638.0, 300 sec: 5659.3). Total num frames: 474071040. Throughput: 0: 5102.1. Samples: 474064360. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:14:30,655][25689] Avg episode reward: [(0, '-45.893')] [2022-07-09 23:14:30,675][26022] Updated weights on worker 0-0, policy_version 462961 (0.00086) [2022-07-09 23:14:32,466][26022] Updated weights on worker 0-0, policy_version 462971 (0.00513) [2022-07-09 23:14:34,060][26022] Updated weights on worker 0-0, policy_version 462981 (0.00084) [2022-07-09 23:14:35,695][25689] Fps is (10 sec: 5573.1, 60 sec: 5651.9, 300 sec: 5660.7). Total num frames: 474099712. Throughput: 0: 5974.6. Samples: 474098982. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:14:35,696][25689] Avg episode reward: [(0, '-46.490')] [2022-07-09 23:14:36,044][26022] Updated weights on worker 0-0, policy_version 462991 (0.00093) [2022-07-09 23:14:37,671][26022] Updated weights on worker 0-0, policy_version 463001 (0.00089) [2022-07-09 23:14:39,545][26022] Updated weights on worker 0-0, policy_version 463011 (0.00084) [2022-07-09 23:14:40,746][25689] Fps is (10 sec: 5881.6, 60 sec: 5666.9, 300 sec: 5668.3). Total num frames: 474130432. Throughput: 0: 5945.3. Samples: 474133150. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:14:40,748][25689] Avg episode reward: [(0, '-45.489')] [2022-07-09 23:14:41,429][26022] Updated weights on worker 0-0, policy_version 463021 (0.00084) [2022-07-09 23:14:43,087][26022] Updated weights on worker 0-0, policy_version 463031 (0.00084) [2022-07-09 23:14:45,042][26022] Updated weights on worker 0-0, policy_version 463041 (0.00086) [2022-07-09 23:14:45,889][25689] Fps is (10 sec: 5826.1, 60 sec: 5663.4, 300 sec: 5663.2). Total num frames: 474159104. Throughput: 0: 5916.5. Samples: 474167366. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:14:45,890][25689] Avg episode reward: [(0, '-44.920')] [2022-07-09 23:14:46,675][26022] Updated weights on worker 0-0, policy_version 463051 (0.00094) [2022-07-09 23:14:48,694][26022] Updated weights on worker 0-0, policy_version 463061 (0.00091) [2022-07-09 23:14:50,201][26022] Updated weights on worker 0-0, policy_version 463071 (0.00862) [2022-07-09 23:14:50,984][25689] Fps is (10 sec: 5601.2, 60 sec: 5673.4, 300 sec: 5658.2). Total num frames: 474187776. Throughput: 0: 5910.4. Samples: 474184552. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:14:50,986][25689] Avg episode reward: [(0, '-45.007')] [2022-07-09 23:14:52,303][26022] Updated weights on worker 0-0, policy_version 463081 (0.00089) [2022-07-09 23:14:53,792][26022] Updated weights on worker 0-0, policy_version 463091 (0.00090) [2022-07-09 23:14:55,759][26022] Updated weights on worker 0-0, policy_version 463101 (0.00086) [2022-07-09 23:14:56,020][25689] Fps is (10 sec: 5660.1, 60 sec: 5661.8, 300 sec: 5657.6). Total num frames: 474216448. Throughput: 0: 5906.3. Samples: 474219030. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:14:56,022][25689] Avg episode reward: [(0, '-44.996')] [2022-07-09 23:14:57,519][26022] Updated weights on worker 0-0, policy_version 463111 (0.00090) [2022-07-09 23:14:59,302][26022] Updated weights on worker 0-0, policy_version 463121 (0.00090) [2022-07-09 23:15:00,942][26022] Updated weights on worker 0-0, policy_version 463131 (0.00085) [2022-07-09 23:15:01,083][25689] Fps is (10 sec: 5779.6, 60 sec: 5694.6, 300 sec: 5670.9). Total num frames: 474246144. Throughput: 0: 5923.4. Samples: 474253612. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:15:01,084][25689] Avg episode reward: [(0, '-44.366')] [2022-07-09 23:15:03,322][26022] Updated weights on worker 0-0, policy_version 463141 (0.00086) [2022-07-09 23:15:04,939][26022] Updated weights on worker 0-0, policy_version 463151 (0.00086) [2022-07-09 23:15:06,182][25689] Fps is (10 sec: 5542.4, 60 sec: 5678.8, 300 sec: 5659.0). Total num frames: 474272768. Throughput: 0: 4986.9. Samples: 474268564. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:15:06,184][25689] Avg episode reward: [(0, '-44.463')] [2022-07-09 23:15:06,795][26022] Updated weights on worker 0-0, policy_version 463161 (0.00092) [2022-07-09 23:15:08,584][26022] Updated weights on worker 0-0, policy_version 463171 (0.00088) [2022-07-09 23:15:10,447][26022] Updated weights on worker 0-0, policy_version 463181 (0.00100) [2022-07-09 23:15:11,193][25689] Fps is (10 sec: 5469.2, 60 sec: 5681.5, 300 sec: 5662.7). Total num frames: 474301440. Throughput: 0: 5856.6. Samples: 474302910. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:15:11,195][25689] Avg episode reward: [(0, '-44.863')] [2022-07-09 23:15:12,167][26022] Updated weights on worker 0-0, policy_version 463191 (0.00079) [2022-07-09 23:15:14,219][26022] Updated weights on worker 0-0, policy_version 463201 (0.00083) [2022-07-09 23:15:15,839][26022] Updated weights on worker 0-0, policy_version 463211 (0.00085) [2022-07-09 23:15:16,221][25689] Fps is (10 sec: 5712.0, 60 sec: 5682.7, 300 sec: 5659.1). Total num frames: 474330112. Throughput: 0: 5831.5. Samples: 474336832. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:15:16,223][25689] Avg episode reward: [(0, '-44.232')] [2022-07-09 23:15:17,645][26022] Updated weights on worker 0-0, policy_version 463221 (0.00084) [2022-07-09 23:15:19,561][26022] Updated weights on worker 0-0, policy_version 463231 (0.00083) [2022-07-09 23:15:21,164][26022] Updated weights on worker 0-0, policy_version 463241 (0.00085) [2022-07-09 23:15:21,297][25689] Fps is (10 sec: 5675.8, 60 sec: 5659.2, 300 sec: 5661.8). Total num frames: 474358784. Throughput: 0: 4982.6. Samples: 474354328. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:15:21,298][25689] Avg episode reward: [(0, '-43.879')] [2022-07-09 23:15:23,044][26022] Updated weights on worker 0-0, policy_version 463251 (0.00087) [2022-07-09 23:15:24,783][26022] Updated weights on worker 0-0, policy_version 463261 (0.00082) [2022-07-09 23:15:26,337][25689] Fps is (10 sec: 5668.5, 60 sec: 5663.8, 300 sec: 5654.3). Total num frames: 474387456. Throughput: 0: 5950.2. Samples: 474388492. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:15:26,338][25689] Avg episode reward: [(0, '-45.196')] [2022-07-09 23:15:26,637][26022] Updated weights on worker 0-0, policy_version 463271 (0.00093) [2022-07-09 23:15:28,525][26022] Updated weights on worker 0-0, policy_version 463281 (0.00093) [2022-07-09 23:15:30,133][26022] Updated weights on worker 0-0, policy_version 463291 (0.00082) [2022-07-09 23:15:31,352][25689] Fps is (10 sec: 5703.0, 60 sec: 5684.8, 300 sec: 5661.3). Total num frames: 474416128. Throughput: 0: 5941.6. Samples: 474422682. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:15:31,352][25689] Avg episode reward: [(0, '-45.254')] [2022-07-09 23:15:31,922][26022] Updated weights on worker 0-0, policy_version 463301 (0.00086) [2022-07-09 23:15:33,738][26022] Updated weights on worker 0-0, policy_version 463311 (0.00085) [2022-07-09 23:15:35,353][26022] Updated weights on worker 0-0, policy_version 463321 (0.00084) [2022-07-09 23:15:36,378][25689] Fps is (10 sec: 5813.3, 60 sec: 5703.6, 300 sec: 5664.8). Total num frames: 474445824. Throughput: 0: 5117.6. Samples: 474439986. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:15:36,378][25689] Avg episode reward: [(0, '-44.759')] [2022-07-09 23:15:37,225][26022] Updated weights on worker 0-0, policy_version 463331 (0.00088) [2022-07-09 23:15:39,092][26022] Updated weights on worker 0-0, policy_version 463341 (0.00086) [2022-07-09 23:15:40,022][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:15:40,034][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000463347_474467328.pth [2022-07-09 23:15:40,034][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000461351_472423424.pth [2022-07-09 23:15:40,802][26022] Updated weights on worker 0-0, policy_version 463351 (0.00088) [2022-07-09 23:15:41,383][25689] Fps is (10 sec: 5818.3, 60 sec: 5674.2, 300 sec: 5662.3). Total num frames: 474474496. Throughput: 0: 5997.3. Samples: 474474794. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:15:41,384][25689] Avg episode reward: [(0, '-45.122')] [2022-07-09 23:15:42,737][26022] Updated weights on worker 0-0, policy_version 463361 (0.00088) [2022-07-09 23:15:44,387][26022] Updated weights on worker 0-0, policy_version 463371 (0.00082) [2022-07-09 23:15:46,199][26022] Updated weights on worker 0-0, policy_version 463381 (0.00096) [2022-07-09 23:15:46,475][25689] Fps is (10 sec: 5678.8, 60 sec: 5678.9, 300 sec: 5660.6). Total num frames: 474503168. Throughput: 0: 5977.6. Samples: 474508870. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:15:46,476][25689] Avg episode reward: [(0, '-45.184')] [2022-07-09 23:15:48,046][26022] Updated weights on worker 0-0, policy_version 463391 (0.00089) [2022-07-09 23:15:49,869][26022] Updated weights on worker 0-0, policy_version 463401 (0.00083) [2022-07-09 23:15:51,497][25689] Fps is (10 sec: 5670.0, 60 sec: 5685.8, 300 sec: 5664.2). Total num frames: 474531840. Throughput: 0: 5130.7. Samples: 474526042. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:15:51,497][25689] Avg episode reward: [(0, '-43.743')] [2022-07-09 23:15:51,570][26022] Updated weights on worker 0-0, policy_version 463411 (0.00101) [2022-07-09 23:15:53,280][26022] Updated weights on worker 0-0, policy_version 463421 (0.00091) [2022-07-09 23:15:55,212][26022] Updated weights on worker 0-0, policy_version 463431 (0.00050) [2022-07-09 23:15:56,517][25689] Fps is (10 sec: 5812.8, 60 sec: 5704.2, 300 sec: 5667.5). Total num frames: 474561536. Throughput: 0: 5990.7. Samples: 474560634. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:15:56,517][25689] Avg episode reward: [(0, '-44.150')] [2022-07-09 23:15:56,995][26022] Updated weights on worker 0-0, policy_version 463441 (0.00088) [2022-07-09 23:15:58,572][26022] Updated weights on worker 0-0, policy_version 463451 (0.00088) [2022-07-09 23:16:00,502][26022] Updated weights on worker 0-0, policy_version 463461 (0.00087) [2022-07-09 23:16:01,577][25689] Fps is (10 sec: 5689.1, 60 sec: 5670.7, 300 sec: 5664.0). Total num frames: 474589184. Throughput: 0: 5962.3. Samples: 474595192. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:16:01,577][25689] Avg episode reward: [(0, '-44.618')] [2022-07-09 23:16:02,627][26022] Updated weights on worker 0-0, policy_version 463471 (0.00090) [2022-07-09 23:16:04,395][26022] Updated weights on worker 0-0, policy_version 463481 (0.00089) [2022-07-09 23:16:06,512][26022] Updated weights on worker 0-0, policy_version 463491 (0.00052) [2022-07-09 23:16:06,740][25689] Fps is (10 sec: 5409.1, 60 sec: 5681.6, 300 sec: 5661.1). Total num frames: 474616832. Throughput: 0: 4999.4. Samples: 474610178. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:16:06,743][25689] Avg episode reward: [(0, '-44.653')] [2022-07-09 23:16:08,007][26022] Updated weights on worker 0-0, policy_version 463501 (0.00081) [2022-07-09 23:16:09,851][26022] Updated weights on worker 0-0, policy_version 463511 (0.00096) [2022-07-09 23:16:11,547][26022] Updated weights on worker 0-0, policy_version 463521 (0.01311) [2022-07-09 23:16:11,783][25689] Fps is (10 sec: 5518.3, 60 sec: 5678.6, 300 sec: 5664.7). Total num frames: 474645504. Throughput: 0: 5838.2. Samples: 474644474. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:16:11,784][25689] Avg episode reward: [(0, '-44.092')] [2022-07-09 23:16:13,510][26022] Updated weights on worker 0-0, policy_version 463531 (0.00097) [2022-07-09 23:16:15,205][26022] Updated weights on worker 0-0, policy_version 463541 (0.00080) [2022-07-09 23:16:16,797][25689] Fps is (10 sec: 5600.0, 60 sec: 5663.0, 300 sec: 5657.8). Total num frames: 474673152. Throughput: 0: 5834.6. Samples: 474678960. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:16:16,797][25689] Avg episode reward: [(0, '-45.924')] [2022-07-09 23:16:17,099][26022] Updated weights on worker 0-0, policy_version 463551 (0.00094) [2022-07-09 23:16:18,556][26022] Updated weights on worker 0-0, policy_version 463561 (0.00089) [2022-07-09 23:16:20,730][26022] Updated weights on worker 0-0, policy_version 463571 (0.00083) [2022-07-09 23:16:21,849][25689] Fps is (10 sec: 5899.9, 60 sec: 5715.9, 300 sec: 5673.7). Total num frames: 474704896. Throughput: 0: 4983.4. Samples: 474696218. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:16:21,850][25689] Avg episode reward: [(0, '-46.563')] [2022-07-09 23:16:22,233][26022] Updated weights on worker 0-0, policy_version 463581 (0.00084) [2022-07-09 23:16:24,228][26022] Updated weights on worker 0-0, policy_version 463591 (0.00106) [2022-07-09 23:16:25,998][26022] Updated weights on worker 0-0, policy_version 463601 (0.00093) [2022-07-09 23:16:26,956][25689] Fps is (10 sec: 5745.1, 60 sec: 5675.8, 300 sec: 5666.0). Total num frames: 474731520. Throughput: 0: 5947.5. Samples: 474730418. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:16:26,957][25689] Avg episode reward: [(0, '-44.564')] [2022-07-09 23:16:27,842][26022] Updated weights on worker 0-0, policy_version 463611 (0.00087) [2022-07-09 23:16:29,667][26022] Updated weights on worker 0-0, policy_version 463621 (0.00086) [2022-07-09 23:16:31,528][26022] Updated weights on worker 0-0, policy_version 463631 (0.00115) [2022-07-09 23:16:31,978][25689] Fps is (10 sec: 5560.3, 60 sec: 5692.0, 300 sec: 5663.9). Total num frames: 474761216. Throughput: 0: 5958.8. Samples: 474764816. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:16:31,980][25689] Avg episode reward: [(0, '-45.515')] [2022-07-09 23:16:33,118][26022] Updated weights on worker 0-0, policy_version 463641 (0.00086) [2022-07-09 23:16:35,105][26022] Updated weights on worker 0-0, policy_version 463651 (0.00088) [2022-07-09 23:16:36,807][26022] Updated weights on worker 0-0, policy_version 463661 (0.00086) [2022-07-09 23:16:36,990][25689] Fps is (10 sec: 5817.4, 60 sec: 5676.5, 300 sec: 5668.3). Total num frames: 474789888. Throughput: 0: 5948.7. Samples: 474799082. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:16:36,990][25689] Avg episode reward: [(0, '-45.002')] [2022-07-09 23:16:38,641][26022] Updated weights on worker 0-0, policy_version 463671 (0.00094) [2022-07-09 23:16:40,132][26022] Updated weights on worker 0-0, policy_version 463681 (0.00092) [2022-07-09 23:16:42,042][25689] Fps is (10 sec: 5698.4, 60 sec: 5672.2, 300 sec: 5665.2). Total num frames: 474818560. Throughput: 0: 5952.4. Samples: 474816410. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:16:42,043][25689] Avg episode reward: [(0, '-44.641')] [2022-07-09 23:16:42,180][26022] Updated weights on worker 0-0, policy_version 463691 (0.00088) [2022-07-09 23:16:43,945][26022] Updated weights on worker 0-0, policy_version 463701 (0.00090) [2022-07-09 23:16:45,859][26022] Updated weights on worker 0-0, policy_version 463711 (0.00094) [2022-07-09 23:16:47,089][25689] Fps is (10 sec: 5779.2, 60 sec: 5693.2, 300 sec: 5671.3). Total num frames: 474848256. Throughput: 0: 5978.6. Samples: 474850784. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:16:47,090][25689] Avg episode reward: [(0, '-43.604')] [2022-07-09 23:16:47,393][26022] Updated weights on worker 0-0, policy_version 463721 (0.00087) [2022-07-09 23:16:49,299][26022] Updated weights on worker 0-0, policy_version 463731 (0.00092) [2022-07-09 23:16:50,947][26022] Updated weights on worker 0-0, policy_version 463741 (0.00089) [2022-07-09 23:16:52,099][25689] Fps is (10 sec: 5701.8, 60 sec: 5677.5, 300 sec: 5661.1). Total num frames: 474875904. Throughput: 0: 5988.2. Samples: 474885300. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:16:52,099][25689] Avg episode reward: [(0, '-43.181')] [2022-07-09 23:16:52,958][26022] Updated weights on worker 0-0, policy_version 463751 (0.00082) [2022-07-09 23:16:54,627][26022] Updated weights on worker 0-0, policy_version 463761 (0.00086) [2022-07-09 23:16:56,378][26022] Updated weights on worker 0-0, policy_version 463771 (0.00087) [2022-07-09 23:16:57,127][25689] Fps is (10 sec: 5713.1, 60 sec: 5676.7, 300 sec: 5668.2). Total num frames: 474905600. Throughput: 0: 5143.9. Samples: 474902664. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:16:57,127][25689] Avg episode reward: [(0, '-42.974')] [2022-07-09 23:16:58,050][26022] Updated weights on worker 0-0, policy_version 463781 (0.00096) [2022-07-09 23:17:00,127][26022] Updated weights on worker 0-0, policy_version 463791 (0.00087) [2022-07-09 23:17:02,147][25689] Fps is (10 sec: 5503.2, 60 sec: 5646.6, 300 sec: 5665.5). Total num frames: 474931200. Throughput: 0: 6012.7. Samples: 474937296. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:17:02,148][25689] Avg episode reward: [(0, '-42.647')] [2022-07-09 23:17:02,170][26022] Updated weights on worker 0-0, policy_version 463801 (0.00089) [2022-07-09 23:17:03,942][26022] Updated weights on worker 0-0, policy_version 463811 (0.00091) [2022-07-09 23:17:05,648][26022] Updated weights on worker 0-0, policy_version 463821 (0.00085) [2022-07-09 23:17:07,211][25689] Fps is (10 sec: 5381.9, 60 sec: 5672.8, 300 sec: 5667.8). Total num frames: 474959872. Throughput: 0: 5894.8. Samples: 474969396. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:17:07,211][25689] Avg episode reward: [(0, '-42.827')] [2022-07-09 23:17:07,580][26022] Updated weights on worker 0-0, policy_version 463831 (0.00085) [2022-07-09 23:17:09,405][26022] Updated weights on worker 0-0, policy_version 463841 (0.00087) [2022-07-09 23:17:11,032][26022] Updated weights on worker 0-0, policy_version 463851 (0.00087) [2022-07-09 23:17:12,219][25689] Fps is (10 sec: 5693.3, 60 sec: 5676.1, 300 sec: 5668.3). Total num frames: 474988544. Throughput: 0: 5036.6. Samples: 474986638. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:17:12,220][25689] Avg episode reward: [(0, '-42.926')] [2022-07-09 23:17:12,976][26022] Updated weights on worker 0-0, policy_version 463861 (0.00094) [2022-07-09 23:17:14,417][26022] Updated weights on worker 0-0, policy_version 463871 (0.00419) [2022-07-09 23:17:16,642][26022] Updated weights on worker 0-0, policy_version 463881 (0.00094) [2022-07-09 23:17:17,222][25689] Fps is (10 sec: 5932.6, 60 sec: 5728.0, 300 sec: 5671.9). Total num frames: 475019264. Throughput: 0: 5870.5. Samples: 475020632. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:17:17,222][25689] Avg episode reward: [(0, '-44.289')] [2022-07-09 23:17:18,572][26022] Updated weights on worker 0-0, policy_version 463891 (0.00086) [2022-07-09 23:17:19,990][26022] Updated weights on worker 0-0, policy_version 463901 (0.00116) [2022-07-09 23:17:22,096][26022] Updated weights on worker 0-0, policy_version 463911 (0.00100) [2022-07-09 23:17:22,233][25689] Fps is (10 sec: 5623.8, 60 sec: 5630.1, 300 sec: 5663.2). Total num frames: 475044864. Throughput: 0: 5843.8. Samples: 475054678. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:17:22,234][25689] Avg episode reward: [(0, '-44.567')] [2022-07-09 23:17:23,528][26022] Updated weights on worker 0-0, policy_version 463921 (0.00419) [2022-07-09 23:17:25,512][26022] Updated weights on worker 0-0, policy_version 463931 (0.00086) [2022-07-09 23:17:27,347][25689] Fps is (10 sec: 5461.4, 60 sec: 5680.4, 300 sec: 5661.2). Total num frames: 475074560. Throughput: 0: 5082.1. Samples: 475071730. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:17:27,347][25689] Avg episode reward: [(0, '-44.722')] [2022-07-09 23:17:27,487][26022] Updated weights on worker 0-0, policy_version 463941 (0.00093) [2022-07-09 23:17:29,061][26022] Updated weights on worker 0-0, policy_version 463951 (0.00084) [2022-07-09 23:17:31,024][26022] Updated weights on worker 0-0, policy_version 463961 (0.00097) [2022-07-09 23:17:32,368][25689] Fps is (10 sec: 5759.4, 60 sec: 5663.5, 300 sec: 5665.0). Total num frames: 475103232. Throughput: 0: 5909.3. Samples: 475105704. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:17:32,368][25689] Avg episode reward: [(0, '-43.828')] [2022-07-09 23:17:32,770][26022] Updated weights on worker 0-0, policy_version 463971 (0.00084) [2022-07-09 23:17:34,678][26022] Updated weights on worker 0-0, policy_version 463981 (0.00088) [2022-07-09 23:17:36,400][26022] Updated weights on worker 0-0, policy_version 463991 (0.00093) [2022-07-09 23:17:37,371][25689] Fps is (10 sec: 5822.7, 60 sec: 5681.2, 300 sec: 5673.2). Total num frames: 475132928. Throughput: 0: 5911.2. Samples: 475139738. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:17:37,373][25689] Avg episode reward: [(0, '-44.291')] [2022-07-09 23:17:38,415][26022] Updated weights on worker 0-0, policy_version 464001 (0.00093) [2022-07-09 23:17:40,013][26022] Updated weights on worker 0-0, policy_version 464011 (0.00085) [2022-07-09 23:17:40,049][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:17:40,062][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000464012_475148288.pth [2022-07-09 23:17:40,062][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000462017_473105408.pth [2022-07-09 23:17:41,951][26022] Updated weights on worker 0-0, policy_version 464021 (0.00091) [2022-07-09 23:17:42,396][25689] Fps is (10 sec: 5616.2, 60 sec: 5649.8, 300 sec: 5664.9). Total num frames: 475159552. Throughput: 0: 5054.8. Samples: 475156596. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:17:42,401][25689] Avg episode reward: [(0, '-44.600')] [2022-07-09 23:17:43,486][26022] Updated weights on worker 0-0, policy_version 464031 (0.00088) [2022-07-09 23:17:45,679][26022] Updated weights on worker 0-0, policy_version 464041 (0.00790) [2022-07-09 23:17:47,291][26022] Updated weights on worker 0-0, policy_version 464051 (0.00087) [2022-07-09 23:17:47,477][25689] Fps is (10 sec: 5471.8, 60 sec: 5629.8, 300 sec: 5660.6). Total num frames: 475188224. Throughput: 0: 5893.3. Samples: 475190362. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:17:47,477][25689] Avg episode reward: [(0, '-44.406')] [2022-07-09 23:17:49,152][26022] Updated weights on worker 0-0, policy_version 464061 (0.00402) [2022-07-09 23:17:50,963][26022] Updated weights on worker 0-0, policy_version 464071 (0.00090) [2022-07-09 23:17:52,514][25689] Fps is (10 sec: 5667.5, 60 sec: 5644.1, 300 sec: 5666.8). Total num frames: 475216896. Throughput: 0: 5893.5. Samples: 475224436. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:17:52,516][25689] Avg episode reward: [(0, '-45.176')] [2022-07-09 23:17:52,726][26022] Updated weights on worker 0-0, policy_version 464081 (0.00093) [2022-07-09 23:17:54,401][26022] Updated weights on worker 0-0, policy_version 464091 (0.00087) [2022-07-09 23:17:56,419][26022] Updated weights on worker 0-0, policy_version 464101 (0.00086) [2022-07-09 23:17:57,529][25689] Fps is (10 sec: 5806.5, 60 sec: 5645.4, 300 sec: 5664.8). Total num frames: 475246592. Throughput: 0: 5059.4. Samples: 475241724. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:17:57,529][25689] Avg episode reward: [(0, '-45.437')] [2022-07-09 23:17:57,839][26022] Updated weights on worker 0-0, policy_version 464111 (0.00091) [2022-07-09 23:18:00,077][26022] Updated weights on worker 0-0, policy_version 464121 (0.00091) [2022-07-09 23:18:01,500][26022] Updated weights on worker 0-0, policy_version 464131 (0.00095) [2022-07-09 23:18:02,555][25689] Fps is (10 sec: 5609.2, 60 sec: 5661.8, 300 sec: 5672.2). Total num frames: 475273216. Throughput: 0: 5936.9. Samples: 475276276. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:18:02,555][25689] Avg episode reward: [(0, '-45.255')] [2022-07-09 23:18:03,979][26022] Updated weights on worker 0-0, policy_version 464141 (0.00093) [2022-07-09 23:18:05,652][26022] Updated weights on worker 0-0, policy_version 464151 (0.00085) [2022-07-09 23:18:07,416][26022] Updated weights on worker 0-0, policy_version 464161 (0.00087) [2022-07-09 23:18:07,592][25689] Fps is (10 sec: 5494.8, 60 sec: 5664.3, 300 sec: 5665.5). Total num frames: 475301888. Throughput: 0: 5868.9. Samples: 475308418. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:18:07,593][25689] Avg episode reward: [(0, '-44.655')] [2022-07-09 23:18:09,257][26022] Updated weights on worker 0-0, policy_version 464171 (0.00088) [2022-07-09 23:18:10,846][26022] Updated weights on worker 0-0, policy_version 464181 (0.00091) [2022-07-09 23:18:12,601][25689] Fps is (10 sec: 5606.1, 60 sec: 5647.2, 300 sec: 5662.8). Total num frames: 475329536. Throughput: 0: 5038.3. Samples: 475325640. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:18:12,601][25689] Avg episode reward: [(0, '-43.928')] [2022-07-09 23:18:12,862][26022] Updated weights on worker 0-0, policy_version 464191 (0.00077) [2022-07-09 23:18:14,632][26022] Updated weights on worker 0-0, policy_version 464201 (0.00092) [2022-07-09 23:18:16,423][26022] Updated weights on worker 0-0, policy_version 464211 (0.00079) [2022-07-09 23:18:17,620][25689] Fps is (10 sec: 5616.4, 60 sec: 5611.8, 300 sec: 5667.8). Total num frames: 475358208. Throughput: 0: 5863.7. Samples: 475359532. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:18:17,620][25689] Avg episode reward: [(0, '-43.629')] [2022-07-09 23:18:18,347][26022] Updated weights on worker 0-0, policy_version 464221 (0.00076) [2022-07-09 23:18:20,060][26022] Updated weights on worker 0-0, policy_version 464231 (0.00087) [2022-07-09 23:18:21,998][26022] Updated weights on worker 0-0, policy_version 464241 (0.00095) [2022-07-09 23:18:22,631][25689] Fps is (10 sec: 5717.4, 60 sec: 5662.8, 300 sec: 5668.7). Total num frames: 475386880. Throughput: 0: 5834.9. Samples: 475393418. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:18:22,631][25689] Avg episode reward: [(0, '-42.697')] [2022-07-09 23:18:23,700][26022] Updated weights on worker 0-0, policy_version 464251 (0.00085) [2022-07-09 23:18:25,709][26022] Updated weights on worker 0-0, policy_version 464261 (0.00082) [2022-07-09 23:18:27,330][26022] Updated weights on worker 0-0, policy_version 464271 (0.00088) [2022-07-09 23:18:27,702][25689] Fps is (10 sec: 5789.0, 60 sec: 5666.7, 300 sec: 5668.3). Total num frames: 475416576. Throughput: 0: 5079.5. Samples: 475410568. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:18:27,703][25689] Avg episode reward: [(0, '-43.155')] [2022-07-09 23:18:29,321][26022] Updated weights on worker 0-0, policy_version 464281 (0.00088) [2022-07-09 23:18:30,875][26022] Updated weights on worker 0-0, policy_version 464291 (0.00086) [2022-07-09 23:18:32,787][25689] Fps is (10 sec: 5545.3, 60 sec: 5626.8, 300 sec: 5663.5). Total num frames: 475443200. Throughput: 0: 5894.1. Samples: 475444618. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:18:32,788][25689] Avg episode reward: [(0, '-43.045')] [2022-07-09 23:18:32,847][26022] Updated weights on worker 0-0, policy_version 464301 (0.00093) [2022-07-09 23:18:34,469][26022] Updated weights on worker 0-0, policy_version 464311 (0.00095) [2022-07-09 23:18:36,342][26022] Updated weights on worker 0-0, policy_version 464321 (0.00079) [2022-07-09 23:18:37,788][25689] Fps is (10 sec: 5482.5, 60 sec: 5610.1, 300 sec: 5660.7). Total num frames: 475471872. Throughput: 0: 5918.8. Samples: 475478904. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:18:37,789][25689] Avg episode reward: [(0, '-43.172')] [2022-07-09 23:18:38,012][26022] Updated weights on worker 0-0, policy_version 464331 (0.00084) [2022-07-09 23:18:39,898][26022] Updated weights on worker 0-0, policy_version 464341 (0.00088) [2022-07-09 23:18:41,761][26022] Updated weights on worker 0-0, policy_version 464351 (0.00082) [2022-07-09 23:18:42,793][25689] Fps is (10 sec: 5833.2, 60 sec: 5662.8, 300 sec: 5666.0). Total num frames: 475501568. Throughput: 0: 5094.0. Samples: 475496126. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:18:42,794][25689] Avg episode reward: [(0, '-43.285')] [2022-07-09 23:18:43,453][26022] Updated weights on worker 0-0, policy_version 464361 (0.00096) [2022-07-09 23:18:45,426][26022] Updated weights on worker 0-0, policy_version 464371 (0.00095) [2022-07-09 23:18:47,159][26022] Updated weights on worker 0-0, policy_version 464381 (0.00086) [2022-07-09 23:18:47,846][25689] Fps is (10 sec: 5701.4, 60 sec: 5648.4, 300 sec: 5665.4). Total num frames: 475529216. Throughput: 0: 5931.5. Samples: 475530050. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:18:47,847][25689] Avg episode reward: [(0, '-44.005')] [2022-07-09 23:18:48,889][26022] Updated weights on worker 0-0, policy_version 464391 (0.00090) [2022-07-09 23:18:50,771][26022] Updated weights on worker 0-0, policy_version 464401 (0.00086) [2022-07-09 23:18:52,549][26022] Updated weights on worker 0-0, policy_version 464411 (0.00087) [2022-07-09 23:18:52,877][25689] Fps is (10 sec: 5686.7, 60 sec: 5666.0, 300 sec: 5666.6). Total num frames: 475558912. Throughput: 0: 5945.2. Samples: 475564056. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:18:52,878][25689] Avg episode reward: [(0, '-43.782')] [2022-07-09 23:18:54,299][26022] Updated weights on worker 0-0, policy_version 464421 (0.00089) [2022-07-09 23:18:56,272][26022] Updated weights on worker 0-0, policy_version 464431 (0.00081) [2022-07-09 23:18:57,773][26022] Updated weights on worker 0-0, policy_version 464441 (0.00088) [2022-07-09 23:18:57,891][25689] Fps is (10 sec: 5810.8, 60 sec: 5649.1, 300 sec: 5670.8). Total num frames: 475587584. Throughput: 0: 5096.6. Samples: 475581358. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:18:57,891][25689] Avg episode reward: [(0, '-43.726')] [2022-07-09 23:18:59,887][26022] Updated weights on worker 0-0, policy_version 464451 (0.00079) [2022-07-09 23:19:01,400][26022] Updated weights on worker 0-0, policy_version 464461 (0.00084) [2022-07-09 23:19:02,966][25689] Fps is (10 sec: 5379.3, 60 sec: 5627.5, 300 sec: 5664.6). Total num frames: 475613184. Throughput: 0: 5927.6. Samples: 475615702. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:19:02,969][25689] Avg episode reward: [(0, '-43.547')] [2022-07-09 23:19:03,812][26022] Updated weights on worker 0-0, policy_version 464471 (0.00091) [2022-07-09 23:19:05,353][26022] Updated weights on worker 0-0, policy_version 464481 (0.00088) [2022-07-09 23:19:07,211][26022] Updated weights on worker 0-0, policy_version 464491 (0.00090) [2022-07-09 23:19:08,074][25689] Fps is (10 sec: 5530.9, 60 sec: 5654.9, 300 sec: 5670.2). Total num frames: 475643904. Throughput: 0: 5832.3. Samples: 475648024. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:19:08,074][25689] Avg episode reward: [(0, '-43.717')] [2022-07-09 23:19:09,114][26022] Updated weights on worker 0-0, policy_version 464501 (0.00086) [2022-07-09 23:19:10,651][26022] Updated weights on worker 0-0, policy_version 464511 (0.00082) [2022-07-09 23:19:12,535][26022] Updated weights on worker 0-0, policy_version 464521 (0.00093) [2022-07-09 23:19:13,145][25689] Fps is (10 sec: 5835.1, 60 sec: 5666.0, 300 sec: 5669.6). Total num frames: 475672576. Throughput: 0: 4997.0. Samples: 475665332. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:19:13,145][25689] Avg episode reward: [(0, '-44.173')] [2022-07-09 23:19:14,447][26022] Updated weights on worker 0-0, policy_version 464531 (0.00084) [2022-07-09 23:19:16,165][26022] Updated weights on worker 0-0, policy_version 464541 (0.00098) [2022-07-09 23:19:18,003][26022] Updated weights on worker 0-0, policy_version 464551 (0.00089) [2022-07-09 23:19:18,161][25689] Fps is (10 sec: 5684.9, 60 sec: 5666.3, 300 sec: 5665.9). Total num frames: 475701248. Throughput: 0: 5822.4. Samples: 475699378. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:19:18,161][25689] Avg episode reward: [(0, '-44.214')] [2022-07-09 23:19:20,011][26022] Updated weights on worker 0-0, policy_version 464561 (0.00086) [2022-07-09 23:19:21,523][26022] Updated weights on worker 0-0, policy_version 464571 (0.00089) [2022-07-09 23:19:23,214][25689] Fps is (10 sec: 5593.0, 60 sec: 5645.4, 300 sec: 5663.2). Total num frames: 475728896. Throughput: 0: 5818.8. Samples: 475733522. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:19:23,215][25689] Avg episode reward: [(0, '-44.706')] [2022-07-09 23:19:23,567][26022] Updated weights on worker 0-0, policy_version 464581 (0.00087) [2022-07-09 23:19:25,399][26022] Updated weights on worker 0-0, policy_version 464591 (0.00083) [2022-07-09 23:19:27,002][26022] Updated weights on worker 0-0, policy_version 464601 (0.00081) [2022-07-09 23:19:28,310][25689] Fps is (10 sec: 5650.0, 60 sec: 5643.1, 300 sec: 5669.4). Total num frames: 475758592. Throughput: 0: 5925.7. Samples: 475767940. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:19:28,311][25689] Avg episode reward: [(0, '-45.748')] [2022-07-09 23:19:28,957][26022] Updated weights on worker 0-0, policy_version 464611 (0.00091) [2022-07-09 23:19:30,443][26022] Updated weights on worker 0-0, policy_version 464621 (0.00090) [2022-07-09 23:19:32,524][26022] Updated weights on worker 0-0, policy_version 464631 (0.00089) [2022-07-09 23:19:33,344][25689] Fps is (10 sec: 5863.1, 60 sec: 5698.6, 300 sec: 5673.0). Total num frames: 475788288. Throughput: 0: 5925.5. Samples: 475785024. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:19:33,345][25689] Avg episode reward: [(0, '-45.648')] [2022-07-09 23:19:34,135][26022] Updated weights on worker 0-0, policy_version 464641 (0.00083) [2022-07-09 23:19:35,999][26022] Updated weights on worker 0-0, policy_version 464651 (0.00097) [2022-07-09 23:19:37,938][26022] Updated weights on worker 0-0, policy_version 464661 (0.00092) [2022-07-09 23:19:38,364][25689] Fps is (10 sec: 5601.7, 60 sec: 5663.0, 300 sec: 5659.9). Total num frames: 475814912. Throughput: 0: 5932.1. Samples: 475819226. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:19:38,365][25689] Avg episode reward: [(0, '-45.281')] [2022-07-09 23:19:39,437][26022] Updated weights on worker 0-0, policy_version 464671 (0.00198) [2022-07-09 23:19:40,204][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:19:40,215][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000464674_475826176.pth [2022-07-09 23:19:40,215][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000462681_473785344.pth [2022-07-09 23:19:41,438][26022] Updated weights on worker 0-0, policy_version 464681 (0.00087) [2022-07-09 23:19:43,175][26022] Updated weights on worker 0-0, policy_version 464691 (0.00086) [2022-07-09 23:19:43,372][25689] Fps is (10 sec: 5514.3, 60 sec: 5645.8, 300 sec: 5662.4). Total num frames: 475843584. Throughput: 0: 5963.8. Samples: 475853736. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-09 23:19:43,372][25689] Avg episode reward: [(0, '-46.088')] [2022-07-09 23:19:44,781][26022] Updated weights on worker 0-0, policy_version 464701 (0.00086) [2022-07-09 23:19:46,892][26022] Updated weights on worker 0-0, policy_version 464711 (0.00095) [2022-07-09 23:19:48,269][26022] Updated weights on worker 0-0, policy_version 464721 (0.00089) [2022-07-09 23:19:48,495][25689] Fps is (10 sec: 5862.6, 60 sec: 5689.9, 300 sec: 5668.8). Total num frames: 475874304. Throughput: 0: 5104.3. Samples: 475870968. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:19:48,495][25689] Avg episode reward: [(0, '-46.035')] [2022-07-09 23:19:50,484][26022] Updated weights on worker 0-0, policy_version 464731 (0.00083) [2022-07-09 23:19:52,206][26022] Updated weights on worker 0-0, policy_version 464741 (0.00082) [2022-07-09 23:19:53,498][25689] Fps is (10 sec: 5663.0, 60 sec: 5641.9, 300 sec: 5662.5). Total num frames: 475900928. Throughput: 0: 5963.7. Samples: 475905214. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:19:53,498][25689] Avg episode reward: [(0, '-45.839')] [2022-07-09 23:19:53,882][26022] Updated weights on worker 0-0, policy_version 464751 (0.00089) [2022-07-09 23:19:55,610][26022] Updated weights on worker 0-0, policy_version 464761 (0.00097) [2022-07-09 23:19:57,599][26022] Updated weights on worker 0-0, policy_version 464771 (0.00084) [2022-07-09 23:19:58,519][25689] Fps is (10 sec: 5618.5, 60 sec: 5658.1, 300 sec: 5663.3). Total num frames: 475930624. Throughput: 0: 5975.2. Samples: 475939654. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:19:58,519][25689] Avg episode reward: [(0, '-45.023')] [2022-07-09 23:19:59,331][26022] Updated weights on worker 0-0, policy_version 464781 (0.00102) [2022-07-09 23:20:01,096][26022] Updated weights on worker 0-0, policy_version 464791 (0.00084) [2022-07-09 23:20:03,090][26022] Updated weights on worker 0-0, policy_version 464801 (0.00098) [2022-07-09 23:20:03,543][25689] Fps is (10 sec: 5708.8, 60 sec: 5696.7, 300 sec: 5668.2). Total num frames: 475958272. Throughput: 0: 5120.6. Samples: 475957022. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:20:03,543][25689] Avg episode reward: [(0, '-45.078')] [2022-07-09 23:20:04,922][26022] Updated weights on worker 0-0, policy_version 464811 (0.00092) [2022-07-09 23:20:06,771][26022] Updated weights on worker 0-0, policy_version 464821 (0.00092) [2022-07-09 23:20:08,467][26022] Updated weights on worker 0-0, policy_version 464831 (0.00098) [2022-07-09 23:20:08,668][25689] Fps is (10 sec: 5548.9, 60 sec: 5661.2, 300 sec: 5666.0). Total num frames: 475986944. Throughput: 0: 5863.9. Samples: 475989264. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:20:08,671][25689] Avg episode reward: [(0, '-44.639')] [2022-07-09 23:20:10,378][26022] Updated weights on worker 0-0, policy_version 464841 (0.00086) [2022-07-09 23:20:12,139][26022] Updated weights on worker 0-0, policy_version 464851 (0.00085) [2022-07-09 23:20:13,703][25689] Fps is (10 sec: 5644.0, 60 sec: 5664.6, 300 sec: 5665.9). Total num frames: 476015616. Throughput: 0: 5853.0. Samples: 476023474. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:20:13,704][25689] Avg episode reward: [(0, '-44.734')] [2022-07-09 23:20:13,884][26022] Updated weights on worker 0-0, policy_version 464861 (0.00088) [2022-07-09 23:20:15,899][26022] Updated weights on worker 0-0, policy_version 464871 (0.00084) [2022-07-09 23:20:17,424][26022] Updated weights on worker 0-0, policy_version 464881 (0.00091) [2022-07-09 23:20:18,707][25689] Fps is (10 sec: 5712.6, 60 sec: 5665.8, 300 sec: 5667.3). Total num frames: 476044288. Throughput: 0: 5012.8. Samples: 476040852. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:20:18,708][25689] Avg episode reward: [(0, '-44.407')] [2022-07-09 23:20:19,403][26022] Updated weights on worker 0-0, policy_version 464891 (0.00086) [2022-07-09 23:20:21,079][26022] Updated weights on worker 0-0, policy_version 464901 (0.00075) [2022-07-09 23:20:22,843][26022] Updated weights on worker 0-0, policy_version 464911 (0.00086) [2022-07-09 23:20:23,727][25689] Fps is (10 sec: 5720.7, 60 sec: 5685.9, 300 sec: 5667.7). Total num frames: 476072960. Throughput: 0: 5849.3. Samples: 476075084. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:20:23,727][25689] Avg episode reward: [(0, '-44.312')] [2022-07-09 23:20:24,630][26022] Updated weights on worker 0-0, policy_version 464921 (0.00099) [2022-07-09 23:20:26,611][26022] Updated weights on worker 0-0, policy_version 464931 (0.00081) [2022-07-09 23:20:28,170][26022] Updated weights on worker 0-0, policy_version 464941 (0.00081) [2022-07-09 23:20:28,794][25689] Fps is (10 sec: 5887.7, 60 sec: 5705.4, 300 sec: 5673.5). Total num frames: 476103680. Throughput: 0: 5964.6. Samples: 476109306. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:20:28,795][25689] Avg episode reward: [(0, '-44.032')] [2022-07-09 23:20:30,285][26022] Updated weights on worker 0-0, policy_version 464951 (0.00102) [2022-07-09 23:20:31,725][26022] Updated weights on worker 0-0, policy_version 464961 (0.00088) [2022-07-09 23:20:33,823][25689] Fps is (10 sec: 5578.5, 60 sec: 5638.2, 300 sec: 5659.7). Total num frames: 476129280. Throughput: 0: 5113.0. Samples: 476126346. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:20:33,823][25689] Avg episode reward: [(0, '-44.444')] [2022-07-09 23:20:33,827][26022] Updated weights on worker 0-0, policy_version 464971 (0.00093) [2022-07-09 23:20:35,199][26022] Updated weights on worker 0-0, policy_version 464981 (0.00084) [2022-07-09 23:20:37,355][26022] Updated weights on worker 0-0, policy_version 464991 (0.00089) [2022-07-09 23:20:38,828][26022] Updated weights on worker 0-0, policy_version 465001 (0.00087) [2022-07-09 23:20:38,847][25689] Fps is (10 sec: 5704.5, 60 sec: 5722.5, 300 sec: 5669.7). Total num frames: 476161024. Throughput: 0: 5953.0. Samples: 476160746. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:20:38,847][25689] Avg episode reward: [(0, '-44.550')] [2022-07-09 23:20:40,969][26022] Updated weights on worker 0-0, policy_version 465011 (0.00092) [2022-07-09 23:20:42,496][26022] Updated weights on worker 0-0, policy_version 465021 (0.00093) [2022-07-09 23:20:43,880][25689] Fps is (10 sec: 5803.6, 60 sec: 5686.3, 300 sec: 5663.9). Total num frames: 476187648. Throughput: 0: 5962.9. Samples: 476195254. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:20:43,880][25689] Avg episode reward: [(0, '-43.559')] [2022-07-09 23:20:44,543][26022] Updated weights on worker 0-0, policy_version 465031 (0.00083) [2022-07-09 23:20:46,061][26022] Updated weights on worker 0-0, policy_version 465041 (0.00085) [2022-07-09 23:20:48,039][26022] Updated weights on worker 0-0, policy_version 465051 (0.00087) [2022-07-09 23:20:48,996][25689] Fps is (10 sec: 5649.8, 60 sec: 5686.9, 300 sec: 5669.0). Total num frames: 476218368. Throughput: 0: 5104.2. Samples: 476212422. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:20:48,997][25689] Avg episode reward: [(0, '-43.227')] [2022-07-09 23:20:49,400][26022] Updated weights on worker 0-0, policy_version 465061 (0.00082) [2022-07-09 23:20:51,753][26022] Updated weights on worker 0-0, policy_version 465071 (0.00082) [2022-07-09 23:20:53,272][26022] Updated weights on worker 0-0, policy_version 465081 (0.00095) [2022-07-09 23:20:54,063][25689] Fps is (10 sec: 5631.3, 60 sec: 5680.9, 300 sec: 5657.8). Total num frames: 476244992. Throughput: 0: 5942.8. Samples: 476246630. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:20:54,064][25689] Avg episode reward: [(0, '-43.850')] [2022-07-09 23:20:55,201][26022] Updated weights on worker 0-0, policy_version 465091 (0.00096) [2022-07-09 23:20:57,044][26022] Updated weights on worker 0-0, policy_version 465101 (0.00090) [2022-07-09 23:20:58,647][26022] Updated weights on worker 0-0, policy_version 465111 (0.00089) [2022-07-09 23:20:59,081][25689] Fps is (10 sec: 5686.3, 60 sec: 5698.1, 300 sec: 5668.9). Total num frames: 476275712. Throughput: 0: 5960.5. Samples: 476281354. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:20:59,082][25689] Avg episode reward: [(0, '-44.678')] [2022-07-09 23:21:00,494][26022] Updated weights on worker 0-0, policy_version 465121 (0.00088) [2022-07-09 23:21:02,454][26022] Updated weights on worker 0-0, policy_version 465131 (0.00081) [2022-07-09 23:21:04,086][25689] Fps is (10 sec: 5721.2, 60 sec: 5682.9, 300 sec: 5668.5). Total num frames: 476302336. Throughput: 0: 5113.1. Samples: 476298570. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:21:04,088][25689] Avg episode reward: [(0, '-44.728')] [2022-07-09 23:21:04,382][26022] Updated weights on worker 0-0, policy_version 465141 (0.00093) [2022-07-09 23:21:06,234][26022] Updated weights on worker 0-0, policy_version 465151 (0.00100) [2022-07-09 23:21:07,608][26022] Updated weights on worker 0-0, policy_version 465161 (0.00091) [2022-07-09 23:21:09,185][25689] Fps is (10 sec: 5473.0, 60 sec: 5685.5, 300 sec: 5667.4). Total num frames: 476331008. Throughput: 0: 5919.8. Samples: 476331932. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:21:09,185][25689] Avg episode reward: [(0, '-46.122')] [2022-07-09 23:21:09,762][26022] Updated weights on worker 0-0, policy_version 465171 (0.00081) [2022-07-09 23:21:11,451][26022] Updated weights on worker 0-0, policy_version 465181 (0.00081) [2022-07-09 23:21:13,128][26022] Updated weights on worker 0-0, policy_version 465191 (0.00093) [2022-07-09 23:21:14,233][25689] Fps is (10 sec: 5853.3, 60 sec: 5718.0, 300 sec: 5677.1). Total num frames: 476361728. Throughput: 0: 5932.1. Samples: 476366280. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:21:14,233][25689] Avg episode reward: [(0, '-46.130')] [2022-07-09 23:21:14,997][26022] Updated weights on worker 0-0, policy_version 465201 (0.00109) [2022-07-09 23:21:16,832][26022] Updated weights on worker 0-0, policy_version 465211 (0.00088) [2022-07-09 23:21:18,659][26022] Updated weights on worker 0-0, policy_version 465221 (0.00091) [2022-07-09 23:21:19,332][25689] Fps is (10 sec: 5853.2, 60 sec: 5709.1, 300 sec: 5665.9). Total num frames: 476390400. Throughput: 0: 5053.3. Samples: 476383686. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:21:19,332][25689] Avg episode reward: [(0, '-45.833')] [2022-07-09 23:21:20,251][26022] Updated weights on worker 0-0, policy_version 465231 (0.00087) [2022-07-09 23:21:22,013][26022] Updated weights on worker 0-0, policy_version 465241 (0.00087) [2022-07-09 23:21:23,949][26022] Updated weights on worker 0-0, policy_version 465251 (0.00057) [2022-07-09 23:21:24,368][25689] Fps is (10 sec: 5657.9, 60 sec: 5707.5, 300 sec: 5674.1). Total num frames: 476419072. Throughput: 0: 5911.5. Samples: 476418468. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:21:24,369][25689] Avg episode reward: [(0, '-45.696')] [2022-07-09 23:21:25,480][26022] Updated weights on worker 0-0, policy_version 465261 (0.00080) [2022-07-09 23:21:27,567][26022] Updated weights on worker 0-0, policy_version 465271 (0.00093) [2022-07-09 23:21:29,191][26022] Updated weights on worker 0-0, policy_version 465281 (0.00383) [2022-07-09 23:21:29,425][25689] Fps is (10 sec: 5783.0, 60 sec: 5691.7, 300 sec: 5673.5). Total num frames: 476448768. Throughput: 0: 5972.2. Samples: 476452810. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:21:29,427][25689] Avg episode reward: [(0, '-46.190')] [2022-07-09 23:21:31,144][26022] Updated weights on worker 0-0, policy_version 465291 (0.00089) [2022-07-09 23:21:32,674][26022] Updated weights on worker 0-0, policy_version 465301 (0.00070) [2022-07-09 23:21:34,465][25689] Fps is (10 sec: 5679.4, 60 sec: 5724.3, 300 sec: 5669.5). Total num frames: 476476416. Throughput: 0: 5131.3. Samples: 476470098. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:21:34,466][25689] Avg episode reward: [(0, '-46.003')] [2022-07-09 23:21:34,536][26022] Updated weights on worker 0-0, policy_version 465311 (0.00078) [2022-07-09 23:21:36,167][26022] Updated weights on worker 0-0, policy_version 465321 (0.00086) [2022-07-09 23:21:38,123][26022] Updated weights on worker 0-0, policy_version 465331 (0.00086) [2022-07-09 23:21:39,478][25689] Fps is (10 sec: 5805.6, 60 sec: 5708.4, 300 sec: 5677.1). Total num frames: 476507136. Throughput: 0: 6014.0. Samples: 476504850. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:21:39,480][25689] Avg episode reward: [(0, '-44.888')] [2022-07-09 23:21:39,908][26022] Updated weights on worker 0-0, policy_version 465341 (0.00092) [2022-07-09 23:21:40,656][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:21:40,669][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000465344_476512256.pth [2022-07-09 23:21:40,675][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000463347_474467328.pth [2022-07-09 23:21:41,695][26022] Updated weights on worker 0-0, policy_version 465351 (0.00073) [2022-07-09 23:21:43,450][26022] Updated weights on worker 0-0, policy_version 465361 (0.00086) [2022-07-09 23:21:44,497][25689] Fps is (10 sec: 5716.3, 60 sec: 5709.8, 300 sec: 5667.3). Total num frames: 476533760. Throughput: 0: 6004.6. Samples: 476539332. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:21:44,507][25689] Avg episode reward: [(0, '-46.497')] [2022-07-09 23:21:45,232][26022] Updated weights on worker 0-0, policy_version 465371 (0.00093) [2022-07-09 23:21:47,203][26022] Updated weights on worker 0-0, policy_version 465381 (0.00089) [2022-07-09 23:21:48,675][26022] Updated weights on worker 0-0, policy_version 465391 (0.00085) [2022-07-09 23:21:49,543][25689] Fps is (10 sec: 5697.5, 60 sec: 5716.4, 300 sec: 5676.9). Total num frames: 476564480. Throughput: 0: 5163.0. Samples: 476556682. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:21:49,544][25689] Avg episode reward: [(0, '-45.456')] [2022-07-09 23:21:50,636][26022] Updated weights on worker 0-0, policy_version 465401 (0.00091) [2022-07-09 23:21:52,341][26022] Updated weights on worker 0-0, policy_version 465411 (0.00093) [2022-07-09 23:21:54,098][26022] Updated weights on worker 0-0, policy_version 465421 (0.00093) [2022-07-09 23:21:54,583][25689] Fps is (10 sec: 5888.3, 60 sec: 5752.8, 300 sec: 5673.3). Total num frames: 476593152. Throughput: 0: 6018.5. Samples: 476591180. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:21:54,584][25689] Avg episode reward: [(0, '-45.367')] [2022-07-09 23:21:56,074][26022] Updated weights on worker 0-0, policy_version 465431 (0.00106) [2022-07-09 23:21:57,710][26022] Updated weights on worker 0-0, policy_version 465441 (0.00091) [2022-07-09 23:21:59,506][26022] Updated weights on worker 0-0, policy_version 465451 (0.00080) [2022-07-09 23:21:59,640][25689] Fps is (10 sec: 5780.7, 60 sec: 5732.2, 300 sec: 5686.3). Total num frames: 476622848. Throughput: 0: 5998.6. Samples: 476625794. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:21:59,641][25689] Avg episode reward: [(0, '-45.184')] [2022-07-09 23:22:01,490][26022] Updated weights on worker 0-0, policy_version 465461 (0.00092) [2022-07-09 23:22:03,373][26022] Updated weights on worker 0-0, policy_version 465471 (0.00084) [2022-07-09 23:22:04,651][25689] Fps is (10 sec: 5492.6, 60 sec: 5714.8, 300 sec: 5677.0). Total num frames: 476648448. Throughput: 0: 5906.4. Samples: 476658368. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:22:04,651][25689] Avg episode reward: [(0, '-45.118')] [2022-07-09 23:22:05,414][26022] Updated weights on worker 0-0, policy_version 465481 (0.00085) [2022-07-09 23:22:06,890][26022] Updated weights on worker 0-0, policy_version 465491 (0.00100) [2022-07-09 23:22:08,891][26022] Updated weights on worker 0-0, policy_version 465501 (0.00097) [2022-07-09 23:22:09,708][25689] Fps is (10 sec: 5594.2, 60 sec: 5752.5, 300 sec: 5683.0). Total num frames: 476679168. Throughput: 0: 5891.9. Samples: 476675490. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:22:09,708][25689] Avg episode reward: [(0, '-44.832')] [2022-07-09 23:22:10,576][26022] Updated weights on worker 0-0, policy_version 465511 (0.00087) [2022-07-09 23:22:12,432][26022] Updated weights on worker 0-0, policy_version 465521 (0.00091) [2022-07-09 23:22:14,186][26022] Updated weights on worker 0-0, policy_version 465531 (0.00100) [2022-07-09 23:22:14,747][25689] Fps is (10 sec: 5781.2, 60 sec: 5702.6, 300 sec: 5672.0). Total num frames: 476706816. Throughput: 0: 5871.6. Samples: 476709572. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:22:14,747][25689] Avg episode reward: [(0, '-44.454')] [2022-07-09 23:22:16,007][26022] Updated weights on worker 0-0, policy_version 465541 (0.00078) [2022-07-09 23:22:17,753][26022] Updated weights on worker 0-0, policy_version 465551 (0.00091) [2022-07-09 23:22:19,564][26022] Updated weights on worker 0-0, policy_version 465561 (0.00094) [2022-07-09 23:22:19,793][25689] Fps is (10 sec: 5584.5, 60 sec: 5707.6, 300 sec: 5681.6). Total num frames: 476735488. Throughput: 0: 5862.5. Samples: 476743938. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:22:19,793][25689] Avg episode reward: [(0, '-43.645')] [2022-07-09 23:22:21,279][26022] Updated weights on worker 0-0, policy_version 465571 (0.00091) [2022-07-09 23:22:23,235][26022] Updated weights on worker 0-0, policy_version 465581 (0.00089) [2022-07-09 23:22:24,803][25689] Fps is (10 sec: 5804.0, 60 sec: 5727.0, 300 sec: 5683.6). Total num frames: 476765184. Throughput: 0: 5099.7. Samples: 476761140. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:22:24,804][25689] Avg episode reward: [(0, '-43.633')] [2022-07-09 23:22:24,810][26022] Updated weights on worker 0-0, policy_version 465591 (0.00089) [2022-07-09 23:22:26,807][26022] Updated weights on worker 0-0, policy_version 465601 (0.00115) [2022-07-09 23:22:28,280][26022] Updated weights on worker 0-0, policy_version 465611 (0.00083) [2022-07-09 23:22:29,871][25689] Fps is (10 sec: 5690.0, 60 sec: 5692.1, 300 sec: 5679.3). Total num frames: 476792832. Throughput: 0: 5965.2. Samples: 476795768. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-09 23:22:29,871][25689] Avg episode reward: [(0, '-43.078')] [2022-07-09 23:22:30,354][26022] Updated weights on worker 0-0, policy_version 465621 (0.00991) [2022-07-09 23:22:32,069][26022] Updated weights on worker 0-0, policy_version 465631 (0.00083) [2022-07-09 23:22:33,845][26022] Updated weights on worker 0-0, policy_version 465641 (0.00088) [2022-07-09 23:22:34,875][25689] Fps is (10 sec: 5592.1, 60 sec: 5712.5, 300 sec: 5675.8). Total num frames: 476821504. Throughput: 0: 5999.7. Samples: 476830334. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:22:34,875][25689] Avg episode reward: [(0, '-43.800')] [2022-07-09 23:22:35,639][26022] Updated weights on worker 0-0, policy_version 465651 (0.00090) [2022-07-09 23:22:37,331][26022] Updated weights on worker 0-0, policy_version 465661 (0.00085) [2022-07-09 23:22:39,215][26022] Updated weights on worker 0-0, policy_version 465671 (0.00088) [2022-07-09 23:22:39,890][25689] Fps is (10 sec: 5826.0, 60 sec: 5695.4, 300 sec: 5686.3). Total num frames: 476851200. Throughput: 0: 5162.3. Samples: 476847682. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:22:39,890][25689] Avg episode reward: [(0, '-43.355')] [2022-07-09 23:22:40,870][26022] Updated weights on worker 0-0, policy_version 465681 (0.00098) [2022-07-09 23:22:42,741][26022] Updated weights on worker 0-0, policy_version 465691 (0.00086) [2022-07-09 23:22:44,466][26022] Updated weights on worker 0-0, policy_version 465701 (0.00088) [2022-07-09 23:22:44,906][25689] Fps is (10 sec: 5716.6, 60 sec: 5712.5, 300 sec: 5684.1). Total num frames: 476878848. Throughput: 0: 6024.1. Samples: 476882240. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:22:44,906][25689] Avg episode reward: [(0, '-44.458')] [2022-07-09 23:22:46,212][26022] Updated weights on worker 0-0, policy_version 465711 (0.00081) [2022-07-09 23:22:48,011][26022] Updated weights on worker 0-0, policy_version 465721 (0.00088) [2022-07-09 23:22:49,780][26022] Updated weights on worker 0-0, policy_version 465731 (0.00088) [2022-07-09 23:22:49,967][25689] Fps is (10 sec: 5690.5, 60 sec: 5694.2, 300 sec: 5687.1). Total num frames: 476908544. Throughput: 0: 6026.5. Samples: 476916874. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:22:49,968][25689] Avg episode reward: [(0, '-44.681')] [2022-07-09 23:22:51,504][26022] Updated weights on worker 0-0, policy_version 465741 (0.00093) [2022-07-09 23:22:53,499][26022] Updated weights on worker 0-0, policy_version 465751 (0.00084) [2022-07-09 23:22:54,989][25689] Fps is (10 sec: 5890.1, 60 sec: 5712.8, 300 sec: 5687.0). Total num frames: 476938240. Throughput: 0: 5148.6. Samples: 476933896. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:22:54,991][25689] Avg episode reward: [(0, '-45.100')] [2022-07-09 23:22:55,118][26022] Updated weights on worker 0-0, policy_version 465761 (0.00088) [2022-07-09 23:22:57,170][26022] Updated weights on worker 0-0, policy_version 465771 (0.00090) [2022-07-09 23:22:58,712][26022] Updated weights on worker 0-0, policy_version 465781 (0.00090) [2022-07-09 23:23:00,000][25689] Fps is (10 sec: 5715.1, 60 sec: 5683.2, 300 sec: 5690.7). Total num frames: 476965888. Throughput: 0: 6007.7. Samples: 476968502. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:23:00,001][25689] Avg episode reward: [(0, '-45.573')] [2022-07-09 23:23:00,593][26022] Updated weights on worker 0-0, policy_version 465791 (0.00089) [2022-07-09 23:23:02,673][26022] Updated weights on worker 0-0, policy_version 465801 (0.00084) [2022-07-09 23:23:04,571][26022] Updated weights on worker 0-0, policy_version 465811 (0.00088) [2022-07-09 23:23:05,019][25689] Fps is (10 sec: 5411.2, 60 sec: 5699.4, 300 sec: 5684.2). Total num frames: 476992512. Throughput: 0: 5893.3. Samples: 477000772. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:23:05,019][25689] Avg episode reward: [(0, '-46.086')] [2022-07-09 23:23:06,251][26022] Updated weights on worker 0-0, policy_version 465821 (0.00090) [2022-07-09 23:23:08,319][26022] Updated weights on worker 0-0, policy_version 465831 (0.00084) [2022-07-09 23:23:09,885][26022] Updated weights on worker 0-0, policy_version 465841 (0.00087) [2022-07-09 23:23:10,135][25689] Fps is (10 sec: 5557.4, 60 sec: 5676.9, 300 sec: 5689.0). Total num frames: 477022208. Throughput: 0: 5009.5. Samples: 477017906. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:23:10,135][25689] Avg episode reward: [(0, '-46.743')] [2022-07-09 23:23:11,839][26022] Updated weights on worker 0-0, policy_version 465851 (0.00079) [2022-07-09 23:23:13,399][26022] Updated weights on worker 0-0, policy_version 465861 (0.00094) [2022-07-09 23:23:15,159][25689] Fps is (10 sec: 5655.2, 60 sec: 5678.3, 300 sec: 5685.5). Total num frames: 477049856. Throughput: 0: 5866.7. Samples: 477052224. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:23:15,159][25689] Avg episode reward: [(0, '-45.201')] [2022-07-09 23:23:15,415][26022] Updated weights on worker 0-0, policy_version 465871 (0.00094) [2022-07-09 23:23:17,027][26022] Updated weights on worker 0-0, policy_version 465881 (0.00083) [2022-07-09 23:23:18,877][26022] Updated weights on worker 0-0, policy_version 465891 (0.00088) [2022-07-09 23:23:20,161][25689] Fps is (10 sec: 5821.7, 60 sec: 5716.4, 300 sec: 5692.5). Total num frames: 477080576. Throughput: 0: 5866.5. Samples: 477086772. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:23:20,161][25689] Avg episode reward: [(0, '-45.569')] [2022-07-09 23:23:20,567][26022] Updated weights on worker 0-0, policy_version 465901 (0.00088) [2022-07-09 23:23:22,571][26022] Updated weights on worker 0-0, policy_version 465911 (0.00094) [2022-07-09 23:23:24,062][26022] Updated weights on worker 0-0, policy_version 465921 (0.00084) [2022-07-09 23:23:25,166][25689] Fps is (10 sec: 5730.3, 60 sec: 5666.0, 300 sec: 5683.5). Total num frames: 477107200. Throughput: 0: 5118.3. Samples: 477103890. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:23:25,166][25689] Avg episode reward: [(0, '-45.149')] [2022-07-09 23:23:26,314][26022] Updated weights on worker 0-0, policy_version 465931 (0.00093) [2022-07-09 23:23:27,844][26022] Updated weights on worker 0-0, policy_version 465941 (0.00087) [2022-07-09 23:23:29,753][26022] Updated weights on worker 0-0, policy_version 465951 (0.00083) [2022-07-09 23:23:30,265][25689] Fps is (10 sec: 5573.7, 60 sec: 5697.0, 300 sec: 5693.5). Total num frames: 477136896. Throughput: 0: 5957.0. Samples: 477137824. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:23:30,266][25689] Avg episode reward: [(0, '-44.322')] [2022-07-09 23:23:31,596][26022] Updated weights on worker 0-0, policy_version 465961 (0.00082) [2022-07-09 23:23:33,304][26022] Updated weights on worker 0-0, policy_version 465971 (0.00091) [2022-07-09 23:23:35,199][26022] Updated weights on worker 0-0, policy_version 465981 (0.00089) [2022-07-09 23:23:35,274][25689] Fps is (10 sec: 5673.0, 60 sec: 5679.5, 300 sec: 5689.9). Total num frames: 477164544. Throughput: 0: 5954.8. Samples: 477172006. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:23:35,274][25689] Avg episode reward: [(0, '-43.574')] [2022-07-09 23:23:36,767][26022] Updated weights on worker 0-0, policy_version 465991 (0.00085) [2022-07-09 23:23:38,743][26022] Updated weights on worker 0-0, policy_version 466001 (0.00088) [2022-07-09 23:23:40,287][25689] Fps is (10 sec: 5722.1, 60 sec: 5679.7, 300 sec: 5689.8). Total num frames: 477194240. Throughput: 0: 5086.6. Samples: 477189144. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:23:40,287][25689] Avg episode reward: [(0, '-43.439')] [2022-07-09 23:23:40,383][26022] Updated weights on worker 0-0, policy_version 466011 (0.00086) [2022-07-09 23:23:40,794][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:23:40,807][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000466013_477197312.pth [2022-07-09 23:23:40,807][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000464012_475148288.pth [2022-07-09 23:23:42,174][26022] Updated weights on worker 0-0, policy_version 466021 (0.00083) [2022-07-09 23:23:43,975][26022] Updated weights on worker 0-0, policy_version 466031 (0.00086) [2022-07-09 23:23:45,311][25689] Fps is (10 sec: 5917.3, 60 sec: 5712.9, 300 sec: 5697.2). Total num frames: 477223936. Throughput: 0: 5952.8. Samples: 477223810. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:23:45,311][25689] Avg episode reward: [(0, '-44.060')] [2022-07-09 23:23:45,649][26022] Updated weights on worker 0-0, policy_version 466041 (0.00092) [2022-07-09 23:23:47,503][26022] Updated weights on worker 0-0, policy_version 466051 (0.00081) [2022-07-09 23:23:49,429][26022] Updated weights on worker 0-0, policy_version 466061 (0.00088) [2022-07-09 23:23:50,391][25689] Fps is (10 sec: 5776.6, 60 sec: 5694.1, 300 sec: 5692.8). Total num frames: 477252608. Throughput: 0: 6003.6. Samples: 477258650. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:23:50,391][25689] Avg episode reward: [(0, '-44.907')] [2022-07-09 23:23:51,210][26022] Updated weights on worker 0-0, policy_version 466071 (0.00097) [2022-07-09 23:23:52,938][26022] Updated weights on worker 0-0, policy_version 466081 (0.00091) [2022-07-09 23:23:54,511][26022] Updated weights on worker 0-0, policy_version 466091 (0.00092) [2022-07-09 23:23:55,421][25689] Fps is (10 sec: 5671.8, 60 sec: 5676.5, 300 sec: 5692.5). Total num frames: 477281280. Throughput: 0: 5147.8. Samples: 477275720. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:23:55,422][25689] Avg episode reward: [(0, '-44.573')] [2022-07-09 23:23:56,367][26022] Updated weights on worker 0-0, policy_version 466101 (0.00090) [2022-07-09 23:23:58,397][26022] Updated weights on worker 0-0, policy_version 466111 (0.00082) [2022-07-09 23:23:59,992][26022] Updated weights on worker 0-0, policy_version 466121 (0.00092) [2022-07-09 23:24:00,468][25689] Fps is (10 sec: 5589.0, 60 sec: 5673.1, 300 sec: 5699.9). Total num frames: 477308928. Throughput: 0: 5991.2. Samples: 477310054. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:24:00,468][25689] Avg episode reward: [(0, '-45.027')] [2022-07-09 23:24:02,183][26022] Updated weights on worker 0-0, policy_version 466131 (0.00909) [2022-07-09 23:24:03,983][26022] Updated weights on worker 0-0, policy_version 466141 (0.00087) [2022-07-09 23:24:05,487][25689] Fps is (10 sec: 5391.7, 60 sec: 5673.0, 300 sec: 5687.9). Total num frames: 477335552. Throughput: 0: 5892.7. Samples: 477342702. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:24:05,488][25689] Avg episode reward: [(0, '-44.963')] [2022-07-09 23:24:05,729][26022] Updated weights on worker 0-0, policy_version 466151 (0.00085) [2022-07-09 23:24:07,511][26022] Updated weights on worker 0-0, policy_version 466161 (0.00091) [2022-07-09 23:24:09,201][26022] Updated weights on worker 0-0, policy_version 466171 (0.00092) [2022-07-09 23:24:10,616][25689] Fps is (10 sec: 5751.7, 60 sec: 5705.7, 300 sec: 5697.1). Total num frames: 477367296. Throughput: 0: 5011.1. Samples: 477360000. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:24:10,616][25689] Avg episode reward: [(0, '-45.106')] [2022-07-09 23:24:11,159][26022] Updated weights on worker 0-0, policy_version 466181 (0.00086) [2022-07-09 23:24:13,025][26022] Updated weights on worker 0-0, policy_version 466191 (0.00079) [2022-07-09 23:24:14,630][26022] Updated weights on worker 0-0, policy_version 466201 (0.00056) [2022-07-09 23:24:15,680][25689] Fps is (10 sec: 5826.6, 60 sec: 5701.9, 300 sec: 5692.7). Total num frames: 477394944. Throughput: 0: 5865.1. Samples: 477394542. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:24:15,681][25689] Avg episode reward: [(0, '-45.529')] [2022-07-09 23:24:16,516][26022] Updated weights on worker 0-0, policy_version 466211 (0.00084) [2022-07-09 23:24:18,381][26022] Updated weights on worker 0-0, policy_version 466221 (0.00084) [2022-07-09 23:24:19,938][26022] Updated weights on worker 0-0, policy_version 466231 (0.00063) [2022-07-09 23:24:20,700][25689] Fps is (10 sec: 5686.8, 60 sec: 5683.4, 300 sec: 5700.3). Total num frames: 477424640. Throughput: 0: 5888.0. Samples: 477429178. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:24:20,700][25689] Avg episode reward: [(0, '-44.687')] [2022-07-09 23:24:21,812][26022] Updated weights on worker 0-0, policy_version 466241 (0.00086) [2022-07-09 23:24:23,439][26022] Updated weights on worker 0-0, policy_version 466251 (0.00097) [2022-07-09 23:24:25,497][26022] Updated weights on worker 0-0, policy_version 466261 (0.00084) [2022-07-09 23:24:25,715][25689] Fps is (10 sec: 5816.9, 60 sec: 5716.2, 300 sec: 5698.4). Total num frames: 477453312. Throughput: 0: 5126.0. Samples: 477446388. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:24:25,715][25689] Avg episode reward: [(0, '-45.366')] [2022-07-09 23:24:26,992][26022] Updated weights on worker 0-0, policy_version 466271 (0.00092) [2022-07-09 23:24:29,087][26022] Updated weights on worker 0-0, policy_version 466281 (0.00086) [2022-07-09 23:24:30,558][26022] Updated weights on worker 0-0, policy_version 466291 (0.00086) [2022-07-09 23:24:30,763][25689] Fps is (10 sec: 5800.1, 60 sec: 5721.1, 300 sec: 5698.1). Total num frames: 477483008. Throughput: 0: 5979.0. Samples: 477480458. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:24:30,763][25689] Avg episode reward: [(0, '-45.405')] [2022-07-09 23:24:32,539][26022] Updated weights on worker 0-0, policy_version 466301 (0.00092) [2022-07-09 23:24:34,195][26022] Updated weights on worker 0-0, policy_version 466311 (0.00086) [2022-07-09 23:24:35,779][25689] Fps is (10 sec: 5697.9, 60 sec: 5720.4, 300 sec: 5701.6). Total num frames: 477510656. Throughput: 0: 6001.2. Samples: 477515156. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:24:35,779][25689] Avg episode reward: [(0, '-44.926')] [2022-07-09 23:24:36,213][26022] Updated weights on worker 0-0, policy_version 466321 (0.00092) [2022-07-09 23:24:37,662][26022] Updated weights on worker 0-0, policy_version 466331 (0.00085) [2022-07-09 23:24:39,810][26022] Updated weights on worker 0-0, policy_version 466341 (0.00090) [2022-07-09 23:24:40,798][25689] Fps is (10 sec: 5714.1, 60 sec: 5719.8, 300 sec: 5704.8). Total num frames: 477540352. Throughput: 0: 5134.0. Samples: 477532364. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:24:40,799][25689] Avg episode reward: [(0, '-44.610')] [2022-07-09 23:24:41,117][26022] Updated weights on worker 0-0, policy_version 466351 (0.00084) [2022-07-09 23:24:43,458][26022] Updated weights on worker 0-0, policy_version 466361 (0.00087) [2022-07-09 23:24:44,716][26022] Updated weights on worker 0-0, policy_version 466371 (0.00091) [2022-07-09 23:24:45,804][25689] Fps is (10 sec: 5719.6, 60 sec: 5687.6, 300 sec: 5696.7). Total num frames: 477568000. Throughput: 0: 6015.5. Samples: 477567238. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:24:45,805][25689] Avg episode reward: [(0, '-45.189')] [2022-07-09 23:24:46,799][26022] Updated weights on worker 0-0, policy_version 466381 (0.00083) [2022-07-09 23:24:48,383][26022] Updated weights on worker 0-0, policy_version 466391 (0.00085) [2022-07-09 23:24:50,155][26022] Updated weights on worker 0-0, policy_version 466401 (0.00096) [2022-07-09 23:24:50,927][25689] Fps is (10 sec: 5661.6, 60 sec: 5700.5, 300 sec: 5704.8). Total num frames: 477597696. Throughput: 0: 6021.9. Samples: 477601884. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:24:50,927][25689] Avg episode reward: [(0, '-44.903')] [2022-07-09 23:24:52,078][26022] Updated weights on worker 0-0, policy_version 466411 (0.00088) [2022-07-09 23:24:53,834][26022] Updated weights on worker 0-0, policy_version 466421 (0.00094) [2022-07-09 23:24:55,665][26022] Updated weights on worker 0-0, policy_version 466431 (0.00083) [2022-07-09 23:24:55,948][25689] Fps is (10 sec: 5955.9, 60 sec: 5735.2, 300 sec: 5708.2). Total num frames: 477628416. Throughput: 0: 5999.5. Samples: 477636164. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:24:55,949][25689] Avg episode reward: [(0, '-45.468')] [2022-07-09 23:24:57,555][26022] Updated weights on worker 0-0, policy_version 466441 (0.00090) [2022-07-09 23:24:59,071][26022] Updated weights on worker 0-0, policy_version 466451 (0.00081) [2022-07-09 23:25:00,951][25689] Fps is (10 sec: 5720.8, 60 sec: 5722.5, 300 sec: 5705.2). Total num frames: 477655040. Throughput: 0: 6005.2. Samples: 477653382. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:25:00,951][25689] Avg episode reward: [(0, '-45.662')] [2022-07-09 23:25:01,080][26022] Updated weights on worker 0-0, policy_version 466461 (0.00086) [2022-07-09 23:25:02,928][26022] Updated weights on worker 0-0, policy_version 466471 (0.00085) [2022-07-09 23:25:04,958][26022] Updated weights on worker 0-0, policy_version 466481 (0.00082) [2022-07-09 23:25:05,998][25689] Fps is (10 sec: 5502.1, 60 sec: 5753.6, 300 sec: 5706.7). Total num frames: 477683712. Throughput: 0: 5890.2. Samples: 477686184. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:25:05,999][25689] Avg episode reward: [(0, '-46.027')] [2022-07-09 23:25:06,660][26022] Updated weights on worker 0-0, policy_version 466491 (0.00085) [2022-07-09 23:25:08,523][26022] Updated weights on worker 0-0, policy_version 466501 (0.00088) [2022-07-09 23:25:10,272][26022] Updated weights on worker 0-0, policy_version 466511 (0.00056) [2022-07-09 23:25:11,089][25689] Fps is (10 sec: 5656.3, 60 sec: 5706.5, 300 sec: 5705.6). Total num frames: 477712384. Throughput: 0: 5875.3. Samples: 477720342. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:25:11,089][25689] Avg episode reward: [(0, '-45.956')] [2022-07-09 23:25:12,135][26022] Updated weights on worker 0-0, policy_version 466521 (0.00097) [2022-07-09 23:25:13,954][26022] Updated weights on worker 0-0, policy_version 466531 (0.00093) [2022-07-09 23:25:15,664][26022] Updated weights on worker 0-0, policy_version 466541 (0.00096) [2022-07-09 23:25:16,109][25689] Fps is (10 sec: 5570.5, 60 sec: 5710.7, 300 sec: 5701.9). Total num frames: 477740032. Throughput: 0: 5026.7. Samples: 477737504. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:25:16,110][25689] Avg episode reward: [(0, '-45.688')] [2022-07-09 23:25:17,298][26022] Updated weights on worker 0-0, policy_version 466551 (0.00091) [2022-07-09 23:25:19,172][26022] Updated weights on worker 0-0, policy_version 466561 (0.00085) [2022-07-09 23:25:21,058][26022] Updated weights on worker 0-0, policy_version 466571 (0.00242) [2022-07-09 23:25:21,117][25689] Fps is (10 sec: 5718.5, 60 sec: 5711.8, 300 sec: 5705.5). Total num frames: 477769728. Throughput: 0: 5886.4. Samples: 477772086. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-09 23:25:21,117][25689] Avg episode reward: [(0, '-45.295')] [2022-07-09 23:25:22,670][26022] Updated weights on worker 0-0, policy_version 466581 (0.00087) [2022-07-09 23:25:24,641][26022] Updated weights on worker 0-0, policy_version 466591 (0.00084) [2022-07-09 23:25:26,195][25689] Fps is (10 sec: 5685.6, 60 sec: 5688.9, 300 sec: 5695.0). Total num frames: 477797376. Throughput: 0: 5935.2. Samples: 477806052. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:25:26,195][25689] Avg episode reward: [(0, '-43.448')] [2022-07-09 23:25:26,516][26022] Updated weights on worker 0-0, policy_version 466601 (0.00092) [2022-07-09 23:25:28,477][26022] Updated weights on worker 0-0, policy_version 466611 (0.00090) [2022-07-09 23:25:29,982][26022] Updated weights on worker 0-0, policy_version 466621 (0.00086) [2022-07-09 23:25:31,263][25689] Fps is (10 sec: 5550.7, 60 sec: 5670.1, 300 sec: 5704.6). Total num frames: 477826048. Throughput: 0: 5090.3. Samples: 477823034. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:25:31,263][25689] Avg episode reward: [(0, '-43.485')] [2022-07-09 23:25:31,701][26022] Updated weights on worker 0-0, policy_version 466631 (0.00093) [2022-07-09 23:25:33,615][26022] Updated weights on worker 0-0, policy_version 466641 (0.00089) [2022-07-09 23:25:35,343][26022] Updated weights on worker 0-0, policy_version 466651 (0.00086) [2022-07-09 23:25:36,277][25689] Fps is (10 sec: 5687.2, 60 sec: 5687.1, 300 sec: 5694.4). Total num frames: 477854720. Throughput: 0: 5954.4. Samples: 477857598. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:25:36,278][25689] Avg episode reward: [(0, '-42.671')] [2022-07-09 23:25:37,239][26022] Updated weights on worker 0-0, policy_version 466661 (0.00088) [2022-07-09 23:25:38,983][26022] Updated weights on worker 0-0, policy_version 466671 (0.00088) [2022-07-09 23:25:40,609][26022] Updated weights on worker 0-0, policy_version 466681 (0.00086) [2022-07-09 23:25:40,857][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:25:40,870][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000466682_477882368.pth [2022-07-09 23:25:40,871][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000464674_475826176.pth [2022-07-09 23:25:41,313][25689] Fps is (10 sec: 5807.5, 60 sec: 5685.6, 300 sec: 5704.7). Total num frames: 477884416. Throughput: 0: 5939.5. Samples: 477892046. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:25:41,323][25689] Avg episode reward: [(0, '-43.005')] [2022-07-09 23:25:42,474][26022] Updated weights on worker 0-0, policy_version 466691 (0.00084) [2022-07-09 23:25:44,405][26022] Updated weights on worker 0-0, policy_version 466701 (0.00081) [2022-07-09 23:25:45,980][26022] Updated weights on worker 0-0, policy_version 466711 (0.00091) [2022-07-09 23:25:46,331][25689] Fps is (10 sec: 5907.7, 60 sec: 5718.4, 300 sec: 5703.2). Total num frames: 477914112. Throughput: 0: 5134.6. Samples: 477909444. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:25:46,331][25689] Avg episode reward: [(0, '-43.137')] [2022-07-09 23:25:48,013][26022] Updated weights on worker 0-0, policy_version 466721 (0.00086) [2022-07-09 23:25:49,404][26022] Updated weights on worker 0-0, policy_version 466731 (0.00083) [2022-07-09 23:25:51,425][25689] Fps is (10 sec: 5670.9, 60 sec: 5687.1, 300 sec: 5706.1). Total num frames: 477941760. Throughput: 0: 6013.6. Samples: 477944282. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:25:51,426][25689] Avg episode reward: [(0, '-43.655')] [2022-07-09 23:25:51,539][26022] Updated weights on worker 0-0, policy_version 466741 (0.00084) [2022-07-09 23:25:53,153][26022] Updated weights on worker 0-0, policy_version 466751 (0.00090) [2022-07-09 23:25:54,979][26022] Updated weights on worker 0-0, policy_version 466761 (0.00055) [2022-07-09 23:25:56,427][25689] Fps is (10 sec: 5578.1, 60 sec: 5655.1, 300 sec: 5699.5). Total num frames: 477970432. Throughput: 0: 6004.9. Samples: 477978596. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:25:56,428][25689] Avg episode reward: [(0, '-44.798')] [2022-07-09 23:25:56,813][26022] Updated weights on worker 0-0, policy_version 466771 (0.00088) [2022-07-09 23:25:58,375][26022] Updated weights on worker 0-0, policy_version 466781 (0.00091) [2022-07-09 23:26:00,372][26022] Updated weights on worker 0-0, policy_version 466791 (0.00089) [2022-07-09 23:26:01,447][25689] Fps is (10 sec: 5824.5, 60 sec: 5704.3, 300 sec: 5709.5). Total num frames: 478000128. Throughput: 0: 5155.2. Samples: 477995836. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:26:01,447][25689] Avg episode reward: [(0, '-44.740')] [2022-07-09 23:26:02,709][26022] Updated weights on worker 0-0, policy_version 466801 (0.00081) [2022-07-09 23:26:04,192][26022] Updated weights on worker 0-0, policy_version 466811 (0.00091) [2022-07-09 23:26:06,171][26022] Updated weights on worker 0-0, policy_version 466821 (0.00086) [2022-07-09 23:26:06,470][25689] Fps is (10 sec: 5709.8, 60 sec: 5689.6, 300 sec: 5707.5). Total num frames: 478027776. Throughput: 0: 5912.4. Samples: 478028518. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:26:06,471][25689] Avg episode reward: [(0, '-45.231')] [2022-07-09 23:26:07,886][26022] Updated weights on worker 0-0, policy_version 466831 (0.00081) [2022-07-09 23:26:09,653][26022] Updated weights on worker 0-0, policy_version 466841 (0.00088) [2022-07-09 23:26:11,454][26022] Updated weights on worker 0-0, policy_version 466851 (0.00091) [2022-07-09 23:26:11,524][25689] Fps is (10 sec: 5487.0, 60 sec: 5676.1, 300 sec: 5697.1). Total num frames: 478055424. Throughput: 0: 5899.3. Samples: 478062850. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:26:11,525][25689] Avg episode reward: [(0, '-45.143')] [2022-07-09 23:26:13,151][26022] Updated weights on worker 0-0, policy_version 466861 (0.00099) [2022-07-09 23:26:14,959][26022] Updated weights on worker 0-0, policy_version 466871 (0.00086) [2022-07-09 23:26:16,528][25689] Fps is (10 sec: 5599.7, 60 sec: 5694.6, 300 sec: 5698.9). Total num frames: 478084096. Throughput: 0: 5049.5. Samples: 478080096. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:26:16,530][25689] Avg episode reward: [(0, '-45.260')] [2022-07-09 23:26:16,623][26022] Updated weights on worker 0-0, policy_version 466881 (0.00086) [2022-07-09 23:26:18,559][26022] Updated weights on worker 0-0, policy_version 466891 (0.00089) [2022-07-09 23:26:20,283][26022] Updated weights on worker 0-0, policy_version 466901 (0.00098) [2022-07-09 23:26:21,551][25689] Fps is (10 sec: 5719.0, 60 sec: 5676.2, 300 sec: 5699.2). Total num frames: 478112768. Throughput: 0: 5911.9. Samples: 478114692. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:26:21,552][25689] Avg episode reward: [(0, '-44.978')] [2022-07-09 23:26:22,226][26022] Updated weights on worker 0-0, policy_version 466911 (0.00084) [2022-07-09 23:26:23,825][26022] Updated weights on worker 0-0, policy_version 466921 (0.00092) [2022-07-09 23:26:25,651][26022] Updated weights on worker 0-0, policy_version 466931 (0.00084) [2022-07-09 23:26:26,554][25689] Fps is (10 sec: 5719.5, 60 sec: 5700.2, 300 sec: 5696.7). Total num frames: 478141440. Throughput: 0: 5990.3. Samples: 478148826. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:26:26,557][25689] Avg episode reward: [(0, '-44.939')] [2022-07-09 23:26:27,603][26022] Updated weights on worker 0-0, policy_version 466941 (0.00087) [2022-07-09 23:26:29,230][26022] Updated weights on worker 0-0, policy_version 466951 (0.00083) [2022-07-09 23:26:31,184][26022] Updated weights on worker 0-0, policy_version 466961 (0.00085) [2022-07-09 23:26:31,689][25689] Fps is (10 sec: 5757.6, 60 sec: 5710.9, 300 sec: 5701.8). Total num frames: 478171136. Throughput: 0: 5111.0. Samples: 478165910. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:26:31,689][25689] Avg episode reward: [(0, '-45.382')] [2022-07-09 23:26:32,960][26022] Updated weights on worker 0-0, policy_version 466971 (0.00093) [2022-07-09 23:26:34,878][26022] Updated weights on worker 0-0, policy_version 466981 (0.00051) [2022-07-09 23:26:36,376][26022] Updated weights on worker 0-0, policy_version 466991 (0.00088) [2022-07-09 23:26:36,717][25689] Fps is (10 sec: 5743.5, 60 sec: 5709.7, 300 sec: 5694.7). Total num frames: 478199808. Throughput: 0: 5947.8. Samples: 478200172. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:26:36,717][25689] Avg episode reward: [(0, '-44.625')] [2022-07-09 23:26:38,332][26022] Updated weights on worker 0-0, policy_version 467001 (0.00092) [2022-07-09 23:26:39,967][26022] Updated weights on worker 0-0, policy_version 467011 (0.00086) [2022-07-09 23:26:41,736][25689] Fps is (10 sec: 5707.6, 60 sec: 5694.3, 300 sec: 5701.5). Total num frames: 478228480. Throughput: 0: 5947.5. Samples: 478234740. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:26:41,736][25689] Avg episode reward: [(0, '-44.842')] [2022-07-09 23:26:41,858][26022] Updated weights on worker 0-0, policy_version 467021 (0.00094) [2022-07-09 23:26:43,527][26022] Updated weights on worker 0-0, policy_version 467031 (0.00092) [2022-07-09 23:26:45,544][26022] Updated weights on worker 0-0, policy_version 467041 (0.00068) [2022-07-09 23:26:46,739][25689] Fps is (10 sec: 5823.5, 60 sec: 5695.6, 300 sec: 5698.9). Total num frames: 478258176. Throughput: 0: 5111.3. Samples: 478251998. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:26:46,741][25689] Avg episode reward: [(0, '-44.057')] [2022-07-09 23:26:47,187][26022] Updated weights on worker 0-0, policy_version 467051 (0.00097) [2022-07-09 23:26:49,144][26022] Updated weights on worker 0-0, policy_version 467061 (0.00090) [2022-07-09 23:26:50,564][26022] Updated weights on worker 0-0, policy_version 467071 (0.00085) [2022-07-09 23:26:51,862][25689] Fps is (10 sec: 5663.1, 60 sec: 5693.0, 300 sec: 5693.9). Total num frames: 478285824. Throughput: 0: 5985.0. Samples: 478286644. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:26:51,862][25689] Avg episode reward: [(0, '-44.694')] [2022-07-09 23:26:52,557][26022] Updated weights on worker 0-0, policy_version 467081 (0.00093) [2022-07-09 23:26:54,233][26022] Updated weights on worker 0-0, policy_version 467091 (0.00084) [2022-07-09 23:26:56,085][26022] Updated weights on worker 0-0, policy_version 467101 (0.00095) [2022-07-09 23:26:56,875][25689] Fps is (10 sec: 5758.8, 60 sec: 5725.8, 300 sec: 5698.2). Total num frames: 478316544. Throughput: 0: 6007.0. Samples: 478321264. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:26:56,876][25689] Avg episode reward: [(0, '-44.713')] [2022-07-09 23:26:57,826][26022] Updated weights on worker 0-0, policy_version 467111 (0.00103) [2022-07-09 23:26:59,639][26022] Updated weights on worker 0-0, policy_version 467121 (0.00086) [2022-07-09 23:27:01,350][26022] Updated weights on worker 0-0, policy_version 467131 (0.00089) [2022-07-09 23:27:01,927][25689] Fps is (10 sec: 5900.8, 60 sec: 5705.8, 300 sec: 5707.7). Total num frames: 478345216. Throughput: 0: 5145.9. Samples: 478338642. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:27:01,927][25689] Avg episode reward: [(0, '-44.912')] [2022-07-09 23:27:03,551][26022] Updated weights on worker 0-0, policy_version 467141 (0.00086) [2022-07-09 23:27:05,194][26022] Updated weights on worker 0-0, policy_version 467151 (0.00049) [2022-07-09 23:27:06,930][25689] Fps is (10 sec: 5499.2, 60 sec: 5690.8, 300 sec: 5695.0). Total num frames: 478371840. Throughput: 0: 5910.3. Samples: 478371332. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:27:06,931][25689] Avg episode reward: [(0, '-44.777')] [2022-07-09 23:27:07,187][26022] Updated weights on worker 0-0, policy_version 467161 (0.00092) [2022-07-09 23:27:08,732][26022] Updated weights on worker 0-0, policy_version 467171 (0.00079) [2022-07-09 23:27:10,689][26022] Updated weights on worker 0-0, policy_version 467181 (0.00085) [2022-07-09 23:27:12,017][25689] Fps is (10 sec: 5581.7, 60 sec: 5721.6, 300 sec: 5700.9). Total num frames: 478401536. Throughput: 0: 5916.9. Samples: 478405900. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:27:12,017][25689] Avg episode reward: [(0, '-44.545')] [2022-07-09 23:27:12,254][26022] Updated weights on worker 0-0, policy_version 467191 (0.00089) [2022-07-09 23:27:14,232][26022] Updated weights on worker 0-0, policy_version 467201 (0.00496) [2022-07-09 23:27:15,770][26022] Updated weights on worker 0-0, policy_version 467211 (0.00084) [2022-07-09 23:27:17,085][25689] Fps is (10 sec: 5647.2, 60 sec: 5698.6, 300 sec: 5697.1). Total num frames: 478429184. Throughput: 0: 5044.1. Samples: 478423204. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:27:17,085][25689] Avg episode reward: [(0, '-44.988')] [2022-07-09 23:27:17,867][26022] Updated weights on worker 0-0, policy_version 467221 (0.00099) [2022-07-09 23:27:19,463][26022] Updated weights on worker 0-0, policy_version 467231 (0.00085) [2022-07-09 23:27:21,390][26022] Updated weights on worker 0-0, policy_version 467241 (0.00090) [2022-07-09 23:27:22,086][25689] Fps is (10 sec: 5695.1, 60 sec: 5717.6, 300 sec: 5697.3). Total num frames: 478458880. Throughput: 0: 5898.1. Samples: 478457542. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:27:22,086][25689] Avg episode reward: [(0, '-44.140')] [2022-07-09 23:27:22,959][26022] Updated weights on worker 0-0, policy_version 467251 (0.00095) [2022-07-09 23:27:24,983][26022] Updated weights on worker 0-0, policy_version 467261 (0.00107) [2022-07-09 23:27:26,711][26022] Updated weights on worker 0-0, policy_version 467271 (0.00552) [2022-07-09 23:27:27,166][25689] Fps is (10 sec: 5789.5, 60 sec: 5710.3, 300 sec: 5700.5). Total num frames: 478487552. Throughput: 0: 5964.5. Samples: 478492030. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:27:27,167][25689] Avg episode reward: [(0, '-43.788')] [2022-07-09 23:27:28,642][26022] Updated weights on worker 0-0, policy_version 467281 (0.00086) [2022-07-09 23:27:30,179][26022] Updated weights on worker 0-0, policy_version 467291 (0.00089) [2022-07-09 23:27:32,217][25689] Fps is (10 sec: 5458.0, 60 sec: 5667.4, 300 sec: 5692.7). Total num frames: 478514176. Throughput: 0: 5934.9. Samples: 478525786. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:27:32,217][25689] Avg episode reward: [(0, '-44.603')] [2022-07-09 23:27:32,339][26022] Updated weights on worker 0-0, policy_version 467301 (0.00084) [2022-07-09 23:27:33,873][26022] Updated weights on worker 0-0, policy_version 467311 (0.00085) [2022-07-09 23:27:35,918][26022] Updated weights on worker 0-0, policy_version 467321 (0.00093) [2022-07-09 23:27:37,311][25689] Fps is (10 sec: 5652.5, 60 sec: 5695.0, 300 sec: 5694.6). Total num frames: 478544896. Throughput: 0: 5921.2. Samples: 478542970. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:27:37,313][25689] Avg episode reward: [(0, '-44.976')] [2022-07-09 23:27:37,412][26022] Updated weights on worker 0-0, policy_version 467331 (0.00091) [2022-07-09 23:27:39,387][26022] Updated weights on worker 0-0, policy_version 467341 (0.00090) [2022-07-09 23:27:40,890][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:27:40,902][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000467350_478566400.pth [2022-07-09 23:27:40,904][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000465344_476512256.pth [2022-07-09 23:27:40,932][26022] Updated weights on worker 0-0, policy_version 467351 (0.00091) [2022-07-09 23:27:42,332][25689] Fps is (10 sec: 5972.9, 60 sec: 5711.8, 300 sec: 5701.4). Total num frames: 478574592. Throughput: 0: 5924.4. Samples: 478577488. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:27:42,333][25689] Avg episode reward: [(0, '-45.021')] [2022-07-09 23:27:42,829][26022] Updated weights on worker 0-0, policy_version 467361 (0.00084) [2022-07-09 23:27:44,531][26022] Updated weights on worker 0-0, policy_version 467371 (0.00085) [2022-07-09 23:27:46,423][26022] Updated weights on worker 0-0, policy_version 467381 (0.00083) [2022-07-09 23:27:47,367][25689] Fps is (10 sec: 5804.2, 60 sec: 5691.9, 300 sec: 5698.4). Total num frames: 478603264. Throughput: 0: 5955.5. Samples: 478612338. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:27:47,368][25689] Avg episode reward: [(0, '-45.816')] [2022-07-09 23:27:48,193][26022] Updated weights on worker 0-0, policy_version 467391 (0.00087) [2022-07-09 23:27:49,962][26022] Updated weights on worker 0-0, policy_version 467401 (0.00080) [2022-07-09 23:27:51,652][26022] Updated weights on worker 0-0, policy_version 467411 (0.00086) [2022-07-09 23:27:52,410][25689] Fps is (10 sec: 5791.3, 60 sec: 5733.2, 300 sec: 5698.0). Total num frames: 478632960. Throughput: 0: 5141.9. Samples: 478629620. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:27:52,411][25689] Avg episode reward: [(0, '-46.261')] [2022-07-09 23:27:53,567][26022] Updated weights on worker 0-0, policy_version 467421 (0.00084) [2022-07-09 23:27:55,017][26022] Updated weights on worker 0-0, policy_version 467431 (0.00087) [2022-07-09 23:27:57,208][26022] Updated weights on worker 0-0, policy_version 467441 (0.00086) [2022-07-09 23:27:57,421][25689] Fps is (10 sec: 5805.4, 60 sec: 5699.6, 300 sec: 5701.5). Total num frames: 478661632. Throughput: 0: 6033.2. Samples: 478664298. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:27:57,422][25689] Avg episode reward: [(0, '-45.889')] [2022-07-09 23:27:58,624][26022] Updated weights on worker 0-0, policy_version 467451 (0.00088) [2022-07-09 23:28:00,702][26022] Updated weights on worker 0-0, policy_version 467461 (0.00086) [2022-07-09 23:28:02,510][25689] Fps is (10 sec: 5474.9, 60 sec: 5662.2, 300 sec: 5700.1). Total num frames: 478688256. Throughput: 0: 5936.1. Samples: 478697270. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:28:02,511][25689] Avg episode reward: [(0, '-45.813')] [2022-07-09 23:28:02,808][26022] Updated weights on worker 0-0, policy_version 467471 (0.00093) [2022-07-09 23:28:04,600][26022] Updated weights on worker 0-0, policy_version 467481 (0.00095) [2022-07-09 23:28:06,391][26022] Updated weights on worker 0-0, policy_version 467491 (0.00098) [2022-07-09 23:28:07,532][25689] Fps is (10 sec: 5570.3, 60 sec: 5711.2, 300 sec: 5701.9). Total num frames: 478717952. Throughput: 0: 5038.3. Samples: 478713934. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-09 23:28:07,533][25689] Avg episode reward: [(0, '-45.767')] [2022-07-09 23:28:08,144][26022] Updated weights on worker 0-0, policy_version 467501 (0.00088) [2022-07-09 23:28:09,985][26022] Updated weights on worker 0-0, policy_version 467511 (0.00085) [2022-07-09 23:28:11,795][26022] Updated weights on worker 0-0, policy_version 467521 (0.00082) [2022-07-09 23:28:12,667][25689] Fps is (10 sec: 5746.7, 60 sec: 5689.8, 300 sec: 5703.3). Total num frames: 478746624. Throughput: 0: 5862.2. Samples: 478748368. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:28:12,668][25689] Avg episode reward: [(0, '-45.966')] [2022-07-09 23:28:13,520][26022] Updated weights on worker 0-0, policy_version 467531 (0.00094) [2022-07-09 23:28:15,223][26022] Updated weights on worker 0-0, policy_version 467541 (0.00095) [2022-07-09 23:28:17,112][26022] Updated weights on worker 0-0, policy_version 467551 (0.00094) [2022-07-09 23:28:17,702][25689] Fps is (10 sec: 5538.3, 60 sec: 5692.9, 300 sec: 5692.3). Total num frames: 478774272. Throughput: 0: 5845.9. Samples: 478782854. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:28:17,702][25689] Avg episode reward: [(0, '-45.291')] [2022-07-09 23:28:18,791][26022] Updated weights on worker 0-0, policy_version 467561 (0.00087) [2022-07-09 23:28:20,895][26022] Updated weights on worker 0-0, policy_version 467571 (0.00479) [2022-07-09 23:28:22,352][26022] Updated weights on worker 0-0, policy_version 467581 (0.00088) [2022-07-09 23:28:22,753][25689] Fps is (10 sec: 5787.0, 60 sec: 5705.1, 300 sec: 5705.2). Total num frames: 478804992. Throughput: 0: 5078.8. Samples: 478800078. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:28:22,754][25689] Avg episode reward: [(0, '-46.432')] [2022-07-09 23:28:24,368][26022] Updated weights on worker 0-0, policy_version 467591 (0.00084) [2022-07-09 23:28:26,044][26022] Updated weights on worker 0-0, policy_version 467601 (0.00091) [2022-07-09 23:28:27,785][25689] Fps is (10 sec: 5686.7, 60 sec: 5675.8, 300 sec: 5696.2). Total num frames: 478831616. Throughput: 0: 5944.7. Samples: 478834332. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:28:27,786][25689] Avg episode reward: [(0, '-45.903')] [2022-07-09 23:28:28,011][26022] Updated weights on worker 0-0, policy_version 467611 (0.00088) [2022-07-09 23:28:29,761][26022] Updated weights on worker 0-0, policy_version 467621 (0.00081) [2022-07-09 23:28:31,458][26022] Updated weights on worker 0-0, policy_version 467631 (0.00090) [2022-07-09 23:28:32,950][25689] Fps is (10 sec: 5523.0, 60 sec: 5715.7, 300 sec: 5700.0). Total num frames: 478861312. Throughput: 0: 5918.6. Samples: 478868416. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:28:32,951][25689] Avg episode reward: [(0, '-45.675')] [2022-07-09 23:28:33,250][26022] Updated weights on worker 0-0, policy_version 467641 (0.00092) [2022-07-09 23:28:35,071][26022] Updated weights on worker 0-0, policy_version 467651 (0.00096) [2022-07-09 23:28:36,856][26022] Updated weights on worker 0-0, policy_version 467661 (0.00100) [2022-07-09 23:28:37,977][25689] Fps is (10 sec: 5827.6, 60 sec: 5705.3, 300 sec: 5699.8). Total num frames: 478891008. Throughput: 0: 5070.3. Samples: 478885656. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:28:37,977][25689] Avg episode reward: [(0, '-46.018')] [2022-07-09 23:28:38,830][26022] Updated weights on worker 0-0, policy_version 467671 (0.00089) [2022-07-09 23:28:40,511][26022] Updated weights on worker 0-0, policy_version 467681 (0.00087) [2022-07-09 23:28:42,267][26022] Updated weights on worker 0-0, policy_version 467691 (0.00093) [2022-07-09 23:28:42,982][25689] Fps is (10 sec: 5716.1, 60 sec: 5672.9, 300 sec: 5693.3). Total num frames: 478918656. Throughput: 0: 5908.2. Samples: 478919596. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:28:42,983][25689] Avg episode reward: [(0, '-47.282')] [2022-07-09 23:28:44,090][26022] Updated weights on worker 0-0, policy_version 467701 (0.00100) [2022-07-09 23:28:45,717][26022] Updated weights on worker 0-0, policy_version 467711 (0.00086) [2022-07-09 23:28:47,755][26022] Updated weights on worker 0-0, policy_version 467721 (0.00089) [2022-07-09 23:28:48,018][25689] Fps is (10 sec: 5710.7, 60 sec: 5689.7, 300 sec: 5697.5). Total num frames: 478948352. Throughput: 0: 5912.9. Samples: 478953968. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:28:48,019][25689] Avg episode reward: [(0, '-46.828')] [2022-07-09 23:28:49,514][26022] Updated weights on worker 0-0, policy_version 467731 (0.00089) [2022-07-09 23:28:51,268][26022] Updated weights on worker 0-0, policy_version 467741 (0.00094) [2022-07-09 23:28:53,122][25689] Fps is (10 sec: 5655.6, 60 sec: 5650.4, 300 sec: 5692.7). Total num frames: 478976000. Throughput: 0: 5093.1. Samples: 478971150. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:28:53,122][25689] Avg episode reward: [(0, '-46.453')] [2022-07-09 23:28:53,180][26022] Updated weights on worker 0-0, policy_version 467751 (0.00084) [2022-07-09 23:28:54,786][26022] Updated weights on worker 0-0, policy_version 467761 (0.00089) [2022-07-09 23:28:56,648][26022] Updated weights on worker 0-0, policy_version 467771 (0.00099) [2022-07-09 23:28:58,123][25689] Fps is (10 sec: 5675.3, 60 sec: 5668.2, 300 sec: 5700.5). Total num frames: 479005696. Throughput: 0: 5942.8. Samples: 479005378. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:28:58,126][25689] Avg episode reward: [(0, '-46.467')] [2022-07-09 23:28:58,537][26022] Updated weights on worker 0-0, policy_version 467781 (0.00056) [2022-07-09 23:29:00,163][26022] Updated weights on worker 0-0, policy_version 467791 (0.00087) [2022-07-09 23:29:02,379][26022] Updated weights on worker 0-0, policy_version 467801 (0.00091) [2022-07-09 23:29:03,128][25689] Fps is (10 sec: 5628.7, 60 sec: 5676.0, 300 sec: 5700.7). Total num frames: 479032320. Throughput: 0: 5847.6. Samples: 479037398. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:29:03,128][25689] Avg episode reward: [(0, '-46.529')] [2022-07-09 23:29:04,223][26022] Updated weights on worker 0-0, policy_version 467811 (0.00083) [2022-07-09 23:29:05,907][26022] Updated weights on worker 0-0, policy_version 467821 (0.00084) [2022-07-09 23:29:07,811][26022] Updated weights on worker 0-0, policy_version 467831 (0.00086) [2022-07-09 23:29:08,139][25689] Fps is (10 sec: 5418.5, 60 sec: 5643.2, 300 sec: 5689.2). Total num frames: 479059968. Throughput: 0: 5006.4. Samples: 479054698. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:29:08,139][25689] Avg episode reward: [(0, '-46.133')] [2022-07-09 23:29:09,409][26022] Updated weights on worker 0-0, policy_version 467841 (0.00618) [2022-07-09 23:29:11,280][26022] Updated weights on worker 0-0, policy_version 467851 (0.00086) [2022-07-09 23:29:12,950][26022] Updated weights on worker 0-0, policy_version 467861 (0.00090) [2022-07-09 23:29:13,191][25689] Fps is (10 sec: 5698.2, 60 sec: 5667.8, 300 sec: 5696.3). Total num frames: 479089664. Throughput: 0: 5893.3. Samples: 479089424. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:29:13,192][25689] Avg episode reward: [(0, '-45.359')] [2022-07-09 23:29:14,851][26022] Updated weights on worker 0-0, policy_version 467871 (0.00093) [2022-07-09 23:29:16,567][26022] Updated weights on worker 0-0, policy_version 467881 (0.00605) [2022-07-09 23:29:18,194][25689] Fps is (10 sec: 5906.7, 60 sec: 5704.7, 300 sec: 5696.7). Total num frames: 479119360. Throughput: 0: 5907.5. Samples: 479123948. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:29:18,194][25689] Avg episode reward: [(0, '-45.106')] [2022-07-09 23:29:18,244][26022] Updated weights on worker 0-0, policy_version 467891 (0.00089) [2022-07-09 23:29:20,078][26022] Updated weights on worker 0-0, policy_version 467901 (0.00083) [2022-07-09 23:29:21,929][26022] Updated weights on worker 0-0, policy_version 467911 (0.00092) [2022-07-09 23:29:23,204][25689] Fps is (10 sec: 5829.4, 60 sec: 5674.7, 300 sec: 5696.7). Total num frames: 479148032. Throughput: 0: 5171.3. Samples: 479141218. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:29:23,204][25689] Avg episode reward: [(0, '-45.147')] [2022-07-09 23:29:23,792][26022] Updated weights on worker 0-0, policy_version 467921 (0.00085) [2022-07-09 23:29:25,451][26022] Updated weights on worker 0-0, policy_version 467931 (0.00082) [2022-07-09 23:29:27,338][26022] Updated weights on worker 0-0, policy_version 467941 (0.00103) [2022-07-09 23:29:28,206][25689] Fps is (10 sec: 5522.9, 60 sec: 5677.5, 300 sec: 5687.3). Total num frames: 479174656. Throughput: 0: 6034.5. Samples: 479175794. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:29:28,206][25689] Avg episode reward: [(0, '-45.757')] [2022-07-09 23:29:29,080][26022] Updated weights on worker 0-0, policy_version 467951 (0.00092) [2022-07-09 23:29:31,000][26022] Updated weights on worker 0-0, policy_version 467961 (0.00093) [2022-07-09 23:29:32,704][26022] Updated weights on worker 0-0, policy_version 467971 (0.00087) [2022-07-09 23:29:33,243][25689] Fps is (10 sec: 5712.2, 60 sec: 5706.6, 300 sec: 5697.2). Total num frames: 479205376. Throughput: 0: 6010.9. Samples: 479209952. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:29:33,243][25689] Avg episode reward: [(0, '-45.664')] [2022-07-09 23:29:34,572][26022] Updated weights on worker 0-0, policy_version 467981 (0.00086) [2022-07-09 23:29:36,139][26022] Updated weights on worker 0-0, policy_version 467991 (0.00085) [2022-07-09 23:29:38,056][26022] Updated weights on worker 0-0, policy_version 468001 (0.00109) [2022-07-09 23:29:38,279][25689] Fps is (10 sec: 5794.2, 60 sec: 5671.7, 300 sec: 5690.0). Total num frames: 479233024. Throughput: 0: 5150.2. Samples: 479227394. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:29:38,280][25689] Avg episode reward: [(0, '-45.868')] [2022-07-09 23:29:39,843][26022] Updated weights on worker 0-0, policy_version 468011 (0.00090) [2022-07-09 23:29:40,977][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:29:40,990][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000468018_479250432.pth [2022-07-09 23:29:40,990][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000466013_477197312.pth [2022-07-09 23:29:41,619][26022] Updated weights on worker 0-0, policy_version 468021 (0.00084) [2022-07-09 23:29:43,287][25689] Fps is (10 sec: 5709.0, 60 sec: 5705.4, 300 sec: 5696.9). Total num frames: 479262720. Throughput: 0: 6016.6. Samples: 479262052. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:29:43,292][25689] Avg episode reward: [(0, '-46.565')] [2022-07-09 23:29:43,317][26022] Updated weights on worker 0-0, policy_version 468031 (0.00091) [2022-07-09 23:29:45,262][26022] Updated weights on worker 0-0, policy_version 468041 (0.00087) [2022-07-09 23:29:47,020][26022] Updated weights on worker 0-0, policy_version 468051 (0.00093) [2022-07-09 23:29:48,300][25689] Fps is (10 sec: 5926.9, 60 sec: 5707.6, 300 sec: 5699.0). Total num frames: 479292416. Throughput: 0: 6010.8. Samples: 479296576. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:29:48,301][25689] Avg episode reward: [(0, '-46.564')] [2022-07-09 23:29:48,728][26022] Updated weights on worker 0-0, policy_version 468061 (0.00085) [2022-07-09 23:29:50,457][26022] Updated weights on worker 0-0, policy_version 468071 (0.00086) [2022-07-09 23:29:52,282][26022] Updated weights on worker 0-0, policy_version 468081 (0.00091) [2022-07-09 23:29:53,382][25689] Fps is (10 sec: 5782.2, 60 sec: 5726.7, 300 sec: 5690.9). Total num frames: 479321088. Throughput: 0: 5152.8. Samples: 479313724. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:29:53,382][25689] Avg episode reward: [(0, '-46.906')] [2022-07-09 23:29:54,027][26022] Updated weights on worker 0-0, policy_version 468091 (0.00082) [2022-07-09 23:29:55,926][26022] Updated weights on worker 0-0, policy_version 468101 (0.00086) [2022-07-09 23:29:57,588][26022] Updated weights on worker 0-0, policy_version 468111 (0.00094) [2022-07-09 23:29:58,453][25689] Fps is (10 sec: 5647.8, 60 sec: 5702.9, 300 sec: 5696.5). Total num frames: 479349760. Throughput: 0: 5999.9. Samples: 479348436. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:29:58,454][25689] Avg episode reward: [(0, '-46.660')] [2022-07-09 23:29:59,398][26022] Updated weights on worker 0-0, policy_version 468121 (0.00091) [2022-07-09 23:30:01,487][26022] Updated weights on worker 0-0, policy_version 468132 (0.00095) [2022-07-09 23:30:03,460][25689] Fps is (10 sec: 5486.8, 60 sec: 5702.8, 300 sec: 5690.4). Total num frames: 479376384. Throughput: 0: 5870.2. Samples: 479380468. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:30:03,460][25689] Avg episode reward: [(0, '-46.417')] [2022-07-09 23:30:03,572][26022] Updated weights on worker 0-0, policy_version 468142 (0.00093) [2022-07-09 23:30:05,349][26022] Updated weights on worker 0-0, policy_version 468152 (0.00089) [2022-07-09 23:30:07,187][26022] Updated weights on worker 0-0, policy_version 468162 (0.00337) [2022-07-09 23:30:08,477][25689] Fps is (10 sec: 5516.8, 60 sec: 5719.3, 300 sec: 5691.8). Total num frames: 479405056. Throughput: 0: 5861.2. Samples: 479414834. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:30:08,477][25689] Avg episode reward: [(0, '-47.300')] [2022-07-09 23:30:08,818][26022] Updated weights on worker 0-0, policy_version 468172 (0.00093) [2022-07-09 23:30:10,765][26022] Updated weights on worker 0-0, policy_version 468182 (0.00079) [2022-07-09 23:30:12,426][26022] Updated weights on worker 0-0, policy_version 468192 (0.00088) [2022-07-09 23:30:13,520][25689] Fps is (10 sec: 5700.0, 60 sec: 5703.1, 300 sec: 5694.8). Total num frames: 479433728. Throughput: 0: 5871.9. Samples: 479431974. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:30:13,522][25689] Avg episode reward: [(0, '-47.347')] [2022-07-09 23:30:14,228][26022] Updated weights on worker 0-0, policy_version 468202 (0.00085) [2022-07-09 23:30:15,999][26022] Updated weights on worker 0-0, policy_version 468212 (0.01087) [2022-07-09 23:30:17,794][26022] Updated weights on worker 0-0, policy_version 468222 (0.00082) [2022-07-09 23:30:18,538][25689] Fps is (10 sec: 5699.7, 60 sec: 5684.8, 300 sec: 5691.2). Total num frames: 479462400. Throughput: 0: 5891.5. Samples: 479466762. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:30:18,538][25689] Avg episode reward: [(0, '-47.821')] [2022-07-09 23:30:19,562][26022] Updated weights on worker 0-0, policy_version 468232 (0.00082) [2022-07-09 23:30:21,594][26022] Updated weights on worker 0-0, policy_version 468242 (0.00082) [2022-07-09 23:30:23,039][26022] Updated weights on worker 0-0, policy_version 468252 (0.00086) [2022-07-09 23:30:23,592][25689] Fps is (10 sec: 5795.2, 60 sec: 5697.6, 300 sec: 5698.5). Total num frames: 479492096. Throughput: 0: 5996.4. Samples: 479501188. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:30:23,593][25689] Avg episode reward: [(0, '-46.888')] [2022-07-09 23:30:25,196][26022] Updated weights on worker 0-0, policy_version 468262 (0.00089) [2022-07-09 23:30:26,840][26022] Updated weights on worker 0-0, policy_version 468272 (0.00092) [2022-07-09 23:30:28,597][26022] Updated weights on worker 0-0, policy_version 468282 (0.00087) [2022-07-09 23:30:28,683][25689] Fps is (10 sec: 5753.2, 60 sec: 5723.0, 300 sec: 5698.1). Total num frames: 479520768. Throughput: 0: 5109.0. Samples: 479518068. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:30:28,683][25689] Avg episode reward: [(0, '-47.052')] [2022-07-09 23:30:30,510][26022] Updated weights on worker 0-0, policy_version 468292 (0.00089) [2022-07-09 23:30:32,022][26022] Updated weights on worker 0-0, policy_version 468302 (0.00084) [2022-07-09 23:30:33,801][25689] Fps is (10 sec: 5516.3, 60 sec: 5664.6, 300 sec: 5692.6). Total num frames: 479548416. Throughput: 0: 5936.3. Samples: 479552370. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:30:33,802][25689] Avg episode reward: [(0, '-47.234')] [2022-07-09 23:30:34,194][26022] Updated weights on worker 0-0, policy_version 468312 (0.00089) [2022-07-09 23:30:35,545][26022] Updated weights on worker 0-0, policy_version 468322 (0.00093) [2022-07-09 23:30:37,571][26022] Updated weights on worker 0-0, policy_version 468332 (0.00089) [2022-07-09 23:30:38,855][25689] Fps is (10 sec: 5838.7, 60 sec: 5730.6, 300 sec: 5699.2). Total num frames: 479580160. Throughput: 0: 5924.2. Samples: 479587126. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:30:38,856][25689] Avg episode reward: [(0, '-47.184')] [2022-07-09 23:30:39,078][26022] Updated weights on worker 0-0, policy_version 468342 (0.00090) [2022-07-09 23:30:41,175][26022] Updated weights on worker 0-0, policy_version 468352 (0.00087) [2022-07-09 23:30:42,550][26022] Updated weights on worker 0-0, policy_version 468362 (0.00085) [2022-07-09 23:30:43,915][25689] Fps is (10 sec: 5872.5, 60 sec: 5691.9, 300 sec: 5691.5). Total num frames: 479607808. Throughput: 0: 5091.4. Samples: 479604662. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:30:43,916][25689] Avg episode reward: [(0, '-46.832')] [2022-07-09 23:30:44,586][26022] Updated weights on worker 0-0, policy_version 468372 (0.00089) [2022-07-09 23:30:46,224][26022] Updated weights on worker 0-0, policy_version 468382 (0.00080) [2022-07-09 23:30:48,175][26022] Updated weights on worker 0-0, policy_version 468392 (0.00092) [2022-07-09 23:30:49,012][25689] Fps is (10 sec: 5646.0, 60 sec: 5684.1, 300 sec: 5698.3). Total num frames: 479637504. Throughput: 0: 5969.7. Samples: 479639424. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:30:49,012][25689] Avg episode reward: [(0, '-46.900')] [2022-07-09 23:30:49,916][26022] Updated weights on worker 0-0, policy_version 468402 (0.00099) [2022-07-09 23:30:51,700][26022] Updated weights on worker 0-0, policy_version 468412 (0.00086) [2022-07-09 23:30:53,509][26022] Updated weights on worker 0-0, policy_version 468422 (0.00086) [2022-07-09 23:30:54,128][25689] Fps is (10 sec: 5915.7, 60 sec: 5714.5, 300 sec: 5703.0). Total num frames: 479668224. Throughput: 0: 5965.3. Samples: 479673624. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:30:54,128][25689] Avg episode reward: [(0, '-47.790')] [2022-07-09 23:30:55,428][26022] Updated weights on worker 0-0, policy_version 468432 (0.00088) [2022-07-09 23:30:56,954][26022] Updated weights on worker 0-0, policy_version 468442 (0.00091) [2022-07-09 23:30:59,082][26022] Updated weights on worker 0-0, policy_version 468452 (0.00086) [2022-07-09 23:30:59,178][25689] Fps is (10 sec: 5640.3, 60 sec: 5682.8, 300 sec: 5692.1). Total num frames: 479694848. Throughput: 0: 5085.3. Samples: 479690476. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:30:59,179][25689] Avg episode reward: [(0, '-48.732')] [2022-07-09 23:31:00,717][26022] Updated weights on worker 0-0, policy_version 468462 (0.00090) [2022-07-09 23:31:02,832][26022] Updated weights on worker 0-0, policy_version 468472 (0.00091) [2022-07-09 23:31:04,229][25689] Fps is (10 sec: 5373.0, 60 sec: 5695.5, 300 sec: 5691.6). Total num frames: 479722496. Throughput: 0: 5803.4. Samples: 479722550. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:31:04,229][25689] Avg episode reward: [(0, '-48.986')] [2022-07-09 23:31:04,885][26022] Updated weights on worker 0-0, policy_version 468482 (0.00084) [2022-07-09 23:31:06,356][26022] Updated weights on worker 0-0, policy_version 468492 (0.00092) [2022-07-09 23:31:08,491][26022] Updated weights on worker 0-0, policy_version 468502 (0.00084) [2022-07-09 23:31:09,307][25689] Fps is (10 sec: 5661.8, 60 sec: 5706.7, 300 sec: 5698.0). Total num frames: 479752192. Throughput: 0: 5782.5. Samples: 479756780. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:31:09,307][25689] Avg episode reward: [(0, '-48.843')] [2022-07-09 23:31:10,117][26022] Updated weights on worker 0-0, policy_version 468512 (0.00087) [2022-07-09 23:31:11,824][26022] Updated weights on worker 0-0, policy_version 468522 (0.00084) [2022-07-09 23:31:13,735][26022] Updated weights on worker 0-0, policy_version 468532 (0.00086) [2022-07-09 23:31:14,375][25689] Fps is (10 sec: 5752.8, 60 sec: 5704.3, 300 sec: 5696.8). Total num frames: 479780864. Throughput: 0: 4960.9. Samples: 479774066. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:31:14,375][25689] Avg episode reward: [(0, '-48.497')] [2022-07-09 23:31:15,340][26022] Updated weights on worker 0-0, policy_version 468542 (0.00091) [2022-07-09 23:31:17,159][26022] Updated weights on worker 0-0, policy_version 468552 (0.00085) [2022-07-09 23:31:19,005][26022] Updated weights on worker 0-0, policy_version 468562 (0.00086) [2022-07-09 23:31:19,403][25689] Fps is (10 sec: 5679.8, 60 sec: 5703.3, 300 sec: 5696.7). Total num frames: 479809536. Throughput: 0: 5858.2. Samples: 479808954. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:31:19,403][25689] Avg episode reward: [(0, '-48.413')] [2022-07-09 23:31:20,695][26022] Updated weights on worker 0-0, policy_version 468572 (0.00082) [2022-07-09 23:31:22,709][26022] Updated weights on worker 0-0, policy_version 468582 (0.00087) [2022-07-09 23:31:24,304][26022] Updated weights on worker 0-0, policy_version 468592 (0.00080) [2022-07-09 23:31:24,419][25689] Fps is (10 sec: 5708.9, 60 sec: 5690.0, 300 sec: 5696.4). Total num frames: 479838208. Throughput: 0: 5985.1. Samples: 479843392. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:31:24,421][25689] Avg episode reward: [(0, '-46.903')] [2022-07-09 23:31:26,141][26022] Updated weights on worker 0-0, policy_version 468602 (0.00080) [2022-07-09 23:31:27,902][26022] Updated weights on worker 0-0, policy_version 468612 (0.00083) [2022-07-09 23:31:29,441][25689] Fps is (10 sec: 5712.8, 60 sec: 5696.6, 300 sec: 5695.2). Total num frames: 479866880. Throughput: 0: 5165.9. Samples: 479860790. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:31:29,442][25689] Avg episode reward: [(0, '-45.963')] [2022-07-09 23:31:29,715][26022] Updated weights on worker 0-0, policy_version 468622 (0.00082) [2022-07-09 23:31:31,485][26022] Updated weights on worker 0-0, policy_version 468632 (0.00087) [2022-07-09 23:31:33,279][26022] Updated weights on worker 0-0, policy_version 468642 (0.00087) [2022-07-09 23:31:34,515][25689] Fps is (10 sec: 5781.3, 60 sec: 5734.4, 300 sec: 5697.7). Total num frames: 479896576. Throughput: 0: 6019.0. Samples: 479895292. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:31:34,516][25689] Avg episode reward: [(0, '-45.187')] [2022-07-09 23:31:34,866][26022] Updated weights on worker 0-0, policy_version 468652 (0.00089) [2022-07-09 23:31:36,995][26022] Updated weights on worker 0-0, policy_version 468662 (0.00084) [2022-07-09 23:31:38,418][26022] Updated weights on worker 0-0, policy_version 468672 (0.00091) [2022-07-09 23:31:39,612][25689] Fps is (10 sec: 5738.4, 60 sec: 5679.8, 300 sec: 5696.2). Total num frames: 479925248. Throughput: 0: 5963.4. Samples: 479929470. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:31:39,614][25689] Avg episode reward: [(0, '-45.136')] [2022-07-09 23:31:40,547][26022] Updated weights on worker 0-0, policy_version 468682 (0.00080) [2022-07-09 23:31:41,123][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:31:41,135][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000468686_479934464.pth [2022-07-09 23:31:41,135][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000466682_477882368.pth [2022-07-09 23:31:42,162][26022] Updated weights on worker 0-0, policy_version 468692 (0.00090) [2022-07-09 23:31:44,107][26022] Updated weights on worker 0-0, policy_version 468702 (0.00098) [2022-07-09 23:31:44,693][25689] Fps is (10 sec: 5634.5, 60 sec: 5694.7, 300 sec: 5691.3). Total num frames: 479953920. Throughput: 0: 5942.4. Samples: 479963864. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:31:44,693][25689] Avg episode reward: [(0, '-45.093')] [2022-07-09 23:31:45,896][26022] Updated weights on worker 0-0, policy_version 468712 (0.00083) [2022-07-09 23:31:47,596][26022] Updated weights on worker 0-0, policy_version 468722 (0.00091) [2022-07-09 23:31:49,456][26022] Updated weights on worker 0-0, policy_version 468732 (0.00081) [2022-07-09 23:31:49,703][25689] Fps is (10 sec: 5682.8, 60 sec: 5685.9, 300 sec: 5696.9). Total num frames: 479982592. Throughput: 0: 5929.1. Samples: 479980926. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:31:49,703][25689] Avg episode reward: [(0, '-45.440')] [2022-07-09 23:31:51,369][26022] Updated weights on worker 0-0, policy_version 468742 (0.00082) [2022-07-09 23:31:53,027][26022] Updated weights on worker 0-0, policy_version 468752 (0.00085) [2022-07-09 23:31:54,799][25689] Fps is (10 sec: 5674.2, 60 sec: 5654.1, 300 sec: 5688.4). Total num frames: 480011264. Throughput: 0: 5911.7. Samples: 480015202. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:31:54,799][25689] Avg episode reward: [(0, '-45.635')] [2022-07-09 23:31:54,832][26022] Updated weights on worker 0-0, policy_version 468762 (0.00093) [2022-07-09 23:31:56,496][26022] Updated weights on worker 0-0, policy_version 468772 (0.00084) [2022-07-09 23:31:58,387][26022] Updated weights on worker 0-0, policy_version 468782 (0.00088) [2022-07-09 23:31:59,817][25689] Fps is (10 sec: 5771.1, 60 sec: 5707.8, 300 sec: 5692.5). Total num frames: 480040960. Throughput: 0: 5961.2. Samples: 480049914. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:31:59,817][25689] Avg episode reward: [(0, '-46.159')] [2022-07-09 23:32:00,130][26022] Updated weights on worker 0-0, policy_version 468792 (0.00094) [2022-07-09 23:32:02,302][26022] Updated weights on worker 0-0, policy_version 468802 (0.00093) [2022-07-09 23:32:04,055][26022] Updated weights on worker 0-0, policy_version 468812 (0.00100) [2022-07-09 23:32:04,823][25689] Fps is (10 sec: 5414.2, 60 sec: 5661.3, 300 sec: 5685.6). Total num frames: 480065536. Throughput: 0: 5024.8. Samples: 480065014. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:32:04,823][25689] Avg episode reward: [(0, '-46.530')] [2022-07-09 23:32:06,048][26022] Updated weights on worker 0-0, policy_version 468822 (0.00085) [2022-07-09 23:32:07,467][26022] Updated weights on worker 0-0, policy_version 468832 (0.00084) [2022-07-09 23:32:09,470][26022] Updated weights on worker 0-0, policy_version 468842 (0.00084) [2022-07-09 23:32:09,839][25689] Fps is (10 sec: 5517.2, 60 sec: 5684.0, 300 sec: 5690.4). Total num frames: 480096256. Throughput: 0: 5887.5. Samples: 480099478. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:32:09,840][25689] Avg episode reward: [(0, '-45.638')] [2022-07-09 23:32:11,019][26022] Updated weights on worker 0-0, policy_version 468852 (0.00088) [2022-07-09 23:32:13,106][26022] Updated weights on worker 0-0, policy_version 468862 (0.00086) [2022-07-09 23:32:14,445][26022] Updated weights on worker 0-0, policy_version 468872 (0.00083) [2022-07-09 23:32:14,976][25689] Fps is (10 sec: 6051.3, 60 sec: 5711.3, 300 sec: 5699.4). Total num frames: 480126976. Throughput: 0: 5897.7. Samples: 480134200. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:32:14,976][25689] Avg episode reward: [(0, '-44.888')] [2022-07-09 23:32:16,689][26022] Updated weights on worker 0-0, policy_version 468882 (0.00087) [2022-07-09 23:32:18,009][26022] Updated weights on worker 0-0, policy_version 468892 (0.00084) [2022-07-09 23:32:20,004][25689] Fps is (10 sec: 5641.4, 60 sec: 5677.5, 300 sec: 5688.5). Total num frames: 480153600. Throughput: 0: 5043.3. Samples: 480151724. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:32:20,005][25689] Avg episode reward: [(0, '-44.881')] [2022-07-09 23:32:20,324][26022] Updated weights on worker 0-0, policy_version 468902 (0.00089) [2022-07-09 23:32:21,596][26022] Updated weights on worker 0-0, policy_version 468912 (0.00087) [2022-07-09 23:32:23,710][26022] Updated weights on worker 0-0, policy_version 468922 (0.00090) [2022-07-09 23:32:25,035][25689] Fps is (10 sec: 5700.6, 60 sec: 5709.9, 300 sec: 5696.4). Total num frames: 480184320. Throughput: 0: 6013.5. Samples: 480186562. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:32:25,036][25689] Avg episode reward: [(0, '-44.874')] [2022-07-09 23:32:25,186][26022] Updated weights on worker 0-0, policy_version 468932 (0.00084) [2022-07-09 23:32:27,324][26022] Updated weights on worker 0-0, policy_version 468942 (0.00090) [2022-07-09 23:32:28,871][26022] Updated weights on worker 0-0, policy_version 468952 (0.00084) [2022-07-09 23:32:30,082][25689] Fps is (10 sec: 5791.9, 60 sec: 5690.7, 300 sec: 5699.9). Total num frames: 480211968. Throughput: 0: 6007.6. Samples: 480221086. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:32:30,082][25689] Avg episode reward: [(0, '-44.446')] [2022-07-09 23:32:30,859][26022] Updated weights on worker 0-0, policy_version 468962 (0.00089) [2022-07-09 23:32:32,391][26022] Updated weights on worker 0-0, policy_version 468972 (0.00078) [2022-07-09 23:32:34,287][26022] Updated weights on worker 0-0, policy_version 468982 (0.00088) [2022-07-09 23:32:35,181][25689] Fps is (10 sec: 5652.1, 60 sec: 5688.4, 300 sec: 5696.3). Total num frames: 480241664. Throughput: 0: 5147.0. Samples: 480238194. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:32:35,181][25689] Avg episode reward: [(0, '-44.672')] [2022-07-09 23:32:36,082][26022] Updated weights on worker 0-0, policy_version 468992 (0.00093) [2022-07-09 23:32:37,776][26022] Updated weights on worker 0-0, policy_version 469002 (0.00090) [2022-07-09 23:32:39,792][26022] Updated weights on worker 0-0, policy_version 469012 (0.00085) [2022-07-09 23:32:40,220][25689] Fps is (10 sec: 5656.4, 60 sec: 5676.9, 300 sec: 5689.1). Total num frames: 480269312. Throughput: 0: 5970.7. Samples: 480272424. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:32:40,220][25689] Avg episode reward: [(0, '-45.010')] [2022-07-09 23:32:41,472][26022] Updated weights on worker 0-0, policy_version 469022 (0.00082) [2022-07-09 23:32:43,550][26022] Updated weights on worker 0-0, policy_version 469032 (0.00090) [2022-07-09 23:32:45,009][26022] Updated weights on worker 0-0, policy_version 469042 (0.00095) [2022-07-09 23:32:45,227][25689] Fps is (10 sec: 5708.0, 60 sec: 5700.7, 300 sec: 5693.1). Total num frames: 480299008. Throughput: 0: 5941.6. Samples: 480306534. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:32:45,227][25689] Avg episode reward: [(0, '-44.956')] [2022-07-09 23:32:46,943][26022] Updated weights on worker 0-0, policy_version 469052 (0.00080) [2022-07-09 23:32:48,747][26022] Updated weights on worker 0-0, policy_version 469062 (0.00052) [2022-07-09 23:32:50,256][25689] Fps is (10 sec: 5917.7, 60 sec: 5715.9, 300 sec: 5693.4). Total num frames: 480328704. Throughput: 0: 5087.2. Samples: 480323716. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:32:50,258][25689] Avg episode reward: [(0, '-45.404')] [2022-07-09 23:32:50,453][26022] Updated weights on worker 0-0, policy_version 469072 (0.00088) [2022-07-09 23:32:52,479][26022] Updated weights on worker 0-0, policy_version 469082 (0.00084) [2022-07-09 23:32:54,049][26022] Updated weights on worker 0-0, policy_version 469092 (0.00079) [2022-07-09 23:32:55,379][25689] Fps is (10 sec: 5547.7, 60 sec: 5679.5, 300 sec: 5684.4). Total num frames: 480355328. Throughput: 0: 5941.9. Samples: 480358208. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:32:55,379][25689] Avg episode reward: [(0, '-45.933')] [2022-07-09 23:32:55,871][26022] Updated weights on worker 0-0, policy_version 469102 (0.00091) [2022-07-09 23:32:57,777][26022] Updated weights on worker 0-0, policy_version 469112 (0.00094) [2022-07-09 23:32:59,333][26022] Updated weights on worker 0-0, policy_version 469122 (0.00086) [2022-07-09 23:33:00,386][25689] Fps is (10 sec: 5660.8, 60 sec: 5697.4, 300 sec: 5699.7). Total num frames: 480386048. Throughput: 0: 5970.9. Samples: 480392836. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:33:00,386][25689] Avg episode reward: [(0, '-46.296')] [2022-07-09 23:33:01,284][26022] Updated weights on worker 0-0, policy_version 469132 (0.00092) [2022-07-09 23:33:03,319][26022] Updated weights on worker 0-0, policy_version 469142 (0.00087) [2022-07-09 23:33:05,245][26022] Updated weights on worker 0-0, policy_version 469152 (0.00085) [2022-07-09 23:33:05,434][25689] Fps is (10 sec: 5804.9, 60 sec: 5744.2, 300 sec: 5692.3). Total num frames: 480413696. Throughput: 0: 5014.4. Samples: 480407860. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:33:05,434][25689] Avg episode reward: [(0, '-46.271')] [2022-07-09 23:33:07,075][26022] Updated weights on worker 0-0, policy_version 469162 (0.00095) [2022-07-09 23:33:08,608][26022] Updated weights on worker 0-0, policy_version 469172 (0.00090) [2022-07-09 23:33:10,441][25689] Fps is (10 sec: 5397.3, 60 sec: 5677.4, 300 sec: 5687.9). Total num frames: 480440320. Throughput: 0: 5854.1. Samples: 480441884. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:33:10,442][25689] Avg episode reward: [(0, '-47.313')] [2022-07-09 23:33:10,705][26022] Updated weights on worker 0-0, policy_version 469182 (0.00087) [2022-07-09 23:33:12,248][26022] Updated weights on worker 0-0, policy_version 469192 (0.00095) [2022-07-09 23:33:14,145][26022] Updated weights on worker 0-0, policy_version 469202 (0.00085) [2022-07-09 23:33:15,496][25689] Fps is (10 sec: 5698.8, 60 sec: 5685.1, 300 sec: 5697.8). Total num frames: 480471040. Throughput: 0: 5872.3. Samples: 480476344. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:33:15,497][25689] Avg episode reward: [(0, '-46.514')] [2022-07-09 23:33:15,930][26022] Updated weights on worker 0-0, policy_version 469212 (0.00090) [2022-07-09 23:33:17,689][26022] Updated weights on worker 0-0, policy_version 469222 (0.00090) [2022-07-09 23:33:19,735][26022] Updated weights on worker 0-0, policy_version 469232 (0.00082) [2022-07-09 23:33:20,565][25689] Fps is (10 sec: 5765.7, 60 sec: 5698.2, 300 sec: 5687.1). Total num frames: 480498688. Throughput: 0: 4995.6. Samples: 480493640. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:33:20,565][25689] Avg episode reward: [(0, '-46.133')] [2022-07-09 23:33:21,137][26022] Updated weights on worker 0-0, policy_version 469242 (0.00587) [2022-07-09 23:33:23,189][26022] Updated weights on worker 0-0, policy_version 469252 (0.00085) [2022-07-09 23:33:24,877][26022] Updated weights on worker 0-0, policy_version 469262 (0.00086) [2022-07-09 23:33:25,649][25689] Fps is (10 sec: 5648.2, 60 sec: 5676.3, 300 sec: 5696.5). Total num frames: 480528384. Throughput: 0: 5951.9. Samples: 480528178. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:33:25,649][25689] Avg episode reward: [(0, '-46.593')] [2022-07-09 23:33:26,857][26022] Updated weights on worker 0-0, policy_version 469272 (0.00080) [2022-07-09 23:33:28,568][26022] Updated weights on worker 0-0, policy_version 469282 (0.00090) [2022-07-09 23:33:30,149][26022] Updated weights on worker 0-0, policy_version 469292 (0.00091) [2022-07-09 23:33:30,715][25689] Fps is (10 sec: 5851.4, 60 sec: 5708.3, 300 sec: 5698.4). Total num frames: 480558080. Throughput: 0: 5931.3. Samples: 480562130. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:33:30,715][25689] Avg episode reward: [(0, '-46.539')] [2022-07-09 23:33:32,215][26022] Updated weights on worker 0-0, policy_version 469302 (0.00091) [2022-07-09 23:33:33,907][26022] Updated weights on worker 0-0, policy_version 469312 (0.00090) [2022-07-09 23:33:35,742][25689] Fps is (10 sec: 5681.2, 60 sec: 5681.2, 300 sec: 5691.5). Total num frames: 480585728. Throughput: 0: 5073.5. Samples: 480579064. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:33:35,743][25689] Avg episode reward: [(0, '-45.881')] [2022-07-09 23:33:35,749][26022] Updated weights on worker 0-0, policy_version 469322 (0.00083) [2022-07-09 23:33:37,428][26022] Updated weights on worker 0-0, policy_version 469332 (0.00096) [2022-07-09 23:33:39,124][26022] Updated weights on worker 0-0, policy_version 469342 (0.00087) [2022-07-09 23:33:40,810][25689] Fps is (10 sec: 5680.1, 60 sec: 5712.3, 300 sec: 5697.1). Total num frames: 480615424. Throughput: 0: 5919.2. Samples: 480613478. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:33:40,811][25689] Avg episode reward: [(0, '-45.418')] [2022-07-09 23:33:41,161][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:33:41,173][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000469352_480616448.pth [2022-07-09 23:33:41,174][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000467350_478566400.pth [2022-07-09 23:33:41,178][26022] Updated weights on worker 0-0, policy_version 469352 (0.00092) [2022-07-09 23:33:43,123][26022] Updated weights on worker 0-0, policy_version 469362 (0.00089) [2022-07-09 23:33:44,665][26022] Updated weights on worker 0-0, policy_version 469372 (0.00050) [2022-07-09 23:33:45,837][25689] Fps is (10 sec: 5680.8, 60 sec: 5676.7, 300 sec: 5690.4). Total num frames: 480643072. Throughput: 0: 5934.3. Samples: 480647980. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-09 23:33:45,837][25689] Avg episode reward: [(0, '-46.110')] [2022-07-09 23:33:46,560][26022] Updated weights on worker 0-0, policy_version 469382 (0.00091) [2022-07-09 23:33:48,270][26022] Updated weights on worker 0-0, policy_version 469392 (0.00098) [2022-07-09 23:33:50,218][26022] Updated weights on worker 0-0, policy_version 469402 (0.00087) [2022-07-09 23:33:50,907][25689] Fps is (10 sec: 5577.9, 60 sec: 5655.9, 300 sec: 5694.5). Total num frames: 480671744. Throughput: 0: 5091.9. Samples: 480664948. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:33:50,908][25689] Avg episode reward: [(0, '-45.712')] [2022-07-09 23:33:51,963][26022] Updated weights on worker 0-0, policy_version 469412 (0.00052) [2022-07-09 23:33:53,807][26022] Updated weights on worker 0-0, policy_version 469422 (0.00078) [2022-07-09 23:33:55,503][26022] Updated weights on worker 0-0, policy_version 469432 (0.00088) [2022-07-09 23:33:55,979][25689] Fps is (10 sec: 5553.2, 60 sec: 5677.6, 300 sec: 5686.3). Total num frames: 480699392. Throughput: 0: 5925.6. Samples: 480698976. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:33:55,979][25689] Avg episode reward: [(0, '-44.491')] [2022-07-09 23:33:57,273][26022] Updated weights on worker 0-0, policy_version 469442 (0.00084) [2022-07-09 23:33:59,138][26022] Updated weights on worker 0-0, policy_version 469452 (0.00088) [2022-07-09 23:34:00,744][26022] Updated weights on worker 0-0, policy_version 469462 (0.00085) [2022-07-09 23:34:00,992][25689] Fps is (10 sec: 5787.8, 60 sec: 5677.1, 300 sec: 5699.9). Total num frames: 480730112. Throughput: 0: 5951.6. Samples: 480733590. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:34:00,992][25689] Avg episode reward: [(0, '-44.599')] [2022-07-09 23:34:03,278][26022] Updated weights on worker 0-0, policy_version 469472 (0.00777) [2022-07-09 23:34:04,792][26022] Updated weights on worker 0-0, policy_version 469482 (0.00084) [2022-07-09 23:34:06,000][25689] Fps is (10 sec: 5619.9, 60 sec: 5647.0, 300 sec: 5693.1). Total num frames: 480755712. Throughput: 0: 4994.6. Samples: 480748688. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:34:06,001][25689] Avg episode reward: [(0, '-44.430')] [2022-07-09 23:34:06,847][26022] Updated weights on worker 0-0, policy_version 469492 (0.00087) [2022-07-09 23:34:08,334][26022] Updated weights on worker 0-0, policy_version 469502 (0.00091) [2022-07-09 23:34:10,335][26022] Updated weights on worker 0-0, policy_version 469512 (0.00089) [2022-07-09 23:34:11,012][25689] Fps is (10 sec: 5416.1, 60 sec: 5680.4, 300 sec: 5690.4). Total num frames: 480784384. Throughput: 0: 5868.2. Samples: 480782928. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:34:11,013][25689] Avg episode reward: [(0, '-44.385')] [2022-07-09 23:34:12,007][26022] Updated weights on worker 0-0, policy_version 469522 (0.00084) [2022-07-09 23:34:14,001][26022] Updated weights on worker 0-0, policy_version 469532 (0.00092) [2022-07-09 23:34:15,632][26022] Updated weights on worker 0-0, policy_version 469542 (0.00091) [2022-07-09 23:34:16,081][25689] Fps is (10 sec: 5789.8, 60 sec: 5662.1, 300 sec: 5689.1). Total num frames: 480814080. Throughput: 0: 5884.9. Samples: 480817278. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:34:16,082][25689] Avg episode reward: [(0, '-44.048')] [2022-07-09 23:34:17,262][26022] Updated weights on worker 0-0, policy_version 469552 (0.00090) [2022-07-09 23:34:19,223][26022] Updated weights on worker 0-0, policy_version 469562 (0.00091) [2022-07-09 23:34:20,814][26022] Updated weights on worker 0-0, policy_version 469572 (0.00088) [2022-07-09 23:34:21,102][25689] Fps is (10 sec: 5683.3, 60 sec: 5666.6, 300 sec: 5685.5). Total num frames: 480841728. Throughput: 0: 5025.5. Samples: 480834654. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:34:21,103][25689] Avg episode reward: [(0, '-44.945')] [2022-07-09 23:34:22,789][26022] Updated weights on worker 0-0, policy_version 469582 (0.00089) [2022-07-09 23:34:24,429][26022] Updated weights on worker 0-0, policy_version 469592 (0.00068) [2022-07-09 23:34:26,117][25689] Fps is (10 sec: 5612.0, 60 sec: 5656.1, 300 sec: 5692.1). Total num frames: 480870400. Throughput: 0: 5994.1. Samples: 480869270. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:34:26,117][25689] Avg episode reward: [(0, '-45.827')] [2022-07-09 23:34:26,310][26022] Updated weights on worker 0-0, policy_version 469602 (0.00092) [2022-07-09 23:34:28,170][26022] Updated weights on worker 0-0, policy_version 469612 (0.00090) [2022-07-09 23:34:29,943][26022] Updated weights on worker 0-0, policy_version 469622 (0.00084) [2022-07-09 23:34:31,146][25689] Fps is (10 sec: 5811.2, 60 sec: 5659.6, 300 sec: 5688.8). Total num frames: 480900096. Throughput: 0: 5981.9. Samples: 480903366. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:34:31,147][25689] Avg episode reward: [(0, '-45.827')] [2022-07-09 23:34:31,581][26022] Updated weights on worker 0-0, policy_version 469632 (0.00086) [2022-07-09 23:34:33,619][26022] Updated weights on worker 0-0, policy_version 469642 (0.00086) [2022-07-09 23:34:35,088][26022] Updated weights on worker 0-0, policy_version 469652 (0.00093) [2022-07-09 23:34:36,253][25689] Fps is (10 sec: 5556.3, 60 sec: 5635.2, 300 sec: 5684.0). Total num frames: 480926720. Throughput: 0: 5973.2. Samples: 480937770. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:34:36,254][25689] Avg episode reward: [(0, '-45.651')] [2022-07-09 23:34:37,156][26022] Updated weights on worker 0-0, policy_version 469662 (0.00131) [2022-07-09 23:34:38,747][26022] Updated weights on worker 0-0, policy_version 469672 (0.00089) [2022-07-09 23:34:40,657][26022] Updated weights on worker 0-0, policy_version 469682 (0.00088) [2022-07-09 23:34:41,278][25689] Fps is (10 sec: 5761.1, 60 sec: 5673.1, 300 sec: 5690.6). Total num frames: 480958464. Throughput: 0: 5953.5. Samples: 480954768. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:34:41,278][25689] Avg episode reward: [(0, '-45.770')] [2022-07-09 23:34:42,403][26022] Updated weights on worker 0-0, policy_version 469692 (0.00086) [2022-07-09 23:34:44,115][26022] Updated weights on worker 0-0, policy_version 469702 (0.00091) [2022-07-09 23:34:45,984][26022] Updated weights on worker 0-0, policy_version 469712 (0.00091) [2022-07-09 23:34:46,298][25689] Fps is (10 sec: 5912.5, 60 sec: 5673.7, 300 sec: 5683.6). Total num frames: 480986112. Throughput: 0: 5963.5. Samples: 480989622. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:34:46,299][25689] Avg episode reward: [(0, '-46.521')] [2022-07-09 23:34:47,766][26022] Updated weights on worker 0-0, policy_version 469722 (0.00095) [2022-07-09 23:34:49,510][26022] Updated weights on worker 0-0, policy_version 469732 (0.00091) [2022-07-09 23:34:51,286][26022] Updated weights on worker 0-0, policy_version 469742 (0.00095) [2022-07-09 23:34:51,324][25689] Fps is (10 sec: 5708.1, 60 sec: 5694.9, 300 sec: 5688.1). Total num frames: 481015808. Throughput: 0: 5996.2. Samples: 481024354. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:34:51,324][25689] Avg episode reward: [(0, '-45.666')] [2022-07-09 23:34:52,937][26022] Updated weights on worker 0-0, policy_version 469752 (0.00091) [2022-07-09 23:34:54,779][26022] Updated weights on worker 0-0, policy_version 469762 (0.00086) [2022-07-09 23:34:56,375][25689] Fps is (10 sec: 5792.6, 60 sec: 5713.8, 300 sec: 5688.5). Total num frames: 481044480. Throughput: 0: 5162.8. Samples: 481041652. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:34:56,377][25689] Avg episode reward: [(0, '-44.212')] [2022-07-09 23:34:56,595][26022] Updated weights on worker 0-0, policy_version 469772 (0.00091) [2022-07-09 23:34:58,276][26022] Updated weights on worker 0-0, policy_version 469782 (0.00089) [2022-07-09 23:35:00,274][26022] Updated weights on worker 0-0, policy_version 469792 (0.00098) [2022-07-09 23:35:01,403][25689] Fps is (10 sec: 5689.0, 60 sec: 5678.4, 300 sec: 5694.9). Total num frames: 481073152. Throughput: 0: 6035.5. Samples: 481076236. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:35:01,404][25689] Avg episode reward: [(0, '-44.390')] [2022-07-09 23:35:02,109][26022] Updated weights on worker 0-0, policy_version 469802 (0.00098) [2022-07-09 23:35:04,165][26022] Updated weights on worker 0-0, policy_version 469812 (0.00083) [2022-07-09 23:35:05,800][26022] Updated weights on worker 0-0, policy_version 469822 (0.00085) [2022-07-09 23:35:06,414][25689] Fps is (10 sec: 5610.0, 60 sec: 5712.1, 300 sec: 5691.6). Total num frames: 481100800. Throughput: 0: 5922.9. Samples: 481108764. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:35:06,416][25689] Avg episode reward: [(0, '-44.445')] [2022-07-09 23:35:07,786][26022] Updated weights on worker 0-0, policy_version 469832 (0.00100) [2022-07-09 23:35:09,546][26022] Updated weights on worker 0-0, policy_version 469842 (0.00082) [2022-07-09 23:35:11,187][26022] Updated weights on worker 0-0, policy_version 469852 (0.00081) [2022-07-09 23:35:11,428][25689] Fps is (10 sec: 5618.2, 60 sec: 5711.9, 300 sec: 5692.2). Total num frames: 481129472. Throughput: 0: 5041.8. Samples: 481125716. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:35:11,431][25689] Avg episode reward: [(0, '-43.610')] [2022-07-09 23:35:13,118][26022] Updated weights on worker 0-0, policy_version 469862 (0.00089) [2022-07-09 23:35:14,786][26022] Updated weights on worker 0-0, policy_version 469872 (0.00082) [2022-07-09 23:35:16,567][25689] Fps is (10 sec: 5648.1, 60 sec: 5688.4, 300 sec: 5689.9). Total num frames: 481158144. Throughput: 0: 5879.5. Samples: 481160372. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:35:16,569][25689] Avg episode reward: [(0, '-43.846')] [2022-07-09 23:35:16,632][26022] Updated weights on worker 0-0, policy_version 469882 (0.00093) [2022-07-09 23:35:18,347][26022] Updated weights on worker 0-0, policy_version 469892 (0.00083) [2022-07-09 23:35:20,103][26022] Updated weights on worker 0-0, policy_version 469902 (0.00085) [2022-07-09 23:35:21,598][25689] Fps is (10 sec: 5839.7, 60 sec: 5738.1, 300 sec: 5693.7). Total num frames: 481188864. Throughput: 0: 5883.4. Samples: 481195052. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:35:21,600][25689] Avg episode reward: [(0, '-44.650')] [2022-07-09 23:35:21,931][26022] Updated weights on worker 0-0, policy_version 469912 (0.00080) [2022-07-09 23:35:23,689][26022] Updated weights on worker 0-0, policy_version 469922 (0.00088) [2022-07-09 23:35:25,437][26022] Updated weights on worker 0-0, policy_version 469932 (0.00089) [2022-07-09 23:35:26,609][25689] Fps is (10 sec: 5710.3, 60 sec: 5704.7, 300 sec: 5688.4). Total num frames: 481215488. Throughput: 0: 5128.8. Samples: 481212340. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:35:26,611][25689] Avg episode reward: [(0, '-45.271')] [2022-07-09 23:35:27,155][26022] Updated weights on worker 0-0, policy_version 469942 (0.00094) [2022-07-09 23:35:29,142][26022] Updated weights on worker 0-0, policy_version 469952 (0.00102) [2022-07-09 23:35:30,725][26022] Updated weights on worker 0-0, policy_version 469962 (0.00091) [2022-07-09 23:35:31,615][25689] Fps is (10 sec: 5622.6, 60 sec: 5706.9, 300 sec: 5697.4). Total num frames: 481245184. Throughput: 0: 6005.0. Samples: 481246940. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:35:31,616][25689] Avg episode reward: [(0, '-46.265')] [2022-07-09 23:35:32,567][26022] Updated weights on worker 0-0, policy_version 469972 (0.00086) [2022-07-09 23:35:34,330][26022] Updated weights on worker 0-0, policy_version 469982 (0.00093) [2022-07-09 23:35:36,131][26022] Updated weights on worker 0-0, policy_version 469992 (0.00084) [2022-07-09 23:35:36,659][25689] Fps is (10 sec: 5909.9, 60 sec: 5763.7, 300 sec: 5690.7). Total num frames: 481274880. Throughput: 0: 6035.6. Samples: 481281638. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:35:36,659][25689] Avg episode reward: [(0, '-46.905')] [2022-07-09 23:35:37,962][26022] Updated weights on worker 0-0, policy_version 470002 (0.00088) [2022-07-09 23:35:39,684][26022] Updated weights on worker 0-0, policy_version 470012 (0.00089) [2022-07-09 23:35:41,204][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:35:41,214][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000470020_481300480.pth [2022-07-09 23:35:41,214][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000468018_479250432.pth [2022-07-09 23:35:41,433][26022] Updated weights on worker 0-0, policy_version 470022 (0.00081) [2022-07-09 23:35:41,684][25689] Fps is (10 sec: 5797.0, 60 sec: 5712.8, 300 sec: 5694.8). Total num frames: 481303552. Throughput: 0: 5164.6. Samples: 481298786. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:35:41,684][25689] Avg episode reward: [(0, '-47.165')] [2022-07-09 23:35:43,447][26022] Updated weights on worker 0-0, policy_version 470032 (0.00090) [2022-07-09 23:35:44,998][26022] Updated weights on worker 0-0, policy_version 470042 (0.00089) [2022-07-09 23:35:46,711][25689] Fps is (10 sec: 5602.9, 60 sec: 5712.2, 300 sec: 5689.3). Total num frames: 481331200. Throughput: 0: 6016.3. Samples: 481333276. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:35:46,711][25689] Avg episode reward: [(0, '-46.867')] [2022-07-09 23:35:46,881][26022] Updated weights on worker 0-0, policy_version 470052 (0.00095) [2022-07-09 23:35:48,506][26022] Updated weights on worker 0-0, policy_version 470062 (0.00087) [2022-07-09 23:35:50,461][26022] Updated weights on worker 0-0, policy_version 470072 (0.00084) [2022-07-09 23:35:51,715][25689] Fps is (10 sec: 5614.5, 60 sec: 5697.3, 300 sec: 5684.5). Total num frames: 481359872. Throughput: 0: 6009.2. Samples: 481367724. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:35:51,715][25689] Avg episode reward: [(0, '-46.888')] [2022-07-09 23:35:52,402][26022] Updated weights on worker 0-0, policy_version 470082 (0.00097) [2022-07-09 23:35:53,886][26022] Updated weights on worker 0-0, policy_version 470092 (0.00089) [2022-07-09 23:35:55,864][26022] Updated weights on worker 0-0, policy_version 470102 (0.00086) [2022-07-09 23:35:56,760][25689] Fps is (10 sec: 5808.3, 60 sec: 5714.8, 300 sec: 5695.0). Total num frames: 481389568. Throughput: 0: 5137.7. Samples: 481384910. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:35:56,760][25689] Avg episode reward: [(0, '-47.011')] [2022-07-09 23:35:57,305][26022] Updated weights on worker 0-0, policy_version 470112 (0.00092) [2022-07-09 23:35:59,499][26022] Updated weights on worker 0-0, policy_version 470122 (0.00081) [2022-07-09 23:36:01,128][26022] Updated weights on worker 0-0, policy_version 470132 (0.00094) [2022-07-09 23:36:01,775][25689] Fps is (10 sec: 5598.5, 60 sec: 5682.2, 300 sec: 5692.2). Total num frames: 481416192. Throughput: 0: 5992.0. Samples: 481419170. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:36:01,775][25689] Avg episode reward: [(0, '-47.166')] [2022-07-09 23:36:03,455][26022] Updated weights on worker 0-0, policy_version 470142 (0.00088) [2022-07-09 23:36:05,351][26022] Updated weights on worker 0-0, policy_version 470152 (0.00087) [2022-07-09 23:36:06,777][25689] Fps is (10 sec: 5520.2, 60 sec: 5700.0, 300 sec: 5690.2). Total num frames: 481444864. Throughput: 0: 5874.5. Samples: 481451154. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:36:06,777][25689] Avg episode reward: [(0, '-47.186')] [2022-07-09 23:36:06,838][26022] Updated weights on worker 0-0, policy_version 470162 (0.00100) [2022-07-09 23:36:08,814][26022] Updated weights on worker 0-0, policy_version 470172 (0.00088) [2022-07-09 23:36:10,594][26022] Updated weights on worker 0-0, policy_version 470182 (0.00088) [2022-07-09 23:36:11,787][25689] Fps is (10 sec: 5625.0, 60 sec: 5683.3, 300 sec: 5687.9). Total num frames: 481472512. Throughput: 0: 5011.4. Samples: 481468314. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:36:11,789][25689] Avg episode reward: [(0, '-47.280')] [2022-07-09 23:36:12,358][26022] Updated weights on worker 0-0, policy_version 470192 (0.00086) [2022-07-09 23:36:14,232][26022] Updated weights on worker 0-0, policy_version 470202 (0.00091) [2022-07-09 23:36:15,950][26022] Updated weights on worker 0-0, policy_version 470212 (0.01282) [2022-07-09 23:36:16,897][25689] Fps is (10 sec: 5666.4, 60 sec: 5703.1, 300 sec: 5689.8). Total num frames: 481502208. Throughput: 0: 5848.4. Samples: 481502680. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:36:16,898][25689] Avg episode reward: [(0, '-47.765')] [2022-07-09 23:36:17,852][26022] Updated weights on worker 0-0, policy_version 470222 (0.00089) [2022-07-09 23:36:19,345][26022] Updated weights on worker 0-0, policy_version 470232 (0.00086) [2022-07-09 23:36:21,403][26022] Updated weights on worker 0-0, policy_version 470242 (0.00096) [2022-07-09 23:36:21,951][25689] Fps is (10 sec: 5843.7, 60 sec: 5684.0, 300 sec: 5692.5). Total num frames: 481531904. Throughput: 0: 5874.1. Samples: 481537686. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:36:21,951][25689] Avg episode reward: [(0, '-48.467')] [2022-07-09 23:36:22,840][26022] Updated weights on worker 0-0, policy_version 470252 (0.00081) [2022-07-09 23:36:24,867][26022] Updated weights on worker 0-0, policy_version 470262 (0.00095) [2022-07-09 23:36:26,675][26022] Updated weights on worker 0-0, policy_version 470272 (0.00097) [2022-07-09 23:36:27,049][25689] Fps is (10 sec: 5749.0, 60 sec: 5709.6, 300 sec: 5691.0). Total num frames: 481560576. Throughput: 0: 5128.9. Samples: 481555130. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:36:27,053][25689] Avg episode reward: [(0, '-47.845')] [2022-07-09 23:36:28,462][26022] Updated weights on worker 0-0, policy_version 470282 (0.00083) [2022-07-09 23:36:30,153][26022] Updated weights on worker 0-0, policy_version 470292 (0.00089) [2022-07-09 23:36:32,121][25689] Fps is (10 sec: 5638.2, 60 sec: 5686.4, 300 sec: 5687.6). Total num frames: 481589248. Throughput: 0: 5953.1. Samples: 481589364. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:36:32,122][25689] Avg episode reward: [(0, '-47.121')] [2022-07-09 23:36:32,127][26022] Updated weights on worker 0-0, policy_version 470302 (0.00227) [2022-07-09 23:36:33,710][26022] Updated weights on worker 0-0, policy_version 470312 (0.00091) [2022-07-09 23:36:35,755][26022] Updated weights on worker 0-0, policy_version 470322 (0.00094) [2022-07-09 23:36:37,183][25689] Fps is (10 sec: 5759.8, 60 sec: 5684.7, 300 sec: 5691.7). Total num frames: 481618944. Throughput: 0: 5956.9. Samples: 481623524. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-09 23:36:37,184][25689] Avg episode reward: [(0, '-46.724')] [2022-07-09 23:36:37,219][26022] Updated weights on worker 0-0, policy_version 470332 (0.00082) [2022-07-09 23:36:39,316][26022] Updated weights on worker 0-0, policy_version 470342 (0.00085) [2022-07-09 23:36:40,824][26022] Updated weights on worker 0-0, policy_version 470352 (0.00083) [2022-07-09 23:36:42,211][25689] Fps is (10 sec: 5582.1, 60 sec: 5650.6, 300 sec: 5685.9). Total num frames: 481645568. Throughput: 0: 5077.1. Samples: 481640548. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:36:42,211][25689] Avg episode reward: [(0, '-46.601')] [2022-07-09 23:36:42,751][26022] Updated weights on worker 0-0, policy_version 470362 (0.00088) [2022-07-09 23:36:44,451][26022] Updated weights on worker 0-0, policy_version 470372 (0.01291) [2022-07-09 23:36:46,270][26022] Updated weights on worker 0-0, policy_version 470382 (0.00099) [2022-07-09 23:36:47,223][25689] Fps is (10 sec: 5609.8, 60 sec: 5685.9, 300 sec: 5689.3). Total num frames: 481675264. Throughput: 0: 5937.4. Samples: 481674908. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:36:47,223][25689] Avg episode reward: [(0, '-45.717')] [2022-07-09 23:36:48,108][26022] Updated weights on worker 0-0, policy_version 470392 (0.00091) [2022-07-09 23:36:49,895][26022] Updated weights on worker 0-0, policy_version 470402 (0.00086) [2022-07-09 23:36:51,559][26022] Updated weights on worker 0-0, policy_version 470412 (0.00092) [2022-07-09 23:36:52,267][25689] Fps is (10 sec: 6007.9, 60 sec: 5716.0, 300 sec: 5697.2). Total num frames: 481705984. Throughput: 0: 5969.4. Samples: 481709620. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:36:52,267][25689] Avg episode reward: [(0, '-46.085')] [2022-07-09 23:36:53,626][26022] Updated weights on worker 0-0, policy_version 470422 (0.00084) [2022-07-09 23:36:55,022][26022] Updated weights on worker 0-0, policy_version 470432 (0.00085) [2022-07-09 23:36:57,217][26022] Updated weights on worker 0-0, policy_version 470442 (0.00087) [2022-07-09 23:36:57,343][25689] Fps is (10 sec: 5868.8, 60 sec: 5696.1, 300 sec: 5692.6). Total num frames: 481734656. Throughput: 0: 5128.7. Samples: 481726916. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:36:57,343][25689] Avg episode reward: [(0, '-46.387')] [2022-07-09 23:36:58,825][26022] Updated weights on worker 0-0, policy_version 470452 (0.00085) [2022-07-09 23:37:00,418][26022] Updated weights on worker 0-0, policy_version 470462 (0.00092) [2022-07-09 23:37:02,407][25689] Fps is (10 sec: 5554.2, 60 sec: 5708.4, 300 sec: 5701.8). Total num frames: 481762304. Throughput: 0: 5980.4. Samples: 481761328. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:37:02,409][25689] Avg episode reward: [(0, '-46.431')] [2022-07-09 23:37:02,468][26022] Updated weights on worker 0-0, policy_version 470472 (0.00099) [2022-07-09 23:37:04,515][26022] Updated weights on worker 0-0, policy_version 470482 (0.00083) [2022-07-09 23:37:06,193][26022] Updated weights on worker 0-0, policy_version 470492 (0.00087) [2022-07-09 23:37:07,416][25689] Fps is (10 sec: 5489.4, 60 sec: 5690.8, 300 sec: 5691.6). Total num frames: 481789952. Throughput: 0: 5888.9. Samples: 481793824. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:37:07,416][25689] Avg episode reward: [(0, '-46.037')] [2022-07-09 23:37:08,069][26022] Updated weights on worker 0-0, policy_version 470502 (0.00085) [2022-07-09 23:37:09,588][26022] Updated weights on worker 0-0, policy_version 470512 (0.00082) [2022-07-09 23:37:11,632][26022] Updated weights on worker 0-0, policy_version 470522 (0.00084) [2022-07-09 23:37:12,456][25689] Fps is (10 sec: 5706.5, 60 sec: 5721.9, 300 sec: 5690.0). Total num frames: 481819648. Throughput: 0: 5868.7. Samples: 481828102. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:37:12,456][25689] Avg episode reward: [(0, '-45.809')] [2022-07-09 23:37:13,280][26022] Updated weights on worker 0-0, policy_version 470532 (0.00086) [2022-07-09 23:37:15,197][26022] Updated weights on worker 0-0, policy_version 470542 (0.00097) [2022-07-09 23:37:16,932][26022] Updated weights on worker 0-0, policy_version 470552 (0.00085) [2022-07-09 23:37:17,529][25689] Fps is (10 sec: 5670.2, 60 sec: 5691.5, 300 sec: 5692.6). Total num frames: 481847296. Throughput: 0: 5863.5. Samples: 481845280. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:37:17,531][25689] Avg episode reward: [(0, '-45.672')] [2022-07-09 23:37:18,826][26022] Updated weights on worker 0-0, policy_version 470562 (0.00097) [2022-07-09 23:37:20,562][26022] Updated weights on worker 0-0, policy_version 470572 (0.00090) [2022-07-09 23:37:22,436][26022] Updated weights on worker 0-0, policy_version 470582 (0.00081) [2022-07-09 23:37:22,534][25689] Fps is (10 sec: 5588.0, 60 sec: 5679.2, 300 sec: 5686.2). Total num frames: 481875968. Throughput: 0: 5878.9. Samples: 481879656. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:37:22,535][25689] Avg episode reward: [(0, '-44.823')] [2022-07-09 23:37:24,080][26022] Updated weights on worker 0-0, policy_version 470592 (0.00090) [2022-07-09 23:37:26,190][26022] Updated weights on worker 0-0, policy_version 470602 (0.00096) [2022-07-09 23:37:27,587][25689] Fps is (10 sec: 5701.5, 60 sec: 5683.5, 300 sec: 5689.6). Total num frames: 481904640. Throughput: 0: 5951.0. Samples: 481913862. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:37:27,588][25689] Avg episode reward: [(0, '-45.969')] [2022-07-09 23:37:27,678][26022] Updated weights on worker 0-0, policy_version 470612 (0.00094) [2022-07-09 23:37:29,673][26022] Updated weights on worker 0-0, policy_version 470622 (0.00084) [2022-07-09 23:37:31,301][26022] Updated weights on worker 0-0, policy_version 470632 (0.00094) [2022-07-09 23:37:32,613][25689] Fps is (10 sec: 5791.4, 60 sec: 5704.7, 300 sec: 5691.0). Total num frames: 481934336. Throughput: 0: 5104.7. Samples: 481930998. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:37:32,613][25689] Avg episode reward: [(0, '-45.162')] [2022-07-09 23:37:33,172][26022] Updated weights on worker 0-0, policy_version 470642 (0.00094) [2022-07-09 23:37:35,053][26022] Updated weights on worker 0-0, policy_version 470652 (0.00079) [2022-07-09 23:37:36,731][26022] Updated weights on worker 0-0, policy_version 470662 (0.00053) [2022-07-09 23:37:37,740][25689] Fps is (10 sec: 5647.7, 60 sec: 5664.7, 300 sec: 5689.3). Total num frames: 481961984. Throughput: 0: 5940.0. Samples: 481965336. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:37:37,741][25689] Avg episode reward: [(0, '-45.160')] [2022-07-09 23:37:38,559][26022] Updated weights on worker 0-0, policy_version 470672 (0.00085) [2022-07-09 23:37:40,425][26022] Updated weights on worker 0-0, policy_version 470682 (0.00086) [2022-07-09 23:37:41,223][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:37:41,239][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000470686_481982464.pth [2022-07-09 23:37:41,240][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000468686_479934464.pth [2022-07-09 23:37:42,037][26022] Updated weights on worker 0-0, policy_version 470692 (0.00087) [2022-07-09 23:37:42,798][25689] Fps is (10 sec: 5730.7, 60 sec: 5729.5, 300 sec: 5691.7). Total num frames: 481992704. Throughput: 0: 5922.0. Samples: 481999656. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:37:42,798][25689] Avg episode reward: [(0, '-44.804')] [2022-07-09 23:37:44,092][26022] Updated weights on worker 0-0, policy_version 470702 (0.00088) [2022-07-09 23:37:45,603][26022] Updated weights on worker 0-0, policy_version 470712 (0.00096) [2022-07-09 23:37:47,551][26022] Updated weights on worker 0-0, policy_version 470722 (0.00089) [2022-07-09 23:37:47,806][25689] Fps is (10 sec: 5798.6, 60 sec: 5696.1, 300 sec: 5685.2). Total num frames: 482020352. Throughput: 0: 5101.8. Samples: 482017018. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:37:47,808][25689] Avg episode reward: [(0, '-45.897')] [2022-07-09 23:37:49,211][26022] Updated weights on worker 0-0, policy_version 470732 (0.00084) [2022-07-09 23:37:51,169][26022] Updated weights on worker 0-0, policy_version 470742 (0.00081) [2022-07-09 23:37:52,823][25689] Fps is (10 sec: 5617.8, 60 sec: 5664.8, 300 sec: 5694.2). Total num frames: 482049024. Throughput: 0: 5958.1. Samples: 482051412. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:37:52,823][25689] Avg episode reward: [(0, '-45.498')] [2022-07-09 23:37:52,828][26022] Updated weights on worker 0-0, policy_version 470752 (0.00086) [2022-07-09 23:37:54,732][26022] Updated weights on worker 0-0, policy_version 470762 (0.00096) [2022-07-09 23:37:56,507][26022] Updated weights on worker 0-0, policy_version 470772 (0.00091) [2022-07-09 23:37:57,948][25689] Fps is (10 sec: 5755.2, 60 sec: 5677.1, 300 sec: 5688.5). Total num frames: 482078720. Throughput: 0: 5964.2. Samples: 482085858. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:37:57,949][25689] Avg episode reward: [(0, '-45.970')] [2022-07-09 23:37:58,262][26022] Updated weights on worker 0-0, policy_version 470782 (0.00088) [2022-07-09 23:38:00,009][26022] Updated weights on worker 0-0, policy_version 470792 (0.00085) [2022-07-09 23:38:01,759][26022] Updated weights on worker 0-0, policy_version 470802 (0.00079) [2022-07-09 23:38:03,026][25689] Fps is (10 sec: 5520.0, 60 sec: 5658.9, 300 sec: 5684.5). Total num frames: 482105344. Throughput: 0: 5116.5. Samples: 482103152. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:38:03,027][25689] Avg episode reward: [(0, '-46.159')] [2022-07-09 23:38:03,896][26022] Updated weights on worker 0-0, policy_version 470812 (0.00094) [2022-07-09 23:38:05,577][26022] Updated weights on worker 0-0, policy_version 470822 (0.00082) [2022-07-09 23:38:07,772][26022] Updated weights on worker 0-0, policy_version 470832 (0.00087) [2022-07-09 23:38:08,045][25689] Fps is (10 sec: 5374.9, 60 sec: 5658.0, 300 sec: 5687.7). Total num frames: 482132992. Throughput: 0: 5830.9. Samples: 482135030. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:38:08,046][25689] Avg episode reward: [(0, '-45.659')] [2022-07-09 23:38:09,166][26022] Updated weights on worker 0-0, policy_version 470842 (0.00092) [2022-07-09 23:38:11,243][26022] Updated weights on worker 0-0, policy_version 470852 (0.00085) [2022-07-09 23:38:12,977][26022] Updated weights on worker 0-0, policy_version 470862 (0.00086) [2022-07-09 23:38:13,076][25689] Fps is (10 sec: 5705.8, 60 sec: 5658.8, 300 sec: 5684.7). Total num frames: 482162688. Throughput: 0: 5822.6. Samples: 482169338. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:38:13,076][25689] Avg episode reward: [(0, '-45.898')] [2022-07-09 23:38:14,732][26022] Updated weights on worker 0-0, policy_version 470872 (0.00090) [2022-07-09 23:38:16,467][26022] Updated weights on worker 0-0, policy_version 470882 (0.00087) [2022-07-09 23:38:18,135][25689] Fps is (10 sec: 5886.6, 60 sec: 5694.0, 300 sec: 5691.8). Total num frames: 482192384. Throughput: 0: 4998.2. Samples: 482186756. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:38:18,135][25689] Avg episode reward: [(0, '-45.045')] [2022-07-09 23:38:18,222][26022] Updated weights on worker 0-0, policy_version 470892 (0.00093) [2022-07-09 23:38:19,943][26022] Updated weights on worker 0-0, policy_version 470902 (0.00082) [2022-07-09 23:38:21,871][26022] Updated weights on worker 0-0, policy_version 470912 (0.00100) [2022-07-09 23:38:23,142][25689] Fps is (10 sec: 6002.0, 60 sec: 5727.6, 300 sec: 5696.7). Total num frames: 482223104. Throughput: 0: 5881.1. Samples: 482221458. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:38:23,142][25689] Avg episode reward: [(0, '-45.003')] [2022-07-09 23:38:23,402][26022] Updated weights on worker 0-0, policy_version 470922 (0.00083) [2022-07-09 23:38:25,523][26022] Updated weights on worker 0-0, policy_version 470932 (0.00087) [2022-07-09 23:38:27,067][26022] Updated weights on worker 0-0, policy_version 470942 (0.00061) [2022-07-09 23:38:28,153][25689] Fps is (10 sec: 5622.0, 60 sec: 5680.8, 300 sec: 5684.0). Total num frames: 482248704. Throughput: 0: 5994.0. Samples: 482255552. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:38:28,153][25689] Avg episode reward: [(0, '-45.228')] [2022-07-09 23:38:28,994][26022] Updated weights on worker 0-0, policy_version 470952 (0.00084) [2022-07-09 23:38:30,939][26022] Updated weights on worker 0-0, policy_version 470962 (0.00089) [2022-07-09 23:38:32,487][26022] Updated weights on worker 0-0, policy_version 470972 (0.00612) [2022-07-09 23:38:33,158][25689] Fps is (10 sec: 5520.6, 60 sec: 5682.7, 300 sec: 5691.3). Total num frames: 482278400. Throughput: 0: 5153.8. Samples: 482272838. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:38:33,159][25689] Avg episode reward: [(0, '-45.928')] [2022-07-09 23:38:34,495][26022] Updated weights on worker 0-0, policy_version 470982 (0.00093) [2022-07-09 23:38:36,073][26022] Updated weights on worker 0-0, policy_version 470992 (0.00083) [2022-07-09 23:38:37,883][26022] Updated weights on worker 0-0, policy_version 471002 (0.00085) [2022-07-09 23:38:38,267][25689] Fps is (10 sec: 5770.7, 60 sec: 5701.4, 300 sec: 5687.0). Total num frames: 482307072. Throughput: 0: 5987.8. Samples: 482307304. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:38:38,268][25689] Avg episode reward: [(0, '-45.737')] [2022-07-09 23:38:39,911][26022] Updated weights on worker 0-0, policy_version 471012 (0.00085) [2022-07-09 23:38:41,468][26022] Updated weights on worker 0-0, policy_version 471022 (0.00086) [2022-07-09 23:38:43,304][25689] Fps is (10 sec: 5652.2, 60 sec: 5669.5, 300 sec: 5690.3). Total num frames: 482335744. Throughput: 0: 5957.8. Samples: 482341578. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:38:43,304][25689] Avg episode reward: [(0, '-46.503')] [2022-07-09 23:38:43,379][26022] Updated weights on worker 0-0, policy_version 471032 (0.00082) [2022-07-09 23:38:45,236][26022] Updated weights on worker 0-0, policy_version 471042 (0.00086) [2022-07-09 23:38:46,742][26022] Updated weights on worker 0-0, policy_version 471052 (0.00477) [2022-07-09 23:38:48,316][25689] Fps is (10 sec: 5808.4, 60 sec: 5703.0, 300 sec: 5694.8). Total num frames: 482365440. Throughput: 0: 5123.4. Samples: 482358858. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:38:48,317][25689] Avg episode reward: [(0, '-45.943')] [2022-07-09 23:38:48,883][26022] Updated weights on worker 0-0, policy_version 471062 (0.00083) [2022-07-09 23:38:50,235][26022] Updated weights on worker 0-0, policy_version 471072 (0.00085) [2022-07-09 23:38:52,437][26022] Updated weights on worker 0-0, policy_version 471082 (0.00086) [2022-07-09 23:38:53,349][25689] Fps is (10 sec: 5708.4, 60 sec: 5684.5, 300 sec: 5695.6). Total num frames: 482393088. Throughput: 0: 5951.4. Samples: 482393002. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:38:53,350][25689] Avg episode reward: [(0, '-46.007')] [2022-07-09 23:38:54,011][26022] Updated weights on worker 0-0, policy_version 471092 (0.00087) [2022-07-09 23:38:55,793][26022] Updated weights on worker 0-0, policy_version 471102 (0.00081) [2022-07-09 23:38:57,805][26022] Updated weights on worker 0-0, policy_version 471112 (0.00081) [2022-07-09 23:38:58,459][25689] Fps is (10 sec: 5754.7, 60 sec: 5702.9, 300 sec: 5693.7). Total num frames: 482423808. Throughput: 0: 5940.2. Samples: 482427246. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:38:58,459][25689] Avg episode reward: [(0, '-46.144')] [2022-07-09 23:38:59,419][26022] Updated weights on worker 0-0, policy_version 471122 (0.00085) [2022-07-09 23:39:01,275][26022] Updated weights on worker 0-0, policy_version 471132 (0.00090) [2022-07-09 23:39:03,443][26022] Updated weights on worker 0-0, policy_version 471142 (0.00092) [2022-07-09 23:39:03,473][25689] Fps is (10 sec: 5563.1, 60 sec: 5692.0, 300 sec: 5693.6). Total num frames: 482449408. Throughput: 0: 5100.7. Samples: 482444456. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:39:03,474][25689] Avg episode reward: [(0, '-46.049')] [2022-07-09 23:39:05,133][26022] Updated weights on worker 0-0, policy_version 471152 (0.00086) [2022-07-09 23:39:07,077][26022] Updated weights on worker 0-0, policy_version 471162 (0.00087) [2022-07-09 23:39:08,479][25689] Fps is (10 sec: 5416.2, 60 sec: 5710.2, 300 sec: 5693.7). Total num frames: 482478080. Throughput: 0: 5834.9. Samples: 482476506. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:39:08,480][25689] Avg episode reward: [(0, '-47.171')] [2022-07-09 23:39:08,888][26022] Updated weights on worker 0-0, policy_version 471172 (0.00089) [2022-07-09 23:39:10,570][26022] Updated weights on worker 0-0, policy_version 471182 (0.00093) [2022-07-09 23:39:12,516][26022] Updated weights on worker 0-0, policy_version 471192 (0.00086) [2022-07-09 23:39:13,505][25689] Fps is (10 sec: 5614.1, 60 sec: 5676.7, 300 sec: 5687.7). Total num frames: 482505728. Throughput: 0: 5830.8. Samples: 482510526. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:39:13,507][25689] Avg episode reward: [(0, '-46.178')] [2022-07-09 23:39:14,143][26022] Updated weights on worker 0-0, policy_version 471202 (0.00082) [2022-07-09 23:39:15,998][26022] Updated weights on worker 0-0, policy_version 471212 (0.00085) [2022-07-09 23:39:17,579][26022] Updated weights on worker 0-0, policy_version 471222 (0.00090) [2022-07-09 23:39:18,569][25689] Fps is (10 sec: 5581.4, 60 sec: 5659.3, 300 sec: 5690.3). Total num frames: 482534400. Throughput: 0: 5863.3. Samples: 482545162. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:39:18,571][25689] Avg episode reward: [(0, '-46.610')] [2022-07-09 23:39:19,493][26022] Updated weights on worker 0-0, policy_version 471232 (0.00082) [2022-07-09 23:39:21,377][26022] Updated weights on worker 0-0, policy_version 471242 (0.00085) [2022-07-09 23:39:23,061][26022] Updated weights on worker 0-0, policy_version 471252 (0.00100) [2022-07-09 23:39:23,602][25689] Fps is (10 sec: 5780.9, 60 sec: 5640.0, 300 sec: 5693.4). Total num frames: 482564096. Throughput: 0: 5857.5. Samples: 482562360. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:39:23,602][25689] Avg episode reward: [(0, '-46.140')] [2022-07-09 23:39:25,042][26022] Updated weights on worker 0-0, policy_version 471262 (0.00087) [2022-07-09 23:39:26,584][26022] Updated weights on worker 0-0, policy_version 471272 (0.00086) [2022-07-09 23:39:28,522][26022] Updated weights on worker 0-0, policy_version 471282 (0.00088) [2022-07-09 23:39:28,632][25689] Fps is (10 sec: 5800.4, 60 sec: 5688.9, 300 sec: 5689.9). Total num frames: 482592768. Throughput: 0: 5964.3. Samples: 482596706. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-09 23:39:28,633][25689] Avg episode reward: [(0, '-46.472')] [2022-07-09 23:39:30,317][26022] Updated weights on worker 0-0, policy_version 471292 (0.00092) [2022-07-09 23:39:32,029][26022] Updated weights on worker 0-0, policy_version 471302 (0.00084) [2022-07-09 23:39:33,710][25689] Fps is (10 sec: 5774.4, 60 sec: 5682.2, 300 sec: 5700.8). Total num frames: 482622464. Throughput: 0: 5985.5. Samples: 482631462. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:39:33,710][25689] Avg episode reward: [(0, '-46.808')] [2022-07-09 23:39:33,799][26022] Updated weights on worker 0-0, policy_version 471312 (0.00088) [2022-07-09 23:39:35,842][26022] Updated weights on worker 0-0, policy_version 471322 (0.00091) [2022-07-09 23:39:37,169][26022] Updated weights on worker 0-0, policy_version 471332 (0.00091) [2022-07-09 23:39:38,813][25689] Fps is (10 sec: 5632.4, 60 sec: 5665.8, 300 sec: 5685.5). Total num frames: 482650112. Throughput: 0: 5115.2. Samples: 482648712. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:39:38,814][25689] Avg episode reward: [(0, '-46.603')] [2022-07-09 23:39:39,346][26022] Updated weights on worker 0-0, policy_version 471342 (0.00086) [2022-07-09 23:39:40,787][26022] Updated weights on worker 0-0, policy_version 471352 (0.00087) [2022-07-09 23:39:41,485][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:39:41,507][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000471354_482666496.pth [2022-07-09 23:39:41,508][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000469352_480616448.pth [2022-07-09 23:39:42,960][26022] Updated weights on worker 0-0, policy_version 471362 (0.00087) [2022-07-09 23:39:43,816][25689] Fps is (10 sec: 5573.1, 60 sec: 5669.0, 300 sec: 5689.3). Total num frames: 482678784. Throughput: 0: 5934.5. Samples: 482682318. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:39:43,816][25689] Avg episode reward: [(0, '-47.242')] [2022-07-09 23:39:44,532][26022] Updated weights on worker 0-0, policy_version 471372 (0.00091) [2022-07-09 23:39:46,667][26022] Updated weights on worker 0-0, policy_version 471382 (0.00091) [2022-07-09 23:39:48,164][26022] Updated weights on worker 0-0, policy_version 471392 (0.00086) [2022-07-09 23:39:48,837][25689] Fps is (10 sec: 5720.5, 60 sec: 5651.2, 300 sec: 5685.9). Total num frames: 482707456. Throughput: 0: 5931.8. Samples: 482716560. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:39:48,838][25689] Avg episode reward: [(0, '-48.043')] [2022-07-09 23:39:50,174][26022] Updated weights on worker 0-0, policy_version 471402 (0.00092) [2022-07-09 23:39:51,821][26022] Updated weights on worker 0-0, policy_version 471412 (0.00089) [2022-07-09 23:39:53,580][26022] Updated weights on worker 0-0, policy_version 471422 (0.00087) [2022-07-09 23:39:53,842][25689] Fps is (10 sec: 5923.8, 60 sec: 5704.7, 300 sec: 5693.7). Total num frames: 482738176. Throughput: 0: 5086.3. Samples: 482733858. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:39:53,843][25689] Avg episode reward: [(0, '-47.486')] [2022-07-09 23:39:55,411][26022] Updated weights on worker 0-0, policy_version 471432 (0.00085) [2022-07-09 23:39:57,240][26022] Updated weights on worker 0-0, policy_version 471442 (0.00092) [2022-07-09 23:39:58,930][25689] Fps is (10 sec: 5681.7, 60 sec: 5638.9, 300 sec: 5685.7). Total num frames: 482764800. Throughput: 0: 5945.0. Samples: 482768308. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:39:58,931][25689] Avg episode reward: [(0, '-47.234')] [2022-07-09 23:39:59,067][26022] Updated weights on worker 0-0, policy_version 471452 (0.00088) [2022-07-09 23:40:00,664][26022] Updated weights on worker 0-0, policy_version 471462 (0.00087) [2022-07-09 23:40:03,032][26022] Updated weights on worker 0-0, policy_version 471472 (0.00087) [2022-07-09 23:40:03,969][25689] Fps is (10 sec: 5358.9, 60 sec: 5670.5, 300 sec: 5685.2). Total num frames: 482792448. Throughput: 0: 5874.1. Samples: 482800700. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:40:03,970][25689] Avg episode reward: [(0, '-47.142')] [2022-07-09 23:40:04,573][26022] Updated weights on worker 0-0, policy_version 471482 (0.00087) [2022-07-09 23:40:06,562][26022] Updated weights on worker 0-0, policy_version 471492 (0.00085) [2022-07-09 23:40:08,248][26022] Updated weights on worker 0-0, policy_version 471502 (0.00092) [2022-07-09 23:40:09,037][25689] Fps is (10 sec: 5572.3, 60 sec: 5664.6, 300 sec: 5684.1). Total num frames: 482821120. Throughput: 0: 5017.9. Samples: 482817922. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:40:09,039][25689] Avg episode reward: [(0, '-47.087')] [2022-07-09 23:40:10,130][26022] Updated weights on worker 0-0, policy_version 471512 (0.00089) [2022-07-09 23:40:12,042][26022] Updated weights on worker 0-0, policy_version 471522 (0.00091) [2022-07-09 23:40:13,776][26022] Updated weights on worker 0-0, policy_version 471532 (0.00089) [2022-07-09 23:40:14,045][25689] Fps is (10 sec: 5691.2, 60 sec: 5683.3, 300 sec: 5686.6). Total num frames: 482849792. Throughput: 0: 5852.8. Samples: 482852104. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:40:14,046][25689] Avg episode reward: [(0, '-47.472')] [2022-07-09 23:40:15,387][26022] Updated weights on worker 0-0, policy_version 471542 (0.00090) [2022-07-09 23:40:17,281][26022] Updated weights on worker 0-0, policy_version 471552 (0.00093) [2022-07-09 23:40:18,982][26022] Updated weights on worker 0-0, policy_version 471562 (0.00087) [2022-07-09 23:40:19,094][25689] Fps is (10 sec: 5804.0, 60 sec: 5701.6, 300 sec: 5682.9). Total num frames: 482879488. Throughput: 0: 5873.4. Samples: 482886738. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:40:19,095][25689] Avg episode reward: [(0, '-46.143')] [2022-07-09 23:40:20,879][26022] Updated weights on worker 0-0, policy_version 471572 (0.00088) [2022-07-09 23:40:22,521][26022] Updated weights on worker 0-0, policy_version 471582 (0.00082) [2022-07-09 23:40:24,109][25689] Fps is (10 sec: 5901.6, 60 sec: 5703.3, 300 sec: 5693.1). Total num frames: 482909184. Throughput: 0: 5138.2. Samples: 482904182. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:40:24,109][25689] Avg episode reward: [(0, '-45.759')] [2022-07-09 23:40:24,239][26022] Updated weights on worker 0-0, policy_version 471592 (0.00081) [2022-07-09 23:40:26,208][26022] Updated weights on worker 0-0, policy_version 471602 (0.00084) [2022-07-09 23:40:27,735][26022] Updated weights on worker 0-0, policy_version 471612 (0.00087) [2022-07-09 23:40:29,142][25689] Fps is (10 sec: 5707.2, 60 sec: 5686.1, 300 sec: 5685.7). Total num frames: 482936832. Throughput: 0: 6016.1. Samples: 482938872. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:40:29,142][25689] Avg episode reward: [(0, '-45.761')] [2022-07-09 23:40:29,790][26022] Updated weights on worker 0-0, policy_version 471622 (0.00091) [2022-07-09 23:40:31,515][26022] Updated weights on worker 0-0, policy_version 471632 (0.00081) [2022-07-09 23:40:33,304][26022] Updated weights on worker 0-0, policy_version 471642 (0.00103) [2022-07-09 23:40:34,173][25689] Fps is (10 sec: 5697.6, 60 sec: 5690.5, 300 sec: 5685.9). Total num frames: 482966528. Throughput: 0: 6023.6. Samples: 482973350. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:40:34,174][25689] Avg episode reward: [(0, '-45.430')] [2022-07-09 23:40:35,030][26022] Updated weights on worker 0-0, policy_version 471652 (0.00089) [2022-07-09 23:40:36,897][26022] Updated weights on worker 0-0, policy_version 471662 (0.00092) [2022-07-09 23:40:38,599][26022] Updated weights on worker 0-0, policy_version 471672 (0.00084) [2022-07-09 23:40:39,219][25689] Fps is (10 sec: 5893.8, 60 sec: 5729.8, 300 sec: 5689.0). Total num frames: 482996224. Throughput: 0: 5160.9. Samples: 482990600. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:40:39,219][25689] Avg episode reward: [(0, '-44.832')] [2022-07-09 23:40:40,474][26022] Updated weights on worker 0-0, policy_version 471682 (0.00799) [2022-07-09 23:40:42,048][26022] Updated weights on worker 0-0, policy_version 471692 (0.00094) [2022-07-09 23:40:43,974][26022] Updated weights on worker 0-0, policy_version 471702 (0.00641) [2022-07-09 23:40:44,227][25689] Fps is (10 sec: 5703.9, 60 sec: 5712.3, 300 sec: 5689.3). Total num frames: 483023872. Throughput: 0: 6016.8. Samples: 483025228. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:40:44,227][25689] Avg episode reward: [(0, '-44.992')] [2022-07-09 23:40:45,719][26022] Updated weights on worker 0-0, policy_version 471712 (0.00090) [2022-07-09 23:40:47,442][26022] Updated weights on worker 0-0, policy_version 471722 (0.00080) [2022-07-09 23:40:49,232][25689] Fps is (10 sec: 5624.4, 60 sec: 5713.9, 300 sec: 5689.3). Total num frames: 483052544. Throughput: 0: 6012.6. Samples: 483059668. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:40:49,233][25689] Avg episode reward: [(0, '-46.020')] [2022-07-09 23:40:49,329][26022] Updated weights on worker 0-0, policy_version 471732 (0.00086) [2022-07-09 23:40:51,082][26022] Updated weights on worker 0-0, policy_version 471742 (0.00083) [2022-07-09 23:40:52,739][26022] Updated weights on worker 0-0, policy_version 471752 (0.00085) [2022-07-09 23:40:54,252][25689] Fps is (10 sec: 5822.1, 60 sec: 5695.4, 300 sec: 5689.8). Total num frames: 483082240. Throughput: 0: 5161.5. Samples: 483076986. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:40:54,253][25689] Avg episode reward: [(0, '-46.414')] [2022-07-09 23:40:54,662][26022] Updated weights on worker 0-0, policy_version 471762 (0.00084) [2022-07-09 23:40:56,278][26022] Updated weights on worker 0-0, policy_version 471772 (0.00094) [2022-07-09 23:40:58,181][26022] Updated weights on worker 0-0, policy_version 471782 (0.00080) [2022-07-09 23:40:59,363][25689] Fps is (10 sec: 5761.6, 60 sec: 5727.3, 300 sec: 5694.9). Total num frames: 483110912. Throughput: 0: 6020.0. Samples: 483111866. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:40:59,363][25689] Avg episode reward: [(0, '-46.411')] [2022-07-09 23:40:59,880][26022] Updated weights on worker 0-0, policy_version 471792 (0.00083) [2022-07-09 23:41:02,133][26022] Updated weights on worker 0-0, policy_version 471802 (0.00088) [2022-07-09 23:41:03,882][26022] Updated weights on worker 0-0, policy_version 471812 (0.00087) [2022-07-09 23:41:04,383][25689] Fps is (10 sec: 5559.0, 60 sec: 5729.0, 300 sec: 5691.1). Total num frames: 483138560. Throughput: 0: 5899.1. Samples: 483144134. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:41:04,384][25689] Avg episode reward: [(0, '-46.231')] [2022-07-09 23:41:05,575][26022] Updated weights on worker 0-0, policy_version 471822 (0.00083) [2022-07-09 23:41:07,363][26022] Updated weights on worker 0-0, policy_version 471832 (0.00080) [2022-07-09 23:41:09,309][26022] Updated weights on worker 0-0, policy_version 471842 (0.00089) [2022-07-09 23:41:09,398][25689] Fps is (10 sec: 5510.0, 60 sec: 5717.1, 300 sec: 5691.0). Total num frames: 483166208. Throughput: 0: 5052.6. Samples: 483161560. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:41:09,399][25689] Avg episode reward: [(0, '-46.193')] [2022-07-09 23:41:10,938][26022] Updated weights on worker 0-0, policy_version 471852 (0.00089) [2022-07-09 23:41:12,872][26022] Updated weights on worker 0-0, policy_version 471862 (0.00088) [2022-07-09 23:41:14,423][25689] Fps is (10 sec: 5711.5, 60 sec: 5732.4, 300 sec: 5692.6). Total num frames: 483195904. Throughput: 0: 5900.8. Samples: 483196012. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:41:14,424][25689] Avg episode reward: [(0, '-46.039')] [2022-07-09 23:41:14,435][26022] Updated weights on worker 0-0, policy_version 471872 (0.00093) [2022-07-09 23:41:16,504][26022] Updated weights on worker 0-0, policy_version 471882 (0.00091) [2022-07-09 23:41:18,172][26022] Updated weights on worker 0-0, policy_version 471892 (0.00084) [2022-07-09 23:41:19,482][25689] Fps is (10 sec: 5788.6, 60 sec: 5714.6, 300 sec: 5689.1). Total num frames: 483224576. Throughput: 0: 5888.1. Samples: 483230328. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:41:19,482][25689] Avg episode reward: [(0, '-45.187')] [2022-07-09 23:41:20,038][26022] Updated weights on worker 0-0, policy_version 471902 (0.00085) [2022-07-09 23:41:21,691][26022] Updated weights on worker 0-0, policy_version 471912 (0.00104) [2022-07-09 23:41:23,552][26022] Updated weights on worker 0-0, policy_version 471922 (0.00092) [2022-07-09 23:41:24,498][25689] Fps is (10 sec: 5793.4, 60 sec: 5714.4, 300 sec: 5694.1). Total num frames: 483254272. Throughput: 0: 5136.9. Samples: 483247462. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:41:24,499][25689] Avg episode reward: [(0, '-44.664')] [2022-07-09 23:41:25,222][26022] Updated weights on worker 0-0, policy_version 471932 (0.00088) [2022-07-09 23:41:27,041][26022] Updated weights on worker 0-0, policy_version 471942 (0.00088) [2022-07-09 23:41:28,759][26022] Updated weights on worker 0-0, policy_version 471952 (0.00084) [2022-07-09 23:41:29,521][25689] Fps is (10 sec: 5609.8, 60 sec: 5698.4, 300 sec: 5688.1). Total num frames: 483280896. Throughput: 0: 5984.7. Samples: 483281990. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:41:29,522][25689] Avg episode reward: [(0, '-44.478')] [2022-07-09 23:41:30,764][26022] Updated weights on worker 0-0, policy_version 471962 (0.00087) [2022-07-09 23:41:32,407][26022] Updated weights on worker 0-0, policy_version 471972 (0.00086) [2022-07-09 23:41:34,214][26022] Updated weights on worker 0-0, policy_version 471982 (0.00084) [2022-07-09 23:41:34,559][25689] Fps is (10 sec: 5700.0, 60 sec: 5714.8, 300 sec: 5692.0). Total num frames: 483311616. Throughput: 0: 5987.4. Samples: 483316570. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:41:34,559][25689] Avg episode reward: [(0, '-44.943')] [2022-07-09 23:41:35,931][26022] Updated weights on worker 0-0, policy_version 471992 (0.00089) [2022-07-09 23:41:37,742][26022] Updated weights on worker 0-0, policy_version 472002 (0.00089) [2022-07-09 23:41:39,490][26022] Updated weights on worker 0-0, policy_version 472012 (0.00054) [2022-07-09 23:41:39,624][25689] Fps is (10 sec: 5980.3, 60 sec: 5712.9, 300 sec: 5701.6). Total num frames: 483341312. Throughput: 0: 5138.7. Samples: 483333832. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:41:39,624][25689] Avg episode reward: [(0, '-44.350')] [2022-07-09 23:41:41,346][26022] Updated weights on worker 0-0, policy_version 472022 (0.00084) [2022-07-09 23:41:41,530][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:41:41,542][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000472023_483351552.pth [2022-07-09 23:41:41,543][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000470020_481300480.pth [2022-07-09 23:41:42,865][26022] Updated weights on worker 0-0, policy_version 472032 (0.00086) [2022-07-09 23:41:44,723][25689] Fps is (10 sec: 5641.5, 60 sec: 5704.3, 300 sec: 5693.1). Total num frames: 483368960. Throughput: 0: 5974.5. Samples: 483368296. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:41:44,724][25689] Avg episode reward: [(0, '-44.725')] [2022-07-09 23:41:45,110][26022] Updated weights on worker 0-0, policy_version 472042 (0.00091) [2022-07-09 23:41:46,744][26022] Updated weights on worker 0-0, policy_version 472052 (0.00086) [2022-07-09 23:41:48,568][26022] Updated weights on worker 0-0, policy_version 472062 (0.00085) [2022-07-09 23:41:49,734][25689] Fps is (10 sec: 5570.5, 60 sec: 5703.8, 300 sec: 5686.8). Total num frames: 483397632. Throughput: 0: 5969.1. Samples: 483402644. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:41:49,735][25689] Avg episode reward: [(0, '-46.069')] [2022-07-09 23:41:50,167][26022] Updated weights on worker 0-0, policy_version 472072 (0.00088) [2022-07-09 23:41:51,916][26022] Updated weights on worker 0-0, policy_version 472082 (0.00080) [2022-07-09 23:41:54,001][26022] Updated weights on worker 0-0, policy_version 472092 (0.00088) [2022-07-09 23:41:54,782][25689] Fps is (10 sec: 5803.0, 60 sec: 5701.1, 300 sec: 5690.8). Total num frames: 483427328. Throughput: 0: 5108.5. Samples: 483419880. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:41:54,782][25689] Avg episode reward: [(0, '-45.685')] [2022-07-09 23:41:55,571][26022] Updated weights on worker 0-0, policy_version 472102 (0.00082) [2022-07-09 23:41:57,481][26022] Updated weights on worker 0-0, policy_version 472112 (0.00088) [2022-07-09 23:41:59,147][26022] Updated weights on worker 0-0, policy_version 472122 (0.00096) [2022-07-09 23:41:59,915][25689] Fps is (10 sec: 5632.3, 60 sec: 5682.1, 300 sec: 5689.5). Total num frames: 483454976. Throughput: 0: 5952.3. Samples: 483454614. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:41:59,916][25689] Avg episode reward: [(0, '-46.195')] [2022-07-09 23:42:00,832][26022] Updated weights on worker 0-0, policy_version 472132 (0.00090) [2022-07-09 23:42:03,304][26022] Updated weights on worker 0-0, policy_version 472142 (0.00090) [2022-07-09 23:42:04,868][26022] Updated weights on worker 0-0, policy_version 472152 (0.00090) [2022-07-09 23:42:04,929][25689] Fps is (10 sec: 5550.5, 60 sec: 5699.7, 300 sec: 5692.9). Total num frames: 483483648. Throughput: 0: 5882.1. Samples: 483487146. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:42:04,929][25689] Avg episode reward: [(0, '-45.455')] [2022-07-09 23:42:06,751][26022] Updated weights on worker 0-0, policy_version 472162 (0.00092) [2022-07-09 23:42:08,564][26022] Updated weights on worker 0-0, policy_version 472172 (0.00084) [2022-07-09 23:42:09,947][25689] Fps is (10 sec: 5716.4, 60 sec: 5716.3, 300 sec: 5689.8). Total num frames: 483512320. Throughput: 0: 5872.2. Samples: 483521336. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:42:09,947][25689] Avg episode reward: [(0, '-45.744')] [2022-07-09 23:42:10,243][26022] Updated weights on worker 0-0, policy_version 472182 (0.00096) [2022-07-09 23:42:12,055][26022] Updated weights on worker 0-0, policy_version 472192 (0.00095) [2022-07-09 23:42:13,824][26022] Updated weights on worker 0-0, policy_version 472202 (0.00094) [2022-07-09 23:42:14,981][25689] Fps is (10 sec: 5500.7, 60 sec: 5664.7, 300 sec: 5687.1). Total num frames: 483538944. Throughput: 0: 5881.4. Samples: 483538682. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-09 23:42:14,982][25689] Avg episode reward: [(0, '-44.977')] [2022-07-09 23:42:15,546][26022] Updated weights on worker 0-0, policy_version 472212 (0.00090) [2022-07-09 23:42:17,519][26022] Updated weights on worker 0-0, policy_version 472222 (0.00100) [2022-07-09 23:42:19,126][26022] Updated weights on worker 0-0, policy_version 472232 (0.00092) [2022-07-09 23:42:20,094][25689] Fps is (10 sec: 5752.0, 60 sec: 5710.2, 300 sec: 5695.4). Total num frames: 483570688. Throughput: 0: 5858.1. Samples: 483572824. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:42:20,095][25689] Avg episode reward: [(0, '-45.512')] [2022-07-09 23:42:21,292][26022] Updated weights on worker 0-0, policy_version 472242 (0.00089) [2022-07-09 23:42:22,678][26022] Updated weights on worker 0-0, policy_version 472252 (0.00087) [2022-07-09 23:42:24,767][26022] Updated weights on worker 0-0, policy_version 472262 (0.00088) [2022-07-09 23:42:25,121][25689] Fps is (10 sec: 5756.1, 60 sec: 5658.6, 300 sec: 5689.0). Total num frames: 483597312. Throughput: 0: 5936.5. Samples: 483607020. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:42:25,122][25689] Avg episode reward: [(0, '-44.752')] [2022-07-09 23:42:26,401][26022] Updated weights on worker 0-0, policy_version 472272 (0.00085) [2022-07-09 23:42:28,473][26022] Updated weights on worker 0-0, policy_version 472282 (0.00082) [2022-07-09 23:42:30,117][26022] Updated weights on worker 0-0, policy_version 472292 (0.00082) [2022-07-09 23:42:30,216][25689] Fps is (10 sec: 5564.3, 60 sec: 5702.5, 300 sec: 5687.7). Total num frames: 483627008. Throughput: 0: 5067.2. Samples: 483624042. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:42:30,216][25689] Avg episode reward: [(0, '-44.270')] [2022-07-09 23:42:31,858][26022] Updated weights on worker 0-0, policy_version 472302 (0.00083) [2022-07-09 23:42:33,630][26022] Updated weights on worker 0-0, policy_version 472312 (0.00089) [2022-07-09 23:42:35,264][25689] Fps is (10 sec: 5956.6, 60 sec: 5701.5, 300 sec: 5699.5). Total num frames: 483657728. Throughput: 0: 5921.5. Samples: 483658788. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:42:35,265][25689] Avg episode reward: [(0, '-44.305')] [2022-07-09 23:42:35,277][26022] Updated weights on worker 0-0, policy_version 472322 (0.00087) [2022-07-09 23:42:37,266][26022] Updated weights on worker 0-0, policy_version 472332 (0.00093) [2022-07-09 23:42:38,812][26022] Updated weights on worker 0-0, policy_version 472342 (0.00090) [2022-07-09 23:42:40,351][25689] Fps is (10 sec: 5759.0, 60 sec: 5665.7, 300 sec: 5688.6). Total num frames: 483685376. Throughput: 0: 5928.4. Samples: 483692914. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:42:40,351][25689] Avg episode reward: [(0, '-44.311')] [2022-07-09 23:42:40,766][26022] Updated weights on worker 0-0, policy_version 472352 (0.00092) [2022-07-09 23:42:42,514][26022] Updated weights on worker 0-0, policy_version 472362 (0.00086) [2022-07-09 23:42:44,422][26022] Updated weights on worker 0-0, policy_version 472372 (0.00089) [2022-07-09 23:42:45,388][25689] Fps is (10 sec: 5664.0, 60 sec: 5705.3, 300 sec: 5695.0). Total num frames: 483715072. Throughput: 0: 5089.5. Samples: 483710168. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:42:45,390][25689] Avg episode reward: [(0, '-44.264')] [2022-07-09 23:42:46,088][26022] Updated weights on worker 0-0, policy_version 472382 (0.00092) [2022-07-09 23:42:47,932][26022] Updated weights on worker 0-0, policy_version 472392 (0.00091) [2022-07-09 23:42:49,578][26022] Updated weights on worker 0-0, policy_version 472402 (0.00097) [2022-07-09 23:42:50,414][25689] Fps is (10 sec: 5698.3, 60 sec: 5687.0, 300 sec: 5691.4). Total num frames: 483742720. Throughput: 0: 5967.7. Samples: 483744580. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:42:50,416][25689] Avg episode reward: [(0, '-42.866')] [2022-07-09 23:42:51,442][26022] Updated weights on worker 0-0, policy_version 472412 (0.00085) [2022-07-09 23:42:53,459][26022] Updated weights on worker 0-0, policy_version 472422 (0.00083) [2022-07-09 23:42:55,011][26022] Updated weights on worker 0-0, policy_version 472432 (0.00822) [2022-07-09 23:42:55,428][25689] Fps is (10 sec: 5814.0, 60 sec: 5707.1, 300 sec: 5696.9). Total num frames: 483773440. Throughput: 0: 5980.7. Samples: 483779380. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:42:55,428][25689] Avg episode reward: [(0, '-43.517')] [2022-07-09 23:42:56,974][26022] Updated weights on worker 0-0, policy_version 472442 (0.00088) [2022-07-09 23:42:58,411][26022] Updated weights on worker 0-0, policy_version 472452 (0.00083) [2022-07-09 23:43:00,399][26022] Updated weights on worker 0-0, policy_version 472462 (0.00087) [2022-07-09 23:43:00,555][25689] Fps is (10 sec: 5755.8, 60 sec: 5707.7, 300 sec: 5699.4). Total num frames: 483801088. Throughput: 0: 5140.0. Samples: 483796762. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:43:00,555][25689] Avg episode reward: [(0, '-43.259')] [2022-07-09 23:43:02,396][26022] Updated weights on worker 0-0, policy_version 472472 (0.00084) [2022-07-09 23:43:04,331][26022] Updated weights on worker 0-0, policy_version 472482 (0.00090) [2022-07-09 23:43:05,599][25689] Fps is (10 sec: 5335.6, 60 sec: 5671.0, 300 sec: 5695.5). Total num frames: 483827712. Throughput: 0: 5889.1. Samples: 483829192. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:43:05,601][25689] Avg episode reward: [(0, '-42.717')] [2022-07-09 23:43:06,027][26022] Updated weights on worker 0-0, policy_version 472492 (0.00085) [2022-07-09 23:43:07,643][26022] Updated weights on worker 0-0, policy_version 472502 (0.00094) [2022-07-09 23:43:09,631][26022] Updated weights on worker 0-0, policy_version 472512 (0.00084) [2022-07-09 23:43:10,609][25689] Fps is (10 sec: 5703.7, 60 sec: 5705.6, 300 sec: 5699.4). Total num frames: 483858432. Throughput: 0: 5889.4. Samples: 483863514. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:43:10,609][25689] Avg episode reward: [(0, '-42.375')] [2022-07-09 23:43:11,416][26022] Updated weights on worker 0-0, policy_version 472522 (0.00086) [2022-07-09 23:43:13,202][26022] Updated weights on worker 0-0, policy_version 472532 (0.00093) [2022-07-09 23:43:15,148][26022] Updated weights on worker 0-0, policy_version 472542 (0.00090) [2022-07-09 23:43:15,639][25689] Fps is (10 sec: 5711.7, 60 sec: 5706.0, 300 sec: 5689.6). Total num frames: 483885056. Throughput: 0: 5012.0. Samples: 483880680. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:43:15,640][25689] Avg episode reward: [(0, '-43.340')] [2022-07-09 23:43:16,791][26022] Updated weights on worker 0-0, policy_version 472552 (0.00104) [2022-07-09 23:43:18,583][26022] Updated weights on worker 0-0, policy_version 472562 (0.00078) [2022-07-09 23:43:20,443][26022] Updated weights on worker 0-0, policy_version 472572 (0.00079) [2022-07-09 23:43:20,724][25689] Fps is (10 sec: 5567.7, 60 sec: 5674.8, 300 sec: 5684.6). Total num frames: 483914752. Throughput: 0: 5868.3. Samples: 483915122. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:43:20,725][25689] Avg episode reward: [(0, '-43.775')] [2022-07-09 23:43:22,098][26022] Updated weights on worker 0-0, policy_version 472582 (0.00086) [2022-07-09 23:43:24,051][26022] Updated weights on worker 0-0, policy_version 472592 (0.00084) [2022-07-09 23:43:25,599][26022] Updated weights on worker 0-0, policy_version 472602 (0.00091) [2022-07-09 23:43:25,782][25689] Fps is (10 sec: 5855.7, 60 sec: 5722.6, 300 sec: 5697.5). Total num frames: 483944448. Throughput: 0: 5960.3. Samples: 483949486. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:43:25,782][25689] Avg episode reward: [(0, '-43.654')] [2022-07-09 23:43:27,548][26022] Updated weights on worker 0-0, policy_version 472612 (0.00089) [2022-07-09 23:43:29,194][26022] Updated weights on worker 0-0, policy_version 472622 (0.00091) [2022-07-09 23:43:30,799][25689] Fps is (10 sec: 5691.7, 60 sec: 5696.1, 300 sec: 5690.4). Total num frames: 483972096. Throughput: 0: 5093.1. Samples: 483966348. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:43:30,800][25689] Avg episode reward: [(0, '-43.317')] [2022-07-09 23:43:31,000][26022] Updated weights on worker 0-0, policy_version 472632 (0.00093) [2022-07-09 23:43:33,051][26022] Updated weights on worker 0-0, policy_version 472642 (0.00095) [2022-07-09 23:43:34,639][26022] Updated weights on worker 0-0, policy_version 472652 (0.00085) [2022-07-09 23:43:35,818][25689] Fps is (10 sec: 5713.4, 60 sec: 5681.9, 300 sec: 5695.6). Total num frames: 484001792. Throughput: 0: 5962.4. Samples: 484000998. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:43:35,819][25689] Avg episode reward: [(0, '-43.371')] [2022-07-09 23:43:36,479][26022] Updated weights on worker 0-0, policy_version 472662 (0.00088) [2022-07-09 23:43:38,418][26022] Updated weights on worker 0-0, policy_version 472672 (0.00095) [2022-07-09 23:43:39,859][26022] Updated weights on worker 0-0, policy_version 472682 (0.00088) [2022-07-09 23:43:40,905][25689] Fps is (10 sec: 5775.7, 60 sec: 5698.8, 300 sec: 5694.6). Total num frames: 484030464. Throughput: 0: 5965.2. Samples: 484035506. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:43:40,907][25689] Avg episode reward: [(0, '-43.923')] [2022-07-09 23:43:41,662][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:43:41,674][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000472690_484034560.pth [2022-07-09 23:43:41,675][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000470686_481982464.pth [2022-07-09 23:43:42,013][26022] Updated weights on worker 0-0, policy_version 472692 (0.00090) [2022-07-09 23:43:43,444][26022] Updated weights on worker 0-0, policy_version 472702 (0.00088) [2022-07-09 23:43:45,511][26022] Updated weights on worker 0-0, policy_version 472712 (0.00090) [2022-07-09 23:43:45,930][25689] Fps is (10 sec: 5772.6, 60 sec: 5700.0, 300 sec: 5694.4). Total num frames: 484060160. Throughput: 0: 5132.8. Samples: 484052902. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:43:45,932][25689] Avg episode reward: [(0, '-44.184')] [2022-07-09 23:43:47,114][26022] Updated weights on worker 0-0, policy_version 472722 (0.00085) [2022-07-09 23:43:48,942][26022] Updated weights on worker 0-0, policy_version 472732 (0.00093) [2022-07-09 23:43:50,640][26022] Updated weights on worker 0-0, policy_version 472742 (0.00083) [2022-07-09 23:43:50,954][25689] Fps is (10 sec: 5706.8, 60 sec: 5700.2, 300 sec: 5694.5). Total num frames: 484087808. Throughput: 0: 6002.7. Samples: 484087328. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:43:50,954][25689] Avg episode reward: [(0, '-43.149')] [2022-07-09 23:43:52,599][26022] Updated weights on worker 0-0, policy_version 472752 (0.00084) [2022-07-09 23:43:54,324][26022] Updated weights on worker 0-0, policy_version 472762 (0.00562) [2022-07-09 23:43:55,969][25689] Fps is (10 sec: 5609.8, 60 sec: 5666.2, 300 sec: 5689.4). Total num frames: 484116480. Throughput: 0: 5995.3. Samples: 484121808. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:43:55,970][25689] Avg episode reward: [(0, '-42.817')] [2022-07-09 23:43:56,214][26022] Updated weights on worker 0-0, policy_version 472772 (0.00087) [2022-07-09 23:43:57,777][26022] Updated weights on worker 0-0, policy_version 472782 (0.00081) [2022-07-09 23:43:59,894][26022] Updated weights on worker 0-0, policy_version 472792 (0.00083) [2022-07-09 23:44:01,028][25689] Fps is (10 sec: 5895.5, 60 sec: 5723.4, 300 sec: 5705.8). Total num frames: 484147200. Throughput: 0: 5148.6. Samples: 484139108. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:44:01,029][25689] Avg episode reward: [(0, '-43.358')] [2022-07-09 23:44:01,386][26022] Updated weights on worker 0-0, policy_version 472802 (0.00088) [2022-07-09 23:44:03,819][26022] Updated weights on worker 0-0, policy_version 472812 (0.00082) [2022-07-09 23:44:05,306][26022] Updated weights on worker 0-0, policy_version 472822 (0.00082) [2022-07-09 23:44:06,038][25689] Fps is (10 sec: 5593.4, 60 sec: 5709.7, 300 sec: 5695.4). Total num frames: 484172800. Throughput: 0: 5880.7. Samples: 484171156. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:44:06,039][25689] Avg episode reward: [(0, '-43.515')] [2022-07-09 23:44:07,221][26022] Updated weights on worker 0-0, policy_version 472832 (0.00081) [2022-07-09 23:44:08,967][26022] Updated weights on worker 0-0, policy_version 472842 (0.00084) [2022-07-09 23:44:10,618][26022] Updated weights on worker 0-0, policy_version 472852 (0.00083) [2022-07-09 23:44:11,042][25689] Fps is (10 sec: 5419.5, 60 sec: 5676.3, 300 sec: 5699.3). Total num frames: 484201472. Throughput: 0: 5897.7. Samples: 484205804. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:44:11,043][25689] Avg episode reward: [(0, '-43.007')] [2022-07-09 23:44:12,693][26022] Updated weights on worker 0-0, policy_version 472862 (0.00083) [2022-07-09 23:44:14,360][26022] Updated weights on worker 0-0, policy_version 472872 (0.00094) [2022-07-09 23:44:16,045][25689] Fps is (10 sec: 5628.1, 60 sec: 5695.8, 300 sec: 5697.0). Total num frames: 484229120. Throughput: 0: 5033.0. Samples: 484222850. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:44:16,046][25689] Avg episode reward: [(0, '-43.771')] [2022-07-09 23:44:16,167][26022] Updated weights on worker 0-0, policy_version 472882 (0.00088) [2022-07-09 23:44:17,985][26022] Updated weights on worker 0-0, policy_version 472892 (0.00086) [2022-07-09 23:44:19,779][26022] Updated weights on worker 0-0, policy_version 472902 (0.00086) [2022-07-09 23:44:21,135][25689] Fps is (10 sec: 5681.6, 60 sec: 5695.4, 300 sec: 5695.9). Total num frames: 484258816. Throughput: 0: 5871.7. Samples: 484257170. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:44:21,135][25689] Avg episode reward: [(0, '-44.202')] [2022-07-09 23:44:21,671][26022] Updated weights on worker 0-0, policy_version 472912 (0.00088) [2022-07-09 23:44:23,271][26022] Updated weights on worker 0-0, policy_version 472922 (0.00343) [2022-07-09 23:44:24,935][26022] Updated weights on worker 0-0, policy_version 472932 (0.00083) [2022-07-09 23:44:26,142][25689] Fps is (10 sec: 5882.4, 60 sec: 5700.2, 300 sec: 5699.8). Total num frames: 484288512. Throughput: 0: 6014.3. Samples: 484292064. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:44:26,142][25689] Avg episode reward: [(0, '-44.749')] [2022-07-09 23:44:26,931][26022] Updated weights on worker 0-0, policy_version 472942 (0.00089) [2022-07-09 23:44:28,655][26022] Updated weights on worker 0-0, policy_version 472952 (0.00085) [2022-07-09 23:44:30,526][26022] Updated weights on worker 0-0, policy_version 472962 (0.00091) [2022-07-09 23:44:31,157][25689] Fps is (10 sec: 5721.8, 60 sec: 5700.4, 300 sec: 5694.1). Total num frames: 484316160. Throughput: 0: 5985.7. Samples: 484326206. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:44:31,157][25689] Avg episode reward: [(0, '-44.546')] [2022-07-09 23:44:32,327][26022] Updated weights on worker 0-0, policy_version 472972 (0.00091) [2022-07-09 23:44:33,991][26022] Updated weights on worker 0-0, policy_version 472982 (0.00095) [2022-07-09 23:44:35,947][26022] Updated weights on worker 0-0, policy_version 472992 (0.00091) [2022-07-09 23:44:36,167][25689] Fps is (10 sec: 5720.4, 60 sec: 5701.3, 300 sec: 5702.8). Total num frames: 484345856. Throughput: 0: 6001.6. Samples: 484343610. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:44:36,167][25689] Avg episode reward: [(0, '-44.566')] [2022-07-09 23:44:37,512][26022] Updated weights on worker 0-0, policy_version 473002 (0.00079) [2022-07-09 23:44:39,443][26022] Updated weights on worker 0-0, policy_version 473012 (0.00085) [2022-07-09 23:44:41,269][25689] Fps is (10 sec: 5771.8, 60 sec: 5699.8, 300 sec: 5700.9). Total num frames: 484374528. Throughput: 0: 6005.9. Samples: 484378096. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:44:41,270][25689] Avg episode reward: [(0, '-44.119')] [2022-07-09 23:44:41,275][26022] Updated weights on worker 0-0, policy_version 473022 (0.00089) [2022-07-09 23:44:43,014][26022] Updated weights on worker 0-0, policy_version 473032 (0.00085) [2022-07-09 23:44:44,904][26022] Updated weights on worker 0-0, policy_version 473042 (0.00088) [2022-07-09 23:44:46,314][25689] Fps is (10 sec: 5752.0, 60 sec: 5697.9, 300 sec: 5703.9). Total num frames: 484404224. Throughput: 0: 5980.3. Samples: 484412698. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:44:46,314][25689] Avg episode reward: [(0, '-43.823')] [2022-07-09 23:44:46,472][26022] Updated weights on worker 0-0, policy_version 473052 (0.00087) [2022-07-09 23:44:48,220][26022] Updated weights on worker 0-0, policy_version 473062 (0.00089) [2022-07-09 23:44:50,177][26022] Updated weights on worker 0-0, policy_version 473072 (0.00083) [2022-07-09 23:44:51,379][25689] Fps is (10 sec: 5773.4, 60 sec: 5711.0, 300 sec: 5695.8). Total num frames: 484432896. Throughput: 0: 5117.7. Samples: 484429698. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:44:51,380][25689] Avg episode reward: [(0, '-44.778')] [2022-07-09 23:44:51,769][26022] Updated weights on worker 0-0, policy_version 473082 (0.00088) [2022-07-09 23:44:53,728][26022] Updated weights on worker 0-0, policy_version 473092 (0.00087) [2022-07-09 23:44:55,508][26022] Updated weights on worker 0-0, policy_version 473102 (0.00089) [2022-07-09 23:44:56,386][25689] Fps is (10 sec: 5490.2, 60 sec: 5677.9, 300 sec: 5697.4). Total num frames: 484459520. Throughput: 0: 5964.9. Samples: 484464216. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:44:56,386][25689] Avg episode reward: [(0, '-44.952')] [2022-07-09 23:44:57,195][26022] Updated weights on worker 0-0, policy_version 473112 (0.00082) [2022-07-09 23:44:59,000][26022] Updated weights on worker 0-0, policy_version 473122 (0.00093) [2022-07-09 23:45:00,659][26022] Updated weights on worker 0-0, policy_version 473132 (0.00091) [2022-07-09 23:45:01,539][25689] Fps is (10 sec: 5543.6, 60 sec: 5652.1, 300 sec: 5702.1). Total num frames: 484489216. Throughput: 0: 5933.0. Samples: 484498354. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:45:01,539][25689] Avg episode reward: [(0, '-44.745')] [2022-07-09 23:45:03,036][26022] Updated weights on worker 0-0, policy_version 473142 (0.00086) [2022-07-09 23:45:04,847][26022] Updated weights on worker 0-0, policy_version 473152 (0.00347) [2022-07-09 23:45:06,551][25689] Fps is (10 sec: 5641.3, 60 sec: 5685.8, 300 sec: 5699.7). Total num frames: 484516864. Throughput: 0: 4987.9. Samples: 484513638. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2022-07-09 23:45:06,552][25689] Avg episode reward: [(0, '-44.398')] [2022-07-09 23:45:06,610][26022] Updated weights on worker 0-0, policy_version 473162 (0.00099) [2022-07-09 23:45:08,515][26022] Updated weights on worker 0-0, policy_version 473172 (0.00084) [2022-07-09 23:45:10,268][26022] Updated weights on worker 0-0, policy_version 473182 (0.00091) [2022-07-09 23:45:11,596][25689] Fps is (10 sec: 5497.9, 60 sec: 5665.0, 300 sec: 5695.6). Total num frames: 484544512. Throughput: 0: 5826.7. Samples: 484547498. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:45:11,598][25689] Avg episode reward: [(0, '-44.043')] [2022-07-09 23:45:12,101][26022] Updated weights on worker 0-0, policy_version 473192 (0.00087) [2022-07-09 23:45:13,864][26022] Updated weights on worker 0-0, policy_version 473202 (0.00083) [2022-07-09 23:45:15,623][26022] Updated weights on worker 0-0, policy_version 473212 (0.00090) [2022-07-09 23:45:16,679][25689] Fps is (10 sec: 5762.9, 60 sec: 5708.3, 300 sec: 5698.4). Total num frames: 484575232. Throughput: 0: 5815.4. Samples: 484582230. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:45:16,680][25689] Avg episode reward: [(0, '-43.580')] [2022-07-09 23:45:17,323][26022] Updated weights on worker 0-0, policy_version 473222 (0.00086) [2022-07-09 23:45:19,215][26022] Updated weights on worker 0-0, policy_version 473232 (0.00097) [2022-07-09 23:45:20,829][26022] Updated weights on worker 0-0, policy_version 473242 (0.00085) [2022-07-09 23:45:21,744][25689] Fps is (10 sec: 5751.6, 60 sec: 5676.7, 300 sec: 5690.5). Total num frames: 484602880. Throughput: 0: 4991.6. Samples: 484599214. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:45:21,745][25689] Avg episode reward: [(0, '-42.509')] [2022-07-09 23:45:22,743][26022] Updated weights on worker 0-0, policy_version 473252 (0.00089) [2022-07-09 23:45:24,484][26022] Updated weights on worker 0-0, policy_version 473262 (0.00084) [2022-07-09 23:45:26,403][26022] Updated weights on worker 0-0, policy_version 473272 (0.00083) [2022-07-09 23:45:26,764][25689] Fps is (10 sec: 5685.9, 60 sec: 5675.5, 300 sec: 5697.6). Total num frames: 484632576. Throughput: 0: 5926.7. Samples: 484633438. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:45:26,766][25689] Avg episode reward: [(0, '-41.694')] [2022-07-09 23:45:28,302][26022] Updated weights on worker 0-0, policy_version 473282 (0.00096) [2022-07-09 23:45:29,789][26022] Updated weights on worker 0-0, policy_version 473292 (0.00088) [2022-07-09 23:45:31,768][25689] Fps is (10 sec: 5721.0, 60 sec: 5676.6, 300 sec: 5691.3). Total num frames: 484660224. Throughput: 0: 5954.9. Samples: 484667618. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:45:31,769][25689] Avg episode reward: [(0, '-41.747')] [2022-07-09 23:45:31,935][26022] Updated weights on worker 0-0, policy_version 473302 (0.00080) [2022-07-09 23:45:33,468][26022] Updated weights on worker 0-0, policy_version 473312 (0.00089) [2022-07-09 23:45:35,359][26022] Updated weights on worker 0-0, policy_version 473322 (0.00087) [2022-07-09 23:45:36,812][25689] Fps is (10 sec: 5605.6, 60 sec: 5656.5, 300 sec: 5687.9). Total num frames: 484688896. Throughput: 0: 5093.7. Samples: 484684778. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:45:36,812][25689] Avg episode reward: [(0, '-42.036')] [2022-07-09 23:45:37,045][26022] Updated weights on worker 0-0, policy_version 473332 (0.00090) [2022-07-09 23:45:38,999][26022] Updated weights on worker 0-0, policy_version 473342 (0.00088) [2022-07-09 23:45:40,576][26022] Updated weights on worker 0-0, policy_version 473352 (0.00085) [2022-07-09 23:45:41,948][25689] Fps is (10 sec: 5733.3, 60 sec: 5670.2, 300 sec: 5692.3). Total num frames: 484718592. Throughput: 0: 5935.5. Samples: 484719134. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:45:41,949][25689] Avg episode reward: [(0, '-41.816')] [2022-07-09 23:45:42,065][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:45:42,077][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000473359_484719616.pth [2022-07-09 23:45:42,077][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000471354_482666496.pth [2022-07-09 23:45:42,594][26022] Updated weights on worker 0-0, policy_version 473362 (0.00088) [2022-07-09 23:45:44,267][26022] Updated weights on worker 0-0, policy_version 473372 (0.00088) [2022-07-09 23:45:46,059][26022] Updated weights on worker 0-0, policy_version 473382 (0.00085) [2022-07-09 23:45:46,992][25689] Fps is (10 sec: 5733.2, 60 sec: 5653.4, 300 sec: 5691.6). Total num frames: 484747264. Throughput: 0: 5941.1. Samples: 484753614. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:45:46,993][25689] Avg episode reward: [(0, '-43.092')] [2022-07-09 23:45:47,948][26022] Updated weights on worker 0-0, policy_version 473392 (0.00387) [2022-07-09 23:45:49,611][26022] Updated weights on worker 0-0, policy_version 473402 (0.00097) [2022-07-09 23:45:51,428][26022] Updated weights on worker 0-0, policy_version 473412 (0.00084) [2022-07-09 23:45:52,025][25689] Fps is (10 sec: 5894.1, 60 sec: 5690.2, 300 sec: 5694.8). Total num frames: 484777984. Throughput: 0: 5093.7. Samples: 484770802. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:45:52,026][25689] Avg episode reward: [(0, '-44.711')] [2022-07-09 23:45:53,223][26022] Updated weights on worker 0-0, policy_version 473422 (0.00083) [2022-07-09 23:45:54,891][26022] Updated weights on worker 0-0, policy_version 473432 (0.00083) [2022-07-09 23:45:57,006][26022] Updated weights on worker 0-0, policy_version 473442 (0.00087) [2022-07-09 23:45:57,101][25689] Fps is (10 sec: 5672.8, 60 sec: 5683.7, 300 sec: 5688.5). Total num frames: 484804608. Throughput: 0: 5942.1. Samples: 484805340. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:45:57,101][25689] Avg episode reward: [(0, '-44.545')] [2022-07-09 23:45:58,639][26022] Updated weights on worker 0-0, policy_version 473452 (0.00094) [2022-07-09 23:46:00,374][26022] Updated weights on worker 0-0, policy_version 473462 (0.00092) [2022-07-09 23:46:02,135][25689] Fps is (10 sec: 5469.7, 60 sec: 5678.0, 300 sec: 5691.7). Total num frames: 484833280. Throughput: 0: 5969.3. Samples: 484839630. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:46:02,135][25689] Avg episode reward: [(0, '-45.241')] [2022-07-09 23:46:02,755][26022] Updated weights on worker 0-0, policy_version 473472 (0.00094) [2022-07-09 23:46:04,146][26022] Updated weights on worker 0-0, policy_version 473482 (0.00088) [2022-07-09 23:46:06,302][26022] Updated weights on worker 0-0, policy_version 473492 (0.00080) [2022-07-09 23:46:07,141][25689] Fps is (10 sec: 5711.3, 60 sec: 5695.4, 300 sec: 5695.3). Total num frames: 484861952. Throughput: 0: 5008.0. Samples: 484854520. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:46:07,142][25689] Avg episode reward: [(0, '-44.922')] [2022-07-09 23:46:07,833][26022] Updated weights on worker 0-0, policy_version 473502 (0.00091) [2022-07-09 23:46:09,775][26022] Updated weights on worker 0-0, policy_version 473512 (0.00783) [2022-07-09 23:46:11,569][26022] Updated weights on worker 0-0, policy_version 473522 (0.00081) [2022-07-09 23:46:12,162][25689] Fps is (10 sec: 5514.4, 60 sec: 5680.8, 300 sec: 5685.1). Total num frames: 484888576. Throughput: 0: 5864.4. Samples: 484888894. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:46:12,163][25689] Avg episode reward: [(0, '-44.735')] [2022-07-09 23:46:13,287][26022] Updated weights on worker 0-0, policy_version 473532 (0.00088) [2022-07-09 23:46:15,174][26022] Updated weights on worker 0-0, policy_version 473542 (0.00553) [2022-07-09 23:46:16,923][26022] Updated weights on worker 0-0, policy_version 473552 (0.00083) [2022-07-09 23:46:17,186][25689] Fps is (10 sec: 5709.0, 60 sec: 5686.4, 300 sec: 5692.6). Total num frames: 484919296. Throughput: 0: 5880.2. Samples: 484923444. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:46:17,186][25689] Avg episode reward: [(0, '-45.124')] [2022-07-09 23:46:18,721][26022] Updated weights on worker 0-0, policy_version 473562 (0.00090) [2022-07-09 23:46:20,398][26022] Updated weights on worker 0-0, policy_version 473572 (0.00090) [2022-07-09 23:46:22,227][25689] Fps is (10 sec: 5799.4, 60 sec: 5688.7, 300 sec: 5685.3). Total num frames: 484946944. Throughput: 0: 5027.8. Samples: 484940650. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:46:22,227][25689] Avg episode reward: [(0, '-44.836')] [2022-07-09 23:46:22,290][26022] Updated weights on worker 0-0, policy_version 473582 (0.00084) [2022-07-09 23:46:24,094][26022] Updated weights on worker 0-0, policy_version 473592 (0.00082) [2022-07-09 23:46:25,731][26022] Updated weights on worker 0-0, policy_version 473602 (0.00096) [2022-07-09 23:46:27,238][25689] Fps is (10 sec: 5500.8, 60 sec: 5655.6, 300 sec: 5688.9). Total num frames: 484974592. Throughput: 0: 6002.2. Samples: 484975146. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:46:27,239][25689] Avg episode reward: [(0, '-44.403')] [2022-07-09 23:46:27,621][26022] Updated weights on worker 0-0, policy_version 473612 (0.00086) [2022-07-09 23:46:29,460][26022] Updated weights on worker 0-0, policy_version 473622 (0.00090) [2022-07-09 23:46:31,067][26022] Updated weights on worker 0-0, policy_version 473632 (0.00097) [2022-07-09 23:46:32,275][25689] Fps is (10 sec: 5808.9, 60 sec: 5703.3, 300 sec: 5689.0). Total num frames: 485005312. Throughput: 0: 5986.1. Samples: 485009290. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:46:32,275][25689] Avg episode reward: [(0, '-43.691')] [2022-07-09 23:46:33,313][26022] Updated weights on worker 0-0, policy_version 473642 (0.00088) [2022-07-09 23:46:34,662][26022] Updated weights on worker 0-0, policy_version 473652 (0.00084) [2022-07-09 23:46:36,744][26022] Updated weights on worker 0-0, policy_version 473662 (0.00092) [2022-07-09 23:46:37,306][25689] Fps is (10 sec: 5797.3, 60 sec: 5687.5, 300 sec: 5682.7). Total num frames: 485032960. Throughput: 0: 5118.2. Samples: 485026424. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:46:37,307][25689] Avg episode reward: [(0, '-43.736')] [2022-07-09 23:46:38,190][26022] Updated weights on worker 0-0, policy_version 473672 (0.00089) [2022-07-09 23:46:40,211][26022] Updated weights on worker 0-0, policy_version 473682 (0.00103) [2022-07-09 23:46:42,040][26022] Updated weights on worker 0-0, policy_version 473692 (0.00086) [2022-07-09 23:46:42,391][25689] Fps is (10 sec: 5668.2, 60 sec: 5692.4, 300 sec: 5689.9). Total num frames: 485062656. Throughput: 0: 5959.0. Samples: 485060812. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:46:42,392][25689] Avg episode reward: [(0, '-43.058')] [2022-07-09 23:46:43,724][26022] Updated weights on worker 0-0, policy_version 473702 (0.00087) [2022-07-09 23:46:45,692][26022] Updated weights on worker 0-0, policy_version 473712 (0.00086) [2022-07-09 23:46:47,408][25689] Fps is (10 sec: 5778.3, 60 sec: 5695.0, 300 sec: 5689.8). Total num frames: 485091328. Throughput: 0: 5946.0. Samples: 485095074. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:46:47,408][25689] Avg episode reward: [(0, '-42.788')] [2022-07-09 23:46:47,409][26022] Updated weights on worker 0-0, policy_version 473722 (0.00100) [2022-07-09 23:46:49,278][26022] Updated weights on worker 0-0, policy_version 473732 (0.00094) [2022-07-09 23:46:51,033][26022] Updated weights on worker 0-0, policy_version 473742 (0.00084) [2022-07-09 23:46:52,420][25689] Fps is (10 sec: 5718.0, 60 sec: 5663.0, 300 sec: 5687.0). Total num frames: 485120000. Throughput: 0: 5954.5. Samples: 485129246. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:46:52,421][25689] Avg episode reward: [(0, '-43.476')] [2022-07-09 23:46:52,867][26022] Updated weights on worker 0-0, policy_version 473752 (0.00079) [2022-07-09 23:46:54,540][26022] Updated weights on worker 0-0, policy_version 473762 (0.00576) [2022-07-09 23:46:56,466][26022] Updated weights on worker 0-0, policy_version 473772 (0.00085) [2022-07-09 23:46:57,463][25689] Fps is (10 sec: 5804.9, 60 sec: 5717.0, 300 sec: 5695.6). Total num frames: 485149696. Throughput: 0: 5960.0. Samples: 485146556. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:46:57,464][25689] Avg episode reward: [(0, '-43.652')] [2022-07-09 23:46:58,052][26022] Updated weights on worker 0-0, policy_version 473782 (0.00081) [2022-07-09 23:46:59,999][26022] Updated weights on worker 0-0, policy_version 473792 (0.00083) [2022-07-09 23:47:01,703][26022] Updated weights on worker 0-0, policy_version 473802 (0.00098) [2022-07-09 23:47:02,528][25689] Fps is (10 sec: 5369.2, 60 sec: 5646.2, 300 sec: 5680.9). Total num frames: 485174272. Throughput: 0: 5977.4. Samples: 485181178. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:47:02,529][25689] Avg episode reward: [(0, '-44.214')] [2022-07-09 23:47:03,920][26022] Updated weights on worker 0-0, policy_version 473812 (0.00089) [2022-07-09 23:47:05,653][26022] Updated weights on worker 0-0, policy_version 473822 (0.00086) [2022-07-09 23:47:07,444][26022] Updated weights on worker 0-0, policy_version 473832 (0.00084) [2022-07-09 23:47:07,552][25689] Fps is (10 sec: 5379.3, 60 sec: 5661.5, 300 sec: 5684.2). Total num frames: 485203968. Throughput: 0: 5879.1. Samples: 485213502. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:47:07,552][25689] Avg episode reward: [(0, '-45.045')] [2022-07-09 23:47:09,200][26022] Updated weights on worker 0-0, policy_version 473842 (0.00076) [2022-07-09 23:47:11,039][26022] Updated weights on worker 0-0, policy_version 473852 (0.00087) [2022-07-09 23:47:12,631][25689] Fps is (10 sec: 5777.6, 60 sec: 5690.0, 300 sec: 5690.2). Total num frames: 485232640. Throughput: 0: 5013.5. Samples: 485230572. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:47:12,631][25689] Avg episode reward: [(0, '-45.405')] [2022-07-09 23:47:12,794][26022] Updated weights on worker 0-0, policy_version 473862 (0.00090) [2022-07-09 23:47:14,659][26022] Updated weights on worker 0-0, policy_version 473872 (0.00094) [2022-07-09 23:47:16,256][26022] Updated weights on worker 0-0, policy_version 473882 (0.00091) [2022-07-09 23:47:17,669][25689] Fps is (10 sec: 5769.4, 60 sec: 5671.7, 300 sec: 5684.8). Total num frames: 485262336. Throughput: 0: 5869.4. Samples: 485265154. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:47:17,669][25689] Avg episode reward: [(0, '-45.611')] [2022-07-09 23:47:18,010][26022] Updated weights on worker 0-0, policy_version 473892 (0.00080) [2022-07-09 23:47:19,963][26022] Updated weights on worker 0-0, policy_version 473902 (0.00080) [2022-07-09 23:47:21,579][26022] Updated weights on worker 0-0, policy_version 473912 (0.00072) [2022-07-09 23:47:22,767][25689] Fps is (10 sec: 5758.5, 60 sec: 5683.2, 300 sec: 5690.3). Total num frames: 485291008. Throughput: 0: 5862.4. Samples: 485299824. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:47:22,767][25689] Avg episode reward: [(0, '-45.686')] [2022-07-09 23:47:23,562][26022] Updated weights on worker 0-0, policy_version 473922 (0.00120) [2022-07-09 23:47:24,997][26022] Updated weights on worker 0-0, policy_version 473932 (0.00088) [2022-07-09 23:47:27,014][26022] Updated weights on worker 0-0, policy_version 473942 (0.00086) [2022-07-09 23:47:27,790][25689] Fps is (10 sec: 5767.0, 60 sec: 5716.0, 300 sec: 5691.7). Total num frames: 485320704. Throughput: 0: 5119.5. Samples: 485317110. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:47:27,790][25689] Avg episode reward: [(0, '-46.127')] [2022-07-09 23:47:28,812][26022] Updated weights on worker 0-0, policy_version 473952 (0.00091) [2022-07-09 23:47:30,729][26022] Updated weights on worker 0-0, policy_version 473962 (0.00091) [2022-07-09 23:47:32,513][26022] Updated weights on worker 0-0, policy_version 473972 (0.00093) [2022-07-09 23:47:32,811][25689] Fps is (10 sec: 5709.2, 60 sec: 5666.7, 300 sec: 5681.9). Total num frames: 485348352. Throughput: 0: 5974.9. Samples: 485351146. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:47:32,811][25689] Avg episode reward: [(0, '-46.281')] [2022-07-09 23:47:34,327][26022] Updated weights on worker 0-0, policy_version 473982 (0.00490) [2022-07-09 23:47:36,132][26022] Updated weights on worker 0-0, policy_version 473992 (0.00095) [2022-07-09 23:47:37,732][26022] Updated weights on worker 0-0, policy_version 474002 (0.00091) [2022-07-09 23:47:37,830][25689] Fps is (10 sec: 5711.1, 60 sec: 5701.7, 300 sec: 5690.0). Total num frames: 485378048. Throughput: 0: 5982.4. Samples: 485385770. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:47:37,831][25689] Avg episode reward: [(0, '-45.075')] [2022-07-09 23:47:39,641][26022] Updated weights on worker 0-0, policy_version 474012 (0.00086) [2022-07-09 23:47:41,318][26022] Updated weights on worker 0-0, policy_version 474022 (0.00089) [2022-07-09 23:47:42,172][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:47:42,184][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000474026_485402624.pth [2022-07-09 23:47:42,185][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000472023_483351552.pth [2022-07-09 23:47:42,867][25689] Fps is (10 sec: 5906.3, 60 sec: 5706.3, 300 sec: 5690.1). Total num frames: 485407744. Throughput: 0: 5131.2. Samples: 485402960. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:47:42,867][25689] Avg episode reward: [(0, '-44.304')] [2022-07-09 23:47:43,003][26022] Updated weights on worker 0-0, policy_version 474032 (0.00088) [2022-07-09 23:47:45,043][26022] Updated weights on worker 0-0, policy_version 474042 (0.00091) [2022-07-09 23:47:46,664][26022] Updated weights on worker 0-0, policy_version 474052 (0.00089) [2022-07-09 23:47:47,871][25689] Fps is (10 sec: 5711.0, 60 sec: 5690.4, 300 sec: 5690.5). Total num frames: 485435392. Throughput: 0: 5994.8. Samples: 485437496. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:47:47,872][25689] Avg episode reward: [(0, '-45.057')] [2022-07-09 23:47:48,347][26022] Updated weights on worker 0-0, policy_version 474062 (0.00079) [2022-07-09 23:47:50,161][26022] Updated weights on worker 0-0, policy_version 474072 (0.00090) [2022-07-09 23:47:52,158][26022] Updated weights on worker 0-0, policy_version 474082 (0.00094) [2022-07-09 23:47:52,903][25689] Fps is (10 sec: 5713.7, 60 sec: 5705.6, 300 sec: 5686.7). Total num frames: 485465088. Throughput: 0: 6028.9. Samples: 485472278. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:47:52,903][25689] Avg episode reward: [(0, '-44.416')] [2022-07-09 23:47:53,828][26022] Updated weights on worker 0-0, policy_version 474092 (0.00086) [2022-07-09 23:47:55,511][26022] Updated weights on worker 0-0, policy_version 474102 (0.00099) [2022-07-09 23:47:57,211][26022] Updated weights on worker 0-0, policy_version 474112 (0.00086) [2022-07-09 23:47:57,943][25689] Fps is (10 sec: 5795.4, 60 sec: 5688.9, 300 sec: 5691.8). Total num frames: 485493760. Throughput: 0: 5164.1. Samples: 485489632. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-09 23:47:57,943][25689] Avg episode reward: [(0, '-44.099')] [2022-07-09 23:47:59,136][26022] Updated weights on worker 0-0, policy_version 474122 (0.00080) [2022-07-09 23:48:00,988][26022] Updated weights on worker 0-0, policy_version 474132 (0.00087) [2022-07-09 23:48:03,061][25689] Fps is (10 sec: 5544.4, 60 sec: 5734.7, 300 sec: 5693.8). Total num frames: 485521408. Throughput: 0: 5993.8. Samples: 485524000. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:48:03,061][25689] Avg episode reward: [(0, '-44.146')] [2022-07-09 23:48:03,068][26022] Updated weights on worker 0-0, policy_version 474142 (0.00086) [2022-07-09 23:48:04,913][26022] Updated weights on worker 0-0, policy_version 474152 (0.00090) [2022-07-09 23:48:06,443][26022] Updated weights on worker 0-0, policy_version 474162 (0.00083) [2022-07-09 23:48:08,071][25689] Fps is (10 sec: 5459.3, 60 sec: 5702.1, 300 sec: 5683.5). Total num frames: 485549056. Throughput: 0: 5898.5. Samples: 485556646. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:48:08,072][25689] Avg episode reward: [(0, '-44.234')] [2022-07-09 23:48:08,343][26022] Updated weights on worker 0-0, policy_version 474172 (0.00090) [2022-07-09 23:48:10,175][26022] Updated weights on worker 0-0, policy_version 474182 (0.00089) [2022-07-09 23:48:11,940][26022] Updated weights on worker 0-0, policy_version 474192 (0.00614) [2022-07-09 23:48:13,074][25689] Fps is (10 sec: 5624.5, 60 sec: 5709.3, 300 sec: 5690.9). Total num frames: 485577728. Throughput: 0: 5043.4. Samples: 485574010. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:48:13,075][25689] Avg episode reward: [(0, '-43.706')] [2022-07-09 23:48:13,671][26022] Updated weights on worker 0-0, policy_version 474202 (0.00087) [2022-07-09 23:48:15,241][26022] Updated weights on worker 0-0, policy_version 474212 (0.00084) [2022-07-09 23:48:17,335][26022] Updated weights on worker 0-0, policy_version 474222 (0.00082) [2022-07-09 23:48:18,084][25689] Fps is (10 sec: 6033.9, 60 sec: 5745.8, 300 sec: 5699.2). Total num frames: 485609472. Throughput: 0: 5931.0. Samples: 485609092. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:48:18,085][25689] Avg episode reward: [(0, '-43.035')] [2022-07-09 23:48:19,063][26022] Updated weights on worker 0-0, policy_version 474232 (0.00082) [2022-07-09 23:48:20,578][26022] Updated weights on worker 0-0, policy_version 474242 (0.00091) [2022-07-09 23:48:22,771][26022] Updated weights on worker 0-0, policy_version 474252 (0.00089) [2022-07-09 23:48:23,135][25689] Fps is (10 sec: 5801.6, 60 sec: 5716.4, 300 sec: 5689.0). Total num frames: 485636096. Throughput: 0: 5963.8. Samples: 485643718. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:48:23,135][25689] Avg episode reward: [(0, '-43.406')] [2022-07-09 23:48:24,223][26022] Updated weights on worker 0-0, policy_version 474262 (0.00090) [2022-07-09 23:48:26,318][26022] Updated weights on worker 0-0, policy_version 474272 (0.00096) [2022-07-09 23:48:27,776][26022] Updated weights on worker 0-0, policy_version 474282 (0.00081) [2022-07-09 23:48:28,165][25689] Fps is (10 sec: 5688.6, 60 sec: 5732.7, 300 sec: 5699.1). Total num frames: 485666816. Throughput: 0: 5193.7. Samples: 485661004. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:48:28,165][25689] Avg episode reward: [(0, '-42.934')] [2022-07-09 23:48:29,850][26022] Updated weights on worker 0-0, policy_version 474292 (0.00095) [2022-07-09 23:48:31,347][26022] Updated weights on worker 0-0, policy_version 474302 (0.00071) [2022-07-09 23:48:33,184][25689] Fps is (10 sec: 5706.1, 60 sec: 5715.9, 300 sec: 5688.8). Total num frames: 485693440. Throughput: 0: 6022.1. Samples: 485695116. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:48:33,185][25689] Avg episode reward: [(0, '-43.151')] [2022-07-09 23:48:33,401][26022] Updated weights on worker 0-0, policy_version 474312 (0.00092) [2022-07-09 23:48:35,022][26022] Updated weights on worker 0-0, policy_version 474322 (0.00091) [2022-07-09 23:48:36,839][26022] Updated weights on worker 0-0, policy_version 474332 (0.00082) [2022-07-09 23:48:38,236][25689] Fps is (10 sec: 5591.9, 60 sec: 5712.8, 300 sec: 5692.9). Total num frames: 485723136. Throughput: 0: 5974.7. Samples: 485729496. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:48:38,237][25689] Avg episode reward: [(0, '-43.408')] [2022-07-09 23:48:38,809][26022] Updated weights on worker 0-0, policy_version 474342 (0.00095) [2022-07-09 23:48:40,443][26022] Updated weights on worker 0-0, policy_version 474352 (0.00099) [2022-07-09 23:48:42,387][26022] Updated weights on worker 0-0, policy_version 474362 (0.00089) [2022-07-09 23:48:43,316][25689] Fps is (10 sec: 5760.9, 60 sec: 5691.8, 300 sec: 5688.4). Total num frames: 485751808. Throughput: 0: 5094.4. Samples: 485746530. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:48:43,316][25689] Avg episode reward: [(0, '-43.688')] [2022-07-09 23:48:44,207][26022] Updated weights on worker 0-0, policy_version 474372 (0.00082) [2022-07-09 23:48:45,919][26022] Updated weights on worker 0-0, policy_version 474382 (0.00090) [2022-07-09 23:48:47,495][26022] Updated weights on worker 0-0, policy_version 474392 (0.00086) [2022-07-09 23:48:48,319][25689] Fps is (10 sec: 5789.2, 60 sec: 5725.8, 300 sec: 5695.7). Total num frames: 485781504. Throughput: 0: 5949.1. Samples: 485780902. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:48:48,320][25689] Avg episode reward: [(0, '-44.073')] [2022-07-09 23:48:49,504][26022] Updated weights on worker 0-0, policy_version 474402 (0.00096) [2022-07-09 23:48:51,114][26022] Updated weights on worker 0-0, policy_version 474412 (0.00088) [2022-07-09 23:48:53,202][26022] Updated weights on worker 0-0, policy_version 474422 (0.00094) [2022-07-09 23:48:53,338][25689] Fps is (10 sec: 5721.7, 60 sec: 5693.1, 300 sec: 5692.2). Total num frames: 485809152. Throughput: 0: 5975.9. Samples: 485815554. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:48:53,340][25689] Avg episode reward: [(0, '-44.023')] [2022-07-09 23:48:54,787][26022] Updated weights on worker 0-0, policy_version 474432 (0.00087) [2022-07-09 23:48:56,699][26022] Updated weights on worker 0-0, policy_version 474442 (0.00088) [2022-07-09 23:48:58,278][26022] Updated weights on worker 0-0, policy_version 474452 (0.00087) [2022-07-09 23:48:58,356][25689] Fps is (10 sec: 5713.0, 60 sec: 5712.1, 300 sec: 5689.5). Total num frames: 485838848. Throughput: 0: 5126.8. Samples: 485832646. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:48:58,358][25689] Avg episode reward: [(0, '-44.491')] [2022-07-09 23:49:00,300][26022] Updated weights on worker 0-0, policy_version 474462 (0.00094) [2022-07-09 23:49:01,928][26022] Updated weights on worker 0-0, policy_version 474472 (0.00103) [2022-07-09 23:49:03,441][25689] Fps is (10 sec: 5574.9, 60 sec: 5698.3, 300 sec: 5691.5). Total num frames: 485865472. Throughput: 0: 5946.2. Samples: 485866196. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:49:03,441][25689] Avg episode reward: [(0, '-43.877')] [2022-07-09 23:49:04,243][26022] Updated weights on worker 0-0, policy_version 474482 (0.00089) [2022-07-09 23:49:05,578][26022] Updated weights on worker 0-0, policy_version 474492 (0.00103) [2022-07-09 23:49:07,729][26022] Updated weights on worker 0-0, policy_version 474502 (0.00089) [2022-07-09 23:49:08,461][25689] Fps is (10 sec: 5573.4, 60 sec: 5731.3, 300 sec: 5694.6). Total num frames: 485895168. Throughput: 0: 5903.8. Samples: 485899820. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:49:08,462][25689] Avg episode reward: [(0, '-43.792')] [2022-07-09 23:49:09,312][26022] Updated weights on worker 0-0, policy_version 474512 (0.00085) [2022-07-09 23:49:11,147][26022] Updated weights on worker 0-0, policy_version 474522 (0.00083) [2022-07-09 23:49:12,937][26022] Updated weights on worker 0-0, policy_version 474532 (0.00091) [2022-07-09 23:49:13,485][25689] Fps is (10 sec: 5709.0, 60 sec: 5712.3, 300 sec: 5694.2). Total num frames: 485922816. Throughput: 0: 5044.0. Samples: 485917176. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:49:13,486][25689] Avg episode reward: [(0, '-43.899')] [2022-07-09 23:49:14,789][26022] Updated weights on worker 0-0, policy_version 474542 (0.00092) [2022-07-09 23:49:16,675][26022] Updated weights on worker 0-0, policy_version 474552 (0.00092) [2022-07-09 23:49:18,399][26022] Updated weights on worker 0-0, policy_version 474562 (0.00082) [2022-07-09 23:49:18,490][25689] Fps is (10 sec: 5615.8, 60 sec: 5662.0, 300 sec: 5692.4). Total num frames: 485951488. Throughput: 0: 5922.6. Samples: 485951892. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:49:18,491][25689] Avg episode reward: [(0, '-42.932')] [2022-07-09 23:49:20,040][26022] Updated weights on worker 0-0, policy_version 474572 (0.00090) [2022-07-09 23:49:21,932][26022] Updated weights on worker 0-0, policy_version 474582 (0.00093) [2022-07-09 23:49:23,594][25689] Fps is (10 sec: 5774.0, 60 sec: 5707.8, 300 sec: 5690.6). Total num frames: 485981184. Throughput: 0: 5939.3. Samples: 485985892. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:49:23,595][25689] Avg episode reward: [(0, '-43.021')] [2022-07-09 23:49:23,631][26022] Updated weights on worker 0-0, policy_version 474592 (0.00094) [2022-07-09 23:49:25,549][26022] Updated weights on worker 0-0, policy_version 474602 (0.00089) [2022-07-09 23:49:27,327][26022] Updated weights on worker 0-0, policy_version 474612 (0.00088) [2022-07-09 23:49:28,665][25689] Fps is (10 sec: 5736.7, 60 sec: 5670.1, 300 sec: 5692.9). Total num frames: 486009856. Throughput: 0: 5104.2. Samples: 486002942. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:49:28,674][25689] Avg episode reward: [(0, '-43.419')] [2022-07-09 23:49:29,079][26022] Updated weights on worker 0-0, policy_version 474622 (0.00096) [2022-07-09 23:49:31,095][26022] Updated weights on worker 0-0, policy_version 474632 (0.00093) [2022-07-09 23:49:32,683][26022] Updated weights on worker 0-0, policy_version 474642 (0.00089) [2022-07-09 23:49:33,773][25689] Fps is (10 sec: 5633.4, 60 sec: 5695.5, 300 sec: 5687.6). Total num frames: 486038528. Throughput: 0: 5912.7. Samples: 486037134. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:49:33,774][25689] Avg episode reward: [(0, '-44.086')] [2022-07-09 23:49:34,522][26022] Updated weights on worker 0-0, policy_version 474652 (0.00090) [2022-07-09 23:49:36,378][26022] Updated weights on worker 0-0, policy_version 474662 (0.00086) [2022-07-09 23:49:38,015][26022] Updated weights on worker 0-0, policy_version 474672 (0.00086) [2022-07-09 23:49:38,820][25689] Fps is (10 sec: 5646.5, 60 sec: 5679.1, 300 sec: 5688.7). Total num frames: 486067200. Throughput: 0: 5883.8. Samples: 486071510. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:49:38,821][25689] Avg episode reward: [(0, '-43.578')] [2022-07-09 23:49:39,770][26022] Updated weights on worker 0-0, policy_version 474682 (0.00088) [2022-07-09 23:49:41,660][26022] Updated weights on worker 0-0, policy_version 474692 (0.00089) [2022-07-09 23:49:42,208][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:49:42,216][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000474695_486087680.pth [2022-07-09 23:49:42,217][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000472690_484034560.pth [2022-07-09 23:49:43,431][26022] Updated weights on worker 0-0, policy_version 474702 (0.00093) [2022-07-09 23:49:43,897][25689] Fps is (10 sec: 5866.5, 60 sec: 5713.2, 300 sec: 5691.5). Total num frames: 486097920. Throughput: 0: 5917.1. Samples: 486106028. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:49:43,898][25689] Avg episode reward: [(0, '-44.202')] [2022-07-09 23:49:45,148][26022] Updated weights on worker 0-0, policy_version 474712 (0.00081) [2022-07-09 23:49:46,952][26022] Updated weights on worker 0-0, policy_version 474722 (0.00091) [2022-07-09 23:49:48,830][26022] Updated weights on worker 0-0, policy_version 474732 (0.00081) [2022-07-09 23:49:48,927][25689] Fps is (10 sec: 5775.1, 60 sec: 5676.8, 300 sec: 5688.7). Total num frames: 486125568. Throughput: 0: 5921.5. Samples: 486122926. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:49:48,928][25689] Avg episode reward: [(0, '-45.127')] [2022-07-09 23:49:50,657][26022] Updated weights on worker 0-0, policy_version 474742 (0.00084) [2022-07-09 23:49:52,331][26022] Updated weights on worker 0-0, policy_version 474752 (0.00085) [2022-07-09 23:49:53,954][25689] Fps is (10 sec: 5600.3, 60 sec: 5693.0, 300 sec: 5695.2). Total num frames: 486154240. Throughput: 0: 5968.1. Samples: 486157574. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:49:53,954][25689] Avg episode reward: [(0, '-43.941')] [2022-07-09 23:49:54,299][26022] Updated weights on worker 0-0, policy_version 474762 (0.00086) [2022-07-09 23:49:55,862][26022] Updated weights on worker 0-0, policy_version 474772 (0.00086) [2022-07-09 23:49:57,935][26022] Updated weights on worker 0-0, policy_version 474782 (0.00090) [2022-07-09 23:49:59,020][25689] Fps is (10 sec: 5783.1, 60 sec: 5688.5, 300 sec: 5696.9). Total num frames: 486183936. Throughput: 0: 5959.5. Samples: 486191890. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:49:59,022][25689] Avg episode reward: [(0, '-43.285')] [2022-07-09 23:49:59,371][26022] Updated weights on worker 0-0, policy_version 474792 (0.00090) [2022-07-09 23:50:01,867][26022] Updated weights on worker 0-0, policy_version 474802 (0.00089) [2022-07-09 23:50:03,254][26022] Updated weights on worker 0-0, policy_version 474812 (0.00090) [2022-07-09 23:50:04,110][25689] Fps is (10 sec: 5444.6, 60 sec: 5671.1, 300 sec: 5688.5). Total num frames: 486209536. Throughput: 0: 5001.2. Samples: 486207116. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:50:04,112][25689] Avg episode reward: [(0, '-43.525')] [2022-07-09 23:50:05,336][26022] Updated weights on worker 0-0, policy_version 474822 (0.00110) [2022-07-09 23:50:06,999][26022] Updated weights on worker 0-0, policy_version 474832 (0.00087) [2022-07-09 23:50:08,768][26022] Updated weights on worker 0-0, policy_version 474842 (0.00091) [2022-07-09 23:50:09,116][25689] Fps is (10 sec: 5578.3, 60 sec: 5689.4, 300 sec: 5699.6). Total num frames: 486240256. Throughput: 0: 5867.0. Samples: 486241376. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:50:09,117][25689] Avg episode reward: [(0, '-43.426')] [2022-07-09 23:50:10,599][26022] Updated weights on worker 0-0, policy_version 474852 (0.00089) [2022-07-09 23:50:12,528][26022] Updated weights on worker 0-0, policy_version 474862 (0.00093) [2022-07-09 23:50:14,127][25689] Fps is (10 sec: 5826.7, 60 sec: 5690.6, 300 sec: 5690.6). Total num frames: 486267904. Throughput: 0: 5855.0. Samples: 486275690. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:50:14,129][25689] Avg episode reward: [(0, '-42.738')] [2022-07-09 23:50:14,308][26022] Updated weights on worker 0-0, policy_version 474872 (0.00086) [2022-07-09 23:50:15,930][26022] Updated weights on worker 0-0, policy_version 474882 (0.00092) [2022-07-09 23:50:17,732][26022] Updated weights on worker 0-0, policy_version 474892 (0.00082) [2022-07-09 23:50:19,154][25689] Fps is (10 sec: 5712.7, 60 sec: 5705.4, 300 sec: 5698.3). Total num frames: 486297600. Throughput: 0: 5027.5. Samples: 486293114. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:50:19,156][25689] Avg episode reward: [(0, '-42.994')] [2022-07-09 23:50:19,514][26022] Updated weights on worker 0-0, policy_version 474902 (0.00089) [2022-07-09 23:50:21,370][26022] Updated weights on worker 0-0, policy_version 474912 (0.00097) [2022-07-09 23:50:23,108][26022] Updated weights on worker 0-0, policy_version 474922 (0.00087) [2022-07-09 23:50:24,223][25689] Fps is (10 sec: 5680.0, 60 sec: 5674.9, 300 sec: 5690.4). Total num frames: 486325248. Throughput: 0: 5979.7. Samples: 486327386. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:50:24,224][25689] Avg episode reward: [(0, '-44.307')] [2022-07-09 23:50:24,922][26022] Updated weights on worker 0-0, policy_version 474932 (0.00051) [2022-07-09 23:50:26,630][26022] Updated weights on worker 0-0, policy_version 474942 (0.00097) [2022-07-09 23:50:28,616][26022] Updated weights on worker 0-0, policy_version 474952 (0.00965) [2022-07-09 23:50:29,266][25689] Fps is (10 sec: 5569.8, 60 sec: 5677.5, 300 sec: 5693.1). Total num frames: 486353920. Throughput: 0: 5962.5. Samples: 486361518. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:50:29,266][25689] Avg episode reward: [(0, '-43.933')] [2022-07-09 23:50:30,409][26022] Updated weights on worker 0-0, policy_version 474962 (0.00086) [2022-07-09 23:50:32,212][26022] Updated weights on worker 0-0, policy_version 474972 (0.00085) [2022-07-09 23:50:33,996][26022] Updated weights on worker 0-0, policy_version 474982 (0.00083) [2022-07-09 23:50:34,288][25689] Fps is (10 sec: 5697.6, 60 sec: 5685.7, 300 sec: 5693.6). Total num frames: 486382592. Throughput: 0: 5108.5. Samples: 486378680. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:50:34,288][25689] Avg episode reward: [(0, '-44.510')] [2022-07-09 23:50:35,608][26022] Updated weights on worker 0-0, policy_version 474992 (0.00089) [2022-07-09 23:50:37,517][26022] Updated weights on worker 0-0, policy_version 475002 (0.00087) [2022-07-09 23:50:39,171][26022] Updated weights on worker 0-0, policy_version 475012 (0.00084) [2022-07-09 23:50:39,308][25689] Fps is (10 sec: 5812.3, 60 sec: 5705.1, 300 sec: 5695.8). Total num frames: 486412288. Throughput: 0: 5978.9. Samples: 486413612. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:50:39,309][25689] Avg episode reward: [(0, '-44.617')] [2022-07-09 23:50:40,930][26022] Updated weights on worker 0-0, policy_version 475022 (0.00088) [2022-07-09 23:50:42,750][26022] Updated weights on worker 0-0, policy_version 475032 (0.00090) [2022-07-09 23:50:44,366][25689] Fps is (10 sec: 5994.5, 60 sec: 5706.9, 300 sec: 5702.4). Total num frames: 486443008. Throughput: 0: 6010.3. Samples: 486448452. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-09 23:50:44,367][25689] Avg episode reward: [(0, '-43.885')] [2022-07-09 23:50:44,369][26022] Updated weights on worker 0-0, policy_version 475042 (0.00084) [2022-07-09 23:50:46,180][26022] Updated weights on worker 0-0, policy_version 475052 (0.00089) [2022-07-09 23:50:48,213][26022] Updated weights on worker 0-0, policy_version 475062 (0.00083) [2022-07-09 23:50:49,399][25689] Fps is (10 sec: 5885.9, 60 sec: 5723.6, 300 sec: 5695.5). Total num frames: 486471680. Throughput: 0: 5184.9. Samples: 486465904. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:50:49,399][25689] Avg episode reward: [(0, '-43.282')] [2022-07-09 23:50:49,807][26022] Updated weights on worker 0-0, policy_version 475072 (0.00088) [2022-07-09 23:50:51,598][26022] Updated weights on worker 0-0, policy_version 475082 (0.00086) [2022-07-09 23:50:53,363][26022] Updated weights on worker 0-0, policy_version 475092 (0.00089) [2022-07-09 23:50:54,409][25689] Fps is (10 sec: 5710.1, 60 sec: 5725.1, 300 sec: 5703.7). Total num frames: 486500352. Throughput: 0: 6060.7. Samples: 486500628. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:50:54,410][25689] Avg episode reward: [(0, '-43.161')] [2022-07-09 23:50:55,066][26022] Updated weights on worker 0-0, policy_version 475102 (0.00088) [2022-07-09 23:50:57,100][26022] Updated weights on worker 0-0, policy_version 475112 (0.00091) [2022-07-09 23:50:58,642][26022] Updated weights on worker 0-0, policy_version 475122 (0.00083) [2022-07-09 23:50:59,434][25689] Fps is (10 sec: 5612.0, 60 sec: 5695.1, 300 sec: 5700.4). Total num frames: 486528000. Throughput: 0: 6035.4. Samples: 486535082. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:50:59,435][25689] Avg episode reward: [(0, '-43.494')] [2022-07-09 23:51:00,563][26022] Updated weights on worker 0-0, policy_version 475132 (0.00085) [2022-07-09 23:51:02,729][26022] Updated weights on worker 0-0, policy_version 475142 (0.00090) [2022-07-09 23:51:04,393][26022] Updated weights on worker 0-0, policy_version 475152 (0.00087) [2022-07-09 23:51:04,542][25689] Fps is (10 sec: 5456.8, 60 sec: 5727.3, 300 sec: 5695.0). Total num frames: 486555648. Throughput: 0: 5136.2. Samples: 486552080. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:51:04,543][25689] Avg episode reward: [(0, '-43.250')] [2022-07-09 23:51:06,202][26022] Updated weights on worker 0-0, policy_version 475162 (0.00082) [2022-07-09 23:51:08,069][26022] Updated weights on worker 0-0, policy_version 475172 (0.00082) [2022-07-09 23:51:09,550][25689] Fps is (10 sec: 5668.6, 60 sec: 5710.2, 300 sec: 5705.6). Total num frames: 486585344. Throughput: 0: 5875.9. Samples: 486584314. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:51:09,551][25689] Avg episode reward: [(0, '-42.842')] [2022-07-09 23:51:09,783][26022] Updated weights on worker 0-0, policy_version 475182 (0.00085) [2022-07-09 23:51:11,815][26022] Updated weights on worker 0-0, policy_version 475192 (0.00086) [2022-07-09 23:51:13,436][26022] Updated weights on worker 0-0, policy_version 475202 (0.00088) [2022-07-09 23:51:14,570][25689] Fps is (10 sec: 5616.5, 60 sec: 5692.4, 300 sec: 5691.9). Total num frames: 486611968. Throughput: 0: 5850.7. Samples: 486618584. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:51:14,571][25689] Avg episode reward: [(0, '-43.213')] [2022-07-09 23:51:15,251][26022] Updated weights on worker 0-0, policy_version 475212 (0.00095) [2022-07-09 23:51:17,083][26022] Updated weights on worker 0-0, policy_version 475222 (0.00086) [2022-07-09 23:51:18,751][26022] Updated weights on worker 0-0, policy_version 475232 (0.00087) [2022-07-09 23:51:19,607][25689] Fps is (10 sec: 5600.5, 60 sec: 5691.5, 300 sec: 5698.8). Total num frames: 486641664. Throughput: 0: 4992.0. Samples: 486635782. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:51:19,607][25689] Avg episode reward: [(0, '-43.378')] [2022-07-09 23:51:20,656][26022] Updated weights on worker 0-0, policy_version 475242 (0.00086) [2022-07-09 23:51:22,226][26022] Updated weights on worker 0-0, policy_version 475252 (0.00081) [2022-07-09 23:51:24,276][26022] Updated weights on worker 0-0, policy_version 475262 (0.00085) [2022-07-09 23:51:24,718][25689] Fps is (10 sec: 5852.9, 60 sec: 5721.4, 300 sec: 5703.8). Total num frames: 486671360. Throughput: 0: 5859.9. Samples: 486670304. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:51:24,718][25689] Avg episode reward: [(0, '-43.332')] [2022-07-09 23:51:26,047][26022] Updated weights on worker 0-0, policy_version 475272 (0.00056) [2022-07-09 23:51:27,648][26022] Updated weights on worker 0-0, policy_version 475282 (0.00086) [2022-07-09 23:51:29,694][26022] Updated weights on worker 0-0, policy_version 475292 (0.00084) [2022-07-09 23:51:29,725][25689] Fps is (10 sec: 5667.1, 60 sec: 5707.8, 300 sec: 5694.0). Total num frames: 486699008. Throughput: 0: 5951.6. Samples: 486704388. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:51:29,726][25689] Avg episode reward: [(0, '-43.258')] [2022-07-09 23:51:31,180][26022] Updated weights on worker 0-0, policy_version 475302 (0.00089) [2022-07-09 23:51:33,414][26022] Updated weights on worker 0-0, policy_version 475312 (0.00088) [2022-07-09 23:51:34,756][25689] Fps is (10 sec: 5610.5, 60 sec: 5706.9, 300 sec: 5697.5). Total num frames: 486727680. Throughput: 0: 5089.5. Samples: 486721320. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:51:34,756][25689] Avg episode reward: [(0, '-43.089')] [2022-07-09 23:51:35,120][26022] Updated weights on worker 0-0, policy_version 475322 (0.00088) [2022-07-09 23:51:36,889][26022] Updated weights on worker 0-0, policy_version 475332 (0.00087) [2022-07-09 23:51:38,675][26022] Updated weights on worker 0-0, policy_version 475342 (0.00084) [2022-07-09 23:51:39,815][25689] Fps is (10 sec: 5683.6, 60 sec: 5686.4, 300 sec: 5694.6). Total num frames: 486756352. Throughput: 0: 5927.6. Samples: 486755568. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:51:39,816][25689] Avg episode reward: [(0, '-43.326')] [2022-07-09 23:51:40,415][26022] Updated weights on worker 0-0, policy_version 475352 (0.00082) [2022-07-09 23:51:42,136][26022] Updated weights on worker 0-0, policy_version 475362 (0.00087) [2022-07-09 23:51:42,627][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:51:42,639][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000475364_486772736.pth [2022-07-09 23:51:42,640][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000473359_484719616.pth [2022-07-09 23:51:44,111][26022] Updated weights on worker 0-0, policy_version 475372 (0.00052) [2022-07-09 23:51:44,896][25689] Fps is (10 sec: 5655.1, 60 sec: 5650.4, 300 sec: 5693.3). Total num frames: 486785024. Throughput: 0: 5900.7. Samples: 486789372. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:51:44,897][25689] Avg episode reward: [(0, '-42.932')] [2022-07-09 23:51:46,019][26022] Updated weights on worker 0-0, policy_version 475382 (0.00078) [2022-07-09 23:51:47,644][26022] Updated weights on worker 0-0, policy_version 475392 (0.00097) [2022-07-09 23:51:49,667][26022] Updated weights on worker 0-0, policy_version 475402 (0.00091) [2022-07-09 23:51:49,903][25689] Fps is (10 sec: 5582.9, 60 sec: 5635.9, 300 sec: 5690.0). Total num frames: 486812672. Throughput: 0: 5057.4. Samples: 486806432. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:51:49,903][25689] Avg episode reward: [(0, '-43.457')] [2022-07-09 23:51:51,176][26022] Updated weights on worker 0-0, policy_version 475412 (0.00088) [2022-07-09 23:51:53,092][26022] Updated weights on worker 0-0, policy_version 475422 (0.00085) [2022-07-09 23:51:54,755][26022] Updated weights on worker 0-0, policy_version 475432 (0.00095) [2022-07-09 23:51:54,911][25689] Fps is (10 sec: 5726.1, 60 sec: 5653.0, 300 sec: 5690.6). Total num frames: 486842368. Throughput: 0: 5929.4. Samples: 486840826. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:51:54,911][25689] Avg episode reward: [(0, '-42.637')] [2022-07-09 23:51:56,669][26022] Updated weights on worker 0-0, policy_version 475442 (0.00094) [2022-07-09 23:51:58,436][26022] Updated weights on worker 0-0, policy_version 475452 (0.00090) [2022-07-09 23:51:59,937][25689] Fps is (10 sec: 5816.6, 60 sec: 5669.8, 300 sec: 5705.2). Total num frames: 486871040. Throughput: 0: 5951.2. Samples: 486875322. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:51:59,938][25689] Avg episode reward: [(0, '-41.610')] [2022-07-09 23:52:00,098][26022] Updated weights on worker 0-0, policy_version 475462 (0.00085) [2022-07-09 23:52:02,232][26022] Updated weights on worker 0-0, policy_version 475472 (0.00086) [2022-07-09 23:52:04,265][26022] Updated weights on worker 0-0, policy_version 475482 (0.00089) [2022-07-09 23:52:05,072][25689] Fps is (10 sec: 5542.3, 60 sec: 5667.3, 300 sec: 5696.1). Total num frames: 486898688. Throughput: 0: 5848.7. Samples: 486907378. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:52:05,073][25689] Avg episode reward: [(0, '-41.590')] [2022-07-09 23:52:05,877][26022] Updated weights on worker 0-0, policy_version 475492 (0.00089) [2022-07-09 23:52:07,782][26022] Updated weights on worker 0-0, policy_version 475502 (0.00089) [2022-07-09 23:52:09,443][26022] Updated weights on worker 0-0, policy_version 475512 (0.00085) [2022-07-09 23:52:10,089][25689] Fps is (10 sec: 5346.3, 60 sec: 5615.7, 300 sec: 5690.5). Total num frames: 486925312. Throughput: 0: 5847.4. Samples: 486924468. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:52:10,089][25689] Avg episode reward: [(0, '-41.357')] [2022-07-09 23:52:11,303][26022] Updated weights on worker 0-0, policy_version 475522 (0.00084) [2022-07-09 23:52:13,245][26022] Updated weights on worker 0-0, policy_version 475532 (0.00080) [2022-07-09 23:52:14,687][26022] Updated weights on worker 0-0, policy_version 475542 (0.00083) [2022-07-09 23:52:15,151][25689] Fps is (10 sec: 5791.1, 60 sec: 5696.3, 300 sec: 5696.9). Total num frames: 486957056. Throughput: 0: 5843.7. Samples: 486959106. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:52:15,152][25689] Avg episode reward: [(0, '-41.086')] [2022-07-09 23:52:16,668][26022] Updated weights on worker 0-0, policy_version 475552 (0.00086) [2022-07-09 23:52:18,335][26022] Updated weights on worker 0-0, policy_version 475562 (0.00082) [2022-07-09 23:52:20,222][25689] Fps is (10 sec: 5962.4, 60 sec: 5676.2, 300 sec: 5697.4). Total num frames: 486985728. Throughput: 0: 5847.3. Samples: 486993930. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:52:20,222][25689] Avg episode reward: [(0, '-41.007')] [2022-07-09 23:52:20,223][26022] Updated weights on worker 0-0, policy_version 475572 (0.00104) [2022-07-09 23:52:22,085][26022] Updated weights on worker 0-0, policy_version 475582 (0.00078) [2022-07-09 23:52:23,582][26022] Updated weights on worker 0-0, policy_version 475592 (0.00090) [2022-07-09 23:52:25,332][25689] Fps is (10 sec: 5632.4, 60 sec: 5659.3, 300 sec: 5692.3). Total num frames: 487014400. Throughput: 0: 5121.7. Samples: 487011144. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:52:25,333][25689] Avg episode reward: [(0, '-40.819')] [2022-07-09 23:52:25,607][26022] Updated weights on worker 0-0, policy_version 475602 (0.00086) [2022-07-09 23:52:27,252][26022] Updated weights on worker 0-0, policy_version 475612 (0.00104) [2022-07-09 23:52:29,202][26022] Updated weights on worker 0-0, policy_version 475622 (0.00101) [2022-07-09 23:52:30,351][25689] Fps is (10 sec: 5762.5, 60 sec: 5692.1, 300 sec: 5699.2). Total num frames: 487044096. Throughput: 0: 5989.6. Samples: 487045828. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:52:30,351][25689] Avg episode reward: [(0, '-40.738')] [2022-07-09 23:52:30,928][26022] Updated weights on worker 0-0, policy_version 475632 (0.00828) [2022-07-09 23:52:32,810][26022] Updated weights on worker 0-0, policy_version 475642 (0.00111) [2022-07-09 23:52:34,554][26022] Updated weights on worker 0-0, policy_version 475652 (0.00054) [2022-07-09 23:52:35,363][25689] Fps is (10 sec: 5716.9, 60 sec: 5677.0, 300 sec: 5692.5). Total num frames: 487071744. Throughput: 0: 5988.6. Samples: 487080146. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:52:35,364][25689] Avg episode reward: [(0, '-40.397')] [2022-07-09 23:52:36,316][26022] Updated weights on worker 0-0, policy_version 475662 (0.00079) [2022-07-09 23:52:38,080][26022] Updated weights on worker 0-0, policy_version 475672 (0.00276) [2022-07-09 23:52:39,719][26022] Updated weights on worker 0-0, policy_version 475682 (0.00092) [2022-07-09 23:52:40,381][25689] Fps is (10 sec: 5614.9, 60 sec: 5680.8, 300 sec: 5689.4). Total num frames: 487100416. Throughput: 0: 5138.5. Samples: 487097520. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:52:40,382][25689] Avg episode reward: [(0, '-40.750')] [2022-07-09 23:52:41,503][26022] Updated weights on worker 0-0, policy_version 475692 (0.00088) [2022-07-09 23:52:43,448][26022] Updated weights on worker 0-0, policy_version 475702 (0.00084) [2022-07-09 23:52:45,076][26022] Updated weights on worker 0-0, policy_version 475712 (0.00091) [2022-07-09 23:52:45,444][25689] Fps is (10 sec: 5891.7, 60 sec: 5716.4, 300 sec: 5698.6). Total num frames: 487131136. Throughput: 0: 6011.1. Samples: 487132034. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:52:45,444][25689] Avg episode reward: [(0, '-40.652')] [2022-07-09 23:52:46,859][26022] Updated weights on worker 0-0, policy_version 475722 (0.00560) [2022-07-09 23:52:48,647][26022] Updated weights on worker 0-0, policy_version 475732 (0.00085) [2022-07-09 23:52:50,355][26022] Updated weights on worker 0-0, policy_version 475742 (0.00086) [2022-07-09 23:52:50,544][25689] Fps is (10 sec: 5844.1, 60 sec: 5724.4, 300 sec: 5693.8). Total num frames: 487159808. Throughput: 0: 5980.6. Samples: 487166596. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:52:50,544][25689] Avg episode reward: [(0, '-41.444')] [2022-07-09 23:52:52,417][26022] Updated weights on worker 0-0, policy_version 475752 (0.00084) [2022-07-09 23:52:53,940][26022] Updated weights on worker 0-0, policy_version 475762 (0.00088) [2022-07-09 23:52:55,556][25689] Fps is (10 sec: 5569.4, 60 sec: 5690.3, 300 sec: 5690.9). Total num frames: 487187456. Throughput: 0: 5129.7. Samples: 487183730. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:52:55,557][25689] Avg episode reward: [(0, '-42.878')] [2022-07-09 23:52:55,926][26022] Updated weights on worker 0-0, policy_version 475772 (0.00091) [2022-07-09 23:52:57,476][26022] Updated weights on worker 0-0, policy_version 475782 (0.00084) [2022-07-09 23:52:59,592][26022] Updated weights on worker 0-0, policy_version 475792 (0.00095) [2022-07-09 23:53:00,575][25689] Fps is (10 sec: 5716.5, 60 sec: 5707.9, 300 sec: 5699.7). Total num frames: 487217152. Throughput: 0: 5975.2. Samples: 487218184. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:53:00,577][25689] Avg episode reward: [(0, '-43.212')] [2022-07-09 23:53:01,318][26022] Updated weights on worker 0-0, policy_version 475802 (0.00087) [2022-07-09 23:53:03,673][26022] Updated weights on worker 0-0, policy_version 475812 (0.00087) [2022-07-09 23:53:05,139][26022] Updated weights on worker 0-0, policy_version 475822 (0.00083) [2022-07-09 23:53:05,698][25689] Fps is (10 sec: 5552.9, 60 sec: 5692.1, 300 sec: 5694.1). Total num frames: 487243776. Throughput: 0: 5827.8. Samples: 487250074. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:53:05,701][25689] Avg episode reward: [(0, '-43.588')] [2022-07-09 23:53:07,138][26022] Updated weights on worker 0-0, policy_version 475832 (0.00085) [2022-07-09 23:53:08,735][26022] Updated weights on worker 0-0, policy_version 475842 (0.00088) [2022-07-09 23:53:10,600][26022] Updated weights on worker 0-0, policy_version 475852 (0.00097) [2022-07-09 23:53:10,720][25689] Fps is (10 sec: 5551.9, 60 sec: 5742.3, 300 sec: 5697.2). Total num frames: 487273472. Throughput: 0: 4993.3. Samples: 487267340. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:53:10,721][25689] Avg episode reward: [(0, '-44.059')] [2022-07-09 23:53:12,416][26022] Updated weights on worker 0-0, policy_version 475862 (0.00095) [2022-07-09 23:53:14,103][26022] Updated weights on worker 0-0, policy_version 475872 (0.00090) [2022-07-09 23:53:15,735][25689] Fps is (10 sec: 5815.7, 60 sec: 5696.1, 300 sec: 5686.8). Total num frames: 487302144. Throughput: 0: 5863.4. Samples: 487302046. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:53:15,737][25689] Avg episode reward: [(0, '-44.750')] [2022-07-09 23:53:15,885][26022] Updated weights on worker 0-0, policy_version 475882 (0.00088) [2022-07-09 23:53:17,677][26022] Updated weights on worker 0-0, policy_version 475892 (0.00086) [2022-07-09 23:53:19,707][26022] Updated weights on worker 0-0, policy_version 475902 (0.00083) [2022-07-09 23:53:20,755][25689] Fps is (10 sec: 5612.3, 60 sec: 5683.9, 300 sec: 5690.8). Total num frames: 487329792. Throughput: 0: 5848.4. Samples: 487336202. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:53:20,756][25689] Avg episode reward: [(0, '-44.414')] [2022-07-09 23:53:21,253][26022] Updated weights on worker 0-0, policy_version 475912 (0.00084) [2022-07-09 23:53:23,152][26022] Updated weights on worker 0-0, policy_version 475922 (0.00090) [2022-07-09 23:53:24,845][26022] Updated weights on worker 0-0, policy_version 475932 (0.00089) [2022-07-09 23:53:25,823][25689] Fps is (10 sec: 5582.9, 60 sec: 5687.9, 300 sec: 5683.2). Total num frames: 487358464. Throughput: 0: 5141.8. Samples: 487353548. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:53:25,823][25689] Avg episode reward: [(0, '-43.634')] [2022-07-09 23:53:26,655][26022] Updated weights on worker 0-0, policy_version 475942 (0.00100) [2022-07-09 23:53:28,467][26022] Updated weights on worker 0-0, policy_version 475952 (0.00094) [2022-07-09 23:53:30,210][26022] Updated weights on worker 0-0, policy_version 475962 (0.00096) [2022-07-09 23:53:30,825][25689] Fps is (10 sec: 5694.5, 60 sec: 5672.5, 300 sec: 5690.4). Total num frames: 487387136. Throughput: 0: 5992.7. Samples: 487387824. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:53:30,825][25689] Avg episode reward: [(0, '-42.962')] [2022-07-09 23:53:32,144][26022] Updated weights on worker 0-0, policy_version 475972 (0.00093) [2022-07-09 23:53:33,901][26022] Updated weights on worker 0-0, policy_version 475982 (0.00091) [2022-07-09 23:53:35,504][26022] Updated weights on worker 0-0, policy_version 475992 (0.00085) [2022-07-09 23:53:35,831][25689] Fps is (10 sec: 5729.8, 60 sec: 5690.1, 300 sec: 5687.9). Total num frames: 487415808. Throughput: 0: 5969.9. Samples: 487422018. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-09 23:53:35,831][25689] Avg episode reward: [(0, '-43.162')] [2022-07-09 23:53:37,474][26022] Updated weights on worker 0-0, policy_version 476002 (0.00609) [2022-07-09 23:53:39,450][26022] Updated weights on worker 0-0, policy_version 476012 (0.00086) [2022-07-09 23:53:40,844][25689] Fps is (10 sec: 5825.4, 60 sec: 5707.4, 300 sec: 5692.6). Total num frames: 487445504. Throughput: 0: 5124.1. Samples: 487439144. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:53:40,845][25689] Avg episode reward: [(0, '-43.125')] [2022-07-09 23:53:41,011][26022] Updated weights on worker 0-0, policy_version 476022 (0.00087) [2022-07-09 23:53:42,872][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:53:42,884][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000476030_487454720.pth [2022-07-09 23:53:42,885][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000474026_485402624.pth [2022-07-09 23:53:43,200][26022] Updated weights on worker 0-0, policy_version 476032 (0.00091) [2022-07-09 23:53:44,631][26022] Updated weights on worker 0-0, policy_version 476042 (0.00055) [2022-07-09 23:53:45,905][25689] Fps is (10 sec: 5590.7, 60 sec: 5639.9, 300 sec: 5681.1). Total num frames: 487472128. Throughput: 0: 5947.7. Samples: 487472990. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:53:45,905][25689] Avg episode reward: [(0, '-42.407')] [2022-07-09 23:53:46,652][26022] Updated weights on worker 0-0, policy_version 476052 (0.00085) [2022-07-09 23:53:48,215][26022] Updated weights on worker 0-0, policy_version 476062 (0.00084) [2022-07-09 23:53:50,158][26022] Updated weights on worker 0-0, policy_version 476072 (0.00086) [2022-07-09 23:53:50,913][25689] Fps is (10 sec: 5695.0, 60 sec: 5682.3, 300 sec: 5691.7). Total num frames: 487502848. Throughput: 0: 5965.1. Samples: 487507656. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:53:50,914][25689] Avg episode reward: [(0, '-41.924')] [2022-07-09 23:53:52,043][26022] Updated weights on worker 0-0, policy_version 476082 (0.00084) [2022-07-09 23:53:53,745][26022] Updated weights on worker 0-0, policy_version 476092 (0.01043) [2022-07-09 23:53:55,428][26022] Updated weights on worker 0-0, policy_version 476102 (0.00096) [2022-07-09 23:53:55,932][25689] Fps is (10 sec: 5821.1, 60 sec: 5681.8, 300 sec: 5684.8). Total num frames: 487530496. Throughput: 0: 5098.4. Samples: 487524500. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:53:55,932][25689] Avg episode reward: [(0, '-42.381')] [2022-07-09 23:53:57,434][26022] Updated weights on worker 0-0, policy_version 476112 (0.00080) [2022-07-09 23:53:59,047][26022] Updated weights on worker 0-0, policy_version 476122 (0.00091) [2022-07-09 23:54:00,907][26022] Updated weights on worker 0-0, policy_version 476132 (0.00091) [2022-07-09 23:54:00,943][25689] Fps is (10 sec: 5615.6, 60 sec: 5665.6, 300 sec: 5693.1). Total num frames: 487559168. Throughput: 0: 5960.7. Samples: 487558944. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:54:00,943][25689] Avg episode reward: [(0, '-42.979')] [2022-07-09 23:54:02,928][26022] Updated weights on worker 0-0, policy_version 476142 (0.00613) [2022-07-09 23:54:04,923][26022] Updated weights on worker 0-0, policy_version 476152 (0.00085) [2022-07-09 23:54:06,003][25689] Fps is (10 sec: 5592.3, 60 sec: 5688.5, 300 sec: 5685.4). Total num frames: 487586816. Throughput: 0: 5870.2. Samples: 487590970. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:54:06,003][25689] Avg episode reward: [(0, '-42.501')] [2022-07-09 23:54:06,607][26022] Updated weights on worker 0-0, policy_version 476162 (0.00094) [2022-07-09 23:54:08,568][26022] Updated weights on worker 0-0, policy_version 476172 (0.00088) [2022-07-09 23:54:10,135][26022] Updated weights on worker 0-0, policy_version 476182 (0.00080) [2022-07-09 23:54:11,011][25689] Fps is (10 sec: 5390.6, 60 sec: 5638.8, 300 sec: 5682.3). Total num frames: 487613440. Throughput: 0: 4994.7. Samples: 487608034. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:54:11,012][25689] Avg episode reward: [(0, '-42.061')] [2022-07-09 23:54:12,122][26022] Updated weights on worker 0-0, policy_version 476192 (0.00179) [2022-07-09 23:54:13,880][26022] Updated weights on worker 0-0, policy_version 476202 (0.00080) [2022-07-09 23:54:15,506][26022] Updated weights on worker 0-0, policy_version 476212 (0.00089) [2022-07-09 23:54:16,026][25689] Fps is (10 sec: 5516.9, 60 sec: 5638.8, 300 sec: 5682.1). Total num frames: 487642112. Throughput: 0: 5859.1. Samples: 487642234. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:54:16,029][25689] Avg episode reward: [(0, '-42.331')] [2022-07-09 23:54:17,504][26022] Updated weights on worker 0-0, policy_version 476222 (0.00085) [2022-07-09 23:54:19,355][26022] Updated weights on worker 0-0, policy_version 476232 (0.00084) [2022-07-09 23:54:20,975][26022] Updated weights on worker 0-0, policy_version 476242 (0.00086) [2022-07-09 23:54:21,038][25689] Fps is (10 sec: 5922.8, 60 sec: 5690.4, 300 sec: 5687.3). Total num frames: 487672832. Throughput: 0: 5848.3. Samples: 487676470. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:54:21,040][25689] Avg episode reward: [(0, '-43.508')] [2022-07-09 23:54:22,975][26022] Updated weights on worker 0-0, policy_version 476252 (0.00080) [2022-07-09 23:54:24,468][26022] Updated weights on worker 0-0, policy_version 476262 (0.00091) [2022-07-09 23:54:26,155][25689] Fps is (10 sec: 5661.2, 60 sec: 5651.9, 300 sec: 5679.5). Total num frames: 487699456. Throughput: 0: 5101.7. Samples: 487693782. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:54:26,156][25689] Avg episode reward: [(0, '-42.674')] [2022-07-09 23:54:26,544][26022] Updated weights on worker 0-0, policy_version 476272 (0.00081) [2022-07-09 23:54:28,110][26022] Updated weights on worker 0-0, policy_version 476282 (0.00101) [2022-07-09 23:54:30,028][26022] Updated weights on worker 0-0, policy_version 476292 (0.00086) [2022-07-09 23:54:31,256][25689] Fps is (10 sec: 5511.8, 60 sec: 5659.6, 300 sec: 5683.1). Total num frames: 487729152. Throughput: 0: 5930.6. Samples: 487728104. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:54:31,257][25689] Avg episode reward: [(0, '-42.469')] [2022-07-09 23:54:31,976][26022] Updated weights on worker 0-0, policy_version 476302 (0.00091) [2022-07-09 23:54:33,514][26022] Updated weights on worker 0-0, policy_version 476312 (0.00086) [2022-07-09 23:54:35,430][26022] Updated weights on worker 0-0, policy_version 476322 (0.00101) [2022-07-09 23:54:36,282][25689] Fps is (10 sec: 5865.0, 60 sec: 5674.7, 300 sec: 5687.0). Total num frames: 487758848. Throughput: 0: 5919.2. Samples: 487762132. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:54:36,282][25689] Avg episode reward: [(0, '-42.628')] [2022-07-09 23:54:37,167][26022] Updated weights on worker 0-0, policy_version 476332 (0.00090) [2022-07-09 23:54:39,098][26022] Updated weights on worker 0-0, policy_version 476342 (0.00086) [2022-07-09 23:54:40,736][26022] Updated weights on worker 0-0, policy_version 476352 (0.00090) [2022-07-09 23:54:41,318][25689] Fps is (10 sec: 5699.2, 60 sec: 5638.7, 300 sec: 5677.4). Total num frames: 487786496. Throughput: 0: 5907.8. Samples: 487796280. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:54:41,319][25689] Avg episode reward: [(0, '-42.188')] [2022-07-09 23:54:42,484][26022] Updated weights on worker 0-0, policy_version 476362 (0.00090) [2022-07-09 23:54:44,357][26022] Updated weights on worker 0-0, policy_version 476372 (0.00083) [2022-07-09 23:54:45,993][26022] Updated weights on worker 0-0, policy_version 476382 (0.00084) [2022-07-09 23:54:46,428][25689] Fps is (10 sec: 5752.6, 60 sec: 5701.7, 300 sec: 5686.2). Total num frames: 487817216. Throughput: 0: 5912.1. Samples: 487813638. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:54:46,428][25689] Avg episode reward: [(0, '-42.768')] [2022-07-09 23:54:48,044][26022] Updated weights on worker 0-0, policy_version 476392 (0.00085) [2022-07-09 23:54:49,505][26022] Updated weights on worker 0-0, policy_version 476402 (0.00087) [2022-07-09 23:54:51,467][25689] Fps is (10 sec: 5751.3, 60 sec: 5648.2, 300 sec: 5682.5). Total num frames: 487844864. Throughput: 0: 5953.0. Samples: 487848416. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:54:51,467][25689] Avg episode reward: [(0, '-42.566')] [2022-07-09 23:54:51,514][26022] Updated weights on worker 0-0, policy_version 476412 (0.00087) [2022-07-09 23:54:53,012][26022] Updated weights on worker 0-0, policy_version 476422 (0.00081) [2022-07-09 23:54:55,119][26022] Updated weights on worker 0-0, policy_version 476432 (0.00083) [2022-07-09 23:54:56,506][25689] Fps is (10 sec: 5588.1, 60 sec: 5663.1, 300 sec: 5679.6). Total num frames: 487873536. Throughput: 0: 5966.0. Samples: 487882794. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:54:56,507][25689] Avg episode reward: [(0, '-42.384')] [2022-07-09 23:54:56,856][26022] Updated weights on worker 0-0, policy_version 476442 (0.00086) [2022-07-09 23:54:58,683][26022] Updated weights on worker 0-0, policy_version 476452 (0.00090) [2022-07-09 23:55:00,300][26022] Updated weights on worker 0-0, policy_version 476462 (0.00085) [2022-07-09 23:55:01,531][25689] Fps is (10 sec: 5697.6, 60 sec: 5661.8, 300 sec: 5691.2). Total num frames: 487902208. Throughput: 0: 5135.8. Samples: 487900092. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:55:01,532][25689] Avg episode reward: [(0, '-41.596')] [2022-07-09 23:55:02,783][26022] Updated weights on worker 0-0, policy_version 476472 (0.00082) [2022-07-09 23:55:04,185][26022] Updated weights on worker 0-0, policy_version 476482 (0.00092) [2022-07-09 23:55:06,315][26022] Updated weights on worker 0-0, policy_version 476492 (0.00078) [2022-07-09 23:55:06,630][25689] Fps is (10 sec: 5563.4, 60 sec: 5658.2, 300 sec: 5679.1). Total num frames: 487929856. Throughput: 0: 5862.4. Samples: 487932070. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:55:06,632][25689] Avg episode reward: [(0, '-41.566')] [2022-07-09 23:55:07,757][26022] Updated weights on worker 0-0, policy_version 476502 (0.00084) [2022-07-09 23:55:09,897][26022] Updated weights on worker 0-0, policy_version 476512 (0.00093) [2022-07-09 23:55:11,344][26022] Updated weights on worker 0-0, policy_version 476522 (0.01128) [2022-07-09 23:55:11,681][25689] Fps is (10 sec: 5649.7, 60 sec: 5704.8, 300 sec: 5685.2). Total num frames: 487959552. Throughput: 0: 5843.0. Samples: 487966530. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:55:11,682][25689] Avg episode reward: [(0, '-42.591')] [2022-07-09 23:55:13,346][26022] Updated weights on worker 0-0, policy_version 476532 (0.00085) [2022-07-09 23:55:15,007][26022] Updated weights on worker 0-0, policy_version 476542 (0.00081) [2022-07-09 23:55:16,739][25689] Fps is (10 sec: 5773.5, 60 sec: 5700.7, 300 sec: 5681.1). Total num frames: 487988224. Throughput: 0: 4989.8. Samples: 487983752. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:55:16,741][25689] Avg episode reward: [(0, '-42.319')] [2022-07-09 23:55:16,908][26022] Updated weights on worker 0-0, policy_version 476552 (0.00093) [2022-07-09 23:55:18,434][26022] Updated weights on worker 0-0, policy_version 476562 (0.00089) [2022-07-09 23:55:20,639][26022] Updated weights on worker 0-0, policy_version 476572 (0.00090) [2022-07-09 23:55:21,791][25689] Fps is (10 sec: 5672.4, 60 sec: 5663.4, 300 sec: 5684.9). Total num frames: 488016896. Throughput: 0: 5827.5. Samples: 488018152. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:55:21,791][25689] Avg episode reward: [(0, '-43.132')] [2022-07-09 23:55:22,331][26022] Updated weights on worker 0-0, policy_version 476582 (0.00088) [2022-07-09 23:55:24,127][26022] Updated weights on worker 0-0, policy_version 476592 (0.00087) [2022-07-09 23:55:25,779][26022] Updated weights on worker 0-0, policy_version 476602 (0.00086) [2022-07-09 23:55:26,914][25689] Fps is (10 sec: 5635.9, 60 sec: 5696.5, 300 sec: 5683.4). Total num frames: 488045568. Throughput: 0: 5930.1. Samples: 488052358. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:55:26,915][25689] Avg episode reward: [(0, '-44.196')] [2022-07-09 23:55:27,754][26022] Updated weights on worker 0-0, policy_version 476612 (0.00086) [2022-07-09 23:55:29,355][26022] Updated weights on worker 0-0, policy_version 476622 (0.00089) [2022-07-09 23:55:31,399][26022] Updated weights on worker 0-0, policy_version 476632 (0.00087) [2022-07-09 23:55:31,927][25689] Fps is (10 sec: 5657.2, 60 sec: 5687.9, 300 sec: 5683.5). Total num frames: 488074240. Throughput: 0: 5096.5. Samples: 488069712. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:55:31,928][25689] Avg episode reward: [(0, '-44.393')] [2022-07-09 23:55:32,988][26022] Updated weights on worker 0-0, policy_version 476642 (0.00085) [2022-07-09 23:55:34,975][26022] Updated weights on worker 0-0, policy_version 476652 (0.00081) [2022-07-09 23:55:36,637][26022] Updated weights on worker 0-0, policy_version 476662 (0.00088) [2022-07-09 23:55:36,968][25689] Fps is (10 sec: 5805.5, 60 sec: 5686.4, 300 sec: 5683.2). Total num frames: 488103936. Throughput: 0: 5925.0. Samples: 488103606. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:55:36,969][25689] Avg episode reward: [(0, '-44.720')] [2022-07-09 23:55:38,457][26022] Updated weights on worker 0-0, policy_version 476672 (0.00086) [2022-07-09 23:55:40,236][26022] Updated weights on worker 0-0, policy_version 476682 (0.00094) [2022-07-09 23:55:41,983][25689] Fps is (10 sec: 5702.8, 60 sec: 5688.5, 300 sec: 5673.7). Total num frames: 488131584. Throughput: 0: 5929.3. Samples: 488137876. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:55:41,983][25689] Avg episode reward: [(0, '-44.337')] [2022-07-09 23:55:42,277][26022] Updated weights on worker 0-0, policy_version 476692 (0.00087) [2022-07-09 23:55:42,913][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:55:42,928][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000476696_488136704.pth [2022-07-09 23:55:42,929][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000474695_486087680.pth [2022-07-09 23:55:43,759][26022] Updated weights on worker 0-0, policy_version 476702 (0.00087) [2022-07-09 23:55:45,850][26022] Updated weights on worker 0-0, policy_version 476712 (0.00089) [2022-07-09 23:55:47,129][25689] Fps is (10 sec: 5643.5, 60 sec: 5668.2, 300 sec: 5674.9). Total num frames: 488161280. Throughput: 0: 5068.7. Samples: 488154822. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:55:47,130][25689] Avg episode reward: [(0, '-44.493')] [2022-07-09 23:55:47,397][26022] Updated weights on worker 0-0, policy_version 476722 (0.00086) [2022-07-09 23:55:49,354][26022] Updated weights on worker 0-0, policy_version 476732 (0.00085) [2022-07-09 23:55:51,035][26022] Updated weights on worker 0-0, policy_version 476742 (0.00085) [2022-07-09 23:55:52,136][25689] Fps is (10 sec: 5648.0, 60 sec: 5671.2, 300 sec: 5671.6). Total num frames: 488188928. Throughput: 0: 5901.3. Samples: 488188968. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:55:52,136][25689] Avg episode reward: [(0, '-44.493')] [2022-07-09 23:55:52,871][26022] Updated weights on worker 0-0, policy_version 476752 (0.00085) [2022-07-09 23:55:54,645][26022] Updated weights on worker 0-0, policy_version 476762 (0.00084) [2022-07-09 23:55:56,437][26022] Updated weights on worker 0-0, policy_version 476772 (0.00087) [2022-07-09 23:55:57,164][25689] Fps is (10 sec: 5714.2, 60 sec: 5689.1, 300 sec: 5678.4). Total num frames: 488218624. Throughput: 0: 5935.7. Samples: 488223486. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:55:57,165][25689] Avg episode reward: [(0, '-43.934')] [2022-07-09 23:55:58,119][26022] Updated weights on worker 0-0, policy_version 476782 (0.00079) [2022-07-09 23:56:00,123][26022] Updated weights on worker 0-0, policy_version 476792 (0.00094) [2022-07-09 23:56:01,805][26022] Updated weights on worker 0-0, policy_version 476802 (0.00086) [2022-07-09 23:56:02,191][25689] Fps is (10 sec: 5601.0, 60 sec: 5655.2, 300 sec: 5676.5). Total num frames: 488245248. Throughput: 0: 5087.0. Samples: 488240676. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:56:02,191][25689] Avg episode reward: [(0, '-43.398')] [2022-07-09 23:56:04,007][26022] Updated weights on worker 0-0, policy_version 476812 (0.00093) [2022-07-09 23:56:05,888][26022] Updated weights on worker 0-0, policy_version 476822 (0.00084) [2022-07-09 23:56:07,280][25689] Fps is (10 sec: 5365.2, 60 sec: 5656.0, 300 sec: 5668.1). Total num frames: 488272896. Throughput: 0: 5837.9. Samples: 488272460. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:56:07,281][25689] Avg episode reward: [(0, '-43.686')] [2022-07-09 23:56:07,735][26022] Updated weights on worker 0-0, policy_version 476832 (0.00085) [2022-07-09 23:56:09,313][26022] Updated weights on worker 0-0, policy_version 476842 (0.00082) [2022-07-09 23:56:11,294][26022] Updated weights on worker 0-0, policy_version 476852 (0.00090) [2022-07-09 23:56:12,355][25689] Fps is (10 sec: 5641.6, 60 sec: 5653.8, 300 sec: 5677.3). Total num frames: 488302592. Throughput: 0: 5822.3. Samples: 488306694. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:56:12,356][25689] Avg episode reward: [(0, '-43.944')] [2022-07-09 23:56:12,900][26022] Updated weights on worker 0-0, policy_version 476862 (0.00057) [2022-07-09 23:56:14,813][26022] Updated weights on worker 0-0, policy_version 476872 (0.00085) [2022-07-09 23:56:16,498][26022] Updated weights on worker 0-0, policy_version 476882 (0.00446) [2022-07-09 23:56:17,381][25689] Fps is (10 sec: 5778.6, 60 sec: 5656.9, 300 sec: 5674.1). Total num frames: 488331264. Throughput: 0: 4962.3. Samples: 488323806. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:56:17,381][25689] Avg episode reward: [(0, '-44.544')] [2022-07-09 23:56:18,415][26022] Updated weights on worker 0-0, policy_version 476892 (0.00093) [2022-07-09 23:56:20,140][26022] Updated weights on worker 0-0, policy_version 476902 (0.00087) [2022-07-09 23:56:22,022][26022] Updated weights on worker 0-0, policy_version 476912 (0.00092) [2022-07-09 23:56:22,382][25689] Fps is (10 sec: 5616.8, 60 sec: 5644.6, 300 sec: 5669.3). Total num frames: 488358912. Throughput: 0: 5820.7. Samples: 488358206. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:56:22,383][25689] Avg episode reward: [(0, '-44.246')] [2022-07-09 23:56:23,862][26022] Updated weights on worker 0-0, policy_version 476922 (0.00092) [2022-07-09 23:56:25,704][26022] Updated weights on worker 0-0, policy_version 476932 (0.00085) [2022-07-09 23:56:27,306][26022] Updated weights on worker 0-0, policy_version 476942 (0.00086) [2022-07-09 23:56:27,422][25689] Fps is (10 sec: 5710.6, 60 sec: 5669.3, 300 sec: 5675.6). Total num frames: 488388608. Throughput: 0: 5950.0. Samples: 488392306. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2022-07-09 23:56:27,423][25689] Avg episode reward: [(0, '-44.436')] [2022-07-09 23:56:29,323][26022] Updated weights on worker 0-0, policy_version 476952 (0.00085) [2022-07-09 23:56:30,771][26022] Updated weights on worker 0-0, policy_version 476962 (0.00101) [2022-07-09 23:56:32,441][25689] Fps is (10 sec: 5802.7, 60 sec: 5668.8, 300 sec: 5675.8). Total num frames: 488417280. Throughput: 0: 5972.4. Samples: 488426654. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:56:32,442][25689] Avg episode reward: [(0, '-45.079')] [2022-07-09 23:56:32,940][26022] Updated weights on worker 0-0, policy_version 476972 (0.00086) [2022-07-09 23:56:34,582][26022] Updated weights on worker 0-0, policy_version 476982 (0.00094) [2022-07-09 23:56:36,581][26022] Updated weights on worker 0-0, policy_version 476992 (0.00086) [2022-07-09 23:56:37,452][25689] Fps is (10 sec: 5717.6, 60 sec: 5654.7, 300 sec: 5676.7). Total num frames: 488445952. Throughput: 0: 5966.4. Samples: 488443558. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:56:37,452][25689] Avg episode reward: [(0, '-45.293')] [2022-07-09 23:56:38,104][26022] Updated weights on worker 0-0, policy_version 477002 (0.00084) [2022-07-09 23:56:39,911][26022] Updated weights on worker 0-0, policy_version 477012 (0.00090) [2022-07-09 23:56:41,735][26022] Updated weights on worker 0-0, policy_version 477022 (0.00087) [2022-07-09 23:56:42,493][25689] Fps is (10 sec: 5602.7, 60 sec: 5652.1, 300 sec: 5674.0). Total num frames: 488473600. Throughput: 0: 5953.2. Samples: 488477930. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:56:42,494][25689] Avg episode reward: [(0, '-44.671')] [2022-07-09 23:56:43,432][26022] Updated weights on worker 0-0, policy_version 477032 (0.00093) [2022-07-09 23:56:45,504][26022] Updated weights on worker 0-0, policy_version 477042 (0.00089) [2022-07-09 23:56:47,183][26022] Updated weights on worker 0-0, policy_version 477052 (0.00096) [2022-07-09 23:56:47,575][25689] Fps is (10 sec: 5563.6, 60 sec: 5641.3, 300 sec: 5676.0). Total num frames: 488502272. Throughput: 0: 5948.6. Samples: 488512184. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:56:47,575][25689] Avg episode reward: [(0, '-44.575')] [2022-07-09 23:56:48,953][26022] Updated weights on worker 0-0, policy_version 477062 (0.00086) [2022-07-09 23:56:50,987][26022] Updated weights on worker 0-0, policy_version 477072 (0.00094) [2022-07-09 23:56:52,449][26022] Updated weights on worker 0-0, policy_version 477082 (0.00087) [2022-07-09 23:56:52,622][25689] Fps is (10 sec: 5762.4, 60 sec: 5671.3, 300 sec: 5675.3). Total num frames: 488531968. Throughput: 0: 5084.9. Samples: 488529274. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:56:52,623][25689] Avg episode reward: [(0, '-43.631')] [2022-07-09 23:56:54,635][26022] Updated weights on worker 0-0, policy_version 477092 (0.00085) [2022-07-09 23:56:55,962][26022] Updated weights on worker 0-0, policy_version 477102 (0.00085) [2022-07-09 23:56:57,641][25689] Fps is (10 sec: 5696.5, 60 sec: 5638.4, 300 sec: 5672.0). Total num frames: 488559616. Throughput: 0: 5940.7. Samples: 488563498. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:56:57,642][25689] Avg episode reward: [(0, '-43.581')] [2022-07-09 23:56:58,099][26022] Updated weights on worker 0-0, policy_version 477112 (0.00084) [2022-07-09 23:56:59,886][26022] Updated weights on worker 0-0, policy_version 477122 (0.00083) [2022-07-09 23:57:01,429][26022] Updated weights on worker 0-0, policy_version 477132 (0.00098) [2022-07-09 23:57:02,650][25689] Fps is (10 sec: 5514.5, 60 sec: 5657.0, 300 sec: 5674.4). Total num frames: 488587264. Throughput: 0: 5850.6. Samples: 488595858. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:57:02,650][25689] Avg episode reward: [(0, '-43.213')] [2022-07-09 23:57:03,766][26022] Updated weights on worker 0-0, policy_version 477142 (0.00087) [2022-07-09 23:57:05,350][26022] Updated weights on worker 0-0, policy_version 477152 (0.00100) [2022-07-09 23:57:07,343][26022] Updated weights on worker 0-0, policy_version 477162 (0.00089) [2022-07-09 23:57:07,721][25689] Fps is (10 sec: 5587.6, 60 sec: 5675.6, 300 sec: 5680.3). Total num frames: 488615936. Throughput: 0: 4993.3. Samples: 488612780. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:57:07,721][25689] Avg episode reward: [(0, '-43.396')] [2022-07-09 23:57:09,248][26022] Updated weights on worker 0-0, policy_version 477172 (0.00094) [2022-07-09 23:57:10,836][26022] Updated weights on worker 0-0, policy_version 477182 (0.00085) [2022-07-09 23:57:12,754][25689] Fps is (10 sec: 5675.3, 60 sec: 5662.6, 300 sec: 5670.5). Total num frames: 488644608. Throughput: 0: 5842.2. Samples: 488646886. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:57:12,755][25689] Avg episode reward: [(0, '-44.091')] [2022-07-09 23:57:12,769][26022] Updated weights on worker 0-0, policy_version 477192 (0.00098) [2022-07-09 23:57:14,320][26022] Updated weights on worker 0-0, policy_version 477202 (0.00092) [2022-07-09 23:57:16,403][26022] Updated weights on worker 0-0, policy_version 477212 (0.00082) [2022-07-09 23:57:17,770][25689] Fps is (10 sec: 5706.3, 60 sec: 5663.5, 300 sec: 5671.5). Total num frames: 488673280. Throughput: 0: 5861.5. Samples: 488681484. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:57:17,771][25689] Avg episode reward: [(0, '-42.874')] [2022-07-09 23:57:18,008][26022] Updated weights on worker 0-0, policy_version 477222 (0.00088) [2022-07-09 23:57:19,778][26022] Updated weights on worker 0-0, policy_version 477232 (0.00079) [2022-07-09 23:57:21,873][26022] Updated weights on worker 0-0, policy_version 477242 (0.01150) [2022-07-09 23:57:22,798][25689] Fps is (10 sec: 5607.5, 60 sec: 5661.1, 300 sec: 5669.7). Total num frames: 488700928. Throughput: 0: 5105.4. Samples: 488698722. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:57:22,798][25689] Avg episode reward: [(0, '-43.276')] [2022-07-09 23:57:23,318][26022] Updated weights on worker 0-0, policy_version 477252 (0.00091) [2022-07-09 23:57:25,394][26022] Updated weights on worker 0-0, policy_version 477262 (0.00090) [2022-07-09 23:57:26,900][26022] Updated weights on worker 0-0, policy_version 477272 (0.00093) [2022-07-09 23:57:27,879][25689] Fps is (10 sec: 5672.7, 60 sec: 5657.2, 300 sec: 5668.5). Total num frames: 488730624. Throughput: 0: 5959.2. Samples: 488732906. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:57:27,879][25689] Avg episode reward: [(0, '-44.096')] [2022-07-09 23:57:28,977][26022] Updated weights on worker 0-0, policy_version 477282 (0.00087) [2022-07-09 23:57:30,562][26022] Updated weights on worker 0-0, policy_version 477292 (0.00087) [2022-07-09 23:57:32,398][26022] Updated weights on worker 0-0, policy_version 477302 (0.00080) [2022-07-09 23:57:32,899][25689] Fps is (10 sec: 5879.9, 60 sec: 5674.1, 300 sec: 5675.2). Total num frames: 488760320. Throughput: 0: 5978.0. Samples: 488767310. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:57:32,899][25689] Avg episode reward: [(0, '-43.463')] [2022-07-09 23:57:34,307][26022] Updated weights on worker 0-0, policy_version 477312 (0.00095) [2022-07-09 23:57:36,015][26022] Updated weights on worker 0-0, policy_version 477322 (0.00084) [2022-07-09 23:57:37,784][26022] Updated weights on worker 0-0, policy_version 477332 (0.00084) [2022-07-09 23:57:37,923][25689] Fps is (10 sec: 5709.3, 60 sec: 5655.9, 300 sec: 5671.7). Total num frames: 488787968. Throughput: 0: 5109.4. Samples: 488784452. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:57:37,923][25689] Avg episode reward: [(0, '-42.928')] [2022-07-09 23:57:39,429][26022] Updated weights on worker 0-0, policy_version 477342 (0.00087) [2022-07-09 23:57:41,462][26022] Updated weights on worker 0-0, policy_version 477352 (0.00094) [2022-07-09 23:57:42,951][25689] Fps is (10 sec: 5704.6, 60 sec: 5691.0, 300 sec: 5668.9). Total num frames: 488817664. Throughput: 0: 5970.1. Samples: 488819038. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:57:42,951][25689] Avg episode reward: [(0, '-42.753')] [2022-07-09 23:57:43,063][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:57:43,085][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000477362_488818688.pth [2022-07-09 23:57:43,085][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000475364_486772736.pth [2022-07-09 23:57:43,090][26022] Updated weights on worker 0-0, policy_version 477362 (0.00096) [2022-07-09 23:57:44,860][26022] Updated weights on worker 0-0, policy_version 477372 (0.00089) [2022-07-09 23:57:46,764][26022] Updated weights on worker 0-0, policy_version 477382 (0.00086) [2022-07-09 23:57:48,076][25689] Fps is (10 sec: 5849.3, 60 sec: 5703.8, 300 sec: 5671.8). Total num frames: 488847360. Throughput: 0: 5979.7. Samples: 488853682. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:57:48,077][25689] Avg episode reward: [(0, '-42.903')] [2022-07-09 23:57:48,525][26022] Updated weights on worker 0-0, policy_version 477392 (0.00095) [2022-07-09 23:57:50,300][26022] Updated weights on worker 0-0, policy_version 477402 (0.00083) [2022-07-09 23:57:51,946][26022] Updated weights on worker 0-0, policy_version 477412 (0.00093) [2022-07-09 23:57:53,120][25689] Fps is (10 sec: 5739.8, 60 sec: 5687.2, 300 sec: 5674.7). Total num frames: 488876032. Throughput: 0: 5121.8. Samples: 488870880. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:57:53,120][25689] Avg episode reward: [(0, '-42.656')] [2022-07-09 23:57:53,780][26022] Updated weights on worker 0-0, policy_version 477422 (0.00084) [2022-07-09 23:57:55,632][26022] Updated weights on worker 0-0, policy_version 477432 (0.00099) [2022-07-09 23:57:57,254][26022] Updated weights on worker 0-0, policy_version 477442 (0.00619) [2022-07-09 23:57:58,166][25689] Fps is (10 sec: 5683.5, 60 sec: 5701.6, 300 sec: 5670.7). Total num frames: 488904704. Throughput: 0: 5991.3. Samples: 488905734. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:57:58,167][25689] Avg episode reward: [(0, '-41.842')] [2022-07-09 23:57:59,173][26022] Updated weights on worker 0-0, policy_version 477452 (0.00080) [2022-07-09 23:58:00,775][26022] Updated weights on worker 0-0, policy_version 477462 (0.00084) [2022-07-09 23:58:02,988][26022] Updated weights on worker 0-0, policy_version 477472 (0.00085) [2022-07-09 23:58:03,187][25689] Fps is (10 sec: 5696.3, 60 sec: 5717.4, 300 sec: 5679.5). Total num frames: 488933376. Throughput: 0: 5896.1. Samples: 488938350. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:58:03,189][25689] Avg episode reward: [(0, '-41.976')] [2022-07-09 23:58:04,814][26022] Updated weights on worker 0-0, policy_version 477482 (0.00082) [2022-07-09 23:58:06,397][26022] Updated weights on worker 0-0, policy_version 477492 (0.00089) [2022-07-09 23:58:08,246][25689] Fps is (10 sec: 5384.2, 60 sec: 5667.7, 300 sec: 5665.1). Total num frames: 488958976. Throughput: 0: 5047.1. Samples: 488955478. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:58:08,249][25689] Avg episode reward: [(0, '-42.572')] [2022-07-09 23:58:08,643][26022] Updated weights on worker 0-0, policy_version 477502 (0.00089) [2022-07-09 23:58:10,144][26022] Updated weights on worker 0-0, policy_version 477512 (0.00088) [2022-07-09 23:58:11,930][26022] Updated weights on worker 0-0, policy_version 477522 (0.00088) [2022-07-09 23:58:13,270][25689] Fps is (10 sec: 5585.5, 60 sec: 5702.4, 300 sec: 5671.8). Total num frames: 488989696. Throughput: 0: 5902.0. Samples: 488989806. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:58:13,272][25689] Avg episode reward: [(0, '-42.262')] [2022-07-09 23:58:13,703][26022] Updated weights on worker 0-0, policy_version 477532 (0.01118) [2022-07-09 23:58:15,467][26022] Updated weights on worker 0-0, policy_version 477542 (0.00097) [2022-07-09 23:58:17,415][26022] Updated weights on worker 0-0, policy_version 477552 (0.00085) [2022-07-09 23:58:18,302][25689] Fps is (10 sec: 5905.7, 60 sec: 5700.9, 300 sec: 5675.0). Total num frames: 489018368. Throughput: 0: 5904.2. Samples: 489024624. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:58:18,303][25689] Avg episode reward: [(0, '-42.632')] [2022-07-09 23:58:18,948][26022] Updated weights on worker 0-0, policy_version 477562 (0.00093) [2022-07-09 23:58:21,019][26022] Updated weights on worker 0-0, policy_version 477572 (0.00086) [2022-07-09 23:58:22,530][26022] Updated weights on worker 0-0, policy_version 477582 (0.00083) [2022-07-09 23:58:23,340][25689] Fps is (10 sec: 5694.9, 60 sec: 5716.9, 300 sec: 5675.6). Total num frames: 489047040. Throughput: 0: 5132.4. Samples: 489041782. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:58:23,340][25689] Avg episode reward: [(0, '-43.019')] [2022-07-09 23:58:24,455][26022] Updated weights on worker 0-0, policy_version 477592 (0.00089) [2022-07-09 23:58:26,170][26022] Updated weights on worker 0-0, policy_version 477602 (0.00089) [2022-07-09 23:58:28,002][26022] Updated weights on worker 0-0, policy_version 477612 (0.00091) [2022-07-09 23:58:28,403][25689] Fps is (10 sec: 5778.5, 60 sec: 5718.5, 300 sec: 5677.8). Total num frames: 489076736. Throughput: 0: 5988.3. Samples: 489076188. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:58:28,404][25689] Avg episode reward: [(0, '-44.017')] [2022-07-09 23:58:29,871][26022] Updated weights on worker 0-0, policy_version 477622 (0.00111) [2022-07-09 23:58:31,580][26022] Updated weights on worker 0-0, policy_version 477632 (0.00083) [2022-07-09 23:58:33,406][25689] Fps is (10 sec: 5696.6, 60 sec: 5686.3, 300 sec: 5674.4). Total num frames: 489104384. Throughput: 0: 5988.9. Samples: 489110398. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:58:33,406][25689] Avg episode reward: [(0, '-43.713')] [2022-07-09 23:58:33,437][26022] Updated weights on worker 0-0, policy_version 477642 (0.01314) [2022-07-09 23:58:35,108][26022] Updated weights on worker 0-0, policy_version 477652 (0.00083) [2022-07-09 23:58:37,139][26022] Updated weights on worker 0-0, policy_version 477662 (0.00086) [2022-07-09 23:58:38,409][25689] Fps is (10 sec: 5526.4, 60 sec: 5688.3, 300 sec: 5667.8). Total num frames: 489132032. Throughput: 0: 5109.4. Samples: 489127356. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:58:38,413][25689] Avg episode reward: [(0, '-43.887')] [2022-07-09 23:58:38,850][26022] Updated weights on worker 0-0, policy_version 477672 (0.00089) [2022-07-09 23:58:40,600][26022] Updated weights on worker 0-0, policy_version 477682 (0.00091) [2022-07-09 23:58:42,467][26022] Updated weights on worker 0-0, policy_version 477692 (0.00084) [2022-07-09 23:58:43,421][25689] Fps is (10 sec: 5725.9, 60 sec: 5689.9, 300 sec: 5679.0). Total num frames: 489161728. Throughput: 0: 5962.6. Samples: 489161518. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:58:43,421][25689] Avg episode reward: [(0, '-43.207')] [2022-07-09 23:58:44,335][26022] Updated weights on worker 0-0, policy_version 477702 (0.00095) [2022-07-09 23:58:46,118][26022] Updated weights on worker 0-0, policy_version 477712 (0.00092) [2022-07-09 23:58:47,907][26022] Updated weights on worker 0-0, policy_version 477722 (0.00085) [2022-07-09 23:58:48,529][25689] Fps is (10 sec: 5666.3, 60 sec: 5657.6, 300 sec: 5666.8). Total num frames: 489189376. Throughput: 0: 5919.6. Samples: 489195326. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:58:48,530][25689] Avg episode reward: [(0, '-43.209')] [2022-07-09 23:58:49,691][26022] Updated weights on worker 0-0, policy_version 477732 (0.00841) [2022-07-09 23:58:51,615][26022] Updated weights on worker 0-0, policy_version 477742 (0.00083) [2022-07-09 23:58:53,362][26022] Updated weights on worker 0-0, policy_version 477752 (0.00088) [2022-07-09 23:58:53,555][25689] Fps is (10 sec: 5658.2, 60 sec: 5676.1, 300 sec: 5673.5). Total num frames: 489219072. Throughput: 0: 5056.1. Samples: 489212276. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:58:53,556][25689] Avg episode reward: [(0, '-43.081')] [2022-07-09 23:58:55,192][26022] Updated weights on worker 0-0, policy_version 477762 (0.00087) [2022-07-09 23:58:56,963][26022] Updated weights on worker 0-0, policy_version 477772 (0.00087) [2022-07-09 23:58:58,582][25689] Fps is (10 sec: 5704.6, 60 sec: 5661.0, 300 sec: 5669.8). Total num frames: 489246720. Throughput: 0: 5898.8. Samples: 489246350. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:58:58,582][25689] Avg episode reward: [(0, '-43.402')] [2022-07-09 23:58:58,774][26022] Updated weights on worker 0-0, policy_version 477782 (0.00087) [2022-07-09 23:59:00,312][26022] Updated weights on worker 0-0, policy_version 477792 (0.00091) [2022-07-09 23:59:02,878][26022] Updated weights on worker 0-0, policy_version 477802 (0.00084) [2022-07-09 23:59:03,608][25689] Fps is (10 sec: 5399.0, 60 sec: 5626.7, 300 sec: 5667.0). Total num frames: 489273344. Throughput: 0: 5811.5. Samples: 489278834. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:59:03,609][25689] Avg episode reward: [(0, '-43.452')] [2022-07-09 23:59:04,249][26022] Updated weights on worker 0-0, policy_version 477812 (0.00093) [2022-07-09 23:59:06,433][26022] Updated weights on worker 0-0, policy_version 477822 (0.00081) [2022-07-09 23:59:07,900][26022] Updated weights on worker 0-0, policy_version 477832 (0.00084) [2022-07-09 23:59:08,651][25689] Fps is (10 sec: 5593.0, 60 sec: 5695.9, 300 sec: 5676.6). Total num frames: 489303040. Throughput: 0: 4996.4. Samples: 489295864. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:59:08,652][25689] Avg episode reward: [(0, '-43.513')] [2022-07-09 23:59:10,126][26022] Updated weights on worker 0-0, policy_version 477842 (0.00081) [2022-07-09 23:59:11,683][26022] Updated weights on worker 0-0, policy_version 477852 (0.00081) [2022-07-09 23:59:13,605][26022] Updated weights on worker 0-0, policy_version 477862 (0.00087) [2022-07-09 23:59:13,665][25689] Fps is (10 sec: 5803.2, 60 sec: 5663.0, 300 sec: 5676.7). Total num frames: 489331712. Throughput: 0: 5861.2. Samples: 489330144. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:59:13,666][25689] Avg episode reward: [(0, '-43.626')] [2022-07-09 23:59:15,085][26022] Updated weights on worker 0-0, policy_version 477872 (0.00087) [2022-07-09 23:59:17,172][26022] Updated weights on worker 0-0, policy_version 477882 (0.00092) [2022-07-09 23:59:18,596][26022] Updated weights on worker 0-0, policy_version 477892 (0.00098) [2022-07-09 23:59:18,676][25689] Fps is (10 sec: 5822.7, 60 sec: 5682.0, 300 sec: 5673.3). Total num frames: 489361408. Throughput: 0: 5888.4. Samples: 489364672. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-09 23:59:18,677][25689] Avg episode reward: [(0, '-43.662')] [2022-07-09 23:59:20,695][26022] Updated weights on worker 0-0, policy_version 477902 (0.00084) [2022-07-09 23:59:22,251][26022] Updated weights on worker 0-0, policy_version 477912 (0.00088) [2022-07-09 23:59:23,687][25689] Fps is (10 sec: 5620.0, 60 sec: 5650.5, 300 sec: 5675.3). Total num frames: 489388032. Throughput: 0: 5137.3. Samples: 489381988. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 23:59:23,687][25689] Avg episode reward: [(0, '-43.349')] [2022-07-09 23:59:24,266][26022] Updated weights on worker 0-0, policy_version 477922 (0.00092) [2022-07-09 23:59:26,073][26022] Updated weights on worker 0-0, policy_version 477932 (0.00094) [2022-07-09 23:59:27,841][26022] Updated weights on worker 0-0, policy_version 477942 (0.00086) [2022-07-09 23:59:28,719][25689] Fps is (10 sec: 5505.7, 60 sec: 5636.5, 300 sec: 5673.2). Total num frames: 489416704. Throughput: 0: 5984.6. Samples: 489415960. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 23:59:28,719][25689] Avg episode reward: [(0, '-43.267')] [2022-07-09 23:59:29,667][26022] Updated weights on worker 0-0, policy_version 477952 (0.00082) [2022-07-09 23:59:31,599][26022] Updated weights on worker 0-0, policy_version 477962 (0.00090) [2022-07-09 23:59:33,081][26022] Updated weights on worker 0-0, policy_version 477972 (0.00087) [2022-07-09 23:59:33,727][25689] Fps is (10 sec: 5915.6, 60 sec: 5686.9, 300 sec: 5676.9). Total num frames: 489447424. Throughput: 0: 5989.5. Samples: 489450300. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 23:59:33,727][25689] Avg episode reward: [(0, '-41.762')] [2022-07-09 23:59:35,140][26022] Updated weights on worker 0-0, policy_version 477982 (0.00095) [2022-07-09 23:59:36,655][26022] Updated weights on worker 0-0, policy_version 477992 (0.00092) [2022-07-09 23:59:38,713][26022] Updated weights on worker 0-0, policy_version 478002 (0.00082) [2022-07-09 23:59:38,732][25689] Fps is (10 sec: 5726.9, 60 sec: 5669.8, 300 sec: 5674.1). Total num frames: 489474048. Throughput: 0: 5114.5. Samples: 489467250. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 23:59:38,732][25689] Avg episode reward: [(0, '-41.906')] [2022-07-09 23:59:40,237][26022] Updated weights on worker 0-0, policy_version 478012 (0.00084) [2022-07-09 23:59:42,403][26022] Updated weights on worker 0-0, policy_version 478022 (0.00086) [2022-07-09 23:59:43,121][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-09 23:59:43,130][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000478027_489499648.pth [2022-07-09 23:59:43,130][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000476030_487454720.pth [2022-07-09 23:59:43,735][25689] Fps is (10 sec: 5524.7, 60 sec: 5653.5, 300 sec: 5669.2). Total num frames: 489502720. Throughput: 0: 5948.8. Samples: 489501254. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 23:59:43,736][25689] Avg episode reward: [(0, '-41.708')] [2022-07-09 23:59:44,021][26022] Updated weights on worker 0-0, policy_version 478032 (0.00092) [2022-07-09 23:59:46,010][26022] Updated weights on worker 0-0, policy_version 478042 (0.00097) [2022-07-09 23:59:47,690][26022] Updated weights on worker 0-0, policy_version 478052 (0.00113) [2022-07-09 23:59:48,804][25689] Fps is (10 sec: 5693.5, 60 sec: 5674.3, 300 sec: 5672.1). Total num frames: 489531392. Throughput: 0: 5926.8. Samples: 489535000. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 23:59:48,804][25689] Avg episode reward: [(0, '-41.657')] [2022-07-09 23:59:49,598][26022] Updated weights on worker 0-0, policy_version 478062 (0.00083) [2022-07-09 23:59:51,338][26022] Updated weights on worker 0-0, policy_version 478072 (0.00611) [2022-07-09 23:59:53,173][26022] Updated weights on worker 0-0, policy_version 478082 (0.00096) [2022-07-09 23:59:53,822][25689] Fps is (10 sec: 5583.6, 60 sec: 5641.1, 300 sec: 5669.1). Total num frames: 489559040. Throughput: 0: 5064.1. Samples: 489552066. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 23:59:53,823][25689] Avg episode reward: [(0, '-42.594')] [2022-07-09 23:59:54,883][26022] Updated weights on worker 0-0, policy_version 478092 (0.00090) [2022-07-09 23:59:56,712][26022] Updated weights on worker 0-0, policy_version 478102 (0.00092) [2022-07-09 23:59:58,612][26022] Updated weights on worker 0-0, policy_version 478112 (0.00076) [2022-07-09 23:59:58,831][25689] Fps is (10 sec: 5616.5, 60 sec: 5659.6, 300 sec: 5669.4). Total num frames: 489587712. Throughput: 0: 5926.5. Samples: 489586370. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-09 23:59:58,832][25689] Avg episode reward: [(0, '-42.719')] [2022-07-10 00:00:00,459][26022] Updated weights on worker 0-0, policy_version 478122 (0.00081) [2022-07-10 00:00:02,567][26022] Updated weights on worker 0-0, policy_version 478132 (0.00093) [2022-07-10 00:00:03,836][25689] Fps is (10 sec: 5419.8, 60 sec: 5644.7, 300 sec: 5664.3). Total num frames: 489613312. Throughput: 0: 5801.7. Samples: 489617870. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 00:00:03,836][25689] Avg episode reward: [(0, '-44.313')] [2022-07-10 00:00:04,437][26022] Updated weights on worker 0-0, policy_version 478142 (0.00090) [2022-07-10 00:00:06,142][26022] Updated weights on worker 0-0, policy_version 478152 (0.00086) [2022-07-10 00:00:07,973][26022] Updated weights on worker 0-0, policy_version 478162 (0.00090) [2022-07-10 00:00:08,940][25689] Fps is (10 sec: 5470.0, 60 sec: 5639.0, 300 sec: 5663.3). Total num frames: 489643008. Throughput: 0: 5809.6. Samples: 489651986. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 00:00:08,941][25689] Avg episode reward: [(0, '-44.217')] [2022-07-10 00:00:09,910][26022] Updated weights on worker 0-0, policy_version 478172 (0.00090) [2022-07-10 00:00:11,558][26022] Updated weights on worker 0-0, policy_version 478182 (0.00091) [2022-07-10 00:00:13,486][26022] Updated weights on worker 0-0, policy_version 478192 (0.00089) [2022-07-10 00:00:13,947][25689] Fps is (10 sec: 5772.8, 60 sec: 5639.7, 300 sec: 5664.2). Total num frames: 489671680. Throughput: 0: 5818.0. Samples: 489669150. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 00:00:13,947][25689] Avg episode reward: [(0, '-43.761')] [2022-07-10 00:00:14,984][26022] Updated weights on worker 0-0, policy_version 478202 (0.00085) [2022-07-10 00:00:16,882][26022] Updated weights on worker 0-0, policy_version 478212 (0.00085) [2022-07-10 00:00:18,815][26022] Updated weights on worker 0-0, policy_version 478222 (0.00096) [2022-07-10 00:00:18,954][25689] Fps is (10 sec: 5726.4, 60 sec: 5622.9, 300 sec: 5665.1). Total num frames: 489700352. Throughput: 0: 5828.8. Samples: 489703662. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 00:00:18,955][25689] Avg episode reward: [(0, '-43.758')] [2022-07-10 00:00:20,474][26022] Updated weights on worker 0-0, policy_version 478232 (0.00056) [2022-07-10 00:00:22,355][26022] Updated weights on worker 0-0, policy_version 478242 (0.00091) [2022-07-10 00:00:23,957][25689] Fps is (10 sec: 5830.5, 60 sec: 5674.7, 300 sec: 5670.8). Total num frames: 489730048. Throughput: 0: 5980.0. Samples: 489738198. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 00:00:23,958][25689] Avg episode reward: [(0, '-43.270')] [2022-07-10 00:00:23,958][26022] Updated weights on worker 0-0, policy_version 478252 (0.00092) [2022-07-10 00:00:25,891][26022] Updated weights on worker 0-0, policy_version 478262 (0.00092) [2022-07-10 00:00:27,853][26022] Updated weights on worker 0-0, policy_version 478272 (0.00086) [2022-07-10 00:00:29,005][25689] Fps is (10 sec: 5705.2, 60 sec: 5656.2, 300 sec: 5666.7). Total num frames: 489757696. Throughput: 0: 5138.2. Samples: 489755084. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 00:00:29,006][25689] Avg episode reward: [(0, '-42.394')] [2022-07-10 00:00:29,472][26022] Updated weights on worker 0-0, policy_version 478282 (0.00089) [2022-07-10 00:00:31,276][26022] Updated weights on worker 0-0, policy_version 478292 (0.00091) [2022-07-10 00:00:33,033][26022] Updated weights on worker 0-0, policy_version 478302 (0.00087) [2022-07-10 00:00:34,016][25689] Fps is (10 sec: 5497.6, 60 sec: 5605.0, 300 sec: 5660.4). Total num frames: 489785344. Throughput: 0: 5981.9. Samples: 489789200. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 00:00:34,017][25689] Avg episode reward: [(0, '-41.013')] [2022-07-10 00:00:34,958][26022] Updated weights on worker 0-0, policy_version 478312 (0.00083) [2022-07-10 00:00:36,691][26022] Updated weights on worker 0-0, policy_version 478322 (0.00093) [2022-07-10 00:00:38,418][26022] Updated weights on worker 0-0, policy_version 478332 (0.00088) [2022-07-10 00:00:39,019][25689] Fps is (10 sec: 5624.1, 60 sec: 5639.1, 300 sec: 5664.1). Total num frames: 489814016. Throughput: 0: 5983.5. Samples: 489823722. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 00:00:39,020][25689] Avg episode reward: [(0, '-41.133')] [2022-07-10 00:00:40,291][26022] Updated weights on worker 0-0, policy_version 478342 (0.00094) [2022-07-10 00:00:42,356][26022] Updated weights on worker 0-0, policy_version 478352 (0.00093) [2022-07-10 00:00:43,935][26022] Updated weights on worker 0-0, policy_version 478362 (0.00086) [2022-07-10 00:00:44,030][25689] Fps is (10 sec: 5726.5, 60 sec: 5638.5, 300 sec: 5663.2). Total num frames: 489842688. Throughput: 0: 5093.4. Samples: 489840434. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 00:00:44,031][25689] Avg episode reward: [(0, '-41.213')] [2022-07-10 00:00:45,864][26022] Updated weights on worker 0-0, policy_version 478372 (0.00090) [2022-07-10 00:00:47,595][26022] Updated weights on worker 0-0, policy_version 478382 (0.00087) [2022-07-10 00:00:49,075][25689] Fps is (10 sec: 5702.9, 60 sec: 5640.7, 300 sec: 5665.9). Total num frames: 489871360. Throughput: 0: 5950.1. Samples: 489874498. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 00:00:49,075][25689] Avg episode reward: [(0, '-41.065')] [2022-07-10 00:00:49,502][26022] Updated weights on worker 0-0, policy_version 478392 (0.00090) [2022-07-10 00:00:51,084][26022] Updated weights on worker 0-0, policy_version 478402 (0.00087) [2022-07-10 00:00:53,052][26022] Updated weights on worker 0-0, policy_version 478412 (0.00079) [2022-07-10 00:00:54,093][25689] Fps is (10 sec: 5698.1, 60 sec: 5657.6, 300 sec: 5662.7). Total num frames: 489900032. Throughput: 0: 5951.3. Samples: 489908688. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 00:00:54,094][25689] Avg episode reward: [(0, '-40.818')] [2022-07-10 00:00:54,584][26022] Updated weights on worker 0-0, policy_version 478422 (0.00080) [2022-07-10 00:00:56,524][26022] Updated weights on worker 0-0, policy_version 478432 (0.00090) [2022-07-10 00:00:58,317][26022] Updated weights on worker 0-0, policy_version 478442 (0.00088) [2022-07-10 00:00:59,121][25689] Fps is (10 sec: 5606.0, 60 sec: 5638.9, 300 sec: 5666.1). Total num frames: 489927680. Throughput: 0: 5078.4. Samples: 489925806. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 00:00:59,122][25689] Avg episode reward: [(0, '-41.778')] [2022-07-10 00:01:00,085][26022] Updated weights on worker 0-0, policy_version 478452 (0.00080) [2022-07-10 00:01:02,249][26022] Updated weights on worker 0-0, policy_version 478462 (0.00087) [2022-07-10 00:01:04,000][26022] Updated weights on worker 0-0, policy_version 478472 (0.00086) [2022-07-10 00:01:04,133][25689] Fps is (10 sec: 5609.8, 60 sec: 5689.2, 300 sec: 5671.0). Total num frames: 489956352. Throughput: 0: 5857.5. Samples: 489958190. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 00:01:04,133][25689] Avg episode reward: [(0, '-43.043')] [2022-07-10 00:01:06,052][26022] Updated weights on worker 0-0, policy_version 478482 (0.00095) [2022-07-10 00:01:07,609][26022] Updated weights on worker 0-0, policy_version 478492 (0.00087) [2022-07-10 00:01:09,227][25689] Fps is (10 sec: 5573.0, 60 sec: 5656.2, 300 sec: 5663.8). Total num frames: 489984000. Throughput: 0: 5841.8. Samples: 489992224. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 00:01:09,227][25689] Avg episode reward: [(0, '-42.772')] [2022-07-10 00:01:09,467][26022] Updated weights on worker 0-0, policy_version 478502 (0.00086) [2022-07-10 00:01:11,339][26022] Updated weights on worker 0-0, policy_version 478512 (0.00086) [2022-07-10 00:01:13,410][26022] Updated weights on worker 0-0, policy_version 478522 (0.00091) [2022-07-10 00:01:14,228][25689] Fps is (10 sec: 5477.3, 60 sec: 5639.7, 300 sec: 5660.8). Total num frames: 490011648. Throughput: 0: 4978.4. Samples: 490008928. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 00:01:14,229][25689] Avg episode reward: [(0, '-43.026')] [2022-07-10 00:01:14,943][26022] Updated weights on worker 0-0, policy_version 478532 (0.00088) [2022-07-10 00:01:16,958][26022] Updated weights on worker 0-0, policy_version 478542 (0.00089) [2022-07-10 00:01:18,411][26022] Updated weights on worker 0-0, policy_version 478552 (0.00084) [2022-07-10 00:01:19,245][25689] Fps is (10 sec: 5621.6, 60 sec: 5638.8, 300 sec: 5663.9). Total num frames: 490040320. Throughput: 0: 5847.4. Samples: 490043482. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 00:01:19,246][25689] Avg episode reward: [(0, '-43.904')] [2022-07-10 00:01:20,302][26022] Updated weights on worker 0-0, policy_version 478562 (0.00088) [2022-07-10 00:01:22,104][26022] Updated weights on worker 0-0, policy_version 478572 (0.00092) [2022-07-10 00:01:23,781][26022] Updated weights on worker 0-0, policy_version 478582 (0.00092) [2022-07-10 00:01:24,285][25689] Fps is (10 sec: 5804.1, 60 sec: 5635.4, 300 sec: 5663.9). Total num frames: 490070016. Throughput: 0: 5947.6. Samples: 490078044. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 00:01:24,285][25689] Avg episode reward: [(0, '-43.516')] [2022-07-10 00:01:25,665][26022] Updated weights on worker 0-0, policy_version 478592 (0.00083) [2022-07-10 00:01:27,381][26022] Updated weights on worker 0-0, policy_version 478602 (0.00093) [2022-07-10 00:01:29,144][26022] Updated weights on worker 0-0, policy_version 478612 (0.00087) [2022-07-10 00:01:29,331][25689] Fps is (10 sec: 5889.0, 60 sec: 5669.5, 300 sec: 5666.9). Total num frames: 490099712. Throughput: 0: 5128.1. Samples: 490095318. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 00:01:29,331][25689] Avg episode reward: [(0, '-43.583')] [2022-07-10 00:01:31,098][26022] Updated weights on worker 0-0, policy_version 478622 (0.00089) [2022-07-10 00:01:32,812][26022] Updated weights on worker 0-0, policy_version 478632 (0.00086) [2022-07-10 00:01:34,332][25689] Fps is (10 sec: 5707.6, 60 sec: 5670.4, 300 sec: 5663.6). Total num frames: 490127360. Throughput: 0: 6022.8. Samples: 490130006. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 00:01:34,332][25689] Avg episode reward: [(0, '-43.250')] [2022-07-10 00:01:34,488][26022] Updated weights on worker 0-0, policy_version 478642 (0.00083) [2022-07-10 00:01:36,469][26022] Updated weights on worker 0-0, policy_version 478652 (0.00086) [2022-07-10 00:01:37,886][26022] Updated weights on worker 0-0, policy_version 478662 (0.00089) [2022-07-10 00:01:39,363][25689] Fps is (10 sec: 5512.0, 60 sec: 5650.9, 300 sec: 5663.8). Total num frames: 490155008. Throughput: 0: 6015.2. Samples: 490164490. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 00:01:39,363][25689] Avg episode reward: [(0, '-43.637')] [2022-07-10 00:01:40,050][26022] Updated weights on worker 0-0, policy_version 478672 (0.00090) [2022-07-10 00:01:41,515][26022] Updated weights on worker 0-0, policy_version 478682 (0.00095) [2022-07-10 00:01:43,194][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:01:43,206][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000478689_490177536.pth [2022-07-10 00:01:43,206][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000476696_488136704.pth [2022-07-10 00:01:43,592][26022] Updated weights on worker 0-0, policy_version 478692 (0.00050) [2022-07-10 00:01:44,378][25689] Fps is (10 sec: 5810.1, 60 sec: 5684.4, 300 sec: 5672.0). Total num frames: 490185728. Throughput: 0: 5153.2. Samples: 490181586. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 00:01:44,387][25689] Avg episode reward: [(0, '-43.789')] [2022-07-10 00:01:45,519][26022] Updated weights on worker 0-0, policy_version 478702 (0.00088) [2022-07-10 00:01:47,054][26022] Updated weights on worker 0-0, policy_version 478712 (0.00092) [2022-07-10 00:01:48,978][26022] Updated weights on worker 0-0, policy_version 478722 (0.00051) [2022-07-10 00:01:49,494][25689] Fps is (10 sec: 5761.2, 60 sec: 5660.7, 300 sec: 5663.8). Total num frames: 490213376. Throughput: 0: 5987.5. Samples: 490216044. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 00:01:49,495][25689] Avg episode reward: [(0, '-43.575')] [2022-07-10 00:01:50,550][26022] Updated weights on worker 0-0, policy_version 478732 (0.00085) [2022-07-10 00:01:52,590][26022] Updated weights on worker 0-0, policy_version 478742 (0.00088) [2022-07-10 00:01:54,255][26022] Updated weights on worker 0-0, policy_version 478752 (0.00084) [2022-07-10 00:01:54,566][25689] Fps is (10 sec: 5628.6, 60 sec: 5672.7, 300 sec: 5669.6). Total num frames: 490243072. Throughput: 0: 5939.6. Samples: 490250188. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 00:01:54,566][25689] Avg episode reward: [(0, '-43.297')] [2022-07-10 00:01:56,131][26022] Updated weights on worker 0-0, policy_version 478762 (0.00377) [2022-07-10 00:01:57,839][26022] Updated weights on worker 0-0, policy_version 478772 (0.00089) [2022-07-10 00:01:59,582][25689] Fps is (10 sec: 5785.6, 60 sec: 5690.7, 300 sec: 5672.9). Total num frames: 490271744. Throughput: 0: 5083.7. Samples: 490267280. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 00:01:59,583][25689] Avg episode reward: [(0, '-44.398')] [2022-07-10 00:01:59,720][26022] Updated weights on worker 0-0, policy_version 478782 (0.00089) [2022-07-10 00:02:01,955][26022] Updated weights on worker 0-0, policy_version 478792 (0.00084) [2022-07-10 00:02:03,672][26022] Updated weights on worker 0-0, policy_version 478802 (0.00086) [2022-07-10 00:02:04,658][25689] Fps is (10 sec: 5479.1, 60 sec: 5650.8, 300 sec: 5666.0). Total num frames: 490298368. Throughput: 0: 5805.1. Samples: 490299314. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 00:02:04,658][25689] Avg episode reward: [(0, '-44.350')] [2022-07-10 00:02:05,500][26022] Updated weights on worker 0-0, policy_version 478812 (0.00110) [2022-07-10 00:02:07,335][26022] Updated weights on worker 0-0, policy_version 478822 (0.00084) [2022-07-10 00:02:09,155][26022] Updated weights on worker 0-0, policy_version 478832 (0.00084) [2022-07-10 00:02:09,774][25689] Fps is (10 sec: 5425.7, 60 sec: 5665.7, 300 sec: 5664.4). Total num frames: 490327040. Throughput: 0: 5791.1. Samples: 490333488. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 00:02:09,774][25689] Avg episode reward: [(0, '-44.917')] [2022-07-10 00:02:10,925][26022] Updated weights on worker 0-0, policy_version 478842 (0.00107) [2022-07-10 00:02:12,671][26022] Updated weights on worker 0-0, policy_version 478852 (0.00090) [2022-07-10 00:02:14,484][26022] Updated weights on worker 0-0, policy_version 478862 (0.00089) [2022-07-10 00:02:14,806][25689] Fps is (10 sec: 5751.6, 60 sec: 5696.6, 300 sec: 5667.5). Total num frames: 490356736. Throughput: 0: 4952.9. Samples: 490350436. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:02:14,806][25689] Avg episode reward: [(0, '-44.059')] [2022-07-10 00:02:16,491][26022] Updated weights on worker 0-0, policy_version 478872 (0.00088) [2022-07-10 00:02:18,153][26022] Updated weights on worker 0-0, policy_version 478882 (0.00090) [2022-07-10 00:02:19,857][25689] Fps is (10 sec: 5686.9, 60 sec: 5676.5, 300 sec: 5667.1). Total num frames: 490384384. Throughput: 0: 5793.1. Samples: 490384736. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:02:19,858][25689] Avg episode reward: [(0, '-43.551')] [2022-07-10 00:02:20,024][26022] Updated weights on worker 0-0, policy_version 478892 (0.00083) [2022-07-10 00:02:21,641][26022] Updated weights on worker 0-0, policy_version 478902 (0.00089) [2022-07-10 00:02:23,469][26022] Updated weights on worker 0-0, policy_version 478912 (0.00096) [2022-07-10 00:02:24,879][25689] Fps is (10 sec: 5692.6, 60 sec: 5678.1, 300 sec: 5668.2). Total num frames: 490414080. Throughput: 0: 5925.9. Samples: 490419146. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:02:24,880][25689] Avg episode reward: [(0, '-43.657')] [2022-07-10 00:02:25,406][26022] Updated weights on worker 0-0, policy_version 478922 (0.00093) [2022-07-10 00:02:26,983][26022] Updated weights on worker 0-0, policy_version 478932 (0.00085) [2022-07-10 00:02:29,008][26022] Updated weights on worker 0-0, policy_version 478942 (0.00088) [2022-07-10 00:02:29,919][25689] Fps is (10 sec: 5699.4, 60 sec: 5644.9, 300 sec: 5660.9). Total num frames: 490441728. Throughput: 0: 5100.2. Samples: 490436230. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:02:29,919][25689] Avg episode reward: [(0, '-42.991')] [2022-07-10 00:02:30,482][26022] Updated weights on worker 0-0, policy_version 478952 (0.00084) [2022-07-10 00:02:32,588][26022] Updated weights on worker 0-0, policy_version 478962 (0.00084) [2022-07-10 00:02:34,180][26022] Updated weights on worker 0-0, policy_version 478972 (0.00088) [2022-07-10 00:02:34,947][25689] Fps is (10 sec: 5594.0, 60 sec: 5659.3, 300 sec: 5664.3). Total num frames: 490470400. Throughput: 0: 5967.5. Samples: 490470630. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:02:34,948][25689] Avg episode reward: [(0, '-42.832')] [2022-07-10 00:02:36,255][26022] Updated weights on worker 0-0, policy_version 478982 (0.00086) [2022-07-10 00:02:37,716][26022] Updated weights on worker 0-0, policy_version 478992 (0.00086) [2022-07-10 00:02:39,623][26022] Updated weights on worker 0-0, policy_version 479002 (0.00087) [2022-07-10 00:02:39,953][25689] Fps is (10 sec: 5714.6, 60 sec: 5678.5, 300 sec: 5661.3). Total num frames: 490499072. Throughput: 0: 5969.5. Samples: 490504698. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:02:39,954][25689] Avg episode reward: [(0, '-43.350')] [2022-07-10 00:02:41,426][26022] Updated weights on worker 0-0, policy_version 479012 (0.00088) [2022-07-10 00:02:43,313][26022] Updated weights on worker 0-0, policy_version 479022 (0.00088) [2022-07-10 00:02:45,015][25689] Fps is (10 sec: 5695.7, 60 sec: 5640.3, 300 sec: 5659.0). Total num frames: 490527744. Throughput: 0: 5092.0. Samples: 490521678. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:02:45,016][25689] Avg episode reward: [(0, '-43.822')] [2022-07-10 00:02:45,083][26022] Updated weights on worker 0-0, policy_version 479032 (0.00088) [2022-07-10 00:02:46,911][26022] Updated weights on worker 0-0, policy_version 479042 (0.00087) [2022-07-10 00:02:48,562][26022] Updated weights on worker 0-0, policy_version 479052 (0.00087) [2022-07-10 00:02:50,127][25689] Fps is (10 sec: 5536.0, 60 sec: 5640.8, 300 sec: 5654.3). Total num frames: 490555392. Throughput: 0: 5926.9. Samples: 490556000. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:02:50,127][25689] Avg episode reward: [(0, '-44.340')] [2022-07-10 00:02:50,655][26022] Updated weights on worker 0-0, policy_version 479062 (0.00092) [2022-07-10 00:02:52,145][26022] Updated weights on worker 0-0, policy_version 479072 (0.00088) [2022-07-10 00:02:54,158][26022] Updated weights on worker 0-0, policy_version 479082 (0.00082) [2022-07-10 00:02:55,184][25689] Fps is (10 sec: 5840.6, 60 sec: 5675.9, 300 sec: 5664.4). Total num frames: 490587136. Throughput: 0: 5916.8. Samples: 490590366. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:02:55,184][25689] Avg episode reward: [(0, '-43.747')] [2022-07-10 00:02:55,770][26022] Updated weights on worker 0-0, policy_version 479092 (0.00094) [2022-07-10 00:02:57,493][26022] Updated weights on worker 0-0, policy_version 479102 (0.00078) [2022-07-10 00:02:59,503][26022] Updated weights on worker 0-0, policy_version 479112 (0.00084) [2022-07-10 00:03:00,196][25689] Fps is (10 sec: 5796.8, 60 sec: 5642.6, 300 sec: 5657.7). Total num frames: 490613760. Throughput: 0: 5084.4. Samples: 490607618. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:03:00,196][25689] Avg episode reward: [(0, '-44.053')] [2022-07-10 00:03:01,100][26022] Updated weights on worker 0-0, policy_version 479122 (0.00085) [2022-07-10 00:03:03,450][26022] Updated weights on worker 0-0, policy_version 479132 (0.00100) [2022-07-10 00:03:05,130][26022] Updated weights on worker 0-0, policy_version 479142 (0.00096) [2022-07-10 00:03:05,214][25689] Fps is (10 sec: 5410.8, 60 sec: 5664.8, 300 sec: 5665.4). Total num frames: 490641408. Throughput: 0: 5846.4. Samples: 490639770. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:03:05,215][25689] Avg episode reward: [(0, '-44.705')] [2022-07-10 00:03:06,932][26022] Updated weights on worker 0-0, policy_version 479152 (0.00087) [2022-07-10 00:03:09,071][26022] Updated weights on worker 0-0, policy_version 479162 (0.00095) [2022-07-10 00:03:10,285][25689] Fps is (10 sec: 5582.3, 60 sec: 5669.1, 300 sec: 5657.6). Total num frames: 490670080. Throughput: 0: 5831.1. Samples: 490673544. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:03:10,285][25689] Avg episode reward: [(0, '-44.143')] [2022-07-10 00:03:10,564][26022] Updated weights on worker 0-0, policy_version 479172 (0.00086) [2022-07-10 00:03:12,525][26022] Updated weights on worker 0-0, policy_version 479182 (0.00090) [2022-07-10 00:03:14,208][26022] Updated weights on worker 0-0, policy_version 479192 (0.00049) [2022-07-10 00:03:15,295][25689] Fps is (10 sec: 5485.3, 60 sec: 5620.3, 300 sec: 5651.1). Total num frames: 490696704. Throughput: 0: 4991.0. Samples: 490690740. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:03:15,297][25689] Avg episode reward: [(0, '-44.644')] [2022-07-10 00:03:15,980][26022] Updated weights on worker 0-0, policy_version 479202 (0.00101) [2022-07-10 00:03:17,772][26022] Updated weights on worker 0-0, policy_version 479212 (0.00087) [2022-07-10 00:03:19,703][26022] Updated weights on worker 0-0, policy_version 479222 (0.00082) [2022-07-10 00:03:20,302][25689] Fps is (10 sec: 5724.6, 60 sec: 5675.3, 300 sec: 5658.6). Total num frames: 490727424. Throughput: 0: 5839.9. Samples: 490725036. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:03:20,314][25689] Avg episode reward: [(0, '-44.931')] [2022-07-10 00:03:21,403][26022] Updated weights on worker 0-0, policy_version 479232 (0.00571) [2022-07-10 00:03:23,263][26022] Updated weights on worker 0-0, policy_version 479242 (0.00085) [2022-07-10 00:03:24,854][26022] Updated weights on worker 0-0, policy_version 479252 (0.00097) [2022-07-10 00:03:25,402][25689] Fps is (10 sec: 5876.0, 60 sec: 5651.0, 300 sec: 5654.4). Total num frames: 490756096. Throughput: 0: 5929.4. Samples: 490759474. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:03:25,404][25689] Avg episode reward: [(0, '-45.131')] [2022-07-10 00:03:26,931][26022] Updated weights on worker 0-0, policy_version 479262 (0.00086) [2022-07-10 00:03:28,624][26022] Updated weights on worker 0-0, policy_version 479272 (0.00090) [2022-07-10 00:03:30,515][25689] Fps is (10 sec: 5514.5, 60 sec: 5644.2, 300 sec: 5652.3). Total num frames: 490783744. Throughput: 0: 5911.9. Samples: 490793142. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:03:30,517][25689] Avg episode reward: [(0, '-44.506')] [2022-07-10 00:03:30,556][26022] Updated weights on worker 0-0, policy_version 479282 (0.00083) [2022-07-10 00:03:32,259][26022] Updated weights on worker 0-0, policy_version 479292 (0.00084) [2022-07-10 00:03:34,182][26022] Updated weights on worker 0-0, policy_version 479302 (0.00062) [2022-07-10 00:03:35,581][25689] Fps is (10 sec: 5734.4, 60 sec: 5674.5, 300 sec: 5661.5). Total num frames: 490814464. Throughput: 0: 5906.0. Samples: 490810550. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:03:35,583][25689] Avg episode reward: [(0, '-44.543')] [2022-07-10 00:03:35,683][26022] Updated weights on worker 0-0, policy_version 479312 (0.00084) [2022-07-10 00:03:37,858][26022] Updated weights on worker 0-0, policy_version 479322 (0.00096) [2022-07-10 00:03:39,157][26022] Updated weights on worker 0-0, policy_version 479332 (0.00084) [2022-07-10 00:03:40,679][25689] Fps is (10 sec: 5742.7, 60 sec: 5649.0, 300 sec: 5652.9). Total num frames: 490842112. Throughput: 0: 5870.5. Samples: 490844658. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:03:40,679][25689] Avg episode reward: [(0, '-44.432')] [2022-07-10 00:03:41,221][26022] Updated weights on worker 0-0, policy_version 479342 (0.00084) [2022-07-10 00:03:42,931][26022] Updated weights on worker 0-0, policy_version 479352 (0.00080) [2022-07-10 00:03:43,257][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:03:43,269][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000479353_490857472.pth [2022-07-10 00:03:43,270][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000477362_488818688.pth [2022-07-10 00:03:44,764][26022] Updated weights on worker 0-0, policy_version 479362 (0.00092) [2022-07-10 00:03:45,727][25689] Fps is (10 sec: 5651.8, 60 sec: 5667.1, 300 sec: 5661.0). Total num frames: 490871808. Throughput: 0: 5874.5. Samples: 490878872. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:03:45,728][25689] Avg episode reward: [(0, '-44.558')] [2022-07-10 00:03:46,602][26022] Updated weights on worker 0-0, policy_version 479372 (0.00083) [2022-07-10 00:03:48,379][26022] Updated weights on worker 0-0, policy_version 479382 (0.00096) [2022-07-10 00:03:50,362][26022] Updated weights on worker 0-0, policy_version 479392 (0.00091) [2022-07-10 00:03:50,794][25689] Fps is (10 sec: 5567.7, 60 sec: 5654.4, 300 sec: 5649.9). Total num frames: 490898432. Throughput: 0: 5058.1. Samples: 490895718. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:03:50,796][25689] Avg episode reward: [(0, '-44.728')] [2022-07-10 00:03:52,081][26022] Updated weights on worker 0-0, policy_version 479402 (0.00088) [2022-07-10 00:03:54,033][26022] Updated weights on worker 0-0, policy_version 479412 (0.00094) [2022-07-10 00:03:55,585][26022] Updated weights on worker 0-0, policy_version 479422 (0.00090) [2022-07-10 00:03:55,824][25689] Fps is (10 sec: 5679.5, 60 sec: 5640.1, 300 sec: 5660.1). Total num frames: 490929152. Throughput: 0: 5879.4. Samples: 490929566. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:03:55,824][25689] Avg episode reward: [(0, '-44.240')] [2022-07-10 00:03:57,709][26022] Updated weights on worker 0-0, policy_version 479432 (0.00088) [2022-07-10 00:03:59,289][26022] Updated weights on worker 0-0, policy_version 479442 (0.00078) [2022-07-10 00:04:00,827][25689] Fps is (10 sec: 5715.7, 60 sec: 5640.9, 300 sec: 5660.6). Total num frames: 490955776. Throughput: 0: 5912.1. Samples: 490963776. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:04:00,828][25689] Avg episode reward: [(0, '-44.724')] [2022-07-10 00:04:01,189][26022] Updated weights on worker 0-0, policy_version 479452 (0.00087) [2022-07-10 00:04:03,270][26022] Updated weights on worker 0-0, policy_version 479462 (0.00088) [2022-07-10 00:04:05,127][26022] Updated weights on worker 0-0, policy_version 479472 (0.00548) [2022-07-10 00:04:05,832][25689] Fps is (10 sec: 5423.0, 60 sec: 5642.2, 300 sec: 5654.4). Total num frames: 490983424. Throughput: 0: 4969.2. Samples: 490978774. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:04:05,834][25689] Avg episode reward: [(0, '-45.246')] [2022-07-10 00:04:06,982][26022] Updated weights on worker 0-0, policy_version 479482 (0.00088) [2022-07-10 00:04:08,846][26022] Updated weights on worker 0-0, policy_version 479492 (0.00092) [2022-07-10 00:04:10,538][26022] Updated weights on worker 0-0, policy_version 479502 (0.00093) [2022-07-10 00:04:10,893][25689] Fps is (10 sec: 5697.0, 60 sec: 5660.0, 300 sec: 5657.0). Total num frames: 491013120. Throughput: 0: 5826.1. Samples: 491012812. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:04:10,893][25689] Avg episode reward: [(0, '-44.588')] [2022-07-10 00:04:12,310][26022] Updated weights on worker 0-0, policy_version 479512 (0.00091) [2022-07-10 00:04:13,916][26022] Updated weights on worker 0-0, policy_version 479522 (0.00085) [2022-07-10 00:04:15,902][25689] Fps is (10 sec: 5592.5, 60 sec: 5660.0, 300 sec: 5646.7). Total num frames: 491039744. Throughput: 0: 5858.3. Samples: 491047190. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:04:15,904][25689] Avg episode reward: [(0, '-43.935')] [2022-07-10 00:04:15,976][26022] Updated weights on worker 0-0, policy_version 479532 (0.00078) [2022-07-10 00:04:17,612][26022] Updated weights on worker 0-0, policy_version 479542 (0.00096) [2022-07-10 00:04:19,443][26022] Updated weights on worker 0-0, policy_version 479552 (0.00087) [2022-07-10 00:04:20,910][25689] Fps is (10 sec: 5519.8, 60 sec: 5626.1, 300 sec: 5653.6). Total num frames: 491068416. Throughput: 0: 5005.2. Samples: 491064298. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:04:20,912][25689] Avg episode reward: [(0, '-44.040')] [2022-07-10 00:04:21,251][26022] Updated weights on worker 0-0, policy_version 479562 (0.00081) [2022-07-10 00:04:22,852][26022] Updated weights on worker 0-0, policy_version 479572 (0.00084) [2022-07-10 00:04:24,889][26022] Updated weights on worker 0-0, policy_version 479582 (0.00091) [2022-07-10 00:04:25,913][25689] Fps is (10 sec: 5932.5, 60 sec: 5669.1, 300 sec: 5661.0). Total num frames: 491099136. Throughput: 0: 5982.1. Samples: 491098906. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:04:25,914][25689] Avg episode reward: [(0, '-43.351')] [2022-07-10 00:04:26,459][26022] Updated weights on worker 0-0, policy_version 479592 (0.00085) [2022-07-10 00:04:28,566][26022] Updated weights on worker 0-0, policy_version 479602 (0.00093) [2022-07-10 00:04:30,104][26022] Updated weights on worker 0-0, policy_version 479612 (0.00090) [2022-07-10 00:04:30,981][25689] Fps is (10 sec: 5694.1, 60 sec: 5656.3, 300 sec: 5646.1). Total num frames: 491125760. Throughput: 0: 5996.9. Samples: 491133282. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:04:30,982][25689] Avg episode reward: [(0, '-42.981')] [2022-07-10 00:04:31,967][26022] Updated weights on worker 0-0, policy_version 479622 (0.00094) [2022-07-10 00:04:33,931][26022] Updated weights on worker 0-0, policy_version 479632 (0.00051) [2022-07-10 00:04:35,496][26022] Updated weights on worker 0-0, policy_version 479642 (0.00091) [2022-07-10 00:04:35,998][25689] Fps is (10 sec: 5686.5, 60 sec: 5661.0, 300 sec: 5659.7). Total num frames: 491156480. Throughput: 0: 5137.4. Samples: 491150428. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:04:35,998][25689] Avg episode reward: [(0, '-42.646')] [2022-07-10 00:04:37,352][26022] Updated weights on worker 0-0, policy_version 479652 (0.00087) [2022-07-10 00:04:38,960][26022] Updated weights on worker 0-0, policy_version 479662 (0.00090) [2022-07-10 00:04:41,002][26022] Updated weights on worker 0-0, policy_version 479672 (0.00093) [2022-07-10 00:04:41,012][25689] Fps is (10 sec: 5819.2, 60 sec: 5668.8, 300 sec: 5656.0). Total num frames: 491184128. Throughput: 0: 6012.1. Samples: 491185148. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:04:41,012][25689] Avg episode reward: [(0, '-42.489')] [2022-07-10 00:04:42,482][26022] Updated weights on worker 0-0, policy_version 479682 (0.00084) [2022-07-10 00:04:44,683][26022] Updated weights on worker 0-0, policy_version 479692 (0.00083) [2022-07-10 00:04:46,027][25689] Fps is (10 sec: 5717.8, 60 sec: 5672.0, 300 sec: 5660.5). Total num frames: 491213824. Throughput: 0: 5991.1. Samples: 491219406. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:04:46,027][25689] Avg episode reward: [(0, '-42.726')] [2022-07-10 00:04:46,031][26022] Updated weights on worker 0-0, policy_version 479702 (0.00092) [2022-07-10 00:04:48,134][26022] Updated weights on worker 0-0, policy_version 479712 (0.00085) [2022-07-10 00:04:49,622][26022] Updated weights on worker 0-0, policy_version 479722 (0.00088) [2022-07-10 00:04:51,149][25689] Fps is (10 sec: 5555.6, 60 sec: 5666.8, 300 sec: 5655.1). Total num frames: 491240448. Throughput: 0: 5112.7. Samples: 491236394. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:04:51,150][25689] Avg episode reward: [(0, '-42.438')] [2022-07-10 00:04:51,806][26022] Updated weights on worker 0-0, policy_version 479732 (0.00081) [2022-07-10 00:04:53,455][26022] Updated weights on worker 0-0, policy_version 479742 (0.00087) [2022-07-10 00:04:55,402][26022] Updated weights on worker 0-0, policy_version 479752 (0.00729) [2022-07-10 00:04:56,162][25689] Fps is (10 sec: 5557.0, 60 sec: 5651.4, 300 sec: 5658.5). Total num frames: 491270144. Throughput: 0: 5956.1. Samples: 491270526. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:04:56,162][25689] Avg episode reward: [(0, '-42.022')] [2022-07-10 00:04:57,069][26022] Updated weights on worker 0-0, policy_version 479762 (0.00091) [2022-07-10 00:04:59,140][26022] Updated weights on worker 0-0, policy_version 479772 (0.00089) [2022-07-10 00:05:00,516][26022] Updated weights on worker 0-0, policy_version 479782 (0.00095) [2022-07-10 00:05:01,178][25689] Fps is (10 sec: 5921.9, 60 sec: 5701.0, 300 sec: 5672.0). Total num frames: 491299840. Throughput: 0: 5920.9. Samples: 491304554. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 00:05:01,179][25689] Avg episode reward: [(0, '-41.808')] [2022-07-10 00:05:03,106][26022] Updated weights on worker 0-0, policy_version 479792 (0.00082) [2022-07-10 00:05:04,538][26022] Updated weights on worker 0-0, policy_version 479802 (0.00098) [2022-07-10 00:05:06,186][25689] Fps is (10 sec: 5516.1, 60 sec: 5666.8, 300 sec: 5660.1). Total num frames: 491325440. Throughput: 0: 4964.1. Samples: 491319480. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:05:06,188][25689] Avg episode reward: [(0, '-42.011')] [2022-07-10 00:05:06,651][26022] Updated weights on worker 0-0, policy_version 479812 (0.00084) [2022-07-10 00:05:08,243][26022] Updated weights on worker 0-0, policy_version 479822 (0.00093) [2022-07-10 00:05:10,119][26022] Updated weights on worker 0-0, policy_version 479832 (0.00081) [2022-07-10 00:05:11,243][25689] Fps is (10 sec: 5392.2, 60 sec: 5650.2, 300 sec: 5659.1). Total num frames: 491354112. Throughput: 0: 5850.5. Samples: 491353954. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:05:11,244][25689] Avg episode reward: [(0, '-41.816')] [2022-07-10 00:05:11,746][26022] Updated weights on worker 0-0, policy_version 479842 (0.00084) [2022-07-10 00:05:13,662][26022] Updated weights on worker 0-0, policy_version 479852 (0.00094) [2022-07-10 00:05:15,489][26022] Updated weights on worker 0-0, policy_version 479862 (0.00091) [2022-07-10 00:05:16,275][25689] Fps is (10 sec: 5683.9, 60 sec: 5682.0, 300 sec: 5658.6). Total num frames: 491382784. Throughput: 0: 5848.3. Samples: 491388154. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:05:16,276][25689] Avg episode reward: [(0, '-41.461')] [2022-07-10 00:05:17,230][26022] Updated weights on worker 0-0, policy_version 479872 (0.00095) [2022-07-10 00:05:18,988][26022] Updated weights on worker 0-0, policy_version 479882 (0.00083) [2022-07-10 00:05:20,726][26022] Updated weights on worker 0-0, policy_version 479892 (0.00088) [2022-07-10 00:05:21,367][25689] Fps is (10 sec: 5562.9, 60 sec: 5657.2, 300 sec: 5650.1). Total num frames: 491410432. Throughput: 0: 4998.6. Samples: 491405472. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:05:21,369][25689] Avg episode reward: [(0, '-41.564')] [2022-07-10 00:05:22,510][26022] Updated weights on worker 0-0, policy_version 479902 (0.00091) [2022-07-10 00:05:24,570][26022] Updated weights on worker 0-0, policy_version 479912 (0.00085) [2022-07-10 00:05:26,082][26022] Updated weights on worker 0-0, policy_version 479922 (0.00089) [2022-07-10 00:05:26,437][25689] Fps is (10 sec: 5743.8, 60 sec: 5651.0, 300 sec: 5660.0). Total num frames: 491441152. Throughput: 0: 5952.7. Samples: 491440026. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:05:26,437][25689] Avg episode reward: [(0, '-42.135')] [2022-07-10 00:05:28,027][26022] Updated weights on worker 0-0, policy_version 479932 (0.00094) [2022-07-10 00:05:29,745][26022] Updated weights on worker 0-0, policy_version 479942 (0.00089) [2022-07-10 00:05:31,499][25689] Fps is (10 sec: 5963.0, 60 sec: 5702.2, 300 sec: 5665.9). Total num frames: 491470848. Throughput: 0: 5935.5. Samples: 491474184. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:05:31,500][25689] Avg episode reward: [(0, '-43.874')] [2022-07-10 00:05:31,505][26022] Updated weights on worker 0-0, policy_version 479952 (0.00093) [2022-07-10 00:05:33,446][26022] Updated weights on worker 0-0, policy_version 479962 (0.00094) [2022-07-10 00:05:34,991][26022] Updated weights on worker 0-0, policy_version 479972 (0.00084) [2022-07-10 00:05:36,517][25689] Fps is (10 sec: 5587.2, 60 sec: 5634.4, 300 sec: 5658.7). Total num frames: 491497472. Throughput: 0: 5094.3. Samples: 491491274. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:05:36,518][25689] Avg episode reward: [(0, '-43.716')] [2022-07-10 00:05:36,990][26022] Updated weights on worker 0-0, policy_version 479982 (0.00461) [2022-07-10 00:05:38,548][26022] Updated weights on worker 0-0, policy_version 479992 (0.00087) [2022-07-10 00:05:40,485][26022] Updated weights on worker 0-0, policy_version 480002 (0.00093) [2022-07-10 00:05:41,540][25689] Fps is (10 sec: 5710.9, 60 sec: 5684.3, 300 sec: 5665.4). Total num frames: 491528192. Throughput: 0: 5966.9. Samples: 491525842. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:05:41,541][25689] Avg episode reward: [(0, '-43.249')] [2022-07-10 00:05:42,235][26022] Updated weights on worker 0-0, policy_version 480012 (0.00091) [2022-07-10 00:05:43,286][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:05:43,294][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000480017_491537408.pth [2022-07-10 00:05:43,299][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000478027_489499648.pth [2022-07-10 00:05:44,143][26022] Updated weights on worker 0-0, policy_version 480022 (0.00617) [2022-07-10 00:05:45,775][26022] Updated weights on worker 0-0, policy_version 480032 (0.00092) [2022-07-10 00:05:46,560][25689] Fps is (10 sec: 5913.9, 60 sec: 5667.0, 300 sec: 5665.8). Total num frames: 491556864. Throughput: 0: 5975.2. Samples: 491560266. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:05:46,560][25689] Avg episode reward: [(0, '-42.874')] [2022-07-10 00:05:47,740][26022] Updated weights on worker 0-0, policy_version 480042 (0.00087) [2022-07-10 00:05:49,407][26022] Updated weights on worker 0-0, policy_version 480052 (0.00093) [2022-07-10 00:05:51,274][26022] Updated weights on worker 0-0, policy_version 480062 (0.00094) [2022-07-10 00:05:51,648][25689] Fps is (10 sec: 5774.7, 60 sec: 5721.0, 300 sec: 5668.0). Total num frames: 491586560. Throughput: 0: 5978.4. Samples: 491594642. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:05:51,648][25689] Avg episode reward: [(0, '-43.794')] [2022-07-10 00:05:53,003][26022] Updated weights on worker 0-0, policy_version 480072 (0.00093) [2022-07-10 00:05:54,797][26022] Updated weights on worker 0-0, policy_version 480082 (0.00088) [2022-07-10 00:05:56,650][26022] Updated weights on worker 0-0, policy_version 480092 (0.00084) [2022-07-10 00:05:56,655][25689] Fps is (10 sec: 5782.0, 60 sec: 5704.6, 300 sec: 5671.8). Total num frames: 491615232. Throughput: 0: 5995.8. Samples: 491612014. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:05:56,655][25689] Avg episode reward: [(0, '-43.801')] [2022-07-10 00:05:58,307][26022] Updated weights on worker 0-0, policy_version 480102 (0.00085) [2022-07-10 00:06:00,098][26022] Updated weights on worker 0-0, policy_version 480112 (0.00090) [2022-07-10 00:06:01,723][25689] Fps is (10 sec: 5488.6, 60 sec: 5649.0, 300 sec: 5663.9). Total num frames: 491641856. Throughput: 0: 5985.3. Samples: 491646640. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:06:01,723][25689] Avg episode reward: [(0, '-44.764')] [2022-07-10 00:06:02,249][26022] Updated weights on worker 0-0, policy_version 480122 (0.00084) [2022-07-10 00:06:04,050][26022] Updated weights on worker 0-0, policy_version 480132 (0.00090) [2022-07-10 00:06:05,772][26022] Updated weights on worker 0-0, policy_version 480142 (0.00734) [2022-07-10 00:06:06,746][25689] Fps is (10 sec: 5377.8, 60 sec: 5681.3, 300 sec: 5665.2). Total num frames: 491669504. Throughput: 0: 5877.6. Samples: 491678914. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:06:06,747][25689] Avg episode reward: [(0, '-44.034')] [2022-07-10 00:06:07,820][26022] Updated weights on worker 0-0, policy_version 480152 (0.00089) [2022-07-10 00:06:09,419][26022] Updated weights on worker 0-0, policy_version 480162 (0.00086) [2022-07-10 00:06:11,123][26022] Updated weights on worker 0-0, policy_version 480172 (0.01294) [2022-07-10 00:06:11,836][25689] Fps is (10 sec: 5670.2, 60 sec: 5695.2, 300 sec: 5670.4). Total num frames: 491699200. Throughput: 0: 5021.0. Samples: 491696006. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:06:11,836][25689] Avg episode reward: [(0, '-45.145')] [2022-07-10 00:06:13,243][26022] Updated weights on worker 0-0, policy_version 480182 (0.00096) [2022-07-10 00:06:14,752][26022] Updated weights on worker 0-0, policy_version 480192 (0.00091) [2022-07-10 00:06:16,785][26022] Updated weights on worker 0-0, policy_version 480202 (0.00087) [2022-07-10 00:06:16,839][25689] Fps is (10 sec: 5783.1, 60 sec: 5697.9, 300 sec: 5670.7). Total num frames: 491727872. Throughput: 0: 5850.3. Samples: 491730100. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:06:16,840][25689] Avg episode reward: [(0, '-45.595')] [2022-07-10 00:06:18,434][26022] Updated weights on worker 0-0, policy_version 480212 (0.00084) [2022-07-10 00:06:20,253][26022] Updated weights on worker 0-0, policy_version 480222 (0.01332) [2022-07-10 00:06:21,860][25689] Fps is (10 sec: 5618.4, 60 sec: 5704.6, 300 sec: 5664.1). Total num frames: 491755520. Throughput: 0: 5857.9. Samples: 491764602. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:06:21,860][25689] Avg episode reward: [(0, '-44.163')] [2022-07-10 00:06:22,139][26022] Updated weights on worker 0-0, policy_version 480232 (0.00093) [2022-07-10 00:06:23,862][26022] Updated weights on worker 0-0, policy_version 480242 (0.00093) [2022-07-10 00:06:25,661][26022] Updated weights on worker 0-0, policy_version 480252 (0.00088) [2022-07-10 00:06:26,887][25689] Fps is (10 sec: 5707.1, 60 sec: 5691.7, 300 sec: 5664.5). Total num frames: 491785216. Throughput: 0: 5116.1. Samples: 491781956. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:06:26,888][25689] Avg episode reward: [(0, '-43.300')] [2022-07-10 00:06:27,539][26022] Updated weights on worker 0-0, policy_version 480262 (0.00095) [2022-07-10 00:06:29,199][26022] Updated weights on worker 0-0, policy_version 480272 (0.00090) [2022-07-10 00:06:31,039][26022] Updated weights on worker 0-0, policy_version 480282 (0.00089) [2022-07-10 00:06:31,940][25689] Fps is (10 sec: 5689.1, 60 sec: 5658.7, 300 sec: 5663.5). Total num frames: 491812864. Throughput: 0: 5963.5. Samples: 491815894. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:06:31,940][25689] Avg episode reward: [(0, '-43.869')] [2022-07-10 00:06:32,866][26022] Updated weights on worker 0-0, policy_version 480292 (0.00090) [2022-07-10 00:06:34,830][26022] Updated weights on worker 0-0, policy_version 480302 (0.00096) [2022-07-10 00:06:36,503][26022] Updated weights on worker 0-0, policy_version 480312 (0.00094) [2022-07-10 00:06:36,942][25689] Fps is (10 sec: 5601.4, 60 sec: 5694.1, 300 sec: 5667.5). Total num frames: 491841536. Throughput: 0: 5956.0. Samples: 491849830. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:06:36,942][25689] Avg episode reward: [(0, '-43.484')] [2022-07-10 00:06:38,420][26022] Updated weights on worker 0-0, policy_version 480322 (0.00144) [2022-07-10 00:06:40,086][26022] Updated weights on worker 0-0, policy_version 480332 (0.00084) [2022-07-10 00:06:41,945][25689] Fps is (10 sec: 5629.0, 60 sec: 5645.1, 300 sec: 5657.4). Total num frames: 491869184. Throughput: 0: 5081.3. Samples: 491866660. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:06:41,946][25689] Avg episode reward: [(0, '-43.856')] [2022-07-10 00:06:42,180][26022] Updated weights on worker 0-0, policy_version 480342 (0.00086) [2022-07-10 00:06:43,632][26022] Updated weights on worker 0-0, policy_version 480352 (0.00080) [2022-07-10 00:06:45,620][26022] Updated weights on worker 0-0, policy_version 480362 (0.00096) [2022-07-10 00:06:46,956][25689] Fps is (10 sec: 5623.8, 60 sec: 5645.9, 300 sec: 5662.8). Total num frames: 491897856. Throughput: 0: 5907.6. Samples: 491900518. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:06:46,957][25689] Avg episode reward: [(0, '-44.306')] [2022-07-10 00:06:47,504][26022] Updated weights on worker 0-0, policy_version 480372 (0.00093) [2022-07-10 00:06:49,257][26022] Updated weights on worker 0-0, policy_version 480382 (0.00096) [2022-07-10 00:06:51,038][26022] Updated weights on worker 0-0, policy_version 480392 (0.00086) [2022-07-10 00:06:52,110][25689] Fps is (10 sec: 5742.2, 60 sec: 5639.8, 300 sec: 5661.3). Total num frames: 491927552. Throughput: 0: 5887.2. Samples: 491934640. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:06:52,110][25689] Avg episode reward: [(0, '-44.925')] [2022-07-10 00:06:52,946][26022] Updated weights on worker 0-0, policy_version 480402 (0.00083) [2022-07-10 00:06:54,484][26022] Updated weights on worker 0-0, policy_version 480412 (0.00091) [2022-07-10 00:06:56,467][26022] Updated weights on worker 0-0, policy_version 480422 (0.00081) [2022-07-10 00:06:57,136][25689] Fps is (10 sec: 5633.0, 60 sec: 5621.0, 300 sec: 5657.6). Total num frames: 491955200. Throughput: 0: 5055.3. Samples: 491951920. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:06:57,138][25689] Avg episode reward: [(0, '-44.956')] [2022-07-10 00:06:57,945][26022] Updated weights on worker 0-0, policy_version 480432 (0.00089) [2022-07-10 00:07:00,106][26022] Updated weights on worker 0-0, policy_version 480442 (0.00091) [2022-07-10 00:07:01,717][26022] Updated weights on worker 0-0, policy_version 480452 (0.00083) [2022-07-10 00:07:02,234][25689] Fps is (10 sec: 5663.9, 60 sec: 5669.0, 300 sec: 5667.5). Total num frames: 491984896. Throughput: 0: 5891.5. Samples: 491986194. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:07:02,239][25689] Avg episode reward: [(0, '-44.862')] [2022-07-10 00:07:04,019][26022] Updated weights on worker 0-0, policy_version 480462 (0.00084) [2022-07-10 00:07:05,616][26022] Updated weights on worker 0-0, policy_version 480472 (0.00089) [2022-07-10 00:07:07,330][25689] Fps is (10 sec: 5525.0, 60 sec: 5645.4, 300 sec: 5661.0). Total num frames: 492011520. Throughput: 0: 5773.8. Samples: 492018150. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:07:07,330][25689] Avg episode reward: [(0, '-44.764')] [2022-07-10 00:07:07,478][26022] Updated weights on worker 0-0, policy_version 480482 (0.00086) [2022-07-10 00:07:09,521][26022] Updated weights on worker 0-0, policy_version 480492 (0.00086) [2022-07-10 00:07:11,056][26022] Updated weights on worker 0-0, policy_version 480502 (0.00081) [2022-07-10 00:07:12,419][25689] Fps is (10 sec: 5429.2, 60 sec: 5628.5, 300 sec: 5656.5). Total num frames: 492040192. Throughput: 0: 4955.5. Samples: 492035284. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:07:12,420][25689] Avg episode reward: [(0, '-44.283')] [2022-07-10 00:07:13,122][26022] Updated weights on worker 0-0, policy_version 480512 (0.00076) [2022-07-10 00:07:14,616][26022] Updated weights on worker 0-0, policy_version 480522 (0.00087) [2022-07-10 00:07:16,496][26022] Updated weights on worker 0-0, policy_version 480532 (0.00093) [2022-07-10 00:07:17,423][25689] Fps is (10 sec: 5783.0, 60 sec: 5645.3, 300 sec: 5664.3). Total num frames: 492069888. Throughput: 0: 5806.4. Samples: 492069712. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:07:17,424][25689] Avg episode reward: [(0, '-43.257')] [2022-07-10 00:07:18,511][26022] Updated weights on worker 0-0, policy_version 480542 (0.00088) [2022-07-10 00:07:20,126][26022] Updated weights on worker 0-0, policy_version 480552 (0.00082) [2022-07-10 00:07:22,008][26022] Updated weights on worker 0-0, policy_version 480562 (0.00088) [2022-07-10 00:07:22,430][25689] Fps is (10 sec: 5728.4, 60 sec: 5646.7, 300 sec: 5657.7). Total num frames: 492097536. Throughput: 0: 5830.8. Samples: 492103948. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:07:22,430][25689] Avg episode reward: [(0, '-43.432')] [2022-07-10 00:07:23,844][26022] Updated weights on worker 0-0, policy_version 480572 (0.00085) [2022-07-10 00:07:25,276][26022] Updated weights on worker 0-0, policy_version 480582 (0.00087) [2022-07-10 00:07:27,327][26022] Updated weights on worker 0-0, policy_version 480592 (0.00083) [2022-07-10 00:07:27,449][25689] Fps is (10 sec: 5617.6, 60 sec: 5630.5, 300 sec: 5661.5). Total num frames: 492126208. Throughput: 0: 5134.8. Samples: 492121456. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:07:27,449][25689] Avg episode reward: [(0, '-43.621')] [2022-07-10 00:07:28,742][26022] Updated weights on worker 0-0, policy_version 480602 (0.00092) [2022-07-10 00:07:30,871][26022] Updated weights on worker 0-0, policy_version 480612 (0.00085) [2022-07-10 00:07:32,507][25689] Fps is (10 sec: 5792.2, 60 sec: 5663.8, 300 sec: 5664.4). Total num frames: 492155904. Throughput: 0: 5997.8. Samples: 492155764. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:07:32,507][25689] Avg episode reward: [(0, '-43.443')] [2022-07-10 00:07:32,611][26022] Updated weights on worker 0-0, policy_version 480622 (0.00086) [2022-07-10 00:07:34,342][26022] Updated weights on worker 0-0, policy_version 480632 (0.00086) [2022-07-10 00:07:36,253][26022] Updated weights on worker 0-0, policy_version 480642 (0.00959) [2022-07-10 00:07:37,532][25689] Fps is (10 sec: 5788.3, 60 sec: 5661.6, 300 sec: 5664.1). Total num frames: 492184576. Throughput: 0: 5985.2. Samples: 492190070. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:07:37,533][25689] Avg episode reward: [(0, '-43.716')] [2022-07-10 00:07:38,098][26022] Updated weights on worker 0-0, policy_version 480652 (0.00091) [2022-07-10 00:07:39,838][26022] Updated weights on worker 0-0, policy_version 480662 (0.00097) [2022-07-10 00:07:41,587][26022] Updated weights on worker 0-0, policy_version 480672 (0.00092) [2022-07-10 00:07:42,538][25689] Fps is (10 sec: 5716.4, 60 sec: 5678.3, 300 sec: 5665.1). Total num frames: 492213248. Throughput: 0: 5131.3. Samples: 492207130. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:07:42,539][25689] Avg episode reward: [(0, '-44.598')] [2022-07-10 00:07:43,304][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:07:43,321][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000480681_492217344.pth [2022-07-10 00:07:43,321][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000478689_490177536.pth [2022-07-10 00:07:43,533][26022] Updated weights on worker 0-0, policy_version 480682 (0.00088) [2022-07-10 00:07:45,094][26022] Updated weights on worker 0-0, policy_version 480692 (0.00092) [2022-07-10 00:07:47,208][26022] Updated weights on worker 0-0, policy_version 480702 (0.00085) [2022-07-10 00:07:47,571][25689] Fps is (10 sec: 5508.4, 60 sec: 5642.5, 300 sec: 5663.2). Total num frames: 492239872. Throughput: 0: 5944.7. Samples: 492241076. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:07:47,571][25689] Avg episode reward: [(0, '-44.878')] [2022-07-10 00:07:48,695][26022] Updated weights on worker 0-0, policy_version 480712 (0.00088) [2022-07-10 00:07:50,657][26022] Updated weights on worker 0-0, policy_version 480722 (0.00053) [2022-07-10 00:07:52,428][26022] Updated weights on worker 0-0, policy_version 480732 (0.00088) [2022-07-10 00:07:52,611][25689] Fps is (10 sec: 5591.5, 60 sec: 5653.1, 300 sec: 5656.6). Total num frames: 492269568. Throughput: 0: 5942.8. Samples: 492275238. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 00:07:52,611][25689] Avg episode reward: [(0, '-44.858')] [2022-07-10 00:07:54,197][26022] Updated weights on worker 0-0, policy_version 480742 (0.00083) [2022-07-10 00:07:56,251][26022] Updated weights on worker 0-0, policy_version 480752 (0.00097) [2022-07-10 00:07:57,615][25689] Fps is (10 sec: 5913.3, 60 sec: 5689.0, 300 sec: 5667.1). Total num frames: 492299264. Throughput: 0: 5082.6. Samples: 492292140. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:07:57,616][25689] Avg episode reward: [(0, '-44.921')] [2022-07-10 00:07:57,843][26022] Updated weights on worker 0-0, policy_version 480762 (0.00088) [2022-07-10 00:07:59,894][26022] Updated weights on worker 0-0, policy_version 480772 (0.00085) [2022-07-10 00:08:01,718][26022] Updated weights on worker 0-0, policy_version 480782 (0.00085) [2022-07-10 00:08:02,617][25689] Fps is (10 sec: 5423.6, 60 sec: 5613.2, 300 sec: 5657.1). Total num frames: 492323840. Throughput: 0: 5916.1. Samples: 492325920. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:08:02,619][25689] Avg episode reward: [(0, '-44.527')] [2022-07-10 00:08:03,600][26022] Updated weights on worker 0-0, policy_version 480792 (0.00087) [2022-07-10 00:08:05,630][26022] Updated weights on worker 0-0, policy_version 480802 (0.00090) [2022-07-10 00:08:07,233][26022] Updated weights on worker 0-0, policy_version 480812 (0.00088) [2022-07-10 00:08:07,631][25689] Fps is (10 sec: 5316.3, 60 sec: 5654.9, 300 sec: 5658.2). Total num frames: 492352512. Throughput: 0: 5833.5. Samples: 492358096. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:08:07,632][25689] Avg episode reward: [(0, '-44.714')] [2022-07-10 00:08:09,263][26022] Updated weights on worker 0-0, policy_version 480822 (0.00090) [2022-07-10 00:08:11,100][26022] Updated weights on worker 0-0, policy_version 480832 (0.00095) [2022-07-10 00:08:12,690][25689] Fps is (10 sec: 5692.8, 60 sec: 5657.6, 300 sec: 5664.1). Total num frames: 492381184. Throughput: 0: 4977.7. Samples: 492375188. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:08:12,692][25689] Avg episode reward: [(0, '-45.195')] [2022-07-10 00:08:12,840][26022] Updated weights on worker 0-0, policy_version 480842 (0.00089) [2022-07-10 00:08:14,486][26022] Updated weights on worker 0-0, policy_version 480852 (0.00084) [2022-07-10 00:08:16,257][26022] Updated weights on worker 0-0, policy_version 480862 (0.00082) [2022-07-10 00:08:17,723][25689] Fps is (10 sec: 5783.2, 60 sec: 5654.9, 300 sec: 5660.2). Total num frames: 492410880. Throughput: 0: 5841.9. Samples: 492409614. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:08:17,724][25689] Avg episode reward: [(0, '-45.461')] [2022-07-10 00:08:18,010][26022] Updated weights on worker 0-0, policy_version 480872 (0.00058) [2022-07-10 00:08:19,919][26022] Updated weights on worker 0-0, policy_version 480882 (0.00089) [2022-07-10 00:08:21,656][26022] Updated weights on worker 0-0, policy_version 480892 (0.00087) [2022-07-10 00:08:22,736][25689] Fps is (10 sec: 5708.6, 60 sec: 5654.4, 300 sec: 5658.4). Total num frames: 492438528. Throughput: 0: 5861.2. Samples: 492443836. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:08:22,736][25689] Avg episode reward: [(0, '-45.848')] [2022-07-10 00:08:23,496][26022] Updated weights on worker 0-0, policy_version 480902 (0.00078) [2022-07-10 00:08:25,214][26022] Updated weights on worker 0-0, policy_version 480912 (0.00085) [2022-07-10 00:08:27,113][26022] Updated weights on worker 0-0, policy_version 480922 (0.00088) [2022-07-10 00:08:27,761][25689] Fps is (10 sec: 5610.7, 60 sec: 5653.7, 300 sec: 5663.5). Total num frames: 492467200. Throughput: 0: 5116.8. Samples: 492461102. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:08:27,762][25689] Avg episode reward: [(0, '-45.163')] [2022-07-10 00:08:28,790][26022] Updated weights on worker 0-0, policy_version 480932 (0.00097) [2022-07-10 00:08:30,734][26022] Updated weights on worker 0-0, policy_version 480942 (0.01177) [2022-07-10 00:08:32,532][26022] Updated weights on worker 0-0, policy_version 480952 (0.00079) [2022-07-10 00:08:32,869][25689] Fps is (10 sec: 5760.0, 60 sec: 5649.1, 300 sec: 5659.3). Total num frames: 492496896. Throughput: 0: 5940.1. Samples: 492495050. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:08:32,869][25689] Avg episode reward: [(0, '-44.549')] [2022-07-10 00:08:34,476][26022] Updated weights on worker 0-0, policy_version 480962 (0.00083) [2022-07-10 00:08:36,003][26022] Updated weights on worker 0-0, policy_version 480972 (0.00087) [2022-07-10 00:08:37,896][25689] Fps is (10 sec: 5658.1, 60 sec: 5632.0, 300 sec: 5660.6). Total num frames: 492524544. Throughput: 0: 5923.0. Samples: 492529098. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:08:37,897][25689] Avg episode reward: [(0, '-44.240')] [2022-07-10 00:08:38,139][26022] Updated weights on worker 0-0, policy_version 480982 (0.00087) [2022-07-10 00:08:39,508][26022] Updated weights on worker 0-0, policy_version 480992 (0.00087) [2022-07-10 00:08:41,534][26022] Updated weights on worker 0-0, policy_version 481002 (0.00086) [2022-07-10 00:08:42,913][25689] Fps is (10 sec: 5607.3, 60 sec: 5631.0, 300 sec: 5657.8). Total num frames: 492553216. Throughput: 0: 5077.0. Samples: 492546276. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:08:42,913][25689] Avg episode reward: [(0, '-43.927')] [2022-07-10 00:08:43,467][26022] Updated weights on worker 0-0, policy_version 481012 (0.00084) [2022-07-10 00:08:44,950][26022] Updated weights on worker 0-0, policy_version 481022 (0.00081) [2022-07-10 00:08:46,993][26022] Updated weights on worker 0-0, policy_version 481032 (0.00091) [2022-07-10 00:08:47,938][25689] Fps is (10 sec: 5710.5, 60 sec: 5665.6, 300 sec: 5665.5). Total num frames: 492581888. Throughput: 0: 5918.9. Samples: 492580526. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:08:47,940][25689] Avg episode reward: [(0, '-44.476')] [2022-07-10 00:08:48,637][26022] Updated weights on worker 0-0, policy_version 481042 (0.00100) [2022-07-10 00:08:50,584][26022] Updated weights on worker 0-0, policy_version 481052 (0.00089) [2022-07-10 00:08:52,496][26022] Updated weights on worker 0-0, policy_version 481062 (0.00083) [2022-07-10 00:08:52,979][25689] Fps is (10 sec: 5595.1, 60 sec: 5631.6, 300 sec: 5654.9). Total num frames: 492609536. Throughput: 0: 5929.3. Samples: 492614288. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:08:52,979][25689] Avg episode reward: [(0, '-45.013')] [2022-07-10 00:08:54,162][26022] Updated weights on worker 0-0, policy_version 481072 (0.00535) [2022-07-10 00:08:55,935][26022] Updated weights on worker 0-0, policy_version 481082 (0.00097) [2022-07-10 00:08:57,955][26022] Updated weights on worker 0-0, policy_version 481092 (0.00088) [2022-07-10 00:08:58,059][25689] Fps is (10 sec: 5564.6, 60 sec: 5607.5, 300 sec: 5660.3). Total num frames: 492638208. Throughput: 0: 5919.7. Samples: 492648456. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:08:58,061][25689] Avg episode reward: [(0, '-44.884')] [2022-07-10 00:08:59,347][26022] Updated weights on worker 0-0, policy_version 481102 (0.00079) [2022-07-10 00:09:01,490][26022] Updated weights on worker 0-0, policy_version 481112 (0.00104) [2022-07-10 00:09:03,072][25689] Fps is (10 sec: 5681.4, 60 sec: 5674.3, 300 sec: 5663.6). Total num frames: 492666880. Throughput: 0: 5911.0. Samples: 492665438. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:09:03,074][25689] Avg episode reward: [(0, '-44.995')] [2022-07-10 00:09:03,340][26022] Updated weights on worker 0-0, policy_version 481122 (0.00096) [2022-07-10 00:09:05,494][26022] Updated weights on worker 0-0, policy_version 481132 (0.00092) [2022-07-10 00:09:07,215][26022] Updated weights on worker 0-0, policy_version 481142 (0.00086) [2022-07-10 00:09:08,087][25689] Fps is (10 sec: 5514.1, 60 sec: 5640.3, 300 sec: 5654.2). Total num frames: 492693504. Throughput: 0: 5798.8. Samples: 492697366. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:09:08,088][25689] Avg episode reward: [(0, '-44.134')] [2022-07-10 00:09:08,975][26022] Updated weights on worker 0-0, policy_version 481152 (0.00088) [2022-07-10 00:09:10,939][26022] Updated weights on worker 0-0, policy_version 481162 (0.00086) [2022-07-10 00:09:12,404][26022] Updated weights on worker 0-0, policy_version 481172 (0.00092) [2022-07-10 00:09:13,171][25689] Fps is (10 sec: 5678.4, 60 sec: 5671.9, 300 sec: 5666.5). Total num frames: 492724224. Throughput: 0: 5817.6. Samples: 492731756. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:09:13,174][25689] Avg episode reward: [(0, '-45.498')] [2022-07-10 00:09:14,415][26022] Updated weights on worker 0-0, policy_version 481182 (0.00104) [2022-07-10 00:09:16,063][26022] Updated weights on worker 0-0, policy_version 481192 (0.00085) [2022-07-10 00:09:17,982][26022] Updated weights on worker 0-0, policy_version 481202 (0.00088) [2022-07-10 00:09:18,218][25689] Fps is (10 sec: 5660.3, 60 sec: 5619.8, 300 sec: 5658.9). Total num frames: 492750848. Throughput: 0: 4975.6. Samples: 492748760. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:09:18,220][25689] Avg episode reward: [(0, '-44.936')] [2022-07-10 00:09:19,752][26022] Updated weights on worker 0-0, policy_version 481212 (0.00082) [2022-07-10 00:09:21,683][26022] Updated weights on worker 0-0, policy_version 481222 (0.00087) [2022-07-10 00:09:23,234][25689] Fps is (10 sec: 5596.6, 60 sec: 5653.3, 300 sec: 5655.2). Total num frames: 492780544. Throughput: 0: 5816.3. Samples: 492782706. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:09:23,235][25689] Avg episode reward: [(0, '-44.291')] [2022-07-10 00:09:23,284][26022] Updated weights on worker 0-0, policy_version 481232 (0.00094) [2022-07-10 00:09:25,336][26022] Updated weights on worker 0-0, policy_version 481242 (0.00089) [2022-07-10 00:09:26,914][26022] Updated weights on worker 0-0, policy_version 481252 (0.00085) [2022-07-10 00:09:28,253][25689] Fps is (10 sec: 5714.7, 60 sec: 5637.0, 300 sec: 5659.6). Total num frames: 492808192. Throughput: 0: 5933.0. Samples: 492817008. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:09:28,253][25689] Avg episode reward: [(0, '-45.382')] [2022-07-10 00:09:28,908][26022] Updated weights on worker 0-0, policy_version 481262 (0.00084) [2022-07-10 00:09:30,620][26022] Updated weights on worker 0-0, policy_version 481272 (0.00091) [2022-07-10 00:09:32,170][26022] Updated weights on worker 0-0, policy_version 481282 (0.00089) [2022-07-10 00:09:33,328][25689] Fps is (10 sec: 5579.6, 60 sec: 5623.1, 300 sec: 5651.6). Total num frames: 492836864. Throughput: 0: 5078.3. Samples: 492834120. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:09:33,329][25689] Avg episode reward: [(0, '-45.734')] [2022-07-10 00:09:34,123][26022] Updated weights on worker 0-0, policy_version 481292 (0.00092) [2022-07-10 00:09:35,917][26022] Updated weights on worker 0-0, policy_version 481302 (0.00085) [2022-07-10 00:09:37,713][26022] Updated weights on worker 0-0, policy_version 481312 (0.00087) [2022-07-10 00:09:38,405][25689] Fps is (10 sec: 5850.2, 60 sec: 5669.3, 300 sec: 5660.7). Total num frames: 492867584. Throughput: 0: 5951.1. Samples: 492868892. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:09:38,405][25689] Avg episode reward: [(0, '-44.840')] [2022-07-10 00:09:39,480][26022] Updated weights on worker 0-0, policy_version 481322 (0.00080) [2022-07-10 00:09:41,118][26022] Updated weights on worker 0-0, policy_version 481332 (0.00090) [2022-07-10 00:09:42,954][26022] Updated weights on worker 0-0, policy_version 481342 (0.00091) [2022-07-10 00:09:43,466][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:09:43,476][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000481344_492896256.pth [2022-07-10 00:09:43,477][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000479353_490857472.pth [2022-07-10 00:09:43,478][25689] Fps is (10 sec: 5851.7, 60 sec: 5664.0, 300 sec: 5656.2). Total num frames: 492896256. Throughput: 0: 5973.1. Samples: 492903622. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:09:43,478][25689] Avg episode reward: [(0, '-44.450')] [2022-07-10 00:09:44,847][26022] Updated weights on worker 0-0, policy_version 481352 (0.00088) [2022-07-10 00:09:46,546][26022] Updated weights on worker 0-0, policy_version 481362 (0.00082) [2022-07-10 00:09:48,446][26022] Updated weights on worker 0-0, policy_version 481372 (0.00094) [2022-07-10 00:09:48,544][25689] Fps is (10 sec: 5656.0, 60 sec: 5660.2, 300 sec: 5664.1). Total num frames: 492924928. Throughput: 0: 5117.8. Samples: 492920852. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:09:48,544][25689] Avg episode reward: [(0, '-44.442')] [2022-07-10 00:09:50,059][26022] Updated weights on worker 0-0, policy_version 481382 (0.00091) [2022-07-10 00:09:52,016][26022] Updated weights on worker 0-0, policy_version 481392 (0.00087) [2022-07-10 00:09:53,658][25689] Fps is (10 sec: 5733.6, 60 sec: 5687.1, 300 sec: 5662.2). Total num frames: 492954624. Throughput: 0: 5942.7. Samples: 492954932. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:09:53,658][25689] Avg episode reward: [(0, '-43.768')] [2022-07-10 00:09:53,952][26022] Updated weights on worker 0-0, policy_version 481402 (0.00094) [2022-07-10 00:09:55,551][26022] Updated weights on worker 0-0, policy_version 481412 (0.00092) [2022-07-10 00:09:57,430][26022] Updated weights on worker 0-0, policy_version 481422 (0.00099) [2022-07-10 00:09:58,703][25689] Fps is (10 sec: 5745.4, 60 sec: 5690.4, 300 sec: 5658.2). Total num frames: 492983296. Throughput: 0: 5932.6. Samples: 492989310. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:09:58,704][25689] Avg episode reward: [(0, '-42.897')] [2022-07-10 00:09:59,154][26022] Updated weights on worker 0-0, policy_version 481432 (0.00087) [2022-07-10 00:10:01,177][26022] Updated weights on worker 0-0, policy_version 481442 (0.00085) [2022-07-10 00:10:03,147][26022] Updated weights on worker 0-0, policy_version 481452 (0.00086) [2022-07-10 00:10:03,719][25689] Fps is (10 sec: 5496.0, 60 sec: 5656.4, 300 sec: 5661.5). Total num frames: 493009920. Throughput: 0: 5067.9. Samples: 493006202. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:10:03,719][25689] Avg episode reward: [(0, '-42.541')] [2022-07-10 00:10:04,959][26022] Updated weights on worker 0-0, policy_version 481462 (0.00060) [2022-07-10 00:10:06,854][26022] Updated weights on worker 0-0, policy_version 481472 (0.00091) [2022-07-10 00:10:08,561][26022] Updated weights on worker 0-0, policy_version 481482 (0.00089) [2022-07-10 00:10:08,750][25689] Fps is (10 sec: 5401.7, 60 sec: 5671.8, 300 sec: 5658.5). Total num frames: 493037568. Throughput: 0: 5809.4. Samples: 493038238. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:10:08,751][25689] Avg episode reward: [(0, '-41.928')] [2022-07-10 00:10:10,398][26022] Updated weights on worker 0-0, policy_version 481492 (0.00088) [2022-07-10 00:10:12,275][26022] Updated weights on worker 0-0, policy_version 481502 (0.00088) [2022-07-10 00:10:13,823][25689] Fps is (10 sec: 5675.6, 60 sec: 5655.9, 300 sec: 5661.2). Total num frames: 493067264. Throughput: 0: 5837.3. Samples: 493072640. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:10:13,823][25689] Avg episode reward: [(0, '-41.948')] [2022-07-10 00:10:13,876][26022] Updated weights on worker 0-0, policy_version 481512 (0.00092) [2022-07-10 00:10:15,860][26022] Updated weights on worker 0-0, policy_version 481522 (0.00087) [2022-07-10 00:10:17,507][26022] Updated weights on worker 0-0, policy_version 481532 (0.00086) [2022-07-10 00:10:18,855][25689] Fps is (10 sec: 5775.9, 60 sec: 5691.0, 300 sec: 5665.8). Total num frames: 493095936. Throughput: 0: 4992.6. Samples: 493089924. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:10:18,856][25689] Avg episode reward: [(0, '-42.239')] [2022-07-10 00:10:19,402][26022] Updated weights on worker 0-0, policy_version 481542 (0.00089) [2022-07-10 00:10:21,283][26022] Updated weights on worker 0-0, policy_version 481552 (0.00088) [2022-07-10 00:10:22,755][26022] Updated weights on worker 0-0, policy_version 481562 (0.00106) [2022-07-10 00:10:23,906][25689] Fps is (10 sec: 5687.1, 60 sec: 5670.9, 300 sec: 5659.3). Total num frames: 493124608. Throughput: 0: 5830.4. Samples: 493123900. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:10:23,906][25689] Avg episode reward: [(0, '-42.951')] [2022-07-10 00:10:24,991][26022] Updated weights on worker 0-0, policy_version 481572 (0.00093) [2022-07-10 00:10:26,586][26022] Updated weights on worker 0-0, policy_version 481582 (0.00057) [2022-07-10 00:10:28,401][26022] Updated weights on worker 0-0, policy_version 481592 (0.00092) [2022-07-10 00:10:28,921][25689] Fps is (10 sec: 5798.6, 60 sec: 5705.0, 300 sec: 5660.2). Total num frames: 493154304. Throughput: 0: 5959.0. Samples: 493158440. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:10:28,922][25689] Avg episode reward: [(0, '-43.256')] [2022-07-10 00:10:30,064][26022] Updated weights on worker 0-0, policy_version 481602 (0.00084) [2022-07-10 00:10:31,933][26022] Updated weights on worker 0-0, policy_version 481612 (0.00086) [2022-07-10 00:10:33,524][26022] Updated weights on worker 0-0, policy_version 481622 (0.00080) [2022-07-10 00:10:33,981][25689] Fps is (10 sec: 5691.3, 60 sec: 5689.5, 300 sec: 5662.8). Total num frames: 493181952. Throughput: 0: 5101.9. Samples: 493175488. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:10:33,982][25689] Avg episode reward: [(0, '-43.608')] [2022-07-10 00:10:35,519][26022] Updated weights on worker 0-0, policy_version 481632 (0.00082) [2022-07-10 00:10:37,301][26022] Updated weights on worker 0-0, policy_version 481642 (0.00093) [2022-07-10 00:10:39,018][25689] Fps is (10 sec: 5679.2, 60 sec: 5676.4, 300 sec: 5659.1). Total num frames: 493211648. Throughput: 0: 5947.1. Samples: 493209836. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 00:10:39,023][25689] Avg episode reward: [(0, '-45.028')] [2022-07-10 00:10:39,030][26022] Updated weights on worker 0-0, policy_version 481652 (0.00115) [2022-07-10 00:10:41,106][26022] Updated weights on worker 0-0, policy_version 481662 (0.00088) [2022-07-10 00:10:42,372][26022] Updated weights on worker 0-0, policy_version 481672 (0.00090) [2022-07-10 00:10:44,054][25689] Fps is (10 sec: 5693.1, 60 sec: 5662.9, 300 sec: 5655.3). Total num frames: 493239296. Throughput: 0: 5968.7. Samples: 493244158. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:10:44,055][25689] Avg episode reward: [(0, '-45.224')] [2022-07-10 00:10:44,693][26022] Updated weights on worker 0-0, policy_version 481682 (0.00083) [2022-07-10 00:10:46,142][26022] Updated weights on worker 0-0, policy_version 481692 (0.00088) [2022-07-10 00:10:48,040][26022] Updated weights on worker 0-0, policy_version 481702 (0.00090) [2022-07-10 00:10:49,111][25689] Fps is (10 sec: 5478.8, 60 sec: 5646.8, 300 sec: 5649.0). Total num frames: 493266944. Throughput: 0: 5086.2. Samples: 493261130. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:10:49,112][25689] Avg episode reward: [(0, '-45.830')] [2022-07-10 00:10:49,951][26022] Updated weights on worker 0-0, policy_version 481712 (0.00094) [2022-07-10 00:10:51,450][26022] Updated weights on worker 0-0, policy_version 481722 (0.00096) [2022-07-10 00:10:53,568][26022] Updated weights on worker 0-0, policy_version 481732 (0.00088) [2022-07-10 00:10:54,181][25689] Fps is (10 sec: 5662.2, 60 sec: 5650.9, 300 sec: 5651.3). Total num frames: 493296640. Throughput: 0: 5919.7. Samples: 493295066. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:10:54,191][25689] Avg episode reward: [(0, '-45.224')] [2022-07-10 00:10:55,063][26022] Updated weights on worker 0-0, policy_version 481742 (0.00088) [2022-07-10 00:10:57,084][26022] Updated weights on worker 0-0, policy_version 481752 (0.00087) [2022-07-10 00:10:58,922][26022] Updated weights on worker 0-0, policy_version 481762 (0.00365) [2022-07-10 00:10:59,196][25689] Fps is (10 sec: 5686.0, 60 sec: 5636.8, 300 sec: 5655.7). Total num frames: 493324288. Throughput: 0: 5915.0. Samples: 493329188. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:10:59,197][25689] Avg episode reward: [(0, '-44.928')] [2022-07-10 00:11:00,819][26022] Updated weights on worker 0-0, policy_version 481772 (0.00089) [2022-07-10 00:11:02,945][26022] Updated weights on worker 0-0, policy_version 481782 (0.00085) [2022-07-10 00:11:04,215][25689] Fps is (10 sec: 5511.0, 60 sec: 5653.5, 300 sec: 5655.8). Total num frames: 493351936. Throughput: 0: 5823.2. Samples: 493361560. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:11:04,216][25689] Avg episode reward: [(0, '-45.859')] [2022-07-10 00:11:04,688][26022] Updated weights on worker 0-0, policy_version 481792 (0.00093) [2022-07-10 00:11:06,515][26022] Updated weights on worker 0-0, policy_version 481802 (0.00088) [2022-07-10 00:11:08,244][26022] Updated weights on worker 0-0, policy_version 481812 (0.00094) [2022-07-10 00:11:09,261][25689] Fps is (10 sec: 5697.5, 60 sec: 5685.9, 300 sec: 5656.7). Total num frames: 493381632. Throughput: 0: 5829.6. Samples: 493378596. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:11:09,262][25689] Avg episode reward: [(0, '-45.332')] [2022-07-10 00:11:10,390][26022] Updated weights on worker 0-0, policy_version 481822 (0.00090) [2022-07-10 00:11:11,749][26022] Updated weights on worker 0-0, policy_version 481832 (0.00080) [2022-07-10 00:11:13,790][26022] Updated weights on worker 0-0, policy_version 481842 (0.00083) [2022-07-10 00:11:14,307][25689] Fps is (10 sec: 5682.2, 60 sec: 5654.6, 300 sec: 5652.4). Total num frames: 493409280. Throughput: 0: 5849.3. Samples: 493412786. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:11:14,308][25689] Avg episode reward: [(0, '-45.898')] [2022-07-10 00:11:15,469][26022] Updated weights on worker 0-0, policy_version 481852 (0.00086) [2022-07-10 00:11:17,331][26022] Updated weights on worker 0-0, policy_version 481862 (0.00094) [2022-07-10 00:11:19,178][26022] Updated weights on worker 0-0, policy_version 481872 (0.00088) [2022-07-10 00:11:19,329][25689] Fps is (10 sec: 5696.0, 60 sec: 5672.6, 300 sec: 5659.3). Total num frames: 493438976. Throughput: 0: 5858.1. Samples: 493447124. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:11:19,329][25689] Avg episode reward: [(0, '-45.910')] [2022-07-10 00:11:20,748][26022] Updated weights on worker 0-0, policy_version 481882 (0.00086) [2022-07-10 00:11:22,599][26022] Updated weights on worker 0-0, policy_version 481892 (0.00088) [2022-07-10 00:11:24,364][25689] Fps is (10 sec: 5702.0, 60 sec: 5657.1, 300 sec: 5652.2). Total num frames: 493466624. Throughput: 0: 5099.1. Samples: 493464300. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:11:24,364][25689] Avg episode reward: [(0, '-46.470')] [2022-07-10 00:11:24,636][26022] Updated weights on worker 0-0, policy_version 481902 (0.00084) [2022-07-10 00:11:26,207][26022] Updated weights on worker 0-0, policy_version 481912 (0.00082) [2022-07-10 00:11:28,129][26022] Updated weights on worker 0-0, policy_version 481922 (0.00100) [2022-07-10 00:11:29,372][25689] Fps is (10 sec: 5607.7, 60 sec: 5640.8, 300 sec: 5656.5). Total num frames: 493495296. Throughput: 0: 5966.0. Samples: 493498576. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:11:29,373][25689] Avg episode reward: [(0, '-46.654')] [2022-07-10 00:11:29,746][26022] Updated weights on worker 0-0, policy_version 481932 (0.00092) [2022-07-10 00:11:31,712][26022] Updated weights on worker 0-0, policy_version 481942 (0.00090) [2022-07-10 00:11:33,531][26022] Updated weights on worker 0-0, policy_version 481952 (0.00083) [2022-07-10 00:11:34,451][25689] Fps is (10 sec: 5684.7, 60 sec: 5656.0, 300 sec: 5655.1). Total num frames: 493523968. Throughput: 0: 5954.3. Samples: 493532730. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:11:34,452][25689] Avg episode reward: [(0, '-46.143')] [2022-07-10 00:11:35,056][26022] Updated weights on worker 0-0, policy_version 481962 (0.00094) [2022-07-10 00:11:37,146][26022] Updated weights on worker 0-0, policy_version 481972 (0.00085) [2022-07-10 00:11:38,815][26022] Updated weights on worker 0-0, policy_version 481982 (0.00090) [2022-07-10 00:11:39,468][25689] Fps is (10 sec: 5680.0, 60 sec: 5641.0, 300 sec: 5658.2). Total num frames: 493552640. Throughput: 0: 5109.3. Samples: 493550022. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:11:39,468][25689] Avg episode reward: [(0, '-46.526')] [2022-07-10 00:11:40,506][26022] Updated weights on worker 0-0, policy_version 481992 (0.00080) [2022-07-10 00:11:42,499][26022] Updated weights on worker 0-0, policy_version 482002 (0.00087) [2022-07-10 00:11:43,526][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:11:43,540][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000482008_493576192.pth [2022-07-10 00:11:43,540][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000480017_491537408.pth [2022-07-10 00:11:44,175][26022] Updated weights on worker 0-0, policy_version 482012 (0.00081) [2022-07-10 00:11:44,511][25689] Fps is (10 sec: 5700.5, 60 sec: 5657.2, 300 sec: 5657.6). Total num frames: 493581312. Throughput: 0: 5947.8. Samples: 493584130. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:11:44,511][25689] Avg episode reward: [(0, '-45.677')] [2022-07-10 00:11:46,213][26022] Updated weights on worker 0-0, policy_version 482022 (0.00088) [2022-07-10 00:11:47,736][26022] Updated weights on worker 0-0, policy_version 482032 (0.00079) [2022-07-10 00:11:49,535][25689] Fps is (10 sec: 5696.2, 60 sec: 5677.2, 300 sec: 5656.7). Total num frames: 493609984. Throughput: 0: 5946.5. Samples: 493618476. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:11:49,535][25689] Avg episode reward: [(0, '-45.097')] [2022-07-10 00:11:49,653][26022] Updated weights on worker 0-0, policy_version 482042 (0.00083) [2022-07-10 00:11:51,288][26022] Updated weights on worker 0-0, policy_version 482052 (0.00105) [2022-07-10 00:11:53,155][26022] Updated weights on worker 0-0, policy_version 482062 (0.00083) [2022-07-10 00:11:54,575][25689] Fps is (10 sec: 5595.9, 60 sec: 5646.1, 300 sec: 5656.4). Total num frames: 493637632. Throughput: 0: 5103.0. Samples: 493635422. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:11:54,576][25689] Avg episode reward: [(0, '-43.968')] [2022-07-10 00:11:55,240][26022] Updated weights on worker 0-0, policy_version 482072 (0.00096) [2022-07-10 00:11:56,869][26022] Updated weights on worker 0-0, policy_version 482082 (0.00093) [2022-07-10 00:11:58,628][26022] Updated weights on worker 0-0, policy_version 482092 (0.00090) [2022-07-10 00:11:59,596][25689] Fps is (10 sec: 5699.3, 60 sec: 5679.5, 300 sec: 5657.9). Total num frames: 493667328. Throughput: 0: 5931.2. Samples: 493669410. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:11:59,597][25689] Avg episode reward: [(0, '-42.974')] [2022-07-10 00:12:00,404][26022] Updated weights on worker 0-0, policy_version 482102 (0.00088) [2022-07-10 00:12:02,778][26022] Updated weights on worker 0-0, policy_version 482112 (0.00091) [2022-07-10 00:12:04,611][25689] Fps is (10 sec: 5408.0, 60 sec: 5629.0, 300 sec: 5652.5). Total num frames: 493691904. Throughput: 0: 5820.5. Samples: 493701122. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:12:04,611][25689] Avg episode reward: [(0, '-43.211')] [2022-07-10 00:12:04,657][26022] Updated weights on worker 0-0, policy_version 482122 (0.00094) [2022-07-10 00:12:06,355][26022] Updated weights on worker 0-0, policy_version 482132 (0.00094) [2022-07-10 00:12:08,083][26022] Updated weights on worker 0-0, policy_version 482142 (0.00090) [2022-07-10 00:12:09,628][25689] Fps is (10 sec: 5410.4, 60 sec: 5631.7, 300 sec: 5657.3). Total num frames: 493721600. Throughput: 0: 4953.1. Samples: 493717996. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:12:09,629][25689] Avg episode reward: [(0, '-43.596')] [2022-07-10 00:12:09,882][26022] Updated weights on worker 0-0, policy_version 482152 (0.00064) [2022-07-10 00:12:11,748][26022] Updated weights on worker 0-0, policy_version 482162 (0.00085) [2022-07-10 00:12:13,658][26022] Updated weights on worker 0-0, policy_version 482172 (0.00089) [2022-07-10 00:12:14,680][25689] Fps is (10 sec: 5796.6, 60 sec: 5648.1, 300 sec: 5653.0). Total num frames: 493750272. Throughput: 0: 5783.5. Samples: 493751698. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:12:14,681][25689] Avg episode reward: [(0, '-43.736')] [2022-07-10 00:12:15,495][26022] Updated weights on worker 0-0, policy_version 482182 (0.00086) [2022-07-10 00:12:17,135][26022] Updated weights on worker 0-0, policy_version 482192 (0.00094) [2022-07-10 00:12:19,291][26022] Updated weights on worker 0-0, policy_version 482202 (0.00090) [2022-07-10 00:12:19,691][25689] Fps is (10 sec: 5596.7, 60 sec: 5615.2, 300 sec: 5652.9). Total num frames: 493777920. Throughput: 0: 5800.9. Samples: 493785974. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:12:19,691][25689] Avg episode reward: [(0, '-43.915')] [2022-07-10 00:12:20,640][26022] Updated weights on worker 0-0, policy_version 482212 (0.00087) [2022-07-10 00:12:22,626][26022] Updated weights on worker 0-0, policy_version 482222 (0.00085) [2022-07-10 00:12:24,332][26022] Updated weights on worker 0-0, policy_version 482232 (0.00088) [2022-07-10 00:12:24,697][25689] Fps is (10 sec: 5622.2, 60 sec: 5634.8, 300 sec: 5653.1). Total num frames: 493806592. Throughput: 0: 5083.6. Samples: 493803234. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:12:24,698][25689] Avg episode reward: [(0, '-44.304')] [2022-07-10 00:12:25,943][26022] Updated weights on worker 0-0, policy_version 482242 (0.00089) [2022-07-10 00:12:27,978][26022] Updated weights on worker 0-0, policy_version 482252 (0.00093) [2022-07-10 00:12:29,667][26022] Updated weights on worker 0-0, policy_version 482262 (0.00081) [2022-07-10 00:12:29,750][25689] Fps is (10 sec: 5802.5, 60 sec: 5647.6, 300 sec: 5653.2). Total num frames: 493836288. Throughput: 0: 5956.0. Samples: 493837844. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:12:29,750][25689] Avg episode reward: [(0, '-44.501')] [2022-07-10 00:12:31,428][26022] Updated weights on worker 0-0, policy_version 482272 (0.00090) [2022-07-10 00:12:33,518][26022] Updated weights on worker 0-0, policy_version 482282 (0.00096) [2022-07-10 00:12:34,853][25689] Fps is (10 sec: 5747.2, 60 sec: 5645.4, 300 sec: 5651.8). Total num frames: 493864960. Throughput: 0: 5967.9. Samples: 493872090. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:12:34,854][25689] Avg episode reward: [(0, '-43.519')] [2022-07-10 00:12:34,948][26022] Updated weights on worker 0-0, policy_version 482292 (0.00092) [2022-07-10 00:12:37,023][26022] Updated weights on worker 0-0, policy_version 482302 (0.00098) [2022-07-10 00:12:38,735][26022] Updated weights on worker 0-0, policy_version 482312 (0.00081) [2022-07-10 00:12:39,856][25689] Fps is (10 sec: 5674.3, 60 sec: 5646.7, 300 sec: 5651.8). Total num frames: 493893632. Throughput: 0: 5129.1. Samples: 493889402. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:12:39,856][25689] Avg episode reward: [(0, '-43.494')] [2022-07-10 00:12:40,303][26022] Updated weights on worker 0-0, policy_version 482322 (0.00088) [2022-07-10 00:12:42,371][26022] Updated weights on worker 0-0, policy_version 482332 (0.00060) [2022-07-10 00:12:43,826][26022] Updated weights on worker 0-0, policy_version 482342 (0.00052) [2022-07-10 00:12:44,879][25689] Fps is (10 sec: 5719.6, 60 sec: 5648.5, 300 sec: 5658.9). Total num frames: 493922304. Throughput: 0: 5996.1. Samples: 493924246. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:12:44,880][25689] Avg episode reward: [(0, '-42.829')] [2022-07-10 00:12:45,820][26022] Updated weights on worker 0-0, policy_version 482352 (0.00084) [2022-07-10 00:12:47,587][26022] Updated weights on worker 0-0, policy_version 482362 (0.00087) [2022-07-10 00:12:49,312][26022] Updated weights on worker 0-0, policy_version 482372 (0.00091) [2022-07-10 00:12:49,948][25689] Fps is (10 sec: 5783.4, 60 sec: 5661.3, 300 sec: 5658.3). Total num frames: 493952000. Throughput: 0: 5984.6. Samples: 493958722. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:12:49,949][25689] Avg episode reward: [(0, '-42.509')] [2022-07-10 00:12:50,996][26022] Updated weights on worker 0-0, policy_version 482382 (0.00091) [2022-07-10 00:12:52,779][26022] Updated weights on worker 0-0, policy_version 482392 (0.00089) [2022-07-10 00:12:54,636][26022] Updated weights on worker 0-0, policy_version 482402 (0.00618) [2022-07-10 00:12:55,023][25689] Fps is (10 sec: 5855.2, 60 sec: 5691.9, 300 sec: 5657.0). Total num frames: 493981696. Throughput: 0: 5142.3. Samples: 493975808. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:12:55,023][25689] Avg episode reward: [(0, '-43.198')] [2022-07-10 00:12:56,674][26022] Updated weights on worker 0-0, policy_version 482412 (0.00091) [2022-07-10 00:12:58,027][26022] Updated weights on worker 0-0, policy_version 482422 (0.00050) [2022-07-10 00:13:00,057][25689] Fps is (10 sec: 5571.5, 60 sec: 5639.9, 300 sec: 5663.3). Total num frames: 494008320. Throughput: 0: 5978.8. Samples: 494010180. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:13:00,058][25689] Avg episode reward: [(0, '-43.407')] [2022-07-10 00:13:00,268][26022] Updated weights on worker 0-0, policy_version 482432 (0.00089) [2022-07-10 00:13:01,871][26022] Updated weights on worker 0-0, policy_version 482442 (0.00084) [2022-07-10 00:13:04,117][26022] Updated weights on worker 0-0, policy_version 482452 (0.00081) [2022-07-10 00:13:05,063][25689] Fps is (10 sec: 5507.4, 60 sec: 5708.4, 300 sec: 5663.4). Total num frames: 494036992. Throughput: 0: 5857.6. Samples: 494042474. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:13:05,063][25689] Avg episode reward: [(0, '-43.719')] [2022-07-10 00:13:05,799][26022] Updated weights on worker 0-0, policy_version 482462 (0.00085) [2022-07-10 00:13:07,595][26022] Updated weights on worker 0-0, policy_version 482472 (0.00090) [2022-07-10 00:13:09,382][26022] Updated weights on worker 0-0, policy_version 482482 (0.00085) [2022-07-10 00:13:10,090][25689] Fps is (10 sec: 5613.5, 60 sec: 5673.6, 300 sec: 5660.6). Total num frames: 494064640. Throughput: 0: 5012.0. Samples: 494059670. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:13:10,091][25689] Avg episode reward: [(0, '-43.920')] [2022-07-10 00:13:11,062][26022] Updated weights on worker 0-0, policy_version 482492 (0.00086) [2022-07-10 00:13:12,872][26022] Updated weights on worker 0-0, policy_version 482502 (0.00084) [2022-07-10 00:13:14,871][26022] Updated weights on worker 0-0, policy_version 482512 (0.00090) [2022-07-10 00:13:15,149][25689] Fps is (10 sec: 5685.3, 60 sec: 5689.9, 300 sec: 5660.1). Total num frames: 494094336. Throughput: 0: 5880.0. Samples: 494094152. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:13:15,150][25689] Avg episode reward: [(0, '-43.997')] [2022-07-10 00:13:16,407][26022] Updated weights on worker 0-0, policy_version 482522 (0.00078) [2022-07-10 00:13:18,381][26022] Updated weights on worker 0-0, policy_version 482532 (0.00093) [2022-07-10 00:13:20,228][25689] Fps is (10 sec: 5656.1, 60 sec: 5683.5, 300 sec: 5658.8). Total num frames: 494121984. Throughput: 0: 5851.2. Samples: 494128208. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:13:20,229][25689] Avg episode reward: [(0, '-43.595')] [2022-07-10 00:13:20,398][26022] Updated weights on worker 0-0, policy_version 482542 (0.00086) [2022-07-10 00:13:21,812][26022] Updated weights on worker 0-0, policy_version 482552 (0.00091) [2022-07-10 00:13:23,793][26022] Updated weights on worker 0-0, policy_version 482562 (0.00079) [2022-07-10 00:13:25,259][25689] Fps is (10 sec: 5773.6, 60 sec: 5715.0, 300 sec: 5665.6). Total num frames: 494152704. Throughput: 0: 5090.8. Samples: 494145288. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:13:25,259][25689] Avg episode reward: [(0, '-43.638')] [2022-07-10 00:13:25,615][26022] Updated weights on worker 0-0, policy_version 482572 (0.00087) [2022-07-10 00:13:27,460][26022] Updated weights on worker 0-0, policy_version 482582 (0.00093) [2022-07-10 00:13:29,319][26022] Updated weights on worker 0-0, policy_version 482592 (0.00096) [2022-07-10 00:13:30,299][25689] Fps is (10 sec: 5592.3, 60 sec: 5648.6, 300 sec: 5653.1). Total num frames: 494178304. Throughput: 0: 5904.1. Samples: 494178990. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 00:13:30,300][25689] Avg episode reward: [(0, '-43.884')] [2022-07-10 00:13:30,944][26022] Updated weights on worker 0-0, policy_version 482602 (0.00084) [2022-07-10 00:13:32,963][26022] Updated weights on worker 0-0, policy_version 482612 (0.00544) [2022-07-10 00:13:34,770][26022] Updated weights on worker 0-0, policy_version 482622 (0.00446) [2022-07-10 00:13:35,360][25689] Fps is (10 sec: 5575.7, 60 sec: 5686.4, 300 sec: 5662.8). Total num frames: 494209024. Throughput: 0: 5875.4. Samples: 494212898. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:13:35,360][25689] Avg episode reward: [(0, '-43.444')] [2022-07-10 00:13:36,579][26022] Updated weights on worker 0-0, policy_version 482632 (0.00082) [2022-07-10 00:13:38,295][26022] Updated weights on worker 0-0, policy_version 482642 (0.00085) [2022-07-10 00:13:40,158][26022] Updated weights on worker 0-0, policy_version 482652 (0.00082) [2022-07-10 00:13:40,392][25689] Fps is (10 sec: 5884.7, 60 sec: 5683.6, 300 sec: 5662.5). Total num frames: 494237696. Throughput: 0: 5052.2. Samples: 494230080. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:13:40,392][25689] Avg episode reward: [(0, '-43.544')] [2022-07-10 00:13:41,938][26022] Updated weights on worker 0-0, policy_version 482662 (0.00095) [2022-07-10 00:13:43,405][26022] Updated weights on worker 0-0, policy_version 482672 (0.00087) [2022-07-10 00:13:43,677][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:13:43,689][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000482673_494257152.pth [2022-07-10 00:13:43,689][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000480681_492217344.pth [2022-07-10 00:13:45,423][25689] Fps is (10 sec: 5596.4, 60 sec: 5665.9, 300 sec: 5659.0). Total num frames: 494265344. Throughput: 0: 5912.0. Samples: 494264502. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:13:45,424][25689] Avg episode reward: [(0, '-43.570')] [2022-07-10 00:13:45,595][26022] Updated weights on worker 0-0, policy_version 482682 (0.00085) [2022-07-10 00:13:47,123][26022] Updated weights on worker 0-0, policy_version 482692 (0.00086) [2022-07-10 00:13:49,108][26022] Updated weights on worker 0-0, policy_version 482702 (0.00113) [2022-07-10 00:13:50,446][25689] Fps is (10 sec: 5703.7, 60 sec: 5670.3, 300 sec: 5666.2). Total num frames: 494295040. Throughput: 0: 5946.7. Samples: 494298796. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:13:50,446][25689] Avg episode reward: [(0, '-43.616')] [2022-07-10 00:13:50,815][26022] Updated weights on worker 0-0, policy_version 482712 (0.00483) [2022-07-10 00:13:52,679][26022] Updated weights on worker 0-0, policy_version 482722 (0.00106) [2022-07-10 00:13:54,504][26022] Updated weights on worker 0-0, policy_version 482732 (0.00085) [2022-07-10 00:13:55,502][25689] Fps is (10 sec: 5690.0, 60 sec: 5638.2, 300 sec: 5663.2). Total num frames: 494322688. Throughput: 0: 5096.6. Samples: 494315552. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:13:55,502][25689] Avg episode reward: [(0, '-44.110')] [2022-07-10 00:13:56,185][26022] Updated weights on worker 0-0, policy_version 482742 (0.00083) [2022-07-10 00:13:58,119][26022] Updated weights on worker 0-0, policy_version 482752 (0.00084) [2022-07-10 00:14:00,024][26022] Updated weights on worker 0-0, policy_version 482762 (0.00081) [2022-07-10 00:14:00,508][25689] Fps is (10 sec: 5495.4, 60 sec: 5657.7, 300 sec: 5659.9). Total num frames: 494350336. Throughput: 0: 5965.7. Samples: 494350086. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:14:00,508][25689] Avg episode reward: [(0, '-44.045')] [2022-07-10 00:14:01,604][26022] Updated weights on worker 0-0, policy_version 482772 (0.00085) [2022-07-10 00:14:03,928][26022] Updated weights on worker 0-0, policy_version 482782 (0.00089) [2022-07-10 00:14:05,496][26022] Updated weights on worker 0-0, policy_version 482792 (0.00056) [2022-07-10 00:14:05,511][25689] Fps is (10 sec: 5626.9, 60 sec: 5658.0, 300 sec: 5667.0). Total num frames: 494379008. Throughput: 0: 5870.4. Samples: 494382422. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:14:05,511][25689] Avg episode reward: [(0, '-44.214')] [2022-07-10 00:14:07,427][26022] Updated weights on worker 0-0, policy_version 482802 (0.00086) [2022-07-10 00:14:09,239][26022] Updated weights on worker 0-0, policy_version 482812 (0.00092) [2022-07-10 00:14:10,519][25689] Fps is (10 sec: 5625.9, 60 sec: 5659.8, 300 sec: 5658.1). Total num frames: 494406656. Throughput: 0: 5023.4. Samples: 494399630. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:14:10,519][25689] Avg episode reward: [(0, '-44.807')] [2022-07-10 00:14:10,887][26022] Updated weights on worker 0-0, policy_version 482822 (0.00064) [2022-07-10 00:14:12,793][26022] Updated weights on worker 0-0, policy_version 482832 (0.00090) [2022-07-10 00:14:14,369][26022] Updated weights on worker 0-0, policy_version 482842 (0.00092) [2022-07-10 00:14:15,597][25689] Fps is (10 sec: 5583.5, 60 sec: 5641.1, 300 sec: 5664.4). Total num frames: 494435328. Throughput: 0: 5895.9. Samples: 494434036. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:14:15,598][25689] Avg episode reward: [(0, '-44.369')] [2022-07-10 00:14:16,210][26022] Updated weights on worker 0-0, policy_version 482852 (0.00094) [2022-07-10 00:14:18,058][26022] Updated weights on worker 0-0, policy_version 482862 (0.00086) [2022-07-10 00:14:19,959][26022] Updated weights on worker 0-0, policy_version 482872 (0.00087) [2022-07-10 00:14:20,619][25689] Fps is (10 sec: 5778.9, 60 sec: 5680.3, 300 sec: 5664.3). Total num frames: 494465024. Throughput: 0: 5893.8. Samples: 494468616. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:14:20,619][25689] Avg episode reward: [(0, '-43.968')] [2022-07-10 00:14:21,700][26022] Updated weights on worker 0-0, policy_version 482882 (0.00089) [2022-07-10 00:14:23,480][26022] Updated weights on worker 0-0, policy_version 482892 (0.00090) [2022-07-10 00:14:25,381][26022] Updated weights on worker 0-0, policy_version 482902 (0.00092) [2022-07-10 00:14:25,647][25689] Fps is (10 sec: 5706.3, 60 sec: 5629.7, 300 sec: 5664.1). Total num frames: 494492672. Throughput: 0: 5977.3. Samples: 494502780. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:14:25,647][25689] Avg episode reward: [(0, '-44.842')] [2022-07-10 00:14:27,072][26022] Updated weights on worker 0-0, policy_version 482912 (0.00095) [2022-07-10 00:14:29,030][26022] Updated weights on worker 0-0, policy_version 482922 (0.00099) [2022-07-10 00:14:30,665][25689] Fps is (10 sec: 5606.0, 60 sec: 5682.6, 300 sec: 5665.2). Total num frames: 494521344. Throughput: 0: 5950.5. Samples: 494519512. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:14:30,666][25689] Avg episode reward: [(0, '-44.736')] [2022-07-10 00:14:30,787][26022] Updated weights on worker 0-0, policy_version 482932 (0.00087) [2022-07-10 00:14:32,383][26022] Updated weights on worker 0-0, policy_version 482942 (0.00094) [2022-07-10 00:14:34,458][26022] Updated weights on worker 0-0, policy_version 482952 (0.00093) [2022-07-10 00:14:35,695][25689] Fps is (10 sec: 5808.5, 60 sec: 5668.6, 300 sec: 5662.7). Total num frames: 494551040. Throughput: 0: 5955.2. Samples: 494553722. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:14:35,696][25689] Avg episode reward: [(0, '-44.816')] [2022-07-10 00:14:35,948][26022] Updated weights on worker 0-0, policy_version 482962 (0.00093) [2022-07-10 00:14:38,094][26022] Updated weights on worker 0-0, policy_version 482972 (0.00084) [2022-07-10 00:14:39,585][26022] Updated weights on worker 0-0, policy_version 482982 (0.00090) [2022-07-10 00:14:40,721][25689] Fps is (10 sec: 5601.1, 60 sec: 5635.3, 300 sec: 5656.7). Total num frames: 494577664. Throughput: 0: 5942.4. Samples: 494588068. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:14:40,721][25689] Avg episode reward: [(0, '-44.718')] [2022-07-10 00:14:41,489][26022] Updated weights on worker 0-0, policy_version 482992 (0.00388) [2022-07-10 00:14:43,159][26022] Updated weights on worker 0-0, policy_version 483002 (0.00081) [2022-07-10 00:14:45,178][26022] Updated weights on worker 0-0, policy_version 483012 (0.00084) [2022-07-10 00:14:45,771][25689] Fps is (10 sec: 5589.9, 60 sec: 5667.5, 300 sec: 5660.4). Total num frames: 494607360. Throughput: 0: 5092.0. Samples: 494605252. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:14:45,771][25689] Avg episode reward: [(0, '-44.748')] [2022-07-10 00:14:46,721][26022] Updated weights on worker 0-0, policy_version 483022 (0.00087) [2022-07-10 00:14:48,840][26022] Updated weights on worker 0-0, policy_version 483032 (0.00084) [2022-07-10 00:14:50,372][26022] Updated weights on worker 0-0, policy_version 483042 (0.00088) [2022-07-10 00:14:50,777][25689] Fps is (10 sec: 5803.8, 60 sec: 5651.9, 300 sec: 5659.0). Total num frames: 494636032. Throughput: 0: 5969.0. Samples: 494639562. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:14:50,778][25689] Avg episode reward: [(0, '-44.283')] [2022-07-10 00:14:52,291][26022] Updated weights on worker 0-0, policy_version 483052 (0.00085) [2022-07-10 00:14:53,979][26022] Updated weights on worker 0-0, policy_version 483062 (0.00089) [2022-07-10 00:14:55,840][25689] Fps is (10 sec: 5796.5, 60 sec: 5685.2, 300 sec: 5662.1). Total num frames: 494665728. Throughput: 0: 5955.0. Samples: 494673684. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:14:55,841][25689] Avg episode reward: [(0, '-43.524')] [2022-07-10 00:14:55,843][26022] Updated weights on worker 0-0, policy_version 483072 (0.00084) [2022-07-10 00:14:57,710][26022] Updated weights on worker 0-0, policy_version 483082 (0.00086) [2022-07-10 00:14:59,595][26022] Updated weights on worker 0-0, policy_version 483092 (0.00104) [2022-07-10 00:15:00,903][25689] Fps is (10 sec: 5663.4, 60 sec: 5679.9, 300 sec: 5664.7). Total num frames: 494693376. Throughput: 0: 5085.1. Samples: 494690698. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:15:00,903][25689] Avg episode reward: [(0, '-43.680')] [2022-07-10 00:15:01,244][26022] Updated weights on worker 0-0, policy_version 483102 (0.00094) [2022-07-10 00:15:03,556][26022] Updated weights on worker 0-0, policy_version 483112 (0.00085) [2022-07-10 00:15:05,167][26022] Updated weights on worker 0-0, policy_version 483122 (0.00092) [2022-07-10 00:15:05,907][25689] Fps is (10 sec: 5289.6, 60 sec: 5629.0, 300 sec: 5658.3). Total num frames: 494718976. Throughput: 0: 5852.8. Samples: 494723104. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:15:05,907][25689] Avg episode reward: [(0, '-42.749')] [2022-07-10 00:15:07,050][26022] Updated weights on worker 0-0, policy_version 483132 (0.00091) [2022-07-10 00:15:08,832][26022] Updated weights on worker 0-0, policy_version 483142 (0.00089) [2022-07-10 00:15:10,610][26022] Updated weights on worker 0-0, policy_version 483152 (0.00103) [2022-07-10 00:15:10,923][25689] Fps is (10 sec: 5620.6, 60 sec: 5679.1, 300 sec: 5662.8). Total num frames: 494749696. Throughput: 0: 5847.8. Samples: 494757370. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:15:10,924][25689] Avg episode reward: [(0, '-42.652')] [2022-07-10 00:15:12,472][26022] Updated weights on worker 0-0, policy_version 483162 (0.00204) [2022-07-10 00:15:14,407][26022] Updated weights on worker 0-0, policy_version 483172 (0.00084) [2022-07-10 00:15:15,986][25689] Fps is (10 sec: 5790.7, 60 sec: 5663.5, 300 sec: 5658.8). Total num frames: 494777344. Throughput: 0: 4992.9. Samples: 494774272. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:15:15,987][25689] Avg episode reward: [(0, '-43.395')] [2022-07-10 00:15:16,125][26022] Updated weights on worker 0-0, policy_version 483182 (0.00090) [2022-07-10 00:15:17,945][26022] Updated weights on worker 0-0, policy_version 483192 (0.00095) [2022-07-10 00:15:19,563][26022] Updated weights on worker 0-0, policy_version 483202 (0.00091) [2022-07-10 00:15:20,993][25689] Fps is (10 sec: 5592.9, 60 sec: 5648.0, 300 sec: 5659.6). Total num frames: 494806016. Throughput: 0: 5857.9. Samples: 494808384. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:15:20,995][25689] Avg episode reward: [(0, '-43.902')] [2022-07-10 00:15:21,570][26022] Updated weights on worker 0-0, policy_version 483212 (0.00084) [2022-07-10 00:15:23,121][26022] Updated weights on worker 0-0, policy_version 483222 (0.00088) [2022-07-10 00:15:24,998][26022] Updated weights on worker 0-0, policy_version 483232 (0.00090) [2022-07-10 00:15:26,033][25689] Fps is (10 sec: 5809.7, 60 sec: 5680.7, 300 sec: 5659.2). Total num frames: 494835712. Throughput: 0: 5951.0. Samples: 494842876. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:15:26,034][25689] Avg episode reward: [(0, '-43.741')] [2022-07-10 00:15:26,676][26022] Updated weights on worker 0-0, policy_version 483242 (0.00088) [2022-07-10 00:15:28,563][26022] Updated weights on worker 0-0, policy_version 483252 (0.00084) [2022-07-10 00:15:30,288][26022] Updated weights on worker 0-0, policy_version 483262 (0.00088) [2022-07-10 00:15:31,035][25689] Fps is (10 sec: 5710.2, 60 sec: 5665.3, 300 sec: 5660.3). Total num frames: 494863360. Throughput: 0: 5094.7. Samples: 494859836. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:15:31,036][25689] Avg episode reward: [(0, '-44.560')] [2022-07-10 00:15:32,244][26022] Updated weights on worker 0-0, policy_version 483272 (0.00604) [2022-07-10 00:15:34,074][26022] Updated weights on worker 0-0, policy_version 483282 (0.00091) [2022-07-10 00:15:35,912][26022] Updated weights on worker 0-0, policy_version 483292 (0.00085) [2022-07-10 00:15:36,134][25689] Fps is (10 sec: 5474.2, 60 sec: 5625.0, 300 sec: 5652.2). Total num frames: 494891008. Throughput: 0: 5938.4. Samples: 494893918. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:15:36,135][25689] Avg episode reward: [(0, '-43.967')] [2022-07-10 00:15:37,433][26022] Updated weights on worker 0-0, policy_version 483302 (0.00082) [2022-07-10 00:15:39,660][26022] Updated weights on worker 0-0, policy_version 483312 (0.00086) [2022-07-10 00:15:41,147][25689] Fps is (10 sec: 5671.3, 60 sec: 5677.0, 300 sec: 5659.5). Total num frames: 494920704. Throughput: 0: 5937.7. Samples: 494928050. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:15:41,147][25689] Avg episode reward: [(0, '-44.431')] [2022-07-10 00:15:41,188][26022] Updated weights on worker 0-0, policy_version 483322 (0.00087) [2022-07-10 00:15:43,180][26022] Updated weights on worker 0-0, policy_version 483332 (0.00087) [2022-07-10 00:15:43,719][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:15:43,732][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000483336_494936064.pth [2022-07-10 00:15:43,733][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000481344_492896256.pth [2022-07-10 00:15:44,923][26022] Updated weights on worker 0-0, policy_version 483342 (0.00089) [2022-07-10 00:15:46,188][25689] Fps is (10 sec: 5703.9, 60 sec: 5644.0, 300 sec: 5659.8). Total num frames: 494948352. Throughput: 0: 5072.8. Samples: 494945118. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:15:46,188][25689] Avg episode reward: [(0, '-43.892')] [2022-07-10 00:15:46,938][26022] Updated weights on worker 0-0, policy_version 483352 (0.00088) [2022-07-10 00:15:48,489][26022] Updated weights on worker 0-0, policy_version 483362 (0.00084) [2022-07-10 00:15:50,504][26022] Updated weights on worker 0-0, policy_version 483372 (0.00088) [2022-07-10 00:15:51,212][25689] Fps is (10 sec: 5798.7, 60 sec: 5676.2, 300 sec: 5664.2). Total num frames: 494979072. Throughput: 0: 5922.3. Samples: 494979330. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:15:51,213][25689] Avg episode reward: [(0, '-43.807')] [2022-07-10 00:15:52,135][26022] Updated weights on worker 0-0, policy_version 483382 (0.00092) [2022-07-10 00:15:54,007][26022] Updated weights on worker 0-0, policy_version 483392 (0.00077) [2022-07-10 00:15:55,701][26022] Updated weights on worker 0-0, policy_version 483402 (0.00099) [2022-07-10 00:15:56,311][25689] Fps is (10 sec: 5664.8, 60 sec: 5622.1, 300 sec: 5659.1). Total num frames: 495005696. Throughput: 0: 5913.3. Samples: 495013226. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:15:56,311][25689] Avg episode reward: [(0, '-44.305')] [2022-07-10 00:15:57,545][26022] Updated weights on worker 0-0, policy_version 483412 (0.00092) [2022-07-10 00:15:59,405][26022] Updated weights on worker 0-0, policy_version 483422 (0.00091) [2022-07-10 00:16:01,288][26022] Updated weights on worker 0-0, policy_version 483432 (0.00084) [2022-07-10 00:16:01,334][25689] Fps is (10 sec: 5462.9, 60 sec: 5642.6, 300 sec: 5662.5). Total num frames: 495034368. Throughput: 0: 5052.1. Samples: 495030040. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:16:01,335][25689] Avg episode reward: [(0, '-44.195')] [2022-07-10 00:16:03,352][26022] Updated weights on worker 0-0, policy_version 483442 (0.00082) [2022-07-10 00:16:05,243][26022] Updated weights on worker 0-0, policy_version 483452 (0.00062) [2022-07-10 00:16:06,347][25689] Fps is (10 sec: 5305.6, 60 sec: 5624.9, 300 sec: 5645.9). Total num frames: 495058944. Throughput: 0: 5780.3. Samples: 495061642. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:16:06,347][25689] Avg episode reward: [(0, '-45.527')] [2022-07-10 00:16:07,030][26022] Updated weights on worker 0-0, policy_version 483462 (0.00094) [2022-07-10 00:16:08,883][26022] Updated weights on worker 0-0, policy_version 483472 (0.00089) [2022-07-10 00:16:10,764][26022] Updated weights on worker 0-0, policy_version 483482 (0.00091) [2022-07-10 00:16:11,355][25689] Fps is (10 sec: 5415.8, 60 sec: 5608.7, 300 sec: 5653.5). Total num frames: 495088640. Throughput: 0: 5751.1. Samples: 495095176. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:16:11,356][25689] Avg episode reward: [(0, '-44.547')] [2022-07-10 00:16:12,583][26022] Updated weights on worker 0-0, policy_version 483492 (0.00085) [2022-07-10 00:16:14,321][26022] Updated weights on worker 0-0, policy_version 483502 (0.00093) [2022-07-10 00:16:16,116][26022] Updated weights on worker 0-0, policy_version 483512 (0.00086) [2022-07-10 00:16:16,467][25689] Fps is (10 sec: 5767.5, 60 sec: 5621.1, 300 sec: 5648.3). Total num frames: 495117312. Throughput: 0: 4909.2. Samples: 495112178. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:16:16,467][25689] Avg episode reward: [(0, '-45.033')] [2022-07-10 00:16:18,033][26022] Updated weights on worker 0-0, policy_version 483522 (0.00082) [2022-07-10 00:16:19,727][26022] Updated weights on worker 0-0, policy_version 483532 (0.00086) [2022-07-10 00:16:21,483][25689] Fps is (10 sec: 5662.4, 60 sec: 5620.3, 300 sec: 5652.1). Total num frames: 495145984. Throughput: 0: 5785.5. Samples: 495146608. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:16:21,483][25689] Avg episode reward: [(0, '-44.448')] [2022-07-10 00:16:21,531][26022] Updated weights on worker 0-0, policy_version 483542 (0.00087) [2022-07-10 00:16:23,320][26022] Updated weights on worker 0-0, policy_version 483552 (0.00083) [2022-07-10 00:16:25,077][26022] Updated weights on worker 0-0, policy_version 483562 (0.00100) [2022-07-10 00:16:26,491][25689] Fps is (10 sec: 5618.6, 60 sec: 5589.4, 300 sec: 5648.7). Total num frames: 495173632. Throughput: 0: 5913.8. Samples: 495180770. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:16:26,491][25689] Avg episode reward: [(0, '-44.969')] [2022-07-10 00:16:27,006][26022] Updated weights on worker 0-0, policy_version 483572 (0.00087) [2022-07-10 00:16:28,752][26022] Updated weights on worker 0-0, policy_version 483582 (0.00095) [2022-07-10 00:16:30,573][26022] Updated weights on worker 0-0, policy_version 483592 (0.00084) [2022-07-10 00:16:31,545][25689] Fps is (10 sec: 5698.6, 60 sec: 5618.4, 300 sec: 5652.6). Total num frames: 495203328. Throughput: 0: 5077.4. Samples: 495197688. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:16:31,546][25689] Avg episode reward: [(0, '-44.621')] [2022-07-10 00:16:32,242][26022] Updated weights on worker 0-0, policy_version 483602 (0.00080) [2022-07-10 00:16:34,150][26022] Updated weights on worker 0-0, policy_version 483612 (0.00089) [2022-07-10 00:16:35,905][26022] Updated weights on worker 0-0, policy_version 483622 (0.00089) [2022-07-10 00:16:36,649][25689] Fps is (10 sec: 5846.6, 60 sec: 5651.8, 300 sec: 5654.4). Total num frames: 495233024. Throughput: 0: 5945.1. Samples: 495232164. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:16:36,650][25689] Avg episode reward: [(0, '-44.053')] [2022-07-10 00:16:37,850][26022] Updated weights on worker 0-0, policy_version 483632 (0.00092) [2022-07-10 00:16:39,322][26022] Updated weights on worker 0-0, policy_version 483642 (0.00093) [2022-07-10 00:16:41,297][26022] Updated weights on worker 0-0, policy_version 483652 (0.00091) [2022-07-10 00:16:41,652][25689] Fps is (10 sec: 5674.1, 60 sec: 5618.8, 300 sec: 5651.7). Total num frames: 495260672. Throughput: 0: 5945.8. Samples: 495266530. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:16:41,653][25689] Avg episode reward: [(0, '-44.297')] [2022-07-10 00:16:42,973][26022] Updated weights on worker 0-0, policy_version 483662 (0.00091) [2022-07-10 00:16:44,899][26022] Updated weights on worker 0-0, policy_version 483672 (0.00089) [2022-07-10 00:16:46,687][25689] Fps is (10 sec: 5610.9, 60 sec: 5636.3, 300 sec: 5651.5). Total num frames: 495289344. Throughput: 0: 5098.1. Samples: 495283730. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:16:46,687][25689] Avg episode reward: [(0, '-44.202')] [2022-07-10 00:16:46,751][26022] Updated weights on worker 0-0, policy_version 483682 (0.00081) [2022-07-10 00:16:48,497][26022] Updated weights on worker 0-0, policy_version 483692 (0.00085) [2022-07-10 00:16:50,171][26022] Updated weights on worker 0-0, policy_version 483702 (0.00299) [2022-07-10 00:16:51,690][25689] Fps is (10 sec: 5712.3, 60 sec: 5604.4, 300 sec: 5655.7). Total num frames: 495318016. Throughput: 0: 5989.0. Samples: 495318336. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:16:51,695][25689] Avg episode reward: [(0, '-44.810')] [2022-07-10 00:16:52,015][26022] Updated weights on worker 0-0, policy_version 483712 (0.00088) [2022-07-10 00:16:53,733][26022] Updated weights on worker 0-0, policy_version 483722 (0.00089) [2022-07-10 00:16:55,566][26022] Updated weights on worker 0-0, policy_version 483732 (0.00078) [2022-07-10 00:16:56,771][25689] Fps is (10 sec: 5788.1, 60 sec: 5656.9, 300 sec: 5654.5). Total num frames: 495347712. Throughput: 0: 6003.1. Samples: 495352956. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:16:56,772][25689] Avg episode reward: [(0, '-44.346')] [2022-07-10 00:16:57,309][26022] Updated weights on worker 0-0, policy_version 483742 (0.00396) [2022-07-10 00:16:59,135][26022] Updated weights on worker 0-0, policy_version 483752 (0.00626) [2022-07-10 00:17:01,004][26022] Updated weights on worker 0-0, policy_version 483762 (0.00089) [2022-07-10 00:17:01,826][25689] Fps is (10 sec: 5859.8, 60 sec: 5670.9, 300 sec: 5671.0). Total num frames: 495377408. Throughput: 0: 5132.2. Samples: 495370068. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:17:01,826][25689] Avg episode reward: [(0, '-44.955')] [2022-07-10 00:17:03,083][26022] Updated weights on worker 0-0, policy_version 483772 (0.00084) [2022-07-10 00:17:04,856][26022] Updated weights on worker 0-0, policy_version 483782 (0.00094) [2022-07-10 00:17:06,831][25689] Fps is (10 sec: 5394.9, 60 sec: 5671.6, 300 sec: 5654.0). Total num frames: 495401984. Throughput: 0: 5884.8. Samples: 495402274. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:17:06,833][25689] Avg episode reward: [(0, '-44.635')] [2022-07-10 00:17:06,863][26022] Updated weights on worker 0-0, policy_version 483792 (0.00093) [2022-07-10 00:17:08,361][26022] Updated weights on worker 0-0, policy_version 483802 (0.00091) [2022-07-10 00:17:10,331][26022] Updated weights on worker 0-0, policy_version 483812 (0.00085) [2022-07-10 00:17:11,792][26022] Updated weights on worker 0-0, policy_version 483822 (0.00618) [2022-07-10 00:17:11,885][25689] Fps is (10 sec: 5599.0, 60 sec: 5701.2, 300 sec: 5664.3). Total num frames: 495433728. Throughput: 0: 5870.6. Samples: 495436890. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:17:11,886][25689] Avg episode reward: [(0, '-44.864')] [2022-07-10 00:17:13,717][26022] Updated weights on worker 0-0, policy_version 483832 (0.00080) [2022-07-10 00:17:15,616][26022] Updated weights on worker 0-0, policy_version 483842 (0.00091) [2022-07-10 00:17:16,946][25689] Fps is (10 sec: 5871.7, 60 sec: 5689.0, 300 sec: 5663.3). Total num frames: 495461376. Throughput: 0: 5024.5. Samples: 495454324. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:17:16,946][25689] Avg episode reward: [(0, '-43.764')] [2022-07-10 00:17:17,363][26022] Updated weights on worker 0-0, policy_version 483852 (0.00088) [2022-07-10 00:17:19,184][26022] Updated weights on worker 0-0, policy_version 483862 (0.01004) [2022-07-10 00:17:20,988][26022] Updated weights on worker 0-0, policy_version 483872 (0.00091) [2022-07-10 00:17:21,981][25689] Fps is (10 sec: 5578.3, 60 sec: 5687.1, 300 sec: 5662.8). Total num frames: 495490048. Throughput: 0: 5887.8. Samples: 495488738. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:17:21,982][25689] Avg episode reward: [(0, '-44.454')] [2022-07-10 00:17:22,671][26022] Updated weights on worker 0-0, policy_version 483882 (0.00094) [2022-07-10 00:17:24,582][26022] Updated weights on worker 0-0, policy_version 483892 (0.00093) [2022-07-10 00:17:26,250][26022] Updated weights on worker 0-0, policy_version 483902 (0.00087) [2022-07-10 00:17:27,008][25689] Fps is (10 sec: 5800.7, 60 sec: 5719.3, 300 sec: 5663.3). Total num frames: 495519744. Throughput: 0: 5981.6. Samples: 495522966. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:17:27,008][25689] Avg episode reward: [(0, '-45.571')] [2022-07-10 00:17:28,308][26022] Updated weights on worker 0-0, policy_version 483912 (0.00093) [2022-07-10 00:17:29,804][26022] Updated weights on worker 0-0, policy_version 483922 (0.00088) [2022-07-10 00:17:31,835][26022] Updated weights on worker 0-0, policy_version 483932 (0.00081) [2022-07-10 00:17:32,039][25689] Fps is (10 sec: 5701.6, 60 sec: 5687.6, 300 sec: 5661.2). Total num frames: 495547392. Throughput: 0: 5975.4. Samples: 495557316. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:17:32,039][25689] Avg episode reward: [(0, '-44.874')] [2022-07-10 00:17:33,383][26022] Updated weights on worker 0-0, policy_version 483942 (0.00089) [2022-07-10 00:17:35,374][26022] Updated weights on worker 0-0, policy_version 483952 (0.00087) [2022-07-10 00:17:37,086][25689] Fps is (10 sec: 5689.9, 60 sec: 5693.0, 300 sec: 5663.8). Total num frames: 495577088. Throughput: 0: 5964.5. Samples: 495574452. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:17:37,087][25689] Avg episode reward: [(0, '-44.691')] [2022-07-10 00:17:37,088][26022] Updated weights on worker 0-0, policy_version 483962 (0.00092) [2022-07-10 00:17:38,843][26022] Updated weights on worker 0-0, policy_version 483972 (0.00098) [2022-07-10 00:17:40,679][26022] Updated weights on worker 0-0, policy_version 483982 (0.00086) [2022-07-10 00:17:42,181][25689] Fps is (10 sec: 5754.7, 60 sec: 5701.1, 300 sec: 5662.4). Total num frames: 495605760. Throughput: 0: 5962.8. Samples: 495609188. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:17:42,182][25689] Avg episode reward: [(0, '-44.726')] [2022-07-10 00:17:42,358][26022] Updated weights on worker 0-0, policy_version 483992 (0.00089) [2022-07-10 00:17:43,803][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:17:43,818][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000484000_495616000.pth [2022-07-10 00:17:43,818][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000482008_493576192.pth [2022-07-10 00:17:44,043][26022] Updated weights on worker 0-0, policy_version 484002 (0.00084) [2022-07-10 00:17:45,962][26022] Updated weights on worker 0-0, policy_version 484012 (0.00084) [2022-07-10 00:17:47,254][25689] Fps is (10 sec: 5740.4, 60 sec: 5714.5, 300 sec: 5662.4). Total num frames: 495635456. Throughput: 0: 5976.4. Samples: 495643964. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:17:47,254][25689] Avg episode reward: [(0, '-45.281')] [2022-07-10 00:17:47,729][26022] Updated weights on worker 0-0, policy_version 484022 (0.00084) [2022-07-10 00:17:49,403][26022] Updated weights on worker 0-0, policy_version 484032 (0.00100) [2022-07-10 00:17:51,263][26022] Updated weights on worker 0-0, policy_version 484042 (0.00092) [2022-07-10 00:17:52,327][25689] Fps is (10 sec: 5854.0, 60 sec: 5724.9, 300 sec: 5662.4). Total num frames: 495665152. Throughput: 0: 5127.0. Samples: 495661334. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:17:52,327][25689] Avg episode reward: [(0, '-45.283')] [2022-07-10 00:17:52,985][26022] Updated weights on worker 0-0, policy_version 484052 (0.00086) [2022-07-10 00:17:54,732][26022] Updated weights on worker 0-0, policy_version 484062 (0.00080) [2022-07-10 00:17:56,629][26022] Updated weights on worker 0-0, policy_version 484072 (0.00089) [2022-07-10 00:17:57,436][25689] Fps is (10 sec: 5732.5, 60 sec: 5705.3, 300 sec: 5667.9). Total num frames: 495693824. Throughput: 0: 5962.8. Samples: 495695794. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:17:57,436][25689] Avg episode reward: [(0, '-44.426')] [2022-07-10 00:17:58,255][26022] Updated weights on worker 0-0, policy_version 484082 (0.00083) [2022-07-10 00:18:00,183][26022] Updated weights on worker 0-0, policy_version 484092 (0.00088) [2022-07-10 00:18:02,403][26022] Updated weights on worker 0-0, policy_version 484102 (0.00087) [2022-07-10 00:18:02,477][25689] Fps is (10 sec: 5447.6, 60 sec: 5655.9, 300 sec: 5660.3). Total num frames: 495720448. Throughput: 0: 5862.3. Samples: 495728170. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:18:02,478][25689] Avg episode reward: [(0, '-44.351')] [2022-07-10 00:18:04,244][26022] Updated weights on worker 0-0, policy_version 484112 (0.00078) [2022-07-10 00:18:05,983][26022] Updated weights on worker 0-0, policy_version 484122 (0.00085) [2022-07-10 00:18:07,496][25689] Fps is (10 sec: 5394.8, 60 sec: 5705.3, 300 sec: 5660.5). Total num frames: 495748096. Throughput: 0: 4965.6. Samples: 495744478. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:18:07,497][25689] Avg episode reward: [(0, '-43.899')] [2022-07-10 00:18:08,078][26022] Updated weights on worker 0-0, policy_version 484132 (0.00088) [2022-07-10 00:18:09,409][26022] Updated weights on worker 0-0, policy_version 484142 (0.00088) [2022-07-10 00:18:11,455][26022] Updated weights on worker 0-0, policy_version 484152 (0.00083) [2022-07-10 00:18:12,500][25689] Fps is (10 sec: 5619.5, 60 sec: 5659.3, 300 sec: 5658.1). Total num frames: 495776768. Throughput: 0: 5816.6. Samples: 495778670. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:18:12,500][25689] Avg episode reward: [(0, '-43.812')] [2022-07-10 00:18:13,002][26022] Updated weights on worker 0-0, policy_version 484162 (0.00084) [2022-07-10 00:18:15,049][26022] Updated weights on worker 0-0, policy_version 484172 (0.00829) [2022-07-10 00:18:16,799][26022] Updated weights on worker 0-0, policy_version 484182 (0.00090) [2022-07-10 00:18:17,576][25689] Fps is (10 sec: 5790.5, 60 sec: 5691.7, 300 sec: 5665.0). Total num frames: 495806464. Throughput: 0: 5839.2. Samples: 495813396. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:18:17,577][25689] Avg episode reward: [(0, '-43.880')] [2022-07-10 00:18:18,529][26022] Updated weights on worker 0-0, policy_version 484192 (0.00086) [2022-07-10 00:18:20,206][26022] Updated weights on worker 0-0, policy_version 484202 (0.00086) [2022-07-10 00:18:22,355][26022] Updated weights on worker 0-0, policy_version 484212 (0.00090) [2022-07-10 00:18:22,585][25689] Fps is (10 sec: 5787.3, 60 sec: 5694.1, 300 sec: 5658.5). Total num frames: 495835136. Throughput: 0: 5096.4. Samples: 495830648. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:18:22,586][25689] Avg episode reward: [(0, '-43.071')] [2022-07-10 00:18:23,640][26022] Updated weights on worker 0-0, policy_version 484222 (0.00090) [2022-07-10 00:18:25,886][26022] Updated weights on worker 0-0, policy_version 484232 (0.00088) [2022-07-10 00:18:27,307][26022] Updated weights on worker 0-0, policy_version 484242 (0.00090) [2022-07-10 00:18:27,644][25689] Fps is (10 sec: 5797.3, 60 sec: 5691.1, 300 sec: 5672.0). Total num frames: 495864832. Throughput: 0: 5975.7. Samples: 495864876. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:18:27,645][25689] Avg episode reward: [(0, '-42.648')] [2022-07-10 00:18:29,327][26022] Updated weights on worker 0-0, policy_version 484252 (0.00087) [2022-07-10 00:18:31,134][26022] Updated weights on worker 0-0, policy_version 484262 (0.00080) [2022-07-10 00:18:32,650][25689] Fps is (10 sec: 5595.7, 60 sec: 5676.5, 300 sec: 5659.2). Total num frames: 495891456. Throughput: 0: 5950.0. Samples: 495898566. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:18:32,651][25689] Avg episode reward: [(0, '-42.828')] [2022-07-10 00:18:33,100][26022] Updated weights on worker 0-0, policy_version 484272 (0.00082) [2022-07-10 00:18:34,986][26022] Updated weights on worker 0-0, policy_version 484282 (0.00091) [2022-07-10 00:18:36,571][26022] Updated weights on worker 0-0, policy_version 484292 (0.00100) [2022-07-10 00:18:37,755][25689] Fps is (10 sec: 5570.3, 60 sec: 5671.2, 300 sec: 5661.3). Total num frames: 495921152. Throughput: 0: 5063.3. Samples: 495915568. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:18:37,755][25689] Avg episode reward: [(0, '-42.422')] [2022-07-10 00:18:38,390][26022] Updated weights on worker 0-0, policy_version 484302 (0.00096) [2022-07-10 00:18:40,148][26022] Updated weights on worker 0-0, policy_version 484312 (0.00088) [2022-07-10 00:18:42,024][26022] Updated weights on worker 0-0, policy_version 484322 (0.00092) [2022-07-10 00:18:42,796][25689] Fps is (10 sec: 5752.7, 60 sec: 5676.2, 300 sec: 5664.5). Total num frames: 495949824. Throughput: 0: 5895.7. Samples: 495949806. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:18:42,797][25689] Avg episode reward: [(0, '-43.319')] [2022-07-10 00:18:43,786][26022] Updated weights on worker 0-0, policy_version 484332 (0.00089) [2022-07-10 00:18:45,574][26022] Updated weights on worker 0-0, policy_version 484342 (0.00050) [2022-07-10 00:18:47,359][26022] Updated weights on worker 0-0, policy_version 484352 (0.00085) [2022-07-10 00:18:47,831][25689] Fps is (10 sec: 5691.3, 60 sec: 5662.9, 300 sec: 5660.9). Total num frames: 495978496. Throughput: 0: 5912.3. Samples: 495984224. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:18:47,831][25689] Avg episode reward: [(0, '-43.648')] [2022-07-10 00:18:49,448][26022] Updated weights on worker 0-0, policy_version 484362 (0.00088) [2022-07-10 00:18:50,843][26022] Updated weights on worker 0-0, policy_version 484372 (0.00095) [2022-07-10 00:18:52,854][25689] Fps is (10 sec: 5497.8, 60 sec: 5616.8, 300 sec: 5658.0). Total num frames: 496005120. Throughput: 0: 5093.7. Samples: 496001480. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:18:52,855][25689] Avg episode reward: [(0, '-44.479')] [2022-07-10 00:18:53,123][26022] Updated weights on worker 0-0, policy_version 484382 (0.00088) [2022-07-10 00:18:54,560][26022] Updated weights on worker 0-0, policy_version 484392 (0.00093) [2022-07-10 00:18:56,245][26022] Updated weights on worker 0-0, policy_version 484402 (0.00091) [2022-07-10 00:18:57,899][25689] Fps is (10 sec: 5593.7, 60 sec: 5639.7, 300 sec: 5664.2). Total num frames: 496034816. Throughput: 0: 5938.6. Samples: 496035196. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:18:57,900][25689] Avg episode reward: [(0, '-45.726')] [2022-07-10 00:18:58,344][26022] Updated weights on worker 0-0, policy_version 484412 (0.00093) [2022-07-10 00:19:00,004][26022] Updated weights on worker 0-0, policy_version 484422 (0.00089) [2022-07-10 00:19:02,385][26022] Updated weights on worker 0-0, policy_version 484432 (0.00097) [2022-07-10 00:19:02,903][25689] Fps is (10 sec: 5604.6, 60 sec: 5643.2, 300 sec: 5657.3). Total num frames: 496061440. Throughput: 0: 5837.8. Samples: 496067186. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:19:02,904][25689] Avg episode reward: [(0, '-45.840')] [2022-07-10 00:19:03,976][26022] Updated weights on worker 0-0, policy_version 484442 (0.00090) [2022-07-10 00:19:05,807][26022] Updated weights on worker 0-0, policy_version 484452 (0.00082) [2022-07-10 00:19:07,747][26022] Updated weights on worker 0-0, policy_version 484462 (0.00086) [2022-07-10 00:19:07,917][25689] Fps is (10 sec: 5519.6, 60 sec: 5660.5, 300 sec: 5660.6). Total num frames: 496090112. Throughput: 0: 4973.4. Samples: 496084122. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:19:07,918][25689] Avg episode reward: [(0, '-45.437')] [2022-07-10 00:19:09,351][26022] Updated weights on worker 0-0, policy_version 484472 (0.00084) [2022-07-10 00:19:11,170][26022] Updated weights on worker 0-0, policy_version 484482 (0.00090) [2022-07-10 00:19:12,922][25689] Fps is (10 sec: 5621.4, 60 sec: 5643.5, 300 sec: 5658.6). Total num frames: 496117760. Throughput: 0: 5847.0. Samples: 496118816. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 00:19:12,922][25689] Avg episode reward: [(0, '-45.075')] [2022-07-10 00:19:13,037][26022] Updated weights on worker 0-0, policy_version 484492 (0.00085) [2022-07-10 00:19:14,731][26022] Updated weights on worker 0-0, policy_version 484502 (0.00083) [2022-07-10 00:19:16,579][26022] Updated weights on worker 0-0, policy_version 484512 (0.00081) [2022-07-10 00:19:17,974][25689] Fps is (10 sec: 5803.7, 60 sec: 5662.7, 300 sec: 5661.4). Total num frames: 496148480. Throughput: 0: 5913.0. Samples: 496153900. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:19:17,975][25689] Avg episode reward: [(0, '-43.826')] [2022-07-10 00:19:18,237][26022] Updated weights on worker 0-0, policy_version 484522 (0.00096) [2022-07-10 00:19:20,007][26022] Updated weights on worker 0-0, policy_version 484532 (0.00084) [2022-07-10 00:19:22,056][26022] Updated weights on worker 0-0, policy_version 484542 (0.00088) [2022-07-10 00:19:22,984][25689] Fps is (10 sec: 6004.2, 60 sec: 5679.6, 300 sec: 5668.7). Total num frames: 496178176. Throughput: 0: 5179.4. Samples: 496171194. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:19:22,985][25689] Avg episode reward: [(0, '-43.789')] [2022-07-10 00:19:23,543][26022] Updated weights on worker 0-0, policy_version 484552 (0.00087) [2022-07-10 00:19:25,451][26022] Updated weights on worker 0-0, policy_version 484562 (0.00086) [2022-07-10 00:19:27,207][26022] Updated weights on worker 0-0, policy_version 484572 (0.00085) [2022-07-10 00:19:27,989][25689] Fps is (10 sec: 5623.6, 60 sec: 5633.7, 300 sec: 5662.0). Total num frames: 496204800. Throughput: 0: 6060.2. Samples: 496205762. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:19:27,990][25689] Avg episode reward: [(0, '-42.489')] [2022-07-10 00:19:28,956][26022] Updated weights on worker 0-0, policy_version 484582 (0.00083) [2022-07-10 00:19:30,977][26022] Updated weights on worker 0-0, policy_version 484592 (0.00092) [2022-07-10 00:19:32,663][26022] Updated weights on worker 0-0, policy_version 484602 (0.00078) [2022-07-10 00:19:32,999][25689] Fps is (10 sec: 5623.9, 60 sec: 5684.3, 300 sec: 5662.4). Total num frames: 496234496. Throughput: 0: 6025.6. Samples: 496239790. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:19:32,999][25689] Avg episode reward: [(0, '-42.710')] [2022-07-10 00:19:34,393][26022] Updated weights on worker 0-0, policy_version 484612 (0.00092) [2022-07-10 00:19:36,326][26022] Updated weights on worker 0-0, policy_version 484622 (0.00091) [2022-07-10 00:19:37,888][26022] Updated weights on worker 0-0, policy_version 484632 (0.00092) [2022-07-10 00:19:38,042][25689] Fps is (10 sec: 5806.3, 60 sec: 5673.2, 300 sec: 5669.0). Total num frames: 496263168. Throughput: 0: 5129.2. Samples: 496256830. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:19:38,042][25689] Avg episode reward: [(0, '-42.511')] [2022-07-10 00:19:39,922][26022] Updated weights on worker 0-0, policy_version 484642 (0.00089) [2022-07-10 00:19:41,630][26022] Updated weights on worker 0-0, policy_version 484652 (0.00092) [2022-07-10 00:19:43,059][25689] Fps is (10 sec: 5496.6, 60 sec: 5641.5, 300 sec: 5659.3). Total num frames: 496289792. Throughput: 0: 5956.7. Samples: 496290772. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:19:43,059][25689] Avg episode reward: [(0, '-42.904')] [2022-07-10 00:19:43,524][26022] Updated weights on worker 0-0, policy_version 484662 (0.00087) [2022-07-10 00:19:44,042][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:19:44,053][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000484665_496296960.pth [2022-07-10 00:19:44,054][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000482673_494257152.pth [2022-07-10 00:19:45,193][26022] Updated weights on worker 0-0, policy_version 484672 (0.00087) [2022-07-10 00:19:47,094][26022] Updated weights on worker 0-0, policy_version 484682 (0.00080) [2022-07-10 00:19:48,071][25689] Fps is (10 sec: 5717.9, 60 sec: 5677.6, 300 sec: 5666.1). Total num frames: 496320512. Throughput: 0: 5933.6. Samples: 496324916. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:19:48,071][25689] Avg episode reward: [(0, '-42.591')] [2022-07-10 00:19:48,898][26022] Updated weights on worker 0-0, policy_version 484692 (0.00064) [2022-07-10 00:19:50,635][26022] Updated weights on worker 0-0, policy_version 484702 (0.00094) [2022-07-10 00:19:52,311][26022] Updated weights on worker 0-0, policy_version 484712 (0.00090) [2022-07-10 00:19:53,083][25689] Fps is (10 sec: 5822.4, 60 sec: 5695.6, 300 sec: 5660.1). Total num frames: 496348160. Throughput: 0: 5100.8. Samples: 496342238. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:19:53,084][25689] Avg episode reward: [(0, '-42.924')] [2022-07-10 00:19:54,256][26022] Updated weights on worker 0-0, policy_version 484722 (0.00093) [2022-07-10 00:19:56,134][26022] Updated weights on worker 0-0, policy_version 484732 (0.00078) [2022-07-10 00:19:57,838][26022] Updated weights on worker 0-0, policy_version 484742 (0.00090) [2022-07-10 00:19:58,155][25689] Fps is (10 sec: 5585.1, 60 sec: 5676.1, 300 sec: 5663.4). Total num frames: 496376832. Throughput: 0: 5965.6. Samples: 496376816. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:19:58,155][25689] Avg episode reward: [(0, '-43.248')] [2022-07-10 00:19:59,621][26022] Updated weights on worker 0-0, policy_version 484752 (0.00081) [2022-07-10 00:20:01,442][26022] Updated weights on worker 0-0, policy_version 484762 (0.00092) [2022-07-10 00:20:03,164][25689] Fps is (10 sec: 5586.8, 60 sec: 5692.6, 300 sec: 5670.2). Total num frames: 496404480. Throughput: 0: 5873.1. Samples: 496408854. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:20:03,165][25689] Avg episode reward: [(0, '-42.822')] [2022-07-10 00:20:03,612][26022] Updated weights on worker 0-0, policy_version 484772 (0.00090) [2022-07-10 00:20:05,402][26022] Updated weights on worker 0-0, policy_version 484782 (0.00087) [2022-07-10 00:20:07,416][26022] Updated weights on worker 0-0, policy_version 484792 (0.00394) [2022-07-10 00:20:08,219][25689] Fps is (10 sec: 5596.2, 60 sec: 5688.7, 300 sec: 5662.6). Total num frames: 496433152. Throughput: 0: 5011.4. Samples: 496425886. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:20:08,219][25689] Avg episode reward: [(0, '-43.184')] [2022-07-10 00:20:08,957][26022] Updated weights on worker 0-0, policy_version 484802 (0.00085) [2022-07-10 00:20:10,882][26022] Updated weights on worker 0-0, policy_version 484812 (0.00086) [2022-07-10 00:20:12,476][26022] Updated weights on worker 0-0, policy_version 484822 (0.00101) [2022-07-10 00:20:13,243][25689] Fps is (10 sec: 5588.2, 60 sec: 5686.9, 300 sec: 5663.3). Total num frames: 496460800. Throughput: 0: 5857.5. Samples: 496460322. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:20:13,244][25689] Avg episode reward: [(0, '-43.803')] [2022-07-10 00:20:14,351][26022] Updated weights on worker 0-0, policy_version 484832 (0.00092) [2022-07-10 00:20:16,028][26022] Updated weights on worker 0-0, policy_version 484842 (0.00086) [2022-07-10 00:20:18,055][26022] Updated weights on worker 0-0, policy_version 484852 (0.00081) [2022-07-10 00:20:18,337][25689] Fps is (10 sec: 5566.3, 60 sec: 5649.1, 300 sec: 5661.6). Total num frames: 496489472. Throughput: 0: 5840.9. Samples: 496494698. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:20:18,337][25689] Avg episode reward: [(0, '-44.659')] [2022-07-10 00:20:19,811][26022] Updated weights on worker 0-0, policy_version 484862 (0.00097) [2022-07-10 00:20:21,645][26022] Updated weights on worker 0-0, policy_version 484872 (0.00114) [2022-07-10 00:20:23,347][25689] Fps is (10 sec: 5675.2, 60 sec: 5632.1, 300 sec: 5658.8). Total num frames: 496518144. Throughput: 0: 5101.2. Samples: 496511810. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:20:23,349][25689] Avg episode reward: [(0, '-44.716')] [2022-07-10 00:20:23,378][26022] Updated weights on worker 0-0, policy_version 484882 (0.00086) [2022-07-10 00:20:25,205][26022] Updated weights on worker 0-0, policy_version 484892 (0.00096) [2022-07-10 00:20:27,124][26022] Updated weights on worker 0-0, policy_version 484902 (0.00084) [2022-07-10 00:20:28,387][25689] Fps is (10 sec: 5705.9, 60 sec: 5662.7, 300 sec: 5661.5). Total num frames: 496546816. Throughput: 0: 5934.3. Samples: 496545570. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:20:28,387][25689] Avg episode reward: [(0, '-44.710')] [2022-07-10 00:20:28,807][26022] Updated weights on worker 0-0, policy_version 484912 (0.00083) [2022-07-10 00:20:30,724][26022] Updated weights on worker 0-0, policy_version 484922 (0.00095) [2022-07-10 00:20:32,431][26022] Updated weights on worker 0-0, policy_version 484932 (0.00079) [2022-07-10 00:20:33,426][25689] Fps is (10 sec: 5689.3, 60 sec: 5643.0, 300 sec: 5666.1). Total num frames: 496575488. Throughput: 0: 5898.3. Samples: 496579370. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:20:33,427][25689] Avg episode reward: [(0, '-45.467')] [2022-07-10 00:20:34,219][26022] Updated weights on worker 0-0, policy_version 484942 (0.00084) [2022-07-10 00:20:36,043][26022] Updated weights on worker 0-0, policy_version 484952 (0.00093) [2022-07-10 00:20:37,884][26022] Updated weights on worker 0-0, policy_version 484962 (0.00080) [2022-07-10 00:20:38,543][25689] Fps is (10 sec: 5646.2, 60 sec: 5636.1, 300 sec: 5660.6). Total num frames: 496604160. Throughput: 0: 5894.8. Samples: 496613810. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:20:38,545][25689] Avg episode reward: [(0, '-45.190')] [2022-07-10 00:20:39,574][26022] Updated weights on worker 0-0, policy_version 484972 (0.00101) [2022-07-10 00:20:41,459][26022] Updated weights on worker 0-0, policy_version 484982 (0.00085) [2022-07-10 00:20:43,137][26022] Updated weights on worker 0-0, policy_version 484992 (0.00095) [2022-07-10 00:20:43,610][25689] Fps is (10 sec: 5731.3, 60 sec: 5682.2, 300 sec: 5667.0). Total num frames: 496633856. Throughput: 0: 5879.3. Samples: 496630944. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:20:43,611][25689] Avg episode reward: [(0, '-44.558')] [2022-07-10 00:20:44,928][26022] Updated weights on worker 0-0, policy_version 485002 (0.00088) [2022-07-10 00:20:47,025][26022] Updated weights on worker 0-0, policy_version 485013 (0.00084) [2022-07-10 00:20:48,619][25689] Fps is (10 sec: 5792.6, 60 sec: 5648.6, 300 sec: 5660.4). Total num frames: 496662528. Throughput: 0: 5903.2. Samples: 496665006. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:20:48,620][25689] Avg episode reward: [(0, '-45.149')] [2022-07-10 00:20:48,893][26022] Updated weights on worker 0-0, policy_version 485023 (0.00093) [2022-07-10 00:20:50,619][26022] Updated weights on worker 0-0, policy_version 485033 (0.00088) [2022-07-10 00:20:52,383][26022] Updated weights on worker 0-0, policy_version 485043 (0.00088) [2022-07-10 00:20:53,633][25689] Fps is (10 sec: 5721.6, 60 sec: 5665.5, 300 sec: 5669.0). Total num frames: 496691200. Throughput: 0: 5924.6. Samples: 496699084. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:20:53,634][25689] Avg episode reward: [(0, '-45.912')] [2022-07-10 00:20:54,052][26022] Updated weights on worker 0-0, policy_version 485053 (0.00087) [2022-07-10 00:20:56,333][26022] Updated weights on worker 0-0, policy_version 485063 (0.00089) [2022-07-10 00:20:57,908][26022] Updated weights on worker 0-0, policy_version 485073 (0.00088) [2022-07-10 00:20:58,728][25689] Fps is (10 sec: 5672.4, 60 sec: 5663.2, 300 sec: 5667.6). Total num frames: 496719872. Throughput: 0: 5073.4. Samples: 496716218. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:20:58,729][25689] Avg episode reward: [(0, '-44.928')] [2022-07-10 00:20:59,709][26022] Updated weights on worker 0-0, policy_version 485083 (0.00086) [2022-07-10 00:21:01,524][26022] Updated weights on worker 0-0, policy_version 485093 (0.00092) [2022-07-10 00:21:03,598][26022] Updated weights on worker 0-0, policy_version 485103 (0.00096) [2022-07-10 00:21:03,788][25689] Fps is (10 sec: 5344.5, 60 sec: 5624.8, 300 sec: 5670.1). Total num frames: 496745472. Throughput: 0: 5819.6. Samples: 496748368. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:21:03,788][25689] Avg episode reward: [(0, '-44.076')] [2022-07-10 00:21:05,452][26022] Updated weights on worker 0-0, policy_version 485113 (0.00089) [2022-07-10 00:21:07,303][26022] Updated weights on worker 0-0, policy_version 485123 (0.00083) [2022-07-10 00:21:08,802][25689] Fps is (10 sec: 5387.7, 60 sec: 5628.5, 300 sec: 5666.6). Total num frames: 496774144. Throughput: 0: 5806.8. Samples: 496782202. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:21:08,802][25689] Avg episode reward: [(0, '-44.146')] [2022-07-10 00:21:09,134][26022] Updated weights on worker 0-0, policy_version 485133 (0.00082) [2022-07-10 00:21:11,098][26022] Updated weights on worker 0-0, policy_version 485143 (0.00090) [2022-07-10 00:21:12,843][26022] Updated weights on worker 0-0, policy_version 485153 (0.00083) [2022-07-10 00:21:13,815][25689] Fps is (10 sec: 5718.9, 60 sec: 5646.4, 300 sec: 5668.5). Total num frames: 496802816. Throughput: 0: 4958.7. Samples: 496799160. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:21:13,815][25689] Avg episode reward: [(0, '-44.409')] [2022-07-10 00:21:14,617][26022] Updated weights on worker 0-0, policy_version 485163 (0.00094) [2022-07-10 00:21:16,156][26022] Updated weights on worker 0-0, policy_version 485173 (0.00092) [2022-07-10 00:21:18,054][26022] Updated weights on worker 0-0, policy_version 485183 (0.00085) [2022-07-10 00:21:18,923][25689] Fps is (10 sec: 5564.6, 60 sec: 5628.2, 300 sec: 5663.3). Total num frames: 496830464. Throughput: 0: 5792.2. Samples: 496833188. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:21:18,923][25689] Avg episode reward: [(0, '-43.981')] [2022-07-10 00:21:20,027][26022] Updated weights on worker 0-0, policy_version 485193 (0.00088) [2022-07-10 00:21:21,719][26022] Updated weights on worker 0-0, policy_version 485203 (0.00089) [2022-07-10 00:21:23,695][26022] Updated weights on worker 0-0, policy_version 485213 (0.00085) [2022-07-10 00:21:23,986][25689] Fps is (10 sec: 5638.1, 60 sec: 5640.2, 300 sec: 5669.1). Total num frames: 496860160. Throughput: 0: 5902.2. Samples: 496867580. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:21:23,986][25689] Avg episode reward: [(0, '-44.036')] [2022-07-10 00:21:25,163][26022] Updated weights on worker 0-0, policy_version 485223 (0.00087) [2022-07-10 00:21:27,171][26022] Updated weights on worker 0-0, policy_version 485233 (0.00111) [2022-07-10 00:21:28,801][26022] Updated weights on worker 0-0, policy_version 485243 (0.00087) [2022-07-10 00:21:29,027][25689] Fps is (10 sec: 5776.6, 60 sec: 5640.1, 300 sec: 5665.9). Total num frames: 496888832. Throughput: 0: 5059.5. Samples: 496884532. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:21:29,028][25689] Avg episode reward: [(0, '-44.957')] [2022-07-10 00:21:30,697][26022] Updated weights on worker 0-0, policy_version 485253 (0.00087) [2022-07-10 00:21:32,506][26022] Updated weights on worker 0-0, policy_version 485263 (0.00080) [2022-07-10 00:21:34,041][25689] Fps is (10 sec: 5804.7, 60 sec: 5659.4, 300 sec: 5667.6). Total num frames: 496918528. Throughput: 0: 5914.7. Samples: 496918792. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:21:34,041][25689] Avg episode reward: [(0, '-45.158')] [2022-07-10 00:21:34,264][26022] Updated weights on worker 0-0, policy_version 485273 (0.00092) [2022-07-10 00:21:36,120][26022] Updated weights on worker 0-0, policy_version 485283 (0.00091) [2022-07-10 00:21:37,888][26022] Updated weights on worker 0-0, policy_version 485293 (0.00092) [2022-07-10 00:21:39,135][25689] Fps is (10 sec: 5774.6, 60 sec: 5661.5, 300 sec: 5669.3). Total num frames: 496947200. Throughput: 0: 5920.3. Samples: 496952850. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:21:39,135][25689] Avg episode reward: [(0, '-44.949')] [2022-07-10 00:21:39,684][26022] Updated weights on worker 0-0, policy_version 485303 (0.00085) [2022-07-10 00:21:41,611][26022] Updated weights on worker 0-0, policy_version 485313 (0.00100) [2022-07-10 00:21:43,367][26022] Updated weights on worker 0-0, policy_version 485323 (0.00085) [2022-07-10 00:21:44,116][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:21:44,130][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000485327_496974848.pth [2022-07-10 00:21:44,131][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000483336_494936064.pth [2022-07-10 00:21:44,231][25689] Fps is (10 sec: 5526.7, 60 sec: 5625.0, 300 sec: 5664.7). Total num frames: 496974848. Throughput: 0: 5054.2. Samples: 496969908. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:21:44,232][25689] Avg episode reward: [(0, '-45.089')] [2022-07-10 00:21:45,172][26022] Updated weights on worker 0-0, policy_version 485333 (0.00078) [2022-07-10 00:21:47,012][26022] Updated weights on worker 0-0, policy_version 485343 (0.00081) [2022-07-10 00:21:48,637][26022] Updated weights on worker 0-0, policy_version 485353 (0.00091) [2022-07-10 00:21:49,241][25689] Fps is (10 sec: 5674.1, 60 sec: 5641.8, 300 sec: 5668.0). Total num frames: 497004544. Throughput: 0: 5919.0. Samples: 497004180. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:21:49,242][25689] Avg episode reward: [(0, '-44.780')] [2022-07-10 00:21:50,471][26022] Updated weights on worker 0-0, policy_version 485363 (0.00087) [2022-07-10 00:21:52,342][26022] Updated weights on worker 0-0, policy_version 485373 (0.00086) [2022-07-10 00:21:54,039][26022] Updated weights on worker 0-0, policy_version 485383 (0.00097) [2022-07-10 00:21:54,243][25689] Fps is (10 sec: 5829.9, 60 sec: 5642.8, 300 sec: 5666.1). Total num frames: 497033216. Throughput: 0: 5947.1. Samples: 497038938. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:21:54,244][25689] Avg episode reward: [(0, '-45.117')] [2022-07-10 00:21:55,876][26022] Updated weights on worker 0-0, policy_version 485393 (0.00086) [2022-07-10 00:21:57,464][26022] Updated weights on worker 0-0, policy_version 485403 (0.00089) [2022-07-10 00:21:59,320][25689] Fps is (10 sec: 5791.4, 60 sec: 5661.6, 300 sec: 5665.7). Total num frames: 497062912. Throughput: 0: 5122.9. Samples: 497056254. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:21:59,320][25689] Avg episode reward: [(0, '-44.892')] [2022-07-10 00:21:59,323][26022] Updated weights on worker 0-0, policy_version 485413 (0.00088) [2022-07-10 00:22:01,588][26022] Updated weights on worker 0-0, policy_version 485423 (0.00095) [2022-07-10 00:22:03,251][26022] Updated weights on worker 0-0, policy_version 485433 (0.00085) [2022-07-10 00:22:04,335][25689] Fps is (10 sec: 5479.1, 60 sec: 5665.6, 300 sec: 5668.9). Total num frames: 497088512. Throughput: 0: 5901.7. Samples: 497088556. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 00:22:04,336][25689] Avg episode reward: [(0, '-45.628')] [2022-07-10 00:22:05,215][26022] Updated weights on worker 0-0, policy_version 485443 (0.00097) [2022-07-10 00:22:06,879][26022] Updated weights on worker 0-0, policy_version 485453 (0.00085) [2022-07-10 00:22:08,732][26022] Updated weights on worker 0-0, policy_version 485463 (0.00085) [2022-07-10 00:22:09,430][25689] Fps is (10 sec: 5469.4, 60 sec: 5675.0, 300 sec: 5661.3). Total num frames: 497118208. Throughput: 0: 5872.5. Samples: 497122736. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:22:09,431][25689] Avg episode reward: [(0, '-45.686')] [2022-07-10 00:22:10,593][26022] Updated weights on worker 0-0, policy_version 485473 (0.00091) [2022-07-10 00:22:12,118][26022] Updated weights on worker 0-0, policy_version 485483 (0.00086) [2022-07-10 00:22:14,318][26022] Updated weights on worker 0-0, policy_version 485493 (0.00093) [2022-07-10 00:22:14,448][25689] Fps is (10 sec: 5670.8, 60 sec: 5657.7, 300 sec: 5662.1). Total num frames: 497145856. Throughput: 0: 4990.1. Samples: 497139760. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:22:14,448][25689] Avg episode reward: [(0, '-45.536')] [2022-07-10 00:22:15,811][26022] Updated weights on worker 0-0, policy_version 485503 (0.00087) [2022-07-10 00:22:17,817][26022] Updated weights on worker 0-0, policy_version 485513 (0.00091) [2022-07-10 00:22:19,509][25689] Fps is (10 sec: 5587.4, 60 sec: 5678.9, 300 sec: 5661.6). Total num frames: 497174528. Throughput: 0: 5822.1. Samples: 497173800. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:22:19,510][25689] Avg episode reward: [(0, '-45.387')] [2022-07-10 00:22:19,591][26022] Updated weights on worker 0-0, policy_version 485523 (0.00090) [2022-07-10 00:22:21,408][26022] Updated weights on worker 0-0, policy_version 485533 (0.00090) [2022-07-10 00:22:23,299][26022] Updated weights on worker 0-0, policy_version 485543 (0.00088) [2022-07-10 00:22:24,534][25689] Fps is (10 sec: 5685.5, 60 sec: 5665.6, 300 sec: 5658.2). Total num frames: 497203200. Throughput: 0: 5914.5. Samples: 497208018. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:22:24,534][25689] Avg episode reward: [(0, '-45.589')] [2022-07-10 00:22:24,955][26022] Updated weights on worker 0-0, policy_version 485553 (0.00088) [2022-07-10 00:22:26,943][26022] Updated weights on worker 0-0, policy_version 485563 (0.00096) [2022-07-10 00:22:28,612][26022] Updated weights on worker 0-0, policy_version 485573 (0.00503) [2022-07-10 00:22:29,543][25689] Fps is (10 sec: 5613.3, 60 sec: 5651.7, 300 sec: 5658.6). Total num frames: 497230848. Throughput: 0: 5089.0. Samples: 497225092. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:22:29,543][25689] Avg episode reward: [(0, '-45.102')] [2022-07-10 00:22:30,264][26022] Updated weights on worker 0-0, policy_version 485583 (0.00094) [2022-07-10 00:22:32,285][26022] Updated weights on worker 0-0, policy_version 485593 (0.00084) [2022-07-10 00:22:33,988][26022] Updated weights on worker 0-0, policy_version 485603 (0.00085) [2022-07-10 00:22:34,545][25689] Fps is (10 sec: 5727.8, 60 sec: 5652.8, 300 sec: 5659.5). Total num frames: 497260544. Throughput: 0: 5936.1. Samples: 497259062. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:22:34,546][25689] Avg episode reward: [(0, '-45.725')] [2022-07-10 00:22:35,701][26022] Updated weights on worker 0-0, policy_version 485613 (0.00093) [2022-07-10 00:22:37,588][26022] Updated weights on worker 0-0, policy_version 485623 (0.00092) [2022-07-10 00:22:39,326][26022] Updated weights on worker 0-0, policy_version 485633 (0.00088) [2022-07-10 00:22:39,651][25689] Fps is (10 sec: 5774.4, 60 sec: 5651.7, 300 sec: 5659.3). Total num frames: 497289216. Throughput: 0: 5913.6. Samples: 497292908. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:22:39,651][25689] Avg episode reward: [(0, '-45.512')] [2022-07-10 00:22:41,323][26022] Updated weights on worker 0-0, policy_version 485643 (0.00091) [2022-07-10 00:22:43,006][26022] Updated weights on worker 0-0, policy_version 485653 (0.00081) [2022-07-10 00:22:44,664][25689] Fps is (10 sec: 5666.9, 60 sec: 5676.4, 300 sec: 5656.9). Total num frames: 497317888. Throughput: 0: 5920.3. Samples: 497327198. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:22:44,665][25689] Avg episode reward: [(0, '-45.617')] [2022-07-10 00:22:44,719][26022] Updated weights on worker 0-0, policy_version 485663 (0.00091) [2022-07-10 00:22:46,467][26022] Updated weights on worker 0-0, policy_version 485673 (0.00083) [2022-07-10 00:22:48,276][26022] Updated weights on worker 0-0, policy_version 485683 (0.00084) [2022-07-10 00:22:49,714][25689] Fps is (10 sec: 5698.5, 60 sec: 5655.7, 300 sec: 5653.9). Total num frames: 497346560. Throughput: 0: 5914.6. Samples: 497344396. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:22:49,714][25689] Avg episode reward: [(0, '-45.051')] [2022-07-10 00:22:50,304][26022] Updated weights on worker 0-0, policy_version 485693 (0.00095) [2022-07-10 00:22:52,144][26022] Updated weights on worker 0-0, policy_version 485703 (0.00092) [2022-07-10 00:22:53,873][26022] Updated weights on worker 0-0, policy_version 485713 (0.00082) [2022-07-10 00:22:54,726][25689] Fps is (10 sec: 5699.1, 60 sec: 5654.8, 300 sec: 5655.8). Total num frames: 497375232. Throughput: 0: 5924.9. Samples: 497378632. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:22:54,727][25689] Avg episode reward: [(0, '-45.465')] [2022-07-10 00:22:55,574][26022] Updated weights on worker 0-0, policy_version 485723 (0.00079) [2022-07-10 00:22:57,462][26022] Updated weights on worker 0-0, policy_version 485733 (0.00087) [2022-07-10 00:22:59,194][26022] Updated weights on worker 0-0, policy_version 485743 (0.00091) [2022-07-10 00:22:59,840][25689] Fps is (10 sec: 5561.8, 60 sec: 5617.4, 300 sec: 5657.8). Total num frames: 497402880. Throughput: 0: 5943.1. Samples: 497412894. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:22:59,840][25689] Avg episode reward: [(0, '-43.761')] [2022-07-10 00:23:01,009][26022] Updated weights on worker 0-0, policy_version 485753 (0.00084) [2022-07-10 00:23:03,255][26022] Updated weights on worker 0-0, policy_version 485763 (0.00095) [2022-07-10 00:23:04,848][26022] Updated weights on worker 0-0, policy_version 485773 (0.00081) [2022-07-10 00:23:04,962][25689] Fps is (10 sec: 5502.1, 60 sec: 5658.2, 300 sec: 5659.3). Total num frames: 497431552. Throughput: 0: 4962.4. Samples: 497427924. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:23:04,963][25689] Avg episode reward: [(0, '-43.943')] [2022-07-10 00:23:06,693][26022] Updated weights on worker 0-0, policy_version 485783 (0.00086) [2022-07-10 00:23:08,360][26022] Updated weights on worker 0-0, policy_version 485793 (0.00093) [2022-07-10 00:23:09,983][25689] Fps is (10 sec: 5552.4, 60 sec: 5631.2, 300 sec: 5655.5). Total num frames: 497459200. Throughput: 0: 5817.4. Samples: 497462312. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:23:09,984][25689] Avg episode reward: [(0, '-43.946')] [2022-07-10 00:23:10,458][26022] Updated weights on worker 0-0, policy_version 485803 (0.00086) [2022-07-10 00:23:12,097][26022] Updated weights on worker 0-0, policy_version 485813 (0.00088) [2022-07-10 00:23:14,035][26022] Updated weights on worker 0-0, policy_version 485823 (0.00094) [2022-07-10 00:23:14,998][25689] Fps is (10 sec: 5714.2, 60 sec: 5665.4, 300 sec: 5656.7). Total num frames: 497488896. Throughput: 0: 5837.0. Samples: 497496956. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:23:14,998][25689] Avg episode reward: [(0, '-44.130')] [2022-07-10 00:23:15,827][26022] Updated weights on worker 0-0, policy_version 485833 (0.00095) [2022-07-10 00:23:17,643][26022] Updated weights on worker 0-0, policy_version 485843 (0.00087) [2022-07-10 00:23:19,225][26022] Updated weights on worker 0-0, policy_version 485853 (0.00091) [2022-07-10 00:23:20,134][25689] Fps is (10 sec: 5750.3, 60 sec: 5658.4, 300 sec: 5654.3). Total num frames: 497517568. Throughput: 0: 4981.5. Samples: 497513988. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:23:20,134][25689] Avg episode reward: [(0, '-44.414')] [2022-07-10 00:23:21,100][26022] Updated weights on worker 0-0, policy_version 485863 (0.00089) [2022-07-10 00:23:22,986][26022] Updated weights on worker 0-0, policy_version 485873 (0.00089) [2022-07-10 00:23:24,678][26022] Updated weights on worker 0-0, policy_version 485883 (0.00089) [2022-07-10 00:23:25,170][25689] Fps is (10 sec: 5637.4, 60 sec: 5657.4, 300 sec: 5651.3). Total num frames: 497546240. Throughput: 0: 5957.9. Samples: 497548316. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:23:25,170][25689] Avg episode reward: [(0, '-44.449')] [2022-07-10 00:23:26,555][26022] Updated weights on worker 0-0, policy_version 485893 (0.00083) [2022-07-10 00:23:28,439][26022] Updated weights on worker 0-0, policy_version 485903 (0.00093) [2022-07-10 00:23:30,183][25689] Fps is (10 sec: 5706.2, 60 sec: 5673.8, 300 sec: 5658.0). Total num frames: 497574912. Throughput: 0: 5932.0. Samples: 497582138. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:23:30,185][25689] Avg episode reward: [(0, '-44.165')] [2022-07-10 00:23:30,190][26022] Updated weights on worker 0-0, policy_version 485913 (0.00094) [2022-07-10 00:23:32,007][26022] Updated weights on worker 0-0, policy_version 485923 (0.00089) [2022-07-10 00:23:33,557][26022] Updated weights on worker 0-0, policy_version 485933 (0.00091) [2022-07-10 00:23:35,225][25689] Fps is (10 sec: 5702.5, 60 sec: 5653.2, 300 sec: 5655.8). Total num frames: 497603584. Throughput: 0: 5066.0. Samples: 497599430. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:23:35,226][25689] Avg episode reward: [(0, '-44.531')] [2022-07-10 00:23:35,795][26022] Updated weights on worker 0-0, policy_version 485943 (0.00087) [2022-07-10 00:23:37,274][26022] Updated weights on worker 0-0, policy_version 485953 (0.00091) [2022-07-10 00:23:39,265][26022] Updated weights on worker 0-0, policy_version 485963 (0.00085) [2022-07-10 00:23:40,283][25689] Fps is (10 sec: 5677.8, 60 sec: 5657.7, 300 sec: 5655.5). Total num frames: 497632256. Throughput: 0: 5918.9. Samples: 497633248. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:23:40,283][25689] Avg episode reward: [(0, '-43.959')] [2022-07-10 00:23:40,924][26022] Updated weights on worker 0-0, policy_version 485973 (0.00086) [2022-07-10 00:23:42,759][26022] Updated weights on worker 0-0, policy_version 485983 (0.00091) [2022-07-10 00:23:44,294][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:23:44,306][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000485992_497655808.pth [2022-07-10 00:23:44,307][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000484000_495616000.pth [2022-07-10 00:23:44,548][26022] Updated weights on worker 0-0, policy_version 485993 (0.00088) [2022-07-10 00:23:45,358][25689] Fps is (10 sec: 5659.6, 60 sec: 5652.0, 300 sec: 5654.7). Total num frames: 497660928. Throughput: 0: 5915.8. Samples: 497667744. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:23:45,358][25689] Avg episode reward: [(0, '-43.444')] [2022-07-10 00:23:46,339][26022] Updated weights on worker 0-0, policy_version 486003 (0.00093) [2022-07-10 00:23:48,143][26022] Updated weights on worker 0-0, policy_version 486013 (0.00085) [2022-07-10 00:23:49,981][26022] Updated weights on worker 0-0, policy_version 486023 (0.00094) [2022-07-10 00:23:50,391][25689] Fps is (10 sec: 5774.2, 60 sec: 5670.4, 300 sec: 5664.9). Total num frames: 497690624. Throughput: 0: 5085.3. Samples: 497684902. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:23:50,392][25689] Avg episode reward: [(0, '-43.397')] [2022-07-10 00:23:51,863][26022] Updated weights on worker 0-0, policy_version 486033 (0.00088) [2022-07-10 00:23:53,800][26022] Updated weights on worker 0-0, policy_version 486043 (0.00087) [2022-07-10 00:23:55,267][26022] Updated weights on worker 0-0, policy_version 486053 (0.00091) [2022-07-10 00:23:55,466][25689] Fps is (10 sec: 5774.1, 60 sec: 5664.5, 300 sec: 5660.8). Total num frames: 497719296. Throughput: 0: 5888.1. Samples: 497718610. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:23:55,467][25689] Avg episode reward: [(0, '-44.037')] [2022-07-10 00:23:57,317][26022] Updated weights on worker 0-0, policy_version 486063 (0.00085) [2022-07-10 00:23:58,766][26022] Updated weights on worker 0-0, policy_version 486073 (0.00086) [2022-07-10 00:24:00,521][25689] Fps is (10 sec: 5458.7, 60 sec: 5653.2, 300 sec: 5659.9). Total num frames: 497745920. Throughput: 0: 5902.9. Samples: 497752712. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:24:00,522][25689] Avg episode reward: [(0, '-44.047')] [2022-07-10 00:24:01,032][26022] Updated weights on worker 0-0, policy_version 486083 (0.00088) [2022-07-10 00:24:02,788][26022] Updated weights on worker 0-0, policy_version 486093 (0.00091) [2022-07-10 00:24:04,676][26022] Updated weights on worker 0-0, policy_version 486103 (0.00171) [2022-07-10 00:24:05,540][25689] Fps is (10 sec: 5591.0, 60 sec: 5679.7, 300 sec: 5663.2). Total num frames: 497775616. Throughput: 0: 4957.2. Samples: 497767790. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:24:05,540][25689] Avg episode reward: [(0, '-43.593')] [2022-07-10 00:24:06,732][26022] Updated weights on worker 0-0, policy_version 486113 (0.00085) [2022-07-10 00:24:08,308][26022] Updated weights on worker 0-0, policy_version 486123 (0.00092) [2022-07-10 00:24:10,322][26022] Updated weights on worker 0-0, policy_version 486133 (0.00089) [2022-07-10 00:24:10,592][25689] Fps is (10 sec: 5490.4, 60 sec: 5643.0, 300 sec: 5655.4). Total num frames: 497801216. Throughput: 0: 5793.9. Samples: 497801946. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:24:10,594][25689] Avg episode reward: [(0, '-44.831')] [2022-07-10 00:24:11,792][26022] Updated weights on worker 0-0, policy_version 486143 (0.00092) [2022-07-10 00:24:13,930][26022] Updated weights on worker 0-0, policy_version 486153 (0.00092) [2022-07-10 00:24:15,622][25689] Fps is (10 sec: 5382.7, 60 sec: 5624.6, 300 sec: 5649.0). Total num frames: 497829888. Throughput: 0: 5822.0. Samples: 497835958. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:24:15,623][25689] Avg episode reward: [(0, '-45.351')] [2022-07-10 00:24:15,768][26022] Updated weights on worker 0-0, policy_version 486163 (0.00087) [2022-07-10 00:24:17,351][26022] Updated weights on worker 0-0, policy_version 486173 (0.00089) [2022-07-10 00:24:19,225][26022] Updated weights on worker 0-0, policy_version 486183 (0.00095) [2022-07-10 00:24:20,569][26022] Updated weights on worker 0-0, policy_version 486193 (0.00084) [2022-07-10 00:24:20,712][25689] Fps is (10 sec: 5970.2, 60 sec: 5679.7, 300 sec: 5654.3). Total num frames: 497861632. Throughput: 0: 4981.5. Samples: 497853294. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:24:20,714][25689] Avg episode reward: [(0, '-45.480')] [2022-07-10 00:24:22,866][26022] Updated weights on worker 0-0, policy_version 486203 (0.00092) [2022-07-10 00:24:24,373][26022] Updated weights on worker 0-0, policy_version 486213 (0.00088) [2022-07-10 00:24:25,767][25689] Fps is (10 sec: 5753.7, 60 sec: 5644.1, 300 sec: 5653.4). Total num frames: 497888256. Throughput: 0: 5926.7. Samples: 497887670. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:24:25,768][25689] Avg episode reward: [(0, '-46.580')] [2022-07-10 00:24:26,453][26022] Updated weights on worker 0-0, policy_version 486223 (0.00091) [2022-07-10 00:24:27,943][26022] Updated weights on worker 0-0, policy_version 486233 (0.00085) [2022-07-10 00:24:29,840][26022] Updated weights on worker 0-0, policy_version 486243 (0.00085) [2022-07-10 00:24:30,789][25689] Fps is (10 sec: 5589.1, 60 sec: 5660.2, 300 sec: 5653.2). Total num frames: 497917952. Throughput: 0: 5939.9. Samples: 497921910. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:24:30,791][25689] Avg episode reward: [(0, '-45.824')] [2022-07-10 00:24:31,620][26022] Updated weights on worker 0-0, policy_version 486253 (0.00091) [2022-07-10 00:24:33,556][26022] Updated weights on worker 0-0, policy_version 486263 (0.00080) [2022-07-10 00:24:35,219][26022] Updated weights on worker 0-0, policy_version 486273 (0.00091) [2022-07-10 00:24:35,869][25689] Fps is (10 sec: 5676.7, 60 sec: 5639.8, 300 sec: 5649.0). Total num frames: 497945600. Throughput: 0: 5097.2. Samples: 497939156. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:24:35,870][25689] Avg episode reward: [(0, '-44.805')] [2022-07-10 00:24:37,000][26022] Updated weights on worker 0-0, policy_version 486283 (0.00094) [2022-07-10 00:24:38,900][26022] Updated weights on worker 0-0, policy_version 486293 (0.00337) [2022-07-10 00:24:40,707][26022] Updated weights on worker 0-0, policy_version 486303 (0.00091) [2022-07-10 00:24:40,908][25689] Fps is (10 sec: 5667.0, 60 sec: 5658.4, 300 sec: 5658.9). Total num frames: 497975296. Throughput: 0: 5958.9. Samples: 497973638. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:24:40,908][25689] Avg episode reward: [(0, '-44.378')] [2022-07-10 00:24:42,499][26022] Updated weights on worker 0-0, policy_version 486313 (0.00806) [2022-07-10 00:24:44,326][26022] Updated weights on worker 0-0, policy_version 486323 (0.00086) [2022-07-10 00:24:45,909][26022] Updated weights on worker 0-0, policy_version 486333 (0.00854) [2022-07-10 00:24:45,952][25689] Fps is (10 sec: 5889.9, 60 sec: 5678.1, 300 sec: 5654.9). Total num frames: 498004992. Throughput: 0: 5960.6. Samples: 498007988. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:24:45,953][25689] Avg episode reward: [(0, '-43.162')] [2022-07-10 00:24:47,958][26022] Updated weights on worker 0-0, policy_version 486343 (0.00094) [2022-07-10 00:24:49,643][26022] Updated weights on worker 0-0, policy_version 486353 (0.00054) [2022-07-10 00:24:50,954][25689] Fps is (10 sec: 5708.2, 60 sec: 5647.3, 300 sec: 5655.1). Total num frames: 498032640. Throughput: 0: 5998.8. Samples: 498042874. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:24:50,954][25689] Avg episode reward: [(0, '-42.864')] [2022-07-10 00:24:51,248][26022] Updated weights on worker 0-0, policy_version 486363 (0.00097) [2022-07-10 00:24:53,249][26022] Updated weights on worker 0-0, policy_version 486373 (0.00088) [2022-07-10 00:24:54,620][26022] Updated weights on worker 0-0, policy_version 486383 (0.00084) [2022-07-10 00:24:55,959][25689] Fps is (10 sec: 5526.0, 60 sec: 5636.9, 300 sec: 5652.9). Total num frames: 498060288. Throughput: 0: 6032.1. Samples: 498060342. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 00:24:55,960][25689] Avg episode reward: [(0, '-43.158')] [2022-07-10 00:24:56,753][26022] Updated weights on worker 0-0, policy_version 486393 (0.00084) [2022-07-10 00:24:58,364][26022] Updated weights on worker 0-0, policy_version 486403 (0.00091) [2022-07-10 00:25:00,052][26022] Updated weights on worker 0-0, policy_version 486413 (0.00086) [2022-07-10 00:25:01,077][25689] Fps is (10 sec: 5866.6, 60 sec: 5715.5, 300 sec: 5664.6). Total num frames: 498092032. Throughput: 0: 6014.6. Samples: 498094950. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:25:01,078][25689] Avg episode reward: [(0, '-42.949')] [2022-07-10 00:25:02,589][26022] Updated weights on worker 0-0, policy_version 486423 (0.00094) [2022-07-10 00:25:04,012][26022] Updated weights on worker 0-0, policy_version 486433 (0.00089) [2022-07-10 00:25:05,934][26022] Updated weights on worker 0-0, policy_version 486443 (0.00087) [2022-07-10 00:25:06,100][25689] Fps is (10 sec: 5654.5, 60 sec: 5647.5, 300 sec: 5654.9). Total num frames: 498117632. Throughput: 0: 5922.5. Samples: 498127314. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:25:06,101][25689] Avg episode reward: [(0, '-43.470')] [2022-07-10 00:25:07,667][26022] Updated weights on worker 0-0, policy_version 486453 (0.00098) [2022-07-10 00:25:09,465][26022] Updated weights on worker 0-0, policy_version 486463 (0.00095) [2022-07-10 00:25:11,118][25689] Fps is (10 sec: 5507.4, 60 sec: 5718.5, 300 sec: 5661.9). Total num frames: 498147328. Throughput: 0: 5048.1. Samples: 498144664. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:25:11,120][25689] Avg episode reward: [(0, '-43.700')] [2022-07-10 00:25:11,243][26022] Updated weights on worker 0-0, policy_version 486473 (0.00086) [2022-07-10 00:25:13,053][26022] Updated weights on worker 0-0, policy_version 486483 (0.00095) [2022-07-10 00:25:14,927][26022] Updated weights on worker 0-0, policy_version 486493 (0.00095) [2022-07-10 00:25:16,211][25689] Fps is (10 sec: 5772.8, 60 sec: 5712.5, 300 sec: 5661.9). Total num frames: 498176000. Throughput: 0: 5850.3. Samples: 498178824. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:25:16,212][25689] Avg episode reward: [(0, '-43.046')] [2022-07-10 00:25:16,639][26022] Updated weights on worker 0-0, policy_version 486503 (0.00085) [2022-07-10 00:25:18,493][26022] Updated weights on worker 0-0, policy_version 486513 (0.00094) [2022-07-10 00:25:20,115][26022] Updated weights on worker 0-0, policy_version 486523 (0.00094) [2022-07-10 00:25:21,325][25689] Fps is (10 sec: 5718.4, 60 sec: 5676.4, 300 sec: 5663.3). Total num frames: 498205696. Throughput: 0: 5846.8. Samples: 498213332. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:25:21,339][25689] Avg episode reward: [(0, '-42.833')] [2022-07-10 00:25:22,099][26022] Updated weights on worker 0-0, policy_version 486533 (0.00084) [2022-07-10 00:25:23,840][26022] Updated weights on worker 0-0, policy_version 486543 (0.00098) [2022-07-10 00:25:25,487][26022] Updated weights on worker 0-0, policy_version 486553 (0.00091) [2022-07-10 00:25:26,360][25689] Fps is (10 sec: 5751.2, 60 sec: 5712.1, 300 sec: 5663.4). Total num frames: 498234368. Throughput: 0: 5095.2. Samples: 498230540. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:25:26,360][25689] Avg episode reward: [(0, '-42.345')] [2022-07-10 00:25:27,392][26022] Updated weights on worker 0-0, policy_version 486563 (0.00095) [2022-07-10 00:25:29,154][26022] Updated weights on worker 0-0, policy_version 486573 (0.00086) [2022-07-10 00:25:31,127][26022] Updated weights on worker 0-0, policy_version 486583 (0.00455) [2022-07-10 00:25:31,428][25689] Fps is (10 sec: 5675.7, 60 sec: 5690.8, 300 sec: 5662.9). Total num frames: 498263040. Throughput: 0: 5900.8. Samples: 498264512. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:25:31,429][25689] Avg episode reward: [(0, '-42.713')] [2022-07-10 00:25:32,822][26022] Updated weights on worker 0-0, policy_version 486593 (0.00081) [2022-07-10 00:25:34,656][26022] Updated weights on worker 0-0, policy_version 486603 (0.00092) [2022-07-10 00:25:36,457][25689] Fps is (10 sec: 5476.3, 60 sec: 5678.7, 300 sec: 5657.7). Total num frames: 498289664. Throughput: 0: 5907.1. Samples: 498298420. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:25:36,457][25689] Avg episode reward: [(0, '-42.321')] [2022-07-10 00:25:36,808][26022] Updated weights on worker 0-0, policy_version 486613 (0.00084) [2022-07-10 00:25:38,253][26022] Updated weights on worker 0-0, policy_version 486623 (0.00090) [2022-07-10 00:25:40,142][26022] Updated weights on worker 0-0, policy_version 486633 (0.00092) [2022-07-10 00:25:41,572][25689] Fps is (10 sec: 5552.2, 60 sec: 5671.6, 300 sec: 5656.8). Total num frames: 498319360. Throughput: 0: 5032.3. Samples: 498315222. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:25:41,572][25689] Avg episode reward: [(0, '-42.212')] [2022-07-10 00:25:41,851][26022] Updated weights on worker 0-0, policy_version 486643 (0.00087) [2022-07-10 00:25:43,757][26022] Updated weights on worker 0-0, policy_version 486653 (0.00089) [2022-07-10 00:25:44,467][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:25:44,475][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000486657_498336768.pth [2022-07-10 00:25:44,476][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000484665_496296960.pth [2022-07-10 00:25:45,692][26022] Updated weights on worker 0-0, policy_version 486663 (0.00089) [2022-07-10 00:25:46,611][25689] Fps is (10 sec: 5748.4, 60 sec: 5655.3, 300 sec: 5656.2). Total num frames: 498348032. Throughput: 0: 5856.2. Samples: 498349134. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:25:46,612][25689] Avg episode reward: [(0, '-42.247')] [2022-07-10 00:25:47,283][26022] Updated weights on worker 0-0, policy_version 486673 (0.00085) [2022-07-10 00:25:49,163][26022] Updated weights on worker 0-0, policy_version 486683 (0.00095) [2022-07-10 00:25:51,046][26022] Updated weights on worker 0-0, policy_version 486693 (0.00096) [2022-07-10 00:25:51,649][25689] Fps is (10 sec: 5588.7, 60 sec: 5651.8, 300 sec: 5652.3). Total num frames: 498375680. Throughput: 0: 5868.6. Samples: 498383182. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:25:51,651][25689] Avg episode reward: [(0, '-42.596')] [2022-07-10 00:25:52,790][26022] Updated weights on worker 0-0, policy_version 486703 (0.00089) [2022-07-10 00:25:54,775][26022] Updated weights on worker 0-0, policy_version 486713 (0.00087) [2022-07-10 00:25:56,334][26022] Updated weights on worker 0-0, policy_version 486723 (0.00084) [2022-07-10 00:25:56,678][25689] Fps is (10 sec: 5696.0, 60 sec: 5683.3, 300 sec: 5657.0). Total num frames: 498405376. Throughput: 0: 5042.7. Samples: 498400390. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:25:56,680][25689] Avg episode reward: [(0, '-42.577')] [2022-07-10 00:25:58,244][26022] Updated weights on worker 0-0, policy_version 486733 (0.00085) [2022-07-10 00:25:59,962][26022] Updated weights on worker 0-0, policy_version 486743 (0.00098) [2022-07-10 00:26:01,783][25689] Fps is (10 sec: 5658.8, 60 sec: 5617.2, 300 sec: 5663.0). Total num frames: 498433024. Throughput: 0: 5913.8. Samples: 498434746. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:26:01,783][25689] Avg episode reward: [(0, '-42.652')] [2022-07-10 00:26:02,130][26022] Updated weights on worker 0-0, policy_version 486753 (0.00088) [2022-07-10 00:26:03,923][26022] Updated weights on worker 0-0, policy_version 486763 (0.00087) [2022-07-10 00:26:05,874][26022] Updated weights on worker 0-0, policy_version 486773 (0.00083) [2022-07-10 00:26:06,825][25689] Fps is (10 sec: 5449.6, 60 sec: 5649.1, 300 sec: 5659.0). Total num frames: 498460672. Throughput: 0: 5824.2. Samples: 498466866. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:26:06,826][25689] Avg episode reward: [(0, '-43.382')] [2022-07-10 00:26:07,504][26022] Updated weights on worker 0-0, policy_version 486783 (0.00090) [2022-07-10 00:26:09,413][26022] Updated weights on worker 0-0, policy_version 486793 (0.00089) [2022-07-10 00:26:11,098][26022] Updated weights on worker 0-0, policy_version 486803 (0.00083) [2022-07-10 00:26:11,868][25689] Fps is (10 sec: 5787.2, 60 sec: 5663.5, 300 sec: 5665.3). Total num frames: 498491392. Throughput: 0: 4980.5. Samples: 498483884. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:26:11,869][25689] Avg episode reward: [(0, '-43.586')] [2022-07-10 00:26:13,111][26022] Updated weights on worker 0-0, policy_version 486813 (0.00088) [2022-07-10 00:26:14,444][26022] Updated weights on worker 0-0, policy_version 486823 (0.00093) [2022-07-10 00:26:16,464][26022] Updated weights on worker 0-0, policy_version 486833 (0.00083) [2022-07-10 00:26:16,875][25689] Fps is (10 sec: 5706.1, 60 sec: 5637.9, 300 sec: 5663.8). Total num frames: 498518016. Throughput: 0: 5836.7. Samples: 498518270. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:26:16,875][25689] Avg episode reward: [(0, '-43.878')] [2022-07-10 00:26:18,435][26022] Updated weights on worker 0-0, policy_version 486843 (0.00096) [2022-07-10 00:26:20,095][26022] Updated weights on worker 0-0, policy_version 486853 (0.00093) [2022-07-10 00:26:21,931][25689] Fps is (10 sec: 5597.2, 60 sec: 5643.3, 300 sec: 5664.0). Total num frames: 498547712. Throughput: 0: 5831.8. Samples: 498552244. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:26:21,932][25689] Avg episode reward: [(0, '-43.789')] [2022-07-10 00:26:21,942][26022] Updated weights on worker 0-0, policy_version 486863 (0.00089) [2022-07-10 00:26:23,696][26022] Updated weights on worker 0-0, policy_version 486873 (0.00091) [2022-07-10 00:26:25,531][26022] Updated weights on worker 0-0, policy_version 486883 (0.00084) [2022-07-10 00:26:27,003][25689] Fps is (10 sec: 5661.7, 60 sec: 5622.9, 300 sec: 5659.9). Total num frames: 498575360. Throughput: 0: 5072.8. Samples: 498569224. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:26:27,004][25689] Avg episode reward: [(0, '-44.132')] [2022-07-10 00:26:27,620][26022] Updated weights on worker 0-0, policy_version 486893 (0.00087) [2022-07-10 00:26:29,044][26022] Updated weights on worker 0-0, policy_version 486903 (0.00091) [2022-07-10 00:26:31,267][26022] Updated weights on worker 0-0, policy_version 486913 (0.00091) [2022-07-10 00:26:32,028][25689] Fps is (10 sec: 5476.2, 60 sec: 5610.0, 300 sec: 5652.9). Total num frames: 498603008. Throughput: 0: 5905.7. Samples: 498602940. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:26:32,029][25689] Avg episode reward: [(0, '-44.350')] [2022-07-10 00:26:32,775][26022] Updated weights on worker 0-0, policy_version 486923 (0.00089) [2022-07-10 00:26:34,872][26022] Updated weights on worker 0-0, policy_version 486933 (0.00089) [2022-07-10 00:26:36,433][26022] Updated weights on worker 0-0, policy_version 486943 (0.00094) [2022-07-10 00:26:37,042][25689] Fps is (10 sec: 5609.9, 60 sec: 5645.2, 300 sec: 5654.4). Total num frames: 498631680. Throughput: 0: 5864.5. Samples: 498636544. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:26:37,043][25689] Avg episode reward: [(0, '-45.043')] [2022-07-10 00:26:38,486][26022] Updated weights on worker 0-0, policy_version 486953 (0.00087) [2022-07-10 00:26:40,002][26022] Updated weights on worker 0-0, policy_version 486963 (0.00088) [2022-07-10 00:26:42,060][26022] Updated weights on worker 0-0, policy_version 486973 (0.00089) [2022-07-10 00:26:42,104][25689] Fps is (10 sec: 5691.1, 60 sec: 5633.2, 300 sec: 5658.5). Total num frames: 498660352. Throughput: 0: 5026.3. Samples: 498653642. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:26:42,105][25689] Avg episode reward: [(0, '-43.924')] [2022-07-10 00:26:43,593][26022] Updated weights on worker 0-0, policy_version 486983 (0.00088) [2022-07-10 00:26:45,655][26022] Updated weights on worker 0-0, policy_version 486993 (0.00094) [2022-07-10 00:26:47,116][25689] Fps is (10 sec: 5794.0, 60 sec: 5652.6, 300 sec: 5658.4). Total num frames: 498690048. Throughput: 0: 5906.5. Samples: 498688024. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:26:47,117][25689] Avg episode reward: [(0, '-43.817')] [2022-07-10 00:26:47,337][26022] Updated weights on worker 0-0, policy_version 487003 (0.00083) [2022-07-10 00:26:49,252][26022] Updated weights on worker 0-0, policy_version 487013 (0.00086) [2022-07-10 00:26:51,017][26022] Updated weights on worker 0-0, policy_version 487023 (0.00089) [2022-07-10 00:26:52,122][25689] Fps is (10 sec: 5826.2, 60 sec: 5672.6, 300 sec: 5658.4). Total num frames: 498718720. Throughput: 0: 5927.5. Samples: 498722048. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:26:52,123][25689] Avg episode reward: [(0, '-42.848')] [2022-07-10 00:26:52,916][26022] Updated weights on worker 0-0, policy_version 487033 (0.00091) [2022-07-10 00:26:54,541][26022] Updated weights on worker 0-0, policy_version 487043 (0.00090) [2022-07-10 00:26:56,536][26022] Updated weights on worker 0-0, policy_version 487053 (0.00092) [2022-07-10 00:26:57,131][25689] Fps is (10 sec: 5521.6, 60 sec: 5623.7, 300 sec: 5649.3). Total num frames: 498745344. Throughput: 0: 5111.4. Samples: 498739224. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:26:57,131][25689] Avg episode reward: [(0, '-42.469')] [2022-07-10 00:26:57,908][26022] Updated weights on worker 0-0, policy_version 487063 (0.00241) [2022-07-10 00:27:00,287][26022] Updated weights on worker 0-0, policy_version 487073 (0.00092) [2022-07-10 00:27:01,595][26022] Updated weights on worker 0-0, policy_version 487083 (0.00088) [2022-07-10 00:27:02,235][25689] Fps is (10 sec: 5569.4, 60 sec: 5657.6, 300 sec: 5661.4). Total num frames: 498775040. Throughput: 0: 5956.3. Samples: 498773546. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:27:02,235][25689] Avg episode reward: [(0, '-42.151')] [2022-07-10 00:27:03,982][26022] Updated weights on worker 0-0, policy_version 487093 (0.00098) [2022-07-10 00:27:05,716][26022] Updated weights on worker 0-0, policy_version 487103 (0.00094) [2022-07-10 00:27:07,250][25689] Fps is (10 sec: 5565.9, 60 sec: 5643.3, 300 sec: 5652.6). Total num frames: 498801664. Throughput: 0: 5841.9. Samples: 498805640. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:27:07,250][25689] Avg episode reward: [(0, '-41.679')] [2022-07-10 00:27:07,578][26022] Updated weights on worker 0-0, policy_version 487113 (0.00050) [2022-07-10 00:27:09,268][26022] Updated weights on worker 0-0, policy_version 487123 (0.00085) [2022-07-10 00:27:11,516][26022] Updated weights on worker 0-0, policy_version 487133 (0.00084) [2022-07-10 00:27:12,267][25689] Fps is (10 sec: 5511.6, 60 sec: 5611.7, 300 sec: 5656.0). Total num frames: 498830336. Throughput: 0: 5833.6. Samples: 498839566. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:27:12,268][25689] Avg episode reward: [(0, '-43.180')] [2022-07-10 00:27:12,838][26022] Updated weights on worker 0-0, policy_version 487143 (0.00089) [2022-07-10 00:27:15,183][26022] Updated weights on worker 0-0, policy_version 487153 (0.00088) [2022-07-10 00:27:16,271][26022] Updated weights on worker 0-0, policy_version 487163 (0.00099) [2022-07-10 00:27:17,289][25689] Fps is (10 sec: 5712.2, 60 sec: 5644.3, 300 sec: 5656.8). Total num frames: 498859008. Throughput: 0: 5846.3. Samples: 498857072. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:27:17,290][25689] Avg episode reward: [(0, '-42.909')] [2022-07-10 00:27:18,492][26022] Updated weights on worker 0-0, policy_version 487173 (0.00081) [2022-07-10 00:27:20,078][26022] Updated weights on worker 0-0, policy_version 487183 (0.00086) [2022-07-10 00:27:21,909][26022] Updated weights on worker 0-0, policy_version 487193 (0.00087) [2022-07-10 00:27:22,409][25689] Fps is (10 sec: 5654.2, 60 sec: 5621.3, 300 sec: 5655.0). Total num frames: 498887680. Throughput: 0: 5858.7. Samples: 498891742. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:27:22,410][25689] Avg episode reward: [(0, '-43.790')] [2022-07-10 00:27:23,573][26022] Updated weights on worker 0-0, policy_version 487203 (0.00081) [2022-07-10 00:27:25,502][26022] Updated weights on worker 0-0, policy_version 487213 (0.00097) [2022-07-10 00:27:27,080][26022] Updated weights on worker 0-0, policy_version 487223 (0.00087) [2022-07-10 00:27:27,507][25689] Fps is (10 sec: 5812.2, 60 sec: 5669.7, 300 sec: 5663.6). Total num frames: 498918400. Throughput: 0: 5961.3. Samples: 498926400. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:27:27,508][25689] Avg episode reward: [(0, '-43.177')] [2022-07-10 00:27:29,161][26022] Updated weights on worker 0-0, policy_version 487233 (0.00086) [2022-07-10 00:27:30,729][26022] Updated weights on worker 0-0, policy_version 487243 (0.00086) [2022-07-10 00:27:32,553][25689] Fps is (10 sec: 5753.9, 60 sec: 5667.7, 300 sec: 5655.9). Total num frames: 498946048. Throughput: 0: 5122.9. Samples: 498943488. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:27:32,554][25689] Avg episode reward: [(0, '-43.121')] [2022-07-10 00:27:32,664][26022] Updated weights on worker 0-0, policy_version 487253 (0.00093) [2022-07-10 00:27:34,267][26022] Updated weights on worker 0-0, policy_version 487263 (0.00092) [2022-07-10 00:27:36,212][26022] Updated weights on worker 0-0, policy_version 487273 (0.00085) [2022-07-10 00:27:37,569][25689] Fps is (10 sec: 5699.2, 60 sec: 5684.5, 300 sec: 5661.0). Total num frames: 498975744. Throughput: 0: 5954.3. Samples: 498977826. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:27:37,569][25689] Avg episode reward: [(0, '-43.473')] [2022-07-10 00:27:37,903][26022] Updated weights on worker 0-0, policy_version 487283 (0.00085) [2022-07-10 00:27:39,875][26022] Updated weights on worker 0-0, policy_version 487293 (0.00086) [2022-07-10 00:27:41,401][26022] Updated weights on worker 0-0, policy_version 487303 (0.00088) [2022-07-10 00:27:42,621][25689] Fps is (10 sec: 5797.5, 60 sec: 5685.4, 300 sec: 5660.3). Total num frames: 499004416. Throughput: 0: 5954.9. Samples: 499012102. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:27:42,622][25689] Avg episode reward: [(0, '-43.622')] [2022-07-10 00:27:43,467][26022] Updated weights on worker 0-0, policy_version 487313 (0.00089) [2022-07-10 00:27:44,573][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:27:44,592][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000487320_499015680.pth [2022-07-10 00:27:44,592][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000485327_496974848.pth [2022-07-10 00:27:44,964][26022] Updated weights on worker 0-0, policy_version 487323 (0.00081) [2022-07-10 00:27:47,103][26022] Updated weights on worker 0-0, policy_version 487333 (0.00093) [2022-07-10 00:27:47,680][25689] Fps is (10 sec: 5569.9, 60 sec: 5647.2, 300 sec: 5656.7). Total num frames: 499032064. Throughput: 0: 5094.9. Samples: 499029182. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2022-07-10 00:27:47,681][25689] Avg episode reward: [(0, '-43.792')] [2022-07-10 00:27:48,483][26022] Updated weights on worker 0-0, policy_version 487343 (0.00084) [2022-07-10 00:27:50,544][26022] Updated weights on worker 0-0, policy_version 487353 (0.00088) [2022-07-10 00:27:52,299][26022] Updated weights on worker 0-0, policy_version 487363 (0.00085) [2022-07-10 00:27:52,723][25689] Fps is (10 sec: 5676.5, 60 sec: 5660.6, 300 sec: 5659.5). Total num frames: 499061760. Throughput: 0: 5944.4. Samples: 499063386. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:27:52,724][25689] Avg episode reward: [(0, '-43.318')] [2022-07-10 00:27:54,163][26022] Updated weights on worker 0-0, policy_version 487373 (0.00615) [2022-07-10 00:27:55,973][26022] Updated weights on worker 0-0, policy_version 487383 (0.00091) [2022-07-10 00:27:57,813][25689] Fps is (10 sec: 5659.3, 60 sec: 5669.9, 300 sec: 5660.0). Total num frames: 499089408. Throughput: 0: 5915.0. Samples: 499097570. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:27:57,814][25689] Avg episode reward: [(0, '-43.283')] [2022-07-10 00:27:57,876][26022] Updated weights on worker 0-0, policy_version 487393 (0.00074) [2022-07-10 00:27:59,406][26022] Updated weights on worker 0-0, policy_version 487403 (0.00090) [2022-07-10 00:28:01,510][26022] Updated weights on worker 0-0, policy_version 487413 (0.00101) [2022-07-10 00:28:02,883][25689] Fps is (10 sec: 5442.8, 60 sec: 5639.4, 300 sec: 5657.6). Total num frames: 499117056. Throughput: 0: 5054.3. Samples: 499114510. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:28:02,883][25689] Avg episode reward: [(0, '-43.565')] [2022-07-10 00:28:03,346][26022] Updated weights on worker 0-0, policy_version 487423 (0.00082) [2022-07-10 00:28:05,548][26022] Updated weights on worker 0-0, policy_version 487433 (0.00090) [2022-07-10 00:28:06,959][26022] Updated weights on worker 0-0, policy_version 487443 (0.00084) [2022-07-10 00:28:07,888][25689] Fps is (10 sec: 5488.6, 60 sec: 5657.2, 300 sec: 5657.9). Total num frames: 499144704. Throughput: 0: 5819.3. Samples: 499146776. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:28:07,888][25689] Avg episode reward: [(0, '-43.832')] [2022-07-10 00:28:09,086][26022] Updated weights on worker 0-0, policy_version 487453 (0.00086) [2022-07-10 00:28:10,607][26022] Updated weights on worker 0-0, policy_version 487463 (0.00083) [2022-07-10 00:28:12,560][26022] Updated weights on worker 0-0, policy_version 487473 (0.00093) [2022-07-10 00:28:12,901][25689] Fps is (10 sec: 5621.6, 60 sec: 5657.6, 300 sec: 5654.4). Total num frames: 499173376. Throughput: 0: 5829.0. Samples: 499181004. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:28:12,902][25689] Avg episode reward: [(0, '-44.277')] [2022-07-10 00:28:14,246][26022] Updated weights on worker 0-0, policy_version 487483 (0.00088) [2022-07-10 00:28:16,218][26022] Updated weights on worker 0-0, policy_version 487493 (0.00084) [2022-07-10 00:28:17,750][26022] Updated weights on worker 0-0, policy_version 487503 (0.00088) [2022-07-10 00:28:17,918][25689] Fps is (10 sec: 5921.3, 60 sec: 5691.8, 300 sec: 5663.6). Total num frames: 499204096. Throughput: 0: 5001.3. Samples: 499198120. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:28:17,919][25689] Avg episode reward: [(0, '-44.601')] [2022-07-10 00:28:19,655][26022] Updated weights on worker 0-0, policy_version 487513 (0.00087) [2022-07-10 00:28:21,323][26022] Updated weights on worker 0-0, policy_version 487523 (0.00092) [2022-07-10 00:28:23,043][25689] Fps is (10 sec: 5755.5, 60 sec: 5674.5, 300 sec: 5658.5). Total num frames: 499231744. Throughput: 0: 5846.8. Samples: 499232380. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:28:23,043][25689] Avg episode reward: [(0, '-45.028')] [2022-07-10 00:28:23,249][26022] Updated weights on worker 0-0, policy_version 487533 (0.00090) [2022-07-10 00:28:25,052][26022] Updated weights on worker 0-0, policy_version 487543 (0.00082) [2022-07-10 00:28:26,872][26022] Updated weights on worker 0-0, policy_version 487553 (0.00092) [2022-07-10 00:28:28,047][25689] Fps is (10 sec: 5459.1, 60 sec: 5632.5, 300 sec: 5655.2). Total num frames: 499259392. Throughput: 0: 5933.0. Samples: 499266382. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:28:28,048][25689] Avg episode reward: [(0, '-44.271')] [2022-07-10 00:28:28,803][26022] Updated weights on worker 0-0, policy_version 487563 (0.00090) [2022-07-10 00:28:30,552][26022] Updated weights on worker 0-0, policy_version 487573 (0.00084) [2022-07-10 00:28:32,258][26022] Updated weights on worker 0-0, policy_version 487583 (0.00086) [2022-07-10 00:28:33,087][25689] Fps is (10 sec: 5709.1, 60 sec: 5667.0, 300 sec: 5658.7). Total num frames: 499289088. Throughput: 0: 5073.6. Samples: 499283418. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:28:33,087][25689] Avg episode reward: [(0, '-43.828')] [2022-07-10 00:28:34,334][26022] Updated weights on worker 0-0, policy_version 487593 (0.00097) [2022-07-10 00:28:35,926][26022] Updated weights on worker 0-0, policy_version 487603 (0.00084) [2022-07-10 00:28:37,826][26022] Updated weights on worker 0-0, policy_version 487613 (0.00086) [2022-07-10 00:28:38,107][25689] Fps is (10 sec: 5801.9, 60 sec: 5649.6, 300 sec: 5659.4). Total num frames: 499317760. Throughput: 0: 5916.9. Samples: 499317578. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:28:38,108][25689] Avg episode reward: [(0, '-43.950')] [2022-07-10 00:28:39,552][26022] Updated weights on worker 0-0, policy_version 487623 (0.00092) [2022-07-10 00:28:41,326][26022] Updated weights on worker 0-0, policy_version 487633 (0.00085) [2022-07-10 00:28:43,156][26022] Updated weights on worker 0-0, policy_version 487643 (0.00106) [2022-07-10 00:28:43,248][25689] Fps is (10 sec: 5643.6, 60 sec: 5641.4, 300 sec: 5658.1). Total num frames: 499346432. Throughput: 0: 5915.4. Samples: 499351902. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:28:43,248][25689] Avg episode reward: [(0, '-43.020')] [2022-07-10 00:28:44,723][26022] Updated weights on worker 0-0, policy_version 487653 (0.00107) [2022-07-10 00:28:46,778][26022] Updated weights on worker 0-0, policy_version 487663 (0.00087) [2022-07-10 00:28:48,280][25689] Fps is (10 sec: 5636.8, 60 sec: 5660.7, 300 sec: 5654.7). Total num frames: 499375104. Throughput: 0: 5067.5. Samples: 499368914. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:28:48,281][25689] Avg episode reward: [(0, '-42.903')] [2022-07-10 00:28:48,608][26022] Updated weights on worker 0-0, policy_version 487673 (0.00085) [2022-07-10 00:28:50,215][26022] Updated weights on worker 0-0, policy_version 487683 (0.00084) [2022-07-10 00:28:52,266][26022] Updated weights on worker 0-0, policy_version 487693 (0.00089) [2022-07-10 00:28:53,294][25689] Fps is (10 sec: 5708.1, 60 sec: 5646.6, 300 sec: 5655.9). Total num frames: 499403776. Throughput: 0: 5914.6. Samples: 499402934. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:28:53,295][25689] Avg episode reward: [(0, '-43.355')] [2022-07-10 00:28:53,888][26022] Updated weights on worker 0-0, policy_version 487703 (0.00280) [2022-07-10 00:28:55,812][26022] Updated weights on worker 0-0, policy_version 487713 (0.00088) [2022-07-10 00:28:57,636][26022] Updated weights on worker 0-0, policy_version 487723 (0.00090) [2022-07-10 00:28:58,386][25689] Fps is (10 sec: 5674.3, 60 sec: 5663.2, 300 sec: 5662.1). Total num frames: 499432448. Throughput: 0: 5893.6. Samples: 499437094. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:28:58,387][25689] Avg episode reward: [(0, '-43.457')] [2022-07-10 00:28:59,237][26022] Updated weights on worker 0-0, policy_version 487733 (0.00086) [2022-07-10 00:29:01,358][26022] Updated weights on worker 0-0, policy_version 487743 (0.00085) [2022-07-10 00:29:03,265][26022] Updated weights on worker 0-0, policy_version 487753 (0.00087) [2022-07-10 00:29:03,453][25689] Fps is (10 sec: 5442.8, 60 sec: 5646.6, 300 sec: 5650.8). Total num frames: 499459072. Throughput: 0: 5063.7. Samples: 499454216. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:29:03,454][25689] Avg episode reward: [(0, '-43.312')] [2022-07-10 00:29:05,191][26022] Updated weights on worker 0-0, policy_version 487763 (0.00084) [2022-07-10 00:29:06,867][26022] Updated weights on worker 0-0, policy_version 487773 (0.00093) [2022-07-10 00:29:08,471][25689] Fps is (10 sec: 5584.6, 60 sec: 5679.2, 300 sec: 5665.2). Total num frames: 499488768. Throughput: 0: 5822.8. Samples: 499486480. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:29:08,472][25689] Avg episode reward: [(0, '-43.574')] [2022-07-10 00:29:08,615][26022] Updated weights on worker 0-0, policy_version 487783 (0.00084) [2022-07-10 00:29:10,343][26022] Updated weights on worker 0-0, policy_version 487793 (0.00093) [2022-07-10 00:29:12,261][26022] Updated weights on worker 0-0, policy_version 487803 (0.00091) [2022-07-10 00:29:13,491][25689] Fps is (10 sec: 5610.9, 60 sec: 5644.8, 300 sec: 5658.6). Total num frames: 499515392. Throughput: 0: 5824.3. Samples: 499520568. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:29:13,491][25689] Avg episode reward: [(0, '-44.250')] [2022-07-10 00:29:14,177][26022] Updated weights on worker 0-0, policy_version 487813 (0.00090) [2022-07-10 00:29:15,862][26022] Updated weights on worker 0-0, policy_version 487823 (0.00091) [2022-07-10 00:29:17,754][26022] Updated weights on worker 0-0, policy_version 487833 (0.00086) [2022-07-10 00:29:18,523][25689] Fps is (10 sec: 5399.4, 60 sec: 5592.7, 300 sec: 5645.9). Total num frames: 499543040. Throughput: 0: 5828.4. Samples: 499554456. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:29:18,523][25689] Avg episode reward: [(0, '-43.724')] [2022-07-10 00:29:19,520][26022] Updated weights on worker 0-0, policy_version 487843 (0.00091) [2022-07-10 00:29:21,383][26022] Updated weights on worker 0-0, policy_version 487853 (0.00085) [2022-07-10 00:29:23,073][26022] Updated weights on worker 0-0, policy_version 487863 (0.00094) [2022-07-10 00:29:23,623][25689] Fps is (10 sec: 5760.4, 60 sec: 5645.6, 300 sec: 5658.8). Total num frames: 499573760. Throughput: 0: 5800.7. Samples: 499571218. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:29:23,624][25689] Avg episode reward: [(0, '-43.252')] [2022-07-10 00:29:25,021][26022] Updated weights on worker 0-0, policy_version 487873 (0.00088) [2022-07-10 00:29:26,691][26022] Updated weights on worker 0-0, policy_version 487883 (0.00091) [2022-07-10 00:29:28,632][26022] Updated weights on worker 0-0, policy_version 487893 (0.01040) [2022-07-10 00:29:28,659][25689] Fps is (10 sec: 5859.5, 60 sec: 5659.6, 300 sec: 5655.1). Total num frames: 499602432. Throughput: 0: 5892.5. Samples: 499605436. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:29:28,659][25689] Avg episode reward: [(0, '-43.434')] [2022-07-10 00:29:30,301][26022] Updated weights on worker 0-0, policy_version 487903 (0.00082) [2022-07-10 00:29:32,184][26022] Updated weights on worker 0-0, policy_version 487913 (0.00095) [2022-07-10 00:29:33,681][25689] Fps is (10 sec: 5600.0, 60 sec: 5627.5, 300 sec: 5656.2). Total num frames: 499630080. Throughput: 0: 5893.4. Samples: 499639554. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:29:33,681][25689] Avg episode reward: [(0, '-43.293')] [2022-07-10 00:29:33,993][26022] Updated weights on worker 0-0, policy_version 487923 (0.00089) [2022-07-10 00:29:35,918][26022] Updated weights on worker 0-0, policy_version 487933 (0.00090) [2022-07-10 00:29:37,600][26022] Updated weights on worker 0-0, policy_version 487943 (0.00090) [2022-07-10 00:29:38,683][25689] Fps is (10 sec: 5720.6, 60 sec: 5646.1, 300 sec: 5656.9). Total num frames: 499659776. Throughput: 0: 5073.1. Samples: 499656730. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:29:38,683][25689] Avg episode reward: [(0, '-44.458')] [2022-07-10 00:29:39,603][26022] Updated weights on worker 0-0, policy_version 487953 (0.00100) [2022-07-10 00:29:41,283][26022] Updated weights on worker 0-0, policy_version 487963 (0.00084) [2022-07-10 00:29:43,087][26022] Updated weights on worker 0-0, policy_version 487973 (0.00091) [2022-07-10 00:29:43,755][25689] Fps is (10 sec: 5895.1, 60 sec: 5669.4, 300 sec: 5656.4). Total num frames: 499689472. Throughput: 0: 5938.3. Samples: 499690768. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:29:43,756][25689] Avg episode reward: [(0, '-44.315')] [2022-07-10 00:29:44,634][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:29:44,646][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000487981_499692544.pth [2022-07-10 00:29:44,646][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000485992_497655808.pth [2022-07-10 00:29:44,958][26022] Updated weights on worker 0-0, policy_version 487983 (0.00084) [2022-07-10 00:29:46,533][26022] Updated weights on worker 0-0, policy_version 487993 (0.00085) [2022-07-10 00:29:48,570][26022] Updated weights on worker 0-0, policy_version 488003 (0.00052) [2022-07-10 00:29:48,838][25689] Fps is (10 sec: 5546.0, 60 sec: 5630.9, 300 sec: 5651.4). Total num frames: 499716096. Throughput: 0: 5902.1. Samples: 499724536. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:29:48,838][25689] Avg episode reward: [(0, '-43.487')] [2022-07-10 00:29:50,051][26022] Updated weights on worker 0-0, policy_version 488013 (0.00091) [2022-07-10 00:29:52,223][26022] Updated weights on worker 0-0, policy_version 488023 (0.00080) [2022-07-10 00:29:53,666][26022] Updated weights on worker 0-0, policy_version 488033 (0.00096) [2022-07-10 00:29:53,845][25689] Fps is (10 sec: 5582.0, 60 sec: 5648.4, 300 sec: 5658.2). Total num frames: 499745792. Throughput: 0: 5067.3. Samples: 499741732. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:29:53,845][25689] Avg episode reward: [(0, '-43.631')] [2022-07-10 00:29:55,706][26022] Updated weights on worker 0-0, policy_version 488043 (0.00087) [2022-07-10 00:29:57,562][26022] Updated weights on worker 0-0, policy_version 488053 (0.00091) [2022-07-10 00:29:58,865][25689] Fps is (10 sec: 5718.9, 60 sec: 5638.2, 300 sec: 5646.3). Total num frames: 499773440. Throughput: 0: 5900.6. Samples: 499775816. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:29:58,865][25689] Avg episode reward: [(0, '-43.683')] [2022-07-10 00:29:59,384][26022] Updated weights on worker 0-0, policy_version 488063 (0.00087) [2022-07-10 00:30:01,205][26022] Updated weights on worker 0-0, policy_version 488073 (0.00096) [2022-07-10 00:30:03,557][26022] Updated weights on worker 0-0, policy_version 488083 (0.00085) [2022-07-10 00:30:03,914][25689] Fps is (10 sec: 5389.8, 60 sec: 5639.9, 300 sec: 5649.3). Total num frames: 499800064. Throughput: 0: 5802.9. Samples: 499807748. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:30:03,915][25689] Avg episode reward: [(0, '-44.032')] [2022-07-10 00:30:04,964][26022] Updated weights on worker 0-0, policy_version 488093 (0.00089) [2022-07-10 00:30:07,128][26022] Updated weights on worker 0-0, policy_version 488103 (0.00092) [2022-07-10 00:30:08,585][26022] Updated weights on worker 0-0, policy_version 488113 (0.00086) [2022-07-10 00:30:08,922][25689] Fps is (10 sec: 5498.4, 60 sec: 5623.9, 300 sec: 5646.0). Total num frames: 499828736. Throughput: 0: 4994.7. Samples: 499824848. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:30:08,922][25689] Avg episode reward: [(0, '-42.362')] [2022-07-10 00:30:10,635][26022] Updated weights on worker 0-0, policy_version 488123 (0.00091) [2022-07-10 00:30:12,333][26022] Updated weights on worker 0-0, policy_version 488133 (0.00089) [2022-07-10 00:30:13,927][25689] Fps is (10 sec: 5625.2, 60 sec: 5642.2, 300 sec: 5644.3). Total num frames: 499856384. Throughput: 0: 5852.1. Samples: 499859252. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:30:13,927][25689] Avg episode reward: [(0, '-42.157')] [2022-07-10 00:30:14,238][26022] Updated weights on worker 0-0, policy_version 488143 (0.00081) [2022-07-10 00:30:15,919][26022] Updated weights on worker 0-0, policy_version 488153 (0.00091) [2022-07-10 00:30:17,718][26022] Updated weights on worker 0-0, policy_version 488163 (0.00092) [2022-07-10 00:30:18,935][25689] Fps is (10 sec: 5624.2, 60 sec: 5661.3, 300 sec: 5642.8). Total num frames: 499885056. Throughput: 0: 5862.5. Samples: 499893480. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:30:18,936][25689] Avg episode reward: [(0, '-41.483')] [2022-07-10 00:30:19,562][26022] Updated weights on worker 0-0, policy_version 488173 (0.00091) [2022-07-10 00:30:21,249][26022] Updated weights on worker 0-0, policy_version 488183 (0.00086) [2022-07-10 00:30:23,186][26022] Updated weights on worker 0-0, policy_version 488193 (0.00089) [2022-07-10 00:30:24,008][25689] Fps is (10 sec: 5789.4, 60 sec: 5647.0, 300 sec: 5645.5). Total num frames: 499914752. Throughput: 0: 5117.3. Samples: 499910576. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:30:24,010][25689] Avg episode reward: [(0, '-42.039')] [2022-07-10 00:30:24,701][26022] Updated weights on worker 0-0, policy_version 488203 (0.00091) [2022-07-10 00:30:26,851][26022] Updated weights on worker 0-0, policy_version 488213 (0.00088) [2022-07-10 00:30:28,639][26022] Updated weights on worker 0-0, policy_version 488223 (0.00083) [2022-07-10 00:30:29,039][25689] Fps is (10 sec: 5777.0, 60 sec: 5647.4, 300 sec: 5646.2). Total num frames: 499943424. Throughput: 0: 5957.7. Samples: 499944702. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:30:29,040][25689] Avg episode reward: [(0, '-42.558')] [2022-07-10 00:30:30,259][26022] Updated weights on worker 0-0, policy_version 488233 (0.00096) [2022-07-10 00:30:32,162][26022] Updated weights on worker 0-0, policy_version 488243 (0.00093) [2022-07-10 00:30:33,899][26022] Updated weights on worker 0-0, policy_version 488253 (0.00085) [2022-07-10 00:30:34,066][25689] Fps is (10 sec: 5599.9, 60 sec: 5647.0, 300 sec: 5649.7). Total num frames: 499971072. Throughput: 0: 5946.0. Samples: 499979000. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:30:34,066][25689] Avg episode reward: [(0, '-42.935')] [2022-07-10 00:30:35,663][26022] Updated weights on worker 0-0, policy_version 488263 (0.00093) [2022-07-10 00:30:37,570][26022] Updated weights on worker 0-0, policy_version 488273 (0.00096) [2022-07-10 00:30:39,093][25689] Fps is (10 sec: 5601.8, 60 sec: 5627.7, 300 sec: 5647.9). Total num frames: 499999744. Throughput: 0: 5080.3. Samples: 499995886. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:30:39,093][25689] Avg episode reward: [(0, '-43.554')] [2022-07-10 00:30:39,143][26022] Updated weights on worker 0-0, policy_version 488283 (0.00085) [2022-07-10 00:30:41,318][26022] Updated weights on worker 0-0, policy_version 488293 (0.00091) [2022-07-10 00:30:42,914][26022] Updated weights on worker 0-0, policy_version 488303 (0.00105) [2022-07-10 00:30:44,184][25689] Fps is (10 sec: 5667.3, 60 sec: 5609.0, 300 sec: 5647.0). Total num frames: 500028416. Throughput: 0: 5904.3. Samples: 500029698. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 00:30:44,185][25689] Avg episode reward: [(0, '-44.231')] [2022-07-10 00:30:44,738][26022] Updated weights on worker 0-0, policy_version 488313 (0.00079) [2022-07-10 00:30:46,583][26022] Updated weights on worker 0-0, policy_version 488323 (0.00097) [2022-07-10 00:30:48,547][26022] Updated weights on worker 0-0, policy_version 488333 (0.00068) [2022-07-10 00:30:49,229][25689] Fps is (10 sec: 5657.3, 60 sec: 5646.4, 300 sec: 5650.3). Total num frames: 500057088. Throughput: 0: 5894.8. Samples: 500063720. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:30:49,230][25689] Avg episode reward: [(0, '-44.531')] [2022-07-10 00:30:50,392][26022] Updated weights on worker 0-0, policy_version 488343 (0.00088) [2022-07-10 00:30:52,246][26022] Updated weights on worker 0-0, policy_version 488353 (0.00091) [2022-07-10 00:30:53,846][26022] Updated weights on worker 0-0, policy_version 488363 (0.00088) [2022-07-10 00:30:54,272][25689] Fps is (10 sec: 5684.6, 60 sec: 5626.1, 300 sec: 5646.6). Total num frames: 500085760. Throughput: 0: 5033.3. Samples: 500080702. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:30:54,273][25689] Avg episode reward: [(0, '-45.022')] [2022-07-10 00:30:55,706][26022] Updated weights on worker 0-0, policy_version 488373 (0.00086) [2022-07-10 00:30:57,406][26022] Updated weights on worker 0-0, policy_version 488383 (0.00092) [2022-07-10 00:30:59,268][26022] Updated weights on worker 0-0, policy_version 488393 (0.00087) [2022-07-10 00:30:59,275][25689] Fps is (10 sec: 5708.1, 60 sec: 5644.6, 300 sec: 5651.9). Total num frames: 500114432. Throughput: 0: 5910.1. Samples: 500115166. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:30:59,276][25689] Avg episode reward: [(0, '-45.572')] [2022-07-10 00:31:00,906][26022] Updated weights on worker 0-0, policy_version 488403 (0.00094) [2022-07-10 00:31:03,379][26022] Updated weights on worker 0-0, policy_version 488413 (0.00091) [2022-07-10 00:31:04,363][25689] Fps is (10 sec: 5581.2, 60 sec: 5658.0, 300 sec: 5651.1). Total num frames: 500142080. Throughput: 0: 5820.9. Samples: 500147156. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:31:04,363][25689] Avg episode reward: [(0, '-45.063')] [2022-07-10 00:31:04,907][26022] Updated weights on worker 0-0, policy_version 488423 (0.00095) [2022-07-10 00:31:06,982][26022] Updated weights on worker 0-0, policy_version 488433 (0.00085) [2022-07-10 00:31:08,606][26022] Updated weights on worker 0-0, policy_version 488443 (0.00084) [2022-07-10 00:31:09,398][25689] Fps is (10 sec: 5361.7, 60 sec: 5621.5, 300 sec: 5637.5). Total num frames: 500168704. Throughput: 0: 4989.3. Samples: 500164346. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:31:09,398][25689] Avg episode reward: [(0, '-44.725')] [2022-07-10 00:31:10,450][26022] Updated weights on worker 0-0, policy_version 488453 (0.00095) [2022-07-10 00:31:12,374][26022] Updated weights on worker 0-0, policy_version 488463 (0.00110) [2022-07-10 00:31:13,949][26022] Updated weights on worker 0-0, policy_version 488473 (0.00086) [2022-07-10 00:31:14,430][25689] Fps is (10 sec: 5696.3, 60 sec: 5669.8, 300 sec: 5650.8). Total num frames: 500199424. Throughput: 0: 5857.7. Samples: 500198782. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:31:14,430][25689] Avg episode reward: [(0, '-43.573')] [2022-07-10 00:31:15,768][26022] Updated weights on worker 0-0, policy_version 488483 (0.00089) [2022-07-10 00:31:17,502][26022] Updated weights on worker 0-0, policy_version 488493 (0.00087) [2022-07-10 00:31:19,434][26022] Updated weights on worker 0-0, policy_version 488503 (0.00093) [2022-07-10 00:31:19,465][25689] Fps is (10 sec: 5798.0, 60 sec: 5650.4, 300 sec: 5644.3). Total num frames: 500227072. Throughput: 0: 5840.2. Samples: 500233074. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:31:19,465][25689] Avg episode reward: [(0, '-43.971')] [2022-07-10 00:31:21,279][26022] Updated weights on worker 0-0, policy_version 488513 (0.00085) [2022-07-10 00:31:22,835][26022] Updated weights on worker 0-0, policy_version 488523 (0.00084) [2022-07-10 00:31:24,580][25689] Fps is (10 sec: 5548.4, 60 sec: 5629.6, 300 sec: 5646.9). Total num frames: 500255744. Throughput: 0: 5085.2. Samples: 500249966. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:31:24,581][25689] Avg episode reward: [(0, '-42.966')] [2022-07-10 00:31:24,872][26022] Updated weights on worker 0-0, policy_version 488533 (0.00094) [2022-07-10 00:31:26,588][26022] Updated weights on worker 0-0, policy_version 488543 (0.00093) [2022-07-10 00:31:28,493][26022] Updated weights on worker 0-0, policy_version 488553 (0.00094) [2022-07-10 00:31:29,639][25689] Fps is (10 sec: 5736.8, 60 sec: 5643.8, 300 sec: 5653.1). Total num frames: 500285440. Throughput: 0: 5918.7. Samples: 500284148. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:31:29,639][25689] Avg episode reward: [(0, '-43.584')] [2022-07-10 00:31:30,141][26022] Updated weights on worker 0-0, policy_version 488563 (0.00084) [2022-07-10 00:31:31,923][26022] Updated weights on worker 0-0, policy_version 488573 (0.00215) [2022-07-10 00:31:33,806][26022] Updated weights on worker 0-0, policy_version 488583 (0.00747) [2022-07-10 00:31:34,671][25689] Fps is (10 sec: 5784.0, 60 sec: 5660.2, 300 sec: 5652.8). Total num frames: 500314112. Throughput: 0: 5928.3. Samples: 500318782. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:31:34,672][25689] Avg episode reward: [(0, '-44.238')] [2022-07-10 00:31:35,594][26022] Updated weights on worker 0-0, policy_version 488593 (0.00089) [2022-07-10 00:31:37,238][26022] Updated weights on worker 0-0, policy_version 488603 (0.00089) [2022-07-10 00:31:39,192][26022] Updated weights on worker 0-0, policy_version 488613 (0.00089) [2022-07-10 00:31:39,699][25689] Fps is (10 sec: 5598.6, 60 sec: 5643.3, 300 sec: 5650.0). Total num frames: 500341760. Throughput: 0: 5094.9. Samples: 500336164. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:31:39,700][25689] Avg episode reward: [(0, '-44.784')] [2022-07-10 00:31:40,748][26022] Updated weights on worker 0-0, policy_version 488623 (0.00695) [2022-07-10 00:31:42,906][26022] Updated weights on worker 0-0, policy_version 488633 (0.00105) [2022-07-10 00:31:44,499][26022] Updated weights on worker 0-0, policy_version 488643 (0.00094) [2022-07-10 00:31:44,709][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:31:44,726][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000488645_500372480.pth [2022-07-10 00:31:44,727][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000486657_498336768.pth [2022-07-10 00:31:44,833][25689] Fps is (10 sec: 5744.2, 60 sec: 5673.1, 300 sec: 5651.1). Total num frames: 500372480. Throughput: 0: 5952.1. Samples: 500370510. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:31:44,833][25689] Avg episode reward: [(0, '-44.895')] [2022-07-10 00:31:46,303][26022] Updated weights on worker 0-0, policy_version 488653 (0.00090) [2022-07-10 00:31:47,919][26022] Updated weights on worker 0-0, policy_version 488663 (0.00090) [2022-07-10 00:31:49,791][26022] Updated weights on worker 0-0, policy_version 488673 (0.00088) [2022-07-10 00:31:49,855][25689] Fps is (10 sec: 5847.9, 60 sec: 5675.2, 300 sec: 5650.8). Total num frames: 500401152. Throughput: 0: 5951.7. Samples: 500404466. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:31:49,855][25689] Avg episode reward: [(0, '-45.466')] [2022-07-10 00:31:51,701][26022] Updated weights on worker 0-0, policy_version 488683 (0.00104) [2022-07-10 00:31:53,488][26022] Updated weights on worker 0-0, policy_version 488693 (0.00348) [2022-07-10 00:31:54,947][25689] Fps is (10 sec: 5669.3, 60 sec: 5670.5, 300 sec: 5656.1). Total num frames: 500429824. Throughput: 0: 5905.4. Samples: 500438520. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:31:54,949][25689] Avg episode reward: [(0, '-45.463')] [2022-07-10 00:31:55,366][26022] Updated weights on worker 0-0, policy_version 488703 (0.00084) [2022-07-10 00:31:57,092][26022] Updated weights on worker 0-0, policy_version 488713 (0.00104) [2022-07-10 00:31:58,896][26022] Updated weights on worker 0-0, policy_version 488723 (0.00092) [2022-07-10 00:31:59,967][25689] Fps is (10 sec: 5670.9, 60 sec: 5669.1, 300 sec: 5654.3). Total num frames: 500458496. Throughput: 0: 5898.3. Samples: 500455712. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:31:59,967][25689] Avg episode reward: [(0, '-45.540')] [2022-07-10 00:32:00,722][26022] Updated weights on worker 0-0, policy_version 488733 (0.00096) [2022-07-10 00:32:02,871][26022] Updated weights on worker 0-0, policy_version 488743 (0.00089) [2022-07-10 00:32:04,741][26022] Updated weights on worker 0-0, policy_version 488753 (0.00082) [2022-07-10 00:32:05,099][25689] Fps is (10 sec: 5447.0, 60 sec: 5648.0, 300 sec: 5652.0). Total num frames: 500485120. Throughput: 0: 5783.9. Samples: 500487730. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:32:05,099][25689] Avg episode reward: [(0, '-44.579')] [2022-07-10 00:32:06,514][26022] Updated weights on worker 0-0, policy_version 488763 (0.00082) [2022-07-10 00:32:08,319][26022] Updated weights on worker 0-0, policy_version 488773 (0.00090) [2022-07-10 00:32:10,140][25689] Fps is (10 sec: 5335.0, 60 sec: 5664.4, 300 sec: 5648.2). Total num frames: 500512768. Throughput: 0: 5802.3. Samples: 500522164. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:32:10,140][25689] Avg episode reward: [(0, '-43.913')] [2022-07-10 00:32:10,184][26022] Updated weights on worker 0-0, policy_version 488783 (0.00092) [2022-07-10 00:32:11,915][26022] Updated weights on worker 0-0, policy_version 488793 (0.00089) [2022-07-10 00:32:13,722][26022] Updated weights on worker 0-0, policy_version 488803 (0.00084) [2022-07-10 00:32:15,152][25689] Fps is (10 sec: 5704.1, 60 sec: 5649.3, 300 sec: 5651.8). Total num frames: 500542464. Throughput: 0: 4990.0. Samples: 500539342. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:32:15,153][25689] Avg episode reward: [(0, '-43.730')] [2022-07-10 00:32:15,299][26022] Updated weights on worker 0-0, policy_version 488813 (0.00084) [2022-07-10 00:32:17,281][26022] Updated weights on worker 0-0, policy_version 488823 (0.00086) [2022-07-10 00:32:18,823][26022] Updated weights on worker 0-0, policy_version 488833 (0.00085) [2022-07-10 00:32:20,166][25689] Fps is (10 sec: 5821.6, 60 sec: 5668.2, 300 sec: 5653.8). Total num frames: 500571136. Throughput: 0: 5853.7. Samples: 500573950. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:32:20,166][25689] Avg episode reward: [(0, '-42.898')] [2022-07-10 00:32:20,714][26022] Updated weights on worker 0-0, policy_version 488843 (0.00062) [2022-07-10 00:32:22,536][26022] Updated weights on worker 0-0, policy_version 488853 (0.00087) [2022-07-10 00:32:24,366][26022] Updated weights on worker 0-0, policy_version 488863 (0.00084) [2022-07-10 00:32:25,243][25689] Fps is (10 sec: 5683.0, 60 sec: 5671.8, 300 sec: 5647.3). Total num frames: 500599808. Throughput: 0: 5982.2. Samples: 500608234. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:32:25,243][25689] Avg episode reward: [(0, '-42.520')] [2022-07-10 00:32:26,183][26022] Updated weights on worker 0-0, policy_version 488873 (0.00096) [2022-07-10 00:32:27,863][26022] Updated weights on worker 0-0, policy_version 488883 (0.00093) [2022-07-10 00:32:29,616][26022] Updated weights on worker 0-0, policy_version 488893 (0.00084) [2022-07-10 00:32:30,247][25689] Fps is (10 sec: 5789.7, 60 sec: 5676.9, 300 sec: 5655.0). Total num frames: 500629504. Throughput: 0: 5129.2. Samples: 500625300. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:32:30,248][25689] Avg episode reward: [(0, '-43.490')] [2022-07-10 00:32:31,645][26022] Updated weights on worker 0-0, policy_version 488903 (0.00094) [2022-07-10 00:32:33,216][26022] Updated weights on worker 0-0, policy_version 488913 (0.00089) [2022-07-10 00:32:35,197][26022] Updated weights on worker 0-0, policy_version 488923 (0.00372) [2022-07-10 00:32:35,267][25689] Fps is (10 sec: 5822.7, 60 sec: 5678.0, 300 sec: 5651.5). Total num frames: 500658176. Throughput: 0: 5982.3. Samples: 500659674. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:32:35,267][25689] Avg episode reward: [(0, '-43.402')] [2022-07-10 00:32:37,071][26022] Updated weights on worker 0-0, policy_version 488933 (0.00088) [2022-07-10 00:32:38,615][26022] Updated weights on worker 0-0, policy_version 488943 (0.00087) [2022-07-10 00:32:40,284][25689] Fps is (10 sec: 5611.3, 60 sec: 5679.0, 300 sec: 5648.7). Total num frames: 500685824. Throughput: 0: 5969.8. Samples: 500694052. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:32:40,285][25689] Avg episode reward: [(0, '-44.600')] [2022-07-10 00:32:40,717][26022] Updated weights on worker 0-0, policy_version 488953 (0.00087) [2022-07-10 00:32:42,079][26022] Updated weights on worker 0-0, policy_version 488963 (0.00087) [2022-07-10 00:32:44,134][26022] Updated weights on worker 0-0, policy_version 488973 (0.00090) [2022-07-10 00:32:45,329][25689] Fps is (10 sec: 5699.0, 60 sec: 5670.4, 300 sec: 5655.8). Total num frames: 500715520. Throughput: 0: 5125.2. Samples: 500711180. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:32:45,330][25689] Avg episode reward: [(0, '-45.094')] [2022-07-10 00:32:46,031][26022] Updated weights on worker 0-0, policy_version 488983 (0.00086) [2022-07-10 00:32:47,648][26022] Updated weights on worker 0-0, policy_version 488993 (0.00087) [2022-07-10 00:32:49,492][26022] Updated weights on worker 0-0, policy_version 489003 (0.00096) [2022-07-10 00:32:50,346][25689] Fps is (10 sec: 5698.9, 60 sec: 5654.0, 300 sec: 5649.5). Total num frames: 500743168. Throughput: 0: 5978.6. Samples: 500745464. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:32:50,347][25689] Avg episode reward: [(0, '-45.577')] [2022-07-10 00:32:51,273][26022] Updated weights on worker 0-0, policy_version 489013 (0.00082) [2022-07-10 00:32:53,152][26022] Updated weights on worker 0-0, policy_version 489023 (0.00088) [2022-07-10 00:32:54,916][26022] Updated weights on worker 0-0, policy_version 489033 (0.00087) [2022-07-10 00:32:55,349][25689] Fps is (10 sec: 5722.9, 60 sec: 5679.3, 300 sec: 5658.0). Total num frames: 500772864. Throughput: 0: 5978.0. Samples: 500779726. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:32:55,350][25689] Avg episode reward: [(0, '-44.986')] [2022-07-10 00:32:56,751][26022] Updated weights on worker 0-0, policy_version 489043 (0.00082) [2022-07-10 00:32:58,371][26022] Updated weights on worker 0-0, policy_version 489053 (0.00086) [2022-07-10 00:33:00,295][26022] Updated weights on worker 0-0, policy_version 489063 (0.00088) [2022-07-10 00:33:00,363][25689] Fps is (10 sec: 5725.0, 60 sec: 5662.9, 300 sec: 5659.1). Total num frames: 500800512. Throughput: 0: 5123.9. Samples: 500796934. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:33:00,363][25689] Avg episode reward: [(0, '-45.382')] [2022-07-10 00:33:02,275][26022] Updated weights on worker 0-0, policy_version 489073 (0.00097) [2022-07-10 00:33:04,245][26022] Updated weights on worker 0-0, policy_version 489083 (0.00086) [2022-07-10 00:33:05,401][25689] Fps is (10 sec: 5500.9, 60 sec: 5688.7, 300 sec: 5658.4). Total num frames: 500828160. Throughput: 0: 5868.6. Samples: 500828976. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:33:05,402][25689] Avg episode reward: [(0, '-45.777')] [2022-07-10 00:33:05,978][26022] Updated weights on worker 0-0, policy_version 489093 (0.00085) [2022-07-10 00:33:07,687][26022] Updated weights on worker 0-0, policy_version 489103 (0.00089) [2022-07-10 00:33:09,579][26022] Updated weights on worker 0-0, policy_version 489113 (0.00085) [2022-07-10 00:33:10,495][25689] Fps is (10 sec: 5558.5, 60 sec: 5700.6, 300 sec: 5656.9). Total num frames: 500856832. Throughput: 0: 5861.4. Samples: 500863564. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:33:10,495][25689] Avg episode reward: [(0, '-45.804')] [2022-07-10 00:33:11,242][26022] Updated weights on worker 0-0, policy_version 489123 (0.00083) [2022-07-10 00:33:13,131][26022] Updated weights on worker 0-0, policy_version 489133 (0.00089) [2022-07-10 00:33:14,851][26022] Updated weights on worker 0-0, policy_version 489143 (0.00086) [2022-07-10 00:33:15,573][25689] Fps is (10 sec: 5536.8, 60 sec: 5660.6, 300 sec: 5645.4). Total num frames: 500884480. Throughput: 0: 4994.6. Samples: 500880738. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:33:15,575][25689] Avg episode reward: [(0, '-44.998')] [2022-07-10 00:33:16,667][26022] Updated weights on worker 0-0, policy_version 489153 (0.00083) [2022-07-10 00:33:18,535][26022] Updated weights on worker 0-0, policy_version 489163 (0.00093) [2022-07-10 00:33:20,259][26022] Updated weights on worker 0-0, policy_version 489173 (0.00090) [2022-07-10 00:33:20,602][25689] Fps is (10 sec: 5876.0, 60 sec: 5709.9, 300 sec: 5661.0). Total num frames: 500916224. Throughput: 0: 5838.7. Samples: 500915108. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:33:20,603][25689] Avg episode reward: [(0, '-45.298')] [2022-07-10 00:33:22,084][26022] Updated weights on worker 0-0, policy_version 489183 (0.00087) [2022-07-10 00:33:23,992][26022] Updated weights on worker 0-0, policy_version 489193 (0.00090) [2022-07-10 00:33:25,642][26022] Updated weights on worker 0-0, policy_version 489203 (0.00089) [2022-07-10 00:33:25,657][25689] Fps is (10 sec: 5890.1, 60 sec: 5695.1, 300 sec: 5660.0). Total num frames: 500943872. Throughput: 0: 5923.8. Samples: 500948964. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:33:25,657][25689] Avg episode reward: [(0, '-45.458')] [2022-07-10 00:33:27,651][26022] Updated weights on worker 0-0, policy_version 489213 (0.00086) [2022-07-10 00:33:29,263][26022] Updated weights on worker 0-0, policy_version 489223 (0.00088) [2022-07-10 00:33:30,667][25689] Fps is (10 sec: 5392.4, 60 sec: 5643.7, 300 sec: 5650.3). Total num frames: 500970496. Throughput: 0: 5076.5. Samples: 500965968. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:33:30,667][25689] Avg episode reward: [(0, '-44.490')] [2022-07-10 00:33:31,273][26022] Updated weights on worker 0-0, policy_version 489233 (0.00102) [2022-07-10 00:33:32,898][26022] Updated weights on worker 0-0, policy_version 489243 (0.00076) [2022-07-10 00:33:34,858][26022] Updated weights on worker 0-0, policy_version 489253 (0.00091) [2022-07-10 00:33:35,707][25689] Fps is (10 sec: 5603.9, 60 sec: 5658.8, 300 sec: 5653.3). Total num frames: 501000192. Throughput: 0: 5924.5. Samples: 501000020. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:33:35,707][25689] Avg episode reward: [(0, '-44.244')] [2022-07-10 00:33:36,580][26022] Updated weights on worker 0-0, policy_version 489263 (0.00088) [2022-07-10 00:33:38,599][26022] Updated weights on worker 0-0, policy_version 489273 (0.00089) [2022-07-10 00:33:40,308][26022] Updated weights on worker 0-0, policy_version 489283 (0.00086) [2022-07-10 00:33:40,737][25689] Fps is (10 sec: 5592.7, 60 sec: 5640.6, 300 sec: 5648.6). Total num frames: 501026816. Throughput: 0: 5877.5. Samples: 501033450. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:33:40,739][25689] Avg episode reward: [(0, '-45.183')] [2022-07-10 00:33:42,186][26022] Updated weights on worker 0-0, policy_version 489293 (0.00093) [2022-07-10 00:33:43,975][26022] Updated weights on worker 0-0, policy_version 489303 (0.00088) [2022-07-10 00:33:44,935][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:33:44,949][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000489308_501051392.pth [2022-07-10 00:33:44,949][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000487320_499015680.pth [2022-07-10 00:33:45,783][25689] Fps is (10 sec: 5589.2, 60 sec: 5640.5, 300 sec: 5651.7). Total num frames: 501056512. Throughput: 0: 5041.5. Samples: 501050434. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:33:45,784][25689] Avg episode reward: [(0, '-45.440')] [2022-07-10 00:33:45,790][26022] Updated weights on worker 0-0, policy_version 489313 (0.00087) [2022-07-10 00:33:47,529][26022] Updated weights on worker 0-0, policy_version 489323 (0.00087) [2022-07-10 00:33:49,378][26022] Updated weights on worker 0-0, policy_version 489333 (0.00085) [2022-07-10 00:33:50,799][25689] Fps is (10 sec: 5801.0, 60 sec: 5657.6, 300 sec: 5651.7). Total num frames: 501085184. Throughput: 0: 5914.0. Samples: 501085028. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:33:50,799][25689] Avg episode reward: [(0, '-45.220')] [2022-07-10 00:33:51,191][26022] Updated weights on worker 0-0, policy_version 489343 (0.00086) [2022-07-10 00:33:53,122][26022] Updated weights on worker 0-0, policy_version 489353 (0.00087) [2022-07-10 00:33:54,782][26022] Updated weights on worker 0-0, policy_version 489363 (0.00097) [2022-07-10 00:33:55,817][25689] Fps is (10 sec: 5613.2, 60 sec: 5622.3, 300 sec: 5649.7). Total num frames: 501112832. Throughput: 0: 5887.8. Samples: 501118424. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:33:55,817][25689] Avg episode reward: [(0, '-44.955')] [2022-07-10 00:33:56,783][26022] Updated weights on worker 0-0, policy_version 489373 (0.00086) [2022-07-10 00:33:58,341][26022] Updated weights on worker 0-0, policy_version 489383 (0.00090) [2022-07-10 00:34:00,218][26022] Updated weights on worker 0-0, policy_version 489393 (0.00090) [2022-07-10 00:34:00,843][25689] Fps is (10 sec: 5811.2, 60 sec: 5672.0, 300 sec: 5664.2). Total num frames: 501143552. Throughput: 0: 5081.8. Samples: 501135622. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:34:00,843][25689] Avg episode reward: [(0, '-45.348')] [2022-07-10 00:34:02,367][26022] Updated weights on worker 0-0, policy_version 489403 (0.00088) [2022-07-10 00:34:04,090][26022] Updated weights on worker 0-0, policy_version 489413 (0.00092) [2022-07-10 00:34:05,980][25689] Fps is (10 sec: 5441.0, 60 sec: 5612.0, 300 sec: 5644.7). Total num frames: 501168128. Throughput: 0: 5803.1. Samples: 501167636. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:34:05,980][25689] Avg episode reward: [(0, '-45.481')] [2022-07-10 00:34:06,001][26022] Updated weights on worker 0-0, policy_version 489423 (0.00086) [2022-07-10 00:34:07,851][26022] Updated weights on worker 0-0, policy_version 489433 (0.00088) [2022-07-10 00:34:09,501][26022] Updated weights on worker 0-0, policy_version 489443 (0.00088) [2022-07-10 00:34:11,038][25689] Fps is (10 sec: 5222.6, 60 sec: 5615.3, 300 sec: 5650.9). Total num frames: 501196800. Throughput: 0: 5763.0. Samples: 501201668. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:34:11,039][25689] Avg episode reward: [(0, '-44.048')] [2022-07-10 00:34:11,498][26022] Updated weights on worker 0-0, policy_version 489453 (0.00078) [2022-07-10 00:34:13,052][26022] Updated weights on worker 0-0, policy_version 489463 (0.00085) [2022-07-10 00:34:15,058][26022] Updated weights on worker 0-0, policy_version 489473 (0.00084) [2022-07-10 00:34:16,068][25689] Fps is (10 sec: 5887.0, 60 sec: 5670.5, 300 sec: 5661.2). Total num frames: 501227520. Throughput: 0: 4959.0. Samples: 501218852. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:34:16,069][25689] Avg episode reward: [(0, '-43.572')] [2022-07-10 00:34:16,608][26022] Updated weights on worker 0-0, policy_version 489483 (0.00083) [2022-07-10 00:34:18,657][26022] Updated weights on worker 0-0, policy_version 489493 (0.00087) [2022-07-10 00:34:20,277][26022] Updated weights on worker 0-0, policy_version 489503 (0.00108) [2022-07-10 00:34:21,106][25689] Fps is (10 sec: 5696.0, 60 sec: 5585.1, 300 sec: 5648.7). Total num frames: 501254144. Throughput: 0: 5823.2. Samples: 501253618. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:34:21,106][25689] Avg episode reward: [(0, '-43.756')] [2022-07-10 00:34:22,143][26022] Updated weights on worker 0-0, policy_version 489513 (0.00087) [2022-07-10 00:34:23,958][26022] Updated weights on worker 0-0, policy_version 489523 (0.00083) [2022-07-10 00:34:25,519][26022] Updated weights on worker 0-0, policy_version 489533 (0.00093) [2022-07-10 00:34:26,178][25689] Fps is (10 sec: 5570.9, 60 sec: 5617.3, 300 sec: 5651.4). Total num frames: 501283840. Throughput: 0: 5943.4. Samples: 501287684. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:34:26,179][25689] Avg episode reward: [(0, '-44.589')] [2022-07-10 00:34:27,499][26022] Updated weights on worker 0-0, policy_version 489543 (0.00089) [2022-07-10 00:34:29,347][26022] Updated weights on worker 0-0, policy_version 489553 (0.00083) [2022-07-10 00:34:31,102][26022] Updated weights on worker 0-0, policy_version 489563 (0.00085) [2022-07-10 00:34:31,228][25689] Fps is (10 sec: 5766.2, 60 sec: 5647.4, 300 sec: 5654.3). Total num frames: 501312512. Throughput: 0: 5951.2. Samples: 501321822. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:34:31,229][25689] Avg episode reward: [(0, '-44.597')] [2022-07-10 00:34:32,943][26022] Updated weights on worker 0-0, policy_version 489573 (0.00086) [2022-07-10 00:34:34,429][26022] Updated weights on worker 0-0, policy_version 489583 (0.00096) [2022-07-10 00:34:36,258][25689] Fps is (10 sec: 5689.1, 60 sec: 5631.5, 300 sec: 5650.4). Total num frames: 501341184. Throughput: 0: 5966.2. Samples: 501339306. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:34:36,258][25689] Avg episode reward: [(0, '-44.195')] [2022-07-10 00:34:36,486][26022] Updated weights on worker 0-0, policy_version 489593 (0.00085) [2022-07-10 00:34:38,336][26022] Updated weights on worker 0-0, policy_version 489603 (0.00108) [2022-07-10 00:34:40,101][26022] Updated weights on worker 0-0, policy_version 489613 (0.00087) [2022-07-10 00:34:41,291][25689] Fps is (10 sec: 5698.8, 60 sec: 5665.1, 300 sec: 5647.7). Total num frames: 501369856. Throughput: 0: 5931.3. Samples: 501373340. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:34:41,291][25689] Avg episode reward: [(0, '-44.795')] [2022-07-10 00:34:41,916][26022] Updated weights on worker 0-0, policy_version 489623 (0.00091) [2022-07-10 00:34:43,551][26022] Updated weights on worker 0-0, policy_version 489633 (0.00097) [2022-07-10 00:34:45,734][26022] Updated weights on worker 0-0, policy_version 489643 (0.00089) [2022-07-10 00:34:46,381][25689] Fps is (10 sec: 5866.6, 60 sec: 5677.8, 300 sec: 5661.3). Total num frames: 501400576. Throughput: 0: 5929.4. Samples: 501407478. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:34:46,382][25689] Avg episode reward: [(0, '-44.456')] [2022-07-10 00:34:47,095][26022] Updated weights on worker 0-0, policy_version 489653 (0.00080) [2022-07-10 00:34:49,095][26022] Updated weights on worker 0-0, policy_version 489663 (0.00085) [2022-07-10 00:34:50,904][26022] Updated weights on worker 0-0, policy_version 489673 (0.00080) [2022-07-10 00:34:51,401][25689] Fps is (10 sec: 5570.3, 60 sec: 5626.7, 300 sec: 5647.3). Total num frames: 501426176. Throughput: 0: 5105.6. Samples: 501424818. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:34:51,402][25689] Avg episode reward: [(0, '-44.957')] [2022-07-10 00:34:52,587][26022] Updated weights on worker 0-0, policy_version 489683 (0.00088) [2022-07-10 00:34:54,638][26022] Updated weights on worker 0-0, policy_version 489693 (0.00089) [2022-07-10 00:34:56,309][26022] Updated weights on worker 0-0, policy_version 489703 (0.00096) [2022-07-10 00:34:56,423][25689] Fps is (10 sec: 5506.5, 60 sec: 5660.1, 300 sec: 5654.1). Total num frames: 501455872. Throughput: 0: 5917.7. Samples: 501458640. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:34:56,424][25689] Avg episode reward: [(0, '-44.726')] [2022-07-10 00:34:58,127][26022] Updated weights on worker 0-0, policy_version 489713 (0.00086) [2022-07-10 00:34:59,868][26022] Updated weights on worker 0-0, policy_version 489723 (0.00093) [2022-07-10 00:35:01,438][25689] Fps is (10 sec: 5815.2, 60 sec: 5627.3, 300 sec: 5661.7). Total num frames: 501484544. Throughput: 0: 5935.6. Samples: 501492930. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:35:01,439][25689] Avg episode reward: [(0, '-43.718')] [2022-07-10 00:35:01,898][26022] Updated weights on worker 0-0, policy_version 489733 (0.00098) [2022-07-10 00:35:03,943][26022] Updated weights on worker 0-0, policy_version 489743 (0.00088) [2022-07-10 00:35:05,668][26022] Updated weights on worker 0-0, policy_version 489753 (0.00094) [2022-07-10 00:35:06,554][25689] Fps is (10 sec: 5458.5, 60 sec: 5663.2, 300 sec: 5652.7). Total num frames: 501511168. Throughput: 0: 4971.9. Samples: 501507772. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:35:06,556][25689] Avg episode reward: [(0, '-44.293')] [2022-07-10 00:35:07,592][26022] Updated weights on worker 0-0, policy_version 489763 (0.00095) [2022-07-10 00:35:09,324][26022] Updated weights on worker 0-0, policy_version 489773 (0.00084) [2022-07-10 00:35:11,057][26022] Updated weights on worker 0-0, policy_version 489783 (0.00086) [2022-07-10 00:35:11,559][25689] Fps is (10 sec: 5565.1, 60 sec: 5685.1, 300 sec: 5659.6). Total num frames: 501540864. Throughput: 0: 5804.5. Samples: 501541822. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:35:11,560][25689] Avg episode reward: [(0, '-44.150')] [2022-07-10 00:35:13,097][26022] Updated weights on worker 0-0, policy_version 489793 (0.00087) [2022-07-10 00:35:14,538][26022] Updated weights on worker 0-0, policy_version 489803 (0.00091) [2022-07-10 00:35:16,507][26022] Updated weights on worker 0-0, policy_version 489813 (0.00115) [2022-07-10 00:35:16,582][25689] Fps is (10 sec: 5718.0, 60 sec: 5634.9, 300 sec: 5655.9). Total num frames: 501568512. Throughput: 0: 5838.2. Samples: 501576332. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:35:16,591][25689] Avg episode reward: [(0, '-43.052')] [2022-07-10 00:35:18,197][26022] Updated weights on worker 0-0, policy_version 489823 (0.00082) [2022-07-10 00:35:20,120][26022] Updated weights on worker 0-0, policy_version 489833 (0.00089) [2022-07-10 00:35:21,634][25689] Fps is (10 sec: 5691.6, 60 sec: 5684.4, 300 sec: 5656.3). Total num frames: 501598208. Throughput: 0: 4980.6. Samples: 501593514. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:35:21,635][25689] Avg episode reward: [(0, '-41.803')] [2022-07-10 00:35:21,913][26022] Updated weights on worker 0-0, policy_version 489843 (0.00088) [2022-07-10 00:35:23,712][26022] Updated weights on worker 0-0, policy_version 489853 (0.00094) [2022-07-10 00:35:25,351][26022] Updated weights on worker 0-0, policy_version 489863 (0.00093) [2022-07-10 00:35:26,684][25689] Fps is (10 sec: 5676.9, 60 sec: 5652.6, 300 sec: 5652.5). Total num frames: 501625856. Throughput: 0: 5960.7. Samples: 501627762. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:35:26,684][25689] Avg episode reward: [(0, '-41.472')] [2022-07-10 00:35:27,441][26022] Updated weights on worker 0-0, policy_version 489873 (0.00090) [2022-07-10 00:35:28,901][26022] Updated weights on worker 0-0, policy_version 489883 (0.00094) [2022-07-10 00:35:30,876][26022] Updated weights on worker 0-0, policy_version 489893 (0.01128) [2022-07-10 00:35:31,735][25689] Fps is (10 sec: 5575.3, 60 sec: 5652.5, 300 sec: 5655.4). Total num frames: 501654528. Throughput: 0: 5962.4. Samples: 501662124. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:35:31,736][25689] Avg episode reward: [(0, '-41.171')] [2022-07-10 00:35:32,524][26022] Updated weights on worker 0-0, policy_version 489903 (0.00090) [2022-07-10 00:35:34,477][26022] Updated weights on worker 0-0, policy_version 489913 (0.00068) [2022-07-10 00:35:36,223][26022] Updated weights on worker 0-0, policy_version 489923 (0.00087) [2022-07-10 00:35:36,746][25689] Fps is (10 sec: 5902.6, 60 sec: 5688.1, 300 sec: 5662.7). Total num frames: 501685248. Throughput: 0: 5114.2. Samples: 501679452. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:35:36,747][25689] Avg episode reward: [(0, '-40.808')] [2022-07-10 00:35:37,987][26022] Updated weights on worker 0-0, policy_version 489933 (0.00095) [2022-07-10 00:35:39,725][26022] Updated weights on worker 0-0, policy_version 489943 (0.01064) [2022-07-10 00:35:41,637][26022] Updated weights on worker 0-0, policy_version 489953 (0.00086) [2022-07-10 00:35:41,824][25689] Fps is (10 sec: 5785.8, 60 sec: 5667.0, 300 sec: 5659.5). Total num frames: 501712896. Throughput: 0: 5949.0. Samples: 501713624. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:35:41,824][25689] Avg episode reward: [(0, '-40.894')] [2022-07-10 00:35:43,267][26022] Updated weights on worker 0-0, policy_version 489963 (0.00081) [2022-07-10 00:35:45,048][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:35:45,054][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000489972_501731328.pth [2022-07-10 00:35:45,055][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000487981_499692544.pth [2022-07-10 00:35:45,278][26022] Updated weights on worker 0-0, policy_version 489973 (0.00092) [2022-07-10 00:35:46,900][25689] Fps is (10 sec: 5647.1, 60 sec: 5651.4, 300 sec: 5662.3). Total num frames: 501742592. Throughput: 0: 5936.9. Samples: 501747786. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:35:46,901][25689] Avg episode reward: [(0, '-41.446')] [2022-07-10 00:35:46,901][26022] Updated weights on worker 0-0, policy_version 489983 (0.00084) [2022-07-10 00:35:48,819][26022] Updated weights on worker 0-0, policy_version 489993 (0.00085) [2022-07-10 00:35:50,452][26022] Updated weights on worker 0-0, policy_version 490003 (0.00113) [2022-07-10 00:35:51,920][25689] Fps is (10 sec: 5679.6, 60 sec: 5685.2, 300 sec: 5659.3). Total num frames: 501770240. Throughput: 0: 5100.5. Samples: 501765076. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:35:51,921][25689] Avg episode reward: [(0, '-41.627')] [2022-07-10 00:35:52,350][26022] Updated weights on worker 0-0, policy_version 490013 (0.00088) [2022-07-10 00:35:54,168][26022] Updated weights on worker 0-0, policy_version 490023 (0.00085) [2022-07-10 00:35:56,052][26022] Updated weights on worker 0-0, policy_version 490033 (0.00118) [2022-07-10 00:35:56,935][25689] Fps is (10 sec: 5510.7, 60 sec: 5652.1, 300 sec: 5655.6). Total num frames: 501797888. Throughput: 0: 5915.1. Samples: 501798872. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:35:56,937][25689] Avg episode reward: [(0, '-41.381')] [2022-07-10 00:35:57,803][26022] Updated weights on worker 0-0, policy_version 490043 (0.00087) [2022-07-10 00:36:00,010][26022] Updated weights on worker 0-0, policy_version 490053 (0.00099) [2022-07-10 00:36:01,720][26022] Updated weights on worker 0-0, policy_version 490063 (0.00092) [2022-07-10 00:36:02,003][25689] Fps is (10 sec: 5484.3, 60 sec: 5630.2, 300 sec: 5656.0). Total num frames: 501825536. Throughput: 0: 5886.1. Samples: 501832400. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:36:02,003][25689] Avg episode reward: [(0, '-41.374')] [2022-07-10 00:36:03,860][26022] Updated weights on worker 0-0, policy_version 490073 (0.00089) [2022-07-10 00:36:05,277][26022] Updated weights on worker 0-0, policy_version 490083 (0.00086) [2022-07-10 00:36:07,071][25689] Fps is (10 sec: 5556.4, 60 sec: 5668.5, 300 sec: 5662.3). Total num frames: 501854208. Throughput: 0: 4959.5. Samples: 501847820. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:36:07,071][25689] Avg episode reward: [(0, '-42.606')] [2022-07-10 00:36:07,242][26022] Updated weights on worker 0-0, policy_version 490093 (0.00092) [2022-07-10 00:36:09,052][26022] Updated weights on worker 0-0, policy_version 490103 (0.00087) [2022-07-10 00:36:10,843][26022] Updated weights on worker 0-0, policy_version 490113 (0.00082) [2022-07-10 00:36:12,121][25689] Fps is (10 sec: 5566.0, 60 sec: 5630.4, 300 sec: 5651.6). Total num frames: 501881856. Throughput: 0: 5794.2. Samples: 501882126. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:36:12,122][25689] Avg episode reward: [(0, '-42.181')] [2022-07-10 00:36:12,667][26022] Updated weights on worker 0-0, policy_version 490123 (0.00087) [2022-07-10 00:36:14,419][26022] Updated weights on worker 0-0, policy_version 490133 (0.00095) [2022-07-10 00:36:16,187][26022] Updated weights on worker 0-0, policy_version 490143 (0.00088) [2022-07-10 00:36:17,147][25689] Fps is (10 sec: 5691.1, 60 sec: 5664.1, 300 sec: 5658.7). Total num frames: 501911552. Throughput: 0: 5823.9. Samples: 501916584. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:36:17,147][25689] Avg episode reward: [(0, '-42.313')] [2022-07-10 00:36:17,899][26022] Updated weights on worker 0-0, policy_version 490153 (0.00091) [2022-07-10 00:36:19,864][26022] Updated weights on worker 0-0, policy_version 490163 (0.00093) [2022-07-10 00:36:21,472][26022] Updated weights on worker 0-0, policy_version 490173 (0.00091) [2022-07-10 00:36:22,150][25689] Fps is (10 sec: 5820.3, 60 sec: 5651.7, 300 sec: 5660.8). Total num frames: 501940224. Throughput: 0: 5034.0. Samples: 501933820. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:36:22,150][25689] Avg episode reward: [(0, '-43.007')] [2022-07-10 00:36:23,474][26022] Updated weights on worker 0-0, policy_version 490183 (0.00091) [2022-07-10 00:36:25,222][26022] Updated weights on worker 0-0, policy_version 490193 (0.00086) [2022-07-10 00:36:26,911][26022] Updated weights on worker 0-0, policy_version 490203 (0.00091) [2022-07-10 00:36:27,200][25689] Fps is (10 sec: 5805.9, 60 sec: 5685.5, 300 sec: 5661.0). Total num frames: 501969920. Throughput: 0: 5971.9. Samples: 501968030. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 00:36:27,200][25689] Avg episode reward: [(0, '-43.402')] [2022-07-10 00:36:28,923][26022] Updated weights on worker 0-0, policy_version 490213 (0.00083) [2022-07-10 00:36:30,416][26022] Updated weights on worker 0-0, policy_version 490223 (0.00095) [2022-07-10 00:36:32,245][25689] Fps is (10 sec: 5680.2, 60 sec: 5669.2, 300 sec: 5657.3). Total num frames: 501997568. Throughput: 0: 5965.8. Samples: 502002182. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:36:32,246][25689] Avg episode reward: [(0, '-42.541')] [2022-07-10 00:36:32,359][26022] Updated weights on worker 0-0, policy_version 490233 (0.00097) [2022-07-10 00:36:34,171][26022] Updated weights on worker 0-0, policy_version 490243 (0.00098) [2022-07-10 00:36:36,020][26022] Updated weights on worker 0-0, policy_version 490253 (0.00242) [2022-07-10 00:36:37,322][25689] Fps is (10 sec: 5463.1, 60 sec: 5612.3, 300 sec: 5656.3). Total num frames: 502025216. Throughput: 0: 5089.9. Samples: 502019272. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:36:37,322][25689] Avg episode reward: [(0, '-43.088')] [2022-07-10 00:36:37,784][26022] Updated weights on worker 0-0, policy_version 490263 (0.00097) [2022-07-10 00:36:39,515][26022] Updated weights on worker 0-0, policy_version 490273 (0.00086) [2022-07-10 00:36:41,362][26022] Updated weights on worker 0-0, policy_version 490283 (0.00086) [2022-07-10 00:36:42,341][25689] Fps is (10 sec: 5781.2, 60 sec: 5668.4, 300 sec: 5658.5). Total num frames: 502055936. Throughput: 0: 5946.7. Samples: 502053894. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:36:42,342][25689] Avg episode reward: [(0, '-43.219')] [2022-07-10 00:36:43,197][26022] Updated weights on worker 0-0, policy_version 490293 (0.00090) [2022-07-10 00:36:44,870][26022] Updated weights on worker 0-0, policy_version 490303 (0.00085) [2022-07-10 00:36:46,820][26022] Updated weights on worker 0-0, policy_version 490313 (0.00090) [2022-07-10 00:36:47,404][25689] Fps is (10 sec: 5890.9, 60 sec: 5652.9, 300 sec: 5657.7). Total num frames: 502084608. Throughput: 0: 5947.7. Samples: 502088196. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:36:47,404][25689] Avg episode reward: [(0, '-43.783')] [2022-07-10 00:36:48,498][26022] Updated weights on worker 0-0, policy_version 490323 (0.00081) [2022-07-10 00:36:50,278][26022] Updated weights on worker 0-0, policy_version 490333 (0.00088) [2022-07-10 00:36:52,120][26022] Updated weights on worker 0-0, policy_version 490343 (0.00083) [2022-07-10 00:36:52,410][25689] Fps is (10 sec: 5695.2, 60 sec: 5671.0, 300 sec: 5659.4). Total num frames: 502113280. Throughput: 0: 5121.8. Samples: 502105462. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:36:52,411][25689] Avg episode reward: [(0, '-44.448')] [2022-07-10 00:36:53,994][26022] Updated weights on worker 0-0, policy_version 490353 (0.00091) [2022-07-10 00:36:55,740][26022] Updated weights on worker 0-0, policy_version 490363 (0.00086) [2022-07-10 00:36:57,453][25689] Fps is (10 sec: 5604.2, 60 sec: 5668.4, 300 sec: 5655.5). Total num frames: 502140928. Throughput: 0: 5977.5. Samples: 502139608. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:36:57,454][25689] Avg episode reward: [(0, '-44.807')] [2022-07-10 00:36:57,495][26022] Updated weights on worker 0-0, policy_version 490373 (0.00090) [2022-07-10 00:36:59,279][26022] Updated weights on worker 0-0, policy_version 490383 (0.00093) [2022-07-10 00:37:00,909][26022] Updated weights on worker 0-0, policy_version 490393 (0.00093) [2022-07-10 00:37:02,499][25689] Fps is (10 sec: 5481.0, 60 sec: 5670.5, 300 sec: 5660.6). Total num frames: 502168576. Throughput: 0: 5958.2. Samples: 502173996. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:37:02,501][25689] Avg episode reward: [(0, '-45.827')] [2022-07-10 00:37:03,282][26022] Updated weights on worker 0-0, policy_version 490403 (0.00094) [2022-07-10 00:37:04,895][26022] Updated weights on worker 0-0, policy_version 490413 (0.00093) [2022-07-10 00:37:06,924][26022] Updated weights on worker 0-0, policy_version 490423 (0.00084) [2022-07-10 00:37:07,537][25689] Fps is (10 sec: 5483.5, 60 sec: 5656.3, 300 sec: 5660.6). Total num frames: 502196224. Throughput: 0: 5841.2. Samples: 502205800. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:37:07,539][25689] Avg episode reward: [(0, '-45.312')] [2022-07-10 00:37:08,518][26022] Updated weights on worker 0-0, policy_version 490433 (0.00094) [2022-07-10 00:37:10,596][26022] Updated weights on worker 0-0, policy_version 490443 (0.00086) [2022-07-10 00:37:12,364][26022] Updated weights on worker 0-0, policy_version 490453 (0.00087) [2022-07-10 00:37:12,557][25689] Fps is (10 sec: 5599.4, 60 sec: 5676.1, 300 sec: 5657.0). Total num frames: 502224896. Throughput: 0: 5804.4. Samples: 502222402. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:37:12,559][25689] Avg episode reward: [(0, '-44.892')] [2022-07-10 00:37:14,043][26022] Updated weights on worker 0-0, policy_version 490463 (0.00095) [2022-07-10 00:37:15,876][26022] Updated weights on worker 0-0, policy_version 490473 (0.00095) [2022-07-10 00:37:17,570][25689] Fps is (10 sec: 5715.9, 60 sec: 5660.4, 300 sec: 5657.1). Total num frames: 502253568. Throughput: 0: 5808.3. Samples: 502256450. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:37:17,571][25689] Avg episode reward: [(0, '-45.404')] [2022-07-10 00:37:17,651][26022] Updated weights on worker 0-0, policy_version 490483 (0.00091) [2022-07-10 00:37:19,635][26022] Updated weights on worker 0-0, policy_version 490493 (0.00089) [2022-07-10 00:37:21,402][26022] Updated weights on worker 0-0, policy_version 490503 (0.00082) [2022-07-10 00:37:22,587][25689] Fps is (10 sec: 5615.5, 60 sec: 5642.1, 300 sec: 5654.8). Total num frames: 502281216. Throughput: 0: 5807.7. Samples: 502290660. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:37:22,588][25689] Avg episode reward: [(0, '-45.335')] [2022-07-10 00:37:23,186][26022] Updated weights on worker 0-0, policy_version 490513 (0.00084) [2022-07-10 00:37:25,048][26022] Updated weights on worker 0-0, policy_version 490523 (0.00093) [2022-07-10 00:37:26,866][26022] Updated weights on worker 0-0, policy_version 490533 (0.00089) [2022-07-10 00:37:27,651][25689] Fps is (10 sec: 5688.2, 60 sec: 5640.8, 300 sec: 5653.6). Total num frames: 502310912. Throughput: 0: 5054.0. Samples: 502307454. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:37:27,651][25689] Avg episode reward: [(0, '-45.701')] [2022-07-10 00:37:28,698][26022] Updated weights on worker 0-0, policy_version 490543 (0.00082) [2022-07-10 00:37:30,398][26022] Updated weights on worker 0-0, policy_version 490553 (0.00085) [2022-07-10 00:37:32,392][26022] Updated weights on worker 0-0, policy_version 490563 (0.00094) [2022-07-10 00:37:32,667][25689] Fps is (10 sec: 5587.0, 60 sec: 5626.6, 300 sec: 5646.8). Total num frames: 502337536. Throughput: 0: 5923.3. Samples: 502341520. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:37:32,668][25689] Avg episode reward: [(0, '-45.293')] [2022-07-10 00:37:33,966][26022] Updated weights on worker 0-0, policy_version 490573 (0.00094) [2022-07-10 00:37:35,877][26022] Updated weights on worker 0-0, policy_version 490583 (0.00086) [2022-07-10 00:37:37,691][25689] Fps is (10 sec: 5507.6, 60 sec: 5648.5, 300 sec: 5650.1). Total num frames: 502366208. Throughput: 0: 5927.6. Samples: 502375720. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:37:37,691][25689] Avg episode reward: [(0, '-45.928')] [2022-07-10 00:37:37,801][26022] Updated weights on worker 0-0, policy_version 490593 (0.00091) [2022-07-10 00:37:39,316][26022] Updated weights on worker 0-0, policy_version 490603 (0.00086) [2022-07-10 00:37:41,305][26022] Updated weights on worker 0-0, policy_version 490613 (0.00089) [2022-07-10 00:37:42,706][25689] Fps is (10 sec: 5916.4, 60 sec: 5648.9, 300 sec: 5654.1). Total num frames: 502396928. Throughput: 0: 5080.7. Samples: 502392880. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:37:42,706][25689] Avg episode reward: [(0, '-45.520')] [2022-07-10 00:37:42,930][26022] Updated weights on worker 0-0, policy_version 490623 (0.00088) [2022-07-10 00:37:44,769][26022] Updated weights on worker 0-0, policy_version 490633 (0.00093) [2022-07-10 00:37:45,091][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:37:45,107][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000490636_502411264.pth [2022-07-10 00:37:45,107][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000488645_500372480.pth [2022-07-10 00:37:46,813][26022] Updated weights on worker 0-0, policy_version 490643 (0.00089) [2022-07-10 00:37:47,782][25689] Fps is (10 sec: 5885.2, 60 sec: 5647.5, 300 sec: 5656.4). Total num frames: 502425600. Throughput: 0: 5943.6. Samples: 502427110. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:37:47,783][25689] Avg episode reward: [(0, '-44.834')] [2022-07-10 00:37:48,244][26022] Updated weights on worker 0-0, policy_version 490653 (0.00088) [2022-07-10 00:37:50,355][26022] Updated weights on worker 0-0, policy_version 490663 (0.00083) [2022-07-10 00:37:52,039][26022] Updated weights on worker 0-0, policy_version 490673 (0.00082) [2022-07-10 00:37:52,793][25689] Fps is (10 sec: 5583.5, 60 sec: 5630.2, 300 sec: 5649.4). Total num frames: 502453248. Throughput: 0: 5972.8. Samples: 502461726. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:37:52,793][25689] Avg episode reward: [(0, '-44.010')] [2022-07-10 00:37:53,764][26022] Updated weights on worker 0-0, policy_version 490683 (0.00087) [2022-07-10 00:37:55,681][26022] Updated weights on worker 0-0, policy_version 490693 (0.00087) [2022-07-10 00:37:57,259][26022] Updated weights on worker 0-0, policy_version 490703 (0.00085) [2022-07-10 00:37:57,859][25689] Fps is (10 sec: 5690.6, 60 sec: 5661.9, 300 sec: 5655.3). Total num frames: 502482944. Throughput: 0: 5105.0. Samples: 502478682. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:37:57,860][25689] Avg episode reward: [(0, '-43.193')] [2022-07-10 00:37:59,218][26022] Updated weights on worker 0-0, policy_version 490713 (0.00083) [2022-07-10 00:38:00,986][26022] Updated weights on worker 0-0, policy_version 490723 (0.00084) [2022-07-10 00:38:02,950][25689] Fps is (10 sec: 5443.9, 60 sec: 5623.8, 300 sec: 5647.4). Total num frames: 502508544. Throughput: 0: 5927.9. Samples: 502512886. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:38:02,951][25689] Avg episode reward: [(0, '-42.570')] [2022-07-10 00:38:03,162][26022] Updated weights on worker 0-0, policy_version 490733 (0.00091) [2022-07-10 00:38:05,027][26022] Updated weights on worker 0-0, policy_version 490743 (0.00097) [2022-07-10 00:38:06,688][26022] Updated weights on worker 0-0, policy_version 490753 (0.00096) [2022-07-10 00:38:08,020][25689] Fps is (10 sec: 5341.3, 60 sec: 5637.8, 300 sec: 5647.9). Total num frames: 502537216. Throughput: 0: 5806.3. Samples: 502544616. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:38:08,021][25689] Avg episode reward: [(0, '-42.068')] [2022-07-10 00:38:08,583][26022] Updated weights on worker 0-0, policy_version 490763 (0.00091) [2022-07-10 00:38:10,285][26022] Updated weights on worker 0-0, policy_version 490773 (0.00083) [2022-07-10 00:38:12,287][26022] Updated weights on worker 0-0, policy_version 490783 (0.00085) [2022-07-10 00:38:13,038][25689] Fps is (10 sec: 5785.9, 60 sec: 5654.9, 300 sec: 5655.9). Total num frames: 502566912. Throughput: 0: 4934.4. Samples: 502561626. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:38:13,039][25689] Avg episode reward: [(0, '-42.858')] [2022-07-10 00:38:13,980][26022] Updated weights on worker 0-0, policy_version 490793 (0.00087) [2022-07-10 00:38:15,751][26022] Updated weights on worker 0-0, policy_version 490803 (0.00083) [2022-07-10 00:38:17,544][26022] Updated weights on worker 0-0, policy_version 490813 (0.00086) [2022-07-10 00:38:18,055][25689] Fps is (10 sec: 5714.3, 60 sec: 5637.5, 300 sec: 5642.4). Total num frames: 502594560. Throughput: 0: 5813.1. Samples: 502596084. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:38:18,056][25689] Avg episode reward: [(0, '-42.395')] [2022-07-10 00:38:19,551][26022] Updated weights on worker 0-0, policy_version 490823 (0.00093) [2022-07-10 00:38:21,155][26022] Updated weights on worker 0-0, policy_version 490833 (0.00094) [2022-07-10 00:38:23,002][26022] Updated weights on worker 0-0, policy_version 490843 (0.00093) [2022-07-10 00:38:23,120][25689] Fps is (10 sec: 5586.1, 60 sec: 5650.0, 300 sec: 5645.6). Total num frames: 502623232. Throughput: 0: 5812.7. Samples: 502630130. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:38:23,121][25689] Avg episode reward: [(0, '-42.267')] [2022-07-10 00:38:24,988][26022] Updated weights on worker 0-0, policy_version 490853 (0.00136) [2022-07-10 00:38:26,562][26022] Updated weights on worker 0-0, policy_version 490863 (0.00082) [2022-07-10 00:38:28,230][25689] Fps is (10 sec: 5636.1, 60 sec: 5628.9, 300 sec: 5650.6). Total num frames: 502651904. Throughput: 0: 5072.3. Samples: 502647126. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:38:28,232][25689] Avg episode reward: [(0, '-42.679')] [2022-07-10 00:38:28,565][26022] Updated weights on worker 0-0, policy_version 490873 (0.00092) [2022-07-10 00:38:30,304][26022] Updated weights on worker 0-0, policy_version 490883 (0.00090) [2022-07-10 00:38:32,066][26022] Updated weights on worker 0-0, policy_version 490893 (0.00085) [2022-07-10 00:38:33,282][25689] Fps is (10 sec: 5643.2, 60 sec: 5659.4, 300 sec: 5646.9). Total num frames: 502680576. Throughput: 0: 5886.8. Samples: 502680796. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:38:33,283][25689] Avg episode reward: [(0, '-43.632')] [2022-07-10 00:38:33,878][26022] Updated weights on worker 0-0, policy_version 490903 (0.00090) [2022-07-10 00:38:35,640][26022] Updated weights on worker 0-0, policy_version 490913 (0.00086) [2022-07-10 00:38:37,573][26022] Updated weights on worker 0-0, policy_version 490923 (0.00088) [2022-07-10 00:38:38,286][25689] Fps is (10 sec: 5804.1, 60 sec: 5678.1, 300 sec: 5657.7). Total num frames: 502710272. Throughput: 0: 5880.9. Samples: 502715058. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:38:38,287][25689] Avg episode reward: [(0, '-43.067')] [2022-07-10 00:38:39,339][26022] Updated weights on worker 0-0, policy_version 490933 (0.00092) [2022-07-10 00:38:40,977][26022] Updated weights on worker 0-0, policy_version 490943 (0.00098) [2022-07-10 00:38:43,055][26022] Updated weights on worker 0-0, policy_version 490953 (0.00087) [2022-07-10 00:38:43,300][25689] Fps is (10 sec: 5724.2, 60 sec: 5627.5, 300 sec: 5651.5). Total num frames: 502737920. Throughput: 0: 5052.7. Samples: 502732090. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:38:43,300][25689] Avg episode reward: [(0, '-42.916')] [2022-07-10 00:38:44,561][26022] Updated weights on worker 0-0, policy_version 490963 (0.01155) [2022-07-10 00:38:46,540][26022] Updated weights on worker 0-0, policy_version 490973 (0.00085) [2022-07-10 00:38:48,395][25689] Fps is (10 sec: 5469.8, 60 sec: 5608.8, 300 sec: 5646.5). Total num frames: 502765568. Throughput: 0: 5899.2. Samples: 502766088. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:38:48,396][25689] Avg episode reward: [(0, '-42.985')] [2022-07-10 00:38:48,579][26022] Updated weights on worker 0-0, policy_version 490983 (0.00086) [2022-07-10 00:38:50,047][26022] Updated weights on worker 0-0, policy_version 490993 (0.00095) [2022-07-10 00:38:52,119][26022] Updated weights on worker 0-0, policy_version 491003 (0.00083) [2022-07-10 00:38:53,398][25689] Fps is (10 sec: 5779.7, 60 sec: 5660.2, 300 sec: 5657.1). Total num frames: 502796288. Throughput: 0: 5925.5. Samples: 502799996. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:38:53,399][25689] Avg episode reward: [(0, '-43.177')] [2022-07-10 00:38:53,731][26022] Updated weights on worker 0-0, policy_version 491013 (0.00095) [2022-07-10 00:38:55,531][26022] Updated weights on worker 0-0, policy_version 491023 (0.00094) [2022-07-10 00:38:57,435][26022] Updated weights on worker 0-0, policy_version 491033 (0.00092) [2022-07-10 00:38:58,449][25689] Fps is (10 sec: 5806.0, 60 sec: 5627.9, 300 sec: 5646.3). Total num frames: 502823936. Throughput: 0: 5065.3. Samples: 502817186. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:38:58,449][25689] Avg episode reward: [(0, '-42.901')] [2022-07-10 00:38:59,104][26022] Updated weights on worker 0-0, policy_version 491043 (0.00088) [2022-07-10 00:39:00,971][26022] Updated weights on worker 0-0, policy_version 491053 (0.00086) [2022-07-10 00:39:03,141][26022] Updated weights on worker 0-0, policy_version 491063 (0.00094) [2022-07-10 00:39:03,471][25689] Fps is (10 sec: 5184.8, 60 sec: 5617.4, 300 sec: 5648.5). Total num frames: 502848512. Throughput: 0: 5907.7. Samples: 502851256. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:39:03,472][25689] Avg episode reward: [(0, '-42.593')] [2022-07-10 00:39:04,838][26022] Updated weights on worker 0-0, policy_version 491073 (0.00097) [2022-07-10 00:39:06,715][26022] Updated weights on worker 0-0, policy_version 491083 (0.00083) [2022-07-10 00:39:08,453][26022] Updated weights on worker 0-0, policy_version 491093 (0.00087) [2022-07-10 00:39:08,565][25689] Fps is (10 sec: 5465.9, 60 sec: 5649.0, 300 sec: 5654.7). Total num frames: 502879232. Throughput: 0: 5832.8. Samples: 502883734. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:39:08,566][25689] Avg episode reward: [(0, '-42.997')] [2022-07-10 00:39:10,199][26022] Updated weights on worker 0-0, policy_version 491103 (0.00089) [2022-07-10 00:39:12,094][26022] Updated weights on worker 0-0, policy_version 491113 (0.00088) [2022-07-10 00:39:13,584][25689] Fps is (10 sec: 5872.5, 60 sec: 5631.9, 300 sec: 5648.0). Total num frames: 502907904. Throughput: 0: 4999.4. Samples: 502900914. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:39:13,585][25689] Avg episode reward: [(0, '-43.482')] [2022-07-10 00:39:13,777][26022] Updated weights on worker 0-0, policy_version 491123 (0.00092) [2022-07-10 00:39:15,777][26022] Updated weights on worker 0-0, policy_version 491133 (0.00087) [2022-07-10 00:39:17,319][26022] Updated weights on worker 0-0, policy_version 491143 (0.00083) [2022-07-10 00:39:18,600][25689] Fps is (10 sec: 5612.6, 60 sec: 5632.1, 300 sec: 5651.9). Total num frames: 502935552. Throughput: 0: 5863.1. Samples: 502935334. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 00:39:18,600][25689] Avg episode reward: [(0, '-42.971')] [2022-07-10 00:39:19,274][26022] Updated weights on worker 0-0, policy_version 491153 (0.00085) [2022-07-10 00:39:21,015][26022] Updated weights on worker 0-0, policy_version 491163 (0.00096) [2022-07-10 00:39:22,724][26022] Updated weights on worker 0-0, policy_version 491173 (0.00080) [2022-07-10 00:39:23,614][25689] Fps is (10 sec: 5717.6, 60 sec: 5653.8, 300 sec: 5653.0). Total num frames: 502965248. Throughput: 0: 5880.2. Samples: 502969700. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:39:23,615][25689] Avg episode reward: [(0, '-42.620')] [2022-07-10 00:39:24,463][26022] Updated weights on worker 0-0, policy_version 491183 (0.00084) [2022-07-10 00:39:26,413][26022] Updated weights on worker 0-0, policy_version 491193 (0.00088) [2022-07-10 00:39:28,232][26022] Updated weights on worker 0-0, policy_version 491203 (0.00090) [2022-07-10 00:39:28,721][25689] Fps is (10 sec: 5766.6, 60 sec: 5653.9, 300 sec: 5651.9). Total num frames: 502993920. Throughput: 0: 5110.5. Samples: 502986742. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:39:28,722][25689] Avg episode reward: [(0, '-43.261')] [2022-07-10 00:39:30,029][26022] Updated weights on worker 0-0, policy_version 491213 (0.00093) [2022-07-10 00:39:31,753][26022] Updated weights on worker 0-0, policy_version 491223 (0.00051) [2022-07-10 00:39:33,527][26022] Updated weights on worker 0-0, policy_version 491233 (0.00093) [2022-07-10 00:39:33,729][25689] Fps is (10 sec: 5770.5, 60 sec: 5675.1, 300 sec: 5655.8). Total num frames: 503023616. Throughput: 0: 5952.6. Samples: 503020824. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:39:33,729][25689] Avg episode reward: [(0, '-42.753')] [2022-07-10 00:39:35,477][26022] Updated weights on worker 0-0, policy_version 491243 (0.00087) [2022-07-10 00:39:37,138][26022] Updated weights on worker 0-0, policy_version 491253 (0.00085) [2022-07-10 00:39:38,784][25689] Fps is (10 sec: 5698.6, 60 sec: 5636.4, 300 sec: 5651.9). Total num frames: 503051264. Throughput: 0: 5932.7. Samples: 503055084. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:39:38,785][25689] Avg episode reward: [(0, '-42.730')] [2022-07-10 00:39:39,005][26022] Updated weights on worker 0-0, policy_version 491263 (0.00083) [2022-07-10 00:39:40,845][26022] Updated weights on worker 0-0, policy_version 491273 (0.00091) [2022-07-10 00:39:42,569][26022] Updated weights on worker 0-0, policy_version 491283 (0.00087) [2022-07-10 00:39:43,840][25689] Fps is (10 sec: 5570.1, 60 sec: 5649.4, 300 sec: 5645.7). Total num frames: 503079936. Throughput: 0: 5928.5. Samples: 503089610. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:39:43,840][25689] Avg episode reward: [(0, '-41.962')] [2022-07-10 00:39:44,419][26022] Updated weights on worker 0-0, policy_version 491293 (0.00090) [2022-07-10 00:39:45,212][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:39:45,229][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000491298_503089152.pth [2022-07-10 00:39:45,230][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000489308_501051392.pth [2022-07-10 00:39:46,380][26022] Updated weights on worker 0-0, policy_version 491303 (0.00082) [2022-07-10 00:39:47,871][26022] Updated weights on worker 0-0, policy_version 491313 (0.00090) [2022-07-10 00:39:48,908][25689] Fps is (10 sec: 5765.3, 60 sec: 5685.8, 300 sec: 5658.5). Total num frames: 503109632. Throughput: 0: 5938.6. Samples: 503106624. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:39:48,909][25689] Avg episode reward: [(0, '-41.698')] [2022-07-10 00:39:49,899][26022] Updated weights on worker 0-0, policy_version 491323 (0.00087) [2022-07-10 00:39:51,710][26022] Updated weights on worker 0-0, policy_version 491333 (0.00090) [2022-07-10 00:39:53,273][26022] Updated weights on worker 0-0, policy_version 491343 (0.00086) [2022-07-10 00:39:54,005][25689] Fps is (10 sec: 5741.8, 60 sec: 5643.2, 300 sec: 5653.7). Total num frames: 503138304. Throughput: 0: 5928.6. Samples: 503141036. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:39:54,006][25689] Avg episode reward: [(0, '-42.012')] [2022-07-10 00:39:55,285][26022] Updated weights on worker 0-0, policy_version 491353 (0.00087) [2022-07-10 00:39:56,835][26022] Updated weights on worker 0-0, policy_version 491363 (0.00086) [2022-07-10 00:39:58,740][26022] Updated weights on worker 0-0, policy_version 491373 (0.00084) [2022-07-10 00:39:59,063][25689] Fps is (10 sec: 5646.8, 60 sec: 5659.3, 300 sec: 5652.9). Total num frames: 503166976. Throughput: 0: 5923.1. Samples: 503175198. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:39:59,064][25689] Avg episode reward: [(0, '-42.243')] [2022-07-10 00:40:00,668][26022] Updated weights on worker 0-0, policy_version 491383 (0.00094) [2022-07-10 00:40:02,667][26022] Updated weights on worker 0-0, policy_version 491393 (0.00091) [2022-07-10 00:40:04,145][25689] Fps is (10 sec: 5453.3, 60 sec: 5687.6, 300 sec: 5653.5). Total num frames: 503193600. Throughput: 0: 4983.5. Samples: 503190802. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:40:04,146][25689] Avg episode reward: [(0, '-42.761')] [2022-07-10 00:40:04,667][26022] Updated weights on worker 0-0, policy_version 491403 (0.00090) [2022-07-10 00:40:06,364][26022] Updated weights on worker 0-0, policy_version 491413 (0.00092) [2022-07-10 00:40:08,040][26022] Updated weights on worker 0-0, policy_version 491423 (0.00081) [2022-07-10 00:40:09,215][25689] Fps is (10 sec: 5447.2, 60 sec: 5656.1, 300 sec: 5648.8). Total num frames: 503222272. Throughput: 0: 5813.1. Samples: 503224668. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:40:09,215][25689] Avg episode reward: [(0, '-43.883')] [2022-07-10 00:40:10,137][26022] Updated weights on worker 0-0, policy_version 491433 (0.00085) [2022-07-10 00:40:11,546][26022] Updated weights on worker 0-0, policy_version 491443 (0.00093) [2022-07-10 00:40:13,639][26022] Updated weights on worker 0-0, policy_version 491453 (0.00093) [2022-07-10 00:40:14,233][25689] Fps is (10 sec: 5684.7, 60 sec: 5656.2, 300 sec: 5652.4). Total num frames: 503250944. Throughput: 0: 5818.5. Samples: 503258728. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:40:14,234][25689] Avg episode reward: [(0, '-43.797')] [2022-07-10 00:40:15,330][26022] Updated weights on worker 0-0, policy_version 491463 (0.00098) [2022-07-10 00:40:17,229][26022] Updated weights on worker 0-0, policy_version 491473 (0.00105) [2022-07-10 00:40:18,952][26022] Updated weights on worker 0-0, policy_version 491483 (0.00087) [2022-07-10 00:40:19,247][25689] Fps is (10 sec: 5613.7, 60 sec: 5656.3, 300 sec: 5646.2). Total num frames: 503278592. Throughput: 0: 4985.7. Samples: 503275830. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:40:19,248][25689] Avg episode reward: [(0, '-43.999')] [2022-07-10 00:40:20,784][26022] Updated weights on worker 0-0, policy_version 491493 (0.00087) [2022-07-10 00:40:22,577][26022] Updated weights on worker 0-0, policy_version 491503 (0.00087) [2022-07-10 00:40:24,268][25689] Fps is (10 sec: 5714.2, 60 sec: 5655.7, 300 sec: 5653.6). Total num frames: 503308288. Throughput: 0: 5932.6. Samples: 503310182. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:40:24,268][25689] Avg episode reward: [(0, '-44.188')] [2022-07-10 00:40:24,345][26022] Updated weights on worker 0-0, policy_version 491513 (0.00087) [2022-07-10 00:40:26,052][26022] Updated weights on worker 0-0, policy_version 491523 (0.00093) [2022-07-10 00:40:27,982][26022] Updated weights on worker 0-0, policy_version 491533 (0.00094) [2022-07-10 00:40:29,332][25689] Fps is (10 sec: 5787.8, 60 sec: 5659.7, 300 sec: 5653.4). Total num frames: 503336960. Throughput: 0: 5957.2. Samples: 503344510. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:40:29,334][25689] Avg episode reward: [(0, '-43.548')] [2022-07-10 00:40:29,755][26022] Updated weights on worker 0-0, policy_version 491543 (0.00089) [2022-07-10 00:40:31,541][26022] Updated weights on worker 0-0, policy_version 491553 (0.00096) [2022-07-10 00:40:33,427][26022] Updated weights on worker 0-0, policy_version 491563 (0.00086) [2022-07-10 00:40:34,346][25689] Fps is (10 sec: 5690.0, 60 sec: 5642.2, 300 sec: 5646.4). Total num frames: 503365632. Throughput: 0: 5105.4. Samples: 503361416. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:40:34,348][25689] Avg episode reward: [(0, '-43.458')] [2022-07-10 00:40:35,116][26022] Updated weights on worker 0-0, policy_version 491573 (0.00091) [2022-07-10 00:40:37,054][26022] Updated weights on worker 0-0, policy_version 491583 (0.00085) [2022-07-10 00:40:38,871][26022] Updated weights on worker 0-0, policy_version 491593 (0.00090) [2022-07-10 00:40:39,387][25689] Fps is (10 sec: 5702.8, 60 sec: 5660.4, 300 sec: 5650.6). Total num frames: 503394304. Throughput: 0: 5941.1. Samples: 503395484. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:40:39,388][25689] Avg episode reward: [(0, '-43.475')] [2022-07-10 00:40:40,579][26022] Updated weights on worker 0-0, policy_version 491603 (0.00091) [2022-07-10 00:40:42,458][26022] Updated weights on worker 0-0, policy_version 491613 (0.00092) [2022-07-10 00:40:44,143][26022] Updated weights on worker 0-0, policy_version 491623 (0.00091) [2022-07-10 00:40:44,402][25689] Fps is (10 sec: 5702.4, 60 sec: 5664.3, 300 sec: 5648.3). Total num frames: 503422976. Throughput: 0: 5935.3. Samples: 503429684. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:40:44,404][25689] Avg episode reward: [(0, '-44.457')] [2022-07-10 00:40:45,961][26022] Updated weights on worker 0-0, policy_version 491633 (0.00092) [2022-07-10 00:40:47,902][26022] Updated weights on worker 0-0, policy_version 491643 (0.00084) [2022-07-10 00:40:49,488][25689] Fps is (10 sec: 5677.5, 60 sec: 5645.7, 300 sec: 5650.5). Total num frames: 503451648. Throughput: 0: 5065.9. Samples: 503446616. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:40:49,489][25689] Avg episode reward: [(0, '-44.777')] [2022-07-10 00:40:49,530][26022] Updated weights on worker 0-0, policy_version 491653 (0.00088) [2022-07-10 00:40:51,489][26022] Updated weights on worker 0-0, policy_version 491663 (0.00091) [2022-07-10 00:40:53,324][26022] Updated weights on worker 0-0, policy_version 491673 (0.00084) [2022-07-10 00:40:54,514][25689] Fps is (10 sec: 5468.4, 60 sec: 5618.5, 300 sec: 5646.8). Total num frames: 503478272. Throughput: 0: 5916.0. Samples: 503480730. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:40:54,515][25689] Avg episode reward: [(0, '-43.998')] [2022-07-10 00:40:55,026][26022] Updated weights on worker 0-0, policy_version 491683 (0.00088) [2022-07-10 00:40:57,251][26022] Updated weights on worker 0-0, policy_version 491693 (0.00093) [2022-07-10 00:40:58,565][26022] Updated weights on worker 0-0, policy_version 491703 (0.00085) [2022-07-10 00:40:59,548][25689] Fps is (10 sec: 5700.0, 60 sec: 5654.6, 300 sec: 5657.8). Total num frames: 503508992. Throughput: 0: 5891.0. Samples: 503514250. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:40:59,552][25689] Avg episode reward: [(0, '-43.377')] [2022-07-10 00:41:00,799][26022] Updated weights on worker 0-0, policy_version 491713 (0.00092) [2022-07-10 00:41:02,728][26022] Updated weights on worker 0-0, policy_version 491723 (0.00093) [2022-07-10 00:41:04,561][25689] Fps is (10 sec: 5503.8, 60 sec: 5627.1, 300 sec: 5645.1). Total num frames: 503533568. Throughput: 0: 4926.5. Samples: 503528996. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:41:04,562][25689] Avg episode reward: [(0, '-42.927')] [2022-07-10 00:41:04,566][26022] Updated weights on worker 0-0, policy_version 491733 (0.00083) [2022-07-10 00:41:06,563][26022] Updated weights on worker 0-0, policy_version 491743 (0.00095) [2022-07-10 00:41:08,222][26022] Updated weights on worker 0-0, policy_version 491753 (0.00084) [2022-07-10 00:41:09,619][25689] Fps is (10 sec: 5287.4, 60 sec: 5628.2, 300 sec: 5648.4). Total num frames: 503562240. Throughput: 0: 5786.5. Samples: 503563104. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:41:09,619][25689] Avg episode reward: [(0, '-42.597')] [2022-07-10 00:41:10,160][26022] Updated weights on worker 0-0, policy_version 491763 (0.00087) [2022-07-10 00:41:12,007][26022] Updated weights on worker 0-0, policy_version 491773 (0.00095) [2022-07-10 00:41:13,649][26022] Updated weights on worker 0-0, policy_version 491783 (0.00091) [2022-07-10 00:41:14,681][25689] Fps is (10 sec: 5767.4, 60 sec: 5641.0, 300 sec: 5647.7). Total num frames: 503591936. Throughput: 0: 5758.7. Samples: 503596868. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:41:14,682][25689] Avg episode reward: [(0, '-41.830')] [2022-07-10 00:41:15,582][26022] Updated weights on worker 0-0, policy_version 491793 (0.00092) [2022-07-10 00:41:17,269][26022] Updated weights on worker 0-0, policy_version 491803 (0.00087) [2022-07-10 00:41:19,283][26022] Updated weights on worker 0-0, policy_version 491813 (0.00078) [2022-07-10 00:41:19,704][25689] Fps is (10 sec: 5685.9, 60 sec: 5640.3, 300 sec: 5643.9). Total num frames: 503619584. Throughput: 0: 4948.3. Samples: 503613988. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:41:19,705][25689] Avg episode reward: [(0, '-41.776')] [2022-07-10 00:41:20,871][26022] Updated weights on worker 0-0, policy_version 491823 (0.00109) [2022-07-10 00:41:22,830][26022] Updated weights on worker 0-0, policy_version 491833 (0.00092) [2022-07-10 00:41:24,597][26022] Updated weights on worker 0-0, policy_version 491843 (0.00087) [2022-07-10 00:41:24,741][25689] Fps is (10 sec: 5598.9, 60 sec: 5621.8, 300 sec: 5640.7). Total num frames: 503648256. Throughput: 0: 5901.3. Samples: 503648082. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:41:24,741][25689] Avg episode reward: [(0, '-42.985')] [2022-07-10 00:41:26,328][26022] Updated weights on worker 0-0, policy_version 491853 (0.00085) [2022-07-10 00:41:28,059][26022] Updated weights on worker 0-0, policy_version 491863 (0.00087) [2022-07-10 00:41:29,857][25689] Fps is (10 sec: 5648.3, 60 sec: 5617.0, 300 sec: 5642.8). Total num frames: 503676928. Throughput: 0: 5892.8. Samples: 503682362. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:41:29,857][25689] Avg episode reward: [(0, '-42.830')] [2022-07-10 00:41:29,971][26022] Updated weights on worker 0-0, policy_version 491873 (0.00098) [2022-07-10 00:41:31,623][26022] Updated weights on worker 0-0, policy_version 491883 (0.00092) [2022-07-10 00:41:33,584][26022] Updated weights on worker 0-0, policy_version 491893 (0.00085) [2022-07-10 00:41:34,927][25689] Fps is (10 sec: 5729.7, 60 sec: 5628.7, 300 sec: 5649.8). Total num frames: 503706624. Throughput: 0: 5060.8. Samples: 503699330. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:41:34,928][25689] Avg episode reward: [(0, '-44.676')] [2022-07-10 00:41:35,165][26022] Updated weights on worker 0-0, policy_version 491903 (0.00085) [2022-07-10 00:41:37,074][26022] Updated weights on worker 0-0, policy_version 491913 (0.00082) [2022-07-10 00:41:38,888][26022] Updated weights on worker 0-0, policy_version 491923 (0.00092) [2022-07-10 00:41:39,979][25689] Fps is (10 sec: 5664.9, 60 sec: 5610.8, 300 sec: 5638.9). Total num frames: 503734272. Throughput: 0: 5888.4. Samples: 503733376. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:41:39,980][25689] Avg episode reward: [(0, '-45.089')] [2022-07-10 00:41:40,668][26022] Updated weights on worker 0-0, policy_version 491933 (0.00092) [2022-07-10 00:41:42,515][26022] Updated weights on worker 0-0, policy_version 491943 (0.00091) [2022-07-10 00:41:44,423][26022] Updated weights on worker 0-0, policy_version 491953 (0.00089) [2022-07-10 00:41:45,026][25689] Fps is (10 sec: 5576.8, 60 sec: 5607.8, 300 sec: 5639.1). Total num frames: 503762944. Throughput: 0: 5884.0. Samples: 503767444. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:41:45,027][25689] Avg episode reward: [(0, '-46.310')] [2022-07-10 00:41:45,302][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:41:45,311][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000491958_503764992.pth [2022-07-10 00:41:45,312][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000489972_501731328.pth [2022-07-10 00:41:46,214][26022] Updated weights on worker 0-0, policy_version 491963 (0.00094) [2022-07-10 00:41:48,108][26022] Updated weights on worker 0-0, policy_version 491973 (0.00054) [2022-07-10 00:41:49,759][26022] Updated weights on worker 0-0, policy_version 491983 (0.00094) [2022-07-10 00:41:50,108][25689] Fps is (10 sec: 5661.7, 60 sec: 5608.2, 300 sec: 5637.7). Total num frames: 503791616. Throughput: 0: 5865.3. Samples: 503801142. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:41:50,108][25689] Avg episode reward: [(0, '-44.710')] [2022-07-10 00:41:51,628][26022] Updated weights on worker 0-0, policy_version 491993 (0.00088) [2022-07-10 00:41:53,435][26022] Updated weights on worker 0-0, policy_version 492003 (0.00090) [2022-07-10 00:41:55,074][26022] Updated weights on worker 0-0, policy_version 492013 (0.00085) [2022-07-10 00:41:55,115][25689] Fps is (10 sec: 5785.6, 60 sec: 5660.7, 300 sec: 5645.3). Total num frames: 503821312. Throughput: 0: 5898.8. Samples: 503818412. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:41:55,116][25689] Avg episode reward: [(0, '-44.332')] [2022-07-10 00:41:57,171][26022] Updated weights on worker 0-0, policy_version 492023 (0.00086) [2022-07-10 00:41:58,654][26022] Updated weights on worker 0-0, policy_version 492033 (0.00092) [2022-07-10 00:42:00,135][25689] Fps is (10 sec: 5616.9, 60 sec: 5594.4, 300 sec: 5642.3). Total num frames: 503847936. Throughput: 0: 5916.0. Samples: 503852616. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:42:00,136][25689] Avg episode reward: [(0, '-42.904')] [2022-07-10 00:42:00,771][26022] Updated weights on worker 0-0, policy_version 492043 (0.00095) [2022-07-10 00:42:02,769][26022] Updated weights on worker 0-0, policy_version 492053 (0.00087) [2022-07-10 00:42:04,583][26022] Updated weights on worker 0-0, policy_version 492063 (0.00089) [2022-07-10 00:42:05,142][25689] Fps is (10 sec: 5514.8, 60 sec: 5662.5, 300 sec: 5646.4). Total num frames: 503876608. Throughput: 0: 5815.4. Samples: 503884422. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:42:05,142][25689] Avg episode reward: [(0, '-42.917')] [2022-07-10 00:42:06,624][26022] Updated weights on worker 0-0, policy_version 492073 (0.00096) [2022-07-10 00:42:07,952][26022] Updated weights on worker 0-0, policy_version 492083 (0.00086) [2022-07-10 00:42:10,204][25689] Fps is (10 sec: 5390.0, 60 sec: 5611.4, 300 sec: 5635.2). Total num frames: 503902208. Throughput: 0: 5004.2. Samples: 503901706. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 00:42:10,205][25689] Avg episode reward: [(0, '-41.925')] [2022-07-10 00:42:10,219][26022] Updated weights on worker 0-0, policy_version 492093 (0.00083) [2022-07-10 00:42:11,568][26022] Updated weights on worker 0-0, policy_version 492103 (0.00083) [2022-07-10 00:42:13,677][26022] Updated weights on worker 0-0, policy_version 492113 (0.00089) [2022-07-10 00:42:15,215][25689] Fps is (10 sec: 5591.0, 60 sec: 5633.1, 300 sec: 5642.1). Total num frames: 503932928. Throughput: 0: 5835.3. Samples: 503935702. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:42:15,215][25689] Avg episode reward: [(0, '-42.468')] [2022-07-10 00:42:15,403][26022] Updated weights on worker 0-0, policy_version 492123 (0.00098) [2022-07-10 00:42:17,199][26022] Updated weights on worker 0-0, policy_version 492133 (0.00083) [2022-07-10 00:42:19,035][26022] Updated weights on worker 0-0, policy_version 492143 (0.00085) [2022-07-10 00:42:20,238][25689] Fps is (10 sec: 5816.7, 60 sec: 5633.0, 300 sec: 5642.0). Total num frames: 503960576. Throughput: 0: 5851.8. Samples: 503970258. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:42:20,239][25689] Avg episode reward: [(0, '-43.166')] [2022-07-10 00:42:20,765][26022] Updated weights on worker 0-0, policy_version 492153 (0.00102) [2022-07-10 00:42:22,476][26022] Updated weights on worker 0-0, policy_version 492163 (0.00093) [2022-07-10 00:42:24,465][26022] Updated weights on worker 0-0, policy_version 492173 (0.00092) [2022-07-10 00:42:25,251][25689] Fps is (10 sec: 5612.0, 60 sec: 5635.3, 300 sec: 5639.6). Total num frames: 503989248. Throughput: 0: 5116.6. Samples: 503987312. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:42:25,251][25689] Avg episode reward: [(0, '-43.756')] [2022-07-10 00:42:25,976][26022] Updated weights on worker 0-0, policy_version 492183 (0.00103) [2022-07-10 00:42:28,042][26022] Updated weights on worker 0-0, policy_version 492193 (0.00086) [2022-07-10 00:42:29,548][26022] Updated weights on worker 0-0, policy_version 492203 (0.00100) [2022-07-10 00:42:30,327][25689] Fps is (10 sec: 5683.8, 60 sec: 5639.0, 300 sec: 5645.3). Total num frames: 504017920. Throughput: 0: 5950.9. Samples: 504021460. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:42:30,328][25689] Avg episode reward: [(0, '-44.507')] [2022-07-10 00:42:31,451][26022] Updated weights on worker 0-0, policy_version 492213 (0.00081) [2022-07-10 00:42:33,642][26022] Updated weights on worker 0-0, policy_version 492223 (0.00746) [2022-07-10 00:42:35,155][26022] Updated weights on worker 0-0, policy_version 492233 (0.00081) [2022-07-10 00:42:35,408][25689] Fps is (10 sec: 5746.6, 60 sec: 5638.1, 300 sec: 5647.7). Total num frames: 504047616. Throughput: 0: 5933.8. Samples: 504055522. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:42:35,408][25689] Avg episode reward: [(0, '-45.559')] [2022-07-10 00:42:37,059][26022] Updated weights on worker 0-0, policy_version 492243 (0.00074) [2022-07-10 00:42:38,715][26022] Updated weights on worker 0-0, policy_version 492253 (0.00095) [2022-07-10 00:42:40,460][25689] Fps is (10 sec: 5760.5, 60 sec: 5655.0, 300 sec: 5640.1). Total num frames: 504076288. Throughput: 0: 5060.9. Samples: 504072598. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:42:40,460][25689] Avg episode reward: [(0, '-43.897')] [2022-07-10 00:42:40,715][26022] Updated weights on worker 0-0, policy_version 492263 (0.00084) [2022-07-10 00:42:42,398][26022] Updated weights on worker 0-0, policy_version 492273 (0.00089) [2022-07-10 00:42:44,146][26022] Updated weights on worker 0-0, policy_version 492283 (0.00085) [2022-07-10 00:42:45,469][25689] Fps is (10 sec: 5801.3, 60 sec: 5675.5, 300 sec: 5644.8). Total num frames: 504105984. Throughput: 0: 5919.2. Samples: 504106988. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:42:45,471][25689] Avg episode reward: [(0, '-42.803')] [2022-07-10 00:42:45,862][26022] Updated weights on worker 0-0, policy_version 492293 (0.00082) [2022-07-10 00:42:47,638][26022] Updated weights on worker 0-0, policy_version 492303 (0.00092) [2022-07-10 00:42:49,567][26022] Updated weights on worker 0-0, policy_version 492313 (0.00055) [2022-07-10 00:42:50,540][25689] Fps is (10 sec: 5587.2, 60 sec: 5642.6, 300 sec: 5640.2). Total num frames: 504132608. Throughput: 0: 5934.0. Samples: 504141402. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:42:50,541][25689] Avg episode reward: [(0, '-42.614')] [2022-07-10 00:42:51,254][26022] Updated weights on worker 0-0, policy_version 492323 (0.00087) [2022-07-10 00:42:53,235][26022] Updated weights on worker 0-0, policy_version 492333 (0.00090) [2022-07-10 00:42:54,707][26022] Updated weights on worker 0-0, policy_version 492343 (0.00093) [2022-07-10 00:42:55,559][25689] Fps is (10 sec: 5581.5, 60 sec: 5641.4, 300 sec: 5641.1). Total num frames: 504162304. Throughput: 0: 5122.4. Samples: 504158748. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:42:55,560][25689] Avg episode reward: [(0, '-42.983')] [2022-07-10 00:42:56,703][26022] Updated weights on worker 0-0, policy_version 492353 (0.00088) [2022-07-10 00:42:58,312][26022] Updated weights on worker 0-0, policy_version 492363 (0.00115) [2022-07-10 00:43:00,158][26022] Updated weights on worker 0-0, policy_version 492373 (0.00095) [2022-07-10 00:43:00,567][25689] Fps is (10 sec: 5821.0, 60 sec: 5676.4, 300 sec: 5653.0). Total num frames: 504190976. Throughput: 0: 6013.2. Samples: 504193506. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:43:00,567][25689] Avg episode reward: [(0, '-42.480')] [2022-07-10 00:43:02,080][26022] Updated weights on worker 0-0, policy_version 492383 (0.00098) [2022-07-10 00:43:04,277][26022] Updated weights on worker 0-0, policy_version 492393 (0.00094) [2022-07-10 00:43:05,582][25689] Fps is (10 sec: 5619.4, 60 sec: 5658.8, 300 sec: 5650.6). Total num frames: 504218624. Throughput: 0: 5888.4. Samples: 504225422. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:43:05,582][25689] Avg episode reward: [(0, '-41.664')] [2022-07-10 00:43:06,032][26022] Updated weights on worker 0-0, policy_version 492403 (0.00086) [2022-07-10 00:43:07,871][26022] Updated weights on worker 0-0, policy_version 492413 (0.00098) [2022-07-10 00:43:09,421][26022] Updated weights on worker 0-0, policy_version 492423 (0.00084) [2022-07-10 00:43:10,725][25689] Fps is (10 sec: 5544.4, 60 sec: 5701.9, 300 sec: 5644.8). Total num frames: 504247296. Throughput: 0: 5011.3. Samples: 504242556. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:43:10,726][25689] Avg episode reward: [(0, '-41.891')] [2022-07-10 00:43:11,471][26022] Updated weights on worker 0-0, policy_version 492433 (0.00082) [2022-07-10 00:43:12,945][26022] Updated weights on worker 0-0, policy_version 492443 (0.00102) [2022-07-10 00:43:14,983][26022] Updated weights on worker 0-0, policy_version 492453 (0.00086) [2022-07-10 00:43:15,728][25689] Fps is (10 sec: 5651.8, 60 sec: 5668.9, 300 sec: 5648.5). Total num frames: 504275968. Throughput: 0: 5874.8. Samples: 504277234. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:43:15,728][25689] Avg episode reward: [(0, '-42.849')] [2022-07-10 00:43:16,615][26022] Updated weights on worker 0-0, policy_version 492463 (0.00062) [2022-07-10 00:43:18,527][26022] Updated weights on worker 0-0, policy_version 492473 (0.00106) [2022-07-10 00:43:20,259][26022] Updated weights on worker 0-0, policy_version 492483 (0.00081) [2022-07-10 00:43:20,736][25689] Fps is (10 sec: 5728.1, 60 sec: 5687.2, 300 sec: 5649.6). Total num frames: 504304640. Throughput: 0: 5854.4. Samples: 504311582. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:43:20,736][25689] Avg episode reward: [(0, '-42.620')] [2022-07-10 00:43:22,049][26022] Updated weights on worker 0-0, policy_version 492493 (0.00085) [2022-07-10 00:43:23,901][26022] Updated weights on worker 0-0, policy_version 492503 (0.00088) [2022-07-10 00:43:25,687][26022] Updated weights on worker 0-0, policy_version 492513 (0.00088) [2022-07-10 00:43:25,815][25689] Fps is (10 sec: 5786.5, 60 sec: 5697.9, 300 sec: 5653.6). Total num frames: 504334336. Throughput: 0: 5110.4. Samples: 504328822. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:43:25,816][25689] Avg episode reward: [(0, '-42.888')] [2022-07-10 00:43:27,561][26022] Updated weights on worker 0-0, policy_version 492523 (0.00090) [2022-07-10 00:43:29,151][26022] Updated weights on worker 0-0, policy_version 492533 (0.00096) [2022-07-10 00:43:30,941][25689] Fps is (10 sec: 5719.5, 60 sec: 5693.2, 300 sec: 5652.2). Total num frames: 504363008. Throughput: 0: 5958.9. Samples: 504363022. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:43:30,946][25689] Avg episode reward: [(0, '-42.899')] [2022-07-10 00:43:30,994][26022] Updated weights on worker 0-0, policy_version 492543 (0.00087) [2022-07-10 00:43:32,886][26022] Updated weights on worker 0-0, policy_version 492553 (0.00090) [2022-07-10 00:43:34,680][26022] Updated weights on worker 0-0, policy_version 492563 (0.00087) [2022-07-10 00:43:36,004][25689] Fps is (10 sec: 5628.1, 60 sec: 5678.0, 300 sec: 5647.7). Total num frames: 504391680. Throughput: 0: 5903.2. Samples: 504396926. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:43:36,004][25689] Avg episode reward: [(0, '-43.747')] [2022-07-10 00:43:36,389][26022] Updated weights on worker 0-0, policy_version 492573 (0.00487) [2022-07-10 00:43:38,255][26022] Updated weights on worker 0-0, policy_version 492583 (0.00090) [2022-07-10 00:43:40,080][26022] Updated weights on worker 0-0, policy_version 492593 (0.00090) [2022-07-10 00:43:41,019][25689] Fps is (10 sec: 5690.3, 60 sec: 5681.5, 300 sec: 5651.1). Total num frames: 504420352. Throughput: 0: 5053.9. Samples: 504414088. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:43:41,020][25689] Avg episode reward: [(0, '-43.620')] [2022-07-10 00:43:41,840][26022] Updated weights on worker 0-0, policy_version 492603 (0.00088) [2022-07-10 00:43:43,571][26022] Updated weights on worker 0-0, policy_version 492613 (0.00085) [2022-07-10 00:43:45,570][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:43:45,593][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000492623_504445952.pth [2022-07-10 00:43:45,594][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000490636_502411264.pth [2022-07-10 00:43:45,601][26022] Updated weights on worker 0-0, policy_version 492623 (0.00086) [2022-07-10 00:43:46,088][25689] Fps is (10 sec: 5686.5, 60 sec: 5658.9, 300 sec: 5655.0). Total num frames: 504449024. Throughput: 0: 5910.7. Samples: 504448650. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:43:46,089][25689] Avg episode reward: [(0, '-43.330')] [2022-07-10 00:43:47,068][26022] Updated weights on worker 0-0, policy_version 492633 (0.00095) [2022-07-10 00:43:48,967][26022] Updated weights on worker 0-0, policy_version 492643 (0.00089) [2022-07-10 00:43:50,749][26022] Updated weights on worker 0-0, policy_version 492653 (0.00099) [2022-07-10 00:43:51,159][25689] Fps is (10 sec: 5655.0, 60 sec: 5692.7, 300 sec: 5646.8). Total num frames: 504477696. Throughput: 0: 5950.3. Samples: 504483324. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:43:51,159][25689] Avg episode reward: [(0, '-43.853')] [2022-07-10 00:43:52,571][26022] Updated weights on worker 0-0, policy_version 492663 (0.00088) [2022-07-10 00:43:54,412][26022] Updated weights on worker 0-0, policy_version 492673 (0.00085) [2022-07-10 00:43:56,167][25689] Fps is (10 sec: 5791.0, 60 sec: 5693.8, 300 sec: 5654.5). Total num frames: 504507392. Throughput: 0: 5124.9. Samples: 504500260. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:43:56,167][25689] Avg episode reward: [(0, '-44.043')] [2022-07-10 00:43:56,171][26022] Updated weights on worker 0-0, policy_version 492683 (0.00114) [2022-07-10 00:43:57,913][26022] Updated weights on worker 0-0, policy_version 492693 (0.00093) [2022-07-10 00:43:59,611][26022] Updated weights on worker 0-0, policy_version 492703 (0.00093) [2022-07-10 00:44:01,197][25689] Fps is (10 sec: 5916.7, 60 sec: 5708.6, 300 sec: 5671.6). Total num frames: 504537088. Throughput: 0: 5982.9. Samples: 504534814. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:44:01,197][25689] Avg episode reward: [(0, '-42.847')] [2022-07-10 00:44:01,498][26022] Updated weights on worker 0-0, policy_version 492713 (0.00075) [2022-07-10 00:44:03,877][26022] Updated weights on worker 0-0, policy_version 492723 (0.00085) [2022-07-10 00:44:05,536][26022] Updated weights on worker 0-0, policy_version 492733 (0.00091) [2022-07-10 00:44:06,199][25689] Fps is (10 sec: 5512.1, 60 sec: 5676.0, 300 sec: 5656.1). Total num frames: 504562688. Throughput: 0: 5876.3. Samples: 504566828. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:44:06,199][25689] Avg episode reward: [(0, '-42.368')] [2022-07-10 00:44:07,335][26022] Updated weights on worker 0-0, policy_version 492743 (0.00495) [2022-07-10 00:44:09,014][26022] Updated weights on worker 0-0, policy_version 492753 (0.00107) [2022-07-10 00:44:11,013][26022] Updated weights on worker 0-0, policy_version 492763 (0.00088) [2022-07-10 00:44:11,245][25689] Fps is (10 sec: 5299.2, 60 sec: 5668.2, 300 sec: 5652.2). Total num frames: 504590336. Throughput: 0: 5014.0. Samples: 504584038. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:44:11,246][25689] Avg episode reward: [(0, '-42.123')] [2022-07-10 00:44:12,541][26022] Updated weights on worker 0-0, policy_version 492773 (0.00083) [2022-07-10 00:44:14,475][26022] Updated weights on worker 0-0, policy_version 492783 (0.00092) [2022-07-10 00:44:16,103][26022] Updated weights on worker 0-0, policy_version 492793 (0.00094) [2022-07-10 00:44:16,256][25689] Fps is (10 sec: 5701.6, 60 sec: 5684.3, 300 sec: 5659.1). Total num frames: 504620032. Throughput: 0: 5880.8. Samples: 504618402. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:44:16,257][25689] Avg episode reward: [(0, '-41.006')] [2022-07-10 00:44:18,164][26022] Updated weights on worker 0-0, policy_version 492803 (0.00089) [2022-07-10 00:44:19,697][26022] Updated weights on worker 0-0, policy_version 492813 (0.00082) [2022-07-10 00:44:21,267][25689] Fps is (10 sec: 5721.9, 60 sec: 5667.1, 300 sec: 5652.3). Total num frames: 504647680. Throughput: 0: 5865.6. Samples: 504652538. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:44:21,268][25689] Avg episode reward: [(0, '-40.570')] [2022-07-10 00:44:21,756][26022] Updated weights on worker 0-0, policy_version 492823 (0.00087) [2022-07-10 00:44:23,365][26022] Updated weights on worker 0-0, policy_version 492833 (0.00077) [2022-07-10 00:44:25,371][26022] Updated weights on worker 0-0, policy_version 492843 (0.00093) [2022-07-10 00:44:26,287][25689] Fps is (10 sec: 5615.0, 60 sec: 5655.8, 300 sec: 5654.0). Total num frames: 504676352. Throughput: 0: 5128.4. Samples: 504669846. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:44:26,287][25689] Avg episode reward: [(0, '-40.287')] [2022-07-10 00:44:26,947][26022] Updated weights on worker 0-0, policy_version 492853 (0.00085) [2022-07-10 00:44:28,955][26022] Updated weights on worker 0-0, policy_version 492863 (0.00085) [2022-07-10 00:44:30,617][26022] Updated weights on worker 0-0, policy_version 492873 (0.00087) [2022-07-10 00:44:31,383][25689] Fps is (10 sec: 5871.3, 60 sec: 5692.5, 300 sec: 5655.7). Total num frames: 504707072. Throughput: 0: 5964.6. Samples: 504704150. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:44:31,383][25689] Avg episode reward: [(0, '-40.762')] [2022-07-10 00:44:32,587][26022] Updated weights on worker 0-0, policy_version 492883 (0.00087) [2022-07-10 00:44:34,156][26022] Updated weights on worker 0-0, policy_version 492893 (0.00082) [2022-07-10 00:44:36,276][26022] Updated weights on worker 0-0, policy_version 492903 (0.00093) [2022-07-10 00:44:36,435][25689] Fps is (10 sec: 5650.6, 60 sec: 5659.6, 300 sec: 5652.4). Total num frames: 504733696. Throughput: 0: 5940.0. Samples: 504738262. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:44:36,436][25689] Avg episode reward: [(0, '-42.388')] [2022-07-10 00:44:37,699][26022] Updated weights on worker 0-0, policy_version 492913 (0.00081) [2022-07-10 00:44:39,751][26022] Updated weights on worker 0-0, policy_version 492923 (0.00089) [2022-07-10 00:44:41,447][26022] Updated weights on worker 0-0, policy_version 492933 (0.01310) [2022-07-10 00:44:41,531][25689] Fps is (10 sec: 5549.6, 60 sec: 5668.9, 300 sec: 5655.0). Total num frames: 504763392. Throughput: 0: 5915.2. Samples: 504772404. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:44:41,532][25689] Avg episode reward: [(0, '-42.821')] [2022-07-10 00:44:43,254][26022] Updated weights on worker 0-0, policy_version 492943 (0.00315) [2022-07-10 00:44:45,027][26022] Updated weights on worker 0-0, policy_version 492953 (0.00086) [2022-07-10 00:44:46,579][25689] Fps is (10 sec: 5854.9, 60 sec: 5687.9, 300 sec: 5655.4). Total num frames: 504793088. Throughput: 0: 5895.3. Samples: 504789474. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:44:46,579][25689] Avg episode reward: [(0, '-42.769')] [2022-07-10 00:44:46,789][26022] Updated weights on worker 0-0, policy_version 492963 (0.00101) [2022-07-10 00:44:48,636][26022] Updated weights on worker 0-0, policy_version 492973 (0.00087) [2022-07-10 00:44:50,414][26022] Updated weights on worker 0-0, policy_version 492983 (0.00086) [2022-07-10 00:44:51,679][25689] Fps is (10 sec: 5752.0, 60 sec: 5685.2, 300 sec: 5655.4). Total num frames: 504821760. Throughput: 0: 5906.3. Samples: 504824024. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:44:51,679][25689] Avg episode reward: [(0, '-43.610')] [2022-07-10 00:44:52,061][26022] Updated weights on worker 0-0, policy_version 492993 (0.00084) [2022-07-10 00:44:54,077][26022] Updated weights on worker 0-0, policy_version 493003 (0.00086) [2022-07-10 00:44:55,782][26022] Updated weights on worker 0-0, policy_version 493013 (0.00093) [2022-07-10 00:44:56,701][25689] Fps is (10 sec: 5665.4, 60 sec: 5666.9, 300 sec: 5656.1). Total num frames: 504850432. Throughput: 0: 5923.8. Samples: 504858312. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:44:56,701][25689] Avg episode reward: [(0, '-43.288')] [2022-07-10 00:44:57,575][26022] Updated weights on worker 0-0, policy_version 493023 (0.00088) [2022-07-10 00:44:59,263][26022] Updated weights on worker 0-0, policy_version 493033 (0.00090) [2022-07-10 00:45:01,114][26022] Updated weights on worker 0-0, policy_version 493043 (0.00087) [2022-07-10 00:45:01,723][25689] Fps is (10 sec: 5505.3, 60 sec: 5616.9, 300 sec: 5657.2). Total num frames: 504877056. Throughput: 0: 5112.1. Samples: 504875624. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 00:45:01,723][25689] Avg episode reward: [(0, '-42.459')] [2022-07-10 00:45:03,057][26022] Updated weights on worker 0-0, policy_version 493053 (0.00089) [2022-07-10 00:45:05,231][26022] Updated weights on worker 0-0, policy_version 493063 (0.00094) [2022-07-10 00:45:06,749][25689] Fps is (10 sec: 5502.9, 60 sec: 5665.4, 300 sec: 5658.0). Total num frames: 504905728. Throughput: 0: 5872.8. Samples: 504907930. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:45:06,750][25689] Avg episode reward: [(0, '-41.775')] [2022-07-10 00:45:06,774][26022] Updated weights on worker 0-0, policy_version 493073 (0.00085) [2022-07-10 00:45:08,717][26022] Updated weights on worker 0-0, policy_version 493083 (0.00085) [2022-07-10 00:45:10,465][26022] Updated weights on worker 0-0, policy_version 493093 (0.00089) [2022-07-10 00:45:11,799][25689] Fps is (10 sec: 5691.1, 60 sec: 5682.0, 300 sec: 5657.4). Total num frames: 504934400. Throughput: 0: 5882.0. Samples: 504942370. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:45:11,799][25689] Avg episode reward: [(0, '-42.099')] [2022-07-10 00:45:12,035][26022] Updated weights on worker 0-0, policy_version 493103 (0.00087) [2022-07-10 00:45:14,088][26022] Updated weights on worker 0-0, policy_version 493113 (0.00093) [2022-07-10 00:45:15,755][26022] Updated weights on worker 0-0, policy_version 493123 (0.00089) [2022-07-10 00:45:16,866][25689] Fps is (10 sec: 5769.4, 60 sec: 5676.7, 300 sec: 5663.3). Total num frames: 504964096. Throughput: 0: 5023.6. Samples: 504959614. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:45:16,867][25689] Avg episode reward: [(0, '-42.095')] [2022-07-10 00:45:17,547][26022] Updated weights on worker 0-0, policy_version 493133 (0.00083) [2022-07-10 00:45:19,506][26022] Updated weights on worker 0-0, policy_version 493143 (0.00094) [2022-07-10 00:45:20,984][26022] Updated weights on worker 0-0, policy_version 493153 (0.00081) [2022-07-10 00:45:21,895][25689] Fps is (10 sec: 5680.0, 60 sec: 5675.1, 300 sec: 5656.3). Total num frames: 504991744. Throughput: 0: 5862.5. Samples: 504993882. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:45:21,895][25689] Avg episode reward: [(0, '-42.102')] [2022-07-10 00:45:23,042][26022] Updated weights on worker 0-0, policy_version 493163 (0.00087) [2022-07-10 00:45:24,672][26022] Updated weights on worker 0-0, policy_version 493173 (0.00087) [2022-07-10 00:45:26,442][26022] Updated weights on worker 0-0, policy_version 493183 (0.00085) [2022-07-10 00:45:26,908][25689] Fps is (10 sec: 5812.4, 60 sec: 5709.4, 300 sec: 5664.1). Total num frames: 505022464. Throughput: 0: 5984.7. Samples: 505028574. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:45:26,908][25689] Avg episode reward: [(0, '-42.407')] [2022-07-10 00:45:28,258][26022] Updated weights on worker 0-0, policy_version 493193 (0.00087) [2022-07-10 00:45:30,064][26022] Updated weights on worker 0-0, policy_version 493203 (0.00085) [2022-07-10 00:45:31,973][25689] Fps is (10 sec: 5791.1, 60 sec: 5661.6, 300 sec: 5659.7). Total num frames: 505050112. Throughput: 0: 5120.2. Samples: 505045668. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:45:31,974][25689] Avg episode reward: [(0, '-43.823')] [2022-07-10 00:45:31,975][26022] Updated weights on worker 0-0, policy_version 493213 (0.00090) [2022-07-10 00:45:33,761][26022] Updated weights on worker 0-0, policy_version 493223 (0.00089) [2022-07-10 00:45:35,506][26022] Updated weights on worker 0-0, policy_version 493233 (0.00092) [2022-07-10 00:45:37,015][25689] Fps is (10 sec: 5572.4, 60 sec: 5696.4, 300 sec: 5659.7). Total num frames: 505078784. Throughput: 0: 5976.6. Samples: 505080036. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:45:37,015][25689] Avg episode reward: [(0, '-43.227')] [2022-07-10 00:45:37,413][26022] Updated weights on worker 0-0, policy_version 493243 (0.00088) [2022-07-10 00:45:39,167][26022] Updated weights on worker 0-0, policy_version 493253 (0.00078) [2022-07-10 00:45:40,929][26022] Updated weights on worker 0-0, policy_version 493263 (0.00089) [2022-07-10 00:45:42,090][25689] Fps is (10 sec: 5668.4, 60 sec: 5681.5, 300 sec: 5658.5). Total num frames: 505107456. Throughput: 0: 5951.0. Samples: 505114064. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:45:42,091][25689] Avg episode reward: [(0, '-43.118')] [2022-07-10 00:45:42,574][26022] Updated weights on worker 0-0, policy_version 493273 (0.00090) [2022-07-10 00:45:44,585][26022] Updated weights on worker 0-0, policy_version 493283 (0.00703) [2022-07-10 00:45:45,642][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:45:45,654][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000493291_505129984.pth [2022-07-10 00:45:45,654][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000491298_503089152.pth [2022-07-10 00:45:46,333][26022] Updated weights on worker 0-0, policy_version 493293 (0.00094) [2022-07-10 00:45:47,158][25689] Fps is (10 sec: 5754.5, 60 sec: 5679.6, 300 sec: 5662.3). Total num frames: 505137152. Throughput: 0: 5080.4. Samples: 505131452. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:45:47,158][25689] Avg episode reward: [(0, '-42.904')] [2022-07-10 00:45:48,128][26022] Updated weights on worker 0-0, policy_version 493303 (0.00088) [2022-07-10 00:45:49,894][26022] Updated weights on worker 0-0, policy_version 493313 (0.00088) [2022-07-10 00:45:51,597][26022] Updated weights on worker 0-0, policy_version 493323 (0.00096) [2022-07-10 00:45:52,247][25689] Fps is (10 sec: 5746.2, 60 sec: 5680.6, 300 sec: 5668.0). Total num frames: 505165824. Throughput: 0: 5921.8. Samples: 505165726. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:45:52,248][25689] Avg episode reward: [(0, '-42.612')] [2022-07-10 00:45:53,417][26022] Updated weights on worker 0-0, policy_version 493333 (0.00086) [2022-07-10 00:45:55,187][26022] Updated weights on worker 0-0, policy_version 493343 (0.00084) [2022-07-10 00:45:57,106][26022] Updated weights on worker 0-0, policy_version 493353 (0.00093) [2022-07-10 00:45:57,262][25689] Fps is (10 sec: 5675.3, 60 sec: 5681.2, 300 sec: 5661.5). Total num frames: 505194496. Throughput: 0: 5910.2. Samples: 505199700. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:45:57,263][25689] Avg episode reward: [(0, '-43.283')] [2022-07-10 00:45:58,860][26022] Updated weights on worker 0-0, policy_version 493363 (0.00089) [2022-07-10 00:46:00,664][26022] Updated weights on worker 0-0, policy_version 493373 (0.00093) [2022-07-10 00:46:02,307][25689] Fps is (10 sec: 5496.9, 60 sec: 5679.1, 300 sec: 5667.8). Total num frames: 505221120. Throughput: 0: 5095.1. Samples: 505217072. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:46:02,307][25689] Avg episode reward: [(0, '-42.067')] [2022-07-10 00:46:02,673][26022] Updated weights on worker 0-0, policy_version 493383 (0.00088) [2022-07-10 00:46:04,543][26022] Updated weights on worker 0-0, policy_version 493393 (0.00084) [2022-07-10 00:46:06,238][26022] Updated weights on worker 0-0, policy_version 493403 (0.00093) [2022-07-10 00:46:07,321][25689] Fps is (10 sec: 5395.5, 60 sec: 5663.4, 300 sec: 5665.2). Total num frames: 505248768. Throughput: 0: 5848.1. Samples: 505249366. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:46:07,321][25689] Avg episode reward: [(0, '-41.936')] [2022-07-10 00:46:08,252][26022] Updated weights on worker 0-0, policy_version 493413 (0.00092) [2022-07-10 00:46:09,902][26022] Updated weights on worker 0-0, policy_version 493423 (0.00086) [2022-07-10 00:46:11,907][26022] Updated weights on worker 0-0, policy_version 493433 (0.00089) [2022-07-10 00:46:12,445][25689] Fps is (10 sec: 5555.2, 60 sec: 5656.4, 300 sec: 5660.5). Total num frames: 505277440. Throughput: 0: 5822.3. Samples: 505283324. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:46:12,446][25689] Avg episode reward: [(0, '-41.702')] [2022-07-10 00:46:13,476][26022] Updated weights on worker 0-0, policy_version 493443 (0.00087) [2022-07-10 00:46:15,613][26022] Updated weights on worker 0-0, policy_version 493453 (0.00105) [2022-07-10 00:46:17,079][26022] Updated weights on worker 0-0, policy_version 493463 (0.00084) [2022-07-10 00:46:17,515][25689] Fps is (10 sec: 5826.3, 60 sec: 5673.1, 300 sec: 5670.0). Total num frames: 505308160. Throughput: 0: 4979.6. Samples: 505300550. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:46:17,515][25689] Avg episode reward: [(0, '-42.284')] [2022-07-10 00:46:19,066][26022] Updated weights on worker 0-0, policy_version 493473 (0.00854) [2022-07-10 00:46:20,722][26022] Updated weights on worker 0-0, policy_version 493483 (0.00089) [2022-07-10 00:46:22,594][25689] Fps is (10 sec: 5751.1, 60 sec: 5668.3, 300 sec: 5665.7). Total num frames: 505335808. Throughput: 0: 5819.4. Samples: 505335132. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:46:22,595][25689] Avg episode reward: [(0, '-41.064')] [2022-07-10 00:46:22,634][26022] Updated weights on worker 0-0, policy_version 493493 (0.00082) [2022-07-10 00:46:24,271][26022] Updated weights on worker 0-0, policy_version 493503 (0.00082) [2022-07-10 00:46:26,174][26022] Updated weights on worker 0-0, policy_version 493513 (0.00092) [2022-07-10 00:46:27,599][25689] Fps is (10 sec: 5686.2, 60 sec: 5652.2, 300 sec: 5671.3). Total num frames: 505365504. Throughput: 0: 5922.4. Samples: 505369464. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:46:27,600][25689] Avg episode reward: [(0, '-41.351')] [2022-07-10 00:46:27,908][26022] Updated weights on worker 0-0, policy_version 493523 (0.00082) [2022-07-10 00:46:29,683][26022] Updated weights on worker 0-0, policy_version 493533 (0.00090) [2022-07-10 00:46:31,566][26022] Updated weights on worker 0-0, policy_version 493543 (0.00090) [2022-07-10 00:46:32,664][25689] Fps is (10 sec: 5796.2, 60 sec: 5669.1, 300 sec: 5667.9). Total num frames: 505394176. Throughput: 0: 5952.1. Samples: 505403670. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:46:32,665][25689] Avg episode reward: [(0, '-42.094')] [2022-07-10 00:46:33,181][26022] Updated weights on worker 0-0, policy_version 493553 (0.00084) [2022-07-10 00:46:35,277][26022] Updated weights on worker 0-0, policy_version 493563 (0.00091) [2022-07-10 00:46:36,860][26022] Updated weights on worker 0-0, policy_version 493573 (0.00092) [2022-07-10 00:46:37,706][25689] Fps is (10 sec: 5572.4, 60 sec: 5652.2, 300 sec: 5668.1). Total num frames: 505421824. Throughput: 0: 5958.0. Samples: 505420852. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:46:37,707][25689] Avg episode reward: [(0, '-42.522')] [2022-07-10 00:46:38,769][26022] Updated weights on worker 0-0, policy_version 493583 (0.00082) [2022-07-10 00:46:40,510][26022] Updated weights on worker 0-0, policy_version 493593 (0.00089) [2022-07-10 00:46:42,335][26022] Updated weights on worker 0-0, policy_version 493603 (0.00087) [2022-07-10 00:46:42,739][25689] Fps is (10 sec: 5691.6, 60 sec: 5673.0, 300 sec: 5671.8). Total num frames: 505451520. Throughput: 0: 5950.9. Samples: 505455014. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:46:42,740][25689] Avg episode reward: [(0, '-43.653')] [2022-07-10 00:46:44,016][26022] Updated weights on worker 0-0, policy_version 493613 (0.00090) [2022-07-10 00:46:46,110][26022] Updated weights on worker 0-0, policy_version 493623 (0.00085) [2022-07-10 00:46:47,603][26022] Updated weights on worker 0-0, policy_version 493633 (0.00081) [2022-07-10 00:46:47,761][25689] Fps is (10 sec: 5804.9, 60 sec: 5660.4, 300 sec: 5673.0). Total num frames: 505480192. Throughput: 0: 5952.7. Samples: 505489482. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:46:47,762][25689] Avg episode reward: [(0, '-43.974')] [2022-07-10 00:46:49,629][26022] Updated weights on worker 0-0, policy_version 493643 (0.00089) [2022-07-10 00:46:51,151][26022] Updated weights on worker 0-0, policy_version 493653 (0.00097) [2022-07-10 00:46:52,862][25689] Fps is (10 sec: 5664.8, 60 sec: 5659.3, 300 sec: 5667.7). Total num frames: 505508864. Throughput: 0: 5105.6. Samples: 505506794. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:46:52,864][25689] Avg episode reward: [(0, '-43.934')] [2022-07-10 00:46:53,140][26022] Updated weights on worker 0-0, policy_version 493663 (0.00079) [2022-07-10 00:46:54,747][26022] Updated weights on worker 0-0, policy_version 493673 (0.00089) [2022-07-10 00:46:56,723][26022] Updated weights on worker 0-0, policy_version 493683 (0.00086) [2022-07-10 00:46:57,885][25689] Fps is (10 sec: 5664.5, 60 sec: 5658.6, 300 sec: 5674.6). Total num frames: 505537536. Throughput: 0: 5964.0. Samples: 505541196. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:46:57,886][25689] Avg episode reward: [(0, '-43.651')] [2022-07-10 00:46:58,376][26022] Updated weights on worker 0-0, policy_version 493693 (0.00095) [2022-07-10 00:47:00,131][26022] Updated weights on worker 0-0, policy_version 493703 (0.00088) [2022-07-10 00:47:02,491][26022] Updated weights on worker 0-0, policy_version 493713 (0.00080) [2022-07-10 00:47:02,897][25689] Fps is (10 sec: 5510.6, 60 sec: 5661.7, 300 sec: 5667.6). Total num frames: 505564160. Throughput: 0: 5873.2. Samples: 505573402. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:47:02,899][25689] Avg episode reward: [(0, '-43.898')] [2022-07-10 00:47:04,066][26022] Updated weights on worker 0-0, policy_version 493723 (0.00099) [2022-07-10 00:47:05,885][26022] Updated weights on worker 0-0, policy_version 493733 (0.00094) [2022-07-10 00:47:07,845][26022] Updated weights on worker 0-0, policy_version 493743 (0.00081) [2022-07-10 00:47:07,905][25689] Fps is (10 sec: 5620.7, 60 sec: 5696.0, 300 sec: 5682.4). Total num frames: 505593856. Throughput: 0: 5035.5. Samples: 505590914. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:47:07,906][25689] Avg episode reward: [(0, '-43.806')] [2022-07-10 00:47:09,467][26022] Updated weights on worker 0-0, policy_version 493753 (0.00094) [2022-07-10 00:47:11,526][26022] Updated weights on worker 0-0, policy_version 493763 (0.00087) [2022-07-10 00:47:12,969][25689] Fps is (10 sec: 5794.7, 60 sec: 5701.6, 300 sec: 5674.5). Total num frames: 505622528. Throughput: 0: 5885.1. Samples: 505625126. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:47:12,971][25689] Avg episode reward: [(0, '-43.383')] [2022-07-10 00:47:13,064][26022] Updated weights on worker 0-0, policy_version 493773 (0.00089) [2022-07-10 00:47:14,987][26022] Updated weights on worker 0-0, policy_version 493783 (0.00080) [2022-07-10 00:47:16,613][26022] Updated weights on worker 0-0, policy_version 493793 (0.00085) [2022-07-10 00:47:17,982][25689] Fps is (10 sec: 5690.2, 60 sec: 5673.1, 300 sec: 5678.1). Total num frames: 505651200. Throughput: 0: 5906.5. Samples: 505659902. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:47:17,983][25689] Avg episode reward: [(0, '-43.536')] [2022-07-10 00:47:18,525][26022] Updated weights on worker 0-0, policy_version 493803 (0.00084) [2022-07-10 00:47:20,119][26022] Updated weights on worker 0-0, policy_version 493813 (0.00083) [2022-07-10 00:47:21,888][26022] Updated weights on worker 0-0, policy_version 493823 (0.00087) [2022-07-10 00:47:22,997][25689] Fps is (10 sec: 5820.8, 60 sec: 5713.1, 300 sec: 5681.5). Total num frames: 505680896. Throughput: 0: 5179.0. Samples: 505677500. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:47:22,998][25689] Avg episode reward: [(0, '-43.778')] [2022-07-10 00:47:23,728][26022] Updated weights on worker 0-0, policy_version 493833 (0.00087) [2022-07-10 00:47:25,391][26022] Updated weights on worker 0-0, policy_version 493843 (0.00092) [2022-07-10 00:47:27,395][26022] Updated weights on worker 0-0, policy_version 493853 (0.00086) [2022-07-10 00:47:28,013][25689] Fps is (10 sec: 5818.8, 60 sec: 5695.1, 300 sec: 5682.7). Total num frames: 505709568. Throughput: 0: 6037.8. Samples: 505712326. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:47:28,014][25689] Avg episode reward: [(0, '-43.799')] [2022-07-10 00:47:28,940][26022] Updated weights on worker 0-0, policy_version 493863 (0.00085) [2022-07-10 00:47:31,016][26022] Updated weights on worker 0-0, policy_version 493873 (0.00089) [2022-07-10 00:47:32,456][26022] Updated weights on worker 0-0, policy_version 493883 (0.00087) [2022-07-10 00:47:33,055][25689] Fps is (10 sec: 5701.2, 60 sec: 5697.3, 300 sec: 5680.0). Total num frames: 505738240. Throughput: 0: 6050.9. Samples: 505746662. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:47:33,056][25689] Avg episode reward: [(0, '-43.085')] [2022-07-10 00:47:34,343][26022] Updated weights on worker 0-0, policy_version 493893 (0.00091) [2022-07-10 00:47:36,149][26022] Updated weights on worker 0-0, policy_version 493903 (0.00088) [2022-07-10 00:47:37,847][26022] Updated weights on worker 0-0, policy_version 493913 (0.00080) [2022-07-10 00:47:38,071][25689] Fps is (10 sec: 5803.4, 60 sec: 5733.7, 300 sec: 5684.1). Total num frames: 505767936. Throughput: 0: 5188.6. Samples: 505764134. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:47:38,072][25689] Avg episode reward: [(0, '-42.910')] [2022-07-10 00:47:39,755][26022] Updated weights on worker 0-0, policy_version 493923 (0.00082) [2022-07-10 00:47:41,572][26022] Updated weights on worker 0-0, policy_version 493933 (0.00087) [2022-07-10 00:47:43,087][25689] Fps is (10 sec: 5716.4, 60 sec: 5701.4, 300 sec: 5677.1). Total num frames: 505795584. Throughput: 0: 6023.4. Samples: 505798508. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:47:43,087][25689] Avg episode reward: [(0, '-43.352')] [2022-07-10 00:47:43,218][26022] Updated weights on worker 0-0, policy_version 493943 (0.00082) [2022-07-10 00:47:44,948][26022] Updated weights on worker 0-0, policy_version 493953 (0.00086) [2022-07-10 00:47:45,835][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:47:45,845][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000493958_505812992.pth [2022-07-10 00:47:45,851][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000491958_503764992.pth [2022-07-10 00:47:46,867][26022] Updated weights on worker 0-0, policy_version 493963 (0.00092) [2022-07-10 00:47:48,107][25689] Fps is (10 sec: 5713.6, 60 sec: 5718.5, 300 sec: 5688.4). Total num frames: 505825280. Throughput: 0: 6023.0. Samples: 505833352. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:47:48,108][25689] Avg episode reward: [(0, '-42.961')] [2022-07-10 00:47:48,393][26022] Updated weights on worker 0-0, policy_version 493973 (0.00088) [2022-07-10 00:47:50,543][26022] Updated weights on worker 0-0, policy_version 493983 (0.00092) [2022-07-10 00:47:52,033][26022] Updated weights on worker 0-0, policy_version 493993 (0.00084) [2022-07-10 00:47:53,161][25689] Fps is (10 sec: 5793.6, 60 sec: 5723.0, 300 sec: 5684.3). Total num frames: 505853952. Throughput: 0: 5179.6. Samples: 505850804. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:47:53,162][25689] Avg episode reward: [(0, '-42.986')] [2022-07-10 00:47:53,951][26022] Updated weights on worker 0-0, policy_version 494003 (0.00085) [2022-07-10 00:47:55,583][26022] Updated weights on worker 0-0, policy_version 494013 (0.00086) [2022-07-10 00:47:57,404][26022] Updated weights on worker 0-0, policy_version 494023 (0.00090) [2022-07-10 00:47:58,163][25689] Fps is (10 sec: 5702.9, 60 sec: 5724.9, 300 sec: 5684.4). Total num frames: 505882624. Throughput: 0: 6036.8. Samples: 505885424. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:47:58,163][25689] Avg episode reward: [(0, '-42.768')] [2022-07-10 00:47:59,233][26022] Updated weights on worker 0-0, policy_version 494033 (0.00079) [2022-07-10 00:48:00,909][26022] Updated weights on worker 0-0, policy_version 494043 (0.00080) [2022-07-10 00:48:03,018][26022] Updated weights on worker 0-0, policy_version 494053 (0.00091) [2022-07-10 00:48:03,183][25689] Fps is (10 sec: 5619.9, 60 sec: 5741.2, 300 sec: 5684.3). Total num frames: 505910272. Throughput: 0: 5935.7. Samples: 505917792. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:48:03,183][25689] Avg episode reward: [(0, '-42.230')] [2022-07-10 00:48:05,022][26022] Updated weights on worker 0-0, policy_version 494063 (0.00089) [2022-07-10 00:48:06,665][26022] Updated weights on worker 0-0, policy_version 494073 (0.00090) [2022-07-10 00:48:08,185][25689] Fps is (10 sec: 5619.7, 60 sec: 5724.8, 300 sec: 5687.0). Total num frames: 505938944. Throughput: 0: 5056.1. Samples: 505934864. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:48:08,185][25689] Avg episode reward: [(0, '-42.158')] [2022-07-10 00:48:08,472][26022] Updated weights on worker 0-0, policy_version 494083 (0.00095) [2022-07-10 00:48:10,283][26022] Updated weights on worker 0-0, policy_version 494093 (0.00084) [2022-07-10 00:48:11,987][26022] Updated weights on worker 0-0, policy_version 494103 (0.00083) [2022-07-10 00:48:13,263][25689] Fps is (10 sec: 5688.9, 60 sec: 5723.5, 300 sec: 5685.5). Total num frames: 505967616. Throughput: 0: 5899.5. Samples: 505969392. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:48:13,263][25689] Avg episode reward: [(0, '-41.930')] [2022-07-10 00:48:14,027][26022] Updated weights on worker 0-0, policy_version 494113 (0.00104) [2022-07-10 00:48:15,836][26022] Updated weights on worker 0-0, policy_version 494123 (0.00084) [2022-07-10 00:48:17,482][26022] Updated weights on worker 0-0, policy_version 494133 (0.00082) [2022-07-10 00:48:18,269][25689] Fps is (10 sec: 5788.2, 60 sec: 5741.1, 300 sec: 5689.0). Total num frames: 505997312. Throughput: 0: 5887.2. Samples: 506003792. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:48:18,269][25689] Avg episode reward: [(0, '-41.963')] [2022-07-10 00:48:19,323][26022] Updated weights on worker 0-0, policy_version 494143 (0.00083) [2022-07-10 00:48:20,887][26022] Updated weights on worker 0-0, policy_version 494153 (0.00090) [2022-07-10 00:48:23,027][26022] Updated weights on worker 0-0, policy_version 494163 (0.00088) [2022-07-10 00:48:23,277][25689] Fps is (10 sec: 5726.4, 60 sec: 5707.8, 300 sec: 5683.5). Total num frames: 506024960. Throughput: 0: 5137.3. Samples: 506021022. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:48:23,277][25689] Avg episode reward: [(0, '-42.068')] [2022-07-10 00:48:24,647][26022] Updated weights on worker 0-0, policy_version 494173 (0.00084) [2022-07-10 00:48:26,267][26022] Updated weights on worker 0-0, policy_version 494183 (0.00091) [2022-07-10 00:48:28,163][26022] Updated weights on worker 0-0, policy_version 494193 (0.00084) [2022-07-10 00:48:28,303][25689] Fps is (10 sec: 5714.7, 60 sec: 5723.8, 300 sec: 5688.8). Total num frames: 506054656. Throughput: 0: 5997.3. Samples: 506055520. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:48:28,304][25689] Avg episode reward: [(0, '-41.445')] [2022-07-10 00:48:30,071][26022] Updated weights on worker 0-0, policy_version 494203 (0.00092) [2022-07-10 00:48:31,777][26022] Updated weights on worker 0-0, policy_version 494213 (0.00093) [2022-07-10 00:48:33,427][25689] Fps is (10 sec: 5649.3, 60 sec: 5699.0, 300 sec: 5684.2). Total num frames: 506082304. Throughput: 0: 5976.0. Samples: 506089896. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:48:33,428][25689] Avg episode reward: [(0, '-41.498')] [2022-07-10 00:48:33,741][26022] Updated weights on worker 0-0, policy_version 494223 (0.00092) [2022-07-10 00:48:35,296][26022] Updated weights on worker 0-0, policy_version 494233 (0.00088) [2022-07-10 00:48:37,267][26022] Updated weights on worker 0-0, policy_version 494243 (0.00095) [2022-07-10 00:48:38,523][25689] Fps is (10 sec: 5711.2, 60 sec: 5708.5, 300 sec: 5689.6). Total num frames: 506113024. Throughput: 0: 5105.0. Samples: 506107190. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:48:38,525][25689] Avg episode reward: [(0, '-41.987')] [2022-07-10 00:48:38,874][26022] Updated weights on worker 0-0, policy_version 494253 (0.00086) [2022-07-10 00:48:40,762][26022] Updated weights on worker 0-0, policy_version 494263 (0.00087) [2022-07-10 00:48:42,455][26022] Updated weights on worker 0-0, policy_version 494273 (0.00093) [2022-07-10 00:48:43,532][25689] Fps is (10 sec: 5877.5, 60 sec: 5726.0, 300 sec: 5690.7). Total num frames: 506141696. Throughput: 0: 5949.6. Samples: 506141534. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:48:43,534][25689] Avg episode reward: [(0, '-41.508')] [2022-07-10 00:48:44,267][26022] Updated weights on worker 0-0, policy_version 494283 (0.00092) [2022-07-10 00:48:46,066][26022] Updated weights on worker 0-0, policy_version 494293 (0.00086) [2022-07-10 00:48:47,849][26022] Updated weights on worker 0-0, policy_version 494303 (0.00085) [2022-07-10 00:48:48,634][25689] Fps is (10 sec: 5570.0, 60 sec: 5684.5, 300 sec: 5686.7). Total num frames: 506169344. Throughput: 0: 5939.1. Samples: 506176270. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:48:48,635][25689] Avg episode reward: [(0, '-42.130')] [2022-07-10 00:48:49,278][26022] Updated weights on worker 0-0, policy_version 494313 (0.00086) [2022-07-10 00:48:51,755][26022] Updated weights on worker 0-0, policy_version 494323 (0.00082) [2022-07-10 00:48:53,170][26022] Updated weights on worker 0-0, policy_version 494333 (0.00087) [2022-07-10 00:48:53,745][25689] Fps is (10 sec: 5515.0, 60 sec: 5679.2, 300 sec: 5681.3). Total num frames: 506198016. Throughput: 0: 5081.2. Samples: 506193132. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:48:53,745][25689] Avg episode reward: [(0, '-41.828')] [2022-07-10 00:48:55,257][26022] Updated weights on worker 0-0, policy_version 494343 (0.00084) [2022-07-10 00:48:56,837][26022] Updated weights on worker 0-0, policy_version 494353 (0.00086) [2022-07-10 00:48:58,635][26022] Updated weights on worker 0-0, policy_version 494363 (0.00085) [2022-07-10 00:48:58,782][25689] Fps is (10 sec: 5751.8, 60 sec: 5692.7, 300 sec: 5681.1). Total num frames: 506227712. Throughput: 0: 5920.2. Samples: 506227128. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:48:58,783][25689] Avg episode reward: [(0, '-42.981')] [2022-07-10 00:49:00,395][26022] Updated weights on worker 0-0, policy_version 494373 (0.00094) [2022-07-10 00:49:02,564][26022] Updated weights on worker 0-0, policy_version 494383 (0.00084) [2022-07-10 00:49:03,803][25689] Fps is (10 sec: 5700.8, 60 sec: 5692.6, 300 sec: 5687.7). Total num frames: 506255360. Throughput: 0: 5827.3. Samples: 506259660. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:49:03,804][25689] Avg episode reward: [(0, '-43.481')] [2022-07-10 00:49:04,404][26022] Updated weights on worker 0-0, policy_version 494393 (0.00083) [2022-07-10 00:49:06,315][26022] Updated weights on worker 0-0, policy_version 494403 (0.00080) [2022-07-10 00:49:07,746][26022] Updated weights on worker 0-0, policy_version 494413 (0.00087) [2022-07-10 00:49:08,855][25689] Fps is (10 sec: 5489.6, 60 sec: 5671.0, 300 sec: 5687.6). Total num frames: 506283008. Throughput: 0: 4978.2. Samples: 506276932. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:49:08,856][25689] Avg episode reward: [(0, '-43.167')] [2022-07-10 00:49:09,961][26022] Updated weights on worker 0-0, policy_version 494423 (0.00275) [2022-07-10 00:49:11,342][26022] Updated weights on worker 0-0, policy_version 494433 (0.00088) [2022-07-10 00:49:13,407][26022] Updated weights on worker 0-0, policy_version 494443 (0.00094) [2022-07-10 00:49:13,918][25689] Fps is (10 sec: 5770.9, 60 sec: 5706.3, 300 sec: 5690.0). Total num frames: 506313728. Throughput: 0: 5866.4. Samples: 506311476. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:49:13,918][25689] Avg episode reward: [(0, '-43.017')] [2022-07-10 00:49:15,115][26022] Updated weights on worker 0-0, policy_version 494453 (0.00093) [2022-07-10 00:49:16,848][26022] Updated weights on worker 0-0, policy_version 494463 (0.00093) [2022-07-10 00:49:18,538][26022] Updated weights on worker 0-0, policy_version 494473 (0.00093) [2022-07-10 00:49:18,951][25689] Fps is (10 sec: 5781.1, 60 sec: 5669.9, 300 sec: 5689.6). Total num frames: 506341376. Throughput: 0: 5880.0. Samples: 506345722. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:49:18,953][25689] Avg episode reward: [(0, '-43.035')] [2022-07-10 00:49:20,538][26022] Updated weights on worker 0-0, policy_version 494483 (0.00083) [2022-07-10 00:49:22,195][26022] Updated weights on worker 0-0, policy_version 494493 (0.00081) [2022-07-10 00:49:23,988][25689] Fps is (10 sec: 5592.8, 60 sec: 5684.1, 300 sec: 5689.3). Total num frames: 506370048. Throughput: 0: 5116.8. Samples: 506362940. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:49:23,990][25689] Avg episode reward: [(0, '-42.808')] [2022-07-10 00:49:24,147][26022] Updated weights on worker 0-0, policy_version 494503 (0.00086) [2022-07-10 00:49:25,792][26022] Updated weights on worker 0-0, policy_version 494513 (0.00096) [2022-07-10 00:49:27,554][26022] Updated weights on worker 0-0, policy_version 494523 (0.00094) [2022-07-10 00:49:29,035][25689] Fps is (10 sec: 5788.4, 60 sec: 5682.2, 300 sec: 5686.7). Total num frames: 506399744. Throughput: 0: 5978.8. Samples: 506397584. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:49:29,037][25689] Avg episode reward: [(0, '-43.193')] [2022-07-10 00:49:29,379][26022] Updated weights on worker 0-0, policy_version 494533 (0.00085) [2022-07-10 00:49:31,196][26022] Updated weights on worker 0-0, policy_version 494543 (0.00091) [2022-07-10 00:49:32,827][26022] Updated weights on worker 0-0, policy_version 494553 (0.00596) [2022-07-10 00:49:34,086][25689] Fps is (10 sec: 5678.7, 60 sec: 5689.0, 300 sec: 5690.2). Total num frames: 506427392. Throughput: 0: 5975.7. Samples: 506431996. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:49:34,088][25689] Avg episode reward: [(0, '-43.449')] [2022-07-10 00:49:34,896][26022] Updated weights on worker 0-0, policy_version 494563 (0.00094) [2022-07-10 00:49:36,375][26022] Updated weights on worker 0-0, policy_version 494573 (0.00087) [2022-07-10 00:49:38,325][26022] Updated weights on worker 0-0, policy_version 494583 (0.00187) [2022-07-10 00:49:39,115][25689] Fps is (10 sec: 5790.4, 60 sec: 5695.3, 300 sec: 5694.9). Total num frames: 506458112. Throughput: 0: 5989.8. Samples: 506466498. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:49:39,117][25689] Avg episode reward: [(0, '-43.747')] [2022-07-10 00:49:40,149][26022] Updated weights on worker 0-0, policy_version 494593 (0.00097) [2022-07-10 00:49:41,863][26022] Updated weights on worker 0-0, policy_version 494603 (0.00092) [2022-07-10 00:49:43,790][26022] Updated weights on worker 0-0, policy_version 494613 (0.00087) [2022-07-10 00:49:44,138][25689] Fps is (10 sec: 5806.3, 60 sec: 5677.1, 300 sec: 5688.5). Total num frames: 506485760. Throughput: 0: 5991.0. Samples: 506483662. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:49:44,140][25689] Avg episode reward: [(0, '-43.209')] [2022-07-10 00:49:45,465][26022] Updated weights on worker 0-0, policy_version 494623 (0.00082) [2022-07-10 00:49:45,971][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:49:45,986][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000494625_506496000.pth [2022-07-10 00:49:45,986][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000492623_504445952.pth [2022-07-10 00:49:47,354][26022] Updated weights on worker 0-0, policy_version 494633 (0.00094) [2022-07-10 00:49:49,153][25689] Fps is (10 sec: 5508.8, 60 sec: 5685.3, 300 sec: 5686.7). Total num frames: 506513408. Throughput: 0: 5965.4. Samples: 506517596. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:49:49,153][25689] Avg episode reward: [(0, '-43.395')] [2022-07-10 00:49:49,377][26022] Updated weights on worker 0-0, policy_version 494643 (0.00088) [2022-07-10 00:49:50,819][26022] Updated weights on worker 0-0, policy_version 494653 (0.00079) [2022-07-10 00:49:52,913][26022] Updated weights on worker 0-0, policy_version 494663 (0.01140) [2022-07-10 00:49:54,227][25689] Fps is (10 sec: 5785.6, 60 sec: 5722.5, 300 sec: 5692.6). Total num frames: 506544128. Throughput: 0: 5963.3. Samples: 506552104. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:49:54,228][25689] Avg episode reward: [(0, '-43.196')] [2022-07-10 00:49:54,332][26022] Updated weights on worker 0-0, policy_version 494673 (0.00084) [2022-07-10 00:49:56,339][26022] Updated weights on worker 0-0, policy_version 494683 (0.00085) [2022-07-10 00:49:57,884][26022] Updated weights on worker 0-0, policy_version 494693 (0.00086) [2022-07-10 00:49:59,261][25689] Fps is (10 sec: 5774.1, 60 sec: 5689.0, 300 sec: 5695.8). Total num frames: 506571776. Throughput: 0: 5111.6. Samples: 506569478. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:49:59,263][25689] Avg episode reward: [(0, '-42.401')] [2022-07-10 00:49:59,847][26022] Updated weights on worker 0-0, policy_version 494703 (0.00094) [2022-07-10 00:50:01,629][26022] Updated weights on worker 0-0, policy_version 494713 (0.00090) [2022-07-10 00:50:03,739][26022] Updated weights on worker 0-0, policy_version 494723 (0.00086) [2022-07-10 00:50:04,283][25689] Fps is (10 sec: 5498.9, 60 sec: 5689.0, 300 sec: 5692.5). Total num frames: 506599424. Throughput: 0: 5880.3. Samples: 506602116. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:50:04,283][25689] Avg episode reward: [(0, '-42.610')] [2022-07-10 00:50:05,454][26022] Updated weights on worker 0-0, policy_version 494733 (0.00084) [2022-07-10 00:50:07,195][26022] Updated weights on worker 0-0, policy_version 494743 (0.00092) [2022-07-10 00:50:09,111][26022] Updated weights on worker 0-0, policy_version 494753 (0.00103) [2022-07-10 00:50:09,294][25689] Fps is (10 sec: 5511.5, 60 sec: 5692.7, 300 sec: 5689.7). Total num frames: 506627072. Throughput: 0: 5895.5. Samples: 506636340. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:50:09,295][25689] Avg episode reward: [(0, '-42.303')] [2022-07-10 00:50:10,720][26022] Updated weights on worker 0-0, policy_version 494763 (0.00095) [2022-07-10 00:50:12,870][26022] Updated weights on worker 0-0, policy_version 494773 (0.00086) [2022-07-10 00:50:14,431][25689] Fps is (10 sec: 5650.5, 60 sec: 5668.8, 300 sec: 5688.4). Total num frames: 506656768. Throughput: 0: 4998.0. Samples: 506653086. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:50:14,432][25689] Avg episode reward: [(0, '-43.755')] [2022-07-10 00:50:14,495][26022] Updated weights on worker 0-0, policy_version 494783 (0.00093) [2022-07-10 00:50:16,336][26022] Updated weights on worker 0-0, policy_version 494793 (0.00070) [2022-07-10 00:50:18,098][26022] Updated weights on worker 0-0, policy_version 494803 (0.00084) [2022-07-10 00:50:19,435][25689] Fps is (10 sec: 5654.8, 60 sec: 5671.6, 300 sec: 5688.9). Total num frames: 506684416. Throughput: 0: 5858.4. Samples: 506687662. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:50:19,435][25689] Avg episode reward: [(0, '-43.844')] [2022-07-10 00:50:19,944][26022] Updated weights on worker 0-0, policy_version 494813 (0.00087) [2022-07-10 00:50:21,580][26022] Updated weights on worker 0-0, policy_version 494823 (0.00088) [2022-07-10 00:50:23,531][26022] Updated weights on worker 0-0, policy_version 494833 (0.00090) [2022-07-10 00:50:24,475][25689] Fps is (10 sec: 5709.6, 60 sec: 5688.2, 300 sec: 5684.9). Total num frames: 506714112. Throughput: 0: 5958.5. Samples: 506722430. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:50:24,475][25689] Avg episode reward: [(0, '-43.320')] [2022-07-10 00:50:24,953][26022] Updated weights on worker 0-0, policy_version 494843 (0.00094) [2022-07-10 00:50:27,092][26022] Updated weights on worker 0-0, policy_version 494853 (0.00087) [2022-07-10 00:50:28,783][26022] Updated weights on worker 0-0, policy_version 494863 (0.00094) [2022-07-10 00:50:29,477][25689] Fps is (10 sec: 5710.3, 60 sec: 5658.6, 300 sec: 5686.1). Total num frames: 506741760. Throughput: 0: 5121.6. Samples: 506739708. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:50:29,478][25689] Avg episode reward: [(0, '-43.648')] [2022-07-10 00:50:30,538][26022] Updated weights on worker 0-0, policy_version 494873 (0.00086) [2022-07-10 00:50:32,351][26022] Updated weights on worker 0-0, policy_version 494883 (0.00092) [2022-07-10 00:50:33,965][26022] Updated weights on worker 0-0, policy_version 494893 (0.00088) [2022-07-10 00:50:34,545][25689] Fps is (10 sec: 5897.5, 60 sec: 5724.7, 300 sec: 5696.0). Total num frames: 506773504. Throughput: 0: 6016.6. Samples: 506774104. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:50:34,546][25689] Avg episode reward: [(0, '-43.706')] [2022-07-10 00:50:35,883][26022] Updated weights on worker 0-0, policy_version 494903 (0.00092) [2022-07-10 00:50:37,595][26022] Updated weights on worker 0-0, policy_version 494913 (0.00088) [2022-07-10 00:50:39,503][26022] Updated weights on worker 0-0, policy_version 494923 (0.00090) [2022-07-10 00:50:39,562][25689] Fps is (10 sec: 5889.1, 60 sec: 5675.0, 300 sec: 5693.6). Total num frames: 506801152. Throughput: 0: 6024.1. Samples: 506808910. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:50:39,563][25689] Avg episode reward: [(0, '-43.602')] [2022-07-10 00:50:41,119][26022] Updated weights on worker 0-0, policy_version 494933 (0.00083) [2022-07-10 00:50:42,959][26022] Updated weights on worker 0-0, policy_version 494943 (0.00093) [2022-07-10 00:50:44,578][25689] Fps is (10 sec: 5613.9, 60 sec: 5692.7, 300 sec: 5691.2). Total num frames: 506829824. Throughput: 0: 5165.2. Samples: 506826266. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:50:44,591][25689] Avg episode reward: [(0, '-43.293')] [2022-07-10 00:50:44,764][26022] Updated weights on worker 0-0, policy_version 494953 (0.00089) [2022-07-10 00:50:46,704][26022] Updated weights on worker 0-0, policy_version 494963 (0.00091) [2022-07-10 00:50:48,354][26022] Updated weights on worker 0-0, policy_version 494973 (0.00087) [2022-07-10 00:50:49,608][25689] Fps is (10 sec: 5708.2, 60 sec: 5708.1, 300 sec: 5692.3). Total num frames: 506858496. Throughput: 0: 5987.7. Samples: 506860246. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 00:50:49,608][25689] Avg episode reward: [(0, '-42.666')] [2022-07-10 00:50:50,320][26022] Updated weights on worker 0-0, policy_version 494983 (0.00093) [2022-07-10 00:50:52,015][26022] Updated weights on worker 0-0, policy_version 494993 (0.00088) [2022-07-10 00:50:53,910][26022] Updated weights on worker 0-0, policy_version 495003 (0.00093) [2022-07-10 00:50:54,711][25689] Fps is (10 sec: 5658.7, 60 sec: 5671.5, 300 sec: 5690.6). Total num frames: 506887168. Throughput: 0: 5963.0. Samples: 506894354. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:50:54,712][25689] Avg episode reward: [(0, '-42.969')] [2022-07-10 00:50:55,687][26022] Updated weights on worker 0-0, policy_version 495013 (0.00088) [2022-07-10 00:50:57,522][26022] Updated weights on worker 0-0, policy_version 495023 (0.00084) [2022-07-10 00:50:59,203][26022] Updated weights on worker 0-0, policy_version 495033 (0.00082) [2022-07-10 00:50:59,736][25689] Fps is (10 sec: 5662.0, 60 sec: 5689.4, 300 sec: 5697.9). Total num frames: 506915840. Throughput: 0: 5084.1. Samples: 506911476. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:50:59,737][25689] Avg episode reward: [(0, '-43.071')] [2022-07-10 00:51:01,012][26022] Updated weights on worker 0-0, policy_version 495043 (0.00086) [2022-07-10 00:51:03,134][26022] Updated weights on worker 0-0, policy_version 495053 (0.00082) [2022-07-10 00:51:04,790][25689] Fps is (10 sec: 5588.2, 60 sec: 5686.4, 300 sec: 5697.1). Total num frames: 506943488. Throughput: 0: 5831.6. Samples: 506944134. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:51:04,790][25689] Avg episode reward: [(0, '-42.785')] [2022-07-10 00:51:04,838][26022] Updated weights on worker 0-0, policy_version 495063 (0.00090) [2022-07-10 00:51:06,611][26022] Updated weights on worker 0-0, policy_version 495073 (0.00095) [2022-07-10 00:51:08,519][26022] Updated weights on worker 0-0, policy_version 495083 (0.00084) [2022-07-10 00:51:09,820][25689] Fps is (10 sec: 5584.8, 60 sec: 5701.5, 300 sec: 5698.9). Total num frames: 506972160. Throughput: 0: 5861.5. Samples: 506978722. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:51:09,821][25689] Avg episode reward: [(0, '-42.375')] [2022-07-10 00:51:10,324][26022] Updated weights on worker 0-0, policy_version 495093 (0.00091) [2022-07-10 00:51:12,020][26022] Updated weights on worker 0-0, policy_version 495103 (0.00081) [2022-07-10 00:51:13,821][26022] Updated weights on worker 0-0, policy_version 495113 (0.00095) [2022-07-10 00:51:14,883][25689] Fps is (10 sec: 5783.0, 60 sec: 5708.5, 300 sec: 5695.6). Total num frames: 507001856. Throughput: 0: 5028.7. Samples: 506995790. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:51:14,883][25689] Avg episode reward: [(0, '-43.059')] [2022-07-10 00:51:15,597][26022] Updated weights on worker 0-0, policy_version 495123 (0.00087) [2022-07-10 00:51:17,459][26022] Updated weights on worker 0-0, policy_version 495133 (0.00083) [2022-07-10 00:51:19,311][26022] Updated weights on worker 0-0, policy_version 495143 (0.00087) [2022-07-10 00:51:19,886][25689] Fps is (10 sec: 5696.9, 60 sec: 5708.5, 300 sec: 5697.1). Total num frames: 507029504. Throughput: 0: 5885.0. Samples: 507030062. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:51:19,887][25689] Avg episode reward: [(0, '-43.399')] [2022-07-10 00:51:20,960][26022] Updated weights on worker 0-0, policy_version 495153 (0.00081) [2022-07-10 00:51:22,931][26022] Updated weights on worker 0-0, policy_version 495163 (0.00501) [2022-07-10 00:51:24,553][26022] Updated weights on worker 0-0, policy_version 495173 (0.00091) [2022-07-10 00:51:24,913][25689] Fps is (10 sec: 5819.0, 60 sec: 5726.7, 300 sec: 5700.1). Total num frames: 507060224. Throughput: 0: 5981.9. Samples: 507064512. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:51:24,914][25689] Avg episode reward: [(0, '-42.717')] [2022-07-10 00:51:26,663][26022] Updated weights on worker 0-0, policy_version 495183 (0.00085) [2022-07-10 00:51:28,235][26022] Updated weights on worker 0-0, policy_version 495193 (0.00086) [2022-07-10 00:51:29,940][25689] Fps is (10 sec: 5602.0, 60 sec: 5690.5, 300 sec: 5690.5). Total num frames: 507085824. Throughput: 0: 5106.3. Samples: 507081458. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:51:29,940][25689] Avg episode reward: [(0, '-42.745')] [2022-07-10 00:51:30,243][26022] Updated weights on worker 0-0, policy_version 495203 (0.00097) [2022-07-10 00:51:31,747][26022] Updated weights on worker 0-0, policy_version 495213 (0.00083) [2022-07-10 00:51:33,576][26022] Updated weights on worker 0-0, policy_version 495223 (0.00086) [2022-07-10 00:51:35,068][25689] Fps is (10 sec: 5445.3, 60 sec: 5651.0, 300 sec: 5695.7). Total num frames: 507115520. Throughput: 0: 5942.0. Samples: 507115732. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:51:35,069][25689] Avg episode reward: [(0, '-42.356')] [2022-07-10 00:51:35,358][26022] Updated weights on worker 0-0, policy_version 495233 (0.00093) [2022-07-10 00:51:37,160][26022] Updated weights on worker 0-0, policy_version 495243 (0.00087) [2022-07-10 00:51:38,906][26022] Updated weights on worker 0-0, policy_version 495253 (0.00088) [2022-07-10 00:51:40,090][25689] Fps is (10 sec: 5851.3, 60 sec: 5684.4, 300 sec: 5696.0). Total num frames: 507145216. Throughput: 0: 5959.5. Samples: 507150466. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:51:40,090][25689] Avg episode reward: [(0, '-41.575')] [2022-07-10 00:51:40,884][26022] Updated weights on worker 0-0, policy_version 495263 (0.00098) [2022-07-10 00:51:42,225][26022] Updated weights on worker 0-0, policy_version 495273 (0.00089) [2022-07-10 00:51:44,376][26022] Updated weights on worker 0-0, policy_version 495283 (0.00083) [2022-07-10 00:51:45,139][25689] Fps is (10 sec: 5795.9, 60 sec: 5681.3, 300 sec: 5695.4). Total num frames: 507173888. Throughput: 0: 5101.9. Samples: 507167700. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:51:45,139][25689] Avg episode reward: [(0, '-40.800')] [2022-07-10 00:51:46,002][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:51:46,011][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000495293_507180032.pth [2022-07-10 00:51:46,012][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000493291_505129984.pth [2022-07-10 00:51:46,014][26022] Updated weights on worker 0-0, policy_version 495293 (0.00090) [2022-07-10 00:51:47,843][26022] Updated weights on worker 0-0, policy_version 495303 (0.00093) [2022-07-10 00:51:49,596][26022] Updated weights on worker 0-0, policy_version 495313 (0.00086) [2022-07-10 00:51:50,155][25689] Fps is (10 sec: 5697.4, 60 sec: 5682.6, 300 sec: 5697.1). Total num frames: 507202560. Throughput: 0: 5986.9. Samples: 507202484. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:51:50,155][25689] Avg episode reward: [(0, '-41.028')] [2022-07-10 00:51:51,386][26022] Updated weights on worker 0-0, policy_version 495323 (0.00086) [2022-07-10 00:51:53,343][26022] Updated weights on worker 0-0, policy_version 495333 (0.00087) [2022-07-10 00:51:55,036][26022] Updated weights on worker 0-0, policy_version 495343 (0.00084) [2022-07-10 00:51:55,252][25689] Fps is (10 sec: 5771.3, 60 sec: 5700.1, 300 sec: 5699.1). Total num frames: 507232256. Throughput: 0: 6005.0. Samples: 507236938. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:51:55,253][25689] Avg episode reward: [(0, '-43.316')] [2022-07-10 00:51:56,698][26022] Updated weights on worker 0-0, policy_version 495353 (0.00085) [2022-07-10 00:51:58,715][26022] Updated weights on worker 0-0, policy_version 495363 (0.00093) [2022-07-10 00:52:00,257][25689] Fps is (10 sec: 5777.5, 60 sec: 5701.9, 300 sec: 5706.1). Total num frames: 507260928. Throughput: 0: 5133.9. Samples: 507254004. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:52:00,258][25689] Avg episode reward: [(0, '-42.961')] [2022-07-10 00:52:00,303][26022] Updated weights on worker 0-0, policy_version 495373 (0.00090) [2022-07-10 00:52:02,614][26022] Updated weights on worker 0-0, policy_version 495383 (0.00082) [2022-07-10 00:52:04,278][26022] Updated weights on worker 0-0, policy_version 495393 (0.00094) [2022-07-10 00:52:05,285][25689] Fps is (10 sec: 5409.6, 60 sec: 5670.6, 300 sec: 5692.0). Total num frames: 507286528. Throughput: 0: 5882.2. Samples: 507286202. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:52:05,285][25689] Avg episode reward: [(0, '-43.148')] [2022-07-10 00:52:06,159][26022] Updated weights on worker 0-0, policy_version 495403 (0.00090) [2022-07-10 00:52:07,915][26022] Updated weights on worker 0-0, policy_version 495413 (0.00092) [2022-07-10 00:52:09,687][26022] Updated weights on worker 0-0, policy_version 495423 (0.00089) [2022-07-10 00:52:10,326][25689] Fps is (10 sec: 5390.0, 60 sec: 5669.5, 300 sec: 5692.4). Total num frames: 507315200. Throughput: 0: 5835.2. Samples: 507320190. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:52:10,327][25689] Avg episode reward: [(0, '-42.995')] [2022-07-10 00:52:11,516][26022] Updated weights on worker 0-0, policy_version 495433 (0.00095) [2022-07-10 00:52:13,328][26022] Updated weights on worker 0-0, policy_version 495443 (0.00094) [2022-07-10 00:52:15,061][26022] Updated weights on worker 0-0, policy_version 495453 (0.00096) [2022-07-10 00:52:15,387][25689] Fps is (10 sec: 5878.9, 60 sec: 5686.6, 300 sec: 5698.4). Total num frames: 507345920. Throughput: 0: 4982.7. Samples: 507337264. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:52:15,396][25689] Avg episode reward: [(0, '-41.974')] [2022-07-10 00:52:17,000][26022] Updated weights on worker 0-0, policy_version 495463 (0.00088) [2022-07-10 00:52:18,601][26022] Updated weights on worker 0-0, policy_version 495473 (0.00089) [2022-07-10 00:52:20,400][25689] Fps is (10 sec: 5794.3, 60 sec: 5685.7, 300 sec: 5691.5). Total num frames: 507373568. Throughput: 0: 5832.0. Samples: 507371474. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:52:20,400][25689] Avg episode reward: [(0, '-41.077')] [2022-07-10 00:52:20,601][26022] Updated weights on worker 0-0, policy_version 495483 (0.00081) [2022-07-10 00:52:22,301][26022] Updated weights on worker 0-0, policy_version 495493 (0.00085) [2022-07-10 00:52:24,039][26022] Updated weights on worker 0-0, policy_version 495503 (0.00109) [2022-07-10 00:52:25,437][25689] Fps is (10 sec: 5705.6, 60 sec: 5667.8, 300 sec: 5694.5). Total num frames: 507403264. Throughput: 0: 5952.2. Samples: 507406156. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:52:25,439][25689] Avg episode reward: [(0, '-40.593')] [2022-07-10 00:52:25,954][26022] Updated weights on worker 0-0, policy_version 495513 (0.00082) [2022-07-10 00:52:27,715][26022] Updated weights on worker 0-0, policy_version 495523 (0.00090) [2022-07-10 00:52:29,480][26022] Updated weights on worker 0-0, policy_version 495533 (0.00085) [2022-07-10 00:52:30,458][25689] Fps is (10 sec: 5701.2, 60 sec: 5702.2, 300 sec: 5691.5). Total num frames: 507430912. Throughput: 0: 5994.6. Samples: 507440872. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:52:30,460][25689] Avg episode reward: [(0, '-39.994')] [2022-07-10 00:52:31,323][26022] Updated weights on worker 0-0, policy_version 495543 (0.00094) [2022-07-10 00:52:33,025][26022] Updated weights on worker 0-0, policy_version 495553 (0.00084) [2022-07-10 00:52:34,774][26022] Updated weights on worker 0-0, policy_version 495563 (0.00087) [2022-07-10 00:52:35,539][25689] Fps is (10 sec: 5677.0, 60 sec: 5706.7, 300 sec: 5690.3). Total num frames: 507460608. Throughput: 0: 5987.0. Samples: 507457910. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:52:35,539][25689] Avg episode reward: [(0, '-40.157')] [2022-07-10 00:52:36,527][26022] Updated weights on worker 0-0, policy_version 495573 (0.00089) [2022-07-10 00:52:38,410][26022] Updated weights on worker 0-0, policy_version 495583 (0.00100) [2022-07-10 00:52:40,130][26022] Updated weights on worker 0-0, policy_version 495593 (0.00088) [2022-07-10 00:52:40,579][25689] Fps is (10 sec: 5666.0, 60 sec: 5671.1, 300 sec: 5689.8). Total num frames: 507488256. Throughput: 0: 5990.9. Samples: 507492364. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:52:40,579][25689] Avg episode reward: [(0, '-41.086')] [2022-07-10 00:52:41,805][26022] Updated weights on worker 0-0, policy_version 495603 (0.00084) [2022-07-10 00:52:43,896][26022] Updated weights on worker 0-0, policy_version 495613 (0.00087) [2022-07-10 00:52:45,377][26022] Updated weights on worker 0-0, policy_version 495623 (0.00082) [2022-07-10 00:52:45,584][25689] Fps is (10 sec: 5810.7, 60 sec: 5709.1, 300 sec: 5693.6). Total num frames: 507518976. Throughput: 0: 6000.9. Samples: 507527050. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:52:45,584][25689] Avg episode reward: [(0, '-40.694')] [2022-07-10 00:52:47,512][26022] Updated weights on worker 0-0, policy_version 495633 (0.00093) [2022-07-10 00:52:49,040][26022] Updated weights on worker 0-0, policy_version 495643 (0.00091) [2022-07-10 00:52:50,594][25689] Fps is (10 sec: 5827.9, 60 sec: 5692.7, 300 sec: 5690.9). Total num frames: 507546624. Throughput: 0: 5122.0. Samples: 507544010. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:52:50,595][25689] Avg episode reward: [(0, '-41.807')] [2022-07-10 00:52:50,796][26022] Updated weights on worker 0-0, policy_version 495653 (0.00085) [2022-07-10 00:52:52,851][26022] Updated weights on worker 0-0, policy_version 495663 (0.00087) [2022-07-10 00:52:54,468][26022] Updated weights on worker 0-0, policy_version 495673 (0.00102) [2022-07-10 00:52:55,667][25689] Fps is (10 sec: 5585.2, 60 sec: 5678.1, 300 sec: 5689.6). Total num frames: 507575296. Throughput: 0: 5979.8. Samples: 507578276. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:52:55,668][25689] Avg episode reward: [(0, '-42.268')] [2022-07-10 00:52:56,263][26022] Updated weights on worker 0-0, policy_version 495683 (0.00097) [2022-07-10 00:52:58,136][26022] Updated weights on worker 0-0, policy_version 495693 (0.00087) [2022-07-10 00:53:00,028][26022] Updated weights on worker 0-0, policy_version 495703 (0.00087) [2022-07-10 00:53:00,737][25689] Fps is (10 sec: 5653.9, 60 sec: 5672.0, 300 sec: 5692.1). Total num frames: 507603968. Throughput: 0: 5958.4. Samples: 507612472. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:53:00,737][25689] Avg episode reward: [(0, '-42.387')] [2022-07-10 00:53:01,499][26022] Updated weights on worker 0-0, policy_version 495713 (0.00083) [2022-07-10 00:53:03,927][26022] Updated weights on worker 0-0, policy_version 495723 (0.00084) [2022-07-10 00:53:05,632][26022] Updated weights on worker 0-0, policy_version 495733 (0.00093) [2022-07-10 00:53:05,762][25689] Fps is (10 sec: 5578.9, 60 sec: 5706.0, 300 sec: 5688.2). Total num frames: 507631616. Throughput: 0: 4981.9. Samples: 507627580. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:53:05,763][25689] Avg episode reward: [(0, '-42.096')] [2022-07-10 00:53:07,419][26022] Updated weights on worker 0-0, policy_version 495743 (0.00095) [2022-07-10 00:53:09,144][26022] Updated weights on worker 0-0, policy_version 495753 (0.00093) [2022-07-10 00:53:10,780][25689] Fps is (10 sec: 5505.5, 60 sec: 5691.3, 300 sec: 5685.9). Total num frames: 507659264. Throughput: 0: 5822.5. Samples: 507661544. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:53:10,781][25689] Avg episode reward: [(0, '-42.379')] [2022-07-10 00:53:11,270][26022] Updated weights on worker 0-0, policy_version 495763 (0.00095) [2022-07-10 00:53:12,772][26022] Updated weights on worker 0-0, policy_version 495773 (0.00089) [2022-07-10 00:53:14,686][26022] Updated weights on worker 0-0, policy_version 495783 (0.00109) [2022-07-10 00:53:15,850][25689] Fps is (10 sec: 5684.7, 60 sec: 5673.6, 300 sec: 5684.7). Total num frames: 507688960. Throughput: 0: 5837.1. Samples: 507696084. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:53:15,850][25689] Avg episode reward: [(0, '-42.259')] [2022-07-10 00:53:16,160][26022] Updated weights on worker 0-0, policy_version 495793 (0.00093) [2022-07-10 00:53:18,238][26022] Updated weights on worker 0-0, policy_version 495803 (0.00087) [2022-07-10 00:53:20,157][26022] Updated weights on worker 0-0, policy_version 495813 (0.00087) [2022-07-10 00:53:20,876][25689] Fps is (10 sec: 5578.4, 60 sec: 5655.4, 300 sec: 5680.9). Total num frames: 507715584. Throughput: 0: 5001.1. Samples: 507713192. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:53:20,877][25689] Avg episode reward: [(0, '-41.869')] [2022-07-10 00:53:21,712][26022] Updated weights on worker 0-0, policy_version 495823 (0.00088) [2022-07-10 00:53:23,762][26022] Updated weights on worker 0-0, policy_version 495833 (0.00087) [2022-07-10 00:53:25,303][26022] Updated weights on worker 0-0, policy_version 495843 (0.00088) [2022-07-10 00:53:25,884][25689] Fps is (10 sec: 5613.0, 60 sec: 5658.2, 300 sec: 5681.2). Total num frames: 507745280. Throughput: 0: 5951.7. Samples: 507747336. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:53:25,884][25689] Avg episode reward: [(0, '-42.184')] [2022-07-10 00:53:27,410][26022] Updated weights on worker 0-0, policy_version 495853 (0.00092) [2022-07-10 00:53:28,937][26022] Updated weights on worker 0-0, policy_version 495863 (0.00099) [2022-07-10 00:53:30,899][25689] Fps is (10 sec: 5721.2, 60 sec: 5658.7, 300 sec: 5683.3). Total num frames: 507772928. Throughput: 0: 5950.3. Samples: 507781258. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:53:30,900][25689] Avg episode reward: [(0, '-41.481')] [2022-07-10 00:53:30,957][26022] Updated weights on worker 0-0, policy_version 495873 (0.00095) [2022-07-10 00:53:32,616][26022] Updated weights on worker 0-0, policy_version 495883 (0.00090) [2022-07-10 00:53:34,552][26022] Updated weights on worker 0-0, policy_version 495893 (0.00079) [2022-07-10 00:53:35,947][25689] Fps is (10 sec: 5698.5, 60 sec: 5661.7, 300 sec: 5680.8). Total num frames: 507802624. Throughput: 0: 5098.2. Samples: 507798542. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:53:35,948][25689] Avg episode reward: [(0, '-41.575')] [2022-07-10 00:53:36,093][26022] Updated weights on worker 0-0, policy_version 495903 (0.00094) [2022-07-10 00:53:38,121][26022] Updated weights on worker 0-0, policy_version 495913 (0.00086) [2022-07-10 00:53:39,822][26022] Updated weights on worker 0-0, policy_version 495923 (0.00081) [2022-07-10 00:53:41,006][25689] Fps is (10 sec: 5775.2, 60 sec: 5676.9, 300 sec: 5679.8). Total num frames: 507831296. Throughput: 0: 5941.3. Samples: 507832788. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 00:53:41,007][25689] Avg episode reward: [(0, '-41.723')] [2022-07-10 00:53:41,816][26022] Updated weights on worker 0-0, policy_version 495933 (0.00093) [2022-07-10 00:53:43,320][26022] Updated weights on worker 0-0, policy_version 495943 (0.00088) [2022-07-10 00:53:45,106][26022] Updated weights on worker 0-0, policy_version 495953 (0.00086) [2022-07-10 00:53:46,061][25689] Fps is (10 sec: 5669.7, 60 sec: 5638.3, 300 sec: 5684.2). Total num frames: 507859968. Throughput: 0: 5947.9. Samples: 507867348. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:53:46,062][25689] Avg episode reward: [(0, '-41.551')] [2022-07-10 00:53:46,066][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:53:46,078][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000495958_507860992.pth [2022-07-10 00:53:46,078][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000493958_505812992.pth [2022-07-10 00:53:46,992][26022] Updated weights on worker 0-0, policy_version 495963 (0.00092) [2022-07-10 00:53:48,606][26022] Updated weights on worker 0-0, policy_version 495973 (0.00083) [2022-07-10 00:53:50,506][26022] Updated weights on worker 0-0, policy_version 495983 (0.00092) [2022-07-10 00:53:51,066][25689] Fps is (10 sec: 5802.3, 60 sec: 5672.8, 300 sec: 5689.6). Total num frames: 507889664. Throughput: 0: 5121.1. Samples: 507884532. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:53:51,066][25689] Avg episode reward: [(0, '-41.150')] [2022-07-10 00:53:52,251][26022] Updated weights on worker 0-0, policy_version 495993 (0.00092) [2022-07-10 00:53:54,153][26022] Updated weights on worker 0-0, policy_version 496003 (0.00087) [2022-07-10 00:53:56,018][26022] Updated weights on worker 0-0, policy_version 496013 (0.00097) [2022-07-10 00:53:56,151][25689] Fps is (10 sec: 5683.4, 60 sec: 5654.7, 300 sec: 5681.8). Total num frames: 507917312. Throughput: 0: 5947.6. Samples: 507918706. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:53:56,151][25689] Avg episode reward: [(0, '-42.354')] [2022-07-10 00:53:57,632][26022] Updated weights on worker 0-0, policy_version 496023 (0.00086) [2022-07-10 00:53:59,566][26022] Updated weights on worker 0-0, policy_version 496033 (0.00086) [2022-07-10 00:54:01,155][25689] Fps is (10 sec: 5784.8, 60 sec: 5694.7, 300 sec: 5692.5). Total num frames: 507948032. Throughput: 0: 5974.1. Samples: 507953162. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:54:01,156][25689] Avg episode reward: [(0, '-42.454')] [2022-07-10 00:54:01,157][26022] Updated weights on worker 0-0, policy_version 496043 (0.00106) [2022-07-10 00:54:03,377][26022] Updated weights on worker 0-0, policy_version 496053 (0.00092) [2022-07-10 00:54:05,134][26022] Updated weights on worker 0-0, policy_version 496063 (0.00085) [2022-07-10 00:54:06,209][25689] Fps is (10 sec: 5701.0, 60 sec: 5675.1, 300 sec: 5689.0). Total num frames: 507974656. Throughput: 0: 5015.1. Samples: 507968394. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:54:06,210][25689] Avg episode reward: [(0, '-42.529')] [2022-07-10 00:54:06,988][26022] Updated weights on worker 0-0, policy_version 496073 (0.00093) [2022-07-10 00:54:08,826][26022] Updated weights on worker 0-0, policy_version 496083 (0.00087) [2022-07-10 00:54:10,549][26022] Updated weights on worker 0-0, policy_version 496093 (0.00081) [2022-07-10 00:54:11,260][25689] Fps is (10 sec: 5371.3, 60 sec: 5672.0, 300 sec: 5678.9). Total num frames: 508002304. Throughput: 0: 5869.7. Samples: 508003064. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:54:11,260][25689] Avg episode reward: [(0, '-43.116')] [2022-07-10 00:54:12,250][26022] Updated weights on worker 0-0, policy_version 496103 (0.00094) [2022-07-10 00:54:14,137][26022] Updated weights on worker 0-0, policy_version 496113 (0.00088) [2022-07-10 00:54:15,946][26022] Updated weights on worker 0-0, policy_version 496123 (0.00085) [2022-07-10 00:54:16,315][25689] Fps is (10 sec: 5674.3, 60 sec: 5673.4, 300 sec: 5685.4). Total num frames: 508032000. Throughput: 0: 5877.2. Samples: 508037216. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:54:16,316][25689] Avg episode reward: [(0, '-44.030')] [2022-07-10 00:54:17,732][26022] Updated weights on worker 0-0, policy_version 496133 (0.00090) [2022-07-10 00:54:19,598][26022] Updated weights on worker 0-0, policy_version 496143 (0.00082) [2022-07-10 00:54:21,334][25689] Fps is (10 sec: 5692.2, 60 sec: 5691.0, 300 sec: 5682.3). Total num frames: 508059648. Throughput: 0: 5004.0. Samples: 508054134. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:54:21,334][25689] Avg episode reward: [(0, '-44.978')] [2022-07-10 00:54:21,455][26022] Updated weights on worker 0-0, policy_version 496153 (0.00078) [2022-07-10 00:54:23,147][26022] Updated weights on worker 0-0, policy_version 496163 (0.00098) [2022-07-10 00:54:25,062][26022] Updated weights on worker 0-0, policy_version 496173 (0.00083) [2022-07-10 00:54:26,363][25689] Fps is (10 sec: 5707.2, 60 sec: 5689.0, 300 sec: 5682.6). Total num frames: 508089344. Throughput: 0: 5973.9. Samples: 508088788. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:54:26,363][25689] Avg episode reward: [(0, '-45.270')] [2022-07-10 00:54:26,524][26022] Updated weights on worker 0-0, policy_version 496183 (0.00085) [2022-07-10 00:54:28,666][26022] Updated weights on worker 0-0, policy_version 496193 (0.00086) [2022-07-10 00:54:30,032][26022] Updated weights on worker 0-0, policy_version 496203 (0.00092) [2022-07-10 00:54:31,391][25689] Fps is (10 sec: 5701.5, 60 sec: 5687.8, 300 sec: 5683.0). Total num frames: 508116992. Throughput: 0: 5971.8. Samples: 508123286. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:54:31,392][25689] Avg episode reward: [(0, '-46.330')] [2022-07-10 00:54:32,164][26022] Updated weights on worker 0-0, policy_version 496213 (0.00081) [2022-07-10 00:54:33,875][26022] Updated weights on worker 0-0, policy_version 496223 (0.00083) [2022-07-10 00:54:35,612][26022] Updated weights on worker 0-0, policy_version 496233 (0.00085) [2022-07-10 00:54:36,447][25689] Fps is (10 sec: 5787.8, 60 sec: 5703.9, 300 sec: 5682.5). Total num frames: 508147712. Throughput: 0: 5127.5. Samples: 508140444. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:54:36,449][25689] Avg episode reward: [(0, '-46.732')] [2022-07-10 00:54:37,460][26022] Updated weights on worker 0-0, policy_version 496243 (0.00082) [2022-07-10 00:54:39,216][26022] Updated weights on worker 0-0, policy_version 496253 (0.00085) [2022-07-10 00:54:40,873][26022] Updated weights on worker 0-0, policy_version 496263 (0.00083) [2022-07-10 00:54:41,518][25689] Fps is (10 sec: 5864.8, 60 sec: 5702.8, 300 sec: 5685.1). Total num frames: 508176384. Throughput: 0: 5992.5. Samples: 508175090. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:54:41,519][25689] Avg episode reward: [(0, '-45.816')] [2022-07-10 00:54:42,703][26022] Updated weights on worker 0-0, policy_version 496273 (0.00089) [2022-07-10 00:54:44,310][26022] Updated weights on worker 0-0, policy_version 496283 (0.00090) [2022-07-10 00:54:46,349][26022] Updated weights on worker 0-0, policy_version 496293 (0.00087) [2022-07-10 00:54:46,546][25689] Fps is (10 sec: 5678.5, 60 sec: 5705.4, 300 sec: 5688.3). Total num frames: 508205056. Throughput: 0: 6001.1. Samples: 508209908. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:54:46,546][25689] Avg episode reward: [(0, '-45.793')] [2022-07-10 00:54:48,002][26022] Updated weights on worker 0-0, policy_version 496303 (0.00084) [2022-07-10 00:54:49,922][26022] Updated weights on worker 0-0, policy_version 496313 (0.00093) [2022-07-10 00:54:51,559][25689] Fps is (10 sec: 5711.2, 60 sec: 5687.7, 300 sec: 5682.5). Total num frames: 508233728. Throughput: 0: 5139.5. Samples: 508226936. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:54:51,561][25689] Avg episode reward: [(0, '-44.727')] [2022-07-10 00:54:51,718][26022] Updated weights on worker 0-0, policy_version 496323 (0.00093) [2022-07-10 00:54:53,418][26022] Updated weights on worker 0-0, policy_version 496333 (0.00089) [2022-07-10 00:54:55,371][26022] Updated weights on worker 0-0, policy_version 496343 (0.00091) [2022-07-10 00:54:56,625][25689] Fps is (10 sec: 5790.8, 60 sec: 5723.3, 300 sec: 5688.8). Total num frames: 508263424. Throughput: 0: 5997.1. Samples: 508261452. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:54:56,626][25689] Avg episode reward: [(0, '-44.388')] [2022-07-10 00:54:56,951][26022] Updated weights on worker 0-0, policy_version 496353 (0.00094) [2022-07-10 00:54:58,883][26022] Updated weights on worker 0-0, policy_version 496363 (0.00087) [2022-07-10 00:55:00,584][26022] Updated weights on worker 0-0, policy_version 496373 (0.00087) [2022-07-10 00:55:01,629][25689] Fps is (10 sec: 5592.6, 60 sec: 5655.6, 300 sec: 5685.7). Total num frames: 508290048. Throughput: 0: 6010.6. Samples: 508295968. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:55:01,630][25689] Avg episode reward: [(0, '-43.614')] [2022-07-10 00:55:02,652][26022] Updated weights on worker 0-0, policy_version 496383 (0.00090) [2022-07-10 00:55:04,465][26022] Updated weights on worker 0-0, policy_version 496393 (0.00091) [2022-07-10 00:55:06,254][26022] Updated weights on worker 0-0, policy_version 496403 (0.00085) [2022-07-10 00:55:06,633][25689] Fps is (10 sec: 5422.8, 60 sec: 5677.2, 300 sec: 5685.8). Total num frames: 508317696. Throughput: 0: 5030.6. Samples: 508310960. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:55:06,634][25689] Avg episode reward: [(0, '-41.973')] [2022-07-10 00:55:08,206][26022] Updated weights on worker 0-0, policy_version 496413 (0.00085) [2022-07-10 00:55:10,099][26022] Updated weights on worker 0-0, policy_version 496423 (0.00079) [2022-07-10 00:55:11,647][25689] Fps is (10 sec: 5621.7, 60 sec: 5697.6, 300 sec: 5684.8). Total num frames: 508346368. Throughput: 0: 5899.3. Samples: 508345446. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:55:11,648][25689] Avg episode reward: [(0, '-42.645')] [2022-07-10 00:55:11,739][26022] Updated weights on worker 0-0, policy_version 496433 (0.00090) [2022-07-10 00:55:13,360][26022] Updated weights on worker 0-0, policy_version 496443 (0.00081) [2022-07-10 00:55:15,138][26022] Updated weights on worker 0-0, policy_version 496453 (0.00084) [2022-07-10 00:55:16,728][25689] Fps is (10 sec: 5782.2, 60 sec: 5695.2, 300 sec: 5690.2). Total num frames: 508376064. Throughput: 0: 5893.6. Samples: 508379928. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:55:16,729][25689] Avg episode reward: [(0, '-43.220')] [2022-07-10 00:55:17,048][26022] Updated weights on worker 0-0, policy_version 496463 (0.00089) [2022-07-10 00:55:18,718][26022] Updated weights on worker 0-0, policy_version 496473 (0.01169) [2022-07-10 00:55:20,646][26022] Updated weights on worker 0-0, policy_version 496483 (0.00083) [2022-07-10 00:55:21,737][25689] Fps is (10 sec: 5784.7, 60 sec: 5713.0, 300 sec: 5687.3). Total num frames: 508404736. Throughput: 0: 5893.3. Samples: 508414472. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:55:21,738][25689] Avg episode reward: [(0, '-43.267')] [2022-07-10 00:55:22,164][26022] Updated weights on worker 0-0, policy_version 496493 (0.00091) [2022-07-10 00:55:24,429][26022] Updated weights on worker 0-0, policy_version 496503 (0.00092) [2022-07-10 00:55:25,754][26022] Updated weights on worker 0-0, policy_version 496513 (0.00093) [2022-07-10 00:55:26,763][25689] Fps is (10 sec: 5714.1, 60 sec: 5696.4, 300 sec: 5690.3). Total num frames: 508433408. Throughput: 0: 5998.5. Samples: 508431710. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:55:26,765][25689] Avg episode reward: [(0, '-43.178')] [2022-07-10 00:55:27,812][26022] Updated weights on worker 0-0, policy_version 496523 (0.00088) [2022-07-10 00:55:29,403][26022] Updated weights on worker 0-0, policy_version 496533 (0.00096) [2022-07-10 00:55:31,159][26022] Updated weights on worker 0-0, policy_version 496543 (0.00091) [2022-07-10 00:55:31,788][25689] Fps is (10 sec: 5705.7, 60 sec: 5713.7, 300 sec: 5680.8). Total num frames: 508462080. Throughput: 0: 6002.2. Samples: 508466332. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:55:31,789][25689] Avg episode reward: [(0, '-43.758')] [2022-07-10 00:55:33,120][26022] Updated weights on worker 0-0, policy_version 496553 (0.00092) [2022-07-10 00:55:34,861][26022] Updated weights on worker 0-0, policy_version 496563 (0.00091) [2022-07-10 00:55:36,490][26022] Updated weights on worker 0-0, policy_version 496573 (0.00086) [2022-07-10 00:55:36,868][25689] Fps is (10 sec: 5877.5, 60 sec: 5711.4, 300 sec: 5689.9). Total num frames: 508492800. Throughput: 0: 6007.2. Samples: 508500916. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:55:36,869][25689] Avg episode reward: [(0, '-44.296')] [2022-07-10 00:55:38,561][26022] Updated weights on worker 0-0, policy_version 496583 (0.00086) [2022-07-10 00:55:40,184][26022] Updated weights on worker 0-0, policy_version 496593 (0.00088) [2022-07-10 00:55:41,892][25689] Fps is (10 sec: 5877.7, 60 sec: 5715.8, 300 sec: 5689.8). Total num frames: 508521472. Throughput: 0: 5143.8. Samples: 508518144. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:55:41,894][25689] Avg episode reward: [(0, '-44.051')] [2022-07-10 00:55:41,907][26022] Updated weights on worker 0-0, policy_version 496603 (0.00090) [2022-07-10 00:55:43,803][26022] Updated weights on worker 0-0, policy_version 496613 (0.00108) [2022-07-10 00:55:45,514][26022] Updated weights on worker 0-0, policy_version 496623 (0.00086) [2022-07-10 00:55:46,311][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:55:46,322][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000496627_508546048.pth [2022-07-10 00:55:46,322][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000494625_506496000.pth [2022-07-10 00:55:46,902][25689] Fps is (10 sec: 5612.7, 60 sec: 5700.5, 300 sec: 5686.7). Total num frames: 508549120. Throughput: 0: 6005.9. Samples: 508552664. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:55:46,904][25689] Avg episode reward: [(0, '-43.948')] [2022-07-10 00:55:47,527][26022] Updated weights on worker 0-0, policy_version 496633 (0.00088) [2022-07-10 00:55:49,131][26022] Updated weights on worker 0-0, policy_version 496643 (0.00087) [2022-07-10 00:55:51,052][26022] Updated weights on worker 0-0, policy_version 496653 (0.00088) [2022-07-10 00:55:51,916][25689] Fps is (10 sec: 5721.0, 60 sec: 5717.4, 300 sec: 5691.9). Total num frames: 508578816. Throughput: 0: 5980.6. Samples: 508586710. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:55:51,916][25689] Avg episode reward: [(0, '-43.422')] [2022-07-10 00:55:52,891][26022] Updated weights on worker 0-0, policy_version 496663 (0.00077) [2022-07-10 00:55:54,619][26022] Updated weights on worker 0-0, policy_version 496673 (0.00082) [2022-07-10 00:55:56,248][26022] Updated weights on worker 0-0, policy_version 496683 (0.00097) [2022-07-10 00:55:57,014][25689] Fps is (10 sec: 5671.0, 60 sec: 5680.5, 300 sec: 5687.0). Total num frames: 508606464. Throughput: 0: 5111.8. Samples: 508603900. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:55:57,015][25689] Avg episode reward: [(0, '-42.982')] [2022-07-10 00:55:58,070][26022] Updated weights on worker 0-0, policy_version 496693 (0.00080) [2022-07-10 00:56:00,010][26022] Updated weights on worker 0-0, policy_version 496703 (0.00086) [2022-07-10 00:56:01,584][26022] Updated weights on worker 0-0, policy_version 496713 (0.00085) [2022-07-10 00:56:02,084][25689] Fps is (10 sec: 5739.9, 60 sec: 5742.0, 300 sec: 5697.0). Total num frames: 508637184. Throughput: 0: 5964.2. Samples: 508638574. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:56:02,085][25689] Avg episode reward: [(0, '-44.248')] [2022-07-10 00:56:03,738][26022] Updated weights on worker 0-0, policy_version 496723 (0.00081) [2022-07-10 00:56:05,461][26022] Updated weights on worker 0-0, policy_version 496733 (0.00085) [2022-07-10 00:56:07,168][25689] Fps is (10 sec: 5748.4, 60 sec: 5734.5, 300 sec: 5692.6). Total num frames: 508664832. Throughput: 0: 5840.3. Samples: 508671022. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:56:07,169][25689] Avg episode reward: [(0, '-43.737')] [2022-07-10 00:56:07,179][26022] Updated weights on worker 0-0, policy_version 496743 (0.00083) [2022-07-10 00:56:09,011][26022] Updated weights on worker 0-0, policy_version 496753 (0.00090) [2022-07-10 00:56:10,967][26022] Updated weights on worker 0-0, policy_version 496763 (0.00090) [2022-07-10 00:56:12,207][25689] Fps is (10 sec: 5361.3, 60 sec: 5698.3, 300 sec: 5682.7). Total num frames: 508691456. Throughput: 0: 5011.0. Samples: 508688392. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:56:12,209][25689] Avg episode reward: [(0, '-44.084')] [2022-07-10 00:56:12,606][26022] Updated weights on worker 0-0, policy_version 496773 (0.00090) [2022-07-10 00:56:14,418][26022] Updated weights on worker 0-0, policy_version 496783 (0.00084) [2022-07-10 00:56:16,222][26022] Updated weights on worker 0-0, policy_version 496793 (0.00079) [2022-07-10 00:56:17,303][25689] Fps is (10 sec: 5658.2, 60 sec: 5713.8, 300 sec: 5691.3). Total num frames: 508722176. Throughput: 0: 5867.2. Samples: 508722936. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:56:17,303][25689] Avg episode reward: [(0, '-43.853')] [2022-07-10 00:56:18,103][26022] Updated weights on worker 0-0, policy_version 496803 (0.00091) [2022-07-10 00:56:19,768][26022] Updated weights on worker 0-0, policy_version 496813 (0.00088) [2022-07-10 00:56:21,695][26022] Updated weights on worker 0-0, policy_version 496823 (0.00087) [2022-07-10 00:56:22,316][25689] Fps is (10 sec: 5774.0, 60 sec: 5696.5, 300 sec: 5681.2). Total num frames: 508749824. Throughput: 0: 5855.6. Samples: 508757042. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:56:22,319][25689] Avg episode reward: [(0, '-45.308')] [2022-07-10 00:56:23,397][26022] Updated weights on worker 0-0, policy_version 496833 (0.00086) [2022-07-10 00:56:25,219][26022] Updated weights on worker 0-0, policy_version 496843 (0.00088) [2022-07-10 00:56:26,996][26022] Updated weights on worker 0-0, policy_version 496853 (0.00083) [2022-07-10 00:56:27,394][25689] Fps is (10 sec: 5682.5, 60 sec: 5708.5, 300 sec: 5694.0). Total num frames: 508779520. Throughput: 0: 5104.3. Samples: 508774262. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:56:27,395][25689] Avg episode reward: [(0, '-44.390')] [2022-07-10 00:56:29,023][26022] Updated weights on worker 0-0, policy_version 496863 (0.00085) [2022-07-10 00:56:30,603][26022] Updated weights on worker 0-0, policy_version 496873 (0.00087) [2022-07-10 00:56:32,338][26022] Updated weights on worker 0-0, policy_version 496883 (0.00092) [2022-07-10 00:56:32,402][25689] Fps is (10 sec: 5786.8, 60 sec: 5710.0, 300 sec: 5692.8). Total num frames: 508808192. Throughput: 0: 5942.5. Samples: 508808402. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 00:56:32,403][25689] Avg episode reward: [(0, '-43.811')] [2022-07-10 00:56:34,216][26022] Updated weights on worker 0-0, policy_version 496893 (0.00617) [2022-07-10 00:56:36,202][26022] Updated weights on worker 0-0, policy_version 496903 (0.00086) [2022-07-10 00:56:37,489][25689] Fps is (10 sec: 5680.7, 60 sec: 5675.7, 300 sec: 5688.1). Total num frames: 508836864. Throughput: 0: 5936.8. Samples: 508842776. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:56:37,489][25689] Avg episode reward: [(0, '-44.017')] [2022-07-10 00:56:37,778][26022] Updated weights on worker 0-0, policy_version 496913 (0.00083) [2022-07-10 00:56:39,793][26022] Updated weights on worker 0-0, policy_version 496923 (0.00087) [2022-07-10 00:56:41,296][26022] Updated weights on worker 0-0, policy_version 496933 (0.00092) [2022-07-10 00:56:42,531][25689] Fps is (10 sec: 5560.7, 60 sec: 5657.1, 300 sec: 5684.8). Total num frames: 508864512. Throughput: 0: 5082.8. Samples: 508859790. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:56:42,531][25689] Avg episode reward: [(0, '-44.728')] [2022-07-10 00:56:43,259][26022] Updated weights on worker 0-0, policy_version 496943 (0.00087) [2022-07-10 00:56:44,993][26022] Updated weights on worker 0-0, policy_version 496953 (0.00091) [2022-07-10 00:56:46,547][26022] Updated weights on worker 0-0, policy_version 496963 (0.00078) [2022-07-10 00:56:47,563][25689] Fps is (10 sec: 5692.2, 60 sec: 5688.9, 300 sec: 5688.0). Total num frames: 508894208. Throughput: 0: 5963.4. Samples: 508894536. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:56:47,563][25689] Avg episode reward: [(0, '-44.539')] [2022-07-10 00:56:48,395][26022] Updated weights on worker 0-0, policy_version 496973 (0.00088) [2022-07-10 00:56:50,359][26022] Updated weights on worker 0-0, policy_version 496983 (0.00087) [2022-07-10 00:56:51,962][26022] Updated weights on worker 0-0, policy_version 496993 (0.00686) [2022-07-10 00:56:52,584][25689] Fps is (10 sec: 5805.8, 60 sec: 5671.2, 300 sec: 5686.0). Total num frames: 508922880. Throughput: 0: 5961.8. Samples: 508928720. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:56:52,585][25689] Avg episode reward: [(0, '-44.075')] [2022-07-10 00:56:54,044][26022] Updated weights on worker 0-0, policy_version 497003 (0.00086) [2022-07-10 00:56:55,877][26022] Updated weights on worker 0-0, policy_version 497013 (0.00089) [2022-07-10 00:56:57,609][26022] Updated weights on worker 0-0, policy_version 497023 (0.00080) [2022-07-10 00:56:57,700][25689] Fps is (10 sec: 5757.9, 60 sec: 5703.4, 300 sec: 5687.3). Total num frames: 508952576. Throughput: 0: 5092.8. Samples: 508945706. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:56:57,700][25689] Avg episode reward: [(0, '-44.573')] [2022-07-10 00:56:59,322][26022] Updated weights on worker 0-0, policy_version 497033 (0.00089) [2022-07-10 00:57:01,006][26022] Updated weights on worker 0-0, policy_version 497043 (0.00089) [2022-07-10 00:57:02,755][25689] Fps is (10 sec: 5537.4, 60 sec: 5637.3, 300 sec: 5690.2). Total num frames: 508979200. Throughput: 0: 5942.3. Samples: 508979968. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:57:02,755][25689] Avg episode reward: [(0, '-44.519')] [2022-07-10 00:57:03,289][26022] Updated weights on worker 0-0, policy_version 497053 (0.00083) [2022-07-10 00:57:05,383][26022] Updated weights on worker 0-0, policy_version 497063 (0.00090) [2022-07-10 00:57:06,801][26022] Updated weights on worker 0-0, policy_version 497073 (0.00083) [2022-07-10 00:57:07,810][25689] Fps is (10 sec: 5469.1, 60 sec: 5656.8, 300 sec: 5690.0). Total num frames: 509007872. Throughput: 0: 5803.3. Samples: 509012038. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:57:07,811][25689] Avg episode reward: [(0, '-43.241')] [2022-07-10 00:57:08,885][26022] Updated weights on worker 0-0, policy_version 497083 (0.00091) [2022-07-10 00:57:10,310][26022] Updated weights on worker 0-0, policy_version 497093 (0.00094) [2022-07-10 00:57:12,407][26022] Updated weights on worker 0-0, policy_version 497103 (0.00084) [2022-07-10 00:57:12,850][25689] Fps is (10 sec: 5578.9, 60 sec: 5673.6, 300 sec: 5680.0). Total num frames: 509035520. Throughput: 0: 4950.8. Samples: 509029054. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:57:12,851][25689] Avg episode reward: [(0, '-43.115')] [2022-07-10 00:57:14,030][26022] Updated weights on worker 0-0, policy_version 497113 (0.00083) [2022-07-10 00:57:15,991][26022] Updated weights on worker 0-0, policy_version 497123 (0.00090) [2022-07-10 00:57:17,616][26022] Updated weights on worker 0-0, policy_version 497133 (0.00085) [2022-07-10 00:57:17,967][25689] Fps is (10 sec: 5746.8, 60 sec: 5671.6, 300 sec: 5688.4). Total num frames: 509066240. Throughput: 0: 5809.9. Samples: 509063456. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:57:17,967][25689] Avg episode reward: [(0, '-42.551')] [2022-07-10 00:57:19,463][26022] Updated weights on worker 0-0, policy_version 497143 (0.00085) [2022-07-10 00:57:21,268][26022] Updated weights on worker 0-0, policy_version 497153 (0.00093) [2022-07-10 00:57:23,054][25689] Fps is (10 sec: 5719.9, 60 sec: 5664.7, 300 sec: 5680.5). Total num frames: 509093888. Throughput: 0: 5799.7. Samples: 509097700. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:57:23,055][25689] Avg episode reward: [(0, '-42.397')] [2022-07-10 00:57:23,254][26022] Updated weights on worker 0-0, policy_version 497163 (0.00094) [2022-07-10 00:57:24,706][26022] Updated weights on worker 0-0, policy_version 497173 (0.00084) [2022-07-10 00:57:26,875][26022] Updated weights on worker 0-0, policy_version 497183 (0.00094) [2022-07-10 00:57:28,060][25689] Fps is (10 sec: 5681.7, 60 sec: 5671.5, 300 sec: 5687.7). Total num frames: 509123584. Throughput: 0: 5920.1. Samples: 509131916. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:57:28,060][25689] Avg episode reward: [(0, '-41.745')] [2022-07-10 00:57:28,553][26022] Updated weights on worker 0-0, policy_version 497193 (0.00051) [2022-07-10 00:57:30,286][26022] Updated weights on worker 0-0, policy_version 497203 (0.00084) [2022-07-10 00:57:32,127][26022] Updated weights on worker 0-0, policy_version 497213 (0.00089) [2022-07-10 00:57:33,155][25689] Fps is (10 sec: 5677.2, 60 sec: 5646.5, 300 sec: 5680.5). Total num frames: 509151232. Throughput: 0: 5906.3. Samples: 509148982. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:57:33,156][25689] Avg episode reward: [(0, '-41.973')] [2022-07-10 00:57:33,937][26022] Updated weights on worker 0-0, policy_version 497223 (0.00092) [2022-07-10 00:57:35,729][26022] Updated weights on worker 0-0, policy_version 497233 (0.00093) [2022-07-10 00:57:37,442][26022] Updated weights on worker 0-0, policy_version 497243 (0.00081) [2022-07-10 00:57:38,208][25689] Fps is (10 sec: 5549.7, 60 sec: 5649.6, 300 sec: 5683.7). Total num frames: 509179904. Throughput: 0: 5915.8. Samples: 509183198. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:57:38,209][25689] Avg episode reward: [(0, '-42.220')] [2022-07-10 00:57:39,439][26022] Updated weights on worker 0-0, policy_version 497253 (0.00089) [2022-07-10 00:57:41,064][26022] Updated weights on worker 0-0, policy_version 497263 (0.00089) [2022-07-10 00:57:42,999][26022] Updated weights on worker 0-0, policy_version 497273 (0.00086) [2022-07-10 00:57:43,278][25689] Fps is (10 sec: 5563.6, 60 sec: 5647.0, 300 sec: 5672.2). Total num frames: 509207552. Throughput: 0: 5906.0. Samples: 509217142. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:57:43,279][25689] Avg episode reward: [(0, '-41.774')] [2022-07-10 00:57:44,688][26022] Updated weights on worker 0-0, policy_version 497283 (0.00093) [2022-07-10 00:57:46,356][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:57:46,369][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000497291_509225984.pth [2022-07-10 00:57:46,370][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000495293_507180032.pth [2022-07-10 00:57:46,658][26022] Updated weights on worker 0-0, policy_version 497293 (0.00090) [2022-07-10 00:57:48,306][25689] Fps is (10 sec: 5679.0, 60 sec: 5647.4, 300 sec: 5678.7). Total num frames: 509237248. Throughput: 0: 5045.2. Samples: 509234060. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:57:48,306][25689] Avg episode reward: [(0, '-41.744')] [2022-07-10 00:57:48,364][26022] Updated weights on worker 0-0, policy_version 497303 (0.00084) [2022-07-10 00:57:50,037][26022] Updated weights on worker 0-0, policy_version 497313 (0.00084) [2022-07-10 00:57:51,976][26022] Updated weights on worker 0-0, policy_version 497323 (0.00103) [2022-07-10 00:57:53,311][25689] Fps is (10 sec: 5817.7, 60 sec: 5648.9, 300 sec: 5680.0). Total num frames: 509265920. Throughput: 0: 5923.3. Samples: 509268370. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:57:53,311][25689] Avg episode reward: [(0, '-40.884')] [2022-07-10 00:57:53,801][26022] Updated weights on worker 0-0, policy_version 497333 (0.00084) [2022-07-10 00:57:55,516][26022] Updated weights on worker 0-0, policy_version 497343 (0.00084) [2022-07-10 00:57:57,187][26022] Updated weights on worker 0-0, policy_version 497353 (0.00088) [2022-07-10 00:57:58,358][25689] Fps is (10 sec: 5704.4, 60 sec: 5638.4, 300 sec: 5680.4). Total num frames: 509294592. Throughput: 0: 5929.9. Samples: 509302686. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:57:58,359][25689] Avg episode reward: [(0, '-41.621')] [2022-07-10 00:57:59,021][26022] Updated weights on worker 0-0, policy_version 497363 (0.00402) [2022-07-10 00:58:00,898][26022] Updated weights on worker 0-0, policy_version 497373 (0.00090) [2022-07-10 00:58:03,112][26022] Updated weights on worker 0-0, policy_version 497383 (0.00093) [2022-07-10 00:58:03,386][25689] Fps is (10 sec: 5589.8, 60 sec: 5657.8, 300 sec: 5680.4). Total num frames: 509322240. Throughput: 0: 5104.5. Samples: 509319784. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:58:03,387][25689] Avg episode reward: [(0, '-42.064')] [2022-07-10 00:58:04,861][26022] Updated weights on worker 0-0, policy_version 497393 (0.00086) [2022-07-10 00:58:06,649][26022] Updated weights on worker 0-0, policy_version 497403 (0.00091) [2022-07-10 00:58:08,390][25689] Fps is (10 sec: 5614.0, 60 sec: 5662.6, 300 sec: 5684.1). Total num frames: 509350912. Throughput: 0: 5858.4. Samples: 509351724. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:58:08,391][25689] Avg episode reward: [(0, '-41.143')] [2022-07-10 00:58:08,404][26022] Updated weights on worker 0-0, policy_version 497413 (0.00099) [2022-07-10 00:58:10,343][26022] Updated weights on worker 0-0, policy_version 497423 (0.00091) [2022-07-10 00:58:12,187][26022] Updated weights on worker 0-0, policy_version 497433 (0.00088) [2022-07-10 00:58:13,411][25689] Fps is (10 sec: 5618.0, 60 sec: 5664.3, 300 sec: 5678.2). Total num frames: 509378560. Throughput: 0: 5847.6. Samples: 509385908. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:58:13,412][25689] Avg episode reward: [(0, '-41.760')] [2022-07-10 00:58:13,903][26022] Updated weights on worker 0-0, policy_version 497443 (0.00084) [2022-07-10 00:58:15,769][26022] Updated weights on worker 0-0, policy_version 497453 (0.00086) [2022-07-10 00:58:17,640][26022] Updated weights on worker 0-0, policy_version 497463 (0.00088) [2022-07-10 00:58:18,495][25689] Fps is (10 sec: 5675.0, 60 sec: 5650.5, 300 sec: 5687.4). Total num frames: 509408256. Throughput: 0: 4992.0. Samples: 509403208. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:58:18,495][25689] Avg episode reward: [(0, '-42.560')] [2022-07-10 00:58:19,267][26022] Updated weights on worker 0-0, policy_version 497473 (0.00093) [2022-07-10 00:58:21,132][26022] Updated weights on worker 0-0, policy_version 497483 (0.00089) [2022-07-10 00:58:22,934][26022] Updated weights on worker 0-0, policy_version 497493 (0.00089) [2022-07-10 00:58:23,511][25689] Fps is (10 sec: 5678.0, 60 sec: 5657.3, 300 sec: 5680.3). Total num frames: 509435904. Throughput: 0: 5852.8. Samples: 509437568. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:58:23,511][25689] Avg episode reward: [(0, '-42.852')] [2022-07-10 00:58:24,871][26022] Updated weights on worker 0-0, policy_version 497503 (0.00084) [2022-07-10 00:58:26,364][26022] Updated weights on worker 0-0, policy_version 497513 (0.00086) [2022-07-10 00:58:28,343][26022] Updated weights on worker 0-0, policy_version 497523 (0.00086) [2022-07-10 00:58:28,515][25689] Fps is (10 sec: 5620.9, 60 sec: 5640.4, 300 sec: 5684.0). Total num frames: 509464576. Throughput: 0: 5959.7. Samples: 509471660. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:58:28,515][25689] Avg episode reward: [(0, '-42.542')] [2022-07-10 00:58:29,868][26022] Updated weights on worker 0-0, policy_version 497533 (0.00090) [2022-07-10 00:58:31,815][26022] Updated weights on worker 0-0, policy_version 497543 (0.00089) [2022-07-10 00:58:33,521][25689] Fps is (10 sec: 5626.0, 60 sec: 5648.7, 300 sec: 5677.9). Total num frames: 509492224. Throughput: 0: 5112.5. Samples: 509488720. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:58:33,522][25689] Avg episode reward: [(0, '-42.779')] [2022-07-10 00:58:33,949][26022] Updated weights on worker 0-0, policy_version 497553 (0.00091) [2022-07-10 00:58:35,291][26022] Updated weights on worker 0-0, policy_version 497563 (0.00099) [2022-07-10 00:58:37,480][26022] Updated weights on worker 0-0, policy_version 497573 (0.00089) [2022-07-10 00:58:38,582][25689] Fps is (10 sec: 5899.4, 60 sec: 5698.9, 300 sec: 5688.2). Total num frames: 509523968. Throughput: 0: 5960.9. Samples: 509522946. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:58:38,583][25689] Avg episode reward: [(0, '-42.809')] [2022-07-10 00:58:38,750][26022] Updated weights on worker 0-0, policy_version 497583 (0.00084) [2022-07-10 00:58:40,855][26022] Updated weights on worker 0-0, policy_version 497593 (0.00095) [2022-07-10 00:58:42,619][26022] Updated weights on worker 0-0, policy_version 497603 (0.00089) [2022-07-10 00:58:43,586][25689] Fps is (10 sec: 5697.8, 60 sec: 5671.2, 300 sec: 5678.9). Total num frames: 509549568. Throughput: 0: 5970.7. Samples: 509557430. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:58:43,586][25689] Avg episode reward: [(0, '-42.673')] [2022-07-10 00:58:44,260][26022] Updated weights on worker 0-0, policy_version 497613 (0.00089) [2022-07-10 00:58:46,228][26022] Updated weights on worker 0-0, policy_version 497623 (0.00095) [2022-07-10 00:58:47,844][26022] Updated weights on worker 0-0, policy_version 497633 (0.00095) [2022-07-10 00:58:48,599][25689] Fps is (10 sec: 5622.4, 60 sec: 5689.5, 300 sec: 5682.1). Total num frames: 509580288. Throughput: 0: 5130.7. Samples: 509574708. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:58:48,600][25689] Avg episode reward: [(0, '-41.773')] [2022-07-10 00:58:49,786][26022] Updated weights on worker 0-0, policy_version 497643 (0.00090) [2022-07-10 00:58:51,587][26022] Updated weights on worker 0-0, policy_version 497653 (0.00081) [2022-07-10 00:58:53,558][26022] Updated weights on worker 0-0, policy_version 497663 (0.00083) [2022-07-10 00:58:53,614][25689] Fps is (10 sec: 5717.9, 60 sec: 5654.6, 300 sec: 5680.0). Total num frames: 509606912. Throughput: 0: 5969.4. Samples: 509608664. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:58:53,615][25689] Avg episode reward: [(0, '-41.418')] [2022-07-10 00:58:55,219][26022] Updated weights on worker 0-0, policy_version 497673 (0.00085) [2022-07-10 00:58:57,187][26022] Updated weights on worker 0-0, policy_version 497683 (0.00088) [2022-07-10 00:58:58,675][25689] Fps is (10 sec: 5487.9, 60 sec: 5653.3, 300 sec: 5672.1). Total num frames: 509635584. Throughput: 0: 5950.0. Samples: 509642500. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:58:58,676][25689] Avg episode reward: [(0, '-41.234')] [2022-07-10 00:58:58,767][26022] Updated weights on worker 0-0, policy_version 497693 (0.00089) [2022-07-10 00:59:00,741][26022] Updated weights on worker 0-0, policy_version 497703 (0.00094) [2022-07-10 00:59:02,805][26022] Updated weights on worker 0-0, policy_version 497713 (0.00083) [2022-07-10 00:59:03,678][25689] Fps is (10 sec: 5494.9, 60 sec: 5638.8, 300 sec: 5673.0). Total num frames: 509662208. Throughput: 0: 5091.0. Samples: 509659718. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:59:03,678][25689] Avg episode reward: [(0, '-41.289')] [2022-07-10 00:59:04,737][26022] Updated weights on worker 0-0, policy_version 497723 (0.00084) [2022-07-10 00:59:06,498][26022] Updated weights on worker 0-0, policy_version 497733 (0.00084) [2022-07-10 00:59:08,391][26022] Updated weights on worker 0-0, policy_version 497743 (0.00102) [2022-07-10 00:59:08,679][25689] Fps is (10 sec: 5527.6, 60 sec: 5639.0, 300 sec: 5677.4). Total num frames: 509690880. Throughput: 0: 5827.6. Samples: 509691724. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:59:08,680][25689] Avg episode reward: [(0, '-41.052')] [2022-07-10 00:59:10,175][26022] Updated weights on worker 0-0, policy_version 497753 (0.00090) [2022-07-10 00:59:11,903][26022] Updated weights on worker 0-0, policy_version 497763 (0.00086) [2022-07-10 00:59:13,706][25689] Fps is (10 sec: 5616.2, 60 sec: 5638.5, 300 sec: 5671.1). Total num frames: 509718528. Throughput: 0: 5824.7. Samples: 509725690. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:59:13,706][25689] Avg episode reward: [(0, '-41.246')] [2022-07-10 00:59:13,858][26022] Updated weights on worker 0-0, policy_version 497773 (0.00089) [2022-07-10 00:59:15,493][26022] Updated weights on worker 0-0, policy_version 497783 (0.00082) [2022-07-10 00:59:17,545][26022] Updated weights on worker 0-0, policy_version 497793 (0.00086) [2022-07-10 00:59:18,749][25689] Fps is (10 sec: 5593.0, 60 sec: 5625.3, 300 sec: 5674.1). Total num frames: 509747200. Throughput: 0: 4992.8. Samples: 509742722. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:59:18,749][25689] Avg episode reward: [(0, '-41.823')] [2022-07-10 00:59:19,143][26022] Updated weights on worker 0-0, policy_version 497803 (0.00092) [2022-07-10 00:59:21,063][26022] Updated weights on worker 0-0, policy_version 497813 (0.00094) [2022-07-10 00:59:22,613][26022] Updated weights on worker 0-0, policy_version 497823 (0.00089) [2022-07-10 00:59:23,784][25689] Fps is (10 sec: 5690.1, 60 sec: 5640.5, 300 sec: 5670.5). Total num frames: 509775872. Throughput: 0: 5823.4. Samples: 509776804. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:59:23,784][25689] Avg episode reward: [(0, '-41.981')] [2022-07-10 00:59:24,679][26022] Updated weights on worker 0-0, policy_version 497833 (0.00090) [2022-07-10 00:59:26,216][26022] Updated weights on worker 0-0, policy_version 497843 (0.00090) [2022-07-10 00:59:28,367][26022] Updated weights on worker 0-0, policy_version 497853 (0.00087) [2022-07-10 00:59:28,815][25689] Fps is (10 sec: 5696.3, 60 sec: 5637.9, 300 sec: 5673.9). Total num frames: 509804544. Throughput: 0: 5908.3. Samples: 509810696. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 00:59:28,817][25689] Avg episode reward: [(0, '-42.409')] [2022-07-10 00:59:29,842][26022] Updated weights on worker 0-0, policy_version 497863 (0.00086) [2022-07-10 00:59:31,884][26022] Updated weights on worker 0-0, policy_version 497873 (0.00091) [2022-07-10 00:59:33,475][26022] Updated weights on worker 0-0, policy_version 497883 (0.00087) [2022-07-10 00:59:33,827][25689] Fps is (10 sec: 5607.5, 60 sec: 5637.4, 300 sec: 5664.4). Total num frames: 509832192. Throughput: 0: 5069.9. Samples: 509827704. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 00:59:33,829][25689] Avg episode reward: [(0, '-42.423')] [2022-07-10 00:59:35,257][26022] Updated weights on worker 0-0, policy_version 497893 (0.00084) [2022-07-10 00:59:37,275][26022] Updated weights on worker 0-0, policy_version 497903 (0.00084) [2022-07-10 00:59:38,677][26022] Updated weights on worker 0-0, policy_version 497913 (0.00084) [2022-07-10 00:59:38,932][25689] Fps is (10 sec: 5870.9, 60 sec: 5633.3, 300 sec: 5674.1). Total num frames: 509863936. Throughput: 0: 5925.0. Samples: 509862308. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 00:59:38,932][25689] Avg episode reward: [(0, '-41.947')] [2022-07-10 00:59:40,815][26022] Updated weights on worker 0-0, policy_version 497923 (0.00092) [2022-07-10 00:59:42,376][26022] Updated weights on worker 0-0, policy_version 497933 (0.00088) [2022-07-10 00:59:43,974][25689] Fps is (10 sec: 5853.2, 60 sec: 5663.6, 300 sec: 5670.4). Total num frames: 509891584. Throughput: 0: 5943.8. Samples: 509896814. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 00:59:43,975][25689] Avg episode reward: [(0, '-42.962')] [2022-07-10 00:59:44,216][26022] Updated weights on worker 0-0, policy_version 497943 (0.00084) [2022-07-10 00:59:45,861][26022] Updated weights on worker 0-0, policy_version 497953 (0.00090) [2022-07-10 00:59:46,494][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 00:59:46,506][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000497955_509905920.pth [2022-07-10 00:59:46,507][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000495958_507860992.pth [2022-07-10 00:59:47,846][26022] Updated weights on worker 0-0, policy_version 497963 (0.00087) [2022-07-10 00:59:49,003][25689] Fps is (10 sec: 5592.0, 60 sec: 5628.2, 300 sec: 5670.1). Total num frames: 509920256. Throughput: 0: 5120.2. Samples: 509914062. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 00:59:49,004][25689] Avg episode reward: [(0, '-43.386')] [2022-07-10 00:59:49,765][26022] Updated weights on worker 0-0, policy_version 497973 (0.00094) [2022-07-10 00:59:51,367][26022] Updated weights on worker 0-0, policy_version 497983 (0.00085) [2022-07-10 00:59:53,199][26022] Updated weights on worker 0-0, policy_version 497993 (0.00085) [2022-07-10 00:59:54,051][25689] Fps is (10 sec: 5792.5, 60 sec: 5676.1, 300 sec: 5670.4). Total num frames: 509949952. Throughput: 0: 5958.8. Samples: 509948210. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 00:59:54,051][25689] Avg episode reward: [(0, '-44.740')] [2022-07-10 00:59:55,002][26022] Updated weights on worker 0-0, policy_version 498003 (0.00086) [2022-07-10 00:59:56,867][26022] Updated weights on worker 0-0, policy_version 498013 (0.00094) [2022-07-10 00:59:58,774][26022] Updated weights on worker 0-0, policy_version 498023 (0.00621) [2022-07-10 00:59:59,127][25689] Fps is (10 sec: 5664.5, 60 sec: 5657.7, 300 sec: 5672.5). Total num frames: 509977600. Throughput: 0: 5934.4. Samples: 509982152. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 00:59:59,127][25689] Avg episode reward: [(0, '-44.283')] [2022-07-10 01:00:00,359][26022] Updated weights on worker 0-0, policy_version 498033 (0.00090) [2022-07-10 01:00:02,670][26022] Updated weights on worker 0-0, policy_version 498043 (0.00092) [2022-07-10 01:00:04,140][25689] Fps is (10 sec: 5277.8, 60 sec: 5639.8, 300 sec: 5665.4). Total num frames: 510003200. Throughput: 0: 5053.9. Samples: 509998728. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 01:00:04,140][25689] Avg episode reward: [(0, '-44.122')] [2022-07-10 01:00:04,540][26022] Updated weights on worker 0-0, policy_version 498053 (0.00085) [2022-07-10 01:00:06,233][26022] Updated weights on worker 0-0, policy_version 498063 (0.00280) [2022-07-10 01:00:08,132][26022] Updated weights on worker 0-0, policy_version 498073 (0.00086) [2022-07-10 01:00:09,173][25689] Fps is (10 sec: 5503.9, 60 sec: 5653.7, 300 sec: 5668.5). Total num frames: 510032896. Throughput: 0: 5819.9. Samples: 510031450. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 01:00:09,174][25689] Avg episode reward: [(0, '-43.345')] [2022-07-10 01:00:09,895][26022] Updated weights on worker 0-0, policy_version 498083 (0.00077) [2022-07-10 01:00:11,884][26022] Updated weights on worker 0-0, policy_version 498093 (0.00079) [2022-07-10 01:00:13,336][26022] Updated weights on worker 0-0, policy_version 498103 (0.00089) [2022-07-10 01:00:14,193][25689] Fps is (10 sec: 5806.0, 60 sec: 5671.3, 300 sec: 5666.2). Total num frames: 510061568. Throughput: 0: 5839.4. Samples: 510065828. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 01:00:14,193][25689] Avg episode reward: [(0, '-42.379')] [2022-07-10 01:00:15,316][26022] Updated weights on worker 0-0, policy_version 498113 (0.00090) [2022-07-10 01:00:16,894][26022] Updated weights on worker 0-0, policy_version 498123 (0.00083) [2022-07-10 01:00:18,842][26022] Updated weights on worker 0-0, policy_version 498133 (0.00089) [2022-07-10 01:00:19,242][25689] Fps is (10 sec: 5593.5, 60 sec: 5653.8, 300 sec: 5662.0). Total num frames: 510089216. Throughput: 0: 5026.5. Samples: 510083262. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 01:00:19,243][25689] Avg episode reward: [(0, '-41.671')] [2022-07-10 01:00:20,504][26022] Updated weights on worker 0-0, policy_version 498143 (0.00079) [2022-07-10 01:00:22,305][26022] Updated weights on worker 0-0, policy_version 498153 (0.00090) [2022-07-10 01:00:24,234][26022] Updated weights on worker 0-0, policy_version 498163 (0.00085) [2022-07-10 01:00:24,249][25689] Fps is (10 sec: 5702.3, 60 sec: 5673.3, 300 sec: 5665.8). Total num frames: 510118912. Throughput: 0: 5913.1. Samples: 510117638. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 01:00:24,250][25689] Avg episode reward: [(0, '-40.696')] [2022-07-10 01:00:26,125][26022] Updated weights on worker 0-0, policy_version 498173 (0.00090) [2022-07-10 01:00:27,799][26022] Updated weights on worker 0-0, policy_version 498183 (0.00089) [2022-07-10 01:00:29,259][25689] Fps is (10 sec: 5724.8, 60 sec: 5658.4, 300 sec: 5662.7). Total num frames: 510146560. Throughput: 0: 5963.1. Samples: 510151224. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 01:00:29,260][25689] Avg episode reward: [(0, '-41.616')] [2022-07-10 01:00:29,894][26022] Updated weights on worker 0-0, policy_version 498193 (0.00089) [2022-07-10 01:00:31,329][26022] Updated weights on worker 0-0, policy_version 498203 (0.00096) [2022-07-10 01:00:33,451][26022] Updated weights on worker 0-0, policy_version 498213 (0.00089) [2022-07-10 01:00:34,263][25689] Fps is (10 sec: 5624.0, 60 sec: 5676.1, 300 sec: 5657.2). Total num frames: 510175232. Throughput: 0: 5102.5. Samples: 510168238. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 01:00:34,264][25689] Avg episode reward: [(0, '-42.104')] [2022-07-10 01:00:34,971][26022] Updated weights on worker 0-0, policy_version 498223 (0.00095) [2022-07-10 01:00:36,913][26022] Updated weights on worker 0-0, policy_version 498233 (0.00085) [2022-07-10 01:00:38,704][26022] Updated weights on worker 0-0, policy_version 498243 (0.00081) [2022-07-10 01:00:39,390][25689] Fps is (10 sec: 5862.6, 60 sec: 5657.1, 300 sec: 5662.2). Total num frames: 510205952. Throughput: 0: 5919.3. Samples: 510202520. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 01:00:39,390][25689] Avg episode reward: [(0, '-42.456')] [2022-07-10 01:00:40,452][26022] Updated weights on worker 0-0, policy_version 498253 (0.00086) [2022-07-10 01:00:42,165][26022] Updated weights on worker 0-0, policy_version 498263 (0.00092) [2022-07-10 01:00:44,135][26022] Updated weights on worker 0-0, policy_version 498273 (0.00083) [2022-07-10 01:00:44,424][25689] Fps is (10 sec: 5744.2, 60 sec: 5657.8, 300 sec: 5661.7). Total num frames: 510233600. Throughput: 0: 5917.6. Samples: 510237028. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 01:00:44,425][25689] Avg episode reward: [(0, '-43.003')] [2022-07-10 01:00:45,685][26022] Updated weights on worker 0-0, policy_version 498283 (0.00090) [2022-07-10 01:00:47,522][26022] Updated weights on worker 0-0, policy_version 498293 (0.00086) [2022-07-10 01:00:49,375][26022] Updated weights on worker 0-0, policy_version 498303 (0.00099) [2022-07-10 01:00:49,469][25689] Fps is (10 sec: 5587.6, 60 sec: 5656.4, 300 sec: 5657.7). Total num frames: 510262272. Throughput: 0: 5965.3. Samples: 510271782. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 01:00:49,470][25689] Avg episode reward: [(0, '-43.365')] [2022-07-10 01:00:51,113][26022] Updated weights on worker 0-0, policy_version 498313 (0.00093) [2022-07-10 01:00:52,970][26022] Updated weights on worker 0-0, policy_version 498323 (0.00095) [2022-07-10 01:00:54,493][25689] Fps is (10 sec: 5797.3, 60 sec: 5658.6, 300 sec: 5666.0). Total num frames: 510291968. Throughput: 0: 5974.2. Samples: 510289092. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 01:00:54,493][25689] Avg episode reward: [(0, '-42.885')] [2022-07-10 01:00:54,603][26022] Updated weights on worker 0-0, policy_version 498333 (0.00085) [2022-07-10 01:00:56,487][26022] Updated weights on worker 0-0, policy_version 498343 (0.00082) [2022-07-10 01:00:58,304][26022] Updated weights on worker 0-0, policy_version 498353 (0.00081) [2022-07-10 01:00:59,618][25689] Fps is (10 sec: 5650.0, 60 sec: 5653.9, 300 sec: 5654.6). Total num frames: 510319616. Throughput: 0: 5963.6. Samples: 510323156. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 01:00:59,619][25689] Avg episode reward: [(0, '-41.670')] [2022-07-10 01:01:00,171][26022] Updated weights on worker 0-0, policy_version 498363 (0.00084) [2022-07-10 01:01:01,808][26022] Updated weights on worker 0-0, policy_version 498373 (0.00089) [2022-07-10 01:01:04,223][26022] Updated weights on worker 0-0, policy_version 498383 (0.00091) [2022-07-10 01:01:04,693][25689] Fps is (10 sec: 5421.2, 60 sec: 5682.0, 300 sec: 5654.8). Total num frames: 510347264. Throughput: 0: 5835.0. Samples: 510355292. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 01:01:04,693][25689] Avg episode reward: [(0, '-42.394')] [2022-07-10 01:01:05,756][26022] Updated weights on worker 0-0, policy_version 498393 (0.00085) [2022-07-10 01:01:07,650][26022] Updated weights on worker 0-0, policy_version 498403 (0.00095) [2022-07-10 01:01:09,417][26022] Updated weights on worker 0-0, policy_version 498413 (0.00330) [2022-07-10 01:01:09,720][25689] Fps is (10 sec: 5676.9, 60 sec: 5682.7, 300 sec: 5665.3). Total num frames: 510376960. Throughput: 0: 4974.9. Samples: 510372526. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 01:01:09,720][25689] Avg episode reward: [(0, '-42.486')] [2022-07-10 01:01:11,188][26022] Updated weights on worker 0-0, policy_version 498423 (0.00086) [2022-07-10 01:01:13,070][26022] Updated weights on worker 0-0, policy_version 498433 (0.00095) [2022-07-10 01:01:14,752][25689] Fps is (10 sec: 5598.6, 60 sec: 5647.6, 300 sec: 5652.8). Total num frames: 510403584. Throughput: 0: 5800.3. Samples: 510406604. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 01:01:14,753][25689] Avg episode reward: [(0, '-42.771')] [2022-07-10 01:01:14,925][26022] Updated weights on worker 0-0, policy_version 498443 (0.00081) [2022-07-10 01:01:16,691][26022] Updated weights on worker 0-0, policy_version 498453 (0.00092) [2022-07-10 01:01:18,498][26022] Updated weights on worker 0-0, policy_version 498463 (0.00085) [2022-07-10 01:01:19,803][25689] Fps is (10 sec: 5686.8, 60 sec: 5698.2, 300 sec: 5662.4). Total num frames: 510434304. Throughput: 0: 5821.2. Samples: 510440658. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 01:01:19,804][25689] Avg episode reward: [(0, '-43.097')] [2022-07-10 01:01:20,258][26022] Updated weights on worker 0-0, policy_version 498473 (0.00049) [2022-07-10 01:01:22,067][26022] Updated weights on worker 0-0, policy_version 498483 (0.00083) [2022-07-10 01:01:23,822][26022] Updated weights on worker 0-0, policy_version 498493 (0.00086) [2022-07-10 01:01:24,869][25689] Fps is (10 sec: 5769.3, 60 sec: 5658.8, 300 sec: 5655.7). Total num frames: 510461952. Throughput: 0: 5084.6. Samples: 510457882. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 01:01:24,870][25689] Avg episode reward: [(0, '-44.436')] [2022-07-10 01:01:25,570][26022] Updated weights on worker 0-0, policy_version 498503 (0.00091) [2022-07-10 01:01:27,441][26022] Updated weights on worker 0-0, policy_version 498513 (0.00086) [2022-07-10 01:01:29,406][26022] Updated weights on worker 0-0, policy_version 498523 (0.00104) [2022-07-10 01:01:29,886][25689] Fps is (10 sec: 5586.0, 60 sec: 5675.1, 300 sec: 5655.6). Total num frames: 510490624. Throughput: 0: 5923.0. Samples: 510491970. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 01:01:29,886][25689] Avg episode reward: [(0, '-44.305')] [2022-07-10 01:01:31,095][26022] Updated weights on worker 0-0, policy_version 498533 (0.00089) [2022-07-10 01:01:32,989][26022] Updated weights on worker 0-0, policy_version 498543 (0.00087) [2022-07-10 01:01:34,674][26022] Updated weights on worker 0-0, policy_version 498553 (0.00088) [2022-07-10 01:01:34,904][25689] Fps is (10 sec: 5714.6, 60 sec: 5673.8, 300 sec: 5656.8). Total num frames: 510519296. Throughput: 0: 5930.6. Samples: 510526116. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 01:01:34,904][25689] Avg episode reward: [(0, '-44.088')] [2022-07-10 01:01:36,737][26022] Updated weights on worker 0-0, policy_version 498563 (0.00092) [2022-07-10 01:01:38,319][26022] Updated weights on worker 0-0, policy_version 498573 (0.00082) [2022-07-10 01:01:39,971][25689] Fps is (10 sec: 5685.8, 60 sec: 5645.5, 300 sec: 5659.8). Total num frames: 510547968. Throughput: 0: 5075.6. Samples: 510543022. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 01:01:39,972][25689] Avg episode reward: [(0, '-42.805')] [2022-07-10 01:01:40,186][26022] Updated weights on worker 0-0, policy_version 498583 (0.00082) [2022-07-10 01:01:41,883][26022] Updated weights on worker 0-0, policy_version 498593 (0.00089) [2022-07-10 01:01:43,814][26022] Updated weights on worker 0-0, policy_version 498603 (0.00084) [2022-07-10 01:01:45,016][25689] Fps is (10 sec: 5772.0, 60 sec: 5678.4, 300 sec: 5659.6). Total num frames: 510577664. Throughput: 0: 5925.2. Samples: 510577256. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 01:01:45,017][25689] Avg episode reward: [(0, '-43.221')] [2022-07-10 01:01:45,336][26022] Updated weights on worker 0-0, policy_version 498613 (0.00088) [2022-07-10 01:01:46,936][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:01:46,962][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000498619_510585856.pth [2022-07-10 01:01:46,963][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000496627_508546048.pth [2022-07-10 01:01:47,435][26022] Updated weights on worker 0-0, policy_version 498623 (0.00086) [2022-07-10 01:01:49,101][26022] Updated weights on worker 0-0, policy_version 498633 (0.00093) [2022-07-10 01:01:50,030][25689] Fps is (10 sec: 5497.2, 60 sec: 5630.5, 300 sec: 5649.4). Total num frames: 510603264. Throughput: 0: 5920.0. Samples: 510611226. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 01:01:50,031][25689] Avg episode reward: [(0, '-43.693')] [2022-07-10 01:01:51,075][26022] Updated weights on worker 0-0, policy_version 498643 (0.00094) [2022-07-10 01:01:52,843][26022] Updated weights on worker 0-0, policy_version 498653 (0.00084) [2022-07-10 01:01:54,601][26022] Updated weights on worker 0-0, policy_version 498663 (0.00091) [2022-07-10 01:01:55,045][25689] Fps is (10 sec: 5513.6, 60 sec: 5631.3, 300 sec: 5651.3). Total num frames: 510632960. Throughput: 0: 5077.2. Samples: 510628378. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 01:01:55,046][25689] Avg episode reward: [(0, '-43.217')] [2022-07-10 01:01:56,419][26022] Updated weights on worker 0-0, policy_version 498673 (0.00084) [2022-07-10 01:01:58,250][26022] Updated weights on worker 0-0, policy_version 498683 (0.00088) [2022-07-10 01:02:00,093][25689] Fps is (10 sec: 5800.1, 60 sec: 5655.5, 300 sec: 5658.3). Total num frames: 510661632. Throughput: 0: 5929.3. Samples: 510662332. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 01:02:00,094][25689] Avg episode reward: [(0, '-43.000')] [2022-07-10 01:02:00,095][26022] Updated weights on worker 0-0, policy_version 498693 (0.00091) [2022-07-10 01:02:02,061][26022] Updated weights on worker 0-0, policy_version 498703 (0.00096) [2022-07-10 01:02:04,150][26022] Updated weights on worker 0-0, policy_version 498713 (0.00090) [2022-07-10 01:02:05,187][25689] Fps is (10 sec: 5452.6, 60 sec: 5636.8, 300 sec: 5650.7). Total num frames: 510688256. Throughput: 0: 5799.7. Samples: 510694238. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 01:02:05,187][25689] Avg episode reward: [(0, '-43.805')] [2022-07-10 01:02:05,702][26022] Updated weights on worker 0-0, policy_version 498723 (0.00081) [2022-07-10 01:02:07,708][26022] Updated weights on worker 0-0, policy_version 498733 (0.00090) [2022-07-10 01:02:09,491][26022] Updated weights on worker 0-0, policy_version 498743 (0.00086) [2022-07-10 01:02:10,221][25689] Fps is (10 sec: 5560.8, 60 sec: 5636.1, 300 sec: 5657.7). Total num frames: 510717952. Throughput: 0: 4950.5. Samples: 510711182. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 01:02:10,222][25689] Avg episode reward: [(0, '-43.278')] [2022-07-10 01:02:11,379][26022] Updated weights on worker 0-0, policy_version 498753 (0.00088) [2022-07-10 01:02:12,932][26022] Updated weights on worker 0-0, policy_version 498763 (0.00094) [2022-07-10 01:02:14,645][26022] Updated weights on worker 0-0, policy_version 498773 (0.00081) [2022-07-10 01:02:15,274][25689] Fps is (10 sec: 5685.0, 60 sec: 5651.2, 300 sec: 5648.6). Total num frames: 510745600. Throughput: 0: 5789.4. Samples: 510745486. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 01:02:15,274][25689] Avg episode reward: [(0, '-42.630')] [2022-07-10 01:02:16,643][26022] Updated weights on worker 0-0, policy_version 498783 (0.00093) [2022-07-10 01:02:18,247][26022] Updated weights on worker 0-0, policy_version 498793 (0.00088) [2022-07-10 01:02:20,300][26022] Updated weights on worker 0-0, policy_version 498803 (0.00085) [2022-07-10 01:02:20,399][25689] Fps is (10 sec: 5533.8, 60 sec: 5610.4, 300 sec: 5651.3). Total num frames: 510774272. Throughput: 0: 5788.0. Samples: 510779860. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 01:02:20,399][25689] Avg episode reward: [(0, '-42.066')] [2022-07-10 01:02:21,623][26022] Updated weights on worker 0-0, policy_version 498813 (0.00091) [2022-07-10 01:02:23,944][26022] Updated weights on worker 0-0, policy_version 498823 (0.00088) [2022-07-10 01:02:25,429][25689] Fps is (10 sec: 5747.4, 60 sec: 5647.6, 300 sec: 5650.8). Total num frames: 510803968. Throughput: 0: 5091.5. Samples: 510797302. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:02:25,430][25689] Avg episode reward: [(0, '-42.850')] [2022-07-10 01:02:25,461][26022] Updated weights on worker 0-0, policy_version 498833 (0.00084) [2022-07-10 01:02:27,328][26022] Updated weights on worker 0-0, policy_version 498843 (0.00094) [2022-07-10 01:02:29,073][26022] Updated weights on worker 0-0, policy_version 498853 (0.00086) [2022-07-10 01:02:30,439][25689] Fps is (10 sec: 5813.6, 60 sec: 5648.2, 300 sec: 5655.9). Total num frames: 510832640. Throughput: 0: 5941.8. Samples: 510831310. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:02:30,444][25689] Avg episode reward: [(0, '-44.143')] [2022-07-10 01:02:30,903][26022] Updated weights on worker 0-0, policy_version 498863 (0.00085) [2022-07-10 01:02:32,782][26022] Updated weights on worker 0-0, policy_version 498873 (0.00085) [2022-07-10 01:02:34,625][26022] Updated weights on worker 0-0, policy_version 498883 (0.00083) [2022-07-10 01:02:35,492][25689] Fps is (10 sec: 5698.4, 60 sec: 5644.9, 300 sec: 5655.9). Total num frames: 510861312. Throughput: 0: 5937.2. Samples: 510865528. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:02:35,493][25689] Avg episode reward: [(0, '-43.536')] [2022-07-10 01:02:36,271][26022] Updated weights on worker 0-0, policy_version 498893 (0.00092) [2022-07-10 01:02:38,162][26022] Updated weights on worker 0-0, policy_version 498903 (0.00086) [2022-07-10 01:02:39,942][26022] Updated weights on worker 0-0, policy_version 498913 (0.00090) [2022-07-10 01:02:40,567][25689] Fps is (10 sec: 5661.9, 60 sec: 5644.2, 300 sec: 5659.3). Total num frames: 510889984. Throughput: 0: 5091.4. Samples: 510882544. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:02:40,567][25689] Avg episode reward: [(0, '-43.122')] [2022-07-10 01:02:41,665][26022] Updated weights on worker 0-0, policy_version 498923 (0.00087) [2022-07-10 01:02:43,664][26022] Updated weights on worker 0-0, policy_version 498933 (0.00085) [2022-07-10 01:02:45,455][26022] Updated weights on worker 0-0, policy_version 498943 (0.00079) [2022-07-10 01:02:45,586][25689] Fps is (10 sec: 5681.1, 60 sec: 5629.7, 300 sec: 5656.0). Total num frames: 510918656. Throughput: 0: 5932.2. Samples: 510916878. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:02:45,587][25689] Avg episode reward: [(0, '-43.756')] [2022-07-10 01:02:47,101][26022] Updated weights on worker 0-0, policy_version 498953 (0.00091) [2022-07-10 01:02:48,896][26022] Updated weights on worker 0-0, policy_version 498963 (0.00090) [2022-07-10 01:02:50,548][26022] Updated weights on worker 0-0, policy_version 498973 (0.00093) [2022-07-10 01:02:50,602][25689] Fps is (10 sec: 5816.4, 60 sec: 5697.2, 300 sec: 5659.2). Total num frames: 510948352. Throughput: 0: 5967.7. Samples: 510951638. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:02:50,603][25689] Avg episode reward: [(0, '-44.388')] [2022-07-10 01:02:52,639][26022] Updated weights on worker 0-0, policy_version 498983 (0.00086) [2022-07-10 01:02:54,182][26022] Updated weights on worker 0-0, policy_version 498993 (0.00092) [2022-07-10 01:02:55,605][25689] Fps is (10 sec: 5723.7, 60 sec: 5664.5, 300 sec: 5656.6). Total num frames: 510976000. Throughput: 0: 5127.2. Samples: 510968648. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:02:55,606][25689] Avg episode reward: [(0, '-42.621')] [2022-07-10 01:02:56,345][26022] Updated weights on worker 0-0, policy_version 499003 (0.00089) [2022-07-10 01:02:57,729][26022] Updated weights on worker 0-0, policy_version 499013 (0.00086) [2022-07-10 01:02:59,922][26022] Updated weights on worker 0-0, policy_version 499023 (0.00106) [2022-07-10 01:03:00,687][25689] Fps is (10 sec: 5685.8, 60 sec: 5678.2, 300 sec: 5662.5). Total num frames: 511005696. Throughput: 0: 5971.3. Samples: 511002690. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:03:00,688][25689] Avg episode reward: [(0, '-42.935')] [2022-07-10 01:03:01,364][26022] Updated weights on worker 0-0, policy_version 499033 (0.00090) [2022-07-10 01:03:03,973][26022] Updated weights on worker 0-0, policy_version 499043 (0.00087) [2022-07-10 01:03:05,323][26022] Updated weights on worker 0-0, policy_version 499053 (0.00090) [2022-07-10 01:03:05,759][25689] Fps is (10 sec: 5445.7, 60 sec: 5663.3, 300 sec: 5650.8). Total num frames: 511031296. Throughput: 0: 5824.8. Samples: 511034380. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:03:05,761][25689] Avg episode reward: [(0, '-43.583')] [2022-07-10 01:03:07,570][26022] Updated weights on worker 0-0, policy_version 499063 (0.00095) [2022-07-10 01:03:09,245][26022] Updated weights on worker 0-0, policy_version 499073 (0.00088) [2022-07-10 01:03:10,790][25689] Fps is (10 sec: 5169.1, 60 sec: 5612.9, 300 sec: 5647.2). Total num frames: 511057920. Throughput: 0: 5793.5. Samples: 511068600. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:03:10,791][25689] Avg episode reward: [(0, '-44.540')] [2022-07-10 01:03:11,151][26022] Updated weights on worker 0-0, policy_version 499083 (0.00082) [2022-07-10 01:03:12,740][26022] Updated weights on worker 0-0, policy_version 499093 (0.00090) [2022-07-10 01:03:14,536][26022] Updated weights on worker 0-0, policy_version 499103 (0.00092) [2022-07-10 01:03:15,800][25689] Fps is (10 sec: 5710.9, 60 sec: 5667.6, 300 sec: 5652.1). Total num frames: 511088640. Throughput: 0: 5803.9. Samples: 511085858. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:03:15,800][25689] Avg episode reward: [(0, '-43.286')] [2022-07-10 01:03:16,224][26022] Updated weights on worker 0-0, policy_version 499113 (0.00089) [2022-07-10 01:03:18,239][26022] Updated weights on worker 0-0, policy_version 499123 (0.00084) [2022-07-10 01:03:19,726][26022] Updated weights on worker 0-0, policy_version 499133 (0.00087) [2022-07-10 01:03:20,949][25689] Fps is (10 sec: 5746.0, 60 sec: 5648.5, 300 sec: 5649.5). Total num frames: 511116288. Throughput: 0: 5811.8. Samples: 511120444. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:03:20,950][25689] Avg episode reward: [(0, '-43.568')] [2022-07-10 01:03:21,861][26022] Updated weights on worker 0-0, policy_version 499143 (0.00087) [2022-07-10 01:03:23,546][26022] Updated weights on worker 0-0, policy_version 499153 (0.00088) [2022-07-10 01:03:25,143][26022] Updated weights on worker 0-0, policy_version 499163 (0.00084) [2022-07-10 01:03:25,992][25689] Fps is (10 sec: 5827.0, 60 sec: 5681.1, 300 sec: 5659.1). Total num frames: 511148032. Throughput: 0: 5957.5. Samples: 511154922. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:03:25,995][25689] Avg episode reward: [(0, '-43.318')] [2022-07-10 01:03:27,252][26022] Updated weights on worker 0-0, policy_version 499173 (0.00090) [2022-07-10 01:03:28,763][26022] Updated weights on worker 0-0, policy_version 499183 (0.00087) [2022-07-10 01:03:30,944][26022] Updated weights on worker 0-0, policy_version 499193 (0.00091) [2022-07-10 01:03:31,041][25689] Fps is (10 sec: 5884.8, 60 sec: 5660.5, 300 sec: 5658.3). Total num frames: 511175680. Throughput: 0: 5090.0. Samples: 511171676. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:03:31,042][25689] Avg episode reward: [(0, '-42.803')] [2022-07-10 01:03:32,575][26022] Updated weights on worker 0-0, policy_version 499203 (0.00088) [2022-07-10 01:03:34,282][26022] Updated weights on worker 0-0, policy_version 499213 (0.00086) [2022-07-10 01:03:36,091][25689] Fps is (10 sec: 5374.0, 60 sec: 5627.0, 300 sec: 5641.3). Total num frames: 511202304. Throughput: 0: 5915.4. Samples: 511205890. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:03:36,092][25689] Avg episode reward: [(0, '-42.242')] [2022-07-10 01:03:36,315][26022] Updated weights on worker 0-0, policy_version 499223 (0.00087) [2022-07-10 01:03:37,872][26022] Updated weights on worker 0-0, policy_version 499233 (0.00089) [2022-07-10 01:03:39,846][26022] Updated weights on worker 0-0, policy_version 499243 (0.00087) [2022-07-10 01:03:41,152][25689] Fps is (10 sec: 5671.3, 60 sec: 5662.1, 300 sec: 5657.4). Total num frames: 511233024. Throughput: 0: 5915.3. Samples: 511239954. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:03:41,153][25689] Avg episode reward: [(0, '-42.810')] [2022-07-10 01:03:41,497][26022] Updated weights on worker 0-0, policy_version 499253 (0.00092) [2022-07-10 01:03:43,278][26022] Updated weights on worker 0-0, policy_version 499263 (0.00088) [2022-07-10 01:03:45,166][26022] Updated weights on worker 0-0, policy_version 499273 (0.00099) [2022-07-10 01:03:46,182][25689] Fps is (10 sec: 5886.1, 60 sec: 5661.2, 300 sec: 5650.2). Total num frames: 511261696. Throughput: 0: 5064.0. Samples: 511257160. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:03:46,182][25689] Avg episode reward: [(0, '-42.887')] [2022-07-10 01:03:47,020][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:03:47,030][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000499283_511265792.pth [2022-07-10 01:03:47,031][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000497291_509225984.pth [2022-07-10 01:03:47,042][26022] Updated weights on worker 0-0, policy_version 499283 (0.00087) [2022-07-10 01:03:48,631][26022] Updated weights on worker 0-0, policy_version 499293 (0.00353) [2022-07-10 01:03:50,822][26022] Updated weights on worker 0-0, policy_version 499303 (0.00095) [2022-07-10 01:03:51,207][25689] Fps is (10 sec: 5499.4, 60 sec: 5609.6, 300 sec: 5650.0). Total num frames: 511288320. Throughput: 0: 5937.2. Samples: 511291406. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:03:51,209][25689] Avg episode reward: [(0, '-43.609')] [2022-07-10 01:03:52,054][26022] Updated weights on worker 0-0, policy_version 499313 (0.00091) [2022-07-10 01:03:54,338][26022] Updated weights on worker 0-0, policy_version 499323 (0.00086) [2022-07-10 01:03:55,698][26022] Updated weights on worker 0-0, policy_version 499333 (0.00091) [2022-07-10 01:03:56,277][25689] Fps is (10 sec: 5578.5, 60 sec: 5637.1, 300 sec: 5653.3). Total num frames: 511318016. Throughput: 0: 5918.3. Samples: 511325358. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:03:56,278][25689] Avg episode reward: [(0, '-44.041')] [2022-07-10 01:03:57,714][26022] Updated weights on worker 0-0, policy_version 499343 (0.00109) [2022-07-10 01:03:59,952][26022] Updated weights on worker 0-0, policy_version 499353 (0.00092) [2022-07-10 01:04:01,350][26022] Updated weights on worker 0-0, policy_version 499363 (0.00101) [2022-07-10 01:04:01,351][25689] Fps is (10 sec: 5854.7, 60 sec: 5637.9, 300 sec: 5662.3). Total num frames: 511347712. Throughput: 0: 5063.2. Samples: 511342226. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:04:01,352][25689] Avg episode reward: [(0, '-43.818')] [2022-07-10 01:04:03,721][26022] Updated weights on worker 0-0, policy_version 499373 (0.00088) [2022-07-10 01:04:05,257][26022] Updated weights on worker 0-0, policy_version 499383 (0.00087) [2022-07-10 01:04:06,372][25689] Fps is (10 sec: 5477.9, 60 sec: 5642.6, 300 sec: 5651.6). Total num frames: 511373312. Throughput: 0: 5797.7. Samples: 511374218. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:04:06,372][25689] Avg episode reward: [(0, '-44.346')] [2022-07-10 01:04:07,104][26022] Updated weights on worker 0-0, policy_version 499393 (0.00097) [2022-07-10 01:04:08,955][26022] Updated weights on worker 0-0, policy_version 499403 (0.00085) [2022-07-10 01:04:10,869][26022] Updated weights on worker 0-0, policy_version 499413 (0.00096) [2022-07-10 01:04:11,374][25689] Fps is (10 sec: 5313.0, 60 sec: 5662.3, 300 sec: 5652.0). Total num frames: 511400960. Throughput: 0: 5800.2. Samples: 511408376. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:04:11,374][25689] Avg episode reward: [(0, '-43.193')] [2022-07-10 01:04:12,460][26022] Updated weights on worker 0-0, policy_version 499423 (0.00086) [2022-07-10 01:04:14,579][26022] Updated weights on worker 0-0, policy_version 499433 (0.00092) [2022-07-10 01:04:16,081][26022] Updated weights on worker 0-0, policy_version 499443 (0.00093) [2022-07-10 01:04:16,406][25689] Fps is (10 sec: 5714.9, 60 sec: 5643.3, 300 sec: 5655.7). Total num frames: 511430656. Throughput: 0: 4976.6. Samples: 511425528. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:04:16,407][25689] Avg episode reward: [(0, '-43.782')] [2022-07-10 01:04:18,105][26022] Updated weights on worker 0-0, policy_version 499453 (0.00086) [2022-07-10 01:04:19,502][26022] Updated weights on worker 0-0, policy_version 499463 (0.00093) [2022-07-10 01:04:21,479][25689] Fps is (10 sec: 5775.6, 60 sec: 5667.2, 300 sec: 5655.0). Total num frames: 511459328. Throughput: 0: 5835.2. Samples: 511459678. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:04:21,480][25689] Avg episode reward: [(0, '-43.091')] [2022-07-10 01:04:21,577][26022] Updated weights on worker 0-0, policy_version 499473 (0.00085) [2022-07-10 01:04:23,248][26022] Updated weights on worker 0-0, policy_version 499483 (0.00618) [2022-07-10 01:04:25,283][26022] Updated weights on worker 0-0, policy_version 499493 (0.00107) [2022-07-10 01:04:26,513][25689] Fps is (10 sec: 5673.4, 60 sec: 5617.4, 300 sec: 5654.9). Total num frames: 511488000. Throughput: 0: 5942.4. Samples: 511493908. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:04:26,515][25689] Avg episode reward: [(0, '-42.556')] [2022-07-10 01:04:26,882][26022] Updated weights on worker 0-0, policy_version 499503 (0.00053) [2022-07-10 01:04:28,852][26022] Updated weights on worker 0-0, policy_version 499513 (0.00082) [2022-07-10 01:04:30,663][26022] Updated weights on worker 0-0, policy_version 499523 (0.00091) [2022-07-10 01:04:31,548][25689] Fps is (10 sec: 5593.6, 60 sec: 5618.7, 300 sec: 5654.5). Total num frames: 511515648. Throughput: 0: 5066.0. Samples: 511510580. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:04:31,550][25689] Avg episode reward: [(0, '-42.730')] [2022-07-10 01:04:32,499][26022] Updated weights on worker 0-0, policy_version 499533 (0.00091) [2022-07-10 01:04:34,455][26022] Updated weights on worker 0-0, policy_version 499543 (0.00089) [2022-07-10 01:04:35,963][26022] Updated weights on worker 0-0, policy_version 499553 (0.00086) [2022-07-10 01:04:36,551][25689] Fps is (10 sec: 5611.1, 60 sec: 5657.0, 300 sec: 5646.1). Total num frames: 511544320. Throughput: 0: 5910.7. Samples: 511544598. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:04:36,551][25689] Avg episode reward: [(0, '-42.862')] [2022-07-10 01:04:37,974][26022] Updated weights on worker 0-0, policy_version 499563 (0.00086) [2022-07-10 01:04:39,671][26022] Updated weights on worker 0-0, policy_version 499573 (0.00050) [2022-07-10 01:04:41,245][26022] Updated weights on worker 0-0, policy_version 499583 (0.00840) [2022-07-10 01:04:41,658][25689] Fps is (10 sec: 5874.3, 60 sec: 5652.6, 300 sec: 5655.2). Total num frames: 511575040. Throughput: 0: 5896.1. Samples: 511578656. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:04:41,659][25689] Avg episode reward: [(0, '-42.307')] [2022-07-10 01:04:43,332][26022] Updated weights on worker 0-0, policy_version 499593 (0.00086) [2022-07-10 01:04:45,040][26022] Updated weights on worker 0-0, policy_version 499603 (0.00086) [2022-07-10 01:04:46,707][25689] Fps is (10 sec: 5646.3, 60 sec: 5617.0, 300 sec: 5647.9). Total num frames: 511601664. Throughput: 0: 5053.7. Samples: 511595960. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:04:46,707][25689] Avg episode reward: [(0, '-42.840')] [2022-07-10 01:04:46,859][26022] Updated weights on worker 0-0, policy_version 499613 (0.00084) [2022-07-10 01:04:48,667][26022] Updated weights on worker 0-0, policy_version 499623 (0.00090) [2022-07-10 01:04:50,357][26022] Updated weights on worker 0-0, policy_version 499633 (0.00086) [2022-07-10 01:04:51,731][25689] Fps is (10 sec: 5591.4, 60 sec: 5667.8, 300 sec: 5648.4). Total num frames: 511631360. Throughput: 0: 5935.8. Samples: 511630382. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:04:51,732][25689] Avg episode reward: [(0, '-43.824')] [2022-07-10 01:04:52,174][26022] Updated weights on worker 0-0, policy_version 499643 (0.00086) [2022-07-10 01:04:54,123][26022] Updated weights on worker 0-0, policy_version 499653 (0.00088) [2022-07-10 01:04:55,766][26022] Updated weights on worker 0-0, policy_version 499663 (0.00114) [2022-07-10 01:04:56,825][25689] Fps is (10 sec: 5768.4, 60 sec: 5648.7, 300 sec: 5651.5). Total num frames: 511660032. Throughput: 0: 5927.3. Samples: 511664772. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:04:56,826][25689] Avg episode reward: [(0, '-43.851')] [2022-07-10 01:04:57,607][26022] Updated weights on worker 0-0, policy_version 499673 (0.00086) [2022-07-10 01:04:59,293][26022] Updated weights on worker 0-0, policy_version 499683 (0.00086) [2022-07-10 01:05:01,159][26022] Updated weights on worker 0-0, policy_version 499693 (0.00081) [2022-07-10 01:05:01,902][25689] Fps is (10 sec: 5637.8, 60 sec: 5631.5, 300 sec: 5660.6). Total num frames: 511688704. Throughput: 0: 5932.8. Samples: 511698760. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:05:01,903][25689] Avg episode reward: [(0, '-44.211')] [2022-07-10 01:05:03,367][26022] Updated weights on worker 0-0, policy_version 499703 (0.00087) [2022-07-10 01:05:05,363][26022] Updated weights on worker 0-0, policy_version 499713 (0.00094) [2022-07-10 01:05:06,898][26022] Updated weights on worker 0-0, policy_version 499723 (0.00085) [2022-07-10 01:05:06,952][25689] Fps is (10 sec: 5561.4, 60 sec: 5662.5, 300 sec: 5653.4). Total num frames: 511716352. Throughput: 0: 5815.4. Samples: 511713696. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:05:06,954][25689] Avg episode reward: [(0, '-43.796')] [2022-07-10 01:05:09,042][26022] Updated weights on worker 0-0, policy_version 499733 (0.00084) [2022-07-10 01:05:10,396][26022] Updated weights on worker 0-0, policy_version 499743 (0.00091) [2022-07-10 01:05:11,983][25689] Fps is (10 sec: 5485.5, 60 sec: 5659.9, 300 sec: 5649.7). Total num frames: 511744000. Throughput: 0: 5803.9. Samples: 511747922. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 01:05:11,983][25689] Avg episode reward: [(0, '-42.654')] [2022-07-10 01:05:12,444][26022] Updated weights on worker 0-0, policy_version 499753 (0.00087) [2022-07-10 01:05:14,375][26022] Updated weights on worker 0-0, policy_version 499763 (0.00277) [2022-07-10 01:05:15,966][26022] Updated weights on worker 0-0, policy_version 499773 (0.00087) [2022-07-10 01:05:17,003][25689] Fps is (10 sec: 5603.8, 60 sec: 5644.1, 300 sec: 5653.7). Total num frames: 511772672. Throughput: 0: 5830.5. Samples: 511782416. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:05:17,003][25689] Avg episode reward: [(0, '-42.179')] [2022-07-10 01:05:17,863][26022] Updated weights on worker 0-0, policy_version 499783 (0.00095) [2022-07-10 01:05:19,495][26022] Updated weights on worker 0-0, policy_version 499793 (0.00103) [2022-07-10 01:05:21,351][26022] Updated weights on worker 0-0, policy_version 499803 (0.00088) [2022-07-10 01:05:22,043][25689] Fps is (10 sec: 5802.0, 60 sec: 5664.1, 300 sec: 5653.1). Total num frames: 511802368. Throughput: 0: 5003.9. Samples: 511799540. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:05:22,043][25689] Avg episode reward: [(0, '-42.426')] [2022-07-10 01:05:23,018][26022] Updated weights on worker 0-0, policy_version 499813 (0.00088) [2022-07-10 01:05:24,831][26022] Updated weights on worker 0-0, policy_version 499823 (0.00097) [2022-07-10 01:05:26,852][26022] Updated weights on worker 0-0, policy_version 499833 (0.00087) [2022-07-10 01:05:27,108][25689] Fps is (10 sec: 5674.6, 60 sec: 5644.3, 300 sec: 5652.0). Total num frames: 511830016. Throughput: 0: 5964.7. Samples: 511833920. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:05:27,109][25689] Avg episode reward: [(0, '-41.688')] [2022-07-10 01:05:28,463][26022] Updated weights on worker 0-0, policy_version 499843 (0.00094) [2022-07-10 01:05:30,478][26022] Updated weights on worker 0-0, policy_version 499853 (0.00091) [2022-07-10 01:05:32,128][25689] Fps is (10 sec: 5584.2, 60 sec: 5662.6, 300 sec: 5651.7). Total num frames: 511858688. Throughput: 0: 5981.6. Samples: 511868424. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:05:32,129][25689] Avg episode reward: [(0, '-42.364')] [2022-07-10 01:05:32,210][26022] Updated weights on worker 0-0, policy_version 499863 (0.00092) [2022-07-10 01:05:33,837][26022] Updated weights on worker 0-0, policy_version 499873 (0.00084) [2022-07-10 01:05:35,645][26022] Updated weights on worker 0-0, policy_version 499883 (0.00094) [2022-07-10 01:05:37,143][25689] Fps is (10 sec: 5714.6, 60 sec: 5661.5, 300 sec: 5647.0). Total num frames: 511887360. Throughput: 0: 5107.4. Samples: 511885280. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:05:37,144][25689] Avg episode reward: [(0, '-42.284')] [2022-07-10 01:05:37,467][26022] Updated weights on worker 0-0, policy_version 499893 (0.00090) [2022-07-10 01:05:39,345][26022] Updated weights on worker 0-0, policy_version 499903 (0.00090) [2022-07-10 01:05:40,934][26022] Updated weights on worker 0-0, policy_version 499913 (0.00094) [2022-07-10 01:05:42,230][25689] Fps is (10 sec: 5778.0, 60 sec: 5646.5, 300 sec: 5652.9). Total num frames: 511917056. Throughput: 0: 5952.5. Samples: 511919706. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:05:42,231][25689] Avg episode reward: [(0, '-41.816')] [2022-07-10 01:05:43,026][26022] Updated weights on worker 0-0, policy_version 499923 (0.00088) [2022-07-10 01:05:44,579][26022] Updated weights on worker 0-0, policy_version 499933 (0.00092) [2022-07-10 01:05:46,378][26022] Updated weights on worker 0-0, policy_version 499943 (0.00085) [2022-07-10 01:05:47,158][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:05:47,168][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000499947_511945728.pth [2022-07-10 01:05:47,169][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000497955_509905920.pth [2022-07-10 01:05:47,247][25689] Fps is (10 sec: 5776.5, 60 sec: 5683.2, 300 sec: 5653.4). Total num frames: 511945728. Throughput: 0: 5962.3. Samples: 511953994. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:05:47,247][25689] Avg episode reward: [(0, '-42.126')] [2022-07-10 01:05:48,295][26022] Updated weights on worker 0-0, policy_version 499953 (0.00083) [2022-07-10 01:05:50,005][26022] Updated weights on worker 0-0, policy_version 499963 (0.00108) [2022-07-10 01:05:51,786][26022] Updated weights on worker 0-0, policy_version 499973 (0.00094) [2022-07-10 01:05:52,283][25689] Fps is (10 sec: 5602.6, 60 sec: 5648.3, 300 sec: 5646.3). Total num frames: 511973376. Throughput: 0: 5092.9. Samples: 511971068. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:05:52,284][25689] Avg episode reward: [(0, '-43.007')] [2022-07-10 01:05:53,647][26022] Updated weights on worker 0-0, policy_version 499983 (0.00089) [2022-07-10 01:05:55,389][26022] Updated weights on worker 0-0, policy_version 499993 (0.00091) [2022-07-10 01:05:57,286][25689] Fps is (10 sec: 5610.1, 60 sec: 5656.8, 300 sec: 5652.0). Total num frames: 512002048. Throughput: 0: 5954.1. Samples: 512005216. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:05:57,287][25689] Avg episode reward: [(0, '-41.704')] [2022-07-10 01:05:57,468][26022] Updated weights on worker 0-0, policy_version 500003 (0.00086) [2022-07-10 01:05:59,052][26022] Updated weights on worker 0-0, policy_version 500013 (0.00088) [2022-07-10 01:06:01,051][26022] Updated weights on worker 0-0, policy_version 500023 (0.00094) [2022-07-10 01:06:02,370][25689] Fps is (10 sec: 5583.4, 60 sec: 5639.3, 300 sec: 5651.9). Total num frames: 512029696. Throughput: 0: 5913.9. Samples: 512038808. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:06:02,370][25689] Avg episode reward: [(0, '-41.913')] [2022-07-10 01:06:02,980][26022] Updated weights on worker 0-0, policy_version 500033 (0.00097) [2022-07-10 01:06:04,959][26022] Updated weights on worker 0-0, policy_version 500043 (0.00086) [2022-07-10 01:06:06,654][26022] Updated weights on worker 0-0, policy_version 500053 (0.00507) [2022-07-10 01:06:07,387][25689] Fps is (10 sec: 5575.9, 60 sec: 5659.3, 300 sec: 5648.6). Total num frames: 512058368. Throughput: 0: 4988.2. Samples: 512054454. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:06:07,389][25689] Avg episode reward: [(0, '-42.122')] [2022-07-10 01:06:08,452][26022] Updated weights on worker 0-0, policy_version 500063 (0.00085) [2022-07-10 01:06:10,281][26022] Updated weights on worker 0-0, policy_version 500073 (0.00084) [2022-07-10 01:06:11,820][26022] Updated weights on worker 0-0, policy_version 500083 (0.00084) [2022-07-10 01:06:12,427][25689] Fps is (10 sec: 5803.6, 60 sec: 5692.3, 300 sec: 5658.8). Total num frames: 512088064. Throughput: 0: 5852.4. Samples: 512088960. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:06:12,429][25689] Avg episode reward: [(0, '-42.138')] [2022-07-10 01:06:14,020][26022] Updated weights on worker 0-0, policy_version 500093 (0.00086) [2022-07-10 01:06:15,409][26022] Updated weights on worker 0-0, policy_version 500103 (0.00085) [2022-07-10 01:06:17,444][25689] Fps is (10 sec: 5599.9, 60 sec: 5658.6, 300 sec: 5645.7). Total num frames: 512114688. Throughput: 0: 5867.8. Samples: 512123498. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:06:17,446][25689] Avg episode reward: [(0, '-42.499')] [2022-07-10 01:06:17,566][26022] Updated weights on worker 0-0, policy_version 500113 (0.00054) [2022-07-10 01:06:19,165][26022] Updated weights on worker 0-0, policy_version 500123 (0.00088) [2022-07-10 01:06:20,997][26022] Updated weights on worker 0-0, policy_version 500133 (0.00052) [2022-07-10 01:06:22,493][25689] Fps is (10 sec: 5595.1, 60 sec: 5657.8, 300 sec: 5652.9). Total num frames: 512144384. Throughput: 0: 5058.8. Samples: 512140608. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:06:22,493][25689] Avg episode reward: [(0, '-42.002')] [2022-07-10 01:06:22,980][26022] Updated weights on worker 0-0, policy_version 500143 (0.00090) [2022-07-10 01:06:24,474][26022] Updated weights on worker 0-0, policy_version 500153 (0.00084) [2022-07-10 01:06:26,500][26022] Updated weights on worker 0-0, policy_version 500163 (0.00087) [2022-07-10 01:06:27,502][25689] Fps is (10 sec: 5803.3, 60 sec: 5680.1, 300 sec: 5653.0). Total num frames: 512173056. Throughput: 0: 5975.4. Samples: 512174650. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:06:27,502][25689] Avg episode reward: [(0, '-41.646')] [2022-07-10 01:06:27,980][26022] Updated weights on worker 0-0, policy_version 500173 (0.00084) [2022-07-10 01:06:30,054][26022] Updated weights on worker 0-0, policy_version 500183 (0.00091) [2022-07-10 01:06:31,786][26022] Updated weights on worker 0-0, policy_version 500193 (0.00077) [2022-07-10 01:06:32,523][25689] Fps is (10 sec: 5717.4, 60 sec: 5680.0, 300 sec: 5653.0). Total num frames: 512201728. Throughput: 0: 5973.1. Samples: 512208994. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:06:32,523][25689] Avg episode reward: [(0, '-41.074')] [2022-07-10 01:06:33,668][26022] Updated weights on worker 0-0, policy_version 500203 (0.00091) [2022-07-10 01:06:35,418][26022] Updated weights on worker 0-0, policy_version 500213 (0.00085) [2022-07-10 01:06:37,134][26022] Updated weights on worker 0-0, policy_version 500223 (0.00092) [2022-07-10 01:06:37,536][25689] Fps is (10 sec: 5715.0, 60 sec: 5680.1, 300 sec: 5654.0). Total num frames: 512230400. Throughput: 0: 5099.9. Samples: 512225964. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:06:37,536][25689] Avg episode reward: [(0, '-42.117')] [2022-07-10 01:06:38,981][26022] Updated weights on worker 0-0, policy_version 500233 (0.00089) [2022-07-10 01:06:40,728][26022] Updated weights on worker 0-0, policy_version 500243 (0.00090) [2022-07-10 01:06:42,444][26022] Updated weights on worker 0-0, policy_version 500253 (0.00092) [2022-07-10 01:06:42,596][25689] Fps is (10 sec: 5794.4, 60 sec: 5682.7, 300 sec: 5653.7). Total num frames: 512260096. Throughput: 0: 5962.0. Samples: 512260462. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:06:42,596][25689] Avg episode reward: [(0, '-41.015')] [2022-07-10 01:06:44,309][26022] Updated weights on worker 0-0, policy_version 500263 (0.00094) [2022-07-10 01:06:45,881][26022] Updated weights on worker 0-0, policy_version 500273 (0.00089) [2022-07-10 01:06:47,669][25689] Fps is (10 sec: 5658.7, 60 sec: 5660.4, 300 sec: 5659.5). Total num frames: 512287744. Throughput: 0: 5973.2. Samples: 512295116. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:06:47,670][25689] Avg episode reward: [(0, '-41.157')] [2022-07-10 01:06:47,826][26022] Updated weights on worker 0-0, policy_version 500283 (0.00097) [2022-07-10 01:06:49,538][26022] Updated weights on worker 0-0, policy_version 500293 (0.00110) [2022-07-10 01:06:51,380][26022] Updated weights on worker 0-0, policy_version 500303 (0.00090) [2022-07-10 01:06:52,680][25689] Fps is (10 sec: 5585.0, 60 sec: 5679.7, 300 sec: 5656.1). Total num frames: 512316416. Throughput: 0: 5126.0. Samples: 512312320. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:06:52,680][25689] Avg episode reward: [(0, '-41.302')] [2022-07-10 01:06:53,356][26022] Updated weights on worker 0-0, policy_version 500313 (0.00084) [2022-07-10 01:06:54,958][26022] Updated weights on worker 0-0, policy_version 500323 (0.00086) [2022-07-10 01:06:57,014][26022] Updated weights on worker 0-0, policy_version 500333 (0.00089) [2022-07-10 01:06:57,733][25689] Fps is (10 sec: 5800.0, 60 sec: 5692.0, 300 sec: 5659.5). Total num frames: 512346112. Throughput: 0: 5966.8. Samples: 512346476. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:06:57,733][25689] Avg episode reward: [(0, '-41.732')] [2022-07-10 01:06:58,534][26022] Updated weights on worker 0-0, policy_version 500343 (0.00094) [2022-07-10 01:07:00,585][26022] Updated weights on worker 0-0, policy_version 500353 (0.00088) [2022-07-10 01:07:02,493][26022] Updated weights on worker 0-0, policy_version 500363 (0.00087) [2022-07-10 01:07:02,786][25689] Fps is (10 sec: 5572.9, 60 sec: 5677.9, 300 sec: 5660.2). Total num frames: 512372736. Throughput: 0: 5862.1. Samples: 512378818. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:07:02,788][25689] Avg episode reward: [(0, '-41.376')] [2022-07-10 01:07:04,473][26022] Updated weights on worker 0-0, policy_version 500373 (0.00087) [2022-07-10 01:07:06,153][26022] Updated weights on worker 0-0, policy_version 500383 (0.00086) [2022-07-10 01:07:07,859][25689] Fps is (10 sec: 5359.7, 60 sec: 5655.7, 300 sec: 5652.6). Total num frames: 512400384. Throughput: 0: 4965.4. Samples: 512395364. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:07:07,861][25689] Avg episode reward: [(0, '-41.841')] [2022-07-10 01:07:08,035][26022] Updated weights on worker 0-0, policy_version 500393 (0.00088) [2022-07-10 01:07:09,775][26022] Updated weights on worker 0-0, policy_version 500403 (0.00086) [2022-07-10 01:07:11,519][26022] Updated weights on worker 0-0, policy_version 500413 (0.00090) [2022-07-10 01:07:12,896][25689] Fps is (10 sec: 5570.7, 60 sec: 5639.1, 300 sec: 5656.3). Total num frames: 512429056. Throughput: 0: 5796.0. Samples: 512429492. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:07:12,897][25689] Avg episode reward: [(0, '-42.566')] [2022-07-10 01:07:13,523][26022] Updated weights on worker 0-0, policy_version 500423 (0.00093) [2022-07-10 01:07:15,181][26022] Updated weights on worker 0-0, policy_version 500433 (0.00084) [2022-07-10 01:07:16,907][26022] Updated weights on worker 0-0, policy_version 500443 (0.00082) [2022-07-10 01:07:17,959][25689] Fps is (10 sec: 5779.1, 60 sec: 5685.6, 300 sec: 5661.0). Total num frames: 512458752. Throughput: 0: 5823.2. Samples: 512464256. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:07:17,961][25689] Avg episode reward: [(0, '-41.700')] [2022-07-10 01:07:18,869][26022] Updated weights on worker 0-0, policy_version 500453 (0.00090) [2022-07-10 01:07:20,540][26022] Updated weights on worker 0-0, policy_version 500463 (0.00091) [2022-07-10 01:07:22,327][26022] Updated weights on worker 0-0, policy_version 500473 (0.00227) [2022-07-10 01:07:23,092][25689] Fps is (10 sec: 5825.1, 60 sec: 5677.7, 300 sec: 5659.0). Total num frames: 512488448. Throughput: 0: 5048.7. Samples: 512481336. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:07:23,092][25689] Avg episode reward: [(0, '-40.916')] [2022-07-10 01:07:24,147][26022] Updated weights on worker 0-0, policy_version 500483 (0.00088) [2022-07-10 01:07:25,791][26022] Updated weights on worker 0-0, policy_version 500493 (0.00087) [2022-07-10 01:07:27,742][26022] Updated weights on worker 0-0, policy_version 500503 (0.00093) [2022-07-10 01:07:28,119][25689] Fps is (10 sec: 5643.9, 60 sec: 5659.1, 300 sec: 5655.2). Total num frames: 512516096. Throughput: 0: 5926.6. Samples: 512515438. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:07:28,120][25689] Avg episode reward: [(0, '-41.072')] [2022-07-10 01:07:29,624][26022] Updated weights on worker 0-0, policy_version 500513 (0.00085) [2022-07-10 01:07:31,331][26022] Updated weights on worker 0-0, policy_version 500523 (0.00090) [2022-07-10 01:07:33,176][25689] Fps is (10 sec: 5585.0, 60 sec: 5655.7, 300 sec: 5655.2). Total num frames: 512544768. Throughput: 0: 5911.7. Samples: 512549382. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:07:33,178][25689] Avg episode reward: [(0, '-39.829')] [2022-07-10 01:07:33,264][26022] Updated weights on worker 0-0, policy_version 500533 (0.00088) [2022-07-10 01:07:35,148][26022] Updated weights on worker 0-0, policy_version 500543 (0.00094) [2022-07-10 01:07:36,642][26022] Updated weights on worker 0-0, policy_version 500553 (0.00091) [2022-07-10 01:07:38,219][25689] Fps is (10 sec: 5677.9, 60 sec: 5653.0, 300 sec: 5655.8). Total num frames: 512573440. Throughput: 0: 5888.9. Samples: 512583564. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:07:38,219][25689] Avg episode reward: [(0, '-38.537')] [2022-07-10 01:07:38,756][26022] Updated weights on worker 0-0, policy_version 500563 (0.00095) [2022-07-10 01:07:40,264][26022] Updated weights on worker 0-0, policy_version 500573 (0.00087) [2022-07-10 01:07:42,317][26022] Updated weights on worker 0-0, policy_version 500583 (0.00082) [2022-07-10 01:07:43,300][25689] Fps is (10 sec: 5765.3, 60 sec: 5651.0, 300 sec: 5658.0). Total num frames: 512603136. Throughput: 0: 5911.1. Samples: 512600788. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:07:43,301][25689] Avg episode reward: [(0, '-37.921')] [2022-07-10 01:07:43,895][26022] Updated weights on worker 0-0, policy_version 500593 (0.00085) [2022-07-10 01:07:45,763][26022] Updated weights on worker 0-0, policy_version 500603 (0.00086) [2022-07-10 01:07:47,297][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:07:47,306][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000500611_512625664.pth [2022-07-10 01:07:47,306][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000498619_510585856.pth [2022-07-10 01:07:47,466][26022] Updated weights on worker 0-0, policy_version 500613 (0.00086) [2022-07-10 01:07:48,337][25689] Fps is (10 sec: 5667.5, 60 sec: 5654.4, 300 sec: 5650.8). Total num frames: 512630784. Throughput: 0: 5925.0. Samples: 512635228. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:07:48,337][25689] Avg episode reward: [(0, '-38.421')] [2022-07-10 01:07:49,348][26022] Updated weights on worker 0-0, policy_version 500623 (0.00085) [2022-07-10 01:07:51,029][26022] Updated weights on worker 0-0, policy_version 500633 (0.00088) [2022-07-10 01:07:52,749][26022] Updated weights on worker 0-0, policy_version 500643 (0.00088) [2022-07-10 01:07:53,361][25689] Fps is (10 sec: 5699.4, 60 sec: 5670.0, 300 sec: 5657.2). Total num frames: 512660480. Throughput: 0: 5979.6. Samples: 512670082. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:07:53,362][25689] Avg episode reward: [(0, '-38.088')] [2022-07-10 01:07:54,615][26022] Updated weights on worker 0-0, policy_version 500653 (0.00086) [2022-07-10 01:07:56,375][26022] Updated weights on worker 0-0, policy_version 500663 (0.00082) [2022-07-10 01:07:58,316][26022] Updated weights on worker 0-0, policy_version 500673 (0.00084) [2022-07-10 01:07:58,383][25689] Fps is (10 sec: 5810.1, 60 sec: 5656.1, 300 sec: 5655.0). Total num frames: 512689152. Throughput: 0: 5144.4. Samples: 512687296. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:07:58,383][25689] Avg episode reward: [(0, '-37.587')] [2022-07-10 01:07:59,879][26022] Updated weights on worker 0-0, policy_version 500683 (0.00096) [2022-07-10 01:08:02,099][26022] Updated weights on worker 0-0, policy_version 500693 (0.00086) [2022-07-10 01:08:03,463][25689] Fps is (10 sec: 5677.0, 60 sec: 5687.3, 300 sec: 5665.1). Total num frames: 512717824. Throughput: 0: 5900.7. Samples: 512719762. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:08:03,463][25689] Avg episode reward: [(0, '-38.894')] [2022-07-10 01:08:03,801][26022] Updated weights on worker 0-0, policy_version 500703 (0.00080) [2022-07-10 01:08:05,893][26022] Updated weights on worker 0-0, policy_version 500713 (0.00086) [2022-07-10 01:08:07,611][26022] Updated weights on worker 0-0, policy_version 500723 (0.00088) [2022-07-10 01:08:08,483][25689] Fps is (10 sec: 5677.2, 60 sec: 5709.1, 300 sec: 5672.2). Total num frames: 512746496. Throughput: 0: 5887.1. Samples: 512753834. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 01:08:08,484][25689] Avg episode reward: [(0, '-39.613')] [2022-07-10 01:08:09,448][26022] Updated weights on worker 0-0, policy_version 500733 (0.00087) [2022-07-10 01:08:11,127][26022] Updated weights on worker 0-0, policy_version 500743 (0.00085) [2022-07-10 01:08:13,047][26022] Updated weights on worker 0-0, policy_version 500753 (0.00093) [2022-07-10 01:08:13,575][25689] Fps is (10 sec: 5468.0, 60 sec: 5670.2, 300 sec: 5656.9). Total num frames: 512773120. Throughput: 0: 4986.6. Samples: 512770880. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:08:13,576][25689] Avg episode reward: [(0, '-40.822')] [2022-07-10 01:08:14,613][26022] Updated weights on worker 0-0, policy_version 500763 (0.00083) [2022-07-10 01:08:16,651][26022] Updated weights on worker 0-0, policy_version 500773 (0.00914) [2022-07-10 01:08:17,995][26022] Updated weights on worker 0-0, policy_version 500783 (0.00085) [2022-07-10 01:08:18,585][25689] Fps is (10 sec: 5575.1, 60 sec: 5675.1, 300 sec: 5666.4). Total num frames: 512802816. Throughput: 0: 5845.2. Samples: 512805386. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:08:18,586][25689] Avg episode reward: [(0, '-40.683')] [2022-07-10 01:08:20,127][26022] Updated weights on worker 0-0, policy_version 500793 (0.00085) [2022-07-10 01:08:21,708][26022] Updated weights on worker 0-0, policy_version 500803 (0.00087) [2022-07-10 01:08:23,534][26022] Updated weights on worker 0-0, policy_version 500813 (0.00086) [2022-07-10 01:08:23,670][25689] Fps is (10 sec: 5984.7, 60 sec: 5696.6, 300 sec: 5662.2). Total num frames: 512833536. Throughput: 0: 5964.2. Samples: 512840284. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:08:23,671][25689] Avg episode reward: [(0, '-40.603')] [2022-07-10 01:08:25,329][26022] Updated weights on worker 0-0, policy_version 500823 (0.00086) [2022-07-10 01:08:26,963][26022] Updated weights on worker 0-0, policy_version 500833 (0.00081) [2022-07-10 01:08:28,714][25689] Fps is (10 sec: 5762.8, 60 sec: 5695.0, 300 sec: 5662.3). Total num frames: 512861184. Throughput: 0: 5129.7. Samples: 512857610. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:08:28,716][25689] Avg episode reward: [(0, '-40.078')] [2022-07-10 01:08:29,124][26022] Updated weights on worker 0-0, policy_version 500843 (0.00079) [2022-07-10 01:08:30,417][26022] Updated weights on worker 0-0, policy_version 500853 (0.00086) [2022-07-10 01:08:32,445][26022] Updated weights on worker 0-0, policy_version 500863 (0.00087) [2022-07-10 01:08:33,725][25689] Fps is (10 sec: 5702.9, 60 sec: 5716.2, 300 sec: 5673.3). Total num frames: 512890880. Throughput: 0: 6023.0. Samples: 512892244. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:08:33,726][25689] Avg episode reward: [(0, '-39.400')] [2022-07-10 01:08:34,249][26022] Updated weights on worker 0-0, policy_version 500873 (0.00090) [2022-07-10 01:08:35,945][26022] Updated weights on worker 0-0, policy_version 500883 (0.00086) [2022-07-10 01:08:37,959][26022] Updated weights on worker 0-0, policy_version 500893 (0.00094) [2022-07-10 01:08:38,734][25689] Fps is (10 sec: 5723.0, 60 sec: 5702.5, 300 sec: 5664.0). Total num frames: 512918528. Throughput: 0: 6006.5. Samples: 512926406. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:08:38,736][25689] Avg episode reward: [(0, '-39.640')] [2022-07-10 01:08:39,537][26022] Updated weights on worker 0-0, policy_version 500903 (0.00088) [2022-07-10 01:08:41,363][26022] Updated weights on worker 0-0, policy_version 500913 (0.00084) [2022-07-10 01:08:43,274][26022] Updated weights on worker 0-0, policy_version 500923 (0.00087) [2022-07-10 01:08:43,773][25689] Fps is (10 sec: 5707.2, 60 sec: 5706.5, 300 sec: 5667.3). Total num frames: 512948224. Throughput: 0: 5128.6. Samples: 512943384. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:08:43,775][25689] Avg episode reward: [(0, '-39.421')] [2022-07-10 01:08:45,073][26022] Updated weights on worker 0-0, policy_version 500933 (0.00087) [2022-07-10 01:08:46,655][26022] Updated weights on worker 0-0, policy_version 500943 (0.00088) [2022-07-10 01:08:48,590][26022] Updated weights on worker 0-0, policy_version 500953 (0.00087) [2022-07-10 01:08:48,779][25689] Fps is (10 sec: 5810.2, 60 sec: 5726.3, 300 sec: 5674.5). Total num frames: 512976896. Throughput: 0: 5998.4. Samples: 512977972. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:08:48,780][25689] Avg episode reward: [(0, '-39.957')] [2022-07-10 01:08:50,224][26022] Updated weights on worker 0-0, policy_version 500963 (0.00087) [2022-07-10 01:08:52,235][26022] Updated weights on worker 0-0, policy_version 500973 (0.00084) [2022-07-10 01:08:53,801][25689] Fps is (10 sec: 5616.3, 60 sec: 5692.7, 300 sec: 5668.6). Total num frames: 513004544. Throughput: 0: 5973.2. Samples: 513012160. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:08:53,803][25689] Avg episode reward: [(0, '-40.270')] [2022-07-10 01:08:54,102][26022] Updated weights on worker 0-0, policy_version 500983 (0.00098) [2022-07-10 01:08:55,708][26022] Updated weights on worker 0-0, policy_version 500993 (0.00094) [2022-07-10 01:08:57,612][26022] Updated weights on worker 0-0, policy_version 501003 (0.00091) [2022-07-10 01:08:58,822][25689] Fps is (10 sec: 5709.9, 60 sec: 5709.6, 300 sec: 5669.6). Total num frames: 513034240. Throughput: 0: 5126.2. Samples: 513029384. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:08:58,823][25689] Avg episode reward: [(0, '-39.892')] [2022-07-10 01:08:59,515][26022] Updated weights on worker 0-0, policy_version 501013 (0.00086) [2022-07-10 01:09:01,095][26022] Updated weights on worker 0-0, policy_version 501023 (0.00081) [2022-07-10 01:09:03,345][26022] Updated weights on worker 0-0, policy_version 501033 (0.00089) [2022-07-10 01:09:03,860][25689] Fps is (10 sec: 5599.1, 60 sec: 5679.7, 300 sec: 5672.7). Total num frames: 513060864. Throughput: 0: 5945.7. Samples: 513062814. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:09:03,860][25689] Avg episode reward: [(0, '-39.239')] [2022-07-10 01:09:05,087][26022] Updated weights on worker 0-0, policy_version 501043 (0.00084) [2022-07-10 01:09:06,773][26022] Updated weights on worker 0-0, policy_version 501053 (0.00091) [2022-07-10 01:09:08,774][26022] Updated weights on worker 0-0, policy_version 501063 (0.00087) [2022-07-10 01:09:08,867][25689] Fps is (10 sec: 5403.3, 60 sec: 5664.1, 300 sec: 5672.6). Total num frames: 513088512. Throughput: 0: 5874.4. Samples: 513095972. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:09:08,867][25689] Avg episode reward: [(0, '-38.533')] [2022-07-10 01:09:10,382][26022] Updated weights on worker 0-0, policy_version 501073 (0.00636) [2022-07-10 01:09:12,470][26022] Updated weights on worker 0-0, policy_version 501083 (0.00088) [2022-07-10 01:09:13,890][25689] Fps is (10 sec: 5717.3, 60 sec: 5721.5, 300 sec: 5672.8). Total num frames: 513118208. Throughput: 0: 5021.2. Samples: 513113028. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:09:13,891][25689] Avg episode reward: [(0, '-38.560')] [2022-07-10 01:09:14,072][26022] Updated weights on worker 0-0, policy_version 501093 (0.00095) [2022-07-10 01:09:15,915][26022] Updated weights on worker 0-0, policy_version 501103 (0.00087) [2022-07-10 01:09:17,555][26022] Updated weights on worker 0-0, policy_version 501113 (0.00099) [2022-07-10 01:09:18,905][25689] Fps is (10 sec: 5712.4, 60 sec: 5687.0, 300 sec: 5670.4). Total num frames: 513145856. Throughput: 0: 5880.1. Samples: 513147474. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:09:18,906][25689] Avg episode reward: [(0, '-38.606')] [2022-07-10 01:09:19,478][26022] Updated weights on worker 0-0, policy_version 501123 (0.00091) [2022-07-10 01:09:21,235][26022] Updated weights on worker 0-0, policy_version 501133 (0.00085) [2022-07-10 01:09:22,967][26022] Updated weights on worker 0-0, policy_version 501143 (0.00091) [2022-07-10 01:09:23,966][25689] Fps is (10 sec: 5690.8, 60 sec: 5672.3, 300 sec: 5673.3). Total num frames: 513175552. Throughput: 0: 5919.9. Samples: 513181842. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:09:23,967][25689] Avg episode reward: [(0, '-38.892')] [2022-07-10 01:09:24,671][26022] Updated weights on worker 0-0, policy_version 501153 (0.00086) [2022-07-10 01:09:26,638][26022] Updated weights on worker 0-0, policy_version 501163 (0.00085) [2022-07-10 01:09:28,395][26022] Updated weights on worker 0-0, policy_version 501173 (0.00092) [2022-07-10 01:09:28,979][25689] Fps is (10 sec: 5793.7, 60 sec: 5692.1, 300 sec: 5677.2). Total num frames: 513204224. Throughput: 0: 5121.6. Samples: 513198982. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:09:28,980][25689] Avg episode reward: [(0, '-38.546')] [2022-07-10 01:09:30,075][26022] Updated weights on worker 0-0, policy_version 501183 (0.00087) [2022-07-10 01:09:31,938][26022] Updated weights on worker 0-0, policy_version 501193 (0.00087) [2022-07-10 01:09:33,911][26022] Updated weights on worker 0-0, policy_version 501203 (0.00088) [2022-07-10 01:09:33,981][25689] Fps is (10 sec: 5623.7, 60 sec: 5659.1, 300 sec: 5673.8). Total num frames: 513231872. Throughput: 0: 5984.7. Samples: 513233268. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:09:33,981][25689] Avg episode reward: [(0, '-39.197')] [2022-07-10 01:09:35,609][26022] Updated weights on worker 0-0, policy_version 501213 (0.00087) [2022-07-10 01:09:37,387][26022] Updated weights on worker 0-0, policy_version 501223 (0.00084) [2022-07-10 01:09:38,990][25689] Fps is (10 sec: 5728.3, 60 sec: 5693.0, 300 sec: 5672.2). Total num frames: 513261568. Throughput: 0: 5973.2. Samples: 513267446. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:09:38,991][25689] Avg episode reward: [(0, '-38.004')] [2022-07-10 01:09:39,162][26022] Updated weights on worker 0-0, policy_version 501233 (0.00066) [2022-07-10 01:09:41,101][26022] Updated weights on worker 0-0, policy_version 501243 (0.00100) [2022-07-10 01:09:42,718][26022] Updated weights on worker 0-0, policy_version 501253 (0.00087) [2022-07-10 01:09:44,101][25689] Fps is (10 sec: 5767.3, 60 sec: 5669.2, 300 sec: 5677.9). Total num frames: 513290240. Throughput: 0: 5107.4. Samples: 513284680. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:09:44,102][25689] Avg episode reward: [(0, '-37.416')] [2022-07-10 01:09:44,567][26022] Updated weights on worker 0-0, policy_version 501263 (0.00082) [2022-07-10 01:09:46,236][26022] Updated weights on worker 0-0, policy_version 501273 (0.00089) [2022-07-10 01:09:47,452][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:09:47,466][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000501279_513309696.pth [2022-07-10 01:09:47,467][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000499283_511265792.pth [2022-07-10 01:09:48,083][26022] Updated weights on worker 0-0, policy_version 501283 (0.00088) [2022-07-10 01:09:49,123][25689] Fps is (10 sec: 5659.3, 60 sec: 5667.8, 300 sec: 5674.5). Total num frames: 513318912. Throughput: 0: 5966.8. Samples: 513319172. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:09:49,124][25689] Avg episode reward: [(0, '-38.179')] [2022-07-10 01:09:49,819][26022] Updated weights on worker 0-0, policy_version 501293 (0.00090) [2022-07-10 01:09:51,712][26022] Updated weights on worker 0-0, policy_version 501303 (0.00082) [2022-07-10 01:09:53,402][26022] Updated weights on worker 0-0, policy_version 501313 (0.00085) [2022-07-10 01:09:54,133][25689] Fps is (10 sec: 5716.5, 60 sec: 5685.9, 300 sec: 5676.1). Total num frames: 513347584. Throughput: 0: 5977.6. Samples: 513353728. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:09:54,134][25689] Avg episode reward: [(0, '-37.596')] [2022-07-10 01:09:55,398][26022] Updated weights on worker 0-0, policy_version 501323 (0.00084) [2022-07-10 01:09:57,101][26022] Updated weights on worker 0-0, policy_version 501333 (0.00094) [2022-07-10 01:09:58,813][26022] Updated weights on worker 0-0, policy_version 501343 (0.00090) [2022-07-10 01:09:59,164][25689] Fps is (10 sec: 5812.8, 60 sec: 5685.0, 300 sec: 5680.4). Total num frames: 513377280. Throughput: 0: 5123.6. Samples: 513370810. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:09:59,166][25689] Avg episode reward: [(0, '-37.335')] [2022-07-10 01:10:00,689][26022] Updated weights on worker 0-0, policy_version 501353 (0.00087) [2022-07-10 01:10:02,619][26022] Updated weights on worker 0-0, policy_version 501363 (0.00086) [2022-07-10 01:10:04,258][25689] Fps is (10 sec: 5562.2, 60 sec: 5679.6, 300 sec: 5676.1). Total num frames: 513403904. Throughput: 0: 5877.2. Samples: 513403146. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:10:04,259][25689] Avg episode reward: [(0, '-38.209')] [2022-07-10 01:10:04,666][26022] Updated weights on worker 0-0, policy_version 501373 (0.00085) [2022-07-10 01:10:06,254][26022] Updated weights on worker 0-0, policy_version 501383 (0.00082) [2022-07-10 01:10:08,150][26022] Updated weights on worker 0-0, policy_version 501393 (0.00091) [2022-07-10 01:10:09,298][25689] Fps is (10 sec: 5355.2, 60 sec: 5676.5, 300 sec: 5676.0). Total num frames: 513431552. Throughput: 0: 5865.7. Samples: 513437518. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:10:09,300][25689] Avg episode reward: [(0, '-38.615')] [2022-07-10 01:10:10,005][26022] Updated weights on worker 0-0, policy_version 501403 (0.00096) [2022-07-10 01:10:11,885][26022] Updated weights on worker 0-0, policy_version 501413 (0.00094) [2022-07-10 01:10:13,463][26022] Updated weights on worker 0-0, policy_version 501423 (0.00093) [2022-07-10 01:10:14,339][25689] Fps is (10 sec: 5688.2, 60 sec: 5674.8, 300 sec: 5679.0). Total num frames: 513461248. Throughput: 0: 4981.3. Samples: 513454384. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:10:14,340][25689] Avg episode reward: [(0, '-38.214')] [2022-07-10 01:10:15,506][26022] Updated weights on worker 0-0, policy_version 501433 (0.00082) [2022-07-10 01:10:16,964][26022] Updated weights on worker 0-0, policy_version 501443 (0.00056) [2022-07-10 01:10:19,164][26022] Updated weights on worker 0-0, policy_version 501453 (0.00084) [2022-07-10 01:10:19,353][25689] Fps is (10 sec: 5703.1, 60 sec: 5674.9, 300 sec: 5672.6). Total num frames: 513488896. Throughput: 0: 5842.7. Samples: 513488770. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:10:19,354][25689] Avg episode reward: [(0, '-37.616')] [2022-07-10 01:10:20,699][26022] Updated weights on worker 0-0, policy_version 501463 (0.00083) [2022-07-10 01:10:22,482][26022] Updated weights on worker 0-0, policy_version 501473 (0.00086) [2022-07-10 01:10:24,096][26022] Updated weights on worker 0-0, policy_version 501483 (0.00084) [2022-07-10 01:10:24,461][25689] Fps is (10 sec: 5665.6, 60 sec: 5670.6, 300 sec: 5678.7). Total num frames: 513518592. Throughput: 0: 5949.1. Samples: 513523334. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:10:24,461][25689] Avg episode reward: [(0, '-38.333')] [2022-07-10 01:10:26,139][26022] Updated weights on worker 0-0, policy_version 501493 (0.00116) [2022-07-10 01:10:28,043][26022] Updated weights on worker 0-0, policy_version 501503 (0.00091) [2022-07-10 01:10:29,492][25689] Fps is (10 sec: 5858.2, 60 sec: 5685.9, 300 sec: 5681.9). Total num frames: 513548288. Throughput: 0: 5098.0. Samples: 513540464. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:10:29,492][25689] Avg episode reward: [(0, '-37.664')] [2022-07-10 01:10:29,716][26022] Updated weights on worker 0-0, policy_version 501513 (0.00088) [2022-07-10 01:10:31,588][26022] Updated weights on worker 0-0, policy_version 501523 (0.00092) [2022-07-10 01:10:33,247][26022] Updated weights on worker 0-0, policy_version 501533 (0.00084) [2022-07-10 01:10:34,497][25689] Fps is (10 sec: 5815.9, 60 sec: 5702.4, 300 sec: 5682.1). Total num frames: 513576960. Throughput: 0: 5964.8. Samples: 513574620. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:10:34,497][25689] Avg episode reward: [(0, '-37.445')] [2022-07-10 01:10:35,123][26022] Updated weights on worker 0-0, policy_version 501543 (0.00094) [2022-07-10 01:10:36,803][26022] Updated weights on worker 0-0, policy_version 501553 (0.00104) [2022-07-10 01:10:38,779][26022] Updated weights on worker 0-0, policy_version 501563 (0.00087) [2022-07-10 01:10:39,527][25689] Fps is (10 sec: 5510.2, 60 sec: 5649.7, 300 sec: 5672.9). Total num frames: 513603584. Throughput: 0: 5961.1. Samples: 513609026. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:10:39,527][25689] Avg episode reward: [(0, '-36.912')] [2022-07-10 01:10:40,331][26022] Updated weights on worker 0-0, policy_version 501573 (0.00607) [2022-07-10 01:10:42,499][26022] Updated weights on worker 0-0, policy_version 501583 (0.00090) [2022-07-10 01:10:43,922][26022] Updated weights on worker 0-0, policy_version 501593 (0.00088) [2022-07-10 01:10:44,623][25689] Fps is (10 sec: 5663.1, 60 sec: 5685.0, 300 sec: 5678.3). Total num frames: 513634304. Throughput: 0: 5940.2. Samples: 513643100. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:10:44,623][25689] Avg episode reward: [(0, '-36.229')] [2022-07-10 01:10:45,888][26022] Updated weights on worker 0-0, policy_version 501603 (0.00082) [2022-07-10 01:10:47,436][26022] Updated weights on worker 0-0, policy_version 501613 (0.00087) [2022-07-10 01:10:49,639][25689] Fps is (10 sec: 5670.9, 60 sec: 5651.6, 300 sec: 5675.2). Total num frames: 513660928. Throughput: 0: 5949.4. Samples: 513660328. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:10:49,639][25689] Avg episode reward: [(0, '-36.568')] [2022-07-10 01:10:49,700][26022] Updated weights on worker 0-0, policy_version 501623 (0.00087) [2022-07-10 01:10:51,207][26022] Updated weights on worker 0-0, policy_version 501633 (0.00079) [2022-07-10 01:10:53,247][26022] Updated weights on worker 0-0, policy_version 501643 (0.00087) [2022-07-10 01:10:54,643][25689] Fps is (10 sec: 5620.9, 60 sec: 5669.2, 300 sec: 5678.6). Total num frames: 513690624. Throughput: 0: 5955.8. Samples: 513694604. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:10:54,643][25689] Avg episode reward: [(0, '-36.398')] [2022-07-10 01:10:54,830][26022] Updated weights on worker 0-0, policy_version 501653 (0.00086) [2022-07-10 01:10:56,706][26022] Updated weights on worker 0-0, policy_version 501663 (0.00084) [2022-07-10 01:10:58,399][26022] Updated weights on worker 0-0, policy_version 501673 (0.00083) [2022-07-10 01:10:59,654][25689] Fps is (10 sec: 5930.0, 60 sec: 5671.0, 300 sec: 5686.9). Total num frames: 513720320. Throughput: 0: 5971.2. Samples: 513729212. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:10:59,655][25689] Avg episode reward: [(0, '-37.751')] [2022-07-10 01:11:00,221][26022] Updated weights on worker 0-0, policy_version 501683 (0.00087) [2022-07-10 01:11:02,469][26022] Updated weights on worker 0-0, policy_version 501693 (0.00083) [2022-07-10 01:11:04,133][26022] Updated weights on worker 0-0, policy_version 501703 (0.00093) [2022-07-10 01:11:04,722][25689] Fps is (10 sec: 5587.7, 60 sec: 5673.5, 300 sec: 5679.0). Total num frames: 513746944. Throughput: 0: 5029.8. Samples: 513744194. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:11:04,724][25689] Avg episode reward: [(0, '-38.697')] [2022-07-10 01:11:06,071][26022] Updated weights on worker 0-0, policy_version 501713 (0.00082) [2022-07-10 01:11:07,703][26022] Updated weights on worker 0-0, policy_version 501723 (0.00093) [2022-07-10 01:11:09,623][26022] Updated weights on worker 0-0, policy_version 501733 (0.00086) [2022-07-10 01:11:09,738][25689] Fps is (10 sec: 5585.1, 60 sec: 5709.6, 300 sec: 5679.5). Total num frames: 513776640. Throughput: 0: 5875.3. Samples: 513778418. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:11:09,739][25689] Avg episode reward: [(0, '-39.671')] [2022-07-10 01:11:11,677][26022] Updated weights on worker 0-0, policy_version 501743 (0.00091) [2022-07-10 01:11:13,027][26022] Updated weights on worker 0-0, policy_version 501753 (0.00099) [2022-07-10 01:11:14,802][25689] Fps is (10 sec: 5587.0, 60 sec: 5656.6, 300 sec: 5678.6). Total num frames: 513803264. Throughput: 0: 5863.2. Samples: 513812804. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:11:14,803][25689] Avg episode reward: [(0, '-41.922')] [2022-07-10 01:11:15,101][26022] Updated weights on worker 0-0, policy_version 501763 (0.00083) [2022-07-10 01:11:16,490][26022] Updated weights on worker 0-0, policy_version 501773 (0.00091) [2022-07-10 01:11:18,587][26022] Updated weights on worker 0-0, policy_version 501783 (0.00089) [2022-07-10 01:11:19,871][25689] Fps is (10 sec: 5659.3, 60 sec: 5702.3, 300 sec: 5681.7). Total num frames: 513833984. Throughput: 0: 4999.1. Samples: 513830278. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:11:19,872][25689] Avg episode reward: [(0, '-40.978')] [2022-07-10 01:11:20,091][26022] Updated weights on worker 0-0, policy_version 501793 (0.00087) [2022-07-10 01:11:22,178][26022] Updated weights on worker 0-0, policy_version 501803 (0.00092) [2022-07-10 01:11:23,730][26022] Updated weights on worker 0-0, policy_version 501813 (0.00081) [2022-07-10 01:11:24,988][25689] Fps is (10 sec: 5730.5, 60 sec: 5667.5, 300 sec: 5676.1). Total num frames: 513861632. Throughput: 0: 5950.6. Samples: 513864790. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:11:24,989][25689] Avg episode reward: [(0, '-40.745')] [2022-07-10 01:11:25,803][26022] Updated weights on worker 0-0, policy_version 501823 (0.00083) [2022-07-10 01:11:27,298][26022] Updated weights on worker 0-0, policy_version 501833 (0.00094) [2022-07-10 01:11:29,339][26022] Updated weights on worker 0-0, policy_version 501843 (0.00087) [2022-07-10 01:11:30,025][25689] Fps is (10 sec: 5647.4, 60 sec: 5666.9, 300 sec: 5679.3). Total num frames: 513891328. Throughput: 0: 5955.1. Samples: 513899228. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:11:30,026][25689] Avg episode reward: [(0, '-40.173')] [2022-07-10 01:11:30,907][26022] Updated weights on worker 0-0, policy_version 501853 (0.00084) [2022-07-10 01:11:32,717][26022] Updated weights on worker 0-0, policy_version 501863 (0.00089) [2022-07-10 01:11:34,503][26022] Updated weights on worker 0-0, policy_version 501873 (0.00087) [2022-07-10 01:11:35,035][25689] Fps is (10 sec: 5911.4, 60 sec: 5683.4, 300 sec: 5682.8). Total num frames: 513921024. Throughput: 0: 5126.9. Samples: 513916534. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:11:35,036][25689] Avg episode reward: [(0, '-40.157')] [2022-07-10 01:11:36,449][26022] Updated weights on worker 0-0, policy_version 501883 (0.00085) [2022-07-10 01:11:37,998][26022] Updated weights on worker 0-0, policy_version 501893 (0.00080) [2022-07-10 01:11:39,900][26022] Updated weights on worker 0-0, policy_version 501903 (0.00085) [2022-07-10 01:11:40,122][25689] Fps is (10 sec: 5780.7, 60 sec: 5711.9, 300 sec: 5678.8). Total num frames: 513949696. Throughput: 0: 5957.0. Samples: 513950914. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:11:40,123][25689] Avg episode reward: [(0, '-39.866')] [2022-07-10 01:11:41,519][26022] Updated weights on worker 0-0, policy_version 501913 (0.00092) [2022-07-10 01:11:43,483][26022] Updated weights on worker 0-0, policy_version 501923 (0.00087) [2022-07-10 01:11:44,958][26022] Updated weights on worker 0-0, policy_version 501933 (0.00081) [2022-07-10 01:11:45,176][25689] Fps is (10 sec: 5755.9, 60 sec: 5699.0, 300 sec: 5686.1). Total num frames: 513979392. Throughput: 0: 5989.8. Samples: 513985710. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:11:45,176][25689] Avg episode reward: [(0, '-38.738')] [2022-07-10 01:11:46,951][26022] Updated weights on worker 0-0, policy_version 501943 (0.00093) [2022-07-10 01:11:47,617][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:11:47,626][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000501947_513993728.pth [2022-07-10 01:11:47,626][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000499947_511945728.pth [2022-07-10 01:11:48,681][26022] Updated weights on worker 0-0, policy_version 501953 (0.00084) [2022-07-10 01:11:50,197][25689] Fps is (10 sec: 5793.4, 60 sec: 5732.3, 300 sec: 5685.9). Total num frames: 514008064. Throughput: 0: 5153.7. Samples: 514003188. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:11:50,198][25689] Avg episode reward: [(0, '-37.354')] [2022-07-10 01:11:50,426][26022] Updated weights on worker 0-0, policy_version 501963 (0.00091) [2022-07-10 01:11:52,347][26022] Updated weights on worker 0-0, policy_version 501973 (0.00085) [2022-07-10 01:11:53,868][26022] Updated weights on worker 0-0, policy_version 501983 (0.00099) [2022-07-10 01:11:55,224][25689] Fps is (10 sec: 5604.7, 60 sec: 5696.3, 300 sec: 5679.5). Total num frames: 514035712. Throughput: 0: 5999.3. Samples: 514037656. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:11:55,225][25689] Avg episode reward: [(0, '-37.362')] [2022-07-10 01:11:55,767][26022] Updated weights on worker 0-0, policy_version 501993 (0.00090) [2022-07-10 01:11:57,546][26022] Updated weights on worker 0-0, policy_version 502003 (0.00086) [2022-07-10 01:11:59,384][26022] Updated weights on worker 0-0, policy_version 502013 (0.00096) [2022-07-10 01:12:00,249][25689] Fps is (10 sec: 5806.5, 60 sec: 5711.9, 300 sec: 5693.8). Total num frames: 514066432. Throughput: 0: 6020.1. Samples: 514072082. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:12:00,250][25689] Avg episode reward: [(0, '-38.061')] [2022-07-10 01:12:01,459][26022] Updated weights on worker 0-0, policy_version 502023 (0.00100) [2022-07-10 01:12:03,244][26022] Updated weights on worker 0-0, policy_version 502033 (0.00088) [2022-07-10 01:12:05,184][26022] Updated weights on worker 0-0, policy_version 502043 (0.00087) [2022-07-10 01:12:05,386][25689] Fps is (10 sec: 5542.6, 60 sec: 5688.5, 300 sec: 5685.7). Total num frames: 514092032. Throughput: 0: 5012.1. Samples: 514087004. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:12:05,386][25689] Avg episode reward: [(0, '-37.659')] [2022-07-10 01:12:06,918][26022] Updated weights on worker 0-0, policy_version 502053 (0.00090) [2022-07-10 01:12:08,763][26022] Updated weights on worker 0-0, policy_version 502063 (0.00090) [2022-07-10 01:12:10,426][25689] Fps is (10 sec: 5433.6, 60 sec: 5686.3, 300 sec: 5689.1). Total num frames: 514121728. Throughput: 0: 5828.4. Samples: 514121090. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:12:10,427][25689] Avg episode reward: [(0, '-37.770')] [2022-07-10 01:12:10,551][26022] Updated weights on worker 0-0, policy_version 502073 (0.00093) [2022-07-10 01:12:12,283][26022] Updated weights on worker 0-0, policy_version 502083 (0.00083) [2022-07-10 01:12:14,332][26022] Updated weights on worker 0-0, policy_version 502093 (0.00084) [2022-07-10 01:12:15,501][25689] Fps is (10 sec: 5770.2, 60 sec: 5719.0, 300 sec: 5685.4). Total num frames: 514150400. Throughput: 0: 5810.0. Samples: 514155464. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:12:15,502][25689] Avg episode reward: [(0, '-39.304')] [2022-07-10 01:12:15,873][26022] Updated weights on worker 0-0, policy_version 502103 (0.00081) [2022-07-10 01:12:17,896][26022] Updated weights on worker 0-0, policy_version 502113 (0.00083) [2022-07-10 01:12:19,531][26022] Updated weights on worker 0-0, policy_version 502123 (0.00075) [2022-07-10 01:12:20,541][25689] Fps is (10 sec: 5669.3, 60 sec: 5688.0, 300 sec: 5683.7). Total num frames: 514179072. Throughput: 0: 4959.5. Samples: 514172718. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:12:20,541][25689] Avg episode reward: [(0, '-39.680')] [2022-07-10 01:12:21,270][26022] Updated weights on worker 0-0, policy_version 502133 (0.00090) [2022-07-10 01:12:23,150][26022] Updated weights on worker 0-0, policy_version 502143 (0.00088) [2022-07-10 01:12:24,843][26022] Updated weights on worker 0-0, policy_version 502153 (0.00084) [2022-07-10 01:12:25,697][25689] Fps is (10 sec: 5724.4, 60 sec: 5718.0, 300 sec: 5688.2). Total num frames: 514208768. Throughput: 0: 5912.2. Samples: 514207092. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:12:25,698][25689] Avg episode reward: [(0, '-38.750')] [2022-07-10 01:12:26,838][26022] Updated weights on worker 0-0, policy_version 502163 (0.00079) [2022-07-10 01:12:28,407][26022] Updated weights on worker 0-0, policy_version 502173 (0.00089) [2022-07-10 01:12:30,464][26022] Updated weights on worker 0-0, policy_version 502183 (0.00090) [2022-07-10 01:12:30,719][25689] Fps is (10 sec: 5634.1, 60 sec: 5685.7, 300 sec: 5685.4). Total num frames: 514236416. Throughput: 0: 5927.6. Samples: 514241378. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:12:30,720][25689] Avg episode reward: [(0, '-38.283')] [2022-07-10 01:12:32,103][26022] Updated weights on worker 0-0, policy_version 502193 (0.00087) [2022-07-10 01:12:34,138][26022] Updated weights on worker 0-0, policy_version 502203 (0.00091) [2022-07-10 01:12:35,767][25689] Fps is (10 sec: 5593.2, 60 sec: 5665.3, 300 sec: 5685.3). Total num frames: 514265088. Throughput: 0: 5084.8. Samples: 514258510. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:12:35,767][25689] Avg episode reward: [(0, '-39.501')] [2022-07-10 01:12:35,800][26022] Updated weights on worker 0-0, policy_version 502213 (0.00094) [2022-07-10 01:12:37,701][26022] Updated weights on worker 0-0, policy_version 502223 (0.00092) [2022-07-10 01:12:39,338][26022] Updated weights on worker 0-0, policy_version 502233 (0.00528) [2022-07-10 01:12:40,772][25689] Fps is (10 sec: 5908.0, 60 sec: 5706.7, 300 sec: 5690.2). Total num frames: 514295808. Throughput: 0: 5931.4. Samples: 514292716. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:12:40,774][25689] Avg episode reward: [(0, '-38.026')] [2022-07-10 01:12:41,120][26022] Updated weights on worker 0-0, policy_version 502243 (0.00086) [2022-07-10 01:12:42,834][26022] Updated weights on worker 0-0, policy_version 502253 (0.00086) [2022-07-10 01:12:44,753][26022] Updated weights on worker 0-0, policy_version 502263 (0.00089) [2022-07-10 01:12:45,826][25689] Fps is (10 sec: 5802.1, 60 sec: 5672.9, 300 sec: 5689.9). Total num frames: 514323456. Throughput: 0: 5957.1. Samples: 514327002. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:12:45,827][25689] Avg episode reward: [(0, '-38.879')] [2022-07-10 01:12:46,424][26022] Updated weights on worker 0-0, policy_version 502273 (0.00083) [2022-07-10 01:12:48,374][26022] Updated weights on worker 0-0, policy_version 502283 (0.00089) [2022-07-10 01:12:49,775][26022] Updated weights on worker 0-0, policy_version 502293 (0.00091) [2022-07-10 01:12:50,855][25689] Fps is (10 sec: 5585.6, 60 sec: 5672.2, 300 sec: 5686.3). Total num frames: 514352128. Throughput: 0: 5980.9. Samples: 514361808. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:12:50,856][25689] Avg episode reward: [(0, '-39.451')] [2022-07-10 01:12:51,961][26022] Updated weights on worker 0-0, policy_version 502303 (0.00086) [2022-07-10 01:12:53,728][26022] Updated weights on worker 0-0, policy_version 502313 (0.00092) [2022-07-10 01:12:55,397][26022] Updated weights on worker 0-0, policy_version 502323 (0.00093) [2022-07-10 01:12:55,872][25689] Fps is (10 sec: 5606.4, 60 sec: 5673.1, 300 sec: 5683.0). Total num frames: 514379776. Throughput: 0: 5988.1. Samples: 514378904. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:12:55,873][25689] Avg episode reward: [(0, '-39.748')] [2022-07-10 01:12:57,186][26022] Updated weights on worker 0-0, policy_version 502333 (0.00091) [2022-07-10 01:12:59,183][26022] Updated weights on worker 0-0, policy_version 502343 (0.00086) [2022-07-10 01:13:00,490][26022] Updated weights on worker 0-0, policy_version 502353 (0.00085) [2022-07-10 01:13:00,921][25689] Fps is (10 sec: 5798.8, 60 sec: 5671.0, 300 sec: 5690.5). Total num frames: 514410496. Throughput: 0: 5984.2. Samples: 514413290. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:13:00,921][25689] Avg episode reward: [(0, '-39.600')] [2022-07-10 01:13:03,170][26022] Updated weights on worker 0-0, policy_version 502363 (0.00095) [2022-07-10 01:13:04,598][26022] Updated weights on worker 0-0, policy_version 502373 (0.00091) [2022-07-10 01:13:06,013][25689] Fps is (10 sec: 5553.5, 60 sec: 5675.0, 300 sec: 5678.8). Total num frames: 514436096. Throughput: 0: 5858.4. Samples: 514445266. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:13:06,014][25689] Avg episode reward: [(0, '-38.574')] [2022-07-10 01:13:06,706][26022] Updated weights on worker 0-0, policy_version 502383 (0.00091) [2022-07-10 01:13:08,319][26022] Updated weights on worker 0-0, policy_version 502393 (0.00086) [2022-07-10 01:13:10,191][26022] Updated weights on worker 0-0, policy_version 502403 (0.00088) [2022-07-10 01:13:11,043][25689] Fps is (10 sec: 5462.6, 60 sec: 5676.0, 300 sec: 5690.3). Total num frames: 514465792. Throughput: 0: 4980.5. Samples: 514462356. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:13:11,044][25689] Avg episode reward: [(0, '-38.353')] [2022-07-10 01:13:11,961][26022] Updated weights on worker 0-0, policy_version 502413 (0.00090) [2022-07-10 01:13:13,795][26022] Updated weights on worker 0-0, policy_version 502423 (0.00092) [2022-07-10 01:13:15,497][26022] Updated weights on worker 0-0, policy_version 502433 (0.00086) [2022-07-10 01:13:16,071][25689] Fps is (10 sec: 5803.6, 60 sec: 5680.5, 300 sec: 5686.5). Total num frames: 514494464. Throughput: 0: 5844.7. Samples: 514496958. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:13:16,071][25689] Avg episode reward: [(0, '-38.072')] [2022-07-10 01:13:17,287][26022] Updated weights on worker 0-0, policy_version 502443 (0.00094) [2022-07-10 01:13:19,038][26022] Updated weights on worker 0-0, policy_version 502453 (0.00084) [2022-07-10 01:13:20,961][26022] Updated weights on worker 0-0, policy_version 502463 (0.00085) [2022-07-10 01:13:21,102][25689] Fps is (10 sec: 5700.7, 60 sec: 5681.3, 300 sec: 5680.6). Total num frames: 514523136. Throughput: 0: 5848.7. Samples: 514531326. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:13:21,103][25689] Avg episode reward: [(0, '-38.077')] [2022-07-10 01:13:22,911][26022] Updated weights on worker 0-0, policy_version 502473 (0.00084) [2022-07-10 01:13:24,448][26022] Updated weights on worker 0-0, policy_version 502483 (0.00080) [2022-07-10 01:13:26,150][25689] Fps is (10 sec: 5587.8, 60 sec: 5657.6, 300 sec: 5680.5). Total num frames: 514550784. Throughput: 0: 5125.1. Samples: 514548466. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:13:26,150][25689] Avg episode reward: [(0, '-37.953')] [2022-07-10 01:13:26,356][26022] Updated weights on worker 0-0, policy_version 502493 (0.00110) [2022-07-10 01:13:27,976][26022] Updated weights on worker 0-0, policy_version 502503 (0.00100) [2022-07-10 01:13:29,993][26022] Updated weights on worker 0-0, policy_version 502513 (0.00098) [2022-07-10 01:13:31,184][25689] Fps is (10 sec: 5687.8, 60 sec: 5690.3, 300 sec: 5680.1). Total num frames: 514580480. Throughput: 0: 5968.5. Samples: 514582568. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:13:31,185][25689] Avg episode reward: [(0, '-37.848')] [2022-07-10 01:13:31,567][26022] Updated weights on worker 0-0, policy_version 502523 (0.00098) [2022-07-10 01:13:33,461][26022] Updated weights on worker 0-0, policy_version 502533 (0.00083) [2022-07-10 01:13:35,464][26022] Updated weights on worker 0-0, policy_version 502543 (0.00091) [2022-07-10 01:13:36,221][25689] Fps is (10 sec: 5795.8, 60 sec: 5691.3, 300 sec: 5683.0). Total num frames: 514609152. Throughput: 0: 5935.4. Samples: 514616556. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:13:36,221][25689] Avg episode reward: [(0, '-38.624')] [2022-07-10 01:13:37,002][26022] Updated weights on worker 0-0, policy_version 502553 (0.00085) [2022-07-10 01:13:38,874][26022] Updated weights on worker 0-0, policy_version 502563 (0.00087) [2022-07-10 01:13:40,554][26022] Updated weights on worker 0-0, policy_version 502573 (0.00089) [2022-07-10 01:13:41,287][25689] Fps is (10 sec: 5473.5, 60 sec: 5617.9, 300 sec: 5672.2). Total num frames: 514635776. Throughput: 0: 5075.0. Samples: 514633762. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:13:41,287][25689] Avg episode reward: [(0, '-38.828')] [2022-07-10 01:13:42,432][26022] Updated weights on worker 0-0, policy_version 502583 (0.00091) [2022-07-10 01:13:44,285][26022] Updated weights on worker 0-0, policy_version 502593 (0.00091) [2022-07-10 01:13:46,040][26022] Updated weights on worker 0-0, policy_version 502603 (0.00087) [2022-07-10 01:13:46,341][25689] Fps is (10 sec: 5767.3, 60 sec: 5685.6, 300 sec: 5681.6). Total num frames: 514667520. Throughput: 0: 5941.7. Samples: 514668436. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:13:46,342][25689] Avg episode reward: [(0, '-39.221')] [2022-07-10 01:13:47,646][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:13:47,655][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000502612_514674688.pth [2022-07-10 01:13:47,656][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000500611_512625664.pth [2022-07-10 01:13:47,834][26022] Updated weights on worker 0-0, policy_version 502613 (0.00090) [2022-07-10 01:13:49,645][26022] Updated weights on worker 0-0, policy_version 502623 (0.00094) [2022-07-10 01:13:51,352][25689] Fps is (10 sec: 5900.6, 60 sec: 5670.3, 300 sec: 5681.8). Total num frames: 514695168. Throughput: 0: 5947.0. Samples: 514702506. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:13:51,353][25689] Avg episode reward: [(0, '-38.435')] [2022-07-10 01:13:51,426][26022] Updated weights on worker 0-0, policy_version 502633 (0.00087) [2022-07-10 01:13:53,247][26022] Updated weights on worker 0-0, policy_version 502643 (0.00087) [2022-07-10 01:13:55,144][26022] Updated weights on worker 0-0, policy_version 502653 (0.00085) [2022-07-10 01:13:56,359][25689] Fps is (10 sec: 5520.0, 60 sec: 5671.4, 300 sec: 5675.2). Total num frames: 514722816. Throughput: 0: 5123.7. Samples: 514719736. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 01:13:56,359][25689] Avg episode reward: [(0, '-38.520')] [2022-07-10 01:13:56,714][26022] Updated weights on worker 0-0, policy_version 502663 (0.00084) [2022-07-10 01:13:58,798][26022] Updated weights on worker 0-0, policy_version 502673 (0.00084) [2022-07-10 01:14:00,256][26022] Updated weights on worker 0-0, policy_version 502683 (0.00095) [2022-07-10 01:14:01,363][25689] Fps is (10 sec: 5830.5, 60 sec: 5675.5, 300 sec: 5689.6). Total num frames: 514753536. Throughput: 0: 6001.7. Samples: 514754254. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:14:01,365][25689] Avg episode reward: [(0, '-38.777')] [2022-07-10 01:14:02,463][26022] Updated weights on worker 0-0, policy_version 502693 (0.00086) [2022-07-10 01:14:04,216][26022] Updated weights on worker 0-0, policy_version 502703 (0.00092) [2022-07-10 01:14:06,246][26022] Updated weights on worker 0-0, policy_version 502713 (0.00086) [2022-07-10 01:14:06,394][25689] Fps is (10 sec: 5510.0, 60 sec: 5664.3, 300 sec: 5678.8). Total num frames: 514778112. Throughput: 0: 5884.9. Samples: 514786446. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:14:06,395][25689] Avg episode reward: [(0, '-37.567')] [2022-07-10 01:14:07,855][26022] Updated weights on worker 0-0, policy_version 502723 (0.00096) [2022-07-10 01:14:09,551][26022] Updated weights on worker 0-0, policy_version 502733 (0.00094) [2022-07-10 01:14:11,330][26022] Updated weights on worker 0-0, policy_version 502743 (0.00086) [2022-07-10 01:14:11,398][25689] Fps is (10 sec: 5510.2, 60 sec: 5683.7, 300 sec: 5682.6). Total num frames: 514808832. Throughput: 0: 5054.4. Samples: 514803826. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:14:11,399][25689] Avg episode reward: [(0, '-37.412')] [2022-07-10 01:14:13,261][26022] Updated weights on worker 0-0, policy_version 502753 (0.00648) [2022-07-10 01:14:14,996][26022] Updated weights on worker 0-0, policy_version 502763 (0.00086) [2022-07-10 01:14:16,405][25689] Fps is (10 sec: 5932.7, 60 sec: 5685.6, 300 sec: 5686.2). Total num frames: 514837504. Throughput: 0: 5913.0. Samples: 514838272. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:14:16,406][25689] Avg episode reward: [(0, '-36.557')] [2022-07-10 01:14:16,783][26022] Updated weights on worker 0-0, policy_version 502773 (0.00089) [2022-07-10 01:14:18,554][26022] Updated weights on worker 0-0, policy_version 502783 (0.00081) [2022-07-10 01:14:20,313][26022] Updated weights on worker 0-0, policy_version 502793 (0.00084) [2022-07-10 01:14:21,415][25689] Fps is (10 sec: 5724.5, 60 sec: 5687.6, 300 sec: 5683.7). Total num frames: 514866176. Throughput: 0: 5929.0. Samples: 514873146. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:14:21,416][25689] Avg episode reward: [(0, '-37.181')] [2022-07-10 01:14:22,105][26022] Updated weights on worker 0-0, policy_version 502803 (0.00087) [2022-07-10 01:14:23,991][26022] Updated weights on worker 0-0, policy_version 502813 (0.00915) [2022-07-10 01:14:25,437][26022] Updated weights on worker 0-0, policy_version 502823 (0.00085) [2022-07-10 01:14:26,495][25689] Fps is (10 sec: 5683.2, 60 sec: 5701.6, 300 sec: 5682.5). Total num frames: 514894848. Throughput: 0: 5178.0. Samples: 514890530. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:14:26,496][25689] Avg episode reward: [(0, '-37.141')] [2022-07-10 01:14:27,439][26022] Updated weights on worker 0-0, policy_version 502833 (0.00089) [2022-07-10 01:14:29,306][26022] Updated weights on worker 0-0, policy_version 502843 (0.00086) [2022-07-10 01:14:30,920][26022] Updated weights on worker 0-0, policy_version 502853 (0.00097) [2022-07-10 01:14:31,524][25689] Fps is (10 sec: 5773.9, 60 sec: 5702.1, 300 sec: 5688.8). Total num frames: 514924544. Throughput: 0: 6039.0. Samples: 514925368. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:14:31,525][25689] Avg episode reward: [(0, '-38.361')] [2022-07-10 01:14:32,820][26022] Updated weights on worker 0-0, policy_version 502863 (0.00091) [2022-07-10 01:14:34,403][26022] Updated weights on worker 0-0, policy_version 502873 (0.00082) [2022-07-10 01:14:36,250][26022] Updated weights on worker 0-0, policy_version 502883 (0.00085) [2022-07-10 01:14:36,535][25689] Fps is (10 sec: 5813.9, 60 sec: 5704.5, 300 sec: 5685.4). Total num frames: 514953216. Throughput: 0: 6058.0. Samples: 514960218. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:14:36,535][25689] Avg episode reward: [(0, '-37.355')] [2022-07-10 01:14:38,083][26022] Updated weights on worker 0-0, policy_version 502893 (0.00084) [2022-07-10 01:14:39,922][26022] Updated weights on worker 0-0, policy_version 502903 (0.00090) [2022-07-10 01:14:41,540][25689] Fps is (10 sec: 5725.4, 60 sec: 5744.3, 300 sec: 5687.4). Total num frames: 514981888. Throughput: 0: 5188.2. Samples: 514977556. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:14:41,541][25689] Avg episode reward: [(0, '-38.143')] [2022-07-10 01:14:41,611][26022] Updated weights on worker 0-0, policy_version 502913 (0.00088) [2022-07-10 01:14:43,300][26022] Updated weights on worker 0-0, policy_version 502923 (0.00094) [2022-07-10 01:14:45,213][26022] Updated weights on worker 0-0, policy_version 502933 (0.00088) [2022-07-10 01:14:46,587][25689] Fps is (10 sec: 5806.6, 60 sec: 5711.0, 300 sec: 5690.3). Total num frames: 515011584. Throughput: 0: 6043.6. Samples: 515011954. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:14:46,587][25689] Avg episode reward: [(0, '-38.402')] [2022-07-10 01:14:47,003][26022] Updated weights on worker 0-0, policy_version 502943 (0.00078) [2022-07-10 01:14:48,850][26022] Updated weights on worker 0-0, policy_version 502953 (0.00104) [2022-07-10 01:14:50,284][26022] Updated weights on worker 0-0, policy_version 502963 (0.00089) [2022-07-10 01:14:51,614][25689] Fps is (10 sec: 5692.6, 60 sec: 5709.5, 300 sec: 5686.6). Total num frames: 515039232. Throughput: 0: 6050.3. Samples: 515046914. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:14:51,614][25689] Avg episode reward: [(0, '-38.524')] [2022-07-10 01:14:52,199][26022] Updated weights on worker 0-0, policy_version 502973 (0.00091) [2022-07-10 01:14:54,099][26022] Updated weights on worker 0-0, policy_version 502983 (0.00088) [2022-07-10 01:14:55,749][26022] Updated weights on worker 0-0, policy_version 502993 (0.00088) [2022-07-10 01:14:56,633][25689] Fps is (10 sec: 5606.0, 60 sec: 5725.3, 300 sec: 5683.4). Total num frames: 515067904. Throughput: 0: 5182.1. Samples: 515064370. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:14:56,634][25689] Avg episode reward: [(0, '-37.036')] [2022-07-10 01:14:57,606][26022] Updated weights on worker 0-0, policy_version 503003 (0.00088) [2022-07-10 01:14:59,385][26022] Updated weights on worker 0-0, policy_version 503013 (0.00091) [2022-07-10 01:15:01,077][26022] Updated weights on worker 0-0, policy_version 503023 (0.00092) [2022-07-10 01:15:01,639][25689] Fps is (10 sec: 5924.1, 60 sec: 5725.1, 300 sec: 5698.8). Total num frames: 515098624. Throughput: 0: 6038.8. Samples: 515098930. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:15:01,641][25689] Avg episode reward: [(0, '-37.603')] [2022-07-10 01:15:03,294][26022] Updated weights on worker 0-0, policy_version 503033 (0.00088) [2022-07-10 01:15:05,006][26022] Updated weights on worker 0-0, policy_version 503043 (0.00080) [2022-07-10 01:15:06,718][25689] Fps is (10 sec: 5584.6, 60 sec: 5737.6, 300 sec: 5691.2). Total num frames: 515124224. Throughput: 0: 5923.8. Samples: 515131208. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:15:06,718][25689] Avg episode reward: [(0, '-37.639')] [2022-07-10 01:15:06,825][26022] Updated weights on worker 0-0, policy_version 503053 (0.00087) [2022-07-10 01:15:08,739][26022] Updated weights on worker 0-0, policy_version 503063 (0.00084) [2022-07-10 01:15:10,374][26022] Updated weights on worker 0-0, policy_version 503073 (0.00053) [2022-07-10 01:15:11,728][25689] Fps is (10 sec: 5278.1, 60 sec: 5686.1, 300 sec: 5684.9). Total num frames: 515151872. Throughput: 0: 5041.7. Samples: 515148322. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:15:11,728][25689] Avg episode reward: [(0, '-37.373')] [2022-07-10 01:15:12,309][26022] Updated weights on worker 0-0, policy_version 503083 (0.00085) [2022-07-10 01:15:14,043][26022] Updated weights on worker 0-0, policy_version 503093 (0.00081) [2022-07-10 01:15:15,766][26022] Updated weights on worker 0-0, policy_version 503103 (0.00084) [2022-07-10 01:15:16,756][25689] Fps is (10 sec: 5814.5, 60 sec: 5718.0, 300 sec: 5695.0). Total num frames: 515182592. Throughput: 0: 5877.0. Samples: 515182634. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:15:16,757][25689] Avg episode reward: [(0, '-36.138')] [2022-07-10 01:15:17,757][26022] Updated weights on worker 0-0, policy_version 503113 (0.00091) [2022-07-10 01:15:19,344][26022] Updated weights on worker 0-0, policy_version 503123 (0.00085) [2022-07-10 01:15:21,194][26022] Updated weights on worker 0-0, policy_version 503133 (0.00084) [2022-07-10 01:15:21,764][25689] Fps is (10 sec: 5917.7, 60 sec: 5718.3, 300 sec: 5693.4). Total num frames: 515211264. Throughput: 0: 5864.4. Samples: 515216950. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:15:21,764][25689] Avg episode reward: [(0, '-36.008')] [2022-07-10 01:15:23,009][26022] Updated weights on worker 0-0, policy_version 503143 (0.00077) [2022-07-10 01:15:24,666][26022] Updated weights on worker 0-0, policy_version 503153 (0.00089) [2022-07-10 01:15:26,567][26022] Updated weights on worker 0-0, policy_version 503163 (0.00090) [2022-07-10 01:15:26,855][25689] Fps is (10 sec: 5880.9, 60 sec: 5751.1, 300 sec: 5695.7). Total num frames: 515241984. Throughput: 0: 5116.2. Samples: 515234236. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:15:26,856][25689] Avg episode reward: [(0, '-37.028')] [2022-07-10 01:15:28,286][26022] Updated weights on worker 0-0, policy_version 503173 (0.00091) [2022-07-10 01:15:29,934][26022] Updated weights on worker 0-0, policy_version 503183 (0.00098) [2022-07-10 01:15:31,873][25689] Fps is (10 sec: 5672.6, 60 sec: 5701.3, 300 sec: 5688.6). Total num frames: 515268608. Throughput: 0: 5975.9. Samples: 515268708. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:15:31,873][25689] Avg episode reward: [(0, '-37.225')] [2022-07-10 01:15:32,119][26022] Updated weights on worker 0-0, policy_version 503193 (0.00086) [2022-07-10 01:15:33,732][26022] Updated weights on worker 0-0, policy_version 503203 (0.00084) [2022-07-10 01:15:35,439][26022] Updated weights on worker 0-0, policy_version 503213 (0.00082) [2022-07-10 01:15:36,972][25689] Fps is (10 sec: 5465.9, 60 sec: 5692.9, 300 sec: 5694.1). Total num frames: 515297280. Throughput: 0: 5949.2. Samples: 515302902. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:15:36,973][25689] Avg episode reward: [(0, '-37.141')] [2022-07-10 01:15:37,369][26022] Updated weights on worker 0-0, policy_version 503223 (0.00087) [2022-07-10 01:15:39,112][26022] Updated weights on worker 0-0, policy_version 503233 (0.00083) [2022-07-10 01:15:40,903][26022] Updated weights on worker 0-0, policy_version 503243 (0.00086) [2022-07-10 01:15:42,010][25689] Fps is (10 sec: 5757.7, 60 sec: 5706.8, 300 sec: 5691.8). Total num frames: 515326976. Throughput: 0: 5099.9. Samples: 515320204. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:15:42,011][25689] Avg episode reward: [(0, '-36.717')] [2022-07-10 01:15:42,703][26022] Updated weights on worker 0-0, policy_version 503253 (0.00053) [2022-07-10 01:15:44,423][26022] Updated weights on worker 0-0, policy_version 503263 (0.00087) [2022-07-10 01:15:46,266][26022] Updated weights on worker 0-0, policy_version 503273 (0.00093) [2022-07-10 01:15:47,075][25689] Fps is (10 sec: 5777.3, 60 sec: 5688.1, 300 sec: 5697.8). Total num frames: 515355648. Throughput: 0: 5949.0. Samples: 515354522. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:15:47,076][25689] Avg episode reward: [(0, '-36.283')] [2022-07-10 01:15:47,680][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:15:47,693][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000503281_515359744.pth [2022-07-10 01:15:47,694][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000501279_513309696.pth [2022-07-10 01:15:47,695][25974] Saving a milestone ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/milestones/checkpoint_000503281_515359744.pth.milestone [2022-07-10 01:15:48,073][26022] Updated weights on worker 0-0, policy_version 503283 (0.00087) [2022-07-10 01:15:49,763][26022] Updated weights on worker 0-0, policy_version 503293 (0.00102) [2022-07-10 01:15:51,527][26022] Updated weights on worker 0-0, policy_version 503303 (0.00083) [2022-07-10 01:15:52,099][25689] Fps is (10 sec: 5684.1, 60 sec: 5705.4, 300 sec: 5693.9). Total num frames: 515384320. Throughput: 0: 5952.2. Samples: 515389096. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:15:52,099][25689] Avg episode reward: [(0, '-35.529')] [2022-07-10 01:15:53,397][26022] Updated weights on worker 0-0, policy_version 503313 (0.00085) [2022-07-10 01:15:55,222][26022] Updated weights on worker 0-0, policy_version 503323 (0.00091) [2022-07-10 01:15:57,088][26022] Updated weights on worker 0-0, policy_version 503333 (0.00094) [2022-07-10 01:15:57,174][25689] Fps is (10 sec: 5678.4, 60 sec: 5700.1, 300 sec: 5689.3). Total num frames: 515412992. Throughput: 0: 5116.7. Samples: 515406268. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:15:57,174][25689] Avg episode reward: [(0, '-35.573')] [2022-07-10 01:15:58,671][26022] Updated weights on worker 0-0, policy_version 503343 (0.00087) [2022-07-10 01:16:00,499][26022] Updated weights on worker 0-0, policy_version 503353 (0.00093) [2022-07-10 01:16:02,210][25689] Fps is (10 sec: 5468.5, 60 sec: 5629.6, 300 sec: 5689.9). Total num frames: 515439616. Throughput: 0: 5954.6. Samples: 515440488. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:16:02,211][25689] Avg episode reward: [(0, '-35.074')] [2022-07-10 01:16:02,694][26022] Updated weights on worker 0-0, policy_version 503363 (0.00084) [2022-07-10 01:16:04,483][26022] Updated weights on worker 0-0, policy_version 503373 (0.00094) [2022-07-10 01:16:06,411][26022] Updated weights on worker 0-0, policy_version 503383 (0.00084) [2022-07-10 01:16:07,297][25689] Fps is (10 sec: 5563.3, 60 sec: 5696.5, 300 sec: 5688.5). Total num frames: 515469312. Throughput: 0: 5847.2. Samples: 515472764. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:16:07,299][25689] Avg episode reward: [(0, '-35.819')] [2022-07-10 01:16:08,100][26022] Updated weights on worker 0-0, policy_version 503393 (0.00090) [2022-07-10 01:16:09,755][26022] Updated weights on worker 0-0, policy_version 503403 (0.00087) [2022-07-10 01:16:11,754][26022] Updated weights on worker 0-0, policy_version 503413 (0.00087) [2022-07-10 01:16:12,303][25689] Fps is (10 sec: 5682.0, 60 sec: 5696.9, 300 sec: 5693.1). Total num frames: 515496960. Throughput: 0: 5846.5. Samples: 515507218. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:16:12,303][25689] Avg episode reward: [(0, '-35.110')] [2022-07-10 01:16:13,211][26022] Updated weights on worker 0-0, policy_version 503423 (0.00085) [2022-07-10 01:16:15,447][26022] Updated weights on worker 0-0, policy_version 503433 (0.00085) [2022-07-10 01:16:17,148][26022] Updated weights on worker 0-0, policy_version 503443 (0.00089) [2022-07-10 01:16:17,326][25689] Fps is (10 sec: 5717.8, 60 sec: 5680.4, 300 sec: 5690.5). Total num frames: 515526656. Throughput: 0: 5869.3. Samples: 515524548. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:16:17,327][25689] Avg episode reward: [(0, '-35.309')] [2022-07-10 01:16:18,813][26022] Updated weights on worker 0-0, policy_version 503453 (0.00098) [2022-07-10 01:16:20,763][26022] Updated weights on worker 0-0, policy_version 503463 (0.00083) [2022-07-10 01:16:22,264][26022] Updated weights on worker 0-0, policy_version 503473 (0.00081) [2022-07-10 01:16:22,363][25689] Fps is (10 sec: 5903.6, 60 sec: 5694.6, 300 sec: 5698.9). Total num frames: 515556352. Throughput: 0: 5876.0. Samples: 515558904. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:16:22,367][25689] Avg episode reward: [(0, '-35.575')] [2022-07-10 01:16:24,254][26022] Updated weights on worker 0-0, policy_version 503483 (0.00088) [2022-07-10 01:16:25,962][26022] Updated weights on worker 0-0, policy_version 503493 (0.00095) [2022-07-10 01:16:27,487][25689] Fps is (10 sec: 5744.4, 60 sec: 5657.8, 300 sec: 5693.8). Total num frames: 515585024. Throughput: 0: 5965.8. Samples: 515593212. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:16:27,488][25689] Avg episode reward: [(0, '-35.734')] [2022-07-10 01:16:27,679][26022] Updated weights on worker 0-0, policy_version 503503 (0.00086) [2022-07-10 01:16:29,684][26022] Updated weights on worker 0-0, policy_version 503513 (0.00085) [2022-07-10 01:16:31,379][26022] Updated weights on worker 0-0, policy_version 503523 (0.00429) [2022-07-10 01:16:32,493][25689] Fps is (10 sec: 5660.4, 60 sec: 5692.6, 300 sec: 5690.4). Total num frames: 515613696. Throughput: 0: 5095.0. Samples: 515610090. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:16:32,494][25689] Avg episode reward: [(0, '-35.136')] [2022-07-10 01:16:33,128][26022] Updated weights on worker 0-0, policy_version 503533 (0.00086) [2022-07-10 01:16:35,015][26022] Updated weights on worker 0-0, policy_version 503543 (0.00090) [2022-07-10 01:16:36,611][26022] Updated weights on worker 0-0, policy_version 503553 (0.00099) [2022-07-10 01:16:37,512][25689] Fps is (10 sec: 5618.0, 60 sec: 5683.3, 300 sec: 5688.3). Total num frames: 515641344. Throughput: 0: 5947.3. Samples: 515644598. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:16:37,512][25689] Avg episode reward: [(0, '-35.777')] [2022-07-10 01:16:38,485][26022] Updated weights on worker 0-0, policy_version 503563 (0.00081) [2022-07-10 01:16:40,319][26022] Updated weights on worker 0-0, policy_version 503573 (0.00089) [2022-07-10 01:16:42,145][26022] Updated weights on worker 0-0, policy_version 503583 (0.00090) [2022-07-10 01:16:42,550][25689] Fps is (10 sec: 5600.6, 60 sec: 5666.4, 300 sec: 5685.2). Total num frames: 515670016. Throughput: 0: 5952.4. Samples: 515679064. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:16:42,550][25689] Avg episode reward: [(0, '-37.061')] [2022-07-10 01:16:43,937][26022] Updated weights on worker 0-0, policy_version 503593 (0.00085) [2022-07-10 01:16:45,778][26022] Updated weights on worker 0-0, policy_version 503603 (0.00092) [2022-07-10 01:16:47,495][26022] Updated weights on worker 0-0, policy_version 503613 (0.00088) [2022-07-10 01:16:47,651][25689] Fps is (10 sec: 5757.0, 60 sec: 5679.9, 300 sec: 5687.1). Total num frames: 515699712. Throughput: 0: 5111.1. Samples: 515696270. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 01:16:47,651][25689] Avg episode reward: [(0, '-36.528')] [2022-07-10 01:16:49,210][26022] Updated weights on worker 0-0, policy_version 503623 (0.00089) [2022-07-10 01:16:51,058][26022] Updated weights on worker 0-0, policy_version 503633 (0.00086) [2022-07-10 01:16:52,672][25689] Fps is (10 sec: 5867.4, 60 sec: 5697.0, 300 sec: 5694.1). Total num frames: 515729408. Throughput: 0: 5974.9. Samples: 515730654. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:16:52,672][25689] Avg episode reward: [(0, '-36.863')] [2022-07-10 01:16:52,731][26022] Updated weights on worker 0-0, policy_version 503643 (0.00085) [2022-07-10 01:16:54,742][26022] Updated weights on worker 0-0, policy_version 503653 (0.00088) [2022-07-10 01:16:56,270][26022] Updated weights on worker 0-0, policy_version 503663 (0.00090) [2022-07-10 01:16:57,687][25689] Fps is (10 sec: 5815.5, 60 sec: 5702.7, 300 sec: 5687.4). Total num frames: 515758080. Throughput: 0: 5990.8. Samples: 515765464. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:16:57,688][25689] Avg episode reward: [(0, '-36.961')] [2022-07-10 01:16:58,320][26022] Updated weights on worker 0-0, policy_version 503673 (0.00095) [2022-07-10 01:16:59,760][26022] Updated weights on worker 0-0, policy_version 503683 (0.00091) [2022-07-10 01:17:02,098][26022] Updated weights on worker 0-0, policy_version 503693 (0.00078) [2022-07-10 01:17:02,701][25689] Fps is (10 sec: 5411.3, 60 sec: 5687.9, 300 sec: 5689.7). Total num frames: 515783680. Throughput: 0: 5134.9. Samples: 515782542. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:17:02,702][25689] Avg episode reward: [(0, '-37.574')] [2022-07-10 01:17:03,830][26022] Updated weights on worker 0-0, policy_version 503703 (0.00085) [2022-07-10 01:17:05,912][26022] Updated weights on worker 0-0, policy_version 503713 (0.00112) [2022-07-10 01:17:07,436][26022] Updated weights on worker 0-0, policy_version 503723 (0.00087) [2022-07-10 01:17:07,808][25689] Fps is (10 sec: 5463.6, 60 sec: 5686.0, 300 sec: 5688.5). Total num frames: 515813376. Throughput: 0: 5872.0. Samples: 515814634. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:17:07,810][25689] Avg episode reward: [(0, '-37.273')] [2022-07-10 01:17:09,472][26022] Updated weights on worker 0-0, policy_version 503733 (0.00088) [2022-07-10 01:17:11,026][26022] Updated weights on worker 0-0, policy_version 503743 (0.00086) [2022-07-10 01:17:12,823][25689] Fps is (10 sec: 5665.5, 60 sec: 5685.1, 300 sec: 5686.2). Total num frames: 515841024. Throughput: 0: 5867.1. Samples: 515848882. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:17:12,823][25689] Avg episode reward: [(0, '-37.120')] [2022-07-10 01:17:13,141][26022] Updated weights on worker 0-0, policy_version 503753 (0.00086) [2022-07-10 01:17:14,531][26022] Updated weights on worker 0-0, policy_version 503763 (0.00095) [2022-07-10 01:17:16,708][26022] Updated weights on worker 0-0, policy_version 503773 (0.00093) [2022-07-10 01:17:17,857][25689] Fps is (10 sec: 5706.3, 60 sec: 5684.1, 300 sec: 5689.7). Total num frames: 515870720. Throughput: 0: 4981.3. Samples: 515865938. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:17:17,858][25689] Avg episode reward: [(0, '-38.366')] [2022-07-10 01:17:18,217][26022] Updated weights on worker 0-0, policy_version 503783 (0.00085) [2022-07-10 01:17:20,422][26022] Updated weights on worker 0-0, policy_version 503793 (0.00087) [2022-07-10 01:17:21,948][26022] Updated weights on worker 0-0, policy_version 503803 (0.00083) [2022-07-10 01:17:22,904][25689] Fps is (10 sec: 5789.7, 60 sec: 5666.2, 300 sec: 5688.4). Total num frames: 515899392. Throughput: 0: 5798.2. Samples: 515899682. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:17:22,905][25689] Avg episode reward: [(0, '-39.298')] [2022-07-10 01:17:23,951][26022] Updated weights on worker 0-0, policy_version 503813 (0.00091) [2022-07-10 01:17:25,642][26022] Updated weights on worker 0-0, policy_version 503823 (0.00092) [2022-07-10 01:17:27,485][26022] Updated weights on worker 0-0, policy_version 503833 (0.00089) [2022-07-10 01:17:28,005][25689] Fps is (10 sec: 5549.8, 60 sec: 5651.4, 300 sec: 5686.9). Total num frames: 515927040. Throughput: 0: 5899.5. Samples: 515933788. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:17:28,007][25689] Avg episode reward: [(0, '-39.855')] [2022-07-10 01:17:29,248][26022] Updated weights on worker 0-0, policy_version 503843 (0.00096) [2022-07-10 01:17:31,196][26022] Updated weights on worker 0-0, policy_version 503853 (0.00085) [2022-07-10 01:17:32,957][26022] Updated weights on worker 0-0, policy_version 503863 (0.00093) [2022-07-10 01:17:33,023][25689] Fps is (10 sec: 5566.0, 60 sec: 5650.4, 300 sec: 5687.4). Total num frames: 515955712. Throughput: 0: 5038.5. Samples: 515950660. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:17:33,023][25689] Avg episode reward: [(0, '-40.739')] [2022-07-10 01:17:34,795][26022] Updated weights on worker 0-0, policy_version 503873 (0.00091) [2022-07-10 01:17:36,666][26022] Updated weights on worker 0-0, policy_version 503883 (0.00086) [2022-07-10 01:17:38,045][25689] Fps is (10 sec: 5712.0, 60 sec: 5667.0, 300 sec: 5680.2). Total num frames: 515984384. Throughput: 0: 5882.1. Samples: 515984682. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:17:38,047][25689] Avg episode reward: [(0, '-40.085')] [2022-07-10 01:17:38,407][26022] Updated weights on worker 0-0, policy_version 503893 (0.00093) [2022-07-10 01:17:40,070][26022] Updated weights on worker 0-0, policy_version 503903 (0.00083) [2022-07-10 01:17:42,046][26022] Updated weights on worker 0-0, policy_version 503913 (0.00084) [2022-07-10 01:17:43,052][25689] Fps is (10 sec: 5819.9, 60 sec: 5686.8, 300 sec: 5688.0). Total num frames: 516014080. Throughput: 0: 5916.3. Samples: 516018880. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:17:43,052][25689] Avg episode reward: [(0, '-39.979')] [2022-07-10 01:17:43,725][26022] Updated weights on worker 0-0, policy_version 503923 (0.00081) [2022-07-10 01:17:45,520][26022] Updated weights on worker 0-0, policy_version 503933 (0.00092) [2022-07-10 01:17:47,200][26022] Updated weights on worker 0-0, policy_version 503943 (0.00106) [2022-07-10 01:17:47,745][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:17:47,760][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000503945_516039680.pth [2022-07-10 01:17:47,761][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000501947_513993728.pth [2022-07-10 01:17:48,107][25689] Fps is (10 sec: 5698.7, 60 sec: 5657.2, 300 sec: 5684.1). Total num frames: 516041728. Throughput: 0: 5086.7. Samples: 516036040. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:17:48,108][25689] Avg episode reward: [(0, '-38.913')] [2022-07-10 01:17:49,166][26022] Updated weights on worker 0-0, policy_version 503953 (0.00083) [2022-07-10 01:17:50,935][26022] Updated weights on worker 0-0, policy_version 503963 (0.00079) [2022-07-10 01:17:52,700][26022] Updated weights on worker 0-0, policy_version 503973 (0.00091) [2022-07-10 01:17:53,155][25689] Fps is (10 sec: 5574.6, 60 sec: 5637.8, 300 sec: 5686.9). Total num frames: 516070400. Throughput: 0: 5936.9. Samples: 516070180. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:17:53,155][25689] Avg episode reward: [(0, '-39.444')] [2022-07-10 01:17:54,538][26022] Updated weights on worker 0-0, policy_version 503983 (0.00085) [2022-07-10 01:17:56,235][26022] Updated weights on worker 0-0, policy_version 503993 (0.00082) [2022-07-10 01:17:57,952][26022] Updated weights on worker 0-0, policy_version 504003 (0.00085) [2022-07-10 01:17:58,167][25689] Fps is (10 sec: 5802.4, 60 sec: 5655.1, 300 sec: 5684.2). Total num frames: 516100096. Throughput: 0: 5954.9. Samples: 516104506. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:17:58,167][25689] Avg episode reward: [(0, '-38.910')] [2022-07-10 01:17:59,867][26022] Updated weights on worker 0-0, policy_version 504013 (0.00559) [2022-07-10 01:18:01,457][26022] Updated weights on worker 0-0, policy_version 504023 (0.00085) [2022-07-10 01:18:03,197][25689] Fps is (10 sec: 5404.3, 60 sec: 5636.6, 300 sec: 5681.9). Total num frames: 516124672. Throughput: 0: 5109.7. Samples: 516121816. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:18:03,198][25689] Avg episode reward: [(0, '-39.394')] [2022-07-10 01:18:03,966][26022] Updated weights on worker 0-0, policy_version 504033 (0.00086) [2022-07-10 01:18:05,580][26022] Updated weights on worker 0-0, policy_version 504043 (0.00094) [2022-07-10 01:18:07,530][26022] Updated weights on worker 0-0, policy_version 504053 (0.00088) [2022-07-10 01:18:08,256][25689] Fps is (10 sec: 5480.8, 60 sec: 5658.1, 300 sec: 5684.8). Total num frames: 516155392. Throughput: 0: 5839.0. Samples: 516153686. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:18:08,256][25689] Avg episode reward: [(0, '-39.726')] [2022-07-10 01:18:08,972][26022] Updated weights on worker 0-0, policy_version 504063 (0.00084) [2022-07-10 01:18:11,074][26022] Updated weights on worker 0-0, policy_version 504073 (0.00090) [2022-07-10 01:18:12,780][26022] Updated weights on worker 0-0, policy_version 504083 (0.00094) [2022-07-10 01:18:13,266][25689] Fps is (10 sec: 5695.3, 60 sec: 5641.6, 300 sec: 5678.3). Total num frames: 516182016. Throughput: 0: 5863.5. Samples: 516188100. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:18:13,266][25689] Avg episode reward: [(0, '-39.380')] [2022-07-10 01:18:14,653][26022] Updated weights on worker 0-0, policy_version 504093 (0.00089) [2022-07-10 01:18:16,394][26022] Updated weights on worker 0-0, policy_version 504103 (0.00087) [2022-07-10 01:18:18,145][26022] Updated weights on worker 0-0, policy_version 504113 (0.00093) [2022-07-10 01:18:18,273][25689] Fps is (10 sec: 5724.7, 60 sec: 5661.1, 300 sec: 5685.6). Total num frames: 516212736. Throughput: 0: 5020.2. Samples: 516205442. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:18:18,273][25689] Avg episode reward: [(0, '-38.465')] [2022-07-10 01:18:20,036][26022] Updated weights on worker 0-0, policy_version 504123 (0.00072) [2022-07-10 01:18:21,820][26022] Updated weights on worker 0-0, policy_version 504133 (0.00197) [2022-07-10 01:18:23,301][25689] Fps is (10 sec: 5816.1, 60 sec: 5645.8, 300 sec: 5686.0). Total num frames: 516240384. Throughput: 0: 5862.2. Samples: 516239672. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:18:23,302][25689] Avg episode reward: [(0, '-37.021')] [2022-07-10 01:18:23,530][26022] Updated weights on worker 0-0, policy_version 504143 (0.00098) [2022-07-10 01:18:25,380][26022] Updated weights on worker 0-0, policy_version 504153 (0.00091) [2022-07-10 01:18:27,179][26022] Updated weights on worker 0-0, policy_version 504163 (0.00090) [2022-07-10 01:18:28,347][25689] Fps is (10 sec: 5489.0, 60 sec: 5651.1, 300 sec: 5678.9). Total num frames: 516268032. Throughput: 0: 5975.7. Samples: 516273744. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:18:28,347][25689] Avg episode reward: [(0, '-35.782')] [2022-07-10 01:18:29,001][26022] Updated weights on worker 0-0, policy_version 504173 (0.00087) [2022-07-10 01:18:30,720][26022] Updated weights on worker 0-0, policy_version 504183 (0.00090) [2022-07-10 01:18:32,365][26022] Updated weights on worker 0-0, policy_version 504193 (0.00079) [2022-07-10 01:18:33,363][25689] Fps is (10 sec: 5698.9, 60 sec: 5668.1, 300 sec: 5682.7). Total num frames: 516297728. Throughput: 0: 5115.6. Samples: 516290912. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:18:33,364][25689] Avg episode reward: [(0, '-35.602')] [2022-07-10 01:18:34,413][26022] Updated weights on worker 0-0, policy_version 504203 (0.00094) [2022-07-10 01:18:35,921][26022] Updated weights on worker 0-0, policy_version 504213 (0.00083) [2022-07-10 01:18:38,021][26022] Updated weights on worker 0-0, policy_version 504223 (0.00089) [2022-07-10 01:18:38,370][25689] Fps is (10 sec: 5823.2, 60 sec: 5669.5, 300 sec: 5690.7). Total num frames: 516326400. Throughput: 0: 5973.9. Samples: 516325502. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:18:38,371][25689] Avg episode reward: [(0, '-35.352')] [2022-07-10 01:18:39,483][26022] Updated weights on worker 0-0, policy_version 504233 (0.00081) [2022-07-10 01:18:41,489][26022] Updated weights on worker 0-0, policy_version 504243 (0.00085) [2022-07-10 01:18:43,175][26022] Updated weights on worker 0-0, policy_version 504253 (0.00085) [2022-07-10 01:18:43,376][25689] Fps is (10 sec: 5727.1, 60 sec: 5652.6, 300 sec: 5681.3). Total num frames: 516355072. Throughput: 0: 5984.9. Samples: 516359820. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:18:43,377][25689] Avg episode reward: [(0, '-34.606')] [2022-07-10 01:18:45,043][26022] Updated weights on worker 0-0, policy_version 504263 (0.00092) [2022-07-10 01:18:46,849][26022] Updated weights on worker 0-0, policy_version 504273 (0.00051) [2022-07-10 01:18:48,463][25689] Fps is (10 sec: 5681.4, 60 sec: 5666.6, 300 sec: 5683.3). Total num frames: 516383744. Throughput: 0: 5136.5. Samples: 516377074. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:18:48,464][25689] Avg episode reward: [(0, '-35.431')] [2022-07-10 01:18:48,776][26022] Updated weights on worker 0-0, policy_version 504283 (0.00093) [2022-07-10 01:18:50,415][26022] Updated weights on worker 0-0, policy_version 504293 (0.00083) [2022-07-10 01:18:52,468][26022] Updated weights on worker 0-0, policy_version 504303 (0.00099) [2022-07-10 01:18:53,486][25689] Fps is (10 sec: 5773.4, 60 sec: 5685.9, 300 sec: 5689.9). Total num frames: 516413440. Throughput: 0: 5973.6. Samples: 516411116. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:18:53,486][25689] Avg episode reward: [(0, '-35.736')] [2022-07-10 01:18:53,971][26022] Updated weights on worker 0-0, policy_version 504313 (0.00089) [2022-07-10 01:18:55,899][26022] Updated weights on worker 0-0, policy_version 504323 (0.00084) [2022-07-10 01:18:57,588][26022] Updated weights on worker 0-0, policy_version 504333 (0.00092) [2022-07-10 01:18:58,534][25689] Fps is (10 sec: 5694.2, 60 sec: 5648.6, 300 sec: 5678.7). Total num frames: 516441088. Throughput: 0: 5960.2. Samples: 516445684. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:18:58,535][25689] Avg episode reward: [(0, '-35.829')] [2022-07-10 01:18:59,411][26022] Updated weights on worker 0-0, policy_version 504343 (0.00084) [2022-07-10 01:19:01,374][26022] Updated weights on worker 0-0, policy_version 504353 (0.00086) [2022-07-10 01:19:03,423][26022] Updated weights on worker 0-0, policy_version 504363 (0.00095) [2022-07-10 01:19:03,538][25689] Fps is (10 sec: 5398.8, 60 sec: 5685.0, 300 sec: 5686.1). Total num frames: 516467712. Throughput: 0: 5844.5. Samples: 516477658. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:19:03,539][25689] Avg episode reward: [(0, '-35.890')] [2022-07-10 01:19:05,283][26022] Updated weights on worker 0-0, policy_version 504373 (0.00087) [2022-07-10 01:19:07,031][26022] Updated weights on worker 0-0, policy_version 504383 (0.00088) [2022-07-10 01:19:08,644][25689] Fps is (10 sec: 5469.7, 60 sec: 5646.7, 300 sec: 5677.3). Total num frames: 516496384. Throughput: 0: 5837.1. Samples: 516494868. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:19:08,644][25689] Avg episode reward: [(0, '-35.497')] [2022-07-10 01:19:08,803][26022] Updated weights on worker 0-0, policy_version 504393 (0.00080) [2022-07-10 01:19:10,502][26022] Updated weights on worker 0-0, policy_version 504403 (0.00083) [2022-07-10 01:19:12,372][26022] Updated weights on worker 0-0, policy_version 504413 (0.00086) [2022-07-10 01:19:13,687][25689] Fps is (10 sec: 5751.6, 60 sec: 5694.4, 300 sec: 5680.1). Total num frames: 516526080. Throughput: 0: 5837.3. Samples: 516529034. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:19:13,687][25689] Avg episode reward: [(0, '-36.207')] [2022-07-10 01:19:14,033][26022] Updated weights on worker 0-0, policy_version 504423 (0.00085) [2022-07-10 01:19:16,036][26022] Updated weights on worker 0-0, policy_version 504433 (0.00095) [2022-07-10 01:19:17,671][26022] Updated weights on worker 0-0, policy_version 504443 (0.00098) [2022-07-10 01:19:18,719][25689] Fps is (10 sec: 5793.0, 60 sec: 5658.1, 300 sec: 5679.6). Total num frames: 516554752. Throughput: 0: 5827.9. Samples: 516563322. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:19:18,720][25689] Avg episode reward: [(0, '-35.400')] [2022-07-10 01:19:19,623][26022] Updated weights on worker 0-0, policy_version 504453 (0.00058) [2022-07-10 01:19:21,385][26022] Updated weights on worker 0-0, policy_version 504463 (0.00083) [2022-07-10 01:19:23,363][26022] Updated weights on worker 0-0, policy_version 504473 (0.00094) [2022-07-10 01:19:23,776][25689] Fps is (10 sec: 5683.5, 60 sec: 5672.4, 300 sec: 5680.1). Total num frames: 516583424. Throughput: 0: 5075.8. Samples: 516580380. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:19:23,777][25689] Avg episode reward: [(0, '-35.088')] [2022-07-10 01:19:24,749][26022] Updated weights on worker 0-0, policy_version 504483 (0.00086) [2022-07-10 01:19:26,797][26022] Updated weights on worker 0-0, policy_version 504493 (0.00092) [2022-07-10 01:19:28,532][26022] Updated weights on worker 0-0, policy_version 504503 (0.00094) [2022-07-10 01:19:28,871][25689] Fps is (10 sec: 5649.0, 60 sec: 5684.8, 300 sec: 5675.4). Total num frames: 516612096. Throughput: 0: 5922.5. Samples: 516614662. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:19:28,871][25689] Avg episode reward: [(0, '-35.101')] [2022-07-10 01:19:30,462][26022] Updated weights on worker 0-0, policy_version 504513 (0.00092) [2022-07-10 01:19:32,107][26022] Updated weights on worker 0-0, policy_version 504523 (0.00092) [2022-07-10 01:19:33,899][25689] Fps is (10 sec: 5665.0, 60 sec: 5666.8, 300 sec: 5675.0). Total num frames: 516640768. Throughput: 0: 5919.4. Samples: 516648678. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:19:33,899][25689] Avg episode reward: [(0, '-34.600')] [2022-07-10 01:19:34,073][26022] Updated weights on worker 0-0, policy_version 504533 (0.00088) [2022-07-10 01:19:35,638][26022] Updated weights on worker 0-0, policy_version 504543 (0.00091) [2022-07-10 01:19:37,650][26022] Updated weights on worker 0-0, policy_version 504553 (0.00089) [2022-07-10 01:19:38,925][25689] Fps is (10 sec: 5805.4, 60 sec: 5681.9, 300 sec: 5678.1). Total num frames: 516670464. Throughput: 0: 5081.9. Samples: 516666004. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:19:38,925][25689] Avg episode reward: [(0, '-34.526')] [2022-07-10 01:19:39,274][26022] Updated weights on worker 0-0, policy_version 504563 (0.00087) [2022-07-10 01:19:41,397][26022] Updated weights on worker 0-0, policy_version 504573 (0.00085) [2022-07-10 01:19:42,887][26022] Updated weights on worker 0-0, policy_version 504583 (0.00087) [2022-07-10 01:19:43,945][25689] Fps is (10 sec: 5708.1, 60 sec: 5663.7, 300 sec: 5671.7). Total num frames: 516698112. Throughput: 0: 5948.8. Samples: 516700360. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 01:19:43,945][25689] Avg episode reward: [(0, '-35.052')] [2022-07-10 01:19:44,865][26022] Updated weights on worker 0-0, policy_version 504593 (0.00084) [2022-07-10 01:19:46,441][26022] Updated weights on worker 0-0, policy_version 504603 (0.00090) [2022-07-10 01:19:47,876][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:19:47,889][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000504611_516721664.pth [2022-07-10 01:19:47,890][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000502612_514674688.pth [2022-07-10 01:19:48,436][26022] Updated weights on worker 0-0, policy_version 504613 (0.00157) [2022-07-10 01:19:49,015][25689] Fps is (10 sec: 5581.7, 60 sec: 5665.3, 300 sec: 5674.3). Total num frames: 516726784. Throughput: 0: 5949.2. Samples: 516734504. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:19:49,015][25689] Avg episode reward: [(0, '-35.913')] [2022-07-10 01:19:50,313][26022] Updated weights on worker 0-0, policy_version 504623 (0.00086) [2022-07-10 01:19:51,939][26022] Updated weights on worker 0-0, policy_version 504633 (0.00092) [2022-07-10 01:19:53,720][26022] Updated weights on worker 0-0, policy_version 504643 (0.00090) [2022-07-10 01:19:54,049][25689] Fps is (10 sec: 5675.4, 60 sec: 5647.3, 300 sec: 5674.1). Total num frames: 516755456. Throughput: 0: 5113.2. Samples: 516751710. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:19:54,049][25689] Avg episode reward: [(0, '-36.493')] [2022-07-10 01:19:55,664][26022] Updated weights on worker 0-0, policy_version 504653 (0.00085) [2022-07-10 01:19:57,327][26022] Updated weights on worker 0-0, policy_version 504663 (0.00092) [2022-07-10 01:19:58,942][26022] Updated weights on worker 0-0, policy_version 504673 (0.00091) [2022-07-10 01:19:59,141][25689] Fps is (10 sec: 5763.9, 60 sec: 5677.0, 300 sec: 5669.0). Total num frames: 516785152. Throughput: 0: 5929.1. Samples: 516785870. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:19:59,143][25689] Avg episode reward: [(0, '-37.362')] [2022-07-10 01:20:00,770][26022] Updated weights on worker 0-0, policy_version 504683 (0.00087) [2022-07-10 01:20:02,951][26022] Updated weights on worker 0-0, policy_version 504693 (0.00090) [2022-07-10 01:20:04,204][25689] Fps is (10 sec: 5445.0, 60 sec: 5654.6, 300 sec: 5669.3). Total num frames: 516810752. Throughput: 0: 5812.3. Samples: 516818114. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:20:04,205][25689] Avg episode reward: [(0, '-36.365')] [2022-07-10 01:20:04,784][26022] Updated weights on worker 0-0, policy_version 504703 (0.00081) [2022-07-10 01:20:06,723][26022] Updated weights on worker 0-0, policy_version 504713 (0.00085) [2022-07-10 01:20:08,446][26022] Updated weights on worker 0-0, policy_version 504723 (0.00090) [2022-07-10 01:20:09,250][25689] Fps is (10 sec: 5470.3, 60 sec: 5677.1, 300 sec: 5675.5). Total num frames: 516840448. Throughput: 0: 4973.0. Samples: 516835132. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:20:09,250][25689] Avg episode reward: [(0, '-36.292')] [2022-07-10 01:20:10,454][26022] Updated weights on worker 0-0, policy_version 504733 (0.00086) [2022-07-10 01:20:11,918][26022] Updated weights on worker 0-0, policy_version 504743 (0.00084) [2022-07-10 01:20:13,983][26022] Updated weights on worker 0-0, policy_version 504753 (0.00095) [2022-07-10 01:20:14,272][25689] Fps is (10 sec: 5797.6, 60 sec: 5662.1, 300 sec: 5668.7). Total num frames: 516869120. Throughput: 0: 5823.8. Samples: 516869484. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:20:14,272][25689] Avg episode reward: [(0, '-36.068')] [2022-07-10 01:20:15,377][26022] Updated weights on worker 0-0, policy_version 504763 (0.00083) [2022-07-10 01:20:17,544][26022] Updated weights on worker 0-0, policy_version 504773 (0.00093) [2022-07-10 01:20:19,244][26022] Updated weights on worker 0-0, policy_version 504783 (0.00094) [2022-07-10 01:20:19,309][25689] Fps is (10 sec: 5700.5, 60 sec: 5661.7, 300 sec: 5668.1). Total num frames: 516897792. Throughput: 0: 5865.4. Samples: 516904164. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:20:19,310][25689] Avg episode reward: [(0, '-36.102')] [2022-07-10 01:20:20,863][26022] Updated weights on worker 0-0, policy_version 504793 (0.00087) [2022-07-10 01:20:22,623][26022] Updated weights on worker 0-0, policy_version 504803 (0.00087) [2022-07-10 01:20:24,329][25689] Fps is (10 sec: 5803.7, 60 sec: 5682.1, 300 sec: 5666.1). Total num frames: 516927488. Throughput: 0: 5120.2. Samples: 516921156. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:20:24,329][25689] Avg episode reward: [(0, '-35.218')] [2022-07-10 01:20:24,584][26022] Updated weights on worker 0-0, policy_version 504813 (0.00092) [2022-07-10 01:20:26,348][26022] Updated weights on worker 0-0, policy_version 504823 (0.00081) [2022-07-10 01:20:28,194][26022] Updated weights on worker 0-0, policy_version 504833 (0.00081) [2022-07-10 01:20:29,375][25689] Fps is (10 sec: 5696.9, 60 sec: 5669.7, 300 sec: 5669.0). Total num frames: 516955136. Throughput: 0: 5977.5. Samples: 516955432. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:20:29,375][25689] Avg episode reward: [(0, '-35.036')] [2022-07-10 01:20:29,917][26022] Updated weights on worker 0-0, policy_version 504843 (0.00086) [2022-07-10 01:20:31,839][26022] Updated weights on worker 0-0, policy_version 504853 (0.00087) [2022-07-10 01:20:33,387][26022] Updated weights on worker 0-0, policy_version 504863 (0.00086) [2022-07-10 01:20:34,437][25689] Fps is (10 sec: 5571.5, 60 sec: 5666.5, 300 sec: 5669.7). Total num frames: 516983808. Throughput: 0: 5962.8. Samples: 516989728. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:20:34,438][25689] Avg episode reward: [(0, '-35.358')] [2022-07-10 01:20:35,347][26022] Updated weights on worker 0-0, policy_version 504873 (0.00086) [2022-07-10 01:20:37,051][26022] Updated weights on worker 0-0, policy_version 504883 (0.00097) [2022-07-10 01:20:38,755][26022] Updated weights on worker 0-0, policy_version 504893 (0.00089) [2022-07-10 01:20:39,447][25689] Fps is (10 sec: 5693.2, 60 sec: 5651.0, 300 sec: 5666.8). Total num frames: 517012480. Throughput: 0: 5107.9. Samples: 517007028. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:20:39,448][25689] Avg episode reward: [(0, '-35.894')] [2022-07-10 01:20:40,813][26022] Updated weights on worker 0-0, policy_version 504903 (0.00093) [2022-07-10 01:20:42,258][26022] Updated weights on worker 0-0, policy_version 504913 (0.00367) [2022-07-10 01:20:44,339][26022] Updated weights on worker 0-0, policy_version 504923 (0.00088) [2022-07-10 01:20:44,475][25689] Fps is (10 sec: 5713.1, 60 sec: 5667.3, 300 sec: 5667.5). Total num frames: 517041152. Throughput: 0: 5975.7. Samples: 517041542. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:20:44,475][25689] Avg episode reward: [(0, '-37.087')] [2022-07-10 01:20:45,815][26022] Updated weights on worker 0-0, policy_version 504933 (0.00086) [2022-07-10 01:20:47,901][26022] Updated weights on worker 0-0, policy_version 504943 (0.00091) [2022-07-10 01:20:49,602][25689] Fps is (10 sec: 5748.1, 60 sec: 5678.8, 300 sec: 5668.9). Total num frames: 517070848. Throughput: 0: 5957.8. Samples: 517075938. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:20:49,602][25689] Avg episode reward: [(0, '-36.618')] [2022-07-10 01:20:49,627][26022] Updated weights on worker 0-0, policy_version 504953 (0.00083) [2022-07-10 01:20:51,672][26022] Updated weights on worker 0-0, policy_version 504963 (0.00091) [2022-07-10 01:20:53,242][26022] Updated weights on worker 0-0, policy_version 504973 (0.00085) [2022-07-10 01:20:54,627][25689] Fps is (10 sec: 5648.4, 60 sec: 5662.8, 300 sec: 5666.5). Total num frames: 517098496. Throughput: 0: 5121.0. Samples: 517093118. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:20:54,628][25689] Avg episode reward: [(0, '-37.490')] [2022-07-10 01:20:55,220][26022] Updated weights on worker 0-0, policy_version 504983 (0.00090) [2022-07-10 01:20:56,727][26022] Updated weights on worker 0-0, policy_version 504993 (0.00088) [2022-07-10 01:20:58,683][26022] Updated weights on worker 0-0, policy_version 505003 (0.00091) [2022-07-10 01:20:59,675][25689] Fps is (10 sec: 5794.4, 60 sec: 5683.8, 300 sec: 5680.0). Total num frames: 517129216. Throughput: 0: 5952.1. Samples: 517127424. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:20:59,676][25689] Avg episode reward: [(0, '-36.736')] [2022-07-10 01:21:00,416][26022] Updated weights on worker 0-0, policy_version 505013 (0.00091) [2022-07-10 01:21:02,772][26022] Updated weights on worker 0-0, policy_version 505023 (0.00086) [2022-07-10 01:21:04,390][26022] Updated weights on worker 0-0, policy_version 505033 (0.00090) [2022-07-10 01:21:04,692][25689] Fps is (10 sec: 5595.7, 60 sec: 5688.2, 300 sec: 5667.6). Total num frames: 517154816. Throughput: 0: 5837.4. Samples: 517159558. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:21:04,692][25689] Avg episode reward: [(0, '-36.782')] [2022-07-10 01:21:06,268][26022] Updated weights on worker 0-0, policy_version 505043 (0.00090) [2022-07-10 01:21:07,859][26022] Updated weights on worker 0-0, policy_version 505053 (0.00085) [2022-07-10 01:21:09,770][25689] Fps is (10 sec: 5375.9, 60 sec: 5668.2, 300 sec: 5669.6). Total num frames: 517183488. Throughput: 0: 5860.2. Samples: 517194130. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:21:09,771][25689] Avg episode reward: [(0, '-36.559')] [2022-07-10 01:21:09,903][26022] Updated weights on worker 0-0, policy_version 505063 (0.00088) [2022-07-10 01:21:11,399][26022] Updated weights on worker 0-0, policy_version 505073 (0.00084) [2022-07-10 01:21:13,454][26022] Updated weights on worker 0-0, policy_version 505083 (0.00094) [2022-07-10 01:21:14,807][25689] Fps is (10 sec: 5871.9, 60 sec: 5700.7, 300 sec: 5672.8). Total num frames: 517214208. Throughput: 0: 5865.6. Samples: 517211482. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:21:14,807][25689] Avg episode reward: [(0, '-35.923')] [2022-07-10 01:21:14,891][26022] Updated weights on worker 0-0, policy_version 505093 (0.00555) [2022-07-10 01:21:16,931][26022] Updated weights on worker 0-0, policy_version 505103 (0.00083) [2022-07-10 01:21:18,708][26022] Updated weights on worker 0-0, policy_version 505113 (0.00085) [2022-07-10 01:21:19,823][25689] Fps is (10 sec: 5704.1, 60 sec: 5668.7, 300 sec: 5662.9). Total num frames: 517240832. Throughput: 0: 5888.4. Samples: 517246068. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:21:19,824][25689] Avg episode reward: [(0, '-35.521')] [2022-07-10 01:21:20,264][26022] Updated weights on worker 0-0, policy_version 505123 (0.00091) [2022-07-10 01:21:22,179][26022] Updated weights on worker 0-0, policy_version 505133 (0.00086) [2022-07-10 01:21:23,862][26022] Updated weights on worker 0-0, policy_version 505143 (0.00091) [2022-07-10 01:21:24,844][25689] Fps is (10 sec: 5508.8, 60 sec: 5651.7, 300 sec: 5664.8). Total num frames: 517269504. Throughput: 0: 6008.3. Samples: 517280640. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:21:24,845][25689] Avg episode reward: [(0, '-35.419')] [2022-07-10 01:21:25,698][26022] Updated weights on worker 0-0, policy_version 505153 (0.00084) [2022-07-10 01:21:27,612][26022] Updated weights on worker 0-0, policy_version 505163 (0.00087) [2022-07-10 01:21:29,210][26022] Updated weights on worker 0-0, policy_version 505173 (0.00090) [2022-07-10 01:21:29,967][25689] Fps is (10 sec: 5956.0, 60 sec: 5712.2, 300 sec: 5672.9). Total num frames: 517301248. Throughput: 0: 5133.0. Samples: 517297800. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:21:29,968][25689] Avg episode reward: [(0, '-35.806')] [2022-07-10 01:21:31,414][26022] Updated weights on worker 0-0, policy_version 505183 (0.00086) [2022-07-10 01:21:32,750][26022] Updated weights on worker 0-0, policy_version 505193 (0.00083) [2022-07-10 01:21:34,800][26022] Updated weights on worker 0-0, policy_version 505203 (0.00085) [2022-07-10 01:21:34,972][25689] Fps is (10 sec: 5763.3, 60 sec: 5683.8, 300 sec: 5669.8). Total num frames: 517327872. Throughput: 0: 5966.8. Samples: 517331804. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:21:34,972][25689] Avg episode reward: [(0, '-34.486')] [2022-07-10 01:21:36,573][26022] Updated weights on worker 0-0, policy_version 505213 (0.00090) [2022-07-10 01:21:38,408][26022] Updated weights on worker 0-0, policy_version 505223 (0.00092) [2022-07-10 01:21:40,061][25689] Fps is (10 sec: 5579.3, 60 sec: 5693.2, 300 sec: 5672.2). Total num frames: 517357568. Throughput: 0: 5925.3. Samples: 517365984. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:21:40,062][25689] Avg episode reward: [(0, '-34.731')] [2022-07-10 01:21:40,252][26022] Updated weights on worker 0-0, policy_version 505233 (0.00092) [2022-07-10 01:21:41,977][26022] Updated weights on worker 0-0, policy_version 505243 (0.00091) [2022-07-10 01:21:43,530][26022] Updated weights on worker 0-0, policy_version 505253 (0.00088) [2022-07-10 01:21:45,110][25689] Fps is (10 sec: 5757.3, 60 sec: 5691.2, 300 sec: 5669.8). Total num frames: 517386240. Throughput: 0: 5042.1. Samples: 517382822. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:21:45,111][25689] Avg episode reward: [(0, '-35.094')] [2022-07-10 01:21:45,659][26022] Updated weights on worker 0-0, policy_version 505263 (0.00095) [2022-07-10 01:21:47,447][26022] Updated weights on worker 0-0, policy_version 505273 (0.00799) [2022-07-10 01:21:47,895][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:21:47,913][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000505276_517402624.pth [2022-07-10 01:21:47,914][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000503281_515359744.pth [2022-07-10 01:21:49,058][26022] Updated weights on worker 0-0, policy_version 505283 (0.00086) [2022-07-10 01:21:50,240][25689] Fps is (10 sec: 5734.4, 60 sec: 5690.9, 300 sec: 5667.7). Total num frames: 517415936. Throughput: 0: 5904.6. Samples: 517417504. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:21:50,241][25689] Avg episode reward: [(0, '-34.863')] [2022-07-10 01:21:50,808][26022] Updated weights on worker 0-0, policy_version 505293 (0.00088) [2022-07-10 01:21:52,704][26022] Updated weights on worker 0-0, policy_version 505303 (0.00086) [2022-07-10 01:21:54,619][26022] Updated weights on worker 0-0, policy_version 505313 (0.00085) [2022-07-10 01:21:55,269][25689] Fps is (10 sec: 5745.7, 60 sec: 5707.5, 300 sec: 5667.5). Total num frames: 517444608. Throughput: 0: 5904.1. Samples: 517451638. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:21:55,269][25689] Avg episode reward: [(0, '-34.492')] [2022-07-10 01:21:56,403][26022] Updated weights on worker 0-0, policy_version 505323 (0.00090) [2022-07-10 01:21:58,082][26022] Updated weights on worker 0-0, policy_version 505333 (0.00083) [2022-07-10 01:21:59,941][26022] Updated weights on worker 0-0, policy_version 505343 (0.00085) [2022-07-10 01:22:00,340][25689] Fps is (10 sec: 5677.5, 60 sec: 5671.5, 300 sec: 5676.7). Total num frames: 517473280. Throughput: 0: 5075.0. Samples: 517468898. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:22:00,341][25689] Avg episode reward: [(0, '-34.674')] [2022-07-10 01:22:01,529][26022] Updated weights on worker 0-0, policy_version 505353 (0.00089) [2022-07-10 01:22:03,899][26022] Updated weights on worker 0-0, policy_version 505363 (0.01318) [2022-07-10 01:22:05,386][25689] Fps is (10 sec: 5465.6, 60 sec: 5685.7, 300 sec: 5667.5). Total num frames: 517499904. Throughput: 0: 5841.4. Samples: 517501260. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:22:05,386][25689] Avg episode reward: [(0, '-35.653')] [2022-07-10 01:22:05,871][26022] Updated weights on worker 0-0, policy_version 505373 (0.00093) [2022-07-10 01:22:07,483][26022] Updated weights on worker 0-0, policy_version 505383 (0.00085) [2022-07-10 01:22:09,294][26022] Updated weights on worker 0-0, policy_version 505393 (0.00082) [2022-07-10 01:22:10,454][25689] Fps is (10 sec: 5568.5, 60 sec: 5703.5, 300 sec: 5673.4). Total num frames: 517529600. Throughput: 0: 5834.5. Samples: 517535444. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:22:10,455][25689] Avg episode reward: [(0, '-35.912')] [2022-07-10 01:22:10,997][26022] Updated weights on worker 0-0, policy_version 505403 (0.00087) [2022-07-10 01:22:12,834][26022] Updated weights on worker 0-0, policy_version 505413 (0.00088) [2022-07-10 01:22:14,795][26022] Updated weights on worker 0-0, policy_version 505423 (0.00083) [2022-07-10 01:22:15,467][25689] Fps is (10 sec: 5485.0, 60 sec: 5621.3, 300 sec: 5660.0). Total num frames: 517555200. Throughput: 0: 4990.1. Samples: 517552430. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:22:15,468][25689] Avg episode reward: [(0, '-37.598')] [2022-07-10 01:22:16,281][26022] Updated weights on worker 0-0, policy_version 505433 (0.00095) [2022-07-10 01:22:18,523][26022] Updated weights on worker 0-0, policy_version 505443 (0.00086) [2022-07-10 01:22:19,898][26022] Updated weights on worker 0-0, policy_version 505453 (0.00087) [2022-07-10 01:22:20,557][25689] Fps is (10 sec: 5676.0, 60 sec: 5698.8, 300 sec: 5669.5). Total num frames: 517586944. Throughput: 0: 5834.6. Samples: 517586856. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:22:20,558][25689] Avg episode reward: [(0, '-38.214')] [2022-07-10 01:22:21,989][26022] Updated weights on worker 0-0, policy_version 505463 (0.00087) [2022-07-10 01:22:23,531][26022] Updated weights on worker 0-0, policy_version 505473 (0.00097) [2022-07-10 01:22:25,410][26022] Updated weights on worker 0-0, policy_version 505483 (0.00089) [2022-07-10 01:22:25,608][25689] Fps is (10 sec: 5957.7, 60 sec: 5696.0, 300 sec: 5673.9). Total num frames: 517615616. Throughput: 0: 5919.7. Samples: 517620970. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:22:25,609][25689] Avg episode reward: [(0, '-38.724')] [2022-07-10 01:22:27,416][26022] Updated weights on worker 0-0, policy_version 505493 (0.00087) [2022-07-10 01:22:29,037][26022] Updated weights on worker 0-0, policy_version 505503 (0.00081) [2022-07-10 01:22:30,760][25689] Fps is (10 sec: 5519.8, 60 sec: 5625.9, 300 sec: 5667.9). Total num frames: 517643264. Throughput: 0: 5038.1. Samples: 517637748. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:22:30,761][25689] Avg episode reward: [(0, '-38.706')] [2022-07-10 01:22:31,035][26022] Updated weights on worker 0-0, policy_version 505513 (0.00088) [2022-07-10 01:22:32,808][26022] Updated weights on worker 0-0, policy_version 505523 (0.00086) [2022-07-10 01:22:34,651][26022] Updated weights on worker 0-0, policy_version 505533 (0.00090) [2022-07-10 01:22:35,842][25689] Fps is (10 sec: 5503.2, 60 sec: 5652.4, 300 sec: 5666.8). Total num frames: 517671936. Throughput: 0: 5849.1. Samples: 517671604. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 01:22:35,842][25689] Avg episode reward: [(0, '-37.220')] [2022-07-10 01:22:36,396][26022] Updated weights on worker 0-0, policy_version 505543 (0.00103) [2022-07-10 01:22:38,191][26022] Updated weights on worker 0-0, policy_version 505553 (0.00089) [2022-07-10 01:22:39,951][26022] Updated weights on worker 0-0, policy_version 505563 (0.00085) [2022-07-10 01:22:40,888][25689] Fps is (10 sec: 5763.1, 60 sec: 5656.5, 300 sec: 5666.0). Total num frames: 517701632. Throughput: 0: 5859.9. Samples: 517705994. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:22:40,889][25689] Avg episode reward: [(0, '-37.171')] [2022-07-10 01:22:41,852][26022] Updated weights on worker 0-0, policy_version 505573 (0.00086) [2022-07-10 01:22:43,515][26022] Updated weights on worker 0-0, policy_version 505583 (0.00099) [2022-07-10 01:22:45,468][26022] Updated weights on worker 0-0, policy_version 505593 (0.00082) [2022-07-10 01:22:45,895][25689] Fps is (10 sec: 5704.1, 60 sec: 5643.5, 300 sec: 5667.0). Total num frames: 517729280. Throughput: 0: 5033.9. Samples: 517723094. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:22:45,896][25689] Avg episode reward: [(0, '-36.767')] [2022-07-10 01:22:47,036][26022] Updated weights on worker 0-0, policy_version 505603 (0.00083) [2022-07-10 01:22:49,002][26022] Updated weights on worker 0-0, policy_version 505613 (0.00086) [2022-07-10 01:22:50,768][26022] Updated weights on worker 0-0, policy_version 505623 (0.00090) [2022-07-10 01:22:50,939][25689] Fps is (10 sec: 5705.7, 60 sec: 5651.5, 300 sec: 5670.5). Total num frames: 517758976. Throughput: 0: 5938.9. Samples: 517757584. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:22:50,939][25689] Avg episode reward: [(0, '-37.308')] [2022-07-10 01:22:52,589][26022] Updated weights on worker 0-0, policy_version 505633 (0.00081) [2022-07-10 01:22:54,075][26022] Updated weights on worker 0-0, policy_version 505643 (0.00092) [2022-07-10 01:22:55,991][25689] Fps is (10 sec: 5679.9, 60 sec: 5632.5, 300 sec: 5662.8). Total num frames: 517786624. Throughput: 0: 5974.9. Samples: 517791992. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:22:55,991][25689] Avg episode reward: [(0, '-35.567')] [2022-07-10 01:22:56,271][26022] Updated weights on worker 0-0, policy_version 505653 (0.00085) [2022-07-10 01:22:57,857][26022] Updated weights on worker 0-0, policy_version 505663 (0.00092) [2022-07-10 01:22:59,795][26022] Updated weights on worker 0-0, policy_version 505673 (0.00086) [2022-07-10 01:23:01,018][25689] Fps is (10 sec: 5689.3, 60 sec: 5653.5, 300 sec: 5680.1). Total num frames: 517816320. Throughput: 0: 5116.7. Samples: 517808992. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:23:01,020][25689] Avg episode reward: [(0, '-35.582')] [2022-07-10 01:23:01,578][26022] Updated weights on worker 0-0, policy_version 505683 (0.00088) [2022-07-10 01:23:03,547][26022] Updated weights on worker 0-0, policy_version 505693 (0.00094) [2022-07-10 01:23:05,633][26022] Updated weights on worker 0-0, policy_version 505703 (0.00093) [2022-07-10 01:23:06,033][25689] Fps is (10 sec: 5608.4, 60 sec: 5656.3, 300 sec: 5667.2). Total num frames: 517842944. Throughput: 0: 5870.8. Samples: 517841320. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:23:06,035][25689] Avg episode reward: [(0, '-34.908')] [2022-07-10 01:23:07,191][26022] Updated weights on worker 0-0, policy_version 505713 (0.00089) [2022-07-10 01:23:08,971][26022] Updated weights on worker 0-0, policy_version 505723 (0.00082) [2022-07-10 01:23:10,993][26022] Updated weights on worker 0-0, policy_version 505733 (0.00081) [2022-07-10 01:23:11,160][25689] Fps is (10 sec: 5452.1, 60 sec: 5634.0, 300 sec: 5671.8). Total num frames: 517871616. Throughput: 0: 5847.8. Samples: 517875836. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:23:11,162][25689] Avg episode reward: [(0, '-34.636')] [2022-07-10 01:23:12,505][26022] Updated weights on worker 0-0, policy_version 505743 (0.00088) [2022-07-10 01:23:14,388][26022] Updated weights on worker 0-0, policy_version 505753 (0.00086) [2022-07-10 01:23:16,092][26022] Updated weights on worker 0-0, policy_version 505763 (0.00091) [2022-07-10 01:23:16,229][25689] Fps is (10 sec: 5825.0, 60 sec: 5713.1, 300 sec: 5670.6). Total num frames: 517902336. Throughput: 0: 5865.4. Samples: 517910696. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:23:16,230][25689] Avg episode reward: [(0, '-35.124')] [2022-07-10 01:23:17,893][26022] Updated weights on worker 0-0, policy_version 505773 (0.00086) [2022-07-10 01:23:19,827][26022] Updated weights on worker 0-0, policy_version 505783 (0.00093) [2022-07-10 01:23:21,244][25689] Fps is (10 sec: 5788.3, 60 sec: 5652.6, 300 sec: 5670.9). Total num frames: 517929984. Throughput: 0: 5882.1. Samples: 517927964. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:23:21,244][25689] Avg episode reward: [(0, '-35.243')] [2022-07-10 01:23:21,557][26022] Updated weights on worker 0-0, policy_version 505793 (0.00089) [2022-07-10 01:23:23,067][26022] Updated weights on worker 0-0, policy_version 505803 (0.00086) [2022-07-10 01:23:25,002][26022] Updated weights on worker 0-0, policy_version 505813 (0.00087) [2022-07-10 01:23:26,263][25689] Fps is (10 sec: 5715.1, 60 sec: 5672.5, 300 sec: 5678.3). Total num frames: 517959680. Throughput: 0: 5984.7. Samples: 517962390. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:23:26,265][25689] Avg episode reward: [(0, '-35.027')] [2022-07-10 01:23:26,875][26022] Updated weights on worker 0-0, policy_version 505823 (0.00084) [2022-07-10 01:23:28,648][26022] Updated weights on worker 0-0, policy_version 505833 (0.00088) [2022-07-10 01:23:30,354][26022] Updated weights on worker 0-0, policy_version 505843 (0.00089) [2022-07-10 01:23:31,355][25689] Fps is (10 sec: 5772.9, 60 sec: 5695.1, 300 sec: 5673.4). Total num frames: 517988352. Throughput: 0: 5971.9. Samples: 517996436. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:23:31,355][25689] Avg episode reward: [(0, '-34.201')] [2022-07-10 01:23:32,203][26022] Updated weights on worker 0-0, policy_version 505853 (0.00083) [2022-07-10 01:23:33,912][26022] Updated weights on worker 0-0, policy_version 505863 (0.00086) [2022-07-10 01:23:35,805][26022] Updated weights on worker 0-0, policy_version 505873 (0.00087) [2022-07-10 01:23:36,389][25689] Fps is (10 sec: 5562.0, 60 sec: 5682.6, 300 sec: 5669.4). Total num frames: 518016000. Throughput: 0: 5109.2. Samples: 518013696. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:23:36,390][25689] Avg episode reward: [(0, '-33.756')] [2022-07-10 01:23:37,438][26022] Updated weights on worker 0-0, policy_version 505883 (0.00087) [2022-07-10 01:23:39,430][26022] Updated weights on worker 0-0, policy_version 505893 (0.00089) [2022-07-10 01:23:40,952][26022] Updated weights on worker 0-0, policy_version 505903 (0.00089) [2022-07-10 01:23:41,431][25689] Fps is (10 sec: 5792.6, 60 sec: 5699.9, 300 sec: 5675.6). Total num frames: 518046720. Throughput: 0: 5976.1. Samples: 518048604. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:23:41,433][25689] Avg episode reward: [(0, '-33.594')] [2022-07-10 01:23:42,872][26022] Updated weights on worker 0-0, policy_version 505913 (0.00090) [2022-07-10 01:23:44,705][26022] Updated weights on worker 0-0, policy_version 505923 (0.00087) [2022-07-10 01:23:46,431][26022] Updated weights on worker 0-0, policy_version 505933 (0.00085) [2022-07-10 01:23:46,495][25689] Fps is (10 sec: 5876.5, 60 sec: 5711.4, 300 sec: 5676.1). Total num frames: 518075392. Throughput: 0: 5969.1. Samples: 518083160. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:23:46,496][25689] Avg episode reward: [(0, '-32.882')] [2022-07-10 01:23:47,973][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:23:47,982][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000505941_518083584.pth [2022-07-10 01:23:47,987][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000503945_516039680.pth [2022-07-10 01:23:48,395][26022] Updated weights on worker 0-0, policy_version 505943 (0.00084) [2022-07-10 01:23:50,073][26022] Updated weights on worker 0-0, policy_version 505953 (0.00049) [2022-07-10 01:23:51,617][25689] Fps is (10 sec: 5629.5, 60 sec: 5687.1, 300 sec: 5670.7). Total num frames: 518104064. Throughput: 0: 5130.0. Samples: 518100382. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:23:51,618][25689] Avg episode reward: [(0, '-32.937')] [2022-07-10 01:23:51,834][26022] Updated weights on worker 0-0, policy_version 505963 (0.00081) [2022-07-10 01:23:53,686][26022] Updated weights on worker 0-0, policy_version 505973 (0.00085) [2022-07-10 01:23:55,307][26022] Updated weights on worker 0-0, policy_version 505983 (0.00085) [2022-07-10 01:23:56,634][25689] Fps is (10 sec: 5555.1, 60 sec: 5690.5, 300 sec: 5671.3). Total num frames: 518131712. Throughput: 0: 5982.1. Samples: 518134804. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:23:56,636][25689] Avg episode reward: [(0, '-34.025')] [2022-07-10 01:23:57,266][26022] Updated weights on worker 0-0, policy_version 505993 (0.00083) [2022-07-10 01:23:58,974][26022] Updated weights on worker 0-0, policy_version 506003 (0.00093) [2022-07-10 01:24:00,791][26022] Updated weights on worker 0-0, policy_version 506013 (0.00087) [2022-07-10 01:24:01,670][25689] Fps is (10 sec: 5704.5, 60 sec: 5689.7, 300 sec: 5681.0). Total num frames: 518161408. Throughput: 0: 5934.2. Samples: 518168704. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:24:01,671][25689] Avg episode reward: [(0, '-34.402')] [2022-07-10 01:24:03,055][26022] Updated weights on worker 0-0, policy_version 506023 (0.00086) [2022-07-10 01:24:04,679][26022] Updated weights on worker 0-0, policy_version 506033 (0.00087) [2022-07-10 01:24:06,682][26022] Updated weights on worker 0-0, policy_version 506043 (0.00052) [2022-07-10 01:24:06,739][25689] Fps is (10 sec: 5573.4, 60 sec: 5684.6, 300 sec: 5674.9). Total num frames: 518188032. Throughput: 0: 4964.3. Samples: 518183654. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:24:06,739][25689] Avg episode reward: [(0, '-34.762')] [2022-07-10 01:24:08,151][26022] Updated weights on worker 0-0, policy_version 506053 (0.00083) [2022-07-10 01:24:10,147][26022] Updated weights on worker 0-0, policy_version 506063 (0.00092) [2022-07-10 01:24:11,806][25689] Fps is (10 sec: 5556.4, 60 sec: 5707.1, 300 sec: 5674.4). Total num frames: 518217728. Throughput: 0: 5836.4. Samples: 518218210. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:24:11,806][25689] Avg episode reward: [(0, '-35.837')] [2022-07-10 01:24:11,930][26022] Updated weights on worker 0-0, policy_version 506073 (0.00403) [2022-07-10 01:24:13,625][26022] Updated weights on worker 0-0, policy_version 506083 (0.00086) [2022-07-10 01:24:15,690][26022] Updated weights on worker 0-0, policy_version 506093 (0.00093) [2022-07-10 01:24:16,810][25689] Fps is (10 sec: 5693.9, 60 sec: 5662.5, 300 sec: 5671.5). Total num frames: 518245376. Throughput: 0: 5843.8. Samples: 518252710. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:24:16,811][25689] Avg episode reward: [(0, '-35.146')] [2022-07-10 01:24:17,084][26022] Updated weights on worker 0-0, policy_version 506103 (0.00091) [2022-07-10 01:24:19,037][26022] Updated weights on worker 0-0, policy_version 506113 (0.00103) [2022-07-10 01:24:20,795][26022] Updated weights on worker 0-0, policy_version 506123 (0.00088) [2022-07-10 01:24:21,831][25689] Fps is (10 sec: 5720.1, 60 sec: 5695.8, 300 sec: 5675.6). Total num frames: 518275072. Throughput: 0: 5026.5. Samples: 518270044. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:24:21,831][25689] Avg episode reward: [(0, '-35.131')] [2022-07-10 01:24:22,509][26022] Updated weights on worker 0-0, policy_version 506133 (0.00086) [2022-07-10 01:24:24,431][26022] Updated weights on worker 0-0, policy_version 506143 (0.00099) [2022-07-10 01:24:26,081][26022] Updated weights on worker 0-0, policy_version 506153 (0.00093) [2022-07-10 01:24:26,834][25689] Fps is (10 sec: 5720.5, 60 sec: 5663.4, 300 sec: 5673.9). Total num frames: 518302720. Throughput: 0: 6024.0. Samples: 518304710. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:24:26,835][25689] Avg episode reward: [(0, '-35.193')] [2022-07-10 01:24:28,026][26022] Updated weights on worker 0-0, policy_version 506163 (0.00089) [2022-07-10 01:24:29,921][26022] Updated weights on worker 0-0, policy_version 506173 (0.00089) [2022-07-10 01:24:31,626][26022] Updated weights on worker 0-0, policy_version 506183 (0.00085) [2022-07-10 01:24:31,912][25689] Fps is (10 sec: 5688.2, 60 sec: 5681.6, 300 sec: 5676.4). Total num frames: 518332416. Throughput: 0: 5992.2. Samples: 518338690. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:24:31,912][25689] Avg episode reward: [(0, '-35.414')] [2022-07-10 01:24:33,475][26022] Updated weights on worker 0-0, policy_version 506193 (0.00083) [2022-07-10 01:24:35,194][26022] Updated weights on worker 0-0, policy_version 506203 (0.00100) [2022-07-10 01:24:36,942][25689] Fps is (10 sec: 5876.1, 60 sec: 5715.8, 300 sec: 5676.3). Total num frames: 518362112. Throughput: 0: 5125.8. Samples: 518355900. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:24:36,942][25689] Avg episode reward: [(0, '-35.647')] [2022-07-10 01:24:36,944][26022] Updated weights on worker 0-0, policy_version 506213 (0.00086) [2022-07-10 01:24:38,757][26022] Updated weights on worker 0-0, policy_version 506223 (0.00081) [2022-07-10 01:24:40,448][26022] Updated weights on worker 0-0, policy_version 506233 (0.00086) [2022-07-10 01:24:41,988][25689] Fps is (10 sec: 5690.9, 60 sec: 5664.7, 300 sec: 5675.8). Total num frames: 518389760. Throughput: 0: 5967.8. Samples: 518390340. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:24:41,989][25689] Avg episode reward: [(0, '-35.941')] [2022-07-10 01:24:42,430][26022] Updated weights on worker 0-0, policy_version 506243 (0.00088) [2022-07-10 01:24:44,083][26022] Updated weights on worker 0-0, policy_version 506253 (0.00086) [2022-07-10 01:24:45,952][26022] Updated weights on worker 0-0, policy_version 506263 (0.00083) [2022-07-10 01:24:47,035][25689] Fps is (10 sec: 5580.1, 60 sec: 5666.4, 300 sec: 5676.3). Total num frames: 518418432. Throughput: 0: 5938.0. Samples: 518424660. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:24:47,036][25689] Avg episode reward: [(0, '-35.268')] [2022-07-10 01:24:47,561][26022] Updated weights on worker 0-0, policy_version 506273 (0.00086) [2022-07-10 01:24:49,559][26022] Updated weights on worker 0-0, policy_version 506283 (0.00084) [2022-07-10 01:24:51,202][26022] Updated weights on worker 0-0, policy_version 506293 (0.00085) [2022-07-10 01:24:52,142][25689] Fps is (10 sec: 5748.5, 60 sec: 5684.7, 300 sec: 5678.3). Total num frames: 518448128. Throughput: 0: 5110.6. Samples: 518442078. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:24:52,142][25689] Avg episode reward: [(0, '-34.705')] [2022-07-10 01:24:52,900][26022] Updated weights on worker 0-0, policy_version 506303 (0.00076) [2022-07-10 01:24:54,552][26022] Updated weights on worker 0-0, policy_version 506313 (0.00093) [2022-07-10 01:24:56,472][26022] Updated weights on worker 0-0, policy_version 506323 (0.00087) [2022-07-10 01:24:57,150][25689] Fps is (10 sec: 5770.2, 60 sec: 5702.4, 300 sec: 5676.5). Total num frames: 518476800. Throughput: 0: 6002.5. Samples: 518477202. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:24:57,151][25689] Avg episode reward: [(0, '-34.961')] [2022-07-10 01:24:58,366][26022] Updated weights on worker 0-0, policy_version 506333 (0.00087) [2022-07-10 01:25:00,161][26022] Updated weights on worker 0-0, policy_version 506343 (0.00084) [2022-07-10 01:25:02,167][25689] Fps is (10 sec: 5618.0, 60 sec: 5670.3, 300 sec: 5684.3). Total num frames: 518504448. Throughput: 0: 5988.8. Samples: 518511186. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:25:02,167][25689] Avg episode reward: [(0, '-35.700')] [2022-07-10 01:25:02,221][26022] Updated weights on worker 0-0, policy_version 506353 (0.00090) [2022-07-10 01:25:04,111][26022] Updated weights on worker 0-0, policy_version 506363 (0.00100) [2022-07-10 01:25:05,763][26022] Updated weights on worker 0-0, policy_version 506373 (0.00088) [2022-07-10 01:25:07,230][25689] Fps is (10 sec: 5688.9, 60 sec: 5721.7, 300 sec: 5683.9). Total num frames: 518534144. Throughput: 0: 5056.1. Samples: 518526770. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:25:07,231][25689] Avg episode reward: [(0, '-35.155')] [2022-07-10 01:25:07,439][26022] Updated weights on worker 0-0, policy_version 506383 (0.00081) [2022-07-10 01:25:09,421][26022] Updated weights on worker 0-0, policy_version 506393 (0.00085) [2022-07-10 01:25:11,239][26022] Updated weights on worker 0-0, policy_version 506403 (0.00088) [2022-07-10 01:25:12,339][25689] Fps is (10 sec: 5737.9, 60 sec: 5700.8, 300 sec: 5682.3). Total num frames: 518562816. Throughput: 0: 5904.4. Samples: 518561332. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:25:12,340][25689] Avg episode reward: [(0, '-35.004')] [2022-07-10 01:25:12,900][26022] Updated weights on worker 0-0, policy_version 506413 (0.00100) [2022-07-10 01:25:14,968][26022] Updated weights on worker 0-0, policy_version 506423 (0.00090) [2022-07-10 01:25:16,336][26022] Updated weights on worker 0-0, policy_version 506433 (0.00483) [2022-07-10 01:25:17,373][25689] Fps is (10 sec: 5653.9, 60 sec: 5715.0, 300 sec: 5682.3). Total num frames: 518591488. Throughput: 0: 5858.5. Samples: 518595674. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:25:17,373][25689] Avg episode reward: [(0, '-36.368')] [2022-07-10 01:25:18,440][26022] Updated weights on worker 0-0, policy_version 506443 (0.00083) [2022-07-10 01:25:20,191][26022] Updated weights on worker 0-0, policy_version 506453 (0.00082) [2022-07-10 01:25:21,834][26022] Updated weights on worker 0-0, policy_version 506463 (0.00497) [2022-07-10 01:25:22,393][25689] Fps is (10 sec: 5805.5, 60 sec: 5715.0, 300 sec: 5682.3). Total num frames: 518621184. Throughput: 0: 5876.7. Samples: 518630050. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:25:22,394][25689] Avg episode reward: [(0, '-36.489')] [2022-07-10 01:25:23,838][26022] Updated weights on worker 0-0, policy_version 506473 (0.00087) [2022-07-10 01:25:25,421][26022] Updated weights on worker 0-0, policy_version 506483 (0.00778) [2022-07-10 01:25:27,238][26022] Updated weights on worker 0-0, policy_version 506493 (0.00089) [2022-07-10 01:25:27,441][25689] Fps is (10 sec: 5797.0, 60 sec: 5727.7, 300 sec: 5685.7). Total num frames: 518649856. Throughput: 0: 5962.4. Samples: 518647278. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:25:27,442][25689] Avg episode reward: [(0, '-37.396')] [2022-07-10 01:25:29,084][26022] Updated weights on worker 0-0, policy_version 506503 (0.00087) [2022-07-10 01:25:30,900][26022] Updated weights on worker 0-0, policy_version 506513 (0.00082) [2022-07-10 01:25:32,540][25689] Fps is (10 sec: 5651.3, 60 sec: 5708.8, 300 sec: 5685.0). Total num frames: 518678528. Throughput: 0: 5947.6. Samples: 518681478. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 01:25:32,541][25689] Avg episode reward: [(0, '-37.704')] [2022-07-10 01:25:32,650][26022] Updated weights on worker 0-0, policy_version 506523 (0.00089) [2022-07-10 01:25:34,247][26022] Updated weights on worker 0-0, policy_version 506533 (0.00092) [2022-07-10 01:25:36,242][26022] Updated weights on worker 0-0, policy_version 506543 (0.00087) [2022-07-10 01:25:37,541][25689] Fps is (10 sec: 5779.0, 60 sec: 5711.5, 300 sec: 5688.6). Total num frames: 518708224. Throughput: 0: 5974.5. Samples: 518716172. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:25:37,542][25689] Avg episode reward: [(0, '-38.740')] [2022-07-10 01:25:37,846][26022] Updated weights on worker 0-0, policy_version 506553 (0.00092) [2022-07-10 01:25:39,899][26022] Updated weights on worker 0-0, policy_version 506563 (0.00456) [2022-07-10 01:25:41,410][26022] Updated weights on worker 0-0, policy_version 506573 (0.00085) [2022-07-10 01:25:42,593][25689] Fps is (10 sec: 5704.2, 60 sec: 5711.0, 300 sec: 5684.7). Total num frames: 518735872. Throughput: 0: 5127.1. Samples: 518733618. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:25:42,594][25689] Avg episode reward: [(0, '-38.695')] [2022-07-10 01:25:43,384][26022] Updated weights on worker 0-0, policy_version 506583 (0.00185) [2022-07-10 01:25:45,007][26022] Updated weights on worker 0-0, policy_version 506593 (0.00092) [2022-07-10 01:25:46,909][26022] Updated weights on worker 0-0, policy_version 506603 (0.00094) [2022-07-10 01:25:47,619][25689] Fps is (10 sec: 5690.4, 60 sec: 5729.8, 300 sec: 5686.7). Total num frames: 518765568. Throughput: 0: 5999.6. Samples: 518768334. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:25:47,619][25689] Avg episode reward: [(0, '-37.489')] [2022-07-10 01:25:47,992][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:25:48,002][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000506609_518767616.pth [2022-07-10 01:25:48,002][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000504611_516721664.pth [2022-07-10 01:25:48,635][26022] Updated weights on worker 0-0, policy_version 506613 (0.00099) [2022-07-10 01:25:50,495][26022] Updated weights on worker 0-0, policy_version 506623 (0.00095) [2022-07-10 01:25:52,215][26022] Updated weights on worker 0-0, policy_version 506633 (0.00087) [2022-07-10 01:25:52,691][25689] Fps is (10 sec: 5881.4, 60 sec: 5733.1, 300 sec: 5692.6). Total num frames: 518795264. Throughput: 0: 5991.1. Samples: 518802206. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:25:52,692][25689] Avg episode reward: [(0, '-38.158')] [2022-07-10 01:25:54,069][26022] Updated weights on worker 0-0, policy_version 506643 (0.00089) [2022-07-10 01:25:55,766][26022] Updated weights on worker 0-0, policy_version 506653 (0.00087) [2022-07-10 01:25:57,647][26022] Updated weights on worker 0-0, policy_version 506663 (0.00095) [2022-07-10 01:25:57,703][25689] Fps is (10 sec: 5686.6, 60 sec: 5715.9, 300 sec: 5683.0). Total num frames: 518822912. Throughput: 0: 5125.9. Samples: 518819518. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:25:57,703][25689] Avg episode reward: [(0, '-38.646')] [2022-07-10 01:25:59,374][26022] Updated weights on worker 0-0, policy_version 506673 (0.00091) [2022-07-10 01:26:01,285][26022] Updated weights on worker 0-0, policy_version 506683 (0.00083) [2022-07-10 01:26:02,705][25689] Fps is (10 sec: 5419.8, 60 sec: 5700.3, 300 sec: 5686.7). Total num frames: 518849536. Throughput: 0: 5977.1. Samples: 518853830. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:26:02,706][25689] Avg episode reward: [(0, '-37.511')] [2022-07-10 01:26:03,164][26022] Updated weights on worker 0-0, policy_version 506693 (0.00081) [2022-07-10 01:26:05,105][26022] Updated weights on worker 0-0, policy_version 506703 (0.00102) [2022-07-10 01:26:06,890][26022] Updated weights on worker 0-0, policy_version 506713 (0.00096) [2022-07-10 01:26:07,736][25689] Fps is (10 sec: 5511.0, 60 sec: 5686.4, 300 sec: 5687.6). Total num frames: 518878208. Throughput: 0: 5860.9. Samples: 518886242. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:26:07,738][25689] Avg episode reward: [(0, '-36.103')] [2022-07-10 01:26:08,791][26022] Updated weights on worker 0-0, policy_version 506723 (0.00099) [2022-07-10 01:26:10,487][26022] Updated weights on worker 0-0, policy_version 506733 (0.00093) [2022-07-10 01:26:12,435][26022] Updated weights on worker 0-0, policy_version 506743 (0.00089) [2022-07-10 01:26:12,827][25689] Fps is (10 sec: 5665.2, 60 sec: 5688.1, 300 sec: 5679.7). Total num frames: 518906880. Throughput: 0: 5030.0. Samples: 518903490. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:26:12,828][25689] Avg episode reward: [(0, '-35.914')] [2022-07-10 01:26:14,015][26022] Updated weights on worker 0-0, policy_version 506753 (0.00090) [2022-07-10 01:26:16,010][26022] Updated weights on worker 0-0, policy_version 506763 (0.00101) [2022-07-10 01:26:17,727][26022] Updated weights on worker 0-0, policy_version 506773 (0.00092) [2022-07-10 01:26:17,847][25689] Fps is (10 sec: 5773.2, 60 sec: 5706.4, 300 sec: 5690.0). Total num frames: 518936576. Throughput: 0: 5868.6. Samples: 518937734. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:26:17,847][25689] Avg episode reward: [(0, '-35.961')] [2022-07-10 01:26:19,536][26022] Updated weights on worker 0-0, policy_version 506783 (0.00086) [2022-07-10 01:26:21,344][26022] Updated weights on worker 0-0, policy_version 506793 (0.00618) [2022-07-10 01:26:22,866][25689] Fps is (10 sec: 5814.1, 60 sec: 5689.5, 300 sec: 5690.0). Total num frames: 518965248. Throughput: 0: 5876.4. Samples: 518972306. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:26:22,868][25689] Avg episode reward: [(0, '-35.402')] [2022-07-10 01:26:23,126][26022] Updated weights on worker 0-0, policy_version 506803 (0.00087) [2022-07-10 01:26:24,898][26022] Updated weights on worker 0-0, policy_version 506813 (0.00078) [2022-07-10 01:26:26,730][26022] Updated weights on worker 0-0, policy_version 506823 (0.00098) [2022-07-10 01:26:27,879][25689] Fps is (10 sec: 5613.6, 60 sec: 5675.9, 300 sec: 5678.3). Total num frames: 518992896. Throughput: 0: 5127.0. Samples: 518989516. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:26:27,880][25689] Avg episode reward: [(0, '-34.346')] [2022-07-10 01:26:28,486][26022] Updated weights on worker 0-0, policy_version 506833 (0.00096) [2022-07-10 01:26:30,373][26022] Updated weights on worker 0-0, policy_version 506843 (0.00091) [2022-07-10 01:26:32,168][26022] Updated weights on worker 0-0, policy_version 506853 (0.00089) [2022-07-10 01:26:32,980][25689] Fps is (10 sec: 5670.2, 60 sec: 5692.7, 300 sec: 5686.8). Total num frames: 519022592. Throughput: 0: 5949.7. Samples: 519023390. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:26:32,980][25689] Avg episode reward: [(0, '-34.622')] [2022-07-10 01:26:33,956][26022] Updated weights on worker 0-0, policy_version 506863 (0.00080) [2022-07-10 01:26:35,828][26022] Updated weights on worker 0-0, policy_version 506873 (0.00088) [2022-07-10 01:26:37,510][26022] Updated weights on worker 0-0, policy_version 506883 (0.00086) [2022-07-10 01:26:38,020][25689] Fps is (10 sec: 5857.1, 60 sec: 5689.0, 300 sec: 5687.8). Total num frames: 519052288. Throughput: 0: 5945.5. Samples: 519057672. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:26:38,020][25689] Avg episode reward: [(0, '-35.769')] [2022-07-10 01:26:39,353][26022] Updated weights on worker 0-0, policy_version 506893 (0.00084) [2022-07-10 01:26:41,082][26022] Updated weights on worker 0-0, policy_version 506903 (0.00089) [2022-07-10 01:26:42,951][26022] Updated weights on worker 0-0, policy_version 506913 (0.00084) [2022-07-10 01:26:43,036][25689] Fps is (10 sec: 5600.3, 60 sec: 5675.4, 300 sec: 5681.5). Total num frames: 519078912. Throughput: 0: 5072.9. Samples: 519074626. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:26:43,037][25689] Avg episode reward: [(0, '-36.972')] [2022-07-10 01:26:44,690][26022] Updated weights on worker 0-0, policy_version 506923 (0.00085) [2022-07-10 01:26:46,383][26022] Updated weights on worker 0-0, policy_version 506933 (0.00088) [2022-07-10 01:26:48,057][25689] Fps is (10 sec: 5610.9, 60 sec: 5675.8, 300 sec: 5683.6). Total num frames: 519108608. Throughput: 0: 5935.6. Samples: 519109284. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:26:48,058][25689] Avg episode reward: [(0, '-37.775')] [2022-07-10 01:26:48,107][26022] Updated weights on worker 0-0, policy_version 506943 (0.00096) [2022-07-10 01:26:50,114][26022] Updated weights on worker 0-0, policy_version 506953 (0.00092) [2022-07-10 01:26:51,664][26022] Updated weights on worker 0-0, policy_version 506963 (0.00090) [2022-07-10 01:26:53,131][25689] Fps is (10 sec: 5883.6, 60 sec: 5675.8, 300 sec: 5686.2). Total num frames: 519138304. Throughput: 0: 5979.7. Samples: 519143888. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:26:53,131][25689] Avg episode reward: [(0, '-39.436')] [2022-07-10 01:26:53,683][26022] Updated weights on worker 0-0, policy_version 506973 (0.00081) [2022-07-10 01:26:55,054][26022] Updated weights on worker 0-0, policy_version 506983 (0.00087) [2022-07-10 01:26:57,333][26022] Updated weights on worker 0-0, policy_version 506993 (0.00084) [2022-07-10 01:26:58,143][25689] Fps is (10 sec: 5685.9, 60 sec: 5675.7, 300 sec: 5683.9). Total num frames: 519165952. Throughput: 0: 5149.1. Samples: 519161288. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:26:58,143][25689] Avg episode reward: [(0, '-39.742')] [2022-07-10 01:26:58,631][26022] Updated weights on worker 0-0, policy_version 507003 (0.00100) [2022-07-10 01:27:01,028][26022] Updated weights on worker 0-0, policy_version 507013 (0.00085) [2022-07-10 01:27:02,771][26022] Updated weights on worker 0-0, policy_version 507023 (0.00089) [2022-07-10 01:27:03,159][25689] Fps is (10 sec: 5412.0, 60 sec: 5674.4, 300 sec: 5684.4). Total num frames: 519192576. Throughput: 0: 5982.4. Samples: 519195006. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:27:03,159][25689] Avg episode reward: [(0, '-39.520')] [2022-07-10 01:27:04,829][26022] Updated weights on worker 0-0, policy_version 507033 (0.00090) [2022-07-10 01:27:06,427][26022] Updated weights on worker 0-0, policy_version 507043 (0.00091) [2022-07-10 01:27:08,171][25689] Fps is (10 sec: 5513.9, 60 sec: 5676.2, 300 sec: 5682.1). Total num frames: 519221248. Throughput: 0: 5868.5. Samples: 519227322. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:27:08,172][25689] Avg episode reward: [(0, '-40.236')] [2022-07-10 01:27:08,218][26022] Updated weights on worker 0-0, policy_version 507053 (0.00092) [2022-07-10 01:27:10,118][26022] Updated weights on worker 0-0, policy_version 507063 (0.00086) [2022-07-10 01:27:11,864][26022] Updated weights on worker 0-0, policy_version 507073 (0.00086) [2022-07-10 01:27:13,309][25689] Fps is (10 sec: 5750.4, 60 sec: 5688.7, 300 sec: 5693.4). Total num frames: 519250944. Throughput: 0: 4986.1. Samples: 519244500. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:27:13,310][25689] Avg episode reward: [(0, '-39.687')] [2022-07-10 01:27:13,685][26022] Updated weights on worker 0-0, policy_version 507083 (0.00087) [2022-07-10 01:27:15,408][26022] Updated weights on worker 0-0, policy_version 507093 (0.00106) [2022-07-10 01:27:17,128][26022] Updated weights on worker 0-0, policy_version 507103 (0.00090) [2022-07-10 01:27:18,348][25689] Fps is (10 sec: 5835.9, 60 sec: 5686.9, 300 sec: 5687.5). Total num frames: 519280640. Throughput: 0: 5837.3. Samples: 519279234. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:27:18,349][25689] Avg episode reward: [(0, '-39.871')] [2022-07-10 01:27:19,113][26022] Updated weights on worker 0-0, policy_version 507113 (0.00082) [2022-07-10 01:27:20,690][26022] Updated weights on worker 0-0, policy_version 507123 (0.00091) [2022-07-10 01:27:22,645][26022] Updated weights on worker 0-0, policy_version 507133 (0.00092) [2022-07-10 01:27:23,365][25689] Fps is (10 sec: 5804.5, 60 sec: 5687.2, 300 sec: 5688.2). Total num frames: 519309312. Throughput: 0: 5882.6. Samples: 519313870. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:27:23,365][25689] Avg episode reward: [(0, '-40.240')] [2022-07-10 01:27:24,303][26022] Updated weights on worker 0-0, policy_version 507143 (0.00090) [2022-07-10 01:27:26,019][26022] Updated weights on worker 0-0, policy_version 507153 (0.00096) [2022-07-10 01:27:27,902][26022] Updated weights on worker 0-0, policy_version 507163 (0.00089) [2022-07-10 01:27:28,389][25689] Fps is (10 sec: 5608.8, 60 sec: 5686.1, 300 sec: 5690.6). Total num frames: 519336960. Throughput: 0: 5136.6. Samples: 519331174. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:27:28,390][25689] Avg episode reward: [(0, '-41.438')] [2022-07-10 01:27:29,564][26022] Updated weights on worker 0-0, policy_version 507173 (0.00091) [2022-07-10 01:27:31,477][26022] Updated weights on worker 0-0, policy_version 507183 (0.00087) [2022-07-10 01:27:33,518][25689] Fps is (10 sec: 5546.9, 60 sec: 5666.5, 300 sec: 5689.7). Total num frames: 519365632. Throughput: 0: 5960.6. Samples: 519364958. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:27:33,519][25689] Avg episode reward: [(0, '-41.134')] [2022-07-10 01:27:33,519][26022] Updated weights on worker 0-0, policy_version 507193 (0.00082) [2022-07-10 01:27:35,080][26022] Updated weights on worker 0-0, policy_version 507203 (0.00077) [2022-07-10 01:27:36,969][26022] Updated weights on worker 0-0, policy_version 507213 (0.00093) [2022-07-10 01:27:38,561][25689] Fps is (10 sec: 5738.1, 60 sec: 5666.2, 300 sec: 5689.8). Total num frames: 519395328. Throughput: 0: 5942.0. Samples: 519399342. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:27:38,562][25689] Avg episode reward: [(0, '-39.747')] [2022-07-10 01:27:38,690][26022] Updated weights on worker 0-0, policy_version 507223 (0.00058) [2022-07-10 01:27:40,612][26022] Updated weights on worker 0-0, policy_version 507233 (0.00087) [2022-07-10 01:27:42,517][26022] Updated weights on worker 0-0, policy_version 507243 (0.00090) [2022-07-10 01:27:43,640][25689] Fps is (10 sec: 5766.4, 60 sec: 5694.2, 300 sec: 5691.8). Total num frames: 519424000. Throughput: 0: 5065.5. Samples: 519416576. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:27:43,642][25689] Avg episode reward: [(0, '-39.100')] [2022-07-10 01:27:43,984][26022] Updated weights on worker 0-0, policy_version 507253 (0.00086) [2022-07-10 01:27:46,082][26022] Updated weights on worker 0-0, policy_version 507263 (0.00090) [2022-07-10 01:27:47,658][26022] Updated weights on worker 0-0, policy_version 507273 (0.00081) [2022-07-10 01:27:48,183][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:27:48,193][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000507275_519449600.pth [2022-07-10 01:27:48,194][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000505276_517402624.pth [2022-07-10 01:27:48,677][25689] Fps is (10 sec: 5668.7, 60 sec: 5675.7, 300 sec: 5688.5). Total num frames: 519452672. Throughput: 0: 5890.5. Samples: 519450680. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:27:48,678][25689] Avg episode reward: [(0, '-40.003')] [2022-07-10 01:27:49,461][26022] Updated weights on worker 0-0, policy_version 507283 (0.00082) [2022-07-10 01:27:51,450][26022] Updated weights on worker 0-0, policy_version 507293 (0.00086) [2022-07-10 01:27:52,803][26022] Updated weights on worker 0-0, policy_version 507303 (0.00087) [2022-07-10 01:27:53,729][25689] Fps is (10 sec: 5684.1, 60 sec: 5660.9, 300 sec: 5692.0). Total num frames: 519481344. Throughput: 0: 5964.3. Samples: 519485500. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:27:53,729][25689] Avg episode reward: [(0, '-38.431')] [2022-07-10 01:27:54,969][26022] Updated weights on worker 0-0, policy_version 507313 (0.00077) [2022-07-10 01:27:56,758][26022] Updated weights on worker 0-0, policy_version 507323 (0.00084) [2022-07-10 01:27:58,158][26022] Updated weights on worker 0-0, policy_version 507333 (0.00084) [2022-07-10 01:27:58,761][25689] Fps is (10 sec: 5788.5, 60 sec: 5692.8, 300 sec: 5691.9). Total num frames: 519511040. Throughput: 0: 5998.6. Samples: 519520510. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:27:58,761][25689] Avg episode reward: [(0, '-38.258')] [2022-07-10 01:28:00,262][26022] Updated weights on worker 0-0, policy_version 507343 (0.00093) [2022-07-10 01:28:02,115][26022] Updated weights on worker 0-0, policy_version 507353 (0.00090) [2022-07-10 01:28:03,768][25689] Fps is (10 sec: 5508.1, 60 sec: 5676.8, 300 sec: 5688.6). Total num frames: 519536640. Throughput: 0: 5977.9. Samples: 519536896. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:28:03,768][25689] Avg episode reward: [(0, '-39.531')] [2022-07-10 01:28:04,047][26022] Updated weights on worker 0-0, policy_version 507363 (0.00073) [2022-07-10 01:28:05,865][26022] Updated weights on worker 0-0, policy_version 507373 (0.00089) [2022-07-10 01:28:07,712][26022] Updated weights on worker 0-0, policy_version 507383 (0.00091) [2022-07-10 01:28:08,784][25689] Fps is (10 sec: 5414.9, 60 sec: 5676.5, 300 sec: 5690.7). Total num frames: 519565312. Throughput: 0: 5929.3. Samples: 519569896. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:28:08,784][25689] Avg episode reward: [(0, '-40.108')] [2022-07-10 01:28:09,392][26022] Updated weights on worker 0-0, policy_version 507393 (0.00088) [2022-07-10 01:28:11,228][26022] Updated weights on worker 0-0, policy_version 507403 (0.00085) [2022-07-10 01:28:12,883][26022] Updated weights on worker 0-0, policy_version 507413 (0.00081) [2022-07-10 01:28:13,820][25689] Fps is (10 sec: 5908.1, 60 sec: 5702.9, 300 sec: 5691.3). Total num frames: 519596032. Throughput: 0: 5915.8. Samples: 519604358. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:28:13,821][25689] Avg episode reward: [(0, '-40.057')] [2022-07-10 01:28:14,766][26022] Updated weights on worker 0-0, policy_version 507423 (0.00093) [2022-07-10 01:28:16,477][26022] Updated weights on worker 0-0, policy_version 507433 (0.00083) [2022-07-10 01:28:18,495][26022] Updated weights on worker 0-0, policy_version 507443 (0.00084) [2022-07-10 01:28:18,864][25689] Fps is (10 sec: 5688.7, 60 sec: 5651.7, 300 sec: 5687.4). Total num frames: 519622656. Throughput: 0: 5016.1. Samples: 519621350. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:28:18,865][25689] Avg episode reward: [(0, '-38.583')] [2022-07-10 01:28:20,195][26022] Updated weights on worker 0-0, policy_version 507453 (0.00077) [2022-07-10 01:28:21,952][26022] Updated weights on worker 0-0, policy_version 507463 (0.00096) [2022-07-10 01:28:23,643][26022] Updated weights on worker 0-0, policy_version 507473 (0.00094) [2022-07-10 01:28:23,870][25689] Fps is (10 sec: 5604.2, 60 sec: 5669.6, 300 sec: 5687.6). Total num frames: 519652352. Throughput: 0: 5918.5. Samples: 519655872. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 01:28:23,870][25689] Avg episode reward: [(0, '-38.455')] [2022-07-10 01:28:25,788][26022] Updated weights on worker 0-0, policy_version 507483 (0.00096) [2022-07-10 01:28:27,208][26022] Updated weights on worker 0-0, policy_version 507493 (0.00086) [2022-07-10 01:28:28,890][25689] Fps is (10 sec: 5719.7, 60 sec: 5670.0, 300 sec: 5685.5). Total num frames: 519680000. Throughput: 0: 5986.0. Samples: 519690252. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:28:28,890][25689] Avg episode reward: [(0, '-38.079')] [2022-07-10 01:28:29,221][26022] Updated weights on worker 0-0, policy_version 507503 (0.00088) [2022-07-10 01:28:30,887][26022] Updated weights on worker 0-0, policy_version 507513 (0.00087) [2022-07-10 01:28:32,687][26022] Updated weights on worker 0-0, policy_version 507523 (0.00086) [2022-07-10 01:28:33,962][25689] Fps is (10 sec: 5580.5, 60 sec: 5675.3, 300 sec: 5688.2). Total num frames: 519708672. Throughput: 0: 5107.9. Samples: 519707240. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:28:33,963][25689] Avg episode reward: [(0, '-38.036')] [2022-07-10 01:28:34,544][26022] Updated weights on worker 0-0, policy_version 507533 (0.00086) [2022-07-10 01:28:36,284][26022] Updated weights on worker 0-0, policy_version 507543 (0.00093) [2022-07-10 01:28:38,035][26022] Updated weights on worker 0-0, policy_version 507553 (0.00089) [2022-07-10 01:28:38,966][25689] Fps is (10 sec: 5996.1, 60 sec: 5713.0, 300 sec: 5692.4). Total num frames: 519740416. Throughput: 0: 5977.6. Samples: 519741510. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:28:38,966][25689] Avg episode reward: [(0, '-37.937')] [2022-07-10 01:28:40,227][26022] Updated weights on worker 0-0, policy_version 507563 (0.00083) [2022-07-10 01:28:41,528][26022] Updated weights on worker 0-0, policy_version 507573 (0.00089) [2022-07-10 01:28:43,648][26022] Updated weights on worker 0-0, policy_version 507583 (0.00086) [2022-07-10 01:28:44,049][25689] Fps is (10 sec: 5786.7, 60 sec: 5678.7, 300 sec: 5685.2). Total num frames: 519767040. Throughput: 0: 5946.1. Samples: 519775858. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:28:44,049][25689] Avg episode reward: [(0, '-38.652')] [2022-07-10 01:28:45,268][26022] Updated weights on worker 0-0, policy_version 507593 (0.00085) [2022-07-10 01:28:46,888][26022] Updated weights on worker 0-0, policy_version 507603 (0.00082) [2022-07-10 01:28:48,988][26022] Updated weights on worker 0-0, policy_version 507613 (0.00086) [2022-07-10 01:28:49,083][25689] Fps is (10 sec: 5465.4, 60 sec: 5678.9, 300 sec: 5686.9). Total num frames: 519795712. Throughput: 0: 5094.5. Samples: 519793126. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:28:49,084][25689] Avg episode reward: [(0, '-40.198')] [2022-07-10 01:28:50,276][26022] Updated weights on worker 0-0, policy_version 507623 (0.00091) [2022-07-10 01:28:52,434][26022] Updated weights on worker 0-0, policy_version 507633 (0.00087) [2022-07-10 01:28:54,194][25689] Fps is (10 sec: 5753.1, 60 sec: 5690.2, 300 sec: 5691.9). Total num frames: 519825408. Throughput: 0: 5946.8. Samples: 519827556. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:28:54,195][25689] Avg episode reward: [(0, '-38.537')] [2022-07-10 01:28:54,466][26022] Updated weights on worker 0-0, policy_version 507643 (0.00091) [2022-07-10 01:28:55,866][26022] Updated weights on worker 0-0, policy_version 507653 (0.00085) [2022-07-10 01:28:58,091][26022] Updated weights on worker 0-0, policy_version 507663 (0.00088) [2022-07-10 01:28:59,239][25689] Fps is (10 sec: 5848.0, 60 sec: 5689.1, 300 sec: 5691.8). Total num frames: 519855104. Throughput: 0: 5942.4. Samples: 519861982. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:28:59,241][25689] Avg episode reward: [(0, '-37.369')] [2022-07-10 01:28:59,476][26022] Updated weights on worker 0-0, policy_version 507673 (0.00095) [2022-07-10 01:29:01,498][26022] Updated weights on worker 0-0, policy_version 507683 (0.00089) [2022-07-10 01:29:03,715][26022] Updated weights on worker 0-0, policy_version 507693 (0.00087) [2022-07-10 01:29:04,274][25689] Fps is (10 sec: 5485.9, 60 sec: 5686.4, 300 sec: 5689.0). Total num frames: 519880704. Throughput: 0: 5019.1. Samples: 519877370. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:29:04,274][25689] Avg episode reward: [(0, '-37.999')] [2022-07-10 01:29:05,278][26022] Updated weights on worker 0-0, policy_version 507703 (0.00104) [2022-07-10 01:29:07,232][26022] Updated weights on worker 0-0, policy_version 507713 (0.00080) [2022-07-10 01:29:08,897][26022] Updated weights on worker 0-0, policy_version 507723 (0.00091) [2022-07-10 01:29:09,277][25689] Fps is (10 sec: 5406.6, 60 sec: 5687.6, 300 sec: 5686.8). Total num frames: 519909376. Throughput: 0: 5856.3. Samples: 519911388. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:29:09,278][25689] Avg episode reward: [(0, '-38.485')] [2022-07-10 01:29:10,677][26022] Updated weights on worker 0-0, policy_version 507733 (0.00090) [2022-07-10 01:29:12,606][26022] Updated weights on worker 0-0, policy_version 507743 (0.00094) [2022-07-10 01:29:14,244][26022] Updated weights on worker 0-0, policy_version 507753 (0.00089) [2022-07-10 01:29:14,344][25689] Fps is (10 sec: 5897.6, 60 sec: 5684.7, 300 sec: 5695.9). Total num frames: 519940096. Throughput: 0: 5872.7. Samples: 519945892. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:29:14,345][25689] Avg episode reward: [(0, '-36.144')] [2022-07-10 01:29:16,067][26022] Updated weights on worker 0-0, policy_version 507763 (0.00088) [2022-07-10 01:29:17,776][26022] Updated weights on worker 0-0, policy_version 507773 (0.00086) [2022-07-10 01:29:19,429][25689] Fps is (10 sec: 5749.8, 60 sec: 5697.9, 300 sec: 5687.8). Total num frames: 519967744. Throughput: 0: 5008.4. Samples: 519963100. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:29:19,429][25689] Avg episode reward: [(0, '-35.927')] [2022-07-10 01:29:19,769][26022] Updated weights on worker 0-0, policy_version 507783 (0.00094) [2022-07-10 01:29:21,167][26022] Updated weights on worker 0-0, policy_version 507793 (0.00086) [2022-07-10 01:29:23,415][26022] Updated weights on worker 0-0, policy_version 507803 (0.00092) [2022-07-10 01:29:24,477][25689] Fps is (10 sec: 5659.2, 60 sec: 5693.8, 300 sec: 5693.8). Total num frames: 519997440. Throughput: 0: 5945.9. Samples: 519997498. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:29:24,478][25689] Avg episode reward: [(0, '-36.744')] [2022-07-10 01:29:24,863][26022] Updated weights on worker 0-0, policy_version 507813 (0.00093) [2022-07-10 01:29:26,873][26022] Updated weights on worker 0-0, policy_version 507823 (0.00083) [2022-07-10 01:29:28,636][26022] Updated weights on worker 0-0, policy_version 507833 (0.00090) [2022-07-10 01:29:29,511][25689] Fps is (10 sec: 5586.1, 60 sec: 5675.7, 300 sec: 5684.3). Total num frames: 520024064. Throughput: 0: 5939.5. Samples: 520031566. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:29:29,511][25689] Avg episode reward: [(0, '-37.324')] [2022-07-10 01:29:30,530][26022] Updated weights on worker 0-0, policy_version 507843 (0.00055) [2022-07-10 01:29:32,435][26022] Updated weights on worker 0-0, policy_version 507853 (0.00051) [2022-07-10 01:29:34,188][26022] Updated weights on worker 0-0, policy_version 507863 (0.00088) [2022-07-10 01:29:34,582][25689] Fps is (10 sec: 5573.4, 60 sec: 5692.6, 300 sec: 5683.5). Total num frames: 520053760. Throughput: 0: 5055.8. Samples: 520048210. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:29:34,583][25689] Avg episode reward: [(0, '-37.345')] [2022-07-10 01:29:36,021][26022] Updated weights on worker 0-0, policy_version 507873 (0.00088) [2022-07-10 01:29:37,702][26022] Updated weights on worker 0-0, policy_version 507883 (0.00089) [2022-07-10 01:29:39,478][26022] Updated weights on worker 0-0, policy_version 507893 (0.00092) [2022-07-10 01:29:39,607][25689] Fps is (10 sec: 5882.5, 60 sec: 5656.9, 300 sec: 5690.8). Total num frames: 520083456. Throughput: 0: 5913.1. Samples: 520082418. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:29:39,608][25689] Avg episode reward: [(0, '-36.480')] [2022-07-10 01:29:41,418][26022] Updated weights on worker 0-0, policy_version 507903 (0.00092) [2022-07-10 01:29:42,880][26022] Updated weights on worker 0-0, policy_version 507913 (0.00091) [2022-07-10 01:29:44,618][25689] Fps is (10 sec: 5612.1, 60 sec: 5663.6, 300 sec: 5684.6). Total num frames: 520110080. Throughput: 0: 5919.9. Samples: 520116728. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:29:44,618][25689] Avg episode reward: [(0, '-35.774')] [2022-07-10 01:29:45,000][26022] Updated weights on worker 0-0, policy_version 507923 (0.00090) [2022-07-10 01:29:46,547][26022] Updated weights on worker 0-0, policy_version 507933 (0.00086) [2022-07-10 01:29:48,228][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:29:48,237][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000507941_520131584.pth [2022-07-10 01:29:48,238][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000505941_518083584.pth [2022-07-10 01:29:48,741][26022] Updated weights on worker 0-0, policy_version 507943 (0.00096) [2022-07-10 01:29:49,640][25689] Fps is (10 sec: 5715.5, 60 sec: 5698.6, 300 sec: 5689.7). Total num frames: 520140800. Throughput: 0: 5070.0. Samples: 520133622. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:29:49,641][25689] Avg episode reward: [(0, '-35.143')] [2022-07-10 01:29:50,264][26022] Updated weights on worker 0-0, policy_version 507953 (0.00097) [2022-07-10 01:29:52,108][26022] Updated weights on worker 0-0, policy_version 507963 (0.00091) [2022-07-10 01:29:54,033][26022] Updated weights on worker 0-0, policy_version 507973 (0.00089) [2022-07-10 01:29:54,767][25689] Fps is (10 sec: 5751.1, 60 sec: 5663.3, 300 sec: 5684.0). Total num frames: 520168448. Throughput: 0: 5932.4. Samples: 520167952. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:29:54,768][25689] Avg episode reward: [(0, '-34.915')] [2022-07-10 01:29:55,575][26022] Updated weights on worker 0-0, policy_version 507983 (0.00092) [2022-07-10 01:29:57,488][26022] Updated weights on worker 0-0, policy_version 507993 (0.00080) [2022-07-10 01:29:59,150][26022] Updated weights on worker 0-0, policy_version 508003 (0.00089) [2022-07-10 01:29:59,778][25689] Fps is (10 sec: 5656.5, 60 sec: 5666.4, 300 sec: 5691.0). Total num frames: 520198144. Throughput: 0: 5956.4. Samples: 520202564. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:29:59,778][25689] Avg episode reward: [(0, '-34.722')] [2022-07-10 01:30:01,157][26022] Updated weights on worker 0-0, policy_version 508013 (0.00084) [2022-07-10 01:30:03,199][26022] Updated weights on worker 0-0, policy_version 508023 (0.00075) [2022-07-10 01:30:04,790][25689] Fps is (10 sec: 5619.2, 60 sec: 5685.5, 300 sec: 5681.6). Total num frames: 520224768. Throughput: 0: 4989.3. Samples: 520217372. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:30:04,792][25689] Avg episode reward: [(0, '-34.613')] [2022-07-10 01:30:05,004][26022] Updated weights on worker 0-0, policy_version 508033 (0.00087) [2022-07-10 01:30:06,772][26022] Updated weights on worker 0-0, policy_version 508043 (0.00084) [2022-07-10 01:30:08,668][26022] Updated weights on worker 0-0, policy_version 508053 (0.00091) [2022-07-10 01:30:09,846][25689] Fps is (10 sec: 5492.0, 60 sec: 5680.5, 300 sec: 5682.6). Total num frames: 520253440. Throughput: 0: 5853.8. Samples: 520251904. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:30:09,847][25689] Avg episode reward: [(0, '-34.828')] [2022-07-10 01:30:10,282][26022] Updated weights on worker 0-0, policy_version 508063 (0.00382) [2022-07-10 01:30:12,279][26022] Updated weights on worker 0-0, policy_version 508073 (0.00084) [2022-07-10 01:30:13,813][26022] Updated weights on worker 0-0, policy_version 508083 (0.00087) [2022-07-10 01:30:14,915][25689] Fps is (10 sec: 5461.3, 60 sec: 5612.8, 300 sec: 5675.1). Total num frames: 520280064. Throughput: 0: 5872.7. Samples: 520286274. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:30:14,915][25689] Avg episode reward: [(0, '-35.296')] [2022-07-10 01:30:15,862][26022] Updated weights on worker 0-0, policy_version 508093 (0.00089) [2022-07-10 01:30:17,479][26022] Updated weights on worker 0-0, policy_version 508103 (0.00087) [2022-07-10 01:30:19,311][26022] Updated weights on worker 0-0, policy_version 508113 (0.00087) [2022-07-10 01:30:19,936][25689] Fps is (10 sec: 5784.9, 60 sec: 5686.3, 300 sec: 5681.9). Total num frames: 520311808. Throughput: 0: 4995.8. Samples: 520303270. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:30:19,937][25689] Avg episode reward: [(0, '-34.929')] [2022-07-10 01:30:21,311][26022] Updated weights on worker 0-0, policy_version 508123 (0.00092) [2022-07-10 01:30:22,739][26022] Updated weights on worker 0-0, policy_version 508133 (0.00051) [2022-07-10 01:30:24,799][26022] Updated weights on worker 0-0, policy_version 508143 (0.00078) [2022-07-10 01:30:24,938][25689] Fps is (10 sec: 5925.2, 60 sec: 5656.8, 300 sec: 5679.4). Total num frames: 520339456. Throughput: 0: 5978.7. Samples: 520337834. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:30:24,939][25689] Avg episode reward: [(0, '-35.782')] [2022-07-10 01:30:26,391][26022] Updated weights on worker 0-0, policy_version 508153 (0.00087) [2022-07-10 01:30:28,328][26022] Updated weights on worker 0-0, policy_version 508163 (0.00093) [2022-07-10 01:30:29,957][25689] Fps is (10 sec: 5722.4, 60 sec: 5709.0, 300 sec: 5684.4). Total num frames: 520369152. Throughput: 0: 5976.0. Samples: 520372086. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:30:29,959][25689] Avg episode reward: [(0, '-35.475')] [2022-07-10 01:30:29,966][26022] Updated weights on worker 0-0, policy_version 508173 (0.00088) [2022-07-10 01:30:32,026][26022] Updated weights on worker 0-0, policy_version 508183 (0.00087) [2022-07-10 01:30:33,724][26022] Updated weights on worker 0-0, policy_version 508193 (0.00090) [2022-07-10 01:30:34,997][25689] Fps is (10 sec: 5598.9, 60 sec: 5661.1, 300 sec: 5673.3). Total num frames: 520395776. Throughput: 0: 5965.3. Samples: 520406072. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:30:34,998][25689] Avg episode reward: [(0, '-36.401')] [2022-07-10 01:30:35,641][26022] Updated weights on worker 0-0, policy_version 508203 (0.00088) [2022-07-10 01:30:37,319][26022] Updated weights on worker 0-0, policy_version 508213 (0.00094) [2022-07-10 01:30:39,042][26022] Updated weights on worker 0-0, policy_version 508223 (0.00086) [2022-07-10 01:30:40,061][25689] Fps is (10 sec: 5574.2, 60 sec: 5657.5, 300 sec: 5679.9). Total num frames: 520425472. Throughput: 0: 5965.8. Samples: 520423330. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:30:40,062][25689] Avg episode reward: [(0, '-38.001')] [2022-07-10 01:30:40,811][26022] Updated weights on worker 0-0, policy_version 508233 (0.00092) [2022-07-10 01:30:42,576][26022] Updated weights on worker 0-0, policy_version 508243 (0.00098) [2022-07-10 01:30:44,518][26022] Updated weights on worker 0-0, policy_version 508253 (0.00090) [2022-07-10 01:30:45,091][25689] Fps is (10 sec: 5782.5, 60 sec: 5689.5, 300 sec: 5676.4). Total num frames: 520454144. Throughput: 0: 5956.8. Samples: 520457882. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:30:45,092][25689] Avg episode reward: [(0, '-38.483')] [2022-07-10 01:30:46,314][26022] Updated weights on worker 0-0, policy_version 508263 (0.00086) [2022-07-10 01:30:47,957][26022] Updated weights on worker 0-0, policy_version 508273 (0.00083) [2022-07-10 01:30:50,015][26022] Updated weights on worker 0-0, policy_version 508283 (0.00088) [2022-07-10 01:30:50,150][25689] Fps is (10 sec: 5582.5, 60 sec: 5635.4, 300 sec: 5669.8). Total num frames: 520481792. Throughput: 0: 5969.1. Samples: 520492616. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:30:50,150][25689] Avg episode reward: [(0, '-37.622')] [2022-07-10 01:30:51,447][26022] Updated weights on worker 0-0, policy_version 508293 (0.00083) [2022-07-10 01:30:53,408][26022] Updated weights on worker 0-0, policy_version 508303 (0.00088) [2022-07-10 01:30:55,079][26022] Updated weights on worker 0-0, policy_version 508313 (0.00089) [2022-07-10 01:30:55,221][25689] Fps is (10 sec: 5862.9, 60 sec: 5708.2, 300 sec: 5682.4). Total num frames: 520513536. Throughput: 0: 5115.2. Samples: 520509522. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:30:55,222][25689] Avg episode reward: [(0, '-36.604')] [2022-07-10 01:30:56,896][26022] Updated weights on worker 0-0, policy_version 508323 (0.00084) [2022-07-10 01:30:58,660][26022] Updated weights on worker 0-0, policy_version 508333 (0.00085) [2022-07-10 01:31:00,227][25689] Fps is (10 sec: 5893.3, 60 sec: 5674.8, 300 sec: 5685.8). Total num frames: 520541184. Throughput: 0: 5992.5. Samples: 520544178. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:31:00,229][25689] Avg episode reward: [(0, '-36.138')] [2022-07-10 01:31:00,522][26022] Updated weights on worker 0-0, policy_version 508343 (0.00082) [2022-07-10 01:31:02,708][26022] Updated weights on worker 0-0, policy_version 508353 (0.00084) [2022-07-10 01:31:04,476][26022] Updated weights on worker 0-0, policy_version 508363 (0.00088) [2022-07-10 01:31:05,236][25689] Fps is (10 sec: 5317.1, 60 sec: 5658.2, 300 sec: 5675.9). Total num frames: 520566784. Throughput: 0: 5879.4. Samples: 520576320. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:31:05,238][25689] Avg episode reward: [(0, '-35.959')] [2022-07-10 01:31:06,336][26022] Updated weights on worker 0-0, policy_version 508373 (0.00088) [2022-07-10 01:31:08,168][26022] Updated weights on worker 0-0, policy_version 508383 (0.00093) [2022-07-10 01:31:09,854][26022] Updated weights on worker 0-0, policy_version 508393 (0.00094) [2022-07-10 01:31:10,238][25689] Fps is (10 sec: 5625.8, 60 sec: 5697.2, 300 sec: 5684.5). Total num frames: 520597504. Throughput: 0: 5013.7. Samples: 520593336. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:31:10,239][25689] Avg episode reward: [(0, '-33.306')] [2022-07-10 01:31:11,867][26022] Updated weights on worker 0-0, policy_version 508403 (0.00082) [2022-07-10 01:31:13,332][26022] Updated weights on worker 0-0, policy_version 508413 (0.00086) [2022-07-10 01:31:15,216][26022] Updated weights on worker 0-0, policy_version 508423 (0.00082) [2022-07-10 01:31:15,353][25689] Fps is (10 sec: 5769.1, 60 sec: 5709.7, 300 sec: 5675.8). Total num frames: 520625152. Throughput: 0: 5887.3. Samples: 520628044. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:31:15,354][25689] Avg episode reward: [(0, '-34.442')] [2022-07-10 01:31:16,911][26022] Updated weights on worker 0-0, policy_version 508433 (0.00086) [2022-07-10 01:31:18,908][26022] Updated weights on worker 0-0, policy_version 508443 (0.00090) [2022-07-10 01:31:20,370][25689] Fps is (10 sec: 5559.2, 60 sec: 5659.4, 300 sec: 5675.8). Total num frames: 520653824. Throughput: 0: 5862.7. Samples: 520662266. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 01:31:20,372][25689] Avg episode reward: [(0, '-33.934')] [2022-07-10 01:31:20,565][26022] Updated weights on worker 0-0, policy_version 508453 (0.00088) [2022-07-10 01:31:22,376][26022] Updated weights on worker 0-0, policy_version 508463 (0.00085) [2022-07-10 01:31:24,227][26022] Updated weights on worker 0-0, policy_version 508473 (0.00084) [2022-07-10 01:31:25,402][25689] Fps is (10 sec: 5706.5, 60 sec: 5673.5, 300 sec: 5678.9). Total num frames: 520682496. Throughput: 0: 5110.4. Samples: 520679378. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:31:25,403][25689] Avg episode reward: [(0, '-33.650')] [2022-07-10 01:31:25,957][26022] Updated weights on worker 0-0, policy_version 508483 (0.00088) [2022-07-10 01:31:27,815][26022] Updated weights on worker 0-0, policy_version 508493 (0.00090) [2022-07-10 01:31:29,451][26022] Updated weights on worker 0-0, policy_version 508503 (0.00080) [2022-07-10 01:31:30,426][25689] Fps is (10 sec: 5702.2, 60 sec: 5656.1, 300 sec: 5676.9). Total num frames: 520711168. Throughput: 0: 5971.2. Samples: 520713880. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:31:30,427][25689] Avg episode reward: [(0, '-33.270')] [2022-07-10 01:31:31,490][26022] Updated weights on worker 0-0, policy_version 508513 (0.00092) [2022-07-10 01:31:33,016][26022] Updated weights on worker 0-0, policy_version 508523 (0.00090) [2022-07-10 01:31:35,139][26022] Updated weights on worker 0-0, policy_version 508533 (0.00099) [2022-07-10 01:31:35,510][25689] Fps is (10 sec: 5673.4, 60 sec: 5685.9, 300 sec: 5672.6). Total num frames: 520739840. Throughput: 0: 5938.4. Samples: 520747740. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:31:35,511][25689] Avg episode reward: [(0, '-34.331')] [2022-07-10 01:31:36,603][26022] Updated weights on worker 0-0, policy_version 508543 (0.00093) [2022-07-10 01:31:38,611][26022] Updated weights on worker 0-0, policy_version 508553 (0.00085) [2022-07-10 01:31:40,429][26022] Updated weights on worker 0-0, policy_version 508563 (0.00095) [2022-07-10 01:31:40,515][25689] Fps is (10 sec: 5684.1, 60 sec: 5674.4, 300 sec: 5679.7). Total num frames: 520768512. Throughput: 0: 5089.6. Samples: 520764796. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:31:40,515][25689] Avg episode reward: [(0, '-33.510')] [2022-07-10 01:31:42,226][26022] Updated weights on worker 0-0, policy_version 508573 (0.00079) [2022-07-10 01:31:43,935][26022] Updated weights on worker 0-0, policy_version 508583 (0.00096) [2022-07-10 01:31:45,534][25689] Fps is (10 sec: 5720.7, 60 sec: 5675.5, 300 sec: 5676.3). Total num frames: 520797184. Throughput: 0: 5960.0. Samples: 520799362. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:31:45,534][25689] Avg episode reward: [(0, '-34.329')] [2022-07-10 01:31:45,691][26022] Updated weights on worker 0-0, policy_version 508593 (0.00091) [2022-07-10 01:31:47,450][26022] Updated weights on worker 0-0, policy_version 508603 (0.00086) [2022-07-10 01:31:48,272][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:31:48,281][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000508607_520813568.pth [2022-07-10 01:31:48,281][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000506609_518767616.pth [2022-07-10 01:31:49,296][26022] Updated weights on worker 0-0, policy_version 508613 (0.00088) [2022-07-10 01:31:50,579][25689] Fps is (10 sec: 5799.7, 60 sec: 5710.6, 300 sec: 5676.9). Total num frames: 520826880. Throughput: 0: 5959.8. Samples: 520833986. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:31:50,579][25689] Avg episode reward: [(0, '-34.248')] [2022-07-10 01:31:51,220][26022] Updated weights on worker 0-0, policy_version 508623 (0.00095) [2022-07-10 01:31:52,956][26022] Updated weights on worker 0-0, policy_version 508633 (0.00090) [2022-07-10 01:31:54,632][26022] Updated weights on worker 0-0, policy_version 508643 (0.00086) [2022-07-10 01:31:55,626][25689] Fps is (10 sec: 5783.4, 60 sec: 5662.1, 300 sec: 5679.6). Total num frames: 520855552. Throughput: 0: 5144.7. Samples: 520851232. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:31:55,626][25689] Avg episode reward: [(0, '-36.225')] [2022-07-10 01:31:56,307][26022] Updated weights on worker 0-0, policy_version 508653 (0.00091) [2022-07-10 01:31:58,240][26022] Updated weights on worker 0-0, policy_version 508663 (0.00084) [2022-07-10 01:31:59,910][26022] Updated weights on worker 0-0, policy_version 508673 (0.00084) [2022-07-10 01:32:00,670][25689] Fps is (10 sec: 5784.0, 60 sec: 5692.4, 300 sec: 5689.4). Total num frames: 520885248. Throughput: 0: 6007.3. Samples: 520885874. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:32:00,670][25689] Avg episode reward: [(0, '-36.281')] [2022-07-10 01:32:02,370][26022] Updated weights on worker 0-0, policy_version 508683 (0.00082) [2022-07-10 01:32:03,896][26022] Updated weights on worker 0-0, policy_version 508693 (0.00087) [2022-07-10 01:32:05,702][25689] Fps is (10 sec: 5487.9, 60 sec: 5690.2, 300 sec: 5678.7). Total num frames: 520910848. Throughput: 0: 5896.7. Samples: 520918288. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:32:05,703][25689] Avg episode reward: [(0, '-36.265')] [2022-07-10 01:32:05,779][26022] Updated weights on worker 0-0, policy_version 508703 (0.00095) [2022-07-10 01:32:07,318][26022] Updated weights on worker 0-0, policy_version 508713 (0.00087) [2022-07-10 01:32:09,372][26022] Updated weights on worker 0-0, policy_version 508723 (0.00091) [2022-07-10 01:32:10,732][25689] Fps is (10 sec: 5495.4, 60 sec: 5670.7, 300 sec: 5680.8). Total num frames: 520940544. Throughput: 0: 5024.3. Samples: 520935244. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:32:10,733][25689] Avg episode reward: [(0, '-35.892')] [2022-07-10 01:32:11,239][26022] Updated weights on worker 0-0, policy_version 508733 (0.00098) [2022-07-10 01:32:12,979][26022] Updated weights on worker 0-0, policy_version 508743 (0.00085) [2022-07-10 01:32:14,528][26022] Updated weights on worker 0-0, policy_version 508753 (0.00085) [2022-07-10 01:32:15,788][25689] Fps is (10 sec: 5786.9, 60 sec: 5693.1, 300 sec: 5677.0). Total num frames: 520969216. Throughput: 0: 5871.0. Samples: 520969604. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:32:15,789][25689] Avg episode reward: [(0, '-36.364')] [2022-07-10 01:32:16,485][26022] Updated weights on worker 0-0, policy_version 508763 (0.00090) [2022-07-10 01:32:18,366][26022] Updated weights on worker 0-0, policy_version 508773 (0.00089) [2022-07-10 01:32:20,097][26022] Updated weights on worker 0-0, policy_version 508783 (0.00093) [2022-07-10 01:32:20,800][25689] Fps is (10 sec: 5593.9, 60 sec: 5676.6, 300 sec: 5673.7). Total num frames: 520996864. Throughput: 0: 5831.8. Samples: 521003270. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:32:20,802][25689] Avg episode reward: [(0, '-37.369')] [2022-07-10 01:32:21,968][26022] Updated weights on worker 0-0, policy_version 508793 (0.00090) [2022-07-10 01:32:23,887][26022] Updated weights on worker 0-0, policy_version 508803 (0.00064) [2022-07-10 01:32:25,572][26022] Updated weights on worker 0-0, policy_version 508813 (0.00099) [2022-07-10 01:32:25,828][25689] Fps is (10 sec: 5609.6, 60 sec: 5677.0, 300 sec: 5677.1). Total num frames: 521025536. Throughput: 0: 5068.0. Samples: 521020288. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:32:25,828][25689] Avg episode reward: [(0, '-35.571')] [2022-07-10 01:32:27,510][26022] Updated weights on worker 0-0, policy_version 508823 (0.00092) [2022-07-10 01:32:29,181][26022] Updated weights on worker 0-0, policy_version 508833 (0.00087) [2022-07-10 01:32:30,842][25689] Fps is (10 sec: 5506.4, 60 sec: 5644.0, 300 sec: 5672.4). Total num frames: 521052160. Throughput: 0: 5904.5. Samples: 521053984. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:32:30,844][25689] Avg episode reward: [(0, '-35.029')] [2022-07-10 01:32:31,115][26022] Updated weights on worker 0-0, policy_version 508843 (0.00087) [2022-07-10 01:32:32,723][26022] Updated weights on worker 0-0, policy_version 508853 (0.00089) [2022-07-10 01:32:34,710][26022] Updated weights on worker 0-0, policy_version 508863 (0.00093) [2022-07-10 01:32:35,900][25689] Fps is (10 sec: 5693.2, 60 sec: 5680.3, 300 sec: 5675.5). Total num frames: 521082880. Throughput: 0: 5898.1. Samples: 521088228. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:32:35,902][25689] Avg episode reward: [(0, '-34.404')] [2022-07-10 01:32:36,436][26022] Updated weights on worker 0-0, policy_version 508873 (0.00086) [2022-07-10 01:32:38,254][26022] Updated weights on worker 0-0, policy_version 508883 (0.00091) [2022-07-10 01:32:40,067][26022] Updated weights on worker 0-0, policy_version 508893 (0.00084) [2022-07-10 01:32:40,907][25689] Fps is (10 sec: 5697.5, 60 sec: 5646.3, 300 sec: 5670.0). Total num frames: 521109504. Throughput: 0: 5063.3. Samples: 521105076. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:32:40,908][25689] Avg episode reward: [(0, '-34.882')] [2022-07-10 01:32:42,043][26022] Updated weights on worker 0-0, policy_version 508903 (0.00083) [2022-07-10 01:32:43,647][26022] Updated weights on worker 0-0, policy_version 508913 (0.00087) [2022-07-10 01:32:45,722][26022] Updated weights on worker 0-0, policy_version 508923 (0.00087) [2022-07-10 01:32:45,939][25689] Fps is (10 sec: 5508.3, 60 sec: 5645.0, 300 sec: 5670.1). Total num frames: 521138176. Throughput: 0: 5900.6. Samples: 521138954. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:32:45,940][25689] Avg episode reward: [(0, '-33.453')] [2022-07-10 01:32:47,315][26022] Updated weights on worker 0-0, policy_version 508933 (0.00105) [2022-07-10 01:32:49,363][26022] Updated weights on worker 0-0, policy_version 508943 (0.00097) [2022-07-10 01:32:50,942][25689] Fps is (10 sec: 5816.2, 60 sec: 5648.9, 300 sec: 5674.5). Total num frames: 521167872. Throughput: 0: 5905.1. Samples: 521172678. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:32:50,943][25689] Avg episode reward: [(0, '-33.814')] [2022-07-10 01:32:50,949][26022] Updated weights on worker 0-0, policy_version 508953 (0.00086) [2022-07-10 01:32:52,999][26022] Updated weights on worker 0-0, policy_version 508963 (0.00495) [2022-07-10 01:32:54,640][26022] Updated weights on worker 0-0, policy_version 508973 (0.00093) [2022-07-10 01:32:55,987][25689] Fps is (10 sec: 5605.3, 60 sec: 5615.3, 300 sec: 5663.9). Total num frames: 521194496. Throughput: 0: 5053.3. Samples: 521189730. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:32:55,987][25689] Avg episode reward: [(0, '-34.202')] [2022-07-10 01:32:56,391][26022] Updated weights on worker 0-0, policy_version 508983 (0.00093) [2022-07-10 01:32:58,359][26022] Updated weights on worker 0-0, policy_version 508993 (0.00099) [2022-07-10 01:33:00,341][26022] Updated weights on worker 0-0, policy_version 509003 (0.00128) [2022-07-10 01:33:00,999][25689] Fps is (10 sec: 5600.5, 60 sec: 5618.3, 300 sec: 5677.6). Total num frames: 521224192. Throughput: 0: 5913.5. Samples: 521223888. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:33:00,999][25689] Avg episode reward: [(0, '-35.937')] [2022-07-10 01:33:02,403][26022] Updated weights on worker 0-0, policy_version 509013 (0.00089) [2022-07-10 01:33:04,139][26022] Updated weights on worker 0-0, policy_version 509023 (0.00092) [2022-07-10 01:33:05,792][26022] Updated weights on worker 0-0, policy_version 509033 (0.00085) [2022-07-10 01:33:06,018][25689] Fps is (10 sec: 5614.4, 60 sec: 5636.4, 300 sec: 5670.6). Total num frames: 521250816. Throughput: 0: 5824.3. Samples: 521255900. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:33:06,019][25689] Avg episode reward: [(0, '-36.165')] [2022-07-10 01:33:07,738][26022] Updated weights on worker 0-0, policy_version 509043 (0.00543) [2022-07-10 01:33:09,263][26022] Updated weights on worker 0-0, policy_version 509053 (0.00090) [2022-07-10 01:33:11,042][25689] Fps is (10 sec: 5403.9, 60 sec: 5603.1, 300 sec: 5660.5). Total num frames: 521278464. Throughput: 0: 4991.6. Samples: 521273006. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:33:11,042][25689] Avg episode reward: [(0, '-36.195')] [2022-07-10 01:33:11,447][26022] Updated weights on worker 0-0, policy_version 509063 (0.00084) [2022-07-10 01:33:12,965][26022] Updated weights on worker 0-0, policy_version 509073 (0.00085) [2022-07-10 01:33:14,823][26022] Updated weights on worker 0-0, policy_version 509083 (0.00085) [2022-07-10 01:33:16,162][25689] Fps is (10 sec: 5653.3, 60 sec: 5614.1, 300 sec: 5669.4). Total num frames: 521308160. Throughput: 0: 5822.2. Samples: 521307192. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:33:16,162][25689] Avg episode reward: [(0, '-36.394')] [2022-07-10 01:33:16,627][26022] Updated weights on worker 0-0, policy_version 509093 (0.00088) [2022-07-10 01:33:18,424][26022] Updated weights on worker 0-0, policy_version 509103 (0.00089) [2022-07-10 01:33:20,351][26022] Updated weights on worker 0-0, policy_version 509113 (0.00088) [2022-07-10 01:33:21,167][25689] Fps is (10 sec: 5663.6, 60 sec: 5614.8, 300 sec: 5662.5). Total num frames: 521335808. Throughput: 0: 5827.3. Samples: 521341414. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:33:21,167][25689] Avg episode reward: [(0, '-36.405')] [2022-07-10 01:33:22,073][26022] Updated weights on worker 0-0, policy_version 509123 (0.00082) [2022-07-10 01:33:23,801][26022] Updated weights on worker 0-0, policy_version 509133 (0.00095) [2022-07-10 01:33:25,652][26022] Updated weights on worker 0-0, policy_version 509143 (0.00099) [2022-07-10 01:33:26,189][25689] Fps is (10 sec: 5718.6, 60 sec: 5632.2, 300 sec: 5669.4). Total num frames: 521365504. Throughput: 0: 5103.9. Samples: 521358852. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:33:26,190][25689] Avg episode reward: [(0, '-35.748')] [2022-07-10 01:33:27,487][26022] Updated weights on worker 0-0, policy_version 509153 (0.00085) [2022-07-10 01:33:29,157][26022] Updated weights on worker 0-0, policy_version 509163 (0.00086) [2022-07-10 01:33:30,909][26022] Updated weights on worker 0-0, policy_version 509173 (0.00092) [2022-07-10 01:33:31,229][25689] Fps is (10 sec: 5902.7, 60 sec: 5680.7, 300 sec: 5673.5). Total num frames: 521395200. Throughput: 0: 5959.1. Samples: 521393302. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:33:31,229][25689] Avg episode reward: [(0, '-34.568')] [2022-07-10 01:33:32,799][26022] Updated weights on worker 0-0, policy_version 509183 (0.00055) [2022-07-10 01:33:34,547][26022] Updated weights on worker 0-0, policy_version 509193 (0.00089) [2022-07-10 01:33:36,277][25689] Fps is (10 sec: 5684.5, 60 sec: 5630.8, 300 sec: 5658.8). Total num frames: 521422848. Throughput: 0: 5991.4. Samples: 521427712. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:33:36,278][25689] Avg episode reward: [(0, '-34.030')] [2022-07-10 01:33:36,504][26022] Updated weights on worker 0-0, policy_version 509203 (0.00091) [2022-07-10 01:33:38,153][26022] Updated weights on worker 0-0, policy_version 509213 (0.00084) [2022-07-10 01:33:40,082][26022] Updated weights on worker 0-0, policy_version 509223 (0.00084) [2022-07-10 01:33:41,300][25689] Fps is (10 sec: 5693.7, 60 sec: 5680.1, 300 sec: 5670.3). Total num frames: 521452544. Throughput: 0: 5139.9. Samples: 521444898. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:33:41,309][25689] Avg episode reward: [(0, '-35.158')] [2022-07-10 01:33:41,595][26022] Updated weights on worker 0-0, policy_version 509233 (0.00084) [2022-07-10 01:33:43,476][26022] Updated weights on worker 0-0, policy_version 509243 (0.00082) [2022-07-10 01:33:45,306][26022] Updated weights on worker 0-0, policy_version 509253 (0.00085) [2022-07-10 01:33:46,311][25689] Fps is (10 sec: 5715.0, 60 sec: 5665.2, 300 sec: 5667.3). Total num frames: 521480192. Throughput: 0: 5994.5. Samples: 521479472. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:33:46,312][25689] Avg episode reward: [(0, '-35.264')] [2022-07-10 01:33:47,046][26022] Updated weights on worker 0-0, policy_version 509263 (0.00080) [2022-07-10 01:33:48,284][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:33:48,292][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000509270_521492480.pth [2022-07-10 01:33:48,298][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000507275_519449600.pth [2022-07-10 01:33:48,962][26022] Updated weights on worker 0-0, policy_version 509273 (0.00089) [2022-07-10 01:33:50,578][26022] Updated weights on worker 0-0, policy_version 509283 (0.00093) [2022-07-10 01:33:51,339][25689] Fps is (10 sec: 5508.4, 60 sec: 5629.0, 300 sec: 5662.0). Total num frames: 521507840. Throughput: 0: 5992.3. Samples: 521513808. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:33:51,339][25689] Avg episode reward: [(0, '-36.614')] [2022-07-10 01:33:52,537][26022] Updated weights on worker 0-0, policy_version 509293 (0.00085) [2022-07-10 01:33:54,289][26022] Updated weights on worker 0-0, policy_version 509303 (0.00098) [2022-07-10 01:33:55,827][26022] Updated weights on worker 0-0, policy_version 509313 (0.00093) [2022-07-10 01:33:56,431][25689] Fps is (10 sec: 5767.7, 60 sec: 5692.3, 300 sec: 5664.5). Total num frames: 521538560. Throughput: 0: 5111.3. Samples: 521530724. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:33:56,432][25689] Avg episode reward: [(0, '-37.031')] [2022-07-10 01:33:57,822][26022] Updated weights on worker 0-0, policy_version 509323 (0.00093) [2022-07-10 01:33:59,370][26022] Updated weights on worker 0-0, policy_version 509333 (0.00098) [2022-07-10 01:34:01,441][25689] Fps is (10 sec: 5676.6, 60 sec: 5641.7, 300 sec: 5668.5). Total num frames: 521565184. Throughput: 0: 5973.0. Samples: 521565196. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:34:01,447][25689] Avg episode reward: [(0, '-37.191')] [2022-07-10 01:34:01,801][26022] Updated weights on worker 0-0, policy_version 509343 (0.00097) [2022-07-10 01:34:03,844][26022] Updated weights on worker 0-0, policy_version 509353 (0.00091) [2022-07-10 01:34:05,357][26022] Updated weights on worker 0-0, policy_version 509363 (0.00087) [2022-07-10 01:34:06,455][25689] Fps is (10 sec: 5414.1, 60 sec: 5659.1, 300 sec: 5664.8). Total num frames: 521592832. Throughput: 0: 5855.5. Samples: 521597424. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:34:06,457][25689] Avg episode reward: [(0, '-36.139')] [2022-07-10 01:34:07,278][26022] Updated weights on worker 0-0, policy_version 509373 (0.00093) [2022-07-10 01:34:09,012][26022] Updated weights on worker 0-0, policy_version 509383 (0.00088) [2022-07-10 01:34:10,762][26022] Updated weights on worker 0-0, policy_version 509393 (0.00089) [2022-07-10 01:34:11,464][25689] Fps is (10 sec: 5721.2, 60 sec: 5694.4, 300 sec: 5662.5). Total num frames: 521622528. Throughput: 0: 5012.4. Samples: 521614682. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:34:11,465][25689] Avg episode reward: [(0, '-36.133')] [2022-07-10 01:34:12,686][26022] Updated weights on worker 0-0, policy_version 509403 (0.00053) [2022-07-10 01:34:14,271][26022] Updated weights on worker 0-0, policy_version 509413 (0.00086) [2022-07-10 01:34:16,187][26022] Updated weights on worker 0-0, policy_version 509423 (0.00083) [2022-07-10 01:34:16,548][25689] Fps is (10 sec: 5782.9, 60 sec: 5680.7, 300 sec: 5665.9). Total num frames: 521651200. Throughput: 0: 5878.2. Samples: 521648978. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 01:34:16,549][25689] Avg episode reward: [(0, '-36.744')] [2022-07-10 01:34:17,762][26022] Updated weights on worker 0-0, policy_version 509433 (0.00092) [2022-07-10 01:34:19,770][26022] Updated weights on worker 0-0, policy_version 509443 (0.00086) [2022-07-10 01:34:21,223][26022] Updated weights on worker 0-0, policy_version 509453 (0.00096) [2022-07-10 01:34:21,615][25689] Fps is (10 sec: 5750.0, 60 sec: 5708.9, 300 sec: 5665.6). Total num frames: 521680896. Throughput: 0: 5865.5. Samples: 521683526. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:34:21,615][25689] Avg episode reward: [(0, '-35.423')] [2022-07-10 01:34:23,345][26022] Updated weights on worker 0-0, policy_version 509463 (0.00086) [2022-07-10 01:34:25,028][26022] Updated weights on worker 0-0, policy_version 509473 (0.00086) [2022-07-10 01:34:26,690][25689] Fps is (10 sec: 5654.4, 60 sec: 5670.1, 300 sec: 5668.2). Total num frames: 521708544. Throughput: 0: 5960.3. Samples: 521718028. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:34:26,690][25689] Avg episode reward: [(0, '-36.587')] [2022-07-10 01:34:26,897][26022] Updated weights on worker 0-0, policy_version 509483 (0.00087) [2022-07-10 01:34:28,551][26022] Updated weights on worker 0-0, policy_version 509493 (0.00086) [2022-07-10 01:34:30,328][26022] Updated weights on worker 0-0, policy_version 509503 (0.00081) [2022-07-10 01:34:31,737][25689] Fps is (10 sec: 5664.9, 60 sec: 5669.3, 300 sec: 5668.7). Total num frames: 521738240. Throughput: 0: 5947.5. Samples: 521735260. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:34:31,738][25689] Avg episode reward: [(0, '-37.315')] [2022-07-10 01:34:32,157][26022] Updated weights on worker 0-0, policy_version 509513 (0.00084) [2022-07-10 01:34:34,077][26022] Updated weights on worker 0-0, policy_version 509523 (0.00088) [2022-07-10 01:34:35,782][26022] Updated weights on worker 0-0, policy_version 509533 (0.00086) [2022-07-10 01:34:36,796][25689] Fps is (10 sec: 5876.9, 60 sec: 5702.2, 300 sec: 5668.1). Total num frames: 521767936. Throughput: 0: 5965.9. Samples: 521769772. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:34:36,796][25689] Avg episode reward: [(0, '-37.619')] [2022-07-10 01:34:37,769][26022] Updated weights on worker 0-0, policy_version 509543 (0.00081) [2022-07-10 01:34:39,259][26022] Updated weights on worker 0-0, policy_version 509553 (0.00085) [2022-07-10 01:34:41,237][26022] Updated weights on worker 0-0, policy_version 509563 (0.00085) [2022-07-10 01:34:41,888][25689] Fps is (10 sec: 5750.3, 60 sec: 5678.8, 300 sec: 5673.4). Total num frames: 521796608. Throughput: 0: 5943.1. Samples: 521804012. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:34:41,888][25689] Avg episode reward: [(0, '-37.630')] [2022-07-10 01:34:43,011][26022] Updated weights on worker 0-0, policy_version 509573 (0.00093) [2022-07-10 01:34:44,740][26022] Updated weights on worker 0-0, policy_version 509583 (0.00085) [2022-07-10 01:34:46,502][26022] Updated weights on worker 0-0, policy_version 509593 (0.00084) [2022-07-10 01:34:46,980][25689] Fps is (10 sec: 5630.8, 60 sec: 5688.1, 300 sec: 5665.2). Total num frames: 521825280. Throughput: 0: 5090.0. Samples: 521821304. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:34:46,983][25689] Avg episode reward: [(0, '-37.022')] [2022-07-10 01:34:48,259][26022] Updated weights on worker 0-0, policy_version 509603 (0.00092) [2022-07-10 01:34:50,092][26022] Updated weights on worker 0-0, policy_version 509613 (0.00086) [2022-07-10 01:34:51,742][26022] Updated weights on worker 0-0, policy_version 509623 (0.00085) [2022-07-10 01:34:51,991][25689] Fps is (10 sec: 5878.7, 60 sec: 5740.3, 300 sec: 5677.7). Total num frames: 521856000. Throughput: 0: 5960.6. Samples: 521855982. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:34:51,991][25689] Avg episode reward: [(0, '-36.710')] [2022-07-10 01:34:53,739][26022] Updated weights on worker 0-0, policy_version 509633 (0.00094) [2022-07-10 01:34:55,183][26022] Updated weights on worker 0-0, policy_version 509643 (0.00096) [2022-07-10 01:34:57,032][25689] Fps is (10 sec: 5704.7, 60 sec: 5677.5, 300 sec: 5666.8). Total num frames: 521882624. Throughput: 0: 5975.6. Samples: 521890694. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:34:57,032][25689] Avg episode reward: [(0, '-37.394')] [2022-07-10 01:34:57,180][26022] Updated weights on worker 0-0, policy_version 509653 (0.00083) [2022-07-10 01:34:58,839][26022] Updated weights on worker 0-0, policy_version 509663 (0.00084) [2022-07-10 01:35:00,781][26022] Updated weights on worker 0-0, policy_version 509673 (0.00086) [2022-07-10 01:35:02,100][25689] Fps is (10 sec: 5571.1, 60 sec: 5722.7, 300 sec: 5676.1). Total num frames: 521912320. Throughput: 0: 5154.6. Samples: 521908198. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:35:02,100][25689] Avg episode reward: [(0, '-37.008')] [2022-07-10 01:35:02,844][26022] Updated weights on worker 0-0, policy_version 509683 (0.00088) [2022-07-10 01:35:04,615][26022] Updated weights on worker 0-0, policy_version 509693 (0.00088) [2022-07-10 01:35:06,301][26022] Updated weights on worker 0-0, policy_version 509703 (0.00085) [2022-07-10 01:35:07,116][25689] Fps is (10 sec: 5584.6, 60 sec: 5705.6, 300 sec: 5669.9). Total num frames: 521938944. Throughput: 0: 5927.8. Samples: 521940672. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:35:07,117][25689] Avg episode reward: [(0, '-37.266')] [2022-07-10 01:35:08,099][26022] Updated weights on worker 0-0, policy_version 509713 (0.00102) [2022-07-10 01:35:10,063][26022] Updated weights on worker 0-0, policy_version 509723 (0.00088) [2022-07-10 01:35:11,650][26022] Updated weights on worker 0-0, policy_version 509733 (0.00084) [2022-07-10 01:35:12,151][25689] Fps is (10 sec: 5501.5, 60 sec: 5686.3, 300 sec: 5677.5). Total num frames: 521967616. Throughput: 0: 5916.9. Samples: 521975270. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:35:12,152][25689] Avg episode reward: [(0, '-37.039')] [2022-07-10 01:35:13,411][26022] Updated weights on worker 0-0, policy_version 509743 (0.00083) [2022-07-10 01:35:15,384][26022] Updated weights on worker 0-0, policy_version 509753 (0.00096) [2022-07-10 01:35:17,139][26022] Updated weights on worker 0-0, policy_version 509763 (0.00082) [2022-07-10 01:35:17,196][25689] Fps is (10 sec: 5790.8, 60 sec: 5706.9, 300 sec: 5670.1). Total num frames: 521997312. Throughput: 0: 5055.7. Samples: 521992638. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:35:17,196][25689] Avg episode reward: [(0, '-37.730')] [2022-07-10 01:35:18,993][26022] Updated weights on worker 0-0, policy_version 509773 (0.00085) [2022-07-10 01:35:20,718][26022] Updated weights on worker 0-0, policy_version 509783 (0.00092) [2022-07-10 01:35:22,199][25689] Fps is (10 sec: 5809.3, 60 sec: 5696.0, 300 sec: 5673.6). Total num frames: 522025984. Throughput: 0: 5912.0. Samples: 522027020. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:35:22,199][25689] Avg episode reward: [(0, '-36.392')] [2022-07-10 01:35:22,456][26022] Updated weights on worker 0-0, policy_version 509793 (0.00086) [2022-07-10 01:35:24,092][26022] Updated weights on worker 0-0, policy_version 509803 (0.00086) [2022-07-10 01:35:25,957][26022] Updated weights on worker 0-0, policy_version 509813 (0.00084) [2022-07-10 01:35:27,230][25689] Fps is (10 sec: 5816.8, 60 sec: 5733.9, 300 sec: 5673.3). Total num frames: 522055680. Throughput: 0: 6010.6. Samples: 522061568. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:35:27,231][25689] Avg episode reward: [(0, '-35.498')] [2022-07-10 01:35:27,639][26022] Updated weights on worker 0-0, policy_version 509823 (0.00091) [2022-07-10 01:35:29,701][26022] Updated weights on worker 0-0, policy_version 509833 (0.00087) [2022-07-10 01:35:31,182][26022] Updated weights on worker 0-0, policy_version 509843 (0.00081) [2022-07-10 01:35:32,259][25689] Fps is (10 sec: 5801.5, 60 sec: 5718.8, 300 sec: 5680.4). Total num frames: 522084352. Throughput: 0: 5156.2. Samples: 522078952. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:35:32,260][25689] Avg episode reward: [(0, '-35.427')] [2022-07-10 01:35:33,258][26022] Updated weights on worker 0-0, policy_version 509853 (0.00079) [2022-07-10 01:35:34,770][26022] Updated weights on worker 0-0, policy_version 509863 (0.00095) [2022-07-10 01:35:36,822][26022] Updated weights on worker 0-0, policy_version 509873 (0.00094) [2022-07-10 01:35:37,305][25689] Fps is (10 sec: 5691.9, 60 sec: 5703.0, 300 sec: 5677.3). Total num frames: 522113024. Throughput: 0: 6020.4. Samples: 522113702. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:35:37,305][25689] Avg episode reward: [(0, '-35.269')] [2022-07-10 01:35:38,183][26022] Updated weights on worker 0-0, policy_version 509883 (0.00092) [2022-07-10 01:35:40,245][26022] Updated weights on worker 0-0, policy_version 509893 (0.00071) [2022-07-10 01:35:41,791][26022] Updated weights on worker 0-0, policy_version 509903 (0.00088) [2022-07-10 01:35:42,327][25689] Fps is (10 sec: 5797.8, 60 sec: 5726.6, 300 sec: 5680.9). Total num frames: 522142720. Throughput: 0: 6021.1. Samples: 522148214. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:35:42,327][25689] Avg episode reward: [(0, '-35.854')] [2022-07-10 01:35:43,954][26022] Updated weights on worker 0-0, policy_version 509913 (0.00103) [2022-07-10 01:35:45,455][26022] Updated weights on worker 0-0, policy_version 509923 (0.00086) [2022-07-10 01:35:47,339][25689] Fps is (10 sec: 5714.7, 60 sec: 5717.2, 300 sec: 5681.8). Total num frames: 522170368. Throughput: 0: 5174.9. Samples: 522165630. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:35:47,340][25689] Avg episode reward: [(0, '-35.110')] [2022-07-10 01:35:47,370][26022] Updated weights on worker 0-0, policy_version 509933 (0.00087) [2022-07-10 01:35:48,331][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:35:48,344][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000509939_522177536.pth [2022-07-10 01:35:48,344][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000507941_520131584.pth [2022-07-10 01:35:48,997][26022] Updated weights on worker 0-0, policy_version 509943 (0.00088) [2022-07-10 01:35:50,995][26022] Updated weights on worker 0-0, policy_version 509953 (0.00090) [2022-07-10 01:35:52,365][25689] Fps is (10 sec: 5814.7, 60 sec: 5715.8, 300 sec: 5679.3). Total num frames: 522201088. Throughput: 0: 6017.9. Samples: 522199942. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:35:52,365][25689] Avg episode reward: [(0, '-35.165')] [2022-07-10 01:35:52,704][26022] Updated weights on worker 0-0, policy_version 509963 (0.00084) [2022-07-10 01:35:54,522][26022] Updated weights on worker 0-0, policy_version 509973 (0.00093) [2022-07-10 01:35:56,280][26022] Updated weights on worker 0-0, policy_version 509983 (0.00707) [2022-07-10 01:35:57,418][25689] Fps is (10 sec: 5689.7, 60 sec: 5714.7, 300 sec: 5674.9). Total num frames: 522227712. Throughput: 0: 5998.3. Samples: 522234346. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:35:57,418][25689] Avg episode reward: [(0, '-35.135')] [2022-07-10 01:35:58,051][26022] Updated weights on worker 0-0, policy_version 509993 (0.00087) [2022-07-10 01:35:59,760][26022] Updated weights on worker 0-0, policy_version 510003 (0.00087) [2022-07-10 01:36:01,656][26022] Updated weights on worker 0-0, policy_version 510013 (0.00088) [2022-07-10 01:36:02,440][25689] Fps is (10 sec: 5488.3, 60 sec: 5702.1, 300 sec: 5685.0). Total num frames: 522256384. Throughput: 0: 5146.9. Samples: 522251734. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:36:02,440][25689] Avg episode reward: [(0, '-35.695')] [2022-07-10 01:36:03,822][26022] Updated weights on worker 0-0, policy_version 510023 (0.00087) [2022-07-10 01:36:05,682][26022] Updated weights on worker 0-0, policy_version 510033 (0.00088) [2022-07-10 01:36:07,486][25689] Fps is (10 sec: 5492.1, 60 sec: 5699.3, 300 sec: 5670.4). Total num frames: 522283008. Throughput: 0: 5875.1. Samples: 522283994. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:36:07,487][25689] Avg episode reward: [(0, '-35.616')] [2022-07-10 01:36:07,635][26022] Updated weights on worker 0-0, policy_version 510043 (0.00079) [2022-07-10 01:36:09,232][26022] Updated weights on worker 0-0, policy_version 510053 (0.00092) [2022-07-10 01:36:10,916][26022] Updated weights on worker 0-0, policy_version 510063 (0.00096) [2022-07-10 01:36:12,496][25689] Fps is (10 sec: 5702.3, 60 sec: 5735.6, 300 sec: 5682.7). Total num frames: 522313728. Throughput: 0: 5886.5. Samples: 522318446. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:36:12,497][25689] Avg episode reward: [(0, '-35.702')] [2022-07-10 01:36:12,725][26022] Updated weights on worker 0-0, policy_version 510073 (0.00091) [2022-07-10 01:36:14,547][26022] Updated weights on worker 0-0, policy_version 510083 (0.00082) [2022-07-10 01:36:16,358][26022] Updated weights on worker 0-0, policy_version 510093 (0.00090) [2022-07-10 01:36:17,553][25689] Fps is (10 sec: 5899.7, 60 sec: 5717.4, 300 sec: 5681.9). Total num frames: 522342400. Throughput: 0: 5038.4. Samples: 522335794. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:36:17,554][25689] Avg episode reward: [(0, '-36.341')] [2022-07-10 01:36:18,207][26022] Updated weights on worker 0-0, policy_version 510103 (0.00080) [2022-07-10 01:36:19,961][26022] Updated weights on worker 0-0, policy_version 510113 (0.00086) [2022-07-10 01:36:21,504][26022] Updated weights on worker 0-0, policy_version 510123 (0.00089) [2022-07-10 01:36:22,564][25689] Fps is (10 sec: 5594.0, 60 sec: 5699.7, 300 sec: 5678.9). Total num frames: 522370048. Throughput: 0: 5874.7. Samples: 522369958. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:36:22,564][25689] Avg episode reward: [(0, '-34.953')] [2022-07-10 01:36:23,497][26022] Updated weights on worker 0-0, policy_version 510133 (0.00087) [2022-07-10 01:36:25,288][26022] Updated weights on worker 0-0, policy_version 510143 (0.00085) [2022-07-10 01:36:27,101][26022] Updated weights on worker 0-0, policy_version 510153 (0.00085) [2022-07-10 01:36:27,596][25689] Fps is (10 sec: 5607.8, 60 sec: 5682.7, 300 sec: 5678.7). Total num frames: 522398720. Throughput: 0: 5977.3. Samples: 522404198. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:36:27,597][25689] Avg episode reward: [(0, '-34.340')] [2022-07-10 01:36:28,856][26022] Updated weights on worker 0-0, policy_version 510163 (0.00093) [2022-07-10 01:36:30,749][26022] Updated weights on worker 0-0, policy_version 510173 (0.00080) [2022-07-10 01:36:32,290][26022] Updated weights on worker 0-0, policy_version 510183 (0.00090) [2022-07-10 01:36:32,611][25689] Fps is (10 sec: 5809.2, 60 sec: 5701.0, 300 sec: 5683.5). Total num frames: 522428416. Throughput: 0: 5111.9. Samples: 522421274. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:36:32,613][25689] Avg episode reward: [(0, '-34.817')] [2022-07-10 01:36:34,296][26022] Updated weights on worker 0-0, policy_version 510193 (0.00091) [2022-07-10 01:36:36,034][26022] Updated weights on worker 0-0, policy_version 510203 (0.00088) [2022-07-10 01:36:37,725][25689] Fps is (10 sec: 5661.2, 60 sec: 5677.5, 300 sec: 5678.0). Total num frames: 522456064. Throughput: 0: 5936.6. Samples: 522455550. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:36:37,726][25689] Avg episode reward: [(0, '-34.807')] [2022-07-10 01:36:37,943][26022] Updated weights on worker 0-0, policy_version 510213 (0.00087) [2022-07-10 01:36:39,648][26022] Updated weights on worker 0-0, policy_version 510223 (0.00087) [2022-07-10 01:36:41,455][26022] Updated weights on worker 0-0, policy_version 510233 (0.00097) [2022-07-10 01:36:42,756][25689] Fps is (10 sec: 5652.9, 60 sec: 5676.8, 300 sec: 5681.2). Total num frames: 522485760. Throughput: 0: 5949.7. Samples: 522490092. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:36:42,756][25689] Avg episode reward: [(0, '-35.712')] [2022-07-10 01:36:43,141][26022] Updated weights on worker 0-0, policy_version 510243 (0.00088) [2022-07-10 01:36:45,029][26022] Updated weights on worker 0-0, policy_version 510253 (0.00086) [2022-07-10 01:36:46,649][26022] Updated weights on worker 0-0, policy_version 510263 (0.00094) [2022-07-10 01:36:47,816][25689] Fps is (10 sec: 5784.6, 60 sec: 5689.2, 300 sec: 5677.5). Total num frames: 522514432. Throughput: 0: 5113.8. Samples: 522507596. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:36:47,816][25689] Avg episode reward: [(0, '-36.340')] [2022-07-10 01:36:48,671][26022] Updated weights on worker 0-0, policy_version 510273 (0.00098) [2022-07-10 01:36:50,309][26022] Updated weights on worker 0-0, policy_version 510283 (0.00088) [2022-07-10 01:36:52,082][26022] Updated weights on worker 0-0, policy_version 510293 (0.00084) [2022-07-10 01:36:52,831][25689] Fps is (10 sec: 5793.3, 60 sec: 5673.3, 300 sec: 5681.5). Total num frames: 522544128. Throughput: 0: 5975.8. Samples: 522542100. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:36:52,831][25689] Avg episode reward: [(0, '-36.426')] [2022-07-10 01:36:53,847][26022] Updated weights on worker 0-0, policy_version 510303 (0.00094) [2022-07-10 01:36:55,601][26022] Updated weights on worker 0-0, policy_version 510313 (0.00087) [2022-07-10 01:36:57,480][26022] Updated weights on worker 0-0, policy_version 510323 (0.00088) [2022-07-10 01:36:57,947][25689] Fps is (10 sec: 5761.1, 60 sec: 5701.2, 300 sec: 5676.7). Total num frames: 522572800. Throughput: 0: 5995.1. Samples: 522576782. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:36:57,948][25689] Avg episode reward: [(0, '-36.037')] [2022-07-10 01:36:59,195][26022] Updated weights on worker 0-0, policy_version 510333 (0.00094) [2022-07-10 01:37:01,031][26022] Updated weights on worker 0-0, policy_version 510343 (0.00089) [2022-07-10 01:37:03,018][25689] Fps is (10 sec: 5427.9, 60 sec: 5662.8, 300 sec: 5679.4). Total num frames: 522599424. Throughput: 0: 5885.2. Samples: 522609340. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:37:03,019][25689] Avg episode reward: [(0, '-34.704')] [2022-07-10 01:37:03,218][26022] Updated weights on worker 0-0, policy_version 510353 (0.00092) [2022-07-10 01:37:04,846][26022] Updated weights on worker 0-0, policy_version 510363 (0.00087) [2022-07-10 01:37:06,845][26022] Updated weights on worker 0-0, policy_version 510373 (0.00085) [2022-07-10 01:37:08,045][25689] Fps is (10 sec: 5678.8, 60 sec: 5732.2, 300 sec: 5682.9). Total num frames: 522630144. Throughput: 0: 5879.9. Samples: 522626542. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 01:37:08,046][25689] Avg episode reward: [(0, '-33.133')] [2022-07-10 01:37:08,399][26022] Updated weights on worker 0-0, policy_version 510383 (0.00098) [2022-07-10 01:37:10,331][26022] Updated weights on worker 0-0, policy_version 510393 (0.00095) [2022-07-10 01:37:12,117][26022] Updated weights on worker 0-0, policy_version 510403 (0.00089) [2022-07-10 01:37:13,100][25689] Fps is (10 sec: 5789.6, 60 sec: 5677.3, 300 sec: 5679.5). Total num frames: 522657792. Throughput: 0: 5866.5. Samples: 522661008. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:37:13,100][25689] Avg episode reward: [(0, '-32.251')] [2022-07-10 01:37:13,988][26022] Updated weights on worker 0-0, policy_version 510413 (0.00097) [2022-07-10 01:37:15,804][26022] Updated weights on worker 0-0, policy_version 510423 (0.00087) [2022-07-10 01:37:17,534][26022] Updated weights on worker 0-0, policy_version 510433 (0.00094) [2022-07-10 01:37:18,167][25689] Fps is (10 sec: 5665.4, 60 sec: 5693.2, 300 sec: 5685.3). Total num frames: 522687488. Throughput: 0: 5855.0. Samples: 522695168. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:37:18,168][25689] Avg episode reward: [(0, '-32.015')] [2022-07-10 01:37:19,481][26022] Updated weights on worker 0-0, policy_version 510443 (0.00089) [2022-07-10 01:37:21,095][26022] Updated weights on worker 0-0, policy_version 510453 (0.00088) [2022-07-10 01:37:22,794][26022] Updated weights on worker 0-0, policy_version 510463 (0.00100) [2022-07-10 01:37:23,203][25689] Fps is (10 sec: 5675.9, 60 sec: 5690.9, 300 sec: 5681.7). Total num frames: 522715136. Throughput: 0: 5103.9. Samples: 522712360. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:37:23,203][25689] Avg episode reward: [(0, '-32.207')] [2022-07-10 01:37:24,560][26022] Updated weights on worker 0-0, policy_version 510473 (0.00085) [2022-07-10 01:37:26,675][26022] Updated weights on worker 0-0, policy_version 510483 (0.00088) [2022-07-10 01:37:28,223][25689] Fps is (10 sec: 5600.8, 60 sec: 5692.0, 300 sec: 5688.5). Total num frames: 522743808. Throughput: 0: 5938.3. Samples: 522746362. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:37:28,224][25689] Avg episode reward: [(0, '-32.850')] [2022-07-10 01:37:28,271][26022] Updated weights on worker 0-0, policy_version 510493 (0.00436) [2022-07-10 01:37:29,980][26022] Updated weights on worker 0-0, policy_version 510503 (0.00090) [2022-07-10 01:37:32,027][26022] Updated weights on worker 0-0, policy_version 510513 (0.00083) [2022-07-10 01:37:33,324][25689] Fps is (10 sec: 5766.6, 60 sec: 5683.9, 300 sec: 5684.2). Total num frames: 522773504. Throughput: 0: 5912.1. Samples: 522780578. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:37:33,325][25689] Avg episode reward: [(0, '-33.922')] [2022-07-10 01:37:33,847][26022] Updated weights on worker 0-0, policy_version 510523 (0.00092) [2022-07-10 01:37:35,620][26022] Updated weights on worker 0-0, policy_version 510533 (0.00088) [2022-07-10 01:37:37,331][26022] Updated weights on worker 0-0, policy_version 510543 (0.00088) [2022-07-10 01:37:38,379][25689] Fps is (10 sec: 5545.1, 60 sec: 5672.6, 300 sec: 5683.3). Total num frames: 522800128. Throughput: 0: 5073.9. Samples: 522797724. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:37:38,380][25689] Avg episode reward: [(0, '-33.392')] [2022-07-10 01:37:39,060][26022] Updated weights on worker 0-0, policy_version 510553 (0.00085) [2022-07-10 01:37:40,830][26022] Updated weights on worker 0-0, policy_version 510563 (0.00090) [2022-07-10 01:37:42,684][26022] Updated weights on worker 0-0, policy_version 510573 (0.00089) [2022-07-10 01:37:43,397][25689] Fps is (10 sec: 5591.2, 60 sec: 5673.7, 300 sec: 5687.0). Total num frames: 522829824. Throughput: 0: 5930.3. Samples: 522832120. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:37:43,398][25689] Avg episode reward: [(0, '-33.489')] [2022-07-10 01:37:44,447][26022] Updated weights on worker 0-0, policy_version 510583 (0.00087) [2022-07-10 01:37:46,393][26022] Updated weights on worker 0-0, policy_version 510593 (0.00090) [2022-07-10 01:37:47,937][26022] Updated weights on worker 0-0, policy_version 510603 (0.00090) [2022-07-10 01:37:48,407][25689] Fps is (10 sec: 5923.1, 60 sec: 5695.4, 300 sec: 5686.9). Total num frames: 522859520. Throughput: 0: 5944.7. Samples: 522866350. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:37:48,407][25689] Avg episode reward: [(0, '-34.242')] [2022-07-10 01:37:48,551][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:37:48,565][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000510606_522860544.pth [2022-07-10 01:37:48,565][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000508607_520813568.pth [2022-07-10 01:37:49,898][26022] Updated weights on worker 0-0, policy_version 510613 (0.00086) [2022-07-10 01:37:51,619][26022] Updated weights on worker 0-0, policy_version 510623 (0.00089) [2022-07-10 01:37:53,424][25689] Fps is (10 sec: 5719.1, 60 sec: 5661.3, 300 sec: 5690.8). Total num frames: 522887168. Throughput: 0: 5109.0. Samples: 522883268. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:37:53,425][25689] Avg episode reward: [(0, '-35.288')] [2022-07-10 01:37:53,610][26022] Updated weights on worker 0-0, policy_version 510633 (0.00097) [2022-07-10 01:37:55,348][26022] Updated weights on worker 0-0, policy_version 510643 (0.00092) [2022-07-10 01:37:57,130][26022] Updated weights on worker 0-0, policy_version 510653 (0.00091) [2022-07-10 01:37:58,494][25689] Fps is (10 sec: 5583.2, 60 sec: 5665.7, 300 sec: 5686.3). Total num frames: 522915840. Throughput: 0: 5946.6. Samples: 522917338. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:37:58,495][25689] Avg episode reward: [(0, '-35.395')] [2022-07-10 01:37:58,808][26022] Updated weights on worker 0-0, policy_version 510663 (0.00091) [2022-07-10 01:38:00,747][26022] Updated weights on worker 0-0, policy_version 510673 (0.00090) [2022-07-10 01:38:02,774][26022] Updated weights on worker 0-0, policy_version 510683 (0.00082) [2022-07-10 01:38:03,499][25689] Fps is (10 sec: 5387.0, 60 sec: 5655.0, 300 sec: 5683.1). Total num frames: 522941440. Throughput: 0: 5830.0. Samples: 522949312. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:38:03,499][25689] Avg episode reward: [(0, '-36.050')] [2022-07-10 01:38:04,618][26022] Updated weights on worker 0-0, policy_version 510693 (0.00082) [2022-07-10 01:38:06,479][26022] Updated weights on worker 0-0, policy_version 510703 (0.00089) [2022-07-10 01:38:08,324][26022] Updated weights on worker 0-0, policy_version 510713 (0.00094) [2022-07-10 01:38:08,515][25689] Fps is (10 sec: 5517.8, 60 sec: 5639.0, 300 sec: 5690.2). Total num frames: 522971136. Throughput: 0: 4981.3. Samples: 522966518. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:38:08,516][25689] Avg episode reward: [(0, '-37.537')] [2022-07-10 01:38:10,210][26022] Updated weights on worker 0-0, policy_version 510723 (0.00103) [2022-07-10 01:38:11,880][26022] Updated weights on worker 0-0, policy_version 510733 (0.00091) [2022-07-10 01:38:13,540][25689] Fps is (10 sec: 5812.6, 60 sec: 5658.7, 300 sec: 5688.5). Total num frames: 522999808. Throughput: 0: 5834.5. Samples: 523000636. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:38:13,543][25689] Avg episode reward: [(0, '-36.142')] [2022-07-10 01:38:13,772][26022] Updated weights on worker 0-0, policy_version 510743 (0.00086) [2022-07-10 01:38:15,510][26022] Updated weights on worker 0-0, policy_version 510753 (0.00089) [2022-07-10 01:38:17,232][26022] Updated weights on worker 0-0, policy_version 510763 (0.00088) [2022-07-10 01:38:18,598][25689] Fps is (10 sec: 5687.2, 60 sec: 5642.6, 300 sec: 5691.0). Total num frames: 523028480. Throughput: 0: 5853.4. Samples: 523035018. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:38:18,599][25689] Avg episode reward: [(0, '-35.145')] [2022-07-10 01:38:18,903][26022] Updated weights on worker 0-0, policy_version 510773 (0.00091) [2022-07-10 01:38:20,731][26022] Updated weights on worker 0-0, policy_version 510783 (0.00088) [2022-07-10 01:38:22,650][26022] Updated weights on worker 0-0, policy_version 510793 (0.00085) [2022-07-10 01:38:23,605][25689] Fps is (10 sec: 5697.3, 60 sec: 5662.3, 300 sec: 5687.8). Total num frames: 523057152. Throughput: 0: 5127.6. Samples: 523052412. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:38:23,606][25689] Avg episode reward: [(0, '-35.006')] [2022-07-10 01:38:24,288][26022] Updated weights on worker 0-0, policy_version 510803 (0.00089) [2022-07-10 01:38:26,360][26022] Updated weights on worker 0-0, policy_version 510813 (0.00623) [2022-07-10 01:38:27,884][26022] Updated weights on worker 0-0, policy_version 510823 (0.00089) [2022-07-10 01:38:28,623][25689] Fps is (10 sec: 5720.5, 60 sec: 5662.5, 300 sec: 5684.8). Total num frames: 523085824. Throughput: 0: 5988.8. Samples: 523086936. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:38:28,623][25689] Avg episode reward: [(0, '-34.702')] [2022-07-10 01:38:29,840][26022] Updated weights on worker 0-0, policy_version 510833 (0.00087) [2022-07-10 01:38:31,587][26022] Updated weights on worker 0-0, policy_version 510843 (0.00085) [2022-07-10 01:38:33,447][26022] Updated weights on worker 0-0, policy_version 510853 (0.00087) [2022-07-10 01:38:33,650][25689] Fps is (10 sec: 5709.1, 60 sec: 5652.5, 300 sec: 5688.6). Total num frames: 523114496. Throughput: 0: 5988.5. Samples: 523121062. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:38:33,650][25689] Avg episode reward: [(0, '-33.489')] [2022-07-10 01:38:35,086][26022] Updated weights on worker 0-0, policy_version 510863 (0.00091) [2022-07-10 01:38:37,111][26022] Updated weights on worker 0-0, policy_version 510873 (0.00084) [2022-07-10 01:38:38,754][25689] Fps is (10 sec: 5660.2, 60 sec: 5681.9, 300 sec: 5683.7). Total num frames: 523143168. Throughput: 0: 5122.8. Samples: 523138270. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:38:38,754][25689] Avg episode reward: [(0, '-32.928')] [2022-07-10 01:38:38,795][26022] Updated weights on worker 0-0, policy_version 510883 (0.00086) [2022-07-10 01:38:40,599][26022] Updated weights on worker 0-0, policy_version 510893 (0.00089) [2022-07-10 01:38:42,561][26022] Updated weights on worker 0-0, policy_version 510903 (0.00086) [2022-07-10 01:38:43,780][25689] Fps is (10 sec: 5660.9, 60 sec: 5664.2, 300 sec: 5686.8). Total num frames: 523171840. Throughput: 0: 5958.9. Samples: 523172628. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:38:43,780][25689] Avg episode reward: [(0, '-32.290')] [2022-07-10 01:38:44,133][26022] Updated weights on worker 0-0, policy_version 510913 (0.00087) [2022-07-10 01:38:46,177][26022] Updated weights on worker 0-0, policy_version 510923 (0.00088) [2022-07-10 01:38:47,620][26022] Updated weights on worker 0-0, policy_version 510933 (0.00105) [2022-07-10 01:38:48,869][25689] Fps is (10 sec: 5668.8, 60 sec: 5639.7, 300 sec: 5689.1). Total num frames: 523200512. Throughput: 0: 5920.5. Samples: 523206806. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:38:48,870][25689] Avg episode reward: [(0, '-31.989')] [2022-07-10 01:38:49,563][26022] Updated weights on worker 0-0, policy_version 510943 (0.00086) [2022-07-10 01:38:51,267][26022] Updated weights on worker 0-0, policy_version 510953 (0.00099) [2022-07-10 01:38:53,243][26022] Updated weights on worker 0-0, policy_version 510963 (0.00084) [2022-07-10 01:38:53,922][25689] Fps is (10 sec: 5754.9, 60 sec: 5670.3, 300 sec: 5686.4). Total num frames: 523230208. Throughput: 0: 5068.4. Samples: 523223810. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:38:53,922][25689] Avg episode reward: [(0, '-31.857')] [2022-07-10 01:38:54,825][26022] Updated weights on worker 0-0, policy_version 510973 (0.00088) [2022-07-10 01:38:56,771][26022] Updated weights on worker 0-0, policy_version 510983 (0.00086) [2022-07-10 01:38:58,243][26022] Updated weights on worker 0-0, policy_version 510993 (0.00091) [2022-07-10 01:38:58,976][25689] Fps is (10 sec: 5876.6, 60 sec: 5688.7, 300 sec: 5695.9). Total num frames: 523259904. Throughput: 0: 5938.2. Samples: 523258354. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:38:58,976][25689] Avg episode reward: [(0, '-32.424')] [2022-07-10 01:39:00,380][26022] Updated weights on worker 0-0, policy_version 511003 (0.00091) [2022-07-10 01:39:02,265][26022] Updated weights on worker 0-0, policy_version 511013 (0.00084) [2022-07-10 01:39:04,051][25689] Fps is (10 sec: 5560.4, 60 sec: 5699.0, 300 sec: 5691.3). Total num frames: 523286528. Throughput: 0: 5830.0. Samples: 523290808. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:39:04,051][25689] Avg episode reward: [(0, '-30.882')] [2022-07-10 01:39:04,303][26022] Updated weights on worker 0-0, policy_version 511023 (0.00085) [2022-07-10 01:39:05,928][26022] Updated weights on worker 0-0, policy_version 511033 (0.00086) [2022-07-10 01:39:07,818][26022] Updated weights on worker 0-0, policy_version 511043 (0.00086) [2022-07-10 01:39:09,078][25689] Fps is (10 sec: 5372.5, 60 sec: 5664.3, 300 sec: 5684.0). Total num frames: 523314176. Throughput: 0: 5002.6. Samples: 523307900. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:39:09,078][25689] Avg episode reward: [(0, '-31.138')] [2022-07-10 01:39:09,380][26022] Updated weights on worker 0-0, policy_version 511053 (0.00084) [2022-07-10 01:39:11,399][26022] Updated weights on worker 0-0, policy_version 511063 (0.00087) [2022-07-10 01:39:13,067][26022] Updated weights on worker 0-0, policy_version 511073 (0.00083) [2022-07-10 01:39:14,156][25689] Fps is (10 sec: 5776.0, 60 sec: 5693.1, 300 sec: 5691.0). Total num frames: 523344896. Throughput: 0: 5861.4. Samples: 523342410. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:39:14,156][25689] Avg episode reward: [(0, '-32.072')] [2022-07-10 01:39:14,953][26022] Updated weights on worker 0-0, policy_version 511083 (0.00083) [2022-07-10 01:39:16,734][26022] Updated weights on worker 0-0, policy_version 511093 (0.00247) [2022-07-10 01:39:18,834][26022] Updated weights on worker 0-0, policy_version 511103 (0.00090) [2022-07-10 01:39:19,195][25689] Fps is (10 sec: 5667.8, 60 sec: 5661.0, 300 sec: 5681.2). Total num frames: 523371520. Throughput: 0: 5858.0. Samples: 523376798. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:39:19,196][25689] Avg episode reward: [(0, '-31.650')] [2022-07-10 01:39:20,101][26022] Updated weights on worker 0-0, policy_version 511113 (0.00084) [2022-07-10 01:39:22,363][26022] Updated weights on worker 0-0, policy_version 511123 (0.00086) [2022-07-10 01:39:23,719][26022] Updated weights on worker 0-0, policy_version 511133 (0.00088) [2022-07-10 01:39:24,275][25689] Fps is (10 sec: 5666.7, 60 sec: 5688.0, 300 sec: 5691.5). Total num frames: 523402240. Throughput: 0: 5944.1. Samples: 523411026. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:39:24,276][25689] Avg episode reward: [(0, '-32.523')] [2022-07-10 01:39:25,679][26022] Updated weights on worker 0-0, policy_version 511143 (0.00089) [2022-07-10 01:39:27,612][26022] Updated weights on worker 0-0, policy_version 511153 (0.00093) [2022-07-10 01:39:29,286][25689] Fps is (10 sec: 5885.6, 60 sec: 5688.6, 300 sec: 5688.7). Total num frames: 523430912. Throughput: 0: 5948.7. Samples: 523428114. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:39:29,287][25689] Avg episode reward: [(0, '-31.869')] [2022-07-10 01:39:29,287][26022] Updated weights on worker 0-0, policy_version 511163 (0.00086) [2022-07-10 01:39:31,316][26022] Updated weights on worker 0-0, policy_version 511173 (0.00086) [2022-07-10 01:39:32,947][26022] Updated weights on worker 0-0, policy_version 511183 (0.00086) [2022-07-10 01:39:34,327][25689] Fps is (10 sec: 5603.2, 60 sec: 5670.4, 300 sec: 5682.2). Total num frames: 523458560. Throughput: 0: 5938.3. Samples: 523462190. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:39:34,328][25689] Avg episode reward: [(0, '-32.329')] [2022-07-10 01:39:34,604][26022] Updated weights on worker 0-0, policy_version 511193 (0.00085) [2022-07-10 01:39:36,596][26022] Updated weights on worker 0-0, policy_version 511203 (0.00090) [2022-07-10 01:39:38,174][26022] Updated weights on worker 0-0, policy_version 511213 (0.00095) [2022-07-10 01:39:39,383][25689] Fps is (10 sec: 5578.1, 60 sec: 5674.9, 300 sec: 5682.9). Total num frames: 523487232. Throughput: 0: 5933.0. Samples: 523496572. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:39:39,383][25689] Avg episode reward: [(0, '-30.948')] [2022-07-10 01:39:40,241][26022] Updated weights on worker 0-0, policy_version 511223 (0.00087) [2022-07-10 01:39:41,707][26022] Updated weights on worker 0-0, policy_version 511233 (0.00087) [2022-07-10 01:39:43,708][26022] Updated weights on worker 0-0, policy_version 511243 (0.00090) [2022-07-10 01:39:44,432][25689] Fps is (10 sec: 5776.0, 60 sec: 5689.6, 300 sec: 5687.1). Total num frames: 523516928. Throughput: 0: 5101.0. Samples: 523513846. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:39:44,433][25689] Avg episode reward: [(0, '-30.431')] [2022-07-10 01:39:45,391][26022] Updated weights on worker 0-0, policy_version 511253 (0.00095) [2022-07-10 01:39:47,295][26022] Updated weights on worker 0-0, policy_version 511263 (0.00083) [2022-07-10 01:39:48,603][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:39:48,618][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000511270_523540480.pth [2022-07-10 01:39:48,619][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000509270_521492480.pth [2022-07-10 01:39:49,109][26022] Updated weights on worker 0-0, policy_version 511273 (0.00087) [2022-07-10 01:39:49,451][25689] Fps is (10 sec: 5797.3, 60 sec: 5696.3, 300 sec: 5680.1). Total num frames: 523545600. Throughput: 0: 5953.9. Samples: 523548174. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:39:49,451][25689] Avg episode reward: [(0, '-30.214')] [2022-07-10 01:39:50,815][26022] Updated weights on worker 0-0, policy_version 511283 (0.00083) [2022-07-10 01:39:52,698][26022] Updated weights on worker 0-0, policy_version 511293 (0.00054) [2022-07-10 01:39:54,457][25689] Fps is (10 sec: 5617.7, 60 sec: 5666.7, 300 sec: 5684.2). Total num frames: 523573248. Throughput: 0: 5985.0. Samples: 523582672. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:39:54,458][25689] Avg episode reward: [(0, '-30.347')] [2022-07-10 01:39:54,526][26022] Updated weights on worker 0-0, policy_version 511303 (0.00088) [2022-07-10 01:39:56,327][26022] Updated weights on worker 0-0, policy_version 511313 (0.00085) [2022-07-10 01:39:58,046][26022] Updated weights on worker 0-0, policy_version 511323 (0.00090) [2022-07-10 01:39:59,518][25689] Fps is (10 sec: 5594.3, 60 sec: 5649.2, 300 sec: 5680.9). Total num frames: 523601920. Throughput: 0: 5119.7. Samples: 523599660. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:39:59,519][25689] Avg episode reward: [(0, '-30.052')] [2022-07-10 01:39:59,837][26022] Updated weights on worker 0-0, policy_version 511333 (0.00093) [2022-07-10 01:40:01,924][26022] Updated weights on worker 0-0, policy_version 511343 (0.00091) [2022-07-10 01:40:03,767][26022] Updated weights on worker 0-0, policy_version 511353 (0.00092) [2022-07-10 01:40:04,546][25689] Fps is (10 sec: 5582.4, 60 sec: 5670.5, 300 sec: 5684.1). Total num frames: 523629568. Throughput: 0: 5872.3. Samples: 523631964. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 01:40:04,547][25689] Avg episode reward: [(0, '-29.587')] [2022-07-10 01:40:05,572][26022] Updated weights on worker 0-0, policy_version 511363 (0.00476) [2022-07-10 01:40:07,205][26022] Updated weights on worker 0-0, policy_version 511373 (0.00084) [2022-07-10 01:40:09,190][26022] Updated weights on worker 0-0, policy_version 511383 (0.00085) [2022-07-10 01:40:09,565][25689] Fps is (10 sec: 5606.1, 60 sec: 5688.2, 300 sec: 5684.4). Total num frames: 523658240. Throughput: 0: 5881.7. Samples: 523666476. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:40:09,565][25689] Avg episode reward: [(0, '-31.411')] [2022-07-10 01:40:11,142][26022] Updated weights on worker 0-0, policy_version 511393 (0.00084) [2022-07-10 01:40:12,682][26022] Updated weights on worker 0-0, policy_version 511403 (0.00088) [2022-07-10 01:40:14,529][26022] Updated weights on worker 0-0, policy_version 511413 (0.00083) [2022-07-10 01:40:14,582][25689] Fps is (10 sec: 5714.3, 60 sec: 5660.1, 300 sec: 5681.5). Total num frames: 523686912. Throughput: 0: 5019.9. Samples: 523683694. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:40:14,582][25689] Avg episode reward: [(0, '-31.851')] [2022-07-10 01:40:16,347][26022] Updated weights on worker 0-0, policy_version 511423 (0.00081) [2022-07-10 01:40:18,158][26022] Updated weights on worker 0-0, policy_version 511433 (0.00089) [2022-07-10 01:40:19,646][25689] Fps is (10 sec: 5790.0, 60 sec: 5708.6, 300 sec: 5683.8). Total num frames: 523716608. Throughput: 0: 5884.2. Samples: 523718092. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:40:19,646][25689] Avg episode reward: [(0, '-31.474')] [2022-07-10 01:40:20,007][26022] Updated weights on worker 0-0, policy_version 511443 (0.00089) [2022-07-10 01:40:21,552][26022] Updated weights on worker 0-0, policy_version 511453 (0.00088) [2022-07-10 01:40:23,690][26022] Updated weights on worker 0-0, policy_version 511463 (0.00091) [2022-07-10 01:40:24,738][25689] Fps is (10 sec: 5847.8, 60 sec: 5690.5, 300 sec: 5682.6). Total num frames: 523746304. Throughput: 0: 5972.2. Samples: 523752552. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:40:24,738][25689] Avg episode reward: [(0, '-31.726')] [2022-07-10 01:40:25,176][26022] Updated weights on worker 0-0, policy_version 511473 (0.00097) [2022-07-10 01:40:27,072][26022] Updated weights on worker 0-0, policy_version 511483 (0.00051) [2022-07-10 01:40:28,866][26022] Updated weights on worker 0-0, policy_version 511493 (0.00086) [2022-07-10 01:40:29,807][25689] Fps is (10 sec: 5542.4, 60 sec: 5651.2, 300 sec: 5675.0). Total num frames: 523772928. Throughput: 0: 5091.0. Samples: 523769530. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:40:29,808][25689] Avg episode reward: [(0, '-32.693')] [2022-07-10 01:40:30,580][26022] Updated weights on worker 0-0, policy_version 511503 (0.00087) [2022-07-10 01:40:32,470][26022] Updated weights on worker 0-0, policy_version 511513 (0.00088) [2022-07-10 01:40:34,295][26022] Updated weights on worker 0-0, policy_version 511523 (0.00118) [2022-07-10 01:40:34,843][25689] Fps is (10 sec: 5573.7, 60 sec: 5685.5, 300 sec: 5678.6). Total num frames: 523802624. Throughput: 0: 5912.1. Samples: 523803478. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:40:34,843][25689] Avg episode reward: [(0, '-32.452')] [2022-07-10 01:40:36,072][26022] Updated weights on worker 0-0, policy_version 511533 (0.00094) [2022-07-10 01:40:37,955][26022] Updated weights on worker 0-0, policy_version 511543 (0.00087) [2022-07-10 01:40:39,719][26022] Updated weights on worker 0-0, policy_version 511553 (0.00093) [2022-07-10 01:40:39,950][25689] Fps is (10 sec: 5653.4, 60 sec: 5663.8, 300 sec: 5670.1). Total num frames: 523830272. Throughput: 0: 5893.6. Samples: 523837760. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:40:39,951][25689] Avg episode reward: [(0, '-32.876')] [2022-07-10 01:40:41,472][26022] Updated weights on worker 0-0, policy_version 511563 (0.00081) [2022-07-10 01:40:43,460][26022] Updated weights on worker 0-0, policy_version 511573 (0.00086) [2022-07-10 01:40:44,922][26022] Updated weights on worker 0-0, policy_version 511583 (0.00056) [2022-07-10 01:40:45,011][25689] Fps is (10 sec: 5739.9, 60 sec: 5679.6, 300 sec: 5679.5). Total num frames: 523860992. Throughput: 0: 5051.2. Samples: 523854956. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:40:45,012][25689] Avg episode reward: [(0, '-33.287')] [2022-07-10 01:40:46,999][26022] Updated weights on worker 0-0, policy_version 511593 (0.00083) [2022-07-10 01:40:48,684][26022] Updated weights on worker 0-0, policy_version 511603 (0.00100) [2022-07-10 01:40:50,058][25689] Fps is (10 sec: 5774.5, 60 sec: 5660.0, 300 sec: 5668.8). Total num frames: 523888640. Throughput: 0: 5911.7. Samples: 523889248. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:40:50,059][25689] Avg episode reward: [(0, '-33.281')] [2022-07-10 01:40:50,517][26022] Updated weights on worker 0-0, policy_version 511613 (0.00092) [2022-07-10 01:40:52,375][26022] Updated weights on worker 0-0, policy_version 511623 (0.00092) [2022-07-10 01:40:54,222][26022] Updated weights on worker 0-0, policy_version 511633 (0.00081) [2022-07-10 01:40:55,061][25689] Fps is (10 sec: 5604.1, 60 sec: 5677.3, 300 sec: 5676.6). Total num frames: 523917312. Throughput: 0: 5924.8. Samples: 523923268. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:40:55,062][25689] Avg episode reward: [(0, '-33.301')] [2022-07-10 01:40:55,921][26022] Updated weights on worker 0-0, policy_version 511643 (0.00085) [2022-07-10 01:40:57,950][26022] Updated weights on worker 0-0, policy_version 511653 (0.00087) [2022-07-10 01:40:59,348][26022] Updated weights on worker 0-0, policy_version 511663 (0.00086) [2022-07-10 01:41:00,106][25689] Fps is (10 sec: 5809.1, 60 sec: 5695.7, 300 sec: 5679.6). Total num frames: 523947008. Throughput: 0: 5096.4. Samples: 523940480. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:41:00,107][25689] Avg episode reward: [(0, '-33.178')] [2022-07-10 01:41:01,329][26022] Updated weights on worker 0-0, policy_version 511673 (0.00082) [2022-07-10 01:41:03,163][26022] Updated weights on worker 0-0, policy_version 511683 (0.00085) [2022-07-10 01:41:05,091][26022] Updated weights on worker 0-0, policy_version 511693 (0.00084) [2022-07-10 01:41:05,115][25689] Fps is (10 sec: 5601.8, 60 sec: 5680.6, 300 sec: 5680.3). Total num frames: 523973632. Throughput: 0: 5879.6. Samples: 523973158. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:41:05,115][25689] Avg episode reward: [(0, '-33.798')] [2022-07-10 01:41:06,957][26022] Updated weights on worker 0-0, policy_version 511703 (0.00091) [2022-07-10 01:41:08,742][26022] Updated weights on worker 0-0, policy_version 511713 (0.00085) [2022-07-10 01:41:10,121][25689] Fps is (10 sec: 5418.8, 60 sec: 5664.8, 300 sec: 5670.1). Total num frames: 524001280. Throughput: 0: 5893.4. Samples: 524007488. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:41:10,122][25689] Avg episode reward: [(0, '-34.750')] [2022-07-10 01:41:10,482][26022] Updated weights on worker 0-0, policy_version 511723 (0.00093) [2022-07-10 01:41:12,281][26022] Updated weights on worker 0-0, policy_version 511733 (0.00094) [2022-07-10 01:41:14,101][26022] Updated weights on worker 0-0, policy_version 511743 (0.00083) [2022-07-10 01:41:15,133][25689] Fps is (10 sec: 5723.8, 60 sec: 5682.1, 300 sec: 5674.4). Total num frames: 524030976. Throughput: 0: 5047.6. Samples: 524024586. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:41:15,135][25689] Avg episode reward: [(0, '-33.709')] [2022-07-10 01:41:15,892][26022] Updated weights on worker 0-0, policy_version 511753 (0.00093) [2022-07-10 01:41:17,666][26022] Updated weights on worker 0-0, policy_version 511763 (0.00079) [2022-07-10 01:41:19,556][26022] Updated weights on worker 0-0, policy_version 511773 (0.00090) [2022-07-10 01:41:20,177][25689] Fps is (10 sec: 5702.7, 60 sec: 5650.2, 300 sec: 5673.7). Total num frames: 524058624. Throughput: 0: 5903.1. Samples: 524058962. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:41:20,178][25689] Avg episode reward: [(0, '-35.010')] [2022-07-10 01:41:21,365][26022] Updated weights on worker 0-0, policy_version 511783 (0.00087) [2022-07-10 01:41:23,067][26022] Updated weights on worker 0-0, policy_version 511793 (0.00083) [2022-07-10 01:41:24,897][26022] Updated weights on worker 0-0, policy_version 511803 (0.00088) [2022-07-10 01:41:25,191][25689] Fps is (10 sec: 5599.8, 60 sec: 5640.6, 300 sec: 5674.1). Total num frames: 524087296. Throughput: 0: 5977.2. Samples: 524093156. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:41:25,193][25689] Avg episode reward: [(0, '-34.232')] [2022-07-10 01:41:26,911][26022] Updated weights on worker 0-0, policy_version 511813 (0.00084) [2022-07-10 01:41:28,412][26022] Updated weights on worker 0-0, policy_version 511823 (0.00090) [2022-07-10 01:41:30,200][25689] Fps is (10 sec: 5619.2, 60 sec: 5663.2, 300 sec: 5667.3). Total num frames: 524114944. Throughput: 0: 5113.7. Samples: 524110164. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:41:30,201][25689] Avg episode reward: [(0, '-33.249')] [2022-07-10 01:41:30,467][26022] Updated weights on worker 0-0, policy_version 511833 (0.00090) [2022-07-10 01:41:32,039][26022] Updated weights on worker 0-0, policy_version 511843 (0.00086) [2022-07-10 01:41:33,894][26022] Updated weights on worker 0-0, policy_version 511853 (0.00084) [2022-07-10 01:41:35,231][25689] Fps is (10 sec: 5711.8, 60 sec: 5663.6, 300 sec: 5675.8). Total num frames: 524144640. Throughput: 0: 5965.5. Samples: 524144476. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:41:35,232][25689] Avg episode reward: [(0, '-33.162')] [2022-07-10 01:41:35,749][26022] Updated weights on worker 0-0, policy_version 511863 (0.00085) [2022-07-10 01:41:37,343][26022] Updated weights on worker 0-0, policy_version 511873 (0.00939) [2022-07-10 01:41:39,363][26022] Updated weights on worker 0-0, policy_version 511883 (0.00090) [2022-07-10 01:41:40,297][25689] Fps is (10 sec: 5780.8, 60 sec: 5684.5, 300 sec: 5671.7). Total num frames: 524173312. Throughput: 0: 5964.6. Samples: 524178968. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:41:40,297][25689] Avg episode reward: [(0, '-32.296')] [2022-07-10 01:41:40,867][26022] Updated weights on worker 0-0, policy_version 511893 (0.00096) [2022-07-10 01:41:42,982][26022] Updated weights on worker 0-0, policy_version 511903 (0.00085) [2022-07-10 01:41:44,628][26022] Updated weights on worker 0-0, policy_version 511913 (0.00089) [2022-07-10 01:41:45,326][25689] Fps is (10 sec: 5781.8, 60 sec: 5670.5, 300 sec: 5675.7). Total num frames: 524203008. Throughput: 0: 5111.6. Samples: 524196078. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:41:45,326][25689] Avg episode reward: [(0, '-33.565')] [2022-07-10 01:41:46,531][26022] Updated weights on worker 0-0, policy_version 511923 (0.00086) [2022-07-10 01:41:48,142][26022] Updated weights on worker 0-0, policy_version 511933 (0.00092) [2022-07-10 01:41:48,641][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:41:48,651][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000511936_524222464.pth [2022-07-10 01:41:48,655][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000509939_522177536.pth [2022-07-10 01:41:50,186][26022] Updated weights on worker 0-0, policy_version 511943 (0.00082) [2022-07-10 01:41:50,331][25689] Fps is (10 sec: 5612.6, 60 sec: 5657.4, 300 sec: 5665.5). Total num frames: 524229632. Throughput: 0: 5975.5. Samples: 524230460. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:41:50,332][25689] Avg episode reward: [(0, '-33.465')] [2022-07-10 01:41:51,863][26022] Updated weights on worker 0-0, policy_version 511953 (0.00088) [2022-07-10 01:41:53,620][26022] Updated weights on worker 0-0, policy_version 511963 (0.00085) [2022-07-10 01:41:55,283][26022] Updated weights on worker 0-0, policy_version 511973 (0.00088) [2022-07-10 01:41:55,350][25689] Fps is (10 sec: 5822.5, 60 sec: 5706.9, 300 sec: 5677.7). Total num frames: 524261376. Throughput: 0: 5979.8. Samples: 524264788. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:41:55,351][25689] Avg episode reward: [(0, '-33.331')] [2022-07-10 01:41:57,186][26022] Updated weights on worker 0-0, policy_version 511983 (0.00084) [2022-07-10 01:41:58,818][26022] Updated weights on worker 0-0, policy_version 511993 (0.00084) [2022-07-10 01:42:00,437][25689] Fps is (10 sec: 5877.2, 60 sec: 5669.0, 300 sec: 5680.9). Total num frames: 524289024. Throughput: 0: 5118.4. Samples: 524282052. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:42:00,437][25689] Avg episode reward: [(0, '-33.727')] [2022-07-10 01:42:00,726][26022] Updated weights on worker 0-0, policy_version 512003 (0.00051) [2022-07-10 01:42:02,791][26022] Updated weights on worker 0-0, policy_version 512013 (0.00090) [2022-07-10 01:42:04,733][26022] Updated weights on worker 0-0, policy_version 512023 (0.00054) [2022-07-10 01:42:05,523][25689] Fps is (10 sec: 5335.2, 60 sec: 5661.8, 300 sec: 5666.0). Total num frames: 524315648. Throughput: 0: 5856.5. Samples: 524314362. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:42:05,523][25689] Avg episode reward: [(0, '-32.907')] [2022-07-10 01:42:06,389][26022] Updated weights on worker 0-0, policy_version 512033 (0.00377) [2022-07-10 01:42:08,171][26022] Updated weights on worker 0-0, policy_version 512043 (0.00611) [2022-07-10 01:42:10,183][26022] Updated weights on worker 0-0, policy_version 512053 (0.00082) [2022-07-10 01:42:10,526][25689] Fps is (10 sec: 5480.7, 60 sec: 5679.0, 300 sec: 5670.4). Total num frames: 524344320. Throughput: 0: 5855.2. Samples: 524348704. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:42:10,527][25689] Avg episode reward: [(0, '-32.063')] [2022-07-10 01:42:11,849][26022] Updated weights on worker 0-0, policy_version 512063 (0.00085) [2022-07-10 01:42:13,718][26022] Updated weights on worker 0-0, policy_version 512073 (0.00085) [2022-07-10 01:42:15,363][26022] Updated weights on worker 0-0, policy_version 512083 (0.00089) [2022-07-10 01:42:15,539][25689] Fps is (10 sec: 5827.1, 60 sec: 5678.9, 300 sec: 5671.4). Total num frames: 524374016. Throughput: 0: 5014.3. Samples: 524366022. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:42:15,540][25689] Avg episode reward: [(0, '-31.821')] [2022-07-10 01:42:17,234][26022] Updated weights on worker 0-0, policy_version 512093 (0.00090) [2022-07-10 01:42:19,126][26022] Updated weights on worker 0-0, policy_version 512103 (0.00084) [2022-07-10 01:42:20,683][25689] Fps is (10 sec: 5746.7, 60 sec: 5686.5, 300 sec: 5672.8). Total num frames: 524402688. Throughput: 0: 5857.1. Samples: 524400634. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:42:20,683][25689] Avg episode reward: [(0, '-31.622')] [2022-07-10 01:42:20,724][26022] Updated weights on worker 0-0, policy_version 512113 (0.00082) [2022-07-10 01:42:22,651][26022] Updated weights on worker 0-0, policy_version 512123 (0.00091) [2022-07-10 01:42:24,457][26022] Updated weights on worker 0-0, policy_version 512133 (0.00090) [2022-07-10 01:42:25,697][25689] Fps is (10 sec: 5645.2, 60 sec: 5686.4, 300 sec: 5672.9). Total num frames: 524431360. Throughput: 0: 5974.4. Samples: 524434894. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:42:25,698][25689] Avg episode reward: [(0, '-32.795')] [2022-07-10 01:42:26,173][26022] Updated weights on worker 0-0, policy_version 512143 (0.00054) [2022-07-10 01:42:28,059][26022] Updated weights on worker 0-0, policy_version 512153 (0.00083) [2022-07-10 01:42:29,626][26022] Updated weights on worker 0-0, policy_version 512163 (0.00096) [2022-07-10 01:42:30,704][25689] Fps is (10 sec: 5722.3, 60 sec: 5703.6, 300 sec: 5671.3). Total num frames: 524460032. Throughput: 0: 5955.7. Samples: 524468878. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:42:30,705][25689] Avg episode reward: [(0, '-32.855')] [2022-07-10 01:42:31,819][26022] Updated weights on worker 0-0, policy_version 512173 (0.00080) [2022-07-10 01:42:33,221][26022] Updated weights on worker 0-0, policy_version 512183 (0.00086) [2022-07-10 01:42:35,343][26022] Updated weights on worker 0-0, policy_version 512193 (0.01064) [2022-07-10 01:42:35,744][25689] Fps is (10 sec: 5605.9, 60 sec: 5668.8, 300 sec: 5675.0). Total num frames: 524487680. Throughput: 0: 5949.6. Samples: 524486232. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:42:35,746][25689] Avg episode reward: [(0, '-32.315')] [2022-07-10 01:42:37,000][26022] Updated weights on worker 0-0, policy_version 512203 (0.00087) [2022-07-10 01:42:38,851][26022] Updated weights on worker 0-0, policy_version 512213 (0.00085) [2022-07-10 01:42:40,361][26022] Updated weights on worker 0-0, policy_version 512223 (0.00080) [2022-07-10 01:42:40,839][25689] Fps is (10 sec: 5658.0, 60 sec: 5683.1, 300 sec: 5673.6). Total num frames: 524517376. Throughput: 0: 5959.2. Samples: 524520748. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:42:40,855][25689] Avg episode reward: [(0, '-32.358')] [2022-07-10 01:42:42,480][26022] Updated weights on worker 0-0, policy_version 512233 (0.00087) [2022-07-10 01:42:43,928][26022] Updated weights on worker 0-0, policy_version 512243 (0.00096) [2022-07-10 01:42:45,869][25689] Fps is (10 sec: 5663.7, 60 sec: 5649.1, 300 sec: 5666.3). Total num frames: 524545024. Throughput: 0: 5953.8. Samples: 524554990. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:42:45,869][25689] Avg episode reward: [(0, '-31.721')] [2022-07-10 01:42:46,101][26022] Updated weights on worker 0-0, policy_version 512253 (0.00627) [2022-07-10 01:42:47,576][26022] Updated weights on worker 0-0, policy_version 512263 (0.00059) [2022-07-10 01:42:49,432][26022] Updated weights on worker 0-0, policy_version 512273 (0.00086) [2022-07-10 01:42:50,876][25689] Fps is (10 sec: 5815.1, 60 sec: 5716.6, 300 sec: 5676.8). Total num frames: 524575744. Throughput: 0: 5139.6. Samples: 524572556. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:42:50,877][25689] Avg episode reward: [(0, '-31.259')] [2022-07-10 01:42:51,241][26022] Updated weights on worker 0-0, policy_version 512283 (0.00897) [2022-07-10 01:42:52,752][26022] Updated weights on worker 0-0, policy_version 512293 (0.00090) [2022-07-10 01:42:54,660][26022] Updated weights on worker 0-0, policy_version 512303 (0.00085) [2022-07-10 01:42:55,891][25689] Fps is (10 sec: 6028.0, 60 sec: 5683.2, 300 sec: 5681.3). Total num frames: 524605440. Throughput: 0: 6013.2. Samples: 524607382. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:42:55,892][25689] Avg episode reward: [(0, '-31.916')] [2022-07-10 01:42:56,648][26022] Updated weights on worker 0-0, policy_version 512313 (0.00091) [2022-07-10 01:42:58,251][26022] Updated weights on worker 0-0, policy_version 512323 (0.00081) [2022-07-10 01:43:00,223][26022] Updated weights on worker 0-0, policy_version 512333 (0.00089) [2022-07-10 01:43:00,927][25689] Fps is (10 sec: 5705.5, 60 sec: 5687.9, 300 sec: 5687.6). Total num frames: 524633088. Throughput: 0: 6035.8. Samples: 524641996. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 01:43:00,928][25689] Avg episode reward: [(0, '-31.912')] [2022-07-10 01:43:02,072][26022] Updated weights on worker 0-0, policy_version 512343 (0.00081) [2022-07-10 01:43:04,194][26022] Updated weights on worker 0-0, policy_version 512353 (0.00088) [2022-07-10 01:43:05,690][26022] Updated weights on worker 0-0, policy_version 512363 (0.00098) [2022-07-10 01:43:05,959][25689] Fps is (10 sec: 5390.7, 60 sec: 5693.0, 300 sec: 5677.0). Total num frames: 524659712. Throughput: 0: 5078.6. Samples: 524657020. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:43:05,959][25689] Avg episode reward: [(0, '-32.206')] [2022-07-10 01:43:07,550][26022] Updated weights on worker 0-0, policy_version 512373 (0.00090) [2022-07-10 01:43:09,475][26022] Updated weights on worker 0-0, policy_version 512383 (0.00084) [2022-07-10 01:43:10,969][25689] Fps is (10 sec: 5608.7, 60 sec: 5709.4, 300 sec: 5680.7). Total num frames: 524689408. Throughput: 0: 5908.4. Samples: 524691270. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:43:10,969][25689] Avg episode reward: [(0, '-33.817')] [2022-07-10 01:43:11,234][26022] Updated weights on worker 0-0, policy_version 512393 (0.00099) [2022-07-10 01:43:12,995][26022] Updated weights on worker 0-0, policy_version 512403 (0.00081) [2022-07-10 01:43:14,868][26022] Updated weights on worker 0-0, policy_version 512413 (0.00086) [2022-07-10 01:43:16,036][25689] Fps is (10 sec: 5690.5, 60 sec: 5670.4, 300 sec: 5677.1). Total num frames: 524717056. Throughput: 0: 5861.1. Samples: 524725452. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:43:16,040][25689] Avg episode reward: [(0, '-34.084')] [2022-07-10 01:43:16,633][26022] Updated weights on worker 0-0, policy_version 512423 (0.00050) [2022-07-10 01:43:18,413][26022] Updated weights on worker 0-0, policy_version 512433 (0.00083) [2022-07-10 01:43:20,185][26022] Updated weights on worker 0-0, policy_version 512443 (0.00082) [2022-07-10 01:43:21,091][25689] Fps is (10 sec: 5665.2, 60 sec: 5695.7, 300 sec: 5679.6). Total num frames: 524746752. Throughput: 0: 4987.5. Samples: 524742562. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:43:21,091][25689] Avg episode reward: [(0, '-34.412')] [2022-07-10 01:43:22,149][26022] Updated weights on worker 0-0, policy_version 512453 (0.00085) [2022-07-10 01:43:23,718][26022] Updated weights on worker 0-0, policy_version 512463 (0.00083) [2022-07-10 01:43:25,707][26022] Updated weights on worker 0-0, policy_version 512473 (0.00092) [2022-07-10 01:43:26,170][25689] Fps is (10 sec: 5658.6, 60 sec: 5672.6, 300 sec: 5675.0). Total num frames: 524774400. Throughput: 0: 5931.1. Samples: 524776894. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:43:26,171][25689] Avg episode reward: [(0, '-33.775')] [2022-07-10 01:43:27,278][26022] Updated weights on worker 0-0, policy_version 512483 (0.00087) [2022-07-10 01:43:29,499][26022] Updated weights on worker 0-0, policy_version 512493 (0.00089) [2022-07-10 01:43:30,963][26022] Updated weights on worker 0-0, policy_version 512503 (0.00081) [2022-07-10 01:43:31,185][25689] Fps is (10 sec: 5680.8, 60 sec: 5688.8, 300 sec: 5678.7). Total num frames: 524804096. Throughput: 0: 5916.1. Samples: 524810874. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:43:31,186][25689] Avg episode reward: [(0, '-34.066')] [2022-07-10 01:43:32,943][26022] Updated weights on worker 0-0, policy_version 512513 (0.00089) [2022-07-10 01:43:34,623][26022] Updated weights on worker 0-0, policy_version 512523 (0.00083) [2022-07-10 01:43:36,213][25689] Fps is (10 sec: 5812.0, 60 sec: 5706.9, 300 sec: 5680.1). Total num frames: 524832768. Throughput: 0: 5082.1. Samples: 524827992. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:43:36,213][25689] Avg episode reward: [(0, '-34.320')] [2022-07-10 01:43:36,377][26022] Updated weights on worker 0-0, policy_version 512533 (0.00085) [2022-07-10 01:43:38,243][26022] Updated weights on worker 0-0, policy_version 512543 (0.00093) [2022-07-10 01:43:40,192][26022] Updated weights on worker 0-0, policy_version 512553 (0.00088) [2022-07-10 01:43:41,296][25689] Fps is (10 sec: 5671.4, 60 sec: 5691.0, 300 sec: 5679.0). Total num frames: 524861440. Throughput: 0: 5920.3. Samples: 524862184. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:43:41,297][25689] Avg episode reward: [(0, '-32.843')] [2022-07-10 01:43:41,861][26022] Updated weights on worker 0-0, policy_version 512563 (0.00093) [2022-07-10 01:43:43,732][26022] Updated weights on worker 0-0, policy_version 512573 (0.00064) [2022-07-10 01:43:45,223][26022] Updated weights on worker 0-0, policy_version 512583 (0.00086) [2022-07-10 01:43:46,300][25689] Fps is (10 sec: 5685.0, 60 sec: 5710.4, 300 sec: 5680.7). Total num frames: 524890112. Throughput: 0: 5950.8. Samples: 524896682. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:43:46,301][25689] Avg episode reward: [(0, '-33.236')] [2022-07-10 01:43:47,202][26022] Updated weights on worker 0-0, policy_version 512593 (0.00084) [2022-07-10 01:43:48,800][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:43:48,810][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000512602_524904448.pth [2022-07-10 01:43:48,814][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000510606_522860544.pth [2022-07-10 01:43:48,931][26022] Updated weights on worker 0-0, policy_version 512603 (0.00089) [2022-07-10 01:43:50,781][26022] Updated weights on worker 0-0, policy_version 512613 (0.00090) [2022-07-10 01:43:51,312][25689] Fps is (10 sec: 5725.7, 60 sec: 5676.2, 300 sec: 5678.0). Total num frames: 524918784. Throughput: 0: 5121.2. Samples: 524913944. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:43:51,312][25689] Avg episode reward: [(0, '-32.831')] [2022-07-10 01:43:52,708][26022] Updated weights on worker 0-0, policy_version 512623 (0.00085) [2022-07-10 01:43:54,342][26022] Updated weights on worker 0-0, policy_version 512633 (0.00086) [2022-07-10 01:43:56,384][25689] Fps is (10 sec: 5483.3, 60 sec: 5620.0, 300 sec: 5667.3). Total num frames: 524945408. Throughput: 0: 5932.8. Samples: 524947666. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:43:56,385][25689] Avg episode reward: [(0, '-33.376')] [2022-07-10 01:43:56,522][26022] Updated weights on worker 0-0, policy_version 512643 (0.00088) [2022-07-10 01:43:58,047][26022] Updated weights on worker 0-0, policy_version 512653 (0.00089) [2022-07-10 01:44:00,022][26022] Updated weights on worker 0-0, policy_version 512663 (0.00085) [2022-07-10 01:44:01,455][25689] Fps is (10 sec: 5552.3, 60 sec: 5650.5, 300 sec: 5677.7). Total num frames: 524975104. Throughput: 0: 5925.9. Samples: 524981642. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:44:01,456][25689] Avg episode reward: [(0, '-34.392')] [2022-07-10 01:44:01,901][26022] Updated weights on worker 0-0, policy_version 512673 (0.00099) [2022-07-10 01:44:03,790][26022] Updated weights on worker 0-0, policy_version 512683 (0.00084) [2022-07-10 01:44:05,518][26022] Updated weights on worker 0-0, policy_version 512693 (0.00091) [2022-07-10 01:44:06,491][25689] Fps is (10 sec: 5471.0, 60 sec: 5633.2, 300 sec: 5670.6). Total num frames: 525000704. Throughput: 0: 4941.5. Samples: 524996458. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:44:06,492][25689] Avg episode reward: [(0, '-35.073')] [2022-07-10 01:44:07,551][26022] Updated weights on worker 0-0, policy_version 512703 (0.00083) [2022-07-10 01:44:09,343][26022] Updated weights on worker 0-0, policy_version 512713 (0.00082) [2022-07-10 01:44:11,015][26022] Updated weights on worker 0-0, policy_version 512723 (0.00090) [2022-07-10 01:44:11,498][25689] Fps is (10 sec: 5506.1, 60 sec: 5633.5, 300 sec: 5668.6). Total num frames: 525030400. Throughput: 0: 5789.4. Samples: 525030810. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:44:11,499][25689] Avg episode reward: [(0, '-34.056')] [2022-07-10 01:44:13,041][26022] Updated weights on worker 0-0, policy_version 512733 (0.00087) [2022-07-10 01:44:14,633][26022] Updated weights on worker 0-0, policy_version 512743 (0.00091) [2022-07-10 01:44:16,514][25689] Fps is (10 sec: 5823.5, 60 sec: 5655.2, 300 sec: 5675.9). Total num frames: 525059072. Throughput: 0: 5833.9. Samples: 525065102. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:44:16,515][25689] Avg episode reward: [(0, '-34.447')] [2022-07-10 01:44:16,515][26022] Updated weights on worker 0-0, policy_version 512753 (0.00090) [2022-07-10 01:44:18,209][26022] Updated weights on worker 0-0, policy_version 512763 (0.00079) [2022-07-10 01:44:19,845][26022] Updated weights on worker 0-0, policy_version 512773 (0.00089) [2022-07-10 01:44:21,617][25689] Fps is (10 sec: 5666.9, 60 sec: 5633.8, 300 sec: 5668.6). Total num frames: 525087744. Throughput: 0: 4998.8. Samples: 525082426. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:44:21,619][25689] Avg episode reward: [(0, '-33.707')] [2022-07-10 01:44:21,951][26022] Updated weights on worker 0-0, policy_version 512783 (0.00091) [2022-07-10 01:44:23,495][26022] Updated weights on worker 0-0, policy_version 512793 (0.00086) [2022-07-10 01:44:25,502][26022] Updated weights on worker 0-0, policy_version 512803 (0.00090) [2022-07-10 01:44:26,634][25689] Fps is (10 sec: 5767.5, 60 sec: 5673.4, 300 sec: 5671.9). Total num frames: 525117440. Throughput: 0: 5960.5. Samples: 525116520. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:44:26,635][25689] Avg episode reward: [(0, '-32.656')] [2022-07-10 01:44:27,060][26022] Updated weights on worker 0-0, policy_version 512813 (0.00088) [2022-07-10 01:44:28,934][26022] Updated weights on worker 0-0, policy_version 512823 (0.00091) [2022-07-10 01:44:30,952][26022] Updated weights on worker 0-0, policy_version 512833 (0.00088) [2022-07-10 01:44:31,664][25689] Fps is (10 sec: 5911.0, 60 sec: 5672.0, 300 sec: 5679.0). Total num frames: 525147136. Throughput: 0: 5947.8. Samples: 525150758. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:44:31,665][25689] Avg episode reward: [(0, '-32.415')] [2022-07-10 01:44:32,574][26022] Updated weights on worker 0-0, policy_version 512843 (0.00095) [2022-07-10 01:44:34,488][26022] Updated weights on worker 0-0, policy_version 512853 (0.00093) [2022-07-10 01:44:36,039][26022] Updated weights on worker 0-0, policy_version 512863 (0.00092) [2022-07-10 01:44:36,724][25689] Fps is (10 sec: 5683.7, 60 sec: 5652.2, 300 sec: 5675.5). Total num frames: 525174784. Throughput: 0: 5083.4. Samples: 525167834. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:44:36,725][25689] Avg episode reward: [(0, '-31.924')] [2022-07-10 01:44:37,925][26022] Updated weights on worker 0-0, policy_version 512873 (0.00086) [2022-07-10 01:44:39,738][26022] Updated weights on worker 0-0, policy_version 512883 (0.00087) [2022-07-10 01:44:41,532][26022] Updated weights on worker 0-0, policy_version 512893 (0.00092) [2022-07-10 01:44:41,845][25689] Fps is (10 sec: 5632.6, 60 sec: 5665.5, 300 sec: 5674.1). Total num frames: 525204480. Throughput: 0: 5942.0. Samples: 525202622. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:44:41,846][25689] Avg episode reward: [(0, '-33.128')] [2022-07-10 01:44:43,326][26022] Updated weights on worker 0-0, policy_version 512903 (0.00092) [2022-07-10 01:44:45,123][26022] Updated weights on worker 0-0, policy_version 512913 (0.00084) [2022-07-10 01:44:46,828][26022] Updated weights on worker 0-0, policy_version 512923 (0.00096) [2022-07-10 01:44:46,915][25689] Fps is (10 sec: 5727.2, 60 sec: 5659.3, 300 sec: 5673.1). Total num frames: 525233152. Throughput: 0: 5944.0. Samples: 525237066. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:44:46,915][25689] Avg episode reward: [(0, '-32.734')] [2022-07-10 01:44:48,552][26022] Updated weights on worker 0-0, policy_version 512933 (0.00097) [2022-07-10 01:44:50,461][26022] Updated weights on worker 0-0, policy_version 512943 (0.00081) [2022-07-10 01:44:51,963][25689] Fps is (10 sec: 5769.0, 60 sec: 5672.8, 300 sec: 5679.2). Total num frames: 525262848. Throughput: 0: 5935.4. Samples: 525271236. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:44:51,964][25689] Avg episode reward: [(0, '-32.458')] [2022-07-10 01:44:52,149][26022] Updated weights on worker 0-0, policy_version 512953 (0.00093) [2022-07-10 01:44:54,222][26022] Updated weights on worker 0-0, policy_version 512963 (0.00079) [2022-07-10 01:44:56,086][26022] Updated weights on worker 0-0, policy_version 512973 (0.00092) [2022-07-10 01:44:56,997][25689] Fps is (10 sec: 5687.4, 60 sec: 5693.3, 300 sec: 5676.3). Total num frames: 525290496. Throughput: 0: 5925.0. Samples: 525287958. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:44:56,999][25689] Avg episode reward: [(0, '-31.154')] [2022-07-10 01:44:57,799][26022] Updated weights on worker 0-0, policy_version 512983 (0.00091) [2022-07-10 01:44:59,545][26022] Updated weights on worker 0-0, policy_version 512993 (0.00086) [2022-07-10 01:45:01,332][26022] Updated weights on worker 0-0, policy_version 513003 (0.00095) [2022-07-10 01:45:02,111][25689] Fps is (10 sec: 5448.6, 60 sec: 5655.5, 300 sec: 5674.6). Total num frames: 525318144. Throughput: 0: 5895.0. Samples: 525322090. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:45:02,113][25689] Avg episode reward: [(0, '-29.971')] [2022-07-10 01:45:03,491][26022] Updated weights on worker 0-0, policy_version 513013 (0.00083) [2022-07-10 01:45:05,457][26022] Updated weights on worker 0-0, policy_version 513023 (0.00086) [2022-07-10 01:45:06,957][26022] Updated weights on worker 0-0, policy_version 513033 (0.00080) [2022-07-10 01:45:07,164][25689] Fps is (10 sec: 5439.0, 60 sec: 5687.7, 300 sec: 5670.5). Total num frames: 525345792. Throughput: 0: 5765.3. Samples: 525353810. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:45:07,165][25689] Avg episode reward: [(0, '-28.994')] [2022-07-10 01:45:09,142][26022] Updated weights on worker 0-0, policy_version 513043 (0.00088) [2022-07-10 01:45:10,445][26022] Updated weights on worker 0-0, policy_version 513053 (0.00087) [2022-07-10 01:45:12,168][25689] Fps is (10 sec: 5498.3, 60 sec: 5654.2, 300 sec: 5667.4). Total num frames: 525373440. Throughput: 0: 4946.0. Samples: 525371172. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:45:12,169][25689] Avg episode reward: [(0, '-29.544')] [2022-07-10 01:45:12,525][26022] Updated weights on worker 0-0, policy_version 513063 (0.00083) [2022-07-10 01:45:14,207][26022] Updated weights on worker 0-0, policy_version 513073 (0.00088) [2022-07-10 01:45:16,011][26022] Updated weights on worker 0-0, policy_version 513083 (0.00085) [2022-07-10 01:45:17,215][25689] Fps is (10 sec: 5705.4, 60 sec: 5668.2, 300 sec: 5667.7). Total num frames: 525403136. Throughput: 0: 5830.1. Samples: 525405828. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:45:17,217][25689] Avg episode reward: [(0, '-30.413')] [2022-07-10 01:45:17,799][26022] Updated weights on worker 0-0, policy_version 513093 (0.00091) [2022-07-10 01:45:19,519][26022] Updated weights on worker 0-0, policy_version 513103 (0.00086) [2022-07-10 01:45:21,488][26022] Updated weights on worker 0-0, policy_version 513113 (0.00086) [2022-07-10 01:45:22,291][25689] Fps is (10 sec: 5766.0, 60 sec: 5670.7, 300 sec: 5664.5). Total num frames: 525431808. Throughput: 0: 5852.6. Samples: 525440194. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:45:22,292][25689] Avg episode reward: [(0, '-30.893')] [2022-07-10 01:45:23,110][26022] Updated weights on worker 0-0, policy_version 513123 (0.00093) [2022-07-10 01:45:24,976][26022] Updated weights on worker 0-0, policy_version 513133 (0.00087) [2022-07-10 01:45:26,735][26022] Updated weights on worker 0-0, policy_version 513143 (0.00081) [2022-07-10 01:45:27,346][25689] Fps is (10 sec: 5660.0, 60 sec: 5650.3, 300 sec: 5671.7). Total num frames: 525460480. Throughput: 0: 5123.0. Samples: 525457204. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:45:27,347][25689] Avg episode reward: [(0, '-32.267')] [2022-07-10 01:45:28,954][26022] Updated weights on worker 0-0, policy_version 513153 (0.00086) [2022-07-10 01:45:30,474][26022] Updated weights on worker 0-0, policy_version 513163 (0.00088) [2022-07-10 01:45:32,268][26022] Updated weights on worker 0-0, policy_version 513173 (0.00091) [2022-07-10 01:45:32,356][25689] Fps is (10 sec: 5697.4, 60 sec: 5635.3, 300 sec: 5668.7). Total num frames: 525489152. Throughput: 0: 5944.5. Samples: 525491178. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:45:32,356][25689] Avg episode reward: [(0, '-33.086')] [2022-07-10 01:45:34,028][26022] Updated weights on worker 0-0, policy_version 513183 (0.00087) [2022-07-10 01:45:35,790][26022] Updated weights on worker 0-0, policy_version 513193 (0.00094) [2022-07-10 01:45:37,366][25689] Fps is (10 sec: 5825.2, 60 sec: 5673.6, 300 sec: 5677.5). Total num frames: 525518848. Throughput: 0: 5950.8. Samples: 525525746. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:45:37,367][25689] Avg episode reward: [(0, '-32.764')] [2022-07-10 01:45:37,680][26022] Updated weights on worker 0-0, policy_version 513203 (0.00090) [2022-07-10 01:45:39,369][26022] Updated weights on worker 0-0, policy_version 513213 (0.00088) [2022-07-10 01:45:41,130][26022] Updated weights on worker 0-0, policy_version 513223 (0.00088) [2022-07-10 01:45:42,425][25689] Fps is (10 sec: 5797.0, 60 sec: 5662.6, 300 sec: 5670.7). Total num frames: 525547520. Throughput: 0: 5106.0. Samples: 525542996. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:45:42,425][25689] Avg episode reward: [(0, '-32.384')] [2022-07-10 01:45:43,013][26022] Updated weights on worker 0-0, policy_version 513233 (0.00089) [2022-07-10 01:45:44,671][26022] Updated weights on worker 0-0, policy_version 513243 (0.00083) [2022-07-10 01:45:46,502][26022] Updated weights on worker 0-0, policy_version 513253 (0.00089) [2022-07-10 01:45:47,435][25689] Fps is (10 sec: 5695.5, 60 sec: 5668.2, 300 sec: 5674.8). Total num frames: 525576192. Throughput: 0: 5992.6. Samples: 525577584. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:45:47,435][25689] Avg episode reward: [(0, '-33.063')] [2022-07-10 01:45:48,343][26022] Updated weights on worker 0-0, policy_version 513263 (0.00087) [2022-07-10 01:45:48,948][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:45:48,971][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000513266_525584384.pth [2022-07-10 01:45:48,971][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000511270_523540480.pth [2022-07-10 01:45:50,133][26022] Updated weights on worker 0-0, policy_version 513273 (0.00102) [2022-07-10 01:45:51,937][26022] Updated weights on worker 0-0, policy_version 513283 (0.00086) [2022-07-10 01:45:52,442][25689] Fps is (10 sec: 5724.4, 60 sec: 5655.1, 300 sec: 5674.7). Total num frames: 525604864. Throughput: 0: 6026.2. Samples: 525612220. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:45:52,443][25689] Avg episode reward: [(0, '-32.632')] [2022-07-10 01:45:53,566][26022] Updated weights on worker 0-0, policy_version 513293 (0.00096) [2022-07-10 01:45:55,352][26022] Updated weights on worker 0-0, policy_version 513303 (0.00082) [2022-07-10 01:45:57,313][26022] Updated weights on worker 0-0, policy_version 513313 (0.00092) [2022-07-10 01:45:57,451][25689] Fps is (10 sec: 5622.8, 60 sec: 5657.5, 300 sec: 5668.5). Total num frames: 525632512. Throughput: 0: 5151.5. Samples: 525629212. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2022-07-10 01:45:57,452][25689] Avg episode reward: [(0, '-32.400')] [2022-07-10 01:45:59,078][26022] Updated weights on worker 0-0, policy_version 513323 (0.00503) [2022-07-10 01:46:00,910][26022] Updated weights on worker 0-0, policy_version 513333 (0.00083) [2022-07-10 01:46:02,518][25689] Fps is (10 sec: 5589.6, 60 sec: 5678.8, 300 sec: 5674.3). Total num frames: 525661184. Throughput: 0: 6002.5. Samples: 525663606. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:46:02,519][25689] Avg episode reward: [(0, '-31.861')] [2022-07-10 01:46:03,016][26022] Updated weights on worker 0-0, policy_version 513343 (0.00091) [2022-07-10 01:46:04,831][26022] Updated weights on worker 0-0, policy_version 513353 (0.00093) [2022-07-10 01:46:06,670][26022] Updated weights on worker 0-0, policy_version 513363 (0.00098) [2022-07-10 01:46:07,543][25689] Fps is (10 sec: 5580.9, 60 sec: 5681.5, 300 sec: 5673.9). Total num frames: 525688832. Throughput: 0: 5857.1. Samples: 525695358. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:46:07,544][25689] Avg episode reward: [(0, '-32.236')] [2022-07-10 01:46:08,496][26022] Updated weights on worker 0-0, policy_version 513373 (0.00093) [2022-07-10 01:46:10,271][26022] Updated weights on worker 0-0, policy_version 513383 (0.00087) [2022-07-10 01:46:12,153][26022] Updated weights on worker 0-0, policy_version 513393 (0.00098) [2022-07-10 01:46:12,566][25689] Fps is (10 sec: 5503.3, 60 sec: 5679.7, 300 sec: 5666.8). Total num frames: 525716480. Throughput: 0: 4972.4. Samples: 525712284. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:46:12,568][25689] Avg episode reward: [(0, '-32.820')] [2022-07-10 01:46:13,943][26022] Updated weights on worker 0-0, policy_version 513403 (0.00095) [2022-07-10 01:46:15,787][26022] Updated weights on worker 0-0, policy_version 513413 (0.00085) [2022-07-10 01:46:17,426][26022] Updated weights on worker 0-0, policy_version 513423 (0.00092) [2022-07-10 01:46:17,587][25689] Fps is (10 sec: 5708.9, 60 sec: 5682.1, 300 sec: 5674.2). Total num frames: 525746176. Throughput: 0: 5829.0. Samples: 525746586. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:46:17,589][25689] Avg episode reward: [(0, '-33.250')] [2022-07-10 01:46:19,242][26022] Updated weights on worker 0-0, policy_version 513433 (0.00091) [2022-07-10 01:46:21,080][26022] Updated weights on worker 0-0, policy_version 513443 (0.00086) [2022-07-10 01:46:22,629][25689] Fps is (10 sec: 5800.4, 60 sec: 5685.4, 300 sec: 5673.6). Total num frames: 525774848. Throughput: 0: 5843.9. Samples: 525781130. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:46:22,630][25689] Avg episode reward: [(0, '-34.257')] [2022-07-10 01:46:22,819][26022] Updated weights on worker 0-0, policy_version 513453 (0.00086) [2022-07-10 01:46:24,763][26022] Updated weights on worker 0-0, policy_version 513463 (0.00083) [2022-07-10 01:46:26,329][26022] Updated weights on worker 0-0, policy_version 513473 (0.00097) [2022-07-10 01:46:27,642][25689] Fps is (10 sec: 5601.5, 60 sec: 5672.3, 300 sec: 5673.6). Total num frames: 525802496. Throughput: 0: 5126.9. Samples: 525798404. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:46:27,644][25689] Avg episode reward: [(0, '-34.084')] [2022-07-10 01:46:28,385][26022] Updated weights on worker 0-0, policy_version 513483 (0.00095) [2022-07-10 01:46:29,877][26022] Updated weights on worker 0-0, policy_version 513493 (0.00088) [2022-07-10 01:46:31,767][26022] Updated weights on worker 0-0, policy_version 513503 (0.00082) [2022-07-10 01:46:32,650][25689] Fps is (10 sec: 5722.1, 60 sec: 5689.5, 300 sec: 5674.0). Total num frames: 525832192. Throughput: 0: 5999.1. Samples: 525832772. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:46:32,651][25689] Avg episode reward: [(0, '-34.165')] [2022-07-10 01:46:33,498][26022] Updated weights on worker 0-0, policy_version 513513 (0.00088) [2022-07-10 01:46:35,291][26022] Updated weights on worker 0-0, policy_version 513523 (0.00090) [2022-07-10 01:46:37,107][26022] Updated weights on worker 0-0, policy_version 513533 (0.00090) [2022-07-10 01:46:37,674][25689] Fps is (10 sec: 5920.4, 60 sec: 5688.2, 300 sec: 5678.2). Total num frames: 525861888. Throughput: 0: 6016.3. Samples: 525867430. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:46:37,674][25689] Avg episode reward: [(0, '-34.215')] [2022-07-10 01:46:38,776][26022] Updated weights on worker 0-0, policy_version 513543 (0.00088) [2022-07-10 01:46:40,515][26022] Updated weights on worker 0-0, policy_version 513553 (0.00095) [2022-07-10 01:46:42,566][26022] Updated weights on worker 0-0, policy_version 513563 (0.00086) [2022-07-10 01:46:42,753][25689] Fps is (10 sec: 5676.2, 60 sec: 5669.3, 300 sec: 5670.4). Total num frames: 525889536. Throughput: 0: 5135.3. Samples: 525884474. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:46:42,753][25689] Avg episode reward: [(0, '-34.300')] [2022-07-10 01:46:44,225][26022] Updated weights on worker 0-0, policy_version 513573 (0.00088) [2022-07-10 01:46:46,088][26022] Updated weights on worker 0-0, policy_version 513583 (0.00087) [2022-07-10 01:46:47,786][25689] Fps is (10 sec: 5670.4, 60 sec: 5684.0, 300 sec: 5680.2). Total num frames: 525919232. Throughput: 0: 5981.8. Samples: 525918904. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:46:47,787][25689] Avg episode reward: [(0, '-33.579')] [2022-07-10 01:46:47,792][26022] Updated weights on worker 0-0, policy_version 513593 (0.00091) [2022-07-10 01:46:49,493][26022] Updated weights on worker 0-0, policy_version 513603 (0.00084) [2022-07-10 01:46:51,323][26022] Updated weights on worker 0-0, policy_version 513613 (0.00098) [2022-07-10 01:46:52,844][25689] Fps is (10 sec: 5885.1, 60 sec: 5696.2, 300 sec: 5672.6). Total num frames: 525948928. Throughput: 0: 6004.2. Samples: 525954022. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:46:52,845][25689] Avg episode reward: [(0, '-33.802')] [2022-07-10 01:46:52,966][26022] Updated weights on worker 0-0, policy_version 513623 (0.00084) [2022-07-10 01:46:54,910][26022] Updated weights on worker 0-0, policy_version 513633 (0.00088) [2022-07-10 01:46:56,631][26022] Updated weights on worker 0-0, policy_version 513643 (0.00084) [2022-07-10 01:46:57,882][25689] Fps is (10 sec: 5781.8, 60 sec: 5710.5, 300 sec: 5676.9). Total num frames: 525977600. Throughput: 0: 5133.7. Samples: 525971172. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:46:57,882][25689] Avg episode reward: [(0, '-34.202')] [2022-07-10 01:46:58,453][26022] Updated weights on worker 0-0, policy_version 513653 (0.00095) [2022-07-10 01:47:00,208][26022] Updated weights on worker 0-0, policy_version 513663 (0.00090) [2022-07-10 01:47:02,470][26022] Updated weights on worker 0-0, policy_version 513673 (0.00088) [2022-07-10 01:47:02,935][25689] Fps is (10 sec: 5378.3, 60 sec: 5660.9, 300 sec: 5674.1). Total num frames: 526003200. Throughput: 0: 5985.8. Samples: 526005284. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:47:02,936][25689] Avg episode reward: [(0, '-32.897')] [2022-07-10 01:47:04,257][26022] Updated weights on worker 0-0, policy_version 513683 (0.00087) [2022-07-10 01:47:06,087][26022] Updated weights on worker 0-0, policy_version 513693 (0.00088) [2022-07-10 01:47:07,767][26022] Updated weights on worker 0-0, policy_version 513703 (0.00088) [2022-07-10 01:47:08,016][25689] Fps is (10 sec: 5456.5, 60 sec: 5689.6, 300 sec: 5676.1). Total num frames: 526032896. Throughput: 0: 5861.4. Samples: 526037476. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:47:08,016][25689] Avg episode reward: [(0, '-33.290')] [2022-07-10 01:47:09,591][26022] Updated weights on worker 0-0, policy_version 513713 (0.00089) [2022-07-10 01:47:11,467][26022] Updated weights on worker 0-0, policy_version 513723 (0.00084) [2022-07-10 01:47:13,099][25689] Fps is (10 sec: 5742.9, 60 sec: 5700.8, 300 sec: 5671.3). Total num frames: 526061568. Throughput: 0: 4969.4. Samples: 526054674. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:47:13,100][25689] Avg episode reward: [(0, '-32.839')] [2022-07-10 01:47:13,193][26022] Updated weights on worker 0-0, policy_version 513733 (0.00089) [2022-07-10 01:47:15,131][26022] Updated weights on worker 0-0, policy_version 513743 (0.00082) [2022-07-10 01:47:16,702][26022] Updated weights on worker 0-0, policy_version 513753 (0.00084) [2022-07-10 01:47:18,120][25689] Fps is (10 sec: 5675.3, 60 sec: 5684.0, 300 sec: 5673.6). Total num frames: 526090240. Throughput: 0: 5807.1. Samples: 526088696. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:47:18,120][25689] Avg episode reward: [(0, '-31.689')] [2022-07-10 01:47:18,590][26022] Updated weights on worker 0-0, policy_version 513763 (0.00103) [2022-07-10 01:47:20,404][26022] Updated weights on worker 0-0, policy_version 513773 (0.00092) [2022-07-10 01:47:22,119][26022] Updated weights on worker 0-0, policy_version 513783 (0.00088) [2022-07-10 01:47:23,233][25689] Fps is (10 sec: 5658.6, 60 sec: 5677.2, 300 sec: 5671.8). Total num frames: 526118912. Throughput: 0: 5813.1. Samples: 526123276. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:47:23,233][25689] Avg episode reward: [(0, '-30.453')] [2022-07-10 01:47:23,901][26022] Updated weights on worker 0-0, policy_version 513793 (0.00086) [2022-07-10 01:47:25,728][26022] Updated weights on worker 0-0, policy_version 513803 (0.00093) [2022-07-10 01:47:27,412][26022] Updated weights on worker 0-0, policy_version 513813 (0.00090) [2022-07-10 01:47:28,285][25689] Fps is (10 sec: 5741.7, 60 sec: 5707.3, 300 sec: 5674.3). Total num frames: 526148608. Throughput: 0: 5915.5. Samples: 526157382. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:47:28,286][25689] Avg episode reward: [(0, '-29.778')] [2022-07-10 01:47:29,439][26022] Updated weights on worker 0-0, policy_version 513823 (0.00093) [2022-07-10 01:47:31,070][26022] Updated weights on worker 0-0, policy_version 513833 (0.00102) [2022-07-10 01:47:32,914][26022] Updated weights on worker 0-0, policy_version 513843 (0.00089) [2022-07-10 01:47:33,327][25689] Fps is (10 sec: 5680.9, 60 sec: 5670.4, 300 sec: 5674.3). Total num frames: 526176256. Throughput: 0: 5919.7. Samples: 526174418. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:47:33,327][25689] Avg episode reward: [(0, '-30.274')] [2022-07-10 01:47:34,597][26022] Updated weights on worker 0-0, policy_version 513853 (0.00087) [2022-07-10 01:47:36,485][26022] Updated weights on worker 0-0, policy_version 513863 (0.00086) [2022-07-10 01:47:38,020][26022] Updated weights on worker 0-0, policy_version 513873 (0.00086) [2022-07-10 01:47:38,354][25689] Fps is (10 sec: 5797.2, 60 sec: 5687.0, 300 sec: 5679.0). Total num frames: 526206976. Throughput: 0: 5943.1. Samples: 526208948. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:47:38,354][25689] Avg episode reward: [(0, '-30.621')] [2022-07-10 01:47:40,153][26022] Updated weights on worker 0-0, policy_version 513883 (0.00094) [2022-07-10 01:47:41,813][26022] Updated weights on worker 0-0, policy_version 513893 (0.00085) [2022-07-10 01:47:43,479][25689] Fps is (10 sec: 5749.2, 60 sec: 5682.6, 300 sec: 5677.2). Total num frames: 526234624. Throughput: 0: 5924.5. Samples: 526243226. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:47:43,480][25689] Avg episode reward: [(0, '-31.228')] [2022-07-10 01:47:43,709][26022] Updated weights on worker 0-0, policy_version 513903 (0.00098) [2022-07-10 01:47:45,397][26022] Updated weights on worker 0-0, policy_version 513913 (0.00090) [2022-07-10 01:47:47,588][26022] Updated weights on worker 0-0, policy_version 513923 (0.00088) [2022-07-10 01:47:48,507][25689] Fps is (10 sec: 5748.6, 60 sec: 5700.1, 300 sec: 5676.8). Total num frames: 526265344. Throughput: 0: 5085.7. Samples: 526260224. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:47:48,508][25689] Avg episode reward: [(0, '-31.595')] [2022-07-10 01:47:49,105][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:47:49,118][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000513933_526267392.pth [2022-07-10 01:47:49,119][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000511936_524222464.pth [2022-07-10 01:47:49,122][26022] Updated weights on worker 0-0, policy_version 513933 (0.00087) [2022-07-10 01:47:50,919][26022] Updated weights on worker 0-0, policy_version 513943 (0.00087) [2022-07-10 01:47:52,556][26022] Updated weights on worker 0-0, policy_version 513953 (0.00089) [2022-07-10 01:47:53,572][25689] Fps is (10 sec: 5681.5, 60 sec: 5648.8, 300 sec: 5665.6). Total num frames: 526291968. Throughput: 0: 5945.5. Samples: 526294788. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:47:53,573][25689] Avg episode reward: [(0, '-31.963')] [2022-07-10 01:47:54,452][26022] Updated weights on worker 0-0, policy_version 513963 (0.00084) [2022-07-10 01:47:56,216][26022] Updated weights on worker 0-0, policy_version 513973 (0.00092) [2022-07-10 01:47:58,172][26022] Updated weights on worker 0-0, policy_version 513983 (0.00086) [2022-07-10 01:47:58,672][25689] Fps is (10 sec: 5540.8, 60 sec: 5659.9, 300 sec: 5671.2). Total num frames: 526321664. Throughput: 0: 5905.7. Samples: 526328940. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:47:58,672][25689] Avg episode reward: [(0, '-32.478')] [2022-07-10 01:47:59,836][26022] Updated weights on worker 0-0, policy_version 513993 (0.00089) [2022-07-10 01:48:02,135][26022] Updated weights on worker 0-0, policy_version 514003 (0.00089) [2022-07-10 01:48:03,758][25689] Fps is (10 sec: 5529.2, 60 sec: 5673.6, 300 sec: 5670.2). Total num frames: 526348288. Throughput: 0: 5075.8. Samples: 526346158. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:48:03,759][25689] Avg episode reward: [(0, '-32.077')] [2022-07-10 01:48:03,807][26022] Updated weights on worker 0-0, policy_version 514013 (0.00084) [2022-07-10 01:48:05,871][26022] Updated weights on worker 0-0, policy_version 514023 (0.00086) [2022-07-10 01:48:07,468][26022] Updated weights on worker 0-0, policy_version 514033 (0.00087) [2022-07-10 01:48:08,778][25689] Fps is (10 sec: 5370.2, 60 sec: 5645.6, 300 sec: 5663.1). Total num frames: 526375936. Throughput: 0: 5814.3. Samples: 526378084. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:48:08,779][25689] Avg episode reward: [(0, '-32.535')] [2022-07-10 01:48:09,279][26022] Updated weights on worker 0-0, policy_version 514043 (0.00087) [2022-07-10 01:48:10,993][26022] Updated weights on worker 0-0, policy_version 514053 (0.00085) [2022-07-10 01:48:12,819][26022] Updated weights on worker 0-0, policy_version 514063 (0.00082) [2022-07-10 01:48:13,787][25689] Fps is (10 sec: 5717.9, 60 sec: 5669.4, 300 sec: 5671.1). Total num frames: 526405632. Throughput: 0: 5821.2. Samples: 526412462. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:48:13,788][25689] Avg episode reward: [(0, '-32.210')] [2022-07-10 01:48:14,658][26022] Updated weights on worker 0-0, policy_version 514073 (0.00081) [2022-07-10 01:48:16,421][26022] Updated weights on worker 0-0, policy_version 514083 (0.00091) [2022-07-10 01:48:18,148][26022] Updated weights on worker 0-0, policy_version 514093 (0.00976) [2022-07-10 01:48:18,798][25689] Fps is (10 sec: 5825.3, 60 sec: 5670.3, 300 sec: 5668.5). Total num frames: 526434304. Throughput: 0: 4994.3. Samples: 526429454. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:48:18,798][25689] Avg episode reward: [(0, '-31.611')] [2022-07-10 01:48:20,011][26022] Updated weights on worker 0-0, policy_version 514103 (0.00087) [2022-07-10 01:48:21,752][26022] Updated weights on worker 0-0, policy_version 514113 (0.00082) [2022-07-10 01:48:23,659][26022] Updated weights on worker 0-0, policy_version 514123 (0.00096) [2022-07-10 01:48:23,895][25689] Fps is (10 sec: 5673.4, 60 sec: 5671.8, 300 sec: 5671.6). Total num frames: 526462976. Throughput: 0: 5826.0. Samples: 526463472. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:48:23,895][25689] Avg episode reward: [(0, '-31.569')] [2022-07-10 01:48:25,233][26022] Updated weights on worker 0-0, policy_version 514133 (0.00088) [2022-07-10 01:48:27,286][26022] Updated weights on worker 0-0, policy_version 514143 (0.00095) [2022-07-10 01:48:28,920][25689] Fps is (10 sec: 5563.8, 60 sec: 5640.6, 300 sec: 5664.5). Total num frames: 526490624. Throughput: 0: 5926.7. Samples: 526497462. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:48:28,921][25689] Avg episode reward: [(0, '-30.599')] [2022-07-10 01:48:29,081][26022] Updated weights on worker 0-0, policy_version 514153 (0.00085) [2022-07-10 01:48:30,839][26022] Updated weights on worker 0-0, policy_version 514163 (0.00079) [2022-07-10 01:48:32,699][26022] Updated weights on worker 0-0, policy_version 514173 (0.00098) [2022-07-10 01:48:33,947][25689] Fps is (10 sec: 5704.7, 60 sec: 5675.8, 300 sec: 5668.0). Total num frames: 526520320. Throughput: 0: 5068.3. Samples: 526514636. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:48:33,947][25689] Avg episode reward: [(0, '-30.227')] [2022-07-10 01:48:34,361][26022] Updated weights on worker 0-0, policy_version 514183 (0.00090) [2022-07-10 01:48:36,235][26022] Updated weights on worker 0-0, policy_version 514193 (0.00091) [2022-07-10 01:48:38,049][26022] Updated weights on worker 0-0, policy_version 514203 (0.00089) [2022-07-10 01:48:38,950][25689] Fps is (10 sec: 5819.5, 60 sec: 5644.2, 300 sec: 5669.5). Total num frames: 526548992. Throughput: 0: 5929.4. Samples: 526548946. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:48:38,951][25689] Avg episode reward: [(0, '-29.570')] [2022-07-10 01:48:39,794][26022] Updated weights on worker 0-0, policy_version 514213 (0.00085) [2022-07-10 01:48:41,683][26022] Updated weights on worker 0-0, policy_version 514223 (0.00086) [2022-07-10 01:48:43,304][26022] Updated weights on worker 0-0, policy_version 514233 (0.00089) [2022-07-10 01:48:44,031][25689] Fps is (10 sec: 5686.6, 60 sec: 5665.3, 300 sec: 5668.0). Total num frames: 526577664. Throughput: 0: 5948.8. Samples: 526583256. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:48:44,031][25689] Avg episode reward: [(0, '-30.307')] [2022-07-10 01:48:45,256][26022] Updated weights on worker 0-0, policy_version 514243 (0.00093) [2022-07-10 01:48:47,051][26022] Updated weights on worker 0-0, policy_version 514253 (0.00087) [2022-07-10 01:48:48,937][26022] Updated weights on worker 0-0, policy_version 514263 (0.00086) [2022-07-10 01:48:49,038][25689] Fps is (10 sec: 5582.9, 60 sec: 5616.4, 300 sec: 5664.7). Total num frames: 526605312. Throughput: 0: 5118.7. Samples: 526600438. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 01:48:49,039][25689] Avg episode reward: [(0, '-32.084')] [2022-07-10 01:48:50,588][26022] Updated weights on worker 0-0, policy_version 514273 (0.00089) [2022-07-10 01:48:52,528][26022] Updated weights on worker 0-0, policy_version 514283 (0.00088) [2022-07-10 01:48:54,074][25689] Fps is (10 sec: 5709.8, 60 sec: 5670.0, 300 sec: 5675.7). Total num frames: 526635008. Throughput: 0: 5956.4. Samples: 526634522. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:48:54,074][25689] Avg episode reward: [(0, '-32.476')] [2022-07-10 01:48:54,209][26022] Updated weights on worker 0-0, policy_version 514293 (0.00081) [2022-07-10 01:48:56,047][26022] Updated weights on worker 0-0, policy_version 514303 (0.00083) [2022-07-10 01:48:57,723][26022] Updated weights on worker 0-0, policy_version 514313 (0.00093) [2022-07-10 01:48:59,134][25689] Fps is (10 sec: 5781.3, 60 sec: 5656.7, 300 sec: 5672.5). Total num frames: 526663680. Throughput: 0: 5935.0. Samples: 526668738. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:48:59,136][25689] Avg episode reward: [(0, '-32.508')] [2022-07-10 01:48:59,665][26022] Updated weights on worker 0-0, policy_version 514323 (0.00349) [2022-07-10 01:49:01,727][26022] Updated weights on worker 0-0, policy_version 514333 (0.00101) [2022-07-10 01:49:03,539][26022] Updated weights on worker 0-0, policy_version 514343 (0.00619) [2022-07-10 01:49:04,184][25689] Fps is (10 sec: 5469.2, 60 sec: 5660.1, 300 sec: 5675.6). Total num frames: 526690304. Throughput: 0: 5069.4. Samples: 526685422. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:49:04,185][25689] Avg episode reward: [(0, '-33.487')] [2022-07-10 01:49:05,511][26022] Updated weights on worker 0-0, policy_version 514353 (0.00082) [2022-07-10 01:49:07,218][26022] Updated weights on worker 0-0, policy_version 514363 (0.00093) [2022-07-10 01:49:08,880][26022] Updated weights on worker 0-0, policy_version 514373 (0.00096) [2022-07-10 01:49:09,213][25689] Fps is (10 sec: 5486.2, 60 sec: 5676.2, 300 sec: 5671.8). Total num frames: 526718976. Throughput: 0: 5835.9. Samples: 526718178. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:49:09,214][25689] Avg episode reward: [(0, '-33.238')] [2022-07-10 01:49:10,797][26022] Updated weights on worker 0-0, policy_version 514383 (0.00085) [2022-07-10 01:49:12,491][26022] Updated weights on worker 0-0, policy_version 514393 (0.00090) [2022-07-10 01:49:14,226][25689] Fps is (10 sec: 5608.4, 60 sec: 5641.9, 300 sec: 5668.4). Total num frames: 526746624. Throughput: 0: 5845.3. Samples: 526752320. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:49:14,227][25689] Avg episode reward: [(0, '-32.473')] [2022-07-10 01:49:14,462][26022] Updated weights on worker 0-0, policy_version 514403 (0.00091) [2022-07-10 01:49:16,151][26022] Updated weights on worker 0-0, policy_version 514413 (0.00087) [2022-07-10 01:49:17,962][26022] Updated weights on worker 0-0, policy_version 514423 (0.00091) [2022-07-10 01:49:19,240][25689] Fps is (10 sec: 5719.2, 60 sec: 5658.6, 300 sec: 5673.5). Total num frames: 526776320. Throughput: 0: 5020.0. Samples: 526769668. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:49:19,240][25689] Avg episode reward: [(0, '-33.173')] [2022-07-10 01:49:19,492][26022] Updated weights on worker 0-0, policy_version 514433 (0.00085) [2022-07-10 01:49:21,403][26022] Updated weights on worker 0-0, policy_version 514443 (0.00087) [2022-07-10 01:49:23,113][26022] Updated weights on worker 0-0, policy_version 514453 (0.00085) [2022-07-10 01:49:24,295][25689] Fps is (10 sec: 5898.9, 60 sec: 5679.5, 300 sec: 5672.8). Total num frames: 526806016. Throughput: 0: 5919.9. Samples: 526804474. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:49:24,295][25689] Avg episode reward: [(0, '-32.368')] [2022-07-10 01:49:24,809][26022] Updated weights on worker 0-0, policy_version 514463 (0.00088) [2022-07-10 01:49:26,787][26022] Updated weights on worker 0-0, policy_version 514473 (0.00087) [2022-07-10 01:49:28,701][26022] Updated weights on worker 0-0, policy_version 514483 (0.00091) [2022-07-10 01:49:29,307][25689] Fps is (10 sec: 5798.0, 60 sec: 5697.7, 300 sec: 5669.7). Total num frames: 526834688. Throughput: 0: 5995.3. Samples: 526838644. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:49:29,307][25689] Avg episode reward: [(0, '-32.360')] [2022-07-10 01:49:30,518][26022] Updated weights on worker 0-0, policy_version 514493 (0.00085) [2022-07-10 01:49:32,201][26022] Updated weights on worker 0-0, policy_version 514503 (0.00093) [2022-07-10 01:49:33,987][26022] Updated weights on worker 0-0, policy_version 514513 (0.00086) [2022-07-10 01:49:34,318][25689] Fps is (10 sec: 5721.0, 60 sec: 5682.2, 300 sec: 5674.1). Total num frames: 526863360. Throughput: 0: 5147.0. Samples: 526855732. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:49:34,319][25689] Avg episode reward: [(0, '-32.590')] [2022-07-10 01:49:35,867][26022] Updated weights on worker 0-0, policy_version 514523 (0.00088) [2022-07-10 01:49:37,511][26022] Updated weights on worker 0-0, policy_version 514533 (0.00089) [2022-07-10 01:49:39,331][25689] Fps is (10 sec: 5618.5, 60 sec: 5664.3, 300 sec: 5669.3). Total num frames: 526891008. Throughput: 0: 6002.0. Samples: 526890254. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:49:39,331][25689] Avg episode reward: [(0, '-33.220')] [2022-07-10 01:49:39,452][26022] Updated weights on worker 0-0, policy_version 514543 (0.00087) [2022-07-10 01:49:41,111][26022] Updated weights on worker 0-0, policy_version 514553 (0.00087) [2022-07-10 01:49:43,034][26022] Updated weights on worker 0-0, policy_version 514563 (0.00109) [2022-07-10 01:49:44,442][25689] Fps is (10 sec: 5765.3, 60 sec: 5695.3, 300 sec: 5675.4). Total num frames: 526921728. Throughput: 0: 5971.5. Samples: 526924784. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:49:44,444][25689] Avg episode reward: [(0, '-34.029')] [2022-07-10 01:49:44,514][26022] Updated weights on worker 0-0, policy_version 514573 (0.00092) [2022-07-10 01:49:46,491][26022] Updated weights on worker 0-0, policy_version 514583 (0.00086) [2022-07-10 01:49:48,292][26022] Updated weights on worker 0-0, policy_version 514593 (0.00088) [2022-07-10 01:49:49,221][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:49:49,236][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000514598_526948352.pth [2022-07-10 01:49:49,236][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000512602_524904448.pth [2022-07-10 01:49:49,462][25689] Fps is (10 sec: 5761.3, 60 sec: 5694.2, 300 sec: 5669.0). Total num frames: 526949376. Throughput: 0: 5981.1. Samples: 526959194. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:49:49,462][25689] Avg episode reward: [(0, '-33.350')] [2022-07-10 01:49:50,041][26022] Updated weights on worker 0-0, policy_version 514603 (0.00088) [2022-07-10 01:49:51,846][26022] Updated weights on worker 0-0, policy_version 514613 (0.00094) [2022-07-10 01:49:53,629][26022] Updated weights on worker 0-0, policy_version 514623 (0.00097) [2022-07-10 01:49:54,518][25689] Fps is (10 sec: 5487.9, 60 sec: 5658.4, 300 sec: 5668.6). Total num frames: 526977024. Throughput: 0: 5972.9. Samples: 526976384. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:49:54,519][25689] Avg episode reward: [(0, '-33.418')] [2022-07-10 01:49:55,464][26022] Updated weights on worker 0-0, policy_version 514633 (0.00090) [2022-07-10 01:49:57,365][26022] Updated weights on worker 0-0, policy_version 514643 (0.00086) [2022-07-10 01:49:59,068][26022] Updated weights on worker 0-0, policy_version 514653 (0.00087) [2022-07-10 01:49:59,539][25689] Fps is (10 sec: 5791.9, 60 sec: 5695.9, 300 sec: 5680.7). Total num frames: 527007744. Throughput: 0: 5951.8. Samples: 527010532. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:49:59,540][25689] Avg episode reward: [(0, '-33.727')] [2022-07-10 01:50:00,910][26022] Updated weights on worker 0-0, policy_version 514663 (0.00093) [2022-07-10 01:50:02,952][26022] Updated weights on worker 0-0, policy_version 514673 (0.00091) [2022-07-10 01:50:04,679][25689] Fps is (10 sec: 5543.0, 60 sec: 5670.6, 300 sec: 5672.2). Total num frames: 527033344. Throughput: 0: 5813.3. Samples: 527042426. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:50:04,681][25689] Avg episode reward: [(0, '-33.552')] [2022-07-10 01:50:04,853][26022] Updated weights on worker 0-0, policy_version 514683 (0.00087) [2022-07-10 01:50:06,607][26022] Updated weights on worker 0-0, policy_version 514693 (0.00499) [2022-07-10 01:50:08,680][26022] Updated weights on worker 0-0, policy_version 514703 (0.00090) [2022-07-10 01:50:09,690][25689] Fps is (10 sec: 5346.6, 60 sec: 5672.3, 300 sec: 5675.5). Total num frames: 527062016. Throughput: 0: 4954.8. Samples: 527059422. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:50:09,692][25689] Avg episode reward: [(0, '-32.723')] [2022-07-10 01:50:10,371][26022] Updated weights on worker 0-0, policy_version 514713 (0.00094) [2022-07-10 01:50:11,963][26022] Updated weights on worker 0-0, policy_version 514723 (0.00075) [2022-07-10 01:50:13,961][26022] Updated weights on worker 0-0, policy_version 514733 (0.00086) [2022-07-10 01:50:14,712][25689] Fps is (10 sec: 5715.3, 60 sec: 5688.4, 300 sec: 5672.5). Total num frames: 527090688. Throughput: 0: 5819.0. Samples: 527093892. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:50:14,714][25689] Avg episode reward: [(0, '-31.260')] [2022-07-10 01:50:15,442][26022] Updated weights on worker 0-0, policy_version 514743 (0.00087) [2022-07-10 01:50:17,484][26022] Updated weights on worker 0-0, policy_version 514753 (0.00082) [2022-07-10 01:50:19,151][26022] Updated weights on worker 0-0, policy_version 514763 (0.00079) [2022-07-10 01:50:19,734][25689] Fps is (10 sec: 5607.1, 60 sec: 5653.7, 300 sec: 5670.1). Total num frames: 527118336. Throughput: 0: 5825.0. Samples: 527128168. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:50:19,736][25689] Avg episode reward: [(0, '-32.543')] [2022-07-10 01:50:20,919][26022] Updated weights on worker 0-0, policy_version 514773 (0.00305) [2022-07-10 01:50:22,898][26022] Updated weights on worker 0-0, policy_version 514783 (0.00085) [2022-07-10 01:50:24,472][26022] Updated weights on worker 0-0, policy_version 514793 (0.00081) [2022-07-10 01:50:24,814][25689] Fps is (10 sec: 5777.7, 60 sec: 5668.2, 300 sec: 5676.5). Total num frames: 527149056. Throughput: 0: 5113.2. Samples: 527145384. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:50:24,815][25689] Avg episode reward: [(0, '-32.548')] [2022-07-10 01:50:26,373][26022] Updated weights on worker 0-0, policy_version 514803 (0.00083) [2022-07-10 01:50:28,059][26022] Updated weights on worker 0-0, policy_version 514813 (0.00090) [2022-07-10 01:50:29,835][25689] Fps is (10 sec: 5879.8, 60 sec: 5667.4, 300 sec: 5676.3). Total num frames: 527177728. Throughput: 0: 5986.1. Samples: 527180014. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:50:29,836][25689] Avg episode reward: [(0, '-31.736')] [2022-07-10 01:50:29,942][26022] Updated weights on worker 0-0, policy_version 514823 (0.00091) [2022-07-10 01:50:31,970][26022] Updated weights on worker 0-0, policy_version 514833 (0.00086) [2022-07-10 01:50:33,578][26022] Updated weights on worker 0-0, policy_version 514843 (0.00087) [2022-07-10 01:50:34,847][25689] Fps is (10 sec: 5715.9, 60 sec: 5667.4, 300 sec: 5672.8). Total num frames: 527206400. Throughput: 0: 5962.2. Samples: 527213938. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:50:34,847][25689] Avg episode reward: [(0, '-32.178')] [2022-07-10 01:50:35,414][26022] Updated weights on worker 0-0, policy_version 514853 (0.00090) [2022-07-10 01:50:37,015][26022] Updated weights on worker 0-0, policy_version 514863 (0.00551) [2022-07-10 01:50:38,987][26022] Updated weights on worker 0-0, policy_version 514873 (0.00089) [2022-07-10 01:50:39,848][25689] Fps is (10 sec: 5829.4, 60 sec: 5702.3, 300 sec: 5677.3). Total num frames: 527236096. Throughput: 0: 5118.9. Samples: 527231128. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:50:39,849][25689] Avg episode reward: [(0, '-32.925')] [2022-07-10 01:50:40,745][26022] Updated weights on worker 0-0, policy_version 514883 (0.00082) [2022-07-10 01:50:42,475][26022] Updated weights on worker 0-0, policy_version 514893 (0.00090) [2022-07-10 01:50:44,363][26022] Updated weights on worker 0-0, policy_version 514903 (0.00081) [2022-07-10 01:50:44,887][25689] Fps is (10 sec: 5609.6, 60 sec: 5641.4, 300 sec: 5669.9). Total num frames: 527262720. Throughput: 0: 5991.1. Samples: 527265640. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:50:44,887][25689] Avg episode reward: [(0, '-32.682')] [2022-07-10 01:50:46,128][26022] Updated weights on worker 0-0, policy_version 514913 (0.00092) [2022-07-10 01:50:48,156][26022] Updated weights on worker 0-0, policy_version 514923 (0.00084) [2022-07-10 01:50:49,663][26022] Updated weights on worker 0-0, policy_version 514933 (0.00099) [2022-07-10 01:50:49,920][25689] Fps is (10 sec: 5591.8, 60 sec: 5674.0, 300 sec: 5672.9). Total num frames: 527292416. Throughput: 0: 5972.9. Samples: 527299978. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:50:49,921][25689] Avg episode reward: [(0, '-31.817')] [2022-07-10 01:50:51,600][26022] Updated weights on worker 0-0, policy_version 514943 (0.00094) [2022-07-10 01:50:53,273][26022] Updated weights on worker 0-0, policy_version 514953 (0.00086) [2022-07-10 01:50:54,911][26022] Updated weights on worker 0-0, policy_version 514963 (0.00083) [2022-07-10 01:50:54,937][25689] Fps is (10 sec: 5909.3, 60 sec: 5711.6, 300 sec: 5679.6). Total num frames: 527322112. Throughput: 0: 5141.8. Samples: 527317240. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:50:54,938][25689] Avg episode reward: [(0, '-31.453')] [2022-07-10 01:50:56,736][26022] Updated weights on worker 0-0, policy_version 514973 (0.00090) [2022-07-10 01:50:58,633][26022] Updated weights on worker 0-0, policy_version 514983 (0.00094) [2022-07-10 01:50:59,949][25689] Fps is (10 sec: 5819.7, 60 sec: 5678.5, 300 sec: 5680.6). Total num frames: 527350784. Throughput: 0: 6006.7. Samples: 527351870. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:50:59,950][25689] Avg episode reward: [(0, '-30.860')] [2022-07-10 01:51:00,404][26022] Updated weights on worker 0-0, policy_version 514993 (0.00088) [2022-07-10 01:51:02,827][26022] Updated weights on worker 0-0, policy_version 515003 (0.00089) [2022-07-10 01:51:04,423][26022] Updated weights on worker 0-0, policy_version 515013 (0.00099) [2022-07-10 01:51:05,025][25689] Fps is (10 sec: 5278.5, 60 sec: 5667.6, 300 sec: 5669.3). Total num frames: 527375360. Throughput: 0: 5801.3. Samples: 527382468. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:51:05,026][25689] Avg episode reward: [(0, '-30.059')] [2022-07-10 01:51:06,781][26022] Updated weights on worker 0-0, policy_version 515023 (0.00085) [2022-07-10 01:51:08,518][26022] Updated weights on worker 0-0, policy_version 515033 (0.00093) [2022-07-10 01:51:10,106][25689] Fps is (10 sec: 4940.0, 60 sec: 5610.1, 300 sec: 5661.3). Total num frames: 527400960. Throughput: 0: 4862.9. Samples: 527398142. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:51:10,107][25689] Avg episode reward: [(0, '-29.637')] [2022-07-10 01:51:10,534][26022] Updated weights on worker 0-0, policy_version 515043 (0.00090) [2022-07-10 01:51:12,763][26022] Updated weights on worker 0-0, policy_version 515053 (0.00091) [2022-07-10 01:51:14,307][26022] Updated weights on worker 0-0, policy_version 515063 (0.00083) [2022-07-10 01:51:15,127][25689] Fps is (10 sec: 5271.2, 60 sec: 5593.3, 300 sec: 5654.5). Total num frames: 527428608. Throughput: 0: 5558.2. Samples: 527429456. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:51:15,127][25689] Avg episode reward: [(0, '-30.530')] [2022-07-10 01:51:16,070][26022] Updated weights on worker 0-0, policy_version 515073 (0.00082) [2022-07-10 01:51:18,023][26022] Updated weights on worker 0-0, policy_version 515083 (0.00088) [2022-07-10 01:51:19,687][26022] Updated weights on worker 0-0, policy_version 515093 (0.00091) [2022-07-10 01:51:20,133][25689] Fps is (10 sec: 5617.1, 60 sec: 5611.8, 300 sec: 5655.1). Total num frames: 527457280. Throughput: 0: 5542.1. Samples: 527463730. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:51:20,134][25689] Avg episode reward: [(0, '-30.691')] [2022-07-10 01:51:21,459][26022] Updated weights on worker 0-0, policy_version 515103 (0.00084) [2022-07-10 01:51:23,547][26022] Updated weights on worker 0-0, policy_version 515113 (0.00083) [2022-07-10 01:51:24,832][26022] Updated weights on worker 0-0, policy_version 515123 (0.00091) [2022-07-10 01:51:25,183][25689] Fps is (10 sec: 5804.4, 60 sec: 5597.7, 300 sec: 5661.3). Total num frames: 527486976. Throughput: 0: 4887.1. Samples: 527480980. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:51:25,183][25689] Avg episode reward: [(0, '-30.891')] [2022-07-10 01:51:27,223][26022] Updated weights on worker 0-0, policy_version 515133 (0.00081) [2022-07-10 01:51:28,435][26022] Updated weights on worker 0-0, policy_version 515143 (0.00087) [2022-07-10 01:51:30,209][25689] Fps is (10 sec: 5589.5, 60 sec: 5563.2, 300 sec: 5650.7). Total num frames: 527513600. Throughput: 0: 5826.5. Samples: 527515270. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:51:30,210][25689] Avg episode reward: [(0, '-30.525')] [2022-07-10 01:51:30,698][26022] Updated weights on worker 0-0, policy_version 515153 (0.00088) [2022-07-10 01:51:32,533][26022] Updated weights on worker 0-0, policy_version 515163 (0.00091) [2022-07-10 01:51:34,313][26022] Updated weights on worker 0-0, policy_version 515173 (0.00523) [2022-07-10 01:51:35,217][25689] Fps is (10 sec: 5408.4, 60 sec: 5546.5, 300 sec: 5644.1). Total num frames: 527541248. Throughput: 0: 5944.2. Samples: 527548878. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:51:35,218][25689] Avg episode reward: [(0, '-32.621')] [2022-07-10 01:51:36,020][26022] Updated weights on worker 0-0, policy_version 515183 (0.00084) [2022-07-10 01:51:37,896][26022] Updated weights on worker 0-0, policy_version 515193 (0.00090) [2022-07-10 01:51:39,379][26022] Updated weights on worker 0-0, policy_version 515203 (0.00086) [2022-07-10 01:51:40,219][25689] Fps is (10 sec: 5831.1, 60 sec: 5563.5, 300 sec: 5655.9). Total num frames: 527571968. Throughput: 0: 5102.4. Samples: 527566216. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:51:40,221][25689] Avg episode reward: [(0, '-33.025')] [2022-07-10 01:51:41,516][26022] Updated weights on worker 0-0, policy_version 515213 (0.00090) [2022-07-10 01:51:43,116][26022] Updated weights on worker 0-0, policy_version 515223 (0.00086) [2022-07-10 01:51:45,038][26022] Updated weights on worker 0-0, policy_version 515233 (0.00085) [2022-07-10 01:51:45,271][25689] Fps is (10 sec: 5805.8, 60 sec: 5579.2, 300 sec: 5648.6). Total num frames: 527599616. Throughput: 0: 5944.7. Samples: 527600398. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 01:51:45,271][25689] Avg episode reward: [(0, '-32.957')] [2022-07-10 01:51:46,751][26022] Updated weights on worker 0-0, policy_version 515243 (0.00082) [2022-07-10 01:51:48,531][26022] Updated weights on worker 0-0, policy_version 515253 (0.00085) [2022-07-10 01:51:49,258][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:51:49,265][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000515257_527623168.pth [2022-07-10 01:51:49,266][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000513266_525584384.pth [2022-07-10 01:51:50,283][25689] Fps is (10 sec: 5596.2, 60 sec: 5564.2, 300 sec: 5646.1). Total num frames: 527628288. Throughput: 0: 5960.9. Samples: 527634928. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:51:50,283][25689] Avg episode reward: [(0, '-32.972')] [2022-07-10 01:51:50,514][26022] Updated weights on worker 0-0, policy_version 515263 (0.00082) [2022-07-10 01:51:52,088][26022] Updated weights on worker 0-0, policy_version 515273 (0.00085) [2022-07-10 01:51:53,837][26022] Updated weights on worker 0-0, policy_version 515283 (0.00084) [2022-07-10 01:51:55,295][25689] Fps is (10 sec: 5822.4, 60 sec: 5564.6, 300 sec: 5650.0). Total num frames: 527657984. Throughput: 0: 5147.1. Samples: 527652222. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:51:55,296][25689] Avg episode reward: [(0, '-33.535')] [2022-07-10 01:51:55,707][26022] Updated weights on worker 0-0, policy_version 515293 (0.00096) [2022-07-10 01:51:57,471][26022] Updated weights on worker 0-0, policy_version 515303 (0.00086) [2022-07-10 01:51:59,246][26022] Updated weights on worker 0-0, policy_version 515313 (0.00089) [2022-07-10 01:52:00,303][25689] Fps is (10 sec: 5723.1, 60 sec: 5548.1, 300 sec: 5657.8). Total num frames: 527685632. Throughput: 0: 5996.0. Samples: 527686638. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:52:00,303][25689] Avg episode reward: [(0, '-34.041')] [2022-07-10 01:52:01,056][26022] Updated weights on worker 0-0, policy_version 515323 (0.00092) [2022-07-10 01:52:03,317][26022] Updated weights on worker 0-0, policy_version 515333 (0.00090) [2022-07-10 01:52:05,044][26022] Updated weights on worker 0-0, policy_version 515343 (0.00090) [2022-07-10 01:52:05,351][25689] Fps is (10 sec: 5397.4, 60 sec: 5584.6, 300 sec: 5648.0). Total num frames: 527712256. Throughput: 0: 5876.3. Samples: 527718394. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:52:05,351][25689] Avg episode reward: [(0, '-33.869')] [2022-07-10 01:52:06,782][26022] Updated weights on worker 0-0, policy_version 515353 (0.00084) [2022-07-10 01:52:08,724][26022] Updated weights on worker 0-0, policy_version 515363 (0.00093) [2022-07-10 01:52:10,367][25689] Fps is (10 sec: 5494.4, 60 sec: 5641.7, 300 sec: 5649.3). Total num frames: 527740928. Throughput: 0: 5009.0. Samples: 527735528. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:52:10,367][25689] Avg episode reward: [(0, '-32.745')] [2022-07-10 01:52:10,502][26022] Updated weights on worker 0-0, policy_version 515373 (0.00086) [2022-07-10 01:52:12,184][26022] Updated weights on worker 0-0, policy_version 515383 (0.00094) [2022-07-10 01:52:14,171][26022] Updated weights on worker 0-0, policy_version 515393 (0.00091) [2022-07-10 01:52:15,387][25689] Fps is (10 sec: 5815.9, 60 sec: 5675.7, 300 sec: 5652.8). Total num frames: 527770624. Throughput: 0: 5852.2. Samples: 527769800. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:52:15,387][25689] Avg episode reward: [(0, '-32.619')] [2022-07-10 01:52:15,617][26022] Updated weights on worker 0-0, policy_version 515403 (0.00090) [2022-07-10 01:52:17,798][26022] Updated weights on worker 0-0, policy_version 515413 (0.00093) [2022-07-10 01:52:19,269][26022] Updated weights on worker 0-0, policy_version 515423 (0.00084) [2022-07-10 01:52:20,419][25689] Fps is (10 sec: 5602.7, 60 sec: 5639.3, 300 sec: 5647.4). Total num frames: 527797248. Throughput: 0: 5839.4. Samples: 527804106. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:52:20,419][25689] Avg episode reward: [(0, '-33.056')] [2022-07-10 01:52:21,268][26022] Updated weights on worker 0-0, policy_version 515433 (0.00081) [2022-07-10 01:52:22,982][26022] Updated weights on worker 0-0, policy_version 515443 (0.00083) [2022-07-10 01:52:25,030][26022] Updated weights on worker 0-0, policy_version 515453 (0.00096) [2022-07-10 01:52:25,494][25689] Fps is (10 sec: 5572.0, 60 sec: 5636.8, 300 sec: 5647.0). Total num frames: 527826944. Throughput: 0: 5104.5. Samples: 527821218. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:52:25,495][25689] Avg episode reward: [(0, '-31.940')] [2022-07-10 01:52:26,465][26022] Updated weights on worker 0-0, policy_version 515463 (0.00087) [2022-07-10 01:52:28,702][26022] Updated weights on worker 0-0, policy_version 515473 (0.00076) [2022-07-10 01:52:30,022][26022] Updated weights on worker 0-0, policy_version 515483 (0.00088) [2022-07-10 01:52:30,580][25689] Fps is (10 sec: 5945.6, 60 sec: 5699.1, 300 sec: 5656.5). Total num frames: 527857664. Throughput: 0: 5934.3. Samples: 527855482. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:52:30,581][25689] Avg episode reward: [(0, '-31.704')] [2022-07-10 01:52:32,261][26022] Updated weights on worker 0-0, policy_version 515493 (0.00087) [2022-07-10 01:52:33,729][26022] Updated weights on worker 0-0, policy_version 515503 (0.00090) [2022-07-10 01:52:35,613][25689] Fps is (10 sec: 5667.0, 60 sec: 5679.8, 300 sec: 5642.6). Total num frames: 527884288. Throughput: 0: 5914.3. Samples: 527889426. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:52:35,615][25689] Avg episode reward: [(0, '-32.899')] [2022-07-10 01:52:35,763][26022] Updated weights on worker 0-0, policy_version 515513 (0.00087) [2022-07-10 01:52:37,611][26022] Updated weights on worker 0-0, policy_version 515523 (0.00088) [2022-07-10 01:52:39,301][26022] Updated weights on worker 0-0, policy_version 515533 (0.00086) [2022-07-10 01:52:40,630][25689] Fps is (10 sec: 5400.2, 60 sec: 5627.5, 300 sec: 5644.6). Total num frames: 527911936. Throughput: 0: 5055.4. Samples: 527906284. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:52:40,632][25689] Avg episode reward: [(0, '-32.053')] [2022-07-10 01:52:40,926][26022] Updated weights on worker 0-0, policy_version 515543 (0.00092) [2022-07-10 01:52:43,076][26022] Updated weights on worker 0-0, policy_version 515553 (0.00100) [2022-07-10 01:52:44,496][26022] Updated weights on worker 0-0, policy_version 515563 (0.00080) [2022-07-10 01:52:45,681][25689] Fps is (10 sec: 5797.8, 60 sec: 5678.5, 300 sec: 5644.2). Total num frames: 527942656. Throughput: 0: 5913.0. Samples: 527940580. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:52:45,681][25689] Avg episode reward: [(0, '-31.575')] [2022-07-10 01:52:46,534][26022] Updated weights on worker 0-0, policy_version 515573 (0.00086) [2022-07-10 01:52:48,262][26022] Updated weights on worker 0-0, policy_version 515583 (0.00088) [2022-07-10 01:52:50,285][26022] Updated weights on worker 0-0, policy_version 515593 (0.00089) [2022-07-10 01:52:50,683][25689] Fps is (10 sec: 5704.5, 60 sec: 5645.6, 300 sec: 5645.4). Total num frames: 527969280. Throughput: 0: 5909.0. Samples: 527974268. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:52:50,683][25689] Avg episode reward: [(0, '-30.990')] [2022-07-10 01:52:51,952][26022] Updated weights on worker 0-0, policy_version 515603 (0.00091) [2022-07-10 01:52:53,759][26022] Updated weights on worker 0-0, policy_version 515613 (0.00095) [2022-07-10 01:52:55,548][26022] Updated weights on worker 0-0, policy_version 515623 (0.00081) [2022-07-10 01:52:55,702][25689] Fps is (10 sec: 5517.7, 60 sec: 5628.0, 300 sec: 5643.5). Total num frames: 527997952. Throughput: 0: 5064.3. Samples: 527991162. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:52:55,703][25689] Avg episode reward: [(0, '-31.848')] [2022-07-10 01:52:57,517][26022] Updated weights on worker 0-0, policy_version 515633 (0.00095) [2022-07-10 01:52:59,191][26022] Updated weights on worker 0-0, policy_version 515643 (0.00089) [2022-07-10 01:53:00,717][25689] Fps is (10 sec: 5714.9, 60 sec: 5644.2, 300 sec: 5651.7). Total num frames: 528026624. Throughput: 0: 5919.5. Samples: 528025186. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:53:00,717][25689] Avg episode reward: [(0, '-31.024')] [2022-07-10 01:53:01,075][26022] Updated weights on worker 0-0, policy_version 515653 (0.00086) [2022-07-10 01:53:03,199][26022] Updated weights on worker 0-0, policy_version 515663 (0.00089) [2022-07-10 01:53:04,973][26022] Updated weights on worker 0-0, policy_version 515673 (0.00112) [2022-07-10 01:53:05,836][25689] Fps is (10 sec: 5456.8, 60 sec: 5637.6, 300 sec: 5646.4). Total num frames: 528053248. Throughput: 0: 5776.6. Samples: 528057008. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:53:05,836][25689] Avg episode reward: [(0, '-29.566')] [2022-07-10 01:53:06,849][26022] Updated weights on worker 0-0, policy_version 515683 (0.00080) [2022-07-10 01:53:08,727][26022] Updated weights on worker 0-0, policy_version 515693 (0.00085) [2022-07-10 01:53:10,500][26022] Updated weights on worker 0-0, policy_version 515703 (0.00090) [2022-07-10 01:53:10,878][25689] Fps is (10 sec: 5442.2, 60 sec: 5635.2, 300 sec: 5642.4). Total num frames: 528081920. Throughput: 0: 5792.4. Samples: 528091244. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:53:10,878][25689] Avg episode reward: [(0, '-29.174')] [2022-07-10 01:53:12,096][26022] Updated weights on worker 0-0, policy_version 515713 (0.00090) [2022-07-10 01:53:13,999][26022] Updated weights on worker 0-0, policy_version 515723 (0.00090) [2022-07-10 01:53:15,707][26022] Updated weights on worker 0-0, policy_version 515733 (0.00103) [2022-07-10 01:53:15,897][25689] Fps is (10 sec: 5699.2, 60 sec: 5618.3, 300 sec: 5642.2). Total num frames: 528110592. Throughput: 0: 5799.9. Samples: 528108292. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:53:15,898][25689] Avg episode reward: [(0, '-30.052')] [2022-07-10 01:53:17,832][26022] Updated weights on worker 0-0, policy_version 515743 (0.00090) [2022-07-10 01:53:19,423][26022] Updated weights on worker 0-0, policy_version 515753 (0.00084) [2022-07-10 01:53:20,959][25689] Fps is (10 sec: 5688.2, 60 sec: 5649.4, 300 sec: 5642.9). Total num frames: 528139264. Throughput: 0: 5782.3. Samples: 528142230. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:53:20,964][25689] Avg episode reward: [(0, '-29.822')] [2022-07-10 01:53:21,362][26022] Updated weights on worker 0-0, policy_version 515763 (0.00088) [2022-07-10 01:53:23,272][26022] Updated weights on worker 0-0, policy_version 515773 (0.00092) [2022-07-10 01:53:24,907][26022] Updated weights on worker 0-0, policy_version 515783 (0.00084) [2022-07-10 01:53:26,009][25689] Fps is (10 sec: 5569.9, 60 sec: 5618.0, 300 sec: 5642.4). Total num frames: 528166912. Throughput: 0: 5907.1. Samples: 528176174. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:53:26,010][25689] Avg episode reward: [(0, '-29.542')] [2022-07-10 01:53:26,929][26022] Updated weights on worker 0-0, policy_version 515793 (0.00100) [2022-07-10 01:53:28,560][26022] Updated weights on worker 0-0, policy_version 515803 (0.00081) [2022-07-10 01:53:30,436][26022] Updated weights on worker 0-0, policy_version 515813 (0.00089) [2022-07-10 01:53:31,020][25689] Fps is (10 sec: 5597.7, 60 sec: 5591.0, 300 sec: 5639.2). Total num frames: 528195584. Throughput: 0: 5046.0. Samples: 528192882. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:53:31,020][25689] Avg episode reward: [(0, '-30.137')] [2022-07-10 01:53:32,272][26022] Updated weights on worker 0-0, policy_version 515823 (0.00103) [2022-07-10 01:53:34,148][26022] Updated weights on worker 0-0, policy_version 515833 (0.00084) [2022-07-10 01:53:35,718][26022] Updated weights on worker 0-0, policy_version 515843 (0.00088) [2022-07-10 01:53:36,025][25689] Fps is (10 sec: 5725.0, 60 sec: 5627.5, 300 sec: 5639.2). Total num frames: 528224256. Throughput: 0: 5860.9. Samples: 528226260. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:53:36,026][25689] Avg episode reward: [(0, '-31.376')] [2022-07-10 01:53:37,819][26022] Updated weights on worker 0-0, policy_version 515853 (0.00066) [2022-07-10 01:53:39,311][26022] Updated weights on worker 0-0, policy_version 515863 (0.00087) [2022-07-10 01:53:41,032][25689] Fps is (10 sec: 5625.4, 60 sec: 5628.5, 300 sec: 5637.2). Total num frames: 528251904. Throughput: 0: 5892.9. Samples: 528260518. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:53:41,032][25689] Avg episode reward: [(0, '-31.241')] [2022-07-10 01:53:41,460][26022] Updated weights on worker 0-0, policy_version 515873 (0.00090) [2022-07-10 01:53:42,952][26022] Updated weights on worker 0-0, policy_version 515883 (0.00091) [2022-07-10 01:53:44,993][26022] Updated weights on worker 0-0, policy_version 515893 (0.00085) [2022-07-10 01:53:46,074][25689] Fps is (10 sec: 5706.4, 60 sec: 5612.2, 300 sec: 5643.4). Total num frames: 528281600. Throughput: 0: 5048.7. Samples: 528277478. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:53:46,075][25689] Avg episode reward: [(0, '-30.786')] [2022-07-10 01:53:46,728][26022] Updated weights on worker 0-0, policy_version 515903 (0.00091) [2022-07-10 01:53:48,586][26022] Updated weights on worker 0-0, policy_version 515913 (0.00081) [2022-07-10 01:53:49,430][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:53:49,450][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000515919_528301056.pth [2022-07-10 01:53:49,451][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000513933_526267392.pth [2022-07-10 01:53:50,232][26022] Updated weights on worker 0-0, policy_version 515923 (0.00081) [2022-07-10 01:53:51,094][25689] Fps is (10 sec: 5597.2, 60 sec: 5610.6, 300 sec: 5633.3). Total num frames: 528308224. Throughput: 0: 5936.5. Samples: 528312050. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:53:51,095][25689] Avg episode reward: [(0, '-29.791')] [2022-07-10 01:53:52,090][26022] Updated weights on worker 0-0, policy_version 515933 (0.00098) [2022-07-10 01:53:53,891][26022] Updated weights on worker 0-0, policy_version 515943 (0.00083) [2022-07-10 01:53:55,735][26022] Updated weights on worker 0-0, policy_version 515953 (0.00088) [2022-07-10 01:53:56,127][25689] Fps is (10 sec: 5604.9, 60 sec: 5626.7, 300 sec: 5637.4). Total num frames: 528337920. Throughput: 0: 5979.9. Samples: 528346440. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:53:56,127][25689] Avg episode reward: [(0, '-29.397')] [2022-07-10 01:53:57,535][26022] Updated weights on worker 0-0, policy_version 515963 (0.00093) [2022-07-10 01:53:59,380][26022] Updated weights on worker 0-0, policy_version 515973 (0.00089) [2022-07-10 01:54:01,107][26022] Updated weights on worker 0-0, policy_version 515983 (0.00085) [2022-07-10 01:54:01,131][25689] Fps is (10 sec: 5815.2, 60 sec: 5627.3, 300 sec: 5645.1). Total num frames: 528366592. Throughput: 0: 5112.4. Samples: 528363274. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:54:01,131][25689] Avg episode reward: [(0, '-30.171')] [2022-07-10 01:54:03,399][26022] Updated weights on worker 0-0, policy_version 515993 (0.00088) [2022-07-10 01:54:05,292][26022] Updated weights on worker 0-0, policy_version 516003 (0.00095) [2022-07-10 01:54:06,268][25689] Fps is (10 sec: 5349.6, 60 sec: 5608.7, 300 sec: 5632.7). Total num frames: 528392192. Throughput: 0: 5829.2. Samples: 528395188. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:54:06,270][25689] Avg episode reward: [(0, '-29.925')] [2022-07-10 01:54:06,902][26022] Updated weights on worker 0-0, policy_version 516013 (0.00092) [2022-07-10 01:54:08,647][26022] Updated weights on worker 0-0, policy_version 516023 (0.00085) [2022-07-10 01:54:10,474][26022] Updated weights on worker 0-0, policy_version 516033 (0.00090) [2022-07-10 01:54:11,299][25689] Fps is (10 sec: 5438.4, 60 sec: 5626.6, 300 sec: 5639.2). Total num frames: 528421888. Throughput: 0: 5803.4. Samples: 528429308. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:54:11,300][25689] Avg episode reward: [(0, '-29.584')] [2022-07-10 01:54:12,396][26022] Updated weights on worker 0-0, policy_version 516043 (0.00087) [2022-07-10 01:54:14,029][26022] Updated weights on worker 0-0, policy_version 516053 (0.00616) [2022-07-10 01:54:15,841][26022] Updated weights on worker 0-0, policy_version 516063 (0.00094) [2022-07-10 01:54:16,323][25689] Fps is (10 sec: 5702.5, 60 sec: 5609.2, 300 sec: 5632.1). Total num frames: 528449536. Throughput: 0: 4952.0. Samples: 528446474. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:54:16,324][25689] Avg episode reward: [(0, '-29.695')] [2022-07-10 01:54:17,712][26022] Updated weights on worker 0-0, policy_version 516073 (0.00086) [2022-07-10 01:54:19,519][26022] Updated weights on worker 0-0, policy_version 516083 (0.00085) [2022-07-10 01:54:21,299][26022] Updated weights on worker 0-0, policy_version 516093 (0.00085) [2022-07-10 01:54:21,331][25689] Fps is (10 sec: 5716.1, 60 sec: 5631.2, 300 sec: 5633.0). Total num frames: 528479232. Throughput: 0: 5808.4. Samples: 528480602. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:54:21,331][25689] Avg episode reward: [(0, '-29.506')] [2022-07-10 01:54:23,135][26022] Updated weights on worker 0-0, policy_version 516103 (0.00082) [2022-07-10 01:54:24,811][26022] Updated weights on worker 0-0, policy_version 516113 (0.00089) [2022-07-10 01:54:26,412][25689] Fps is (10 sec: 5785.5, 60 sec: 5645.3, 300 sec: 5631.7). Total num frames: 528507904. Throughput: 0: 5932.9. Samples: 528514704. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:54:26,413][25689] Avg episode reward: [(0, '-29.243')] [2022-07-10 01:54:26,776][26022] Updated weights on worker 0-0, policy_version 516123 (0.00090) [2022-07-10 01:54:28,579][26022] Updated weights on worker 0-0, policy_version 516133 (0.00083) [2022-07-10 01:54:30,246][26022] Updated weights on worker 0-0, policy_version 516143 (0.00089) [2022-07-10 01:54:31,511][25689] Fps is (10 sec: 5532.5, 60 sec: 5620.2, 300 sec: 5626.6). Total num frames: 528535552. Throughput: 0: 5063.7. Samples: 528531656. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:54:31,511][25689] Avg episode reward: [(0, '-29.670')] [2022-07-10 01:54:32,164][26022] Updated weights on worker 0-0, policy_version 516153 (0.01358) [2022-07-10 01:54:33,754][26022] Updated weights on worker 0-0, policy_version 516163 (0.00085) [2022-07-10 01:54:35,848][26022] Updated weights on worker 0-0, policy_version 516173 (0.00090) [2022-07-10 01:54:36,545][25689] Fps is (10 sec: 5659.2, 60 sec: 5634.4, 300 sec: 5633.1). Total num frames: 528565248. Throughput: 0: 5905.3. Samples: 528565888. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:54:36,545][25689] Avg episode reward: [(0, '-30.582')] [2022-07-10 01:54:37,556][26022] Updated weights on worker 0-0, policy_version 516183 (0.00614) [2022-07-10 01:54:39,455][26022] Updated weights on worker 0-0, policy_version 516193 (0.00090) [2022-07-10 01:54:41,007][26022] Updated weights on worker 0-0, policy_version 516203 (0.00088) [2022-07-10 01:54:41,577][25689] Fps is (10 sec: 5798.1, 60 sec: 5648.9, 300 sec: 5627.7). Total num frames: 528593920. Throughput: 0: 5911.0. Samples: 528600280. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 01:54:41,578][25689] Avg episode reward: [(0, '-31.177')] [2022-07-10 01:54:42,864][26022] Updated weights on worker 0-0, policy_version 516213 (0.00086) [2022-07-10 01:54:44,621][26022] Updated weights on worker 0-0, policy_version 516223 (0.00084) [2022-07-10 01:54:46,375][26022] Updated weights on worker 0-0, policy_version 516233 (0.00094) [2022-07-10 01:54:46,638][25689] Fps is (10 sec: 5884.5, 60 sec: 5664.1, 300 sec: 5637.3). Total num frames: 528624640. Throughput: 0: 5088.1. Samples: 528617614. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:54:46,638][25689] Avg episode reward: [(0, '-32.279')] [2022-07-10 01:54:48,275][26022] Updated weights on worker 0-0, policy_version 516243 (0.00092) [2022-07-10 01:54:49,847][26022] Updated weights on worker 0-0, policy_version 516253 (0.00087) [2022-07-10 01:54:51,667][25689] Fps is (10 sec: 5784.8, 60 sec: 5680.1, 300 sec: 5637.8). Total num frames: 528652288. Throughput: 0: 5993.8. Samples: 528652470. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:54:51,668][25689] Avg episode reward: [(0, '-33.849')] [2022-07-10 01:54:51,772][26022] Updated weights on worker 0-0, policy_version 516263 (0.00093) [2022-07-10 01:54:53,235][26022] Updated weights on worker 0-0, policy_version 516273 (0.00078) [2022-07-10 01:54:55,194][26022] Updated weights on worker 0-0, policy_version 516283 (0.00094) [2022-07-10 01:54:56,668][25689] Fps is (10 sec: 5716.9, 60 sec: 5682.7, 300 sec: 5634.7). Total num frames: 528681984. Throughput: 0: 6040.7. Samples: 528687448. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:54:56,669][25689] Avg episode reward: [(0, '-32.704')] [2022-07-10 01:54:57,100][26022] Updated weights on worker 0-0, policy_version 516293 (0.00091) [2022-07-10 01:54:58,820][26022] Updated weights on worker 0-0, policy_version 516303 (0.00091) [2022-07-10 01:55:00,464][26022] Updated weights on worker 0-0, policy_version 516313 (0.00096) [2022-07-10 01:55:01,671][25689] Fps is (10 sec: 5630.1, 60 sec: 5649.4, 300 sec: 5640.8). Total num frames: 528708608. Throughput: 0: 5195.9. Samples: 528704684. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:55:01,671][25689] Avg episode reward: [(0, '-32.878')] [2022-07-10 01:55:02,756][26022] Updated weights on worker 0-0, policy_version 516323 (0.00092) [2022-07-10 01:55:04,423][26022] Updated weights on worker 0-0, policy_version 516333 (0.00087) [2022-07-10 01:55:06,386][26022] Updated weights on worker 0-0, policy_version 516343 (0.00086) [2022-07-10 01:55:06,719][25689] Fps is (10 sec: 5502.1, 60 sec: 5708.6, 300 sec: 5640.1). Total num frames: 528737280. Throughput: 0: 5950.4. Samples: 528737102. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:55:06,719][25689] Avg episode reward: [(0, '-32.388')] [2022-07-10 01:55:07,972][26022] Updated weights on worker 0-0, policy_version 516353 (0.00087) [2022-07-10 01:55:09,737][26022] Updated weights on worker 0-0, policy_version 516363 (0.00087) [2022-07-10 01:55:11,608][26022] Updated weights on worker 0-0, policy_version 516373 (0.00085) [2022-07-10 01:55:11,724][25689] Fps is (10 sec: 5703.8, 60 sec: 5694.0, 300 sec: 5640.4). Total num frames: 528765952. Throughput: 0: 5946.3. Samples: 528771734. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:55:11,725][25689] Avg episode reward: [(0, '-31.638')] [2022-07-10 01:55:13,402][26022] Updated weights on worker 0-0, policy_version 516383 (0.00085) [2022-07-10 01:55:15,068][26022] Updated weights on worker 0-0, policy_version 516393 (0.00098) [2022-07-10 01:55:16,732][25689] Fps is (10 sec: 5829.2, 60 sec: 5729.6, 300 sec: 5647.6). Total num frames: 528795648. Throughput: 0: 5065.8. Samples: 528789084. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:55:16,733][25689] Avg episode reward: [(0, '-31.573')] [2022-07-10 01:55:16,807][26022] Updated weights on worker 0-0, policy_version 516403 (0.00078) [2022-07-10 01:55:18,699][26022] Updated weights on worker 0-0, policy_version 516413 (0.00090) [2022-07-10 01:55:20,557][26022] Updated weights on worker 0-0, policy_version 516423 (0.00083) [2022-07-10 01:55:21,833][25689] Fps is (10 sec: 5774.1, 60 sec: 5703.7, 300 sec: 5640.3). Total num frames: 528824320. Throughput: 0: 5891.1. Samples: 528823462. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:55:21,833][25689] Avg episode reward: [(0, '-32.102')] [2022-07-10 01:55:22,244][26022] Updated weights on worker 0-0, policy_version 516433 (0.00086) [2022-07-10 01:55:24,012][26022] Updated weights on worker 0-0, policy_version 516443 (0.00088) [2022-07-10 01:55:25,867][26022] Updated weights on worker 0-0, policy_version 516453 (0.00083) [2022-07-10 01:55:26,871][25689] Fps is (10 sec: 5655.7, 60 sec: 5707.8, 300 sec: 5639.9). Total num frames: 528852992. Throughput: 0: 6009.7. Samples: 528858212. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:55:26,871][25689] Avg episode reward: [(0, '-31.495')] [2022-07-10 01:55:27,563][26022] Updated weights on worker 0-0, policy_version 516463 (0.00084) [2022-07-10 01:55:29,256][26022] Updated weights on worker 0-0, policy_version 516473 (0.00081) [2022-07-10 01:55:31,124][26022] Updated weights on worker 0-0, policy_version 516483 (0.00087) [2022-07-10 01:55:31,881][25689] Fps is (10 sec: 5707.0, 60 sec: 5733.2, 300 sec: 5640.0). Total num frames: 528881664. Throughput: 0: 5147.6. Samples: 528875492. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:55:31,881][25689] Avg episode reward: [(0, '-32.297')] [2022-07-10 01:55:32,893][26022] Updated weights on worker 0-0, policy_version 516493 (0.00080) [2022-07-10 01:55:34,748][26022] Updated weights on worker 0-0, policy_version 516503 (0.00086) [2022-07-10 01:55:36,698][26022] Updated weights on worker 0-0, policy_version 516513 (0.00082) [2022-07-10 01:55:36,908][25689] Fps is (10 sec: 5815.1, 60 sec: 5733.8, 300 sec: 5639.5). Total num frames: 528911360. Throughput: 0: 5971.8. Samples: 528909574. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:55:36,909][25689] Avg episode reward: [(0, '-32.622')] [2022-07-10 01:55:38,263][26022] Updated weights on worker 0-0, policy_version 516523 (0.00085) [2022-07-10 01:55:40,246][26022] Updated weights on worker 0-0, policy_version 516533 (0.00090) [2022-07-10 01:55:41,913][25689] Fps is (10 sec: 5716.2, 60 sec: 5719.5, 300 sec: 5643.6). Total num frames: 528939008. Throughput: 0: 5992.7. Samples: 528943794. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:55:41,913][25689] Avg episode reward: [(0, '-31.445')] [2022-07-10 01:55:41,932][26022] Updated weights on worker 0-0, policy_version 516543 (0.00096) [2022-07-10 01:55:43,904][26022] Updated weights on worker 0-0, policy_version 516553 (0.00090) [2022-07-10 01:55:45,529][26022] Updated weights on worker 0-0, policy_version 516563 (0.00092) [2022-07-10 01:55:46,988][25689] Fps is (10 sec: 5587.5, 60 sec: 5684.2, 300 sec: 5639.3). Total num frames: 528967680. Throughput: 0: 5113.9. Samples: 528961088. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:55:46,988][25689] Avg episode reward: [(0, '-30.887')] [2022-07-10 01:55:47,375][26022] Updated weights on worker 0-0, policy_version 516573 (0.00094) [2022-07-10 01:55:49,227][26022] Updated weights on worker 0-0, policy_version 516583 (0.00084) [2022-07-10 01:55:49,615][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:55:49,629][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000516586_528984064.pth [2022-07-10 01:55:49,629][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000514598_526948352.pth [2022-07-10 01:55:50,972][26022] Updated weights on worker 0-0, policy_version 516593 (0.00086) [2022-07-10 01:55:52,007][25689] Fps is (10 sec: 5681.0, 60 sec: 5702.2, 300 sec: 5635.9). Total num frames: 528996352. Throughput: 0: 5965.2. Samples: 528995546. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:55:52,007][25689] Avg episode reward: [(0, '-31.710')] [2022-07-10 01:55:52,757][26022] Updated weights on worker 0-0, policy_version 516603 (0.00083) [2022-07-10 01:55:54,613][26022] Updated weights on worker 0-0, policy_version 516613 (0.00085) [2022-07-10 01:55:56,394][26022] Updated weights on worker 0-0, policy_version 516623 (0.00088) [2022-07-10 01:55:57,020][25689] Fps is (10 sec: 5920.1, 60 sec: 5718.0, 300 sec: 5642.7). Total num frames: 529027072. Throughput: 0: 5975.5. Samples: 529029754. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:55:57,021][25689] Avg episode reward: [(0, '-31.265')] [2022-07-10 01:55:58,336][26022] Updated weights on worker 0-0, policy_version 516633 (0.00088) [2022-07-10 01:55:59,760][26022] Updated weights on worker 0-0, policy_version 516643 (0.00084) [2022-07-10 01:56:01,815][26022] Updated weights on worker 0-0, policy_version 516653 (0.00091) [2022-07-10 01:56:02,038][25689] Fps is (10 sec: 5716.7, 60 sec: 5716.5, 300 sec: 5650.7). Total num frames: 529053696. Throughput: 0: 5132.8. Samples: 529047092. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:56:02,038][25689] Avg episode reward: [(0, '-30.180')] [2022-07-10 01:56:03,820][26022] Updated weights on worker 0-0, policy_version 516663 (0.00085) [2022-07-10 01:56:05,598][26022] Updated weights on worker 0-0, policy_version 516673 (0.00094) [2022-07-10 01:56:07,147][25689] Fps is (10 sec: 5258.1, 60 sec: 5676.8, 300 sec: 5653.6). Total num frames: 529080320. Throughput: 0: 5861.5. Samples: 529079252. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:56:07,148][25689] Avg episode reward: [(0, '-31.427')] [2022-07-10 01:56:07,735][26022] Updated weights on worker 0-0, policy_version 516683 (0.00090) [2022-07-10 01:56:09,302][26022] Updated weights on worker 0-0, policy_version 516693 (0.00088) [2022-07-10 01:56:11,118][26022] Updated weights on worker 0-0, policy_version 516703 (0.00309) [2022-07-10 01:56:12,161][25689] Fps is (10 sec: 5563.3, 60 sec: 5693.0, 300 sec: 5660.6). Total num frames: 529110016. Throughput: 0: 5840.4. Samples: 529113256. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:56:12,162][25689] Avg episode reward: [(0, '-31.172')] [2022-07-10 01:56:13,108][26022] Updated weights on worker 0-0, policy_version 516713 (0.00078) [2022-07-10 01:56:14,608][26022] Updated weights on worker 0-0, policy_version 516723 (0.00090) [2022-07-10 01:56:16,451][26022] Updated weights on worker 0-0, policy_version 516733 (0.00082) [2022-07-10 01:56:17,163][25689] Fps is (10 sec: 5827.8, 60 sec: 5676.6, 300 sec: 5660.7). Total num frames: 529138688. Throughput: 0: 5885.2. Samples: 529148296. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:56:17,163][25689] Avg episode reward: [(0, '-29.715')] [2022-07-10 01:56:18,217][26022] Updated weights on worker 0-0, policy_version 516743 (0.00086) [2022-07-10 01:56:19,882][26022] Updated weights on worker 0-0, policy_version 516753 (0.00085) [2022-07-10 01:56:21,816][26022] Updated weights on worker 0-0, policy_version 516763 (0.00086) [2022-07-10 01:56:22,187][25689] Fps is (10 sec: 5719.8, 60 sec: 5683.8, 300 sec: 5657.8). Total num frames: 529167360. Throughput: 0: 5878.0. Samples: 529165528. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:56:22,187][25689] Avg episode reward: [(0, '-31.133')] [2022-07-10 01:56:23,448][26022] Updated weights on worker 0-0, policy_version 516773 (0.00086) [2022-07-10 01:56:25,403][26022] Updated weights on worker 0-0, policy_version 516783 (0.00090) [2022-07-10 01:56:27,184][26022] Updated weights on worker 0-0, policy_version 516793 (0.00108) [2022-07-10 01:56:27,298][25689] Fps is (10 sec: 5657.6, 60 sec: 5676.9, 300 sec: 5663.1). Total num frames: 529196032. Throughput: 0: 6007.3. Samples: 529200306. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:56:27,299][25689] Avg episode reward: [(0, '-31.269')] [2022-07-10 01:56:28,798][26022] Updated weights on worker 0-0, policy_version 516803 (0.00084) [2022-07-10 01:56:30,881][26022] Updated weights on worker 0-0, policy_version 516813 (0.00083) [2022-07-10 01:56:32,275][26022] Updated weights on worker 0-0, policy_version 516823 (0.00089) [2022-07-10 01:56:32,367][25689] Fps is (10 sec: 5834.3, 60 sec: 5705.3, 300 sec: 5672.2). Total num frames: 529226752. Throughput: 0: 6015.2. Samples: 529234796. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:56:32,368][25689] Avg episode reward: [(0, '-31.464')] [2022-07-10 01:56:34,263][26022] Updated weights on worker 0-0, policy_version 516833 (0.00087) [2022-07-10 01:56:35,935][26022] Updated weights on worker 0-0, policy_version 516843 (0.00082) [2022-07-10 01:56:37,417][25689] Fps is (10 sec: 5768.1, 60 sec: 5669.2, 300 sec: 5661.0). Total num frames: 529254400. Throughput: 0: 5126.7. Samples: 529252136. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:56:37,419][25689] Avg episode reward: [(0, '-30.566')] [2022-07-10 01:56:37,697][26022] Updated weights on worker 0-0, policy_version 516853 (0.00085) [2022-07-10 01:56:39,863][26022] Updated weights on worker 0-0, policy_version 516863 (0.00085) [2022-07-10 01:56:41,233][26022] Updated weights on worker 0-0, policy_version 516873 (0.01110) [2022-07-10 01:56:42,450][25689] Fps is (10 sec: 5686.8, 60 sec: 5700.4, 300 sec: 5668.2). Total num frames: 529284096. Throughput: 0: 5956.5. Samples: 529286228. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:56:42,452][25689] Avg episode reward: [(0, '-30.974')] [2022-07-10 01:56:43,202][26022] Updated weights on worker 0-0, policy_version 516883 (0.00084) [2022-07-10 01:56:45,015][26022] Updated weights on worker 0-0, policy_version 516893 (0.00080) [2022-07-10 01:56:46,822][26022] Updated weights on worker 0-0, policy_version 516903 (0.00092) [2022-07-10 01:56:47,511][25689] Fps is (10 sec: 5782.5, 60 sec: 5701.7, 300 sec: 5667.3). Total num frames: 529312768. Throughput: 0: 5961.7. Samples: 529320810. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:56:47,512][25689] Avg episode reward: [(0, '-30.395')] [2022-07-10 01:56:48,746][26022] Updated weights on worker 0-0, policy_version 516913 (0.00084) [2022-07-10 01:56:50,406][26022] Updated weights on worker 0-0, policy_version 516923 (0.00077) [2022-07-10 01:56:52,107][26022] Updated weights on worker 0-0, policy_version 516933 (0.00094) [2022-07-10 01:56:52,532][25689] Fps is (10 sec: 5789.7, 60 sec: 5718.5, 300 sec: 5667.1). Total num frames: 529342464. Throughput: 0: 5122.4. Samples: 529338090. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:56:52,532][25689] Avg episode reward: [(0, '-30.508')] [2022-07-10 01:56:53,947][26022] Updated weights on worker 0-0, policy_version 516943 (0.00082) [2022-07-10 01:56:55,635][26022] Updated weights on worker 0-0, policy_version 516953 (0.00080) [2022-07-10 01:56:57,554][25689] Fps is (10 sec: 5608.2, 60 sec: 5650.0, 300 sec: 5663.4). Total num frames: 529369088. Throughput: 0: 5967.6. Samples: 529372300. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:56:57,554][25689] Avg episode reward: [(0, '-31.265')] [2022-07-10 01:56:57,587][26022] Updated weights on worker 0-0, policy_version 516963 (0.00083) [2022-07-10 01:56:59,195][26022] Updated weights on worker 0-0, policy_version 516973 (0.00087) [2022-07-10 01:57:01,208][26022] Updated weights on worker 0-0, policy_version 516983 (0.00094) [2022-07-10 01:57:02,576][25689] Fps is (10 sec: 5301.4, 60 sec: 5649.6, 300 sec: 5663.9). Total num frames: 529395712. Throughput: 0: 5904.9. Samples: 529405068. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:57:02,576][25689] Avg episode reward: [(0, '-32.537')] [2022-07-10 01:57:03,117][26022] Updated weights on worker 0-0, policy_version 516993 (0.00082) [2022-07-10 01:57:05,095][26022] Updated weights on worker 0-0, policy_version 517003 (0.00092) [2022-07-10 01:57:06,708][26022] Updated weights on worker 0-0, policy_version 517013 (0.00091) [2022-07-10 01:57:07,684][25689] Fps is (10 sec: 5559.4, 60 sec: 5700.4, 300 sec: 5665.6). Total num frames: 529425408. Throughput: 0: 5005.9. Samples: 529421794. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:57:07,685][25689] Avg episode reward: [(0, '-32.403')] [2022-07-10 01:57:08,650][26022] Updated weights on worker 0-0, policy_version 517023 (0.00091) [2022-07-10 01:57:10,469][26022] Updated weights on worker 0-0, policy_version 517033 (0.00089) [2022-07-10 01:57:12,151][26022] Updated weights on worker 0-0, policy_version 517043 (0.00094) [2022-07-10 01:57:12,717][25689] Fps is (10 sec: 5856.4, 60 sec: 5698.7, 300 sec: 5665.3). Total num frames: 529455104. Throughput: 0: 5838.6. Samples: 529455944. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:57:12,718][25689] Avg episode reward: [(0, '-33.285')] [2022-07-10 01:57:13,851][26022] Updated weights on worker 0-0, policy_version 517053 (0.00090) [2022-07-10 01:57:15,660][26022] Updated weights on worker 0-0, policy_version 517063 (0.00093) [2022-07-10 01:57:17,410][26022] Updated weights on worker 0-0, policy_version 517073 (0.00113) [2022-07-10 01:57:17,721][25689] Fps is (10 sec: 5815.6, 60 sec: 5698.5, 300 sec: 5672.8). Total num frames: 529483776. Throughput: 0: 5875.0. Samples: 529490782. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:57:17,721][25689] Avg episode reward: [(0, '-32.410')] [2022-07-10 01:57:19,441][26022] Updated weights on worker 0-0, policy_version 517083 (0.00085) [2022-07-10 01:57:20,976][26022] Updated weights on worker 0-0, policy_version 517093 (0.00085) [2022-07-10 01:57:22,725][25689] Fps is (10 sec: 5627.4, 60 sec: 5683.4, 300 sec: 5667.2). Total num frames: 529511424. Throughput: 0: 5113.1. Samples: 529508094. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:57:22,726][25689] Avg episode reward: [(0, '-32.242')] [2022-07-10 01:57:22,934][26022] Updated weights on worker 0-0, policy_version 517103 (0.00096) [2022-07-10 01:57:24,496][26022] Updated weights on worker 0-0, policy_version 517113 (0.00090) [2022-07-10 01:57:26,621][26022] Updated weights on worker 0-0, policy_version 517123 (0.00088) [2022-07-10 01:57:27,863][25689] Fps is (10 sec: 5654.1, 60 sec: 5697.9, 300 sec: 5662.8). Total num frames: 529541120. Throughput: 0: 5980.2. Samples: 529542466. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:57:27,863][25689] Avg episode reward: [(0, '-31.744')] [2022-07-10 01:57:28,207][26022] Updated weights on worker 0-0, policy_version 517133 (0.00090) [2022-07-10 01:57:30,057][26022] Updated weights on worker 0-0, policy_version 517143 (0.00088) [2022-07-10 01:57:31,680][26022] Updated weights on worker 0-0, policy_version 517153 (0.00084) [2022-07-10 01:57:32,865][25689] Fps is (10 sec: 5857.2, 60 sec: 5687.1, 300 sec: 5673.7). Total num frames: 529570816. Throughput: 0: 6013.3. Samples: 529577100. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:57:32,866][25689] Avg episode reward: [(0, '-30.756')] [2022-07-10 01:57:33,815][26022] Updated weights on worker 0-0, policy_version 517163 (0.00086) [2022-07-10 01:57:35,151][26022] Updated weights on worker 0-0, policy_version 517173 (0.00099) [2022-07-10 01:57:37,215][26022] Updated weights on worker 0-0, policy_version 517183 (0.00081) [2022-07-10 01:57:37,906][25689] Fps is (10 sec: 5709.9, 60 sec: 5688.1, 300 sec: 5673.3). Total num frames: 529598464. Throughput: 0: 5133.9. Samples: 529594410. Policy #0 lag: (min: 0.0, avg: 8.7, max: 17.0) [2022-07-10 01:57:37,906][25689] Avg episode reward: [(0, '-32.050')] [2022-07-10 01:57:38,803][26022] Updated weights on worker 0-0, policy_version 517193 (0.00097) [2022-07-10 01:57:40,850][26022] Updated weights on worker 0-0, policy_version 517203 (0.00091) [2022-07-10 01:57:42,543][26022] Updated weights on worker 0-0, policy_version 517213 (0.00092) [2022-07-10 01:57:42,932][25689] Fps is (10 sec: 5696.2, 60 sec: 5688.7, 300 sec: 5670.3). Total num frames: 529628160. Throughput: 0: 5954.4. Samples: 529628414. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 01:57:42,933][25689] Avg episode reward: [(0, '-32.344')] [2022-07-10 01:57:44,419][26022] Updated weights on worker 0-0, policy_version 517223 (0.00083) [2022-07-10 01:57:46,126][26022] Updated weights on worker 0-0, policy_version 517233 (0.00090) [2022-07-10 01:57:48,031][25689] Fps is (10 sec: 5663.5, 60 sec: 5668.3, 300 sec: 5671.9). Total num frames: 529655808. Throughput: 0: 5963.4. Samples: 529662736. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 01:57:48,031][25689] Avg episode reward: [(0, '-31.276')] [2022-07-10 01:57:48,169][26022] Updated weights on worker 0-0, policy_version 517243 (0.00086) [2022-07-10 01:57:49,590][26022] Updated weights on worker 0-0, policy_version 517253 (0.00929) [2022-07-10 01:57:49,883][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:57:49,897][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000517254_529668096.pth [2022-07-10 01:57:49,897][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000515257_527623168.pth [2022-07-10 01:57:51,595][26022] Updated weights on worker 0-0, policy_version 517263 (0.00087) [2022-07-10 01:57:53,043][25689] Fps is (10 sec: 5772.9, 60 sec: 5685.9, 300 sec: 5678.9). Total num frames: 529686528. Throughput: 0: 5101.1. Samples: 529680028. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 01:57:53,044][25689] Avg episode reward: [(0, '-30.603')] [2022-07-10 01:57:53,260][26022] Updated weights on worker 0-0, policy_version 517273 (0.00085) [2022-07-10 01:57:55,099][26022] Updated weights on worker 0-0, policy_version 517283 (0.00091) [2022-07-10 01:57:56,892][26022] Updated weights on worker 0-0, policy_version 517293 (0.00089) [2022-07-10 01:57:58,046][25689] Fps is (10 sec: 5930.2, 60 sec: 5721.6, 300 sec: 5679.2). Total num frames: 529715200. Throughput: 0: 5977.9. Samples: 529714806. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 01:57:58,046][25689] Avg episode reward: [(0, '-31.012')] [2022-07-10 01:57:58,615][26022] Updated weights on worker 0-0, policy_version 517303 (0.00085) [2022-07-10 01:58:00,437][26022] Updated weights on worker 0-0, policy_version 517313 (0.00094) [2022-07-10 01:58:02,691][26022] Updated weights on worker 0-0, policy_version 517323 (0.00079) [2022-07-10 01:58:03,051][25689] Fps is (10 sec: 5320.5, 60 sec: 5689.3, 300 sec: 5674.4). Total num frames: 529739776. Throughput: 0: 5896.1. Samples: 529747036. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 01:58:03,052][25689] Avg episode reward: [(0, '-30.825')] [2022-07-10 01:58:04,339][26022] Updated weights on worker 0-0, policy_version 517333 (0.00088) [2022-07-10 01:58:06,300][26022] Updated weights on worker 0-0, policy_version 517343 (0.00086) [2022-07-10 01:58:07,963][26022] Updated weights on worker 0-0, policy_version 517353 (0.00084) [2022-07-10 01:58:08,106][25689] Fps is (10 sec: 5496.4, 60 sec: 5711.3, 300 sec: 5681.1). Total num frames: 529770496. Throughput: 0: 5042.6. Samples: 529763970. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 01:58:08,107][25689] Avg episode reward: [(0, '-30.004')] [2022-07-10 01:58:09,749][26022] Updated weights on worker 0-0, policy_version 517363 (0.00088) [2022-07-10 01:58:11,701][26022] Updated weights on worker 0-0, policy_version 517373 (0.00090) [2022-07-10 01:58:13,115][25689] Fps is (10 sec: 5799.9, 60 sec: 5679.7, 300 sec: 5677.8). Total num frames: 529798144. Throughput: 0: 5880.5. Samples: 529798060. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 01:58:13,115][25689] Avg episode reward: [(0, '-29.930')] [2022-07-10 01:58:13,279][26022] Updated weights on worker 0-0, policy_version 517383 (0.00092) [2022-07-10 01:58:15,179][26022] Updated weights on worker 0-0, policy_version 517393 (0.00086) [2022-07-10 01:58:16,899][26022] Updated weights on worker 0-0, policy_version 517403 (0.00087) [2022-07-10 01:58:18,122][25689] Fps is (10 sec: 5622.9, 60 sec: 5679.3, 300 sec: 5678.9). Total num frames: 529826816. Throughput: 0: 5867.5. Samples: 529832606. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 01:58:18,123][25689] Avg episode reward: [(0, '-29.606')] [2022-07-10 01:58:18,732][26022] Updated weights on worker 0-0, policy_version 517413 (0.00092) [2022-07-10 01:58:20,772][26022] Updated weights on worker 0-0, policy_version 517423 (0.00089) [2022-07-10 01:58:22,338][26022] Updated weights on worker 0-0, policy_version 517433 (0.00089) [2022-07-10 01:58:23,157][25689] Fps is (10 sec: 5608.3, 60 sec: 5676.5, 300 sec: 5679.2). Total num frames: 529854464. Throughput: 0: 5104.2. Samples: 529849660. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 01:58:23,157][25689] Avg episode reward: [(0, '-29.920')] [2022-07-10 01:58:24,260][26022] Updated weights on worker 0-0, policy_version 517443 (0.00084) [2022-07-10 01:58:26,124][26022] Updated weights on worker 0-0, policy_version 517453 (0.00094) [2022-07-10 01:58:27,804][26022] Updated weights on worker 0-0, policy_version 517463 (0.00082) [2022-07-10 01:58:28,280][25689] Fps is (10 sec: 5746.2, 60 sec: 5694.8, 300 sec: 5683.9). Total num frames: 529885184. Throughput: 0: 5923.4. Samples: 529883468. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 01:58:28,281][25689] Avg episode reward: [(0, '-31.002')] [2022-07-10 01:58:29,695][26022] Updated weights on worker 0-0, policy_version 517473 (0.00087) [2022-07-10 01:58:31,266][26022] Updated weights on worker 0-0, policy_version 517483 (0.00082) [2022-07-10 01:58:33,315][25689] Fps is (10 sec: 5645.1, 60 sec: 5640.9, 300 sec: 5676.5). Total num frames: 529911808. Throughput: 0: 5915.4. Samples: 529917554. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 01:58:33,316][25689] Avg episode reward: [(0, '-30.052')] [2022-07-10 01:58:33,324][26022] Updated weights on worker 0-0, policy_version 517493 (0.00081) [2022-07-10 01:58:34,939][26022] Updated weights on worker 0-0, policy_version 517503 (0.00093) [2022-07-10 01:58:36,600][26022] Updated weights on worker 0-0, policy_version 517513 (0.00082) [2022-07-10 01:58:38,320][25689] Fps is (10 sec: 5609.4, 60 sec: 5678.1, 300 sec: 5683.4). Total num frames: 529941504. Throughput: 0: 5046.9. Samples: 529934544. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 01:58:38,321][25689] Avg episode reward: [(0, '-30.482')] [2022-07-10 01:58:38,831][26022] Updated weights on worker 0-0, policy_version 517523 (0.00095) [2022-07-10 01:58:40,301][26022] Updated weights on worker 0-0, policy_version 517533 (0.00090) [2022-07-10 01:58:42,365][26022] Updated weights on worker 0-0, policy_version 517543 (0.00085) [2022-07-10 01:58:43,322][25689] Fps is (10 sec: 5628.1, 60 sec: 5629.6, 300 sec: 5673.8). Total num frames: 529968128. Throughput: 0: 5877.4. Samples: 529968178. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 01:58:43,322][25689] Avg episode reward: [(0, '-29.586')] [2022-07-10 01:58:44,208][26022] Updated weights on worker 0-0, policy_version 517553 (0.00088) [2022-07-10 01:58:45,955][26022] Updated weights on worker 0-0, policy_version 517563 (0.00100) [2022-07-10 01:58:47,827][26022] Updated weights on worker 0-0, policy_version 517573 (0.00093) [2022-07-10 01:58:48,407][25689] Fps is (10 sec: 5685.1, 60 sec: 5681.7, 300 sec: 5686.3). Total num frames: 529998848. Throughput: 0: 5891.8. Samples: 530002054. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 01:58:48,408][25689] Avg episode reward: [(0, '-31.333')] [2022-07-10 01:58:49,599][26022] Updated weights on worker 0-0, policy_version 517583 (0.00105) [2022-07-10 01:58:51,416][26022] Updated weights on worker 0-0, policy_version 517593 (0.00084) [2022-07-10 01:58:53,253][26022] Updated weights on worker 0-0, policy_version 517603 (0.00086) [2022-07-10 01:58:53,419][25689] Fps is (10 sec: 5780.6, 60 sec: 5630.8, 300 sec: 5679.8). Total num frames: 530026496. Throughput: 0: 5067.0. Samples: 530019422. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 01:58:53,420][25689] Avg episode reward: [(0, '-31.117')] [2022-07-10 01:58:54,947][26022] Updated weights on worker 0-0, policy_version 517613 (0.00081) [2022-07-10 01:58:56,784][26022] Updated weights on worker 0-0, policy_version 517623 (0.00090) [2022-07-10 01:58:58,435][25689] Fps is (10 sec: 5616.4, 60 sec: 5629.6, 300 sec: 5679.6). Total num frames: 530055168. Throughput: 0: 5927.6. Samples: 530053776. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 01:58:58,436][25689] Avg episode reward: [(0, '-30.439')] [2022-07-10 01:58:58,625][26022] Updated weights on worker 0-0, policy_version 517633 (0.00086) [2022-07-10 01:59:00,209][26022] Updated weights on worker 0-0, policy_version 517643 (0.00085) [2022-07-10 01:59:02,696][26022] Updated weights on worker 0-0, policy_version 517653 (0.00086) [2022-07-10 01:59:03,502][25689] Fps is (10 sec: 5586.0, 60 sec: 5674.7, 300 sec: 5687.9). Total num frames: 530082816. Throughput: 0: 5847.9. Samples: 530086186. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 01:59:03,503][25689] Avg episode reward: [(0, '-31.762')] [2022-07-10 01:59:04,116][26022] Updated weights on worker 0-0, policy_version 517663 (0.00090) [2022-07-10 01:59:06,098][26022] Updated weights on worker 0-0, policy_version 517673 (0.00082) [2022-07-10 01:59:07,973][26022] Updated weights on worker 0-0, policy_version 517683 (0.00092) [2022-07-10 01:59:08,637][25689] Fps is (10 sec: 5420.1, 60 sec: 5616.4, 300 sec: 5679.0). Total num frames: 530110464. Throughput: 0: 5871.6. Samples: 530120836. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 01:59:08,638][25689] Avg episode reward: [(0, '-31.999')] [2022-07-10 01:59:09,542][26022] Updated weights on worker 0-0, policy_version 517693 (0.00090) [2022-07-10 01:59:11,438][26022] Updated weights on worker 0-0, policy_version 517703 (0.00082) [2022-07-10 01:59:13,107][26022] Updated weights on worker 0-0, policy_version 517713 (0.00195) [2022-07-10 01:59:13,666][25689] Fps is (10 sec: 5641.7, 60 sec: 5648.4, 300 sec: 5685.8). Total num frames: 530140160. Throughput: 0: 5852.9. Samples: 530137924. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 01:59:13,666][25689] Avg episode reward: [(0, '-32.481')] [2022-07-10 01:59:14,932][26022] Updated weights on worker 0-0, policy_version 517723 (0.00090) [2022-07-10 01:59:16,823][26022] Updated weights on worker 0-0, policy_version 517733 (0.00077) [2022-07-10 01:59:18,257][26022] Updated weights on worker 0-0, policy_version 517743 (0.00083) [2022-07-10 01:59:18,689][25689] Fps is (10 sec: 6010.2, 60 sec: 5680.7, 300 sec: 5688.9). Total num frames: 530170880. Throughput: 0: 5880.7. Samples: 530172886. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 01:59:18,690][25689] Avg episode reward: [(0, '-31.783')] [2022-07-10 01:59:20,392][26022] Updated weights on worker 0-0, policy_version 517753 (0.00087) [2022-07-10 01:59:22,000][26022] Updated weights on worker 0-0, policy_version 517763 (0.00089) [2022-07-10 01:59:23,693][25689] Fps is (10 sec: 5821.2, 60 sec: 5683.6, 300 sec: 5687.0). Total num frames: 530198528. Throughput: 0: 5991.0. Samples: 530207152. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 01:59:23,693][25689] Avg episode reward: [(0, '-31.164')] [2022-07-10 01:59:23,770][26022] Updated weights on worker 0-0, policy_version 517773 (0.00088) [2022-07-10 01:59:25,811][26022] Updated weights on worker 0-0, policy_version 517783 (0.00088) [2022-07-10 01:59:27,437][26022] Updated weights on worker 0-0, policy_version 517793 (0.00619) [2022-07-10 01:59:28,765][25689] Fps is (10 sec: 5386.4, 60 sec: 5620.7, 300 sec: 5684.0). Total num frames: 530225152. Throughput: 0: 5129.9. Samples: 530224092. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 01:59:28,766][25689] Avg episode reward: [(0, '-31.441')] [2022-07-10 01:59:29,342][26022] Updated weights on worker 0-0, policy_version 517803 (0.00094) [2022-07-10 01:59:31,035][26022] Updated weights on worker 0-0, policy_version 517813 (0.00087) [2022-07-10 01:59:32,811][26022] Updated weights on worker 0-0, policy_version 517823 (0.00095) [2022-07-10 01:59:33,818][25689] Fps is (10 sec: 5764.7, 60 sec: 5703.7, 300 sec: 5690.6). Total num frames: 530256896. Throughput: 0: 5972.3. Samples: 530258278. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 01:59:33,818][25689] Avg episode reward: [(0, '-30.147')] [2022-07-10 01:59:34,866][26022] Updated weights on worker 0-0, policy_version 517833 (0.00090) [2022-07-10 01:59:36,358][26022] Updated weights on worker 0-0, policy_version 517843 (0.00085) [2022-07-10 01:59:38,308][26022] Updated weights on worker 0-0, policy_version 517853 (0.00087) [2022-07-10 01:59:38,876][25689] Fps is (10 sec: 5772.7, 60 sec: 5647.9, 300 sec: 5683.2). Total num frames: 530283520. Throughput: 0: 5924.6. Samples: 530292484. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 01:59:38,877][25689] Avg episode reward: [(0, '-30.831')] [2022-07-10 01:59:40,068][26022] Updated weights on worker 0-0, policy_version 517863 (0.00089) [2022-07-10 01:59:41,982][26022] Updated weights on worker 0-0, policy_version 517873 (0.00087) [2022-07-10 01:59:43,764][26022] Updated weights on worker 0-0, policy_version 517883 (0.00086) [2022-07-10 01:59:43,963][25689] Fps is (10 sec: 5450.3, 60 sec: 5673.8, 300 sec: 5675.8). Total num frames: 530312192. Throughput: 0: 5048.9. Samples: 530309496. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 01:59:43,964][25689] Avg episode reward: [(0, '-32.275')] [2022-07-10 01:59:45,594][26022] Updated weights on worker 0-0, policy_version 517893 (0.00093) [2022-07-10 01:59:47,303][26022] Updated weights on worker 0-0, policy_version 517903 (0.00098) [2022-07-10 01:59:49,022][25689] Fps is (10 sec: 5753.1, 60 sec: 5659.4, 300 sec: 5682.1). Total num frames: 530341888. Throughput: 0: 5913.0. Samples: 530343868. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 01:59:49,023][25689] Avg episode reward: [(0, '-32.080')] [2022-07-10 01:59:49,090][26022] Updated weights on worker 0-0, policy_version 517913 (0.00100) [2022-07-10 01:59:49,951][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 01:59:49,962][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000517918_530348032.pth [2022-07-10 01:59:49,963][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000515919_528301056.pth [2022-07-10 01:59:50,825][26022] Updated weights on worker 0-0, policy_version 517923 (0.00091) [2022-07-10 01:59:52,765][26022] Updated weights on worker 0-0, policy_version 517933 (0.00088) [2022-07-10 01:59:54,046][25689] Fps is (10 sec: 5788.8, 60 sec: 5675.1, 300 sec: 5678.2). Total num frames: 530370560. Throughput: 0: 5926.8. Samples: 530378168. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 01:59:54,047][25689] Avg episode reward: [(0, '-32.239')] [2022-07-10 01:59:54,525][26022] Updated weights on worker 0-0, policy_version 517943 (0.00083) [2022-07-10 01:59:56,291][26022] Updated weights on worker 0-0, policy_version 517953 (0.00087) [2022-07-10 01:59:58,143][26022] Updated weights on worker 0-0, policy_version 517963 (0.00085) [2022-07-10 01:59:59,099][25689] Fps is (10 sec: 5792.1, 60 sec: 5688.5, 300 sec: 5687.6). Total num frames: 530400256. Throughput: 0: 5085.4. Samples: 530395324. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 01:59:59,099][25689] Avg episode reward: [(0, '-32.455')] [2022-07-10 01:59:59,636][26022] Updated weights on worker 0-0, policy_version 517973 (0.00093) [2022-07-10 02:00:02,039][26022] Updated weights on worker 0-0, policy_version 517983 (0.00114) [2022-07-10 02:00:03,837][26022] Updated weights on worker 0-0, policy_version 517993 (0.00096) [2022-07-10 02:00:04,133][25689] Fps is (10 sec: 5583.3, 60 sec: 5674.7, 300 sec: 5681.0). Total num frames: 530426880. Throughput: 0: 5858.3. Samples: 530427658. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:00:04,134][25689] Avg episode reward: [(0, '-32.472')] [2022-07-10 02:00:05,563][26022] Updated weights on worker 0-0, policy_version 518003 (0.00088) [2022-07-10 02:00:07,603][26022] Updated weights on worker 0-0, policy_version 518013 (0.00106) [2022-07-10 02:00:09,137][26022] Updated weights on worker 0-0, policy_version 518023 (0.00088) [2022-07-10 02:00:09,235][25689] Fps is (10 sec: 5455.2, 60 sec: 5694.7, 300 sec: 5679.1). Total num frames: 530455552. Throughput: 0: 5832.9. Samples: 530461772. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:00:09,236][25689] Avg episode reward: [(0, '-31.962')] [2022-07-10 02:00:11,054][26022] Updated weights on worker 0-0, policy_version 518033 (0.00086) [2022-07-10 02:00:12,825][26022] Updated weights on worker 0-0, policy_version 518043 (0.00088) [2022-07-10 02:00:14,306][25689] Fps is (10 sec: 5637.1, 60 sec: 5673.9, 300 sec: 5674.5). Total num frames: 530484224. Throughput: 0: 4968.6. Samples: 530478830. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:00:14,306][25689] Avg episode reward: [(0, '-31.056')] [2022-07-10 02:00:14,474][26022] Updated weights on worker 0-0, policy_version 518053 (0.00082) [2022-07-10 02:00:16,501][26022] Updated weights on worker 0-0, policy_version 518063 (0.00087) [2022-07-10 02:00:18,141][26022] Updated weights on worker 0-0, policy_version 518073 (0.00087) [2022-07-10 02:00:19,335][25689] Fps is (10 sec: 5677.9, 60 sec: 5639.6, 300 sec: 5675.9). Total num frames: 530512896. Throughput: 0: 5828.8. Samples: 530513274. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:00:19,336][25689] Avg episode reward: [(0, '-29.970')] [2022-07-10 02:00:20,111][26022] Updated weights on worker 0-0, policy_version 518083 (0.00088) [2022-07-10 02:00:21,797][26022] Updated weights on worker 0-0, policy_version 518093 (0.00084) [2022-07-10 02:00:23,523][26022] Updated weights on worker 0-0, policy_version 518103 (0.00086) [2022-07-10 02:00:24,378][25689] Fps is (10 sec: 5591.6, 60 sec: 5635.9, 300 sec: 5672.3). Total num frames: 530540544. Throughput: 0: 5921.4. Samples: 530547536. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:00:24,379][25689] Avg episode reward: [(0, '-28.745')] [2022-07-10 02:00:25,340][26022] Updated weights on worker 0-0, policy_version 518113 (0.00082) [2022-07-10 02:00:27,076][26022] Updated weights on worker 0-0, policy_version 518123 (0.00087) [2022-07-10 02:00:28,948][26022] Updated weights on worker 0-0, policy_version 518133 (0.00086) [2022-07-10 02:00:29,488][25689] Fps is (10 sec: 5748.6, 60 sec: 5699.9, 300 sec: 5677.3). Total num frames: 530571264. Throughput: 0: 5080.9. Samples: 530564676. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:00:29,489][25689] Avg episode reward: [(0, '-29.549')] [2022-07-10 02:00:30,806][26022] Updated weights on worker 0-0, policy_version 518143 (0.00092) [2022-07-10 02:00:32,468][26022] Updated weights on worker 0-0, policy_version 518153 (0.00090) [2022-07-10 02:00:34,519][25689] Fps is (10 sec: 5755.5, 60 sec: 5634.4, 300 sec: 5670.3). Total num frames: 530598912. Throughput: 0: 5938.1. Samples: 530598860. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:00:34,520][25689] Avg episode reward: [(0, '-29.670')] [2022-07-10 02:00:34,532][26022] Updated weights on worker 0-0, policy_version 518163 (0.00086) [2022-07-10 02:00:36,044][26022] Updated weights on worker 0-0, policy_version 518173 (0.00090) [2022-07-10 02:00:37,943][26022] Updated weights on worker 0-0, policy_version 518183 (0.00086) [2022-07-10 02:00:39,566][25689] Fps is (10 sec: 5690.2, 60 sec: 5686.1, 300 sec: 5676.4). Total num frames: 530628608. Throughput: 0: 5933.1. Samples: 530633308. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:00:39,566][25689] Avg episode reward: [(0, '-29.309')] [2022-07-10 02:00:39,632][26022] Updated weights on worker 0-0, policy_version 518193 (0.00090) [2022-07-10 02:00:41,453][26022] Updated weights on worker 0-0, policy_version 518203 (0.00091) [2022-07-10 02:00:43,261][26022] Updated weights on worker 0-0, policy_version 518213 (0.00082) [2022-07-10 02:00:44,573][25689] Fps is (10 sec: 5805.9, 60 sec: 5693.7, 300 sec: 5677.7). Total num frames: 530657280. Throughput: 0: 5096.3. Samples: 530650452. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:00:44,573][25689] Avg episode reward: [(0, '-29.537')] [2022-07-10 02:00:45,122][26022] Updated weights on worker 0-0, policy_version 518223 (0.00085) [2022-07-10 02:00:46,862][26022] Updated weights on worker 0-0, policy_version 518233 (0.00081) [2022-07-10 02:00:48,728][26022] Updated weights on worker 0-0, policy_version 518243 (0.00084) [2022-07-10 02:00:49,610][25689] Fps is (10 sec: 5708.9, 60 sec: 5678.7, 300 sec: 5677.3). Total num frames: 530685952. Throughput: 0: 5978.7. Samples: 530684982. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:00:49,611][25689] Avg episode reward: [(0, '-30.673')] [2022-07-10 02:00:50,412][26022] Updated weights on worker 0-0, policy_version 518253 (0.00085) [2022-07-10 02:00:52,293][26022] Updated weights on worker 0-0, policy_version 518263 (0.00092) [2022-07-10 02:00:54,047][26022] Updated weights on worker 0-0, policy_version 518273 (0.00088) [2022-07-10 02:00:54,623][25689] Fps is (10 sec: 5705.6, 60 sec: 5679.8, 300 sec: 5670.5). Total num frames: 530714624. Throughput: 0: 5982.8. Samples: 530719136. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:00:54,625][25689] Avg episode reward: [(0, '-31.125')] [2022-07-10 02:00:55,741][26022] Updated weights on worker 0-0, policy_version 518283 (0.00088) [2022-07-10 02:00:57,605][26022] Updated weights on worker 0-0, policy_version 518293 (0.00090) [2022-07-10 02:00:59,567][26022] Updated weights on worker 0-0, policy_version 518303 (0.00085) [2022-07-10 02:00:59,643][25689] Fps is (10 sec: 5613.9, 60 sec: 5649.1, 300 sec: 5673.9). Total num frames: 530742272. Throughput: 0: 5116.4. Samples: 530736030. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:00:59,643][25689] Avg episode reward: [(0, '-29.726')] [2022-07-10 02:01:01,182][26022] Updated weights on worker 0-0, policy_version 518313 (0.00086) [2022-07-10 02:01:03,440][26022] Updated weights on worker 0-0, policy_version 518323 (0.00090) [2022-07-10 02:01:04,661][25689] Fps is (10 sec: 5406.7, 60 sec: 5650.6, 300 sec: 5675.6). Total num frames: 530768896. Throughput: 0: 5849.5. Samples: 530767960. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:01:04,661][25689] Avg episode reward: [(0, '-28.916')] [2022-07-10 02:01:05,251][26022] Updated weights on worker 0-0, policy_version 518333 (0.00091) [2022-07-10 02:01:06,972][26022] Updated weights on worker 0-0, policy_version 518343 (0.00089) [2022-07-10 02:01:08,863][26022] Updated weights on worker 0-0, policy_version 518353 (0.00093) [2022-07-10 02:01:09,795][25689] Fps is (10 sec: 5547.3, 60 sec: 5664.4, 300 sec: 5673.3). Total num frames: 530798592. Throughput: 0: 5799.2. Samples: 530802038. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:01:09,797][25689] Avg episode reward: [(0, '-31.007')] [2022-07-10 02:01:10,511][26022] Updated weights on worker 0-0, policy_version 518363 (0.00093) [2022-07-10 02:01:12,533][26022] Updated weights on worker 0-0, policy_version 518373 (0.00087) [2022-07-10 02:01:14,143][26022] Updated weights on worker 0-0, policy_version 518383 (0.00093) [2022-07-10 02:01:14,894][25689] Fps is (10 sec: 5603.8, 60 sec: 5644.9, 300 sec: 5668.0). Total num frames: 530826240. Throughput: 0: 5785.3. Samples: 530836410. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:01:14,894][25689] Avg episode reward: [(0, '-30.607')] [2022-07-10 02:01:16,070][26022] Updated weights on worker 0-0, policy_version 518393 (0.00087) [2022-07-10 02:01:17,808][26022] Updated weights on worker 0-0, policy_version 518403 (0.00089) [2022-07-10 02:01:19,586][26022] Updated weights on worker 0-0, policy_version 518413 (0.00071) [2022-07-10 02:01:19,899][25689] Fps is (10 sec: 5675.8, 60 sec: 5664.1, 300 sec: 5671.8). Total num frames: 530855936. Throughput: 0: 5802.0. Samples: 530853556. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:01:19,899][25689] Avg episode reward: [(0, '-30.575')] [2022-07-10 02:01:21,503][26022] Updated weights on worker 0-0, policy_version 518423 (0.00089) [2022-07-10 02:01:23,276][26022] Updated weights on worker 0-0, policy_version 518433 (0.00086) [2022-07-10 02:01:24,914][25689] Fps is (10 sec: 5825.2, 60 sec: 5683.7, 300 sec: 5673.7). Total num frames: 530884608. Throughput: 0: 5932.9. Samples: 530888118. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:01:24,915][25689] Avg episode reward: [(0, '-31.421')] [2022-07-10 02:01:24,932][26022] Updated weights on worker 0-0, policy_version 518443 (0.00080) [2022-07-10 02:01:26,960][26022] Updated weights on worker 0-0, policy_version 518453 (0.00088) [2022-07-10 02:01:28,420][26022] Updated weights on worker 0-0, policy_version 518463 (0.00086) [2022-07-10 02:01:29,956][25689] Fps is (10 sec: 5600.0, 60 sec: 5639.2, 300 sec: 5663.9). Total num frames: 530912256. Throughput: 0: 5960.0. Samples: 530922194. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:01:29,956][25689] Avg episode reward: [(0, '-32.103')] [2022-07-10 02:01:30,532][26022] Updated weights on worker 0-0, policy_version 518473 (0.00090) [2022-07-10 02:01:32,128][26022] Updated weights on worker 0-0, policy_version 518483 (0.00089) [2022-07-10 02:01:34,171][26022] Updated weights on worker 0-0, policy_version 518493 (0.00091) [2022-07-10 02:01:34,963][25689] Fps is (10 sec: 5706.5, 60 sec: 5675.4, 300 sec: 5671.6). Total num frames: 530941952. Throughput: 0: 5123.0. Samples: 530939222. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:01:34,964][25689] Avg episode reward: [(0, '-31.343')] [2022-07-10 02:01:35,887][26022] Updated weights on worker 0-0, policy_version 518503 (0.00085) [2022-07-10 02:01:37,710][26022] Updated weights on worker 0-0, policy_version 518513 (0.00081) [2022-07-10 02:01:39,336][26022] Updated weights on worker 0-0, policy_version 518523 (0.00091) [2022-07-10 02:01:39,973][25689] Fps is (10 sec: 5826.6, 60 sec: 5661.8, 300 sec: 5668.6). Total num frames: 530970624. Throughput: 0: 5985.0. Samples: 530973702. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:01:39,975][25689] Avg episode reward: [(0, '-30.023')] [2022-07-10 02:01:41,243][26022] Updated weights on worker 0-0, policy_version 518533 (0.00088) [2022-07-10 02:01:42,828][26022] Updated weights on worker 0-0, policy_version 518543 (0.00078) [2022-07-10 02:01:44,817][26022] Updated weights on worker 0-0, policy_version 518553 (0.00099) [2022-07-10 02:01:45,011][25689] Fps is (10 sec: 5707.0, 60 sec: 5658.9, 300 sec: 5669.0). Total num frames: 530999296. Throughput: 0: 5964.1. Samples: 531007978. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:01:45,012][25689] Avg episode reward: [(0, '-29.724')] [2022-07-10 02:01:46,587][26022] Updated weights on worker 0-0, policy_version 518563 (0.00083) [2022-07-10 02:01:48,330][26022] Updated weights on worker 0-0, policy_version 518573 (0.00088) [2022-07-10 02:01:50,138][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:01:50,150][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000518582_531027968.pth [2022-07-10 02:01:50,150][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000516586_528984064.pth [2022-07-10 02:01:50,159][25689] Fps is (10 sec: 5529.1, 60 sec: 5631.7, 300 sec: 5659.7). Total num frames: 531026944. Throughput: 0: 5086.2. Samples: 531024954. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:01:50,160][25689] Avg episode reward: [(0, '-29.120')] [2022-07-10 02:01:50,365][26022] Updated weights on worker 0-0, policy_version 518583 (0.00104) [2022-07-10 02:01:51,931][26022] Updated weights on worker 0-0, policy_version 518593 (0.00085) [2022-07-10 02:01:53,889][26022] Updated weights on worker 0-0, policy_version 518603 (0.00092) [2022-07-10 02:01:55,183][25689] Fps is (10 sec: 5738.1, 60 sec: 5664.5, 300 sec: 5673.4). Total num frames: 531057664. Throughput: 0: 5940.8. Samples: 531059344. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:01:55,183][25689] Avg episode reward: [(0, '-29.341')] [2022-07-10 02:01:55,562][26022] Updated weights on worker 0-0, policy_version 518613 (0.00102) [2022-07-10 02:01:57,342][26022] Updated weights on worker 0-0, policy_version 518623 (0.00084) [2022-07-10 02:01:59,101][26022] Updated weights on worker 0-0, policy_version 518633 (0.00083) [2022-07-10 02:02:00,185][25689] Fps is (10 sec: 5924.1, 60 sec: 5683.0, 300 sec: 5680.7). Total num frames: 531086336. Throughput: 0: 5941.8. Samples: 531093796. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:02:00,186][25689] Avg episode reward: [(0, '-30.418')] [2022-07-10 02:02:01,054][26022] Updated weights on worker 0-0, policy_version 518643 (0.00081) [2022-07-10 02:02:02,958][26022] Updated weights on worker 0-0, policy_version 518653 (0.00089) [2022-07-10 02:02:04,918][26022] Updated weights on worker 0-0, policy_version 518663 (0.00089) [2022-07-10 02:02:05,189][25689] Fps is (10 sec: 5321.3, 60 sec: 5650.5, 300 sec: 5665.5). Total num frames: 531110912. Throughput: 0: 4989.5. Samples: 531108658. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:02:05,190][25689] Avg episode reward: [(0, '-32.144')] [2022-07-10 02:02:06,491][26022] Updated weights on worker 0-0, policy_version 518673 (0.00095) [2022-07-10 02:02:08,677][26022] Updated weights on worker 0-0, policy_version 518683 (0.00088) [2022-07-10 02:02:10,228][26022] Updated weights on worker 0-0, policy_version 518693 (0.00090) [2022-07-10 02:02:10,327][25689] Fps is (10 sec: 5452.1, 60 sec: 5667.1, 300 sec: 5666.9). Total num frames: 531141632. Throughput: 0: 5850.7. Samples: 531142948. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:02:10,327][25689] Avg episode reward: [(0, '-32.896')] [2022-07-10 02:02:12,160][26022] Updated weights on worker 0-0, policy_version 518703 (0.00084) [2022-07-10 02:02:13,810][26022] Updated weights on worker 0-0, policy_version 518713 (0.00099) [2022-07-10 02:02:15,406][25689] Fps is (10 sec: 5813.1, 60 sec: 5685.9, 300 sec: 5665.5). Total num frames: 531170304. Throughput: 0: 5832.8. Samples: 531177304. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:02:15,407][25689] Avg episode reward: [(0, '-33.566')] [2022-07-10 02:02:15,748][26022] Updated weights on worker 0-0, policy_version 518723 (0.00091) [2022-07-10 02:02:17,396][26022] Updated weights on worker 0-0, policy_version 518733 (0.00085) [2022-07-10 02:02:19,191][26022] Updated weights on worker 0-0, policy_version 518743 (0.00101) [2022-07-10 02:02:20,429][25689] Fps is (10 sec: 5676.5, 60 sec: 5667.2, 300 sec: 5668.6). Total num frames: 531198976. Throughput: 0: 4968.5. Samples: 531194380. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:02:20,430][25689] Avg episode reward: [(0, '-33.988')] [2022-07-10 02:02:21,029][26022] Updated weights on worker 0-0, policy_version 518753 (0.00087) [2022-07-10 02:02:22,925][26022] Updated weights on worker 0-0, policy_version 518763 (0.00410) [2022-07-10 02:02:24,684][26022] Updated weights on worker 0-0, policy_version 518773 (0.00086) [2022-07-10 02:02:25,444][25689] Fps is (10 sec: 5712.9, 60 sec: 5667.3, 300 sec: 5667.5). Total num frames: 531227648. Throughput: 0: 5932.9. Samples: 531228826. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:02:25,444][25689] Avg episode reward: [(0, '-34.347')] [2022-07-10 02:02:26,282][26022] Updated weights on worker 0-0, policy_version 518783 (0.00096) [2022-07-10 02:02:28,255][26022] Updated weights on worker 0-0, policy_version 518793 (0.00082) [2022-07-10 02:02:30,015][26022] Updated weights on worker 0-0, policy_version 518803 (0.00082) [2022-07-10 02:02:30,527][25689] Fps is (10 sec: 5679.0, 60 sec: 5680.4, 300 sec: 5662.5). Total num frames: 531256320. Throughput: 0: 5951.5. Samples: 531263166. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:02:30,527][25689] Avg episode reward: [(0, '-34.034')] [2022-07-10 02:02:31,783][26022] Updated weights on worker 0-0, policy_version 518813 (0.00087) [2022-07-10 02:02:33,815][26022] Updated weights on worker 0-0, policy_version 518823 (0.00085) [2022-07-10 02:02:35,361][26022] Updated weights on worker 0-0, policy_version 518833 (0.01016) [2022-07-10 02:02:35,601][25689] Fps is (10 sec: 5746.2, 60 sec: 5674.0, 300 sec: 5668.7). Total num frames: 531286016. Throughput: 0: 5093.8. Samples: 531280176. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:02:35,602][25689] Avg episode reward: [(0, '-32.989')] [2022-07-10 02:02:37,348][26022] Updated weights on worker 0-0, policy_version 518843 (0.00087) [2022-07-10 02:02:38,886][26022] Updated weights on worker 0-0, policy_version 518853 (0.00087) [2022-07-10 02:02:40,673][25689] Fps is (10 sec: 5651.6, 60 sec: 5651.4, 300 sec: 5661.0). Total num frames: 531313664. Throughput: 0: 5923.4. Samples: 531314294. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:02:40,674][25689] Avg episode reward: [(0, '-34.325')] [2022-07-10 02:02:40,892][26022] Updated weights on worker 0-0, policy_version 518863 (0.00087) [2022-07-10 02:02:42,636][26022] Updated weights on worker 0-0, policy_version 518873 (0.00087) [2022-07-10 02:02:44,354][26022] Updated weights on worker 0-0, policy_version 518883 (0.00081) [2022-07-10 02:02:45,767][25689] Fps is (10 sec: 5540.2, 60 sec: 5646.1, 300 sec: 5664.5). Total num frames: 531342336. Throughput: 0: 5896.5. Samples: 531348664. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:02:45,768][25689] Avg episode reward: [(0, '-33.408')] [2022-07-10 02:02:46,187][26022] Updated weights on worker 0-0, policy_version 518893 (0.00084) [2022-07-10 02:02:47,829][26022] Updated weights on worker 0-0, policy_version 518903 (0.01152) [2022-07-10 02:02:49,832][26022] Updated weights on worker 0-0, policy_version 518913 (0.00090) [2022-07-10 02:02:50,896][25689] Fps is (10 sec: 5809.9, 60 sec: 5698.6, 300 sec: 5662.3). Total num frames: 531373056. Throughput: 0: 5034.6. Samples: 531365716. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:02:50,897][25689] Avg episode reward: [(0, '-34.612')] [2022-07-10 02:02:51,551][26022] Updated weights on worker 0-0, policy_version 518923 (0.00093) [2022-07-10 02:02:53,342][26022] Updated weights on worker 0-0, policy_version 518933 (0.00087) [2022-07-10 02:02:55,207][26022] Updated weights on worker 0-0, policy_version 518943 (0.00082) [2022-07-10 02:02:55,934][25689] Fps is (10 sec: 5741.2, 60 sec: 5646.7, 300 sec: 5658.2). Total num frames: 531400704. Throughput: 0: 5894.9. Samples: 531400032. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:02:55,935][25689] Avg episode reward: [(0, '-34.209')] [2022-07-10 02:02:56,937][26022] Updated weights on worker 0-0, policy_version 518953 (0.00094) [2022-07-10 02:02:58,659][26022] Updated weights on worker 0-0, policy_version 518963 (0.00089) [2022-07-10 02:03:00,522][26022] Updated weights on worker 0-0, policy_version 518973 (0.00089) [2022-07-10 02:03:00,975][25689] Fps is (10 sec: 5587.7, 60 sec: 5643.0, 300 sec: 5671.3). Total num frames: 531429376. Throughput: 0: 5922.1. Samples: 531434524. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:03:00,976][25689] Avg episode reward: [(0, '-34.223')] [2022-07-10 02:03:02,694][26022] Updated weights on worker 0-0, policy_version 518983 (0.00085) [2022-07-10 02:03:04,623][26022] Updated weights on worker 0-0, policy_version 518993 (0.00089) [2022-07-10 02:03:05,997][25689] Fps is (10 sec: 5596.3, 60 sec: 5691.9, 300 sec: 5661.6). Total num frames: 531457024. Throughput: 0: 4984.5. Samples: 531449502. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:03:05,999][25689] Avg episode reward: [(0, '-35.127')] [2022-07-10 02:03:06,242][26022] Updated weights on worker 0-0, policy_version 519003 (0.00089) [2022-07-10 02:03:08,033][26022] Updated weights on worker 0-0, policy_version 519013 (0.00085) [2022-07-10 02:03:09,943][26022] Updated weights on worker 0-0, policy_version 519023 (0.00092) [2022-07-10 02:03:11,074][25689] Fps is (10 sec: 5576.6, 60 sec: 5663.9, 300 sec: 5663.7). Total num frames: 531485696. Throughput: 0: 5856.9. Samples: 531483900. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:03:11,075][25689] Avg episode reward: [(0, '-33.808')] [2022-07-10 02:03:11,590][26022] Updated weights on worker 0-0, policy_version 519033 (0.00095) [2022-07-10 02:03:13,554][26022] Updated weights on worker 0-0, policy_version 519043 (0.00087) [2022-07-10 02:03:15,162][26022] Updated weights on worker 0-0, policy_version 519053 (0.00087) [2022-07-10 02:03:16,145][25689] Fps is (10 sec: 5751.8, 60 sec: 5681.5, 300 sec: 5665.9). Total num frames: 531515392. Throughput: 0: 5860.5. Samples: 531518482. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:03:16,146][25689] Avg episode reward: [(0, '-34.075')] [2022-07-10 02:03:17,165][26022] Updated weights on worker 0-0, policy_version 519063 (0.00088) [2022-07-10 02:03:18,736][26022] Updated weights on worker 0-0, policy_version 519073 (0.00086) [2022-07-10 02:03:20,810][26022] Updated weights on worker 0-0, policy_version 519083 (0.00086) [2022-07-10 02:03:21,154][25689] Fps is (10 sec: 5587.2, 60 sec: 5649.0, 300 sec: 5663.0). Total num frames: 531542016. Throughput: 0: 5849.7. Samples: 531552568. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:03:21,155][25689] Avg episode reward: [(0, '-33.529')] [2022-07-10 02:03:22,342][26022] Updated weights on worker 0-0, policy_version 519093 (0.00079) [2022-07-10 02:03:24,343][26022] Updated weights on worker 0-0, policy_version 519103 (0.00092) [2022-07-10 02:03:25,901][26022] Updated weights on worker 0-0, policy_version 519113 (0.00084) [2022-07-10 02:03:26,159][25689] Fps is (10 sec: 5726.7, 60 sec: 5683.7, 300 sec: 5665.2). Total num frames: 531572736. Throughput: 0: 5974.8. Samples: 531569962. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:03:26,159][25689] Avg episode reward: [(0, '-33.649')] [2022-07-10 02:03:27,865][26022] Updated weights on worker 0-0, policy_version 519123 (0.00086) [2022-07-10 02:03:29,563][26022] Updated weights on worker 0-0, policy_version 519133 (0.00089) [2022-07-10 02:03:31,228][25689] Fps is (10 sec: 5895.6, 60 sec: 5685.0, 300 sec: 5671.4). Total num frames: 531601408. Throughput: 0: 5959.5. Samples: 531604010. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 02:03:31,230][25689] Avg episode reward: [(0, '-33.050')] [2022-07-10 02:03:31,371][26022] Updated weights on worker 0-0, policy_version 519143 (0.00085) [2022-07-10 02:03:33,206][26022] Updated weights on worker 0-0, policy_version 519153 (0.00089) [2022-07-10 02:03:35,011][26022] Updated weights on worker 0-0, policy_version 519163 (0.00096) [2022-07-10 02:03:36,302][25689] Fps is (10 sec: 5451.3, 60 sec: 5634.4, 300 sec: 5659.8). Total num frames: 531628032. Throughput: 0: 5939.9. Samples: 531638214. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:03:36,304][25689] Avg episode reward: [(0, '-32.923')] [2022-07-10 02:03:36,702][26022] Updated weights on worker 0-0, policy_version 519173 (0.00085) [2022-07-10 02:03:38,898][26022] Updated weights on worker 0-0, policy_version 519183 (0.00094) [2022-07-10 02:03:40,391][26022] Updated weights on worker 0-0, policy_version 519193 (0.00935) [2022-07-10 02:03:41,329][25689] Fps is (10 sec: 5576.1, 60 sec: 5672.4, 300 sec: 5669.7). Total num frames: 531657728. Throughput: 0: 5088.0. Samples: 531655216. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:03:41,329][25689] Avg episode reward: [(0, '-33.197')] [2022-07-10 02:03:42,432][26022] Updated weights on worker 0-0, policy_version 519203 (0.00086) [2022-07-10 02:03:43,911][26022] Updated weights on worker 0-0, policy_version 519213 (0.00086) [2022-07-10 02:03:45,827][26022] Updated weights on worker 0-0, policy_version 519223 (0.00085) [2022-07-10 02:03:46,343][25689] Fps is (10 sec: 5813.3, 60 sec: 5679.9, 300 sec: 5664.1). Total num frames: 531686400. Throughput: 0: 5932.1. Samples: 531689696. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:03:46,344][25689] Avg episode reward: [(0, '-32.937')] [2022-07-10 02:03:47,677][26022] Updated weights on worker 0-0, policy_version 519233 (0.00087) [2022-07-10 02:03:49,422][26022] Updated weights on worker 0-0, policy_version 519243 (0.00090) [2022-07-10 02:03:50,375][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:03:50,388][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000519248_531709952.pth [2022-07-10 02:03:50,389][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000517254_529668096.pth [2022-07-10 02:03:51,402][25689] Fps is (10 sec: 5692.7, 60 sec: 5652.6, 300 sec: 5666.7). Total num frames: 531715072. Throughput: 0: 5926.4. Samples: 531723566. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:03:51,403][25689] Avg episode reward: [(0, '-33.347')] [2022-07-10 02:03:51,407][26022] Updated weights on worker 0-0, policy_version 519253 (0.00080) [2022-07-10 02:03:52,894][26022] Updated weights on worker 0-0, policy_version 519263 (0.00087) [2022-07-10 02:03:54,867][26022] Updated weights on worker 0-0, policy_version 519273 (0.00053) [2022-07-10 02:03:56,478][25689] Fps is (10 sec: 5658.1, 60 sec: 5666.0, 300 sec: 5665.5). Total num frames: 531743744. Throughput: 0: 5083.9. Samples: 531740784. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:03:56,479][25689] Avg episode reward: [(0, '-33.711')] [2022-07-10 02:03:56,727][26022] Updated weights on worker 0-0, policy_version 519283 (0.00093) [2022-07-10 02:03:58,268][26022] Updated weights on worker 0-0, policy_version 519293 (0.00095) [2022-07-10 02:04:00,391][26022] Updated weights on worker 0-0, policy_version 519303 (0.00081) [2022-07-10 02:04:01,570][25689] Fps is (10 sec: 5740.6, 60 sec: 5678.1, 300 sec: 5671.9). Total num frames: 531773440. Throughput: 0: 5934.5. Samples: 531775336. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:04:01,570][25689] Avg episode reward: [(0, '-32.780')] [2022-07-10 02:04:02,235][26022] Updated weights on worker 0-0, policy_version 519313 (0.00089) [2022-07-10 02:04:04,159][26022] Updated weights on worker 0-0, policy_version 519323 (0.00082) [2022-07-10 02:04:05,908][26022] Updated weights on worker 0-0, policy_version 519333 (0.00091) [2022-07-10 02:04:06,607][25689] Fps is (10 sec: 5560.2, 60 sec: 5659.8, 300 sec: 5670.4). Total num frames: 531800064. Throughput: 0: 5835.7. Samples: 531807952. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:04:06,608][25689] Avg episode reward: [(0, '-32.411')] [2022-07-10 02:04:07,515][26022] Updated weights on worker 0-0, policy_version 519343 (0.00087) [2022-07-10 02:04:09,487][26022] Updated weights on worker 0-0, policy_version 519353 (0.00089) [2022-07-10 02:04:11,259][26022] Updated weights on worker 0-0, policy_version 519363 (0.00089) [2022-07-10 02:04:11,678][25689] Fps is (10 sec: 5571.6, 60 sec: 5677.3, 300 sec: 5669.6). Total num frames: 531829760. Throughput: 0: 5011.1. Samples: 531825170. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:04:11,679][25689] Avg episode reward: [(0, '-33.363')] [2022-07-10 02:04:12,914][26022] Updated weights on worker 0-0, policy_version 519373 (0.00087) [2022-07-10 02:04:14,931][26022] Updated weights on worker 0-0, policy_version 519383 (0.00712) [2022-07-10 02:04:16,548][26022] Updated weights on worker 0-0, policy_version 519393 (0.00090) [2022-07-10 02:04:16,747][25689] Fps is (10 sec: 5756.1, 60 sec: 5660.5, 300 sec: 5661.8). Total num frames: 531858432. Throughput: 0: 5856.3. Samples: 531859488. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:04:16,748][25689] Avg episode reward: [(0, '-33.501')] [2022-07-10 02:04:18,412][26022] Updated weights on worker 0-0, policy_version 519403 (0.00092) [2022-07-10 02:04:20,128][26022] Updated weights on worker 0-0, policy_version 519413 (0.00089) [2022-07-10 02:04:21,765][25689] Fps is (10 sec: 5684.7, 60 sec: 5693.5, 300 sec: 5665.0). Total num frames: 531887104. Throughput: 0: 5864.3. Samples: 531893770. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:04:21,766][25689] Avg episode reward: [(0, '-34.496')] [2022-07-10 02:04:21,992][26022] Updated weights on worker 0-0, policy_version 519423 (0.00086) [2022-07-10 02:04:23,667][26022] Updated weights on worker 0-0, policy_version 519433 (0.00091) [2022-07-10 02:04:25,671][26022] Updated weights on worker 0-0, policy_version 519443 (0.00083) [2022-07-10 02:04:26,779][25689] Fps is (10 sec: 5818.3, 60 sec: 5675.7, 300 sec: 5676.4). Total num frames: 531916800. Throughput: 0: 5099.7. Samples: 531910822. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:04:26,779][25689] Avg episode reward: [(0, '-35.924')] [2022-07-10 02:04:27,236][26022] Updated weights on worker 0-0, policy_version 519453 (0.00084) [2022-07-10 02:04:29,211][26022] Updated weights on worker 0-0, policy_version 519463 (0.00086) [2022-07-10 02:04:30,802][26022] Updated weights on worker 0-0, policy_version 519473 (0.00087) [2022-07-10 02:04:31,910][25689] Fps is (10 sec: 5652.8, 60 sec: 5653.1, 300 sec: 5661.2). Total num frames: 531944448. Throughput: 0: 5910.8. Samples: 531944756. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:04:31,910][25689] Avg episode reward: [(0, '-36.212')] [2022-07-10 02:04:32,842][26022] Updated weights on worker 0-0, policy_version 519483 (0.00086) [2022-07-10 02:04:34,786][26022] Updated weights on worker 0-0, policy_version 519493 (0.00090) [2022-07-10 02:04:36,211][26022] Updated weights on worker 0-0, policy_version 519503 (0.00086) [2022-07-10 02:04:36,959][25689] Fps is (10 sec: 5632.9, 60 sec: 5706.1, 300 sec: 5671.7). Total num frames: 531974144. Throughput: 0: 5914.0. Samples: 531979022. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:04:36,960][25689] Avg episode reward: [(0, '-35.226')] [2022-07-10 02:04:38,268][26022] Updated weights on worker 0-0, policy_version 519513 (0.00084) [2022-07-10 02:04:39,974][26022] Updated weights on worker 0-0, policy_version 519523 (0.00079) [2022-07-10 02:04:41,755][26022] Updated weights on worker 0-0, policy_version 519533 (0.00101) [2022-07-10 02:04:42,006][25689] Fps is (10 sec: 5781.4, 60 sec: 5687.3, 300 sec: 5672.4). Total num frames: 532002816. Throughput: 0: 5068.4. Samples: 531996356. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:04:42,006][25689] Avg episode reward: [(0, '-34.935')] [2022-07-10 02:04:43,726][26022] Updated weights on worker 0-0, policy_version 519543 (0.00086) [2022-07-10 02:04:45,259][26022] Updated weights on worker 0-0, policy_version 519553 (0.00089) [2022-07-10 02:04:47,009][25689] Fps is (10 sec: 5706.1, 60 sec: 5688.4, 300 sec: 5670.0). Total num frames: 532031488. Throughput: 0: 5944.4. Samples: 532031076. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:04:47,010][25689] Avg episode reward: [(0, '-34.801')] [2022-07-10 02:04:47,159][26022] Updated weights on worker 0-0, policy_version 519563 (0.00053) [2022-07-10 02:04:48,903][26022] Updated weights on worker 0-0, policy_version 519573 (0.00083) [2022-07-10 02:04:50,643][26022] Updated weights on worker 0-0, policy_version 519583 (0.00093) [2022-07-10 02:04:52,069][25689] Fps is (10 sec: 5698.1, 60 sec: 5688.2, 300 sec: 5669.4). Total num frames: 532060160. Throughput: 0: 5984.9. Samples: 532065408. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:04:52,070][25689] Avg episode reward: [(0, '-32.587')] [2022-07-10 02:04:52,438][26022] Updated weights on worker 0-0, policy_version 519593 (0.00083) [2022-07-10 02:04:54,279][26022] Updated weights on worker 0-0, policy_version 519603 (0.00089) [2022-07-10 02:04:55,918][26022] Updated weights on worker 0-0, policy_version 519613 (0.00085) [2022-07-10 02:04:57,102][25689] Fps is (10 sec: 5681.4, 60 sec: 5692.3, 300 sec: 5666.3). Total num frames: 532088832. Throughput: 0: 5151.0. Samples: 532082778. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:04:57,102][25689] Avg episode reward: [(0, '-32.299')] [2022-07-10 02:04:57,932][26022] Updated weights on worker 0-0, policy_version 519623 (0.00089) [2022-07-10 02:04:59,410][26022] Updated weights on worker 0-0, policy_version 519633 (0.00095) [2022-07-10 02:05:01,345][26022] Updated weights on worker 0-0, policy_version 519643 (0.00088) [2022-07-10 02:05:02,122][25689] Fps is (10 sec: 5704.5, 60 sec: 5682.1, 300 sec: 5673.5). Total num frames: 532117504. Throughput: 0: 6008.7. Samples: 532117228. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:05:02,124][25689] Avg episode reward: [(0, '-31.978')] [2022-07-10 02:05:03,498][26022] Updated weights on worker 0-0, policy_version 519653 (0.00082) [2022-07-10 02:05:05,325][26022] Updated weights on worker 0-0, policy_version 519663 (0.00093) [2022-07-10 02:05:07,135][25689] Fps is (10 sec: 5511.1, 60 sec: 5684.3, 300 sec: 5668.3). Total num frames: 532144128. Throughput: 0: 5869.0. Samples: 532149200. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:05:07,137][25689] Avg episode reward: [(0, '-32.357')] [2022-07-10 02:05:07,232][26022] Updated weights on worker 0-0, policy_version 519673 (0.00089) [2022-07-10 02:05:09,027][26022] Updated weights on worker 0-0, policy_version 519683 (0.00090) [2022-07-10 02:05:10,636][26022] Updated weights on worker 0-0, policy_version 519693 (0.00092) [2022-07-10 02:05:12,193][25689] Fps is (10 sec: 5388.7, 60 sec: 5651.7, 300 sec: 5665.1). Total num frames: 532171776. Throughput: 0: 5018.1. Samples: 532166392. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:05:12,194][25689] Avg episode reward: [(0, '-31.669')] [2022-07-10 02:05:12,669][26022] Updated weights on worker 0-0, policy_version 519703 (0.00089) [2022-07-10 02:05:14,204][26022] Updated weights on worker 0-0, policy_version 519713 (0.00087) [2022-07-10 02:05:16,165][26022] Updated weights on worker 0-0, policy_version 519723 (0.00089) [2022-07-10 02:05:17,204][25689] Fps is (10 sec: 5797.1, 60 sec: 5691.1, 300 sec: 5672.3). Total num frames: 532202496. Throughput: 0: 5866.2. Samples: 532200700. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:05:17,206][25689] Avg episode reward: [(0, '-32.210')] [2022-07-10 02:05:17,679][26022] Updated weights on worker 0-0, policy_version 519733 (0.00087) [2022-07-10 02:05:19,708][26022] Updated weights on worker 0-0, policy_version 519743 (0.00091) [2022-07-10 02:05:21,399][26022] Updated weights on worker 0-0, policy_version 519753 (0.00086) [2022-07-10 02:05:22,217][25689] Fps is (10 sec: 5822.9, 60 sec: 5674.6, 300 sec: 5672.9). Total num frames: 532230144. Throughput: 0: 5856.2. Samples: 532234910. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:05:22,218][25689] Avg episode reward: [(0, '-31.384')] [2022-07-10 02:05:23,251][26022] Updated weights on worker 0-0, policy_version 519763 (0.00091) [2022-07-10 02:05:24,995][26022] Updated weights on worker 0-0, policy_version 519773 (0.00089) [2022-07-10 02:05:26,861][26022] Updated weights on worker 0-0, policy_version 519783 (0.00087) [2022-07-10 02:05:27,223][25689] Fps is (10 sec: 5723.8, 60 sec: 5675.4, 300 sec: 5671.4). Total num frames: 532259840. Throughput: 0: 5131.7. Samples: 532252280. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:05:27,224][25689] Avg episode reward: [(0, '-32.017')] [2022-07-10 02:05:28,675][26022] Updated weights on worker 0-0, policy_version 519793 (0.00102) [2022-07-10 02:05:30,550][26022] Updated weights on worker 0-0, policy_version 519803 (0.00097) [2022-07-10 02:05:32,276][25689] Fps is (10 sec: 5802.5, 60 sec: 5699.6, 300 sec: 5674.4). Total num frames: 532288512. Throughput: 0: 5960.7. Samples: 532286100. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:05:32,277][25689] Avg episode reward: [(0, '-32.629')] [2022-07-10 02:05:32,283][26022] Updated weights on worker 0-0, policy_version 519813 (0.00097) [2022-07-10 02:05:34,114][26022] Updated weights on worker 0-0, policy_version 519823 (0.00083) [2022-07-10 02:05:35,647][26022] Updated weights on worker 0-0, policy_version 519833 (0.00084) [2022-07-10 02:05:37,291][25689] Fps is (10 sec: 5593.6, 60 sec: 5668.9, 300 sec: 5668.1). Total num frames: 532316160. Throughput: 0: 5971.5. Samples: 532320650. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:05:37,292][25689] Avg episode reward: [(0, '-33.533')] [2022-07-10 02:05:37,766][26022] Updated weights on worker 0-0, policy_version 519843 (0.00089) [2022-07-10 02:05:39,408][26022] Updated weights on worker 0-0, policy_version 519853 (0.00089) [2022-07-10 02:05:41,300][26022] Updated weights on worker 0-0, policy_version 519863 (0.00088) [2022-07-10 02:05:42,311][25689] Fps is (10 sec: 5714.5, 60 sec: 5688.4, 300 sec: 5671.3). Total num frames: 532345856. Throughput: 0: 5973.1. Samples: 532354934. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:05:42,311][25689] Avg episode reward: [(0, '-32.780')] [2022-07-10 02:05:42,971][26022] Updated weights on worker 0-0, policy_version 519873 (0.00086) [2022-07-10 02:05:44,994][26022] Updated weights on worker 0-0, policy_version 519883 (0.00084) [2022-07-10 02:05:46,544][26022] Updated weights on worker 0-0, policy_version 519893 (0.00096) [2022-07-10 02:05:47,320][25689] Fps is (10 sec: 5717.9, 60 sec: 5670.8, 300 sec: 5668.5). Total num frames: 532373504. Throughput: 0: 5966.9. Samples: 532372200. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:05:47,320][25689] Avg episode reward: [(0, '-32.957')] [2022-07-10 02:05:48,508][26022] Updated weights on worker 0-0, policy_version 519903 (0.00079) [2022-07-10 02:05:50,069][26022] Updated weights on worker 0-0, policy_version 519913 (0.00093) [2022-07-10 02:05:50,701][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:05:50,710][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000519915_532392960.pth [2022-07-10 02:05:50,717][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000517918_530348032.pth [2022-07-10 02:05:52,104][26022] Updated weights on worker 0-0, policy_version 519923 (0.00089) [2022-07-10 02:05:52,417][25689] Fps is (10 sec: 5775.7, 60 sec: 5701.4, 300 sec: 5673.7). Total num frames: 532404224. Throughput: 0: 5961.0. Samples: 532406158. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:05:52,417][25689] Avg episode reward: [(0, '-33.022')] [2022-07-10 02:05:53,831][26022] Updated weights on worker 0-0, policy_version 519933 (0.00906) [2022-07-10 02:05:55,624][26022] Updated weights on worker 0-0, policy_version 519943 (0.00091) [2022-07-10 02:05:57,432][25689] Fps is (10 sec: 5771.9, 60 sec: 5686.0, 300 sec: 5673.8). Total num frames: 532431872. Throughput: 0: 5934.2. Samples: 532440172. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:05:57,434][25689] Avg episode reward: [(0, '-32.772')] [2022-07-10 02:05:57,442][26022] Updated weights on worker 0-0, policy_version 519953 (0.00086) [2022-07-10 02:05:59,362][26022] Updated weights on worker 0-0, policy_version 519963 (0.00303) [2022-07-10 02:06:01,119][26022] Updated weights on worker 0-0, policy_version 519973 (0.00089) [2022-07-10 02:06:02,475][25689] Fps is (10 sec: 5294.1, 60 sec: 5633.0, 300 sec: 5669.9). Total num frames: 532457472. Throughput: 0: 5083.6. Samples: 532457438. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:06:02,476][25689] Avg episode reward: [(0, '-31.865')] [2022-07-10 02:06:03,252][26022] Updated weights on worker 0-0, policy_version 519983 (0.00090) [2022-07-10 02:06:04,822][26022] Updated weights on worker 0-0, policy_version 519993 (0.00089) [2022-07-10 02:06:06,676][26022] Updated weights on worker 0-0, policy_version 520003 (0.00088) [2022-07-10 02:06:07,555][25689] Fps is (10 sec: 5462.8, 60 sec: 5677.7, 300 sec: 5670.9). Total num frames: 532487168. Throughput: 0: 5799.3. Samples: 532489548. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:06:07,555][25689] Avg episode reward: [(0, '-32.168')] [2022-07-10 02:06:08,692][26022] Updated weights on worker 0-0, policy_version 520013 (0.00095) [2022-07-10 02:06:10,420][26022] Updated weights on worker 0-0, policy_version 520023 (0.00087) [2022-07-10 02:06:12,193][26022] Updated weights on worker 0-0, policy_version 520033 (0.00087) [2022-07-10 02:06:12,671][25689] Fps is (10 sec: 5624.1, 60 sec: 5672.2, 300 sec: 5670.6). Total num frames: 532514816. Throughput: 0: 5812.1. Samples: 532523878. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:06:12,671][25689] Avg episode reward: [(0, '-32.195')] [2022-07-10 02:06:13,856][26022] Updated weights on worker 0-0, policy_version 520043 (0.00094) [2022-07-10 02:06:16,024][26022] Updated weights on worker 0-0, policy_version 520053 (0.00088) [2022-07-10 02:06:17,431][26022] Updated weights on worker 0-0, policy_version 520063 (0.00096) [2022-07-10 02:06:17,679][25689] Fps is (10 sec: 5663.8, 60 sec: 5655.5, 300 sec: 5670.5). Total num frames: 532544512. Throughput: 0: 4987.8. Samples: 532541160. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:06:17,680][25689] Avg episode reward: [(0, '-31.602')] [2022-07-10 02:06:19,218][26022] Updated weights on worker 0-0, policy_version 520073 (0.00085) [2022-07-10 02:06:21,016][26022] Updated weights on worker 0-0, policy_version 520083 (0.00088) [2022-07-10 02:06:22,741][25689] Fps is (10 sec: 5897.8, 60 sec: 5684.8, 300 sec: 5673.1). Total num frames: 532574208. Throughput: 0: 5828.0. Samples: 532575550. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:06:22,742][25689] Avg episode reward: [(0, '-31.232')] [2022-07-10 02:06:22,999][26022] Updated weights on worker 0-0, policy_version 520093 (0.00093) [2022-07-10 02:06:24,898][26022] Updated weights on worker 0-0, policy_version 520103 (0.00101) [2022-07-10 02:06:26,443][26022] Updated weights on worker 0-0, policy_version 520113 (0.00083) [2022-07-10 02:06:27,802][25689] Fps is (10 sec: 5665.0, 60 sec: 5645.7, 300 sec: 5672.7). Total num frames: 532601856. Throughput: 0: 5935.0. Samples: 532609716. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 02:06:27,803][25689] Avg episode reward: [(0, '-30.366')] [2022-07-10 02:06:28,348][26022] Updated weights on worker 0-0, policy_version 520123 (0.00085) [2022-07-10 02:06:30,238][26022] Updated weights on worker 0-0, policy_version 520133 (0.00084) [2022-07-10 02:06:31,982][26022] Updated weights on worker 0-0, policy_version 520143 (0.00081) [2022-07-10 02:06:32,866][25689] Fps is (10 sec: 5663.4, 60 sec: 5661.6, 300 sec: 5671.6). Total num frames: 532631552. Throughput: 0: 5084.5. Samples: 532626562. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:06:32,867][25689] Avg episode reward: [(0, '-31.093')] [2022-07-10 02:06:33,780][26022] Updated weights on worker 0-0, policy_version 520153 (0.00089) [2022-07-10 02:06:35,509][26022] Updated weights on worker 0-0, policy_version 520163 (0.00088) [2022-07-10 02:06:37,365][26022] Updated weights on worker 0-0, policy_version 520173 (0.00087) [2022-07-10 02:06:37,878][25689] Fps is (10 sec: 5589.6, 60 sec: 5645.1, 300 sec: 5664.7). Total num frames: 532658176. Throughput: 0: 5921.5. Samples: 532660764. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:06:37,879][25689] Avg episode reward: [(0, '-30.442')] [2022-07-10 02:06:39,192][26022] Updated weights on worker 0-0, policy_version 520183 (0.00083) [2022-07-10 02:06:41,033][26022] Updated weights on worker 0-0, policy_version 520193 (0.00093) [2022-07-10 02:06:42,807][26022] Updated weights on worker 0-0, policy_version 520203 (0.00087) [2022-07-10 02:06:42,906][25689] Fps is (10 sec: 5609.7, 60 sec: 5644.3, 300 sec: 5668.4). Total num frames: 532687872. Throughput: 0: 5925.8. Samples: 532695044. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:06:42,907][25689] Avg episode reward: [(0, '-31.204')] [2022-07-10 02:06:44,437][26022] Updated weights on worker 0-0, policy_version 520213 (0.00088) [2022-07-10 02:06:46,406][26022] Updated weights on worker 0-0, policy_version 520223 (0.00087) [2022-07-10 02:06:47,968][25689] Fps is (10 sec: 5885.9, 60 sec: 5673.1, 300 sec: 5676.9). Total num frames: 532717568. Throughput: 0: 5082.3. Samples: 532712208. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:06:47,969][25689] Avg episode reward: [(0, '-31.695')] [2022-07-10 02:06:48,217][26022] Updated weights on worker 0-0, policy_version 520233 (0.00085) [2022-07-10 02:06:50,115][26022] Updated weights on worker 0-0, policy_version 520243 (0.00476) [2022-07-10 02:06:51,807][26022] Updated weights on worker 0-0, policy_version 520253 (0.00085) [2022-07-10 02:06:53,051][25689] Fps is (10 sec: 5753.7, 60 sec: 5640.7, 300 sec: 5668.9). Total num frames: 532746240. Throughput: 0: 5919.7. Samples: 532746046. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:06:53,051][25689] Avg episode reward: [(0, '-32.456')] [2022-07-10 02:06:53,735][26022] Updated weights on worker 0-0, policy_version 520263 (0.00085) [2022-07-10 02:06:55,428][26022] Updated weights on worker 0-0, policy_version 520273 (0.00087) [2022-07-10 02:06:57,204][26022] Updated weights on worker 0-0, policy_version 520283 (0.00084) [2022-07-10 02:06:58,092][25689] Fps is (10 sec: 5563.2, 60 sec: 5638.3, 300 sec: 5664.7). Total num frames: 532773888. Throughput: 0: 5913.5. Samples: 532780300. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:06:58,097][25689] Avg episode reward: [(0, '-33.531')] [2022-07-10 02:06:59,216][26022] Updated weights on worker 0-0, policy_version 520293 (0.00086) [2022-07-10 02:07:00,747][26022] Updated weights on worker 0-0, policy_version 520303 (0.00090) [2022-07-10 02:07:03,103][25689] Fps is (10 sec: 5297.1, 60 sec: 5641.2, 300 sec: 5668.0). Total num frames: 532799488. Throughput: 0: 5067.8. Samples: 532797396. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:07:03,103][25689] Avg episode reward: [(0, '-32.171')] [2022-07-10 02:07:03,111][26022] Updated weights on worker 0-0, policy_version 520313 (0.00088) [2022-07-10 02:07:04,634][26022] Updated weights on worker 0-0, policy_version 520323 (0.00082) [2022-07-10 02:07:06,587][26022] Updated weights on worker 0-0, policy_version 520333 (0.00084) [2022-07-10 02:07:08,142][25689] Fps is (10 sec: 5604.0, 60 sec: 5661.9, 300 sec: 5669.9). Total num frames: 532830208. Throughput: 0: 5827.7. Samples: 532829774. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:07:08,142][25689] Avg episode reward: [(0, '-33.155')] [2022-07-10 02:07:08,192][26022] Updated weights on worker 0-0, policy_version 520343 (0.00093) [2022-07-10 02:07:10,116][26022] Updated weights on worker 0-0, policy_version 520353 (0.00079) [2022-07-10 02:07:11,938][26022] Updated weights on worker 0-0, policy_version 520363 (0.00087) [2022-07-10 02:07:13,208][25689] Fps is (10 sec: 5776.1, 60 sec: 5666.6, 300 sec: 5666.7). Total num frames: 532857856. Throughput: 0: 5849.6. Samples: 532863958. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:07:13,208][25689] Avg episode reward: [(0, '-32.616')] [2022-07-10 02:07:13,796][26022] Updated weights on worker 0-0, policy_version 520373 (0.00086) [2022-07-10 02:07:15,492][26022] Updated weights on worker 0-0, policy_version 520383 (0.00089) [2022-07-10 02:07:17,327][26022] Updated weights on worker 0-0, policy_version 520393 (0.00090) [2022-07-10 02:07:18,210][25689] Fps is (10 sec: 5594.0, 60 sec: 5650.3, 300 sec: 5667.1). Total num frames: 532886528. Throughput: 0: 5015.2. Samples: 532881194. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:07:18,212][25689] Avg episode reward: [(0, '-31.962')] [2022-07-10 02:07:18,963][26022] Updated weights on worker 0-0, policy_version 520403 (0.00092) [2022-07-10 02:07:20,930][26022] Updated weights on worker 0-0, policy_version 520413 (0.00088) [2022-07-10 02:07:22,549][26022] Updated weights on worker 0-0, policy_version 520423 (0.00082) [2022-07-10 02:07:23,242][25689] Fps is (10 sec: 5816.8, 60 sec: 5653.0, 300 sec: 5670.2). Total num frames: 532916224. Throughput: 0: 5861.6. Samples: 532915446. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:07:23,243][25689] Avg episode reward: [(0, '-31.768')] [2022-07-10 02:07:24,609][26022] Updated weights on worker 0-0, policy_version 520433 (0.00094) [2022-07-10 02:07:26,181][26022] Updated weights on worker 0-0, policy_version 520443 (0.00088) [2022-07-10 02:07:28,082][26022] Updated weights on worker 0-0, policy_version 520453 (0.00089) [2022-07-10 02:07:28,278][25689] Fps is (10 sec: 5695.5, 60 sec: 5655.4, 300 sec: 5667.7). Total num frames: 532943872. Throughput: 0: 5950.9. Samples: 532949602. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:07:28,279][25689] Avg episode reward: [(0, '-30.441')] [2022-07-10 02:07:29,850][26022] Updated weights on worker 0-0, policy_version 520463 (0.00082) [2022-07-10 02:07:31,754][26022] Updated weights on worker 0-0, policy_version 520473 (0.00087) [2022-07-10 02:07:33,334][25689] Fps is (10 sec: 5682.5, 60 sec: 5656.2, 300 sec: 5668.0). Total num frames: 532973568. Throughput: 0: 5100.9. Samples: 532966618. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:07:33,334][25689] Avg episode reward: [(0, '-28.965')] [2022-07-10 02:07:33,477][26022] Updated weights on worker 0-0, policy_version 520483 (0.00084) [2022-07-10 02:07:35,239][26022] Updated weights on worker 0-0, policy_version 520493 (0.00084) [2022-07-10 02:07:37,079][26022] Updated weights on worker 0-0, policy_version 520503 (0.00087) [2022-07-10 02:07:38,392][25689] Fps is (10 sec: 5771.1, 60 sec: 5685.7, 300 sec: 5671.7). Total num frames: 533002240. Throughput: 0: 5939.5. Samples: 533001066. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:07:38,393][25689] Avg episode reward: [(0, '-29.492')] [2022-07-10 02:07:38,847][26022] Updated weights on worker 0-0, policy_version 520513 (0.00084) [2022-07-10 02:07:40,546][26022] Updated weights on worker 0-0, policy_version 520523 (0.00087) [2022-07-10 02:07:42,398][26022] Updated weights on worker 0-0, policy_version 520533 (0.00087) [2022-07-10 02:07:43,427][25689] Fps is (10 sec: 5681.5, 60 sec: 5668.2, 300 sec: 5672.8). Total num frames: 533030912. Throughput: 0: 5948.3. Samples: 533035510. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:07:43,427][25689] Avg episode reward: [(0, '-29.551')] [2022-07-10 02:07:44,111][26022] Updated weights on worker 0-0, policy_version 520543 (0.00101) [2022-07-10 02:07:46,081][26022] Updated weights on worker 0-0, policy_version 520553 (0.00082) [2022-07-10 02:07:47,556][26022] Updated weights on worker 0-0, policy_version 520563 (0.00092) [2022-07-10 02:07:48,430][25689] Fps is (10 sec: 5610.9, 60 sec: 5639.8, 300 sec: 5664.9). Total num frames: 533058560. Throughput: 0: 5110.5. Samples: 533052586. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:07:48,430][25689] Avg episode reward: [(0, '-29.222')] [2022-07-10 02:07:49,546][26022] Updated weights on worker 0-0, policy_version 520573 (0.00084) [2022-07-10 02:07:50,811][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:07:50,827][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000520581_533074944.pth [2022-07-10 02:07:50,827][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000518582_531027968.pth [2022-07-10 02:07:51,511][26022] Updated weights on worker 0-0, policy_version 520583 (0.00086) [2022-07-10 02:07:53,054][26022] Updated weights on worker 0-0, policy_version 520593 (0.00079) [2022-07-10 02:07:53,493][25689] Fps is (10 sec: 5900.1, 60 sec: 5692.4, 300 sec: 5678.2). Total num frames: 533090304. Throughput: 0: 5971.5. Samples: 533087000. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:07:53,494][25689] Avg episode reward: [(0, '-30.344')] [2022-07-10 02:07:54,979][26022] Updated weights on worker 0-0, policy_version 520603 (0.00085) [2022-07-10 02:07:56,545][26022] Updated weights on worker 0-0, policy_version 520613 (0.00097) [2022-07-10 02:07:58,516][25689] Fps is (10 sec: 5888.2, 60 sec: 5694.1, 300 sec: 5675.1). Total num frames: 533117952. Throughput: 0: 5998.3. Samples: 533121778. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:07:58,517][25689] Avg episode reward: [(0, '-30.487')] [2022-07-10 02:07:58,519][26022] Updated weights on worker 0-0, policy_version 520623 (0.00083) [2022-07-10 02:08:00,364][26022] Updated weights on worker 0-0, policy_version 520633 (0.00085) [2022-07-10 02:08:02,058][26022] Updated weights on worker 0-0, policy_version 520643 (0.00090) [2022-07-10 02:08:03,563][25689] Fps is (10 sec: 5186.2, 60 sec: 5673.8, 300 sec: 5664.3). Total num frames: 533142528. Throughput: 0: 5870.3. Samples: 533153714. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:08:03,563][25689] Avg episode reward: [(0, '-31.735')] [2022-07-10 02:08:04,318][26022] Updated weights on worker 0-0, policy_version 520653 (0.00087) [2022-07-10 02:08:06,061][26022] Updated weights on worker 0-0, policy_version 520663 (0.00087) [2022-07-10 02:08:07,811][26022] Updated weights on worker 0-0, policy_version 520673 (0.00087) [2022-07-10 02:08:08,601][25689] Fps is (10 sec: 5584.6, 60 sec: 5690.8, 300 sec: 5675.4). Total num frames: 533174272. Throughput: 0: 5870.4. Samples: 533171000. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:08:08,603][25689] Avg episode reward: [(0, '-30.553')] [2022-07-10 02:08:09,648][26022] Updated weights on worker 0-0, policy_version 520683 (0.00088) [2022-07-10 02:08:11,364][26022] Updated weights on worker 0-0, policy_version 520693 (0.00081) [2022-07-10 02:08:13,041][26022] Updated weights on worker 0-0, policy_version 520703 (0.00088) [2022-07-10 02:08:13,640][25689] Fps is (10 sec: 5995.1, 60 sec: 5710.3, 300 sec: 5672.5). Total num frames: 533202944. Throughput: 0: 5894.2. Samples: 533205752. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:08:13,641][25689] Avg episode reward: [(0, '-31.382')] [2022-07-10 02:08:14,954][26022] Updated weights on worker 0-0, policy_version 520713 (0.00093) [2022-07-10 02:08:16,689][26022] Updated weights on worker 0-0, policy_version 520723 (0.00093) [2022-07-10 02:08:18,609][26022] Updated weights on worker 0-0, policy_version 520733 (0.00707) [2022-07-10 02:08:18,687][25689] Fps is (10 sec: 5584.2, 60 sec: 5689.2, 300 sec: 5675.3). Total num frames: 533230592. Throughput: 0: 5858.5. Samples: 533239946. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:08:18,687][25689] Avg episode reward: [(0, '-32.766')] [2022-07-10 02:08:20,307][26022] Updated weights on worker 0-0, policy_version 520743 (0.00088) [2022-07-10 02:08:22,130][26022] Updated weights on worker 0-0, policy_version 520753 (0.00087) [2022-07-10 02:08:23,695][25689] Fps is (10 sec: 5601.4, 60 sec: 5674.5, 300 sec: 5668.3). Total num frames: 533259264. Throughput: 0: 5134.2. Samples: 533257076. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:08:23,695][25689] Avg episode reward: [(0, '-33.655')] [2022-07-10 02:08:23,911][26022] Updated weights on worker 0-0, policy_version 520763 (0.00085) [2022-07-10 02:08:25,938][26022] Updated weights on worker 0-0, policy_version 520773 (0.00093) [2022-07-10 02:08:27,494][26022] Updated weights on worker 0-0, policy_version 520783 (0.00086) [2022-07-10 02:08:28,697][25689] Fps is (10 sec: 5728.2, 60 sec: 5694.6, 300 sec: 5669.6). Total num frames: 533287936. Throughput: 0: 5985.1. Samples: 533291278. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:08:28,698][25689] Avg episode reward: [(0, '-33.110')] [2022-07-10 02:08:29,512][26022] Updated weights on worker 0-0, policy_version 520793 (0.00081) [2022-07-10 02:08:31,040][26022] Updated weights on worker 0-0, policy_version 520803 (0.00097) [2022-07-10 02:08:33,024][26022] Updated weights on worker 0-0, policy_version 520813 (0.00083) [2022-07-10 02:08:33,758][25689] Fps is (10 sec: 5799.9, 60 sec: 5694.1, 300 sec: 5680.2). Total num frames: 533317632. Throughput: 0: 5954.7. Samples: 533325550. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:08:33,759][25689] Avg episode reward: [(0, '-33.252')] [2022-07-10 02:08:34,739][26022] Updated weights on worker 0-0, policy_version 520823 (0.00090) [2022-07-10 02:08:36,490][26022] Updated weights on worker 0-0, policy_version 520833 (0.00086) [2022-07-10 02:08:38,403][26022] Updated weights on worker 0-0, policy_version 520843 (0.00083) [2022-07-10 02:08:38,839][25689] Fps is (10 sec: 5654.4, 60 sec: 5675.1, 300 sec: 5672.3). Total num frames: 533345280. Throughput: 0: 5100.5. Samples: 533342732. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:08:38,839][25689] Avg episode reward: [(0, '-33.292')] [2022-07-10 02:08:40,087][26022] Updated weights on worker 0-0, policy_version 520853 (0.00091) [2022-07-10 02:08:41,775][26022] Updated weights on worker 0-0, policy_version 520863 (0.00088) [2022-07-10 02:08:43,746][26022] Updated weights on worker 0-0, policy_version 520873 (0.00084) [2022-07-10 02:08:43,847][25689] Fps is (10 sec: 5582.6, 60 sec: 5677.6, 300 sec: 5672.4). Total num frames: 533373952. Throughput: 0: 5956.7. Samples: 533377116. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:08:43,847][25689] Avg episode reward: [(0, '-31.711')] [2022-07-10 02:08:45,576][26022] Updated weights on worker 0-0, policy_version 520883 (0.00099) [2022-07-10 02:08:47,193][26022] Updated weights on worker 0-0, policy_version 520893 (0.00087) [2022-07-10 02:08:48,940][25689] Fps is (10 sec: 5778.4, 60 sec: 5703.0, 300 sec: 5675.2). Total num frames: 533403648. Throughput: 0: 5932.9. Samples: 533411374. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:08:48,940][25689] Avg episode reward: [(0, '-30.631')] [2022-07-10 02:08:49,111][26022] Updated weights on worker 0-0, policy_version 520903 (0.00086) [2022-07-10 02:08:50,796][26022] Updated weights on worker 0-0, policy_version 520913 (0.00089) [2022-07-10 02:08:52,799][26022] Updated weights on worker 0-0, policy_version 520923 (0.00084) [2022-07-10 02:08:54,007][25689] Fps is (10 sec: 5644.0, 60 sec: 5634.9, 300 sec: 5671.9). Total num frames: 533431296. Throughput: 0: 5065.0. Samples: 533428108. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:08:54,008][25689] Avg episode reward: [(0, '-30.271')] [2022-07-10 02:08:54,483][26022] Updated weights on worker 0-0, policy_version 520933 (0.00088) [2022-07-10 02:08:56,380][26022] Updated weights on worker 0-0, policy_version 520943 (0.00091) [2022-07-10 02:08:58,131][26022] Updated weights on worker 0-0, policy_version 520953 (0.00089) [2022-07-10 02:08:59,036][25689] Fps is (10 sec: 5578.3, 60 sec: 5651.3, 300 sec: 5669.6). Total num frames: 533459968. Throughput: 0: 5899.9. Samples: 533461894. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:08:59,037][25689] Avg episode reward: [(0, '-30.008')] [2022-07-10 02:09:00,131][26022] Updated weights on worker 0-0, policy_version 520963 (0.00091) [2022-07-10 02:09:02,062][26022] Updated weights on worker 0-0, policy_version 520973 (0.00101) [2022-07-10 02:09:04,038][25689] Fps is (10 sec: 5410.3, 60 sec: 5672.4, 300 sec: 5666.9). Total num frames: 533485568. Throughput: 0: 5772.9. Samples: 533493680. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:09:04,039][25689] Avg episode reward: [(0, '-29.904')] [2022-07-10 02:09:04,061][26022] Updated weights on worker 0-0, policy_version 520983 (0.00088) [2022-07-10 02:09:05,854][26022] Updated weights on worker 0-0, policy_version 520993 (0.00097) [2022-07-10 02:09:07,527][26022] Updated weights on worker 0-0, policy_version 521003 (0.00085) [2022-07-10 02:09:09,059][25689] Fps is (10 sec: 5517.2, 60 sec: 5640.2, 300 sec: 5667.8). Total num frames: 533515264. Throughput: 0: 4941.5. Samples: 533510790. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:09:09,059][25689] Avg episode reward: [(0, '-30.252')] [2022-07-10 02:09:09,513][26022] Updated weights on worker 0-0, policy_version 521013 (0.00093) [2022-07-10 02:09:11,259][26022] Updated weights on worker 0-0, policy_version 521023 (0.00499) [2022-07-10 02:09:13,069][26022] Updated weights on worker 0-0, policy_version 521033 (0.00085) [2022-07-10 02:09:14,165][25689] Fps is (10 sec: 5763.7, 60 sec: 5633.9, 300 sec: 5667.1). Total num frames: 533543936. Throughput: 0: 5801.1. Samples: 533545048. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:09:14,166][25689] Avg episode reward: [(0, '-29.931')] [2022-07-10 02:09:15,059][26022] Updated weights on worker 0-0, policy_version 521043 (0.00087) [2022-07-10 02:09:16,507][26022] Updated weights on worker 0-0, policy_version 521053 (0.00094) [2022-07-10 02:09:18,607][26022] Updated weights on worker 0-0, policy_version 521063 (0.00087) [2022-07-10 02:09:19,243][25689] Fps is (10 sec: 5731.0, 60 sec: 5664.8, 300 sec: 5669.4). Total num frames: 533573632. Throughput: 0: 5809.5. Samples: 533579288. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 02:09:19,254][25689] Avg episode reward: [(0, '-29.297')] [2022-07-10 02:09:20,075][26022] Updated weights on worker 0-0, policy_version 521073 (0.00102) [2022-07-10 02:09:22,110][26022] Updated weights on worker 0-0, policy_version 521083 (0.00092) [2022-07-10 02:09:23,970][26022] Updated weights on worker 0-0, policy_version 521093 (0.00088) [2022-07-10 02:09:24,271][25689] Fps is (10 sec: 5573.0, 60 sec: 5629.2, 300 sec: 5658.8). Total num frames: 533600256. Throughput: 0: 5082.7. Samples: 533596516. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:09:24,272][25689] Avg episode reward: [(0, '-28.789')] [2022-07-10 02:09:25,543][26022] Updated weights on worker 0-0, policy_version 521103 (0.00083) [2022-07-10 02:09:27,425][26022] Updated weights on worker 0-0, policy_version 521113 (0.00088) [2022-07-10 02:09:29,111][26022] Updated weights on worker 0-0, policy_version 521123 (0.00080) [2022-07-10 02:09:29,333][25689] Fps is (10 sec: 5581.7, 60 sec: 5640.5, 300 sec: 5667.0). Total num frames: 533629952. Throughput: 0: 5919.2. Samples: 533630800. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:09:29,334][25689] Avg episode reward: [(0, '-28.776')] [2022-07-10 02:09:31,054][26022] Updated weights on worker 0-0, policy_version 521133 (0.00410) [2022-07-10 02:09:32,967][26022] Updated weights on worker 0-0, policy_version 521143 (0.00089) [2022-07-10 02:09:34,434][25689] Fps is (10 sec: 5843.9, 60 sec: 5636.8, 300 sec: 5666.0). Total num frames: 533659648. Throughput: 0: 5932.9. Samples: 533665302. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:09:34,434][25689] Avg episode reward: [(0, '-29.098')] [2022-07-10 02:09:34,457][26022] Updated weights on worker 0-0, policy_version 521153 (0.00087) [2022-07-10 02:09:36,496][26022] Updated weights on worker 0-0, policy_version 521163 (0.00090) [2022-07-10 02:09:37,866][26022] Updated weights on worker 0-0, policy_version 521173 (0.00098) [2022-07-10 02:09:39,447][25689] Fps is (10 sec: 5670.1, 60 sec: 5643.1, 300 sec: 5663.2). Total num frames: 533687296. Throughput: 0: 5113.2. Samples: 533682594. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:09:39,447][25689] Avg episode reward: [(0, '-30.459')] [2022-07-10 02:09:40,069][26022] Updated weights on worker 0-0, policy_version 521183 (0.00088) [2022-07-10 02:09:41,501][26022] Updated weights on worker 0-0, policy_version 521193 (0.00084) [2022-07-10 02:09:43,475][26022] Updated weights on worker 0-0, policy_version 521203 (0.00595) [2022-07-10 02:09:44,449][25689] Fps is (10 sec: 5827.9, 60 sec: 5677.4, 300 sec: 5670.1). Total num frames: 533718016. Throughput: 0: 5966.2. Samples: 533716904. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:09:44,450][25689] Avg episode reward: [(0, '-32.084')] [2022-07-10 02:09:45,222][26022] Updated weights on worker 0-0, policy_version 521213 (0.00093) [2022-07-10 02:09:47,084][26022] Updated weights on worker 0-0, policy_version 521223 (0.00088) [2022-07-10 02:09:48,815][26022] Updated weights on worker 0-0, policy_version 521233 (0.00071) [2022-07-10 02:09:49,457][25689] Fps is (10 sec: 5831.1, 60 sec: 5651.6, 300 sec: 5667.7). Total num frames: 533745664. Throughput: 0: 5988.6. Samples: 533751310. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:09:49,457][25689] Avg episode reward: [(0, '-32.059')] [2022-07-10 02:09:50,671][26022] Updated weights on worker 0-0, policy_version 521243 (0.00089) [2022-07-10 02:09:50,846][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:09:50,862][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000521244_533753856.pth [2022-07-10 02:09:50,862][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000519248_531709952.pth [2022-07-10 02:09:52,455][26022] Updated weights on worker 0-0, policy_version 521253 (0.00086) [2022-07-10 02:09:54,133][26022] Updated weights on worker 0-0, policy_version 521263 (0.00083) [2022-07-10 02:09:54,527][25689] Fps is (10 sec: 5588.7, 60 sec: 5668.2, 300 sec: 5667.0). Total num frames: 533774336. Throughput: 0: 5137.3. Samples: 533768526. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:09:54,527][25689] Avg episode reward: [(0, '-32.624')] [2022-07-10 02:09:56,068][26022] Updated weights on worker 0-0, policy_version 521273 (0.00093) [2022-07-10 02:09:57,802][26022] Updated weights on worker 0-0, policy_version 521283 (0.00087) [2022-07-10 02:09:59,536][25689] Fps is (10 sec: 5791.1, 60 sec: 5687.1, 300 sec: 5670.6). Total num frames: 533804032. Throughput: 0: 5977.2. Samples: 533802668. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:09:59,536][25689] Avg episode reward: [(0, '-32.888')] [2022-07-10 02:09:59,547][26022] Updated weights on worker 0-0, policy_version 521293 (0.00563) [2022-07-10 02:10:01,395][26022] Updated weights on worker 0-0, policy_version 521303 (0.00056) [2022-07-10 02:10:03,504][26022] Updated weights on worker 0-0, policy_version 521313 (0.00096) [2022-07-10 02:10:04,591][25689] Fps is (10 sec: 5494.6, 60 sec: 5682.1, 300 sec: 5666.4). Total num frames: 533829632. Throughput: 0: 5852.5. Samples: 533834780. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:10:04,591][25689] Avg episode reward: [(0, '-31.417')] [2022-07-10 02:10:05,520][26022] Updated weights on worker 0-0, policy_version 521323 (0.00090) [2022-07-10 02:10:07,109][26022] Updated weights on worker 0-0, policy_version 521333 (0.00086) [2022-07-10 02:10:08,891][26022] Updated weights on worker 0-0, policy_version 521343 (0.00083) [2022-07-10 02:10:09,607][25689] Fps is (10 sec: 5287.3, 60 sec: 5648.7, 300 sec: 5667.2). Total num frames: 533857280. Throughput: 0: 4985.7. Samples: 533851772. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:10:09,607][25689] Avg episode reward: [(0, '-31.256')] [2022-07-10 02:10:10,919][26022] Updated weights on worker 0-0, policy_version 521353 (0.00083) [2022-07-10 02:10:12,422][26022] Updated weights on worker 0-0, policy_version 521363 (0.00088) [2022-07-10 02:10:14,387][26022] Updated weights on worker 0-0, policy_version 521373 (0.00084) [2022-07-10 02:10:14,645][25689] Fps is (10 sec: 5805.1, 60 sec: 5688.9, 300 sec: 5666.7). Total num frames: 533888000. Throughput: 0: 5852.3. Samples: 533886264. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:10:14,646][25689] Avg episode reward: [(0, '-30.807')] [2022-07-10 02:10:16,163][26022] Updated weights on worker 0-0, policy_version 521383 (0.00090) [2022-07-10 02:10:17,785][26022] Updated weights on worker 0-0, policy_version 521393 (0.00087) [2022-07-10 02:10:19,683][25689] Fps is (10 sec: 5690.7, 60 sec: 5641.8, 300 sec: 5662.8). Total num frames: 533914624. Throughput: 0: 5865.6. Samples: 533920846. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:10:19,684][25689] Avg episode reward: [(0, '-30.846')] [2022-07-10 02:10:19,926][26022] Updated weights on worker 0-0, policy_version 521403 (0.00085) [2022-07-10 02:10:21,493][26022] Updated weights on worker 0-0, policy_version 521413 (0.00084) [2022-07-10 02:10:23,295][26022] Updated weights on worker 0-0, policy_version 521423 (0.00090) [2022-07-10 02:10:24,697][25689] Fps is (10 sec: 5602.8, 60 sec: 5693.9, 300 sec: 5662.6). Total num frames: 533944320. Throughput: 0: 5133.4. Samples: 533937996. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:10:24,698][25689] Avg episode reward: [(0, '-32.263')] [2022-07-10 02:10:25,171][26022] Updated weights on worker 0-0, policy_version 521433 (0.00095) [2022-07-10 02:10:26,991][26022] Updated weights on worker 0-0, policy_version 521443 (0.00097) [2022-07-10 02:10:28,826][26022] Updated weights on worker 0-0, policy_version 521453 (0.00083) [2022-07-10 02:10:29,703][25689] Fps is (10 sec: 5825.4, 60 sec: 5682.3, 300 sec: 5663.5). Total num frames: 533972992. Throughput: 0: 5980.6. Samples: 533971960. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:10:29,703][25689] Avg episode reward: [(0, '-32.540')] [2022-07-10 02:10:30,471][26022] Updated weights on worker 0-0, policy_version 521463 (0.00091) [2022-07-10 02:10:32,307][26022] Updated weights on worker 0-0, policy_version 521473 (0.00087) [2022-07-10 02:10:34,125][26022] Updated weights on worker 0-0, policy_version 521483 (0.00070) [2022-07-10 02:10:34,761][25689] Fps is (10 sec: 5697.9, 60 sec: 5669.3, 300 sec: 5666.1). Total num frames: 534001664. Throughput: 0: 5965.6. Samples: 534006268. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:10:34,762][25689] Avg episode reward: [(0, '-33.829')] [2022-07-10 02:10:36,226][26022] Updated weights on worker 0-0, policy_version 521493 (0.00095) [2022-07-10 02:10:37,524][26022] Updated weights on worker 0-0, policy_version 521503 (0.00086) [2022-07-10 02:10:39,694][26022] Updated weights on worker 0-0, policy_version 521513 (0.00086) [2022-07-10 02:10:39,794][25689] Fps is (10 sec: 5784.1, 60 sec: 5701.4, 300 sec: 5665.9). Total num frames: 534031360. Throughput: 0: 5099.1. Samples: 534023392. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:10:39,795][25689] Avg episode reward: [(0, '-32.873')] [2022-07-10 02:10:41,182][26022] Updated weights on worker 0-0, policy_version 521523 (0.00081) [2022-07-10 02:10:43,087][26022] Updated weights on worker 0-0, policy_version 521533 (0.00094) [2022-07-10 02:10:44,797][25689] Fps is (10 sec: 5713.9, 60 sec: 5650.4, 300 sec: 5666.0). Total num frames: 534059008. Throughput: 0: 5963.2. Samples: 534057856. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:10:44,798][25689] Avg episode reward: [(0, '-32.902')] [2022-07-10 02:10:44,948][26022] Updated weights on worker 0-0, policy_version 521543 (0.00084) [2022-07-10 02:10:46,605][26022] Updated weights on worker 0-0, policy_version 521553 (0.00092) [2022-07-10 02:10:48,432][26022] Updated weights on worker 0-0, policy_version 521563 (0.00090) [2022-07-10 02:10:49,803][25689] Fps is (10 sec: 5729.6, 60 sec: 5684.6, 300 sec: 5664.3). Total num frames: 534088704. Throughput: 0: 5979.8. Samples: 534092152. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:10:49,803][25689] Avg episode reward: [(0, '-32.643')] [2022-07-10 02:10:50,237][26022] Updated weights on worker 0-0, policy_version 521573 (0.00088) [2022-07-10 02:10:52,166][26022] Updated weights on worker 0-0, policy_version 521583 (0.00897) [2022-07-10 02:10:53,831][26022] Updated weights on worker 0-0, policy_version 521593 (0.00090) [2022-07-10 02:10:54,871][25689] Fps is (10 sec: 5692.3, 60 sec: 5667.7, 300 sec: 5663.3). Total num frames: 534116352. Throughput: 0: 5968.2. Samples: 534126288. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:10:54,872][25689] Avg episode reward: [(0, '-31.813')] [2022-07-10 02:10:55,677][26022] Updated weights on worker 0-0, policy_version 521603 (0.00085) [2022-07-10 02:10:57,433][26022] Updated weights on worker 0-0, policy_version 521613 (0.00087) [2022-07-10 02:10:59,257][26022] Updated weights on worker 0-0, policy_version 521623 (0.00090) [2022-07-10 02:10:59,915][25689] Fps is (10 sec: 5569.4, 60 sec: 5647.5, 300 sec: 5673.6). Total num frames: 534145024. Throughput: 0: 5962.3. Samples: 534143358. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:10:59,917][25689] Avg episode reward: [(0, '-32.073')] [2022-07-10 02:11:01,151][26022] Updated weights on worker 0-0, policy_version 521633 (0.00087) [2022-07-10 02:11:03,128][26022] Updated weights on worker 0-0, policy_version 521643 (0.00094) [2022-07-10 02:11:04,863][26022] Updated weights on worker 0-0, policy_version 521653 (0.00087) [2022-07-10 02:11:04,920][25689] Fps is (10 sec: 5604.8, 60 sec: 5686.1, 300 sec: 5668.1). Total num frames: 534172672. Throughput: 0: 5851.0. Samples: 534175592. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:11:04,920][25689] Avg episode reward: [(0, '-31.784')] [2022-07-10 02:11:06,976][26022] Updated weights on worker 0-0, policy_version 521663 (0.00092) [2022-07-10 02:11:08,457][26022] Updated weights on worker 0-0, policy_version 521673 (0.00080) [2022-07-10 02:11:09,924][25689] Fps is (10 sec: 5422.4, 60 sec: 5670.3, 300 sec: 5666.8). Total num frames: 534199296. Throughput: 0: 5843.8. Samples: 534209738. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:11:09,924][25689] Avg episode reward: [(0, '-33.187')] [2022-07-10 02:11:10,484][26022] Updated weights on worker 0-0, policy_version 521683 (0.00097) [2022-07-10 02:11:12,080][26022] Updated weights on worker 0-0, policy_version 521693 (0.00085) [2022-07-10 02:11:14,013][26022] Updated weights on worker 0-0, policy_version 521703 (0.00086) [2022-07-10 02:11:15,049][25689] Fps is (10 sec: 5661.3, 60 sec: 5662.2, 300 sec: 5668.0). Total num frames: 534230016. Throughput: 0: 4986.4. Samples: 534226902. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:11:15,052][25689] Avg episode reward: [(0, '-32.724')] [2022-07-10 02:11:15,829][26022] Updated weights on worker 0-0, policy_version 521713 (0.00090) [2022-07-10 02:11:17,624][26022] Updated weights on worker 0-0, policy_version 521723 (0.00093) [2022-07-10 02:11:19,409][26022] Updated weights on worker 0-0, policy_version 521733 (0.00092) [2022-07-10 02:11:20,118][25689] Fps is (10 sec: 5826.1, 60 sec: 5693.2, 300 sec: 5664.5). Total num frames: 534258688. Throughput: 0: 5825.9. Samples: 534261058. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:11:20,120][25689] Avg episode reward: [(0, '-31.721')] [2022-07-10 02:11:21,248][26022] Updated weights on worker 0-0, policy_version 521743 (0.00088) [2022-07-10 02:11:23,057][26022] Updated weights on worker 0-0, policy_version 521753 (0.00087) [2022-07-10 02:11:24,827][26022] Updated weights on worker 0-0, policy_version 521763 (0.00086) [2022-07-10 02:11:25,193][25689] Fps is (10 sec: 5652.7, 60 sec: 5670.5, 300 sec: 5667.6). Total num frames: 534287360. Throughput: 0: 5901.4. Samples: 534295234. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:11:25,194][25689] Avg episode reward: [(0, '-32.632')] [2022-07-10 02:11:26,610][26022] Updated weights on worker 0-0, policy_version 521773 (0.00096) [2022-07-10 02:11:28,485][26022] Updated weights on worker 0-0, policy_version 521783 (0.00090) [2022-07-10 02:11:30,255][25689] Fps is (10 sec: 5556.1, 60 sec: 5648.3, 300 sec: 5660.8). Total num frames: 534315008. Throughput: 0: 5010.1. Samples: 534311598. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:11:30,255][25689] Avg episode reward: [(0, '-32.999')] [2022-07-10 02:11:30,301][26022] Updated weights on worker 0-0, policy_version 521793 (0.00092) [2022-07-10 02:11:32,139][26022] Updated weights on worker 0-0, policy_version 521803 (0.00094) [2022-07-10 02:11:33,934][26022] Updated weights on worker 0-0, policy_version 521813 (0.00090) [2022-07-10 02:11:35,398][25689] Fps is (10 sec: 5619.4, 60 sec: 5657.3, 300 sec: 5668.6). Total num frames: 534344704. Throughput: 0: 5844.5. Samples: 534345832. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:11:35,398][25689] Avg episode reward: [(0, '-33.361')] [2022-07-10 02:11:35,687][26022] Updated weights on worker 0-0, policy_version 521823 (0.00094) [2022-07-10 02:11:37,596][26022] Updated weights on worker 0-0, policy_version 521833 (0.00092) [2022-07-10 02:11:39,174][26022] Updated weights on worker 0-0, policy_version 521843 (0.00082) [2022-07-10 02:11:40,468][25689] Fps is (10 sec: 5614.5, 60 sec: 5620.1, 300 sec: 5660.9). Total num frames: 534372352. Throughput: 0: 5854.2. Samples: 534380192. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:11:40,469][25689] Avg episode reward: [(0, '-32.646')] [2022-07-10 02:11:41,137][26022] Updated weights on worker 0-0, policy_version 521853 (0.00088) [2022-07-10 02:11:42,800][26022] Updated weights on worker 0-0, policy_version 521863 (0.00090) [2022-07-10 02:11:44,623][26022] Updated weights on worker 0-0, policy_version 521873 (0.00089) [2022-07-10 02:11:45,485][25689] Fps is (10 sec: 5685.0, 60 sec: 5652.6, 300 sec: 5661.8). Total num frames: 534402048. Throughput: 0: 5029.7. Samples: 534397294. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:11:45,485][25689] Avg episode reward: [(0, '-32.009')] [2022-07-10 02:11:46,460][26022] Updated weights on worker 0-0, policy_version 521883 (0.00088) [2022-07-10 02:11:48,194][26022] Updated weights on worker 0-0, policy_version 521893 (0.00089) [2022-07-10 02:11:49,998][26022] Updated weights on worker 0-0, policy_version 521903 (0.00084) [2022-07-10 02:11:50,497][25689] Fps is (10 sec: 5820.0, 60 sec: 5635.0, 300 sec: 5663.1). Total num frames: 534430720. Throughput: 0: 5918.6. Samples: 534431406. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:11:50,498][25689] Avg episode reward: [(0, '-32.725')] [2022-07-10 02:11:50,943][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:11:50,954][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000521907_534432768.pth [2022-07-10 02:11:50,954][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000519915_532392960.pth [2022-07-10 02:11:51,835][26022] Updated weights on worker 0-0, policy_version 521913 (0.00087) [2022-07-10 02:11:53,793][26022] Updated weights on worker 0-0, policy_version 521923 (0.00085) [2022-07-10 02:11:55,502][26022] Updated weights on worker 0-0, policy_version 521933 (0.00093) [2022-07-10 02:11:55,564][25689] Fps is (10 sec: 5689.6, 60 sec: 5652.1, 300 sec: 5666.1). Total num frames: 534459392. Throughput: 0: 5948.0. Samples: 534465778. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:11:55,564][25689] Avg episode reward: [(0, '-31.742')] [2022-07-10 02:11:57,266][26022] Updated weights on worker 0-0, policy_version 521943 (0.00094) [2022-07-10 02:11:58,881][26022] Updated weights on worker 0-0, policy_version 521953 (0.00096) [2022-07-10 02:12:00,604][25689] Fps is (10 sec: 5673.7, 60 sec: 5652.4, 300 sec: 5675.9). Total num frames: 534488064. Throughput: 0: 5111.4. Samples: 534483114. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:12:00,605][25689] Avg episode reward: [(0, '-30.969')] [2022-07-10 02:12:00,808][26022] Updated weights on worker 0-0, policy_version 521963 (0.00091) [2022-07-10 02:12:03,038][26022] Updated weights on worker 0-0, policy_version 521973 (0.00089) [2022-07-10 02:12:04,824][26022] Updated weights on worker 0-0, policy_version 521983 (0.00087) [2022-07-10 02:12:05,608][25689] Fps is (10 sec: 5606.8, 60 sec: 5652.4, 300 sec: 5666.2). Total num frames: 534515712. Throughput: 0: 5864.8. Samples: 534515316. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:12:05,609][25689] Avg episode reward: [(0, '-32.895')] [2022-07-10 02:12:06,822][26022] Updated weights on worker 0-0, policy_version 521993 (0.00085) [2022-07-10 02:12:08,301][26022] Updated weights on worker 0-0, policy_version 522003 (0.00098) [2022-07-10 02:12:10,407][26022] Updated weights on worker 0-0, policy_version 522013 (0.00095) [2022-07-10 02:12:10,639][25689] Fps is (10 sec: 5306.3, 60 sec: 5633.1, 300 sec: 5660.0). Total num frames: 534541312. Throughput: 0: 5863.7. Samples: 534549512. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:12:10,640][25689] Avg episode reward: [(0, '-33.614')] [2022-07-10 02:12:11,876][26022] Updated weights on worker 0-0, policy_version 522023 (0.00094) [2022-07-10 02:12:13,885][26022] Updated weights on worker 0-0, policy_version 522033 (0.00087) [2022-07-10 02:12:15,548][26022] Updated weights on worker 0-0, policy_version 522043 (0.00093) [2022-07-10 02:12:15,727][25689] Fps is (10 sec: 5566.1, 60 sec: 5636.6, 300 sec: 5665.2). Total num frames: 534572032. Throughput: 0: 5002.5. Samples: 534566646. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 02:12:15,727][25689] Avg episode reward: [(0, '-33.576')] [2022-07-10 02:12:17,357][26022] Updated weights on worker 0-0, policy_version 522053 (0.00093) [2022-07-10 02:12:19,258][26022] Updated weights on worker 0-0, policy_version 522063 (0.00090) [2022-07-10 02:12:20,755][25689] Fps is (10 sec: 5972.3, 60 sec: 5657.3, 300 sec: 5665.3). Total num frames: 534601728. Throughput: 0: 5856.3. Samples: 534601124. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:12:20,756][25689] Avg episode reward: [(0, '-33.480')] [2022-07-10 02:12:20,991][26022] Updated weights on worker 0-0, policy_version 522073 (0.00085) [2022-07-10 02:12:22,760][26022] Updated weights on worker 0-0, policy_version 522083 (0.00092) [2022-07-10 02:12:24,706][26022] Updated weights on worker 0-0, policy_version 522093 (0.00088) [2022-07-10 02:12:25,787][25689] Fps is (10 sec: 5700.3, 60 sec: 5644.4, 300 sec: 5665.4). Total num frames: 534629376. Throughput: 0: 5956.5. Samples: 534635508. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:12:25,787][25689] Avg episode reward: [(0, '-32.600')] [2022-07-10 02:12:26,258][26022] Updated weights on worker 0-0, policy_version 522103 (0.00088) [2022-07-10 02:12:28,175][26022] Updated weights on worker 0-0, policy_version 522113 (0.00096) [2022-07-10 02:12:29,851][26022] Updated weights on worker 0-0, policy_version 522123 (0.00087) [2022-07-10 02:12:30,791][25689] Fps is (10 sec: 5612.1, 60 sec: 5666.7, 300 sec: 5662.9). Total num frames: 534658048. Throughput: 0: 5113.0. Samples: 534652546. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:12:30,791][25689] Avg episode reward: [(0, '-32.638')] [2022-07-10 02:12:31,782][26022] Updated weights on worker 0-0, policy_version 522133 (0.00087) [2022-07-10 02:12:33,457][26022] Updated weights on worker 0-0, policy_version 522143 (0.00092) [2022-07-10 02:12:35,462][26022] Updated weights on worker 0-0, policy_version 522153 (0.00094) [2022-07-10 02:12:35,854][25689] Fps is (10 sec: 5695.8, 60 sec: 5657.2, 300 sec: 5662.8). Total num frames: 534686720. Throughput: 0: 5961.0. Samples: 534686626. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:12:35,856][25689] Avg episode reward: [(0, '-32.608')] [2022-07-10 02:12:37,015][26022] Updated weights on worker 0-0, policy_version 522163 (0.00086) [2022-07-10 02:12:38,938][26022] Updated weights on worker 0-0, policy_version 522173 (0.00086) [2022-07-10 02:12:40,698][26022] Updated weights on worker 0-0, policy_version 522183 (0.00090) [2022-07-10 02:12:40,881][25689] Fps is (10 sec: 5683.2, 60 sec: 5678.3, 300 sec: 5663.0). Total num frames: 534715392. Throughput: 0: 5957.3. Samples: 534721018. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:12:40,881][25689] Avg episode reward: [(0, '-31.844')] [2022-07-10 02:12:42,509][26022] Updated weights on worker 0-0, policy_version 522193 (0.00091) [2022-07-10 02:12:44,394][26022] Updated weights on worker 0-0, policy_version 522203 (0.00086) [2022-07-10 02:12:45,896][25689] Fps is (10 sec: 5812.7, 60 sec: 5678.4, 300 sec: 5669.6). Total num frames: 534745088. Throughput: 0: 5108.4. Samples: 534738234. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:12:45,896][25689] Avg episode reward: [(0, '-31.706')] [2022-07-10 02:12:45,968][26022] Updated weights on worker 0-0, policy_version 522213 (0.00093) [2022-07-10 02:12:47,965][26022] Updated weights on worker 0-0, policy_version 522223 (0.00086) [2022-07-10 02:12:49,760][26022] Updated weights on worker 0-0, policy_version 522233 (0.00086) [2022-07-10 02:12:50,908][25689] Fps is (10 sec: 5718.8, 60 sec: 5661.5, 300 sec: 5656.8). Total num frames: 534772736. Throughput: 0: 5959.8. Samples: 534772442. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:12:50,909][25689] Avg episode reward: [(0, '-32.115')] [2022-07-10 02:12:51,343][26022] Updated weights on worker 0-0, policy_version 522243 (0.00084) [2022-07-10 02:12:53,187][26022] Updated weights on worker 0-0, policy_version 522253 (0.00091) [2022-07-10 02:12:54,871][26022] Updated weights on worker 0-0, policy_version 522263 (0.00088) [2022-07-10 02:12:55,951][25689] Fps is (10 sec: 5703.1, 60 sec: 5680.7, 300 sec: 5663.4). Total num frames: 534802432. Throughput: 0: 5997.6. Samples: 534807156. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:12:55,951][25689] Avg episode reward: [(0, '-33.704')] [2022-07-10 02:12:56,714][26022] Updated weights on worker 0-0, policy_version 522273 (0.00089) [2022-07-10 02:12:58,451][26022] Updated weights on worker 0-0, policy_version 522283 (0.00146) [2022-07-10 02:13:00,409][26022] Updated weights on worker 0-0, policy_version 522293 (0.00085) [2022-07-10 02:13:00,986][25689] Fps is (10 sec: 5791.7, 60 sec: 5681.2, 300 sec: 5677.4). Total num frames: 534831104. Throughput: 0: 5145.8. Samples: 534824474. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:13:00,986][25689] Avg episode reward: [(0, '-32.950')] [2022-07-10 02:13:02,124][26022] Updated weights on worker 0-0, policy_version 522303 (0.00089) [2022-07-10 02:13:04,367][26022] Updated weights on worker 0-0, policy_version 522313 (0.00085) [2022-07-10 02:13:06,047][25689] Fps is (10 sec: 5477.0, 60 sec: 5658.9, 300 sec: 5659.7). Total num frames: 534857728. Throughput: 0: 5868.3. Samples: 534856486. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:13:06,048][25689] Avg episode reward: [(0, '-31.643')] [2022-07-10 02:13:06,209][26022] Updated weights on worker 0-0, policy_version 522323 (0.00087) [2022-07-10 02:13:07,818][26022] Updated weights on worker 0-0, policy_version 522333 (0.00082) [2022-07-10 02:13:09,902][26022] Updated weights on worker 0-0, policy_version 522343 (0.00089) [2022-07-10 02:13:11,054][25689] Fps is (10 sec: 5492.5, 60 sec: 5712.0, 300 sec: 5660.3). Total num frames: 534886400. Throughput: 0: 5867.8. Samples: 534890652. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:13:11,054][25689] Avg episode reward: [(0, '-30.788')] [2022-07-10 02:13:11,447][26022] Updated weights on worker 0-0, policy_version 522353 (0.00086) [2022-07-10 02:13:13,345][26022] Updated weights on worker 0-0, policy_version 522363 (0.00082) [2022-07-10 02:13:15,007][26022] Updated weights on worker 0-0, policy_version 522373 (0.00876) [2022-07-10 02:13:16,095][25689] Fps is (10 sec: 5707.1, 60 sec: 5682.5, 300 sec: 5663.9). Total num frames: 534915072. Throughput: 0: 4994.1. Samples: 534907756. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:13:16,095][25689] Avg episode reward: [(0, '-31.686')] [2022-07-10 02:13:16,807][26022] Updated weights on worker 0-0, policy_version 522383 (0.00085) [2022-07-10 02:13:18,667][26022] Updated weights on worker 0-0, policy_version 522393 (0.00083) [2022-07-10 02:13:20,595][26022] Updated weights on worker 0-0, policy_version 522403 (0.00089) [2022-07-10 02:13:21,131][25689] Fps is (10 sec: 5690.5, 60 sec: 5664.8, 300 sec: 5663.3). Total num frames: 534943744. Throughput: 0: 5849.7. Samples: 534942316. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:13:21,131][25689] Avg episode reward: [(0, '-29.875')] [2022-07-10 02:13:22,151][26022] Updated weights on worker 0-0, policy_version 522413 (0.00079) [2022-07-10 02:13:24,031][26022] Updated weights on worker 0-0, policy_version 522423 (0.00094) [2022-07-10 02:13:25,691][26022] Updated weights on worker 0-0, policy_version 522433 (0.00085) [2022-07-10 02:13:26,153][25689] Fps is (10 sec: 5802.9, 60 sec: 5699.6, 300 sec: 5666.4). Total num frames: 534973440. Throughput: 0: 5975.8. Samples: 534976636. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:13:26,154][25689] Avg episode reward: [(0, '-29.505')] [2022-07-10 02:13:27,711][26022] Updated weights on worker 0-0, policy_version 522443 (0.00087) [2022-07-10 02:13:29,336][26022] Updated weights on worker 0-0, policy_version 522453 (0.00088) [2022-07-10 02:13:31,154][25689] Fps is (10 sec: 5720.9, 60 sec: 5682.9, 300 sec: 5660.7). Total num frames: 535001088. Throughput: 0: 5963.9. Samples: 535010530. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:13:31,155][25689] Avg episode reward: [(0, '-30.718')] [2022-07-10 02:13:31,284][26022] Updated weights on worker 0-0, policy_version 522463 (0.00094) [2022-07-10 02:13:33,150][26022] Updated weights on worker 0-0, policy_version 522473 (0.00619) [2022-07-10 02:13:34,907][26022] Updated weights on worker 0-0, policy_version 522483 (0.00086) [2022-07-10 02:13:36,230][25689] Fps is (10 sec: 5589.2, 60 sec: 5681.8, 300 sec: 5664.2). Total num frames: 535029760. Throughput: 0: 5940.3. Samples: 535027364. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:13:36,230][25689] Avg episode reward: [(0, '-31.739')] [2022-07-10 02:13:36,827][26022] Updated weights on worker 0-0, policy_version 522493 (0.00086) [2022-07-10 02:13:38,572][26022] Updated weights on worker 0-0, policy_version 522503 (0.00087) [2022-07-10 02:13:40,456][26022] Updated weights on worker 0-0, policy_version 522513 (0.00093) [2022-07-10 02:13:41,292][25689] Fps is (10 sec: 5555.6, 60 sec: 5661.5, 300 sec: 5659.7). Total num frames: 535057408. Throughput: 0: 5903.2. Samples: 535061332. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:13:41,292][25689] Avg episode reward: [(0, '-32.442')] [2022-07-10 02:13:42,125][26022] Updated weights on worker 0-0, policy_version 522523 (0.00088) [2022-07-10 02:13:44,097][26022] Updated weights on worker 0-0, policy_version 522533 (0.00087) [2022-07-10 02:13:45,800][26022] Updated weights on worker 0-0, policy_version 522543 (0.00090) [2022-07-10 02:13:46,345][25689] Fps is (10 sec: 5669.2, 60 sec: 5657.9, 300 sec: 5660.5). Total num frames: 535087104. Throughput: 0: 5884.6. Samples: 535095456. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:13:46,345][25689] Avg episode reward: [(0, '-31.923')] [2022-07-10 02:13:47,717][26022] Updated weights on worker 0-0, policy_version 522553 (0.00092) [2022-07-10 02:13:49,363][26022] Updated weights on worker 0-0, policy_version 522563 (0.00085) [2022-07-10 02:13:51,105][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:13:51,122][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000522572_535113728.pth [2022-07-10 02:13:51,123][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000520581_533074944.pth [2022-07-10 02:13:51,260][26022] Updated weights on worker 0-0, policy_version 522573 (0.00087) [2022-07-10 02:13:51,376][25689] Fps is (10 sec: 5686.3, 60 sec: 5656.1, 300 sec: 5661.2). Total num frames: 535114752. Throughput: 0: 5047.1. Samples: 535112594. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:13:51,377][25689] Avg episode reward: [(0, '-32.535')] [2022-07-10 02:13:52,951][26022] Updated weights on worker 0-0, policy_version 522583 (0.00086) [2022-07-10 02:13:54,899][26022] Updated weights on worker 0-0, policy_version 522593 (0.00085) [2022-07-10 02:13:56,443][25689] Fps is (10 sec: 5678.6, 60 sec: 5653.9, 300 sec: 5663.9). Total num frames: 535144448. Throughput: 0: 5893.4. Samples: 535146488. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:13:56,443][25689] Avg episode reward: [(0, '-31.366')] [2022-07-10 02:13:56,504][26022] Updated weights on worker 0-0, policy_version 522603 (0.00090) [2022-07-10 02:13:58,312][26022] Updated weights on worker 0-0, policy_version 522613 (0.00110) [2022-07-10 02:14:00,112][26022] Updated weights on worker 0-0, policy_version 522623 (0.00091) [2022-07-10 02:14:01,456][25689] Fps is (10 sec: 5790.8, 60 sec: 5656.0, 300 sec: 5674.0). Total num frames: 535173120. Throughput: 0: 5922.5. Samples: 535180754. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:14:01,456][25689] Avg episode reward: [(0, '-30.960')] [2022-07-10 02:14:02,485][26022] Updated weights on worker 0-0, policy_version 522633 (0.00067) [2022-07-10 02:14:04,188][26022] Updated weights on worker 0-0, policy_version 522643 (0.00093) [2022-07-10 02:14:05,979][26022] Updated weights on worker 0-0, policy_version 522653 (0.00094) [2022-07-10 02:14:06,459][25689] Fps is (10 sec: 5418.2, 60 sec: 5644.4, 300 sec: 5660.6). Total num frames: 535198720. Throughput: 0: 4979.7. Samples: 535195622. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:14:06,462][25689] Avg episode reward: [(0, '-30.901')] [2022-07-10 02:14:07,845][26022] Updated weights on worker 0-0, policy_version 522663 (0.00092) [2022-07-10 02:14:09,780][26022] Updated weights on worker 0-0, policy_version 522673 (0.00090) [2022-07-10 02:14:11,427][26022] Updated weights on worker 0-0, policy_version 522683 (0.00084) [2022-07-10 02:14:11,463][25689] Fps is (10 sec: 5423.5, 60 sec: 5644.7, 300 sec: 5662.5). Total num frames: 535227392. Throughput: 0: 5809.0. Samples: 535229276. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:14:11,463][25689] Avg episode reward: [(0, '-30.770')] [2022-07-10 02:14:13,332][26022] Updated weights on worker 0-0, policy_version 522693 (0.00084) [2022-07-10 02:14:15,161][26022] Updated weights on worker 0-0, policy_version 522703 (0.00087) [2022-07-10 02:14:16,585][25689] Fps is (10 sec: 5561.9, 60 sec: 5620.2, 300 sec: 5654.8). Total num frames: 535255040. Throughput: 0: 5794.2. Samples: 535263198. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:14:16,586][25689] Avg episode reward: [(0, '-32.450')] [2022-07-10 02:14:16,990][26022] Updated weights on worker 0-0, policy_version 522713 (0.00094) [2022-07-10 02:14:18,615][26022] Updated weights on worker 0-0, policy_version 522723 (0.00084) [2022-07-10 02:14:20,279][26022] Updated weights on worker 0-0, policy_version 522733 (0.00086) [2022-07-10 02:14:21,685][25689] Fps is (10 sec: 5609.7, 60 sec: 5631.2, 300 sec: 5663.8). Total num frames: 535284736. Throughput: 0: 4929.3. Samples: 535280470. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:14:21,686][25689] Avg episode reward: [(0, '-33.046')] [2022-07-10 02:14:22,225][26022] Updated weights on worker 0-0, policy_version 522743 (0.00090) [2022-07-10 02:14:24,247][26022] Updated weights on worker 0-0, policy_version 522753 (0.00096) [2022-07-10 02:14:25,763][26022] Updated weights on worker 0-0, policy_version 522763 (0.00103) [2022-07-10 02:14:26,694][25689] Fps is (10 sec: 5774.4, 60 sec: 5615.6, 300 sec: 5661.3). Total num frames: 535313408. Throughput: 0: 5882.4. Samples: 535314648. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:14:26,694][25689] Avg episode reward: [(0, '-32.891')] [2022-07-10 02:14:27,876][26022] Updated weights on worker 0-0, policy_version 522773 (0.00087) [2022-07-10 02:14:29,395][26022] Updated weights on worker 0-0, policy_version 522783 (0.00088) [2022-07-10 02:14:31,426][26022] Updated weights on worker 0-0, policy_version 522793 (0.00087) [2022-07-10 02:14:31,728][25689] Fps is (10 sec: 5607.9, 60 sec: 5612.5, 300 sec: 5655.7). Total num frames: 535341056. Throughput: 0: 5879.2. Samples: 535348420. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:14:31,728][25689] Avg episode reward: [(0, '-32.576')] [2022-07-10 02:14:33,033][26022] Updated weights on worker 0-0, policy_version 522803 (0.00083) [2022-07-10 02:14:34,827][26022] Updated weights on worker 0-0, policy_version 522813 (0.00089) [2022-07-10 02:14:36,800][25689] Fps is (10 sec: 5674.1, 60 sec: 5629.7, 300 sec: 5661.5). Total num frames: 535370752. Throughput: 0: 5060.3. Samples: 535365488. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:14:36,804][26022] Updated weights on worker 0-0, policy_version 522823 (0.00307) [2022-07-10 02:14:36,805][25689] Avg episode reward: [(0, '-33.175')] [2022-07-10 02:14:38,587][26022] Updated weights on worker 0-0, policy_version 522833 (0.00085) [2022-07-10 02:14:40,253][26022] Updated weights on worker 0-0, policy_version 522843 (0.00081) [2022-07-10 02:14:41,822][25689] Fps is (10 sec: 5782.6, 60 sec: 5650.4, 300 sec: 5654.2). Total num frames: 535399424. Throughput: 0: 5933.0. Samples: 535399942. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:14:41,822][25689] Avg episode reward: [(0, '-32.764')] [2022-07-10 02:14:42,124][26022] Updated weights on worker 0-0, policy_version 522853 (0.00087) [2022-07-10 02:14:43,732][26022] Updated weights on worker 0-0, policy_version 522863 (0.00097) [2022-07-10 02:14:45,748][26022] Updated weights on worker 0-0, policy_version 522873 (0.00082) [2022-07-10 02:14:46,875][25689] Fps is (10 sec: 5691.7, 60 sec: 5633.4, 300 sec: 5656.8). Total num frames: 535428096. Throughput: 0: 5918.9. Samples: 535434100. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:14:46,875][25689] Avg episode reward: [(0, '-31.837')] [2022-07-10 02:14:47,376][26022] Updated weights on worker 0-0, policy_version 522883 (0.00089) [2022-07-10 02:14:49,290][26022] Updated weights on worker 0-0, policy_version 522893 (0.00093) [2022-07-10 02:14:51,083][26022] Updated weights on worker 0-0, policy_version 522903 (0.00090) [2022-07-10 02:14:51,940][25689] Fps is (10 sec: 5566.1, 60 sec: 5630.3, 300 sec: 5653.4). Total num frames: 535455744. Throughput: 0: 5081.0. Samples: 535451118. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:14:51,941][25689] Avg episode reward: [(0, '-31.412')] [2022-07-10 02:14:52,851][26022] Updated weights on worker 0-0, policy_version 522913 (0.00083) [2022-07-10 02:14:54,723][26022] Updated weights on worker 0-0, policy_version 522923 (0.00092) [2022-07-10 02:14:56,553][26022] Updated weights on worker 0-0, policy_version 522933 (0.00092) [2022-07-10 02:14:57,047][25689] Fps is (10 sec: 5637.2, 60 sec: 5626.5, 300 sec: 5651.6). Total num frames: 535485440. Throughput: 0: 5927.7. Samples: 535485508. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:14:57,048][25689] Avg episode reward: [(0, '-31.845')] [2022-07-10 02:14:58,306][26022] Updated weights on worker 0-0, policy_version 522943 (0.00082) [2022-07-10 02:15:00,167][26022] Updated weights on worker 0-0, policy_version 522953 (0.00089) [2022-07-10 02:15:02,066][25689] Fps is (10 sec: 5561.8, 60 sec: 5592.2, 300 sec: 5655.7). Total num frames: 535512064. Throughput: 0: 5918.1. Samples: 535519752. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:15:02,074][25689] Avg episode reward: [(0, '-31.906')] [2022-07-10 02:15:02,297][26022] Updated weights on worker 0-0, policy_version 522963 (0.00093) [2022-07-10 02:15:03,951][26022] Updated weights on worker 0-0, policy_version 522973 (0.00083) [2022-07-10 02:15:05,895][26022] Updated weights on worker 0-0, policy_version 522983 (0.00327) [2022-07-10 02:15:07,117][25689] Fps is (10 sec: 5592.8, 60 sec: 5655.3, 300 sec: 5661.9). Total num frames: 535541760. Throughput: 0: 4970.6. Samples: 535534718. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:15:07,118][25689] Avg episode reward: [(0, '-31.304')] [2022-07-10 02:15:07,847][26022] Updated weights on worker 0-0, policy_version 522993 (0.00082) [2022-07-10 02:15:09,478][26022] Updated weights on worker 0-0, policy_version 523003 (0.00050) [2022-07-10 02:15:11,035][26022] Updated weights on worker 0-0, policy_version 523013 (0.00083) [2022-07-10 02:15:12,159][25689] Fps is (10 sec: 5884.4, 60 sec: 5668.6, 300 sec: 5658.4). Total num frames: 535571456. Throughput: 0: 5856.3. Samples: 535569528. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 02:15:12,160][25689] Avg episode reward: [(0, '-30.963')] [2022-07-10 02:15:13,102][26022] Updated weights on worker 0-0, policy_version 523023 (0.00083) [2022-07-10 02:15:14,615][26022] Updated weights on worker 0-0, policy_version 523033 (0.00086) [2022-07-10 02:15:16,668][26022] Updated weights on worker 0-0, policy_version 523043 (0.00084) [2022-07-10 02:15:17,286][25689] Fps is (10 sec: 5740.0, 60 sec: 5685.1, 300 sec: 5663.6). Total num frames: 535600128. Throughput: 0: 5866.3. Samples: 535604234. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:15:17,287][25689] Avg episode reward: [(0, '-31.746')] [2022-07-10 02:15:18,242][26022] Updated weights on worker 0-0, policy_version 523053 (0.00369) [2022-07-10 02:15:20,219][26022] Updated weights on worker 0-0, policy_version 523063 (0.00096) [2022-07-10 02:15:22,019][26022] Updated weights on worker 0-0, policy_version 523073 (0.00081) [2022-07-10 02:15:22,321][25689] Fps is (10 sec: 5542.7, 60 sec: 5657.4, 300 sec: 5656.3). Total num frames: 535627776. Throughput: 0: 5010.0. Samples: 535621226. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:15:22,321][25689] Avg episode reward: [(0, '-31.556')] [2022-07-10 02:15:23,715][26022] Updated weights on worker 0-0, policy_version 523083 (0.00094) [2022-07-10 02:15:25,550][26022] Updated weights on worker 0-0, policy_version 523093 (0.00090) [2022-07-10 02:15:27,277][26022] Updated weights on worker 0-0, policy_version 523103 (0.00085) [2022-07-10 02:15:27,322][25689] Fps is (10 sec: 5713.7, 60 sec: 5674.9, 300 sec: 5659.8). Total num frames: 535657472. Throughput: 0: 5993.5. Samples: 535655816. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:15:27,323][25689] Avg episode reward: [(0, '-30.912')] [2022-07-10 02:15:29,090][26022] Updated weights on worker 0-0, policy_version 523113 (0.00098) [2022-07-10 02:15:30,915][26022] Updated weights on worker 0-0, policy_version 523123 (0.00083) [2022-07-10 02:15:32,343][25689] Fps is (10 sec: 5925.9, 60 sec: 5710.0, 300 sec: 5664.0). Total num frames: 535687168. Throughput: 0: 5975.4. Samples: 535690132. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:15:32,346][25689] Avg episode reward: [(0, '-30.031')] [2022-07-10 02:15:32,477][26022] Updated weights on worker 0-0, policy_version 523133 (0.00093) [2022-07-10 02:15:34,571][26022] Updated weights on worker 0-0, policy_version 523143 (0.00093) [2022-07-10 02:15:36,474][26022] Updated weights on worker 0-0, policy_version 523153 (0.00087) [2022-07-10 02:15:37,488][25689] Fps is (10 sec: 5539.9, 60 sec: 5652.5, 300 sec: 5651.5). Total num frames: 535713792. Throughput: 0: 5930.1. Samples: 535724036. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:15:37,489][25689] Avg episode reward: [(0, '-30.853')] [2022-07-10 02:15:38,024][26022] Updated weights on worker 0-0, policy_version 523163 (0.00086) [2022-07-10 02:15:39,861][26022] Updated weights on worker 0-0, policy_version 523173 (0.00085) [2022-07-10 02:15:41,705][26022] Updated weights on worker 0-0, policy_version 523183 (0.00082) [2022-07-10 02:15:42,531][25689] Fps is (10 sec: 5628.4, 60 sec: 5684.2, 300 sec: 5661.1). Total num frames: 535744512. Throughput: 0: 5940.8. Samples: 535741294. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:15:42,532][25689] Avg episode reward: [(0, '-30.551')] [2022-07-10 02:15:43,507][26022] Updated weights on worker 0-0, policy_version 523193 (0.00083) [2022-07-10 02:15:45,519][26022] Updated weights on worker 0-0, policy_version 523203 (0.00096) [2022-07-10 02:15:46,800][26022] Updated weights on worker 0-0, policy_version 523213 (0.00053) [2022-07-10 02:15:47,567][25689] Fps is (10 sec: 5893.1, 60 sec: 5685.9, 300 sec: 5657.1). Total num frames: 535773184. Throughput: 0: 5912.9. Samples: 535775520. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:15:47,567][25689] Avg episode reward: [(0, '-30.797')] [2022-07-10 02:15:49,051][26022] Updated weights on worker 0-0, policy_version 523223 (0.00102) [2022-07-10 02:15:50,495][26022] Updated weights on worker 0-0, policy_version 523233 (0.00090) [2022-07-10 02:15:51,220][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:15:51,229][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000523235_535792640.pth [2022-07-10 02:15:51,230][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000521244_533753856.pth [2022-07-10 02:15:52,575][25689] Fps is (10 sec: 5505.4, 60 sec: 5674.3, 300 sec: 5654.8). Total num frames: 535799808. Throughput: 0: 5910.4. Samples: 535809712. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:15:52,578][25689] Avg episode reward: [(0, '-30.631')] [2022-07-10 02:15:52,600][26022] Updated weights on worker 0-0, policy_version 523243 (0.00092) [2022-07-10 02:15:54,422][26022] Updated weights on worker 0-0, policy_version 523253 (0.00084) [2022-07-10 02:15:56,025][26022] Updated weights on worker 0-0, policy_version 523263 (0.00097) [2022-07-10 02:15:57,683][25689] Fps is (10 sec: 5567.3, 60 sec: 5674.3, 300 sec: 5657.0). Total num frames: 535829504. Throughput: 0: 5088.0. Samples: 535826784. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:15:57,683][25689] Avg episode reward: [(0, '-30.014')] [2022-07-10 02:15:57,873][26022] Updated weights on worker 0-0, policy_version 523273 (0.00085) [2022-07-10 02:15:59,556][26022] Updated weights on worker 0-0, policy_version 523283 (0.00088) [2022-07-10 02:16:01,671][26022] Updated weights on worker 0-0, policy_version 523293 (0.00511) [2022-07-10 02:16:02,704][25689] Fps is (10 sec: 5661.3, 60 sec: 5691.0, 300 sec: 5656.7). Total num frames: 535857152. Throughput: 0: 5919.7. Samples: 535860712. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:16:02,705][25689] Avg episode reward: [(0, '-30.506')] [2022-07-10 02:16:03,807][26022] Updated weights on worker 0-0, policy_version 523303 (0.00080) [2022-07-10 02:16:05,228][26022] Updated weights on worker 0-0, policy_version 523313 (0.00099) [2022-07-10 02:16:07,470][26022] Updated weights on worker 0-0, policy_version 523323 (0.00096) [2022-07-10 02:16:07,707][25689] Fps is (10 sec: 5618.6, 60 sec: 5678.6, 300 sec: 5663.6). Total num frames: 535885824. Throughput: 0: 5824.5. Samples: 535892826. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:16:07,707][25689] Avg episode reward: [(0, '-30.649')] [2022-07-10 02:16:09,243][26022] Updated weights on worker 0-0, policy_version 523333 (0.00083) [2022-07-10 02:16:10,897][26022] Updated weights on worker 0-0, policy_version 523343 (0.00084) [2022-07-10 02:16:12,738][25689] Fps is (10 sec: 5511.3, 60 sec: 5629.0, 300 sec: 5651.6). Total num frames: 535912448. Throughput: 0: 4968.8. Samples: 535909894. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:16:12,738][25689] Avg episode reward: [(0, '-31.093')] [2022-07-10 02:16:12,776][26022] Updated weights on worker 0-0, policy_version 523353 (0.00096) [2022-07-10 02:16:14,540][26022] Updated weights on worker 0-0, policy_version 523363 (0.00096) [2022-07-10 02:16:16,107][26022] Updated weights on worker 0-0, policy_version 523373 (0.00097) [2022-07-10 02:16:17,888][25689] Fps is (10 sec: 5632.4, 60 sec: 5660.6, 300 sec: 5657.0). Total num frames: 535943168. Throughput: 0: 5823.1. Samples: 535944440. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:16:17,888][25689] Avg episode reward: [(0, '-30.293')] [2022-07-10 02:16:18,046][26022] Updated weights on worker 0-0, policy_version 523383 (0.00092) [2022-07-10 02:16:19,825][26022] Updated weights on worker 0-0, policy_version 523393 (0.00088) [2022-07-10 02:16:21,635][26022] Updated weights on worker 0-0, policy_version 523403 (0.00085) [2022-07-10 02:16:22,920][25689] Fps is (10 sec: 5833.0, 60 sec: 5677.7, 300 sec: 5657.8). Total num frames: 535971840. Throughput: 0: 5849.1. Samples: 535978954. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:16:22,921][25689] Avg episode reward: [(0, '-30.873')] [2022-07-10 02:16:23,354][26022] Updated weights on worker 0-0, policy_version 523413 (0.00079) [2022-07-10 02:16:25,188][26022] Updated weights on worker 0-0, policy_version 523423 (0.00085) [2022-07-10 02:16:26,887][26022] Updated weights on worker 0-0, policy_version 523433 (0.00553) [2022-07-10 02:16:27,930][25689] Fps is (10 sec: 5710.1, 60 sec: 5660.0, 300 sec: 5662.2). Total num frames: 536000512. Throughput: 0: 5124.4. Samples: 535996460. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:16:27,931][25689] Avg episode reward: [(0, '-30.781')] [2022-07-10 02:16:28,675][26022] Updated weights on worker 0-0, policy_version 523443 (0.00087) [2022-07-10 02:16:30,311][26022] Updated weights on worker 0-0, policy_version 523453 (0.00084) [2022-07-10 02:16:32,348][26022] Updated weights on worker 0-0, policy_version 523463 (0.00088) [2022-07-10 02:16:32,938][25689] Fps is (10 sec: 5826.3, 60 sec: 5661.2, 300 sec: 5664.8). Total num frames: 536030208. Throughput: 0: 6002.1. Samples: 536031138. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:16:32,939][25689] Avg episode reward: [(0, '-30.175')] [2022-07-10 02:16:34,011][26022] Updated weights on worker 0-0, policy_version 523473 (0.00084) [2022-07-10 02:16:36,083][26022] Updated weights on worker 0-0, policy_version 523483 (0.00085) [2022-07-10 02:16:37,485][26022] Updated weights on worker 0-0, policy_version 523493 (0.00085) [2022-07-10 02:16:38,054][25689] Fps is (10 sec: 5765.3, 60 sec: 5697.7, 300 sec: 5667.4). Total num frames: 536058880. Throughput: 0: 5996.3. Samples: 536065364. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:16:38,055][25689] Avg episode reward: [(0, '-30.207')] [2022-07-10 02:16:39,571][26022] Updated weights on worker 0-0, policy_version 523503 (0.00084) [2022-07-10 02:16:40,939][26022] Updated weights on worker 0-0, policy_version 523513 (0.00090) [2022-07-10 02:16:43,089][25689] Fps is (10 sec: 5548.0, 60 sec: 5647.8, 300 sec: 5660.1). Total num frames: 536086528. Throughput: 0: 5134.3. Samples: 536082510. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:16:43,090][25689] Avg episode reward: [(0, '-30.407')] [2022-07-10 02:16:43,161][26022] Updated weights on worker 0-0, policy_version 523523 (0.00086) [2022-07-10 02:16:44,621][26022] Updated weights on worker 0-0, policy_version 523533 (0.00094) [2022-07-10 02:16:46,549][26022] Updated weights on worker 0-0, policy_version 523543 (0.00087) [2022-07-10 02:16:48,161][25689] Fps is (10 sec: 5775.1, 60 sec: 5678.1, 300 sec: 5665.9). Total num frames: 536117248. Throughput: 0: 5962.4. Samples: 536117084. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:16:48,163][25689] Avg episode reward: [(0, '-29.778')] [2022-07-10 02:16:48,263][26022] Updated weights on worker 0-0, policy_version 523553 (0.00086) [2022-07-10 02:16:50,074][26022] Updated weights on worker 0-0, policy_version 523563 (0.00428) [2022-07-10 02:16:51,887][26022] Updated weights on worker 0-0, policy_version 523573 (0.00085) [2022-07-10 02:16:53,241][25689] Fps is (10 sec: 5850.3, 60 sec: 5705.2, 300 sec: 5665.6). Total num frames: 536145920. Throughput: 0: 5926.1. Samples: 536151458. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:16:53,242][25689] Avg episode reward: [(0, '-30.422')] [2022-07-10 02:16:53,755][26022] Updated weights on worker 0-0, policy_version 523583 (0.00095) [2022-07-10 02:16:55,481][26022] Updated weights on worker 0-0, policy_version 523593 (0.00084) [2022-07-10 02:16:57,283][26022] Updated weights on worker 0-0, policy_version 523603 (0.00087) [2022-07-10 02:16:58,316][25689] Fps is (10 sec: 5647.0, 60 sec: 5691.4, 300 sec: 5665.0). Total num frames: 536174592. Throughput: 0: 5084.3. Samples: 536168380. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:16:58,317][25689] Avg episode reward: [(0, '-30.208')] [2022-07-10 02:16:59,163][26022] Updated weights on worker 0-0, policy_version 523613 (0.00093) [2022-07-10 02:17:00,772][26022] Updated weights on worker 0-0, policy_version 523623 (0.00106) [2022-07-10 02:17:03,271][26022] Updated weights on worker 0-0, policy_version 523633 (0.01299) [2022-07-10 02:17:03,356][25689] Fps is (10 sec: 5365.5, 60 sec: 5655.9, 300 sec: 5657.4). Total num frames: 536200192. Throughput: 0: 5868.2. Samples: 536201440. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:17:03,357][25689] Avg episode reward: [(0, '-29.407')] [2022-07-10 02:17:04,884][26022] Updated weights on worker 0-0, policy_version 523643 (0.00087) [2022-07-10 02:17:06,680][26022] Updated weights on worker 0-0, policy_version 523653 (0.00085) [2022-07-10 02:17:08,378][25689] Fps is (10 sec: 5495.6, 60 sec: 5671.0, 300 sec: 5671.3). Total num frames: 536229888. Throughput: 0: 5813.3. Samples: 536234610. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:17:08,378][25689] Avg episode reward: [(0, '-29.604')] [2022-07-10 02:17:08,552][26022] Updated weights on worker 0-0, policy_version 523663 (0.00085) [2022-07-10 02:17:10,272][26022] Updated weights on worker 0-0, policy_version 523673 (0.00093) [2022-07-10 02:17:12,237][26022] Updated weights on worker 0-0, policy_version 523683 (0.00086) [2022-07-10 02:17:13,382][25689] Fps is (10 sec: 5821.9, 60 sec: 5707.3, 300 sec: 5666.1). Total num frames: 536258560. Throughput: 0: 4977.2. Samples: 536251702. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:17:13,382][25689] Avg episode reward: [(0, '-29.662')] [2022-07-10 02:17:13,939][26022] Updated weights on worker 0-0, policy_version 523693 (0.01088) [2022-07-10 02:17:15,635][26022] Updated weights on worker 0-0, policy_version 523703 (0.00091) [2022-07-10 02:17:17,457][26022] Updated weights on worker 0-0, policy_version 523713 (0.00079) [2022-07-10 02:17:18,425][25689] Fps is (10 sec: 5707.8, 60 sec: 5683.6, 300 sec: 5662.3). Total num frames: 536287232. Throughput: 0: 5851.7. Samples: 536286048. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:17:18,425][25689] Avg episode reward: [(0, '-28.847')] [2022-07-10 02:17:19,263][26022] Updated weights on worker 0-0, policy_version 523723 (0.00087) [2022-07-10 02:17:20,991][26022] Updated weights on worker 0-0, policy_version 523733 (0.00082) [2022-07-10 02:17:22,791][26022] Updated weights on worker 0-0, policy_version 523743 (0.00086) [2022-07-10 02:17:23,479][25689] Fps is (10 sec: 5679.2, 60 sec: 5681.5, 300 sec: 5665.3). Total num frames: 536315904. Throughput: 0: 5904.6. Samples: 536320258. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:17:23,479][25689] Avg episode reward: [(0, '-28.818')] [2022-07-10 02:17:24,625][26022] Updated weights on worker 0-0, policy_version 523753 (0.00092) [2022-07-10 02:17:26,506][26022] Updated weights on worker 0-0, policy_version 523763 (0.00083) [2022-07-10 02:17:28,076][26022] Updated weights on worker 0-0, policy_version 523773 (0.00090) [2022-07-10 02:17:28,543][25689] Fps is (10 sec: 5667.5, 60 sec: 5676.5, 300 sec: 5664.2). Total num frames: 536344576. Throughput: 0: 5100.3. Samples: 536337456. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:17:28,543][25689] Avg episode reward: [(0, '-29.508')] [2022-07-10 02:17:30,126][26022] Updated weights on worker 0-0, policy_version 523783 (0.00089) [2022-07-10 02:17:31,690][26022] Updated weights on worker 0-0, policy_version 523793 (0.00095) [2022-07-10 02:17:33,555][25689] Fps is (10 sec: 5691.1, 60 sec: 5659.1, 300 sec: 5665.2). Total num frames: 536373248. Throughput: 0: 5952.8. Samples: 536371788. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:17:33,555][25689] Avg episode reward: [(0, '-31.054')] [2022-07-10 02:17:33,655][26022] Updated weights on worker 0-0, policy_version 523803 (0.00098) [2022-07-10 02:17:35,558][26022] Updated weights on worker 0-0, policy_version 523813 (0.00089) [2022-07-10 02:17:37,268][26022] Updated weights on worker 0-0, policy_version 523823 (0.00089) [2022-07-10 02:17:38,605][25689] Fps is (10 sec: 5800.7, 60 sec: 5682.3, 300 sec: 5668.2). Total num frames: 536402944. Throughput: 0: 5944.7. Samples: 536406014. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:17:38,605][25689] Avg episode reward: [(0, '-30.744')] [2022-07-10 02:17:38,931][26022] Updated weights on worker 0-0, policy_version 523833 (0.00089) [2022-07-10 02:17:40,821][26022] Updated weights on worker 0-0, policy_version 523843 (0.00084) [2022-07-10 02:17:42,542][26022] Updated weights on worker 0-0, policy_version 523853 (0.00085) [2022-07-10 02:17:43,667][25689] Fps is (10 sec: 5772.3, 60 sec: 5696.7, 300 sec: 5663.8). Total num frames: 536431616. Throughput: 0: 5964.2. Samples: 536440662. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:17:43,668][25689] Avg episode reward: [(0, '-31.760')] [2022-07-10 02:17:44,453][26022] Updated weights on worker 0-0, policy_version 523863 (0.00097) [2022-07-10 02:17:45,978][26022] Updated weights on worker 0-0, policy_version 523873 (0.00091) [2022-07-10 02:17:47,970][26022] Updated weights on worker 0-0, policy_version 523883 (0.00089) [2022-07-10 02:17:48,737][25689] Fps is (10 sec: 5659.4, 60 sec: 5663.0, 300 sec: 5666.2). Total num frames: 536460288. Throughput: 0: 5967.7. Samples: 536457972. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:17:48,739][25689] Avg episode reward: [(0, '-32.095')] [2022-07-10 02:17:49,689][26022] Updated weights on worker 0-0, policy_version 523893 (0.00083) [2022-07-10 02:17:51,383][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:17:51,394][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000523902_536475648.pth [2022-07-10 02:17:51,394][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000521907_534432768.pth [2022-07-10 02:17:51,686][26022] Updated weights on worker 0-0, policy_version 523903 (0.00091) [2022-07-10 02:17:53,222][26022] Updated weights on worker 0-0, policy_version 523913 (0.00088) [2022-07-10 02:17:53,758][25689] Fps is (10 sec: 5783.9, 60 sec: 5685.4, 300 sec: 5666.6). Total num frames: 536489984. Throughput: 0: 5970.7. Samples: 536492416. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:17:53,759][25689] Avg episode reward: [(0, '-31.517')] [2022-07-10 02:17:55,200][26022] Updated weights on worker 0-0, policy_version 523923 (0.00093) [2022-07-10 02:17:56,786][26022] Updated weights on worker 0-0, policy_version 523933 (0.00085) [2022-07-10 02:17:58,643][26022] Updated weights on worker 0-0, policy_version 523943 (0.00091) [2022-07-10 02:17:58,815][25689] Fps is (10 sec: 5791.8, 60 sec: 5687.1, 300 sec: 5666.2). Total num frames: 536518656. Throughput: 0: 5986.8. Samples: 536527008. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:17:58,817][25689] Avg episode reward: [(0, '-30.951')] [2022-07-10 02:18:00,292][26022] Updated weights on worker 0-0, policy_version 523953 (0.00101) [2022-07-10 02:18:02,500][26022] Updated weights on worker 0-0, policy_version 523963 (0.00101) [2022-07-10 02:18:03,910][25689] Fps is (10 sec: 5547.6, 60 sec: 5715.7, 300 sec: 5669.0). Total num frames: 536546304. Throughput: 0: 5108.3. Samples: 536544072. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:18:03,912][25689] Avg episode reward: [(0, '-30.654')] [2022-07-10 02:18:04,383][26022] Updated weights on worker 0-0, policy_version 523973 (0.00084) [2022-07-10 02:18:06,241][26022] Updated weights on worker 0-0, policy_version 523983 (0.00093) [2022-07-10 02:18:07,919][26022] Updated weights on worker 0-0, policy_version 523993 (0.00090) [2022-07-10 02:18:08,914][25689] Fps is (10 sec: 5374.0, 60 sec: 5666.7, 300 sec: 5662.1). Total num frames: 536572928. Throughput: 0: 5845.8. Samples: 536575922. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 02:18:08,915][25689] Avg episode reward: [(0, '-30.103')] [2022-07-10 02:18:09,839][26022] Updated weights on worker 0-0, policy_version 524003 (0.00092) [2022-07-10 02:18:11,583][26022] Updated weights on worker 0-0, policy_version 524013 (0.00094) [2022-07-10 02:18:13,628][26022] Updated weights on worker 0-0, policy_version 524023 (0.00093) [2022-07-10 02:18:13,952][25689] Fps is (10 sec: 5506.4, 60 sec: 5663.5, 300 sec: 5662.2). Total num frames: 536601600. Throughput: 0: 5821.6. Samples: 536609980. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:18:13,954][25689] Avg episode reward: [(0, '-30.129')] [2022-07-10 02:18:15,108][26022] Updated weights on worker 0-0, policy_version 524033 (0.00086) [2022-07-10 02:18:17,054][26022] Updated weights on worker 0-0, policy_version 524043 (0.00083) [2022-07-10 02:18:18,851][26022] Updated weights on worker 0-0, policy_version 524053 (0.00103) [2022-07-10 02:18:19,023][25689] Fps is (10 sec: 5774.0, 60 sec: 5677.8, 300 sec: 5665.0). Total num frames: 536631296. Throughput: 0: 4965.3. Samples: 536627346. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:18:19,023][25689] Avg episode reward: [(0, '-30.521')] [2022-07-10 02:18:20,526][26022] Updated weights on worker 0-0, policy_version 524063 (0.00097) [2022-07-10 02:18:22,406][26022] Updated weights on worker 0-0, policy_version 524073 (0.00080) [2022-07-10 02:18:24,030][25689] Fps is (10 sec: 5791.8, 60 sec: 5682.2, 300 sec: 5661.8). Total num frames: 536659968. Throughput: 0: 5849.0. Samples: 536661754. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:18:24,031][25689] Avg episode reward: [(0, '-29.758')] [2022-07-10 02:18:24,176][26022] Updated weights on worker 0-0, policy_version 524083 (0.00090) [2022-07-10 02:18:26,070][26022] Updated weights on worker 0-0, policy_version 524093 (0.00103) [2022-07-10 02:18:27,885][26022] Updated weights on worker 0-0, policy_version 524103 (0.00089) [2022-07-10 02:18:29,058][25689] Fps is (10 sec: 5714.4, 60 sec: 5685.6, 300 sec: 5664.8). Total num frames: 536688640. Throughput: 0: 5963.4. Samples: 536696048. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:18:29,058][25689] Avg episode reward: [(0, '-29.317')] [2022-07-10 02:18:29,610][26022] Updated weights on worker 0-0, policy_version 524113 (0.00090) [2022-07-10 02:18:31,323][26022] Updated weights on worker 0-0, policy_version 524123 (0.00085) [2022-07-10 02:18:33,195][26022] Updated weights on worker 0-0, policy_version 524133 (0.00086) [2022-07-10 02:18:34,083][25689] Fps is (10 sec: 5704.4, 60 sec: 5684.4, 300 sec: 5665.7). Total num frames: 536717312. Throughput: 0: 5123.4. Samples: 536713116. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:18:34,083][25689] Avg episode reward: [(0, '-30.709')] [2022-07-10 02:18:34,854][26022] Updated weights on worker 0-0, policy_version 524143 (0.00097) [2022-07-10 02:18:36,901][26022] Updated weights on worker 0-0, policy_version 524153 (0.00085) [2022-07-10 02:18:38,557][26022] Updated weights on worker 0-0, policy_version 524163 (0.00091) [2022-07-10 02:18:39,137][25689] Fps is (10 sec: 5689.0, 60 sec: 5667.0, 300 sec: 5669.3). Total num frames: 536745984. Throughput: 0: 5965.4. Samples: 536747338. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:18:39,138][25689] Avg episode reward: [(0, '-32.147')] [2022-07-10 02:18:40,387][26022] Updated weights on worker 0-0, policy_version 524173 (0.00089) [2022-07-10 02:18:42,200][26022] Updated weights on worker 0-0, policy_version 524183 (0.00055) [2022-07-10 02:18:43,665][26022] Updated weights on worker 0-0, policy_version 524193 (0.00086) [2022-07-10 02:18:44,155][25689] Fps is (10 sec: 5693.4, 60 sec: 5671.2, 300 sec: 5666.5). Total num frames: 536774656. Throughput: 0: 5964.1. Samples: 536781780. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:18:44,155][25689] Avg episode reward: [(0, '-31.893')] [2022-07-10 02:18:45,900][26022] Updated weights on worker 0-0, policy_version 524203 (0.00085) [2022-07-10 02:18:47,367][26022] Updated weights on worker 0-0, policy_version 524213 (0.00084) [2022-07-10 02:18:49,142][26022] Updated weights on worker 0-0, policy_version 524223 (0.00081) [2022-07-10 02:18:49,185][25689] Fps is (10 sec: 5808.9, 60 sec: 5691.9, 300 sec: 5673.4). Total num frames: 536804352. Throughput: 0: 5117.1. Samples: 536799044. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:18:49,186][25689] Avg episode reward: [(0, '-31.638')] [2022-07-10 02:18:51,118][26022] Updated weights on worker 0-0, policy_version 524233 (0.00094) [2022-07-10 02:18:52,782][26022] Updated weights on worker 0-0, policy_version 524243 (0.00083) [2022-07-10 02:18:54,207][25689] Fps is (10 sec: 5704.7, 60 sec: 5658.0, 300 sec: 5667.4). Total num frames: 536832000. Throughput: 0: 5976.5. Samples: 536833388. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:18:54,207][25689] Avg episode reward: [(0, '-31.478')] [2022-07-10 02:18:54,648][26022] Updated weights on worker 0-0, policy_version 524253 (0.00096) [2022-07-10 02:18:56,544][26022] Updated weights on worker 0-0, policy_version 524263 (0.00089) [2022-07-10 02:18:57,994][26022] Updated weights on worker 0-0, policy_version 524273 (0.00079) [2022-07-10 02:18:59,296][25689] Fps is (10 sec: 5469.2, 60 sec: 5638.0, 300 sec: 5662.5). Total num frames: 536859648. Throughput: 0: 5982.7. Samples: 536867940. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:18:59,296][25689] Avg episode reward: [(0, '-31.294')] [2022-07-10 02:18:59,975][26022] Updated weights on worker 0-0, policy_version 524283 (0.00091) [2022-07-10 02:19:01,794][26022] Updated weights on worker 0-0, policy_version 524293 (0.00087) [2022-07-10 02:19:03,804][26022] Updated weights on worker 0-0, policy_version 524303 (0.00091) [2022-07-10 02:19:04,315][25689] Fps is (10 sec: 5673.1, 60 sec: 5679.0, 300 sec: 5676.0). Total num frames: 536889344. Throughput: 0: 5127.5. Samples: 536885150. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:19:04,315][25689] Avg episode reward: [(0, '-30.616')] [2022-07-10 02:19:05,880][26022] Updated weights on worker 0-0, policy_version 524313 (0.00089) [2022-07-10 02:19:07,458][26022] Updated weights on worker 0-0, policy_version 524323 (0.00087) [2022-07-10 02:19:09,347][25689] Fps is (10 sec: 5603.5, 60 sec: 5676.4, 300 sec: 5668.6). Total num frames: 536915968. Throughput: 0: 5864.2. Samples: 536917274. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:19:09,347][25689] Avg episode reward: [(0, '-29.395')] [2022-07-10 02:19:09,502][26022] Updated weights on worker 0-0, policy_version 524333 (0.00085) [2022-07-10 02:19:11,247][26022] Updated weights on worker 0-0, policy_version 524343 (0.00089) [2022-07-10 02:19:12,945][26022] Updated weights on worker 0-0, policy_version 524353 (0.00092) [2022-07-10 02:19:14,358][25689] Fps is (10 sec: 5505.8, 60 sec: 5678.9, 300 sec: 5674.1). Total num frames: 536944640. Throughput: 0: 5863.5. Samples: 536951544. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:19:14,358][25689] Avg episode reward: [(0, '-29.146')] [2022-07-10 02:19:14,772][26022] Updated weights on worker 0-0, policy_version 524363 (0.00085) [2022-07-10 02:19:16,385][26022] Updated weights on worker 0-0, policy_version 524373 (0.00093) [2022-07-10 02:19:18,320][26022] Updated weights on worker 0-0, policy_version 524383 (0.00085) [2022-07-10 02:19:19,488][25689] Fps is (10 sec: 5755.6, 60 sec: 5673.4, 300 sec: 5673.6). Total num frames: 536974336. Throughput: 0: 4997.5. Samples: 536968848. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:19:19,488][25689] Avg episode reward: [(0, '-30.304')] [2022-07-10 02:19:20,161][26022] Updated weights on worker 0-0, policy_version 524393 (0.00081) [2022-07-10 02:19:21,812][26022] Updated weights on worker 0-0, policy_version 524403 (0.00087) [2022-07-10 02:19:23,868][26022] Updated weights on worker 0-0, policy_version 524413 (0.00088) [2022-07-10 02:19:24,492][25689] Fps is (10 sec: 5759.3, 60 sec: 5673.6, 300 sec: 5673.6). Total num frames: 537003008. Throughput: 0: 5836.0. Samples: 537002908. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:19:24,493][25689] Avg episode reward: [(0, '-30.154')] [2022-07-10 02:19:25,351][26022] Updated weights on worker 0-0, policy_version 524423 (0.00088) [2022-07-10 02:19:27,343][26022] Updated weights on worker 0-0, policy_version 524433 (0.00090) [2022-07-10 02:19:29,016][26022] Updated weights on worker 0-0, policy_version 524443 (0.00099) [2022-07-10 02:19:29,558][25689] Fps is (10 sec: 5694.4, 60 sec: 5670.0, 300 sec: 5676.5). Total num frames: 537031680. Throughput: 0: 5942.1. Samples: 537037372. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:19:29,559][25689] Avg episode reward: [(0, '-30.131')] [2022-07-10 02:19:31,030][26022] Updated weights on worker 0-0, policy_version 524453 (0.00098) [2022-07-10 02:19:32,668][26022] Updated weights on worker 0-0, policy_version 524463 (0.00085) [2022-07-10 02:19:34,356][26022] Updated weights on worker 0-0, policy_version 524473 (0.00086) [2022-07-10 02:19:34,587][25689] Fps is (10 sec: 5782.2, 60 sec: 5686.6, 300 sec: 5677.3). Total num frames: 537061376. Throughput: 0: 5086.5. Samples: 537054438. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:19:34,588][25689] Avg episode reward: [(0, '-29.288')] [2022-07-10 02:19:36,354][26022] Updated weights on worker 0-0, policy_version 524483 (0.00084) [2022-07-10 02:19:37,927][26022] Updated weights on worker 0-0, policy_version 524493 (0.00052) [2022-07-10 02:19:39,643][25689] Fps is (10 sec: 5584.5, 60 sec: 5652.6, 300 sec: 5669.8). Total num frames: 537088000. Throughput: 0: 5940.4. Samples: 537088578. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:19:39,644][25689] Avg episode reward: [(0, '-29.783')] [2022-07-10 02:19:40,059][26022] Updated weights on worker 0-0, policy_version 524503 (0.00084) [2022-07-10 02:19:41,298][26022] Updated weights on worker 0-0, policy_version 524513 (0.00087) [2022-07-10 02:19:43,459][26022] Updated weights on worker 0-0, policy_version 524523 (0.00086) [2022-07-10 02:19:44,653][25689] Fps is (10 sec: 5696.9, 60 sec: 5687.2, 300 sec: 5677.5). Total num frames: 537118720. Throughput: 0: 5973.3. Samples: 537123332. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:19:44,653][25689] Avg episode reward: [(0, '-28.872')] [2022-07-10 02:19:45,032][26022] Updated weights on worker 0-0, policy_version 524533 (0.00089) [2022-07-10 02:19:46,881][26022] Updated weights on worker 0-0, policy_version 524543 (0.00082) [2022-07-10 02:19:48,862][26022] Updated weights on worker 0-0, policy_version 524553 (0.00090) [2022-07-10 02:19:49,700][25689] Fps is (10 sec: 6007.7, 60 sec: 5685.7, 300 sec: 5684.7). Total num frames: 537148416. Throughput: 0: 5125.3. Samples: 537140604. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:19:49,700][25689] Avg episode reward: [(0, '-30.091')] [2022-07-10 02:19:50,501][26022] Updated weights on worker 0-0, policy_version 524563 (0.00423) [2022-07-10 02:19:51,494][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:19:51,503][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000524568_537157632.pth [2022-07-10 02:19:51,503][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000522572_535113728.pth [2022-07-10 02:19:52,344][26022] Updated weights on worker 0-0, policy_version 524573 (0.00080) [2022-07-10 02:19:54,235][26022] Updated weights on worker 0-0, policy_version 524583 (0.00095) [2022-07-10 02:19:54,718][25689] Fps is (10 sec: 5697.0, 60 sec: 5685.9, 300 sec: 5679.5). Total num frames: 537176064. Throughput: 0: 5987.8. Samples: 537174980. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:19:54,719][25689] Avg episode reward: [(0, '-30.432')] [2022-07-10 02:19:55,812][26022] Updated weights on worker 0-0, policy_version 524593 (0.00079) [2022-07-10 02:19:57,749][26022] Updated weights on worker 0-0, policy_version 524603 (0.00093) [2022-07-10 02:19:59,719][26022] Updated weights on worker 0-0, policy_version 524613 (0.00086) [2022-07-10 02:19:59,858][25689] Fps is (10 sec: 5543.9, 60 sec: 5698.0, 300 sec: 5684.1). Total num frames: 537204736. Throughput: 0: 5968.4. Samples: 537209232. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:19:59,859][25689] Avg episode reward: [(0, '-31.953')] [2022-07-10 02:20:01,110][26022] Updated weights on worker 0-0, policy_version 524623 (0.00094) [2022-07-10 02:20:03,587][26022] Updated weights on worker 0-0, policy_version 524633 (0.00094) [2022-07-10 02:20:04,886][25689] Fps is (10 sec: 5539.2, 60 sec: 5663.4, 300 sec: 5677.6). Total num frames: 537232384. Throughput: 0: 5853.5. Samples: 537241766. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:20:04,886][25689] Avg episode reward: [(0, '-33.179')] [2022-07-10 02:20:05,146][26022] Updated weights on worker 0-0, policy_version 524643 (0.00091) [2022-07-10 02:20:07,055][26022] Updated weights on worker 0-0, policy_version 524653 (0.00083) [2022-07-10 02:20:08,919][26022] Updated weights on worker 0-0, policy_version 524663 (0.00052) [2022-07-10 02:20:09,957][25689] Fps is (10 sec: 5577.2, 60 sec: 5693.6, 300 sec: 5673.7). Total num frames: 537261056. Throughput: 0: 5829.7. Samples: 537258696. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:20:09,957][25689] Avg episode reward: [(0, '-32.572')] [2022-07-10 02:20:10,420][26022] Updated weights on worker 0-0, policy_version 524673 (0.00101) [2022-07-10 02:20:12,390][26022] Updated weights on worker 0-0, policy_version 524683 (0.00087) [2022-07-10 02:20:14,286][26022] Updated weights on worker 0-0, policy_version 524693 (0.00088) [2022-07-10 02:20:14,969][25689] Fps is (10 sec: 5687.0, 60 sec: 5693.5, 300 sec: 5675.8). Total num frames: 537289728. Throughput: 0: 5849.9. Samples: 537293446. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:20:14,971][25689] Avg episode reward: [(0, '-31.929')] [2022-07-10 02:20:15,792][26022] Updated weights on worker 0-0, policy_version 524703 (0.01321) [2022-07-10 02:20:17,832][26022] Updated weights on worker 0-0, policy_version 524713 (0.00089) [2022-07-10 02:20:19,300][26022] Updated weights on worker 0-0, policy_version 524723 (0.00090) [2022-07-10 02:20:20,036][25689] Fps is (10 sec: 5790.7, 60 sec: 5699.4, 300 sec: 5682.1). Total num frames: 537319424. Throughput: 0: 5901.0. Samples: 537328302. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:20:20,037][25689] Avg episode reward: [(0, '-31.263')] [2022-07-10 02:20:21,189][26022] Updated weights on worker 0-0, policy_version 524733 (0.00085) [2022-07-10 02:20:23,126][26022] Updated weights on worker 0-0, policy_version 524743 (0.00094) [2022-07-10 02:20:24,826][26022] Updated weights on worker 0-0, policy_version 524753 (0.00082) [2022-07-10 02:20:25,061][25689] Fps is (10 sec: 5784.0, 60 sec: 5697.5, 300 sec: 5678.2). Total num frames: 537348096. Throughput: 0: 5125.8. Samples: 537345178. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:20:25,062][25689] Avg episode reward: [(0, '-29.459')] [2022-07-10 02:20:26,671][26022] Updated weights on worker 0-0, policy_version 524763 (0.00090) [2022-07-10 02:20:28,582][26022] Updated weights on worker 0-0, policy_version 524773 (0.00088) [2022-07-10 02:20:30,103][25689] Fps is (10 sec: 5594.9, 60 sec: 5682.8, 300 sec: 5670.9). Total num frames: 537375744. Throughput: 0: 5973.7. Samples: 537379042. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:20:30,104][25689] Avg episode reward: [(0, '-28.336')] [2022-07-10 02:20:30,279][26022] Updated weights on worker 0-0, policy_version 524783 (0.00087) [2022-07-10 02:20:32,288][26022] Updated weights on worker 0-0, policy_version 524793 (0.00086) [2022-07-10 02:20:33,865][26022] Updated weights on worker 0-0, policy_version 524803 (0.00089) [2022-07-10 02:20:35,113][25689] Fps is (10 sec: 5500.5, 60 sec: 5650.7, 300 sec: 5676.9). Total num frames: 537403392. Throughput: 0: 5948.7. Samples: 537413278. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:20:35,115][25689] Avg episode reward: [(0, '-27.359')] [2022-07-10 02:20:35,823][26022] Updated weights on worker 0-0, policy_version 524813 (0.00086) [2022-07-10 02:20:37,572][26022] Updated weights on worker 0-0, policy_version 524823 (0.00088) [2022-07-10 02:20:39,191][26022] Updated weights on worker 0-0, policy_version 524833 (0.00092) [2022-07-10 02:20:40,214][25689] Fps is (10 sec: 5671.1, 60 sec: 5697.2, 300 sec: 5672.4). Total num frames: 537433088. Throughput: 0: 5051.8. Samples: 537430238. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:20:40,215][25689] Avg episode reward: [(0, '-26.995')] [2022-07-10 02:20:41,393][26022] Updated weights on worker 0-0, policy_version 524843 (0.00086) [2022-07-10 02:20:42,770][26022] Updated weights on worker 0-0, policy_version 524853 (0.00088) [2022-07-10 02:20:44,801][26022] Updated weights on worker 0-0, policy_version 524863 (0.00084) [2022-07-10 02:20:45,222][25689] Fps is (10 sec: 5875.7, 60 sec: 5680.5, 300 sec: 5676.4). Total num frames: 537462784. Throughput: 0: 5908.4. Samples: 537464298. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:20:45,222][25689] Avg episode reward: [(0, '-27.131')] [2022-07-10 02:20:46,584][26022] Updated weights on worker 0-0, policy_version 524873 (0.00100) [2022-07-10 02:20:48,158][26022] Updated weights on worker 0-0, policy_version 524883 (0.00085) [2022-07-10 02:20:50,269][25689] Fps is (10 sec: 5703.1, 60 sec: 5646.6, 300 sec: 5679.1). Total num frames: 537490432. Throughput: 0: 5929.8. Samples: 537498626. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:20:50,270][25689] Avg episode reward: [(0, '-27.882')] [2022-07-10 02:20:50,273][26022] Updated weights on worker 0-0, policy_version 524893 (0.00088) [2022-07-10 02:20:51,868][26022] Updated weights on worker 0-0, policy_version 524903 (0.00105) [2022-07-10 02:20:53,706][26022] Updated weights on worker 0-0, policy_version 524913 (0.00084) [2022-07-10 02:20:55,343][25689] Fps is (10 sec: 5665.6, 60 sec: 5675.3, 300 sec: 5679.7). Total num frames: 537520128. Throughput: 0: 5053.5. Samples: 537515508. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:20:55,343][25689] Avg episode reward: [(0, '-28.817')] [2022-07-10 02:20:55,698][26022] Updated weights on worker 0-0, policy_version 524923 (0.00091) [2022-07-10 02:20:57,407][26022] Updated weights on worker 0-0, policy_version 524933 (0.00085) [2022-07-10 02:20:59,358][26022] Updated weights on worker 0-0, policy_version 524943 (0.00104) [2022-07-10 02:21:00,396][25689] Fps is (10 sec: 5763.9, 60 sec: 5683.5, 300 sec: 5682.6). Total num frames: 537548800. Throughput: 0: 5925.6. Samples: 537549826. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:21:00,396][25689] Avg episode reward: [(0, '-28.509')] [2022-07-10 02:21:00,745][26022] Updated weights on worker 0-0, policy_version 524953 (0.00088) [2022-07-10 02:21:03,290][26022] Updated weights on worker 0-0, policy_version 524963 (0.00623) [2022-07-10 02:21:04,607][26022] Updated weights on worker 0-0, policy_version 524973 (0.00080) [2022-07-10 02:21:05,455][25689] Fps is (10 sec: 5468.2, 60 sec: 5663.6, 300 sec: 5674.6). Total num frames: 537575424. Throughput: 0: 5820.3. Samples: 537582064. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 02:21:05,456][25689] Avg episode reward: [(0, '-28.859')] [2022-07-10 02:21:06,906][26022] Updated weights on worker 0-0, policy_version 524983 (0.00089) [2022-07-10 02:21:08,400][26022] Updated weights on worker 0-0, policy_version 524993 (0.00093) [2022-07-10 02:21:10,407][26022] Updated weights on worker 0-0, policy_version 525003 (0.00083) [2022-07-10 02:21:10,503][25689] Fps is (10 sec: 5369.9, 60 sec: 5648.9, 300 sec: 5677.7). Total num frames: 537603072. Throughput: 0: 4955.2. Samples: 537598886. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:21:10,503][25689] Avg episode reward: [(0, '-29.191')] [2022-07-10 02:21:11,900][26022] Updated weights on worker 0-0, policy_version 525013 (0.00089) [2022-07-10 02:21:14,093][26022] Updated weights on worker 0-0, policy_version 525023 (0.00087) [2022-07-10 02:21:15,573][25689] Fps is (10 sec: 5566.6, 60 sec: 5643.5, 300 sec: 5672.4). Total num frames: 537631744. Throughput: 0: 5808.2. Samples: 537633006. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:21:15,573][25689] Avg episode reward: [(0, '-27.960')] [2022-07-10 02:21:15,791][26022] Updated weights on worker 0-0, policy_version 525033 (0.00084) [2022-07-10 02:21:17,565][26022] Updated weights on worker 0-0, policy_version 525043 (0.00086) [2022-07-10 02:21:19,106][26022] Updated weights on worker 0-0, policy_version 525053 (0.00093) [2022-07-10 02:21:20,655][25689] Fps is (10 sec: 5648.1, 60 sec: 5625.1, 300 sec: 5671.4). Total num frames: 537660416. Throughput: 0: 5807.1. Samples: 537667476. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:21:20,656][25689] Avg episode reward: [(0, '-28.049')] [2022-07-10 02:21:21,219][26022] Updated weights on worker 0-0, policy_version 525063 (0.00096) [2022-07-10 02:21:22,879][26022] Updated weights on worker 0-0, policy_version 525073 (0.00081) [2022-07-10 02:21:24,835][26022] Updated weights on worker 0-0, policy_version 525083 (0.00092) [2022-07-10 02:21:25,695][25689] Fps is (10 sec: 5766.5, 60 sec: 5640.7, 300 sec: 5674.3). Total num frames: 537690112. Throughput: 0: 5062.7. Samples: 537684530. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:21:25,695][25689] Avg episode reward: [(0, '-27.968')] [2022-07-10 02:21:26,300][26022] Updated weights on worker 0-0, policy_version 525093 (0.00088) [2022-07-10 02:21:28,450][26022] Updated weights on worker 0-0, policy_version 525103 (0.00086) [2022-07-10 02:21:30,087][26022] Updated weights on worker 0-0, policy_version 525113 (0.00093) [2022-07-10 02:21:30,702][25689] Fps is (10 sec: 5809.6, 60 sec: 5660.8, 300 sec: 5670.8). Total num frames: 537718784. Throughput: 0: 5944.1. Samples: 537718954. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:21:30,705][25689] Avg episode reward: [(0, '-27.538')] [2022-07-10 02:21:31,928][26022] Updated weights on worker 0-0, policy_version 525123 (0.00102) [2022-07-10 02:21:33,737][26022] Updated weights on worker 0-0, policy_version 525133 (0.00088) [2022-07-10 02:21:35,386][26022] Updated weights on worker 0-0, policy_version 525143 (0.00084) [2022-07-10 02:21:35,718][25689] Fps is (10 sec: 5823.4, 60 sec: 5694.1, 300 sec: 5676.2). Total num frames: 537748480. Throughput: 0: 5979.9. Samples: 537753472. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:21:35,718][25689] Avg episode reward: [(0, '-27.737')] [2022-07-10 02:21:37,328][26022] Updated weights on worker 0-0, policy_version 525153 (0.00081) [2022-07-10 02:21:39,084][26022] Updated weights on worker 0-0, policy_version 525163 (0.00084) [2022-07-10 02:21:40,788][25689] Fps is (10 sec: 5685.7, 60 sec: 5663.2, 300 sec: 5675.5). Total num frames: 537776128. Throughput: 0: 5118.4. Samples: 537770522. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:21:40,788][25689] Avg episode reward: [(0, '-28.191')] [2022-07-10 02:21:40,902][26022] Updated weights on worker 0-0, policy_version 525173 (0.00092) [2022-07-10 02:21:42,642][26022] Updated weights on worker 0-0, policy_version 525183 (0.00085) [2022-07-10 02:21:44,419][26022] Updated weights on worker 0-0, policy_version 525193 (0.00092) [2022-07-10 02:21:45,858][25689] Fps is (10 sec: 5554.3, 60 sec: 5640.5, 300 sec: 5668.7). Total num frames: 537804800. Throughput: 0: 5956.5. Samples: 537804630. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:21:45,858][25689] Avg episode reward: [(0, '-28.326')] [2022-07-10 02:21:46,316][26022] Updated weights on worker 0-0, policy_version 525203 (0.00093) [2022-07-10 02:21:47,951][26022] Updated weights on worker 0-0, policy_version 525213 (0.00092) [2022-07-10 02:21:49,834][26022] Updated weights on worker 0-0, policy_version 525223 (0.00079) [2022-07-10 02:21:50,873][25689] Fps is (10 sec: 5889.1, 60 sec: 5694.2, 300 sec: 5676.8). Total num frames: 537835520. Throughput: 0: 5964.7. Samples: 537839266. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:21:50,874][25689] Avg episode reward: [(0, '-28.782')] [2022-07-10 02:21:51,451][26022] Updated weights on worker 0-0, policy_version 525233 (0.00086) [2022-07-10 02:21:51,640][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:21:51,654][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000525234_537839616.pth [2022-07-10 02:21:51,655][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000523235_535792640.pth [2022-07-10 02:21:53,509][26022] Updated weights on worker 0-0, policy_version 525243 (0.00090) [2022-07-10 02:21:55,133][26022] Updated weights on worker 0-0, policy_version 525253 (0.00090) [2022-07-10 02:21:55,895][25689] Fps is (10 sec: 5713.2, 60 sec: 5648.3, 300 sec: 5670.9). Total num frames: 537862144. Throughput: 0: 5109.3. Samples: 537856562. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:21:55,895][25689] Avg episode reward: [(0, '-30.615')] [2022-07-10 02:21:56,935][26022] Updated weights on worker 0-0, policy_version 525263 (0.00087) [2022-07-10 02:21:58,923][26022] Updated weights on worker 0-0, policy_version 525273 (0.00087) [2022-07-10 02:22:00,386][26022] Updated weights on worker 0-0, policy_version 525283 (0.00085) [2022-07-10 02:22:00,935][25689] Fps is (10 sec: 5699.3, 60 sec: 5683.4, 300 sec: 5688.1). Total num frames: 537892864. Throughput: 0: 5972.8. Samples: 537890854. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:22:00,935][25689] Avg episode reward: [(0, '-30.111')] [2022-07-10 02:22:02,737][26022] Updated weights on worker 0-0, policy_version 525293 (0.00081) [2022-07-10 02:22:04,471][26022] Updated weights on worker 0-0, policy_version 525303 (0.00084) [2022-07-10 02:22:05,937][25689] Fps is (10 sec: 5710.2, 60 sec: 5688.7, 300 sec: 5678.2). Total num frames: 537919488. Throughput: 0: 5896.5. Samples: 537923028. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:22:05,938][25689] Avg episode reward: [(0, '-29.086')] [2022-07-10 02:22:06,142][26022] Updated weights on worker 0-0, policy_version 525313 (0.00086) [2022-07-10 02:22:08,222][26022] Updated weights on worker 0-0, policy_version 525323 (0.00088) [2022-07-10 02:22:09,588][26022] Updated weights on worker 0-0, policy_version 525333 (0.00090) [2022-07-10 02:22:10,965][25689] Fps is (10 sec: 5308.7, 60 sec: 5673.6, 300 sec: 5670.9). Total num frames: 537946112. Throughput: 0: 5028.0. Samples: 537940288. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:22:10,966][25689] Avg episode reward: [(0, '-28.881')] [2022-07-10 02:22:11,639][26022] Updated weights on worker 0-0, policy_version 525343 (0.00091) [2022-07-10 02:22:13,522][26022] Updated weights on worker 0-0, policy_version 525353 (0.00082) [2022-07-10 02:22:15,166][26022] Updated weights on worker 0-0, policy_version 525363 (0.00079) [2022-07-10 02:22:15,983][25689] Fps is (10 sec: 5606.4, 60 sec: 5695.5, 300 sec: 5674.8). Total num frames: 537975808. Throughput: 0: 5874.8. Samples: 537974576. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:22:15,983][25689] Avg episode reward: [(0, '-28.636')] [2022-07-10 02:22:17,079][26022] Updated weights on worker 0-0, policy_version 525373 (0.00091) [2022-07-10 02:22:18,827][26022] Updated weights on worker 0-0, policy_version 525383 (0.00088) [2022-07-10 02:22:20,459][26022] Updated weights on worker 0-0, policy_version 525393 (0.00092) [2022-07-10 02:22:21,101][25689] Fps is (10 sec: 5758.4, 60 sec: 5692.1, 300 sec: 5673.6). Total num frames: 538004480. Throughput: 0: 5865.1. Samples: 538009132. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:22:21,102][25689] Avg episode reward: [(0, '-28.749')] [2022-07-10 02:22:22,379][26022] Updated weights on worker 0-0, policy_version 525403 (0.00082) [2022-07-10 02:22:23,895][26022] Updated weights on worker 0-0, policy_version 525413 (0.00084) [2022-07-10 02:22:26,012][26022] Updated weights on worker 0-0, policy_version 525423 (0.00097) [2022-07-10 02:22:26,117][25689] Fps is (10 sec: 5658.1, 60 sec: 5677.3, 300 sec: 5674.5). Total num frames: 538033152. Throughput: 0: 5978.7. Samples: 538043682. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:22:26,118][25689] Avg episode reward: [(0, '-28.844')] [2022-07-10 02:22:27,466][26022] Updated weights on worker 0-0, policy_version 525433 (0.00082) [2022-07-10 02:22:29,713][26022] Updated weights on worker 0-0, policy_version 525443 (0.00085) [2022-07-10 02:22:31,086][26022] Updated weights on worker 0-0, policy_version 525453 (0.00080) [2022-07-10 02:22:31,180][25689] Fps is (10 sec: 5892.8, 60 sec: 5706.1, 300 sec: 5680.4). Total num frames: 538063872. Throughput: 0: 5964.4. Samples: 538060858. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:22:31,180][25689] Avg episode reward: [(0, '-29.996')] [2022-07-10 02:22:33,271][26022] Updated weights on worker 0-0, policy_version 525463 (0.00087) [2022-07-10 02:22:34,604][26022] Updated weights on worker 0-0, policy_version 525473 (0.00081) [2022-07-10 02:22:36,203][25689] Fps is (10 sec: 5685.5, 60 sec: 5654.5, 300 sec: 5670.6). Total num frames: 538090496. Throughput: 0: 5965.9. Samples: 538095212. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:22:36,204][25689] Avg episode reward: [(0, '-31.892')] [2022-07-10 02:22:36,725][26022] Updated weights on worker 0-0, policy_version 525483 (0.00087) [2022-07-10 02:22:38,257][26022] Updated weights on worker 0-0, policy_version 525493 (0.00092) [2022-07-10 02:22:40,275][26022] Updated weights on worker 0-0, policy_version 525503 (0.00082) [2022-07-10 02:22:41,274][25689] Fps is (10 sec: 5681.1, 60 sec: 5705.3, 300 sec: 5677.3). Total num frames: 538121216. Throughput: 0: 5972.2. Samples: 538129606. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:22:41,274][25689] Avg episode reward: [(0, '-31.307')] [2022-07-10 02:22:42,105][26022] Updated weights on worker 0-0, policy_version 525513 (0.00090) [2022-07-10 02:22:43,791][26022] Updated weights on worker 0-0, policy_version 525523 (0.00086) [2022-07-10 02:22:45,799][26022] Updated weights on worker 0-0, policy_version 525533 (0.00090) [2022-07-10 02:22:46,291][25689] Fps is (10 sec: 5887.9, 60 sec: 5710.2, 300 sec: 5678.3). Total num frames: 538149888. Throughput: 0: 5101.4. Samples: 538146594. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:22:46,291][25689] Avg episode reward: [(0, '-31.259')] [2022-07-10 02:22:47,420][26022] Updated weights on worker 0-0, policy_version 525543 (0.00079) [2022-07-10 02:22:49,081][26022] Updated weights on worker 0-0, policy_version 525553 (0.00081) [2022-07-10 02:22:51,017][26022] Updated weights on worker 0-0, policy_version 525563 (0.01238) [2022-07-10 02:22:51,300][25689] Fps is (10 sec: 5617.3, 60 sec: 5660.0, 300 sec: 5671.6). Total num frames: 538177536. Throughput: 0: 5991.1. Samples: 538181398. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:22:51,300][25689] Avg episode reward: [(0, '-30.695')] [2022-07-10 02:22:52,533][26022] Updated weights on worker 0-0, policy_version 525573 (0.00085) [2022-07-10 02:22:54,551][26022] Updated weights on worker 0-0, policy_version 525583 (0.00095) [2022-07-10 02:22:56,273][26022] Updated weights on worker 0-0, policy_version 525593 (0.00086) [2022-07-10 02:22:56,330][25689] Fps is (10 sec: 5711.7, 60 sec: 5710.0, 300 sec: 5675.6). Total num frames: 538207232. Throughput: 0: 5996.5. Samples: 538215904. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:22:56,331][25689] Avg episode reward: [(0, '-29.749')] [2022-07-10 02:22:58,125][26022] Updated weights on worker 0-0, policy_version 525603 (0.00091) [2022-07-10 02:23:00,001][26022] Updated weights on worker 0-0, policy_version 525613 (0.00090) [2022-07-10 02:23:01,446][25689] Fps is (10 sec: 5752.9, 60 sec: 5669.1, 300 sec: 5678.7). Total num frames: 538235904. Throughput: 0: 5128.3. Samples: 538233054. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:23:01,446][25689] Avg episode reward: [(0, '-29.033')] [2022-07-10 02:23:01,639][26022] Updated weights on worker 0-0, policy_version 525623 (0.00084) [2022-07-10 02:23:03,847][26022] Updated weights on worker 0-0, policy_version 525633 (0.00111) [2022-07-10 02:23:05,489][26022] Updated weights on worker 0-0, policy_version 525643 (0.00086) [2022-07-10 02:23:06,508][25689] Fps is (10 sec: 5533.7, 60 sec: 5680.3, 300 sec: 5681.0). Total num frames: 538263552. Throughput: 0: 5909.8. Samples: 538266076. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:23:06,509][25689] Avg episode reward: [(0, '-28.713')] [2022-07-10 02:23:07,306][26022] Updated weights on worker 0-0, policy_version 525653 (0.00082) [2022-07-10 02:23:09,209][26022] Updated weights on worker 0-0, policy_version 525663 (0.00086) [2022-07-10 02:23:10,823][26022] Updated weights on worker 0-0, policy_version 525673 (0.00081) [2022-07-10 02:23:11,553][25689] Fps is (10 sec: 5572.4, 60 sec: 5712.6, 300 sec: 5680.9). Total num frames: 538292224. Throughput: 0: 5876.0. Samples: 538300404. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:23:11,554][25689] Avg episode reward: [(0, '-29.267')] [2022-07-10 02:23:12,899][26022] Updated weights on worker 0-0, policy_version 525683 (0.00082) [2022-07-10 02:23:14,508][26022] Updated weights on worker 0-0, policy_version 525693 (0.00084) [2022-07-10 02:23:16,322][26022] Updated weights on worker 0-0, policy_version 525703 (0.00083) [2022-07-10 02:23:16,578][25689] Fps is (10 sec: 5796.4, 60 sec: 5711.9, 300 sec: 5681.7). Total num frames: 538321920. Throughput: 0: 5010.6. Samples: 538317356. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:23:16,579][25689] Avg episode reward: [(0, '-29.216')] [2022-07-10 02:23:18,091][26022] Updated weights on worker 0-0, policy_version 525713 (0.00080) [2022-07-10 02:23:19,797][26022] Updated weights on worker 0-0, policy_version 525723 (0.00280) [2022-07-10 02:23:21,635][25689] Fps is (10 sec: 5687.3, 60 sec: 5700.7, 300 sec: 5677.3). Total num frames: 538349568. Throughput: 0: 5877.6. Samples: 538351722. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:23:21,636][25689] Avg episode reward: [(0, '-29.292')] [2022-07-10 02:23:21,928][26022] Updated weights on worker 0-0, policy_version 525733 (0.00083) [2022-07-10 02:23:23,443][26022] Updated weights on worker 0-0, policy_version 525743 (0.00091) [2022-07-10 02:23:25,187][26022] Updated weights on worker 0-0, policy_version 525753 (0.00081) [2022-07-10 02:23:26,734][25689] Fps is (10 sec: 5545.6, 60 sec: 5693.0, 300 sec: 5676.0). Total num frames: 538378240. Throughput: 0: 5928.5. Samples: 538385982. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:23:26,734][25689] Avg episode reward: [(0, '-29.853')] [2022-07-10 02:23:27,074][26022] Updated weights on worker 0-0, policy_version 525763 (0.00085) [2022-07-10 02:23:28,840][26022] Updated weights on worker 0-0, policy_version 525773 (0.00083) [2022-07-10 02:23:30,707][26022] Updated weights on worker 0-0, policy_version 525783 (0.00100) [2022-07-10 02:23:31,747][25689] Fps is (10 sec: 5772.7, 60 sec: 5680.7, 300 sec: 5679.6). Total num frames: 538407936. Throughput: 0: 5092.6. Samples: 538403244. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:23:31,747][25689] Avg episode reward: [(0, '-31.417')] [2022-07-10 02:23:32,445][26022] Updated weights on worker 0-0, policy_version 525793 (0.00087) [2022-07-10 02:23:34,242][26022] Updated weights on worker 0-0, policy_version 525803 (0.00366) [2022-07-10 02:23:35,914][26022] Updated weights on worker 0-0, policy_version 525813 (0.00094) [2022-07-10 02:23:36,758][25689] Fps is (10 sec: 5618.4, 60 sec: 5681.9, 300 sec: 5673.6). Total num frames: 538434560. Throughput: 0: 5955.6. Samples: 538437540. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:23:36,758][25689] Avg episode reward: [(0, '-30.885')] [2022-07-10 02:23:37,815][26022] Updated weights on worker 0-0, policy_version 525823 (0.00420) [2022-07-10 02:23:39,635][26022] Updated weights on worker 0-0, policy_version 525833 (0.00085) [2022-07-10 02:23:41,498][26022] Updated weights on worker 0-0, policy_version 525843 (0.00092) [2022-07-10 02:23:41,823][25689] Fps is (10 sec: 5589.4, 60 sec: 5665.5, 300 sec: 5676.1). Total num frames: 538464256. Throughput: 0: 5957.1. Samples: 538471980. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:23:41,823][25689] Avg episode reward: [(0, '-30.536')] [2022-07-10 02:23:43,381][26022] Updated weights on worker 0-0, policy_version 525853 (0.00086) [2022-07-10 02:23:45,029][26022] Updated weights on worker 0-0, policy_version 525863 (0.00085) [2022-07-10 02:23:46,868][25689] Fps is (10 sec: 5773.2, 60 sec: 5662.8, 300 sec: 5672.4). Total num frames: 538492928. Throughput: 0: 5118.5. Samples: 538489040. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:23:46,869][25689] Avg episode reward: [(0, '-31.254')] [2022-07-10 02:23:46,937][26022] Updated weights on worker 0-0, policy_version 525873 (0.00090) [2022-07-10 02:23:48,609][26022] Updated weights on worker 0-0, policy_version 525883 (0.00080) [2022-07-10 02:23:50,559][26022] Updated weights on worker 0-0, policy_version 525893 (0.00088) [2022-07-10 02:23:51,817][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:23:51,832][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000525900_538521600.pth [2022-07-10 02:23:51,833][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000523902_536475648.pth [2022-07-10 02:23:51,937][25689] Fps is (10 sec: 5669.6, 60 sec: 5674.1, 300 sec: 5674.9). Total num frames: 538521600. Throughput: 0: 5956.9. Samples: 538523518. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:23:51,938][25689] Avg episode reward: [(0, '-30.939')] [2022-07-10 02:23:52,258][26022] Updated weights on worker 0-0, policy_version 525903 (0.00083) [2022-07-10 02:23:54,070][26022] Updated weights on worker 0-0, policy_version 525913 (0.00091) [2022-07-10 02:23:55,988][26022] Updated weights on worker 0-0, policy_version 525923 (0.00087) [2022-07-10 02:23:56,952][25689] Fps is (10 sec: 5788.3, 60 sec: 5675.6, 300 sec: 5683.2). Total num frames: 538551296. Throughput: 0: 5932.9. Samples: 538557350. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:23:56,952][25689] Avg episode reward: [(0, '-29.442')] [2022-07-10 02:23:57,631][26022] Updated weights on worker 0-0, policy_version 525933 (0.00111) [2022-07-10 02:23:59,472][26022] Updated weights on worker 0-0, policy_version 525943 (0.00083) [2022-07-10 02:24:01,180][26022] Updated weights on worker 0-0, policy_version 525953 (0.00099) [2022-07-10 02:24:02,067][25689] Fps is (10 sec: 5560.0, 60 sec: 5641.9, 300 sec: 5671.1). Total num frames: 538577920. Throughput: 0: 5069.6. Samples: 538574608. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:24:02,067][25689] Avg episode reward: [(0, '-28.104')] [2022-07-10 02:24:03,420][26022] Updated weights on worker 0-0, policy_version 525963 (0.00092) [2022-07-10 02:24:05,343][26022] Updated weights on worker 0-0, policy_version 525973 (0.00091) [2022-07-10 02:24:06,884][26022] Updated weights on worker 0-0, policy_version 525983 (0.00085) [2022-07-10 02:24:07,092][25689] Fps is (10 sec: 5554.2, 60 sec: 5679.1, 300 sec: 5681.5). Total num frames: 538607616. Throughput: 0: 5845.3. Samples: 538607256. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 02:24:07,093][25689] Avg episode reward: [(0, '-29.883')] [2022-07-10 02:24:08,759][26022] Updated weights on worker 0-0, policy_version 525993 (0.00714) [2022-07-10 02:24:10,480][26022] Updated weights on worker 0-0, policy_version 526003 (0.00091) [2022-07-10 02:24:12,126][25689] Fps is (10 sec: 5700.6, 60 sec: 5663.2, 300 sec: 5677.7). Total num frames: 538635264. Throughput: 0: 5845.5. Samples: 538641532. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:24:12,126][25689] Avg episode reward: [(0, '-29.150')] [2022-07-10 02:24:12,523][26022] Updated weights on worker 0-0, policy_version 526013 (0.00083) [2022-07-10 02:24:14,069][26022] Updated weights on worker 0-0, policy_version 526023 (0.00064) [2022-07-10 02:24:15,992][26022] Updated weights on worker 0-0, policy_version 526033 (0.00095) [2022-07-10 02:24:17,131][25689] Fps is (10 sec: 5610.1, 60 sec: 5648.2, 300 sec: 5676.6). Total num frames: 538663936. Throughput: 0: 5014.8. Samples: 538658544. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:24:17,136][25689] Avg episode reward: [(0, '-29.093')] [2022-07-10 02:24:17,525][26022] Updated weights on worker 0-0, policy_version 526043 (0.00085) [2022-07-10 02:24:19,353][26022] Updated weights on worker 0-0, policy_version 526053 (0.00086) [2022-07-10 02:24:21,401][26022] Updated weights on worker 0-0, policy_version 526063 (0.00093) [2022-07-10 02:24:22,174][25689] Fps is (10 sec: 5809.1, 60 sec: 5683.4, 300 sec: 5679.3). Total num frames: 538693632. Throughput: 0: 5895.1. Samples: 538693140. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:24:22,174][25689] Avg episode reward: [(0, '-29.501')] [2022-07-10 02:24:23,023][26022] Updated weights on worker 0-0, policy_version 526073 (0.00088) [2022-07-10 02:24:24,848][26022] Updated weights on worker 0-0, policy_version 526083 (0.00092) [2022-07-10 02:24:26,663][26022] Updated weights on worker 0-0, policy_version 526093 (0.00082) [2022-07-10 02:24:27,176][25689] Fps is (10 sec: 5607.1, 60 sec: 5658.6, 300 sec: 5673.6). Total num frames: 538720256. Throughput: 0: 5971.6. Samples: 538727186. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:24:27,176][25689] Avg episode reward: [(0, '-30.194')] [2022-07-10 02:24:28,345][26022] Updated weights on worker 0-0, policy_version 526103 (0.00087) [2022-07-10 02:24:30,496][26022] Updated weights on worker 0-0, policy_version 526113 (0.00088) [2022-07-10 02:24:32,111][26022] Updated weights on worker 0-0, policy_version 526123 (0.00092) [2022-07-10 02:24:32,179][25689] Fps is (10 sec: 5628.9, 60 sec: 5659.5, 300 sec: 5674.1). Total num frames: 538749952. Throughput: 0: 5128.8. Samples: 538744382. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:24:32,180][25689] Avg episode reward: [(0, '-29.056')] [2022-07-10 02:24:33,816][26022] Updated weights on worker 0-0, policy_version 526133 (0.00084) [2022-07-10 02:24:35,693][26022] Updated weights on worker 0-0, policy_version 526143 (0.00095) [2022-07-10 02:24:37,191][25689] Fps is (10 sec: 5929.9, 60 sec: 5710.2, 300 sec: 5685.3). Total num frames: 538779648. Throughput: 0: 5975.2. Samples: 538778408. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:24:37,192][25689] Avg episode reward: [(0, '-29.730')] [2022-07-10 02:24:37,471][26022] Updated weights on worker 0-0, policy_version 526153 (0.00094) [2022-07-10 02:24:39,368][26022] Updated weights on worker 0-0, policy_version 526163 (0.00087) [2022-07-10 02:24:41,005][26022] Updated weights on worker 0-0, policy_version 526173 (0.00087) [2022-07-10 02:24:42,262][25689] Fps is (10 sec: 5687.1, 60 sec: 5675.7, 300 sec: 5673.8). Total num frames: 538807296. Throughput: 0: 5934.5. Samples: 538812356. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:24:42,269][25689] Avg episode reward: [(0, '-30.268')] [2022-07-10 02:24:43,052][26022] Updated weights on worker 0-0, policy_version 526183 (0.00091) [2022-07-10 02:24:44,663][26022] Updated weights on worker 0-0, policy_version 526193 (0.00085) [2022-07-10 02:24:46,612][26022] Updated weights on worker 0-0, policy_version 526203 (0.00087) [2022-07-10 02:24:47,301][25689] Fps is (10 sec: 5571.0, 60 sec: 5676.4, 300 sec: 5670.5). Total num frames: 538835968. Throughput: 0: 5073.3. Samples: 538829286. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:24:47,301][25689] Avg episode reward: [(0, '-30.396')] [2022-07-10 02:24:48,344][26022] Updated weights on worker 0-0, policy_version 526213 (0.00091) [2022-07-10 02:24:50,184][26022] Updated weights on worker 0-0, policy_version 526223 (0.00090) [2022-07-10 02:24:51,910][26022] Updated weights on worker 0-0, policy_version 526233 (0.00086) [2022-07-10 02:24:52,365][25689] Fps is (10 sec: 5575.0, 60 sec: 5659.9, 300 sec: 5669.7). Total num frames: 538863616. Throughput: 0: 5889.1. Samples: 538863256. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:24:52,365][25689] Avg episode reward: [(0, '-30.368')] [2022-07-10 02:24:53,753][26022] Updated weights on worker 0-0, policy_version 526243 (0.00094) [2022-07-10 02:24:55,733][26022] Updated weights on worker 0-0, policy_version 526253 (0.00086) [2022-07-10 02:24:57,241][26022] Updated weights on worker 0-0, policy_version 526263 (0.00088) [2022-07-10 02:24:57,402][25689] Fps is (10 sec: 5676.6, 60 sec: 5657.8, 300 sec: 5675.0). Total num frames: 538893312. Throughput: 0: 5895.2. Samples: 538897556. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:24:57,404][25689] Avg episode reward: [(0, '-30.556')] [2022-07-10 02:24:59,203][26022] Updated weights on worker 0-0, policy_version 526273 (0.00087) [2022-07-10 02:25:00,906][26022] Updated weights on worker 0-0, policy_version 526283 (0.00085) [2022-07-10 02:25:02,486][25689] Fps is (10 sec: 5564.3, 60 sec: 5660.7, 300 sec: 5670.5). Total num frames: 538919936. Throughput: 0: 5817.1. Samples: 538930000. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:25:02,487][25689] Avg episode reward: [(0, '-30.579')] [2022-07-10 02:25:03,106][26022] Updated weights on worker 0-0, policy_version 526293 (0.00090) [2022-07-10 02:25:04,898][26022] Updated weights on worker 0-0, policy_version 526303 (0.00081) [2022-07-10 02:25:06,730][26022] Updated weights on worker 0-0, policy_version 526313 (0.00096) [2022-07-10 02:25:07,538][25689] Fps is (10 sec: 5455.5, 60 sec: 5641.2, 300 sec: 5670.9). Total num frames: 538948608. Throughput: 0: 5822.5. Samples: 538947120. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:25:07,539][25689] Avg episode reward: [(0, '-30.606')] [2022-07-10 02:25:08,473][26022] Updated weights on worker 0-0, policy_version 526323 (0.00088) [2022-07-10 02:25:10,180][26022] Updated weights on worker 0-0, policy_version 526333 (0.00092) [2022-07-10 02:25:11,995][26022] Updated weights on worker 0-0, policy_version 526343 (0.00086) [2022-07-10 02:25:12,599][25689] Fps is (10 sec: 5771.7, 60 sec: 5672.6, 300 sec: 5673.4). Total num frames: 538978304. Throughput: 0: 5832.1. Samples: 538981266. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:25:12,600][25689] Avg episode reward: [(0, '-29.868')] [2022-07-10 02:25:13,927][26022] Updated weights on worker 0-0, policy_version 526353 (0.00097) [2022-07-10 02:25:15,690][26022] Updated weights on worker 0-0, policy_version 526363 (0.00090) [2022-07-10 02:25:17,482][26022] Updated weights on worker 0-0, policy_version 526373 (0.00091) [2022-07-10 02:25:17,617][25689] Fps is (10 sec: 5689.7, 60 sec: 5654.5, 300 sec: 5667.5). Total num frames: 539005952. Throughput: 0: 5837.2. Samples: 539015552. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:25:17,619][25689] Avg episode reward: [(0, '-30.024')] [2022-07-10 02:25:19,280][26022] Updated weights on worker 0-0, policy_version 526383 (0.00091) [2022-07-10 02:25:21,125][26022] Updated weights on worker 0-0, policy_version 526393 (0.00085) [2022-07-10 02:25:22,667][25689] Fps is (10 sec: 5695.5, 60 sec: 5653.8, 300 sec: 5670.4). Total num frames: 539035648. Throughput: 0: 5090.7. Samples: 539032736. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:25:22,668][25689] Avg episode reward: [(0, '-30.115')] [2022-07-10 02:25:22,848][26022] Updated weights on worker 0-0, policy_version 526403 (0.00087) [2022-07-10 02:25:24,788][26022] Updated weights on worker 0-0, policy_version 526413 (0.00085) [2022-07-10 02:25:26,388][26022] Updated weights on worker 0-0, policy_version 526423 (0.00092) [2022-07-10 02:25:27,708][25689] Fps is (10 sec: 5682.8, 60 sec: 5667.1, 300 sec: 5670.4). Total num frames: 539063296. Throughput: 0: 5943.7. Samples: 539067002. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:25:27,709][25689] Avg episode reward: [(0, '-32.060')] [2022-07-10 02:25:28,331][26022] Updated weights on worker 0-0, policy_version 526433 (0.00097) [2022-07-10 02:25:30,142][26022] Updated weights on worker 0-0, policy_version 526443 (0.00353) [2022-07-10 02:25:31,831][26022] Updated weights on worker 0-0, policy_version 526453 (0.00085) [2022-07-10 02:25:32,755][25689] Fps is (10 sec: 5684.7, 60 sec: 5663.0, 300 sec: 5676.6). Total num frames: 539092992. Throughput: 0: 5964.8. Samples: 539101490. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:25:32,756][25689] Avg episode reward: [(0, '-32.005')] [2022-07-10 02:25:33,655][26022] Updated weights on worker 0-0, policy_version 526463 (0.00089) [2022-07-10 02:25:35,515][26022] Updated weights on worker 0-0, policy_version 526473 (0.00091) [2022-07-10 02:25:37,269][26022] Updated weights on worker 0-0, policy_version 526483 (0.00085) [2022-07-10 02:25:37,766][25689] Fps is (10 sec: 5802.8, 60 sec: 5646.2, 300 sec: 5674.9). Total num frames: 539121664. Throughput: 0: 5120.2. Samples: 539118720. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:25:37,768][25689] Avg episode reward: [(0, '-31.756')] [2022-07-10 02:25:39,032][26022] Updated weights on worker 0-0, policy_version 526493 (0.00088) [2022-07-10 02:25:40,699][26022] Updated weights on worker 0-0, policy_version 526503 (0.00092) [2022-07-10 02:25:42,579][26022] Updated weights on worker 0-0, policy_version 526513 (0.00081) [2022-07-10 02:25:42,889][25689] Fps is (10 sec: 5658.3, 60 sec: 5658.2, 300 sec: 5669.2). Total num frames: 539150336. Throughput: 0: 5957.3. Samples: 539153204. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:25:42,890][25689] Avg episode reward: [(0, '-31.106')] [2022-07-10 02:25:44,179][26022] Updated weights on worker 0-0, policy_version 526523 (0.00088) [2022-07-10 02:25:46,380][26022] Updated weights on worker 0-0, policy_version 526533 (0.00091) [2022-07-10 02:25:47,728][26022] Updated weights on worker 0-0, policy_version 526543 (0.00099) [2022-07-10 02:25:47,918][25689] Fps is (10 sec: 5749.5, 60 sec: 5676.0, 300 sec: 5676.5). Total num frames: 539180032. Throughput: 0: 5965.7. Samples: 539187570. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:25:47,919][25689] Avg episode reward: [(0, '-30.487')] [2022-07-10 02:25:49,776][26022] Updated weights on worker 0-0, policy_version 526553 (0.00087) [2022-07-10 02:25:51,362][26022] Updated weights on worker 0-0, policy_version 526563 (0.00088) [2022-07-10 02:25:51,896][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:25:51,909][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000526565_539202560.pth [2022-07-10 02:25:51,913][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000524568_537157632.pth [2022-07-10 02:25:52,940][25689] Fps is (10 sec: 5807.4, 60 sec: 5696.8, 300 sec: 5674.0). Total num frames: 539208704. Throughput: 0: 5117.7. Samples: 539204792. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:25:52,940][25689] Avg episode reward: [(0, '-30.826')] [2022-07-10 02:25:53,341][26022] Updated weights on worker 0-0, policy_version 526573 (0.00088) [2022-07-10 02:25:55,140][26022] Updated weights on worker 0-0, policy_version 526583 (0.00087) [2022-07-10 02:25:56,959][26022] Updated weights on worker 0-0, policy_version 526593 (0.00095) [2022-07-10 02:25:58,021][25689] Fps is (10 sec: 5676.0, 60 sec: 5675.9, 300 sec: 5673.5). Total num frames: 539237376. Throughput: 0: 5948.4. Samples: 539239204. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:25:58,022][25689] Avg episode reward: [(0, '-29.463')] [2022-07-10 02:25:58,497][26022] Updated weights on worker 0-0, policy_version 526603 (0.00090) [2022-07-10 02:26:00,589][26022] Updated weights on worker 0-0, policy_version 526613 (0.00091) [2022-07-10 02:26:02,462][26022] Updated weights on worker 0-0, policy_version 526623 (0.00087) [2022-07-10 02:26:03,148][25689] Fps is (10 sec: 5416.7, 60 sec: 5671.8, 300 sec: 5672.2). Total num frames: 539264000. Throughput: 0: 5822.5. Samples: 539271164. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:26:03,149][25689] Avg episode reward: [(0, '-29.305')] [2022-07-10 02:26:04,463][26022] Updated weights on worker 0-0, policy_version 526633 (0.00086) [2022-07-10 02:26:06,170][26022] Updated weights on worker 0-0, policy_version 526643 (0.00088) [2022-07-10 02:26:08,150][26022] Updated weights on worker 0-0, policy_version 526653 (0.00088) [2022-07-10 02:26:08,223][25689] Fps is (10 sec: 5420.3, 60 sec: 5669.7, 300 sec: 5675.1). Total num frames: 539292672. Throughput: 0: 4958.1. Samples: 539288242. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:26:08,223][25689] Avg episode reward: [(0, '-30.159')] [2022-07-10 02:26:09,606][26022] Updated weights on worker 0-0, policy_version 526663 (0.00087) [2022-07-10 02:26:11,967][26022] Updated weights on worker 0-0, policy_version 526673 (0.00092) [2022-07-10 02:26:13,237][25689] Fps is (10 sec: 5785.5, 60 sec: 5674.1, 300 sec: 5679.6). Total num frames: 539322368. Throughput: 0: 5801.2. Samples: 539322540. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:26:13,238][25689] Avg episode reward: [(0, '-31.557')] [2022-07-10 02:26:13,291][26022] Updated weights on worker 0-0, policy_version 526683 (0.00086) [2022-07-10 02:26:15,332][26022] Updated weights on worker 0-0, policy_version 526693 (0.00097) [2022-07-10 02:26:17,003][26022] Updated weights on worker 0-0, policy_version 526703 (0.00089) [2022-07-10 02:26:18,267][25689] Fps is (10 sec: 5709.3, 60 sec: 5673.0, 300 sec: 5677.2). Total num frames: 539350016. Throughput: 0: 5814.9. Samples: 539356930. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:26:18,267][25689] Avg episode reward: [(0, '-32.415')] [2022-07-10 02:26:18,727][26022] Updated weights on worker 0-0, policy_version 526713 (0.00082) [2022-07-10 02:26:20,699][26022] Updated weights on worker 0-0, policy_version 526723 (0.00087) [2022-07-10 02:26:22,247][26022] Updated weights on worker 0-0, policy_version 526733 (0.00090) [2022-07-10 02:26:23,347][25689] Fps is (10 sec: 5671.9, 60 sec: 5670.1, 300 sec: 5676.4). Total num frames: 539379712. Throughput: 0: 5104.2. Samples: 539374262. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:26:23,348][25689] Avg episode reward: [(0, '-31.613')] [2022-07-10 02:26:24,201][26022] Updated weights on worker 0-0, policy_version 526743 (0.00086) [2022-07-10 02:26:25,899][26022] Updated weights on worker 0-0, policy_version 526753 (0.00095) [2022-07-10 02:26:27,564][26022] Updated weights on worker 0-0, policy_version 526763 (0.00090) [2022-07-10 02:26:28,356][25689] Fps is (10 sec: 5886.4, 60 sec: 5706.9, 300 sec: 5679.8). Total num frames: 539409408. Throughput: 0: 5976.7. Samples: 539408574. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:26:28,358][25689] Avg episode reward: [(0, '-30.160')] [2022-07-10 02:26:29,639][26022] Updated weights on worker 0-0, policy_version 526773 (0.00090) [2022-07-10 02:26:31,071][26022] Updated weights on worker 0-0, policy_version 526783 (0.00087) [2022-07-10 02:26:33,162][26022] Updated weights on worker 0-0, policy_version 526793 (0.00090) [2022-07-10 02:26:33,387][25689] Fps is (10 sec: 5813.9, 60 sec: 5691.5, 300 sec: 5676.1). Total num frames: 539438080. Throughput: 0: 5978.7. Samples: 539443008. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:26:33,389][25689] Avg episode reward: [(0, '-30.761')] [2022-07-10 02:26:34,811][26022] Updated weights on worker 0-0, policy_version 526803 (0.00094) [2022-07-10 02:26:36,699][26022] Updated weights on worker 0-0, policy_version 526813 (0.00085) [2022-07-10 02:26:38,420][25689] Fps is (10 sec: 5596.5, 60 sec: 5672.6, 300 sec: 5676.8). Total num frames: 539465728. Throughput: 0: 5122.3. Samples: 539460160. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:26:38,421][25689] Avg episode reward: [(0, '-30.484')] [2022-07-10 02:26:38,494][26022] Updated weights on worker 0-0, policy_version 526823 (0.00093) [2022-07-10 02:26:40,271][26022] Updated weights on worker 0-0, policy_version 526833 (0.00085) [2022-07-10 02:26:42,035][26022] Updated weights on worker 0-0, policy_version 526843 (0.00083) [2022-07-10 02:26:43,453][25689] Fps is (10 sec: 5594.5, 60 sec: 5681.0, 300 sec: 5677.5). Total num frames: 539494400. Throughput: 0: 5982.7. Samples: 539494552. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:26:43,455][25689] Avg episode reward: [(0, '-30.021')] [2022-07-10 02:26:43,918][26022] Updated weights on worker 0-0, policy_version 526853 (0.00090) [2022-07-10 02:26:45,508][26022] Updated weights on worker 0-0, policy_version 526863 (0.00090) [2022-07-10 02:26:47,598][26022] Updated weights on worker 0-0, policy_version 526873 (0.00091) [2022-07-10 02:26:48,472][25689] Fps is (10 sec: 5806.5, 60 sec: 5682.0, 300 sec: 5674.0). Total num frames: 539524096. Throughput: 0: 5985.5. Samples: 539528976. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:26:48,473][25689] Avg episode reward: [(0, '-32.456')] [2022-07-10 02:26:49,039][26022] Updated weights on worker 0-0, policy_version 526883 (0.00086) [2022-07-10 02:26:51,019][26022] Updated weights on worker 0-0, policy_version 526893 (0.00092) [2022-07-10 02:26:52,761][26022] Updated weights on worker 0-0, policy_version 526903 (0.00085) [2022-07-10 02:26:53,487][25689] Fps is (10 sec: 5715.2, 60 sec: 5665.7, 300 sec: 5677.6). Total num frames: 539551744. Throughput: 0: 5138.2. Samples: 539546288. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:26:53,489][25689] Avg episode reward: [(0, '-32.513')] [2022-07-10 02:26:54,591][26022] Updated weights on worker 0-0, policy_version 526913 (0.00101) [2022-07-10 02:26:56,239][26022] Updated weights on worker 0-0, policy_version 526923 (0.00089) [2022-07-10 02:26:58,093][26022] Updated weights on worker 0-0, policy_version 526933 (0.00091) [2022-07-10 02:26:58,506][25689] Fps is (10 sec: 5714.9, 60 sec: 5688.4, 300 sec: 5674.5). Total num frames: 539581440. Throughput: 0: 6014.0. Samples: 539580960. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:26:58,507][25689] Avg episode reward: [(0, '-33.000')] [2022-07-10 02:26:59,843][26022] Updated weights on worker 0-0, policy_version 526943 (0.00091) [2022-07-10 02:27:01,740][26022] Updated weights on worker 0-0, policy_version 526953 (0.00090) [2022-07-10 02:27:03,616][25689] Fps is (10 sec: 5762.6, 60 sec: 5723.9, 300 sec: 5679.4). Total num frames: 539610112. Throughput: 0: 5871.6. Samples: 539612938. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 02:27:03,616][25689] Avg episode reward: [(0, '-31.627')] [2022-07-10 02:27:03,618][26022] Updated weights on worker 0-0, policy_version 526963 (0.00086) [2022-07-10 02:27:05,793][26022] Updated weights on worker 0-0, policy_version 526973 (0.00088) [2022-07-10 02:27:07,371][26022] Updated weights on worker 0-0, policy_version 526983 (0.00084) [2022-07-10 02:27:08,674][25689] Fps is (10 sec: 5438.5, 60 sec: 5691.6, 300 sec: 5678.8). Total num frames: 539636736. Throughput: 0: 4998.5. Samples: 539629954. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:27:08,674][25689] Avg episode reward: [(0, '-30.932')] [2022-07-10 02:27:09,145][26022] Updated weights on worker 0-0, policy_version 526993 (0.00095) [2022-07-10 02:27:11,119][26022] Updated weights on worker 0-0, policy_version 527003 (0.00078) [2022-07-10 02:27:12,800][26022] Updated weights on worker 0-0, policy_version 527013 (0.00089) [2022-07-10 02:27:13,711][25689] Fps is (10 sec: 5376.2, 60 sec: 5655.6, 300 sec: 5671.5). Total num frames: 539664384. Throughput: 0: 5824.5. Samples: 539664080. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:27:13,711][25689] Avg episode reward: [(0, '-32.018')] [2022-07-10 02:27:14,703][26022] Updated weights on worker 0-0, policy_version 527023 (0.00092) [2022-07-10 02:27:16,627][26022] Updated weights on worker 0-0, policy_version 527033 (0.00097) [2022-07-10 02:27:18,261][26022] Updated weights on worker 0-0, policy_version 527043 (0.00079) [2022-07-10 02:27:18,722][25689] Fps is (10 sec: 5808.9, 60 sec: 5708.2, 300 sec: 5680.5). Total num frames: 539695104. Throughput: 0: 5807.0. Samples: 539698352. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:27:18,722][25689] Avg episode reward: [(0, '-32.048')] [2022-07-10 02:27:20,184][26022] Updated weights on worker 0-0, policy_version 527053 (0.00472) [2022-07-10 02:27:21,763][26022] Updated weights on worker 0-0, policy_version 527063 (0.00541) [2022-07-10 02:27:23,704][26022] Updated weights on worker 0-0, policy_version 527073 (0.00092) [2022-07-10 02:27:23,774][25689] Fps is (10 sec: 5799.9, 60 sec: 5676.9, 300 sec: 5676.3). Total num frames: 539722752. Throughput: 0: 5945.4. Samples: 539732788. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:27:23,775][25689] Avg episode reward: [(0, '-32.426')] [2022-07-10 02:27:25,470][26022] Updated weights on worker 0-0, policy_version 527083 (0.00618) [2022-07-10 02:27:27,280][26022] Updated weights on worker 0-0, policy_version 527093 (0.00084) [2022-07-10 02:27:28,820][25689] Fps is (10 sec: 5576.9, 60 sec: 5656.5, 300 sec: 5669.8). Total num frames: 539751424. Throughput: 0: 5956.2. Samples: 539749952. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:27:28,821][25689] Avg episode reward: [(0, '-32.195')] [2022-07-10 02:27:29,036][26022] Updated weights on worker 0-0, policy_version 527103 (0.00093) [2022-07-10 02:27:31,039][26022] Updated weights on worker 0-0, policy_version 527113 (0.00080) [2022-07-10 02:27:32,636][26022] Updated weights on worker 0-0, policy_version 527123 (0.00087) [2022-07-10 02:27:33,839][25689] Fps is (10 sec: 5799.5, 60 sec: 5674.6, 300 sec: 5680.2). Total num frames: 539781120. Throughput: 0: 5960.2. Samples: 539784046. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:27:33,839][25689] Avg episode reward: [(0, '-33.617')] [2022-07-10 02:27:34,473][26022] Updated weights on worker 0-0, policy_version 527133 (0.00086) [2022-07-10 02:27:36,324][26022] Updated weights on worker 0-0, policy_version 527143 (0.00087) [2022-07-10 02:27:37,970][26022] Updated weights on worker 0-0, policy_version 527153 (0.00096) [2022-07-10 02:27:38,852][25689] Fps is (10 sec: 5614.3, 60 sec: 5659.5, 300 sec: 5667.5). Total num frames: 539807744. Throughput: 0: 5951.1. Samples: 539818150. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:27:38,852][25689] Avg episode reward: [(0, '-31.892')] [2022-07-10 02:27:39,955][26022] Updated weights on worker 0-0, policy_version 527163 (0.00092) [2022-07-10 02:27:41,748][26022] Updated weights on worker 0-0, policy_version 527173 (0.00092) [2022-07-10 02:27:43,530][26022] Updated weights on worker 0-0, policy_version 527183 (0.00089) [2022-07-10 02:27:43,968][25689] Fps is (10 sec: 5661.0, 60 sec: 5685.6, 300 sec: 5672.5). Total num frames: 539838464. Throughput: 0: 5066.4. Samples: 539835100. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:27:43,969][25689] Avg episode reward: [(0, '-30.997')] [2022-07-10 02:27:45,430][26022] Updated weights on worker 0-0, policy_version 527193 (0.00083) [2022-07-10 02:27:47,012][26022] Updated weights on worker 0-0, policy_version 527203 (0.00088) [2022-07-10 02:27:48,952][26022] Updated weights on worker 0-0, policy_version 527213 (0.00089) [2022-07-10 02:27:49,041][25689] Fps is (10 sec: 5728.7, 60 sec: 5646.7, 300 sec: 5671.3). Total num frames: 539866112. Throughput: 0: 5878.0. Samples: 539868806. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:27:49,041][25689] Avg episode reward: [(0, '-30.594')] [2022-07-10 02:27:50,696][26022] Updated weights on worker 0-0, policy_version 527223 (0.00089) [2022-07-10 02:27:52,008][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:27:52,023][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000527230_539883520.pth [2022-07-10 02:27:52,023][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000525234_537839616.pth [2022-07-10 02:27:52,660][26022] Updated weights on worker 0-0, policy_version 527233 (0.00097) [2022-07-10 02:27:54,059][25689] Fps is (10 sec: 5581.2, 60 sec: 5663.3, 300 sec: 5668.1). Total num frames: 539894784. Throughput: 0: 5876.5. Samples: 539902874. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:27:54,060][25689] Avg episode reward: [(0, '-32.336')] [2022-07-10 02:27:54,453][26022] Updated weights on worker 0-0, policy_version 527243 (0.00085) [2022-07-10 02:27:56,123][26022] Updated weights on worker 0-0, policy_version 527253 (0.00084) [2022-07-10 02:27:57,995][26022] Updated weights on worker 0-0, policy_version 527263 (0.00086) [2022-07-10 02:27:59,066][25689] Fps is (10 sec: 5719.8, 60 sec: 5647.5, 300 sec: 5670.1). Total num frames: 539923456. Throughput: 0: 5029.8. Samples: 539919824. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:27:59,067][25689] Avg episode reward: [(0, '-32.437')] [2022-07-10 02:27:59,890][26022] Updated weights on worker 0-0, policy_version 527273 (0.00095) [2022-07-10 02:28:01,426][26022] Updated weights on worker 0-0, policy_version 527283 (0.00080) [2022-07-10 02:28:03,879][26022] Updated weights on worker 0-0, policy_version 527293 (0.00091) [2022-07-10 02:28:04,191][25689] Fps is (10 sec: 5356.8, 60 sec: 5595.4, 300 sec: 5662.1). Total num frames: 539949056. Throughput: 0: 5788.9. Samples: 539952166. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:28:04,191][25689] Avg episode reward: [(0, '-31.789')] [2022-07-10 02:28:05,530][26022] Updated weights on worker 0-0, policy_version 527303 (0.00101) [2022-07-10 02:28:07,364][26022] Updated weights on worker 0-0, policy_version 527313 (0.00084) [2022-07-10 02:28:09,007][26022] Updated weights on worker 0-0, policy_version 527323 (0.00086) [2022-07-10 02:28:09,219][25689] Fps is (10 sec: 5547.3, 60 sec: 5665.8, 300 sec: 5669.3). Total num frames: 539979776. Throughput: 0: 5846.1. Samples: 539986770. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:28:09,219][25689] Avg episode reward: [(0, '-32.371')] [2022-07-10 02:28:10,907][26022] Updated weights on worker 0-0, policy_version 527333 (0.00088) [2022-07-10 02:28:12,623][26022] Updated weights on worker 0-0, policy_version 527343 (0.00085) [2022-07-10 02:28:14,223][25689] Fps is (10 sec: 5818.4, 60 sec: 5669.0, 300 sec: 5662.8). Total num frames: 540007424. Throughput: 0: 5019.7. Samples: 540004086. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:28:14,223][25689] Avg episode reward: [(0, '-31.245')] [2022-07-10 02:28:14,372][26022] Updated weights on worker 0-0, policy_version 527353 (0.00088) [2022-07-10 02:28:16,009][26022] Updated weights on worker 0-0, policy_version 527363 (0.00087) [2022-07-10 02:28:18,153][26022] Updated weights on worker 0-0, policy_version 527373 (0.00087) [2022-07-10 02:28:19,255][25689] Fps is (10 sec: 5713.9, 60 sec: 5650.1, 300 sec: 5670.2). Total num frames: 540037120. Throughput: 0: 5870.3. Samples: 540038336. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:28:19,255][25689] Avg episode reward: [(0, '-29.825')] [2022-07-10 02:28:19,691][26022] Updated weights on worker 0-0, policy_version 527383 (0.00090) [2022-07-10 02:28:21,835][26022] Updated weights on worker 0-0, policy_version 527393 (0.00087) [2022-07-10 02:28:23,491][26022] Updated weights on worker 0-0, policy_version 527403 (0.00089) [2022-07-10 02:28:24,295][25689] Fps is (10 sec: 5896.5, 60 sec: 5685.1, 300 sec: 5674.7). Total num frames: 540066816. Throughput: 0: 6004.9. Samples: 540072888. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:28:24,295][25689] Avg episode reward: [(0, '-29.840')] [2022-07-10 02:28:25,149][26022] Updated weights on worker 0-0, policy_version 527413 (0.00085) [2022-07-10 02:28:27,123][26022] Updated weights on worker 0-0, policy_version 527423 (0.00093) [2022-07-10 02:28:28,624][26022] Updated weights on worker 0-0, policy_version 527433 (0.00092) [2022-07-10 02:28:29,299][25689] Fps is (10 sec: 5607.1, 60 sec: 5655.2, 300 sec: 5664.6). Total num frames: 540093440. Throughput: 0: 5148.6. Samples: 540090156. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:28:29,300][25689] Avg episode reward: [(0, '-29.371')] [2022-07-10 02:28:30,658][26022] Updated weights on worker 0-0, policy_version 527443 (0.00088) [2022-07-10 02:28:32,350][26022] Updated weights on worker 0-0, policy_version 527453 (0.00083) [2022-07-10 02:28:34,041][26022] Updated weights on worker 0-0, policy_version 527463 (0.00092) [2022-07-10 02:28:34,307][25689] Fps is (10 sec: 5625.3, 60 sec: 5656.1, 300 sec: 5675.0). Total num frames: 540123136. Throughput: 0: 5985.6. Samples: 540124302. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:28:34,307][25689] Avg episode reward: [(0, '-28.496')] [2022-07-10 02:28:36,039][26022] Updated weights on worker 0-0, policy_version 527473 (0.00087) [2022-07-10 02:28:37,714][26022] Updated weights on worker 0-0, policy_version 527483 (0.00092) [2022-07-10 02:28:39,323][25689] Fps is (10 sec: 5822.9, 60 sec: 5689.7, 300 sec: 5672.4). Total num frames: 540151808. Throughput: 0: 5995.9. Samples: 540158664. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:28:39,323][25689] Avg episode reward: [(0, '-28.989')] [2022-07-10 02:28:39,562][26022] Updated weights on worker 0-0, policy_version 527493 (0.00094) [2022-07-10 02:28:41,495][26022] Updated weights on worker 0-0, policy_version 527503 (0.00085) [2022-07-10 02:28:43,016][26022] Updated weights on worker 0-0, policy_version 527513 (0.00087) [2022-07-10 02:28:44,393][25689] Fps is (10 sec: 5583.6, 60 sec: 5643.2, 300 sec: 5668.5). Total num frames: 540179456. Throughput: 0: 5123.3. Samples: 540175858. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:28:44,394][25689] Avg episode reward: [(0, '-28.827')] [2022-07-10 02:28:44,917][26022] Updated weights on worker 0-0, policy_version 527523 (0.00088) [2022-07-10 02:28:46,700][26022] Updated weights on worker 0-0, policy_version 527533 (0.00085) [2022-07-10 02:28:48,474][26022] Updated weights on worker 0-0, policy_version 527543 (0.00087) [2022-07-10 02:28:49,419][25689] Fps is (10 sec: 5679.7, 60 sec: 5681.5, 300 sec: 5672.8). Total num frames: 540209152. Throughput: 0: 5967.5. Samples: 540210224. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:28:49,420][25689] Avg episode reward: [(0, '-28.765')] [2022-07-10 02:28:50,388][26022] Updated weights on worker 0-0, policy_version 527553 (0.00085) [2022-07-10 02:28:52,119][26022] Updated weights on worker 0-0, policy_version 527563 (0.00088) [2022-07-10 02:28:53,931][26022] Updated weights on worker 0-0, policy_version 527573 (0.00088) [2022-07-10 02:28:54,458][25689] Fps is (10 sec: 5697.4, 60 sec: 5662.6, 300 sec: 5665.4). Total num frames: 540236800. Throughput: 0: 5948.9. Samples: 540244182. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:28:54,459][25689] Avg episode reward: [(0, '-29.237')] [2022-07-10 02:28:55,589][26022] Updated weights on worker 0-0, policy_version 527583 (0.00087) [2022-07-10 02:28:57,456][26022] Updated weights on worker 0-0, policy_version 527593 (0.00090) [2022-07-10 02:28:59,208][26022] Updated weights on worker 0-0, policy_version 527603 (0.00081) [2022-07-10 02:28:59,465][25689] Fps is (10 sec: 5810.4, 60 sec: 5696.6, 300 sec: 5681.3). Total num frames: 540267520. Throughput: 0: 5096.6. Samples: 540261318. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:28:59,465][25689] Avg episode reward: [(0, '-29.828')] [2022-07-10 02:29:01,259][26022] Updated weights on worker 0-0, policy_version 527613 (0.00092) [2022-07-10 02:29:03,159][26022] Updated weights on worker 0-0, policy_version 527623 (0.00085) [2022-07-10 02:29:04,550][25689] Fps is (10 sec: 5581.0, 60 sec: 5700.3, 300 sec: 5666.4). Total num frames: 540293120. Throughput: 0: 5853.5. Samples: 540293844. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:29:04,550][25689] Avg episode reward: [(0, '-30.191')] [2022-07-10 02:29:05,081][26022] Updated weights on worker 0-0, policy_version 527633 (0.00086) [2022-07-10 02:29:06,731][26022] Updated weights on worker 0-0, policy_version 527643 (0.00103) [2022-07-10 02:29:08,655][26022] Updated weights on worker 0-0, policy_version 527653 (0.00085) [2022-07-10 02:29:09,649][25689] Fps is (10 sec: 5329.2, 60 sec: 5659.7, 300 sec: 5668.6). Total num frames: 540321792. Throughput: 0: 5827.2. Samples: 540328104. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:29:09,650][25689] Avg episode reward: [(0, '-30.593')] [2022-07-10 02:29:10,283][26022] Updated weights on worker 0-0, policy_version 527663 (0.00089) [2022-07-10 02:29:12,199][26022] Updated weights on worker 0-0, policy_version 527673 (0.00084) [2022-07-10 02:29:14,039][26022] Updated weights on worker 0-0, policy_version 527683 (0.00086) [2022-07-10 02:29:14,657][25689] Fps is (10 sec: 5673.5, 60 sec: 5676.2, 300 sec: 5668.5). Total num frames: 540350464. Throughput: 0: 4987.7. Samples: 540344928. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:29:14,658][25689] Avg episode reward: [(0, '-30.658')] [2022-07-10 02:29:15,918][26022] Updated weights on worker 0-0, policy_version 527693 (0.00088) [2022-07-10 02:29:17,556][26022] Updated weights on worker 0-0, policy_version 527703 (0.00089) [2022-07-10 02:29:19,532][26022] Updated weights on worker 0-0, policy_version 527713 (0.00095) [2022-07-10 02:29:19,692][25689] Fps is (10 sec: 5709.7, 60 sec: 5659.0, 300 sec: 5665.2). Total num frames: 540379136. Throughput: 0: 5806.8. Samples: 540378772. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:29:19,693][25689] Avg episode reward: [(0, '-30.848')] [2022-07-10 02:29:21,272][26022] Updated weights on worker 0-0, policy_version 527723 (0.00092) [2022-07-10 02:29:23,225][26022] Updated weights on worker 0-0, policy_version 527733 (0.00093) [2022-07-10 02:29:24,793][25689] Fps is (10 sec: 5657.8, 60 sec: 5636.4, 300 sec: 5670.2). Total num frames: 540407808. Throughput: 0: 5863.1. Samples: 540412530. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:29:24,794][25689] Avg episode reward: [(0, '-30.905')] [2022-07-10 02:29:24,906][26022] Updated weights on worker 0-0, policy_version 527743 (0.00089) [2022-07-10 02:29:26,659][26022] Updated weights on worker 0-0, policy_version 527753 (0.00055) [2022-07-10 02:29:28,455][26022] Updated weights on worker 0-0, policy_version 527763 (0.00085) [2022-07-10 02:29:29,814][25689] Fps is (10 sec: 5665.0, 60 sec: 5668.7, 300 sec: 5666.4). Total num frames: 540436480. Throughput: 0: 5030.0. Samples: 540429538. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:29:29,815][25689] Avg episode reward: [(0, '-30.724')] [2022-07-10 02:29:30,554][26022] Updated weights on worker 0-0, policy_version 527773 (0.00088) [2022-07-10 02:29:32,087][26022] Updated weights on worker 0-0, policy_version 527783 (0.00085) [2022-07-10 02:29:34,046][26022] Updated weights on worker 0-0, policy_version 527793 (0.00080) [2022-07-10 02:29:34,834][25689] Fps is (10 sec: 5608.8, 60 sec: 5633.7, 300 sec: 5659.4). Total num frames: 540464128. Throughput: 0: 5882.8. Samples: 540463624. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:29:34,834][25689] Avg episode reward: [(0, '-30.673')] [2022-07-10 02:29:35,465][26022] Updated weights on worker 0-0, policy_version 527803 (0.00085) [2022-07-10 02:29:37,696][26022] Updated weights on worker 0-0, policy_version 527813 (0.00092) [2022-07-10 02:29:39,158][26022] Updated weights on worker 0-0, policy_version 527823 (0.00083) [2022-07-10 02:29:39,868][25689] Fps is (10 sec: 5602.1, 60 sec: 5632.1, 300 sec: 5663.6). Total num frames: 540492800. Throughput: 0: 5896.6. Samples: 540497742. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:29:39,868][25689] Avg episode reward: [(0, '-29.978')] [2022-07-10 02:29:41,178][26022] Updated weights on worker 0-0, policy_version 527833 (0.00090) [2022-07-10 02:29:42,917][26022] Updated weights on worker 0-0, policy_version 527843 (0.00094) [2022-07-10 02:29:44,969][25689] Fps is (10 sec: 5556.8, 60 sec: 5629.2, 300 sec: 5658.9). Total num frames: 540520448. Throughput: 0: 5065.8. Samples: 540514742. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:29:44,970][25689] Avg episode reward: [(0, '-30.464')] [2022-07-10 02:29:45,155][26022] Updated weights on worker 0-0, policy_version 527853 (0.00085) [2022-07-10 02:29:46,453][26022] Updated weights on worker 0-0, policy_version 527863 (0.00106) [2022-07-10 02:29:48,610][26022] Updated weights on worker 0-0, policy_version 527873 (0.00092) [2022-07-10 02:29:50,020][25689] Fps is (10 sec: 5749.3, 60 sec: 5643.8, 300 sec: 5669.5). Total num frames: 540551168. Throughput: 0: 5895.2. Samples: 540548654. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:29:50,020][25689] Avg episode reward: [(0, '-31.292')] [2022-07-10 02:29:50,093][26022] Updated weights on worker 0-0, policy_version 527883 (0.00096) [2022-07-10 02:29:52,028][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:29:52,049][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000527892_540561408.pth [2022-07-10 02:29:52,050][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000525900_538521600.pth [2022-07-10 02:29:52,162][26022] Updated weights on worker 0-0, policy_version 527893 (0.00092) [2022-07-10 02:29:54,132][26022] Updated weights on worker 0-0, policy_version 527903 (0.00093) [2022-07-10 02:29:55,023][25689] Fps is (10 sec: 5703.8, 60 sec: 5630.2, 300 sec: 5659.8). Total num frames: 540577792. Throughput: 0: 5900.9. Samples: 540582758. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:29:55,023][25689] Avg episode reward: [(0, '-31.296')] [2022-07-10 02:29:55,591][26022] Updated weights on worker 0-0, policy_version 527913 (0.00094) [2022-07-10 02:29:57,522][26022] Updated weights on worker 0-0, policy_version 527923 (0.00102) [2022-07-10 02:29:59,247][26022] Updated weights on worker 0-0, policy_version 527933 (0.00086) [2022-07-10 02:30:00,026][25689] Fps is (10 sec: 5526.2, 60 sec: 5596.7, 300 sec: 5668.2). Total num frames: 540606464. Throughput: 0: 5921.8. Samples: 540617116. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 02:30:00,027][25689] Avg episode reward: [(0, '-30.270')] [2022-07-10 02:30:01,026][26022] Updated weights on worker 0-0, policy_version 527943 (0.00089) [2022-07-10 02:30:03,279][26022] Updated weights on worker 0-0, policy_version 527953 (0.00081) [2022-07-10 02:30:05,017][26022] Updated weights on worker 0-0, policy_version 527963 (0.00084) [2022-07-10 02:30:05,075][25689] Fps is (10 sec: 5603.0, 60 sec: 5633.9, 300 sec: 5664.9). Total num frames: 540634112. Throughput: 0: 5825.7. Samples: 540631870. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:30:05,075][25689] Avg episode reward: [(0, '-29.705')] [2022-07-10 02:30:06,761][26022] Updated weights on worker 0-0, policy_version 527973 (0.00104) [2022-07-10 02:30:08,707][26022] Updated weights on worker 0-0, policy_version 527983 (0.00080) [2022-07-10 02:30:10,120][25689] Fps is (10 sec: 5680.8, 60 sec: 5655.8, 300 sec: 5665.2). Total num frames: 540663808. Throughput: 0: 5866.2. Samples: 540666568. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:30:10,121][25689] Avg episode reward: [(0, '-29.623')] [2022-07-10 02:30:10,202][26022] Updated weights on worker 0-0, policy_version 527993 (0.00088) [2022-07-10 02:30:12,197][26022] Updated weights on worker 0-0, policy_version 528003 (0.00089) [2022-07-10 02:30:13,838][26022] Updated weights on worker 0-0, policy_version 528013 (0.00090) [2022-07-10 02:30:15,151][25689] Fps is (10 sec: 5691.1, 60 sec: 5636.8, 300 sec: 5664.9). Total num frames: 540691456. Throughput: 0: 5884.1. Samples: 540701192. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:30:15,152][25689] Avg episode reward: [(0, '-28.843')] [2022-07-10 02:30:15,700][26022] Updated weights on worker 0-0, policy_version 528023 (0.00086) [2022-07-10 02:30:17,570][26022] Updated weights on worker 0-0, policy_version 528033 (0.00093) [2022-07-10 02:30:19,190][26022] Updated weights on worker 0-0, policy_version 528043 (0.00086) [2022-07-10 02:30:20,172][25689] Fps is (10 sec: 5602.9, 60 sec: 5638.0, 300 sec: 5662.0). Total num frames: 540720128. Throughput: 0: 5021.5. Samples: 540718280. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:30:20,173][25689] Avg episode reward: [(0, '-28.684')] [2022-07-10 02:30:21,141][26022] Updated weights on worker 0-0, policy_version 528053 (0.00087) [2022-07-10 02:30:22,777][26022] Updated weights on worker 0-0, policy_version 528063 (0.00085) [2022-07-10 02:30:24,804][26022] Updated weights on worker 0-0, policy_version 528073 (0.00083) [2022-07-10 02:30:25,253][25689] Fps is (10 sec: 5777.6, 60 sec: 5656.8, 300 sec: 5668.1). Total num frames: 540749824. Throughput: 0: 5984.7. Samples: 540752632. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:30:25,254][25689] Avg episode reward: [(0, '-29.384')] [2022-07-10 02:30:26,512][26022] Updated weights on worker 0-0, policy_version 528083 (0.00093) [2022-07-10 02:30:28,381][26022] Updated weights on worker 0-0, policy_version 528093 (0.00092) [2022-07-10 02:30:30,074][26022] Updated weights on worker 0-0, policy_version 528103 (0.00609) [2022-07-10 02:30:30,292][25689] Fps is (10 sec: 5767.8, 60 sec: 5655.2, 300 sec: 5664.8). Total num frames: 540778496. Throughput: 0: 5947.4. Samples: 540786536. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:30:30,293][25689] Avg episode reward: [(0, '-30.324')] [2022-07-10 02:30:32,092][26022] Updated weights on worker 0-0, policy_version 528113 (0.00085) [2022-07-10 02:30:33,764][26022] Updated weights on worker 0-0, policy_version 528123 (0.00088) [2022-07-10 02:30:35,303][25689] Fps is (10 sec: 5502.3, 60 sec: 5639.1, 300 sec: 5658.0). Total num frames: 540805120. Throughput: 0: 5081.1. Samples: 540803586. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:30:35,303][25689] Avg episode reward: [(0, '-30.476')] [2022-07-10 02:30:35,629][26022] Updated weights on worker 0-0, policy_version 528133 (0.00094) [2022-07-10 02:30:37,395][26022] Updated weights on worker 0-0, policy_version 528143 (0.00089) [2022-07-10 02:30:39,130][26022] Updated weights on worker 0-0, policy_version 528153 (0.00088) [2022-07-10 02:30:40,313][25689] Fps is (10 sec: 5722.5, 60 sec: 5675.2, 300 sec: 5667.0). Total num frames: 540835840. Throughput: 0: 5939.7. Samples: 540837908. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:30:40,315][25689] Avg episode reward: [(0, '-31.635')] [2022-07-10 02:30:40,839][26022] Updated weights on worker 0-0, policy_version 528163 (0.00080) [2022-07-10 02:30:42,890][26022] Updated weights on worker 0-0, policy_version 528173 (0.00101) [2022-07-10 02:30:44,454][26022] Updated weights on worker 0-0, policy_version 528183 (0.00084) [2022-07-10 02:30:45,439][25689] Fps is (10 sec: 5758.5, 60 sec: 5673.0, 300 sec: 5658.3). Total num frames: 540863488. Throughput: 0: 5930.1. Samples: 540872334. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:30:45,439][25689] Avg episode reward: [(0, '-31.429')] [2022-07-10 02:30:46,432][26022] Updated weights on worker 0-0, policy_version 528194 (0.00085) [2022-07-10 02:30:48,353][26022] Updated weights on worker 0-0, policy_version 528204 (0.00088) [2022-07-10 02:30:50,066][26022] Updated weights on worker 0-0, policy_version 528214 (0.00088) [2022-07-10 02:30:50,479][25689] Fps is (10 sec: 5540.2, 60 sec: 5640.1, 300 sec: 5657.9). Total num frames: 540892160. Throughput: 0: 5096.9. Samples: 540889426. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:30:50,479][25689] Avg episode reward: [(0, '-30.428')] [2022-07-10 02:30:51,865][26022] Updated weights on worker 0-0, policy_version 528224 (0.00087) [2022-07-10 02:30:53,687][26022] Updated weights on worker 0-0, policy_version 528234 (0.00088) [2022-07-10 02:30:55,480][25689] Fps is (10 sec: 5710.9, 60 sec: 5674.1, 300 sec: 5659.5). Total num frames: 540920832. Throughput: 0: 5946.7. Samples: 540923572. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:30:55,480][25689] Avg episode reward: [(0, '-31.532')] [2022-07-10 02:30:55,537][26022] Updated weights on worker 0-0, policy_version 528244 (0.00093) [2022-07-10 02:30:57,313][26022] Updated weights on worker 0-0, policy_version 528254 (0.00085) [2022-07-10 02:30:59,093][26022] Updated weights on worker 0-0, policy_version 528264 (0.00098) [2022-07-10 02:31:00,495][25689] Fps is (10 sec: 5827.5, 60 sec: 5690.0, 300 sec: 5671.9). Total num frames: 540950528. Throughput: 0: 5940.5. Samples: 540957798. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:31:00,495][25689] Avg episode reward: [(0, '-30.140')] [2022-07-10 02:31:00,837][26022] Updated weights on worker 0-0, policy_version 528274 (0.00093) [2022-07-10 02:31:03,111][26022] Updated weights on worker 0-0, policy_version 528284 (0.00086) [2022-07-10 02:31:04,955][26022] Updated weights on worker 0-0, policy_version 528294 (0.00087) [2022-07-10 02:31:05,530][25689] Fps is (10 sec: 5502.1, 60 sec: 5657.4, 300 sec: 5662.3). Total num frames: 540976128. Throughput: 0: 4994.1. Samples: 540972670. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:31:05,530][25689] Avg episode reward: [(0, '-28.971')] [2022-07-10 02:31:06,634][26022] Updated weights on worker 0-0, policy_version 528304 (0.00086) [2022-07-10 02:31:08,473][26022] Updated weights on worker 0-0, policy_version 528314 (0.00090) [2022-07-10 02:31:10,150][26022] Updated weights on worker 0-0, policy_version 528324 (0.00086) [2022-07-10 02:31:10,600][25689] Fps is (10 sec: 5471.8, 60 sec: 5655.0, 300 sec: 5661.3). Total num frames: 541005824. Throughput: 0: 5832.2. Samples: 541006778. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:31:10,601][25689] Avg episode reward: [(0, '-29.048')] [2022-07-10 02:31:12,270][26022] Updated weights on worker 0-0, policy_version 528334 (0.00089) [2022-07-10 02:31:13,794][26022] Updated weights on worker 0-0, policy_version 528344 (0.00085) [2022-07-10 02:31:15,605][25689] Fps is (10 sec: 5793.1, 60 sec: 5674.3, 300 sec: 5665.2). Total num frames: 541034496. Throughput: 0: 5838.9. Samples: 541041082. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:31:15,606][25689] Avg episode reward: [(0, '-30.581')] [2022-07-10 02:31:15,606][26022] Updated weights on worker 0-0, policy_version 528354 (0.00093) [2022-07-10 02:31:17,492][26022] Updated weights on worker 0-0, policy_version 528364 (0.00086) [2022-07-10 02:31:19,080][26022] Updated weights on worker 0-0, policy_version 528374 (0.00088) [2022-07-10 02:31:20,631][25689] Fps is (10 sec: 5614.9, 60 sec: 5657.1, 300 sec: 5659.4). Total num frames: 541062144. Throughput: 0: 4977.9. Samples: 541058034. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:31:20,631][25689] Avg episode reward: [(0, '-30.237')] [2022-07-10 02:31:21,256][26022] Updated weights on worker 0-0, policy_version 528384 (0.00081) [2022-07-10 02:31:22,835][26022] Updated weights on worker 0-0, policy_version 528394 (0.00098) [2022-07-10 02:31:24,726][26022] Updated weights on worker 0-0, policy_version 528404 (0.00088) [2022-07-10 02:31:25,685][25689] Fps is (10 sec: 5689.0, 60 sec: 5659.5, 300 sec: 5658.5). Total num frames: 541091840. Throughput: 0: 5933.0. Samples: 541092252. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:31:25,685][25689] Avg episode reward: [(0, '-31.035')] [2022-07-10 02:31:26,451][26022] Updated weights on worker 0-0, policy_version 528414 (0.00094) [2022-07-10 02:31:28,201][26022] Updated weights on worker 0-0, policy_version 528424 (0.00083) [2022-07-10 02:31:30,148][26022] Updated weights on worker 0-0, policy_version 528434 (0.00088) [2022-07-10 02:31:30,774][25689] Fps is (10 sec: 5653.2, 60 sec: 5637.9, 300 sec: 5653.9). Total num frames: 541119488. Throughput: 0: 5941.1. Samples: 541126634. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:31:30,775][25689] Avg episode reward: [(0, '-31.497')] [2022-07-10 02:31:31,750][26022] Updated weights on worker 0-0, policy_version 528444 (0.00086) [2022-07-10 02:31:33,712][26022] Updated weights on worker 0-0, policy_version 528454 (0.00088) [2022-07-10 02:31:35,280][26022] Updated weights on worker 0-0, policy_version 528464 (0.00095) [2022-07-10 02:31:35,781][25689] Fps is (10 sec: 5679.8, 60 sec: 5689.0, 300 sec: 5661.3). Total num frames: 541149184. Throughput: 0: 5085.9. Samples: 541143694. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:31:35,782][25689] Avg episode reward: [(0, '-32.348')] [2022-07-10 02:31:37,341][26022] Updated weights on worker 0-0, policy_version 528474 (0.00083) [2022-07-10 02:31:38,875][26022] Updated weights on worker 0-0, policy_version 528484 (0.00086) [2022-07-10 02:31:40,831][25689] Fps is (10 sec: 5804.0, 60 sec: 5651.5, 300 sec: 5661.0). Total num frames: 541177856. Throughput: 0: 5937.2. Samples: 541177966. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:31:40,831][25689] Avg episode reward: [(0, '-31.071')] [2022-07-10 02:31:40,835][26022] Updated weights on worker 0-0, policy_version 528494 (0.00094) [2022-07-10 02:31:42,560][26022] Updated weights on worker 0-0, policy_version 528504 (0.00090) [2022-07-10 02:31:44,491][26022] Updated weights on worker 0-0, policy_version 528514 (0.00087) [2022-07-10 02:31:45,930][25689] Fps is (10 sec: 5751.5, 60 sec: 5687.9, 300 sec: 5659.5). Total num frames: 541207552. Throughput: 0: 5934.5. Samples: 541212394. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:31:45,930][25689] Avg episode reward: [(0, '-30.008')] [2022-07-10 02:31:46,088][26022] Updated weights on worker 0-0, policy_version 528524 (0.00094) [2022-07-10 02:31:48,141][26022] Updated weights on worker 0-0, policy_version 528534 (0.00095) [2022-07-10 02:31:49,662][26022] Updated weights on worker 0-0, policy_version 528544 (0.00089) [2022-07-10 02:31:50,976][25689] Fps is (10 sec: 5551.2, 60 sec: 5653.4, 300 sec: 5655.4). Total num frames: 541234176. Throughput: 0: 5084.1. Samples: 541229344. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:31:50,978][25689] Avg episode reward: [(0, '-30.258')] [2022-07-10 02:31:51,799][26022] Updated weights on worker 0-0, policy_version 528554 (0.00090) [2022-07-10 02:31:52,136][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:31:52,145][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000528557_541242368.pth [2022-07-10 02:31:52,146][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000526565_539202560.pth [2022-07-10 02:31:53,346][26022] Updated weights on worker 0-0, policy_version 528564 (0.00092) [2022-07-10 02:31:55,309][26022] Updated weights on worker 0-0, policy_version 528574 (0.00103) [2022-07-10 02:31:55,982][25689] Fps is (10 sec: 5704.4, 60 sec: 5686.8, 300 sec: 5659.1). Total num frames: 541264896. Throughput: 0: 5915.2. Samples: 541263186. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:31:55,984][25689] Avg episode reward: [(0, '-29.239')] [2022-07-10 02:31:57,196][26022] Updated weights on worker 0-0, policy_version 528584 (0.00094) [2022-07-10 02:31:58,886][26022] Updated weights on worker 0-0, policy_version 528594 (0.00084) [2022-07-10 02:32:00,668][26022] Updated weights on worker 0-0, policy_version 528604 (0.00089) [2022-07-10 02:32:01,003][25689] Fps is (10 sec: 5719.4, 60 sec: 5635.5, 300 sec: 5653.9). Total num frames: 541291520. Throughput: 0: 5915.3. Samples: 541297288. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:32:01,003][25689] Avg episode reward: [(0, '-29.318')] [2022-07-10 02:32:02,928][26022] Updated weights on worker 0-0, policy_version 528614 (0.00086) [2022-07-10 02:32:04,636][26022] Updated weights on worker 0-0, policy_version 528624 (0.00086) [2022-07-10 02:32:06,123][25689] Fps is (10 sec: 5351.6, 60 sec: 5661.3, 300 sec: 5656.2). Total num frames: 541319168. Throughput: 0: 4942.7. Samples: 541312206. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:32:06,124][25689] Avg episode reward: [(0, '-30.003')] [2022-07-10 02:32:06,430][26022] Updated weights on worker 0-0, policy_version 528634 (0.00088) [2022-07-10 02:32:08,383][26022] Updated weights on worker 0-0, policy_version 528644 (0.00091) [2022-07-10 02:32:09,926][26022] Updated weights on worker 0-0, policy_version 528654 (0.00082) [2022-07-10 02:32:11,178][25689] Fps is (10 sec: 5635.8, 60 sec: 5662.8, 300 sec: 5662.7). Total num frames: 541348864. Throughput: 0: 5808.9. Samples: 541346690. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:32:11,178][25689] Avg episode reward: [(0, '-30.656')] [2022-07-10 02:32:11,978][26022] Updated weights on worker 0-0, policy_version 528664 (0.00085) [2022-07-10 02:32:13,578][26022] Updated weights on worker 0-0, policy_version 528674 (0.00086) [2022-07-10 02:32:15,468][26022] Updated weights on worker 0-0, policy_version 528684 (0.00084) [2022-07-10 02:32:16,250][25689] Fps is (10 sec: 5662.7, 60 sec: 5639.7, 300 sec: 5651.3). Total num frames: 541376512. Throughput: 0: 5810.4. Samples: 541380950. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:32:16,251][25689] Avg episode reward: [(0, '-29.821')] [2022-07-10 02:32:17,031][26022] Updated weights on worker 0-0, policy_version 528694 (0.00079) [2022-07-10 02:32:19,004][26022] Updated weights on worker 0-0, policy_version 528704 (0.00085) [2022-07-10 02:32:20,746][26022] Updated weights on worker 0-0, policy_version 528714 (0.00092) [2022-07-10 02:32:21,255][25689] Fps is (10 sec: 5588.8, 60 sec: 5658.5, 300 sec: 5655.6). Total num frames: 541405184. Throughput: 0: 4968.3. Samples: 541397906. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:32:21,255][25689] Avg episode reward: [(0, '-29.663')] [2022-07-10 02:32:22,718][26022] Updated weights on worker 0-0, policy_version 528724 (0.00083) [2022-07-10 02:32:24,322][26022] Updated weights on worker 0-0, policy_version 528734 (0.00085) [2022-07-10 02:32:26,369][25689] Fps is (10 sec: 5565.8, 60 sec: 5619.2, 300 sec: 5650.9). Total num frames: 541432832. Throughput: 0: 5910.3. Samples: 541431864. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:32:26,369][25689] Avg episode reward: [(0, '-29.771')] [2022-07-10 02:32:26,453][26022] Updated weights on worker 0-0, policy_version 528744 (0.00093) [2022-07-10 02:32:27,899][26022] Updated weights on worker 0-0, policy_version 528754 (0.00091) [2022-07-10 02:32:29,979][26022] Updated weights on worker 0-0, policy_version 528764 (0.00086) [2022-07-10 02:32:31,396][25689] Fps is (10 sec: 5755.8, 60 sec: 5675.6, 300 sec: 5654.2). Total num frames: 541463552. Throughput: 0: 5898.5. Samples: 541465946. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:32:31,396][25689] Avg episode reward: [(0, '-28.661')] [2022-07-10 02:32:31,582][26022] Updated weights on worker 0-0, policy_version 528774 (0.00086) [2022-07-10 02:32:33,542][26022] Updated weights on worker 0-0, policy_version 528784 (0.00090) [2022-07-10 02:32:35,300][26022] Updated weights on worker 0-0, policy_version 528794 (0.00088) [2022-07-10 02:32:36,424][25689] Fps is (10 sec: 5804.7, 60 sec: 5639.8, 300 sec: 5657.3). Total num frames: 541491200. Throughput: 0: 5910.7. Samples: 541500194. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:32:36,425][25689] Avg episode reward: [(0, '-27.683')] [2022-07-10 02:32:37,101][26022] Updated weights on worker 0-0, policy_version 528804 (0.00086) [2022-07-10 02:32:38,751][26022] Updated weights on worker 0-0, policy_version 528814 (0.00082) [2022-07-10 02:32:40,911][26022] Updated weights on worker 0-0, policy_version 528824 (0.00096) [2022-07-10 02:32:41,475][25689] Fps is (10 sec: 5486.0, 60 sec: 5622.8, 300 sec: 5648.2). Total num frames: 541518848. Throughput: 0: 5912.7. Samples: 541517462. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:32:41,476][25689] Avg episode reward: [(0, '-26.991')] [2022-07-10 02:32:42,271][26022] Updated weights on worker 0-0, policy_version 528834 (0.00088) [2022-07-10 02:32:44,474][26022] Updated weights on worker 0-0, policy_version 528844 (0.00088) [2022-07-10 02:32:45,806][26022] Updated weights on worker 0-0, policy_version 528854 (0.00086) [2022-07-10 02:32:46,563][25689] Fps is (10 sec: 5656.0, 60 sec: 5623.9, 300 sec: 5654.8). Total num frames: 541548544. Throughput: 0: 5923.5. Samples: 541551482. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:32:46,563][25689] Avg episode reward: [(0, '-27.544')] [2022-07-10 02:32:48,024][26022] Updated weights on worker 0-0, policy_version 528864 (0.00088) [2022-07-10 02:32:49,609][26022] Updated weights on worker 0-0, policy_version 528874 (0.00089) [2022-07-10 02:32:51,496][26022] Updated weights on worker 0-0, policy_version 528884 (0.00090) [2022-07-10 02:32:51,570][25689] Fps is (10 sec: 5782.0, 60 sec: 5661.3, 300 sec: 5655.0). Total num frames: 541577216. Throughput: 0: 5931.7. Samples: 541585614. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:32:51,570][25689] Avg episode reward: [(0, '-27.402')] [2022-07-10 02:32:53,260][26022] Updated weights on worker 0-0, policy_version 528894 (0.00085) [2022-07-10 02:32:55,080][26022] Updated weights on worker 0-0, policy_version 528904 (0.00089) [2022-07-10 02:32:56,587][25689] Fps is (10 sec: 5720.3, 60 sec: 5626.4, 300 sec: 5654.8). Total num frames: 541605888. Throughput: 0: 5087.2. Samples: 541602768. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 02:32:56,589][25689] Avg episode reward: [(0, '-26.152')] [2022-07-10 02:32:56,823][26022] Updated weights on worker 0-0, policy_version 528914 (0.00121) [2022-07-10 02:32:58,851][26022] Updated weights on worker 0-0, policy_version 528924 (0.00089) [2022-07-10 02:33:00,442][26022] Updated weights on worker 0-0, policy_version 528934 (0.00091) [2022-07-10 02:33:01,602][25689] Fps is (10 sec: 5614.1, 60 sec: 5643.9, 300 sec: 5663.8). Total num frames: 541633536. Throughput: 0: 5931.0. Samples: 541636832. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:33:01,602][25689] Avg episode reward: [(0, '-27.337')] [2022-07-10 02:33:02,772][26022] Updated weights on worker 0-0, policy_version 528944 (0.00090) [2022-07-10 02:33:04,598][26022] Updated weights on worker 0-0, policy_version 528954 (0.00371) [2022-07-10 02:33:06,400][26022] Updated weights on worker 0-0, policy_version 528964 (0.00084) [2022-07-10 02:33:06,739][25689] Fps is (10 sec: 5346.0, 60 sec: 5625.4, 300 sec: 5648.0). Total num frames: 541660160. Throughput: 0: 5812.1. Samples: 541668750. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:33:06,741][25689] Avg episode reward: [(0, '-28.031')] [2022-07-10 02:33:07,960][26022] Updated weights on worker 0-0, policy_version 528974 (0.00083) [2022-07-10 02:33:09,972][26022] Updated weights on worker 0-0, policy_version 528984 (0.00098) [2022-07-10 02:33:11,559][26022] Updated weights on worker 0-0, policy_version 528994 (0.00085) [2022-07-10 02:33:11,753][25689] Fps is (10 sec: 5547.9, 60 sec: 5629.2, 300 sec: 5654.6). Total num frames: 541689856. Throughput: 0: 4973.4. Samples: 541685994. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:33:11,755][25689] Avg episode reward: [(0, '-28.231')] [2022-07-10 02:33:13,633][26022] Updated weights on worker 0-0, policy_version 529004 (0.00086) [2022-07-10 02:33:15,262][26022] Updated weights on worker 0-0, policy_version 529014 (0.00080) [2022-07-10 02:33:16,771][25689] Fps is (10 sec: 5818.5, 60 sec: 5651.2, 300 sec: 5651.5). Total num frames: 541718528. Throughput: 0: 5822.0. Samples: 541720276. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:33:16,772][25689] Avg episode reward: [(0, '-29.154')] [2022-07-10 02:33:17,087][26022] Updated weights on worker 0-0, policy_version 529024 (0.00108) [2022-07-10 02:33:18,987][26022] Updated weights on worker 0-0, policy_version 529034 (0.00090) [2022-07-10 02:33:20,749][26022] Updated weights on worker 0-0, policy_version 529044 (0.00085) [2022-07-10 02:33:21,801][25689] Fps is (10 sec: 5707.0, 60 sec: 5648.8, 300 sec: 5648.2). Total num frames: 541747200. Throughput: 0: 5821.0. Samples: 541754414. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:33:21,802][25689] Avg episode reward: [(0, '-29.134')] [2022-07-10 02:33:22,524][26022] Updated weights on worker 0-0, policy_version 529054 (0.00091) [2022-07-10 02:33:24,310][26022] Updated weights on worker 0-0, policy_version 529064 (0.00094) [2022-07-10 02:33:25,996][26022] Updated weights on worker 0-0, policy_version 529074 (0.00093) [2022-07-10 02:33:26,886][25689] Fps is (10 sec: 5568.1, 60 sec: 5651.6, 300 sec: 5650.1). Total num frames: 541774848. Throughput: 0: 5103.3. Samples: 541771562. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:33:26,886][25689] Avg episode reward: [(0, '-30.670')] [2022-07-10 02:33:27,962][26022] Updated weights on worker 0-0, policy_version 529084 (0.00087) [2022-07-10 02:33:29,598][26022] Updated weights on worker 0-0, policy_version 529094 (0.00084) [2022-07-10 02:33:31,567][26022] Updated weights on worker 0-0, policy_version 529104 (0.00080) [2022-07-10 02:33:31,891][25689] Fps is (10 sec: 5683.2, 60 sec: 5636.6, 300 sec: 5650.2). Total num frames: 541804544. Throughput: 0: 5908.5. Samples: 541804980. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:33:31,892][25689] Avg episode reward: [(0, '-30.599')] [2022-07-10 02:33:33,388][26022] Updated weights on worker 0-0, policy_version 529114 (0.00089) [2022-07-10 02:33:35,167][26022] Updated weights on worker 0-0, policy_version 529124 (0.01295) [2022-07-10 02:33:36,947][25689] Fps is (10 sec: 5699.4, 60 sec: 5634.1, 300 sec: 5646.0). Total num frames: 541832192. Throughput: 0: 5887.1. Samples: 541839056. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:33:36,947][25689] Avg episode reward: [(0, '-30.282')] [2022-07-10 02:33:37,129][26022] Updated weights on worker 0-0, policy_version 529134 (0.00093) [2022-07-10 02:33:38,843][26022] Updated weights on worker 0-0, policy_version 529144 (0.00083) [2022-07-10 02:33:40,522][26022] Updated weights on worker 0-0, policy_version 529154 (0.00091) [2022-07-10 02:33:41,966][25689] Fps is (10 sec: 5488.4, 60 sec: 5637.0, 300 sec: 5647.0). Total num frames: 541859840. Throughput: 0: 5049.4. Samples: 541856236. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:33:41,967][25689] Avg episode reward: [(0, '-30.520')] [2022-07-10 02:33:42,463][26022] Updated weights on worker 0-0, policy_version 529164 (0.00091) [2022-07-10 02:33:44,192][26022] Updated weights on worker 0-0, policy_version 529174 (0.00085) [2022-07-10 02:33:45,981][26022] Updated weights on worker 0-0, policy_version 529184 (0.00086) [2022-07-10 02:33:47,035][25689] Fps is (10 sec: 5582.8, 60 sec: 5621.9, 300 sec: 5642.7). Total num frames: 541888512. Throughput: 0: 5897.7. Samples: 541890398. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:33:47,036][25689] Avg episode reward: [(0, '-29.654')] [2022-07-10 02:33:47,733][26022] Updated weights on worker 0-0, policy_version 529194 (0.00091) [2022-07-10 02:33:49,615][26022] Updated weights on worker 0-0, policy_version 529204 (0.00087) [2022-07-10 02:33:51,479][26022] Updated weights on worker 0-0, policy_version 529214 (0.00087) [2022-07-10 02:33:52,107][25689] Fps is (10 sec: 5655.1, 60 sec: 5615.9, 300 sec: 5645.5). Total num frames: 541917184. Throughput: 0: 5898.4. Samples: 541924218. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:33:52,107][25689] Avg episode reward: [(0, '-29.097')] [2022-07-10 02:33:52,245][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:33:52,263][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000529218_541919232.pth [2022-07-10 02:33:52,263][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000527230_539883520.pth [2022-07-10 02:33:53,210][26022] Updated weights on worker 0-0, policy_version 529224 (0.00096) [2022-07-10 02:33:55,031][26022] Updated weights on worker 0-0, policy_version 529234 (0.00089) [2022-07-10 02:33:56,881][26022] Updated weights on worker 0-0, policy_version 529244 (0.00084) [2022-07-10 02:33:57,138][25689] Fps is (10 sec: 5777.3, 60 sec: 5631.5, 300 sec: 5641.6). Total num frames: 541946880. Throughput: 0: 5075.6. Samples: 541941538. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:33:57,138][25689] Avg episode reward: [(0, '-28.243')] [2022-07-10 02:33:58,478][26022] Updated weights on worker 0-0, policy_version 529254 (0.00084) [2022-07-10 02:34:00,479][26022] Updated weights on worker 0-0, policy_version 529264 (0.00506) [2022-07-10 02:34:02,212][25689] Fps is (10 sec: 5573.5, 60 sec: 5609.1, 300 sec: 5645.3). Total num frames: 541973504. Throughput: 0: 5911.2. Samples: 541975908. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:34:02,212][25689] Avg episode reward: [(0, '-28.389')] [2022-07-10 02:34:02,484][26022] Updated weights on worker 0-0, policy_version 529274 (0.00088) [2022-07-10 02:34:04,536][26022] Updated weights on worker 0-0, policy_version 529284 (0.00090) [2022-07-10 02:34:06,217][26022] Updated weights on worker 0-0, policy_version 529294 (0.00098) [2022-07-10 02:34:07,275][25689] Fps is (10 sec: 5353.9, 60 sec: 5632.9, 300 sec: 5642.5). Total num frames: 542001152. Throughput: 0: 5787.4. Samples: 542007534. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:34:07,275][25689] Avg episode reward: [(0, '-28.528')] [2022-07-10 02:34:08,074][26022] Updated weights on worker 0-0, policy_version 529304 (0.00089) [2022-07-10 02:34:09,906][26022] Updated weights on worker 0-0, policy_version 529314 (0.00090) [2022-07-10 02:34:11,636][26022] Updated weights on worker 0-0, policy_version 529324 (0.00085) [2022-07-10 02:34:12,316][25689] Fps is (10 sec: 5776.7, 60 sec: 5647.3, 300 sec: 5648.8). Total num frames: 542031872. Throughput: 0: 4970.4. Samples: 542024664. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:34:12,316][25689] Avg episode reward: [(0, '-28.074')] [2022-07-10 02:34:13,543][26022] Updated weights on worker 0-0, policy_version 529334 (0.00097) [2022-07-10 02:34:15,152][26022] Updated weights on worker 0-0, policy_version 529344 (0.00087) [2022-07-10 02:34:17,158][26022] Updated weights on worker 0-0, policy_version 529354 (0.00406) [2022-07-10 02:34:17,389][25689] Fps is (10 sec: 5771.1, 60 sec: 5625.3, 300 sec: 5644.6). Total num frames: 542059520. Throughput: 0: 5806.2. Samples: 542059118. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:34:17,389][25689] Avg episode reward: [(0, '-28.819')] [2022-07-10 02:34:18,751][26022] Updated weights on worker 0-0, policy_version 529364 (0.00097) [2022-07-10 02:34:20,678][26022] Updated weights on worker 0-0, policy_version 529374 (0.00085) [2022-07-10 02:34:22,316][26022] Updated weights on worker 0-0, policy_version 529384 (0.00084) [2022-07-10 02:34:22,409][25689] Fps is (10 sec: 5681.5, 60 sec: 5643.1, 300 sec: 5649.6). Total num frames: 542089216. Throughput: 0: 5814.4. Samples: 542093344. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:34:22,409][25689] Avg episode reward: [(0, '-29.029')] [2022-07-10 02:34:24,231][26022] Updated weights on worker 0-0, policy_version 529394 (0.00083) [2022-07-10 02:34:25,980][26022] Updated weights on worker 0-0, policy_version 529404 (0.00098) [2022-07-10 02:34:27,505][25689] Fps is (10 sec: 5668.3, 60 sec: 5642.0, 300 sec: 5644.7). Total num frames: 542116864. Throughput: 0: 5096.3. Samples: 542110634. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:34:27,506][25689] Avg episode reward: [(0, '-29.215')] [2022-07-10 02:34:27,885][26022] Updated weights on worker 0-0, policy_version 529414 (0.00096) [2022-07-10 02:34:29,729][26022] Updated weights on worker 0-0, policy_version 529424 (0.00081) [2022-07-10 02:34:31,474][26022] Updated weights on worker 0-0, policy_version 529434 (0.00088) [2022-07-10 02:34:32,516][25689] Fps is (10 sec: 5572.6, 60 sec: 5624.7, 300 sec: 5648.3). Total num frames: 542145536. Throughput: 0: 5930.6. Samples: 542144462. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:34:32,516][25689] Avg episode reward: [(0, '-29.020')] [2022-07-10 02:34:33,314][26022] Updated weights on worker 0-0, policy_version 529444 (0.00087) [2022-07-10 02:34:35,011][26022] Updated weights on worker 0-0, policy_version 529454 (0.00101) [2022-07-10 02:34:36,871][26022] Updated weights on worker 0-0, policy_version 529464 (0.00092) [2022-07-10 02:34:37,523][25689] Fps is (10 sec: 5724.5, 60 sec: 5646.1, 300 sec: 5648.9). Total num frames: 542174208. Throughput: 0: 5939.0. Samples: 542178692. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:34:37,523][25689] Avg episode reward: [(0, '-29.041')] [2022-07-10 02:34:38,591][26022] Updated weights on worker 0-0, policy_version 529474 (0.00097) [2022-07-10 02:34:40,390][26022] Updated weights on worker 0-0, policy_version 529484 (0.00089) [2022-07-10 02:34:42,360][26022] Updated weights on worker 0-0, policy_version 529494 (0.00086) [2022-07-10 02:34:42,535][25689] Fps is (10 sec: 5723.2, 60 sec: 5663.7, 300 sec: 5654.0). Total num frames: 542202880. Throughput: 0: 5089.2. Samples: 542195770. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:34:42,536][25689] Avg episode reward: [(0, '-28.694')] [2022-07-10 02:34:44,157][26022] Updated weights on worker 0-0, policy_version 529504 (0.00089) [2022-07-10 02:34:45,874][26022] Updated weights on worker 0-0, policy_version 529514 (0.00093) [2022-07-10 02:34:47,598][25689] Fps is (10 sec: 5590.0, 60 sec: 5647.3, 300 sec: 5643.4). Total num frames: 542230528. Throughput: 0: 5928.7. Samples: 542229756. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:34:47,598][25689] Avg episode reward: [(0, '-27.659')] [2022-07-10 02:34:47,831][26022] Updated weights on worker 0-0, policy_version 529524 (0.00089) [2022-07-10 02:34:49,369][26022] Updated weights on worker 0-0, policy_version 529534 (0.00082) [2022-07-10 02:34:51,314][26022] Updated weights on worker 0-0, policy_version 529544 (0.00082) [2022-07-10 02:34:52,611][25689] Fps is (10 sec: 5589.7, 60 sec: 5652.8, 300 sec: 5650.1). Total num frames: 542259200. Throughput: 0: 5941.3. Samples: 542263852. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:34:52,612][25689] Avg episode reward: [(0, '-27.414')] [2022-07-10 02:34:53,186][26022] Updated weights on worker 0-0, policy_version 529554 (0.00086) [2022-07-10 02:34:54,947][26022] Updated weights on worker 0-0, policy_version 529564 (0.00088) [2022-07-10 02:34:56,773][26022] Updated weights on worker 0-0, policy_version 529574 (0.00088) [2022-07-10 02:34:57,627][25689] Fps is (10 sec: 5717.7, 60 sec: 5637.3, 300 sec: 5649.9). Total num frames: 542287872. Throughput: 0: 5082.1. Samples: 542280864. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:34:57,629][25689] Avg episode reward: [(0, '-27.516')] [2022-07-10 02:34:58,668][26022] Updated weights on worker 0-0, policy_version 529584 (0.00617) [2022-07-10 02:35:00,323][26022] Updated weights on worker 0-0, policy_version 529594 (0.00089) [2022-07-10 02:35:02,630][25689] Fps is (10 sec: 5416.9, 60 sec: 5626.9, 300 sec: 5643.9). Total num frames: 542313472. Throughput: 0: 5920.3. Samples: 542314736. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:35:02,632][25689] Avg episode reward: [(0, '-27.989')] [2022-07-10 02:35:02,640][26022] Updated weights on worker 0-0, policy_version 529604 (0.00088) [2022-07-10 02:35:04,525][26022] Updated weights on worker 0-0, policy_version 529614 (0.00057) [2022-07-10 02:35:06,251][26022] Updated weights on worker 0-0, policy_version 529624 (0.00115) [2022-07-10 02:35:07,739][25689] Fps is (10 sec: 5468.6, 60 sec: 5656.6, 300 sec: 5642.7). Total num frames: 542343168. Throughput: 0: 5793.6. Samples: 542346444. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:35:07,740][25689] Avg episode reward: [(0, '-28.467')] [2022-07-10 02:35:08,021][26022] Updated weights on worker 0-0, policy_version 529634 (0.00513) [2022-07-10 02:35:10,039][26022] Updated weights on worker 0-0, policy_version 529644 (0.00095) [2022-07-10 02:35:11,449][26022] Updated weights on worker 0-0, policy_version 529654 (0.00084) [2022-07-10 02:35:12,819][25689] Fps is (10 sec: 5627.8, 60 sec: 5602.1, 300 sec: 5641.7). Total num frames: 542370816. Throughput: 0: 5793.8. Samples: 542380934. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:35:12,819][25689] Avg episode reward: [(0, '-28.468')] [2022-07-10 02:35:13,483][26022] Updated weights on worker 0-0, policy_version 529664 (0.00080) [2022-07-10 02:35:15,134][26022] Updated weights on worker 0-0, policy_version 529674 (0.00085) [2022-07-10 02:35:17,011][26022] Updated weights on worker 0-0, policy_version 529684 (0.00086) [2022-07-10 02:35:17,874][25689] Fps is (10 sec: 5758.6, 60 sec: 5654.5, 300 sec: 5648.0). Total num frames: 542401536. Throughput: 0: 5795.5. Samples: 542398206. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:35:17,875][25689] Avg episode reward: [(0, '-29.365')] [2022-07-10 02:35:18,832][26022] Updated weights on worker 0-0, policy_version 529694 (0.00087) [2022-07-10 02:35:20,525][26022] Updated weights on worker 0-0, policy_version 529704 (0.00087) [2022-07-10 02:35:22,450][26022] Updated weights on worker 0-0, policy_version 529714 (0.00094) [2022-07-10 02:35:22,897][25689] Fps is (10 sec: 5791.7, 60 sec: 5620.4, 300 sec: 5642.2). Total num frames: 542429184. Throughput: 0: 5804.4. Samples: 542432376. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:35:22,897][25689] Avg episode reward: [(0, '-29.378')] [2022-07-10 02:35:24,112][26022] Updated weights on worker 0-0, policy_version 529724 (0.00087) [2022-07-10 02:35:25,976][26022] Updated weights on worker 0-0, policy_version 529734 (0.00089) [2022-07-10 02:35:27,784][26022] Updated weights on worker 0-0, policy_version 529744 (0.00085) [2022-07-10 02:35:27,978][25689] Fps is (10 sec: 5574.4, 60 sec: 5638.8, 300 sec: 5641.4). Total num frames: 542457856. Throughput: 0: 5919.3. Samples: 542466246. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:35:27,978][25689] Avg episode reward: [(0, '-29.366')] [2022-07-10 02:35:29,542][26022] Updated weights on worker 0-0, policy_version 529754 (0.00090) [2022-07-10 02:35:31,516][26022] Updated weights on worker 0-0, policy_version 529764 (0.00091) [2022-07-10 02:35:32,997][25689] Fps is (10 sec: 5677.3, 60 sec: 5637.9, 300 sec: 5648.1). Total num frames: 542486528. Throughput: 0: 5083.7. Samples: 542483516. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:35:32,998][25689] Avg episode reward: [(0, '-28.091')] [2022-07-10 02:35:33,126][26022] Updated weights on worker 0-0, policy_version 529774 (0.00100) [2022-07-10 02:35:35,034][26022] Updated weights on worker 0-0, policy_version 529784 (0.00089) [2022-07-10 02:35:36,701][26022] Updated weights on worker 0-0, policy_version 529794 (0.00080) [2022-07-10 02:35:38,006][25689] Fps is (10 sec: 5717.9, 60 sec: 5637.7, 300 sec: 5641.2). Total num frames: 542515200. Throughput: 0: 5946.8. Samples: 542517928. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:35:38,007][25689] Avg episode reward: [(0, '-28.313')] [2022-07-10 02:35:38,620][26022] Updated weights on worker 0-0, policy_version 529804 (0.00090) [2022-07-10 02:35:40,490][26022] Updated weights on worker 0-0, policy_version 529814 (0.00087) [2022-07-10 02:35:42,158][26022] Updated weights on worker 0-0, policy_version 529824 (0.00089) [2022-07-10 02:35:43,016][25689] Fps is (10 sec: 5723.9, 60 sec: 5638.0, 300 sec: 5646.9). Total num frames: 542543872. Throughput: 0: 5959.9. Samples: 542552282. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:35:43,016][25689] Avg episode reward: [(0, '-28.495')] [2022-07-10 02:35:43,800][26022] Updated weights on worker 0-0, policy_version 529834 (0.00088) [2022-07-10 02:35:45,794][26022] Updated weights on worker 0-0, policy_version 529844 (0.00095) [2022-07-10 02:35:47,332][26022] Updated weights on worker 0-0, policy_version 529854 (0.00084) [2022-07-10 02:35:48,133][25689] Fps is (10 sec: 5662.6, 60 sec: 5649.8, 300 sec: 5645.4). Total num frames: 542572544. Throughput: 0: 5124.0. Samples: 542569524. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:35:48,134][25689] Avg episode reward: [(0, '-29.350')] [2022-07-10 02:35:49,402][26022] Updated weights on worker 0-0, policy_version 529864 (0.00083) [2022-07-10 02:35:50,936][26022] Updated weights on worker 0-0, policy_version 529874 (0.00090) [2022-07-10 02:35:52,270][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:35:52,294][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000529880_542597120.pth [2022-07-10 02:35:52,295][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000527892_540561408.pth [2022-07-10 02:35:52,932][26022] Updated weights on worker 0-0, policy_version 529884 (0.00091) [2022-07-10 02:35:53,141][25689] Fps is (10 sec: 5865.5, 60 sec: 5684.1, 300 sec: 5652.2). Total num frames: 542603264. Throughput: 0: 5981.5. Samples: 542604008. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:35:53,142][25689] Avg episode reward: [(0, '-29.298')] [2022-07-10 02:35:54,692][26022] Updated weights on worker 0-0, policy_version 529894 (0.00091) [2022-07-10 02:35:56,479][26022] Updated weights on worker 0-0, policy_version 529904 (0.00087) [2022-07-10 02:35:58,148][25689] Fps is (10 sec: 5726.0, 60 sec: 5651.2, 300 sec: 5642.0). Total num frames: 542629888. Throughput: 0: 5972.6. Samples: 542638224. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:35:58,148][25689] Avg episode reward: [(0, '-28.539')] [2022-07-10 02:35:58,239][26022] Updated weights on worker 0-0, policy_version 529914 (0.00082) [2022-07-10 02:36:00,356][26022] Updated weights on worker 0-0, policy_version 529924 (0.00095) [2022-07-10 02:36:02,447][26022] Updated weights on worker 0-0, policy_version 529935 (0.00089) [2022-07-10 02:36:03,186][25689] Fps is (10 sec: 5402.8, 60 sec: 5681.7, 300 sec: 5648.8). Total num frames: 542657536. Throughput: 0: 5096.0. Samples: 542655072. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:36:03,190][25689] Avg episode reward: [(0, '-30.120')] [2022-07-10 02:36:04,331][26022] Updated weights on worker 0-0, policy_version 529945 (0.00089) [2022-07-10 02:36:06,203][26022] Updated weights on worker 0-0, policy_version 529955 (0.00085) [2022-07-10 02:36:07,797][26022] Updated weights on worker 0-0, policy_version 529965 (0.00085) [2022-07-10 02:36:08,293][25689] Fps is (10 sec: 5551.2, 60 sec: 5664.9, 300 sec: 5644.7). Total num frames: 542686208. Throughput: 0: 5815.5. Samples: 542686766. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:36:08,294][25689] Avg episode reward: [(0, '-31.164')] [2022-07-10 02:36:09,846][26022] Updated weights on worker 0-0, policy_version 529975 (0.00093) [2022-07-10 02:36:11,590][26022] Updated weights on worker 0-0, policy_version 529985 (0.00092) [2022-07-10 02:36:13,323][25689] Fps is (10 sec: 5556.4, 60 sec: 5669.7, 300 sec: 5640.8). Total num frames: 542713856. Throughput: 0: 5784.3. Samples: 542720742. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:36:13,323][25689] Avg episode reward: [(0, '-32.127')] [2022-07-10 02:36:13,400][26022] Updated weights on worker 0-0, policy_version 529995 (0.00089) [2022-07-10 02:36:15,178][26022] Updated weights on worker 0-0, policy_version 530005 (0.00086) [2022-07-10 02:36:16,972][26022] Updated weights on worker 0-0, policy_version 530015 (0.00093) [2022-07-10 02:36:18,358][25689] Fps is (10 sec: 5596.1, 60 sec: 5637.8, 300 sec: 5644.0). Total num frames: 542742528. Throughput: 0: 4929.3. Samples: 542737844. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:36:18,358][25689] Avg episode reward: [(0, '-31.955')] [2022-07-10 02:36:18,890][26022] Updated weights on worker 0-0, policy_version 530025 (0.00098) [2022-07-10 02:36:20,568][26022] Updated weights on worker 0-0, policy_version 530035 (0.00087) [2022-07-10 02:36:22,548][26022] Updated weights on worker 0-0, policy_version 530045 (0.00085) [2022-07-10 02:36:23,376][25689] Fps is (10 sec: 5805.8, 60 sec: 5672.0, 300 sec: 5644.7). Total num frames: 542772224. Throughput: 0: 5793.9. Samples: 542772048. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:36:23,376][25689] Avg episode reward: [(0, '-32.920')] [2022-07-10 02:36:24,244][26022] Updated weights on worker 0-0, policy_version 530055 (0.00086) [2022-07-10 02:36:25,987][26022] Updated weights on worker 0-0, policy_version 530065 (0.00090) [2022-07-10 02:36:27,831][26022] Updated weights on worker 0-0, policy_version 530075 (0.00087) [2022-07-10 02:36:28,432][25689] Fps is (10 sec: 5691.9, 60 sec: 5657.4, 300 sec: 5645.4). Total num frames: 542799872. Throughput: 0: 5943.7. Samples: 542806466. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:36:28,433][25689] Avg episode reward: [(0, '-32.147')] [2022-07-10 02:36:29,397][26022] Updated weights on worker 0-0, policy_version 530085 (0.00094) [2022-07-10 02:36:31,424][26022] Updated weights on worker 0-0, policy_version 530095 (0.00093) [2022-07-10 02:36:33,104][26022] Updated weights on worker 0-0, policy_version 530105 (0.00085) [2022-07-10 02:36:33,473][25689] Fps is (10 sec: 5679.5, 60 sec: 5672.4, 300 sec: 5644.7). Total num frames: 542829568. Throughput: 0: 5096.2. Samples: 542823432. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:36:33,473][25689] Avg episode reward: [(0, '-31.963')] [2022-07-10 02:36:34,875][26022] Updated weights on worker 0-0, policy_version 530115 (0.00081) [2022-07-10 02:36:36,725][26022] Updated weights on worker 0-0, policy_version 530125 (0.00092) [2022-07-10 02:36:38,482][25689] Fps is (10 sec: 5808.1, 60 sec: 5672.4, 300 sec: 5645.5). Total num frames: 542858240. Throughput: 0: 5969.0. Samples: 542857964. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:36:38,482][25689] Avg episode reward: [(0, '-30.940')] [2022-07-10 02:36:38,486][26022] Updated weights on worker 0-0, policy_version 530135 (0.00091) [2022-07-10 02:36:40,399][26022] Updated weights on worker 0-0, policy_version 530145 (0.00083) [2022-07-10 02:36:41,975][26022] Updated weights on worker 0-0, policy_version 530155 (0.00091) [2022-07-10 02:36:43,510][25689] Fps is (10 sec: 5610.9, 60 sec: 5653.7, 300 sec: 5639.9). Total num frames: 542885888. Throughput: 0: 5971.7. Samples: 542892284. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:36:43,511][25689] Avg episode reward: [(0, '-30.457')] [2022-07-10 02:36:43,894][26022] Updated weights on worker 0-0, policy_version 530165 (0.00091) [2022-07-10 02:36:45,787][26022] Updated weights on worker 0-0, policy_version 530175 (0.00083) [2022-07-10 02:36:47,389][26022] Updated weights on worker 0-0, policy_version 530185 (0.00090) [2022-07-10 02:36:48,567][25689] Fps is (10 sec: 5686.0, 60 sec: 5676.3, 300 sec: 5650.1). Total num frames: 542915584. Throughput: 0: 5103.6. Samples: 542909224. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:36:48,568][25689] Avg episode reward: [(0, '-29.120')] [2022-07-10 02:36:49,439][26022] Updated weights on worker 0-0, policy_version 530195 (0.00095) [2022-07-10 02:36:51,073][26022] Updated weights on worker 0-0, policy_version 530205 (0.00089) [2022-07-10 02:36:52,967][26022] Updated weights on worker 0-0, policy_version 530215 (0.00096) [2022-07-10 02:36:53,573][25689] Fps is (10 sec: 5800.3, 60 sec: 5642.6, 300 sec: 5643.2). Total num frames: 542944256. Throughput: 0: 5960.4. Samples: 542943240. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:36:53,574][25689] Avg episode reward: [(0, '-30.085')] [2022-07-10 02:36:54,721][26022] Updated weights on worker 0-0, policy_version 530225 (0.00089) [2022-07-10 02:36:56,417][26022] Updated weights on worker 0-0, policy_version 530235 (0.00089) [2022-07-10 02:36:58,268][26022] Updated weights on worker 0-0, policy_version 530245 (0.00090) [2022-07-10 02:36:58,579][25689] Fps is (10 sec: 5625.3, 60 sec: 5659.6, 300 sec: 5646.9). Total num frames: 542971904. Throughput: 0: 5960.3. Samples: 542977748. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:36:58,579][25689] Avg episode reward: [(0, '-30.445')] [2022-07-10 02:36:59,938][26022] Updated weights on worker 0-0, policy_version 530255 (0.00099) [2022-07-10 02:37:01,914][26022] Updated weights on worker 0-0, policy_version 530265 (0.00091) [2022-07-10 02:37:03,589][25689] Fps is (10 sec: 5316.4, 60 sec: 5628.4, 300 sec: 5642.1). Total num frames: 542997504. Throughput: 0: 5095.9. Samples: 542994604. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:37:03,590][25689] Avg episode reward: [(0, '-31.962')] [2022-07-10 02:37:04,382][26022] Updated weights on worker 0-0, policy_version 530275 (0.00086) [2022-07-10 02:37:05,901][26022] Updated weights on worker 0-0, policy_version 530285 (0.00083) [2022-07-10 02:37:07,799][26022] Updated weights on worker 0-0, policy_version 530295 (0.00084) [2022-07-10 02:37:08,669][25689] Fps is (10 sec: 5479.8, 60 sec: 5647.8, 300 sec: 5641.6). Total num frames: 543027200. Throughput: 0: 5826.9. Samples: 543026360. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:37:08,670][25689] Avg episode reward: [(0, '-31.734')] [2022-07-10 02:37:09,585][26022] Updated weights on worker 0-0, policy_version 530305 (0.00094) [2022-07-10 02:37:11,347][26022] Updated weights on worker 0-0, policy_version 530315 (0.00090) [2022-07-10 02:37:13,411][26022] Updated weights on worker 0-0, policy_version 530325 (0.00100) [2022-07-10 02:37:13,676][25689] Fps is (10 sec: 5684.7, 60 sec: 5649.9, 300 sec: 5642.9). Total num frames: 543054848. Throughput: 0: 5822.8. Samples: 543060298. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:37:13,677][25689] Avg episode reward: [(0, '-31.969')] [2022-07-10 02:37:14,939][26022] Updated weights on worker 0-0, policy_version 530335 (0.00083) [2022-07-10 02:37:16,915][26022] Updated weights on worker 0-0, policy_version 530345 (0.00084) [2022-07-10 02:37:18,418][26022] Updated weights on worker 0-0, policy_version 530355 (0.00084) [2022-07-10 02:37:18,693][25689] Fps is (10 sec: 5721.1, 60 sec: 5668.6, 300 sec: 5646.1). Total num frames: 543084544. Throughput: 0: 4955.9. Samples: 543077430. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:37:18,693][25689] Avg episode reward: [(0, '-32.229')] [2022-07-10 02:37:20,407][26022] Updated weights on worker 0-0, policy_version 530365 (0.00081) [2022-07-10 02:37:22,119][26022] Updated weights on worker 0-0, policy_version 530375 (0.00097) [2022-07-10 02:37:23,699][25689] Fps is (10 sec: 5721.6, 60 sec: 5635.8, 300 sec: 5648.1). Total num frames: 543112192. Throughput: 0: 5807.6. Samples: 543111394. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:37:23,699][25689] Avg episode reward: [(0, '-31.131')] [2022-07-10 02:37:24,123][26022] Updated weights on worker 0-0, policy_version 530385 (0.00088) [2022-07-10 02:37:25,712][26022] Updated weights on worker 0-0, policy_version 530395 (0.00089) [2022-07-10 02:37:27,755][26022] Updated weights on worker 0-0, policy_version 530405 (0.00090) [2022-07-10 02:37:28,751][25689] Fps is (10 sec: 5599.0, 60 sec: 5653.1, 300 sec: 5640.8). Total num frames: 543140864. Throughput: 0: 5938.3. Samples: 543145614. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:37:28,752][25689] Avg episode reward: [(0, '-30.559')] [2022-07-10 02:37:29,494][26022] Updated weights on worker 0-0, policy_version 530415 (0.00086) [2022-07-10 02:37:31,260][26022] Updated weights on worker 0-0, policy_version 530425 (0.00090) [2022-07-10 02:37:33,019][26022] Updated weights on worker 0-0, policy_version 530435 (0.00090) [2022-07-10 02:37:33,755][25689] Fps is (10 sec: 5498.9, 60 sec: 5605.6, 300 sec: 5637.8). Total num frames: 543167488. Throughput: 0: 5080.8. Samples: 543162312. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:37:33,755][25689] Avg episode reward: [(0, '-29.480')] [2022-07-10 02:37:34,739][26022] Updated weights on worker 0-0, policy_version 530445 (0.00082) [2022-07-10 02:37:36,843][26022] Updated weights on worker 0-0, policy_version 530455 (0.00087) [2022-07-10 02:37:38,399][26022] Updated weights on worker 0-0, policy_version 530465 (0.00094) [2022-07-10 02:37:38,763][25689] Fps is (10 sec: 5625.5, 60 sec: 5622.7, 300 sec: 5645.5). Total num frames: 543197184. Throughput: 0: 5918.9. Samples: 543196226. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:37:38,764][25689] Avg episode reward: [(0, '-29.135')] [2022-07-10 02:37:40,263][26022] Updated weights on worker 0-0, policy_version 530475 (0.00086) [2022-07-10 02:37:42,146][26022] Updated weights on worker 0-0, policy_version 530485 (0.00097) [2022-07-10 02:37:43,772][25689] Fps is (10 sec: 5826.4, 60 sec: 5641.5, 300 sec: 5643.5). Total num frames: 543225856. Throughput: 0: 5940.2. Samples: 543230636. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:37:43,773][25689] Avg episode reward: [(0, '-30.224')] [2022-07-10 02:37:43,881][26022] Updated weights on worker 0-0, policy_version 530495 (0.00091) [2022-07-10 02:37:45,691][26022] Updated weights on worker 0-0, policy_version 530505 (0.00086) [2022-07-10 02:37:47,485][26022] Updated weights on worker 0-0, policy_version 530515 (0.00093) [2022-07-10 02:37:48,904][25689] Fps is (10 sec: 5755.8, 60 sec: 5634.5, 300 sec: 5644.6). Total num frames: 543255552. Throughput: 0: 5071.2. Samples: 543247810. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:37:48,904][25689] Avg episode reward: [(0, '-29.852')] [2022-07-10 02:37:49,311][26022] Updated weights on worker 0-0, policy_version 530525 (0.00089) [2022-07-10 02:37:51,190][26022] Updated weights on worker 0-0, policy_version 530535 (0.00086) [2022-07-10 02:37:52,393][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:37:52,409][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000530542_543275008.pth [2022-07-10 02:37:52,410][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000528557_541242368.pth [2022-07-10 02:37:52,823][26022] Updated weights on worker 0-0, policy_version 530545 (0.00087) [2022-07-10 02:37:53,926][25689] Fps is (10 sec: 5647.9, 60 sec: 5616.1, 300 sec: 5641.1). Total num frames: 543283200. Throughput: 0: 5922.4. Samples: 543281772. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:37:53,928][25689] Avg episode reward: [(0, '-30.619')] [2022-07-10 02:37:54,702][26022] Updated weights on worker 0-0, policy_version 530555 (0.00087) [2022-07-10 02:37:56,394][26022] Updated weights on worker 0-0, policy_version 530565 (0.00086) [2022-07-10 02:37:58,247][26022] Updated weights on worker 0-0, policy_version 530575 (0.00085) [2022-07-10 02:37:58,936][25689] Fps is (10 sec: 5614.1, 60 sec: 5632.6, 300 sec: 5644.6). Total num frames: 543311872. Throughput: 0: 5940.1. Samples: 543316054. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:37:58,937][25689] Avg episode reward: [(0, '-31.282')] [2022-07-10 02:38:00,363][26022] Updated weights on worker 0-0, policy_version 530585 (0.00085) [2022-07-10 02:38:02,024][26022] Updated weights on worker 0-0, policy_version 530595 (0.00082) [2022-07-10 02:38:03,940][25689] Fps is (10 sec: 5317.4, 60 sec: 5616.3, 300 sec: 5640.3). Total num frames: 543336448. Throughput: 0: 5796.0. Samples: 543347524. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:38:03,940][25689] Avg episode reward: [(0, '-32.357')] [2022-07-10 02:38:04,358][26022] Updated weights on worker 0-0, policy_version 530605 (0.00090) [2022-07-10 02:38:05,795][26022] Updated weights on worker 0-0, policy_version 530615 (0.00417) [2022-07-10 02:38:07,900][26022] Updated weights on worker 0-0, policy_version 530625 (0.00086) [2022-07-10 02:38:09,068][25689] Fps is (10 sec: 5558.5, 60 sec: 5645.7, 300 sec: 5645.0). Total num frames: 543368192. Throughput: 0: 5783.4. Samples: 543364428. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:38:09,070][25689] Avg episode reward: [(0, '-31.019')] [2022-07-10 02:38:09,400][26022] Updated weights on worker 0-0, policy_version 530635 (0.00088) [2022-07-10 02:38:11,540][26022] Updated weights on worker 0-0, policy_version 530645 (0.00081) [2022-07-10 02:38:13,176][26022] Updated weights on worker 0-0, policy_version 530655 (0.00087) [2022-07-10 02:38:14,079][25689] Fps is (10 sec: 5756.5, 60 sec: 5628.4, 300 sec: 5638.2). Total num frames: 543394816. Throughput: 0: 5802.5. Samples: 543398712. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:38:14,081][25689] Avg episode reward: [(0, '-30.123')] [2022-07-10 02:38:14,980][26022] Updated weights on worker 0-0, policy_version 530665 (0.00092) [2022-07-10 02:38:16,648][26022] Updated weights on worker 0-0, policy_version 530675 (0.00093) [2022-07-10 02:38:18,503][26022] Updated weights on worker 0-0, policy_version 530685 (0.00091) [2022-07-10 02:38:19,133][25689] Fps is (10 sec: 5595.4, 60 sec: 5624.8, 300 sec: 5641.2). Total num frames: 543424512. Throughput: 0: 5790.5. Samples: 543433008. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:38:19,134][25689] Avg episode reward: [(0, '-28.885')] [2022-07-10 02:38:20,456][26022] Updated weights on worker 0-0, policy_version 530695 (0.00085) [2022-07-10 02:38:22,259][26022] Updated weights on worker 0-0, policy_version 530705 (0.00090) [2022-07-10 02:38:24,036][26022] Updated weights on worker 0-0, policy_version 530715 (0.00093) [2022-07-10 02:38:24,177][25689] Fps is (10 sec: 5679.0, 60 sec: 5621.4, 300 sec: 5642.0). Total num frames: 543452160. Throughput: 0: 5053.3. Samples: 543449788. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:38:24,177][25689] Avg episode reward: [(0, '-28.628')] [2022-07-10 02:38:25,808][26022] Updated weights on worker 0-0, policy_version 530725 (0.00091) [2022-07-10 02:38:27,596][26022] Updated weights on worker 0-0, policy_version 530735 (0.00091) [2022-07-10 02:38:29,248][25689] Fps is (10 sec: 5669.6, 60 sec: 5636.6, 300 sec: 5640.7). Total num frames: 543481856. Throughput: 0: 5918.7. Samples: 543483866. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:38:29,248][25689] Avg episode reward: [(0, '-28.647')] [2022-07-10 02:38:29,493][26022] Updated weights on worker 0-0, policy_version 530745 (0.00091) [2022-07-10 02:38:31,380][26022] Updated weights on worker 0-0, policy_version 530755 (0.00086) [2022-07-10 02:38:33,081][26022] Updated weights on worker 0-0, policy_version 530765 (0.00085) [2022-07-10 02:38:34,266][25689] Fps is (10 sec: 5582.2, 60 sec: 5635.2, 300 sec: 5638.0). Total num frames: 543508480. Throughput: 0: 5882.4. Samples: 543517458. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:38:34,266][25689] Avg episode reward: [(0, '-27.074')] [2022-07-10 02:38:35,158][26022] Updated weights on worker 0-0, policy_version 530775 (0.00092) [2022-07-10 02:38:36,573][26022] Updated weights on worker 0-0, policy_version 530785 (0.00083) [2022-07-10 02:38:38,753][26022] Updated weights on worker 0-0, policy_version 530795 (0.00085) [2022-07-10 02:38:39,325][25689] Fps is (10 sec: 5589.0, 60 sec: 5630.5, 300 sec: 5644.2). Total num frames: 543538176. Throughput: 0: 5011.8. Samples: 543534202. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:38:39,325][25689] Avg episode reward: [(0, '-28.755')] [2022-07-10 02:38:40,331][26022] Updated weights on worker 0-0, policy_version 530805 (0.00092) [2022-07-10 02:38:42,295][26022] Updated weights on worker 0-0, policy_version 530815 (0.00091) [2022-07-10 02:38:44,027][26022] Updated weights on worker 0-0, policy_version 530825 (0.00085) [2022-07-10 02:38:44,363][25689] Fps is (10 sec: 5679.2, 60 sec: 5610.9, 300 sec: 5641.3). Total num frames: 543565824. Throughput: 0: 5860.3. Samples: 543568086. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:38:44,363][25689] Avg episode reward: [(0, '-27.824')] [2022-07-10 02:38:45,792][26022] Updated weights on worker 0-0, policy_version 530835 (0.00086) [2022-07-10 02:38:47,619][26022] Updated weights on worker 0-0, policy_version 530845 (0.00085) [2022-07-10 02:38:49,404][25689] Fps is (10 sec: 5587.5, 60 sec: 5602.3, 300 sec: 5641.9). Total num frames: 543594496. Throughput: 0: 5880.1. Samples: 543602390. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 02:38:49,405][25689] Avg episode reward: [(0, '-28.809')] [2022-07-10 02:38:49,446][26022] Updated weights on worker 0-0, policy_version 530855 (0.00091) [2022-07-10 02:38:51,219][26022] Updated weights on worker 0-0, policy_version 530865 (0.00093) [2022-07-10 02:38:53,244][26022] Updated weights on worker 0-0, policy_version 530875 (0.00091) [2022-07-10 02:38:54,488][25689] Fps is (10 sec: 5663.8, 60 sec: 5613.6, 300 sec: 5637.4). Total num frames: 543623168. Throughput: 0: 5027.5. Samples: 543619124. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:38:54,488][25689] Avg episode reward: [(0, '-27.370')] [2022-07-10 02:38:55,117][26022] Updated weights on worker 0-0, policy_version 530885 (0.00615) [2022-07-10 02:38:56,772][26022] Updated weights on worker 0-0, policy_version 530895 (0.00085) [2022-07-10 02:38:58,659][26022] Updated weights on worker 0-0, policy_version 530905 (0.00073) [2022-07-10 02:38:59,564][25689] Fps is (10 sec: 5745.2, 60 sec: 5624.3, 300 sec: 5647.7). Total num frames: 543652864. Throughput: 0: 5885.4. Samples: 543653318. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:38:59,566][25689] Avg episode reward: [(0, '-27.229')] [2022-07-10 02:39:00,371][26022] Updated weights on worker 0-0, policy_version 530915 (0.00088) [2022-07-10 02:39:02,547][26022] Updated weights on worker 0-0, policy_version 530925 (0.00098) [2022-07-10 02:39:04,448][26022] Updated weights on worker 0-0, policy_version 530935 (0.00091) [2022-07-10 02:39:04,646][25689] Fps is (10 sec: 5342.5, 60 sec: 5617.0, 300 sec: 5637.0). Total num frames: 543677440. Throughput: 0: 5785.2. Samples: 543685428. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:39:04,648][25689] Avg episode reward: [(0, '-28.386')] [2022-07-10 02:39:06,009][26022] Updated weights on worker 0-0, policy_version 530945 (0.00087) [2022-07-10 02:39:07,978][26022] Updated weights on worker 0-0, policy_version 530955 (0.00092) [2022-07-10 02:39:09,725][25689] Fps is (10 sec: 5441.9, 60 sec: 5604.7, 300 sec: 5636.3). Total num frames: 543708160. Throughput: 0: 4930.1. Samples: 543702574. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:39:09,726][25689] Avg episode reward: [(0, '-28.019')] [2022-07-10 02:39:09,734][26022] Updated weights on worker 0-0, policy_version 530965 (0.00087) [2022-07-10 02:39:11,444][26022] Updated weights on worker 0-0, policy_version 530975 (0.00095) [2022-07-10 02:39:13,335][26022] Updated weights on worker 0-0, policy_version 530985 (0.00093) [2022-07-10 02:39:14,798][25689] Fps is (10 sec: 5951.2, 60 sec: 5649.6, 300 sec: 5643.2). Total num frames: 543737856. Throughput: 0: 5800.7. Samples: 543736938. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:39:14,800][25689] Avg episode reward: [(0, '-28.993')] [2022-07-10 02:39:15,016][26022] Updated weights on worker 0-0, policy_version 530995 (0.00095) [2022-07-10 02:39:16,835][26022] Updated weights on worker 0-0, policy_version 531005 (0.00076) [2022-07-10 02:39:18,525][26022] Updated weights on worker 0-0, policy_version 531015 (0.00085) [2022-07-10 02:39:19,827][25689] Fps is (10 sec: 5778.1, 60 sec: 5635.1, 300 sec: 5639.6). Total num frames: 543766528. Throughput: 0: 5831.4. Samples: 543771478. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:39:19,829][25689] Avg episode reward: [(0, '-29.238')] [2022-07-10 02:39:20,312][26022] Updated weights on worker 0-0, policy_version 531025 (0.00090) [2022-07-10 02:39:22,281][26022] Updated weights on worker 0-0, policy_version 531035 (0.00082) [2022-07-10 02:39:24,023][26022] Updated weights on worker 0-0, policy_version 531045 (0.00053) [2022-07-10 02:39:24,909][25689] Fps is (10 sec: 5671.5, 60 sec: 5648.4, 300 sec: 5643.3). Total num frames: 543795200. Throughput: 0: 5102.5. Samples: 543788818. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:39:24,910][25689] Avg episode reward: [(0, '-30.132')] [2022-07-10 02:39:25,656][26022] Updated weights on worker 0-0, policy_version 531055 (0.00084) [2022-07-10 02:39:27,676][26022] Updated weights on worker 0-0, policy_version 531065 (0.00086) [2022-07-10 02:39:29,249][26022] Updated weights on worker 0-0, policy_version 531075 (0.00093) [2022-07-10 02:39:29,953][25689] Fps is (10 sec: 5662.8, 60 sec: 5634.0, 300 sec: 5642.6). Total num frames: 543823872. Throughput: 0: 5968.3. Samples: 543823302. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:39:29,954][25689] Avg episode reward: [(0, '-31.079')] [2022-07-10 02:39:31,082][26022] Updated weights on worker 0-0, policy_version 531085 (0.00094) [2022-07-10 02:39:32,971][26022] Updated weights on worker 0-0, policy_version 531095 (0.00441) [2022-07-10 02:39:34,761][26022] Updated weights on worker 0-0, policy_version 531105 (0.00087) [2022-07-10 02:39:35,039][25689] Fps is (10 sec: 5660.7, 60 sec: 5661.4, 300 sec: 5641.2). Total num frames: 543852544. Throughput: 0: 5935.7. Samples: 543857082. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:39:35,040][25689] Avg episode reward: [(0, '-31.241')] [2022-07-10 02:39:36,596][26022] Updated weights on worker 0-0, policy_version 531115 (0.00084) [2022-07-10 02:39:38,267][26022] Updated weights on worker 0-0, policy_version 531125 (0.00089) [2022-07-10 02:39:40,067][25689] Fps is (10 sec: 5670.2, 60 sec: 5647.5, 300 sec: 5640.9). Total num frames: 543881216. Throughput: 0: 5065.8. Samples: 543874008. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:39:40,067][25689] Avg episode reward: [(0, '-32.018')] [2022-07-10 02:39:40,125][26022] Updated weights on worker 0-0, policy_version 531135 (0.00092) [2022-07-10 02:39:41,938][26022] Updated weights on worker 0-0, policy_version 531145 (0.00095) [2022-07-10 02:39:43,770][26022] Updated weights on worker 0-0, policy_version 531155 (0.00094) [2022-07-10 02:39:45,072][25689] Fps is (10 sec: 5715.9, 60 sec: 5667.5, 300 sec: 5645.4). Total num frames: 543909888. Throughput: 0: 5932.8. Samples: 543908438. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:39:45,072][25689] Avg episode reward: [(0, '-32.174')] [2022-07-10 02:39:45,388][26022] Updated weights on worker 0-0, policy_version 531165 (0.00084) [2022-07-10 02:39:47,367][26022] Updated weights on worker 0-0, policy_version 531175 (0.00069) [2022-07-10 02:39:48,970][26022] Updated weights on worker 0-0, policy_version 531185 (0.00091) [2022-07-10 02:39:50,132][25689] Fps is (10 sec: 5697.0, 60 sec: 5665.7, 300 sec: 5644.5). Total num frames: 543938560. Throughput: 0: 5927.2. Samples: 543942908. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:39:50,133][25689] Avg episode reward: [(0, '-32.504')] [2022-07-10 02:39:50,856][26022] Updated weights on worker 0-0, policy_version 531195 (0.00078) [2022-07-10 02:39:52,525][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:39:52,540][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000531205_543953920.pth [2022-07-10 02:39:52,540][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000529218_541919232.pth [2022-07-10 02:39:52,544][26022] Updated weights on worker 0-0, policy_version 531205 (0.00084) [2022-07-10 02:39:54,493][26022] Updated weights on worker 0-0, policy_version 531215 (0.01084) [2022-07-10 02:39:55,156][25689] Fps is (10 sec: 5686.5, 60 sec: 5671.2, 300 sec: 5644.4). Total num frames: 543967232. Throughput: 0: 5124.9. Samples: 543960178. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:39:55,156][25689] Avg episode reward: [(0, '-33.850')] [2022-07-10 02:39:56,269][26022] Updated weights on worker 0-0, policy_version 531225 (0.00090) [2022-07-10 02:39:58,239][26022] Updated weights on worker 0-0, policy_version 531235 (0.00088) [2022-07-10 02:39:59,849][26022] Updated weights on worker 0-0, policy_version 531245 (0.00090) [2022-07-10 02:40:00,207][25689] Fps is (10 sec: 5793.7, 60 sec: 5673.6, 300 sec: 5657.2). Total num frames: 543996928. Throughput: 0: 5981.9. Samples: 543994484. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:40:00,207][25689] Avg episode reward: [(0, '-32.562')] [2022-07-10 02:40:02,249][26022] Updated weights on worker 0-0, policy_version 531255 (0.00094) [2022-07-10 02:40:03,772][26022] Updated weights on worker 0-0, policy_version 531265 (0.00084) [2022-07-10 02:40:05,208][25689] Fps is (10 sec: 5501.1, 60 sec: 5698.1, 300 sec: 5645.5). Total num frames: 544022528. Throughput: 0: 5859.2. Samples: 544026420. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:40:05,208][25689] Avg episode reward: [(0, '-31.930')] [2022-07-10 02:40:05,749][26022] Updated weights on worker 0-0, policy_version 531275 (0.00092) [2022-07-10 02:40:07,528][26022] Updated weights on worker 0-0, policy_version 531285 (0.00092) [2022-07-10 02:40:09,371][26022] Updated weights on worker 0-0, policy_version 531295 (0.00082) [2022-07-10 02:40:10,289][25689] Fps is (10 sec: 5382.9, 60 sec: 5664.1, 300 sec: 5648.9). Total num frames: 544051200. Throughput: 0: 5822.2. Samples: 544060264. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:40:10,289][25689] Avg episode reward: [(0, '-30.360')] [2022-07-10 02:40:11,132][26022] Updated weights on worker 0-0, policy_version 531305 (0.01039) [2022-07-10 02:40:12,885][26022] Updated weights on worker 0-0, policy_version 531315 (0.00083) [2022-07-10 02:40:14,524][26022] Updated weights on worker 0-0, policy_version 531325 (0.00094) [2022-07-10 02:40:15,335][25689] Fps is (10 sec: 5662.4, 60 sec: 5649.7, 300 sec: 5642.2). Total num frames: 544079872. Throughput: 0: 5815.9. Samples: 544077536. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:40:15,335][25689] Avg episode reward: [(0, '-29.585')] [2022-07-10 02:40:16,713][26022] Updated weights on worker 0-0, policy_version 531335 (0.00454) [2022-07-10 02:40:18,137][26022] Updated weights on worker 0-0, policy_version 531345 (0.00093) [2022-07-10 02:40:20,150][26022] Updated weights on worker 0-0, policy_version 531355 (0.00089) [2022-07-10 02:40:20,341][25689] Fps is (10 sec: 5704.8, 60 sec: 5651.9, 300 sec: 5646.0). Total num frames: 544108544. Throughput: 0: 5838.3. Samples: 544112032. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:40:20,341][25689] Avg episode reward: [(0, '-28.975')] [2022-07-10 02:40:21,849][26022] Updated weights on worker 0-0, policy_version 531365 (0.00082) [2022-07-10 02:40:23,501][26022] Updated weights on worker 0-0, policy_version 531375 (0.00085) [2022-07-10 02:40:25,351][25689] Fps is (10 sec: 5725.0, 60 sec: 5658.6, 300 sec: 5647.3). Total num frames: 544137216. Throughput: 0: 5964.2. Samples: 544146560. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:40:25,352][25689] Avg episode reward: [(0, '-28.903')] [2022-07-10 02:40:25,434][26022] Updated weights on worker 0-0, policy_version 531385 (0.00084) [2022-07-10 02:40:27,112][26022] Updated weights on worker 0-0, policy_version 531395 (0.00087) [2022-07-10 02:40:29,134][26022] Updated weights on worker 0-0, policy_version 531405 (0.00088) [2022-07-10 02:40:30,465][25689] Fps is (10 sec: 5765.3, 60 sec: 5669.0, 300 sec: 5649.0). Total num frames: 544166912. Throughput: 0: 5123.7. Samples: 544163640. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:40:30,465][25689] Avg episode reward: [(0, '-29.235')] [2022-07-10 02:40:30,820][26022] Updated weights on worker 0-0, policy_version 531415 (0.00082) [2022-07-10 02:40:32,711][26022] Updated weights on worker 0-0, policy_version 531425 (0.00089) [2022-07-10 02:40:34,335][26022] Updated weights on worker 0-0, policy_version 531435 (0.00092) [2022-07-10 02:40:35,506][25689] Fps is (10 sec: 5646.9, 60 sec: 5656.2, 300 sec: 5644.9). Total num frames: 544194560. Throughput: 0: 5958.6. Samples: 544197730. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:40:35,507][25689] Avg episode reward: [(0, '-30.230')] [2022-07-10 02:40:36,296][26022] Updated weights on worker 0-0, policy_version 531445 (0.00084) [2022-07-10 02:40:37,948][26022] Updated weights on worker 0-0, policy_version 531455 (0.00086) [2022-07-10 02:40:39,795][26022] Updated weights on worker 0-0, policy_version 531465 (0.00088) [2022-07-10 02:40:40,544][25689] Fps is (10 sec: 5790.8, 60 sec: 5689.1, 300 sec: 5651.2). Total num frames: 544225280. Throughput: 0: 5945.4. Samples: 544232150. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:40:40,545][25689] Avg episode reward: [(0, '-29.732')] [2022-07-10 02:40:41,605][26022] Updated weights on worker 0-0, policy_version 531475 (0.00096) [2022-07-10 02:40:43,295][26022] Updated weights on worker 0-0, policy_version 531485 (0.00081) [2022-07-10 02:40:44,992][26022] Updated weights on worker 0-0, policy_version 531495 (0.00081) [2022-07-10 02:40:45,548][25689] Fps is (10 sec: 5812.6, 60 sec: 5672.3, 300 sec: 5650.0). Total num frames: 544252928. Throughput: 0: 5097.3. Samples: 544249508. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:40:45,548][25689] Avg episode reward: [(0, '-30.294')] [2022-07-10 02:40:46,824][26022] Updated weights on worker 0-0, policy_version 531505 (0.00080) [2022-07-10 02:40:48,827][26022] Updated weights on worker 0-0, policy_version 531515 (0.00084) [2022-07-10 02:40:50,450][26022] Updated weights on worker 0-0, policy_version 531525 (0.00090) [2022-07-10 02:40:50,627][25689] Fps is (10 sec: 5586.0, 60 sec: 5670.6, 300 sec: 5641.7). Total num frames: 544281600. Throughput: 0: 5960.2. Samples: 544283808. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:40:50,627][25689] Avg episode reward: [(0, '-31.007')] [2022-07-10 02:40:52,391][26022] Updated weights on worker 0-0, policy_version 531535 (0.00086) [2022-07-10 02:40:54,130][26022] Updated weights on worker 0-0, policy_version 531545 (0.00088) [2022-07-10 02:40:55,643][25689] Fps is (10 sec: 5680.6, 60 sec: 5671.3, 300 sec: 5648.4). Total num frames: 544310272. Throughput: 0: 5987.9. Samples: 544318306. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:40:55,644][25689] Avg episode reward: [(0, '-30.641')] [2022-07-10 02:40:55,855][26022] Updated weights on worker 0-0, policy_version 531555 (0.00093) [2022-07-10 02:40:57,535][26022] Updated weights on worker 0-0, policy_version 531565 (0.00086) [2022-07-10 02:40:59,434][26022] Updated weights on worker 0-0, policy_version 531575 (0.00085) [2022-07-10 02:41:00,657][25689] Fps is (10 sec: 5717.0, 60 sec: 5657.8, 300 sec: 5652.4). Total num frames: 544338944. Throughput: 0: 5134.4. Samples: 544335416. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:41:00,657][25689] Avg episode reward: [(0, '-30.233')] [2022-07-10 02:41:01,201][26022] Updated weights on worker 0-0, policy_version 531585 (0.00091) [2022-07-10 02:41:03,409][26022] Updated weights on worker 0-0, policy_version 531595 (0.00089) [2022-07-10 02:41:05,159][26022] Updated weights on worker 0-0, policy_version 531605 (0.00089) [2022-07-10 02:41:05,664][25689] Fps is (10 sec: 5517.7, 60 sec: 5674.2, 300 sec: 5647.4). Total num frames: 544365568. Throughput: 0: 5883.8. Samples: 544367868. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:41:05,665][25689] Avg episode reward: [(0, '-28.471')] [2022-07-10 02:41:07,027][26022] Updated weights on worker 0-0, policy_version 531615 (0.00088) [2022-07-10 02:41:08,760][26022] Updated weights on worker 0-0, policy_version 531625 (0.00093) [2022-07-10 02:41:10,767][25689] Fps is (10 sec: 5469.7, 60 sec: 5672.2, 300 sec: 5649.4). Total num frames: 544394240. Throughput: 0: 5868.1. Samples: 544401992. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:41:10,767][25689] Avg episode reward: [(0, '-28.532')] [2022-07-10 02:41:10,768][26022] Updated weights on worker 0-0, policy_version 531635 (0.00126) [2022-07-10 02:41:12,365][26022] Updated weights on worker 0-0, policy_version 531645 (0.00083) [2022-07-10 02:41:14,087][26022] Updated weights on worker 0-0, policy_version 531655 (0.00089) [2022-07-10 02:41:15,853][25689] Fps is (10 sec: 5728.7, 60 sec: 5685.3, 300 sec: 5651.9). Total num frames: 544423936. Throughput: 0: 4999.2. Samples: 544419340. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:41:15,853][25689] Avg episode reward: [(0, '-28.978')] [2022-07-10 02:41:15,970][26022] Updated weights on worker 0-0, policy_version 531665 (0.00099) [2022-07-10 02:41:17,781][26022] Updated weights on worker 0-0, policy_version 531675 (0.00056) [2022-07-10 02:41:19,556][26022] Updated weights on worker 0-0, policy_version 531685 (0.00089) [2022-07-10 02:41:20,951][25689] Fps is (10 sec: 5831.3, 60 sec: 5693.5, 300 sec: 5650.4). Total num frames: 544453632. Throughput: 0: 5833.7. Samples: 544453808. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:41:20,952][25689] Avg episode reward: [(0, '-29.802')] [2022-07-10 02:41:21,253][26022] Updated weights on worker 0-0, policy_version 531695 (0.00087) [2022-07-10 02:41:23,185][26022] Updated weights on worker 0-0, policy_version 531705 (0.00089) [2022-07-10 02:41:25,083][26022] Updated weights on worker 0-0, policy_version 531715 (0.00084) [2022-07-10 02:41:25,965][25689] Fps is (10 sec: 5670.8, 60 sec: 5676.3, 300 sec: 5651.2). Total num frames: 544481280. Throughput: 0: 5901.2. Samples: 544487668. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:41:25,966][25689] Avg episode reward: [(0, '-30.627')] [2022-07-10 02:41:26,859][26022] Updated weights on worker 0-0, policy_version 531725 (0.00085) [2022-07-10 02:41:28,547][26022] Updated weights on worker 0-0, policy_version 531735 (0.00093) [2022-07-10 02:41:30,488][26022] Updated weights on worker 0-0, policy_version 531745 (0.00084) [2022-07-10 02:41:31,037][25689] Fps is (10 sec: 5685.6, 60 sec: 5680.2, 300 sec: 5650.6). Total num frames: 544510976. Throughput: 0: 5062.6. Samples: 544504616. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:41:31,038][25689] Avg episode reward: [(0, '-31.363')] [2022-07-10 02:41:32,198][26022] Updated weights on worker 0-0, policy_version 531755 (0.00095) [2022-07-10 02:41:33,896][26022] Updated weights on worker 0-0, policy_version 531765 (0.00090) [2022-07-10 02:41:35,843][26022] Updated weights on worker 0-0, policy_version 531775 (0.00099) [2022-07-10 02:41:36,129][25689] Fps is (10 sec: 5641.8, 60 sec: 5675.5, 300 sec: 5645.6). Total num frames: 544538624. Throughput: 0: 5900.5. Samples: 544538980. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:41:36,130][25689] Avg episode reward: [(0, '-31.022')] [2022-07-10 02:41:37,538][26022] Updated weights on worker 0-0, policy_version 531785 (0.00085) [2022-07-10 02:41:39,529][26022] Updated weights on worker 0-0, policy_version 531795 (0.00087) [2022-07-10 02:41:40,986][26022] Updated weights on worker 0-0, policy_version 531805 (0.00088) [2022-07-10 02:41:41,161][25689] Fps is (10 sec: 5664.2, 60 sec: 5659.1, 300 sec: 5652.4). Total num frames: 544568320. Throughput: 0: 5898.7. Samples: 544573020. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:41:41,162][25689] Avg episode reward: [(0, '-30.598')] [2022-07-10 02:41:42,967][26022] Updated weights on worker 0-0, policy_version 531815 (0.00093) [2022-07-10 02:41:44,804][26022] Updated weights on worker 0-0, policy_version 531825 (0.00104) [2022-07-10 02:41:46,171][25689] Fps is (10 sec: 5710.6, 60 sec: 5658.6, 300 sec: 5646.4). Total num frames: 544595968. Throughput: 0: 5075.0. Samples: 544590212. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-10 02:41:46,171][25689] Avg episode reward: [(0, '-28.887')] [2022-07-10 02:41:46,598][26022] Updated weights on worker 0-0, policy_version 531835 (0.00086) [2022-07-10 02:41:48,230][26022] Updated weights on worker 0-0, policy_version 531845 (0.00089) [2022-07-10 02:41:50,395][26022] Updated weights on worker 0-0, policy_version 531855 (0.00087) [2022-07-10 02:41:51,212][25689] Fps is (10 sec: 5705.4, 60 sec: 5679.0, 300 sec: 5649.2). Total num frames: 544625664. Throughput: 0: 5952.4. Samples: 544624704. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:41:51,213][25689] Avg episode reward: [(0, '-28.601')] [2022-07-10 02:41:51,611][26022] Updated weights on worker 0-0, policy_version 531865 (0.00087) [2022-07-10 02:41:52,669][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:41:52,689][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000531870_544634880.pth [2022-07-10 02:41:52,690][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000529880_542597120.pth [2022-07-10 02:41:53,866][26022] Updated weights on worker 0-0, policy_version 531875 (0.00085) [2022-07-10 02:41:55,274][26022] Updated weights on worker 0-0, policy_version 531885 (0.00087) [2022-07-10 02:41:56,243][25689] Fps is (10 sec: 5795.1, 60 sec: 5677.6, 300 sec: 5652.1). Total num frames: 544654336. Throughput: 0: 5979.7. Samples: 544659254. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:41:56,243][25689] Avg episode reward: [(0, '-27.354')] [2022-07-10 02:41:57,375][26022] Updated weights on worker 0-0, policy_version 531895 (0.00090) [2022-07-10 02:41:59,064][26022] Updated weights on worker 0-0, policy_version 531905 (0.00092) [2022-07-10 02:42:00,791][26022] Updated weights on worker 0-0, policy_version 531915 (0.00050) [2022-07-10 02:42:01,271][25689] Fps is (10 sec: 5802.9, 60 sec: 5693.2, 300 sec: 5665.5). Total num frames: 544684032. Throughput: 0: 5145.9. Samples: 544676500. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:42:01,271][25689] Avg episode reward: [(0, '-27.886')] [2022-07-10 02:42:03,135][26022] Updated weights on worker 0-0, policy_version 531925 (0.00083) [2022-07-10 02:42:04,671][26022] Updated weights on worker 0-0, policy_version 531935 (0.00091) [2022-07-10 02:42:06,283][25689] Fps is (10 sec: 5405.6, 60 sec: 5659.0, 300 sec: 5649.6). Total num frames: 544708608. Throughput: 0: 5890.6. Samples: 544708684. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:42:06,283][25689] Avg episode reward: [(0, '-27.974')] [2022-07-10 02:42:06,660][26022] Updated weights on worker 0-0, policy_version 531945 (0.00092) [2022-07-10 02:42:08,495][26022] Updated weights on worker 0-0, policy_version 531955 (0.00089) [2022-07-10 02:42:10,209][26022] Updated weights on worker 0-0, policy_version 531965 (0.01066) [2022-07-10 02:42:11,408][25689] Fps is (10 sec: 5353.7, 60 sec: 5673.8, 300 sec: 5654.3). Total num frames: 544738304. Throughput: 0: 5845.1. Samples: 544742750. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:42:11,408][25689] Avg episode reward: [(0, '-29.261')] [2022-07-10 02:42:12,012][26022] Updated weights on worker 0-0, policy_version 531975 (0.00087) [2022-07-10 02:42:13,807][26022] Updated weights on worker 0-0, policy_version 531985 (0.00083) [2022-07-10 02:42:15,522][26022] Updated weights on worker 0-0, policy_version 531995 (0.00087) [2022-07-10 02:42:16,460][25689] Fps is (10 sec: 5734.8, 60 sec: 5660.0, 300 sec: 5650.1). Total num frames: 544766976. Throughput: 0: 4985.1. Samples: 544760042. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:42:16,461][25689] Avg episode reward: [(0, '-29.125')] [2022-07-10 02:42:17,230][26022] Updated weights on worker 0-0, policy_version 532005 (0.00091) [2022-07-10 02:42:19,205][26022] Updated weights on worker 0-0, policy_version 532015 (0.00080) [2022-07-10 02:42:20,737][26022] Updated weights on worker 0-0, policy_version 532025 (0.00084) [2022-07-10 02:42:21,463][25689] Fps is (10 sec: 5804.6, 60 sec: 5669.0, 300 sec: 5657.1). Total num frames: 544796672. Throughput: 0: 5843.8. Samples: 544794502. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:42:21,464][25689] Avg episode reward: [(0, '-28.788')] [2022-07-10 02:42:22,863][26022] Updated weights on worker 0-0, policy_version 532035 (0.00091) [2022-07-10 02:42:24,427][26022] Updated weights on worker 0-0, policy_version 532045 (0.00091) [2022-07-10 02:42:26,400][26022] Updated weights on worker 0-0, policy_version 532055 (0.00088) [2022-07-10 02:42:26,468][25689] Fps is (10 sec: 5832.5, 60 sec: 5686.7, 300 sec: 5658.0). Total num frames: 544825344. Throughput: 0: 5972.6. Samples: 544829242. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:42:26,468][25689] Avg episode reward: [(0, '-28.467')] [2022-07-10 02:42:27,954][26022] Updated weights on worker 0-0, policy_version 532065 (0.00089) [2022-07-10 02:42:30,012][26022] Updated weights on worker 0-0, policy_version 532075 (0.00087) [2022-07-10 02:42:31,547][25689] Fps is (10 sec: 5686.9, 60 sec: 5669.2, 300 sec: 5663.4). Total num frames: 544854016. Throughput: 0: 5980.9. Samples: 544863200. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:42:31,547][25689] Avg episode reward: [(0, '-30.048')] [2022-07-10 02:42:31,650][26022] Updated weights on worker 0-0, policy_version 532085 (0.00109) [2022-07-10 02:42:33,548][26022] Updated weights on worker 0-0, policy_version 532095 (0.00050) [2022-07-10 02:42:35,247][26022] Updated weights on worker 0-0, policy_version 532105 (0.00087) [2022-07-10 02:42:36,558][25689] Fps is (10 sec: 5581.8, 60 sec: 5676.8, 300 sec: 5656.5). Total num frames: 544881664. Throughput: 0: 5984.1. Samples: 544880306. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:42:36,558][25689] Avg episode reward: [(0, '-30.357')] [2022-07-10 02:42:37,204][26022] Updated weights on worker 0-0, policy_version 532115 (0.00092) [2022-07-10 02:42:38,958][26022] Updated weights on worker 0-0, policy_version 532125 (0.00085) [2022-07-10 02:42:40,809][26022] Updated weights on worker 0-0, policy_version 532135 (0.00094) [2022-07-10 02:42:41,581][25689] Fps is (10 sec: 5714.6, 60 sec: 5677.6, 300 sec: 5659.7). Total num frames: 544911360. Throughput: 0: 5944.6. Samples: 544914096. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:42:41,582][25689] Avg episode reward: [(0, '-29.054')] [2022-07-10 02:42:42,599][26022] Updated weights on worker 0-0, policy_version 532145 (0.00081) [2022-07-10 02:42:44,408][26022] Updated weights on worker 0-0, policy_version 532155 (0.00092) [2022-07-10 02:42:46,274][26022] Updated weights on worker 0-0, policy_version 532165 (0.00100) [2022-07-10 02:42:46,611][25689] Fps is (10 sec: 5602.4, 60 sec: 5658.8, 300 sec: 5651.3). Total num frames: 544937984. Throughput: 0: 5909.7. Samples: 544948280. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:42:46,611][25689] Avg episode reward: [(0, '-30.082')] [2022-07-10 02:42:48,035][26022] Updated weights on worker 0-0, policy_version 532175 (0.00083) [2022-07-10 02:42:49,814][26022] Updated weights on worker 0-0, policy_version 532185 (0.00086) [2022-07-10 02:42:51,728][25689] Fps is (10 sec: 5550.5, 60 sec: 5651.7, 300 sec: 5656.4). Total num frames: 544967680. Throughput: 0: 5061.7. Samples: 544965352. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:42:51,729][25689] Avg episode reward: [(0, '-31.367')] [2022-07-10 02:42:51,733][26022] Updated weights on worker 0-0, policy_version 532195 (0.00090) [2022-07-10 02:42:53,349][26022] Updated weights on worker 0-0, policy_version 532205 (0.00099) [2022-07-10 02:42:55,338][26022] Updated weights on worker 0-0, policy_version 532215 (0.00099) [2022-07-10 02:42:56,751][25689] Fps is (10 sec: 5756.3, 60 sec: 5652.4, 300 sec: 5656.1). Total num frames: 544996352. Throughput: 0: 5906.5. Samples: 544999574. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:42:56,751][25689] Avg episode reward: [(0, '-31.076')] [2022-07-10 02:42:57,027][26022] Updated weights on worker 0-0, policy_version 532225 (0.00087) [2022-07-10 02:42:58,765][26022] Updated weights on worker 0-0, policy_version 532235 (0.00090) [2022-07-10 02:43:00,689][26022] Updated weights on worker 0-0, policy_version 532245 (0.00090) [2022-07-10 02:43:01,767][25689] Fps is (10 sec: 5814.4, 60 sec: 5653.5, 300 sec: 5673.1). Total num frames: 545026048. Throughput: 0: 5937.1. Samples: 545033938. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:43:01,767][25689] Avg episode reward: [(0, '-30.854')] [2022-07-10 02:43:02,559][26022] Updated weights on worker 0-0, policy_version 532255 (0.00089) [2022-07-10 02:43:04,479][26022] Updated weights on worker 0-0, policy_version 532265 (0.00086) [2022-07-10 02:43:06,619][26022] Updated weights on worker 0-0, policy_version 532276 (0.00362) [2022-07-10 02:43:06,812][25689] Fps is (10 sec: 5393.9, 60 sec: 5650.4, 300 sec: 5650.6). Total num frames: 545050624. Throughput: 0: 4987.5. Samples: 545049036. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:43:06,813][25689] Avg episode reward: [(0, '-28.952')] [2022-07-10 02:43:08,342][26022] Updated weights on worker 0-0, policy_version 532286 (0.00091) [2022-07-10 02:43:10,231][26022] Updated weights on worker 0-0, policy_version 532296 (0.00094) [2022-07-10 02:43:11,776][26022] Updated weights on worker 0-0, policy_version 532306 (0.00089) [2022-07-10 02:43:11,868][25689] Fps is (10 sec: 5474.3, 60 sec: 5673.8, 300 sec: 5663.5). Total num frames: 545081344. Throughput: 0: 5845.8. Samples: 545083084. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:43:11,868][25689] Avg episode reward: [(0, '-29.052')] [2022-07-10 02:43:13,724][26022] Updated weights on worker 0-0, policy_version 532316 (0.00089) [2022-07-10 02:43:15,639][26022] Updated weights on worker 0-0, policy_version 532326 (0.00085) [2022-07-10 02:43:16,919][25689] Fps is (10 sec: 5876.2, 60 sec: 5673.9, 300 sec: 5660.1). Total num frames: 545110016. Throughput: 0: 5849.7. Samples: 545117556. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:43:16,920][25689] Avg episode reward: [(0, '-28.376')] [2022-07-10 02:43:17,204][26022] Updated weights on worker 0-0, policy_version 532336 (0.00083) [2022-07-10 02:43:19,040][26022] Updated weights on worker 0-0, policy_version 532346 (0.00087) [2022-07-10 02:43:20,738][26022] Updated weights on worker 0-0, policy_version 532356 (0.00088) [2022-07-10 02:43:21,947][25689] Fps is (10 sec: 5587.9, 60 sec: 5637.8, 300 sec: 5660.4). Total num frames: 545137664. Throughput: 0: 4994.3. Samples: 545134724. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:43:21,947][25689] Avg episode reward: [(0, '-28.472')] [2022-07-10 02:43:22,620][26022] Updated weights on worker 0-0, policy_version 532366 (0.00093) [2022-07-10 02:43:24,598][26022] Updated weights on worker 0-0, policy_version 532376 (0.00083) [2022-07-10 02:43:26,143][26022] Updated weights on worker 0-0, policy_version 532386 (0.00089) [2022-07-10 02:43:26,949][25689] Fps is (10 sec: 5615.2, 60 sec: 5638.0, 300 sec: 5658.3). Total num frames: 545166336. Throughput: 0: 5956.5. Samples: 545168982. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:43:26,951][25689] Avg episode reward: [(0, '-27.389')] [2022-07-10 02:43:28,084][26022] Updated weights on worker 0-0, policy_version 532396 (0.00089) [2022-07-10 02:43:29,889][26022] Updated weights on worker 0-0, policy_version 532406 (0.00095) [2022-07-10 02:43:31,791][26022] Updated weights on worker 0-0, policy_version 532416 (0.00089) [2022-07-10 02:43:32,057][25689] Fps is (10 sec: 5672.0, 60 sec: 5635.3, 300 sec: 5663.5). Total num frames: 545195008. Throughput: 0: 5929.7. Samples: 545202798. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:43:32,057][25689] Avg episode reward: [(0, '-27.915')] [2022-07-10 02:43:33,487][26022] Updated weights on worker 0-0, policy_version 532426 (0.00079) [2022-07-10 02:43:35,298][26022] Updated weights on worker 0-0, policy_version 532436 (0.00095) [2022-07-10 02:43:36,858][26022] Updated weights on worker 0-0, policy_version 532446 (0.00098) [2022-07-10 02:43:37,112][25689] Fps is (10 sec: 5844.2, 60 sec: 5681.9, 300 sec: 5667.0). Total num frames: 545225728. Throughput: 0: 5083.5. Samples: 545220204. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:43:37,112][25689] Avg episode reward: [(0, '-28.942')] [2022-07-10 02:43:39,011][26022] Updated weights on worker 0-0, policy_version 532456 (0.00084) [2022-07-10 02:43:40,563][26022] Updated weights on worker 0-0, policy_version 532466 (0.00089) [2022-07-10 02:43:42,120][25689] Fps is (10 sec: 5698.4, 60 sec: 5632.7, 300 sec: 5664.1). Total num frames: 545252352. Throughput: 0: 5932.0. Samples: 545254390. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:43:42,121][25689] Avg episode reward: [(0, '-27.644')] [2022-07-10 02:43:42,537][26022] Updated weights on worker 0-0, policy_version 532476 (0.00085) [2022-07-10 02:43:44,073][26022] Updated weights on worker 0-0, policy_version 532486 (0.00074) [2022-07-10 02:43:46,128][26022] Updated weights on worker 0-0, policy_version 532496 (0.00091) [2022-07-10 02:43:47,121][25689] Fps is (10 sec: 5626.8, 60 sec: 5686.0, 300 sec: 5668.3). Total num frames: 545282048. Throughput: 0: 5939.8. Samples: 545288798. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:43:47,122][25689] Avg episode reward: [(0, '-28.192')] [2022-07-10 02:43:47,692][26022] Updated weights on worker 0-0, policy_version 532506 (0.00090) [2022-07-10 02:43:49,630][26022] Updated weights on worker 0-0, policy_version 532516 (0.00090) [2022-07-10 02:43:51,368][26022] Updated weights on worker 0-0, policy_version 532526 (0.00084) [2022-07-10 02:43:52,170][25689] Fps is (10 sec: 5705.5, 60 sec: 5658.5, 300 sec: 5665.5). Total num frames: 545309696. Throughput: 0: 5132.1. Samples: 545306024. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:43:52,171][25689] Avg episode reward: [(0, '-28.075')] [2022-07-10 02:43:52,767][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:43:52,778][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000532533_545313792.pth [2022-07-10 02:43:52,778][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000530542_543275008.pth [2022-07-10 02:43:53,326][26022] Updated weights on worker 0-0, policy_version 532536 (0.00083) [2022-07-10 02:43:54,995][26022] Updated weights on worker 0-0, policy_version 532546 (0.00088) [2022-07-10 02:43:56,857][26022] Updated weights on worker 0-0, policy_version 532556 (0.00088) [2022-07-10 02:43:57,176][25689] Fps is (10 sec: 5702.9, 60 sec: 5677.1, 300 sec: 5666.9). Total num frames: 545339392. Throughput: 0: 5983.0. Samples: 545340250. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:43:57,177][25689] Avg episode reward: [(0, '-28.729')] [2022-07-10 02:43:58,458][26022] Updated weights on worker 0-0, policy_version 532566 (0.00087) [2022-07-10 02:44:00,379][26022] Updated weights on worker 0-0, policy_version 532576 (0.00083) [2022-07-10 02:44:02,178][25689] Fps is (10 sec: 5627.6, 60 sec: 5627.5, 300 sec: 5675.3). Total num frames: 545366016. Throughput: 0: 5993.4. Samples: 545374608. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:44:02,179][25689] Avg episode reward: [(0, '-28.262')] [2022-07-10 02:44:02,634][26022] Updated weights on worker 0-0, policy_version 532586 (0.00090) [2022-07-10 02:44:04,280][26022] Updated weights on worker 0-0, policy_version 532596 (0.00084) [2022-07-10 02:44:06,142][26022] Updated weights on worker 0-0, policy_version 532606 (0.00083) [2022-07-10 02:44:07,249][25689] Fps is (10 sec: 5388.0, 60 sec: 5676.0, 300 sec: 5665.1). Total num frames: 545393664. Throughput: 0: 5019.4. Samples: 545389828. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:44:07,249][25689] Avg episode reward: [(0, '-28.576')] [2022-07-10 02:44:07,795][26022] Updated weights on worker 0-0, policy_version 532616 (0.00087) [2022-07-10 02:44:09,773][26022] Updated weights on worker 0-0, policy_version 532626 (0.00080) [2022-07-10 02:44:11,590][26022] Updated weights on worker 0-0, policy_version 532636 (0.00084) [2022-07-10 02:44:12,297][25689] Fps is (10 sec: 5566.0, 60 sec: 5642.9, 300 sec: 5662.1). Total num frames: 545422336. Throughput: 0: 5867.2. Samples: 545424108. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:44:12,297][25689] Avg episode reward: [(0, '-28.382')] [2022-07-10 02:44:13,155][26022] Updated weights on worker 0-0, policy_version 532646 (0.00090) [2022-07-10 02:44:15,116][26022] Updated weights on worker 0-0, policy_version 532656 (0.00086) [2022-07-10 02:44:16,542][26022] Updated weights on worker 0-0, policy_version 532666 (0.00089) [2022-07-10 02:44:17,316][25689] Fps is (10 sec: 5696.0, 60 sec: 5645.9, 300 sec: 5662.3). Total num frames: 545451008. Throughput: 0: 5892.7. Samples: 545458930. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:44:17,317][25689] Avg episode reward: [(0, '-28.867')] [2022-07-10 02:44:18,616][26022] Updated weights on worker 0-0, policy_version 532676 (0.00092) [2022-07-10 02:44:20,431][26022] Updated weights on worker 0-0, policy_version 532686 (0.00087) [2022-07-10 02:44:22,122][26022] Updated weights on worker 0-0, policy_version 532696 (0.00086) [2022-07-10 02:44:22,322][25689] Fps is (10 sec: 5924.4, 60 sec: 5698.8, 300 sec: 5670.7). Total num frames: 545481728. Throughput: 0: 5049.0. Samples: 545476312. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:44:22,322][25689] Avg episode reward: [(0, '-28.469')] [2022-07-10 02:44:23,911][26022] Updated weights on worker 0-0, policy_version 532706 (0.00083) [2022-07-10 02:44:25,836][26022] Updated weights on worker 0-0, policy_version 532716 (0.00089) [2022-07-10 02:44:27,328][25689] Fps is (10 sec: 5932.1, 60 sec: 5698.4, 300 sec: 5671.4). Total num frames: 545510400. Throughput: 0: 6028.5. Samples: 545510876. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:44:27,329][25689] Avg episode reward: [(0, '-29.061')] [2022-07-10 02:44:27,378][26022] Updated weights on worker 0-0, policy_version 532726 (0.00091) [2022-07-10 02:44:29,567][26022] Updated weights on worker 0-0, policy_version 532736 (0.00087) [2022-07-10 02:44:30,853][26022] Updated weights on worker 0-0, policy_version 532746 (0.00086) [2022-07-10 02:44:32,395][25689] Fps is (10 sec: 5489.5, 60 sec: 5668.3, 300 sec: 5664.9). Total num frames: 545537024. Throughput: 0: 6036.9. Samples: 545545438. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:44:32,395][25689] Avg episode reward: [(0, '-28.866')] [2022-07-10 02:44:32,929][26022] Updated weights on worker 0-0, policy_version 532756 (0.00089) [2022-07-10 02:44:34,294][26022] Updated weights on worker 0-0, policy_version 532766 (0.00081) [2022-07-10 02:44:36,398][26022] Updated weights on worker 0-0, policy_version 532776 (0.00628) [2022-07-10 02:44:37,402][25689] Fps is (10 sec: 5794.0, 60 sec: 5689.8, 300 sec: 5675.6). Total num frames: 545568768. Throughput: 0: 5171.5. Samples: 545562804. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:44:37,403][25689] Avg episode reward: [(0, '-30.483')] [2022-07-10 02:44:38,041][26022] Updated weights on worker 0-0, policy_version 532786 (0.00079) [2022-07-10 02:44:39,954][26022] Updated weights on worker 0-0, policy_version 532796 (0.00091) [2022-07-10 02:44:41,602][26022] Updated weights on worker 0-0, policy_version 532806 (0.00081) [2022-07-10 02:44:42,415][25689] Fps is (10 sec: 5927.5, 60 sec: 5706.4, 300 sec: 5672.0). Total num frames: 545596416. Throughput: 0: 6016.9. Samples: 545597208. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:44:42,415][25689] Avg episode reward: [(0, '-30.555')] [2022-07-10 02:44:43,674][26022] Updated weights on worker 0-0, policy_version 532816 (0.00089) [2022-07-10 02:44:45,194][26022] Updated weights on worker 0-0, policy_version 532826 (0.00082) [2022-07-10 02:44:47,255][26022] Updated weights on worker 0-0, policy_version 532836 (0.00082) [2022-07-10 02:44:47,423][25689] Fps is (10 sec: 5722.7, 60 sec: 5705.7, 300 sec: 5676.4). Total num frames: 545626112. Throughput: 0: 6027.4. Samples: 545631992. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 02:44:47,423][25689] Avg episode reward: [(0, '-30.312')] [2022-07-10 02:44:48,755][26022] Updated weights on worker 0-0, policy_version 532846 (0.00086) [2022-07-10 02:44:50,676][26022] Updated weights on worker 0-0, policy_version 532856 (0.00089) [2022-07-10 02:44:52,295][26022] Updated weights on worker 0-0, policy_version 532866 (0.00080) [2022-07-10 02:44:52,500][25689] Fps is (10 sec: 5787.5, 60 sec: 5720.1, 300 sec: 5675.4). Total num frames: 545654784. Throughput: 0: 5166.9. Samples: 545649318. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:44:52,500][25689] Avg episode reward: [(0, '-30.243')] [2022-07-10 02:44:54,122][26022] Updated weights on worker 0-0, policy_version 532876 (0.00086) [2022-07-10 02:44:55,835][26022] Updated weights on worker 0-0, policy_version 532886 (0.00090) [2022-07-10 02:44:57,528][25689] Fps is (10 sec: 5775.8, 60 sec: 5717.9, 300 sec: 5675.8). Total num frames: 545684480. Throughput: 0: 6034.9. Samples: 545684264. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:44:57,529][25689] Avg episode reward: [(0, '-31.528')] [2022-07-10 02:44:57,651][26022] Updated weights on worker 0-0, policy_version 532896 (0.00082) [2022-07-10 02:44:59,342][26022] Updated weights on worker 0-0, policy_version 532906 (0.00086) [2022-07-10 02:45:01,292][26022] Updated weights on worker 0-0, policy_version 532916 (0.00087) [2022-07-10 02:45:02,555][25689] Fps is (10 sec: 5702.9, 60 sec: 5732.5, 300 sec: 5682.2). Total num frames: 545712128. Throughput: 0: 5997.4. Samples: 545717998. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:45:02,557][25689] Avg episode reward: [(0, '-30.019')] [2022-07-10 02:45:03,289][26022] Updated weights on worker 0-0, policy_version 532926 (0.00091) [2022-07-10 02:45:05,271][26022] Updated weights on worker 0-0, policy_version 532936 (0.00099) [2022-07-10 02:45:06,779][26022] Updated weights on worker 0-0, policy_version 532946 (0.00078) [2022-07-10 02:45:07,560][25689] Fps is (10 sec: 5308.0, 60 sec: 5704.8, 300 sec: 5673.4). Total num frames: 545737728. Throughput: 0: 5068.1. Samples: 545734050. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:45:07,561][25689] Avg episode reward: [(0, '-30.291')] [2022-07-10 02:45:08,836][26022] Updated weights on worker 0-0, policy_version 532956 (0.00607) [2022-07-10 02:45:10,793][26022] Updated weights on worker 0-0, policy_version 532966 (0.00086) [2022-07-10 02:45:12,560][26022] Updated weights on worker 0-0, policy_version 532976 (0.00092) [2022-07-10 02:45:12,603][25689] Fps is (10 sec: 5605.1, 60 sec: 5739.3, 300 sec: 5680.3). Total num frames: 545768448. Throughput: 0: 5882.4. Samples: 545767572. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:45:12,603][25689] Avg episode reward: [(0, '-31.822')] [2022-07-10 02:45:14,279][26022] Updated weights on worker 0-0, policy_version 532986 (0.00087) [2022-07-10 02:45:16,131][26022] Updated weights on worker 0-0, policy_version 532996 (0.00085) [2022-07-10 02:45:17,646][25689] Fps is (10 sec: 5888.3, 60 sec: 5737.0, 300 sec: 5679.6). Total num frames: 545797120. Throughput: 0: 5837.8. Samples: 545801710. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:45:17,647][25689] Avg episode reward: [(0, '-31.799')] [2022-07-10 02:45:17,692][26022] Updated weights on worker 0-0, policy_version 533006 (0.00091) [2022-07-10 02:45:19,660][26022] Updated weights on worker 0-0, policy_version 533016 (0.00089) [2022-07-10 02:45:21,268][26022] Updated weights on worker 0-0, policy_version 533026 (0.00089) [2022-07-10 02:45:22,711][25689] Fps is (10 sec: 5470.2, 60 sec: 5663.5, 300 sec: 5671.7). Total num frames: 545823744. Throughput: 0: 5002.0. Samples: 545818822. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:45:22,712][25689] Avg episode reward: [(0, '-30.553')] [2022-07-10 02:45:23,185][26022] Updated weights on worker 0-0, policy_version 533036 (0.00090) [2022-07-10 02:45:25,076][26022] Updated weights on worker 0-0, policy_version 533046 (0.00094) [2022-07-10 02:45:26,819][26022] Updated weights on worker 0-0, policy_version 533056 (0.00088) [2022-07-10 02:45:27,752][25689] Fps is (10 sec: 5573.0, 60 sec: 5677.2, 300 sec: 5673.1). Total num frames: 545853440. Throughput: 0: 5889.8. Samples: 545852978. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:45:27,753][25689] Avg episode reward: [(0, '-28.999')] [2022-07-10 02:45:28,592][26022] Updated weights on worker 0-0, policy_version 533066 (0.00089) [2022-07-10 02:45:30,531][26022] Updated weights on worker 0-0, policy_version 533076 (0.00095) [2022-07-10 02:45:32,132][26022] Updated weights on worker 0-0, policy_version 533086 (0.00084) [2022-07-10 02:45:32,851][25689] Fps is (10 sec: 5857.3, 60 sec: 5725.0, 300 sec: 5678.8). Total num frames: 545883136. Throughput: 0: 5932.5. Samples: 545887696. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:45:32,853][25689] Avg episode reward: [(0, '-29.956')] [2022-07-10 02:45:34,156][26022] Updated weights on worker 0-0, policy_version 533096 (0.00103) [2022-07-10 02:45:35,803][26022] Updated weights on worker 0-0, policy_version 533106 (0.00084) [2022-07-10 02:45:37,669][26022] Updated weights on worker 0-0, policy_version 533116 (0.00086) [2022-07-10 02:45:37,922][25689] Fps is (10 sec: 5840.0, 60 sec: 5685.1, 300 sec: 5674.8). Total num frames: 545912832. Throughput: 0: 5935.5. Samples: 545922056. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:45:37,924][25689] Avg episode reward: [(0, '-29.768')] [2022-07-10 02:45:39,347][26022] Updated weights on worker 0-0, policy_version 533126 (0.00095) [2022-07-10 02:45:41,156][26022] Updated weights on worker 0-0, policy_version 533136 (0.00089) [2022-07-10 02:45:42,942][25689] Fps is (10 sec: 5682.9, 60 sec: 5684.4, 300 sec: 5674.5). Total num frames: 545940480. Throughput: 0: 5936.0. Samples: 545938910. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:45:42,944][25689] Avg episode reward: [(0, '-27.912')] [2022-07-10 02:45:43,012][26022] Updated weights on worker 0-0, policy_version 533146 (0.00088) [2022-07-10 02:45:44,619][26022] Updated weights on worker 0-0, policy_version 533156 (0.00095) [2022-07-10 02:45:46,818][26022] Updated weights on worker 0-0, policy_version 533166 (0.00099) [2022-07-10 02:45:47,961][25689] Fps is (10 sec: 5813.9, 60 sec: 5700.3, 300 sec: 5682.5). Total num frames: 545971200. Throughput: 0: 5977.9. Samples: 545973788. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:45:47,963][25689] Avg episode reward: [(0, '-29.029')] [2022-07-10 02:45:48,067][26022] Updated weights on worker 0-0, policy_version 533176 (0.00088) [2022-07-10 02:45:50,190][26022] Updated weights on worker 0-0, policy_version 533186 (0.00088) [2022-07-10 02:45:51,947][26022] Updated weights on worker 0-0, policy_version 533196 (0.00085) [2022-07-10 02:45:52,965][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:45:52,988][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000533201_545997824.pth [2022-07-10 02:45:52,988][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000531205_543953920.pth [2022-07-10 02:45:53,065][25689] Fps is (10 sec: 5664.5, 60 sec: 5663.9, 300 sec: 5673.9). Total num frames: 545997824. Throughput: 0: 5949.6. Samples: 546007962. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:45:53,066][25689] Avg episode reward: [(0, '-29.875')] [2022-07-10 02:45:53,639][26022] Updated weights on worker 0-0, policy_version 533206 (0.00087) [2022-07-10 02:45:55,545][26022] Updated weights on worker 0-0, policy_version 533216 (0.00086) [2022-07-10 02:45:57,162][26022] Updated weights on worker 0-0, policy_version 533226 (0.00626) [2022-07-10 02:45:58,090][25689] Fps is (10 sec: 5560.6, 60 sec: 5664.3, 300 sec: 5677.2). Total num frames: 546027520. Throughput: 0: 5123.7. Samples: 546025390. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:45:58,090][25689] Avg episode reward: [(0, '-29.847')] [2022-07-10 02:45:59,051][26022] Updated weights on worker 0-0, policy_version 533236 (0.00091) [2022-07-10 02:46:00,778][26022] Updated weights on worker 0-0, policy_version 533246 (0.00094) [2022-07-10 02:46:02,821][26022] Updated weights on worker 0-0, policy_version 533256 (0.00086) [2022-07-10 02:46:03,115][25689] Fps is (10 sec: 5706.2, 60 sec: 5664.5, 300 sec: 5680.3). Total num frames: 546055168. Throughput: 0: 5898.9. Samples: 546057908. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:46:03,115][25689] Avg episode reward: [(0, '-28.393')] [2022-07-10 02:46:04,676][26022] Updated weights on worker 0-0, policy_version 533266 (0.00087) [2022-07-10 02:46:06,338][26022] Updated weights on worker 0-0, policy_version 533276 (0.00092) [2022-07-10 02:46:08,142][25689] Fps is (10 sec: 5603.0, 60 sec: 5713.1, 300 sec: 5681.7). Total num frames: 546083840. Throughput: 0: 5874.0. Samples: 546092326. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:46:08,142][25689] Avg episode reward: [(0, '-28.590')] [2022-07-10 02:46:08,355][26022] Updated weights on worker 0-0, policy_version 533286 (0.00097) [2022-07-10 02:46:10,063][26022] Updated weights on worker 0-0, policy_version 533296 (0.00086) [2022-07-10 02:46:11,950][26022] Updated weights on worker 0-0, policy_version 533306 (0.00082) [2022-07-10 02:46:13,190][25689] Fps is (10 sec: 5691.3, 60 sec: 5678.8, 300 sec: 5679.0). Total num frames: 546112512. Throughput: 0: 5039.8. Samples: 546109388. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:46:13,191][25689] Avg episode reward: [(0, '-28.580')] [2022-07-10 02:46:13,896][26022] Updated weights on worker 0-0, policy_version 533316 (0.00104) [2022-07-10 02:46:15,285][26022] Updated weights on worker 0-0, policy_version 533326 (0.00100) [2022-07-10 02:46:17,311][26022] Updated weights on worker 0-0, policy_version 533336 (0.00092) [2022-07-10 02:46:18,194][25689] Fps is (10 sec: 5704.7, 60 sec: 5682.6, 300 sec: 5677.4). Total num frames: 546141184. Throughput: 0: 5900.4. Samples: 546144010. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:46:18,194][25689] Avg episode reward: [(0, '-27.917')] [2022-07-10 02:46:18,883][26022] Updated weights on worker 0-0, policy_version 533346 (0.00089) [2022-07-10 02:46:20,761][26022] Updated weights on worker 0-0, policy_version 533356 (0.00085) [2022-07-10 02:46:22,698][26022] Updated weights on worker 0-0, policy_version 533366 (0.00083) [2022-07-10 02:46:23,215][25689] Fps is (10 sec: 5618.3, 60 sec: 5703.6, 300 sec: 5677.2). Total num frames: 546168832. Throughput: 0: 6008.9. Samples: 546178688. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:46:23,215][25689] Avg episode reward: [(0, '-27.763')] [2022-07-10 02:46:24,311][26022] Updated weights on worker 0-0, policy_version 533376 (0.00095) [2022-07-10 02:46:26,386][26022] Updated weights on worker 0-0, policy_version 533386 (0.00096) [2022-07-10 02:46:27,811][26022] Updated weights on worker 0-0, policy_version 533396 (0.00094) [2022-07-10 02:46:28,242][25689] Fps is (10 sec: 5706.6, 60 sec: 5704.9, 300 sec: 5678.1). Total num frames: 546198528. Throughput: 0: 5139.2. Samples: 546195628. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:46:28,243][25689] Avg episode reward: [(0, '-27.440')] [2022-07-10 02:46:30,019][26022] Updated weights on worker 0-0, policy_version 533406 (0.00089) [2022-07-10 02:46:31,506][26022] Updated weights on worker 0-0, policy_version 533416 (0.00092) [2022-07-10 02:46:33,326][25689] Fps is (10 sec: 5772.4, 60 sec: 5689.4, 300 sec: 5681.7). Total num frames: 546227200. Throughput: 0: 5986.5. Samples: 546229932. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:46:33,327][25689] Avg episode reward: [(0, '-26.890')] [2022-07-10 02:46:33,478][26022] Updated weights on worker 0-0, policy_version 533426 (0.00085) [2022-07-10 02:46:35,210][26022] Updated weights on worker 0-0, policy_version 533436 (0.00093) [2022-07-10 02:46:36,835][26022] Updated weights on worker 0-0, policy_version 533446 (0.00084) [2022-07-10 02:46:38,336][25689] Fps is (10 sec: 5680.9, 60 sec: 5678.1, 300 sec: 5678.6). Total num frames: 546255872. Throughput: 0: 5972.4. Samples: 546264310. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:46:38,337][25689] Avg episode reward: [(0, '-26.560')] [2022-07-10 02:46:38,738][26022] Updated weights on worker 0-0, policy_version 533456 (0.00088) [2022-07-10 02:46:40,583][26022] Updated weights on worker 0-0, policy_version 533466 (0.00093) [2022-07-10 02:46:42,349][26022] Updated weights on worker 0-0, policy_version 533476 (0.00084) [2022-07-10 02:46:43,363][25689] Fps is (10 sec: 5713.6, 60 sec: 5694.5, 300 sec: 5681.8). Total num frames: 546284544. Throughput: 0: 5093.1. Samples: 546281304. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:46:43,365][25689] Avg episode reward: [(0, '-26.399')] [2022-07-10 02:46:44,331][26022] Updated weights on worker 0-0, policy_version 533486 (0.00095) [2022-07-10 02:46:46,007][26022] Updated weights on worker 0-0, policy_version 533496 (0.00097) [2022-07-10 02:46:47,822][26022] Updated weights on worker 0-0, policy_version 533506 (0.00092) [2022-07-10 02:46:48,376][25689] Fps is (10 sec: 5711.9, 60 sec: 5661.2, 300 sec: 5678.9). Total num frames: 546313216. Throughput: 0: 5958.5. Samples: 546315594. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:46:48,376][25689] Avg episode reward: [(0, '-27.461')] [2022-07-10 02:46:49,559][26022] Updated weights on worker 0-0, policy_version 533516 (0.00082) [2022-07-10 02:46:51,336][26022] Updated weights on worker 0-0, policy_version 533526 (0.00083) [2022-07-10 02:46:53,120][26022] Updated weights on worker 0-0, policy_version 533536 (0.00086) [2022-07-10 02:46:53,428][25689] Fps is (10 sec: 5697.0, 60 sec: 5699.9, 300 sec: 5678.4). Total num frames: 546341888. Throughput: 0: 5976.4. Samples: 546350070. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:46:53,429][25689] Avg episode reward: [(0, '-26.848')] [2022-07-10 02:46:54,988][26022] Updated weights on worker 0-0, policy_version 533546 (0.00096) [2022-07-10 02:46:56,638][26022] Updated weights on worker 0-0, policy_version 533556 (0.00617) [2022-07-10 02:46:58,455][25689] Fps is (10 sec: 5689.2, 60 sec: 5682.7, 300 sec: 5675.0). Total num frames: 546370560. Throughput: 0: 5119.3. Samples: 546367304. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:46:58,456][25689] Avg episode reward: [(0, '-27.746')] [2022-07-10 02:46:58,528][26022] Updated weights on worker 0-0, policy_version 533566 (0.00090) [2022-07-10 02:47:00,169][26022] Updated weights on worker 0-0, policy_version 533576 (0.00089) [2022-07-10 02:47:01,999][26022] Updated weights on worker 0-0, policy_version 533586 (0.00083) [2022-07-10 02:47:03,475][25689] Fps is (10 sec: 5504.1, 60 sec: 5666.3, 300 sec: 5681.8). Total num frames: 546397184. Throughput: 0: 5975.8. Samples: 546401488. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:47:03,477][25689] Avg episode reward: [(0, '-27.838')] [2022-07-10 02:47:04,243][26022] Updated weights on worker 0-0, policy_version 533596 (0.00087) [2022-07-10 02:47:06,069][26022] Updated weights on worker 0-0, policy_version 533606 (0.00093) [2022-07-10 02:47:07,762][26022] Updated weights on worker 0-0, policy_version 533616 (0.00083) [2022-07-10 02:47:08,499][25689] Fps is (10 sec: 5607.5, 60 sec: 5683.5, 300 sec: 5683.7). Total num frames: 546426880. Throughput: 0: 5884.7. Samples: 546434012. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:47:08,501][25689] Avg episode reward: [(0, '-28.251')] [2022-07-10 02:47:09,795][26022] Updated weights on worker 0-0, policy_version 533626 (0.00100) [2022-07-10 02:47:11,214][26022] Updated weights on worker 0-0, policy_version 533636 (0.00088) [2022-07-10 02:47:13,432][26022] Updated weights on worker 0-0, policy_version 533646 (0.00083) [2022-07-10 02:47:13,587][25689] Fps is (10 sec: 5670.7, 60 sec: 5662.8, 300 sec: 5679.6). Total num frames: 546454528. Throughput: 0: 5024.4. Samples: 546451356. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:47:13,587][25689] Avg episode reward: [(0, '-27.911')] [2022-07-10 02:47:14,796][26022] Updated weights on worker 0-0, policy_version 533656 (0.00086) [2022-07-10 02:47:16,804][26022] Updated weights on worker 0-0, policy_version 533666 (0.00087) [2022-07-10 02:47:18,414][26022] Updated weights on worker 0-0, policy_version 533676 (0.00086) [2022-07-10 02:47:18,660][25689] Fps is (10 sec: 5744.6, 60 sec: 5690.2, 300 sec: 5681.7). Total num frames: 546485248. Throughput: 0: 5873.6. Samples: 546485976. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:47:18,660][25689] Avg episode reward: [(0, '-26.677')] [2022-07-10 02:47:20,292][26022] Updated weights on worker 0-0, policy_version 533686 (0.00084) [2022-07-10 02:47:22,032][26022] Updated weights on worker 0-0, policy_version 533696 (0.00087) [2022-07-10 02:47:23,729][25689] Fps is (10 sec: 5855.9, 60 sec: 5702.6, 300 sec: 5680.4). Total num frames: 546513920. Throughput: 0: 5886.9. Samples: 546520724. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:47:23,730][25689] Avg episode reward: [(0, '-27.105')] [2022-07-10 02:47:23,826][26022] Updated weights on worker 0-0, policy_version 533706 (0.00091) [2022-07-10 02:47:25,555][26022] Updated weights on worker 0-0, policy_version 533716 (0.00084) [2022-07-10 02:47:27,375][26022] Updated weights on worker 0-0, policy_version 533726 (0.00084) [2022-07-10 02:47:28,747][25689] Fps is (10 sec: 5684.9, 60 sec: 5686.6, 300 sec: 5681.6). Total num frames: 546542592. Throughput: 0: 5119.3. Samples: 546537666. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:47:28,747][25689] Avg episode reward: [(0, '-25.928')] [2022-07-10 02:47:29,122][26022] Updated weights on worker 0-0, policy_version 533736 (0.00087) [2022-07-10 02:47:31,076][26022] Updated weights on worker 0-0, policy_version 533746 (0.00086) [2022-07-10 02:47:32,664][26022] Updated weights on worker 0-0, policy_version 533756 (0.00084) [2022-07-10 02:47:33,887][25689] Fps is (10 sec: 5746.2, 60 sec: 5698.2, 300 sec: 5686.0). Total num frames: 546572288. Throughput: 0: 5952.8. Samples: 546572196. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:47:33,887][25689] Avg episode reward: [(0, '-26.525')] [2022-07-10 02:47:34,606][26022] Updated weights on worker 0-0, policy_version 533766 (0.00091) [2022-07-10 02:47:36,289][26022] Updated weights on worker 0-0, policy_version 533776 (0.00090) [2022-07-10 02:47:38,054][26022] Updated weights on worker 0-0, policy_version 533786 (0.00086) [2022-07-10 02:47:38,911][25689] Fps is (10 sec: 5742.5, 60 sec: 5696.9, 300 sec: 5682.6). Total num frames: 546600960. Throughput: 0: 5967.9. Samples: 546606834. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:47:38,911][25689] Avg episode reward: [(0, '-27.384')] [2022-07-10 02:47:39,891][26022] Updated weights on worker 0-0, policy_version 533796 (0.00089) [2022-07-10 02:47:41,533][26022] Updated weights on worker 0-0, policy_version 533806 (0.00093) [2022-07-10 02:47:43,360][26022] Updated weights on worker 0-0, policy_version 533816 (0.00092) [2022-07-10 02:47:43,928][25689] Fps is (10 sec: 5812.8, 60 sec: 5714.7, 300 sec: 5693.1). Total num frames: 546630656. Throughput: 0: 5120.8. Samples: 546624160. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 02:47:43,929][25689] Avg episode reward: [(0, '-27.299')] [2022-07-10 02:47:45,309][26022] Updated weights on worker 0-0, policy_version 533826 (0.00086) [2022-07-10 02:47:47,008][26022] Updated weights on worker 0-0, policy_version 533836 (0.00090) [2022-07-10 02:47:48,761][26022] Updated weights on worker 0-0, policy_version 533846 (0.00085) [2022-07-10 02:47:49,016][25689] Fps is (10 sec: 5775.8, 60 sec: 5707.6, 300 sec: 5690.3). Total num frames: 546659328. Throughput: 0: 5962.9. Samples: 546658534. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:47:49,017][25689] Avg episode reward: [(0, '-27.287')] [2022-07-10 02:47:50,558][26022] Updated weights on worker 0-0, policy_version 533856 (0.00085) [2022-07-10 02:47:52,265][26022] Updated weights on worker 0-0, policy_version 533866 (0.00087) [2022-07-10 02:47:53,242][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:47:53,256][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000533870_546682880.pth [2022-07-10 02:47:53,257][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000531870_544634880.pth [2022-07-10 02:47:54,062][26022] Updated weights on worker 0-0, policy_version 533876 (0.00078) [2022-07-10 02:47:54,118][25689] Fps is (10 sec: 5728.2, 60 sec: 5719.9, 300 sec: 5692.2). Total num frames: 546689024. Throughput: 0: 5977.3. Samples: 546693122. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:47:54,118][25689] Avg episode reward: [(0, '-26.800')] [2022-07-10 02:47:55,895][26022] Updated weights on worker 0-0, policy_version 533886 (0.00084) [2022-07-10 02:47:57,569][26022] Updated weights on worker 0-0, policy_version 533896 (0.00086) [2022-07-10 02:47:59,123][25689] Fps is (10 sec: 5775.0, 60 sec: 5721.9, 300 sec: 5688.9). Total num frames: 546717696. Throughput: 0: 5125.2. Samples: 546710428. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:47:59,124][25689] Avg episode reward: [(0, '-27.458')] [2022-07-10 02:47:59,423][26022] Updated weights on worker 0-0, policy_version 533906 (0.00096) [2022-07-10 02:48:01,312][26022] Updated weights on worker 0-0, policy_version 533916 (0.00094) [2022-07-10 02:48:03,238][26022] Updated weights on worker 0-0, policy_version 533926 (0.00086) [2022-07-10 02:48:04,140][25689] Fps is (10 sec: 5517.4, 60 sec: 5722.2, 300 sec: 5696.4). Total num frames: 546744320. Throughput: 0: 5918.9. Samples: 546743790. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:48:04,140][25689] Avg episode reward: [(0, '-25.562')] [2022-07-10 02:48:05,311][26022] Updated weights on worker 0-0, policy_version 533936 (0.00387) [2022-07-10 02:48:06,963][26022] Updated weights on worker 0-0, policy_version 533946 (0.00087) [2022-07-10 02:48:08,775][26022] Updated weights on worker 0-0, policy_version 533956 (0.00089) [2022-07-10 02:48:09,157][25689] Fps is (10 sec: 5511.0, 60 sec: 5705.9, 300 sec: 5690.2). Total num frames: 546772992. Throughput: 0: 5904.7. Samples: 546777458. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:48:09,159][25689] Avg episode reward: [(0, '-25.549')] [2022-07-10 02:48:10,477][26022] Updated weights on worker 0-0, policy_version 533966 (0.00084) [2022-07-10 02:48:12,367][26022] Updated weights on worker 0-0, policy_version 533976 (0.00079) [2022-07-10 02:48:14,187][26022] Updated weights on worker 0-0, policy_version 533986 (0.00084) [2022-07-10 02:48:14,199][25689] Fps is (10 sec: 5700.9, 60 sec: 5727.2, 300 sec: 5690.4). Total num frames: 546801664. Throughput: 0: 5051.5. Samples: 546794560. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:48:14,199][25689] Avg episode reward: [(0, '-26.009')] [2022-07-10 02:48:16,085][26022] Updated weights on worker 0-0, policy_version 533996 (0.00089) [2022-07-10 02:48:17,643][26022] Updated weights on worker 0-0, policy_version 534006 (0.00100) [2022-07-10 02:48:19,228][25689] Fps is (10 sec: 5694.2, 60 sec: 5697.5, 300 sec: 5693.8). Total num frames: 546830336. Throughput: 0: 5906.6. Samples: 546829176. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:48:19,228][25689] Avg episode reward: [(0, '-26.505')] [2022-07-10 02:48:19,631][26022] Updated weights on worker 0-0, policy_version 534016 (0.00493) [2022-07-10 02:48:21,097][26022] Updated weights on worker 0-0, policy_version 534026 (0.00093) [2022-07-10 02:48:23,178][26022] Updated weights on worker 0-0, policy_version 534036 (0.00093) [2022-07-10 02:48:24,241][25689] Fps is (10 sec: 5812.0, 60 sec: 5719.7, 300 sec: 5697.1). Total num frames: 546860032. Throughput: 0: 5946.5. Samples: 546863324. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:48:24,242][25689] Avg episode reward: [(0, '-26.036')] [2022-07-10 02:48:24,935][26022] Updated weights on worker 0-0, policy_version 534046 (0.00088) [2022-07-10 02:48:26,671][26022] Updated weights on worker 0-0, policy_version 534056 (0.00099) [2022-07-10 02:48:28,279][26022] Updated weights on worker 0-0, policy_version 534066 (0.00083) [2022-07-10 02:48:29,264][25689] Fps is (10 sec: 5713.7, 60 sec: 5702.3, 300 sec: 5695.3). Total num frames: 546887680. Throughput: 0: 5135.2. Samples: 546880710. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:48:29,266][25689] Avg episode reward: [(0, '-26.236')] [2022-07-10 02:48:30,246][26022] Updated weights on worker 0-0, policy_version 534076 (0.00087) [2022-07-10 02:48:32,191][26022] Updated weights on worker 0-0, policy_version 534086 (0.00083) [2022-07-10 02:48:33,777][26022] Updated weights on worker 0-0, policy_version 534096 (0.00099) [2022-07-10 02:48:34,333][25689] Fps is (10 sec: 5581.0, 60 sec: 5692.1, 300 sec: 5688.1). Total num frames: 546916352. Throughput: 0: 5985.9. Samples: 546915080. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:48:34,333][25689] Avg episode reward: [(0, '-27.606')] [2022-07-10 02:48:35,644][26022] Updated weights on worker 0-0, policy_version 534106 (0.00086) [2022-07-10 02:48:37,386][26022] Updated weights on worker 0-0, policy_version 534116 (0.00091) [2022-07-10 02:48:39,069][26022] Updated weights on worker 0-0, policy_version 534126 (0.00085) [2022-07-10 02:48:39,349][25689] Fps is (10 sec: 5685.9, 60 sec: 5692.8, 300 sec: 5694.8). Total num frames: 546945024. Throughput: 0: 5984.5. Samples: 546949594. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:48:39,350][25689] Avg episode reward: [(0, '-27.430')] [2022-07-10 02:48:41,155][26022] Updated weights on worker 0-0, policy_version 534136 (0.00088) [2022-07-10 02:48:42,590][26022] Updated weights on worker 0-0, policy_version 534146 (0.00096) [2022-07-10 02:48:44,355][25689] Fps is (10 sec: 5619.5, 60 sec: 5660.0, 300 sec: 5687.9). Total num frames: 546972672. Throughput: 0: 5957.4. Samples: 546983150. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:48:44,356][25689] Avg episode reward: [(0, '-26.681')] [2022-07-10 02:48:44,802][26022] Updated weights on worker 0-0, policy_version 534156 (0.00087) [2022-07-10 02:48:46,319][26022] Updated weights on worker 0-0, policy_version 534166 (0.00091) [2022-07-10 02:48:48,279][26022] Updated weights on worker 0-0, policy_version 534176 (0.00092) [2022-07-10 02:48:49,361][25689] Fps is (10 sec: 5727.9, 60 sec: 5684.7, 300 sec: 5695.6). Total num frames: 547002368. Throughput: 0: 5955.0. Samples: 547000386. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:48:49,361][25689] Avg episode reward: [(0, '-26.814')] [2022-07-10 02:48:49,942][26022] Updated weights on worker 0-0, policy_version 534186 (0.00086) [2022-07-10 02:48:51,939][26022] Updated weights on worker 0-0, policy_version 534196 (0.00080) [2022-07-10 02:48:53,681][26022] Updated weights on worker 0-0, policy_version 534206 (0.00086) [2022-07-10 02:48:54,487][25689] Fps is (10 sec: 5761.0, 60 sec: 5665.4, 300 sec: 5689.8). Total num frames: 547031040. Throughput: 0: 5924.1. Samples: 547034474. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:48:54,487][25689] Avg episode reward: [(0, '-26.414')] [2022-07-10 02:48:55,399][26022] Updated weights on worker 0-0, policy_version 534216 (0.00085) [2022-07-10 02:48:57,198][26022] Updated weights on worker 0-0, policy_version 534226 (0.00506) [2022-07-10 02:48:58,883][26022] Updated weights on worker 0-0, policy_version 534236 (0.00082) [2022-07-10 02:48:59,515][25689] Fps is (10 sec: 5647.3, 60 sec: 5663.3, 300 sec: 5696.2). Total num frames: 547059712. Throughput: 0: 5922.9. Samples: 547069032. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:48:59,515][25689] Avg episode reward: [(0, '-27.249')] [2022-07-10 02:49:00,833][26022] Updated weights on worker 0-0, policy_version 534246 (0.00087) [2022-07-10 02:49:02,900][26022] Updated weights on worker 0-0, policy_version 534256 (0.00093) [2022-07-10 02:49:04,577][25689] Fps is (10 sec: 5581.6, 60 sec: 5676.0, 300 sec: 5696.4). Total num frames: 547087360. Throughput: 0: 4996.1. Samples: 547084180. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:49:04,577][25689] Avg episode reward: [(0, '-26.838')] [2022-07-10 02:49:04,638][26022] Updated weights on worker 0-0, policy_version 534266 (0.00090) [2022-07-10 02:49:06,530][26022] Updated weights on worker 0-0, policy_version 534276 (0.00095) [2022-07-10 02:49:08,197][26022] Updated weights on worker 0-0, policy_version 534286 (0.00081) [2022-07-10 02:49:09,599][25689] Fps is (10 sec: 5686.3, 60 sec: 5692.4, 300 sec: 5700.3). Total num frames: 547117056. Throughput: 0: 5833.8. Samples: 547118454. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:49:09,601][25689] Avg episode reward: [(0, '-27.240')] [2022-07-10 02:49:10,206][26022] Updated weights on worker 0-0, policy_version 534296 (0.00091) [2022-07-10 02:49:11,894][26022] Updated weights on worker 0-0, policy_version 534306 (0.00086) [2022-07-10 02:49:13,759][26022] Updated weights on worker 0-0, policy_version 534316 (0.00097) [2022-07-10 02:49:14,709][25689] Fps is (10 sec: 5558.6, 60 sec: 5652.2, 300 sec: 5691.7). Total num frames: 547143680. Throughput: 0: 5839.5. Samples: 547152562. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:49:14,709][25689] Avg episode reward: [(0, '-27.953')] [2022-07-10 02:49:15,493][26022] Updated weights on worker 0-0, policy_version 534326 (0.00083) [2022-07-10 02:49:17,318][26022] Updated weights on worker 0-0, policy_version 534336 (0.00592) [2022-07-10 02:49:19,062][26022] Updated weights on worker 0-0, policy_version 534346 (0.00095) [2022-07-10 02:49:19,723][25689] Fps is (10 sec: 5664.7, 60 sec: 5687.5, 300 sec: 5691.6). Total num frames: 547174400. Throughput: 0: 4991.7. Samples: 547169902. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:49:19,723][25689] Avg episode reward: [(0, '-28.571')] [2022-07-10 02:49:21,022][26022] Updated weights on worker 0-0, policy_version 534356 (0.00085) [2022-07-10 02:49:22,622][26022] Updated weights on worker 0-0, policy_version 534366 (0.00080) [2022-07-10 02:49:24,420][26022] Updated weights on worker 0-0, policy_version 534376 (0.00090) [2022-07-10 02:49:24,725][25689] Fps is (10 sec: 5827.2, 60 sec: 5654.6, 300 sec: 5688.2). Total num frames: 547202048. Throughput: 0: 5973.2. Samples: 547204532. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:49:24,726][25689] Avg episode reward: [(0, '-28.695')] [2022-07-10 02:49:26,122][26022] Updated weights on worker 0-0, policy_version 534386 (0.00092) [2022-07-10 02:49:28,098][26022] Updated weights on worker 0-0, policy_version 534396 (0.00086) [2022-07-10 02:49:29,763][25689] Fps is (10 sec: 5609.2, 60 sec: 5670.2, 300 sec: 5695.6). Total num frames: 547230720. Throughput: 0: 5959.3. Samples: 547238616. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:49:29,764][25689] Avg episode reward: [(0, '-28.651')] [2022-07-10 02:49:29,867][26022] Updated weights on worker 0-0, policy_version 534406 (0.00083) [2022-07-10 02:49:31,753][26022] Updated weights on worker 0-0, policy_version 534416 (0.00087) [2022-07-10 02:49:33,387][26022] Updated weights on worker 0-0, policy_version 534426 (0.00094) [2022-07-10 02:49:34,875][25689] Fps is (10 sec: 5650.1, 60 sec: 5666.2, 300 sec: 5683.3). Total num frames: 547259392. Throughput: 0: 5113.6. Samples: 547255682. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:49:34,876][25689] Avg episode reward: [(0, '-28.278')] [2022-07-10 02:49:35,292][26022] Updated weights on worker 0-0, policy_version 534436 (0.00086) [2022-07-10 02:49:37,035][26022] Updated weights on worker 0-0, policy_version 534446 (0.00090) [2022-07-10 02:49:38,795][26022] Updated weights on worker 0-0, policy_version 534456 (0.00088) [2022-07-10 02:49:39,885][25689] Fps is (10 sec: 5766.4, 60 sec: 5683.6, 300 sec: 5690.2). Total num frames: 547289088. Throughput: 0: 5962.0. Samples: 547290112. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:49:39,887][25689] Avg episode reward: [(0, '-29.765')] [2022-07-10 02:49:40,610][26022] Updated weights on worker 0-0, policy_version 534466 (0.00088) [2022-07-10 02:49:42,256][26022] Updated weights on worker 0-0, policy_version 534476 (0.00085) [2022-07-10 02:49:44,250][26022] Updated weights on worker 0-0, policy_version 534486 (0.00089) [2022-07-10 02:49:44,914][25689] Fps is (10 sec: 5814.2, 60 sec: 5698.4, 300 sec: 5686.4). Total num frames: 547317760. Throughput: 0: 5945.7. Samples: 547324564. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:49:44,914][25689] Avg episode reward: [(0, '-29.117')] [2022-07-10 02:49:45,977][26022] Updated weights on worker 0-0, policy_version 534496 (0.00144) [2022-07-10 02:49:47,737][26022] Updated weights on worker 0-0, policy_version 534506 (0.00080) [2022-07-10 02:49:49,624][26022] Updated weights on worker 0-0, policy_version 534516 (0.00092) [2022-07-10 02:49:49,927][25689] Fps is (10 sec: 5710.5, 60 sec: 5680.8, 300 sec: 5687.6). Total num frames: 547346432. Throughput: 0: 5104.9. Samples: 547341550. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:49:49,928][25689] Avg episode reward: [(0, '-28.715')] [2022-07-10 02:49:51,441][26022] Updated weights on worker 0-0, policy_version 534526 (0.00085) [2022-07-10 02:49:53,211][26022] Updated weights on worker 0-0, policy_version 534536 (0.00092) [2022-07-10 02:49:53,315][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:49:53,322][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000534537_547365888.pth [2022-07-10 02:49:53,339][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000532533_545313792.pth [2022-07-10 02:49:54,887][26022] Updated weights on worker 0-0, policy_version 534546 (0.00085) [2022-07-10 02:49:55,039][25689] Fps is (10 sec: 5764.6, 60 sec: 5699.0, 300 sec: 5686.0). Total num frames: 547376128. Throughput: 0: 5981.3. Samples: 547376290. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:49:55,039][25689] Avg episode reward: [(0, '-29.202')] [2022-07-10 02:49:56,661][26022] Updated weights on worker 0-0, policy_version 534556 (0.00087) [2022-07-10 02:49:58,342][26022] Updated weights on worker 0-0, policy_version 534566 (0.00084) [2022-07-10 02:50:00,121][25689] Fps is (10 sec: 5725.8, 60 sec: 5694.0, 300 sec: 5688.4). Total num frames: 547404800. Throughput: 0: 5987.7. Samples: 547411276. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:50:00,121][25689] Avg episode reward: [(0, '-29.296')] [2022-07-10 02:50:00,355][26022] Updated weights on worker 0-0, policy_version 534576 (0.00083) [2022-07-10 02:50:02,233][26022] Updated weights on worker 0-0, policy_version 534586 (0.00086) [2022-07-10 02:50:04,440][26022] Updated weights on worker 0-0, policy_version 534596 (0.00094) [2022-07-10 02:50:05,123][25689] Fps is (10 sec: 5483.5, 60 sec: 5682.7, 300 sec: 5691.9). Total num frames: 547431424. Throughput: 0: 5035.6. Samples: 547426330. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:50:05,123][25689] Avg episode reward: [(0, '-28.421')] [2022-07-10 02:50:05,668][26022] Updated weights on worker 0-0, policy_version 534606 (0.00089) [2022-07-10 02:50:07,917][26022] Updated weights on worker 0-0, policy_version 534616 (0.00094) [2022-07-10 02:50:09,370][26022] Updated weights on worker 0-0, policy_version 534626 (0.00091) [2022-07-10 02:50:10,178][25689] Fps is (10 sec: 5600.2, 60 sec: 5679.7, 300 sec: 5688.2). Total num frames: 547461120. Throughput: 0: 5891.4. Samples: 547460854. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:50:10,178][25689] Avg episode reward: [(0, '-28.948')] [2022-07-10 02:50:11,373][26022] Updated weights on worker 0-0, policy_version 534636 (0.00093) [2022-07-10 02:50:12,917][26022] Updated weights on worker 0-0, policy_version 534646 (0.00090) [2022-07-10 02:50:14,866][26022] Updated weights on worker 0-0, policy_version 534656 (0.00084) [2022-07-10 02:50:15,306][25689] Fps is (10 sec: 5731.9, 60 sec: 5711.7, 300 sec: 5686.6). Total num frames: 547489792. Throughput: 0: 5859.4. Samples: 547495044. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:50:15,307][25689] Avg episode reward: [(0, '-28.911')] [2022-07-10 02:50:16,628][26022] Updated weights on worker 0-0, policy_version 534666 (0.00081) [2022-07-10 02:50:18,626][26022] Updated weights on worker 0-0, policy_version 534676 (0.00088) [2022-07-10 02:50:20,341][25689] Fps is (10 sec: 5541.7, 60 sec: 5659.0, 300 sec: 5690.6). Total num frames: 547517440. Throughput: 0: 4972.8. Samples: 547511824. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:50:20,341][25689] Avg episode reward: [(0, '-29.056')] [2022-07-10 02:50:20,390][26022] Updated weights on worker 0-0, policy_version 534686 (0.00098) [2022-07-10 02:50:22,250][26022] Updated weights on worker 0-0, policy_version 534696 (0.00094) [2022-07-10 02:50:23,966][26022] Updated weights on worker 0-0, policy_version 534706 (0.00101) [2022-07-10 02:50:25,362][25689] Fps is (10 sec: 5600.4, 60 sec: 5674.2, 300 sec: 5687.5). Total num frames: 547546112. Throughput: 0: 5894.7. Samples: 547545636. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:50:25,363][25689] Avg episode reward: [(0, '-27.918')] [2022-07-10 02:50:25,942][26022] Updated weights on worker 0-0, policy_version 534716 (0.00083) [2022-07-10 02:50:27,683][26022] Updated weights on worker 0-0, policy_version 534726 (0.00088) [2022-07-10 02:50:29,557][26022] Updated weights on worker 0-0, policy_version 534736 (0.00090) [2022-07-10 02:50:30,453][25689] Fps is (10 sec: 5670.5, 60 sec: 5669.2, 300 sec: 5684.3). Total num frames: 547574784. Throughput: 0: 5835.0. Samples: 547579162. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:50:30,454][25689] Avg episode reward: [(0, '-28.393')] [2022-07-10 02:50:31,331][26022] Updated weights on worker 0-0, policy_version 534746 (0.00082) [2022-07-10 02:50:33,088][26022] Updated weights on worker 0-0, policy_version 534756 (0.00087) [2022-07-10 02:50:34,893][26022] Updated weights on worker 0-0, policy_version 534766 (0.00096) [2022-07-10 02:50:35,541][25689] Fps is (10 sec: 5734.6, 60 sec: 5688.3, 300 sec: 5683.9). Total num frames: 547604480. Throughput: 0: 5855.7. Samples: 547613532. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:50:35,541][25689] Avg episode reward: [(0, '-28.008')] [2022-07-10 02:50:36,506][26022] Updated weights on worker 0-0, policy_version 534776 (0.00086) [2022-07-10 02:50:38,639][26022] Updated weights on worker 0-0, policy_version 534786 (0.00083) [2022-07-10 02:50:40,109][26022] Updated weights on worker 0-0, policy_version 534796 (0.00084) [2022-07-10 02:50:40,573][25689] Fps is (10 sec: 5767.7, 60 sec: 5669.4, 300 sec: 5687.1). Total num frames: 547633152. Throughput: 0: 5869.1. Samples: 547630570. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 02:50:40,574][25689] Avg episode reward: [(0, '-27.120')] [2022-07-10 02:50:42,116][26022] Updated weights on worker 0-0, policy_version 534806 (0.00068) [2022-07-10 02:50:43,862][26022] Updated weights on worker 0-0, policy_version 534816 (0.00085) [2022-07-10 02:50:45,522][26022] Updated weights on worker 0-0, policy_version 534826 (0.00091) [2022-07-10 02:50:45,619][25689] Fps is (10 sec: 5689.6, 60 sec: 5667.7, 300 sec: 5679.8). Total num frames: 547661824. Throughput: 0: 5895.5. Samples: 547665060. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:50:45,620][25689] Avg episode reward: [(0, '-28.602')] [2022-07-10 02:50:47,383][26022] Updated weights on worker 0-0, policy_version 534836 (0.00089) [2022-07-10 02:50:49,267][26022] Updated weights on worker 0-0, policy_version 534846 (0.00088) [2022-07-10 02:50:50,667][25689] Fps is (10 sec: 5579.7, 60 sec: 5647.7, 300 sec: 5684.3). Total num frames: 547689472. Throughput: 0: 5952.5. Samples: 547699484. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:50:50,667][25689] Avg episode reward: [(0, '-27.977')] [2022-07-10 02:50:50,894][26022] Updated weights on worker 0-0, policy_version 534856 (0.00092) [2022-07-10 02:50:52,813][26022] Updated weights on worker 0-0, policy_version 534866 (0.00087) [2022-07-10 02:50:54,494][26022] Updated weights on worker 0-0, policy_version 534876 (0.00090) [2022-07-10 02:50:55,713][25689] Fps is (10 sec: 5681.3, 60 sec: 5653.8, 300 sec: 5683.9). Total num frames: 547719168. Throughput: 0: 5115.3. Samples: 547716718. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:50:55,713][25689] Avg episode reward: [(0, '-28.901')] [2022-07-10 02:50:56,358][26022] Updated weights on worker 0-0, policy_version 534886 (0.00096) [2022-07-10 02:50:58,244][26022] Updated weights on worker 0-0, policy_version 534896 (0.00092) [2022-07-10 02:50:59,930][26022] Updated weights on worker 0-0, policy_version 534906 (0.00100) [2022-07-10 02:51:00,715][25689] Fps is (10 sec: 5808.7, 60 sec: 5661.2, 300 sec: 5687.7). Total num frames: 547747840. Throughput: 0: 5979.2. Samples: 547751004. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:51:00,716][25689] Avg episode reward: [(0, '-29.104')] [2022-07-10 02:51:02,347][26022] Updated weights on worker 0-0, policy_version 534916 (0.00084) [2022-07-10 02:51:03,883][26022] Updated weights on worker 0-0, policy_version 534926 (0.00092) [2022-07-10 02:51:05,740][25689] Fps is (10 sec: 5514.7, 60 sec: 5659.2, 300 sec: 5680.9). Total num frames: 547774464. Throughput: 0: 5858.9. Samples: 547782944. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:51:05,740][25689] Avg episode reward: [(0, '-28.334')] [2022-07-10 02:51:05,745][26022] Updated weights on worker 0-0, policy_version 534936 (0.00094) [2022-07-10 02:51:07,443][26022] Updated weights on worker 0-0, policy_version 534946 (0.00084) [2022-07-10 02:51:09,411][26022] Updated weights on worker 0-0, policy_version 534956 (0.00097) [2022-07-10 02:51:10,753][25689] Fps is (10 sec: 5509.0, 60 sec: 5646.2, 300 sec: 5681.6). Total num frames: 547803136. Throughput: 0: 5005.5. Samples: 547800024. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:51:10,753][25689] Avg episode reward: [(0, '-28.340')] [2022-07-10 02:51:10,933][26022] Updated weights on worker 0-0, policy_version 534966 (0.00086) [2022-07-10 02:51:12,931][26022] Updated weights on worker 0-0, policy_version 534976 (0.00054) [2022-07-10 02:51:14,747][26022] Updated weights on worker 0-0, policy_version 534986 (0.00353) [2022-07-10 02:51:15,811][25689] Fps is (10 sec: 5592.3, 60 sec: 5635.8, 300 sec: 5677.1). Total num frames: 547830784. Throughput: 0: 5842.1. Samples: 547834132. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:51:15,811][25689] Avg episode reward: [(0, '-27.804')] [2022-07-10 02:51:16,374][26022] Updated weights on worker 0-0, policy_version 534996 (0.00090) [2022-07-10 02:51:18,363][26022] Updated weights on worker 0-0, policy_version 535006 (0.00082) [2022-07-10 02:51:19,878][26022] Updated weights on worker 0-0, policy_version 535016 (0.00093) [2022-07-10 02:51:20,840][25689] Fps is (10 sec: 5583.4, 60 sec: 5653.2, 300 sec: 5680.4). Total num frames: 547859456. Throughput: 0: 5829.4. Samples: 547868318. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:51:20,840][25689] Avg episode reward: [(0, '-28.217')] [2022-07-10 02:51:22,111][26022] Updated weights on worker 0-0, policy_version 535026 (0.00094) [2022-07-10 02:51:23,792][26022] Updated weights on worker 0-0, policy_version 535036 (0.00088) [2022-07-10 02:51:25,583][26022] Updated weights on worker 0-0, policy_version 535046 (0.00091) [2022-07-10 02:51:25,843][25689] Fps is (10 sec: 5715.8, 60 sec: 5655.0, 300 sec: 5677.4). Total num frames: 547888128. Throughput: 0: 5099.5. Samples: 547885462. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:51:25,844][25689] Avg episode reward: [(0, '-27.352')] [2022-07-10 02:51:27,590][26022] Updated weights on worker 0-0, policy_version 535056 (0.00088) [2022-07-10 02:51:29,208][26022] Updated weights on worker 0-0, policy_version 535066 (0.00091) [2022-07-10 02:51:30,869][25689] Fps is (10 sec: 5717.6, 60 sec: 5661.0, 300 sec: 5678.5). Total num frames: 547916800. Throughput: 0: 5944.3. Samples: 547919602. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:51:30,870][25689] Avg episode reward: [(0, '-28.038')] [2022-07-10 02:51:30,966][26022] Updated weights on worker 0-0, policy_version 535076 (0.00085) [2022-07-10 02:51:32,852][26022] Updated weights on worker 0-0, policy_version 535086 (0.00091) [2022-07-10 02:51:34,427][26022] Updated weights on worker 0-0, policy_version 535096 (0.00086) [2022-07-10 02:51:35,995][25689] Fps is (10 sec: 5547.9, 60 sec: 5623.6, 300 sec: 5672.9). Total num frames: 547944448. Throughput: 0: 5919.2. Samples: 547953606. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:51:35,995][25689] Avg episode reward: [(0, '-27.967')] [2022-07-10 02:51:36,579][26022] Updated weights on worker 0-0, policy_version 535106 (0.00088) [2022-07-10 02:51:38,055][26022] Updated weights on worker 0-0, policy_version 535116 (0.00083) [2022-07-10 02:51:40,048][26022] Updated weights on worker 0-0, policy_version 535126 (0.00090) [2022-07-10 02:51:41,003][25689] Fps is (10 sec: 5759.6, 60 sec: 5659.7, 300 sec: 5680.1). Total num frames: 547975168. Throughput: 0: 5073.0. Samples: 547970606. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:51:41,004][25689] Avg episode reward: [(0, '-28.625')] [2022-07-10 02:51:41,707][26022] Updated weights on worker 0-0, policy_version 535136 (0.00090) [2022-07-10 02:51:43,602][26022] Updated weights on worker 0-0, policy_version 535146 (0.00089) [2022-07-10 02:51:45,542][26022] Updated weights on worker 0-0, policy_version 535156 (0.00089) [2022-07-10 02:51:46,032][25689] Fps is (10 sec: 5814.9, 60 sec: 5644.3, 300 sec: 5676.3). Total num frames: 548002816. Throughput: 0: 5903.8. Samples: 548004656. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:51:46,033][25689] Avg episode reward: [(0, '-28.879')] [2022-07-10 02:51:47,133][26022] Updated weights on worker 0-0, policy_version 535166 (0.00085) [2022-07-10 02:51:49,157][26022] Updated weights on worker 0-0, policy_version 535176 (0.00098) [2022-07-10 02:51:50,699][26022] Updated weights on worker 0-0, policy_version 535186 (0.00085) [2022-07-10 02:51:51,055][25689] Fps is (10 sec: 5603.0, 60 sec: 5663.6, 300 sec: 5676.9). Total num frames: 548031488. Throughput: 0: 5901.3. Samples: 548038724. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:51:51,056][25689] Avg episode reward: [(0, '-28.667')] [2022-07-10 02:51:52,686][26022] Updated weights on worker 0-0, policy_version 535196 (0.00089) [2022-07-10 02:51:53,341][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:51:53,354][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000535201_548045824.pth [2022-07-10 02:51:53,355][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000533201_545997824.pth [2022-07-10 02:51:54,460][26022] Updated weights on worker 0-0, policy_version 535206 (0.00092) [2022-07-10 02:51:56,119][25689] Fps is (10 sec: 5583.9, 60 sec: 5628.0, 300 sec: 5672.8). Total num frames: 548059136. Throughput: 0: 5085.4. Samples: 548055944. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:51:56,119][25689] Avg episode reward: [(0, '-28.210')] [2022-07-10 02:51:56,235][26022] Updated weights on worker 0-0, policy_version 535216 (0.00086) [2022-07-10 02:51:58,076][26022] Updated weights on worker 0-0, policy_version 535226 (0.00114) [2022-07-10 02:51:59,814][26022] Updated weights on worker 0-0, policy_version 535236 (0.00084) [2022-07-10 02:52:01,129][25689] Fps is (10 sec: 5692.1, 60 sec: 5644.3, 300 sec: 5683.3). Total num frames: 548088832. Throughput: 0: 5944.1. Samples: 548090238. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:52:01,131][25689] Avg episode reward: [(0, '-27.817')] [2022-07-10 02:52:01,643][26022] Updated weights on worker 0-0, policy_version 535246 (0.00100) [2022-07-10 02:52:03,617][26022] Updated weights on worker 0-0, policy_version 535256 (0.00085) [2022-07-10 02:52:05,580][26022] Updated weights on worker 0-0, policy_version 535266 (0.00082) [2022-07-10 02:52:06,150][25689] Fps is (10 sec: 5614.2, 60 sec: 5644.6, 300 sec: 5673.0). Total num frames: 548115456. Throughput: 0: 5866.1. Samples: 548122670. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:52:06,151][25689] Avg episode reward: [(0, '-28.095')] [2022-07-10 02:52:07,239][26022] Updated weights on worker 0-0, policy_version 535276 (0.00090) [2022-07-10 02:52:09,079][26022] Updated weights on worker 0-0, policy_version 535286 (0.00094) [2022-07-10 02:52:11,034][26022] Updated weights on worker 0-0, policy_version 535296 (0.00085) [2022-07-10 02:52:11,167][25689] Fps is (10 sec: 5407.1, 60 sec: 5627.3, 300 sec: 5674.4). Total num frames: 548143104. Throughput: 0: 5031.8. Samples: 548139920. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:52:11,167][25689] Avg episode reward: [(0, '-27.939')] [2022-07-10 02:52:12,569][26022] Updated weights on worker 0-0, policy_version 535306 (0.00088) [2022-07-10 02:52:14,569][26022] Updated weights on worker 0-0, policy_version 535316 (0.00080) [2022-07-10 02:52:16,268][25689] Fps is (10 sec: 5769.1, 60 sec: 5674.1, 300 sec: 5673.8). Total num frames: 548173824. Throughput: 0: 5883.4. Samples: 548174490. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:52:16,268][25689] Avg episode reward: [(0, '-27.808')] [2022-07-10 02:52:16,271][26022] Updated weights on worker 0-0, policy_version 535326 (0.00086) [2022-07-10 02:52:18,079][26022] Updated weights on worker 0-0, policy_version 535336 (0.00088) [2022-07-10 02:52:19,746][26022] Updated weights on worker 0-0, policy_version 535346 (0.00092) [2022-07-10 02:52:21,297][25689] Fps is (10 sec: 5862.4, 60 sec: 5674.1, 300 sec: 5674.6). Total num frames: 548202496. Throughput: 0: 5870.1. Samples: 548208626. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:52:21,298][25689] Avg episode reward: [(0, '-29.006')] [2022-07-10 02:52:21,640][26022] Updated weights on worker 0-0, policy_version 535356 (0.00095) [2022-07-10 02:52:23,247][26022] Updated weights on worker 0-0, policy_version 535366 (0.00078) [2022-07-10 02:52:25,371][26022] Updated weights on worker 0-0, policy_version 535376 (0.00082) [2022-07-10 02:52:26,330][25689] Fps is (10 sec: 5597.1, 60 sec: 5654.4, 300 sec: 5670.9). Total num frames: 548230144. Throughput: 0: 5118.2. Samples: 548225954. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:52:26,331][25689] Avg episode reward: [(0, '-29.595')] [2022-07-10 02:52:26,951][26022] Updated weights on worker 0-0, policy_version 535386 (0.00089) [2022-07-10 02:52:29,027][26022] Updated weights on worker 0-0, policy_version 535396 (0.00090) [2022-07-10 02:52:30,711][26022] Updated weights on worker 0-0, policy_version 535406 (0.00091) [2022-07-10 02:52:31,345][25689] Fps is (10 sec: 5605.2, 60 sec: 5655.4, 300 sec: 5669.8). Total num frames: 548258816. Throughput: 0: 5937.4. Samples: 548259730. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:52:31,346][25689] Avg episode reward: [(0, '-30.114')] [2022-07-10 02:52:32,466][26022] Updated weights on worker 0-0, policy_version 535416 (0.00086) [2022-07-10 02:52:34,320][26022] Updated weights on worker 0-0, policy_version 535426 (0.00076) [2022-07-10 02:52:35,942][26022] Updated weights on worker 0-0, policy_version 535436 (0.00087) [2022-07-10 02:52:36,404][25689] Fps is (10 sec: 5590.5, 60 sec: 5661.6, 300 sec: 5665.7). Total num frames: 548286464. Throughput: 0: 5921.3. Samples: 548293724. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:52:36,405][25689] Avg episode reward: [(0, '-30.634')] [2022-07-10 02:52:37,867][26022] Updated weights on worker 0-0, policy_version 535446 (0.00435) [2022-07-10 02:52:39,870][26022] Updated weights on worker 0-0, policy_version 535456 (0.00088) [2022-07-10 02:52:41,427][25689] Fps is (10 sec: 5586.2, 60 sec: 5626.4, 300 sec: 5662.1). Total num frames: 548315136. Throughput: 0: 5073.0. Samples: 548310744. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:52:41,428][25689] Avg episode reward: [(0, '-31.310')] [2022-07-10 02:52:41,580][26022] Updated weights on worker 0-0, policy_version 535466 (0.00085) [2022-07-10 02:52:43,409][26022] Updated weights on worker 0-0, policy_version 535476 (0.00086) [2022-07-10 02:52:45,118][26022] Updated weights on worker 0-0, policy_version 535486 (0.00860) [2022-07-10 02:52:46,433][25689] Fps is (10 sec: 5820.3, 60 sec: 5662.5, 300 sec: 5667.2). Total num frames: 548344832. Throughput: 0: 5928.4. Samples: 548345130. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:52:46,433][25689] Avg episode reward: [(0, '-29.908')] [2022-07-10 02:52:46,966][26022] Updated weights on worker 0-0, policy_version 535496 (0.00072) [2022-07-10 02:52:48,784][26022] Updated weights on worker 0-0, policy_version 535506 (0.00082) [2022-07-10 02:52:50,659][26022] Updated weights on worker 0-0, policy_version 535516 (0.00092) [2022-07-10 02:52:51,463][25689] Fps is (10 sec: 5816.2, 60 sec: 5661.8, 300 sec: 5665.1). Total num frames: 548373504. Throughput: 0: 5918.2. Samples: 548378788. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:52:51,463][25689] Avg episode reward: [(0, '-28.225')] [2022-07-10 02:52:52,486][26022] Updated weights on worker 0-0, policy_version 535526 (0.00089) [2022-07-10 02:52:54,179][26022] Updated weights on worker 0-0, policy_version 535536 (0.00085) [2022-07-10 02:52:55,908][26022] Updated weights on worker 0-0, policy_version 535546 (0.00087) [2022-07-10 02:52:56,565][25689] Fps is (10 sec: 5659.5, 60 sec: 5675.1, 300 sec: 5663.2). Total num frames: 548402176. Throughput: 0: 5059.4. Samples: 548395724. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:52:56,565][25689] Avg episode reward: [(0, '-29.427')] [2022-07-10 02:52:57,705][26022] Updated weights on worker 0-0, policy_version 535556 (0.00092) [2022-07-10 02:52:59,621][26022] Updated weights on worker 0-0, policy_version 535566 (0.00091) [2022-07-10 02:53:01,629][25689] Fps is (10 sec: 5439.3, 60 sec: 5619.4, 300 sec: 5662.3). Total num frames: 548428800. Throughput: 0: 5896.0. Samples: 548429852. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:53:01,629][25689] Avg episode reward: [(0, '-28.859')] [2022-07-10 02:53:01,670][26022] Updated weights on worker 0-0, policy_version 535576 (0.00094) [2022-07-10 02:53:03,668][26022] Updated weights on worker 0-0, policy_version 535586 (0.00094) [2022-07-10 02:53:05,535][26022] Updated weights on worker 0-0, policy_version 535596 (0.00088) [2022-07-10 02:53:06,670][25689] Fps is (10 sec: 5370.5, 60 sec: 5634.4, 300 sec: 5658.4). Total num frames: 548456448. Throughput: 0: 5742.6. Samples: 548461348. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:53:06,671][25689] Avg episode reward: [(0, '-28.416')] [2022-07-10 02:53:07,419][26022] Updated weights on worker 0-0, policy_version 535606 (0.00084) [2022-07-10 02:53:09,140][26022] Updated weights on worker 0-0, policy_version 535616 (0.00084) [2022-07-10 02:53:10,977][26022] Updated weights on worker 0-0, policy_version 535626 (0.00090) [2022-07-10 02:53:11,683][25689] Fps is (10 sec: 5601.8, 60 sec: 5651.6, 300 sec: 5659.0). Total num frames: 548485120. Throughput: 0: 4921.7. Samples: 548478304. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:53:11,684][25689] Avg episode reward: [(0, '-28.169')] [2022-07-10 02:53:12,579][26022] Updated weights on worker 0-0, policy_version 535636 (0.00095) [2022-07-10 02:53:14,612][26022] Updated weights on worker 0-0, policy_version 535646 (0.00089) [2022-07-10 02:53:16,213][26022] Updated weights on worker 0-0, policy_version 535656 (0.00618) [2022-07-10 02:53:16,768][25689] Fps is (10 sec: 5679.0, 60 sec: 5619.3, 300 sec: 5657.9). Total num frames: 548513792. Throughput: 0: 5786.8. Samples: 548512636. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:53:16,768][25689] Avg episode reward: [(0, '-28.311')] [2022-07-10 02:53:18,155][26022] Updated weights on worker 0-0, policy_version 535666 (0.00084) [2022-07-10 02:53:19,915][26022] Updated weights on worker 0-0, policy_version 535676 (0.00085) [2022-07-10 02:53:21,619][26022] Updated weights on worker 0-0, policy_version 535686 (0.00056) [2022-07-10 02:53:21,851][25689] Fps is (10 sec: 5740.1, 60 sec: 5631.2, 300 sec: 5656.6). Total num frames: 548543488. Throughput: 0: 5782.8. Samples: 548546794. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:53:21,853][25689] Avg episode reward: [(0, '-29.202')] [2022-07-10 02:53:23,651][26022] Updated weights on worker 0-0, policy_version 535696 (0.00096) [2022-07-10 02:53:25,311][26022] Updated weights on worker 0-0, policy_version 535706 (0.00096) [2022-07-10 02:53:26,857][25689] Fps is (10 sec: 5684.0, 60 sec: 5633.8, 300 sec: 5656.9). Total num frames: 548571136. Throughput: 0: 5084.4. Samples: 548563982. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:53:26,858][25689] Avg episode reward: [(0, '-26.887')] [2022-07-10 02:53:27,262][26022] Updated weights on worker 0-0, policy_version 535716 (0.00094) [2022-07-10 02:53:28,945][26022] Updated weights on worker 0-0, policy_version 535726 (0.00086) [2022-07-10 02:53:30,768][26022] Updated weights on worker 0-0, policy_version 535736 (0.00092) [2022-07-10 02:53:31,868][25689] Fps is (10 sec: 5520.0, 60 sec: 5617.2, 300 sec: 5654.6). Total num frames: 548598784. Throughput: 0: 5929.7. Samples: 548598000. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:53:31,869][25689] Avg episode reward: [(0, '-26.197')] [2022-07-10 02:53:32,555][26022] Updated weights on worker 0-0, policy_version 535746 (0.00089) [2022-07-10 02:53:34,444][26022] Updated weights on worker 0-0, policy_version 535756 (0.00089) [2022-07-10 02:53:36,277][26022] Updated weights on worker 0-0, policy_version 535766 (0.00085) [2022-07-10 02:53:37,000][25689] Fps is (10 sec: 5754.4, 60 sec: 5661.1, 300 sec: 5659.3). Total num frames: 548629504. Throughput: 0: 5909.6. Samples: 548632198. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:53:37,000][25689] Avg episode reward: [(0, '-27.808')] [2022-07-10 02:53:37,987][26022] Updated weights on worker 0-0, policy_version 535776 (0.00084) [2022-07-10 02:53:39,760][26022] Updated weights on worker 0-0, policy_version 535786 (0.00081) [2022-07-10 02:53:41,608][26022] Updated weights on worker 0-0, policy_version 535796 (0.00089) [2022-07-10 02:53:42,082][25689] Fps is (10 sec: 5714.9, 60 sec: 5638.7, 300 sec: 5657.8). Total num frames: 548657152. Throughput: 0: 5929.0. Samples: 548666742. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 02:53:42,082][25689] Avg episode reward: [(0, '-27.325')] [2022-07-10 02:53:43,169][26022] Updated weights on worker 0-0, policy_version 535806 (0.00234) [2022-07-10 02:53:45,183][26022] Updated weights on worker 0-0, policy_version 535816 (0.00098) [2022-07-10 02:53:46,875][26022] Updated weights on worker 0-0, policy_version 535826 (0.00091) [2022-07-10 02:53:47,106][25689] Fps is (10 sec: 5572.7, 60 sec: 5620.1, 300 sec: 5654.0). Total num frames: 548685824. Throughput: 0: 5924.6. Samples: 548683954. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:53:47,107][25689] Avg episode reward: [(0, '-26.519')] [2022-07-10 02:53:48,521][26022] Updated weights on worker 0-0, policy_version 535836 (0.00094) [2022-07-10 02:53:50,605][26022] Updated weights on worker 0-0, policy_version 535846 (0.00086) [2022-07-10 02:53:52,026][26022] Updated weights on worker 0-0, policy_version 535856 (0.00085) [2022-07-10 02:53:52,123][25689] Fps is (10 sec: 5915.1, 60 sec: 5655.1, 300 sec: 5663.0). Total num frames: 548716544. Throughput: 0: 5926.4. Samples: 548718034. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:53:52,123][25689] Avg episode reward: [(0, '-26.626')] [2022-07-10 02:53:53,460][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:53:53,474][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000535861_548721664.pth [2022-07-10 02:53:53,475][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000533870_546682880.pth [2022-07-10 02:53:54,306][26022] Updated weights on worker 0-0, policy_version 535866 (0.00079) [2022-07-10 02:53:55,591][26022] Updated weights on worker 0-0, policy_version 535876 (0.00084) [2022-07-10 02:53:57,213][25689] Fps is (10 sec: 5673.7, 60 sec: 5622.4, 300 sec: 5654.9). Total num frames: 548743168. Throughput: 0: 5928.0. Samples: 548752024. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:53:57,214][25689] Avg episode reward: [(0, '-27.059')] [2022-07-10 02:53:57,874][26022] Updated weights on worker 0-0, policy_version 535886 (0.00427) [2022-07-10 02:53:59,366][26022] Updated weights on worker 0-0, policy_version 535896 (0.00082) [2022-07-10 02:54:01,343][26022] Updated weights on worker 0-0, policy_version 535906 (0.00088) [2022-07-10 02:54:02,235][25689] Fps is (10 sec: 5366.9, 60 sec: 5643.2, 300 sec: 5655.7). Total num frames: 548770816. Throughput: 0: 5078.2. Samples: 548769084. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:54:02,235][25689] Avg episode reward: [(0, '-27.177')] [2022-07-10 02:54:03,503][26022] Updated weights on worker 0-0, policy_version 535916 (0.00095) [2022-07-10 02:54:05,456][26022] Updated weights on worker 0-0, policy_version 535926 (0.00083) [2022-07-10 02:54:06,887][26022] Updated weights on worker 0-0, policy_version 535936 (0.00086) [2022-07-10 02:54:07,256][25689] Fps is (10 sec: 5607.8, 60 sec: 5662.0, 300 sec: 5652.3). Total num frames: 548799488. Throughput: 0: 5809.8. Samples: 548801022. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:54:07,257][25689] Avg episode reward: [(0, '-25.935')] [2022-07-10 02:54:08,891][26022] Updated weights on worker 0-0, policy_version 535946 (0.00086) [2022-07-10 02:54:10,809][26022] Updated weights on worker 0-0, policy_version 535956 (0.00091) [2022-07-10 02:54:12,316][25689] Fps is (10 sec: 5790.2, 60 sec: 5674.6, 300 sec: 5663.5). Total num frames: 548829184. Throughput: 0: 5800.1. Samples: 548835156. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:54:12,316][25689] Avg episode reward: [(0, '-27.575')] [2022-07-10 02:54:12,329][26022] Updated weights on worker 0-0, policy_version 535966 (0.00803) [2022-07-10 02:54:14,447][26022] Updated weights on worker 0-0, policy_version 535976 (0.00082) [2022-07-10 02:54:15,968][26022] Updated weights on worker 0-0, policy_version 535986 (0.00090) [2022-07-10 02:54:17,411][25689] Fps is (10 sec: 5445.3, 60 sec: 5622.9, 300 sec: 5644.8). Total num frames: 548854784. Throughput: 0: 4959.0. Samples: 548852186. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:54:17,412][25689] Avg episode reward: [(0, '-28.349')] [2022-07-10 02:54:18,045][26022] Updated weights on worker 0-0, policy_version 535996 (0.00084) [2022-07-10 02:54:19,644][26022] Updated weights on worker 0-0, policy_version 536006 (0.00088) [2022-07-10 02:54:21,502][26022] Updated weights on worker 0-0, policy_version 536016 (0.00091) [2022-07-10 02:54:22,437][25689] Fps is (10 sec: 5463.4, 60 sec: 5628.3, 300 sec: 5651.2). Total num frames: 548884480. Throughput: 0: 5809.0. Samples: 548886436. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:54:22,437][25689] Avg episode reward: [(0, '-28.546')] [2022-07-10 02:54:23,479][26022] Updated weights on worker 0-0, policy_version 536026 (0.00086) [2022-07-10 02:54:25,208][26022] Updated weights on worker 0-0, policy_version 536036 (0.00090) [2022-07-10 02:54:26,781][26022] Updated weights on worker 0-0, policy_version 536046 (0.00089) [2022-07-10 02:54:27,479][25689] Fps is (10 sec: 5899.5, 60 sec: 5658.7, 300 sec: 5654.6). Total num frames: 548914176. Throughput: 0: 5903.8. Samples: 548920410. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:54:27,479][25689] Avg episode reward: [(0, '-29.809')] [2022-07-10 02:54:28,851][26022] Updated weights on worker 0-0, policy_version 536056 (0.00442) [2022-07-10 02:54:30,500][26022] Updated weights on worker 0-0, policy_version 536066 (0.00086) [2022-07-10 02:54:32,288][26022] Updated weights on worker 0-0, policy_version 536076 (0.00091) [2022-07-10 02:54:32,490][25689] Fps is (10 sec: 5704.0, 60 sec: 5658.7, 300 sec: 5653.0). Total num frames: 548941824. Throughput: 0: 5069.7. Samples: 548937436. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:54:32,491][25689] Avg episode reward: [(0, '-30.909')] [2022-07-10 02:54:34,216][26022] Updated weights on worker 0-0, policy_version 536086 (0.00083) [2022-07-10 02:54:35,961][26022] Updated weights on worker 0-0, policy_version 536096 (0.00078) [2022-07-10 02:54:37,587][25689] Fps is (10 sec: 5470.5, 60 sec: 5611.2, 300 sec: 5644.5). Total num frames: 548969472. Throughput: 0: 5925.7. Samples: 548971740. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:54:37,587][25689] Avg episode reward: [(0, '-32.091')] [2022-07-10 02:54:37,838][26022] Updated weights on worker 0-0, policy_version 536106 (0.00100) [2022-07-10 02:54:39,647][26022] Updated weights on worker 0-0, policy_version 536116 (0.00084) [2022-07-10 02:54:41,398][26022] Updated weights on worker 0-0, policy_version 536126 (0.00091) [2022-07-10 02:54:42,660][25689] Fps is (10 sec: 5739.6, 60 sec: 5662.8, 300 sec: 5650.6). Total num frames: 549000192. Throughput: 0: 5904.3. Samples: 549005838. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:54:42,660][25689] Avg episode reward: [(0, '-32.222')] [2022-07-10 02:54:43,292][26022] Updated weights on worker 0-0, policy_version 536136 (0.00080) [2022-07-10 02:54:45,001][26022] Updated weights on worker 0-0, policy_version 536146 (0.00086) [2022-07-10 02:54:46,766][26022] Updated weights on worker 0-0, policy_version 536156 (0.00086) [2022-07-10 02:54:47,672][25689] Fps is (10 sec: 5889.2, 60 sec: 5664.0, 300 sec: 5650.6). Total num frames: 549028864. Throughput: 0: 5079.0. Samples: 549022970. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:54:47,672][25689] Avg episode reward: [(0, '-32.427')] [2022-07-10 02:54:48,704][26022] Updated weights on worker 0-0, policy_version 536166 (0.00105) [2022-07-10 02:54:50,443][26022] Updated weights on worker 0-0, policy_version 536176 (0.00083) [2022-07-10 02:54:52,250][26022] Updated weights on worker 0-0, policy_version 536186 (0.00092) [2022-07-10 02:54:52,675][25689] Fps is (10 sec: 5725.7, 60 sec: 5631.4, 300 sec: 5649.2). Total num frames: 549057536. Throughput: 0: 5935.9. Samples: 549057250. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:54:52,675][25689] Avg episode reward: [(0, '-31.274')] [2022-07-10 02:54:53,880][26022] Updated weights on worker 0-0, policy_version 536196 (0.00092) [2022-07-10 02:54:55,728][26022] Updated weights on worker 0-0, policy_version 536206 (0.00083) [2022-07-10 02:54:57,773][25689] Fps is (10 sec: 5575.3, 60 sec: 5647.6, 300 sec: 5645.5). Total num frames: 549085184. Throughput: 0: 5922.0. Samples: 549091284. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:54:57,774][25689] Avg episode reward: [(0, '-30.938')] [2022-07-10 02:54:57,774][26022] Updated weights on worker 0-0, policy_version 536216 (0.00082) [2022-07-10 02:54:59,294][26022] Updated weights on worker 0-0, policy_version 536226 (0.00084) [2022-07-10 02:55:01,114][26022] Updated weights on worker 0-0, policy_version 536236 (0.00086) [2022-07-10 02:55:02,843][25689] Fps is (10 sec: 5437.8, 60 sec: 5643.0, 300 sec: 5647.6). Total num frames: 549112832. Throughput: 0: 5091.4. Samples: 549108602. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:55:02,844][25689] Avg episode reward: [(0, '-28.791')] [2022-07-10 02:55:03,350][26022] Updated weights on worker 0-0, policy_version 536246 (0.00093) [2022-07-10 02:55:05,038][26022] Updated weights on worker 0-0, policy_version 536256 (0.00087) [2022-07-10 02:55:06,904][26022] Updated weights on worker 0-0, policy_version 536266 (0.00106) [2022-07-10 02:55:07,942][25689] Fps is (10 sec: 5538.8, 60 sec: 5635.9, 300 sec: 5643.3). Total num frames: 549141504. Throughput: 0: 5813.8. Samples: 549140816. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:55:07,942][25689] Avg episode reward: [(0, '-28.304')] [2022-07-10 02:55:08,868][26022] Updated weights on worker 0-0, policy_version 536276 (0.00092) [2022-07-10 02:55:10,431][26022] Updated weights on worker 0-0, policy_version 536286 (0.00086) [2022-07-10 02:55:12,368][26022] Updated weights on worker 0-0, policy_version 536296 (0.00089) [2022-07-10 02:55:12,999][25689] Fps is (10 sec: 5747.6, 60 sec: 5636.1, 300 sec: 5648.1). Total num frames: 549171200. Throughput: 0: 5804.6. Samples: 549175224. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:55:12,999][25689] Avg episode reward: [(0, '-27.172')] [2022-07-10 02:55:14,091][26022] Updated weights on worker 0-0, policy_version 536306 (0.00097) [2022-07-10 02:55:15,974][26022] Updated weights on worker 0-0, policy_version 536316 (0.00085) [2022-07-10 02:55:17,767][26022] Updated weights on worker 0-0, policy_version 536326 (0.00096) [2022-07-10 02:55:18,034][25689] Fps is (10 sec: 5681.8, 60 sec: 5675.5, 300 sec: 5648.1). Total num frames: 549198848. Throughput: 0: 4987.9. Samples: 549192344. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:55:18,035][25689] Avg episode reward: [(0, '-27.830')] [2022-07-10 02:55:19,502][26022] Updated weights on worker 0-0, policy_version 536336 (0.00090) [2022-07-10 02:55:21,487][26022] Updated weights on worker 0-0, policy_version 536346 (0.00079) [2022-07-10 02:55:23,051][25689] Fps is (10 sec: 5602.5, 60 sec: 5659.4, 300 sec: 5648.2). Total num frames: 549227520. Throughput: 0: 5834.5. Samples: 549226504. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:55:23,052][25689] Avg episode reward: [(0, '-27.116')] [2022-07-10 02:55:23,083][26022] Updated weights on worker 0-0, policy_version 536356 (0.00093) [2022-07-10 02:55:24,957][26022] Updated weights on worker 0-0, policy_version 536366 (0.00087) [2022-07-10 02:55:26,687][26022] Updated weights on worker 0-0, policy_version 536376 (0.00093) [2022-07-10 02:55:28,067][25689] Fps is (10 sec: 5715.7, 60 sec: 5644.9, 300 sec: 5649.6). Total num frames: 549256192. Throughput: 0: 5944.0. Samples: 549260440. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:55:28,067][25689] Avg episode reward: [(0, '-27.245')] [2022-07-10 02:55:28,538][26022] Updated weights on worker 0-0, policy_version 536386 (0.00090) [2022-07-10 02:55:30,375][26022] Updated weights on worker 0-0, policy_version 536396 (0.00087) [2022-07-10 02:55:32,259][26022] Updated weights on worker 0-0, policy_version 536406 (0.00089) [2022-07-10 02:55:33,076][25689] Fps is (10 sec: 5720.2, 60 sec: 5662.0, 300 sec: 5647.7). Total num frames: 549284864. Throughput: 0: 5085.3. Samples: 549277324. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:55:33,081][25689] Avg episode reward: [(0, '-26.602')] [2022-07-10 02:55:34,113][26022] Updated weights on worker 0-0, policy_version 536416 (0.00091) [2022-07-10 02:55:35,855][26022] Updated weights on worker 0-0, policy_version 536426 (0.00100) [2022-07-10 02:55:37,579][26022] Updated weights on worker 0-0, policy_version 536436 (0.00095) [2022-07-10 02:55:38,123][25689] Fps is (10 sec: 5702.5, 60 sec: 5683.6, 300 sec: 5647.4). Total num frames: 549313536. Throughput: 0: 5919.2. Samples: 549311252. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:55:38,125][25689] Avg episode reward: [(0, '-27.537')] [2022-07-10 02:55:39,671][26022] Updated weights on worker 0-0, policy_version 536446 (0.00081) [2022-07-10 02:55:40,962][26022] Updated weights on worker 0-0, policy_version 536456 (0.00092) [2022-07-10 02:55:43,131][25689] Fps is (10 sec: 5499.7, 60 sec: 5622.0, 300 sec: 5641.2). Total num frames: 549340160. Throughput: 0: 5909.2. Samples: 549345156. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:55:43,132][25689] Avg episode reward: [(0, '-27.407')] [2022-07-10 02:55:43,257][26022] Updated weights on worker 0-0, policy_version 536466 (0.01236) [2022-07-10 02:55:44,632][26022] Updated weights on worker 0-0, policy_version 536476 (0.00094) [2022-07-10 02:55:46,730][26022] Updated weights on worker 0-0, policy_version 536486 (0.00090) [2022-07-10 02:55:48,142][25689] Fps is (10 sec: 5723.5, 60 sec: 5655.9, 300 sec: 5652.3). Total num frames: 549370880. Throughput: 0: 5074.7. Samples: 549362314. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:55:48,144][25689] Avg episode reward: [(0, '-27.333')] [2022-07-10 02:55:48,527][26022] Updated weights on worker 0-0, policy_version 536496 (0.00107) [2022-07-10 02:55:50,191][26022] Updated weights on worker 0-0, policy_version 536506 (0.00103) [2022-07-10 02:55:52,172][26022] Updated weights on worker 0-0, policy_version 536516 (0.00084) [2022-07-10 02:55:53,152][25689] Fps is (10 sec: 5824.4, 60 sec: 5638.4, 300 sec: 5646.1). Total num frames: 549398528. Throughput: 0: 5939.2. Samples: 549396558. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:55:53,153][25689] Avg episode reward: [(0, '-27.759')] [2022-07-10 02:55:53,599][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:55:53,618][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000536525_549401600.pth [2022-07-10 02:55:53,618][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000534537_547365888.pth [2022-07-10 02:55:53,909][26022] Updated weights on worker 0-0, policy_version 536526 (0.00093) [2022-07-10 02:55:55,523][26022] Updated weights on worker 0-0, policy_version 536536 (0.00085) [2022-07-10 02:55:57,603][26022] Updated weights on worker 0-0, policy_version 536546 (0.00089) [2022-07-10 02:55:58,295][25689] Fps is (10 sec: 5647.9, 60 sec: 5668.0, 300 sec: 5646.8). Total num frames: 549428224. Throughput: 0: 5926.9. Samples: 549430810. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:55:58,296][25689] Avg episode reward: [(0, '-28.722')] [2022-07-10 02:55:58,973][26022] Updated weights on worker 0-0, policy_version 536556 (0.00084) [2022-07-10 02:56:00,987][26022] Updated weights on worker 0-0, policy_version 536566 (0.00094) [2022-07-10 02:56:03,304][25689] Fps is (10 sec: 5345.9, 60 sec: 5623.0, 300 sec: 5640.2). Total num frames: 549452800. Throughput: 0: 5837.2. Samples: 549462912. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:56:03,305][25689] Avg episode reward: [(0, '-28.773')] [2022-07-10 02:56:03,319][26022] Updated weights on worker 0-0, policy_version 536576 (0.00085) [2022-07-10 02:56:04,893][26022] Updated weights on worker 0-0, policy_version 536586 (0.00088) [2022-07-10 02:56:06,838][26022] Updated weights on worker 0-0, policy_version 536596 (0.00068) [2022-07-10 02:56:08,336][25689] Fps is (10 sec: 5405.1, 60 sec: 5646.1, 300 sec: 5643.3). Total num frames: 549482496. Throughput: 0: 5823.9. Samples: 549479922. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:56:08,336][25689] Avg episode reward: [(0, '-28.497')] [2022-07-10 02:56:08,748][26022] Updated weights on worker 0-0, policy_version 536606 (0.00091) [2022-07-10 02:56:10,350][26022] Updated weights on worker 0-0, policy_version 536616 (0.00091) [2022-07-10 02:56:12,483][26022] Updated weights on worker 0-0, policy_version 536626 (0.00081) [2022-07-10 02:56:13,363][25689] Fps is (10 sec: 5904.4, 60 sec: 5648.9, 300 sec: 5650.8). Total num frames: 549512192. Throughput: 0: 5817.0. Samples: 549514126. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:56:13,363][25689] Avg episode reward: [(0, '-29.936')] [2022-07-10 02:56:14,210][26022] Updated weights on worker 0-0, policy_version 536636 (0.00088) [2022-07-10 02:56:15,785][26022] Updated weights on worker 0-0, policy_version 536646 (0.00088) [2022-07-10 02:56:17,670][26022] Updated weights on worker 0-0, policy_version 536656 (0.00084) [2022-07-10 02:56:18,475][25689] Fps is (10 sec: 5756.8, 60 sec: 5658.7, 300 sec: 5649.2). Total num frames: 549540864. Throughput: 0: 5840.6. Samples: 549548672. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:56:18,475][25689] Avg episode reward: [(0, '-29.858')] [2022-07-10 02:56:19,332][26022] Updated weights on worker 0-0, policy_version 536666 (0.00093) [2022-07-10 02:56:21,182][26022] Updated weights on worker 0-0, policy_version 536676 (0.00086) [2022-07-10 02:56:23,041][26022] Updated weights on worker 0-0, policy_version 536686 (0.00090) [2022-07-10 02:56:23,500][25689] Fps is (10 sec: 5454.6, 60 sec: 5624.0, 300 sec: 5641.9). Total num frames: 549567488. Throughput: 0: 5101.5. Samples: 549565944. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:56:23,501][25689] Avg episode reward: [(0, '-29.728')] [2022-07-10 02:56:24,820][26022] Updated weights on worker 0-0, policy_version 536696 (0.00091) [2022-07-10 02:56:26,626][26022] Updated weights on worker 0-0, policy_version 536706 (0.00089) [2022-07-10 02:56:28,440][26022] Updated weights on worker 0-0, policy_version 536716 (0.00093) [2022-07-10 02:56:28,534][25689] Fps is (10 sec: 5599.0, 60 sec: 5639.3, 300 sec: 5645.2). Total num frames: 549597184. Throughput: 0: 5941.3. Samples: 549599924. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:56:28,534][25689] Avg episode reward: [(0, '-28.879')] [2022-07-10 02:56:30,266][26022] Updated weights on worker 0-0, policy_version 536726 (0.00094) [2022-07-10 02:56:32,091][26022] Updated weights on worker 0-0, policy_version 536736 (0.00090) [2022-07-10 02:56:33,632][25689] Fps is (10 sec: 5760.8, 60 sec: 5631.0, 300 sec: 5649.2). Total num frames: 549625856. Throughput: 0: 5918.9. Samples: 549634098. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:56:33,633][25689] Avg episode reward: [(0, '-28.801')] [2022-07-10 02:56:33,886][26022] Updated weights on worker 0-0, policy_version 536746 (0.00082) [2022-07-10 02:56:35,555][26022] Updated weights on worker 0-0, policy_version 536756 (0.00083) [2022-07-10 02:56:37,444][26022] Updated weights on worker 0-0, policy_version 536766 (0.00092) [2022-07-10 02:56:38,775][25689] Fps is (10 sec: 5899.0, 60 sec: 5672.7, 300 sec: 5650.1). Total num frames: 549657600. Throughput: 0: 5057.7. Samples: 549651346. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 02:56:38,776][25689] Avg episode reward: [(0, '-27.514')] [2022-07-10 02:56:38,903][26022] Updated weights on worker 0-0, policy_version 536776 (0.00098) [2022-07-10 02:56:41,055][26022] Updated weights on worker 0-0, policy_version 536786 (0.00087) [2022-07-10 02:56:42,733][26022] Updated weights on worker 0-0, policy_version 536796 (0.00085) [2022-07-10 02:56:43,833][25689] Fps is (10 sec: 5722.2, 60 sec: 5668.1, 300 sec: 5646.1). Total num frames: 549684224. Throughput: 0: 5892.6. Samples: 549685754. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:56:43,833][25689] Avg episode reward: [(0, '-26.355')] [2022-07-10 02:56:44,598][26022] Updated weights on worker 0-0, policy_version 536806 (0.00090) [2022-07-10 02:56:46,414][26022] Updated weights on worker 0-0, policy_version 536816 (0.00084) [2022-07-10 02:56:48,183][26022] Updated weights on worker 0-0, policy_version 536826 (0.00076) [2022-07-10 02:56:48,903][25689] Fps is (10 sec: 5561.2, 60 sec: 5645.7, 300 sec: 5648.6). Total num frames: 549713920. Throughput: 0: 5917.6. Samples: 549720460. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:56:48,903][25689] Avg episode reward: [(0, '-26.200')] [2022-07-10 02:56:49,883][26022] Updated weights on worker 0-0, policy_version 536836 (0.00091) [2022-07-10 02:56:51,686][26022] Updated weights on worker 0-0, policy_version 536846 (0.00092) [2022-07-10 02:56:53,526][26022] Updated weights on worker 0-0, policy_version 536856 (0.00086) [2022-07-10 02:56:54,000][25689] Fps is (10 sec: 5740.4, 60 sec: 5654.4, 300 sec: 5651.4). Total num frames: 549742592. Throughput: 0: 5081.9. Samples: 549737608. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:56:54,001][25689] Avg episode reward: [(0, '-26.699')] [2022-07-10 02:56:55,161][26022] Updated weights on worker 0-0, policy_version 536866 (0.00087) [2022-07-10 02:56:57,142][26022] Updated weights on worker 0-0, policy_version 536876 (0.00055) [2022-07-10 02:56:58,619][26022] Updated weights on worker 0-0, policy_version 536886 (0.00087) [2022-07-10 02:56:59,066][25689] Fps is (10 sec: 5743.3, 60 sec: 5661.7, 300 sec: 5650.4). Total num frames: 549772288. Throughput: 0: 5932.2. Samples: 549771712. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:56:59,066][25689] Avg episode reward: [(0, '-27.345')] [2022-07-10 02:57:00,730][26022] Updated weights on worker 0-0, policy_version 536896 (0.00093) [2022-07-10 02:57:02,888][26022] Updated weights on worker 0-0, policy_version 536906 (0.00102) [2022-07-10 02:57:04,078][25689] Fps is (10 sec: 5588.7, 60 sec: 5695.1, 300 sec: 5650.6). Total num frames: 549798912. Throughput: 0: 5838.1. Samples: 549803948. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:57:04,078][25689] Avg episode reward: [(0, '-28.827')] [2022-07-10 02:57:04,517][26022] Updated weights on worker 0-0, policy_version 536916 (0.00083) [2022-07-10 02:57:06,457][26022] Updated weights on worker 0-0, policy_version 536926 (0.00087) [2022-07-10 02:57:08,251][26022] Updated weights on worker 0-0, policy_version 536936 (0.00088) [2022-07-10 02:57:09,111][25689] Fps is (10 sec: 5402.9, 60 sec: 5661.3, 300 sec: 5650.2). Total num frames: 549826560. Throughput: 0: 4969.7. Samples: 549820886. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:57:09,112][25689] Avg episode reward: [(0, '-29.102')] [2022-07-10 02:57:09,919][26022] Updated weights on worker 0-0, policy_version 536946 (0.00087) [2022-07-10 02:57:11,943][26022] Updated weights on worker 0-0, policy_version 536956 (0.00083) [2022-07-10 02:57:13,335][26022] Updated weights on worker 0-0, policy_version 536966 (0.00086) [2022-07-10 02:57:14,120][25689] Fps is (10 sec: 5608.5, 60 sec: 5646.1, 300 sec: 5645.1). Total num frames: 549855232. Throughput: 0: 5850.0. Samples: 549855306. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:57:14,122][25689] Avg episode reward: [(0, '-29.123')] [2022-07-10 02:57:15,530][26022] Updated weights on worker 0-0, policy_version 536976 (0.00085) [2022-07-10 02:57:17,269][26022] Updated weights on worker 0-0, policy_version 536986 (0.00059) [2022-07-10 02:57:18,928][26022] Updated weights on worker 0-0, policy_version 536996 (0.00087) [2022-07-10 02:57:19,166][25689] Fps is (10 sec: 5804.4, 60 sec: 5669.1, 300 sec: 5648.2). Total num frames: 549884928. Throughput: 0: 5860.0. Samples: 549889502. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:57:19,167][25689] Avg episode reward: [(0, '-29.214')] [2022-07-10 02:57:20,748][26022] Updated weights on worker 0-0, policy_version 537006 (0.00089) [2022-07-10 02:57:22,536][26022] Updated weights on worker 0-0, policy_version 537016 (0.00091) [2022-07-10 02:57:24,199][25689] Fps is (10 sec: 5689.6, 60 sec: 5685.3, 300 sec: 5648.2). Total num frames: 549912576. Throughput: 0: 5101.2. Samples: 549906588. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:57:24,200][25689] Avg episode reward: [(0, '-28.481')] [2022-07-10 02:57:24,577][26022] Updated weights on worker 0-0, policy_version 537026 (0.00081) [2022-07-10 02:57:26,143][26022] Updated weights on worker 0-0, policy_version 537036 (0.00085) [2022-07-10 02:57:28,082][26022] Updated weights on worker 0-0, policy_version 537046 (0.00091) [2022-07-10 02:57:29,201][25689] Fps is (10 sec: 5612.5, 60 sec: 5671.3, 300 sec: 5648.5). Total num frames: 549941248. Throughput: 0: 5966.6. Samples: 549940758. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:57:29,202][25689] Avg episode reward: [(0, '-27.227')] [2022-07-10 02:57:29,630][26022] Updated weights on worker 0-0, policy_version 537056 (0.00092) [2022-07-10 02:57:31,531][26022] Updated weights on worker 0-0, policy_version 537066 (0.00095) [2022-07-10 02:57:33,536][26022] Updated weights on worker 0-0, policy_version 537076 (0.00089) [2022-07-10 02:57:34,238][25689] Fps is (10 sec: 5711.9, 60 sec: 5677.1, 300 sec: 5652.3). Total num frames: 549969920. Throughput: 0: 5944.7. Samples: 549974904. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:57:34,239][25689] Avg episode reward: [(0, '-25.515')] [2022-07-10 02:57:35,150][26022] Updated weights on worker 0-0, policy_version 537086 (0.00106) [2022-07-10 02:57:37,045][26022] Updated weights on worker 0-0, policy_version 537096 (0.00091) [2022-07-10 02:57:38,852][26022] Updated weights on worker 0-0, policy_version 537106 (0.00086) [2022-07-10 02:57:39,296][25689] Fps is (10 sec: 5781.7, 60 sec: 5651.2, 300 sec: 5655.1). Total num frames: 549999616. Throughput: 0: 5083.7. Samples: 549991836. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:57:39,297][25689] Avg episode reward: [(0, '-25.593')] [2022-07-10 02:57:40,578][26022] Updated weights on worker 0-0, policy_version 537116 (0.00085) [2022-07-10 02:57:42,366][26022] Updated weights on worker 0-0, policy_version 537126 (0.00086) [2022-07-10 02:57:44,148][26022] Updated weights on worker 0-0, policy_version 537136 (0.00085) [2022-07-10 02:57:44,324][25689] Fps is (10 sec: 5787.2, 60 sec: 5687.8, 300 sec: 5651.2). Total num frames: 550028288. Throughput: 0: 5946.4. Samples: 550026260. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:57:44,325][25689] Avg episode reward: [(0, '-25.898')] [2022-07-10 02:57:46,080][26022] Updated weights on worker 0-0, policy_version 537146 (0.00088) [2022-07-10 02:57:47,734][26022] Updated weights on worker 0-0, policy_version 537156 (0.00090) [2022-07-10 02:57:49,329][25689] Fps is (10 sec: 5511.9, 60 sec: 5643.2, 300 sec: 5644.8). Total num frames: 550054912. Throughput: 0: 5956.2. Samples: 550060642. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:57:49,329][25689] Avg episode reward: [(0, '-27.553')] [2022-07-10 02:57:49,694][26022] Updated weights on worker 0-0, policy_version 537166 (0.00090) [2022-07-10 02:57:51,367][26022] Updated weights on worker 0-0, policy_version 537176 (0.00088) [2022-07-10 02:57:53,280][26022] Updated weights on worker 0-0, policy_version 537186 (0.00092) [2022-07-10 02:57:53,731][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:57:53,742][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000537189_550081536.pth [2022-07-10 02:57:53,742][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000535201_548045824.pth [2022-07-10 02:57:54,343][25689] Fps is (10 sec: 5621.1, 60 sec: 5667.9, 300 sec: 5649.9). Total num frames: 550084608. Throughput: 0: 5128.8. Samples: 550078020. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:57:54,344][25689] Avg episode reward: [(0, '-27.789')] [2022-07-10 02:57:54,919][26022] Updated weights on worker 0-0, policy_version 537196 (0.00085) [2022-07-10 02:57:56,876][26022] Updated weights on worker 0-0, policy_version 537206 (0.00092) [2022-07-10 02:57:58,562][26022] Updated weights on worker 0-0, policy_version 537216 (0.00086) [2022-07-10 02:57:59,413][25689] Fps is (10 sec: 5889.2, 60 sec: 5667.4, 300 sec: 5660.1). Total num frames: 550114304. Throughput: 0: 5972.5. Samples: 550111986. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:57:59,414][25689] Avg episode reward: [(0, '-28.835')] [2022-07-10 02:58:00,427][26022] Updated weights on worker 0-0, policy_version 537226 (0.00085) [2022-07-10 02:58:02,283][26022] Updated weights on worker 0-0, policy_version 537236 (0.00088) [2022-07-10 02:58:04,319][26022] Updated weights on worker 0-0, policy_version 537246 (0.00083) [2022-07-10 02:58:04,456][25689] Fps is (10 sec: 5468.0, 60 sec: 5647.6, 300 sec: 5653.2). Total num frames: 550139904. Throughput: 0: 5865.1. Samples: 550144336. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:58:04,456][25689] Avg episode reward: [(0, '-28.440')] [2022-07-10 02:58:06,065][26022] Updated weights on worker 0-0, policy_version 537256 (0.00095) [2022-07-10 02:58:07,988][26022] Updated weights on worker 0-0, policy_version 537266 (0.00091) [2022-07-10 02:58:09,463][25689] Fps is (10 sec: 5400.7, 60 sec: 5667.0, 300 sec: 5653.4). Total num frames: 550168576. Throughput: 0: 5013.3. Samples: 550161580. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:58:09,463][25689] Avg episode reward: [(0, '-28.846')] [2022-07-10 02:58:09,599][26022] Updated weights on worker 0-0, policy_version 537276 (0.00083) [2022-07-10 02:58:11,607][26022] Updated weights on worker 0-0, policy_version 537286 (0.00084) [2022-07-10 02:58:13,081][26022] Updated weights on worker 0-0, policy_version 537296 (0.00088) [2022-07-10 02:58:14,470][25689] Fps is (10 sec: 5828.9, 60 sec: 5684.2, 300 sec: 5658.3). Total num frames: 550198272. Throughput: 0: 5871.7. Samples: 550196196. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:58:14,470][25689] Avg episode reward: [(0, '-28.016')] [2022-07-10 02:58:15,159][26022] Updated weights on worker 0-0, policy_version 537306 (0.00091) [2022-07-10 02:58:16,702][26022] Updated weights on worker 0-0, policy_version 537316 (0.00077) [2022-07-10 02:58:18,741][26022] Updated weights on worker 0-0, policy_version 537326 (0.00086) [2022-07-10 02:58:19,587][25689] Fps is (10 sec: 5866.3, 60 sec: 5677.5, 300 sec: 5657.6). Total num frames: 550227968. Throughput: 0: 5884.7. Samples: 550230700. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:58:19,589][25689] Avg episode reward: [(0, '-27.085')] [2022-07-10 02:58:20,391][26022] Updated weights on worker 0-0, policy_version 537336 (0.00086) [2022-07-10 02:58:22,201][26022] Updated weights on worker 0-0, policy_version 537346 (0.00095) [2022-07-10 02:58:24,085][26022] Updated weights on worker 0-0, policy_version 537356 (0.00086) [2022-07-10 02:58:24,679][25689] Fps is (10 sec: 5717.3, 60 sec: 5688.9, 300 sec: 5659.4). Total num frames: 550256640. Throughput: 0: 5976.0. Samples: 550265186. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:58:24,687][25689] Avg episode reward: [(0, '-27.656')] [2022-07-10 02:58:25,783][26022] Updated weights on worker 0-0, policy_version 537366 (0.00085) [2022-07-10 02:58:27,640][26022] Updated weights on worker 0-0, policy_version 537376 (0.00092) [2022-07-10 02:58:29,460][26022] Updated weights on worker 0-0, policy_version 537386 (0.00088) [2022-07-10 02:58:29,781][25689] Fps is (10 sec: 5524.8, 60 sec: 5662.6, 300 sec: 5657.7). Total num frames: 550284288. Throughput: 0: 5929.1. Samples: 550282050. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:58:29,783][25689] Avg episode reward: [(0, '-27.037')] [2022-07-10 02:58:31,193][26022] Updated weights on worker 0-0, policy_version 537396 (0.00090) [2022-07-10 02:58:33,174][26022] Updated weights on worker 0-0, policy_version 537406 (0.00082) [2022-07-10 02:58:34,727][26022] Updated weights on worker 0-0, policy_version 537416 (0.00506) [2022-07-10 02:58:34,822][25689] Fps is (10 sec: 5653.4, 60 sec: 5679.2, 300 sec: 5656.0). Total num frames: 550313984. Throughput: 0: 5893.9. Samples: 550316150. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:58:34,822][25689] Avg episode reward: [(0, '-26.727')] [2022-07-10 02:58:36,485][26022] Updated weights on worker 0-0, policy_version 537426 (0.00086) [2022-07-10 02:58:38,339][26022] Updated weights on worker 0-0, policy_version 537436 (0.00084) [2022-07-10 02:58:39,937][25689] Fps is (10 sec: 5747.1, 60 sec: 5656.9, 300 sec: 5658.8). Total num frames: 550342656. Throughput: 0: 5888.9. Samples: 550350540. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:58:39,938][25689] Avg episode reward: [(0, '-27.083')] [2022-07-10 02:58:40,201][26022] Updated weights on worker 0-0, policy_version 537446 (0.00082) [2022-07-10 02:58:42,027][26022] Updated weights on worker 0-0, policy_version 537456 (0.00089) [2022-07-10 02:58:43,798][26022] Updated weights on worker 0-0, policy_version 537466 (0.00088) [2022-07-10 02:58:44,960][25689] Fps is (10 sec: 5757.1, 60 sec: 5674.2, 300 sec: 5662.3). Total num frames: 550372352. Throughput: 0: 5059.9. Samples: 550367810. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:58:44,961][25689] Avg episode reward: [(0, '-26.676')] [2022-07-10 02:58:45,421][26022] Updated weights on worker 0-0, policy_version 537476 (0.00084) [2022-07-10 02:58:47,328][26022] Updated weights on worker 0-0, policy_version 537486 (0.00084) [2022-07-10 02:58:49,187][26022] Updated weights on worker 0-0, policy_version 537496 (0.00084) [2022-07-10 02:58:49,989][25689] Fps is (10 sec: 5705.0, 60 sec: 5688.9, 300 sec: 5651.7). Total num frames: 550400000. Throughput: 0: 5960.5. Samples: 550402498. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:58:49,989][25689] Avg episode reward: [(0, '-27.023')] [2022-07-10 02:58:50,953][26022] Updated weights on worker 0-0, policy_version 537506 (0.00092) [2022-07-10 02:58:52,705][26022] Updated weights on worker 0-0, policy_version 537516 (0.00094) [2022-07-10 02:58:54,466][26022] Updated weights on worker 0-0, policy_version 537526 (0.00092) [2022-07-10 02:58:54,999][25689] Fps is (10 sec: 5610.5, 60 sec: 5672.4, 300 sec: 5660.1). Total num frames: 550428672. Throughput: 0: 5954.7. Samples: 550436296. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:58:54,999][25689] Avg episode reward: [(0, '-27.551')] [2022-07-10 02:58:56,348][26022] Updated weights on worker 0-0, policy_version 537536 (0.00092) [2022-07-10 02:58:58,092][26022] Updated weights on worker 0-0, policy_version 537546 (0.00087) [2022-07-10 02:58:59,944][26022] Updated weights on worker 0-0, policy_version 537556 (0.00090) [2022-07-10 02:59:00,043][25689] Fps is (10 sec: 5703.3, 60 sec: 5658.0, 300 sec: 5663.1). Total num frames: 550457344. Throughput: 0: 5098.4. Samples: 550453048. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:59:00,044][25689] Avg episode reward: [(0, '-27.581')] [2022-07-10 02:59:01,735][26022] Updated weights on worker 0-0, policy_version 537566 (0.00082) [2022-07-10 02:59:03,832][26022] Updated weights on worker 0-0, policy_version 537576 (0.00087) [2022-07-10 02:59:05,080][25689] Fps is (10 sec: 5383.6, 60 sec: 5658.5, 300 sec: 5652.5). Total num frames: 550482944. Throughput: 0: 5840.5. Samples: 550485316. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:59:05,080][25689] Avg episode reward: [(0, '-26.855')] [2022-07-10 02:59:05,699][26022] Updated weights on worker 0-0, policy_version 537586 (0.00084) [2022-07-10 02:59:07,390][26022] Updated weights on worker 0-0, policy_version 537596 (0.00099) [2022-07-10 02:59:09,322][26022] Updated weights on worker 0-0, policy_version 537606 (0.00086) [2022-07-10 02:59:10,088][25689] Fps is (10 sec: 5505.1, 60 sec: 5675.3, 300 sec: 5653.5). Total num frames: 550512640. Throughput: 0: 5826.0. Samples: 550519596. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:59:10,088][25689] Avg episode reward: [(0, '-27.487')] [2022-07-10 02:59:11,059][26022] Updated weights on worker 0-0, policy_version 537616 (0.00094) [2022-07-10 02:59:12,869][26022] Updated weights on worker 0-0, policy_version 537626 (0.00094) [2022-07-10 02:59:14,667][26022] Updated weights on worker 0-0, policy_version 537636 (0.00093) [2022-07-10 02:59:15,111][25689] Fps is (10 sec: 5818.7, 60 sec: 5656.9, 300 sec: 5665.2). Total num frames: 550541312. Throughput: 0: 5001.8. Samples: 550536892. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:59:15,111][25689] Avg episode reward: [(0, '-28.466')] [2022-07-10 02:59:16,369][26022] Updated weights on worker 0-0, policy_version 537646 (0.00084) [2022-07-10 02:59:18,127][26022] Updated weights on worker 0-0, policy_version 537656 (0.00084) [2022-07-10 02:59:19,991][26022] Updated weights on worker 0-0, policy_version 537666 (0.00104) [2022-07-10 02:59:20,157][25689] Fps is (10 sec: 5796.8, 60 sec: 5663.6, 300 sec: 5664.8). Total num frames: 550571008. Throughput: 0: 5901.3. Samples: 550571744. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:59:20,158][25689] Avg episode reward: [(0, '-27.431')] [2022-07-10 02:59:21,775][26022] Updated weights on worker 0-0, policy_version 537676 (0.00082) [2022-07-10 02:59:23,520][26022] Updated weights on worker 0-0, policy_version 537686 (0.00090) [2022-07-10 02:59:25,170][25689] Fps is (10 sec: 5700.8, 60 sec: 5654.0, 300 sec: 5658.5). Total num frames: 550598656. Throughput: 0: 6016.1. Samples: 550606180. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:59:25,170][25689] Avg episode reward: [(0, '-26.852')] [2022-07-10 02:59:25,481][26022] Updated weights on worker 0-0, policy_version 537696 (0.00094) [2022-07-10 02:59:27,226][26022] Updated weights on worker 0-0, policy_version 537706 (0.00087) [2022-07-10 02:59:28,955][26022] Updated weights on worker 0-0, policy_version 537716 (0.00090) [2022-07-10 02:59:30,173][25689] Fps is (10 sec: 5622.7, 60 sec: 5680.2, 300 sec: 5662.1). Total num frames: 550627328. Throughput: 0: 5160.3. Samples: 550623244. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:59:30,174][25689] Avg episode reward: [(0, '-26.520')] [2022-07-10 02:59:30,799][26022] Updated weights on worker 0-0, policy_version 537726 (0.00081) [2022-07-10 02:59:32,622][26022] Updated weights on worker 0-0, policy_version 537736 (0.00086) [2022-07-10 02:59:34,123][26022] Updated weights on worker 0-0, policy_version 537746 (0.00102) [2022-07-10 02:59:35,181][25689] Fps is (10 sec: 5830.0, 60 sec: 5683.3, 300 sec: 5670.6). Total num frames: 550657024. Throughput: 0: 6026.4. Samples: 550657846. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 02:59:35,183][25689] Avg episode reward: [(0, '-26.932')] [2022-07-10 02:59:36,236][26022] Updated weights on worker 0-0, policy_version 537756 (0.00090) [2022-07-10 02:59:37,779][26022] Updated weights on worker 0-0, policy_version 537766 (0.00085) [2022-07-10 02:59:39,757][26022] Updated weights on worker 0-0, policy_version 537776 (0.00086) [2022-07-10 02:59:40,213][25689] Fps is (10 sec: 5813.3, 60 sec: 5691.1, 300 sec: 5664.5). Total num frames: 550685696. Throughput: 0: 6007.3. Samples: 550692232. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 02:59:40,214][25689] Avg episode reward: [(0, '-26.454')] [2022-07-10 02:59:41,398][26022] Updated weights on worker 0-0, policy_version 537786 (0.00373) [2022-07-10 02:59:43,242][26022] Updated weights on worker 0-0, policy_version 537796 (0.00086) [2022-07-10 02:59:45,174][26022] Updated weights on worker 0-0, policy_version 537806 (0.00092) [2022-07-10 02:59:45,229][25689] Fps is (10 sec: 5605.1, 60 sec: 5657.8, 300 sec: 5661.0). Total num frames: 550713344. Throughput: 0: 5141.6. Samples: 550709320. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 02:59:45,230][25689] Avg episode reward: [(0, '-26.849')] [2022-07-10 02:59:46,802][26022] Updated weights on worker 0-0, policy_version 537816 (0.00084) [2022-07-10 02:59:48,699][26022] Updated weights on worker 0-0, policy_version 537826 (0.00092) [2022-07-10 02:59:50,241][25689] Fps is (10 sec: 5718.4, 60 sec: 5693.3, 300 sec: 5664.3). Total num frames: 550743040. Throughput: 0: 6008.1. Samples: 550743816. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 02:59:50,242][25689] Avg episode reward: [(0, '-27.930')] [2022-07-10 02:59:50,407][26022] Updated weights on worker 0-0, policy_version 537836 (0.00090) [2022-07-10 02:59:52,231][26022] Updated weights on worker 0-0, policy_version 537846 (0.00091) [2022-07-10 02:59:53,960][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 02:59:53,979][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000537855_550763520.pth [2022-07-10 02:59:53,980][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000535861_548721664.pth [2022-07-10 02:59:54,189][26022] Updated weights on worker 0-0, policy_version 537856 (0.00084) [2022-07-10 02:59:55,247][25689] Fps is (10 sec: 5724.4, 60 sec: 5676.8, 300 sec: 5666.1). Total num frames: 550770688. Throughput: 0: 5975.7. Samples: 550777752. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 02:59:55,250][25689] Avg episode reward: [(0, '-27.997')] [2022-07-10 02:59:55,755][26022] Updated weights on worker 0-0, policy_version 537866 (0.00090) [2022-07-10 02:59:57,759][26022] Updated weights on worker 0-0, policy_version 537876 (0.00094) [2022-07-10 02:59:59,501][26022] Updated weights on worker 0-0, policy_version 537886 (0.00092) [2022-07-10 03:00:00,310][25689] Fps is (10 sec: 5593.8, 60 sec: 5675.1, 300 sec: 5669.6). Total num frames: 550799360. Throughput: 0: 5091.9. Samples: 550794560. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:00:00,310][25689] Avg episode reward: [(0, '-29.032')] [2022-07-10 03:00:01,305][26022] Updated weights on worker 0-0, policy_version 537896 (0.00087) [2022-07-10 03:00:03,456][26022] Updated weights on worker 0-0, policy_version 537906 (0.00087) [2022-07-10 03:00:05,324][25689] Fps is (10 sec: 5385.4, 60 sec: 5677.1, 300 sec: 5660.9). Total num frames: 550824960. Throughput: 0: 5824.8. Samples: 550826370. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:00:05,325][25689] Avg episode reward: [(0, '-28.964')] [2022-07-10 03:00:05,399][26022] Updated weights on worker 0-0, policy_version 537916 (0.00101) [2022-07-10 03:00:07,155][26022] Updated weights on worker 0-0, policy_version 537926 (0.00091) [2022-07-10 03:00:09,127][26022] Updated weights on worker 0-0, policy_version 537936 (0.00088) [2022-07-10 03:00:10,342][25689] Fps is (10 sec: 5511.9, 60 sec: 5676.2, 300 sec: 5661.7). Total num frames: 550854656. Throughput: 0: 5783.1. Samples: 550860058. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:00:10,343][25689] Avg episode reward: [(0, '-28.654')] [2022-07-10 03:00:10,956][26022] Updated weights on worker 0-0, policy_version 537946 (0.00090) [2022-07-10 03:00:12,601][26022] Updated weights on worker 0-0, policy_version 537956 (0.00085) [2022-07-10 03:00:14,509][26022] Updated weights on worker 0-0, policy_version 537966 (0.00086) [2022-07-10 03:00:15,356][25689] Fps is (10 sec: 5614.3, 60 sec: 5643.1, 300 sec: 5658.6). Total num frames: 550881280. Throughput: 0: 4933.4. Samples: 550876958. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:00:15,357][25689] Avg episode reward: [(0, '-28.169')] [2022-07-10 03:00:16,158][26022] Updated weights on worker 0-0, policy_version 537976 (0.00088) [2022-07-10 03:00:18,041][26022] Updated weights on worker 0-0, policy_version 537986 (0.00086) [2022-07-10 03:00:20,006][26022] Updated weights on worker 0-0, policy_version 537996 (0.00084) [2022-07-10 03:00:20,487][25689] Fps is (10 sec: 5551.4, 60 sec: 5635.1, 300 sec: 5659.9). Total num frames: 550910976. Throughput: 0: 5778.0. Samples: 550911144. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:00:20,487][25689] Avg episode reward: [(0, '-28.783')] [2022-07-10 03:00:21,548][26022] Updated weights on worker 0-0, policy_version 538006 (0.00092) [2022-07-10 03:00:23,606][26022] Updated weights on worker 0-0, policy_version 538016 (0.00094) [2022-07-10 03:00:25,268][26022] Updated weights on worker 0-0, policy_version 538026 (0.00089) [2022-07-10 03:00:25,556][25689] Fps is (10 sec: 5822.8, 60 sec: 5663.8, 300 sec: 5662.3). Total num frames: 550940672. Throughput: 0: 5872.9. Samples: 550945186. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:00:25,556][25689] Avg episode reward: [(0, '-28.630')] [2022-07-10 03:00:27,303][26022] Updated weights on worker 0-0, policy_version 538036 (0.00084) [2022-07-10 03:00:28,965][26022] Updated weights on worker 0-0, policy_version 538046 (0.00084) [2022-07-10 03:00:30,575][25689] Fps is (10 sec: 5684.5, 60 sec: 5645.4, 300 sec: 5658.7). Total num frames: 550968320. Throughput: 0: 5040.0. Samples: 550962032. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:00:30,575][25689] Avg episode reward: [(0, '-29.554')] [2022-07-10 03:00:30,696][26022] Updated weights on worker 0-0, policy_version 538056 (0.00081) [2022-07-10 03:00:32,624][26022] Updated weights on worker 0-0, policy_version 538066 (0.00086) [2022-07-10 03:00:34,340][26022] Updated weights on worker 0-0, policy_version 538076 (0.00095) [2022-07-10 03:00:35,626][25689] Fps is (10 sec: 5491.0, 60 sec: 5607.5, 300 sec: 5655.2). Total num frames: 550995968. Throughput: 0: 5867.3. Samples: 550995890. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:00:35,627][25689] Avg episode reward: [(0, '-29.873')] [2022-07-10 03:00:36,256][26022] Updated weights on worker 0-0, policy_version 538086 (0.00085) [2022-07-10 03:00:38,064][26022] Updated weights on worker 0-0, policy_version 538096 (0.00100) [2022-07-10 03:00:39,828][26022] Updated weights on worker 0-0, policy_version 538106 (0.00087) [2022-07-10 03:00:40,702][25689] Fps is (10 sec: 5561.4, 60 sec: 5603.5, 300 sec: 5660.8). Total num frames: 551024640. Throughput: 0: 5876.4. Samples: 551029934. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:00:40,702][25689] Avg episode reward: [(0, '-29.709')] [2022-07-10 03:00:41,556][26022] Updated weights on worker 0-0, policy_version 538116 (0.00092) [2022-07-10 03:00:43,575][26022] Updated weights on worker 0-0, policy_version 538126 (0.00095) [2022-07-10 03:00:45,116][26022] Updated weights on worker 0-0, policy_version 538136 (0.00092) [2022-07-10 03:00:45,705][25689] Fps is (10 sec: 5689.3, 60 sec: 5621.5, 300 sec: 5654.0). Total num frames: 551053312. Throughput: 0: 5039.5. Samples: 551046730. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:00:45,706][25689] Avg episode reward: [(0, '-30.091')] [2022-07-10 03:00:47,381][26022] Updated weights on worker 0-0, policy_version 538146 (0.00079) [2022-07-10 03:00:48,825][26022] Updated weights on worker 0-0, policy_version 538156 (0.00086) [2022-07-10 03:00:50,724][25689] Fps is (10 sec: 5619.5, 60 sec: 5587.1, 300 sec: 5653.9). Total num frames: 551080960. Throughput: 0: 5884.3. Samples: 551080596. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:00:50,725][25689] Avg episode reward: [(0, '-28.872')] [2022-07-10 03:00:50,932][26022] Updated weights on worker 0-0, policy_version 538166 (0.00087) [2022-07-10 03:00:52,492][26022] Updated weights on worker 0-0, policy_version 538176 (0.00090) [2022-07-10 03:00:54,442][26022] Updated weights on worker 0-0, policy_version 538186 (0.00086) [2022-07-10 03:00:55,744][25689] Fps is (10 sec: 5712.5, 60 sec: 5619.6, 300 sec: 5656.2). Total num frames: 551110656. Throughput: 0: 5903.4. Samples: 551114652. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:00:55,746][25689] Avg episode reward: [(0, '-26.726')] [2022-07-10 03:00:56,181][26022] Updated weights on worker 0-0, policy_version 538196 (0.00071) [2022-07-10 03:00:58,053][26022] Updated weights on worker 0-0, policy_version 538206 (0.00093) [2022-07-10 03:00:59,708][26022] Updated weights on worker 0-0, policy_version 538216 (0.00083) [2022-07-10 03:01:00,848][25689] Fps is (10 sec: 5664.0, 60 sec: 5598.8, 300 sec: 5664.7). Total num frames: 551138304. Throughput: 0: 5908.4. Samples: 551148968. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:01:00,849][25689] Avg episode reward: [(0, '-27.334')] [2022-07-10 03:01:01,871][26022] Updated weights on worker 0-0, policy_version 538226 (0.00093) [2022-07-10 03:01:03,476][26022] Updated weights on worker 0-0, policy_version 538236 (0.00083) [2022-07-10 03:01:05,647][26022] Updated weights on worker 0-0, policy_version 538246 (0.00094) [2022-07-10 03:01:05,866][25689] Fps is (10 sec: 5361.8, 60 sec: 5615.5, 300 sec: 5654.7). Total num frames: 551164928. Throughput: 0: 5788.3. Samples: 551163424. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:01:05,866][25689] Avg episode reward: [(0, '-27.437')] [2022-07-10 03:01:07,435][26022] Updated weights on worker 0-0, policy_version 538256 (0.00501) [2022-07-10 03:01:09,116][26022] Updated weights on worker 0-0, policy_version 538266 (0.00088) [2022-07-10 03:01:10,875][25689] Fps is (10 sec: 5514.9, 60 sec: 5599.3, 300 sec: 5651.6). Total num frames: 551193600. Throughput: 0: 5811.8. Samples: 551197710. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:01:10,875][25689] Avg episode reward: [(0, '-26.808')] [2022-07-10 03:01:11,005][26022] Updated weights on worker 0-0, policy_version 538276 (0.00086) [2022-07-10 03:01:12,680][26022] Updated weights on worker 0-0, policy_version 538286 (0.00085) [2022-07-10 03:01:14,629][26022] Updated weights on worker 0-0, policy_version 538296 (0.00092) [2022-07-10 03:01:15,891][25689] Fps is (10 sec: 5720.2, 60 sec: 5633.0, 300 sec: 5653.4). Total num frames: 551222272. Throughput: 0: 4976.5. Samples: 551214914. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:01:15,891][25689] Avg episode reward: [(0, '-26.428')] [2022-07-10 03:01:16,355][26022] Updated weights on worker 0-0, policy_version 538306 (0.00082) [2022-07-10 03:01:18,117][26022] Updated weights on worker 0-0, policy_version 538316 (0.00088) [2022-07-10 03:01:19,799][26022] Updated weights on worker 0-0, policy_version 538326 (0.00090) [2022-07-10 03:01:20,985][25689] Fps is (10 sec: 5672.0, 60 sec: 5619.5, 300 sec: 5659.0). Total num frames: 551250944. Throughput: 0: 4984.3. Samples: 551249336. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:01:20,985][25689] Avg episode reward: [(0, '-25.907')] [2022-07-10 03:01:21,773][26022] Updated weights on worker 0-0, policy_version 538336 (0.00091) [2022-07-10 03:01:23,303][26022] Updated weights on worker 0-0, policy_version 538346 (0.00088) [2022-07-10 03:01:25,274][26022] Updated weights on worker 0-0, policy_version 538356 (0.00090) [2022-07-10 03:01:25,987][25689] Fps is (10 sec: 5781.0, 60 sec: 5625.7, 300 sec: 5659.6). Total num frames: 551280640. Throughput: 0: 5999.0. Samples: 551284132. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:01:25,988][25689] Avg episode reward: [(0, '-26.556')] [2022-07-10 03:01:26,962][26022] Updated weights on worker 0-0, policy_version 538366 (0.00089) [2022-07-10 03:01:28,770][26022] Updated weights on worker 0-0, policy_version 538376 (0.00094) [2022-07-10 03:01:30,692][26022] Updated weights on worker 0-0, policy_version 538386 (0.00095) [2022-07-10 03:01:31,011][25689] Fps is (10 sec: 5719.5, 60 sec: 5625.2, 300 sec: 5657.5). Total num frames: 551308288. Throughput: 0: 5997.4. Samples: 551318474. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:01:31,011][25689] Avg episode reward: [(0, '-26.889')] [2022-07-10 03:01:32,339][26022] Updated weights on worker 0-0, policy_version 538396 (0.00088) [2022-07-10 03:01:34,303][26022] Updated weights on worker 0-0, policy_version 538406 (0.00086) [2022-07-10 03:01:35,872][26022] Updated weights on worker 0-0, policy_version 538416 (0.00082) [2022-07-10 03:01:36,023][25689] Fps is (10 sec: 5714.1, 60 sec: 5662.8, 300 sec: 5653.2). Total num frames: 551337984. Throughput: 0: 5984.6. Samples: 551335396. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:01:36,023][25689] Avg episode reward: [(0, '-26.482')] [2022-07-10 03:01:37,790][26022] Updated weights on worker 0-0, policy_version 538426 (0.00089) [2022-07-10 03:01:39,563][26022] Updated weights on worker 0-0, policy_version 538436 (0.00084) [2022-07-10 03:01:41,086][25689] Fps is (10 sec: 5793.2, 60 sec: 5663.9, 300 sec: 5659.9). Total num frames: 551366656. Throughput: 0: 5995.5. Samples: 551369854. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:01:41,088][25689] Avg episode reward: [(0, '-26.536')] [2022-07-10 03:01:41,417][26022] Updated weights on worker 0-0, policy_version 538446 (0.00088) [2022-07-10 03:01:43,186][26022] Updated weights on worker 0-0, policy_version 538456 (0.00089) [2022-07-10 03:01:44,918][26022] Updated weights on worker 0-0, policy_version 538466 (0.00097) [2022-07-10 03:01:46,139][25689] Fps is (10 sec: 5668.6, 60 sec: 5659.4, 300 sec: 5656.8). Total num frames: 551395328. Throughput: 0: 5964.0. Samples: 551404316. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:01:46,141][25689] Avg episode reward: [(0, '-28.677')] [2022-07-10 03:01:46,615][26022] Updated weights on worker 0-0, policy_version 538476 (0.00091) [2022-07-10 03:01:48,422][26022] Updated weights on worker 0-0, policy_version 538486 (0.00086) [2022-07-10 03:01:50,267][26022] Updated weights on worker 0-0, policy_version 538496 (0.00086) [2022-07-10 03:01:51,150][25689] Fps is (10 sec: 5799.9, 60 sec: 5694.0, 300 sec: 5661.9). Total num frames: 551425024. Throughput: 0: 5120.7. Samples: 551421600. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:01:51,152][25689] Avg episode reward: [(0, '-27.902')] [2022-07-10 03:01:52,185][26022] Updated weights on worker 0-0, policy_version 538506 (0.00088) [2022-07-10 03:01:53,727][26022] Updated weights on worker 0-0, policy_version 538516 (0.00105) [2022-07-10 03:01:54,034][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:01:54,050][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000538518_551442432.pth [2022-07-10 03:01:54,050][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000536525_549401600.pth [2022-07-10 03:01:55,617][26022] Updated weights on worker 0-0, policy_version 538526 (0.00196) [2022-07-10 03:01:56,155][25689] Fps is (10 sec: 5827.6, 60 sec: 5678.5, 300 sec: 5659.6). Total num frames: 551453696. Throughput: 0: 6001.9. Samples: 551456226. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:01:56,155][25689] Avg episode reward: [(0, '-27.210')] [2022-07-10 03:01:57,563][26022] Updated weights on worker 0-0, policy_version 538536 (0.00094) [2022-07-10 03:01:59,085][26022] Updated weights on worker 0-0, policy_version 538546 (0.00094) [2022-07-10 03:02:01,008][26022] Updated weights on worker 0-0, policy_version 538556 (0.00093) [2022-07-10 03:02:01,252][25689] Fps is (10 sec: 5777.9, 60 sec: 5713.0, 300 sec: 5668.3). Total num frames: 551483392. Throughput: 0: 6000.4. Samples: 551490856. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:02:01,252][25689] Avg episode reward: [(0, '-28.289')] [2022-07-10 03:02:02,987][26022] Updated weights on worker 0-0, policy_version 538566 (0.00088) [2022-07-10 03:02:04,901][26022] Updated weights on worker 0-0, policy_version 538576 (0.00086) [2022-07-10 03:02:06,311][25689] Fps is (10 sec: 5545.0, 60 sec: 5709.1, 300 sec: 5664.4). Total num frames: 551510016. Throughput: 0: 5019.9. Samples: 551505580. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:02:06,312][25689] Avg episode reward: [(0, '-27.845')] [2022-07-10 03:02:06,805][26022] Updated weights on worker 0-0, policy_version 538586 (0.00085) [2022-07-10 03:02:08,552][26022] Updated weights on worker 0-0, policy_version 538596 (0.00081) [2022-07-10 03:02:10,227][26022] Updated weights on worker 0-0, policy_version 538606 (0.00085) [2022-07-10 03:02:11,361][25689] Fps is (10 sec: 5368.6, 60 sec: 5688.3, 300 sec: 5660.2). Total num frames: 551537664. Throughput: 0: 5843.8. Samples: 551539710. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:02:11,362][25689] Avg episode reward: [(0, '-28.434')] [2022-07-10 03:02:12,241][26022] Updated weights on worker 0-0, policy_version 538616 (0.00086) [2022-07-10 03:02:13,801][26022] Updated weights on worker 0-0, policy_version 538626 (0.00095) [2022-07-10 03:02:15,814][26022] Updated weights on worker 0-0, policy_version 538636 (0.00079) [2022-07-10 03:02:16,393][25689] Fps is (10 sec: 5586.2, 60 sec: 5686.8, 300 sec: 5657.0). Total num frames: 551566336. Throughput: 0: 5843.4. Samples: 551574490. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:02:16,394][25689] Avg episode reward: [(0, '-26.219')] [2022-07-10 03:02:17,361][26022] Updated weights on worker 0-0, policy_version 538646 (0.00089) [2022-07-10 03:02:19,197][26022] Updated weights on worker 0-0, policy_version 538656 (0.00086) [2022-07-10 03:02:21,083][26022] Updated weights on worker 0-0, policy_version 538666 (0.00079) [2022-07-10 03:02:21,494][25689] Fps is (10 sec: 5659.0, 60 sec: 5686.1, 300 sec: 5659.1). Total num frames: 551595008. Throughput: 0: 4982.6. Samples: 551591716. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:02:21,496][25689] Avg episode reward: [(0, '-26.641')] [2022-07-10 03:02:22,872][26022] Updated weights on worker 0-0, policy_version 538676 (0.00088) [2022-07-10 03:02:24,596][26022] Updated weights on worker 0-0, policy_version 538686 (0.00094) [2022-07-10 03:02:26,488][26022] Updated weights on worker 0-0, policy_version 538696 (0.00094) [2022-07-10 03:02:26,561][25689] Fps is (10 sec: 5740.4, 60 sec: 5680.0, 300 sec: 5661.3). Total num frames: 551624704. Throughput: 0: 5959.8. Samples: 551626266. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:02:26,562][25689] Avg episode reward: [(0, '-26.335')] [2022-07-10 03:02:28,104][26022] Updated weights on worker 0-0, policy_version 538706 (0.00088) [2022-07-10 03:02:30,041][26022] Updated weights on worker 0-0, policy_version 538716 (0.00087) [2022-07-10 03:02:31,646][25689] Fps is (10 sec: 5749.9, 60 sec: 5691.3, 300 sec: 5660.4). Total num frames: 551653376. Throughput: 0: 5951.1. Samples: 551660424. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:02:31,646][25689] Avg episode reward: [(0, '-27.608')] [2022-07-10 03:02:31,810][26022] Updated weights on worker 0-0, policy_version 538726 (0.00100) [2022-07-10 03:02:33,663][26022] Updated weights on worker 0-0, policy_version 538736 (0.00091) [2022-07-10 03:02:35,471][26022] Updated weights on worker 0-0, policy_version 538746 (0.00091) [2022-07-10 03:02:36,729][25689] Fps is (10 sec: 5639.8, 60 sec: 5667.7, 300 sec: 5656.5). Total num frames: 551682048. Throughput: 0: 5053.6. Samples: 551677258. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 03:02:36,729][25689] Avg episode reward: [(0, '-27.164')] [2022-07-10 03:02:37,319][26022] Updated weights on worker 0-0, policy_version 538756 (0.00095) [2022-07-10 03:02:38,886][26022] Updated weights on worker 0-0, policy_version 538766 (0.00089) [2022-07-10 03:02:40,944][26022] Updated weights on worker 0-0, policy_version 538776 (0.00096) [2022-07-10 03:02:41,778][25689] Fps is (10 sec: 5861.7, 60 sec: 5702.8, 300 sec: 5663.0). Total num frames: 551712768. Throughput: 0: 5906.1. Samples: 551711512. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:02:41,778][25689] Avg episode reward: [(0, '-28.393')] [2022-07-10 03:02:42,625][26022] Updated weights on worker 0-0, policy_version 538786 (0.00080) [2022-07-10 03:02:44,412][26022] Updated weights on worker 0-0, policy_version 538796 (0.00089) [2022-07-10 03:02:46,242][26022] Updated weights on worker 0-0, policy_version 538806 (0.00094) [2022-07-10 03:02:46,841][25689] Fps is (10 sec: 5670.9, 60 sec: 5668.1, 300 sec: 5661.9). Total num frames: 551739392. Throughput: 0: 5896.5. Samples: 551745846. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:02:46,841][25689] Avg episode reward: [(0, '-28.315')] [2022-07-10 03:02:47,978][26022] Updated weights on worker 0-0, policy_version 538816 (0.00086) [2022-07-10 03:02:49,857][26022] Updated weights on worker 0-0, policy_version 538826 (0.00087) [2022-07-10 03:02:51,522][26022] Updated weights on worker 0-0, policy_version 538836 (0.00087) [2022-07-10 03:02:51,847][25689] Fps is (10 sec: 5593.5, 60 sec: 5668.6, 300 sec: 5662.0). Total num frames: 551769088. Throughput: 0: 5081.9. Samples: 551763084. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:02:51,847][25689] Avg episode reward: [(0, '-27.721')] [2022-07-10 03:02:53,423][26022] Updated weights on worker 0-0, policy_version 538846 (0.00086) [2022-07-10 03:02:55,195][26022] Updated weights on worker 0-0, policy_version 538856 (0.00091) [2022-07-10 03:02:56,853][25689] Fps is (10 sec: 5932.0, 60 sec: 5685.3, 300 sec: 5663.3). Total num frames: 551798784. Throughput: 0: 5975.5. Samples: 551797510. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:02:56,854][25689] Avg episode reward: [(0, '-27.867')] [2022-07-10 03:02:56,856][26022] Updated weights on worker 0-0, policy_version 538866 (0.00092) [2022-07-10 03:02:58,847][26022] Updated weights on worker 0-0, policy_version 538876 (0.00087) [2022-07-10 03:03:00,384][26022] Updated weights on worker 0-0, policy_version 538886 (0.00082) [2022-07-10 03:03:01,893][25689] Fps is (10 sec: 5605.9, 60 sec: 5640.0, 300 sec: 5666.8). Total num frames: 551825408. Throughput: 0: 5966.6. Samples: 551831532. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:03:01,894][25689] Avg episode reward: [(0, '-27.031')] [2022-07-10 03:03:02,722][26022] Updated weights on worker 0-0, policy_version 538896 (0.00093) [2022-07-10 03:03:04,599][26022] Updated weights on worker 0-0, policy_version 538906 (0.00089) [2022-07-10 03:03:06,327][26022] Updated weights on worker 0-0, policy_version 538916 (0.00089) [2022-07-10 03:03:06,969][25689] Fps is (10 sec: 5365.1, 60 sec: 5655.4, 300 sec: 5662.0). Total num frames: 551853056. Throughput: 0: 4987.4. Samples: 551846232. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:03:06,969][25689] Avg episode reward: [(0, '-26.882')] [2022-07-10 03:03:08,220][26022] Updated weights on worker 0-0, policy_version 538926 (0.00086) [2022-07-10 03:03:09,909][26022] Updated weights on worker 0-0, policy_version 538936 (0.00086) [2022-07-10 03:03:11,769][26022] Updated weights on worker 0-0, policy_version 538946 (0.00084) [2022-07-10 03:03:12,004][25689] Fps is (10 sec: 5570.2, 60 sec: 5673.6, 300 sec: 5658.0). Total num frames: 551881728. Throughput: 0: 5813.7. Samples: 551880274. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:03:12,005][25689] Avg episode reward: [(0, '-25.743')] [2022-07-10 03:03:13,603][26022] Updated weights on worker 0-0, policy_version 538956 (0.00085) [2022-07-10 03:03:15,237][26022] Updated weights on worker 0-0, policy_version 538966 (0.00088) [2022-07-10 03:03:17,015][25689] Fps is (10 sec: 5708.2, 60 sec: 5675.6, 300 sec: 5656.6). Total num frames: 551910400. Throughput: 0: 5829.1. Samples: 551915034. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:03:17,015][25689] Avg episode reward: [(0, '-26.379')] [2022-07-10 03:03:17,112][26022] Updated weights on worker 0-0, policy_version 538976 (0.00092) [2022-07-10 03:03:18,799][26022] Updated weights on worker 0-0, policy_version 538986 (0.00087) [2022-07-10 03:03:20,647][26022] Updated weights on worker 0-0, policy_version 538996 (0.00092) [2022-07-10 03:03:22,065][25689] Fps is (10 sec: 5801.6, 60 sec: 5697.3, 300 sec: 5660.8). Total num frames: 551940096. Throughput: 0: 5000.9. Samples: 551932408. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:03:22,067][25689] Avg episode reward: [(0, '-26.428')] [2022-07-10 03:03:22,413][26022] Updated weights on worker 0-0, policy_version 539006 (0.00089) [2022-07-10 03:03:24,219][26022] Updated weights on worker 0-0, policy_version 539016 (0.00085) [2022-07-10 03:03:26,174][26022] Updated weights on worker 0-0, policy_version 539026 (0.00088) [2022-07-10 03:03:27,082][25689] Fps is (10 sec: 5797.7, 60 sec: 5685.0, 300 sec: 5665.9). Total num frames: 551968768. Throughput: 0: 6011.6. Samples: 551967146. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:03:27,083][25689] Avg episode reward: [(0, '-26.528')] [2022-07-10 03:03:27,887][26022] Updated weights on worker 0-0, policy_version 539036 (0.00083) [2022-07-10 03:03:29,698][26022] Updated weights on worker 0-0, policy_version 539046 (0.00086) [2022-07-10 03:03:31,546][26022] Updated weights on worker 0-0, policy_version 539056 (0.00083) [2022-07-10 03:03:32,093][25689] Fps is (10 sec: 5718.4, 60 sec: 5692.0, 300 sec: 5663.0). Total num frames: 551997440. Throughput: 0: 6023.2. Samples: 552001272. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:03:32,093][25689] Avg episode reward: [(0, '-28.473')] [2022-07-10 03:03:33,291][26022] Updated weights on worker 0-0, policy_version 539066 (0.00084) [2022-07-10 03:03:35,072][26022] Updated weights on worker 0-0, policy_version 539076 (0.00092) [2022-07-10 03:03:36,826][26022] Updated weights on worker 0-0, policy_version 539086 (0.00092) [2022-07-10 03:03:37,128][25689] Fps is (10 sec: 5606.1, 60 sec: 5679.5, 300 sec: 5661.1). Total num frames: 552025088. Throughput: 0: 5132.9. Samples: 552018276. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:03:37,129][25689] Avg episode reward: [(0, '-27.600')] [2022-07-10 03:03:38,716][26022] Updated weights on worker 0-0, policy_version 539096 (0.00090) [2022-07-10 03:03:40,607][26022] Updated weights on worker 0-0, policy_version 539106 (0.00089) [2022-07-10 03:03:42,181][25689] Fps is (10 sec: 5582.9, 60 sec: 5645.3, 300 sec: 5657.1). Total num frames: 552053760. Throughput: 0: 5947.9. Samples: 552052056. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:03:42,182][25689] Avg episode reward: [(0, '-27.044')] [2022-07-10 03:03:42,295][26022] Updated weights on worker 0-0, policy_version 539116 (0.00085) [2022-07-10 03:03:44,146][26022] Updated weights on worker 0-0, policy_version 539126 (0.00085) [2022-07-10 03:03:45,779][26022] Updated weights on worker 0-0, policy_version 539136 (0.00084) [2022-07-10 03:03:47,262][25689] Fps is (10 sec: 5557.6, 60 sec: 5660.5, 300 sec: 5656.1). Total num frames: 552081408. Throughput: 0: 5906.1. Samples: 552086332. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:03:47,263][25689] Avg episode reward: [(0, '-27.180')] [2022-07-10 03:03:47,617][26022] Updated weights on worker 0-0, policy_version 539146 (0.00091) [2022-07-10 03:03:49,542][26022] Updated weights on worker 0-0, policy_version 539156 (0.00087) [2022-07-10 03:03:51,363][26022] Updated weights on worker 0-0, policy_version 539166 (0.00092) [2022-07-10 03:03:52,306][25689] Fps is (10 sec: 5663.6, 60 sec: 5657.0, 300 sec: 5658.9). Total num frames: 552111104. Throughput: 0: 5885.8. Samples: 552120240. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:03:52,307][25689] Avg episode reward: [(0, '-26.842')] [2022-07-10 03:03:53,221][26022] Updated weights on worker 0-0, policy_version 539176 (0.00084) [2022-07-10 03:03:54,183][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:03:54,205][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000539181_552121344.pth [2022-07-10 03:03:54,206][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000537189_550081536.pth [2022-07-10 03:03:54,746][26022] Updated weights on worker 0-0, policy_version 539186 (0.00089) [2022-07-10 03:03:56,771][26022] Updated weights on worker 0-0, policy_version 539196 (0.00084) [2022-07-10 03:03:57,317][25689] Fps is (10 sec: 5804.7, 60 sec: 5639.6, 300 sec: 5659.5). Total num frames: 552139776. Throughput: 0: 5888.1. Samples: 552137152. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:03:57,318][25689] Avg episode reward: [(0, '-26.239')] [2022-07-10 03:03:58,564][26022] Updated weights on worker 0-0, policy_version 539206 (0.00083) [2022-07-10 03:04:00,386][26022] Updated weights on worker 0-0, policy_version 539216 (0.00088) [2022-07-10 03:04:02,365][25689] Fps is (10 sec: 5497.0, 60 sec: 5638.9, 300 sec: 5662.8). Total num frames: 552166400. Throughput: 0: 5915.4. Samples: 552171454. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:04:02,366][25689] Avg episode reward: [(0, '-25.460')] [2022-07-10 03:04:02,446][26022] Updated weights on worker 0-0, policy_version 539226 (0.00090) [2022-07-10 03:04:04,292][26022] Updated weights on worker 0-0, policy_version 539236 (0.00095) [2022-07-10 03:04:06,169][26022] Updated weights on worker 0-0, policy_version 539246 (0.00078) [2022-07-10 03:04:07,385][25689] Fps is (10 sec: 5492.6, 60 sec: 5661.0, 300 sec: 5659.1). Total num frames: 552195072. Throughput: 0: 5809.4. Samples: 552203232. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:04:07,386][25689] Avg episode reward: [(0, '-26.234')] [2022-07-10 03:04:07,887][26022] Updated weights on worker 0-0, policy_version 539256 (0.00347) [2022-07-10 03:04:09,724][26022] Updated weights on worker 0-0, policy_version 539266 (0.00084) [2022-07-10 03:04:11,746][26022] Updated weights on worker 0-0, policy_version 539276 (0.00084) [2022-07-10 03:04:12,400][25689] Fps is (10 sec: 5612.3, 60 sec: 5645.9, 300 sec: 5655.8). Total num frames: 552222720. Throughput: 0: 4980.0. Samples: 552220310. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:04:12,402][25689] Avg episode reward: [(0, '-27.445')] [2022-07-10 03:04:13,317][26022] Updated weights on worker 0-0, policy_version 539286 (0.00087) [2022-07-10 03:04:15,245][26022] Updated weights on worker 0-0, policy_version 539296 (0.00049) [2022-07-10 03:04:16,784][26022] Updated weights on worker 0-0, policy_version 539306 (0.00090) [2022-07-10 03:04:17,406][25689] Fps is (10 sec: 5620.0, 60 sec: 5646.4, 300 sec: 5653.1). Total num frames: 552251392. Throughput: 0: 5850.3. Samples: 552254676. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:04:17,407][25689] Avg episode reward: [(0, '-27.660')] [2022-07-10 03:04:18,900][26022] Updated weights on worker 0-0, policy_version 539316 (0.00082) [2022-07-10 03:04:20,535][26022] Updated weights on worker 0-0, policy_version 539326 (0.00093) [2022-07-10 03:04:22,446][26022] Updated weights on worker 0-0, policy_version 539336 (0.00092) [2022-07-10 03:04:22,481][25689] Fps is (10 sec: 5688.2, 60 sec: 5627.1, 300 sec: 5655.4). Total num frames: 552280064. Throughput: 0: 5846.5. Samples: 552289064. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:04:22,482][25689] Avg episode reward: [(0, '-28.609')] [2022-07-10 03:04:24,139][26022] Updated weights on worker 0-0, policy_version 539346 (0.00083) [2022-07-10 03:04:25,908][26022] Updated weights on worker 0-0, policy_version 539356 (0.00096) [2022-07-10 03:04:27,524][25689] Fps is (10 sec: 5768.7, 60 sec: 5641.6, 300 sec: 5658.1). Total num frames: 552309760. Throughput: 0: 5107.6. Samples: 552306096. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:04:27,525][25689] Avg episode reward: [(0, '-28.868')] [2022-07-10 03:04:27,698][26022] Updated weights on worker 0-0, policy_version 539366 (0.00092) [2022-07-10 03:04:29,574][26022] Updated weights on worker 0-0, policy_version 539376 (0.00090) [2022-07-10 03:04:31,292][26022] Updated weights on worker 0-0, policy_version 539386 (0.00081) [2022-07-10 03:04:32,536][25689] Fps is (10 sec: 5804.9, 60 sec: 5641.5, 300 sec: 5654.5). Total num frames: 552338432. Throughput: 0: 5941.2. Samples: 552339944. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:04:32,538][25689] Avg episode reward: [(0, '-28.665')] [2022-07-10 03:04:33,287][26022] Updated weights on worker 0-0, policy_version 539396 (0.00086) [2022-07-10 03:04:34,973][26022] Updated weights on worker 0-0, policy_version 539406 (0.00092) [2022-07-10 03:04:36,975][26022] Updated weights on worker 0-0, policy_version 539416 (0.00088) [2022-07-10 03:04:37,576][25689] Fps is (10 sec: 5602.9, 60 sec: 5641.1, 300 sec: 5651.0). Total num frames: 552366080. Throughput: 0: 5922.5. Samples: 552374132. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:04:37,577][25689] Avg episode reward: [(0, '-28.764')] [2022-07-10 03:04:38,723][26022] Updated weights on worker 0-0, policy_version 539426 (0.00088) [2022-07-10 03:04:40,571][26022] Updated weights on worker 0-0, policy_version 539436 (0.00085) [2022-07-10 03:04:42,376][26022] Updated weights on worker 0-0, policy_version 539446 (0.00093) [2022-07-10 03:04:42,624][25689] Fps is (10 sec: 5583.1, 60 sec: 5641.5, 300 sec: 5653.8). Total num frames: 552394752. Throughput: 0: 5065.6. Samples: 552391098. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:04:42,626][25689] Avg episode reward: [(0, '-27.909')] [2022-07-10 03:04:44,077][26022] Updated weights on worker 0-0, policy_version 539456 (0.00086) [2022-07-10 03:04:45,821][26022] Updated weights on worker 0-0, policy_version 539466 (0.00087) [2022-07-10 03:04:47,552][26022] Updated weights on worker 0-0, policy_version 539476 (0.00090) [2022-07-10 03:04:47,639][25689] Fps is (10 sec: 5698.3, 60 sec: 5664.6, 300 sec: 5650.3). Total num frames: 552423424. Throughput: 0: 5953.8. Samples: 552425858. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:04:47,640][25689] Avg episode reward: [(0, '-28.215')] [2022-07-10 03:04:49,383][26022] Updated weights on worker 0-0, policy_version 539486 (0.00879) [2022-07-10 03:04:51,110][26022] Updated weights on worker 0-0, policy_version 539496 (0.00071) [2022-07-10 03:04:52,649][25689] Fps is (10 sec: 5822.3, 60 sec: 5667.8, 300 sec: 5657.1). Total num frames: 552453120. Throughput: 0: 5992.0. Samples: 552460458. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:04:52,649][25689] Avg episode reward: [(0, '-28.324')] [2022-07-10 03:04:52,771][26022] Updated weights on worker 0-0, policy_version 539506 (0.00099) [2022-07-10 03:04:54,588][26022] Updated weights on worker 0-0, policy_version 539516 (0.00091) [2022-07-10 03:04:56,420][26022] Updated weights on worker 0-0, policy_version 539526 (0.00083) [2022-07-10 03:04:57,669][25689] Fps is (10 sec: 5819.8, 60 sec: 5667.1, 300 sec: 5657.9). Total num frames: 552481792. Throughput: 0: 5154.6. Samples: 552477700. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:04:57,669][25689] Avg episode reward: [(0, '-27.425')] [2022-07-10 03:04:58,275][26022] Updated weights on worker 0-0, policy_version 539536 (0.00093) [2022-07-10 03:05:00,035][26022] Updated weights on worker 0-0, policy_version 539546 (0.00085) [2022-07-10 03:05:02,256][26022] Updated weights on worker 0-0, policy_version 539556 (0.00095) [2022-07-10 03:05:02,783][25689] Fps is (10 sec: 5355.2, 60 sec: 5643.8, 300 sec: 5656.0). Total num frames: 552507392. Throughput: 0: 5989.9. Samples: 552511850. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:05:02,784][25689] Avg episode reward: [(0, '-27.579')] [2022-07-10 03:05:04,064][26022] Updated weights on worker 0-0, policy_version 539566 (0.00079) [2022-07-10 03:05:05,939][26022] Updated weights on worker 0-0, policy_version 539576 (0.00083) [2022-07-10 03:05:07,686][26022] Updated weights on worker 0-0, policy_version 539586 (0.00100) [2022-07-10 03:05:07,789][25689] Fps is (10 sec: 5362.7, 60 sec: 5645.2, 300 sec: 5652.8). Total num frames: 552536064. Throughput: 0: 5849.6. Samples: 552543724. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:05:07,789][25689] Avg episode reward: [(0, '-27.823')] [2022-07-10 03:05:09,579][26022] Updated weights on worker 0-0, policy_version 539596 (0.00091) [2022-07-10 03:05:11,243][26022] Updated weights on worker 0-0, policy_version 539606 (0.00092) [2022-07-10 03:05:12,795][25689] Fps is (10 sec: 5625.6, 60 sec: 5646.1, 300 sec: 5656.4). Total num frames: 552563712. Throughput: 0: 4972.0. Samples: 552560622. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:05:12,795][25689] Avg episode reward: [(0, '-28.074')] [2022-07-10 03:05:13,195][26022] Updated weights on worker 0-0, policy_version 539616 (0.00085) [2022-07-10 03:05:14,808][26022] Updated weights on worker 0-0, policy_version 539626 (0.00086) [2022-07-10 03:05:16,733][26022] Updated weights on worker 0-0, policy_version 539636 (0.00090) [2022-07-10 03:05:17,857][25689] Fps is (10 sec: 5797.1, 60 sec: 5674.7, 300 sec: 5661.1). Total num frames: 552594432. Throughput: 0: 5805.2. Samples: 552594900. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:05:17,858][25689] Avg episode reward: [(0, '-27.158')] [2022-07-10 03:05:18,517][26022] Updated weights on worker 0-0, policy_version 539646 (0.00091) [2022-07-10 03:05:20,337][26022] Updated weights on worker 0-0, policy_version 539656 (0.00062) [2022-07-10 03:05:21,980][26022] Updated weights on worker 0-0, policy_version 539666 (0.00086) [2022-07-10 03:05:22,957][25689] Fps is (10 sec: 5743.8, 60 sec: 5655.5, 300 sec: 5653.7). Total num frames: 552622080. Throughput: 0: 5811.7. Samples: 552629090. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:05:22,958][25689] Avg episode reward: [(0, '-27.452')] [2022-07-10 03:05:23,925][26022] Updated weights on worker 0-0, policy_version 539676 (0.00086) [2022-07-10 03:05:25,748][26022] Updated weights on worker 0-0, policy_version 539686 (0.00094) [2022-07-10 03:05:27,616][26022] Updated weights on worker 0-0, policy_version 539696 (0.00088) [2022-07-10 03:05:27,974][25689] Fps is (10 sec: 5466.1, 60 sec: 5624.0, 300 sec: 5653.7). Total num frames: 552649728. Throughput: 0: 5062.7. Samples: 552645914. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:05:27,976][25689] Avg episode reward: [(0, '-27.317')] [2022-07-10 03:05:29,376][26022] Updated weights on worker 0-0, policy_version 539706 (0.00093) [2022-07-10 03:05:30,999][26022] Updated weights on worker 0-0, policy_version 539716 (0.00084) [2022-07-10 03:05:32,877][26022] Updated weights on worker 0-0, policy_version 539726 (0.00088) [2022-07-10 03:05:33,038][25689] Fps is (10 sec: 5688.2, 60 sec: 5636.1, 300 sec: 5660.4). Total num frames: 552679424. Throughput: 0: 5888.0. Samples: 552679814. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 03:05:33,039][25689] Avg episode reward: [(0, '-26.530')] [2022-07-10 03:05:34,904][26022] Updated weights on worker 0-0, policy_version 539736 (0.00093) [2022-07-10 03:05:36,580][26022] Updated weights on worker 0-0, policy_version 539746 (0.00092) [2022-07-10 03:05:38,091][25689] Fps is (10 sec: 5667.8, 60 sec: 5634.8, 300 sec: 5657.3). Total num frames: 552707072. Throughput: 0: 5877.3. Samples: 552713820. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:05:38,093][25689] Avg episode reward: [(0, '-26.824')] [2022-07-10 03:05:38,443][26022] Updated weights on worker 0-0, policy_version 539756 (0.00097) [2022-07-10 03:05:40,109][26022] Updated weights on worker 0-0, policy_version 539766 (0.00084) [2022-07-10 03:05:41,912][26022] Updated weights on worker 0-0, policy_version 539776 (0.00081) [2022-07-10 03:05:43,155][25689] Fps is (10 sec: 5667.9, 60 sec: 5650.2, 300 sec: 5659.6). Total num frames: 552736768. Throughput: 0: 5042.6. Samples: 552730944. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:05:43,156][25689] Avg episode reward: [(0, '-27.668')] [2022-07-10 03:05:43,653][26022] Updated weights on worker 0-0, policy_version 539786 (0.00086) [2022-07-10 03:05:45,456][26022] Updated weights on worker 0-0, policy_version 539796 (0.00091) [2022-07-10 03:05:47,319][26022] Updated weights on worker 0-0, policy_version 539806 (0.00093) [2022-07-10 03:05:48,159][25689] Fps is (10 sec: 5695.5, 60 sec: 5634.4, 300 sec: 5659.9). Total num frames: 552764416. Throughput: 0: 5913.6. Samples: 552765284. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:05:48,161][25689] Avg episode reward: [(0, '-28.211')] [2022-07-10 03:05:49,108][26022] Updated weights on worker 0-0, policy_version 539816 (0.00086) [2022-07-10 03:05:51,018][26022] Updated weights on worker 0-0, policy_version 539826 (0.00085) [2022-07-10 03:05:52,747][26022] Updated weights on worker 0-0, policy_version 539836 (0.00094) [2022-07-10 03:05:53,191][25689] Fps is (10 sec: 5713.9, 60 sec: 5632.3, 300 sec: 5659.7). Total num frames: 552794112. Throughput: 0: 5938.2. Samples: 552799488. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:05:53,193][25689] Avg episode reward: [(0, '-27.373')] [2022-07-10 03:05:54,408][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:05:54,422][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000539845_552801280.pth [2022-07-10 03:05:54,422][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000537855_550763520.pth [2022-07-10 03:05:54,663][26022] Updated weights on worker 0-0, policy_version 539846 (0.00100) [2022-07-10 03:05:56,407][26022] Updated weights on worker 0-0, policy_version 539856 (0.00092) [2022-07-10 03:05:58,233][25689] Fps is (10 sec: 5692.6, 60 sec: 5613.4, 300 sec: 5660.9). Total num frames: 552821760. Throughput: 0: 5099.0. Samples: 552816522. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:05:58,233][25689] Avg episode reward: [(0, '-27.415')] [2022-07-10 03:05:58,383][26022] Updated weights on worker 0-0, policy_version 539866 (0.00084) [2022-07-10 03:06:00,009][26022] Updated weights on worker 0-0, policy_version 539876 (0.00091) [2022-07-10 03:06:02,369][26022] Updated weights on worker 0-0, policy_version 539886 (0.00084) [2022-07-10 03:06:03,305][25689] Fps is (10 sec: 5366.3, 60 sec: 5634.3, 300 sec: 5659.8). Total num frames: 552848384. Throughput: 0: 5878.7. Samples: 552849396. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:06:03,305][25689] Avg episode reward: [(0, '-28.167')] [2022-07-10 03:06:03,999][26022] Updated weights on worker 0-0, policy_version 539896 (0.00082) [2022-07-10 03:06:05,915][26022] Updated weights on worker 0-0, policy_version 539906 (0.00089) [2022-07-10 03:06:07,630][26022] Updated weights on worker 0-0, policy_version 539916 (0.00087) [2022-07-10 03:06:08,361][25689] Fps is (10 sec: 5459.4, 60 sec: 5629.5, 300 sec: 5658.9). Total num frames: 552877056. Throughput: 0: 5788.8. Samples: 552882230. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:06:08,362][25689] Avg episode reward: [(0, '-27.849')] [2022-07-10 03:06:09,463][26022] Updated weights on worker 0-0, policy_version 539926 (0.00096) [2022-07-10 03:06:11,125][26022] Updated weights on worker 0-0, policy_version 539936 (0.01354) [2022-07-10 03:06:13,015][26022] Updated weights on worker 0-0, policy_version 539946 (0.00094) [2022-07-10 03:06:13,387][25689] Fps is (10 sec: 5687.9, 60 sec: 5644.6, 300 sec: 5658.8). Total num frames: 552905728. Throughput: 0: 4938.3. Samples: 552899218. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:06:13,387][25689] Avg episode reward: [(0, '-28.046')] [2022-07-10 03:06:14,887][26022] Updated weights on worker 0-0, policy_version 539956 (0.00086) [2022-07-10 03:06:16,577][26022] Updated weights on worker 0-0, policy_version 539966 (0.00093) [2022-07-10 03:06:18,323][26022] Updated weights on worker 0-0, policy_version 539976 (0.00082) [2022-07-10 03:06:18,409][25689] Fps is (10 sec: 5809.3, 60 sec: 5631.5, 300 sec: 5663.6). Total num frames: 552935424. Throughput: 0: 5802.5. Samples: 552933592. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:06:18,409][25689] Avg episode reward: [(0, '-27.198')] [2022-07-10 03:06:20,143][26022] Updated weights on worker 0-0, policy_version 539986 (0.00090) [2022-07-10 03:06:22,040][26022] Updated weights on worker 0-0, policy_version 539996 (0.00080) [2022-07-10 03:06:23,456][25689] Fps is (10 sec: 5796.7, 60 sec: 5653.3, 300 sec: 5659.3). Total num frames: 552964096. Throughput: 0: 5887.5. Samples: 552968036. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:06:23,456][25689] Avg episode reward: [(0, '-26.712')] [2022-07-10 03:06:23,787][26022] Updated weights on worker 0-0, policy_version 540006 (0.00085) [2022-07-10 03:06:25,673][26022] Updated weights on worker 0-0, policy_version 540016 (0.00094) [2022-07-10 03:06:27,192][26022] Updated weights on worker 0-0, policy_version 540026 (0.00084) [2022-07-10 03:06:28,461][25689] Fps is (10 sec: 5602.5, 60 sec: 5654.3, 300 sec: 5659.6). Total num frames: 552991744. Throughput: 0: 5128.0. Samples: 552985302. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:06:28,462][25689] Avg episode reward: [(0, '-26.254')] [2022-07-10 03:06:29,322][26022] Updated weights on worker 0-0, policy_version 540036 (0.00091) [2022-07-10 03:06:31,084][26022] Updated weights on worker 0-0, policy_version 540046 (0.00088) [2022-07-10 03:06:32,777][26022] Updated weights on worker 0-0, policy_version 540056 (0.00086) [2022-07-10 03:06:33,562][25689] Fps is (10 sec: 5674.2, 60 sec: 5650.9, 300 sec: 5657.9). Total num frames: 553021440. Throughput: 0: 5946.2. Samples: 553019184. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:06:33,562][25689] Avg episode reward: [(0, '-25.497')] [2022-07-10 03:06:34,711][26022] Updated weights on worker 0-0, policy_version 540066 (0.00088) [2022-07-10 03:06:36,350][26022] Updated weights on worker 0-0, policy_version 540076 (0.00096) [2022-07-10 03:06:38,510][26022] Updated weights on worker 0-0, policy_version 540086 (0.00092) [2022-07-10 03:06:38,631][25689] Fps is (10 sec: 5638.4, 60 sec: 5649.4, 300 sec: 5654.4). Total num frames: 553049088. Throughput: 0: 5911.8. Samples: 553053146. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:06:38,632][25689] Avg episode reward: [(0, '-24.234')] [2022-07-10 03:06:39,928][26022] Updated weights on worker 0-0, policy_version 540096 (0.00088) [2022-07-10 03:06:41,927][26022] Updated weights on worker 0-0, policy_version 540106 (0.00610) [2022-07-10 03:06:43,620][26022] Updated weights on worker 0-0, policy_version 540116 (0.00092) [2022-07-10 03:06:43,703][25689] Fps is (10 sec: 5654.7, 60 sec: 5648.7, 300 sec: 5657.5). Total num frames: 553078784. Throughput: 0: 5900.5. Samples: 553087504. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:06:43,703][25689] Avg episode reward: [(0, '-24.298')] [2022-07-10 03:06:45,383][26022] Updated weights on worker 0-0, policy_version 540126 (0.00083) [2022-07-10 03:06:47,179][26022] Updated weights on worker 0-0, policy_version 540136 (0.00092) [2022-07-10 03:06:48,756][25689] Fps is (10 sec: 5866.1, 60 sec: 5677.9, 300 sec: 5656.7). Total num frames: 553108480. Throughput: 0: 5896.8. Samples: 553104976. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:06:48,757][25689] Avg episode reward: [(0, '-24.835')] [2022-07-10 03:06:48,921][26022] Updated weights on worker 0-0, policy_version 540146 (0.00086) [2022-07-10 03:06:50,611][26022] Updated weights on worker 0-0, policy_version 540156 (0.00098) [2022-07-10 03:06:52,664][26022] Updated weights on worker 0-0, policy_version 540166 (0.00081) [2022-07-10 03:06:53,781][25689] Fps is (10 sec: 5893.0, 60 sec: 5678.6, 300 sec: 5659.7). Total num frames: 553138176. Throughput: 0: 5951.4. Samples: 553139518. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:06:53,782][25689] Avg episode reward: [(0, '-24.764')] [2022-07-10 03:06:54,272][26022] Updated weights on worker 0-0, policy_version 540176 (0.00090) [2022-07-10 03:06:56,167][26022] Updated weights on worker 0-0, policy_version 540186 (0.00095) [2022-07-10 03:06:57,920][26022] Updated weights on worker 0-0, policy_version 540196 (0.00088) [2022-07-10 03:06:58,813][25689] Fps is (10 sec: 5599.9, 60 sec: 5662.5, 300 sec: 5650.6). Total num frames: 553164800. Throughput: 0: 5976.9. Samples: 553173772. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:06:58,814][25689] Avg episode reward: [(0, '-26.314')] [2022-07-10 03:06:59,514][26022] Updated weights on worker 0-0, policy_version 540206 (0.00102) [2022-07-10 03:07:01,609][26022] Updated weights on worker 0-0, policy_version 540216 (0.00085) [2022-07-10 03:07:03,429][26022] Updated weights on worker 0-0, policy_version 540226 (0.00313) [2022-07-10 03:07:03,855][25689] Fps is (10 sec: 5387.6, 60 sec: 5682.3, 300 sec: 5654.4). Total num frames: 553192448. Throughput: 0: 5128.8. Samples: 553190860. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:07:03,855][25689] Avg episode reward: [(0, '-28.343')] [2022-07-10 03:07:05,394][26022] Updated weights on worker 0-0, policy_version 540236 (0.00087) [2022-07-10 03:07:07,058][26022] Updated weights on worker 0-0, policy_version 540246 (0.00091) [2022-07-10 03:07:08,878][25689] Fps is (10 sec: 5596.0, 60 sec: 5685.5, 300 sec: 5658.4). Total num frames: 553221120. Throughput: 0: 5876.8. Samples: 553223226. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:07:08,878][25689] Avg episode reward: [(0, '-27.885')] [2022-07-10 03:07:09,015][26022] Updated weights on worker 0-0, policy_version 540256 (0.00086) [2022-07-10 03:07:10,923][26022] Updated weights on worker 0-0, policy_version 540266 (0.00085) [2022-07-10 03:07:12,461][26022] Updated weights on worker 0-0, policy_version 540276 (0.00081) [2022-07-10 03:07:13,879][25689] Fps is (10 sec: 5720.3, 60 sec: 5687.7, 300 sec: 5658.9). Total num frames: 553249792. Throughput: 0: 5854.2. Samples: 553257176. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:07:13,880][25689] Avg episode reward: [(0, '-27.466')] [2022-07-10 03:07:14,386][26022] Updated weights on worker 0-0, policy_version 540286 (0.00073) [2022-07-10 03:07:15,982][26022] Updated weights on worker 0-0, policy_version 540296 (0.00090) [2022-07-10 03:07:18,077][26022] Updated weights on worker 0-0, policy_version 540306 (0.00084) [2022-07-10 03:07:18,933][25689] Fps is (10 sec: 5702.8, 60 sec: 5667.8, 300 sec: 5659.8). Total num frames: 553278464. Throughput: 0: 5006.3. Samples: 553274496. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:07:18,934][25689] Avg episode reward: [(0, '-28.076')] [2022-07-10 03:07:19,662][26022] Updated weights on worker 0-0, policy_version 540316 (0.00086) [2022-07-10 03:07:21,686][26022] Updated weights on worker 0-0, policy_version 540326 (0.00090) [2022-07-10 03:07:23,242][26022] Updated weights on worker 0-0, policy_version 540336 (0.00086) [2022-07-10 03:07:23,999][25689] Fps is (10 sec: 5767.9, 60 sec: 5683.0, 300 sec: 5659.9). Total num frames: 553308160. Throughput: 0: 5871.9. Samples: 553309144. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:07:23,999][25689] Avg episode reward: [(0, '-27.342')] [2022-07-10 03:07:25,280][26022] Updated weights on worker 0-0, policy_version 540346 (0.00087) [2022-07-10 03:07:26,837][26022] Updated weights on worker 0-0, policy_version 540356 (0.00086) [2022-07-10 03:07:28,550][26022] Updated weights on worker 0-0, policy_version 540366 (0.00092) [2022-07-10 03:07:29,010][25689] Fps is (10 sec: 5792.1, 60 sec: 5699.3, 300 sec: 5661.3). Total num frames: 553336832. Throughput: 0: 5969.0. Samples: 553343396. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:07:29,011][25689] Avg episode reward: [(0, '-26.026')] [2022-07-10 03:07:30,554][26022] Updated weights on worker 0-0, policy_version 540376 (0.00760) [2022-07-10 03:07:32,148][26022] Updated weights on worker 0-0, policy_version 540386 (0.00085) [2022-07-10 03:07:33,965][26022] Updated weights on worker 0-0, policy_version 540396 (0.00082) [2022-07-10 03:07:34,032][25689] Fps is (10 sec: 5715.6, 60 sec: 5689.9, 300 sec: 5662.4). Total num frames: 553365504. Throughput: 0: 5135.4. Samples: 553360666. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:07:34,032][25689] Avg episode reward: [(0, '-25.846')] [2022-07-10 03:07:35,848][26022] Updated weights on worker 0-0, policy_version 540406 (0.00082) [2022-07-10 03:07:37,584][26022] Updated weights on worker 0-0, policy_version 540416 (0.00087) [2022-07-10 03:07:39,079][25689] Fps is (10 sec: 5491.9, 60 sec: 5675.0, 300 sec: 5648.7). Total num frames: 553392128. Throughput: 0: 5976.5. Samples: 553394896. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:07:39,079][25689] Avg episode reward: [(0, '-26.320')] [2022-07-10 03:07:39,604][26022] Updated weights on worker 0-0, policy_version 540426 (0.00087) [2022-07-10 03:07:41,363][26022] Updated weights on worker 0-0, policy_version 540436 (0.00253) [2022-07-10 03:07:43,067][26022] Updated weights on worker 0-0, policy_version 540446 (0.00088) [2022-07-10 03:07:44,148][25689] Fps is (10 sec: 5668.6, 60 sec: 5692.2, 300 sec: 5662.4). Total num frames: 553422848. Throughput: 0: 5940.5. Samples: 553428838. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:07:44,148][25689] Avg episode reward: [(0, '-27.444')] [2022-07-10 03:07:45,035][26022] Updated weights on worker 0-0, policy_version 540456 (0.00095) [2022-07-10 03:07:46,707][26022] Updated weights on worker 0-0, policy_version 540466 (0.00084) [2022-07-10 03:07:48,561][26022] Updated weights on worker 0-0, policy_version 540476 (0.00085) [2022-07-10 03:07:49,153][25689] Fps is (10 sec: 5793.7, 60 sec: 5662.8, 300 sec: 5655.5). Total num frames: 553450496. Throughput: 0: 5094.8. Samples: 553446020. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:07:49,154][25689] Avg episode reward: [(0, '-25.712')] [2022-07-10 03:07:50,407][26022] Updated weights on worker 0-0, policy_version 540486 (0.00092) [2022-07-10 03:07:52,178][26022] Updated weights on worker 0-0, policy_version 540496 (0.00089) [2022-07-10 03:07:53,871][26022] Updated weights on worker 0-0, policy_version 540506 (0.00982) [2022-07-10 03:07:54,171][25689] Fps is (10 sec: 5618.7, 60 sec: 5646.5, 300 sec: 5651.8). Total num frames: 553479168. Throughput: 0: 5937.1. Samples: 553480238. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:07:54,172][25689] Avg episode reward: [(0, '-26.471')] [2022-07-10 03:07:54,508][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:07:54,522][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000540510_553482240.pth [2022-07-10 03:07:54,522][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000538518_551442432.pth [2022-07-10 03:07:55,518][26022] Updated weights on worker 0-0, policy_version 540516 (0.00083) [2022-07-10 03:07:57,452][26022] Updated weights on worker 0-0, policy_version 540526 (0.00090) [2022-07-10 03:07:59,181][25689] Fps is (10 sec: 5616.4, 60 sec: 5665.6, 300 sec: 5655.8). Total num frames: 553506816. Throughput: 0: 5961.1. Samples: 553514728. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:07:59,182][25689] Avg episode reward: [(0, '-26.606')] [2022-07-10 03:07:59,311][26022] Updated weights on worker 0-0, policy_version 540536 (0.00090) [2022-07-10 03:08:00,889][26022] Updated weights on worker 0-0, policy_version 540546 (0.00093) [2022-07-10 03:08:03,289][26022] Updated weights on worker 0-0, policy_version 540556 (0.00093) [2022-07-10 03:08:04,300][25689] Fps is (10 sec: 5560.5, 60 sec: 5675.2, 300 sec: 5658.4). Total num frames: 553535488. Throughput: 0: 5115.1. Samples: 553531918. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:08:04,300][25689] Avg episode reward: [(0, '-26.197')] [2022-07-10 03:08:04,892][26022] Updated weights on worker 0-0, policy_version 540566 (0.00087) [2022-07-10 03:08:06,624][26022] Updated weights on worker 0-0, policy_version 540576 (0.00088) [2022-07-10 03:08:08,897][26022] Updated weights on worker 0-0, policy_version 540586 (0.00090) [2022-07-10 03:08:09,342][25689] Fps is (10 sec: 5441.8, 60 sec: 5639.5, 300 sec: 5651.4). Total num frames: 553562112. Throughput: 0: 5857.5. Samples: 553564278. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:08:09,343][25689] Avg episode reward: [(0, '-25.890')] [2022-07-10 03:08:10,184][26022] Updated weights on worker 0-0, policy_version 540596 (0.00090) [2022-07-10 03:08:12,314][26022] Updated weights on worker 0-0, policy_version 540606 (0.00130) [2022-07-10 03:08:13,827][26022] Updated weights on worker 0-0, policy_version 540616 (0.00086) [2022-07-10 03:08:14,381][25689] Fps is (10 sec: 5688.1, 60 sec: 5669.9, 300 sec: 5657.8). Total num frames: 553592832. Throughput: 0: 5844.4. Samples: 553598352. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:08:14,382][25689] Avg episode reward: [(0, '-26.751')] [2022-07-10 03:08:15,813][26022] Updated weights on worker 0-0, policy_version 540626 (0.00084) [2022-07-10 03:08:17,548][26022] Updated weights on worker 0-0, policy_version 540636 (0.00088) [2022-07-10 03:08:19,340][26022] Updated weights on worker 0-0, policy_version 540646 (0.00086) [2022-07-10 03:08:19,407][25689] Fps is (10 sec: 5901.1, 60 sec: 5672.5, 300 sec: 5654.8). Total num frames: 553621504. Throughput: 0: 4986.8. Samples: 553615590. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:08:19,408][25689] Avg episode reward: [(0, '-27.232')] [2022-07-10 03:08:20,936][26022] Updated weights on worker 0-0, policy_version 540656 (0.00086) [2022-07-10 03:08:22,907][26022] Updated weights on worker 0-0, policy_version 540666 (0.00087) [2022-07-10 03:08:24,406][26022] Updated weights on worker 0-0, policy_version 540676 (0.00095) [2022-07-10 03:08:24,483][25689] Fps is (10 sec: 5980.6, 60 sec: 5705.4, 300 sec: 5664.0). Total num frames: 553653248. Throughput: 0: 5865.4. Samples: 553650302. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:08:24,484][25689] Avg episode reward: [(0, '-27.933')] [2022-07-10 03:08:26,631][26022] Updated weights on worker 0-0, policy_version 540686 (0.00086) [2022-07-10 03:08:28,358][26022] Updated weights on worker 0-0, policy_version 540697 (0.00087) [2022-07-10 03:08:29,516][25689] Fps is (10 sec: 5773.6, 60 sec: 5669.5, 300 sec: 5656.7). Total num frames: 553679872. Throughput: 0: 5971.4. Samples: 553684744. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:08:29,517][25689] Avg episode reward: [(0, '-27.818')] [2022-07-10 03:08:30,277][26022] Updated weights on worker 0-0, policy_version 540707 (0.00887) [2022-07-10 03:08:32,056][26022] Updated weights on worker 0-0, policy_version 540717 (0.00087) [2022-07-10 03:08:33,758][26022] Updated weights on worker 0-0, policy_version 540727 (0.00086) [2022-07-10 03:08:34,564][25689] Fps is (10 sec: 5485.3, 60 sec: 5667.0, 300 sec: 5659.9). Total num frames: 553708544. Throughput: 0: 5136.0. Samples: 553702010. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2022-07-10 03:08:34,566][25689] Avg episode reward: [(0, '-27.592')] [2022-07-10 03:08:35,407][26022] Updated weights on worker 0-0, policy_version 540737 (0.00082) [2022-07-10 03:08:37,362][26022] Updated weights on worker 0-0, policy_version 540747 (0.00092) [2022-07-10 03:08:39,095][26022] Updated weights on worker 0-0, policy_version 540757 (0.00087) [2022-07-10 03:08:39,579][25689] Fps is (10 sec: 5698.6, 60 sec: 5703.9, 300 sec: 5660.6). Total num frames: 553737216. Throughput: 0: 5997.9. Samples: 553736582. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:08:39,581][25689] Avg episode reward: [(0, '-26.795')] [2022-07-10 03:08:40,929][26022] Updated weights on worker 0-0, policy_version 540767 (0.00087) [2022-07-10 03:08:42,960][26022] Updated weights on worker 0-0, policy_version 540777 (0.00092) [2022-07-10 03:08:44,482][26022] Updated weights on worker 0-0, policy_version 540787 (0.00085) [2022-07-10 03:08:44,646][25689] Fps is (10 sec: 5789.4, 60 sec: 5687.2, 300 sec: 5667.8). Total num frames: 553766912. Throughput: 0: 5969.8. Samples: 553770670. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:08:44,646][25689] Avg episode reward: [(0, '-26.412')] [2022-07-10 03:08:46,423][26022] Updated weights on worker 0-0, policy_version 540797 (0.00087) [2022-07-10 03:08:48,091][26022] Updated weights on worker 0-0, policy_version 540807 (0.00090) [2022-07-10 03:08:49,711][25689] Fps is (10 sec: 5760.9, 60 sec: 5698.5, 300 sec: 5663.9). Total num frames: 553795584. Throughput: 0: 5102.3. Samples: 553787784. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:08:49,711][25689] Avg episode reward: [(0, '-25.298')] [2022-07-10 03:08:49,919][26022] Updated weights on worker 0-0, policy_version 540817 (0.00090) [2022-07-10 03:08:51,919][26022] Updated weights on worker 0-0, policy_version 540827 (0.00084) [2022-07-10 03:08:53,516][26022] Updated weights on worker 0-0, policy_version 540837 (0.00091) [2022-07-10 03:08:54,783][25689] Fps is (10 sec: 5556.0, 60 sec: 5676.6, 300 sec: 5659.3). Total num frames: 553823232. Throughput: 0: 5924.5. Samples: 553821796. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:08:54,783][25689] Avg episode reward: [(0, '-26.814')] [2022-07-10 03:08:55,519][26022] Updated weights on worker 0-0, policy_version 540847 (0.00094) [2022-07-10 03:08:56,975][26022] Updated weights on worker 0-0, policy_version 540857 (0.00091) [2022-07-10 03:08:58,972][26022] Updated weights on worker 0-0, policy_version 540867 (0.00087) [2022-07-10 03:08:59,798][25689] Fps is (10 sec: 5684.7, 60 sec: 5709.8, 300 sec: 5670.3). Total num frames: 553852928. Throughput: 0: 5917.6. Samples: 553856230. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:08:59,799][25689] Avg episode reward: [(0, '-27.368')] [2022-07-10 03:09:00,763][26022] Updated weights on worker 0-0, policy_version 540877 (0.00095) [2022-07-10 03:09:02,895][26022] Updated weights on worker 0-0, policy_version 540887 (0.00091) [2022-07-10 03:09:04,780][26022] Updated weights on worker 0-0, policy_version 540897 (0.00086) [2022-07-10 03:09:04,921][25689] Fps is (10 sec: 5454.0, 60 sec: 5658.7, 300 sec: 5658.0). Total num frames: 553878528. Throughput: 0: 5798.6. Samples: 553888238. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:09:04,922][25689] Avg episode reward: [(0, '-27.333')] [2022-07-10 03:09:06,673][26022] Updated weights on worker 0-0, policy_version 540907 (0.01153) [2022-07-10 03:09:08,230][26022] Updated weights on worker 0-0, policy_version 540917 (0.00085) [2022-07-10 03:09:09,983][25689] Fps is (10 sec: 5328.8, 60 sec: 5690.7, 300 sec: 5660.5). Total num frames: 553907200. Throughput: 0: 5797.0. Samples: 553905302. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:09:09,984][25689] Avg episode reward: [(0, '-27.211')] [2022-07-10 03:09:10,318][26022] Updated weights on worker 0-0, policy_version 540927 (0.00091) [2022-07-10 03:09:11,786][26022] Updated weights on worker 0-0, policy_version 540937 (0.00085) [2022-07-10 03:09:13,922][26022] Updated weights on worker 0-0, policy_version 540947 (0.00086) [2022-07-10 03:09:15,021][25689] Fps is (10 sec: 5779.2, 60 sec: 5673.9, 300 sec: 5663.4). Total num frames: 553936896. Throughput: 0: 5790.7. Samples: 553938990. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:09:15,021][25689] Avg episode reward: [(0, '-27.280')] [2022-07-10 03:09:15,629][26022] Updated weights on worker 0-0, policy_version 540957 (0.00098) [2022-07-10 03:09:17,455][26022] Updated weights on worker 0-0, policy_version 540967 (0.00083) [2022-07-10 03:09:19,149][26022] Updated weights on worker 0-0, policy_version 540977 (0.00093) [2022-07-10 03:09:20,083][25689] Fps is (10 sec: 5677.4, 60 sec: 5653.6, 300 sec: 5660.2). Total num frames: 553964544. Throughput: 0: 5763.5. Samples: 553973144. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:09:20,084][25689] Avg episode reward: [(0, '-27.067')] [2022-07-10 03:09:21,080][26022] Updated weights on worker 0-0, policy_version 540987 (0.00085) [2022-07-10 03:09:22,769][26022] Updated weights on worker 0-0, policy_version 540997 (0.00083) [2022-07-10 03:09:24,636][26022] Updated weights on worker 0-0, policy_version 541007 (0.00078) [2022-07-10 03:09:25,203][25689] Fps is (10 sec: 5632.0, 60 sec: 5615.9, 300 sec: 5658.7). Total num frames: 553994240. Throughput: 0: 5042.9. Samples: 553990516. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:09:25,203][25689] Avg episode reward: [(0, '-27.427')] [2022-07-10 03:09:26,437][26022] Updated weights on worker 0-0, policy_version 541017 (0.00084) [2022-07-10 03:09:28,219][26022] Updated weights on worker 0-0, policy_version 541027 (0.00095) [2022-07-10 03:09:30,012][26022] Updated weights on worker 0-0, policy_version 541037 (0.00089) [2022-07-10 03:09:30,210][25689] Fps is (10 sec: 5763.6, 60 sec: 5651.9, 300 sec: 5658.8). Total num frames: 554022912. Throughput: 0: 5893.0. Samples: 554024502. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:09:30,211][25689] Avg episode reward: [(0, '-26.249')] [2022-07-10 03:09:31,857][26022] Updated weights on worker 0-0, policy_version 541047 (0.00083) [2022-07-10 03:09:33,584][26022] Updated weights on worker 0-0, policy_version 541057 (0.00090) [2022-07-10 03:09:35,233][25689] Fps is (10 sec: 5615.2, 60 sec: 5637.4, 300 sec: 5659.1). Total num frames: 554050560. Throughput: 0: 5937.8. Samples: 554059004. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:09:35,233][25689] Avg episode reward: [(0, '-27.455')] [2022-07-10 03:09:35,371][26022] Updated weights on worker 0-0, policy_version 541067 (0.00099) [2022-07-10 03:09:37,103][26022] Updated weights on worker 0-0, policy_version 541077 (0.00091) [2022-07-10 03:09:39,071][26022] Updated weights on worker 0-0, policy_version 541087 (0.00088) [2022-07-10 03:09:40,246][25689] Fps is (10 sec: 5713.8, 60 sec: 5654.4, 300 sec: 5663.2). Total num frames: 554080256. Throughput: 0: 5104.9. Samples: 554076074. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:09:40,247][25689] Avg episode reward: [(0, '-28.632')] [2022-07-10 03:09:40,808][26022] Updated weights on worker 0-0, policy_version 541097 (0.00090) [2022-07-10 03:09:42,544][26022] Updated weights on worker 0-0, policy_version 541107 (0.00092) [2022-07-10 03:09:44,409][26022] Updated weights on worker 0-0, policy_version 541117 (0.00081) [2022-07-10 03:09:45,294][25689] Fps is (10 sec: 5801.4, 60 sec: 5639.4, 300 sec: 5662.6). Total num frames: 554108928. Throughput: 0: 5951.6. Samples: 554110090. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:09:45,294][25689] Avg episode reward: [(0, '-28.944')] [2022-07-10 03:09:46,194][26022] Updated weights on worker 0-0, policy_version 541127 (0.00090) [2022-07-10 03:09:47,984][26022] Updated weights on worker 0-0, policy_version 541137 (0.00088) [2022-07-10 03:09:49,899][26022] Updated weights on worker 0-0, policy_version 541147 (0.00092) [2022-07-10 03:09:50,324][25689] Fps is (10 sec: 5588.7, 60 sec: 5625.7, 300 sec: 5655.4). Total num frames: 554136576. Throughput: 0: 5967.0. Samples: 554144520. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:09:50,324][25689] Avg episode reward: [(0, '-28.463')] [2022-07-10 03:09:51,421][26022] Updated weights on worker 0-0, policy_version 541157 (0.00090) [2022-07-10 03:09:53,447][26022] Updated weights on worker 0-0, policy_version 541167 (0.00084) [2022-07-10 03:09:54,540][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:09:54,560][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000541175_554163200.pth [2022-07-10 03:09:54,560][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000539181_552121344.pth [2022-07-10 03:09:55,011][26022] Updated weights on worker 0-0, policy_version 541177 (0.00091) [2022-07-10 03:09:55,339][25689] Fps is (10 sec: 5708.6, 60 sec: 5664.8, 300 sec: 5658.9). Total num frames: 554166272. Throughput: 0: 5110.9. Samples: 554161764. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:09:55,339][25689] Avg episode reward: [(0, '-27.698')] [2022-07-10 03:09:57,069][26022] Updated weights on worker 0-0, policy_version 541187 (0.00084) [2022-07-10 03:09:58,815][26022] Updated weights on worker 0-0, policy_version 541197 (0.00087) [2022-07-10 03:10:00,353][25689] Fps is (10 sec: 5717.5, 60 sec: 5631.1, 300 sec: 5667.7). Total num frames: 554193920. Throughput: 0: 5974.2. Samples: 554196198. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:10:00,354][25689] Avg episode reward: [(0, '-27.902')] [2022-07-10 03:10:00,515][26022] Updated weights on worker 0-0, policy_version 541207 (0.00096) [2022-07-10 03:10:02,785][26022] Updated weights on worker 0-0, policy_version 541217 (0.00083) [2022-07-10 03:10:04,539][26022] Updated weights on worker 0-0, policy_version 541227 (0.00083) [2022-07-10 03:10:05,451][25689] Fps is (10 sec: 5468.0, 60 sec: 5667.2, 300 sec: 5662.5). Total num frames: 554221568. Throughput: 0: 5859.0. Samples: 554228196. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:10:05,452][25689] Avg episode reward: [(0, '-27.412')] [2022-07-10 03:10:06,341][26022] Updated weights on worker 0-0, policy_version 541237 (0.00082) [2022-07-10 03:10:08,131][26022] Updated weights on worker 0-0, policy_version 541247 (0.00092) [2022-07-10 03:10:09,817][26022] Updated weights on worker 0-0, policy_version 541257 (0.00094) [2022-07-10 03:10:10,468][25689] Fps is (10 sec: 5568.4, 60 sec: 5671.5, 300 sec: 5665.7). Total num frames: 554250240. Throughput: 0: 5011.7. Samples: 554245474. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:10:10,468][25689] Avg episode reward: [(0, '-27.370')] [2022-07-10 03:10:11,593][26022] Updated weights on worker 0-0, policy_version 541267 (0.00088) [2022-07-10 03:10:13,512][26022] Updated weights on worker 0-0, policy_version 541277 (0.00090) [2022-07-10 03:10:15,220][26022] Updated weights on worker 0-0, policy_version 541287 (0.00079) [2022-07-10 03:10:15,474][25689] Fps is (10 sec: 5721.5, 60 sec: 5657.5, 300 sec: 5659.9). Total num frames: 554278912. Throughput: 0: 5853.2. Samples: 554279620. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:10:15,475][25689] Avg episode reward: [(0, '-27.814')] [2022-07-10 03:10:16,969][26022] Updated weights on worker 0-0, policy_version 541297 (0.00089) [2022-07-10 03:10:18,916][26022] Updated weights on worker 0-0, policy_version 541307 (0.00085) [2022-07-10 03:10:20,487][25689] Fps is (10 sec: 5723.3, 60 sec: 5679.1, 300 sec: 5665.0). Total num frames: 554307584. Throughput: 0: 5862.3. Samples: 554314226. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:10:20,488][25689] Avg episode reward: [(0, '-28.781')] [2022-07-10 03:10:20,519][26022] Updated weights on worker 0-0, policy_version 541317 (0.00089) [2022-07-10 03:10:22,423][26022] Updated weights on worker 0-0, policy_version 541327 (0.00085) [2022-07-10 03:10:24,091][26022] Updated weights on worker 0-0, policy_version 541337 (0.00089) [2022-07-10 03:10:25,573][25689] Fps is (10 sec: 5678.4, 60 sec: 5665.3, 300 sec: 5667.1). Total num frames: 554336256. Throughput: 0: 5139.2. Samples: 554331602. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:10:25,574][25689] Avg episode reward: [(0, '-29.536')] [2022-07-10 03:10:25,901][26022] Updated weights on worker 0-0, policy_version 541347 (0.00094) [2022-07-10 03:10:27,868][26022] Updated weights on worker 0-0, policy_version 541357 (0.00093) [2022-07-10 03:10:29,418][26022] Updated weights on worker 0-0, policy_version 541367 (0.00091) [2022-07-10 03:10:30,599][25689] Fps is (10 sec: 5671.0, 60 sec: 5663.6, 300 sec: 5664.4). Total num frames: 554364928. Throughput: 0: 5978.6. Samples: 554365830. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:10:30,599][25689] Avg episode reward: [(0, '-28.940')] [2022-07-10 03:10:31,551][26022] Updated weights on worker 0-0, policy_version 541377 (0.00083) [2022-07-10 03:10:33,140][26022] Updated weights on worker 0-0, policy_version 541387 (0.00088) [2022-07-10 03:10:34,966][26022] Updated weights on worker 0-0, policy_version 541397 (0.00092) [2022-07-10 03:10:35,628][25689] Fps is (10 sec: 5804.7, 60 sec: 5696.9, 300 sec: 5671.7). Total num frames: 554394624. Throughput: 0: 5978.1. Samples: 554400102. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:10:35,630][25689] Avg episode reward: [(0, '-27.888')] [2022-07-10 03:10:36,828][26022] Updated weights on worker 0-0, policy_version 541407 (0.00090) [2022-07-10 03:10:38,547][26022] Updated weights on worker 0-0, policy_version 541417 (0.00090) [2022-07-10 03:10:40,390][26022] Updated weights on worker 0-0, policy_version 541427 (0.00102) [2022-07-10 03:10:40,636][25689] Fps is (10 sec: 5713.0, 60 sec: 5663.5, 300 sec: 5665.9). Total num frames: 554422272. Throughput: 0: 5106.0. Samples: 554417108. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:10:40,637][25689] Avg episode reward: [(0, '-26.592')] [2022-07-10 03:10:42,157][26022] Updated weights on worker 0-0, policy_version 541437 (0.00086) [2022-07-10 03:10:43,939][26022] Updated weights on worker 0-0, policy_version 541447 (0.00083) [2022-07-10 03:10:45,758][25689] Fps is (10 sec: 5458.6, 60 sec: 5639.5, 300 sec: 5663.7). Total num frames: 554449920. Throughput: 0: 5924.6. Samples: 554451192. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:10:45,759][25689] Avg episode reward: [(0, '-26.176')] [2022-07-10 03:10:45,992][26022] Updated weights on worker 0-0, policy_version 541457 (0.00092) [2022-07-10 03:10:47,468][26022] Updated weights on worker 0-0, policy_version 541467 (0.00085) [2022-07-10 03:10:49,727][26022] Updated weights on worker 0-0, policy_version 541477 (0.00090) [2022-07-10 03:10:50,797][25689] Fps is (10 sec: 5844.9, 60 sec: 5706.4, 300 sec: 5670.4). Total num frames: 554481664. Throughput: 0: 5916.6. Samples: 554485338. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:10:50,798][25689] Avg episode reward: [(0, '-25.826')] [2022-07-10 03:10:50,970][26022] Updated weights on worker 0-0, policy_version 541487 (0.00096) [2022-07-10 03:10:53,064][26022] Updated weights on worker 0-0, policy_version 541497 (0.00083) [2022-07-10 03:10:54,795][26022] Updated weights on worker 0-0, policy_version 541507 (0.00082) [2022-07-10 03:10:55,840][25689] Fps is (10 sec: 5789.2, 60 sec: 5653.0, 300 sec: 5667.0). Total num frames: 554508288. Throughput: 0: 5066.3. Samples: 554502504. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:10:55,841][25689] Avg episode reward: [(0, '-26.243')] [2022-07-10 03:10:56,519][26022] Updated weights on worker 0-0, policy_version 541517 (0.00086) [2022-07-10 03:10:58,587][26022] Updated weights on worker 0-0, policy_version 541527 (0.00086) [2022-07-10 03:11:00,328][26022] Updated weights on worker 0-0, policy_version 541537 (0.00090) [2022-07-10 03:11:00,847][25689] Fps is (10 sec: 5502.5, 60 sec: 5670.7, 300 sec: 5675.1). Total num frames: 554536960. Throughput: 0: 5916.5. Samples: 554536684. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:11:00,847][25689] Avg episode reward: [(0, '-26.479')] [2022-07-10 03:11:02,263][26022] Updated weights on worker 0-0, policy_version 541547 (0.00084) [2022-07-10 03:11:04,446][26022] Updated weights on worker 0-0, policy_version 541557 (0.00087) [2022-07-10 03:11:05,860][26022] Updated weights on worker 0-0, policy_version 541567 (0.00091) [2022-07-10 03:11:05,947][25689] Fps is (10 sec: 5673.8, 60 sec: 5687.4, 300 sec: 5674.2). Total num frames: 554565632. Throughput: 0: 5813.7. Samples: 554568564. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:11:05,948][25689] Avg episode reward: [(0, '-27.381')] [2022-07-10 03:11:07,932][26022] Updated weights on worker 0-0, policy_version 541577 (0.00092) [2022-07-10 03:11:09,601][26022] Updated weights on worker 0-0, policy_version 541587 (0.00084) [2022-07-10 03:11:10,985][25689] Fps is (10 sec: 5353.2, 60 sec: 5634.6, 300 sec: 5663.7). Total num frames: 554591232. Throughput: 0: 4965.2. Samples: 554585574. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:11:10,985][25689] Avg episode reward: [(0, '-27.354')] [2022-07-10 03:11:11,386][26022] Updated weights on worker 0-0, policy_version 541597 (0.00085) [2022-07-10 03:11:13,332][26022] Updated weights on worker 0-0, policy_version 541607 (0.00086) [2022-07-10 03:11:14,759][26022] Updated weights on worker 0-0, policy_version 541617 (0.00092) [2022-07-10 03:11:15,990][25689] Fps is (10 sec: 5404.1, 60 sec: 5634.8, 300 sec: 5660.6). Total num frames: 554619904. Throughput: 0: 5822.4. Samples: 554619822. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:11:15,990][25689] Avg episode reward: [(0, '-26.787')] [2022-07-10 03:11:16,724][26022] Updated weights on worker 0-0, policy_version 541627 (0.00096) [2022-07-10 03:11:18,517][26022] Updated weights on worker 0-0, policy_version 541637 (0.00089) [2022-07-10 03:11:20,324][26022] Updated weights on worker 0-0, policy_version 541647 (0.00086) [2022-07-10 03:11:21,035][25689] Fps is (10 sec: 5909.5, 60 sec: 5665.6, 300 sec: 5667.5). Total num frames: 554650624. Throughput: 0: 5809.5. Samples: 554653970. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:11:21,036][25689] Avg episode reward: [(0, '-26.414')] [2022-07-10 03:11:22,156][26022] Updated weights on worker 0-0, policy_version 541657 (0.00081) [2022-07-10 03:11:23,909][26022] Updated weights on worker 0-0, policy_version 541667 (0.00091) [2022-07-10 03:11:25,684][26022] Updated weights on worker 0-0, policy_version 541677 (0.00083) [2022-07-10 03:11:26,072][25689] Fps is (10 sec: 5789.0, 60 sec: 5653.2, 300 sec: 5666.9). Total num frames: 554678272. Throughput: 0: 5100.3. Samples: 554671210. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:11:26,073][25689] Avg episode reward: [(0, '-26.240')] [2022-07-10 03:11:27,526][26022] Updated weights on worker 0-0, policy_version 541687 (0.00086) [2022-07-10 03:11:29,518][26022] Updated weights on worker 0-0, policy_version 541697 (0.01171) [2022-07-10 03:11:31,096][25689] Fps is (10 sec: 5598.0, 60 sec: 5653.4, 300 sec: 5664.9). Total num frames: 554706944. Throughput: 0: 5962.8. Samples: 554705492. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 03:11:31,097][25689] Avg episode reward: [(0, '-27.319')] [2022-07-10 03:11:31,104][26022] Updated weights on worker 0-0, policy_version 541707 (0.00090) [2022-07-10 03:11:32,891][26022] Updated weights on worker 0-0, policy_version 541717 (0.00097) [2022-07-10 03:11:34,680][26022] Updated weights on worker 0-0, policy_version 541727 (0.00088) [2022-07-10 03:11:36,123][25689] Fps is (10 sec: 5807.6, 60 sec: 5653.6, 300 sec: 5672.6). Total num frames: 554736640. Throughput: 0: 5971.9. Samples: 554740054. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:11:36,123][25689] Avg episode reward: [(0, '-27.855')] [2022-07-10 03:11:36,487][26022] Updated weights on worker 0-0, policy_version 541737 (0.00090) [2022-07-10 03:11:38,359][26022] Updated weights on worker 0-0, policy_version 541747 (0.00105) [2022-07-10 03:11:40,037][26022] Updated weights on worker 0-0, policy_version 541757 (0.00093) [2022-07-10 03:11:41,134][25689] Fps is (10 sec: 5712.5, 60 sec: 5653.3, 300 sec: 5666.9). Total num frames: 554764288. Throughput: 0: 5999.5. Samples: 554774554. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:11:41,135][25689] Avg episode reward: [(0, '-28.469')] [2022-07-10 03:11:41,905][26022] Updated weights on worker 0-0, policy_version 541767 (0.00093) [2022-07-10 03:11:43,806][26022] Updated weights on worker 0-0, policy_version 541777 (0.00092) [2022-07-10 03:11:45,399][26022] Updated weights on worker 0-0, policy_version 541787 (0.00092) [2022-07-10 03:11:46,218][25689] Fps is (10 sec: 5680.5, 60 sec: 5690.8, 300 sec: 5666.3). Total num frames: 554793984. Throughput: 0: 5979.3. Samples: 554791664. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:11:46,218][25689] Avg episode reward: [(0, '-28.724')] [2022-07-10 03:11:47,238][26022] Updated weights on worker 0-0, policy_version 541797 (0.00085) [2022-07-10 03:11:49,088][26022] Updated weights on worker 0-0, policy_version 541807 (0.00092) [2022-07-10 03:11:50,819][26022] Updated weights on worker 0-0, policy_version 541817 (0.00085) [2022-07-10 03:11:51,230][25689] Fps is (10 sec: 5882.7, 60 sec: 5659.4, 300 sec: 5666.5). Total num frames: 554823680. Throughput: 0: 5980.5. Samples: 554825906. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:11:51,231][25689] Avg episode reward: [(0, '-28.907')] [2022-07-10 03:11:52,863][26022] Updated weights on worker 0-0, policy_version 541827 (0.00098) [2022-07-10 03:11:54,283][26022] Updated weights on worker 0-0, policy_version 541837 (0.00089) [2022-07-10 03:11:54,722][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:11:54,739][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000541840_554844160.pth [2022-07-10 03:11:54,740][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000539845_552801280.pth [2022-07-10 03:11:56,286][25689] Fps is (10 sec: 5593.7, 60 sec: 5658.2, 300 sec: 5666.1). Total num frames: 554850304. Throughput: 0: 5960.9. Samples: 554860246. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:11:56,287][25689] Avg episode reward: [(0, '-28.014')] [2022-07-10 03:11:56,452][26022] Updated weights on worker 0-0, policy_version 541847 (0.00092) [2022-07-10 03:11:57,940][26022] Updated weights on worker 0-0, policy_version 541857 (0.00089) [2022-07-10 03:11:59,928][26022] Updated weights on worker 0-0, policy_version 541867 (0.00093) [2022-07-10 03:12:01,306][25689] Fps is (10 sec: 5488.1, 60 sec: 5656.9, 300 sec: 5669.9). Total num frames: 554878976. Throughput: 0: 5093.1. Samples: 554877290. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:12:01,307][25689] Avg episode reward: [(0, '-27.895')] [2022-07-10 03:12:01,637][26022] Updated weights on worker 0-0, policy_version 541877 (0.00092) [2022-07-10 03:12:03,849][26022] Updated weights on worker 0-0, policy_version 541887 (0.00081) [2022-07-10 03:12:05,502][26022] Updated weights on worker 0-0, policy_version 541897 (0.00085) [2022-07-10 03:12:06,387][25689] Fps is (10 sec: 5575.8, 60 sec: 5641.8, 300 sec: 5665.4). Total num frames: 554906624. Throughput: 0: 5834.8. Samples: 554909348. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:12:06,388][25689] Avg episode reward: [(0, '-27.781')] [2022-07-10 03:12:07,748][26022] Updated weights on worker 0-0, policy_version 541907 (0.00089) [2022-07-10 03:12:08,908][26022] Updated weights on worker 0-0, policy_version 541917 (0.00092) [2022-07-10 03:12:11,174][26022] Updated weights on worker 0-0, policy_version 541927 (0.00086) [2022-07-10 03:12:11,407][25689] Fps is (10 sec: 5575.6, 60 sec: 5694.3, 300 sec: 5665.0). Total num frames: 554935296. Throughput: 0: 5825.4. Samples: 554943444. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:12:11,409][25689] Avg episode reward: [(0, '-26.748')] [2022-07-10 03:12:12,736][26022] Updated weights on worker 0-0, policy_version 541937 (0.00087) [2022-07-10 03:12:14,547][26022] Updated weights on worker 0-0, policy_version 541947 (0.00084) [2022-07-10 03:12:16,415][25689] Fps is (10 sec: 5514.3, 60 sec: 5660.2, 300 sec: 5659.0). Total num frames: 554961920. Throughput: 0: 4989.9. Samples: 554960686. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:12:16,427][25689] Avg episode reward: [(0, '-26.592')] [2022-07-10 03:12:16,518][26022] Updated weights on worker 0-0, policy_version 541957 (0.00091) [2022-07-10 03:12:18,104][26022] Updated weights on worker 0-0, policy_version 541967 (0.00053) [2022-07-10 03:12:20,138][26022] Updated weights on worker 0-0, policy_version 541977 (0.00091) [2022-07-10 03:12:21,438][25689] Fps is (10 sec: 5716.7, 60 sec: 5662.2, 300 sec: 5663.3). Total num frames: 554992640. Throughput: 0: 5830.2. Samples: 554994664. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:12:21,439][25689] Avg episode reward: [(0, '-27.609')] [2022-07-10 03:12:21,903][26022] Updated weights on worker 0-0, policy_version 541987 (0.00089) [2022-07-10 03:12:23,666][26022] Updated weights on worker 0-0, policy_version 541997 (0.00093) [2022-07-10 03:12:25,479][26022] Updated weights on worker 0-0, policy_version 542007 (0.00103) [2022-07-10 03:12:26,537][25689] Fps is (10 sec: 5867.5, 60 sec: 5673.4, 300 sec: 5661.6). Total num frames: 555021312. Throughput: 0: 5925.4. Samples: 555028744. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:12:26,540][25689] Avg episode reward: [(0, '-29.069')] [2022-07-10 03:12:27,403][26022] Updated weights on worker 0-0, policy_version 542017 (0.00085) [2022-07-10 03:12:29,051][26022] Updated weights on worker 0-0, policy_version 542027 (0.00614) [2022-07-10 03:12:30,876][26022] Updated weights on worker 0-0, policy_version 542037 (0.00081) [2022-07-10 03:12:31,571][25689] Fps is (10 sec: 5558.5, 60 sec: 5655.5, 300 sec: 5657.9). Total num frames: 555048960. Throughput: 0: 5089.5. Samples: 555046066. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:12:31,571][25689] Avg episode reward: [(0, '-27.576')] [2022-07-10 03:12:32,565][26022] Updated weights on worker 0-0, policy_version 542047 (0.00087) [2022-07-10 03:12:34,333][26022] Updated weights on worker 0-0, policy_version 542057 (0.00087) [2022-07-10 03:12:36,210][26022] Updated weights on worker 0-0, policy_version 542067 (0.00086) [2022-07-10 03:12:36,592][25689] Fps is (10 sec: 5702.7, 60 sec: 5656.0, 300 sec: 5668.7). Total num frames: 555078656. Throughput: 0: 5949.0. Samples: 555080724. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:12:36,593][25689] Avg episode reward: [(0, '-27.701')] [2022-07-10 03:12:38,049][26022] Updated weights on worker 0-0, policy_version 542077 (0.00081) [2022-07-10 03:12:39,816][26022] Updated weights on worker 0-0, policy_version 542087 (0.00093) [2022-07-10 03:12:41,513][26022] Updated weights on worker 0-0, policy_version 542097 (0.00087) [2022-07-10 03:12:41,609][25689] Fps is (10 sec: 5814.7, 60 sec: 5672.5, 300 sec: 5662.9). Total num frames: 555107328. Throughput: 0: 5959.0. Samples: 555114858. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:12:41,609][25689] Avg episode reward: [(0, '-28.341')] [2022-07-10 03:12:43,484][26022] Updated weights on worker 0-0, policy_version 542107 (0.00104) [2022-07-10 03:12:45,174][26022] Updated weights on worker 0-0, policy_version 542117 (0.00093) [2022-07-10 03:12:46,713][25689] Fps is (10 sec: 5564.8, 60 sec: 5636.6, 300 sec: 5661.0). Total num frames: 555134976. Throughput: 0: 5112.2. Samples: 555131890. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:12:46,714][25689] Avg episode reward: [(0, '-27.125')] [2022-07-10 03:12:47,087][26022] Updated weights on worker 0-0, policy_version 542127 (0.00084) [2022-07-10 03:12:48,897][26022] Updated weights on worker 0-0, policy_version 542137 (0.00099) [2022-07-10 03:12:50,703][26022] Updated weights on worker 0-0, policy_version 542147 (0.00079) [2022-07-10 03:12:51,747][25689] Fps is (10 sec: 5757.2, 60 sec: 5651.6, 300 sec: 5667.6). Total num frames: 555165696. Throughput: 0: 5951.7. Samples: 555166150. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:12:51,747][25689] Avg episode reward: [(0, '-26.543')] [2022-07-10 03:12:52,322][26022] Updated weights on worker 0-0, policy_version 542157 (0.00091) [2022-07-10 03:12:54,351][26022] Updated weights on worker 0-0, policy_version 542167 (0.00088) [2022-07-10 03:12:55,875][26022] Updated weights on worker 0-0, policy_version 542177 (0.00085) [2022-07-10 03:12:56,760][25689] Fps is (10 sec: 5707.9, 60 sec: 5655.6, 300 sec: 5664.1). Total num frames: 555192320. Throughput: 0: 5921.9. Samples: 555200152. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:12:56,760][25689] Avg episode reward: [(0, '-26.000')] [2022-07-10 03:12:57,958][26022] Updated weights on worker 0-0, policy_version 542187 (0.00094) [2022-07-10 03:12:59,638][26022] Updated weights on worker 0-0, policy_version 542197 (0.00093) [2022-07-10 03:13:01,522][26022] Updated weights on worker 0-0, policy_version 542207 (0.00086) [2022-07-10 03:13:01,775][25689] Fps is (10 sec: 5616.4, 60 sec: 5673.0, 300 sec: 5669.5). Total num frames: 555222016. Throughput: 0: 5073.5. Samples: 555217170. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:13:01,775][25689] Avg episode reward: [(0, '-26.507')] [2022-07-10 03:13:03,655][26022] Updated weights on worker 0-0, policy_version 542217 (0.00087) [2022-07-10 03:13:05,432][26022] Updated weights on worker 0-0, policy_version 542227 (0.00091) [2022-07-10 03:13:06,832][25689] Fps is (10 sec: 5591.3, 60 sec: 5658.2, 300 sec: 5669.2). Total num frames: 555248640. Throughput: 0: 5831.2. Samples: 555249210. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:13:06,833][25689] Avg episode reward: [(0, '-26.689')] [2022-07-10 03:13:07,142][26022] Updated weights on worker 0-0, policy_version 542237 (0.00089) [2022-07-10 03:13:09,160][26022] Updated weights on worker 0-0, policy_version 542247 (0.00098) [2022-07-10 03:13:10,782][26022] Updated weights on worker 0-0, policy_version 542257 (0.00087) [2022-07-10 03:13:11,898][25689] Fps is (10 sec: 5361.2, 60 sec: 5637.1, 300 sec: 5658.4). Total num frames: 555276288. Throughput: 0: 5816.4. Samples: 555283356. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:13:11,898][25689] Avg episode reward: [(0, '-27.384')] [2022-07-10 03:13:12,810][26022] Updated weights on worker 0-0, policy_version 542267 (0.00087) [2022-07-10 03:13:14,270][26022] Updated weights on worker 0-0, policy_version 542277 (0.00085) [2022-07-10 03:13:16,288][26022] Updated weights on worker 0-0, policy_version 542287 (0.00091) [2022-07-10 03:13:16,938][25689] Fps is (10 sec: 5674.5, 60 sec: 5684.8, 300 sec: 5661.5). Total num frames: 555305984. Throughput: 0: 4972.3. Samples: 555300484. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:13:16,938][25689] Avg episode reward: [(0, '-27.687')] [2022-07-10 03:13:17,951][26022] Updated weights on worker 0-0, policy_version 542297 (0.00088) [2022-07-10 03:13:19,873][26022] Updated weights on worker 0-0, policy_version 542307 (0.00099) [2022-07-10 03:13:21,444][26022] Updated weights on worker 0-0, policy_version 542317 (0.00089) [2022-07-10 03:13:21,990][25689] Fps is (10 sec: 5682.3, 60 sec: 5631.5, 300 sec: 5648.2). Total num frames: 555333632. Throughput: 0: 5803.9. Samples: 555334494. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:13:21,991][25689] Avg episode reward: [(0, '-27.719')] [2022-07-10 03:13:23,671][26022] Updated weights on worker 0-0, policy_version 542327 (0.00091) [2022-07-10 03:13:25,041][26022] Updated weights on worker 0-0, policy_version 542337 (0.00089) [2022-07-10 03:13:27,095][25689] Fps is (10 sec: 5545.0, 60 sec: 5630.8, 300 sec: 5653.7). Total num frames: 555362304. Throughput: 0: 5904.6. Samples: 555368852. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:13:27,096][25689] Avg episode reward: [(0, '-28.277')] [2022-07-10 03:13:27,127][26022] Updated weights on worker 0-0, policy_version 542347 (0.00094) [2022-07-10 03:13:28,663][26022] Updated weights on worker 0-0, policy_version 542357 (0.00086) [2022-07-10 03:13:30,699][26022] Updated weights on worker 0-0, policy_version 542367 (0.00088) [2022-07-10 03:13:32,175][25689] Fps is (10 sec: 5730.7, 60 sec: 5660.3, 300 sec: 5656.6). Total num frames: 555392000. Throughput: 0: 5058.0. Samples: 555385916. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:13:32,175][25689] Avg episode reward: [(0, '-28.694')] [2022-07-10 03:13:32,635][26022] Updated weights on worker 0-0, policy_version 542377 (0.00093) [2022-07-10 03:13:34,174][26022] Updated weights on worker 0-0, policy_version 542387 (0.00086) [2022-07-10 03:13:36,230][26022] Updated weights on worker 0-0, policy_version 542397 (0.00097) [2022-07-10 03:13:37,227][25689] Fps is (10 sec: 5861.9, 60 sec: 5657.5, 300 sec: 5659.3). Total num frames: 555421696. Throughput: 0: 5898.4. Samples: 555420154. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:13:37,227][25689] Avg episode reward: [(0, '-28.122')] [2022-07-10 03:13:37,727][26022] Updated weights on worker 0-0, policy_version 542407 (0.00084) [2022-07-10 03:13:39,591][26022] Updated weights on worker 0-0, policy_version 542417 (0.00093) [2022-07-10 03:13:41,549][26022] Updated weights on worker 0-0, policy_version 542427 (0.00085) [2022-07-10 03:13:42,262][25689] Fps is (10 sec: 5786.2, 60 sec: 5655.7, 300 sec: 5656.5). Total num frames: 555450368. Throughput: 0: 5906.5. Samples: 555454234. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:13:42,263][25689] Avg episode reward: [(0, '-26.595')] [2022-07-10 03:13:43,178][26022] Updated weights on worker 0-0, policy_version 542437 (0.00083) [2022-07-10 03:13:45,054][26022] Updated weights on worker 0-0, policy_version 542447 (0.00090) [2022-07-10 03:13:47,084][26022] Updated weights on worker 0-0, policy_version 542457 (0.00098) [2022-07-10 03:13:47,395][25689] Fps is (10 sec: 5438.2, 60 sec: 5636.2, 300 sec: 5648.3). Total num frames: 555476992. Throughput: 0: 5054.3. Samples: 555471454. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:13:47,395][25689] Avg episode reward: [(0, '-26.946')] [2022-07-10 03:13:48,711][26022] Updated weights on worker 0-0, policy_version 542467 (0.00081) [2022-07-10 03:13:50,630][26022] Updated weights on worker 0-0, policy_version 542477 (0.00099) [2022-07-10 03:13:52,285][26022] Updated weights on worker 0-0, policy_version 542487 (0.00088) [2022-07-10 03:13:52,495][25689] Fps is (10 sec: 5604.1, 60 sec: 5630.1, 300 sec: 5658.1). Total num frames: 555507712. Throughput: 0: 5896.9. Samples: 555505740. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:13:52,495][25689] Avg episode reward: [(0, '-26.618')] [2022-07-10 03:13:54,103][26022] Updated weights on worker 0-0, policy_version 542497 (0.00094) [2022-07-10 03:13:54,783][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:13:54,792][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000542501_555521024.pth [2022-07-10 03:13:54,793][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000540510_553482240.pth [2022-07-10 03:13:56,032][26022] Updated weights on worker 0-0, policy_version 542507 (0.00094) [2022-07-10 03:13:57,544][25689] Fps is (10 sec: 5952.7, 60 sec: 5677.2, 300 sec: 5657.4). Total num frames: 555537408. Throughput: 0: 5867.5. Samples: 555539366. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:13:57,545][25689] Avg episode reward: [(0, '-25.696')] [2022-07-10 03:13:57,551][26022] Updated weights on worker 0-0, policy_version 542517 (0.00088) [2022-07-10 03:13:59,674][26022] Updated weights on worker 0-0, policy_version 542527 (0.00090) [2022-07-10 03:14:01,325][26022] Updated weights on worker 0-0, policy_version 542537 (0.00088) [2022-07-10 03:14:02,577][25689] Fps is (10 sec: 5484.7, 60 sec: 5608.3, 300 sec: 5659.2). Total num frames: 555563008. Throughput: 0: 5767.5. Samples: 555571398. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:14:02,577][25689] Avg episode reward: [(0, '-26.394')] [2022-07-10 03:14:03,622][26022] Updated weights on worker 0-0, policy_version 542547 (0.00084) [2022-07-10 03:14:05,377][26022] Updated weights on worker 0-0, policy_version 542557 (0.00090) [2022-07-10 03:14:06,951][26022] Updated weights on worker 0-0, policy_version 542567 (0.00094) [2022-07-10 03:14:07,643][25689] Fps is (10 sec: 5374.1, 60 sec: 5641.1, 300 sec: 5659.1). Total num frames: 555591680. Throughput: 0: 5778.8. Samples: 555588464. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:14:07,643][25689] Avg episode reward: [(0, '-26.110')] [2022-07-10 03:14:08,916][26022] Updated weights on worker 0-0, policy_version 542577 (0.00083) [2022-07-10 03:14:10,606][26022] Updated weights on worker 0-0, policy_version 542587 (0.00087) [2022-07-10 03:14:12,450][26022] Updated weights on worker 0-0, policy_version 542597 (0.00090) [2022-07-10 03:14:12,647][25689] Fps is (10 sec: 5694.3, 60 sec: 5663.7, 300 sec: 5656.3). Total num frames: 555620352. Throughput: 0: 5813.9. Samples: 555622904. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:14:12,648][25689] Avg episode reward: [(0, '-25.890')] [2022-07-10 03:14:14,354][26022] Updated weights on worker 0-0, policy_version 542607 (0.00093) [2022-07-10 03:14:15,967][26022] Updated weights on worker 0-0, policy_version 542617 (0.00087) [2022-07-10 03:14:17,654][25689] Fps is (10 sec: 5625.7, 60 sec: 5633.0, 300 sec: 5657.3). Total num frames: 555648000. Throughput: 0: 5838.6. Samples: 555656780. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:14:17,655][25689] Avg episode reward: [(0, '-26.585')] [2022-07-10 03:14:17,834][26022] Updated weights on worker 0-0, policy_version 542627 (0.00087) [2022-07-10 03:14:19,694][26022] Updated weights on worker 0-0, policy_version 542637 (0.00078) [2022-07-10 03:14:21,522][26022] Updated weights on worker 0-0, policy_version 542647 (0.00086) [2022-07-10 03:14:22,703][25689] Fps is (10 sec: 5600.6, 60 sec: 5650.1, 300 sec: 5655.2). Total num frames: 555676672. Throughput: 0: 5103.8. Samples: 555674120. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:14:22,704][25689] Avg episode reward: [(0, '-25.849')] [2022-07-10 03:14:23,219][26022] Updated weights on worker 0-0, policy_version 542657 (0.00095) [2022-07-10 03:14:24,963][26022] Updated weights on worker 0-0, policy_version 542667 (0.00090) [2022-07-10 03:14:26,840][26022] Updated weights on worker 0-0, policy_version 542677 (0.00084) [2022-07-10 03:14:27,753][25689] Fps is (10 sec: 5779.5, 60 sec: 5672.2, 300 sec: 5657.9). Total num frames: 555706368. Throughput: 0: 5946.3. Samples: 555708046. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:14:27,754][25689] Avg episode reward: [(0, '-26.379')] [2022-07-10 03:14:28,666][26022] Updated weights on worker 0-0, policy_version 542687 (0.00082) [2022-07-10 03:14:30,503][26022] Updated weights on worker 0-0, policy_version 542697 (0.00087) [2022-07-10 03:14:32,265][26022] Updated weights on worker 0-0, policy_version 542707 (0.00091) [2022-07-10 03:14:32,757][25689] Fps is (10 sec: 5805.4, 60 sec: 5662.4, 300 sec: 5661.6). Total num frames: 555735040. Throughput: 0: 5935.1. Samples: 555742260. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 03:14:32,758][25689] Avg episode reward: [(0, '-26.236')] [2022-07-10 03:14:34,195][26022] Updated weights on worker 0-0, policy_version 542717 (0.00090) [2022-07-10 03:14:35,788][26022] Updated weights on worker 0-0, policy_version 542727 (0.00055) [2022-07-10 03:14:37,763][25689] Fps is (10 sec: 5524.4, 60 sec: 5616.0, 300 sec: 5651.5). Total num frames: 555761664. Throughput: 0: 5103.4. Samples: 555759402. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:14:37,763][25689] Avg episode reward: [(0, '-27.098')] [2022-07-10 03:14:37,899][26022] Updated weights on worker 0-0, policy_version 542737 (0.00088) [2022-07-10 03:14:39,336][26022] Updated weights on worker 0-0, policy_version 542747 (0.00090) [2022-07-10 03:14:41,412][26022] Updated weights on worker 0-0, policy_version 542757 (0.00085) [2022-07-10 03:14:42,765][25689] Fps is (10 sec: 5729.7, 60 sec: 5652.9, 300 sec: 5659.2). Total num frames: 555792384. Throughput: 0: 5944.4. Samples: 555793378. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:14:42,766][25689] Avg episode reward: [(0, '-25.669')] [2022-07-10 03:14:42,832][26022] Updated weights on worker 0-0, policy_version 542767 (0.00086) [2022-07-10 03:14:44,999][26022] Updated weights on worker 0-0, policy_version 542777 (0.00094) [2022-07-10 03:14:46,727][26022] Updated weights on worker 0-0, policy_version 542787 (0.00086) [2022-07-10 03:14:47,807][25689] Fps is (10 sec: 5811.1, 60 sec: 5678.3, 300 sec: 5659.0). Total num frames: 555820032. Throughput: 0: 5974.9. Samples: 555827864. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:14:47,807][25689] Avg episode reward: [(0, '-25.285')] [2022-07-10 03:14:48,385][26022] Updated weights on worker 0-0, policy_version 542797 (0.00095) [2022-07-10 03:14:50,274][26022] Updated weights on worker 0-0, policy_version 542807 (0.00086) [2022-07-10 03:14:52,053][26022] Updated weights on worker 0-0, policy_version 542817 (0.00082) [2022-07-10 03:14:52,818][25689] Fps is (10 sec: 5602.8, 60 sec: 5652.8, 300 sec: 5655.6). Total num frames: 555848704. Throughput: 0: 5109.6. Samples: 555844760. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:14:52,818][25689] Avg episode reward: [(0, '-25.144')] [2022-07-10 03:14:53,858][26022] Updated weights on worker 0-0, policy_version 542827 (0.00089) [2022-07-10 03:14:55,719][26022] Updated weights on worker 0-0, policy_version 542837 (0.00090) [2022-07-10 03:14:57,523][26022] Updated weights on worker 0-0, policy_version 542847 (0.00094) [2022-07-10 03:14:57,830][25689] Fps is (10 sec: 5619.1, 60 sec: 5622.3, 300 sec: 5655.7). Total num frames: 555876352. Throughput: 0: 5955.9. Samples: 555878920. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:14:57,830][25689] Avg episode reward: [(0, '-25.924')] [2022-07-10 03:14:59,140][26022] Updated weights on worker 0-0, policy_version 542857 (0.00084) [2022-07-10 03:15:01,194][26022] Updated weights on worker 0-0, policy_version 542867 (0.00090) [2022-07-10 03:15:02,845][25689] Fps is (10 sec: 5514.2, 60 sec: 5657.8, 300 sec: 5657.3). Total num frames: 555904000. Throughput: 0: 5859.6. Samples: 555911038. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:15:02,846][25689] Avg episode reward: [(0, '-26.210')] [2022-07-10 03:15:03,223][26022] Updated weights on worker 0-0, policy_version 542877 (0.00090) [2022-07-10 03:15:05,371][26022] Updated weights on worker 0-0, policy_version 542887 (0.00085) [2022-07-10 03:15:06,838][26022] Updated weights on worker 0-0, policy_version 542897 (0.00635) [2022-07-10 03:15:07,953][25689] Fps is (10 sec: 5462.3, 60 sec: 5637.0, 300 sec: 5652.1). Total num frames: 555931648. Throughput: 0: 4974.1. Samples: 555928072. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:15:07,953][25689] Avg episode reward: [(0, '-26.629')] [2022-07-10 03:15:08,794][26022] Updated weights on worker 0-0, policy_version 542907 (0.00092) [2022-07-10 03:15:10,555][26022] Updated weights on worker 0-0, policy_version 542917 (0.00096) [2022-07-10 03:15:12,283][26022] Updated weights on worker 0-0, policy_version 542927 (0.00082) [2022-07-10 03:15:12,964][25689] Fps is (10 sec: 5667.2, 60 sec: 5653.3, 300 sec: 5655.4). Total num frames: 555961344. Throughput: 0: 5832.2. Samples: 555962258. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:15:12,965][25689] Avg episode reward: [(0, '-27.783')] [2022-07-10 03:15:14,141][26022] Updated weights on worker 0-0, policy_version 542937 (0.00081) [2022-07-10 03:15:15,856][26022] Updated weights on worker 0-0, policy_version 542947 (0.00081) [2022-07-10 03:15:17,604][26022] Updated weights on worker 0-0, policy_version 542957 (0.00093) [2022-07-10 03:15:17,982][25689] Fps is (10 sec: 5922.2, 60 sec: 5686.3, 300 sec: 5658.8). Total num frames: 555991040. Throughput: 0: 5838.2. Samples: 555996570. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:15:17,982][25689] Avg episode reward: [(0, '-27.860')] [2022-07-10 03:15:19,550][26022] Updated weights on worker 0-0, policy_version 542967 (0.00087) [2022-07-10 03:15:21,122][26022] Updated weights on worker 0-0, policy_version 542977 (0.00083) [2022-07-10 03:15:23,048][25689] Fps is (10 sec: 5483.4, 60 sec: 5633.7, 300 sec: 5648.8). Total num frames: 556016640. Throughput: 0: 5084.7. Samples: 556013764. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:15:23,049][25689] Avg episode reward: [(0, '-27.679')] [2022-07-10 03:15:23,239][26022] Updated weights on worker 0-0, policy_version 542987 (0.00081) [2022-07-10 03:15:24,734][26022] Updated weights on worker 0-0, policy_version 542997 (0.00085) [2022-07-10 03:15:26,707][26022] Updated weights on worker 0-0, policy_version 543007 (0.00085) [2022-07-10 03:15:28,179][25689] Fps is (10 sec: 5522.8, 60 sec: 5643.1, 300 sec: 5653.7). Total num frames: 556047360. Throughput: 0: 5943.8. Samples: 556048294. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:15:28,180][25689] Avg episode reward: [(0, '-26.940')] [2022-07-10 03:15:28,408][26022] Updated weights on worker 0-0, policy_version 543017 (0.00093) [2022-07-10 03:15:30,251][26022] Updated weights on worker 0-0, policy_version 543027 (0.00088) [2022-07-10 03:15:31,965][26022] Updated weights on worker 0-0, policy_version 543037 (0.00088) [2022-07-10 03:15:33,226][25689] Fps is (10 sec: 5835.5, 60 sec: 5639.1, 300 sec: 5649.9). Total num frames: 556076032. Throughput: 0: 5937.2. Samples: 556082556. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:15:33,226][25689] Avg episode reward: [(0, '-26.853')] [2022-07-10 03:15:33,892][26022] Updated weights on worker 0-0, policy_version 543047 (0.00084) [2022-07-10 03:15:35,519][26022] Updated weights on worker 0-0, policy_version 543057 (0.00084) [2022-07-10 03:15:37,381][26022] Updated weights on worker 0-0, policy_version 543067 (0.00526) [2022-07-10 03:15:38,266][25689] Fps is (10 sec: 5786.8, 60 sec: 5686.7, 300 sec: 5656.2). Total num frames: 556105728. Throughput: 0: 5088.3. Samples: 556099782. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:15:38,266][25689] Avg episode reward: [(0, '-27.012')] [2022-07-10 03:15:39,144][26022] Updated weights on worker 0-0, policy_version 543077 (0.00085) [2022-07-10 03:15:41,036][26022] Updated weights on worker 0-0, policy_version 543087 (0.00087) [2022-07-10 03:15:42,654][26022] Updated weights on worker 0-0, policy_version 543097 (0.00085) [2022-07-10 03:15:43,312][25689] Fps is (10 sec: 5786.9, 60 sec: 5648.8, 300 sec: 5661.1). Total num frames: 556134400. Throughput: 0: 5933.8. Samples: 556134006. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:15:43,313][25689] Avg episode reward: [(0, '-26.662')] [2022-07-10 03:15:44,522][26022] Updated weights on worker 0-0, policy_version 543107 (0.00087) [2022-07-10 03:15:46,146][26022] Updated weights on worker 0-0, policy_version 543117 (0.00084) [2022-07-10 03:15:48,030][26022] Updated weights on worker 0-0, policy_version 543127 (0.00090) [2022-07-10 03:15:48,358][25689] Fps is (10 sec: 5580.2, 60 sec: 5648.3, 300 sec: 5647.2). Total num frames: 556162048. Throughput: 0: 5967.8. Samples: 556168720. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:15:48,359][25689] Avg episode reward: [(0, '-27.670')] [2022-07-10 03:15:49,835][26022] Updated weights on worker 0-0, policy_version 543137 (0.00087) [2022-07-10 03:15:51,599][26022] Updated weights on worker 0-0, policy_version 543147 (0.00109) [2022-07-10 03:15:53,373][25689] Fps is (10 sec: 5598.0, 60 sec: 5648.0, 300 sec: 5654.6). Total num frames: 556190720. Throughput: 0: 5112.3. Samples: 556185554. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:15:53,375][25689] Avg episode reward: [(0, '-26.972')] [2022-07-10 03:15:53,510][26022] Updated weights on worker 0-0, policy_version 543157 (0.00093) [2022-07-10 03:15:54,966][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:15:54,977][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000543166_556201984.pth [2022-07-10 03:15:54,977][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000541175_554163200.pth [2022-07-10 03:15:55,220][26022] Updated weights on worker 0-0, policy_version 543167 (0.00085) [2022-07-10 03:15:57,125][26022] Updated weights on worker 0-0, policy_version 543177 (0.00104) [2022-07-10 03:15:58,381][25689] Fps is (10 sec: 5926.1, 60 sec: 5699.1, 300 sec: 5661.5). Total num frames: 556221440. Throughput: 0: 5977.4. Samples: 556220016. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:15:58,383][25689] Avg episode reward: [(0, '-29.707')] [2022-07-10 03:15:58,877][26022] Updated weights on worker 0-0, policy_version 543187 (0.00087) [2022-07-10 03:16:00,629][26022] Updated weights on worker 0-0, policy_version 543197 (0.00096) [2022-07-10 03:16:02,901][26022] Updated weights on worker 0-0, policy_version 543207 (0.00089) [2022-07-10 03:16:03,402][25689] Fps is (10 sec: 5513.6, 60 sec: 5647.9, 300 sec: 5649.2). Total num frames: 556246016. Throughput: 0: 5870.4. Samples: 556251942. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:16:03,403][25689] Avg episode reward: [(0, '-29.335')] [2022-07-10 03:16:04,507][26022] Updated weights on worker 0-0, policy_version 543217 (0.00083) [2022-07-10 03:16:06,652][26022] Updated weights on worker 0-0, policy_version 543227 (0.00087) [2022-07-10 03:16:08,207][26022] Updated weights on worker 0-0, policy_version 543237 (0.00094) [2022-07-10 03:16:08,523][25689] Fps is (10 sec: 5350.9, 60 sec: 5680.4, 300 sec: 5661.4). Total num frames: 556275712. Throughput: 0: 4978.6. Samples: 556269112. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:16:08,524][25689] Avg episode reward: [(0, '-29.949')] [2022-07-10 03:16:10,221][26022] Updated weights on worker 0-0, policy_version 543247 (0.00072) [2022-07-10 03:16:12,026][26022] Updated weights on worker 0-0, policy_version 543257 (0.00093) [2022-07-10 03:16:13,543][25689] Fps is (10 sec: 5654.8, 60 sec: 5645.8, 300 sec: 5657.7). Total num frames: 556303360. Throughput: 0: 5799.6. Samples: 556302530. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:16:13,543][25689] Avg episode reward: [(0, '-29.332')] [2022-07-10 03:16:13,828][26022] Updated weights on worker 0-0, policy_version 543267 (0.00092) [2022-07-10 03:16:15,539][26022] Updated weights on worker 0-0, policy_version 543277 (0.00088) [2022-07-10 03:16:17,361][26022] Updated weights on worker 0-0, policy_version 543287 (0.00086) [2022-07-10 03:16:18,569][25689] Fps is (10 sec: 5504.6, 60 sec: 5611.2, 300 sec: 5647.7). Total num frames: 556331008. Throughput: 0: 5778.3. Samples: 556336668. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:16:18,569][25689] Avg episode reward: [(0, '-29.907')] [2022-07-10 03:16:19,273][26022] Updated weights on worker 0-0, policy_version 543297 (0.00091) [2022-07-10 03:16:21,037][26022] Updated weights on worker 0-0, policy_version 543307 (0.00083) [2022-07-10 03:16:22,848][26022] Updated weights on worker 0-0, policy_version 543317 (0.00089) [2022-07-10 03:16:23,596][25689] Fps is (10 sec: 5703.8, 60 sec: 5682.5, 300 sec: 5654.8). Total num frames: 556360704. Throughput: 0: 5021.6. Samples: 556353350. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:16:23,597][25689] Avg episode reward: [(0, '-28.625')] [2022-07-10 03:16:24,698][26022] Updated weights on worker 0-0, policy_version 543327 (0.00083) [2022-07-10 03:16:26,395][26022] Updated weights on worker 0-0, policy_version 543337 (0.00087) [2022-07-10 03:16:28,299][26022] Updated weights on worker 0-0, policy_version 543347 (0.00085) [2022-07-10 03:16:28,659][25689] Fps is (10 sec: 5784.2, 60 sec: 5655.0, 300 sec: 5654.0). Total num frames: 556389376. Throughput: 0: 5882.3. Samples: 556387560. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:16:28,660][25689] Avg episode reward: [(0, '-27.058')] [2022-07-10 03:16:30,100][26022] Updated weights on worker 0-0, policy_version 543357 (0.00090) [2022-07-10 03:16:31,874][26022] Updated weights on worker 0-0, policy_version 543367 (0.00087) [2022-07-10 03:16:33,671][25689] Fps is (10 sec: 5590.5, 60 sec: 5641.4, 300 sec: 5647.4). Total num frames: 556417024. Throughput: 0: 5912.5. Samples: 556421538. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:16:33,671][25689] Avg episode reward: [(0, '-27.144')] [2022-07-10 03:16:33,771][26022] Updated weights on worker 0-0, policy_version 543377 (0.00087) [2022-07-10 03:16:35,550][26022] Updated weights on worker 0-0, policy_version 543387 (0.00092) [2022-07-10 03:16:37,265][26022] Updated weights on worker 0-0, policy_version 543397 (0.00088) [2022-07-10 03:16:38,717][25689] Fps is (10 sec: 5599.6, 60 sec: 5623.8, 300 sec: 5650.2). Total num frames: 556445696. Throughput: 0: 5075.6. Samples: 556438934. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:16:38,718][25689] Avg episode reward: [(0, '-28.309')] [2022-07-10 03:16:39,103][26022] Updated weights on worker 0-0, policy_version 543407 (0.00084) [2022-07-10 03:16:40,782][26022] Updated weights on worker 0-0, policy_version 543417 (0.00089) [2022-07-10 03:16:42,616][26022] Updated weights on worker 0-0, policy_version 543427 (0.00093) [2022-07-10 03:16:43,737][25689] Fps is (10 sec: 5696.5, 60 sec: 5626.3, 300 sec: 5648.0). Total num frames: 556474368. Throughput: 0: 5953.0. Samples: 556473248. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:16:43,738][25689] Avg episode reward: [(0, '-29.498')] [2022-07-10 03:16:44,544][26022] Updated weights on worker 0-0, policy_version 543437 (0.00056) [2022-07-10 03:16:46,252][26022] Updated weights on worker 0-0, policy_version 543447 (0.00082) [2022-07-10 03:16:47,959][26022] Updated weights on worker 0-0, policy_version 543457 (0.00087) [2022-07-10 03:16:48,796][25689] Fps is (10 sec: 5689.4, 60 sec: 5642.0, 300 sec: 5643.7). Total num frames: 556503040. Throughput: 0: 5942.3. Samples: 556507220. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:16:48,797][25689] Avg episode reward: [(0, '-30.446')] [2022-07-10 03:16:49,851][26022] Updated weights on worker 0-0, policy_version 543467 (0.00086) [2022-07-10 03:16:51,784][26022] Updated weights on worker 0-0, policy_version 543477 (0.00088) [2022-07-10 03:16:53,507][26022] Updated weights on worker 0-0, policy_version 543487 (0.00085) [2022-07-10 03:16:53,842][25689] Fps is (10 sec: 5776.2, 60 sec: 5656.0, 300 sec: 5654.2). Total num frames: 556532736. Throughput: 0: 5963.8. Samples: 556541836. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:16:53,843][25689] Avg episode reward: [(0, '-31.672')] [2022-07-10 03:16:55,149][26022] Updated weights on worker 0-0, policy_version 543497 (0.00087) [2022-07-10 03:16:57,032][26022] Updated weights on worker 0-0, policy_version 543507 (0.00092) [2022-07-10 03:16:58,770][26022] Updated weights on worker 0-0, policy_version 543517 (0.00092) [2022-07-10 03:16:58,907][25689] Fps is (10 sec: 5773.0, 60 sec: 5616.9, 300 sec: 5653.3). Total num frames: 556561408. Throughput: 0: 5950.7. Samples: 556559076. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:16:58,907][25689] Avg episode reward: [(0, '-30.333')] [2022-07-10 03:17:00,719][26022] Updated weights on worker 0-0, policy_version 543527 (0.00084) [2022-07-10 03:17:02,698][26022] Updated weights on worker 0-0, policy_version 543537 (0.00091) [2022-07-10 03:17:03,932][25689] Fps is (10 sec: 5480.1, 60 sec: 5650.3, 300 sec: 5650.9). Total num frames: 556588032. Throughput: 0: 5857.2. Samples: 556591536. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:17:03,933][25689] Avg episode reward: [(0, '-29.498')] [2022-07-10 03:17:04,525][26022] Updated weights on worker 0-0, policy_version 543547 (0.00087) [2022-07-10 03:17:06,235][26022] Updated weights on worker 0-0, policy_version 543557 (0.00086) [2022-07-10 03:17:08,152][26022] Updated weights on worker 0-0, policy_version 543567 (0.00087) [2022-07-10 03:17:09,002][25689] Fps is (10 sec: 5477.6, 60 sec: 5638.2, 300 sec: 5650.0). Total num frames: 556616704. Throughput: 0: 5874.3. Samples: 556625912. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:17:09,002][25689] Avg episode reward: [(0, '-28.063')] [2022-07-10 03:17:09,776][26022] Updated weights on worker 0-0, policy_version 543577 (0.00092) [2022-07-10 03:17:11,798][26022] Updated weights on worker 0-0, policy_version 543587 (0.00637) [2022-07-10 03:17:13,535][26022] Updated weights on worker 0-0, policy_version 543597 (0.00085) [2022-07-10 03:17:14,076][25689] Fps is (10 sec: 5854.9, 60 sec: 5683.8, 300 sec: 5662.5). Total num frames: 556647424. Throughput: 0: 5000.6. Samples: 556643020. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:17:14,078][25689] Avg episode reward: [(0, '-27.448')] [2022-07-10 03:17:15,279][26022] Updated weights on worker 0-0, policy_version 543607 (0.00083) [2022-07-10 03:17:17,113][26022] Updated weights on worker 0-0, policy_version 543617 (0.00086) [2022-07-10 03:17:18,710][26022] Updated weights on worker 0-0, policy_version 543627 (0.00094) [2022-07-10 03:17:19,123][25689] Fps is (10 sec: 5766.9, 60 sec: 5681.9, 300 sec: 5651.7). Total num frames: 556675072. Throughput: 0: 5858.8. Samples: 556677518. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:17:19,124][25689] Avg episode reward: [(0, '-26.394')] [2022-07-10 03:17:20,765][26022] Updated weights on worker 0-0, policy_version 543637 (0.00091) [2022-07-10 03:17:22,457][26022] Updated weights on worker 0-0, policy_version 543647 (0.00095) [2022-07-10 03:17:24,182][25689] Fps is (10 sec: 5573.2, 60 sec: 5662.1, 300 sec: 5652.5). Total num frames: 556703744. Throughput: 0: 5934.4. Samples: 556711704. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:17:24,182][25689] Avg episode reward: [(0, '-26.712')] [2022-07-10 03:17:24,235][26022] Updated weights on worker 0-0, policy_version 543657 (0.00088) [2022-07-10 03:17:26,154][26022] Updated weights on worker 0-0, policy_version 543667 (0.00094) [2022-07-10 03:17:27,739][26022] Updated weights on worker 0-0, policy_version 543677 (0.00086) [2022-07-10 03:17:29,255][25689] Fps is (10 sec: 5659.8, 60 sec: 5661.2, 300 sec: 5655.2). Total num frames: 556732416. Throughput: 0: 5075.0. Samples: 556728696. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 03:17:29,255][25689] Avg episode reward: [(0, '-26.960')] [2022-07-10 03:17:29,731][26022] Updated weights on worker 0-0, policy_version 543687 (0.00086) [2022-07-10 03:17:31,549][26022] Updated weights on worker 0-0, policy_version 543697 (0.00090) [2022-07-10 03:17:33,155][26022] Updated weights on worker 0-0, policy_version 543707 (0.00085) [2022-07-10 03:17:34,312][25689] Fps is (10 sec: 5660.5, 60 sec: 5673.7, 300 sec: 5651.1). Total num frames: 556761088. Throughput: 0: 5928.2. Samples: 556762982. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:17:34,313][25689] Avg episode reward: [(0, '-27.740')] [2022-07-10 03:17:34,978][26022] Updated weights on worker 0-0, policy_version 543717 (0.00093) [2022-07-10 03:17:36,873][26022] Updated weights on worker 0-0, policy_version 543727 (0.00084) [2022-07-10 03:17:38,663][26022] Updated weights on worker 0-0, policy_version 543737 (0.00086) [2022-07-10 03:17:39,341][25689] Fps is (10 sec: 5685.5, 60 sec: 5675.4, 300 sec: 5650.8). Total num frames: 556789760. Throughput: 0: 5906.8. Samples: 556796940. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:17:39,341][25689] Avg episode reward: [(0, '-27.831')] [2022-07-10 03:17:40,672][26022] Updated weights on worker 0-0, policy_version 543747 (0.00085) [2022-07-10 03:17:42,124][26022] Updated weights on worker 0-0, policy_version 543757 (0.00090) [2022-07-10 03:17:44,175][26022] Updated weights on worker 0-0, policy_version 543767 (0.00083) [2022-07-10 03:17:44,379][25689] Fps is (10 sec: 5696.2, 60 sec: 5673.7, 300 sec: 5655.5). Total num frames: 556818432. Throughput: 0: 5072.0. Samples: 556814146. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:17:44,380][25689] Avg episode reward: [(0, '-28.867')] [2022-07-10 03:17:45,919][26022] Updated weights on worker 0-0, policy_version 543777 (0.00087) [2022-07-10 03:17:47,654][26022] Updated weights on worker 0-0, policy_version 543787 (0.00095) [2022-07-10 03:17:49,468][25689] Fps is (10 sec: 5662.6, 60 sec: 5671.0, 300 sec: 5647.6). Total num frames: 556847104. Throughput: 0: 5932.7. Samples: 556848612. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:17:49,468][25689] Avg episode reward: [(0, '-27.839')] [2022-07-10 03:17:49,611][26022] Updated weights on worker 0-0, policy_version 543797 (0.00090) [2022-07-10 03:17:51,067][26022] Updated weights on worker 0-0, policy_version 543807 (0.00099) [2022-07-10 03:17:53,223][26022] Updated weights on worker 0-0, policy_version 543817 (0.00087) [2022-07-10 03:17:54,491][25689] Fps is (10 sec: 5873.6, 60 sec: 5689.9, 300 sec: 5661.2). Total num frames: 556877824. Throughput: 0: 5944.5. Samples: 556882934. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:17:54,491][25689] Avg episode reward: [(0, '-28.080')] [2022-07-10 03:17:54,744][26022] Updated weights on worker 0-0, policy_version 543827 (0.00085) [2022-07-10 03:17:54,997][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:17:55,006][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000543828_556879872.pth [2022-07-10 03:17:55,006][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000541840_554844160.pth [2022-07-10 03:17:56,607][26022] Updated weights on worker 0-0, policy_version 543837 (0.00089) [2022-07-10 03:17:58,498][26022] Updated weights on worker 0-0, policy_version 543847 (0.00087) [2022-07-10 03:17:59,534][25689] Fps is (10 sec: 5696.4, 60 sec: 5658.2, 300 sec: 5650.3). Total num frames: 556904448. Throughput: 0: 5116.7. Samples: 556900264. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:17:59,535][25689] Avg episode reward: [(0, '-28.940')] [2022-07-10 03:18:00,083][26022] Updated weights on worker 0-0, policy_version 543857 (0.00086) [2022-07-10 03:18:02,470][26022] Updated weights on worker 0-0, policy_version 543867 (0.00094) [2022-07-10 03:18:03,943][26022] Updated weights on worker 0-0, policy_version 543877 (0.00086) [2022-07-10 03:18:04,548][25689] Fps is (10 sec: 5396.2, 60 sec: 5676.1, 300 sec: 5654.6). Total num frames: 556932096. Throughput: 0: 5871.5. Samples: 556932568. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:18:04,549][25689] Avg episode reward: [(0, '-28.559')] [2022-07-10 03:18:05,880][26022] Updated weights on worker 0-0, policy_version 543887 (0.00092) [2022-07-10 03:18:07,644][26022] Updated weights on worker 0-0, policy_version 543897 (0.00091) [2022-07-10 03:18:09,479][26022] Updated weights on worker 0-0, policy_version 543907 (0.00092) [2022-07-10 03:18:09,603][25689] Fps is (10 sec: 5593.5, 60 sec: 5677.5, 300 sec: 5658.2). Total num frames: 556960768. Throughput: 0: 5892.6. Samples: 556967262. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:18:09,604][25689] Avg episode reward: [(0, '-28.786')] [2022-07-10 03:18:11,256][26022] Updated weights on worker 0-0, policy_version 543917 (0.00086) [2022-07-10 03:18:13,005][26022] Updated weights on worker 0-0, policy_version 543927 (0.00098) [2022-07-10 03:18:14,654][25689] Fps is (10 sec: 5775.7, 60 sec: 5662.8, 300 sec: 5658.0). Total num frames: 556990464. Throughput: 0: 5041.3. Samples: 556984584. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:18:14,656][25689] Avg episode reward: [(0, '-28.152')] [2022-07-10 03:18:14,691][26022] Updated weights on worker 0-0, policy_version 543937 (0.00083) [2022-07-10 03:18:16,665][26022] Updated weights on worker 0-0, policy_version 543947 (0.00082) [2022-07-10 03:18:18,356][26022] Updated weights on worker 0-0, policy_version 543957 (0.00089) [2022-07-10 03:18:19,687][25689] Fps is (10 sec: 5686.8, 60 sec: 5664.1, 300 sec: 5658.4). Total num frames: 557018112. Throughput: 0: 5882.8. Samples: 557018816. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:18:19,688][25689] Avg episode reward: [(0, '-28.278')] [2022-07-10 03:18:20,194][26022] Updated weights on worker 0-0, policy_version 543967 (0.00084) [2022-07-10 03:18:22,064][26022] Updated weights on worker 0-0, policy_version 543977 (0.00087) [2022-07-10 03:18:23,778][26022] Updated weights on worker 0-0, policy_version 543987 (0.00086) [2022-07-10 03:18:24,691][25689] Fps is (10 sec: 5611.2, 60 sec: 5669.2, 300 sec: 5660.3). Total num frames: 557046784. Throughput: 0: 5965.1. Samples: 557052722. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:18:24,693][25689] Avg episode reward: [(0, '-27.817')] [2022-07-10 03:18:25,582][26022] Updated weights on worker 0-0, policy_version 543997 (0.00069) [2022-07-10 03:18:27,524][26022] Updated weights on worker 0-0, policy_version 544007 (0.00089) [2022-07-10 03:18:29,216][26022] Updated weights on worker 0-0, policy_version 544017 (0.00083) [2022-07-10 03:18:29,762][25689] Fps is (10 sec: 5793.2, 60 sec: 5686.3, 300 sec: 5660.5). Total num frames: 557076480. Throughput: 0: 5084.3. Samples: 557069756. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:18:29,762][25689] Avg episode reward: [(0, '-27.418')] [2022-07-10 03:18:31,018][26022] Updated weights on worker 0-0, policy_version 544027 (0.00090) [2022-07-10 03:18:32,852][26022] Updated weights on worker 0-0, policy_version 544037 (0.00089) [2022-07-10 03:18:34,605][26022] Updated weights on worker 0-0, policy_version 544047 (0.00087) [2022-07-10 03:18:34,774][25689] Fps is (10 sec: 5789.1, 60 sec: 5690.6, 300 sec: 5657.8). Total num frames: 557105152. Throughput: 0: 5928.4. Samples: 557103860. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:18:34,774][25689] Avg episode reward: [(0, '-27.569')] [2022-07-10 03:18:36,639][26022] Updated weights on worker 0-0, policy_version 544057 (0.00085) [2022-07-10 03:18:38,180][26022] Updated weights on worker 0-0, policy_version 544067 (0.00091) [2022-07-10 03:18:39,783][25689] Fps is (10 sec: 5619.9, 60 sec: 5675.5, 300 sec: 5654.8). Total num frames: 557132800. Throughput: 0: 5926.8. Samples: 557137924. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:18:39,784][25689] Avg episode reward: [(0, '-28.586')] [2022-07-10 03:18:40,105][26022] Updated weights on worker 0-0, policy_version 544077 (0.00085) [2022-07-10 03:18:41,893][26022] Updated weights on worker 0-0, policy_version 544087 (0.00095) [2022-07-10 03:18:43,616][26022] Updated weights on worker 0-0, policy_version 544097 (0.00085) [2022-07-10 03:18:44,787][25689] Fps is (10 sec: 5624.7, 60 sec: 5678.8, 300 sec: 5664.2). Total num frames: 557161472. Throughput: 0: 5097.2. Samples: 557155152. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:18:44,787][25689] Avg episode reward: [(0, '-27.919')] [2022-07-10 03:18:45,434][26022] Updated weights on worker 0-0, policy_version 544107 (0.00088) [2022-07-10 03:18:47,248][26022] Updated weights on worker 0-0, policy_version 544117 (0.00085) [2022-07-10 03:18:48,962][26022] Updated weights on worker 0-0, policy_version 544127 (0.00091) [2022-07-10 03:18:49,849][25689] Fps is (10 sec: 5798.7, 60 sec: 5698.2, 300 sec: 5661.4). Total num frames: 557191168. Throughput: 0: 5969.0. Samples: 557189656. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:18:49,851][25689] Avg episode reward: [(0, '-27.600')] [2022-07-10 03:18:51,004][26022] Updated weights on worker 0-0, policy_version 544137 (0.00100) [2022-07-10 03:18:52,636][26022] Updated weights on worker 0-0, policy_version 544147 (0.00095) [2022-07-10 03:18:54,573][26022] Updated weights on worker 0-0, policy_version 544157 (0.00050) [2022-07-10 03:18:54,907][25689] Fps is (10 sec: 5666.5, 60 sec: 5644.1, 300 sec: 5654.4). Total num frames: 557218816. Throughput: 0: 5929.5. Samples: 557223238. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:18:54,907][25689] Avg episode reward: [(0, '-26.661')] [2022-07-10 03:18:56,256][26022] Updated weights on worker 0-0, policy_version 544167 (0.00088) [2022-07-10 03:18:57,884][26022] Updated weights on worker 0-0, policy_version 544177 (0.00085) [2022-07-10 03:18:59,741][26022] Updated weights on worker 0-0, policy_version 544187 (0.00078) [2022-07-10 03:18:59,923][25689] Fps is (10 sec: 5590.7, 60 sec: 5680.5, 300 sec: 5665.0). Total num frames: 557247488. Throughput: 0: 5088.4. Samples: 557240402. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:18:59,923][25689] Avg episode reward: [(0, '-26.826')] [2022-07-10 03:19:01,517][26022] Updated weights on worker 0-0, policy_version 544197 (0.00086) [2022-07-10 03:19:04,012][26022] Updated weights on worker 0-0, policy_version 544207 (0.00093) [2022-07-10 03:19:04,931][25689] Fps is (10 sec: 5516.2, 60 sec: 5664.2, 300 sec: 5659.3). Total num frames: 557274112. Throughput: 0: 5831.2. Samples: 557272616. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:19:04,933][25689] Avg episode reward: [(0, '-26.220')] [2022-07-10 03:19:05,617][26022] Updated weights on worker 0-0, policy_version 544217 (0.00093) [2022-07-10 03:19:07,515][26022] Updated weights on worker 0-0, policy_version 544227 (0.00069) [2022-07-10 03:19:09,036][26022] Updated weights on worker 0-0, policy_version 544237 (0.00084) [2022-07-10 03:19:10,047][25689] Fps is (10 sec: 5360.8, 60 sec: 5641.5, 300 sec: 5653.7). Total num frames: 557301760. Throughput: 0: 5805.2. Samples: 557306908. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:19:10,049][25689] Avg episode reward: [(0, '-26.276')] [2022-07-10 03:19:11,003][26022] Updated weights on worker 0-0, policy_version 544247 (0.00088) [2022-07-10 03:19:12,770][26022] Updated weights on worker 0-0, policy_version 544257 (0.00097) [2022-07-10 03:19:14,496][26022] Updated weights on worker 0-0, policy_version 544267 (0.00091) [2022-07-10 03:19:15,049][25689] Fps is (10 sec: 5667.3, 60 sec: 5646.0, 300 sec: 5660.7). Total num frames: 557331456. Throughput: 0: 5004.3. Samples: 557324038. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:19:15,050][25689] Avg episode reward: [(0, '-27.914')] [2022-07-10 03:19:16,422][26022] Updated weights on worker 0-0, policy_version 544277 (0.00088) [2022-07-10 03:19:18,170][26022] Updated weights on worker 0-0, policy_version 544287 (0.00087) [2022-07-10 03:19:19,914][26022] Updated weights on worker 0-0, policy_version 544297 (0.00086) [2022-07-10 03:19:20,082][25689] Fps is (10 sec: 5816.0, 60 sec: 5662.9, 300 sec: 5661.0). Total num frames: 557360128. Throughput: 0: 5859.8. Samples: 557358534. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:19:20,083][25689] Avg episode reward: [(0, '-28.188')] [2022-07-10 03:19:21,630][26022] Updated weights on worker 0-0, policy_version 544307 (0.00081) [2022-07-10 03:19:23,651][26022] Updated weights on worker 0-0, policy_version 544317 (0.00089) [2022-07-10 03:19:25,095][25689] Fps is (10 sec: 5708.4, 60 sec: 5662.2, 300 sec: 5658.2). Total num frames: 557388800. Throughput: 0: 5943.6. Samples: 557392462. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:19:25,098][25689] Avg episode reward: [(0, '-28.755')] [2022-07-10 03:19:25,311][26022] Updated weights on worker 0-0, policy_version 544327 (0.00088) [2022-07-10 03:19:27,263][26022] Updated weights on worker 0-0, policy_version 544337 (0.00084) [2022-07-10 03:19:28,999][26022] Updated weights on worker 0-0, policy_version 544347 (0.00085) [2022-07-10 03:19:30,219][25689] Fps is (10 sec: 5657.3, 60 sec: 5640.3, 300 sec: 5656.0). Total num frames: 557417472. Throughput: 0: 5936.6. Samples: 557426660. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:19:30,219][25689] Avg episode reward: [(0, '-28.556')] [2022-07-10 03:19:30,905][26022] Updated weights on worker 0-0, policy_version 544357 (0.00080) [2022-07-10 03:19:32,732][26022] Updated weights on worker 0-0, policy_version 544367 (0.00091) [2022-07-10 03:19:34,287][26022] Updated weights on worker 0-0, policy_version 544377 (0.00084) [2022-07-10 03:19:35,278][25689] Fps is (10 sec: 5631.3, 60 sec: 5635.9, 300 sec: 5661.8). Total num frames: 557446144. Throughput: 0: 5918.5. Samples: 557443762. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:19:35,280][25689] Avg episode reward: [(0, '-28.994')] [2022-07-10 03:19:36,268][26022] Updated weights on worker 0-0, policy_version 544387 (0.00088) [2022-07-10 03:19:38,029][26022] Updated weights on worker 0-0, policy_version 544397 (0.00088) [2022-07-10 03:19:39,835][26022] Updated weights on worker 0-0, policy_version 544407 (0.00083) [2022-07-10 03:19:40,289][25689] Fps is (10 sec: 5796.1, 60 sec: 5669.6, 300 sec: 5658.2). Total num frames: 557475840. Throughput: 0: 5914.3. Samples: 557478040. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:19:40,291][25689] Avg episode reward: [(0, '-27.887')] [2022-07-10 03:19:41,590][26022] Updated weights on worker 0-0, policy_version 544417 (0.00087) [2022-07-10 03:19:43,295][26022] Updated weights on worker 0-0, policy_version 544427 (0.00087) [2022-07-10 03:19:45,304][25689] Fps is (10 sec: 5719.3, 60 sec: 5651.5, 300 sec: 5658.7). Total num frames: 557503488. Throughput: 0: 5954.0. Samples: 557512790. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:19:45,305][25689] Avg episode reward: [(0, '-28.382')] [2022-07-10 03:19:45,315][26022] Updated weights on worker 0-0, policy_version 544437 (0.00087) [2022-07-10 03:19:46,945][26022] Updated weights on worker 0-0, policy_version 544447 (0.00091) [2022-07-10 03:19:48,705][26022] Updated weights on worker 0-0, policy_version 544457 (0.00083) [2022-07-10 03:19:50,364][26022] Updated weights on worker 0-0, policy_version 544467 (0.00120) [2022-07-10 03:19:50,403][25689] Fps is (10 sec: 5771.1, 60 sec: 5665.1, 300 sec: 5663.9). Total num frames: 557534208. Throughput: 0: 5125.9. Samples: 557530122. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:19:50,403][25689] Avg episode reward: [(0, '-27.972')] [2022-07-10 03:19:52,249][26022] Updated weights on worker 0-0, policy_version 544477 (0.00956) [2022-07-10 03:19:54,015][26022] Updated weights on worker 0-0, policy_version 544487 (0.00094) [2022-07-10 03:19:55,312][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:19:55,325][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000544493_557560832.pth [2022-07-10 03:19:55,328][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000542501_555521024.pth [2022-07-10 03:19:55,410][25689] Fps is (10 sec: 5674.1, 60 sec: 5652.8, 300 sec: 5660.6). Total num frames: 557560832. Throughput: 0: 5993.8. Samples: 557564434. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:19:55,411][25689] Avg episode reward: [(0, '-27.654')] [2022-07-10 03:19:55,883][26022] Updated weights on worker 0-0, policy_version 544497 (0.00091) [2022-07-10 03:19:57,383][26022] Updated weights on worker 0-0, policy_version 544507 (0.00084) [2022-07-10 03:19:59,531][26022] Updated weights on worker 0-0, policy_version 544517 (0.00085) [2022-07-10 03:20:00,417][25689] Fps is (10 sec: 5623.8, 60 sec: 5670.6, 300 sec: 5667.6). Total num frames: 557590528. Throughput: 0: 5994.5. Samples: 557598700. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:20:00,418][25689] Avg episode reward: [(0, '-28.407')] [2022-07-10 03:20:01,339][26022] Updated weights on worker 0-0, policy_version 544527 (0.00091) [2022-07-10 03:20:03,382][26022] Updated weights on worker 0-0, policy_version 544537 (0.00091) [2022-07-10 03:20:05,206][26022] Updated weights on worker 0-0, policy_version 544547 (0.00092) [2022-07-10 03:20:05,438][25689] Fps is (10 sec: 5616.8, 60 sec: 5669.5, 300 sec: 5665.9). Total num frames: 557617152. Throughput: 0: 5012.8. Samples: 557613714. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:20:05,438][25689] Avg episode reward: [(0, '-29.479')] [2022-07-10 03:20:06,965][26022] Updated weights on worker 0-0, policy_version 544557 (0.00101) [2022-07-10 03:20:08,656][26022] Updated weights on worker 0-0, policy_version 544567 (0.00092) [2022-07-10 03:20:10,558][25689] Fps is (10 sec: 5352.0, 60 sec: 5669.0, 300 sec: 5656.9). Total num frames: 557644800. Throughput: 0: 5832.9. Samples: 557647684. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:20:10,558][25689] Avg episode reward: [(0, '-29.751')] [2022-07-10 03:20:10,950][26022] Updated weights on worker 0-0, policy_version 544577 (0.00094) [2022-07-10 03:20:12,220][26022] Updated weights on worker 0-0, policy_version 544587 (0.00099) [2022-07-10 03:20:14,349][26022] Updated weights on worker 0-0, policy_version 544597 (0.00092) [2022-07-10 03:20:15,656][25689] Fps is (10 sec: 5712.3, 60 sec: 5677.0, 300 sec: 5658.8). Total num frames: 557675520. Throughput: 0: 5799.3. Samples: 557681842. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:20:15,656][25689] Avg episode reward: [(0, '-29.291')] [2022-07-10 03:20:15,938][26022] Updated weights on worker 0-0, policy_version 544607 (0.00092) [2022-07-10 03:20:17,797][26022] Updated weights on worker 0-0, policy_version 544617 (0.00080) [2022-07-10 03:20:19,739][26022] Updated weights on worker 0-0, policy_version 544627 (0.00097) [2022-07-10 03:20:20,671][25689] Fps is (10 sec: 5771.4, 60 sec: 5661.8, 300 sec: 5666.7). Total num frames: 557703168. Throughput: 0: 4956.1. Samples: 557699080. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:20:20,672][25689] Avg episode reward: [(0, '-30.391')] [2022-07-10 03:20:21,230][26022] Updated weights on worker 0-0, policy_version 544637 (0.00095) [2022-07-10 03:20:23,306][26022] Updated weights on worker 0-0, policy_version 544647 (0.00090) [2022-07-10 03:20:24,939][26022] Updated weights on worker 0-0, policy_version 544657 (0.00090) [2022-07-10 03:20:25,678][25689] Fps is (10 sec: 5619.6, 60 sec: 5662.3, 300 sec: 5662.1). Total num frames: 557731840. Throughput: 0: 5900.6. Samples: 557733144. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:20:25,678][25689] Avg episode reward: [(0, '-31.571')] [2022-07-10 03:20:27,078][26022] Updated weights on worker 0-0, policy_version 544667 (0.00084) [2022-07-10 03:20:28,695][26022] Updated weights on worker 0-0, policy_version 544677 (0.00082) [2022-07-10 03:20:30,537][26022] Updated weights on worker 0-0, policy_version 544687 (0.00084) [2022-07-10 03:20:30,772][25689] Fps is (10 sec: 5677.0, 60 sec: 5665.1, 300 sec: 5661.2). Total num frames: 557760512. Throughput: 0: 5896.7. Samples: 557766884. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 03:20:30,773][25689] Avg episode reward: [(0, '-31.203')] [2022-07-10 03:20:32,226][26022] Updated weights on worker 0-0, policy_version 544697 (0.01158) [2022-07-10 03:20:34,098][26022] Updated weights on worker 0-0, policy_version 544707 (0.00090) [2022-07-10 03:20:35,789][25689] Fps is (10 sec: 5671.3, 60 sec: 5669.0, 300 sec: 5658.2). Total num frames: 557789184. Throughput: 0: 5078.1. Samples: 557784082. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:20:35,789][25689] Avg episode reward: [(0, '-30.081')] [2022-07-10 03:20:35,905][26022] Updated weights on worker 0-0, policy_version 544717 (0.00055) [2022-07-10 03:20:37,721][26022] Updated weights on worker 0-0, policy_version 544727 (0.00084) [2022-07-10 03:20:39,500][26022] Updated weights on worker 0-0, policy_version 544737 (0.00092) [2022-07-10 03:20:40,881][25689] Fps is (10 sec: 5672.7, 60 sec: 5644.6, 300 sec: 5657.4). Total num frames: 557817856. Throughput: 0: 5889.8. Samples: 557818112. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:20:40,883][25689] Avg episode reward: [(0, '-30.054')] [2022-07-10 03:20:41,459][26022] Updated weights on worker 0-0, policy_version 544747 (0.00084) [2022-07-10 03:20:43,090][26022] Updated weights on worker 0-0, policy_version 544757 (0.00098) [2022-07-10 03:20:45,017][26022] Updated weights on worker 0-0, policy_version 544767 (0.00342) [2022-07-10 03:20:45,895][25689] Fps is (10 sec: 5775.7, 60 sec: 5678.5, 300 sec: 5664.9). Total num frames: 557847552. Throughput: 0: 5903.6. Samples: 557852498. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:20:45,895][25689] Avg episode reward: [(0, '-30.173')] [2022-07-10 03:20:46,785][26022] Updated weights on worker 0-0, policy_version 544777 (0.00100) [2022-07-10 03:20:48,338][26022] Updated weights on worker 0-0, policy_version 544787 (0.00085) [2022-07-10 03:20:50,464][26022] Updated weights on worker 0-0, policy_version 544797 (0.00088) [2022-07-10 03:20:50,973][25689] Fps is (10 sec: 5682.2, 60 sec: 5629.7, 300 sec: 5660.2). Total num frames: 557875200. Throughput: 0: 5083.1. Samples: 557869566. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:20:50,975][25689] Avg episode reward: [(0, '-29.097')] [2022-07-10 03:20:52,037][26022] Updated weights on worker 0-0, policy_version 544807 (0.00082) [2022-07-10 03:20:53,912][26022] Updated weights on worker 0-0, policy_version 544817 (0.00090) [2022-07-10 03:20:55,679][26022] Updated weights on worker 0-0, policy_version 544827 (0.00091) [2022-07-10 03:20:55,984][25689] Fps is (10 sec: 5582.3, 60 sec: 5663.2, 300 sec: 5653.3). Total num frames: 557903872. Throughput: 0: 5935.3. Samples: 557903944. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:20:55,984][25689] Avg episode reward: [(0, '-28.484')] [2022-07-10 03:20:57,572][26022] Updated weights on worker 0-0, policy_version 544837 (0.00088) [2022-07-10 03:20:59,255][26022] Updated weights on worker 0-0, policy_version 544847 (0.00087) [2022-07-10 03:21:00,986][25689] Fps is (10 sec: 5624.6, 60 sec: 5629.8, 300 sec: 5664.0). Total num frames: 557931520. Throughput: 0: 5971.5. Samples: 557938168. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:21:00,987][25689] Avg episode reward: [(0, '-28.268')] [2022-07-10 03:21:01,245][26022] Updated weights on worker 0-0, policy_version 544857 (0.00085) [2022-07-10 03:21:03,058][26022] Updated weights on worker 0-0, policy_version 544867 (0.00079) [2022-07-10 03:21:05,087][26022] Updated weights on worker 0-0, policy_version 544877 (0.00089) [2022-07-10 03:21:06,007][25689] Fps is (10 sec: 5619.1, 60 sec: 5663.6, 300 sec: 5662.4). Total num frames: 557960192. Throughput: 0: 5013.6. Samples: 557953332. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:21:06,007][25689] Avg episode reward: [(0, '-28.268')] [2022-07-10 03:21:06,592][26022] Updated weights on worker 0-0, policy_version 544887 (0.00087) [2022-07-10 03:21:08,571][26022] Updated weights on worker 0-0, policy_version 544897 (0.00094) [2022-07-10 03:21:10,444][26022] Updated weights on worker 0-0, policy_version 544907 (0.00092) [2022-07-10 03:21:11,130][25689] Fps is (10 sec: 5552.1, 60 sec: 5663.3, 300 sec: 5660.5). Total num frames: 557987840. Throughput: 0: 5857.3. Samples: 557987632. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:21:11,130][25689] Avg episode reward: [(0, '-27.219')] [2022-07-10 03:21:12,167][26022] Updated weights on worker 0-0, policy_version 544917 (0.00086) [2022-07-10 03:21:14,081][26022] Updated weights on worker 0-0, policy_version 544927 (0.00082) [2022-07-10 03:21:15,734][26022] Updated weights on worker 0-0, policy_version 544937 (0.00087) [2022-07-10 03:21:16,211][25689] Fps is (10 sec: 5519.2, 60 sec: 5631.1, 300 sec: 5662.8). Total num frames: 558016512. Throughput: 0: 5841.2. Samples: 558022096. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:21:16,212][25689] Avg episode reward: [(0, '-27.109')] [2022-07-10 03:21:17,487][26022] Updated weights on worker 0-0, policy_version 544947 (0.00095) [2022-07-10 03:21:19,458][26022] Updated weights on worker 0-0, policy_version 544957 (0.00085) [2022-07-10 03:21:21,055][26022] Updated weights on worker 0-0, policy_version 544967 (0.00607) [2022-07-10 03:21:21,276][25689] Fps is (10 sec: 5954.4, 60 sec: 5694.0, 300 sec: 5669.0). Total num frames: 558048256. Throughput: 0: 4979.0. Samples: 558039194. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:21:21,277][25689] Avg episode reward: [(0, '-26.895')] [2022-07-10 03:21:23,150][26022] Updated weights on worker 0-0, policy_version 544977 (0.00095) [2022-07-10 03:21:24,651][26022] Updated weights on worker 0-0, policy_version 544987 (0.00094) [2022-07-10 03:21:26,353][25689] Fps is (10 sec: 5654.2, 60 sec: 5636.8, 300 sec: 5658.4). Total num frames: 558073856. Throughput: 0: 5894.7. Samples: 558073266. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:21:26,353][25689] Avg episode reward: [(0, '-26.808')] [2022-07-10 03:21:26,772][26022] Updated weights on worker 0-0, policy_version 544997 (0.00083) [2022-07-10 03:21:28,306][26022] Updated weights on worker 0-0, policy_version 545007 (0.00091) [2022-07-10 03:21:30,270][26022] Updated weights on worker 0-0, policy_version 545017 (0.00093) [2022-07-10 03:21:31,411][25689] Fps is (10 sec: 5557.2, 60 sec: 5674.0, 300 sec: 5667.9). Total num frames: 558104576. Throughput: 0: 5884.2. Samples: 558106970. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:21:31,412][25689] Avg episode reward: [(0, '-26.363')] [2022-07-10 03:21:32,114][26022] Updated weights on worker 0-0, policy_version 545027 (0.00499) [2022-07-10 03:21:33,787][26022] Updated weights on worker 0-0, policy_version 545037 (0.00094) [2022-07-10 03:21:35,821][26022] Updated weights on worker 0-0, policy_version 545047 (0.00095) [2022-07-10 03:21:36,501][25689] Fps is (10 sec: 5751.6, 60 sec: 5650.3, 300 sec: 5663.6). Total num frames: 558132224. Throughput: 0: 5866.8. Samples: 558141132. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:21:36,501][25689] Avg episode reward: [(0, '-26.852')] [2022-07-10 03:21:37,361][26022] Updated weights on worker 0-0, policy_version 545057 (0.00085) [2022-07-10 03:21:39,279][26022] Updated weights on worker 0-0, policy_version 545067 (0.00083) [2022-07-10 03:21:41,059][26022] Updated weights on worker 0-0, policy_version 545077 (0.00095) [2022-07-10 03:21:41,531][25689] Fps is (10 sec: 5463.8, 60 sec: 5639.1, 300 sec: 5660.0). Total num frames: 558159872. Throughput: 0: 5873.0. Samples: 558158152. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:21:41,532][25689] Avg episode reward: [(0, '-27.830')] [2022-07-10 03:21:42,928][26022] Updated weights on worker 0-0, policy_version 545087 (0.00090) [2022-07-10 03:21:44,813][26022] Updated weights on worker 0-0, policy_version 545097 (0.00087) [2022-07-10 03:21:46,383][26022] Updated weights on worker 0-0, policy_version 545107 (0.00085) [2022-07-10 03:21:46,592][25689] Fps is (10 sec: 5783.9, 60 sec: 5651.6, 300 sec: 5666.8). Total num frames: 558190592. Throughput: 0: 5877.7. Samples: 558192228. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:21:46,593][25689] Avg episode reward: [(0, '-28.349')] [2022-07-10 03:21:48,491][26022] Updated weights on worker 0-0, policy_version 545117 (0.00086) [2022-07-10 03:21:49,927][26022] Updated weights on worker 0-0, policy_version 545127 (0.00093) [2022-07-10 03:21:51,636][25689] Fps is (10 sec: 5674.9, 60 sec: 5637.9, 300 sec: 5656.5). Total num frames: 558217216. Throughput: 0: 5910.1. Samples: 558226504. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:21:51,638][25689] Avg episode reward: [(0, '-29.001')] [2022-07-10 03:21:52,093][26022] Updated weights on worker 0-0, policy_version 545137 (0.00092) [2022-07-10 03:21:53,465][26022] Updated weights on worker 0-0, policy_version 545147 (0.00093) [2022-07-10 03:21:55,484][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:21:55,496][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000545156_558239744.pth [2022-07-10 03:21:55,497][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000543166_556201984.pth [2022-07-10 03:21:55,583][26022] Updated weights on worker 0-0, policy_version 545157 (0.00099) [2022-07-10 03:21:56,643][25689] Fps is (10 sec: 5705.6, 60 sec: 5672.1, 300 sec: 5664.5). Total num frames: 558247936. Throughput: 0: 5095.8. Samples: 558243770. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:21:56,644][25689] Avg episode reward: [(0, '-27.603')] [2022-07-10 03:21:57,338][26022] Updated weights on worker 0-0, policy_version 545167 (0.00091) [2022-07-10 03:21:59,175][26022] Updated weights on worker 0-0, policy_version 545177 (0.00086) [2022-07-10 03:22:00,949][26022] Updated weights on worker 0-0, policy_version 545187 (0.00090) [2022-07-10 03:22:01,678][25689] Fps is (10 sec: 5710.6, 60 sec: 5652.2, 300 sec: 5664.3). Total num frames: 558274560. Throughput: 0: 5939.8. Samples: 558277818. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:22:01,683][25689] Avg episode reward: [(0, '-29.139')] [2022-07-10 03:22:03,252][26022] Updated weights on worker 0-0, policy_version 545197 (0.00104) [2022-07-10 03:22:04,860][26022] Updated weights on worker 0-0, policy_version 545207 (0.00083) [2022-07-10 03:22:06,699][25689] Fps is (10 sec: 5193.3, 60 sec: 5601.5, 300 sec: 5654.9). Total num frames: 558300160. Throughput: 0: 5837.7. Samples: 558309604. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:22:06,699][25689] Avg episode reward: [(0, '-29.434')] [2022-07-10 03:22:06,875][26022] Updated weights on worker 0-0, policy_version 545217 (0.00091) [2022-07-10 03:22:08,458][26022] Updated weights on worker 0-0, policy_version 545227 (0.00085) [2022-07-10 03:22:10,451][26022] Updated weights on worker 0-0, policy_version 545237 (0.00093) [2022-07-10 03:22:11,781][25689] Fps is (10 sec: 5574.6, 60 sec: 5656.0, 300 sec: 5654.8). Total num frames: 558330880. Throughput: 0: 4960.7. Samples: 558326436. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:22:11,781][25689] Avg episode reward: [(0, '-27.361')] [2022-07-10 03:22:12,134][26022] Updated weights on worker 0-0, policy_version 545247 (0.00078) [2022-07-10 03:22:14,105][26022] Updated weights on worker 0-0, policy_version 545257 (0.00081) [2022-07-10 03:22:15,670][26022] Updated weights on worker 0-0, policy_version 545267 (0.00097) [2022-07-10 03:22:16,786][25689] Fps is (10 sec: 5888.0, 60 sec: 5663.1, 300 sec: 5659.0). Total num frames: 558359552. Throughput: 0: 5803.9. Samples: 558360678. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:22:16,787][25689] Avg episode reward: [(0, '-26.115')] [2022-07-10 03:22:17,519][26022] Updated weights on worker 0-0, policy_version 545277 (0.00077) [2022-07-10 03:22:19,358][26022] Updated weights on worker 0-0, policy_version 545287 (0.00086) [2022-07-10 03:22:21,165][26022] Updated weights on worker 0-0, policy_version 545297 (0.00085) [2022-07-10 03:22:21,793][25689] Fps is (10 sec: 5625.2, 60 sec: 5600.8, 300 sec: 5656.6). Total num frames: 558387200. Throughput: 0: 5834.3. Samples: 558395176. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:22:21,793][25689] Avg episode reward: [(0, '-27.272')] [2022-07-10 03:22:23,114][26022] Updated weights on worker 0-0, policy_version 545307 (0.00545) [2022-07-10 03:22:24,562][26022] Updated weights on worker 0-0, policy_version 545317 (0.00091) [2022-07-10 03:22:26,567][26022] Updated weights on worker 0-0, policy_version 545327 (0.00086) [2022-07-10 03:22:26,799][25689] Fps is (10 sec: 5624.7, 60 sec: 5658.2, 300 sec: 5657.8). Total num frames: 558415872. Throughput: 0: 5130.3. Samples: 558412722. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:22:26,799][25689] Avg episode reward: [(0, '-28.118')] [2022-07-10 03:22:28,303][26022] Updated weights on worker 0-0, policy_version 545337 (0.00096) [2022-07-10 03:22:30,078][26022] Updated weights on worker 0-0, policy_version 545347 (0.00090) [2022-07-10 03:22:31,915][25689] Fps is (10 sec: 5665.0, 60 sec: 5618.9, 300 sec: 5656.7). Total num frames: 558444544. Throughput: 0: 5960.3. Samples: 558446444. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:22:31,916][25689] Avg episode reward: [(0, '-26.447')] [2022-07-10 03:22:32,158][26022] Updated weights on worker 0-0, policy_version 545357 (0.00097) [2022-07-10 03:22:33,565][26022] Updated weights on worker 0-0, policy_version 545367 (0.00086) [2022-07-10 03:22:35,668][26022] Updated weights on worker 0-0, policy_version 545377 (0.00089) [2022-07-10 03:22:36,923][25689] Fps is (10 sec: 5866.2, 60 sec: 5677.4, 300 sec: 5664.0). Total num frames: 558475264. Throughput: 0: 5967.8. Samples: 558480854. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:22:36,923][25689] Avg episode reward: [(0, '-28.195')] [2022-07-10 03:22:37,111][26022] Updated weights on worker 0-0, policy_version 545387 (0.00099) [2022-07-10 03:22:39,285][26022] Updated weights on worker 0-0, policy_version 545397 (0.00092) [2022-07-10 03:22:40,775][26022] Updated weights on worker 0-0, policy_version 545407 (0.00081) [2022-07-10 03:22:41,932][25689] Fps is (10 sec: 5724.9, 60 sec: 5662.4, 300 sec: 5657.7). Total num frames: 558501888. Throughput: 0: 5093.1. Samples: 558497748. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:22:41,933][25689] Avg episode reward: [(0, '-28.760')] [2022-07-10 03:22:42,833][26022] Updated weights on worker 0-0, policy_version 545417 (0.00097) [2022-07-10 03:22:44,343][26022] Updated weights on worker 0-0, policy_version 545427 (0.00084) [2022-07-10 03:22:46,394][26022] Updated weights on worker 0-0, policy_version 545437 (0.00088) [2022-07-10 03:22:46,947][25689] Fps is (10 sec: 5516.6, 60 sec: 5632.9, 300 sec: 5659.1). Total num frames: 558530560. Throughput: 0: 5904.9. Samples: 558531694. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:22:46,947][25689] Avg episode reward: [(0, '-29.735')] [2022-07-10 03:22:48,187][26022] Updated weights on worker 0-0, policy_version 545447 (0.00083) [2022-07-10 03:22:50,005][26022] Updated weights on worker 0-0, policy_version 545457 (0.00086) [2022-07-10 03:22:51,863][26022] Updated weights on worker 0-0, policy_version 545467 (0.00089) [2022-07-10 03:22:52,007][25689] Fps is (10 sec: 5691.7, 60 sec: 5665.3, 300 sec: 5651.5). Total num frames: 558559232. Throughput: 0: 5947.7. Samples: 558565942. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:22:52,008][25689] Avg episode reward: [(0, '-29.866')] [2022-07-10 03:22:53,357][26022] Updated weights on worker 0-0, policy_version 545477 (0.00090) [2022-07-10 03:22:55,523][26022] Updated weights on worker 0-0, policy_version 545487 (0.00085) [2022-07-10 03:22:57,020][25689] Fps is (10 sec: 5692.6, 60 sec: 5630.7, 300 sec: 5658.9). Total num frames: 558587904. Throughput: 0: 5096.9. Samples: 558583286. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:22:57,020][25689] Avg episode reward: [(0, '-30.925')] [2022-07-10 03:22:57,081][26022] Updated weights on worker 0-0, policy_version 545497 (0.00087) [2022-07-10 03:22:58,863][26022] Updated weights on worker 0-0, policy_version 545507 (0.00096) [2022-07-10 03:23:00,834][26022] Updated weights on worker 0-0, policy_version 545517 (0.00082) [2022-07-10 03:23:02,064][25689] Fps is (10 sec: 5600.2, 60 sec: 5646.9, 300 sec: 5658.4). Total num frames: 558615552. Throughput: 0: 5948.6. Samples: 558617504. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:23:02,064][25689] Avg episode reward: [(0, '-31.878')] [2022-07-10 03:23:02,999][26022] Updated weights on worker 0-0, policy_version 545527 (0.00081) [2022-07-10 03:23:04,559][26022] Updated weights on worker 0-0, policy_version 545537 (0.00084) [2022-07-10 03:23:06,697][26022] Updated weights on worker 0-0, policy_version 545547 (0.00085) [2022-07-10 03:23:07,080][25689] Fps is (10 sec: 5394.9, 60 sec: 5664.3, 300 sec: 5652.2). Total num frames: 558642176. Throughput: 0: 5844.5. Samples: 558649364. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:23:07,080][25689] Avg episode reward: [(0, '-31.442')] [2022-07-10 03:23:08,259][26022] Updated weights on worker 0-0, policy_version 545557 (0.00086) [2022-07-10 03:23:10,208][26022] Updated weights on worker 0-0, policy_version 545567 (0.00086) [2022-07-10 03:23:11,891][26022] Updated weights on worker 0-0, policy_version 545577 (0.00084) [2022-07-10 03:23:12,130][25689] Fps is (10 sec: 5493.4, 60 sec: 5633.4, 300 sec: 5648.8). Total num frames: 558670848. Throughput: 0: 4996.7. Samples: 558666490. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:23:12,130][25689] Avg episode reward: [(0, '-30.933')] [2022-07-10 03:23:13,671][26022] Updated weights on worker 0-0, policy_version 545587 (0.00091) [2022-07-10 03:23:15,415][26022] Updated weights on worker 0-0, policy_version 545597 (0.00087) [2022-07-10 03:23:17,152][25689] Fps is (10 sec: 5795.0, 60 sec: 5648.7, 300 sec: 5655.9). Total num frames: 558700544. Throughput: 0: 5840.7. Samples: 558700872. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:23:17,154][25689] Avg episode reward: [(0, '-30.618')] [2022-07-10 03:23:17,293][26022] Updated weights on worker 0-0, policy_version 545607 (0.00084) [2022-07-10 03:23:18,885][26022] Updated weights on worker 0-0, policy_version 545617 (0.00085) [2022-07-10 03:23:20,918][26022] Updated weights on worker 0-0, policy_version 545627 (0.00082) [2022-07-10 03:23:22,173][25689] Fps is (10 sec: 5811.7, 60 sec: 5664.4, 300 sec: 5655.6). Total num frames: 558729216. Throughput: 0: 5861.1. Samples: 558735366. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:23:22,174][25689] Avg episode reward: [(0, '-30.690')] [2022-07-10 03:23:22,633][26022] Updated weights on worker 0-0, policy_version 545637 (0.00086) [2022-07-10 03:23:24,331][26022] Updated weights on worker 0-0, policy_version 545647 (0.00089) [2022-07-10 03:23:26,511][26022] Updated weights on worker 0-0, policy_version 545657 (0.00092) [2022-07-10 03:23:27,190][25689] Fps is (10 sec: 5712.8, 60 sec: 5663.3, 300 sec: 5653.2). Total num frames: 558757888. Throughput: 0: 5135.0. Samples: 558752628. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 03:23:27,190][25689] Avg episode reward: [(0, '-28.836')] [2022-07-10 03:23:27,759][26022] Updated weights on worker 0-0, policy_version 545667 (0.00091) [2022-07-10 03:23:29,907][26022] Updated weights on worker 0-0, policy_version 545677 (0.00092) [2022-07-10 03:23:31,741][26022] Updated weights on worker 0-0, policy_version 545687 (0.00087) [2022-07-10 03:23:32,307][25689] Fps is (10 sec: 5557.6, 60 sec: 5646.4, 300 sec: 5647.7). Total num frames: 558785536. Throughput: 0: 5964.7. Samples: 558786840. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:23:32,308][25689] Avg episode reward: [(0, '-28.431')] [2022-07-10 03:23:33,343][26022] Updated weights on worker 0-0, policy_version 545697 (0.00086) [2022-07-10 03:23:35,305][26022] Updated weights on worker 0-0, policy_version 545707 (0.00088) [2022-07-10 03:23:36,874][26022] Updated weights on worker 0-0, policy_version 545717 (0.00084) [2022-07-10 03:23:37,335][25689] Fps is (10 sec: 5652.6, 60 sec: 5627.5, 300 sec: 5654.3). Total num frames: 558815232. Throughput: 0: 5958.1. Samples: 558821122. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:23:37,335][25689] Avg episode reward: [(0, '-28.538')] [2022-07-10 03:23:38,703][26022] Updated weights on worker 0-0, policy_version 545727 (0.00093) [2022-07-10 03:23:40,739][26022] Updated weights on worker 0-0, policy_version 545737 (0.00091) [2022-07-10 03:23:42,107][26022] Updated weights on worker 0-0, policy_version 545747 (0.00088) [2022-07-10 03:23:42,352][25689] Fps is (10 sec: 5912.5, 60 sec: 5677.6, 300 sec: 5657.4). Total num frames: 558844928. Throughput: 0: 5113.6. Samples: 558838554. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:23:42,353][25689] Avg episode reward: [(0, '-29.139')] [2022-07-10 03:23:44,211][26022] Updated weights on worker 0-0, policy_version 545757 (0.00085) [2022-07-10 03:23:45,871][26022] Updated weights on worker 0-0, policy_version 545767 (0.00088) [2022-07-10 03:23:47,414][25689] Fps is (10 sec: 5790.7, 60 sec: 5673.1, 300 sec: 5654.0). Total num frames: 558873600. Throughput: 0: 5964.6. Samples: 558873258. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:23:47,415][25689] Avg episode reward: [(0, '-29.980')] [2022-07-10 03:23:47,623][26022] Updated weights on worker 0-0, policy_version 545777 (0.00084) [2022-07-10 03:23:49,467][26022] Updated weights on worker 0-0, policy_version 545787 (0.00619) [2022-07-10 03:23:51,035][26022] Updated weights on worker 0-0, policy_version 545797 (0.00101) [2022-07-10 03:23:52,516][25689] Fps is (10 sec: 5642.0, 60 sec: 5669.2, 300 sec: 5656.6). Total num frames: 558902272. Throughput: 0: 5968.6. Samples: 558907460. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:23:52,517][25689] Avg episode reward: [(0, '-29.597')] [2022-07-10 03:23:53,122][26022] Updated weights on worker 0-0, policy_version 545807 (0.00093) [2022-07-10 03:23:54,795][26022] Updated weights on worker 0-0, policy_version 545817 (0.00080) [2022-07-10 03:23:55,578][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:23:55,594][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000545821_558920704.pth [2022-07-10 03:23:55,595][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000543828_556879872.pth [2022-07-10 03:23:56,742][26022] Updated weights on worker 0-0, policy_version 545827 (0.00094) [2022-07-10 03:23:57,539][25689] Fps is (10 sec: 5764.9, 60 sec: 5685.2, 300 sec: 5659.9). Total num frames: 558931968. Throughput: 0: 5128.6. Samples: 558924742. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:23:57,539][25689] Avg episode reward: [(0, '-30.013')] [2022-07-10 03:23:58,235][26022] Updated weights on worker 0-0, policy_version 545837 (0.00098) [2022-07-10 03:24:00,217][26022] Updated weights on worker 0-0, policy_version 545847 (0.00094) [2022-07-10 03:24:02,044][26022] Updated weights on worker 0-0, policy_version 545857 (0.00094) [2022-07-10 03:24:02,562][25689] Fps is (10 sec: 5504.2, 60 sec: 5653.3, 300 sec: 5656.2). Total num frames: 558957568. Throughput: 0: 5959.3. Samples: 558958992. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:24:02,563][25689] Avg episode reward: [(0, '-30.180')] [2022-07-10 03:24:04,193][26022] Updated weights on worker 0-0, policy_version 545867 (0.00085) [2022-07-10 03:24:06,181][26022] Updated weights on worker 0-0, policy_version 545877 (0.00090) [2022-07-10 03:24:07,583][25689] Fps is (10 sec: 5505.2, 60 sec: 5703.6, 300 sec: 5664.9). Total num frames: 558987264. Throughput: 0: 5851.9. Samples: 558991284. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:24:07,584][25689] Avg episode reward: [(0, '-30.177')] [2022-07-10 03:24:07,674][26022] Updated weights on worker 0-0, policy_version 545887 (0.00085) [2022-07-10 03:24:09,677][26022] Updated weights on worker 0-0, policy_version 545897 (0.00425) [2022-07-10 03:24:11,400][26022] Updated weights on worker 0-0, policy_version 545907 (0.00093) [2022-07-10 03:24:12,686][25689] Fps is (10 sec: 5664.3, 60 sec: 5681.7, 300 sec: 5656.1). Total num frames: 559014912. Throughput: 0: 5867.5. Samples: 559025806. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:24:12,686][25689] Avg episode reward: [(0, '-29.833')] [2022-07-10 03:24:13,075][26022] Updated weights on worker 0-0, policy_version 545917 (0.00367) [2022-07-10 03:24:15,045][26022] Updated weights on worker 0-0, policy_version 545927 (0.00082) [2022-07-10 03:24:16,539][26022] Updated weights on worker 0-0, policy_version 545937 (0.00084) [2022-07-10 03:24:17,716][25689] Fps is (10 sec: 5659.3, 60 sec: 5681.0, 300 sec: 5659.6). Total num frames: 559044608. Throughput: 0: 5872.9. Samples: 559043240. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:24:17,716][25689] Avg episode reward: [(0, '-30.024')] [2022-07-10 03:24:18,704][26022] Updated weights on worker 0-0, policy_version 545947 (0.00095) [2022-07-10 03:24:20,233][26022] Updated weights on worker 0-0, policy_version 545957 (0.00088) [2022-07-10 03:24:22,197][26022] Updated weights on worker 0-0, policy_version 545967 (0.00084) [2022-07-10 03:24:22,743][25689] Fps is (10 sec: 5905.3, 60 sec: 5697.3, 300 sec: 5662.7). Total num frames: 559074304. Throughput: 0: 5884.0. Samples: 559077738. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:24:22,744][25689] Avg episode reward: [(0, '-29.155')] [2022-07-10 03:24:23,842][26022] Updated weights on worker 0-0, policy_version 545977 (0.00086) [2022-07-10 03:24:25,638][26022] Updated weights on worker 0-0, policy_version 545987 (0.00090) [2022-07-10 03:24:27,437][26022] Updated weights on worker 0-0, policy_version 545997 (0.00091) [2022-07-10 03:24:27,827][25689] Fps is (10 sec: 5671.1, 60 sec: 5674.1, 300 sec: 5660.0). Total num frames: 559101952. Throughput: 0: 5970.3. Samples: 559112148. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:24:27,828][25689] Avg episode reward: [(0, '-28.812')] [2022-07-10 03:24:29,202][26022] Updated weights on worker 0-0, policy_version 546007 (0.00093) [2022-07-10 03:24:31,266][26022] Updated weights on worker 0-0, policy_version 546017 (0.00085) [2022-07-10 03:24:32,743][26022] Updated weights on worker 0-0, policy_version 546027 (0.00089) [2022-07-10 03:24:32,928][25689] Fps is (10 sec: 5730.9, 60 sec: 5726.3, 300 sec: 5666.1). Total num frames: 559132672. Throughput: 0: 5105.3. Samples: 559129150. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:24:32,929][25689] Avg episode reward: [(0, '-29.051')] [2022-07-10 03:24:34,702][26022] Updated weights on worker 0-0, policy_version 546037 (0.00088) [2022-07-10 03:24:36,355][26022] Updated weights on worker 0-0, policy_version 546047 (0.00084) [2022-07-10 03:24:37,974][25689] Fps is (10 sec: 5853.3, 60 sec: 5707.7, 300 sec: 5662.0). Total num frames: 559161344. Throughput: 0: 5932.6. Samples: 559163426. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:24:37,975][25689] Avg episode reward: [(0, '-28.109')] [2022-07-10 03:24:38,042][26022] Updated weights on worker 0-0, policy_version 546057 (0.00085) [2022-07-10 03:24:39,992][26022] Updated weights on worker 0-0, policy_version 546067 (0.00110) [2022-07-10 03:24:41,599][26022] Updated weights on worker 0-0, policy_version 546077 (0.00091) [2022-07-10 03:24:42,991][25689] Fps is (10 sec: 5597.1, 60 sec: 5674.0, 300 sec: 5662.0). Total num frames: 559188992. Throughput: 0: 5930.7. Samples: 559197818. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:24:42,991][25689] Avg episode reward: [(0, '-27.083')] [2022-07-10 03:24:43,515][26022] Updated weights on worker 0-0, policy_version 546087 (0.00082) [2022-07-10 03:24:45,444][26022] Updated weights on worker 0-0, policy_version 546097 (0.00086) [2022-07-10 03:24:47,000][26022] Updated weights on worker 0-0, policy_version 546107 (0.00084) [2022-07-10 03:24:47,999][25689] Fps is (10 sec: 5720.4, 60 sec: 5696.0, 300 sec: 5660.3). Total num frames: 559218688. Throughput: 0: 5102.9. Samples: 559215080. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:24:47,999][25689] Avg episode reward: [(0, '-27.574')] [2022-07-10 03:24:49,025][26022] Updated weights on worker 0-0, policy_version 546117 (0.00090) [2022-07-10 03:24:50,542][26022] Updated weights on worker 0-0, policy_version 546127 (0.00089) [2022-07-10 03:24:52,495][26022] Updated weights on worker 0-0, policy_version 546137 (0.00089) [2022-07-10 03:24:53,062][25689] Fps is (10 sec: 5795.4, 60 sec: 5699.6, 300 sec: 5666.1). Total num frames: 559247360. Throughput: 0: 5988.3. Samples: 559249718. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:24:53,063][25689] Avg episode reward: [(0, '-26.709')] [2022-07-10 03:24:54,090][26022] Updated weights on worker 0-0, policy_version 546147 (0.00100) [2022-07-10 03:24:56,008][26022] Updated weights on worker 0-0, policy_version 546157 (0.00092) [2022-07-10 03:24:57,871][26022] Updated weights on worker 0-0, policy_version 546167 (0.00088) [2022-07-10 03:24:58,080][25689] Fps is (10 sec: 5688.3, 60 sec: 5683.2, 300 sec: 5662.5). Total num frames: 559276032. Throughput: 0: 5991.9. Samples: 559283896. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:24:58,083][25689] Avg episode reward: [(0, '-25.656')] [2022-07-10 03:24:59,657][26022] Updated weights on worker 0-0, policy_version 546177 (0.00087) [2022-07-10 03:25:01,444][26022] Updated weights on worker 0-0, policy_version 546187 (0.00093) [2022-07-10 03:25:03,099][25689] Fps is (10 sec: 5407.4, 60 sec: 5683.6, 300 sec: 5659.1). Total num frames: 559301632. Throughput: 0: 5133.7. Samples: 559301046. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:25:03,099][25689] Avg episode reward: [(0, '-26.518')] [2022-07-10 03:25:03,626][26022] Updated weights on worker 0-0, policy_version 546197 (0.00089) [2022-07-10 03:25:05,410][26022] Updated weights on worker 0-0, policy_version 546207 (0.00053) [2022-07-10 03:25:07,288][26022] Updated weights on worker 0-0, policy_version 546217 (0.00721) [2022-07-10 03:25:08,139][25689] Fps is (10 sec: 5395.0, 60 sec: 5664.8, 300 sec: 5664.0). Total num frames: 559330304. Throughput: 0: 5841.0. Samples: 559332722. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:25:08,140][25689] Avg episode reward: [(0, '-27.205')] [2022-07-10 03:25:09,019][26022] Updated weights on worker 0-0, policy_version 546227 (0.00086) [2022-07-10 03:25:10,929][26022] Updated weights on worker 0-0, policy_version 546237 (0.00095) [2022-07-10 03:25:12,601][26022] Updated weights on worker 0-0, policy_version 546247 (0.00077) [2022-07-10 03:25:13,233][25689] Fps is (10 sec: 5860.4, 60 sec: 5716.4, 300 sec: 5664.1). Total num frames: 559361024. Throughput: 0: 5819.9. Samples: 559367112. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:25:13,234][25689] Avg episode reward: [(0, '-27.849')] [2022-07-10 03:25:14,543][26022] Updated weights on worker 0-0, policy_version 546257 (0.00087) [2022-07-10 03:25:16,024][26022] Updated weights on worker 0-0, policy_version 546267 (0.00091) [2022-07-10 03:25:18,067][26022] Updated weights on worker 0-0, policy_version 546277 (0.00089) [2022-07-10 03:25:18,251][25689] Fps is (10 sec: 5671.4, 60 sec: 5666.8, 300 sec: 5660.6). Total num frames: 559387648. Throughput: 0: 4984.8. Samples: 559384444. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:25:18,251][25689] Avg episode reward: [(0, '-28.923')] [2022-07-10 03:25:19,799][26022] Updated weights on worker 0-0, policy_version 546287 (0.00083) [2022-07-10 03:25:21,505][26022] Updated weights on worker 0-0, policy_version 546297 (0.00090) [2022-07-10 03:25:23,295][25689] Fps is (10 sec: 5597.6, 60 sec: 5665.3, 300 sec: 5663.3). Total num frames: 559417344. Throughput: 0: 5822.2. Samples: 559418630. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:25:23,295][25689] Avg episode reward: [(0, '-29.762')] [2022-07-10 03:25:23,447][26022] Updated weights on worker 0-0, policy_version 546307 (0.00091) [2022-07-10 03:25:25,121][26022] Updated weights on worker 0-0, policy_version 546317 (0.00080) [2022-07-10 03:25:27,145][26022] Updated weights on worker 0-0, policy_version 546327 (0.00092) [2022-07-10 03:25:28,315][25689] Fps is (10 sec: 5799.5, 60 sec: 5688.2, 300 sec: 5664.7). Total num frames: 559446016. Throughput: 0: 5947.4. Samples: 559452714. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:25:28,315][25689] Avg episode reward: [(0, '-29.892')] [2022-07-10 03:25:28,775][26022] Updated weights on worker 0-0, policy_version 546337 (0.00085) [2022-07-10 03:25:30,466][26022] Updated weights on worker 0-0, policy_version 546347 (0.00086) [2022-07-10 03:25:32,492][26022] Updated weights on worker 0-0, policy_version 546357 (0.00092) [2022-07-10 03:25:33,435][25689] Fps is (10 sec: 5655.2, 60 sec: 5652.5, 300 sec: 5662.8). Total num frames: 559474688. Throughput: 0: 5088.0. Samples: 559469898. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:25:33,435][25689] Avg episode reward: [(0, '-29.268')] [2022-07-10 03:25:34,324][26022] Updated weights on worker 0-0, policy_version 546367 (0.00086) [2022-07-10 03:25:35,977][26022] Updated weights on worker 0-0, policy_version 546377 (0.00625) [2022-07-10 03:25:37,785][26022] Updated weights on worker 0-0, policy_version 546387 (0.00086) [2022-07-10 03:25:38,469][25689] Fps is (10 sec: 5546.4, 60 sec: 5636.7, 300 sec: 5660.4). Total num frames: 559502336. Throughput: 0: 5919.2. Samples: 559504122. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:25:38,470][25689] Avg episode reward: [(0, '-28.757')] [2022-07-10 03:25:39,570][26022] Updated weights on worker 0-0, policy_version 546397 (0.00078) [2022-07-10 03:25:41,460][26022] Updated weights on worker 0-0, policy_version 546407 (0.00090) [2022-07-10 03:25:43,107][26022] Updated weights on worker 0-0, policy_version 546417 (0.00081) [2022-07-10 03:25:43,476][25689] Fps is (10 sec: 5710.7, 60 sec: 5671.4, 300 sec: 5660.6). Total num frames: 559532032. Throughput: 0: 5922.6. Samples: 559538158. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:25:43,477][25689] Avg episode reward: [(0, '-29.719')] [2022-07-10 03:25:44,954][26022] Updated weights on worker 0-0, policy_version 546427 (0.00095) [2022-07-10 03:25:46,929][26022] Updated weights on worker 0-0, policy_version 546437 (0.00100) [2022-07-10 03:25:48,489][25689] Fps is (10 sec: 5825.0, 60 sec: 5654.0, 300 sec: 5665.2). Total num frames: 559560704. Throughput: 0: 5080.5. Samples: 559555210. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:25:48,490][25689] Avg episode reward: [(0, '-29.289')] [2022-07-10 03:25:48,635][26022] Updated weights on worker 0-0, policy_version 546447 (0.00083) [2022-07-10 03:25:50,524][26022] Updated weights on worker 0-0, policy_version 546457 (0.00084) [2022-07-10 03:25:52,363][26022] Updated weights on worker 0-0, policy_version 546467 (0.00539) [2022-07-10 03:25:53,581][25689] Fps is (10 sec: 5674.9, 60 sec: 5651.4, 300 sec: 5663.7). Total num frames: 559589376. Throughput: 0: 5933.6. Samples: 559589438. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:25:53,582][25689] Avg episode reward: [(0, '-30.014')] [2022-07-10 03:25:54,024][26022] Updated weights on worker 0-0, policy_version 546477 (0.00087) [2022-07-10 03:25:55,748][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:25:55,758][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000546486_559601664.pth [2022-07-10 03:25:55,760][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000544493_557560832.pth [2022-07-10 03:25:55,991][26022] Updated weights on worker 0-0, policy_version 546487 (0.00084) [2022-07-10 03:25:57,403][26022] Updated weights on worker 0-0, policy_version 546497 (0.00092) [2022-07-10 03:25:58,678][25689] Fps is (10 sec: 5628.5, 60 sec: 5644.0, 300 sec: 5665.4). Total num frames: 559618048. Throughput: 0: 5932.5. Samples: 559624008. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:25:58,678][25689] Avg episode reward: [(0, '-30.304')] [2022-07-10 03:25:59,355][26022] Updated weights on worker 0-0, policy_version 546507 (0.00350) [2022-07-10 03:26:00,872][26022] Updated weights on worker 0-0, policy_version 546517 (0.00089) [2022-07-10 03:26:03,253][26022] Updated weights on worker 0-0, policy_version 546527 (0.00085) [2022-07-10 03:26:03,724][25689] Fps is (10 sec: 5552.9, 60 sec: 5675.3, 300 sec: 5661.4). Total num frames: 559645696. Throughput: 0: 5090.4. Samples: 559641226. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:26:03,724][25689] Avg episode reward: [(0, '-30.628')] [2022-07-10 03:26:05,259][26022] Updated weights on worker 0-0, policy_version 546537 (0.00090) [2022-07-10 03:26:06,940][26022] Updated weights on worker 0-0, policy_version 546547 (0.00091) [2022-07-10 03:26:08,552][26022] Updated weights on worker 0-0, policy_version 546557 (0.00092) [2022-07-10 03:26:08,739][25689] Fps is (10 sec: 5699.3, 60 sec: 5694.6, 300 sec: 5670.4). Total num frames: 559675392. Throughput: 0: 5844.0. Samples: 559673548. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:26:08,740][25689] Avg episode reward: [(0, '-30.093')] [2022-07-10 03:26:10,746][26022] Updated weights on worker 0-0, policy_version 546567 (0.00090) [2022-07-10 03:26:12,123][26022] Updated weights on worker 0-0, policy_version 546577 (0.00093) [2022-07-10 03:26:13,867][25689] Fps is (10 sec: 5552.3, 60 sec: 5623.8, 300 sec: 5662.6). Total num frames: 559702016. Throughput: 0: 5829.5. Samples: 559707696. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:26:13,868][25689] Avg episode reward: [(0, '-28.355')] [2022-07-10 03:26:14,331][26022] Updated weights on worker 0-0, policy_version 546587 (0.00086) [2022-07-10 03:26:15,647][26022] Updated weights on worker 0-0, policy_version 546597 (0.00086) [2022-07-10 03:26:17,697][26022] Updated weights on worker 0-0, policy_version 546607 (0.00085) [2022-07-10 03:26:18,949][25689] Fps is (10 sec: 5716.7, 60 sec: 5702.2, 300 sec: 5662.3). Total num frames: 559733760. Throughput: 0: 4981.4. Samples: 559724988. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:26:18,950][25689] Avg episode reward: [(0, '-27.173')] [2022-07-10 03:26:19,340][26022] Updated weights on worker 0-0, policy_version 546617 (0.00089) [2022-07-10 03:26:21,224][26022] Updated weights on worker 0-0, policy_version 546627 (0.00094) [2022-07-10 03:26:22,844][26022] Updated weights on worker 0-0, policy_version 546637 (0.00087) [2022-07-10 03:26:23,982][25689] Fps is (10 sec: 5770.8, 60 sec: 5652.6, 300 sec: 5666.5). Total num frames: 559760384. Throughput: 0: 5847.0. Samples: 559759674. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:26:23,982][25689] Avg episode reward: [(0, '-26.935')] [2022-07-10 03:26:24,732][26022] Updated weights on worker 0-0, policy_version 546647 (0.00090) [2022-07-10 03:26:26,530][26022] Updated weights on worker 0-0, policy_version 546657 (0.00096) [2022-07-10 03:26:28,451][26022] Updated weights on worker 0-0, policy_version 546667 (0.00092) [2022-07-10 03:26:29,023][25689] Fps is (10 sec: 5590.6, 60 sec: 5667.5, 300 sec: 5663.4). Total num frames: 559790080. Throughput: 0: 5929.6. Samples: 559793824. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 03:26:29,024][25689] Avg episode reward: [(0, '-25.745')] [2022-07-10 03:26:30,178][26022] Updated weights on worker 0-0, policy_version 546677 (0.00083) [2022-07-10 03:26:31,946][26022] Updated weights on worker 0-0, policy_version 546687 (0.00083) [2022-07-10 03:26:33,953][26022] Updated weights on worker 0-0, policy_version 546697 (0.00085) [2022-07-10 03:26:34,097][25689] Fps is (10 sec: 5669.1, 60 sec: 5654.9, 300 sec: 5663.7). Total num frames: 559817728. Throughput: 0: 5921.9. Samples: 559827494. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:26:34,097][25689] Avg episode reward: [(0, '-26.281')] [2022-07-10 03:26:35,683][26022] Updated weights on worker 0-0, policy_version 546707 (0.00096) [2022-07-10 03:26:37,540][26022] Updated weights on worker 0-0, policy_version 546717 (0.00090) [2022-07-10 03:26:39,109][25689] Fps is (10 sec: 5584.1, 60 sec: 5673.9, 300 sec: 5667.5). Total num frames: 559846400. Throughput: 0: 5919.3. Samples: 559844320. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:26:39,109][25689] Avg episode reward: [(0, '-27.048')] [2022-07-10 03:26:39,359][26022] Updated weights on worker 0-0, policy_version 546727 (0.00090) [2022-07-10 03:26:41,031][26022] Updated weights on worker 0-0, policy_version 546737 (0.00091) [2022-07-10 03:26:43,224][26022] Updated weights on worker 0-0, policy_version 546747 (0.01132) [2022-07-10 03:26:44,195][25689] Fps is (10 sec: 5779.8, 60 sec: 5666.5, 300 sec: 5663.6). Total num frames: 559876096. Throughput: 0: 5878.5. Samples: 559878500. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:26:44,196][25689] Avg episode reward: [(0, '-28.943')] [2022-07-10 03:26:44,531][26022] Updated weights on worker 0-0, policy_version 546757 (0.00080) [2022-07-10 03:26:46,620][26022] Updated weights on worker 0-0, policy_version 546767 (0.00086) [2022-07-10 03:26:48,273][26022] Updated weights on worker 0-0, policy_version 546777 (0.00091) [2022-07-10 03:26:49,293][25689] Fps is (10 sec: 5630.7, 60 sec: 5641.8, 300 sec: 5666.0). Total num frames: 559903744. Throughput: 0: 5862.5. Samples: 559912656. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:26:49,294][25689] Avg episode reward: [(0, '-30.014')] [2022-07-10 03:26:50,152][26022] Updated weights on worker 0-0, policy_version 546787 (0.00108) [2022-07-10 03:26:52,015][26022] Updated weights on worker 0-0, policy_version 546797 (0.00084) [2022-07-10 03:26:53,705][26022] Updated weights on worker 0-0, policy_version 546807 (0.00091) [2022-07-10 03:26:54,336][25689] Fps is (10 sec: 5654.8, 60 sec: 5663.1, 300 sec: 5661.9). Total num frames: 559933440. Throughput: 0: 5062.9. Samples: 559929964. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:26:54,337][25689] Avg episode reward: [(0, '-28.601')] [2022-07-10 03:26:55,555][26022] Updated weights on worker 0-0, policy_version 546817 (0.00083) [2022-07-10 03:26:57,477][26022] Updated weights on worker 0-0, policy_version 546827 (0.00084) [2022-07-10 03:26:58,818][26022] Updated weights on worker 0-0, policy_version 546837 (0.00099) [2022-07-10 03:26:59,341][25689] Fps is (10 sec: 5910.9, 60 sec: 5688.6, 300 sec: 5672.8). Total num frames: 559963136. Throughput: 0: 5932.2. Samples: 559964342. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:26:59,342][25689] Avg episode reward: [(0, '-28.755')] [2022-07-10 03:27:00,935][26022] Updated weights on worker 0-0, policy_version 546847 (0.00098) [2022-07-10 03:27:02,746][26022] Updated weights on worker 0-0, policy_version 546857 (0.00084) [2022-07-10 03:27:04,350][25689] Fps is (10 sec: 5419.8, 60 sec: 5641.4, 300 sec: 5669.6). Total num frames: 559987712. Throughput: 0: 5862.1. Samples: 559996646. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:27:04,350][25689] Avg episode reward: [(0, '-29.130')] [2022-07-10 03:27:04,948][26022] Updated weights on worker 0-0, policy_version 546867 (0.00090) [2022-07-10 03:27:06,412][26022] Updated weights on worker 0-0, policy_version 546877 (0.00085) [2022-07-10 03:27:08,582][26022] Updated weights on worker 0-0, policy_version 546887 (0.00086) [2022-07-10 03:27:09,379][25689] Fps is (10 sec: 5406.8, 60 sec: 5640.1, 300 sec: 5667.1). Total num frames: 560017408. Throughput: 0: 5038.6. Samples: 560013860. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:27:09,381][25689] Avg episode reward: [(0, '-28.297')] [2022-07-10 03:27:10,182][26022] Updated weights on worker 0-0, policy_version 546897 (0.00092) [2022-07-10 03:27:12,122][26022] Updated weights on worker 0-0, policy_version 546907 (0.00088) [2022-07-10 03:27:13,765][26022] Updated weights on worker 0-0, policy_version 546917 (0.00089) [2022-07-10 03:27:14,437][25689] Fps is (10 sec: 5786.2, 60 sec: 5680.4, 300 sec: 5666.1). Total num frames: 560046080. Throughput: 0: 5867.1. Samples: 560047900. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:27:14,438][25689] Avg episode reward: [(0, '-27.812')] [2022-07-10 03:27:15,754][26022] Updated weights on worker 0-0, policy_version 546927 (0.00493) [2022-07-10 03:27:17,387][26022] Updated weights on worker 0-0, policy_version 546937 (0.00086) [2022-07-10 03:27:19,351][26022] Updated weights on worker 0-0, policy_version 546947 (0.00085) [2022-07-10 03:27:19,451][25689] Fps is (10 sec: 5591.9, 60 sec: 5619.2, 300 sec: 5666.0). Total num frames: 560073728. Throughput: 0: 5836.4. Samples: 560081710. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:27:19,451][25689] Avg episode reward: [(0, '-26.252')] [2022-07-10 03:27:20,982][26022] Updated weights on worker 0-0, policy_version 546957 (0.00089) [2022-07-10 03:27:22,926][26022] Updated weights on worker 0-0, policy_version 546967 (0.00081) [2022-07-10 03:27:24,467][25689] Fps is (10 sec: 5615.2, 60 sec: 5654.5, 300 sec: 5665.8). Total num frames: 560102400. Throughput: 0: 5075.1. Samples: 560098744. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:27:24,468][25689] Avg episode reward: [(0, '-26.622')] [2022-07-10 03:27:24,705][26022] Updated weights on worker 0-0, policy_version 546977 (0.00086) [2022-07-10 03:27:26,454][26022] Updated weights on worker 0-0, policy_version 546987 (0.00086) [2022-07-10 03:27:28,172][26022] Updated weights on worker 0-0, policy_version 546997 (0.00095) [2022-07-10 03:27:29,479][25689] Fps is (10 sec: 5820.3, 60 sec: 5657.3, 300 sec: 5671.2). Total num frames: 560132096. Throughput: 0: 5938.8. Samples: 560133230. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:27:29,480][25689] Avg episode reward: [(0, '-26.430')] [2022-07-10 03:27:30,273][26022] Updated weights on worker 0-0, policy_version 547007 (0.00086) [2022-07-10 03:27:31,803][26022] Updated weights on worker 0-0, policy_version 547017 (0.00093) [2022-07-10 03:27:33,523][26022] Updated weights on worker 0-0, policy_version 547027 (0.00087) [2022-07-10 03:27:34,547][25689] Fps is (10 sec: 5587.5, 60 sec: 5640.9, 300 sec: 5656.3). Total num frames: 560158720. Throughput: 0: 5934.9. Samples: 560167248. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:27:34,547][25689] Avg episode reward: [(0, '-26.014')] [2022-07-10 03:27:35,416][26022] Updated weights on worker 0-0, policy_version 547037 (0.00089) [2022-07-10 03:27:37,207][26022] Updated weights on worker 0-0, policy_version 547047 (0.00101) [2022-07-10 03:27:39,146][26022] Updated weights on worker 0-0, policy_version 547057 (0.00092) [2022-07-10 03:27:39,553][25689] Fps is (10 sec: 5590.8, 60 sec: 5658.4, 300 sec: 5666.7). Total num frames: 560188416. Throughput: 0: 5109.9. Samples: 560184430. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:27:39,553][25689] Avg episode reward: [(0, '-25.961')] [2022-07-10 03:27:40,766][26022] Updated weights on worker 0-0, policy_version 547067 (0.00085) [2022-07-10 03:27:42,751][26022] Updated weights on worker 0-0, policy_version 547077 (0.00094) [2022-07-10 03:27:44,553][26022] Updated weights on worker 0-0, policy_version 547087 (0.00093) [2022-07-10 03:27:44,555][25689] Fps is (10 sec: 5729.8, 60 sec: 5632.4, 300 sec: 5663.5). Total num frames: 560216064. Throughput: 0: 5980.3. Samples: 560218874. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:27:44,555][25689] Avg episode reward: [(0, '-26.514')] [2022-07-10 03:27:46,139][26022] Updated weights on worker 0-0, policy_version 547097 (0.00091) [2022-07-10 03:27:48,268][26022] Updated weights on worker 0-0, policy_version 547107 (0.00086) [2022-07-10 03:27:49,576][25689] Fps is (10 sec: 5823.3, 60 sec: 5690.5, 300 sec: 5671.1). Total num frames: 560246784. Throughput: 0: 5978.8. Samples: 560253386. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:27:49,576][25689] Avg episode reward: [(0, '-27.601')] [2022-07-10 03:27:49,702][26022] Updated weights on worker 0-0, policy_version 547117 (0.00088) [2022-07-10 03:27:51,596][26022] Updated weights on worker 0-0, policy_version 547127 (0.00088) [2022-07-10 03:27:53,230][26022] Updated weights on worker 0-0, policy_version 547137 (0.00090) [2022-07-10 03:27:54,632][25689] Fps is (10 sec: 5893.9, 60 sec: 5672.3, 300 sec: 5670.3). Total num frames: 560275456. Throughput: 0: 5157.6. Samples: 560270838. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:27:54,632][25689] Avg episode reward: [(0, '-27.871')] [2022-07-10 03:27:55,160][26022] Updated weights on worker 0-0, policy_version 547147 (0.00089) [2022-07-10 03:27:55,882][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:27:55,894][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000547151_560282624.pth [2022-07-10 03:27:55,894][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000545156_558239744.pth [2022-07-10 03:27:56,927][26022] Updated weights on worker 0-0, policy_version 547157 (0.00092) [2022-07-10 03:27:58,766][26022] Updated weights on worker 0-0, policy_version 547167 (0.00088) [2022-07-10 03:27:59,647][25689] Fps is (10 sec: 5592.4, 60 sec: 5637.4, 300 sec: 5670.9). Total num frames: 560303104. Throughput: 0: 6002.5. Samples: 560305044. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:27:59,647][25689] Avg episode reward: [(0, '-27.370')] [2022-07-10 03:28:00,499][26022] Updated weights on worker 0-0, policy_version 547177 (0.00085) [2022-07-10 03:28:02,695][26022] Updated weights on worker 0-0, policy_version 547187 (0.00095) [2022-07-10 03:28:04,478][26022] Updated weights on worker 0-0, policy_version 547197 (0.00091) [2022-07-10 03:28:04,650][25689] Fps is (10 sec: 5417.4, 60 sec: 5671.9, 300 sec: 5671.1). Total num frames: 560329728. Throughput: 0: 5889.4. Samples: 560337222. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:28:04,651][25689] Avg episode reward: [(0, '-27.759')] [2022-07-10 03:28:06,335][26022] Updated weights on worker 0-0, policy_version 547207 (0.00085) [2022-07-10 03:28:08,137][26022] Updated weights on worker 0-0, policy_version 547217 (0.00086) [2022-07-10 03:28:09,671][25689] Fps is (10 sec: 5618.1, 60 sec: 5672.6, 300 sec: 5675.1). Total num frames: 560359424. Throughput: 0: 5015.2. Samples: 560354170. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:28:09,672][25689] Avg episode reward: [(0, '-27.374')] [2022-07-10 03:28:09,769][26022] Updated weights on worker 0-0, policy_version 547227 (0.00086) [2022-07-10 03:28:11,951][26022] Updated weights on worker 0-0, policy_version 547237 (0.00082) [2022-07-10 03:28:13,377][26022] Updated weights on worker 0-0, policy_version 547247 (0.00085) [2022-07-10 03:28:14,731][25689] Fps is (10 sec: 5586.5, 60 sec: 5638.5, 300 sec: 5664.1). Total num frames: 560386048. Throughput: 0: 5835.4. Samples: 560388128. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:28:14,732][25689] Avg episode reward: [(0, '-27.350')] [2022-07-10 03:28:15,395][26022] Updated weights on worker 0-0, policy_version 547257 (0.00093) [2022-07-10 03:28:17,053][26022] Updated weights on worker 0-0, policy_version 547267 (0.00089) [2022-07-10 03:28:18,781][26022] Updated weights on worker 0-0, policy_version 547277 (0.00095) [2022-07-10 03:28:19,735][25689] Fps is (10 sec: 5698.4, 60 sec: 5690.4, 300 sec: 5671.3). Total num frames: 560416768. Throughput: 0: 5866.4. Samples: 560422888. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:28:19,735][25689] Avg episode reward: [(0, '-26.583')] [2022-07-10 03:28:20,652][26022] Updated weights on worker 0-0, policy_version 547287 (0.00094) [2022-07-10 03:28:22,404][26022] Updated weights on worker 0-0, policy_version 547297 (0.00087) [2022-07-10 03:28:24,251][26022] Updated weights on worker 0-0, policy_version 547307 (0.00089) [2022-07-10 03:28:24,762][25689] Fps is (10 sec: 5920.9, 60 sec: 5689.4, 300 sec: 5671.1). Total num frames: 560445440. Throughput: 0: 5109.4. Samples: 560439984. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:28:24,763][25689] Avg episode reward: [(0, '-27.432')] [2022-07-10 03:28:26,119][26022] Updated weights on worker 0-0, policy_version 547317 (0.00088) [2022-07-10 03:28:27,802][26022] Updated weights on worker 0-0, policy_version 547327 (0.00090) [2022-07-10 03:28:29,766][25689] Fps is (10 sec: 5512.1, 60 sec: 5639.1, 300 sec: 5669.8). Total num frames: 560472064. Throughput: 0: 5952.8. Samples: 560473792. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:28:29,767][26022] Updated weights on worker 0-0, policy_version 547337 (0.00079) [2022-07-10 03:28:29,768][25689] Avg episode reward: [(0, '-27.809')] [2022-07-10 03:28:31,606][26022] Updated weights on worker 0-0, policy_version 547347 (0.00087) [2022-07-10 03:28:33,435][26022] Updated weights on worker 0-0, policy_version 547357 (0.00086) [2022-07-10 03:28:34,836][25689] Fps is (10 sec: 5590.6, 60 sec: 5689.9, 300 sec: 5669.0). Total num frames: 560501760. Throughput: 0: 5946.4. Samples: 560507680. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:28:34,838][25689] Avg episode reward: [(0, '-27.823')] [2022-07-10 03:28:35,196][26022] Updated weights on worker 0-0, policy_version 547367 (0.00092) [2022-07-10 03:28:37,195][26022] Updated weights on worker 0-0, policy_version 547377 (0.00089) [2022-07-10 03:28:38,650][26022] Updated weights on worker 0-0, policy_version 547387 (0.00087) [2022-07-10 03:28:39,847][25689] Fps is (10 sec: 5587.1, 60 sec: 5638.5, 300 sec: 5658.8). Total num frames: 560528384. Throughput: 0: 5053.7. Samples: 560524528. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:28:39,847][25689] Avg episode reward: [(0, '-27.538')] [2022-07-10 03:28:40,731][26022] Updated weights on worker 0-0, policy_version 547397 (0.00082) [2022-07-10 03:28:42,155][26022] Updated weights on worker 0-0, policy_version 547407 (0.00083) [2022-07-10 03:28:44,287][26022] Updated weights on worker 0-0, policy_version 547417 (0.01042) [2022-07-10 03:28:44,849][25689] Fps is (10 sec: 5624.6, 60 sec: 5672.4, 300 sec: 5663.4). Total num frames: 560558080. Throughput: 0: 5898.8. Samples: 560558474. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:28:44,851][25689] Avg episode reward: [(0, '-27.161')] [2022-07-10 03:28:45,950][26022] Updated weights on worker 0-0, policy_version 547427 (0.00093) [2022-07-10 03:28:47,889][26022] Updated weights on worker 0-0, policy_version 547437 (0.00086) [2022-07-10 03:28:49,660][26022] Updated weights on worker 0-0, policy_version 547447 (0.00088) [2022-07-10 03:28:49,853][25689] Fps is (10 sec: 5833.1, 60 sec: 5640.1, 300 sec: 5665.2). Total num frames: 560586752. Throughput: 0: 5941.3. Samples: 560593134. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:28:49,854][25689] Avg episode reward: [(0, '-26.706')] [2022-07-10 03:28:51,552][26022] Updated weights on worker 0-0, policy_version 547457 (0.00099) [2022-07-10 03:28:53,111][26022] Updated weights on worker 0-0, policy_version 547467 (0.00081) [2022-07-10 03:28:54,899][25689] Fps is (10 sec: 5502.5, 60 sec: 5607.1, 300 sec: 5654.5). Total num frames: 560613376. Throughput: 0: 5086.7. Samples: 560609732. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:28:54,899][25689] Avg episode reward: [(0, '-29.002')] [2022-07-10 03:28:55,018][26022] Updated weights on worker 0-0, policy_version 547477 (0.00085) [2022-07-10 03:28:56,803][26022] Updated weights on worker 0-0, policy_version 547487 (0.00095) [2022-07-10 03:28:58,681][26022] Updated weights on worker 0-0, policy_version 547497 (0.00095) [2022-07-10 03:28:59,917][25689] Fps is (10 sec: 5799.7, 60 sec: 5674.7, 300 sec: 5675.2). Total num frames: 560645120. Throughput: 0: 5973.3. Samples: 560644414. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:28:59,918][25689] Avg episode reward: [(0, '-29.546')] [2022-07-10 03:29:00,462][26022] Updated weights on worker 0-0, policy_version 547507 (0.00085) [2022-07-10 03:29:02,351][26022] Updated weights on worker 0-0, policy_version 547517 (0.00096) [2022-07-10 03:29:04,527][26022] Updated weights on worker 0-0, policy_version 547527 (0.00084) [2022-07-10 03:29:04,943][25689] Fps is (10 sec: 5607.3, 60 sec: 5638.6, 300 sec: 5657.9). Total num frames: 560669696. Throughput: 0: 5869.9. Samples: 560676420. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:29:04,943][25689] Avg episode reward: [(0, '-30.560')] [2022-07-10 03:29:06,112][26022] Updated weights on worker 0-0, policy_version 547537 (0.00088) [2022-07-10 03:29:08,144][26022] Updated weights on worker 0-0, policy_version 547547 (0.00091) [2022-07-10 03:29:09,757][26022] Updated weights on worker 0-0, policy_version 547557 (0.00096) [2022-07-10 03:29:09,967][25689] Fps is (10 sec: 5298.5, 60 sec: 5621.4, 300 sec: 5662.8). Total num frames: 560698368. Throughput: 0: 4988.6. Samples: 560693474. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:29:09,967][25689] Avg episode reward: [(0, '-30.128')] [2022-07-10 03:29:11,605][26022] Updated weights on worker 0-0, policy_version 547567 (0.00085) [2022-07-10 03:29:13,382][26022] Updated weights on worker 0-0, policy_version 547577 (0.00087) [2022-07-10 03:29:15,020][25689] Fps is (10 sec: 5690.4, 60 sec: 5656.0, 300 sec: 5659.0). Total num frames: 560727040. Throughput: 0: 5856.0. Samples: 560727562. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:29:15,020][25689] Avg episode reward: [(0, '-29.883')] [2022-07-10 03:29:15,202][26022] Updated weights on worker 0-0, policy_version 547587 (0.00087) [2022-07-10 03:29:17,031][26022] Updated weights on worker 0-0, policy_version 547597 (0.00091) [2022-07-10 03:29:18,799][26022] Updated weights on worker 0-0, policy_version 547607 (0.00095) [2022-07-10 03:29:20,046][25689] Fps is (10 sec: 5791.3, 60 sec: 5636.9, 300 sec: 5659.0). Total num frames: 560756736. Throughput: 0: 5835.4. Samples: 560761870. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:29:20,048][25689] Avg episode reward: [(0, '-29.384')] [2022-07-10 03:29:20,666][26022] Updated weights on worker 0-0, policy_version 547617 (0.00092) [2022-07-10 03:29:22,555][26022] Updated weights on worker 0-0, policy_version 547627 (0.00092) [2022-07-10 03:29:24,179][26022] Updated weights on worker 0-0, policy_version 547637 (0.00084) [2022-07-10 03:29:25,055][25689] Fps is (10 sec: 5714.6, 60 sec: 5621.7, 300 sec: 5660.4). Total num frames: 560784384. Throughput: 0: 5101.5. Samples: 560779022. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:29:25,055][25689] Avg episode reward: [(0, '-27.893')] [2022-07-10 03:29:25,967][26022] Updated weights on worker 0-0, policy_version 547647 (0.00094) [2022-07-10 03:29:27,765][26022] Updated weights on worker 0-0, policy_version 547657 (0.00090) [2022-07-10 03:29:29,505][26022] Updated weights on worker 0-0, policy_version 547667 (0.00092) [2022-07-10 03:29:30,064][25689] Fps is (10 sec: 5621.5, 60 sec: 5655.1, 300 sec: 5655.3). Total num frames: 560813056. Throughput: 0: 5969.5. Samples: 560813444. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:29:30,065][25689] Avg episode reward: [(0, '-27.193')] [2022-07-10 03:29:31,591][26022] Updated weights on worker 0-0, policy_version 547677 (0.00086) [2022-07-10 03:29:33,033][26022] Updated weights on worker 0-0, policy_version 547687 (0.00087) [2022-07-10 03:29:35,111][25689] Fps is (10 sec: 5702.5, 60 sec: 5640.3, 300 sec: 5655.3). Total num frames: 560841728. Throughput: 0: 5977.0. Samples: 560847644. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:29:35,112][25689] Avg episode reward: [(0, '-27.706')] [2022-07-10 03:29:35,113][26022] Updated weights on worker 0-0, policy_version 547697 (0.00162) [2022-07-10 03:29:36,630][26022] Updated weights on worker 0-0, policy_version 547707 (0.00094) [2022-07-10 03:29:38,482][26022] Updated weights on worker 0-0, policy_version 547717 (0.00087) [2022-07-10 03:29:40,120][25689] Fps is (10 sec: 5804.4, 60 sec: 5691.4, 300 sec: 5662.3). Total num frames: 560871424. Throughput: 0: 5120.1. Samples: 560864654. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:29:40,120][25689] Avg episode reward: [(0, '-27.717')] [2022-07-10 03:29:40,337][26022] Updated weights on worker 0-0, policy_version 547727 (0.00052) [2022-07-10 03:29:42,099][26022] Updated weights on worker 0-0, policy_version 547737 (0.00092) [2022-07-10 03:29:44,102][26022] Updated weights on worker 0-0, policy_version 547747 (0.00084) [2022-07-10 03:29:45,133][25689] Fps is (10 sec: 5721.4, 60 sec: 5656.4, 300 sec: 5655.3). Total num frames: 560899072. Throughput: 0: 5970.4. Samples: 560898900. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:29:45,135][25689] Avg episode reward: [(0, '-28.252')] [2022-07-10 03:29:45,529][26022] Updated weights on worker 0-0, policy_version 547757 (0.00091) [2022-07-10 03:29:47,535][26022] Updated weights on worker 0-0, policy_version 547767 (0.00085) [2022-07-10 03:29:49,315][26022] Updated weights on worker 0-0, policy_version 547777 (0.00083) [2022-07-10 03:29:50,148][25689] Fps is (10 sec: 5616.0, 60 sec: 5655.4, 300 sec: 5656.2). Total num frames: 560927744. Throughput: 0: 5973.2. Samples: 560933412. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:29:50,149][25689] Avg episode reward: [(0, '-29.330')] [2022-07-10 03:29:51,080][26022] Updated weights on worker 0-0, policy_version 547787 (0.00095) [2022-07-10 03:29:52,944][26022] Updated weights on worker 0-0, policy_version 547797 (0.00099) [2022-07-10 03:29:54,536][26022] Updated weights on worker 0-0, policy_version 547807 (0.00084) [2022-07-10 03:29:55,203][25689] Fps is (10 sec: 5694.9, 60 sec: 5688.5, 300 sec: 5655.5). Total num frames: 560956416. Throughput: 0: 5117.8. Samples: 560950472. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:29:55,203][25689] Avg episode reward: [(0, '-28.874')] [2022-07-10 03:29:55,914][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:29:55,923][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000547814_560961536.pth [2022-07-10 03:29:55,923][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000545821_558920704.pth [2022-07-10 03:29:56,440][26022] Updated weights on worker 0-0, policy_version 547817 (0.00090) [2022-07-10 03:29:58,595][26022] Updated weights on worker 0-0, policy_version 547827 (0.00088) [2022-07-10 03:30:00,051][26022] Updated weights on worker 0-0, policy_version 547837 (0.00092) [2022-07-10 03:30:00,205][25689] Fps is (10 sec: 5804.1, 60 sec: 5656.1, 300 sec: 5669.6). Total num frames: 560986112. Throughput: 0: 5982.0. Samples: 560984802. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:30:00,205][25689] Avg episode reward: [(0, '-28.848')] [2022-07-10 03:30:02,331][26022] Updated weights on worker 0-0, policy_version 547847 (0.00098) [2022-07-10 03:30:04,047][26022] Updated weights on worker 0-0, policy_version 547857 (0.00086) [2022-07-10 03:30:05,223][25689] Fps is (10 sec: 5416.2, 60 sec: 5656.8, 300 sec: 5656.3). Total num frames: 561010688. Throughput: 0: 5871.3. Samples: 561016852. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:30:05,224][25689] Avg episode reward: [(0, '-27.519')] [2022-07-10 03:30:05,744][26022] Updated weights on worker 0-0, policy_version 547867 (0.00087) [2022-07-10 03:30:07,739][26022] Updated weights on worker 0-0, policy_version 547877 (0.00086) [2022-07-10 03:30:09,480][26022] Updated weights on worker 0-0, policy_version 547887 (0.00087) [2022-07-10 03:30:10,234][25689] Fps is (10 sec: 5309.1, 60 sec: 5658.0, 300 sec: 5650.9). Total num frames: 561039360. Throughput: 0: 4993.4. Samples: 561033710. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:30:10,235][25689] Avg episode reward: [(0, '-28.282')] [2022-07-10 03:30:11,222][26022] Updated weights on worker 0-0, policy_version 547897 (0.00088) [2022-07-10 03:30:13,113][26022] Updated weights on worker 0-0, policy_version 547907 (0.00084) [2022-07-10 03:30:14,862][26022] Updated weights on worker 0-0, policy_version 547917 (0.00085) [2022-07-10 03:30:15,299][25689] Fps is (10 sec: 5691.4, 60 sec: 5656.9, 300 sec: 5656.9). Total num frames: 561068032. Throughput: 0: 5828.3. Samples: 561067598. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:30:15,299][25689] Avg episode reward: [(0, '-28.355')] [2022-07-10 03:30:16,731][26022] Updated weights on worker 0-0, policy_version 547927 (0.00085) [2022-07-10 03:30:18,629][26022] Updated weights on worker 0-0, policy_version 547937 (0.00091) [2022-07-10 03:30:20,198][26022] Updated weights on worker 0-0, policy_version 547947 (0.00087) [2022-07-10 03:30:20,300][25689] Fps is (10 sec: 5900.6, 60 sec: 5676.2, 300 sec: 5661.2). Total num frames: 561098752. Throughput: 0: 5841.7. Samples: 561102190. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:30:20,301][25689] Avg episode reward: [(0, '-27.556')] [2022-07-10 03:30:22,253][26022] Updated weights on worker 0-0, policy_version 547957 (0.00089) [2022-07-10 03:30:23,802][26022] Updated weights on worker 0-0, policy_version 547967 (0.00093) [2022-07-10 03:30:25,339][25689] Fps is (10 sec: 5711.5, 60 sec: 5656.4, 300 sec: 5654.0). Total num frames: 561125376. Throughput: 0: 5102.2. Samples: 561119486. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:30:25,339][25689] Avg episode reward: [(0, '-27.853')] [2022-07-10 03:30:25,659][26022] Updated weights on worker 0-0, policy_version 547977 (0.00086) [2022-07-10 03:30:27,421][26022] Updated weights on worker 0-0, policy_version 547987 (0.00088) [2022-07-10 03:30:29,223][26022] Updated weights on worker 0-0, policy_version 547997 (0.00087) [2022-07-10 03:30:30,355][25689] Fps is (10 sec: 5499.5, 60 sec: 5655.8, 300 sec: 5655.9). Total num frames: 561154048. Throughput: 0: 5960.0. Samples: 561153626. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:30:30,356][25689] Avg episode reward: [(0, '-28.820')] [2022-07-10 03:30:31,099][26022] Updated weights on worker 0-0, policy_version 548007 (0.00089) [2022-07-10 03:30:32,836][26022] Updated weights on worker 0-0, policy_version 548017 (0.00085) [2022-07-10 03:30:34,591][26022] Updated weights on worker 0-0, policy_version 548027 (0.00080) [2022-07-10 03:30:35,463][25689] Fps is (10 sec: 5663.9, 60 sec: 5650.0, 300 sec: 5658.0). Total num frames: 561182720. Throughput: 0: 5974.3. Samples: 561188066. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:30:35,464][25689] Avg episode reward: [(0, '-28.470')] [2022-07-10 03:30:36,574][26022] Updated weights on worker 0-0, policy_version 548037 (0.00096) [2022-07-10 03:30:37,999][26022] Updated weights on worker 0-0, policy_version 548047 (0.00091) [2022-07-10 03:30:40,240][26022] Updated weights on worker 0-0, policy_version 548057 (0.00090) [2022-07-10 03:30:40,496][25689] Fps is (10 sec: 5755.5, 60 sec: 5647.8, 300 sec: 5657.5). Total num frames: 561212416. Throughput: 0: 5101.8. Samples: 561205226. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:30:40,496][25689] Avg episode reward: [(0, '-28.703')] [2022-07-10 03:30:41,663][26022] Updated weights on worker 0-0, policy_version 548067 (0.00088) [2022-07-10 03:30:43,750][26022] Updated weights on worker 0-0, policy_version 548077 (0.00092) [2022-07-10 03:30:45,465][26022] Updated weights on worker 0-0, policy_version 548087 (0.00083) [2022-07-10 03:30:45,564][25689] Fps is (10 sec: 5778.7, 60 sec: 5659.7, 300 sec: 5656.4). Total num frames: 561241088. Throughput: 0: 5943.5. Samples: 561239694. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:30:45,564][25689] Avg episode reward: [(0, '-29.514')] [2022-07-10 03:30:47,206][26022] Updated weights on worker 0-0, policy_version 548097 (0.00095) [2022-07-10 03:30:48,860][26022] Updated weights on worker 0-0, policy_version 548107 (0.00094) [2022-07-10 03:30:50,664][25689] Fps is (10 sec: 5740.2, 60 sec: 5668.6, 300 sec: 5659.7). Total num frames: 561270784. Throughput: 0: 5931.0. Samples: 561274082. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:30:50,666][25689] Avg episode reward: [(0, '-28.517')] [2022-07-10 03:30:50,830][26022] Updated weights on worker 0-0, policy_version 548117 (0.00088) [2022-07-10 03:30:52,535][26022] Updated weights on worker 0-0, policy_version 548127 (0.00094) [2022-07-10 03:30:54,543][26022] Updated weights on worker 0-0, policy_version 548137 (0.00088) [2022-07-10 03:30:55,717][25689] Fps is (10 sec: 5748.8, 60 sec: 5668.8, 300 sec: 5660.6). Total num frames: 561299456. Throughput: 0: 5923.6. Samples: 561308040. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:30:55,718][25689] Avg episode reward: [(0, '-29.363')] [2022-07-10 03:30:56,071][26022] Updated weights on worker 0-0, policy_version 548147 (0.00113) [2022-07-10 03:30:58,135][26022] Updated weights on worker 0-0, policy_version 548157 (0.00092) [2022-07-10 03:30:59,676][26022] Updated weights on worker 0-0, policy_version 548167 (0.00074) [2022-07-10 03:31:00,746][25689] Fps is (10 sec: 5586.1, 60 sec: 5632.4, 300 sec: 5660.9). Total num frames: 561327104. Throughput: 0: 5927.7. Samples: 561325264. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:31:00,747][25689] Avg episode reward: [(0, '-29.521')] [2022-07-10 03:31:01,656][26022] Updated weights on worker 0-0, policy_version 548177 (0.00091) [2022-07-10 03:31:03,777][26022] Updated weights on worker 0-0, policy_version 548187 (0.00085) [2022-07-10 03:31:05,622][26022] Updated weights on worker 0-0, policy_version 548197 (0.00091) [2022-07-10 03:31:05,772][25689] Fps is (10 sec: 5397.5, 60 sec: 5665.5, 300 sec: 5650.3). Total num frames: 561353728. Throughput: 0: 5822.7. Samples: 561357360. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:31:05,772][25689] Avg episode reward: [(0, '-28.999')] [2022-07-10 03:31:07,268][26022] Updated weights on worker 0-0, policy_version 548207 (0.00086) [2022-07-10 03:31:09,330][26022] Updated weights on worker 0-0, policy_version 548217 (0.00096) [2022-07-10 03:31:10,787][25689] Fps is (10 sec: 5609.4, 60 sec: 5682.1, 300 sec: 5662.8). Total num frames: 561383424. Throughput: 0: 5821.2. Samples: 561391218. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:31:10,787][25689] Avg episode reward: [(0, '-28.514')] [2022-07-10 03:31:11,089][26022] Updated weights on worker 0-0, policy_version 548227 (0.00093) [2022-07-10 03:31:12,933][26022] Updated weights on worker 0-0, policy_version 548237 (0.00085) [2022-07-10 03:31:14,706][26022] Updated weights on worker 0-0, policy_version 548247 (0.00087) [2022-07-10 03:31:15,907][25689] Fps is (10 sec: 5556.7, 60 sec: 5643.0, 300 sec: 5644.9). Total num frames: 561410048. Throughput: 0: 4959.3. Samples: 561408170. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:31:15,908][25689] Avg episode reward: [(0, '-27.827')] [2022-07-10 03:31:16,484][26022] Updated weights on worker 0-0, policy_version 548257 (0.00092) [2022-07-10 03:31:18,437][26022] Updated weights on worker 0-0, policy_version 548267 (0.00089) [2022-07-10 03:31:20,035][26022] Updated weights on worker 0-0, policy_version 548277 (0.00075) [2022-07-10 03:31:20,920][25689] Fps is (10 sec: 5558.0, 60 sec: 5625.1, 300 sec: 5655.6). Total num frames: 561439744. Throughput: 0: 5797.6. Samples: 561442222. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:31:20,920][25689] Avg episode reward: [(0, '-28.425')] [2022-07-10 03:31:21,868][26022] Updated weights on worker 0-0, policy_version 548287 (0.00094) [2022-07-10 03:31:23,740][26022] Updated weights on worker 0-0, policy_version 548297 (0.00087) [2022-07-10 03:31:25,447][26022] Updated weights on worker 0-0, policy_version 548307 (0.00090) [2022-07-10 03:31:25,930][25689] Fps is (10 sec: 5721.2, 60 sec: 5644.6, 300 sec: 5649.3). Total num frames: 561467392. Throughput: 0: 5901.8. Samples: 561476332. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:31:25,931][25689] Avg episode reward: [(0, '-27.147')] [2022-07-10 03:31:27,509][26022] Updated weights on worker 0-0, policy_version 548317 (0.00083) [2022-07-10 03:31:29,288][26022] Updated weights on worker 0-0, policy_version 548327 (0.00095) [2022-07-10 03:31:30,967][25689] Fps is (10 sec: 5605.7, 60 sec: 5642.7, 300 sec: 5653.4). Total num frames: 561496064. Throughput: 0: 5055.4. Samples: 561493236. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:31:30,967][25689] Avg episode reward: [(0, '-26.656')] [2022-07-10 03:31:30,970][26022] Updated weights on worker 0-0, policy_version 548337 (0.00093) [2022-07-10 03:31:32,973][26022] Updated weights on worker 0-0, policy_version 548347 (0.00082) [2022-07-10 03:31:34,402][26022] Updated weights on worker 0-0, policy_version 548357 (0.00093) [2022-07-10 03:31:36,086][25689] Fps is (10 sec: 5646.5, 60 sec: 5641.7, 300 sec: 5651.4). Total num frames: 561524736. Throughput: 0: 5911.2. Samples: 561527450. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:31:36,086][25689] Avg episode reward: [(0, '-27.556')] [2022-07-10 03:31:36,475][26022] Updated weights on worker 0-0, policy_version 548367 (0.00093) [2022-07-10 03:31:37,984][26022] Updated weights on worker 0-0, policy_version 548377 (0.00090) [2022-07-10 03:31:39,999][26022] Updated weights on worker 0-0, policy_version 548387 (0.00092) [2022-07-10 03:31:41,106][25689] Fps is (10 sec: 5756.9, 60 sec: 5642.9, 300 sec: 5652.7). Total num frames: 561554432. Throughput: 0: 5910.2. Samples: 561561524. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:31:41,106][25689] Avg episode reward: [(0, '-27.828')] [2022-07-10 03:31:41,841][26022] Updated weights on worker 0-0, policy_version 548397 (0.00086) [2022-07-10 03:31:43,443][26022] Updated weights on worker 0-0, policy_version 548407 (0.00085) [2022-07-10 03:31:45,421][26022] Updated weights on worker 0-0, policy_version 548417 (0.00090) [2022-07-10 03:31:46,137][25689] Fps is (10 sec: 5807.3, 60 sec: 5646.3, 300 sec: 5657.4). Total num frames: 561583104. Throughput: 0: 5056.6. Samples: 561578508. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:31:46,137][25689] Avg episode reward: [(0, '-29.257')] [2022-07-10 03:31:47,340][26022] Updated weights on worker 0-0, policy_version 548427 (0.00093) [2022-07-10 03:31:48,977][26022] Updated weights on worker 0-0, policy_version 548437 (0.00109) [2022-07-10 03:31:50,903][26022] Updated weights on worker 0-0, policy_version 548447 (0.00093) [2022-07-10 03:31:51,205][25689] Fps is (10 sec: 5678.1, 60 sec: 5632.5, 300 sec: 5653.5). Total num frames: 561611776. Throughput: 0: 5892.3. Samples: 561612484. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:31:51,205][25689] Avg episode reward: [(0, '-28.948')] [2022-07-10 03:31:52,575][26022] Updated weights on worker 0-0, policy_version 548457 (0.00084) [2022-07-10 03:31:54,457][26022] Updated weights on worker 0-0, policy_version 548467 (0.00091) [2022-07-10 03:31:55,928][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:31:55,936][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000548475_561638400.pth [2022-07-10 03:31:55,936][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000546486_559601664.pth [2022-07-10 03:31:56,111][26022] Updated weights on worker 0-0, policy_version 548477 (0.00094) [2022-07-10 03:31:56,307][25689] Fps is (10 sec: 5739.3, 60 sec: 5644.8, 300 sec: 5651.6). Total num frames: 561641472. Throughput: 0: 5897.3. Samples: 561646698. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:31:56,315][25689] Avg episode reward: [(0, '-29.270')] [2022-07-10 03:31:58,195][26022] Updated weights on worker 0-0, policy_version 548487 (0.00090) [2022-07-10 03:31:59,702][26022] Updated weights on worker 0-0, policy_version 548497 (0.00097) [2022-07-10 03:32:01,355][25689] Fps is (10 sec: 5649.7, 60 sec: 5643.1, 300 sec: 5661.2). Total num frames: 561669120. Throughput: 0: 5046.0. Samples: 561663700. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:32:01,355][25689] Avg episode reward: [(0, '-28.167')] [2022-07-10 03:32:02,071][26022] Updated weights on worker 0-0, policy_version 548507 (0.00164) [2022-07-10 03:32:03,613][26022] Updated weights on worker 0-0, policy_version 548517 (0.00085) [2022-07-10 03:32:05,613][26022] Updated weights on worker 0-0, policy_version 548527 (0.00087) [2022-07-10 03:32:06,404][25689] Fps is (10 sec: 5577.9, 60 sec: 5674.6, 300 sec: 5657.4). Total num frames: 561697792. Throughput: 0: 5806.8. Samples: 561696194. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:32:06,404][25689] Avg episode reward: [(0, '-27.737')] [2022-07-10 03:32:07,497][26022] Updated weights on worker 0-0, policy_version 548537 (0.00089) [2022-07-10 03:32:09,071][26022] Updated weights on worker 0-0, policy_version 548547 (0.00118) [2022-07-10 03:32:11,131][26022] Updated weights on worker 0-0, policy_version 548557 (0.00082) [2022-07-10 03:32:11,429][25689] Fps is (10 sec: 5488.8, 60 sec: 5623.0, 300 sec: 5651.1). Total num frames: 561724416. Throughput: 0: 5827.8. Samples: 561730348. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:32:11,431][25689] Avg episode reward: [(0, '-26.424')] [2022-07-10 03:32:12,784][26022] Updated weights on worker 0-0, policy_version 548567 (0.00096) [2022-07-10 03:32:14,659][26022] Updated weights on worker 0-0, policy_version 548577 (0.00086) [2022-07-10 03:32:16,375][26022] Updated weights on worker 0-0, policy_version 548587 (0.00087) [2022-07-10 03:32:16,533][25689] Fps is (10 sec: 5459.1, 60 sec: 5658.3, 300 sec: 5652.8). Total num frames: 561753088. Throughput: 0: 4987.2. Samples: 561747570. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:32:16,534][25689] Avg episode reward: [(0, '-26.174')] [2022-07-10 03:32:18,060][26022] Updated weights on worker 0-0, policy_version 548597 (0.00085) [2022-07-10 03:32:20,019][26022] Updated weights on worker 0-0, policy_version 548607 (0.00091) [2022-07-10 03:32:21,532][26022] Updated weights on worker 0-0, policy_version 548617 (0.00086) [2022-07-10 03:32:21,627][25689] Fps is (10 sec: 5824.2, 60 sec: 5667.6, 300 sec: 5658.3). Total num frames: 561783808. Throughput: 0: 5845.2. Samples: 561782194. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:32:21,627][25689] Avg episode reward: [(0, '-27.555')] [2022-07-10 03:32:23,668][26022] Updated weights on worker 0-0, policy_version 548627 (0.00088) [2022-07-10 03:32:25,076][26022] Updated weights on worker 0-0, policy_version 548637 (0.00088) [2022-07-10 03:32:26,677][25689] Fps is (10 sec: 5652.8, 60 sec: 5647.0, 300 sec: 5647.2). Total num frames: 561810432. Throughput: 0: 5942.2. Samples: 561816664. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 03:32:26,678][25689] Avg episode reward: [(0, '-26.431')] [2022-07-10 03:32:27,127][26022] Updated weights on worker 0-0, policy_version 548647 (0.00085) [2022-07-10 03:32:28,730][26022] Updated weights on worker 0-0, policy_version 548657 (0.01122) [2022-07-10 03:32:30,619][26022] Updated weights on worker 0-0, policy_version 548667 (0.00085) [2022-07-10 03:32:31,694][25689] Fps is (10 sec: 5696.0, 60 sec: 5682.6, 300 sec: 5661.9). Total num frames: 561841152. Throughput: 0: 5100.9. Samples: 561833718. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:32:31,694][25689] Avg episode reward: [(0, '-27.557')] [2022-07-10 03:32:32,391][26022] Updated weights on worker 0-0, policy_version 548677 (0.00089) [2022-07-10 03:32:34,261][26022] Updated weights on worker 0-0, policy_version 548687 (0.00093) [2022-07-10 03:32:36,027][26022] Updated weights on worker 0-0, policy_version 548697 (0.00082) [2022-07-10 03:32:36,789][25689] Fps is (10 sec: 5873.7, 60 sec: 5684.9, 300 sec: 5656.8). Total num frames: 561869824. Throughput: 0: 5938.5. Samples: 561867860. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:32:36,789][25689] Avg episode reward: [(0, '-27.225')] [2022-07-10 03:32:37,864][26022] Updated weights on worker 0-0, policy_version 548707 (0.00086) [2022-07-10 03:32:39,508][26022] Updated weights on worker 0-0, policy_version 548717 (0.00092) [2022-07-10 03:32:41,708][26022] Updated weights on worker 0-0, policy_version 548727 (0.00092) [2022-07-10 03:32:41,799][25689] Fps is (10 sec: 5573.5, 60 sec: 5652.0, 300 sec: 5656.7). Total num frames: 561897472. Throughput: 0: 5963.7. Samples: 561902496. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:32:41,799][25689] Avg episode reward: [(0, '-26.319')] [2022-07-10 03:32:43,146][26022] Updated weights on worker 0-0, policy_version 548737 (0.00087) [2022-07-10 03:32:44,980][26022] Updated weights on worker 0-0, policy_version 548747 (0.00085) [2022-07-10 03:32:46,801][25689] Fps is (10 sec: 5727.5, 60 sec: 5671.6, 300 sec: 5653.6). Total num frames: 561927168. Throughput: 0: 5952.2. Samples: 561936444. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:32:46,801][25689] Avg episode reward: [(0, '-26.483')] [2022-07-10 03:32:46,803][26022] Updated weights on worker 0-0, policy_version 548757 (0.00106) [2022-07-10 03:32:48,463][26022] Updated weights on worker 0-0, policy_version 548767 (0.00087) [2022-07-10 03:32:50,441][26022] Updated weights on worker 0-0, policy_version 548777 (0.00089) [2022-07-10 03:32:51,809][25689] Fps is (10 sec: 5830.8, 60 sec: 5677.2, 300 sec: 5654.5). Total num frames: 561955840. Throughput: 0: 5976.2. Samples: 561953930. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:32:51,810][25689] Avg episode reward: [(0, '-26.364')] [2022-07-10 03:32:52,142][26022] Updated weights on worker 0-0, policy_version 548787 (0.00513) [2022-07-10 03:32:53,891][26022] Updated weights on worker 0-0, policy_version 548797 (0.00094) [2022-07-10 03:32:55,727][26022] Updated weights on worker 0-0, policy_version 548807 (0.00089) [2022-07-10 03:32:56,901][25689] Fps is (10 sec: 5677.5, 60 sec: 5661.3, 300 sec: 5656.5). Total num frames: 561984512. Throughput: 0: 5993.2. Samples: 561988396. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:32:56,902][25689] Avg episode reward: [(0, '-27.028')] [2022-07-10 03:32:57,397][26022] Updated weights on worker 0-0, policy_version 548817 (0.00093) [2022-07-10 03:32:59,301][26022] Updated weights on worker 0-0, policy_version 548827 (0.00082) [2022-07-10 03:33:00,931][26022] Updated weights on worker 0-0, policy_version 548837 (0.00090) [2022-07-10 03:33:01,920][25689] Fps is (10 sec: 5468.8, 60 sec: 5647.0, 300 sec: 5656.2). Total num frames: 562011136. Throughput: 0: 5884.4. Samples: 562020900. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:33:01,921][25689] Avg episode reward: [(0, '-27.268')] [2022-07-10 03:33:03,347][26022] Updated weights on worker 0-0, policy_version 548847 (0.00090) [2022-07-10 03:33:05,168][26022] Updated weights on worker 0-0, policy_version 548857 (0.00092) [2022-07-10 03:33:06,679][26022] Updated weights on worker 0-0, policy_version 548867 (0.00088) [2022-07-10 03:33:06,943][25689] Fps is (10 sec: 5608.6, 60 sec: 5666.4, 300 sec: 5656.1). Total num frames: 562040832. Throughput: 0: 5043.7. Samples: 562038034. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:33:06,943][25689] Avg episode reward: [(0, '-28.124')] [2022-07-10 03:33:08,771][26022] Updated weights on worker 0-0, policy_version 548877 (0.00416) [2022-07-10 03:33:10,392][26022] Updated weights on worker 0-0, policy_version 548887 (0.00093) [2022-07-10 03:33:11,964][25689] Fps is (10 sec: 5709.4, 60 sec: 5683.7, 300 sec: 5660.3). Total num frames: 562068480. Throughput: 0: 5860.2. Samples: 562072042. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:33:11,965][25689] Avg episode reward: [(0, '-28.414')] [2022-07-10 03:33:12,352][26022] Updated weights on worker 0-0, policy_version 548897 (0.00089) [2022-07-10 03:33:13,995][26022] Updated weights on worker 0-0, policy_version 548907 (0.00092) [2022-07-10 03:33:16,022][26022] Updated weights on worker 0-0, policy_version 548917 (0.00102) [2022-07-10 03:33:17,084][25689] Fps is (10 sec: 5553.3, 60 sec: 5682.2, 300 sec: 5651.2). Total num frames: 562097152. Throughput: 0: 5829.8. Samples: 562106062. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:33:17,085][25689] Avg episode reward: [(0, '-27.700')] [2022-07-10 03:33:17,583][26022] Updated weights on worker 0-0, policy_version 548927 (0.00096) [2022-07-10 03:33:19,591][26022] Updated weights on worker 0-0, policy_version 548937 (0.00086) [2022-07-10 03:33:21,121][26022] Updated weights on worker 0-0, policy_version 548947 (0.00087) [2022-07-10 03:33:22,180][25689] Fps is (10 sec: 5613.1, 60 sec: 5648.1, 300 sec: 5649.9). Total num frames: 562125824. Throughput: 0: 5055.0. Samples: 562123314. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:33:22,181][25689] Avg episode reward: [(0, '-27.289')] [2022-07-10 03:33:22,980][26022] Updated weights on worker 0-0, policy_version 548957 (0.00088) [2022-07-10 03:33:24,954][26022] Updated weights on worker 0-0, policy_version 548967 (0.00091) [2022-07-10 03:33:26,634][26022] Updated weights on worker 0-0, policy_version 548977 (0.00084) [2022-07-10 03:33:27,236][25689] Fps is (10 sec: 5749.7, 60 sec: 5698.4, 300 sec: 5659.3). Total num frames: 562155520. Throughput: 0: 5881.9. Samples: 562157396. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:33:27,238][25689] Avg episode reward: [(0, '-25.808')] [2022-07-10 03:33:28,634][26022] Updated weights on worker 0-0, policy_version 548987 (0.00084) [2022-07-10 03:33:30,484][26022] Updated weights on worker 0-0, policy_version 548997 (0.00086) [2022-07-10 03:33:32,058][26022] Updated weights on worker 0-0, policy_version 549007 (0.00092) [2022-07-10 03:33:32,278][25689] Fps is (10 sec: 5780.2, 60 sec: 5662.2, 300 sec: 5656.3). Total num frames: 562184192. Throughput: 0: 5863.8. Samples: 562191160. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:33:32,280][25689] Avg episode reward: [(0, '-26.864')] [2022-07-10 03:33:34,274][26022] Updated weights on worker 0-0, policy_version 549017 (0.00085) [2022-07-10 03:33:35,519][26022] Updated weights on worker 0-0, policy_version 549027 (0.00088) [2022-07-10 03:33:37,397][25689] Fps is (10 sec: 5441.7, 60 sec: 5626.1, 300 sec: 5654.3). Total num frames: 562210816. Throughput: 0: 5028.3. Samples: 562208202. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:33:37,398][25689] Avg episode reward: [(0, '-26.988')] [2022-07-10 03:33:37,719][26022] Updated weights on worker 0-0, policy_version 549037 (0.00085) [2022-07-10 03:33:39,174][26022] Updated weights on worker 0-0, policy_version 549047 (0.00087) [2022-07-10 03:33:41,158][26022] Updated weights on worker 0-0, policy_version 549057 (0.00086) [2022-07-10 03:33:42,437][25689] Fps is (10 sec: 5644.8, 60 sec: 5674.1, 300 sec: 5657.0). Total num frames: 562241536. Throughput: 0: 5884.1. Samples: 562242506. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:33:42,437][25689] Avg episode reward: [(0, '-27.851')] [2022-07-10 03:33:43,054][26022] Updated weights on worker 0-0, policy_version 549067 (0.00083) [2022-07-10 03:33:44,723][26022] Updated weights on worker 0-0, policy_version 549077 (0.00085) [2022-07-10 03:33:46,580][26022] Updated weights on worker 0-0, policy_version 549087 (0.00091) [2022-07-10 03:33:47,474][25689] Fps is (10 sec: 5894.0, 60 sec: 5653.9, 300 sec: 5656.4). Total num frames: 562270208. Throughput: 0: 5894.4. Samples: 562276688. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:33:47,475][25689] Avg episode reward: [(0, '-27.153')] [2022-07-10 03:33:48,368][26022] Updated weights on worker 0-0, policy_version 549097 (0.00087) [2022-07-10 03:33:50,029][26022] Updated weights on worker 0-0, policy_version 549107 (0.00089) [2022-07-10 03:33:51,951][26022] Updated weights on worker 0-0, policy_version 549117 (0.00088) [2022-07-10 03:33:52,495][25689] Fps is (10 sec: 5701.3, 60 sec: 5652.7, 300 sec: 5663.7). Total num frames: 562298880. Throughput: 0: 5088.8. Samples: 562294038. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:33:52,495][25689] Avg episode reward: [(0, '-27.636')] [2022-07-10 03:33:53,826][26022] Updated weights on worker 0-0, policy_version 549127 (0.00083) [2022-07-10 03:33:55,486][26022] Updated weights on worker 0-0, policy_version 549137 (0.00092) [2022-07-10 03:33:56,180][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:33:56,196][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000549141_562320384.pth [2022-07-10 03:33:56,196][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000547151_560282624.pth [2022-07-10 03:33:57,389][26022] Updated weights on worker 0-0, policy_version 549147 (0.00090) [2022-07-10 03:33:57,539][25689] Fps is (10 sec: 5595.8, 60 sec: 5640.3, 300 sec: 5649.5). Total num frames: 562326528. Throughput: 0: 5940.8. Samples: 562327858. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:33:57,539][25689] Avg episode reward: [(0, '-28.020')] [2022-07-10 03:33:58,887][26022] Updated weights on worker 0-0, policy_version 549157 (0.00086) [2022-07-10 03:34:01,089][26022] Updated weights on worker 0-0, policy_version 549167 (0.00095) [2022-07-10 03:34:02,562][25689] Fps is (10 sec: 5696.4, 60 sec: 5690.6, 300 sec: 5666.7). Total num frames: 562356224. Throughput: 0: 5945.4. Samples: 562362156. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:34:02,562][25689] Avg episode reward: [(0, '-27.532')] [2022-07-10 03:34:02,867][26022] Updated weights on worker 0-0, policy_version 549177 (0.00090) [2022-07-10 03:34:05,075][26022] Updated weights on worker 0-0, policy_version 549187 (0.00095) [2022-07-10 03:34:06,664][26022] Updated weights on worker 0-0, policy_version 549197 (0.00090) [2022-07-10 03:34:07,575][25689] Fps is (10 sec: 5509.7, 60 sec: 5624.0, 300 sec: 5656.6). Total num frames: 562381824. Throughput: 0: 5013.7. Samples: 562377466. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:34:07,575][25689] Avg episode reward: [(0, '-26.569')] [2022-07-10 03:34:08,580][26022] Updated weights on worker 0-0, policy_version 549207 (0.00084) [2022-07-10 03:34:10,083][26022] Updated weights on worker 0-0, policy_version 549217 (0.00083) [2022-07-10 03:34:12,230][26022] Updated weights on worker 0-0, policy_version 549227 (0.00075) [2022-07-10 03:34:12,587][25689] Fps is (10 sec: 5413.2, 60 sec: 5641.7, 300 sec: 5657.4). Total num frames: 562410496. Throughput: 0: 5838.9. Samples: 562411356. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:34:12,588][25689] Avg episode reward: [(0, '-25.789')] [2022-07-10 03:34:13,869][26022] Updated weights on worker 0-0, policy_version 549237 (0.00092) [2022-07-10 03:34:15,752][26022] Updated weights on worker 0-0, policy_version 549247 (0.00091) [2022-07-10 03:34:17,420][26022] Updated weights on worker 0-0, policy_version 549257 (0.00084) [2022-07-10 03:34:17,631][25689] Fps is (10 sec: 5702.2, 60 sec: 5648.8, 300 sec: 5653.6). Total num frames: 562439168. Throughput: 0: 5847.2. Samples: 562445342. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:34:17,632][25689] Avg episode reward: [(0, '-26.019')] [2022-07-10 03:34:19,315][26022] Updated weights on worker 0-0, policy_version 549267 (0.00088) [2022-07-10 03:34:21,260][26022] Updated weights on worker 0-0, policy_version 549277 (0.00090) [2022-07-10 03:34:22,633][25689] Fps is (10 sec: 5708.2, 60 sec: 5657.6, 300 sec: 5657.2). Total num frames: 562467840. Throughput: 0: 5000.1. Samples: 562462516. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:34:22,634][25689] Avg episode reward: [(0, '-25.749')] [2022-07-10 03:34:22,790][26022] Updated weights on worker 0-0, policy_version 549287 (0.00085) [2022-07-10 03:34:24,654][26022] Updated weights on worker 0-0, policy_version 549297 (0.00083) [2022-07-10 03:34:26,403][26022] Updated weights on worker 0-0, policy_version 549307 (0.00090) [2022-07-10 03:34:27,664][25689] Fps is (10 sec: 5715.8, 60 sec: 5643.0, 300 sec: 5656.8). Total num frames: 562496512. Throughput: 0: 5949.2. Samples: 562496980. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:34:27,665][25689] Avg episode reward: [(0, '-26.171')] [2022-07-10 03:34:28,175][26022] Updated weights on worker 0-0, policy_version 549317 (0.00093) [2022-07-10 03:34:30,183][26022] Updated weights on worker 0-0, policy_version 549327 (0.00086) [2022-07-10 03:34:31,812][26022] Updated weights on worker 0-0, policy_version 549337 (0.00095) [2022-07-10 03:34:32,681][25689] Fps is (10 sec: 5707.3, 60 sec: 5645.3, 300 sec: 5657.3). Total num frames: 562525184. Throughput: 0: 5948.9. Samples: 562530890. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:34:32,683][25689] Avg episode reward: [(0, '-27.527')] [2022-07-10 03:34:33,924][26022] Updated weights on worker 0-0, policy_version 549347 (0.00090) [2022-07-10 03:34:35,443][26022] Updated weights on worker 0-0, policy_version 549357 (0.00094) [2022-07-10 03:34:37,480][26022] Updated weights on worker 0-0, policy_version 549367 (0.00085) [2022-07-10 03:34:37,735][25689] Fps is (10 sec: 5592.1, 60 sec: 5668.4, 300 sec: 5649.6). Total num frames: 562552832. Throughput: 0: 5098.7. Samples: 562547846. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:34:37,736][25689] Avg episode reward: [(0, '-28.028')] [2022-07-10 03:34:39,037][26022] Updated weights on worker 0-0, policy_version 549377 (0.00081) [2022-07-10 03:34:40,999][26022] Updated weights on worker 0-0, policy_version 549387 (0.00093) [2022-07-10 03:34:42,559][26022] Updated weights on worker 0-0, policy_version 549397 (0.00093) [2022-07-10 03:34:42,752][25689] Fps is (10 sec: 5693.7, 60 sec: 5653.5, 300 sec: 5656.4). Total num frames: 562582528. Throughput: 0: 5946.6. Samples: 562582156. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:34:42,754][25689] Avg episode reward: [(0, '-29.287')] [2022-07-10 03:34:44,612][26022] Updated weights on worker 0-0, policy_version 549407 (0.00090) [2022-07-10 03:34:46,273][26022] Updated weights on worker 0-0, policy_version 549417 (0.00080) [2022-07-10 03:34:47,765][25689] Fps is (10 sec: 5819.2, 60 sec: 5655.7, 300 sec: 5656.5). Total num frames: 562611200. Throughput: 0: 5957.4. Samples: 562616734. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:34:47,766][25689] Avg episode reward: [(0, '-29.455')] [2022-07-10 03:34:48,126][26022] Updated weights on worker 0-0, policy_version 549427 (0.00093) [2022-07-10 03:34:49,661][26022] Updated weights on worker 0-0, policy_version 549437 (0.00086) [2022-07-10 03:34:51,719][26022] Updated weights on worker 0-0, policy_version 549447 (0.00081) [2022-07-10 03:34:52,780][25689] Fps is (10 sec: 5718.8, 60 sec: 5656.4, 300 sec: 5657.2). Total num frames: 562639872. Throughput: 0: 5128.8. Samples: 562633972. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:34:52,781][25689] Avg episode reward: [(0, '-30.365')] [2022-07-10 03:34:53,368][26022] Updated weights on worker 0-0, policy_version 549457 (0.00052) [2022-07-10 03:34:55,313][26022] Updated weights on worker 0-0, policy_version 549467 (0.00088) [2022-07-10 03:34:56,911][26022] Updated weights on worker 0-0, policy_version 549477 (0.00089) [2022-07-10 03:34:57,842][25689] Fps is (10 sec: 5690.9, 60 sec: 5671.6, 300 sec: 5652.6). Total num frames: 562668544. Throughput: 0: 5978.2. Samples: 562668048. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:34:57,843][25689] Avg episode reward: [(0, '-30.413')] [2022-07-10 03:34:58,922][26022] Updated weights on worker 0-0, policy_version 549487 (0.00513) [2022-07-10 03:35:00,624][26022] Updated weights on worker 0-0, policy_version 549497 (0.00090) [2022-07-10 03:35:02,758][26022] Updated weights on worker 0-0, policy_version 549507 (0.00090) [2022-07-10 03:35:02,853][25689] Fps is (10 sec: 5489.3, 60 sec: 5621.8, 300 sec: 5659.6). Total num frames: 562695168. Throughput: 0: 5924.4. Samples: 562701240. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:35:02,853][25689] Avg episode reward: [(0, '-29.367')] [2022-07-10 03:35:04,674][26022] Updated weights on worker 0-0, policy_version 549517 (0.00087) [2022-07-10 03:35:06,359][26022] Updated weights on worker 0-0, policy_version 549527 (0.00084) [2022-07-10 03:35:07,877][25689] Fps is (10 sec: 5408.0, 60 sec: 5654.7, 300 sec: 5656.0). Total num frames: 562722816. Throughput: 0: 4999.2. Samples: 562717278. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:35:07,878][25689] Avg episode reward: [(0, '-28.900')] [2022-07-10 03:35:08,504][26022] Updated weights on worker 0-0, policy_version 549537 (0.00086) [2022-07-10 03:35:09,863][26022] Updated weights on worker 0-0, policy_version 549547 (0.00084) [2022-07-10 03:35:11,975][26022] Updated weights on worker 0-0, policy_version 549557 (0.00084) [2022-07-10 03:35:12,923][25689] Fps is (10 sec: 5592.6, 60 sec: 5651.5, 300 sec: 5656.3). Total num frames: 562751488. Throughput: 0: 5825.0. Samples: 562751310. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:35:12,924][25689] Avg episode reward: [(0, '-26.786')] [2022-07-10 03:35:13,604][26022] Updated weights on worker 0-0, policy_version 549567 (0.00096) [2022-07-10 03:35:15,451][26022] Updated weights on worker 0-0, policy_version 549577 (0.00098) [2022-07-10 03:35:17,423][26022] Updated weights on worker 0-0, policy_version 549587 (0.00079) [2022-07-10 03:35:18,028][25689] Fps is (10 sec: 5548.2, 60 sec: 5628.9, 300 sec: 5644.0). Total num frames: 562779136. Throughput: 0: 5796.2. Samples: 562785054. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:35:18,029][25689] Avg episode reward: [(0, '-26.857')] [2022-07-10 03:35:18,993][26022] Updated weights on worker 0-0, policy_version 549597 (0.00090) [2022-07-10 03:35:21,048][26022] Updated weights on worker 0-0, policy_version 549607 (0.00084) [2022-07-10 03:35:22,719][26022] Updated weights on worker 0-0, policy_version 549617 (0.00091) [2022-07-10 03:35:23,091][25689] Fps is (10 sec: 5640.1, 60 sec: 5640.2, 300 sec: 5653.9). Total num frames: 562808832. Throughput: 0: 4994.0. Samples: 562802306. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:35:23,091][25689] Avg episode reward: [(0, '-26.778')] [2022-07-10 03:35:24,402][26022] Updated weights on worker 0-0, policy_version 549627 (0.00088) [2022-07-10 03:35:26,421][26022] Updated weights on worker 0-0, policy_version 549637 (0.00362) [2022-07-10 03:35:27,975][26022] Updated weights on worker 0-0, policy_version 549647 (0.00087) [2022-07-10 03:35:28,109][25689] Fps is (10 sec: 5891.4, 60 sec: 5658.2, 300 sec: 5657.3). Total num frames: 562838528. Throughput: 0: 5909.2. Samples: 562836834. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-10 03:35:28,110][25689] Avg episode reward: [(0, '-26.183')] [2022-07-10 03:35:30,123][26022] Updated weights on worker 0-0, policy_version 549657 (0.00085) [2022-07-10 03:35:31,808][26022] Updated weights on worker 0-0, policy_version 549667 (0.00084) [2022-07-10 03:35:33,148][25689] Fps is (10 sec: 5803.8, 60 sec: 5656.3, 300 sec: 5658.6). Total num frames: 562867200. Throughput: 0: 5909.7. Samples: 562870828. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:35:33,148][25689] Avg episode reward: [(0, '-26.502')] [2022-07-10 03:35:33,495][26022] Updated weights on worker 0-0, policy_version 549677 (0.00088) [2022-07-10 03:35:35,504][26022] Updated weights on worker 0-0, policy_version 549687 (0.00085) [2022-07-10 03:35:37,143][26022] Updated weights on worker 0-0, policy_version 549697 (0.00084) [2022-07-10 03:35:38,259][25689] Fps is (10 sec: 5649.8, 60 sec: 5667.8, 300 sec: 5653.7). Total num frames: 562895872. Throughput: 0: 5934.6. Samples: 562905116. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:35:38,260][25689] Avg episode reward: [(0, '-26.341')] [2022-07-10 03:35:39,101][26022] Updated weights on worker 0-0, policy_version 549707 (0.00091) [2022-07-10 03:35:40,533][26022] Updated weights on worker 0-0, policy_version 549717 (0.00090) [2022-07-10 03:35:42,611][26022] Updated weights on worker 0-0, policy_version 549727 (0.00091) [2022-07-10 03:35:43,264][25689] Fps is (10 sec: 5668.5, 60 sec: 5652.1, 300 sec: 5654.9). Total num frames: 562924544. Throughput: 0: 5938.0. Samples: 562922094. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:35:43,264][25689] Avg episode reward: [(0, '-26.468')] [2022-07-10 03:35:44,354][26022] Updated weights on worker 0-0, policy_version 549737 (0.00094) [2022-07-10 03:35:46,275][26022] Updated weights on worker 0-0, policy_version 549747 (0.00091) [2022-07-10 03:35:48,023][26022] Updated weights on worker 0-0, policy_version 549757 (0.00095) [2022-07-10 03:35:48,327][25689] Fps is (10 sec: 5594.0, 60 sec: 5630.5, 300 sec: 5648.7). Total num frames: 562952192. Throughput: 0: 5886.7. Samples: 562955850. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:35:48,327][25689] Avg episode reward: [(0, '-26.623')] [2022-07-10 03:35:49,888][26022] Updated weights on worker 0-0, policy_version 549767 (0.00085) [2022-07-10 03:35:51,583][26022] Updated weights on worker 0-0, policy_version 549777 (0.00084) [2022-07-10 03:35:53,407][25689] Fps is (10 sec: 5552.5, 60 sec: 5624.4, 300 sec: 5648.2). Total num frames: 562980864. Throughput: 0: 5889.3. Samples: 562990142. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:35:53,409][25689] Avg episode reward: [(0, '-26.526')] [2022-07-10 03:35:53,424][26022] Updated weights on worker 0-0, policy_version 549787 (0.00140) [2022-07-10 03:35:55,096][26022] Updated weights on worker 0-0, policy_version 549797 (0.00090) [2022-07-10 03:35:56,269][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:35:56,288][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000549803_562998272.pth [2022-07-10 03:35:56,289][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000547814_560961536.pth [2022-07-10 03:35:57,171][26022] Updated weights on worker 0-0, policy_version 549807 (0.00086) [2022-07-10 03:35:58,517][25689] Fps is (10 sec: 5828.3, 60 sec: 5653.7, 300 sec: 5657.0). Total num frames: 563011584. Throughput: 0: 5042.0. Samples: 563007258. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:35:58,518][25689] Avg episode reward: [(0, '-25.681')] [2022-07-10 03:35:58,608][26022] Updated weights on worker 0-0, policy_version 549817 (0.00091) [2022-07-10 03:36:00,627][26022] Updated weights on worker 0-0, policy_version 549827 (0.00091) [2022-07-10 03:36:02,503][26022] Updated weights on worker 0-0, policy_version 549837 (0.00086) [2022-07-10 03:36:03,608][25689] Fps is (10 sec: 5520.8, 60 sec: 5629.4, 300 sec: 5652.3). Total num frames: 563037184. Throughput: 0: 5783.1. Samples: 563039748. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:36:03,609][25689] Avg episode reward: [(0, '-26.131')] [2022-07-10 03:36:04,537][26022] Updated weights on worker 0-0, policy_version 549847 (0.00083) [2022-07-10 03:36:06,248][26022] Updated weights on worker 0-0, policy_version 549857 (0.00086) [2022-07-10 03:36:08,104][26022] Updated weights on worker 0-0, policy_version 549867 (0.00091) [2022-07-10 03:36:08,678][25689] Fps is (10 sec: 5442.3, 60 sec: 5658.9, 300 sec: 5651.2). Total num frames: 563066880. Throughput: 0: 5802.8. Samples: 563073942. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:36:08,678][25689] Avg episode reward: [(0, '-27.609')] [2022-07-10 03:36:09,791][26022] Updated weights on worker 0-0, policy_version 549877 (0.00090) [2022-07-10 03:36:11,613][26022] Updated weights on worker 0-0, policy_version 549887 (0.00090) [2022-07-10 03:36:13,384][26022] Updated weights on worker 0-0, policy_version 549897 (0.00108) [2022-07-10 03:36:13,689][25689] Fps is (10 sec: 5891.4, 60 sec: 5679.0, 300 sec: 5663.6). Total num frames: 563096576. Throughput: 0: 4975.4. Samples: 563091056. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:36:13,690][25689] Avg episode reward: [(0, '-27.522')] [2022-07-10 03:36:15,475][26022] Updated weights on worker 0-0, policy_version 549907 (0.00080) [2022-07-10 03:36:17,114][26022] Updated weights on worker 0-0, policy_version 549917 (0.00086) [2022-07-10 03:36:18,789][25689] Fps is (10 sec: 5570.0, 60 sec: 5662.6, 300 sec: 5651.7). Total num frames: 563123200. Throughput: 0: 5802.1. Samples: 563124876. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:36:18,790][25689] Avg episode reward: [(0, '-27.078')] [2022-07-10 03:36:18,954][26022] Updated weights on worker 0-0, policy_version 549927 (0.00085) [2022-07-10 03:36:20,707][26022] Updated weights on worker 0-0, policy_version 549937 (0.00089) [2022-07-10 03:36:22,598][26022] Updated weights on worker 0-0, policy_version 549947 (0.00091) [2022-07-10 03:36:23,821][25689] Fps is (10 sec: 5458.1, 60 sec: 5648.6, 300 sec: 5654.7). Total num frames: 563151872. Throughput: 0: 5908.0. Samples: 563159162. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:36:23,821][25689] Avg episode reward: [(0, '-28.208')] [2022-07-10 03:36:24,279][26022] Updated weights on worker 0-0, policy_version 549957 (0.00709) [2022-07-10 03:36:26,278][26022] Updated weights on worker 0-0, policy_version 549967 (0.00088) [2022-07-10 03:36:28,072][26022] Updated weights on worker 0-0, policy_version 549977 (0.00087) [2022-07-10 03:36:28,823][25689] Fps is (10 sec: 5816.9, 60 sec: 5650.1, 300 sec: 5658.8). Total num frames: 563181568. Throughput: 0: 5073.7. Samples: 563176154. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:36:28,824][25689] Avg episode reward: [(0, '-29.552')] [2022-07-10 03:36:29,840][26022] Updated weights on worker 0-0, policy_version 549987 (0.00056) [2022-07-10 03:36:31,534][26022] Updated weights on worker 0-0, policy_version 549997 (0.00083) [2022-07-10 03:36:33,467][26022] Updated weights on worker 0-0, policy_version 550007 (0.00084) [2022-07-10 03:36:33,827][25689] Fps is (10 sec: 5730.6, 60 sec: 5636.4, 300 sec: 5657.5). Total num frames: 563209216. Throughput: 0: 5904.0. Samples: 563209948. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:36:33,828][25689] Avg episode reward: [(0, '-28.599')] [2022-07-10 03:36:35,259][26022] Updated weights on worker 0-0, policy_version 550017 (0.00091) [2022-07-10 03:36:37,136][26022] Updated weights on worker 0-0, policy_version 550027 (0.00087) [2022-07-10 03:36:38,753][26022] Updated weights on worker 0-0, policy_version 550037 (0.00086) [2022-07-10 03:36:38,883][25689] Fps is (10 sec: 5700.3, 60 sec: 5658.5, 300 sec: 5656.8). Total num frames: 563238912. Throughput: 0: 5942.9. Samples: 563244292. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:36:38,884][25689] Avg episode reward: [(0, '-28.398')] [2022-07-10 03:36:40,694][26022] Updated weights on worker 0-0, policy_version 550047 (0.00093) [2022-07-10 03:36:42,554][26022] Updated weights on worker 0-0, policy_version 550057 (0.00084) [2022-07-10 03:36:43,898][25689] Fps is (10 sec: 5694.1, 60 sec: 5640.7, 300 sec: 5653.7). Total num frames: 563266560. Throughput: 0: 5095.1. Samples: 563261458. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:36:43,898][25689] Avg episode reward: [(0, '-28.715')] [2022-07-10 03:36:44,236][26022] Updated weights on worker 0-0, policy_version 550067 (0.00091) [2022-07-10 03:36:46,103][26022] Updated weights on worker 0-0, policy_version 550077 (0.00079) [2022-07-10 03:36:47,647][26022] Updated weights on worker 0-0, policy_version 550087 (0.00090) [2022-07-10 03:36:48,905][25689] Fps is (10 sec: 5517.3, 60 sec: 5645.8, 300 sec: 5651.4). Total num frames: 563294208. Throughput: 0: 5940.2. Samples: 563295446. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:36:48,906][25689] Avg episode reward: [(0, '-28.214')] [2022-07-10 03:36:49,714][26022] Updated weights on worker 0-0, policy_version 550097 (0.00085) [2022-07-10 03:36:51,448][26022] Updated weights on worker 0-0, policy_version 550107 (0.00086) [2022-07-10 03:36:53,243][26022] Updated weights on worker 0-0, policy_version 550117 (0.00089) [2022-07-10 03:36:53,948][25689] Fps is (10 sec: 5705.6, 60 sec: 5666.2, 300 sec: 5652.5). Total num frames: 563323904. Throughput: 0: 5933.5. Samples: 563329338. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:36:53,950][25689] Avg episode reward: [(0, '-26.994')] [2022-07-10 03:36:55,148][26022] Updated weights on worker 0-0, policy_version 550127 (0.00087) [2022-07-10 03:36:56,899][26022] Updated weights on worker 0-0, policy_version 550137 (0.00086) [2022-07-10 03:36:58,718][26022] Updated weights on worker 0-0, policy_version 550147 (0.00090) [2022-07-10 03:36:59,024][25689] Fps is (10 sec: 5565.8, 60 sec: 5601.7, 300 sec: 5648.6). Total num frames: 563350528. Throughput: 0: 5055.5. Samples: 563346116. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:36:59,026][25689] Avg episode reward: [(0, '-26.788')] [2022-07-10 03:37:00,386][26022] Updated weights on worker 0-0, policy_version 550157 (0.00094) [2022-07-10 03:37:02,936][26022] Updated weights on worker 0-0, policy_version 550167 (0.00093) [2022-07-10 03:37:04,084][25689] Fps is (10 sec: 5354.5, 60 sec: 5638.5, 300 sec: 5644.9). Total num frames: 563378176. Throughput: 0: 5758.7. Samples: 563377704. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:37:04,085][25689] Avg episode reward: [(0, '-26.185')] [2022-07-10 03:37:04,511][26022] Updated weights on worker 0-0, policy_version 550177 (0.00092) [2022-07-10 03:37:06,423][26022] Updated weights on worker 0-0, policy_version 550187 (0.00086) [2022-07-10 03:37:08,295][26022] Updated weights on worker 0-0, policy_version 550197 (0.00088) [2022-07-10 03:37:09,093][25689] Fps is (10 sec: 5492.0, 60 sec: 5610.3, 300 sec: 5648.7). Total num frames: 563405824. Throughput: 0: 5758.8. Samples: 563411702. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:37:09,095][25689] Avg episode reward: [(0, '-25.561')] [2022-07-10 03:37:10,049][26022] Updated weights on worker 0-0, policy_version 550207 (0.00093) [2022-07-10 03:37:11,919][26022] Updated weights on worker 0-0, policy_version 550217 (0.00084) [2022-07-10 03:37:13,459][26022] Updated weights on worker 0-0, policy_version 550227 (0.00088) [2022-07-10 03:37:14,129][25689] Fps is (10 sec: 5708.6, 60 sec: 5607.9, 300 sec: 5653.4). Total num frames: 563435520. Throughput: 0: 4930.8. Samples: 563428846. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:37:14,130][25689] Avg episode reward: [(0, '-24.848')] [2022-07-10 03:37:15,431][26022] Updated weights on worker 0-0, policy_version 550237 (0.00086) [2022-07-10 03:37:17,146][26022] Updated weights on worker 0-0, policy_version 550247 (0.00086) [2022-07-10 03:37:18,907][26022] Updated weights on worker 0-0, policy_version 550257 (0.00083) [2022-07-10 03:37:19,210][25689] Fps is (10 sec: 5667.7, 60 sec: 5626.6, 300 sec: 5643.3). Total num frames: 563463168. Throughput: 0: 5788.2. Samples: 563462956. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:37:19,211][25689] Avg episode reward: [(0, '-25.262')] [2022-07-10 03:37:20,878][26022] Updated weights on worker 0-0, policy_version 550267 (0.00092) [2022-07-10 03:37:22,673][26022] Updated weights on worker 0-0, policy_version 550277 (0.00084) [2022-07-10 03:37:24,215][25689] Fps is (10 sec: 5584.3, 60 sec: 5629.2, 300 sec: 5651.1). Total num frames: 563491840. Throughput: 0: 5934.7. Samples: 563497172. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:37:24,215][25689] Avg episode reward: [(0, '-25.676')] [2022-07-10 03:37:24,480][26022] Updated weights on worker 0-0, policy_version 550287 (0.00821) [2022-07-10 03:37:26,202][26022] Updated weights on worker 0-0, policy_version 550297 (0.00083) [2022-07-10 03:37:28,045][26022] Updated weights on worker 0-0, policy_version 550307 (0.00088) [2022-07-10 03:37:29,229][25689] Fps is (10 sec: 5723.9, 60 sec: 5611.2, 300 sec: 5644.2). Total num frames: 563520512. Throughput: 0: 5102.6. Samples: 563514448. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:37:29,229][25689] Avg episode reward: [(0, '-24.869')] [2022-07-10 03:37:30,042][26022] Updated weights on worker 0-0, policy_version 550317 (0.00089) [2022-07-10 03:37:31,544][26022] Updated weights on worker 0-0, policy_version 550327 (0.00092) [2022-07-10 03:37:33,569][26022] Updated weights on worker 0-0, policy_version 550337 (0.00096) [2022-07-10 03:37:34,237][25689] Fps is (10 sec: 5721.7, 60 sec: 5627.7, 300 sec: 5645.9). Total num frames: 563549184. Throughput: 0: 5939.0. Samples: 563548266. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:37:34,239][25689] Avg episode reward: [(0, '-24.657')] [2022-07-10 03:37:35,091][26022] Updated weights on worker 0-0, policy_version 550347 (0.00097) [2022-07-10 03:37:37,082][26022] Updated weights on worker 0-0, policy_version 550357 (0.00087) [2022-07-10 03:37:39,122][26022] Updated weights on worker 0-0, policy_version 550367 (0.00090) [2022-07-10 03:37:39,376][25689] Fps is (10 sec: 5651.0, 60 sec: 5603.0, 300 sec: 5646.9). Total num frames: 563577856. Throughput: 0: 5915.6. Samples: 563582250. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:37:39,377][25689] Avg episode reward: [(0, '-25.152')] [2022-07-10 03:37:40,547][26022] Updated weights on worker 0-0, policy_version 550377 (0.00071) [2022-07-10 03:37:42,616][26022] Updated weights on worker 0-0, policy_version 550387 (0.00086) [2022-07-10 03:37:44,242][26022] Updated weights on worker 0-0, policy_version 550397 (0.00088) [2022-07-10 03:37:44,382][25689] Fps is (10 sec: 5652.5, 60 sec: 5620.8, 300 sec: 5643.4). Total num frames: 563606528. Throughput: 0: 5070.7. Samples: 563599434. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:37:44,382][25689] Avg episode reward: [(0, '-25.310')] [2022-07-10 03:37:46,190][26022] Updated weights on worker 0-0, policy_version 550407 (0.00096) [2022-07-10 03:37:47,942][26022] Updated weights on worker 0-0, policy_version 550417 (0.00090) [2022-07-10 03:37:49,405][25689] Fps is (10 sec: 5717.7, 60 sec: 5636.2, 300 sec: 5643.1). Total num frames: 563635200. Throughput: 0: 5887.7. Samples: 563633242. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:37:49,406][25689] Avg episode reward: [(0, '-26.693')] [2022-07-10 03:37:49,683][26022] Updated weights on worker 0-0, policy_version 550427 (0.00084) [2022-07-10 03:37:51,535][26022] Updated weights on worker 0-0, policy_version 550437 (0.00093) [2022-07-10 03:37:53,257][26022] Updated weights on worker 0-0, policy_version 550447 (0.00086) [2022-07-10 03:37:54,414][25689] Fps is (10 sec: 5716.2, 60 sec: 5622.5, 300 sec: 5644.7). Total num frames: 563663872. Throughput: 0: 5901.2. Samples: 563667332. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:37:54,414][25689] Avg episode reward: [(0, '-26.642')] [2022-07-10 03:37:55,307][26022] Updated weights on worker 0-0, policy_version 550457 (0.00095) [2022-07-10 03:37:56,371][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:37:56,382][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000550463_563674112.pth [2022-07-10 03:37:56,383][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000548475_561638400.pth [2022-07-10 03:37:56,801][26022] Updated weights on worker 0-0, policy_version 550467 (0.00089) [2022-07-10 03:37:58,729][26022] Updated weights on worker 0-0, policy_version 550477 (0.00084) [2022-07-10 03:37:59,456][25689] Fps is (10 sec: 5705.5, 60 sec: 5659.5, 300 sec: 5651.1). Total num frames: 563692544. Throughput: 0: 5939.6. Samples: 563701516. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:37:59,457][25689] Avg episode reward: [(0, '-27.525')] [2022-07-10 03:38:00,607][26022] Updated weights on worker 0-0, policy_version 550487 (0.00087) [2022-07-10 03:38:02,692][26022] Updated weights on worker 0-0, policy_version 550497 (0.00091) [2022-07-10 03:38:04,495][25689] Fps is (10 sec: 5383.6, 60 sec: 5627.6, 300 sec: 5637.0). Total num frames: 563718144. Throughput: 0: 5815.6. Samples: 563716402. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:38:04,495][25689] Avg episode reward: [(0, '-27.760')] [2022-07-10 03:38:04,612][26022] Updated weights on worker 0-0, policy_version 550507 (0.00085) [2022-07-10 03:38:06,382][26022] Updated weights on worker 0-0, policy_version 550517 (0.00094) [2022-07-10 03:38:08,224][26022] Updated weights on worker 0-0, policy_version 550527 (0.00082) [2022-07-10 03:38:09,497][25689] Fps is (10 sec: 5302.9, 60 sec: 5628.2, 300 sec: 5637.4). Total num frames: 563745792. Throughput: 0: 5840.4. Samples: 563750588. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:38:09,498][25689] Avg episode reward: [(0, '-27.841')] [2022-07-10 03:38:09,962][26022] Updated weights on worker 0-0, policy_version 550537 (0.00090) [2022-07-10 03:38:11,750][26022] Updated weights on worker 0-0, policy_version 550547 (0.00083) [2022-07-10 03:38:13,514][26022] Updated weights on worker 0-0, policy_version 550557 (0.00078) [2022-07-10 03:38:14,513][25689] Fps is (10 sec: 5723.8, 60 sec: 5630.1, 300 sec: 5642.8). Total num frames: 563775488. Throughput: 0: 5840.5. Samples: 563784724. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:38:14,514][25689] Avg episode reward: [(0, '-26.082')] [2022-07-10 03:38:15,563][26022] Updated weights on worker 0-0, policy_version 550567 (0.00090) [2022-07-10 03:38:16,965][26022] Updated weights on worker 0-0, policy_version 550577 (0.00087) [2022-07-10 03:38:19,083][26022] Updated weights on worker 0-0, policy_version 550587 (0.00099) [2022-07-10 03:38:19,618][25689] Fps is (10 sec: 5767.4, 60 sec: 5644.9, 300 sec: 5642.7). Total num frames: 563804160. Throughput: 0: 4959.2. Samples: 563801502. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:38:19,618][25689] Avg episode reward: [(0, '-25.185')] [2022-07-10 03:38:20,794][26022] Updated weights on worker 0-0, policy_version 550597 (0.00087) [2022-07-10 03:38:22,614][26022] Updated weights on worker 0-0, policy_version 550607 (0.00095) [2022-07-10 03:38:24,509][26022] Updated weights on worker 0-0, policy_version 550617 (0.00090) [2022-07-10 03:38:24,654][25689] Fps is (10 sec: 5655.0, 60 sec: 5641.9, 300 sec: 5639.6). Total num frames: 563832832. Throughput: 0: 5916.3. Samples: 563835668. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:38:24,654][25689] Avg episode reward: [(0, '-24.130')] [2022-07-10 03:38:26,207][26022] Updated weights on worker 0-0, policy_version 550627 (0.00093) [2022-07-10 03:38:28,039][26022] Updated weights on worker 0-0, policy_version 550637 (0.00084) [2022-07-10 03:38:29,660][25689] Fps is (10 sec: 5608.1, 60 sec: 5625.7, 300 sec: 5636.8). Total num frames: 563860480. Throughput: 0: 5911.9. Samples: 563869790. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 03:38:29,661][25689] Avg episode reward: [(0, '-24.199')] [2022-07-10 03:38:29,807][26022] Updated weights on worker 0-0, policy_version 550647 (0.00087) [2022-07-10 03:38:31,648][26022] Updated weights on worker 0-0, policy_version 550657 (0.00090) [2022-07-10 03:38:33,443][26022] Updated weights on worker 0-0, policy_version 550667 (0.00088) [2022-07-10 03:38:34,663][25689] Fps is (10 sec: 5626.8, 60 sec: 5626.2, 300 sec: 5645.9). Total num frames: 563889152. Throughput: 0: 5071.3. Samples: 563886908. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:38:34,663][25689] Avg episode reward: [(0, '-24.313')] [2022-07-10 03:38:35,267][26022] Updated weights on worker 0-0, policy_version 550677 (0.00089) [2022-07-10 03:38:37,008][26022] Updated weights on worker 0-0, policy_version 550687 (0.00084) [2022-07-10 03:38:38,933][26022] Updated weights on worker 0-0, policy_version 550697 (0.00083) [2022-07-10 03:38:39,766][25689] Fps is (10 sec: 5674.5, 60 sec: 5629.6, 300 sec: 5637.8). Total num frames: 563917824. Throughput: 0: 5920.5. Samples: 563920790. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:38:39,768][25689] Avg episode reward: [(0, '-25.014')] [2022-07-10 03:38:40,563][26022] Updated weights on worker 0-0, policy_version 550707 (0.00086) [2022-07-10 03:38:42,562][26022] Updated weights on worker 0-0, policy_version 550717 (0.00083) [2022-07-10 03:38:44,274][26022] Updated weights on worker 0-0, policy_version 550727 (0.00090) [2022-07-10 03:38:44,775][25689] Fps is (10 sec: 5771.9, 60 sec: 5646.2, 300 sec: 5641.8). Total num frames: 563947520. Throughput: 0: 5941.4. Samples: 563955220. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:38:44,776][25689] Avg episode reward: [(0, '-26.277')] [2022-07-10 03:38:46,091][26022] Updated weights on worker 0-0, policy_version 550737 (0.00088) [2022-07-10 03:38:47,970][26022] Updated weights on worker 0-0, policy_version 550747 (0.00081) [2022-07-10 03:38:49,789][25689] Fps is (10 sec: 5721.4, 60 sec: 5630.2, 300 sec: 5638.5). Total num frames: 563975168. Throughput: 0: 5074.3. Samples: 563971926. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:38:49,789][25689] Avg episode reward: [(0, '-26.469')] [2022-07-10 03:38:49,795][26022] Updated weights on worker 0-0, policy_version 550757 (0.00089) [2022-07-10 03:38:51,587][26022] Updated weights on worker 0-0, policy_version 550767 (0.00082) [2022-07-10 03:38:53,598][26022] Updated weights on worker 0-0, policy_version 550777 (0.00093) [2022-07-10 03:38:54,843][25689] Fps is (10 sec: 5594.1, 60 sec: 5625.9, 300 sec: 5641.7). Total num frames: 564003840. Throughput: 0: 5872.6. Samples: 564005418. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:38:54,843][25689] Avg episode reward: [(0, '-26.509')] [2022-07-10 03:38:55,154][26022] Updated weights on worker 0-0, policy_version 550787 (0.00092) [2022-07-10 03:38:57,050][26022] Updated weights on worker 0-0, policy_version 550797 (0.00089) [2022-07-10 03:38:58,888][26022] Updated weights on worker 0-0, policy_version 550807 (0.00086) [2022-07-10 03:38:59,883][25689] Fps is (10 sec: 5579.4, 60 sec: 5609.2, 300 sec: 5634.5). Total num frames: 564031488. Throughput: 0: 5900.7. Samples: 564039494. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:38:59,883][25689] Avg episode reward: [(0, '-26.544')] [2022-07-10 03:39:00,656][26022] Updated weights on worker 0-0, policy_version 550817 (0.00090) [2022-07-10 03:39:03,005][26022] Updated weights on worker 0-0, policy_version 550827 (0.00086) [2022-07-10 03:39:04,752][26022] Updated weights on worker 0-0, policy_version 550837 (0.00083) [2022-07-10 03:39:04,888][25689] Fps is (10 sec: 5504.7, 60 sec: 5646.2, 300 sec: 5641.6). Total num frames: 564059136. Throughput: 0: 4960.6. Samples: 564054992. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:39:04,889][25689] Avg episode reward: [(0, '-26.440')] [2022-07-10 03:39:06,471][26022] Updated weights on worker 0-0, policy_version 550847 (0.00095) [2022-07-10 03:39:08,099][26022] Updated weights on worker 0-0, policy_version 550857 (0.00087) [2022-07-10 03:39:09,939][25689] Fps is (10 sec: 5397.0, 60 sec: 5624.8, 300 sec: 5634.0). Total num frames: 564085760. Throughput: 0: 5794.2. Samples: 564088680. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:39:09,939][25689] Avg episode reward: [(0, '-26.416')] [2022-07-10 03:39:10,098][26022] Updated weights on worker 0-0, policy_version 550867 (0.00087) [2022-07-10 03:39:11,762][26022] Updated weights on worker 0-0, policy_version 550877 (0.00085) [2022-07-10 03:39:13,581][26022] Updated weights on worker 0-0, policy_version 550887 (0.00091) [2022-07-10 03:39:14,947][25689] Fps is (10 sec: 5701.1, 60 sec: 5642.4, 300 sec: 5641.5). Total num frames: 564116480. Throughput: 0: 5853.3. Samples: 564123090. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:39:14,948][25689] Avg episode reward: [(0, '-26.610')] [2022-07-10 03:39:15,404][26022] Updated weights on worker 0-0, policy_version 550897 (0.00088) [2022-07-10 03:39:17,204][26022] Updated weights on worker 0-0, policy_version 550907 (0.00625) [2022-07-10 03:39:19,147][26022] Updated weights on worker 0-0, policy_version 550917 (0.00086) [2022-07-10 03:39:20,068][25689] Fps is (10 sec: 5762.1, 60 sec: 5623.9, 300 sec: 5635.8). Total num frames: 564144128. Throughput: 0: 4984.2. Samples: 564140102. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:39:20,069][25689] Avg episode reward: [(0, '-27.561')] [2022-07-10 03:39:20,768][26022] Updated weights on worker 0-0, policy_version 550927 (0.00088) [2022-07-10 03:39:22,697][26022] Updated weights on worker 0-0, policy_version 550937 (0.00084) [2022-07-10 03:39:24,536][26022] Updated weights on worker 0-0, policy_version 550947 (0.00087) [2022-07-10 03:39:25,118][25689] Fps is (10 sec: 5537.1, 60 sec: 5622.6, 300 sec: 5635.4). Total num frames: 564172800. Throughput: 0: 5878.2. Samples: 564173906. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:39:25,119][25689] Avg episode reward: [(0, '-26.206')] [2022-07-10 03:39:26,227][26022] Updated weights on worker 0-0, policy_version 550957 (0.00061) [2022-07-10 03:39:28,117][26022] Updated weights on worker 0-0, policy_version 550967 (0.00096) [2022-07-10 03:39:29,883][26022] Updated weights on worker 0-0, policy_version 550977 (0.00087) [2022-07-10 03:39:30,129][25689] Fps is (10 sec: 5699.7, 60 sec: 5639.1, 300 sec: 5635.6). Total num frames: 564201472. Throughput: 0: 5910.7. Samples: 564208020. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:39:30,131][25689] Avg episode reward: [(0, '-26.594')] [2022-07-10 03:39:31,701][26022] Updated weights on worker 0-0, policy_version 550987 (0.00085) [2022-07-10 03:39:33,444][26022] Updated weights on worker 0-0, policy_version 550997 (0.00088) [2022-07-10 03:39:35,185][25689] Fps is (10 sec: 5696.2, 60 sec: 5634.1, 300 sec: 5639.0). Total num frames: 564230144. Throughput: 0: 5031.0. Samples: 564224910. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:39:35,186][25689] Avg episode reward: [(0, '-26.817')] [2022-07-10 03:39:35,340][26022] Updated weights on worker 0-0, policy_version 551007 (0.00088) [2022-07-10 03:39:37,115][26022] Updated weights on worker 0-0, policy_version 551017 (0.00085) [2022-07-10 03:39:39,067][26022] Updated weights on worker 0-0, policy_version 551027 (0.00088) [2022-07-10 03:39:40,229][25689] Fps is (10 sec: 5576.5, 60 sec: 5622.8, 300 sec: 5631.6). Total num frames: 564257792. Throughput: 0: 5883.5. Samples: 564258716. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:39:40,229][25689] Avg episode reward: [(0, '-27.153')] [2022-07-10 03:39:40,691][26022] Updated weights on worker 0-0, policy_version 551037 (0.00057) [2022-07-10 03:39:42,655][26022] Updated weights on worker 0-0, policy_version 551047 (0.00097) [2022-07-10 03:39:44,171][26022] Updated weights on worker 0-0, policy_version 551057 (0.00090) [2022-07-10 03:39:45,248][25689] Fps is (10 sec: 5698.8, 60 sec: 5621.9, 300 sec: 5634.9). Total num frames: 564287488. Throughput: 0: 5929.7. Samples: 564293268. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:39:45,248][25689] Avg episode reward: [(0, '-27.775')] [2022-07-10 03:39:46,260][26022] Updated weights on worker 0-0, policy_version 551067 (0.00087) [2022-07-10 03:39:47,699][26022] Updated weights on worker 0-0, policy_version 551077 (0.00084) [2022-07-10 03:39:49,786][26022] Updated weights on worker 0-0, policy_version 551087 (0.00051) [2022-07-10 03:39:50,252][25689] Fps is (10 sec: 5721.1, 60 sec: 5622.7, 300 sec: 5631.7). Total num frames: 564315136. Throughput: 0: 5097.5. Samples: 564310596. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:39:50,253][25689] Avg episode reward: [(0, '-27.512')] [2022-07-10 03:39:51,463][26022] Updated weights on worker 0-0, policy_version 551097 (0.00093) [2022-07-10 03:39:53,403][26022] Updated weights on worker 0-0, policy_version 551107 (0.00090) [2022-07-10 03:39:55,066][26022] Updated weights on worker 0-0, policy_version 551117 (0.00095) [2022-07-10 03:39:55,255][25689] Fps is (10 sec: 5627.6, 60 sec: 5627.5, 300 sec: 5632.8). Total num frames: 564343808. Throughput: 0: 5967.2. Samples: 564344672. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:39:55,256][25689] Avg episode reward: [(0, '-27.815')] [2022-07-10 03:39:56,415][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:39:56,427][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000551124_564350976.pth [2022-07-10 03:39:56,428][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000549141_562320384.pth [2022-07-10 03:39:56,811][26022] Updated weights on worker 0-0, policy_version 551127 (0.00091) [2022-07-10 03:39:58,811][26022] Updated weights on worker 0-0, policy_version 551137 (0.00089) [2022-07-10 03:40:00,329][25689] Fps is (10 sec: 5792.4, 60 sec: 5658.2, 300 sec: 5641.9). Total num frames: 564373504. Throughput: 0: 5985.1. Samples: 564379016. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:40:00,329][25689] Avg episode reward: [(0, '-28.527')] [2022-07-10 03:40:00,337][26022] Updated weights on worker 0-0, policy_version 551147 (0.00084) [2022-07-10 03:40:02,678][26022] Updated weights on worker 0-0, policy_version 551157 (0.00098) [2022-07-10 03:40:04,248][26022] Updated weights on worker 0-0, policy_version 551167 (0.00092) [2022-07-10 03:40:05,330][25689] Fps is (10 sec: 5387.1, 60 sec: 5607.8, 300 sec: 5632.0). Total num frames: 564398080. Throughput: 0: 5011.6. Samples: 564393912. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:40:05,330][25689] Avg episode reward: [(0, '-28.446')] [2022-07-10 03:40:06,304][26022] Updated weights on worker 0-0, policy_version 551177 (0.00082) [2022-07-10 03:40:08,155][26022] Updated weights on worker 0-0, policy_version 551187 (0.00086) [2022-07-10 03:40:09,997][26022] Updated weights on worker 0-0, policy_version 551197 (0.00087) [2022-07-10 03:40:10,341][25689] Fps is (10 sec: 5420.6, 60 sec: 5662.3, 300 sec: 5636.2). Total num frames: 564427776. Throughput: 0: 5825.3. Samples: 564427618. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:40:10,341][25689] Avg episode reward: [(0, '-27.144')] [2022-07-10 03:40:11,765][26022] Updated weights on worker 0-0, policy_version 551207 (0.00087) [2022-07-10 03:40:13,578][26022] Updated weights on worker 0-0, policy_version 551217 (0.00085) [2022-07-10 03:40:15,363][25689] Fps is (10 sec: 5715.4, 60 sec: 5610.1, 300 sec: 5637.7). Total num frames: 564455424. Throughput: 0: 5823.2. Samples: 564461764. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:40:15,365][25689] Avg episode reward: [(0, '-26.181')] [2022-07-10 03:40:15,448][26022] Updated weights on worker 0-0, policy_version 551227 (0.00093) [2022-07-10 03:40:17,203][26022] Updated weights on worker 0-0, policy_version 551237 (0.00083) [2022-07-10 03:40:18,903][26022] Updated weights on worker 0-0, policy_version 551247 (0.00096) [2022-07-10 03:40:20,500][25689] Fps is (10 sec: 5543.5, 60 sec: 5625.6, 300 sec: 5632.9). Total num frames: 564484096. Throughput: 0: 4946.1. Samples: 564478788. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:40:20,502][25689] Avg episode reward: [(0, '-26.361')] [2022-07-10 03:40:20,871][26022] Updated weights on worker 0-0, policy_version 551257 (0.00086) [2022-07-10 03:40:22,508][26022] Updated weights on worker 0-0, policy_version 551267 (0.00087) [2022-07-10 03:40:24,446][26022] Updated weights on worker 0-0, policy_version 551277 (0.00087) [2022-07-10 03:40:25,510][25689] Fps is (10 sec: 5853.4, 60 sec: 5663.3, 300 sec: 5636.5). Total num frames: 564514816. Throughput: 0: 5892.6. Samples: 564512824. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:40:25,511][25689] Avg episode reward: [(0, '-26.730')] [2022-07-10 03:40:26,138][26022] Updated weights on worker 0-0, policy_version 551287 (0.00086) [2022-07-10 03:40:27,923][26022] Updated weights on worker 0-0, policy_version 551297 (0.00091) [2022-07-10 03:40:29,806][26022] Updated weights on worker 0-0, policy_version 551307 (0.00090) [2022-07-10 03:40:30,535][25689] Fps is (10 sec: 5612.3, 60 sec: 5611.1, 300 sec: 5626.4). Total num frames: 564540416. Throughput: 0: 5909.0. Samples: 564546948. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:40:30,537][25689] Avg episode reward: [(0, '-26.186')] [2022-07-10 03:40:31,624][26022] Updated weights on worker 0-0, policy_version 551317 (0.00083) [2022-07-10 03:40:33,636][26022] Updated weights on worker 0-0, policy_version 551327 (0.00088) [2022-07-10 03:40:35,254][26022] Updated weights on worker 0-0, policy_version 551337 (0.00081) [2022-07-10 03:40:35,577][25689] Fps is (10 sec: 5492.4, 60 sec: 5629.3, 300 sec: 5631.1). Total num frames: 564570112. Throughput: 0: 5056.8. Samples: 564563984. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:40:35,578][25689] Avg episode reward: [(0, '-26.463')] [2022-07-10 03:40:37,030][26022] Updated weights on worker 0-0, policy_version 551347 (0.00093) [2022-07-10 03:40:38,890][26022] Updated weights on worker 0-0, policy_version 551357 (0.00096) [2022-07-10 03:40:40,484][26022] Updated weights on worker 0-0, policy_version 551367 (0.00095) [2022-07-10 03:40:40,712][25689] Fps is (10 sec: 5936.9, 60 sec: 5671.7, 300 sec: 5635.6). Total num frames: 564600832. Throughput: 0: 5902.5. Samples: 564598086. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:40:40,712][25689] Avg episode reward: [(0, '-27.027')] [2022-07-10 03:40:42,483][26022] Updated weights on worker 0-0, policy_version 551377 (0.00090) [2022-07-10 03:40:44,247][26022] Updated weights on worker 0-0, policy_version 551387 (0.00087) [2022-07-10 03:40:45,753][25689] Fps is (10 sec: 5736.0, 60 sec: 5635.7, 300 sec: 5636.0). Total num frames: 564628480. Throughput: 0: 5918.5. Samples: 564632636. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:40:45,754][25689] Avg episode reward: [(0, '-27.558')] [2022-07-10 03:40:45,971][26022] Updated weights on worker 0-0, policy_version 551397 (0.00084) [2022-07-10 03:40:48,016][26022] Updated weights on worker 0-0, policy_version 551407 (0.00089) [2022-07-10 03:40:49,499][26022] Updated weights on worker 0-0, policy_version 551417 (0.00080) [2022-07-10 03:40:50,760][25689] Fps is (10 sec: 5605.1, 60 sec: 5652.4, 300 sec: 5637.4). Total num frames: 564657152. Throughput: 0: 5913.2. Samples: 564666540. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:40:50,761][25689] Avg episode reward: [(0, '-28.637')] [2022-07-10 03:40:51,554][26022] Updated weights on worker 0-0, policy_version 551427 (0.00090) [2022-07-10 03:40:53,320][26022] Updated weights on worker 0-0, policy_version 551437 (0.00091) [2022-07-10 03:40:55,200][26022] Updated weights on worker 0-0, policy_version 551447 (0.00090) [2022-07-10 03:40:55,774][25689] Fps is (10 sec: 5518.2, 60 sec: 5617.6, 300 sec: 5625.4). Total num frames: 564683776. Throughput: 0: 5899.4. Samples: 564683130. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:40:55,774][25689] Avg episode reward: [(0, '-29.227')] [2022-07-10 03:40:56,875][26022] Updated weights on worker 0-0, policy_version 551457 (0.00085) [2022-07-10 03:40:58,827][26022] Updated weights on worker 0-0, policy_version 551467 (0.00092) [2022-07-10 03:41:00,458][26022] Updated weights on worker 0-0, policy_version 551477 (0.00089) [2022-07-10 03:41:00,837][25689] Fps is (10 sec: 5589.0, 60 sec: 5618.5, 300 sec: 5639.7). Total num frames: 564713472. Throughput: 0: 5925.2. Samples: 564717330. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:41:00,839][25689] Avg episode reward: [(0, '-28.955')] [2022-07-10 03:41:02,719][26022] Updated weights on worker 0-0, policy_version 551487 (0.00091) [2022-07-10 03:41:04,403][26022] Updated weights on worker 0-0, policy_version 551497 (0.01103) [2022-07-10 03:41:05,844][25689] Fps is (10 sec: 5592.8, 60 sec: 5651.8, 300 sec: 5630.6). Total num frames: 564740096. Throughput: 0: 5810.9. Samples: 564749382. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:41:05,846][25689] Avg episode reward: [(0, '-29.578')] [2022-07-10 03:41:06,544][26022] Updated weights on worker 0-0, policy_version 551507 (0.00090) [2022-07-10 03:41:08,061][26022] Updated weights on worker 0-0, policy_version 551517 (0.00084) [2022-07-10 03:41:10,088][26022] Updated weights on worker 0-0, policy_version 551527 (0.00101) [2022-07-10 03:41:10,859][25689] Fps is (10 sec: 5517.5, 60 sec: 5634.5, 300 sec: 5627.1). Total num frames: 564768768. Throughput: 0: 4972.6. Samples: 564766484. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:41:10,861][25689] Avg episode reward: [(0, '-28.801')] [2022-07-10 03:41:11,534][26022] Updated weights on worker 0-0, policy_version 551537 (0.00097) [2022-07-10 03:41:13,497][26022] Updated weights on worker 0-0, policy_version 551547 (0.00088) [2022-07-10 03:41:15,375][26022] Updated weights on worker 0-0, policy_version 551557 (0.00088) [2022-07-10 03:41:15,870][25689] Fps is (10 sec: 5617.5, 60 sec: 5635.6, 300 sec: 5632.2). Total num frames: 564796416. Throughput: 0: 5837.1. Samples: 564800432. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:41:15,872][25689] Avg episode reward: [(0, '-28.258')] [2022-07-10 03:41:17,204][26022] Updated weights on worker 0-0, policy_version 551567 (0.00083) [2022-07-10 03:41:18,891][26022] Updated weights on worker 0-0, policy_version 551577 (0.00081) [2022-07-10 03:41:20,742][26022] Updated weights on worker 0-0, policy_version 551587 (0.00101) [2022-07-10 03:41:20,951][25689] Fps is (10 sec: 5580.6, 60 sec: 5640.8, 300 sec: 5631.3). Total num frames: 564825088. Throughput: 0: 5831.5. Samples: 564834626. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:41:20,951][25689] Avg episode reward: [(0, '-27.413')] [2022-07-10 03:41:22,617][26022] Updated weights on worker 0-0, policy_version 551597 (0.00082) [2022-07-10 03:41:24,600][26022] Updated weights on worker 0-0, policy_version 551607 (0.00096) [2022-07-10 03:41:25,974][25689] Fps is (10 sec: 5776.7, 60 sec: 5622.6, 300 sec: 5630.9). Total num frames: 564854784. Throughput: 0: 5074.6. Samples: 564851532. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:41:25,974][25689] Avg episode reward: [(0, '-27.464')] [2022-07-10 03:41:26,111][26022] Updated weights on worker 0-0, policy_version 551617 (0.00089) [2022-07-10 03:41:28,134][26022] Updated weights on worker 0-0, policy_version 551627 (0.00092) [2022-07-10 03:41:29,782][26022] Updated weights on worker 0-0, policy_version 551637 (0.00085) [2022-07-10 03:41:31,052][25689] Fps is (10 sec: 5575.6, 60 sec: 5634.7, 300 sec: 5626.0). Total num frames: 564881408. Throughput: 0: 5887.6. Samples: 564885374. Policy #0 lag: (min: 0.0, avg: 8.3, max: 17.0) [2022-07-10 03:41:31,052][25689] Avg episode reward: [(0, '-27.292')] [2022-07-10 03:41:31,779][26022] Updated weights on worker 0-0, policy_version 551647 (0.00087) [2022-07-10 03:41:33,536][26022] Updated weights on worker 0-0, policy_version 551657 (0.00089) [2022-07-10 03:41:35,375][26022] Updated weights on worker 0-0, policy_version 551667 (0.00084) [2022-07-10 03:41:36,150][25689] Fps is (10 sec: 5433.6, 60 sec: 5612.5, 300 sec: 5621.8). Total num frames: 564910080. Throughput: 0: 5858.6. Samples: 564919248. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:41:36,155][25689] Avg episode reward: [(0, '-27.323')] [2022-07-10 03:41:36,994][26022] Updated weights on worker 0-0, policy_version 551677 (0.00083) [2022-07-10 03:41:39,109][26022] Updated weights on worker 0-0, policy_version 551687 (0.00091) [2022-07-10 03:41:40,261][26022] Updated weights on worker 0-0, policy_version 551697 (0.00093) [2022-07-10 03:41:41,243][25689] Fps is (10 sec: 5727.4, 60 sec: 5599.5, 300 sec: 5627.2). Total num frames: 564939776. Throughput: 0: 5021.8. Samples: 564936528. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:41:41,243][25689] Avg episode reward: [(0, '-26.885')] [2022-07-10 03:41:42,721][26022] Updated weights on worker 0-0, policy_version 551707 (0.00091) [2022-07-10 03:41:43,963][26022] Updated weights on worker 0-0, policy_version 551717 (0.00085) [2022-07-10 03:41:46,088][26022] Updated weights on worker 0-0, policy_version 551727 (0.00088) [2022-07-10 03:41:46,300][25689] Fps is (10 sec: 5952.3, 60 sec: 5648.8, 300 sec: 5636.6). Total num frames: 564970496. Throughput: 0: 5878.7. Samples: 564971026. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:41:46,301][25689] Avg episode reward: [(0, '-26.967')] [2022-07-10 03:41:47,936][26022] Updated weights on worker 0-0, policy_version 551737 (0.00086) [2022-07-10 03:41:49,686][26022] Updated weights on worker 0-0, policy_version 551747 (0.00082) [2022-07-10 03:41:51,317][25689] Fps is (10 sec: 5793.8, 60 sec: 5630.9, 300 sec: 5630.2). Total num frames: 564998144. Throughput: 0: 5899.4. Samples: 565004926. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:41:51,318][25689] Avg episode reward: [(0, '-26.208')] [2022-07-10 03:41:51,435][26022] Updated weights on worker 0-0, policy_version 551757 (0.00081) [2022-07-10 03:41:53,419][26022] Updated weights on worker 0-0, policy_version 551767 (0.01320) [2022-07-10 03:41:54,927][26022] Updated weights on worker 0-0, policy_version 551777 (0.00091) [2022-07-10 03:41:56,348][25689] Fps is (10 sec: 5401.6, 60 sec: 5629.3, 300 sec: 5631.0). Total num frames: 565024768. Throughput: 0: 5082.9. Samples: 565021910. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:41:56,348][25689] Avg episode reward: [(0, '-26.860')] [2022-07-10 03:41:56,464][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:41:56,473][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000551783_565025792.pth [2022-07-10 03:41:56,473][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000549803_562998272.pth [2022-07-10 03:41:57,169][26022] Updated weights on worker 0-0, policy_version 551787 (0.00094) [2022-07-10 03:41:58,652][26022] Updated weights on worker 0-0, policy_version 551797 (0.00091) [2022-07-10 03:42:00,699][26022] Updated weights on worker 0-0, policy_version 551807 (0.00087) [2022-07-10 03:42:01,402][25689] Fps is (10 sec: 5686.0, 60 sec: 5647.0, 300 sec: 5641.5). Total num frames: 565055488. Throughput: 0: 5916.7. Samples: 565055806. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:42:01,403][25689] Avg episode reward: [(0, '-26.961')] [2022-07-10 03:42:02,686][26022] Updated weights on worker 0-0, policy_version 551817 (0.00088) [2022-07-10 03:42:04,834][26022] Updated weights on worker 0-0, policy_version 551827 (0.00085) [2022-07-10 03:42:06,417][25689] Fps is (10 sec: 5491.9, 60 sec: 5612.6, 300 sec: 5631.0). Total num frames: 565080064. Throughput: 0: 5779.0. Samples: 565087278. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:42:06,417][25689] Avg episode reward: [(0, '-26.080')] [2022-07-10 03:42:06,639][26022] Updated weights on worker 0-0, policy_version 551837 (0.00085) [2022-07-10 03:42:08,198][26022] Updated weights on worker 0-0, policy_version 551847 (0.00076) [2022-07-10 03:42:10,190][26022] Updated weights on worker 0-0, policy_version 551857 (0.00095) [2022-07-10 03:42:11,432][25689] Fps is (10 sec: 5308.9, 60 sec: 5612.5, 300 sec: 5628.0). Total num frames: 565108736. Throughput: 0: 4937.2. Samples: 565104238. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:42:11,433][25689] Avg episode reward: [(0, '-25.478')] [2022-07-10 03:42:11,888][26022] Updated weights on worker 0-0, policy_version 551867 (0.00089) [2022-07-10 03:42:13,538][26022] Updated weights on worker 0-0, policy_version 551877 (0.00084) [2022-07-10 03:42:15,422][26022] Updated weights on worker 0-0, policy_version 551887 (0.00096) [2022-07-10 03:42:16,453][25689] Fps is (10 sec: 5713.6, 60 sec: 5628.5, 300 sec: 5632.6). Total num frames: 565137408. Throughput: 0: 5795.3. Samples: 565138426. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:42:16,453][25689] Avg episode reward: [(0, '-27.512')] [2022-07-10 03:42:17,192][26022] Updated weights on worker 0-0, policy_version 551897 (0.00091) [2022-07-10 03:42:19,197][26022] Updated weights on worker 0-0, policy_version 551907 (0.00399) [2022-07-10 03:42:20,831][26022] Updated weights on worker 0-0, policy_version 551917 (0.00089) [2022-07-10 03:42:21,591][25689] Fps is (10 sec: 5644.9, 60 sec: 5623.2, 300 sec: 5630.0). Total num frames: 565166080. Throughput: 0: 5777.8. Samples: 565172452. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:42:21,591][25689] Avg episode reward: [(0, '-27.447')] [2022-07-10 03:42:22,575][26022] Updated weights on worker 0-0, policy_version 551927 (0.00085) [2022-07-10 03:42:24,447][26022] Updated weights on worker 0-0, policy_version 551937 (0.00083) [2022-07-10 03:42:26,438][26022] Updated weights on worker 0-0, policy_version 551947 (0.00083) [2022-07-10 03:42:26,643][25689] Fps is (10 sec: 5627.3, 60 sec: 5603.6, 300 sec: 5629.3). Total num frames: 565194752. Throughput: 0: 5048.7. Samples: 565189398. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:42:26,644][25689] Avg episode reward: [(0, '-26.891')] [2022-07-10 03:42:28,031][26022] Updated weights on worker 0-0, policy_version 551957 (0.00086) [2022-07-10 03:42:29,965][26022] Updated weights on worker 0-0, policy_version 551967 (0.00090) [2022-07-10 03:42:31,677][25689] Fps is (10 sec: 5685.2, 60 sec: 5641.5, 300 sec: 5628.8). Total num frames: 565223424. Throughput: 0: 5907.0. Samples: 565223826. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:42:31,678][25689] Avg episode reward: [(0, '-27.163')] [2022-07-10 03:42:31,755][26022] Updated weights on worker 0-0, policy_version 551977 (0.00095) [2022-07-10 03:42:33,509][26022] Updated weights on worker 0-0, policy_version 551987 (0.00081) [2022-07-10 03:42:35,314][26022] Updated weights on worker 0-0, policy_version 551997 (0.00076) [2022-07-10 03:42:36,714][25689] Fps is (10 sec: 5694.2, 60 sec: 5647.2, 300 sec: 5630.8). Total num frames: 565252096. Throughput: 0: 5901.0. Samples: 565257986. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:42:36,714][25689] Avg episode reward: [(0, '-27.283')] [2022-07-10 03:42:37,047][26022] Updated weights on worker 0-0, policy_version 552007 (0.00091) [2022-07-10 03:42:38,926][26022] Updated weights on worker 0-0, policy_version 552017 (0.00091) [2022-07-10 03:42:40,578][26022] Updated weights on worker 0-0, policy_version 552027 (0.00091) [2022-07-10 03:42:41,747][25689] Fps is (10 sec: 5695.0, 60 sec: 5635.9, 300 sec: 5630.3). Total num frames: 565280768. Throughput: 0: 5091.7. Samples: 565275078. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:42:41,747][25689] Avg episode reward: [(0, '-26.585')] [2022-07-10 03:42:42,551][26022] Updated weights on worker 0-0, policy_version 552037 (0.00074) [2022-07-10 03:42:44,145][26022] Updated weights on worker 0-0, policy_version 552047 (0.00087) [2022-07-10 03:42:46,142][26022] Updated weights on worker 0-0, policy_version 552057 (0.00089) [2022-07-10 03:42:46,749][25689] Fps is (10 sec: 5714.2, 60 sec: 5607.1, 300 sec: 5630.7). Total num frames: 565309440. Throughput: 0: 5955.1. Samples: 565309132. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:42:46,750][25689] Avg episode reward: [(0, '-25.904')] [2022-07-10 03:42:48,011][26022] Updated weights on worker 0-0, policy_version 552067 (0.00090) [2022-07-10 03:42:49,735][26022] Updated weights on worker 0-0, policy_version 552077 (0.00086) [2022-07-10 03:42:51,496][26022] Updated weights on worker 0-0, policy_version 552087 (0.00080) [2022-07-10 03:42:51,755][25689] Fps is (10 sec: 5729.7, 60 sec: 5625.1, 300 sec: 5630.7). Total num frames: 565338112. Throughput: 0: 5962.3. Samples: 565343534. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:42:51,755][25689] Avg episode reward: [(0, '-26.038')] [2022-07-10 03:42:53,398][26022] Updated weights on worker 0-0, policy_version 552097 (0.00203) [2022-07-10 03:42:55,103][26022] Updated weights on worker 0-0, policy_version 552107 (0.00103) [2022-07-10 03:42:56,772][25689] Fps is (10 sec: 5517.0, 60 sec: 5626.3, 300 sec: 5624.3). Total num frames: 565364736. Throughput: 0: 5118.6. Samples: 565360656. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:42:56,773][25689] Avg episode reward: [(0, '-25.306')] [2022-07-10 03:42:57,143][26022] Updated weights on worker 0-0, policy_version 552117 (0.00092) [2022-07-10 03:42:58,819][26022] Updated weights on worker 0-0, policy_version 552127 (0.00098) [2022-07-10 03:43:00,627][26022] Updated weights on worker 0-0, policy_version 552137 (0.00099) [2022-07-10 03:43:01,819][25689] Fps is (10 sec: 5596.2, 60 sec: 5610.1, 300 sec: 5637.9). Total num frames: 565394432. Throughput: 0: 5939.3. Samples: 565394292. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:43:01,819][25689] Avg episode reward: [(0, '-25.106')] [2022-07-10 03:43:03,157][26022] Updated weights on worker 0-0, policy_version 552147 (0.00091) [2022-07-10 03:43:04,456][26022] Updated weights on worker 0-0, policy_version 552157 (0.00086) [2022-07-10 03:43:06,608][26022] Updated weights on worker 0-0, policy_version 552167 (0.00094) [2022-07-10 03:43:06,835][25689] Fps is (10 sec: 5495.2, 60 sec: 5626.9, 300 sec: 5630.8). Total num frames: 565420032. Throughput: 0: 5822.1. Samples: 565426072. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:43:06,835][25689] Avg episode reward: [(0, '-24.750')] [2022-07-10 03:43:08,099][26022] Updated weights on worker 0-0, policy_version 552177 (0.00092) [2022-07-10 03:43:09,963][26022] Updated weights on worker 0-0, policy_version 552187 (0.00084) [2022-07-10 03:43:11,843][25689] Fps is (10 sec: 5414.2, 60 sec: 5627.6, 300 sec: 5627.5). Total num frames: 565448704. Throughput: 0: 4947.0. Samples: 565442908. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:43:11,843][25689] Avg episode reward: [(0, '-24.119')] [2022-07-10 03:43:11,998][26022] Updated weights on worker 0-0, policy_version 552197 (0.00104) [2022-07-10 03:43:13,564][26022] Updated weights on worker 0-0, policy_version 552207 (0.00084) [2022-07-10 03:43:15,648][26022] Updated weights on worker 0-0, policy_version 552217 (0.00082) [2022-07-10 03:43:16,858][25689] Fps is (10 sec: 5823.4, 60 sec: 5645.1, 300 sec: 5632.6). Total num frames: 565478400. Throughput: 0: 5787.0. Samples: 565476890. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:43:16,858][25689] Avg episode reward: [(0, '-24.098')] [2022-07-10 03:43:17,012][26022] Updated weights on worker 0-0, policy_version 552227 (0.00097) [2022-07-10 03:43:19,263][26022] Updated weights on worker 0-0, policy_version 552237 (0.00083) [2022-07-10 03:43:21,058][26022] Updated weights on worker 0-0, policy_version 552247 (0.00782) [2022-07-10 03:43:21,948][25689] Fps is (10 sec: 5573.2, 60 sec: 5615.6, 300 sec: 5624.7). Total num frames: 565505024. Throughput: 0: 5777.9. Samples: 565510598. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:43:21,949][25689] Avg episode reward: [(0, '-24.887')] [2022-07-10 03:43:22,694][26022] Updated weights on worker 0-0, policy_version 552257 (0.00087) [2022-07-10 03:43:24,731][26022] Updated weights on worker 0-0, policy_version 552267 (0.00093) [2022-07-10 03:43:26,402][26022] Updated weights on worker 0-0, policy_version 552277 (0.00089) [2022-07-10 03:43:26,964][25689] Fps is (10 sec: 5572.9, 60 sec: 5636.0, 300 sec: 5631.4). Total num frames: 565534720. Throughput: 0: 5050.4. Samples: 565527732. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:43:26,964][25689] Avg episode reward: [(0, '-25.418')] [2022-07-10 03:43:28,226][26022] Updated weights on worker 0-0, policy_version 552287 (0.00091) [2022-07-10 03:43:30,219][26022] Updated weights on worker 0-0, policy_version 552297 (0.00794) [2022-07-10 03:43:31,673][26022] Updated weights on worker 0-0, policy_version 552307 (0.00089) [2022-07-10 03:43:31,973][25689] Fps is (10 sec: 5719.9, 60 sec: 5621.3, 300 sec: 5627.8). Total num frames: 565562368. Throughput: 0: 5882.7. Samples: 565561332. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:43:31,974][25689] Avg episode reward: [(0, '-25.213')] [2022-07-10 03:43:33,879][26022] Updated weights on worker 0-0, policy_version 552317 (0.00084) [2022-07-10 03:43:35,440][26022] Updated weights on worker 0-0, policy_version 552327 (0.00086) [2022-07-10 03:43:37,018][25689] Fps is (10 sec: 5601.8, 60 sec: 5620.6, 300 sec: 5629.0). Total num frames: 565591040. Throughput: 0: 5885.1. Samples: 565595534. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:43:37,018][25689] Avg episode reward: [(0, '-24.921')] [2022-07-10 03:43:37,230][26022] Updated weights on worker 0-0, policy_version 552337 (0.00095) [2022-07-10 03:43:39,083][26022] Updated weights on worker 0-0, policy_version 552347 (0.00080) [2022-07-10 03:43:40,978][26022] Updated weights on worker 0-0, policy_version 552357 (0.00084) [2022-07-10 03:43:42,078][25689] Fps is (10 sec: 5675.1, 60 sec: 5618.0, 300 sec: 5624.5). Total num frames: 565619712. Throughput: 0: 5062.0. Samples: 565612496. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:43:42,080][25689] Avg episode reward: [(0, '-25.714')] [2022-07-10 03:43:42,671][26022] Updated weights on worker 0-0, policy_version 552367 (0.00089) [2022-07-10 03:43:44,459][26022] Updated weights on worker 0-0, policy_version 552377 (0.00096) [2022-07-10 03:43:46,321][26022] Updated weights on worker 0-0, policy_version 552387 (0.00089) [2022-07-10 03:43:47,145][25689] Fps is (10 sec: 5560.9, 60 sec: 5595.1, 300 sec: 5623.5). Total num frames: 565647360. Throughput: 0: 5881.8. Samples: 565646438. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:43:47,146][25689] Avg episode reward: [(0, '-25.097')] [2022-07-10 03:43:48,134][26022] Updated weights on worker 0-0, policy_version 552397 (0.00090) [2022-07-10 03:43:49,928][26022] Updated weights on worker 0-0, policy_version 552407 (0.00091) [2022-07-10 03:43:51,890][26022] Updated weights on worker 0-0, policy_version 552417 (0.00093) [2022-07-10 03:43:52,156][25689] Fps is (10 sec: 5588.3, 60 sec: 5594.6, 300 sec: 5624.3). Total num frames: 565676032. Throughput: 0: 5891.0. Samples: 565680228. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:43:52,158][25689] Avg episode reward: [(0, '-25.770')] [2022-07-10 03:43:53,600][26022] Updated weights on worker 0-0, policy_version 552427 (0.00091) [2022-07-10 03:43:55,472][26022] Updated weights on worker 0-0, policy_version 552437 (0.00091) [2022-07-10 03:43:56,503][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:43:56,514][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000552443_565701632.pth [2022-07-10 03:43:56,514][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000550463_563674112.pth [2022-07-10 03:43:57,165][25689] Fps is (10 sec: 5621.1, 60 sec: 5612.3, 300 sec: 5624.9). Total num frames: 565703680. Throughput: 0: 5886.9. Samples: 565714138. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:43:57,167][25689] Avg episode reward: [(0, '-26.520')] [2022-07-10 03:43:57,296][26022] Updated weights on worker 0-0, policy_version 552447 (0.00097) [2022-07-10 03:43:59,156][26022] Updated weights on worker 0-0, policy_version 552457 (0.00088) [2022-07-10 03:44:00,831][26022] Updated weights on worker 0-0, policy_version 552467 (0.00090) [2022-07-10 03:44:02,266][25689] Fps is (10 sec: 5469.7, 60 sec: 5573.4, 300 sec: 5623.1). Total num frames: 565731328. Throughput: 0: 5884.7. Samples: 565731294. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:44:02,267][25689] Avg episode reward: [(0, '-27.464')] [2022-07-10 03:44:03,028][26022] Updated weights on worker 0-0, policy_version 552477 (0.00086) [2022-07-10 03:44:04,946][26022] Updated weights on worker 0-0, policy_version 552487 (0.00090) [2022-07-10 03:44:06,921][26022] Updated weights on worker 0-0, policy_version 552497 (0.00086) [2022-07-10 03:44:07,307][25689] Fps is (10 sec: 5654.2, 60 sec: 5638.8, 300 sec: 5633.6). Total num frames: 565761024. Throughput: 0: 5792.5. Samples: 565763222. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:44:07,307][25689] Avg episode reward: [(0, '-27.574')] [2022-07-10 03:44:08,476][26022] Updated weights on worker 0-0, policy_version 552507 (0.00086) [2022-07-10 03:44:10,251][26022] Updated weights on worker 0-0, policy_version 552517 (0.00092) [2022-07-10 03:44:11,897][26022] Updated weights on worker 0-0, policy_version 552527 (0.00087) [2022-07-10 03:44:12,358][25689] Fps is (10 sec: 5681.9, 60 sec: 5617.9, 300 sec: 5622.5). Total num frames: 565788672. Throughput: 0: 5800.3. Samples: 565797404. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:44:12,359][25689] Avg episode reward: [(0, '-27.727')] [2022-07-10 03:44:13,910][26022] Updated weights on worker 0-0, policy_version 552537 (0.00079) [2022-07-10 03:44:15,825][26022] Updated weights on worker 0-0, policy_version 552547 (0.00085) [2022-07-10 03:44:17,339][26022] Updated weights on worker 0-0, policy_version 552557 (0.00093) [2022-07-10 03:44:17,391][25689] Fps is (10 sec: 5686.6, 60 sec: 5616.2, 300 sec: 5631.1). Total num frames: 565818368. Throughput: 0: 4960.6. Samples: 565814470. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:44:17,391][25689] Avg episode reward: [(0, '-27.514')] [2022-07-10 03:44:19,535][26022] Updated weights on worker 0-0, policy_version 552567 (0.00598) [2022-07-10 03:44:21,238][26022] Updated weights on worker 0-0, policy_version 552577 (0.00091) [2022-07-10 03:44:22,503][25689] Fps is (10 sec: 5551.6, 60 sec: 5614.2, 300 sec: 5623.0). Total num frames: 565844992. Throughput: 0: 5791.0. Samples: 565848488. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:44:22,503][25689] Avg episode reward: [(0, '-26.222')] [2022-07-10 03:44:22,959][26022] Updated weights on worker 0-0, policy_version 552587 (0.00082) [2022-07-10 03:44:24,891][26022] Updated weights on worker 0-0, policy_version 552597 (0.00085) [2022-07-10 03:44:26,464][26022] Updated weights on worker 0-0, policy_version 552607 (0.00088) [2022-07-10 03:44:27,523][25689] Fps is (10 sec: 5457.5, 60 sec: 5596.9, 300 sec: 5622.8). Total num frames: 565873664. Throughput: 0: 5896.4. Samples: 565882426. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 03:44:27,523][25689] Avg episode reward: [(0, '-26.056')] [2022-07-10 03:44:28,535][26022] Updated weights on worker 0-0, policy_version 552617 (0.00093) [2022-07-10 03:44:30,265][26022] Updated weights on worker 0-0, policy_version 552627 (0.00095) [2022-07-10 03:44:32,177][26022] Updated weights on worker 0-0, policy_version 552637 (0.00240) [2022-07-10 03:44:32,542][25689] Fps is (10 sec: 5814.3, 60 sec: 5629.9, 300 sec: 5627.0). Total num frames: 565903360. Throughput: 0: 5053.4. Samples: 565899400. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:44:32,542][25689] Avg episode reward: [(0, '-25.084')] [2022-07-10 03:44:33,866][26022] Updated weights on worker 0-0, policy_version 552647 (0.00089) [2022-07-10 03:44:35,817][26022] Updated weights on worker 0-0, policy_version 552657 (0.00090) [2022-07-10 03:44:37,287][26022] Updated weights on worker 0-0, policy_version 552667 (0.00058) [2022-07-10 03:44:37,543][25689] Fps is (10 sec: 5825.3, 60 sec: 5633.9, 300 sec: 5631.2). Total num frames: 565932032. Throughput: 0: 5923.5. Samples: 565933840. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:44:37,543][25689] Avg episode reward: [(0, '-25.598')] [2022-07-10 03:44:39,571][26022] Updated weights on worker 0-0, policy_version 552677 (0.00085) [2022-07-10 03:44:40,842][26022] Updated weights on worker 0-0, policy_version 552687 (0.00088) [2022-07-10 03:44:42,644][25689] Fps is (10 sec: 5574.8, 60 sec: 5613.2, 300 sec: 5622.8). Total num frames: 565959680. Throughput: 0: 5937.4. Samples: 565968074. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:44:42,645][25689] Avg episode reward: [(0, '-25.488')] [2022-07-10 03:44:42,915][26022] Updated weights on worker 0-0, policy_version 552697 (0.00460) [2022-07-10 03:44:44,449][26022] Updated weights on worker 0-0, policy_version 552707 (0.00083) [2022-07-10 03:44:46,355][26022] Updated weights on worker 0-0, policy_version 552717 (0.00098) [2022-07-10 03:44:47,669][25689] Fps is (10 sec: 5763.8, 60 sec: 5667.8, 300 sec: 5632.7). Total num frames: 565990400. Throughput: 0: 5113.2. Samples: 565985438. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:44:47,670][25689] Avg episode reward: [(0, '-26.256')] [2022-07-10 03:44:48,299][26022] Updated weights on worker 0-0, policy_version 552727 (0.00089) [2022-07-10 03:44:50,006][26022] Updated weights on worker 0-0, policy_version 552737 (0.00085) [2022-07-10 03:44:51,748][26022] Updated weights on worker 0-0, policy_version 552747 (0.00090) [2022-07-10 03:44:52,698][25689] Fps is (10 sec: 5805.8, 60 sec: 5649.3, 300 sec: 5628.8). Total num frames: 566018048. Throughput: 0: 5960.2. Samples: 566019534. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:44:52,698][25689] Avg episode reward: [(0, '-27.158')] [2022-07-10 03:44:53,663][26022] Updated weights on worker 0-0, policy_version 552757 (0.00089) [2022-07-10 03:44:55,170][26022] Updated weights on worker 0-0, policy_version 552767 (0.00086) [2022-07-10 03:44:57,392][26022] Updated weights on worker 0-0, policy_version 552777 (0.00087) [2022-07-10 03:44:57,779][25689] Fps is (10 sec: 5469.7, 60 sec: 5642.5, 300 sec: 5621.8). Total num frames: 566045696. Throughput: 0: 5904.1. Samples: 566053316. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:44:57,779][25689] Avg episode reward: [(0, '-27.295')] [2022-07-10 03:44:59,028][26022] Updated weights on worker 0-0, policy_version 552787 (0.00090) [2022-07-10 03:45:00,896][26022] Updated weights on worker 0-0, policy_version 552797 (0.00091) [2022-07-10 03:45:02,919][25689] Fps is (10 sec: 5309.8, 60 sec: 5622.0, 300 sec: 5626.0). Total num frames: 566072320. Throughput: 0: 5040.8. Samples: 566070274. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:45:02,919][25689] Avg episode reward: [(0, '-26.678')] [2022-07-10 03:45:03,094][26022] Updated weights on worker 0-0, policy_version 552807 (0.00087) [2022-07-10 03:45:04,632][26022] Updated weights on worker 0-0, policy_version 552817 (0.00093) [2022-07-10 03:45:06,671][26022] Updated weights on worker 0-0, policy_version 552827 (0.00086) [2022-07-10 03:45:08,009][25689] Fps is (10 sec: 5605.4, 60 sec: 5634.3, 300 sec: 5628.0). Total num frames: 566103040. Throughput: 0: 5750.2. Samples: 566102394. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:45:08,011][25689] Avg episode reward: [(0, '-27.197')] [2022-07-10 03:45:08,583][26022] Updated weights on worker 0-0, policy_version 552837 (0.00087) [2022-07-10 03:45:10,238][26022] Updated weights on worker 0-0, policy_version 552847 (0.00086) [2022-07-10 03:45:12,290][26022] Updated weights on worker 0-0, policy_version 552857 (0.00087) [2022-07-10 03:45:13,064][25689] Fps is (10 sec: 5752.9, 60 sec: 5633.9, 300 sec: 5627.3). Total num frames: 566130688. Throughput: 0: 5748.2. Samples: 566136608. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:45:13,065][25689] Avg episode reward: [(0, '-26.242')] [2022-07-10 03:45:13,653][26022] Updated weights on worker 0-0, policy_version 552867 (0.00079) [2022-07-10 03:45:15,791][26022] Updated weights on worker 0-0, policy_version 552877 (0.00084) [2022-07-10 03:45:17,223][26022] Updated weights on worker 0-0, policy_version 552887 (0.00093) [2022-07-10 03:45:18,122][25689] Fps is (10 sec: 5468.0, 60 sec: 5597.9, 300 sec: 5625.4). Total num frames: 566158336. Throughput: 0: 4937.1. Samples: 566153750. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:45:18,122][25689] Avg episode reward: [(0, '-25.636')] [2022-07-10 03:45:19,295][26022] Updated weights on worker 0-0, policy_version 552897 (0.00082) [2022-07-10 03:45:21,044][26022] Updated weights on worker 0-0, policy_version 552907 (0.00080) [2022-07-10 03:45:22,928][26022] Updated weights on worker 0-0, policy_version 552917 (0.00089) [2022-07-10 03:45:23,183][25689] Fps is (10 sec: 5768.2, 60 sec: 5670.1, 300 sec: 5624.4). Total num frames: 566189056. Throughput: 0: 5818.2. Samples: 566188178. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:45:23,184][25689] Avg episode reward: [(0, '-25.306')] [2022-07-10 03:45:24,465][26022] Updated weights on worker 0-0, policy_version 552927 (0.00080) [2022-07-10 03:45:26,468][26022] Updated weights on worker 0-0, policy_version 552937 (0.00083) [2022-07-10 03:45:28,147][26022] Updated weights on worker 0-0, policy_version 552947 (0.00091) [2022-07-10 03:45:28,229][25689] Fps is (10 sec: 5876.0, 60 sec: 5667.7, 300 sec: 5634.4). Total num frames: 566217728. Throughput: 0: 5918.3. Samples: 566222064. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:45:28,230][25689] Avg episode reward: [(0, '-25.024')] [2022-07-10 03:45:30,151][26022] Updated weights on worker 0-0, policy_version 552957 (0.00090) [2022-07-10 03:45:32,008][26022] Updated weights on worker 0-0, policy_version 552967 (0.00084) [2022-07-10 03:45:33,235][25689] Fps is (10 sec: 5603.2, 60 sec: 5635.2, 300 sec: 5628.2). Total num frames: 566245376. Throughput: 0: 5075.9. Samples: 566238990. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:45:33,235][25689] Avg episode reward: [(0, '-25.245')] [2022-07-10 03:45:33,693][26022] Updated weights on worker 0-0, policy_version 552977 (0.00091) [2022-07-10 03:45:35,640][26022] Updated weights on worker 0-0, policy_version 552987 (0.00089) [2022-07-10 03:45:37,437][26022] Updated weights on worker 0-0, policy_version 552997 (0.00086) [2022-07-10 03:45:38,255][25689] Fps is (10 sec: 5515.3, 60 sec: 5616.5, 300 sec: 5620.0). Total num frames: 566273024. Throughput: 0: 5911.8. Samples: 566272774. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:45:38,256][25689] Avg episode reward: [(0, '-26.211')] [2022-07-10 03:45:39,244][26022] Updated weights on worker 0-0, policy_version 553007 (0.00088) [2022-07-10 03:45:41,193][26022] Updated weights on worker 0-0, policy_version 553017 (0.00089) [2022-07-10 03:45:42,903][26022] Updated weights on worker 0-0, policy_version 553027 (0.00089) [2022-07-10 03:45:43,341][25689] Fps is (10 sec: 5673.8, 60 sec: 5651.6, 300 sec: 5626.0). Total num frames: 566302720. Throughput: 0: 5853.6. Samples: 566306176. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:45:43,342][25689] Avg episode reward: [(0, '-25.398')] [2022-07-10 03:45:44,765][26022] Updated weights on worker 0-0, policy_version 553037 (0.00083) [2022-07-10 03:45:46,456][26022] Updated weights on worker 0-0, policy_version 553047 (0.00524) [2022-07-10 03:45:48,379][25689] Fps is (10 sec: 5664.4, 60 sec: 5599.9, 300 sec: 5622.0). Total num frames: 566330368. Throughput: 0: 5025.4. Samples: 566323322. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:45:48,379][25689] Avg episode reward: [(0, '-25.284')] [2022-07-10 03:45:48,381][26022] Updated weights on worker 0-0, policy_version 553057 (0.00098) [2022-07-10 03:45:50,275][26022] Updated weights on worker 0-0, policy_version 553067 (0.00090) [2022-07-10 03:45:51,827][26022] Updated weights on worker 0-0, policy_version 553077 (0.00080) [2022-07-10 03:45:53,416][25689] Fps is (10 sec: 5488.5, 60 sec: 5599.1, 300 sec: 5625.0). Total num frames: 566358016. Throughput: 0: 5873.1. Samples: 566357518. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:45:53,417][25689] Avg episode reward: [(0, '-26.055')] [2022-07-10 03:45:53,702][26022] Updated weights on worker 0-0, policy_version 553087 (0.00093) [2022-07-10 03:45:55,427][26022] Updated weights on worker 0-0, policy_version 553097 (0.00088) [2022-07-10 03:45:56,643][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:45:56,657][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000553102_566376448.pth [2022-07-10 03:45:56,658][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000551124_564350976.pth [2022-07-10 03:45:57,410][26022] Updated weights on worker 0-0, policy_version 553107 (0.00097) [2022-07-10 03:45:58,427][25689] Fps is (10 sec: 5706.8, 60 sec: 5639.4, 300 sec: 5626.0). Total num frames: 566387712. Throughput: 0: 5885.5. Samples: 566391496. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:45:58,427][25689] Avg episode reward: [(0, '-25.709')] [2022-07-10 03:45:59,399][26022] Updated weights on worker 0-0, policy_version 553117 (0.00089) [2022-07-10 03:46:00,879][26022] Updated weights on worker 0-0, policy_version 553127 (0.00092) [2022-07-10 03:46:03,331][26022] Updated weights on worker 0-0, policy_version 553137 (0.00083) [2022-07-10 03:46:03,481][25689] Fps is (10 sec: 5494.3, 60 sec: 5630.5, 300 sec: 5621.7). Total num frames: 566413312. Throughput: 0: 5813.0. Samples: 566423246. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:46:03,481][25689] Avg episode reward: [(0, '-26.244')] [2022-07-10 03:46:04,778][26022] Updated weights on worker 0-0, policy_version 553147 (0.00084) [2022-07-10 03:46:06,836][26022] Updated weights on worker 0-0, policy_version 553157 (0.00085) [2022-07-10 03:46:08,492][25689] Fps is (10 sec: 5392.2, 60 sec: 5604.0, 300 sec: 5621.7). Total num frames: 566441984. Throughput: 0: 5816.2. Samples: 566440304. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:46:08,493][25689] Avg episode reward: [(0, '-25.946')] [2022-07-10 03:46:08,502][26022] Updated weights on worker 0-0, policy_version 553167 (0.00083) [2022-07-10 03:46:10,338][26022] Updated weights on worker 0-0, policy_version 553177 (0.00090) [2022-07-10 03:46:12,172][26022] Updated weights on worker 0-0, policy_version 553187 (0.00086) [2022-07-10 03:46:13,586][25689] Fps is (10 sec: 5776.1, 60 sec: 5634.2, 300 sec: 5627.1). Total num frames: 566471680. Throughput: 0: 5791.1. Samples: 566474322. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:46:13,588][25689] Avg episode reward: [(0, '-27.198')] [2022-07-10 03:46:13,946][26022] Updated weights on worker 0-0, policy_version 553197 (0.00091) [2022-07-10 03:46:15,802][26022] Updated weights on worker 0-0, policy_version 553207 (0.00086) [2022-07-10 03:46:17,747][26022] Updated weights on worker 0-0, policy_version 553217 (0.00088) [2022-07-10 03:46:18,624][25689] Fps is (10 sec: 5659.5, 60 sec: 5636.0, 300 sec: 5624.4). Total num frames: 566499328. Throughput: 0: 5793.8. Samples: 566508514. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:46:18,625][25689] Avg episode reward: [(0, '-26.841')] [2022-07-10 03:46:19,294][26022] Updated weights on worker 0-0, policy_version 553227 (0.00094) [2022-07-10 03:46:21,403][26022] Updated weights on worker 0-0, policy_version 553237 (0.00090) [2022-07-10 03:46:22,868][26022] Updated weights on worker 0-0, policy_version 553247 (0.00088) [2022-07-10 03:46:23,741][25689] Fps is (10 sec: 5546.0, 60 sec: 5597.1, 300 sec: 5619.2). Total num frames: 566528000. Throughput: 0: 5051.0. Samples: 566525580. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:46:23,741][25689] Avg episode reward: [(0, '-27.071')] [2022-07-10 03:46:24,972][26022] Updated weights on worker 0-0, policy_version 553257 (0.00098) [2022-07-10 03:46:26,614][26022] Updated weights on worker 0-0, policy_version 553267 (0.00087) [2022-07-10 03:46:28,327][26022] Updated weights on worker 0-0, policy_version 553277 (0.00088) [2022-07-10 03:46:28,750][25689] Fps is (10 sec: 5865.5, 60 sec: 5634.3, 300 sec: 5634.3). Total num frames: 566558720. Throughput: 0: 5901.9. Samples: 566559864. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:46:28,750][25689] Avg episode reward: [(0, '-26.158')] [2022-07-10 03:46:30,407][26022] Updated weights on worker 0-0, policy_version 553287 (0.00091) [2022-07-10 03:46:31,851][26022] Updated weights on worker 0-0, policy_version 553297 (0.00085) [2022-07-10 03:46:33,766][25689] Fps is (10 sec: 5720.0, 60 sec: 5616.4, 300 sec: 5629.0). Total num frames: 566585344. Throughput: 0: 5928.1. Samples: 566593952. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:46:33,766][25689] Avg episode reward: [(0, '-26.070')] [2022-07-10 03:46:33,858][26022] Updated weights on worker 0-0, policy_version 553307 (0.00094) [2022-07-10 03:46:35,738][26022] Updated weights on worker 0-0, policy_version 553317 (0.00079) [2022-07-10 03:46:37,365][26022] Updated weights on worker 0-0, policy_version 553327 (0.00085) [2022-07-10 03:46:38,792][25689] Fps is (10 sec: 5404.2, 60 sec: 5615.8, 300 sec: 5623.3). Total num frames: 566612992. Throughput: 0: 5071.8. Samples: 566610804. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:46:38,793][25689] Avg episode reward: [(0, '-26.566')] [2022-07-10 03:46:39,195][26022] Updated weights on worker 0-0, policy_version 553337 (0.00082) [2022-07-10 03:46:41,228][26022] Updated weights on worker 0-0, policy_version 553347 (0.00086) [2022-07-10 03:46:42,779][26022] Updated weights on worker 0-0, policy_version 553357 (0.00084) [2022-07-10 03:46:43,917][25689] Fps is (10 sec: 5649.0, 60 sec: 5612.3, 300 sec: 5618.6). Total num frames: 566642688. Throughput: 0: 5909.6. Samples: 566644814. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:46:43,918][25689] Avg episode reward: [(0, '-26.684')] [2022-07-10 03:46:44,810][26022] Updated weights on worker 0-0, policy_version 553367 (0.00091) [2022-07-10 03:46:46,571][26022] Updated weights on worker 0-0, policy_version 553377 (0.00096) [2022-07-10 03:46:48,325][26022] Updated weights on worker 0-0, policy_version 553387 (0.00087) [2022-07-10 03:46:49,001][25689] Fps is (10 sec: 5717.2, 60 sec: 5624.8, 300 sec: 5620.8). Total num frames: 566671360. Throughput: 0: 5869.0. Samples: 566678722. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:46:49,002][25689] Avg episode reward: [(0, '-25.988')] [2022-07-10 03:46:50,163][26022] Updated weights on worker 0-0, policy_version 553397 (0.00084) [2022-07-10 03:46:52,068][26022] Updated weights on worker 0-0, policy_version 553407 (0.00104) [2022-07-10 03:46:53,747][26022] Updated weights on worker 0-0, policy_version 553417 (0.00089) [2022-07-10 03:46:54,064][25689] Fps is (10 sec: 5651.5, 60 sec: 5639.4, 300 sec: 5627.1). Total num frames: 566700032. Throughput: 0: 5016.2. Samples: 566695770. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:46:54,064][25689] Avg episode reward: [(0, '-26.868')] [2022-07-10 03:46:55,780][26022] Updated weights on worker 0-0, policy_version 553427 (0.00089) [2022-07-10 03:46:57,260][26022] Updated weights on worker 0-0, policy_version 553437 (0.00081) [2022-07-10 03:46:59,069][25689] Fps is (10 sec: 5695.9, 60 sec: 5623.0, 300 sec: 5621.1). Total num frames: 566728704. Throughput: 0: 5879.2. Samples: 566730014. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:46:59,070][25689] Avg episode reward: [(0, '-26.797')] [2022-07-10 03:46:59,247][26022] Updated weights on worker 0-0, policy_version 553447 (0.00576) [2022-07-10 03:47:00,997][26022] Updated weights on worker 0-0, policy_version 553457 (0.00091) [2022-07-10 03:47:03,190][26022] Updated weights on worker 0-0, policy_version 553467 (0.00092) [2022-07-10 03:47:04,138][25689] Fps is (10 sec: 5387.4, 60 sec: 5621.6, 300 sec: 5623.5). Total num frames: 566754304. Throughput: 0: 5772.0. Samples: 566761528. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:47:04,141][25689] Avg episode reward: [(0, '-26.085')] [2022-07-10 03:47:04,919][26022] Updated weights on worker 0-0, policy_version 553477 (0.00084) [2022-07-10 03:47:06,904][26022] Updated weights on worker 0-0, policy_version 553487 (0.00092) [2022-07-10 03:47:08,518][26022] Updated weights on worker 0-0, policy_version 553497 (0.00089) [2022-07-10 03:47:09,152][25689] Fps is (10 sec: 5382.5, 60 sec: 5621.3, 300 sec: 5623.5). Total num frames: 566782976. Throughput: 0: 4963.1. Samples: 566778730. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:47:09,152][25689] Avg episode reward: [(0, '-25.689')] [2022-07-10 03:47:10,672][26022] Updated weights on worker 0-0, policy_version 553507 (0.00092) [2022-07-10 03:47:12,181][26022] Updated weights on worker 0-0, policy_version 553517 (0.00091) [2022-07-10 03:47:14,223][25689] Fps is (10 sec: 5584.3, 60 sec: 5589.7, 300 sec: 5619.2). Total num frames: 566810624. Throughput: 0: 5779.5. Samples: 566812280. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:47:14,225][25689] Avg episode reward: [(0, '-26.664')] [2022-07-10 03:47:14,230][26022] Updated weights on worker 0-0, policy_version 553527 (0.00099) [2022-07-10 03:47:15,938][26022] Updated weights on worker 0-0, policy_version 553537 (0.00091) [2022-07-10 03:47:17,629][26022] Updated weights on worker 0-0, policy_version 553547 (0.00094) [2022-07-10 03:47:19,227][25689] Fps is (10 sec: 5590.3, 60 sec: 5609.8, 300 sec: 5621.7). Total num frames: 566839296. Throughput: 0: 5766.7. Samples: 566846256. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:47:19,227][25689] Avg episode reward: [(0, '-26.260')] [2022-07-10 03:47:19,635][26022] Updated weights on worker 0-0, policy_version 553557 (0.00083) [2022-07-10 03:47:21,572][26022] Updated weights on worker 0-0, policy_version 553567 (0.00047) [2022-07-10 03:47:23,035][26022] Updated weights on worker 0-0, policy_version 553577 (0.00086) [2022-07-10 03:47:24,281][25689] Fps is (10 sec: 5803.2, 60 sec: 5632.5, 300 sec: 5625.1). Total num frames: 566868992. Throughput: 0: 5046.3. Samples: 566863176. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:47:24,282][25689] Avg episode reward: [(0, '-26.123')] [2022-07-10 03:47:25,140][26022] Updated weights on worker 0-0, policy_version 553587 (0.00090) [2022-07-10 03:47:26,682][26022] Updated weights on worker 0-0, policy_version 553597 (0.00087) [2022-07-10 03:47:28,672][26022] Updated weights on worker 0-0, policy_version 553607 (0.00090) [2022-07-10 03:47:29,305][25689] Fps is (10 sec: 5689.9, 60 sec: 5580.4, 300 sec: 5621.8). Total num frames: 566896640. Throughput: 0: 5885.1. Samples: 566897332. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 03:47:29,306][25689] Avg episode reward: [(0, '-27.294')] [2022-07-10 03:47:30,274][26022] Updated weights on worker 0-0, policy_version 553617 (0.00086) [2022-07-10 03:47:32,299][26022] Updated weights on worker 0-0, policy_version 553627 (0.00083) [2022-07-10 03:47:33,943][26022] Updated weights on worker 0-0, policy_version 553637 (0.00092) [2022-07-10 03:47:34,314][25689] Fps is (10 sec: 5715.4, 60 sec: 5631.7, 300 sec: 5625.8). Total num frames: 566926336. Throughput: 0: 5940.7. Samples: 566931634. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:47:34,315][25689] Avg episode reward: [(0, '-27.956')] [2022-07-10 03:47:35,912][26022] Updated weights on worker 0-0, policy_version 553647 (0.00092) [2022-07-10 03:47:37,428][26022] Updated weights on worker 0-0, policy_version 553657 (0.00094) [2022-07-10 03:47:39,324][25689] Fps is (10 sec: 5621.2, 60 sec: 5616.4, 300 sec: 5619.4). Total num frames: 566952960. Throughput: 0: 5097.2. Samples: 566948696. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:47:39,326][25689] Avg episode reward: [(0, '-27.518')] [2022-07-10 03:47:39,579][26022] Updated weights on worker 0-0, policy_version 553667 (0.00085) [2022-07-10 03:47:41,150][26022] Updated weights on worker 0-0, policy_version 553677 (0.00090) [2022-07-10 03:47:43,038][26022] Updated weights on worker 0-0, policy_version 553687 (0.00091) [2022-07-10 03:47:44,368][25689] Fps is (10 sec: 5601.9, 60 sec: 5623.9, 300 sec: 5622.0). Total num frames: 566982656. Throughput: 0: 5967.0. Samples: 566983036. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:47:44,368][25689] Avg episode reward: [(0, '-27.732')] [2022-07-10 03:47:44,670][26022] Updated weights on worker 0-0, policy_version 553697 (0.00103) [2022-07-10 03:47:46,643][26022] Updated weights on worker 0-0, policy_version 553707 (0.00090) [2022-07-10 03:47:48,257][26022] Updated weights on worker 0-0, policy_version 553717 (0.00090) [2022-07-10 03:47:49,396][25689] Fps is (10 sec: 5896.9, 60 sec: 5646.1, 300 sec: 5625.0). Total num frames: 567012352. Throughput: 0: 5977.2. Samples: 567017420. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:47:49,396][25689] Avg episode reward: [(0, '-28.091')] [2022-07-10 03:47:50,367][26022] Updated weights on worker 0-0, policy_version 553727 (0.00087) [2022-07-10 03:47:51,630][26022] Updated weights on worker 0-0, policy_version 553737 (0.00092) [2022-07-10 03:47:53,860][26022] Updated weights on worker 0-0, policy_version 553747 (0.00083) [2022-07-10 03:47:54,418][25689] Fps is (10 sec: 5807.5, 60 sec: 5649.8, 300 sec: 5631.8). Total num frames: 567041024. Throughput: 0: 5119.8. Samples: 567034564. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:47:54,419][25689] Avg episode reward: [(0, '-26.896')] [2022-07-10 03:47:55,095][26022] Updated weights on worker 0-0, policy_version 553757 (0.00085) [2022-07-10 03:47:56,933][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:47:56,943][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000553764_567054336.pth [2022-07-10 03:47:56,944][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000551783_565025792.pth [2022-07-10 03:47:57,457][26022] Updated weights on worker 0-0, policy_version 553767 (0.00094) [2022-07-10 03:47:58,767][26022] Updated weights on worker 0-0, policy_version 553777 (0.00088) [2022-07-10 03:47:59,455][25689] Fps is (10 sec: 5700.8, 60 sec: 5646.9, 300 sec: 5628.6). Total num frames: 567069696. Throughput: 0: 5979.4. Samples: 567069066. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:47:59,455][25689] Avg episode reward: [(0, '-27.310')] [2022-07-10 03:48:01,046][26022] Updated weights on worker 0-0, policy_version 553787 (0.00093) [2022-07-10 03:48:03,188][26022] Updated weights on worker 0-0, policy_version 553797 (0.00084) [2022-07-10 03:48:04,494][25689] Fps is (10 sec: 5487.8, 60 sec: 5666.6, 300 sec: 5631.6). Total num frames: 567096320. Throughput: 0: 5872.6. Samples: 567101230. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:48:04,495][25689] Avg episode reward: [(0, '-27.016')] [2022-07-10 03:48:04,850][26022] Updated weights on worker 0-0, policy_version 553807 (0.00084) [2022-07-10 03:48:06,646][26022] Updated weights on worker 0-0, policy_version 553817 (0.00087) [2022-07-10 03:48:08,512][26022] Updated weights on worker 0-0, policy_version 553827 (0.00080) [2022-07-10 03:48:09,508][25689] Fps is (10 sec: 5500.2, 60 sec: 5666.6, 300 sec: 5631.4). Total num frames: 567124992. Throughput: 0: 5024.7. Samples: 567118478. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:48:09,509][25689] Avg episode reward: [(0, '-25.821')] [2022-07-10 03:48:10,195][26022] Updated weights on worker 0-0, policy_version 553837 (0.00089) [2022-07-10 03:48:12,153][26022] Updated weights on worker 0-0, policy_version 553847 (0.00092) [2022-07-10 03:48:13,758][26022] Updated weights on worker 0-0, policy_version 553857 (0.00057) [2022-07-10 03:48:14,525][25689] Fps is (10 sec: 5716.9, 60 sec: 5688.7, 300 sec: 5628.0). Total num frames: 567153664. Throughput: 0: 5866.5. Samples: 567152520. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:48:14,525][25689] Avg episode reward: [(0, '-26.351')] [2022-07-10 03:48:15,749][26022] Updated weights on worker 0-0, policy_version 553867 (0.00090) [2022-07-10 03:48:17,473][26022] Updated weights on worker 0-0, policy_version 553877 (0.00085) [2022-07-10 03:48:19,247][26022] Updated weights on worker 0-0, policy_version 553887 (0.00082) [2022-07-10 03:48:19,527][25689] Fps is (10 sec: 5723.8, 60 sec: 5688.9, 300 sec: 5636.5). Total num frames: 567182336. Throughput: 0: 5876.5. Samples: 567187020. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:48:19,527][25689] Avg episode reward: [(0, '-26.008')] [2022-07-10 03:48:21,026][26022] Updated weights on worker 0-0, policy_version 553897 (0.00089) [2022-07-10 03:48:22,908][26022] Updated weights on worker 0-0, policy_version 553907 (0.00089) [2022-07-10 03:48:24,650][25689] Fps is (10 sec: 5663.8, 60 sec: 5665.5, 300 sec: 5631.1). Total num frames: 567211008. Throughput: 0: 5090.4. Samples: 567203828. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:48:24,650][25689] Avg episode reward: [(0, '-26.541')] [2022-07-10 03:48:24,651][26022] Updated weights on worker 0-0, policy_version 553917 (0.00087) [2022-07-10 03:48:26,573][26022] Updated weights on worker 0-0, policy_version 553927 (0.00088) [2022-07-10 03:48:28,206][26022] Updated weights on worker 0-0, policy_version 553937 (0.00093) [2022-07-10 03:48:29,663][25689] Fps is (10 sec: 5556.0, 60 sec: 5666.4, 300 sec: 5631.0). Total num frames: 567238656. Throughput: 0: 5914.0. Samples: 567237676. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:48:29,664][25689] Avg episode reward: [(0, '-26.651')] [2022-07-10 03:48:30,267][26022] Updated weights on worker 0-0, policy_version 553947 (0.00097) [2022-07-10 03:48:31,943][26022] Updated weights on worker 0-0, policy_version 553957 (0.00089) [2022-07-10 03:48:33,777][26022] Updated weights on worker 0-0, policy_version 553967 (0.00089) [2022-07-10 03:48:34,664][25689] Fps is (10 sec: 5623.6, 60 sec: 5650.2, 300 sec: 5631.8). Total num frames: 567267328. Throughput: 0: 5890.7. Samples: 567271156. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:48:34,665][25689] Avg episode reward: [(0, '-26.304')] [2022-07-10 03:48:35,673][26022] Updated weights on worker 0-0, policy_version 553977 (0.00085) [2022-07-10 03:48:37,306][26022] Updated weights on worker 0-0, policy_version 553987 (0.00094) [2022-07-10 03:48:39,368][26022] Updated weights on worker 0-0, policy_version 553997 (0.00084) [2022-07-10 03:48:39,717][25689] Fps is (10 sec: 5703.7, 60 sec: 5680.1, 300 sec: 5631.9). Total num frames: 567296000. Throughput: 0: 5007.2. Samples: 567288118. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:48:39,718][25689] Avg episode reward: [(0, '-26.386')] [2022-07-10 03:48:41,151][26022] Updated weights on worker 0-0, policy_version 554007 (0.00090) [2022-07-10 03:48:42,796][26022] Updated weights on worker 0-0, policy_version 554017 (0.00090) [2022-07-10 03:48:44,609][26022] Updated weights on worker 0-0, policy_version 554027 (0.00085) [2022-07-10 03:48:44,838][25689] Fps is (10 sec: 5636.4, 60 sec: 5655.9, 300 sec: 5634.4). Total num frames: 567324672. Throughput: 0: 5859.9. Samples: 567322132. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:48:44,839][25689] Avg episode reward: [(0, '-25.824')] [2022-07-10 03:48:46,662][26022] Updated weights on worker 0-0, policy_version 554037 (0.00092) [2022-07-10 03:48:48,107][26022] Updated weights on worker 0-0, policy_version 554047 (0.00089) [2022-07-10 03:48:49,865][25689] Fps is (10 sec: 5549.9, 60 sec: 5622.1, 300 sec: 5630.6). Total num frames: 567352320. Throughput: 0: 5885.7. Samples: 567356578. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:48:49,866][25689] Avg episode reward: [(0, '-26.393')] [2022-07-10 03:48:50,203][26022] Updated weights on worker 0-0, policy_version 554057 (0.00099) [2022-07-10 03:48:51,713][26022] Updated weights on worker 0-0, policy_version 554067 (0.00092) [2022-07-10 03:48:53,843][26022] Updated weights on worker 0-0, policy_version 554077 (0.00104) [2022-07-10 03:48:54,884][25689] Fps is (10 sec: 5606.2, 60 sec: 5622.5, 300 sec: 5633.9). Total num frames: 567380992. Throughput: 0: 5887.7. Samples: 567390206. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:48:54,885][25689] Avg episode reward: [(0, '-26.174')] [2022-07-10 03:48:55,550][26022] Updated weights on worker 0-0, policy_version 554087 (0.00095) [2022-07-10 03:48:57,199][26022] Updated weights on worker 0-0, policy_version 554097 (0.00088) [2022-07-10 03:48:59,074][26022] Updated weights on worker 0-0, policy_version 554107 (0.00089) [2022-07-10 03:48:59,910][25689] Fps is (10 sec: 5810.9, 60 sec: 5640.4, 300 sec: 5642.2). Total num frames: 567410688. Throughput: 0: 5906.4. Samples: 567407382. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:48:59,911][25689] Avg episode reward: [(0, '-25.004')] [2022-07-10 03:49:00,828][26022] Updated weights on worker 0-0, policy_version 554117 (0.00088) [2022-07-10 03:49:02,977][26022] Updated weights on worker 0-0, policy_version 554127 (0.00086) [2022-07-10 03:49:04,884][26022] Updated weights on worker 0-0, policy_version 554137 (0.00087) [2022-07-10 03:49:04,975][25689] Fps is (10 sec: 5480.2, 60 sec: 5621.1, 300 sec: 5628.0). Total num frames: 567436288. Throughput: 0: 5821.5. Samples: 567439356. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:49:04,975][25689] Avg episode reward: [(0, '-25.451')] [2022-07-10 03:49:06,668][26022] Updated weights on worker 0-0, policy_version 554147 (0.00094) [2022-07-10 03:49:08,425][26022] Updated weights on worker 0-0, policy_version 554157 (0.00098) [2022-07-10 03:49:09,985][25689] Fps is (10 sec: 5284.8, 60 sec: 5604.5, 300 sec: 5628.7). Total num frames: 567463936. Throughput: 0: 5809.4. Samples: 567473464. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:49:09,986][25689] Avg episode reward: [(0, '-25.047')] [2022-07-10 03:49:10,334][26022] Updated weights on worker 0-0, policy_version 554167 (0.00088) [2022-07-10 03:49:11,920][26022] Updated weights on worker 0-0, policy_version 554177 (0.00086) [2022-07-10 03:49:13,930][26022] Updated weights on worker 0-0, policy_version 554187 (0.00090) [2022-07-10 03:49:14,988][25689] Fps is (10 sec: 5931.4, 60 sec: 5656.6, 300 sec: 5636.2). Total num frames: 567495680. Throughput: 0: 5007.4. Samples: 567490872. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:49:14,988][25689] Avg episode reward: [(0, '-25.405')] [2022-07-10 03:49:15,415][26022] Updated weights on worker 0-0, policy_version 554197 (0.00088) [2022-07-10 03:49:17,437][26022] Updated weights on worker 0-0, policy_version 554207 (0.00085) [2022-07-10 03:49:19,463][26022] Updated weights on worker 0-0, policy_version 554217 (0.00090) [2022-07-10 03:49:20,023][25689] Fps is (10 sec: 5611.1, 60 sec: 5585.8, 300 sec: 5630.8). Total num frames: 567520256. Throughput: 0: 5867.4. Samples: 567525392. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:49:20,023][25689] Avg episode reward: [(0, '-24.244')] [2022-07-10 03:49:20,842][26022] Updated weights on worker 0-0, policy_version 554227 (0.00088) [2022-07-10 03:49:23,051][26022] Updated weights on worker 0-0, policy_version 554237 (0.00087) [2022-07-10 03:49:24,564][26022] Updated weights on worker 0-0, policy_version 554247 (0.00091) [2022-07-10 03:49:25,123][25689] Fps is (10 sec: 5556.8, 60 sec: 5638.7, 300 sec: 5639.6). Total num frames: 567552000. Throughput: 0: 5957.4. Samples: 567559390. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:49:25,124][25689] Avg episode reward: [(0, '-25.991')] [2022-07-10 03:49:26,584][26022] Updated weights on worker 0-0, policy_version 554257 (0.00087) [2022-07-10 03:49:28,336][26022] Updated weights on worker 0-0, policy_version 554267 (0.00097) [2022-07-10 03:49:29,978][26022] Updated weights on worker 0-0, policy_version 554277 (0.00104) [2022-07-10 03:49:30,128][25689] Fps is (10 sec: 5877.1, 60 sec: 5639.5, 300 sec: 5632.9). Total num frames: 567579648. Throughput: 0: 5100.6. Samples: 567576204. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:49:30,129][25689] Avg episode reward: [(0, '-25.452')] [2022-07-10 03:49:32,008][26022] Updated weights on worker 0-0, policy_version 554287 (0.00085) [2022-07-10 03:49:33,784][26022] Updated weights on worker 0-0, policy_version 554297 (0.00083) [2022-07-10 03:49:35,171][25689] Fps is (10 sec: 5503.5, 60 sec: 5618.7, 300 sec: 5628.7). Total num frames: 567607296. Throughput: 0: 5917.8. Samples: 567610310. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:49:35,171][25689] Avg episode reward: [(0, '-26.150')] [2022-07-10 03:49:35,462][26022] Updated weights on worker 0-0, policy_version 554307 (0.00085) [2022-07-10 03:49:37,319][26022] Updated weights on worker 0-0, policy_version 554317 (0.00098) [2022-07-10 03:49:39,112][26022] Updated weights on worker 0-0, policy_version 554327 (0.00091) [2022-07-10 03:49:40,209][25689] Fps is (10 sec: 5688.3, 60 sec: 5637.0, 300 sec: 5636.8). Total num frames: 567636992. Throughput: 0: 5892.0. Samples: 567644332. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:49:40,210][25689] Avg episode reward: [(0, '-25.861')] [2022-07-10 03:49:40,989][26022] Updated weights on worker 0-0, policy_version 554337 (0.00115) [2022-07-10 03:49:42,772][26022] Updated weights on worker 0-0, policy_version 554347 (0.00090) [2022-07-10 03:49:44,495][26022] Updated weights on worker 0-0, policy_version 554357 (0.00081) [2022-07-10 03:49:45,281][25689] Fps is (10 sec: 5671.8, 60 sec: 5624.6, 300 sec: 5625.6). Total num frames: 567664640. Throughput: 0: 5057.8. Samples: 567661344. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:49:45,282][25689] Avg episode reward: [(0, '-26.584')] [2022-07-10 03:49:46,274][26022] Updated weights on worker 0-0, policy_version 554367 (0.00624) [2022-07-10 03:49:48,198][26022] Updated weights on worker 0-0, policy_version 554377 (0.00089) [2022-07-10 03:49:50,021][26022] Updated weights on worker 0-0, policy_version 554387 (0.00082) [2022-07-10 03:49:50,286][25689] Fps is (10 sec: 5792.2, 60 sec: 5677.5, 300 sec: 5636.3). Total num frames: 567695360. Throughput: 0: 5921.7. Samples: 567695574. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:49:50,287][25689] Avg episode reward: [(0, '-26.994')] [2022-07-10 03:49:51,820][26022] Updated weights on worker 0-0, policy_version 554397 (0.00090) [2022-07-10 03:49:53,574][26022] Updated weights on worker 0-0, policy_version 554407 (0.00089) [2022-07-10 03:49:55,297][25689] Fps is (10 sec: 5827.7, 60 sec: 5661.4, 300 sec: 5637.7). Total num frames: 567723008. Throughput: 0: 5931.8. Samples: 567729694. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:49:55,297][25689] Avg episode reward: [(0, '-26.866')] [2022-07-10 03:49:55,302][26022] Updated weights on worker 0-0, policy_version 554417 (0.00085) [2022-07-10 03:49:57,035][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:49:57,054][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000554425_567731200.pth [2022-07-10 03:49:57,054][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000552443_565701632.pth [2022-07-10 03:49:57,249][26022] Updated weights on worker 0-0, policy_version 554427 (0.00086) [2022-07-10 03:49:58,876][26022] Updated weights on worker 0-0, policy_version 554437 (0.00087) [2022-07-10 03:50:00,301][25689] Fps is (10 sec: 5419.1, 60 sec: 5612.4, 300 sec: 5640.2). Total num frames: 567749632. Throughput: 0: 5090.4. Samples: 567746608. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:50:00,302][25689] Avg episode reward: [(0, '-28.474')] [2022-07-10 03:50:00,833][26022] Updated weights on worker 0-0, policy_version 554447 (0.00092) [2022-07-10 03:50:02,754][26022] Updated weights on worker 0-0, policy_version 554457 (0.00089) [2022-07-10 03:50:04,887][26022] Updated weights on worker 0-0, policy_version 554467 (0.00562) [2022-07-10 03:50:05,384][25689] Fps is (10 sec: 5481.8, 60 sec: 5661.6, 300 sec: 5633.5). Total num frames: 567778304. Throughput: 0: 5824.6. Samples: 567778434. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:50:05,385][25689] Avg episode reward: [(0, '-29.075')] [2022-07-10 03:50:06,542][26022] Updated weights on worker 0-0, policy_version 554477 (0.00081) [2022-07-10 03:50:08,288][26022] Updated weights on worker 0-0, policy_version 554487 (0.00109) [2022-07-10 03:50:10,081][26022] Updated weights on worker 0-0, policy_version 554497 (0.00091) [2022-07-10 03:50:10,410][25689] Fps is (10 sec: 5470.2, 60 sec: 5643.3, 300 sec: 5630.6). Total num frames: 567804928. Throughput: 0: 5824.8. Samples: 567812788. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:50:10,410][25689] Avg episode reward: [(0, '-27.909')] [2022-07-10 03:50:11,921][26022] Updated weights on worker 0-0, policy_version 554507 (0.00066) [2022-07-10 03:50:13,839][26022] Updated weights on worker 0-0, policy_version 554517 (0.00086) [2022-07-10 03:50:15,466][25689] Fps is (10 sec: 5484.4, 60 sec: 5587.5, 300 sec: 5634.1). Total num frames: 567833600. Throughput: 0: 4951.6. Samples: 567829566. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:50:15,467][25689] Avg episode reward: [(0, '-27.611')] [2022-07-10 03:50:15,607][26022] Updated weights on worker 0-0, policy_version 554527 (0.00086) [2022-07-10 03:50:17,422][26022] Updated weights on worker 0-0, policy_version 554537 (0.00083) [2022-07-10 03:50:19,231][26022] Updated weights on worker 0-0, policy_version 554547 (0.00089) [2022-07-10 03:50:20,503][25689] Fps is (10 sec: 5884.3, 60 sec: 5688.8, 300 sec: 5634.6). Total num frames: 567864320. Throughput: 0: 5805.3. Samples: 567863886. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:50:20,504][25689] Avg episode reward: [(0, '-27.055')] [2022-07-10 03:50:21,007][26022] Updated weights on worker 0-0, policy_version 554557 (0.00085) [2022-07-10 03:50:22,683][26022] Updated weights on worker 0-0, policy_version 554567 (0.00093) [2022-07-10 03:50:24,574][26022] Updated weights on worker 0-0, policy_version 554577 (0.00082) [2022-07-10 03:50:25,597][25689] Fps is (10 sec: 5862.4, 60 sec: 5638.7, 300 sec: 5633.6). Total num frames: 567892992. Throughput: 0: 5931.2. Samples: 567898324. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:50:25,598][25689] Avg episode reward: [(0, '-25.026')] [2022-07-10 03:50:26,444][26022] Updated weights on worker 0-0, policy_version 554587 (0.00090) [2022-07-10 03:50:28,163][26022] Updated weights on worker 0-0, policy_version 554597 (0.00088) [2022-07-10 03:50:30,114][26022] Updated weights on worker 0-0, policy_version 554607 (0.00092) [2022-07-10 03:50:30,669][25689] Fps is (10 sec: 5640.9, 60 sec: 5649.4, 300 sec: 5635.8). Total num frames: 567921664. Throughput: 0: 5056.8. Samples: 567915234. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 03:50:30,669][25689] Avg episode reward: [(0, '-24.814')] [2022-07-10 03:50:31,716][26022] Updated weights on worker 0-0, policy_version 554617 (0.00091) [2022-07-10 03:50:33,766][26022] Updated weights on worker 0-0, policy_version 554627 (0.00090) [2022-07-10 03:50:35,473][26022] Updated weights on worker 0-0, policy_version 554637 (0.00090) [2022-07-10 03:50:35,692][25689] Fps is (10 sec: 5579.2, 60 sec: 5651.2, 300 sec: 5635.8). Total num frames: 567949312. Throughput: 0: 5907.2. Samples: 567949044. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:50:35,692][25689] Avg episode reward: [(0, '-25.033')] [2022-07-10 03:50:37,207][26022] Updated weights on worker 0-0, policy_version 554647 (0.00086) [2022-07-10 03:50:39,020][26022] Updated weights on worker 0-0, policy_version 554657 (0.00082) [2022-07-10 03:50:40,730][25689] Fps is (10 sec: 5597.9, 60 sec: 5634.3, 300 sec: 5633.3). Total num frames: 567977984. Throughput: 0: 5914.1. Samples: 567983508. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:50:40,730][25689] Avg episode reward: [(0, '-25.590')] [2022-07-10 03:50:40,820][26022] Updated weights on worker 0-0, policy_version 554667 (0.00088) [2022-07-10 03:50:42,681][26022] Updated weights on worker 0-0, policy_version 554677 (0.00091) [2022-07-10 03:50:44,302][26022] Updated weights on worker 0-0, policy_version 554687 (0.00093) [2022-07-10 03:50:45,799][25689] Fps is (10 sec: 5673.9, 60 sec: 5651.5, 300 sec: 5636.1). Total num frames: 568006656. Throughput: 0: 5066.0. Samples: 568000664. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:50:45,801][25689] Avg episode reward: [(0, '-24.669')] [2022-07-10 03:50:46,191][26022] Updated weights on worker 0-0, policy_version 554697 (0.00089) [2022-07-10 03:50:47,918][26022] Updated weights on worker 0-0, policy_version 554707 (0.00089) [2022-07-10 03:50:49,811][26022] Updated weights on worker 0-0, policy_version 554717 (0.00093) [2022-07-10 03:50:50,807][25689] Fps is (10 sec: 5690.6, 60 sec: 5617.4, 300 sec: 5640.1). Total num frames: 568035328. Throughput: 0: 5948.0. Samples: 568035014. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:50:50,809][25689] Avg episode reward: [(0, '-26.098')] [2022-07-10 03:50:51,646][26022] Updated weights on worker 0-0, policy_version 554727 (0.00086) [2022-07-10 03:50:53,226][26022] Updated weights on worker 0-0, policy_version 554737 (0.00089) [2022-07-10 03:50:55,182][26022] Updated weights on worker 0-0, policy_version 554747 (0.00090) [2022-07-10 03:50:55,832][25689] Fps is (10 sec: 5817.5, 60 sec: 5649.9, 300 sec: 5639.8). Total num frames: 568065024. Throughput: 0: 5965.5. Samples: 568069188. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:50:55,833][25689] Avg episode reward: [(0, '-25.059')] [2022-07-10 03:50:56,843][26022] Updated weights on worker 0-0, policy_version 554757 (0.00088) [2022-07-10 03:50:58,703][26022] Updated weights on worker 0-0, policy_version 554767 (0.00056) [2022-07-10 03:51:00,692][26022] Updated weights on worker 0-0, policy_version 554777 (0.00095) [2022-07-10 03:51:00,842][25689] Fps is (10 sec: 5612.3, 60 sec: 5649.4, 300 sec: 5644.1). Total num frames: 568091648. Throughput: 0: 5108.8. Samples: 568086256. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:51:00,843][25689] Avg episode reward: [(0, '-25.893')] [2022-07-10 03:51:02,725][26022] Updated weights on worker 0-0, policy_version 554787 (0.00090) [2022-07-10 03:51:04,610][26022] Updated weights on worker 0-0, policy_version 554797 (0.00087) [2022-07-10 03:51:05,898][25689] Fps is (10 sec: 5391.7, 60 sec: 5634.9, 300 sec: 5639.8). Total num frames: 568119296. Throughput: 0: 5846.9. Samples: 568118180. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:51:05,900][25689] Avg episode reward: [(0, '-27.094')] [2022-07-10 03:51:06,616][26022] Updated weights on worker 0-0, policy_version 554807 (0.00085) [2022-07-10 03:51:08,071][26022] Updated weights on worker 0-0, policy_version 554817 (0.00086) [2022-07-10 03:51:10,120][26022] Updated weights on worker 0-0, policy_version 554827 (0.00088) [2022-07-10 03:51:10,937][25689] Fps is (10 sec: 5578.7, 60 sec: 5667.5, 300 sec: 5637.4). Total num frames: 568147968. Throughput: 0: 5807.3. Samples: 568151918. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:51:10,938][25689] Avg episode reward: [(0, '-27.302')] [2022-07-10 03:51:11,795][26022] Updated weights on worker 0-0, policy_version 554837 (0.00087) [2022-07-10 03:51:13,711][26022] Updated weights on worker 0-0, policy_version 554847 (0.00095) [2022-07-10 03:51:15,603][26022] Updated weights on worker 0-0, policy_version 554857 (0.00083) [2022-07-10 03:51:15,953][25689] Fps is (10 sec: 5601.1, 60 sec: 5654.4, 300 sec: 5637.8). Total num frames: 568175616. Throughput: 0: 4950.5. Samples: 568168796. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:51:15,955][25689] Avg episode reward: [(0, '-28.536')] [2022-07-10 03:51:17,315][26022] Updated weights on worker 0-0, policy_version 554867 (0.00084) [2022-07-10 03:51:19,129][26022] Updated weights on worker 0-0, policy_version 554877 (0.00088) [2022-07-10 03:51:20,799][26022] Updated weights on worker 0-0, policy_version 554887 (0.00091) [2022-07-10 03:51:21,010][25689] Fps is (10 sec: 5591.4, 60 sec: 5618.7, 300 sec: 5639.0). Total num frames: 568204288. Throughput: 0: 5784.4. Samples: 568202916. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:51:21,012][25689] Avg episode reward: [(0, '-29.354')] [2022-07-10 03:51:22,657][26022] Updated weights on worker 0-0, policy_version 554897 (0.00086) [2022-07-10 03:51:24,282][26022] Updated weights on worker 0-0, policy_version 554907 (0.00090) [2022-07-10 03:51:26,131][25689] Fps is (10 sec: 5634.2, 60 sec: 5616.2, 300 sec: 5629.9). Total num frames: 568232960. Throughput: 0: 5886.4. Samples: 568237280. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:51:26,131][25689] Avg episode reward: [(0, '-30.173')] [2022-07-10 03:51:26,298][26022] Updated weights on worker 0-0, policy_version 554917 (0.00090) [2022-07-10 03:51:27,927][26022] Updated weights on worker 0-0, policy_version 554927 (0.00087) [2022-07-10 03:51:29,976][26022] Updated weights on worker 0-0, policy_version 554937 (0.00093) [2022-07-10 03:51:31,141][25689] Fps is (10 sec: 5761.2, 60 sec: 5638.8, 300 sec: 5640.4). Total num frames: 568262656. Throughput: 0: 5908.0. Samples: 568271282. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:51:31,142][25689] Avg episode reward: [(0, '-29.228')] [2022-07-10 03:51:31,740][26022] Updated weights on worker 0-0, policy_version 554947 (0.00090) [2022-07-10 03:51:33,619][26022] Updated weights on worker 0-0, policy_version 554957 (0.00152) [2022-07-10 03:51:35,448][26022] Updated weights on worker 0-0, policy_version 554967 (0.00084) [2022-07-10 03:51:36,150][25689] Fps is (10 sec: 5621.1, 60 sec: 5623.2, 300 sec: 5637.3). Total num frames: 568289280. Throughput: 0: 5911.6. Samples: 568288194. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:51:36,151][25689] Avg episode reward: [(0, '-28.807')] [2022-07-10 03:51:36,913][26022] Updated weights on worker 0-0, policy_version 554977 (0.00096) [2022-07-10 03:51:39,023][26022] Updated weights on worker 0-0, policy_version 554987 (0.00083) [2022-07-10 03:51:40,739][26022] Updated weights on worker 0-0, policy_version 554997 (0.00090) [2022-07-10 03:51:41,175][25689] Fps is (10 sec: 5612.8, 60 sec: 5641.3, 300 sec: 5639.2). Total num frames: 568318976. Throughput: 0: 5916.8. Samples: 568322230. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:51:41,176][25689] Avg episode reward: [(0, '-26.810')] [2022-07-10 03:51:42,684][26022] Updated weights on worker 0-0, policy_version 555007 (0.00085) [2022-07-10 03:51:44,438][26022] Updated weights on worker 0-0, policy_version 555017 (0.00086) [2022-07-10 03:51:46,222][25689] Fps is (10 sec: 5693.3, 60 sec: 5626.4, 300 sec: 5636.4). Total num frames: 568346624. Throughput: 0: 5922.2. Samples: 568356268. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:51:46,225][26022] Updated weights on worker 0-0, policy_version 555027 (0.00095) [2022-07-10 03:51:46,223][25689] Avg episode reward: [(0, '-26.620')] [2022-07-10 03:51:48,127][26022] Updated weights on worker 0-0, policy_version 555037 (0.00088) [2022-07-10 03:51:49,879][26022] Updated weights on worker 0-0, policy_version 555047 (0.00088) [2022-07-10 03:51:51,239][25689] Fps is (10 sec: 5698.3, 60 sec: 5642.6, 300 sec: 5640.7). Total num frames: 568376320. Throughput: 0: 5076.8. Samples: 568373314. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:51:51,239][25689] Avg episode reward: [(0, '-26.732')] [2022-07-10 03:51:51,562][26022] Updated weights on worker 0-0, policy_version 555057 (0.00098) [2022-07-10 03:51:53,368][26022] Updated weights on worker 0-0, policy_version 555067 (0.00087) [2022-07-10 03:51:55,198][26022] Updated weights on worker 0-0, policy_version 555077 (0.00101) [2022-07-10 03:51:56,242][25689] Fps is (10 sec: 5723.1, 60 sec: 5610.7, 300 sec: 5637.3). Total num frames: 568403968. Throughput: 0: 5957.7. Samples: 568407896. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:51:56,244][25689] Avg episode reward: [(0, '-27.246')] [2022-07-10 03:51:57,180][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:51:57,193][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000555087_568409088.pth [2022-07-10 03:51:57,193][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000553102_566376448.pth [2022-07-10 03:51:57,199][26022] Updated weights on worker 0-0, policy_version 555087 (0.00093) [2022-07-10 03:51:58,575][26022] Updated weights on worker 0-0, policy_version 555097 (0.00086) [2022-07-10 03:52:00,664][26022] Updated weights on worker 0-0, policy_version 555107 (0.01063) [2022-07-10 03:52:01,250][25689] Fps is (10 sec: 5727.9, 60 sec: 5661.7, 300 sec: 5652.3). Total num frames: 568433664. Throughput: 0: 5976.6. Samples: 568442208. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:52:01,250][25689] Avg episode reward: [(0, '-27.710')] [2022-07-10 03:52:02,679][26022] Updated weights on worker 0-0, policy_version 555117 (0.00088) [2022-07-10 03:52:04,631][26022] Updated weights on worker 0-0, policy_version 555127 (0.00099) [2022-07-10 03:52:06,242][26022] Updated weights on worker 0-0, policy_version 555137 (0.00092) [2022-07-10 03:52:06,367][25689] Fps is (10 sec: 5562.6, 60 sec: 5639.1, 300 sec: 5643.4). Total num frames: 568460288. Throughput: 0: 5004.0. Samples: 568457070. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:52:06,368][25689] Avg episode reward: [(0, '-26.857')] [2022-07-10 03:52:08,190][26022] Updated weights on worker 0-0, policy_version 555147 (0.00098) [2022-07-10 03:52:09,998][26022] Updated weights on worker 0-0, policy_version 555157 (0.00091) [2022-07-10 03:52:11,378][25689] Fps is (10 sec: 5459.7, 60 sec: 5641.8, 300 sec: 5648.0). Total num frames: 568488960. Throughput: 0: 5849.5. Samples: 568491120. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:52:11,379][25689] Avg episode reward: [(0, '-27.497')] [2022-07-10 03:52:11,919][26022] Updated weights on worker 0-0, policy_version 555167 (0.00087) [2022-07-10 03:52:13,712][26022] Updated weights on worker 0-0, policy_version 555177 (0.00089) [2022-07-10 03:52:15,452][26022] Updated weights on worker 0-0, policy_version 555187 (0.00087) [2022-07-10 03:52:16,435][25689] Fps is (10 sec: 5594.0, 60 sec: 5637.9, 300 sec: 5643.6). Total num frames: 568516608. Throughput: 0: 5808.2. Samples: 568525178. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:52:16,435][25689] Avg episode reward: [(0, '-26.581')] [2022-07-10 03:52:17,262][26022] Updated weights on worker 0-0, policy_version 555197 (0.00089) [2022-07-10 03:52:19,306][26022] Updated weights on worker 0-0, policy_version 555207 (0.00088) [2022-07-10 03:52:20,624][26022] Updated weights on worker 0-0, policy_version 555217 (0.00093) [2022-07-10 03:52:21,455][25689] Fps is (10 sec: 5690.8, 60 sec: 5658.3, 300 sec: 5644.2). Total num frames: 568546304. Throughput: 0: 4944.7. Samples: 568542114. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:52:21,455][25689] Avg episode reward: [(0, '-26.422')] [2022-07-10 03:52:23,000][26022] Updated weights on worker 0-0, policy_version 555227 (0.00094) [2022-07-10 03:52:24,169][26022] Updated weights on worker 0-0, policy_version 555237 (0.00087) [2022-07-10 03:52:26,445][26022] Updated weights on worker 0-0, policy_version 555247 (0.00093) [2022-07-10 03:52:26,533][25689] Fps is (10 sec: 5678.6, 60 sec: 5645.4, 300 sec: 5643.2). Total num frames: 568573952. Throughput: 0: 5898.3. Samples: 568576016. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:52:26,535][25689] Avg episode reward: [(0, '-25.678')] [2022-07-10 03:52:27,900][26022] Updated weights on worker 0-0, policy_version 555257 (0.00092) [2022-07-10 03:52:29,928][26022] Updated weights on worker 0-0, policy_version 555267 (0.00094) [2022-07-10 03:52:31,537][25689] Fps is (10 sec: 5586.0, 60 sec: 5629.0, 300 sec: 5639.8). Total num frames: 568602624. Throughput: 0: 5873.1. Samples: 568609516. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:52:31,537][25689] Avg episode reward: [(0, '-25.256')] [2022-07-10 03:52:31,623][26022] Updated weights on worker 0-0, policy_version 555277 (0.00091) [2022-07-10 03:52:33,646][26022] Updated weights on worker 0-0, policy_version 555287 (0.00095) [2022-07-10 03:52:35,416][26022] Updated weights on worker 0-0, policy_version 555297 (0.00088) [2022-07-10 03:52:36,575][25689] Fps is (10 sec: 5710.4, 60 sec: 5660.2, 300 sec: 5646.2). Total num frames: 568631296. Throughput: 0: 5022.6. Samples: 568626334. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:52:36,575][25689] Avg episode reward: [(0, '-25.107')] [2022-07-10 03:52:37,309][26022] Updated weights on worker 0-0, policy_version 555307 (0.00092) [2022-07-10 03:52:38,902][26022] Updated weights on worker 0-0, policy_version 555317 (0.00085) [2022-07-10 03:52:40,972][26022] Updated weights on worker 0-0, policy_version 555327 (0.00090) [2022-07-10 03:52:41,621][25689] Fps is (10 sec: 5584.8, 60 sec: 5624.3, 300 sec: 5639.3). Total num frames: 568658944. Throughput: 0: 5871.0. Samples: 568660512. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:52:41,627][25689] Avg episode reward: [(0, '-24.606')] [2022-07-10 03:52:42,361][26022] Updated weights on worker 0-0, policy_version 555337 (0.00089) [2022-07-10 03:52:44,630][26022] Updated weights on worker 0-0, policy_version 555347 (0.00097) [2022-07-10 03:52:46,229][26022] Updated weights on worker 0-0, policy_version 555357 (0.00093) [2022-07-10 03:52:46,699][25689] Fps is (10 sec: 5563.2, 60 sec: 5638.4, 300 sec: 5634.9). Total num frames: 568687616. Throughput: 0: 5868.1. Samples: 568694350. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:52:46,699][25689] Avg episode reward: [(0, '-24.051')] [2022-07-10 03:52:47,961][26022] Updated weights on worker 0-0, policy_version 555367 (0.00085) [2022-07-10 03:52:50,033][26022] Updated weights on worker 0-0, policy_version 555377 (0.00093) [2022-07-10 03:52:51,528][26022] Updated weights on worker 0-0, policy_version 555387 (0.00087) [2022-07-10 03:52:51,703][25689] Fps is (10 sec: 5688.1, 60 sec: 5622.6, 300 sec: 5635.2). Total num frames: 568716288. Throughput: 0: 5059.6. Samples: 568711548. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:52:51,703][25689] Avg episode reward: [(0, '-24.919')] [2022-07-10 03:52:53,567][26022] Updated weights on worker 0-0, policy_version 555397 (0.00093) [2022-07-10 03:52:55,162][26022] Updated weights on worker 0-0, policy_version 555407 (0.00086) [2022-07-10 03:52:56,710][25689] Fps is (10 sec: 5625.4, 60 sec: 5622.3, 300 sec: 5632.3). Total num frames: 568743936. Throughput: 0: 5926.1. Samples: 568745658. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:52:56,711][25689] Avg episode reward: [(0, '-25.153')] [2022-07-10 03:52:57,054][26022] Updated weights on worker 0-0, policy_version 555417 (0.00087) [2022-07-10 03:52:59,042][26022] Updated weights on worker 0-0, policy_version 555427 (0.00086) [2022-07-10 03:53:00,690][26022] Updated weights on worker 0-0, policy_version 555437 (0.00093) [2022-07-10 03:53:01,770][25689] Fps is (10 sec: 5594.2, 60 sec: 5600.5, 300 sec: 5638.8). Total num frames: 568772608. Throughput: 0: 5925.3. Samples: 568779900. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:53:01,775][25689] Avg episode reward: [(0, '-25.328')] [2022-07-10 03:53:02,917][26022] Updated weights on worker 0-0, policy_version 555447 (0.00088) [2022-07-10 03:53:04,479][26022] Updated weights on worker 0-0, policy_version 555457 (0.00097) [2022-07-10 03:53:06,455][26022] Updated weights on worker 0-0, policy_version 555467 (0.00087) [2022-07-10 03:53:06,891][25689] Fps is (10 sec: 5431.6, 60 sec: 5600.2, 300 sec: 5629.9). Total num frames: 568799232. Throughput: 0: 4981.2. Samples: 568794930. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:53:06,891][25689] Avg episode reward: [(0, '-25.079')] [2022-07-10 03:53:08,262][26022] Updated weights on worker 0-0, policy_version 555477 (0.00098) [2022-07-10 03:53:10,028][26022] Updated weights on worker 0-0, policy_version 555487 (0.00087) [2022-07-10 03:53:11,890][26022] Updated weights on worker 0-0, policy_version 555497 (0.00098) [2022-07-10 03:53:11,912][25689] Fps is (10 sec: 5553.2, 60 sec: 5616.1, 300 sec: 5633.3). Total num frames: 568828928. Throughput: 0: 5819.1. Samples: 568829148. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:53:11,913][25689] Avg episode reward: [(0, '-25.327')] [2022-07-10 03:53:13,555][26022] Updated weights on worker 0-0, policy_version 555507 (0.00051) [2022-07-10 03:53:15,358][26022] Updated weights on worker 0-0, policy_version 555517 (0.00087) [2022-07-10 03:53:16,922][25689] Fps is (10 sec: 5818.3, 60 sec: 5637.4, 300 sec: 5633.1). Total num frames: 568857600. Throughput: 0: 5820.6. Samples: 568863304. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:53:16,924][25689] Avg episode reward: [(0, '-27.938')] [2022-07-10 03:53:17,159][26022] Updated weights on worker 0-0, policy_version 555527 (0.00086) [2022-07-10 03:53:18,962][26022] Updated weights on worker 0-0, policy_version 555537 (0.00091) [2022-07-10 03:53:20,960][26022] Updated weights on worker 0-0, policy_version 555547 (0.00086) [2022-07-10 03:53:21,925][25689] Fps is (10 sec: 5625.0, 60 sec: 5605.2, 300 sec: 5632.0). Total num frames: 568885248. Throughput: 0: 4995.5. Samples: 568880582. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:53:21,925][25689] Avg episode reward: [(0, '-27.525')] [2022-07-10 03:53:22,524][26022] Updated weights on worker 0-0, policy_version 555557 (0.00086) [2022-07-10 03:53:24,419][26022] Updated weights on worker 0-0, policy_version 555567 (0.00085) [2022-07-10 03:53:26,245][26022] Updated weights on worker 0-0, policy_version 555577 (0.00084) [2022-07-10 03:53:26,980][25689] Fps is (10 sec: 5701.6, 60 sec: 5641.2, 300 sec: 5638.1). Total num frames: 568914944. Throughput: 0: 5969.2. Samples: 568914846. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:53:26,980][25689] Avg episode reward: [(0, '-26.591')] [2022-07-10 03:53:28,163][26022] Updated weights on worker 0-0, policy_version 555587 (0.00087) [2022-07-10 03:53:29,752][26022] Updated weights on worker 0-0, policy_version 555597 (0.00089) [2022-07-10 03:53:31,907][26022] Updated weights on worker 0-0, policy_version 555607 (0.00087) [2022-07-10 03:53:32,059][25689] Fps is (10 sec: 5557.3, 60 sec: 5600.3, 300 sec: 5629.7). Total num frames: 568941568. Throughput: 0: 5926.1. Samples: 568948540. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 03:53:32,061][25689] Avg episode reward: [(0, '-27.631')] [2022-07-10 03:53:33,333][26022] Updated weights on worker 0-0, policy_version 555617 (0.00313) [2022-07-10 03:53:35,508][26022] Updated weights on worker 0-0, policy_version 555627 (0.00081) [2022-07-10 03:53:37,024][26022] Updated weights on worker 0-0, policy_version 555637 (0.00087) [2022-07-10 03:53:37,112][25689] Fps is (10 sec: 5659.6, 60 sec: 5632.8, 300 sec: 5636.6). Total num frames: 568972288. Throughput: 0: 5071.7. Samples: 568965700. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:53:37,113][25689] Avg episode reward: [(0, '-27.616')] [2022-07-10 03:53:39,182][26022] Updated weights on worker 0-0, policy_version 555647 (0.00089) [2022-07-10 03:53:40,543][26022] Updated weights on worker 0-0, policy_version 555657 (0.00091) [2022-07-10 03:53:42,148][25689] Fps is (10 sec: 5785.4, 60 sec: 5633.8, 300 sec: 5634.7). Total num frames: 568999936. Throughput: 0: 5889.4. Samples: 568999684. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:53:42,149][25689] Avg episode reward: [(0, '-26.585')] [2022-07-10 03:53:42,764][26022] Updated weights on worker 0-0, policy_version 555667 (0.00088) [2022-07-10 03:53:44,307][26022] Updated weights on worker 0-0, policy_version 555677 (0.00351) [2022-07-10 03:53:46,224][26022] Updated weights on worker 0-0, policy_version 555687 (0.00090) [2022-07-10 03:53:47,274][25689] Fps is (10 sec: 5542.4, 60 sec: 5629.2, 300 sec: 5636.3). Total num frames: 569028608. Throughput: 0: 5843.7. Samples: 569033438. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:53:47,275][25689] Avg episode reward: [(0, '-27.047')] [2022-07-10 03:53:47,904][26022] Updated weights on worker 0-0, policy_version 555697 (0.00088) [2022-07-10 03:53:49,877][26022] Updated weights on worker 0-0, policy_version 555707 (0.00082) [2022-07-10 03:53:51,630][26022] Updated weights on worker 0-0, policy_version 555717 (0.00090) [2022-07-10 03:53:52,298][25689] Fps is (10 sec: 5750.4, 60 sec: 5644.3, 300 sec: 5639.7). Total num frames: 569058304. Throughput: 0: 5052.3. Samples: 569050794. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:53:52,300][25689] Avg episode reward: [(0, '-26.632')] [2022-07-10 03:53:53,387][26022] Updated weights on worker 0-0, policy_version 555727 (0.00091) [2022-07-10 03:53:55,209][26022] Updated weights on worker 0-0, policy_version 555737 (0.00087) [2022-07-10 03:53:57,005][26022] Updated weights on worker 0-0, policy_version 555747 (0.00086) [2022-07-10 03:53:57,202][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:53:57,217][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000555748_569085952.pth [2022-07-10 03:53:57,218][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000553764_567054336.pth [2022-07-10 03:53:57,343][25689] Fps is (10 sec: 5694.7, 60 sec: 5640.8, 300 sec: 5632.4). Total num frames: 569085952. Throughput: 0: 5878.3. Samples: 569084626. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:53:57,345][25689] Avg episode reward: [(0, '-27.518')] [2022-07-10 03:53:58,959][26022] Updated weights on worker 0-0, policy_version 555757 (0.00098) [2022-07-10 03:54:00,814][26022] Updated weights on worker 0-0, policy_version 555767 (0.00085) [2022-07-10 03:54:02,363][25689] Fps is (10 sec: 5290.7, 60 sec: 5593.8, 300 sec: 5633.3). Total num frames: 569111552. Throughput: 0: 5859.2. Samples: 569118126. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:54:02,363][25689] Avg episode reward: [(0, '-27.568')] [2022-07-10 03:54:02,976][26022] Updated weights on worker 0-0, policy_version 555777 (0.00087) [2022-07-10 03:54:04,549][26022] Updated weights on worker 0-0, policy_version 555787 (0.00094) [2022-07-10 03:54:06,622][26022] Updated weights on worker 0-0, policy_version 555797 (0.00092) [2022-07-10 03:54:07,473][25689] Fps is (10 sec: 5560.2, 60 sec: 5662.4, 300 sec: 5641.7). Total num frames: 569142272. Throughput: 0: 5776.1. Samples: 569150108. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:54:07,473][25689] Avg episode reward: [(0, '-27.353')] [2022-07-10 03:54:08,356][26022] Updated weights on worker 0-0, policy_version 555807 (0.00094) [2022-07-10 03:54:10,219][26022] Updated weights on worker 0-0, policy_version 555817 (0.00089) [2022-07-10 03:54:12,156][26022] Updated weights on worker 0-0, policy_version 555827 (0.00092) [2022-07-10 03:54:12,513][25689] Fps is (10 sec: 5649.7, 60 sec: 5610.0, 300 sec: 5623.8). Total num frames: 569168896. Throughput: 0: 5736.7. Samples: 569166758. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:54:12,513][25689] Avg episode reward: [(0, '-26.686')] [2022-07-10 03:54:13,810][26022] Updated weights on worker 0-0, policy_version 555837 (0.00054) [2022-07-10 03:54:15,603][26022] Updated weights on worker 0-0, policy_version 555847 (0.00091) [2022-07-10 03:54:17,603][25689] Fps is (10 sec: 5458.6, 60 sec: 5602.6, 300 sec: 5636.5). Total num frames: 569197568. Throughput: 0: 5734.5. Samples: 569200804. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:54:17,603][25689] Avg episode reward: [(0, '-27.335')] [2022-07-10 03:54:17,612][26022] Updated weights on worker 0-0, policy_version 555857 (0.00093) [2022-07-10 03:54:19,354][26022] Updated weights on worker 0-0, policy_version 555867 (0.00079) [2022-07-10 03:54:21,169][26022] Updated weights on worker 0-0, policy_version 555877 (0.00086) [2022-07-10 03:54:22,655][25689] Fps is (10 sec: 5654.1, 60 sec: 5614.9, 300 sec: 5627.1). Total num frames: 569226240. Throughput: 0: 5759.2. Samples: 569234992. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:54:22,655][25689] Avg episode reward: [(0, '-26.381')] [2022-07-10 03:54:22,939][26022] Updated weights on worker 0-0, policy_version 555887 (0.00076) [2022-07-10 03:54:24,643][26022] Updated weights on worker 0-0, policy_version 555897 (0.00090) [2022-07-10 03:54:26,599][26022] Updated weights on worker 0-0, policy_version 555907 (0.00086) [2022-07-10 03:54:27,773][25689] Fps is (10 sec: 5638.1, 60 sec: 5592.2, 300 sec: 5628.4). Total num frames: 569254912. Throughput: 0: 5025.1. Samples: 569252122. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:54:27,774][25689] Avg episode reward: [(0, '-25.324')] [2022-07-10 03:54:28,264][26022] Updated weights on worker 0-0, policy_version 555917 (0.00090) [2022-07-10 03:54:30,180][26022] Updated weights on worker 0-0, policy_version 555927 (0.00088) [2022-07-10 03:54:31,840][26022] Updated weights on worker 0-0, policy_version 555937 (0.00091) [2022-07-10 03:54:32,793][25689] Fps is (10 sec: 5656.4, 60 sec: 5631.4, 300 sec: 5632.3). Total num frames: 569283584. Throughput: 0: 5879.5. Samples: 569285994. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:54:32,793][25689] Avg episode reward: [(0, '-25.869')] [2022-07-10 03:54:33,631][26022] Updated weights on worker 0-0, policy_version 555947 (0.00084) [2022-07-10 03:54:35,646][26022] Updated weights on worker 0-0, policy_version 555957 (0.00093) [2022-07-10 03:54:37,186][26022] Updated weights on worker 0-0, policy_version 555967 (0.00087) [2022-07-10 03:54:37,836][25689] Fps is (10 sec: 5698.7, 60 sec: 5598.6, 300 sec: 5628.7). Total num frames: 569312256. Throughput: 0: 5902.1. Samples: 569320224. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:54:37,837][25689] Avg episode reward: [(0, '-25.369')] [2022-07-10 03:54:39,144][26022] Updated weights on worker 0-0, policy_version 555977 (0.00102) [2022-07-10 03:54:40,881][26022] Updated weights on worker 0-0, policy_version 555987 (0.00086) [2022-07-10 03:54:42,589][26022] Updated weights on worker 0-0, policy_version 555997 (0.00086) [2022-07-10 03:54:42,854][25689] Fps is (10 sec: 5699.2, 60 sec: 5617.1, 300 sec: 5633.2). Total num frames: 569340928. Throughput: 0: 5061.3. Samples: 569337228. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:54:42,856][25689] Avg episode reward: [(0, '-25.601')] [2022-07-10 03:54:44,672][26022] Updated weights on worker 0-0, policy_version 556007 (0.00097) [2022-07-10 03:54:46,372][26022] Updated weights on worker 0-0, policy_version 556017 (0.00087) [2022-07-10 03:54:47,890][25689] Fps is (10 sec: 5704.0, 60 sec: 5625.5, 300 sec: 5625.7). Total num frames: 569369600. Throughput: 0: 5911.3. Samples: 569371034. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:54:47,893][25689] Avg episode reward: [(0, '-24.897')] [2022-07-10 03:54:48,187][26022] Updated weights on worker 0-0, policy_version 556027 (0.00082) [2022-07-10 03:54:50,218][26022] Updated weights on worker 0-0, policy_version 556037 (0.00090) [2022-07-10 03:54:51,737][26022] Updated weights on worker 0-0, policy_version 556047 (0.00089) [2022-07-10 03:54:52,923][25689] Fps is (10 sec: 5695.1, 60 sec: 5607.7, 300 sec: 5628.7). Total num frames: 569398272. Throughput: 0: 5921.9. Samples: 569405206. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:54:52,925][25689] Avg episode reward: [(0, '-25.028')] [2022-07-10 03:54:53,921][26022] Updated weights on worker 0-0, policy_version 556057 (0.00083) [2022-07-10 03:54:55,217][26022] Updated weights on worker 0-0, policy_version 556067 (0.00086) [2022-07-10 03:54:57,396][26022] Updated weights on worker 0-0, policy_version 556077 (0.00089) [2022-07-10 03:54:57,927][25689] Fps is (10 sec: 5815.1, 60 sec: 5645.4, 300 sec: 5639.1). Total num frames: 569427968. Throughput: 0: 5086.6. Samples: 569422418. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:54:57,928][25689] Avg episode reward: [(0, '-25.193')] [2022-07-10 03:54:58,828][26022] Updated weights on worker 0-0, policy_version 556087 (0.00094) [2022-07-10 03:55:00,825][26022] Updated weights on worker 0-0, policy_version 556097 (0.00099) [2022-07-10 03:55:02,900][26022] Updated weights on worker 0-0, policy_version 556107 (0.00091) [2022-07-10 03:55:02,996][25689] Fps is (10 sec: 5489.4, 60 sec: 5640.7, 300 sec: 5629.0). Total num frames: 569453568. Throughput: 0: 5877.2. Samples: 569455608. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:55:02,997][25689] Avg episode reward: [(0, '-25.260')] [2022-07-10 03:55:04,709][26022] Updated weights on worker 0-0, policy_version 556117 (0.00087) [2022-07-10 03:55:06,628][26022] Updated weights on worker 0-0, policy_version 556127 (0.00087) [2022-07-10 03:55:08,053][25689] Fps is (10 sec: 5460.6, 60 sec: 5628.7, 300 sec: 5638.7). Total num frames: 569483264. Throughput: 0: 5841.6. Samples: 569488822. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:55:08,054][25689] Avg episode reward: [(0, '-25.593')] [2022-07-10 03:55:08,384][26022] Updated weights on worker 0-0, policy_version 556137 (0.00093) [2022-07-10 03:55:10,163][26022] Updated weights on worker 0-0, policy_version 556147 (0.00093) [2022-07-10 03:55:12,198][26022] Updated weights on worker 0-0, policy_version 556157 (0.00095) [2022-07-10 03:55:13,082][25689] Fps is (10 sec: 5584.2, 60 sec: 5629.8, 300 sec: 5632.4). Total num frames: 569509888. Throughput: 0: 4975.1. Samples: 569505498. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:55:13,083][25689] Avg episode reward: [(0, '-25.362')] [2022-07-10 03:55:13,795][26022] Updated weights on worker 0-0, policy_version 556167 (0.00087) [2022-07-10 03:55:15,580][26022] Updated weights on worker 0-0, policy_version 556177 (0.00086) [2022-07-10 03:55:17,532][26022] Updated weights on worker 0-0, policy_version 556187 (0.00089) [2022-07-10 03:55:18,156][25689] Fps is (10 sec: 5473.1, 60 sec: 5631.2, 300 sec: 5624.8). Total num frames: 569538560. Throughput: 0: 5795.8. Samples: 569539664. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:55:18,157][25689] Avg episode reward: [(0, '-25.916')] [2022-07-10 03:55:19,080][26022] Updated weights on worker 0-0, policy_version 556197 (0.00086) [2022-07-10 03:55:21,048][26022] Updated weights on worker 0-0, policy_version 556207 (0.00099) [2022-07-10 03:55:22,909][26022] Updated weights on worker 0-0, policy_version 556217 (0.00090) [2022-07-10 03:55:23,182][25689] Fps is (10 sec: 5576.3, 60 sec: 5616.8, 300 sec: 5622.7). Total num frames: 569566208. Throughput: 0: 5847.0. Samples: 569573632. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:55:23,183][25689] Avg episode reward: [(0, '-26.253')] [2022-07-10 03:55:24,712][26022] Updated weights on worker 0-0, policy_version 556227 (0.00091) [2022-07-10 03:55:26,782][26022] Updated weights on worker 0-0, policy_version 556237 (0.00088) [2022-07-10 03:55:28,246][25689] Fps is (10 sec: 5581.8, 60 sec: 5621.8, 300 sec: 5622.8). Total num frames: 569594880. Throughput: 0: 5033.0. Samples: 569590454. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:55:28,247][25689] Avg episode reward: [(0, '-27.106')] [2022-07-10 03:55:28,466][26022] Updated weights on worker 0-0, policy_version 556247 (0.00088) [2022-07-10 03:55:30,309][26022] Updated weights on worker 0-0, policy_version 556257 (0.00092) [2022-07-10 03:55:32,028][26022] Updated weights on worker 0-0, policy_version 556267 (0.00086) [2022-07-10 03:55:33,255][25689] Fps is (10 sec: 5794.7, 60 sec: 5639.8, 300 sec: 5630.0). Total num frames: 569624576. Throughput: 0: 5887.0. Samples: 569624254. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:55:33,257][25689] Avg episode reward: [(0, '-28.361')] [2022-07-10 03:55:33,856][26022] Updated weights on worker 0-0, policy_version 556277 (0.00093) [2022-07-10 03:55:35,758][26022] Updated weights on worker 0-0, policy_version 556287 (0.00081) [2022-07-10 03:55:37,404][26022] Updated weights on worker 0-0, policy_version 556297 (0.00092) [2022-07-10 03:55:38,303][25689] Fps is (10 sec: 5803.9, 60 sec: 5639.3, 300 sec: 5629.8). Total num frames: 569653248. Throughput: 0: 5889.1. Samples: 569658310. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:55:38,305][25689] Avg episode reward: [(0, '-28.190')] [2022-07-10 03:55:39,360][26022] Updated weights on worker 0-0, policy_version 556307 (0.00088) [2022-07-10 03:55:41,143][26022] Updated weights on worker 0-0, policy_version 556317 (0.00091) [2022-07-10 03:55:43,018][26022] Updated weights on worker 0-0, policy_version 556327 (0.00108) [2022-07-10 03:55:43,370][25689] Fps is (10 sec: 5669.2, 60 sec: 5634.8, 300 sec: 5629.8). Total num frames: 569681920. Throughput: 0: 5034.7. Samples: 569675272. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:55:43,372][25689] Avg episode reward: [(0, '-28.344')] [2022-07-10 03:55:44,908][26022] Updated weights on worker 0-0, policy_version 556337 (0.00094) [2022-07-10 03:55:46,712][26022] Updated weights on worker 0-0, policy_version 556347 (0.00093) [2022-07-10 03:55:48,243][26022] Updated weights on worker 0-0, policy_version 556357 (0.00092) [2022-07-10 03:55:48,480][25689] Fps is (10 sec: 5533.9, 60 sec: 5610.9, 300 sec: 5624.4). Total num frames: 569709568. Throughput: 0: 5875.2. Samples: 569709332. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:55:48,481][25689] Avg episode reward: [(0, '-27.318')] [2022-07-10 03:55:50,352][26022] Updated weights on worker 0-0, policy_version 556367 (0.00057) [2022-07-10 03:55:51,899][26022] Updated weights on worker 0-0, policy_version 556377 (0.00096) [2022-07-10 03:55:53,514][25689] Fps is (10 sec: 5451.1, 60 sec: 5594.0, 300 sec: 5617.4). Total num frames: 569737216. Throughput: 0: 5887.3. Samples: 569743526. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:55:53,516][25689] Avg episode reward: [(0, '-27.510')] [2022-07-10 03:55:53,817][26022] Updated weights on worker 0-0, policy_version 556387 (0.00085) [2022-07-10 03:55:55,541][26022] Updated weights on worker 0-0, policy_version 556397 (0.00082) [2022-07-10 03:55:57,358][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:55:57,371][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000556407_569760768.pth [2022-07-10 03:55:57,387][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000554425_567731200.pth [2022-07-10 03:55:57,391][26022] Updated weights on worker 0-0, policy_version 556407 (0.00092) [2022-07-10 03:55:58,550][25689] Fps is (10 sec: 5694.8, 60 sec: 5591.0, 300 sec: 5627.2). Total num frames: 569766912. Throughput: 0: 5056.8. Samples: 569760694. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:55:58,551][25689] Avg episode reward: [(0, '-26.885')] [2022-07-10 03:55:59,203][26022] Updated weights on worker 0-0, policy_version 556417 (0.00081) [2022-07-10 03:56:01,135][26022] Updated weights on worker 0-0, policy_version 556427 (0.00086) [2022-07-10 03:56:03,119][26022] Updated weights on worker 0-0, policy_version 556437 (0.00096) [2022-07-10 03:56:03,574][25689] Fps is (10 sec: 5700.0, 60 sec: 5629.0, 300 sec: 5627.8). Total num frames: 569794560. Throughput: 0: 5911.3. Samples: 569794706. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:56:03,575][25689] Avg episode reward: [(0, '-25.876')] [2022-07-10 03:56:05,366][26022] Updated weights on worker 0-0, policy_version 556447 (0.00087) [2022-07-10 03:56:06,576][26022] Updated weights on worker 0-0, policy_version 556457 (0.00094) [2022-07-10 03:56:08,655][25689] Fps is (10 sec: 5269.5, 60 sec: 5559.2, 300 sec: 5616.7). Total num frames: 569820160. Throughput: 0: 5804.5. Samples: 569826434. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:56:08,656][25689] Avg episode reward: [(0, '-24.632')] [2022-07-10 03:56:08,784][26022] Updated weights on worker 0-0, policy_version 556467 (0.00089) [2022-07-10 03:56:10,448][26022] Updated weights on worker 0-0, policy_version 556477 (0.00086) [2022-07-10 03:56:12,136][26022] Updated weights on worker 0-0, policy_version 556487 (0.00090) [2022-07-10 03:56:13,682][25689] Fps is (10 sec: 5572.0, 60 sec: 5626.9, 300 sec: 5626.8). Total num frames: 569850880. Throughput: 0: 4947.3. Samples: 569843304. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:56:13,684][25689] Avg episode reward: [(0, '-24.648')] [2022-07-10 03:56:14,099][26022] Updated weights on worker 0-0, policy_version 556497 (0.00096) [2022-07-10 03:56:15,674][26022] Updated weights on worker 0-0, policy_version 556507 (0.00055) [2022-07-10 03:56:17,789][26022] Updated weights on worker 0-0, policy_version 556517 (0.00085) [2022-07-10 03:56:18,701][25689] Fps is (10 sec: 5911.7, 60 sec: 5632.1, 300 sec: 5627.5). Total num frames: 569879552. Throughput: 0: 5783.6. Samples: 569877242. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:56:18,702][25689] Avg episode reward: [(0, '-24.027')] [2022-07-10 03:56:19,423][26022] Updated weights on worker 0-0, policy_version 556527 (0.00081) [2022-07-10 03:56:21,196][26022] Updated weights on worker 0-0, policy_version 556537 (0.00088) [2022-07-10 03:56:23,135][26022] Updated weights on worker 0-0, policy_version 556547 (0.00081) [2022-07-10 03:56:23,706][25689] Fps is (10 sec: 5618.4, 60 sec: 5634.0, 300 sec: 5626.3). Total num frames: 569907200. Throughput: 0: 5801.3. Samples: 569911498. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:56:23,707][25689] Avg episode reward: [(0, '-23.735')] [2022-07-10 03:56:24,852][26022] Updated weights on worker 0-0, policy_version 556557 (0.00623) [2022-07-10 03:56:26,733][26022] Updated weights on worker 0-0, policy_version 556567 (0.00086) [2022-07-10 03:56:28,513][26022] Updated weights on worker 0-0, policy_version 556577 (0.00087) [2022-07-10 03:56:28,783][25689] Fps is (10 sec: 5687.7, 60 sec: 5649.7, 300 sec: 5625.0). Total num frames: 569936896. Throughput: 0: 5900.1. Samples: 569945194. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:56:28,784][25689] Avg episode reward: [(0, '-24.533')] [2022-07-10 03:56:30,372][26022] Updated weights on worker 0-0, policy_version 556587 (0.00091) [2022-07-10 03:56:32,149][26022] Updated weights on worker 0-0, policy_version 556597 (0.00087) [2022-07-10 03:56:33,868][25689] Fps is (10 sec: 5542.5, 60 sec: 5591.9, 300 sec: 5623.6). Total num frames: 569963520. Throughput: 0: 5886.8. Samples: 569962132. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 03:56:33,868][25689] Avg episode reward: [(0, '-24.555')] [2022-07-10 03:56:34,027][26022] Updated weights on worker 0-0, policy_version 556607 (0.00089) [2022-07-10 03:56:35,699][26022] Updated weights on worker 0-0, policy_version 556617 (0.00084) [2022-07-10 03:56:37,768][26022] Updated weights on worker 0-0, policy_version 556627 (0.00089) [2022-07-10 03:56:38,936][25689] Fps is (10 sec: 5648.1, 60 sec: 5623.9, 300 sec: 5626.2). Total num frames: 569994240. Throughput: 0: 5867.5. Samples: 569995970. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:56:38,937][25689] Avg episode reward: [(0, '-25.272')] [2022-07-10 03:56:39,142][26022] Updated weights on worker 0-0, policy_version 556637 (0.00088) [2022-07-10 03:56:41,506][26022] Updated weights on worker 0-0, policy_version 556647 (0.00091) [2022-07-10 03:56:42,891][26022] Updated weights on worker 0-0, policy_version 556657 (0.00109) [2022-07-10 03:56:43,960][25689] Fps is (10 sec: 5681.8, 60 sec: 5594.0, 300 sec: 5623.2). Total num frames: 570020864. Throughput: 0: 5841.8. Samples: 570029818. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:56:43,961][25689] Avg episode reward: [(0, '-25.673')] [2022-07-10 03:56:44,931][26022] Updated weights on worker 0-0, policy_version 556667 (0.00093) [2022-07-10 03:56:46,691][26022] Updated weights on worker 0-0, policy_version 556677 (0.00089) [2022-07-10 03:56:48,476][26022] Updated weights on worker 0-0, policy_version 556687 (0.00098) [2022-07-10 03:56:49,058][25689] Fps is (10 sec: 5463.1, 60 sec: 5612.1, 300 sec: 5618.2). Total num frames: 570049536. Throughput: 0: 5010.5. Samples: 570046780. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:56:49,058][25689] Avg episode reward: [(0, '-26.112')] [2022-07-10 03:56:50,327][26022] Updated weights on worker 0-0, policy_version 556697 (0.00084) [2022-07-10 03:56:52,276][26022] Updated weights on worker 0-0, policy_version 556707 (0.00090) [2022-07-10 03:56:53,805][26022] Updated weights on worker 0-0, policy_version 556717 (0.00081) [2022-07-10 03:56:54,066][25689] Fps is (10 sec: 5876.9, 60 sec: 5665.2, 300 sec: 5628.5). Total num frames: 570080256. Throughput: 0: 5885.8. Samples: 570081016. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:56:54,067][25689] Avg episode reward: [(0, '-26.351')] [2022-07-10 03:56:55,722][26022] Updated weights on worker 0-0, policy_version 556727 (0.00086) [2022-07-10 03:56:57,295][26022] Updated weights on worker 0-0, policy_version 556737 (0.00082) [2022-07-10 03:56:59,091][25689] Fps is (10 sec: 5715.6, 60 sec: 5615.5, 300 sec: 5617.8). Total num frames: 570106880. Throughput: 0: 5927.5. Samples: 570115436. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:56:59,091][25689] Avg episode reward: [(0, '-25.809')] [2022-07-10 03:56:59,366][26022] Updated weights on worker 0-0, policy_version 556747 (0.00083) [2022-07-10 03:57:01,039][26022] Updated weights on worker 0-0, policy_version 556757 (0.00091) [2022-07-10 03:57:03,259][26022] Updated weights on worker 0-0, policy_version 556767 (0.00095) [2022-07-10 03:57:04,094][25689] Fps is (10 sec: 5310.2, 60 sec: 5600.6, 300 sec: 5620.0). Total num frames: 570133504. Throughput: 0: 5077.1. Samples: 570132038. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:57:04,094][25689] Avg episode reward: [(0, '-25.036')] [2022-07-10 03:57:05,124][26022] Updated weights on worker 0-0, policy_version 556777 (0.00082) [2022-07-10 03:57:06,950][26022] Updated weights on worker 0-0, policy_version 556787 (0.00092) [2022-07-10 03:57:08,778][26022] Updated weights on worker 0-0, policy_version 556797 (0.00094) [2022-07-10 03:57:09,198][25689] Fps is (10 sec: 5470.8, 60 sec: 5649.1, 300 sec: 5618.2). Total num frames: 570162176. Throughput: 0: 5833.0. Samples: 570164258. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:57:09,199][25689] Avg episode reward: [(0, '-23.730')] [2022-07-10 03:57:10,543][26022] Updated weights on worker 0-0, policy_version 556807 (0.00088) [2022-07-10 03:57:12,188][26022] Updated weights on worker 0-0, policy_version 556817 (0.00089) [2022-07-10 03:57:14,208][25689] Fps is (10 sec: 5568.1, 60 sec: 5599.9, 300 sec: 5619.1). Total num frames: 570189824. Throughput: 0: 5821.5. Samples: 570198274. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:57:14,209][25689] Avg episode reward: [(0, '-24.361')] [2022-07-10 03:57:14,263][26022] Updated weights on worker 0-0, policy_version 556827 (0.00089) [2022-07-10 03:57:15,866][26022] Updated weights on worker 0-0, policy_version 556837 (0.00093) [2022-07-10 03:57:17,907][26022] Updated weights on worker 0-0, policy_version 556847 (0.00096) [2022-07-10 03:57:19,247][25689] Fps is (10 sec: 5706.6, 60 sec: 5615.1, 300 sec: 5618.7). Total num frames: 570219520. Throughput: 0: 4952.0. Samples: 570215250. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:57:19,247][25689] Avg episode reward: [(0, '-25.720')] [2022-07-10 03:57:19,379][26022] Updated weights on worker 0-0, policy_version 556857 (0.00087) [2022-07-10 03:57:21,433][26022] Updated weights on worker 0-0, policy_version 556867 (0.00080) [2022-07-10 03:57:22,991][26022] Updated weights on worker 0-0, policy_version 556877 (0.00103) [2022-07-10 03:57:24,281][25689] Fps is (10 sec: 5693.2, 60 sec: 5612.4, 300 sec: 5619.6). Total num frames: 570247168. Throughput: 0: 5819.0. Samples: 570249508. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:57:24,281][25689] Avg episode reward: [(0, '-24.322')] [2022-07-10 03:57:25,064][26022] Updated weights on worker 0-0, policy_version 556887 (0.00097) [2022-07-10 03:57:26,718][26022] Updated weights on worker 0-0, policy_version 556897 (0.00087) [2022-07-10 03:57:28,688][26022] Updated weights on worker 0-0, policy_version 556907 (0.00054) [2022-07-10 03:57:29,406][25689] Fps is (10 sec: 5543.6, 60 sec: 5591.0, 300 sec: 5617.3). Total num frames: 570275840. Throughput: 0: 5883.7. Samples: 570283156. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:57:29,407][25689] Avg episode reward: [(0, '-24.325')] [2022-07-10 03:57:30,528][26022] Updated weights on worker 0-0, policy_version 556917 (0.00090) [2022-07-10 03:57:32,416][26022] Updated weights on worker 0-0, policy_version 556927 (0.00091) [2022-07-10 03:57:34,046][26022] Updated weights on worker 0-0, policy_version 556937 (0.00092) [2022-07-10 03:57:34,461][25689] Fps is (10 sec: 5733.3, 60 sec: 5644.4, 300 sec: 5620.4). Total num frames: 570305536. Throughput: 0: 5022.5. Samples: 570299994. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:57:34,462][25689] Avg episode reward: [(0, '-24.861')] [2022-07-10 03:57:36,094][26022] Updated weights on worker 0-0, policy_version 556947 (0.00092) [2022-07-10 03:57:37,572][26022] Updated weights on worker 0-0, policy_version 556957 (0.01278) [2022-07-10 03:57:39,479][25689] Fps is (10 sec: 5591.5, 60 sec: 5581.5, 300 sec: 5617.5). Total num frames: 570332160. Throughput: 0: 5863.9. Samples: 570333888. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:57:39,479][25689] Avg episode reward: [(0, '-26.142')] [2022-07-10 03:57:39,868][26022] Updated weights on worker 0-0, policy_version 556967 (0.00092) [2022-07-10 03:57:41,312][26022] Updated weights on worker 0-0, policy_version 556977 (0.00096) [2022-07-10 03:57:43,489][26022] Updated weights on worker 0-0, policy_version 556987 (0.00088) [2022-07-10 03:57:44,503][25689] Fps is (10 sec: 5608.3, 60 sec: 5632.2, 300 sec: 5621.9). Total num frames: 570361856. Throughput: 0: 5826.4. Samples: 570367334. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:57:44,504][25689] Avg episode reward: [(0, '-26.459')] [2022-07-10 03:57:44,953][26022] Updated weights on worker 0-0, policy_version 556997 (0.00094) [2022-07-10 03:57:47,143][26022] Updated weights on worker 0-0, policy_version 557007 (0.00906) [2022-07-10 03:57:48,570][26022] Updated weights on worker 0-0, policy_version 557017 (0.00095) [2022-07-10 03:57:49,615][25689] Fps is (10 sec: 5657.2, 60 sec: 5614.0, 300 sec: 5616.5). Total num frames: 570389504. Throughput: 0: 5002.7. Samples: 570384256. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:57:49,616][25689] Avg episode reward: [(0, '-25.775')] [2022-07-10 03:57:50,769][26022] Updated weights on worker 0-0, policy_version 557027 (0.00085) [2022-07-10 03:57:52,143][26022] Updated weights on worker 0-0, policy_version 557037 (0.00091) [2022-07-10 03:57:54,322][26022] Updated weights on worker 0-0, policy_version 557047 (0.01119) [2022-07-10 03:57:54,644][25689] Fps is (10 sec: 5553.8, 60 sec: 5578.2, 300 sec: 5619.5). Total num frames: 570418176. Throughput: 0: 5856.8. Samples: 570418202. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:57:54,645][25689] Avg episode reward: [(0, '-25.742')] [2022-07-10 03:57:56,008][26022] Updated weights on worker 0-0, policy_version 557057 (0.00103) [2022-07-10 03:57:57,402][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:57:57,416][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000557065_570434560.pth [2022-07-10 03:57:57,417][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000555087_568409088.pth [2022-07-10 03:57:57,937][26022] Updated weights on worker 0-0, policy_version 557067 (0.00086) [2022-07-10 03:57:59,580][26022] Updated weights on worker 0-0, policy_version 557077 (0.00091) [2022-07-10 03:57:59,664][25689] Fps is (10 sec: 5706.4, 60 sec: 5612.5, 300 sec: 5620.3). Total num frames: 570446848. Throughput: 0: 5866.1. Samples: 570452298. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:57:59,665][25689] Avg episode reward: [(0, '-27.126')] [2022-07-10 03:58:01,821][26022] Updated weights on worker 0-0, policy_version 557087 (0.00097) [2022-07-10 03:58:03,415][26022] Updated weights on worker 0-0, policy_version 557097 (0.00094) [2022-07-10 03:58:04,665][25689] Fps is (10 sec: 5415.8, 60 sec: 5595.7, 300 sec: 5619.1). Total num frames: 570472448. Throughput: 0: 4967.6. Samples: 570467490. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:58:04,666][25689] Avg episode reward: [(0, '-25.957')] [2022-07-10 03:58:05,570][26022] Updated weights on worker 0-0, policy_version 557107 (0.00089) [2022-07-10 03:58:07,100][26022] Updated weights on worker 0-0, policy_version 557117 (0.00086) [2022-07-10 03:58:08,994][26022] Updated weights on worker 0-0, policy_version 557127 (0.00089) [2022-07-10 03:58:09,732][25689] Fps is (10 sec: 5594.1, 60 sec: 5633.1, 300 sec: 5621.7). Total num frames: 570503168. Throughput: 0: 5805.7. Samples: 570501048. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:58:09,733][25689] Avg episode reward: [(0, '-25.077')] [2022-07-10 03:58:10,919][26022] Updated weights on worker 0-0, policy_version 557137 (0.00087) [2022-07-10 03:58:12,551][26022] Updated weights on worker 0-0, policy_version 557147 (0.00085) [2022-07-10 03:58:14,447][26022] Updated weights on worker 0-0, policy_version 557157 (0.00089) [2022-07-10 03:58:14,747][25689] Fps is (10 sec: 5688.2, 60 sec: 5615.7, 300 sec: 5614.7). Total num frames: 570529792. Throughput: 0: 5806.6. Samples: 570534928. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:58:14,747][25689] Avg episode reward: [(0, '-24.840')] [2022-07-10 03:58:16,262][26022] Updated weights on worker 0-0, policy_version 557167 (0.00088) [2022-07-10 03:58:18,191][26022] Updated weights on worker 0-0, policy_version 557177 (0.00089) [2022-07-10 03:58:19,749][25689] Fps is (10 sec: 5417.8, 60 sec: 5585.2, 300 sec: 5614.7). Total num frames: 570557440. Throughput: 0: 4956.7. Samples: 570551854. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:58:19,750][25689] Avg episode reward: [(0, '-25.312')] [2022-07-10 03:58:19,991][26022] Updated weights on worker 0-0, policy_version 557187 (0.00089) [2022-07-10 03:58:21,729][26022] Updated weights on worker 0-0, policy_version 557197 (0.00102) [2022-07-10 03:58:23,560][26022] Updated weights on worker 0-0, policy_version 557207 (0.00085) [2022-07-10 03:58:24,753][25689] Fps is (10 sec: 5730.5, 60 sec: 5621.8, 300 sec: 5615.6). Total num frames: 570587136. Throughput: 0: 5902.4. Samples: 570586058. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:58:24,754][25689] Avg episode reward: [(0, '-25.711')] [2022-07-10 03:58:25,293][26022] Updated weights on worker 0-0, policy_version 557217 (0.00082) [2022-07-10 03:58:27,224][26022] Updated weights on worker 0-0, policy_version 557227 (0.00094) [2022-07-10 03:58:28,972][26022] Updated weights on worker 0-0, policy_version 557237 (0.00089) [2022-07-10 03:58:29,832][25689] Fps is (10 sec: 5585.8, 60 sec: 5592.3, 300 sec: 5615.7). Total num frames: 570613760. Throughput: 0: 5898.6. Samples: 570619610. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:58:29,833][25689] Avg episode reward: [(0, '-24.251')] [2022-07-10 03:58:30,958][26022] Updated weights on worker 0-0, policy_version 557247 (0.00092) [2022-07-10 03:58:32,499][26022] Updated weights on worker 0-0, policy_version 557257 (0.00091) [2022-07-10 03:58:34,601][26022] Updated weights on worker 0-0, policy_version 557267 (0.00089) [2022-07-10 03:58:34,850][25689] Fps is (10 sec: 5679.4, 60 sec: 5612.7, 300 sec: 5616.3). Total num frames: 570644480. Throughput: 0: 5054.5. Samples: 570636542. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:58:34,851][25689] Avg episode reward: [(0, '-24.916')] [2022-07-10 03:58:36,152][26022] Updated weights on worker 0-0, policy_version 557277 (0.00098) [2022-07-10 03:58:37,941][26022] Updated weights on worker 0-0, policy_version 557287 (0.00087) [2022-07-10 03:58:39,915][25689] Fps is (10 sec: 5687.4, 60 sec: 5608.3, 300 sec: 5612.3). Total num frames: 570671104. Throughput: 0: 5885.5. Samples: 570670536. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:58:39,915][25689] Avg episode reward: [(0, '-24.878')] [2022-07-10 03:58:40,011][26022] Updated weights on worker 0-0, policy_version 557297 (0.00096) [2022-07-10 03:58:41,674][26022] Updated weights on worker 0-0, policy_version 557307 (0.00084) [2022-07-10 03:58:43,417][26022] Updated weights on worker 0-0, policy_version 557317 (0.00085) [2022-07-10 03:58:44,966][25689] Fps is (10 sec: 5567.5, 60 sec: 5605.8, 300 sec: 5617.2). Total num frames: 570700800. Throughput: 0: 5869.6. Samples: 570704698. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:58:44,967][25689] Avg episode reward: [(0, '-25.570')] [2022-07-10 03:58:45,508][26022] Updated weights on worker 0-0, policy_version 557327 (0.00087) [2022-07-10 03:58:47,005][26022] Updated weights on worker 0-0, policy_version 557337 (0.00393) [2022-07-10 03:58:49,064][26022] Updated weights on worker 0-0, policy_version 557347 (0.00091) [2022-07-10 03:58:50,038][25689] Fps is (10 sec: 5766.0, 60 sec: 5626.5, 300 sec: 5612.9). Total num frames: 570729472. Throughput: 0: 5897.3. Samples: 570738768. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:58:50,038][25689] Avg episode reward: [(0, '-24.971')] [2022-07-10 03:58:50,394][26022] Updated weights on worker 0-0, policy_version 557357 (0.00081) [2022-07-10 03:58:52,652][26022] Updated weights on worker 0-0, policy_version 557367 (0.00085) [2022-07-10 03:58:54,304][26022] Updated weights on worker 0-0, policy_version 557377 (0.00080) [2022-07-10 03:58:55,081][25689] Fps is (10 sec: 5466.9, 60 sec: 5591.3, 300 sec: 5609.5). Total num frames: 570756096. Throughput: 0: 5877.7. Samples: 570755452. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:58:55,081][25689] Avg episode reward: [(0, '-25.146')] [2022-07-10 03:58:56,068][26022] Updated weights on worker 0-0, policy_version 557387 (0.00081) [2022-07-10 03:58:57,983][26022] Updated weights on worker 0-0, policy_version 557397 (0.00088) [2022-07-10 03:58:59,995][26022] Updated weights on worker 0-0, policy_version 557407 (0.00083) [2022-07-10 03:59:00,103][25689] Fps is (10 sec: 5493.5, 60 sec: 5591.1, 300 sec: 5619.7). Total num frames: 570784768. Throughput: 0: 5909.5. Samples: 570789840. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:59:00,104][25689] Avg episode reward: [(0, '-25.749')] [2022-07-10 03:59:01,758][26022] Updated weights on worker 0-0, policy_version 557417 (0.00090) [2022-07-10 03:59:04,011][26022] Updated weights on worker 0-0, policy_version 557427 (0.00101) [2022-07-10 03:59:05,201][25689] Fps is (10 sec: 5666.2, 60 sec: 5632.9, 300 sec: 5613.1). Total num frames: 570813440. Throughput: 0: 5771.7. Samples: 570821488. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:59:05,202][25689] Avg episode reward: [(0, '-25.310')] [2022-07-10 03:59:05,330][26022] Updated weights on worker 0-0, policy_version 557437 (0.00086) [2022-07-10 03:59:07,688][26022] Updated weights on worker 0-0, policy_version 557447 (0.00091) [2022-07-10 03:59:09,134][26022] Updated weights on worker 0-0, policy_version 557457 (0.00084) [2022-07-10 03:59:10,249][25689] Fps is (10 sec: 5551.0, 60 sec: 5583.9, 300 sec: 5616.3). Total num frames: 570841088. Throughput: 0: 4925.2. Samples: 570838314. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:59:10,250][25689] Avg episode reward: [(0, '-25.641')] [2022-07-10 03:59:11,245][26022] Updated weights on worker 0-0, policy_version 557467 (0.00085) [2022-07-10 03:59:12,842][26022] Updated weights on worker 0-0, policy_version 557477 (0.00090) [2022-07-10 03:59:14,632][26022] Updated weights on worker 0-0, policy_version 557487 (0.00096) [2022-07-10 03:59:15,331][25689] Fps is (10 sec: 5458.7, 60 sec: 5594.6, 300 sec: 5613.1). Total num frames: 570868736. Throughput: 0: 5766.8. Samples: 570872230. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:59:15,332][25689] Avg episode reward: [(0, '-25.372')] [2022-07-10 03:59:16,538][26022] Updated weights on worker 0-0, policy_version 557497 (0.00085) [2022-07-10 03:59:18,483][26022] Updated weights on worker 0-0, policy_version 557507 (0.00100) [2022-07-10 03:59:20,197][26022] Updated weights on worker 0-0, policy_version 557517 (0.00619) [2022-07-10 03:59:20,394][25689] Fps is (10 sec: 5551.4, 60 sec: 5605.9, 300 sec: 5612.8). Total num frames: 570897408. Throughput: 0: 5721.3. Samples: 570905932. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:59:20,399][25689] Avg episode reward: [(0, '-25.012')] [2022-07-10 03:59:22,138][26022] Updated weights on worker 0-0, policy_version 557527 (0.00090) [2022-07-10 03:59:23,719][26022] Updated weights on worker 0-0, policy_version 557537 (0.00083) [2022-07-10 03:59:25,450][25689] Fps is (10 sec: 5666.8, 60 sec: 5584.2, 300 sec: 5614.0). Total num frames: 570926080. Throughput: 0: 5014.8. Samples: 570923034. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:59:25,451][25689] Avg episode reward: [(0, '-24.994')] [2022-07-10 03:59:25,822][26022] Updated weights on worker 0-0, policy_version 557547 (0.00096) [2022-07-10 03:59:27,298][26022] Updated weights on worker 0-0, policy_version 557557 (0.00088) [2022-07-10 03:59:29,464][26022] Updated weights on worker 0-0, policy_version 557567 (0.00089) [2022-07-10 03:59:30,570][25689] Fps is (10 sec: 5836.9, 60 sec: 5647.9, 300 sec: 5619.0). Total num frames: 570956800. Throughput: 0: 5828.3. Samples: 570956748. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:59:30,570][25689] Avg episode reward: [(0, '-25.016')] [2022-07-10 03:59:30,962][26022] Updated weights on worker 0-0, policy_version 557577 (0.00092) [2022-07-10 03:59:33,066][26022] Updated weights on worker 0-0, policy_version 557587 (0.00087) [2022-07-10 03:59:34,615][26022] Updated weights on worker 0-0, policy_version 557597 (0.00092) [2022-07-10 03:59:35,595][25689] Fps is (10 sec: 5652.6, 60 sec: 5579.8, 300 sec: 5612.5). Total num frames: 570983424. Throughput: 0: 5849.0. Samples: 570990754. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 03:59:35,596][25689] Avg episode reward: [(0, '-24.593')] [2022-07-10 03:59:36,604][26022] Updated weights on worker 0-0, policy_version 557607 (0.00087) [2022-07-10 03:59:38,231][26022] Updated weights on worker 0-0, policy_version 557617 (0.00099) [2022-07-10 03:59:40,227][26022] Updated weights on worker 0-0, policy_version 557627 (0.00091) [2022-07-10 03:59:40,645][25689] Fps is (10 sec: 5589.6, 60 sec: 5631.7, 300 sec: 5615.3). Total num frames: 571013120. Throughput: 0: 5032.2. Samples: 571007840. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 03:59:40,646][25689] Avg episode reward: [(0, '-24.342')] [2022-07-10 03:59:41,895][26022] Updated weights on worker 0-0, policy_version 557637 (0.00084) [2022-07-10 03:59:43,886][26022] Updated weights on worker 0-0, policy_version 557647 (0.00096) [2022-07-10 03:59:45,329][26022] Updated weights on worker 0-0, policy_version 557657 (0.00085) [2022-07-10 03:59:45,681][25689] Fps is (10 sec: 5787.3, 60 sec: 5616.3, 300 sec: 5615.3). Total num frames: 571041792. Throughput: 0: 5883.8. Samples: 571042064. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 03:59:45,681][25689] Avg episode reward: [(0, '-24.397')] [2022-07-10 03:59:47,659][26022] Updated weights on worker 0-0, policy_version 557667 (0.00075) [2022-07-10 03:59:49,017][26022] Updated weights on worker 0-0, policy_version 557677 (0.00085) [2022-07-10 03:59:50,766][25689] Fps is (10 sec: 5463.7, 60 sec: 5581.3, 300 sec: 5607.5). Total num frames: 571068416. Throughput: 0: 5878.9. Samples: 571075482. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 03:59:50,767][25689] Avg episode reward: [(0, '-25.460')] [2022-07-10 03:59:51,167][26022] Updated weights on worker 0-0, policy_version 557687 (0.00085) [2022-07-10 03:59:52,973][26022] Updated weights on worker 0-0, policy_version 557697 (0.00090) [2022-07-10 03:59:54,678][26022] Updated weights on worker 0-0, policy_version 557707 (0.00087) [2022-07-10 03:59:55,771][25689] Fps is (10 sec: 5581.7, 60 sec: 5635.5, 300 sec: 5607.4). Total num frames: 571098112. Throughput: 0: 5043.0. Samples: 571092500. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 03:59:55,772][25689] Avg episode reward: [(0, '-25.394')] [2022-07-10 03:59:56,436][26022] Updated weights on worker 0-0, policy_version 557717 (0.00086) [2022-07-10 03:59:57,516][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 03:59:57,527][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000557723_571108352.pth [2022-07-10 03:59:57,529][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000555748_569085952.pth [2022-07-10 03:59:58,324][26022] Updated weights on worker 0-0, policy_version 557727 (0.00089) [2022-07-10 04:00:00,056][26022] Updated weights on worker 0-0, policy_version 557737 (0.00087) [2022-07-10 04:00:00,838][25689] Fps is (10 sec: 5795.3, 60 sec: 5631.3, 300 sec: 5617.8). Total num frames: 571126784. Throughput: 0: 5887.0. Samples: 571126712. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:00:00,838][25689] Avg episode reward: [(0, '-26.065')] [2022-07-10 04:00:02,304][26022] Updated weights on worker 0-0, policy_version 557747 (0.00092) [2022-07-10 04:00:04,039][26022] Updated weights on worker 0-0, policy_version 557757 (0.00093) [2022-07-10 04:00:05,881][25689] Fps is (10 sec: 5368.0, 60 sec: 5585.8, 300 sec: 5604.3). Total num frames: 571152384. Throughput: 0: 5773.3. Samples: 571158686. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:00:05,881][25689] Avg episode reward: [(0, '-25.690')] [2022-07-10 04:00:06,016][26022] Updated weights on worker 0-0, policy_version 557767 (0.00087) [2022-07-10 04:00:07,419][26022] Updated weights on worker 0-0, policy_version 557777 (0.00092) [2022-07-10 04:00:09,687][26022] Updated weights on worker 0-0, policy_version 557787 (0.00083) [2022-07-10 04:00:10,930][25689] Fps is (10 sec: 5580.5, 60 sec: 5636.3, 300 sec: 5617.7). Total num frames: 571183104. Throughput: 0: 4977.7. Samples: 571175852. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:00:10,931][25689] Avg episode reward: [(0, '-24.790')] [2022-07-10 04:00:11,035][26022] Updated weights on worker 0-0, policy_version 557797 (0.00093) [2022-07-10 04:00:13,161][26022] Updated weights on worker 0-0, policy_version 557807 (0.00090) [2022-07-10 04:00:14,471][26022] Updated weights on worker 0-0, policy_version 557817 (0.00085) [2022-07-10 04:00:15,980][25689] Fps is (10 sec: 5678.0, 60 sec: 5622.4, 300 sec: 5611.3). Total num frames: 571209728. Throughput: 0: 5830.0. Samples: 571210322. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:00:15,981][25689] Avg episode reward: [(0, '-24.840')] [2022-07-10 04:00:16,775][26022] Updated weights on worker 0-0, policy_version 557827 (0.00089) [2022-07-10 04:00:18,499][26022] Updated weights on worker 0-0, policy_version 557837 (0.00078) [2022-07-10 04:00:20,309][26022] Updated weights on worker 0-0, policy_version 557847 (0.00090) [2022-07-10 04:00:20,995][25689] Fps is (10 sec: 5595.9, 60 sec: 5643.8, 300 sec: 5618.4). Total num frames: 571239424. Throughput: 0: 5829.2. Samples: 571244210. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:00:20,995][25689] Avg episode reward: [(0, '-24.961')] [2022-07-10 04:00:22,158][26022] Updated weights on worker 0-0, policy_version 557857 (0.00736) [2022-07-10 04:00:23,818][26022] Updated weights on worker 0-0, policy_version 557867 (0.00091) [2022-07-10 04:00:25,691][26022] Updated weights on worker 0-0, policy_version 557877 (0.00089) [2022-07-10 04:00:26,046][25689] Fps is (10 sec: 5697.2, 60 sec: 5627.4, 300 sec: 5615.2). Total num frames: 571267072. Throughput: 0: 5083.7. Samples: 571261196. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:00:26,046][25689] Avg episode reward: [(0, '-24.857')] [2022-07-10 04:00:27,522][26022] Updated weights on worker 0-0, policy_version 557887 (0.00084) [2022-07-10 04:00:29,248][26022] Updated weights on worker 0-0, policy_version 557897 (0.00087) [2022-07-10 04:00:31,129][25689] Fps is (10 sec: 5557.5, 60 sec: 5596.9, 300 sec: 5610.3). Total num frames: 571295744. Throughput: 0: 5881.6. Samples: 571294652. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:00:31,129][25689] Avg episode reward: [(0, '-24.011')] [2022-07-10 04:00:31,187][26022] Updated weights on worker 0-0, policy_version 557907 (0.00430) [2022-07-10 04:00:33,012][26022] Updated weights on worker 0-0, policy_version 557917 (0.00112) [2022-07-10 04:00:35,032][26022] Updated weights on worker 0-0, policy_version 557927 (0.00087) [2022-07-10 04:00:36,151][25689] Fps is (10 sec: 5674.7, 60 sec: 5631.1, 300 sec: 5610.8). Total num frames: 571324416. Throughput: 0: 5867.8. Samples: 571328678. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:00:36,151][25689] Avg episode reward: [(0, '-25.068')] [2022-07-10 04:00:36,595][26022] Updated weights on worker 0-0, policy_version 557937 (0.00093) [2022-07-10 04:00:38,551][26022] Updated weights on worker 0-0, policy_version 557947 (0.00094) [2022-07-10 04:00:40,125][26022] Updated weights on worker 0-0, policy_version 557957 (0.00087) [2022-07-10 04:00:41,153][25689] Fps is (10 sec: 5618.2, 60 sec: 5601.7, 300 sec: 5608.6). Total num frames: 571352064. Throughput: 0: 5029.2. Samples: 571345592. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:00:41,154][25689] Avg episode reward: [(0, '-25.706')] [2022-07-10 04:00:42,192][26022] Updated weights on worker 0-0, policy_version 557967 (0.00085) [2022-07-10 04:00:43,951][26022] Updated weights on worker 0-0, policy_version 557977 (0.00098) [2022-07-10 04:00:45,785][26022] Updated weights on worker 0-0, policy_version 557987 (0.00086) [2022-07-10 04:00:46,186][25689] Fps is (10 sec: 5612.0, 60 sec: 5601.9, 300 sec: 5613.5). Total num frames: 571380736. Throughput: 0: 5876.3. Samples: 571379550. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:00:46,188][25689] Avg episode reward: [(0, '-25.498')] [2022-07-10 04:00:47,480][26022] Updated weights on worker 0-0, policy_version 557997 (0.00088) [2022-07-10 04:00:49,427][26022] Updated weights on worker 0-0, policy_version 558007 (0.00093) [2022-07-10 04:00:51,247][25689] Fps is (10 sec: 5579.6, 60 sec: 5621.1, 300 sec: 5613.0). Total num frames: 571408384. Throughput: 0: 5883.3. Samples: 571413016. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:00:51,248][25689] Avg episode reward: [(0, '-25.249')] [2022-07-10 04:00:51,315][26022] Updated weights on worker 0-0, policy_version 558017 (0.00093) [2022-07-10 04:00:53,220][26022] Updated weights on worker 0-0, policy_version 558027 (0.01422) [2022-07-10 04:00:54,903][26022] Updated weights on worker 0-0, policy_version 558037 (0.00084) [2022-07-10 04:00:56,287][25689] Fps is (10 sec: 5576.1, 60 sec: 5600.9, 300 sec: 5609.5). Total num frames: 571437056. Throughput: 0: 5036.8. Samples: 571430098. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:00:56,287][25689] Avg episode reward: [(0, '-25.753')] [2022-07-10 04:00:56,642][26022] Updated weights on worker 0-0, policy_version 558047 (0.00091) [2022-07-10 04:00:58,416][26022] Updated weights on worker 0-0, policy_version 558057 (0.00084) [2022-07-10 04:01:00,114][26022] Updated weights on worker 0-0, policy_version 558067 (0.00050) [2022-07-10 04:01:01,307][25689] Fps is (10 sec: 5802.2, 60 sec: 5622.2, 300 sec: 5616.4). Total num frames: 571466752. Throughput: 0: 5898.0. Samples: 571464458. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:01:01,307][25689] Avg episode reward: [(0, '-25.221')] [2022-07-10 04:01:02,510][26022] Updated weights on worker 0-0, policy_version 558077 (0.00083) [2022-07-10 04:01:04,251][26022] Updated weights on worker 0-0, policy_version 558087 (0.00087) [2022-07-10 04:01:06,077][26022] Updated weights on worker 0-0, policy_version 558097 (0.00097) [2022-07-10 04:01:06,313][25689] Fps is (10 sec: 5515.1, 60 sec: 5625.6, 300 sec: 5617.8). Total num frames: 571492352. Throughput: 0: 5799.5. Samples: 571496274. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:01:06,315][25689] Avg episode reward: [(0, '-25.554')] [2022-07-10 04:01:07,792][26022] Updated weights on worker 0-0, policy_version 558107 (0.00087) [2022-07-10 04:01:09,802][26022] Updated weights on worker 0-0, policy_version 558117 (0.00085) [2022-07-10 04:01:11,452][25689] Fps is (10 sec: 5349.4, 60 sec: 5583.4, 300 sec: 5608.8). Total num frames: 571521024. Throughput: 0: 5807.2. Samples: 571530350. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:01:11,453][25689] Avg episode reward: [(0, '-25.328')] [2022-07-10 04:01:11,528][26022] Updated weights on worker 0-0, policy_version 558127 (0.00092) [2022-07-10 04:01:13,334][26022] Updated weights on worker 0-0, policy_version 558137 (0.00085) [2022-07-10 04:01:15,128][26022] Updated weights on worker 0-0, policy_version 558147 (0.00083) [2022-07-10 04:01:16,476][25689] Fps is (10 sec: 5642.4, 60 sec: 5619.7, 300 sec: 5608.7). Total num frames: 571549696. Throughput: 0: 5815.9. Samples: 571547518. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:01:16,477][25689] Avg episode reward: [(0, '-26.865')] [2022-07-10 04:01:17,031][26022] Updated weights on worker 0-0, policy_version 558157 (0.00090) [2022-07-10 04:01:18,739][26022] Updated weights on worker 0-0, policy_version 558167 (0.00096) [2022-07-10 04:01:20,548][26022] Updated weights on worker 0-0, policy_version 558177 (0.00081) [2022-07-10 04:01:21,499][25689] Fps is (10 sec: 5605.5, 60 sec: 5585.0, 300 sec: 5608.4). Total num frames: 571577344. Throughput: 0: 5786.5. Samples: 571581302. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:01:21,501][25689] Avg episode reward: [(0, '-27.181')] [2022-07-10 04:01:22,443][26022] Updated weights on worker 0-0, policy_version 558187 (0.00083) [2022-07-10 04:01:24,170][26022] Updated weights on worker 0-0, policy_version 558197 (0.00095) [2022-07-10 04:01:26,057][26022] Updated weights on worker 0-0, policy_version 558207 (0.00081) [2022-07-10 04:01:26,502][25689] Fps is (10 sec: 5617.5, 60 sec: 5606.4, 300 sec: 5606.4). Total num frames: 571606016. Throughput: 0: 5901.6. Samples: 571615422. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:01:26,503][25689] Avg episode reward: [(0, '-27.614')] [2022-07-10 04:01:27,803][26022] Updated weights on worker 0-0, policy_version 558217 (0.00094) [2022-07-10 04:01:29,707][26022] Updated weights on worker 0-0, policy_version 558227 (0.00089) [2022-07-10 04:01:31,463][26022] Updated weights on worker 0-0, policy_version 558237 (0.00094) [2022-07-10 04:01:31,631][25689] Fps is (10 sec: 5760.7, 60 sec: 5619.0, 300 sec: 5615.8). Total num frames: 571635712. Throughput: 0: 5049.7. Samples: 571632250. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:01:31,632][25689] Avg episode reward: [(0, '-27.322')] [2022-07-10 04:01:33,227][26022] Updated weights on worker 0-0, policy_version 558247 (0.00080) [2022-07-10 04:01:34,968][26022] Updated weights on worker 0-0, policy_version 558257 (0.00096) [2022-07-10 04:01:36,704][25689] Fps is (10 sec: 5721.2, 60 sec: 5614.4, 300 sec: 5608.9). Total num frames: 571664384. Throughput: 0: 5867.1. Samples: 571666196. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:01:36,704][25689] Avg episode reward: [(0, '-27.374')] [2022-07-10 04:01:36,859][26022] Updated weights on worker 0-0, policy_version 558267 (0.00094) [2022-07-10 04:01:38,912][26022] Updated weights on worker 0-0, policy_version 558277 (0.00084) [2022-07-10 04:01:40,531][26022] Updated weights on worker 0-0, policy_version 558287 (0.00086) [2022-07-10 04:01:41,728][25689] Fps is (10 sec: 5476.9, 60 sec: 5595.5, 300 sec: 5608.9). Total num frames: 571691008. Throughput: 0: 5868.2. Samples: 571700006. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:01:41,728][25689] Avg episode reward: [(0, '-25.709')] [2022-07-10 04:01:42,359][26022] Updated weights on worker 0-0, policy_version 558297 (0.00086) [2022-07-10 04:01:44,047][26022] Updated weights on worker 0-0, policy_version 558307 (0.00085) [2022-07-10 04:01:45,987][26022] Updated weights on worker 0-0, policy_version 558317 (0.00089) [2022-07-10 04:01:46,755][25689] Fps is (10 sec: 5603.2, 60 sec: 5612.9, 300 sec: 5613.6). Total num frames: 571720704. Throughput: 0: 5026.6. Samples: 571717228. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:01:46,756][25689] Avg episode reward: [(0, '-26.341')] [2022-07-10 04:01:47,878][26022] Updated weights on worker 0-0, policy_version 558327 (0.00086) [2022-07-10 04:01:49,723][26022] Updated weights on worker 0-0, policy_version 558337 (0.00086) [2022-07-10 04:01:51,287][26022] Updated weights on worker 0-0, policy_version 558347 (0.00097) [2022-07-10 04:01:51,866][25689] Fps is (10 sec: 5757.4, 60 sec: 5625.2, 300 sec: 5604.8). Total num frames: 571749376. Throughput: 0: 5864.6. Samples: 571750916. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:01:51,866][25689] Avg episode reward: [(0, '-26.114')] [2022-07-10 04:01:53,271][26022] Updated weights on worker 0-0, policy_version 558357 (0.00100) [2022-07-10 04:01:54,886][26022] Updated weights on worker 0-0, policy_version 558367 (0.00086) [2022-07-10 04:01:56,875][25689] Fps is (10 sec: 5666.4, 60 sec: 5628.0, 300 sec: 5612.0). Total num frames: 571778048. Throughput: 0: 5895.5. Samples: 571785116. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:01:56,876][25689] Avg episode reward: [(0, '-25.683')] [2022-07-10 04:01:56,878][26022] Updated weights on worker 0-0, policy_version 558377 (0.00088) [2022-07-10 04:01:57,555][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:01:57,564][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000558382_571783168.pth [2022-07-10 04:01:57,565][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000556407_569760768.pth [2022-07-10 04:01:58,624][26022] Updated weights on worker 0-0, policy_version 558387 (0.00082) [2022-07-10 04:02:00,409][26022] Updated weights on worker 0-0, policy_version 558397 (0.00087) [2022-07-10 04:02:01,898][25689] Fps is (10 sec: 5613.5, 60 sec: 5593.9, 300 sec: 5615.1). Total num frames: 571805696. Throughput: 0: 5073.0. Samples: 571802332. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:02:01,899][25689] Avg episode reward: [(0, '-25.650')] [2022-07-10 04:02:02,645][26022] Updated weights on worker 0-0, policy_version 558407 (0.00086) [2022-07-10 04:02:04,159][26022] Updated weights on worker 0-0, policy_version 558417 (0.00089) [2022-07-10 04:02:06,170][26022] Updated weights on worker 0-0, policy_version 558427 (0.00095) [2022-07-10 04:02:06,996][25689] Fps is (10 sec: 5463.5, 60 sec: 5619.2, 300 sec: 5611.7). Total num frames: 571833344. Throughput: 0: 5802.2. Samples: 571834670. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:02:06,997][25689] Avg episode reward: [(0, '-25.710')] [2022-07-10 04:02:07,837][26022] Updated weights on worker 0-0, policy_version 558437 (0.00087) [2022-07-10 04:02:09,912][26022] Updated weights on worker 0-0, policy_version 558447 (0.00085) [2022-07-10 04:02:11,582][26022] Updated weights on worker 0-0, policy_version 558457 (0.00092) [2022-07-10 04:02:12,067][25689] Fps is (10 sec: 5639.1, 60 sec: 5642.4, 300 sec: 5617.5). Total num frames: 571863040. Throughput: 0: 5825.1. Samples: 571868594. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:02:12,069][25689] Avg episode reward: [(0, '-26.174')] [2022-07-10 04:02:13,463][26022] Updated weights on worker 0-0, policy_version 558467 (0.00053) [2022-07-10 04:02:15,036][26022] Updated weights on worker 0-0, policy_version 558477 (0.00094) [2022-07-10 04:02:17,008][26022] Updated weights on worker 0-0, policy_version 558487 (0.00093) [2022-07-10 04:02:17,084][25689] Fps is (10 sec: 5684.5, 60 sec: 5626.2, 300 sec: 5611.0). Total num frames: 571890688. Throughput: 0: 4974.1. Samples: 571885636. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:02:17,086][25689] Avg episode reward: [(0, '-25.536')] [2022-07-10 04:02:18,714][26022] Updated weights on worker 0-0, policy_version 558497 (0.00085) [2022-07-10 04:02:20,685][26022] Updated weights on worker 0-0, policy_version 558507 (0.00089) [2022-07-10 04:02:22,136][25689] Fps is (10 sec: 5491.7, 60 sec: 5623.5, 300 sec: 5610.7). Total num frames: 571918336. Throughput: 0: 5791.0. Samples: 571919530. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:02:22,137][25689] Avg episode reward: [(0, '-26.214')] [2022-07-10 04:02:22,355][26022] Updated weights on worker 0-0, policy_version 558517 (0.00098) [2022-07-10 04:02:24,340][26022] Updated weights on worker 0-0, policy_version 558527 (0.00089) [2022-07-10 04:02:25,983][26022] Updated weights on worker 0-0, policy_version 558537 (0.00097) [2022-07-10 04:02:27,180][25689] Fps is (10 sec: 5679.7, 60 sec: 5636.6, 300 sec: 5615.6). Total num frames: 571948032. Throughput: 0: 5881.5. Samples: 571953382. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:02:27,181][25689] Avg episode reward: [(0, '-25.931')] [2022-07-10 04:02:28,035][26022] Updated weights on worker 0-0, policy_version 558547 (0.00084) [2022-07-10 04:02:29,672][26022] Updated weights on worker 0-0, policy_version 558557 (0.00092) [2022-07-10 04:02:31,407][26022] Updated weights on worker 0-0, policy_version 558567 (0.00078) [2022-07-10 04:02:32,317][25689] Fps is (10 sec: 5733.5, 60 sec: 5619.0, 300 sec: 5610.7). Total num frames: 571976704. Throughput: 0: 5029.6. Samples: 571970440. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:02:32,317][25689] Avg episode reward: [(0, '-25.955')] [2022-07-10 04:02:33,403][26022] Updated weights on worker 0-0, policy_version 558577 (0.00092) [2022-07-10 04:02:34,983][26022] Updated weights on worker 0-0, policy_version 558587 (0.00094) [2022-07-10 04:02:37,004][26022] Updated weights on worker 0-0, policy_version 558597 (0.00086) [2022-07-10 04:02:37,374][25689] Fps is (10 sec: 5725.7, 60 sec: 5637.3, 300 sec: 5620.2). Total num frames: 572006400. Throughput: 0: 5868.2. Samples: 572004704. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 04:02:37,375][25689] Avg episode reward: [(0, '-26.500')] [2022-07-10 04:02:38,514][26022] Updated weights on worker 0-0, policy_version 558607 (0.00100) [2022-07-10 04:02:40,560][26022] Updated weights on worker 0-0, policy_version 558617 (0.00088) [2022-07-10 04:02:42,245][26022] Updated weights on worker 0-0, policy_version 558627 (0.00093) [2022-07-10 04:02:42,452][25689] Fps is (10 sec: 5758.7, 60 sec: 5666.0, 300 sec: 5615.8). Total num frames: 572035072. Throughput: 0: 5869.9. Samples: 572038782. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:02:42,453][25689] Avg episode reward: [(0, '-25.885')] [2022-07-10 04:02:44,047][26022] Updated weights on worker 0-0, policy_version 558637 (0.00085) [2022-07-10 04:02:45,816][26022] Updated weights on worker 0-0, policy_version 558647 (0.00097) [2022-07-10 04:02:47,522][25689] Fps is (10 sec: 5650.8, 60 sec: 5645.2, 300 sec: 5620.0). Total num frames: 572063744. Throughput: 0: 5868.1. Samples: 572072750. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:02:47,522][25689] Avg episode reward: [(0, '-27.768')] [2022-07-10 04:02:47,636][26022] Updated weights on worker 0-0, policy_version 558657 (0.00087) [2022-07-10 04:02:49,606][26022] Updated weights on worker 0-0, policy_version 558667 (0.00088) [2022-07-10 04:02:51,616][26022] Updated weights on worker 0-0, policy_version 558677 (0.00090) [2022-07-10 04:02:52,635][25689] Fps is (10 sec: 5631.6, 60 sec: 5645.0, 300 sec: 5618.4). Total num frames: 572092416. Throughput: 0: 5846.0. Samples: 572089220. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:02:52,635][25689] Avg episode reward: [(0, '-27.001')] [2022-07-10 04:02:53,149][26022] Updated weights on worker 0-0, policy_version 558687 (0.00092) [2022-07-10 04:02:55,212][26022] Updated weights on worker 0-0, policy_version 558697 (0.00089) [2022-07-10 04:02:56,788][26022] Updated weights on worker 0-0, policy_version 558707 (0.00079) [2022-07-10 04:02:57,671][25689] Fps is (10 sec: 5549.3, 60 sec: 5625.6, 300 sec: 5614.7). Total num frames: 572120064. Throughput: 0: 5836.0. Samples: 572123158. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:02:57,673][25689] Avg episode reward: [(0, '-27.244')] [2022-07-10 04:02:58,807][26022] Updated weights on worker 0-0, policy_version 558717 (0.00102) [2022-07-10 04:03:00,344][26022] Updated weights on worker 0-0, policy_version 558727 (0.01460) [2022-07-10 04:03:02,690][25689] Fps is (10 sec: 5295.5, 60 sec: 5592.4, 300 sec: 5614.4). Total num frames: 572145664. Throughput: 0: 5754.1. Samples: 572155232. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:03:02,691][25689] Avg episode reward: [(0, '-26.758')] [2022-07-10 04:03:02,779][26022] Updated weights on worker 0-0, policy_version 558737 (0.00087) [2022-07-10 04:03:04,424][26022] Updated weights on worker 0-0, policy_version 558747 (0.00087) [2022-07-10 04:03:06,336][26022] Updated weights on worker 0-0, policy_version 558757 (0.00090) [2022-07-10 04:03:07,701][25689] Fps is (10 sec: 5411.3, 60 sec: 5617.3, 300 sec: 5608.5). Total num frames: 572174336. Throughput: 0: 4931.1. Samples: 572172252. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:03:07,701][25689] Avg episode reward: [(0, '-25.530')] [2022-07-10 04:03:07,995][26022] Updated weights on worker 0-0, policy_version 558767 (0.00086) [2022-07-10 04:03:10,035][26022] Updated weights on worker 0-0, policy_version 558777 (0.00086) [2022-07-10 04:03:11,621][26022] Updated weights on worker 0-0, policy_version 558787 (0.00095) [2022-07-10 04:03:12,788][25689] Fps is (10 sec: 5780.0, 60 sec: 5615.7, 300 sec: 5617.5). Total num frames: 572204032. Throughput: 0: 5810.4. Samples: 572206320. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:03:12,789][25689] Avg episode reward: [(0, '-26.227')] [2022-07-10 04:03:13,664][26022] Updated weights on worker 0-0, policy_version 558797 (0.00086) [2022-07-10 04:03:15,139][26022] Updated weights on worker 0-0, policy_version 558807 (0.00091) [2022-07-10 04:03:17,240][26022] Updated weights on worker 0-0, policy_version 558817 (0.00083) [2022-07-10 04:03:17,823][25689] Fps is (10 sec: 5665.2, 60 sec: 5614.1, 300 sec: 5616.9). Total num frames: 572231680. Throughput: 0: 5822.7. Samples: 572240494. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:03:17,823][25689] Avg episode reward: [(0, '-25.226')] [2022-07-10 04:03:18,965][26022] Updated weights on worker 0-0, policy_version 558827 (0.00096) [2022-07-10 04:03:20,951][26022] Updated weights on worker 0-0, policy_version 558837 (0.00094) [2022-07-10 04:03:22,566][26022] Updated weights on worker 0-0, policy_version 558847 (0.00090) [2022-07-10 04:03:22,847][25689] Fps is (10 sec: 5598.9, 60 sec: 5633.5, 300 sec: 5613.1). Total num frames: 572260352. Throughput: 0: 5065.9. Samples: 572257346. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:03:22,848][25689] Avg episode reward: [(0, '-25.363')] [2022-07-10 04:03:24,479][26022] Updated weights on worker 0-0, policy_version 558857 (0.00093) [2022-07-10 04:03:26,335][26022] Updated weights on worker 0-0, policy_version 558867 (0.00103) [2022-07-10 04:03:27,863][25689] Fps is (10 sec: 5711.5, 60 sec: 5619.3, 300 sec: 5621.1). Total num frames: 572289024. Throughput: 0: 5898.1. Samples: 572291172. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:03:27,863][25689] Avg episode reward: [(0, '-25.340')] [2022-07-10 04:03:27,943][26022] Updated weights on worker 0-0, policy_version 558877 (0.00093) [2022-07-10 04:03:30,079][26022] Updated weights on worker 0-0, policy_version 558887 (0.00084) [2022-07-10 04:03:31,535][26022] Updated weights on worker 0-0, policy_version 558897 (0.00086) [2022-07-10 04:03:32,915][25689] Fps is (10 sec: 5492.3, 60 sec: 5593.3, 300 sec: 5606.7). Total num frames: 572315648. Throughput: 0: 5903.1. Samples: 572325132. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:03:32,916][25689] Avg episode reward: [(0, '-25.014')] [2022-07-10 04:03:33,551][26022] Updated weights on worker 0-0, policy_version 558907 (0.00082) [2022-07-10 04:03:35,130][26022] Updated weights on worker 0-0, policy_version 558917 (0.00086) [2022-07-10 04:03:37,042][26022] Updated weights on worker 0-0, policy_version 558927 (0.00088) [2022-07-10 04:03:37,927][25689] Fps is (10 sec: 5596.0, 60 sec: 5597.5, 300 sec: 5618.0). Total num frames: 572345344. Throughput: 0: 5060.1. Samples: 572342224. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:03:37,928][25689] Avg episode reward: [(0, '-26.626')] [2022-07-10 04:03:38,979][26022] Updated weights on worker 0-0, policy_version 558937 (0.00082) [2022-07-10 04:03:40,599][26022] Updated weights on worker 0-0, policy_version 558947 (0.00095) [2022-07-10 04:03:42,519][26022] Updated weights on worker 0-0, policy_version 558957 (0.00091) [2022-07-10 04:03:42,943][25689] Fps is (10 sec: 5923.0, 60 sec: 5620.2, 300 sec: 5618.7). Total num frames: 572375040. Throughput: 0: 5926.2. Samples: 572376436. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:03:42,943][25689] Avg episode reward: [(0, '-25.747')] [2022-07-10 04:03:44,425][26022] Updated weights on worker 0-0, policy_version 558967 (0.00088) [2022-07-10 04:03:45,994][26022] Updated weights on worker 0-0, policy_version 558977 (0.00085) [2022-07-10 04:03:47,916][26022] Updated weights on worker 0-0, policy_version 558987 (0.00087) [2022-07-10 04:03:47,950][25689] Fps is (10 sec: 5721.1, 60 sec: 5609.0, 300 sec: 5616.5). Total num frames: 572402688. Throughput: 0: 5938.1. Samples: 572410454. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:03:47,951][25689] Avg episode reward: [(0, '-25.537')] [2022-07-10 04:03:49,723][26022] Updated weights on worker 0-0, policy_version 558997 (0.00086) [2022-07-10 04:03:51,662][26022] Updated weights on worker 0-0, policy_version 559007 (0.00090) [2022-07-10 04:03:53,002][25689] Fps is (10 sec: 5497.1, 60 sec: 5597.8, 300 sec: 5619.7). Total num frames: 572430336. Throughput: 0: 5084.2. Samples: 572427256. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:03:53,002][25689] Avg episode reward: [(0, '-26.926')] [2022-07-10 04:03:53,522][26022] Updated weights on worker 0-0, policy_version 559017 (0.00088) [2022-07-10 04:03:55,194][26022] Updated weights on worker 0-0, policy_version 559027 (0.00089) [2022-07-10 04:03:57,162][26022] Updated weights on worker 0-0, policy_version 559037 (0.00084) [2022-07-10 04:03:57,675][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:03:57,685][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000559040_572456960.pth [2022-07-10 04:03:57,686][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000557065_570434560.pth [2022-07-10 04:03:57,686][25974] Saving a milestone ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/milestones/checkpoint_000559040_572456960.pth.milestone [2022-07-10 04:03:58,046][25689] Fps is (10 sec: 5578.9, 60 sec: 5614.0, 300 sec: 5619.3). Total num frames: 572459008. Throughput: 0: 5897.5. Samples: 572460872. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:03:58,046][25689] Avg episode reward: [(0, '-26.644')] [2022-07-10 04:03:58,806][26022] Updated weights on worker 0-0, policy_version 559047 (0.00088) [2022-07-10 04:04:00,891][26022] Updated weights on worker 0-0, policy_version 559057 (0.00089) [2022-07-10 04:04:02,663][26022] Updated weights on worker 0-0, policy_version 559067 (0.00083) [2022-07-10 04:04:03,048][25689] Fps is (10 sec: 5504.4, 60 sec: 5632.6, 300 sec: 5614.3). Total num frames: 572485632. Throughput: 0: 5797.6. Samples: 572492996. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:04:03,048][25689] Avg episode reward: [(0, '-25.897')] [2022-07-10 04:04:04,823][26022] Updated weights on worker 0-0, policy_version 559077 (0.00093) [2022-07-10 04:04:06,276][26022] Updated weights on worker 0-0, policy_version 559087 (0.00091) [2022-07-10 04:04:08,055][25689] Fps is (10 sec: 5319.9, 60 sec: 5599.0, 300 sec: 5611.6). Total num frames: 572512256. Throughput: 0: 4951.3. Samples: 572509996. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:04:08,055][25689] Avg episode reward: [(0, '-25.920')] [2022-07-10 04:04:08,472][26022] Updated weights on worker 0-0, policy_version 559097 (0.00092) [2022-07-10 04:04:10,112][26022] Updated weights on worker 0-0, policy_version 559107 (0.00093) [2022-07-10 04:04:11,947][26022] Updated weights on worker 0-0, policy_version 559117 (0.00084) [2022-07-10 04:04:13,185][25689] Fps is (10 sec: 5555.4, 60 sec: 5595.0, 300 sec: 5617.6). Total num frames: 572541952. Throughput: 0: 5739.2. Samples: 572543094. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:04:13,186][25689] Avg episode reward: [(0, '-25.320')] [2022-07-10 04:04:14,112][26022] Updated weights on worker 0-0, policy_version 559127 (0.00093) [2022-07-10 04:04:15,620][26022] Updated weights on worker 0-0, policy_version 559137 (0.00091) [2022-07-10 04:04:17,607][26022] Updated weights on worker 0-0, policy_version 559147 (0.00905) [2022-07-10 04:04:18,222][25689] Fps is (10 sec: 5539.5, 60 sec: 5577.8, 300 sec: 5611.2). Total num frames: 572568576. Throughput: 0: 5730.5. Samples: 572576492. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:04:18,222][25689] Avg episode reward: [(0, '-25.351')] [2022-07-10 04:04:19,419][26022] Updated weights on worker 0-0, policy_version 559157 (0.00084) [2022-07-10 04:04:21,196][26022] Updated weights on worker 0-0, policy_version 559167 (0.00087) [2022-07-10 04:04:23,119][26022] Updated weights on worker 0-0, policy_version 559177 (0.00088) [2022-07-10 04:04:23,256][25689] Fps is (10 sec: 5490.9, 60 sec: 5577.0, 300 sec: 5611.6). Total num frames: 572597248. Throughput: 0: 4969.1. Samples: 572593414. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:04:23,257][25689] Avg episode reward: [(0, '-23.819')] [2022-07-10 04:04:25,035][26022] Updated weights on worker 0-0, policy_version 559187 (0.00084) [2022-07-10 04:04:26,417][26022] Updated weights on worker 0-0, policy_version 559197 (0.00084) [2022-07-10 04:04:28,262][25689] Fps is (10 sec: 5711.3, 60 sec: 5577.8, 300 sec: 5606.9). Total num frames: 572625920. Throughput: 0: 5799.5. Samples: 572627188. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:04:28,264][25689] Avg episode reward: [(0, '-24.099')] [2022-07-10 04:04:28,779][26022] Updated weights on worker 0-0, policy_version 559207 (0.00093) [2022-07-10 04:04:30,114][26022] Updated weights on worker 0-0, policy_version 559217 (0.00087) [2022-07-10 04:04:32,282][26022] Updated weights on worker 0-0, policy_version 559227 (0.00087) [2022-07-10 04:04:33,326][25689] Fps is (10 sec: 5694.3, 60 sec: 5610.6, 300 sec: 5613.0). Total num frames: 572654592. Throughput: 0: 5861.3. Samples: 572661146. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:04:33,327][25689] Avg episode reward: [(0, '-24.209')] [2022-07-10 04:04:33,793][26022] Updated weights on worker 0-0, policy_version 559237 (0.00092) [2022-07-10 04:04:35,707][26022] Updated weights on worker 0-0, policy_version 559247 (0.00081) [2022-07-10 04:04:37,487][26022] Updated weights on worker 0-0, policy_version 559257 (0.00099) [2022-07-10 04:04:38,351][25689] Fps is (10 sec: 5683.9, 60 sec: 5592.5, 300 sec: 5610.1). Total num frames: 572683264. Throughput: 0: 5051.1. Samples: 572678168. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:04:38,353][25689] Avg episode reward: [(0, '-26.136')] [2022-07-10 04:04:39,419][26022] Updated weights on worker 0-0, policy_version 559267 (0.00096) [2022-07-10 04:04:41,056][26022] Updated weights on worker 0-0, policy_version 559277 (0.00083) [2022-07-10 04:04:43,112][26022] Updated weights on worker 0-0, policy_version 559287 (0.00091) [2022-07-10 04:04:43,392][25689] Fps is (10 sec: 5595.1, 60 sec: 5556.2, 300 sec: 5606.5). Total num frames: 572710912. Throughput: 0: 5904.1. Samples: 572712302. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:04:43,393][25689] Avg episode reward: [(0, '-26.823')] [2022-07-10 04:04:44,620][26022] Updated weights on worker 0-0, policy_version 559297 (0.00091) [2022-07-10 04:04:46,630][26022] Updated weights on worker 0-0, policy_version 559307 (0.00098) [2022-07-10 04:04:48,407][25689] Fps is (10 sec: 5600.8, 60 sec: 5572.6, 300 sec: 5614.7). Total num frames: 572739584. Throughput: 0: 5921.2. Samples: 572746468. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:04:48,407][25689] Avg episode reward: [(0, '-27.196')] [2022-07-10 04:04:48,521][26022] Updated weights on worker 0-0, policy_version 559317 (0.00100) [2022-07-10 04:04:50,156][26022] Updated weights on worker 0-0, policy_version 559327 (0.00084) [2022-07-10 04:04:52,159][26022] Updated weights on worker 0-0, policy_version 559337 (0.00093) [2022-07-10 04:04:53,454][25689] Fps is (10 sec: 5801.0, 60 sec: 5606.8, 300 sec: 5613.9). Total num frames: 572769280. Throughput: 0: 5079.9. Samples: 572763392. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:04:53,454][25689] Avg episode reward: [(0, '-28.156')] [2022-07-10 04:04:53,738][26022] Updated weights on worker 0-0, policy_version 559347 (0.00093) [2022-07-10 04:04:55,661][26022] Updated weights on worker 0-0, policy_version 559357 (0.00103) [2022-07-10 04:04:57,510][26022] Updated weights on worker 0-0, policy_version 559367 (0.00094) [2022-07-10 04:04:58,475][25689] Fps is (10 sec: 5695.6, 60 sec: 5592.0, 300 sec: 5611.3). Total num frames: 572796928. Throughput: 0: 5900.6. Samples: 572796910. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:04:58,475][25689] Avg episode reward: [(0, '-27.760')] [2022-07-10 04:04:59,533][26022] Updated weights on worker 0-0, policy_version 559377 (0.00092) [2022-07-10 04:05:01,016][26022] Updated weights on worker 0-0, policy_version 559387 (0.00091) [2022-07-10 04:05:03,284][26022] Updated weights on worker 0-0, policy_version 559397 (0.00083) [2022-07-10 04:05:03,478][25689] Fps is (10 sec: 5311.8, 60 sec: 5574.9, 300 sec: 5612.1). Total num frames: 572822528. Throughput: 0: 5816.8. Samples: 572829138. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:05:03,480][25689] Avg episode reward: [(0, '-28.078')] [2022-07-10 04:05:04,883][26022] Updated weights on worker 0-0, policy_version 559407 (0.00107) [2022-07-10 04:05:06,940][26022] Updated weights on worker 0-0, policy_version 559417 (0.00077) [2022-07-10 04:05:08,501][25689] Fps is (10 sec: 5413.0, 60 sec: 5607.4, 300 sec: 5605.7). Total num frames: 572851200. Throughput: 0: 4974.7. Samples: 572846430. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:05:08,503][25689] Avg episode reward: [(0, '-27.702')] [2022-07-10 04:05:08,641][26022] Updated weights on worker 0-0, policy_version 559427 (0.00097) [2022-07-10 04:05:10,371][26022] Updated weights on worker 0-0, policy_version 559437 (0.00112) [2022-07-10 04:05:12,530][26022] Updated weights on worker 0-0, policy_version 559447 (0.00091) [2022-07-10 04:05:13,530][25689] Fps is (10 sec: 5806.9, 60 sec: 5616.8, 300 sec: 5616.4). Total num frames: 572880896. Throughput: 0: 5822.6. Samples: 572880286. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:05:13,530][25689] Avg episode reward: [(0, '-27.502')] [2022-07-10 04:05:13,940][26022] Updated weights on worker 0-0, policy_version 559457 (0.00091) [2022-07-10 04:05:16,033][26022] Updated weights on worker 0-0, policy_version 559467 (0.00086) [2022-07-10 04:05:17,526][26022] Updated weights on worker 0-0, policy_version 559477 (0.00079) [2022-07-10 04:05:18,540][25689] Fps is (10 sec: 5508.0, 60 sec: 5602.2, 300 sec: 5602.7). Total num frames: 572906496. Throughput: 0: 5851.0. Samples: 572914314. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:05:18,541][25689] Avg episode reward: [(0, '-27.786')] [2022-07-10 04:05:19,552][26022] Updated weights on worker 0-0, policy_version 559487 (0.00095) [2022-07-10 04:05:21,523][26022] Updated weights on worker 0-0, policy_version 559497 (0.00095) [2022-07-10 04:05:23,000][26022] Updated weights on worker 0-0, policy_version 559507 (0.00086) [2022-07-10 04:05:23,543][25689] Fps is (10 sec: 5624.5, 60 sec: 5639.1, 300 sec: 5614.0). Total num frames: 572937216. Throughput: 0: 5100.1. Samples: 572931470. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:05:23,544][25689] Avg episode reward: [(0, '-27.558')] [2022-07-10 04:05:25,047][26022] Updated weights on worker 0-0, policy_version 559517 (0.00096) [2022-07-10 04:05:26,587][26022] Updated weights on worker 0-0, policy_version 559527 (0.00093) [2022-07-10 04:05:28,561][25689] Fps is (10 sec: 5824.4, 60 sec: 5621.0, 300 sec: 5611.8). Total num frames: 572964864. Throughput: 0: 5930.5. Samples: 572965398. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:05:28,562][25689] Avg episode reward: [(0, '-27.671')] [2022-07-10 04:05:28,708][26022] Updated weights on worker 0-0, policy_version 559537 (0.00091) [2022-07-10 04:05:30,434][26022] Updated weights on worker 0-0, policy_version 559547 (0.00086) [2022-07-10 04:05:32,338][26022] Updated weights on worker 0-0, policy_version 559557 (0.00082) [2022-07-10 04:05:33,629][25689] Fps is (10 sec: 5685.6, 60 sec: 5637.7, 300 sec: 5614.3). Total num frames: 572994560. Throughput: 0: 5909.4. Samples: 572999060. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:05:33,631][25689] Avg episode reward: [(0, '-27.938')] [2022-07-10 04:05:34,096][26022] Updated weights on worker 0-0, policy_version 559567 (0.00087) [2022-07-10 04:05:35,923][26022] Updated weights on worker 0-0, policy_version 559577 (0.00090) [2022-07-10 04:05:37,685][26022] Updated weights on worker 0-0, policy_version 559587 (0.00095) [2022-07-10 04:05:38,657][25689] Fps is (10 sec: 5578.7, 60 sec: 5603.4, 300 sec: 5610.4). Total num frames: 573021184. Throughput: 0: 5049.4. Samples: 573015892. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 04:05:38,659][25689] Avg episode reward: [(0, '-27.637')] [2022-07-10 04:05:39,395][26022] Updated weights on worker 0-0, policy_version 559597 (0.00088) [2022-07-10 04:05:41,541][26022] Updated weights on worker 0-0, policy_version 559607 (0.00081) [2022-07-10 04:05:43,133][26022] Updated weights on worker 0-0, policy_version 559617 (0.00089) [2022-07-10 04:05:43,671][25689] Fps is (10 sec: 5608.3, 60 sec: 5639.9, 300 sec: 5614.2). Total num frames: 573050880. Throughput: 0: 5882.2. Samples: 573049866. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:05:43,672][25689] Avg episode reward: [(0, '-28.088')] [2022-07-10 04:05:44,923][26022] Updated weights on worker 0-0, policy_version 559627 (0.00081) [2022-07-10 04:05:46,644][26022] Updated weights on worker 0-0, policy_version 559637 (0.00085) [2022-07-10 04:05:48,579][26022] Updated weights on worker 0-0, policy_version 559647 (0.00096) [2022-07-10 04:05:48,727][25689] Fps is (10 sec: 5694.7, 60 sec: 5619.1, 300 sec: 5614.3). Total num frames: 573078528. Throughput: 0: 5883.8. Samples: 573084046. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:05:48,727][25689] Avg episode reward: [(0, '-27.492')] [2022-07-10 04:05:50,466][26022] Updated weights on worker 0-0, policy_version 559657 (0.00096) [2022-07-10 04:05:52,140][26022] Updated weights on worker 0-0, policy_version 559667 (0.00091) [2022-07-10 04:05:53,843][25689] Fps is (10 sec: 5436.2, 60 sec: 5578.8, 300 sec: 5609.4). Total num frames: 573106176. Throughput: 0: 5028.6. Samples: 573100708. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:05:53,843][25689] Avg episode reward: [(0, '-26.752')] [2022-07-10 04:05:54,263][26022] Updated weights on worker 0-0, policy_version 559677 (0.00085) [2022-07-10 04:05:55,757][26022] Updated weights on worker 0-0, policy_version 559687 (0.00089) [2022-07-10 04:05:57,682][26022] Updated weights on worker 0-0, policy_version 559697 (0.00080) [2022-07-10 04:05:57,717][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:05:57,725][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000559698_573130752.pth [2022-07-10 04:05:57,725][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000557723_571108352.pth [2022-07-10 04:05:58,861][25689] Fps is (10 sec: 5658.4, 60 sec: 5612.9, 300 sec: 5609.5). Total num frames: 573135872. Throughput: 0: 5868.1. Samples: 573134450. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:05:58,861][25689] Avg episode reward: [(0, '-25.223')] [2022-07-10 04:05:59,502][26022] Updated weights on worker 0-0, policy_version 559707 (0.00092) [2022-07-10 04:06:01,219][26022] Updated weights on worker 0-0, policy_version 559717 (0.00085) [2022-07-10 04:06:03,478][26022] Updated weights on worker 0-0, policy_version 559727 (0.00089) [2022-07-10 04:06:03,909][25689] Fps is (10 sec: 5594.9, 60 sec: 5625.7, 300 sec: 5612.1). Total num frames: 573162496. Throughput: 0: 5761.6. Samples: 573166470. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:06:03,910][25689] Avg episode reward: [(0, '-24.946')] [2022-07-10 04:06:05,429][26022] Updated weights on worker 0-0, policy_version 559737 (0.00091) [2022-07-10 04:06:07,084][26022] Updated weights on worker 0-0, policy_version 559747 (0.00088) [2022-07-10 04:06:08,934][25689] Fps is (10 sec: 5286.0, 60 sec: 5591.6, 300 sec: 5607.4). Total num frames: 573189120. Throughput: 0: 4913.2. Samples: 573183330. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:06:08,935][25689] Avg episode reward: [(0, '-25.194')] [2022-07-10 04:06:09,169][26022] Updated weights on worker 0-0, policy_version 559757 (0.00099) [2022-07-10 04:06:10,557][26022] Updated weights on worker 0-0, policy_version 559767 (0.00092) [2022-07-10 04:06:12,643][26022] Updated weights on worker 0-0, policy_version 559777 (0.00089) [2022-07-10 04:06:13,997][25689] Fps is (10 sec: 5582.9, 60 sec: 5588.5, 300 sec: 5610.1). Total num frames: 573218816. Throughput: 0: 5776.8. Samples: 573217136. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:06:13,998][25689] Avg episode reward: [(0, '-24.586')] [2022-07-10 04:06:14,417][26022] Updated weights on worker 0-0, policy_version 559787 (0.00095) [2022-07-10 04:06:16,011][26022] Updated weights on worker 0-0, policy_version 559797 (0.00086) [2022-07-10 04:06:18,175][26022] Updated weights on worker 0-0, policy_version 559807 (0.00087) [2022-07-10 04:06:19,007][25689] Fps is (10 sec: 5794.4, 60 sec: 5639.3, 300 sec: 5613.8). Total num frames: 573247488. Throughput: 0: 5792.3. Samples: 573251144. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:06:19,008][25689] Avg episode reward: [(0, '-26.267')] [2022-07-10 04:06:19,660][26022] Updated weights on worker 0-0, policy_version 559817 (0.00088) [2022-07-10 04:06:21,667][26022] Updated weights on worker 0-0, policy_version 559827 (0.00134) [2022-07-10 04:06:23,196][26022] Updated weights on worker 0-0, policy_version 559837 (0.00095) [2022-07-10 04:06:24,040][25689] Fps is (10 sec: 5608.1, 60 sec: 5585.8, 300 sec: 5609.8). Total num frames: 573275136. Throughput: 0: 5897.5. Samples: 573285190. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:06:24,040][25689] Avg episode reward: [(0, '-27.154')] [2022-07-10 04:06:25,217][26022] Updated weights on worker 0-0, policy_version 559847 (0.00091) [2022-07-10 04:06:27,142][26022] Updated weights on worker 0-0, policy_version 559857 (0.00088) [2022-07-10 04:06:28,672][26022] Updated weights on worker 0-0, policy_version 559867 (0.00091) [2022-07-10 04:06:29,126][25689] Fps is (10 sec: 5666.9, 60 sec: 5613.3, 300 sec: 5610.6). Total num frames: 573304832. Throughput: 0: 5888.1. Samples: 573302224. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:06:29,127][25689] Avg episode reward: [(0, '-27.077')] [2022-07-10 04:06:30,525][26022] Updated weights on worker 0-0, policy_version 559877 (0.00089) [2022-07-10 04:06:32,607][26022] Updated weights on worker 0-0, policy_version 559887 (0.00099) [2022-07-10 04:06:34,162][26022] Updated weights on worker 0-0, policy_version 559897 (0.00095) [2022-07-10 04:06:34,234][25689] Fps is (10 sec: 5826.0, 60 sec: 5609.6, 300 sec: 5613.4). Total num frames: 573334528. Throughput: 0: 5889.7. Samples: 573336326. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:06:34,235][25689] Avg episode reward: [(0, '-27.041')] [2022-07-10 04:06:36,095][26022] Updated weights on worker 0-0, policy_version 559907 (0.00086) [2022-07-10 04:06:37,524][26022] Updated weights on worker 0-0, policy_version 559917 (0.00084) [2022-07-10 04:06:39,241][25689] Fps is (10 sec: 5568.2, 60 sec: 5611.5, 300 sec: 5613.7). Total num frames: 573361152. Throughput: 0: 5906.9. Samples: 573370662. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:06:39,242][25689] Avg episode reward: [(0, '-28.977')] [2022-07-10 04:06:39,763][26022] Updated weights on worker 0-0, policy_version 559927 (0.00088) [2022-07-10 04:06:41,555][26022] Updated weights on worker 0-0, policy_version 559937 (0.00088) [2022-07-10 04:06:43,262][26022] Updated weights on worker 0-0, policy_version 559947 (0.00092) [2022-07-10 04:06:44,252][25689] Fps is (10 sec: 5622.0, 60 sec: 5611.8, 300 sec: 5614.0). Total num frames: 573390848. Throughput: 0: 5076.1. Samples: 573387788. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:06:44,252][25689] Avg episode reward: [(0, '-28.221')] [2022-07-10 04:06:45,112][26022] Updated weights on worker 0-0, policy_version 559957 (0.00091) [2022-07-10 04:06:47,080][26022] Updated weights on worker 0-0, policy_version 559967 (0.00091) [2022-07-10 04:06:48,590][26022] Updated weights on worker 0-0, policy_version 559977 (0.00090) [2022-07-10 04:06:49,254][25689] Fps is (10 sec: 5931.8, 60 sec: 5650.6, 300 sec: 5619.5). Total num frames: 573420544. Throughput: 0: 5950.5. Samples: 573421990. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:06:49,254][25689] Avg episode reward: [(0, '-27.152')] [2022-07-10 04:06:50,579][26022] Updated weights on worker 0-0, policy_version 559987 (0.00082) [2022-07-10 04:06:52,121][26022] Updated weights on worker 0-0, policy_version 559997 (0.00085) [2022-07-10 04:06:54,215][26022] Updated weights on worker 0-0, policy_version 560007 (0.00086) [2022-07-10 04:06:54,388][25689] Fps is (10 sec: 5556.1, 60 sec: 5632.0, 300 sec: 5610.2). Total num frames: 573447168. Throughput: 0: 5934.8. Samples: 573455938. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:06:54,389][25689] Avg episode reward: [(0, '-26.855')] [2022-07-10 04:06:55,945][26022] Updated weights on worker 0-0, policy_version 560017 (0.00086) [2022-07-10 04:06:57,879][26022] Updated weights on worker 0-0, policy_version 560027 (0.00095) [2022-07-10 04:06:59,412][25689] Fps is (10 sec: 5443.2, 60 sec: 5614.5, 300 sec: 5613.7). Total num frames: 573475840. Throughput: 0: 5066.7. Samples: 573472864. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:06:59,413][25689] Avg episode reward: [(0, '-27.182')] [2022-07-10 04:06:59,620][26022] Updated weights on worker 0-0, policy_version 560037 (0.00102) [2022-07-10 04:07:01,375][26022] Updated weights on worker 0-0, policy_version 560047 (0.00081) [2022-07-10 04:07:03,459][26022] Updated weights on worker 0-0, policy_version 560057 (0.00087) [2022-07-10 04:07:04,471][25689] Fps is (10 sec: 5586.1, 60 sec: 5630.5, 300 sec: 5614.4). Total num frames: 573503488. Throughput: 0: 5792.7. Samples: 573504910. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:07:04,471][25689] Avg episode reward: [(0, '-27.083')] [2022-07-10 04:07:05,487][26022] Updated weights on worker 0-0, policy_version 560067 (0.00087) [2022-07-10 04:07:07,050][26022] Updated weights on worker 0-0, policy_version 560077 (0.00087) [2022-07-10 04:07:08,951][26022] Updated weights on worker 0-0, policy_version 560087 (0.00082) [2022-07-10 04:07:09,502][25689] Fps is (10 sec: 5480.6, 60 sec: 5646.8, 300 sec: 5608.3). Total num frames: 573531136. Throughput: 0: 5778.1. Samples: 573538986. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:07:09,502][25689] Avg episode reward: [(0, '-26.242')] [2022-07-10 04:07:10,699][26022] Updated weights on worker 0-0, policy_version 560097 (0.00085) [2022-07-10 04:07:12,644][26022] Updated weights on worker 0-0, policy_version 560107 (0.00086) [2022-07-10 04:07:14,345][26022] Updated weights on worker 0-0, policy_version 560117 (0.00089) [2022-07-10 04:07:14,615][25689] Fps is (10 sec: 5652.9, 60 sec: 5642.2, 300 sec: 5613.3). Total num frames: 573560832. Throughput: 0: 4939.6. Samples: 573555848. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:07:14,615][25689] Avg episode reward: [(0, '-26.677')] [2022-07-10 04:07:16,140][26022] Updated weights on worker 0-0, policy_version 560127 (0.00088) [2022-07-10 04:07:17,928][26022] Updated weights on worker 0-0, policy_version 560137 (0.00093) [2022-07-10 04:07:19,626][25689] Fps is (10 sec: 5765.0, 60 sec: 5642.1, 300 sec: 5617.6). Total num frames: 573589504. Throughput: 0: 5797.7. Samples: 573590060. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:07:19,627][25689] Avg episode reward: [(0, '-26.622')] [2022-07-10 04:07:19,882][26022] Updated weights on worker 0-0, policy_version 560147 (0.00087) [2022-07-10 04:07:21,505][26022] Updated weights on worker 0-0, policy_version 560157 (0.00086) [2022-07-10 04:07:23,651][26022] Updated weights on worker 0-0, policy_version 560167 (0.00101) [2022-07-10 04:07:24,682][25689] Fps is (10 sec: 5695.9, 60 sec: 5656.8, 300 sec: 5613.9). Total num frames: 573618176. Throughput: 0: 5883.7. Samples: 573623830. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:07:24,683][25689] Avg episode reward: [(0, '-27.213')] [2022-07-10 04:07:25,236][26022] Updated weights on worker 0-0, policy_version 560177 (0.00090) [2022-07-10 04:07:27,072][26022] Updated weights on worker 0-0, policy_version 560187 (0.00101) [2022-07-10 04:07:29,067][26022] Updated weights on worker 0-0, policy_version 560197 (0.00086) [2022-07-10 04:07:29,739][25689] Fps is (10 sec: 5569.2, 60 sec: 5625.8, 300 sec: 5612.0). Total num frames: 573645824. Throughput: 0: 5021.5. Samples: 573640608. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:07:29,741][25689] Avg episode reward: [(0, '-26.558')] [2022-07-10 04:07:30,589][26022] Updated weights on worker 0-0, policy_version 560207 (0.00629) [2022-07-10 04:07:32,433][26022] Updated weights on worker 0-0, policy_version 560217 (0.00088) [2022-07-10 04:07:34,276][26022] Updated weights on worker 0-0, policy_version 560227 (0.00090) [2022-07-10 04:07:34,811][25689] Fps is (10 sec: 5459.4, 60 sec: 5595.3, 300 sec: 5604.8). Total num frames: 573673472. Throughput: 0: 5873.1. Samples: 573674462. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:07:34,813][25689] Avg episode reward: [(0, '-26.840')] [2022-07-10 04:07:36,139][26022] Updated weights on worker 0-0, policy_version 560237 (0.00093) [2022-07-10 04:07:37,908][26022] Updated weights on worker 0-0, policy_version 560247 (0.00089) [2022-07-10 04:07:39,795][26022] Updated weights on worker 0-0, policy_version 560257 (0.00094) [2022-07-10 04:07:39,891][25689] Fps is (10 sec: 5648.8, 60 sec: 5639.2, 300 sec: 5608.2). Total num frames: 573703168. Throughput: 0: 5854.9. Samples: 573708704. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:07:39,891][25689] Avg episode reward: [(0, '-25.992')] [2022-07-10 04:07:41,578][26022] Updated weights on worker 0-0, policy_version 560267 (0.00087) [2022-07-10 04:07:43,404][26022] Updated weights on worker 0-0, policy_version 560277 (0.00091) [2022-07-10 04:07:44,900][25689] Fps is (10 sec: 5785.1, 60 sec: 5622.4, 300 sec: 5609.3). Total num frames: 573731840. Throughput: 0: 5039.3. Samples: 573725714. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:07:44,901][25689] Avg episode reward: [(0, '-25.870')] [2022-07-10 04:07:45,278][26022] Updated weights on worker 0-0, policy_version 560287 (0.00087) [2022-07-10 04:07:47,046][26022] Updated weights on worker 0-0, policy_version 560297 (0.00087) [2022-07-10 04:07:48,811][26022] Updated weights on worker 0-0, policy_version 560307 (0.00094) [2022-07-10 04:07:49,903][25689] Fps is (10 sec: 5625.1, 60 sec: 5588.6, 300 sec: 5608.0). Total num frames: 573759488. Throughput: 0: 5900.3. Samples: 573759580. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:07:49,904][25689] Avg episode reward: [(0, '-25.267')] [2022-07-10 04:07:50,692][26022] Updated weights on worker 0-0, policy_version 560317 (0.00088) [2022-07-10 04:07:52,503][26022] Updated weights on worker 0-0, policy_version 560327 (0.00088) [2022-07-10 04:07:54,299][26022] Updated weights on worker 0-0, policy_version 560337 (0.00089) [2022-07-10 04:07:55,030][25689] Fps is (10 sec: 5559.8, 60 sec: 5623.1, 300 sec: 5609.7). Total num frames: 573788160. Throughput: 0: 5871.5. Samples: 573793180. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:07:55,035][25689] Avg episode reward: [(0, '-24.570')] [2022-07-10 04:07:56,194][26022] Updated weights on worker 0-0, policy_version 560347 (0.00085) [2022-07-10 04:07:57,819][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:07:57,829][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000560356_573804544.pth [2022-07-10 04:07:57,830][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000558382_571783168.pth [2022-07-10 04:07:57,985][26022] Updated weights on worker 0-0, policy_version 560357 (0.00091) [2022-07-10 04:07:59,854][26022] Updated weights on worker 0-0, policy_version 560367 (0.00083) [2022-07-10 04:08:00,087][25689] Fps is (10 sec: 5831.8, 60 sec: 5653.8, 300 sec: 5626.2). Total num frames: 573818880. Throughput: 0: 5000.1. Samples: 573809688. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:08:00,087][25689] Avg episode reward: [(0, '-24.488')] [2022-07-10 04:08:02,027][26022] Updated weights on worker 0-0, policy_version 560377 (0.00088) [2022-07-10 04:08:03,627][26022] Updated weights on worker 0-0, policy_version 560387 (0.00093) [2022-07-10 04:08:05,103][25689] Fps is (10 sec: 5286.1, 60 sec: 5573.2, 300 sec: 5605.4). Total num frames: 573841408. Throughput: 0: 5737.9. Samples: 573841638. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:08:05,104][25689] Avg episode reward: [(0, '-24.895')] [2022-07-10 04:08:05,649][26022] Updated weights on worker 0-0, policy_version 560397 (0.00082) [2022-07-10 04:08:07,223][26022] Updated weights on worker 0-0, policy_version 560407 (0.00081) [2022-07-10 04:08:09,267][26022] Updated weights on worker 0-0, policy_version 560417 (0.00086) [2022-07-10 04:08:10,121][25689] Fps is (10 sec: 5306.5, 60 sec: 5625.1, 300 sec: 5610.2). Total num frames: 573872128. Throughput: 0: 5746.9. Samples: 573875774. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:08:10,122][25689] Avg episode reward: [(0, '-25.766')] [2022-07-10 04:08:11,101][26022] Updated weights on worker 0-0, policy_version 560427 (0.00089) [2022-07-10 04:08:12,746][26022] Updated weights on worker 0-0, policy_version 560437 (0.00080) [2022-07-10 04:08:14,668][26022] Updated weights on worker 0-0, policy_version 560447 (0.00097) [2022-07-10 04:08:15,176][25689] Fps is (10 sec: 5896.3, 60 sec: 5613.6, 300 sec: 5613.3). Total num frames: 573900800. Throughput: 0: 5788.1. Samples: 573909788. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:08:15,177][25689] Avg episode reward: [(0, '-26.121')] [2022-07-10 04:08:16,578][26022] Updated weights on worker 0-0, policy_version 560457 (0.00090) [2022-07-10 04:08:18,098][26022] Updated weights on worker 0-0, policy_version 560467 (0.00105) [2022-07-10 04:08:20,220][25689] Fps is (10 sec: 5475.5, 60 sec: 5576.8, 300 sec: 5606.0). Total num frames: 573927424. Throughput: 0: 5816.4. Samples: 573926792. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:08:20,221][25689] Avg episode reward: [(0, '-26.367')] [2022-07-10 04:08:20,253][26022] Updated weights on worker 0-0, policy_version 560477 (0.00082) [2022-07-10 04:08:21,766][26022] Updated weights on worker 0-0, policy_version 560487 (0.00092) [2022-07-10 04:08:23,702][26022] Updated weights on worker 0-0, policy_version 560497 (0.00083) [2022-07-10 04:08:25,246][25689] Fps is (10 sec: 5694.7, 60 sec: 5613.4, 300 sec: 5612.7). Total num frames: 573958144. Throughput: 0: 5931.0. Samples: 573961104. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:08:25,246][25689] Avg episode reward: [(0, '-27.020')] [2022-07-10 04:08:25,313][26022] Updated weights on worker 0-0, policy_version 560507 (0.00084) [2022-07-10 04:08:27,204][26022] Updated weights on worker 0-0, policy_version 560517 (0.00589) [2022-07-10 04:08:29,168][26022] Updated weights on worker 0-0, policy_version 560527 (0.00094) [2022-07-10 04:08:30,278][25689] Fps is (10 sec: 5803.3, 60 sec: 5615.7, 300 sec: 5616.5). Total num frames: 573985792. Throughput: 0: 5912.2. Samples: 573994944. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:08:30,279][25689] Avg episode reward: [(0, '-25.764')] [2022-07-10 04:08:31,060][26022] Updated weights on worker 0-0, policy_version 560537 (0.00091) [2022-07-10 04:08:32,776][26022] Updated weights on worker 0-0, policy_version 560547 (0.00095) [2022-07-10 04:08:34,705][26022] Updated weights on worker 0-0, policy_version 560557 (0.00090) [2022-07-10 04:08:35,341][25689] Fps is (10 sec: 5579.1, 60 sec: 5633.5, 300 sec: 5612.1). Total num frames: 574014464. Throughput: 0: 5066.6. Samples: 574011954. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:08:35,341][25689] Avg episode reward: [(0, '-25.336')] [2022-07-10 04:08:36,285][26022] Updated weights on worker 0-0, policy_version 560567 (0.00092) [2022-07-10 04:08:38,287][26022] Updated weights on worker 0-0, policy_version 560577 (0.00092) [2022-07-10 04:08:39,852][26022] Updated weights on worker 0-0, policy_version 560587 (0.00089) [2022-07-10 04:08:40,363][25689] Fps is (10 sec: 5787.7, 60 sec: 5638.8, 300 sec: 5612.0). Total num frames: 574044160. Throughput: 0: 5928.3. Samples: 574046204. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2022-07-10 04:08:40,363][25689] Avg episode reward: [(0, '-25.816')] [2022-07-10 04:08:41,796][26022] Updated weights on worker 0-0, policy_version 560597 (0.00090) [2022-07-10 04:08:43,622][26022] Updated weights on worker 0-0, policy_version 560607 (0.00084) [2022-07-10 04:08:45,403][25689] Fps is (10 sec: 5698.9, 60 sec: 5619.0, 300 sec: 5611.4). Total num frames: 574071808. Throughput: 0: 5914.3. Samples: 574080320. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:08:45,404][25689] Avg episode reward: [(0, '-24.476')] [2022-07-10 04:08:45,404][26022] Updated weights on worker 0-0, policy_version 560617 (0.00090) [2022-07-10 04:08:47,191][26022] Updated weights on worker 0-0, policy_version 560627 (0.00095) [2022-07-10 04:08:48,902][26022] Updated weights on worker 0-0, policy_version 560637 (0.00086) [2022-07-10 04:08:50,423][25689] Fps is (10 sec: 5598.2, 60 sec: 5634.3, 300 sec: 5615.4). Total num frames: 574100480. Throughput: 0: 5083.2. Samples: 574097346. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:08:50,424][25689] Avg episode reward: [(0, '-25.346')] [2022-07-10 04:08:50,729][26022] Updated weights on worker 0-0, policy_version 560647 (0.00094) [2022-07-10 04:08:52,747][26022] Updated weights on worker 0-0, policy_version 560657 (0.00094) [2022-07-10 04:08:54,353][26022] Updated weights on worker 0-0, policy_version 560667 (0.00093) [2022-07-10 04:08:55,517][25689] Fps is (10 sec: 5669.7, 60 sec: 5637.4, 300 sec: 5614.5). Total num frames: 574129152. Throughput: 0: 5901.6. Samples: 574131028. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:08:55,519][25689] Avg episode reward: [(0, '-25.242')] [2022-07-10 04:08:56,411][26022] Updated weights on worker 0-0, policy_version 560677 (0.00097) [2022-07-10 04:08:57,960][26022] Updated weights on worker 0-0, policy_version 560687 (0.00089) [2022-07-10 04:08:59,930][26022] Updated weights on worker 0-0, policy_version 560697 (0.00092) [2022-07-10 04:09:00,587][25689] Fps is (10 sec: 5541.2, 60 sec: 5585.4, 300 sec: 5616.6). Total num frames: 574156800. Throughput: 0: 5868.1. Samples: 574164882. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:09:00,588][25689] Avg episode reward: [(0, '-25.684')] [2022-07-10 04:09:02,071][26022] Updated weights on worker 0-0, policy_version 560707 (0.00718) [2022-07-10 04:09:03,659][26022] Updated weights on worker 0-0, policy_version 560717 (0.00083) [2022-07-10 04:09:05,602][25689] Fps is (10 sec: 5381.4, 60 sec: 5653.3, 300 sec: 5616.5). Total num frames: 574183424. Throughput: 0: 4932.5. Samples: 574179952. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:09:05,603][25689] Avg episode reward: [(0, '-25.739')] [2022-07-10 04:09:05,731][26022] Updated weights on worker 0-0, policy_version 560727 (0.00084) [2022-07-10 04:09:07,368][26022] Updated weights on worker 0-0, policy_version 560737 (0.00090) [2022-07-10 04:09:09,371][26022] Updated weights on worker 0-0, policy_version 560747 (0.00421) [2022-07-10 04:09:10,638][25689] Fps is (10 sec: 5501.7, 60 sec: 5617.8, 300 sec: 5614.8). Total num frames: 574212096. Throughput: 0: 5773.0. Samples: 574214044. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:09:10,638][25689] Avg episode reward: [(0, '-25.166')] [2022-07-10 04:09:11,194][26022] Updated weights on worker 0-0, policy_version 560757 (0.00085) [2022-07-10 04:09:12,996][26022] Updated weights on worker 0-0, policy_version 560767 (0.00089) [2022-07-10 04:09:14,769][26022] Updated weights on worker 0-0, policy_version 560777 (0.00092) [2022-07-10 04:09:15,696][25689] Fps is (10 sec: 5579.5, 60 sec: 5600.5, 300 sec: 5617.8). Total num frames: 574239744. Throughput: 0: 5805.0. Samples: 574248168. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:09:15,697][25689] Avg episode reward: [(0, '-24.821')] [2022-07-10 04:09:16,617][26022] Updated weights on worker 0-0, policy_version 560787 (0.00087) [2022-07-10 04:09:18,477][26022] Updated weights on worker 0-0, policy_version 560797 (0.00092) [2022-07-10 04:09:20,189][26022] Updated weights on worker 0-0, policy_version 560807 (0.00085) [2022-07-10 04:09:20,756][25689] Fps is (10 sec: 5667.4, 60 sec: 5649.8, 300 sec: 5620.8). Total num frames: 574269440. Throughput: 0: 4962.2. Samples: 574264964. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:09:20,757][25689] Avg episode reward: [(0, '-24.933')] [2022-07-10 04:09:22,060][26022] Updated weights on worker 0-0, policy_version 560817 (0.00080) [2022-07-10 04:09:23,651][26022] Updated weights on worker 0-0, policy_version 560827 (0.00086) [2022-07-10 04:09:25,724][26022] Updated weights on worker 0-0, policy_version 560837 (0.00094) [2022-07-10 04:09:25,773][25689] Fps is (10 sec: 5691.0, 60 sec: 5599.9, 300 sec: 5617.2). Total num frames: 574297088. Throughput: 0: 5896.9. Samples: 574298894. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:09:25,773][25689] Avg episode reward: [(0, '-24.659')] [2022-07-10 04:09:27,389][26022] Updated weights on worker 0-0, policy_version 560847 (0.00099) [2022-07-10 04:09:29,340][26022] Updated weights on worker 0-0, policy_version 560857 (0.00091) [2022-07-10 04:09:30,783][25689] Fps is (10 sec: 5719.2, 60 sec: 5635.8, 300 sec: 5621.6). Total num frames: 574326784. Throughput: 0: 5889.2. Samples: 574332680. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:09:30,783][25689] Avg episode reward: [(0, '-23.796')] [2022-07-10 04:09:30,980][26022] Updated weights on worker 0-0, policy_version 560867 (0.00090) [2022-07-10 04:09:33,092][26022] Updated weights on worker 0-0, policy_version 560877 (0.00091) [2022-07-10 04:09:34,761][26022] Updated weights on worker 0-0, policy_version 560887 (0.00085) [2022-07-10 04:09:35,891][25689] Fps is (10 sec: 5566.3, 60 sec: 5597.8, 300 sec: 5613.2). Total num frames: 574353408. Throughput: 0: 5019.6. Samples: 574349534. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:09:35,891][25689] Avg episode reward: [(0, '-24.887')] [2022-07-10 04:09:36,713][26022] Updated weights on worker 0-0, policy_version 560897 (0.00088) [2022-07-10 04:09:38,254][26022] Updated weights on worker 0-0, policy_version 560907 (0.00092) [2022-07-10 04:09:40,341][26022] Updated weights on worker 0-0, policy_version 560917 (0.00084) [2022-07-10 04:09:40,934][25689] Fps is (10 sec: 5447.2, 60 sec: 5578.9, 300 sec: 5616.6). Total num frames: 574382080. Throughput: 0: 5870.1. Samples: 574383410. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:09:40,936][25689] Avg episode reward: [(0, '-25.275')] [2022-07-10 04:09:41,972][26022] Updated weights on worker 0-0, policy_version 560927 (0.00086) [2022-07-10 04:09:43,980][26022] Updated weights on worker 0-0, policy_version 560937 (0.00087) [2022-07-10 04:09:45,559][26022] Updated weights on worker 0-0, policy_version 560947 (0.00084) [2022-07-10 04:09:45,953][25689] Fps is (10 sec: 5800.4, 60 sec: 5614.6, 300 sec: 5619.9). Total num frames: 574411776. Throughput: 0: 5868.8. Samples: 574417330. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:09:45,954][25689] Avg episode reward: [(0, '-25.820')] [2022-07-10 04:09:47,558][26022] Updated weights on worker 0-0, policy_version 560957 (0.00091) [2022-07-10 04:09:49,304][26022] Updated weights on worker 0-0, policy_version 560967 (0.00087) [2022-07-10 04:09:50,991][25689] Fps is (10 sec: 5600.4, 60 sec: 5579.3, 300 sec: 5609.8). Total num frames: 574438400. Throughput: 0: 5023.5. Samples: 574434194. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:09:50,992][25689] Avg episode reward: [(0, '-25.194')] [2022-07-10 04:09:51,131][26022] Updated weights on worker 0-0, policy_version 560977 (0.00088) [2022-07-10 04:09:52,838][26022] Updated weights on worker 0-0, policy_version 560987 (0.00085) [2022-07-10 04:09:54,711][26022] Updated weights on worker 0-0, policy_version 560997 (0.00508) [2022-07-10 04:09:56,109][25689] Fps is (10 sec: 5646.3, 60 sec: 5610.8, 300 sec: 5618.3). Total num frames: 574469120. Throughput: 0: 5879.0. Samples: 574468400. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:09:56,110][25689] Avg episode reward: [(0, '-24.957')] [2022-07-10 04:09:56,331][26022] Updated weights on worker 0-0, policy_version 561007 (0.00089) [2022-07-10 04:09:57,872][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:09:57,886][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000561014_574478336.pth [2022-07-10 04:09:57,886][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000559040_572456960.pth [2022-07-10 04:09:58,462][26022] Updated weights on worker 0-0, policy_version 561017 (0.00091) [2022-07-10 04:09:59,995][26022] Updated weights on worker 0-0, policy_version 561027 (0.00088) [2022-07-10 04:10:01,121][25689] Fps is (10 sec: 5761.4, 60 sec: 5616.1, 300 sec: 5625.0). Total num frames: 574496768. Throughput: 0: 5897.3. Samples: 574502460. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:10:01,127][25689] Avg episode reward: [(0, '-24.122')] [2022-07-10 04:10:02,424][26022] Updated weights on worker 0-0, policy_version 561037 (0.00088) [2022-07-10 04:10:03,947][26022] Updated weights on worker 0-0, policy_version 561047 (0.00085) [2022-07-10 04:10:05,919][26022] Updated weights on worker 0-0, policy_version 561057 (0.00093) [2022-07-10 04:10:06,154][25689] Fps is (10 sec: 5301.2, 60 sec: 5597.6, 300 sec: 5614.5). Total num frames: 574522368. Throughput: 0: 4948.3. Samples: 574517288. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:10:06,154][25689] Avg episode reward: [(0, '-23.529')] [2022-07-10 04:10:07,583][26022] Updated weights on worker 0-0, policy_version 561067 (0.00091) [2022-07-10 04:10:09,551][26022] Updated weights on worker 0-0, policy_version 561077 (0.00098) [2022-07-10 04:10:11,239][25689] Fps is (10 sec: 5465.4, 60 sec: 5610.0, 300 sec: 5613.4). Total num frames: 574552064. Throughput: 0: 5792.5. Samples: 574551482. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:10:11,241][25689] Avg episode reward: [(0, '-23.838')] [2022-07-10 04:10:11,250][26022] Updated weights on worker 0-0, policy_version 561087 (0.00085) [2022-07-10 04:10:13,272][26022] Updated weights on worker 0-0, policy_version 561097 (0.00090) [2022-07-10 04:10:14,896][26022] Updated weights on worker 0-0, policy_version 561107 (0.00093) [2022-07-10 04:10:16,373][25689] Fps is (10 sec: 5711.5, 60 sec: 5619.8, 300 sec: 5621.4). Total num frames: 574580736. Throughput: 0: 5767.3. Samples: 574585268. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:10:16,375][25689] Avg episode reward: [(0, '-23.168')] [2022-07-10 04:10:16,888][26022] Updated weights on worker 0-0, policy_version 561117 (0.00095) [2022-07-10 04:10:18,535][26022] Updated weights on worker 0-0, policy_version 561127 (0.00481) [2022-07-10 04:10:20,408][26022] Updated weights on worker 0-0, policy_version 561137 (0.00090) [2022-07-10 04:10:21,386][25689] Fps is (10 sec: 5651.3, 60 sec: 5607.3, 300 sec: 5614.3). Total num frames: 574609408. Throughput: 0: 4919.4. Samples: 574602150. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:10:21,386][25689] Avg episode reward: [(0, '-23.272')] [2022-07-10 04:10:22,238][26022] Updated weights on worker 0-0, policy_version 561147 (0.00090) [2022-07-10 04:10:23,988][26022] Updated weights on worker 0-0, policy_version 561157 (0.00088) [2022-07-10 04:10:25,995][26022] Updated weights on worker 0-0, policy_version 561167 (0.00092) [2022-07-10 04:10:26,442][25689] Fps is (10 sec: 5593.4, 60 sec: 5603.6, 300 sec: 5613.6). Total num frames: 574637056. Throughput: 0: 5865.5. Samples: 574636290. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:10:26,444][25689] Avg episode reward: [(0, '-24.417')] [2022-07-10 04:10:27,595][26022] Updated weights on worker 0-0, policy_version 561177 (0.00085) [2022-07-10 04:10:29,582][26022] Updated weights on worker 0-0, policy_version 561187 (0.00094) [2022-07-10 04:10:31,283][26022] Updated weights on worker 0-0, policy_version 561197 (0.00091) [2022-07-10 04:10:31,475][25689] Fps is (10 sec: 5582.5, 60 sec: 5584.7, 300 sec: 5610.9). Total num frames: 574665728. Throughput: 0: 5857.2. Samples: 574670008. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:10:31,475][25689] Avg episode reward: [(0, '-24.690')] [2022-07-10 04:10:33,043][26022] Updated weights on worker 0-0, policy_version 561207 (0.00087) [2022-07-10 04:10:35,036][26022] Updated weights on worker 0-0, policy_version 561217 (0.00088) [2022-07-10 04:10:36,526][25689] Fps is (10 sec: 5686.5, 60 sec: 5623.6, 300 sec: 5617.3). Total num frames: 574694400. Throughput: 0: 5889.0. Samples: 574703950. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:10:36,528][25689] Avg episode reward: [(0, '-24.265')] [2022-07-10 04:10:36,779][26022] Updated weights on worker 0-0, policy_version 561227 (0.00092) [2022-07-10 04:10:38,480][26022] Updated weights on worker 0-0, policy_version 561237 (0.00090) [2022-07-10 04:10:40,502][26022] Updated weights on worker 0-0, policy_version 561247 (0.00052) [2022-07-10 04:10:41,534][25689] Fps is (10 sec: 5802.2, 60 sec: 5643.8, 300 sec: 5617.4). Total num frames: 574724096. Throughput: 0: 5898.8. Samples: 574721002. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:10:41,535][25689] Avg episode reward: [(0, '-24.853')] [2022-07-10 04:10:41,918][26022] Updated weights on worker 0-0, policy_version 561257 (0.00096) [2022-07-10 04:10:44,139][26022] Updated weights on worker 0-0, policy_version 561267 (0.00083) [2022-07-10 04:10:45,638][26022] Updated weights on worker 0-0, policy_version 561277 (0.00093) [2022-07-10 04:10:46,585][25689] Fps is (10 sec: 5599.1, 60 sec: 5590.2, 300 sec: 5614.0). Total num frames: 574750720. Throughput: 0: 5901.1. Samples: 574755156. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:10:46,587][25689] Avg episode reward: [(0, '-24.378')] [2022-07-10 04:10:47,688][26022] Updated weights on worker 0-0, policy_version 561287 (0.00099) [2022-07-10 04:10:49,276][26022] Updated weights on worker 0-0, policy_version 561297 (0.00091) [2022-07-10 04:10:51,246][26022] Updated weights on worker 0-0, policy_version 561307 (0.00091) [2022-07-10 04:10:51,602][25689] Fps is (10 sec: 5695.8, 60 sec: 5659.7, 300 sec: 5626.2). Total num frames: 574781440. Throughput: 0: 5917.5. Samples: 574789112. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:10:51,603][25689] Avg episode reward: [(0, '-25.754')] [2022-07-10 04:10:53,069][26022] Updated weights on worker 0-0, policy_version 561317 (0.00071) [2022-07-10 04:10:54,871][26022] Updated weights on worker 0-0, policy_version 561327 (0.00593) [2022-07-10 04:10:56,627][26022] Updated weights on worker 0-0, policy_version 561337 (0.00086) [2022-07-10 04:10:56,692][25689] Fps is (10 sec: 5775.4, 60 sec: 5611.7, 300 sec: 5618.0). Total num frames: 574809088. Throughput: 0: 5066.4. Samples: 574806116. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:10:56,692][25689] Avg episode reward: [(0, '-25.361')] [2022-07-10 04:10:58,530][26022] Updated weights on worker 0-0, policy_version 561347 (0.00089) [2022-07-10 04:11:00,150][26022] Updated weights on worker 0-0, policy_version 561357 (0.00110) [2022-07-10 04:11:01,699][25689] Fps is (10 sec: 5476.3, 60 sec: 5612.1, 300 sec: 5622.2). Total num frames: 574836736. Throughput: 0: 5916.8. Samples: 574840314. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:11:01,701][25689] Avg episode reward: [(0, '-26.122')] [2022-07-10 04:11:02,683][26022] Updated weights on worker 0-0, policy_version 561367 (0.00088) [2022-07-10 04:11:04,044][26022] Updated weights on worker 0-0, policy_version 561377 (0.00086) [2022-07-10 04:11:06,218][26022] Updated weights on worker 0-0, policy_version 561387 (0.00088) [2022-07-10 04:11:06,722][25689] Fps is (10 sec: 5512.8, 60 sec: 5646.8, 300 sec: 5625.7). Total num frames: 574864384. Throughput: 0: 5807.8. Samples: 574872106. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:11:06,723][25689] Avg episode reward: [(0, '-26.034')] [2022-07-10 04:11:07,776][26022] Updated weights on worker 0-0, policy_version 561397 (0.00080) [2022-07-10 04:11:09,842][26022] Updated weights on worker 0-0, policy_version 561407 (0.00089) [2022-07-10 04:11:11,595][26022] Updated weights on worker 0-0, policy_version 561417 (0.00083) [2022-07-10 04:11:11,743][25689] Fps is (10 sec: 5505.6, 60 sec: 5618.9, 300 sec: 5619.6). Total num frames: 574892032. Throughput: 0: 4970.6. Samples: 574889224. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:11:11,743][25689] Avg episode reward: [(0, '-26.145')] [2022-07-10 04:11:13,448][26022] Updated weights on worker 0-0, policy_version 561427 (0.00087) [2022-07-10 04:11:15,095][26022] Updated weights on worker 0-0, policy_version 561437 (0.00094) [2022-07-10 04:11:16,910][25689] Fps is (10 sec: 5528.3, 60 sec: 5615.9, 300 sec: 5616.7). Total num frames: 574920704. Throughput: 0: 5782.8. Samples: 574923032. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:11:16,912][25689] Avg episode reward: [(0, '-25.519')] [2022-07-10 04:11:17,178][26022] Updated weights on worker 0-0, policy_version 561447 (0.00081) [2022-07-10 04:11:18,676][26022] Updated weights on worker 0-0, policy_version 561457 (0.00089) [2022-07-10 04:11:20,621][26022] Updated weights on worker 0-0, policy_version 561467 (0.00087) [2022-07-10 04:11:21,939][25689] Fps is (10 sec: 5724.6, 60 sec: 5631.3, 300 sec: 5623.6). Total num frames: 574950400. Throughput: 0: 5778.7. Samples: 574957272. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:11:21,939][25689] Avg episode reward: [(0, '-25.523')] [2022-07-10 04:11:22,307][26022] Updated weights on worker 0-0, policy_version 561477 (0.00092) [2022-07-10 04:11:24,229][26022] Updated weights on worker 0-0, policy_version 561487 (0.00091) [2022-07-10 04:11:26,180][26022] Updated weights on worker 0-0, policy_version 561497 (0.00094) [2022-07-10 04:11:27,002][25689] Fps is (10 sec: 5681.7, 60 sec: 5630.6, 300 sec: 5617.2). Total num frames: 574978048. Throughput: 0: 5030.5. Samples: 574974126. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:11:27,003][25689] Avg episode reward: [(0, '-25.043')] [2022-07-10 04:11:27,999][26022] Updated weights on worker 0-0, policy_version 561507 (0.00082) [2022-07-10 04:11:29,632][26022] Updated weights on worker 0-0, policy_version 561517 (0.00092) [2022-07-10 04:11:31,624][26022] Updated weights on worker 0-0, policy_version 561527 (0.00089) [2022-07-10 04:11:32,059][25689] Fps is (10 sec: 5464.1, 60 sec: 5611.5, 300 sec: 5611.2). Total num frames: 575005696. Throughput: 0: 5828.5. Samples: 575007634. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:11:32,060][25689] Avg episode reward: [(0, '-25.208')] [2022-07-10 04:11:33,220][26022] Updated weights on worker 0-0, policy_version 561537 (0.00086) [2022-07-10 04:11:35,302][26022] Updated weights on worker 0-0, policy_version 561547 (0.00091) [2022-07-10 04:11:37,011][26022] Updated weights on worker 0-0, policy_version 561557 (0.00094) [2022-07-10 04:11:37,111][25689] Fps is (10 sec: 5571.5, 60 sec: 5611.5, 300 sec: 5617.3). Total num frames: 575034368. Throughput: 0: 5858.7. Samples: 575041386. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:11:37,111][25689] Avg episode reward: [(0, '-25.530')] [2022-07-10 04:11:38,975][26022] Updated weights on worker 0-0, policy_version 561567 (0.00086) [2022-07-10 04:11:40,587][26022] Updated weights on worker 0-0, policy_version 561577 (0.00091) [2022-07-10 04:11:42,113][25689] Fps is (10 sec: 5703.6, 60 sec: 5595.1, 300 sec: 5614.0). Total num frames: 575063040. Throughput: 0: 5004.6. Samples: 575058234. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 04:11:42,113][25689] Avg episode reward: [(0, '-25.899')] [2022-07-10 04:11:42,323][26022] Updated weights on worker 0-0, policy_version 561587 (0.00085) [2022-07-10 04:11:44,221][26022] Updated weights on worker 0-0, policy_version 561597 (0.00082) [2022-07-10 04:11:46,127][26022] Updated weights on worker 0-0, policy_version 561607 (0.00089) [2022-07-10 04:11:47,116][25689] Fps is (10 sec: 5629.0, 60 sec: 5616.4, 300 sec: 5607.1). Total num frames: 575090688. Throughput: 0: 5884.8. Samples: 575092492. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:11:47,117][25689] Avg episode reward: [(0, '-25.977')] [2022-07-10 04:11:47,710][26022] Updated weights on worker 0-0, policy_version 561617 (0.00079) [2022-07-10 04:11:49,723][26022] Updated weights on worker 0-0, policy_version 561627 (0.00088) [2022-07-10 04:11:51,402][26022] Updated weights on worker 0-0, policy_version 561637 (0.00090) [2022-07-10 04:11:52,137][25689] Fps is (10 sec: 5618.4, 60 sec: 5582.2, 300 sec: 5616.1). Total num frames: 575119360. Throughput: 0: 5915.5. Samples: 575126406. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:11:52,137][25689] Avg episode reward: [(0, '-26.003')] [2022-07-10 04:11:53,278][26022] Updated weights on worker 0-0, policy_version 561647 (0.00089) [2022-07-10 04:11:54,971][26022] Updated weights on worker 0-0, policy_version 561657 (0.00084) [2022-07-10 04:11:56,878][26022] Updated weights on worker 0-0, policy_version 561667 (0.00095) [2022-07-10 04:11:57,247][25689] Fps is (10 sec: 5660.5, 60 sec: 5597.3, 300 sec: 5614.5). Total num frames: 575148032. Throughput: 0: 5068.8. Samples: 575143450. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:11:57,247][25689] Avg episode reward: [(0, '-26.642')] [2022-07-10 04:11:58,304][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:11:58,320][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000561674_575154176.pth [2022-07-10 04:11:58,321][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000559698_573130752.pth [2022-07-10 04:11:58,738][26022] Updated weights on worker 0-0, policy_version 561677 (0.00091) [2022-07-10 04:12:00,496][26022] Updated weights on worker 0-0, policy_version 561687 (0.00085) [2022-07-10 04:12:02,324][25689] Fps is (10 sec: 5428.1, 60 sec: 5574.0, 300 sec: 5610.7). Total num frames: 575174656. Throughput: 0: 5901.7. Samples: 575177514. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:12:02,324][25689] Avg episode reward: [(0, '-26.614')] [2022-07-10 04:12:02,903][26022] Updated weights on worker 0-0, policy_version 561697 (0.00391) [2022-07-10 04:12:04,430][26022] Updated weights on worker 0-0, policy_version 561707 (0.00089) [2022-07-10 04:12:06,374][26022] Updated weights on worker 0-0, policy_version 561717 (0.00073) [2022-07-10 04:12:07,370][25689] Fps is (10 sec: 5563.4, 60 sec: 5605.6, 300 sec: 5617.3). Total num frames: 575204352. Throughput: 0: 5769.7. Samples: 575209350. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:12:07,371][25689] Avg episode reward: [(0, '-26.060')] [2022-07-10 04:12:07,867][26022] Updated weights on worker 0-0, policy_version 561727 (0.00095) [2022-07-10 04:12:09,974][26022] Updated weights on worker 0-0, policy_version 561737 (0.00796) [2022-07-10 04:12:11,700][26022] Updated weights on worker 0-0, policy_version 561747 (0.00088) [2022-07-10 04:12:12,443][25689] Fps is (10 sec: 5667.0, 60 sec: 5600.8, 300 sec: 5611.2). Total num frames: 575232000. Throughput: 0: 4925.7. Samples: 575226426. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:12:12,443][25689] Avg episode reward: [(0, '-24.984')] [2022-07-10 04:12:13,604][26022] Updated weights on worker 0-0, policy_version 561757 (0.00089) [2022-07-10 04:12:15,285][26022] Updated weights on worker 0-0, policy_version 561767 (0.00089) [2022-07-10 04:12:17,267][26022] Updated weights on worker 0-0, policy_version 561777 (0.00088) [2022-07-10 04:12:17,479][25689] Fps is (10 sec: 5469.9, 60 sec: 5595.9, 300 sec: 5607.3). Total num frames: 575259648. Throughput: 0: 5775.6. Samples: 575260304. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:12:17,480][25689] Avg episode reward: [(0, '-25.701')] [2022-07-10 04:12:19,052][26022] Updated weights on worker 0-0, policy_version 561787 (0.00096) [2022-07-10 04:12:20,828][26022] Updated weights on worker 0-0, policy_version 561797 (0.00092) [2022-07-10 04:12:22,484][25689] Fps is (10 sec: 5711.0, 60 sec: 5598.2, 300 sec: 5611.7). Total num frames: 575289344. Throughput: 0: 5802.5. Samples: 575294492. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:12:22,484][25689] Avg episode reward: [(0, '-25.014')] [2022-07-10 04:12:22,606][26022] Updated weights on worker 0-0, policy_version 561807 (0.00089) [2022-07-10 04:12:24,416][26022] Updated weights on worker 0-0, policy_version 561817 (0.00080) [2022-07-10 04:12:26,300][26022] Updated weights on worker 0-0, policy_version 561827 (0.00089) [2022-07-10 04:12:27,489][25689] Fps is (10 sec: 5728.8, 60 sec: 5603.6, 300 sec: 5612.7). Total num frames: 575316992. Throughput: 0: 5086.7. Samples: 575311690. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:12:27,490][25689] Avg episode reward: [(0, '-25.037')] [2022-07-10 04:12:27,991][26022] Updated weights on worker 0-0, policy_version 561837 (0.00095) [2022-07-10 04:12:29,837][26022] Updated weights on worker 0-0, policy_version 561847 (0.00093) [2022-07-10 04:12:31,432][26022] Updated weights on worker 0-0, policy_version 561857 (0.00085) [2022-07-10 04:12:32,496][25689] Fps is (10 sec: 5727.4, 60 sec: 5642.1, 300 sec: 5620.8). Total num frames: 575346688. Throughput: 0: 5955.9. Samples: 575345862. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:12:32,497][25689] Avg episode reward: [(0, '-25.128')] [2022-07-10 04:12:33,549][26022] Updated weights on worker 0-0, policy_version 561867 (0.00102) [2022-07-10 04:12:35,343][26022] Updated weights on worker 0-0, policy_version 561877 (0.00081) [2022-07-10 04:12:37,179][26022] Updated weights on worker 0-0, policy_version 561887 (0.00103) [2022-07-10 04:12:37,547][25689] Fps is (10 sec: 5599.6, 60 sec: 5608.3, 300 sec: 5611.0). Total num frames: 575373312. Throughput: 0: 5922.5. Samples: 575379154. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:12:37,548][25689] Avg episode reward: [(0, '-25.058')] [2022-07-10 04:12:38,865][26022] Updated weights on worker 0-0, policy_version 561897 (0.00083) [2022-07-10 04:12:40,764][26022] Updated weights on worker 0-0, policy_version 561907 (0.00103) [2022-07-10 04:12:42,536][26022] Updated weights on worker 0-0, policy_version 561917 (0.00090) [2022-07-10 04:12:42,595][25689] Fps is (10 sec: 5678.1, 60 sec: 5637.9, 300 sec: 5617.1). Total num frames: 575404032. Throughput: 0: 5066.1. Samples: 575396378. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:12:42,596][25689] Avg episode reward: [(0, '-25.247')] [2022-07-10 04:12:44,457][26022] Updated weights on worker 0-0, policy_version 561927 (0.00085) [2022-07-10 04:12:46,192][26022] Updated weights on worker 0-0, policy_version 561937 (0.00085) [2022-07-10 04:12:47,663][25689] Fps is (10 sec: 5871.2, 60 sec: 5648.8, 300 sec: 5619.3). Total num frames: 575432704. Throughput: 0: 5894.5. Samples: 575430604. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:12:47,665][25689] Avg episode reward: [(0, '-25.222')] [2022-07-10 04:12:47,784][26022] Updated weights on worker 0-0, policy_version 561947 (0.00083) [2022-07-10 04:12:49,828][26022] Updated weights on worker 0-0, policy_version 561957 (0.00086) [2022-07-10 04:12:51,497][26022] Updated weights on worker 0-0, policy_version 561967 (0.00081) [2022-07-10 04:12:52,716][25689] Fps is (10 sec: 5564.8, 60 sec: 5628.9, 300 sec: 5617.3). Total num frames: 575460352. Throughput: 0: 5890.6. Samples: 575464970. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:12:52,718][25689] Avg episode reward: [(0, '-24.624')] [2022-07-10 04:12:53,193][26022] Updated weights on worker 0-0, policy_version 561977 (0.00095) [2022-07-10 04:12:55,362][26022] Updated weights on worker 0-0, policy_version 561987 (0.00093) [2022-07-10 04:12:56,760][26022] Updated weights on worker 0-0, policy_version 561997 (0.00089) [2022-07-10 04:12:57,787][25689] Fps is (10 sec: 5663.8, 60 sec: 5649.4, 300 sec: 5613.6). Total num frames: 575490048. Throughput: 0: 5924.9. Samples: 575499076. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:12:57,788][25689] Avg episode reward: [(0, '-24.928')] [2022-07-10 04:12:58,783][26022] Updated weights on worker 0-0, policy_version 562007 (0.00086) [2022-07-10 04:13:00,526][26022] Updated weights on worker 0-0, policy_version 562017 (0.00085) [2022-07-10 04:13:02,687][26022] Updated weights on worker 0-0, policy_version 562027 (0.00080) [2022-07-10 04:13:02,802][25689] Fps is (10 sec: 5583.8, 60 sec: 5655.2, 300 sec: 5627.4). Total num frames: 575516672. Throughput: 0: 5932.2. Samples: 575516252. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:13:02,803][25689] Avg episode reward: [(0, '-24.566')] [2022-07-10 04:13:04,727][26022] Updated weights on worker 0-0, policy_version 562037 (0.00078) [2022-07-10 04:13:06,210][26022] Updated weights on worker 0-0, policy_version 562047 (0.00098) [2022-07-10 04:13:07,823][25689] Fps is (10 sec: 5305.7, 60 sec: 5606.7, 300 sec: 5613.6). Total num frames: 575543296. Throughput: 0: 5825.3. Samples: 575548046. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:13:07,824][25689] Avg episode reward: [(0, '-24.510')] [2022-07-10 04:13:08,291][26022] Updated weights on worker 0-0, policy_version 562057 (0.00093) [2022-07-10 04:13:09,991][26022] Updated weights on worker 0-0, policy_version 562067 (0.00096) [2022-07-10 04:13:11,945][26022] Updated weights on worker 0-0, policy_version 562077 (0.00088) [2022-07-10 04:13:12,835][25689] Fps is (10 sec: 5511.9, 60 sec: 5629.4, 300 sec: 5614.4). Total num frames: 575571968. Throughput: 0: 5799.7. Samples: 575581652. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:13:12,835][25689] Avg episode reward: [(0, '-23.476')] [2022-07-10 04:13:13,714][26022] Updated weights on worker 0-0, policy_version 562087 (0.00085) [2022-07-10 04:13:15,540][26022] Updated weights on worker 0-0, policy_version 562097 (0.00103) [2022-07-10 04:13:17,002][26022] Updated weights on worker 0-0, policy_version 562107 (0.00085) [2022-07-10 04:13:17,899][25689] Fps is (10 sec: 5691.5, 60 sec: 5643.7, 300 sec: 5620.9). Total num frames: 575600640. Throughput: 0: 4958.4. Samples: 575598796. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:13:17,899][25689] Avg episode reward: [(0, '-23.609')] [2022-07-10 04:13:19,091][26022] Updated weights on worker 0-0, policy_version 562117 (0.00084) [2022-07-10 04:13:20,725][26022] Updated weights on worker 0-0, policy_version 562127 (0.00630) [2022-07-10 04:13:22,701][26022] Updated weights on worker 0-0, policy_version 562137 (0.00094) [2022-07-10 04:13:22,962][25689] Fps is (10 sec: 5763.1, 60 sec: 5638.2, 300 sec: 5616.7). Total num frames: 575630336. Throughput: 0: 5791.9. Samples: 575633016. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:13:22,963][25689] Avg episode reward: [(0, '-23.853')] [2022-07-10 04:13:24,615][26022] Updated weights on worker 0-0, policy_version 562147 (0.00567) [2022-07-10 04:13:26,427][26022] Updated weights on worker 0-0, policy_version 562157 (0.00075) [2022-07-10 04:13:27,978][25689] Fps is (10 sec: 5689.5, 60 sec: 5637.3, 300 sec: 5617.0). Total num frames: 575657984. Throughput: 0: 5866.9. Samples: 575666290. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:13:27,978][25689] Avg episode reward: [(0, '-23.335')] [2022-07-10 04:13:28,215][26022] Updated weights on worker 0-0, policy_version 562167 (0.00090) [2022-07-10 04:13:29,939][26022] Updated weights on worker 0-0, policy_version 562177 (0.00088) [2022-07-10 04:13:31,776][26022] Updated weights on worker 0-0, policy_version 562187 (0.00091) [2022-07-10 04:13:33,010][25689] Fps is (10 sec: 5503.6, 60 sec: 5601.1, 300 sec: 5614.2). Total num frames: 575685632. Throughput: 0: 5039.6. Samples: 575683326. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:13:33,010][25689] Avg episode reward: [(0, '-23.716')] [2022-07-10 04:13:33,610][26022] Updated weights on worker 0-0, policy_version 562197 (0.00085) [2022-07-10 04:13:35,419][26022] Updated weights on worker 0-0, policy_version 562207 (0.00091) [2022-07-10 04:13:37,216][26022] Updated weights on worker 0-0, policy_version 562217 (0.00093) [2022-07-10 04:13:38,066][25689] Fps is (10 sec: 5582.8, 60 sec: 5634.5, 300 sec: 5610.1). Total num frames: 575714304. Throughput: 0: 5868.0. Samples: 575717136. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:13:38,066][25689] Avg episode reward: [(0, '-24.900')] [2022-07-10 04:13:39,039][26022] Updated weights on worker 0-0, policy_version 562227 (0.00085) [2022-07-10 04:13:40,919][26022] Updated weights on worker 0-0, policy_version 562237 (0.00092) [2022-07-10 04:13:42,578][26022] Updated weights on worker 0-0, policy_version 562247 (0.00092) [2022-07-10 04:13:43,082][25689] Fps is (10 sec: 5693.2, 60 sec: 5603.6, 300 sec: 5614.0). Total num frames: 575742976. Throughput: 0: 5882.1. Samples: 575751362. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:13:43,083][25689] Avg episode reward: [(0, '-25.504')] [2022-07-10 04:13:44,394][26022] Updated weights on worker 0-0, policy_version 562257 (0.00085) [2022-07-10 04:13:46,312][26022] Updated weights on worker 0-0, policy_version 562267 (0.00095) [2022-07-10 04:13:48,119][25689] Fps is (10 sec: 5602.5, 60 sec: 5589.5, 300 sec: 5610.2). Total num frames: 575770624. Throughput: 0: 5065.9. Samples: 575768322. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:13:48,119][25689] Avg episode reward: [(0, '-25.328')] [2022-07-10 04:13:48,167][26022] Updated weights on worker 0-0, policy_version 562277 (0.00094) [2022-07-10 04:13:49,921][26022] Updated weights on worker 0-0, policy_version 562287 (0.00095) [2022-07-10 04:13:51,927][26022] Updated weights on worker 0-0, policy_version 562297 (0.00099) [2022-07-10 04:13:53,121][25689] Fps is (10 sec: 5610.3, 60 sec: 5611.2, 300 sec: 5612.0). Total num frames: 575799296. Throughput: 0: 5892.7. Samples: 575801834. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:13:53,121][25689] Avg episode reward: [(0, '-26.173')] [2022-07-10 04:13:53,607][26022] Updated weights on worker 0-0, policy_version 562307 (0.00095) [2022-07-10 04:13:55,443][26022] Updated weights on worker 0-0, policy_version 562317 (0.00095) [2022-07-10 04:13:57,308][26022] Updated weights on worker 0-0, policy_version 562327 (0.00083) [2022-07-10 04:13:58,182][25689] Fps is (10 sec: 5596.4, 60 sec: 5578.2, 300 sec: 5612.1). Total num frames: 575826944. Throughput: 0: 5874.9. Samples: 575835318. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:13:58,183][25689] Avg episode reward: [(0, '-25.805')] [2022-07-10 04:13:58,492][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:13:58,504][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000562333_575828992.pth [2022-07-10 04:13:58,504][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000560356_573804544.pth [2022-07-10 04:13:58,984][26022] Updated weights on worker 0-0, policy_version 562337 (0.00092) [2022-07-10 04:14:00,813][26022] Updated weights on worker 0-0, policy_version 562347 (0.00089) [2022-07-10 04:14:02,991][26022] Updated weights on worker 0-0, policy_version 562357 (0.00089) [2022-07-10 04:14:03,186][25689] Fps is (10 sec: 5494.0, 60 sec: 5596.3, 300 sec: 5615.8). Total num frames: 575854592. Throughput: 0: 5025.5. Samples: 575852392. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:14:03,188][25689] Avg episode reward: [(0, '-24.709')] [2022-07-10 04:14:04,979][26022] Updated weights on worker 0-0, policy_version 562367 (0.00092) [2022-07-10 04:14:06,796][26022] Updated weights on worker 0-0, policy_version 562377 (0.00088) [2022-07-10 04:14:08,209][25689] Fps is (10 sec: 5412.7, 60 sec: 5596.0, 300 sec: 5609.1). Total num frames: 575881216. Throughput: 0: 5747.6. Samples: 575883794. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:14:08,210][25689] Avg episode reward: [(0, '-24.153')] [2022-07-10 04:14:08,561][26022] Updated weights on worker 0-0, policy_version 562387 (0.00083) [2022-07-10 04:14:10,448][26022] Updated weights on worker 0-0, policy_version 562397 (0.00089) [2022-07-10 04:14:12,155][26022] Updated weights on worker 0-0, policy_version 562407 (0.00086) [2022-07-10 04:14:13,246][25689] Fps is (10 sec: 5394.4, 60 sec: 5576.7, 300 sec: 5609.5). Total num frames: 575908864. Throughput: 0: 5757.5. Samples: 575917708. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:14:13,247][25689] Avg episode reward: [(0, '-24.699')] [2022-07-10 04:14:13,987][26022] Updated weights on worker 0-0, policy_version 562417 (0.00093) [2022-07-10 04:14:16,058][26022] Updated weights on worker 0-0, policy_version 562427 (0.00095) [2022-07-10 04:14:17,619][26022] Updated weights on worker 0-0, policy_version 562437 (0.00095) [2022-07-10 04:14:18,356][25689] Fps is (10 sec: 5752.3, 60 sec: 5606.4, 300 sec: 5612.0). Total num frames: 575939584. Throughput: 0: 4924.1. Samples: 575934656. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:14:18,357][25689] Avg episode reward: [(0, '-24.433')] [2022-07-10 04:14:19,490][26022] Updated weights on worker 0-0, policy_version 562447 (0.00088) [2022-07-10 04:14:21,299][26022] Updated weights on worker 0-0, policy_version 562457 (0.00095) [2022-07-10 04:14:23,337][26022] Updated weights on worker 0-0, policy_version 562467 (0.00095) [2022-07-10 04:14:23,372][25689] Fps is (10 sec: 5663.5, 60 sec: 5560.0, 300 sec: 5608.6). Total num frames: 575966208. Throughput: 0: 5766.0. Samples: 575968784. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:14:23,372][25689] Avg episode reward: [(0, '-24.237')] [2022-07-10 04:14:24,839][26022] Updated weights on worker 0-0, policy_version 562477 (0.00089) [2022-07-10 04:14:26,824][26022] Updated weights on worker 0-0, policy_version 562487 (0.00092) [2022-07-10 04:14:28,390][25689] Fps is (10 sec: 5612.7, 60 sec: 5593.5, 300 sec: 5608.5). Total num frames: 575995904. Throughput: 0: 5906.9. Samples: 576003004. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:14:28,391][25689] Avg episode reward: [(0, '-24.468')] [2022-07-10 04:14:28,471][26022] Updated weights on worker 0-0, policy_version 562497 (0.00091) [2022-07-10 04:14:30,378][26022] Updated weights on worker 0-0, policy_version 562507 (0.00088) [2022-07-10 04:14:32,083][26022] Updated weights on worker 0-0, policy_version 562517 (0.00085) [2022-07-10 04:14:33,405][25689] Fps is (10 sec: 5817.4, 60 sec: 5612.1, 300 sec: 5617.1). Total num frames: 576024576. Throughput: 0: 5074.9. Samples: 576020010. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:14:33,405][25689] Avg episode reward: [(0, '-25.641')] [2022-07-10 04:14:33,975][26022] Updated weights on worker 0-0, policy_version 562527 (0.00083) [2022-07-10 04:14:35,667][26022] Updated weights on worker 0-0, policy_version 562537 (0.00085) [2022-07-10 04:14:37,541][26022] Updated weights on worker 0-0, policy_version 562547 (0.00057) [2022-07-10 04:14:38,506][25689] Fps is (10 sec: 5668.7, 60 sec: 5607.9, 300 sec: 5616.0). Total num frames: 576053248. Throughput: 0: 5916.9. Samples: 576053884. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:14:38,507][25689] Avg episode reward: [(0, '-25.187')] [2022-07-10 04:14:39,379][26022] Updated weights on worker 0-0, policy_version 562557 (0.00098) [2022-07-10 04:14:41,139][26022] Updated weights on worker 0-0, policy_version 562567 (0.00082) [2022-07-10 04:14:42,785][26022] Updated weights on worker 0-0, policy_version 562577 (0.00088) [2022-07-10 04:14:43,559][25689] Fps is (10 sec: 5748.4, 60 sec: 5621.5, 300 sec: 5615.4). Total num frames: 576082944. Throughput: 0: 5920.6. Samples: 576088304. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 04:14:43,559][25689] Avg episode reward: [(0, '-24.235')] [2022-07-10 04:14:44,915][26022] Updated weights on worker 0-0, policy_version 562587 (0.00084) [2022-07-10 04:14:46,379][26022] Updated weights on worker 0-0, policy_version 562597 (0.00086) [2022-07-10 04:14:48,540][26022] Updated weights on worker 0-0, policy_version 562607 (0.00087) [2022-07-10 04:14:48,573][25689] Fps is (10 sec: 5594.5, 60 sec: 5606.6, 300 sec: 5615.8). Total num frames: 576109568. Throughput: 0: 5067.5. Samples: 576105280. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:14:48,574][25689] Avg episode reward: [(0, '-23.438')] [2022-07-10 04:14:50,022][26022] Updated weights on worker 0-0, policy_version 562617 (0.00091) [2022-07-10 04:14:51,999][26022] Updated weights on worker 0-0, policy_version 562627 (0.00088) [2022-07-10 04:14:53,586][25689] Fps is (10 sec: 5514.3, 60 sec: 5605.6, 300 sec: 5610.9). Total num frames: 576138240. Throughput: 0: 5913.9. Samples: 576139362. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:14:53,588][25689] Avg episode reward: [(0, '-23.782')] [2022-07-10 04:14:53,790][26022] Updated weights on worker 0-0, policy_version 562637 (0.00088) [2022-07-10 04:14:55,706][26022] Updated weights on worker 0-0, policy_version 562647 (0.00090) [2022-07-10 04:14:57,420][26022] Updated weights on worker 0-0, policy_version 562657 (0.00362) [2022-07-10 04:14:58,695][25689] Fps is (10 sec: 5867.6, 60 sec: 5651.9, 300 sec: 5619.4). Total num frames: 576168960. Throughput: 0: 5923.5. Samples: 576173474. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:14:58,697][25689] Avg episode reward: [(0, '-23.575')] [2022-07-10 04:14:59,587][26022] Updated weights on worker 0-0, policy_version 562667 (0.00098) [2022-07-10 04:15:00,857][26022] Updated weights on worker 0-0, policy_version 562677 (0.00095) [2022-07-10 04:15:03,631][26022] Updated weights on worker 0-0, policy_version 562687 (0.00091) [2022-07-10 04:15:03,762][25689] Fps is (10 sec: 5333.7, 60 sec: 5578.4, 300 sec: 5611.9). Total num frames: 576192512. Throughput: 0: 5043.6. Samples: 576190200. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:15:03,762][25689] Avg episode reward: [(0, '-23.538')] [2022-07-10 04:15:04,893][26022] Updated weights on worker 0-0, policy_version 562697 (0.00088) [2022-07-10 04:15:07,029][26022] Updated weights on worker 0-0, policy_version 562707 (0.00260) [2022-07-10 04:15:08,687][26022] Updated weights on worker 0-0, policy_version 562717 (0.00090) [2022-07-10 04:15:08,778][25689] Fps is (10 sec: 5280.9, 60 sec: 5629.8, 300 sec: 5613.2). Total num frames: 576222208. Throughput: 0: 5766.4. Samples: 576221790. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:15:08,779][25689] Avg episode reward: [(0, '-24.893')] [2022-07-10 04:15:10,527][26022] Updated weights on worker 0-0, policy_version 562727 (0.00087) [2022-07-10 04:15:12,447][26022] Updated weights on worker 0-0, policy_version 562737 (0.00084) [2022-07-10 04:15:13,794][25689] Fps is (10 sec: 5818.0, 60 sec: 5648.7, 300 sec: 5615.4). Total num frames: 576250880. Throughput: 0: 5763.0. Samples: 576255818. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:15:13,796][25689] Avg episode reward: [(0, '-25.343')] [2022-07-10 04:15:14,111][26022] Updated weights on worker 0-0, policy_version 562747 (0.00088) [2022-07-10 04:15:16,061][26022] Updated weights on worker 0-0, policy_version 562757 (0.00086) [2022-07-10 04:15:17,862][26022] Updated weights on worker 0-0, policy_version 562767 (0.00084) [2022-07-10 04:15:18,837][25689] Fps is (10 sec: 5599.3, 60 sec: 5604.2, 300 sec: 5611.4). Total num frames: 576278528. Throughput: 0: 4912.0. Samples: 576272410. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:15:18,837][25689] Avg episode reward: [(0, '-25.041')] [2022-07-10 04:15:19,671][26022] Updated weights on worker 0-0, policy_version 562777 (0.00088) [2022-07-10 04:15:21,553][26022] Updated weights on worker 0-0, policy_version 562787 (0.00088) [2022-07-10 04:15:23,371][26022] Updated weights on worker 0-0, policy_version 562797 (0.00095) [2022-07-10 04:15:23,844][25689] Fps is (10 sec: 5603.7, 60 sec: 5638.7, 300 sec: 5615.8). Total num frames: 576307200. Throughput: 0: 5794.8. Samples: 576306576. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:15:23,845][25689] Avg episode reward: [(0, '-24.486')] [2022-07-10 04:15:24,858][26022] Updated weights on worker 0-0, policy_version 562807 (0.00083) [2022-07-10 04:15:27,089][26022] Updated weights on worker 0-0, policy_version 562817 (0.00097) [2022-07-10 04:15:28,665][26022] Updated weights on worker 0-0, policy_version 562827 (0.00087) [2022-07-10 04:15:28,871][25689] Fps is (10 sec: 5715.1, 60 sec: 5621.1, 300 sec: 5615.9). Total num frames: 576335872. Throughput: 0: 5906.1. Samples: 576340456. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:15:28,871][25689] Avg episode reward: [(0, '-25.046')] [2022-07-10 04:15:30,563][26022] Updated weights on worker 0-0, policy_version 562837 (0.00091) [2022-07-10 04:15:32,286][26022] Updated weights on worker 0-0, policy_version 562847 (0.00066) [2022-07-10 04:15:33,901][25689] Fps is (10 sec: 5702.3, 60 sec: 5619.7, 300 sec: 5616.3). Total num frames: 576364544. Throughput: 0: 5054.8. Samples: 576357454. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:15:33,901][25689] Avg episode reward: [(0, '-23.532')] [2022-07-10 04:15:34,135][26022] Updated weights on worker 0-0, policy_version 562857 (0.00092) [2022-07-10 04:15:36,145][26022] Updated weights on worker 0-0, policy_version 562867 (0.00414) [2022-07-10 04:15:37,800][26022] Updated weights on worker 0-0, policy_version 562877 (0.00088) [2022-07-10 04:15:38,958][25689] Fps is (10 sec: 5481.9, 60 sec: 5590.0, 300 sec: 5605.0). Total num frames: 576391168. Throughput: 0: 5908.4. Samples: 576391292. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:15:38,958][25689] Avg episode reward: [(0, '-23.801')] [2022-07-10 04:15:39,541][26022] Updated weights on worker 0-0, policy_version 562887 (0.00082) [2022-07-10 04:15:41,270][26022] Updated weights on worker 0-0, policy_version 562897 (0.00092) [2022-07-10 04:15:43,331][26022] Updated weights on worker 0-0, policy_version 562907 (0.00087) [2022-07-10 04:15:43,974][25689] Fps is (10 sec: 5692.9, 60 sec: 5610.3, 300 sec: 5619.5). Total num frames: 576421888. Throughput: 0: 5902.7. Samples: 576425392. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:15:43,974][25689] Avg episode reward: [(0, '-24.956')] [2022-07-10 04:15:45,127][26022] Updated weights on worker 0-0, policy_version 562917 (0.00084) [2022-07-10 04:15:46,844][26022] Updated weights on worker 0-0, policy_version 562927 (0.00091) [2022-07-10 04:15:48,615][26022] Updated weights on worker 0-0, policy_version 562937 (0.00085) [2022-07-10 04:15:48,981][25689] Fps is (10 sec: 5822.9, 60 sec: 5627.9, 300 sec: 5609.3). Total num frames: 576449536. Throughput: 0: 5074.8. Samples: 576442514. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:15:48,982][25689] Avg episode reward: [(0, '-25.840')] [2022-07-10 04:15:50,368][26022] Updated weights on worker 0-0, policy_version 562947 (0.00085) [2022-07-10 04:15:52,209][26022] Updated weights on worker 0-0, policy_version 562957 (0.00084) [2022-07-10 04:15:53,992][25689] Fps is (10 sec: 5519.2, 60 sec: 5611.1, 300 sec: 5610.8). Total num frames: 576477184. Throughput: 0: 5938.8. Samples: 576476774. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:15:53,993][25689] Avg episode reward: [(0, '-26.362')] [2022-07-10 04:15:54,000][26022] Updated weights on worker 0-0, policy_version 562967 (0.00087) [2022-07-10 04:15:55,763][26022] Updated weights on worker 0-0, policy_version 562977 (0.00089) [2022-07-10 04:15:57,703][26022] Updated weights on worker 0-0, policy_version 562987 (0.00088) [2022-07-10 04:15:58,584][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:15:58,598][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000562993_576504832.pth [2022-07-10 04:15:58,599][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000561014_574478336.pth [2022-07-10 04:15:59,029][25689] Fps is (10 sec: 5605.1, 60 sec: 5583.8, 300 sec: 5613.7). Total num frames: 576505856. Throughput: 0: 5951.7. Samples: 576510754. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:15:59,030][25689] Avg episode reward: [(0, '-26.263')] [2022-07-10 04:15:59,540][26022] Updated weights on worker 0-0, policy_version 562997 (0.00086) [2022-07-10 04:16:01,275][26022] Updated weights on worker 0-0, policy_version 563007 (0.00084) [2022-07-10 04:16:03,380][26022] Updated weights on worker 0-0, policy_version 563017 (0.00092) [2022-07-10 04:16:04,032][25689] Fps is (10 sec: 5405.9, 60 sec: 5623.7, 300 sec: 5607.2). Total num frames: 576531456. Throughput: 0: 5098.4. Samples: 576527658. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:16:04,033][25689] Avg episode reward: [(0, '-26.631')] [2022-07-10 04:16:05,261][26022] Updated weights on worker 0-0, policy_version 563027 (0.00095) [2022-07-10 04:16:07,368][26022] Updated weights on worker 0-0, policy_version 563037 (0.00089) [2022-07-10 04:16:09,023][26022] Updated weights on worker 0-0, policy_version 563047 (0.00095) [2022-07-10 04:16:09,050][25689] Fps is (10 sec: 5416.1, 60 sec: 5606.6, 300 sec: 5610.7). Total num frames: 576560128. Throughput: 0: 5808.2. Samples: 576559076. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:16:09,051][25689] Avg episode reward: [(0, '-25.993')] [2022-07-10 04:16:10,723][26022] Updated weights on worker 0-0, policy_version 563057 (0.00091) [2022-07-10 04:16:12,415][26022] Updated weights on worker 0-0, policy_version 563067 (0.00087) [2022-07-10 04:16:14,055][25689] Fps is (10 sec: 5721.2, 60 sec: 5607.6, 300 sec: 5613.7). Total num frames: 576588800. Throughput: 0: 5813.4. Samples: 576593406. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:16:14,056][25689] Avg episode reward: [(0, '-25.702')] [2022-07-10 04:16:14,295][26022] Updated weights on worker 0-0, policy_version 563077 (0.00087) [2022-07-10 04:16:16,052][26022] Updated weights on worker 0-0, policy_version 563087 (0.00074) [2022-07-10 04:16:17,963][26022] Updated weights on worker 0-0, policy_version 563097 (0.00077) [2022-07-10 04:16:19,138][25689] Fps is (10 sec: 5684.6, 60 sec: 5620.9, 300 sec: 5609.3). Total num frames: 576617472. Throughput: 0: 4947.2. Samples: 576610234. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:16:19,138][25689] Avg episode reward: [(0, '-25.720')] [2022-07-10 04:16:19,739][26022] Updated weights on worker 0-0, policy_version 563107 (0.00095) [2022-07-10 04:16:21,791][26022] Updated weights on worker 0-0, policy_version 563117 (0.00082) [2022-07-10 04:16:23,330][26022] Updated weights on worker 0-0, policy_version 563127 (0.00092) [2022-07-10 04:16:24,153][25689] Fps is (10 sec: 5577.3, 60 sec: 5603.2, 300 sec: 5610.2). Total num frames: 576645120. Throughput: 0: 5787.3. Samples: 576644104. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:16:24,153][25689] Avg episode reward: [(0, '-25.251')] [2022-07-10 04:16:25,279][26022] Updated weights on worker 0-0, policy_version 563137 (0.00088) [2022-07-10 04:16:27,003][26022] Updated weights on worker 0-0, policy_version 563147 (0.00090) [2022-07-10 04:16:28,687][26022] Updated weights on worker 0-0, policy_version 563157 (0.00087) [2022-07-10 04:16:29,160][25689] Fps is (10 sec: 5721.6, 60 sec: 5621.9, 300 sec: 5618.0). Total num frames: 576674816. Throughput: 0: 5935.9. Samples: 576678448. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:16:29,162][25689] Avg episode reward: [(0, '-26.456')] [2022-07-10 04:16:30,785][26022] Updated weights on worker 0-0, policy_version 563167 (0.00086) [2022-07-10 04:16:32,304][26022] Updated weights on worker 0-0, policy_version 563177 (0.00186) [2022-07-10 04:16:34,168][25689] Fps is (10 sec: 5725.6, 60 sec: 5607.0, 300 sec: 5615.4). Total num frames: 576702464. Throughput: 0: 5069.0. Samples: 576695362. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:16:34,169][25689] Avg episode reward: [(0, '-26.043')] [2022-07-10 04:16:34,382][26022] Updated weights on worker 0-0, policy_version 563187 (0.00476) [2022-07-10 04:16:36,091][26022] Updated weights on worker 0-0, policy_version 563197 (0.00099) [2022-07-10 04:16:37,862][26022] Updated weights on worker 0-0, policy_version 563207 (0.00095) [2022-07-10 04:16:39,303][25689] Fps is (10 sec: 5552.7, 60 sec: 5633.7, 300 sec: 5612.9). Total num frames: 576731136. Throughput: 0: 5910.0. Samples: 576729410. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:16:39,304][25689] Avg episode reward: [(0, '-26.043')] [2022-07-10 04:16:39,744][26022] Updated weights on worker 0-0, policy_version 563217 (0.00091) [2022-07-10 04:16:41,544][26022] Updated weights on worker 0-0, policy_version 563227 (0.00088) [2022-07-10 04:16:43,199][26022] Updated weights on worker 0-0, policy_version 563237 (0.00082) [2022-07-10 04:16:44,355][25689] Fps is (10 sec: 5730.0, 60 sec: 5613.4, 300 sec: 5618.9). Total num frames: 576760832. Throughput: 0: 5915.5. Samples: 576763608. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:16:44,355][25689] Avg episode reward: [(0, '-25.904')] [2022-07-10 04:16:45,259][26022] Updated weights on worker 0-0, policy_version 563247 (0.00094) [2022-07-10 04:16:46,755][26022] Updated weights on worker 0-0, policy_version 563257 (0.00085) [2022-07-10 04:16:48,768][26022] Updated weights on worker 0-0, policy_version 563267 (0.00090) [2022-07-10 04:16:49,366][25689] Fps is (10 sec: 5698.5, 60 sec: 5613.1, 300 sec: 5615.6). Total num frames: 576788480. Throughput: 0: 5890.1. Samples: 576797462. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:16:49,366][25689] Avg episode reward: [(0, '-26.157')] [2022-07-10 04:16:50,525][26022] Updated weights on worker 0-0, policy_version 563277 (0.00297) [2022-07-10 04:16:52,345][26022] Updated weights on worker 0-0, policy_version 563287 (0.00083) [2022-07-10 04:16:54,199][26022] Updated weights on worker 0-0, policy_version 563297 (0.00089) [2022-07-10 04:16:54,465][25689] Fps is (10 sec: 5570.4, 60 sec: 5621.8, 300 sec: 5615.8). Total num frames: 576817152. Throughput: 0: 5864.2. Samples: 576814388. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:16:54,466][25689] Avg episode reward: [(0, '-24.426')] [2022-07-10 04:16:55,981][26022] Updated weights on worker 0-0, policy_version 563307 (0.00098) [2022-07-10 04:16:57,710][26022] Updated weights on worker 0-0, policy_version 563317 (0.00089) [2022-07-10 04:16:59,533][25689] Fps is (10 sec: 5640.0, 60 sec: 5618.9, 300 sec: 5622.9). Total num frames: 576845824. Throughput: 0: 5876.5. Samples: 576848294. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:16:59,534][25689] Avg episode reward: [(0, '-24.135')] [2022-07-10 04:16:59,672][26022] Updated weights on worker 0-0, policy_version 563327 (0.00092) [2022-07-10 04:17:01,439][26022] Updated weights on worker 0-0, policy_version 563337 (0.00089) [2022-07-10 04:17:03,630][26022] Updated weights on worker 0-0, policy_version 563347 (0.00093) [2022-07-10 04:17:04,624][25689] Fps is (10 sec: 5543.9, 60 sec: 5644.5, 300 sec: 5615.1). Total num frames: 576873472. Throughput: 0: 5750.2. Samples: 576880162. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:17:04,625][25689] Avg episode reward: [(0, '-24.606')] [2022-07-10 04:17:05,491][26022] Updated weights on worker 0-0, policy_version 563357 (0.00086) [2022-07-10 04:17:07,229][26022] Updated weights on worker 0-0, policy_version 563367 (0.00086) [2022-07-10 04:17:09,115][26022] Updated weights on worker 0-0, policy_version 563377 (0.00086) [2022-07-10 04:17:09,649][25689] Fps is (10 sec: 5466.5, 60 sec: 5627.0, 300 sec: 5616.0). Total num frames: 576901120. Throughput: 0: 4926.0. Samples: 576897376. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:17:09,649][25689] Avg episode reward: [(0, '-25.308')] [2022-07-10 04:17:10,820][26022] Updated weights on worker 0-0, policy_version 563387 (0.00084) [2022-07-10 04:17:12,691][26022] Updated weights on worker 0-0, policy_version 563397 (0.00085) [2022-07-10 04:17:14,426][26022] Updated weights on worker 0-0, policy_version 563407 (0.00804) [2022-07-10 04:17:14,680][25689] Fps is (10 sec: 5498.8, 60 sec: 5607.7, 300 sec: 5616.1). Total num frames: 576928768. Throughput: 0: 5764.3. Samples: 576930914. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:17:14,681][25689] Avg episode reward: [(0, '-25.842')] [2022-07-10 04:17:16,467][26022] Updated weights on worker 0-0, policy_version 563417 (0.00097) [2022-07-10 04:17:18,245][26022] Updated weights on worker 0-0, policy_version 563427 (0.00092) [2022-07-10 04:17:19,750][25689] Fps is (10 sec: 5575.7, 60 sec: 5608.9, 300 sec: 5611.5). Total num frames: 576957440. Throughput: 0: 5745.5. Samples: 576964448. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:17:19,751][25689] Avg episode reward: [(0, '-26.419')] [2022-07-10 04:17:20,045][26022] Updated weights on worker 0-0, policy_version 563437 (0.00096) [2022-07-10 04:17:21,771][26022] Updated weights on worker 0-0, policy_version 563447 (0.00089) [2022-07-10 04:17:23,656][26022] Updated weights on worker 0-0, policy_version 563457 (0.00089) [2022-07-10 04:17:24,833][25689] Fps is (10 sec: 5748.9, 60 sec: 5636.4, 300 sec: 5616.9). Total num frames: 576987136. Throughput: 0: 5014.1. Samples: 576981492. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:17:24,835][25689] Avg episode reward: [(0, '-26.286')] [2022-07-10 04:17:25,465][26022] Updated weights on worker 0-0, policy_version 563467 (0.00083) [2022-07-10 04:17:27,348][26022] Updated weights on worker 0-0, policy_version 563477 (0.00086) [2022-07-10 04:17:28,887][26022] Updated weights on worker 0-0, policy_version 563487 (0.00086) [2022-07-10 04:17:29,885][25689] Fps is (10 sec: 5556.7, 60 sec: 5581.6, 300 sec: 5605.7). Total num frames: 577013760. Throughput: 0: 5842.0. Samples: 577015598. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:17:29,887][25689] Avg episode reward: [(0, '-27.072')] [2022-07-10 04:17:30,933][26022] Updated weights on worker 0-0, policy_version 563497 (0.00083) [2022-07-10 04:17:32,633][26022] Updated weights on worker 0-0, policy_version 563507 (0.00086) [2022-07-10 04:17:34,474][26022] Updated weights on worker 0-0, policy_version 563517 (0.00092) [2022-07-10 04:17:34,935][25689] Fps is (10 sec: 5575.4, 60 sec: 5611.5, 300 sec: 5616.0). Total num frames: 577043456. Throughput: 0: 5857.3. Samples: 577049550. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:17:34,937][25689] Avg episode reward: [(0, '-27.727')] [2022-07-10 04:17:36,193][26022] Updated weights on worker 0-0, policy_version 563527 (0.00098) [2022-07-10 04:17:38,158][26022] Updated weights on worker 0-0, policy_version 563537 (0.00098) [2022-07-10 04:17:39,930][26022] Updated weights on worker 0-0, policy_version 563547 (0.00090) [2022-07-10 04:17:39,996][25689] Fps is (10 sec: 5773.0, 60 sec: 5618.3, 300 sec: 5608.9). Total num frames: 577072128. Throughput: 0: 5032.8. Samples: 577066342. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:17:39,998][25689] Avg episode reward: [(0, '-28.002')] [2022-07-10 04:17:42,030][26022] Updated weights on worker 0-0, policy_version 563557 (0.00098) [2022-07-10 04:17:43,488][26022] Updated weights on worker 0-0, policy_version 563567 (0.00088) [2022-07-10 04:17:45,030][25689] Fps is (10 sec: 5477.7, 60 sec: 5569.4, 300 sec: 5602.7). Total num frames: 577098752. Throughput: 0: 5872.9. Samples: 577100102. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 04:17:45,031][25689] Avg episode reward: [(0, '-28.103')] [2022-07-10 04:17:45,681][26022] Updated weights on worker 0-0, policy_version 563577 (0.00089) [2022-07-10 04:17:47,099][26022] Updated weights on worker 0-0, policy_version 563587 (0.00085) [2022-07-10 04:17:49,279][26022] Updated weights on worker 0-0, policy_version 563597 (0.00091) [2022-07-10 04:17:50,032][25689] Fps is (10 sec: 5713.9, 60 sec: 5620.8, 300 sec: 5614.0). Total num frames: 577129472. Throughput: 0: 5871.2. Samples: 577133882. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:17:50,032][25689] Avg episode reward: [(0, '-28.178')] [2022-07-10 04:17:50,840][26022] Updated weights on worker 0-0, policy_version 563607 (0.00092) [2022-07-10 04:17:52,910][26022] Updated weights on worker 0-0, policy_version 563617 (0.00086) [2022-07-10 04:17:54,322][26022] Updated weights on worker 0-0, policy_version 563627 (0.00091) [2022-07-10 04:17:55,073][25689] Fps is (10 sec: 5811.4, 60 sec: 5609.3, 300 sec: 5607.6). Total num frames: 577157120. Throughput: 0: 5032.2. Samples: 577150890. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:17:55,074][25689] Avg episode reward: [(0, '-28.241')] [2022-07-10 04:17:56,463][26022] Updated weights on worker 0-0, policy_version 563637 (0.00081) [2022-07-10 04:17:58,163][26022] Updated weights on worker 0-0, policy_version 563647 (0.00066) [2022-07-10 04:17:58,912][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:17:58,922][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000563651_577178624.pth [2022-07-10 04:17:58,930][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000561674_575154176.pth [2022-07-10 04:18:00,034][26022] Updated weights on worker 0-0, policy_version 563657 (0.00916) [2022-07-10 04:18:00,140][25689] Fps is (10 sec: 5470.6, 60 sec: 5592.5, 300 sec: 5610.1). Total num frames: 577184768. Throughput: 0: 5867.9. Samples: 577184544. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:18:00,140][25689] Avg episode reward: [(0, '-28.124')] [2022-07-10 04:18:01,979][26022] Updated weights on worker 0-0, policy_version 563667 (0.00497) [2022-07-10 04:18:04,126][26022] Updated weights on worker 0-0, policy_version 563677 (0.00100) [2022-07-10 04:18:05,150][25689] Fps is (10 sec: 5284.0, 60 sec: 5566.1, 300 sec: 5606.9). Total num frames: 577210368. Throughput: 0: 5772.3. Samples: 577216246. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:18:05,151][25689] Avg episode reward: [(0, '-27.333')] [2022-07-10 04:18:05,730][26022] Updated weights on worker 0-0, policy_version 563687 (0.00099) [2022-07-10 04:18:07,829][26022] Updated weights on worker 0-0, policy_version 563697 (0.00088) [2022-07-10 04:18:09,409][26022] Updated weights on worker 0-0, policy_version 563707 (0.00091) [2022-07-10 04:18:10,154][25689] Fps is (10 sec: 5419.4, 60 sec: 5585.0, 300 sec: 5607.0). Total num frames: 577239040. Throughput: 0: 4937.0. Samples: 577233224. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:18:10,155][25689] Avg episode reward: [(0, '-27.433')] [2022-07-10 04:18:11,350][26022] Updated weights on worker 0-0, policy_version 563717 (0.00098) [2022-07-10 04:18:13,097][26022] Updated weights on worker 0-0, policy_version 563727 (0.00091) [2022-07-10 04:18:15,142][26022] Updated weights on worker 0-0, policy_version 563737 (0.00092) [2022-07-10 04:18:15,167][25689] Fps is (10 sec: 5622.5, 60 sec: 5586.7, 300 sec: 5604.5). Total num frames: 577266688. Throughput: 0: 5777.7. Samples: 577266988. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:18:15,169][25689] Avg episode reward: [(0, '-25.830')] [2022-07-10 04:18:16,846][26022] Updated weights on worker 0-0, policy_version 563747 (0.00094) [2022-07-10 04:18:18,854][26022] Updated weights on worker 0-0, policy_version 563757 (0.00092) [2022-07-10 04:18:20,262][25689] Fps is (10 sec: 5571.8, 60 sec: 5584.3, 300 sec: 5600.5). Total num frames: 577295360. Throughput: 0: 5746.2. Samples: 577300172. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:18:20,264][25689] Avg episode reward: [(0, '-25.682')] [2022-07-10 04:18:20,543][26022] Updated weights on worker 0-0, policy_version 563767 (0.00090) [2022-07-10 04:18:22,467][26022] Updated weights on worker 0-0, policy_version 563777 (0.00090) [2022-07-10 04:18:24,118][26022] Updated weights on worker 0-0, policy_version 563787 (0.00119) [2022-07-10 04:18:25,311][25689] Fps is (10 sec: 5552.5, 60 sec: 5553.7, 300 sec: 5599.9). Total num frames: 577323008. Throughput: 0: 5005.3. Samples: 577317154. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:18:25,312][25689] Avg episode reward: [(0, '-24.857')] [2022-07-10 04:18:25,992][26022] Updated weights on worker 0-0, policy_version 563797 (0.00095) [2022-07-10 04:18:28,057][26022] Updated weights on worker 0-0, policy_version 563807 (0.00085) [2022-07-10 04:18:29,580][26022] Updated weights on worker 0-0, policy_version 563817 (0.00091) [2022-07-10 04:18:30,337][25689] Fps is (10 sec: 5691.7, 60 sec: 5606.9, 300 sec: 5606.9). Total num frames: 577352704. Throughput: 0: 5835.5. Samples: 577351002. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:18:30,338][25689] Avg episode reward: [(0, '-24.537')] [2022-07-10 04:18:31,684][26022] Updated weights on worker 0-0, policy_version 563827 (0.00086) [2022-07-10 04:18:33,027][26022] Updated weights on worker 0-0, policy_version 563837 (0.00086) [2022-07-10 04:18:35,136][26022] Updated weights on worker 0-0, policy_version 563847 (0.00091) [2022-07-10 04:18:35,348][25689] Fps is (10 sec: 5712.8, 60 sec: 5576.5, 300 sec: 5604.3). Total num frames: 577380352. Throughput: 0: 5849.2. Samples: 577385030. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:18:35,349][25689] Avg episode reward: [(0, '-25.005')] [2022-07-10 04:18:36,732][26022] Updated weights on worker 0-0, policy_version 563857 (0.00091) [2022-07-10 04:18:38,814][26022] Updated weights on worker 0-0, policy_version 563867 (0.00088) [2022-07-10 04:18:40,367][26022] Updated weights on worker 0-0, policy_version 563877 (0.00091) [2022-07-10 04:18:40,422][25689] Fps is (10 sec: 5686.2, 60 sec: 5592.3, 300 sec: 5606.6). Total num frames: 577410048. Throughput: 0: 5879.0. Samples: 577418692. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:18:40,422][25689] Avg episode reward: [(0, '-24.453')] [2022-07-10 04:18:42,247][26022] Updated weights on worker 0-0, policy_version 563887 (0.00087) [2022-07-10 04:18:43,861][26022] Updated weights on worker 0-0, policy_version 563897 (0.00086) [2022-07-10 04:18:45,486][25689] Fps is (10 sec: 5757.4, 60 sec: 5623.3, 300 sec: 5609.5). Total num frames: 577438720. Throughput: 0: 5886.9. Samples: 577435926. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:18:45,487][25689] Avg episode reward: [(0, '-25.063')] [2022-07-10 04:18:46,201][26022] Updated weights on worker 0-0, policy_version 563907 (0.00094) [2022-07-10 04:18:47,525][26022] Updated weights on worker 0-0, policy_version 563917 (0.00080) [2022-07-10 04:18:49,606][26022] Updated weights on worker 0-0, policy_version 563927 (0.00085) [2022-07-10 04:18:50,500][25689] Fps is (10 sec: 5588.3, 60 sec: 5571.4, 300 sec: 5605.9). Total num frames: 577466368. Throughput: 0: 5908.5. Samples: 577470136. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:18:50,501][25689] Avg episode reward: [(0, '-24.987')] [2022-07-10 04:18:51,184][26022] Updated weights on worker 0-0, policy_version 563937 (0.00085) [2022-07-10 04:18:53,347][26022] Updated weights on worker 0-0, policy_version 563947 (0.00087) [2022-07-10 04:18:54,868][26022] Updated weights on worker 0-0, policy_version 563957 (0.00095) [2022-07-10 04:18:55,559][25689] Fps is (10 sec: 5591.5, 60 sec: 5586.8, 300 sec: 5609.4). Total num frames: 577495040. Throughput: 0: 5882.3. Samples: 577503914. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:18:55,560][25689] Avg episode reward: [(0, '-25.419')] [2022-07-10 04:18:56,773][26022] Updated weights on worker 0-0, policy_version 563967 (0.00085) [2022-07-10 04:18:58,486][26022] Updated weights on worker 0-0, policy_version 563977 (0.00084) [2022-07-10 04:19:00,451][26022] Updated weights on worker 0-0, policy_version 563987 (0.00088) [2022-07-10 04:19:00,598][25689] Fps is (10 sec: 5577.3, 60 sec: 5589.3, 300 sec: 5608.7). Total num frames: 577522688. Throughput: 0: 5067.9. Samples: 577520944. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:19:00,599][25689] Avg episode reward: [(0, '-25.094')] [2022-07-10 04:19:02,549][26022] Updated weights on worker 0-0, policy_version 563997 (0.00088) [2022-07-10 04:19:04,423][26022] Updated weights on worker 0-0, policy_version 564007 (0.00093) [2022-07-10 04:19:05,673][25689] Fps is (10 sec: 5467.2, 60 sec: 5617.2, 300 sec: 5611.2). Total num frames: 577550336. Throughput: 0: 5803.7. Samples: 577553082. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:19:05,675][25689] Avg episode reward: [(0, '-24.156')] [2022-07-10 04:19:06,019][26022] Updated weights on worker 0-0, policy_version 564017 (0.00091) [2022-07-10 04:19:07,923][26022] Updated weights on worker 0-0, policy_version 564027 (0.00091) [2022-07-10 04:19:09,890][26022] Updated weights on worker 0-0, policy_version 564037 (0.00092) [2022-07-10 04:19:10,749][25689] Fps is (10 sec: 5548.4, 60 sec: 5610.5, 300 sec: 5613.9). Total num frames: 577579008. Throughput: 0: 5774.0. Samples: 577587052. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:19:10,749][25689] Avg episode reward: [(0, '-24.630')] [2022-07-10 04:19:11,626][26022] Updated weights on worker 0-0, policy_version 564047 (0.00088) [2022-07-10 04:19:13,479][26022] Updated weights on worker 0-0, policy_version 564057 (0.00089) [2022-07-10 04:19:15,113][26022] Updated weights on worker 0-0, policy_version 564067 (0.00087) [2022-07-10 04:19:15,779][25689] Fps is (10 sec: 5674.4, 60 sec: 5625.9, 300 sec: 5608.5). Total num frames: 577607680. Throughput: 0: 4956.2. Samples: 577604128. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:19:15,781][25689] Avg episode reward: [(0, '-25.656')] [2022-07-10 04:19:17,027][26022] Updated weights on worker 0-0, policy_version 564077 (0.00098) [2022-07-10 04:19:18,778][26022] Updated weights on worker 0-0, policy_version 564087 (0.00084) [2022-07-10 04:19:20,599][26022] Updated weights on worker 0-0, policy_version 564097 (0.00099) [2022-07-10 04:19:20,888][25689] Fps is (10 sec: 5554.6, 60 sec: 5607.6, 300 sec: 5610.2). Total num frames: 577635328. Throughput: 0: 5773.9. Samples: 577638096. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:19:20,889][25689] Avg episode reward: [(0, '-25.374')] [2022-07-10 04:19:22,360][26022] Updated weights on worker 0-0, policy_version 564107 (0.00086) [2022-07-10 04:19:24,574][26022] Updated weights on worker 0-0, policy_version 564117 (0.00092) [2022-07-10 04:19:25,946][25689] Fps is (10 sec: 5640.1, 60 sec: 5640.6, 300 sec: 5609.4). Total num frames: 577665024. Throughput: 0: 5870.4. Samples: 577672092. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:19:25,946][25689] Avg episode reward: [(0, '-25.851')] [2022-07-10 04:19:26,070][26022] Updated weights on worker 0-0, policy_version 564127 (0.00093) [2022-07-10 04:19:28,044][26022] Updated weights on worker 0-0, policy_version 564137 (0.00083) [2022-07-10 04:19:29,584][26022] Updated weights on worker 0-0, policy_version 564147 (0.00087) [2022-07-10 04:19:30,966][25689] Fps is (10 sec: 5588.4, 60 sec: 5590.4, 300 sec: 5602.4). Total num frames: 577691648. Throughput: 0: 5039.2. Samples: 577688932. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:19:30,967][25689] Avg episode reward: [(0, '-27.205')] [2022-07-10 04:19:31,712][26022] Updated weights on worker 0-0, policy_version 564157 (0.00088) [2022-07-10 04:19:33,457][26022] Updated weights on worker 0-0, policy_version 564167 (0.00091) [2022-07-10 04:19:35,289][26022] Updated weights on worker 0-0, policy_version 564177 (0.00091) [2022-07-10 04:19:35,983][25689] Fps is (10 sec: 5610.9, 60 sec: 5623.7, 300 sec: 5607.5). Total num frames: 577721344. Throughput: 0: 5863.4. Samples: 577722596. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:19:35,984][25689] Avg episode reward: [(0, '-26.774')] [2022-07-10 04:19:37,152][26022] Updated weights on worker 0-0, policy_version 564187 (0.00088) [2022-07-10 04:19:39,130][26022] Updated weights on worker 0-0, policy_version 564197 (0.00091) [2022-07-10 04:19:40,621][26022] Updated weights on worker 0-0, policy_version 564207 (0.00093) [2022-07-10 04:19:41,093][25689] Fps is (10 sec: 5763.7, 60 sec: 5603.4, 300 sec: 5603.0). Total num frames: 577750016. Throughput: 0: 5850.4. Samples: 577756302. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:19:41,094][25689] Avg episode reward: [(0, '-25.596')] [2022-07-10 04:19:42,657][26022] Updated weights on worker 0-0, policy_version 564217 (0.00084) [2022-07-10 04:19:44,471][26022] Updated weights on worker 0-0, policy_version 564227 (0.00087) [2022-07-10 04:19:46,138][25689] Fps is (10 sec: 5546.0, 60 sec: 5588.3, 300 sec: 5605.8). Total num frames: 577777664. Throughput: 0: 5005.6. Samples: 577773168. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:19:46,139][25689] Avg episode reward: [(0, '-24.689')] [2022-07-10 04:19:46,175][26022] Updated weights on worker 0-0, policy_version 564237 (0.00094) [2022-07-10 04:19:48,204][26022] Updated weights on worker 0-0, policy_version 564247 (0.00091) [2022-07-10 04:19:49,683][26022] Updated weights on worker 0-0, policy_version 564257 (0.00087) [2022-07-10 04:19:51,184][25689] Fps is (10 sec: 5479.9, 60 sec: 5585.4, 300 sec: 5601.8). Total num frames: 577805312. Throughput: 0: 5836.5. Samples: 577806932. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:19:51,184][25689] Avg episode reward: [(0, '-25.261')] [2022-07-10 04:19:51,816][26022] Updated weights on worker 0-0, policy_version 564267 (0.00090) [2022-07-10 04:19:53,248][26022] Updated weights on worker 0-0, policy_version 564277 (0.00089) [2022-07-10 04:19:55,309][26022] Updated weights on worker 0-0, policy_version 564287 (0.00092) [2022-07-10 04:19:56,200][25689] Fps is (10 sec: 5699.6, 60 sec: 5606.3, 300 sec: 5600.1). Total num frames: 577835008. Throughput: 0: 5857.0. Samples: 577841002. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:19:56,200][25689] Avg episode reward: [(0, '-24.432')] [2022-07-10 04:19:57,176][26022] Updated weights on worker 0-0, policy_version 564297 (0.00088) [2022-07-10 04:19:58,728][26022] Updated weights on worker 0-0, policy_version 564307 (0.00092) [2022-07-10 04:19:58,974][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:19:58,996][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000564308_577851392.pth [2022-07-10 04:19:58,997][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000562333_575828992.pth [2022-07-10 04:20:00,922][26022] Updated weights on worker 0-0, policy_version 564317 (0.00093) [2022-07-10 04:20:01,290][25689] Fps is (10 sec: 5573.1, 60 sec: 5584.7, 300 sec: 5610.0). Total num frames: 577861632. Throughput: 0: 5039.9. Samples: 577858090. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:20:01,290][25689] Avg episode reward: [(0, '-24.607')] [2022-07-10 04:20:02,819][26022] Updated weights on worker 0-0, policy_version 564327 (0.00080) [2022-07-10 04:20:04,736][26022] Updated weights on worker 0-0, policy_version 564337 (0.00086) [2022-07-10 04:20:06,320][25689] Fps is (10 sec: 5463.7, 60 sec: 5605.7, 300 sec: 5606.3). Total num frames: 577890304. Throughput: 0: 5784.4. Samples: 577889906. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:20:06,322][25689] Avg episode reward: [(0, '-24.954')] [2022-07-10 04:20:06,422][26022] Updated weights on worker 0-0, policy_version 564347 (0.00090) [2022-07-10 04:20:08,524][26022] Updated weights on worker 0-0, policy_version 564357 (0.00083) [2022-07-10 04:20:10,054][26022] Updated weights on worker 0-0, policy_version 564367 (0.00093) [2022-07-10 04:20:11,339][25689] Fps is (10 sec: 5604.6, 60 sec: 5594.1, 300 sec: 5602.8). Total num frames: 577917952. Throughput: 0: 5800.6. Samples: 577923842. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:20:11,340][25689] Avg episode reward: [(0, '-24.722')] [2022-07-10 04:20:11,927][26022] Updated weights on worker 0-0, policy_version 564377 (0.00452) [2022-07-10 04:20:13,776][26022] Updated weights on worker 0-0, policy_version 564387 (0.00087) [2022-07-10 04:20:15,814][26022] Updated weights on worker 0-0, policy_version 564397 (0.00087) [2022-07-10 04:20:16,385][25689] Fps is (10 sec: 5595.4, 60 sec: 5592.5, 300 sec: 5606.1). Total num frames: 577946624. Throughput: 0: 4941.1. Samples: 577940742. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:20:16,386][25689] Avg episode reward: [(0, '-24.565')] [2022-07-10 04:20:17,405][26022] Updated weights on worker 0-0, policy_version 564407 (0.00097) [2022-07-10 04:20:19,364][26022] Updated weights on worker 0-0, policy_version 564417 (0.00101) [2022-07-10 04:20:20,961][26022] Updated weights on worker 0-0, policy_version 564427 (0.00089) [2022-07-10 04:20:21,507][25689] Fps is (10 sec: 5538.6, 60 sec: 5591.4, 300 sec: 5600.5). Total num frames: 577974272. Throughput: 0: 5747.9. Samples: 577974298. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:20:21,508][25689] Avg episode reward: [(0, '-23.659')] [2022-07-10 04:20:22,927][26022] Updated weights on worker 0-0, policy_version 564437 (0.00088) [2022-07-10 04:20:24,549][26022] Updated weights on worker 0-0, policy_version 564447 (0.00089) [2022-07-10 04:20:26,523][25689] Fps is (10 sec: 5555.7, 60 sec: 5578.4, 300 sec: 5600.7). Total num frames: 578002944. Throughput: 0: 5863.7. Samples: 578008368. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:20:26,523][25689] Avg episode reward: [(0, '-22.992')] [2022-07-10 04:20:26,558][26022] Updated weights on worker 0-0, policy_version 564457 (0.00087) [2022-07-10 04:20:28,249][26022] Updated weights on worker 0-0, policy_version 564467 (0.00087) [2022-07-10 04:20:30,123][26022] Updated weights on worker 0-0, policy_version 564477 (0.00088) [2022-07-10 04:20:31,533][25689] Fps is (10 sec: 5719.6, 60 sec: 5613.1, 300 sec: 5601.1). Total num frames: 578031616. Throughput: 0: 5019.6. Samples: 578025210. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:20:31,533][25689] Avg episode reward: [(0, '-23.946')] [2022-07-10 04:20:31,879][26022] Updated weights on worker 0-0, policy_version 564487 (0.00086) [2022-07-10 04:20:33,799][26022] Updated weights on worker 0-0, policy_version 564497 (0.00086) [2022-07-10 04:20:35,579][26022] Updated weights on worker 0-0, policy_version 564507 (0.00084) [2022-07-10 04:20:36,546][25689] Fps is (10 sec: 5721.4, 60 sec: 5596.6, 300 sec: 5608.8). Total num frames: 578060288. Throughput: 0: 5885.2. Samples: 578059388. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:20:36,546][25689] Avg episode reward: [(0, '-24.312')] [2022-07-10 04:20:37,384][26022] Updated weights on worker 0-0, policy_version 564517 (0.00085) [2022-07-10 04:20:39,187][26022] Updated weights on worker 0-0, policy_version 564527 (0.00088) [2022-07-10 04:20:40,972][26022] Updated weights on worker 0-0, policy_version 564537 (0.00092) [2022-07-10 04:20:41,594][25689] Fps is (10 sec: 5801.5, 60 sec: 5619.3, 300 sec: 5604.8). Total num frames: 578089984. Throughput: 0: 5943.1. Samples: 578093674. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:20:41,594][25689] Avg episode reward: [(0, '-23.718')] [2022-07-10 04:20:42,832][26022] Updated weights on worker 0-0, policy_version 564547 (0.00091) [2022-07-10 04:20:44,585][26022] Updated weights on worker 0-0, policy_version 564557 (0.00091) [2022-07-10 04:20:46,416][26022] Updated weights on worker 0-0, policy_version 564567 (0.00086) [2022-07-10 04:20:46,613][25689] Fps is (10 sec: 5695.9, 60 sec: 5621.7, 300 sec: 5604.6). Total num frames: 578117632. Throughput: 0: 5087.3. Samples: 578110574. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2022-07-10 04:20:46,614][25689] Avg episode reward: [(0, '-24.829')] [2022-07-10 04:20:48,159][26022] Updated weights on worker 0-0, policy_version 564577 (0.00083) [2022-07-10 04:20:49,983][26022] Updated weights on worker 0-0, policy_version 564587 (0.00089) [2022-07-10 04:20:51,628][25689] Fps is (10 sec: 5510.8, 60 sec: 5624.5, 300 sec: 5604.5). Total num frames: 578145280. Throughput: 0: 5927.7. Samples: 578144326. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:20:51,628][25689] Avg episode reward: [(0, '-25.406')] [2022-07-10 04:20:51,840][26022] Updated weights on worker 0-0, policy_version 564597 (0.00084) [2022-07-10 04:20:53,792][26022] Updated weights on worker 0-0, policy_version 564607 (0.00081) [2022-07-10 04:20:55,341][26022] Updated weights on worker 0-0, policy_version 564617 (0.00092) [2022-07-10 04:20:56,651][25689] Fps is (10 sec: 5508.8, 60 sec: 5590.0, 300 sec: 5601.3). Total num frames: 578172928. Throughput: 0: 5904.3. Samples: 578178096. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:20:56,651][25689] Avg episode reward: [(0, '-25.318')] [2022-07-10 04:20:57,626][26022] Updated weights on worker 0-0, policy_version 564627 (0.00083) [2022-07-10 04:20:59,103][26022] Updated weights on worker 0-0, policy_version 564637 (0.00090) [2022-07-10 04:21:01,051][26022] Updated weights on worker 0-0, policy_version 564647 (0.00081) [2022-07-10 04:21:01,700][25689] Fps is (10 sec: 5795.0, 60 sec: 5661.5, 300 sec: 5617.6). Total num frames: 578203648. Throughput: 0: 5029.6. Samples: 578194800. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:21:01,701][25689] Avg episode reward: [(0, '-25.449')] [2022-07-10 04:21:03,337][26022] Updated weights on worker 0-0, policy_version 564657 (0.00097) [2022-07-10 04:21:04,850][26022] Updated weights on worker 0-0, policy_version 564667 (0.00091) [2022-07-10 04:21:06,733][25689] Fps is (10 sec: 5281.3, 60 sec: 5559.6, 300 sec: 5596.7). Total num frames: 578226176. Throughput: 0: 5760.7. Samples: 578226480. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:21:06,734][25689] Avg episode reward: [(0, '-24.938')] [2022-07-10 04:21:07,054][26022] Updated weights on worker 0-0, policy_version 564677 (0.00093) [2022-07-10 04:21:08,482][26022] Updated weights on worker 0-0, policy_version 564687 (0.00090) [2022-07-10 04:21:10,527][26022] Updated weights on worker 0-0, policy_version 564697 (0.00076) [2022-07-10 04:21:11,766][25689] Fps is (10 sec: 5289.9, 60 sec: 5609.1, 300 sec: 5603.1). Total num frames: 578256896. Throughput: 0: 5779.9. Samples: 578260722. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:21:11,768][25689] Avg episode reward: [(0, '-25.094')] [2022-07-10 04:21:12,330][26022] Updated weights on worker 0-0, policy_version 564707 (0.00086) [2022-07-10 04:21:13,878][26022] Updated weights on worker 0-0, policy_version 564717 (0.00090) [2022-07-10 04:21:16,008][26022] Updated weights on worker 0-0, policy_version 564727 (0.00080) [2022-07-10 04:21:16,788][25689] Fps is (10 sec: 6008.6, 60 sec: 5628.4, 300 sec: 5607.7). Total num frames: 578286592. Throughput: 0: 5795.0. Samples: 578294792. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:21:16,789][25689] Avg episode reward: [(0, '-25.408')] [2022-07-10 04:21:17,747][26022] Updated weights on worker 0-0, policy_version 564737 (0.00085) [2022-07-10 04:21:19,469][26022] Updated weights on worker 0-0, policy_version 564747 (0.00090) [2022-07-10 04:21:21,353][26022] Updated weights on worker 0-0, policy_version 564757 (0.00085) [2022-07-10 04:21:21,881][25689] Fps is (10 sec: 5669.2, 60 sec: 5631.0, 300 sec: 5606.2). Total num frames: 578314240. Throughput: 0: 5801.3. Samples: 578311876. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:21:21,881][25689] Avg episode reward: [(0, '-24.177')] [2022-07-10 04:21:22,998][26022] Updated weights on worker 0-0, policy_version 564767 (0.00096) [2022-07-10 04:21:24,920][26022] Updated weights on worker 0-0, policy_version 564777 (0.00090) [2022-07-10 04:21:26,671][26022] Updated weights on worker 0-0, policy_version 564787 (0.00091) [2022-07-10 04:21:26,926][25689] Fps is (10 sec: 5454.5, 60 sec: 5611.4, 300 sec: 5598.6). Total num frames: 578341888. Throughput: 0: 5918.5. Samples: 578345990. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:21:26,926][25689] Avg episode reward: [(0, '-23.429')] [2022-07-10 04:21:28,409][26022] Updated weights on worker 0-0, policy_version 564797 (0.00094) [2022-07-10 04:21:30,398][26022] Updated weights on worker 0-0, policy_version 564807 (0.00087) [2022-07-10 04:21:31,912][26022] Updated weights on worker 0-0, policy_version 564817 (0.00088) [2022-07-10 04:21:31,939][25689] Fps is (10 sec: 5803.3, 60 sec: 5645.0, 300 sec: 5608.8). Total num frames: 578372608. Throughput: 0: 5920.4. Samples: 578380152. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:21:31,940][25689] Avg episode reward: [(0, '-24.994')] [2022-07-10 04:21:33,978][26022] Updated weights on worker 0-0, policy_version 564827 (0.00085) [2022-07-10 04:21:35,654][26022] Updated weights on worker 0-0, policy_version 564837 (0.00096) [2022-07-10 04:21:36,947][25689] Fps is (10 sec: 5722.2, 60 sec: 5611.5, 300 sec: 5604.3). Total num frames: 578399232. Throughput: 0: 5081.9. Samples: 578397238. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:21:36,948][25689] Avg episode reward: [(0, '-25.058')] [2022-07-10 04:21:37,687][26022] Updated weights on worker 0-0, policy_version 564847 (0.00089) [2022-07-10 04:21:39,314][26022] Updated weights on worker 0-0, policy_version 564857 (0.00088) [2022-07-10 04:21:41,123][26022] Updated weights on worker 0-0, policy_version 564867 (0.00086) [2022-07-10 04:21:42,018][25689] Fps is (10 sec: 5486.2, 60 sec: 5592.5, 300 sec: 5600.5). Total num frames: 578427904. Throughput: 0: 5933.7. Samples: 578431362. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:21:42,018][25689] Avg episode reward: [(0, '-25.720')] [2022-07-10 04:21:42,753][26022] Updated weights on worker 0-0, policy_version 564877 (0.00091) [2022-07-10 04:21:44,636][26022] Updated weights on worker 0-0, policy_version 564887 (0.00093) [2022-07-10 04:21:46,540][26022] Updated weights on worker 0-0, policy_version 564897 (0.00520) [2022-07-10 04:21:47,065][25689] Fps is (10 sec: 5667.3, 60 sec: 5606.8, 300 sec: 5603.3). Total num frames: 578456576. Throughput: 0: 5931.1. Samples: 578465440. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:21:47,066][25689] Avg episode reward: [(0, '-26.056')] [2022-07-10 04:21:48,380][26022] Updated weights on worker 0-0, policy_version 564907 (0.00090) [2022-07-10 04:21:50,346][26022] Updated weights on worker 0-0, policy_version 564917 (0.00092) [2022-07-10 04:21:51,969][26022] Updated weights on worker 0-0, policy_version 564927 (0.00089) [2022-07-10 04:21:52,078][25689] Fps is (10 sec: 5802.1, 60 sec: 5640.9, 300 sec: 5608.4). Total num frames: 578486272. Throughput: 0: 5088.3. Samples: 578482624. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:21:52,078][25689] Avg episode reward: [(0, '-27.434')] [2022-07-10 04:21:53,792][26022] Updated weights on worker 0-0, policy_version 564937 (0.00087) [2022-07-10 04:21:55,514][26022] Updated weights on worker 0-0, policy_version 564947 (0.00083) [2022-07-10 04:21:57,089][25689] Fps is (10 sec: 5720.7, 60 sec: 5642.0, 300 sec: 5606.0). Total num frames: 578513920. Throughput: 0: 5947.2. Samples: 578517028. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:21:57,090][25689] Avg episode reward: [(0, '-27.440')] [2022-07-10 04:21:57,208][26022] Updated weights on worker 0-0, policy_version 564957 (0.00089) [2022-07-10 04:21:59,061][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:21:59,074][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000564966_578525184.pth [2022-07-10 04:21:59,074][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000562993_576504832.pth [2022-07-10 04:21:59,337][26022] Updated weights on worker 0-0, policy_version 564967 (0.00088) [2022-07-10 04:22:00,736][26022] Updated weights on worker 0-0, policy_version 564977 (0.00092) [2022-07-10 04:22:02,165][25689] Fps is (10 sec: 5481.8, 60 sec: 5588.7, 300 sec: 5606.3). Total num frames: 578541568. Throughput: 0: 5924.7. Samples: 578550726. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:22:02,174][25689] Avg episode reward: [(0, '-26.292')] [2022-07-10 04:22:03,171][26022] Updated weights on worker 0-0, policy_version 564987 (0.00091) [2022-07-10 04:22:04,832][26022] Updated weights on worker 0-0, policy_version 564997 (0.00086) [2022-07-10 04:22:06,610][26022] Updated weights on worker 0-0, policy_version 565007 (0.00073) [2022-07-10 04:22:07,194][25689] Fps is (10 sec: 5573.8, 60 sec: 5690.7, 300 sec: 5609.6). Total num frames: 578570240. Throughput: 0: 4988.5. Samples: 578565850. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:22:07,194][25689] Avg episode reward: [(0, '-26.953')] [2022-07-10 04:22:08,558][26022] Updated weights on worker 0-0, policy_version 565017 (0.00086) [2022-07-10 04:22:10,190][26022] Updated weights on worker 0-0, policy_version 565027 (0.00097) [2022-07-10 04:22:12,205][25689] Fps is (10 sec: 5507.2, 60 sec: 5624.9, 300 sec: 5606.6). Total num frames: 578596864. Throughput: 0: 5831.3. Samples: 578599996. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:22:12,206][25689] Avg episode reward: [(0, '-25.655')] [2022-07-10 04:22:12,306][26022] Updated weights on worker 0-0, policy_version 565037 (0.00086) [2022-07-10 04:22:13,920][26022] Updated weights on worker 0-0, policy_version 565047 (0.00094) [2022-07-10 04:22:15,857][26022] Updated weights on worker 0-0, policy_version 565057 (0.00089) [2022-07-10 04:22:17,206][25689] Fps is (10 sec: 5522.7, 60 sec: 5609.9, 300 sec: 5607.9). Total num frames: 578625536. Throughput: 0: 5812.7. Samples: 578633962. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:22:17,207][25689] Avg episode reward: [(0, '-26.127')] [2022-07-10 04:22:17,501][26022] Updated weights on worker 0-0, policy_version 565067 (0.00052) [2022-07-10 04:22:19,262][26022] Updated weights on worker 0-0, policy_version 565077 (0.00079) [2022-07-10 04:22:21,162][26022] Updated weights on worker 0-0, policy_version 565087 (0.00095) [2022-07-10 04:22:22,361][25689] Fps is (10 sec: 5848.2, 60 sec: 5655.0, 300 sec: 5610.0). Total num frames: 578656256. Throughput: 0: 4971.7. Samples: 578651136. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:22:22,361][25689] Avg episode reward: [(0, '-26.198')] [2022-07-10 04:22:22,998][26022] Updated weights on worker 0-0, policy_version 565097 (0.00090) [2022-07-10 04:22:24,591][26022] Updated weights on worker 0-0, policy_version 565107 (0.00092) [2022-07-10 04:22:26,695][26022] Updated weights on worker 0-0, policy_version 565117 (0.00083) [2022-07-10 04:22:27,365][25689] Fps is (10 sec: 5645.1, 60 sec: 5641.9, 300 sec: 5610.9). Total num frames: 578682880. Throughput: 0: 5927.1. Samples: 578685404. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:22:27,365][25689] Avg episode reward: [(0, '-25.921')] [2022-07-10 04:22:28,246][26022] Updated weights on worker 0-0, policy_version 565127 (0.00082) [2022-07-10 04:22:30,279][26022] Updated weights on worker 0-0, policy_version 565137 (0.00089) [2022-07-10 04:22:32,083][26022] Updated weights on worker 0-0, policy_version 565147 (0.00090) [2022-07-10 04:22:32,387][25689] Fps is (10 sec: 5515.4, 60 sec: 5607.2, 300 sec: 5608.0). Total num frames: 578711552. Throughput: 0: 5909.1. Samples: 578719250. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:22:32,387][25689] Avg episode reward: [(0, '-24.943')] [2022-07-10 04:22:33,886][26022] Updated weights on worker 0-0, policy_version 565157 (0.00094) [2022-07-10 04:22:35,653][26022] Updated weights on worker 0-0, policy_version 565167 (0.00088) [2022-07-10 04:22:37,361][26022] Updated weights on worker 0-0, policy_version 565177 (0.00086) [2022-07-10 04:22:37,396][25689] Fps is (10 sec: 5818.4, 60 sec: 5657.9, 300 sec: 5612.4). Total num frames: 578741248. Throughput: 0: 5080.2. Samples: 578736530. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:22:37,397][25689] Avg episode reward: [(0, '-25.298')] [2022-07-10 04:22:39,180][26022] Updated weights on worker 0-0, policy_version 565187 (0.00089) [2022-07-10 04:22:40,990][26022] Updated weights on worker 0-0, policy_version 565197 (0.00086) [2022-07-10 04:22:42,502][25689] Fps is (10 sec: 5770.3, 60 sec: 5654.6, 300 sec: 5617.9). Total num frames: 578769920. Throughput: 0: 5940.0. Samples: 578770774. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:22:42,503][25689] Avg episode reward: [(0, '-24.710')] [2022-07-10 04:22:42,874][26022] Updated weights on worker 0-0, policy_version 565207 (0.00082) [2022-07-10 04:22:44,389][26022] Updated weights on worker 0-0, policy_version 565217 (0.00091) [2022-07-10 04:22:46,555][26022] Updated weights on worker 0-0, policy_version 565227 (0.00112) [2022-07-10 04:22:47,534][25689] Fps is (10 sec: 5757.6, 60 sec: 5673.0, 300 sec: 5613.9). Total num frames: 578799616. Throughput: 0: 5923.5. Samples: 578804876. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:22:47,534][25689] Avg episode reward: [(0, '-24.132')] [2022-07-10 04:22:48,110][26022] Updated weights on worker 0-0, policy_version 565237 (0.00093) [2022-07-10 04:22:50,132][26022] Updated weights on worker 0-0, policy_version 565247 (0.00090) [2022-07-10 04:22:51,665][26022] Updated weights on worker 0-0, policy_version 565257 (0.00085) [2022-07-10 04:22:52,549][25689] Fps is (10 sec: 5605.7, 60 sec: 5622.0, 300 sec: 5610.9). Total num frames: 578826240. Throughput: 0: 5097.9. Samples: 578822034. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:22:52,549][25689] Avg episode reward: [(0, '-24.144')] [2022-07-10 04:22:53,587][26022] Updated weights on worker 0-0, policy_version 565267 (0.00089) [2022-07-10 04:22:55,561][26022] Updated weights on worker 0-0, policy_version 565277 (0.00085) [2022-07-10 04:22:57,206][26022] Updated weights on worker 0-0, policy_version 565287 (0.00088) [2022-07-10 04:22:57,580][25689] Fps is (10 sec: 5708.3, 60 sec: 5671.0, 300 sec: 5621.9). Total num frames: 578856960. Throughput: 0: 5931.8. Samples: 578856254. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:22:57,581][25689] Avg episode reward: [(0, '-25.845')] [2022-07-10 04:22:59,031][26022] Updated weights on worker 0-0, policy_version 565297 (0.00083) [2022-07-10 04:23:00,742][26022] Updated weights on worker 0-0, policy_version 565307 (0.00090) [2022-07-10 04:23:02,694][25689] Fps is (10 sec: 5450.3, 60 sec: 5616.6, 300 sec: 5616.5). Total num frames: 578881536. Throughput: 0: 5854.1. Samples: 578888982. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:23:02,695][25689] Avg episode reward: [(0, '-26.316')] [2022-07-10 04:23:03,178][26022] Updated weights on worker 0-0, policy_version 565317 (0.00085) [2022-07-10 04:23:04,712][26022] Updated weights on worker 0-0, policy_version 565327 (0.00081) [2022-07-10 04:23:06,546][26022] Updated weights on worker 0-0, policy_version 565337 (0.00087) [2022-07-10 04:23:07,713][25689] Fps is (10 sec: 5254.7, 60 sec: 5617.5, 300 sec: 5616.3). Total num frames: 578910208. Throughput: 0: 4963.9. Samples: 578905044. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:23:07,714][25689] Avg episode reward: [(0, '-26.389')] [2022-07-10 04:23:08,468][26022] Updated weights on worker 0-0, policy_version 565347 (0.00093) [2022-07-10 04:23:10,349][26022] Updated weights on worker 0-0, policy_version 565357 (0.00084) [2022-07-10 04:23:11,841][26022] Updated weights on worker 0-0, policy_version 565367 (0.00084) [2022-07-10 04:23:12,728][25689] Fps is (10 sec: 5817.1, 60 sec: 5667.9, 300 sec: 5623.1). Total num frames: 578939904. Throughput: 0: 5805.1. Samples: 578939178. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:23:12,729][25689] Avg episode reward: [(0, '-26.722')] [2022-07-10 04:23:13,895][26022] Updated weights on worker 0-0, policy_version 565377 (0.00087) [2022-07-10 04:23:15,559][26022] Updated weights on worker 0-0, policy_version 565387 (0.00087) [2022-07-10 04:23:17,403][26022] Updated weights on worker 0-0, policy_version 565397 (0.00089) [2022-07-10 04:23:17,739][25689] Fps is (10 sec: 5719.7, 60 sec: 5650.1, 300 sec: 5621.3). Total num frames: 578967552. Throughput: 0: 5810.0. Samples: 578973378. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:23:17,739][25689] Avg episode reward: [(0, '-26.348')] [2022-07-10 04:23:19,405][26022] Updated weights on worker 0-0, policy_version 565407 (0.00092) [2022-07-10 04:23:21,067][26022] Updated weights on worker 0-0, policy_version 565417 (0.00090) [2022-07-10 04:23:22,828][25689] Fps is (10 sec: 5576.3, 60 sec: 5622.4, 300 sec: 5623.9). Total num frames: 578996224. Throughput: 0: 5024.0. Samples: 578990134. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:23:22,829][25689] Avg episode reward: [(0, '-24.788')] [2022-07-10 04:23:22,991][26022] Updated weights on worker 0-0, policy_version 565427 (0.00103) [2022-07-10 04:23:24,762][26022] Updated weights on worker 0-0, policy_version 565437 (0.00087) [2022-07-10 04:23:26,490][26022] Updated weights on worker 0-0, policy_version 565447 (0.00091) [2022-07-10 04:23:27,901][25689] Fps is (10 sec: 5643.1, 60 sec: 5649.8, 300 sec: 5619.6). Total num frames: 579024896. Throughput: 0: 5895.4. Samples: 579024058. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:23:27,901][25689] Avg episode reward: [(0, '-24.090')] [2022-07-10 04:23:28,442][26022] Updated weights on worker 0-0, policy_version 565457 (0.00088) [2022-07-10 04:23:30,166][26022] Updated weights on worker 0-0, policy_version 565467 (0.00087) [2022-07-10 04:23:32,103][26022] Updated weights on worker 0-0, policy_version 565477 (0.00084) [2022-07-10 04:23:32,908][25689] Fps is (10 sec: 5688.8, 60 sec: 5651.1, 300 sec: 5623.1). Total num frames: 579053568. Throughput: 0: 5893.6. Samples: 579058110. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:23:32,909][25689] Avg episode reward: [(0, '-23.379')] [2022-07-10 04:23:33,691][26022] Updated weights on worker 0-0, policy_version 565487 (0.00088) [2022-07-10 04:23:35,562][26022] Updated weights on worker 0-0, policy_version 565497 (0.00096) [2022-07-10 04:23:37,395][26022] Updated weights on worker 0-0, policy_version 565507 (0.00097) [2022-07-10 04:23:37,952][25689] Fps is (10 sec: 5603.3, 60 sec: 5614.1, 300 sec: 5616.8). Total num frames: 579081216. Throughput: 0: 5024.9. Samples: 579074948. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:23:37,952][25689] Avg episode reward: [(0, '-22.716')] [2022-07-10 04:23:39,243][26022] Updated weights on worker 0-0, policy_version 565517 (0.00086) [2022-07-10 04:23:40,982][26022] Updated weights on worker 0-0, policy_version 565527 (0.00108) [2022-07-10 04:23:42,856][26022] Updated weights on worker 0-0, policy_version 565537 (0.00098) [2022-07-10 04:23:43,004][25689] Fps is (10 sec: 5680.0, 60 sec: 5636.0, 300 sec: 5620.5). Total num frames: 579110912. Throughput: 0: 5896.8. Samples: 579109108. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:23:43,005][25689] Avg episode reward: [(0, '-23.233')] [2022-07-10 04:23:44,459][26022] Updated weights on worker 0-0, policy_version 565547 (0.00083) [2022-07-10 04:23:46,498][26022] Updated weights on worker 0-0, policy_version 565557 (0.00087) [2022-07-10 04:23:48,091][25689] Fps is (10 sec: 5756.9, 60 sec: 5614.0, 300 sec: 5622.5). Total num frames: 579139584. Throughput: 0: 5907.6. Samples: 579143334. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:23:48,091][25689] Avg episode reward: [(0, '-23.878')] [2022-07-10 04:23:48,199][26022] Updated weights on worker 0-0, policy_version 565567 (0.00086) [2022-07-10 04:23:49,989][26022] Updated weights on worker 0-0, policy_version 565577 (0.00091) [2022-07-10 04:23:51,953][26022] Updated weights on worker 0-0, policy_version 565587 (0.00084) [2022-07-10 04:23:53,136][25689] Fps is (10 sec: 5659.9, 60 sec: 5645.0, 300 sec: 5622.8). Total num frames: 579168256. Throughput: 0: 5875.1. Samples: 579176948. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:23:53,136][25689] Avg episode reward: [(0, '-25.220')] [2022-07-10 04:23:53,651][26022] Updated weights on worker 0-0, policy_version 565597 (0.00079) [2022-07-10 04:23:55,558][26022] Updated weights on worker 0-0, policy_version 565607 (0.00094) [2022-07-10 04:23:57,362][26022] Updated weights on worker 0-0, policy_version 565617 (0.00093) [2022-07-10 04:23:58,219][25689] Fps is (10 sec: 5560.9, 60 sec: 5589.5, 300 sec: 5622.0). Total num frames: 579195904. Throughput: 0: 5877.1. Samples: 579194058. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:23:58,219][25689] Avg episode reward: [(0, '-25.017')] [2022-07-10 04:23:59,019][26022] Updated weights on worker 0-0, policy_version 565627 (0.00063) [2022-07-10 04:23:59,125][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:23:59,146][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000565628_579203072.pth [2022-07-10 04:23:59,157][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000563651_577178624.pth [2022-07-10 04:24:00,911][26022] Updated weights on worker 0-0, policy_version 565637 (0.00085) [2022-07-10 04:24:03,031][26022] Updated weights on worker 0-0, policy_version 565647 (0.00092) [2022-07-10 04:24:03,273][25689] Fps is (10 sec: 5454.9, 60 sec: 5645.8, 300 sec: 5622.4). Total num frames: 579223552. Throughput: 0: 5784.6. Samples: 579226354. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:24:03,273][25689] Avg episode reward: [(0, '-26.162')] [2022-07-10 04:24:04,975][26022] Updated weights on worker 0-0, policy_version 565657 (0.00096) [2022-07-10 04:24:06,615][26022] Updated weights on worker 0-0, policy_version 565667 (0.00094) [2022-07-10 04:24:08,310][25689] Fps is (10 sec: 5581.1, 60 sec: 5644.1, 300 sec: 5623.1). Total num frames: 579252224. Throughput: 0: 5791.6. Samples: 579260436. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:24:08,310][25689] Avg episode reward: [(0, '-26.794')] [2022-07-10 04:24:08,388][26022] Updated weights on worker 0-0, policy_version 565677 (0.00092) [2022-07-10 04:24:10,375][26022] Updated weights on worker 0-0, policy_version 565687 (0.00087) [2022-07-10 04:24:12,020][26022] Updated weights on worker 0-0, policy_version 565697 (0.00087) [2022-07-10 04:24:13,335][25689] Fps is (10 sec: 5596.9, 60 sec: 5609.4, 300 sec: 5619.7). Total num frames: 579279872. Throughput: 0: 4983.6. Samples: 579277616. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:24:13,336][25689] Avg episode reward: [(0, '-26.994')] [2022-07-10 04:24:13,966][26022] Updated weights on worker 0-0, policy_version 565707 (0.00053) [2022-07-10 04:24:15,537][26022] Updated weights on worker 0-0, policy_version 565717 (0.00083) [2022-07-10 04:24:17,496][26022] Updated weights on worker 0-0, policy_version 565727 (0.00101) [2022-07-10 04:24:18,375][25689] Fps is (10 sec: 5697.5, 60 sec: 5640.5, 300 sec: 5628.0). Total num frames: 579309568. Throughput: 0: 5850.5. Samples: 579311980. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:24:18,375][25689] Avg episode reward: [(0, '-29.267')] [2022-07-10 04:24:19,333][26022] Updated weights on worker 0-0, policy_version 565737 (0.00085) [2022-07-10 04:24:21,163][26022] Updated weights on worker 0-0, policy_version 565747 (0.00079) [2022-07-10 04:24:22,823][26022] Updated weights on worker 0-0, policy_version 565757 (0.00080) [2022-07-10 04:24:23,477][25689] Fps is (10 sec: 5856.2, 60 sec: 5656.1, 300 sec: 5627.1). Total num frames: 579339264. Throughput: 0: 5927.4. Samples: 579346114. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:24:23,478][25689] Avg episode reward: [(0, '-28.460')] [2022-07-10 04:24:24,714][26022] Updated weights on worker 0-0, policy_version 565767 (0.00088) [2022-07-10 04:24:26,313][26022] Updated weights on worker 0-0, policy_version 565777 (0.00081) [2022-07-10 04:24:28,420][26022] Updated weights on worker 0-0, policy_version 565787 (0.00095) [2022-07-10 04:24:28,500][25689] Fps is (10 sec: 5562.0, 60 sec: 5627.0, 300 sec: 5627.1). Total num frames: 579365888. Throughput: 0: 5084.3. Samples: 579363090. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:24:28,501][25689] Avg episode reward: [(0, '-29.444')] [2022-07-10 04:24:29,866][26022] Updated weights on worker 0-0, policy_version 565797 (0.00084) [2022-07-10 04:24:31,995][26022] Updated weights on worker 0-0, policy_version 565807 (0.00091) [2022-07-10 04:24:33,519][25689] Fps is (10 sec: 5608.7, 60 sec: 5642.9, 300 sec: 5627.0). Total num frames: 579395584. Throughput: 0: 5936.6. Samples: 579397436. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:24:33,519][25689] Avg episode reward: [(0, '-27.966')] [2022-07-10 04:24:33,572][26022] Updated weights on worker 0-0, policy_version 565817 (0.00086) [2022-07-10 04:24:35,519][26022] Updated weights on worker 0-0, policy_version 565827 (0.00092) [2022-07-10 04:24:37,244][26022] Updated weights on worker 0-0, policy_version 565837 (0.00089) [2022-07-10 04:24:38,557][25689] Fps is (10 sec: 5804.3, 60 sec: 5660.3, 300 sec: 5628.4). Total num frames: 579424256. Throughput: 0: 5926.5. Samples: 579431588. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:24:38,557][25689] Avg episode reward: [(0, '-28.165')] [2022-07-10 04:24:39,235][26022] Updated weights on worker 0-0, policy_version 565847 (0.00089) [2022-07-10 04:24:40,776][26022] Updated weights on worker 0-0, policy_version 565857 (0.00095) [2022-07-10 04:24:42,861][26022] Updated weights on worker 0-0, policy_version 565867 (0.00088) [2022-07-10 04:24:43,637][25689] Fps is (10 sec: 5768.5, 60 sec: 5657.6, 300 sec: 5634.6). Total num frames: 579453952. Throughput: 0: 5088.0. Samples: 579448688. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:24:43,638][25689] Avg episode reward: [(0, '-26.743')] [2022-07-10 04:24:44,271][26022] Updated weights on worker 0-0, policy_version 565877 (0.00091) [2022-07-10 04:24:46,504][26022] Updated weights on worker 0-0, policy_version 565887 (0.00091) [2022-07-10 04:24:48,073][26022] Updated weights on worker 0-0, policy_version 565897 (0.00063) [2022-07-10 04:24:48,694][25689] Fps is (10 sec: 5555.4, 60 sec: 5626.6, 300 sec: 5630.9). Total num frames: 579480576. Throughput: 0: 5932.5. Samples: 579482890. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:24:48,696][25689] Avg episode reward: [(0, '-26.160')] [2022-07-10 04:24:49,859][26022] Updated weights on worker 0-0, policy_version 565907 (0.00102) [2022-07-10 04:24:51,820][26022] Updated weights on worker 0-0, policy_version 565917 (0.00093) [2022-07-10 04:24:53,312][26022] Updated weights on worker 0-0, policy_version 565927 (0.00092) [2022-07-10 04:24:53,726][25689] Fps is (10 sec: 5582.5, 60 sec: 5644.8, 300 sec: 5630.6). Total num frames: 579510272. Throughput: 0: 5911.7. Samples: 579516894. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:24:53,726][25689] Avg episode reward: [(0, '-27.071')] [2022-07-10 04:24:55,540][26022] Updated weights on worker 0-0, policy_version 565937 (0.00088) [2022-07-10 04:24:57,137][26022] Updated weights on worker 0-0, policy_version 565947 (0.00087) [2022-07-10 04:24:58,747][25689] Fps is (10 sec: 5704.7, 60 sec: 5650.6, 300 sec: 5635.4). Total num frames: 579537920. Throughput: 0: 5069.1. Samples: 579533932. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:24:58,748][25689] Avg episode reward: [(0, '-26.300')] [2022-07-10 04:24:59,020][26022] Updated weights on worker 0-0, policy_version 565957 (0.00087) [2022-07-10 04:25:00,678][26022] Updated weights on worker 0-0, policy_version 565967 (0.00101) [2022-07-10 04:25:03,069][26022] Updated weights on worker 0-0, policy_version 565977 (0.00088) [2022-07-10 04:25:03,872][25689] Fps is (10 sec: 5449.7, 60 sec: 5643.8, 300 sec: 5630.1). Total num frames: 579565568. Throughput: 0: 5781.4. Samples: 579565676. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:25:03,874][25689] Avg episode reward: [(0, '-26.662')] [2022-07-10 04:25:04,667][26022] Updated weights on worker 0-0, policy_version 565987 (0.00086) [2022-07-10 04:25:06,647][26022] Updated weights on worker 0-0, policy_version 565997 (0.00089) [2022-07-10 04:25:08,338][26022] Updated weights on worker 0-0, policy_version 566007 (0.00095) [2022-07-10 04:25:08,888][25689] Fps is (10 sec: 5452.7, 60 sec: 5629.0, 300 sec: 5630.2). Total num frames: 579593216. Throughput: 0: 5773.4. Samples: 579599472. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:25:08,889][25689] Avg episode reward: [(0, '-25.608')] [2022-07-10 04:25:10,336][26022] Updated weights on worker 0-0, policy_version 566017 (0.00095) [2022-07-10 04:25:11,911][26022] Updated weights on worker 0-0, policy_version 566027 (0.00093) [2022-07-10 04:25:13,899][25689] Fps is (10 sec: 5617.0, 60 sec: 5647.2, 300 sec: 5630.9). Total num frames: 579621888. Throughput: 0: 4939.2. Samples: 579616534. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:25:13,901][25689] Avg episode reward: [(0, '-24.298')] [2022-07-10 04:25:13,903][26022] Updated weights on worker 0-0, policy_version 566037 (0.00090) [2022-07-10 04:25:15,519][26022] Updated weights on worker 0-0, policy_version 566047 (0.00087) [2022-07-10 04:25:17,501][26022] Updated weights on worker 0-0, policy_version 566057 (0.00105) [2022-07-10 04:25:18,928][25689] Fps is (10 sec: 5711.5, 60 sec: 5631.3, 300 sec: 5636.1). Total num frames: 579650560. Throughput: 0: 5780.2. Samples: 579650582. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:25:18,930][25689] Avg episode reward: [(0, '-24.363')] [2022-07-10 04:25:19,265][26022] Updated weights on worker 0-0, policy_version 566067 (0.00092) [2022-07-10 04:25:21,148][26022] Updated weights on worker 0-0, policy_version 566077 (0.00087) [2022-07-10 04:25:23,088][26022] Updated weights on worker 0-0, policy_version 566087 (0.00086) [2022-07-10 04:25:24,084][25689] Fps is (10 sec: 5630.5, 60 sec: 5609.4, 300 sec: 5633.4). Total num frames: 579679232. Throughput: 0: 5867.1. Samples: 579684258. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:25:24,084][25689] Avg episode reward: [(0, '-23.865')] [2022-07-10 04:25:24,688][26022] Updated weights on worker 0-0, policy_version 566097 (0.00085) [2022-07-10 04:25:26,625][26022] Updated weights on worker 0-0, policy_version 566107 (0.00089) [2022-07-10 04:25:28,332][26022] Updated weights on worker 0-0, policy_version 566117 (0.00101) [2022-07-10 04:25:29,089][25689] Fps is (10 sec: 5643.2, 60 sec: 5644.9, 300 sec: 5633.5). Total num frames: 579707904. Throughput: 0: 5028.8. Samples: 579701062. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:25:29,090][25689] Avg episode reward: [(0, '-24.377')] [2022-07-10 04:25:30,331][26022] Updated weights on worker 0-0, policy_version 566127 (0.00625) [2022-07-10 04:25:31,908][26022] Updated weights on worker 0-0, policy_version 566137 (0.00171) [2022-07-10 04:25:33,755][26022] Updated weights on worker 0-0, policy_version 566147 (0.00085) [2022-07-10 04:25:34,105][25689] Fps is (10 sec: 5722.3, 60 sec: 5628.2, 300 sec: 5633.5). Total num frames: 579736576. Throughput: 0: 5874.3. Samples: 579735228. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:25:34,105][25689] Avg episode reward: [(0, '-24.437')] [2022-07-10 04:25:35,725][26022] Updated weights on worker 0-0, policy_version 566157 (0.00092) [2022-07-10 04:25:37,531][26022] Updated weights on worker 0-0, policy_version 566167 (0.00088) [2022-07-10 04:25:39,113][25689] Fps is (10 sec: 5721.0, 60 sec: 5631.0, 300 sec: 5630.8). Total num frames: 579765248. Throughput: 0: 5882.6. Samples: 579769322. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:25:39,113][25689] Avg episode reward: [(0, '-23.805')] [2022-07-10 04:25:39,118][26022] Updated weights on worker 0-0, policy_version 566177 (0.00094) [2022-07-10 04:25:41,118][26022] Updated weights on worker 0-0, policy_version 566187 (0.00090) [2022-07-10 04:25:42,775][26022] Updated weights on worker 0-0, policy_version 566197 (0.00093) [2022-07-10 04:25:44,259][25689] Fps is (10 sec: 5546.2, 60 sec: 5591.1, 300 sec: 5628.4). Total num frames: 579792896. Throughput: 0: 5053.7. Samples: 579786220. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:25:44,260][25689] Avg episode reward: [(0, '-24.152')] [2022-07-10 04:25:44,715][26022] Updated weights on worker 0-0, policy_version 566207 (0.00082) [2022-07-10 04:25:46,473][26022] Updated weights on worker 0-0, policy_version 566217 (0.00087) [2022-07-10 04:25:48,301][26022] Updated weights on worker 0-0, policy_version 566227 (0.00100) [2022-07-10 04:25:49,263][25689] Fps is (10 sec: 5447.5, 60 sec: 5612.9, 300 sec: 5628.6). Total num frames: 579820544. Throughput: 0: 5917.8. Samples: 579820450. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:25:49,264][25689] Avg episode reward: [(0, '-25.037')] [2022-07-10 04:25:50,120][26022] Updated weights on worker 0-0, policy_version 566237 (0.00092) [2022-07-10 04:25:51,995][26022] Updated weights on worker 0-0, policy_version 566247 (0.00085) [2022-07-10 04:25:53,614][26022] Updated weights on worker 0-0, policy_version 566257 (0.00082) [2022-07-10 04:25:54,309][25689] Fps is (10 sec: 5706.0, 60 sec: 5611.6, 300 sec: 5635.0). Total num frames: 579850240. Throughput: 0: 5894.6. Samples: 579854324. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:25:54,310][25689] Avg episode reward: [(0, '-25.511')] [2022-07-10 04:25:55,548][26022] Updated weights on worker 0-0, policy_version 566267 (0.00088) [2022-07-10 04:25:57,065][26022] Updated weights on worker 0-0, policy_version 566277 (0.00085) [2022-07-10 04:25:59,313][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:25:59,321][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000566287_579877888.pth [2022-07-10 04:25:59,322][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000564308_577851392.pth [2022-07-10 04:25:59,323][25689] Fps is (10 sec: 5700.6, 60 sec: 5612.2, 300 sec: 5625.4). Total num frames: 579877888. Throughput: 0: 5051.5. Samples: 579871414. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:25:59,323][25689] Avg episode reward: [(0, '-25.383')] [2022-07-10 04:25:59,327][26022] Updated weights on worker 0-0, policy_version 566287 (0.00085) [2022-07-10 04:26:01,064][26022] Updated weights on worker 0-0, policy_version 566297 (0.00085) [2022-07-10 04:26:03,127][26022] Updated weights on worker 0-0, policy_version 566307 (0.00084) [2022-07-10 04:26:04,449][25689] Fps is (10 sec: 5554.4, 60 sec: 5629.1, 300 sec: 5644.3). Total num frames: 579906560. Throughput: 0: 5861.1. Samples: 579904552. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:26:04,449][25689] Avg episode reward: [(0, '-25.582')] [2022-07-10 04:26:04,953][26022] Updated weights on worker 0-0, policy_version 566317 (0.00078) [2022-07-10 04:26:06,709][26022] Updated weights on worker 0-0, policy_version 566327 (0.00091) [2022-07-10 04:26:08,639][26022] Updated weights on worker 0-0, policy_version 566337 (0.00083) [2022-07-10 04:26:09,472][25689] Fps is (10 sec: 5751.1, 60 sec: 5662.2, 300 sec: 5641.0). Total num frames: 579936256. Throughput: 0: 5797.3. Samples: 579937602. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:26:09,474][25689] Avg episode reward: [(0, '-26.072')] [2022-07-10 04:26:10,496][26022] Updated weights on worker 0-0, policy_version 566347 (0.00105) [2022-07-10 04:26:12,163][26022] Updated weights on worker 0-0, policy_version 566357 (0.00086) [2022-07-10 04:26:13,893][26022] Updated weights on worker 0-0, policy_version 566367 (0.00082) [2022-07-10 04:26:14,528][25689] Fps is (10 sec: 5486.1, 60 sec: 5607.3, 300 sec: 5626.6). Total num frames: 579961856. Throughput: 0: 5807.7. Samples: 579971750. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:26:14,529][25689] Avg episode reward: [(0, '-25.380')] [2022-07-10 04:26:15,639][26022] Updated weights on worker 0-0, policy_version 566377 (0.00091) [2022-07-10 04:26:17,570][26022] Updated weights on worker 0-0, policy_version 566387 (0.00097) [2022-07-10 04:26:19,268][26022] Updated weights on worker 0-0, policy_version 566397 (0.00088) [2022-07-10 04:26:19,531][25689] Fps is (10 sec: 5395.3, 60 sec: 5609.7, 300 sec: 5631.7). Total num frames: 579990528. Throughput: 0: 5802.9. Samples: 579988680. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:26:19,531][25689] Avg episode reward: [(0, '-24.698')] [2022-07-10 04:26:20,911][26022] Updated weights on worker 0-0, policy_version 566407 (0.00089) [2022-07-10 04:26:22,887][26022] Updated weights on worker 0-0, policy_version 566417 (0.00083) [2022-07-10 04:26:24,520][26022] Updated weights on worker 0-0, policy_version 566427 (0.00091) [2022-07-10 04:26:24,605][25689] Fps is (10 sec: 5893.7, 60 sec: 5651.1, 300 sec: 5641.5). Total num frames: 580021248. Throughput: 0: 5868.9. Samples: 580022848. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:26:24,606][25689] Avg episode reward: [(0, '-24.061')] [2022-07-10 04:26:26,607][26022] Updated weights on worker 0-0, policy_version 566437 (0.00088) [2022-07-10 04:26:28,262][26022] Updated weights on worker 0-0, policy_version 566447 (0.00085) [2022-07-10 04:26:29,624][25689] Fps is (10 sec: 5782.5, 60 sec: 5632.9, 300 sec: 5631.1). Total num frames: 580048896. Throughput: 0: 5920.4. Samples: 580056916. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:26:29,626][25689] Avg episode reward: [(0, '-24.265')] [2022-07-10 04:26:30,241][26022] Updated weights on worker 0-0, policy_version 566457 (0.00089) [2022-07-10 04:26:31,999][26022] Updated weights on worker 0-0, policy_version 566467 (0.00084) [2022-07-10 04:26:33,701][26022] Updated weights on worker 0-0, policy_version 566477 (0.00088) [2022-07-10 04:26:34,699][25689] Fps is (10 sec: 5478.4, 60 sec: 5610.6, 300 sec: 5633.3). Total num frames: 580076544. Throughput: 0: 5068.6. Samples: 580073988. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:26:34,701][25689] Avg episode reward: [(0, '-24.359')] [2022-07-10 04:26:35,610][26022] Updated weights on worker 0-0, policy_version 566487 (0.00093) [2022-07-10 04:26:37,322][26022] Updated weights on worker 0-0, policy_version 566497 (0.00091) [2022-07-10 04:26:39,165][26022] Updated weights on worker 0-0, policy_version 566507 (0.00086) [2022-07-10 04:26:39,781][25689] Fps is (10 sec: 5545.3, 60 sec: 5603.7, 300 sec: 5633.0). Total num frames: 580105216. Throughput: 0: 5899.1. Samples: 580108138. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:26:39,781][25689] Avg episode reward: [(0, '-23.826')] [2022-07-10 04:26:40,896][26022] Updated weights on worker 0-0, policy_version 566517 (0.00084) [2022-07-10 04:26:42,833][26022] Updated weights on worker 0-0, policy_version 566527 (0.00091) [2022-07-10 04:26:44,524][26022] Updated weights on worker 0-0, policy_version 566537 (0.00091) [2022-07-10 04:26:44,851][25689] Fps is (10 sec: 5850.0, 60 sec: 5661.5, 300 sec: 5639.5). Total num frames: 580135936. Throughput: 0: 5896.9. Samples: 580142236. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:26:44,852][25689] Avg episode reward: [(0, '-24.157')] [2022-07-10 04:26:46,491][26022] Updated weights on worker 0-0, policy_version 566547 (0.00093) [2022-07-10 04:26:48,140][26022] Updated weights on worker 0-0, policy_version 566557 (0.00085) [2022-07-10 04:26:49,896][25689] Fps is (10 sec: 5770.6, 60 sec: 5657.7, 300 sec: 5632.0). Total num frames: 580163584. Throughput: 0: 5055.8. Samples: 580159402. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 04:26:49,896][25689] Avg episode reward: [(0, '-24.181')] [2022-07-10 04:26:50,072][26022] Updated weights on worker 0-0, policy_version 566567 (0.00090) [2022-07-10 04:26:51,731][26022] Updated weights on worker 0-0, policy_version 566577 (0.00091) [2022-07-10 04:26:53,540][26022] Updated weights on worker 0-0, policy_version 566587 (0.00086) [2022-07-10 04:26:54,898][25689] Fps is (10 sec: 5605.9, 60 sec: 5644.9, 300 sec: 5635.6). Total num frames: 580192256. Throughput: 0: 5919.1. Samples: 580193548. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:26:54,899][25689] Avg episode reward: [(0, '-24.389')] [2022-07-10 04:26:55,326][26022] Updated weights on worker 0-0, policy_version 566597 (0.00089) [2022-07-10 04:26:57,227][26022] Updated weights on worker 0-0, policy_version 566607 (0.00083) [2022-07-10 04:26:58,935][26022] Updated weights on worker 0-0, policy_version 566617 (0.00093) [2022-07-10 04:26:59,938][25689] Fps is (10 sec: 5710.4, 60 sec: 5659.3, 300 sec: 5639.7). Total num frames: 580220928. Throughput: 0: 5943.3. Samples: 580227936. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:26:59,938][25689] Avg episode reward: [(0, '-23.772')] [2022-07-10 04:27:00,759][26022] Updated weights on worker 0-0, policy_version 566627 (0.00082) [2022-07-10 04:27:02,860][26022] Updated weights on worker 0-0, policy_version 566637 (0.00085) [2022-07-10 04:27:04,735][26022] Updated weights on worker 0-0, policy_version 566647 (0.00098) [2022-07-10 04:27:04,987][25689] Fps is (10 sec: 5480.9, 60 sec: 5632.7, 300 sec: 5632.5). Total num frames: 580247552. Throughput: 0: 5012.1. Samples: 580243158. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:27:04,987][25689] Avg episode reward: [(0, '-24.385')] [2022-07-10 04:27:06,551][26022] Updated weights on worker 0-0, policy_version 566657 (0.00096) [2022-07-10 04:27:08,261][26022] Updated weights on worker 0-0, policy_version 566667 (0.00086) [2022-07-10 04:27:09,995][25689] Fps is (10 sec: 5396.2, 60 sec: 5600.2, 300 sec: 5636.0). Total num frames: 580275200. Throughput: 0: 5830.2. Samples: 580276586. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:27:09,996][25689] Avg episode reward: [(0, '-25.393')] [2022-07-10 04:27:10,327][26022] Updated weights on worker 0-0, policy_version 566677 (0.00094) [2022-07-10 04:27:11,796][26022] Updated weights on worker 0-0, policy_version 566687 (0.00090) [2022-07-10 04:27:13,955][26022] Updated weights on worker 0-0, policy_version 566697 (0.00096) [2022-07-10 04:27:15,027][25689] Fps is (10 sec: 5813.7, 60 sec: 5687.2, 300 sec: 5642.3). Total num frames: 580305920. Throughput: 0: 5816.7. Samples: 580310630. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:27:15,027][25689] Avg episode reward: [(0, '-26.023')] [2022-07-10 04:27:15,553][26022] Updated weights on worker 0-0, policy_version 566707 (0.00087) [2022-07-10 04:27:17,451][26022] Updated weights on worker 0-0, policy_version 566717 (0.00089) [2022-07-10 04:27:19,291][26022] Updated weights on worker 0-0, policy_version 566727 (0.00085) [2022-07-10 04:27:20,055][25689] Fps is (10 sec: 5700.0, 60 sec: 5650.8, 300 sec: 5630.9). Total num frames: 580332544. Throughput: 0: 4956.2. Samples: 580327644. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:27:20,056][25689] Avg episode reward: [(0, '-25.736')] [2022-07-10 04:27:21,015][26022] Updated weights on worker 0-0, policy_version 566737 (0.00096) [2022-07-10 04:27:23,002][26022] Updated weights on worker 0-0, policy_version 566747 (0.00093) [2022-07-10 04:27:24,556][26022] Updated weights on worker 0-0, policy_version 566757 (0.00092) [2022-07-10 04:27:25,117][25689] Fps is (10 sec: 5581.3, 60 sec: 5635.1, 300 sec: 5640.1). Total num frames: 580362240. Throughput: 0: 5881.5. Samples: 580361556. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:27:25,118][25689] Avg episode reward: [(0, '-25.887')] [2022-07-10 04:27:26,491][26022] Updated weights on worker 0-0, policy_version 566767 (0.00083) [2022-07-10 04:27:28,377][26022] Updated weights on worker 0-0, policy_version 566777 (0.00092) [2022-07-10 04:27:30,084][26022] Updated weights on worker 0-0, policy_version 566787 (0.00084) [2022-07-10 04:27:30,177][25689] Fps is (10 sec: 5665.4, 60 sec: 5631.3, 300 sec: 5636.0). Total num frames: 580389888. Throughput: 0: 5897.4. Samples: 580395608. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:27:30,178][25689] Avg episode reward: [(0, '-25.810')] [2022-07-10 04:27:32,009][26022] Updated weights on worker 0-0, policy_version 566797 (0.00085) [2022-07-10 04:27:33,577][26022] Updated weights on worker 0-0, policy_version 566807 (0.00093) [2022-07-10 04:27:35,204][25689] Fps is (10 sec: 5583.5, 60 sec: 5652.7, 300 sec: 5632.2). Total num frames: 580418560. Throughput: 0: 5068.4. Samples: 580412898. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:27:35,205][25689] Avg episode reward: [(0, '-25.886')] [2022-07-10 04:27:35,502][26022] Updated weights on worker 0-0, policy_version 566817 (0.00083) [2022-07-10 04:27:37,175][26022] Updated weights on worker 0-0, policy_version 566827 (0.00090) [2022-07-10 04:27:38,915][26022] Updated weights on worker 0-0, policy_version 566837 (0.00090) [2022-07-10 04:27:40,256][25689] Fps is (10 sec: 5791.2, 60 sec: 5672.4, 300 sec: 5636.7). Total num frames: 580448256. Throughput: 0: 5906.2. Samples: 580446952. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:27:40,256][25689] Avg episode reward: [(0, '-25.560')] [2022-07-10 04:27:41,065][26022] Updated weights on worker 0-0, policy_version 566847 (0.00083) [2022-07-10 04:27:42,869][26022] Updated weights on worker 0-0, policy_version 566857 (0.00081) [2022-07-10 04:27:44,517][26022] Updated weights on worker 0-0, policy_version 566867 (0.00091) [2022-07-10 04:27:45,353][25689] Fps is (10 sec: 5650.3, 60 sec: 5619.1, 300 sec: 5628.5). Total num frames: 580475904. Throughput: 0: 5902.8. Samples: 580481004. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:27:45,354][25689] Avg episode reward: [(0, '-25.610')] [2022-07-10 04:27:46,416][26022] Updated weights on worker 0-0, policy_version 566877 (0.00092) [2022-07-10 04:27:48,142][26022] Updated weights on worker 0-0, policy_version 566887 (0.00089) [2022-07-10 04:27:50,088][26022] Updated weights on worker 0-0, policy_version 566897 (0.00086) [2022-07-10 04:27:50,379][25689] Fps is (10 sec: 5563.4, 60 sec: 5637.8, 300 sec: 5635.2). Total num frames: 580504576. Throughput: 0: 5075.9. Samples: 580498152. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:27:50,380][25689] Avg episode reward: [(0, '-25.054')] [2022-07-10 04:27:51,675][26022] Updated weights on worker 0-0, policy_version 566907 (0.00088) [2022-07-10 04:27:53,692][26022] Updated weights on worker 0-0, policy_version 566917 (0.00079) [2022-07-10 04:27:55,283][26022] Updated weights on worker 0-0, policy_version 566927 (0.00093) [2022-07-10 04:27:55,384][25689] Fps is (10 sec: 5717.0, 60 sec: 5637.6, 300 sec: 5628.8). Total num frames: 580533248. Throughput: 0: 5897.0. Samples: 580531896. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:27:55,384][25689] Avg episode reward: [(0, '-25.098')] [2022-07-10 04:27:57,404][26022] Updated weights on worker 0-0, policy_version 566937 (0.00081) [2022-07-10 04:27:58,990][26022] Updated weights on worker 0-0, policy_version 566947 (0.00085) [2022-07-10 04:27:59,414][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:27:59,424][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000566949_580555776.pth [2022-07-10 04:27:59,424][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000564966_578525184.pth [2022-07-10 04:28:00,411][25689] Fps is (10 sec: 5614.3, 60 sec: 5621.8, 300 sec: 5640.8). Total num frames: 580560896. Throughput: 0: 5914.8. Samples: 580566164. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:28:00,413][25689] Avg episode reward: [(0, '-25.082')] [2022-07-10 04:28:00,854][26022] Updated weights on worker 0-0, policy_version 566957 (0.00089) [2022-07-10 04:28:02,891][26022] Updated weights on worker 0-0, policy_version 566967 (0.00085) [2022-07-10 04:28:04,831][26022] Updated weights on worker 0-0, policy_version 566977 (0.00092) [2022-07-10 04:28:05,542][25689] Fps is (10 sec: 5342.7, 60 sec: 5614.2, 300 sec: 5631.8). Total num frames: 580587520. Throughput: 0: 4956.5. Samples: 580581068. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:28:05,542][25689] Avg episode reward: [(0, '-24.711')] [2022-07-10 04:28:06,530][26022] Updated weights on worker 0-0, policy_version 566987 (0.00097) [2022-07-10 04:28:08,587][26022] Updated weights on worker 0-0, policy_version 566997 (0.00088) [2022-07-10 04:28:10,191][26022] Updated weights on worker 0-0, policy_version 567007 (0.00095) [2022-07-10 04:28:10,619][25689] Fps is (10 sec: 5517.1, 60 sec: 5641.6, 300 sec: 5630.6). Total num frames: 580617216. Throughput: 0: 5782.6. Samples: 580615190. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:28:10,620][25689] Avg episode reward: [(0, '-24.477')] [2022-07-10 04:28:11,988][26022] Updated weights on worker 0-0, policy_version 567017 (0.00092) [2022-07-10 04:28:13,820][26022] Updated weights on worker 0-0, policy_version 567027 (0.00086) [2022-07-10 04:28:15,488][26022] Updated weights on worker 0-0, policy_version 567037 (0.00093) [2022-07-10 04:28:15,633][25689] Fps is (10 sec: 5784.0, 60 sec: 5609.4, 300 sec: 5634.0). Total num frames: 580645888. Throughput: 0: 5818.8. Samples: 580649724. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:28:15,634][25689] Avg episode reward: [(0, '-24.382')] [2022-07-10 04:28:17,233][26022] Updated weights on worker 0-0, policy_version 567047 (0.00086) [2022-07-10 04:28:19,205][26022] Updated weights on worker 0-0, policy_version 567057 (0.00085) [2022-07-10 04:28:20,652][25689] Fps is (10 sec: 5715.6, 60 sec: 5644.1, 300 sec: 5635.3). Total num frames: 580674560. Throughput: 0: 5821.1. Samples: 580683990. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:28:20,652][25689] Avg episode reward: [(0, '-25.268')] [2022-07-10 04:28:20,851][26022] Updated weights on worker 0-0, policy_version 567067 (0.00081) [2022-07-10 04:28:22,916][26022] Updated weights on worker 0-0, policy_version 567077 (0.00090) [2022-07-10 04:28:24,426][26022] Updated weights on worker 0-0, policy_version 567087 (0.00791) [2022-07-10 04:28:25,703][25689] Fps is (10 sec: 5593.1, 60 sec: 5611.4, 300 sec: 5632.3). Total num frames: 580702208. Throughput: 0: 5939.1. Samples: 580700806. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:28:25,703][25689] Avg episode reward: [(0, '-25.981')] [2022-07-10 04:28:26,376][26022] Updated weights on worker 0-0, policy_version 567097 (0.00441) [2022-07-10 04:28:28,134][26022] Updated weights on worker 0-0, policy_version 567107 (0.00078) [2022-07-10 04:28:29,907][26022] Updated weights on worker 0-0, policy_version 567117 (0.00091) [2022-07-10 04:28:30,714][25689] Fps is (10 sec: 5699.0, 60 sec: 5649.7, 300 sec: 5635.7). Total num frames: 580731904. Throughput: 0: 5964.2. Samples: 580735042. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:28:30,715][25689] Avg episode reward: [(0, '-26.082')] [2022-07-10 04:28:31,814][26022] Updated weights on worker 0-0, policy_version 567127 (0.00088) [2022-07-10 04:28:33,450][26022] Updated weights on worker 0-0, policy_version 567137 (0.00096) [2022-07-10 04:28:35,335][26022] Updated weights on worker 0-0, policy_version 567147 (0.00093) [2022-07-10 04:28:35,727][25689] Fps is (10 sec: 5925.0, 60 sec: 5668.0, 300 sec: 5643.1). Total num frames: 580761600. Throughput: 0: 5936.6. Samples: 580769012. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:28:35,727][25689] Avg episode reward: [(0, '-25.572')] [2022-07-10 04:28:37,323][26022] Updated weights on worker 0-0, policy_version 567157 (0.00094) [2022-07-10 04:28:38,999][26022] Updated weights on worker 0-0, policy_version 567167 (0.00091) [2022-07-10 04:28:40,720][26022] Updated weights on worker 0-0, policy_version 567177 (0.00093) [2022-07-10 04:28:40,735][25689] Fps is (10 sec: 5722.7, 60 sec: 5638.2, 300 sec: 5637.1). Total num frames: 580789248. Throughput: 0: 5087.2. Samples: 580786154. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:28:40,735][25689] Avg episode reward: [(0, '-24.676')] [2022-07-10 04:28:42,840][26022] Updated weights on worker 0-0, policy_version 567187 (0.00092) [2022-07-10 04:28:44,292][26022] Updated weights on worker 0-0, policy_version 567197 (0.00085) [2022-07-10 04:28:45,838][25689] Fps is (10 sec: 5367.3, 60 sec: 5620.7, 300 sec: 5629.9). Total num frames: 580815872. Throughput: 0: 5899.1. Samples: 580819588. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:28:45,839][25689] Avg episode reward: [(0, '-24.888')] [2022-07-10 04:28:46,319][26022] Updated weights on worker 0-0, policy_version 567207 (0.00088) [2022-07-10 04:28:47,908][26022] Updated weights on worker 0-0, policy_version 567217 (0.00089) [2022-07-10 04:28:49,843][26022] Updated weights on worker 0-0, policy_version 567227 (0.00084) [2022-07-10 04:28:50,859][25689] Fps is (10 sec: 5562.6, 60 sec: 5638.1, 300 sec: 5633.8). Total num frames: 580845568. Throughput: 0: 5896.2. Samples: 580853822. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:28:50,860][25689] Avg episode reward: [(0, '-24.509')] [2022-07-10 04:28:51,763][26022] Updated weights on worker 0-0, policy_version 567237 (0.00087) [2022-07-10 04:28:53,584][26022] Updated weights on worker 0-0, policy_version 567247 (0.00089) [2022-07-10 04:28:55,324][26022] Updated weights on worker 0-0, policy_version 567257 (0.00089) [2022-07-10 04:28:55,937][25689] Fps is (10 sec: 5779.5, 60 sec: 5631.2, 300 sec: 5637.3). Total num frames: 580874240. Throughput: 0: 5017.0. Samples: 580870408. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:28:55,938][25689] Avg episode reward: [(0, '-24.935')] [2022-07-10 04:28:57,190][26022] Updated weights on worker 0-0, policy_version 567267 (0.00099) [2022-07-10 04:28:58,786][26022] Updated weights on worker 0-0, policy_version 567277 (0.00087) [2022-07-10 04:29:00,941][25689] Fps is (10 sec: 5485.1, 60 sec: 5616.5, 300 sec: 5634.8). Total num frames: 580900864. Throughput: 0: 5855.9. Samples: 580904478. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:29:00,941][25689] Avg episode reward: [(0, '-24.438')] [2022-07-10 04:29:01,043][26022] Updated weights on worker 0-0, policy_version 567287 (0.00083) [2022-07-10 04:29:02,652][26022] Updated weights on worker 0-0, policy_version 567297 (0.00090) [2022-07-10 04:29:04,826][26022] Updated weights on worker 0-0, policy_version 567307 (0.00089) [2022-07-10 04:29:06,015][25689] Fps is (10 sec: 5487.2, 60 sec: 5655.7, 300 sec: 5634.1). Total num frames: 580929536. Throughput: 0: 5792.7. Samples: 580936464. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:29:06,015][25689] Avg episode reward: [(0, '-24.700')] [2022-07-10 04:29:06,566][26022] Updated weights on worker 0-0, policy_version 567317 (0.00088) [2022-07-10 04:29:08,445][26022] Updated weights on worker 0-0, policy_version 567327 (0.00111) [2022-07-10 04:29:10,246][26022] Updated weights on worker 0-0, policy_version 567337 (0.00091) [2022-07-10 04:29:11,096][25689] Fps is (10 sec: 5546.0, 60 sec: 5621.5, 300 sec: 5633.1). Total num frames: 580957184. Throughput: 0: 4927.4. Samples: 580953534. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:29:11,097][25689] Avg episode reward: [(0, '-25.263')] [2022-07-10 04:29:11,967][26022] Updated weights on worker 0-0, policy_version 567347 (0.00086) [2022-07-10 04:29:13,628][26022] Updated weights on worker 0-0, policy_version 567357 (0.00087) [2022-07-10 04:29:15,718][26022] Updated weights on worker 0-0, policy_version 567367 (0.00087) [2022-07-10 04:29:16,158][25689] Fps is (10 sec: 5653.1, 60 sec: 5633.9, 300 sec: 5632.7). Total num frames: 580986880. Throughput: 0: 5811.7. Samples: 580987926. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:29:16,159][25689] Avg episode reward: [(0, '-25.600')] [2022-07-10 04:29:17,163][26022] Updated weights on worker 0-0, policy_version 567377 (0.00082) [2022-07-10 04:29:19,155][26022] Updated weights on worker 0-0, policy_version 567387 (0.00090) [2022-07-10 04:29:20,871][26022] Updated weights on worker 0-0, policy_version 567397 (0.00087) [2022-07-10 04:29:21,194][25689] Fps is (10 sec: 5678.5, 60 sec: 5615.4, 300 sec: 5627.0). Total num frames: 581014528. Throughput: 0: 5832.8. Samples: 581022612. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:29:21,194][25689] Avg episode reward: [(0, '-26.335')] [2022-07-10 04:29:22,563][26022] Updated weights on worker 0-0, policy_version 567407 (0.00090) [2022-07-10 04:29:24,621][26022] Updated weights on worker 0-0, policy_version 567417 (0.00085) [2022-07-10 04:29:26,249][25689] Fps is (10 sec: 5784.2, 60 sec: 5665.7, 300 sec: 5640.2). Total num frames: 581045248. Throughput: 0: 5086.8. Samples: 581039394. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:29:26,250][25689] Avg episode reward: [(0, '-26.166')] [2022-07-10 04:29:26,251][26022] Updated weights on worker 0-0, policy_version 567427 (0.00089) [2022-07-10 04:29:28,138][26022] Updated weights on worker 0-0, policy_version 567437 (0.00585) [2022-07-10 04:29:30,021][26022] Updated weights on worker 0-0, policy_version 567447 (0.00086) [2022-07-10 04:29:31,324][25689] Fps is (10 sec: 5862.7, 60 sec: 5642.9, 300 sec: 5635.7). Total num frames: 581073920. Throughput: 0: 5933.3. Samples: 581073558. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:29:31,325][25689] Avg episode reward: [(0, '-25.911')] [2022-07-10 04:29:31,707][26022] Updated weights on worker 0-0, policy_version 567457 (0.00084) [2022-07-10 04:29:33,604][26022] Updated weights on worker 0-0, policy_version 567467 (0.00087) [2022-07-10 04:29:35,422][26022] Updated weights on worker 0-0, policy_version 567477 (0.00091) [2022-07-10 04:29:36,347][25689] Fps is (10 sec: 5577.8, 60 sec: 5608.2, 300 sec: 5632.5). Total num frames: 581101568. Throughput: 0: 5938.7. Samples: 581107818. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:29:36,347][25689] Avg episode reward: [(0, '-26.025')] [2022-07-10 04:29:37,139][26022] Updated weights on worker 0-0, policy_version 567487 (0.00088) [2022-07-10 04:29:38,962][26022] Updated weights on worker 0-0, policy_version 567497 (0.00083) [2022-07-10 04:29:40,753][26022] Updated weights on worker 0-0, policy_version 567507 (0.00083) [2022-07-10 04:29:41,359][25689] Fps is (10 sec: 5612.7, 60 sec: 5624.7, 300 sec: 5630.4). Total num frames: 581130240. Throughput: 0: 5075.0. Samples: 581124948. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:29:41,359][25689] Avg episode reward: [(0, '-25.712')] [2022-07-10 04:29:42,476][26022] Updated weights on worker 0-0, policy_version 567517 (0.00090) [2022-07-10 04:29:44,459][26022] Updated weights on worker 0-0, policy_version 567527 (0.00088) [2022-07-10 04:29:46,032][26022] Updated weights on worker 0-0, policy_version 567537 (0.00081) [2022-07-10 04:29:46,425][25689] Fps is (10 sec: 5791.3, 60 sec: 5678.8, 300 sec: 5640.5). Total num frames: 581159936. Throughput: 0: 5947.2. Samples: 581159384. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:29:46,426][25689] Avg episode reward: [(0, '-24.204')] [2022-07-10 04:29:47,973][26022] Updated weights on worker 0-0, policy_version 567547 (0.00621) [2022-07-10 04:29:49,609][26022] Updated weights on worker 0-0, policy_version 567557 (0.00092) [2022-07-10 04:29:51,464][25689] Fps is (10 sec: 5776.0, 60 sec: 5660.3, 300 sec: 5637.0). Total num frames: 581188608. Throughput: 0: 5975.6. Samples: 581193906. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 04:29:51,465][25689] Avg episode reward: [(0, '-24.837')] [2022-07-10 04:29:51,467][26022] Updated weights on worker 0-0, policy_version 567567 (0.00087) [2022-07-10 04:29:53,276][26022] Updated weights on worker 0-0, policy_version 567577 (0.00083) [2022-07-10 04:29:55,107][26022] Updated weights on worker 0-0, policy_version 567587 (0.00086) [2022-07-10 04:29:56,466][25689] Fps is (10 sec: 5711.2, 60 sec: 5667.4, 300 sec: 5640.7). Total num frames: 581217280. Throughput: 0: 5123.7. Samples: 581210904. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:29:56,466][25689] Avg episode reward: [(0, '-24.978')] [2022-07-10 04:29:57,043][26022] Updated weights on worker 0-0, policy_version 567597 (0.00095) [2022-07-10 04:29:58,655][26022] Updated weights on worker 0-0, policy_version 567607 (0.00093) [2022-07-10 04:29:59,537][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:29:59,553][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000567612_581234688.pth [2022-07-10 04:29:59,553][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000565628_579203072.pth [2022-07-10 04:30:00,516][26022] Updated weights on worker 0-0, policy_version 567617 (0.00083) [2022-07-10 04:30:01,538][25689] Fps is (10 sec: 5590.6, 60 sec: 5677.9, 300 sec: 5641.8). Total num frames: 581244928. Throughput: 0: 5949.7. Samples: 581245008. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:30:01,539][25689] Avg episode reward: [(0, '-24.547')] [2022-07-10 04:30:02,672][26022] Updated weights on worker 0-0, policy_version 567627 (0.00086) [2022-07-10 04:30:04,464][26022] Updated weights on worker 0-0, policy_version 567637 (0.00084) [2022-07-10 04:30:06,251][26022] Updated weights on worker 0-0, policy_version 567647 (0.00098) [2022-07-10 04:30:06,679][25689] Fps is (10 sec: 5514.6, 60 sec: 5671.6, 300 sec: 5642.8). Total num frames: 581273600. Throughput: 0: 5796.2. Samples: 581276778. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:30:06,679][25689] Avg episode reward: [(0, '-23.777')] [2022-07-10 04:30:08,297][26022] Updated weights on worker 0-0, policy_version 567657 (0.00092) [2022-07-10 04:30:09,843][26022] Updated weights on worker 0-0, policy_version 567667 (0.00094) [2022-07-10 04:30:11,703][25689] Fps is (10 sec: 5440.0, 60 sec: 5660.0, 300 sec: 5635.7). Total num frames: 581300224. Throughput: 0: 4934.9. Samples: 581293782. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:30:11,703][25689] Avg episode reward: [(0, '-24.308')] [2022-07-10 04:30:11,931][26022] Updated weights on worker 0-0, policy_version 567677 (0.00095) [2022-07-10 04:30:13,276][26022] Updated weights on worker 0-0, policy_version 567687 (0.00106) [2022-07-10 04:30:15,490][26022] Updated weights on worker 0-0, policy_version 567697 (0.00093) [2022-07-10 04:30:16,710][25689] Fps is (10 sec: 5614.8, 60 sec: 5665.3, 300 sec: 5639.6). Total num frames: 581329920. Throughput: 0: 5773.2. Samples: 581327774. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:30:16,710][25689] Avg episode reward: [(0, '-24.562')] [2022-07-10 04:30:17,016][26022] Updated weights on worker 0-0, policy_version 567707 (0.00091) [2022-07-10 04:30:19,030][26022] Updated weights on worker 0-0, policy_version 567717 (0.00092) [2022-07-10 04:30:20,654][26022] Updated weights on worker 0-0, policy_version 567727 (0.00088) [2022-07-10 04:30:21,758][25689] Fps is (10 sec: 5601.3, 60 sec: 5647.2, 300 sec: 5634.7). Total num frames: 581356544. Throughput: 0: 5801.1. Samples: 581362304. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:30:21,760][25689] Avg episode reward: [(0, '-24.201')] [2022-07-10 04:30:22,594][26022] Updated weights on worker 0-0, policy_version 567737 (0.00080) [2022-07-10 04:30:24,150][26022] Updated weights on worker 0-0, policy_version 567747 (0.00096) [2022-07-10 04:30:26,114][26022] Updated weights on worker 0-0, policy_version 567757 (0.00096) [2022-07-10 04:30:26,842][25689] Fps is (10 sec: 5659.7, 60 sec: 5644.6, 300 sec: 5640.1). Total num frames: 581387264. Throughput: 0: 5942.8. Samples: 581396602. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:30:26,842][25689] Avg episode reward: [(0, '-24.263')] [2022-07-10 04:30:27,854][26022] Updated weights on worker 0-0, policy_version 567767 (0.00091) [2022-07-10 04:30:29,723][26022] Updated weights on worker 0-0, policy_version 567777 (0.00088) [2022-07-10 04:30:31,531][26022] Updated weights on worker 0-0, policy_version 567787 (0.00049) [2022-07-10 04:30:31,877][25689] Fps is (10 sec: 5768.1, 60 sec: 5631.3, 300 sec: 5636.3). Total num frames: 581414912. Throughput: 0: 5943.6. Samples: 581413688. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:30:31,880][25689] Avg episode reward: [(0, '-24.472')] [2022-07-10 04:30:33,305][26022] Updated weights on worker 0-0, policy_version 567797 (0.00084) [2022-07-10 04:30:35,231][26022] Updated weights on worker 0-0, policy_version 567807 (0.00088) [2022-07-10 04:30:36,877][26022] Updated weights on worker 0-0, policy_version 567817 (0.00085) [2022-07-10 04:30:36,964][25689] Fps is (10 sec: 5766.4, 60 sec: 5676.0, 300 sec: 5641.7). Total num frames: 581445632. Throughput: 0: 5946.4. Samples: 581448214. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:30:36,964][25689] Avg episode reward: [(0, '-24.273')] [2022-07-10 04:30:38,630][26022] Updated weights on worker 0-0, policy_version 567827 (0.00086) [2022-07-10 04:30:40,516][26022] Updated weights on worker 0-0, policy_version 567837 (0.00085) [2022-07-10 04:30:41,977][25689] Fps is (10 sec: 5779.1, 60 sec: 5659.1, 300 sec: 5644.2). Total num frames: 581473280. Throughput: 0: 5943.0. Samples: 581482464. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:30:41,977][25689] Avg episode reward: [(0, '-24.288')] [2022-07-10 04:30:42,276][26022] Updated weights on worker 0-0, policy_version 567847 (0.00085) [2022-07-10 04:30:44,115][26022] Updated weights on worker 0-0, policy_version 567857 (0.00081) [2022-07-10 04:30:45,770][26022] Updated weights on worker 0-0, policy_version 567867 (0.00087) [2022-07-10 04:30:47,039][25689] Fps is (10 sec: 5590.0, 60 sec: 5642.6, 300 sec: 5646.6). Total num frames: 581501952. Throughput: 0: 5110.0. Samples: 581499810. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:30:47,039][25689] Avg episode reward: [(0, '-24.540')] [2022-07-10 04:30:47,666][26022] Updated weights on worker 0-0, policy_version 567877 (0.00083) [2022-07-10 04:30:49,351][26022] Updated weights on worker 0-0, policy_version 567887 (0.00093) [2022-07-10 04:30:51,263][26022] Updated weights on worker 0-0, policy_version 567897 (0.00103) [2022-07-10 04:30:52,052][25689] Fps is (10 sec: 5691.6, 60 sec: 5645.0, 300 sec: 5643.7). Total num frames: 581530624. Throughput: 0: 5974.0. Samples: 581534212. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:30:52,052][25689] Avg episode reward: [(0, '-24.996')] [2022-07-10 04:30:52,910][26022] Updated weights on worker 0-0, policy_version 567907 (0.00091) [2022-07-10 04:30:54,841][26022] Updated weights on worker 0-0, policy_version 567917 (0.00088) [2022-07-10 04:30:56,600][26022] Updated weights on worker 0-0, policy_version 567927 (0.00104) [2022-07-10 04:30:57,105][25689] Fps is (10 sec: 5696.8, 60 sec: 5640.2, 300 sec: 5646.4). Total num frames: 581559296. Throughput: 0: 5954.7. Samples: 581568150. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:30:57,105][25689] Avg episode reward: [(0, '-26.221')] [2022-07-10 04:30:58,575][26022] Updated weights on worker 0-0, policy_version 567937 (0.00091) [2022-07-10 04:31:00,065][26022] Updated weights on worker 0-0, policy_version 567947 (0.00087) [2022-07-10 04:31:02,128][25689] Fps is (10 sec: 5487.9, 60 sec: 5627.9, 300 sec: 5641.5). Total num frames: 581585920. Throughput: 0: 5098.8. Samples: 581585210. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:31:02,128][25689] Avg episode reward: [(0, '-26.060')] [2022-07-10 04:31:02,389][26022] Updated weights on worker 0-0, policy_version 567957 (0.00088) [2022-07-10 04:31:04,288][26022] Updated weights on worker 0-0, policy_version 567967 (0.00091) [2022-07-10 04:31:06,129][26022] Updated weights on worker 0-0, policy_version 567977 (0.00095) [2022-07-10 04:31:07,233][25689] Fps is (10 sec: 5459.6, 60 sec: 5631.2, 300 sec: 5636.5). Total num frames: 581614592. Throughput: 0: 5815.4. Samples: 581617248. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:31:07,234][25689] Avg episode reward: [(0, '-26.773')] [2022-07-10 04:31:07,847][26022] Updated weights on worker 0-0, policy_version 567987 (0.00583) [2022-07-10 04:31:09,693][26022] Updated weights on worker 0-0, policy_version 567997 (0.00083) [2022-07-10 04:31:11,381][26022] Updated weights on worker 0-0, policy_version 568007 (0.00086) [2022-07-10 04:31:12,274][25689] Fps is (10 sec: 5752.7, 60 sec: 5680.3, 300 sec: 5650.6). Total num frames: 581644288. Throughput: 0: 5812.5. Samples: 581651754. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:31:12,274][25689] Avg episode reward: [(0, '-27.332')] [2022-07-10 04:31:13,199][26022] Updated weights on worker 0-0, policy_version 568017 (0.00086) [2022-07-10 04:31:15,011][26022] Updated weights on worker 0-0, policy_version 568027 (0.00083) [2022-07-10 04:31:16,709][26022] Updated weights on worker 0-0, policy_version 568037 (0.00090) [2022-07-10 04:31:17,290][25689] Fps is (10 sec: 5803.8, 60 sec: 5662.5, 300 sec: 5650.3). Total num frames: 581672960. Throughput: 0: 5000.6. Samples: 581669088. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:31:17,291][25689] Avg episode reward: [(0, '-26.464')] [2022-07-10 04:31:18,624][26022] Updated weights on worker 0-0, policy_version 568047 (0.00081) [2022-07-10 04:31:20,166][26022] Updated weights on worker 0-0, policy_version 568057 (0.00093) [2022-07-10 04:31:22,155][26022] Updated weights on worker 0-0, policy_version 568067 (0.00095) [2022-07-10 04:31:22,300][25689] Fps is (10 sec: 5719.5, 60 sec: 5699.9, 300 sec: 5644.7). Total num frames: 581701632. Throughput: 0: 5862.3. Samples: 581703466. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:31:22,301][25689] Avg episode reward: [(0, '-26.489')] [2022-07-10 04:31:23,968][26022] Updated weights on worker 0-0, policy_version 568077 (0.00089) [2022-07-10 04:31:25,620][26022] Updated weights on worker 0-0, policy_version 568087 (0.00091) [2022-07-10 04:31:27,349][25689] Fps is (10 sec: 5598.8, 60 sec: 5652.4, 300 sec: 5644.1). Total num frames: 581729280. Throughput: 0: 5974.9. Samples: 581737440. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:31:27,350][25689] Avg episode reward: [(0, '-25.022')] [2022-07-10 04:31:27,694][26022] Updated weights on worker 0-0, policy_version 568097 (0.00087) [2022-07-10 04:31:29,301][26022] Updated weights on worker 0-0, policy_version 568107 (0.00085) [2022-07-10 04:31:31,185][26022] Updated weights on worker 0-0, policy_version 568117 (0.00087) [2022-07-10 04:31:32,367][25689] Fps is (10 sec: 5594.8, 60 sec: 5671.0, 300 sec: 5648.6). Total num frames: 581757952. Throughput: 0: 5092.4. Samples: 581754074. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:31:32,367][25689] Avg episode reward: [(0, '-26.042')] [2022-07-10 04:31:33,154][26022] Updated weights on worker 0-0, policy_version 568127 (0.00087) [2022-07-10 04:31:34,788][26022] Updated weights on worker 0-0, policy_version 568137 (0.00086) [2022-07-10 04:31:36,753][26022] Updated weights on worker 0-0, policy_version 568147 (0.00089) [2022-07-10 04:31:37,388][25689] Fps is (10 sec: 5814.4, 60 sec: 5660.2, 300 sec: 5653.2). Total num frames: 581787648. Throughput: 0: 5925.5. Samples: 581788178. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:31:37,388][25689] Avg episode reward: [(0, '-25.376')] [2022-07-10 04:31:38,187][26022] Updated weights on worker 0-0, policy_version 568157 (0.00093) [2022-07-10 04:31:40,334][26022] Updated weights on worker 0-0, policy_version 568167 (0.00090) [2022-07-10 04:31:41,941][26022] Updated weights on worker 0-0, policy_version 568177 (0.00091) [2022-07-10 04:31:42,399][25689] Fps is (10 sec: 5614.1, 60 sec: 5643.5, 300 sec: 5640.6). Total num frames: 581814272. Throughput: 0: 5904.3. Samples: 581822134. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:31:42,399][25689] Avg episode reward: [(0, '-25.831')] [2022-07-10 04:31:43,993][26022] Updated weights on worker 0-0, policy_version 568187 (0.00085) [2022-07-10 04:31:45,709][26022] Updated weights on worker 0-0, policy_version 568197 (0.00094) [2022-07-10 04:31:47,446][25689] Fps is (10 sec: 5497.7, 60 sec: 5644.9, 300 sec: 5644.0). Total num frames: 581842944. Throughput: 0: 5051.9. Samples: 581838966. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:31:47,446][25689] Avg episode reward: [(0, '-25.604')] [2022-07-10 04:31:47,545][26022] Updated weights on worker 0-0, policy_version 568207 (0.00091) [2022-07-10 04:31:49,231][26022] Updated weights on worker 0-0, policy_version 568217 (0.00086) [2022-07-10 04:31:51,227][26022] Updated weights on worker 0-0, policy_version 568227 (0.00078) [2022-07-10 04:31:52,455][25689] Fps is (10 sec: 5804.2, 60 sec: 5662.2, 300 sec: 5647.3). Total num frames: 581872640. Throughput: 0: 5925.5. Samples: 581873106. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:31:52,455][25689] Avg episode reward: [(0, '-26.707')] [2022-07-10 04:31:52,898][26022] Updated weights on worker 0-0, policy_version 568237 (0.00091) [2022-07-10 04:31:54,815][26022] Updated weights on worker 0-0, policy_version 568247 (0.00081) [2022-07-10 04:31:56,327][26022] Updated weights on worker 0-0, policy_version 568257 (0.00088) [2022-07-10 04:31:57,459][25689] Fps is (10 sec: 5727.1, 60 sec: 5649.9, 300 sec: 5644.5). Total num frames: 581900288. Throughput: 0: 5947.4. Samples: 581907546. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:31:57,459][25689] Avg episode reward: [(0, '-27.234')] [2022-07-10 04:31:58,264][26022] Updated weights on worker 0-0, policy_version 568267 (0.00094) [2022-07-10 04:31:59,702][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:31:59,714][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000568275_581913600.pth [2022-07-10 04:31:59,714][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000566287_579877888.pth [2022-07-10 04:32:00,059][26022] Updated weights on worker 0-0, policy_version 568277 (0.00084) [2022-07-10 04:32:01,836][26022] Updated weights on worker 0-0, policy_version 568287 (0.00090) [2022-07-10 04:32:02,469][25689] Fps is (10 sec: 5317.3, 60 sec: 5634.1, 300 sec: 5641.8). Total num frames: 581925888. Throughput: 0: 5090.7. Samples: 581924306. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:32:02,469][25689] Avg episode reward: [(0, '-27.193')] [2022-07-10 04:32:04,147][26022] Updated weights on worker 0-0, policy_version 568297 (0.00082) [2022-07-10 04:32:05,771][26022] Updated weights on worker 0-0, policy_version 568307 (0.00088) [2022-07-10 04:32:07,541][25689] Fps is (10 sec: 5484.4, 60 sec: 5654.2, 300 sec: 5647.5). Total num frames: 581955584. Throughput: 0: 5849.1. Samples: 581956504. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:32:07,543][25689] Avg episode reward: [(0, '-26.047')] [2022-07-10 04:32:07,552][26022] Updated weights on worker 0-0, policy_version 568317 (0.00090) [2022-07-10 04:32:09,313][26022] Updated weights on worker 0-0, policy_version 568327 (0.00082) [2022-07-10 04:32:11,216][26022] Updated weights on worker 0-0, policy_version 568337 (0.00080) [2022-07-10 04:32:12,554][25689] Fps is (10 sec: 5787.2, 60 sec: 5639.8, 300 sec: 5640.9). Total num frames: 581984256. Throughput: 0: 5855.5. Samples: 581990800. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:32:12,555][25689] Avg episode reward: [(0, '-26.667')] [2022-07-10 04:32:13,088][26022] Updated weights on worker 0-0, policy_version 568347 (0.00092) [2022-07-10 04:32:14,811][26022] Updated weights on worker 0-0, policy_version 568357 (0.00091) [2022-07-10 04:32:16,539][26022] Updated weights on worker 0-0, policy_version 568367 (0.00091) [2022-07-10 04:32:17,557][25689] Fps is (10 sec: 5623.0, 60 sec: 5624.0, 300 sec: 5644.9). Total num frames: 582011904. Throughput: 0: 4991.7. Samples: 582007872. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:32:17,558][25689] Avg episode reward: [(0, '-27.188')] [2022-07-10 04:32:18,333][26022] Updated weights on worker 0-0, policy_version 568377 (0.00087) [2022-07-10 04:32:20,259][26022] Updated weights on worker 0-0, policy_version 568387 (0.00092) [2022-07-10 04:32:21,899][26022] Updated weights on worker 0-0, policy_version 568397 (0.00086) [2022-07-10 04:32:22,578][25689] Fps is (10 sec: 5720.9, 60 sec: 5640.0, 300 sec: 5645.7). Total num frames: 582041600. Throughput: 0: 5871.9. Samples: 582042386. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:32:22,579][25689] Avg episode reward: [(0, '-26.014')] [2022-07-10 04:32:23,767][26022] Updated weights on worker 0-0, policy_version 568407 (0.00096) [2022-07-10 04:32:25,604][26022] Updated weights on worker 0-0, policy_version 568417 (0.00095) [2022-07-10 04:32:27,492][26022] Updated weights on worker 0-0, policy_version 568427 (0.00084) [2022-07-10 04:32:27,610][25689] Fps is (10 sec: 5704.0, 60 sec: 5641.6, 300 sec: 5646.2). Total num frames: 582069248. Throughput: 0: 5971.1. Samples: 582076340. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:32:27,611][25689] Avg episode reward: [(0, '-24.965')] [2022-07-10 04:32:29,295][26022] Updated weights on worker 0-0, policy_version 568437 (0.00626) [2022-07-10 04:32:31,062][26022] Updated weights on worker 0-0, policy_version 568447 (0.00089) [2022-07-10 04:32:32,627][25689] Fps is (10 sec: 5706.7, 60 sec: 5658.7, 300 sec: 5649.8). Total num frames: 582098944. Throughput: 0: 5106.6. Samples: 582093300. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:32:32,627][25689] Avg episode reward: [(0, '-24.180')] [2022-07-10 04:32:32,909][26022] Updated weights on worker 0-0, policy_version 568457 (0.00090) [2022-07-10 04:32:34,832][26022] Updated weights on worker 0-0, policy_version 568467 (0.00088) [2022-07-10 04:32:36,496][26022] Updated weights on worker 0-0, policy_version 568477 (0.00093) [2022-07-10 04:32:37,647][25689] Fps is (10 sec: 5611.6, 60 sec: 5607.8, 300 sec: 5640.1). Total num frames: 582125568. Throughput: 0: 5950.4. Samples: 582127414. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:32:37,647][25689] Avg episode reward: [(0, '-23.945')] [2022-07-10 04:32:38,276][26022] Updated weights on worker 0-0, policy_version 568487 (0.00090) [2022-07-10 04:32:40,139][26022] Updated weights on worker 0-0, policy_version 568497 (0.00082) [2022-07-10 04:32:42,013][26022] Updated weights on worker 0-0, policy_version 568507 (0.00085) [2022-07-10 04:32:42,659][25689] Fps is (10 sec: 5613.9, 60 sec: 5658.7, 300 sec: 5648.6). Total num frames: 582155264. Throughput: 0: 5928.1. Samples: 582161426. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:32:42,659][25689] Avg episode reward: [(0, '-24.460')] [2022-07-10 04:32:43,806][26022] Updated weights on worker 0-0, policy_version 568517 (0.00089) [2022-07-10 04:32:45,475][26022] Updated weights on worker 0-0, policy_version 568527 (0.00085) [2022-07-10 04:32:47,417][26022] Updated weights on worker 0-0, policy_version 568537 (0.00085) [2022-07-10 04:32:47,723][25689] Fps is (10 sec: 5792.8, 60 sec: 5657.1, 300 sec: 5647.9). Total num frames: 582183936. Throughput: 0: 5077.5. Samples: 582178460. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:32:47,723][25689] Avg episode reward: [(0, '-25.509')] [2022-07-10 04:32:49,238][26022] Updated weights on worker 0-0, policy_version 568547 (0.00091) [2022-07-10 04:32:50,955][26022] Updated weights on worker 0-0, policy_version 568557 (0.00086) [2022-07-10 04:32:52,730][25689] Fps is (10 sec: 5592.0, 60 sec: 5623.2, 300 sec: 5644.4). Total num frames: 582211584. Throughput: 0: 5934.2. Samples: 582212600. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:32:52,732][25689] Avg episode reward: [(0, '-25.662')] [2022-07-10 04:32:52,762][26022] Updated weights on worker 0-0, policy_version 568567 (0.00086) [2022-07-10 04:32:54,395][26022] Updated weights on worker 0-0, policy_version 568577 (0.00062) [2022-07-10 04:32:56,445][26022] Updated weights on worker 0-0, policy_version 568587 (0.00091) [2022-07-10 04:32:57,767][25689] Fps is (10 sec: 5709.2, 60 sec: 5654.2, 300 sec: 5651.1). Total num frames: 582241280. Throughput: 0: 5933.8. Samples: 582246802. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:32:57,768][25689] Avg episode reward: [(0, '-26.713')] [2022-07-10 04:32:58,106][26022] Updated weights on worker 0-0, policy_version 568597 (0.00092) [2022-07-10 04:33:00,063][26022] Updated weights on worker 0-0, policy_version 568607 (0.00090) [2022-07-10 04:33:01,842][26022] Updated weights on worker 0-0, policy_version 568617 (0.00096) [2022-07-10 04:33:02,775][25689] Fps is (10 sec: 5505.1, 60 sec: 5654.4, 300 sec: 5650.0). Total num frames: 582266880. Throughput: 0: 5078.4. Samples: 582263584. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:33:02,775][25689] Avg episode reward: [(0, '-26.981')] [2022-07-10 04:33:04,171][26022] Updated weights on worker 0-0, policy_version 568627 (0.00095) [2022-07-10 04:33:05,835][26022] Updated weights on worker 0-0, policy_version 568637 (0.00095) [2022-07-10 04:33:07,748][26022] Updated weights on worker 0-0, policy_version 568647 (0.00085) [2022-07-10 04:33:07,810][25689] Fps is (10 sec: 5302.0, 60 sec: 5623.9, 300 sec: 5643.9). Total num frames: 582294528. Throughput: 0: 5805.2. Samples: 582295070. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:33:07,810][25689] Avg episode reward: [(0, '-27.600')] [2022-07-10 04:33:09,578][26022] Updated weights on worker 0-0, policy_version 568657 (0.00094) [2022-07-10 04:33:11,276][26022] Updated weights on worker 0-0, policy_version 568667 (0.00076) [2022-07-10 04:33:12,832][25689] Fps is (10 sec: 5599.8, 60 sec: 5623.0, 300 sec: 5643.7). Total num frames: 582323200. Throughput: 0: 5797.3. Samples: 582329138. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:33:12,833][25689] Avg episode reward: [(0, '-27.328')] [2022-07-10 04:33:13,135][26022] Updated weights on worker 0-0, policy_version 568677 (0.00088) [2022-07-10 04:33:14,918][26022] Updated weights on worker 0-0, policy_version 568687 (0.00090) [2022-07-10 04:33:16,716][26022] Updated weights on worker 0-0, policy_version 568697 (0.00093) [2022-07-10 04:33:17,863][25689] Fps is (10 sec: 5806.1, 60 sec: 5654.4, 300 sec: 5647.0). Total num frames: 582352896. Throughput: 0: 4948.5. Samples: 582346246. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:33:17,863][25689] Avg episode reward: [(0, '-26.685')] [2022-07-10 04:33:18,598][26022] Updated weights on worker 0-0, policy_version 568707 (0.00087) [2022-07-10 04:33:20,368][26022] Updated weights on worker 0-0, policy_version 568717 (0.00094) [2022-07-10 04:33:22,275][26022] Updated weights on worker 0-0, policy_version 568727 (0.00511) [2022-07-10 04:33:22,882][25689] Fps is (10 sec: 5604.2, 60 sec: 5603.6, 300 sec: 5644.1). Total num frames: 582379520. Throughput: 0: 5792.9. Samples: 582380062. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:33:22,885][25689] Avg episode reward: [(0, '-25.020')] [2022-07-10 04:33:23,954][26022] Updated weights on worker 0-0, policy_version 568737 (0.00089) [2022-07-10 04:33:25,743][26022] Updated weights on worker 0-0, policy_version 568747 (0.00089) [2022-07-10 04:33:27,516][26022] Updated weights on worker 0-0, policy_version 568757 (0.00092) [2022-07-10 04:33:27,956][25689] Fps is (10 sec: 5681.3, 60 sec: 5650.6, 300 sec: 5646.4). Total num frames: 582410240. Throughput: 0: 5908.1. Samples: 582414096. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:33:27,957][25689] Avg episode reward: [(0, '-26.224')] [2022-07-10 04:33:29,469][26022] Updated weights on worker 0-0, policy_version 568767 (0.00089) [2022-07-10 04:33:31,189][26022] Updated weights on worker 0-0, policy_version 568777 (0.00093) [2022-07-10 04:33:32,985][25689] Fps is (10 sec: 5574.3, 60 sec: 5581.6, 300 sec: 5632.3). Total num frames: 582435840. Throughput: 0: 5055.9. Samples: 582431030. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:33:32,986][25689] Avg episode reward: [(0, '-26.217')] [2022-07-10 04:33:33,135][26022] Updated weights on worker 0-0, policy_version 568787 (0.00089) [2022-07-10 04:33:34,705][26022] Updated weights on worker 0-0, policy_version 568797 (0.00088) [2022-07-10 04:33:36,674][26022] Updated weights on worker 0-0, policy_version 568807 (0.00085) [2022-07-10 04:33:38,001][25689] Fps is (10 sec: 5505.1, 60 sec: 5632.9, 300 sec: 5639.0). Total num frames: 582465536. Throughput: 0: 5902.0. Samples: 582465100. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:33:38,001][25689] Avg episode reward: [(0, '-25.436')] [2022-07-10 04:33:38,337][26022] Updated weights on worker 0-0, policy_version 568817 (0.00096) [2022-07-10 04:33:40,290][26022] Updated weights on worker 0-0, policy_version 568827 (0.00099) [2022-07-10 04:33:41,903][26022] Updated weights on worker 0-0, policy_version 568837 (0.00083) [2022-07-10 04:33:43,006][25689] Fps is (10 sec: 5824.9, 60 sec: 5616.6, 300 sec: 5647.8). Total num frames: 582494208. Throughput: 0: 5911.6. Samples: 582499028. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:33:43,006][25689] Avg episode reward: [(0, '-24.568')] [2022-07-10 04:33:44,037][26022] Updated weights on worker 0-0, policy_version 568847 (0.00082) [2022-07-10 04:33:45,515][26022] Updated weights on worker 0-0, policy_version 568857 (0.00081) [2022-07-10 04:33:47,597][26022] Updated weights on worker 0-0, policy_version 568867 (0.00090) [2022-07-10 04:33:48,131][25689] Fps is (10 sec: 5660.5, 60 sec: 5610.9, 300 sec: 5642.4). Total num frames: 582522880. Throughput: 0: 5050.3. Samples: 582515986. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:33:48,132][25689] Avg episode reward: [(0, '-25.106')] [2022-07-10 04:33:49,409][26022] Updated weights on worker 0-0, policy_version 568877 (0.00094) [2022-07-10 04:33:51,258][26022] Updated weights on worker 0-0, policy_version 568887 (0.00091) [2022-07-10 04:33:53,076][26022] Updated weights on worker 0-0, policy_version 568897 (0.00091) [2022-07-10 04:33:53,168][25689] Fps is (10 sec: 5542.1, 60 sec: 5608.1, 300 sec: 5639.7). Total num frames: 582550528. Throughput: 0: 5862.3. Samples: 582549346. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:33:53,169][25689] Avg episode reward: [(0, '-24.487')] [2022-07-10 04:33:54,838][26022] Updated weights on worker 0-0, policy_version 568907 (0.00082) [2022-07-10 04:33:56,624][26022] Updated weights on worker 0-0, policy_version 568917 (0.00568) [2022-07-10 04:33:58,234][25689] Fps is (10 sec: 5574.8, 60 sec: 5588.5, 300 sec: 5645.4). Total num frames: 582579200. Throughput: 0: 5846.4. Samples: 582583392. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:33:58,234][25689] Avg episode reward: [(0, '-23.813')] [2022-07-10 04:33:58,707][26022] Updated weights on worker 0-0, policy_version 568927 (0.00084) [2022-07-10 04:33:59,765][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:33:59,783][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000568934_582588416.pth [2022-07-10 04:33:59,783][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000566949_580555776.pth [2022-07-10 04:34:00,052][26022] Updated weights on worker 0-0, policy_version 568937 (0.00303) [2022-07-10 04:34:02,679][26022] Updated weights on worker 0-0, policy_version 568947 (0.00091) [2022-07-10 04:34:03,236][25689] Fps is (10 sec: 5390.5, 60 sec: 5589.0, 300 sec: 5636.4). Total num frames: 582604800. Throughput: 0: 5748.1. Samples: 582615314. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:34:03,238][25689] Avg episode reward: [(0, '-23.500')] [2022-07-10 04:34:04,189][26022] Updated weights on worker 0-0, policy_version 568957 (0.00093) [2022-07-10 04:34:06,355][26022] Updated weights on worker 0-0, policy_version 568967 (0.00088) [2022-07-10 04:34:07,856][26022] Updated weights on worker 0-0, policy_version 568977 (0.00098) [2022-07-10 04:34:08,322][25689] Fps is (10 sec: 5481.2, 60 sec: 5618.2, 300 sec: 5643.2). Total num frames: 582634496. Throughput: 0: 5767.6. Samples: 582632440. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:34:08,323][25689] Avg episode reward: [(0, '-23.468')] [2022-07-10 04:34:09,870][26022] Updated weights on worker 0-0, policy_version 568987 (0.00087) [2022-07-10 04:34:11,350][26022] Updated weights on worker 0-0, policy_version 568997 (0.00092) [2022-07-10 04:34:13,340][25689] Fps is (10 sec: 5675.7, 60 sec: 5601.7, 300 sec: 5637.2). Total num frames: 582662144. Throughput: 0: 5808.5. Samples: 582666512. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:34:13,340][25689] Avg episode reward: [(0, '-24.442')] [2022-07-10 04:34:13,523][26022] Updated weights on worker 0-0, policy_version 569007 (0.00091) [2022-07-10 04:34:15,070][26022] Updated weights on worker 0-0, policy_version 569017 (0.00100) [2022-07-10 04:34:16,894][26022] Updated weights on worker 0-0, policy_version 569027 (0.00086) [2022-07-10 04:34:18,352][25689] Fps is (10 sec: 5717.6, 60 sec: 5603.4, 300 sec: 5644.5). Total num frames: 582691840. Throughput: 0: 5837.3. Samples: 582700824. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:34:18,352][25689] Avg episode reward: [(0, '-23.857')] [2022-07-10 04:34:18,521][26022] Updated weights on worker 0-0, policy_version 569037 (0.00095) [2022-07-10 04:34:20,596][26022] Updated weights on worker 0-0, policy_version 569047 (0.00085) [2022-07-10 04:34:22,264][26022] Updated weights on worker 0-0, policy_version 569057 (0.00089) [2022-07-10 04:34:23,409][25689] Fps is (10 sec: 5694.7, 60 sec: 5616.8, 300 sec: 5634.1). Total num frames: 582719488. Throughput: 0: 5085.0. Samples: 582717894. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:34:23,410][25689] Avg episode reward: [(0, '-24.828')] [2022-07-10 04:34:24,128][26022] Updated weights on worker 0-0, policy_version 569067 (0.00093) [2022-07-10 04:34:25,839][26022] Updated weights on worker 0-0, policy_version 569077 (0.00083) [2022-07-10 04:34:27,766][26022] Updated weights on worker 0-0, policy_version 569087 (0.00085) [2022-07-10 04:34:28,533][25689] Fps is (10 sec: 5632.2, 60 sec: 5595.3, 300 sec: 5636.7). Total num frames: 582749184. Throughput: 0: 5922.2. Samples: 582752132. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:34:28,534][25689] Avg episode reward: [(0, '-24.345')] [2022-07-10 04:34:29,332][26022] Updated weights on worker 0-0, policy_version 569097 (0.00081) [2022-07-10 04:34:31,274][26022] Updated weights on worker 0-0, policy_version 569107 (0.00092) [2022-07-10 04:34:32,995][26022] Updated weights on worker 0-0, policy_version 569117 (0.00081) [2022-07-10 04:34:33,598][25689] Fps is (10 sec: 5728.5, 60 sec: 5642.6, 300 sec: 5639.3). Total num frames: 582777856. Throughput: 0: 5897.7. Samples: 582785992. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:34:33,599][25689] Avg episode reward: [(0, '-24.044')] [2022-07-10 04:34:35,013][26022] Updated weights on worker 0-0, policy_version 569127 (0.00090) [2022-07-10 04:34:36,629][26022] Updated weights on worker 0-0, policy_version 569137 (0.00082) [2022-07-10 04:34:38,675][25689] Fps is (10 sec: 5553.0, 60 sec: 5603.1, 300 sec: 5634.6). Total num frames: 582805504. Throughput: 0: 5030.6. Samples: 582803064. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:34:38,676][25689] Avg episode reward: [(0, '-23.553')] [2022-07-10 04:34:38,684][26022] Updated weights on worker 0-0, policy_version 569147 (0.00084) [2022-07-10 04:34:40,312][26022] Updated weights on worker 0-0, policy_version 569157 (0.00085) [2022-07-10 04:34:42,126][26022] Updated weights on worker 0-0, policy_version 569167 (0.00087) [2022-07-10 04:34:43,757][25689] Fps is (10 sec: 5644.7, 60 sec: 5612.9, 300 sec: 5634.3). Total num frames: 582835200. Throughput: 0: 5876.4. Samples: 582837466. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:34:43,758][25689] Avg episode reward: [(0, '-23.134')] [2022-07-10 04:34:43,956][26022] Updated weights on worker 0-0, policy_version 569177 (0.00093) [2022-07-10 04:34:45,639][26022] Updated weights on worker 0-0, policy_version 569187 (0.00376) [2022-07-10 04:34:47,624][26022] Updated weights on worker 0-0, policy_version 569197 (0.00089) [2022-07-10 04:34:48,834][25689] Fps is (10 sec: 5947.1, 60 sec: 5651.1, 300 sec: 5640.5). Total num frames: 582865920. Throughput: 0: 5884.8. Samples: 582871600. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:34:48,835][25689] Avg episode reward: [(0, '-22.825')] [2022-07-10 04:34:49,523][26022] Updated weights on worker 0-0, policy_version 569207 (0.00082) [2022-07-10 04:34:51,107][26022] Updated weights on worker 0-0, policy_version 569217 (0.00088) [2022-07-10 04:34:53,086][26022] Updated weights on worker 0-0, policy_version 569227 (0.00084) [2022-07-10 04:34:53,932][25689] Fps is (10 sec: 5736.4, 60 sec: 5645.4, 300 sec: 5635.2). Total num frames: 582893568. Throughput: 0: 5048.9. Samples: 582888656. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:34:53,933][25689] Avg episode reward: [(0, '-23.152')] [2022-07-10 04:34:54,748][26022] Updated weights on worker 0-0, policy_version 569237 (0.00089) [2022-07-10 04:34:56,474][26022] Updated weights on worker 0-0, policy_version 569247 (0.00090) [2022-07-10 04:34:58,424][26022] Updated weights on worker 0-0, policy_version 569257 (0.00087) [2022-07-10 04:34:58,969][25689] Fps is (10 sec: 5657.9, 60 sec: 5665.0, 300 sec: 5642.8). Total num frames: 582923264. Throughput: 0: 5895.9. Samples: 582922716. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:34:58,970][25689] Avg episode reward: [(0, '-23.454')] [2022-07-10 04:35:00,364][26022] Updated weights on worker 0-0, policy_version 569267 (0.00079) [2022-07-10 04:35:02,221][26022] Updated weights on worker 0-0, policy_version 569277 (0.00091) [2022-07-10 04:35:04,035][25689] Fps is (10 sec: 5371.8, 60 sec: 5642.2, 300 sec: 5630.4). Total num frames: 582947840. Throughput: 0: 5784.4. Samples: 582954762. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:35:04,036][25689] Avg episode reward: [(0, '-23.763')] [2022-07-10 04:35:04,214][26022] Updated weights on worker 0-0, policy_version 569287 (0.00083) [2022-07-10 04:35:05,818][26022] Updated weights on worker 0-0, policy_version 569297 (0.00087) [2022-07-10 04:35:07,767][26022] Updated weights on worker 0-0, policy_version 569307 (0.00088) [2022-07-10 04:35:09,092][25689] Fps is (10 sec: 5361.6, 60 sec: 5644.9, 300 sec: 5640.1). Total num frames: 582977536. Throughput: 0: 4952.4. Samples: 582971924. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:35:09,092][25689] Avg episode reward: [(0, '-24.630')] [2022-07-10 04:35:09,554][26022] Updated weights on worker 0-0, policy_version 569317 (0.00100) [2022-07-10 04:35:11,210][26022] Updated weights on worker 0-0, policy_version 569327 (0.00089) [2022-07-10 04:35:13,186][26022] Updated weights on worker 0-0, policy_version 569337 (0.00094) [2022-07-10 04:35:14,192][25689] Fps is (10 sec: 5948.4, 60 sec: 5687.7, 300 sec: 5641.8). Total num frames: 583008256. Throughput: 0: 5819.5. Samples: 583006558. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:35:14,192][25689] Avg episode reward: [(0, '-25.420')] [2022-07-10 04:35:14,844][26022] Updated weights on worker 0-0, policy_version 569347 (0.00090) [2022-07-10 04:35:16,593][26022] Updated weights on worker 0-0, policy_version 569357 (0.00086) [2022-07-10 04:35:18,550][26022] Updated weights on worker 0-0, policy_version 569367 (0.00089) [2022-07-10 04:35:19,250][25689] Fps is (10 sec: 5746.2, 60 sec: 5649.8, 300 sec: 5645.1). Total num frames: 583035904. Throughput: 0: 5834.1. Samples: 583041032. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:35:19,250][25689] Avg episode reward: [(0, '-24.446')] [2022-07-10 04:35:20,088][26022] Updated weights on worker 0-0, policy_version 569377 (0.00081) [2022-07-10 04:35:21,984][26022] Updated weights on worker 0-0, policy_version 569387 (0.00091) [2022-07-10 04:35:23,644][26022] Updated weights on worker 0-0, policy_version 569397 (0.00083) [2022-07-10 04:35:24,289][25689] Fps is (10 sec: 5578.1, 60 sec: 5668.4, 300 sec: 5639.0). Total num frames: 583064576. Throughput: 0: 5107.9. Samples: 583058214. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:35:24,289][25689] Avg episode reward: [(0, '-24.522')] [2022-07-10 04:35:25,464][26022] Updated weights on worker 0-0, policy_version 569407 (0.00091) [2022-07-10 04:35:27,388][26022] Updated weights on worker 0-0, policy_version 569417 (0.00092) [2022-07-10 04:35:29,154][26022] Updated weights on worker 0-0, policy_version 569427 (0.00091) [2022-07-10 04:35:29,387][25689] Fps is (10 sec: 5757.5, 60 sec: 5670.7, 300 sec: 5644.7). Total num frames: 583094272. Throughput: 0: 5934.7. Samples: 583092372. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:35:29,388][25689] Avg episode reward: [(0, '-25.131')] [2022-07-10 04:35:30,996][26022] Updated weights on worker 0-0, policy_version 569437 (0.00090) [2022-07-10 04:35:32,960][26022] Updated weights on worker 0-0, policy_version 569447 (0.00087) [2022-07-10 04:35:34,434][25689] Fps is (10 sec: 5753.0, 60 sec: 5672.4, 300 sec: 5638.6). Total num frames: 583122944. Throughput: 0: 5909.4. Samples: 583126178. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:35:34,435][25689] Avg episode reward: [(0, '-25.264')] [2022-07-10 04:35:34,678][26022] Updated weights on worker 0-0, policy_version 569457 (0.00090) [2022-07-10 04:35:36,380][26022] Updated weights on worker 0-0, policy_version 569467 (0.00096) [2022-07-10 04:35:38,280][26022] Updated weights on worker 0-0, policy_version 569477 (0.00093) [2022-07-10 04:35:39,485][25689] Fps is (10 sec: 5577.4, 60 sec: 5674.9, 300 sec: 5637.9). Total num frames: 583150592. Throughput: 0: 5883.2. Samples: 583160082. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:35:39,486][25689] Avg episode reward: [(0, '-24.285')] [2022-07-10 04:35:40,000][26022] Updated weights on worker 0-0, policy_version 569487 (0.00090) [2022-07-10 04:35:41,894][26022] Updated weights on worker 0-0, policy_version 569497 (0.00085) [2022-07-10 04:35:43,671][26022] Updated weights on worker 0-0, policy_version 569507 (0.00084) [2022-07-10 04:35:44,565][25689] Fps is (10 sec: 5559.0, 60 sec: 5658.2, 300 sec: 5637.5). Total num frames: 583179264. Throughput: 0: 5857.7. Samples: 583176990. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:35:44,566][25689] Avg episode reward: [(0, '-25.809')] [2022-07-10 04:35:45,597][26022] Updated weights on worker 0-0, policy_version 569517 (0.00078) [2022-07-10 04:35:47,345][26022] Updated weights on worker 0-0, policy_version 569527 (0.00078) [2022-07-10 04:35:48,976][26022] Updated weights on worker 0-0, policy_version 569537 (0.00224) [2022-07-10 04:35:49,671][25689] Fps is (10 sec: 5730.0, 60 sec: 5638.7, 300 sec: 5639.2). Total num frames: 583208960. Throughput: 0: 5850.4. Samples: 583211042. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:35:49,672][25689] Avg episode reward: [(0, '-25.734')] [2022-07-10 04:35:51,015][26022] Updated weights on worker 0-0, policy_version 569547 (0.00094) [2022-07-10 04:35:52,732][26022] Updated weights on worker 0-0, policy_version 569557 (0.00084) [2022-07-10 04:35:54,596][26022] Updated weights on worker 0-0, policy_version 569567 (0.00108) [2022-07-10 04:35:54,695][25689] Fps is (10 sec: 5661.1, 60 sec: 5645.6, 300 sec: 5636.3). Total num frames: 583236608. Throughput: 0: 5880.1. Samples: 583245314. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:35:54,695][25689] Avg episode reward: [(0, '-25.862')] [2022-07-10 04:35:56,352][26022] Updated weights on worker 0-0, policy_version 569577 (0.00080) [2022-07-10 04:35:58,129][26022] Updated weights on worker 0-0, policy_version 569587 (0.00088) [2022-07-10 04:35:59,709][25689] Fps is (10 sec: 5712.9, 60 sec: 5647.7, 300 sec: 5646.8). Total num frames: 583266304. Throughput: 0: 5068.3. Samples: 583262584. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 04:35:59,710][25689] Avg episode reward: [(0, '-26.093')] [2022-07-10 04:35:59,791][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:35:59,827][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000569597_583267328.pth [2022-07-10 04:35:59,827][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000567612_581234688.pth [2022-07-10 04:35:59,833][26022] Updated weights on worker 0-0, policy_version 569597 (0.00092) [2022-07-10 04:36:01,696][26022] Updated weights on worker 0-0, policy_version 569607 (0.00079) [2022-07-10 04:36:04,015][26022] Updated weights on worker 0-0, policy_version 569617 (0.00087) [2022-07-10 04:36:04,798][25689] Fps is (10 sec: 5574.8, 60 sec: 5679.3, 300 sec: 5640.2). Total num frames: 583292928. Throughput: 0: 5803.6. Samples: 583294408. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:36:04,798][25689] Avg episode reward: [(0, '-24.954')] [2022-07-10 04:36:05,983][26022] Updated weights on worker 0-0, policy_version 569627 (0.00096) [2022-07-10 04:36:07,573][26022] Updated weights on worker 0-0, policy_version 569637 (0.00088) [2022-07-10 04:36:09,383][26022] Updated weights on worker 0-0, policy_version 569647 (0.00107) [2022-07-10 04:36:09,890][25689] Fps is (10 sec: 5230.1, 60 sec: 5625.4, 300 sec: 5628.9). Total num frames: 583319552. Throughput: 0: 5790.2. Samples: 583328112. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:36:09,891][25689] Avg episode reward: [(0, '-24.738')] [2022-07-10 04:36:11,287][26022] Updated weights on worker 0-0, policy_version 569657 (0.00085) [2022-07-10 04:36:12,923][26022] Updated weights on worker 0-0, policy_version 569667 (0.00090) [2022-07-10 04:36:14,911][25689] Fps is (10 sec: 5467.7, 60 sec: 5599.0, 300 sec: 5628.9). Total num frames: 583348224. Throughput: 0: 4941.6. Samples: 583345212. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:36:14,911][25689] Avg episode reward: [(0, '-23.759')] [2022-07-10 04:36:15,044][26022] Updated weights on worker 0-0, policy_version 569677 (0.00085) [2022-07-10 04:36:16,532][26022] Updated weights on worker 0-0, policy_version 569687 (0.00096) [2022-07-10 04:36:18,592][26022] Updated weights on worker 0-0, policy_version 569697 (0.00094) [2022-07-10 04:36:19,932][25689] Fps is (10 sec: 5914.6, 60 sec: 5653.0, 300 sec: 5635.5). Total num frames: 583378944. Throughput: 0: 5771.3. Samples: 583379296. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:36:19,932][25689] Avg episode reward: [(0, '-23.448')] [2022-07-10 04:36:20,004][26022] Updated weights on worker 0-0, policy_version 569707 (0.00087) [2022-07-10 04:36:22,244][26022] Updated weights on worker 0-0, policy_version 569717 (0.00087) [2022-07-10 04:36:23,681][26022] Updated weights on worker 0-0, policy_version 569727 (0.00083) [2022-07-10 04:36:24,940][25689] Fps is (10 sec: 5717.8, 60 sec: 5622.2, 300 sec: 5632.9). Total num frames: 583405568. Throughput: 0: 5901.3. Samples: 583413274. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:36:24,940][25689] Avg episode reward: [(0, '-24.109')] [2022-07-10 04:36:25,732][26022] Updated weights on worker 0-0, policy_version 569737 (0.00090) [2022-07-10 04:36:27,350][26022] Updated weights on worker 0-0, policy_version 569747 (0.00090) [2022-07-10 04:36:29,335][26022] Updated weights on worker 0-0, policy_version 569757 (0.00087) [2022-07-10 04:36:30,079][25689] Fps is (10 sec: 5550.6, 60 sec: 5618.5, 300 sec: 5634.0). Total num frames: 583435264. Throughput: 0: 5057.7. Samples: 583430218. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:36:30,079][25689] Avg episode reward: [(0, '-24.244')] [2022-07-10 04:36:31,279][26022] Updated weights on worker 0-0, policy_version 569767 (0.00095) [2022-07-10 04:36:33,073][26022] Updated weights on worker 0-0, policy_version 569777 (0.00092) [2022-07-10 04:36:34,552][26022] Updated weights on worker 0-0, policy_version 569787 (0.00087) [2022-07-10 04:36:35,153][25689] Fps is (10 sec: 5715.2, 60 sec: 5616.0, 300 sec: 5629.6). Total num frames: 583463936. Throughput: 0: 5883.1. Samples: 583464296. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:36:35,153][25689] Avg episode reward: [(0, '-24.600')] [2022-07-10 04:36:36,796][26022] Updated weights on worker 0-0, policy_version 569797 (0.00652) [2022-07-10 04:36:38,090][26022] Updated weights on worker 0-0, policy_version 569807 (0.00096) [2022-07-10 04:36:40,163][25689] Fps is (10 sec: 5585.0, 60 sec: 5619.7, 300 sec: 5633.0). Total num frames: 583491584. Throughput: 0: 5889.5. Samples: 583498446. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:36:40,163][25689] Avg episode reward: [(0, '-24.137')] [2022-07-10 04:36:40,275][26022] Updated weights on worker 0-0, policy_version 569817 (0.00081) [2022-07-10 04:36:41,905][26022] Updated weights on worker 0-0, policy_version 569827 (0.00095) [2022-07-10 04:36:43,825][26022] Updated weights on worker 0-0, policy_version 569837 (0.00084) [2022-07-10 04:36:45,183][25689] Fps is (10 sec: 5717.2, 60 sec: 5642.2, 300 sec: 5637.0). Total num frames: 583521280. Throughput: 0: 5053.0. Samples: 583515560. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:36:45,183][25689] Avg episode reward: [(0, '-23.697')] [2022-07-10 04:36:45,604][26022] Updated weights on worker 0-0, policy_version 569847 (0.00090) [2022-07-10 04:36:47,462][26022] Updated weights on worker 0-0, policy_version 569857 (0.00955) [2022-07-10 04:36:49,185][26022] Updated weights on worker 0-0, policy_version 569867 (0.00090) [2022-07-10 04:36:50,239][25689] Fps is (10 sec: 5691.2, 60 sec: 5613.1, 300 sec: 5629.2). Total num frames: 583548928. Throughput: 0: 5908.5. Samples: 583549334. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:36:50,239][25689] Avg episode reward: [(0, '-23.047')] [2022-07-10 04:36:51,141][26022] Updated weights on worker 0-0, policy_version 569877 (0.00092) [2022-07-10 04:36:52,549][26022] Updated weights on worker 0-0, policy_version 569887 (0.00085) [2022-07-10 04:36:54,649][26022] Updated weights on worker 0-0, policy_version 569897 (0.00081) [2022-07-10 04:36:55,305][25689] Fps is (10 sec: 5766.3, 60 sec: 5659.8, 300 sec: 5638.3). Total num frames: 583579648. Throughput: 0: 5925.6. Samples: 583583710. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:36:55,305][25689] Avg episode reward: [(0, '-22.842')] [2022-07-10 04:36:56,495][26022] Updated weights on worker 0-0, policy_version 569907 (0.00087) [2022-07-10 04:36:58,108][26022] Updated weights on worker 0-0, policy_version 569917 (0.00085) [2022-07-10 04:37:00,300][26022] Updated weights on worker 0-0, policy_version 569927 (0.00086) [2022-07-10 04:37:00,347][25689] Fps is (10 sec: 5571.8, 60 sec: 5589.7, 300 sec: 5637.7). Total num frames: 583605248. Throughput: 0: 5064.1. Samples: 583600660. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:37:00,347][25689] Avg episode reward: [(0, '-22.055')] [2022-07-10 04:37:01,720][26022] Updated weights on worker 0-0, policy_version 569937 (0.00087) [2022-07-10 04:37:04,078][26022] Updated weights on worker 0-0, policy_version 569947 (0.00089) [2022-07-10 04:37:05,354][25689] Fps is (10 sec: 5400.8, 60 sec: 5631.0, 300 sec: 5635.5). Total num frames: 583633920. Throughput: 0: 5794.8. Samples: 583632448. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:37:05,354][25689] Avg episode reward: [(0, '-22.010')] [2022-07-10 04:37:05,795][26022] Updated weights on worker 0-0, policy_version 569957 (0.00079) [2022-07-10 04:37:07,655][26022] Updated weights on worker 0-0, policy_version 569967 (0.00090) [2022-07-10 04:37:09,512][26022] Updated weights on worker 0-0, policy_version 569977 (0.00086) [2022-07-10 04:37:10,433][25689] Fps is (10 sec: 5685.6, 60 sec: 5666.1, 300 sec: 5634.3). Total num frames: 583662592. Throughput: 0: 5819.2. Samples: 583666848. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:37:10,433][25689] Avg episode reward: [(0, '-22.559')] [2022-07-10 04:37:11,139][26022] Updated weights on worker 0-0, policy_version 569987 (0.00088) [2022-07-10 04:37:12,887][26022] Updated weights on worker 0-0, policy_version 569997 (0.00095) [2022-07-10 04:37:14,925][26022] Updated weights on worker 0-0, policy_version 570007 (0.00087) [2022-07-10 04:37:15,435][25689] Fps is (10 sec: 5485.3, 60 sec: 5634.0, 300 sec: 5630.9). Total num frames: 583689216. Throughput: 0: 4973.4. Samples: 583683822. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:37:15,435][25689] Avg episode reward: [(0, '-24.085')] [2022-07-10 04:37:16,397][26022] Updated weights on worker 0-0, policy_version 570017 (0.00092) [2022-07-10 04:37:18,374][26022] Updated weights on worker 0-0, policy_version 570027 (0.00083) [2022-07-10 04:37:20,145][26022] Updated weights on worker 0-0, policy_version 570037 (0.00082) [2022-07-10 04:37:20,455][25689] Fps is (10 sec: 5619.5, 60 sec: 5617.2, 300 sec: 5630.9). Total num frames: 583718912. Throughput: 0: 5848.7. Samples: 583718266. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:37:20,455][25689] Avg episode reward: [(0, '-24.820')] [2022-07-10 04:37:21,991][26022] Updated weights on worker 0-0, policy_version 570047 (0.00068) [2022-07-10 04:37:23,746][26022] Updated weights on worker 0-0, policy_version 570057 (0.00829) [2022-07-10 04:37:25,351][26022] Updated weights on worker 0-0, policy_version 570067 (0.00088) [2022-07-10 04:37:25,484][25689] Fps is (10 sec: 5910.2, 60 sec: 5666.0, 300 sec: 5637.8). Total num frames: 583748608. Throughput: 0: 5980.1. Samples: 583752826. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:37:25,484][25689] Avg episode reward: [(0, '-24.280')] [2022-07-10 04:37:27,249][26022] Updated weights on worker 0-0, policy_version 570077 (0.00084) [2022-07-10 04:37:29,183][26022] Updated weights on worker 0-0, policy_version 570087 (0.00087) [2022-07-10 04:37:30,546][25689] Fps is (10 sec: 5783.9, 60 sec: 5656.2, 300 sec: 5633.5). Total num frames: 583777280. Throughput: 0: 5130.0. Samples: 583770032. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:37:30,547][25689] Avg episode reward: [(0, '-24.922')] [2022-07-10 04:37:30,878][26022] Updated weights on worker 0-0, policy_version 570097 (0.00086) [2022-07-10 04:37:32,782][26022] Updated weights on worker 0-0, policy_version 570107 (0.00096) [2022-07-10 04:37:34,443][26022] Updated weights on worker 0-0, policy_version 570117 (0.00084) [2022-07-10 04:37:35,548][25689] Fps is (10 sec: 5698.0, 60 sec: 5663.0, 300 sec: 5640.7). Total num frames: 583805952. Throughput: 0: 5985.9. Samples: 583804216. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:37:35,548][25689] Avg episode reward: [(0, '-25.078')] [2022-07-10 04:37:36,291][26022] Updated weights on worker 0-0, policy_version 570127 (0.00093) [2022-07-10 04:37:38,113][26022] Updated weights on worker 0-0, policy_version 570137 (0.00086) [2022-07-10 04:37:39,856][26022] Updated weights on worker 0-0, policy_version 570147 (0.00092) [2022-07-10 04:37:40,549][25689] Fps is (10 sec: 5630.6, 60 sec: 5663.8, 300 sec: 5634.1). Total num frames: 583833600. Throughput: 0: 5985.1. Samples: 583838530. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:37:40,550][25689] Avg episode reward: [(0, '-26.306')] [2022-07-10 04:37:41,691][26022] Updated weights on worker 0-0, policy_version 570157 (0.00087) [2022-07-10 04:37:43,580][26022] Updated weights on worker 0-0, policy_version 570167 (0.00090) [2022-07-10 04:37:45,144][26022] Updated weights on worker 0-0, policy_version 570177 (0.00094) [2022-07-10 04:37:45,552][25689] Fps is (10 sec: 5732.2, 60 sec: 5665.4, 300 sec: 5638.7). Total num frames: 583863296. Throughput: 0: 5122.2. Samples: 583855618. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:37:45,553][25689] Avg episode reward: [(0, '-25.582')] [2022-07-10 04:37:47,076][26022] Updated weights on worker 0-0, policy_version 570187 (0.00090) [2022-07-10 04:37:48,641][26022] Updated weights on worker 0-0, policy_version 570197 (0.00086) [2022-07-10 04:37:50,512][26022] Updated weights on worker 0-0, policy_version 570207 (0.00085) [2022-07-10 04:37:50,632][25689] Fps is (10 sec: 5788.7, 60 sec: 5680.1, 300 sec: 5640.7). Total num frames: 583891968. Throughput: 0: 5976.1. Samples: 583890066. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:37:50,632][25689] Avg episode reward: [(0, '-25.403')] [2022-07-10 04:37:52,477][26022] Updated weights on worker 0-0, policy_version 570217 (0.00086) [2022-07-10 04:37:54,261][26022] Updated weights on worker 0-0, policy_version 570227 (0.00089) [2022-07-10 04:37:55,634][25689] Fps is (10 sec: 5586.0, 60 sec: 5635.2, 300 sec: 5634.5). Total num frames: 583919616. Throughput: 0: 5982.2. Samples: 583924376. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:37:55,634][25689] Avg episode reward: [(0, '-24.968')] [2022-07-10 04:37:55,992][26022] Updated weights on worker 0-0, policy_version 570237 (0.00091) [2022-07-10 04:37:57,855][26022] Updated weights on worker 0-0, policy_version 570247 (0.00087) [2022-07-10 04:37:59,387][26022] Updated weights on worker 0-0, policy_version 570257 (0.00098) [2022-07-10 04:37:59,881][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:37:59,897][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000570259_583945216.pth [2022-07-10 04:37:59,903][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000568275_581913600.pth [2022-07-10 04:38:00,644][25689] Fps is (10 sec: 5625.3, 60 sec: 5689.2, 300 sec: 5644.8). Total num frames: 583948288. Throughput: 0: 5129.7. Samples: 583941612. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:38:00,644][25689] Avg episode reward: [(0, '-25.289')] [2022-07-10 04:38:01,411][26022] Updated weights on worker 0-0, policy_version 570267 (0.00087) [2022-07-10 04:38:03,375][26022] Updated weights on worker 0-0, policy_version 570277 (0.00087) [2022-07-10 04:38:05,350][26022] Updated weights on worker 0-0, policy_version 570287 (0.00090) [2022-07-10 04:38:05,657][25689] Fps is (10 sec: 5517.0, 60 sec: 5654.7, 300 sec: 5641.8). Total num frames: 583974912. Throughput: 0: 5870.1. Samples: 583973638. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:38:05,659][25689] Avg episode reward: [(0, '-26.169')] [2022-07-10 04:38:07,110][26022] Updated weights on worker 0-0, policy_version 570297 (0.00086) [2022-07-10 04:38:08,916][26022] Updated weights on worker 0-0, policy_version 570307 (0.00087) [2022-07-10 04:38:10,726][25689] Fps is (10 sec: 5586.2, 60 sec: 5672.6, 300 sec: 5644.3). Total num frames: 584004608. Throughput: 0: 5855.9. Samples: 584007734. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:38:10,730][26022] Updated weights on worker 0-0, policy_version 570317 (0.00092) [2022-07-10 04:38:10,731][25689] Avg episode reward: [(0, '-25.666')] [2022-07-10 04:38:12,531][26022] Updated weights on worker 0-0, policy_version 570327 (0.00380) [2022-07-10 04:38:14,214][26022] Updated weights on worker 0-0, policy_version 570337 (0.00085) [2022-07-10 04:38:15,751][25689] Fps is (10 sec: 5782.6, 60 sec: 5704.4, 300 sec: 5641.0). Total num frames: 584033280. Throughput: 0: 5861.5. Samples: 584042290. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:38:15,751][25689] Avg episode reward: [(0, '-25.840')] [2022-07-10 04:38:16,047][26022] Updated weights on worker 0-0, policy_version 570347 (0.00086) [2022-07-10 04:38:17,760][26022] Updated weights on worker 0-0, policy_version 570357 (0.00089) [2022-07-10 04:38:19,712][26022] Updated weights on worker 0-0, policy_version 570367 (0.00090) [2022-07-10 04:38:20,759][25689] Fps is (10 sec: 5613.3, 60 sec: 5671.5, 300 sec: 5644.6). Total num frames: 584060928. Throughput: 0: 5855.4. Samples: 584059394. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:38:20,759][25689] Avg episode reward: [(0, '-26.291')] [2022-07-10 04:38:21,351][26022] Updated weights on worker 0-0, policy_version 570377 (0.00086) [2022-07-10 04:38:23,663][26022] Updated weights on worker 0-0, policy_version 570387 (0.00085) [2022-07-10 04:38:24,991][26022] Updated weights on worker 0-0, policy_version 570397 (0.00089) [2022-07-10 04:38:25,775][25689] Fps is (10 sec: 5720.3, 60 sec: 5672.7, 300 sec: 5642.3). Total num frames: 584090624. Throughput: 0: 5944.7. Samples: 584093236. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:38:25,776][25689] Avg episode reward: [(0, '-26.804')] [2022-07-10 04:38:27,032][26022] Updated weights on worker 0-0, policy_version 570407 (0.00086) [2022-07-10 04:38:28,678][26022] Updated weights on worker 0-0, policy_version 570417 (0.00092) [2022-07-10 04:38:30,694][26022] Updated weights on worker 0-0, policy_version 570427 (0.00089) [2022-07-10 04:38:30,860][25689] Fps is (10 sec: 5677.2, 60 sec: 5653.7, 300 sec: 5648.1). Total num frames: 584118272. Throughput: 0: 5926.2. Samples: 584127052. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:38:30,860][25689] Avg episode reward: [(0, '-26.497')] [2022-07-10 04:38:32,308][26022] Updated weights on worker 0-0, policy_version 570437 (0.00088) [2022-07-10 04:38:34,246][26022] Updated weights on worker 0-0, policy_version 570447 (0.00085) [2022-07-10 04:38:35,915][25689] Fps is (10 sec: 5554.2, 60 sec: 5648.6, 300 sec: 5643.9). Total num frames: 584146944. Throughput: 0: 5057.5. Samples: 584144272. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:38:35,916][25689] Avg episode reward: [(0, '-25.461')] [2022-07-10 04:38:36,092][26022] Updated weights on worker 0-0, policy_version 570457 (0.00096) [2022-07-10 04:38:37,886][26022] Updated weights on worker 0-0, policy_version 570467 (0.00084) [2022-07-10 04:38:39,721][26022] Updated weights on worker 0-0, policy_version 570477 (0.00086) [2022-07-10 04:38:40,920][25689] Fps is (10 sec: 5598.5, 60 sec: 5648.3, 300 sec: 5640.5). Total num frames: 584174592. Throughput: 0: 5892.5. Samples: 584178190. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:38:40,920][25689] Avg episode reward: [(0, '-26.385')] [2022-07-10 04:38:41,415][26022] Updated weights on worker 0-0, policy_version 570487 (0.00091) [2022-07-10 04:38:43,119][26022] Updated weights on worker 0-0, policy_version 570497 (0.00082) [2022-07-10 04:38:45,133][26022] Updated weights on worker 0-0, policy_version 570507 (0.00081) [2022-07-10 04:38:45,935][25689] Fps is (10 sec: 5722.9, 60 sec: 5647.1, 300 sec: 5646.0). Total num frames: 584204288. Throughput: 0: 5934.7. Samples: 584212880. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:38:45,936][25689] Avg episode reward: [(0, '-26.347')] [2022-07-10 04:38:46,675][26022] Updated weights on worker 0-0, policy_version 570517 (0.00092) [2022-07-10 04:38:48,615][26022] Updated weights on worker 0-0, policy_version 570527 (0.00086) [2022-07-10 04:38:50,167][26022] Updated weights on worker 0-0, policy_version 570537 (0.00086) [2022-07-10 04:38:50,973][25689] Fps is (10 sec: 5805.9, 60 sec: 5651.1, 300 sec: 5649.4). Total num frames: 584232960. Throughput: 0: 5134.2. Samples: 584230316. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:38:50,973][25689] Avg episode reward: [(0, '-25.516')] [2022-07-10 04:38:52,243][26022] Updated weights on worker 0-0, policy_version 570547 (0.00098) [2022-07-10 04:38:53,990][26022] Updated weights on worker 0-0, policy_version 570557 (0.00100) [2022-07-10 04:38:55,652][26022] Updated weights on worker 0-0, policy_version 570567 (0.00087) [2022-07-10 04:38:55,993][25689] Fps is (10 sec: 5701.2, 60 sec: 5666.3, 300 sec: 5650.3). Total num frames: 584261632. Throughput: 0: 5982.9. Samples: 584264398. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:38:55,994][25689] Avg episode reward: [(0, '-25.550')] [2022-07-10 04:38:57,442][26022] Updated weights on worker 0-0, policy_version 570577 (0.00087) [2022-07-10 04:38:59,265][26022] Updated weights on worker 0-0, policy_version 570587 (0.00079) [2022-07-10 04:39:00,995][25689] Fps is (10 sec: 5721.4, 60 sec: 5667.0, 300 sec: 5660.6). Total num frames: 584290304. Throughput: 0: 5999.6. Samples: 584298636. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 04:39:00,996][25689] Avg episode reward: [(0, '-25.759')] [2022-07-10 04:39:01,051][26022] Updated weights on worker 0-0, policy_version 570597 (0.00088) [2022-07-10 04:39:03,247][26022] Updated weights on worker 0-0, policy_version 570607 (0.00089) [2022-07-10 04:39:05,132][26022] Updated weights on worker 0-0, policy_version 570617 (0.00096) [2022-07-10 04:39:06,006][25689] Fps is (10 sec: 5522.5, 60 sec: 5667.3, 300 sec: 5651.7). Total num frames: 584316928. Throughput: 0: 5020.2. Samples: 584313640. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:39:06,007][25689] Avg episode reward: [(0, '-25.043')] [2022-07-10 04:39:06,970][26022] Updated weights on worker 0-0, policy_version 570627 (0.00085) [2022-07-10 04:39:08,735][26022] Updated weights on worker 0-0, policy_version 570637 (0.00086) [2022-07-10 04:39:10,733][26022] Updated weights on worker 0-0, policy_version 570647 (0.00082) [2022-07-10 04:39:11,050][25689] Fps is (10 sec: 5295.7, 60 sec: 5618.7, 300 sec: 5647.8). Total num frames: 584343552. Throughput: 0: 5846.6. Samples: 584347702. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:39:11,051][25689] Avg episode reward: [(0, '-24.754')] [2022-07-10 04:39:12,346][26022] Updated weights on worker 0-0, policy_version 570657 (0.00091) [2022-07-10 04:39:14,266][26022] Updated weights on worker 0-0, policy_version 570667 (0.00093) [2022-07-10 04:39:15,783][26022] Updated weights on worker 0-0, policy_version 570677 (0.00087) [2022-07-10 04:39:16,095][25689] Fps is (10 sec: 5683.7, 60 sec: 5650.7, 300 sec: 5650.6). Total num frames: 584374272. Throughput: 0: 5852.4. Samples: 584382042. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:39:16,097][25689] Avg episode reward: [(0, '-24.519')] [2022-07-10 04:39:17,858][26022] Updated weights on worker 0-0, policy_version 570687 (0.00095) [2022-07-10 04:39:19,459][26022] Updated weights on worker 0-0, policy_version 570697 (0.00948) [2022-07-10 04:39:21,115][25689] Fps is (10 sec: 5901.3, 60 sec: 5666.7, 300 sec: 5654.8). Total num frames: 584402944. Throughput: 0: 4998.9. Samples: 584399210. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:39:21,115][25689] Avg episode reward: [(0, '-25.731')] [2022-07-10 04:39:21,180][26022] Updated weights on worker 0-0, policy_version 570707 (0.00083) [2022-07-10 04:39:23,131][26022] Updated weights on worker 0-0, policy_version 570717 (0.00092) [2022-07-10 04:39:24,881][26022] Updated weights on worker 0-0, policy_version 570727 (0.00090) [2022-07-10 04:39:26,117][25689] Fps is (10 sec: 5722.1, 60 sec: 5651.0, 300 sec: 5653.6). Total num frames: 584431616. Throughput: 0: 5965.1. Samples: 584433602. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:39:26,118][25689] Avg episode reward: [(0, '-26.513')] [2022-07-10 04:39:26,822][26022] Updated weights on worker 0-0, policy_version 570737 (0.00092) [2022-07-10 04:39:28,410][26022] Updated weights on worker 0-0, policy_version 570747 (0.00094) [2022-07-10 04:39:30,104][26022] Updated weights on worker 0-0, policy_version 570757 (0.00081) [2022-07-10 04:39:31,176][25689] Fps is (10 sec: 5699.7, 60 sec: 5670.4, 300 sec: 5653.8). Total num frames: 584460288. Throughput: 0: 5969.4. Samples: 584467836. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:39:31,176][25689] Avg episode reward: [(0, '-26.301')] [2022-07-10 04:39:32,146][26022] Updated weights on worker 0-0, policy_version 570767 (0.00091) [2022-07-10 04:39:33,680][26022] Updated weights on worker 0-0, policy_version 570777 (0.00093) [2022-07-10 04:39:35,783][26022] Updated weights on worker 0-0, policy_version 570787 (0.00096) [2022-07-10 04:39:36,219][25689] Fps is (10 sec: 5575.2, 60 sec: 5654.6, 300 sec: 5654.4). Total num frames: 584487936. Throughput: 0: 5111.8. Samples: 584484910. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:39:36,220][25689] Avg episode reward: [(0, '-26.397')] [2022-07-10 04:39:37,699][26022] Updated weights on worker 0-0, policy_version 570797 (0.00086) [2022-07-10 04:39:39,314][26022] Updated weights on worker 0-0, policy_version 570807 (0.00086) [2022-07-10 04:39:41,174][26022] Updated weights on worker 0-0, policy_version 570817 (0.00091) [2022-07-10 04:39:41,239][25689] Fps is (10 sec: 5596.7, 60 sec: 5670.1, 300 sec: 5652.1). Total num frames: 584516608. Throughput: 0: 5939.7. Samples: 584518740. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:39:41,239][25689] Avg episode reward: [(0, '-26.453')] [2022-07-10 04:39:42,874][26022] Updated weights on worker 0-0, policy_version 570827 (0.00090) [2022-07-10 04:39:44,560][26022] Updated weights on worker 0-0, policy_version 570837 (0.00090) [2022-07-10 04:39:46,243][25689] Fps is (10 sec: 5720.6, 60 sec: 5654.2, 300 sec: 5646.6). Total num frames: 584545280. Throughput: 0: 5934.2. Samples: 584553034. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:39:46,244][25689] Avg episode reward: [(0, '-26.687')] [2022-07-10 04:39:46,681][26022] Updated weights on worker 0-0, policy_version 570847 (0.00085) [2022-07-10 04:39:48,579][26022] Updated weights on worker 0-0, policy_version 570858 (0.00088) [2022-07-10 04:39:50,480][26022] Updated weights on worker 0-0, policy_version 570868 (0.00088) [2022-07-10 04:39:51,291][25689] Fps is (10 sec: 5704.7, 60 sec: 5653.2, 300 sec: 5651.0). Total num frames: 584573952. Throughput: 0: 5072.8. Samples: 584569876. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:39:51,291][25689] Avg episode reward: [(0, '-25.343')] [2022-07-10 04:39:51,981][26022] Updated weights on worker 0-0, policy_version 570878 (0.00087) [2022-07-10 04:39:54,129][26022] Updated weights on worker 0-0, policy_version 570888 (0.00092) [2022-07-10 04:39:55,444][26022] Updated weights on worker 0-0, policy_version 570898 (0.00090) [2022-07-10 04:39:56,320][25689] Fps is (10 sec: 5589.3, 60 sec: 5635.5, 300 sec: 5644.3). Total num frames: 584601600. Throughput: 0: 5934.0. Samples: 584604186. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:39:56,320][25689] Avg episode reward: [(0, '-23.752')] [2022-07-10 04:39:57,646][26022] Updated weights on worker 0-0, policy_version 570908 (0.00079) [2022-07-10 04:39:59,251][26022] Updated weights on worker 0-0, policy_version 570918 (0.00098) [2022-07-10 04:40:00,089][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:40:00,109][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000570921_584623104.pth [2022-07-10 04:40:00,109][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000568934_582588416.pth [2022-07-10 04:40:01,110][26022] Updated weights on worker 0-0, policy_version 570928 (0.00097) [2022-07-10 04:40:01,341][25689] Fps is (10 sec: 5705.5, 60 sec: 5650.6, 300 sec: 5662.4). Total num frames: 584631296. Throughput: 0: 5949.3. Samples: 584638336. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:40:01,342][25689] Avg episode reward: [(0, '-23.361')] [2022-07-10 04:40:03,356][26022] Updated weights on worker 0-0, policy_version 570938 (0.00090) [2022-07-10 04:40:05,008][26022] Updated weights on worker 0-0, policy_version 570948 (0.00084) [2022-07-10 04:40:06,346][25689] Fps is (10 sec: 5617.4, 60 sec: 5651.2, 300 sec: 5653.0). Total num frames: 584657920. Throughput: 0: 4980.2. Samples: 584653148. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:40:06,347][25689] Avg episode reward: [(0, '-22.687')] [2022-07-10 04:40:06,908][26022] Updated weights on worker 0-0, policy_version 570958 (0.00089) [2022-07-10 04:40:08,795][26022] Updated weights on worker 0-0, policy_version 570968 (0.00087) [2022-07-10 04:40:10,543][26022] Updated weights on worker 0-0, policy_version 570978 (0.00087) [2022-07-10 04:40:11,452][25689] Fps is (10 sec: 5266.6, 60 sec: 5645.4, 300 sec: 5639.1). Total num frames: 584684544. Throughput: 0: 5796.0. Samples: 584686730. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:40:11,452][25689] Avg episode reward: [(0, '-22.872')] [2022-07-10 04:40:12,412][26022] Updated weights on worker 0-0, policy_version 570988 (0.00089) [2022-07-10 04:40:14,271][26022] Updated weights on worker 0-0, policy_version 570998 (0.00083) [2022-07-10 04:40:16,011][26022] Updated weights on worker 0-0, policy_version 571008 (0.00093) [2022-07-10 04:40:16,479][25689] Fps is (10 sec: 5558.0, 60 sec: 5630.2, 300 sec: 5646.6). Total num frames: 584714240. Throughput: 0: 5784.8. Samples: 584720802. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:40:16,479][25689] Avg episode reward: [(0, '-23.258')] [2022-07-10 04:40:17,939][26022] Updated weights on worker 0-0, policy_version 571018 (0.00087) [2022-07-10 04:40:19,604][26022] Updated weights on worker 0-0, policy_version 571028 (0.00092) [2022-07-10 04:40:21,447][26022] Updated weights on worker 0-0, policy_version 571038 (0.00083) [2022-07-10 04:40:21,487][25689] Fps is (10 sec: 5816.0, 60 sec: 5631.1, 300 sec: 5647.2). Total num frames: 584742912. Throughput: 0: 4940.3. Samples: 584737864. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:40:21,488][25689] Avg episode reward: [(0, '-23.861')] [2022-07-10 04:40:23,261][26022] Updated weights on worker 0-0, policy_version 571048 (0.00082) [2022-07-10 04:40:24,811][26022] Updated weights on worker 0-0, policy_version 571058 (0.00086) [2022-07-10 04:40:26,489][25689] Fps is (10 sec: 5626.0, 60 sec: 5614.2, 300 sec: 5642.1). Total num frames: 584770560. Throughput: 0: 5921.1. Samples: 584772422. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:40:26,490][25689] Avg episode reward: [(0, '-23.952')] [2022-07-10 04:40:26,905][26022] Updated weights on worker 0-0, policy_version 571068 (0.00091) [2022-07-10 04:40:28,546][26022] Updated weights on worker 0-0, policy_version 571078 (0.00090) [2022-07-10 04:40:30,407][26022] Updated weights on worker 0-0, policy_version 571088 (0.00090) [2022-07-10 04:40:31,606][25689] Fps is (10 sec: 5667.1, 60 sec: 5625.8, 300 sec: 5644.2). Total num frames: 584800256. Throughput: 0: 5931.7. Samples: 584806280. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:40:31,607][25689] Avg episode reward: [(0, '-24.149')] [2022-07-10 04:40:32,235][26022] Updated weights on worker 0-0, policy_version 571098 (0.00095) [2022-07-10 04:40:34,176][26022] Updated weights on worker 0-0, policy_version 571108 (0.00092) [2022-07-10 04:40:35,759][26022] Updated weights on worker 0-0, policy_version 571118 (0.00081) [2022-07-10 04:40:36,637][25689] Fps is (10 sec: 5650.8, 60 sec: 5626.9, 300 sec: 5644.6). Total num frames: 584827904. Throughput: 0: 5086.6. Samples: 584823340. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:40:36,638][25689] Avg episode reward: [(0, '-24.601')] [2022-07-10 04:40:37,641][26022] Updated weights on worker 0-0, policy_version 571128 (0.00095) [2022-07-10 04:40:39,133][26022] Updated weights on worker 0-0, policy_version 571138 (0.00116) [2022-07-10 04:40:41,308][26022] Updated weights on worker 0-0, policy_version 571148 (0.00089) [2022-07-10 04:40:41,655][25689] Fps is (10 sec: 5706.8, 60 sec: 5644.1, 300 sec: 5649.3). Total num frames: 584857600. Throughput: 0: 5930.8. Samples: 584857472. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:40:41,655][25689] Avg episode reward: [(0, '-25.126')] [2022-07-10 04:40:43,026][26022] Updated weights on worker 0-0, policy_version 571158 (0.00083) [2022-07-10 04:40:44,795][26022] Updated weights on worker 0-0, policy_version 571168 (0.00096) [2022-07-10 04:40:46,667][25689] Fps is (10 sec: 5717.7, 60 sec: 5626.4, 300 sec: 5644.2). Total num frames: 584885248. Throughput: 0: 5918.4. Samples: 584891838. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:40:46,667][25689] Avg episode reward: [(0, '-25.561')] [2022-07-10 04:40:46,736][26022] Updated weights on worker 0-0, policy_version 571178 (0.00056) [2022-07-10 04:40:48,457][26022] Updated weights on worker 0-0, policy_version 571188 (0.00086) [2022-07-10 04:40:50,248][26022] Updated weights on worker 0-0, policy_version 571198 (0.00079) [2022-07-10 04:40:51,782][25689] Fps is (10 sec: 5662.1, 60 sec: 5637.0, 300 sec: 5649.3). Total num frames: 584914944. Throughput: 0: 5072.3. Samples: 584908620. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:40:51,784][25689] Avg episode reward: [(0, '-25.592')] [2022-07-10 04:40:52,023][26022] Updated weights on worker 0-0, policy_version 571208 (0.00116) [2022-07-10 04:40:53,771][26022] Updated weights on worker 0-0, policy_version 571218 (0.00086) [2022-07-10 04:40:55,595][26022] Updated weights on worker 0-0, policy_version 571228 (0.00088) [2022-07-10 04:40:56,805][25689] Fps is (10 sec: 5757.1, 60 sec: 5654.5, 300 sec: 5645.7). Total num frames: 584943616. Throughput: 0: 5949.2. Samples: 584943322. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:40:56,806][25689] Avg episode reward: [(0, '-25.380')] [2022-07-10 04:40:57,359][26022] Updated weights on worker 0-0, policy_version 571238 (0.00096) [2022-07-10 04:40:59,259][26022] Updated weights on worker 0-0, policy_version 571248 (0.00084) [2022-07-10 04:41:00,943][26022] Updated weights on worker 0-0, policy_version 571258 (0.00090) [2022-07-10 04:41:01,862][25689] Fps is (10 sec: 5587.4, 60 sec: 5617.4, 300 sec: 5649.7). Total num frames: 584971264. Throughput: 0: 5943.1. Samples: 584977568. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:41:01,863][25689] Avg episode reward: [(0, '-24.648')] [2022-07-10 04:41:03,123][26022] Updated weights on worker 0-0, policy_version 571268 (0.00083) [2022-07-10 04:41:04,914][26022] Updated weights on worker 0-0, policy_version 571278 (0.00103) [2022-07-10 04:41:06,783][26022] Updated weights on worker 0-0, policy_version 571288 (0.00088) [2022-07-10 04:41:06,880][25689] Fps is (10 sec: 5488.2, 60 sec: 5633.0, 300 sec: 5654.6). Total num frames: 584998912. Throughput: 0: 4978.6. Samples: 584992474. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:41:06,881][25689] Avg episode reward: [(0, '-23.540')] [2022-07-10 04:41:08,412][26022] Updated weights on worker 0-0, policy_version 571298 (0.00090) [2022-07-10 04:41:10,227][26022] Updated weights on worker 0-0, policy_version 571308 (0.00091) [2022-07-10 04:41:11,981][25689] Fps is (10 sec: 5565.8, 60 sec: 5667.3, 300 sec: 5653.1). Total num frames: 585027584. Throughput: 0: 5845.9. Samples: 585026700. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:41:11,982][25689] Avg episode reward: [(0, '-23.677')] [2022-07-10 04:41:12,185][26022] Updated weights on worker 0-0, policy_version 571318 (0.00084) [2022-07-10 04:41:13,746][26022] Updated weights on worker 0-0, policy_version 571328 (0.00082) [2022-07-10 04:41:15,772][26022] Updated weights on worker 0-0, policy_version 571338 (0.00084) [2022-07-10 04:41:16,985][25689] Fps is (10 sec: 5776.1, 60 sec: 5669.5, 300 sec: 5649.9). Total num frames: 585057280. Throughput: 0: 5839.1. Samples: 585061158. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:41:16,986][25689] Avg episode reward: [(0, '-22.928')] [2022-07-10 04:41:17,446][26022] Updated weights on worker 0-0, policy_version 571348 (0.00086) [2022-07-10 04:41:19,347][26022] Updated weights on worker 0-0, policy_version 571358 (0.00087) [2022-07-10 04:41:20,996][26022] Updated weights on worker 0-0, policy_version 571368 (0.00089) [2022-07-10 04:41:22,025][25689] Fps is (10 sec: 5709.1, 60 sec: 5649.6, 300 sec: 5652.8). Total num frames: 585084928. Throughput: 0: 5838.1. Samples: 585095282. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:41:22,027][25689] Avg episode reward: [(0, '-23.162')] [2022-07-10 04:41:22,851][26022] Updated weights on worker 0-0, policy_version 571378 (0.00083) [2022-07-10 04:41:24,658][26022] Updated weights on worker 0-0, policy_version 571388 (0.00096) [2022-07-10 04:41:26,505][26022] Updated weights on worker 0-0, policy_version 571398 (0.00617) [2022-07-10 04:41:27,048][25689] Fps is (10 sec: 5698.9, 60 sec: 5681.5, 300 sec: 5655.0). Total num frames: 585114624. Throughput: 0: 5947.3. Samples: 585112416. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:41:27,048][25689] Avg episode reward: [(0, '-24.682')] [2022-07-10 04:41:28,260][26022] Updated weights on worker 0-0, policy_version 571408 (0.00084) [2022-07-10 04:41:30,263][26022] Updated weights on worker 0-0, policy_version 571418 (0.00081) [2022-07-10 04:41:31,782][26022] Updated weights on worker 0-0, policy_version 571428 (0.00091) [2022-07-10 04:41:32,143][25689] Fps is (10 sec: 5667.7, 60 sec: 5649.7, 300 sec: 5651.1). Total num frames: 585142272. Throughput: 0: 5926.5. Samples: 585146190. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:41:32,144][25689] Avg episode reward: [(0, '-24.699')] [2022-07-10 04:41:33,844][26022] Updated weights on worker 0-0, policy_version 571438 (0.00092) [2022-07-10 04:41:35,558][26022] Updated weights on worker 0-0, policy_version 571448 (0.00103) [2022-07-10 04:41:37,228][25689] Fps is (10 sec: 5532.0, 60 sec: 5661.6, 300 sec: 5653.2). Total num frames: 585170944. Throughput: 0: 5886.3. Samples: 585180314. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:41:37,229][25689] Avg episode reward: [(0, '-24.318')] [2022-07-10 04:41:37,489][26022] Updated weights on worker 0-0, policy_version 571458 (0.00096) [2022-07-10 04:41:39,317][26022] Updated weights on worker 0-0, policy_version 571468 (0.00084) [2022-07-10 04:41:40,988][26022] Updated weights on worker 0-0, policy_version 571478 (0.00087) [2022-07-10 04:41:42,266][25689] Fps is (10 sec: 5664.6, 60 sec: 5642.7, 300 sec: 5649.4). Total num frames: 585199616. Throughput: 0: 5047.8. Samples: 585197454. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:41:42,267][25689] Avg episode reward: [(0, '-24.874')] [2022-07-10 04:41:42,765][26022] Updated weights on worker 0-0, policy_version 571488 (0.00085) [2022-07-10 04:41:44,437][26022] Updated weights on worker 0-0, policy_version 571498 (0.00098) [2022-07-10 04:41:46,593][26022] Updated weights on worker 0-0, policy_version 571508 (0.00093) [2022-07-10 04:41:47,319][25689] Fps is (10 sec: 5784.2, 60 sec: 5672.7, 300 sec: 5656.3). Total num frames: 585229312. Throughput: 0: 5880.4. Samples: 585231622. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:41:47,321][25689] Avg episode reward: [(0, '-25.060')] [2022-07-10 04:41:48,311][26022] Updated weights on worker 0-0, policy_version 571518 (0.00094) [2022-07-10 04:41:50,023][26022] Updated weights on worker 0-0, policy_version 571528 (0.00077) [2022-07-10 04:41:51,995][26022] Updated weights on worker 0-0, policy_version 571538 (0.00084) [2022-07-10 04:41:52,365][25689] Fps is (10 sec: 5678.4, 60 sec: 5645.5, 300 sec: 5646.4). Total num frames: 585256960. Throughput: 0: 5911.6. Samples: 585265734. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:41:52,366][25689] Avg episode reward: [(0, '-24.170')] [2022-07-10 04:41:53,500][26022] Updated weights on worker 0-0, policy_version 571548 (0.00087) [2022-07-10 04:41:55,467][26022] Updated weights on worker 0-0, policy_version 571558 (0.00090) [2022-07-10 04:41:57,264][26022] Updated weights on worker 0-0, policy_version 571568 (0.00091) [2022-07-10 04:41:57,367][25689] Fps is (10 sec: 5707.2, 60 sec: 5664.3, 300 sec: 5660.9). Total num frames: 585286656. Throughput: 0: 5089.9. Samples: 585282816. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:41:57,367][25689] Avg episode reward: [(0, '-23.824')] [2022-07-10 04:41:58,843][26022] Updated weights on worker 0-0, policy_version 571578 (0.00088) [2022-07-10 04:42:00,279][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:42:00,286][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000571585_585303040.pth [2022-07-10 04:42:00,286][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000569597_583267328.pth [2022-07-10 04:42:00,885][26022] Updated weights on worker 0-0, policy_version 571588 (0.00086) [2022-07-10 04:42:02,445][25689] Fps is (10 sec: 5587.4, 60 sec: 5645.5, 300 sec: 5652.7). Total num frames: 585313280. Throughput: 0: 5930.8. Samples: 585317128. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 04:42:02,445][25689] Avg episode reward: [(0, '-23.816')] [2022-07-10 04:42:02,941][26022] Updated weights on worker 0-0, policy_version 571598 (0.00051) [2022-07-10 04:42:04,695][26022] Updated weights on worker 0-0, policy_version 571608 (0.00084) [2022-07-10 04:42:06,608][26022] Updated weights on worker 0-0, policy_version 571618 (0.00093) [2022-07-10 04:42:07,459][25689] Fps is (10 sec: 5377.5, 60 sec: 5645.8, 300 sec: 5650.5). Total num frames: 585340928. Throughput: 0: 5836.8. Samples: 585349176. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:42:07,460][25689] Avg episode reward: [(0, '-25.150')] [2022-07-10 04:42:08,295][26022] Updated weights on worker 0-0, policy_version 571628 (0.00097) [2022-07-10 04:42:10,230][26022] Updated weights on worker 0-0, policy_version 571638 (0.00088) [2022-07-10 04:42:11,914][26022] Updated weights on worker 0-0, policy_version 571648 (0.00082) [2022-07-10 04:42:12,562][25689] Fps is (10 sec: 5566.7, 60 sec: 5645.6, 300 sec: 5655.4). Total num frames: 585369600. Throughput: 0: 4972.7. Samples: 585366168. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:42:12,562][25689] Avg episode reward: [(0, '-25.295')] [2022-07-10 04:42:13,958][26022] Updated weights on worker 0-0, policy_version 571658 (0.00080) [2022-07-10 04:42:15,463][26022] Updated weights on worker 0-0, policy_version 571668 (0.00530) [2022-07-10 04:42:17,488][26022] Updated weights on worker 0-0, policy_version 571678 (0.00083) [2022-07-10 04:42:17,587][25689] Fps is (10 sec: 5662.1, 60 sec: 5626.8, 300 sec: 5651.9). Total num frames: 585398272. Throughput: 0: 5815.7. Samples: 585400410. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:42:17,588][25689] Avg episode reward: [(0, '-25.840')] [2022-07-10 04:42:18,994][26022] Updated weights on worker 0-0, policy_version 571688 (0.00086) [2022-07-10 04:42:21,054][26022] Updated weights on worker 0-0, policy_version 571698 (0.00083) [2022-07-10 04:42:22,524][26022] Updated weights on worker 0-0, policy_version 571708 (0.00085) [2022-07-10 04:42:22,599][25689] Fps is (10 sec: 5917.5, 60 sec: 5680.1, 300 sec: 5655.7). Total num frames: 585428992. Throughput: 0: 5835.1. Samples: 585434728. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:42:22,599][25689] Avg episode reward: [(0, '-26.917')] [2022-07-10 04:42:24,553][26022] Updated weights on worker 0-0, policy_version 571718 (0.00088) [2022-07-10 04:42:26,405][26022] Updated weights on worker 0-0, policy_version 571728 (0.00089) [2022-07-10 04:42:27,612][25689] Fps is (10 sec: 5618.1, 60 sec: 5613.4, 300 sec: 5646.3). Total num frames: 585454592. Throughput: 0: 5096.6. Samples: 585451886. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:42:27,612][25689] Avg episode reward: [(0, '-27.475')] [2022-07-10 04:42:28,019][26022] Updated weights on worker 0-0, policy_version 571738 (0.00088) [2022-07-10 04:42:29,999][26022] Updated weights on worker 0-0, policy_version 571748 (0.00082) [2022-07-10 04:42:31,704][26022] Updated weights on worker 0-0, policy_version 571758 (0.00091) [2022-07-10 04:42:32,684][25689] Fps is (10 sec: 5584.6, 60 sec: 5666.3, 300 sec: 5651.8). Total num frames: 585485312. Throughput: 0: 5945.4. Samples: 585485798. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:42:32,684][25689] Avg episode reward: [(0, '-26.748')] [2022-07-10 04:42:33,703][26022] Updated weights on worker 0-0, policy_version 571768 (0.00083) [2022-07-10 04:42:35,365][26022] Updated weights on worker 0-0, policy_version 571778 (0.00089) [2022-07-10 04:42:36,989][26022] Updated weights on worker 0-0, policy_version 571788 (0.00087) [2022-07-10 04:42:37,756][25689] Fps is (10 sec: 5854.9, 60 sec: 5667.5, 300 sec: 5653.9). Total num frames: 585513984. Throughput: 0: 5948.1. Samples: 585520376. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:42:37,757][25689] Avg episode reward: [(0, '-24.668')] [2022-07-10 04:42:39,056][26022] Updated weights on worker 0-0, policy_version 571798 (0.00093) [2022-07-10 04:42:40,598][26022] Updated weights on worker 0-0, policy_version 571808 (0.00088) [2022-07-10 04:42:42,593][26022] Updated weights on worker 0-0, policy_version 571818 (0.00091) [2022-07-10 04:42:42,793][25689] Fps is (10 sec: 5672.3, 60 sec: 5667.6, 300 sec: 5649.8). Total num frames: 585542656. Throughput: 0: 5089.2. Samples: 585537502. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:42:42,794][25689] Avg episode reward: [(0, '-24.417')] [2022-07-10 04:42:44,351][26022] Updated weights on worker 0-0, policy_version 571828 (0.00083) [2022-07-10 04:42:46,000][26022] Updated weights on worker 0-0, policy_version 571838 (0.00083) [2022-07-10 04:42:47,816][25689] Fps is (10 sec: 5598.4, 60 sec: 5636.6, 300 sec: 5647.5). Total num frames: 585570304. Throughput: 0: 5928.6. Samples: 585571668. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:42:47,818][25689] Avg episode reward: [(0, '-23.410')] [2022-07-10 04:42:48,075][26022] Updated weights on worker 0-0, policy_version 571848 (0.00090) [2022-07-10 04:42:49,768][26022] Updated weights on worker 0-0, policy_version 571858 (0.00091) [2022-07-10 04:42:51,419][26022] Updated weights on worker 0-0, policy_version 571868 (0.00082) [2022-07-10 04:42:52,891][25689] Fps is (10 sec: 5678.6, 60 sec: 5667.6, 300 sec: 5653.0). Total num frames: 585600000. Throughput: 0: 5952.4. Samples: 585606082. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:42:52,892][25689] Avg episode reward: [(0, '-24.235')] [2022-07-10 04:42:53,483][26022] Updated weights on worker 0-0, policy_version 571878 (0.00087) [2022-07-10 04:42:54,872][26022] Updated weights on worker 0-0, policy_version 571888 (0.00091) [2022-07-10 04:42:56,996][26022] Updated weights on worker 0-0, policy_version 571898 (0.00092) [2022-07-10 04:42:57,942][25689] Fps is (10 sec: 5865.0, 60 sec: 5663.0, 300 sec: 5655.6). Total num frames: 585629696. Throughput: 0: 5099.3. Samples: 585623314. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:42:57,943][25689] Avg episode reward: [(0, '-24.294')] [2022-07-10 04:42:58,710][26022] Updated weights on worker 0-0, policy_version 571908 (0.00091) [2022-07-10 04:43:00,441][26022] Updated weights on worker 0-0, policy_version 571918 (0.00082) [2022-07-10 04:43:02,709][26022] Updated weights on worker 0-0, policy_version 571928 (0.00083) [2022-07-10 04:43:03,029][25689] Fps is (10 sec: 5454.6, 60 sec: 5645.3, 300 sec: 5650.8). Total num frames: 585655296. Throughput: 0: 5946.0. Samples: 585657824. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:43:03,029][25689] Avg episode reward: [(0, '-26.044')] [2022-07-10 04:43:04,277][26022] Updated weights on worker 0-0, policy_version 571938 (0.00091) [2022-07-10 04:43:06,220][26022] Updated weights on worker 0-0, policy_version 571948 (0.00093) [2022-07-10 04:43:08,045][25689] Fps is (10 sec: 5371.9, 60 sec: 5662.0, 300 sec: 5648.3). Total num frames: 585683968. Throughput: 0: 5842.5. Samples: 585689858. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:43:08,047][25689] Avg episode reward: [(0, '-27.231')] [2022-07-10 04:43:08,099][26022] Updated weights on worker 0-0, policy_version 571958 (0.00092) [2022-07-10 04:43:09,723][26022] Updated weights on worker 0-0, policy_version 571968 (0.00085) [2022-07-10 04:43:11,818][26022] Updated weights on worker 0-0, policy_version 571978 (0.00089) [2022-07-10 04:43:13,187][25689] Fps is (10 sec: 5745.7, 60 sec: 5675.3, 300 sec: 5649.5). Total num frames: 585713664. Throughput: 0: 4961.9. Samples: 585706788. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:43:13,188][25689] Avg episode reward: [(0, '-27.189')] [2022-07-10 04:43:13,476][26022] Updated weights on worker 0-0, policy_version 571988 (0.00090) [2022-07-10 04:43:15,253][26022] Updated weights on worker 0-0, policy_version 571998 (0.00085) [2022-07-10 04:43:17,155][26022] Updated weights on worker 0-0, policy_version 572008 (0.00072) [2022-07-10 04:43:18,209][25689] Fps is (10 sec: 5743.0, 60 sec: 5675.6, 300 sec: 5652.7). Total num frames: 585742336. Throughput: 0: 5804.0. Samples: 585740940. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:43:18,209][25689] Avg episode reward: [(0, '-26.934')] [2022-07-10 04:43:18,687][26022] Updated weights on worker 0-0, policy_version 572018 (0.00087) [2022-07-10 04:43:20,754][26022] Updated weights on worker 0-0, policy_version 572028 (0.00091) [2022-07-10 04:43:22,472][26022] Updated weights on worker 0-0, policy_version 572038 (0.00092) [2022-07-10 04:43:23,268][25689] Fps is (10 sec: 5688.6, 60 sec: 5637.4, 300 sec: 5648.5). Total num frames: 585771008. Throughput: 0: 5793.0. Samples: 585775068. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:43:23,269][25689] Avg episode reward: [(0, '-26.907')] [2022-07-10 04:43:24,322][26022] Updated weights on worker 0-0, policy_version 572048 (0.00082) [2022-07-10 04:43:26,093][26022] Updated weights on worker 0-0, policy_version 572058 (0.00089) [2022-07-10 04:43:27,742][26022] Updated weights on worker 0-0, policy_version 572068 (0.00082) [2022-07-10 04:43:28,281][25689] Fps is (10 sec: 5591.8, 60 sec: 5671.2, 300 sec: 5649.9). Total num frames: 585798656. Throughput: 0: 5078.3. Samples: 585792618. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:43:28,281][25689] Avg episode reward: [(0, '-25.972')] [2022-07-10 04:43:29,846][26022] Updated weights on worker 0-0, policy_version 572078 (0.00085) [2022-07-10 04:43:31,551][26022] Updated weights on worker 0-0, policy_version 572088 (0.00085) [2022-07-10 04:43:33,294][26022] Updated weights on worker 0-0, policy_version 572098 (0.00083) [2022-07-10 04:43:33,372][25689] Fps is (10 sec: 5776.8, 60 sec: 5669.4, 300 sec: 5656.1). Total num frames: 585829376. Throughput: 0: 5928.0. Samples: 585826440. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:43:33,372][25689] Avg episode reward: [(0, '-24.787')] [2022-07-10 04:43:34,971][26022] Updated weights on worker 0-0, policy_version 572108 (0.00086) [2022-07-10 04:43:37,027][26022] Updated weights on worker 0-0, policy_version 572118 (0.00087) [2022-07-10 04:43:38,394][25689] Fps is (10 sec: 5973.7, 60 sec: 5690.9, 300 sec: 5662.6). Total num frames: 585859072. Throughput: 0: 5945.8. Samples: 585860960. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:43:38,395][25689] Avg episode reward: [(0, '-24.754')] [2022-07-10 04:43:38,396][26022] Updated weights on worker 0-0, policy_version 572128 (0.00092) [2022-07-10 04:43:40,408][26022] Updated weights on worker 0-0, policy_version 572138 (0.00084) [2022-07-10 04:43:42,227][26022] Updated weights on worker 0-0, policy_version 572148 (0.00097) [2022-07-10 04:43:43,412][25689] Fps is (10 sec: 5507.5, 60 sec: 5642.1, 300 sec: 5648.8). Total num frames: 585884672. Throughput: 0: 5962.6. Samples: 585895178. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:43:43,412][25689] Avg episode reward: [(0, '-25.132')] [2022-07-10 04:43:44,024][26022] Updated weights on worker 0-0, policy_version 572158 (0.00085) [2022-07-10 04:43:45,923][26022] Updated weights on worker 0-0, policy_version 572168 (0.00093) [2022-07-10 04:43:47,513][26022] Updated weights on worker 0-0, policy_version 572178 (0.00085) [2022-07-10 04:43:48,459][25689] Fps is (10 sec: 5596.0, 60 sec: 5690.5, 300 sec: 5655.5). Total num frames: 585915392. Throughput: 0: 5923.2. Samples: 585912136. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:43:48,459][25689] Avg episode reward: [(0, '-24.753')] [2022-07-10 04:43:49,605][26022] Updated weights on worker 0-0, policy_version 572188 (0.00093) [2022-07-10 04:43:51,316][26022] Updated weights on worker 0-0, policy_version 572198 (0.00083) [2022-07-10 04:43:53,209][26022] Updated weights on worker 0-0, policy_version 572208 (0.00054) [2022-07-10 04:43:53,585][25689] Fps is (10 sec: 5838.1, 60 sec: 5668.9, 300 sec: 5653.5). Total num frames: 585944064. Throughput: 0: 5903.6. Samples: 585945770. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:43:53,585][25689] Avg episode reward: [(0, '-26.301')] [2022-07-10 04:43:54,928][26022] Updated weights on worker 0-0, policy_version 572218 (0.00084) [2022-07-10 04:43:56,571][26022] Updated weights on worker 0-0, policy_version 572228 (0.00082) [2022-07-10 04:43:58,627][25689] Fps is (10 sec: 5438.2, 60 sec: 5619.1, 300 sec: 5645.9). Total num frames: 585970688. Throughput: 0: 5895.0. Samples: 585980230. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:43:58,627][25689] Avg episode reward: [(0, '-26.106')] [2022-07-10 04:43:58,690][26022] Updated weights on worker 0-0, policy_version 572238 (0.00096) [2022-07-10 04:44:00,060][26022] Updated weights on worker 0-0, policy_version 572248 (0.00088) [2022-07-10 04:44:00,308][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:44:00,322][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000572249_585982976.pth [2022-07-10 04:44:00,323][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000570259_583945216.pth [2022-07-10 04:44:02,619][26022] Updated weights on worker 0-0, policy_version 572258 (0.00080) [2022-07-10 04:44:03,710][25689] Fps is (10 sec: 5461.3, 60 sec: 5670.0, 300 sec: 5651.4). Total num frames: 585999360. Throughput: 0: 5041.1. Samples: 585997502. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:44:03,710][25689] Avg episode reward: [(0, '-25.788')] [2022-07-10 04:44:04,194][26022] Updated weights on worker 0-0, policy_version 572268 (0.00092) [2022-07-10 04:44:06,108][26022] Updated weights on worker 0-0, policy_version 572278 (0.00087) [2022-07-10 04:44:08,006][26022] Updated weights on worker 0-0, policy_version 572288 (0.00094) [2022-07-10 04:44:08,764][25689] Fps is (10 sec: 5555.6, 60 sec: 5649.6, 300 sec: 5654.6). Total num frames: 586027008. Throughput: 0: 5759.6. Samples: 586029088. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:44:08,767][25689] Avg episode reward: [(0, '-26.157')] [2022-07-10 04:44:09,622][26022] Updated weights on worker 0-0, policy_version 572298 (0.00092) [2022-07-10 04:44:11,587][26022] Updated weights on worker 0-0, policy_version 572308 (0.00087) [2022-07-10 04:44:13,358][26022] Updated weights on worker 0-0, policy_version 572318 (0.00086) [2022-07-10 04:44:13,866][25689] Fps is (10 sec: 5646.4, 60 sec: 5653.4, 300 sec: 5650.1). Total num frames: 586056704. Throughput: 0: 5778.0. Samples: 586062952. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:44:13,866][25689] Avg episode reward: [(0, '-26.055')] [2022-07-10 04:44:15,299][26022] Updated weights on worker 0-0, policy_version 572328 (0.00096) [2022-07-10 04:44:17,034][26022] Updated weights on worker 0-0, policy_version 572338 (0.00089) [2022-07-10 04:44:18,847][26022] Updated weights on worker 0-0, policy_version 572348 (0.00086) [2022-07-10 04:44:18,928][25689] Fps is (10 sec: 5641.9, 60 sec: 5632.7, 300 sec: 5645.8). Total num frames: 586084352. Throughput: 0: 4906.6. Samples: 586079842. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:44:18,930][25689] Avg episode reward: [(0, '-26.662')] [2022-07-10 04:44:20,525][26022] Updated weights on worker 0-0, policy_version 572358 (0.00097) [2022-07-10 04:44:22,450][26022] Updated weights on worker 0-0, policy_version 572368 (0.00112) [2022-07-10 04:44:23,941][25689] Fps is (10 sec: 5589.7, 60 sec: 5637.0, 300 sec: 5645.6). Total num frames: 586113024. Throughput: 0: 5759.5. Samples: 586114026. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:44:23,942][25689] Avg episode reward: [(0, '-25.853')] [2022-07-10 04:44:24,205][26022] Updated weights on worker 0-0, policy_version 572378 (0.00085) [2022-07-10 04:44:26,171][26022] Updated weights on worker 0-0, policy_version 572388 (0.00091) [2022-07-10 04:44:27,775][26022] Updated weights on worker 0-0, policy_version 572398 (0.00084) [2022-07-10 04:44:28,987][25689] Fps is (10 sec: 5701.0, 60 sec: 5650.8, 300 sec: 5645.9). Total num frames: 586141696. Throughput: 0: 5898.8. Samples: 586148376. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:44:28,987][25689] Avg episode reward: [(0, '-26.268')] [2022-07-10 04:44:29,702][26022] Updated weights on worker 0-0, policy_version 572408 (0.00091) [2022-07-10 04:44:31,307][26022] Updated weights on worker 0-0, policy_version 572418 (0.00095) [2022-07-10 04:44:33,234][26022] Updated weights on worker 0-0, policy_version 572428 (0.00088) [2022-07-10 04:44:34,035][25689] Fps is (10 sec: 5782.8, 60 sec: 5638.0, 300 sec: 5652.7). Total num frames: 586171392. Throughput: 0: 5081.8. Samples: 586165448. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:44:34,035][25689] Avg episode reward: [(0, '-26.600')] [2022-07-10 04:44:35,140][26022] Updated weights on worker 0-0, policy_version 572438 (0.00087) [2022-07-10 04:44:36,678][26022] Updated weights on worker 0-0, policy_version 572448 (0.00082) [2022-07-10 04:44:38,793][26022] Updated weights on worker 0-0, policy_version 572458 (0.00093) [2022-07-10 04:44:39,059][25689] Fps is (10 sec: 5591.7, 60 sec: 5587.2, 300 sec: 5645.7). Total num frames: 586198016. Throughput: 0: 5940.3. Samples: 586199424. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:44:39,059][25689] Avg episode reward: [(0, '-27.259')] [2022-07-10 04:44:40,264][26022] Updated weights on worker 0-0, policy_version 572468 (0.00089) [2022-07-10 04:44:42,368][26022] Updated weights on worker 0-0, policy_version 572478 (0.00090) [2022-07-10 04:44:44,020][26022] Updated weights on worker 0-0, policy_version 572488 (0.00091) [2022-07-10 04:44:44,062][25689] Fps is (10 sec: 5616.8, 60 sec: 5656.0, 300 sec: 5649.2). Total num frames: 586227712. Throughput: 0: 5939.2. Samples: 586233526. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:44:44,062][25689] Avg episode reward: [(0, '-27.510')] [2022-07-10 04:44:46,039][26022] Updated weights on worker 0-0, policy_version 572498 (0.00087) [2022-07-10 04:44:47,543][26022] Updated weights on worker 0-0, policy_version 572508 (0.00089) [2022-07-10 04:44:49,076][25689] Fps is (10 sec: 5622.2, 60 sec: 5591.5, 300 sec: 5642.9). Total num frames: 586254336. Throughput: 0: 5084.3. Samples: 586250518. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:44:49,077][25689] Avg episode reward: [(0, '-26.638')] [2022-07-10 04:44:49,491][26022] Updated weights on worker 0-0, policy_version 572518 (0.00085) [2022-07-10 04:44:51,208][26022] Updated weights on worker 0-0, policy_version 572528 (0.00091) [2022-07-10 04:44:53,116][26022] Updated weights on worker 0-0, policy_version 572538 (0.00094) [2022-07-10 04:44:54,150][25689] Fps is (10 sec: 5684.3, 60 sec: 5630.2, 300 sec: 5652.4). Total num frames: 586285056. Throughput: 0: 5918.5. Samples: 586284500. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:44:54,150][25689] Avg episode reward: [(0, '-27.079')] [2022-07-10 04:44:54,790][26022] Updated weights on worker 0-0, policy_version 572548 (0.00094) [2022-07-10 04:44:56,604][26022] Updated weights on worker 0-0, policy_version 572558 (0.00084) [2022-07-10 04:44:58,412][26022] Updated weights on worker 0-0, policy_version 572568 (0.00085) [2022-07-10 04:44:59,224][25689] Fps is (10 sec: 5953.4, 60 sec: 5677.8, 300 sec: 5651.4). Total num frames: 586314752. Throughput: 0: 5939.8. Samples: 586319204. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:44:59,225][25689] Avg episode reward: [(0, '-28.208')] [2022-07-10 04:45:00,242][26022] Updated weights on worker 0-0, policy_version 572578 (0.00083) [2022-07-10 04:45:02,334][26022] Updated weights on worker 0-0, policy_version 572588 (0.00084) [2022-07-10 04:45:04,112][26022] Updated weights on worker 0-0, policy_version 572598 (0.00087) [2022-07-10 04:45:04,250][25689] Fps is (10 sec: 5474.9, 60 sec: 5632.5, 300 sec: 5647.6). Total num frames: 586340352. Throughput: 0: 5061.1. Samples: 586335700. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:45:04,250][25689] Avg episode reward: [(0, '-27.598')] [2022-07-10 04:45:06,043][26022] Updated weights on worker 0-0, policy_version 572608 (0.00088) [2022-07-10 04:45:07,810][26022] Updated weights on worker 0-0, policy_version 572618 (0.00095) [2022-07-10 04:45:09,275][25689] Fps is (10 sec: 5399.9, 60 sec: 5652.2, 300 sec: 5656.0). Total num frames: 586369024. Throughput: 0: 5843.5. Samples: 586368548. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 04:45:09,276][25689] Avg episode reward: [(0, '-26.732')] [2022-07-10 04:45:09,735][26022] Updated weights on worker 0-0, policy_version 572628 (0.00084) [2022-07-10 04:45:11,429][26022] Updated weights on worker 0-0, policy_version 572638 (0.00088) [2022-07-10 04:45:13,075][26022] Updated weights on worker 0-0, policy_version 572648 (0.00087) [2022-07-10 04:45:14,376][25689] Fps is (10 sec: 5561.5, 60 sec: 5618.3, 300 sec: 5647.7). Total num frames: 586396672. Throughput: 0: 5832.5. Samples: 586402472. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:45:14,377][25689] Avg episode reward: [(0, '-27.190')] [2022-07-10 04:45:15,101][26022] Updated weights on worker 0-0, policy_version 572658 (0.00091) [2022-07-10 04:45:16,912][26022] Updated weights on worker 0-0, policy_version 572668 (0.00089) [2022-07-10 04:45:18,640][26022] Updated weights on worker 0-0, policy_version 572678 (0.00089) [2022-07-10 04:45:19,407][25689] Fps is (10 sec: 5558.2, 60 sec: 5638.2, 300 sec: 5647.3). Total num frames: 586425344. Throughput: 0: 4958.3. Samples: 586419278. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:45:19,408][25689] Avg episode reward: [(0, '-27.967')] [2022-07-10 04:45:20,640][26022] Updated weights on worker 0-0, policy_version 572688 (0.00091) [2022-07-10 04:45:22,368][26022] Updated weights on worker 0-0, policy_version 572698 (0.00088) [2022-07-10 04:45:24,108][26022] Updated weights on worker 0-0, policy_version 572708 (0.00077) [2022-07-10 04:45:24,416][25689] Fps is (10 sec: 5813.9, 60 sec: 5655.5, 300 sec: 5654.0). Total num frames: 586455040. Throughput: 0: 5829.2. Samples: 586453252. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:45:24,418][25689] Avg episode reward: [(0, '-26.904')] [2022-07-10 04:45:26,025][26022] Updated weights on worker 0-0, policy_version 572718 (0.00093) [2022-07-10 04:45:27,687][26022] Updated weights on worker 0-0, policy_version 572728 (0.00089) [2022-07-10 04:45:29,422][25689] Fps is (10 sec: 5623.6, 60 sec: 5625.3, 300 sec: 5645.8). Total num frames: 586481664. Throughput: 0: 5897.8. Samples: 586487376. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:45:29,427][25689] Avg episode reward: [(0, '-26.269')] [2022-07-10 04:45:29,724][26022] Updated weights on worker 0-0, policy_version 572738 (0.00084) [2022-07-10 04:45:31,313][26022] Updated weights on worker 0-0, policy_version 572748 (0.00081) [2022-07-10 04:45:33,181][26022] Updated weights on worker 0-0, policy_version 572758 (0.00089) [2022-07-10 04:45:34,514][25689] Fps is (10 sec: 5678.7, 60 sec: 5638.2, 300 sec: 5655.0). Total num frames: 586512384. Throughput: 0: 5053.9. Samples: 586504246. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:45:34,516][25689] Avg episode reward: [(0, '-25.949')] [2022-07-10 04:45:35,054][26022] Updated weights on worker 0-0, policy_version 572768 (0.00086) [2022-07-10 04:45:36,711][26022] Updated weights on worker 0-0, policy_version 572778 (0.00088) [2022-07-10 04:45:38,762][26022] Updated weights on worker 0-0, policy_version 572788 (0.00094) [2022-07-10 04:45:39,548][25689] Fps is (10 sec: 5764.7, 60 sec: 5654.2, 300 sec: 5647.8). Total num frames: 586540032. Throughput: 0: 5907.9. Samples: 586538264. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:45:39,548][25689] Avg episode reward: [(0, '-24.895')] [2022-07-10 04:45:40,247][26022] Updated weights on worker 0-0, policy_version 572798 (0.00060) [2022-07-10 04:45:42,238][26022] Updated weights on worker 0-0, policy_version 572808 (0.00092) [2022-07-10 04:45:43,909][26022] Updated weights on worker 0-0, policy_version 572818 (0.00088) [2022-07-10 04:45:44,598][25689] Fps is (10 sec: 5585.2, 60 sec: 5632.9, 300 sec: 5650.5). Total num frames: 586568704. Throughput: 0: 5924.7. Samples: 586572824. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:45:44,598][25689] Avg episode reward: [(0, '-24.608')] [2022-07-10 04:45:45,887][26022] Updated weights on worker 0-0, policy_version 572828 (0.00086) [2022-07-10 04:45:47,596][26022] Updated weights on worker 0-0, policy_version 572838 (0.01388) [2022-07-10 04:45:49,542][26022] Updated weights on worker 0-0, policy_version 572848 (0.00089) [2022-07-10 04:45:49,607][25689] Fps is (10 sec: 5599.0, 60 sec: 5650.3, 300 sec: 5645.6). Total num frames: 586596352. Throughput: 0: 5921.7. Samples: 586606900. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:45:49,607][25689] Avg episode reward: [(0, '-24.731')] [2022-07-10 04:45:51,070][26022] Updated weights on worker 0-0, policy_version 572858 (0.00088) [2022-07-10 04:45:53,111][26022] Updated weights on worker 0-0, policy_version 572868 (0.00102) [2022-07-10 04:45:54,639][26022] Updated weights on worker 0-0, policy_version 572878 (0.00085) [2022-07-10 04:45:54,737][25689] Fps is (10 sec: 5756.6, 60 sec: 5645.0, 300 sec: 5650.5). Total num frames: 586627072. Throughput: 0: 5917.5. Samples: 586623916. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:45:54,738][25689] Avg episode reward: [(0, '-23.809')] [2022-07-10 04:45:56,604][26022] Updated weights on worker 0-0, policy_version 572888 (0.00096) [2022-07-10 04:45:58,347][26022] Updated weights on worker 0-0, policy_version 572898 (0.00086) [2022-07-10 04:45:59,817][25689] Fps is (10 sec: 5817.0, 60 sec: 5627.6, 300 sec: 5653.5). Total num frames: 586655744. Throughput: 0: 5922.7. Samples: 586658312. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:45:59,817][25689] Avg episode reward: [(0, '-23.684')] [2022-07-10 04:46:00,120][26022] Updated weights on worker 0-0, policy_version 572908 (0.00103) [2022-07-10 04:46:00,362][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:46:00,376][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000572909_586658816.pth [2022-07-10 04:46:00,376][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000570921_584623104.pth [2022-07-10 04:46:02,233][26022] Updated weights on worker 0-0, policy_version 572918 (0.00090) [2022-07-10 04:46:04,206][26022] Updated weights on worker 0-0, policy_version 572928 (0.00084) [2022-07-10 04:46:04,839][25689] Fps is (10 sec: 5372.6, 60 sec: 5628.0, 300 sec: 5646.5). Total num frames: 586681344. Throughput: 0: 5813.6. Samples: 586690496. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:46:04,839][25689] Avg episode reward: [(0, '-24.123')] [2022-07-10 04:46:06,034][26022] Updated weights on worker 0-0, policy_version 572938 (0.00087) [2022-07-10 04:46:07,911][26022] Updated weights on worker 0-0, policy_version 572948 (0.00090) [2022-07-10 04:46:09,356][26022] Updated weights on worker 0-0, policy_version 572958 (0.00083) [2022-07-10 04:46:09,908][25689] Fps is (10 sec: 5580.9, 60 sec: 5657.6, 300 sec: 5654.0). Total num frames: 586712064. Throughput: 0: 4961.5. Samples: 586707626. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:46:09,909][25689] Avg episode reward: [(0, '-25.420')] [2022-07-10 04:46:11,336][26022] Updated weights on worker 0-0, policy_version 572968 (0.00086) [2022-07-10 04:46:12,983][26022] Updated weights on worker 0-0, policy_version 572978 (0.00099) [2022-07-10 04:46:14,979][25689] Fps is (10 sec: 5655.0, 60 sec: 5643.6, 300 sec: 5642.4). Total num frames: 586738688. Throughput: 0: 5815.9. Samples: 586741642. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:46:14,980][25689] Avg episode reward: [(0, '-25.592')] [2022-07-10 04:46:15,060][26022] Updated weights on worker 0-0, policy_version 572988 (0.00086) [2022-07-10 04:46:16,777][26022] Updated weights on worker 0-0, policy_version 572998 (0.00089) [2022-07-10 04:46:18,671][26022] Updated weights on worker 0-0, policy_version 573008 (0.00092) [2022-07-10 04:46:19,992][25689] Fps is (10 sec: 5584.9, 60 sec: 5662.1, 300 sec: 5649.8). Total num frames: 586768384. Throughput: 0: 5809.6. Samples: 586775526. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:46:19,993][25689] Avg episode reward: [(0, '-25.801')] [2022-07-10 04:46:20,437][26022] Updated weights on worker 0-0, policy_version 573018 (0.00087) [2022-07-10 04:46:22,152][26022] Updated weights on worker 0-0, policy_version 573028 (0.00096) [2022-07-10 04:46:24,083][26022] Updated weights on worker 0-0, policy_version 573038 (0.00087) [2022-07-10 04:46:25,037][25689] Fps is (10 sec: 5701.5, 60 sec: 5625.0, 300 sec: 5642.5). Total num frames: 586796032. Throughput: 0: 5059.7. Samples: 586792692. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:46:25,037][25689] Avg episode reward: [(0, '-25.536')] [2022-07-10 04:46:25,816][26022] Updated weights on worker 0-0, policy_version 573048 (0.00093) [2022-07-10 04:46:27,761][26022] Updated weights on worker 0-0, policy_version 573058 (0.00080) [2022-07-10 04:46:29,669][26022] Updated weights on worker 0-0, policy_version 573068 (0.00086) [2022-07-10 04:46:30,056][25689] Fps is (10 sec: 5596.4, 60 sec: 5657.6, 300 sec: 5647.4). Total num frames: 586824704. Throughput: 0: 5922.0. Samples: 586826942. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:46:30,056][25689] Avg episode reward: [(0, '-25.983')] [2022-07-10 04:46:31,194][26022] Updated weights on worker 0-0, policy_version 573078 (0.00088) [2022-07-10 04:46:33,104][26022] Updated weights on worker 0-0, policy_version 573088 (0.00084) [2022-07-10 04:46:34,796][26022] Updated weights on worker 0-0, policy_version 573098 (0.00094) [2022-07-10 04:46:35,193][25689] Fps is (10 sec: 5646.2, 60 sec: 5619.6, 300 sec: 5646.4). Total num frames: 586853376. Throughput: 0: 5906.4. Samples: 586861032. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:46:35,193][25689] Avg episode reward: [(0, '-25.709')] [2022-07-10 04:46:36,650][26022] Updated weights on worker 0-0, policy_version 573108 (0.00089) [2022-07-10 04:46:38,525][26022] Updated weights on worker 0-0, policy_version 573118 (0.00094) [2022-07-10 04:46:40,208][25689] Fps is (10 sec: 5749.4, 60 sec: 5655.1, 300 sec: 5650.3). Total num frames: 586883072. Throughput: 0: 5065.5. Samples: 586877928. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:46:40,208][26022] Updated weights on worker 0-0, policy_version 573128 (0.00080) [2022-07-10 04:46:40,209][25689] Avg episode reward: [(0, '-26.287')] [2022-07-10 04:46:42,066][26022] Updated weights on worker 0-0, policy_version 573138 (0.00100) [2022-07-10 04:46:43,946][26022] Updated weights on worker 0-0, policy_version 573148 (0.00091) [2022-07-10 04:46:45,280][25689] Fps is (10 sec: 5685.2, 60 sec: 5636.2, 300 sec: 5643.1). Total num frames: 586910720. Throughput: 0: 5904.2. Samples: 586912210. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:46:45,281][25689] Avg episode reward: [(0, '-25.551')] [2022-07-10 04:46:45,602][26022] Updated weights on worker 0-0, policy_version 573158 (0.00093) [2022-07-10 04:46:47,594][26022] Updated weights on worker 0-0, policy_version 573168 (0.00086) [2022-07-10 04:46:49,300][26022] Updated weights on worker 0-0, policy_version 573178 (0.00084) [2022-07-10 04:46:50,379][25689] Fps is (10 sec: 5537.5, 60 sec: 5644.7, 300 sec: 5645.5). Total num frames: 586939392. Throughput: 0: 5876.5. Samples: 586946368. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:46:50,379][25689] Avg episode reward: [(0, '-26.495')] [2022-07-10 04:46:51,023][26022] Updated weights on worker 0-0, policy_version 573188 (0.00090) [2022-07-10 04:46:52,886][26022] Updated weights on worker 0-0, policy_version 573198 (0.00081) [2022-07-10 04:46:54,611][26022] Updated weights on worker 0-0, policy_version 573208 (0.00362) [2022-07-10 04:46:55,521][25689] Fps is (10 sec: 5699.3, 60 sec: 5626.8, 300 sec: 5642.8). Total num frames: 586969088. Throughput: 0: 5042.3. Samples: 586963532. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:46:55,521][25689] Avg episode reward: [(0, '-26.289')] [2022-07-10 04:46:56,392][26022] Updated weights on worker 0-0, policy_version 573218 (0.00632) [2022-07-10 04:46:58,264][26022] Updated weights on worker 0-0, policy_version 573228 (0.00091) [2022-07-10 04:46:59,895][26022] Updated weights on worker 0-0, policy_version 573238 (0.00081) [2022-07-10 04:47:00,542][25689] Fps is (10 sec: 5843.6, 60 sec: 5649.0, 300 sec: 5654.2). Total num frames: 586998784. Throughput: 0: 5898.3. Samples: 586997866. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:47:00,543][25689] Avg episode reward: [(0, '-25.549')] [2022-07-10 04:47:02,027][26022] Updated weights on worker 0-0, policy_version 573248 (0.00088) [2022-07-10 04:47:03,818][26022] Updated weights on worker 0-0, policy_version 573258 (0.00091) [2022-07-10 04:47:05,628][25689] Fps is (10 sec: 5572.5, 60 sec: 5660.0, 300 sec: 5649.4). Total num frames: 587025408. Throughput: 0: 5805.9. Samples: 587030348. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:47:05,628][25689] Avg episode reward: [(0, '-24.940')] [2022-07-10 04:47:05,644][26022] Updated weights on worker 0-0, policy_version 573268 (0.00090) [2022-07-10 04:47:07,528][26022] Updated weights on worker 0-0, policy_version 573278 (0.00090) [2022-07-10 04:47:09,282][26022] Updated weights on worker 0-0, policy_version 573288 (0.00088) [2022-07-10 04:47:10,678][25689] Fps is (10 sec: 5455.7, 60 sec: 5628.1, 300 sec: 5650.4). Total num frames: 587054080. Throughput: 0: 4969.4. Samples: 587047242. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:47:10,678][25689] Avg episode reward: [(0, '-26.326')] [2022-07-10 04:47:11,368][26022] Updated weights on worker 0-0, policy_version 573298 (0.00088) [2022-07-10 04:47:12,944][26022] Updated weights on worker 0-0, policy_version 573308 (0.00090) [2022-07-10 04:47:14,642][26022] Updated weights on worker 0-0, policy_version 573318 (0.00095) [2022-07-10 04:47:15,705][25689] Fps is (10 sec: 5690.5, 60 sec: 5665.9, 300 sec: 5650.4). Total num frames: 587082752. Throughput: 0: 5856.3. Samples: 587081736. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:47:15,705][25689] Avg episode reward: [(0, '-26.496')] [2022-07-10 04:47:16,513][26022] Updated weights on worker 0-0, policy_version 573328 (0.00094) [2022-07-10 04:47:18,127][26022] Updated weights on worker 0-0, policy_version 573338 (0.00088) [2022-07-10 04:47:20,294][26022] Updated weights on worker 0-0, policy_version 573348 (0.00086) [2022-07-10 04:47:20,766][25689] Fps is (10 sec: 5785.6, 60 sec: 5661.4, 300 sec: 5646.0). Total num frames: 587112448. Throughput: 0: 5835.0. Samples: 587115872. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:47:20,767][25689] Avg episode reward: [(0, '-25.995')] [2022-07-10 04:47:21,885][26022] Updated weights on worker 0-0, policy_version 573358 (0.00089) [2022-07-10 04:47:23,668][26022] Updated weights on worker 0-0, policy_version 573368 (0.00095) [2022-07-10 04:47:25,471][26022] Updated weights on worker 0-0, policy_version 573378 (0.00086) [2022-07-10 04:47:25,798][25689] Fps is (10 sec: 5681.2, 60 sec: 5662.5, 300 sec: 5652.5). Total num frames: 587140096. Throughput: 0: 5942.7. Samples: 587150216. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:47:25,807][25689] Avg episode reward: [(0, '-26.031')] [2022-07-10 04:47:27,068][26022] Updated weights on worker 0-0, policy_version 573388 (0.00092) [2022-07-10 04:47:29,099][26022] Updated weights on worker 0-0, policy_version 573398 (0.00093) [2022-07-10 04:47:30,708][26022] Updated weights on worker 0-0, policy_version 573408 (0.00090) [2022-07-10 04:47:30,824][25689] Fps is (10 sec: 5701.5, 60 sec: 5678.8, 300 sec: 5650.0). Total num frames: 587169792. Throughput: 0: 5961.7. Samples: 587167348. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:47:30,824][25689] Avg episode reward: [(0, '-26.956')] [2022-07-10 04:47:32,651][26022] Updated weights on worker 0-0, policy_version 573418 (0.00096) [2022-07-10 04:47:34,585][26022] Updated weights on worker 0-0, policy_version 573428 (0.00086) [2022-07-10 04:47:35,963][25689] Fps is (10 sec: 5742.5, 60 sec: 5678.6, 300 sec: 5648.7). Total num frames: 587198464. Throughput: 0: 5912.5. Samples: 587201510. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:47:35,963][25689] Avg episode reward: [(0, '-25.943')] [2022-07-10 04:47:36,375][26022] Updated weights on worker 0-0, policy_version 573438 (0.00093) [2022-07-10 04:47:38,086][26022] Updated weights on worker 0-0, policy_version 573448 (0.00086) [2022-07-10 04:47:39,799][26022] Updated weights on worker 0-0, policy_version 573458 (0.00087) [2022-07-10 04:47:40,994][25689] Fps is (10 sec: 5537.8, 60 sec: 5643.4, 300 sec: 5645.4). Total num frames: 587226112. Throughput: 0: 5928.3. Samples: 587235788. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:47:40,996][25689] Avg episode reward: [(0, '-25.650')] [2022-07-10 04:47:41,696][26022] Updated weights on worker 0-0, policy_version 573468 (0.00083) [2022-07-10 04:47:43,659][26022] Updated weights on worker 0-0, policy_version 573478 (0.00089) [2022-07-10 04:47:45,272][26022] Updated weights on worker 0-0, policy_version 573488 (0.00089) [2022-07-10 04:47:46,001][25689] Fps is (10 sec: 5814.6, 60 sec: 5700.0, 300 sec: 5656.0). Total num frames: 587256832. Throughput: 0: 5078.2. Samples: 587252808. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:47:46,003][25689] Avg episode reward: [(0, '-25.435')] [2022-07-10 04:47:47,308][26022] Updated weights on worker 0-0, policy_version 573498 (0.00662) [2022-07-10 04:47:48,761][26022] Updated weights on worker 0-0, policy_version 573508 (0.00087) [2022-07-10 04:47:50,779][26022] Updated weights on worker 0-0, policy_version 573518 (0.00086) [2022-07-10 04:47:51,016][25689] Fps is (10 sec: 5721.6, 60 sec: 5674.1, 300 sec: 5646.8). Total num frames: 587283456. Throughput: 0: 5919.9. Samples: 587286886. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:47:51,018][25689] Avg episode reward: [(0, '-26.702')] [2022-07-10 04:47:52,448][26022] Updated weights on worker 0-0, policy_version 573528 (0.00086) [2022-07-10 04:47:54,448][26022] Updated weights on worker 0-0, policy_version 573538 (0.00091) [2022-07-10 04:47:56,075][25689] Fps is (10 sec: 5489.1, 60 sec: 5665.0, 300 sec: 5643.2). Total num frames: 587312128. Throughput: 0: 5923.8. Samples: 587320650. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:47:56,075][25689] Avg episode reward: [(0, '-26.074')] [2022-07-10 04:47:56,215][26022] Updated weights on worker 0-0, policy_version 573548 (0.00087) [2022-07-10 04:47:58,008][26022] Updated weights on worker 0-0, policy_version 573558 (0.00099) [2022-07-10 04:47:59,645][26022] Updated weights on worker 0-0, policy_version 573568 (0.00083) [2022-07-10 04:48:00,473][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:48:00,482][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000573572_587337728.pth [2022-07-10 04:48:00,482][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000571585_585303040.pth [2022-07-10 04:48:01,080][25689] Fps is (10 sec: 5698.3, 60 sec: 5649.7, 300 sec: 5655.1). Total num frames: 587340800. Throughput: 0: 5075.4. Samples: 587337730. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:48:01,080][25689] Avg episode reward: [(0, '-25.974')] [2022-07-10 04:48:01,490][26022] Updated weights on worker 0-0, policy_version 573578 (0.00088) [2022-07-10 04:48:03,860][26022] Updated weights on worker 0-0, policy_version 573588 (0.00096) [2022-07-10 04:48:05,487][26022] Updated weights on worker 0-0, policy_version 573598 (0.00087) [2022-07-10 04:48:06,107][25689] Fps is (10 sec: 5409.8, 60 sec: 5638.2, 300 sec: 5644.6). Total num frames: 587366400. Throughput: 0: 5822.3. Samples: 587369872. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:48:06,107][25689] Avg episode reward: [(0, '-25.379')] [2022-07-10 04:48:07,467][26022] Updated weights on worker 0-0, policy_version 573608 (0.00086) [2022-07-10 04:48:09,170][26022] Updated weights on worker 0-0, policy_version 573618 (0.00093) [2022-07-10 04:48:11,063][26022] Updated weights on worker 0-0, policy_version 573628 (0.00097) [2022-07-10 04:48:11,157][25689] Fps is (10 sec: 5385.6, 60 sec: 5638.2, 300 sec: 5642.9). Total num frames: 587395072. Throughput: 0: 5804.7. Samples: 587403796. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 04:48:11,157][25689] Avg episode reward: [(0, '-25.694')] [2022-07-10 04:48:12,771][26022] Updated weights on worker 0-0, policy_version 573638 (0.00088) [2022-07-10 04:48:14,499][26022] Updated weights on worker 0-0, policy_version 573648 (0.00085) [2022-07-10 04:48:16,223][25689] Fps is (10 sec: 5668.6, 60 sec: 5634.6, 300 sec: 5642.0). Total num frames: 587423744. Throughput: 0: 4986.3. Samples: 587421114. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:48:16,223][25689] Avg episode reward: [(0, '-25.765')] [2022-07-10 04:48:16,295][26022] Updated weights on worker 0-0, policy_version 573658 (0.00086) [2022-07-10 04:48:18,131][26022] Updated weights on worker 0-0, policy_version 573668 (0.00094) [2022-07-10 04:48:19,875][26022] Updated weights on worker 0-0, policy_version 573678 (0.00077) [2022-07-10 04:48:21,278][25689] Fps is (10 sec: 5665.7, 60 sec: 5618.2, 300 sec: 5642.1). Total num frames: 587452416. Throughput: 0: 5825.6. Samples: 587455398. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:48:21,279][25689] Avg episode reward: [(0, '-26.435')] [2022-07-10 04:48:21,851][26022] Updated weights on worker 0-0, policy_version 573688 (0.00084) [2022-07-10 04:48:23,428][26022] Updated weights on worker 0-0, policy_version 573698 (0.00088) [2022-07-10 04:48:25,359][26022] Updated weights on worker 0-0, policy_version 573708 (0.00085) [2022-07-10 04:48:26,289][25689] Fps is (10 sec: 5900.2, 60 sec: 5671.0, 300 sec: 5652.5). Total num frames: 587483136. Throughput: 0: 5938.7. Samples: 587489728. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:48:26,289][25689] Avg episode reward: [(0, '-27.098')] [2022-07-10 04:48:27,194][26022] Updated weights on worker 0-0, policy_version 573718 (0.00083) [2022-07-10 04:48:28,972][26022] Updated weights on worker 0-0, policy_version 573728 (0.00089) [2022-07-10 04:48:30,813][26022] Updated weights on worker 0-0, policy_version 573738 (0.00085) [2022-07-10 04:48:31,293][25689] Fps is (10 sec: 5623.8, 60 sec: 5605.3, 300 sec: 5636.9). Total num frames: 587508736. Throughput: 0: 5109.6. Samples: 587506682. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:48:31,293][25689] Avg episode reward: [(0, '-29.129')] [2022-07-10 04:48:32,724][26022] Updated weights on worker 0-0, policy_version 573748 (0.00092) [2022-07-10 04:48:34,560][26022] Updated weights on worker 0-0, policy_version 573758 (0.00084) [2022-07-10 04:48:36,260][26022] Updated weights on worker 0-0, policy_version 573768 (0.00110) [2022-07-10 04:48:36,358][25689] Fps is (10 sec: 5491.9, 60 sec: 5629.1, 300 sec: 5636.1). Total num frames: 587538432. Throughput: 0: 5934.5. Samples: 587540604. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:48:36,358][25689] Avg episode reward: [(0, '-29.128')] [2022-07-10 04:48:38,128][26022] Updated weights on worker 0-0, policy_version 573778 (0.00096) [2022-07-10 04:48:39,930][26022] Updated weights on worker 0-0, policy_version 573788 (0.00091) [2022-07-10 04:48:41,364][25689] Fps is (10 sec: 5693.9, 60 sec: 5631.4, 300 sec: 5643.2). Total num frames: 587566080. Throughput: 0: 5933.1. Samples: 587574570. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:48:41,364][25689] Avg episode reward: [(0, '-28.131')] [2022-07-10 04:48:41,584][26022] Updated weights on worker 0-0, policy_version 573798 (0.00093) [2022-07-10 04:48:43,598][26022] Updated weights on worker 0-0, policy_version 573808 (0.00087) [2022-07-10 04:48:45,197][26022] Updated weights on worker 0-0, policy_version 573818 (0.00088) [2022-07-10 04:48:46,385][25689] Fps is (10 sec: 5820.8, 60 sec: 5630.1, 300 sec: 5643.7). Total num frames: 587596800. Throughput: 0: 5069.1. Samples: 587591598. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:48:46,387][25689] Avg episode reward: [(0, '-27.021')] [2022-07-10 04:48:47,207][26022] Updated weights on worker 0-0, policy_version 573828 (0.00083) [2022-07-10 04:48:48,858][26022] Updated weights on worker 0-0, policy_version 573838 (0.00078) [2022-07-10 04:48:50,889][26022] Updated weights on worker 0-0, policy_version 573848 (0.00101) [2022-07-10 04:48:51,398][25689] Fps is (10 sec: 5612.7, 60 sec: 5613.3, 300 sec: 5635.5). Total num frames: 587622400. Throughput: 0: 5917.8. Samples: 587625662. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:48:51,400][25689] Avg episode reward: [(0, '-26.742')] [2022-07-10 04:48:52,434][26022] Updated weights on worker 0-0, policy_version 573858 (0.00086) [2022-07-10 04:48:54,601][26022] Updated weights on worker 0-0, policy_version 573868 (0.00097) [2022-07-10 04:48:56,164][26022] Updated weights on worker 0-0, policy_version 573878 (0.00097) [2022-07-10 04:48:56,474][25689] Fps is (10 sec: 5481.0, 60 sec: 5628.7, 300 sec: 5645.2). Total num frames: 587652096. Throughput: 0: 5889.2. Samples: 587659072. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:48:56,474][25689] Avg episode reward: [(0, '-25.746')] [2022-07-10 04:48:58,068][26022] Updated weights on worker 0-0, policy_version 573888 (0.00087) [2022-07-10 04:48:59,878][26022] Updated weights on worker 0-0, policy_version 573898 (0.00089) [2022-07-10 04:49:01,481][25689] Fps is (10 sec: 5788.9, 60 sec: 5628.5, 300 sec: 5646.6). Total num frames: 587680768. Throughput: 0: 5050.8. Samples: 587676178. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:49:01,483][25689] Avg episode reward: [(0, '-25.819')] [2022-07-10 04:49:01,574][26022] Updated weights on worker 0-0, policy_version 573908 (0.00088) [2022-07-10 04:49:03,847][26022] Updated weights on worker 0-0, policy_version 573918 (0.00085) [2022-07-10 04:49:05,644][26022] Updated weights on worker 0-0, policy_version 573928 (0.00098) [2022-07-10 04:49:06,499][25689] Fps is (10 sec: 5413.8, 60 sec: 5629.4, 300 sec: 5640.5). Total num frames: 587706368. Throughput: 0: 5815.3. Samples: 587708564. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:49:06,499][25689] Avg episode reward: [(0, '-25.970')] [2022-07-10 04:49:07,404][26022] Updated weights on worker 0-0, policy_version 573938 (0.00084) [2022-07-10 04:49:09,388][26022] Updated weights on worker 0-0, policy_version 573948 (0.00094) [2022-07-10 04:49:11,007][26022] Updated weights on worker 0-0, policy_version 573958 (0.00079) [2022-07-10 04:49:11,511][25689] Fps is (10 sec: 5411.0, 60 sec: 5632.9, 300 sec: 5638.7). Total num frames: 587735040. Throughput: 0: 5811.3. Samples: 587742544. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:49:11,513][25689] Avg episode reward: [(0, '-26.452')] [2022-07-10 04:49:12,922][26022] Updated weights on worker 0-0, policy_version 573968 (0.00089) [2022-07-10 04:49:14,717][26022] Updated weights on worker 0-0, policy_version 573978 (0.00088) [2022-07-10 04:49:16,371][26022] Updated weights on worker 0-0, policy_version 573988 (0.00088) [2022-07-10 04:49:16,554][25689] Fps is (10 sec: 5702.7, 60 sec: 5635.0, 300 sec: 5642.5). Total num frames: 587763712. Throughput: 0: 4999.6. Samples: 587759466. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:49:16,555][25689] Avg episode reward: [(0, '-27.227')] [2022-07-10 04:49:18,499][26022] Updated weights on worker 0-0, policy_version 573998 (0.00102) [2022-07-10 04:49:20,088][26022] Updated weights on worker 0-0, policy_version 574008 (0.00085) [2022-07-10 04:49:21,626][25689] Fps is (10 sec: 5669.3, 60 sec: 5633.5, 300 sec: 5641.4). Total num frames: 587792384. Throughput: 0: 5801.8. Samples: 587793054. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:49:21,628][25689] Avg episode reward: [(0, '-26.878')] [2022-07-10 04:49:22,047][26022] Updated weights on worker 0-0, policy_version 574018 (0.00089) [2022-07-10 04:49:23,964][26022] Updated weights on worker 0-0, policy_version 574028 (0.00084) [2022-07-10 04:49:25,472][26022] Updated weights on worker 0-0, policy_version 574038 (0.00087) [2022-07-10 04:49:26,669][25689] Fps is (10 sec: 5568.3, 60 sec: 5579.6, 300 sec: 5638.0). Total num frames: 587820032. Throughput: 0: 5870.9. Samples: 587826980. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:49:26,670][25689] Avg episode reward: [(0, '-27.169')] [2022-07-10 04:49:27,463][26022] Updated weights on worker 0-0, policy_version 574048 (0.00092) [2022-07-10 04:49:28,886][26022] Updated weights on worker 0-0, policy_version 574058 (0.00085) [2022-07-10 04:49:30,996][26022] Updated weights on worker 0-0, policy_version 574068 (0.00089) [2022-07-10 04:49:31,686][25689] Fps is (10 sec: 5802.0, 60 sec: 5663.2, 300 sec: 5642.0). Total num frames: 587850752. Throughput: 0: 5045.4. Samples: 587844334. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:49:31,686][25689] Avg episode reward: [(0, '-26.887')] [2022-07-10 04:49:32,801][26022] Updated weights on worker 0-0, policy_version 574078 (0.00093) [2022-07-10 04:49:34,508][26022] Updated weights on worker 0-0, policy_version 574088 (0.00092) [2022-07-10 04:49:36,347][26022] Updated weights on worker 0-0, policy_version 574098 (0.00092) [2022-07-10 04:49:36,772][25689] Fps is (10 sec: 5776.9, 60 sec: 5627.3, 300 sec: 5644.3). Total num frames: 587878400. Throughput: 0: 5893.7. Samples: 587878626. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:49:36,773][25689] Avg episode reward: [(0, '-25.453')] [2022-07-10 04:49:38,063][26022] Updated weights on worker 0-0, policy_version 574108 (0.00081) [2022-07-10 04:49:39,981][26022] Updated weights on worker 0-0, policy_version 574118 (0.00090) [2022-07-10 04:49:41,726][26022] Updated weights on worker 0-0, policy_version 574128 (0.00086) [2022-07-10 04:49:41,788][25689] Fps is (10 sec: 5575.2, 60 sec: 5643.3, 300 sec: 5640.6). Total num frames: 587907072. Throughput: 0: 5929.2. Samples: 587912598. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:49:41,788][25689] Avg episode reward: [(0, '-25.475')] [2022-07-10 04:49:43,592][26022] Updated weights on worker 0-0, policy_version 574138 (0.00094) [2022-07-10 04:49:45,422][26022] Updated weights on worker 0-0, policy_version 574148 (0.00087) [2022-07-10 04:49:46,802][25689] Fps is (10 sec: 5615.4, 60 sec: 5593.2, 300 sec: 5644.1). Total num frames: 587934720. Throughput: 0: 5101.8. Samples: 587929696. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:49:46,802][25689] Avg episode reward: [(0, '-25.239')] [2022-07-10 04:49:47,227][26022] Updated weights on worker 0-0, policy_version 574158 (0.00082) [2022-07-10 04:49:48,985][26022] Updated weights on worker 0-0, policy_version 574168 (0.00090) [2022-07-10 04:49:50,755][26022] Updated weights on worker 0-0, policy_version 574178 (0.00084) [2022-07-10 04:49:51,835][25689] Fps is (10 sec: 5707.5, 60 sec: 5659.1, 300 sec: 5641.4). Total num frames: 587964416. Throughput: 0: 5929.0. Samples: 587963798. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:49:51,836][25689] Avg episode reward: [(0, '-25.884')] [2022-07-10 04:49:52,532][26022] Updated weights on worker 0-0, policy_version 574188 (0.00084) [2022-07-10 04:49:54,322][26022] Updated weights on worker 0-0, policy_version 574198 (0.00084) [2022-07-10 04:49:56,317][26022] Updated weights on worker 0-0, policy_version 574208 (0.00093) [2022-07-10 04:49:56,908][25689] Fps is (10 sec: 5674.2, 60 sec: 5625.4, 300 sec: 5634.5). Total num frames: 587992064. Throughput: 0: 5922.6. Samples: 587997882. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:49:56,908][25689] Avg episode reward: [(0, '-26.031')] [2022-07-10 04:49:58,048][26022] Updated weights on worker 0-0, policy_version 574218 (0.00084) [2022-07-10 04:49:59,730][26022] Updated weights on worker 0-0, policy_version 574228 (0.00092) [2022-07-10 04:50:00,547][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:50:00,558][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000574233_588014592.pth [2022-07-10 04:50:00,559][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000572249_585982976.pth [2022-07-10 04:50:01,796][26022] Updated weights on worker 0-0, policy_version 574238 (0.00089) [2022-07-10 04:50:01,914][25689] Fps is (10 sec: 5486.1, 60 sec: 5608.6, 300 sec: 5641.8). Total num frames: 588019712. Throughput: 0: 5098.5. Samples: 588015214. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:50:01,915][25689] Avg episode reward: [(0, '-26.505')] [2022-07-10 04:50:03,734][26022] Updated weights on worker 0-0, policy_version 574248 (0.00106) [2022-07-10 04:50:05,560][26022] Updated weights on worker 0-0, policy_version 574258 (0.00084) [2022-07-10 04:50:06,919][25689] Fps is (10 sec: 5523.7, 60 sec: 5643.7, 300 sec: 5638.7). Total num frames: 588047360. Throughput: 0: 5848.9. Samples: 588047358. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:50:06,921][25689] Avg episode reward: [(0, '-27.169')] [2022-07-10 04:50:07,358][26022] Updated weights on worker 0-0, policy_version 574268 (0.00108) [2022-07-10 04:50:09,199][26022] Updated weights on worker 0-0, policy_version 574278 (0.00087) [2022-07-10 04:50:10,892][26022] Updated weights on worker 0-0, policy_version 574288 (0.00086) [2022-07-10 04:50:12,009][25689] Fps is (10 sec: 5579.4, 60 sec: 5636.5, 300 sec: 5642.4). Total num frames: 588076032. Throughput: 0: 5852.7. Samples: 588081868. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:50:12,010][25689] Avg episode reward: [(0, '-27.219')] [2022-07-10 04:50:12,796][26022] Updated weights on worker 0-0, policy_version 574298 (0.00084) [2022-07-10 04:50:14,455][26022] Updated weights on worker 0-0, policy_version 574308 (0.00094) [2022-07-10 04:50:16,303][26022] Updated weights on worker 0-0, policy_version 574318 (0.00085) [2022-07-10 04:50:17,100][25689] Fps is (10 sec: 5934.2, 60 sec: 5682.7, 300 sec: 5651.6). Total num frames: 588107776. Throughput: 0: 5005.7. Samples: 588098956. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:50:17,100][25689] Avg episode reward: [(0, '-27.480')] [2022-07-10 04:50:18,117][26022] Updated weights on worker 0-0, policy_version 574328 (0.00090) [2022-07-10 04:50:19,874][26022] Updated weights on worker 0-0, policy_version 574338 (0.00082) [2022-07-10 04:50:21,821][26022] Updated weights on worker 0-0, policy_version 574348 (0.00082) [2022-07-10 04:50:22,167][25689] Fps is (10 sec: 5644.9, 60 sec: 5632.4, 300 sec: 5636.7). Total num frames: 588133376. Throughput: 0: 5818.9. Samples: 588133064. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:50:22,169][25689] Avg episode reward: [(0, '-26.946')] [2022-07-10 04:50:23,486][26022] Updated weights on worker 0-0, policy_version 574358 (0.00088) [2022-07-10 04:50:25,402][26022] Updated weights on worker 0-0, policy_version 574368 (0.00091) [2022-07-10 04:50:26,886][26022] Updated weights on worker 0-0, policy_version 574378 (0.00089) [2022-07-10 04:50:27,190][25689] Fps is (10 sec: 5683.2, 60 sec: 5701.9, 300 sec: 5653.6). Total num frames: 588165120. Throughput: 0: 5931.9. Samples: 588167604. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:50:27,190][25689] Avg episode reward: [(0, '-27.267')] [2022-07-10 04:50:29,025][26022] Updated weights on worker 0-0, policy_version 574388 (0.00094) [2022-07-10 04:50:30,652][26022] Updated weights on worker 0-0, policy_version 574398 (0.00094) [2022-07-10 04:50:32,208][25689] Fps is (10 sec: 5711.3, 60 sec: 5617.3, 300 sec: 5637.8). Total num frames: 588190720. Throughput: 0: 5087.8. Samples: 588184638. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:50:32,208][25689] Avg episode reward: [(0, '-26.632')] [2022-07-10 04:50:32,709][26022] Updated weights on worker 0-0, policy_version 574408 (0.00096) [2022-07-10 04:50:34,173][26022] Updated weights on worker 0-0, policy_version 574418 (0.00085) [2022-07-10 04:50:36,094][26022] Updated weights on worker 0-0, policy_version 574428 (0.00086) [2022-07-10 04:50:37,291][25689] Fps is (10 sec: 5575.7, 60 sec: 5668.4, 300 sec: 5647.2). Total num frames: 588221440. Throughput: 0: 5937.0. Samples: 588218830. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:50:37,291][25689] Avg episode reward: [(0, '-26.743')] [2022-07-10 04:50:37,647][26022] Updated weights on worker 0-0, policy_version 574438 (0.00085) [2022-07-10 04:50:39,694][26022] Updated weights on worker 0-0, policy_version 574448 (0.00082) [2022-07-10 04:50:41,408][26022] Updated weights on worker 0-0, policy_version 574458 (0.00092) [2022-07-10 04:50:42,345][25689] Fps is (10 sec: 5858.9, 60 sec: 5664.8, 300 sec: 5647.1). Total num frames: 588250112. Throughput: 0: 5950.6. Samples: 588253132. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:50:42,345][25689] Avg episode reward: [(0, '-26.615')] [2022-07-10 04:50:43,155][26022] Updated weights on worker 0-0, policy_version 574468 (0.00083) [2022-07-10 04:50:45,089][26022] Updated weights on worker 0-0, policy_version 574478 (0.00093) [2022-07-10 04:50:46,691][26022] Updated weights on worker 0-0, policy_version 574488 (0.00097) [2022-07-10 04:50:47,360][25689] Fps is (10 sec: 5593.3, 60 sec: 5664.6, 300 sec: 5647.0). Total num frames: 588277760. Throughput: 0: 5939.6. Samples: 588287406. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:50:47,360][25689] Avg episode reward: [(0, '-25.754')] [2022-07-10 04:50:48,595][26022] Updated weights on worker 0-0, policy_version 574498 (0.00078) [2022-07-10 04:50:50,573][26022] Updated weights on worker 0-0, policy_version 574508 (0.00089) [2022-07-10 04:50:52,309][26022] Updated weights on worker 0-0, policy_version 574518 (0.00086) [2022-07-10 04:50:52,445][25689] Fps is (10 sec: 5576.0, 60 sec: 5642.9, 300 sec: 5641.0). Total num frames: 588306432. Throughput: 0: 5919.3. Samples: 588304430. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:50:52,446][25689] Avg episode reward: [(0, '-25.935')] [2022-07-10 04:50:53,999][26022] Updated weights on worker 0-0, policy_version 574528 (0.00084) [2022-07-10 04:50:55,813][26022] Updated weights on worker 0-0, policy_version 574538 (0.00077) [2022-07-10 04:50:57,477][25689] Fps is (10 sec: 5769.2, 60 sec: 5680.5, 300 sec: 5645.3). Total num frames: 588336128. Throughput: 0: 5919.4. Samples: 588338320. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:50:57,479][25689] Avg episode reward: [(0, '-26.197')] [2022-07-10 04:50:57,594][26022] Updated weights on worker 0-0, policy_version 574548 (0.00090) [2022-07-10 04:50:59,496][26022] Updated weights on worker 0-0, policy_version 574558 (0.00091) [2022-07-10 04:51:01,488][26022] Updated weights on worker 0-0, policy_version 574568 (0.00093) [2022-07-10 04:51:02,579][25689] Fps is (10 sec: 5456.6, 60 sec: 5637.8, 300 sec: 5643.8). Total num frames: 588361728. Throughput: 0: 5793.7. Samples: 588370364. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:51:02,580][25689] Avg episode reward: [(0, '-25.155')] [2022-07-10 04:51:03,455][26022] Updated weights on worker 0-0, policy_version 574578 (0.00080) [2022-07-10 04:51:05,235][26022] Updated weights on worker 0-0, policy_version 574588 (0.00093) [2022-07-10 04:51:07,124][26022] Updated weights on worker 0-0, policy_version 574598 (0.00086) [2022-07-10 04:51:07,651][25689] Fps is (10 sec: 5435.2, 60 sec: 5665.3, 300 sec: 5640.3). Total num frames: 588391424. Throughput: 0: 4934.9. Samples: 588387540. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:51:07,651][25689] Avg episode reward: [(0, '-25.858')] [2022-07-10 04:51:08,986][26022] Updated weights on worker 0-0, policy_version 574608 (0.00088) [2022-07-10 04:51:10,674][26022] Updated weights on worker 0-0, policy_version 574618 (0.00087) [2022-07-10 04:51:12,579][26022] Updated weights on worker 0-0, policy_version 574628 (0.00095) [2022-07-10 04:51:12,671][25689] Fps is (10 sec: 5682.1, 60 sec: 5654.9, 300 sec: 5644.7). Total num frames: 588419072. Throughput: 0: 5779.4. Samples: 588421324. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:51:12,672][25689] Avg episode reward: [(0, '-26.052')] [2022-07-10 04:51:14,406][26022] Updated weights on worker 0-0, policy_version 574638 (0.00081) [2022-07-10 04:51:16,193][26022] Updated weights on worker 0-0, policy_version 574648 (0.00088) [2022-07-10 04:51:17,732][25689] Fps is (10 sec: 5688.0, 60 sec: 5623.9, 300 sec: 5643.8). Total num frames: 588448768. Throughput: 0: 5771.8. Samples: 588455228. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:51:17,734][25689] Avg episode reward: [(0, '-27.977')] [2022-07-10 04:51:17,911][26022] Updated weights on worker 0-0, policy_version 574658 (0.00080) [2022-07-10 04:51:19,767][26022] Updated weights on worker 0-0, policy_version 574668 (0.00089) [2022-07-10 04:51:21,392][26022] Updated weights on worker 0-0, policy_version 574678 (0.00090) [2022-07-10 04:51:22,784][25689] Fps is (10 sec: 5670.1, 60 sec: 5659.1, 300 sec: 5643.7). Total num frames: 588476416. Throughput: 0: 5043.4. Samples: 588472268. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:51:22,785][25689] Avg episode reward: [(0, '-27.401')] [2022-07-10 04:51:23,369][26022] Updated weights on worker 0-0, policy_version 574688 (0.00083) [2022-07-10 04:51:25,374][26022] Updated weights on worker 0-0, policy_version 574698 (0.00088) [2022-07-10 04:51:26,926][26022] Updated weights on worker 0-0, policy_version 574708 (0.00089) [2022-07-10 04:51:27,867][25689] Fps is (10 sec: 5658.4, 60 sec: 5619.8, 300 sec: 5645.9). Total num frames: 588506112. Throughput: 0: 5867.7. Samples: 588506160. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:51:27,871][25689] Avg episode reward: [(0, '-28.068')] [2022-07-10 04:51:28,922][26022] Updated weights on worker 0-0, policy_version 574718 (0.00080) [2022-07-10 04:51:30,613][26022] Updated weights on worker 0-0, policy_version 574728 (0.00086) [2022-07-10 04:51:32,558][26022] Updated weights on worker 0-0, policy_version 574738 (0.00089) [2022-07-10 04:51:32,965][25689] Fps is (10 sec: 5532.4, 60 sec: 5629.3, 300 sec: 5639.8). Total num frames: 588532736. Throughput: 0: 5859.1. Samples: 588540226. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:51:32,965][25689] Avg episode reward: [(0, '-28.525')] [2022-07-10 04:51:34,085][26022] Updated weights on worker 0-0, policy_version 574748 (0.00083) [2022-07-10 04:51:36,237][26022] Updated weights on worker 0-0, policy_version 574758 (0.00088) [2022-07-10 04:51:37,759][26022] Updated weights on worker 0-0, policy_version 574768 (0.00095) [2022-07-10 04:51:38,065][25689] Fps is (10 sec: 5622.6, 60 sec: 5627.6, 300 sec: 5641.6). Total num frames: 588563456. Throughput: 0: 5017.3. Samples: 588557246. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:51:38,067][25689] Avg episode reward: [(0, '-28.826')] [2022-07-10 04:51:39,811][26022] Updated weights on worker 0-0, policy_version 574778 (0.00086) [2022-07-10 04:51:41,513][26022] Updated weights on worker 0-0, policy_version 574788 (0.00083) [2022-07-10 04:51:43,128][25689] Fps is (10 sec: 5843.6, 60 sec: 5626.8, 300 sec: 5645.2). Total num frames: 588592128. Throughput: 0: 5847.7. Samples: 588591232. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:51:43,129][25689] Avg episode reward: [(0, '-28.605')] [2022-07-10 04:51:43,201][26022] Updated weights on worker 0-0, policy_version 574798 (0.00094) [2022-07-10 04:51:44,820][26022] Updated weights on worker 0-0, policy_version 574808 (0.00087) [2022-07-10 04:51:46,779][26022] Updated weights on worker 0-0, policy_version 574818 (0.00085) [2022-07-10 04:51:48,209][25689] Fps is (10 sec: 5754.2, 60 sec: 5654.4, 300 sec: 5649.0). Total num frames: 588621824. Throughput: 0: 5865.7. Samples: 588625482. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:51:48,209][25689] Avg episode reward: [(0, '-27.503')] [2022-07-10 04:51:48,677][26022] Updated weights on worker 0-0, policy_version 574828 (0.00082) [2022-07-10 04:51:50,707][26022] Updated weights on worker 0-0, policy_version 574838 (0.00087) [2022-07-10 04:51:52,299][26022] Updated weights on worker 0-0, policy_version 574848 (0.00090) [2022-07-10 04:51:53,256][25689] Fps is (10 sec: 5560.9, 60 sec: 5624.3, 300 sec: 5640.4). Total num frames: 588648448. Throughput: 0: 5044.5. Samples: 588642586. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:51:53,257][25689] Avg episode reward: [(0, '-26.743')] [2022-07-10 04:51:54,152][26022] Updated weights on worker 0-0, policy_version 574858 (0.00086) [2022-07-10 04:51:56,011][26022] Updated weights on worker 0-0, policy_version 574868 (0.00094) [2022-07-10 04:51:57,718][26022] Updated weights on worker 0-0, policy_version 574878 (0.00080) [2022-07-10 04:51:58,378][25689] Fps is (10 sec: 5437.7, 60 sec: 5599.1, 300 sec: 5635.1). Total num frames: 588677120. Throughput: 0: 5863.4. Samples: 588676348. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:51:58,378][25689] Avg episode reward: [(0, '-27.736')] [2022-07-10 04:51:59,600][26022] Updated weights on worker 0-0, policy_version 574888 (0.00091) [2022-07-10 04:52:00,716][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:52:00,732][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000574894_588691456.pth [2022-07-10 04:52:00,732][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000572909_586658816.pth [2022-07-10 04:52:01,475][26022] Updated weights on worker 0-0, policy_version 574898 (0.00082) [2022-07-10 04:52:03,344][26022] Updated weights on worker 0-0, policy_version 574908 (0.00095) [2022-07-10 04:52:03,416][25689] Fps is (10 sec: 5644.2, 60 sec: 5655.5, 300 sec: 5642.9). Total num frames: 588705792. Throughput: 0: 5798.6. Samples: 588708872. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:52:03,417][25689] Avg episode reward: [(0, '-28.472')] [2022-07-10 04:52:05,564][26022] Updated weights on worker 0-0, policy_version 574918 (0.00090) [2022-07-10 04:52:06,945][26022] Updated weights on worker 0-0, policy_version 574928 (0.00091) [2022-07-10 04:52:08,498][25689] Fps is (10 sec: 5463.9, 60 sec: 5604.0, 300 sec: 5635.4). Total num frames: 588732416. Throughput: 0: 5772.3. Samples: 588742598. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:52:08,500][25689] Avg episode reward: [(0, '-28.129')] [2022-07-10 04:52:08,961][26022] Updated weights on worker 0-0, policy_version 574938 (0.00085) [2022-07-10 04:52:10,770][26022] Updated weights on worker 0-0, policy_version 574948 (0.00090) [2022-07-10 04:52:12,479][26022] Updated weights on worker 0-0, policy_version 574958 (0.00089) [2022-07-10 04:52:13,567][25689] Fps is (10 sec: 5548.3, 60 sec: 5633.2, 300 sec: 5638.0). Total num frames: 588762112. Throughput: 0: 5770.0. Samples: 588759780. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:52:13,567][25689] Avg episode reward: [(0, '-28.272')] [2022-07-10 04:52:14,336][26022] Updated weights on worker 0-0, policy_version 574968 (0.00083) [2022-07-10 04:52:16,096][26022] Updated weights on worker 0-0, policy_version 574978 (0.00089) [2022-07-10 04:52:17,935][26022] Updated weights on worker 0-0, policy_version 574988 (0.00054) [2022-07-10 04:52:18,647][25689] Fps is (10 sec: 5953.2, 60 sec: 5648.3, 300 sec: 5641.1). Total num frames: 588792832. Throughput: 0: 5818.5. Samples: 588794282. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:52:18,647][25689] Avg episode reward: [(0, '-28.787')] [2022-07-10 04:52:19,622][26022] Updated weights on worker 0-0, policy_version 574998 (0.00087) [2022-07-10 04:52:21,462][26022] Updated weights on worker 0-0, policy_version 575008 (0.00091) [2022-07-10 04:52:23,325][26022] Updated weights on worker 0-0, policy_version 575018 (0.00092) [2022-07-10 04:52:23,745][25689] Fps is (10 sec: 5734.4, 60 sec: 5644.0, 300 sec: 5639.9). Total num frames: 588820480. Throughput: 0: 5900.6. Samples: 588828828. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:52:23,746][25689] Avg episode reward: [(0, '-30.521')] [2022-07-10 04:52:24,976][26022] Updated weights on worker 0-0, policy_version 575028 (0.00087) [2022-07-10 04:52:27,036][26022] Updated weights on worker 0-0, policy_version 575038 (0.00097) [2022-07-10 04:52:28,512][26022] Updated weights on worker 0-0, policy_version 575048 (0.00086) [2022-07-10 04:52:28,781][25689] Fps is (10 sec: 5557.5, 60 sec: 5631.5, 300 sec: 5636.3). Total num frames: 588849152. Throughput: 0: 5078.9. Samples: 588845610. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:52:28,781][25689] Avg episode reward: [(0, '-30.819')] [2022-07-10 04:52:30,595][26022] Updated weights on worker 0-0, policy_version 575058 (0.00082) [2022-07-10 04:52:32,170][26022] Updated weights on worker 0-0, policy_version 575068 (0.00051) [2022-07-10 04:52:33,793][25689] Fps is (10 sec: 5707.3, 60 sec: 5673.1, 300 sec: 5638.6). Total num frames: 588877824. Throughput: 0: 5933.3. Samples: 588879786. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:52:33,795][25689] Avg episode reward: [(0, '-30.988')] [2022-07-10 04:52:33,987][26022] Updated weights on worker 0-0, policy_version 575078 (0.00083) [2022-07-10 04:52:35,935][26022] Updated weights on worker 0-0, policy_version 575088 (0.00089) [2022-07-10 04:52:37,599][26022] Updated weights on worker 0-0, policy_version 575098 (0.00085) [2022-07-10 04:52:38,887][25689] Fps is (10 sec: 5674.2, 60 sec: 5640.1, 300 sec: 5640.9). Total num frames: 588906496. Throughput: 0: 5918.5. Samples: 588914074. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:52:38,887][25689] Avg episode reward: [(0, '-30.031')] [2022-07-10 04:52:39,407][26022] Updated weights on worker 0-0, policy_version 575108 (0.00091) [2022-07-10 04:52:41,463][26022] Updated weights on worker 0-0, policy_version 575118 (0.00969) [2022-07-10 04:52:43,032][26022] Updated weights on worker 0-0, policy_version 575128 (0.00088) [2022-07-10 04:52:43,925][25689] Fps is (10 sec: 5659.3, 60 sec: 5642.4, 300 sec: 5633.4). Total num frames: 588935168. Throughput: 0: 5070.5. Samples: 588931150. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:52:43,926][25689] Avg episode reward: [(0, '-30.247')] [2022-07-10 04:52:44,995][26022] Updated weights on worker 0-0, policy_version 575138 (0.00095) [2022-07-10 04:52:46,640][26022] Updated weights on worker 0-0, policy_version 575148 (0.00089) [2022-07-10 04:52:48,419][26022] Updated weights on worker 0-0, policy_version 575158 (0.00086) [2022-07-10 04:52:48,937][25689] Fps is (10 sec: 5705.6, 60 sec: 5631.9, 300 sec: 5640.4). Total num frames: 588963840. Throughput: 0: 5936.1. Samples: 588965260. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:52:48,939][25689] Avg episode reward: [(0, '-28.964')] [2022-07-10 04:52:50,403][26022] Updated weights on worker 0-0, policy_version 575168 (0.00090) [2022-07-10 04:52:52,142][26022] Updated weights on worker 0-0, policy_version 575178 (0.00093) [2022-07-10 04:52:53,849][26022] Updated weights on worker 0-0, policy_version 575188 (0.00087) [2022-07-10 04:52:53,976][25689] Fps is (10 sec: 5807.4, 60 sec: 5683.3, 300 sec: 5644.2). Total num frames: 588993536. Throughput: 0: 5943.4. Samples: 588999742. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:52:53,978][25689] Avg episode reward: [(0, '-28.550')] [2022-07-10 04:52:55,771][26022] Updated weights on worker 0-0, policy_version 575198 (0.00083) [2022-07-10 04:52:57,451][26022] Updated weights on worker 0-0, policy_version 575208 (0.00095) [2022-07-10 04:52:59,033][25689] Fps is (10 sec: 5679.8, 60 sec: 5672.4, 300 sec: 5639.7). Total num frames: 589021184. Throughput: 0: 5086.2. Samples: 589016544. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:52:59,035][25689] Avg episode reward: [(0, '-27.469')] [2022-07-10 04:52:59,296][26022] Updated weights on worker 0-0, policy_version 575218 (0.00087) [2022-07-10 04:53:01,252][26022] Updated weights on worker 0-0, policy_version 575228 (0.00093) [2022-07-10 04:53:03,191][26022] Updated weights on worker 0-0, policy_version 575238 (0.00085) [2022-07-10 04:53:04,055][25689] Fps is (10 sec: 5283.3, 60 sec: 5623.4, 300 sec: 5639.9). Total num frames: 589046784. Throughput: 0: 5836.7. Samples: 589048638. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:53:04,055][25689] Avg episode reward: [(0, '-27.029')] [2022-07-10 04:53:05,205][26022] Updated weights on worker 0-0, policy_version 575248 (0.01367) [2022-07-10 04:53:06,882][26022] Updated weights on worker 0-0, policy_version 575258 (0.00085) [2022-07-10 04:53:08,738][26022] Updated weights on worker 0-0, policy_version 575268 (0.00086) [2022-07-10 04:53:09,075][25689] Fps is (10 sec: 5608.8, 60 sec: 5696.7, 300 sec: 5647.3). Total num frames: 589077504. Throughput: 0: 5834.1. Samples: 589082744. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:53:09,076][25689] Avg episode reward: [(0, '-27.066')] [2022-07-10 04:53:10,655][26022] Updated weights on worker 0-0, policy_version 575278 (0.00089) [2022-07-10 04:53:12,296][26022] Updated weights on worker 0-0, policy_version 575288 (0.00067) [2022-07-10 04:53:14,090][25689] Fps is (10 sec: 5612.1, 60 sec: 5634.1, 300 sec: 5637.9). Total num frames: 589103104. Throughput: 0: 4961.9. Samples: 589099546. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:53:14,090][25689] Avg episode reward: [(0, '-27.027')] [2022-07-10 04:53:14,408][26022] Updated weights on worker 0-0, policy_version 575298 (0.00085) [2022-07-10 04:53:16,067][26022] Updated weights on worker 0-0, policy_version 575308 (0.00088) [2022-07-10 04:53:17,785][26022] Updated weights on worker 0-0, policy_version 575318 (0.00086) [2022-07-10 04:53:19,175][25689] Fps is (10 sec: 5373.5, 60 sec: 5599.8, 300 sec: 5637.4). Total num frames: 589131776. Throughput: 0: 5801.7. Samples: 589133398. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:53:19,175][25689] Avg episode reward: [(0, '-27.384')] [2022-07-10 04:53:19,558][26022] Updated weights on worker 0-0, policy_version 575328 (0.00085) [2022-07-10 04:53:21,396][26022] Updated weights on worker 0-0, policy_version 575338 (0.00092) [2022-07-10 04:53:23,135][26022] Updated weights on worker 0-0, policy_version 575348 (0.00085) [2022-07-10 04:53:24,176][25689] Fps is (10 sec: 5888.8, 60 sec: 5659.7, 300 sec: 5637.6). Total num frames: 589162496. Throughput: 0: 5919.3. Samples: 589167742. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:53:24,176][25689] Avg episode reward: [(0, '-26.838')] [2022-07-10 04:53:25,101][26022] Updated weights on worker 0-0, policy_version 575358 (0.00097) [2022-07-10 04:53:26,817][26022] Updated weights on worker 0-0, policy_version 575368 (0.00087) [2022-07-10 04:53:28,652][26022] Updated weights on worker 0-0, policy_version 575378 (0.00107) [2022-07-10 04:53:29,185][25689] Fps is (10 sec: 5830.5, 60 sec: 5645.1, 300 sec: 5644.3). Total num frames: 589190144. Throughput: 0: 5063.3. Samples: 589184570. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:53:29,186][25689] Avg episode reward: [(0, '-26.869')] [2022-07-10 04:53:30,568][26022] Updated weights on worker 0-0, policy_version 575388 (0.00085) [2022-07-10 04:53:32,106][26022] Updated weights on worker 0-0, policy_version 575398 (0.00090) [2022-07-10 04:53:34,172][26022] Updated weights on worker 0-0, policy_version 575408 (0.00092) [2022-07-10 04:53:34,190][25689] Fps is (10 sec: 5521.5, 60 sec: 5628.8, 300 sec: 5638.6). Total num frames: 589217792. Throughput: 0: 5918.1. Samples: 589218502. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:53:34,191][25689] Avg episode reward: [(0, '-26.867')] [2022-07-10 04:53:35,842][26022] Updated weights on worker 0-0, policy_version 575418 (0.00087) [2022-07-10 04:53:37,632][26022] Updated weights on worker 0-0, policy_version 575428 (0.00087) [2022-07-10 04:53:39,289][25689] Fps is (10 sec: 5574.0, 60 sec: 5628.4, 300 sec: 5640.3). Total num frames: 589246464. Throughput: 0: 5936.5. Samples: 589252810. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:53:39,290][25689] Avg episode reward: [(0, '-26.836')] [2022-07-10 04:53:39,480][26022] Updated weights on worker 0-0, policy_version 575438 (0.00087) [2022-07-10 04:53:41,210][26022] Updated weights on worker 0-0, policy_version 575448 (0.00084) [2022-07-10 04:53:43,027][26022] Updated weights on worker 0-0, policy_version 575458 (0.00092) [2022-07-10 04:53:44,318][25689] Fps is (10 sec: 5763.0, 60 sec: 5646.2, 300 sec: 5636.7). Total num frames: 589276160. Throughput: 0: 5057.5. Samples: 589269614. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:53:44,319][25689] Avg episode reward: [(0, '-26.532')] [2022-07-10 04:53:45,051][26022] Updated weights on worker 0-0, policy_version 575468 (0.00082) [2022-07-10 04:53:46,482][26022] Updated weights on worker 0-0, policy_version 575478 (0.00088) [2022-07-10 04:53:48,657][26022] Updated weights on worker 0-0, policy_version 575488 (0.00091) [2022-07-10 04:53:49,339][25689] Fps is (10 sec: 5706.3, 60 sec: 5628.5, 300 sec: 5643.4). Total num frames: 589303808. Throughput: 0: 5904.0. Samples: 589303556. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:53:49,339][25689] Avg episode reward: [(0, '-26.461')] [2022-07-10 04:53:50,269][26022] Updated weights on worker 0-0, policy_version 575498 (0.00091) [2022-07-10 04:53:52,279][26022] Updated weights on worker 0-0, policy_version 575508 (0.00084) [2022-07-10 04:53:54,112][26022] Updated weights on worker 0-0, policy_version 575518 (0.00092) [2022-07-10 04:53:54,350][25689] Fps is (10 sec: 5512.4, 60 sec: 5597.2, 300 sec: 5637.8). Total num frames: 589331456. Throughput: 0: 5879.6. Samples: 589337030. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:53:54,350][25689] Avg episode reward: [(0, '-24.998')] [2022-07-10 04:53:55,869][26022] Updated weights on worker 0-0, policy_version 575528 (0.00091) [2022-07-10 04:53:57,696][26022] Updated weights on worker 0-0, policy_version 575538 (0.00086) [2022-07-10 04:53:59,459][25689] Fps is (10 sec: 5464.1, 60 sec: 5592.4, 300 sec: 5632.4). Total num frames: 589359104. Throughput: 0: 4998.0. Samples: 589353616. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:53:59,459][25689] Avg episode reward: [(0, '-25.619')] [2022-07-10 04:53:59,769][26022] Updated weights on worker 0-0, policy_version 575548 (0.00070) [2022-07-10 04:54:00,823][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:54:00,834][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000575555_589368320.pth [2022-07-10 04:54:00,840][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000573572_587337728.pth [2022-07-10 04:54:01,413][26022] Updated weights on worker 0-0, policy_version 575558 (0.00086) [2022-07-10 04:54:03,751][26022] Updated weights on worker 0-0, policy_version 575568 (0.00088) [2022-07-10 04:54:04,483][25689] Fps is (10 sec: 5355.6, 60 sec: 5609.0, 300 sec: 5635.7). Total num frames: 589385728. Throughput: 0: 5732.3. Samples: 589385206. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:54:04,484][25689] Avg episode reward: [(0, '-24.446')] [2022-07-10 04:54:05,577][26022] Updated weights on worker 0-0, policy_version 575578 (0.00084) [2022-07-10 04:54:07,144][26022] Updated weights on worker 0-0, policy_version 575588 (0.00574) [2022-07-10 04:54:09,063][26022] Updated weights on worker 0-0, policy_version 575598 (0.00086) [2022-07-10 04:54:09,513][25689] Fps is (10 sec: 5601.6, 60 sec: 5591.2, 300 sec: 5638.8). Total num frames: 589415424. Throughput: 0: 5732.8. Samples: 589419210. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:54:09,514][25689] Avg episode reward: [(0, '-24.767')] [2022-07-10 04:54:10,831][26022] Updated weights on worker 0-0, policy_version 575608 (0.00081) [2022-07-10 04:54:12,666][26022] Updated weights on worker 0-0, policy_version 575618 (0.00086) [2022-07-10 04:54:14,524][25689] Fps is (10 sec: 5609.0, 60 sec: 5608.5, 300 sec: 5632.6). Total num frames: 589442048. Throughput: 0: 4926.1. Samples: 589436412. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:54:14,525][25689] Avg episode reward: [(0, '-25.651')] [2022-07-10 04:54:14,736][26022] Updated weights on worker 0-0, policy_version 575628 (0.00094) [2022-07-10 04:54:16,145][26022] Updated weights on worker 0-0, policy_version 575638 (0.00092) [2022-07-10 04:54:18,143][26022] Updated weights on worker 0-0, policy_version 575648 (0.00085) [2022-07-10 04:54:19,567][25689] Fps is (10 sec: 5703.4, 60 sec: 5646.3, 300 sec: 5640.0). Total num frames: 589472768. Throughput: 0: 5811.7. Samples: 589470482. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 04:54:19,569][25689] Avg episode reward: [(0, '-25.115')] [2022-07-10 04:54:19,746][26022] Updated weights on worker 0-0, policy_version 575658 (0.00089) [2022-07-10 04:54:21,776][26022] Updated weights on worker 0-0, policy_version 575668 (0.00090) [2022-07-10 04:54:23,431][26022] Updated weights on worker 0-0, policy_version 575678 (0.00093) [2022-07-10 04:54:24,576][25689] Fps is (10 sec: 5705.0, 60 sec: 5577.7, 300 sec: 5637.2). Total num frames: 589499392. Throughput: 0: 5937.7. Samples: 589504510. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:54:24,576][25689] Avg episode reward: [(0, '-27.199')] [2022-07-10 04:54:25,424][26022] Updated weights on worker 0-0, policy_version 575688 (0.00086) [2022-07-10 04:54:27,073][26022] Updated weights on worker 0-0, policy_version 575698 (0.00081) [2022-07-10 04:54:29,128][26022] Updated weights on worker 0-0, policy_version 575708 (0.00084) [2022-07-10 04:54:29,595][25689] Fps is (10 sec: 5514.1, 60 sec: 5593.8, 300 sec: 5630.3). Total num frames: 589528064. Throughput: 0: 5073.8. Samples: 589521104. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:54:29,596][25689] Avg episode reward: [(0, '-27.035')] [2022-07-10 04:54:30,795][26022] Updated weights on worker 0-0, policy_version 575718 (0.00085) [2022-07-10 04:54:32,685][26022] Updated weights on worker 0-0, policy_version 575728 (0.00085) [2022-07-10 04:54:34,369][26022] Updated weights on worker 0-0, policy_version 575738 (0.00089) [2022-07-10 04:54:34,601][25689] Fps is (10 sec: 5719.9, 60 sec: 5610.6, 300 sec: 5635.2). Total num frames: 589556736. Throughput: 0: 5899.9. Samples: 589554864. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:54:34,602][25689] Avg episode reward: [(0, '-26.509')] [2022-07-10 04:54:36,338][26022] Updated weights on worker 0-0, policy_version 575748 (0.00094) [2022-07-10 04:54:37,975][26022] Updated weights on worker 0-0, policy_version 575758 (0.00090) [2022-07-10 04:54:39,648][25689] Fps is (10 sec: 5500.5, 60 sec: 5581.6, 300 sec: 5627.8). Total num frames: 589583360. Throughput: 0: 5897.3. Samples: 589588906. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:54:39,649][25689] Avg episode reward: [(0, '-26.971')] [2022-07-10 04:54:40,011][26022] Updated weights on worker 0-0, policy_version 575768 (0.00088) [2022-07-10 04:54:41,522][26022] Updated weights on worker 0-0, policy_version 575778 (0.00095) [2022-07-10 04:54:43,654][26022] Updated weights on worker 0-0, policy_version 575788 (0.00086) [2022-07-10 04:54:44,657][25689] Fps is (10 sec: 5600.7, 60 sec: 5583.4, 300 sec: 5634.7). Total num frames: 589613056. Throughput: 0: 5049.6. Samples: 589605912. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:54:44,658][25689] Avg episode reward: [(0, '-26.318')] [2022-07-10 04:54:45,007][26022] Updated weights on worker 0-0, policy_version 575798 (0.00093) [2022-07-10 04:54:47,088][26022] Updated weights on worker 0-0, policy_version 575808 (0.00089) [2022-07-10 04:54:48,923][26022] Updated weights on worker 0-0, policy_version 575818 (0.00087) [2022-07-10 04:54:49,684][25689] Fps is (10 sec: 5816.3, 60 sec: 5599.8, 300 sec: 5631.4). Total num frames: 589641728. Throughput: 0: 5922.2. Samples: 589640070. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:54:49,684][25689] Avg episode reward: [(0, '-24.829')] [2022-07-10 04:54:50,666][26022] Updated weights on worker 0-0, policy_version 575828 (0.00086) [2022-07-10 04:54:52,244][26022] Updated weights on worker 0-0, policy_version 575838 (0.00091) [2022-07-10 04:54:54,358][26022] Updated weights on worker 0-0, policy_version 575848 (0.00095) [2022-07-10 04:54:54,723][25689] Fps is (10 sec: 5697.1, 60 sec: 5614.2, 300 sec: 5635.5). Total num frames: 589670400. Throughput: 0: 5950.6. Samples: 589674598. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:54:54,723][25689] Avg episode reward: [(0, '-25.273')] [2022-07-10 04:54:56,099][26022] Updated weights on worker 0-0, policy_version 575858 (0.00084) [2022-07-10 04:54:57,813][26022] Updated weights on worker 0-0, policy_version 575868 (0.00092) [2022-07-10 04:54:59,719][26022] Updated weights on worker 0-0, policy_version 575878 (0.00089) [2022-07-10 04:54:59,771][25689] Fps is (10 sec: 5684.4, 60 sec: 5636.7, 300 sec: 5638.1). Total num frames: 589699072. Throughput: 0: 5941.2. Samples: 589708462. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:54:59,772][25689] Avg episode reward: [(0, '-24.513')] [2022-07-10 04:55:01,778][26022] Updated weights on worker 0-0, policy_version 575888 (0.00106) [2022-07-10 04:55:03,587][26022] Updated weights on worker 0-0, policy_version 575898 (0.00052) [2022-07-10 04:55:04,779][25689] Fps is (10 sec: 5498.8, 60 sec: 5638.4, 300 sec: 5634.6). Total num frames: 589725696. Throughput: 0: 5841.8. Samples: 589723458. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:55:04,779][25689] Avg episode reward: [(0, '-26.915')] [2022-07-10 04:55:05,575][26022] Updated weights on worker 0-0, policy_version 575908 (0.00088) [2022-07-10 04:55:06,959][26022] Updated weights on worker 0-0, policy_version 575918 (0.00088) [2022-07-10 04:55:09,139][26022] Updated weights on worker 0-0, policy_version 575928 (0.00090) [2022-07-10 04:55:09,791][25689] Fps is (10 sec: 5519.0, 60 sec: 5623.0, 300 sec: 5636.1). Total num frames: 589754368. Throughput: 0: 5862.3. Samples: 589757944. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:55:09,791][25689] Avg episode reward: [(0, '-25.570')] [2022-07-10 04:55:10,915][26022] Updated weights on worker 0-0, policy_version 575938 (0.00092) [2022-07-10 04:55:12,530][26022] Updated weights on worker 0-0, policy_version 575948 (0.00095) [2022-07-10 04:55:14,497][26022] Updated weights on worker 0-0, policy_version 575958 (0.00087) [2022-07-10 04:55:14,811][25689] Fps is (10 sec: 5716.1, 60 sec: 5656.2, 300 sec: 5627.1). Total num frames: 589783040. Throughput: 0: 5851.5. Samples: 589792142. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:55:14,811][25689] Avg episode reward: [(0, '-25.785')] [2022-07-10 04:55:16,294][26022] Updated weights on worker 0-0, policy_version 575968 (0.00085) [2022-07-10 04:55:18,026][26022] Updated weights on worker 0-0, policy_version 575978 (0.00086) [2022-07-10 04:55:19,852][25689] Fps is (10 sec: 5597.6, 60 sec: 5605.4, 300 sec: 5634.5). Total num frames: 589810688. Throughput: 0: 5023.5. Samples: 589809338. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:55:19,853][25689] Avg episode reward: [(0, '-26.918')] [2022-07-10 04:55:19,906][26022] Updated weights on worker 0-0, policy_version 575988 (0.00084) [2022-07-10 04:55:21,367][26022] Updated weights on worker 0-0, policy_version 575998 (0.00088) [2022-07-10 04:55:23,531][26022] Updated weights on worker 0-0, policy_version 576008 (0.00083) [2022-07-10 04:55:24,858][25689] Fps is (10 sec: 5809.5, 60 sec: 5673.6, 300 sec: 5631.4). Total num frames: 589841408. Throughput: 0: 5983.1. Samples: 589843592. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:55:24,859][25689] Avg episode reward: [(0, '-26.812')] [2022-07-10 04:55:25,086][26022] Updated weights on worker 0-0, policy_version 576018 (0.00088) [2022-07-10 04:55:26,987][26022] Updated weights on worker 0-0, policy_version 576028 (0.00096) [2022-07-10 04:55:28,883][26022] Updated weights on worker 0-0, policy_version 576038 (0.00081) [2022-07-10 04:55:29,873][25689] Fps is (10 sec: 5722.6, 60 sec: 5640.0, 300 sec: 5634.9). Total num frames: 589868032. Throughput: 0: 5963.4. Samples: 589877702. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:55:29,873][25689] Avg episode reward: [(0, '-26.402')] [2022-07-10 04:55:30,570][26022] Updated weights on worker 0-0, policy_version 576048 (0.00094) [2022-07-10 04:55:32,448][26022] Updated weights on worker 0-0, policy_version 576058 (0.00090) [2022-07-10 04:55:34,187][26022] Updated weights on worker 0-0, policy_version 576068 (0.01180) [2022-07-10 04:55:34,881][25689] Fps is (10 sec: 5516.8, 60 sec: 5639.8, 300 sec: 5629.4). Total num frames: 589896704. Throughput: 0: 5120.9. Samples: 589894920. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:55:34,883][25689] Avg episode reward: [(0, '-26.478')] [2022-07-10 04:55:35,853][26022] Updated weights on worker 0-0, policy_version 576078 (0.00088) [2022-07-10 04:55:37,956][26022] Updated weights on worker 0-0, policy_version 576088 (0.00081) [2022-07-10 04:55:39,638][26022] Updated weights on worker 0-0, policy_version 576098 (0.00087) [2022-07-10 04:55:39,931][25689] Fps is (10 sec: 5803.3, 60 sec: 5690.5, 300 sec: 5633.0). Total num frames: 589926400. Throughput: 0: 5962.8. Samples: 589929062. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:55:39,931][25689] Avg episode reward: [(0, '-26.772')] [2022-07-10 04:55:41,329][26022] Updated weights on worker 0-0, policy_version 576108 (0.00082) [2022-07-10 04:55:43,224][26022] Updated weights on worker 0-0, policy_version 576118 (0.00091) [2022-07-10 04:55:44,942][26022] Updated weights on worker 0-0, policy_version 576128 (0.00081) [2022-07-10 04:55:44,946][25689] Fps is (10 sec: 5697.3, 60 sec: 5656.0, 300 sec: 5633.0). Total num frames: 589954048. Throughput: 0: 5952.4. Samples: 589963166. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:55:44,947][25689] Avg episode reward: [(0, '-26.217')] [2022-07-10 04:55:46,850][26022] Updated weights on worker 0-0, policy_version 576138 (0.00097) [2022-07-10 04:55:48,636][26022] Updated weights on worker 0-0, policy_version 576148 (0.00087) [2022-07-10 04:55:49,947][25689] Fps is (10 sec: 5622.5, 60 sec: 5658.3, 300 sec: 5634.5). Total num frames: 589982720. Throughput: 0: 5105.4. Samples: 589980190. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:55:49,948][25689] Avg episode reward: [(0, '-26.785')] [2022-07-10 04:55:50,539][26022] Updated weights on worker 0-0, policy_version 576158 (0.00097) [2022-07-10 04:55:52,406][26022] Updated weights on worker 0-0, policy_version 576168 (0.00093) [2022-07-10 04:55:54,171][26022] Updated weights on worker 0-0, policy_version 576178 (0.00087) [2022-07-10 04:55:54,951][25689] Fps is (10 sec: 5629.4, 60 sec: 5644.7, 300 sec: 5628.2). Total num frames: 590010368. Throughput: 0: 5933.4. Samples: 590014002. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:55:54,951][25689] Avg episode reward: [(0, '-26.803')] [2022-07-10 04:55:55,872][26022] Updated weights on worker 0-0, policy_version 576188 (0.00087) [2022-07-10 04:55:57,770][26022] Updated weights on worker 0-0, policy_version 576198 (0.00097) [2022-07-10 04:55:59,524][26022] Updated weights on worker 0-0, policy_version 576208 (0.00090) [2022-07-10 04:55:59,995][25689] Fps is (10 sec: 5605.3, 60 sec: 5645.1, 300 sec: 5639.6). Total num frames: 590039040. Throughput: 0: 5920.7. Samples: 590047858. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:55:59,996][25689] Avg episode reward: [(0, '-24.566')] [2022-07-10 04:56:01,003][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:56:01,012][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000576216_590045184.pth [2022-07-10 04:56:01,012][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000574233_588014592.pth [2022-07-10 04:56:01,881][26022] Updated weights on worker 0-0, policy_version 576218 (0.00091) [2022-07-10 04:56:03,598][26022] Updated weights on worker 0-0, policy_version 576228 (0.00092) [2022-07-10 04:56:05,001][25689] Fps is (10 sec: 5399.9, 60 sec: 5628.2, 300 sec: 5627.1). Total num frames: 590064640. Throughput: 0: 4968.1. Samples: 590062802. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:56:05,002][25689] Avg episode reward: [(0, '-24.963')] [2022-07-10 04:56:05,239][26022] Updated weights on worker 0-0, policy_version 576238 (0.00083) [2022-07-10 04:56:07,083][26022] Updated weights on worker 0-0, policy_version 576248 (0.00488) [2022-07-10 04:56:08,752][26022] Updated weights on worker 0-0, policy_version 576258 (0.00099) [2022-07-10 04:56:10,007][25689] Fps is (10 sec: 5420.6, 60 sec: 5628.7, 300 sec: 5630.8). Total num frames: 590093312. Throughput: 0: 5823.2. Samples: 590097002. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:56:10,008][25689] Avg episode reward: [(0, '-24.258')] [2022-07-10 04:56:10,743][26022] Updated weights on worker 0-0, policy_version 576268 (0.00090) [2022-07-10 04:56:12,492][26022] Updated weights on worker 0-0, policy_version 576278 (0.00085) [2022-07-10 04:56:14,356][26022] Updated weights on worker 0-0, policy_version 576288 (0.00092) [2022-07-10 04:56:15,022][25689] Fps is (10 sec: 5722.5, 60 sec: 5629.2, 300 sec: 5628.2). Total num frames: 590121984. Throughput: 0: 5831.8. Samples: 590131054. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:56:15,022][25689] Avg episode reward: [(0, '-24.444')] [2022-07-10 04:56:16,141][26022] Updated weights on worker 0-0, policy_version 576298 (0.00082) [2022-07-10 04:56:18,043][26022] Updated weights on worker 0-0, policy_version 576308 (0.00095) [2022-07-10 04:56:19,574][26022] Updated weights on worker 0-0, policy_version 576318 (0.00093) [2022-07-10 04:56:20,090][25689] Fps is (10 sec: 5687.6, 60 sec: 5643.8, 300 sec: 5631.4). Total num frames: 590150656. Throughput: 0: 5000.3. Samples: 590148336. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:56:20,090][25689] Avg episode reward: [(0, '-24.710')] [2022-07-10 04:56:21,647][26022] Updated weights on worker 0-0, policy_version 576328 (0.00090) [2022-07-10 04:56:23,399][26022] Updated weights on worker 0-0, policy_version 576338 (0.00097) [2022-07-10 04:56:25,099][25689] Fps is (10 sec: 5690.5, 60 sec: 5609.4, 300 sec: 5629.3). Total num frames: 590179328. Throughput: 0: 5957.4. Samples: 590182532. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:56:25,099][25689] Avg episode reward: [(0, '-24.846')] [2022-07-10 04:56:25,119][26022] Updated weights on worker 0-0, policy_version 576348 (0.00088) [2022-07-10 04:56:26,993][26022] Updated weights on worker 0-0, policy_version 576358 (0.00089) [2022-07-10 04:56:28,729][26022] Updated weights on worker 0-0, policy_version 576368 (0.00081) [2022-07-10 04:56:30,107][25689] Fps is (10 sec: 5724.7, 60 sec: 5644.2, 300 sec: 5637.9). Total num frames: 590208000. Throughput: 0: 5948.8. Samples: 590216568. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:56:30,107][25689] Avg episode reward: [(0, '-25.165')] [2022-07-10 04:56:30,654][26022] Updated weights on worker 0-0, policy_version 576378 (0.00089) [2022-07-10 04:56:32,319][26022] Updated weights on worker 0-0, policy_version 576388 (0.00090) [2022-07-10 04:56:34,348][26022] Updated weights on worker 0-0, policy_version 576398 (0.00091) [2022-07-10 04:56:35,113][25689] Fps is (10 sec: 5726.1, 60 sec: 5644.3, 300 sec: 5632.8). Total num frames: 590236672. Throughput: 0: 5099.8. Samples: 590233516. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:56:35,114][25689] Avg episode reward: [(0, '-26.752')] [2022-07-10 04:56:35,904][26022] Updated weights on worker 0-0, policy_version 576408 (0.00060) [2022-07-10 04:56:37,885][26022] Updated weights on worker 0-0, policy_version 576418 (0.00086) [2022-07-10 04:56:39,323][26022] Updated weights on worker 0-0, policy_version 576428 (0.00057) [2022-07-10 04:56:40,171][25689] Fps is (10 sec: 5697.9, 60 sec: 5626.6, 300 sec: 5632.9). Total num frames: 590265344. Throughput: 0: 5955.4. Samples: 590267926. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:56:40,171][25689] Avg episode reward: [(0, '-26.225')] [2022-07-10 04:56:41,486][26022] Updated weights on worker 0-0, policy_version 576438 (0.00088) [2022-07-10 04:56:43,160][26022] Updated weights on worker 0-0, policy_version 576448 (0.00090) [2022-07-10 04:56:44,911][26022] Updated weights on worker 0-0, policy_version 576458 (0.00087) [2022-07-10 04:56:45,219][25689] Fps is (10 sec: 5674.7, 60 sec: 5640.5, 300 sec: 5630.1). Total num frames: 590294016. Throughput: 0: 5933.7. Samples: 590301916. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:56:45,219][25689] Avg episode reward: [(0, '-26.231')] [2022-07-10 04:56:46,979][26022] Updated weights on worker 0-0, policy_version 576468 (0.00092) [2022-07-10 04:56:48,447][26022] Updated weights on worker 0-0, policy_version 576478 (0.00092) [2022-07-10 04:56:50,238][25689] Fps is (10 sec: 5695.8, 60 sec: 5638.8, 300 sec: 5637.5). Total num frames: 590322688. Throughput: 0: 5092.0. Samples: 590319080. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:56:50,240][25689] Avg episode reward: [(0, '-26.166')] [2022-07-10 04:56:50,435][26022] Updated weights on worker 0-0, policy_version 576488 (0.00087) [2022-07-10 04:56:52,088][26022] Updated weights on worker 0-0, policy_version 576498 (0.00088) [2022-07-10 04:56:53,972][26022] Updated weights on worker 0-0, policy_version 576508 (0.00085) [2022-07-10 04:56:55,252][25689] Fps is (10 sec: 5715.6, 60 sec: 5654.8, 300 sec: 5639.6). Total num frames: 590351360. Throughput: 0: 5951.9. Samples: 590353376. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:56:55,252][25689] Avg episode reward: [(0, '-25.160')] [2022-07-10 04:56:56,066][26022] Updated weights on worker 0-0, policy_version 576518 (0.00609) [2022-07-10 04:56:57,385][26022] Updated weights on worker 0-0, policy_version 576528 (0.00086) [2022-07-10 04:56:59,550][26022] Updated weights on worker 0-0, policy_version 576538 (0.00089) [2022-07-10 04:57:00,379][25689] Fps is (10 sec: 5654.8, 60 sec: 5647.1, 300 sec: 5637.9). Total num frames: 590380032. Throughput: 0: 5908.3. Samples: 590387324. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:57:00,380][25689] Avg episode reward: [(0, '-25.019')] [2022-07-10 04:57:01,087][26022] Updated weights on worker 0-0, policy_version 576548 (0.00089) [2022-07-10 04:57:03,398][26022] Updated weights on worker 0-0, policy_version 576558 (0.00087) [2022-07-10 04:57:05,287][26022] Updated weights on worker 0-0, policy_version 576568 (0.00088) [2022-07-10 04:57:05,405][25689] Fps is (10 sec: 5345.2, 60 sec: 5645.2, 300 sec: 5635.5). Total num frames: 590405632. Throughput: 0: 4966.5. Samples: 590402172. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:57:05,406][25689] Avg episode reward: [(0, '-24.192')] [2022-07-10 04:57:06,834][26022] Updated weights on worker 0-0, policy_version 576578 (0.00086) [2022-07-10 04:57:08,840][26022] Updated weights on worker 0-0, policy_version 576588 (0.00087) [2022-07-10 04:57:10,424][25689] Fps is (10 sec: 5505.0, 60 sec: 5661.0, 300 sec: 5636.4). Total num frames: 590435328. Throughput: 0: 5811.4. Samples: 590436386. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:57:10,424][25689] Avg episode reward: [(0, '-24.285')] [2022-07-10 04:57:10,463][26022] Updated weights on worker 0-0, policy_version 576598 (0.00087) [2022-07-10 04:57:12,413][26022] Updated weights on worker 0-0, policy_version 576608 (0.00087) [2022-07-10 04:57:14,318][26022] Updated weights on worker 0-0, policy_version 576618 (0.00054) [2022-07-10 04:57:15,499][25689] Fps is (10 sec: 5782.3, 60 sec: 5655.3, 300 sec: 5629.6). Total num frames: 590464000. Throughput: 0: 5797.4. Samples: 590470760. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:57:15,500][25689] Avg episode reward: [(0, '-24.393')] [2022-07-10 04:57:15,867][26022] Updated weights on worker 0-0, policy_version 576628 (0.00081) [2022-07-10 04:57:17,857][26022] Updated weights on worker 0-0, policy_version 576638 (0.00084) [2022-07-10 04:57:19,550][26022] Updated weights on worker 0-0, policy_version 576648 (0.00093) [2022-07-10 04:57:20,573][25689] Fps is (10 sec: 5650.2, 60 sec: 5654.7, 300 sec: 5633.5). Total num frames: 590492672. Throughput: 0: 4980.1. Samples: 590487890. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 04:57:20,574][25689] Avg episode reward: [(0, '-25.546')] [2022-07-10 04:57:21,222][26022] Updated weights on worker 0-0, policy_version 576658 (0.00084) [2022-07-10 04:57:22,975][26022] Updated weights on worker 0-0, policy_version 576668 (0.00084) [2022-07-10 04:57:25,088][26022] Updated weights on worker 0-0, policy_version 576678 (0.00088) [2022-07-10 04:57:25,670][25689] Fps is (10 sec: 5638.3, 60 sec: 5646.5, 300 sec: 5632.4). Total num frames: 590521344. Throughput: 0: 5937.6. Samples: 590522496. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:57:25,670][25689] Avg episode reward: [(0, '-25.975')] [2022-07-10 04:57:26,610][26022] Updated weights on worker 0-0, policy_version 576688 (0.00089) [2022-07-10 04:57:28,460][26022] Updated weights on worker 0-0, policy_version 576698 (0.00087) [2022-07-10 04:57:30,232][26022] Updated weights on worker 0-0, policy_version 576708 (0.00089) [2022-07-10 04:57:30,737][25689] Fps is (10 sec: 5742.5, 60 sec: 5657.9, 300 sec: 5634.8). Total num frames: 590551040. Throughput: 0: 5914.5. Samples: 590556528. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:57:30,738][25689] Avg episode reward: [(0, '-26.405')] [2022-07-10 04:57:32,204][26022] Updated weights on worker 0-0, policy_version 576718 (0.00086) [2022-07-10 04:57:33,875][26022] Updated weights on worker 0-0, policy_version 576728 (0.00094) [2022-07-10 04:57:35,725][26022] Updated weights on worker 0-0, policy_version 576738 (0.00088) [2022-07-10 04:57:35,769][25689] Fps is (10 sec: 5779.3, 60 sec: 5655.5, 300 sec: 5635.9). Total num frames: 590579712. Throughput: 0: 5076.2. Samples: 590573654. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:57:35,770][25689] Avg episode reward: [(0, '-26.726')] [2022-07-10 04:57:37,418][26022] Updated weights on worker 0-0, policy_version 576748 (0.00086) [2022-07-10 04:57:39,351][26022] Updated weights on worker 0-0, policy_version 576758 (0.00088) [2022-07-10 04:57:40,840][25689] Fps is (10 sec: 5676.1, 60 sec: 5654.3, 300 sec: 5635.3). Total num frames: 590608384. Throughput: 0: 5912.9. Samples: 590607726. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:57:40,840][25689] Avg episode reward: [(0, '-27.398')] [2022-07-10 04:57:41,068][26022] Updated weights on worker 0-0, policy_version 576768 (0.00090) [2022-07-10 04:57:42,939][26022] Updated weights on worker 0-0, policy_version 576778 (0.00082) [2022-07-10 04:57:44,655][26022] Updated weights on worker 0-0, policy_version 576788 (0.00087) [2022-07-10 04:57:45,882][25689] Fps is (10 sec: 5569.5, 60 sec: 5638.0, 300 sec: 5631.3). Total num frames: 590636032. Throughput: 0: 5918.7. Samples: 590642122. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:57:45,882][25689] Avg episode reward: [(0, '-26.176')] [2022-07-10 04:57:46,572][26022] Updated weights on worker 0-0, policy_version 576798 (0.00087) [2022-07-10 04:57:48,358][26022] Updated weights on worker 0-0, policy_version 576808 (0.00084) [2022-07-10 04:57:49,965][26022] Updated weights on worker 0-0, policy_version 576818 (0.00088) [2022-07-10 04:57:50,936][25689] Fps is (10 sec: 5781.4, 60 sec: 5668.5, 300 sec: 5634.5). Total num frames: 590666752. Throughput: 0: 5085.9. Samples: 590659256. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:57:50,936][25689] Avg episode reward: [(0, '-26.148')] [2022-07-10 04:57:51,871][26022] Updated weights on worker 0-0, policy_version 576828 (0.00084) [2022-07-10 04:57:53,569][26022] Updated weights on worker 0-0, policy_version 576838 (0.00084) [2022-07-10 04:57:55,498][26022] Updated weights on worker 0-0, policy_version 576848 (0.00088) [2022-07-10 04:57:55,997][25689] Fps is (10 sec: 5770.0, 60 sec: 5647.1, 300 sec: 5634.4). Total num frames: 590694400. Throughput: 0: 5948.9. Samples: 590693988. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:57:55,998][25689] Avg episode reward: [(0, '-25.279')] [2022-07-10 04:57:57,055][26022] Updated weights on worker 0-0, policy_version 576858 (0.00090) [2022-07-10 04:57:59,040][26022] Updated weights on worker 0-0, policy_version 576868 (0.00088) [2022-07-10 04:58:00,662][26022] Updated weights on worker 0-0, policy_version 576878 (0.00088) [2022-07-10 04:58:01,091][25689] Fps is (10 sec: 5646.8, 60 sec: 5667.2, 300 sec: 5646.8). Total num frames: 590724096. Throughput: 0: 5945.5. Samples: 590728128. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:58:01,091][25689] Avg episode reward: [(0, '-25.550')] [2022-07-10 04:58:01,306][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 04:58:01,318][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000576880_590725120.pth [2022-07-10 04:58:01,318][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000574894_588691456.pth [2022-07-10 04:58:03,014][26022] Updated weights on worker 0-0, policy_version 576888 (0.00081) [2022-07-10 04:58:04,735][26022] Updated weights on worker 0-0, policy_version 576898 (0.00085) [2022-07-10 04:58:06,099][25689] Fps is (10 sec: 5575.4, 60 sec: 5685.7, 300 sec: 5633.3). Total num frames: 590750720. Throughput: 0: 5006.4. Samples: 590743338. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:58:06,099][25689] Avg episode reward: [(0, '-25.554')] [2022-07-10 04:58:06,584][26022] Updated weights on worker 0-0, policy_version 576908 (0.00106) [2022-07-10 04:58:08,292][26022] Updated weights on worker 0-0, policy_version 576918 (0.00089) [2022-07-10 04:58:10,228][26022] Updated weights on worker 0-0, policy_version 576928 (0.00105) [2022-07-10 04:58:11,125][25689] Fps is (10 sec: 5408.7, 60 sec: 5651.3, 300 sec: 5640.0). Total num frames: 590778368. Throughput: 0: 5865.9. Samples: 590777684. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:58:11,125][25689] Avg episode reward: [(0, '-25.550')] [2022-07-10 04:58:11,887][26022] Updated weights on worker 0-0, policy_version 576938 (0.00086) [2022-07-10 04:58:13,844][26022] Updated weights on worker 0-0, policy_version 576948 (0.00086) [2022-07-10 04:58:15,591][26022] Updated weights on worker 0-0, policy_version 576958 (0.00108) [2022-07-10 04:58:16,138][25689] Fps is (10 sec: 5609.9, 60 sec: 5657.1, 300 sec: 5641.3). Total num frames: 590807040. Throughput: 0: 5842.5. Samples: 590811662. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:58:16,139][25689] Avg episode reward: [(0, '-25.851')] [2022-07-10 04:58:17,411][26022] Updated weights on worker 0-0, policy_version 576968 (0.00085) [2022-07-10 04:58:19,237][26022] Updated weights on worker 0-0, policy_version 576978 (0.00082) [2022-07-10 04:58:21,041][26022] Updated weights on worker 0-0, policy_version 576988 (0.00078) [2022-07-10 04:58:21,237][25689] Fps is (10 sec: 5873.5, 60 sec: 5688.5, 300 sec: 5639.5). Total num frames: 590837760. Throughput: 0: 5832.9. Samples: 590845638. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:58:21,237][25689] Avg episode reward: [(0, '-26.081')] [2022-07-10 04:58:23,081][26022] Updated weights on worker 0-0, policy_version 576998 (0.00084) [2022-07-10 04:58:24,527][26022] Updated weights on worker 0-0, policy_version 577008 (0.00058) [2022-07-10 04:58:26,278][25689] Fps is (10 sec: 5655.2, 60 sec: 5659.9, 300 sec: 5635.4). Total num frames: 590864384. Throughput: 0: 5921.2. Samples: 590862824. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:58:26,279][25689] Avg episode reward: [(0, '-26.571')] [2022-07-10 04:58:26,638][26022] Updated weights on worker 0-0, policy_version 577018 (0.00086) [2022-07-10 04:58:28,294][26022] Updated weights on worker 0-0, policy_version 577028 (0.00087) [2022-07-10 04:58:30,296][26022] Updated weights on worker 0-0, policy_version 577038 (0.00089) [2022-07-10 04:58:31,300][25689] Fps is (10 sec: 5494.8, 60 sec: 5647.3, 300 sec: 5638.5). Total num frames: 590893056. Throughput: 0: 5900.0. Samples: 590896718. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:58:31,301][25689] Avg episode reward: [(0, '-26.068')] [2022-07-10 04:58:31,970][26022] Updated weights on worker 0-0, policy_version 577048 (0.00101) [2022-07-10 04:58:33,686][26022] Updated weights on worker 0-0, policy_version 577058 (0.00088) [2022-07-10 04:58:35,599][26022] Updated weights on worker 0-0, policy_version 577068 (0.00100) [2022-07-10 04:58:36,333][25689] Fps is (10 sec: 5703.0, 60 sec: 5647.2, 300 sec: 5639.8). Total num frames: 590921728. Throughput: 0: 5905.3. Samples: 590930920. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:58:36,335][25689] Avg episode reward: [(0, '-25.024')] [2022-07-10 04:58:37,235][26022] Updated weights on worker 0-0, policy_version 577078 (0.00080) [2022-07-10 04:58:39,129][26022] Updated weights on worker 0-0, policy_version 577088 (0.00092) [2022-07-10 04:58:40,843][26022] Updated weights on worker 0-0, policy_version 577098 (0.00091) [2022-07-10 04:58:41,451][25689] Fps is (10 sec: 5750.1, 60 sec: 5659.7, 300 sec: 5638.1). Total num frames: 590951424. Throughput: 0: 5060.6. Samples: 590947934. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:58:41,451][25689] Avg episode reward: [(0, '-25.013')] [2022-07-10 04:58:42,775][26022] Updated weights on worker 0-0, policy_version 577108 (0.00496) [2022-07-10 04:58:44,433][26022] Updated weights on worker 0-0, policy_version 577118 (0.00086) [2022-07-10 04:58:46,410][26022] Updated weights on worker 0-0, policy_version 577128 (0.00080) [2022-07-10 04:58:46,491][25689] Fps is (10 sec: 5645.4, 60 sec: 5659.8, 300 sec: 5637.7). Total num frames: 590979072. Throughput: 0: 5905.1. Samples: 590982182. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:58:46,492][25689] Avg episode reward: [(0, '-23.441')] [2022-07-10 04:58:48,135][26022] Updated weights on worker 0-0, policy_version 577138 (0.00086) [2022-07-10 04:58:49,880][26022] Updated weights on worker 0-0, policy_version 577148 (0.00085) [2022-07-10 04:58:51,503][25689] Fps is (10 sec: 5501.1, 60 sec: 5613.1, 300 sec: 5637.7). Total num frames: 591006720. Throughput: 0: 5907.9. Samples: 591016072. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:58:51,503][25689] Avg episode reward: [(0, '-23.192')] [2022-07-10 04:58:51,807][26022] Updated weights on worker 0-0, policy_version 577158 (0.00083) [2022-07-10 04:58:53,481][26022] Updated weights on worker 0-0, policy_version 577168 (0.00086) [2022-07-10 04:58:55,307][26022] Updated weights on worker 0-0, policy_version 577178 (0.00089) [2022-07-10 04:58:56,523][25689] Fps is (10 sec: 5716.2, 60 sec: 5650.8, 300 sec: 5646.3). Total num frames: 591036416. Throughput: 0: 5073.7. Samples: 591033356. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:58:56,523][25689] Avg episode reward: [(0, '-23.538')] [2022-07-10 04:58:57,019][26022] Updated weights on worker 0-0, policy_version 577188 (0.00086) [2022-07-10 04:58:58,909][26022] Updated weights on worker 0-0, policy_version 577198 (0.00094) [2022-07-10 04:59:00,855][26022] Updated weights on worker 0-0, policy_version 577208 (0.00116) [2022-07-10 04:59:01,575][25689] Fps is (10 sec: 5795.1, 60 sec: 5637.7, 300 sec: 5652.7). Total num frames: 591065088. Throughput: 0: 5943.4. Samples: 591067538. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:59:01,575][25689] Avg episode reward: [(0, '-23.731')] [2022-07-10 04:59:02,824][26022] Updated weights on worker 0-0, policy_version 577218 (0.00091) [2022-07-10 04:59:04,781][26022] Updated weights on worker 0-0, policy_version 577228 (0.00092) [2022-07-10 04:59:06,419][26022] Updated weights on worker 0-0, policy_version 577238 (0.00092) [2022-07-10 04:59:06,598][25689] Fps is (10 sec: 5488.4, 60 sec: 5636.3, 300 sec: 5642.5). Total num frames: 591091712. Throughput: 0: 5823.9. Samples: 591099282. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:59:06,598][25689] Avg episode reward: [(0, '-24.750')] [2022-07-10 04:59:08,412][26022] Updated weights on worker 0-0, policy_version 577248 (0.00091) [2022-07-10 04:59:10,271][26022] Updated weights on worker 0-0, policy_version 577258 (0.00081) [2022-07-10 04:59:11,678][25689] Fps is (10 sec: 5472.8, 60 sec: 5648.2, 300 sec: 5648.0). Total num frames: 591120384. Throughput: 0: 4965.9. Samples: 591116262. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:59:11,679][25689] Avg episode reward: [(0, '-25.862')] [2022-07-10 04:59:11,972][26022] Updated weights on worker 0-0, policy_version 577268 (0.00085) [2022-07-10 04:59:13,956][26022] Updated weights on worker 0-0, policy_version 577278 (0.00084) [2022-07-10 04:59:15,472][26022] Updated weights on worker 0-0, policy_version 577288 (0.00092) [2022-07-10 04:59:16,697][25689] Fps is (10 sec: 5678.4, 60 sec: 5647.7, 300 sec: 5641.6). Total num frames: 591149056. Throughput: 0: 5803.1. Samples: 591150426. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:59:16,697][25689] Avg episode reward: [(0, '-26.591')] [2022-07-10 04:59:17,362][26022] Updated weights on worker 0-0, policy_version 577298 (0.00091) [2022-07-10 04:59:19,162][26022] Updated weights on worker 0-0, policy_version 577308 (0.00093) [2022-07-10 04:59:20,861][26022] Updated weights on worker 0-0, policy_version 577318 (0.00094) [2022-07-10 04:59:21,735][25689] Fps is (10 sec: 5702.2, 60 sec: 5619.5, 300 sec: 5647.9). Total num frames: 591177728. Throughput: 0: 5793.4. Samples: 591184334. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:59:21,735][25689] Avg episode reward: [(0, '-27.093')] [2022-07-10 04:59:22,908][26022] Updated weights on worker 0-0, policy_version 577328 (0.00094) [2022-07-10 04:59:24,579][26022] Updated weights on worker 0-0, policy_version 577338 (0.00087) [2022-07-10 04:59:26,672][26022] Updated weights on worker 0-0, policy_version 577348 (0.00100) [2022-07-10 04:59:26,806][25689] Fps is (10 sec: 5672.2, 60 sec: 5650.6, 300 sec: 5647.0). Total num frames: 591206400. Throughput: 0: 5044.7. Samples: 591201226. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:59:26,807][25689] Avg episode reward: [(0, '-26.968')] [2022-07-10 04:59:28,225][26022] Updated weights on worker 0-0, policy_version 577358 (0.00079) [2022-07-10 04:59:30,135][26022] Updated weights on worker 0-0, policy_version 577368 (0.00085) [2022-07-10 04:59:31,849][25689] Fps is (10 sec: 5568.5, 60 sec: 5631.7, 300 sec: 5642.8). Total num frames: 591234048. Throughput: 0: 5890.0. Samples: 591235066. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:59:31,851][25689] Avg episode reward: [(0, '-26.305')] [2022-07-10 04:59:31,963][26022] Updated weights on worker 0-0, policy_version 577378 (0.00092) [2022-07-10 04:59:33,708][26022] Updated weights on worker 0-0, policy_version 577388 (0.00089) [2022-07-10 04:59:35,625][26022] Updated weights on worker 0-0, policy_version 577398 (0.00089) [2022-07-10 04:59:36,863][25689] Fps is (10 sec: 5498.6, 60 sec: 5616.6, 300 sec: 5646.9). Total num frames: 591261696. Throughput: 0: 5868.8. Samples: 591268776. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:59:36,864][25689] Avg episode reward: [(0, '-26.043')] [2022-07-10 04:59:37,373][26022] Updated weights on worker 0-0, policy_version 577408 (0.00097) [2022-07-10 04:59:39,415][26022] Updated weights on worker 0-0, policy_version 577418 (0.00081) [2022-07-10 04:59:40,987][26022] Updated weights on worker 0-0, policy_version 577428 (0.00088) [2022-07-10 04:59:41,919][25689] Fps is (10 sec: 5491.0, 60 sec: 5588.4, 300 sec: 5639.1). Total num frames: 591289344. Throughput: 0: 5003.8. Samples: 591285332. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:59:41,920][25689] Avg episode reward: [(0, '-25.561')] [2022-07-10 04:59:42,981][26022] Updated weights on worker 0-0, policy_version 577438 (0.00087) [2022-07-10 04:59:44,576][26022] Updated weights on worker 0-0, policy_version 577448 (0.00084) [2022-07-10 04:59:46,457][26022] Updated weights on worker 0-0, policy_version 577458 (0.00091) [2022-07-10 04:59:46,962][25689] Fps is (10 sec: 5576.5, 60 sec: 5605.1, 300 sec: 5638.8). Total num frames: 591318016. Throughput: 0: 5858.3. Samples: 591319306. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:59:46,963][25689] Avg episode reward: [(0, '-24.984')] [2022-07-10 04:59:48,503][26022] Updated weights on worker 0-0, policy_version 577468 (0.00088) [2022-07-10 04:59:50,123][26022] Updated weights on worker 0-0, policy_version 577478 (0.00089) [2022-07-10 04:59:51,965][25689] Fps is (10 sec: 5708.0, 60 sec: 5622.8, 300 sec: 5639.5). Total num frames: 591346688. Throughput: 0: 5886.5. Samples: 591353482. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:59:51,966][25689] Avg episode reward: [(0, '-23.625')] [2022-07-10 04:59:52,049][26022] Updated weights on worker 0-0, policy_version 577488 (0.00090) [2022-07-10 04:59:53,603][26022] Updated weights on worker 0-0, policy_version 577498 (0.00094) [2022-07-10 04:59:55,608][26022] Updated weights on worker 0-0, policy_version 577508 (0.00093) [2022-07-10 04:59:56,995][25689] Fps is (10 sec: 5817.9, 60 sec: 5621.9, 300 sec: 5643.3). Total num frames: 591376384. Throughput: 0: 5058.4. Samples: 591370612. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 04:59:56,995][25689] Avg episode reward: [(0, '-23.355')] [2022-07-10 04:59:57,436][26022] Updated weights on worker 0-0, policy_version 577518 (0.00086) [2022-07-10 04:59:59,086][26022] Updated weights on worker 0-0, policy_version 577528 (0.00090) [2022-07-10 05:00:00,914][26022] Updated weights on worker 0-0, policy_version 577538 (0.00084) [2022-07-10 05:00:01,766][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:00:01,788][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000577541_591401984.pth [2022-07-10 05:00:01,789][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000575555_589368320.pth [2022-07-10 05:00:02,106][25689] Fps is (10 sec: 5554.2, 60 sec: 5582.6, 300 sec: 5641.3). Total num frames: 591403008. Throughput: 0: 5921.5. Samples: 591404866. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 05:00:02,106][25689] Avg episode reward: [(0, '-23.144')] [2022-07-10 05:00:03,096][26022] Updated weights on worker 0-0, policy_version 577548 (0.00092) [2022-07-10 05:00:04,918][26022] Updated weights on worker 0-0, policy_version 577558 (0.00089) [2022-07-10 05:00:06,880][26022] Updated weights on worker 0-0, policy_version 577568 (0.00083) [2022-07-10 05:00:07,166][25689] Fps is (10 sec: 5436.5, 60 sec: 5613.0, 300 sec: 5640.4). Total num frames: 591431680. Throughput: 0: 5802.1. Samples: 591436530. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 05:00:07,166][25689] Avg episode reward: [(0, '-23.328')] [2022-07-10 05:00:08,543][26022] Updated weights on worker 0-0, policy_version 577578 (0.00085) [2022-07-10 05:00:10,428][26022] Updated weights on worker 0-0, policy_version 577588 (0.00091) [2022-07-10 05:00:11,994][26022] Updated weights on worker 0-0, policy_version 577598 (0.00085) [2022-07-10 05:00:12,179][25689] Fps is (10 sec: 5794.1, 60 sec: 5636.1, 300 sec: 5644.0). Total num frames: 591461376. Throughput: 0: 4959.2. Samples: 591453726. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 05:00:12,180][25689] Avg episode reward: [(0, '-23.697')] [2022-07-10 05:00:13,929][26022] Updated weights on worker 0-0, policy_version 577608 (0.00087) [2022-07-10 05:00:15,642][26022] Updated weights on worker 0-0, policy_version 577618 (0.00090) [2022-07-10 05:00:17,198][25689] Fps is (10 sec: 5613.7, 60 sec: 5602.2, 300 sec: 5640.9). Total num frames: 591488000. Throughput: 0: 5824.7. Samples: 591488292. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 05:00:17,199][25689] Avg episode reward: [(0, '-24.288')] [2022-07-10 05:00:17,587][26022] Updated weights on worker 0-0, policy_version 577628 (0.00091) [2022-07-10 05:00:19,291][26022] Updated weights on worker 0-0, policy_version 577638 (0.00085) [2022-07-10 05:00:21,122][26022] Updated weights on worker 0-0, policy_version 577648 (0.00086) [2022-07-10 05:00:22,268][25689] Fps is (10 sec: 5684.2, 60 sec: 5633.2, 300 sec: 5639.7). Total num frames: 591518720. Throughput: 0: 5834.6. Samples: 591522504. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2022-07-10 05:00:22,268][25689] Avg episode reward: [(0, '-23.966')] [2022-07-10 05:00:22,797][26022] Updated weights on worker 0-0, policy_version 577658 (0.00087) [2022-07-10 05:00:24,596][26022] Updated weights on worker 0-0, policy_version 577668 (0.00089) [2022-07-10 05:00:26,427][26022] Updated weights on worker 0-0, policy_version 577678 (0.00090) [2022-07-10 05:00:27,269][25689] Fps is (10 sec: 5694.2, 60 sec: 5605.8, 300 sec: 5640.0). Total num frames: 591545344. Throughput: 0: 5131.8. Samples: 591539696. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:00:27,269][25689] Avg episode reward: [(0, '-25.076')] [2022-07-10 05:00:28,251][26022] Updated weights on worker 0-0, policy_version 577688 (0.00084) [2022-07-10 05:00:30,167][26022] Updated weights on worker 0-0, policy_version 577698 (0.00101) [2022-07-10 05:00:31,887][26022] Updated weights on worker 0-0, policy_version 577708 (0.00092) [2022-07-10 05:00:32,353][25689] Fps is (10 sec: 5584.5, 60 sec: 5635.9, 300 sec: 5642.0). Total num frames: 591575040. Throughput: 0: 5948.9. Samples: 591573736. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:00:32,354][25689] Avg episode reward: [(0, '-25.825')] [2022-07-10 05:00:33,896][26022] Updated weights on worker 0-0, policy_version 577718 (0.00093) [2022-07-10 05:00:35,653][26022] Updated weights on worker 0-0, policy_version 577728 (0.00089) [2022-07-10 05:00:37,395][25689] Fps is (10 sec: 5663.1, 60 sec: 5633.2, 300 sec: 5635.3). Total num frames: 591602688. Throughput: 0: 5894.0. Samples: 591607330. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:00:37,395][25689] Avg episode reward: [(0, '-25.310')] [2022-07-10 05:00:37,419][26022] Updated weights on worker 0-0, policy_version 577738 (0.00092) [2022-07-10 05:00:39,166][26022] Updated weights on worker 0-0, policy_version 577748 (0.00089) [2022-07-10 05:00:41,071][26022] Updated weights on worker 0-0, policy_version 577758 (0.00085) [2022-07-10 05:00:42,434][25689] Fps is (10 sec: 5586.7, 60 sec: 5651.8, 300 sec: 5638.3). Total num frames: 591631360. Throughput: 0: 5890.4. Samples: 591641292. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:00:42,434][25689] Avg episode reward: [(0, '-25.323')] [2022-07-10 05:00:42,912][26022] Updated weights on worker 0-0, policy_version 577768 (0.00088) [2022-07-10 05:00:44,580][26022] Updated weights on worker 0-0, policy_version 577778 (0.00091) [2022-07-10 05:00:46,470][26022] Updated weights on worker 0-0, policy_version 577788 (0.00089) [2022-07-10 05:00:47,458][25689] Fps is (10 sec: 5698.6, 60 sec: 5653.6, 300 sec: 5637.8). Total num frames: 591660032. Throughput: 0: 5873.4. Samples: 591658272. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:00:47,458][25689] Avg episode reward: [(0, '-25.778')] [2022-07-10 05:00:48,169][26022] Updated weights on worker 0-0, policy_version 577798 (0.00088) [2022-07-10 05:00:50,198][26022] Updated weights on worker 0-0, policy_version 577808 (0.00090) [2022-07-10 05:00:51,930][26022] Updated weights on worker 0-0, policy_version 577818 (0.00085) [2022-07-10 05:00:52,460][25689] Fps is (10 sec: 5719.2, 60 sec: 5653.6, 300 sec: 5641.3). Total num frames: 591688704. Throughput: 0: 5882.3. Samples: 591692016. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:00:52,461][25689] Avg episode reward: [(0, '-25.278')] [2022-07-10 05:00:53,699][26022] Updated weights on worker 0-0, policy_version 577828 (0.00081) [2022-07-10 05:00:55,444][26022] Updated weights on worker 0-0, policy_version 577838 (0.00088) [2022-07-10 05:00:57,480][25689] Fps is (10 sec: 5517.3, 60 sec: 5603.7, 300 sec: 5634.9). Total num frames: 591715328. Throughput: 0: 5902.8. Samples: 591725888. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:00:57,481][25689] Avg episode reward: [(0, '-25.557')] [2022-07-10 05:00:57,631][26022] Updated weights on worker 0-0, policy_version 577848 (0.00097) [2022-07-10 05:00:59,155][26022] Updated weights on worker 0-0, policy_version 577858 (0.00445) [2022-07-10 05:01:01,120][26022] Updated weights on worker 0-0, policy_version 577868 (0.00279) [2022-07-10 05:01:02,604][25689] Fps is (10 sec: 5451.3, 60 sec: 5636.4, 300 sec: 5643.0). Total num frames: 591744000. Throughput: 0: 5028.5. Samples: 591742718. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:01:02,605][25689] Avg episode reward: [(0, '-23.908')] [2022-07-10 05:01:03,273][26022] Updated weights on worker 0-0, policy_version 577878 (0.00091) [2022-07-10 05:01:05,256][26022] Updated weights on worker 0-0, policy_version 577888 (0.00092) [2022-07-10 05:01:07,188][26022] Updated weights on worker 0-0, policy_version 577898 (0.00090) [2022-07-10 05:01:07,622][25689] Fps is (10 sec: 5452.4, 60 sec: 5606.5, 300 sec: 5635.9). Total num frames: 591770624. Throughput: 0: 5728.7. Samples: 591773784. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:01:07,628][25689] Avg episode reward: [(0, '-23.688')] [2022-07-10 05:01:08,719][26022] Updated weights on worker 0-0, policy_version 577908 (0.00085) [2022-07-10 05:01:10,535][26022] Updated weights on worker 0-0, policy_version 577918 (0.00085) [2022-07-10 05:01:12,647][26022] Updated weights on worker 0-0, policy_version 577928 (0.00091) [2022-07-10 05:01:12,654][25689] Fps is (10 sec: 5298.4, 60 sec: 5553.9, 300 sec: 5628.6). Total num frames: 591797248. Throughput: 0: 5744.6. Samples: 591808020. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:01:12,654][25689] Avg episode reward: [(0, '-23.589')] [2022-07-10 05:01:14,279][26022] Updated weights on worker 0-0, policy_version 577938 (0.00095) [2022-07-10 05:01:16,087][26022] Updated weights on worker 0-0, policy_version 577948 (0.00093) [2022-07-10 05:01:17,705][25689] Fps is (10 sec: 5686.8, 60 sec: 5618.7, 300 sec: 5635.8). Total num frames: 591827968. Throughput: 0: 4905.2. Samples: 591825096. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:01:17,706][25689] Avg episode reward: [(0, '-24.653')] [2022-07-10 05:01:18,073][26022] Updated weights on worker 0-0, policy_version 577958 (0.00111) [2022-07-10 05:01:19,685][26022] Updated weights on worker 0-0, policy_version 577968 (0.00087) [2022-07-10 05:01:21,687][26022] Updated weights on worker 0-0, policy_version 577978 (0.00091) [2022-07-10 05:01:22,783][25689] Fps is (10 sec: 5762.4, 60 sec: 5567.1, 300 sec: 5631.1). Total num frames: 591855616. Throughput: 0: 5758.1. Samples: 591858910. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:01:22,784][25689] Avg episode reward: [(0, '-24.231')] [2022-07-10 05:01:23,048][26022] Updated weights on worker 0-0, policy_version 577988 (0.00088) [2022-07-10 05:01:25,239][26022] Updated weights on worker 0-0, policy_version 577998 (0.00087) [2022-07-10 05:01:26,894][26022] Updated weights on worker 0-0, policy_version 578008 (0.00620) [2022-07-10 05:01:27,801][25689] Fps is (10 sec: 5679.9, 60 sec: 5616.3, 300 sec: 5634.3). Total num frames: 591885312. Throughput: 0: 5901.6. Samples: 591892876. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:01:27,802][25689] Avg episode reward: [(0, '-24.320')] [2022-07-10 05:01:28,899][26022] Updated weights on worker 0-0, policy_version 578018 (0.00086) [2022-07-10 05:01:30,292][26022] Updated weights on worker 0-0, policy_version 578028 (0.00091) [2022-07-10 05:01:32,385][26022] Updated weights on worker 0-0, policy_version 578038 (0.00090) [2022-07-10 05:01:32,839][25689] Fps is (10 sec: 5702.3, 60 sec: 5586.7, 300 sec: 5630.3). Total num frames: 591912960. Throughput: 0: 5050.8. Samples: 591909972. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:01:32,840][25689] Avg episode reward: [(0, '-24.728')] [2022-07-10 05:01:34,100][26022] Updated weights on worker 0-0, policy_version 578048 (0.00087) [2022-07-10 05:01:35,967][26022] Updated weights on worker 0-0, policy_version 578058 (0.00091) [2022-07-10 05:01:37,819][26022] Updated weights on worker 0-0, policy_version 578068 (0.00086) [2022-07-10 05:01:37,891][25689] Fps is (10 sec: 5582.1, 60 sec: 5602.8, 300 sec: 5630.4). Total num frames: 591941632. Throughput: 0: 5883.3. Samples: 591943852. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:01:37,891][25689] Avg episode reward: [(0, '-25.317')] [2022-07-10 05:01:39,467][26022] Updated weights on worker 0-0, policy_version 578078 (0.00084) [2022-07-10 05:01:41,548][26022] Updated weights on worker 0-0, policy_version 578088 (0.00090) [2022-07-10 05:01:42,955][25689] Fps is (10 sec: 5770.2, 60 sec: 5617.4, 300 sec: 5633.6). Total num frames: 591971328. Throughput: 0: 5903.7. Samples: 591977998. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:01:42,955][25689] Avg episode reward: [(0, '-24.558')] [2022-07-10 05:01:43,123][26022] Updated weights on worker 0-0, policy_version 578098 (0.00090) [2022-07-10 05:01:45,085][26022] Updated weights on worker 0-0, policy_version 578108 (0.00089) [2022-07-10 05:01:46,758][26022] Updated weights on worker 0-0, policy_version 578118 (0.00096) [2022-07-10 05:01:47,989][25689] Fps is (10 sec: 5678.8, 60 sec: 5599.5, 300 sec: 5629.8). Total num frames: 591998976. Throughput: 0: 5060.7. Samples: 591995038. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:01:47,989][25689] Avg episode reward: [(0, '-23.737')] [2022-07-10 05:01:48,669][26022] Updated weights on worker 0-0, policy_version 578128 (0.00088) [2022-07-10 05:01:50,425][26022] Updated weights on worker 0-0, policy_version 578138 (0.00084) [2022-07-10 05:01:52,082][26022] Updated weights on worker 0-0, policy_version 578148 (0.00088) [2022-07-10 05:01:52,994][25689] Fps is (10 sec: 5712.0, 60 sec: 5616.2, 300 sec: 5633.4). Total num frames: 592028672. Throughput: 0: 5917.1. Samples: 592029230. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:01:52,994][25689] Avg episode reward: [(0, '-23.763')] [2022-07-10 05:01:54,128][26022] Updated weights on worker 0-0, policy_version 578158 (0.00080) [2022-07-10 05:01:55,758][26022] Updated weights on worker 0-0, policy_version 578168 (0.00052) [2022-07-10 05:01:57,633][26022] Updated weights on worker 0-0, policy_version 578178 (0.00085) [2022-07-10 05:01:58,077][25689] Fps is (10 sec: 5785.5, 60 sec: 5644.1, 300 sec: 5634.3). Total num frames: 592057344. Throughput: 0: 5929.9. Samples: 592063556. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:01:58,078][25689] Avg episode reward: [(0, '-24.108')] [2022-07-10 05:01:59,324][26022] Updated weights on worker 0-0, policy_version 578188 (0.00089) [2022-07-10 05:02:01,087][26022] Updated weights on worker 0-0, policy_version 578198 (0.00078) [2022-07-10 05:02:02,045][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:02:02,058][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000578201_592077824.pth [2022-07-10 05:02:02,058][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000576216_590045184.pth [2022-07-10 05:02:03,175][25689] Fps is (10 sec: 5431.0, 60 sec: 5612.7, 300 sec: 5636.3). Total num frames: 592083968. Throughput: 0: 5079.6. Samples: 592080712. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:02:03,176][25689] Avg episode reward: [(0, '-24.423')] [2022-07-10 05:02:03,289][26022] Updated weights on worker 0-0, policy_version 578208 (0.00084) [2022-07-10 05:02:05,069][26022] Updated weights on worker 0-0, policy_version 578218 (0.00086) [2022-07-10 05:02:06,980][26022] Updated weights on worker 0-0, policy_version 578228 (0.00093) [2022-07-10 05:02:08,187][25689] Fps is (10 sec: 5368.3, 60 sec: 5630.2, 300 sec: 5629.6). Total num frames: 592111616. Throughput: 0: 5811.5. Samples: 592112420. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:02:08,187][25689] Avg episode reward: [(0, '-24.610')] [2022-07-10 05:02:08,686][26022] Updated weights on worker 0-0, policy_version 578238 (0.00091) [2022-07-10 05:02:10,592][26022] Updated weights on worker 0-0, policy_version 578248 (0.00091) [2022-07-10 05:02:12,308][26022] Updated weights on worker 0-0, policy_version 578258 (0.00084) [2022-07-10 05:02:13,227][25689] Fps is (10 sec: 5603.1, 60 sec: 5663.2, 300 sec: 5630.3). Total num frames: 592140288. Throughput: 0: 5811.1. Samples: 592146806. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:02:13,227][25689] Avg episode reward: [(0, '-23.949')] [2022-07-10 05:02:14,120][26022] Updated weights on worker 0-0, policy_version 578268 (0.00096) [2022-07-10 05:02:15,898][26022] Updated weights on worker 0-0, policy_version 578278 (0.00089) [2022-07-10 05:02:17,707][26022] Updated weights on worker 0-0, policy_version 578288 (0.00094) [2022-07-10 05:02:18,287][25689] Fps is (10 sec: 5677.5, 60 sec: 5628.6, 300 sec: 5630.5). Total num frames: 592168960. Throughput: 0: 4963.0. Samples: 592163858. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:02:18,288][25689] Avg episode reward: [(0, '-24.488')] [2022-07-10 05:02:19,719][26022] Updated weights on worker 0-0, policy_version 578298 (0.00091) [2022-07-10 05:02:21,423][26022] Updated weights on worker 0-0, policy_version 578308 (0.00084) [2022-07-10 05:02:23,240][26022] Updated weights on worker 0-0, policy_version 578318 (0.00086) [2022-07-10 05:02:23,357][25689] Fps is (10 sec: 5762.0, 60 sec: 5663.2, 300 sec: 5634.5). Total num frames: 592198656. Throughput: 0: 5797.0. Samples: 592197704. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:02:23,357][25689] Avg episode reward: [(0, '-24.895')] [2022-07-10 05:02:25,111][26022] Updated weights on worker 0-0, policy_version 578328 (0.00101) [2022-07-10 05:02:26,774][26022] Updated weights on worker 0-0, policy_version 578338 (0.00090) [2022-07-10 05:02:28,401][25689] Fps is (10 sec: 5568.4, 60 sec: 5610.0, 300 sec: 5624.6). Total num frames: 592225280. Throughput: 0: 5905.4. Samples: 592231794. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:02:28,402][25689] Avg episode reward: [(0, '-24.973')] [2022-07-10 05:02:28,741][26022] Updated weights on worker 0-0, policy_version 578348 (0.00084) [2022-07-10 05:02:30,306][26022] Updated weights on worker 0-0, policy_version 578358 (0.00084) [2022-07-10 05:02:32,251][26022] Updated weights on worker 0-0, policy_version 578368 (0.00087) [2022-07-10 05:02:33,430][25689] Fps is (10 sec: 5590.9, 60 sec: 5644.7, 300 sec: 5628.1). Total num frames: 592254976. Throughput: 0: 5048.4. Samples: 592248802. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:02:33,431][25689] Avg episode reward: [(0, '-25.883')] [2022-07-10 05:02:34,068][26022] Updated weights on worker 0-0, policy_version 578378 (0.00084) [2022-07-10 05:02:35,875][26022] Updated weights on worker 0-0, policy_version 578388 (0.00099) [2022-07-10 05:02:37,778][26022] Updated weights on worker 0-0, policy_version 578398 (0.00092) [2022-07-10 05:02:38,436][25689] Fps is (10 sec: 5612.5, 60 sec: 5615.1, 300 sec: 5622.4). Total num frames: 592281600. Throughput: 0: 5917.5. Samples: 592283090. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:02:38,436][25689] Avg episode reward: [(0, '-26.948')] [2022-07-10 05:02:39,478][26022] Updated weights on worker 0-0, policy_version 578408 (0.00087) [2022-07-10 05:02:41,277][26022] Updated weights on worker 0-0, policy_version 578418 (0.00091) [2022-07-10 05:02:43,187][26022] Updated weights on worker 0-0, policy_version 578428 (0.00092) [2022-07-10 05:02:43,544][25689] Fps is (10 sec: 5568.8, 60 sec: 5611.0, 300 sec: 5628.1). Total num frames: 592311296. Throughput: 0: 5897.9. Samples: 592316764. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:02:43,544][25689] Avg episode reward: [(0, '-27.198')] [2022-07-10 05:02:45,142][26022] Updated weights on worker 0-0, policy_version 578438 (0.00074) [2022-07-10 05:02:46,659][26022] Updated weights on worker 0-0, policy_version 578448 (0.00092) [2022-07-10 05:02:48,563][25689] Fps is (10 sec: 5763.7, 60 sec: 5629.3, 300 sec: 5621.9). Total num frames: 592339968. Throughput: 0: 5055.6. Samples: 592333722. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:02:48,564][25689] Avg episode reward: [(0, '-25.819')] [2022-07-10 05:02:48,593][26022] Updated weights on worker 0-0, policy_version 578458 (0.00084) [2022-07-10 05:02:50,519][26022] Updated weights on worker 0-0, policy_version 578468 (0.00096) [2022-07-10 05:02:52,319][26022] Updated weights on worker 0-0, policy_version 578478 (0.00093) [2022-07-10 05:02:53,577][25689] Fps is (10 sec: 5613.2, 60 sec: 5594.7, 300 sec: 5622.7). Total num frames: 592367616. Throughput: 0: 5911.6. Samples: 592367902. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:02:53,578][25689] Avg episode reward: [(0, '-25.476')] [2022-07-10 05:02:54,067][26022] Updated weights on worker 0-0, policy_version 578488 (0.00089) [2022-07-10 05:02:55,652][26022] Updated weights on worker 0-0, policy_version 578498 (0.00092) [2022-07-10 05:02:57,548][26022] Updated weights on worker 0-0, policy_version 578508 (0.00092) [2022-07-10 05:02:58,592][25689] Fps is (10 sec: 5819.7, 60 sec: 5634.8, 300 sec: 5627.7). Total num frames: 592398336. Throughput: 0: 5903.3. Samples: 592402078. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:02:58,594][25689] Avg episode reward: [(0, '-26.455')] [2022-07-10 05:02:59,331][26022] Updated weights on worker 0-0, policy_version 578518 (0.00098) [2022-07-10 05:03:01,316][26022] Updated weights on worker 0-0, policy_version 578528 (0.00082) [2022-07-10 05:03:03,486][26022] Updated weights on worker 0-0, policy_version 578538 (0.00096) [2022-07-10 05:03:03,655][25689] Fps is (10 sec: 5588.3, 60 sec: 5621.2, 300 sec: 5623.2). Total num frames: 592423936. Throughput: 0: 5088.3. Samples: 592419098. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:03:03,656][25689] Avg episode reward: [(0, '-24.437')] [2022-07-10 05:03:05,282][26022] Updated weights on worker 0-0, policy_version 578548 (0.00092) [2022-07-10 05:03:06,882][26022] Updated weights on worker 0-0, policy_version 578558 (0.00092) [2022-07-10 05:03:08,736][25689] Fps is (10 sec: 5249.3, 60 sec: 5614.8, 300 sec: 5622.2). Total num frames: 592451584. Throughput: 0: 5797.7. Samples: 592450680. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:03:08,736][25689] Avg episode reward: [(0, '-24.710')] [2022-07-10 05:03:08,890][26022] Updated weights on worker 0-0, policy_version 578568 (0.00098) [2022-07-10 05:03:10,566][26022] Updated weights on worker 0-0, policy_version 578578 (0.00086) [2022-07-10 05:03:12,470][26022] Updated weights on worker 0-0, policy_version 578588 (0.00091) [2022-07-10 05:03:13,778][25689] Fps is (10 sec: 5665.1, 60 sec: 5631.5, 300 sec: 5625.1). Total num frames: 592481280. Throughput: 0: 5797.5. Samples: 592485014. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:03:13,779][25689] Avg episode reward: [(0, '-25.038')] [2022-07-10 05:03:14,207][26022] Updated weights on worker 0-0, policy_version 578598 (0.00091) [2022-07-10 05:03:16,193][26022] Updated weights on worker 0-0, policy_version 578608 (0.00085) [2022-07-10 05:03:17,808][26022] Updated weights on worker 0-0, policy_version 578618 (0.00080) [2022-07-10 05:03:18,793][25689] Fps is (10 sec: 5702.0, 60 sec: 5618.8, 300 sec: 5616.3). Total num frames: 592508928. Throughput: 0: 5791.0. Samples: 592519060. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:03:18,793][25689] Avg episode reward: [(0, '-24.847')] [2022-07-10 05:03:19,914][26022] Updated weights on worker 0-0, policy_version 578628 (0.00092) [2022-07-10 05:03:21,403][26022] Updated weights on worker 0-0, policy_version 578638 (0.00091) [2022-07-10 05:03:23,505][26022] Updated weights on worker 0-0, policy_version 578648 (0.00093) [2022-07-10 05:03:23,907][25689] Fps is (10 sec: 5560.1, 60 sec: 5597.7, 300 sec: 5621.8). Total num frames: 592537600. Throughput: 0: 5765.7. Samples: 592535864. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:03:23,908][25689] Avg episode reward: [(0, '-23.267')] [2022-07-10 05:03:24,984][26022] Updated weights on worker 0-0, policy_version 578658 (0.00096) [2022-07-10 05:03:27,046][26022] Updated weights on worker 0-0, policy_version 578668 (0.00087) [2022-07-10 05:03:28,655][26022] Updated weights on worker 0-0, policy_version 578678 (0.00088) [2022-07-10 05:03:28,993][25689] Fps is (10 sec: 5722.5, 60 sec: 5644.6, 300 sec: 5624.1). Total num frames: 592567296. Throughput: 0: 5890.2. Samples: 592569996. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 05:03:28,993][25689] Avg episode reward: [(0, '-23.425')] [2022-07-10 05:03:30,658][26022] Updated weights on worker 0-0, policy_version 578688 (0.00085) [2022-07-10 05:03:32,536][26022] Updated weights on worker 0-0, policy_version 578698 (0.00084) [2022-07-10 05:03:34,060][25689] Fps is (10 sec: 5749.0, 60 sec: 5624.2, 300 sec: 5623.4). Total num frames: 592595968. Throughput: 0: 5859.9. Samples: 592603864. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:03:34,060][25689] Avg episode reward: [(0, '-25.123')] [2022-07-10 05:03:34,305][26022] Updated weights on worker 0-0, policy_version 578708 (0.00086) [2022-07-10 05:03:36,185][26022] Updated weights on worker 0-0, policy_version 578718 (0.00086) [2022-07-10 05:03:37,825][26022] Updated weights on worker 0-0, policy_version 578728 (0.00093) [2022-07-10 05:03:39,120][25689] Fps is (10 sec: 5561.4, 60 sec: 5636.1, 300 sec: 5617.6). Total num frames: 592623616. Throughput: 0: 5015.0. Samples: 592621000. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:03:39,120][25689] Avg episode reward: [(0, '-26.934')] [2022-07-10 05:03:39,767][26022] Updated weights on worker 0-0, policy_version 578738 (0.00080) [2022-07-10 05:03:41,608][26022] Updated weights on worker 0-0, policy_version 578748 (0.00090) [2022-07-10 05:03:43,262][26022] Updated weights on worker 0-0, policy_version 578758 (0.00092) [2022-07-10 05:03:44,225][25689] Fps is (10 sec: 5540.4, 60 sec: 5619.4, 300 sec: 5619.8). Total num frames: 592652288. Throughput: 0: 5857.0. Samples: 592654868. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:03:44,226][25689] Avg episode reward: [(0, '-27.649')] [2022-07-10 05:03:45,098][26022] Updated weights on worker 0-0, policy_version 578768 (0.00082) [2022-07-10 05:03:46,876][26022] Updated weights on worker 0-0, policy_version 578778 (0.00091) [2022-07-10 05:03:48,699][26022] Updated weights on worker 0-0, policy_version 578788 (0.00086) [2022-07-10 05:03:49,245][25689] Fps is (10 sec: 5764.3, 60 sec: 5636.1, 300 sec: 5626.5). Total num frames: 592681984. Throughput: 0: 5884.9. Samples: 592689182. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:03:49,246][25689] Avg episode reward: [(0, '-29.086')] [2022-07-10 05:03:50,573][26022] Updated weights on worker 0-0, policy_version 578798 (0.00084) [2022-07-10 05:03:52,451][26022] Updated weights on worker 0-0, policy_version 578808 (0.00092) [2022-07-10 05:03:54,064][26022] Updated weights on worker 0-0, policy_version 578818 (0.00091) [2022-07-10 05:03:54,294][25689] Fps is (10 sec: 5797.2, 60 sec: 5649.9, 300 sec: 5622.6). Total num frames: 592710656. Throughput: 0: 5038.4. Samples: 592705810. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:03:54,294][25689] Avg episode reward: [(0, '-29.482')] [2022-07-10 05:03:56,045][26022] Updated weights on worker 0-0, policy_version 578828 (0.00483) [2022-07-10 05:03:57,568][26022] Updated weights on worker 0-0, policy_version 578838 (0.00080) [2022-07-10 05:03:59,322][25689] Fps is (10 sec: 5487.6, 60 sec: 5581.2, 300 sec: 5616.1). Total num frames: 592737280. Throughput: 0: 5888.8. Samples: 592739966. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:03:59,322][25689] Avg episode reward: [(0, '-28.846')] [2022-07-10 05:03:59,669][26022] Updated weights on worker 0-0, policy_version 578848 (0.00105) [2022-07-10 05:04:01,369][26022] Updated weights on worker 0-0, policy_version 578858 (0.00089) [2022-07-10 05:04:02,130][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:04:02,145][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000578860_592752640.pth [2022-07-10 05:04:02,145][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000576880_590725120.pth [2022-07-10 05:04:03,587][26022] Updated weights on worker 0-0, policy_version 578868 (0.00087) [2022-07-10 05:04:04,408][25689] Fps is (10 sec: 5467.2, 60 sec: 5629.7, 300 sec: 5621.8). Total num frames: 592765952. Throughput: 0: 5804.7. Samples: 592772022. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:04:04,408][25689] Avg episode reward: [(0, '-28.467')] [2022-07-10 05:04:05,458][26022] Updated weights on worker 0-0, policy_version 578878 (0.00081) [2022-07-10 05:04:07,244][26022] Updated weights on worker 0-0, policy_version 578888 (0.00092) [2022-07-10 05:04:09,181][26022] Updated weights on worker 0-0, policy_version 578898 (0.01276) [2022-07-10 05:04:09,429][25689] Fps is (10 sec: 5471.1, 60 sec: 5618.3, 300 sec: 5616.1). Total num frames: 592792576. Throughput: 0: 4946.4. Samples: 592789016. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:04:09,429][25689] Avg episode reward: [(0, '-26.637')] [2022-07-10 05:04:10,812][26022] Updated weights on worker 0-0, policy_version 578908 (0.00089) [2022-07-10 05:04:12,596][26022] Updated weights on worker 0-0, policy_version 578918 (0.00089) [2022-07-10 05:04:14,514][25689] Fps is (10 sec: 5471.5, 60 sec: 5597.5, 300 sec: 5614.8). Total num frames: 592821248. Throughput: 0: 5794.3. Samples: 592822972. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:04:14,514][25689] Avg episode reward: [(0, '-25.394')] [2022-07-10 05:04:14,540][26022] Updated weights on worker 0-0, policy_version 578928 (0.00095) [2022-07-10 05:04:16,162][26022] Updated weights on worker 0-0, policy_version 578938 (0.00085) [2022-07-10 05:04:17,989][26022] Updated weights on worker 0-0, policy_version 578948 (0.00088) [2022-07-10 05:04:19,566][25689] Fps is (10 sec: 5757.5, 60 sec: 5627.7, 300 sec: 5618.0). Total num frames: 592850944. Throughput: 0: 5790.4. Samples: 592857190. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:04:19,567][25689] Avg episode reward: [(0, '-25.006')] [2022-07-10 05:04:20,014][26022] Updated weights on worker 0-0, policy_version 578958 (0.00088) [2022-07-10 05:04:21,756][26022] Updated weights on worker 0-0, policy_version 578968 (0.00093) [2022-07-10 05:04:23,361][26022] Updated weights on worker 0-0, policy_version 578978 (0.00093) [2022-07-10 05:04:24,653][25689] Fps is (10 sec: 5958.6, 60 sec: 5664.0, 300 sec: 5624.6). Total num frames: 592881664. Throughput: 0: 5042.9. Samples: 592874118. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:04:24,654][25689] Avg episode reward: [(0, '-23.720')] [2022-07-10 05:04:25,349][26022] Updated weights on worker 0-0, policy_version 578988 (0.00108) [2022-07-10 05:04:26,989][26022] Updated weights on worker 0-0, policy_version 578998 (0.00085) [2022-07-10 05:04:28,915][26022] Updated weights on worker 0-0, policy_version 579008 (0.00086) [2022-07-10 05:04:29,684][25689] Fps is (10 sec: 5667.6, 60 sec: 5618.4, 300 sec: 5621.4). Total num frames: 592908288. Throughput: 0: 5904.1. Samples: 592908606. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:04:29,686][25689] Avg episode reward: [(0, '-23.093')] [2022-07-10 05:04:30,428][26022] Updated weights on worker 0-0, policy_version 579018 (0.00110) [2022-07-10 05:04:32,707][26022] Updated weights on worker 0-0, policy_version 579028 (0.00086) [2022-07-10 05:04:34,050][26022] Updated weights on worker 0-0, policy_version 579038 (0.00098) [2022-07-10 05:04:34,715][25689] Fps is (10 sec: 5495.7, 60 sec: 5621.8, 300 sec: 5624.5). Total num frames: 592936960. Throughput: 0: 5935.7. Samples: 592942880. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:04:34,717][25689] Avg episode reward: [(0, '-22.763')] [2022-07-10 05:04:36,107][26022] Updated weights on worker 0-0, policy_version 579048 (0.00087) [2022-07-10 05:04:37,660][26022] Updated weights on worker 0-0, policy_version 579058 (0.00085) [2022-07-10 05:04:39,695][26022] Updated weights on worker 0-0, policy_version 579068 (0.00096) [2022-07-10 05:04:39,792][25689] Fps is (10 sec: 5774.7, 60 sec: 5654.0, 300 sec: 5631.0). Total num frames: 592966656. Throughput: 0: 5090.1. Samples: 592960138. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:04:39,792][25689] Avg episode reward: [(0, '-23.395')] [2022-07-10 05:04:41,447][26022] Updated weights on worker 0-0, policy_version 579078 (0.00088) [2022-07-10 05:04:43,278][26022] Updated weights on worker 0-0, policy_version 579088 (0.00085) [2022-07-10 05:04:44,899][25689] Fps is (10 sec: 5731.3, 60 sec: 5653.8, 300 sec: 5629.8). Total num frames: 592995328. Throughput: 0: 5929.4. Samples: 592994162. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:04:44,901][25689] Avg episode reward: [(0, '-23.774')] [2022-07-10 05:04:44,907][26022] Updated weights on worker 0-0, policy_version 579098 (0.00091) [2022-07-10 05:04:46,951][26022] Updated weights on worker 0-0, policy_version 579108 (0.00093) [2022-07-10 05:04:48,814][26022] Updated weights on worker 0-0, policy_version 579118 (0.00089) [2022-07-10 05:04:49,952][25689] Fps is (10 sec: 5643.9, 60 sec: 5633.9, 300 sec: 5628.8). Total num frames: 593024000. Throughput: 0: 5894.8. Samples: 593028080. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:04:49,953][25689] Avg episode reward: [(0, '-23.578')] [2022-07-10 05:04:50,369][26022] Updated weights on worker 0-0, policy_version 579128 (0.00085) [2022-07-10 05:04:52,480][26022] Updated weights on worker 0-0, policy_version 579138 (0.00089) [2022-07-10 05:04:53,766][26022] Updated weights on worker 0-0, policy_version 579148 (0.00052) [2022-07-10 05:04:54,960][25689] Fps is (10 sec: 5598.1, 60 sec: 5620.8, 300 sec: 5622.4). Total num frames: 593051648. Throughput: 0: 5047.8. Samples: 593045070. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:04:54,961][25689] Avg episode reward: [(0, '-23.958')] [2022-07-10 05:04:56,121][26022] Updated weights on worker 0-0, policy_version 579158 (0.00087) [2022-07-10 05:04:57,441][26022] Updated weights on worker 0-0, policy_version 579168 (0.00060) [2022-07-10 05:04:59,568][26022] Updated weights on worker 0-0, policy_version 579178 (0.00089) [2022-07-10 05:05:00,033][25689] Fps is (10 sec: 5688.6, 60 sec: 5667.3, 300 sec: 5633.4). Total num frames: 593081344. Throughput: 0: 5877.2. Samples: 593079098. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:05:00,033][25689] Avg episode reward: [(0, '-24.670')] [2022-07-10 05:05:01,365][26022] Updated weights on worker 0-0, policy_version 579188 (0.00085) [2022-07-10 05:05:03,447][26022] Updated weights on worker 0-0, policy_version 579198 (0.00085) [2022-07-10 05:05:05,092][25689] Fps is (10 sec: 5558.1, 60 sec: 5635.9, 300 sec: 5626.5). Total num frames: 593107968. Throughput: 0: 5786.3. Samples: 593111008. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:05:05,093][25689] Avg episode reward: [(0, '-25.200')] [2022-07-10 05:05:05,462][26022] Updated weights on worker 0-0, policy_version 579208 (0.00091) [2022-07-10 05:05:07,172][26022] Updated weights on worker 0-0, policy_version 579218 (0.00090) [2022-07-10 05:05:08,915][26022] Updated weights on worker 0-0, policy_version 579228 (0.00093) [2022-07-10 05:05:10,127][25689] Fps is (10 sec: 5477.9, 60 sec: 5668.4, 300 sec: 5622.7). Total num frames: 593136640. Throughput: 0: 5803.4. Samples: 593145162. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:05:10,127][25689] Avg episode reward: [(0, '-25.396')] [2022-07-10 05:05:10,947][26022] Updated weights on worker 0-0, policy_version 579238 (0.00089) [2022-07-10 05:05:12,313][26022] Updated weights on worker 0-0, policy_version 579248 (0.00088) [2022-07-10 05:05:14,435][26022] Updated weights on worker 0-0, policy_version 579258 (0.00083) [2022-07-10 05:05:15,172][25689] Fps is (10 sec: 5587.3, 60 sec: 5655.3, 300 sec: 5625.6). Total num frames: 593164288. Throughput: 0: 5797.7. Samples: 593162258. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:05:15,174][25689] Avg episode reward: [(0, '-24.719')] [2022-07-10 05:05:16,127][26022] Updated weights on worker 0-0, policy_version 579268 (0.00090) [2022-07-10 05:05:17,951][26022] Updated weights on worker 0-0, policy_version 579278 (0.00088) [2022-07-10 05:05:19,864][26022] Updated weights on worker 0-0, policy_version 579288 (0.00083) [2022-07-10 05:05:20,198][25689] Fps is (10 sec: 5490.4, 60 sec: 5624.0, 300 sec: 5616.1). Total num frames: 593191936. Throughput: 0: 5816.3. Samples: 593196388. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:05:20,200][25689] Avg episode reward: [(0, '-24.162')] [2022-07-10 05:05:21,483][26022] Updated weights on worker 0-0, policy_version 579298 (0.00097) [2022-07-10 05:05:23,313][26022] Updated weights on worker 0-0, policy_version 579308 (0.00092) [2022-07-10 05:05:24,963][26022] Updated weights on worker 0-0, policy_version 579318 (0.00090) [2022-07-10 05:05:25,248][25689] Fps is (10 sec: 5792.8, 60 sec: 5627.4, 300 sec: 5629.0). Total num frames: 593222656. Throughput: 0: 5934.8. Samples: 593230630. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:05:25,249][25689] Avg episode reward: [(0, '-23.933')] [2022-07-10 05:05:26,864][26022] Updated weights on worker 0-0, policy_version 579328 (0.00089) [2022-07-10 05:05:28,989][26022] Updated weights on worker 0-0, policy_version 579338 (0.00085) [2022-07-10 05:05:30,272][25689] Fps is (10 sec: 5895.7, 60 sec: 5661.9, 300 sec: 5626.6). Total num frames: 593251328. Throughput: 0: 5080.1. Samples: 593247502. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:05:30,274][25689] Avg episode reward: [(0, '-24.331')] [2022-07-10 05:05:30,584][26022] Updated weights on worker 0-0, policy_version 579348 (0.00095) [2022-07-10 05:05:32,365][26022] Updated weights on worker 0-0, policy_version 579358 (0.00084) [2022-07-10 05:05:34,178][26022] Updated weights on worker 0-0, policy_version 579368 (0.00086) [2022-07-10 05:05:35,283][25689] Fps is (10 sec: 5510.7, 60 sec: 5629.9, 300 sec: 5623.8). Total num frames: 593277952. Throughput: 0: 5944.6. Samples: 593281806. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:05:35,283][25689] Avg episode reward: [(0, '-23.526')] [2022-07-10 05:05:35,932][26022] Updated weights on worker 0-0, policy_version 579378 (0.00086) [2022-07-10 05:05:37,839][26022] Updated weights on worker 0-0, policy_version 579388 (0.00087) [2022-07-10 05:05:39,653][26022] Updated weights on worker 0-0, policy_version 579398 (0.00094) [2022-07-10 05:05:40,301][25689] Fps is (10 sec: 5513.9, 60 sec: 5618.5, 300 sec: 5624.2). Total num frames: 593306624. Throughput: 0: 5914.5. Samples: 593315282. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:05:40,302][25689] Avg episode reward: [(0, '-23.569')] [2022-07-10 05:05:41,398][26022] Updated weights on worker 0-0, policy_version 579408 (0.00087) [2022-07-10 05:05:43,348][26022] Updated weights on worker 0-0, policy_version 579418 (0.00103) [2022-07-10 05:05:44,950][26022] Updated weights on worker 0-0, policy_version 579428 (0.00090) [2022-07-10 05:05:45,369][25689] Fps is (10 sec: 5685.2, 60 sec: 5622.1, 300 sec: 5623.4). Total num frames: 593335296. Throughput: 0: 5053.6. Samples: 593332312. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:05:45,370][25689] Avg episode reward: [(0, '-23.804')] [2022-07-10 05:05:46,921][26022] Updated weights on worker 0-0, policy_version 579438 (0.00085) [2022-07-10 05:05:48,612][26022] Updated weights on worker 0-0, policy_version 579448 (0.00085) [2022-07-10 05:05:50,373][25689] Fps is (10 sec: 5693.1, 60 sec: 5626.6, 300 sec: 5623.3). Total num frames: 593363968. Throughput: 0: 5922.0. Samples: 593366540. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:05:50,374][25689] Avg episode reward: [(0, '-25.054')] [2022-07-10 05:05:50,575][26022] Updated weights on worker 0-0, policy_version 579458 (0.00100) [2022-07-10 05:05:52,170][26022] Updated weights on worker 0-0, policy_version 579468 (0.00108) [2022-07-10 05:05:54,136][26022] Updated weights on worker 0-0, policy_version 579478 (0.00104) [2022-07-10 05:05:55,382][25689] Fps is (10 sec: 5726.9, 60 sec: 5643.4, 300 sec: 5630.4). Total num frames: 593392640. Throughput: 0: 5915.8. Samples: 593400712. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:05:55,383][25689] Avg episode reward: [(0, '-23.974')] [2022-07-10 05:05:55,804][26022] Updated weights on worker 0-0, policy_version 579488 (0.00085) [2022-07-10 05:05:57,724][26022] Updated weights on worker 0-0, policy_version 579498 (0.00087) [2022-07-10 05:05:59,415][26022] Updated weights on worker 0-0, policy_version 579508 (0.00094) [2022-07-10 05:06:00,391][25689] Fps is (10 sec: 5724.3, 60 sec: 5632.5, 300 sec: 5632.6). Total num frames: 593421312. Throughput: 0: 5101.5. Samples: 593417772. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:06:00,391][25689] Avg episode reward: [(0, '-23.653')] [2022-07-10 05:06:01,318][26022] Updated weights on worker 0-0, policy_version 579518 (0.00093) [2022-07-10 05:06:02,181][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:06:02,196][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000579521_593429504.pth [2022-07-10 05:06:02,197][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000577541_591401984.pth [2022-07-10 05:06:03,435][26022] Updated weights on worker 0-0, policy_version 579528 (0.00456) [2022-07-10 05:06:05,330][26022] Updated weights on worker 0-0, policy_version 579538 (0.00085) [2022-07-10 05:06:05,463][25689] Fps is (10 sec: 5485.2, 60 sec: 5631.3, 300 sec: 5631.6). Total num frames: 593447936. Throughput: 0: 5844.0. Samples: 593449742. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:06:05,465][25689] Avg episode reward: [(0, '-24.295')] [2022-07-10 05:06:07,192][26022] Updated weights on worker 0-0, policy_version 579548 (0.00088) [2022-07-10 05:06:08,880][26022] Updated weights on worker 0-0, policy_version 579558 (0.00088) [2022-07-10 05:06:10,519][25689] Fps is (10 sec: 5358.5, 60 sec: 5612.4, 300 sec: 5634.5). Total num frames: 593475584. Throughput: 0: 5814.5. Samples: 593483678. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:06:10,520][25689] Avg episode reward: [(0, '-24.317')] [2022-07-10 05:06:10,999][26022] Updated weights on worker 0-0, policy_version 579568 (0.00085) [2022-07-10 05:06:12,484][26022] Updated weights on worker 0-0, policy_version 579578 (0.00089) [2022-07-10 05:06:14,698][26022] Updated weights on worker 0-0, policy_version 579588 (0.00090) [2022-07-10 05:06:15,535][25689] Fps is (10 sec: 5592.2, 60 sec: 5632.1, 300 sec: 5628.4). Total num frames: 593504256. Throughput: 0: 4943.5. Samples: 593500334. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:06:15,535][25689] Avg episode reward: [(0, '-25.044')] [2022-07-10 05:06:16,018][26022] Updated weights on worker 0-0, policy_version 579598 (0.00088) [2022-07-10 05:06:17,999][26022] Updated weights on worker 0-0, policy_version 579608 (0.00093) [2022-07-10 05:06:19,838][26022] Updated weights on worker 0-0, policy_version 579618 (0.00095) [2022-07-10 05:06:20,559][25689] Fps is (10 sec: 5711.8, 60 sec: 5649.3, 300 sec: 5632.8). Total num frames: 593532928. Throughput: 0: 5794.7. Samples: 593534638. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:06:20,560][25689] Avg episode reward: [(0, '-23.817')] [2022-07-10 05:06:21,599][26022] Updated weights on worker 0-0, policy_version 579628 (0.00094) [2022-07-10 05:06:23,435][26022] Updated weights on worker 0-0, policy_version 579638 (0.00092) [2022-07-10 05:06:25,228][26022] Updated weights on worker 0-0, policy_version 579648 (0.00085) [2022-07-10 05:06:25,606][25689] Fps is (10 sec: 5592.0, 60 sec: 5598.6, 300 sec: 5625.4). Total num frames: 593560576. Throughput: 0: 5908.4. Samples: 593568754. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:06:25,607][25689] Avg episode reward: [(0, '-23.950')] [2022-07-10 05:06:27,000][26022] Updated weights on worker 0-0, policy_version 579658 (0.00092) [2022-07-10 05:06:28,885][26022] Updated weights on worker 0-0, policy_version 579668 (0.00085) [2022-07-10 05:06:30,508][26022] Updated weights on worker 0-0, policy_version 579678 (0.00052) [2022-07-10 05:06:30,611][25689] Fps is (10 sec: 5704.5, 60 sec: 5617.3, 300 sec: 5632.9). Total num frames: 593590272. Throughput: 0: 5075.7. Samples: 593585658. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 05:06:30,613][25689] Avg episode reward: [(0, '-24.125')] [2022-07-10 05:06:32,495][26022] Updated weights on worker 0-0, policy_version 579688 (0.00090) [2022-07-10 05:06:34,265][26022] Updated weights on worker 0-0, policy_version 579698 (0.00081) [2022-07-10 05:06:35,630][25689] Fps is (10 sec: 5720.8, 60 sec: 5633.5, 300 sec: 5630.1). Total num frames: 593617920. Throughput: 0: 5945.4. Samples: 593619810. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:06:35,631][25689] Avg episode reward: [(0, '-23.659')] [2022-07-10 05:06:36,091][26022] Updated weights on worker 0-0, policy_version 579708 (0.00100) [2022-07-10 05:06:38,027][26022] Updated weights on worker 0-0, policy_version 579718 (0.00094) [2022-07-10 05:06:39,671][26022] Updated weights on worker 0-0, policy_version 579728 (0.00100) [2022-07-10 05:06:40,638][25689] Fps is (10 sec: 5616.7, 60 sec: 5634.4, 300 sec: 5627.7). Total num frames: 593646592. Throughput: 0: 5932.0. Samples: 593653750. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:06:40,639][25689] Avg episode reward: [(0, '-23.582')] [2022-07-10 05:06:41,553][26022] Updated weights on worker 0-0, policy_version 579738 (0.00085) [2022-07-10 05:06:43,339][26022] Updated weights on worker 0-0, policy_version 579748 (0.00094) [2022-07-10 05:06:45,079][26022] Updated weights on worker 0-0, policy_version 579758 (0.00093) [2022-07-10 05:06:45,704][25689] Fps is (10 sec: 5692.1, 60 sec: 5634.7, 300 sec: 5630.5). Total num frames: 593675264. Throughput: 0: 5061.4. Samples: 593670478. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:06:45,705][25689] Avg episode reward: [(0, '-24.003')] [2022-07-10 05:06:46,938][26022] Updated weights on worker 0-0, policy_version 579768 (0.00093) [2022-07-10 05:06:48,698][26022] Updated weights on worker 0-0, policy_version 579778 (0.00085) [2022-07-10 05:06:50,629][26022] Updated weights on worker 0-0, policy_version 579788 (0.00094) [2022-07-10 05:06:50,716][25689] Fps is (10 sec: 5588.5, 60 sec: 5617.0, 300 sec: 5623.5). Total num frames: 593702912. Throughput: 0: 5916.2. Samples: 593704604. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:06:50,717][25689] Avg episode reward: [(0, '-23.395')] [2022-07-10 05:06:52,308][26022] Updated weights on worker 0-0, policy_version 579798 (0.00086) [2022-07-10 05:06:54,193][26022] Updated weights on worker 0-0, policy_version 579808 (0.00086) [2022-07-10 05:06:55,731][25689] Fps is (10 sec: 5719.3, 60 sec: 5633.4, 300 sec: 5628.2). Total num frames: 593732608. Throughput: 0: 5917.0. Samples: 593738746. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:06:55,732][25689] Avg episode reward: [(0, '-23.396')] [2022-07-10 05:06:56,022][26022] Updated weights on worker 0-0, policy_version 579818 (0.00093) [2022-07-10 05:06:57,797][26022] Updated weights on worker 0-0, policy_version 579828 (0.00097) [2022-07-10 05:06:59,595][26022] Updated weights on worker 0-0, policy_version 579838 (0.00089) [2022-07-10 05:07:00,749][25689] Fps is (10 sec: 5817.6, 60 sec: 5632.5, 300 sec: 5636.6). Total num frames: 593761280. Throughput: 0: 5082.8. Samples: 593755968. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:07:00,749][25689] Avg episode reward: [(0, '-24.186')] [2022-07-10 05:07:01,643][26022] Updated weights on worker 0-0, policy_version 579848 (0.00098) [2022-07-10 05:07:03,455][26022] Updated weights on worker 0-0, policy_version 579858 (0.00079) [2022-07-10 05:07:05,459][26022] Updated weights on worker 0-0, policy_version 579868 (0.00086) [2022-07-10 05:07:05,851][25689] Fps is (10 sec: 5362.4, 60 sec: 5612.8, 300 sec: 5628.0). Total num frames: 593786880. Throughput: 0: 5844.8. Samples: 593788234. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:07:05,852][25689] Avg episode reward: [(0, '-23.554')] [2022-07-10 05:07:07,005][26022] Updated weights on worker 0-0, policy_version 579878 (0.00336) [2022-07-10 05:07:09,082][26022] Updated weights on worker 0-0, policy_version 579888 (0.00092) [2022-07-10 05:07:10,509][26022] Updated weights on worker 0-0, policy_version 579898 (0.00088) [2022-07-10 05:07:10,873][25689] Fps is (10 sec: 5461.9, 60 sec: 5649.9, 300 sec: 5631.8). Total num frames: 593816576. Throughput: 0: 5848.2. Samples: 593822486. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:07:10,874][25689] Avg episode reward: [(0, '-23.055')] [2022-07-10 05:07:12,559][26022] Updated weights on worker 0-0, policy_version 579908 (0.00081) [2022-07-10 05:07:14,146][26022] Updated weights on worker 0-0, policy_version 579918 (0.00089) [2022-07-10 05:07:15,907][25689] Fps is (10 sec: 5702.8, 60 sec: 5631.2, 300 sec: 5628.9). Total num frames: 593844224. Throughput: 0: 5000.4. Samples: 593839634. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:07:15,907][25689] Avg episode reward: [(0, '-23.883')] [2022-07-10 05:07:16,137][26022] Updated weights on worker 0-0, policy_version 579928 (0.00081) [2022-07-10 05:07:17,748][26022] Updated weights on worker 0-0, policy_version 579938 (0.00086) [2022-07-10 05:07:19,698][26022] Updated weights on worker 0-0, policy_version 579948 (0.00081) [2022-07-10 05:07:20,926][25689] Fps is (10 sec: 5704.2, 60 sec: 5648.6, 300 sec: 5629.8). Total num frames: 593873920. Throughput: 0: 5850.4. Samples: 593874010. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:07:20,926][25689] Avg episode reward: [(0, '-24.171')] [2022-07-10 05:07:21,304][26022] Updated weights on worker 0-0, policy_version 579958 (0.00080) [2022-07-10 05:07:23,201][26022] Updated weights on worker 0-0, policy_version 579968 (0.00089) [2022-07-10 05:07:25,203][26022] Updated weights on worker 0-0, policy_version 579978 (0.00082) [2022-07-10 05:07:26,000][25689] Fps is (10 sec: 5782.7, 60 sec: 5663.1, 300 sec: 5636.1). Total num frames: 593902592. Throughput: 0: 5960.5. Samples: 593908332. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:07:26,001][25689] Avg episode reward: [(0, '-24.440')] [2022-07-10 05:07:26,718][26022] Updated weights on worker 0-0, policy_version 579988 (0.00092) [2022-07-10 05:07:28,727][26022] Updated weights on worker 0-0, policy_version 579998 (0.00093) [2022-07-10 05:07:30,218][26022] Updated weights on worker 0-0, policy_version 580008 (0.00085) [2022-07-10 05:07:31,023][25689] Fps is (10 sec: 5577.8, 60 sec: 5627.5, 300 sec: 5629.4). Total num frames: 593930240. Throughput: 0: 5107.2. Samples: 593925396. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:07:31,024][25689] Avg episode reward: [(0, '-24.241')] [2022-07-10 05:07:32,198][26022] Updated weights on worker 0-0, policy_version 580018 (0.00086) [2022-07-10 05:07:34,181][26022] Updated weights on worker 0-0, policy_version 580028 (0.00089) [2022-07-10 05:07:35,825][26022] Updated weights on worker 0-0, policy_version 580038 (0.00086) [2022-07-10 05:07:36,055][25689] Fps is (10 sec: 5601.4, 60 sec: 5643.2, 300 sec: 5635.8). Total num frames: 593958912. Throughput: 0: 5945.6. Samples: 593959426. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:07:36,055][25689] Avg episode reward: [(0, '-24.526')] [2022-07-10 05:07:37,597][26022] Updated weights on worker 0-0, policy_version 580048 (0.00092) [2022-07-10 05:07:39,501][26022] Updated weights on worker 0-0, policy_version 580058 (0.00051) [2022-07-10 05:07:41,071][25689] Fps is (10 sec: 5809.1, 60 sec: 5659.5, 300 sec: 5637.5). Total num frames: 593988608. Throughput: 0: 5939.8. Samples: 593993666. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:07:41,071][25689] Avg episode reward: [(0, '-23.900')] [2022-07-10 05:07:41,123][26022] Updated weights on worker 0-0, policy_version 580068 (0.00095) [2022-07-10 05:07:43,151][26022] Updated weights on worker 0-0, policy_version 580078 (0.00088) [2022-07-10 05:07:44,846][26022] Updated weights on worker 0-0, policy_version 580088 (0.00085) [2022-07-10 05:07:46,207][25689] Fps is (10 sec: 5648.5, 60 sec: 5636.0, 300 sec: 5631.9). Total num frames: 594016256. Throughput: 0: 5059.0. Samples: 594010558. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:07:46,208][25689] Avg episode reward: [(0, '-24.365')] [2022-07-10 05:07:46,804][26022] Updated weights on worker 0-0, policy_version 580098 (0.00092) [2022-07-10 05:07:48,431][26022] Updated weights on worker 0-0, policy_version 580108 (0.00088) [2022-07-10 05:07:50,261][26022] Updated weights on worker 0-0, policy_version 580118 (0.00092) [2022-07-10 05:07:51,230][25689] Fps is (10 sec: 5644.4, 60 sec: 5668.8, 300 sec: 5638.6). Total num frames: 594045952. Throughput: 0: 5924.9. Samples: 594045122. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:07:51,231][25689] Avg episode reward: [(0, '-23.734')] [2022-07-10 05:07:52,012][26022] Updated weights on worker 0-0, policy_version 580128 (0.00089) [2022-07-10 05:07:53,950][26022] Updated weights on worker 0-0, policy_version 580138 (0.00101) [2022-07-10 05:07:55,659][26022] Updated weights on worker 0-0, policy_version 580148 (0.00084) [2022-07-10 05:07:56,295][25689] Fps is (10 sec: 5684.3, 60 sec: 5630.2, 300 sec: 5627.3). Total num frames: 594073600. Throughput: 0: 5918.5. Samples: 594079218. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:07:56,296][25689] Avg episode reward: [(0, '-23.821')] [2022-07-10 05:07:57,495][26022] Updated weights on worker 0-0, policy_version 580158 (0.00084) [2022-07-10 05:07:59,308][26022] Updated weights on worker 0-0, policy_version 580168 (0.00090) [2022-07-10 05:08:00,912][26022] Updated weights on worker 0-0, policy_version 580178 (0.00090) [2022-07-10 05:08:01,359][25689] Fps is (10 sec: 5661.2, 60 sec: 5642.9, 300 sec: 5641.0). Total num frames: 594103296. Throughput: 0: 5056.6. Samples: 594096258. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:08:01,360][25689] Avg episode reward: [(0, '-23.257')] [2022-07-10 05:08:02,291][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:08:02,310][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000580183_594107392.pth [2022-07-10 05:08:02,310][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000578201_592077824.pth [2022-07-10 05:08:03,291][26022] Updated weights on worker 0-0, policy_version 580188 (0.00100) [2022-07-10 05:08:05,165][26022] Updated weights on worker 0-0, policy_version 580198 (0.00109) [2022-07-10 05:08:06,462][25689] Fps is (10 sec: 5539.4, 60 sec: 5659.7, 300 sec: 5637.2). Total num frames: 594129920. Throughput: 0: 5799.7. Samples: 594128034. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:08:06,463][25689] Avg episode reward: [(0, '-23.246')] [2022-07-10 05:08:06,786][26022] Updated weights on worker 0-0, policy_version 580208 (0.00085) [2022-07-10 05:08:08,857][26022] Updated weights on worker 0-0, policy_version 580218 (0.00094) [2022-07-10 05:08:10,371][26022] Updated weights on worker 0-0, policy_version 580228 (0.00091) [2022-07-10 05:08:11,495][25689] Fps is (10 sec: 5354.6, 60 sec: 5624.9, 300 sec: 5630.5). Total num frames: 594157568. Throughput: 0: 5766.3. Samples: 594161976. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:08:11,496][25689] Avg episode reward: [(0, '-22.695')] [2022-07-10 05:08:12,292][26022] Updated weights on worker 0-0, policy_version 580238 (0.00089) [2022-07-10 05:08:14,276][26022] Updated weights on worker 0-0, policy_version 580248 (0.00091) [2022-07-10 05:08:15,967][26022] Updated weights on worker 0-0, policy_version 580258 (0.00093) [2022-07-10 05:08:16,506][25689] Fps is (10 sec: 5811.3, 60 sec: 5677.7, 300 sec: 5640.9). Total num frames: 594188288. Throughput: 0: 5782.0. Samples: 594196078. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:08:16,507][25689] Avg episode reward: [(0, '-23.003')] [2022-07-10 05:08:17,726][26022] Updated weights on worker 0-0, policy_version 580268 (0.00090) [2022-07-10 05:08:19,585][26022] Updated weights on worker 0-0, policy_version 580278 (0.00082) [2022-07-10 05:08:21,416][26022] Updated weights on worker 0-0, policy_version 580288 (0.00094) [2022-07-10 05:08:21,570][25689] Fps is (10 sec: 5793.1, 60 sec: 5639.7, 300 sec: 5638.4). Total num frames: 594215936. Throughput: 0: 5774.7. Samples: 594212970. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:08:21,571][25689] Avg episode reward: [(0, '-22.439')] [2022-07-10 05:08:23,220][26022] Updated weights on worker 0-0, policy_version 580298 (0.00087) [2022-07-10 05:08:24,805][26022] Updated weights on worker 0-0, policy_version 580308 (0.00088) [2022-07-10 05:08:26,671][25689] Fps is (10 sec: 5440.1, 60 sec: 5620.4, 300 sec: 5631.2). Total num frames: 594243584. Throughput: 0: 5880.9. Samples: 594246878. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:08:26,672][25689] Avg episode reward: [(0, '-23.036')] [2022-07-10 05:08:26,794][26022] Updated weights on worker 0-0, policy_version 580318 (0.00092) [2022-07-10 05:08:28,794][26022] Updated weights on worker 0-0, policy_version 580328 (0.00089) [2022-07-10 05:08:30,461][26022] Updated weights on worker 0-0, policy_version 580338 (0.00087) [2022-07-10 05:08:31,689][25689] Fps is (10 sec: 5667.1, 60 sec: 5654.6, 300 sec: 5635.6). Total num frames: 594273280. Throughput: 0: 5872.0. Samples: 594280556. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:08:31,690][25689] Avg episode reward: [(0, '-24.642')] [2022-07-10 05:08:32,462][26022] Updated weights on worker 0-0, policy_version 580348 (0.00092) [2022-07-10 05:08:34,039][26022] Updated weights on worker 0-0, policy_version 580358 (0.00095) [2022-07-10 05:08:36,100][26022] Updated weights on worker 0-0, policy_version 580368 (0.00088) [2022-07-10 05:08:36,707][25689] Fps is (10 sec: 5611.8, 60 sec: 5622.1, 300 sec: 5632.9). Total num frames: 594299904. Throughput: 0: 5027.4. Samples: 594297634. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:08:36,707][25689] Avg episode reward: [(0, '-26.002')] [2022-07-10 05:08:37,608][26022] Updated weights on worker 0-0, policy_version 580378 (0.00083) [2022-07-10 05:08:39,697][26022] Updated weights on worker 0-0, policy_version 580388 (0.00094) [2022-07-10 05:08:41,515][26022] Updated weights on worker 0-0, policy_version 580398 (0.00083) [2022-07-10 05:08:41,731][25689] Fps is (10 sec: 5506.7, 60 sec: 5604.5, 300 sec: 5634.5). Total num frames: 594328576. Throughput: 0: 5877.7. Samples: 594331466. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:08:41,731][25689] Avg episode reward: [(0, '-26.348')] [2022-07-10 05:08:43,456][26022] Updated weights on worker 0-0, policy_version 580408 (0.00100) [2022-07-10 05:08:45,028][26022] Updated weights on worker 0-0, policy_version 580418 (0.00094) [2022-07-10 05:08:46,836][25689] Fps is (10 sec: 5661.0, 60 sec: 5624.2, 300 sec: 5629.4). Total num frames: 594357248. Throughput: 0: 5871.5. Samples: 594365280. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:08:46,837][25689] Avg episode reward: [(0, '-25.656')] [2022-07-10 05:08:46,958][26022] Updated weights on worker 0-0, policy_version 580428 (0.00089) [2022-07-10 05:08:48,611][26022] Updated weights on worker 0-0, policy_version 580438 (0.00573) [2022-07-10 05:08:50,425][26022] Updated weights on worker 0-0, policy_version 580448 (0.00087) [2022-07-10 05:08:51,865][25689] Fps is (10 sec: 5658.2, 60 sec: 5606.8, 300 sec: 5629.8). Total num frames: 594385920. Throughput: 0: 5036.0. Samples: 594382164. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:08:51,866][25689] Avg episode reward: [(0, '-25.534')] [2022-07-10 05:08:52,353][26022] Updated weights on worker 0-0, policy_version 580458 (0.00089) [2022-07-10 05:08:54,206][26022] Updated weights on worker 0-0, policy_version 580468 (0.00094) [2022-07-10 05:08:55,830][26022] Updated weights on worker 0-0, policy_version 580478 (0.00095) [2022-07-10 05:08:56,926][25689] Fps is (10 sec: 5480.3, 60 sec: 5590.3, 300 sec: 5629.2). Total num frames: 594412544. Throughput: 0: 5848.1. Samples: 594415880. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:08:56,927][25689] Avg episode reward: [(0, '-25.066')] [2022-07-10 05:08:57,782][26022] Updated weights on worker 0-0, policy_version 580488 (0.00092) [2022-07-10 05:08:59,784][26022] Updated weights on worker 0-0, policy_version 580498 (0.00089) [2022-07-10 05:09:01,236][26022] Updated weights on worker 0-0, policy_version 580508 (0.00090) [2022-07-10 05:09:01,935][25689] Fps is (10 sec: 5694.7, 60 sec: 5612.3, 300 sec: 5637.5). Total num frames: 594443264. Throughput: 0: 5886.8. Samples: 594450406. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:09:01,936][25689] Avg episode reward: [(0, '-23.731')] [2022-07-10 05:09:03,703][26022] Updated weights on worker 0-0, policy_version 580518 (0.00089) [2022-07-10 05:09:05,267][26022] Updated weights on worker 0-0, policy_version 580528 (0.00092) [2022-07-10 05:09:07,064][25689] Fps is (10 sec: 5656.4, 60 sec: 5609.9, 300 sec: 5635.5). Total num frames: 594469888. Throughput: 0: 4950.6. Samples: 594465420. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:09:07,064][25689] Avg episode reward: [(0, '-23.252')] [2022-07-10 05:09:07,145][26022] Updated weights on worker 0-0, policy_version 580538 (0.00112) [2022-07-10 05:09:09,004][26022] Updated weights on worker 0-0, policy_version 580548 (0.00093) [2022-07-10 05:09:10,583][26022] Updated weights on worker 0-0, policy_version 580558 (0.00086) [2022-07-10 05:09:12,080][25689] Fps is (10 sec: 5349.7, 60 sec: 5611.4, 300 sec: 5633.3). Total num frames: 594497536. Throughput: 0: 5802.8. Samples: 594499464. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:09:12,080][25689] Avg episode reward: [(0, '-23.051')] [2022-07-10 05:09:12,627][26022] Updated weights on worker 0-0, policy_version 580568 (0.00094) [2022-07-10 05:09:14,026][26022] Updated weights on worker 0-0, policy_version 580578 (0.00092) [2022-07-10 05:09:16,244][26022] Updated weights on worker 0-0, policy_version 580588 (0.00082) [2022-07-10 05:09:17,123][25689] Fps is (10 sec: 5802.3, 60 sec: 5608.4, 300 sec: 5636.9). Total num frames: 594528256. Throughput: 0: 5819.8. Samples: 594533424. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:09:17,125][25689] Avg episode reward: [(0, '-23.904')] [2022-07-10 05:09:17,812][26022] Updated weights on worker 0-0, policy_version 580598 (0.00086) [2022-07-10 05:09:19,790][26022] Updated weights on worker 0-0, policy_version 580608 (0.00098) [2022-07-10 05:09:21,618][26022] Updated weights on worker 0-0, policy_version 580618 (0.00087) [2022-07-10 05:09:22,203][25689] Fps is (10 sec: 5664.8, 60 sec: 5590.1, 300 sec: 5623.3). Total num frames: 594554880. Throughput: 0: 4933.2. Samples: 594550390. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:09:22,204][25689] Avg episode reward: [(0, '-24.361')] [2022-07-10 05:09:23,515][26022] Updated weights on worker 0-0, policy_version 580628 (0.00091) [2022-07-10 05:09:25,186][26022] Updated weights on worker 0-0, policy_version 580638 (0.00085) [2022-07-10 05:09:27,132][26022] Updated weights on worker 0-0, policy_version 580648 (0.00078) [2022-07-10 05:09:27,294][25689] Fps is (10 sec: 5537.5, 60 sec: 5624.8, 300 sec: 5632.5). Total num frames: 594584576. Throughput: 0: 5884.9. Samples: 594584472. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:09:27,296][25689] Avg episode reward: [(0, '-25.522')] [2022-07-10 05:09:28,811][26022] Updated weights on worker 0-0, policy_version 580658 (0.00091) [2022-07-10 05:09:30,728][26022] Updated weights on worker 0-0, policy_version 580668 (0.00086) [2022-07-10 05:09:32,396][25689] Fps is (10 sec: 5726.4, 60 sec: 5600.2, 300 sec: 5631.2). Total num frames: 594613248. Throughput: 0: 5865.7. Samples: 594618630. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:09:32,396][25689] Avg episode reward: [(0, '-26.426')] [2022-07-10 05:09:32,459][26022] Updated weights on worker 0-0, policy_version 580678 (0.00089) [2022-07-10 05:09:34,415][26022] Updated weights on worker 0-0, policy_version 580688 (0.00079) [2022-07-10 05:09:35,950][26022] Updated weights on worker 0-0, policy_version 580698 (0.00088) [2022-07-10 05:09:37,419][25689] Fps is (10 sec: 5663.5, 60 sec: 5633.4, 300 sec: 5628.7). Total num frames: 594641920. Throughput: 0: 5049.0. Samples: 594635898. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 05:09:37,421][25689] Avg episode reward: [(0, '-26.409')] [2022-07-10 05:09:38,035][26022] Updated weights on worker 0-0, policy_version 580708 (0.00089) [2022-07-10 05:09:39,433][26022] Updated weights on worker 0-0, policy_version 580718 (0.00089) [2022-07-10 05:09:41,498][26022] Updated weights on worker 0-0, policy_version 580728 (0.00090) [2022-07-10 05:09:42,435][25689] Fps is (10 sec: 5814.1, 60 sec: 5651.0, 300 sec: 5633.9). Total num frames: 594671616. Throughput: 0: 5927.2. Samples: 594670308. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:09:42,436][25689] Avg episode reward: [(0, '-26.783')] [2022-07-10 05:09:42,953][26022] Updated weights on worker 0-0, policy_version 580738 (0.00080) [2022-07-10 05:09:44,954][26022] Updated weights on worker 0-0, policy_version 580748 (0.00085) [2022-07-10 05:09:46,754][26022] Updated weights on worker 0-0, policy_version 580758 (0.00084) [2022-07-10 05:09:47,554][25689] Fps is (10 sec: 5759.2, 60 sec: 5649.7, 300 sec: 5632.7). Total num frames: 594700288. Throughput: 0: 5912.2. Samples: 594704252. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:09:47,555][25689] Avg episode reward: [(0, '-26.241')] [2022-07-10 05:09:48,723][26022] Updated weights on worker 0-0, policy_version 580768 (0.00082) [2022-07-10 05:09:50,462][26022] Updated weights on worker 0-0, policy_version 580778 (0.00085) [2022-07-10 05:09:52,221][26022] Updated weights on worker 0-0, policy_version 580788 (0.00082) [2022-07-10 05:09:52,566][25689] Fps is (10 sec: 5660.3, 60 sec: 5651.4, 300 sec: 5636.0). Total num frames: 594728960. Throughput: 0: 5083.5. Samples: 594721164. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:09:52,566][25689] Avg episode reward: [(0, '-25.919')] [2022-07-10 05:09:54,112][26022] Updated weights on worker 0-0, policy_version 580798 (0.00086) [2022-07-10 05:09:55,917][26022] Updated weights on worker 0-0, policy_version 580808 (0.00093) [2022-07-10 05:09:57,574][25689] Fps is (10 sec: 5518.7, 60 sec: 5656.3, 300 sec: 5626.9). Total num frames: 594755584. Throughput: 0: 5916.5. Samples: 594755142. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:09:57,574][25689] Avg episode reward: [(0, '-25.575')] [2022-07-10 05:09:57,851][26022] Updated weights on worker 0-0, policy_version 580818 (0.00091) [2022-07-10 05:09:59,487][26022] Updated weights on worker 0-0, policy_version 580828 (0.00487) [2022-07-10 05:10:01,402][26022] Updated weights on worker 0-0, policy_version 580838 (0.00091) [2022-07-10 05:10:02,371][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:10:02,383][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000580841_594781184.pth [2022-07-10 05:10:02,384][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000578860_592752640.pth [2022-07-10 05:10:02,721][25689] Fps is (10 sec: 5243.3, 60 sec: 5576.0, 300 sec: 5625.3). Total num frames: 594782208. Throughput: 0: 5824.8. Samples: 594788472. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:10:02,735][25689] Avg episode reward: [(0, '-24.052')] [2022-07-10 05:10:03,652][26022] Updated weights on worker 0-0, policy_version 580848 (0.00095) [2022-07-10 05:10:05,488][26022] Updated weights on worker 0-0, policy_version 580858 (0.00105) [2022-07-10 05:10:07,415][26022] Updated weights on worker 0-0, policy_version 580868 (0.00087) [2022-07-10 05:10:07,848][25689] Fps is (10 sec: 5382.2, 60 sec: 5609.9, 300 sec: 5623.5). Total num frames: 594810880. Throughput: 0: 5727.8. Samples: 594820490. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:10:07,848][25689] Avg episode reward: [(0, '-24.184')] [2022-07-10 05:10:08,988][26022] Updated weights on worker 0-0, policy_version 580878 (0.00089) [2022-07-10 05:10:10,986][26022] Updated weights on worker 0-0, policy_version 580888 (0.00091) [2022-07-10 05:10:12,840][26022] Updated weights on worker 0-0, policy_version 580898 (0.00090) [2022-07-10 05:10:12,910][25689] Fps is (10 sec: 5628.3, 60 sec: 5622.5, 300 sec: 5626.7). Total num frames: 594839552. Throughput: 0: 5715.0. Samples: 594837430. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:10:12,910][25689] Avg episode reward: [(0, '-23.608')] [2022-07-10 05:10:14,599][26022] Updated weights on worker 0-0, policy_version 580908 (0.00092) [2022-07-10 05:10:16,564][26022] Updated weights on worker 0-0, policy_version 580918 (0.00090) [2022-07-10 05:10:17,915][25689] Fps is (10 sec: 5696.3, 60 sec: 5592.4, 300 sec: 5630.5). Total num frames: 594868224. Throughput: 0: 5705.6. Samples: 594871198. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:10:17,915][25689] Avg episode reward: [(0, '-23.366')] [2022-07-10 05:10:18,156][26022] Updated weights on worker 0-0, policy_version 580928 (0.00093) [2022-07-10 05:10:20,032][26022] Updated weights on worker 0-0, policy_version 580938 (0.00090) [2022-07-10 05:10:21,877][26022] Updated weights on worker 0-0, policy_version 580948 (0.00086) [2022-07-10 05:10:22,916][25689] Fps is (10 sec: 5832.8, 60 sec: 5650.1, 300 sec: 5628.0). Total num frames: 594897920. Throughput: 0: 5764.2. Samples: 594904882. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:10:22,917][25689] Avg episode reward: [(0, '-24.014')] [2022-07-10 05:10:23,870][26022] Updated weights on worker 0-0, policy_version 580958 (0.00089) [2022-07-10 05:10:25,497][26022] Updated weights on worker 0-0, policy_version 580968 (0.00087) [2022-07-10 05:10:27,356][26022] Updated weights on worker 0-0, policy_version 580978 (0.00084) [2022-07-10 05:10:27,964][25689] Fps is (10 sec: 5706.0, 60 sec: 5620.4, 300 sec: 5624.1). Total num frames: 594925568. Throughput: 0: 5036.1. Samples: 594921800. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:10:27,969][25689] Avg episode reward: [(0, '-24.652')] [2022-07-10 05:10:29,046][26022] Updated weights on worker 0-0, policy_version 580988 (0.00089) [2022-07-10 05:10:31,015][26022] Updated weights on worker 0-0, policy_version 580998 (0.00085) [2022-07-10 05:10:32,732][26022] Updated weights on worker 0-0, policy_version 581008 (0.00082) [2022-07-10 05:10:33,039][25689] Fps is (10 sec: 5462.4, 60 sec: 5606.0, 300 sec: 5626.3). Total num frames: 594953216. Throughput: 0: 5885.7. Samples: 594955908. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:10:33,040][25689] Avg episode reward: [(0, '-24.742')] [2022-07-10 05:10:34,567][26022] Updated weights on worker 0-0, policy_version 581018 (0.00083) [2022-07-10 05:10:36,540][26022] Updated weights on worker 0-0, policy_version 581028 (0.00093) [2022-07-10 05:10:38,026][26022] Updated weights on worker 0-0, policy_version 581038 (0.00100) [2022-07-10 05:10:38,113][25689] Fps is (10 sec: 5650.3, 60 sec: 5618.2, 300 sec: 5628.7). Total num frames: 594982912. Throughput: 0: 5871.1. Samples: 594989786. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:10:38,114][25689] Avg episode reward: [(0, '-24.775')] [2022-07-10 05:10:40,294][26022] Updated weights on worker 0-0, policy_version 581048 (0.00087) [2022-07-10 05:10:41,610][26022] Updated weights on worker 0-0, policy_version 581058 (0.00094) [2022-07-10 05:10:43,133][25689] Fps is (10 sec: 5681.0, 60 sec: 5584.0, 300 sec: 5626.1). Total num frames: 595010560. Throughput: 0: 5022.3. Samples: 595006418. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:10:43,135][25689] Avg episode reward: [(0, '-25.767')] [2022-07-10 05:10:43,819][26022] Updated weights on worker 0-0, policy_version 581068 (0.00084) [2022-07-10 05:10:45,423][26022] Updated weights on worker 0-0, policy_version 581078 (0.00094) [2022-07-10 05:10:47,375][26022] Updated weights on worker 0-0, policy_version 581088 (0.00087) [2022-07-10 05:10:48,222][25689] Fps is (10 sec: 5469.9, 60 sec: 5570.0, 300 sec: 5621.1). Total num frames: 595038208. Throughput: 0: 5844.1. Samples: 595040190. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:10:48,223][25689] Avg episode reward: [(0, '-25.542')] [2022-07-10 05:10:49,091][26022] Updated weights on worker 0-0, policy_version 581098 (0.00087) [2022-07-10 05:10:51,217][26022] Updated weights on worker 0-0, policy_version 581108 (0.00078) [2022-07-10 05:10:52,880][26022] Updated weights on worker 0-0, policy_version 581118 (0.00086) [2022-07-10 05:10:53,295][25689] Fps is (10 sec: 5542.6, 60 sec: 5564.4, 300 sec: 5619.9). Total num frames: 595066880. Throughput: 0: 5833.7. Samples: 595074072. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:10:53,295][25689] Avg episode reward: [(0, '-26.076')] [2022-07-10 05:10:54,623][26022] Updated weights on worker 0-0, policy_version 581128 (0.00089) [2022-07-10 05:10:56,336][26022] Updated weights on worker 0-0, policy_version 581138 (0.00094) [2022-07-10 05:10:58,043][26022] Updated weights on worker 0-0, policy_version 581148 (0.00099) [2022-07-10 05:10:58,332][25689] Fps is (10 sec: 5773.6, 60 sec: 5612.3, 300 sec: 5622.8). Total num frames: 595096576. Throughput: 0: 5014.0. Samples: 595091162. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:10:58,333][25689] Avg episode reward: [(0, '-26.613')] [2022-07-10 05:11:00,195][26022] Updated weights on worker 0-0, policy_version 581158 (0.00087) [2022-07-10 05:11:02,135][26022] Updated weights on worker 0-0, policy_version 581168 (0.00084) [2022-07-10 05:11:03,337][25689] Fps is (10 sec: 5506.3, 60 sec: 5608.5, 300 sec: 5620.6). Total num frames: 595122176. Throughput: 0: 5814.0. Samples: 595123882. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:11:03,338][25689] Avg episode reward: [(0, '-26.796')] [2022-07-10 05:11:04,108][26022] Updated weights on worker 0-0, policy_version 581178 (0.00085) [2022-07-10 05:11:05,694][26022] Updated weights on worker 0-0, policy_version 581188 (0.00084) [2022-07-10 05:11:07,514][26022] Updated weights on worker 0-0, policy_version 581198 (0.00082) [2022-07-10 05:11:08,378][25689] Fps is (10 sec: 5402.3, 60 sec: 5616.5, 300 sec: 5624.4). Total num frames: 595150848. Throughput: 0: 5815.6. Samples: 595157406. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:11:08,378][25689] Avg episode reward: [(0, '-26.381')] [2022-07-10 05:11:09,417][26022] Updated weights on worker 0-0, policy_version 581208 (0.00090) [2022-07-10 05:11:11,368][26022] Updated weights on worker 0-0, policy_version 581218 (0.00088) [2022-07-10 05:11:13,025][26022] Updated weights on worker 0-0, policy_version 581228 (0.00085) [2022-07-10 05:11:13,409][25689] Fps is (10 sec: 5693.4, 60 sec: 5619.3, 300 sec: 5624.1). Total num frames: 595179520. Throughput: 0: 4991.3. Samples: 595174468. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:11:13,410][25689] Avg episode reward: [(0, '-26.428')] [2022-07-10 05:11:14,764][26022] Updated weights on worker 0-0, policy_version 581238 (0.00086) [2022-07-10 05:11:16,464][26022] Updated weights on worker 0-0, policy_version 581248 (0.00097) [2022-07-10 05:11:18,414][25689] Fps is (10 sec: 5611.8, 60 sec: 5602.5, 300 sec: 5621.0). Total num frames: 595207168. Throughput: 0: 5858.2. Samples: 595208806. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:11:18,414][25689] Avg episode reward: [(0, '-25.853')] [2022-07-10 05:11:18,539][26022] Updated weights on worker 0-0, policy_version 581258 (0.00110) [2022-07-10 05:11:20,068][26022] Updated weights on worker 0-0, policy_version 581268 (0.00084) [2022-07-10 05:11:22,169][26022] Updated weights on worker 0-0, policy_version 581278 (0.00088) [2022-07-10 05:11:23,425][25689] Fps is (10 sec: 5725.2, 60 sec: 5601.6, 300 sec: 5628.5). Total num frames: 595236864. Throughput: 0: 5908.2. Samples: 595242566. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:11:23,427][25689] Avg episode reward: [(0, '-24.859')] [2022-07-10 05:11:23,829][26022] Updated weights on worker 0-0, policy_version 581288 (0.00094) [2022-07-10 05:11:25,744][26022] Updated weights on worker 0-0, policy_version 581298 (0.00089) [2022-07-10 05:11:27,447][26022] Updated weights on worker 0-0, policy_version 581308 (0.00083) [2022-07-10 05:11:28,501][25689] Fps is (10 sec: 5685.0, 60 sec: 5599.0, 300 sec: 5620.3). Total num frames: 595264512. Throughput: 0: 5087.6. Samples: 595259784. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:11:28,501][25689] Avg episode reward: [(0, '-25.668')] [2022-07-10 05:11:29,243][26022] Updated weights on worker 0-0, policy_version 581318 (0.00090) [2022-07-10 05:11:30,994][26022] Updated weights on worker 0-0, policy_version 581328 (0.00091) [2022-07-10 05:11:32,834][26022] Updated weights on worker 0-0, policy_version 581338 (0.00094) [2022-07-10 05:11:33,549][25689] Fps is (10 sec: 5563.4, 60 sec: 5618.5, 300 sec: 5623.2). Total num frames: 595293184. Throughput: 0: 5928.1. Samples: 595293856. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:11:33,550][25689] Avg episode reward: [(0, '-25.183')] [2022-07-10 05:11:34,638][26022] Updated weights on worker 0-0, policy_version 581348 (0.00088) [2022-07-10 05:11:36,563][26022] Updated weights on worker 0-0, policy_version 581358 (0.00084) [2022-07-10 05:11:38,168][26022] Updated weights on worker 0-0, policy_version 581368 (0.00083) [2022-07-10 05:11:38,558][25689] Fps is (10 sec: 5905.7, 60 sec: 5641.4, 300 sec: 5630.1). Total num frames: 595323904. Throughput: 0: 5923.7. Samples: 595328132. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:11:38,559][25689] Avg episode reward: [(0, '-26.921')] [2022-07-10 05:11:40,093][26022] Updated weights on worker 0-0, policy_version 581378 (0.00085) [2022-07-10 05:11:41,820][26022] Updated weights on worker 0-0, policy_version 581388 (0.00092) [2022-07-10 05:11:43,569][25689] Fps is (10 sec: 5722.9, 60 sec: 5625.3, 300 sec: 5624.2). Total num frames: 595350528. Throughput: 0: 5091.4. Samples: 595345124. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:11:43,569][25689] Avg episode reward: [(0, '-26.537')] [2022-07-10 05:11:43,696][26022] Updated weights on worker 0-0, policy_version 581398 (0.00089) [2022-07-10 05:11:45,388][26022] Updated weights on worker 0-0, policy_version 581408 (0.00084) [2022-07-10 05:11:47,191][26022] Updated weights on worker 0-0, policy_version 581418 (0.00089) [2022-07-10 05:11:48,631][25689] Fps is (10 sec: 5591.1, 60 sec: 5661.7, 300 sec: 5630.2). Total num frames: 595380224. Throughput: 0: 5933.9. Samples: 595379232. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:11:48,634][25689] Avg episode reward: [(0, '-26.950')] [2022-07-10 05:11:49,225][26022] Updated weights on worker 0-0, policy_version 581428 (0.00624) [2022-07-10 05:11:50,961][26022] Updated weights on worker 0-0, policy_version 581438 (0.00093) [2022-07-10 05:11:52,841][26022] Updated weights on worker 0-0, policy_version 581448 (0.00088) [2022-07-10 05:11:53,665][25689] Fps is (10 sec: 5781.0, 60 sec: 5665.3, 300 sec: 5626.4). Total num frames: 595408896. Throughput: 0: 5936.9. Samples: 595413286. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:11:53,667][25689] Avg episode reward: [(0, '-26.543')] [2022-07-10 05:11:54,496][26022] Updated weights on worker 0-0, policy_version 581458 (0.00092) [2022-07-10 05:11:56,188][26022] Updated weights on worker 0-0, policy_version 581468 (0.00087) [2022-07-10 05:11:58,199][26022] Updated weights on worker 0-0, policy_version 581478 (0.00086) [2022-07-10 05:11:58,688][25689] Fps is (10 sec: 5498.2, 60 sec: 5615.8, 300 sec: 5619.4). Total num frames: 595435520. Throughput: 0: 5076.7. Samples: 595430328. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:11:58,689][25689] Avg episode reward: [(0, '-26.067')] [2022-07-10 05:11:59,840][26022] Updated weights on worker 0-0, policy_version 581488 (0.00090) [2022-07-10 05:12:02,087][26022] Updated weights on worker 0-0, policy_version 581498 (0.00084) [2022-07-10 05:12:02,461][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:12:02,474][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000581500_595456000.pth [2022-07-10 05:12:02,475][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000579521_593429504.pth [2022-07-10 05:12:03,700][25689] Fps is (10 sec: 5306.6, 60 sec: 5632.1, 300 sec: 5624.6). Total num frames: 595462144. Throughput: 0: 5805.6. Samples: 595461996. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:12:03,700][25689] Avg episode reward: [(0, '-25.730')] [2022-07-10 05:12:03,828][26022] Updated weights on worker 0-0, policy_version 581508 (0.00087) [2022-07-10 05:12:05,736][26022] Updated weights on worker 0-0, policy_version 581518 (0.00085) [2022-07-10 05:12:07,511][26022] Updated weights on worker 0-0, policy_version 581528 (0.00091) [2022-07-10 05:12:08,742][25689] Fps is (10 sec: 5499.7, 60 sec: 5631.9, 300 sec: 5620.7). Total num frames: 595490816. Throughput: 0: 5804.0. Samples: 595495960. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:12:08,743][25689] Avg episode reward: [(0, '-25.887')] [2022-07-10 05:12:09,239][26022] Updated weights on worker 0-0, policy_version 581538 (0.00088) [2022-07-10 05:12:11,270][26022] Updated weights on worker 0-0, policy_version 581548 (0.00090) [2022-07-10 05:12:13,131][26022] Updated weights on worker 0-0, policy_version 581558 (0.00084) [2022-07-10 05:12:13,752][25689] Fps is (10 sec: 5704.4, 60 sec: 5633.9, 300 sec: 5624.6). Total num frames: 595519488. Throughput: 0: 4960.0. Samples: 595512916. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:12:13,753][25689] Avg episode reward: [(0, '-25.620')] [2022-07-10 05:12:14,844][26022] Updated weights on worker 0-0, policy_version 581568 (0.00085) [2022-07-10 05:12:16,639][26022] Updated weights on worker 0-0, policy_version 581578 (0.00085) [2022-07-10 05:12:18,468][26022] Updated weights on worker 0-0, policy_version 581588 (0.00088) [2022-07-10 05:12:18,757][25689] Fps is (10 sec: 5726.0, 60 sec: 5650.9, 300 sec: 5621.4). Total num frames: 595548160. Throughput: 0: 5818.3. Samples: 595547096. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:12:18,757][25689] Avg episode reward: [(0, '-25.064')] [2022-07-10 05:12:20,413][26022] Updated weights on worker 0-0, policy_version 581598 (0.00085) [2022-07-10 05:12:21,920][26022] Updated weights on worker 0-0, policy_version 581608 (0.00081) [2022-07-10 05:12:23,771][25689] Fps is (10 sec: 5519.4, 60 sec: 5599.8, 300 sec: 5615.7). Total num frames: 595574784. Throughput: 0: 5922.8. Samples: 595580874. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:12:23,771][25689] Avg episode reward: [(0, '-24.788')] [2022-07-10 05:12:24,032][26022] Updated weights on worker 0-0, policy_version 581618 (0.00093) [2022-07-10 05:12:25,554][26022] Updated weights on worker 0-0, policy_version 581628 (0.00082) [2022-07-10 05:12:27,642][26022] Updated weights on worker 0-0, policy_version 581638 (0.00087) [2022-07-10 05:12:28,828][25689] Fps is (10 sec: 5592.1, 60 sec: 5635.4, 300 sec: 5621.9). Total num frames: 595604480. Throughput: 0: 5072.9. Samples: 595597856. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:12:28,829][25689] Avg episode reward: [(0, '-24.440')] [2022-07-10 05:12:29,245][26022] Updated weights on worker 0-0, policy_version 581648 (0.00068) [2022-07-10 05:12:30,937][26022] Updated weights on worker 0-0, policy_version 581658 (0.00085) [2022-07-10 05:12:32,974][26022] Updated weights on worker 0-0, policy_version 581668 (0.00088) [2022-07-10 05:12:33,838][25689] Fps is (10 sec: 5899.5, 60 sec: 5655.9, 300 sec: 5625.8). Total num frames: 595634176. Throughput: 0: 5934.5. Samples: 595632114. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:12:33,838][25689] Avg episode reward: [(0, '-24.584')] [2022-07-10 05:12:34,697][26022] Updated weights on worker 0-0, policy_version 581678 (0.00086) [2022-07-10 05:12:36,591][26022] Updated weights on worker 0-0, policy_version 581688 (0.00094) [2022-07-10 05:12:38,289][26022] Updated weights on worker 0-0, policy_version 581698 (0.00089) [2022-07-10 05:12:38,847][25689] Fps is (10 sec: 5621.7, 60 sec: 5588.0, 300 sec: 5615.6). Total num frames: 595660800. Throughput: 0: 5929.8. Samples: 595666224. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 05:12:38,847][25689] Avg episode reward: [(0, '-24.319')] [2022-07-10 05:12:39,998][26022] Updated weights on worker 0-0, policy_version 581708 (0.00090) [2022-07-10 05:12:42,096][26022] Updated weights on worker 0-0, policy_version 581718 (0.00087) [2022-07-10 05:12:43,558][26022] Updated weights on worker 0-0, policy_version 581728 (0.00052) [2022-07-10 05:12:43,860][25689] Fps is (10 sec: 5721.5, 60 sec: 5655.7, 300 sec: 5628.2). Total num frames: 595691520. Throughput: 0: 5094.8. Samples: 595683228. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:12:43,862][25689] Avg episode reward: [(0, '-25.414')] [2022-07-10 05:12:45,696][26022] Updated weights on worker 0-0, policy_version 581738 (0.00096) [2022-07-10 05:12:46,953][26022] Updated weights on worker 0-0, policy_version 581748 (0.00084) [2022-07-10 05:12:48,919][25689] Fps is (10 sec: 5693.4, 60 sec: 5605.1, 300 sec: 5617.3). Total num frames: 595718144. Throughput: 0: 5946.1. Samples: 595717316. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:12:48,920][25689] Avg episode reward: [(0, '-25.740')] [2022-07-10 05:12:49,245][26022] Updated weights on worker 0-0, policy_version 581758 (0.00089) [2022-07-10 05:12:50,886][26022] Updated weights on worker 0-0, policy_version 581768 (0.00091) [2022-07-10 05:12:52,544][26022] Updated weights on worker 0-0, policy_version 581778 (0.00088) [2022-07-10 05:12:53,924][25689] Fps is (10 sec: 5494.8, 60 sec: 5607.8, 300 sec: 5621.8). Total num frames: 595746816. Throughput: 0: 5948.4. Samples: 595751594. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:12:53,926][25689] Avg episode reward: [(0, '-26.118')] [2022-07-10 05:12:54,519][26022] Updated weights on worker 0-0, policy_version 581788 (0.00090) [2022-07-10 05:12:56,255][26022] Updated weights on worker 0-0, policy_version 581798 (0.00085) [2022-07-10 05:12:58,039][26022] Updated weights on worker 0-0, policy_version 581808 (0.00094) [2022-07-10 05:12:58,928][25689] Fps is (10 sec: 5831.5, 60 sec: 5660.6, 300 sec: 5623.0). Total num frames: 595776512. Throughput: 0: 5112.9. Samples: 595768896. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:12:58,928][25689] Avg episode reward: [(0, '-26.528')] [2022-07-10 05:12:59,846][26022] Updated weights on worker 0-0, policy_version 581818 (0.00088) [2022-07-10 05:13:01,568][26022] Updated weights on worker 0-0, policy_version 581828 (0.00091) [2022-07-10 05:13:03,874][26022] Updated weights on worker 0-0, policy_version 581838 (0.00087) [2022-07-10 05:13:03,958][25689] Fps is (10 sec: 5510.7, 60 sec: 5641.8, 300 sec: 5620.9). Total num frames: 595802112. Throughput: 0: 5974.5. Samples: 595803302. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:13:03,959][25689] Avg episode reward: [(0, '-25.235')] [2022-07-10 05:13:05,448][26022] Updated weights on worker 0-0, policy_version 581848 (0.00091) [2022-07-10 05:13:07,490][26022] Updated weights on worker 0-0, policy_version 581858 (0.00092) [2022-07-10 05:13:09,011][26022] Updated weights on worker 0-0, policy_version 581868 (0.00083) [2022-07-10 05:13:09,014][25689] Fps is (10 sec: 5482.4, 60 sec: 5657.6, 300 sec: 5627.3). Total num frames: 595831808. Throughput: 0: 5899.4. Samples: 595835866. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:13:09,014][25689] Avg episode reward: [(0, '-24.150')] [2022-07-10 05:13:11,101][26022] Updated weights on worker 0-0, policy_version 581878 (0.00086) [2022-07-10 05:13:12,665][26022] Updated weights on worker 0-0, policy_version 581888 (0.00092) [2022-07-10 05:13:14,031][25689] Fps is (10 sec: 5693.1, 60 sec: 5640.0, 300 sec: 5616.9). Total num frames: 595859456. Throughput: 0: 5044.9. Samples: 595853030. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:13:14,031][25689] Avg episode reward: [(0, '-24.290')] [2022-07-10 05:13:14,689][26022] Updated weights on worker 0-0, policy_version 581898 (0.00085) [2022-07-10 05:13:16,320][26022] Updated weights on worker 0-0, policy_version 581908 (0.00088) [2022-07-10 05:13:18,082][26022] Updated weights on worker 0-0, policy_version 581918 (0.00087) [2022-07-10 05:13:19,039][25689] Fps is (10 sec: 5618.0, 60 sec: 5639.6, 300 sec: 5621.4). Total num frames: 595888128. Throughput: 0: 5896.8. Samples: 595887486. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:13:19,039][25689] Avg episode reward: [(0, '-24.216')] [2022-07-10 05:13:19,914][26022] Updated weights on worker 0-0, policy_version 581928 (0.00087) [2022-07-10 05:13:21,709][26022] Updated weights on worker 0-0, policy_version 581938 (0.00083) [2022-07-10 05:13:23,468][26022] Updated weights on worker 0-0, policy_version 581948 (0.00087) [2022-07-10 05:13:24,048][25689] Fps is (10 sec: 5724.3, 60 sec: 5674.0, 300 sec: 5626.6). Total num frames: 595916800. Throughput: 0: 5863.1. Samples: 595921092. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:13:24,049][25689] Avg episode reward: [(0, '-23.035')] [2022-07-10 05:13:25,466][26022] Updated weights on worker 0-0, policy_version 581958 (0.00083) [2022-07-10 05:13:27,276][26022] Updated weights on worker 0-0, policy_version 581968 (0.00090) [2022-07-10 05:13:29,104][25689] Fps is (10 sec: 5493.6, 60 sec: 5623.2, 300 sec: 5615.5). Total num frames: 595943424. Throughput: 0: 5924.5. Samples: 595954892. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:13:29,105][25689] Avg episode reward: [(0, '-23.923')] [2022-07-10 05:13:29,276][26022] Updated weights on worker 0-0, policy_version 581978 (0.00079) [2022-07-10 05:13:30,776][26022] Updated weights on worker 0-0, policy_version 581988 (0.00089) [2022-07-10 05:13:32,854][26022] Updated weights on worker 0-0, policy_version 581998 (0.00078) [2022-07-10 05:13:34,140][25689] Fps is (10 sec: 5783.6, 60 sec: 5654.7, 300 sec: 5632.4). Total num frames: 595975168. Throughput: 0: 5920.0. Samples: 595972078. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:13:34,141][25689] Avg episode reward: [(0, '-24.819')] [2022-07-10 05:13:34,346][26022] Updated weights on worker 0-0, policy_version 582008 (0.00082) [2022-07-10 05:13:36,179][26022] Updated weights on worker 0-0, policy_version 582018 (0.00090) [2022-07-10 05:13:38,147][26022] Updated weights on worker 0-0, policy_version 582028 (0.00093) [2022-07-10 05:13:39,234][25689] Fps is (10 sec: 5964.1, 60 sec: 5680.6, 300 sec: 5631.1). Total num frames: 596003840. Throughput: 0: 5900.3. Samples: 596006646. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:13:39,235][25689] Avg episode reward: [(0, '-25.084')] [2022-07-10 05:13:39,560][26022] Updated weights on worker 0-0, policy_version 582038 (0.00087) [2022-07-10 05:13:41,727][26022] Updated weights on worker 0-0, policy_version 582048 (0.00089) [2022-07-10 05:13:43,195][26022] Updated weights on worker 0-0, policy_version 582058 (0.00433) [2022-07-10 05:13:44,266][25689] Fps is (10 sec: 5460.8, 60 sec: 5611.1, 300 sec: 5625.6). Total num frames: 596030464. Throughput: 0: 5930.1. Samples: 596040988. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:13:44,267][25689] Avg episode reward: [(0, '-24.200')] [2022-07-10 05:13:45,363][26022] Updated weights on worker 0-0, policy_version 582068 (0.00082) [2022-07-10 05:13:46,975][26022] Updated weights on worker 0-0, policy_version 582078 (0.00089) [2022-07-10 05:13:48,731][26022] Updated weights on worker 0-0, policy_version 582088 (0.00096) [2022-07-10 05:13:49,317][25689] Fps is (10 sec: 5789.1, 60 sec: 5696.6, 300 sec: 5635.5). Total num frames: 596062208. Throughput: 0: 5103.4. Samples: 596058044. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:13:49,317][25689] Avg episode reward: [(0, '-24.058')] [2022-07-10 05:13:50,524][26022] Updated weights on worker 0-0, policy_version 582098 (0.00094) [2022-07-10 05:13:52,378][26022] Updated weights on worker 0-0, policy_version 582108 (0.00088) [2022-07-10 05:13:54,139][26022] Updated weights on worker 0-0, policy_version 582118 (0.00066) [2022-07-10 05:13:54,321][25689] Fps is (10 sec: 5907.0, 60 sec: 5679.7, 300 sec: 5640.0). Total num frames: 596089856. Throughput: 0: 5935.1. Samples: 596091852. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:13:54,321][25689] Avg episode reward: [(0, '-23.431')] [2022-07-10 05:13:56,079][26022] Updated weights on worker 0-0, policy_version 582128 (0.00098) [2022-07-10 05:13:57,656][26022] Updated weights on worker 0-0, policy_version 582138 (0.00094) [2022-07-10 05:13:59,362][25689] Fps is (10 sec: 5402.6, 60 sec: 5625.4, 300 sec: 5625.6). Total num frames: 596116480. Throughput: 0: 5931.2. Samples: 596126028. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:13:59,363][25689] Avg episode reward: [(0, '-24.152')] [2022-07-10 05:13:59,673][26022] Updated weights on worker 0-0, policy_version 582148 (0.00079) [2022-07-10 05:14:01,538][26022] Updated weights on worker 0-0, policy_version 582158 (0.00088) [2022-07-10 05:14:02,653][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:14:02,668][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000582163_596134912.pth [2022-07-10 05:14:02,668][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000580183_594107392.pth [2022-07-10 05:14:03,600][26022] Updated weights on worker 0-0, policy_version 582168 (0.00087) [2022-07-10 05:14:04,368][25689] Fps is (10 sec: 5300.0, 60 sec: 5644.7, 300 sec: 5628.0). Total num frames: 596143104. Throughput: 0: 5013.1. Samples: 596141758. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:14:04,368][25689] Avg episode reward: [(0, '-24.311')] [2022-07-10 05:14:05,419][26022] Updated weights on worker 0-0, policy_version 582178 (0.00089) [2022-07-10 05:14:07,189][26022] Updated weights on worker 0-0, policy_version 582188 (0.01103) [2022-07-10 05:14:09,006][26022] Updated weights on worker 0-0, policy_version 582198 (0.00085) [2022-07-10 05:14:09,478][25689] Fps is (10 sec: 5567.5, 60 sec: 5639.6, 300 sec: 5633.1). Total num frames: 596172800. Throughput: 0: 5814.6. Samples: 596175274. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:14:09,479][25689] Avg episode reward: [(0, '-25.063')] [2022-07-10 05:14:10,822][26022] Updated weights on worker 0-0, policy_version 582208 (0.00086) [2022-07-10 05:14:12,594][26022] Updated weights on worker 0-0, policy_version 582218 (0.00092) [2022-07-10 05:14:14,390][26022] Updated weights on worker 0-0, policy_version 582228 (0.00615) [2022-07-10 05:14:14,495][25689] Fps is (10 sec: 5763.6, 60 sec: 5656.5, 300 sec: 5626.7). Total num frames: 596201472. Throughput: 0: 5837.9. Samples: 596209624. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:14:14,495][25689] Avg episode reward: [(0, '-25.618')] [2022-07-10 05:14:16,351][26022] Updated weights on worker 0-0, policy_version 582238 (0.00087) [2022-07-10 05:14:18,015][26022] Updated weights on worker 0-0, policy_version 582248 (0.00091) [2022-07-10 05:14:19,511][25689] Fps is (10 sec: 5715.7, 60 sec: 5655.8, 300 sec: 5634.8). Total num frames: 596230144. Throughput: 0: 5003.5. Samples: 596226842. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:14:19,511][25689] Avg episode reward: [(0, '-26.203')] [2022-07-10 05:14:19,792][26022] Updated weights on worker 0-0, policy_version 582258 (0.00086) [2022-07-10 05:14:21,687][26022] Updated weights on worker 0-0, policy_version 582268 (0.00097) [2022-07-10 05:14:23,415][26022] Updated weights on worker 0-0, policy_version 582278 (0.00093) [2022-07-10 05:14:24,519][25689] Fps is (10 sec: 5618.1, 60 sec: 5638.9, 300 sec: 5629.5). Total num frames: 596257792. Throughput: 0: 5901.7. Samples: 596260688. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:14:24,520][25689] Avg episode reward: [(0, '-25.964')] [2022-07-10 05:14:25,312][26022] Updated weights on worker 0-0, policy_version 582288 (0.00095) [2022-07-10 05:14:27,095][26022] Updated weights on worker 0-0, policy_version 582298 (0.00092) [2022-07-10 05:14:28,960][26022] Updated weights on worker 0-0, policy_version 582308 (0.00088) [2022-07-10 05:14:29,579][25689] Fps is (10 sec: 5797.1, 60 sec: 5706.3, 300 sec: 5637.1). Total num frames: 596288512. Throughput: 0: 5935.8. Samples: 596294590. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:14:29,580][25689] Avg episode reward: [(0, '-25.239')] [2022-07-10 05:14:30,831][26022] Updated weights on worker 0-0, policy_version 582318 (0.00099) [2022-07-10 05:14:32,383][26022] Updated weights on worker 0-0, policy_version 582328 (0.00090) [2022-07-10 05:14:34,411][26022] Updated weights on worker 0-0, policy_version 582338 (0.00086) [2022-07-10 05:14:34,652][25689] Fps is (10 sec: 5659.3, 60 sec: 5618.2, 300 sec: 5629.3). Total num frames: 596315136. Throughput: 0: 5067.7. Samples: 596311776. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:14:34,653][25689] Avg episode reward: [(0, '-24.690')] [2022-07-10 05:14:35,920][26022] Updated weights on worker 0-0, policy_version 582348 (0.00085) [2022-07-10 05:14:37,995][26022] Updated weights on worker 0-0, policy_version 582358 (0.00091) [2022-07-10 05:14:39,519][26022] Updated weights on worker 0-0, policy_version 582368 (0.00083) [2022-07-10 05:14:39,684][25689] Fps is (10 sec: 5573.8, 60 sec: 5641.0, 300 sec: 5629.0). Total num frames: 596344832. Throughput: 0: 5909.2. Samples: 596346048. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:14:39,684][25689] Avg episode reward: [(0, '-24.947')] [2022-07-10 05:14:41,458][26022] Updated weights on worker 0-0, policy_version 582378 (0.00087) [2022-07-10 05:14:43,312][26022] Updated weights on worker 0-0, policy_version 582388 (0.00086) [2022-07-10 05:14:44,727][25689] Fps is (10 sec: 5691.8, 60 sec: 5656.8, 300 sec: 5627.0). Total num frames: 596372480. Throughput: 0: 5912.6. Samples: 596380168. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:14:44,729][25689] Avg episode reward: [(0, '-24.002')] [2022-07-10 05:14:45,146][26022] Updated weights on worker 0-0, policy_version 582398 (0.00084) [2022-07-10 05:14:46,731][26022] Updated weights on worker 0-0, policy_version 582408 (0.00090) [2022-07-10 05:14:48,817][26022] Updated weights on worker 0-0, policy_version 582418 (0.00081) [2022-07-10 05:14:49,789][25689] Fps is (10 sec: 5775.7, 60 sec: 5638.8, 300 sec: 5632.9). Total num frames: 596403200. Throughput: 0: 5071.0. Samples: 596397076. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:14:49,790][25689] Avg episode reward: [(0, '-22.518')] [2022-07-10 05:14:50,456][26022] Updated weights on worker 0-0, policy_version 582428 (0.00087) [2022-07-10 05:14:52,308][26022] Updated weights on worker 0-0, policy_version 582438 (0.00107) [2022-07-10 05:14:54,076][26022] Updated weights on worker 0-0, policy_version 582448 (0.00773) [2022-07-10 05:14:54,791][25689] Fps is (10 sec: 5697.8, 60 sec: 5622.1, 300 sec: 5633.0). Total num frames: 596429824. Throughput: 0: 5925.2. Samples: 596431104. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:14:54,792][25689] Avg episode reward: [(0, '-22.075')] [2022-07-10 05:14:55,915][26022] Updated weights on worker 0-0, policy_version 582458 (0.00081) [2022-07-10 05:14:57,838][26022] Updated weights on worker 0-0, policy_version 582468 (0.00100) [2022-07-10 05:14:59,438][26022] Updated weights on worker 0-0, policy_version 582478 (0.00085) [2022-07-10 05:14:59,805][25689] Fps is (10 sec: 5623.1, 60 sec: 5675.4, 300 sec: 5645.9). Total num frames: 596459520. Throughput: 0: 5932.6. Samples: 596465422. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:14:59,806][25689] Avg episode reward: [(0, '-21.961')] [2022-07-10 05:15:01,248][26022] Updated weights on worker 0-0, policy_version 582488 (0.00091) [2022-07-10 05:15:03,478][26022] Updated weights on worker 0-0, policy_version 582498 (0.00084) [2022-07-10 05:15:04,836][25689] Fps is (10 sec: 5504.9, 60 sec: 5656.1, 300 sec: 5637.4). Total num frames: 596485120. Throughput: 0: 4986.4. Samples: 596480440. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:15:04,837][25689] Avg episode reward: [(0, '-21.808')] [2022-07-10 05:15:05,093][26022] Updated weights on worker 0-0, policy_version 582508 (0.00088) [2022-07-10 05:15:07,371][26022] Updated weights on worker 0-0, policy_version 582518 (0.00093) [2022-07-10 05:15:08,777][26022] Updated weights on worker 0-0, policy_version 582528 (0.00070) [2022-07-10 05:15:09,948][25689] Fps is (10 sec: 5351.1, 60 sec: 5639.1, 300 sec: 5636.5). Total num frames: 596513792. Throughput: 0: 5824.1. Samples: 596514478. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:15:09,948][25689] Avg episode reward: [(0, '-21.301')] [2022-07-10 05:15:10,866][26022] Updated weights on worker 0-0, policy_version 582538 (0.00088) [2022-07-10 05:15:12,501][26022] Updated weights on worker 0-0, policy_version 582548 (0.00093) [2022-07-10 05:15:14,415][26022] Updated weights on worker 0-0, policy_version 582558 (0.00097) [2022-07-10 05:15:14,982][25689] Fps is (10 sec: 5652.0, 60 sec: 5637.4, 300 sec: 5635.9). Total num frames: 596542464. Throughput: 0: 5807.4. Samples: 596548360. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:15:14,982][25689] Avg episode reward: [(0, '-22.558')] [2022-07-10 05:15:16,264][26022] Updated weights on worker 0-0, policy_version 582568 (0.00089) [2022-07-10 05:15:17,793][26022] Updated weights on worker 0-0, policy_version 582578 (0.00085) [2022-07-10 05:15:19,786][26022] Updated weights on worker 0-0, policy_version 582588 (0.00088) [2022-07-10 05:15:19,989][25689] Fps is (10 sec: 5710.5, 60 sec: 5638.2, 300 sec: 5632.3). Total num frames: 596571136. Throughput: 0: 4954.7. Samples: 596565428. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:15:19,990][25689] Avg episode reward: [(0, '-23.000')] [2022-07-10 05:15:21,633][26022] Updated weights on worker 0-0, policy_version 582598 (0.00824) [2022-07-10 05:15:23,231][26022] Updated weights on worker 0-0, policy_version 582608 (0.00090) [2022-07-10 05:15:25,010][25689] Fps is (10 sec: 5616.0, 60 sec: 5637.1, 300 sec: 5632.9). Total num frames: 596598784. Throughput: 0: 5909.2. Samples: 596599654. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:15:25,011][25689] Avg episode reward: [(0, '-23.399')] [2022-07-10 05:15:25,355][26022] Updated weights on worker 0-0, policy_version 582618 (0.00080) [2022-07-10 05:15:26,796][26022] Updated weights on worker 0-0, policy_version 582628 (0.00089) [2022-07-10 05:15:28,873][26022] Updated weights on worker 0-0, policy_version 582638 (0.00096) [2022-07-10 05:15:30,083][25689] Fps is (10 sec: 5782.4, 60 sec: 5635.8, 300 sec: 5643.2). Total num frames: 596629504. Throughput: 0: 5930.2. Samples: 596633888. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:15:30,084][25689] Avg episode reward: [(0, '-24.213')] [2022-07-10 05:15:30,555][26022] Updated weights on worker 0-0, policy_version 582648 (0.00087) [2022-07-10 05:15:32,377][26022] Updated weights on worker 0-0, policy_version 582658 (0.00090) [2022-07-10 05:15:34,238][26022] Updated weights on worker 0-0, policy_version 582668 (0.00086) [2022-07-10 05:15:35,107][25689] Fps is (10 sec: 5781.1, 60 sec: 5657.4, 300 sec: 5637.3). Total num frames: 596657152. Throughput: 0: 5097.4. Samples: 596650944. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:15:35,107][25689] Avg episode reward: [(0, '-24.465')] [2022-07-10 05:15:35,987][26022] Updated weights on worker 0-0, policy_version 582678 (0.00090) [2022-07-10 05:15:37,706][26022] Updated weights on worker 0-0, policy_version 582688 (0.00097) [2022-07-10 05:15:39,657][26022] Updated weights on worker 0-0, policy_version 582698 (0.00087) [2022-07-10 05:15:40,116][25689] Fps is (10 sec: 5511.5, 60 sec: 5625.6, 300 sec: 5637.5). Total num frames: 596684800. Throughput: 0: 5964.9. Samples: 596685482. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:15:40,117][25689] Avg episode reward: [(0, '-24.563')] [2022-07-10 05:15:41,213][26022] Updated weights on worker 0-0, policy_version 582708 (0.00085) [2022-07-10 05:15:43,216][26022] Updated weights on worker 0-0, policy_version 582718 (0.00090) [2022-07-10 05:15:44,980][26022] Updated weights on worker 0-0, policy_version 582728 (0.00089) [2022-07-10 05:15:45,125][25689] Fps is (10 sec: 5621.6, 60 sec: 5645.7, 300 sec: 5642.5). Total num frames: 596713472. Throughput: 0: 5961.3. Samples: 596719564. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 05:15:45,126][25689] Avg episode reward: [(0, '-25.025')] [2022-07-10 05:15:46,794][26022] Updated weights on worker 0-0, policy_version 582738 (0.00086) [2022-07-10 05:15:48,575][26022] Updated weights on worker 0-0, policy_version 582748 (0.00086) [2022-07-10 05:15:50,183][25689] Fps is (10 sec: 5798.1, 60 sec: 5629.2, 300 sec: 5646.2). Total num frames: 596743168. Throughput: 0: 5110.3. Samples: 596736602. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:15:50,183][25689] Avg episode reward: [(0, '-24.725')] [2022-07-10 05:15:50,366][26022] Updated weights on worker 0-0, policy_version 582758 (0.00085) [2022-07-10 05:15:52,109][26022] Updated weights on worker 0-0, policy_version 582768 (0.00105) [2022-07-10 05:15:54,089][26022] Updated weights on worker 0-0, policy_version 582778 (0.00090) [2022-07-10 05:15:55,207][25689] Fps is (10 sec: 5586.4, 60 sec: 5627.2, 300 sec: 5636.1). Total num frames: 596769792. Throughput: 0: 5958.0. Samples: 596770700. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:15:55,207][25689] Avg episode reward: [(0, '-23.777')] [2022-07-10 05:15:55,673][26022] Updated weights on worker 0-0, policy_version 582788 (0.00082) [2022-07-10 05:15:57,732][26022] Updated weights on worker 0-0, policy_version 582798 (0.00091) [2022-07-10 05:15:59,356][26022] Updated weights on worker 0-0, policy_version 582808 (0.00086) [2022-07-10 05:16:00,214][25689] Fps is (10 sec: 5716.4, 60 sec: 5644.7, 300 sec: 5653.3). Total num frames: 596800512. Throughput: 0: 5930.2. Samples: 596804668. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:16:00,215][25689] Avg episode reward: [(0, '-23.758')] [2022-07-10 05:16:01,244][26022] Updated weights on worker 0-0, policy_version 582818 (0.00091) [2022-07-10 05:16:02,968][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:16:02,992][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000582825_596812800.pth [2022-07-10 05:16:02,993][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000580841_594781184.pth [2022-07-10 05:16:03,323][26022] Updated weights on worker 0-0, policy_version 582828 (0.00084) [2022-07-10 05:16:05,292][25689] Fps is (10 sec: 5584.3, 60 sec: 5640.3, 300 sec: 5642.2). Total num frames: 596826112. Throughput: 0: 4956.4. Samples: 596819520. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:16:05,293][25689] Avg episode reward: [(0, '-23.633')] [2022-07-10 05:16:05,299][26022] Updated weights on worker 0-0, policy_version 582838 (0.00094) [2022-07-10 05:16:06,923][26022] Updated weights on worker 0-0, policy_version 582848 (0.00085) [2022-07-10 05:16:08,971][26022] Updated weights on worker 0-0, policy_version 582858 (0.00088) [2022-07-10 05:16:10,412][25689] Fps is (10 sec: 5422.7, 60 sec: 5656.5, 300 sec: 5644.0). Total num frames: 596855808. Throughput: 0: 5782.8. Samples: 596853580. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:16:10,412][25689] Avg episode reward: [(0, '-23.016')] [2022-07-10 05:16:10,545][26022] Updated weights on worker 0-0, policy_version 582868 (0.00098) [2022-07-10 05:16:12,584][26022] Updated weights on worker 0-0, policy_version 582878 (0.00086) [2022-07-10 05:16:14,125][26022] Updated weights on worker 0-0, policy_version 582888 (0.00088) [2022-07-10 05:16:15,415][25689] Fps is (10 sec: 5563.7, 60 sec: 5625.5, 300 sec: 5640.6). Total num frames: 596882432. Throughput: 0: 5783.7. Samples: 596887578. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:16:15,416][25689] Avg episode reward: [(0, '-22.683')] [2022-07-10 05:16:16,137][26022] Updated weights on worker 0-0, policy_version 582898 (0.00091) [2022-07-10 05:16:17,920][26022] Updated weights on worker 0-0, policy_version 582908 (0.00090) [2022-07-10 05:16:19,659][26022] Updated weights on worker 0-0, policy_version 582918 (0.00097) [2022-07-10 05:16:20,463][25689] Fps is (10 sec: 5603.4, 60 sec: 5638.7, 300 sec: 5639.9). Total num frames: 596912128. Throughput: 0: 4933.6. Samples: 596904564. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:16:20,463][25689] Avg episode reward: [(0, '-22.424')] [2022-07-10 05:16:21,618][26022] Updated weights on worker 0-0, policy_version 582928 (0.00098) [2022-07-10 05:16:23,148][26022] Updated weights on worker 0-0, policy_version 582938 (0.00087) [2022-07-10 05:16:25,129][26022] Updated weights on worker 0-0, policy_version 582948 (0.00090) [2022-07-10 05:16:25,474][25689] Fps is (10 sec: 5700.7, 60 sec: 5639.6, 300 sec: 5641.1). Total num frames: 596939776. Throughput: 0: 5915.1. Samples: 596938898. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:16:25,475][25689] Avg episode reward: [(0, '-23.214')] [2022-07-10 05:16:26,779][26022] Updated weights on worker 0-0, policy_version 582958 (0.00086) [2022-07-10 05:16:28,757][26022] Updated weights on worker 0-0, policy_version 582968 (0.00089) [2022-07-10 05:16:30,496][26022] Updated weights on worker 0-0, policy_version 582978 (0.00091) [2022-07-10 05:16:30,545][25689] Fps is (10 sec: 5687.5, 60 sec: 5622.9, 300 sec: 5644.1). Total num frames: 596969472. Throughput: 0: 5916.9. Samples: 596972708. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:16:30,546][25689] Avg episode reward: [(0, '-23.366')] [2022-07-10 05:16:32,211][26022] Updated weights on worker 0-0, policy_version 582988 (0.00086) [2022-07-10 05:16:34,152][26022] Updated weights on worker 0-0, policy_version 582998 (0.00098) [2022-07-10 05:16:35,554][25689] Fps is (10 sec: 5790.5, 60 sec: 5641.1, 300 sec: 5637.2). Total num frames: 596998144. Throughput: 0: 5928.5. Samples: 597006972. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:16:35,555][25689] Avg episode reward: [(0, '-24.091')] [2022-07-10 05:16:35,828][26022] Updated weights on worker 0-0, policy_version 583008 (0.00090) [2022-07-10 05:16:37,871][26022] Updated weights on worker 0-0, policy_version 583018 (0.00093) [2022-07-10 05:16:39,627][26022] Updated weights on worker 0-0, policy_version 583028 (0.00092) [2022-07-10 05:16:40,603][25689] Fps is (10 sec: 5599.5, 60 sec: 5637.4, 300 sec: 5640.0). Total num frames: 597025792. Throughput: 0: 5930.1. Samples: 597024000. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:16:40,604][25689] Avg episode reward: [(0, '-24.350')] [2022-07-10 05:16:41,265][26022] Updated weights on worker 0-0, policy_version 583038 (0.00092) [2022-07-10 05:16:43,219][26022] Updated weights on worker 0-0, policy_version 583048 (0.00097) [2022-07-10 05:16:44,967][26022] Updated weights on worker 0-0, policy_version 583058 (0.00098) [2022-07-10 05:16:45,616][25689] Fps is (10 sec: 5597.3, 60 sec: 5637.1, 300 sec: 5637.4). Total num frames: 597054464. Throughput: 0: 5920.7. Samples: 597058152. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:16:45,618][25689] Avg episode reward: [(0, '-24.539')] [2022-07-10 05:16:46,734][26022] Updated weights on worker 0-0, policy_version 583068 (0.00089) [2022-07-10 05:16:48,747][26022] Updated weights on worker 0-0, policy_version 583078 (0.00366) [2022-07-10 05:16:50,312][26022] Updated weights on worker 0-0, policy_version 583088 (0.00087) [2022-07-10 05:16:50,759][25689] Fps is (10 sec: 5747.3, 60 sec: 5629.2, 300 sec: 5638.8). Total num frames: 597084160. Throughput: 0: 5918.3. Samples: 597092340. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:16:50,760][25689] Avg episode reward: [(0, '-25.422')] [2022-07-10 05:16:52,159][26022] Updated weights on worker 0-0, policy_version 583098 (0.00082) [2022-07-10 05:16:53,923][26022] Updated weights on worker 0-0, policy_version 583108 (0.00086) [2022-07-10 05:16:55,537][26022] Updated weights on worker 0-0, policy_version 583118 (0.00086) [2022-07-10 05:16:55,851][25689] Fps is (10 sec: 5702.9, 60 sec: 5656.6, 300 sec: 5644.4). Total num frames: 597112832. Throughput: 0: 5047.2. Samples: 597109414. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:16:55,851][25689] Avg episode reward: [(0, '-25.007')] [2022-07-10 05:16:57,815][26022] Updated weights on worker 0-0, policy_version 583128 (0.00088) [2022-07-10 05:16:59,382][26022] Updated weights on worker 0-0, policy_version 583138 (0.00098) [2022-07-10 05:17:00,863][25689] Fps is (10 sec: 5573.8, 60 sec: 5605.5, 300 sec: 5647.8). Total num frames: 597140480. Throughput: 0: 5889.6. Samples: 597143324. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:17:00,864][25689] Avg episode reward: [(0, '-24.065')] [2022-07-10 05:17:01,244][26022] Updated weights on worker 0-0, policy_version 583148 (0.00946) [2022-07-10 05:17:03,256][26022] Updated weights on worker 0-0, policy_version 583158 (0.00086) [2022-07-10 05:17:05,144][26022] Updated weights on worker 0-0, policy_version 583168 (0.00084) [2022-07-10 05:17:05,949][25689] Fps is (10 sec: 5374.4, 60 sec: 5621.6, 300 sec: 5640.1). Total num frames: 597167104. Throughput: 0: 5768.9. Samples: 597175452. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:17:05,950][25689] Avg episode reward: [(0, '-24.938')] [2022-07-10 05:17:07,009][26022] Updated weights on worker 0-0, policy_version 583178 (0.00087) [2022-07-10 05:17:08,976][26022] Updated weights on worker 0-0, policy_version 583188 (0.00084) [2022-07-10 05:17:10,555][26022] Updated weights on worker 0-0, policy_version 583198 (0.00083) [2022-07-10 05:17:11,003][25689] Fps is (10 sec: 5554.6, 60 sec: 5627.8, 300 sec: 5642.8). Total num frames: 597196800. Throughput: 0: 4928.3. Samples: 597192108. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:17:11,003][25689] Avg episode reward: [(0, '-25.198')] [2022-07-10 05:17:12,702][26022] Updated weights on worker 0-0, policy_version 583208 (0.00095) [2022-07-10 05:17:14,077][26022] Updated weights on worker 0-0, policy_version 583218 (0.00088) [2022-07-10 05:17:16,023][25689] Fps is (10 sec: 5489.2, 60 sec: 5609.3, 300 sec: 5632.1). Total num frames: 597222400. Throughput: 0: 5783.8. Samples: 597226084. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:17:16,023][25689] Avg episode reward: [(0, '-24.818')] [2022-07-10 05:17:16,259][26022] Updated weights on worker 0-0, policy_version 583228 (0.00083) [2022-07-10 05:17:17,874][26022] Updated weights on worker 0-0, policy_version 583238 (0.00088) [2022-07-10 05:17:19,788][26022] Updated weights on worker 0-0, policy_version 583248 (0.00097) [2022-07-10 05:17:21,032][25689] Fps is (10 sec: 5615.7, 60 sec: 5629.8, 300 sec: 5646.0). Total num frames: 597253120. Throughput: 0: 5778.9. Samples: 597259874. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:17:21,032][25689] Avg episode reward: [(0, '-23.709')] [2022-07-10 05:17:21,534][26022] Updated weights on worker 0-0, policy_version 583258 (0.00091) [2022-07-10 05:17:23,395][26022] Updated weights on worker 0-0, policy_version 583268 (0.00086) [2022-07-10 05:17:25,215][26022] Updated weights on worker 0-0, policy_version 583278 (0.00087) [2022-07-10 05:17:26,059][25689] Fps is (10 sec: 5713.8, 60 sec: 5611.5, 300 sec: 5636.2). Total num frames: 597279744. Throughput: 0: 5044.9. Samples: 597276902. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:17:26,059][25689] Avg episode reward: [(0, '-24.204')] [2022-07-10 05:17:27,027][26022] Updated weights on worker 0-0, policy_version 583288 (0.00085) [2022-07-10 05:17:29,027][26022] Updated weights on worker 0-0, policy_version 583298 (0.00081) [2022-07-10 05:17:30,870][26022] Updated weights on worker 0-0, policy_version 583308 (0.00099) [2022-07-10 05:17:31,191][25689] Fps is (10 sec: 5543.8, 60 sec: 5605.8, 300 sec: 5633.9). Total num frames: 597309440. Throughput: 0: 5858.6. Samples: 597310382. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:17:31,191][25689] Avg episode reward: [(0, '-24.267')] [2022-07-10 05:17:32,497][26022] Updated weights on worker 0-0, policy_version 583318 (0.00086) [2022-07-10 05:17:34,306][26022] Updated weights on worker 0-0, policy_version 583328 (0.00089) [2022-07-10 05:17:36,194][26022] Updated weights on worker 0-0, policy_version 583338 (0.00093) [2022-07-10 05:17:36,289][25689] Fps is (10 sec: 5705.1, 60 sec: 5597.5, 300 sec: 5639.1). Total num frames: 597338112. Throughput: 0: 5820.8. Samples: 597344052. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:17:36,290][25689] Avg episode reward: [(0, '-24.418')] [2022-07-10 05:17:38,226][26022] Updated weights on worker 0-0, policy_version 583348 (0.00094) [2022-07-10 05:17:39,958][26022] Updated weights on worker 0-0, policy_version 583358 (0.00087) [2022-07-10 05:17:41,323][25689] Fps is (10 sec: 5558.7, 60 sec: 5599.0, 300 sec: 5628.4). Total num frames: 597365760. Throughput: 0: 4966.7. Samples: 597360654. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:17:41,323][25689] Avg episode reward: [(0, '-24.044')] [2022-07-10 05:17:41,744][26022] Updated weights on worker 0-0, policy_version 583368 (0.00065) [2022-07-10 05:17:43,394][26022] Updated weights on worker 0-0, policy_version 583378 (0.00087) [2022-07-10 05:17:45,440][26022] Updated weights on worker 0-0, policy_version 583388 (0.00092) [2022-07-10 05:17:46,332][25689] Fps is (10 sec: 5608.4, 60 sec: 5599.4, 300 sec: 5636.2). Total num frames: 597394432. Throughput: 0: 5809.7. Samples: 597394682. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:17:46,332][25689] Avg episode reward: [(0, '-24.491')] [2022-07-10 05:17:46,983][26022] Updated weights on worker 0-0, policy_version 583398 (0.00086) [2022-07-10 05:17:49,019][26022] Updated weights on worker 0-0, policy_version 583408 (0.00091) [2022-07-10 05:17:50,833][26022] Updated weights on worker 0-0, policy_version 583418 (0.00086) [2022-07-10 05:17:51,402][25689] Fps is (10 sec: 5689.5, 60 sec: 5589.2, 300 sec: 5635.0). Total num frames: 597423104. Throughput: 0: 5845.3. Samples: 597428520. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:17:51,402][25689] Avg episode reward: [(0, '-24.713')] [2022-07-10 05:17:52,658][26022] Updated weights on worker 0-0, policy_version 583428 (0.00089) [2022-07-10 05:17:54,346][26022] Updated weights on worker 0-0, policy_version 583438 (0.00087) [2022-07-10 05:17:56,214][26022] Updated weights on worker 0-0, policy_version 583448 (0.00089) [2022-07-10 05:17:56,433][25689] Fps is (10 sec: 5676.8, 60 sec: 5594.8, 300 sec: 5631.0). Total num frames: 597451776. Throughput: 0: 5049.7. Samples: 597445770. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:17:56,433][25689] Avg episode reward: [(0, '-24.947')] [2022-07-10 05:17:57,915][26022] Updated weights on worker 0-0, policy_version 583458 (0.00089) [2022-07-10 05:17:59,758][26022] Updated weights on worker 0-0, policy_version 583468 (0.00085) [2022-07-10 05:18:01,534][25689] Fps is (10 sec: 5659.5, 60 sec: 5603.5, 300 sec: 5640.0). Total num frames: 597480448. Throughput: 0: 5898.5. Samples: 597479870. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:18:01,534][25689] Avg episode reward: [(0, '-24.740')] [2022-07-10 05:18:01,690][26022] Updated weights on worker 0-0, policy_version 583478 (0.00096) [2022-07-10 05:18:03,127][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:18:03,142][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000583485_597488640.pth [2022-07-10 05:18:03,142][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000581500_595456000.pth [2022-07-10 05:18:03,896][26022] Updated weights on worker 0-0, policy_version 583488 (0.00093) [2022-07-10 05:18:05,575][26022] Updated weights on worker 0-0, policy_version 583498 (0.00090) [2022-07-10 05:18:06,538][25689] Fps is (10 sec: 5472.3, 60 sec: 5611.1, 300 sec: 5630.7). Total num frames: 597507072. Throughput: 0: 5813.2. Samples: 597512144. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:18:06,538][25689] Avg episode reward: [(0, '-24.858')] [2022-07-10 05:18:07,349][26022] Updated weights on worker 0-0, policy_version 583508 (0.00101) [2022-07-10 05:18:09,094][26022] Updated weights on worker 0-0, policy_version 583518 (0.00091) [2022-07-10 05:18:11,162][26022] Updated weights on worker 0-0, policy_version 583528 (0.00081) [2022-07-10 05:18:11,596][25689] Fps is (10 sec: 5495.5, 60 sec: 5593.8, 300 sec: 5633.3). Total num frames: 597535744. Throughput: 0: 4967.3. Samples: 597528832. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:18:11,598][25689] Avg episode reward: [(0, '-24.743')] [2022-07-10 05:18:12,875][26022] Updated weights on worker 0-0, policy_version 583538 (0.00088) [2022-07-10 05:18:14,598][26022] Updated weights on worker 0-0, policy_version 583548 (0.00090) [2022-07-10 05:18:16,380][26022] Updated weights on worker 0-0, policy_version 583558 (0.00172) [2022-07-10 05:18:16,642][25689] Fps is (10 sec: 5776.4, 60 sec: 5658.9, 300 sec: 5636.1). Total num frames: 597565440. Throughput: 0: 5800.3. Samples: 597562990. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:18:16,644][25689] Avg episode reward: [(0, '-26.941')] [2022-07-10 05:18:18,243][26022] Updated weights on worker 0-0, policy_version 583568 (0.00092) [2022-07-10 05:18:20,029][26022] Updated weights on worker 0-0, policy_version 583578 (0.00094) [2022-07-10 05:18:21,664][25689] Fps is (10 sec: 5695.9, 60 sec: 5607.1, 300 sec: 5632.4). Total num frames: 597593088. Throughput: 0: 5823.8. Samples: 597597102. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:18:21,665][25689] Avg episode reward: [(0, '-26.513')] [2022-07-10 05:18:21,884][26022] Updated weights on worker 0-0, policy_version 583588 (0.00087) [2022-07-10 05:18:23,608][26022] Updated weights on worker 0-0, policy_version 583598 (0.00087) [2022-07-10 05:18:25,528][26022] Updated weights on worker 0-0, policy_version 583608 (0.00086) [2022-07-10 05:18:26,676][25689] Fps is (10 sec: 5613.0, 60 sec: 5642.2, 300 sec: 5640.1). Total num frames: 597621760. Throughput: 0: 5070.6. Samples: 597614258. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:18:26,678][25689] Avg episode reward: [(0, '-25.919')] [2022-07-10 05:18:27,365][26022] Updated weights on worker 0-0, policy_version 583618 (0.00090) [2022-07-10 05:18:29,227][26022] Updated weights on worker 0-0, policy_version 583628 (0.00071) [2022-07-10 05:18:30,899][26022] Updated weights on worker 0-0, policy_version 583638 (0.00085) [2022-07-10 05:18:31,787][25689] Fps is (10 sec: 5462.2, 60 sec: 5593.5, 300 sec: 5621.5). Total num frames: 597648384. Throughput: 0: 5883.1. Samples: 597647618. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:18:31,787][25689] Avg episode reward: [(0, '-25.987')] [2022-07-10 05:18:32,733][26022] Updated weights on worker 0-0, policy_version 583648 (0.00091) [2022-07-10 05:18:34,670][26022] Updated weights on worker 0-0, policy_version 583658 (0.00094) [2022-07-10 05:18:36,217][26022] Updated weights on worker 0-0, policy_version 583668 (0.00084) [2022-07-10 05:18:36,820][25689] Fps is (10 sec: 5551.9, 60 sec: 5616.4, 300 sec: 5626.1). Total num frames: 597678080. Throughput: 0: 5885.5. Samples: 597681748. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:18:36,821][25689] Avg episode reward: [(0, '-25.579')] [2022-07-10 05:18:38,223][26022] Updated weights on worker 0-0, policy_version 583678 (0.00108) [2022-07-10 05:18:40,156][26022] Updated weights on worker 0-0, policy_version 583688 (0.00089) [2022-07-10 05:18:41,764][26022] Updated weights on worker 0-0, policy_version 583698 (0.00086) [2022-07-10 05:18:41,867][25689] Fps is (10 sec: 5790.5, 60 sec: 5632.1, 300 sec: 5632.7). Total num frames: 597706752. Throughput: 0: 5859.8. Samples: 597715490. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:18:41,867][25689] Avg episode reward: [(0, '-26.012')] [2022-07-10 05:18:43,683][26022] Updated weights on worker 0-0, policy_version 583708 (0.00057) [2022-07-10 05:18:45,438][26022] Updated weights on worker 0-0, policy_version 583718 (0.00091) [2022-07-10 05:18:46,904][25689] Fps is (10 sec: 5585.0, 60 sec: 5612.5, 300 sec: 5619.2). Total num frames: 597734400. Throughput: 0: 5848.0. Samples: 597732554. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:18:46,905][25689] Avg episode reward: [(0, '-24.752')] [2022-07-10 05:18:47,323][26022] Updated weights on worker 0-0, policy_version 583728 (0.00085) [2022-07-10 05:18:49,129][26022] Updated weights on worker 0-0, policy_version 583738 (0.00089) [2022-07-10 05:18:50,861][26022] Updated weights on worker 0-0, policy_version 583748 (0.00084) [2022-07-10 05:18:52,002][25689] Fps is (10 sec: 5556.8, 60 sec: 5610.0, 300 sec: 5620.8). Total num frames: 597763072. Throughput: 0: 5870.3. Samples: 597766288. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 05:18:52,003][25689] Avg episode reward: [(0, '-24.966')] [2022-07-10 05:18:52,944][26022] Updated weights on worker 0-0, policy_version 583758 (0.00088) [2022-07-10 05:18:54,534][26022] Updated weights on worker 0-0, policy_version 583768 (0.00093) [2022-07-10 05:18:56,491][26022] Updated weights on worker 0-0, policy_version 583778 (0.00094) [2022-07-10 05:18:57,025][25689] Fps is (10 sec: 5665.9, 60 sec: 5610.7, 300 sec: 5628.1). Total num frames: 597791744. Throughput: 0: 5852.2. Samples: 597799992. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:18:57,026][25689] Avg episode reward: [(0, '-24.694')] [2022-07-10 05:18:58,414][26022] Updated weights on worker 0-0, policy_version 583788 (0.00086) [2022-07-10 05:19:00,108][26022] Updated weights on worker 0-0, policy_version 583798 (0.00083) [2022-07-10 05:19:02,111][25689] Fps is (10 sec: 5470.3, 60 sec: 5578.4, 300 sec: 5626.5). Total num frames: 597818368. Throughput: 0: 5019.9. Samples: 597817112. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:19:02,111][25689] Avg episode reward: [(0, '-25.202')] [2022-07-10 05:19:02,178][26022] Updated weights on worker 0-0, policy_version 583808 (0.00084) [2022-07-10 05:19:04,062][26022] Updated weights on worker 0-0, policy_version 583818 (0.00105) [2022-07-10 05:19:05,851][26022] Updated weights on worker 0-0, policy_version 583828 (0.00087) [2022-07-10 05:19:07,140][25689] Fps is (10 sec: 5466.7, 60 sec: 5609.8, 300 sec: 5624.6). Total num frames: 597847040. Throughput: 0: 5758.6. Samples: 597849086. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:19:07,141][25689] Avg episode reward: [(0, '-24.355')] [2022-07-10 05:19:07,728][26022] Updated weights on worker 0-0, policy_version 583838 (0.00085) [2022-07-10 05:19:09,402][26022] Updated weights on worker 0-0, policy_version 583848 (0.00088) [2022-07-10 05:19:11,359][26022] Updated weights on worker 0-0, policy_version 583858 (0.00090) [2022-07-10 05:19:12,266][25689] Fps is (10 sec: 5646.9, 60 sec: 5603.6, 300 sec: 5622.6). Total num frames: 597875712. Throughput: 0: 5761.9. Samples: 597883044. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:19:12,266][25689] Avg episode reward: [(0, '-24.113')] [2022-07-10 05:19:12,992][26022] Updated weights on worker 0-0, policy_version 583868 (0.00090) [2022-07-10 05:19:14,825][26022] Updated weights on worker 0-0, policy_version 583878 (0.00055) [2022-07-10 05:19:16,468][26022] Updated weights on worker 0-0, policy_version 583888 (0.00089) [2022-07-10 05:19:17,329][25689] Fps is (10 sec: 5628.3, 60 sec: 5585.1, 300 sec: 5621.7). Total num frames: 597904384. Throughput: 0: 4930.7. Samples: 597900102. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:19:17,330][25689] Avg episode reward: [(0, '-24.138')] [2022-07-10 05:19:18,543][26022] Updated weights on worker 0-0, policy_version 583898 (0.00091) [2022-07-10 05:19:20,121][26022] Updated weights on worker 0-0, policy_version 583908 (0.00084) [2022-07-10 05:19:22,110][26022] Updated weights on worker 0-0, policy_version 583918 (0.00088) [2022-07-10 05:19:22,339][25689] Fps is (10 sec: 5794.5, 60 sec: 5620.0, 300 sec: 5628.5). Total num frames: 597934080. Throughput: 0: 5788.9. Samples: 597934210. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:19:22,339][25689] Avg episode reward: [(0, '-24.325')] [2022-07-10 05:19:23,742][26022] Updated weights on worker 0-0, policy_version 583928 (0.00095) [2022-07-10 05:19:25,624][26022] Updated weights on worker 0-0, policy_version 583938 (0.00072) [2022-07-10 05:19:27,342][25689] Fps is (10 sec: 5624.8, 60 sec: 5587.1, 300 sec: 5615.9). Total num frames: 597960704. Throughput: 0: 5902.2. Samples: 597968318. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:19:27,343][25689] Avg episode reward: [(0, '-24.344')] [2022-07-10 05:19:27,782][26022] Updated weights on worker 0-0, policy_version 583948 (0.00092) [2022-07-10 05:19:29,309][26022] Updated weights on worker 0-0, policy_version 583958 (0.00083) [2022-07-10 05:19:31,260][26022] Updated weights on worker 0-0, policy_version 583968 (0.00089) [2022-07-10 05:19:32,410][25689] Fps is (10 sec: 5490.4, 60 sec: 5624.8, 300 sec: 5622.8). Total num frames: 597989376. Throughput: 0: 5063.9. Samples: 597985054. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:19:32,412][25689] Avg episode reward: [(0, '-23.998')] [2022-07-10 05:19:32,928][26022] Updated weights on worker 0-0, policy_version 583978 (0.00093) [2022-07-10 05:19:34,771][26022] Updated weights on worker 0-0, policy_version 583988 (0.00087) [2022-07-10 05:19:36,863][26022] Updated weights on worker 0-0, policy_version 583998 (0.00087) [2022-07-10 05:19:37,450][25689] Fps is (10 sec: 5774.4, 60 sec: 5624.2, 300 sec: 5622.7). Total num frames: 598019072. Throughput: 0: 5904.8. Samples: 598018910. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:19:37,452][25689] Avg episode reward: [(0, '-23.532')] [2022-07-10 05:19:38,379][26022] Updated weights on worker 0-0, policy_version 584008 (0.00092) [2022-07-10 05:19:40,279][26022] Updated weights on worker 0-0, policy_version 584018 (0.00096) [2022-07-10 05:19:42,080][26022] Updated weights on worker 0-0, policy_version 584028 (0.00093) [2022-07-10 05:19:42,512][25689] Fps is (10 sec: 5676.5, 60 sec: 5605.9, 300 sec: 5622.3). Total num frames: 598046720. Throughput: 0: 5891.1. Samples: 598053054. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:19:42,513][25689] Avg episode reward: [(0, '-22.819')] [2022-07-10 05:19:43,844][26022] Updated weights on worker 0-0, policy_version 584038 (0.00089) [2022-07-10 05:19:45,562][26022] Updated weights on worker 0-0, policy_version 584048 (0.00095) [2022-07-10 05:19:47,528][25689] Fps is (10 sec: 5588.5, 60 sec: 5624.8, 300 sec: 5616.3). Total num frames: 598075392. Throughput: 0: 5045.0. Samples: 598070156. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:19:47,529][25689] Avg episode reward: [(0, '-23.862')] [2022-07-10 05:19:47,531][26022] Updated weights on worker 0-0, policy_version 584058 (0.00093) [2022-07-10 05:19:49,333][26022] Updated weights on worker 0-0, policy_version 584068 (0.00091) [2022-07-10 05:19:51,187][26022] Updated weights on worker 0-0, policy_version 584078 (0.00089) [2022-07-10 05:19:52,590][25689] Fps is (10 sec: 5690.2, 60 sec: 5628.1, 300 sec: 5622.1). Total num frames: 598104064. Throughput: 0: 5890.1. Samples: 598103914. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:19:52,590][25689] Avg episode reward: [(0, '-23.832')] [2022-07-10 05:19:52,955][26022] Updated weights on worker 0-0, policy_version 584088 (0.00079) [2022-07-10 05:19:54,736][26022] Updated weights on worker 0-0, policy_version 584098 (0.00086) [2022-07-10 05:19:56,439][26022] Updated weights on worker 0-0, policy_version 584108 (0.00083) [2022-07-10 05:19:57,613][25689] Fps is (10 sec: 5685.6, 60 sec: 5628.0, 300 sec: 5618.4). Total num frames: 598132736. Throughput: 0: 5903.7. Samples: 598137952. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:19:57,614][25689] Avg episode reward: [(0, '-24.148')] [2022-07-10 05:19:58,490][26022] Updated weights on worker 0-0, policy_version 584118 (0.00097) [2022-07-10 05:20:00,046][26022] Updated weights on worker 0-0, policy_version 584128 (0.00087) [2022-07-10 05:20:02,280][26022] Updated weights on worker 0-0, policy_version 584138 (0.00102) [2022-07-10 05:20:02,685][25689] Fps is (10 sec: 5376.1, 60 sec: 5612.4, 300 sec: 5617.7). Total num frames: 598158336. Throughput: 0: 5045.4. Samples: 598154832. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:20:02,685][25689] Avg episode reward: [(0, '-24.819')] [2022-07-10 05:20:03,322][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:20:03,335][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000584143_598162432.pth [2022-07-10 05:20:03,336][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000582163_596134912.pth [2022-07-10 05:20:04,116][26022] Updated weights on worker 0-0, policy_version 584148 (0.00056) [2022-07-10 05:20:05,977][26022] Updated weights on worker 0-0, policy_version 584158 (0.00086) [2022-07-10 05:20:07,727][25689] Fps is (10 sec: 5265.2, 60 sec: 5594.4, 300 sec: 5615.5). Total num frames: 598185984. Throughput: 0: 5757.8. Samples: 598186458. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:20:07,727][25689] Avg episode reward: [(0, '-25.704')] [2022-07-10 05:20:08,051][26022] Updated weights on worker 0-0, policy_version 584168 (0.00094) [2022-07-10 05:20:09,489][26022] Updated weights on worker 0-0, policy_version 584178 (0.00085) [2022-07-10 05:20:11,567][26022] Updated weights on worker 0-0, policy_version 584188 (0.00097) [2022-07-10 05:20:12,781][25689] Fps is (10 sec: 5679.8, 60 sec: 5617.9, 300 sec: 5618.6). Total num frames: 598215680. Throughput: 0: 5774.5. Samples: 598220508. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:20:12,783][25689] Avg episode reward: [(0, '-25.051')] [2022-07-10 05:20:13,238][26022] Updated weights on worker 0-0, policy_version 584198 (0.00091) [2022-07-10 05:20:15,095][26022] Updated weights on worker 0-0, policy_version 584208 (0.00073) [2022-07-10 05:20:16,815][26022] Updated weights on worker 0-0, policy_version 584218 (0.00087) [2022-07-10 05:20:17,785][25689] Fps is (10 sec: 5700.9, 60 sec: 5606.4, 300 sec: 5615.2). Total num frames: 598243328. Throughput: 0: 4934.8. Samples: 598237496. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:20:17,786][25689] Avg episode reward: [(0, '-23.966')] [2022-07-10 05:20:18,691][26022] Updated weights on worker 0-0, policy_version 584228 (0.00095) [2022-07-10 05:20:20,531][26022] Updated weights on worker 0-0, policy_version 584238 (0.00092) [2022-07-10 05:20:22,207][26022] Updated weights on worker 0-0, policy_version 584248 (0.00090) [2022-07-10 05:20:22,794][25689] Fps is (10 sec: 5726.9, 60 sec: 5606.5, 300 sec: 5622.3). Total num frames: 598273024. Throughput: 0: 5806.3. Samples: 598271592. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:20:22,794][25689] Avg episode reward: [(0, '-24.768')] [2022-07-10 05:20:24,287][26022] Updated weights on worker 0-0, policy_version 584258 (0.00089) [2022-07-10 05:20:25,965][26022] Updated weights on worker 0-0, policy_version 584268 (0.00085) [2022-07-10 05:20:27,814][25689] Fps is (10 sec: 5615.9, 60 sec: 5604.9, 300 sec: 5609.6). Total num frames: 598299648. Throughput: 0: 5921.3. Samples: 598305402. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:20:27,815][25689] Avg episode reward: [(0, '-24.488')] [2022-07-10 05:20:27,944][26022] Updated weights on worker 0-0, policy_version 584278 (0.00087) [2022-07-10 05:20:29,815][26022] Updated weights on worker 0-0, policy_version 584288 (0.00094) [2022-07-10 05:20:31,463][26022] Updated weights on worker 0-0, policy_version 584298 (0.00082) [2022-07-10 05:20:32,862][25689] Fps is (10 sec: 5492.3, 60 sec: 5606.8, 300 sec: 5612.6). Total num frames: 598328320. Throughput: 0: 5063.6. Samples: 598322190. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:20:32,864][25689] Avg episode reward: [(0, '-24.209')] [2022-07-10 05:20:33,230][26022] Updated weights on worker 0-0, policy_version 584308 (0.00094) [2022-07-10 05:20:35,189][26022] Updated weights on worker 0-0, policy_version 584318 (0.00091) [2022-07-10 05:20:36,874][26022] Updated weights on worker 0-0, policy_version 584328 (0.00076) [2022-07-10 05:20:37,900][25689] Fps is (10 sec: 5685.8, 60 sec: 5590.1, 300 sec: 5615.5). Total num frames: 598356992. Throughput: 0: 5886.7. Samples: 598355902. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:20:37,900][25689] Avg episode reward: [(0, '-25.048')] [2022-07-10 05:20:38,786][26022] Updated weights on worker 0-0, policy_version 584338 (0.00090) [2022-07-10 05:20:40,368][26022] Updated weights on worker 0-0, policy_version 584348 (0.00091) [2022-07-10 05:20:42,396][26022] Updated weights on worker 0-0, policy_version 584358 (0.00082) [2022-07-10 05:20:42,910][25689] Fps is (10 sec: 5604.8, 60 sec: 5594.9, 300 sec: 5612.0). Total num frames: 598384640. Throughput: 0: 5885.8. Samples: 598389994. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:20:42,912][25689] Avg episode reward: [(0, '-26.044')] [2022-07-10 05:20:43,985][26022] Updated weights on worker 0-0, policy_version 584368 (0.00086) [2022-07-10 05:20:46,060][26022] Updated weights on worker 0-0, policy_version 584378 (0.00085) [2022-07-10 05:20:47,939][25689] Fps is (10 sec: 5508.0, 60 sec: 5576.7, 300 sec: 5605.7). Total num frames: 598412288. Throughput: 0: 5040.8. Samples: 598406846. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:20:47,939][25689] Avg episode reward: [(0, '-24.849')] [2022-07-10 05:20:47,951][26022] Updated weights on worker 0-0, policy_version 584388 (0.00090) [2022-07-10 05:20:49,446][26022] Updated weights on worker 0-0, policy_version 584398 (0.00086) [2022-07-10 05:20:51,721][26022] Updated weights on worker 0-0, policy_version 584408 (0.00090) [2022-07-10 05:20:52,874][26022] Updated weights on worker 0-0, policy_version 584418 (0.00091) [2022-07-10 05:20:53,064][25689] Fps is (10 sec: 5849.6, 60 sec: 5621.7, 300 sec: 5621.0). Total num frames: 598444032. Throughput: 0: 5874.1. Samples: 598440856. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:20:53,064][25689] Avg episode reward: [(0, '-25.106')] [2022-07-10 05:20:55,173][26022] Updated weights on worker 0-0, policy_version 584428 (0.00090) [2022-07-10 05:20:56,679][26022] Updated weights on worker 0-0, policy_version 584438 (0.00089) [2022-07-10 05:20:58,083][25689] Fps is (10 sec: 5754.0, 60 sec: 5588.3, 300 sec: 5607.0). Total num frames: 598470656. Throughput: 0: 5909.3. Samples: 598475170. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:20:58,083][25689] Avg episode reward: [(0, '-24.804')] [2022-07-10 05:20:58,524][26022] Updated weights on worker 0-0, policy_version 584448 (0.00095) [2022-07-10 05:21:00,540][26022] Updated weights on worker 0-0, policy_version 584458 (0.00084) [2022-07-10 05:21:02,633][26022] Updated weights on worker 0-0, policy_version 584468 (0.00095) [2022-07-10 05:21:03,129][25689] Fps is (10 sec: 5290.5, 60 sec: 5607.6, 300 sec: 5611.0). Total num frames: 598497280. Throughput: 0: 5778.6. Samples: 598506826. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:21:03,129][25689] Avg episode reward: [(0, '-25.269')] [2022-07-10 05:21:04,357][26022] Updated weights on worker 0-0, policy_version 584478 (0.00091) [2022-07-10 05:21:06,214][26022] Updated weights on worker 0-0, policy_version 584488 (0.00087) [2022-07-10 05:21:07,830][26022] Updated weights on worker 0-0, policy_version 584498 (0.00098) [2022-07-10 05:21:08,184][25689] Fps is (10 sec: 5575.4, 60 sec: 5640.2, 300 sec: 5612.2). Total num frames: 598526976. Throughput: 0: 5787.3. Samples: 598524012. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:21:08,185][25689] Avg episode reward: [(0, '-25.833')] [2022-07-10 05:21:09,967][26022] Updated weights on worker 0-0, policy_version 584508 (0.00083) [2022-07-10 05:21:11,706][26022] Updated weights on worker 0-0, policy_version 584518 (0.00087) [2022-07-10 05:21:13,237][25689] Fps is (10 sec: 5673.0, 60 sec: 5606.5, 300 sec: 5614.7). Total num frames: 598554624. Throughput: 0: 5802.0. Samples: 598557900. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:21:13,238][25689] Avg episode reward: [(0, '-24.645')] [2022-07-10 05:21:13,441][26022] Updated weights on worker 0-0, policy_version 584528 (0.00086) [2022-07-10 05:21:15,435][26022] Updated weights on worker 0-0, policy_version 584538 (0.00092) [2022-07-10 05:21:17,041][26022] Updated weights on worker 0-0, policy_version 584548 (0.00085) [2022-07-10 05:21:18,312][25689] Fps is (10 sec: 5560.7, 60 sec: 5616.8, 300 sec: 5610.8). Total num frames: 598583296. Throughput: 0: 5771.9. Samples: 598591934. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:21:18,313][25689] Avg episode reward: [(0, '-25.975')] [2022-07-10 05:21:19,044][26022] Updated weights on worker 0-0, policy_version 584558 (0.00120) [2022-07-10 05:21:20,936][26022] Updated weights on worker 0-0, policy_version 584568 (0.00090) [2022-07-10 05:21:22,521][26022] Updated weights on worker 0-0, policy_version 584578 (0.00084) [2022-07-10 05:21:23,386][25689] Fps is (10 sec: 5650.2, 60 sec: 5593.9, 300 sec: 5613.0). Total num frames: 598611968. Throughput: 0: 5030.2. Samples: 598608724. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:21:23,386][25689] Avg episode reward: [(0, '-25.405')] [2022-07-10 05:21:24,524][26022] Updated weights on worker 0-0, policy_version 584588 (0.00095) [2022-07-10 05:21:26,129][26022] Updated weights on worker 0-0, policy_version 584598 (0.00090) [2022-07-10 05:21:28,031][26022] Updated weights on worker 0-0, policy_version 584608 (0.00096) [2022-07-10 05:21:28,463][25689] Fps is (10 sec: 5649.3, 60 sec: 5622.4, 300 sec: 5609.5). Total num frames: 598640640. Throughput: 0: 5846.2. Samples: 598642566. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:21:28,463][25689] Avg episode reward: [(0, '-25.349')] [2022-07-10 05:21:29,944][26022] Updated weights on worker 0-0, policy_version 584618 (0.00090) [2022-07-10 05:21:31,728][26022] Updated weights on worker 0-0, policy_version 584628 (0.00093) [2022-07-10 05:21:33,495][25689] Fps is (10 sec: 5570.8, 60 sec: 5606.9, 300 sec: 5605.6). Total num frames: 598668288. Throughput: 0: 5830.6. Samples: 598676020. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:21:33,496][25689] Avg episode reward: [(0, '-24.283')] [2022-07-10 05:21:33,623][26022] Updated weights on worker 0-0, policy_version 584638 (0.00093) [2022-07-10 05:21:35,339][26022] Updated weights on worker 0-0, policy_version 584648 (0.00081) [2022-07-10 05:21:36,965][26022] Updated weights on worker 0-0, policy_version 584658 (0.00086) [2022-07-10 05:21:38,573][25689] Fps is (10 sec: 5570.6, 60 sec: 5603.2, 300 sec: 5608.5). Total num frames: 598696960. Throughput: 0: 5006.2. Samples: 598693364. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:21:38,575][25689] Avg episode reward: [(0, '-24.629')] [2022-07-10 05:21:39,017][26022] Updated weights on worker 0-0, policy_version 584668 (0.00109) [2022-07-10 05:21:40,580][26022] Updated weights on worker 0-0, policy_version 584678 (0.00093) [2022-07-10 05:21:42,492][26022] Updated weights on worker 0-0, policy_version 584688 (0.00088) [2022-07-10 05:21:43,640][25689] Fps is (10 sec: 5854.6, 60 sec: 5648.7, 300 sec: 5614.4). Total num frames: 598727680. Throughput: 0: 5873.0. Samples: 598727678. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:21:43,640][25689] Avg episode reward: [(0, '-24.663')] [2022-07-10 05:21:44,182][26022] Updated weights on worker 0-0, policy_version 584698 (0.00083) [2022-07-10 05:21:45,919][26022] Updated weights on worker 0-0, policy_version 584708 (0.00084) [2022-07-10 05:21:47,895][26022] Updated weights on worker 0-0, policy_version 584718 (0.00093) [2022-07-10 05:21:48,737][25689] Fps is (10 sec: 5742.8, 60 sec: 5642.3, 300 sec: 5608.4). Total num frames: 598755328. Throughput: 0: 5880.1. Samples: 598761778. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:21:48,739][25689] Avg episode reward: [(0, '-24.284')] [2022-07-10 05:21:49,609][26022] Updated weights on worker 0-0, policy_version 584728 (0.00085) [2022-07-10 05:21:51,538][26022] Updated weights on worker 0-0, policy_version 584738 (0.00088) [2022-07-10 05:21:53,513][26022] Updated weights on worker 0-0, policy_version 584748 (0.00086) [2022-07-10 05:21:53,798][25689] Fps is (10 sec: 5544.2, 60 sec: 5597.6, 300 sec: 5608.9). Total num frames: 598784000. Throughput: 0: 5056.1. Samples: 598778674. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 05:21:53,800][25689] Avg episode reward: [(0, '-24.237')] [2022-07-10 05:21:55,025][26022] Updated weights on worker 0-0, policy_version 584758 (0.00105) [2022-07-10 05:21:57,073][26022] Updated weights on worker 0-0, policy_version 584768 (0.00087) [2022-07-10 05:21:58,743][26022] Updated weights on worker 0-0, policy_version 584778 (0.00112) [2022-07-10 05:21:58,821][25689] Fps is (10 sec: 5787.8, 60 sec: 5647.8, 300 sec: 5615.6). Total num frames: 598813696. Throughput: 0: 5887.4. Samples: 598812574. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:21:58,822][25689] Avg episode reward: [(0, '-24.412')] [2022-07-10 05:22:00,588][26022] Updated weights on worker 0-0, policy_version 584788 (0.00090) [2022-07-10 05:22:02,796][26022] Updated weights on worker 0-0, policy_version 584798 (0.00094) [2022-07-10 05:22:03,390][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:22:03,412][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000584801_598836224.pth [2022-07-10 05:22:03,412][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000582825_596812800.pth [2022-07-10 05:22:03,826][25689] Fps is (10 sec: 5412.5, 60 sec: 5617.9, 300 sec: 5610.3). Total num frames: 598838272. Throughput: 0: 5766.9. Samples: 598844086. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:22:03,826][25689] Avg episode reward: [(0, '-23.694')] [2022-07-10 05:22:04,607][26022] Updated weights on worker 0-0, policy_version 584808 (0.00084) [2022-07-10 05:22:06,522][26022] Updated weights on worker 0-0, policy_version 584818 (0.00085) [2022-07-10 05:22:08,254][26022] Updated weights on worker 0-0, policy_version 584828 (0.00086) [2022-07-10 05:22:08,865][25689] Fps is (10 sec: 5301.7, 60 sec: 5602.5, 300 sec: 5607.1). Total num frames: 598866944. Throughput: 0: 4937.0. Samples: 598861152. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:22:08,866][25689] Avg episode reward: [(0, '-23.182')] [2022-07-10 05:22:10,133][26022] Updated weights on worker 0-0, policy_version 584838 (0.00085) [2022-07-10 05:22:11,911][26022] Updated weights on worker 0-0, policy_version 584848 (0.00088) [2022-07-10 05:22:13,585][26022] Updated weights on worker 0-0, policy_version 584858 (0.00090) [2022-07-10 05:22:13,987][25689] Fps is (10 sec: 5845.0, 60 sec: 5646.7, 300 sec: 5622.4). Total num frames: 598897664. Throughput: 0: 5781.3. Samples: 598895390. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:22:13,987][25689] Avg episode reward: [(0, '-23.380')] [2022-07-10 05:22:15,562][26022] Updated weights on worker 0-0, policy_version 584868 (0.00086) [2022-07-10 05:22:16,983][26022] Updated weights on worker 0-0, policy_version 584878 (0.00084) [2022-07-10 05:22:18,998][25689] Fps is (10 sec: 5760.3, 60 sec: 5635.9, 300 sec: 5612.0). Total num frames: 598925312. Throughput: 0: 5825.7. Samples: 598930116. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:22:18,998][25689] Avg episode reward: [(0, '-22.738')] [2022-07-10 05:22:19,004][26022] Updated weights on worker 0-0, policy_version 584888 (0.00086) [2022-07-10 05:22:20,693][26022] Updated weights on worker 0-0, policy_version 584898 (0.00203) [2022-07-10 05:22:22,574][26022] Updated weights on worker 0-0, policy_version 584908 (0.00092) [2022-07-10 05:22:24,058][25689] Fps is (10 sec: 5490.3, 60 sec: 5620.2, 300 sec: 5614.8). Total num frames: 598952960. Throughput: 0: 5086.4. Samples: 598946998. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:22:24,059][25689] Avg episode reward: [(0, '-21.928')] [2022-07-10 05:22:24,399][26022] Updated weights on worker 0-0, policy_version 584918 (0.00087) [2022-07-10 05:22:25,947][26022] Updated weights on worker 0-0, policy_version 584928 (0.00092) [2022-07-10 05:22:28,214][26022] Updated weights on worker 0-0, policy_version 584938 (0.00086) [2022-07-10 05:22:29,091][25689] Fps is (10 sec: 5884.6, 60 sec: 5675.0, 300 sec: 5623.6). Total num frames: 598984704. Throughput: 0: 5934.7. Samples: 598981184. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:22:29,091][25689] Avg episode reward: [(0, '-22.073')] [2022-07-10 05:22:29,560][26022] Updated weights on worker 0-0, policy_version 584948 (0.00091) [2022-07-10 05:22:31,629][26022] Updated weights on worker 0-0, policy_version 584958 (0.00095) [2022-07-10 05:22:33,338][26022] Updated weights on worker 0-0, policy_version 584968 (0.00082) [2022-07-10 05:22:34,202][25689] Fps is (10 sec: 5653.3, 60 sec: 5633.9, 300 sec: 5613.0). Total num frames: 599010304. Throughput: 0: 5931.3. Samples: 599015292. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:22:34,202][25689] Avg episode reward: [(0, '-22.073')] [2022-07-10 05:22:35,180][26022] Updated weights on worker 0-0, policy_version 584978 (0.00088) [2022-07-10 05:22:37,061][26022] Updated weights on worker 0-0, policy_version 584988 (0.00090) [2022-07-10 05:22:38,763][26022] Updated weights on worker 0-0, policy_version 584998 (0.00086) [2022-07-10 05:22:39,240][25689] Fps is (10 sec: 5448.3, 60 sec: 5654.5, 300 sec: 5619.8). Total num frames: 599040000. Throughput: 0: 5904.3. Samples: 599049632. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:22:39,240][25689] Avg episode reward: [(0, '-22.616')] [2022-07-10 05:22:40,619][26022] Updated weights on worker 0-0, policy_version 585008 (0.00091) [2022-07-10 05:22:42,292][26022] Updated weights on worker 0-0, policy_version 585018 (0.00089) [2022-07-10 05:22:44,241][26022] Updated weights on worker 0-0, policy_version 585028 (0.00087) [2022-07-10 05:22:44,255][25689] Fps is (10 sec: 5806.2, 60 sec: 5625.6, 300 sec: 5619.7). Total num frames: 599068672. Throughput: 0: 5928.3. Samples: 599066728. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:22:44,255][25689] Avg episode reward: [(0, '-22.333')] [2022-07-10 05:22:45,864][26022] Updated weights on worker 0-0, policy_version 585038 (0.00090) [2022-07-10 05:22:47,902][26022] Updated weights on worker 0-0, policy_version 585048 (0.00066) [2022-07-10 05:22:49,261][25689] Fps is (10 sec: 5824.3, 60 sec: 5667.8, 300 sec: 5624.3). Total num frames: 599098368. Throughput: 0: 5933.5. Samples: 599100868. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:22:49,262][25689] Avg episode reward: [(0, '-23.307')] [2022-07-10 05:22:49,555][26022] Updated weights on worker 0-0, policy_version 585058 (0.00086) [2022-07-10 05:22:51,502][26022] Updated weights on worker 0-0, policy_version 585068 (0.00089) [2022-07-10 05:22:53,079][26022] Updated weights on worker 0-0, policy_version 585078 (0.00093) [2022-07-10 05:22:54,366][25689] Fps is (10 sec: 5570.2, 60 sec: 5630.0, 300 sec: 5616.1). Total num frames: 599124992. Throughput: 0: 5945.2. Samples: 599135170. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:22:54,366][25689] Avg episode reward: [(0, '-23.537')] [2022-07-10 05:22:54,872][26022] Updated weights on worker 0-0, policy_version 585088 (0.00090) [2022-07-10 05:22:56,498][26022] Updated weights on worker 0-0, policy_version 585098 (0.00087) [2022-07-10 05:22:58,716][26022] Updated weights on worker 0-0, policy_version 585108 (0.00086) [2022-07-10 05:22:59,386][25689] Fps is (10 sec: 5663.7, 60 sec: 5647.1, 300 sec: 5624.5). Total num frames: 599155712. Throughput: 0: 5091.7. Samples: 599152212. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:22:59,387][25689] Avg episode reward: [(0, '-23.686')] [2022-07-10 05:23:00,227][26022] Updated weights on worker 0-0, policy_version 585118 (0.00083) [2022-07-10 05:23:02,416][26022] Updated weights on worker 0-0, policy_version 585128 (0.00078) [2022-07-10 05:23:04,284][26022] Updated weights on worker 0-0, policy_version 585138 (0.00083) [2022-07-10 05:23:04,396][25689] Fps is (10 sec: 5615.2, 60 sec: 5663.5, 300 sec: 5620.9). Total num frames: 599181312. Throughput: 0: 5837.2. Samples: 599184294. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:23:04,396][25689] Avg episode reward: [(0, '-23.195')] [2022-07-10 05:23:06,038][26022] Updated weights on worker 0-0, policy_version 585148 (0.00111) [2022-07-10 05:23:07,976][26022] Updated weights on worker 0-0, policy_version 585158 (0.00079) [2022-07-10 05:23:09,407][25689] Fps is (10 sec: 5313.9, 60 sec: 5649.3, 300 sec: 5618.4). Total num frames: 599208960. Throughput: 0: 5824.2. Samples: 599218198. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:23:09,407][25689] Avg episode reward: [(0, '-24.608')] [2022-07-10 05:23:09,903][26022] Updated weights on worker 0-0, policy_version 585168 (0.00100) [2022-07-10 05:23:11,521][26022] Updated weights on worker 0-0, policy_version 585178 (0.00085) [2022-07-10 05:23:13,419][26022] Updated weights on worker 0-0, policy_version 585188 (0.00087) [2022-07-10 05:23:14,498][25689] Fps is (10 sec: 5676.4, 60 sec: 5635.2, 300 sec: 5617.5). Total num frames: 599238656. Throughput: 0: 4980.8. Samples: 599235444. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:23:14,500][25689] Avg episode reward: [(0, '-24.512')] [2022-07-10 05:23:15,103][26022] Updated weights on worker 0-0, policy_version 585198 (0.00088) [2022-07-10 05:23:17,015][26022] Updated weights on worker 0-0, policy_version 585208 (0.00093) [2022-07-10 05:23:18,816][26022] Updated weights on worker 0-0, policy_version 585218 (0.00092) [2022-07-10 05:23:19,529][25689] Fps is (10 sec: 5664.9, 60 sec: 5633.3, 300 sec: 5617.3). Total num frames: 599266304. Throughput: 0: 5841.0. Samples: 599269868. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:23:19,530][25689] Avg episode reward: [(0, '-24.173')] [2022-07-10 05:23:20,431][26022] Updated weights on worker 0-0, policy_version 585228 (0.00088) [2022-07-10 05:23:22,386][26022] Updated weights on worker 0-0, policy_version 585238 (0.00093) [2022-07-10 05:23:23,961][26022] Updated weights on worker 0-0, policy_version 585248 (0.00088) [2022-07-10 05:23:24,544][25689] Fps is (10 sec: 5708.0, 60 sec: 5671.4, 300 sec: 5620.7). Total num frames: 599296000. Throughput: 0: 5942.3. Samples: 599304022. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:23:24,546][25689] Avg episode reward: [(0, '-24.773')] [2022-07-10 05:23:26,021][26022] Updated weights on worker 0-0, policy_version 585258 (0.00098) [2022-07-10 05:23:27,723][26022] Updated weights on worker 0-0, policy_version 585268 (0.00094) [2022-07-10 05:23:29,549][25689] Fps is (10 sec: 5825.4, 60 sec: 5623.2, 300 sec: 5629.6). Total num frames: 599324672. Throughput: 0: 5101.5. Samples: 599320954. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:23:29,551][25689] Avg episode reward: [(0, '-24.352')] [2022-07-10 05:23:29,555][26022] Updated weights on worker 0-0, policy_version 585278 (0.00082) [2022-07-10 05:23:31,473][26022] Updated weights on worker 0-0, policy_version 585288 (0.00084) [2022-07-10 05:23:33,133][26022] Updated weights on worker 0-0, policy_version 585298 (0.00085) [2022-07-10 05:23:34,613][25689] Fps is (10 sec: 5593.6, 60 sec: 5661.5, 300 sec: 5622.2). Total num frames: 599352320. Throughput: 0: 5945.8. Samples: 599355044. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:23:34,613][25689] Avg episode reward: [(0, '-23.385')] [2022-07-10 05:23:35,177][26022] Updated weights on worker 0-0, policy_version 585308 (0.00083) [2022-07-10 05:23:36,624][26022] Updated weights on worker 0-0, policy_version 585318 (0.00090) [2022-07-10 05:23:38,808][26022] Updated weights on worker 0-0, policy_version 585328 (0.00084) [2022-07-10 05:23:39,615][25689] Fps is (10 sec: 5697.0, 60 sec: 5664.9, 300 sec: 5626.5). Total num frames: 599382016. Throughput: 0: 5936.3. Samples: 599389100. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:23:39,615][25689] Avg episode reward: [(0, '-24.036')] [2022-07-10 05:23:40,176][26022] Updated weights on worker 0-0, policy_version 585338 (0.00089) [2022-07-10 05:23:42,266][26022] Updated weights on worker 0-0, policy_version 585348 (0.00524) [2022-07-10 05:23:43,782][26022] Updated weights on worker 0-0, policy_version 585358 (0.00089) [2022-07-10 05:23:44,657][25689] Fps is (10 sec: 5607.3, 60 sec: 5628.4, 300 sec: 5622.9). Total num frames: 599408640. Throughput: 0: 5084.6. Samples: 599406288. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:23:44,657][25689] Avg episode reward: [(0, '-24.360')] [2022-07-10 05:23:45,792][26022] Updated weights on worker 0-0, policy_version 585368 (0.00084) [2022-07-10 05:23:47,425][26022] Updated weights on worker 0-0, policy_version 585378 (0.00064) [2022-07-10 05:23:49,478][26022] Updated weights on worker 0-0, policy_version 585388 (0.00091) [2022-07-10 05:23:49,685][25689] Fps is (10 sec: 5491.1, 60 sec: 5609.5, 300 sec: 5624.2). Total num frames: 599437312. Throughput: 0: 5929.7. Samples: 599440352. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:23:49,685][25689] Avg episode reward: [(0, '-24.226')] [2022-07-10 05:23:51,156][26022] Updated weights on worker 0-0, policy_version 585398 (0.00090) [2022-07-10 05:23:53,313][26022] Updated weights on worker 0-0, policy_version 585408 (0.00088) [2022-07-10 05:23:54,758][25689] Fps is (10 sec: 5778.4, 60 sec: 5663.2, 300 sec: 5626.7). Total num frames: 599467008. Throughput: 0: 5894.8. Samples: 599473796. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:23:54,758][25689] Avg episode reward: [(0, '-24.205')] [2022-07-10 05:23:54,836][26022] Updated weights on worker 0-0, policy_version 585418 (0.00090) [2022-07-10 05:23:56,782][26022] Updated weights on worker 0-0, policy_version 585428 (0.00081) [2022-07-10 05:23:58,348][26022] Updated weights on worker 0-0, policy_version 585438 (0.00094) [2022-07-10 05:23:59,768][25689] Fps is (10 sec: 5687.0, 60 sec: 5613.3, 300 sec: 5631.6). Total num frames: 599494656. Throughput: 0: 5042.5. Samples: 599490724. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:23:59,769][25689] Avg episode reward: [(0, '-25.225')] [2022-07-10 05:24:00,356][26022] Updated weights on worker 0-0, policy_version 585448 (0.00092) [2022-07-10 05:24:02,554][26022] Updated weights on worker 0-0, policy_version 585458 (0.00078) [2022-07-10 05:24:03,637][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:24:03,650][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000585464_599515136.pth [2022-07-10 05:24:03,650][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000583485_597488640.pth [2022-07-10 05:24:04,483][26022] Updated weights on worker 0-0, policy_version 585468 (0.00083) [2022-07-10 05:24:04,800][25689] Fps is (10 sec: 5302.2, 60 sec: 5611.2, 300 sec: 5621.2). Total num frames: 599520256. Throughput: 0: 5765.0. Samples: 599522416. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:24:04,801][25689] Avg episode reward: [(0, '-25.451')] [2022-07-10 05:24:06,226][26022] Updated weights on worker 0-0, policy_version 585478 (0.00084) [2022-07-10 05:24:08,118][26022] Updated weights on worker 0-0, policy_version 585488 (0.00064) [2022-07-10 05:24:09,804][26022] Updated weights on worker 0-0, policy_version 585498 (0.00086) [2022-07-10 05:24:09,815][25689] Fps is (10 sec: 5503.9, 60 sec: 5644.8, 300 sec: 5626.8). Total num frames: 599549952. Throughput: 0: 5754.5. Samples: 599556190. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:24:09,815][25689] Avg episode reward: [(0, '-25.288')] [2022-07-10 05:24:11,660][26022] Updated weights on worker 0-0, policy_version 585508 (0.00087) [2022-07-10 05:24:13,698][26022] Updated weights on worker 0-0, policy_version 585518 (0.00090) [2022-07-10 05:24:14,866][25689] Fps is (10 sec: 5798.5, 60 sec: 5631.5, 300 sec: 5627.0). Total num frames: 599578624. Throughput: 0: 4941.7. Samples: 599573166. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:24:14,867][25689] Avg episode reward: [(0, '-24.819')] [2022-07-10 05:24:15,242][26022] Updated weights on worker 0-0, policy_version 585528 (0.00094) [2022-07-10 05:24:17,303][26022] Updated weights on worker 0-0, policy_version 585538 (0.00085) [2022-07-10 05:24:18,984][26022] Updated weights on worker 0-0, policy_version 585548 (0.00102) [2022-07-10 05:24:19,886][25689] Fps is (10 sec: 5490.5, 60 sec: 5615.7, 300 sec: 5616.5). Total num frames: 599605248. Throughput: 0: 5786.8. Samples: 599607142. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:24:19,886][25689] Avg episode reward: [(0, '-24.813')] [2022-07-10 05:24:20,735][26022] Updated weights on worker 0-0, policy_version 585558 (0.00093) [2022-07-10 05:24:22,571][26022] Updated weights on worker 0-0, policy_version 585568 (0.00093) [2022-07-10 05:24:24,360][26022] Updated weights on worker 0-0, policy_version 585578 (0.00096) [2022-07-10 05:24:24,895][25689] Fps is (10 sec: 5616.1, 60 sec: 5616.2, 300 sec: 5626.7). Total num frames: 599634944. Throughput: 0: 5911.6. Samples: 599641206. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:24:24,895][25689] Avg episode reward: [(0, '-25.202')] [2022-07-10 05:24:26,154][26022] Updated weights on worker 0-0, policy_version 585588 (0.00088) [2022-07-10 05:24:28,051][26022] Updated weights on worker 0-0, policy_version 585598 (0.00090) [2022-07-10 05:24:29,696][26022] Updated weights on worker 0-0, policy_version 585608 (0.00084) [2022-07-10 05:24:29,906][25689] Fps is (10 sec: 5825.0, 60 sec: 5615.6, 300 sec: 5627.8). Total num frames: 599663616. Throughput: 0: 5070.0. Samples: 599658054. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:24:29,907][25689] Avg episode reward: [(0, '-25.099')] [2022-07-10 05:24:31,666][26022] Updated weights on worker 0-0, policy_version 585618 (0.00095) [2022-07-10 05:24:33,368][26022] Updated weights on worker 0-0, policy_version 585628 (0.00091) [2022-07-10 05:24:34,970][25689] Fps is (10 sec: 5590.0, 60 sec: 5615.6, 300 sec: 5620.4). Total num frames: 599691264. Throughput: 0: 5904.3. Samples: 599691862. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:24:34,970][25689] Avg episode reward: [(0, '-25.503')] [2022-07-10 05:24:35,374][26022] Updated weights on worker 0-0, policy_version 585638 (0.00094) [2022-07-10 05:24:37,186][26022] Updated weights on worker 0-0, policy_version 585648 (0.00089) [2022-07-10 05:24:38,726][26022] Updated weights on worker 0-0, policy_version 585658 (0.00089) [2022-07-10 05:24:40,053][25689] Fps is (10 sec: 5550.6, 60 sec: 5591.2, 300 sec: 5623.5). Total num frames: 599719936. Throughput: 0: 5884.6. Samples: 599725816. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:24:40,055][25689] Avg episode reward: [(0, '-25.916')] [2022-07-10 05:24:40,902][26022] Updated weights on worker 0-0, policy_version 585668 (0.00082) [2022-07-10 05:24:42,552][26022] Updated weights on worker 0-0, policy_version 585678 (0.00095) [2022-07-10 05:24:44,388][26022] Updated weights on worker 0-0, policy_version 585688 (0.00092) [2022-07-10 05:24:45,094][25689] Fps is (10 sec: 5764.9, 60 sec: 5642.0, 300 sec: 5626.4). Total num frames: 599749632. Throughput: 0: 5035.9. Samples: 599742930. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:24:45,096][25689] Avg episode reward: [(0, '-25.462')] [2022-07-10 05:24:46,167][26022] Updated weights on worker 0-0, policy_version 585698 (0.00088) [2022-07-10 05:24:47,827][26022] Updated weights on worker 0-0, policy_version 585708 (0.00087) [2022-07-10 05:24:49,809][26022] Updated weights on worker 0-0, policy_version 585718 (0.00086) [2022-07-10 05:24:50,104][25689] Fps is (10 sec: 5603.2, 60 sec: 5609.9, 300 sec: 5620.5). Total num frames: 599776256. Throughput: 0: 5893.9. Samples: 599777100. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:24:50,105][25689] Avg episode reward: [(0, '-25.578')] [2022-07-10 05:24:51,474][26022] Updated weights on worker 0-0, policy_version 585728 (0.00090) [2022-07-10 05:24:53,292][26022] Updated weights on worker 0-0, policy_version 585738 (0.00098) [2022-07-10 05:24:55,054][26022] Updated weights on worker 0-0, policy_version 585748 (0.00055) [2022-07-10 05:24:55,215][25689] Fps is (10 sec: 5665.9, 60 sec: 5623.2, 300 sec: 5625.8). Total num frames: 599806976. Throughput: 0: 5890.1. Samples: 599811112. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:24:55,216][25689] Avg episode reward: [(0, '-26.142')] [2022-07-10 05:24:57,011][26022] Updated weights on worker 0-0, policy_version 585758 (0.00095) [2022-07-10 05:24:58,737][26022] Updated weights on worker 0-0, policy_version 585768 (0.00086) [2022-07-10 05:25:00,268][25689] Fps is (10 sec: 5742.9, 60 sec: 5619.3, 300 sec: 5633.0). Total num frames: 599834624. Throughput: 0: 5052.3. Samples: 599827948. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 05:25:00,268][25689] Avg episode reward: [(0, '-25.326')] [2022-07-10 05:25:00,602][26022] Updated weights on worker 0-0, policy_version 585778 (0.00088) [2022-07-10 05:25:02,724][26022] Updated weights on worker 0-0, policy_version 585788 (0.00094) [2022-07-10 05:25:04,415][26022] Updated weights on worker 0-0, policy_version 585798 (0.00089) [2022-07-10 05:25:05,327][25689] Fps is (10 sec: 5164.9, 60 sec: 5599.9, 300 sec: 5622.4). Total num frames: 599859200. Throughput: 0: 5793.7. Samples: 599860150. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:25:05,327][25689] Avg episode reward: [(0, '-25.446')] [2022-07-10 05:25:06,292][26022] Updated weights on worker 0-0, policy_version 585808 (0.00091) [2022-07-10 05:25:08,278][26022] Updated weights on worker 0-0, policy_version 585818 (0.00086) [2022-07-10 05:25:09,881][26022] Updated weights on worker 0-0, policy_version 585828 (0.00087) [2022-07-10 05:25:10,344][25689] Fps is (10 sec: 5589.6, 60 sec: 5633.5, 300 sec: 5630.0). Total num frames: 599890944. Throughput: 0: 5779.0. Samples: 599894064. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:25:10,344][25689] Avg episode reward: [(0, '-26.027')] [2022-07-10 05:25:11,871][26022] Updated weights on worker 0-0, policy_version 585838 (0.00094) [2022-07-10 05:25:13,389][26022] Updated weights on worker 0-0, policy_version 585848 (0.00088) [2022-07-10 05:25:15,396][25689] Fps is (10 sec: 5796.6, 60 sec: 5599.6, 300 sec: 5625.6). Total num frames: 599917568. Throughput: 0: 4953.3. Samples: 599911072. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:25:15,397][25689] Avg episode reward: [(0, '-26.383')] [2022-07-10 05:25:15,629][26022] Updated weights on worker 0-0, policy_version 585858 (0.00093) [2022-07-10 05:25:17,118][26022] Updated weights on worker 0-0, policy_version 585868 (0.00088) [2022-07-10 05:25:19,205][26022] Updated weights on worker 0-0, policy_version 585878 (0.00093) [2022-07-10 05:25:20,411][25689] Fps is (10 sec: 5696.2, 60 sec: 5667.8, 300 sec: 5628.9). Total num frames: 599948288. Throughput: 0: 5825.0. Samples: 599945280. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:25:20,411][25689] Avg episode reward: [(0, '-26.427')] [2022-07-10 05:25:20,649][26022] Updated weights on worker 0-0, policy_version 585888 (0.00096) [2022-07-10 05:25:22,748][26022] Updated weights on worker 0-0, policy_version 585898 (0.00084) [2022-07-10 05:25:24,483][26022] Updated weights on worker 0-0, policy_version 585908 (0.00055) [2022-07-10 05:25:25,482][25689] Fps is (10 sec: 5685.8, 60 sec: 5611.2, 300 sec: 5628.0). Total num frames: 599974912. Throughput: 0: 5905.4. Samples: 599979172. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:25:25,482][25689] Avg episode reward: [(0, '-24.750')] [2022-07-10 05:25:26,246][26022] Updated weights on worker 0-0, policy_version 585918 (0.00090) [2022-07-10 05:25:28,162][26022] Updated weights on worker 0-0, policy_version 585928 (0.00096) [2022-07-10 05:25:30,075][26022] Updated weights on worker 0-0, policy_version 585938 (0.00083) [2022-07-10 05:25:30,499][25689] Fps is (10 sec: 5379.6, 60 sec: 5593.8, 300 sec: 5625.1). Total num frames: 600002560. Throughput: 0: 5901.8. Samples: 600013018. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:25:30,500][25689] Avg episode reward: [(0, '-24.662')] [2022-07-10 05:25:31,681][26022] Updated weights on worker 0-0, policy_version 585948 (0.00086) [2022-07-10 05:25:33,759][26022] Updated weights on worker 0-0, policy_version 585958 (0.00086) [2022-07-10 05:25:35,205][26022] Updated weights on worker 0-0, policy_version 585968 (0.00080) [2022-07-10 05:25:35,565][25689] Fps is (10 sec: 5788.7, 60 sec: 5644.3, 300 sec: 5631.5). Total num frames: 600033280. Throughput: 0: 5872.6. Samples: 600029514. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:25:35,565][25689] Avg episode reward: [(0, '-24.385')] [2022-07-10 05:25:37,542][26022] Updated weights on worker 0-0, policy_version 585978 (0.00093) [2022-07-10 05:25:38,974][26022] Updated weights on worker 0-0, policy_version 585988 (0.00084) [2022-07-10 05:25:40,593][25689] Fps is (10 sec: 5579.5, 60 sec: 5598.6, 300 sec: 5624.2). Total num frames: 600058880. Throughput: 0: 5837.8. Samples: 600063102. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:25:40,594][25689] Avg episode reward: [(0, '-24.356')] [2022-07-10 05:25:41,110][26022] Updated weights on worker 0-0, policy_version 585998 (0.00094) [2022-07-10 05:25:42,580][26022] Updated weights on worker 0-0, policy_version 586008 (0.00089) [2022-07-10 05:25:44,441][26022] Updated weights on worker 0-0, policy_version 586018 (0.00111) [2022-07-10 05:25:45,625][25689] Fps is (10 sec: 5394.7, 60 sec: 5582.6, 300 sec: 5627.6). Total num frames: 600087552. Throughput: 0: 5885.3. Samples: 600097722. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:25:45,627][25689] Avg episode reward: [(0, '-24.824')] [2022-07-10 05:25:46,254][26022] Updated weights on worker 0-0, policy_version 586028 (0.00091) [2022-07-10 05:25:48,325][26022] Updated weights on worker 0-0, policy_version 586038 (0.00564) [2022-07-10 05:25:49,868][26022] Updated weights on worker 0-0, policy_version 586048 (0.00094) [2022-07-10 05:25:50,641][25689] Fps is (10 sec: 5809.2, 60 sec: 5632.8, 300 sec: 5622.8). Total num frames: 600117248. Throughput: 0: 5046.5. Samples: 600114662. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:25:50,643][25689] Avg episode reward: [(0, '-24.536')] [2022-07-10 05:25:51,971][26022] Updated weights on worker 0-0, policy_version 586058 (0.00088) [2022-07-10 05:25:53,360][26022] Updated weights on worker 0-0, policy_version 586068 (0.00091) [2022-07-10 05:25:55,475][26022] Updated weights on worker 0-0, policy_version 586078 (0.01248) [2022-07-10 05:25:55,693][25689] Fps is (10 sec: 5695.4, 60 sec: 5587.5, 300 sec: 5625.6). Total num frames: 600144896. Throughput: 0: 5909.7. Samples: 600148468. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:25:55,694][25689] Avg episode reward: [(0, '-22.754')] [2022-07-10 05:25:56,942][26022] Updated weights on worker 0-0, policy_version 586088 (0.00087) [2022-07-10 05:25:59,063][26022] Updated weights on worker 0-0, policy_version 586098 (0.00082) [2022-07-10 05:26:00,696][25689] Fps is (10 sec: 5702.7, 60 sec: 5625.9, 300 sec: 5636.7). Total num frames: 600174592. Throughput: 0: 5949.2. Samples: 600182698. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:26:00,697][25689] Avg episode reward: [(0, '-23.033')] [2022-07-10 05:26:00,698][26022] Updated weights on worker 0-0, policy_version 586108 (0.00089) [2022-07-10 05:26:03,080][26022] Updated weights on worker 0-0, policy_version 586118 (0.00091) [2022-07-10 05:26:04,048][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:26:04,061][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000586124_600190976.pth [2022-07-10 05:26:04,061][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000584143_598162432.pth [2022-07-10 05:26:04,670][26022] Updated weights on worker 0-0, policy_version 586128 (0.00087) [2022-07-10 05:26:05,702][25689] Fps is (10 sec: 5525.0, 60 sec: 5647.9, 300 sec: 5623.9). Total num frames: 600200192. Throughput: 0: 4971.1. Samples: 600197526. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:26:05,702][25689] Avg episode reward: [(0, '-23.069')] [2022-07-10 05:26:06,458][26022] Updated weights on worker 0-0, policy_version 586138 (0.00087) [2022-07-10 05:26:08,631][26022] Updated weights on worker 0-0, policy_version 586148 (0.00633) [2022-07-10 05:26:10,253][26022] Updated weights on worker 0-0, policy_version 586158 (0.00106) [2022-07-10 05:26:10,794][25689] Fps is (10 sec: 5475.9, 60 sec: 5606.9, 300 sec: 5630.0). Total num frames: 600229888. Throughput: 0: 5781.4. Samples: 600231178. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:26:10,795][25689] Avg episode reward: [(0, '-23.081')] [2022-07-10 05:26:12,127][26022] Updated weights on worker 0-0, policy_version 586168 (0.00083) [2022-07-10 05:26:13,777][26022] Updated weights on worker 0-0, policy_version 586178 (0.00084) [2022-07-10 05:26:15,690][26022] Updated weights on worker 0-0, policy_version 586188 (0.00094) [2022-07-10 05:26:15,862][25689] Fps is (10 sec: 5643.9, 60 sec: 5622.5, 300 sec: 5626.8). Total num frames: 600257536. Throughput: 0: 5798.3. Samples: 600265412. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:26:15,862][25689] Avg episode reward: [(0, '-23.468')] [2022-07-10 05:26:17,359][26022] Updated weights on worker 0-0, policy_version 586198 (0.00088) [2022-07-10 05:26:19,107][26022] Updated weights on worker 0-0, policy_version 586208 (0.00082) [2022-07-10 05:26:20,927][25689] Fps is (10 sec: 5558.2, 60 sec: 5583.9, 300 sec: 5626.9). Total num frames: 600286208. Throughput: 0: 4941.0. Samples: 600282660. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:26:20,928][25689] Avg episode reward: [(0, '-23.349')] [2022-07-10 05:26:21,091][26022] Updated weights on worker 0-0, policy_version 586218 (0.00086) [2022-07-10 05:26:22,875][26022] Updated weights on worker 0-0, policy_version 586228 (0.00088) [2022-07-10 05:26:24,568][26022] Updated weights on worker 0-0, policy_version 586238 (0.00091) [2022-07-10 05:26:25,947][25689] Fps is (10 sec: 5686.3, 60 sec: 5622.6, 300 sec: 5628.0). Total num frames: 600314880. Throughput: 0: 5894.8. Samples: 600316866. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:26:25,948][25689] Avg episode reward: [(0, '-23.591')] [2022-07-10 05:26:26,473][26022] Updated weights on worker 0-0, policy_version 586248 (0.00090) [2022-07-10 05:26:28,194][26022] Updated weights on worker 0-0, policy_version 586258 (0.00084) [2022-07-10 05:26:30,092][26022] Updated weights on worker 0-0, policy_version 586268 (0.00087) [2022-07-10 05:26:30,969][25689] Fps is (10 sec: 5608.7, 60 sec: 5622.1, 300 sec: 5628.2). Total num frames: 600342528. Throughput: 0: 5913.8. Samples: 600350486. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:26:30,969][25689] Avg episode reward: [(0, '-25.000')] [2022-07-10 05:26:32,006][26022] Updated weights on worker 0-0, policy_version 586278 (0.00089) [2022-07-10 05:26:33,818][26022] Updated weights on worker 0-0, policy_version 586288 (0.00085) [2022-07-10 05:26:35,413][26022] Updated weights on worker 0-0, policy_version 586298 (0.00087) [2022-07-10 05:26:36,029][25689] Fps is (10 sec: 5687.2, 60 sec: 5605.6, 300 sec: 5632.0). Total num frames: 600372224. Throughput: 0: 5053.5. Samples: 600367328. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:26:36,031][25689] Avg episode reward: [(0, '-24.964')] [2022-07-10 05:26:37,360][26022] Updated weights on worker 0-0, policy_version 586308 (0.00096) [2022-07-10 05:26:39,069][26022] Updated weights on worker 0-0, policy_version 586318 (0.00085) [2022-07-10 05:26:41,016][26022] Updated weights on worker 0-0, policy_version 586328 (0.00140) [2022-07-10 05:26:41,111][25689] Fps is (10 sec: 5654.0, 60 sec: 5634.6, 300 sec: 5621.4). Total num frames: 600399872. Throughput: 0: 5879.5. Samples: 600401330. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:26:41,113][25689] Avg episode reward: [(0, '-25.163')] [2022-07-10 05:26:42,692][26022] Updated weights on worker 0-0, policy_version 586338 (0.00093) [2022-07-10 05:26:44,597][26022] Updated weights on worker 0-0, policy_version 586348 (0.00090) [2022-07-10 05:26:46,140][25689] Fps is (10 sec: 5570.6, 60 sec: 5634.9, 300 sec: 5626.1). Total num frames: 600428544. Throughput: 0: 5887.8. Samples: 600435758. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:26:46,140][25689] Avg episode reward: [(0, '-25.453')] [2022-07-10 05:26:46,295][26022] Updated weights on worker 0-0, policy_version 586358 (0.00088) [2022-07-10 05:26:48,087][26022] Updated weights on worker 0-0, policy_version 586368 (0.00090) [2022-07-10 05:26:50,076][26022] Updated weights on worker 0-0, policy_version 586378 (0.00079) [2022-07-10 05:26:51,199][25689] Fps is (10 sec: 5785.6, 60 sec: 5630.8, 300 sec: 5629.6). Total num frames: 600458240. Throughput: 0: 5060.5. Samples: 600452864. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:26:51,200][25689] Avg episode reward: [(0, '-25.311')] [2022-07-10 05:26:51,692][26022] Updated weights on worker 0-0, policy_version 586388 (0.00094) [2022-07-10 05:26:53,676][26022] Updated weights on worker 0-0, policy_version 586398 (0.00088) [2022-07-10 05:26:55,456][26022] Updated weights on worker 0-0, policy_version 586408 (0.00090) [2022-07-10 05:26:56,290][25689] Fps is (10 sec: 5750.2, 60 sec: 5644.1, 300 sec: 5624.8). Total num frames: 600486912. Throughput: 0: 5895.0. Samples: 600486764. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:26:56,291][25689] Avg episode reward: [(0, '-23.953')] [2022-07-10 05:26:57,121][26022] Updated weights on worker 0-0, policy_version 586418 (0.00094) [2022-07-10 05:26:59,039][26022] Updated weights on worker 0-0, policy_version 586428 (0.00096) [2022-07-10 05:27:00,758][26022] Updated weights on worker 0-0, policy_version 586438 (0.00101) [2022-07-10 05:27:01,302][25689] Fps is (10 sec: 5676.4, 60 sec: 5626.4, 300 sec: 5638.5). Total num frames: 600515584. Throughput: 0: 5912.8. Samples: 600520714. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:27:01,302][25689] Avg episode reward: [(0, '-23.604')] [2022-07-10 05:27:03,045][26022] Updated weights on worker 0-0, policy_version 586448 (0.00090) [2022-07-10 05:27:04,667][26022] Updated weights on worker 0-0, policy_version 586458 (0.00091) [2022-07-10 05:27:06,411][25689] Fps is (10 sec: 5463.7, 60 sec: 5633.7, 300 sec: 5630.3). Total num frames: 600542208. Throughput: 0: 4934.2. Samples: 600535780. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:27:06,411][25689] Avg episode reward: [(0, '-24.764')] [2022-07-10 05:27:06,559][26022] Updated weights on worker 0-0, policy_version 586468 (0.00084) [2022-07-10 05:27:08,315][26022] Updated weights on worker 0-0, policy_version 586478 (0.00116) [2022-07-10 05:27:10,065][26022] Updated weights on worker 0-0, policy_version 586488 (0.00093) [2022-07-10 05:27:11,417][25689] Fps is (10 sec: 5466.8, 60 sec: 5624.9, 300 sec: 5625.6). Total num frames: 600570880. Throughput: 0: 5801.1. Samples: 600570146. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:27:11,418][25689] Avg episode reward: [(0, '-23.548')] [2022-07-10 05:27:11,866][26022] Updated weights on worker 0-0, policy_version 586498 (0.00083) [2022-07-10 05:27:13,767][26022] Updated weights on worker 0-0, policy_version 586508 (0.00088) [2022-07-10 05:27:15,406][26022] Updated weights on worker 0-0, policy_version 586518 (0.00119) [2022-07-10 05:27:16,558][25689] Fps is (10 sec: 5751.9, 60 sec: 5651.7, 300 sec: 5630.0). Total num frames: 600600576. Throughput: 0: 5814.0. Samples: 600604602. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:27:16,559][25689] Avg episode reward: [(0, '-24.082')] [2022-07-10 05:27:17,323][26022] Updated weights on worker 0-0, policy_version 586528 (0.00338) [2022-07-10 05:27:19,027][26022] Updated weights on worker 0-0, policy_version 586538 (0.00081) [2022-07-10 05:27:21,131][26022] Updated weights on worker 0-0, policy_version 586548 (0.00080) [2022-07-10 05:27:21,605][25689] Fps is (10 sec: 5729.1, 60 sec: 5653.5, 300 sec: 5633.7). Total num frames: 600629248. Throughput: 0: 4986.6. Samples: 600621956. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:27:21,605][25689] Avg episode reward: [(0, '-23.775')] [2022-07-10 05:27:22,474][26022] Updated weights on worker 0-0, policy_version 586558 (0.00093) [2022-07-10 05:27:24,623][26022] Updated weights on worker 0-0, policy_version 586568 (0.00086) [2022-07-10 05:27:26,052][26022] Updated weights on worker 0-0, policy_version 586578 (0.00084) [2022-07-10 05:27:26,615][25689] Fps is (10 sec: 5803.8, 60 sec: 5671.2, 300 sec: 5627.2). Total num frames: 600658944. Throughput: 0: 5965.8. Samples: 600656316. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:27:26,616][25689] Avg episode reward: [(0, '-23.630')] [2022-07-10 05:27:28,033][26022] Updated weights on worker 0-0, policy_version 586588 (0.00620) [2022-07-10 05:27:29,724][26022] Updated weights on worker 0-0, policy_version 586598 (0.00083) [2022-07-10 05:27:31,488][26022] Updated weights on worker 0-0, policy_version 586608 (0.00085) [2022-07-10 05:27:31,684][25689] Fps is (10 sec: 5689.3, 60 sec: 5666.9, 300 sec: 5634.9). Total num frames: 600686592. Throughput: 0: 5955.2. Samples: 600690842. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:27:31,684][25689] Avg episode reward: [(0, '-23.427')] [2022-07-10 05:27:33,382][26022] Updated weights on worker 0-0, policy_version 586618 (0.00089) [2022-07-10 05:27:35,111][26022] Updated weights on worker 0-0, policy_version 586628 (0.00086) [2022-07-10 05:27:36,730][25689] Fps is (10 sec: 5669.1, 60 sec: 5668.2, 300 sec: 5634.8). Total num frames: 600716288. Throughput: 0: 5973.2. Samples: 600725094. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:27:36,731][25689] Avg episode reward: [(0, '-22.989')] [2022-07-10 05:27:36,845][26022] Updated weights on worker 0-0, policy_version 586638 (0.00113) [2022-07-10 05:27:38,894][26022] Updated weights on worker 0-0, policy_version 586648 (0.00086) [2022-07-10 05:27:40,475][26022] Updated weights on worker 0-0, policy_version 586658 (0.00094) [2022-07-10 05:27:41,755][25689] Fps is (10 sec: 5693.9, 60 sec: 5673.5, 300 sec: 5631.1). Total num frames: 600743936. Throughput: 0: 5945.4. Samples: 600741758. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:27:41,755][25689] Avg episode reward: [(0, '-22.689')] [2022-07-10 05:27:42,442][26022] Updated weights on worker 0-0, policy_version 586668 (0.00087) [2022-07-10 05:27:44,253][26022] Updated weights on worker 0-0, policy_version 586678 (0.00088) [2022-07-10 05:27:45,954][26022] Updated weights on worker 0-0, policy_version 586688 (0.00086) [2022-07-10 05:27:46,813][25689] Fps is (10 sec: 5484.4, 60 sec: 5653.9, 300 sec: 5623.3). Total num frames: 600771584. Throughput: 0: 5924.2. Samples: 600775972. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:27:46,813][25689] Avg episode reward: [(0, '-23.566')] [2022-07-10 05:27:47,828][26022] Updated weights on worker 0-0, policy_version 586698 (0.00094) [2022-07-10 05:27:49,683][26022] Updated weights on worker 0-0, policy_version 586708 (0.00090) [2022-07-10 05:27:51,435][26022] Updated weights on worker 0-0, policy_version 586718 (0.00085) [2022-07-10 05:27:51,854][25689] Fps is (10 sec: 5678.2, 60 sec: 5655.6, 300 sec: 5634.8). Total num frames: 600801280. Throughput: 0: 5916.8. Samples: 600810184. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:27:51,854][25689] Avg episode reward: [(0, '-24.869')] [2022-07-10 05:27:53,349][26022] Updated weights on worker 0-0, policy_version 586728 (0.00094) [2022-07-10 05:27:55,165][26022] Updated weights on worker 0-0, policy_version 586738 (0.00090) [2022-07-10 05:27:56,823][26022] Updated weights on worker 0-0, policy_version 586748 (0.00087) [2022-07-10 05:27:56,990][25689] Fps is (10 sec: 5835.9, 60 sec: 5668.3, 300 sec: 5629.2). Total num frames: 600830976. Throughput: 0: 5026.7. Samples: 600826932. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:27:56,990][25689] Avg episode reward: [(0, '-26.321')] [2022-07-10 05:27:58,803][26022] Updated weights on worker 0-0, policy_version 586758 (0.00094) [2022-07-10 05:28:00,299][26022] Updated weights on worker 0-0, policy_version 586768 (0.00092) [2022-07-10 05:28:02,006][25689] Fps is (10 sec: 5446.5, 60 sec: 5617.2, 300 sec: 5629.0). Total num frames: 600856576. Throughput: 0: 5884.2. Samples: 600860922. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:28:02,007][25689] Avg episode reward: [(0, '-26.864')] [2022-07-10 05:28:02,818][26022] Updated weights on worker 0-0, policy_version 586778 (0.00093) [2022-07-10 05:28:04,150][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:28:04,162][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000586786_600868864.pth [2022-07-10 05:28:04,163][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000584801_598836224.pth [2022-07-10 05:28:04,431][26022] Updated weights on worker 0-0, policy_version 586788 (0.00094) [2022-07-10 05:28:06,402][26022] Updated weights on worker 0-0, policy_version 586798 (0.00084) [2022-07-10 05:28:07,067][25689] Fps is (10 sec: 5284.1, 60 sec: 5638.6, 300 sec: 5628.1). Total num frames: 600884224. Throughput: 0: 5754.7. Samples: 600892528. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:28:07,067][25689] Avg episode reward: [(0, '-26.979')] [2022-07-10 05:28:08,115][26022] Updated weights on worker 0-0, policy_version 586808 (0.00090) [2022-07-10 05:28:09,937][26022] Updated weights on worker 0-0, policy_version 586818 (0.00089) [2022-07-10 05:28:11,814][26022] Updated weights on worker 0-0, policy_version 586828 (0.00087) [2022-07-10 05:28:12,100][25689] Fps is (10 sec: 5579.8, 60 sec: 5636.1, 300 sec: 5625.8). Total num frames: 600912896. Throughput: 0: 4889.8. Samples: 600909182. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:28:12,100][25689] Avg episode reward: [(0, '-27.428')] [2022-07-10 05:28:13,839][26022] Updated weights on worker 0-0, policy_version 586838 (0.00082) [2022-07-10 05:28:15,252][26022] Updated weights on worker 0-0, policy_version 586848 (0.00090) [2022-07-10 05:28:17,227][25689] Fps is (10 sec: 5644.0, 60 sec: 5620.6, 300 sec: 5627.4). Total num frames: 600941568. Throughput: 0: 5758.5. Samples: 600943468. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:28:17,227][25689] Avg episode reward: [(0, '-26.911')] [2022-07-10 05:28:17,548][26022] Updated weights on worker 0-0, policy_version 586858 (0.00087) [2022-07-10 05:28:18,865][26022] Updated weights on worker 0-0, policy_version 586868 (0.00094) [2022-07-10 05:28:20,814][26022] Updated weights on worker 0-0, policy_version 586878 (0.00092) [2022-07-10 05:28:22,251][25689] Fps is (10 sec: 5750.1, 60 sec: 5639.5, 300 sec: 5627.2). Total num frames: 600971264. Throughput: 0: 5767.8. Samples: 600977686. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:28:22,252][25689] Avg episode reward: [(0, '-25.042')] [2022-07-10 05:28:22,619][26022] Updated weights on worker 0-0, policy_version 586888 (0.00087) [2022-07-10 05:28:24,398][26022] Updated weights on worker 0-0, policy_version 586898 (0.00089) [2022-07-10 05:28:26,319][26022] Updated weights on worker 0-0, policy_version 586908 (0.00087) [2022-07-10 05:28:27,296][25689] Fps is (10 sec: 5695.2, 60 sec: 5602.6, 300 sec: 5623.0). Total num frames: 600998912. Throughput: 0: 5054.3. Samples: 600994772. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:28:27,297][25689] Avg episode reward: [(0, '-24.495')] [2022-07-10 05:28:28,051][26022] Updated weights on worker 0-0, policy_version 586918 (0.00088) [2022-07-10 05:28:29,908][26022] Updated weights on worker 0-0, policy_version 586928 (0.00089) [2022-07-10 05:28:31,540][26022] Updated weights on worker 0-0, policy_version 586938 (0.00088) [2022-07-10 05:28:32,331][25689] Fps is (10 sec: 5587.2, 60 sec: 5622.6, 300 sec: 5627.0). Total num frames: 601027584. Throughput: 0: 5914.3. Samples: 601028834. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:28:32,333][25689] Avg episode reward: [(0, '-25.829')] [2022-07-10 05:28:33,543][26022] Updated weights on worker 0-0, policy_version 586948 (0.00088) [2022-07-10 05:28:35,092][26022] Updated weights on worker 0-0, policy_version 586958 (0.00092) [2022-07-10 05:28:37,152][26022] Updated weights on worker 0-0, policy_version 586968 (0.00094) [2022-07-10 05:28:37,383][25689] Fps is (10 sec: 5684.9, 60 sec: 5605.2, 300 sec: 5622.6). Total num frames: 601056256. Throughput: 0: 5930.5. Samples: 601063002. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:28:37,383][25689] Avg episode reward: [(0, '-25.798')] [2022-07-10 05:28:38,782][26022] Updated weights on worker 0-0, policy_version 586978 (0.00092) [2022-07-10 05:28:40,792][26022] Updated weights on worker 0-0, policy_version 586988 (0.00087) [2022-07-10 05:28:42,389][25689] Fps is (10 sec: 5701.4, 60 sec: 5623.8, 300 sec: 5630.2). Total num frames: 601084928. Throughput: 0: 5075.9. Samples: 601079904. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:28:42,390][25689] Avg episode reward: [(0, '-23.894')] [2022-07-10 05:28:42,404][26022] Updated weights on worker 0-0, policy_version 586998 (0.00080) [2022-07-10 05:28:44,311][26022] Updated weights on worker 0-0, policy_version 587008 (0.00090) [2022-07-10 05:28:46,084][26022] Updated weights on worker 0-0, policy_version 587018 (0.00091) [2022-07-10 05:28:47,404][25689] Fps is (10 sec: 5619.8, 60 sec: 5627.7, 300 sec: 5627.0). Total num frames: 601112576. Throughput: 0: 5927.6. Samples: 601113968. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:28:47,405][25689] Avg episode reward: [(0, '-24.034')] [2022-07-10 05:28:47,879][26022] Updated weights on worker 0-0, policy_version 587028 (0.00087) [2022-07-10 05:28:49,664][26022] Updated weights on worker 0-0, policy_version 587038 (0.00087) [2022-07-10 05:28:51,579][26022] Updated weights on worker 0-0, policy_version 587048 (0.00591) [2022-07-10 05:28:52,406][25689] Fps is (10 sec: 5622.1, 60 sec: 5614.4, 300 sec: 5624.9). Total num frames: 601141248. Throughput: 0: 5942.1. Samples: 601148124. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:28:52,407][25689] Avg episode reward: [(0, '-24.736')] [2022-07-10 05:28:53,499][26022] Updated weights on worker 0-0, policy_version 587058 (0.00082) [2022-07-10 05:28:55,231][26022] Updated weights on worker 0-0, policy_version 587068 (0.00080) [2022-07-10 05:28:56,946][26022] Updated weights on worker 0-0, policy_version 587078 (0.00087) [2022-07-10 05:28:57,485][25689] Fps is (10 sec: 5790.1, 60 sec: 5619.7, 300 sec: 5630.5). Total num frames: 601170944. Throughput: 0: 5077.6. Samples: 601165074. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:28:57,486][25689] Avg episode reward: [(0, '-24.943')] [2022-07-10 05:28:58,689][26022] Updated weights on worker 0-0, policy_version 587088 (0.00091) [2022-07-10 05:29:00,380][26022] Updated weights on worker 0-0, policy_version 587098 (0.00084) [2022-07-10 05:29:02,490][25689] Fps is (10 sec: 5585.5, 60 sec: 5637.8, 300 sec: 5634.4). Total num frames: 601197568. Throughput: 0: 5940.3. Samples: 601199310. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:29:02,490][25689] Avg episode reward: [(0, '-24.761')] [2022-07-10 05:29:02,652][26022] Updated weights on worker 0-0, policy_version 587108 (0.00084) [2022-07-10 05:29:04,439][26022] Updated weights on worker 0-0, policy_version 587118 (0.00089) [2022-07-10 05:29:06,280][26022] Updated weights on worker 0-0, policy_version 587128 (0.00081) [2022-07-10 05:29:07,517][25689] Fps is (10 sec: 5409.9, 60 sec: 5640.9, 300 sec: 5627.3). Total num frames: 601225216. Throughput: 0: 5842.9. Samples: 601231484. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:29:07,518][25689] Avg episode reward: [(0, '-25.975')] [2022-07-10 05:29:08,251][26022] Updated weights on worker 0-0, policy_version 587138 (0.00095) [2022-07-10 05:29:09,971][26022] Updated weights on worker 0-0, policy_version 587148 (0.00104) [2022-07-10 05:29:11,810][26022] Updated weights on worker 0-0, policy_version 587158 (0.00089) [2022-07-10 05:29:12,544][25689] Fps is (10 sec: 5703.3, 60 sec: 5658.4, 300 sec: 5631.2). Total num frames: 601254912. Throughput: 0: 4989.7. Samples: 601248606. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:29:12,545][25689] Avg episode reward: [(0, '-26.329')] [2022-07-10 05:29:13,688][26022] Updated weights on worker 0-0, policy_version 587168 (0.00092) [2022-07-10 05:29:15,232][26022] Updated weights on worker 0-0, policy_version 587178 (0.00091) [2022-07-10 05:29:17,238][26022] Updated weights on worker 0-0, policy_version 587188 (0.00087) [2022-07-10 05:29:17,611][25689] Fps is (10 sec: 5579.6, 60 sec: 5630.1, 300 sec: 5630.3). Total num frames: 601281536. Throughput: 0: 5835.5. Samples: 601282518. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:29:17,613][25689] Avg episode reward: [(0, '-26.724')] [2022-07-10 05:29:18,844][26022] Updated weights on worker 0-0, policy_version 587198 (0.00092) [2022-07-10 05:29:20,863][26022] Updated weights on worker 0-0, policy_version 587208 (0.00085) [2022-07-10 05:29:22,527][26022] Updated weights on worker 0-0, policy_version 587218 (0.00085) [2022-07-10 05:29:22,664][25689] Fps is (10 sec: 5565.3, 60 sec: 5627.4, 300 sec: 5629.5). Total num frames: 601311232. Throughput: 0: 5805.7. Samples: 601316436. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:29:22,665][25689] Avg episode reward: [(0, '-26.905')] [2022-07-10 05:29:24,390][26022] Updated weights on worker 0-0, policy_version 587228 (0.00098) [2022-07-10 05:29:26,158][26022] Updated weights on worker 0-0, policy_version 587238 (0.00088) [2022-07-10 05:29:27,727][25689] Fps is (10 sec: 5769.7, 60 sec: 5642.6, 300 sec: 5628.5). Total num frames: 601339904. Throughput: 0: 5048.1. Samples: 601333506. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:29:27,728][25689] Avg episode reward: [(0, '-26.332')] [2022-07-10 05:29:28,159][26022] Updated weights on worker 0-0, policy_version 587248 (0.00086) [2022-07-10 05:29:29,712][26022] Updated weights on worker 0-0, policy_version 587258 (0.00095) [2022-07-10 05:29:31,884][26022] Updated weights on worker 0-0, policy_version 587268 (0.00091) [2022-07-10 05:29:32,795][25689] Fps is (10 sec: 5660.5, 60 sec: 5639.6, 300 sec: 5631.9). Total num frames: 601368576. Throughput: 0: 5857.9. Samples: 601367230. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:29:32,795][25689] Avg episode reward: [(0, '-25.940')] [2022-07-10 05:29:33,364][26022] Updated weights on worker 0-0, policy_version 587278 (0.00083) [2022-07-10 05:29:35,378][26022] Updated weights on worker 0-0, policy_version 587288 (0.00093) [2022-07-10 05:29:37,122][26022] Updated weights on worker 0-0, policy_version 587298 (0.00087) [2022-07-10 05:29:37,838][25689] Fps is (10 sec: 5570.0, 60 sec: 5623.4, 300 sec: 5629.2). Total num frames: 601396224. Throughput: 0: 5873.4. Samples: 601401322. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:29:37,840][25689] Avg episode reward: [(0, '-25.401')] [2022-07-10 05:29:38,873][26022] Updated weights on worker 0-0, policy_version 587308 (0.00090) [2022-07-10 05:29:40,767][26022] Updated weights on worker 0-0, policy_version 587318 (0.00114) [2022-07-10 05:29:42,609][26022] Updated weights on worker 0-0, policy_version 587328 (0.00089) [2022-07-10 05:29:42,849][25689] Fps is (10 sec: 5601.5, 60 sec: 5623.0, 300 sec: 5626.3). Total num frames: 601424896. Throughput: 0: 5043.5. Samples: 601418240. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:29:42,853][25689] Avg episode reward: [(0, '-25.395')] [2022-07-10 05:29:44,361][26022] Updated weights on worker 0-0, policy_version 587338 (0.00088) [2022-07-10 05:29:46,069][26022] Updated weights on worker 0-0, policy_version 587348 (0.00094) [2022-07-10 05:29:47,870][25689] Fps is (10 sec: 5614.1, 60 sec: 5622.5, 300 sec: 5629.5). Total num frames: 601452544. Throughput: 0: 5891.9. Samples: 601452186. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:29:47,872][25689] Avg episode reward: [(0, '-25.761')] [2022-07-10 05:29:48,016][26022] Updated weights on worker 0-0, policy_version 587358 (0.00085) [2022-07-10 05:29:49,810][26022] Updated weights on worker 0-0, policy_version 587368 (0.00088) [2022-07-10 05:29:51,744][26022] Updated weights on worker 0-0, policy_version 587378 (0.00087) [2022-07-10 05:29:52,882][25689] Fps is (10 sec: 5715.7, 60 sec: 5638.5, 300 sec: 5628.0). Total num frames: 601482240. Throughput: 0: 5939.0. Samples: 601486528. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:29:52,883][25689] Avg episode reward: [(0, '-25.312')] [2022-07-10 05:29:53,295][26022] Updated weights on worker 0-0, policy_version 587388 (0.00089) [2022-07-10 05:29:55,183][26022] Updated weights on worker 0-0, policy_version 587398 (0.00097) [2022-07-10 05:29:56,992][26022] Updated weights on worker 0-0, policy_version 587408 (0.00092) [2022-07-10 05:29:57,955][25689] Fps is (10 sec: 5686.3, 60 sec: 5605.2, 300 sec: 5627.6). Total num frames: 601509888. Throughput: 0: 5925.3. Samples: 601520518. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:29:57,955][25689] Avg episode reward: [(0, '-25.465')] [2022-07-10 05:29:58,793][26022] Updated weights on worker 0-0, policy_version 587418 (0.00086) [2022-07-10 05:30:00,518][26022] Updated weights on worker 0-0, policy_version 587428 (0.00087) [2022-07-10 05:30:02,828][26022] Updated weights on worker 0-0, policy_version 587438 (0.00085) [2022-07-10 05:30:03,015][25689] Fps is (10 sec: 5456.8, 60 sec: 5616.9, 300 sec: 5637.9). Total num frames: 601537536. Throughput: 0: 5919.5. Samples: 601537614. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:30:03,016][25689] Avg episode reward: [(0, '-24.875')] [2022-07-10 05:30:04,164][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:30:04,179][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000587446_601544704.pth [2022-07-10 05:30:04,180][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000585464_599515136.pth [2022-07-10 05:30:04,533][26022] Updated weights on worker 0-0, policy_version 587448 (0.00086) [2022-07-10 05:30:06,461][26022] Updated weights on worker 0-0, policy_version 587458 (0.00053) [2022-07-10 05:30:08,074][25689] Fps is (10 sec: 5565.8, 60 sec: 5631.0, 300 sec: 5626.8). Total num frames: 601566208. Throughput: 0: 5803.4. Samples: 601569436. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:30:08,074][25689] Avg episode reward: [(0, '-25.588')] [2022-07-10 05:30:08,132][26022] Updated weights on worker 0-0, policy_version 587468 (0.00089) [2022-07-10 05:30:10,026][26022] Updated weights on worker 0-0, policy_version 587478 (0.00086) [2022-07-10 05:30:12,082][26022] Updated weights on worker 0-0, policy_version 587488 (0.00083) [2022-07-10 05:30:13,120][25689] Fps is (10 sec: 5472.4, 60 sec: 5578.5, 300 sec: 5626.9). Total num frames: 601592832. Throughput: 0: 5773.1. Samples: 601603364. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:30:13,121][25689] Avg episode reward: [(0, '-24.686')] [2022-07-10 05:30:13,570][26022] Updated weights on worker 0-0, policy_version 587498 (0.00078) [2022-07-10 05:30:15,571][26022] Updated weights on worker 0-0, policy_version 587508 (0.00093) [2022-07-10 05:30:17,127][26022] Updated weights on worker 0-0, policy_version 587518 (0.00086) [2022-07-10 05:30:18,207][25689] Fps is (10 sec: 5658.7, 60 sec: 5644.1, 300 sec: 5625.5). Total num frames: 601623552. Throughput: 0: 4942.2. Samples: 601620610. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:30:18,208][25689] Avg episode reward: [(0, '-24.143')] [2022-07-10 05:30:19,183][26022] Updated weights on worker 0-0, policy_version 587528 (0.00094) [2022-07-10 05:30:20,882][26022] Updated weights on worker 0-0, policy_version 587538 (0.00093) [2022-07-10 05:30:22,796][26022] Updated weights on worker 0-0, policy_version 587548 (0.00080) [2022-07-10 05:30:23,220][25689] Fps is (10 sec: 5779.1, 60 sec: 5614.1, 300 sec: 5630.1). Total num frames: 601651200. Throughput: 0: 5795.3. Samples: 601654704. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:30:23,220][25689] Avg episode reward: [(0, '-24.562')] [2022-07-10 05:30:24,438][26022] Updated weights on worker 0-0, policy_version 587558 (0.00083) [2022-07-10 05:30:26,363][26022] Updated weights on worker 0-0, policy_version 587568 (0.00086) [2022-07-10 05:30:28,003][26022] Updated weights on worker 0-0, policy_version 587578 (0.00091) [2022-07-10 05:30:28,252][25689] Fps is (10 sec: 5708.7, 60 sec: 5633.9, 300 sec: 5636.7). Total num frames: 601680896. Throughput: 0: 5930.0. Samples: 601689096. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:30:28,253][25689] Avg episode reward: [(0, '-25.265')] [2022-07-10 05:30:29,855][26022] Updated weights on worker 0-0, policy_version 587588 (0.00100) [2022-07-10 05:30:31,463][26022] Updated weights on worker 0-0, policy_version 587598 (0.00083) [2022-07-10 05:30:33,273][25689] Fps is (10 sec: 5805.7, 60 sec: 5638.2, 300 sec: 5630.6). Total num frames: 601709568. Throughput: 0: 5101.8. Samples: 601706182. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:30:33,274][25689] Avg episode reward: [(0, '-25.563')] [2022-07-10 05:30:33,326][26022] Updated weights on worker 0-0, policy_version 587608 (0.00090) [2022-07-10 05:30:35,232][26022] Updated weights on worker 0-0, policy_version 587618 (0.00387) [2022-07-10 05:30:37,096][26022] Updated weights on worker 0-0, policy_version 587628 (0.00090) [2022-07-10 05:30:38,365][25689] Fps is (10 sec: 5569.2, 60 sec: 5633.7, 300 sec: 5636.3). Total num frames: 601737216. Throughput: 0: 5927.9. Samples: 601740104. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:30:38,366][25689] Avg episode reward: [(0, '-26.346')] [2022-07-10 05:30:38,868][26022] Updated weights on worker 0-0, policy_version 587638 (0.00096) [2022-07-10 05:30:40,616][26022] Updated weights on worker 0-0, policy_version 587648 (0.00089) [2022-07-10 05:30:42,531][26022] Updated weights on worker 0-0, policy_version 587658 (0.00096) [2022-07-10 05:30:43,382][25689] Fps is (10 sec: 5470.2, 60 sec: 5616.3, 300 sec: 5633.2). Total num frames: 601764864. Throughput: 0: 5909.8. Samples: 601773858. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:30:43,382][25689] Avg episode reward: [(0, '-26.410')] [2022-07-10 05:30:44,318][26022] Updated weights on worker 0-0, policy_version 587668 (0.00093) [2022-07-10 05:30:46,277][26022] Updated weights on worker 0-0, policy_version 587678 (0.00081) [2022-07-10 05:30:48,109][26022] Updated weights on worker 0-0, policy_version 587688 (0.00088) [2022-07-10 05:30:48,384][25689] Fps is (10 sec: 5723.4, 60 sec: 5651.8, 300 sec: 5633.4). Total num frames: 601794560. Throughput: 0: 5034.5. Samples: 601790450. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:30:48,386][25689] Avg episode reward: [(0, '-27.615')] [2022-07-10 05:30:49,852][26022] Updated weights on worker 0-0, policy_version 587698 (0.00091) [2022-07-10 05:30:51,623][26022] Updated weights on worker 0-0, policy_version 587708 (0.00088) [2022-07-10 05:30:53,419][25689] Fps is (10 sec: 5713.1, 60 sec: 5615.8, 300 sec: 5633.7). Total num frames: 601822208. Throughput: 0: 5867.8. Samples: 601824394. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:30:53,420][25689] Avg episode reward: [(0, '-27.505')] [2022-07-10 05:30:53,590][26022] Updated weights on worker 0-0, policy_version 587718 (0.00093) [2022-07-10 05:30:55,185][26022] Updated weights on worker 0-0, policy_version 587728 (0.00288) [2022-07-10 05:30:57,146][26022] Updated weights on worker 0-0, policy_version 587738 (0.00086) [2022-07-10 05:30:58,505][25689] Fps is (10 sec: 5666.3, 60 sec: 5648.5, 300 sec: 5632.2). Total num frames: 601851904. Throughput: 0: 5880.2. Samples: 601858528. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:30:58,506][25689] Avg episode reward: [(0, '-27.347')] [2022-07-10 05:30:59,056][26022] Updated weights on worker 0-0, policy_version 587748 (0.00090) [2022-07-10 05:31:00,713][26022] Updated weights on worker 0-0, policy_version 587758 (0.00084) [2022-07-10 05:31:02,801][26022] Updated weights on worker 0-0, policy_version 587768 (0.00093) [2022-07-10 05:31:03,526][25689] Fps is (10 sec: 5471.3, 60 sec: 5618.3, 300 sec: 5631.9). Total num frames: 601877504. Throughput: 0: 5048.1. Samples: 601875546. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:31:03,528][25689] Avg episode reward: [(0, '-27.135')] [2022-07-10 05:31:04,648][26022] Updated weights on worker 0-0, policy_version 587778 (0.00089) [2022-07-10 05:31:06,403][26022] Updated weights on worker 0-0, policy_version 587788 (0.00091) [2022-07-10 05:31:08,470][26022] Updated weights on worker 0-0, policy_version 587798 (0.00097) [2022-07-10 05:31:08,529][25689] Fps is (10 sec: 5414.2, 60 sec: 5623.5, 300 sec: 5630.1). Total num frames: 601906176. Throughput: 0: 5821.0. Samples: 601907710. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 05:31:08,529][25689] Avg episode reward: [(0, '-26.650')] [2022-07-10 05:31:10,019][26022] Updated weights on worker 0-0, policy_version 587808 (0.00090) [2022-07-10 05:31:12,105][26022] Updated weights on worker 0-0, policy_version 587818 (0.00087) [2022-07-10 05:31:13,557][25689] Fps is (10 sec: 5818.8, 60 sec: 5676.0, 300 sec: 5637.8). Total num frames: 601935872. Throughput: 0: 5850.1. Samples: 601942200. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:31:13,557][25689] Avg episode reward: [(0, '-26.071')] [2022-07-10 05:31:13,563][26022] Updated weights on worker 0-0, policy_version 587828 (0.00094) [2022-07-10 05:31:15,606][26022] Updated weights on worker 0-0, policy_version 587838 (0.00086) [2022-07-10 05:31:17,315][26022] Updated weights on worker 0-0, policy_version 587848 (0.00082) [2022-07-10 05:31:18,630][25689] Fps is (10 sec: 5676.9, 60 sec: 5626.5, 300 sec: 5634.2). Total num frames: 601963520. Throughput: 0: 4990.4. Samples: 601958960. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:31:18,630][25689] Avg episode reward: [(0, '-26.774')] [2022-07-10 05:31:19,237][26022] Updated weights on worker 0-0, policy_version 587858 (0.00087) [2022-07-10 05:31:20,871][26022] Updated weights on worker 0-0, policy_version 587868 (0.00096) [2022-07-10 05:31:22,747][26022] Updated weights on worker 0-0, policy_version 587878 (0.00089) [2022-07-10 05:31:23,643][25689] Fps is (10 sec: 5482.4, 60 sec: 5626.5, 300 sec: 5630.9). Total num frames: 601991168. Throughput: 0: 5829.7. Samples: 601992822. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:31:23,643][25689] Avg episode reward: [(0, '-25.462')] [2022-07-10 05:31:24,451][26022] Updated weights on worker 0-0, policy_version 587888 (0.00088) [2022-07-10 05:31:26,524][26022] Updated weights on worker 0-0, policy_version 587898 (0.00086) [2022-07-10 05:31:28,019][26022] Updated weights on worker 0-0, policy_version 587908 (0.00084) [2022-07-10 05:31:28,656][25689] Fps is (10 sec: 5719.2, 60 sec: 5628.2, 300 sec: 5637.9). Total num frames: 602020864. Throughput: 0: 5917.0. Samples: 602026806. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:31:28,657][25689] Avg episode reward: [(0, '-25.803')] [2022-07-10 05:31:30,174][26022] Updated weights on worker 0-0, policy_version 587918 (0.00091) [2022-07-10 05:31:31,716][26022] Updated weights on worker 0-0, policy_version 587928 (0.00097) [2022-07-10 05:31:33,679][25689] Fps is (10 sec: 5611.5, 60 sec: 5594.2, 300 sec: 5628.3). Total num frames: 602047488. Throughput: 0: 5041.8. Samples: 602043654. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:31:33,680][25689] Avg episode reward: [(0, '-26.934')] [2022-07-10 05:31:33,719][26022] Updated weights on worker 0-0, policy_version 587938 (0.00089) [2022-07-10 05:31:35,383][26022] Updated weights on worker 0-0, policy_version 587948 (0.00091) [2022-07-10 05:31:37,463][26022] Updated weights on worker 0-0, policy_version 587958 (0.00087) [2022-07-10 05:31:38,737][25689] Fps is (10 sec: 5586.9, 60 sec: 5631.2, 300 sec: 5635.6). Total num frames: 602077184. Throughput: 0: 5900.6. Samples: 602077604. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:31:38,741][25689] Avg episode reward: [(0, '-26.934')] [2022-07-10 05:31:38,986][26022] Updated weights on worker 0-0, policy_version 587968 (0.00083) [2022-07-10 05:31:41,134][26022] Updated weights on worker 0-0, policy_version 587978 (0.00088) [2022-07-10 05:31:42,558][26022] Updated weights on worker 0-0, policy_version 587988 (0.00089) [2022-07-10 05:31:43,744][25689] Fps is (10 sec: 5697.2, 60 sec: 5632.1, 300 sec: 5632.6). Total num frames: 602104832. Throughput: 0: 5911.2. Samples: 602111648. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:31:43,745][25689] Avg episode reward: [(0, '-27.151')] [2022-07-10 05:31:44,736][26022] Updated weights on worker 0-0, policy_version 587998 (0.00093) [2022-07-10 05:31:46,256][26022] Updated weights on worker 0-0, policy_version 588008 (0.00086) [2022-07-10 05:31:48,274][26022] Updated weights on worker 0-0, policy_version 588018 (0.00442) [2022-07-10 05:31:48,759][25689] Fps is (10 sec: 5619.6, 60 sec: 5614.0, 300 sec: 5630.0). Total num frames: 602133504. Throughput: 0: 5054.2. Samples: 602128410. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:31:48,760][25689] Avg episode reward: [(0, '-26.353')] [2022-07-10 05:31:49,907][26022] Updated weights on worker 0-0, policy_version 588028 (0.00090) [2022-07-10 05:31:51,947][26022] Updated weights on worker 0-0, policy_version 588038 (0.00081) [2022-07-10 05:31:53,424][26022] Updated weights on worker 0-0, policy_version 588048 (0.00088) [2022-07-10 05:31:53,773][25689] Fps is (10 sec: 5718.2, 60 sec: 5632.9, 300 sec: 5631.5). Total num frames: 602162176. Throughput: 0: 5897.1. Samples: 602162148. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:31:53,773][25689] Avg episode reward: [(0, '-26.531')] [2022-07-10 05:31:55,734][26022] Updated weights on worker 0-0, policy_version 588058 (0.00077) [2022-07-10 05:31:57,188][26022] Updated weights on worker 0-0, policy_version 588068 (0.00093) [2022-07-10 05:31:58,827][25689] Fps is (10 sec: 5492.6, 60 sec: 5585.0, 300 sec: 5623.8). Total num frames: 602188800. Throughput: 0: 5876.5. Samples: 602195660. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:31:58,827][25689] Avg episode reward: [(0, '-25.952')] [2022-07-10 05:31:59,259][26022] Updated weights on worker 0-0, policy_version 588078 (0.00103) [2022-07-10 05:32:00,782][26022] Updated weights on worker 0-0, policy_version 588088 (0.00090) [2022-07-10 05:32:03,103][26022] Updated weights on worker 0-0, policy_version 588098 (0.00088) [2022-07-10 05:32:03,838][25689] Fps is (10 sec: 5391.9, 60 sec: 5619.8, 300 sec: 5629.1). Total num frames: 602216448. Throughput: 0: 5028.6. Samples: 602212692. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:32:03,839][25689] Avg episode reward: [(0, '-25.434')] [2022-07-10 05:32:04,234][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:32:04,246][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000588105_602219520.pth [2022-07-10 05:32:04,246][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000586124_600190976.pth [2022-07-10 05:32:04,818][26022] Updated weights on worker 0-0, policy_version 588108 (0.00084) [2022-07-10 05:32:06,556][26022] Updated weights on worker 0-0, policy_version 588118 (0.00086) [2022-07-10 05:32:08,599][26022] Updated weights on worker 0-0, policy_version 588128 (0.00090) [2022-07-10 05:32:08,867][25689] Fps is (10 sec: 5507.5, 60 sec: 5600.4, 300 sec: 5625.2). Total num frames: 602244096. Throughput: 0: 5787.6. Samples: 602244784. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:32:08,869][25689] Avg episode reward: [(0, '-26.651')] [2022-07-10 05:32:10,284][26022] Updated weights on worker 0-0, policy_version 588138 (0.00084) [2022-07-10 05:32:12,167][26022] Updated weights on worker 0-0, policy_version 588148 (0.00083) [2022-07-10 05:32:13,890][25689] Fps is (10 sec: 5603.0, 60 sec: 5583.9, 300 sec: 5624.0). Total num frames: 602272768. Throughput: 0: 5809.7. Samples: 602279024. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:32:13,891][25689] Avg episode reward: [(0, '-25.692')] [2022-07-10 05:32:13,927][26022] Updated weights on worker 0-0, policy_version 588158 (0.00100) [2022-07-10 05:32:15,863][26022] Updated weights on worker 0-0, policy_version 588168 (0.00087) [2022-07-10 05:32:17,471][26022] Updated weights on worker 0-0, policy_version 588178 (0.00091) [2022-07-10 05:32:18,981][25689] Fps is (10 sec: 5568.6, 60 sec: 5582.3, 300 sec: 5619.7). Total num frames: 602300416. Throughput: 0: 4965.7. Samples: 602295738. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:32:18,981][25689] Avg episode reward: [(0, '-25.427')] [2022-07-10 05:32:19,566][26022] Updated weights on worker 0-0, policy_version 588188 (0.00084) [2022-07-10 05:32:21,063][26022] Updated weights on worker 0-0, policy_version 588198 (0.00084) [2022-07-10 05:32:22,985][26022] Updated weights on worker 0-0, policy_version 588208 (0.00088) [2022-07-10 05:32:23,992][25689] Fps is (10 sec: 5676.8, 60 sec: 5616.4, 300 sec: 5619.7). Total num frames: 602330112. Throughput: 0: 5821.8. Samples: 602330020. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:32:23,992][25689] Avg episode reward: [(0, '-24.955')] [2022-07-10 05:32:24,564][26022] Updated weights on worker 0-0, policy_version 588218 (0.00096) [2022-07-10 05:32:26,671][26022] Updated weights on worker 0-0, policy_version 588228 (0.00092) [2022-07-10 05:32:28,314][26022] Updated weights on worker 0-0, policy_version 588238 (0.00086) [2022-07-10 05:32:29,020][25689] Fps is (10 sec: 5813.8, 60 sec: 5598.1, 300 sec: 5623.9). Total num frames: 602358784. Throughput: 0: 5925.7. Samples: 602364206. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:32:29,021][25689] Avg episode reward: [(0, '-24.642')] [2022-07-10 05:32:30,172][26022] Updated weights on worker 0-0, policy_version 588248 (0.00091) [2022-07-10 05:32:31,676][26022] Updated weights on worker 0-0, policy_version 588258 (0.00087) [2022-07-10 05:32:33,998][26022] Updated weights on worker 0-0, policy_version 588268 (0.00082) [2022-07-10 05:32:34,046][25689] Fps is (10 sec: 5601.6, 60 sec: 5614.8, 300 sec: 5617.4). Total num frames: 602386432. Throughput: 0: 5080.3. Samples: 602381420. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:32:34,046][25689] Avg episode reward: [(0, '-23.138')] [2022-07-10 05:32:35,478][26022] Updated weights on worker 0-0, policy_version 588278 (0.00052) [2022-07-10 05:32:37,338][26022] Updated weights on worker 0-0, policy_version 588288 (0.00091) [2022-07-10 05:32:39,162][25689] Fps is (10 sec: 5654.2, 60 sec: 5609.4, 300 sec: 5622.6). Total num frames: 602416128. Throughput: 0: 5929.6. Samples: 602415406. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:32:39,163][25689] Avg episode reward: [(0, '-22.136')] [2022-07-10 05:32:39,242][26022] Updated weights on worker 0-0, policy_version 588298 (0.00079) [2022-07-10 05:32:41,153][26022] Updated weights on worker 0-0, policy_version 588308 (0.00086) [2022-07-10 05:32:42,808][26022] Updated weights on worker 0-0, policy_version 588318 (0.00085) [2022-07-10 05:32:44,223][25689] Fps is (10 sec: 5735.2, 60 sec: 5621.3, 300 sec: 5626.0). Total num frames: 602444800. Throughput: 0: 5893.0. Samples: 602449244. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:32:44,223][25689] Avg episode reward: [(0, '-23.078')] [2022-07-10 05:32:44,813][26022] Updated weights on worker 0-0, policy_version 588328 (0.00093) [2022-07-10 05:32:46,339][26022] Updated weights on worker 0-0, policy_version 588338 (0.00091) [2022-07-10 05:32:48,503][26022] Updated weights on worker 0-0, policy_version 588348 (0.00094) [2022-07-10 05:32:49,268][25689] Fps is (10 sec: 5674.3, 60 sec: 5618.5, 300 sec: 5622.4). Total num frames: 602473472. Throughput: 0: 5041.0. Samples: 602466276. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:32:49,268][25689] Avg episode reward: [(0, '-23.544')] [2022-07-10 05:32:50,096][26022] Updated weights on worker 0-0, policy_version 588359 (0.00089) [2022-07-10 05:32:52,233][26022] Updated weights on worker 0-0, policy_version 588369 (0.00083) [2022-07-10 05:32:53,811][26022] Updated weights on worker 0-0, policy_version 588379 (0.00114) [2022-07-10 05:32:54,291][25689] Fps is (10 sec: 5695.6, 60 sec: 5617.6, 300 sec: 5621.1). Total num frames: 602502144. Throughput: 0: 5884.2. Samples: 602500546. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:32:54,292][25689] Avg episode reward: [(0, '-24.007')] [2022-07-10 05:32:55,655][26022] Updated weights on worker 0-0, policy_version 588389 (0.00091) [2022-07-10 05:32:57,530][26022] Updated weights on worker 0-0, policy_version 588399 (0.00092) [2022-07-10 05:32:59,299][26022] Updated weights on worker 0-0, policy_version 588409 (0.00090) [2022-07-10 05:32:59,361][25689] Fps is (10 sec: 5681.5, 60 sec: 5649.9, 300 sec: 5630.4). Total num frames: 602530816. Throughput: 0: 5912.7. Samples: 602534838. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:32:59,362][25689] Avg episode reward: [(0, '-24.479')] [2022-07-10 05:33:01,018][26022] Updated weights on worker 0-0, policy_version 588419 (0.00084) [2022-07-10 05:33:03,304][26022] Updated weights on worker 0-0, policy_version 588429 (0.00071) [2022-07-10 05:33:04,377][25689] Fps is (10 sec: 5482.2, 60 sec: 5632.6, 300 sec: 5627.8). Total num frames: 602557440. Throughput: 0: 5068.2. Samples: 602551390. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:33:04,378][25689] Avg episode reward: [(0, '-24.866')] [2022-07-10 05:33:05,004][26022] Updated weights on worker 0-0, policy_version 588439 (0.00093) [2022-07-10 05:33:06,987][26022] Updated weights on worker 0-0, policy_version 588449 (0.00086) [2022-07-10 05:33:08,731][26022] Updated weights on worker 0-0, policy_version 588459 (0.00084) [2022-07-10 05:33:09,382][25689] Fps is (10 sec: 5313.9, 60 sec: 5617.9, 300 sec: 5621.5). Total num frames: 602584064. Throughput: 0: 5849.6. Samples: 602583934. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:33:09,382][25689] Avg episode reward: [(0, '-24.723')] [2022-07-10 05:33:10,575][26022] Updated weights on worker 0-0, policy_version 588469 (0.00090) [2022-07-10 05:33:12,474][26022] Updated weights on worker 0-0, policy_version 588479 (0.00094) [2022-07-10 05:33:14,263][26022] Updated weights on worker 0-0, policy_version 588489 (0.00087) [2022-07-10 05:33:14,386][25689] Fps is (10 sec: 5627.0, 60 sec: 5636.6, 300 sec: 5627.3). Total num frames: 602613760. Throughput: 0: 5831.1. Samples: 602617724. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:33:14,387][25689] Avg episode reward: [(0, '-24.558')] [2022-07-10 05:33:16,120][26022] Updated weights on worker 0-0, policy_version 588499 (0.00089) [2022-07-10 05:33:17,875][26022] Updated weights on worker 0-0, policy_version 588509 (0.00100) [2022-07-10 05:33:19,448][25689] Fps is (10 sec: 5798.3, 60 sec: 5656.2, 300 sec: 5623.1). Total num frames: 602642432. Throughput: 0: 5793.3. Samples: 602651208. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:33:19,449][25689] Avg episode reward: [(0, '-24.900')] [2022-07-10 05:33:19,647][26022] Updated weights on worker 0-0, policy_version 588519 (0.00102) [2022-07-10 05:33:21,473][26022] Updated weights on worker 0-0, policy_version 588529 (0.00091) [2022-07-10 05:33:23,358][26022] Updated weights on worker 0-0, policy_version 588539 (0.00085) [2022-07-10 05:33:24,522][25689] Fps is (10 sec: 5556.8, 60 sec: 5616.5, 300 sec: 5622.6). Total num frames: 602670080. Throughput: 0: 5796.8. Samples: 602668162. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:33:24,523][25689] Avg episode reward: [(0, '-24.954')] [2022-07-10 05:33:25,244][26022] Updated weights on worker 0-0, policy_version 588549 (0.00091) [2022-07-10 05:33:27,031][26022] Updated weights on worker 0-0, policy_version 588559 (0.00101) [2022-07-10 05:33:28,948][26022] Updated weights on worker 0-0, policy_version 588569 (0.00088) [2022-07-10 05:33:29,527][25689] Fps is (10 sec: 5486.2, 60 sec: 5601.7, 300 sec: 5619.7). Total num frames: 602697728. Throughput: 0: 5835.1. Samples: 602701486. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:33:29,528][25689] Avg episode reward: [(0, '-25.887')] [2022-07-10 05:33:30,577][26022] Updated weights on worker 0-0, policy_version 588579 (0.00091) [2022-07-10 05:33:32,629][26022] Updated weights on worker 0-0, policy_version 588589 (0.00070) [2022-07-10 05:33:34,151][26022] Updated weights on worker 0-0, policy_version 588599 (0.00085) [2022-07-10 05:33:34,575][25689] Fps is (10 sec: 5602.2, 60 sec: 5616.6, 300 sec: 5619.7). Total num frames: 602726400. Throughput: 0: 5841.8. Samples: 602735662. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:33:34,576][25689] Avg episode reward: [(0, '-27.478')] [2022-07-10 05:33:36,147][26022] Updated weights on worker 0-0, policy_version 588609 (0.00111) [2022-07-10 05:33:37,787][26022] Updated weights on worker 0-0, policy_version 588619 (0.00088) [2022-07-10 05:33:39,608][26022] Updated weights on worker 0-0, policy_version 588629 (0.00095) [2022-07-10 05:33:39,658][25689] Fps is (10 sec: 5761.7, 60 sec: 5619.7, 300 sec: 5621.7). Total num frames: 602756096. Throughput: 0: 5024.4. Samples: 602752748. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:33:39,658][25689] Avg episode reward: [(0, '-26.378')] [2022-07-10 05:33:41,471][26022] Updated weights on worker 0-0, policy_version 588639 (0.00087) [2022-07-10 05:33:43,092][26022] Updated weights on worker 0-0, policy_version 588649 (0.00088) [2022-07-10 05:33:44,698][25689] Fps is (10 sec: 5664.6, 60 sec: 5604.7, 300 sec: 5621.3). Total num frames: 602783744. Throughput: 0: 5911.5. Samples: 602787434. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:33:44,701][25689] Avg episode reward: [(0, '-26.257')] [2022-07-10 05:33:45,092][26022] Updated weights on worker 0-0, policy_version 588659 (0.00087) [2022-07-10 05:33:46,927][26022] Updated weights on worker 0-0, policy_version 588669 (0.00086) [2022-07-10 05:33:48,676][26022] Updated weights on worker 0-0, policy_version 588679 (0.00087) [2022-07-10 05:33:49,712][25689] Fps is (10 sec: 5601.4, 60 sec: 5607.5, 300 sec: 5621.0). Total num frames: 602812416. Throughput: 0: 5933.1. Samples: 602821246. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:33:49,713][25689] Avg episode reward: [(0, '-24.870')] [2022-07-10 05:33:50,595][26022] Updated weights on worker 0-0, policy_version 588689 (0.00096) [2022-07-10 05:33:52,463][26022] Updated weights on worker 0-0, policy_version 588699 (0.00085) [2022-07-10 05:33:54,163][26022] Updated weights on worker 0-0, policy_version 588709 (0.00079) [2022-07-10 05:33:54,719][25689] Fps is (10 sec: 5722.1, 60 sec: 5609.0, 300 sec: 5618.9). Total num frames: 602841088. Throughput: 0: 5073.0. Samples: 602837856. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:33:54,720][25689] Avg episode reward: [(0, '-24.310')] [2022-07-10 05:33:56,178][26022] Updated weights on worker 0-0, policy_version 588719 (0.00100) [2022-07-10 05:33:57,740][26022] Updated weights on worker 0-0, policy_version 588729 (0.00090) [2022-07-10 05:33:59,787][25689] Fps is (10 sec: 5488.7, 60 sec: 5575.4, 300 sec: 5617.8). Total num frames: 602867712. Throughput: 0: 5915.0. Samples: 602871812. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:33:59,787][25689] Avg episode reward: [(0, '-24.950')] [2022-07-10 05:33:59,791][26022] Updated weights on worker 0-0, policy_version 588739 (0.00087) [2022-07-10 05:34:01,182][26022] Updated weights on worker 0-0, policy_version 588749 (0.00093) [2022-07-10 05:34:03,680][26022] Updated weights on worker 0-0, policy_version 588759 (0.00090) [2022-07-10 05:34:04,278][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:34:04,287][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000588763_602893312.pth [2022-07-10 05:34:04,287][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000586786_600868864.pth [2022-07-10 05:34:04,809][25689] Fps is (10 sec: 5480.3, 60 sec: 5608.7, 300 sec: 5621.3). Total num frames: 602896384. Throughput: 0: 5795.3. Samples: 602903986. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:34:04,812][25689] Avg episode reward: [(0, '-23.845')] [2022-07-10 05:34:05,329][26022] Updated weights on worker 0-0, policy_version 588769 (0.00659) [2022-07-10 05:34:07,304][26022] Updated weights on worker 0-0, policy_version 588779 (0.00090) [2022-07-10 05:34:09,070][26022] Updated weights on worker 0-0, policy_version 588789 (0.00093) [2022-07-10 05:34:09,815][25689] Fps is (10 sec: 5616.4, 60 sec: 5625.5, 300 sec: 5614.8). Total num frames: 602924032. Throughput: 0: 4963.5. Samples: 602921024. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:34:09,816][25689] Avg episode reward: [(0, '-24.591')] [2022-07-10 05:34:10,743][26022] Updated weights on worker 0-0, policy_version 588799 (0.00098) [2022-07-10 05:34:12,527][26022] Updated weights on worker 0-0, policy_version 588809 (0.00086) [2022-07-10 05:34:14,582][26022] Updated weights on worker 0-0, policy_version 588819 (0.00086) [2022-07-10 05:34:14,824][25689] Fps is (10 sec: 5521.6, 60 sec: 5591.2, 300 sec: 5619.3). Total num frames: 602951680. Throughput: 0: 5821.9. Samples: 602954902. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 05:34:14,824][25689] Avg episode reward: [(0, '-25.393')] [2022-07-10 05:34:16,124][26022] Updated weights on worker 0-0, policy_version 588829 (0.00087) [2022-07-10 05:34:18,309][26022] Updated weights on worker 0-0, policy_version 588839 (0.00088) [2022-07-10 05:34:19,864][26022] Updated weights on worker 0-0, policy_version 588849 (0.00094) [2022-07-10 05:34:19,943][25689] Fps is (10 sec: 5661.8, 60 sec: 5602.9, 300 sec: 5618.1). Total num frames: 602981376. Throughput: 0: 5803.8. Samples: 602988794. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:34:19,943][25689] Avg episode reward: [(0, '-25.813')] [2022-07-10 05:34:21,692][26022] Updated weights on worker 0-0, policy_version 588859 (0.00095) [2022-07-10 05:34:23,758][26022] Updated weights on worker 0-0, policy_version 588869 (0.00084) [2022-07-10 05:34:24,958][25689] Fps is (10 sec: 5759.6, 60 sec: 5625.2, 300 sec: 5619.0). Total num frames: 603010048. Throughput: 0: 5055.2. Samples: 603005842. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:34:24,959][25689] Avg episode reward: [(0, '-25.757')] [2022-07-10 05:34:25,419][26022] Updated weights on worker 0-0, policy_version 588879 (0.00084) [2022-07-10 05:34:27,325][26022] Updated weights on worker 0-0, policy_version 588889 (0.00091) [2022-07-10 05:34:29,156][26022] Updated weights on worker 0-0, policy_version 588899 (0.00575) [2022-07-10 05:34:29,963][25689] Fps is (10 sec: 5518.5, 60 sec: 5608.3, 300 sec: 5613.3). Total num frames: 603036672. Throughput: 0: 5887.0. Samples: 603039638. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:34:29,964][25689] Avg episode reward: [(0, '-24.850')] [2022-07-10 05:34:30,723][26022] Updated weights on worker 0-0, policy_version 588909 (0.00083) [2022-07-10 05:34:32,645][26022] Updated weights on worker 0-0, policy_version 588919 (0.00087) [2022-07-10 05:34:34,075][26022] Updated weights on worker 0-0, policy_version 588929 (0.00958) [2022-07-10 05:34:34,970][25689] Fps is (10 sec: 5523.2, 60 sec: 5612.1, 300 sec: 5617.4). Total num frames: 603065344. Throughput: 0: 5902.3. Samples: 603073808. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:34:34,971][25689] Avg episode reward: [(0, '-25.667')] [2022-07-10 05:34:36,285][26022] Updated weights on worker 0-0, policy_version 588939 (0.00101) [2022-07-10 05:34:38,089][26022] Updated weights on worker 0-0, policy_version 588949 (0.00096) [2022-07-10 05:34:39,870][26022] Updated weights on worker 0-0, policy_version 588959 (0.00088) [2022-07-10 05:34:40,043][25689] Fps is (10 sec: 5790.2, 60 sec: 5613.0, 300 sec: 5619.7). Total num frames: 603095040. Throughput: 0: 5070.1. Samples: 603090706. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:34:40,044][25689] Avg episode reward: [(0, '-26.042')] [2022-07-10 05:34:41,541][26022] Updated weights on worker 0-0, policy_version 588969 (0.00085) [2022-07-10 05:34:43,285][26022] Updated weights on worker 0-0, policy_version 588979 (0.00094) [2022-07-10 05:34:45,068][25689] Fps is (10 sec: 5779.9, 60 sec: 5631.4, 300 sec: 5623.1). Total num frames: 603123712. Throughput: 0: 5920.7. Samples: 603124908. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:34:45,069][25689] Avg episode reward: [(0, '-25.769')] [2022-07-10 05:34:45,123][26022] Updated weights on worker 0-0, policy_version 588989 (0.00052) [2022-07-10 05:34:47,203][26022] Updated weights on worker 0-0, policy_version 588999 (0.00088) [2022-07-10 05:34:48,694][26022] Updated weights on worker 0-0, policy_version 589009 (0.00090) [2022-07-10 05:34:50,078][25689] Fps is (10 sec: 5510.3, 60 sec: 5597.9, 300 sec: 5612.8). Total num frames: 603150336. Throughput: 0: 5931.7. Samples: 603158956. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:34:50,079][25689] Avg episode reward: [(0, '-25.652')] [2022-07-10 05:34:50,651][26022] Updated weights on worker 0-0, policy_version 589019 (0.00100) [2022-07-10 05:34:52,463][26022] Updated weights on worker 0-0, policy_version 589029 (0.00092) [2022-07-10 05:34:54,219][26022] Updated weights on worker 0-0, policy_version 589039 (0.00085) [2022-07-10 05:34:55,097][25689] Fps is (10 sec: 5615.9, 60 sec: 5613.8, 300 sec: 5620.7). Total num frames: 603180032. Throughput: 0: 5077.7. Samples: 603176008. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:34:55,099][25689] Avg episode reward: [(0, '-25.014')] [2022-07-10 05:34:56,133][26022] Updated weights on worker 0-0, policy_version 589049 (0.00091) [2022-07-10 05:34:57,826][26022] Updated weights on worker 0-0, policy_version 589059 (0.00085) [2022-07-10 05:34:59,803][26022] Updated weights on worker 0-0, policy_version 589069 (0.00095) [2022-07-10 05:35:00,166][25689] Fps is (10 sec: 5785.8, 60 sec: 5647.5, 300 sec: 5624.0). Total num frames: 603208704. Throughput: 0: 5919.8. Samples: 603209832. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:35:00,168][25689] Avg episode reward: [(0, '-24.345')] [2022-07-10 05:35:01,526][26022] Updated weights on worker 0-0, policy_version 589079 (0.00103) [2022-07-10 05:35:03,836][26022] Updated weights on worker 0-0, policy_version 589089 (0.00092) [2022-07-10 05:35:05,204][25689] Fps is (10 sec: 5471.0, 60 sec: 5612.2, 300 sec: 5617.5). Total num frames: 603235328. Throughput: 0: 5798.5. Samples: 603241666. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:35:05,205][25689] Avg episode reward: [(0, '-23.234')] [2022-07-10 05:35:05,524][26022] Updated weights on worker 0-0, policy_version 589099 (0.00090) [2022-07-10 05:35:07,329][26022] Updated weights on worker 0-0, policy_version 589109 (0.00086) [2022-07-10 05:35:09,222][26022] Updated weights on worker 0-0, policy_version 589119 (0.00089) [2022-07-10 05:35:10,225][25689] Fps is (10 sec: 5395.4, 60 sec: 5610.7, 300 sec: 5621.4). Total num frames: 603262976. Throughput: 0: 4952.7. Samples: 603258738. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:35:10,226][25689] Avg episode reward: [(0, '-23.248')] [2022-07-10 05:35:10,822][26022] Updated weights on worker 0-0, policy_version 589129 (0.00092) [2022-07-10 05:35:12,913][26022] Updated weights on worker 0-0, policy_version 589139 (0.00084) [2022-07-10 05:35:14,467][26022] Updated weights on worker 0-0, policy_version 589149 (0.00083) [2022-07-10 05:35:15,238][25689] Fps is (10 sec: 5715.1, 60 sec: 5644.3, 300 sec: 5619.4). Total num frames: 603292672. Throughput: 0: 5790.0. Samples: 603292624. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:35:15,238][25689] Avg episode reward: [(0, '-24.218')] [2022-07-10 05:35:16,602][26022] Updated weights on worker 0-0, policy_version 589159 (0.00095) [2022-07-10 05:35:18,083][26022] Updated weights on worker 0-0, policy_version 589169 (0.00088) [2022-07-10 05:35:20,134][26022] Updated weights on worker 0-0, policy_version 589179 (0.00092) [2022-07-10 05:35:20,317][25689] Fps is (10 sec: 5580.6, 60 sec: 5597.1, 300 sec: 5614.6). Total num frames: 603319296. Throughput: 0: 5786.5. Samples: 603326436. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:35:20,318][25689] Avg episode reward: [(0, '-25.187')] [2022-07-10 05:35:21,856][26022] Updated weights on worker 0-0, policy_version 589189 (0.00095) [2022-07-10 05:35:23,768][26022] Updated weights on worker 0-0, policy_version 589199 (0.00096) [2022-07-10 05:35:25,313][26022] Updated weights on worker 0-0, policy_version 589209 (0.00087) [2022-07-10 05:35:25,342][25689] Fps is (10 sec: 5675.4, 60 sec: 5630.1, 300 sec: 5618.2). Total num frames: 603350016. Throughput: 0: 5053.8. Samples: 603343438. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:35:25,342][25689] Avg episode reward: [(0, '-26.298')] [2022-07-10 05:35:27,582][26022] Updated weights on worker 0-0, policy_version 589219 (0.00087) [2022-07-10 05:35:28,975][26022] Updated weights on worker 0-0, policy_version 589229 (0.00088) [2022-07-10 05:35:30,367][25689] Fps is (10 sec: 5604.3, 60 sec: 5611.3, 300 sec: 5607.8). Total num frames: 603375616. Throughput: 0: 5877.3. Samples: 603377114. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:35:30,367][25689] Avg episode reward: [(0, '-25.401')] [2022-07-10 05:35:31,215][26022] Updated weights on worker 0-0, policy_version 589239 (0.00085) [2022-07-10 05:35:32,567][26022] Updated weights on worker 0-0, policy_version 589249 (0.00087) [2022-07-10 05:35:34,796][26022] Updated weights on worker 0-0, policy_version 589259 (0.00090) [2022-07-10 05:35:35,377][25689] Fps is (10 sec: 5612.2, 60 sec: 5644.9, 300 sec: 5619.7). Total num frames: 603406336. Throughput: 0: 5883.1. Samples: 603411106. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:35:35,378][25689] Avg episode reward: [(0, '-26.137')] [2022-07-10 05:35:36,518][26022] Updated weights on worker 0-0, policy_version 589269 (0.00091) [2022-07-10 05:35:38,329][26022] Updated weights on worker 0-0, policy_version 589279 (0.00461) [2022-07-10 05:35:40,220][26022] Updated weights on worker 0-0, policy_version 589289 (0.00080) [2022-07-10 05:35:40,510][25689] Fps is (10 sec: 5653.4, 60 sec: 5588.5, 300 sec: 5614.1). Total num frames: 603432960. Throughput: 0: 5019.3. Samples: 603427788. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:35:40,511][25689] Avg episode reward: [(0, '-26.303')] [2022-07-10 05:35:41,860][26022] Updated weights on worker 0-0, policy_version 589299 (0.00089) [2022-07-10 05:35:43,708][26022] Updated weights on worker 0-0, policy_version 589309 (0.00086) [2022-07-10 05:35:45,514][25689] Fps is (10 sec: 5455.0, 60 sec: 5590.5, 300 sec: 5610.6). Total num frames: 603461632. Throughput: 0: 5858.5. Samples: 603461616. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:35:45,514][25689] Avg episode reward: [(0, '-26.019')] [2022-07-10 05:35:45,729][26022] Updated weights on worker 0-0, policy_version 589319 (0.00096) [2022-07-10 05:35:47,396][26022] Updated weights on worker 0-0, policy_version 589329 (0.00089) [2022-07-10 05:35:49,352][26022] Updated weights on worker 0-0, policy_version 589339 (0.00088) [2022-07-10 05:35:50,559][25689] Fps is (10 sec: 5706.5, 60 sec: 5621.1, 300 sec: 5613.8). Total num frames: 603490304. Throughput: 0: 5853.5. Samples: 603495308. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:35:50,560][25689] Avg episode reward: [(0, '-25.690')] [2022-07-10 05:35:51,067][26022] Updated weights on worker 0-0, policy_version 589349 (0.00095) [2022-07-10 05:35:52,828][26022] Updated weights on worker 0-0, policy_version 589359 (0.00089) [2022-07-10 05:35:54,756][26022] Updated weights on worker 0-0, policy_version 589369 (0.00093) [2022-07-10 05:35:55,659][25689] Fps is (10 sec: 5551.3, 60 sec: 5579.7, 300 sec: 5606.7). Total num frames: 603517952. Throughput: 0: 4986.4. Samples: 603512240. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:35:55,660][25689] Avg episode reward: [(0, '-25.472')] [2022-07-10 05:35:56,387][26022] Updated weights on worker 0-0, policy_version 589379 (0.00085) [2022-07-10 05:35:58,463][26022] Updated weights on worker 0-0, policy_version 589389 (0.00084) [2022-07-10 05:36:00,131][26022] Updated weights on worker 0-0, policy_version 589399 (0.00086) [2022-07-10 05:36:00,707][25689] Fps is (10 sec: 5650.7, 60 sec: 5598.6, 300 sec: 5620.0). Total num frames: 603547648. Throughput: 0: 5850.4. Samples: 603545948. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:36:00,707][25689] Avg episode reward: [(0, '-26.118')] [2022-07-10 05:36:02,310][26022] Updated weights on worker 0-0, policy_version 589409 (0.00095) [2022-07-10 05:36:04,056][26022] Updated weights on worker 0-0, policy_version 589419 (0.00092) [2022-07-10 05:36:04,433][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:36:04,448][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000589421_603567104.pth [2022-07-10 05:36:04,448][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000587446_601544704.pth [2022-07-10 05:36:05,719][25689] Fps is (10 sec: 5598.8, 60 sec: 5601.0, 300 sec: 5612.9). Total num frames: 603574272. Throughput: 0: 5774.5. Samples: 603578288. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:36:05,719][25689] Avg episode reward: [(0, '-24.838')] [2022-07-10 05:36:05,956][26022] Updated weights on worker 0-0, policy_version 589429 (0.00081) [2022-07-10 05:36:07,611][26022] Updated weights on worker 0-0, policy_version 589439 (0.00089) [2022-07-10 05:36:09,502][26022] Updated weights on worker 0-0, policy_version 589449 (0.00086) [2022-07-10 05:36:10,779][25689] Fps is (10 sec: 5388.4, 60 sec: 5597.4, 300 sec: 5605.4). Total num frames: 603601920. Throughput: 0: 4947.6. Samples: 603595350. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:36:10,779][25689] Avg episode reward: [(0, '-25.704')] [2022-07-10 05:36:11,130][26022] Updated weights on worker 0-0, policy_version 589459 (0.00094) [2022-07-10 05:36:13,035][26022] Updated weights on worker 0-0, policy_version 589469 (0.00079) [2022-07-10 05:36:14,918][26022] Updated weights on worker 0-0, policy_version 589479 (0.00079) [2022-07-10 05:36:15,785][25689] Fps is (10 sec: 5798.2, 60 sec: 5614.9, 300 sec: 5617.0). Total num frames: 603632640. Throughput: 0: 5834.3. Samples: 603629660. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:36:15,786][25689] Avg episode reward: [(0, '-24.683')] [2022-07-10 05:36:16,684][26022] Updated weights on worker 0-0, policy_version 589489 (0.00089) [2022-07-10 05:36:18,681][26022] Updated weights on worker 0-0, policy_version 589499 (0.00090) [2022-07-10 05:36:20,245][26022] Updated weights on worker 0-0, policy_version 589509 (0.00085) [2022-07-10 05:36:20,875][25689] Fps is (10 sec: 5578.4, 60 sec: 5597.0, 300 sec: 5608.6). Total num frames: 603658240. Throughput: 0: 5833.4. Samples: 603663596. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:36:20,876][25689] Avg episode reward: [(0, '-24.279')] [2022-07-10 05:36:22,200][26022] Updated weights on worker 0-0, policy_version 589519 (0.00088) [2022-07-10 05:36:23,967][26022] Updated weights on worker 0-0, policy_version 589529 (0.00086) [2022-07-10 05:36:25,654][26022] Updated weights on worker 0-0, policy_version 589539 (0.00089) [2022-07-10 05:36:25,895][25689] Fps is (10 sec: 5570.7, 60 sec: 5597.4, 300 sec: 5612.0). Total num frames: 603688960. Throughput: 0: 5916.7. Samples: 603697666. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:36:25,896][25689] Avg episode reward: [(0, '-24.451')] [2022-07-10 05:36:27,700][26022] Updated weights on worker 0-0, policy_version 589549 (0.00087) [2022-07-10 05:36:29,259][26022] Updated weights on worker 0-0, policy_version 589559 (0.00089) [2022-07-10 05:36:30,918][25689] Fps is (10 sec: 5709.7, 60 sec: 5614.5, 300 sec: 5612.0). Total num frames: 603715584. Throughput: 0: 5931.0. Samples: 603714796. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:36:30,919][25689] Avg episode reward: [(0, '-24.534')] [2022-07-10 05:36:31,176][26022] Updated weights on worker 0-0, policy_version 589569 (0.00091) [2022-07-10 05:36:32,857][26022] Updated weights on worker 0-0, policy_version 589579 (0.00086) [2022-07-10 05:36:34,565][26022] Updated weights on worker 0-0, policy_version 589589 (0.00084) [2022-07-10 05:36:36,019][25689] Fps is (10 sec: 5563.1, 60 sec: 5589.3, 300 sec: 5611.2). Total num frames: 603745280. Throughput: 0: 5911.9. Samples: 603749280. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:36:36,019][25689] Avg episode reward: [(0, '-24.064')] [2022-07-10 05:36:36,714][26022] Updated weights on worker 0-0, policy_version 589599 (0.00091) [2022-07-10 05:36:38,139][26022] Updated weights on worker 0-0, policy_version 589609 (0.00087) [2022-07-10 05:36:40,131][26022] Updated weights on worker 0-0, policy_version 589619 (0.00094) [2022-07-10 05:36:41,060][25689] Fps is (10 sec: 5856.2, 60 sec: 5648.5, 300 sec: 5617.4). Total num frames: 603774976. Throughput: 0: 5929.5. Samples: 603783282. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:36:41,060][25689] Avg episode reward: [(0, '-23.506')] [2022-07-10 05:36:41,846][26022] Updated weights on worker 0-0, policy_version 589629 (0.00087) [2022-07-10 05:36:43,728][26022] Updated weights on worker 0-0, policy_version 589639 (0.01083) [2022-07-10 05:36:45,536][26022] Updated weights on worker 0-0, policy_version 589649 (0.00087) [2022-07-10 05:36:46,068][25689] Fps is (10 sec: 5706.5, 60 sec: 5631.2, 300 sec: 5614.1). Total num frames: 603802624. Throughput: 0: 5090.1. Samples: 603800346. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:36:46,068][25689] Avg episode reward: [(0, '-23.141')] [2022-07-10 05:36:47,325][26022] Updated weights on worker 0-0, policy_version 589659 (0.00092) [2022-07-10 05:36:49,027][26022] Updated weights on worker 0-0, policy_version 589669 (0.00084) [2022-07-10 05:36:51,068][26022] Updated weights on worker 0-0, policy_version 589679 (0.00088) [2022-07-10 05:36:51,073][25689] Fps is (10 sec: 5624.4, 60 sec: 5634.9, 300 sec: 5614.2). Total num frames: 603831296. Throughput: 0: 5951.0. Samples: 603834740. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:36:51,074][25689] Avg episode reward: [(0, '-23.136')] [2022-07-10 05:36:52,657][26022] Updated weights on worker 0-0, policy_version 589689 (0.00085) [2022-07-10 05:36:54,566][26022] Updated weights on worker 0-0, policy_version 589699 (0.00086) [2022-07-10 05:36:56,098][25689] Fps is (10 sec: 5819.0, 60 sec: 5675.8, 300 sec: 5625.1). Total num frames: 603860992. Throughput: 0: 5945.0. Samples: 603868652. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:36:56,099][25689] Avg episode reward: [(0, '-23.153')] [2022-07-10 05:36:56,172][26022] Updated weights on worker 0-0, policy_version 589709 (0.00092) [2022-07-10 05:36:58,164][26022] Updated weights on worker 0-0, policy_version 589719 (0.00086) [2022-07-10 05:37:00,138][26022] Updated weights on worker 0-0, policy_version 589729 (0.00096) [2022-07-10 05:37:01,169][25689] Fps is (10 sec: 5680.3, 60 sec: 5639.8, 300 sec: 5624.0). Total num frames: 603888640. Throughput: 0: 5089.7. Samples: 603885630. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:37:01,169][25689] Avg episode reward: [(0, '-24.139')] [2022-07-10 05:37:02,069][26022] Updated weights on worker 0-0, policy_version 589739 (0.00097) [2022-07-10 05:37:04,056][26022] Updated weights on worker 0-0, policy_version 589749 (0.00087) [2022-07-10 05:37:05,570][26022] Updated weights on worker 0-0, policy_version 589759 (0.00088) [2022-07-10 05:37:06,193][25689] Fps is (10 sec: 5275.0, 60 sec: 5621.7, 300 sec: 5617.2). Total num frames: 603914240. Throughput: 0: 5832.7. Samples: 603917730. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:37:06,195][25689] Avg episode reward: [(0, '-25.094')] [2022-07-10 05:37:07,602][26022] Updated weights on worker 0-0, policy_version 589769 (0.00086) [2022-07-10 05:37:09,415][26022] Updated weights on worker 0-0, policy_version 589779 (0.00088) [2022-07-10 05:37:11,198][25689] Fps is (10 sec: 5411.5, 60 sec: 5643.8, 300 sec: 5617.6). Total num frames: 603942912. Throughput: 0: 5814.6. Samples: 603951754. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:37:11,200][25689] Avg episode reward: [(0, '-25.445')] [2022-07-10 05:37:11,249][26022] Updated weights on worker 0-0, policy_version 589789 (0.00087) [2022-07-10 05:37:12,866][26022] Updated weights on worker 0-0, policy_version 589799 (0.00105) [2022-07-10 05:37:14,771][26022] Updated weights on worker 0-0, policy_version 589809 (0.00097) [2022-07-10 05:37:16,212][25689] Fps is (10 sec: 5825.6, 60 sec: 5626.1, 300 sec: 5625.9). Total num frames: 603972608. Throughput: 0: 4981.9. Samples: 603968856. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:37:16,215][25689] Avg episode reward: [(0, '-25.575')] [2022-07-10 05:37:16,531][26022] Updated weights on worker 0-0, policy_version 589819 (0.00094) [2022-07-10 05:37:18,523][26022] Updated weights on worker 0-0, policy_version 589829 (0.00088) [2022-07-10 05:37:20,177][26022] Updated weights on worker 0-0, policy_version 589839 (0.00094) [2022-07-10 05:37:21,360][25689] Fps is (10 sec: 5542.3, 60 sec: 5637.7, 300 sec: 5613.0). Total num frames: 603999232. Throughput: 0: 5799.5. Samples: 604002728. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 05:37:21,360][25689] Avg episode reward: [(0, '-26.004')] [2022-07-10 05:37:22,013][26022] Updated weights on worker 0-0, policy_version 589849 (0.00050) [2022-07-10 05:37:24,072][26022] Updated weights on worker 0-0, policy_version 589859 (0.00090) [2022-07-10 05:37:25,708][26022] Updated weights on worker 0-0, policy_version 589869 (0.00084) [2022-07-10 05:37:26,365][25689] Fps is (10 sec: 5648.0, 60 sec: 5639.0, 300 sec: 5620.3). Total num frames: 604029952. Throughput: 0: 5890.1. Samples: 604036546. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:37:26,366][25689] Avg episode reward: [(0, '-26.454')] [2022-07-10 05:37:27,528][26022] Updated weights on worker 0-0, policy_version 589879 (0.00096) [2022-07-10 05:37:29,307][26022] Updated weights on worker 0-0, policy_version 589889 (0.00091) [2022-07-10 05:37:31,398][25689] Fps is (10 sec: 5610.7, 60 sec: 5621.2, 300 sec: 5613.3). Total num frames: 604055552. Throughput: 0: 5018.3. Samples: 604053122. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:37:31,398][25689] Avg episode reward: [(0, '-26.969')] [2022-07-10 05:37:31,477][26022] Updated weights on worker 0-0, policy_version 589899 (0.00083) [2022-07-10 05:37:32,875][26022] Updated weights on worker 0-0, policy_version 589909 (0.00102) [2022-07-10 05:37:34,923][26022] Updated weights on worker 0-0, policy_version 589919 (0.00085) [2022-07-10 05:37:36,400][25689] Fps is (10 sec: 5714.5, 60 sec: 5664.3, 300 sec: 5622.3). Total num frames: 604087296. Throughput: 0: 5868.1. Samples: 604087320. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:37:36,401][25689] Avg episode reward: [(0, '-27.032')] [2022-07-10 05:37:36,409][26022] Updated weights on worker 0-0, policy_version 589929 (0.00093) [2022-07-10 05:37:38,590][26022] Updated weights on worker 0-0, policy_version 589939 (0.00093) [2022-07-10 05:37:40,290][26022] Updated weights on worker 0-0, policy_version 589949 (0.00092) [2022-07-10 05:37:41,453][25689] Fps is (10 sec: 5702.7, 60 sec: 5595.3, 300 sec: 5612.1). Total num frames: 604112896. Throughput: 0: 5891.1. Samples: 604121100. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:37:41,454][25689] Avg episode reward: [(0, '-27.063')] [2022-07-10 05:37:42,018][26022] Updated weights on worker 0-0, policy_version 589959 (0.00092) [2022-07-10 05:37:44,199][26022] Updated weights on worker 0-0, policy_version 589969 (0.00092) [2022-07-10 05:37:45,728][26022] Updated weights on worker 0-0, policy_version 589979 (0.00090) [2022-07-10 05:37:46,468][25689] Fps is (10 sec: 5594.1, 60 sec: 5645.6, 300 sec: 5619.6). Total num frames: 604143616. Throughput: 0: 5048.1. Samples: 604138026. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:37:46,468][25689] Avg episode reward: [(0, '-26.841')] [2022-07-10 05:37:47,586][26022] Updated weights on worker 0-0, policy_version 589989 (0.00086) [2022-07-10 05:37:49,254][26022] Updated weights on worker 0-0, policy_version 589999 (0.00089) [2022-07-10 05:37:51,183][26022] Updated weights on worker 0-0, policy_version 590009 (0.00084) [2022-07-10 05:37:51,491][25689] Fps is (10 sec: 5713.0, 60 sec: 5610.1, 300 sec: 5612.7). Total num frames: 604170240. Throughput: 0: 5927.0. Samples: 604172214. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:37:51,491][25689] Avg episode reward: [(0, '-26.163')] [2022-07-10 05:37:52,911][26022] Updated weights on worker 0-0, policy_version 590019 (0.00082) [2022-07-10 05:37:55,055][26022] Updated weights on worker 0-0, policy_version 590029 (0.00462) [2022-07-10 05:37:56,485][26022] Updated weights on worker 0-0, policy_version 590039 (0.00087) [2022-07-10 05:37:56,575][25689] Fps is (10 sec: 5572.4, 60 sec: 5604.6, 300 sec: 5615.9). Total num frames: 604199936. Throughput: 0: 5884.7. Samples: 604206042. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:37:56,575][25689] Avg episode reward: [(0, '-24.480')] [2022-07-10 05:37:58,537][26022] Updated weights on worker 0-0, policy_version 590049 (0.00090) [2022-07-10 05:38:00,127][26022] Updated weights on worker 0-0, policy_version 590059 (0.00087) [2022-07-10 05:38:01,705][25689] Fps is (10 sec: 5613.8, 60 sec: 5599.0, 300 sec: 5617.2). Total num frames: 604227584. Throughput: 0: 5042.1. Samples: 604223214. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:38:01,706][25689] Avg episode reward: [(0, '-25.051')] [2022-07-10 05:38:02,261][26022] Updated weights on worker 0-0, policy_version 590069 (0.00092) [2022-07-10 05:38:04,020][26022] Updated weights on worker 0-0, policy_version 590079 (0.00095) [2022-07-10 05:38:04,522][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:38:04,530][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000590081_604242944.pth [2022-07-10 05:38:04,534][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000588105_602219520.pth [2022-07-10 05:38:06,050][26022] Updated weights on worker 0-0, policy_version 590089 (0.00087) [2022-07-10 05:38:06,723][25689] Fps is (10 sec: 5347.9, 60 sec: 5616.5, 300 sec: 5616.9). Total num frames: 604254208. Throughput: 0: 5784.3. Samples: 604255188. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:38:06,723][25689] Avg episode reward: [(0, '-25.268')] [2022-07-10 05:38:07,609][26022] Updated weights on worker 0-0, policy_version 590099 (0.00090) [2022-07-10 05:38:09,659][26022] Updated weights on worker 0-0, policy_version 590109 (0.00088) [2022-07-10 05:38:11,275][26022] Updated weights on worker 0-0, policy_version 590119 (0.00085) [2022-07-10 05:38:11,785][25689] Fps is (10 sec: 5485.7, 60 sec: 5611.2, 300 sec: 5612.4). Total num frames: 604282880. Throughput: 0: 5757.1. Samples: 604289054. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:38:11,787][25689] Avg episode reward: [(0, '-25.946')] [2022-07-10 05:38:13,239][26022] Updated weights on worker 0-0, policy_version 590129 (0.00090) [2022-07-10 05:38:14,980][26022] Updated weights on worker 0-0, policy_version 590139 (0.00089) [2022-07-10 05:38:16,792][25689] Fps is (10 sec: 5796.5, 60 sec: 5611.9, 300 sec: 5616.9). Total num frames: 604312576. Throughput: 0: 4949.1. Samples: 604306102. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:38:16,793][25689] Avg episode reward: [(0, '-25.426')] [2022-07-10 05:38:16,799][26022] Updated weights on worker 0-0, policy_version 590149 (0.00089) [2022-07-10 05:38:18,576][26022] Updated weights on worker 0-0, policy_version 590159 (0.00093) [2022-07-10 05:38:20,342][26022] Updated weights on worker 0-0, policy_version 590169 (0.00091) [2022-07-10 05:38:21,845][25689] Fps is (10 sec: 5700.2, 60 sec: 5637.6, 300 sec: 5617.3). Total num frames: 604340224. Throughput: 0: 5814.3. Samples: 604340314. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:38:21,846][25689] Avg episode reward: [(0, '-24.008')] [2022-07-10 05:38:22,263][26022] Updated weights on worker 0-0, policy_version 590179 (0.00087) [2022-07-10 05:38:24,063][26022] Updated weights on worker 0-0, policy_version 590189 (0.00089) [2022-07-10 05:38:25,874][26022] Updated weights on worker 0-0, policy_version 590199 (0.00094) [2022-07-10 05:38:26,911][25689] Fps is (10 sec: 5667.1, 60 sec: 5615.0, 300 sec: 5623.0). Total num frames: 604369920. Throughput: 0: 5899.8. Samples: 604374294. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:38:26,912][25689] Avg episode reward: [(0, '-24.522')] [2022-07-10 05:38:27,750][26022] Updated weights on worker 0-0, policy_version 590209 (0.00086) [2022-07-10 05:38:29,485][26022] Updated weights on worker 0-0, policy_version 590219 (0.00090) [2022-07-10 05:38:31,371][26022] Updated weights on worker 0-0, policy_version 590229 (0.00084) [2022-07-10 05:38:31,949][25689] Fps is (10 sec: 5574.1, 60 sec: 5631.4, 300 sec: 5616.3). Total num frames: 604396544. Throughput: 0: 5070.3. Samples: 604391290. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:38:31,950][25689] Avg episode reward: [(0, '-24.265')] [2022-07-10 05:38:33,060][26022] Updated weights on worker 0-0, policy_version 590239 (0.00085) [2022-07-10 05:38:35,099][26022] Updated weights on worker 0-0, policy_version 590249 (0.00617) [2022-07-10 05:38:36,585][26022] Updated weights on worker 0-0, policy_version 590259 (0.00089) [2022-07-10 05:38:37,002][25689] Fps is (10 sec: 5682.6, 60 sec: 5609.8, 300 sec: 5620.3). Total num frames: 604427264. Throughput: 0: 5908.0. Samples: 604425502. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:38:37,003][25689] Avg episode reward: [(0, '-23.635')] [2022-07-10 05:38:38,577][26022] Updated weights on worker 0-0, policy_version 590269 (0.00083) [2022-07-10 05:38:40,281][26022] Updated weights on worker 0-0, policy_version 590279 (0.00089) [2022-07-10 05:38:42,068][25689] Fps is (10 sec: 5768.2, 60 sec: 5642.4, 300 sec: 5619.8). Total num frames: 604454912. Throughput: 0: 5891.1. Samples: 604459448. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:38:42,069][25689] Avg episode reward: [(0, '-23.503')] [2022-07-10 05:38:42,270][26022] Updated weights on worker 0-0, policy_version 590289 (0.00071) [2022-07-10 05:38:43,914][26022] Updated weights on worker 0-0, policy_version 590299 (0.00090) [2022-07-10 05:38:45,928][26022] Updated weights on worker 0-0, policy_version 590309 (0.00087) [2022-07-10 05:38:47,083][25689] Fps is (10 sec: 5587.0, 60 sec: 5608.6, 300 sec: 5619.8). Total num frames: 604483584. Throughput: 0: 5073.5. Samples: 604476636. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:38:47,086][25689] Avg episode reward: [(0, '-24.163')] [2022-07-10 05:38:47,392][26022] Updated weights on worker 0-0, policy_version 590319 (0.00087) [2022-07-10 05:38:49,373][26022] Updated weights on worker 0-0, policy_version 590329 (0.00098) [2022-07-10 05:38:51,151][26022] Updated weights on worker 0-0, policy_version 590339 (0.00093) [2022-07-10 05:38:52,114][25689] Fps is (10 sec: 5606.5, 60 sec: 5624.8, 300 sec: 5615.9). Total num frames: 604511232. Throughput: 0: 5928.6. Samples: 604510836. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:38:52,114][25689] Avg episode reward: [(0, '-25.180')] [2022-07-10 05:38:53,029][26022] Updated weights on worker 0-0, policy_version 590349 (0.00085) [2022-07-10 05:38:54,871][26022] Updated weights on worker 0-0, policy_version 590359 (0.00087) [2022-07-10 05:38:56,506][26022] Updated weights on worker 0-0, policy_version 590369 (0.00088) [2022-07-10 05:38:57,123][25689] Fps is (10 sec: 5711.6, 60 sec: 5631.8, 300 sec: 5627.4). Total num frames: 604540928. Throughput: 0: 5901.5. Samples: 604544242. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:38:57,123][25689] Avg episode reward: [(0, '-25.497')] [2022-07-10 05:38:58,421][26022] Updated weights on worker 0-0, policy_version 590379 (0.00087) [2022-07-10 05:39:00,108][26022] Updated weights on worker 0-0, policy_version 590389 (0.00094) [2022-07-10 05:39:02,183][25689] Fps is (10 sec: 5491.7, 60 sec: 5604.5, 300 sec: 5616.3). Total num frames: 604566528. Throughput: 0: 5865.2. Samples: 604577424. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:39:02,183][25689] Avg episode reward: [(0, '-24.000')] [2022-07-10 05:39:02,468][26022] Updated weights on worker 0-0, policy_version 590399 (0.00115) [2022-07-10 05:39:04,223][26022] Updated weights on worker 0-0, policy_version 590409 (0.00090) [2022-07-10 05:39:06,063][26022] Updated weights on worker 0-0, policy_version 590419 (0.00083) [2022-07-10 05:39:07,213][25689] Fps is (10 sec: 5378.8, 60 sec: 5637.2, 300 sec: 5619.3). Total num frames: 604595200. Throughput: 0: 5795.5. Samples: 604593298. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:39:07,213][25689] Avg episode reward: [(0, '-24.769')] [2022-07-10 05:39:07,863][26022] Updated weights on worker 0-0, policy_version 590429 (0.00084) [2022-07-10 05:39:09,843][26022] Updated weights on worker 0-0, policy_version 590439 (0.00089) [2022-07-10 05:39:11,354][26022] Updated weights on worker 0-0, policy_version 590449 (0.00085) [2022-07-10 05:39:12,217][25689] Fps is (10 sec: 5714.9, 60 sec: 5642.6, 300 sec: 5622.8). Total num frames: 604623872. Throughput: 0: 5803.8. Samples: 604627510. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:39:12,219][25689] Avg episode reward: [(0, '-24.847')] [2022-07-10 05:39:13,344][26022] Updated weights on worker 0-0, policy_version 590459 (0.00781) [2022-07-10 05:39:15,118][26022] Updated weights on worker 0-0, policy_version 590469 (0.00089) [2022-07-10 05:39:16,860][26022] Updated weights on worker 0-0, policy_version 590479 (0.00105) [2022-07-10 05:39:17,232][25689] Fps is (10 sec: 5723.2, 60 sec: 5624.9, 300 sec: 5621.3). Total num frames: 604652544. Throughput: 0: 5841.5. Samples: 604661712. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:39:17,233][25689] Avg episode reward: [(0, '-25.035')] [2022-07-10 05:39:18,712][26022] Updated weights on worker 0-0, policy_version 590489 (0.00092) [2022-07-10 05:39:20,294][26022] Updated weights on worker 0-0, policy_version 590499 (0.00087) [2022-07-10 05:39:22,344][25689] Fps is (10 sec: 5561.3, 60 sec: 5619.5, 300 sec: 5616.1). Total num frames: 604680192. Throughput: 0: 5035.5. Samples: 604678946. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:39:22,344][25689] Avg episode reward: [(0, '-24.329')] [2022-07-10 05:39:22,438][26022] Updated weights on worker 0-0, policy_version 590509 (0.00093) [2022-07-10 05:39:24,015][26022] Updated weights on worker 0-0, policy_version 590519 (0.00084) [2022-07-10 05:39:26,036][26022] Updated weights on worker 0-0, policy_version 590529 (0.00095) [2022-07-10 05:39:27,366][25689] Fps is (10 sec: 5557.9, 60 sec: 5606.6, 300 sec: 5622.7). Total num frames: 604708864. Throughput: 0: 5924.5. Samples: 604712694. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:39:27,366][25689] Avg episode reward: [(0, '-25.230')] [2022-07-10 05:39:27,831][26022] Updated weights on worker 0-0, policy_version 590539 (0.00092) [2022-07-10 05:39:29,454][26022] Updated weights on worker 0-0, policy_version 590549 (0.00086) [2022-07-10 05:39:31,487][26022] Updated weights on worker 0-0, policy_version 590559 (0.00091) [2022-07-10 05:39:32,377][25689] Fps is (10 sec: 5613.5, 60 sec: 5626.0, 300 sec: 5619.1). Total num frames: 604736512. Throughput: 0: 5912.7. Samples: 604746710. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:39:32,379][25689] Avg episode reward: [(0, '-25.933')] [2022-07-10 05:39:33,067][26022] Updated weights on worker 0-0, policy_version 590569 (0.00095) [2022-07-10 05:39:35,104][26022] Updated weights on worker 0-0, policy_version 590579 (0.00088) [2022-07-10 05:39:36,761][26022] Updated weights on worker 0-0, policy_version 590589 (0.00093) [2022-07-10 05:39:37,423][25689] Fps is (10 sec: 5803.6, 60 sec: 5626.7, 300 sec: 5623.1). Total num frames: 604767232. Throughput: 0: 5062.7. Samples: 604763930. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:39:37,424][25689] Avg episode reward: [(0, '-26.162')] [2022-07-10 05:39:38,847][26022] Updated weights on worker 0-0, policy_version 590599 (0.00086) [2022-07-10 05:39:40,207][26022] Updated weights on worker 0-0, policy_version 590609 (0.00087) [2022-07-10 05:39:42,400][26022] Updated weights on worker 0-0, policy_version 590619 (0.00089) [2022-07-10 05:39:42,498][25689] Fps is (10 sec: 5665.7, 60 sec: 5608.9, 300 sec: 5615.3). Total num frames: 604793856. Throughput: 0: 5905.7. Samples: 604797970. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:39:42,499][25689] Avg episode reward: [(0, '-25.556')] [2022-07-10 05:39:43,834][26022] Updated weights on worker 0-0, policy_version 590629 (0.00088) [2022-07-10 05:39:45,874][26022] Updated weights on worker 0-0, policy_version 590639 (0.00090) [2022-07-10 05:39:47,502][25689] Fps is (10 sec: 5587.8, 60 sec: 5626.8, 300 sec: 5625.7). Total num frames: 604823552. Throughput: 0: 5933.3. Samples: 604832168. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:39:47,503][26022] Updated weights on worker 0-0, policy_version 590649 (0.00085) [2022-07-10 05:39:47,504][25689] Avg episode reward: [(0, '-26.134')] [2022-07-10 05:39:49,386][26022] Updated weights on worker 0-0, policy_version 590659 (0.00084) [2022-07-10 05:39:51,159][26022] Updated weights on worker 0-0, policy_version 590669 (0.00100) [2022-07-10 05:39:52,545][25689] Fps is (10 sec: 5911.3, 60 sec: 5659.5, 300 sec: 5625.2). Total num frames: 604853248. Throughput: 0: 5083.9. Samples: 604849244. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:39:52,547][25689] Avg episode reward: [(0, '-26.886')] [2022-07-10 05:39:52,843][26022] Updated weights on worker 0-0, policy_version 590679 (0.00088) [2022-07-10 05:39:54,697][26022] Updated weights on worker 0-0, policy_version 590689 (0.00086) [2022-07-10 05:39:56,572][26022] Updated weights on worker 0-0, policy_version 590699 (0.00087) [2022-07-10 05:39:57,579][25689] Fps is (10 sec: 5690.6, 60 sec: 5623.4, 300 sec: 5622.5). Total num frames: 604880896. Throughput: 0: 5918.0. Samples: 604883212. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:39:57,579][25689] Avg episode reward: [(0, '-26.164')] [2022-07-10 05:39:58,222][26022] Updated weights on worker 0-0, policy_version 590709 (0.00087) [2022-07-10 05:40:00,466][26022] Updated weights on worker 0-0, policy_version 590719 (0.00090) [2022-07-10 05:40:02,396][26022] Updated weights on worker 0-0, policy_version 590729 (0.00093) [2022-07-10 05:40:02,707][25689] Fps is (10 sec: 5340.9, 60 sec: 5634.0, 300 sec: 5620.8). Total num frames: 604907520. Throughput: 0: 5801.6. Samples: 604915214. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:40:02,707][25689] Avg episode reward: [(0, '-25.752')] [2022-07-10 05:40:04,184][26022] Updated weights on worker 0-0, policy_version 590739 (0.00098) [2022-07-10 05:40:04,872][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:40:04,885][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000590742_604919808.pth [2022-07-10 05:40:04,886][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000588763_602893312.pth [2022-07-10 05:40:06,043][26022] Updated weights on worker 0-0, policy_version 590749 (0.00100) [2022-07-10 05:40:07,777][25689] Fps is (10 sec: 5422.0, 60 sec: 5630.2, 300 sec: 5623.3). Total num frames: 604936192. Throughput: 0: 4920.2. Samples: 604931926. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:40:07,778][25689] Avg episode reward: [(0, '-24.897')] [2022-07-10 05:40:07,826][26022] Updated weights on worker 0-0, policy_version 590759 (0.00091) [2022-07-10 05:40:09,705][26022] Updated weights on worker 0-0, policy_version 590769 (0.00087) [2022-07-10 05:40:11,627][26022] Updated weights on worker 0-0, policy_version 590779 (0.00096) [2022-07-10 05:40:12,850][25689] Fps is (10 sec: 5653.3, 60 sec: 5623.8, 300 sec: 5618.7). Total num frames: 604964864. Throughput: 0: 5724.7. Samples: 604965484. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:40:12,851][25689] Avg episode reward: [(0, '-24.096')] [2022-07-10 05:40:13,394][26022] Updated weights on worker 0-0, policy_version 590789 (0.00085) [2022-07-10 05:40:15,280][26022] Updated weights on worker 0-0, policy_version 590799 (0.00088) [2022-07-10 05:40:17,027][26022] Updated weights on worker 0-0, policy_version 590809 (0.00084) [2022-07-10 05:40:17,931][25689] Fps is (10 sec: 5647.2, 60 sec: 5617.7, 300 sec: 5625.5). Total num frames: 604993536. Throughput: 0: 5704.1. Samples: 604999306. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:40:17,932][25689] Avg episode reward: [(0, '-23.428')] [2022-07-10 05:40:18,937][26022] Updated weights on worker 0-0, policy_version 590819 (0.00085) [2022-07-10 05:40:20,601][26022] Updated weights on worker 0-0, policy_version 590829 (0.00085) [2022-07-10 05:40:22,528][26022] Updated weights on worker 0-0, policy_version 590839 (0.00091) [2022-07-10 05:40:22,978][25689] Fps is (10 sec: 5561.0, 60 sec: 5623.8, 300 sec: 5614.8). Total num frames: 605021184. Throughput: 0: 4986.1. Samples: 605016288. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:40:22,978][25689] Avg episode reward: [(0, '-23.768')] [2022-07-10 05:40:24,290][26022] Updated weights on worker 0-0, policy_version 590849 (0.00093) [2022-07-10 05:40:26,203][26022] Updated weights on worker 0-0, policy_version 590859 (0.00093) [2022-07-10 05:40:27,845][26022] Updated weights on worker 0-0, policy_version 590869 (0.00090) [2022-07-10 05:40:28,030][25689] Fps is (10 sec: 5678.6, 60 sec: 5637.9, 300 sec: 5628.1). Total num frames: 605050880. Throughput: 0: 5834.0. Samples: 605050080. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-10 05:40:28,030][25689] Avg episode reward: [(0, '-25.036')] [2022-07-10 05:40:29,900][26022] Updated weights on worker 0-0, policy_version 590879 (0.01061) [2022-07-10 05:40:31,684][26022] Updated weights on worker 0-0, policy_version 590889 (0.00093) [2022-07-10 05:40:33,038][25689] Fps is (10 sec: 5598.4, 60 sec: 5621.3, 300 sec: 5614.3). Total num frames: 605077504. Throughput: 0: 5860.6. Samples: 605083796. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:40:33,038][25689] Avg episode reward: [(0, '-26.377')] [2022-07-10 05:40:33,445][26022] Updated weights on worker 0-0, policy_version 590899 (0.00092) [2022-07-10 05:40:35,098][26022] Updated weights on worker 0-0, policy_version 590909 (0.00092) [2022-07-10 05:40:36,998][26022] Updated weights on worker 0-0, policy_version 590919 (0.00080) [2022-07-10 05:40:38,090][25689] Fps is (10 sec: 5598.2, 60 sec: 5603.8, 300 sec: 5626.2). Total num frames: 605107200. Throughput: 0: 5034.5. Samples: 605100794. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:40:38,091][25689] Avg episode reward: [(0, '-26.755')] [2022-07-10 05:40:38,843][26022] Updated weights on worker 0-0, policy_version 590929 (0.00085) [2022-07-10 05:40:40,631][26022] Updated weights on worker 0-0, policy_version 590939 (0.00086) [2022-07-10 05:40:42,205][26022] Updated weights on worker 0-0, policy_version 590949 (0.00082) [2022-07-10 05:40:43,199][25689] Fps is (10 sec: 5744.1, 60 sec: 5634.4, 300 sec: 5624.2). Total num frames: 605135872. Throughput: 0: 5866.9. Samples: 605134924. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:40:43,200][25689] Avg episode reward: [(0, '-27.011')] [2022-07-10 05:40:44,263][26022] Updated weights on worker 0-0, policy_version 590959 (0.00084) [2022-07-10 05:40:45,891][26022] Updated weights on worker 0-0, policy_version 590969 (0.00072) [2022-07-10 05:40:47,964][26022] Updated weights on worker 0-0, policy_version 590979 (0.00095) [2022-07-10 05:40:48,226][25689] Fps is (10 sec: 5556.5, 60 sec: 5598.6, 300 sec: 5621.1). Total num frames: 605163520. Throughput: 0: 5888.8. Samples: 605169012. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:40:48,227][25689] Avg episode reward: [(0, '-26.542')] [2022-07-10 05:40:49,437][26022] Updated weights on worker 0-0, policy_version 590989 (0.00104) [2022-07-10 05:40:51,583][26022] Updated weights on worker 0-0, policy_version 590999 (0.00093) [2022-07-10 05:40:53,128][26022] Updated weights on worker 0-0, policy_version 591009 (0.00082) [2022-07-10 05:40:53,322][25689] Fps is (10 sec: 5765.8, 60 sec: 5610.5, 300 sec: 5631.5). Total num frames: 605194240. Throughput: 0: 5050.2. Samples: 605186236. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:40:53,323][25689] Avg episode reward: [(0, '-26.172')] [2022-07-10 05:40:55,254][26022] Updated weights on worker 0-0, policy_version 591019 (0.00095) [2022-07-10 05:40:56,704][26022] Updated weights on worker 0-0, policy_version 591029 (0.00084) [2022-07-10 05:40:58,371][25689] Fps is (10 sec: 5652.2, 60 sec: 5592.2, 300 sec: 5621.1). Total num frames: 605220864. Throughput: 0: 5894.0. Samples: 605220332. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:40:58,372][25689] Avg episode reward: [(0, '-25.406')] [2022-07-10 05:40:58,905][26022] Updated weights on worker 0-0, policy_version 591039 (0.00084) [2022-07-10 05:41:00,551][26022] Updated weights on worker 0-0, policy_version 591049 (0.00085) [2022-07-10 05:41:02,861][26022] Updated weights on worker 0-0, policy_version 591059 (0.00088) [2022-07-10 05:41:03,494][25689] Fps is (10 sec: 5234.8, 60 sec: 5592.7, 300 sec: 5619.0). Total num frames: 605247488. Throughput: 0: 5762.3. Samples: 605251870. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:41:03,497][25689] Avg episode reward: [(0, '-26.159')] [2022-07-10 05:41:04,501][26022] Updated weights on worker 0-0, policy_version 591069 (0.00087) [2022-07-10 05:41:06,529][26022] Updated weights on worker 0-0, policy_version 591079 (0.00089) [2022-07-10 05:41:08,242][26022] Updated weights on worker 0-0, policy_version 591089 (0.00093) [2022-07-10 05:41:08,515][25689] Fps is (10 sec: 5552.0, 60 sec: 5614.1, 300 sec: 5626.7). Total num frames: 605277184. Throughput: 0: 5749.1. Samples: 605285658. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:41:08,516][25689] Avg episode reward: [(0, '-27.363')] [2022-07-10 05:41:10,011][26022] Updated weights on worker 0-0, policy_version 591099 (0.00090) [2022-07-10 05:41:11,928][26022] Updated weights on worker 0-0, policy_version 591109 (0.00095) [2022-07-10 05:41:13,530][25689] Fps is (10 sec: 5714.0, 60 sec: 5602.7, 300 sec: 5616.2). Total num frames: 605304832. Throughput: 0: 5752.2. Samples: 605302474. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:41:13,532][25689] Avg episode reward: [(0, '-27.605')] [2022-07-10 05:41:13,708][26022] Updated weights on worker 0-0, policy_version 591119 (0.00092) [2022-07-10 05:41:15,394][26022] Updated weights on worker 0-0, policy_version 591129 (0.00088) [2022-07-10 05:41:17,184][26022] Updated weights on worker 0-0, policy_version 591139 (0.00087) [2022-07-10 05:41:18,534][25689] Fps is (10 sec: 5621.4, 60 sec: 5609.8, 300 sec: 5628.1). Total num frames: 605333504. Throughput: 0: 5774.4. Samples: 605336760. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:41:18,535][25689] Avg episode reward: [(0, '-27.203')] [2022-07-10 05:41:19,007][26022] Updated weights on worker 0-0, policy_version 591149 (0.00086) [2022-07-10 05:41:20,951][26022] Updated weights on worker 0-0, policy_version 591159 (0.00090) [2022-07-10 05:41:22,627][26022] Updated weights on worker 0-0, policy_version 591169 (0.00090) [2022-07-10 05:41:23,578][25689] Fps is (10 sec: 5706.9, 60 sec: 5626.9, 300 sec: 5620.8). Total num frames: 605362176. Throughput: 0: 5924.9. Samples: 605370866. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:41:23,580][25689] Avg episode reward: [(0, '-27.040')] [2022-07-10 05:41:24,480][26022] Updated weights on worker 0-0, policy_version 591179 (0.00092) [2022-07-10 05:41:26,228][26022] Updated weights on worker 0-0, policy_version 591189 (0.00093) [2022-07-10 05:41:28,137][26022] Updated weights on worker 0-0, policy_version 591199 (0.00093) [2022-07-10 05:41:28,672][25689] Fps is (10 sec: 5555.7, 60 sec: 5589.3, 300 sec: 5622.9). Total num frames: 605389824. Throughput: 0: 5060.8. Samples: 605387664. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:41:28,673][25689] Avg episode reward: [(0, '-27.910')] [2022-07-10 05:41:30,046][26022] Updated weights on worker 0-0, policy_version 591209 (0.00087) [2022-07-10 05:41:31,856][26022] Updated weights on worker 0-0, policy_version 591219 (0.00083) [2022-07-10 05:41:33,608][26022] Updated weights on worker 0-0, policy_version 591229 (0.00086) [2022-07-10 05:41:33,675][25689] Fps is (10 sec: 5578.3, 60 sec: 5623.5, 300 sec: 5621.3). Total num frames: 605418496. Throughput: 0: 5916.6. Samples: 605421662. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:41:33,676][25689] Avg episode reward: [(0, '-26.734')] [2022-07-10 05:41:35,519][26022] Updated weights on worker 0-0, policy_version 591239 (0.00095) [2022-07-10 05:41:36,949][26022] Updated weights on worker 0-0, policy_version 591249 (0.00094) [2022-07-10 05:41:38,717][25689] Fps is (10 sec: 5606.8, 60 sec: 5590.7, 300 sec: 5614.4). Total num frames: 605446144. Throughput: 0: 5875.6. Samples: 605455342. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:41:38,718][25689] Avg episode reward: [(0, '-26.664')] [2022-07-10 05:41:39,108][26022] Updated weights on worker 0-0, policy_version 591259 (0.00085) [2022-07-10 05:41:40,775][26022] Updated weights on worker 0-0, policy_version 591269 (0.00090) [2022-07-10 05:41:42,720][26022] Updated weights on worker 0-0, policy_version 591279 (0.00086) [2022-07-10 05:41:43,799][25689] Fps is (10 sec: 5664.0, 60 sec: 5610.1, 300 sec: 5619.9). Total num frames: 605475840. Throughput: 0: 5015.9. Samples: 605472290. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:41:43,801][25689] Avg episode reward: [(0, '-27.591')] [2022-07-10 05:41:44,415][26022] Updated weights on worker 0-0, policy_version 591289 (0.00087) [2022-07-10 05:41:46,288][26022] Updated weights on worker 0-0, policy_version 591299 (0.00081) [2022-07-10 05:41:47,973][26022] Updated weights on worker 0-0, policy_version 591309 (0.00088) [2022-07-10 05:41:48,849][25689] Fps is (10 sec: 5760.5, 60 sec: 5624.8, 300 sec: 5619.0). Total num frames: 605504512. Throughput: 0: 5889.3. Samples: 605506496. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:41:48,851][25689] Avg episode reward: [(0, '-29.481')] [2022-07-10 05:41:49,873][26022] Updated weights on worker 0-0, policy_version 591319 (0.00087) [2022-07-10 05:41:51,587][26022] Updated weights on worker 0-0, policy_version 591329 (0.00094) [2022-07-10 05:41:53,543][26022] Updated weights on worker 0-0, policy_version 591339 (0.00087) [2022-07-10 05:41:53,854][25689] Fps is (10 sec: 5804.7, 60 sec: 5616.3, 300 sec: 5619.4). Total num frames: 605534208. Throughput: 0: 5904.1. Samples: 605540804. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:41:53,856][25689] Avg episode reward: [(0, '-28.786')] [2022-07-10 05:41:55,278][26022] Updated weights on worker 0-0, policy_version 591349 (0.00086) [2022-07-10 05:41:57,131][26022] Updated weights on worker 0-0, policy_version 591359 (0.00094) [2022-07-10 05:41:58,887][25689] Fps is (10 sec: 5610.8, 60 sec: 5617.9, 300 sec: 5616.7). Total num frames: 605560832. Throughput: 0: 5091.0. Samples: 605558030. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:41:58,889][25689] Avg episode reward: [(0, '-27.076')] [2022-07-10 05:41:59,034][26022] Updated weights on worker 0-0, policy_version 591369 (0.00087) [2022-07-10 05:42:00,687][26022] Updated weights on worker 0-0, policy_version 591379 (0.00081) [2022-07-10 05:42:02,799][26022] Updated weights on worker 0-0, policy_version 591389 (0.00088) [2022-07-10 05:42:03,973][25689] Fps is (10 sec: 5262.4, 60 sec: 5621.3, 300 sec: 5619.0). Total num frames: 605587456. Throughput: 0: 5819.9. Samples: 605589700. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:42:03,973][25689] Avg episode reward: [(0, '-26.879')] [2022-07-10 05:42:04,623][26022] Updated weights on worker 0-0, policy_version 591399 (0.00084) [2022-07-10 05:42:04,924][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:42:04,936][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000591400_605593600.pth [2022-07-10 05:42:04,936][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000589421_603567104.pth [2022-07-10 05:42:06,462][26022] Updated weights on worker 0-0, policy_version 591409 (0.00096) [2022-07-10 05:42:08,322][26022] Updated weights on worker 0-0, policy_version 591419 (0.00096) [2022-07-10 05:42:08,993][25689] Fps is (10 sec: 5471.5, 60 sec: 5604.4, 300 sec: 5618.7). Total num frames: 605616128. Throughput: 0: 5818.3. Samples: 605623700. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:42:08,994][25689] Avg episode reward: [(0, '-25.961')] [2022-07-10 05:42:10,275][26022] Updated weights on worker 0-0, policy_version 591429 (0.00085) [2022-07-10 05:42:11,804][26022] Updated weights on worker 0-0, policy_version 591439 (0.00263) [2022-07-10 05:42:13,946][26022] Updated weights on worker 0-0, policy_version 591449 (0.00088) [2022-07-10 05:42:14,035][25689] Fps is (10 sec: 5597.0, 60 sec: 5601.9, 300 sec: 5611.3). Total num frames: 605643776. Throughput: 0: 4947.3. Samples: 605640646. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:42:14,037][25689] Avg episode reward: [(0, '-25.554')] [2022-07-10 05:42:15,438][26022] Updated weights on worker 0-0, policy_version 591459 (0.00087) [2022-07-10 05:42:17,472][26022] Updated weights on worker 0-0, policy_version 591469 (0.00092) [2022-07-10 05:42:19,065][25689] Fps is (10 sec: 5490.1, 60 sec: 5582.6, 300 sec: 5616.9). Total num frames: 605671424. Throughput: 0: 5758.6. Samples: 605674228. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:42:19,067][25689] Avg episode reward: [(0, '-25.168')] [2022-07-10 05:42:19,476][26022] Updated weights on worker 0-0, policy_version 591479 (0.00089) [2022-07-10 05:42:21,011][26022] Updated weights on worker 0-0, policy_version 591489 (0.00086) [2022-07-10 05:42:22,757][26022] Updated weights on worker 0-0, policy_version 591499 (0.00087) [2022-07-10 05:42:24,122][25689] Fps is (10 sec: 5787.0, 60 sec: 5615.3, 300 sec: 5616.0). Total num frames: 605702144. Throughput: 0: 5894.4. Samples: 605708464. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:42:24,122][25689] Avg episode reward: [(0, '-25.121')] [2022-07-10 05:42:24,760][26022] Updated weights on worker 0-0, policy_version 591509 (0.00088) [2022-07-10 05:42:26,542][26022] Updated weights on worker 0-0, policy_version 591519 (0.00092) [2022-07-10 05:42:28,425][26022] Updated weights on worker 0-0, policy_version 591529 (0.00088) [2022-07-10 05:42:29,144][25689] Fps is (10 sec: 5892.8, 60 sec: 5638.8, 300 sec: 5626.5). Total num frames: 605730816. Throughput: 0: 5046.2. Samples: 605725384. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:42:29,144][25689] Avg episode reward: [(0, '-25.385')] [2022-07-10 05:42:30,007][26022] Updated weights on worker 0-0, policy_version 591539 (0.00097) [2022-07-10 05:42:32,159][26022] Updated weights on worker 0-0, policy_version 591549 (0.00083) [2022-07-10 05:42:33,811][26022] Updated weights on worker 0-0, policy_version 591559 (0.00079) [2022-07-10 05:42:34,151][25689] Fps is (10 sec: 5513.6, 60 sec: 5604.6, 300 sec: 5609.2). Total num frames: 605757440. Throughput: 0: 5903.3. Samples: 605759392. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:42:34,151][25689] Avg episode reward: [(0, '-26.207')] [2022-07-10 05:42:35,487][26022] Updated weights on worker 0-0, policy_version 591569 (0.00051) [2022-07-10 05:42:37,196][26022] Updated weights on worker 0-0, policy_version 591579 (0.00091) [2022-07-10 05:42:39,165][25689] Fps is (10 sec: 5517.7, 60 sec: 5624.1, 300 sec: 5620.2). Total num frames: 605786112. Throughput: 0: 5934.0. Samples: 605793502. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:42:39,166][25689] Avg episode reward: [(0, '-25.827')] [2022-07-10 05:42:39,385][26022] Updated weights on worker 0-0, policy_version 591589 (0.00090) [2022-07-10 05:42:41,000][26022] Updated weights on worker 0-0, policy_version 591599 (0.00094) [2022-07-10 05:42:42,728][26022] Updated weights on worker 0-0, policy_version 591609 (0.00086) [2022-07-10 05:42:44,222][25689] Fps is (10 sec: 5693.7, 60 sec: 5609.5, 300 sec: 5612.6). Total num frames: 605814784. Throughput: 0: 5069.9. Samples: 605810372. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:42:44,223][25689] Avg episode reward: [(0, '-23.947')] [2022-07-10 05:42:44,583][26022] Updated weights on worker 0-0, policy_version 591619 (0.00084) [2022-07-10 05:42:46,418][26022] Updated weights on worker 0-0, policy_version 591629 (0.00081) [2022-07-10 05:42:48,171][26022] Updated weights on worker 0-0, policy_version 591639 (0.00083) [2022-07-10 05:42:49,231][25689] Fps is (10 sec: 5799.0, 60 sec: 5630.3, 300 sec: 5623.2). Total num frames: 605844480. Throughput: 0: 5941.5. Samples: 605844730. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:42:49,231][25689] Avg episode reward: [(0, '-24.701')] [2022-07-10 05:42:49,932][26022] Updated weights on worker 0-0, policy_version 591649 (0.00092) [2022-07-10 05:42:51,756][26022] Updated weights on worker 0-0, policy_version 591659 (0.00092) [2022-07-10 05:42:53,651][26022] Updated weights on worker 0-0, policy_version 591669 (0.00099) [2022-07-10 05:42:54,244][25689] Fps is (10 sec: 5824.2, 60 sec: 5612.6, 300 sec: 5621.1). Total num frames: 605873152. Throughput: 0: 5954.3. Samples: 605879032. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:42:54,244][25689] Avg episode reward: [(0, '-23.346')] [2022-07-10 05:42:55,291][26022] Updated weights on worker 0-0, policy_version 591679 (0.00085) [2022-07-10 05:42:57,017][26022] Updated weights on worker 0-0, policy_version 591689 (0.00086) [2022-07-10 05:42:59,042][26022] Updated weights on worker 0-0, policy_version 591699 (0.00084) [2022-07-10 05:42:59,252][25689] Fps is (10 sec: 5620.0, 60 sec: 5631.9, 300 sec: 5623.4). Total num frames: 605900800. Throughput: 0: 5113.9. Samples: 605896222. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:42:59,252][25689] Avg episode reward: [(0, '-24.126')] [2022-07-10 05:43:00,729][26022] Updated weights on worker 0-0, policy_version 591709 (0.00088) [2022-07-10 05:43:03,000][26022] Updated weights on worker 0-0, policy_version 591719 (0.00078) [2022-07-10 05:43:04,358][25689] Fps is (10 sec: 5568.5, 60 sec: 5663.9, 300 sec: 5628.6). Total num frames: 605929472. Throughput: 0: 5846.9. Samples: 605928102. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:43:04,358][25689] Avg episode reward: [(0, '-23.609')] [2022-07-10 05:43:04,708][26022] Updated weights on worker 0-0, policy_version 591729 (0.00092) [2022-07-10 05:43:06,459][26022] Updated weights on worker 0-0, policy_version 591739 (0.00080) [2022-07-10 05:43:08,425][26022] Updated weights on worker 0-0, policy_version 591749 (0.00094) [2022-07-10 05:43:09,420][25689] Fps is (10 sec: 5437.9, 60 sec: 5626.1, 300 sec: 5621.7). Total num frames: 605956096. Throughput: 0: 5817.5. Samples: 605962184. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:43:09,421][25689] Avg episode reward: [(0, '-24.520')] [2022-07-10 05:43:10,119][26022] Updated weights on worker 0-0, policy_version 591759 (0.00085) [2022-07-10 05:43:12,036][26022] Updated weights on worker 0-0, policy_version 591769 (0.00085) [2022-07-10 05:43:13,711][26022] Updated weights on worker 0-0, policy_version 591779 (0.00086) [2022-07-10 05:43:14,448][25689] Fps is (10 sec: 5480.2, 60 sec: 5644.4, 300 sec: 5617.9). Total num frames: 605984768. Throughput: 0: 4960.7. Samples: 605979258. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:43:14,449][25689] Avg episode reward: [(0, '-24.711')] [2022-07-10 05:43:15,671][26022] Updated weights on worker 0-0, policy_version 591789 (0.00084) [2022-07-10 05:43:17,647][26022] Updated weights on worker 0-0, policy_version 591799 (0.00089) [2022-07-10 05:43:19,070][26022] Updated weights on worker 0-0, policy_version 591809 (0.00083) [2022-07-10 05:43:19,475][25689] Fps is (10 sec: 5703.5, 60 sec: 5661.6, 300 sec: 5621.8). Total num frames: 606013440. Throughput: 0: 5785.0. Samples: 606013210. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:43:19,475][25689] Avg episode reward: [(0, '-25.182')] [2022-07-10 05:43:21,047][26022] Updated weights on worker 0-0, policy_version 591819 (0.00097) [2022-07-10 05:43:22,760][26022] Updated weights on worker 0-0, policy_version 591829 (0.00089) [2022-07-10 05:43:24,593][25689] Fps is (10 sec: 5652.4, 60 sec: 5622.0, 300 sec: 5617.4). Total num frames: 606042112. Throughput: 0: 5898.4. Samples: 606047458. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:43:24,594][25689] Avg episode reward: [(0, '-25.729')] [2022-07-10 05:43:24,599][26022] Updated weights on worker 0-0, policy_version 591839 (0.00082) [2022-07-10 05:43:26,586][26022] Updated weights on worker 0-0, policy_version 591849 (0.00086) [2022-07-10 05:43:28,135][26022] Updated weights on worker 0-0, policy_version 591859 (0.00091) [2022-07-10 05:43:29,610][25689] Fps is (10 sec: 5657.9, 60 sec: 5622.5, 300 sec: 5624.7). Total num frames: 606070784. Throughput: 0: 5065.9. Samples: 606064462. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 05:43:29,611][25689] Avg episode reward: [(0, '-25.199')] [2022-07-10 05:43:30,168][26022] Updated weights on worker 0-0, policy_version 591869 (0.00084) [2022-07-10 05:43:31,595][26022] Updated weights on worker 0-0, policy_version 591879 (0.00082) [2022-07-10 05:43:33,716][26022] Updated weights on worker 0-0, policy_version 591889 (0.00086) [2022-07-10 05:43:34,645][25689] Fps is (10 sec: 5806.7, 60 sec: 5670.6, 300 sec: 5621.6). Total num frames: 606100480. Throughput: 0: 5909.7. Samples: 606098616. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:43:34,646][25689] Avg episode reward: [(0, '-25.825')] [2022-07-10 05:43:35,419][26022] Updated weights on worker 0-0, policy_version 591899 (0.00093) [2022-07-10 05:43:37,366][26022] Updated weights on worker 0-0, policy_version 591909 (0.00086) [2022-07-10 05:43:38,975][26022] Updated weights on worker 0-0, policy_version 591919 (0.00083) [2022-07-10 05:43:39,658][25689] Fps is (10 sec: 5604.9, 60 sec: 5636.9, 300 sec: 5619.1). Total num frames: 606127104. Throughput: 0: 5926.8. Samples: 606132834. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:43:39,660][25689] Avg episode reward: [(0, '-25.610')] [2022-07-10 05:43:40,775][26022] Updated weights on worker 0-0, policy_version 591929 (0.00091) [2022-07-10 05:43:42,605][26022] Updated weights on worker 0-0, policy_version 591939 (0.00094) [2022-07-10 05:43:44,461][26022] Updated weights on worker 0-0, policy_version 591949 (0.00089) [2022-07-10 05:43:44,708][25689] Fps is (10 sec: 5698.4, 60 sec: 5671.4, 300 sec: 5625.3). Total num frames: 606157824. Throughput: 0: 5086.7. Samples: 606149776. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:43:44,709][25689] Avg episode reward: [(0, '-25.546')] [2022-07-10 05:43:46,299][26022] Updated weights on worker 0-0, policy_version 591959 (0.00086) [2022-07-10 05:43:47,958][26022] Updated weights on worker 0-0, policy_version 591969 (0.00088) [2022-07-10 05:43:49,723][26022] Updated weights on worker 0-0, policy_version 591979 (0.00089) [2022-07-10 05:43:49,808][25689] Fps is (10 sec: 5851.4, 60 sec: 5645.9, 300 sec: 5627.5). Total num frames: 606186496. Throughput: 0: 5915.7. Samples: 606183950. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:43:49,809][25689] Avg episode reward: [(0, '-26.201')] [2022-07-10 05:43:51,660][26022] Updated weights on worker 0-0, policy_version 591989 (0.00088) [2022-07-10 05:43:53,403][26022] Updated weights on worker 0-0, policy_version 591999 (0.00088) [2022-07-10 05:43:54,825][25689] Fps is (10 sec: 5566.6, 60 sec: 5628.6, 300 sec: 5620.4). Total num frames: 606214144. Throughput: 0: 5907.5. Samples: 606217834. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:43:54,826][25689] Avg episode reward: [(0, '-25.433')] [2022-07-10 05:43:55,404][26022] Updated weights on worker 0-0, policy_version 592009 (0.00098) [2022-07-10 05:43:57,039][26022] Updated weights on worker 0-0, policy_version 592019 (0.00087) [2022-07-10 05:43:58,843][26022] Updated weights on worker 0-0, policy_version 592029 (0.00092) [2022-07-10 05:43:59,885][25689] Fps is (10 sec: 5487.7, 60 sec: 5623.9, 300 sec: 5627.3). Total num frames: 606241792. Throughput: 0: 5877.7. Samples: 606251720. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:43:59,885][25689] Avg episode reward: [(0, '-25.167')] [2022-07-10 05:44:00,695][26022] Updated weights on worker 0-0, policy_version 592039 (0.00096) [2022-07-10 05:44:03,030][26022] Updated weights on worker 0-0, policy_version 592049 (0.00085) [2022-07-10 05:44:04,820][26022] Updated weights on worker 0-0, policy_version 592059 (0.00079) [2022-07-10 05:44:05,018][25689] Fps is (10 sec: 5324.8, 60 sec: 5587.6, 300 sec: 5618.5). Total num frames: 606268416. Throughput: 0: 5739.0. Samples: 606266334. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:44:05,018][25689] Avg episode reward: [(0, '-25.416')] [2022-07-10 05:44:05,084][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:44:05,098][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000592060_606269440.pth [2022-07-10 05:44:05,098][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000590081_604242944.pth [2022-07-10 05:44:06,459][26022] Updated weights on worker 0-0, policy_version 592069 (0.00092) [2022-07-10 05:44:08,332][26022] Updated weights on worker 0-0, policy_version 592079 (0.00096) [2022-07-10 05:44:10,046][25689] Fps is (10 sec: 5542.5, 60 sec: 5641.4, 300 sec: 5621.5). Total num frames: 606298112. Throughput: 0: 5749.1. Samples: 606300300. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:44:10,047][25689] Avg episode reward: [(0, '-25.809')] [2022-07-10 05:44:10,326][26022] Updated weights on worker 0-0, policy_version 592089 (0.00089) [2022-07-10 05:44:11,957][26022] Updated weights on worker 0-0, policy_version 592099 (0.00087) [2022-07-10 05:44:14,026][26022] Updated weights on worker 0-0, policy_version 592109 (0.00087) [2022-07-10 05:44:15,125][25689] Fps is (10 sec: 5876.4, 60 sec: 5653.6, 300 sec: 5623.7). Total num frames: 606327808. Throughput: 0: 5743.4. Samples: 606334420. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:44:15,125][25689] Avg episode reward: [(0, '-25.539')] [2022-07-10 05:44:15,556][26022] Updated weights on worker 0-0, policy_version 592119 (0.00086) [2022-07-10 05:44:17,422][26022] Updated weights on worker 0-0, policy_version 592129 (0.00081) [2022-07-10 05:44:19,395][26022] Updated weights on worker 0-0, policy_version 592139 (0.00090) [2022-07-10 05:44:20,158][25689] Fps is (10 sec: 5569.7, 60 sec: 5619.2, 300 sec: 5621.8). Total num frames: 606354432. Throughput: 0: 4917.3. Samples: 606351410. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:44:20,159][25689] Avg episode reward: [(0, '-25.841')] [2022-07-10 05:44:21,273][26022] Updated weights on worker 0-0, policy_version 592149 (0.00087) [2022-07-10 05:44:23,063][26022] Updated weights on worker 0-0, policy_version 592159 (0.00092) [2022-07-10 05:44:24,670][26022] Updated weights on worker 0-0, policy_version 592169 (0.00091) [2022-07-10 05:44:25,203][25689] Fps is (10 sec: 5486.6, 60 sec: 5626.0, 300 sec: 5621.3). Total num frames: 606383104. Throughput: 0: 5896.0. Samples: 606385346. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:44:25,203][25689] Avg episode reward: [(0, '-24.402')] [2022-07-10 05:44:26,702][26022] Updated weights on worker 0-0, policy_version 592179 (0.00091) [2022-07-10 05:44:28,370][26022] Updated weights on worker 0-0, policy_version 592189 (0.00094) [2022-07-10 05:44:30,207][26022] Updated weights on worker 0-0, policy_version 592199 (0.00092) [2022-07-10 05:44:30,276][25689] Fps is (10 sec: 5768.6, 60 sec: 5637.7, 300 sec: 5627.0). Total num frames: 606412800. Throughput: 0: 5879.4. Samples: 606419240. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:44:30,277][25689] Avg episode reward: [(0, '-23.720')] [2022-07-10 05:44:31,769][26022] Updated weights on worker 0-0, policy_version 592209 (0.00088) [2022-07-10 05:44:33,946][26022] Updated weights on worker 0-0, policy_version 592219 (0.00086) [2022-07-10 05:44:35,314][25689] Fps is (10 sec: 5671.7, 60 sec: 5603.7, 300 sec: 5616.9). Total num frames: 606440448. Throughput: 0: 5032.1. Samples: 606436012. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:44:35,315][25689] Avg episode reward: [(0, '-24.571')] [2022-07-10 05:44:35,632][26022] Updated weights on worker 0-0, policy_version 592229 (0.00089) [2022-07-10 05:44:37,409][26022] Updated weights on worker 0-0, policy_version 592239 (0.00086) [2022-07-10 05:44:39,137][26022] Updated weights on worker 0-0, policy_version 592249 (0.00089) [2022-07-10 05:44:40,349][25689] Fps is (10 sec: 5591.5, 60 sec: 5635.4, 300 sec: 5624.5). Total num frames: 606469120. Throughput: 0: 5893.4. Samples: 606470400. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:44:40,349][25689] Avg episode reward: [(0, '-25.191')] [2022-07-10 05:44:40,975][26022] Updated weights on worker 0-0, policy_version 592259 (0.00090) [2022-07-10 05:44:42,875][26022] Updated weights on worker 0-0, policy_version 592269 (0.00090) [2022-07-10 05:44:44,617][26022] Updated weights on worker 0-0, policy_version 592279 (0.00087) [2022-07-10 05:44:45,426][25689] Fps is (10 sec: 5670.4, 60 sec: 5599.1, 300 sec: 5619.7). Total num frames: 606497792. Throughput: 0: 5895.0. Samples: 606504564. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:44:45,427][25689] Avg episode reward: [(0, '-24.661')] [2022-07-10 05:44:46,422][26022] Updated weights on worker 0-0, policy_version 592289 (0.00087) [2022-07-10 05:44:48,209][26022] Updated weights on worker 0-0, policy_version 592299 (0.00090) [2022-07-10 05:44:49,916][26022] Updated weights on worker 0-0, policy_version 592309 (0.00093) [2022-07-10 05:44:50,515][25689] Fps is (10 sec: 5640.4, 60 sec: 5600.2, 300 sec: 5615.4). Total num frames: 606526464. Throughput: 0: 5058.0. Samples: 606521610. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:44:50,516][25689] Avg episode reward: [(0, '-26.291')] [2022-07-10 05:44:51,793][26022] Updated weights on worker 0-0, policy_version 592319 (0.00084) [2022-07-10 05:44:53,700][26022] Updated weights on worker 0-0, policy_version 592329 (0.00088) [2022-07-10 05:44:55,454][26022] Updated weights on worker 0-0, policy_version 592339 (0.00090) [2022-07-10 05:44:55,548][25689] Fps is (10 sec: 5665.7, 60 sec: 5615.6, 300 sec: 5618.8). Total num frames: 606555136. Throughput: 0: 5920.8. Samples: 606555814. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:44:55,548][25689] Avg episode reward: [(0, '-26.900')] [2022-07-10 05:44:57,356][26022] Updated weights on worker 0-0, policy_version 592349 (0.00090) [2022-07-10 05:44:58,955][26022] Updated weights on worker 0-0, policy_version 592359 (0.00087) [2022-07-10 05:45:00,552][25689] Fps is (10 sec: 5815.2, 60 sec: 5654.4, 300 sec: 5631.5). Total num frames: 606584832. Throughput: 0: 5931.9. Samples: 606590246. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:45:00,553][25689] Avg episode reward: [(0, '-26.898')] [2022-07-10 05:45:00,707][26022] Updated weights on worker 0-0, policy_version 592369 (0.00093) [2022-07-10 05:45:03,092][26022] Updated weights on worker 0-0, policy_version 592379 (0.00097) [2022-07-10 05:45:04,945][26022] Updated weights on worker 0-0, policy_version 592389 (0.00084) [2022-07-10 05:45:05,648][25689] Fps is (10 sec: 5576.3, 60 sec: 5657.9, 300 sec: 5624.1). Total num frames: 606611456. Throughput: 0: 4963.2. Samples: 606604926. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:45:05,648][25689] Avg episode reward: [(0, '-26.384')] [2022-07-10 05:45:06,602][26022] Updated weights on worker 0-0, policy_version 592399 (0.00086) [2022-07-10 05:45:08,381][26022] Updated weights on worker 0-0, policy_version 592409 (0.00093) [2022-07-10 05:45:10,195][26022] Updated weights on worker 0-0, policy_version 592419 (0.00090) [2022-07-10 05:45:10,681][25689] Fps is (10 sec: 5257.0, 60 sec: 5606.8, 300 sec: 5618.0). Total num frames: 606638080. Throughput: 0: 5829.4. Samples: 606639166. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:45:10,682][25689] Avg episode reward: [(0, '-25.948')] [2022-07-10 05:45:12,059][26022] Updated weights on worker 0-0, policy_version 592429 (0.00085) [2022-07-10 05:45:13,874][26022] Updated weights on worker 0-0, policy_version 592439 (0.00091) [2022-07-10 05:45:15,560][26022] Updated weights on worker 0-0, policy_version 592449 (0.00089) [2022-07-10 05:45:15,704][25689] Fps is (10 sec: 5600.6, 60 sec: 5612.0, 300 sec: 5622.6). Total num frames: 606667776. Throughput: 0: 5836.8. Samples: 606673460. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:45:15,704][25689] Avg episode reward: [(0, '-25.915')] [2022-07-10 05:45:17,442][26022] Updated weights on worker 0-0, policy_version 592459 (0.00100) [2022-07-10 05:45:19,304][26022] Updated weights on worker 0-0, policy_version 592469 (0.00085) [2022-07-10 05:45:20,755][25689] Fps is (10 sec: 5793.9, 60 sec: 5644.1, 300 sec: 5625.9). Total num frames: 606696448. Throughput: 0: 4972.0. Samples: 606690696. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:45:20,756][25689] Avg episode reward: [(0, '-26.185')] [2022-07-10 05:45:20,915][26022] Updated weights on worker 0-0, policy_version 592479 (0.00078) [2022-07-10 05:45:22,772][26022] Updated weights on worker 0-0, policy_version 592489 (0.00094) [2022-07-10 05:45:24,754][26022] Updated weights on worker 0-0, policy_version 592499 (0.00084) [2022-07-10 05:45:25,839][25689] Fps is (10 sec: 5556.7, 60 sec: 5623.6, 300 sec: 5618.4). Total num frames: 606724096. Throughput: 0: 5915.8. Samples: 606724376. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:45:25,840][25689] Avg episode reward: [(0, '-25.689')] [2022-07-10 05:45:26,394][26022] Updated weights on worker 0-0, policy_version 592509 (0.00089) [2022-07-10 05:45:28,636][26022] Updated weights on worker 0-0, policy_version 592519 (0.00087) [2022-07-10 05:45:29,736][26022] Updated weights on worker 0-0, policy_version 592529 (0.00090) [2022-07-10 05:45:30,937][25689] Fps is (10 sec: 5632.4, 60 sec: 5621.3, 300 sec: 5627.0). Total num frames: 606753792. Throughput: 0: 5877.8. Samples: 606758222. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:45:30,937][25689] Avg episode reward: [(0, '-26.763')] [2022-07-10 05:45:32,111][26022] Updated weights on worker 0-0, policy_version 592539 (0.00097) [2022-07-10 05:45:33,806][26022] Updated weights on worker 0-0, policy_version 592549 (0.00094) [2022-07-10 05:45:35,583][26022] Updated weights on worker 0-0, policy_version 592559 (0.00083) [2022-07-10 05:45:35,961][25689] Fps is (10 sec: 5665.3, 60 sec: 5622.5, 300 sec: 5620.7). Total num frames: 606781440. Throughput: 0: 5008.4. Samples: 606774918. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:45:35,962][25689] Avg episode reward: [(0, '-27.132')] [2022-07-10 05:45:37,479][26022] Updated weights on worker 0-0, policy_version 592569 (0.00093) [2022-07-10 05:45:39,179][26022] Updated weights on worker 0-0, policy_version 592579 (0.00091) [2022-07-10 05:45:40,982][25689] Fps is (10 sec: 5606.4, 60 sec: 5623.8, 300 sec: 5622.4). Total num frames: 606810112. Throughput: 0: 5857.4. Samples: 606809172. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:45:40,983][25689] Avg episode reward: [(0, '-27.382')] [2022-07-10 05:45:41,130][26022] Updated weights on worker 0-0, policy_version 592589 (0.00097) [2022-07-10 05:45:42,780][26022] Updated weights on worker 0-0, policy_version 592599 (0.00096) [2022-07-10 05:45:44,560][26022] Updated weights on worker 0-0, policy_version 592609 (0.00489) [2022-07-10 05:45:46,031][25689] Fps is (10 sec: 5796.1, 60 sec: 5643.4, 300 sec: 5628.8). Total num frames: 606839808. Throughput: 0: 5908.6. Samples: 606843684. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:45:46,032][25689] Avg episode reward: [(0, '-27.104')] [2022-07-10 05:45:46,242][26022] Updated weights on worker 0-0, policy_version 592619 (0.00088) [2022-07-10 05:45:48,179][26022] Updated weights on worker 0-0, policy_version 592629 (0.00089) [2022-07-10 05:45:49,716][26022] Updated weights on worker 0-0, policy_version 592639 (0.00083) [2022-07-10 05:45:51,034][25689] Fps is (10 sec: 5602.7, 60 sec: 5617.5, 300 sec: 5616.8). Total num frames: 606866432. Throughput: 0: 5960.3. Samples: 606878012. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:45:51,035][25689] Avg episode reward: [(0, '-26.513')] [2022-07-10 05:45:51,801][26022] Updated weights on worker 0-0, policy_version 592649 (0.00092) [2022-07-10 05:45:53,339][26022] Updated weights on worker 0-0, policy_version 592659 (0.00058) [2022-07-10 05:45:55,221][26022] Updated weights on worker 0-0, policy_version 592669 (0.00091) [2022-07-10 05:45:56,118][25689] Fps is (10 sec: 5888.3, 60 sec: 5680.4, 300 sec: 5636.8). Total num frames: 606899200. Throughput: 0: 5967.2. Samples: 606895196. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:45:56,119][25689] Avg episode reward: [(0, '-24.287')] [2022-07-10 05:45:57,145][26022] Updated weights on worker 0-0, policy_version 592679 (0.00088) [2022-07-10 05:45:58,886][26022] Updated weights on worker 0-0, policy_version 592689 (0.00087) [2022-07-10 05:46:00,664][26022] Updated weights on worker 0-0, policy_version 592699 (0.00090) [2022-07-10 05:46:01,141][25689] Fps is (10 sec: 5876.5, 60 sec: 5628.0, 300 sec: 5638.7). Total num frames: 606925824. Throughput: 0: 5968.7. Samples: 606929496. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:46:01,143][25689] Avg episode reward: [(0, '-24.266')] [2022-07-10 05:46:02,917][26022] Updated weights on worker 0-0, policy_version 592709 (0.00098) [2022-07-10 05:46:04,547][26022] Updated weights on worker 0-0, policy_version 592719 (0.00081) [2022-07-10 05:46:05,460][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:46:05,474][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000592722_606947328.pth [2022-07-10 05:46:05,474][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000590742_604919808.pth [2022-07-10 05:46:06,271][25689] Fps is (10 sec: 5143.4, 60 sec: 5607.8, 300 sec: 5622.9). Total num frames: 606951424. Throughput: 0: 5792.7. Samples: 606960928. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:46:06,274][25689] Avg episode reward: [(0, '-24.320')] [2022-07-10 05:46:06,741][26022] Updated weights on worker 0-0, policy_version 592729 (0.00089) [2022-07-10 05:46:08,148][26022] Updated weights on worker 0-0, policy_version 592739 (0.00088) [2022-07-10 05:46:10,282][26022] Updated weights on worker 0-0, policy_version 592749 (0.00089) [2022-07-10 05:46:11,294][25689] Fps is (10 sec: 5547.5, 60 sec: 5676.5, 300 sec: 5633.0). Total num frames: 606982144. Throughput: 0: 4929.1. Samples: 606977870. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:46:11,294][25689] Avg episode reward: [(0, '-25.199')] [2022-07-10 05:46:11,911][26022] Updated weights on worker 0-0, policy_version 592759 (0.00100) [2022-07-10 05:46:13,832][26022] Updated weights on worker 0-0, policy_version 592769 (0.00083) [2022-07-10 05:46:15,650][26022] Updated weights on worker 0-0, policy_version 592779 (0.00113) [2022-07-10 05:46:16,299][25689] Fps is (10 sec: 5820.8, 60 sec: 5644.3, 300 sec: 5629.6). Total num frames: 607009792. Throughput: 0: 5789.9. Samples: 607012042. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:46:16,301][25689] Avg episode reward: [(0, '-25.167')] [2022-07-10 05:46:17,311][26022] Updated weights on worker 0-0, policy_version 592789 (0.00098) [2022-07-10 05:46:19,318][26022] Updated weights on worker 0-0, policy_version 592799 (0.00097) [2022-07-10 05:46:21,158][26022] Updated weights on worker 0-0, policy_version 592809 (0.00086) [2022-07-10 05:46:21,353][25689] Fps is (10 sec: 5497.1, 60 sec: 5627.2, 300 sec: 5626.0). Total num frames: 607037440. Throughput: 0: 5766.8. Samples: 607046050. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:46:21,353][25689] Avg episode reward: [(0, '-27.147')] [2022-07-10 05:46:22,684][26022] Updated weights on worker 0-0, policy_version 592819 (0.00089) [2022-07-10 05:46:24,757][26022] Updated weights on worker 0-0, policy_version 592829 (0.00100) [2022-07-10 05:46:26,177][26022] Updated weights on worker 0-0, policy_version 592839 (0.00080) [2022-07-10 05:46:26,409][25689] Fps is (10 sec: 5773.1, 60 sec: 5680.4, 300 sec: 5637.0). Total num frames: 607068160. Throughput: 0: 5074.7. Samples: 607063122. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:46:26,410][25689] Avg episode reward: [(0, '-26.996')] [2022-07-10 05:46:28,489][26022] Updated weights on worker 0-0, policy_version 592849 (0.00086) [2022-07-10 05:46:29,748][26022] Updated weights on worker 0-0, policy_version 592859 (0.00092) [2022-07-10 05:46:31,431][25689] Fps is (10 sec: 5486.6, 60 sec: 5602.9, 300 sec: 5622.9). Total num frames: 607092736. Throughput: 0: 5921.1. Samples: 607097106. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:46:31,432][25689] Avg episode reward: [(0, '-25.721')] [2022-07-10 05:46:31,989][26022] Updated weights on worker 0-0, policy_version 592869 (0.00084) [2022-07-10 05:46:33,487][26022] Updated weights on worker 0-0, policy_version 592879 (0.00082) [2022-07-10 05:46:35,528][26022] Updated weights on worker 0-0, policy_version 592889 (0.00090) [2022-07-10 05:46:36,446][25689] Fps is (10 sec: 5611.1, 60 sec: 5671.5, 300 sec: 5637.1). Total num frames: 607124480. Throughput: 0: 5898.7. Samples: 607130886. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 05:46:36,447][25689] Avg episode reward: [(0, '-25.637')] [2022-07-10 05:46:37,320][26022] Updated weights on worker 0-0, policy_version 592899 (0.00094) [2022-07-10 05:46:39,196][26022] Updated weights on worker 0-0, policy_version 592909 (0.00088) [2022-07-10 05:46:40,936][26022] Updated weights on worker 0-0, policy_version 592919 (0.00088) [2022-07-10 05:46:41,463][25689] Fps is (10 sec: 5920.5, 60 sec: 5655.0, 300 sec: 5631.5). Total num frames: 607152128. Throughput: 0: 5067.8. Samples: 607147964. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:46:41,463][25689] Avg episode reward: [(0, '-25.733')] [2022-07-10 05:46:42,762][26022] Updated weights on worker 0-0, policy_version 592929 (0.00094) [2022-07-10 05:46:44,463][26022] Updated weights on worker 0-0, policy_version 592939 (0.00090) [2022-07-10 05:46:46,395][26022] Updated weights on worker 0-0, policy_version 592949 (0.00087) [2022-07-10 05:46:46,502][25689] Fps is (10 sec: 5498.9, 60 sec: 5622.0, 300 sec: 5628.2). Total num frames: 607179776. Throughput: 0: 5936.5. Samples: 607182404. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:46:46,503][25689] Avg episode reward: [(0, '-25.514')] [2022-07-10 05:46:47,901][26022] Updated weights on worker 0-0, policy_version 592959 (0.00086) [2022-07-10 05:46:50,095][26022] Updated weights on worker 0-0, policy_version 592969 (0.00085) [2022-07-10 05:46:51,502][26022] Updated weights on worker 0-0, policy_version 592979 (0.00086) [2022-07-10 05:46:51,510][25689] Fps is (10 sec: 5809.1, 60 sec: 5689.3, 300 sec: 5631.6). Total num frames: 607210496. Throughput: 0: 5952.0. Samples: 607216618. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:46:51,511][25689] Avg episode reward: [(0, '-24.471')] [2022-07-10 05:46:53,419][26022] Updated weights on worker 0-0, policy_version 592989 (0.00085) [2022-07-10 05:46:55,274][26022] Updated weights on worker 0-0, policy_version 592999 (0.00087) [2022-07-10 05:46:56,515][25689] Fps is (10 sec: 5727.4, 60 sec: 5595.0, 300 sec: 5632.2). Total num frames: 607237120. Throughput: 0: 5125.1. Samples: 607233738. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:46:56,515][25689] Avg episode reward: [(0, '-25.310')] [2022-07-10 05:46:57,119][26022] Updated weights on worker 0-0, policy_version 593009 (0.00083) [2022-07-10 05:46:58,980][26022] Updated weights on worker 0-0, policy_version 593019 (0.00080) [2022-07-10 05:47:00,707][26022] Updated weights on worker 0-0, policy_version 593029 (0.00094) [2022-07-10 05:47:01,531][25689] Fps is (10 sec: 5313.9, 60 sec: 5595.7, 300 sec: 5633.5). Total num frames: 607263744. Throughput: 0: 5960.5. Samples: 607267580. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:47:01,531][25689] Avg episode reward: [(0, '-25.734')] [2022-07-10 05:47:02,856][26022] Updated weights on worker 0-0, policy_version 593039 (0.00097) [2022-07-10 05:47:04,947][26022] Updated weights on worker 0-0, policy_version 593049 (0.00057) [2022-07-10 05:47:06,572][26022] Updated weights on worker 0-0, policy_version 593059 (0.00096) [2022-07-10 05:47:06,599][25689] Fps is (10 sec: 5483.4, 60 sec: 5652.3, 300 sec: 5632.6). Total num frames: 607292416. Throughput: 0: 5814.2. Samples: 607299250. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:47:06,599][25689] Avg episode reward: [(0, '-25.415')] [2022-07-10 05:47:08,513][26022] Updated weights on worker 0-0, policy_version 593069 (0.00085) [2022-07-10 05:47:10,130][26022] Updated weights on worker 0-0, policy_version 593079 (0.00091) [2022-07-10 05:47:11,608][25689] Fps is (10 sec: 5589.0, 60 sec: 5602.6, 300 sec: 5633.2). Total num frames: 607320064. Throughput: 0: 4959.4. Samples: 607316290. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:47:11,609][25689] Avg episode reward: [(0, '-25.613')] [2022-07-10 05:47:12,255][26022] Updated weights on worker 0-0, policy_version 593089 (0.00090) [2022-07-10 05:47:13,589][26022] Updated weights on worker 0-0, policy_version 593099 (0.00083) [2022-07-10 05:47:15,772][26022] Updated weights on worker 0-0, policy_version 593109 (0.00088) [2022-07-10 05:47:16,627][25689] Fps is (10 sec: 5718.5, 60 sec: 5635.3, 300 sec: 5640.3). Total num frames: 607349760. Throughput: 0: 5814.0. Samples: 607350670. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:47:16,627][25689] Avg episode reward: [(0, '-26.319')] [2022-07-10 05:47:17,099][26022] Updated weights on worker 0-0, policy_version 593119 (0.00092) [2022-07-10 05:47:19,416][26022] Updated weights on worker 0-0, policy_version 593129 (0.00092) [2022-07-10 05:47:20,976][26022] Updated weights on worker 0-0, policy_version 593139 (0.00083) [2022-07-10 05:47:21,635][25689] Fps is (10 sec: 5719.1, 60 sec: 5639.6, 300 sec: 5630.9). Total num frames: 607377408. Throughput: 0: 5834.7. Samples: 607384880. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:47:21,636][25689] Avg episode reward: [(0, '-27.227')] [2022-07-10 05:47:22,822][26022] Updated weights on worker 0-0, policy_version 593149 (0.00088) [2022-07-10 05:47:24,645][26022] Updated weights on worker 0-0, policy_version 593159 (0.01067) [2022-07-10 05:47:26,430][26022] Updated weights on worker 0-0, policy_version 593169 (0.00092) [2022-07-10 05:47:26,684][25689] Fps is (10 sec: 5600.0, 60 sec: 5606.3, 300 sec: 5630.4). Total num frames: 607406080. Throughput: 0: 5117.5. Samples: 607402034. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:47:26,684][25689] Avg episode reward: [(0, '-26.465')] [2022-07-10 05:47:28,186][26022] Updated weights on worker 0-0, policy_version 593179 (0.00087) [2022-07-10 05:47:30,103][26022] Updated weights on worker 0-0, policy_version 593189 (0.00084) [2022-07-10 05:47:31,607][26022] Updated weights on worker 0-0, policy_version 593199 (0.00083) [2022-07-10 05:47:31,691][25689] Fps is (10 sec: 5804.0, 60 sec: 5692.7, 300 sec: 5640.7). Total num frames: 607435776. Throughput: 0: 5969.9. Samples: 607436184. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:47:31,692][25689] Avg episode reward: [(0, '-27.041')] [2022-07-10 05:47:33,729][26022] Updated weights on worker 0-0, policy_version 593209 (0.00059) [2022-07-10 05:47:35,406][26022] Updated weights on worker 0-0, policy_version 593219 (0.00095) [2022-07-10 05:47:36,737][25689] Fps is (10 sec: 5602.1, 60 sec: 5604.8, 300 sec: 5633.2). Total num frames: 607462400. Throughput: 0: 5925.4. Samples: 607469832. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:47:36,738][25689] Avg episode reward: [(0, '-27.470')] [2022-07-10 05:47:37,208][26022] Updated weights on worker 0-0, policy_version 593229 (0.00081) [2022-07-10 05:47:39,235][26022] Updated weights on worker 0-0, policy_version 593239 (0.00097) [2022-07-10 05:47:40,823][26022] Updated weights on worker 0-0, policy_version 593249 (0.00093) [2022-07-10 05:47:41,757][25689] Fps is (10 sec: 5493.6, 60 sec: 5621.5, 300 sec: 5633.9). Total num frames: 607491072. Throughput: 0: 5072.4. Samples: 607486948. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:47:41,757][25689] Avg episode reward: [(0, '-25.897')] [2022-07-10 05:47:42,775][26022] Updated weights on worker 0-0, policy_version 593259 (0.00090) [2022-07-10 05:47:44,458][26022] Updated weights on worker 0-0, policy_version 593269 (0.00087) [2022-07-10 05:47:46,371][26022] Updated weights on worker 0-0, policy_version 593279 (0.00100) [2022-07-10 05:47:46,821][25689] Fps is (10 sec: 5788.4, 60 sec: 5653.2, 300 sec: 5632.9). Total num frames: 607520768. Throughput: 0: 5915.9. Samples: 607521164. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:47:46,821][25689] Avg episode reward: [(0, '-25.458')] [2022-07-10 05:47:48,100][26022] Updated weights on worker 0-0, policy_version 593289 (0.00094) [2022-07-10 05:47:49,866][26022] Updated weights on worker 0-0, policy_version 593299 (0.00085) [2022-07-10 05:47:51,822][25689] Fps is (10 sec: 5595.3, 60 sec: 5585.9, 300 sec: 5626.2). Total num frames: 607547392. Throughput: 0: 5900.4. Samples: 607554966. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:47:51,823][25689] Avg episode reward: [(0, '-24.608')] [2022-07-10 05:47:51,835][26022] Updated weights on worker 0-0, policy_version 593309 (0.00088) [2022-07-10 05:47:53,445][26022] Updated weights on worker 0-0, policy_version 593319 (0.00087) [2022-07-10 05:47:55,474][26022] Updated weights on worker 0-0, policy_version 593329 (0.00093) [2022-07-10 05:47:56,832][25689] Fps is (10 sec: 5625.6, 60 sec: 5636.3, 300 sec: 5633.1). Total num frames: 607577088. Throughput: 0: 5080.8. Samples: 607571930. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:47:56,833][25689] Avg episode reward: [(0, '-24.390')] [2022-07-10 05:47:57,151][26022] Updated weights on worker 0-0, policy_version 593339 (0.00094) [2022-07-10 05:47:59,028][26022] Updated weights on worker 0-0, policy_version 593349 (0.00085) [2022-07-10 05:48:00,644][26022] Updated weights on worker 0-0, policy_version 593359 (0.00090) [2022-07-10 05:48:01,843][25689] Fps is (10 sec: 5518.3, 60 sec: 5619.9, 300 sec: 5624.5). Total num frames: 607602688. Throughput: 0: 5925.9. Samples: 607605976. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:48:01,843][25689] Avg episode reward: [(0, '-24.808')] [2022-07-10 05:48:03,062][26022] Updated weights on worker 0-0, policy_version 593369 (0.00093) [2022-07-10 05:48:04,773][26022] Updated weights on worker 0-0, policy_version 593379 (0.00093) [2022-07-10 05:48:05,539][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:48:05,554][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000593382_607623168.pth [2022-07-10 05:48:05,555][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000591400_605593600.pth [2022-07-10 05:48:06,781][26022] Updated weights on worker 0-0, policy_version 593389 (0.00094) [2022-07-10 05:48:06,920][25689] Fps is (10 sec: 5278.5, 60 sec: 5602.0, 300 sec: 5627.7). Total num frames: 607630336. Throughput: 0: 5771.9. Samples: 607637174. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:48:06,920][25689] Avg episode reward: [(0, '-23.653')] [2022-07-10 05:48:08,563][26022] Updated weights on worker 0-0, policy_version 593399 (0.00095) [2022-07-10 05:48:10,265][26022] Updated weights on worker 0-0, policy_version 593409 (0.00090) [2022-07-10 05:48:11,949][25689] Fps is (10 sec: 5673.9, 60 sec: 5634.1, 300 sec: 5631.1). Total num frames: 607660032. Throughput: 0: 4921.6. Samples: 607654024. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:48:11,950][25689] Avg episode reward: [(0, '-24.731')] [2022-07-10 05:48:12,155][26022] Updated weights on worker 0-0, policy_version 593419 (0.00111) [2022-07-10 05:48:14,080][26022] Updated weights on worker 0-0, policy_version 593429 (0.00093) [2022-07-10 05:48:15,766][26022] Updated weights on worker 0-0, policy_version 593439 (0.00090) [2022-07-10 05:48:16,957][25689] Fps is (10 sec: 5713.0, 60 sec: 5601.1, 300 sec: 5628.0). Total num frames: 607687680. Throughput: 0: 5756.0. Samples: 607687770. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:48:16,959][25689] Avg episode reward: [(0, '-24.544')] [2022-07-10 05:48:17,884][26022] Updated weights on worker 0-0, policy_version 593449 (0.00085) [2022-07-10 05:48:19,495][26022] Updated weights on worker 0-0, policy_version 593459 (0.00086) [2022-07-10 05:48:21,187][26022] Updated weights on worker 0-0, policy_version 593469 (0.00087) [2022-07-10 05:48:21,986][25689] Fps is (10 sec: 5611.4, 60 sec: 5616.2, 300 sec: 5629.7). Total num frames: 607716352. Throughput: 0: 5741.5. Samples: 607721628. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:48:21,988][25689] Avg episode reward: [(0, '-23.786')] [2022-07-10 05:48:23,267][26022] Updated weights on worker 0-0, policy_version 593479 (0.00088) [2022-07-10 05:48:24,856][26022] Updated weights on worker 0-0, policy_version 593489 (0.00094) [2022-07-10 05:48:26,880][26022] Updated weights on worker 0-0, policy_version 593499 (0.00096) [2022-07-10 05:48:27,083][25689] Fps is (10 sec: 5562.2, 60 sec: 5594.8, 300 sec: 5624.8). Total num frames: 607744000. Throughput: 0: 5024.6. Samples: 607738486. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:48:27,083][25689] Avg episode reward: [(0, '-23.732')] [2022-07-10 05:48:28,574][26022] Updated weights on worker 0-0, policy_version 593509 (0.00080) [2022-07-10 05:48:30,555][26022] Updated weights on worker 0-0, policy_version 593519 (0.00090) [2022-07-10 05:48:32,127][25689] Fps is (10 sec: 5553.8, 60 sec: 5574.5, 300 sec: 5621.2). Total num frames: 607772672. Throughput: 0: 5851.5. Samples: 607772092. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:48:32,129][25689] Avg episode reward: [(0, '-23.889')] [2022-07-10 05:48:32,303][26022] Updated weights on worker 0-0, policy_version 593529 (0.00092) [2022-07-10 05:48:34,155][26022] Updated weights on worker 0-0, policy_version 593539 (0.00086) [2022-07-10 05:48:35,734][26022] Updated weights on worker 0-0, policy_version 593549 (0.00088) [2022-07-10 05:48:37,133][25689] Fps is (10 sec: 5706.0, 60 sec: 5612.1, 300 sec: 5628.2). Total num frames: 607801344. Throughput: 0: 5871.8. Samples: 607806236. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:48:37,133][25689] Avg episode reward: [(0, '-24.324')] [2022-07-10 05:48:37,768][26022] Updated weights on worker 0-0, policy_version 593559 (0.00088) [2022-07-10 05:48:39,392][26022] Updated weights on worker 0-0, policy_version 593569 (0.00092) [2022-07-10 05:48:41,366][26022] Updated weights on worker 0-0, policy_version 593579 (0.00402) [2022-07-10 05:48:42,193][25689] Fps is (10 sec: 5696.6, 60 sec: 5608.3, 300 sec: 5621.1). Total num frames: 607830016. Throughput: 0: 5028.8. Samples: 607823246. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:48:42,195][25689] Avg episode reward: [(0, '-24.397')] [2022-07-10 05:48:43,070][26022] Updated weights on worker 0-0, policy_version 593589 (0.00092) [2022-07-10 05:48:44,959][26022] Updated weights on worker 0-0, policy_version 593599 (0.00087) [2022-07-10 05:48:46,730][26022] Updated weights on worker 0-0, policy_version 593609 (0.00093) [2022-07-10 05:48:47,247][25689] Fps is (10 sec: 5669.7, 60 sec: 5592.3, 300 sec: 5622.0). Total num frames: 607858688. Throughput: 0: 5893.7. Samples: 607857326. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:48:47,247][25689] Avg episode reward: [(0, '-23.921')] [2022-07-10 05:48:48,604][26022] Updated weights on worker 0-0, policy_version 593619 (0.00116) [2022-07-10 05:48:50,226][26022] Updated weights on worker 0-0, policy_version 593629 (0.00091) [2022-07-10 05:48:52,263][25689] Fps is (10 sec: 5592.7, 60 sec: 5607.8, 300 sec: 5622.0). Total num frames: 607886336. Throughput: 0: 5898.8. Samples: 607890874. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:48:52,264][25689] Avg episode reward: [(0, '-24.571')] [2022-07-10 05:48:52,274][26022] Updated weights on worker 0-0, policy_version 593639 (0.00089) [2022-07-10 05:48:53,949][26022] Updated weights on worker 0-0, policy_version 593649 (0.00092) [2022-07-10 05:48:55,876][26022] Updated weights on worker 0-0, policy_version 593659 (0.00087) [2022-07-10 05:48:57,302][25689] Fps is (10 sec: 5601.2, 60 sec: 5588.3, 300 sec: 5625.8). Total num frames: 607915008. Throughput: 0: 5042.2. Samples: 607907932. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:48:57,302][25689] Avg episode reward: [(0, '-24.403')] [2022-07-10 05:48:57,798][26022] Updated weights on worker 0-0, policy_version 593669 (0.00085) [2022-07-10 05:48:59,190][26022] Updated weights on worker 0-0, policy_version 593679 (0.00087) [2022-07-10 05:49:01,358][26022] Updated weights on worker 0-0, policy_version 593689 (0.00088) [2022-07-10 05:49:02,315][25689] Fps is (10 sec: 5501.4, 60 sec: 5605.0, 300 sec: 5628.1). Total num frames: 607941632. Throughput: 0: 5905.0. Samples: 607942064. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:49:02,315][25689] Avg episode reward: [(0, '-25.485')] [2022-07-10 05:49:03,286][26022] Updated weights on worker 0-0, policy_version 593699 (0.00085) [2022-07-10 05:49:05,274][26022] Updated weights on worker 0-0, policy_version 593709 (0.00083) [2022-07-10 05:49:06,964][26022] Updated weights on worker 0-0, policy_version 593719 (0.00083) [2022-07-10 05:49:07,382][25689] Fps is (10 sec: 5384.0, 60 sec: 5605.9, 300 sec: 5620.5). Total num frames: 607969280. Throughput: 0: 5791.9. Samples: 607973946. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:49:07,382][25689] Avg episode reward: [(0, '-25.086')] [2022-07-10 05:49:08,806][26022] Updated weights on worker 0-0, policy_version 593729 (0.00091) [2022-07-10 05:49:10,890][26022] Updated weights on worker 0-0, policy_version 593739 (0.00095) [2022-07-10 05:49:12,383][25689] Fps is (10 sec: 5593.8, 60 sec: 5591.6, 300 sec: 5618.5). Total num frames: 607997952. Throughput: 0: 4973.2. Samples: 607990934. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:49:12,384][25689] Avg episode reward: [(0, '-25.121')] [2022-07-10 05:49:12,504][26022] Updated weights on worker 0-0, policy_version 593749 (0.00085) [2022-07-10 05:49:14,262][26022] Updated weights on worker 0-0, policy_version 593759 (0.00097) [2022-07-10 05:49:16,213][26022] Updated weights on worker 0-0, policy_version 593769 (0.00093) [2022-07-10 05:49:17,416][25689] Fps is (10 sec: 5714.8, 60 sec: 5606.2, 300 sec: 5625.4). Total num frames: 608026624. Throughput: 0: 5814.5. Samples: 608024888. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:49:17,416][25689] Avg episode reward: [(0, '-25.513')] [2022-07-10 05:49:17,778][26022] Updated weights on worker 0-0, policy_version 593779 (0.00097) [2022-07-10 05:49:19,880][26022] Updated weights on worker 0-0, policy_version 593789 (0.00619) [2022-07-10 05:49:21,435][26022] Updated weights on worker 0-0, policy_version 593799 (0.00088) [2022-07-10 05:49:22,419][25689] Fps is (10 sec: 5611.7, 60 sec: 5591.6, 300 sec: 5622.7). Total num frames: 608054272. Throughput: 0: 5804.3. Samples: 608058756. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:49:22,420][25689] Avg episode reward: [(0, '-25.488')] [2022-07-10 05:49:23,508][26022] Updated weights on worker 0-0, policy_version 593809 (0.00092) [2022-07-10 05:49:25,060][26022] Updated weights on worker 0-0, policy_version 593819 (0.00084) [2022-07-10 05:49:27,073][26022] Updated weights on worker 0-0, policy_version 593829 (0.00084) [2022-07-10 05:49:27,556][25689] Fps is (10 sec: 5554.0, 60 sec: 5604.8, 300 sec: 5618.1). Total num frames: 608082944. Throughput: 0: 5879.3. Samples: 608092560. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:49:27,557][25689] Avg episode reward: [(0, '-25.029')] [2022-07-10 05:49:28,905][26022] Updated weights on worker 0-0, policy_version 593839 (0.00082) [2022-07-10 05:49:30,796][26022] Updated weights on worker 0-0, policy_version 593849 (0.00089) [2022-07-10 05:49:32,407][26022] Updated weights on worker 0-0, policy_version 593859 (0.00092) [2022-07-10 05:49:32,583][25689] Fps is (10 sec: 5742.9, 60 sec: 5623.4, 300 sec: 5625.2). Total num frames: 608112640. Throughput: 0: 5875.8. Samples: 608109624. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:49:32,583][25689] Avg episode reward: [(0, '-25.868')] [2022-07-10 05:49:34,196][26022] Updated weights on worker 0-0, policy_version 593869 (0.00093) [2022-07-10 05:49:35,996][26022] Updated weights on worker 0-0, policy_version 593879 (0.00087) [2022-07-10 05:49:37,652][25689] Fps is (10 sec: 5578.5, 60 sec: 5583.6, 300 sec: 5617.6). Total num frames: 608139264. Throughput: 0: 5851.2. Samples: 608143294. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:49:37,653][25689] Avg episode reward: [(0, '-25.181')] [2022-07-10 05:49:38,161][26022] Updated weights on worker 0-0, policy_version 593889 (0.00096) [2022-07-10 05:49:39,712][26022] Updated weights on worker 0-0, policy_version 593899 (0.00086) [2022-07-10 05:49:41,529][26022] Updated weights on worker 0-0, policy_version 593909 (0.00082) [2022-07-10 05:49:42,713][25689] Fps is (10 sec: 5559.6, 60 sec: 5600.5, 300 sec: 5621.4). Total num frames: 608168960. Throughput: 0: 5822.4. Samples: 608176914. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 05:49:42,715][25689] Avg episode reward: [(0, '-24.946')] [2022-07-10 05:49:43,642][26022] Updated weights on worker 0-0, policy_version 593919 (0.00093) [2022-07-10 05:49:45,264][26022] Updated weights on worker 0-0, policy_version 593929 (0.00084) [2022-07-10 05:49:47,050][26022] Updated weights on worker 0-0, policy_version 593939 (0.00096) [2022-07-10 05:49:47,779][25689] Fps is (10 sec: 5763.8, 60 sec: 5599.4, 300 sec: 5621.8). Total num frames: 608197632. Throughput: 0: 5003.4. Samples: 608193742. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:49:47,780][25689] Avg episode reward: [(0, '-24.668')] [2022-07-10 05:49:49,049][26022] Updated weights on worker 0-0, policy_version 593949 (0.00083) [2022-07-10 05:49:50,601][26022] Updated weights on worker 0-0, policy_version 593959 (0.00092) [2022-07-10 05:49:52,691][26022] Updated weights on worker 0-0, policy_version 593969 (0.00090) [2022-07-10 05:49:52,802][25689] Fps is (10 sec: 5480.7, 60 sec: 5581.9, 300 sec: 5615.1). Total num frames: 608224256. Throughput: 0: 5853.1. Samples: 608227970. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:49:52,803][25689] Avg episode reward: [(0, '-24.508')] [2022-07-10 05:49:53,950][26022] Updated weights on worker 0-0, policy_version 593979 (0.00087) [2022-07-10 05:49:56,249][26022] Updated weights on worker 0-0, policy_version 593989 (0.00089) [2022-07-10 05:49:57,836][25689] Fps is (10 sec: 5701.6, 60 sec: 5616.1, 300 sec: 5618.0). Total num frames: 608254976. Throughput: 0: 5882.8. Samples: 608262034. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:49:57,837][25689] Avg episode reward: [(0, '-23.819')] [2022-07-10 05:49:57,841][26022] Updated weights on worker 0-0, policy_version 593999 (0.00090) [2022-07-10 05:49:59,813][26022] Updated weights on worker 0-0, policy_version 594009 (0.00089) [2022-07-10 05:50:01,653][26022] Updated weights on worker 0-0, policy_version 594019 (0.00322) [2022-07-10 05:50:02,867][25689] Fps is (10 sec: 5595.8, 60 sec: 5597.5, 300 sec: 5615.8). Total num frames: 608280576. Throughput: 0: 5068.0. Samples: 608279054. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:50:02,868][25689] Avg episode reward: [(0, '-23.523')] [2022-07-10 05:50:03,771][26022] Updated weights on worker 0-0, policy_version 594029 (0.00087) [2022-07-10 05:50:05,462][26022] Updated weights on worker 0-0, policy_version 594039 (0.00094) [2022-07-10 05:50:05,649][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:50:05,663][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000594041_608297984.pth [2022-07-10 05:50:05,665][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000592060_606269440.pth [2022-07-10 05:50:07,616][26022] Updated weights on worker 0-0, policy_version 594049 (0.00087) [2022-07-10 05:50:08,001][25689] Fps is (10 sec: 5238.8, 60 sec: 5591.4, 300 sec: 5617.3). Total num frames: 608308224. Throughput: 0: 5795.0. Samples: 608310926. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:50:08,001][25689] Avg episode reward: [(0, '-24.257')] [2022-07-10 05:50:09,144][26022] Updated weights on worker 0-0, policy_version 594059 (0.00099) [2022-07-10 05:50:11,219][26022] Updated weights on worker 0-0, policy_version 594069 (0.00085) [2022-07-10 05:50:12,748][26022] Updated weights on worker 0-0, policy_version 594079 (0.00095) [2022-07-10 05:50:13,011][25689] Fps is (10 sec: 5653.1, 60 sec: 5607.5, 300 sec: 5617.6). Total num frames: 608337920. Throughput: 0: 5769.9. Samples: 608344572. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:50:13,011][25689] Avg episode reward: [(0, '-25.549')] [2022-07-10 05:50:14,705][26022] Updated weights on worker 0-0, policy_version 594089 (0.00092) [2022-07-10 05:50:16,475][26022] Updated weights on worker 0-0, policy_version 594099 (0.00084) [2022-07-10 05:50:18,020][25689] Fps is (10 sec: 5723.6, 60 sec: 5592.8, 300 sec: 5614.9). Total num frames: 608365568. Throughput: 0: 4938.6. Samples: 608361710. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:50:18,020][25689] Avg episode reward: [(0, '-25.328')] [2022-07-10 05:50:18,223][26022] Updated weights on worker 0-0, policy_version 594109 (0.00088) [2022-07-10 05:50:20,221][26022] Updated weights on worker 0-0, policy_version 594119 (0.00094) [2022-07-10 05:50:21,923][26022] Updated weights on worker 0-0, policy_version 594129 (0.00086) [2022-07-10 05:50:23,090][25689] Fps is (10 sec: 5587.7, 60 sec: 5603.5, 300 sec: 5618.6). Total num frames: 608394240. Throughput: 0: 5770.1. Samples: 608395744. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:50:23,091][25689] Avg episode reward: [(0, '-26.170')] [2022-07-10 05:50:23,591][26022] Updated weights on worker 0-0, policy_version 594139 (0.00090) [2022-07-10 05:50:25,543][26022] Updated weights on worker 0-0, policy_version 594149 (0.00083) [2022-07-10 05:50:27,328][26022] Updated weights on worker 0-0, policy_version 594159 (0.00096) [2022-07-10 05:50:28,174][25689] Fps is (10 sec: 5647.4, 60 sec: 5608.4, 300 sec: 5615.4). Total num frames: 608422912. Throughput: 0: 5887.7. Samples: 608429698. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:50:28,175][25689] Avg episode reward: [(0, '-26.196')] [2022-07-10 05:50:29,100][26022] Updated weights on worker 0-0, policy_version 594169 (0.00090) [2022-07-10 05:50:31,073][26022] Updated weights on worker 0-0, policy_version 594179 (0.00086) [2022-07-10 05:50:32,705][26022] Updated weights on worker 0-0, policy_version 594189 (0.00089) [2022-07-10 05:50:33,195][25689] Fps is (10 sec: 5675.1, 60 sec: 5592.0, 300 sec: 5619.0). Total num frames: 608451584. Throughput: 0: 5044.6. Samples: 608446392. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:50:33,196][25689] Avg episode reward: [(0, '-25.301')] [2022-07-10 05:50:34,508][26022] Updated weights on worker 0-0, policy_version 594199 (0.00082) [2022-07-10 05:50:36,459][26022] Updated weights on worker 0-0, policy_version 594209 (0.00095) [2022-07-10 05:50:38,147][26022] Updated weights on worker 0-0, policy_version 594219 (0.00090) [2022-07-10 05:50:38,211][25689] Fps is (10 sec: 5814.9, 60 sec: 5647.6, 300 sec: 5622.5). Total num frames: 608481280. Throughput: 0: 5883.0. Samples: 608480498. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:50:38,212][25689] Avg episode reward: [(0, '-24.568')] [2022-07-10 05:50:40,055][26022] Updated weights on worker 0-0, policy_version 594229 (0.00102) [2022-07-10 05:50:41,647][26022] Updated weights on worker 0-0, policy_version 594239 (0.00094) [2022-07-10 05:50:43,235][25689] Fps is (10 sec: 5507.6, 60 sec: 5583.5, 300 sec: 5609.2). Total num frames: 608506880. Throughput: 0: 5901.1. Samples: 608514618. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:50:43,236][25689] Avg episode reward: [(0, '-24.891')] [2022-07-10 05:50:43,686][26022] Updated weights on worker 0-0, policy_version 594249 (0.00084) [2022-07-10 05:50:45,307][26022] Updated weights on worker 0-0, policy_version 594259 (0.00093) [2022-07-10 05:50:47,288][26022] Updated weights on worker 0-0, policy_version 594269 (0.00093) [2022-07-10 05:50:48,367][25689] Fps is (10 sec: 5545.9, 60 sec: 5611.2, 300 sec: 5620.5). Total num frames: 608537600. Throughput: 0: 5037.5. Samples: 608531420. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:50:48,367][25689] Avg episode reward: [(0, '-25.701')] [2022-07-10 05:50:49,089][26022] Updated weights on worker 0-0, policy_version 594279 (0.00087) [2022-07-10 05:50:50,915][26022] Updated weights on worker 0-0, policy_version 594289 (0.00091) [2022-07-10 05:50:52,759][26022] Updated weights on worker 0-0, policy_version 594299 (0.00087) [2022-07-10 05:50:53,395][25689] Fps is (10 sec: 5744.7, 60 sec: 5627.6, 300 sec: 5604.4). Total num frames: 608565248. Throughput: 0: 5900.9. Samples: 608565592. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:50:53,395][25689] Avg episode reward: [(0, '-25.877')] [2022-07-10 05:50:54,352][26022] Updated weights on worker 0-0, policy_version 594309 (0.00089) [2022-07-10 05:50:56,231][26022] Updated weights on worker 0-0, policy_version 594319 (0.00086) [2022-07-10 05:50:58,045][26022] Updated weights on worker 0-0, policy_version 594329 (0.00090) [2022-07-10 05:50:58,397][25689] Fps is (10 sec: 5615.2, 60 sec: 5596.8, 300 sec: 5611.7). Total num frames: 608593920. Throughput: 0: 5903.3. Samples: 608599658. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:50:58,397][25689] Avg episode reward: [(0, '-25.695')] [2022-07-10 05:50:59,876][26022] Updated weights on worker 0-0, policy_version 594339 (0.00084) [2022-07-10 05:51:01,855][26022] Updated weights on worker 0-0, policy_version 594349 (0.00102) [2022-07-10 05:51:03,421][25689] Fps is (10 sec: 5515.3, 60 sec: 5614.3, 300 sec: 5617.1). Total num frames: 608620544. Throughput: 0: 5053.8. Samples: 608616636. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:51:03,421][25689] Avg episode reward: [(0, '-26.358')] [2022-07-10 05:51:03,845][26022] Updated weights on worker 0-0, policy_version 594359 (0.00090) [2022-07-10 05:51:05,756][26022] Updated weights on worker 0-0, policy_version 594369 (0.00085) [2022-07-10 05:51:07,467][26022] Updated weights on worker 0-0, policy_version 594379 (0.00086) [2022-07-10 05:51:08,534][25689] Fps is (10 sec: 5454.7, 60 sec: 5633.1, 300 sec: 5608.5). Total num frames: 608649216. Throughput: 0: 5814.6. Samples: 608648684. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:51:08,534][25689] Avg episode reward: [(0, '-26.571')] [2022-07-10 05:51:09,303][26022] Updated weights on worker 0-0, policy_version 594389 (0.00084) [2022-07-10 05:51:11,172][26022] Updated weights on worker 0-0, policy_version 594399 (0.01108) [2022-07-10 05:51:12,752][26022] Updated weights on worker 0-0, policy_version 594409 (0.00097) [2022-07-10 05:51:13,568][25689] Fps is (10 sec: 5651.4, 60 sec: 5614.0, 300 sec: 5611.4). Total num frames: 608677888. Throughput: 0: 5806.7. Samples: 608682730. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:51:13,568][25689] Avg episode reward: [(0, '-25.360')] [2022-07-10 05:51:14,740][26022] Updated weights on worker 0-0, policy_version 594419 (0.00092) [2022-07-10 05:51:16,356][26022] Updated weights on worker 0-0, policy_version 594429 (0.00088) [2022-07-10 05:51:18,373][26022] Updated weights on worker 0-0, policy_version 594439 (0.00080) [2022-07-10 05:51:18,583][25689] Fps is (10 sec: 5706.3, 60 sec: 5630.3, 300 sec: 5615.6). Total num frames: 608706560. Throughput: 0: 4963.1. Samples: 608699846. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:51:18,584][25689] Avg episode reward: [(0, '-24.817')] [2022-07-10 05:51:20,149][26022] Updated weights on worker 0-0, policy_version 594449 (0.00376) [2022-07-10 05:51:22,045][26022] Updated weights on worker 0-0, policy_version 594459 (0.00091) [2022-07-10 05:51:23,592][25689] Fps is (10 sec: 5720.5, 60 sec: 5636.1, 300 sec: 5609.6). Total num frames: 608735232. Throughput: 0: 5812.6. Samples: 608733882. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:51:23,592][25689] Avg episode reward: [(0, '-24.471')] [2022-07-10 05:51:23,771][26022] Updated weights on worker 0-0, policy_version 594469 (0.00118) [2022-07-10 05:51:25,936][26022] Updated weights on worker 0-0, policy_version 594479 (0.00084) [2022-07-10 05:51:27,442][26022] Updated weights on worker 0-0, policy_version 594489 (0.00089) [2022-07-10 05:51:28,637][25689] Fps is (10 sec: 5601.6, 60 sec: 5622.7, 300 sec: 5619.5). Total num frames: 608762880. Throughput: 0: 5898.6. Samples: 608767268. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:51:28,638][25689] Avg episode reward: [(0, '-24.143')] [2022-07-10 05:51:29,415][26022] Updated weights on worker 0-0, policy_version 594499 (0.00084) [2022-07-10 05:51:31,006][26022] Updated weights on worker 0-0, policy_version 594509 (0.00097) [2022-07-10 05:51:32,856][26022] Updated weights on worker 0-0, policy_version 594519 (0.00057) [2022-07-10 05:51:33,665][25689] Fps is (10 sec: 5489.7, 60 sec: 5605.1, 300 sec: 5605.5). Total num frames: 608790528. Throughput: 0: 5051.6. Samples: 608784252. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:51:33,665][25689] Avg episode reward: [(0, '-23.026')] [2022-07-10 05:51:34,779][26022] Updated weights on worker 0-0, policy_version 594529 (0.00093) [2022-07-10 05:51:36,544][26022] Updated weights on worker 0-0, policy_version 594539 (0.00091) [2022-07-10 05:51:38,240][26022] Updated weights on worker 0-0, policy_version 594549 (0.00118) [2022-07-10 05:51:38,685][25689] Fps is (10 sec: 5707.3, 60 sec: 5604.8, 300 sec: 5612.3). Total num frames: 608820224. Throughput: 0: 5882.8. Samples: 608818102. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:51:38,686][25689] Avg episode reward: [(0, '-23.239')] [2022-07-10 05:51:40,402][26022] Updated weights on worker 0-0, policy_version 594559 (0.00102) [2022-07-10 05:51:41,809][26022] Updated weights on worker 0-0, policy_version 594569 (0.00088) [2022-07-10 05:51:43,701][25689] Fps is (10 sec: 5611.6, 60 sec: 5622.4, 300 sec: 5609.3). Total num frames: 608846848. Throughput: 0: 5870.5. Samples: 608851932. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:51:43,702][25689] Avg episode reward: [(0, '-23.199')] [2022-07-10 05:51:44,031][26022] Updated weights on worker 0-0, policy_version 594579 (0.00087) [2022-07-10 05:51:45,337][26022] Updated weights on worker 0-0, policy_version 594589 (0.00093) [2022-07-10 05:51:47,456][26022] Updated weights on worker 0-0, policy_version 594599 (0.00090) [2022-07-10 05:51:48,835][25689] Fps is (10 sec: 5548.8, 60 sec: 5605.3, 300 sec: 5603.5). Total num frames: 608876544. Throughput: 0: 5882.8. Samples: 608886086. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:51:48,835][25689] Avg episode reward: [(0, '-24.387')] [2022-07-10 05:51:49,258][26022] Updated weights on worker 0-0, policy_version 594609 (0.00081) [2022-07-10 05:51:50,946][26022] Updated weights on worker 0-0, policy_version 594619 (0.00087) [2022-07-10 05:51:53,085][26022] Updated weights on worker 0-0, policy_version 594629 (0.00090) [2022-07-10 05:51:53,857][25689] Fps is (10 sec: 5747.2, 60 sec: 5622.8, 300 sec: 5610.0). Total num frames: 608905216. Throughput: 0: 5877.8. Samples: 608902938. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:51:53,857][25689] Avg episode reward: [(0, '-24.852')] [2022-07-10 05:51:54,547][26022] Updated weights on worker 0-0, policy_version 594639 (0.00090) [2022-07-10 05:51:56,592][26022] Updated weights on worker 0-0, policy_version 594649 (0.00102) [2022-07-10 05:51:58,590][26022] Updated weights on worker 0-0, policy_version 594659 (0.00081) [2022-07-10 05:51:58,869][25689] Fps is (10 sec: 5613.0, 60 sec: 5604.9, 300 sec: 5613.5). Total num frames: 608932864. Throughput: 0: 5879.3. Samples: 608936768. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:51:58,869][25689] Avg episode reward: [(0, '-25.260')] [2022-07-10 05:52:00,044][26022] Updated weights on worker 0-0, policy_version 594669 (0.00090) [2022-07-10 05:52:02,475][26022] Updated weights on worker 0-0, policy_version 594679 (0.00093) [2022-07-10 05:52:03,927][25689] Fps is (10 sec: 5491.3, 60 sec: 5618.8, 300 sec: 5610.3). Total num frames: 608960512. Throughput: 0: 5770.6. Samples: 608968646. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:52:03,927][25689] Avg episode reward: [(0, '-25.041')] [2022-07-10 05:52:03,971][26022] Updated weights on worker 0-0, policy_version 594689 (0.00091) [2022-07-10 05:52:05,740][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:52:05,754][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000594698_608970752.pth [2022-07-10 05:52:05,754][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000592722_606947328.pth [2022-07-10 05:52:05,917][26022] Updated weights on worker 0-0, policy_version 594699 (0.00090) [2022-07-10 05:52:07,887][26022] Updated weights on worker 0-0, policy_version 594709 (0.00384) [2022-07-10 05:52:08,967][25689] Fps is (10 sec: 5577.5, 60 sec: 5625.6, 300 sec: 5613.2). Total num frames: 608989184. Throughput: 0: 4943.8. Samples: 608985614. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:52:08,967][25689] Avg episode reward: [(0, '-24.176')] [2022-07-10 05:52:09,618][26022] Updated weights on worker 0-0, policy_version 594719 (0.00311) [2022-07-10 05:52:11,471][26022] Updated weights on worker 0-0, policy_version 594729 (0.00086) [2022-07-10 05:52:13,207][26022] Updated weights on worker 0-0, policy_version 594739 (0.00090) [2022-07-10 05:52:13,979][25689] Fps is (10 sec: 5602.9, 60 sec: 5610.6, 300 sec: 5606.4). Total num frames: 609016832. Throughput: 0: 5785.9. Samples: 609019360. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:52:13,979][25689] Avg episode reward: [(0, '-23.874')] [2022-07-10 05:52:14,973][26022] Updated weights on worker 0-0, policy_version 594749 (0.00087) [2022-07-10 05:52:16,803][26022] Updated weights on worker 0-0, policy_version 594759 (0.00085) [2022-07-10 05:52:18,639][26022] Updated weights on worker 0-0, policy_version 594769 (0.00085) [2022-07-10 05:52:18,999][25689] Fps is (10 sec: 5614.0, 60 sec: 5610.2, 300 sec: 5609.6). Total num frames: 609045504. Throughput: 0: 5804.9. Samples: 609053620. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:52:19,000][25689] Avg episode reward: [(0, '-24.036')] [2022-07-10 05:52:20,547][26022] Updated weights on worker 0-0, policy_version 594779 (0.00083) [2022-07-10 05:52:22,220][26022] Updated weights on worker 0-0, policy_version 594789 (0.00083) [2022-07-10 05:52:24,011][25689] Fps is (10 sec: 5614.0, 60 sec: 5593.0, 300 sec: 5606.9). Total num frames: 609073152. Throughput: 0: 5082.4. Samples: 609070720. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:52:24,012][25689] Avg episode reward: [(0, '-24.301')] [2022-07-10 05:52:24,072][26022] Updated weights on worker 0-0, policy_version 594799 (0.00090) [2022-07-10 05:52:25,795][26022] Updated weights on worker 0-0, policy_version 594809 (0.00088) [2022-07-10 05:52:27,788][26022] Updated weights on worker 0-0, policy_version 594819 (0.00093) [2022-07-10 05:52:29,075][25689] Fps is (10 sec: 5691.2, 60 sec: 5625.1, 300 sec: 5605.8). Total num frames: 609102848. Throughput: 0: 5927.8. Samples: 609104810. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:52:29,075][25689] Avg episode reward: [(0, '-24.551')] [2022-07-10 05:52:29,484][26022] Updated weights on worker 0-0, policy_version 594829 (0.00088) [2022-07-10 05:52:31,197][26022] Updated weights on worker 0-0, policy_version 594839 (0.00052) [2022-07-10 05:52:33,162][26022] Updated weights on worker 0-0, policy_version 594849 (0.00065) [2022-07-10 05:52:34,081][25689] Fps is (10 sec: 5592.5, 60 sec: 5610.1, 300 sec: 5606.5). Total num frames: 609129472. Throughput: 0: 5944.8. Samples: 609138866. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:52:34,082][25689] Avg episode reward: [(0, '-27.207')] [2022-07-10 05:52:34,732][26022] Updated weights on worker 0-0, policy_version 594859 (0.00089) [2022-07-10 05:52:36,875][26022] Updated weights on worker 0-0, policy_version 594869 (0.00086) [2022-07-10 05:52:38,384][26022] Updated weights on worker 0-0, policy_version 594879 (0.00095) [2022-07-10 05:52:39,123][25689] Fps is (10 sec: 5605.1, 60 sec: 5608.2, 300 sec: 5609.6). Total num frames: 609159168. Throughput: 0: 5088.3. Samples: 609156016. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:52:39,123][25689] Avg episode reward: [(0, '-27.622')] [2022-07-10 05:52:40,302][26022] Updated weights on worker 0-0, policy_version 594889 (0.00087) [2022-07-10 05:52:42,197][26022] Updated weights on worker 0-0, policy_version 594899 (0.00090) [2022-07-10 05:52:44,027][26022] Updated weights on worker 0-0, policy_version 594909 (0.00085) [2022-07-10 05:52:44,150][25689] Fps is (10 sec: 5695.2, 60 sec: 5624.0, 300 sec: 5603.4). Total num frames: 609186816. Throughput: 0: 5915.2. Samples: 609189850. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:52:44,151][25689] Avg episode reward: [(0, '-27.986')] [2022-07-10 05:52:45,714][26022] Updated weights on worker 0-0, policy_version 594919 (0.00080) [2022-07-10 05:52:47,631][26022] Updated weights on worker 0-0, policy_version 594929 (0.00094) [2022-07-10 05:52:49,179][26022] Updated weights on worker 0-0, policy_version 594939 (0.00090) [2022-07-10 05:52:49,267][25689] Fps is (10 sec: 5753.6, 60 sec: 5642.5, 300 sec: 5615.0). Total num frames: 609217536. Throughput: 0: 5904.4. Samples: 609224036. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 05:52:49,268][25689] Avg episode reward: [(0, '-29.064')] [2022-07-10 05:52:51,247][26022] Updated weights on worker 0-0, policy_version 594949 (0.00086) [2022-07-10 05:52:52,843][26022] Updated weights on worker 0-0, policy_version 594959 (0.00089) [2022-07-10 05:52:54,283][25689] Fps is (10 sec: 5760.1, 60 sec: 5626.1, 300 sec: 5608.0). Total num frames: 609245184. Throughput: 0: 5063.6. Samples: 609241160. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:52:54,284][25689] Avg episode reward: [(0, '-27.873')] [2022-07-10 05:52:54,817][26022] Updated weights on worker 0-0, policy_version 594969 (0.00093) [2022-07-10 05:52:56,575][26022] Updated weights on worker 0-0, policy_version 594979 (0.00087) [2022-07-10 05:52:58,306][26022] Updated weights on worker 0-0, policy_version 594989 (0.00088) [2022-07-10 05:52:59,290][25689] Fps is (10 sec: 5619.4, 60 sec: 5643.6, 300 sec: 5618.4). Total num frames: 609273856. Throughput: 0: 5928.6. Samples: 609275578. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:52:59,290][25689] Avg episode reward: [(0, '-25.925')] [2022-07-10 05:53:00,263][26022] Updated weights on worker 0-0, policy_version 594999 (0.00088) [2022-07-10 05:53:02,264][26022] Updated weights on worker 0-0, policy_version 595009 (0.00091) [2022-07-10 05:53:04,264][26022] Updated weights on worker 0-0, policy_version 595019 (0.00090) [2022-07-10 05:53:04,390][25689] Fps is (10 sec: 5471.2, 60 sec: 5622.7, 300 sec: 5614.5). Total num frames: 609300480. Throughput: 0: 5800.5. Samples: 609307252. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:53:04,391][25689] Avg episode reward: [(0, '-24.151')] [2022-07-10 05:53:05,956][26022] Updated weights on worker 0-0, policy_version 595029 (0.00083) [2022-07-10 05:53:07,853][26022] Updated weights on worker 0-0, policy_version 595039 (0.00907) [2022-07-10 05:53:09,374][26022] Updated weights on worker 0-0, policy_version 595049 (0.00085) [2022-07-10 05:53:09,468][25689] Fps is (10 sec: 5533.6, 60 sec: 5636.1, 300 sec: 5613.6). Total num frames: 609330176. Throughput: 0: 4965.3. Samples: 609324336. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:53:09,468][25689] Avg episode reward: [(0, '-23.105')] [2022-07-10 05:53:11,367][26022] Updated weights on worker 0-0, policy_version 595059 (0.00091) [2022-07-10 05:53:13,096][26022] Updated weights on worker 0-0, policy_version 595069 (0.00095) [2022-07-10 05:53:14,511][25689] Fps is (10 sec: 5666.1, 60 sec: 5633.3, 300 sec: 5612.9). Total num frames: 609357824. Throughput: 0: 5789.7. Samples: 609358270. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:53:14,511][25689] Avg episode reward: [(0, '-22.581')] [2022-07-10 05:53:15,095][26022] Updated weights on worker 0-0, policy_version 595079 (0.00090) [2022-07-10 05:53:16,647][26022] Updated weights on worker 0-0, policy_version 595089 (0.00094) [2022-07-10 05:53:18,509][26022] Updated weights on worker 0-0, policy_version 595099 (0.00064) [2022-07-10 05:53:19,516][25689] Fps is (10 sec: 5604.8, 60 sec: 5634.6, 300 sec: 5613.4). Total num frames: 609386496. Throughput: 0: 5781.7. Samples: 609392522. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:53:19,517][25689] Avg episode reward: [(0, '-22.591')] [2022-07-10 05:53:20,410][26022] Updated weights on worker 0-0, policy_version 595109 (0.00094) [2022-07-10 05:53:22,184][26022] Updated weights on worker 0-0, policy_version 595119 (0.00091) [2022-07-10 05:53:23,857][26022] Updated weights on worker 0-0, policy_version 595129 (0.00084) [2022-07-10 05:53:24,533][25689] Fps is (10 sec: 5619.7, 60 sec: 5634.2, 300 sec: 5614.9). Total num frames: 609414144. Throughput: 0: 5084.9. Samples: 609409674. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:53:24,533][25689] Avg episode reward: [(0, '-23.180')] [2022-07-10 05:53:25,812][26022] Updated weights on worker 0-0, policy_version 595139 (0.00090) [2022-07-10 05:53:27,641][26022] Updated weights on worker 0-0, policy_version 595149 (0.00092) [2022-07-10 05:53:29,457][26022] Updated weights on worker 0-0, policy_version 595159 (0.00079) [2022-07-10 05:53:29,603][25689] Fps is (10 sec: 5583.7, 60 sec: 5616.7, 300 sec: 5614.4). Total num frames: 609442816. Throughput: 0: 5906.7. Samples: 609443268. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:53:29,604][25689] Avg episode reward: [(0, '-23.081')] [2022-07-10 05:53:31,412][26022] Updated weights on worker 0-0, policy_version 595169 (0.00092) [2022-07-10 05:53:33,049][26022] Updated weights on worker 0-0, policy_version 595179 (0.00084) [2022-07-10 05:53:34,623][25689] Fps is (10 sec: 5581.7, 60 sec: 5632.4, 300 sec: 5610.7). Total num frames: 609470464. Throughput: 0: 5915.2. Samples: 609477236. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:53:34,623][25689] Avg episode reward: [(0, '-23.994')] [2022-07-10 05:53:35,003][26022] Updated weights on worker 0-0, policy_version 595189 (0.00093) [2022-07-10 05:53:36,577][26022] Updated weights on worker 0-0, policy_version 595199 (0.00086) [2022-07-10 05:53:38,717][26022] Updated weights on worker 0-0, policy_version 595209 (0.00090) [2022-07-10 05:53:39,644][25689] Fps is (10 sec: 5608.6, 60 sec: 5617.3, 300 sec: 5611.4). Total num frames: 609499136. Throughput: 0: 5050.5. Samples: 609494180. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:53:39,646][25689] Avg episode reward: [(0, '-24.693')] [2022-07-10 05:53:40,265][26022] Updated weights on worker 0-0, policy_version 595219 (0.00086) [2022-07-10 05:53:42,260][26022] Updated weights on worker 0-0, policy_version 595229 (0.00096) [2022-07-10 05:53:44,101][26022] Updated weights on worker 0-0, policy_version 595239 (0.00088) [2022-07-10 05:53:44,667][25689] Fps is (10 sec: 5607.4, 60 sec: 5617.8, 300 sec: 5608.6). Total num frames: 609526784. Throughput: 0: 5861.7. Samples: 609527692. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:53:44,667][25689] Avg episode reward: [(0, '-25.524')] [2022-07-10 05:53:45,853][26022] Updated weights on worker 0-0, policy_version 595249 (0.00090) [2022-07-10 05:53:47,799][26022] Updated weights on worker 0-0, policy_version 595259 (0.00094) [2022-07-10 05:53:49,248][26022] Updated weights on worker 0-0, policy_version 595269 (0.00086) [2022-07-10 05:53:49,724][25689] Fps is (10 sec: 5688.8, 60 sec: 5606.4, 300 sec: 5614.7). Total num frames: 609556480. Throughput: 0: 5883.1. Samples: 609561646. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:53:49,725][25689] Avg episode reward: [(0, '-25.464')] [2022-07-10 05:53:51,419][26022] Updated weights on worker 0-0, policy_version 595279 (0.00079) [2022-07-10 05:53:53,322][26022] Updated weights on worker 0-0, policy_version 595289 (0.00084) [2022-07-10 05:53:54,741][25689] Fps is (10 sec: 5692.2, 60 sec: 5606.3, 300 sec: 5611.6). Total num frames: 609584128. Throughput: 0: 5023.5. Samples: 609578298. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:53:54,741][25689] Avg episode reward: [(0, '-26.683')] [2022-07-10 05:53:54,873][26022] Updated weights on worker 0-0, policy_version 595299 (0.00090) [2022-07-10 05:53:56,866][26022] Updated weights on worker 0-0, policy_version 595309 (0.00084) [2022-07-10 05:53:58,533][26022] Updated weights on worker 0-0, policy_version 595319 (0.00088) [2022-07-10 05:53:59,744][25689] Fps is (10 sec: 5621.0, 60 sec: 5606.6, 300 sec: 5618.7). Total num frames: 609612800. Throughput: 0: 5891.2. Samples: 609612592. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:53:59,745][25689] Avg episode reward: [(0, '-25.678')] [2022-07-10 05:54:00,627][26022] Updated weights on worker 0-0, policy_version 595329 (0.00086) [2022-07-10 05:54:02,299][26022] Updated weights on worker 0-0, policy_version 595339 (0.00099) [2022-07-10 05:54:04,453][26022] Updated weights on worker 0-0, policy_version 595349 (0.00093) [2022-07-10 05:54:04,746][25689] Fps is (10 sec: 5526.6, 60 sec: 5615.7, 300 sec: 5616.5). Total num frames: 609639424. Throughput: 0: 5820.7. Samples: 609644570. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:54:04,747][25689] Avg episode reward: [(0, '-25.445')] [2022-07-10 05:54:05,916][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:54:05,930][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000595358_609646592.pth [2022-07-10 05:54:05,930][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000593382_607623168.pth [2022-07-10 05:54:06,312][26022] Updated weights on worker 0-0, policy_version 595359 (0.00079) [2022-07-10 05:54:08,068][26022] Updated weights on worker 0-0, policy_version 595369 (0.00088) [2022-07-10 05:54:09,703][26022] Updated weights on worker 0-0, policy_version 595379 (0.00086) [2022-07-10 05:54:09,807][25689] Fps is (10 sec: 5495.2, 60 sec: 5600.4, 300 sec: 5615.4). Total num frames: 609668096. Throughput: 0: 4976.7. Samples: 609661590. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:54:09,807][25689] Avg episode reward: [(0, '-25.023')] [2022-07-10 05:54:11,625][26022] Updated weights on worker 0-0, policy_version 595389 (0.00090) [2022-07-10 05:54:13,393][26022] Updated weights on worker 0-0, policy_version 595399 (0.00085) [2022-07-10 05:54:14,827][25689] Fps is (10 sec: 5688.4, 60 sec: 5619.4, 300 sec: 5615.6). Total num frames: 609696768. Throughput: 0: 5850.1. Samples: 609695806. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:54:14,827][25689] Avg episode reward: [(0, '-25.229')] [2022-07-10 05:54:15,215][26022] Updated weights on worker 0-0, policy_version 595409 (0.00094) [2022-07-10 05:54:16,940][26022] Updated weights on worker 0-0, policy_version 595419 (0.00093) [2022-07-10 05:54:19,010][26022] Updated weights on worker 0-0, policy_version 595429 (0.00090) [2022-07-10 05:54:19,848][25689] Fps is (10 sec: 5710.9, 60 sec: 5618.0, 300 sec: 5618.7). Total num frames: 609725440. Throughput: 0: 5827.9. Samples: 609729754. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:54:19,848][25689] Avg episode reward: [(0, '-24.318')] [2022-07-10 05:54:20,514][26022] Updated weights on worker 0-0, policy_version 595439 (0.00090) [2022-07-10 05:54:22,583][26022] Updated weights on worker 0-0, policy_version 595449 (0.00091) [2022-07-10 05:54:24,180][26022] Updated weights on worker 0-0, policy_version 595459 (0.00095) [2022-07-10 05:54:24,863][25689] Fps is (10 sec: 5611.8, 60 sec: 5618.1, 300 sec: 5617.6). Total num frames: 609753088. Throughput: 0: 5083.6. Samples: 609746836. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:54:24,863][25689] Avg episode reward: [(0, '-23.234')] [2022-07-10 05:54:26,047][26022] Updated weights on worker 0-0, policy_version 595469 (0.00091) [2022-07-10 05:54:27,843][26022] Updated weights on worker 0-0, policy_version 595479 (0.00088) [2022-07-10 05:54:29,734][26022] Updated weights on worker 0-0, policy_version 595489 (0.00082) [2022-07-10 05:54:29,898][25689] Fps is (10 sec: 5603.8, 60 sec: 5621.4, 300 sec: 5614.0). Total num frames: 609781760. Throughput: 0: 5924.5. Samples: 609780622. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:54:29,906][25689] Avg episode reward: [(0, '-24.026')] [2022-07-10 05:54:31,540][26022] Updated weights on worker 0-0, policy_version 595499 (0.00083) [2022-07-10 05:54:33,319][26022] Updated weights on worker 0-0, policy_version 595509 (0.00088) [2022-07-10 05:54:34,912][25689] Fps is (10 sec: 5706.2, 60 sec: 5638.9, 300 sec: 5621.9). Total num frames: 609810432. Throughput: 0: 5921.0. Samples: 609814730. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:54:34,913][25689] Avg episode reward: [(0, '-25.312')] [2022-07-10 05:54:35,166][26022] Updated weights on worker 0-0, policy_version 595519 (0.00070) [2022-07-10 05:54:37,062][26022] Updated weights on worker 0-0, policy_version 595529 (0.00086) [2022-07-10 05:54:38,707][26022] Updated weights on worker 0-0, policy_version 595539 (0.00092) [2022-07-10 05:54:39,922][25689] Fps is (10 sec: 5618.8, 60 sec: 5623.1, 300 sec: 5616.0). Total num frames: 609838080. Throughput: 0: 5070.4. Samples: 609831536. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:54:39,922][25689] Avg episode reward: [(0, '-24.821')] [2022-07-10 05:54:40,523][26022] Updated weights on worker 0-0, policy_version 595549 (0.00096) [2022-07-10 05:54:42,344][26022] Updated weights on worker 0-0, policy_version 595559 (0.00088) [2022-07-10 05:54:44,184][26022] Updated weights on worker 0-0, policy_version 595569 (0.00090) [2022-07-10 05:54:44,931][25689] Fps is (10 sec: 5519.5, 60 sec: 5624.3, 300 sec: 5613.6). Total num frames: 609865728. Throughput: 0: 5932.0. Samples: 609865878. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:54:44,931][25689] Avg episode reward: [(0, '-25.185')] [2022-07-10 05:54:45,986][26022] Updated weights on worker 0-0, policy_version 595579 (0.00090) [2022-07-10 05:54:47,751][26022] Updated weights on worker 0-0, policy_version 595589 (0.00093) [2022-07-10 05:54:49,633][26022] Updated weights on worker 0-0, policy_version 595599 (0.00086) [2022-07-10 05:54:50,007][25689] Fps is (10 sec: 5584.3, 60 sec: 5605.6, 300 sec: 5619.5). Total num frames: 609894400. Throughput: 0: 5922.6. Samples: 609899718. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:54:50,008][25689] Avg episode reward: [(0, '-25.715')] [2022-07-10 05:54:51,385][26022] Updated weights on worker 0-0, policy_version 595609 (0.00086) [2022-07-10 05:54:53,364][26022] Updated weights on worker 0-0, policy_version 595619 (0.00084) [2022-07-10 05:54:54,852][26022] Updated weights on worker 0-0, policy_version 595629 (0.00100) [2022-07-10 05:54:55,011][25689] Fps is (10 sec: 5790.1, 60 sec: 5640.7, 300 sec: 5616.6). Total num frames: 609924096. Throughput: 0: 5082.6. Samples: 609916884. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:54:55,013][25689] Avg episode reward: [(0, '-25.061')] [2022-07-10 05:54:56,815][26022] Updated weights on worker 0-0, policy_version 595639 (0.00097) [2022-07-10 05:54:58,542][26022] Updated weights on worker 0-0, policy_version 595649 (0.00090) [2022-07-10 05:55:00,014][25689] Fps is (10 sec: 5832.9, 60 sec: 5640.8, 300 sec: 5627.5). Total num frames: 609952768. Throughput: 0: 5965.4. Samples: 609951394. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:55:00,014][25689] Avg episode reward: [(0, '-25.282')] [2022-07-10 05:55:00,271][26022] Updated weights on worker 0-0, policy_version 595659 (0.00096) [2022-07-10 05:55:02,471][26022] Updated weights on worker 0-0, policy_version 595669 (0.00089) [2022-07-10 05:55:04,231][26022] Updated weights on worker 0-0, policy_version 595679 (0.00093) [2022-07-10 05:55:05,019][25689] Fps is (10 sec: 5525.6, 60 sec: 5640.5, 300 sec: 5626.5). Total num frames: 609979392. Throughput: 0: 5854.5. Samples: 609983482. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:55:05,020][25689] Avg episode reward: [(0, '-24.065')] [2022-07-10 05:55:06,207][26022] Updated weights on worker 0-0, policy_version 595689 (0.00090) [2022-07-10 05:55:08,020][26022] Updated weights on worker 0-0, policy_version 595699 (0.00079) [2022-07-10 05:55:09,691][26022] Updated weights on worker 0-0, policy_version 595709 (0.00084) [2022-07-10 05:55:10,126][25689] Fps is (10 sec: 5367.2, 60 sec: 5619.2, 300 sec: 5617.8). Total num frames: 610007040. Throughput: 0: 5842.8. Samples: 610017266. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:55:10,126][25689] Avg episode reward: [(0, '-24.727')] [2022-07-10 05:55:11,625][26022] Updated weights on worker 0-0, policy_version 595719 (0.00093) [2022-07-10 05:55:13,191][26022] Updated weights on worker 0-0, policy_version 595729 (0.00086) [2022-07-10 05:55:15,161][25689] Fps is (10 sec: 5451.7, 60 sec: 5600.8, 300 sec: 5617.3). Total num frames: 610034688. Throughput: 0: 5838.2. Samples: 610034524. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:55:15,162][25689] Avg episode reward: [(0, '-24.346')] [2022-07-10 05:55:15,296][26022] Updated weights on worker 0-0, policy_version 595739 (0.00080) [2022-07-10 05:55:16,863][26022] Updated weights on worker 0-0, policy_version 595749 (0.00091) [2022-07-10 05:55:18,942][26022] Updated weights on worker 0-0, policy_version 595759 (0.00087) [2022-07-10 05:55:20,185][25689] Fps is (10 sec: 5700.6, 60 sec: 5617.5, 300 sec: 5621.6). Total num frames: 610064384. Throughput: 0: 5793.9. Samples: 610068264. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:55:20,186][25689] Avg episode reward: [(0, '-25.842')] [2022-07-10 05:55:20,716][26022] Updated weights on worker 0-0, policy_version 595769 (0.00091) [2022-07-10 05:55:22,577][26022] Updated weights on worker 0-0, policy_version 595779 (0.00087) [2022-07-10 05:55:24,175][26022] Updated weights on worker 0-0, policy_version 595789 (0.00082) [2022-07-10 05:55:25,196][25689] Fps is (10 sec: 5714.9, 60 sec: 5617.9, 300 sec: 5619.5). Total num frames: 610092032. Throughput: 0: 5880.5. Samples: 610102134. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:55:25,196][25689] Avg episode reward: [(0, '-26.233')] [2022-07-10 05:55:26,180][26022] Updated weights on worker 0-0, policy_version 595799 (0.00092) [2022-07-10 05:55:27,803][26022] Updated weights on worker 0-0, policy_version 595809 (0.00088) [2022-07-10 05:55:29,777][26022] Updated weights on worker 0-0, policy_version 595819 (0.00092) [2022-07-10 05:55:30,315][25689] Fps is (10 sec: 5559.8, 60 sec: 5610.1, 300 sec: 5617.7). Total num frames: 610120704. Throughput: 0: 5030.8. Samples: 610118834. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:55:30,315][25689] Avg episode reward: [(0, '-26.065')] [2022-07-10 05:55:31,626][26022] Updated weights on worker 0-0, policy_version 595829 (0.00087) [2022-07-10 05:55:33,457][26022] Updated weights on worker 0-0, policy_version 595839 (0.00084) [2022-07-10 05:55:35,075][26022] Updated weights on worker 0-0, policy_version 595849 (0.00084) [2022-07-10 05:55:35,355][25689] Fps is (10 sec: 5845.9, 60 sec: 5641.6, 300 sec: 5620.7). Total num frames: 610151424. Throughput: 0: 5860.9. Samples: 610152878. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:55:35,356][25689] Avg episode reward: [(0, '-26.064')] [2022-07-10 05:55:36,961][26022] Updated weights on worker 0-0, policy_version 595859 (0.00084) [2022-07-10 05:55:38,746][26022] Updated weights on worker 0-0, policy_version 595869 (0.00092) [2022-07-10 05:55:40,393][25689] Fps is (10 sec: 5690.0, 60 sec: 5622.0, 300 sec: 5623.8). Total num frames: 610178048. Throughput: 0: 5872.3. Samples: 610186930. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:55:40,393][25689] Avg episode reward: [(0, '-26.490')] [2022-07-10 05:55:40,676][26022] Updated weights on worker 0-0, policy_version 595879 (0.00112) [2022-07-10 05:55:42,277][26022] Updated weights on worker 0-0, policy_version 595889 (0.00086) [2022-07-10 05:55:44,291][26022] Updated weights on worker 0-0, policy_version 595899 (0.00084) [2022-07-10 05:55:45,420][25689] Fps is (10 sec: 5494.2, 60 sec: 5637.3, 300 sec: 5618.9). Total num frames: 610206720. Throughput: 0: 5035.7. Samples: 610203976. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:55:45,420][25689] Avg episode reward: [(0, '-26.153')] [2022-07-10 05:55:45,994][26022] Updated weights on worker 0-0, policy_version 595909 (0.00085) [2022-07-10 05:55:47,815][26022] Updated weights on worker 0-0, policy_version 595919 (0.00085) [2022-07-10 05:55:49,701][26022] Updated weights on worker 0-0, policy_version 595929 (0.00079) [2022-07-10 05:55:50,548][25689] Fps is (10 sec: 5646.5, 60 sec: 5632.4, 300 sec: 5620.5). Total num frames: 610235392. Throughput: 0: 5888.7. Samples: 610237984. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:55:50,549][25689] Avg episode reward: [(0, '-23.843')] [2022-07-10 05:55:51,588][26022] Updated weights on worker 0-0, policy_version 595939 (0.00913) [2022-07-10 05:55:53,351][26022] Updated weights on worker 0-0, policy_version 595949 (0.00085) [2022-07-10 05:55:55,121][26022] Updated weights on worker 0-0, policy_version 595959 (0.00093) [2022-07-10 05:55:55,593][25689] Fps is (10 sec: 5636.6, 60 sec: 5611.7, 300 sec: 5619.7). Total num frames: 610264064. Throughput: 0: 5876.8. Samples: 610271812. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 05:55:55,594][25689] Avg episode reward: [(0, '-23.853')] [2022-07-10 05:55:56,863][26022] Updated weights on worker 0-0, policy_version 595969 (0.00096) [2022-07-10 05:55:58,613][26022] Updated weights on worker 0-0, policy_version 595979 (0.00089) [2022-07-10 05:56:00,322][26022] Updated weights on worker 0-0, policy_version 595989 (0.00087) [2022-07-10 05:56:00,691][25689] Fps is (10 sec: 5754.7, 60 sec: 5619.7, 300 sec: 5628.6). Total num frames: 610293760. Throughput: 0: 5024.2. Samples: 610288914. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:56:00,692][25689] Avg episode reward: [(0, '-23.770')] [2022-07-10 05:56:02,771][26022] Updated weights on worker 0-0, policy_version 595999 (0.00099) [2022-07-10 05:56:04,481][26022] Updated weights on worker 0-0, policy_version 596009 (0.00087) [2022-07-10 05:56:05,718][25689] Fps is (10 sec: 5360.1, 60 sec: 5583.9, 300 sec: 5616.4). Total num frames: 610318336. Throughput: 0: 5770.6. Samples: 610321112. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:56:05,719][25689] Avg episode reward: [(0, '-24.173')] [2022-07-10 05:56:05,986][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:56:06,003][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000596017_610321408.pth [2022-07-10 05:56:06,004][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000594041_608297984.pth [2022-07-10 05:56:06,307][26022] Updated weights on worker 0-0, policy_version 596019 (0.00084) [2022-07-10 05:56:08,044][26022] Updated weights on worker 0-0, policy_version 596029 (0.00093) [2022-07-10 05:56:09,819][26022] Updated weights on worker 0-0, policy_version 596039 (0.00087) [2022-07-10 05:56:10,782][25689] Fps is (10 sec: 5378.5, 60 sec: 5621.7, 300 sec: 5619.3). Total num frames: 610348032. Throughput: 0: 5776.4. Samples: 610354860. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:56:10,782][25689] Avg episode reward: [(0, '-24.622')] [2022-07-10 05:56:11,783][26022] Updated weights on worker 0-0, policy_version 596049 (0.00085) [2022-07-10 05:56:13,474][26022] Updated weights on worker 0-0, policy_version 596059 (0.00079) [2022-07-10 05:56:15,372][26022] Updated weights on worker 0-0, policy_version 596069 (0.00095) [2022-07-10 05:56:15,808][25689] Fps is (10 sec: 5886.4, 60 sec: 5656.4, 300 sec: 5622.6). Total num frames: 610377728. Throughput: 0: 4953.5. Samples: 610371946. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:56:15,809][25689] Avg episode reward: [(0, '-25.854')] [2022-07-10 05:56:17,124][26022] Updated weights on worker 0-0, policy_version 596079 (0.00360) [2022-07-10 05:56:18,979][26022] Updated weights on worker 0-0, policy_version 596089 (0.00091) [2022-07-10 05:56:20,766][26022] Updated weights on worker 0-0, policy_version 596099 (0.00077) [2022-07-10 05:56:20,827][25689] Fps is (10 sec: 5708.5, 60 sec: 5623.0, 300 sec: 5618.9). Total num frames: 610405376. Throughput: 0: 5818.7. Samples: 610406078. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:56:20,828][25689] Avg episode reward: [(0, '-25.641')] [2022-07-10 05:56:22,438][26022] Updated weights on worker 0-0, policy_version 596109 (0.00085) [2022-07-10 05:56:24,368][26022] Updated weights on worker 0-0, policy_version 596119 (0.00089) [2022-07-10 05:56:25,881][25689] Fps is (10 sec: 5489.8, 60 sec: 5619.1, 300 sec: 5618.8). Total num frames: 610433024. Throughput: 0: 5894.9. Samples: 610439966. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:56:25,881][25689] Avg episode reward: [(0, '-25.494')] [2022-07-10 05:56:26,295][26022] Updated weights on worker 0-0, policy_version 596129 (0.00050) [2022-07-10 05:56:27,939][26022] Updated weights on worker 0-0, policy_version 596139 (0.00086) [2022-07-10 05:56:29,852][26022] Updated weights on worker 0-0, policy_version 596149 (0.00087) [2022-07-10 05:56:30,970][25689] Fps is (10 sec: 5754.6, 60 sec: 5655.6, 300 sec: 5627.9). Total num frames: 610463744. Throughput: 0: 5035.8. Samples: 610456524. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:56:30,971][25689] Avg episode reward: [(0, '-25.788')] [2022-07-10 05:56:31,838][26022] Updated weights on worker 0-0, policy_version 596159 (0.00085) [2022-07-10 05:56:33,536][26022] Updated weights on worker 0-0, policy_version 596169 (0.00085) [2022-07-10 05:56:35,612][26022] Updated weights on worker 0-0, policy_version 596179 (0.00104) [2022-07-10 05:56:36,040][25689] Fps is (10 sec: 5442.6, 60 sec: 5551.5, 300 sec: 5609.8). Total num frames: 610488320. Throughput: 0: 5842.5. Samples: 610490152. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:56:36,041][25689] Avg episode reward: [(0, '-25.792')] [2022-07-10 05:56:37,087][26022] Updated weights on worker 0-0, policy_version 596189 (0.00057) [2022-07-10 05:56:39,187][26022] Updated weights on worker 0-0, policy_version 596199 (0.00096) [2022-07-10 05:56:40,791][26022] Updated weights on worker 0-0, policy_version 596209 (0.00087) [2022-07-10 05:56:41,085][25689] Fps is (10 sec: 5466.5, 60 sec: 5618.4, 300 sec: 5623.0). Total num frames: 610519040. Throughput: 0: 5828.6. Samples: 610524154. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:56:41,086][25689] Avg episode reward: [(0, '-24.906')] [2022-07-10 05:56:42,542][26022] Updated weights on worker 0-0, policy_version 596219 (0.00074) [2022-07-10 05:56:44,476][26022] Updated weights on worker 0-0, policy_version 596229 (0.00091) [2022-07-10 05:56:46,137][25689] Fps is (10 sec: 5882.4, 60 sec: 5616.1, 300 sec: 5621.1). Total num frames: 610547712. Throughput: 0: 5010.9. Samples: 610541468. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:56:46,137][25689] Avg episode reward: [(0, '-23.939')] [2022-07-10 05:56:46,167][26022] Updated weights on worker 0-0, policy_version 596239 (0.00086) [2022-07-10 05:56:48,188][26022] Updated weights on worker 0-0, policy_version 596249 (0.00101) [2022-07-10 05:56:49,983][26022] Updated weights on worker 0-0, policy_version 596259 (0.00091) [2022-07-10 05:56:51,216][25689] Fps is (10 sec: 5761.4, 60 sec: 5637.5, 300 sec: 5623.5). Total num frames: 610577408. Throughput: 0: 5873.0. Samples: 610575430. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:56:51,217][25689] Avg episode reward: [(0, '-23.538')] [2022-07-10 05:56:51,495][26022] Updated weights on worker 0-0, policy_version 596269 (0.00091) [2022-07-10 05:56:53,519][26022] Updated weights on worker 0-0, policy_version 596279 (0.00091) [2022-07-10 05:56:54,992][26022] Updated weights on worker 0-0, policy_version 596289 (0.00089) [2022-07-10 05:56:56,270][25689] Fps is (10 sec: 5659.1, 60 sec: 5619.8, 300 sec: 5622.7). Total num frames: 610605056. Throughput: 0: 5917.0. Samples: 610609850. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:56:56,270][25689] Avg episode reward: [(0, '-24.693')] [2022-07-10 05:56:57,003][26022] Updated weights on worker 0-0, policy_version 596299 (0.00088) [2022-07-10 05:56:58,767][26022] Updated weights on worker 0-0, policy_version 596309 (0.00085) [2022-07-10 05:57:00,531][26022] Updated weights on worker 0-0, policy_version 596319 (0.00086) [2022-07-10 05:57:01,303][25689] Fps is (10 sec: 5685.3, 60 sec: 5625.9, 300 sec: 5630.0). Total num frames: 610634752. Throughput: 0: 5080.7. Samples: 610626872. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:57:01,303][25689] Avg episode reward: [(0, '-25.125')] [2022-07-10 05:57:02,801][26022] Updated weights on worker 0-0, policy_version 596329 (0.00085) [2022-07-10 05:57:04,598][26022] Updated weights on worker 0-0, policy_version 596339 (0.00087) [2022-07-10 05:57:06,317][25689] Fps is (10 sec: 5503.4, 60 sec: 5643.9, 300 sec: 5620.2). Total num frames: 610660352. Throughput: 0: 5827.6. Samples: 610659072. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:57:06,320][25689] Avg episode reward: [(0, '-24.721')] [2022-07-10 05:57:06,527][26022] Updated weights on worker 0-0, policy_version 596349 (0.00087) [2022-07-10 05:57:08,110][26022] Updated weights on worker 0-0, policy_version 596359 (0.00097) [2022-07-10 05:57:09,921][26022] Updated weights on worker 0-0, policy_version 596369 (0.00093) [2022-07-10 05:57:11,382][25689] Fps is (10 sec: 5384.6, 60 sec: 5626.9, 300 sec: 5622.6). Total num frames: 610689024. Throughput: 0: 5843.6. Samples: 610693268. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:57:11,384][25689] Avg episode reward: [(0, '-24.829')] [2022-07-10 05:57:11,874][26022] Updated weights on worker 0-0, policy_version 596379 (0.00094) [2022-07-10 05:57:13,465][26022] Updated weights on worker 0-0, policy_version 596389 (0.00085) [2022-07-10 05:57:15,531][26022] Updated weights on worker 0-0, policy_version 596399 (0.00087) [2022-07-10 05:57:16,393][25689] Fps is (10 sec: 5691.4, 60 sec: 5611.4, 300 sec: 5622.8). Total num frames: 610717696. Throughput: 0: 5831.1. Samples: 610727190. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:57:16,395][25689] Avg episode reward: [(0, '-24.775')] [2022-07-10 05:57:17,197][26022] Updated weights on worker 0-0, policy_version 596409 (0.00090) [2022-07-10 05:57:19,098][26022] Updated weights on worker 0-0, policy_version 596419 (0.00088) [2022-07-10 05:57:20,738][26022] Updated weights on worker 0-0, policy_version 596429 (0.00084) [2022-07-10 05:57:21,418][25689] Fps is (10 sec: 5713.8, 60 sec: 5627.8, 300 sec: 5626.0). Total num frames: 610746368. Throughput: 0: 5829.6. Samples: 610744136. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:57:21,418][25689] Avg episode reward: [(0, '-25.708')] [2022-07-10 05:57:22,810][26022] Updated weights on worker 0-0, policy_version 596439 (0.00091) [2022-07-10 05:57:24,368][26022] Updated weights on worker 0-0, policy_version 596449 (0.00093) [2022-07-10 05:57:26,355][26022] Updated weights on worker 0-0, policy_version 596459 (0.00085) [2022-07-10 05:57:26,435][25689] Fps is (10 sec: 5608.6, 60 sec: 5631.2, 300 sec: 5620.0). Total num frames: 610774016. Throughput: 0: 5927.8. Samples: 610778322. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:57:26,435][25689] Avg episode reward: [(0, '-25.193')] [2022-07-10 05:57:28,005][26022] Updated weights on worker 0-0, policy_version 596469 (0.00089) [2022-07-10 05:57:30,009][26022] Updated weights on worker 0-0, policy_version 596479 (0.00086) [2022-07-10 05:57:31,556][25689] Fps is (10 sec: 5555.3, 60 sec: 5594.5, 300 sec: 5624.7). Total num frames: 610802688. Throughput: 0: 5876.6. Samples: 610811824. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:57:31,556][25689] Avg episode reward: [(0, '-24.364')] [2022-07-10 05:57:31,758][26022] Updated weights on worker 0-0, policy_version 596489 (0.00088) [2022-07-10 05:57:33,592][26022] Updated weights on worker 0-0, policy_version 596499 (0.00090) [2022-07-10 05:57:35,363][26022] Updated weights on worker 0-0, policy_version 596509 (0.00090) [2022-07-10 05:57:36,615][25689] Fps is (10 sec: 5632.9, 60 sec: 5663.1, 300 sec: 5621.0). Total num frames: 610831360. Throughput: 0: 5026.7. Samples: 610828840. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:57:36,615][25689] Avg episode reward: [(0, '-24.924')] [2022-07-10 05:57:37,148][26022] Updated weights on worker 0-0, policy_version 596519 (0.00085) [2022-07-10 05:57:38,812][26022] Updated weights on worker 0-0, policy_version 596529 (0.00082) [2022-07-10 05:57:40,863][26022] Updated weights on worker 0-0, policy_version 596539 (0.00090) [2022-07-10 05:57:41,617][25689] Fps is (10 sec: 5699.4, 60 sec: 5633.3, 300 sec: 5624.9). Total num frames: 610860032. Throughput: 0: 5882.7. Samples: 610862964. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:57:41,618][25689] Avg episode reward: [(0, '-25.627')] [2022-07-10 05:57:42,508][26022] Updated weights on worker 0-0, policy_version 596549 (0.00081) [2022-07-10 05:57:44,414][26022] Updated weights on worker 0-0, policy_version 596559 (0.00094) [2022-07-10 05:57:46,375][26022] Updated weights on worker 0-0, policy_version 596569 (0.00096) [2022-07-10 05:57:46,694][25689] Fps is (10 sec: 5791.2, 60 sec: 5647.8, 300 sec: 5622.2). Total num frames: 610889728. Throughput: 0: 5854.9. Samples: 610896936. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:57:46,695][25689] Avg episode reward: [(0, '-24.801')] [2022-07-10 05:57:48,185][26022] Updated weights on worker 0-0, policy_version 596579 (0.00085) [2022-07-10 05:57:49,852][26022] Updated weights on worker 0-0, policy_version 596589 (0.00094) [2022-07-10 05:57:51,764][25689] Fps is (10 sec: 5651.5, 60 sec: 5614.9, 300 sec: 5621.2). Total num frames: 610917376. Throughput: 0: 5029.7. Samples: 610913464. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:57:51,764][25689] Avg episode reward: [(0, '-24.951')] [2022-07-10 05:57:51,766][26022] Updated weights on worker 0-0, policy_version 596599 (0.00090) [2022-07-10 05:57:53,499][26022] Updated weights on worker 0-0, policy_version 596609 (0.00090) [2022-07-10 05:57:55,272][26022] Updated weights on worker 0-0, policy_version 596619 (0.00086) [2022-07-10 05:57:56,844][25689] Fps is (10 sec: 5649.6, 60 sec: 5646.3, 300 sec: 5623.2). Total num frames: 610947072. Throughput: 0: 5888.5. Samples: 610947958. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:57:56,844][25689] Avg episode reward: [(0, '-24.598')] [2022-07-10 05:57:57,150][26022] Updated weights on worker 0-0, policy_version 596629 (0.00094) [2022-07-10 05:57:58,707][26022] Updated weights on worker 0-0, policy_version 596639 (0.00096) [2022-07-10 05:58:00,764][26022] Updated weights on worker 0-0, policy_version 596649 (0.00093) [2022-07-10 05:58:01,897][25689] Fps is (10 sec: 5456.6, 60 sec: 5576.7, 300 sec: 5620.7). Total num frames: 610972672. Throughput: 0: 5822.2. Samples: 610981040. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:58:01,898][25689] Avg episode reward: [(0, '-24.436')] [2022-07-10 05:58:03,011][26022] Updated weights on worker 0-0, policy_version 596659 (0.00087) [2022-07-10 05:58:04,664][26022] Updated weights on worker 0-0, policy_version 596669 (0.00093) [2022-07-10 05:58:06,157][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 05:58:06,179][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000596677_610997248.pth [2022-07-10 05:58:06,180][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000594698_608970752.pth [2022-07-10 05:58:06,674][26022] Updated weights on worker 0-0, policy_version 596679 (0.00096) [2022-07-10 05:58:06,929][25689] Fps is (10 sec: 5280.0, 60 sec: 5609.0, 300 sec: 5614.7). Total num frames: 611000320. Throughput: 0: 4945.4. Samples: 610997006. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:58:06,929][25689] Avg episode reward: [(0, '-24.350')] [2022-07-10 05:58:08,266][26022] Updated weights on worker 0-0, policy_version 596689 (0.00086) [2022-07-10 05:58:10,324][26022] Updated weights on worker 0-0, policy_version 596699 (0.00089) [2022-07-10 05:58:11,975][25689] Fps is (10 sec: 5690.0, 60 sec: 5627.5, 300 sec: 5621.5). Total num frames: 611030016. Throughput: 0: 5797.9. Samples: 611030650. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:58:11,976][25689] Avg episode reward: [(0, '-23.768')] [2022-07-10 05:58:11,977][26022] Updated weights on worker 0-0, policy_version 596709 (0.00091) [2022-07-10 05:58:13,883][26022] Updated weights on worker 0-0, policy_version 596719 (0.00090) [2022-07-10 05:58:15,628][26022] Updated weights on worker 0-0, policy_version 596729 (0.00091) [2022-07-10 05:58:16,977][25689] Fps is (10 sec: 5706.8, 60 sec: 5611.5, 300 sec: 5618.1). Total num frames: 611057664. Throughput: 0: 5791.7. Samples: 611064564. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:58:16,978][25689] Avg episode reward: [(0, '-23.333')] [2022-07-10 05:58:17,456][26022] Updated weights on worker 0-0, policy_version 596739 (0.00093) [2022-07-10 05:58:19,400][26022] Updated weights on worker 0-0, policy_version 596749 (0.00085) [2022-07-10 05:58:21,215][26022] Updated weights on worker 0-0, policy_version 596759 (0.00086) [2022-07-10 05:58:22,004][25689] Fps is (10 sec: 5513.6, 60 sec: 5594.4, 300 sec: 5617.9). Total num frames: 611085312. Throughput: 0: 4982.4. Samples: 611081220. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:58:22,005][25689] Avg episode reward: [(0, '-23.593')] [2022-07-10 05:58:22,937][26022] Updated weights on worker 0-0, policy_version 596769 (0.00091) [2022-07-10 05:58:24,905][26022] Updated weights on worker 0-0, policy_version 596779 (0.00117) [2022-07-10 05:58:26,521][26022] Updated weights on worker 0-0, policy_version 596789 (0.00084) [2022-07-10 05:58:27,009][25689] Fps is (10 sec: 5715.9, 60 sec: 5629.3, 300 sec: 5622.6). Total num frames: 611115008. Throughput: 0: 5901.1. Samples: 611115504. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:58:27,011][25689] Avg episode reward: [(0, '-22.691')] [2022-07-10 05:58:28,567][26022] Updated weights on worker 0-0, policy_version 596799 (0.00084) [2022-07-10 05:58:30,094][26022] Updated weights on worker 0-0, policy_version 596809 (0.00085) [2022-07-10 05:58:32,071][25689] Fps is (10 sec: 5696.6, 60 sec: 5617.9, 300 sec: 5621.8). Total num frames: 611142656. Throughput: 0: 5896.5. Samples: 611149142. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:58:32,071][25689] Avg episode reward: [(0, '-22.716')] [2022-07-10 05:58:32,077][26022] Updated weights on worker 0-0, policy_version 596819 (0.00092) [2022-07-10 05:58:33,966][26022] Updated weights on worker 0-0, policy_version 596829 (0.00090) [2022-07-10 05:58:35,594][26022] Updated weights on worker 0-0, policy_version 596839 (0.00088) [2022-07-10 05:58:37,088][25689] Fps is (10 sec: 5587.8, 60 sec: 5621.8, 300 sec: 5621.9). Total num frames: 611171328. Throughput: 0: 5050.7. Samples: 611166140. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:58:37,089][25689] Avg episode reward: [(0, '-22.988')] [2022-07-10 05:58:37,526][26022] Updated weights on worker 0-0, policy_version 596849 (0.00096) [2022-07-10 05:58:39,275][26022] Updated weights on worker 0-0, policy_version 596859 (0.00087) [2022-07-10 05:58:40,897][26022] Updated weights on worker 0-0, policy_version 596869 (0.00085) [2022-07-10 05:58:42,154][25689] Fps is (10 sec: 5687.1, 60 sec: 5615.9, 300 sec: 5624.5). Total num frames: 611200000. Throughput: 0: 5901.9. Samples: 611200140. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:58:42,155][25689] Avg episode reward: [(0, '-22.740')] [2022-07-10 05:58:43,069][26022] Updated weights on worker 0-0, policy_version 596879 (0.00370) [2022-07-10 05:58:44,586][26022] Updated weights on worker 0-0, policy_version 596889 (0.00092) [2022-07-10 05:58:46,432][26022] Updated weights on worker 0-0, policy_version 596899 (0.00087) [2022-07-10 05:58:47,241][25689] Fps is (10 sec: 5648.1, 60 sec: 5598.0, 300 sec: 5620.5). Total num frames: 611228672. Throughput: 0: 5860.9. Samples: 611234080. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:58:47,242][25689] Avg episode reward: [(0, '-24.286')] [2022-07-10 05:58:48,414][26022] Updated weights on worker 0-0, policy_version 596909 (0.00097) [2022-07-10 05:58:50,086][26022] Updated weights on worker 0-0, policy_version 596919 (0.00091) [2022-07-10 05:58:52,227][26022] Updated weights on worker 0-0, policy_version 596929 (0.00093) [2022-07-10 05:58:52,317][25689] Fps is (10 sec: 5441.0, 60 sec: 5580.6, 300 sec: 5615.9). Total num frames: 611255296. Throughput: 0: 5012.8. Samples: 611250630. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:58:52,317][25689] Avg episode reward: [(0, '-23.001')] [2022-07-10 05:58:53,595][26022] Updated weights on worker 0-0, policy_version 596939 (0.00092) [2022-07-10 05:58:55,843][26022] Updated weights on worker 0-0, policy_version 596949 (0.00273) [2022-07-10 05:58:57,327][25689] Fps is (10 sec: 5584.3, 60 sec: 5587.0, 300 sec: 5619.3). Total num frames: 611284992. Throughput: 0: 5827.4. Samples: 611284076. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:58:57,327][25689] Avg episode reward: [(0, '-23.436')] [2022-07-10 05:58:57,524][26022] Updated weights on worker 0-0, policy_version 596959 (0.00088) [2022-07-10 05:58:59,351][26022] Updated weights on worker 0-0, policy_version 596969 (0.00086) [2022-07-10 05:59:01,177][26022] Updated weights on worker 0-0, policy_version 596979 (0.00088) [2022-07-10 05:59:02,355][25689] Fps is (10 sec: 5508.6, 60 sec: 5589.4, 300 sec: 5615.3). Total num frames: 611310592. Throughput: 0: 5817.3. Samples: 611317654. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 05:59:02,355][25689] Avg episode reward: [(0, '-23.957')] [2022-07-10 05:59:03,468][26022] Updated weights on worker 0-0, policy_version 596989 (0.00092) [2022-07-10 05:59:05,139][26022] Updated weights on worker 0-0, policy_version 596999 (0.00096) [2022-07-10 05:59:07,120][26022] Updated weights on worker 0-0, policy_version 597009 (0.00088) [2022-07-10 05:59:07,384][25689] Fps is (10 sec: 5294.2, 60 sec: 5589.5, 300 sec: 5612.5). Total num frames: 611338240. Throughput: 0: 4887.2. Samples: 611332524. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 05:59:07,385][25689] Avg episode reward: [(0, '-23.828')] [2022-07-10 05:59:08,848][26022] Updated weights on worker 0-0, policy_version 597019 (0.00085) [2022-07-10 05:59:10,655][26022] Updated weights on worker 0-0, policy_version 597029 (0.00088) [2022-07-10 05:59:12,458][25689] Fps is (10 sec: 5574.3, 60 sec: 5570.1, 300 sec: 5611.5). Total num frames: 611366912. Throughput: 0: 5746.3. Samples: 611366370. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 05:59:12,459][25689] Avg episode reward: [(0, '-23.467')] [2022-07-10 05:59:12,501][26022] Updated weights on worker 0-0, policy_version 597039 (0.00095) [2022-07-10 05:59:14,139][26022] Updated weights on worker 0-0, policy_version 597049 (0.00088) [2022-07-10 05:59:16,138][26022] Updated weights on worker 0-0, policy_version 597059 (0.00087) [2022-07-10 05:59:17,468][25689] Fps is (10 sec: 5788.6, 60 sec: 5603.2, 300 sec: 5615.1). Total num frames: 611396608. Throughput: 0: 5776.7. Samples: 611400424. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 05:59:17,468][25689] Avg episode reward: [(0, '-24.059')] [2022-07-10 05:59:18,047][26022] Updated weights on worker 0-0, policy_version 597069 (0.00080) [2022-07-10 05:59:19,594][26022] Updated weights on worker 0-0, policy_version 597079 (0.00091) [2022-07-10 05:59:21,787][26022] Updated weights on worker 0-0, policy_version 597089 (0.00085) [2022-07-10 05:59:22,507][25689] Fps is (10 sec: 5808.3, 60 sec: 5619.0, 300 sec: 5618.1). Total num frames: 611425280. Throughput: 0: 4946.6. Samples: 611417340. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 05:59:22,508][25689] Avg episode reward: [(0, '-25.329')] [2022-07-10 05:59:23,111][26022] Updated weights on worker 0-0, policy_version 597099 (0.00084) [2022-07-10 05:59:25,115][26022] Updated weights on worker 0-0, policy_version 597109 (0.00090) [2022-07-10 05:59:26,987][26022] Updated weights on worker 0-0, policy_version 597119 (0.00084) [2022-07-10 05:59:27,513][25689] Fps is (10 sec: 5606.3, 60 sec: 5585.1, 300 sec: 5615.2). Total num frames: 611452928. Throughput: 0: 5929.7. Samples: 611451884. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 05:59:27,514][25689] Avg episode reward: [(0, '-25.832')] [2022-07-10 05:59:28,714][26022] Updated weights on worker 0-0, policy_version 597129 (0.00054) [2022-07-10 05:59:30,656][26022] Updated weights on worker 0-0, policy_version 597139 (0.00092) [2022-07-10 05:59:32,330][26022] Updated weights on worker 0-0, policy_version 597149 (0.00096) [2022-07-10 05:59:32,655][25689] Fps is (10 sec: 5549.9, 60 sec: 5594.6, 300 sec: 5612.8). Total num frames: 611481600. Throughput: 0: 5906.7. Samples: 611485668. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 05:59:32,656][25689] Avg episode reward: [(0, '-27.026')] [2022-07-10 05:59:34,166][26022] Updated weights on worker 0-0, policy_version 597159 (0.00091) [2022-07-10 05:59:36,071][26022] Updated weights on worker 0-0, policy_version 597169 (0.00085) [2022-07-10 05:59:37,697][25689] Fps is (10 sec: 5631.1, 60 sec: 5592.4, 300 sec: 5615.6). Total num frames: 611510272. Throughput: 0: 5903.1. Samples: 611519838. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 05:59:37,697][25689] Avg episode reward: [(0, '-27.452')] [2022-07-10 05:59:37,729][26022] Updated weights on worker 0-0, policy_version 597179 (0.00081) [2022-07-10 05:59:39,638][26022] Updated weights on worker 0-0, policy_version 597189 (0.00118) [2022-07-10 05:59:41,331][26022] Updated weights on worker 0-0, policy_version 597199 (0.00085) [2022-07-10 05:59:42,754][25689] Fps is (10 sec: 5678.3, 60 sec: 5593.1, 300 sec: 5618.1). Total num frames: 611538944. Throughput: 0: 5906.0. Samples: 611536918. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 05:59:42,755][25689] Avg episode reward: [(0, '-26.423')] [2022-07-10 05:59:43,165][26022] Updated weights on worker 0-0, policy_version 597209 (0.00093) [2022-07-10 05:59:44,965][26022] Updated weights on worker 0-0, policy_version 597219 (0.00085) [2022-07-10 05:59:46,940][26022] Updated weights on worker 0-0, policy_version 597229 (0.00087) [2022-07-10 05:59:47,830][25689] Fps is (10 sec: 5658.8, 60 sec: 5594.1, 300 sec: 5618.2). Total num frames: 611567616. Throughput: 0: 5856.0. Samples: 611570860. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 05:59:47,832][25689] Avg episode reward: [(0, '-26.293')] [2022-07-10 05:59:48,539][26022] Updated weights on worker 0-0, policy_version 597239 (0.00094) [2022-07-10 05:59:50,513][26022] Updated weights on worker 0-0, policy_version 597249 (0.00092) [2022-07-10 05:59:52,315][26022] Updated weights on worker 0-0, policy_version 597259 (0.00088) [2022-07-10 05:59:52,895][25689] Fps is (10 sec: 5553.8, 60 sec: 5612.1, 300 sec: 5610.1). Total num frames: 611595264. Throughput: 0: 5878.6. Samples: 611604648. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 05:59:52,897][25689] Avg episode reward: [(0, '-25.658')] [2022-07-10 05:59:53,973][26022] Updated weights on worker 0-0, policy_version 597269 (0.00086) [2022-07-10 05:59:55,969][26022] Updated weights on worker 0-0, policy_version 597279 (0.00086) [2022-07-10 05:59:57,594][26022] Updated weights on worker 0-0, policy_version 597289 (0.00055) [2022-07-10 05:59:57,922][25689] Fps is (10 sec: 5682.5, 60 sec: 5610.5, 300 sec: 5613.1). Total num frames: 611624960. Throughput: 0: 5033.6. Samples: 611621642. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 05:59:57,932][25689] Avg episode reward: [(0, '-25.343')] [2022-07-10 05:59:59,487][26022] Updated weights on worker 0-0, policy_version 597299 (0.00085) [2022-07-10 06:00:01,557][26022] Updated weights on worker 0-0, policy_version 597309 (0.00098) [2022-07-10 06:00:02,948][25689] Fps is (10 sec: 5602.3, 60 sec: 5627.6, 300 sec: 5612.7). Total num frames: 611651584. Throughput: 0: 5865.7. Samples: 611655370. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:00:02,948][25689] Avg episode reward: [(0, '-24.520')] [2022-07-10 06:00:03,368][26022] Updated weights on worker 0-0, policy_version 597319 (0.00090) [2022-07-10 06:00:05,546][26022] Updated weights on worker 0-0, policy_version 597329 (0.00087) [2022-07-10 06:00:06,238][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:00:06,255][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000597334_611670016.pth [2022-07-10 06:00:06,255][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000595358_609646592.pth [2022-07-10 06:00:07,105][26022] Updated weights on worker 0-0, policy_version 597339 (0.00091) [2022-07-10 06:00:08,016][25689] Fps is (10 sec: 5376.3, 60 sec: 5623.9, 300 sec: 5613.4). Total num frames: 611679232. Throughput: 0: 5765.8. Samples: 611687250. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:00:08,018][25689] Avg episode reward: [(0, '-24.318')] [2022-07-10 06:00:08,991][26022] Updated weights on worker 0-0, policy_version 597349 (0.00082) [2022-07-10 06:00:10,818][26022] Updated weights on worker 0-0, policy_version 597359 (0.00087) [2022-07-10 06:00:12,577][26022] Updated weights on worker 0-0, policy_version 597369 (0.00087) [2022-07-10 06:00:13,062][25689] Fps is (10 sec: 5568.3, 60 sec: 5626.5, 300 sec: 5616.7). Total num frames: 611707904. Throughput: 0: 4938.1. Samples: 611704238. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:00:13,063][25689] Avg episode reward: [(0, '-24.604')] [2022-07-10 06:00:14,380][26022] Updated weights on worker 0-0, policy_version 597379 (0.00089) [2022-07-10 06:00:16,231][26022] Updated weights on worker 0-0, policy_version 597389 (0.00082) [2022-07-10 06:00:17,836][26022] Updated weights on worker 0-0, policy_version 597399 (0.00082) [2022-07-10 06:00:18,097][25689] Fps is (10 sec: 5790.2, 60 sec: 5624.2, 300 sec: 5616.5). Total num frames: 611737600. Throughput: 0: 5787.2. Samples: 611738400. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:00:18,097][25689] Avg episode reward: [(0, '-24.380')] [2022-07-10 06:00:20,052][26022] Updated weights on worker 0-0, policy_version 597409 (0.00091) [2022-07-10 06:00:21,438][26022] Updated weights on worker 0-0, policy_version 597419 (0.00087) [2022-07-10 06:00:23,106][25689] Fps is (10 sec: 5505.8, 60 sec: 5576.4, 300 sec: 5609.6). Total num frames: 611763200. Throughput: 0: 5796.3. Samples: 611772210. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:00:23,106][25689] Avg episode reward: [(0, '-23.825')] [2022-07-10 06:00:23,568][26022] Updated weights on worker 0-0, policy_version 597429 (0.00059) [2022-07-10 06:00:25,135][26022] Updated weights on worker 0-0, policy_version 597439 (0.00083) [2022-07-10 06:00:27,220][26022] Updated weights on worker 0-0, policy_version 597449 (0.00091) [2022-07-10 06:00:28,126][25689] Fps is (10 sec: 5411.6, 60 sec: 5592.0, 300 sec: 5611.5). Total num frames: 611791872. Throughput: 0: 5057.8. Samples: 611788960. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:00:28,126][25689] Avg episode reward: [(0, '-24.369')] [2022-07-10 06:00:28,909][26022] Updated weights on worker 0-0, policy_version 597459 (0.00084) [2022-07-10 06:00:30,932][26022] Updated weights on worker 0-0, policy_version 597469 (0.00089) [2022-07-10 06:00:32,532][26022] Updated weights on worker 0-0, policy_version 597479 (0.00395) [2022-07-10 06:00:33,196][25689] Fps is (10 sec: 5784.4, 60 sec: 5615.5, 300 sec: 5607.5). Total num frames: 611821568. Throughput: 0: 5894.9. Samples: 611822926. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:00:33,197][25689] Avg episode reward: [(0, '-24.252')] [2022-07-10 06:00:34,520][26022] Updated weights on worker 0-0, policy_version 597489 (0.00087) [2022-07-10 06:00:36,207][26022] Updated weights on worker 0-0, policy_version 597499 (0.00090) [2022-07-10 06:00:38,015][26022] Updated weights on worker 0-0, policy_version 597509 (0.00091) [2022-07-10 06:00:38,229][25689] Fps is (10 sec: 5777.4, 60 sec: 5616.3, 300 sec: 5614.5). Total num frames: 611850240. Throughput: 0: 5885.5. Samples: 611856886. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:00:38,229][25689] Avg episode reward: [(0, '-23.928')] [2022-07-10 06:00:39,989][26022] Updated weights on worker 0-0, policy_version 597519 (0.00092) [2022-07-10 06:00:41,571][26022] Updated weights on worker 0-0, policy_version 597529 (0.00090) [2022-07-10 06:00:43,270][25689] Fps is (10 sec: 5489.0, 60 sec: 5584.0, 300 sec: 5607.3). Total num frames: 611876864. Throughput: 0: 5032.8. Samples: 611873694. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:00:43,271][25689] Avg episode reward: [(0, '-23.689')] [2022-07-10 06:00:43,506][26022] Updated weights on worker 0-0, policy_version 597539 (0.00091) [2022-07-10 06:00:45,380][26022] Updated weights on worker 0-0, policy_version 597549 (0.00088) [2022-07-10 06:00:47,057][26022] Updated weights on worker 0-0, policy_version 597559 (0.00087) [2022-07-10 06:00:48,300][25689] Fps is (10 sec: 5592.3, 60 sec: 5605.2, 300 sec: 5612.6). Total num frames: 611906560. Throughput: 0: 5881.2. Samples: 611907608. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:00:48,300][25689] Avg episode reward: [(0, '-24.636')] [2022-07-10 06:00:49,064][26022] Updated weights on worker 0-0, policy_version 597569 (0.00088) [2022-07-10 06:00:50,644][26022] Updated weights on worker 0-0, policy_version 597579 (0.00087) [2022-07-10 06:00:52,672][26022] Updated weights on worker 0-0, policy_version 597589 (0.00083) [2022-07-10 06:00:53,353][25689] Fps is (10 sec: 5687.5, 60 sec: 5606.3, 300 sec: 5609.0). Total num frames: 611934208. Throughput: 0: 5878.1. Samples: 611941408. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:00:53,354][25689] Avg episode reward: [(0, '-24.229')] [2022-07-10 06:00:54,469][26022] Updated weights on worker 0-0, policy_version 597599 (0.00091) [2022-07-10 06:00:56,196][26022] Updated weights on worker 0-0, policy_version 597609 (0.00093) [2022-07-10 06:00:58,196][26022] Updated weights on worker 0-0, policy_version 597619 (0.00087) [2022-07-10 06:00:58,407][25689] Fps is (10 sec: 5572.4, 60 sec: 5586.8, 300 sec: 5606.4). Total num frames: 611962880. Throughput: 0: 5022.9. Samples: 611958238. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:00:58,408][25689] Avg episode reward: [(0, '-22.862')] [2022-07-10 06:00:59,910][26022] Updated weights on worker 0-0, policy_version 597629 (0.00101) [2022-07-10 06:01:02,284][26022] Updated weights on worker 0-0, policy_version 597639 (0.00095) [2022-07-10 06:01:03,435][25689] Fps is (10 sec: 5484.7, 60 sec: 5586.7, 300 sec: 5613.3). Total num frames: 611989504. Throughput: 0: 5840.9. Samples: 611991472. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:01:03,437][25689] Avg episode reward: [(0, '-23.625')] [2022-07-10 06:01:03,898][26022] Updated weights on worker 0-0, policy_version 597649 (0.00084) [2022-07-10 06:01:05,718][26022] Updated weights on worker 0-0, policy_version 597659 (0.00091) [2022-07-10 06:01:07,663][26022] Updated weights on worker 0-0, policy_version 597669 (0.00085) [2022-07-10 06:01:08,456][25689] Fps is (10 sec: 5400.8, 60 sec: 5591.1, 300 sec: 5607.2). Total num frames: 612017152. Throughput: 0: 5780.4. Samples: 612024118. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:01:08,457][25689] Avg episode reward: [(0, '-24.210')] [2022-07-10 06:01:09,204][26022] Updated weights on worker 0-0, policy_version 597679 (0.00087) [2022-07-10 06:01:11,167][26022] Updated weights on worker 0-0, policy_version 597689 (0.00088) [2022-07-10 06:01:13,023][26022] Updated weights on worker 0-0, policy_version 597699 (0.00082) [2022-07-10 06:01:13,579][25689] Fps is (10 sec: 5652.5, 60 sec: 5600.8, 300 sec: 5605.4). Total num frames: 612046848. Throughput: 0: 4928.1. Samples: 612041088. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:01:13,581][25689] Avg episode reward: [(0, '-25.536')] [2022-07-10 06:01:14,726][26022] Updated weights on worker 0-0, policy_version 597709 (0.00085) [2022-07-10 06:01:16,694][26022] Updated weights on worker 0-0, policy_version 597719 (0.00091) [2022-07-10 06:01:18,407][26022] Updated weights on worker 0-0, policy_version 597729 (0.00082) [2022-07-10 06:01:18,609][25689] Fps is (10 sec: 5748.8, 60 sec: 5584.4, 300 sec: 5608.6). Total num frames: 612075520. Throughput: 0: 5782.0. Samples: 612075046. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:01:18,609][25689] Avg episode reward: [(0, '-24.295')] [2022-07-10 06:01:20,203][26022] Updated weights on worker 0-0, policy_version 597739 (0.00084) [2022-07-10 06:01:21,999][26022] Updated weights on worker 0-0, policy_version 597749 (0.00087) [2022-07-10 06:01:23,611][25689] Fps is (10 sec: 5716.3, 60 sec: 5635.7, 300 sec: 5613.0). Total num frames: 612104192. Throughput: 0: 5847.1. Samples: 612109446. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:01:23,613][25689] Avg episode reward: [(0, '-25.186')] [2022-07-10 06:01:23,716][26022] Updated weights on worker 0-0, policy_version 597759 (0.00094) [2022-07-10 06:01:25,684][26022] Updated weights on worker 0-0, policy_version 597769 (0.00082) [2022-07-10 06:01:27,453][26022] Updated weights on worker 0-0, policy_version 597779 (0.00083) [2022-07-10 06:01:28,655][25689] Fps is (10 sec: 5606.4, 60 sec: 5616.6, 300 sec: 5603.6). Total num frames: 612131840. Throughput: 0: 5050.3. Samples: 612126128. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:01:28,655][25689] Avg episode reward: [(0, '-25.538')] [2022-07-10 06:01:29,303][26022] Updated weights on worker 0-0, policy_version 597789 (0.00087) [2022-07-10 06:01:31,076][26022] Updated weights on worker 0-0, policy_version 597799 (0.00087) [2022-07-10 06:01:32,906][26022] Updated weights on worker 0-0, policy_version 597809 (0.00087) [2022-07-10 06:01:33,755][25689] Fps is (10 sec: 5451.3, 60 sec: 5580.1, 300 sec: 5613.3). Total num frames: 612159488. Throughput: 0: 5883.4. Samples: 612159788. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:01:33,755][25689] Avg episode reward: [(0, '-24.903')] [2022-07-10 06:01:34,741][26022] Updated weights on worker 0-0, policy_version 597819 (0.00094) [2022-07-10 06:01:36,644][26022] Updated weights on worker 0-0, policy_version 597829 (0.00092) [2022-07-10 06:01:38,403][26022] Updated weights on worker 0-0, policy_version 597839 (0.00087) [2022-07-10 06:01:38,797][25689] Fps is (10 sec: 5653.9, 60 sec: 5596.1, 300 sec: 5609.9). Total num frames: 612189184. Throughput: 0: 5875.2. Samples: 612193656. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:01:38,797][25689] Avg episode reward: [(0, '-23.573')] [2022-07-10 06:01:40,367][26022] Updated weights on worker 0-0, policy_version 597849 (0.00097) [2022-07-10 06:01:42,097][26022] Updated weights on worker 0-0, policy_version 597859 (0.00090) [2022-07-10 06:01:43,804][25689] Fps is (10 sec: 5706.3, 60 sec: 5616.2, 300 sec: 5607.3). Total num frames: 612216832. Throughput: 0: 5006.2. Samples: 612210536. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:01:43,806][25689] Avg episode reward: [(0, '-23.679')] [2022-07-10 06:01:43,919][26022] Updated weights on worker 0-0, policy_version 597869 (0.00088) [2022-07-10 06:01:45,596][26022] Updated weights on worker 0-0, policy_version 597879 (0.00090) [2022-07-10 06:01:47,585][26022] Updated weights on worker 0-0, policy_version 597889 (0.00093) [2022-07-10 06:01:48,808][25689] Fps is (10 sec: 5625.6, 60 sec: 5601.6, 300 sec: 5605.3). Total num frames: 612245504. Throughput: 0: 5857.9. Samples: 612244188. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:01:48,809][25689] Avg episode reward: [(0, '-23.586')] [2022-07-10 06:01:49,381][26022] Updated weights on worker 0-0, policy_version 597899 (0.00090) [2022-07-10 06:01:51,318][26022] Updated weights on worker 0-0, policy_version 597909 (0.00086) [2022-07-10 06:01:52,928][26022] Updated weights on worker 0-0, policy_version 597919 (0.00088) [2022-07-10 06:01:53,867][25689] Fps is (10 sec: 5698.8, 60 sec: 5618.0, 300 sec: 5608.7). Total num frames: 612274176. Throughput: 0: 5876.3. Samples: 612277970. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:01:53,868][25689] Avg episode reward: [(0, '-23.580')] [2022-07-10 06:01:54,870][26022] Updated weights on worker 0-0, policy_version 597929 (0.00088) [2022-07-10 06:01:56,728][26022] Updated weights on worker 0-0, policy_version 597939 (0.00086) [2022-07-10 06:01:58,708][26022] Updated weights on worker 0-0, policy_version 597949 (0.00090) [2022-07-10 06:01:58,889][25689] Fps is (10 sec: 5485.6, 60 sec: 5587.1, 300 sec: 5598.5). Total num frames: 612300800. Throughput: 0: 5022.3. Samples: 612294562. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:01:58,889][25689] Avg episode reward: [(0, '-23.488')] [2022-07-10 06:02:00,335][26022] Updated weights on worker 0-0, policy_version 597959 (0.00088) [2022-07-10 06:02:02,578][26022] Updated weights on worker 0-0, policy_version 597969 (0.00090) [2022-07-10 06:02:03,892][25689] Fps is (10 sec: 5311.3, 60 sec: 5589.4, 300 sec: 5602.2). Total num frames: 612327424. Throughput: 0: 5785.6. Samples: 612326756. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:02:03,893][25689] Avg episode reward: [(0, '-23.619')] [2022-07-10 06:02:04,492][26022] Updated weights on worker 0-0, policy_version 597979 (0.00090) [2022-07-10 06:02:06,194][26022] Updated weights on worker 0-0, policy_version 597989 (0.00088) [2022-07-10 06:02:06,284][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:02:06,298][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000597990_612341760.pth [2022-07-10 06:02:06,298][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000596017_610321408.pth [2022-07-10 06:02:08,196][26022] Updated weights on worker 0-0, policy_version 597999 (0.00081) [2022-07-10 06:02:08,921][25689] Fps is (10 sec: 5511.5, 60 sec: 5605.6, 300 sec: 5602.9). Total num frames: 612356096. Throughput: 0: 5755.2. Samples: 612359944. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:02:08,922][25689] Avg episode reward: [(0, '-24.694')] [2022-07-10 06:02:09,770][26022] Updated weights on worker 0-0, policy_version 598009 (0.00087) [2022-07-10 06:02:11,682][26022] Updated weights on worker 0-0, policy_version 598019 (0.00090) [2022-07-10 06:02:13,405][26022] Updated weights on worker 0-0, policy_version 598029 (0.00085) [2022-07-10 06:02:13,988][25689] Fps is (10 sec: 5679.8, 60 sec: 5593.9, 300 sec: 5601.8). Total num frames: 612384768. Throughput: 0: 4913.3. Samples: 612376834. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:02:13,990][25689] Avg episode reward: [(0, '-25.901')] [2022-07-10 06:02:15,555][26022] Updated weights on worker 0-0, policy_version 598039 (0.00092) [2022-07-10 06:02:17,129][26022] Updated weights on worker 0-0, policy_version 598049 (0.00087) [2022-07-10 06:02:18,916][26022] Updated weights on worker 0-0, policy_version 598059 (0.00087) [2022-07-10 06:02:19,012][25689] Fps is (10 sec: 5581.3, 60 sec: 5577.4, 300 sec: 5598.4). Total num frames: 612412416. Throughput: 0: 5780.9. Samples: 612410896. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:02:19,013][25689] Avg episode reward: [(0, '-26.849')] [2022-07-10 06:02:20,611][26022] Updated weights on worker 0-0, policy_version 598069 (0.00423) [2022-07-10 06:02:22,521][26022] Updated weights on worker 0-0, policy_version 598079 (0.00049) [2022-07-10 06:02:24,063][25689] Fps is (10 sec: 5590.1, 60 sec: 5572.9, 300 sec: 5601.2). Total num frames: 612441088. Throughput: 0: 5853.3. Samples: 612444824. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:02:24,064][25689] Avg episode reward: [(0, '-26.611')] [2022-07-10 06:02:24,302][26022] Updated weights on worker 0-0, policy_version 598089 (0.00084) [2022-07-10 06:02:26,223][26022] Updated weights on worker 0-0, policy_version 598099 (0.00091) [2022-07-10 06:02:28,154][26022] Updated weights on worker 0-0, policy_version 598109 (0.00092) [2022-07-10 06:02:29,153][25689] Fps is (10 sec: 5654.9, 60 sec: 5585.6, 300 sec: 5601.8). Total num frames: 612469760. Throughput: 0: 5871.5. Samples: 612478734. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:02:29,153][25689] Avg episode reward: [(0, '-26.159')] [2022-07-10 06:02:29,670][26022] Updated weights on worker 0-0, policy_version 598119 (0.00049) [2022-07-10 06:02:31,631][26022] Updated weights on worker 0-0, policy_version 598129 (0.00097) [2022-07-10 06:02:33,514][26022] Updated weights on worker 0-0, policy_version 598139 (0.00086) [2022-07-10 06:02:34,217][25689] Fps is (10 sec: 5546.8, 60 sec: 5589.0, 300 sec: 5598.2). Total num frames: 612497408. Throughput: 0: 5880.9. Samples: 612495798. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:02:34,217][25689] Avg episode reward: [(0, '-24.664')] [2022-07-10 06:02:35,382][26022] Updated weights on worker 0-0, policy_version 598149 (0.00086) [2022-07-10 06:02:36,969][26022] Updated weights on worker 0-0, policy_version 598159 (0.00096) [2022-07-10 06:02:39,019][26022] Updated weights on worker 0-0, policy_version 598169 (0.00092) [2022-07-10 06:02:39,297][25689] Fps is (10 sec: 5552.0, 60 sec: 5568.5, 300 sec: 5596.8). Total num frames: 612526080. Throughput: 0: 5868.3. Samples: 612529932. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:02:39,298][25689] Avg episode reward: [(0, '-25.003')] [2022-07-10 06:02:40,649][26022] Updated weights on worker 0-0, policy_version 598179 (0.00093) [2022-07-10 06:02:42,527][26022] Updated weights on worker 0-0, policy_version 598189 (0.00095) [2022-07-10 06:02:44,131][26022] Updated weights on worker 0-0, policy_version 598199 (0.00093) [2022-07-10 06:02:44,365][25689] Fps is (10 sec: 5852.7, 60 sec: 5613.7, 300 sec: 5600.4). Total num frames: 612556800. Throughput: 0: 5878.2. Samples: 612564160. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:02:44,365][25689] Avg episode reward: [(0, '-23.217')] [2022-07-10 06:02:45,946][26022] Updated weights on worker 0-0, policy_version 598209 (0.00087) [2022-07-10 06:02:47,917][26022] Updated weights on worker 0-0, policy_version 598219 (0.00092) [2022-07-10 06:02:49,400][25689] Fps is (10 sec: 5777.1, 60 sec: 5593.9, 300 sec: 5601.0). Total num frames: 612584448. Throughput: 0: 5061.8. Samples: 612581222. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:02:49,401][25689] Avg episode reward: [(0, '-22.010')] [2022-07-10 06:02:49,539][26022] Updated weights on worker 0-0, policy_version 598229 (0.00084) [2022-07-10 06:02:51,566][26022] Updated weights on worker 0-0, policy_version 598239 (0.00086) [2022-07-10 06:02:53,155][26022] Updated weights on worker 0-0, policy_version 598249 (0.00081) [2022-07-10 06:02:54,464][25689] Fps is (10 sec: 5576.8, 60 sec: 5593.4, 300 sec: 5597.9). Total num frames: 612613120. Throughput: 0: 5896.3. Samples: 612615180. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:02:54,464][25689] Avg episode reward: [(0, '-22.888')] [2022-07-10 06:02:55,065][26022] Updated weights on worker 0-0, policy_version 598259 (0.00090) [2022-07-10 06:02:56,862][26022] Updated weights on worker 0-0, policy_version 598269 (0.00091) [2022-07-10 06:02:58,704][26022] Updated weights on worker 0-0, policy_version 598279 (0.00084) [2022-07-10 06:02:59,476][25689] Fps is (10 sec: 5589.5, 60 sec: 5611.2, 300 sec: 5605.6). Total num frames: 612640768. Throughput: 0: 5908.2. Samples: 612649156. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:02:59,477][25689] Avg episode reward: [(0, '-23.194')] [2022-07-10 06:03:00,465][26022] Updated weights on worker 0-0, policy_version 598289 (0.00094) [2022-07-10 06:03:02,916][26022] Updated weights on worker 0-0, policy_version 598299 (0.00087) [2022-07-10 06:03:04,466][26022] Updated weights on worker 0-0, policy_version 598309 (0.00088) [2022-07-10 06:03:04,559][25689] Fps is (10 sec: 5477.6, 60 sec: 5620.8, 300 sec: 5604.6). Total num frames: 612668416. Throughput: 0: 4941.3. Samples: 612663944. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:03:04,559][25689] Avg episode reward: [(0, '-23.206')] [2022-07-10 06:03:06,559][26022] Updated weights on worker 0-0, policy_version 598319 (0.00100) [2022-07-10 06:03:07,928][26022] Updated weights on worker 0-0, policy_version 598329 (0.00091) [2022-07-10 06:03:09,615][25689] Fps is (10 sec: 5554.8, 60 sec: 5618.2, 300 sec: 5601.0). Total num frames: 612697088. Throughput: 0: 5775.5. Samples: 612697974. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:03:09,616][25689] Avg episode reward: [(0, '-23.997')] [2022-07-10 06:03:10,008][26022] Updated weights on worker 0-0, policy_version 598339 (0.00088) [2022-07-10 06:03:11,723][26022] Updated weights on worker 0-0, policy_version 598349 (0.00109) [2022-07-10 06:03:13,635][26022] Updated weights on worker 0-0, policy_version 598359 (0.00086) [2022-07-10 06:03:14,727][25689] Fps is (10 sec: 5639.3, 60 sec: 5614.0, 300 sec: 5602.3). Total num frames: 612725760. Throughput: 0: 5766.6. Samples: 612732032. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:03:14,728][25689] Avg episode reward: [(0, '-25.264')] [2022-07-10 06:03:15,422][26022] Updated weights on worker 0-0, policy_version 598369 (0.00086) [2022-07-10 06:03:17,215][26022] Updated weights on worker 0-0, policy_version 598379 (0.00095) [2022-07-10 06:03:18,912][26022] Updated weights on worker 0-0, policy_version 598389 (0.00090) [2022-07-10 06:03:19,748][25689] Fps is (10 sec: 5760.3, 60 sec: 5648.1, 300 sec: 5609.3). Total num frames: 612755456. Throughput: 0: 4927.3. Samples: 612749042. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:03:19,749][25689] Avg episode reward: [(0, '-24.741')] [2022-07-10 06:03:20,887][26022] Updated weights on worker 0-0, policy_version 598399 (0.00090) [2022-07-10 06:03:22,492][26022] Updated weights on worker 0-0, policy_version 598409 (0.00087) [2022-07-10 06:03:24,508][26022] Updated weights on worker 0-0, policy_version 598419 (0.00086) [2022-07-10 06:03:24,778][25689] Fps is (10 sec: 5705.2, 60 sec: 5633.1, 300 sec: 5602.0). Total num frames: 612783104. Throughput: 0: 5911.0. Samples: 612783464. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:03:24,779][25689] Avg episode reward: [(0, '-24.945')] [2022-07-10 06:03:25,948][26022] Updated weights on worker 0-0, policy_version 598429 (0.00096) [2022-07-10 06:03:28,039][26022] Updated weights on worker 0-0, policy_version 598439 (0.00093) [2022-07-10 06:03:29,649][26022] Updated weights on worker 0-0, policy_version 598449 (0.00085) [2022-07-10 06:03:29,804][25689] Fps is (10 sec: 5600.6, 60 sec: 5639.1, 300 sec: 5606.1). Total num frames: 612811776. Throughput: 0: 5914.9. Samples: 612817388. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:03:29,805][25689] Avg episode reward: [(0, '-25.157')] [2022-07-10 06:03:31,793][26022] Updated weights on worker 0-0, policy_version 598459 (0.00096) [2022-07-10 06:03:33,331][26022] Updated weights on worker 0-0, policy_version 598469 (0.00084) [2022-07-10 06:03:34,927][25689] Fps is (10 sec: 5549.4, 60 sec: 5633.6, 300 sec: 5600.7). Total num frames: 612839424. Throughput: 0: 5065.3. Samples: 612834352. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:03:34,929][25689] Avg episode reward: [(0, '-24.751')] [2022-07-10 06:03:35,111][26022] Updated weights on worker 0-0, policy_version 598479 (0.00088) [2022-07-10 06:03:36,886][26022] Updated weights on worker 0-0, policy_version 598489 (0.00087) [2022-07-10 06:03:38,833][26022] Updated weights on worker 0-0, policy_version 598499 (0.00092) [2022-07-10 06:03:39,950][25689] Fps is (10 sec: 5652.0, 60 sec: 5655.8, 300 sec: 5604.9). Total num frames: 612869120. Throughput: 0: 5940.3. Samples: 612869046. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:03:39,952][25689] Avg episode reward: [(0, '-23.354')] [2022-07-10 06:03:40,564][26022] Updated weights on worker 0-0, policy_version 598509 (0.00089) [2022-07-10 06:03:42,369][26022] Updated weights on worker 0-0, policy_version 598519 (0.00090) [2022-07-10 06:03:44,141][26022] Updated weights on worker 0-0, policy_version 598529 (0.00092) [2022-07-10 06:03:44,983][25689] Fps is (10 sec: 5906.5, 60 sec: 5642.2, 300 sec: 5609.4). Total num frames: 612898816. Throughput: 0: 5936.9. Samples: 612903414. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:03:44,983][25689] Avg episode reward: [(0, '-22.804')] [2022-07-10 06:03:45,791][26022] Updated weights on worker 0-0, policy_version 598539 (0.00090) [2022-07-10 06:03:47,719][26022] Updated weights on worker 0-0, policy_version 598549 (0.00082) [2022-07-10 06:03:49,611][26022] Updated weights on worker 0-0, policy_version 598559 (0.00093) [2022-07-10 06:03:50,008][25689] Fps is (10 sec: 5701.1, 60 sec: 5643.1, 300 sec: 5613.8). Total num frames: 612926464. Throughput: 0: 5105.5. Samples: 612920540. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:03:50,009][25689] Avg episode reward: [(0, '-23.250')] [2022-07-10 06:03:51,415][26022] Updated weights on worker 0-0, policy_version 598569 (0.00102) [2022-07-10 06:03:53,307][26022] Updated weights on worker 0-0, policy_version 598579 (0.00090) [2022-07-10 06:03:55,068][25689] Fps is (10 sec: 5584.5, 60 sec: 5643.5, 300 sec: 5609.4). Total num frames: 612955136. Throughput: 0: 5940.6. Samples: 612953998. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:03:55,068][25689] Avg episode reward: [(0, '-23.614')] [2022-07-10 06:03:55,079][26022] Updated weights on worker 0-0, policy_version 598589 (0.00094) [2022-07-10 06:03:56,832][26022] Updated weights on worker 0-0, policy_version 598599 (0.00092) [2022-07-10 06:03:58,697][26022] Updated weights on worker 0-0, policy_version 598609 (0.00082) [2022-07-10 06:04:00,124][25689] Fps is (10 sec: 5668.9, 60 sec: 5656.3, 300 sec: 5619.2). Total num frames: 612983808. Throughput: 0: 5889.6. Samples: 612987862. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:04:00,124][25689] Avg episode reward: [(0, '-23.779')] [2022-07-10 06:04:00,444][26022] Updated weights on worker 0-0, policy_version 598619 (0.00091) [2022-07-10 06:04:02,759][26022] Updated weights on worker 0-0, policy_version 598629 (0.00091) [2022-07-10 06:04:04,490][26022] Updated weights on worker 0-0, policy_version 598639 (0.00087) [2022-07-10 06:04:05,156][25689] Fps is (10 sec: 5379.9, 60 sec: 5627.2, 300 sec: 5612.3). Total num frames: 613009408. Throughput: 0: 4987.2. Samples: 613004020. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:04:05,156][25689] Avg episode reward: [(0, '-24.687')] [2022-07-10 06:04:06,233][26022] Updated weights on worker 0-0, policy_version 598649 (0.00086) [2022-07-10 06:04:06,530][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:04:06,544][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000598650_613017600.pth [2022-07-10 06:04:06,544][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000596677_610997248.pth [2022-07-10 06:04:08,206][26022] Updated weights on worker 0-0, policy_version 598659 (0.00089) [2022-07-10 06:04:09,811][26022] Updated weights on worker 0-0, policy_version 598669 (0.00094) [2022-07-10 06:04:10,157][25689] Fps is (10 sec: 5409.1, 60 sec: 5632.3, 300 sec: 5613.6). Total num frames: 613038080. Throughput: 0: 5767.8. Samples: 613036756. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:04:10,158][25689] Avg episode reward: [(0, '-24.664')] [2022-07-10 06:04:11,749][26022] Updated weights on worker 0-0, policy_version 598679 (0.00086) [2022-07-10 06:04:13,560][26022] Updated weights on worker 0-0, policy_version 598689 (0.00096) [2022-07-10 06:04:15,301][25689] Fps is (10 sec: 5551.2, 60 sec: 5612.5, 300 sec: 5604.2). Total num frames: 613065728. Throughput: 0: 5753.3. Samples: 613070408. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:04:15,302][25689] Avg episode reward: [(0, '-24.353')] [2022-07-10 06:04:15,496][26022] Updated weights on worker 0-0, policy_version 598699 (0.00090) [2022-07-10 06:04:17,082][26022] Updated weights on worker 0-0, policy_version 598709 (0.00090) [2022-07-10 06:04:18,998][26022] Updated weights on worker 0-0, policy_version 598719 (0.00095) [2022-07-10 06:04:20,366][25689] Fps is (10 sec: 5717.5, 60 sec: 5625.3, 300 sec: 5610.6). Total num frames: 613096448. Throughput: 0: 5765.9. Samples: 613104576. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:04:20,367][25689] Avg episode reward: [(0, '-24.384')] [2022-07-10 06:04:20,876][26022] Updated weights on worker 0-0, policy_version 598729 (0.00090) [2022-07-10 06:04:22,729][26022] Updated weights on worker 0-0, policy_version 598739 (0.00084) [2022-07-10 06:04:24,408][26022] Updated weights on worker 0-0, policy_version 598749 (0.00091) [2022-07-10 06:04:25,382][25689] Fps is (10 sec: 5790.1, 60 sec: 5626.6, 300 sec: 5610.4). Total num frames: 613124096. Throughput: 0: 5814.0. Samples: 613121614. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:04:25,382][25689] Avg episode reward: [(0, '-24.228')] [2022-07-10 06:04:26,328][26022] Updated weights on worker 0-0, policy_version 598759 (0.00086) [2022-07-10 06:04:28,109][26022] Updated weights on worker 0-0, policy_version 598769 (0.00091) [2022-07-10 06:04:30,140][26022] Updated weights on worker 0-0, policy_version 598779 (0.00091) [2022-07-10 06:04:30,444][25689] Fps is (10 sec: 5486.7, 60 sec: 5606.3, 300 sec: 5608.5). Total num frames: 613151744. Throughput: 0: 5833.5. Samples: 613155098. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:04:30,445][25689] Avg episode reward: [(0, '-23.488')] [2022-07-10 06:04:31,525][26022] Updated weights on worker 0-0, policy_version 598789 (0.00089) [2022-07-10 06:04:33,815][26022] Updated weights on worker 0-0, policy_version 598799 (0.00105) [2022-07-10 06:04:35,132][26022] Updated weights on worker 0-0, policy_version 598809 (0.00084) [2022-07-10 06:04:35,508][25689] Fps is (10 sec: 5663.0, 60 sec: 5645.7, 300 sec: 5611.5). Total num frames: 613181440. Throughput: 0: 5846.4. Samples: 613188544. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:04:35,508][25689] Avg episode reward: [(0, '-23.733')] [2022-07-10 06:04:37,428][26022] Updated weights on worker 0-0, policy_version 598819 (0.00090) [2022-07-10 06:04:39,095][26022] Updated weights on worker 0-0, policy_version 598830 (0.00088) [2022-07-10 06:04:40,514][25689] Fps is (10 sec: 5593.0, 60 sec: 5596.5, 300 sec: 5605.6). Total num frames: 613208064. Throughput: 0: 5015.9. Samples: 613205636. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:04:40,514][25689] Avg episode reward: [(0, '-24.227')] [2022-07-10 06:04:41,123][26022] Updated weights on worker 0-0, policy_version 598840 (0.00088) [2022-07-10 06:04:42,911][26022] Updated weights on worker 0-0, policy_version 598850 (0.00081) [2022-07-10 06:04:44,776][26022] Updated weights on worker 0-0, policy_version 598860 (0.00090) [2022-07-10 06:04:45,541][25689] Fps is (10 sec: 5511.2, 60 sec: 5580.1, 300 sec: 5606.5). Total num frames: 613236736. Throughput: 0: 5873.5. Samples: 613240020. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:04:45,542][25689] Avg episode reward: [(0, '-24.015')] [2022-07-10 06:04:46,340][26022] Updated weights on worker 0-0, policy_version 598870 (0.00088) [2022-07-10 06:04:48,376][26022] Updated weights on worker 0-0, policy_version 598880 (0.00083) [2022-07-10 06:04:49,935][26022] Updated weights on worker 0-0, policy_version 598890 (0.00082) [2022-07-10 06:04:50,547][25689] Fps is (10 sec: 5817.6, 60 sec: 5615.7, 300 sec: 5614.5). Total num frames: 613266432. Throughput: 0: 5921.4. Samples: 613274134. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:04:50,547][25689] Avg episode reward: [(0, '-24.787')] [2022-07-10 06:04:51,888][26022] Updated weights on worker 0-0, policy_version 598900 (0.00089) [2022-07-10 06:04:53,715][26022] Updated weights on worker 0-0, policy_version 598910 (0.00090) [2022-07-10 06:04:55,451][26022] Updated weights on worker 0-0, policy_version 598920 (0.00085) [2022-07-10 06:04:55,594][25689] Fps is (10 sec: 5806.3, 60 sec: 5616.9, 300 sec: 5610.7). Total num frames: 613295104. Throughput: 0: 5106.9. Samples: 613291120. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:04:55,594][25689] Avg episode reward: [(0, '-24.272')] [2022-07-10 06:04:57,364][26022] Updated weights on worker 0-0, policy_version 598930 (0.00095) [2022-07-10 06:04:59,063][26022] Updated weights on worker 0-0, policy_version 598940 (0.00080) [2022-07-10 06:05:00,601][25689] Fps is (10 sec: 5601.7, 60 sec: 5604.5, 300 sec: 5614.5). Total num frames: 613322752. Throughput: 0: 5944.2. Samples: 613325038. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:05:00,602][25689] Avg episode reward: [(0, '-25.317')] [2022-07-10 06:05:00,934][26022] Updated weights on worker 0-0, policy_version 598950 (0.00973) [2022-07-10 06:05:03,058][26022] Updated weights on worker 0-0, policy_version 598960 (0.00096) [2022-07-10 06:05:04,894][26022] Updated weights on worker 0-0, policy_version 598970 (0.00086) [2022-07-10 06:05:05,625][25689] Fps is (10 sec: 5308.1, 60 sec: 5605.2, 300 sec: 5608.4). Total num frames: 613348352. Throughput: 0: 5827.4. Samples: 613357058. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:05:05,626][25689] Avg episode reward: [(0, '-25.562')] [2022-07-10 06:05:06,786][26022] Updated weights on worker 0-0, policy_version 598980 (0.00085) [2022-07-10 06:05:08,635][26022] Updated weights on worker 0-0, policy_version 598990 (0.00094) [2022-07-10 06:05:10,445][26022] Updated weights on worker 0-0, policy_version 599000 (0.00090) [2022-07-10 06:05:10,655][25689] Fps is (10 sec: 5398.1, 60 sec: 5602.6, 300 sec: 5608.7). Total num frames: 613377024. Throughput: 0: 4961.6. Samples: 613373902. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:05:10,656][25689] Avg episode reward: [(0, '-25.023')] [2022-07-10 06:05:12,085][26022] Updated weights on worker 0-0, policy_version 599010 (0.00080) [2022-07-10 06:05:14,047][26022] Updated weights on worker 0-0, policy_version 599020 (0.00089) [2022-07-10 06:05:15,635][26022] Updated weights on worker 0-0, policy_version 599030 (0.00088) [2022-07-10 06:05:15,715][25689] Fps is (10 sec: 5785.0, 60 sec: 5644.3, 300 sec: 5608.3). Total num frames: 613406720. Throughput: 0: 5812.2. Samples: 613408068. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 06:05:15,715][25689] Avg episode reward: [(0, '-24.655')] [2022-07-10 06:05:17,602][26022] Updated weights on worker 0-0, policy_version 599040 (0.00092) [2022-07-10 06:05:19,422][26022] Updated weights on worker 0-0, policy_version 599050 (0.00091) [2022-07-10 06:05:20,725][25689] Fps is (10 sec: 5593.0, 60 sec: 5581.6, 300 sec: 5611.7). Total num frames: 613433344. Throughput: 0: 5824.2. Samples: 613442242. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:05:20,725][25689] Avg episode reward: [(0, '-24.762')] [2022-07-10 06:05:21,201][26022] Updated weights on worker 0-0, policy_version 599060 (0.00090) [2022-07-10 06:05:22,954][26022] Updated weights on worker 0-0, policy_version 599070 (0.00083) [2022-07-10 06:05:24,898][26022] Updated weights on worker 0-0, policy_version 599080 (0.00084) [2022-07-10 06:05:25,739][25689] Fps is (10 sec: 5618.5, 60 sec: 5615.7, 300 sec: 5615.2). Total num frames: 613463040. Throughput: 0: 5080.1. Samples: 613459236. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:05:25,739][25689] Avg episode reward: [(0, '-24.772')] [2022-07-10 06:05:26,515][26022] Updated weights on worker 0-0, policy_version 599090 (0.00086) [2022-07-10 06:05:28,464][26022] Updated weights on worker 0-0, policy_version 599100 (0.00112) [2022-07-10 06:05:30,149][26022] Updated weights on worker 0-0, policy_version 599110 (0.00088) [2022-07-10 06:05:30,741][25689] Fps is (10 sec: 5622.6, 60 sec: 5604.2, 300 sec: 5606.2). Total num frames: 613489664. Throughput: 0: 5937.9. Samples: 613493172. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:05:30,742][25689] Avg episode reward: [(0, '-23.857')] [2022-07-10 06:05:32,443][26022] Updated weights on worker 0-0, policy_version 599120 (0.00088) [2022-07-10 06:05:33,902][26022] Updated weights on worker 0-0, policy_version 599130 (0.00083) [2022-07-10 06:05:35,800][25689] Fps is (10 sec: 5496.1, 60 sec: 5587.8, 300 sec: 5605.7). Total num frames: 613518336. Throughput: 0: 5915.7. Samples: 613526884. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:05:35,800][25689] Avg episode reward: [(0, '-23.681')] [2022-07-10 06:05:35,981][26022] Updated weights on worker 0-0, policy_version 599140 (0.00089) [2022-07-10 06:05:37,344][26022] Updated weights on worker 0-0, policy_version 599150 (0.00091) [2022-07-10 06:05:39,486][26022] Updated weights on worker 0-0, policy_version 599160 (0.00088) [2022-07-10 06:05:40,808][25689] Fps is (10 sec: 5798.4, 60 sec: 5638.5, 300 sec: 5616.7). Total num frames: 613548032. Throughput: 0: 5066.0. Samples: 613543982. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:05:40,808][25689] Avg episode reward: [(0, '-23.842')] [2022-07-10 06:05:41,173][26022] Updated weights on worker 0-0, policy_version 599170 (0.00090) [2022-07-10 06:05:42,964][26022] Updated weights on worker 0-0, policy_version 599180 (0.00091) [2022-07-10 06:05:44,745][26022] Updated weights on worker 0-0, policy_version 599190 (0.00095) [2022-07-10 06:05:45,845][25689] Fps is (10 sec: 5708.7, 60 sec: 5620.6, 300 sec: 5609.6). Total num frames: 613575680. Throughput: 0: 5902.7. Samples: 613577916. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:05:45,846][25689] Avg episode reward: [(0, '-24.922')] [2022-07-10 06:05:46,588][26022] Updated weights on worker 0-0, policy_version 599200 (0.00088) [2022-07-10 06:05:48,451][26022] Updated weights on worker 0-0, policy_version 599210 (0.00087) [2022-07-10 06:05:50,251][26022] Updated weights on worker 0-0, policy_version 599220 (0.00096) [2022-07-10 06:05:50,869][25689] Fps is (10 sec: 5699.5, 60 sec: 5618.9, 300 sec: 5617.1). Total num frames: 613605376. Throughput: 0: 5892.8. Samples: 613611780. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:05:50,870][25689] Avg episode reward: [(0, '-24.238')] [2022-07-10 06:05:52,206][26022] Updated weights on worker 0-0, policy_version 599230 (0.00095) [2022-07-10 06:05:53,744][26022] Updated weights on worker 0-0, policy_version 599240 (0.00086) [2022-07-10 06:05:55,784][26022] Updated weights on worker 0-0, policy_version 599250 (0.00090) [2022-07-10 06:05:55,923][25689] Fps is (10 sec: 5690.1, 60 sec: 5601.3, 300 sec: 5613.6). Total num frames: 613633024. Throughput: 0: 5053.0. Samples: 613628566. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:05:55,923][25689] Avg episode reward: [(0, '-24.277')] [2022-07-10 06:05:57,439][26022] Updated weights on worker 0-0, policy_version 599260 (0.00083) [2022-07-10 06:05:59,333][26022] Updated weights on worker 0-0, policy_version 599270 (0.00085) [2022-07-10 06:06:00,934][25689] Fps is (10 sec: 5494.0, 60 sec: 5600.9, 300 sec: 5617.4). Total num frames: 613660672. Throughput: 0: 5888.9. Samples: 613662502. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:06:00,934][25689] Avg episode reward: [(0, '-24.639')] [2022-07-10 06:06:01,173][26022] Updated weights on worker 0-0, policy_version 599280 (0.00086) [2022-07-10 06:06:03,475][26022] Updated weights on worker 0-0, policy_version 599290 (0.00095) [2022-07-10 06:06:05,178][26022] Updated weights on worker 0-0, policy_version 599300 (0.00089) [2022-07-10 06:06:05,963][25689] Fps is (10 sec: 5405.6, 60 sec: 5617.5, 300 sec: 5613.8). Total num frames: 613687296. Throughput: 0: 5788.8. Samples: 613694374. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:06:05,963][25689] Avg episode reward: [(0, '-24.163')] [2022-07-10 06:06:06,692][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:06:06,709][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000599308_613691392.pth [2022-07-10 06:06:06,710][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000597334_611670016.pth [2022-07-10 06:06:06,991][26022] Updated weights on worker 0-0, policy_version 599310 (0.00087) [2022-07-10 06:06:08,755][26022] Updated weights on worker 0-0, policy_version 599320 (0.00081) [2022-07-10 06:06:10,735][26022] Updated weights on worker 0-0, policy_version 599330 (0.00094) [2022-07-10 06:06:10,983][25689] Fps is (10 sec: 5400.8, 60 sec: 5601.4, 300 sec: 5608.9). Total num frames: 613714944. Throughput: 0: 4938.6. Samples: 613711114. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:06:10,983][25689] Avg episode reward: [(0, '-23.880')] [2022-07-10 06:06:12,598][26022] Updated weights on worker 0-0, policy_version 599340 (0.00089) [2022-07-10 06:06:14,262][26022] Updated weights on worker 0-0, policy_version 599350 (0.00088) [2022-07-10 06:06:16,067][25689] Fps is (10 sec: 5675.3, 60 sec: 5599.1, 300 sec: 5611.3). Total num frames: 613744640. Throughput: 0: 5781.1. Samples: 613745022. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:06:16,069][26022] Updated weights on worker 0-0, policy_version 599360 (0.00085) [2022-07-10 06:06:16,067][25689] Avg episode reward: [(0, '-23.914')] [2022-07-10 06:06:17,995][26022] Updated weights on worker 0-0, policy_version 599370 (0.00089) [2022-07-10 06:06:19,732][26022] Updated weights on worker 0-0, policy_version 599380 (0.00086) [2022-07-10 06:06:21,134][25689] Fps is (10 sec: 5649.1, 60 sec: 5610.8, 300 sec: 5606.6). Total num frames: 613772288. Throughput: 0: 5763.2. Samples: 613778918. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:06:21,134][25689] Avg episode reward: [(0, '-23.765')] [2022-07-10 06:06:21,672][26022] Updated weights on worker 0-0, policy_version 599390 (0.00085) [2022-07-10 06:06:23,125][26022] Updated weights on worker 0-0, policy_version 599400 (0.00084) [2022-07-10 06:06:25,319][26022] Updated weights on worker 0-0, policy_version 599410 (0.00086) [2022-07-10 06:06:26,218][25689] Fps is (10 sec: 5649.1, 60 sec: 5604.3, 300 sec: 5612.7). Total num frames: 613801984. Throughput: 0: 5019.2. Samples: 613796038. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:06:26,220][25689] Avg episode reward: [(0, '-23.346')] [2022-07-10 06:06:26,904][26022] Updated weights on worker 0-0, policy_version 599420 (0.00090) [2022-07-10 06:06:28,831][26022] Updated weights on worker 0-0, policy_version 599430 (0.00084) [2022-07-10 06:06:30,750][26022] Updated weights on worker 0-0, policy_version 599440 (0.00085) [2022-07-10 06:06:31,229][25689] Fps is (10 sec: 5579.0, 60 sec: 5603.6, 300 sec: 5611.0). Total num frames: 613828608. Throughput: 0: 5861.2. Samples: 613829780. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:06:31,229][25689] Avg episode reward: [(0, '-23.461')] [2022-07-10 06:06:32,235][26022] Updated weights on worker 0-0, policy_version 599450 (0.00106) [2022-07-10 06:06:34,445][26022] Updated weights on worker 0-0, policy_version 599460 (0.00091) [2022-07-10 06:06:35,983][26022] Updated weights on worker 0-0, policy_version 599470 (0.00092) [2022-07-10 06:06:36,365][25689] Fps is (10 sec: 5550.2, 60 sec: 5613.2, 300 sec: 5609.2). Total num frames: 613858304. Throughput: 0: 5844.5. Samples: 613863656. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:06:36,366][25689] Avg episode reward: [(0, '-22.574')] [2022-07-10 06:06:37,801][26022] Updated weights on worker 0-0, policy_version 599480 (0.00084) [2022-07-10 06:06:39,667][26022] Updated weights on worker 0-0, policy_version 599490 (0.00089) [2022-07-10 06:06:41,438][25689] Fps is (10 sec: 5616.8, 60 sec: 5573.5, 300 sec: 5608.0). Total num frames: 613885952. Throughput: 0: 5004.4. Samples: 613880528. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:06:41,438][25689] Avg episode reward: [(0, '-22.200')] [2022-07-10 06:06:41,537][26022] Updated weights on worker 0-0, policy_version 599500 (0.00090) [2022-07-10 06:06:43,256][26022] Updated weights on worker 0-0, policy_version 599510 (0.00080) [2022-07-10 06:06:45,290][26022] Updated weights on worker 0-0, policy_version 599520 (0.00100) [2022-07-10 06:06:46,445][25689] Fps is (10 sec: 5587.4, 60 sec: 5593.1, 300 sec: 5607.9). Total num frames: 613914624. Throughput: 0: 5869.2. Samples: 613914756. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:06:46,446][25689] Avg episode reward: [(0, '-22.714')] [2022-07-10 06:06:46,943][26022] Updated weights on worker 0-0, policy_version 599530 (0.00091) [2022-07-10 06:06:48,861][26022] Updated weights on worker 0-0, policy_version 599540 (0.00086) [2022-07-10 06:06:50,398][26022] Updated weights on worker 0-0, policy_version 599550 (0.00094) [2022-07-10 06:06:51,518][25689] Fps is (10 sec: 5790.6, 60 sec: 5588.6, 300 sec: 5611.1). Total num frames: 613944320. Throughput: 0: 5859.5. Samples: 613948664. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:06:51,518][25689] Avg episode reward: [(0, '-22.918')] [2022-07-10 06:06:52,603][26022] Updated weights on worker 0-0, policy_version 599560 (0.00400) [2022-07-10 06:06:54,240][26022] Updated weights on worker 0-0, policy_version 599570 (0.00091) [2022-07-10 06:06:56,252][26022] Updated weights on worker 0-0, policy_version 599580 (0.00088) [2022-07-10 06:06:56,572][25689] Fps is (10 sec: 5662.6, 60 sec: 5588.6, 300 sec: 5613.9). Total num frames: 613971968. Throughput: 0: 5032.4. Samples: 613965344. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:06:56,572][25689] Avg episode reward: [(0, '-23.413')] [2022-07-10 06:06:57,774][26022] Updated weights on worker 0-0, policy_version 599590 (0.00087) [2022-07-10 06:06:59,738][26022] Updated weights on worker 0-0, policy_version 599600 (0.00090) [2022-07-10 06:07:01,508][26022] Updated weights on worker 0-0, policy_version 599610 (0.00095) [2022-07-10 06:07:01,592][25689] Fps is (10 sec: 5590.6, 60 sec: 5604.7, 300 sec: 5620.5). Total num frames: 614000640. Throughput: 0: 5884.6. Samples: 613999126. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:07:01,592][25689] Avg episode reward: [(0, '-23.084')] [2022-07-10 06:07:03,869][26022] Updated weights on worker 0-0, policy_version 599620 (0.00085) [2022-07-10 06:07:05,588][26022] Updated weights on worker 0-0, policy_version 599630 (0.00089) [2022-07-10 06:07:06,610][25689] Fps is (10 sec: 5508.8, 60 sec: 5605.7, 300 sec: 5613.8). Total num frames: 614027264. Throughput: 0: 5758.7. Samples: 614030878. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:07:06,610][25689] Avg episode reward: [(0, '-23.094')] [2022-07-10 06:07:07,288][26022] Updated weights on worker 0-0, policy_version 599640 (0.00086) [2022-07-10 06:07:09,301][26022] Updated weights on worker 0-0, policy_version 599650 (0.00094) [2022-07-10 06:07:10,874][26022] Updated weights on worker 0-0, policy_version 599660 (0.00091) [2022-07-10 06:07:11,626][25689] Fps is (10 sec: 5306.5, 60 sec: 5589.1, 300 sec: 5607.9). Total num frames: 614053888. Throughput: 0: 4931.7. Samples: 614047834. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:07:11,627][25689] Avg episode reward: [(0, '-22.651')] [2022-07-10 06:07:12,829][26022] Updated weights on worker 0-0, policy_version 599670 (0.00423) [2022-07-10 06:07:14,696][26022] Updated weights on worker 0-0, policy_version 599680 (0.00089) [2022-07-10 06:07:16,407][26022] Updated weights on worker 0-0, policy_version 599690 (0.00085) [2022-07-10 06:07:16,693][25689] Fps is (10 sec: 5687.1, 60 sec: 5607.6, 300 sec: 5617.4). Total num frames: 614084608. Throughput: 0: 5795.7. Samples: 614081960. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:07:16,693][25689] Avg episode reward: [(0, '-24.105')] [2022-07-10 06:07:18,372][26022] Updated weights on worker 0-0, policy_version 599700 (0.00090) [2022-07-10 06:07:20,031][26022] Updated weights on worker 0-0, policy_version 599710 (0.00093) [2022-07-10 06:07:21,696][25689] Fps is (10 sec: 5796.3, 60 sec: 5613.5, 300 sec: 5614.9). Total num frames: 614112256. Throughput: 0: 5806.0. Samples: 614115852. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:07:21,697][25689] Avg episode reward: [(0, '-25.487')] [2022-07-10 06:07:21,988][26022] Updated weights on worker 0-0, policy_version 599720 (0.00099) [2022-07-10 06:07:23,679][26022] Updated weights on worker 0-0, policy_version 599730 (0.00091) [2022-07-10 06:07:25,387][26022] Updated weights on worker 0-0, policy_version 599740 (0.00097) [2022-07-10 06:07:26,708][25689] Fps is (10 sec: 5521.3, 60 sec: 5586.4, 300 sec: 5612.9). Total num frames: 614139904. Throughput: 0: 5078.2. Samples: 614132940. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:07:26,708][25689] Avg episode reward: [(0, '-24.852')] [2022-07-10 06:07:27,299][26022] Updated weights on worker 0-0, policy_version 599750 (0.00055) [2022-07-10 06:07:29,275][26022] Updated weights on worker 0-0, policy_version 599760 (0.00099) [2022-07-10 06:07:30,858][26022] Updated weights on worker 0-0, policy_version 599770 (0.00089) [2022-07-10 06:07:31,736][25689] Fps is (10 sec: 5609.5, 60 sec: 5618.6, 300 sec: 5617.0). Total num frames: 614168576. Throughput: 0: 5933.5. Samples: 614167156. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:07:31,737][25689] Avg episode reward: [(0, '-25.408')] [2022-07-10 06:07:32,880][26022] Updated weights on worker 0-0, policy_version 599780 (0.00088) [2022-07-10 06:07:34,461][26022] Updated weights on worker 0-0, policy_version 599790 (0.00048) [2022-07-10 06:07:36,327][26022] Updated weights on worker 0-0, policy_version 599800 (0.00346) [2022-07-10 06:07:36,829][25689] Fps is (10 sec: 5665.5, 60 sec: 5605.7, 300 sec: 5616.8). Total num frames: 614197248. Throughput: 0: 5915.6. Samples: 614201080. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:07:36,830][25689] Avg episode reward: [(0, '-25.811')] [2022-07-10 06:07:38,119][26022] Updated weights on worker 0-0, policy_version 599810 (0.00093) [2022-07-10 06:07:39,912][26022] Updated weights on worker 0-0, policy_version 599820 (0.00083) [2022-07-10 06:07:41,791][26022] Updated weights on worker 0-0, policy_version 599830 (0.00084) [2022-07-10 06:07:41,831][25689] Fps is (10 sec: 5680.4, 60 sec: 5629.2, 300 sec: 5611.1). Total num frames: 614225920. Throughput: 0: 5932.4. Samples: 614235302. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:07:41,832][25689] Avg episode reward: [(0, '-24.673')] [2022-07-10 06:07:43,481][26022] Updated weights on worker 0-0, policy_version 599840 (0.00090) [2022-07-10 06:07:45,327][26022] Updated weights on worker 0-0, policy_version 599850 (0.00089) [2022-07-10 06:07:46,848][25689] Fps is (10 sec: 5723.5, 60 sec: 5628.3, 300 sec: 5614.9). Total num frames: 614254592. Throughput: 0: 5930.7. Samples: 614252388. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:07:46,849][25689] Avg episode reward: [(0, '-24.201')] [2022-07-10 06:07:47,296][26022] Updated weights on worker 0-0, policy_version 599860 (0.00090) [2022-07-10 06:07:48,751][26022] Updated weights on worker 0-0, policy_version 599870 (0.00087) [2022-07-10 06:07:50,946][26022] Updated weights on worker 0-0, policy_version 599880 (0.00082) [2022-07-10 06:07:51,857][25689] Fps is (10 sec: 5719.5, 60 sec: 5617.3, 300 sec: 5615.9). Total num frames: 614283264. Throughput: 0: 5937.3. Samples: 614286622. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:07:51,858][25689] Avg episode reward: [(0, '-23.684')] [2022-07-10 06:07:52,584][26022] Updated weights on worker 0-0, policy_version 599890 (0.00090) [2022-07-10 06:07:54,303][26022] Updated weights on worker 0-0, policy_version 599900 (0.00092) [2022-07-10 06:07:56,088][26022] Updated weights on worker 0-0, policy_version 599910 (0.00091) [2022-07-10 06:07:56,939][25689] Fps is (10 sec: 5683.0, 60 sec: 5631.7, 300 sec: 5618.1). Total num frames: 614311936. Throughput: 0: 5936.1. Samples: 614320452. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:07:56,939][25689] Avg episode reward: [(0, '-23.382')] [2022-07-10 06:07:58,152][26022] Updated weights on worker 0-0, policy_version 599920 (0.00092) [2022-07-10 06:07:59,673][26022] Updated weights on worker 0-0, policy_version 599930 (0.00088) [2022-07-10 06:08:01,941][25689] Fps is (10 sec: 5382.2, 60 sec: 5582.5, 300 sec: 5612.7). Total num frames: 614337536. Throughput: 0: 5090.0. Samples: 614337660. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:08:01,941][25689] Avg episode reward: [(0, '-24.108')] [2022-07-10 06:08:02,365][26022] Updated weights on worker 0-0, policy_version 599940 (0.00091) [2022-07-10 06:08:03,760][26022] Updated weights on worker 0-0, policy_version 599950 (0.00086) [2022-07-10 06:08:05,794][26022] Updated weights on worker 0-0, policy_version 599960 (0.00089) [2022-07-10 06:08:06,781][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:08:06,794][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000599967_614366208.pth [2022-07-10 06:08:06,794][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000597990_612341760.pth [2022-07-10 06:08:06,945][25689] Fps is (10 sec: 5525.9, 60 sec: 5634.6, 300 sec: 5617.1). Total num frames: 614367232. Throughput: 0: 5817.8. Samples: 614369308. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:08:06,946][25689] Avg episode reward: [(0, '-24.469')] [2022-07-10 06:08:07,375][26022] Updated weights on worker 0-0, policy_version 599970 (0.00091) [2022-07-10 06:08:09,371][26022] Updated weights on worker 0-0, policy_version 599980 (0.00096) [2022-07-10 06:08:10,967][26022] Updated weights on worker 0-0, policy_version 599990 (0.00091) [2022-07-10 06:08:11,994][25689] Fps is (10 sec: 5704.1, 60 sec: 5648.6, 300 sec: 5614.9). Total num frames: 614394880. Throughput: 0: 5787.7. Samples: 614403166. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:08:11,995][25689] Avg episode reward: [(0, '-24.024')] [2022-07-10 06:08:13,015][26022] Updated weights on worker 0-0, policy_version 600000 (0.00092) [2022-07-10 06:08:14,760][26022] Updated weights on worker 0-0, policy_version 600010 (0.00086) [2022-07-10 06:08:16,579][26022] Updated weights on worker 0-0, policy_version 600020 (0.00081) [2022-07-10 06:08:17,043][25689] Fps is (10 sec: 5577.8, 60 sec: 5616.3, 300 sec: 5610.9). Total num frames: 614423552. Throughput: 0: 4955.9. Samples: 614420084. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:08:17,043][25689] Avg episode reward: [(0, '-24.069')] [2022-07-10 06:08:18,183][26022] Updated weights on worker 0-0, policy_version 600030 (0.01313) [2022-07-10 06:08:20,139][26022] Updated weights on worker 0-0, policy_version 600040 (0.00092) [2022-07-10 06:08:21,999][26022] Updated weights on worker 0-0, policy_version 600050 (0.00082) [2022-07-10 06:08:22,064][25689] Fps is (10 sec: 5592.8, 60 sec: 5614.6, 300 sec: 5611.1). Total num frames: 614451200. Throughput: 0: 5797.7. Samples: 614454328. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:08:22,065][25689] Avg episode reward: [(0, '-23.913')] [2022-07-10 06:08:23,466][26022] Updated weights on worker 0-0, policy_version 600060 (0.00093) [2022-07-10 06:08:25,617][26022] Updated weights on worker 0-0, policy_version 600070 (0.00082) [2022-07-10 06:08:27,092][25689] Fps is (10 sec: 5706.1, 60 sec: 5647.0, 300 sec: 5614.5). Total num frames: 614480896. Throughput: 0: 5910.0. Samples: 614488376. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:08:27,094][25689] Avg episode reward: [(0, '-24.976')] [2022-07-10 06:08:27,219][26022] Updated weights on worker 0-0, policy_version 600080 (0.00085) [2022-07-10 06:08:29,108][26022] Updated weights on worker 0-0, policy_version 600090 (0.00092) [2022-07-10 06:08:30,968][26022] Updated weights on worker 0-0, policy_version 600100 (0.00091) [2022-07-10 06:08:32,105][25689] Fps is (10 sec: 5711.2, 60 sec: 5631.5, 300 sec: 5616.6). Total num frames: 614508544. Throughput: 0: 5073.8. Samples: 614505204. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:08:32,106][25689] Avg episode reward: [(0, '-24.228')] [2022-07-10 06:08:32,829][26022] Updated weights on worker 0-0, policy_version 600110 (0.00088) [2022-07-10 06:08:34,469][26022] Updated weights on worker 0-0, policy_version 600120 (0.00086) [2022-07-10 06:08:36,497][26022] Updated weights on worker 0-0, policy_version 600130 (0.00086) [2022-07-10 06:08:37,241][25689] Fps is (10 sec: 5549.7, 60 sec: 5627.6, 300 sec: 5611.0). Total num frames: 614537216. Throughput: 0: 5897.9. Samples: 614539208. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:08:37,250][25689] Avg episode reward: [(0, '-24.436')] [2022-07-10 06:08:38,060][26022] Updated weights on worker 0-0, policy_version 600140 (0.00091) [2022-07-10 06:08:40,049][26022] Updated weights on worker 0-0, policy_version 600150 (0.00085) [2022-07-10 06:08:41,851][26022] Updated weights on worker 0-0, policy_version 600160 (0.00090) [2022-07-10 06:08:42,264][25689] Fps is (10 sec: 5644.4, 60 sec: 5625.5, 300 sec: 5607.7). Total num frames: 614565888. Throughput: 0: 5897.3. Samples: 614573454. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:08:42,266][25689] Avg episode reward: [(0, '-23.831')] [2022-07-10 06:08:43,423][26022] Updated weights on worker 0-0, policy_version 600170 (0.00084) [2022-07-10 06:08:45,437][26022] Updated weights on worker 0-0, policy_version 600180 (0.00083) [2022-07-10 06:08:47,363][25689] Fps is (10 sec: 5564.0, 60 sec: 5601.1, 300 sec: 5606.4). Total num frames: 614593536. Throughput: 0: 5042.2. Samples: 614590584. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:08:47,364][25689] Avg episode reward: [(0, '-23.781')] [2022-07-10 06:08:47,383][26022] Updated weights on worker 0-0, policy_version 600190 (0.00082) [2022-07-10 06:08:48,845][26022] Updated weights on worker 0-0, policy_version 600200 (0.00090) [2022-07-10 06:08:50,882][26022] Updated weights on worker 0-0, policy_version 600210 (0.00100) [2022-07-10 06:08:52,381][25689] Fps is (10 sec: 5667.9, 60 sec: 5617.1, 300 sec: 5610.6). Total num frames: 614623232. Throughput: 0: 5892.0. Samples: 614624676. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:08:52,382][25689] Avg episode reward: [(0, '-23.252')] [2022-07-10 06:08:52,646][26022] Updated weights on worker 0-0, policy_version 600220 (0.00095) [2022-07-10 06:08:54,589][26022] Updated weights on worker 0-0, policy_version 600230 (0.00093) [2022-07-10 06:08:56,430][26022] Updated weights on worker 0-0, policy_version 600240 (0.00095) [2022-07-10 06:08:57,464][25689] Fps is (10 sec: 5778.1, 60 sec: 5616.9, 300 sec: 5610.1). Total num frames: 614651904. Throughput: 0: 5892.6. Samples: 614658378. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:08:57,465][25689] Avg episode reward: [(0, '-22.606')] [2022-07-10 06:08:58,002][26022] Updated weights on worker 0-0, policy_version 600250 (0.00087) [2022-07-10 06:09:00,036][26022] Updated weights on worker 0-0, policy_version 600260 (0.00091) [2022-07-10 06:09:01,626][26022] Updated weights on worker 0-0, policy_version 600270 (0.00086) [2022-07-10 06:09:02,507][25689] Fps is (10 sec: 5461.0, 60 sec: 5630.1, 300 sec: 5613.3). Total num frames: 614678528. Throughput: 0: 5036.6. Samples: 614675410. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:09:02,508][25689] Avg episode reward: [(0, '-23.153')] [2022-07-10 06:09:04,007][26022] Updated weights on worker 0-0, policy_version 600280 (0.00336) [2022-07-10 06:09:05,848][26022] Updated weights on worker 0-0, policy_version 600290 (0.00090) [2022-07-10 06:09:07,579][25689] Fps is (10 sec: 5365.8, 60 sec: 5590.1, 300 sec: 5608.5). Total num frames: 614706176. Throughput: 0: 5749.2. Samples: 614706810. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:09:07,579][25689] Avg episode reward: [(0, '-23.832')] [2022-07-10 06:09:07,619][26022] Updated weights on worker 0-0, policy_version 600300 (0.00093) [2022-07-10 06:09:09,404][26022] Updated weights on worker 0-0, policy_version 600310 (0.00090) [2022-07-10 06:09:11,284][26022] Updated weights on worker 0-0, policy_version 600320 (0.00093) [2022-07-10 06:09:12,639][25689] Fps is (10 sec: 5558.6, 60 sec: 5605.9, 300 sec: 5613.5). Total num frames: 614734848. Throughput: 0: 5727.1. Samples: 614740694. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:09:12,640][25689] Avg episode reward: [(0, '-24.919')] [2022-07-10 06:09:13,223][26022] Updated weights on worker 0-0, policy_version 600330 (0.00080) [2022-07-10 06:09:14,909][26022] Updated weights on worker 0-0, policy_version 600340 (0.00082) [2022-07-10 06:09:16,812][26022] Updated weights on worker 0-0, policy_version 600350 (0.00090) [2022-07-10 06:09:17,711][25689] Fps is (10 sec: 5760.5, 60 sec: 5620.6, 300 sec: 5610.0). Total num frames: 614764544. Throughput: 0: 4913.7. Samples: 614757864. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:09:17,712][25689] Avg episode reward: [(0, '-27.037')] [2022-07-10 06:09:18,478][26022] Updated weights on worker 0-0, policy_version 600360 (0.00088) [2022-07-10 06:09:20,367][26022] Updated weights on worker 0-0, policy_version 600370 (0.00095) [2022-07-10 06:09:22,134][26022] Updated weights on worker 0-0, policy_version 600380 (0.00321) [2022-07-10 06:09:22,754][25689] Fps is (10 sec: 5770.4, 60 sec: 5635.4, 300 sec: 5612.9). Total num frames: 614793216. Throughput: 0: 5749.9. Samples: 614791830. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:09:22,755][25689] Avg episode reward: [(0, '-27.199')] [2022-07-10 06:09:23,958][26022] Updated weights on worker 0-0, policy_version 600390 (0.00095) [2022-07-10 06:09:25,695][26022] Updated weights on worker 0-0, policy_version 600400 (0.00085) [2022-07-10 06:09:27,465][26022] Updated weights on worker 0-0, policy_version 600410 (0.00084) [2022-07-10 06:09:27,812][25689] Fps is (10 sec: 5575.6, 60 sec: 5598.9, 300 sec: 5613.0). Total num frames: 614820864. Throughput: 0: 5881.3. Samples: 614825810. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:09:27,813][25689] Avg episode reward: [(0, '-26.952')] [2022-07-10 06:09:29,350][26022] Updated weights on worker 0-0, policy_version 600420 (0.00088) [2022-07-10 06:09:31,248][26022] Updated weights on worker 0-0, policy_version 600430 (0.00079) [2022-07-10 06:09:32,872][25689] Fps is (10 sec: 5566.3, 60 sec: 5611.4, 300 sec: 5609.6). Total num frames: 614849536. Throughput: 0: 5055.8. Samples: 614842990. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:09:32,873][25689] Avg episode reward: [(0, '-26.373')] [2022-07-10 06:09:33,059][26022] Updated weights on worker 0-0, policy_version 600440 (0.00087) [2022-07-10 06:09:34,964][26022] Updated weights on worker 0-0, policy_version 600450 (0.00094) [2022-07-10 06:09:36,546][26022] Updated weights on worker 0-0, policy_version 600460 (0.00091) [2022-07-10 06:09:37,947][25689] Fps is (10 sec: 5557.3, 60 sec: 5600.2, 300 sec: 5611.8). Total num frames: 614877184. Throughput: 0: 5872.6. Samples: 614876700. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:09:37,947][25689] Avg episode reward: [(0, '-26.132')] [2022-07-10 06:09:38,460][26022] Updated weights on worker 0-0, policy_version 600470 (0.00093) [2022-07-10 06:09:40,284][26022] Updated weights on worker 0-0, policy_version 600480 (0.00090) [2022-07-10 06:09:42,180][26022] Updated weights on worker 0-0, policy_version 600490 (0.00090) [2022-07-10 06:09:42,953][25689] Fps is (10 sec: 5688.3, 60 sec: 5618.7, 300 sec: 5615.6). Total num frames: 614906880. Throughput: 0: 5884.3. Samples: 614910688. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:09:42,954][25689] Avg episode reward: [(0, '-25.084')] [2022-07-10 06:09:43,783][26022] Updated weights on worker 0-0, policy_version 600500 (0.00094) [2022-07-10 06:09:45,659][26022] Updated weights on worker 0-0, policy_version 600510 (0.00085) [2022-07-10 06:09:47,520][26022] Updated weights on worker 0-0, policy_version 600520 (0.00089) [2022-07-10 06:09:47,964][25689] Fps is (10 sec: 5826.5, 60 sec: 5643.7, 300 sec: 5612.0). Total num frames: 614935552. Throughput: 0: 5061.7. Samples: 614927814. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:09:47,966][25689] Avg episode reward: [(0, '-24.681')] [2022-07-10 06:09:49,338][26022] Updated weights on worker 0-0, policy_version 600530 (0.00083) [2022-07-10 06:09:51,152][26022] Updated weights on worker 0-0, policy_version 600540 (0.00093) [2022-07-10 06:09:52,976][25689] Fps is (10 sec: 5517.1, 60 sec: 5593.6, 300 sec: 5605.8). Total num frames: 614962176. Throughput: 0: 5900.2. Samples: 614961606. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:09:52,978][25689] Avg episode reward: [(0, '-25.061')] [2022-07-10 06:09:53,002][26022] Updated weights on worker 0-0, policy_version 600550 (0.00087) [2022-07-10 06:09:54,801][26022] Updated weights on worker 0-0, policy_version 600560 (0.00086) [2022-07-10 06:09:56,658][26022] Updated weights on worker 0-0, policy_version 600570 (0.00086) [2022-07-10 06:09:58,022][25689] Fps is (10 sec: 5497.8, 60 sec: 5597.0, 300 sec: 5608.5). Total num frames: 614990848. Throughput: 0: 5928.7. Samples: 614995722. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:09:58,023][25689] Avg episode reward: [(0, '-24.068')] [2022-07-10 06:09:58,311][26022] Updated weights on worker 0-0, policy_version 600580 (0.00083) [2022-07-10 06:10:00,216][26022] Updated weights on worker 0-0, policy_version 600590 (0.00085) [2022-07-10 06:10:02,193][26022] Updated weights on worker 0-0, policy_version 600600 (0.00080) [2022-07-10 06:10:03,061][25689] Fps is (10 sec: 5483.1, 60 sec: 5597.4, 300 sec: 5611.7). Total num frames: 615017472. Throughput: 0: 5811.7. Samples: 615027546. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:10:03,061][25689] Avg episode reward: [(0, '-24.241')] [2022-07-10 06:10:04,454][26022] Updated weights on worker 0-0, policy_version 600610 (0.00087) [2022-07-10 06:10:06,223][26022] Updated weights on worker 0-0, policy_version 600620 (0.00088) [2022-07-10 06:10:06,882][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:10:06,897][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000600626_615041024.pth [2022-07-10 06:10:06,898][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000598650_613017600.pth [2022-07-10 06:10:07,710][26022] Updated weights on worker 0-0, policy_version 600630 (0.00086) [2022-07-10 06:10:08,065][25689] Fps is (10 sec: 5506.2, 60 sec: 5620.6, 300 sec: 5612.2). Total num frames: 615046144. Throughput: 0: 5805.8. Samples: 615044512. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:10:08,065][25689] Avg episode reward: [(0, '-24.989')] [2022-07-10 06:10:09,685][26022] Updated weights on worker 0-0, policy_version 600640 (0.00089) [2022-07-10 06:10:11,445][26022] Updated weights on worker 0-0, policy_version 600650 (0.00088) [2022-07-10 06:10:13,066][25689] Fps is (10 sec: 5628.8, 60 sec: 5609.1, 300 sec: 5606.4). Total num frames: 615073792. Throughput: 0: 5816.9. Samples: 615078470. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:10:13,067][25689] Avg episode reward: [(0, '-24.870')] [2022-07-10 06:10:13,259][26022] Updated weights on worker 0-0, policy_version 600660 (0.00098) [2022-07-10 06:10:15,231][26022] Updated weights on worker 0-0, policy_version 600670 (0.00084) [2022-07-10 06:10:16,767][26022] Updated weights on worker 0-0, policy_version 600680 (0.00500) [2022-07-10 06:10:18,199][25689] Fps is (10 sec: 5658.6, 60 sec: 5603.5, 300 sec: 5614.4). Total num frames: 615103488. Throughput: 0: 5804.0. Samples: 615112826. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:10:18,199][25689] Avg episode reward: [(0, '-24.412')] [2022-07-10 06:10:18,596][26022] Updated weights on worker 0-0, policy_version 600690 (0.00087) [2022-07-10 06:10:20,435][26022] Updated weights on worker 0-0, policy_version 600700 (0.00386) [2022-07-10 06:10:22,191][26022] Updated weights on worker 0-0, policy_version 600710 (0.00057) [2022-07-10 06:10:23,207][25689] Fps is (10 sec: 5756.0, 60 sec: 5606.8, 300 sec: 5611.1). Total num frames: 615132160. Throughput: 0: 5079.4. Samples: 615129876. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:10:23,207][25689] Avg episode reward: [(0, '-23.877')] [2022-07-10 06:10:24,107][26022] Updated weights on worker 0-0, policy_version 600720 (0.00086) [2022-07-10 06:10:25,670][26022] Updated weights on worker 0-0, policy_version 600730 (0.00092) [2022-07-10 06:10:27,644][26022] Updated weights on worker 0-0, policy_version 600740 (0.00383) [2022-07-10 06:10:28,219][25689] Fps is (10 sec: 5825.1, 60 sec: 5645.0, 300 sec: 5621.2). Total num frames: 615161856. Throughput: 0: 5930.7. Samples: 615164038. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:10:28,219][25689] Avg episode reward: [(0, '-23.461')] [2022-07-10 06:10:29,581][26022] Updated weights on worker 0-0, policy_version 600750 (0.00095) [2022-07-10 06:10:31,139][26022] Updated weights on worker 0-0, policy_version 600760 (0.00091) [2022-07-10 06:10:33,127][26022] Updated weights on worker 0-0, policy_version 600770 (0.00106) [2022-07-10 06:10:33,242][25689] Fps is (10 sec: 5612.2, 60 sec: 5614.5, 300 sec: 5615.0). Total num frames: 615188480. Throughput: 0: 5923.1. Samples: 615197972. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:10:33,243][25689] Avg episode reward: [(0, '-23.463')] [2022-07-10 06:10:34,783][26022] Updated weights on worker 0-0, policy_version 600780 (0.00090) [2022-07-10 06:10:36,810][26022] Updated weights on worker 0-0, policy_version 600790 (0.00090) [2022-07-10 06:10:38,293][25689] Fps is (10 sec: 5590.5, 60 sec: 5650.6, 300 sec: 5614.2). Total num frames: 615218176. Throughput: 0: 5058.2. Samples: 615214468. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:10:38,293][25689] Avg episode reward: [(0, '-22.905')] [2022-07-10 06:10:38,502][26022] Updated weights on worker 0-0, policy_version 600800 (0.00088) [2022-07-10 06:10:40,282][26022] Updated weights on worker 0-0, policy_version 600810 (0.00084) [2022-07-10 06:10:42,324][26022] Updated weights on worker 0-0, policy_version 600820 (0.00087) [2022-07-10 06:10:43,324][25689] Fps is (10 sec: 5688.1, 60 sec: 5614.4, 300 sec: 5614.3). Total num frames: 615245824. Throughput: 0: 5904.8. Samples: 615248662. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:10:43,324][25689] Avg episode reward: [(0, '-23.532')] [2022-07-10 06:10:43,891][26022] Updated weights on worker 0-0, policy_version 600830 (0.00085) [2022-07-10 06:10:45,867][26022] Updated weights on worker 0-0, policy_version 600840 (0.00094) [2022-07-10 06:10:47,598][26022] Updated weights on worker 0-0, policy_version 600850 (0.00091) [2022-07-10 06:10:48,362][25689] Fps is (10 sec: 5491.7, 60 sec: 5594.9, 300 sec: 5607.2). Total num frames: 615273472. Throughput: 0: 5880.2. Samples: 615282484. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:10:48,363][25689] Avg episode reward: [(0, '-23.884')] [2022-07-10 06:10:49,347][26022] Updated weights on worker 0-0, policy_version 600860 (0.00095) [2022-07-10 06:10:51,332][26022] Updated weights on worker 0-0, policy_version 600870 (0.00086) [2022-07-10 06:10:53,008][26022] Updated weights on worker 0-0, policy_version 600880 (0.00122) [2022-07-10 06:10:53,387][25689] Fps is (10 sec: 5596.8, 60 sec: 5627.6, 300 sec: 5611.2). Total num frames: 615302144. Throughput: 0: 5031.8. Samples: 615299334. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:10:53,387][25689] Avg episode reward: [(0, '-25.251')] [2022-07-10 06:10:54,942][26022] Updated weights on worker 0-0, policy_version 600890 (0.00087) [2022-07-10 06:10:56,707][26022] Updated weights on worker 0-0, policy_version 600900 (0.00085) [2022-07-10 06:10:58,481][25689] Fps is (10 sec: 5666.8, 60 sec: 5623.1, 300 sec: 5613.0). Total num frames: 615330816. Throughput: 0: 5882.4. Samples: 615333224. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:10:58,482][25689] Avg episode reward: [(0, '-25.728')] [2022-07-10 06:10:58,512][26022] Updated weights on worker 0-0, policy_version 600910 (0.00087) [2022-07-10 06:11:00,391][26022] Updated weights on worker 0-0, policy_version 600920 (0.01089) [2022-07-10 06:11:02,459][26022] Updated weights on worker 0-0, policy_version 600930 (0.00091) [2022-07-10 06:11:03,489][25689] Fps is (10 sec: 5372.3, 60 sec: 5609.1, 300 sec: 5610.0). Total num frames: 615356416. Throughput: 0: 5781.2. Samples: 615365240. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:11:03,489][25689] Avg episode reward: [(0, '-25.343')] [2022-07-10 06:11:04,213][26022] Updated weights on worker 0-0, policy_version 600940 (0.00095) [2022-07-10 06:11:06,443][26022] Updated weights on worker 0-0, policy_version 600950 (0.00099) [2022-07-10 06:11:08,058][26022] Updated weights on worker 0-0, policy_version 600960 (0.00084) [2022-07-10 06:11:08,516][25689] Fps is (10 sec: 5408.2, 60 sec: 5606.9, 300 sec: 5613.3). Total num frames: 615385088. Throughput: 0: 4938.4. Samples: 615382014. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:11:08,517][25689] Avg episode reward: [(0, '-24.702')] [2022-07-10 06:11:09,816][26022] Updated weights on worker 0-0, policy_version 600970 (0.00085) [2022-07-10 06:11:11,615][26022] Updated weights on worker 0-0, policy_version 600980 (0.00090) [2022-07-10 06:11:13,531][25689] Fps is (10 sec: 5608.2, 60 sec: 5605.7, 300 sec: 5607.7). Total num frames: 615412736. Throughput: 0: 5769.5. Samples: 615415558. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:11:13,532][25689] Avg episode reward: [(0, '-23.700')] [2022-07-10 06:11:13,540][26022] Updated weights on worker 0-0, policy_version 600990 (0.00089) [2022-07-10 06:11:15,347][26022] Updated weights on worker 0-0, policy_version 601000 (0.00096) [2022-07-10 06:11:17,063][26022] Updated weights on worker 0-0, policy_version 601010 (0.00092) [2022-07-10 06:11:18,596][25689] Fps is (10 sec: 5689.1, 60 sec: 5611.9, 300 sec: 5614.6). Total num frames: 615442432. Throughput: 0: 5779.6. Samples: 615449480. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:11:18,598][25689] Avg episode reward: [(0, '-24.659')] [2022-07-10 06:11:19,027][26022] Updated weights on worker 0-0, policy_version 601020 (0.00085) [2022-07-10 06:11:20,739][26022] Updated weights on worker 0-0, policy_version 601030 (0.00087) [2022-07-10 06:11:22,616][26022] Updated weights on worker 0-0, policy_version 601040 (0.00090) [2022-07-10 06:11:23,660][25689] Fps is (10 sec: 5661.3, 60 sec: 5589.8, 300 sec: 5608.1). Total num frames: 615470080. Throughput: 0: 5017.0. Samples: 615466440. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:11:23,662][25689] Avg episode reward: [(0, '-23.277')] [2022-07-10 06:11:24,252][26022] Updated weights on worker 0-0, policy_version 601050 (0.00092) [2022-07-10 06:11:26,334][26022] Updated weights on worker 0-0, policy_version 601060 (0.00091) [2022-07-10 06:11:27,975][26022] Updated weights on worker 0-0, policy_version 601070 (0.00084) [2022-07-10 06:11:28,685][25689] Fps is (10 sec: 5480.7, 60 sec: 5554.7, 300 sec: 5611.3). Total num frames: 615497728. Throughput: 0: 5859.3. Samples: 615500188. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 06:11:28,687][25689] Avg episode reward: [(0, '-24.896')] [2022-07-10 06:11:30,047][26022] Updated weights on worker 0-0, policy_version 601080 (0.00085) [2022-07-10 06:11:31,719][26022] Updated weights on worker 0-0, policy_version 601090 (0.00084) [2022-07-10 06:11:33,562][26022] Updated weights on worker 0-0, policy_version 601100 (0.00091) [2022-07-10 06:11:33,715][25689] Fps is (10 sec: 5601.5, 60 sec: 5588.0, 300 sec: 5609.9). Total num frames: 615526400. Throughput: 0: 5869.9. Samples: 615534034. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:11:33,715][25689] Avg episode reward: [(0, '-24.895')] [2022-07-10 06:11:35,341][26022] Updated weights on worker 0-0, policy_version 601110 (0.00108) [2022-07-10 06:11:37,311][26022] Updated weights on worker 0-0, policy_version 601120 (0.00095) [2022-07-10 06:11:38,788][25689] Fps is (10 sec: 5676.2, 60 sec: 5569.0, 300 sec: 5613.3). Total num frames: 615555072. Throughput: 0: 5027.8. Samples: 615551000. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:11:38,788][25689] Avg episode reward: [(0, '-25.048')] [2022-07-10 06:11:39,044][26022] Updated weights on worker 0-0, policy_version 601130 (0.00098) [2022-07-10 06:11:40,776][26022] Updated weights on worker 0-0, policy_version 601140 (0.00087) [2022-07-10 06:11:42,877][26022] Updated weights on worker 0-0, policy_version 601150 (0.00086) [2022-07-10 06:11:43,816][25689] Fps is (10 sec: 5676.8, 60 sec: 5586.1, 300 sec: 5612.9). Total num frames: 615583744. Throughput: 0: 5875.7. Samples: 615584870. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:11:43,817][25689] Avg episode reward: [(0, '-24.703')] [2022-07-10 06:11:44,503][26022] Updated weights on worker 0-0, policy_version 601160 (0.00093) [2022-07-10 06:11:46,339][26022] Updated weights on worker 0-0, policy_version 601170 (0.00089) [2022-07-10 06:11:48,008][26022] Updated weights on worker 0-0, policy_version 601180 (0.00087) [2022-07-10 06:11:48,837][25689] Fps is (10 sec: 5706.4, 60 sec: 5604.7, 300 sec: 5610.5). Total num frames: 615612416. Throughput: 0: 5891.3. Samples: 615618908. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:11:48,837][25689] Avg episode reward: [(0, '-24.882')] [2022-07-10 06:11:49,824][26022] Updated weights on worker 0-0, policy_version 601190 (0.00089) [2022-07-10 06:11:51,696][26022] Updated weights on worker 0-0, policy_version 601200 (0.00095) [2022-07-10 06:11:53,466][26022] Updated weights on worker 0-0, policy_version 601210 (0.00086) [2022-07-10 06:11:53,839][25689] Fps is (10 sec: 5619.2, 60 sec: 5589.9, 300 sec: 5611.4). Total num frames: 615640064. Throughput: 0: 5066.7. Samples: 615635998. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:11:53,839][25689] Avg episode reward: [(0, '-24.604')] [2022-07-10 06:11:55,398][26022] Updated weights on worker 0-0, policy_version 601220 (0.00106) [2022-07-10 06:11:57,091][26022] Updated weights on worker 0-0, policy_version 601230 (0.00093) [2022-07-10 06:11:58,895][25689] Fps is (10 sec: 5599.7, 60 sec: 5593.5, 300 sec: 5610.8). Total num frames: 615668736. Throughput: 0: 5913.8. Samples: 615669908. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:11:58,896][25689] Avg episode reward: [(0, '-23.731')] [2022-07-10 06:11:59,047][26022] Updated weights on worker 0-0, policy_version 601240 (0.00083) [2022-07-10 06:12:00,711][26022] Updated weights on worker 0-0, policy_version 601250 (0.00098) [2022-07-10 06:12:03,087][26022] Updated weights on worker 0-0, policy_version 601260 (0.00367) [2022-07-10 06:12:03,912][25689] Fps is (10 sec: 5591.1, 60 sec: 5626.4, 300 sec: 5614.2). Total num frames: 615696384. Throughput: 0: 5811.3. Samples: 615701654. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:12:03,914][25689] Avg episode reward: [(0, '-23.942')] [2022-07-10 06:12:04,674][26022] Updated weights on worker 0-0, policy_version 601270 (0.00084) [2022-07-10 06:12:06,582][26022] Updated weights on worker 0-0, policy_version 601280 (0.00088) [2022-07-10 06:12:07,020][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:12:07,047][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000601282_615712768.pth [2022-07-10 06:12:07,047][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000599308_613691392.pth [2022-07-10 06:12:08,401][26022] Updated weights on worker 0-0, policy_version 601290 (0.00089) [2022-07-10 06:12:08,916][25689] Fps is (10 sec: 5415.7, 60 sec: 5594.8, 300 sec: 5614.5). Total num frames: 615723008. Throughput: 0: 4960.9. Samples: 615718518. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:12:08,918][25689] Avg episode reward: [(0, '-24.299')] [2022-07-10 06:12:10,335][26022] Updated weights on worker 0-0, policy_version 601300 (0.00094) [2022-07-10 06:12:12,007][26022] Updated weights on worker 0-0, policy_version 601310 (0.00089) [2022-07-10 06:12:13,952][25689] Fps is (10 sec: 5405.8, 60 sec: 5592.8, 300 sec: 5604.7). Total num frames: 615750656. Throughput: 0: 5775.8. Samples: 615752164. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:12:13,952][25689] Avg episode reward: [(0, '-25.587')] [2022-07-10 06:12:14,067][26022] Updated weights on worker 0-0, policy_version 601320 (0.00453) [2022-07-10 06:12:15,482][26022] Updated weights on worker 0-0, policy_version 601330 (0.00097) [2022-07-10 06:12:17,571][26022] Updated weights on worker 0-0, policy_version 601340 (0.00090) [2022-07-10 06:12:19,007][25689] Fps is (10 sec: 5682.8, 60 sec: 5593.7, 300 sec: 5610.6). Total num frames: 615780352. Throughput: 0: 5772.7. Samples: 615786008. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:12:19,008][25689] Avg episode reward: [(0, '-26.433')] [2022-07-10 06:12:19,185][26022] Updated weights on worker 0-0, policy_version 601350 (0.00084) [2022-07-10 06:12:21,129][26022] Updated weights on worker 0-0, policy_version 601360 (0.00083) [2022-07-10 06:12:23,014][26022] Updated weights on worker 0-0, policy_version 601370 (0.00096) [2022-07-10 06:12:24,042][25689] Fps is (10 sec: 5784.6, 60 sec: 5613.4, 300 sec: 5613.6). Total num frames: 615809024. Throughput: 0: 5035.3. Samples: 615803014. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:12:24,043][25689] Avg episode reward: [(0, '-26.873')] [2022-07-10 06:12:24,669][26022] Updated weights on worker 0-0, policy_version 601380 (0.00086) [2022-07-10 06:12:26,484][26022] Updated weights on worker 0-0, policy_version 601390 (0.00086) [2022-07-10 06:12:28,635][26022] Updated weights on worker 0-0, policy_version 601400 (0.00090) [2022-07-10 06:12:29,075][25689] Fps is (10 sec: 5492.2, 60 sec: 5595.7, 300 sec: 5606.6). Total num frames: 615835648. Throughput: 0: 5880.6. Samples: 615837064. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:12:29,075][25689] Avg episode reward: [(0, '-26.481')] [2022-07-10 06:12:30,089][26022] Updated weights on worker 0-0, policy_version 601410 (0.00094) [2022-07-10 06:12:32,139][26022] Updated weights on worker 0-0, policy_version 601420 (0.00097) [2022-07-10 06:12:33,721][26022] Updated weights on worker 0-0, policy_version 601430 (0.00081) [2022-07-10 06:12:34,111][25689] Fps is (10 sec: 5593.6, 60 sec: 5612.0, 300 sec: 5611.2). Total num frames: 615865344. Throughput: 0: 5879.9. Samples: 615870696. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:12:34,111][25689] Avg episode reward: [(0, '-26.086')] [2022-07-10 06:12:35,828][26022] Updated weights on worker 0-0, policy_version 601440 (0.00090) [2022-07-10 06:12:37,427][26022] Updated weights on worker 0-0, policy_version 601450 (0.00086) [2022-07-10 06:12:39,198][25689] Fps is (10 sec: 5664.7, 60 sec: 5593.8, 300 sec: 5606.1). Total num frames: 615892992. Throughput: 0: 5030.1. Samples: 615887572. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:12:39,198][25689] Avg episode reward: [(0, '-25.340')] [2022-07-10 06:12:39,469][26022] Updated weights on worker 0-0, policy_version 601460 (0.00090) [2022-07-10 06:12:41,090][26022] Updated weights on worker 0-0, policy_version 601470 (0.00097) [2022-07-10 06:12:43,157][26022] Updated weights on worker 0-0, policy_version 601480 (0.00094) [2022-07-10 06:12:44,214][25689] Fps is (10 sec: 5675.6, 60 sec: 5611.8, 300 sec: 5609.6). Total num frames: 615922688. Throughput: 0: 5873.2. Samples: 615921490. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:12:44,215][25689] Avg episode reward: [(0, '-23.780')] [2022-07-10 06:12:44,704][26022] Updated weights on worker 0-0, policy_version 601490 (0.00085) [2022-07-10 06:12:46,602][26022] Updated weights on worker 0-0, policy_version 601500 (0.00090) [2022-07-10 06:12:48,188][26022] Updated weights on worker 0-0, policy_version 601510 (0.00089) [2022-07-10 06:12:49,243][25689] Fps is (10 sec: 5708.4, 60 sec: 5594.1, 300 sec: 5605.8). Total num frames: 615950336. Throughput: 0: 5881.9. Samples: 615955694. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:12:49,244][25689] Avg episode reward: [(0, '-23.186')] [2022-07-10 06:12:50,343][26022] Updated weights on worker 0-0, policy_version 601520 (0.00093) [2022-07-10 06:12:52,030][26022] Updated weights on worker 0-0, policy_version 601530 (0.00097) [2022-07-10 06:12:53,965][26022] Updated weights on worker 0-0, policy_version 601540 (0.00416) [2022-07-10 06:12:54,248][25689] Fps is (10 sec: 5613.0, 60 sec: 5610.8, 300 sec: 5607.2). Total num frames: 615979008. Throughput: 0: 5067.8. Samples: 615972748. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:12:54,249][25689] Avg episode reward: [(0, '-24.948')] [2022-07-10 06:12:55,624][26022] Updated weights on worker 0-0, policy_version 601550 (0.00092) [2022-07-10 06:12:57,475][26022] Updated weights on worker 0-0, policy_version 601560 (0.00090) [2022-07-10 06:12:59,254][26022] Updated weights on worker 0-0, policy_version 601570 (0.00089) [2022-07-10 06:12:59,354][25689] Fps is (10 sec: 5671.3, 60 sec: 5606.1, 300 sec: 5615.6). Total num frames: 616007680. Throughput: 0: 5899.0. Samples: 616006476. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:12:59,355][25689] Avg episode reward: [(0, '-25.213')] [2022-07-10 06:13:01,587][26022] Updated weights on worker 0-0, policy_version 601580 (0.00507) [2022-07-10 06:13:03,364][26022] Updated weights on worker 0-0, policy_version 601590 (0.00083) [2022-07-10 06:13:04,362][25689] Fps is (10 sec: 5366.1, 60 sec: 5573.2, 300 sec: 5601.7). Total num frames: 616033280. Throughput: 0: 5796.1. Samples: 616038268. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:13:04,362][25689] Avg episode reward: [(0, '-25.027')] [2022-07-10 06:13:05,174][26022] Updated weights on worker 0-0, policy_version 601600 (0.00095) [2022-07-10 06:13:06,989][26022] Updated weights on worker 0-0, policy_version 601610 (0.00088) [2022-07-10 06:13:08,871][26022] Updated weights on worker 0-0, policy_version 601620 (0.00090) [2022-07-10 06:13:09,386][25689] Fps is (10 sec: 5308.0, 60 sec: 5588.2, 300 sec: 5602.2). Total num frames: 616060928. Throughput: 0: 4938.6. Samples: 616055172. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:13:09,388][25689] Avg episode reward: [(0, '-24.827')] [2022-07-10 06:13:10,599][26022] Updated weights on worker 0-0, policy_version 601630 (0.00089) [2022-07-10 06:13:12,686][26022] Updated weights on worker 0-0, policy_version 601640 (0.00087) [2022-07-10 06:13:14,094][26022] Updated weights on worker 0-0, policy_version 601650 (0.00085) [2022-07-10 06:13:14,431][25689] Fps is (10 sec: 5796.6, 60 sec: 5638.1, 300 sec: 5609.1). Total num frames: 616091648. Throughput: 0: 5762.7. Samples: 616089058. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:13:14,433][25689] Avg episode reward: [(0, '-24.812')] [2022-07-10 06:13:16,256][26022] Updated weights on worker 0-0, policy_version 601660 (0.00094) [2022-07-10 06:13:17,598][26022] Updated weights on worker 0-0, policy_version 601670 (0.00087) [2022-07-10 06:13:19,471][25689] Fps is (10 sec: 5787.5, 60 sec: 5605.7, 300 sec: 5608.8). Total num frames: 616119296. Throughput: 0: 5788.7. Samples: 616122926. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:13:19,473][25689] Avg episode reward: [(0, '-24.958')] [2022-07-10 06:13:19,591][26022] Updated weights on worker 0-0, policy_version 601680 (0.00086) [2022-07-10 06:13:21,361][26022] Updated weights on worker 0-0, policy_version 601690 (0.00096) [2022-07-10 06:13:23,160][26022] Updated weights on worker 0-0, policy_version 601700 (0.00085) [2022-07-10 06:13:24,479][25689] Fps is (10 sec: 5503.3, 60 sec: 5591.3, 300 sec: 5602.3). Total num frames: 616146944. Throughput: 0: 5899.6. Samples: 616156950. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:13:24,480][25689] Avg episode reward: [(0, '-23.756')] [2022-07-10 06:13:24,997][26022] Updated weights on worker 0-0, policy_version 601710 (0.00089) [2022-07-10 06:13:26,958][26022] Updated weights on worker 0-0, policy_version 601720 (0.00089) [2022-07-10 06:13:28,680][26022] Updated weights on worker 0-0, policy_version 601730 (0.00084) [2022-07-10 06:13:29,486][25689] Fps is (10 sec: 5623.8, 60 sec: 5627.6, 300 sec: 5605.8). Total num frames: 616175616. Throughput: 0: 5897.4. Samples: 616173708. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:13:29,486][25689] Avg episode reward: [(0, '-24.824')] [2022-07-10 06:13:30,662][26022] Updated weights on worker 0-0, policy_version 601740 (0.00088) [2022-07-10 06:13:32,355][26022] Updated weights on worker 0-0, policy_version 601750 (0.00069) [2022-07-10 06:13:34,279][26022] Updated weights on worker 0-0, policy_version 601760 (0.00088) [2022-07-10 06:13:34,509][25689] Fps is (10 sec: 5614.8, 60 sec: 5594.8, 300 sec: 5604.5). Total num frames: 616203264. Throughput: 0: 5895.7. Samples: 616207434. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:13:34,510][25689] Avg episode reward: [(0, '-25.235')] [2022-07-10 06:13:35,934][26022] Updated weights on worker 0-0, policy_version 601770 (0.00091) [2022-07-10 06:13:37,935][26022] Updated weights on worker 0-0, policy_version 601780 (0.00085) [2022-07-10 06:13:39,619][25689] Fps is (10 sec: 5557.7, 60 sec: 5609.6, 300 sec: 5602.9). Total num frames: 616231936. Throughput: 0: 5870.4. Samples: 616241204. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:13:39,620][25689] Avg episode reward: [(0, '-25.480')] [2022-07-10 06:13:39,692][26022] Updated weights on worker 0-0, policy_version 601790 (0.00083) [2022-07-10 06:13:41,546][26022] Updated weights on worker 0-0, policy_version 601800 (0.00086) [2022-07-10 06:13:43,401][26022] Updated weights on worker 0-0, policy_version 601810 (0.00089) [2022-07-10 06:13:44,626][25689] Fps is (10 sec: 5668.2, 60 sec: 5593.6, 300 sec: 5608.1). Total num frames: 616260608. Throughput: 0: 5020.9. Samples: 616258108. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:13:44,628][25689] Avg episode reward: [(0, '-27.287')] [2022-07-10 06:13:44,977][26022] Updated weights on worker 0-0, policy_version 601820 (0.00092) [2022-07-10 06:13:47,003][26022] Updated weights on worker 0-0, policy_version 601830 (0.00085) [2022-07-10 06:13:48,703][26022] Updated weights on worker 0-0, policy_version 601840 (0.00083) [2022-07-10 06:13:49,692][25689] Fps is (10 sec: 5693.0, 60 sec: 5607.1, 300 sec: 5603.7). Total num frames: 616289280. Throughput: 0: 5865.2. Samples: 616292222. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:13:49,693][25689] Avg episode reward: [(0, '-27.205')] [2022-07-10 06:13:50,551][26022] Updated weights on worker 0-0, policy_version 601850 (0.00094) [2022-07-10 06:13:52,319][26022] Updated weights on worker 0-0, policy_version 601860 (0.00089) [2022-07-10 06:13:54,223][26022] Updated weights on worker 0-0, policy_version 601870 (0.00089) [2022-07-10 06:13:54,708][25689] Fps is (10 sec: 5484.9, 60 sec: 5572.2, 300 sec: 5598.1). Total num frames: 616315904. Throughput: 0: 5860.9. Samples: 616325814. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:13:54,708][25689] Avg episode reward: [(0, '-26.757')] [2022-07-10 06:13:56,090][26022] Updated weights on worker 0-0, policy_version 601880 (0.00096) [2022-07-10 06:13:57,918][26022] Updated weights on worker 0-0, policy_version 601890 (0.00087) [2022-07-10 06:13:59,689][26022] Updated weights on worker 0-0, policy_version 601900 (0.00087) [2022-07-10 06:13:59,808][25689] Fps is (10 sec: 5567.4, 60 sec: 5589.7, 300 sec: 5607.3). Total num frames: 616345600. Throughput: 0: 5029.6. Samples: 616342746. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:13:59,809][25689] Avg episode reward: [(0, '-25.983')] [2022-07-10 06:14:01,416][26022] Updated weights on worker 0-0, policy_version 601910 (0.00087) [2022-07-10 06:14:03,715][26022] Updated weights on worker 0-0, policy_version 601920 (0.00086) [2022-07-10 06:14:04,828][25689] Fps is (10 sec: 5564.9, 60 sec: 5605.5, 300 sec: 5604.9). Total num frames: 616372224. Throughput: 0: 5763.3. Samples: 616374538. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:14:04,829][25689] Avg episode reward: [(0, '-25.075')] [2022-07-10 06:14:05,485][26022] Updated weights on worker 0-0, policy_version 601930 (0.00095) [2022-07-10 06:14:07,114][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:14:07,131][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000601939_616385536.pth [2022-07-10 06:14:07,131][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000599967_614366208.pth [2022-07-10 06:14:07,219][26022] Updated weights on worker 0-0, policy_version 601940 (0.00092) [2022-07-10 06:14:09,268][26022] Updated weights on worker 0-0, policy_version 601950 (0.00095) [2022-07-10 06:14:09,855][25689] Fps is (10 sec: 5300.2, 60 sec: 5588.4, 300 sec: 5598.6). Total num frames: 616398848. Throughput: 0: 5751.5. Samples: 616408186. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:14:09,855][25689] Avg episode reward: [(0, '-24.557')] [2022-07-10 06:14:10,839][26022] Updated weights on worker 0-0, policy_version 601960 (0.00089) [2022-07-10 06:14:12,811][26022] Updated weights on worker 0-0, policy_version 601970 (0.00087) [2022-07-10 06:14:14,492][26022] Updated weights on worker 0-0, policy_version 601980 (0.00078) [2022-07-10 06:14:14,869][25689] Fps is (10 sec: 5609.0, 60 sec: 5574.2, 300 sec: 5599.7). Total num frames: 616428544. Throughput: 0: 4930.1. Samples: 616425212. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:14:14,870][25689] Avg episode reward: [(0, '-23.379')] [2022-07-10 06:14:16,359][26022] Updated weights on worker 0-0, policy_version 601990 (0.00093) [2022-07-10 06:14:18,190][26022] Updated weights on worker 0-0, policy_version 602000 (0.00085) [2022-07-10 06:14:19,899][26022] Updated weights on worker 0-0, policy_version 602010 (0.00084) [2022-07-10 06:14:19,997][25689] Fps is (10 sec: 5855.7, 60 sec: 5600.0, 300 sec: 5601.6). Total num frames: 616458240. Throughput: 0: 5782.8. Samples: 616459496. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:14:19,998][25689] Avg episode reward: [(0, '-22.900')] [2022-07-10 06:14:21,719][26022] Updated weights on worker 0-0, policy_version 602020 (0.00095) [2022-07-10 06:14:23,583][26022] Updated weights on worker 0-0, policy_version 602030 (0.00085) [2022-07-10 06:14:25,031][25689] Fps is (10 sec: 5744.0, 60 sec: 5614.5, 300 sec: 5605.5). Total num frames: 616486912. Throughput: 0: 5896.5. Samples: 616493662. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:14:25,031][25689] Avg episode reward: [(0, '-23.596')] [2022-07-10 06:14:25,327][26022] Updated weights on worker 0-0, policy_version 602040 (0.00098) [2022-07-10 06:14:27,203][26022] Updated weights on worker 0-0, policy_version 602050 (0.01339) [2022-07-10 06:14:28,870][26022] Updated weights on worker 0-0, policy_version 602060 (0.00091) [2022-07-10 06:14:30,082][25689] Fps is (10 sec: 5584.8, 60 sec: 5593.5, 300 sec: 5602.2). Total num frames: 616514560. Throughput: 0: 5070.8. Samples: 616510754. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:14:30,083][25689] Avg episode reward: [(0, '-23.992')] [2022-07-10 06:14:30,855][26022] Updated weights on worker 0-0, policy_version 602070 (0.00098) [2022-07-10 06:14:32,702][26022] Updated weights on worker 0-0, policy_version 602080 (0.00083) [2022-07-10 06:14:34,406][26022] Updated weights on worker 0-0, policy_version 602090 (0.00094) [2022-07-10 06:14:35,088][25689] Fps is (10 sec: 5599.9, 60 sec: 5612.0, 300 sec: 5606.9). Total num frames: 616543232. Throughput: 0: 5920.9. Samples: 616544926. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 06:14:35,089][25689] Avg episode reward: [(0, '-24.564')] [2022-07-10 06:14:36,090][26022] Updated weights on worker 0-0, policy_version 602100 (0.00098) [2022-07-10 06:14:38,163][26022] Updated weights on worker 0-0, policy_version 602110 (0.00093) [2022-07-10 06:14:39,860][26022] Updated weights on worker 0-0, policy_version 602120 (0.00095) [2022-07-10 06:14:40,167][25689] Fps is (10 sec: 5686.2, 60 sec: 5614.9, 300 sec: 5602.1). Total num frames: 616571904. Throughput: 0: 5905.3. Samples: 616578602. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:14:40,167][25689] Avg episode reward: [(0, '-23.922')] [2022-07-10 06:14:41,880][26022] Updated weights on worker 0-0, policy_version 602130 (0.00089) [2022-07-10 06:14:43,369][26022] Updated weights on worker 0-0, policy_version 602140 (0.00054) [2022-07-10 06:14:45,201][25689] Fps is (10 sec: 5569.1, 60 sec: 5595.5, 300 sec: 5598.2). Total num frames: 616599552. Throughput: 0: 5054.0. Samples: 616595602. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:14:45,202][25689] Avg episode reward: [(0, '-23.657')] [2022-07-10 06:14:45,368][26022] Updated weights on worker 0-0, policy_version 602150 (0.00094) [2022-07-10 06:14:47,168][26022] Updated weights on worker 0-0, policy_version 602160 (0.00086) [2022-07-10 06:14:48,978][26022] Updated weights on worker 0-0, policy_version 602170 (0.00088) [2022-07-10 06:14:50,229][25689] Fps is (10 sec: 5597.0, 60 sec: 5599.0, 300 sec: 5604.8). Total num frames: 616628224. Throughput: 0: 5893.1. Samples: 616629484. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:14:50,231][25689] Avg episode reward: [(0, '-24.465')] [2022-07-10 06:14:50,832][26022] Updated weights on worker 0-0, policy_version 602180 (0.00088) [2022-07-10 06:14:52,518][26022] Updated weights on worker 0-0, policy_version 602190 (0.00091) [2022-07-10 06:14:54,277][26022] Updated weights on worker 0-0, policy_version 602200 (0.00086) [2022-07-10 06:14:55,271][25689] Fps is (10 sec: 5796.4, 60 sec: 5647.3, 300 sec: 5608.3). Total num frames: 616657920. Throughput: 0: 5880.5. Samples: 616663610. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:14:55,271][25689] Avg episode reward: [(0, '-23.353')] [2022-07-10 06:14:56,276][26022] Updated weights on worker 0-0, policy_version 602210 (0.00085) [2022-07-10 06:14:57,916][26022] Updated weights on worker 0-0, policy_version 602220 (0.00096) [2022-07-10 06:14:59,826][26022] Updated weights on worker 0-0, policy_version 602230 (0.00091) [2022-07-10 06:15:00,345][25689] Fps is (10 sec: 5770.0, 60 sec: 5632.8, 300 sec: 5614.6). Total num frames: 616686592. Throughput: 0: 5056.7. Samples: 616680638. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:15:00,345][25689] Avg episode reward: [(0, '-23.764')] [2022-07-10 06:15:01,387][26022] Updated weights on worker 0-0, policy_version 602240 (0.00083) [2022-07-10 06:15:03,833][26022] Updated weights on worker 0-0, policy_version 602250 (0.00106) [2022-07-10 06:15:05,357][25689] Fps is (10 sec: 5380.5, 60 sec: 5616.6, 300 sec: 5604.1). Total num frames: 616712192. Throughput: 0: 5818.9. Samples: 616712890. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:15:05,359][25689] Avg episode reward: [(0, '-24.581')] [2022-07-10 06:15:05,480][26022] Updated weights on worker 0-0, policy_version 602260 (0.00086) [2022-07-10 06:15:07,259][26022] Updated weights on worker 0-0, policy_version 602270 (0.00060) [2022-07-10 06:15:09,058][26022] Updated weights on worker 0-0, policy_version 602280 (0.00086) [2022-07-10 06:15:10,377][25689] Fps is (10 sec: 5409.9, 60 sec: 5651.1, 300 sec: 5607.2). Total num frames: 616740864. Throughput: 0: 5847.8. Samples: 616747304. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:15:10,379][25689] Avg episode reward: [(0, '-25.821')] [2022-07-10 06:15:10,884][26022] Updated weights on worker 0-0, policy_version 602290 (0.00091) [2022-07-10 06:15:12,811][26022] Updated weights on worker 0-0, policy_version 602300 (0.00097) [2022-07-10 06:15:14,459][26022] Updated weights on worker 0-0, policy_version 602310 (0.00907) [2022-07-10 06:15:15,405][25689] Fps is (10 sec: 5707.2, 60 sec: 5632.9, 300 sec: 5605.7). Total num frames: 616769536. Throughput: 0: 4995.9. Samples: 616764200. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:15:15,407][25689] Avg episode reward: [(0, '-26.570')] [2022-07-10 06:15:16,465][26022] Updated weights on worker 0-0, policy_version 602320 (0.00084) [2022-07-10 06:15:18,200][26022] Updated weights on worker 0-0, policy_version 602330 (0.00095) [2022-07-10 06:15:19,901][26022] Updated weights on worker 0-0, policy_version 602340 (0.00092) [2022-07-10 06:15:20,486][25689] Fps is (10 sec: 5672.3, 60 sec: 5620.3, 300 sec: 5604.3). Total num frames: 616798208. Throughput: 0: 5835.7. Samples: 616798178. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:15:20,487][25689] Avg episode reward: [(0, '-25.507')] [2022-07-10 06:15:21,831][26022] Updated weights on worker 0-0, policy_version 602350 (0.00087) [2022-07-10 06:15:23,395][26022] Updated weights on worker 0-0, policy_version 602360 (0.00092) [2022-07-10 06:15:25,365][26022] Updated weights on worker 0-0, policy_version 602370 (0.00087) [2022-07-10 06:15:25,505][25689] Fps is (10 sec: 5677.5, 60 sec: 5621.7, 300 sec: 5600.7). Total num frames: 616826880. Throughput: 0: 5936.0. Samples: 616832488. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:15:25,506][25689] Avg episode reward: [(0, '-24.882')] [2022-07-10 06:15:26,989][26022] Updated weights on worker 0-0, policy_version 602380 (0.00095) [2022-07-10 06:15:29,015][26022] Updated weights on worker 0-0, policy_version 602390 (0.00100) [2022-07-10 06:15:30,517][25689] Fps is (10 sec: 5819.0, 60 sec: 5659.2, 300 sec: 5611.3). Total num frames: 616856576. Throughput: 0: 5085.1. Samples: 616849718. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:15:30,518][25689] Avg episode reward: [(0, '-24.592')] [2022-07-10 06:15:30,842][26022] Updated weights on worker 0-0, policy_version 602400 (0.00086) [2022-07-10 06:15:32,407][26022] Updated weights on worker 0-0, policy_version 602410 (0.00079) [2022-07-10 06:15:34,459][26022] Updated weights on worker 0-0, policy_version 602420 (0.00085) [2022-07-10 06:15:35,542][25689] Fps is (10 sec: 5815.1, 60 sec: 5657.4, 300 sec: 5608.3). Total num frames: 616885248. Throughput: 0: 5949.4. Samples: 616884006. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:15:35,543][25689] Avg episode reward: [(0, '-22.450')] [2022-07-10 06:15:35,971][26022] Updated weights on worker 0-0, policy_version 602430 (0.00097) [2022-07-10 06:15:37,930][26022] Updated weights on worker 0-0, policy_version 602440 (0.00084) [2022-07-10 06:15:39,726][26022] Updated weights on worker 0-0, policy_version 602450 (0.00093) [2022-07-10 06:15:40,594][25689] Fps is (10 sec: 5690.4, 60 sec: 5659.9, 300 sec: 5611.4). Total num frames: 616913920. Throughput: 0: 5965.2. Samples: 616918126. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:15:40,595][25689] Avg episode reward: [(0, '-21.304')] [2022-07-10 06:15:41,524][26022] Updated weights on worker 0-0, policy_version 602460 (0.00085) [2022-07-10 06:15:43,427][26022] Updated weights on worker 0-0, policy_version 602470 (0.00086) [2022-07-10 06:15:45,184][26022] Updated weights on worker 0-0, policy_version 602480 (0.00084) [2022-07-10 06:15:45,619][25689] Fps is (10 sec: 5589.4, 60 sec: 5660.9, 300 sec: 5611.6). Total num frames: 616941568. Throughput: 0: 5097.7. Samples: 616935020. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:15:45,619][25689] Avg episode reward: [(0, '-20.718')] [2022-07-10 06:15:46,895][26022] Updated weights on worker 0-0, policy_version 602490 (0.00085) [2022-07-10 06:15:48,911][26022] Updated weights on worker 0-0, policy_version 602500 (0.00087) [2022-07-10 06:15:50,629][25689] Fps is (10 sec: 5612.4, 60 sec: 5662.5, 300 sec: 5611.9). Total num frames: 616970240. Throughput: 0: 5947.4. Samples: 616969334. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:15:50,630][25689] Avg episode reward: [(0, '-21.754')] [2022-07-10 06:15:50,632][26022] Updated weights on worker 0-0, policy_version 602510 (0.00112) [2022-07-10 06:15:52,386][26022] Updated weights on worker 0-0, policy_version 602520 (0.00093) [2022-07-10 06:15:54,004][26022] Updated weights on worker 0-0, policy_version 602530 (0.00086) [2022-07-10 06:15:55,639][25689] Fps is (10 sec: 5722.8, 60 sec: 5648.5, 300 sec: 5613.5). Total num frames: 616998912. Throughput: 0: 5931.5. Samples: 617003208. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:15:55,639][25689] Avg episode reward: [(0, '-21.306')] [2022-07-10 06:15:56,142][26022] Updated weights on worker 0-0, policy_version 602540 (0.00091) [2022-07-10 06:15:57,822][26022] Updated weights on worker 0-0, policy_version 602550 (0.00080) [2022-07-10 06:15:59,869][26022] Updated weights on worker 0-0, policy_version 602560 (0.00090) [2022-07-10 06:16:00,708][25689] Fps is (10 sec: 5791.5, 60 sec: 5666.0, 300 sec: 5626.1). Total num frames: 617028608. Throughput: 0: 5082.1. Samples: 617020344. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:16:00,709][25689] Avg episode reward: [(0, '-21.938')] [2022-07-10 06:16:01,296][26022] Updated weights on worker 0-0, policy_version 602570 (0.00087) [2022-07-10 06:16:03,617][26022] Updated weights on worker 0-0, policy_version 602580 (0.00052) [2022-07-10 06:16:05,404][26022] Updated weights on worker 0-0, policy_version 602590 (0.00085) [2022-07-10 06:16:05,781][25689] Fps is (10 sec: 5452.0, 60 sec: 5660.3, 300 sec: 5614.9). Total num frames: 617054208. Throughput: 0: 5830.1. Samples: 617052570. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:16:05,782][25689] Avg episode reward: [(0, '-22.721')] [2022-07-10 06:16:07,267][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:16:07,282][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000602600_617062400.pth [2022-07-10 06:16:07,282][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000600626_615041024.pth [2022-07-10 06:16:07,286][26022] Updated weights on worker 0-0, policy_version 602600 (0.00100) [2022-07-10 06:16:08,831][26022] Updated weights on worker 0-0, policy_version 602610 (0.00090) [2022-07-10 06:16:10,823][25689] Fps is (10 sec: 5263.9, 60 sec: 5641.2, 300 sec: 5614.4). Total num frames: 617081856. Throughput: 0: 5806.8. Samples: 617086596. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:16:10,825][25689] Avg episode reward: [(0, '-22.626')] [2022-07-10 06:16:10,856][26022] Updated weights on worker 0-0, policy_version 602620 (0.00089) [2022-07-10 06:16:12,532][26022] Updated weights on worker 0-0, policy_version 602630 (0.00087) [2022-07-10 06:16:14,494][26022] Updated weights on worker 0-0, policy_version 602640 (0.00085) [2022-07-10 06:16:15,844][25689] Fps is (10 sec: 5800.4, 60 sec: 5675.8, 300 sec: 5618.7). Total num frames: 617112576. Throughput: 0: 5817.1. Samples: 617120742. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:16:15,846][25689] Avg episode reward: [(0, '-22.373')] [2022-07-10 06:16:15,940][26022] Updated weights on worker 0-0, policy_version 602650 (0.00295) [2022-07-10 06:16:18,112][26022] Updated weights on worker 0-0, policy_version 602660 (0.00080) [2022-07-10 06:16:19,659][26022] Updated weights on worker 0-0, policy_version 602670 (0.00094) [2022-07-10 06:16:20,966][25689] Fps is (10 sec: 5653.7, 60 sec: 5638.1, 300 sec: 5614.2). Total num frames: 617139200. Throughput: 0: 5785.9. Samples: 617137558. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:16:20,968][25689] Avg episode reward: [(0, '-22.665')] [2022-07-10 06:16:21,747][26022] Updated weights on worker 0-0, policy_version 602680 (0.00092) [2022-07-10 06:16:23,575][26022] Updated weights on worker 0-0, policy_version 602690 (0.00088) [2022-07-10 06:16:25,059][26022] Updated weights on worker 0-0, policy_version 602700 (0.00084) [2022-07-10 06:16:26,055][25689] Fps is (10 sec: 5415.4, 60 sec: 5631.6, 300 sec: 5616.4). Total num frames: 617167872. Throughput: 0: 5874.8. Samples: 617171674. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:16:26,055][25689] Avg episode reward: [(0, '-24.440')] [2022-07-10 06:16:27,182][26022] Updated weights on worker 0-0, policy_version 602710 (0.00090) [2022-07-10 06:16:29,024][26022] Updated weights on worker 0-0, policy_version 602720 (0.00086) [2022-07-10 06:16:30,570][26022] Updated weights on worker 0-0, policy_version 602730 (0.00100) [2022-07-10 06:16:31,060][25689] Fps is (10 sec: 5884.0, 60 sec: 5649.2, 300 sec: 5623.7). Total num frames: 617198592. Throughput: 0: 5890.6. Samples: 617205802. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:16:31,060][25689] Avg episode reward: [(0, '-24.182')] [2022-07-10 06:16:32,552][26022] Updated weights on worker 0-0, policy_version 602740 (0.00088) [2022-07-10 06:16:34,129][26022] Updated weights on worker 0-0, policy_version 602750 (0.00094) [2022-07-10 06:16:36,067][25689] Fps is (10 sec: 5727.3, 60 sec: 5617.0, 300 sec: 5618.1). Total num frames: 617225216. Throughput: 0: 5051.1. Samples: 617222892. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:16:36,068][25689] Avg episode reward: [(0, '-23.440')] [2022-07-10 06:16:36,163][26022] Updated weights on worker 0-0, policy_version 602760 (0.00085) [2022-07-10 06:16:37,940][26022] Updated weights on worker 0-0, policy_version 602770 (0.00086) [2022-07-10 06:16:39,691][26022] Updated weights on worker 0-0, policy_version 602780 (0.00088) [2022-07-10 06:16:41,207][25689] Fps is (10 sec: 5449.1, 60 sec: 5608.8, 300 sec: 5616.0). Total num frames: 617253888. Throughput: 0: 5895.0. Samples: 617256882. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:16:41,208][25689] Avg episode reward: [(0, '-24.606')] [2022-07-10 06:16:41,533][26022] Updated weights on worker 0-0, policy_version 602790 (0.00622) [2022-07-10 06:16:43,235][26022] Updated weights on worker 0-0, policy_version 602800 (0.00077) [2022-07-10 06:16:45,101][26022] Updated weights on worker 0-0, policy_version 602810 (0.00085) [2022-07-10 06:16:46,243][25689] Fps is (10 sec: 5836.5, 60 sec: 5658.4, 300 sec: 5622.6). Total num frames: 617284608. Throughput: 0: 5914.9. Samples: 617291086. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:16:46,244][25689] Avg episode reward: [(0, '-24.125')] [2022-07-10 06:16:46,951][26022] Updated weights on worker 0-0, policy_version 602820 (0.00094) [2022-07-10 06:16:48,522][26022] Updated weights on worker 0-0, policy_version 602830 (0.00082) [2022-07-10 06:16:50,418][26022] Updated weights on worker 0-0, policy_version 602840 (0.00087) [2022-07-10 06:16:51,277][25689] Fps is (10 sec: 5898.2, 60 sec: 5656.3, 300 sec: 5625.4). Total num frames: 617313280. Throughput: 0: 5078.7. Samples: 617308478. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:16:51,277][25689] Avg episode reward: [(0, '-23.907')] [2022-07-10 06:16:52,261][26022] Updated weights on worker 0-0, policy_version 602850 (0.00091) [2022-07-10 06:16:53,887][26022] Updated weights on worker 0-0, policy_version 602860 (0.00097) [2022-07-10 06:16:55,699][26022] Updated weights on worker 0-0, policy_version 602870 (0.00086) [2022-07-10 06:16:56,354][25689] Fps is (10 sec: 5570.5, 60 sec: 5633.2, 300 sec: 5621.6). Total num frames: 617340928. Throughput: 0: 5917.4. Samples: 617342934. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:16:56,354][25689] Avg episode reward: [(0, '-22.792')] [2022-07-10 06:16:57,696][26022] Updated weights on worker 0-0, policy_version 602880 (0.00092) [2022-07-10 06:16:59,379][26022] Updated weights on worker 0-0, policy_version 602890 (0.00083) [2022-07-10 06:17:01,444][25689] Fps is (10 sec: 5338.2, 60 sec: 5580.6, 300 sec: 5616.8). Total num frames: 617367552. Throughput: 0: 5948.8. Samples: 617377264. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:17:01,444][25689] Avg episode reward: [(0, '-23.852')] [2022-07-10 06:17:01,707][26022] Updated weights on worker 0-0, policy_version 602900 (0.00089) [2022-07-10 06:17:03,216][26022] Updated weights on worker 0-0, policy_version 602910 (0.00103) [2022-07-10 06:17:05,156][26022] Updated weights on worker 0-0, policy_version 602920 (0.00085) [2022-07-10 06:17:06,452][25689] Fps is (10 sec: 5576.9, 60 sec: 5654.1, 300 sec: 5627.0). Total num frames: 617397248. Throughput: 0: 5007.0. Samples: 617392274. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:17:06,453][25689] Avg episode reward: [(0, '-23.235')] [2022-07-10 06:17:07,029][26022] Updated weights on worker 0-0, policy_version 602930 (0.00092) [2022-07-10 06:17:08,781][26022] Updated weights on worker 0-0, policy_version 602940 (0.00079) [2022-07-10 06:17:10,704][26022] Updated weights on worker 0-0, policy_version 602950 (0.00087) [2022-07-10 06:17:11,533][25689] Fps is (10 sec: 5785.3, 60 sec: 5667.4, 300 sec: 5629.6). Total num frames: 617425920. Throughput: 0: 5804.9. Samples: 617426060. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:17:11,533][25689] Avg episode reward: [(0, '-22.598')] [2022-07-10 06:17:12,559][26022] Updated weights on worker 0-0, policy_version 602960 (0.00091) [2022-07-10 06:17:14,167][26022] Updated weights on worker 0-0, policy_version 602970 (0.00084) [2022-07-10 06:17:16,226][26022] Updated weights on worker 0-0, policy_version 602980 (0.00086) [2022-07-10 06:17:16,544][25689] Fps is (10 sec: 5580.6, 60 sec: 5617.6, 300 sec: 5623.5). Total num frames: 617453568. Throughput: 0: 5781.5. Samples: 617459668. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:17:16,545][25689] Avg episode reward: [(0, '-22.828')] [2022-07-10 06:17:17,898][26022] Updated weights on worker 0-0, policy_version 602990 (0.00113) [2022-07-10 06:17:19,792][26022] Updated weights on worker 0-0, policy_version 603000 (0.00086) [2022-07-10 06:17:21,574][26022] Updated weights on worker 0-0, policy_version 603010 (0.00106) [2022-07-10 06:17:21,665][25689] Fps is (10 sec: 5558.8, 60 sec: 5651.5, 300 sec: 5621.9). Total num frames: 617482240. Throughput: 0: 4914.4. Samples: 617476638. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:17:21,665][25689] Avg episode reward: [(0, '-24.037')] [2022-07-10 06:17:23,211][26022] Updated weights on worker 0-0, policy_version 603020 (0.00098) [2022-07-10 06:17:25,257][26022] Updated weights on worker 0-0, policy_version 603030 (0.00089) [2022-07-10 06:17:26,721][25689] Fps is (10 sec: 5634.8, 60 sec: 5654.5, 300 sec: 5628.4). Total num frames: 617510912. Throughput: 0: 5852.3. Samples: 617510894. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:17:26,725][25689] Avg episode reward: [(0, '-23.476')] [2022-07-10 06:17:26,871][26022] Updated weights on worker 0-0, policy_version 603040 (0.00085) [2022-07-10 06:17:28,945][26022] Updated weights on worker 0-0, policy_version 603050 (0.00089) [2022-07-10 06:17:30,643][26022] Updated weights on worker 0-0, policy_version 603060 (0.00094) [2022-07-10 06:17:31,747][25689] Fps is (10 sec: 5687.7, 60 sec: 5618.9, 300 sec: 5625.1). Total num frames: 617539584. Throughput: 0: 5853.7. Samples: 617544386. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:17:31,747][25689] Avg episode reward: [(0, '-23.904')] [2022-07-10 06:17:32,577][26022] Updated weights on worker 0-0, policy_version 603070 (0.00091) [2022-07-10 06:17:34,373][26022] Updated weights on worker 0-0, policy_version 603080 (0.00084) [2022-07-10 06:17:35,986][26022] Updated weights on worker 0-0, policy_version 603090 (0.00088) [2022-07-10 06:17:36,784][25689] Fps is (10 sec: 5698.8, 60 sec: 5649.9, 300 sec: 5629.5). Total num frames: 617568256. Throughput: 0: 5030.9. Samples: 617561490. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:17:36,785][25689] Avg episode reward: [(0, '-24.420')] [2022-07-10 06:17:37,878][26022] Updated weights on worker 0-0, policy_version 603100 (0.00092) [2022-07-10 06:17:39,912][26022] Updated weights on worker 0-0, policy_version 603110 (0.00090) [2022-07-10 06:17:41,614][26022] Updated weights on worker 0-0, policy_version 603120 (0.00091) [2022-07-10 06:17:41,842][25689] Fps is (10 sec: 5578.9, 60 sec: 5640.6, 300 sec: 5621.8). Total num frames: 617595904. Throughput: 0: 5882.0. Samples: 617595320. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 06:17:41,842][25689] Avg episode reward: [(0, '-24.834')] [2022-07-10 06:17:43,418][26022] Updated weights on worker 0-0, policy_version 603130 (0.00086) [2022-07-10 06:17:45,289][26022] Updated weights on worker 0-0, policy_version 603140 (0.00084) [2022-07-10 06:17:46,888][25689] Fps is (10 sec: 5472.6, 60 sec: 5589.0, 300 sec: 5621.5). Total num frames: 617623552. Throughput: 0: 5842.7. Samples: 617628722. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:17:46,888][25689] Avg episode reward: [(0, '-24.906')] [2022-07-10 06:17:47,062][26022] Updated weights on worker 0-0, policy_version 603150 (0.00092) [2022-07-10 06:17:48,951][26022] Updated weights on worker 0-0, policy_version 603160 (0.00087) [2022-07-10 06:17:50,734][26022] Updated weights on worker 0-0, policy_version 603170 (0.00091) [2022-07-10 06:17:51,943][25689] Fps is (10 sec: 5575.4, 60 sec: 5587.0, 300 sec: 5620.6). Total num frames: 617652224. Throughput: 0: 5015.2. Samples: 617645676. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:17:51,944][25689] Avg episode reward: [(0, '-24.447')] [2022-07-10 06:17:52,527][26022] Updated weights on worker 0-0, policy_version 603180 (0.00089) [2022-07-10 06:17:54,387][26022] Updated weights on worker 0-0, policy_version 603190 (0.00082) [2022-07-10 06:17:56,144][26022] Updated weights on worker 0-0, policy_version 603200 (0.00081) [2022-07-10 06:17:56,947][25689] Fps is (10 sec: 5802.4, 60 sec: 5627.5, 300 sec: 5625.9). Total num frames: 617681920. Throughput: 0: 5865.9. Samples: 617679768. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:17:56,948][25689] Avg episode reward: [(0, '-24.984')] [2022-07-10 06:17:58,028][26022] Updated weights on worker 0-0, policy_version 603210 (0.00057) [2022-07-10 06:17:59,723][26022] Updated weights on worker 0-0, policy_version 603220 (0.00087) [2022-07-10 06:18:01,844][26022] Updated weights on worker 0-0, policy_version 603230 (0.00092) [2022-07-10 06:18:02,017][25689] Fps is (10 sec: 5489.3, 60 sec: 5612.5, 300 sec: 5624.8). Total num frames: 617707520. Throughput: 0: 5880.8. Samples: 617713964. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:18:02,017][25689] Avg episode reward: [(0, '-24.022')] [2022-07-10 06:18:03,611][26022] Updated weights on worker 0-0, policy_version 603240 (0.00092) [2022-07-10 06:18:05,380][26022] Updated weights on worker 0-0, policy_version 603250 (0.00087) [2022-07-10 06:18:07,117][25689] Fps is (10 sec: 5235.6, 60 sec: 5570.2, 300 sec: 5623.3). Total num frames: 617735168. Throughput: 0: 5806.3. Samples: 617746182. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:18:07,118][25689] Avg episode reward: [(0, '-25.039')] [2022-07-10 06:18:07,352][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:18:07,365][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000603260_617738240.pth [2022-07-10 06:18:07,365][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000601282_615712768.pth [2022-07-10 06:18:07,371][26022] Updated weights on worker 0-0, policy_version 603260 (0.00097) [2022-07-10 06:18:08,840][26022] Updated weights on worker 0-0, policy_version 603270 (0.00081) [2022-07-10 06:18:11,025][26022] Updated weights on worker 0-0, policy_version 603280 (0.00090) [2022-07-10 06:18:12,163][25689] Fps is (10 sec: 5853.8, 60 sec: 5624.2, 300 sec: 5626.8). Total num frames: 617766912. Throughput: 0: 5808.9. Samples: 617763128. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:18:12,164][25689] Avg episode reward: [(0, '-25.055')] [2022-07-10 06:18:12,538][26022] Updated weights on worker 0-0, policy_version 603290 (0.00093) [2022-07-10 06:18:14,607][26022] Updated weights on worker 0-0, policy_version 603300 (0.00084) [2022-07-10 06:18:16,362][26022] Updated weights on worker 0-0, policy_version 603310 (0.00104) [2022-07-10 06:18:17,185][25689] Fps is (10 sec: 5797.9, 60 sec: 5606.3, 300 sec: 5623.7). Total num frames: 617793536. Throughput: 0: 5802.9. Samples: 617797204. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:18:17,185][25689] Avg episode reward: [(0, '-25.252')] [2022-07-10 06:18:18,163][26022] Updated weights on worker 0-0, policy_version 603320 (0.00104) [2022-07-10 06:18:20,026][26022] Updated weights on worker 0-0, policy_version 603330 (0.00876) [2022-07-10 06:18:21,763][26022] Updated weights on worker 0-0, policy_version 603340 (0.00090) [2022-07-10 06:18:22,251][25689] Fps is (10 sec: 5582.6, 60 sec: 5628.2, 300 sec: 5629.4). Total num frames: 617823232. Throughput: 0: 5782.1. Samples: 617830962. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:18:22,253][25689] Avg episode reward: [(0, '-26.070')] [2022-07-10 06:18:23,571][26022] Updated weights on worker 0-0, policy_version 603350 (0.00085) [2022-07-10 06:18:25,412][26022] Updated weights on worker 0-0, policy_version 603360 (0.00086) [2022-07-10 06:18:27,244][26022] Updated weights on worker 0-0, policy_version 603370 (0.00093) [2022-07-10 06:18:27,265][25689] Fps is (10 sec: 5688.7, 60 sec: 5615.2, 300 sec: 5625.9). Total num frames: 617850880. Throughput: 0: 5053.9. Samples: 617848006. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:18:27,265][25689] Avg episode reward: [(0, '-26.151')] [2022-07-10 06:18:29,047][26022] Updated weights on worker 0-0, policy_version 603380 (0.00088) [2022-07-10 06:18:30,800][26022] Updated weights on worker 0-0, policy_version 603390 (0.00091) [2022-07-10 06:18:32,275][25689] Fps is (10 sec: 5516.6, 60 sec: 5599.8, 300 sec: 5626.1). Total num frames: 617878528. Throughput: 0: 5902.6. Samples: 617881842. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:18:32,276][25689] Avg episode reward: [(0, '-26.426')] [2022-07-10 06:18:32,734][26022] Updated weights on worker 0-0, policy_version 603400 (0.00087) [2022-07-10 06:18:34,548][26022] Updated weights on worker 0-0, policy_version 603410 (0.00112) [2022-07-10 06:18:36,284][26022] Updated weights on worker 0-0, policy_version 603420 (0.00088) [2022-07-10 06:18:37,279][25689] Fps is (10 sec: 5726.4, 60 sec: 5619.7, 300 sec: 5631.6). Total num frames: 617908224. Throughput: 0: 5904.0. Samples: 617915840. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:18:37,280][25689] Avg episode reward: [(0, '-25.999')] [2022-07-10 06:18:38,012][26022] Updated weights on worker 0-0, policy_version 603430 (0.00089) [2022-07-10 06:18:40,093][26022] Updated weights on worker 0-0, policy_version 603440 (0.00411) [2022-07-10 06:18:41,747][26022] Updated weights on worker 0-0, policy_version 603450 (0.00086) [2022-07-10 06:18:42,382][25689] Fps is (10 sec: 5673.7, 60 sec: 5615.6, 300 sec: 5626.3). Total num frames: 617935872. Throughput: 0: 5057.5. Samples: 617932772. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:18:42,383][25689] Avg episode reward: [(0, '-24.994')] [2022-07-10 06:18:43,674][26022] Updated weights on worker 0-0, policy_version 603460 (0.00093) [2022-07-10 06:18:45,543][26022] Updated weights on worker 0-0, policy_version 603470 (0.00093) [2022-07-10 06:18:47,135][26022] Updated weights on worker 0-0, policy_version 603480 (0.00094) [2022-07-10 06:18:47,423][25689] Fps is (10 sec: 5653.2, 60 sec: 5649.9, 300 sec: 5630.2). Total num frames: 617965568. Throughput: 0: 5890.8. Samples: 617966752. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:18:47,423][25689] Avg episode reward: [(0, '-24.816')] [2022-07-10 06:18:48,912][26022] Updated weights on worker 0-0, policy_version 603490 (0.00089) [2022-07-10 06:18:50,792][26022] Updated weights on worker 0-0, policy_version 603500 (0.00089) [2022-07-10 06:18:52,474][25689] Fps is (10 sec: 5682.0, 60 sec: 5633.4, 300 sec: 5633.0). Total num frames: 617993216. Throughput: 0: 5870.0. Samples: 618000412. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:18:52,474][25689] Avg episode reward: [(0, '-23.849')] [2022-07-10 06:18:52,682][26022] Updated weights on worker 0-0, policy_version 603510 (0.00089) [2022-07-10 06:18:54,552][26022] Updated weights on worker 0-0, policy_version 603520 (0.00092) [2022-07-10 06:18:56,335][26022] Updated weights on worker 0-0, policy_version 603530 (0.00092) [2022-07-10 06:18:57,528][25689] Fps is (10 sec: 5573.4, 60 sec: 5611.8, 300 sec: 5630.4). Total num frames: 618021888. Throughput: 0: 5023.2. Samples: 618017554. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:18:57,529][25689] Avg episode reward: [(0, '-23.607')] [2022-07-10 06:18:58,051][26022] Updated weights on worker 0-0, policy_version 603540 (0.00090) [2022-07-10 06:18:59,992][26022] Updated weights on worker 0-0, policy_version 603550 (0.00084) [2022-07-10 06:19:01,830][26022] Updated weights on worker 0-0, policy_version 603560 (0.00084) [2022-07-10 06:19:02,578][25689] Fps is (10 sec: 5371.2, 60 sec: 5613.6, 300 sec: 5626.4). Total num frames: 618047488. Throughput: 0: 5864.9. Samples: 618051222. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:19:02,579][25689] Avg episode reward: [(0, '-23.315')] [2022-07-10 06:19:03,864][26022] Updated weights on worker 0-0, policy_version 603570 (0.00090) [2022-07-10 06:19:05,664][26022] Updated weights on worker 0-0, policy_version 603580 (0.00084) [2022-07-10 06:19:07,558][26022] Updated weights on worker 0-0, policy_version 603590 (0.00083) [2022-07-10 06:19:07,594][25689] Fps is (10 sec: 5391.3, 60 sec: 5638.4, 300 sec: 5633.5). Total num frames: 618076160. Throughput: 0: 5786.9. Samples: 618083482. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:19:07,595][25689] Avg episode reward: [(0, '-24.006')] [2022-07-10 06:19:09,214][26022] Updated weights on worker 0-0, policy_version 603600 (0.00081) [2022-07-10 06:19:11,098][26022] Updated weights on worker 0-0, policy_version 603610 (0.00053) [2022-07-10 06:19:12,606][25689] Fps is (10 sec: 5718.2, 60 sec: 5590.7, 300 sec: 5630.1). Total num frames: 618104832. Throughput: 0: 4977.8. Samples: 618100628. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:19:12,607][25689] Avg episode reward: [(0, '-26.014')] [2022-07-10 06:19:12,821][26022] Updated weights on worker 0-0, policy_version 603620 (0.00085) [2022-07-10 06:19:14,920][26022] Updated weights on worker 0-0, policy_version 603630 (0.00090) [2022-07-10 06:19:16,444][26022] Updated weights on worker 0-0, policy_version 603640 (0.00089) [2022-07-10 06:19:17,632][25689] Fps is (10 sec: 5610.6, 60 sec: 5607.2, 300 sec: 5625.1). Total num frames: 618132480. Throughput: 0: 5808.8. Samples: 618134338. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:19:17,633][25689] Avg episode reward: [(0, '-25.975')] [2022-07-10 06:19:18,320][26022] Updated weights on worker 0-0, policy_version 603650 (0.00078) [2022-07-10 06:19:20,093][26022] Updated weights on worker 0-0, policy_version 603660 (0.00081) [2022-07-10 06:19:21,838][26022] Updated weights on worker 0-0, policy_version 603670 (0.00089) [2022-07-10 06:19:22,727][25689] Fps is (10 sec: 5666.3, 60 sec: 5604.7, 300 sec: 5627.4). Total num frames: 618162176. Throughput: 0: 5826.0. Samples: 618168608. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:19:22,727][25689] Avg episode reward: [(0, '-25.099')] [2022-07-10 06:19:23,637][26022] Updated weights on worker 0-0, policy_version 603680 (0.00091) [2022-07-10 06:19:25,404][26022] Updated weights on worker 0-0, policy_version 603690 (0.00089) [2022-07-10 06:19:27,376][26022] Updated weights on worker 0-0, policy_version 603700 (0.00082) [2022-07-10 06:19:27,732][25689] Fps is (10 sec: 5779.1, 60 sec: 5622.3, 300 sec: 5631.7). Total num frames: 618190848. Throughput: 0: 5084.4. Samples: 618185872. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:19:27,733][25689] Avg episode reward: [(0, '-24.730')] [2022-07-10 06:19:28,923][26022] Updated weights on worker 0-0, policy_version 603710 (0.00087) [2022-07-10 06:19:31,054][26022] Updated weights on worker 0-0, policy_version 603720 (0.00084) [2022-07-10 06:19:32,755][25689] Fps is (10 sec: 5616.3, 60 sec: 5621.2, 300 sec: 5628.0). Total num frames: 618218496. Throughput: 0: 5903.9. Samples: 618219580. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:19:32,755][25689] Avg episode reward: [(0, '-24.558')] [2022-07-10 06:19:32,774][26022] Updated weights on worker 0-0, policy_version 603730 (0.00092) [2022-07-10 06:19:34,596][26022] Updated weights on worker 0-0, policy_version 603740 (0.00089) [2022-07-10 06:19:36,453][26022] Updated weights on worker 0-0, policy_version 603750 (0.00086) [2022-07-10 06:19:37,779][25689] Fps is (10 sec: 5605.5, 60 sec: 5602.3, 300 sec: 5629.0). Total num frames: 618247168. Throughput: 0: 5916.9. Samples: 618253546. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:19:37,780][25689] Avg episode reward: [(0, '-24.420')] [2022-07-10 06:19:38,245][26022] Updated weights on worker 0-0, policy_version 603760 (0.00094) [2022-07-10 06:19:39,945][26022] Updated weights on worker 0-0, policy_version 603770 (0.00095) [2022-07-10 06:19:41,846][26022] Updated weights on worker 0-0, policy_version 603780 (0.00093) [2022-07-10 06:19:42,833][25689] Fps is (10 sec: 5689.8, 60 sec: 5623.8, 300 sec: 5632.1). Total num frames: 618275840. Throughput: 0: 5069.4. Samples: 618270534. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:19:42,834][25689] Avg episode reward: [(0, '-23.350')] [2022-07-10 06:19:43,324][26022] Updated weights on worker 0-0, policy_version 603790 (0.00089) [2022-07-10 06:19:45,538][26022] Updated weights on worker 0-0, policy_version 603800 (0.00084) [2022-07-10 06:19:47,356][26022] Updated weights on worker 0-0, policy_version 603810 (0.00095) [2022-07-10 06:19:47,847][25689] Fps is (10 sec: 5594.4, 60 sec: 5592.5, 300 sec: 5628.9). Total num frames: 618303488. Throughput: 0: 5902.9. Samples: 618304606. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:19:47,848][25689] Avg episode reward: [(0, '-24.081')] [2022-07-10 06:19:49,070][26022] Updated weights on worker 0-0, policy_version 603820 (0.00089) [2022-07-10 06:19:51,008][26022] Updated weights on worker 0-0, policy_version 603830 (0.00087) [2022-07-10 06:19:52,396][26022] Updated weights on worker 0-0, policy_version 603840 (0.00091) [2022-07-10 06:19:52,866][25689] Fps is (10 sec: 5817.3, 60 sec: 5646.3, 300 sec: 5632.8). Total num frames: 618334208. Throughput: 0: 5940.3. Samples: 618339050. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:19:52,868][25689] Avg episode reward: [(0, '-24.307')] [2022-07-10 06:19:54,568][26022] Updated weights on worker 0-0, policy_version 603850 (0.00094) [2022-07-10 06:19:56,183][26022] Updated weights on worker 0-0, policy_version 603860 (0.00077) [2022-07-10 06:19:57,870][25689] Fps is (10 sec: 5720.8, 60 sec: 5617.0, 300 sec: 5627.2). Total num frames: 618360832. Throughput: 0: 5105.0. Samples: 618356110. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:19:57,872][25689] Avg episode reward: [(0, '-24.980')] [2022-07-10 06:19:57,989][26022] Updated weights on worker 0-0, policy_version 603870 (0.00089) [2022-07-10 06:19:59,800][26022] Updated weights on worker 0-0, policy_version 603880 (0.00091) [2022-07-10 06:20:01,524][26022] Updated weights on worker 0-0, policy_version 603890 (0.00089) [2022-07-10 06:20:03,015][25689] Fps is (10 sec: 5246.8, 60 sec: 5625.2, 300 sec: 5628.2). Total num frames: 618387456. Throughput: 0: 5937.1. Samples: 618390358. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:20:03,016][25689] Avg episode reward: [(0, '-26.408')] [2022-07-10 06:20:03,755][26022] Updated weights on worker 0-0, policy_version 603900 (0.00094) [2022-07-10 06:20:05,710][26022] Updated weights on worker 0-0, policy_version 603910 (0.00094) [2022-07-10 06:20:07,182][26022] Updated weights on worker 0-0, policy_version 603920 (0.00089) [2022-07-10 06:20:07,374][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:20:07,392][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000603921_618415104.pth [2022-07-10 06:20:07,393][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000601939_616385536.pth [2022-07-10 06:20:08,032][25689] Fps is (10 sec: 5542.2, 60 sec: 5642.0, 300 sec: 5631.6). Total num frames: 618417152. Throughput: 0: 5829.3. Samples: 618422274. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:20:08,033][25689] Avg episode reward: [(0, '-25.247')] [2022-07-10 06:20:09,414][26022] Updated weights on worker 0-0, policy_version 603930 (0.00089) [2022-07-10 06:20:11,016][26022] Updated weights on worker 0-0, policy_version 603940 (0.00070) [2022-07-10 06:20:12,983][26022] Updated weights on worker 0-0, policy_version 603950 (0.00090) [2022-07-10 06:20:13,052][25689] Fps is (10 sec: 5713.1, 60 sec: 5624.3, 300 sec: 5628.4). Total num frames: 618444800. Throughput: 0: 4952.4. Samples: 618439018. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:20:13,053][25689] Avg episode reward: [(0, '-24.560')] [2022-07-10 06:20:14,670][26022] Updated weights on worker 0-0, policy_version 603960 (0.00094) [2022-07-10 06:20:16,517][26022] Updated weights on worker 0-0, policy_version 603970 (0.00086) [2022-07-10 06:20:18,078][25689] Fps is (10 sec: 5606.7, 60 sec: 5641.3, 300 sec: 5629.4). Total num frames: 618473472. Throughput: 0: 5777.5. Samples: 618472856. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:20:18,078][25689] Avg episode reward: [(0, '-24.972')] [2022-07-10 06:20:18,311][26022] Updated weights on worker 0-0, policy_version 603980 (0.00087) [2022-07-10 06:20:20,177][26022] Updated weights on worker 0-0, policy_version 603990 (0.00090) [2022-07-10 06:20:21,871][26022] Updated weights on worker 0-0, policy_version 604000 (0.00094) [2022-07-10 06:20:23,212][25689] Fps is (10 sec: 5644.0, 60 sec: 5620.6, 300 sec: 5627.2). Total num frames: 618502144. Throughput: 0: 5754.9. Samples: 618506592. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:20:23,213][25689] Avg episode reward: [(0, '-25.237')] [2022-07-10 06:20:23,744][26022] Updated weights on worker 0-0, policy_version 604010 (0.00085) [2022-07-10 06:20:25,630][26022] Updated weights on worker 0-0, policy_version 604020 (0.00101) [2022-07-10 06:20:27,321][26022] Updated weights on worker 0-0, policy_version 604030 (0.00086) [2022-07-10 06:20:28,253][25689] Fps is (10 sec: 5635.8, 60 sec: 5617.4, 300 sec: 5623.2). Total num frames: 618530816. Throughput: 0: 5853.3. Samples: 618540628. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:20:28,253][25689] Avg episode reward: [(0, '-25.300')] [2022-07-10 06:20:29,201][26022] Updated weights on worker 0-0, policy_version 604040 (0.00094) [2022-07-10 06:20:30,918][26022] Updated weights on worker 0-0, policy_version 604050 (0.00088) [2022-07-10 06:20:32,977][26022] Updated weights on worker 0-0, policy_version 604060 (0.00089) [2022-07-10 06:20:33,256][25689] Fps is (10 sec: 5607.7, 60 sec: 5619.2, 300 sec: 5620.2). Total num frames: 618558464. Throughput: 0: 5859.6. Samples: 618557402. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:20:33,256][25689] Avg episode reward: [(0, '-24.845')] [2022-07-10 06:20:34,542][26022] Updated weights on worker 0-0, policy_version 604070 (0.00402) [2022-07-10 06:20:36,471][26022] Updated weights on worker 0-0, policy_version 604080 (0.00091) [2022-07-10 06:20:38,307][25689] Fps is (10 sec: 5601.3, 60 sec: 5616.7, 300 sec: 5620.2). Total num frames: 618587136. Throughput: 0: 5861.7. Samples: 618591436. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:20:38,308][25689] Avg episode reward: [(0, '-25.471')] [2022-07-10 06:20:38,398][26022] Updated weights on worker 0-0, policy_version 604090 (0.00091) [2022-07-10 06:20:40,264][26022] Updated weights on worker 0-0, policy_version 604100 (0.00084) [2022-07-10 06:20:41,717][26022] Updated weights on worker 0-0, policy_version 604110 (0.00085) [2022-07-10 06:20:43,430][25689] Fps is (10 sec: 5636.3, 60 sec: 5610.3, 300 sec: 5621.8). Total num frames: 618615808. Throughput: 0: 5881.6. Samples: 618625502. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:20:43,430][25689] Avg episode reward: [(0, '-24.665')] [2022-07-10 06:20:44,032][26022] Updated weights on worker 0-0, policy_version 604120 (0.00093) [2022-07-10 06:20:45,344][26022] Updated weights on worker 0-0, policy_version 604130 (0.00090) [2022-07-10 06:20:47,566][26022] Updated weights on worker 0-0, policy_version 604140 (0.00085) [2022-07-10 06:20:48,467][25689] Fps is (10 sec: 5745.2, 60 sec: 5642.0, 300 sec: 5624.8). Total num frames: 618645504. Throughput: 0: 5043.4. Samples: 618642576. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2022-07-10 06:20:48,468][25689] Avg episode reward: [(0, '-24.002')] [2022-07-10 06:20:48,878][26022] Updated weights on worker 0-0, policy_version 604150 (0.00083) [2022-07-10 06:20:50,957][26022] Updated weights on worker 0-0, policy_version 604160 (0.00084) [2022-07-10 06:20:52,654][26022] Updated weights on worker 0-0, policy_version 604170 (0.00086) [2022-07-10 06:20:53,565][25689] Fps is (10 sec: 5658.1, 60 sec: 5584.1, 300 sec: 5619.7). Total num frames: 618673152. Throughput: 0: 5867.6. Samples: 618676566. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:20:53,565][25689] Avg episode reward: [(0, '-23.986')] [2022-07-10 06:20:54,615][26022] Updated weights on worker 0-0, policy_version 604180 (0.00092) [2022-07-10 06:20:56,474][26022] Updated weights on worker 0-0, policy_version 604190 (0.00083) [2022-07-10 06:20:58,176][26022] Updated weights on worker 0-0, policy_version 604200 (0.00089) [2022-07-10 06:20:58,592][25689] Fps is (10 sec: 5663.3, 60 sec: 5632.5, 300 sec: 5620.4). Total num frames: 618702848. Throughput: 0: 5877.0. Samples: 618710652. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:20:58,593][25689] Avg episode reward: [(0, '-22.607')] [2022-07-10 06:20:59,990][26022] Updated weights on worker 0-0, policy_version 604210 (0.00612) [2022-07-10 06:21:02,278][26022] Updated weights on worker 0-0, policy_version 604220 (0.00091) [2022-07-10 06:21:03,688][25689] Fps is (10 sec: 5563.3, 60 sec: 5637.0, 300 sec: 5623.4). Total num frames: 618729472. Throughput: 0: 5041.1. Samples: 618727626. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:21:03,689][25689] Avg episode reward: [(0, '-23.313')] [2022-07-10 06:21:03,839][26022] Updated weights on worker 0-0, policy_version 604230 (0.00089) [2022-07-10 06:21:05,949][26022] Updated weights on worker 0-0, policy_version 604240 (0.00085) [2022-07-10 06:21:07,621][26022] Updated weights on worker 0-0, policy_version 604250 (0.00091) [2022-07-10 06:21:08,773][25689] Fps is (10 sec: 5431.3, 60 sec: 5613.9, 300 sec: 5626.1). Total num frames: 618758144. Throughput: 0: 5763.3. Samples: 618759610. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:21:08,774][25689] Avg episode reward: [(0, '-23.696')] [2022-07-10 06:21:09,499][26022] Updated weights on worker 0-0, policy_version 604260 (0.00096) [2022-07-10 06:21:11,078][26022] Updated weights on worker 0-0, policy_version 604270 (0.00085) [2022-07-10 06:21:13,046][26022] Updated weights on worker 0-0, policy_version 604280 (0.00898) [2022-07-10 06:21:13,860][25689] Fps is (10 sec: 5536.7, 60 sec: 5607.7, 300 sec: 5614.5). Total num frames: 618785792. Throughput: 0: 5776.6. Samples: 618793806. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:21:13,861][25689] Avg episode reward: [(0, '-25.072')] [2022-07-10 06:21:14,757][26022] Updated weights on worker 0-0, policy_version 604290 (0.00089) [2022-07-10 06:21:16,757][26022] Updated weights on worker 0-0, policy_version 604300 (0.00085) [2022-07-10 06:21:18,428][26022] Updated weights on worker 0-0, policy_version 604310 (0.00092) [2022-07-10 06:21:18,895][25689] Fps is (10 sec: 5766.6, 60 sec: 5640.5, 300 sec: 5629.9). Total num frames: 618816512. Throughput: 0: 4936.4. Samples: 618810874. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:21:18,897][25689] Avg episode reward: [(0, '-25.456')] [2022-07-10 06:21:20,423][26022] Updated weights on worker 0-0, policy_version 604320 (0.00087) [2022-07-10 06:21:21,938][26022] Updated weights on worker 0-0, policy_version 604330 (0.00089) [2022-07-10 06:21:23,892][26022] Updated weights on worker 0-0, policy_version 604340 (0.00051) [2022-07-10 06:21:23,995][25689] Fps is (10 sec: 5759.3, 60 sec: 5626.9, 300 sec: 5626.3). Total num frames: 618844160. Throughput: 0: 5765.6. Samples: 618844706. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:21:23,996][25689] Avg episode reward: [(0, '-26.199')] [2022-07-10 06:21:25,585][26022] Updated weights on worker 0-0, policy_version 604350 (0.00105) [2022-07-10 06:21:27,519][26022] Updated weights on worker 0-0, policy_version 604360 (0.00085) [2022-07-10 06:21:29,009][25689] Fps is (10 sec: 5568.4, 60 sec: 5629.3, 300 sec: 5619.2). Total num frames: 618872832. Throughput: 0: 5897.7. Samples: 618878956. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:21:29,010][25689] Avg episode reward: [(0, '-25.973')] [2022-07-10 06:21:29,261][26022] Updated weights on worker 0-0, policy_version 604370 (0.00092) [2022-07-10 06:21:31,130][26022] Updated weights on worker 0-0, policy_version 604380 (0.00089) [2022-07-10 06:21:32,761][26022] Updated weights on worker 0-0, policy_version 604390 (0.00085) [2022-07-10 06:21:34,011][25689] Fps is (10 sec: 5622.8, 60 sec: 5629.4, 300 sec: 5622.8). Total num frames: 618900480. Throughput: 0: 5070.0. Samples: 618895972. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:21:34,012][25689] Avg episode reward: [(0, '-25.366')] [2022-07-10 06:21:34,653][26022] Updated weights on worker 0-0, policy_version 604400 (0.00094) [2022-07-10 06:21:36,512][26022] Updated weights on worker 0-0, policy_version 604410 (0.00090) [2022-07-10 06:21:38,203][26022] Updated weights on worker 0-0, policy_version 604420 (0.00097) [2022-07-10 06:21:39,036][25689] Fps is (10 sec: 5617.2, 60 sec: 5631.9, 300 sec: 5624.9). Total num frames: 618929152. Throughput: 0: 5917.2. Samples: 618930050. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:21:39,036][25689] Avg episode reward: [(0, '-24.737')] [2022-07-10 06:21:40,138][26022] Updated weights on worker 0-0, policy_version 604430 (0.00094) [2022-07-10 06:21:42,012][26022] Updated weights on worker 0-0, policy_version 604440 (0.00091) [2022-07-10 06:21:43,737][26022] Updated weights on worker 0-0, policy_version 604450 (0.00092) [2022-07-10 06:21:44,169][25689] Fps is (10 sec: 5746.2, 60 sec: 5647.8, 300 sec: 5619.7). Total num frames: 618958848. Throughput: 0: 5908.8. Samples: 618963910. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:21:44,169][25689] Avg episode reward: [(0, '-24.244')] [2022-07-10 06:21:45,619][26022] Updated weights on worker 0-0, policy_version 604460 (0.00083) [2022-07-10 06:21:47,235][26022] Updated weights on worker 0-0, policy_version 604470 (0.00088) [2022-07-10 06:21:49,081][26022] Updated weights on worker 0-0, policy_version 604480 (0.00087) [2022-07-10 06:21:49,197][25689] Fps is (10 sec: 5744.2, 60 sec: 5631.7, 300 sec: 5619.8). Total num frames: 618987520. Throughput: 0: 5052.0. Samples: 618980944. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:21:49,197][25689] Avg episode reward: [(0, '-24.975')] [2022-07-10 06:21:51,164][26022] Updated weights on worker 0-0, policy_version 604490 (0.00088) [2022-07-10 06:21:52,584][26022] Updated weights on worker 0-0, policy_version 604500 (0.00083) [2022-07-10 06:21:54,234][25689] Fps is (10 sec: 5493.9, 60 sec: 5620.5, 300 sec: 5617.1). Total num frames: 619014144. Throughput: 0: 5883.8. Samples: 619014958. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:21:54,234][25689] Avg episode reward: [(0, '-23.656')] [2022-07-10 06:21:54,751][26022] Updated weights on worker 0-0, policy_version 604510 (0.00093) [2022-07-10 06:21:56,311][26022] Updated weights on worker 0-0, policy_version 604520 (0.00097) [2022-07-10 06:21:58,248][26022] Updated weights on worker 0-0, policy_version 604530 (0.00093) [2022-07-10 06:21:59,278][25689] Fps is (10 sec: 5688.1, 60 sec: 5635.8, 300 sec: 5631.7). Total num frames: 619044864. Throughput: 0: 5882.3. Samples: 619049124. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:21:59,279][25689] Avg episode reward: [(0, '-23.705')] [2022-07-10 06:22:00,079][26022] Updated weights on worker 0-0, policy_version 604540 (0.00095) [2022-07-10 06:22:02,180][26022] Updated weights on worker 0-0, policy_version 604550 (0.00086) [2022-07-10 06:22:04,011][26022] Updated weights on worker 0-0, policy_version 604560 (0.00081) [2022-07-10 06:22:04,347][25689] Fps is (10 sec: 5569.0, 60 sec: 5621.5, 300 sec: 5616.8). Total num frames: 619070464. Throughput: 0: 5059.2. Samples: 619065996. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:22:04,347][25689] Avg episode reward: [(0, '-24.151')] [2022-07-10 06:22:05,751][26022] Updated weights on worker 0-0, policy_version 604570 (0.00093) [2022-07-10 06:22:07,548][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:22:07,560][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000604580_619089920.pth [2022-07-10 06:22:07,561][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000602600_617062400.pth [2022-07-10 06:22:07,563][26022] Updated weights on worker 0-0, policy_version 604580 (0.00087) [2022-07-10 06:22:09,424][25689] Fps is (10 sec: 5349.2, 60 sec: 5622.2, 300 sec: 5616.9). Total num frames: 619099136. Throughput: 0: 5809.8. Samples: 619098460. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:22:09,425][25689] Avg episode reward: [(0, '-24.349')] [2022-07-10 06:22:09,433][26022] Updated weights on worker 0-0, policy_version 604590 (0.00089) [2022-07-10 06:22:10,895][26022] Updated weights on worker 0-0, policy_version 604600 (0.00085) [2022-07-10 06:22:12,923][26022] Updated weights on worker 0-0, policy_version 604610 (0.00084) [2022-07-10 06:22:14,508][25689] Fps is (10 sec: 5845.2, 60 sec: 5673.2, 300 sec: 5625.8). Total num frames: 619129856. Throughput: 0: 5821.8. Samples: 619132988. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:22:14,508][25689] Avg episode reward: [(0, '-25.297')] [2022-07-10 06:22:14,777][26022] Updated weights on worker 0-0, policy_version 604620 (0.00087) [2022-07-10 06:22:16,591][26022] Updated weights on worker 0-0, policy_version 604630 (0.00092) [2022-07-10 06:22:18,301][26022] Updated weights on worker 0-0, policy_version 604640 (0.00087) [2022-07-10 06:22:19,530][25689] Fps is (10 sec: 5876.9, 60 sec: 5640.5, 300 sec: 5627.7). Total num frames: 619158528. Throughput: 0: 5832.0. Samples: 619167234. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:22:19,531][25689] Avg episode reward: [(0, '-24.672')] [2022-07-10 06:22:20,407][26022] Updated weights on worker 0-0, policy_version 604650 (0.00086) [2022-07-10 06:22:21,752][26022] Updated weights on worker 0-0, policy_version 604660 (0.00091) [2022-07-10 06:22:24,079][26022] Updated weights on worker 0-0, policy_version 604670 (0.00087) [2022-07-10 06:22:24,672][25689] Fps is (10 sec: 5540.7, 60 sec: 5636.6, 300 sec: 5622.6). Total num frames: 619186176. Throughput: 0: 5803.1. Samples: 619183948. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:22:24,673][25689] Avg episode reward: [(0, '-24.737')] [2022-07-10 06:22:25,222][26022] Updated weights on worker 0-0, policy_version 604680 (0.00086) [2022-07-10 06:22:27,576][26022] Updated weights on worker 0-0, policy_version 604690 (0.00090) [2022-07-10 06:22:29,248][26022] Updated weights on worker 0-0, policy_version 604700 (0.00117) [2022-07-10 06:22:29,678][25689] Fps is (10 sec: 5449.0, 60 sec: 5620.5, 300 sec: 5619.5). Total num frames: 619213824. Throughput: 0: 5897.3. Samples: 619217904. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:22:29,679][25689] Avg episode reward: [(0, '-24.697')] [2022-07-10 06:22:31,131][26022] Updated weights on worker 0-0, policy_version 604710 (0.00089) [2022-07-10 06:22:32,937][26022] Updated weights on worker 0-0, policy_version 604720 (0.00093) [2022-07-10 06:22:34,682][25689] Fps is (10 sec: 5626.9, 60 sec: 5637.3, 300 sec: 5620.2). Total num frames: 619242496. Throughput: 0: 5887.3. Samples: 619251758. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:22:34,682][25689] Avg episode reward: [(0, '-23.944')] [2022-07-10 06:22:34,782][26022] Updated weights on worker 0-0, policy_version 604730 (0.00092) [2022-07-10 06:22:36,375][26022] Updated weights on worker 0-0, policy_version 604740 (0.00086) [2022-07-10 06:22:38,518][26022] Updated weights on worker 0-0, policy_version 604750 (0.00087) [2022-07-10 06:22:39,694][25689] Fps is (10 sec: 5725.4, 60 sec: 5638.4, 300 sec: 5624.5). Total num frames: 619271168. Throughput: 0: 5031.6. Samples: 619268690. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:22:39,695][25689] Avg episode reward: [(0, '-23.512')] [2022-07-10 06:22:40,046][26022] Updated weights on worker 0-0, policy_version 604760 (0.00088) [2022-07-10 06:22:42,119][26022] Updated weights on worker 0-0, policy_version 604770 (0.00085) [2022-07-10 06:22:43,682][26022] Updated weights on worker 0-0, policy_version 604780 (0.00084) [2022-07-10 06:22:44,797][25689] Fps is (10 sec: 5669.0, 60 sec: 5624.3, 300 sec: 5626.8). Total num frames: 619299840. Throughput: 0: 5895.9. Samples: 619302600. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:22:44,799][25689] Avg episode reward: [(0, '-24.239')] [2022-07-10 06:22:45,626][26022] Updated weights on worker 0-0, policy_version 604790 (0.00093) [2022-07-10 06:22:47,304][26022] Updated weights on worker 0-0, policy_version 604800 (0.00092) [2022-07-10 06:22:49,383][26022] Updated weights on worker 0-0, policy_version 604810 (0.00088) [2022-07-10 06:22:49,818][25689] Fps is (10 sec: 5563.1, 60 sec: 5608.1, 300 sec: 5624.1). Total num frames: 619327488. Throughput: 0: 5907.9. Samples: 619336886. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:22:49,818][25689] Avg episode reward: [(0, '-23.360')] [2022-07-10 06:22:50,943][26022] Updated weights on worker 0-0, policy_version 604820 (0.00086) [2022-07-10 06:22:52,987][26022] Updated weights on worker 0-0, policy_version 604830 (0.00086) [2022-07-10 06:22:54,538][26022] Updated weights on worker 0-0, policy_version 604840 (0.00087) [2022-07-10 06:22:54,864][25689] Fps is (10 sec: 5798.0, 60 sec: 5674.8, 300 sec: 5626.7). Total num frames: 619358208. Throughput: 0: 5064.2. Samples: 619353966. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:22:54,864][25689] Avg episode reward: [(0, '-22.900')] [2022-07-10 06:22:56,580][26022] Updated weights on worker 0-0, policy_version 604850 (0.00088) [2022-07-10 06:22:58,184][26022] Updated weights on worker 0-0, policy_version 604860 (0.00091) [2022-07-10 06:22:59,878][25689] Fps is (10 sec: 5801.6, 60 sec: 5626.9, 300 sec: 5634.6). Total num frames: 619385856. Throughput: 0: 5923.9. Samples: 619388260. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:22:59,879][25689] Avg episode reward: [(0, '-22.985')] [2022-07-10 06:23:00,003][26022] Updated weights on worker 0-0, policy_version 604870 (0.00088) [2022-07-10 06:23:02,243][26022] Updated weights on worker 0-0, policy_version 604880 (0.00091) [2022-07-10 06:23:04,135][26022] Updated weights on worker 0-0, policy_version 604890 (0.00092) [2022-07-10 06:23:04,928][25689] Fps is (10 sec: 5392.7, 60 sec: 5645.6, 300 sec: 5632.1). Total num frames: 619412480. Throughput: 0: 5827.3. Samples: 619419908. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:23:04,928][25689] Avg episode reward: [(0, '-22.511')] [2022-07-10 06:23:05,694][26022] Updated weights on worker 0-0, policy_version 604900 (0.00105) [2022-07-10 06:23:07,739][26022] Updated weights on worker 0-0, policy_version 604910 (0.00440) [2022-07-10 06:23:09,104][26022] Updated weights on worker 0-0, policy_version 604920 (0.00089) [2022-07-10 06:23:10,025][25689] Fps is (10 sec: 5348.7, 60 sec: 5626.8, 300 sec: 5617.4). Total num frames: 619440128. Throughput: 0: 4950.2. Samples: 619436916. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:23:10,026][25689] Avg episode reward: [(0, '-22.934')] [2022-07-10 06:23:11,391][26022] Updated weights on worker 0-0, policy_version 604930 (0.00087) [2022-07-10 06:23:12,928][26022] Updated weights on worker 0-0, policy_version 604940 (0.00088) [2022-07-10 06:23:14,826][26022] Updated weights on worker 0-0, policy_version 604950 (0.00086) [2022-07-10 06:23:15,052][25689] Fps is (10 sec: 5664.4, 60 sec: 5615.2, 300 sec: 5627.6). Total num frames: 619469824. Throughput: 0: 5789.9. Samples: 619470850. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:23:15,052][25689] Avg episode reward: [(0, '-23.210')] [2022-07-10 06:23:16,701][26022] Updated weights on worker 0-0, policy_version 604960 (0.00090) [2022-07-10 06:23:18,527][26022] Updated weights on worker 0-0, policy_version 604970 (0.00094) [2022-07-10 06:23:20,081][25689] Fps is (10 sec: 5804.4, 60 sec: 5614.6, 300 sec: 5624.9). Total num frames: 619498496. Throughput: 0: 5771.9. Samples: 619504868. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:23:20,081][25689] Avg episode reward: [(0, '-23.555')] [2022-07-10 06:23:20,187][26022] Updated weights on worker 0-0, policy_version 604980 (0.00086) [2022-07-10 06:23:22,279][26022] Updated weights on worker 0-0, policy_version 604990 (0.00086) [2022-07-10 06:23:23,798][26022] Updated weights on worker 0-0, policy_version 605000 (0.00088) [2022-07-10 06:23:25,123][25689] Fps is (10 sec: 5490.1, 60 sec: 5606.9, 300 sec: 5620.9). Total num frames: 619525120. Throughput: 0: 5040.9. Samples: 619521712. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:23:25,124][25689] Avg episode reward: [(0, '-23.793')] [2022-07-10 06:23:25,996][26022] Updated weights on worker 0-0, policy_version 605010 (0.00095) [2022-07-10 06:23:27,480][26022] Updated weights on worker 0-0, policy_version 605020 (0.00091) [2022-07-10 06:23:29,435][26022] Updated weights on worker 0-0, policy_version 605030 (0.00111) [2022-07-10 06:23:30,129][25689] Fps is (10 sec: 5604.8, 60 sec: 5640.8, 300 sec: 5627.9). Total num frames: 619554816. Throughput: 0: 5912.0. Samples: 619555772. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:23:30,130][25689] Avg episode reward: [(0, '-23.454')] [2022-07-10 06:23:31,021][26022] Updated weights on worker 0-0, policy_version 605040 (0.00086) [2022-07-10 06:23:33,134][26022] Updated weights on worker 0-0, policy_version 605050 (0.00094) [2022-07-10 06:23:34,591][26022] Updated weights on worker 0-0, policy_version 605060 (0.00085) [2022-07-10 06:23:35,180][25689] Fps is (10 sec: 5803.9, 60 sec: 5636.4, 300 sec: 5623.6). Total num frames: 619583488. Throughput: 0: 5929.3. Samples: 619590198. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:23:35,181][25689] Avg episode reward: [(0, '-23.127')] [2022-07-10 06:23:36,598][26022] Updated weights on worker 0-0, policy_version 605070 (0.00083) [2022-07-10 06:23:38,101][26022] Updated weights on worker 0-0, policy_version 605080 (0.00089) [2022-07-10 06:23:40,206][25689] Fps is (10 sec: 5589.3, 60 sec: 5618.2, 300 sec: 5625.0). Total num frames: 619611136. Throughput: 0: 5091.0. Samples: 619607324. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:23:40,207][25689] Avg episode reward: [(0, '-23.866')] [2022-07-10 06:23:40,263][26022] Updated weights on worker 0-0, policy_version 605090 (0.00386) [2022-07-10 06:23:41,854][26022] Updated weights on worker 0-0, policy_version 605100 (0.00087) [2022-07-10 06:23:43,762][26022] Updated weights on worker 0-0, policy_version 605110 (0.00088) [2022-07-10 06:23:45,315][25689] Fps is (10 sec: 5758.9, 60 sec: 5651.4, 300 sec: 5627.2). Total num frames: 619641856. Throughput: 0: 5932.0. Samples: 619641490. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:23:45,316][25689] Avg episode reward: [(0, '-24.620')] [2022-07-10 06:23:45,504][26022] Updated weights on worker 0-0, policy_version 605120 (0.00095) [2022-07-10 06:23:47,417][26022] Updated weights on worker 0-0, policy_version 605130 (0.00093) [2022-07-10 06:23:49,072][26022] Updated weights on worker 0-0, policy_version 605140 (0.00087) [2022-07-10 06:23:50,379][25689] Fps is (10 sec: 5737.5, 60 sec: 5647.5, 300 sec: 5626.9). Total num frames: 619669504. Throughput: 0: 5911.3. Samples: 619675472. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:23:50,379][25689] Avg episode reward: [(0, '-24.682')] [2022-07-10 06:23:51,214][26022] Updated weights on worker 0-0, policy_version 605150 (0.00094) [2022-07-10 06:23:52,774][26022] Updated weights on worker 0-0, policy_version 605160 (0.00084) [2022-07-10 06:23:54,720][26022] Updated weights on worker 0-0, policy_version 605170 (0.00079) [2022-07-10 06:23:55,395][25689] Fps is (10 sec: 5485.9, 60 sec: 5599.5, 300 sec: 5624.2). Total num frames: 619697152. Throughput: 0: 5027.4. Samples: 619691826. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:23:55,396][25689] Avg episode reward: [(0, '-23.974')] [2022-07-10 06:23:56,575][26022] Updated weights on worker 0-0, policy_version 605180 (0.00092) [2022-07-10 06:23:58,351][26022] Updated weights on worker 0-0, policy_version 605190 (0.00409) [2022-07-10 06:24:00,285][26022] Updated weights on worker 0-0, policy_version 605200 (0.00095) [2022-07-10 06:24:00,416][25689] Fps is (10 sec: 5611.2, 60 sec: 5615.8, 300 sec: 5635.1). Total num frames: 619725824. Throughput: 0: 5868.7. Samples: 619725930. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 06:24:00,416][25689] Avg episode reward: [(0, '-24.198')] [2022-07-10 06:24:01,810][26022] Updated weights on worker 0-0, policy_version 605210 (0.00094) [2022-07-10 06:24:04,205][26022] Updated weights on worker 0-0, policy_version 605220 (0.00088) [2022-07-10 06:24:05,467][25689] Fps is (10 sec: 5591.6, 60 sec: 5632.6, 300 sec: 5631.0). Total num frames: 619753472. Throughput: 0: 5769.8. Samples: 619757762. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:24:05,468][25689] Avg episode reward: [(0, '-24.152')] [2022-07-10 06:24:06,218][26022] Updated weights on worker 0-0, policy_version 605230 (0.00085) [2022-07-10 06:24:07,736][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:24:07,750][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000605240_619765760.pth [2022-07-10 06:24:07,751][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000603260_617738240.pth [2022-07-10 06:24:07,754][26022] Updated weights on worker 0-0, policy_version 605240 (0.00086) [2022-07-10 06:24:09,688][26022] Updated weights on worker 0-0, policy_version 605250 (0.00088) [2022-07-10 06:24:10,499][25689] Fps is (10 sec: 5585.9, 60 sec: 5655.7, 300 sec: 5630.6). Total num frames: 619782144. Throughput: 0: 4940.3. Samples: 619774868. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:24:10,499][25689] Avg episode reward: [(0, '-23.645')] [2022-07-10 06:24:11,280][26022] Updated weights on worker 0-0, policy_version 605260 (0.00086) [2022-07-10 06:24:13,192][26022] Updated weights on worker 0-0, policy_version 605270 (0.00087) [2022-07-10 06:24:14,899][26022] Updated weights on worker 0-0, policy_version 605280 (0.00086) [2022-07-10 06:24:15,512][25689] Fps is (10 sec: 5607.1, 60 sec: 5623.0, 300 sec: 5630.8). Total num frames: 619809792. Throughput: 0: 5837.6. Samples: 619809260. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:24:15,512][25689] Avg episode reward: [(0, '-23.596')] [2022-07-10 06:24:16,714][26022] Updated weights on worker 0-0, policy_version 605290 (0.00087) [2022-07-10 06:24:18,461][26022] Updated weights on worker 0-0, policy_version 605300 (0.00085) [2022-07-10 06:24:20,495][26022] Updated weights on worker 0-0, policy_version 605310 (0.00097) [2022-07-10 06:24:20,520][25689] Fps is (10 sec: 5517.9, 60 sec: 5608.1, 300 sec: 5625.6). Total num frames: 619837440. Throughput: 0: 5868.9. Samples: 619843918. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:24:20,520][25689] Avg episode reward: [(0, '-23.255')] [2022-07-10 06:24:21,936][26022] Updated weights on worker 0-0, policy_version 605320 (0.00093) [2022-07-10 06:24:24,071][26022] Updated weights on worker 0-0, policy_version 605330 (0.00089) [2022-07-10 06:24:25,455][26022] Updated weights on worker 0-0, policy_version 605340 (0.00501) [2022-07-10 06:24:25,612][25689] Fps is (10 sec: 5778.7, 60 sec: 5671.1, 300 sec: 5630.8). Total num frames: 619868160. Throughput: 0: 5111.5. Samples: 619860734. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:24:25,613][25689] Avg episode reward: [(0, '-23.366')] [2022-07-10 06:24:27,655][26022] Updated weights on worker 0-0, policy_version 605350 (0.00087) [2022-07-10 06:24:29,139][26022] Updated weights on worker 0-0, policy_version 605360 (0.00100) [2022-07-10 06:24:30,679][25689] Fps is (10 sec: 5745.4, 60 sec: 5631.6, 300 sec: 5630.0). Total num frames: 619895808. Throughput: 0: 5931.9. Samples: 619894576. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:24:30,679][25689] Avg episode reward: [(0, '-23.343')] [2022-07-10 06:24:31,318][26022] Updated weights on worker 0-0, policy_version 605370 (0.00085) [2022-07-10 06:24:32,934][26022] Updated weights on worker 0-0, policy_version 605380 (0.00086) [2022-07-10 06:24:34,915][26022] Updated weights on worker 0-0, policy_version 605390 (0.00087) [2022-07-10 06:24:35,708][25689] Fps is (10 sec: 5477.5, 60 sec: 5616.8, 300 sec: 5626.5). Total num frames: 619923456. Throughput: 0: 5907.0. Samples: 619928556. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:24:35,708][25689] Avg episode reward: [(0, '-24.265')] [2022-07-10 06:24:36,268][26022] Updated weights on worker 0-0, policy_version 605400 (0.00877) [2022-07-10 06:24:38,634][26022] Updated weights on worker 0-0, policy_version 605410 (0.00086) [2022-07-10 06:24:40,215][26022] Updated weights on worker 0-0, policy_version 605420 (0.00626) [2022-07-10 06:24:40,731][25689] Fps is (10 sec: 5704.7, 60 sec: 5650.8, 300 sec: 5630.5). Total num frames: 619953152. Throughput: 0: 5037.3. Samples: 619945728. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:24:40,731][25689] Avg episode reward: [(0, '-24.313')] [2022-07-10 06:24:41,961][26022] Updated weights on worker 0-0, policy_version 605430 (0.00085) [2022-07-10 06:24:43,579][26022] Updated weights on worker 0-0, policy_version 605440 (0.00097) [2022-07-10 06:24:45,527][26022] Updated weights on worker 0-0, policy_version 605450 (0.00087) [2022-07-10 06:24:45,801][25689] Fps is (10 sec: 5681.1, 60 sec: 5603.7, 300 sec: 5629.4). Total num frames: 619980800. Throughput: 0: 5883.0. Samples: 619979506. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:24:45,802][25689] Avg episode reward: [(0, '-22.974')] [2022-07-10 06:24:47,496][26022] Updated weights on worker 0-0, policy_version 605460 (0.00092) [2022-07-10 06:24:49,291][26022] Updated weights on worker 0-0, policy_version 605470 (0.00086) [2022-07-10 06:24:50,810][25689] Fps is (10 sec: 5689.6, 60 sec: 5642.7, 300 sec: 5626.2). Total num frames: 620010496. Throughput: 0: 5933.7. Samples: 620014026. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:24:50,810][25689] Avg episode reward: [(0, '-23.617')] [2022-07-10 06:24:50,901][26022] Updated weights on worker 0-0, policy_version 605480 (0.00083) [2022-07-10 06:24:52,631][26022] Updated weights on worker 0-0, policy_version 605490 (0.00537) [2022-07-10 06:24:54,515][26022] Updated weights on worker 0-0, policy_version 605500 (0.00090) [2022-07-10 06:24:55,817][25689] Fps is (10 sec: 5827.9, 60 sec: 5660.5, 300 sec: 5633.0). Total num frames: 620039168. Throughput: 0: 5939.9. Samples: 620048002. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:24:55,817][25689] Avg episode reward: [(0, '-23.177')] [2022-07-10 06:24:56,443][26022] Updated weights on worker 0-0, policy_version 605510 (0.00093) [2022-07-10 06:24:58,249][26022] Updated weights on worker 0-0, policy_version 605520 (0.00096) [2022-07-10 06:24:59,932][26022] Updated weights on worker 0-0, policy_version 605530 (0.00089) [2022-07-10 06:25:00,829][25689] Fps is (10 sec: 5518.7, 60 sec: 5627.4, 300 sec: 5635.5). Total num frames: 620065792. Throughput: 0: 5941.0. Samples: 620065132. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:25:00,830][25689] Avg episode reward: [(0, '-24.149')] [2022-07-10 06:25:01,871][26022] Updated weights on worker 0-0, policy_version 605540 (0.00088) [2022-07-10 06:25:04,116][26022] Updated weights on worker 0-0, policy_version 605550 (0.00085) [2022-07-10 06:25:05,870][25689] Fps is (10 sec: 5398.3, 60 sec: 5628.3, 300 sec: 5628.2). Total num frames: 620093440. Throughput: 0: 5867.3. Samples: 620097256. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:25:05,871][25689] Avg episode reward: [(0, '-24.461')] [2022-07-10 06:25:05,884][26022] Updated weights on worker 0-0, policy_version 605560 (0.00094) [2022-07-10 06:25:07,631][26022] Updated weights on worker 0-0, policy_version 605570 (0.00097) [2022-07-10 06:25:09,455][26022] Updated weights on worker 0-0, policy_version 605580 (0.00550) [2022-07-10 06:25:10,879][25689] Fps is (10 sec: 5604.3, 60 sec: 5630.5, 300 sec: 5631.8). Total num frames: 620122112. Throughput: 0: 5845.0. Samples: 620131330. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:25:10,880][25689] Avg episode reward: [(0, '-24.353')] [2022-07-10 06:25:11,231][26022] Updated weights on worker 0-0, policy_version 605590 (0.00086) [2022-07-10 06:25:13,066][26022] Updated weights on worker 0-0, policy_version 605600 (0.00091) [2022-07-10 06:25:14,843][26022] Updated weights on worker 0-0, policy_version 605610 (0.00086) [2022-07-10 06:25:15,921][25689] Fps is (10 sec: 5603.3, 60 sec: 5627.7, 300 sec: 5628.1). Total num frames: 620149760. Throughput: 0: 5003.3. Samples: 620148592. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:25:15,923][25689] Avg episode reward: [(0, '-24.367')] [2022-07-10 06:25:16,525][26022] Updated weights on worker 0-0, policy_version 605620 (0.00091) [2022-07-10 06:25:18,376][26022] Updated weights on worker 0-0, policy_version 605630 (0.00090) [2022-07-10 06:25:20,079][26022] Updated weights on worker 0-0, policy_version 605640 (0.00061) [2022-07-10 06:25:20,933][25689] Fps is (10 sec: 5703.5, 60 sec: 5661.3, 300 sec: 5633.8). Total num frames: 620179456. Throughput: 0: 5861.9. Samples: 620182978. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:25:20,935][25689] Avg episode reward: [(0, '-25.302')] [2022-07-10 06:25:21,910][26022] Updated weights on worker 0-0, policy_version 605650 (0.00109) [2022-07-10 06:25:23,787][26022] Updated weights on worker 0-0, policy_version 605660 (0.00089) [2022-07-10 06:25:25,543][26022] Updated weights on worker 0-0, policy_version 605670 (0.00090) [2022-07-10 06:25:25,989][25689] Fps is (10 sec: 5797.9, 60 sec: 5630.8, 300 sec: 5633.6). Total num frames: 620208128. Throughput: 0: 5941.3. Samples: 620216786. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:25:25,989][25689] Avg episode reward: [(0, '-24.759')] [2022-07-10 06:25:27,414][26022] Updated weights on worker 0-0, policy_version 605680 (0.00093) [2022-07-10 06:25:29,274][26022] Updated weights on worker 0-0, policy_version 605690 (0.00092) [2022-07-10 06:25:30,919][26022] Updated weights on worker 0-0, policy_version 605700 (0.00083) [2022-07-10 06:25:31,032][25689] Fps is (10 sec: 5678.1, 60 sec: 5649.9, 300 sec: 5636.2). Total num frames: 620236800. Throughput: 0: 5085.4. Samples: 620233818. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:25:31,033][25689] Avg episode reward: [(0, '-23.561')] [2022-07-10 06:25:32,867][26022] Updated weights on worker 0-0, policy_version 605710 (0.00086) [2022-07-10 06:25:34,399][26022] Updated weights on worker 0-0, policy_version 605720 (0.00091) [2022-07-10 06:25:36,059][25689] Fps is (10 sec: 5592.8, 60 sec: 5650.1, 300 sec: 5633.3). Total num frames: 620264448. Throughput: 0: 5937.1. Samples: 620268150. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:25:36,059][25689] Avg episode reward: [(0, '-23.933')] [2022-07-10 06:25:36,430][26022] Updated weights on worker 0-0, policy_version 605730 (0.00088) [2022-07-10 06:25:38,089][26022] Updated weights on worker 0-0, policy_version 605740 (0.00092) [2022-07-10 06:25:40,071][26022] Updated weights on worker 0-0, policy_version 605750 (0.00096) [2022-07-10 06:25:41,091][25689] Fps is (10 sec: 5701.0, 60 sec: 5649.3, 300 sec: 5638.4). Total num frames: 620294144. Throughput: 0: 5928.9. Samples: 620302492. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:25:41,091][25689] Avg episode reward: [(0, '-24.687')] [2022-07-10 06:25:41,548][26022] Updated weights on worker 0-0, policy_version 605760 (0.00100) [2022-07-10 06:25:43,766][26022] Updated weights on worker 0-0, policy_version 605770 (0.00091) [2022-07-10 06:25:45,264][26022] Updated weights on worker 0-0, policy_version 605780 (0.00087) [2022-07-10 06:25:46,142][25689] Fps is (10 sec: 5788.6, 60 sec: 5668.1, 300 sec: 5634.7). Total num frames: 620322816. Throughput: 0: 5099.5. Samples: 620319558. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:25:46,143][25689] Avg episode reward: [(0, '-25.041')] [2022-07-10 06:25:47,158][26022] Updated weights on worker 0-0, policy_version 605790 (0.00084) [2022-07-10 06:25:48,897][26022] Updated weights on worker 0-0, policy_version 605800 (0.00084) [2022-07-10 06:25:50,710][26022] Updated weights on worker 0-0, policy_version 605810 (0.00086) [2022-07-10 06:25:51,150][25689] Fps is (10 sec: 5598.9, 60 sec: 5634.2, 300 sec: 5636.4). Total num frames: 620350464. Throughput: 0: 5959.1. Samples: 620353702. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:25:51,150][25689] Avg episode reward: [(0, '-25.095')] [2022-07-10 06:25:52,569][26022] Updated weights on worker 0-0, policy_version 605820 (0.00086) [2022-07-10 06:25:54,328][26022] Updated weights on worker 0-0, policy_version 605830 (0.00093) [2022-07-10 06:25:56,154][25689] Fps is (10 sec: 5727.6, 60 sec: 5651.4, 300 sec: 5636.9). Total num frames: 620380160. Throughput: 0: 5946.4. Samples: 620387644. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:25:56,154][25689] Avg episode reward: [(0, '-24.932')] [2022-07-10 06:25:56,158][26022] Updated weights on worker 0-0, policy_version 605840 (0.00105) [2022-07-10 06:25:58,151][26022] Updated weights on worker 0-0, policy_version 605850 (0.00095) [2022-07-10 06:25:59,803][26022] Updated weights on worker 0-0, policy_version 605860 (0.00095) [2022-07-10 06:26:01,174][25689] Fps is (10 sec: 5618.4, 60 sec: 5650.7, 300 sec: 5638.3). Total num frames: 620406784. Throughput: 0: 5090.7. Samples: 620404730. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:26:01,174][25689] Avg episode reward: [(0, '-24.128')] [2022-07-10 06:26:01,976][26022] Updated weights on worker 0-0, policy_version 605870 (0.00091) [2022-07-10 06:26:03,840][26022] Updated weights on worker 0-0, policy_version 605880 (0.00092) [2022-07-10 06:26:05,739][26022] Updated weights on worker 0-0, policy_version 605890 (0.00093) [2022-07-10 06:26:06,300][25689] Fps is (10 sec: 5349.0, 60 sec: 5642.8, 300 sec: 5634.1). Total num frames: 620434432. Throughput: 0: 5784.8. Samples: 620436168. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:26:06,301][25689] Avg episode reward: [(0, '-23.279')] [2022-07-10 06:26:07,560][26022] Updated weights on worker 0-0, policy_version 605900 (0.00092) [2022-07-10 06:26:07,945][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:26:07,959][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000605902_620443648.pth [2022-07-10 06:26:07,960][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000603921_618415104.pth [2022-07-10 06:26:09,463][26022] Updated weights on worker 0-0, policy_version 605910 (0.00088) [2022-07-10 06:26:11,331][25689] Fps is (10 sec: 5444.2, 60 sec: 5623.8, 300 sec: 5635.1). Total num frames: 620462080. Throughput: 0: 5758.5. Samples: 620469914. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:26:11,332][25689] Avg episode reward: [(0, '-22.402')] [2022-07-10 06:26:11,333][26022] Updated weights on worker 0-0, policy_version 605920 (0.00090) [2022-07-10 06:26:12,954][26022] Updated weights on worker 0-0, policy_version 605930 (0.00085) [2022-07-10 06:26:14,742][26022] Updated weights on worker 0-0, policy_version 605940 (0.00085) [2022-07-10 06:26:16,352][25689] Fps is (10 sec: 5806.9, 60 sec: 5676.6, 300 sec: 5635.4). Total num frames: 620492800. Throughput: 0: 4927.3. Samples: 620487164. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:26:16,352][25689] Avg episode reward: [(0, '-22.788')] [2022-07-10 06:26:16,354][26022] Updated weights on worker 0-0, policy_version 605950 (0.00086) [2022-07-10 06:26:18,456][26022] Updated weights on worker 0-0, policy_version 605960 (0.00091) [2022-07-10 06:26:20,233][26022] Updated weights on worker 0-0, policy_version 605970 (0.00085) [2022-07-10 06:26:21,390][25689] Fps is (10 sec: 5598.9, 60 sec: 5606.4, 300 sec: 5629.7). Total num frames: 620518400. Throughput: 0: 5761.9. Samples: 620521212. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:26:21,390][25689] Avg episode reward: [(0, '-22.208')] [2022-07-10 06:26:22,017][26022] Updated weights on worker 0-0, policy_version 605980 (0.00087) [2022-07-10 06:26:23,749][26022] Updated weights on worker 0-0, policy_version 605990 (0.00094) [2022-07-10 06:26:25,746][26022] Updated weights on worker 0-0, policy_version 606000 (0.00086) [2022-07-10 06:26:26,472][25689] Fps is (10 sec: 5463.7, 60 sec: 5620.8, 300 sec: 5631.8). Total num frames: 620548096. Throughput: 0: 5887.5. Samples: 620554932. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:26:26,473][25689] Avg episode reward: [(0, '-22.976')] [2022-07-10 06:26:27,549][26022] Updated weights on worker 0-0, policy_version 606010 (0.00086) [2022-07-10 06:26:29,298][26022] Updated weights on worker 0-0, policy_version 606020 (0.01145) [2022-07-10 06:26:31,029][26022] Updated weights on worker 0-0, policy_version 606030 (0.00099) [2022-07-10 06:26:31,492][25689] Fps is (10 sec: 5879.3, 60 sec: 5640.0, 300 sec: 5638.4). Total num frames: 620577792. Throughput: 0: 5070.3. Samples: 620572136. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:26:31,492][25689] Avg episode reward: [(0, '-23.541')] [2022-07-10 06:26:33,067][26022] Updated weights on worker 0-0, policy_version 606040 (0.01015) [2022-07-10 06:26:34,483][26022] Updated weights on worker 0-0, policy_version 606050 (0.00085) [2022-07-10 06:26:36,501][25689] Fps is (10 sec: 5616.0, 60 sec: 5624.7, 300 sec: 5631.8). Total num frames: 620604416. Throughput: 0: 5925.9. Samples: 620606566. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:26:36,501][25689] Avg episode reward: [(0, '-23.760')] [2022-07-10 06:26:36,553][26022] Updated weights on worker 0-0, policy_version 606060 (0.00096) [2022-07-10 06:26:38,052][26022] Updated weights on worker 0-0, policy_version 606070 (0.00089) [2022-07-10 06:26:40,196][26022] Updated weights on worker 0-0, policy_version 606080 (0.00085) [2022-07-10 06:26:41,530][25689] Fps is (10 sec: 5610.8, 60 sec: 5625.0, 300 sec: 5633.8). Total num frames: 620634112. Throughput: 0: 5927.4. Samples: 620640588. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:26:41,531][25689] Avg episode reward: [(0, '-24.324')] [2022-07-10 06:26:41,783][26022] Updated weights on worker 0-0, policy_version 606090 (0.00093) [2022-07-10 06:26:43,683][26022] Updated weights on worker 0-0, policy_version 606100 (0.00080) [2022-07-10 06:26:45,527][26022] Updated weights on worker 0-0, policy_version 606110 (0.00093) [2022-07-10 06:26:46,595][25689] Fps is (10 sec: 5681.0, 60 sec: 5606.7, 300 sec: 5629.6). Total num frames: 620661760. Throughput: 0: 5094.0. Samples: 620657436. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:26:46,596][25689] Avg episode reward: [(0, '-23.877')] [2022-07-10 06:26:47,540][26022] Updated weights on worker 0-0, policy_version 606120 (0.00089) [2022-07-10 06:26:49,128][26022] Updated weights on worker 0-0, policy_version 606130 (0.00084) [2022-07-10 06:26:51,003][26022] Updated weights on worker 0-0, policy_version 606140 (0.00095) [2022-07-10 06:26:51,680][25689] Fps is (10 sec: 5649.3, 60 sec: 5633.4, 300 sec: 5639.0). Total num frames: 620691456. Throughput: 0: 5911.9. Samples: 620691488. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:26:51,681][25689] Avg episode reward: [(0, '-23.271')] [2022-07-10 06:26:52,685][26022] Updated weights on worker 0-0, policy_version 606150 (0.00088) [2022-07-10 06:26:54,516][26022] Updated weights on worker 0-0, policy_version 606160 (0.00089) [2022-07-10 06:26:56,443][26022] Updated weights on worker 0-0, policy_version 606170 (0.00089) [2022-07-10 06:26:56,701][25689] Fps is (10 sec: 5674.5, 60 sec: 5598.1, 300 sec: 5629.2). Total num frames: 620719104. Throughput: 0: 5881.9. Samples: 620725378. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:26:56,701][25689] Avg episode reward: [(0, '-23.382')] [2022-07-10 06:26:58,246][26022] Updated weights on worker 0-0, policy_version 606180 (0.00097) [2022-07-10 06:26:59,971][26022] Updated weights on worker 0-0, policy_version 606190 (0.00083) [2022-07-10 06:27:01,717][25689] Fps is (10 sec: 5509.2, 60 sec: 5615.3, 300 sec: 5637.0). Total num frames: 620746752. Throughput: 0: 5049.0. Samples: 620742516. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:27:01,718][25689] Avg episode reward: [(0, '-22.600')] [2022-07-10 06:27:02,160][26022] Updated weights on worker 0-0, policy_version 606200 (0.00082) [2022-07-10 06:27:03,832][26022] Updated weights on worker 0-0, policy_version 606210 (0.00088) [2022-07-10 06:27:05,918][26022] Updated weights on worker 0-0, policy_version 606220 (0.00088) [2022-07-10 06:27:06,841][25689] Fps is (10 sec: 5655.2, 60 sec: 5649.4, 300 sec: 5639.6). Total num frames: 620776448. Throughput: 0: 5768.7. Samples: 620774228. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 06:27:06,841][25689] Avg episode reward: [(0, '-22.252')] [2022-07-10 06:27:07,720][26022] Updated weights on worker 0-0, policy_version 606230 (0.00091) [2022-07-10 06:27:09,640][26022] Updated weights on worker 0-0, policy_version 606240 (0.00090) [2022-07-10 06:27:11,369][26022] Updated weights on worker 0-0, policy_version 606250 (0.00086) [2022-07-10 06:27:11,850][25689] Fps is (10 sec: 5558.0, 60 sec: 5634.4, 300 sec: 5627.2). Total num frames: 620803072. Throughput: 0: 5787.3. Samples: 620808218. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:27:11,851][25689] Avg episode reward: [(0, '-22.369')] [2022-07-10 06:27:13,239][26022] Updated weights on worker 0-0, policy_version 606260 (0.00091) [2022-07-10 06:27:14,639][26022] Updated weights on worker 0-0, policy_version 606270 (0.00086) [2022-07-10 06:27:16,907][25689] Fps is (10 sec: 5289.5, 60 sec: 5563.4, 300 sec: 5619.7). Total num frames: 620829696. Throughput: 0: 4950.9. Samples: 620825420. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:27:16,908][25689] Avg episode reward: [(0, '-23.255')] [2022-07-10 06:27:17,001][26022] Updated weights on worker 0-0, policy_version 606280 (0.00089) [2022-07-10 06:27:18,391][26022] Updated weights on worker 0-0, policy_version 606290 (0.00092) [2022-07-10 06:27:20,491][26022] Updated weights on worker 0-0, policy_version 606300 (0.00091) [2022-07-10 06:27:21,928][25689] Fps is (10 sec: 5690.5, 60 sec: 5649.6, 300 sec: 5632.3). Total num frames: 620860416. Throughput: 0: 5777.0. Samples: 620859270. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:27:21,928][25689] Avg episode reward: [(0, '-23.437')] [2022-07-10 06:27:22,050][26022] Updated weights on worker 0-0, policy_version 606310 (0.00096) [2022-07-10 06:27:24,075][26022] Updated weights on worker 0-0, policy_version 606320 (0.00090) [2022-07-10 06:27:25,655][26022] Updated weights on worker 0-0, policy_version 606330 (0.00093) [2022-07-10 06:27:26,981][25689] Fps is (10 sec: 5692.4, 60 sec: 5601.6, 300 sec: 5628.0). Total num frames: 620887040. Throughput: 0: 5883.7. Samples: 620892730. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:27:26,982][25689] Avg episode reward: [(0, '-24.056')] [2022-07-10 06:27:27,956][26022] Updated weights on worker 0-0, policy_version 606340 (0.00091) [2022-07-10 06:27:29,284][26022] Updated weights on worker 0-0, policy_version 606350 (0.00481) [2022-07-10 06:27:31,595][26022] Updated weights on worker 0-0, policy_version 606360 (0.00095) [2022-07-10 06:27:31,988][25689] Fps is (10 sec: 5394.6, 60 sec: 5568.9, 300 sec: 5624.5). Total num frames: 620914688. Throughput: 0: 5039.6. Samples: 620909704. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:27:31,988][25689] Avg episode reward: [(0, '-25.276')] [2022-07-10 06:27:32,777][26022] Updated weights on worker 0-0, policy_version 606370 (0.00086) [2022-07-10 06:27:35,199][26022] Updated weights on worker 0-0, policy_version 606380 (0.00085) [2022-07-10 06:27:36,589][26022] Updated weights on worker 0-0, policy_version 606390 (0.00090) [2022-07-10 06:27:37,019][25689] Fps is (10 sec: 5712.8, 60 sec: 5617.6, 300 sec: 5627.6). Total num frames: 620944384. Throughput: 0: 5878.9. Samples: 620943654. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:27:37,020][25689] Avg episode reward: [(0, '-25.711')] [2022-07-10 06:27:38,503][26022] Updated weights on worker 0-0, policy_version 606400 (0.00079) [2022-07-10 06:27:40,631][26022] Updated weights on worker 0-0, policy_version 606410 (0.00104) [2022-07-10 06:27:42,043][25689] Fps is (10 sec: 5805.0, 60 sec: 5601.2, 300 sec: 5629.1). Total num frames: 620973056. Throughput: 0: 5880.4. Samples: 620977554. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:27:42,043][25689] Avg episode reward: [(0, '-25.387')] [2022-07-10 06:27:42,164][26022] Updated weights on worker 0-0, policy_version 606420 (0.00086) [2022-07-10 06:27:44,088][26022] Updated weights on worker 0-0, policy_version 606430 (0.00094) [2022-07-10 06:27:45,821][26022] Updated weights on worker 0-0, policy_version 606440 (0.00086) [2022-07-10 06:27:47,128][25689] Fps is (10 sec: 5571.3, 60 sec: 5599.3, 300 sec: 5627.8). Total num frames: 621000704. Throughput: 0: 5048.9. Samples: 620994450. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:27:47,129][25689] Avg episode reward: [(0, '-25.364')] [2022-07-10 06:27:47,599][26022] Updated weights on worker 0-0, policy_version 606450 (0.00087) [2022-07-10 06:27:49,815][26022] Updated weights on worker 0-0, policy_version 606460 (0.00854) [2022-07-10 06:27:51,196][26022] Updated weights on worker 0-0, policy_version 606470 (0.00092) [2022-07-10 06:27:52,146][25689] Fps is (10 sec: 5676.0, 60 sec: 5605.6, 300 sec: 5624.9). Total num frames: 621030400. Throughput: 0: 5890.3. Samples: 621028438. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:27:52,146][25689] Avg episode reward: [(0, '-24.222')] [2022-07-10 06:27:53,278][26022] Updated weights on worker 0-0, policy_version 606480 (0.00088) [2022-07-10 06:27:54,788][26022] Updated weights on worker 0-0, policy_version 606490 (0.00089) [2022-07-10 06:27:56,974][26022] Updated weights on worker 0-0, policy_version 606500 (0.00086) [2022-07-10 06:27:57,147][25689] Fps is (10 sec: 5621.2, 60 sec: 5590.4, 300 sec: 5621.7). Total num frames: 621057024. Throughput: 0: 5897.7. Samples: 621062364. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:27:57,148][25689] Avg episode reward: [(0, '-24.565')] [2022-07-10 06:27:58,466][26022] Updated weights on worker 0-0, policy_version 606510 (0.00086) [2022-07-10 06:28:00,358][26022] Updated weights on worker 0-0, policy_version 606520 (0.00085) [2022-07-10 06:28:02,186][25689] Fps is (10 sec: 5303.4, 60 sec: 5571.4, 300 sec: 5621.9). Total num frames: 621083648. Throughput: 0: 5051.2. Samples: 621079302. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:28:02,186][25689] Avg episode reward: [(0, '-22.225')] [2022-07-10 06:28:02,519][26022] Updated weights on worker 0-0, policy_version 606530 (0.00092) [2022-07-10 06:28:04,527][26022] Updated weights on worker 0-0, policy_version 606540 (0.00083) [2022-07-10 06:28:06,119][26022] Updated weights on worker 0-0, policy_version 606550 (0.00092) [2022-07-10 06:28:07,252][25689] Fps is (10 sec: 5573.5, 60 sec: 5576.7, 300 sec: 5629.4). Total num frames: 621113344. Throughput: 0: 5807.0. Samples: 621111312. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:28:07,253][25689] Avg episode reward: [(0, '-21.820')] [2022-07-10 06:28:07,974][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:28:08,002][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000606559_621116416.pth [2022-07-10 06:28:08,003][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000604580_619089920.pth [2022-07-10 06:28:08,117][26022] Updated weights on worker 0-0, policy_version 606560 (0.00089) [2022-07-10 06:28:09,608][26022] Updated weights on worker 0-0, policy_version 606570 (0.00078) [2022-07-10 06:28:11,757][26022] Updated weights on worker 0-0, policy_version 606580 (0.00086) [2022-07-10 06:28:12,275][25689] Fps is (10 sec: 5684.0, 60 sec: 5592.5, 300 sec: 5622.6). Total num frames: 621140992. Throughput: 0: 5806.9. Samples: 621145324. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:28:12,276][25689] Avg episode reward: [(0, '-22.159')] [2022-07-10 06:28:13,062][26022] Updated weights on worker 0-0, policy_version 606590 (0.00085) [2022-07-10 06:28:15,459][26022] Updated weights on worker 0-0, policy_version 606600 (0.00083) [2022-07-10 06:28:16,697][26022] Updated weights on worker 0-0, policy_version 606610 (0.00089) [2022-07-10 06:28:17,288][25689] Fps is (10 sec: 5714.0, 60 sec: 5647.4, 300 sec: 5626.4). Total num frames: 621170688. Throughput: 0: 5831.4. Samples: 621179812. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:28:17,288][25689] Avg episode reward: [(0, '-22.004')] [2022-07-10 06:28:18,823][26022] Updated weights on worker 0-0, policy_version 606620 (0.00092) [2022-07-10 06:28:20,307][26022] Updated weights on worker 0-0, policy_version 606630 (0.00085) [2022-07-10 06:28:22,304][25689] Fps is (10 sec: 5819.6, 60 sec: 5613.8, 300 sec: 5633.7). Total num frames: 621199360. Throughput: 0: 5842.3. Samples: 621196838. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:28:22,305][25689] Avg episode reward: [(0, '-21.487')] [2022-07-10 06:28:22,308][26022] Updated weights on worker 0-0, policy_version 606640 (0.00089) [2022-07-10 06:28:23,996][26022] Updated weights on worker 0-0, policy_version 606650 (0.00087) [2022-07-10 06:28:25,841][26022] Updated weights on worker 0-0, policy_version 606660 (0.00067) [2022-07-10 06:28:27,344][25689] Fps is (10 sec: 5600.7, 60 sec: 5632.1, 300 sec: 5626.2). Total num frames: 621227008. Throughput: 0: 5969.7. Samples: 621231254. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:28:27,344][25689] Avg episode reward: [(0, '-21.218')] [2022-07-10 06:28:27,662][26022] Updated weights on worker 0-0, policy_version 606670 (0.00089) [2022-07-10 06:28:29,700][26022] Updated weights on worker 0-0, policy_version 606680 (0.00096) [2022-07-10 06:28:31,203][26022] Updated weights on worker 0-0, policy_version 606690 (0.00089) [2022-07-10 06:28:32,381][25689] Fps is (10 sec: 5589.0, 60 sec: 5646.2, 300 sec: 5626.5). Total num frames: 621255680. Throughput: 0: 5956.8. Samples: 621265094. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:28:32,383][25689] Avg episode reward: [(0, '-21.701')] [2022-07-10 06:28:33,383][26022] Updated weights on worker 0-0, policy_version 606700 (0.00087) [2022-07-10 06:28:34,781][26022] Updated weights on worker 0-0, policy_version 606710 (0.00092) [2022-07-10 06:28:36,999][26022] Updated weights on worker 0-0, policy_version 606720 (0.00089) [2022-07-10 06:28:37,391][25689] Fps is (10 sec: 5707.7, 60 sec: 5631.3, 300 sec: 5630.2). Total num frames: 621284352. Throughput: 0: 5096.0. Samples: 621282256. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:28:37,391][25689] Avg episode reward: [(0, '-21.759')] [2022-07-10 06:28:38,463][26022] Updated weights on worker 0-0, policy_version 606730 (0.00080) [2022-07-10 06:28:40,499][26022] Updated weights on worker 0-0, policy_version 606740 (0.00637) [2022-07-10 06:28:42,203][26022] Updated weights on worker 0-0, policy_version 606750 (0.00085) [2022-07-10 06:28:42,397][25689] Fps is (10 sec: 5622.8, 60 sec: 5615.9, 300 sec: 5621.8). Total num frames: 621312000. Throughput: 0: 5950.9. Samples: 621316412. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:28:42,399][25689] Avg episode reward: [(0, '-23.020')] [2022-07-10 06:28:43,884][26022] Updated weights on worker 0-0, policy_version 606760 (0.00087) [2022-07-10 06:28:45,903][26022] Updated weights on worker 0-0, policy_version 606770 (0.00097) [2022-07-10 06:28:47,446][25689] Fps is (10 sec: 5702.9, 60 sec: 5653.2, 300 sec: 5629.0). Total num frames: 621341696. Throughput: 0: 5934.9. Samples: 621350558. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:28:47,447][25689] Avg episode reward: [(0, '-23.868')] [2022-07-10 06:28:47,610][26022] Updated weights on worker 0-0, policy_version 606780 (0.00089) [2022-07-10 06:28:49,523][26022] Updated weights on worker 0-0, policy_version 606790 (0.00085) [2022-07-10 06:28:51,180][26022] Updated weights on worker 0-0, policy_version 606800 (0.00090) [2022-07-10 06:28:52,538][25689] Fps is (10 sec: 5654.7, 60 sec: 5612.3, 300 sec: 5627.6). Total num frames: 621369344. Throughput: 0: 5089.6. Samples: 621367688. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:28:52,540][25689] Avg episode reward: [(0, '-24.941')] [2022-07-10 06:28:53,085][26022] Updated weights on worker 0-0, policy_version 606810 (0.00085) [2022-07-10 06:28:55,011][26022] Updated weights on worker 0-0, policy_version 606820 (0.00088) [2022-07-10 06:28:56,638][26022] Updated weights on worker 0-0, policy_version 606830 (0.00093) [2022-07-10 06:28:57,559][25689] Fps is (10 sec: 5569.1, 60 sec: 5644.5, 300 sec: 5627.6). Total num frames: 621398016. Throughput: 0: 5904.6. Samples: 621401340. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:28:57,560][25689] Avg episode reward: [(0, '-25.080')] [2022-07-10 06:28:58,706][26022] Updated weights on worker 0-0, policy_version 606840 (0.00085) [2022-07-10 06:29:00,314][26022] Updated weights on worker 0-0, policy_version 606850 (0.00095) [2022-07-10 06:29:02,575][25689] Fps is (10 sec: 5407.6, 60 sec: 5629.6, 300 sec: 5621.3). Total num frames: 621423616. Throughput: 0: 5792.5. Samples: 621433288. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:29:02,575][25689] Avg episode reward: [(0, '-24.589')] [2022-07-10 06:29:02,586][26022] Updated weights on worker 0-0, policy_version 606860 (0.00102) [2022-07-10 06:29:04,262][26022] Updated weights on worker 0-0, policy_version 606870 (0.00086) [2022-07-10 06:29:06,233][26022] Updated weights on worker 0-0, policy_version 606880 (0.00085) [2022-07-10 06:29:07,647][25689] Fps is (10 sec: 5481.4, 60 sec: 5629.1, 300 sec: 5624.0). Total num frames: 621453312. Throughput: 0: 4935.8. Samples: 621450266. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:29:07,647][25689] Avg episode reward: [(0, '-24.880')] [2022-07-10 06:29:07,884][26022] Updated weights on worker 0-0, policy_version 606890 (0.00092) [2022-07-10 06:29:09,598][26022] Updated weights on worker 0-0, policy_version 606900 (0.00107) [2022-07-10 06:29:11,502][26022] Updated weights on worker 0-0, policy_version 606910 (0.00418) [2022-07-10 06:29:12,686][25689] Fps is (10 sec: 5873.8, 60 sec: 5661.4, 300 sec: 5630.4). Total num frames: 621483008. Throughput: 0: 5792.2. Samples: 621484388. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:29:12,687][25689] Avg episode reward: [(0, '-24.487')] [2022-07-10 06:29:13,481][26022] Updated weights on worker 0-0, policy_version 606920 (0.00087) [2022-07-10 06:29:15,362][26022] Updated weights on worker 0-0, policy_version 606930 (0.00094) [2022-07-10 06:29:16,962][26022] Updated weights on worker 0-0, policy_version 606940 (0.00089) [2022-07-10 06:29:17,781][25689] Fps is (10 sec: 5658.6, 60 sec: 5620.0, 300 sec: 5628.8). Total num frames: 621510656. Throughput: 0: 5770.8. Samples: 621518036. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:29:17,781][25689] Avg episode reward: [(0, '-24.051')] [2022-07-10 06:29:18,924][26022] Updated weights on worker 0-0, policy_version 606950 (0.00091) [2022-07-10 06:29:20,550][26022] Updated weights on worker 0-0, policy_version 606960 (0.00097) [2022-07-10 06:29:22,610][26022] Updated weights on worker 0-0, policy_version 606970 (0.01524) [2022-07-10 06:29:22,870][25689] Fps is (10 sec: 5530.4, 60 sec: 5613.2, 300 sec: 5622.0). Total num frames: 621539328. Throughput: 0: 4999.2. Samples: 621534748. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:29:22,870][25689] Avg episode reward: [(0, '-24.752')] [2022-07-10 06:29:24,408][26022] Updated weights on worker 0-0, policy_version 606980 (0.00091) [2022-07-10 06:29:26,076][26022] Updated weights on worker 0-0, policy_version 606990 (0.00092) [2022-07-10 06:29:27,903][25689] Fps is (10 sec: 5564.1, 60 sec: 5613.8, 300 sec: 5622.6). Total num frames: 621566976. Throughput: 0: 5838.0. Samples: 621568520. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:29:27,903][25689] Avg episode reward: [(0, '-25.050')] [2022-07-10 06:29:28,000][26022] Updated weights on worker 0-0, policy_version 607000 (0.00092) [2022-07-10 06:29:29,947][26022] Updated weights on worker 0-0, policy_version 607010 (0.00085) [2022-07-10 06:29:31,473][26022] Updated weights on worker 0-0, policy_version 607020 (0.00090) [2022-07-10 06:29:32,958][25689] Fps is (10 sec: 5582.6, 60 sec: 5612.1, 300 sec: 5625.5). Total num frames: 621595648. Throughput: 0: 5811.2. Samples: 621602194. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:29:32,959][25689] Avg episode reward: [(0, '-25.356')] [2022-07-10 06:29:33,533][26022] Updated weights on worker 0-0, policy_version 607030 (0.00091) [2022-07-10 06:29:35,202][26022] Updated weights on worker 0-0, policy_version 607040 (0.00091) [2022-07-10 06:29:37,182][26022] Updated weights on worker 0-0, policy_version 607050 (0.00091) [2022-07-10 06:29:37,971][25689] Fps is (10 sec: 5695.3, 60 sec: 5611.8, 300 sec: 5622.3). Total num frames: 621624320. Throughput: 0: 5016.5. Samples: 621619322. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:29:37,972][25689] Avg episode reward: [(0, '-25.484')] [2022-07-10 06:29:38,956][26022] Updated weights on worker 0-0, policy_version 607060 (0.00085) [2022-07-10 06:29:40,604][26022] Updated weights on worker 0-0, policy_version 607070 (0.00078) [2022-07-10 06:29:42,703][26022] Updated weights on worker 0-0, policy_version 607080 (0.00092) [2022-07-10 06:29:42,981][25689] Fps is (10 sec: 5721.6, 60 sec: 5628.5, 300 sec: 5626.9). Total num frames: 621652992. Throughput: 0: 5897.5. Samples: 621653352. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:29:42,981][25689] Avg episode reward: [(0, '-25.353')] [2022-07-10 06:29:44,144][26022] Updated weights on worker 0-0, policy_version 607090 (0.00085) [2022-07-10 06:29:46,082][26022] Updated weights on worker 0-0, policy_version 607100 (0.00094) [2022-07-10 06:29:47,842][26022] Updated weights on worker 0-0, policy_version 607110 (0.00053) [2022-07-10 06:29:48,038][25689] Fps is (10 sec: 5696.2, 60 sec: 5610.7, 300 sec: 5622.5). Total num frames: 621681664. Throughput: 0: 5914.8. Samples: 621687618. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:29:48,039][25689] Avg episode reward: [(0, '-25.278')] [2022-07-10 06:29:49,713][26022] Updated weights on worker 0-0, policy_version 607120 (0.00080) [2022-07-10 06:29:51,500][26022] Updated weights on worker 0-0, policy_version 607130 (0.00084) [2022-07-10 06:29:53,047][25689] Fps is (10 sec: 5595.0, 60 sec: 5618.5, 300 sec: 5619.0). Total num frames: 621709312. Throughput: 0: 5104.7. Samples: 621704740. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:29:53,049][25689] Avg episode reward: [(0, '-24.144')] [2022-07-10 06:29:53,150][26022] Updated weights on worker 0-0, policy_version 607140 (0.00079) [2022-07-10 06:29:54,998][26022] Updated weights on worker 0-0, policy_version 607150 (0.00082) [2022-07-10 06:29:56,700][26022] Updated weights on worker 0-0, policy_version 607160 (0.00088) [2022-07-10 06:29:58,062][25689] Fps is (10 sec: 5618.7, 60 sec: 5619.0, 300 sec: 5625.9). Total num frames: 621737984. Throughput: 0: 5959.6. Samples: 621739054. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:29:58,063][25689] Avg episode reward: [(0, '-23.111')] [2022-07-10 06:29:58,778][26022] Updated weights on worker 0-0, policy_version 607170 (0.00092) [2022-07-10 06:30:00,413][26022] Updated weights on worker 0-0, policy_version 607180 (0.00082) [2022-07-10 06:30:02,707][26022] Updated weights on worker 0-0, policy_version 607190 (0.00094) [2022-07-10 06:30:03,087][25689] Fps is (10 sec: 5507.7, 60 sec: 5635.1, 300 sec: 5622.7). Total num frames: 621764608. Throughput: 0: 5862.1. Samples: 621771214. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:30:03,087][25689] Avg episode reward: [(0, '-22.367')] [2022-07-10 06:30:04,308][26022] Updated weights on worker 0-0, policy_version 607200 (0.00085) [2022-07-10 06:30:06,436][26022] Updated weights on worker 0-0, policy_version 607210 (0.00088) [2022-07-10 06:30:08,127][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:30:08,129][25689] Fps is (10 sec: 5391.2, 60 sec: 5604.0, 300 sec: 5618.6). Total num frames: 621792256. Throughput: 0: 4975.8. Samples: 621787584. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:30:08,130][25689] Avg episode reward: [(0, '-21.848')] [2022-07-10 06:30:08,144][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000607220_621793280.pth [2022-07-10 06:30:08,144][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000605240_619765760.pth [2022-07-10 06:30:08,147][26022] Updated weights on worker 0-0, policy_version 607220 (0.00087) [2022-07-10 06:30:10,012][26022] Updated weights on worker 0-0, policy_version 607230 (0.00092) [2022-07-10 06:30:11,666][26022] Updated weights on worker 0-0, policy_version 607240 (0.00086) [2022-07-10 06:30:13,146][25689] Fps is (10 sec: 5496.9, 60 sec: 5572.2, 300 sec: 5619.1). Total num frames: 621819904. Throughput: 0: 5820.2. Samples: 621821722. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 06:30:13,147][25689] Avg episode reward: [(0, '-21.334')] [2022-07-10 06:30:13,516][26022] Updated weights on worker 0-0, policy_version 607250 (0.00090) [2022-07-10 06:30:15,311][26022] Updated weights on worker 0-0, policy_version 607260 (0.00097) [2022-07-10 06:30:17,067][26022] Updated weights on worker 0-0, policy_version 607270 (0.00114) [2022-07-10 06:30:18,167][25689] Fps is (10 sec: 5712.5, 60 sec: 5612.9, 300 sec: 5618.9). Total num frames: 621849600. Throughput: 0: 5834.7. Samples: 621856362. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:30:18,169][25689] Avg episode reward: [(0, '-21.614')] [2022-07-10 06:30:18,769][26022] Updated weights on worker 0-0, policy_version 607280 (0.00085) [2022-07-10 06:30:20,728][26022] Updated weights on worker 0-0, policy_version 607290 (0.00091) [2022-07-10 06:30:22,354][26022] Updated weights on worker 0-0, policy_version 607300 (0.00084) [2022-07-10 06:30:23,188][25689] Fps is (10 sec: 5812.3, 60 sec: 5619.2, 300 sec: 5619.6). Total num frames: 621878272. Throughput: 0: 5090.3. Samples: 621873538. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:30:23,189][25689] Avg episode reward: [(0, '-21.885')] [2022-07-10 06:30:24,266][26022] Updated weights on worker 0-0, policy_version 607310 (0.00095) [2022-07-10 06:30:26,277][26022] Updated weights on worker 0-0, policy_version 607320 (0.00081) [2022-07-10 06:30:27,885][26022] Updated weights on worker 0-0, policy_version 607330 (0.00088) [2022-07-10 06:30:28,255][25689] Fps is (10 sec: 5786.1, 60 sec: 5650.0, 300 sec: 5622.6). Total num frames: 621907968. Throughput: 0: 5951.3. Samples: 621907360. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:30:28,255][25689] Avg episode reward: [(0, '-22.753')] [2022-07-10 06:30:29,894][26022] Updated weights on worker 0-0, policy_version 607340 (0.00084) [2022-07-10 06:30:31,597][26022] Updated weights on worker 0-0, policy_version 607350 (0.00092) [2022-07-10 06:30:33,268][25689] Fps is (10 sec: 5689.2, 60 sec: 5637.1, 300 sec: 5622.9). Total num frames: 621935616. Throughput: 0: 5934.2. Samples: 621941128. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:30:33,268][25689] Avg episode reward: [(0, '-22.893')] [2022-07-10 06:30:33,291][26022] Updated weights on worker 0-0, policy_version 607360 (0.00087) [2022-07-10 06:30:35,292][26022] Updated weights on worker 0-0, policy_version 607370 (0.00088) [2022-07-10 06:30:36,743][26022] Updated weights on worker 0-0, policy_version 607380 (0.00093) [2022-07-10 06:30:38,281][25689] Fps is (10 sec: 5515.1, 60 sec: 5620.1, 300 sec: 5616.3). Total num frames: 621963264. Throughput: 0: 5066.4. Samples: 621958266. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:30:38,282][25689] Avg episode reward: [(0, '-23.547')] [2022-07-10 06:30:38,872][26022] Updated weights on worker 0-0, policy_version 607390 (0.00087) [2022-07-10 06:30:40,541][26022] Updated weights on worker 0-0, policy_version 607400 (0.00088) [2022-07-10 06:30:42,309][26022] Updated weights on worker 0-0, policy_version 607410 (0.00089) [2022-07-10 06:30:43,293][25689] Fps is (10 sec: 5719.7, 60 sec: 5636.8, 300 sec: 5620.5). Total num frames: 621992960. Throughput: 0: 5912.0. Samples: 621992398. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:30:43,294][25689] Avg episode reward: [(0, '-23.505')] [2022-07-10 06:30:44,416][26022] Updated weights on worker 0-0, policy_version 607420 (0.00092) [2022-07-10 06:30:45,717][26022] Updated weights on worker 0-0, policy_version 607430 (0.00089) [2022-07-10 06:30:47,892][26022] Updated weights on worker 0-0, policy_version 607440 (0.00088) [2022-07-10 06:30:48,335][25689] Fps is (10 sec: 5703.7, 60 sec: 5621.3, 300 sec: 5619.9). Total num frames: 622020608. Throughput: 0: 5926.3. Samples: 622026360. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:30:48,335][25689] Avg episode reward: [(0, '-23.640')] [2022-07-10 06:30:49,379][26022] Updated weights on worker 0-0, policy_version 607450 (0.00087) [2022-07-10 06:30:51,434][26022] Updated weights on worker 0-0, policy_version 607460 (0.00095) [2022-07-10 06:30:53,268][26022] Updated weights on worker 0-0, policy_version 607470 (0.00057) [2022-07-10 06:30:53,422][25689] Fps is (10 sec: 5560.5, 60 sec: 5630.9, 300 sec: 5614.9). Total num frames: 622049280. Throughput: 0: 5072.7. Samples: 622043364. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:30:53,423][25689] Avg episode reward: [(0, '-23.968')] [2022-07-10 06:30:54,727][26022] Updated weights on worker 0-0, policy_version 607480 (0.00084) [2022-07-10 06:30:56,854][26022] Updated weights on worker 0-0, policy_version 607490 (0.00093) [2022-07-10 06:30:58,433][25689] Fps is (10 sec: 5780.2, 60 sec: 5648.3, 300 sec: 5625.4). Total num frames: 622078976. Throughput: 0: 5926.3. Samples: 622077692. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:30:58,433][25689] Avg episode reward: [(0, '-24.104')] [2022-07-10 06:30:58,675][26022] Updated weights on worker 0-0, policy_version 607500 (0.00087) [2022-07-10 06:31:00,310][26022] Updated weights on worker 0-0, policy_version 607510 (0.00096) [2022-07-10 06:31:02,759][26022] Updated weights on worker 0-0, policy_version 607520 (0.00084) [2022-07-10 06:31:03,461][25689] Fps is (10 sec: 5507.9, 60 sec: 5631.0, 300 sec: 5620.3). Total num frames: 622104576. Throughput: 0: 5808.0. Samples: 622109534. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:31:03,462][25689] Avg episode reward: [(0, '-23.966')] [2022-07-10 06:31:04,367][26022] Updated weights on worker 0-0, policy_version 607530 (0.00086) [2022-07-10 06:31:06,339][26022] Updated weights on worker 0-0, policy_version 607540 (0.00090) [2022-07-10 06:31:07,931][26022] Updated weights on worker 0-0, policy_version 607550 (0.00084) [2022-07-10 06:31:08,567][25689] Fps is (10 sec: 5355.5, 60 sec: 5642.0, 300 sec: 5622.4). Total num frames: 622133248. Throughput: 0: 4955.9. Samples: 622126626. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:31:08,567][25689] Avg episode reward: [(0, '-23.197')] [2022-07-10 06:31:09,823][26022] Updated weights on worker 0-0, policy_version 607560 (0.00084) [2022-07-10 06:31:11,743][26022] Updated weights on worker 0-0, policy_version 607570 (0.00054) [2022-07-10 06:31:13,387][26022] Updated weights on worker 0-0, policy_version 607580 (0.00089) [2022-07-10 06:31:13,575][25689] Fps is (10 sec: 5771.4, 60 sec: 5676.8, 300 sec: 5619.2). Total num frames: 622162944. Throughput: 0: 5817.5. Samples: 622160602. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:31:13,575][25689] Avg episode reward: [(0, '-23.262')] [2022-07-10 06:31:15,110][26022] Updated weights on worker 0-0, policy_version 607590 (0.00101) [2022-07-10 06:31:16,948][26022] Updated weights on worker 0-0, policy_version 607600 (0.00624) [2022-07-10 06:31:18,596][25689] Fps is (10 sec: 5819.9, 60 sec: 5659.9, 300 sec: 5629.8). Total num frames: 622191616. Throughput: 0: 5822.7. Samples: 622195094. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:31:18,596][25689] Avg episode reward: [(0, '-23.382')] [2022-07-10 06:31:18,639][26022] Updated weights on worker 0-0, policy_version 607610 (0.00089) [2022-07-10 06:31:20,654][26022] Updated weights on worker 0-0, policy_version 607620 (0.00093) [2022-07-10 06:31:22,426][26022] Updated weights on worker 0-0, policy_version 607630 (0.00097) [2022-07-10 06:31:23,607][25689] Fps is (10 sec: 5716.1, 60 sec: 5660.8, 300 sec: 5627.7). Total num frames: 622220288. Throughput: 0: 5934.0. Samples: 622229076. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:31:23,608][25689] Avg episode reward: [(0, '-23.420')] [2022-07-10 06:31:24,084][26022] Updated weights on worker 0-0, policy_version 607640 (0.00088) [2022-07-10 06:31:26,122][26022] Updated weights on worker 0-0, policy_version 607650 (0.00084) [2022-07-10 06:31:27,898][26022] Updated weights on worker 0-0, policy_version 607660 (0.00088) [2022-07-10 06:31:28,727][25689] Fps is (10 sec: 5458.1, 60 sec: 5605.0, 300 sec: 5615.5). Total num frames: 622246912. Throughput: 0: 5914.3. Samples: 622245858. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:31:28,729][25689] Avg episode reward: [(0, '-23.288')] [2022-07-10 06:31:29,641][26022] Updated weights on worker 0-0, policy_version 607670 (0.00089) [2022-07-10 06:31:31,671][26022] Updated weights on worker 0-0, policy_version 607680 (0.00092) [2022-07-10 06:31:33,120][26022] Updated weights on worker 0-0, policy_version 607690 (0.00087) [2022-07-10 06:31:33,759][25689] Fps is (10 sec: 5547.4, 60 sec: 5637.0, 300 sec: 5625.4). Total num frames: 622276608. Throughput: 0: 5899.1. Samples: 622279672. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:31:33,761][25689] Avg episode reward: [(0, '-23.340')] [2022-07-10 06:31:35,290][26022] Updated weights on worker 0-0, policy_version 607700 (0.00088) [2022-07-10 06:31:36,756][26022] Updated weights on worker 0-0, policy_version 607710 (0.00087) [2022-07-10 06:31:38,761][26022] Updated weights on worker 0-0, policy_version 607720 (0.00085) [2022-07-10 06:31:38,806][25689] Fps is (10 sec: 5791.0, 60 sec: 5650.9, 300 sec: 5621.6). Total num frames: 622305280. Throughput: 0: 5874.8. Samples: 622313824. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:31:38,806][25689] Avg episode reward: [(0, '-26.194')] [2022-07-10 06:31:40,513][26022] Updated weights on worker 0-0, policy_version 607730 (0.00083) [2022-07-10 06:31:42,179][26022] Updated weights on worker 0-0, policy_version 607740 (0.00084) [2022-07-10 06:31:43,827][25689] Fps is (10 sec: 5695.9, 60 sec: 5633.1, 300 sec: 5625.9). Total num frames: 622333952. Throughput: 0: 5046.7. Samples: 622331120. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:31:43,827][25689] Avg episode reward: [(0, '-26.666')] [2022-07-10 06:31:44,064][26022] Updated weights on worker 0-0, policy_version 607750 (0.00097) [2022-07-10 06:31:46,079][26022] Updated weights on worker 0-0, policy_version 607760 (0.00094) [2022-07-10 06:31:47,598][26022] Updated weights on worker 0-0, policy_version 607770 (0.00097) [2022-07-10 06:31:48,951][25689] Fps is (10 sec: 5652.5, 60 sec: 5642.4, 300 sec: 5621.7). Total num frames: 622362624. Throughput: 0: 5912.6. Samples: 622365434. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:31:48,951][25689] Avg episode reward: [(0, '-26.550')] [2022-07-10 06:31:49,714][26022] Updated weights on worker 0-0, policy_version 607780 (0.00106) [2022-07-10 06:31:51,184][26022] Updated weights on worker 0-0, policy_version 607790 (0.00087) [2022-07-10 06:31:53,211][26022] Updated weights on worker 0-0, policy_version 607800 (0.00085) [2022-07-10 06:31:53,988][25689] Fps is (10 sec: 5744.1, 60 sec: 5663.9, 300 sec: 5628.3). Total num frames: 622392320. Throughput: 0: 5935.1. Samples: 622399734. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:31:53,989][25689] Avg episode reward: [(0, '-25.493')] [2022-07-10 06:31:54,768][26022] Updated weights on worker 0-0, policy_version 607810 (0.00062) [2022-07-10 06:31:56,701][26022] Updated weights on worker 0-0, policy_version 607820 (0.00089) [2022-07-10 06:31:58,467][26022] Updated weights on worker 0-0, policy_version 607830 (0.00092) [2022-07-10 06:31:59,003][25689] Fps is (10 sec: 5704.4, 60 sec: 5629.7, 300 sec: 5628.3). Total num frames: 622419968. Throughput: 0: 5097.7. Samples: 622416786. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:31:59,004][25689] Avg episode reward: [(0, '-25.285')] [2022-07-10 06:32:00,226][26022] Updated weights on worker 0-0, policy_version 607840 (0.00087) [2022-07-10 06:32:02,468][26022] Updated weights on worker 0-0, policy_version 607850 (0.00090) [2022-07-10 06:32:04,017][25689] Fps is (10 sec: 5411.6, 60 sec: 5648.0, 300 sec: 5620.0). Total num frames: 622446592. Throughput: 0: 5832.7. Samples: 622448884. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:32:04,017][25689] Avg episode reward: [(0, '-25.875')] [2022-07-10 06:32:04,168][26022] Updated weights on worker 0-0, policy_version 607860 (0.00088) [2022-07-10 06:32:06,127][26022] Updated weights on worker 0-0, policy_version 607870 (0.00084) [2022-07-10 06:32:07,823][26022] Updated weights on worker 0-0, policy_version 607880 (0.00083) [2022-07-10 06:32:08,206][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:32:08,226][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000607881_622470144.pth [2022-07-10 06:32:08,228][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000605902_620443648.pth [2022-07-10 06:32:09,127][25689] Fps is (10 sec: 5461.8, 60 sec: 5647.5, 300 sec: 5625.0). Total num frames: 622475264. Throughput: 0: 5813.7. Samples: 622482736. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:32:09,128][25689] Avg episode reward: [(0, '-22.717')] [2022-07-10 06:32:09,774][26022] Updated weights on worker 0-0, policy_version 607890 (0.00100) [2022-07-10 06:32:11,595][26022] Updated weights on worker 0-0, policy_version 607900 (0.00084) [2022-07-10 06:32:13,403][26022] Updated weights on worker 0-0, policy_version 607910 (0.00084) [2022-07-10 06:32:14,135][25689] Fps is (10 sec: 5667.4, 60 sec: 5630.6, 300 sec: 5632.9). Total num frames: 622503936. Throughput: 0: 4961.9. Samples: 622499702. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:32:14,136][25689] Avg episode reward: [(0, '-22.733')] [2022-07-10 06:32:15,344][26022] Updated weights on worker 0-0, policy_version 607920 (0.00092) [2022-07-10 06:32:16,987][26022] Updated weights on worker 0-0, policy_version 607930 (0.00087) [2022-07-10 06:32:18,754][26022] Updated weights on worker 0-0, policy_version 607940 (0.00084) [2022-07-10 06:32:19,222][25689] Fps is (10 sec: 5579.5, 60 sec: 5607.6, 300 sec: 5621.3). Total num frames: 622531584. Throughput: 0: 5789.2. Samples: 622533832. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:32:19,222][25689] Avg episode reward: [(0, '-22.749')] [2022-07-10 06:32:20,582][26022] Updated weights on worker 0-0, policy_version 607950 (0.00089) [2022-07-10 06:32:22,523][26022] Updated weights on worker 0-0, policy_version 607960 (0.00089) [2022-07-10 06:32:24,184][26022] Updated weights on worker 0-0, policy_version 607970 (0.00087) [2022-07-10 06:32:24,252][25689] Fps is (10 sec: 5769.2, 60 sec: 5639.6, 300 sec: 5635.5). Total num frames: 622562304. Throughput: 0: 5883.4. Samples: 622567936. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:32:24,253][25689] Avg episode reward: [(0, '-22.722')] [2022-07-10 06:32:26,115][26022] Updated weights on worker 0-0, policy_version 607980 (0.00092) [2022-07-10 06:32:27,750][26022] Updated weights on worker 0-0, policy_version 607990 (0.00101) [2022-07-10 06:32:29,317][25689] Fps is (10 sec: 5680.0, 60 sec: 5644.7, 300 sec: 5630.9). Total num frames: 622588928. Throughput: 0: 5053.8. Samples: 622584772. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:32:29,319][25689] Avg episode reward: [(0, '-23.187')] [2022-07-10 06:32:29,639][26022] Updated weights on worker 0-0, policy_version 608000 (0.00091) [2022-07-10 06:32:31,383][26022] Updated weights on worker 0-0, policy_version 608010 (0.00098) [2022-07-10 06:32:33,409][26022] Updated weights on worker 0-0, policy_version 608020 (0.00091) [2022-07-10 06:32:34,351][25689] Fps is (10 sec: 5577.0, 60 sec: 5644.6, 300 sec: 5630.9). Total num frames: 622618624. Throughput: 0: 5890.0. Samples: 622618772. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:32:34,351][25689] Avg episode reward: [(0, '-24.185')] [2022-07-10 06:32:35,024][26022] Updated weights on worker 0-0, policy_version 608030 (0.00086) [2022-07-10 06:32:36,911][26022] Updated weights on worker 0-0, policy_version 608040 (0.00089) [2022-07-10 06:32:38,574][26022] Updated weights on worker 0-0, policy_version 608050 (0.00089) [2022-07-10 06:32:39,377][25689] Fps is (10 sec: 5700.2, 60 sec: 5629.6, 300 sec: 5627.4). Total num frames: 622646272. Throughput: 0: 5893.3. Samples: 622652616. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:32:39,378][25689] Avg episode reward: [(0, '-24.306')] [2022-07-10 06:32:40,675][26022] Updated weights on worker 0-0, policy_version 608060 (0.00092) [2022-07-10 06:32:42,384][26022] Updated weights on worker 0-0, policy_version 608070 (0.00110) [2022-07-10 06:32:44,295][26022] Updated weights on worker 0-0, policy_version 608080 (0.00084) [2022-07-10 06:32:44,383][25689] Fps is (10 sec: 5613.9, 60 sec: 5631.0, 300 sec: 5632.3). Total num frames: 622674944. Throughput: 0: 5036.0. Samples: 622669314. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:32:44,385][25689] Avg episode reward: [(0, '-23.863')] [2022-07-10 06:32:45,972][26022] Updated weights on worker 0-0, policy_version 608090 (0.00090) [2022-07-10 06:32:47,944][26022] Updated weights on worker 0-0, policy_version 608100 (0.00093) [2022-07-10 06:32:49,491][25689] Fps is (10 sec: 5568.6, 60 sec: 5615.6, 300 sec: 5623.7). Total num frames: 622702592. Throughput: 0: 5851.9. Samples: 622702828. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:32:49,492][25689] Avg episode reward: [(0, '-23.491')] [2022-07-10 06:32:49,703][26022] Updated weights on worker 0-0, policy_version 608110 (0.00088) [2022-07-10 06:32:51,595][26022] Updated weights on worker 0-0, policy_version 608120 (0.00101) [2022-07-10 06:32:53,342][26022] Updated weights on worker 0-0, policy_version 608130 (0.00081) [2022-07-10 06:32:54,510][25689] Fps is (10 sec: 5460.5, 60 sec: 5583.5, 300 sec: 5626.9). Total num frames: 622730240. Throughput: 0: 5841.7. Samples: 622736532. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:32:54,510][25689] Avg episode reward: [(0, '-22.919')] [2022-07-10 06:32:55,312][26022] Updated weights on worker 0-0, policy_version 608140 (0.00094) [2022-07-10 06:32:56,864][26022] Updated weights on worker 0-0, policy_version 608150 (0.00093) [2022-07-10 06:32:58,973][26022] Updated weights on worker 0-0, policy_version 608160 (0.00093) [2022-07-10 06:32:59,537][25689] Fps is (10 sec: 5606.0, 60 sec: 5599.2, 300 sec: 5634.0). Total num frames: 622758912. Throughput: 0: 5008.2. Samples: 622753582. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:32:59,538][25689] Avg episode reward: [(0, '-23.178')] [2022-07-10 06:33:00,655][26022] Updated weights on worker 0-0, policy_version 608170 (0.00099) [2022-07-10 06:33:02,831][26022] Updated weights on worker 0-0, policy_version 608180 (0.00094) [2022-07-10 06:33:04,562][25689] Fps is (10 sec: 5398.9, 60 sec: 5581.3, 300 sec: 5621.0). Total num frames: 622784512. Throughput: 0: 5745.0. Samples: 622785242. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:33:04,563][25689] Avg episode reward: [(0, '-21.746')] [2022-07-10 06:33:04,908][26022] Updated weights on worker 0-0, policy_version 608190 (0.00087) [2022-07-10 06:33:06,487][26022] Updated weights on worker 0-0, policy_version 608200 (0.00087) [2022-07-10 06:33:08,488][26022] Updated weights on worker 0-0, policy_version 608210 (0.00087) [2022-07-10 06:33:09,621][25689] Fps is (10 sec: 5585.6, 60 sec: 5620.0, 300 sec: 5630.6). Total num frames: 622815232. Throughput: 0: 5761.4. Samples: 622818802. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:33:09,621][25689] Avg episode reward: [(0, '-22.283')] [2022-07-10 06:33:10,166][26022] Updated weights on worker 0-0, policy_version 608220 (0.00089) [2022-07-10 06:33:11,923][26022] Updated weights on worker 0-0, policy_version 608230 (0.00097) [2022-07-10 06:33:13,887][26022] Updated weights on worker 0-0, policy_version 608240 (0.00087) [2022-07-10 06:33:14,712][25689] Fps is (10 sec: 5649.8, 60 sec: 5578.4, 300 sec: 5618.8). Total num frames: 622841856. Throughput: 0: 4918.3. Samples: 622835892. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:33:14,712][25689] Avg episode reward: [(0, '-22.881')] [2022-07-10 06:33:15,671][26022] Updated weights on worker 0-0, policy_version 608250 (0.00096) [2022-07-10 06:33:17,389][26022] Updated weights on worker 0-0, policy_version 608260 (0.00084) [2022-07-10 06:33:19,227][26022] Updated weights on worker 0-0, policy_version 608270 (0.00098) [2022-07-10 06:33:19,727][25689] Fps is (10 sec: 5471.1, 60 sec: 5601.8, 300 sec: 5618.8). Total num frames: 622870528. Throughput: 0: 5748.6. Samples: 622869646. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:33:19,728][25689] Avg episode reward: [(0, '-23.198')] [2022-07-10 06:33:21,208][26022] Updated weights on worker 0-0, policy_version 608280 (0.00094) [2022-07-10 06:33:22,955][26022] Updated weights on worker 0-0, policy_version 608290 (0.00095) [2022-07-10 06:33:24,749][25689] Fps is (10 sec: 5611.2, 60 sec: 5551.9, 300 sec: 5619.2). Total num frames: 622898176. Throughput: 0: 5871.7. Samples: 622903772. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:33:24,749][25689] Avg episode reward: [(0, '-23.656')] [2022-07-10 06:33:24,769][26022] Updated weights on worker 0-0, policy_version 608300 (0.00222) [2022-07-10 06:33:26,470][26022] Updated weights on worker 0-0, policy_version 608310 (0.00082) [2022-07-10 06:33:28,528][26022] Updated weights on worker 0-0, policy_version 608320 (0.00088) [2022-07-10 06:33:29,834][25689] Fps is (10 sec: 5673.7, 60 sec: 5600.8, 300 sec: 5621.7). Total num frames: 622927872. Throughput: 0: 5037.0. Samples: 622920620. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:33:29,836][25689] Avg episode reward: [(0, '-23.405')] [2022-07-10 06:33:29,937][26022] Updated weights on worker 0-0, policy_version 608330 (0.00095) [2022-07-10 06:33:32,122][26022] Updated weights on worker 0-0, policy_version 608340 (0.00091) [2022-07-10 06:33:33,545][26022] Updated weights on worker 0-0, policy_version 608350 (0.00090) [2022-07-10 06:33:34,854][25689] Fps is (10 sec: 5776.0, 60 sec: 5585.2, 300 sec: 5621.5). Total num frames: 622956544. Throughput: 0: 5899.7. Samples: 622954726. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:33:34,856][25689] Avg episode reward: [(0, '-23.813')] [2022-07-10 06:33:35,751][26022] Updated weights on worker 0-0, policy_version 608360 (0.00090) [2022-07-10 06:33:37,314][26022] Updated weights on worker 0-0, policy_version 608370 (0.00092) [2022-07-10 06:33:39,355][26022] Updated weights on worker 0-0, policy_version 608380 (0.00084) [2022-07-10 06:33:39,866][25689] Fps is (10 sec: 5716.5, 60 sec: 5603.4, 300 sec: 5624.9). Total num frames: 622985216. Throughput: 0: 5912.5. Samples: 622988714. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:33:39,866][25689] Avg episode reward: [(0, '-23.811')] [2022-07-10 06:33:40,936][26022] Updated weights on worker 0-0, policy_version 608390 (0.00089) [2022-07-10 06:33:42,879][26022] Updated weights on worker 0-0, policy_version 608400 (0.00095) [2022-07-10 06:33:44,444][26022] Updated weights on worker 0-0, policy_version 608410 (0.00090) [2022-07-10 06:33:44,881][25689] Fps is (10 sec: 5617.0, 60 sec: 5585.7, 300 sec: 5618.6). Total num frames: 623012864. Throughput: 0: 5062.9. Samples: 623005698. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:33:44,881][25689] Avg episode reward: [(0, '-22.757')] [2022-07-10 06:33:46,589][26022] Updated weights on worker 0-0, policy_version 608420 (0.00093) [2022-07-10 06:33:48,195][26022] Updated weights on worker 0-0, policy_version 608430 (0.00094) [2022-07-10 06:33:50,033][25689] Fps is (10 sec: 5539.5, 60 sec: 5598.5, 300 sec: 5620.9). Total num frames: 623041536. Throughput: 0: 5905.3. Samples: 623039898. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:33:50,033][25689] Avg episode reward: [(0, '-22.729')] [2022-07-10 06:33:50,118][26022] Updated weights on worker 0-0, policy_version 608440 (0.00094) [2022-07-10 06:33:52,031][26022] Updated weights on worker 0-0, policy_version 608450 (0.00096) [2022-07-10 06:33:53,861][26022] Updated weights on worker 0-0, policy_version 608460 (0.00091) [2022-07-10 06:33:55,076][25689] Fps is (10 sec: 5524.1, 60 sec: 5596.2, 300 sec: 5617.0). Total num frames: 623069184. Throughput: 0: 5867.7. Samples: 623073382. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:33:55,078][25689] Avg episode reward: [(0, '-22.940')] [2022-07-10 06:33:55,607][26022] Updated weights on worker 0-0, policy_version 608470 (0.00084) [2022-07-10 06:33:57,312][26022] Updated weights on worker 0-0, policy_version 608480 (0.00091) [2022-07-10 06:33:59,096][26022] Updated weights on worker 0-0, policy_version 608490 (0.00091) [2022-07-10 06:34:00,091][25689] Fps is (10 sec: 5701.3, 60 sec: 5614.4, 300 sec: 5630.8). Total num frames: 623098880. Throughput: 0: 5874.6. Samples: 623107528. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:34:00,092][25689] Avg episode reward: [(0, '-22.879')] [2022-07-10 06:34:01,125][26022] Updated weights on worker 0-0, policy_version 608500 (0.00085) [2022-07-10 06:34:03,283][26022] Updated weights on worker 0-0, policy_version 608510 (0.00084) [2022-07-10 06:34:05,098][25689] Fps is (10 sec: 5313.0, 60 sec: 5582.1, 300 sec: 5611.4). Total num frames: 623122432. Throughput: 0: 5736.2. Samples: 623121670. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:34:05,099][25689] Avg episode reward: [(0, '-22.621')] [2022-07-10 06:34:05,262][26022] Updated weights on worker 0-0, policy_version 608520 (0.00092) [2022-07-10 06:34:07,089][26022] Updated weights on worker 0-0, policy_version 608530 (0.00095) [2022-07-10 06:34:08,349][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:34:08,358][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000608537_623141888.pth [2022-07-10 06:34:08,367][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000606559_621116416.pth [2022-07-10 06:34:09,115][26022] Updated weights on worker 0-0, policy_version 608540 (0.00087) [2022-07-10 06:34:10,243][25689] Fps is (10 sec: 5144.2, 60 sec: 5540.4, 300 sec: 5606.0). Total num frames: 623151104. Throughput: 0: 5652.7. Samples: 623154138. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:34:10,243][25689] Avg episode reward: [(0, '-22.765')] [2022-07-10 06:34:10,816][26022] Updated weights on worker 0-0, policy_version 608550 (0.00072) [2022-07-10 06:34:12,712][26022] Updated weights on worker 0-0, policy_version 608560 (0.00091) [2022-07-10 06:34:14,303][26022] Updated weights on worker 0-0, policy_version 608570 (0.00089) [2022-07-10 06:34:15,254][25689] Fps is (10 sec: 5747.4, 60 sec: 5598.5, 300 sec: 5614.4). Total num frames: 623180800. Throughput: 0: 5697.8. Samples: 623188350. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:34:15,254][25689] Avg episode reward: [(0, '-24.650')] [2022-07-10 06:34:16,352][26022] Updated weights on worker 0-0, policy_version 608580 (0.00092) [2022-07-10 06:34:17,878][26022] Updated weights on worker 0-0, policy_version 608590 (0.00087) [2022-07-10 06:34:19,827][26022] Updated weights on worker 0-0, policy_version 608600 (0.00085) [2022-07-10 06:34:20,258][25689] Fps is (10 sec: 5725.3, 60 sec: 5582.6, 300 sec: 5612.6). Total num frames: 623208448. Throughput: 0: 4862.8. Samples: 623205600. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:34:20,259][25689] Avg episode reward: [(0, '-25.378')] [2022-07-10 06:34:21,410][26022] Updated weights on worker 0-0, policy_version 608610 (0.00088) [2022-07-10 06:34:23,325][26022] Updated weights on worker 0-0, policy_version 608620 (0.00087) [2022-07-10 06:34:25,151][26022] Updated weights on worker 0-0, policy_version 608630 (0.00090) [2022-07-10 06:34:25,331][25689] Fps is (10 sec: 5588.8, 60 sec: 5594.8, 300 sec: 5615.3). Total num frames: 623237120. Throughput: 0: 5839.7. Samples: 623239824. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:34:25,332][25689] Avg episode reward: [(0, '-26.632')] [2022-07-10 06:34:26,900][26022] Updated weights on worker 0-0, policy_version 608640 (0.00094) [2022-07-10 06:34:28,781][26022] Updated weights on worker 0-0, policy_version 608650 (0.00092) [2022-07-10 06:34:30,466][25689] Fps is (10 sec: 5718.2, 60 sec: 5590.2, 300 sec: 5617.2). Total num frames: 623266816. Throughput: 0: 5902.0. Samples: 623273496. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:34:30,466][25689] Avg episode reward: [(0, '-26.722')] [2022-07-10 06:34:30,747][26022] Updated weights on worker 0-0, policy_version 608660 (0.00088) [2022-07-10 06:34:32,351][26022] Updated weights on worker 0-0, policy_version 608670 (0.00090) [2022-07-10 06:34:34,362][26022] Updated weights on worker 0-0, policy_version 608680 (0.00090) [2022-07-10 06:34:35,480][25689] Fps is (10 sec: 5650.5, 60 sec: 5573.9, 300 sec: 5613.8). Total num frames: 623294464. Throughput: 0: 5047.3. Samples: 623290436. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:34:35,480][25689] Avg episode reward: [(0, '-26.065')] [2022-07-10 06:34:35,969][26022] Updated weights on worker 0-0, policy_version 608690 (0.00083) [2022-07-10 06:34:37,819][26022] Updated weights on worker 0-0, policy_version 608700 (0.00085) [2022-07-10 06:34:39,748][26022] Updated weights on worker 0-0, policy_version 608710 (0.00090) [2022-07-10 06:34:40,568][25689] Fps is (10 sec: 5575.3, 60 sec: 5566.9, 300 sec: 5612.3). Total num frames: 623323136. Throughput: 0: 5840.8. Samples: 623324222. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:34:40,568][25689] Avg episode reward: [(0, '-24.689')] [2022-07-10 06:34:41,427][26022] Updated weights on worker 0-0, policy_version 608720 (0.00092) [2022-07-10 06:34:43,462][26022] Updated weights on worker 0-0, policy_version 608730 (0.00092) [2022-07-10 06:34:44,980][26022] Updated weights on worker 0-0, policy_version 608740 (0.00085) [2022-07-10 06:34:45,653][25689] Fps is (10 sec: 5636.4, 60 sec: 5577.2, 300 sec: 5611.8). Total num frames: 623351808. Throughput: 0: 5838.8. Samples: 623358482. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:34:45,654][25689] Avg episode reward: [(0, '-23.392')] [2022-07-10 06:34:46,947][26022] Updated weights on worker 0-0, policy_version 608750 (0.00090) [2022-07-10 06:34:48,780][26022] Updated weights on worker 0-0, policy_version 608760 (0.00087) [2022-07-10 06:34:50,625][26022] Updated weights on worker 0-0, policy_version 608770 (0.00090) [2022-07-10 06:34:50,693][25689] Fps is (10 sec: 5764.4, 60 sec: 5604.4, 300 sec: 5618.1). Total num frames: 623381504. Throughput: 0: 5042.7. Samples: 623375496. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:34:50,694][25689] Avg episode reward: [(0, '-22.970')] [2022-07-10 06:34:52,309][26022] Updated weights on worker 0-0, policy_version 608780 (0.00088) [2022-07-10 06:34:54,242][26022] Updated weights on worker 0-0, policy_version 608790 (0.00094) [2022-07-10 06:34:55,747][25689] Fps is (10 sec: 5681.3, 60 sec: 5603.5, 300 sec: 5613.9). Total num frames: 623409152. Throughput: 0: 5881.0. Samples: 623409628. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:34:55,747][25689] Avg episode reward: [(0, '-22.231')] [2022-07-10 06:34:56,043][26022] Updated weights on worker 0-0, policy_version 608800 (0.00094) [2022-07-10 06:34:57,989][26022] Updated weights on worker 0-0, policy_version 608810 (0.00091) [2022-07-10 06:34:59,434][26022] Updated weights on worker 0-0, policy_version 608820 (0.00087) [2022-07-10 06:35:00,763][25689] Fps is (10 sec: 5593.1, 60 sec: 5586.5, 300 sec: 5620.9). Total num frames: 623437824. Throughput: 0: 5910.4. Samples: 623443582. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:35:00,763][25689] Avg episode reward: [(0, '-22.238')] [2022-07-10 06:35:01,492][26022] Updated weights on worker 0-0, policy_version 608830 (0.00092) [2022-07-10 06:35:03,535][26022] Updated weights on worker 0-0, policy_version 608840 (0.00069) [2022-07-10 06:35:05,387][26022] Updated weights on worker 0-0, policy_version 608850 (0.00088) [2022-07-10 06:35:05,817][25689] Fps is (10 sec: 5389.2, 60 sec: 5615.9, 300 sec: 5613.8). Total num frames: 623463424. Throughput: 0: 4959.5. Samples: 623458486. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:35:05,819][25689] Avg episode reward: [(0, '-22.684')] [2022-07-10 06:35:07,079][26022] Updated weights on worker 0-0, policy_version 608860 (0.00083) [2022-07-10 06:35:09,334][26022] Updated weights on worker 0-0, policy_version 608870 (0.00092) [2022-07-10 06:35:10,862][26022] Updated weights on worker 0-0, policy_version 608880 (0.00091) [2022-07-10 06:35:10,939][25689] Fps is (10 sec: 5434.0, 60 sec: 5634.9, 300 sec: 5618.7). Total num frames: 623493120. Throughput: 0: 5768.2. Samples: 623492276. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:35:10,939][25689] Avg episode reward: [(0, '-22.606')] [2022-07-10 06:35:12,845][26022] Updated weights on worker 0-0, policy_version 608890 (0.00087) [2022-07-10 06:35:14,358][26022] Updated weights on worker 0-0, policy_version 608900 (0.00085) [2022-07-10 06:35:15,940][25689] Fps is (10 sec: 5665.1, 60 sec: 5602.1, 300 sec: 5612.2). Total num frames: 623520768. Throughput: 0: 5774.6. Samples: 623526234. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:35:15,942][25689] Avg episode reward: [(0, '-22.781')] [2022-07-10 06:35:16,431][26022] Updated weights on worker 0-0, policy_version 608910 (0.00082) [2022-07-10 06:35:18,097][26022] Updated weights on worker 0-0, policy_version 608920 (0.00096) [2022-07-10 06:35:20,097][26022] Updated weights on worker 0-0, policy_version 608930 (0.00086) [2022-07-10 06:35:20,964][25689] Fps is (10 sec: 5617.6, 60 sec: 5617.1, 300 sec: 5612.2). Total num frames: 623549440. Throughput: 0: 4924.6. Samples: 623543066. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:35:20,965][25689] Avg episode reward: [(0, '-22.714')] [2022-07-10 06:35:21,660][26022] Updated weights on worker 0-0, policy_version 608940 (0.00087) [2022-07-10 06:35:23,698][26022] Updated weights on worker 0-0, policy_version 608950 (0.00088) [2022-07-10 06:35:25,362][26022] Updated weights on worker 0-0, policy_version 608960 (0.00086) [2022-07-10 06:35:25,981][25689] Fps is (10 sec: 5608.8, 60 sec: 5605.4, 300 sec: 5606.2). Total num frames: 623577088. Throughput: 0: 5868.4. Samples: 623576816. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:35:25,982][25689] Avg episode reward: [(0, '-24.748')] [2022-07-10 06:35:27,334][26022] Updated weights on worker 0-0, policy_version 608970 (0.00084) [2022-07-10 06:35:29,248][26022] Updated weights on worker 0-0, policy_version 608980 (0.00094) [2022-07-10 06:35:30,969][26022] Updated weights on worker 0-0, policy_version 608990 (0.00096) [2022-07-10 06:35:31,098][25689] Fps is (10 sec: 5557.5, 60 sec: 5590.1, 300 sec: 5607.7). Total num frames: 623605760. Throughput: 0: 5851.2. Samples: 623610234. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:35:31,099][25689] Avg episode reward: [(0, '-24.775')] [2022-07-10 06:35:32,718][26022] Updated weights on worker 0-0, policy_version 609000 (0.00093) [2022-07-10 06:35:34,746][26022] Updated weights on worker 0-0, policy_version 609010 (0.00083) [2022-07-10 06:35:36,123][25689] Fps is (10 sec: 5755.2, 60 sec: 5622.9, 300 sec: 5614.4). Total num frames: 623635456. Throughput: 0: 5002.6. Samples: 623627202. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:35:36,124][25689] Avg episode reward: [(0, '-24.194')] [2022-07-10 06:35:36,190][26022] Updated weights on worker 0-0, policy_version 609020 (0.00090) [2022-07-10 06:35:38,380][26022] Updated weights on worker 0-0, policy_version 609030 (0.00094) [2022-07-10 06:35:39,861][26022] Updated weights on worker 0-0, policy_version 609040 (0.00085) [2022-07-10 06:35:41,156][25689] Fps is (10 sec: 5599.9, 60 sec: 5594.2, 300 sec: 5603.7). Total num frames: 623662080. Throughput: 0: 5843.8. Samples: 623661060. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:35:41,156][25689] Avg episode reward: [(0, '-24.635')] [2022-07-10 06:35:41,982][26022] Updated weights on worker 0-0, policy_version 609050 (0.00085) [2022-07-10 06:35:43,479][26022] Updated weights on worker 0-0, policy_version 609060 (0.00087) [2022-07-10 06:35:45,579][26022] Updated weights on worker 0-0, policy_version 609070 (0.00086) [2022-07-10 06:35:46,167][25689] Fps is (10 sec: 5607.1, 60 sec: 5618.0, 300 sec: 5611.1). Total num frames: 623691776. Throughput: 0: 5869.4. Samples: 623695298. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:35:46,168][25689] Avg episode reward: [(0, '-25.365')] [2022-07-10 06:35:47,183][26022] Updated weights on worker 0-0, policy_version 609080 (0.00087) [2022-07-10 06:35:49,176][26022] Updated weights on worker 0-0, policy_version 609090 (0.00090) [2022-07-10 06:35:50,648][26022] Updated weights on worker 0-0, policy_version 609100 (0.00088) [2022-07-10 06:35:51,240][25689] Fps is (10 sec: 5686.7, 60 sec: 5581.2, 300 sec: 5607.9). Total num frames: 623719424. Throughput: 0: 5898.9. Samples: 623729044. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:35:51,240][25689] Avg episode reward: [(0, '-24.692')] [2022-07-10 06:35:52,757][26022] Updated weights on worker 0-0, policy_version 609110 (0.00087) [2022-07-10 06:35:54,655][26022] Updated weights on worker 0-0, policy_version 609120 (0.00091) [2022-07-10 06:35:56,253][25689] Fps is (10 sec: 5584.3, 60 sec: 5601.8, 300 sec: 5604.5). Total num frames: 623748096. Throughput: 0: 5901.4. Samples: 623745996. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:35:56,253][25689] Avg episode reward: [(0, '-24.100')] [2022-07-10 06:35:56,287][26022] Updated weights on worker 0-0, policy_version 609130 (0.00092) [2022-07-10 06:35:58,119][26022] Updated weights on worker 0-0, policy_version 609140 (0.00085) [2022-07-10 06:35:59,891][26022] Updated weights on worker 0-0, policy_version 609150 (0.00083) [2022-07-10 06:36:01,262][25689] Fps is (10 sec: 5619.5, 60 sec: 5585.5, 300 sec: 5611.7). Total num frames: 623775744. Throughput: 0: 5893.1. Samples: 623779546. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:36:01,263][25689] Avg episode reward: [(0, '-24.678')] [2022-07-10 06:36:01,857][26022] Updated weights on worker 0-0, policy_version 609160 (0.00095) [2022-07-10 06:36:04,072][26022] Updated weights on worker 0-0, policy_version 609170 (0.00086) [2022-07-10 06:36:05,859][26022] Updated weights on worker 0-0, policy_version 609180 (0.00084) [2022-07-10 06:36:06,286][25689] Fps is (10 sec: 5307.4, 60 sec: 5588.4, 300 sec: 5602.9). Total num frames: 623801344. Throughput: 0: 5763.6. Samples: 623811252. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:36:06,286][25689] Avg episode reward: [(0, '-25.705')] [2022-07-10 06:36:07,755][26022] Updated weights on worker 0-0, policy_version 609190 (0.00089) [2022-07-10 06:36:08,548][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:36:08,562][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000609195_623815680.pth [2022-07-10 06:36:08,562][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000607220_621793280.pth [2022-07-10 06:36:09,564][26022] Updated weights on worker 0-0, policy_version 609200 (0.00090) [2022-07-10 06:36:11,283][26022] Updated weights on worker 0-0, policy_version 609210 (0.00093) [2022-07-10 06:36:11,315][25689] Fps is (10 sec: 5500.5, 60 sec: 5596.9, 300 sec: 5602.5). Total num frames: 623831040. Throughput: 0: 4929.5. Samples: 623828004. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:36:11,315][25689] Avg episode reward: [(0, '-25.662')] [2022-07-10 06:36:13,371][26022] Updated weights on worker 0-0, policy_version 609220 (0.00091) [2022-07-10 06:36:14,955][26022] Updated weights on worker 0-0, policy_version 609230 (0.00086) [2022-07-10 06:36:16,327][25689] Fps is (10 sec: 5609.0, 60 sec: 5578.9, 300 sec: 5595.8). Total num frames: 623857664. Throughput: 0: 5775.3. Samples: 623861928. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:36:16,329][25689] Avg episode reward: [(0, '-25.227')] [2022-07-10 06:36:16,854][26022] Updated weights on worker 0-0, policy_version 609240 (0.00537) [2022-07-10 06:36:18,617][26022] Updated weights on worker 0-0, policy_version 609250 (0.00091) [2022-07-10 06:36:20,252][26022] Updated weights on worker 0-0, policy_version 609260 (0.00089) [2022-07-10 06:36:21,349][25689] Fps is (10 sec: 5510.5, 60 sec: 5579.1, 300 sec: 5595.6). Total num frames: 623886336. Throughput: 0: 5816.0. Samples: 623896376. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:36:21,351][25689] Avg episode reward: [(0, '-25.481')] [2022-07-10 06:36:22,259][26022] Updated weights on worker 0-0, policy_version 609270 (0.00089) [2022-07-10 06:36:23,814][26022] Updated weights on worker 0-0, policy_version 609280 (0.00087) [2022-07-10 06:36:25,708][26022] Updated weights on worker 0-0, policy_version 609290 (0.00087) [2022-07-10 06:36:26,363][25689] Fps is (10 sec: 5917.7, 60 sec: 5630.3, 300 sec: 5611.4). Total num frames: 623917056. Throughput: 0: 5094.3. Samples: 623913532. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:36:26,363][25689] Avg episode reward: [(0, '-24.581')] [2022-07-10 06:36:27,700][26022] Updated weights on worker 0-0, policy_version 609300 (0.00106) [2022-07-10 06:36:29,418][26022] Updated weights on worker 0-0, policy_version 609310 (0.00086) [2022-07-10 06:36:31,391][25689] Fps is (10 sec: 5710.5, 60 sec: 5604.6, 300 sec: 5601.1). Total num frames: 623943680. Throughput: 0: 5937.8. Samples: 623947214. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 06:36:31,392][25689] Avg episode reward: [(0, '-23.166')] [2022-07-10 06:36:31,399][26022] Updated weights on worker 0-0, policy_version 609320 (0.00094) [2022-07-10 06:36:33,113][26022] Updated weights on worker 0-0, policy_version 609330 (0.00095) [2022-07-10 06:36:34,884][26022] Updated weights on worker 0-0, policy_version 609340 (0.00094) [2022-07-10 06:36:36,437][25689] Fps is (10 sec: 5387.6, 60 sec: 5568.8, 300 sec: 5597.7). Total num frames: 623971328. Throughput: 0: 5919.1. Samples: 623980960. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:36:36,437][25689] Avg episode reward: [(0, '-22.366')] [2022-07-10 06:36:36,921][26022] Updated weights on worker 0-0, policy_version 609350 (0.00091) [2022-07-10 06:36:38,454][26022] Updated weights on worker 0-0, policy_version 609360 (0.00095) [2022-07-10 06:36:40,559][26022] Updated weights on worker 0-0, policy_version 609370 (0.00095) [2022-07-10 06:36:41,458][25689] Fps is (10 sec: 5696.4, 60 sec: 5620.8, 300 sec: 5601.1). Total num frames: 624001024. Throughput: 0: 5031.3. Samples: 623997548. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:36:41,458][25689] Avg episode reward: [(0, '-22.620')] [2022-07-10 06:36:42,314][26022] Updated weights on worker 0-0, policy_version 609380 (0.00084) [2022-07-10 06:36:44,179][26022] Updated weights on worker 0-0, policy_version 609390 (0.00086) [2022-07-10 06:36:45,946][26022] Updated weights on worker 0-0, policy_version 609400 (0.00088) [2022-07-10 06:36:46,473][25689] Fps is (10 sec: 5611.7, 60 sec: 5569.5, 300 sec: 5596.3). Total num frames: 624027648. Throughput: 0: 5869.9. Samples: 624031574. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:36:46,473][25689] Avg episode reward: [(0, '-22.975')] [2022-07-10 06:36:47,750][26022] Updated weights on worker 0-0, policy_version 609410 (0.00093) [2022-07-10 06:36:49,601][26022] Updated weights on worker 0-0, policy_version 609420 (0.00084) [2022-07-10 06:36:51,339][26022] Updated weights on worker 0-0, policy_version 609430 (0.00050) [2022-07-10 06:36:51,559][25689] Fps is (10 sec: 5474.2, 60 sec: 5585.2, 300 sec: 5591.9). Total num frames: 624056320. Throughput: 0: 5872.1. Samples: 624065642. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:36:51,559][25689] Avg episode reward: [(0, '-22.642')] [2022-07-10 06:36:53,221][26022] Updated weights on worker 0-0, policy_version 609440 (0.00081) [2022-07-10 06:36:54,883][26022] Updated weights on worker 0-0, policy_version 609450 (0.00089) [2022-07-10 06:36:56,583][25689] Fps is (10 sec: 5672.0, 60 sec: 5584.2, 300 sec: 5595.2). Total num frames: 624084992. Throughput: 0: 5039.1. Samples: 624082478. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:36:56,583][25689] Avg episode reward: [(0, '-22.404')] [2022-07-10 06:36:56,819][26022] Updated weights on worker 0-0, policy_version 609460 (0.00096) [2022-07-10 06:36:58,479][26022] Updated weights on worker 0-0, policy_version 609470 (0.00083) [2022-07-10 06:37:00,357][26022] Updated weights on worker 0-0, policy_version 609480 (0.00086) [2022-07-10 06:37:01,605][25689] Fps is (10 sec: 5708.2, 60 sec: 5600.0, 300 sec: 5601.9). Total num frames: 624113664. Throughput: 0: 5904.8. Samples: 624116512. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:37:01,605][25689] Avg episode reward: [(0, '-21.937')] [2022-07-10 06:37:02,570][26022] Updated weights on worker 0-0, policy_version 609490 (0.00089) [2022-07-10 06:37:04,308][26022] Updated weights on worker 0-0, policy_version 609500 (0.00091) [2022-07-10 06:37:06,328][26022] Updated weights on worker 0-0, policy_version 609510 (0.00088) [2022-07-10 06:37:06,618][25689] Fps is (10 sec: 5408.0, 60 sec: 5601.0, 300 sec: 5593.5). Total num frames: 624139264. Throughput: 0: 5794.4. Samples: 624148304. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:37:06,619][25689] Avg episode reward: [(0, '-23.038')] [2022-07-10 06:37:07,954][26022] Updated weights on worker 0-0, policy_version 609520 (0.00093) [2022-07-10 06:37:10,021][26022] Updated weights on worker 0-0, policy_version 609530 (0.00086) [2022-07-10 06:37:11,524][26022] Updated weights on worker 0-0, policy_version 609540 (0.00085) [2022-07-10 06:37:11,652][25689] Fps is (10 sec: 5503.8, 60 sec: 5600.5, 300 sec: 5596.4). Total num frames: 624168960. Throughput: 0: 4936.4. Samples: 624164828. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:37:11,652][25689] Avg episode reward: [(0, '-23.721')] [2022-07-10 06:37:13,659][26022] Updated weights on worker 0-0, policy_version 609550 (0.00085) [2022-07-10 06:37:15,293][26022] Updated weights on worker 0-0, policy_version 609560 (0.00054) [2022-07-10 06:37:16,663][25689] Fps is (10 sec: 5709.0, 60 sec: 5617.6, 300 sec: 5597.8). Total num frames: 624196608. Throughput: 0: 5789.9. Samples: 624198740. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:37:16,665][25689] Avg episode reward: [(0, '-24.034')] [2022-07-10 06:37:17,180][26022] Updated weights on worker 0-0, policy_version 609570 (0.00085) [2022-07-10 06:37:18,963][26022] Updated weights on worker 0-0, policy_version 609580 (0.00095) [2022-07-10 06:37:20,939][26022] Updated weights on worker 0-0, policy_version 609590 (0.00102) [2022-07-10 06:37:21,686][25689] Fps is (10 sec: 5612.7, 60 sec: 5617.5, 300 sec: 5591.1). Total num frames: 624225280. Throughput: 0: 5786.1. Samples: 624232704. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:37:21,688][25689] Avg episode reward: [(0, '-24.989')] [2022-07-10 06:37:22,519][26022] Updated weights on worker 0-0, policy_version 609600 (0.00087) [2022-07-10 06:37:24,599][26022] Updated weights on worker 0-0, policy_version 609610 (0.00078) [2022-07-10 06:37:26,163][26022] Updated weights on worker 0-0, policy_version 609620 (0.00086) [2022-07-10 06:37:26,723][25689] Fps is (10 sec: 5699.8, 60 sec: 5581.4, 300 sec: 5598.5). Total num frames: 624253952. Throughput: 0: 5052.1. Samples: 624249878. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:37:26,724][25689] Avg episode reward: [(0, '-25.351')] [2022-07-10 06:37:28,215][26022] Updated weights on worker 0-0, policy_version 609630 (0.00079) [2022-07-10 06:37:29,741][26022] Updated weights on worker 0-0, policy_version 609640 (0.00082) [2022-07-10 06:37:31,783][25689] Fps is (10 sec: 5476.5, 60 sec: 5578.5, 300 sec: 5587.7). Total num frames: 624280576. Throughput: 0: 5884.5. Samples: 624283290. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:37:31,783][25689] Avg episode reward: [(0, '-24.442')] [2022-07-10 06:37:31,826][26022] Updated weights on worker 0-0, policy_version 609650 (0.00078) [2022-07-10 06:37:33,355][26022] Updated weights on worker 0-0, policy_version 609660 (0.00078) [2022-07-10 06:37:35,387][26022] Updated weights on worker 0-0, policy_version 609670 (0.00089) [2022-07-10 06:37:36,803][25689] Fps is (10 sec: 5587.7, 60 sec: 5614.8, 300 sec: 5594.7). Total num frames: 624310272. Throughput: 0: 5884.9. Samples: 624317262. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:37:36,803][25689] Avg episode reward: [(0, '-23.690')] [2022-07-10 06:37:37,039][26022] Updated weights on worker 0-0, policy_version 609680 (0.00093) [2022-07-10 06:37:38,973][26022] Updated weights on worker 0-0, policy_version 609690 (0.00091) [2022-07-10 06:37:40,707][26022] Updated weights on worker 0-0, policy_version 609700 (0.00087) [2022-07-10 06:37:41,823][25689] Fps is (10 sec: 5711.7, 60 sec: 5581.0, 300 sec: 5591.0). Total num frames: 624337920. Throughput: 0: 5026.2. Samples: 624333914. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:37:41,823][25689] Avg episode reward: [(0, '-23.490')] [2022-07-10 06:37:42,615][26022] Updated weights on worker 0-0, policy_version 609710 (0.00087) [2022-07-10 06:37:44,352][26022] Updated weights on worker 0-0, policy_version 609720 (0.00085) [2022-07-10 06:37:46,234][26022] Updated weights on worker 0-0, policy_version 609730 (0.00087) [2022-07-10 06:37:46,829][25689] Fps is (10 sec: 5617.1, 60 sec: 5615.7, 300 sec: 5596.3). Total num frames: 624366592. Throughput: 0: 5882.0. Samples: 624368140. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:37:46,830][25689] Avg episode reward: [(0, '-23.027')] [2022-07-10 06:37:48,110][26022] Updated weights on worker 0-0, policy_version 609740 (0.00094) [2022-07-10 06:37:49,935][26022] Updated weights on worker 0-0, policy_version 609750 (0.00103) [2022-07-10 06:37:51,859][26022] Updated weights on worker 0-0, policy_version 609760 (0.00097) [2022-07-10 06:37:51,886][25689] Fps is (10 sec: 5698.2, 60 sec: 5618.4, 300 sec: 5599.0). Total num frames: 624395264. Throughput: 0: 5892.1. Samples: 624401740. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:37:51,887][25689] Avg episode reward: [(0, '-23.312')] [2022-07-10 06:37:53,500][26022] Updated weights on worker 0-0, policy_version 609770 (0.00095) [2022-07-10 06:37:55,549][26022] Updated weights on worker 0-0, policy_version 609780 (0.00309) [2022-07-10 06:37:56,937][25689] Fps is (10 sec: 5673.3, 60 sec: 5615.9, 300 sec: 5598.6). Total num frames: 624423936. Throughput: 0: 5038.0. Samples: 624418698. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:37:56,938][25689] Avg episode reward: [(0, '-23.157')] [2022-07-10 06:37:57,192][26022] Updated weights on worker 0-0, policy_version 609790 (0.00066) [2022-07-10 06:37:59,092][26022] Updated weights on worker 0-0, policy_version 609800 (0.00084) [2022-07-10 06:38:00,951][26022] Updated weights on worker 0-0, policy_version 609810 (0.00098) [2022-07-10 06:38:01,943][25689] Fps is (10 sec: 5498.6, 60 sec: 5583.5, 300 sec: 5602.4). Total num frames: 624450560. Throughput: 0: 5888.1. Samples: 624452382. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:38:01,943][25689] Avg episode reward: [(0, '-22.859')] [2022-07-10 06:38:03,158][26022] Updated weights on worker 0-0, policy_version 609820 (0.00087) [2022-07-10 06:38:04,883][26022] Updated weights on worker 0-0, policy_version 609830 (0.00087) [2022-07-10 06:38:06,782][26022] Updated weights on worker 0-0, policy_version 609840 (0.00086) [2022-07-10 06:38:06,947][25689] Fps is (10 sec: 5319.5, 60 sec: 5601.3, 300 sec: 5589.7). Total num frames: 624477184. Throughput: 0: 5755.0. Samples: 624483918. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:38:06,947][25689] Avg episode reward: [(0, '-22.763')] [2022-07-10 06:38:08,451][26022] Updated weights on worker 0-0, policy_version 609850 (0.00098) [2022-07-10 06:38:08,813][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:38:08,835][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000609852_624488448.pth [2022-07-10 06:38:08,836][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000607881_622470144.pth [2022-07-10 06:38:10,518][26022] Updated weights on worker 0-0, policy_version 609860 (0.00086) [2022-07-10 06:38:11,999][25689] Fps is (10 sec: 5600.5, 60 sec: 5599.6, 300 sec: 5600.7). Total num frames: 624506880. Throughput: 0: 4925.9. Samples: 624500812. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:38:11,999][25689] Avg episode reward: [(0, '-22.793')] [2022-07-10 06:38:12,004][26022] Updated weights on worker 0-0, policy_version 609870 (0.00087) [2022-07-10 06:38:14,017][26022] Updated weights on worker 0-0, policy_version 609880 (0.00085) [2022-07-10 06:38:15,738][26022] Updated weights on worker 0-0, policy_version 609890 (0.00085) [2022-07-10 06:38:17,013][25689] Fps is (10 sec: 5696.9, 60 sec: 5599.3, 300 sec: 5597.3). Total num frames: 624534528. Throughput: 0: 5786.4. Samples: 624534864. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:38:17,013][25689] Avg episode reward: [(0, '-22.453')] [2022-07-10 06:38:17,443][26022] Updated weights on worker 0-0, policy_version 609900 (0.00097) [2022-07-10 06:38:19,394][26022] Updated weights on worker 0-0, policy_version 609910 (0.00087) [2022-07-10 06:38:21,164][26022] Updated weights on worker 0-0, policy_version 609920 (0.00087) [2022-07-10 06:38:22,023][25689] Fps is (10 sec: 5516.4, 60 sec: 5583.6, 300 sec: 5597.5). Total num frames: 624562176. Throughput: 0: 5800.5. Samples: 624568858. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:38:22,026][25689] Avg episode reward: [(0, '-22.290')] [2022-07-10 06:38:22,900][26022] Updated weights on worker 0-0, policy_version 609930 (0.00087) [2022-07-10 06:38:24,750][26022] Updated weights on worker 0-0, policy_version 609940 (0.00087) [2022-07-10 06:38:26,620][26022] Updated weights on worker 0-0, policy_version 609950 (0.00080) [2022-07-10 06:38:27,116][25689] Fps is (10 sec: 5574.5, 60 sec: 5578.4, 300 sec: 5593.9). Total num frames: 624590848. Throughput: 0: 5060.5. Samples: 624585982. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:38:27,117][25689] Avg episode reward: [(0, '-22.666')] [2022-07-10 06:38:28,582][26022] Updated weights on worker 0-0, policy_version 609960 (0.00099) [2022-07-10 06:38:30,126][26022] Updated weights on worker 0-0, policy_version 609970 (0.00084) [2022-07-10 06:38:32,206][25689] Fps is (10 sec: 5631.3, 60 sec: 5609.5, 300 sec: 5592.6). Total num frames: 624619520. Throughput: 0: 5889.5. Samples: 624619822. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:38:32,207][25689] Avg episode reward: [(0, '-23.483')] [2022-07-10 06:38:32,215][26022] Updated weights on worker 0-0, policy_version 609980 (0.00090) [2022-07-10 06:38:33,809][26022] Updated weights on worker 0-0, policy_version 609990 (0.00086) [2022-07-10 06:38:35,637][26022] Updated weights on worker 0-0, policy_version 610000 (0.00094) [2022-07-10 06:38:37,224][25689] Fps is (10 sec: 5672.9, 60 sec: 5592.7, 300 sec: 5592.5). Total num frames: 624648192. Throughput: 0: 5881.1. Samples: 624653730. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:38:37,225][25689] Avg episode reward: [(0, '-24.532')] [2022-07-10 06:38:37,440][26022] Updated weights on worker 0-0, policy_version 610010 (0.00086) [2022-07-10 06:38:39,297][26022] Updated weights on worker 0-0, policy_version 610020 (0.00087) [2022-07-10 06:38:41,159][26022] Updated weights on worker 0-0, policy_version 610030 (0.00086) [2022-07-10 06:38:42,228][25689] Fps is (10 sec: 5721.9, 60 sec: 5611.2, 300 sec: 5596.2). Total num frames: 624676864. Throughput: 0: 5027.6. Samples: 624670442. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:38:42,228][25689] Avg episode reward: [(0, '-23.611')] [2022-07-10 06:38:43,148][26022] Updated weights on worker 0-0, policy_version 610040 (0.00091) [2022-07-10 06:38:44,913][26022] Updated weights on worker 0-0, policy_version 610050 (0.00086) [2022-07-10 06:38:46,722][26022] Updated weights on worker 0-0, policy_version 610060 (0.00088) [2022-07-10 06:38:47,260][25689] Fps is (10 sec: 5611.9, 60 sec: 5591.8, 300 sec: 5595.0). Total num frames: 624704512. Throughput: 0: 5868.9. Samples: 624704206. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:38:47,261][25689] Avg episode reward: [(0, '-23.331')] [2022-07-10 06:38:48,659][26022] Updated weights on worker 0-0, policy_version 610070 (0.00089) [2022-07-10 06:38:50,146][26022] Updated weights on worker 0-0, policy_version 610080 (0.00083) [2022-07-10 06:38:52,156][26022] Updated weights on worker 0-0, policy_version 610090 (0.00092) [2022-07-10 06:38:52,411][25689] Fps is (10 sec: 5530.6, 60 sec: 5583.1, 300 sec: 5596.4). Total num frames: 624733184. Throughput: 0: 5843.4. Samples: 624737890. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:38:52,412][25689] Avg episode reward: [(0, '-23.450')] [2022-07-10 06:38:53,891][26022] Updated weights on worker 0-0, policy_version 610100 (0.00085) [2022-07-10 06:38:55,640][26022] Updated weights on worker 0-0, policy_version 610110 (0.00583) [2022-07-10 06:38:57,424][25689] Fps is (10 sec: 5642.0, 60 sec: 5586.7, 300 sec: 5593.0). Total num frames: 624761856. Throughput: 0: 5017.9. Samples: 624755090. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:38:57,425][25689] Avg episode reward: [(0, '-23.624')] [2022-07-10 06:38:57,531][26022] Updated weights on worker 0-0, policy_version 610120 (0.00066) [2022-07-10 06:38:59,110][26022] Updated weights on worker 0-0, policy_version 610130 (0.00091) [2022-07-10 06:39:01,138][26022] Updated weights on worker 0-0, policy_version 610140 (0.00086) [2022-07-10 06:39:02,468][25689] Fps is (10 sec: 5498.3, 60 sec: 5583.1, 300 sec: 5602.6). Total num frames: 624788480. Throughput: 0: 5859.5. Samples: 624789040. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:39:02,469][25689] Avg episode reward: [(0, '-23.712')] [2022-07-10 06:39:03,496][26022] Updated weights on worker 0-0, policy_version 610150 (0.00085) [2022-07-10 06:39:04,962][26022] Updated weights on worker 0-0, policy_version 610160 (0.00085) [2022-07-10 06:39:06,890][26022] Updated weights on worker 0-0, policy_version 610170 (0.00100) [2022-07-10 06:39:07,524][25689] Fps is (10 sec: 5474.7, 60 sec: 5612.1, 300 sec: 5604.3). Total num frames: 624817152. Throughput: 0: 5762.1. Samples: 624820970. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:39:07,525][25689] Avg episode reward: [(0, '-23.509')] [2022-07-10 06:39:08,708][26022] Updated weights on worker 0-0, policy_version 610180 (0.00104) [2022-07-10 06:39:10,594][26022] Updated weights on worker 0-0, policy_version 610190 (0.00080) [2022-07-10 06:39:12,596][25689] Fps is (10 sec: 5459.8, 60 sec: 5559.6, 300 sec: 5592.8). Total num frames: 624843776. Throughput: 0: 5771.7. Samples: 624854390. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:39:12,597][25689] Avg episode reward: [(0, '-23.487')] [2022-07-10 06:39:12,735][26022] Updated weights on worker 0-0, policy_version 610200 (0.00079) [2022-07-10 06:39:14,056][26022] Updated weights on worker 0-0, policy_version 610210 (0.00087) [2022-07-10 06:39:16,211][26022] Updated weights on worker 0-0, policy_version 610220 (0.00088) [2022-07-10 06:39:17,669][25689] Fps is (10 sec: 5652.8, 60 sec: 5604.9, 300 sec: 5601.9). Total num frames: 624874496. Throughput: 0: 5767.4. Samples: 624871848. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:39:17,669][25689] Avg episode reward: [(0, '-24.317')] [2022-07-10 06:39:17,689][26022] Updated weights on worker 0-0, policy_version 610230 (0.00095) [2022-07-10 06:39:19,743][26022] Updated weights on worker 0-0, policy_version 610240 (0.00086) [2022-07-10 06:39:21,633][26022] Updated weights on worker 0-0, policy_version 610250 (0.00089) [2022-07-10 06:39:22,673][25689] Fps is (10 sec: 5792.2, 60 sec: 5605.4, 300 sec: 5599.7). Total num frames: 624902144. Throughput: 0: 5785.1. Samples: 624905928. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:39:22,674][25689] Avg episode reward: [(0, '-24.194')] [2022-07-10 06:39:23,218][26022] Updated weights on worker 0-0, policy_version 610260 (0.00091) [2022-07-10 06:39:25,215][26022] Updated weights on worker 0-0, policy_version 610270 (0.00083) [2022-07-10 06:39:26,812][26022] Updated weights on worker 0-0, policy_version 610280 (0.00087) [2022-07-10 06:39:27,731][25689] Fps is (10 sec: 5597.2, 60 sec: 5608.7, 300 sec: 5597.7). Total num frames: 624930816. Throughput: 0: 5878.1. Samples: 624939746. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:39:27,731][25689] Avg episode reward: [(0, '-24.053')] [2022-07-10 06:39:28,892][26022] Updated weights on worker 0-0, policy_version 610290 (0.00084) [2022-07-10 06:39:30,484][26022] Updated weights on worker 0-0, policy_version 610300 (0.00088) [2022-07-10 06:39:32,553][26022] Updated weights on worker 0-0, policy_version 610310 (0.00088) [2022-07-10 06:39:32,798][25689] Fps is (10 sec: 5663.7, 60 sec: 5610.8, 300 sec: 5600.1). Total num frames: 624959488. Throughput: 0: 5056.2. Samples: 624956534. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:39:32,798][25689] Avg episode reward: [(0, '-24.009')] [2022-07-10 06:39:34,163][26022] Updated weights on worker 0-0, policy_version 610320 (0.00088) [2022-07-10 06:39:36,155][26022] Updated weights on worker 0-0, policy_version 610330 (0.00094) [2022-07-10 06:39:37,635][26022] Updated weights on worker 0-0, policy_version 610340 (0.00095) [2022-07-10 06:39:37,827][25689] Fps is (10 sec: 5679.9, 60 sec: 5609.8, 300 sec: 5601.3). Total num frames: 624988160. Throughput: 0: 5893.2. Samples: 624990644. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 06:39:37,827][25689] Avg episode reward: [(0, '-24.065')] [2022-07-10 06:39:39,683][26022] Updated weights on worker 0-0, policy_version 610350 (0.00095) [2022-07-10 06:39:41,422][26022] Updated weights on worker 0-0, policy_version 610360 (0.00093) [2022-07-10 06:39:42,835][25689] Fps is (10 sec: 5509.2, 60 sec: 5575.6, 300 sec: 5595.8). Total num frames: 625014784. Throughput: 0: 5866.9. Samples: 625024216. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:39:42,836][25689] Avg episode reward: [(0, '-24.097')] [2022-07-10 06:39:43,227][26022] Updated weights on worker 0-0, policy_version 610370 (0.00084) [2022-07-10 06:39:45,175][26022] Updated weights on worker 0-0, policy_version 610380 (0.00083) [2022-07-10 06:39:46,931][26022] Updated weights on worker 0-0, policy_version 610390 (0.00094) [2022-07-10 06:39:47,849][25689] Fps is (10 sec: 5517.0, 60 sec: 5594.1, 300 sec: 5592.9). Total num frames: 625043456. Throughput: 0: 5047.3. Samples: 625041292. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:39:47,850][25689] Avg episode reward: [(0, '-24.631')] [2022-07-10 06:39:48,618][26022] Updated weights on worker 0-0, policy_version 610400 (0.00087) [2022-07-10 06:39:50,674][26022] Updated weights on worker 0-0, policy_version 610410 (0.00086) [2022-07-10 06:39:52,212][26022] Updated weights on worker 0-0, policy_version 610420 (0.00090) [2022-07-10 06:39:52,949][25689] Fps is (10 sec: 5872.3, 60 sec: 5632.7, 300 sec: 5602.3). Total num frames: 625074176. Throughput: 0: 5889.3. Samples: 625075210. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:39:52,949][25689] Avg episode reward: [(0, '-24.911')] [2022-07-10 06:39:54,418][26022] Updated weights on worker 0-0, policy_version 610430 (0.00093) [2022-07-10 06:39:55,909][26022] Updated weights on worker 0-0, policy_version 610440 (0.00089) [2022-07-10 06:39:58,002][25689] Fps is (10 sec: 5648.0, 60 sec: 5595.1, 300 sec: 5594.7). Total num frames: 625100800. Throughput: 0: 5863.4. Samples: 625108942. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:39:58,004][25689] Avg episode reward: [(0, '-26.343')] [2022-07-10 06:39:58,005][26022] Updated weights on worker 0-0, policy_version 610450 (0.00091) [2022-07-10 06:39:59,670][26022] Updated weights on worker 0-0, policy_version 610460 (0.00093) [2022-07-10 06:40:01,649][26022] Updated weights on worker 0-0, policy_version 610470 (0.00087) [2022-07-10 06:40:03,018][25689] Fps is (10 sec: 5186.8, 60 sec: 5580.9, 300 sec: 5595.5). Total num frames: 625126400. Throughput: 0: 5038.1. Samples: 625125898. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:40:03,020][25689] Avg episode reward: [(0, '-26.913')] [2022-07-10 06:40:03,719][26022] Updated weights on worker 0-0, policy_version 610480 (0.00088) [2022-07-10 06:40:05,644][26022] Updated weights on worker 0-0, policy_version 610490 (0.00088) [2022-07-10 06:40:07,345][26022] Updated weights on worker 0-0, policy_version 610500 (0.00094) [2022-07-10 06:40:08,083][25689] Fps is (10 sec: 5586.7, 60 sec: 5613.8, 300 sec: 5600.0). Total num frames: 625157120. Throughput: 0: 5759.2. Samples: 625157824. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:40:08,085][25689] Avg episode reward: [(0, '-26.400')] [2022-07-10 06:40:09,132][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:40:09,146][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000610509_625161216.pth [2022-07-10 06:40:09,147][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000608537_623141888.pth [2022-07-10 06:40:09,267][26022] Updated weights on worker 0-0, policy_version 610510 (0.00085) [2022-07-10 06:40:10,807][26022] Updated weights on worker 0-0, policy_version 610520 (0.00088) [2022-07-10 06:40:12,934][26022] Updated weights on worker 0-0, policy_version 610530 (0.00087) [2022-07-10 06:40:13,155][25689] Fps is (10 sec: 5656.8, 60 sec: 5613.9, 300 sec: 5595.2). Total num frames: 625183744. Throughput: 0: 5742.3. Samples: 625191238. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:40:13,155][25689] Avg episode reward: [(0, '-26.425')] [2022-07-10 06:40:14,420][26022] Updated weights on worker 0-0, policy_version 610540 (0.00090) [2022-07-10 06:40:16,571][26022] Updated weights on worker 0-0, policy_version 610550 (0.00888) [2022-07-10 06:40:18,134][26022] Updated weights on worker 0-0, policy_version 610560 (0.00089) [2022-07-10 06:40:18,233][25689] Fps is (10 sec: 5549.0, 60 sec: 5596.4, 300 sec: 5597.6). Total num frames: 625213440. Throughput: 0: 4901.8. Samples: 625208108. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:40:18,243][25689] Avg episode reward: [(0, '-24.889')] [2022-07-10 06:40:20,107][26022] Updated weights on worker 0-0, policy_version 610570 (0.00082) [2022-07-10 06:40:21,831][26022] Updated weights on worker 0-0, policy_version 610580 (0.00084) [2022-07-10 06:40:23,272][25689] Fps is (10 sec: 5769.2, 60 sec: 5610.1, 300 sec: 5600.7). Total num frames: 625242112. Throughput: 0: 5756.3. Samples: 625242488. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:40:23,272][25689] Avg episode reward: [(0, '-24.807')] [2022-07-10 06:40:23,562][26022] Updated weights on worker 0-0, policy_version 610590 (0.00093) [2022-07-10 06:40:25,301][26022] Updated weights on worker 0-0, policy_version 610600 (0.00089) [2022-07-10 06:40:27,518][26022] Updated weights on worker 0-0, policy_version 610610 (0.00085) [2022-07-10 06:40:28,291][25689] Fps is (10 sec: 5599.7, 60 sec: 5596.8, 300 sec: 5599.1). Total num frames: 625269760. Throughput: 0: 5864.4. Samples: 625276328. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:40:28,291][25689] Avg episode reward: [(0, '-23.377')] [2022-07-10 06:40:28,918][26022] Updated weights on worker 0-0, policy_version 610620 (0.00090) [2022-07-10 06:40:31,048][26022] Updated weights on worker 0-0, policy_version 610630 (0.00087) [2022-07-10 06:40:32,521][26022] Updated weights on worker 0-0, policy_version 610640 (0.00093) [2022-07-10 06:40:33,434][25689] Fps is (10 sec: 5542.3, 60 sec: 5589.8, 300 sec: 5593.4). Total num frames: 625298432. Throughput: 0: 5023.2. Samples: 625293104. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:40:33,434][25689] Avg episode reward: [(0, '-23.107')] [2022-07-10 06:40:34,730][26022] Updated weights on worker 0-0, policy_version 610650 (0.00085) [2022-07-10 06:40:36,102][26022] Updated weights on worker 0-0, policy_version 610660 (0.00082) [2022-07-10 06:40:38,346][26022] Updated weights on worker 0-0, policy_version 610670 (0.00099) [2022-07-10 06:40:38,438][25689] Fps is (10 sec: 5651.2, 60 sec: 5592.1, 300 sec: 5600.8). Total num frames: 625327104. Throughput: 0: 5878.4. Samples: 625326880. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:40:38,438][25689] Avg episode reward: [(0, '-23.118')] [2022-07-10 06:40:39,811][26022] Updated weights on worker 0-0, policy_version 610680 (0.00086) [2022-07-10 06:40:41,909][26022] Updated weights on worker 0-0, policy_version 610690 (0.00082) [2022-07-10 06:40:43,474][25689] Fps is (10 sec: 5711.6, 60 sec: 5623.3, 300 sec: 5596.9). Total num frames: 625355776. Throughput: 0: 5855.1. Samples: 625360770. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:40:43,474][25689] Avg episode reward: [(0, '-23.682')] [2022-07-10 06:40:43,641][26022] Updated weights on worker 0-0, policy_version 610700 (0.00095) [2022-07-10 06:40:45,707][26022] Updated weights on worker 0-0, policy_version 610710 (0.00089) [2022-07-10 06:40:47,171][26022] Updated weights on worker 0-0, policy_version 610720 (0.00093) [2022-07-10 06:40:48,533][25689] Fps is (10 sec: 5578.7, 60 sec: 5602.3, 300 sec: 5597.2). Total num frames: 625383424. Throughput: 0: 5002.3. Samples: 625377588. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:40:48,534][25689] Avg episode reward: [(0, '-23.761')] [2022-07-10 06:40:49,341][26022] Updated weights on worker 0-0, policy_version 610730 (0.00090) [2022-07-10 06:40:50,977][26022] Updated weights on worker 0-0, policy_version 610740 (0.00084) [2022-07-10 06:40:52,904][26022] Updated weights on worker 0-0, policy_version 610750 (0.00090) [2022-07-10 06:40:53,650][25689] Fps is (10 sec: 5433.6, 60 sec: 5550.1, 300 sec: 5591.8). Total num frames: 625411072. Throughput: 0: 5841.6. Samples: 625411202. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:40:53,651][25689] Avg episode reward: [(0, '-24.290')] [2022-07-10 06:40:54,526][26022] Updated weights on worker 0-0, policy_version 610760 (0.00090) [2022-07-10 06:40:56,640][26022] Updated weights on worker 0-0, policy_version 610770 (0.00096) [2022-07-10 06:40:58,055][26022] Updated weights on worker 0-0, policy_version 610780 (0.00090) [2022-07-10 06:40:58,654][25689] Fps is (10 sec: 5767.3, 60 sec: 5622.2, 300 sec: 5602.2). Total num frames: 625441792. Throughput: 0: 5848.5. Samples: 625445112. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:40:58,654][25689] Avg episode reward: [(0, '-23.995')] [2022-07-10 06:41:00,288][26022] Updated weights on worker 0-0, policy_version 610790 (0.00092) [2022-07-10 06:41:01,723][26022] Updated weights on worker 0-0, policy_version 610800 (0.00089) [2022-07-10 06:41:03,715][25689] Fps is (10 sec: 5392.4, 60 sec: 5584.2, 300 sec: 5594.6). Total num frames: 625465344. Throughput: 0: 5005.9. Samples: 625462096. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:41:03,715][25689] Avg episode reward: [(0, '-23.390')] [2022-07-10 06:41:04,126][26022] Updated weights on worker 0-0, policy_version 610810 (0.00094) [2022-07-10 06:41:05,963][26022] Updated weights on worker 0-0, policy_version 610820 (0.00090) [2022-07-10 06:41:07,566][26022] Updated weights on worker 0-0, policy_version 610830 (0.00085) [2022-07-10 06:41:08,739][25689] Fps is (10 sec: 5178.4, 60 sec: 5554.3, 300 sec: 5591.3). Total num frames: 625494016. Throughput: 0: 5756.7. Samples: 625493904. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:41:08,739][25689] Avg episode reward: [(0, '-24.147')] [2022-07-10 06:41:09,574][26022] Updated weights on worker 0-0, policy_version 610840 (0.00083) [2022-07-10 06:41:11,443][26022] Updated weights on worker 0-0, policy_version 610850 (0.00090) [2022-07-10 06:41:13,047][26022] Updated weights on worker 0-0, policy_version 610860 (0.01307) [2022-07-10 06:41:13,815][25689] Fps is (10 sec: 5981.7, 60 sec: 5638.2, 300 sec: 5607.3). Total num frames: 625525760. Throughput: 0: 5787.5. Samples: 625527904. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:41:13,815][25689] Avg episode reward: [(0, '-23.843')] [2022-07-10 06:41:15,165][26022] Updated weights on worker 0-0, policy_version 610870 (0.00099) [2022-07-10 06:41:16,616][26022] Updated weights on worker 0-0, policy_version 610880 (0.00089) [2022-07-10 06:41:18,809][26022] Updated weights on worker 0-0, policy_version 610890 (0.00082) [2022-07-10 06:41:18,828][25689] Fps is (10 sec: 5683.5, 60 sec: 5576.7, 300 sec: 5597.1). Total num frames: 625551360. Throughput: 0: 5780.4. Samples: 625561730. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:41:18,829][25689] Avg episode reward: [(0, '-22.946')] [2022-07-10 06:41:20,353][26022] Updated weights on worker 0-0, policy_version 610900 (0.00094) [2022-07-10 06:41:22,368][26022] Updated weights on worker 0-0, policy_version 610910 (0.00090) [2022-07-10 06:41:23,901][25689] Fps is (10 sec: 5584.1, 60 sec: 5607.4, 300 sec: 5596.0). Total num frames: 625582080. Throughput: 0: 5775.3. Samples: 625578676. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:41:23,901][25689] Avg episode reward: [(0, '-22.150')] [2022-07-10 06:41:23,907][26022] Updated weights on worker 0-0, policy_version 610920 (0.00092) [2022-07-10 06:41:25,780][26022] Updated weights on worker 0-0, policy_version 610930 (0.00088) [2022-07-10 06:41:27,742][26022] Updated weights on worker 0-0, policy_version 610940 (0.00088) [2022-07-10 06:41:28,991][25689] Fps is (10 sec: 5844.4, 60 sec: 5617.7, 300 sec: 5601.7). Total num frames: 625610752. Throughput: 0: 5860.5. Samples: 625612590. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:41:28,991][25689] Avg episode reward: [(0, '-23.152')] [2022-07-10 06:41:29,578][26022] Updated weights on worker 0-0, policy_version 610950 (0.00092) [2022-07-10 06:41:31,165][26022] Updated weights on worker 0-0, policy_version 610960 (0.00093) [2022-07-10 06:41:33,087][26022] Updated weights on worker 0-0, policy_version 610970 (0.00355) [2022-07-10 06:41:34,066][25689] Fps is (10 sec: 5540.3, 60 sec: 5607.0, 300 sec: 5601.2). Total num frames: 625638400. Throughput: 0: 5850.0. Samples: 625646374. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:41:34,067][25689] Avg episode reward: [(0, '-23.172')] [2022-07-10 06:41:34,803][26022] Updated weights on worker 0-0, policy_version 610980 (0.00086) [2022-07-10 06:41:36,823][26022] Updated weights on worker 0-0, policy_version 610990 (0.00083) [2022-07-10 06:41:38,475][26022] Updated weights on worker 0-0, policy_version 611000 (0.00098) [2022-07-10 06:41:39,072][25689] Fps is (10 sec: 5485.1, 60 sec: 5590.0, 300 sec: 5594.6). Total num frames: 625666048. Throughput: 0: 5030.9. Samples: 625663572. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:41:39,072][25689] Avg episode reward: [(0, '-21.899')] [2022-07-10 06:41:40,534][26022] Updated weights on worker 0-0, policy_version 611010 (0.00054) [2022-07-10 06:41:42,036][26022] Updated weights on worker 0-0, policy_version 611020 (0.00090) [2022-07-10 06:41:44,108][25689] Fps is (10 sec: 5506.7, 60 sec: 5573.1, 300 sec: 5597.6). Total num frames: 625693696. Throughput: 0: 5862.4. Samples: 625697138. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:41:44,109][25689] Avg episode reward: [(0, '-22.397')] [2022-07-10 06:41:44,247][26022] Updated weights on worker 0-0, policy_version 611030 (0.00088) [2022-07-10 06:41:45,796][26022] Updated weights on worker 0-0, policy_version 611040 (0.00083) [2022-07-10 06:41:47,731][26022] Updated weights on worker 0-0, policy_version 611050 (0.00091) [2022-07-10 06:41:49,218][25689] Fps is (10 sec: 5752.7, 60 sec: 5619.1, 300 sec: 5604.1). Total num frames: 625724416. Throughput: 0: 5858.0. Samples: 625731082. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:41:49,218][25689] Avg episode reward: [(0, '-23.660')] [2022-07-10 06:41:49,261][26022] Updated weights on worker 0-0, policy_version 611060 (0.00095) [2022-07-10 06:41:51,307][26022] Updated weights on worker 0-0, policy_version 611070 (0.00094) [2022-07-10 06:41:52,993][26022] Updated weights on worker 0-0, policy_version 611080 (0.00097) [2022-07-10 06:41:54,296][25689] Fps is (10 sec: 5829.5, 60 sec: 5639.6, 300 sec: 5603.0). Total num frames: 625753088. Throughput: 0: 5036.5. Samples: 625748260. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:41:54,296][25689] Avg episode reward: [(0, '-23.183')] [2022-07-10 06:41:54,910][26022] Updated weights on worker 0-0, policy_version 611090 (0.00101) [2022-07-10 06:41:56,835][26022] Updated weights on worker 0-0, policy_version 611100 (0.00093) [2022-07-10 06:41:58,479][26022] Updated weights on worker 0-0, policy_version 611110 (0.00085) [2022-07-10 06:41:59,376][25689] Fps is (10 sec: 5544.5, 60 sec: 5581.9, 300 sec: 5598.5). Total num frames: 625780736. Throughput: 0: 5834.8. Samples: 625782042. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:41:59,376][25689] Avg episode reward: [(0, '-22.124')] [2022-07-10 06:42:00,318][26022] Updated weights on worker 0-0, policy_version 611120 (0.00105) [2022-07-10 06:42:02,507][26022] Updated weights on worker 0-0, policy_version 611130 (0.00098) [2022-07-10 06:42:04,310][26022] Updated weights on worker 0-0, policy_version 611140 (0.00100) [2022-07-10 06:42:04,404][25689] Fps is (10 sec: 5369.2, 60 sec: 5635.5, 300 sec: 5601.7). Total num frames: 625807360. Throughput: 0: 5749.5. Samples: 625813832. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:42:04,404][25689] Avg episode reward: [(0, '-23.542')] [2022-07-10 06:42:06,267][26022] Updated weights on worker 0-0, policy_version 611150 (0.00094) [2022-07-10 06:42:07,968][26022] Updated weights on worker 0-0, policy_version 611160 (0.00086) [2022-07-10 06:42:09,431][25689] Fps is (10 sec: 5295.7, 60 sec: 5601.5, 300 sec: 5591.5). Total num frames: 625833984. Throughput: 0: 4932.8. Samples: 625830788. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:42:09,431][25689] Avg episode reward: [(0, '-23.516')] [2022-07-10 06:42:09,524][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:42:09,533][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000611167_625835008.pth [2022-07-10 06:42:09,534][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000609195_623815680.pth [2022-07-10 06:42:09,929][26022] Updated weights on worker 0-0, policy_version 611170 (0.00085) [2022-07-10 06:42:11,492][26022] Updated weights on worker 0-0, policy_version 611180 (0.00100) [2022-07-10 06:42:13,561][26022] Updated weights on worker 0-0, policy_version 611190 (0.00093) [2022-07-10 06:42:14,499][25689] Fps is (10 sec: 5680.6, 60 sec: 5585.4, 300 sec: 5600.7). Total num frames: 625864704. Throughput: 0: 5753.4. Samples: 625864494. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:42:14,499][25689] Avg episode reward: [(0, '-23.406')] [2022-07-10 06:42:15,236][26022] Updated weights on worker 0-0, policy_version 611200 (0.00101) [2022-07-10 06:42:17,315][26022] Updated weights on worker 0-0, policy_version 611210 (0.00084) [2022-07-10 06:42:18,883][26022] Updated weights on worker 0-0, policy_version 611220 (0.00071) [2022-07-10 06:42:19,595][25689] Fps is (10 sec: 5742.6, 60 sec: 5611.5, 300 sec: 5595.9). Total num frames: 625892352. Throughput: 0: 5735.2. Samples: 625898002. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:42:19,595][25689] Avg episode reward: [(0, '-23.479')] [2022-07-10 06:42:20,783][26022] Updated weights on worker 0-0, policy_version 611230 (0.00090) [2022-07-10 06:42:22,435][26022] Updated weights on worker 0-0, policy_version 611240 (0.00179) [2022-07-10 06:42:24,528][26022] Updated weights on worker 0-0, policy_version 611250 (0.00087) [2022-07-10 06:42:24,656][25689] Fps is (10 sec: 5645.6, 60 sec: 5595.7, 300 sec: 5598.9). Total num frames: 625922048. Throughput: 0: 5005.8. Samples: 625915210. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:42:24,656][25689] Avg episode reward: [(0, '-23.795')] [2022-07-10 06:42:26,227][26022] Updated weights on worker 0-0, policy_version 611260 (0.00096) [2022-07-10 06:42:27,992][26022] Updated weights on worker 0-0, policy_version 611270 (0.00094) [2022-07-10 06:42:29,723][25689] Fps is (10 sec: 5560.8, 60 sec: 5564.1, 300 sec: 5598.8). Total num frames: 625948672. Throughput: 0: 5826.9. Samples: 625949028. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:42:29,723][25689] Avg episode reward: [(0, '-24.414')] [2022-07-10 06:42:29,978][26022] Updated weights on worker 0-0, policy_version 611280 (0.00089) [2022-07-10 06:42:31,696][26022] Updated weights on worker 0-0, policy_version 611290 (0.00084) [2022-07-10 06:42:33,622][26022] Updated weights on worker 0-0, policy_version 611300 (0.00089) [2022-07-10 06:42:34,803][25689] Fps is (10 sec: 5550.6, 60 sec: 5597.4, 300 sec: 5597.6). Total num frames: 625978368. Throughput: 0: 5818.6. Samples: 625982634. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:42:34,803][25689] Avg episode reward: [(0, '-24.784')] [2022-07-10 06:42:35,268][26022] Updated weights on worker 0-0, policy_version 611310 (0.00092) [2022-07-10 06:42:37,136][26022] Updated weights on worker 0-0, policy_version 611320 (0.00089) [2022-07-10 06:42:38,929][26022] Updated weights on worker 0-0, policy_version 611330 (0.00085) [2022-07-10 06:42:39,826][25689] Fps is (10 sec: 5675.7, 60 sec: 5595.8, 300 sec: 5597.6). Total num frames: 626006016. Throughput: 0: 5020.5. Samples: 625999574. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:42:39,827][25689] Avg episode reward: [(0, '-24.749')] [2022-07-10 06:42:40,688][26022] Updated weights on worker 0-0, policy_version 611340 (0.00086) [2022-07-10 06:42:42,667][26022] Updated weights on worker 0-0, policy_version 611350 (0.00087) [2022-07-10 06:42:44,359][26022] Updated weights on worker 0-0, policy_version 611360 (0.00090) [2022-07-10 06:42:44,843][25689] Fps is (10 sec: 5507.4, 60 sec: 5597.5, 300 sec: 5593.9). Total num frames: 626033664. Throughput: 0: 5851.1. Samples: 626033326. Policy #0 lag: (min: 0.0, avg: 8.7, max: 18.0) [2022-07-10 06:42:44,843][25689] Avg episode reward: [(0, '-24.745')] [2022-07-10 06:42:46,355][26022] Updated weights on worker 0-0, policy_version 611370 (0.00094) [2022-07-10 06:42:48,012][26022] Updated weights on worker 0-0, policy_version 611380 (0.00082) [2022-07-10 06:42:49,858][25689] Fps is (10 sec: 5512.0, 60 sec: 5555.7, 300 sec: 5591.3). Total num frames: 626061312. Throughput: 0: 5866.6. Samples: 626067154. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:42:49,859][25689] Avg episode reward: [(0, '-25.649')] [2022-07-10 06:42:50,023][26022] Updated weights on worker 0-0, policy_version 611390 (0.00087) [2022-07-10 06:42:51,715][26022] Updated weights on worker 0-0, policy_version 611400 (0.00093) [2022-07-10 06:42:53,692][26022] Updated weights on worker 0-0, policy_version 611410 (0.00093) [2022-07-10 06:42:54,928][25689] Fps is (10 sec: 5888.8, 60 sec: 5607.0, 300 sec: 5601.2). Total num frames: 626093056. Throughput: 0: 5043.4. Samples: 626084136. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:42:54,929][25689] Avg episode reward: [(0, '-24.796')] [2022-07-10 06:42:55,096][26022] Updated weights on worker 0-0, policy_version 611420 (0.00098) [2022-07-10 06:42:57,336][26022] Updated weights on worker 0-0, policy_version 611430 (0.00092) [2022-07-10 06:42:59,028][26022] Updated weights on worker 0-0, policy_version 611440 (0.00102) [2022-07-10 06:42:59,931][25689] Fps is (10 sec: 5794.7, 60 sec: 5597.3, 300 sec: 5601.3). Total num frames: 626119680. Throughput: 0: 5891.8. Samples: 626118028. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:42:59,931][25689] Avg episode reward: [(0, '-24.094')] [2022-07-10 06:43:00,960][26022] Updated weights on worker 0-0, policy_version 611450 (0.00098) [2022-07-10 06:43:02,956][26022] Updated weights on worker 0-0, policy_version 611460 (0.00088) [2022-07-10 06:43:05,021][25689] Fps is (10 sec: 5174.7, 60 sec: 5574.7, 300 sec: 5596.3). Total num frames: 626145280. Throughput: 0: 5765.7. Samples: 626149664. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:43:05,021][25689] Avg episode reward: [(0, '-23.388')] [2022-07-10 06:43:05,026][26022] Updated weights on worker 0-0, policy_version 611470 (0.00083) [2022-07-10 06:43:06,418][26022] Updated weights on worker 0-0, policy_version 611480 (0.00085) [2022-07-10 06:43:08,642][26022] Updated weights on worker 0-0, policy_version 611490 (0.00090) [2022-07-10 06:43:10,058][25689] Fps is (10 sec: 5460.0, 60 sec: 5624.4, 300 sec: 5596.5). Total num frames: 626174976. Throughput: 0: 5751.2. Samples: 626183328. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:43:10,059][25689] Avg episode reward: [(0, '-22.802')] [2022-07-10 06:43:10,191][26022] Updated weights on worker 0-0, policy_version 611500 (0.00088) [2022-07-10 06:43:12,356][26022] Updated weights on worker 0-0, policy_version 611510 (0.00091) [2022-07-10 06:43:13,996][26022] Updated weights on worker 0-0, policy_version 611520 (0.00093) [2022-07-10 06:43:15,165][25689] Fps is (10 sec: 5450.7, 60 sec: 5536.4, 300 sec: 5587.9). Total num frames: 626200576. Throughput: 0: 5721.4. Samples: 626199920. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:43:15,166][25689] Avg episode reward: [(0, '-23.098')] [2022-07-10 06:43:15,876][26022] Updated weights on worker 0-0, policy_version 611530 (0.00093) [2022-07-10 06:43:17,707][26022] Updated weights on worker 0-0, policy_version 611540 (0.00086) [2022-07-10 06:43:19,393][26022] Updated weights on worker 0-0, policy_version 611550 (0.00089) [2022-07-10 06:43:20,184][25689] Fps is (10 sec: 5561.8, 60 sec: 5594.1, 300 sec: 5598.0). Total num frames: 626231296. Throughput: 0: 5709.5. Samples: 626233666. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:43:20,185][25689] Avg episode reward: [(0, '-23.323')] [2022-07-10 06:43:21,492][26022] Updated weights on worker 0-0, policy_version 611560 (0.00087) [2022-07-10 06:43:23,072][26022] Updated weights on worker 0-0, policy_version 611570 (0.00084) [2022-07-10 06:43:25,048][26022] Updated weights on worker 0-0, policy_version 611580 (0.00090) [2022-07-10 06:43:25,242][25689] Fps is (10 sec: 5894.1, 60 sec: 5577.5, 300 sec: 5598.7). Total num frames: 626259968. Throughput: 0: 5835.7. Samples: 626267670. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:43:25,242][25689] Avg episode reward: [(0, '-23.058')] [2022-07-10 06:43:26,854][26022] Updated weights on worker 0-0, policy_version 611590 (0.00087) [2022-07-10 06:43:28,480][26022] Updated weights on worker 0-0, policy_version 611600 (0.00088) [2022-07-10 06:43:30,296][25689] Fps is (10 sec: 5468.6, 60 sec: 5578.7, 300 sec: 5592.5). Total num frames: 626286592. Throughput: 0: 4996.8. Samples: 626284452. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:43:30,297][25689] Avg episode reward: [(0, '-23.589')] [2022-07-10 06:43:30,527][26022] Updated weights on worker 0-0, policy_version 611610 (0.00094) [2022-07-10 06:43:32,165][26022] Updated weights on worker 0-0, policy_version 611620 (0.00096) [2022-07-10 06:43:34,240][26022] Updated weights on worker 0-0, policy_version 611630 (0.00091) [2022-07-10 06:43:35,368][25689] Fps is (10 sec: 5562.0, 60 sec: 5579.5, 300 sec: 5594.9). Total num frames: 626316288. Throughput: 0: 5857.0. Samples: 626318246. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:43:35,368][25689] Avg episode reward: [(0, '-24.753')] [2022-07-10 06:43:35,821][26022] Updated weights on worker 0-0, policy_version 611640 (0.00084) [2022-07-10 06:43:37,741][26022] Updated weights on worker 0-0, policy_version 611650 (0.00089) [2022-07-10 06:43:39,470][26022] Updated weights on worker 0-0, policy_version 611660 (0.00094) [2022-07-10 06:43:40,373][25689] Fps is (10 sec: 5690.4, 60 sec: 5581.1, 300 sec: 5591.5). Total num frames: 626343936. Throughput: 0: 5866.9. Samples: 626352112. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:43:40,374][25689] Avg episode reward: [(0, '-24.286')] [2022-07-10 06:43:41,415][26022] Updated weights on worker 0-0, policy_version 611670 (0.00087) [2022-07-10 06:43:43,052][26022] Updated weights on worker 0-0, policy_version 611680 (0.00087) [2022-07-10 06:43:45,158][26022] Updated weights on worker 0-0, policy_version 611690 (0.00087) [2022-07-10 06:43:45,393][25689] Fps is (10 sec: 5413.3, 60 sec: 5563.9, 300 sec: 5588.3). Total num frames: 626370560. Throughput: 0: 5031.2. Samples: 626369056. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:43:45,394][25689] Avg episode reward: [(0, '-24.446')] [2022-07-10 06:43:46,843][26022] Updated weights on worker 0-0, policy_version 611700 (0.00094) [2022-07-10 06:43:48,697][26022] Updated weights on worker 0-0, policy_version 611710 (0.00083) [2022-07-10 06:43:50,434][25689] Fps is (10 sec: 5598.1, 60 sec: 5595.4, 300 sec: 5593.8). Total num frames: 626400256. Throughput: 0: 5862.8. Samples: 626402516. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:43:50,434][25689] Avg episode reward: [(0, '-24.355')] [2022-07-10 06:43:50,494][26022] Updated weights on worker 0-0, policy_version 611720 (0.00098) [2022-07-10 06:43:52,512][26022] Updated weights on worker 0-0, policy_version 611730 (0.00098) [2022-07-10 06:43:54,138][26022] Updated weights on worker 0-0, policy_version 611740 (0.00085) [2022-07-10 06:43:55,489][25689] Fps is (10 sec: 5781.7, 60 sec: 5546.1, 300 sec: 5593.0). Total num frames: 626428928. Throughput: 0: 5859.6. Samples: 626436146. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:43:55,489][25689] Avg episode reward: [(0, '-23.516')] [2022-07-10 06:43:56,203][26022] Updated weights on worker 0-0, policy_version 611750 (0.00089) [2022-07-10 06:43:57,903][26022] Updated weights on worker 0-0, policy_version 611760 (0.00090) [2022-07-10 06:43:59,662][26022] Updated weights on worker 0-0, policy_version 611770 (0.00093) [2022-07-10 06:44:00,495][25689] Fps is (10 sec: 5598.0, 60 sec: 5562.7, 300 sec: 5597.1). Total num frames: 626456576. Throughput: 0: 5017.2. Samples: 626453064. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:44:00,495][25689] Avg episode reward: [(0, '-22.528')] [2022-07-10 06:44:01,891][26022] Updated weights on worker 0-0, policy_version 611780 (0.00083) [2022-07-10 06:44:03,694][26022] Updated weights on worker 0-0, policy_version 611790 (0.00087) [2022-07-10 06:44:05,424][26022] Updated weights on worker 0-0, policy_version 611800 (0.00088) [2022-07-10 06:44:05,505][25689] Fps is (10 sec: 5418.3, 60 sec: 5586.9, 300 sec: 5591.1). Total num frames: 626483200. Throughput: 0: 5777.3. Samples: 626485246. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:44:05,505][25689] Avg episode reward: [(0, '-21.501')] [2022-07-10 06:44:07,320][26022] Updated weights on worker 0-0, policy_version 611810 (0.00088) [2022-07-10 06:44:09,005][26022] Updated weights on worker 0-0, policy_version 611820 (0.01133) [2022-07-10 06:44:09,661][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:44:09,677][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000611823_626506752.pth [2022-07-10 06:44:09,677][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000609852_624488448.pth [2022-07-10 06:44:10,512][25689] Fps is (10 sec: 5622.1, 60 sec: 5589.7, 300 sec: 5602.7). Total num frames: 626512896. Throughput: 0: 5813.2. Samples: 626519236. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:44:10,513][25689] Avg episode reward: [(0, '-20.908')] [2022-07-10 06:44:10,768][26022] Updated weights on worker 0-0, policy_version 611830 (0.00091) [2022-07-10 06:44:12,701][26022] Updated weights on worker 0-0, policy_version 611840 (0.00094) [2022-07-10 06:44:14,620][26022] Updated weights on worker 0-0, policy_version 611850 (0.00088) [2022-07-10 06:44:15,572][25689] Fps is (10 sec: 5594.8, 60 sec: 5611.1, 300 sec: 5589.1). Total num frames: 626539520. Throughput: 0: 4984.8. Samples: 626536256. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:44:15,572][25689] Avg episode reward: [(0, '-20.464')] [2022-07-10 06:44:16,257][26022] Updated weights on worker 0-0, policy_version 611860 (0.00096) [2022-07-10 06:44:18,168][26022] Updated weights on worker 0-0, policy_version 611870 (0.00087) [2022-07-10 06:44:19,997][26022] Updated weights on worker 0-0, policy_version 611880 (0.00091) [2022-07-10 06:44:20,578][25689] Fps is (10 sec: 5391.8, 60 sec: 5561.4, 300 sec: 5589.1). Total num frames: 626567168. Throughput: 0: 5830.9. Samples: 626570168. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:44:20,579][25689] Avg episode reward: [(0, '-21.177')] [2022-07-10 06:44:21,695][26022] Updated weights on worker 0-0, policy_version 611890 (0.00061) [2022-07-10 06:44:23,696][26022] Updated weights on worker 0-0, policy_version 611900 (0.00082) [2022-07-10 06:44:25,138][26022] Updated weights on worker 0-0, policy_version 611910 (0.00084) [2022-07-10 06:44:25,607][25689] Fps is (10 sec: 5714.3, 60 sec: 5581.0, 300 sec: 5593.1). Total num frames: 626596864. Throughput: 0: 5920.9. Samples: 626604266. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:44:25,607][25689] Avg episode reward: [(0, '-21.080')] [2022-07-10 06:44:27,258][26022] Updated weights on worker 0-0, policy_version 611920 (0.00096) [2022-07-10 06:44:28,974][26022] Updated weights on worker 0-0, policy_version 611930 (0.00093) [2022-07-10 06:44:30,633][25689] Fps is (10 sec: 5702.8, 60 sec: 5600.5, 300 sec: 5590.4). Total num frames: 626624512. Throughput: 0: 5058.3. Samples: 626621014. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:44:30,634][25689] Avg episode reward: [(0, '-21.298')] [2022-07-10 06:44:30,851][26022] Updated weights on worker 0-0, policy_version 611940 (0.00089) [2022-07-10 06:44:32,734][26022] Updated weights on worker 0-0, policy_version 611950 (0.00085) [2022-07-10 06:44:34,640][26022] Updated weights on worker 0-0, policy_version 611960 (0.00094) [2022-07-10 06:44:35,710][25689] Fps is (10 sec: 5574.5, 60 sec: 5583.1, 300 sec: 5589.5). Total num frames: 626653184. Throughput: 0: 5875.2. Samples: 626654574. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:44:35,710][25689] Avg episode reward: [(0, '-21.943')] [2022-07-10 06:44:36,316][26022] Updated weights on worker 0-0, policy_version 611970 (0.00085) [2022-07-10 06:44:38,079][26022] Updated weights on worker 0-0, policy_version 611980 (0.00091) [2022-07-10 06:44:39,903][26022] Updated weights on worker 0-0, policy_version 611990 (0.00090) [2022-07-10 06:44:40,791][25689] Fps is (10 sec: 5645.4, 60 sec: 5593.1, 300 sec: 5595.0). Total num frames: 626681856. Throughput: 0: 5873.7. Samples: 626688894. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:44:40,792][25689] Avg episode reward: [(0, '-22.959')] [2022-07-10 06:44:41,760][26022] Updated weights on worker 0-0, policy_version 612000 (0.00088) [2022-07-10 06:44:43,573][26022] Updated weights on worker 0-0, policy_version 612010 (0.00092) [2022-07-10 06:44:45,298][26022] Updated weights on worker 0-0, policy_version 612020 (0.00092) [2022-07-10 06:44:45,814][25689] Fps is (10 sec: 5675.5, 60 sec: 5626.7, 300 sec: 5594.9). Total num frames: 626710528. Throughput: 0: 5038.3. Samples: 626706074. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:44:45,814][25689] Avg episode reward: [(0, '-21.734')] [2022-07-10 06:44:46,952][26022] Updated weights on worker 0-0, policy_version 612030 (0.00122) [2022-07-10 06:44:49,032][26022] Updated weights on worker 0-0, policy_version 612040 (0.00085) [2022-07-10 06:44:50,715][26022] Updated weights on worker 0-0, policy_version 612050 (0.00094) [2022-07-10 06:44:50,909][25689] Fps is (10 sec: 5769.0, 60 sec: 5621.7, 300 sec: 5591.5). Total num frames: 626740224. Throughput: 0: 5875.6. Samples: 626740142. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:44:50,909][25689] Avg episode reward: [(0, '-22.615')] [2022-07-10 06:44:52,655][26022] Updated weights on worker 0-0, policy_version 612060 (0.00087) [2022-07-10 06:44:54,311][26022] Updated weights on worker 0-0, policy_version 612070 (0.00107) [2022-07-10 06:44:55,973][25689] Fps is (10 sec: 5644.4, 60 sec: 5603.8, 300 sec: 5594.7). Total num frames: 626767872. Throughput: 0: 5894.9. Samples: 626774024. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:44:55,974][25689] Avg episode reward: [(0, '-22.428')] [2022-07-10 06:44:56,236][26022] Updated weights on worker 0-0, policy_version 612080 (0.00092) [2022-07-10 06:44:58,082][26022] Updated weights on worker 0-0, policy_version 612090 (0.00085) [2022-07-10 06:44:59,786][26022] Updated weights on worker 0-0, policy_version 612100 (0.00084) [2022-07-10 06:45:00,975][25689] Fps is (10 sec: 5493.1, 60 sec: 5604.2, 300 sec: 5601.9). Total num frames: 626795520. Throughput: 0: 5055.0. Samples: 626790924. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:45:00,975][25689] Avg episode reward: [(0, '-22.284')] [2022-07-10 06:45:02,280][26022] Updated weights on worker 0-0, policy_version 612110 (0.00083) [2022-07-10 06:45:03,841][26022] Updated weights on worker 0-0, policy_version 612120 (0.00090) [2022-07-10 06:45:05,682][26022] Updated weights on worker 0-0, policy_version 612130 (0.00086) [2022-07-10 06:45:05,986][25689] Fps is (10 sec: 5420.0, 60 sec: 5604.1, 300 sec: 5589.1). Total num frames: 626822144. Throughput: 0: 5779.8. Samples: 626822668. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:45:05,987][25689] Avg episode reward: [(0, '-22.545')] [2022-07-10 06:45:07,399][26022] Updated weights on worker 0-0, policy_version 612140 (0.00086) [2022-07-10 06:45:09,315][26022] Updated weights on worker 0-0, policy_version 612150 (0.00085) [2022-07-10 06:45:11,000][25689] Fps is (10 sec: 5516.0, 60 sec: 5586.6, 300 sec: 5597.1). Total num frames: 626850816. Throughput: 0: 5811.9. Samples: 626856910. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:45:11,000][25689] Avg episode reward: [(0, '-22.694')] [2022-07-10 06:45:11,093][26022] Updated weights on worker 0-0, policy_version 612160 (0.00088) [2022-07-10 06:45:12,912][26022] Updated weights on worker 0-0, policy_version 612170 (0.00094) [2022-07-10 06:45:14,665][26022] Updated weights on worker 0-0, policy_version 612180 (0.00090) [2022-07-10 06:45:16,078][25689] Fps is (10 sec: 5682.1, 60 sec: 5618.7, 300 sec: 5593.7). Total num frames: 626879488. Throughput: 0: 4970.7. Samples: 626873962. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:45:16,079][25689] Avg episode reward: [(0, '-22.419')] [2022-07-10 06:45:16,440][26022] Updated weights on worker 0-0, policy_version 612190 (0.00088) [2022-07-10 06:45:18,231][26022] Updated weights on worker 0-0, policy_version 612200 (0.00084) [2022-07-10 06:45:19,944][26022] Updated weights on worker 0-0, policy_version 612210 (0.00085) [2022-07-10 06:45:21,111][25689] Fps is (10 sec: 5671.4, 60 sec: 5633.2, 300 sec: 5593.8). Total num frames: 626908160. Throughput: 0: 5816.4. Samples: 626908042. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:45:21,112][25689] Avg episode reward: [(0, '-22.626')] [2022-07-10 06:45:22,119][26022] Updated weights on worker 0-0, policy_version 612220 (0.00088) [2022-07-10 06:45:23,548][26022] Updated weights on worker 0-0, policy_version 612230 (0.00087) [2022-07-10 06:45:25,428][26022] Updated weights on worker 0-0, policy_version 612240 (0.00084) [2022-07-10 06:45:26,114][25689] Fps is (10 sec: 5816.0, 60 sec: 5635.5, 300 sec: 5601.0). Total num frames: 626937856. Throughput: 0: 5956.6. Samples: 626942562. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:45:26,116][25689] Avg episode reward: [(0, '-22.486')] [2022-07-10 06:45:27,198][26022] Updated weights on worker 0-0, policy_version 612250 (0.00086) [2022-07-10 06:45:29,058][26022] Updated weights on worker 0-0, policy_version 612260 (0.00093) [2022-07-10 06:45:30,925][26022] Updated weights on worker 0-0, policy_version 612270 (0.00610) [2022-07-10 06:45:31,142][25689] Fps is (10 sec: 5716.7, 60 sec: 5635.5, 300 sec: 5599.7). Total num frames: 626965504. Throughput: 0: 5095.9. Samples: 626959552. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:45:31,143][25689] Avg episode reward: [(0, '-22.012')] [2022-07-10 06:45:32,787][26022] Updated weights on worker 0-0, policy_version 612280 (0.00089) [2022-07-10 06:45:34,458][26022] Updated weights on worker 0-0, policy_version 612290 (0.00708) [2022-07-10 06:45:36,188][25689] Fps is (10 sec: 5590.5, 60 sec: 5638.2, 300 sec: 5598.9). Total num frames: 626994176. Throughput: 0: 5938.1. Samples: 626993378. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:45:36,189][25689] Avg episode reward: [(0, '-21.064')] [2022-07-10 06:45:36,391][26022] Updated weights on worker 0-0, policy_version 612300 (0.00082) [2022-07-10 06:45:37,937][26022] Updated weights on worker 0-0, policy_version 612310 (0.00100) [2022-07-10 06:45:40,032][26022] Updated weights on worker 0-0, policy_version 612320 (0.00081) [2022-07-10 06:45:41,190][25689] Fps is (10 sec: 5808.6, 60 sec: 5662.6, 300 sec: 5603.0). Total num frames: 627023872. Throughput: 0: 5947.2. Samples: 627027458. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:45:41,191][25689] Avg episode reward: [(0, '-22.498')] [2022-07-10 06:45:41,613][26022] Updated weights on worker 0-0, policy_version 612330 (0.00080) [2022-07-10 06:45:43,586][26022] Updated weights on worker 0-0, policy_version 612340 (0.00087) [2022-07-10 06:45:45,240][26022] Updated weights on worker 0-0, policy_version 612350 (0.00088) [2022-07-10 06:45:46,207][25689] Fps is (10 sec: 5621.8, 60 sec: 5629.3, 300 sec: 5600.3). Total num frames: 627050496. Throughput: 0: 5077.5. Samples: 627044582. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:45:46,207][25689] Avg episode reward: [(0, '-23.882')] [2022-07-10 06:45:47,118][26022] Updated weights on worker 0-0, policy_version 612360 (0.00091) [2022-07-10 06:45:49,107][26022] Updated weights on worker 0-0, policy_version 612370 (0.00088) [2022-07-10 06:45:50,815][26022] Updated weights on worker 0-0, policy_version 612380 (0.00090) [2022-07-10 06:45:51,235][25689] Fps is (10 sec: 5505.0, 60 sec: 5618.5, 300 sec: 5605.5). Total num frames: 627079168. Throughput: 0: 5907.5. Samples: 627078252. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:45:51,235][25689] Avg episode reward: [(0, '-23.260')] [2022-07-10 06:45:52,684][26022] Updated weights on worker 0-0, policy_version 612390 (0.00091) [2022-07-10 06:45:54,391][26022] Updated weights on worker 0-0, policy_version 612400 (0.00088) [2022-07-10 06:45:56,267][26022] Updated weights on worker 0-0, policy_version 612410 (0.00086) [2022-07-10 06:45:56,359][25689] Fps is (10 sec: 5648.5, 60 sec: 5630.0, 300 sec: 5596.3). Total num frames: 627107840. Throughput: 0: 5881.5. Samples: 627112008. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 06:45:56,359][25689] Avg episode reward: [(0, '-23.334')] [2022-07-10 06:45:58,164][26022] Updated weights on worker 0-0, policy_version 612420 (0.00113) [2022-07-10 06:45:59,912][26022] Updated weights on worker 0-0, policy_version 612430 (0.00091) [2022-07-10 06:46:01,367][25689] Fps is (10 sec: 5558.4, 60 sec: 5629.3, 300 sec: 5611.1). Total num frames: 627135488. Throughput: 0: 5032.3. Samples: 627128996. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:46:01,368][25689] Avg episode reward: [(0, '-24.391')] [2022-07-10 06:46:01,683][26022] Updated weights on worker 0-0, policy_version 612440 (0.00088) [2022-07-10 06:46:04,047][26022] Updated weights on worker 0-0, policy_version 612450 (0.00087) [2022-07-10 06:46:05,577][26022] Updated weights on worker 0-0, policy_version 612460 (0.00082) [2022-07-10 06:46:06,379][25689] Fps is (10 sec: 5518.2, 60 sec: 5646.2, 300 sec: 5607.8). Total num frames: 627163136. Throughput: 0: 5758.1. Samples: 627160738. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:46:06,380][25689] Avg episode reward: [(0, '-23.976')] [2022-07-10 06:46:07,723][26022] Updated weights on worker 0-0, policy_version 612470 (0.00086) [2022-07-10 06:46:09,195][26022] Updated weights on worker 0-0, policy_version 612480 (0.00085) [2022-07-10 06:46:09,746][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:46:09,761][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000612482_627181568.pth [2022-07-10 06:46:09,762][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000610509_625161216.pth [2022-07-10 06:46:11,324][26022] Updated weights on worker 0-0, policy_version 612490 (0.00053) [2022-07-10 06:46:11,396][25689] Fps is (10 sec: 5411.6, 60 sec: 5612.0, 300 sec: 5591.8). Total num frames: 627189760. Throughput: 0: 5757.0. Samples: 627194320. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:46:11,397][25689] Avg episode reward: [(0, '-23.388')] [2022-07-10 06:46:13,018][26022] Updated weights on worker 0-0, policy_version 612500 (0.00591) [2022-07-10 06:46:14,902][26022] Updated weights on worker 0-0, policy_version 612510 (0.00080) [2022-07-10 06:46:16,456][25689] Fps is (10 sec: 5588.9, 60 sec: 5630.7, 300 sec: 5604.6). Total num frames: 627219456. Throughput: 0: 5781.8. Samples: 627228208. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:46:16,457][25689] Avg episode reward: [(0, '-22.900')] [2022-07-10 06:46:16,607][26022] Updated weights on worker 0-0, policy_version 612520 (0.00091) [2022-07-10 06:46:18,707][26022] Updated weights on worker 0-0, policy_version 612530 (0.00096) [2022-07-10 06:46:20,312][26022] Updated weights on worker 0-0, policy_version 612540 (0.00079) [2022-07-10 06:46:21,543][25689] Fps is (10 sec: 5550.5, 60 sec: 5591.7, 300 sec: 5590.6). Total num frames: 627246080. Throughput: 0: 5751.4. Samples: 627245034. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:46:21,543][25689] Avg episode reward: [(0, '-22.550')] [2022-07-10 06:46:22,379][26022] Updated weights on worker 0-0, policy_version 612550 (0.00088) [2022-07-10 06:46:23,881][26022] Updated weights on worker 0-0, policy_version 612560 (0.00081) [2022-07-10 06:46:25,791][26022] Updated weights on worker 0-0, policy_version 612570 (0.00092) [2022-07-10 06:46:26,570][25689] Fps is (10 sec: 5467.6, 60 sec: 5572.6, 300 sec: 5591.8). Total num frames: 627274752. Throughput: 0: 5846.2. Samples: 627278776. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:46:26,570][25689] Avg episode reward: [(0, '-21.242')] [2022-07-10 06:46:27,721][26022] Updated weights on worker 0-0, policy_version 612580 (0.00308) [2022-07-10 06:46:29,448][26022] Updated weights on worker 0-0, policy_version 612590 (0.00089) [2022-07-10 06:46:31,489][26022] Updated weights on worker 0-0, policy_version 612600 (0.00110) [2022-07-10 06:46:31,612][25689] Fps is (10 sec: 5695.0, 60 sec: 5588.2, 300 sec: 5595.9). Total num frames: 627303424. Throughput: 0: 5828.6. Samples: 627312152. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:46:31,613][25689] Avg episode reward: [(0, '-21.913')] [2022-07-10 06:46:33,390][26022] Updated weights on worker 0-0, policy_version 612610 (0.00089) [2022-07-10 06:46:34,855][26022] Updated weights on worker 0-0, policy_version 612620 (0.00098) [2022-07-10 06:46:36,679][25689] Fps is (10 sec: 5470.0, 60 sec: 5552.5, 300 sec: 5591.3). Total num frames: 627330048. Throughput: 0: 4978.0. Samples: 627328874. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:46:36,679][25689] Avg episode reward: [(0, '-22.318')] [2022-07-10 06:46:37,051][26022] Updated weights on worker 0-0, policy_version 612630 (0.00087) [2022-07-10 06:46:38,376][26022] Updated weights on worker 0-0, policy_version 612640 (0.00094) [2022-07-10 06:46:40,575][26022] Updated weights on worker 0-0, policy_version 612650 (0.00098) [2022-07-10 06:46:41,711][25689] Fps is (10 sec: 5678.4, 60 sec: 5566.6, 300 sec: 5601.7). Total num frames: 627360768. Throughput: 0: 5834.1. Samples: 627362696. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:46:41,711][25689] Avg episode reward: [(0, '-22.253')] [2022-07-10 06:46:42,314][26022] Updated weights on worker 0-0, policy_version 612660 (0.00093) [2022-07-10 06:46:43,984][26022] Updated weights on worker 0-0, policy_version 612670 (0.00081) [2022-07-10 06:46:45,999][26022] Updated weights on worker 0-0, policy_version 612680 (0.00086) [2022-07-10 06:46:46,736][25689] Fps is (10 sec: 5701.8, 60 sec: 5565.8, 300 sec: 5589.5). Total num frames: 627387392. Throughput: 0: 5847.7. Samples: 627396700. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:46:46,736][25689] Avg episode reward: [(0, '-22.486')] [2022-07-10 06:46:47,641][26022] Updated weights on worker 0-0, policy_version 612690 (0.00090) [2022-07-10 06:46:49,545][26022] Updated weights on worker 0-0, policy_version 612700 (0.00093) [2022-07-10 06:46:51,425][26022] Updated weights on worker 0-0, policy_version 612710 (0.00094) [2022-07-10 06:46:51,752][25689] Fps is (10 sec: 5506.8, 60 sec: 5566.9, 300 sec: 5590.7). Total num frames: 627416064. Throughput: 0: 5049.4. Samples: 627413848. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:46:51,753][25689] Avg episode reward: [(0, '-22.785')] [2022-07-10 06:46:53,088][26022] Updated weights on worker 0-0, policy_version 612720 (0.00086) [2022-07-10 06:46:55,185][26022] Updated weights on worker 0-0, policy_version 612730 (0.00087) [2022-07-10 06:46:56,538][26022] Updated weights on worker 0-0, policy_version 612740 (0.00084) [2022-07-10 06:46:56,795][25689] Fps is (10 sec: 5904.4, 60 sec: 5608.3, 300 sec: 5601.7). Total num frames: 627446784. Throughput: 0: 5916.8. Samples: 627447896. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:46:56,795][25689] Avg episode reward: [(0, '-24.428')] [2022-07-10 06:46:58,626][26022] Updated weights on worker 0-0, policy_version 612750 (0.00094) [2022-07-10 06:47:00,519][26022] Updated weights on worker 0-0, policy_version 612760 (0.00090) [2022-07-10 06:47:01,877][25689] Fps is (10 sec: 5663.4, 60 sec: 5584.5, 300 sec: 5600.7). Total num frames: 627473408. Throughput: 0: 5909.3. Samples: 627481868. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:47:01,878][25689] Avg episode reward: [(0, '-24.482')] [2022-07-10 06:47:02,615][26022] Updated weights on worker 0-0, policy_version 612770 (0.00096) [2022-07-10 06:47:04,446][26022] Updated weights on worker 0-0, policy_version 612780 (0.00093) [2022-07-10 06:47:06,082][26022] Updated weights on worker 0-0, policy_version 612790 (0.00089) [2022-07-10 06:47:06,892][25689] Fps is (10 sec: 5273.7, 60 sec: 5567.4, 300 sec: 5600.9). Total num frames: 627500032. Throughput: 0: 4968.6. Samples: 627496846. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:47:06,892][25689] Avg episode reward: [(0, '-24.056')] [2022-07-10 06:47:08,070][26022] Updated weights on worker 0-0, policy_version 612800 (0.00087) [2022-07-10 06:47:09,829][26022] Updated weights on worker 0-0, policy_version 612810 (0.00089) [2022-07-10 06:47:11,639][26022] Updated weights on worker 0-0, policy_version 612820 (0.00995) [2022-07-10 06:47:11,907][25689] Fps is (10 sec: 5513.2, 60 sec: 5601.3, 300 sec: 5595.0). Total num frames: 627528704. Throughput: 0: 5802.7. Samples: 627530802. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:47:11,908][25689] Avg episode reward: [(0, '-23.355')] [2022-07-10 06:47:13,444][26022] Updated weights on worker 0-0, policy_version 612830 (0.00089) [2022-07-10 06:47:15,252][26022] Updated weights on worker 0-0, policy_version 612840 (0.00084) [2022-07-10 06:47:16,959][25689] Fps is (10 sec: 5696.0, 60 sec: 5585.2, 300 sec: 5599.3). Total num frames: 627557376. Throughput: 0: 5778.3. Samples: 627564410. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:47:16,959][25689] Avg episode reward: [(0, '-23.317')] [2022-07-10 06:47:17,230][26022] Updated weights on worker 0-0, policy_version 612850 (0.00092) [2022-07-10 06:47:19,092][26022] Updated weights on worker 0-0, policy_version 612860 (0.00085) [2022-07-10 06:47:20,870][26022] Updated weights on worker 0-0, policy_version 612870 (0.00085) [2022-07-10 06:47:21,984][25689] Fps is (10 sec: 5487.2, 60 sec: 5590.9, 300 sec: 5589.6). Total num frames: 627584000. Throughput: 0: 4951.7. Samples: 627581432. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:47:21,985][25689] Avg episode reward: [(0, '-21.402')] [2022-07-10 06:47:22,590][26022] Updated weights on worker 0-0, policy_version 612880 (0.00091) [2022-07-10 06:47:24,481][26022] Updated weights on worker 0-0, policy_version 612890 (0.00083) [2022-07-10 06:47:26,227][26022] Updated weights on worker 0-0, policy_version 612900 (0.00084) [2022-07-10 06:47:26,995][25689] Fps is (10 sec: 5509.8, 60 sec: 5592.4, 300 sec: 5597.6). Total num frames: 627612672. Throughput: 0: 5879.6. Samples: 627615046. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:47:26,995][25689] Avg episode reward: [(0, '-21.052')] [2022-07-10 06:47:28,207][26022] Updated weights on worker 0-0, policy_version 612910 (0.00094) [2022-07-10 06:47:29,895][26022] Updated weights on worker 0-0, policy_version 612920 (0.00083) [2022-07-10 06:47:31,755][26022] Updated weights on worker 0-0, policy_version 612930 (0.00088) [2022-07-10 06:47:32,008][25689] Fps is (10 sec: 5721.1, 60 sec: 5595.1, 300 sec: 5595.4). Total num frames: 627641344. Throughput: 0: 5860.7. Samples: 627648604. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:47:32,009][25689] Avg episode reward: [(0, '-20.774')] [2022-07-10 06:47:33,731][26022] Updated weights on worker 0-0, policy_version 612940 (0.00087) [2022-07-10 06:47:35,343][26022] Updated weights on worker 0-0, policy_version 612950 (0.00097) [2022-07-10 06:47:37,075][25689] Fps is (10 sec: 5587.5, 60 sec: 5612.0, 300 sec: 5594.6). Total num frames: 627668992. Throughput: 0: 5013.5. Samples: 627665260. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:47:37,077][25689] Avg episode reward: [(0, '-20.910')] [2022-07-10 06:47:37,362][26022] Updated weights on worker 0-0, policy_version 612960 (0.00084) [2022-07-10 06:47:39,328][26022] Updated weights on worker 0-0, policy_version 612970 (0.00092) [2022-07-10 06:47:40,878][26022] Updated weights on worker 0-0, policy_version 612980 (0.00092) [2022-07-10 06:47:42,087][25689] Fps is (10 sec: 5486.3, 60 sec: 5563.0, 300 sec: 5594.7). Total num frames: 627696640. Throughput: 0: 5837.1. Samples: 627698770. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:47:42,088][25689] Avg episode reward: [(0, '-21.587')] [2022-07-10 06:47:42,868][26022] Updated weights on worker 0-0, policy_version 612990 (0.00091) [2022-07-10 06:47:44,394][26022] Updated weights on worker 0-0, policy_version 613000 (0.00087) [2022-07-10 06:47:46,303][26022] Updated weights on worker 0-0, policy_version 613010 (0.00090) [2022-07-10 06:47:47,111][25689] Fps is (10 sec: 5713.6, 60 sec: 5614.0, 300 sec: 5601.4). Total num frames: 627726336. Throughput: 0: 5853.8. Samples: 627732802. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:47:47,113][25689] Avg episode reward: [(0, '-22.212')] [2022-07-10 06:47:48,398][26022] Updated weights on worker 0-0, policy_version 613020 (0.00089) [2022-07-10 06:47:49,767][26022] Updated weights on worker 0-0, policy_version 613030 (0.00094) [2022-07-10 06:47:52,022][26022] Updated weights on worker 0-0, policy_version 613040 (0.00706) [2022-07-10 06:47:52,141][25689] Fps is (10 sec: 5601.8, 60 sec: 5578.8, 300 sec: 5584.9). Total num frames: 627752960. Throughput: 0: 5010.2. Samples: 627749474. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:47:52,141][25689] Avg episode reward: [(0, '-21.734')] [2022-07-10 06:47:53,625][26022] Updated weights on worker 0-0, policy_version 613050 (0.00098) [2022-07-10 06:47:55,642][26022] Updated weights on worker 0-0, policy_version 613060 (0.00093) [2022-07-10 06:47:57,101][26022] Updated weights on worker 0-0, policy_version 613070 (0.00109) [2022-07-10 06:47:57,224][25689] Fps is (10 sec: 5670.5, 60 sec: 5575.1, 300 sec: 5597.2). Total num frames: 627783680. Throughput: 0: 5849.0. Samples: 627783112. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:47:57,224][25689] Avg episode reward: [(0, '-22.010')] [2022-07-10 06:47:59,163][26022] Updated weights on worker 0-0, policy_version 613080 (0.00081) [2022-07-10 06:48:01,118][26022] Updated weights on worker 0-0, policy_version 613090 (0.00083) [2022-07-10 06:48:02,279][25689] Fps is (10 sec: 5454.3, 60 sec: 5543.7, 300 sec: 5594.4). Total num frames: 627808256. Throughput: 0: 5861.6. Samples: 627817128. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:48:02,279][25689] Avg episode reward: [(0, '-23.128')] [2022-07-10 06:48:03,109][26022] Updated weights on worker 0-0, policy_version 613100 (0.00085) [2022-07-10 06:48:05,093][26022] Updated weights on worker 0-0, policy_version 613110 (0.00354) [2022-07-10 06:48:06,796][26022] Updated weights on worker 0-0, policy_version 613120 (0.00093) [2022-07-10 06:48:07,301][25689] Fps is (10 sec: 5385.6, 60 sec: 5593.8, 300 sec: 5594.7). Total num frames: 627837952. Throughput: 0: 4903.7. Samples: 627831808. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:48:07,303][25689] Avg episode reward: [(0, '-22.362')] [2022-07-10 06:48:08,522][26022] Updated weights on worker 0-0, policy_version 613130 (0.00088) [2022-07-10 06:48:09,817][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:48:09,826][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000613136_627851264.pth [2022-07-10 06:48:09,826][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000611167_625835008.pth [2022-07-10 06:48:10,436][26022] Updated weights on worker 0-0, policy_version 613140 (0.00103) [2022-07-10 06:48:12,328][25689] Fps is (10 sec: 5706.1, 60 sec: 5575.8, 300 sec: 5603.1). Total num frames: 627865600. Throughput: 0: 5754.7. Samples: 627865648. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:48:12,329][25689] Avg episode reward: [(0, '-21.709')] [2022-07-10 06:48:12,330][26022] Updated weights on worker 0-0, policy_version 613150 (0.00098) [2022-07-10 06:48:14,073][26022] Updated weights on worker 0-0, policy_version 613160 (0.00092) [2022-07-10 06:48:15,938][26022] Updated weights on worker 0-0, policy_version 613170 (0.00096) [2022-07-10 06:48:17,399][25689] Fps is (10 sec: 5374.6, 60 sec: 5540.2, 300 sec: 5588.3). Total num frames: 627892224. Throughput: 0: 5744.7. Samples: 627899012. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:48:17,399][25689] Avg episode reward: [(0, '-22.630')] [2022-07-10 06:48:17,863][26022] Updated weights on worker 0-0, policy_version 613180 (0.00088) [2022-07-10 06:48:19,632][26022] Updated weights on worker 0-0, policy_version 613190 (0.00092) [2022-07-10 06:48:21,501][26022] Updated weights on worker 0-0, policy_version 613200 (0.00093) [2022-07-10 06:48:22,463][25689] Fps is (10 sec: 5557.1, 60 sec: 5587.4, 300 sec: 5591.6). Total num frames: 627921920. Throughput: 0: 4878.7. Samples: 627915600. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:48:22,463][25689] Avg episode reward: [(0, '-22.871')] [2022-07-10 06:48:23,495][26022] Updated weights on worker 0-0, policy_version 613210 (0.00089) [2022-07-10 06:48:25,175][26022] Updated weights on worker 0-0, policy_version 613220 (0.00093) [2022-07-10 06:48:27,111][26022] Updated weights on worker 0-0, policy_version 613230 (0.00085) [2022-07-10 06:48:27,466][25689] Fps is (10 sec: 5797.9, 60 sec: 5588.1, 300 sec: 5599.5). Total num frames: 627950592. Throughput: 0: 5830.5. Samples: 627949380. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:48:27,466][25689] Avg episode reward: [(0, '-21.966')] [2022-07-10 06:48:28,766][26022] Updated weights on worker 0-0, policy_version 613240 (0.00090) [2022-07-10 06:48:30,674][26022] Updated weights on worker 0-0, policy_version 613250 (0.00085) [2022-07-10 06:48:32,496][25689] Fps is (10 sec: 5511.5, 60 sec: 5552.7, 300 sec: 5589.9). Total num frames: 627977216. Throughput: 0: 5815.4. Samples: 627982930. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:48:32,497][25689] Avg episode reward: [(0, '-21.103')] [2022-07-10 06:48:32,647][26022] Updated weights on worker 0-0, policy_version 613260 (0.00084) [2022-07-10 06:48:34,271][26022] Updated weights on worker 0-0, policy_version 613270 (0.00085) [2022-07-10 06:48:36,224][26022] Updated weights on worker 0-0, policy_version 613280 (0.00094) [2022-07-10 06:48:37,557][25689] Fps is (10 sec: 5581.0, 60 sec: 5587.1, 300 sec: 5595.8). Total num frames: 628006912. Throughput: 0: 4993.9. Samples: 627999682. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:48:37,558][25689] Avg episode reward: [(0, '-21.652')] [2022-07-10 06:48:37,928][26022] Updated weights on worker 0-0, policy_version 613290 (0.00083) [2022-07-10 06:48:39,751][26022] Updated weights on worker 0-0, policy_version 613300 (0.00081) [2022-07-10 06:48:41,687][26022] Updated weights on worker 0-0, policy_version 613310 (0.00089) [2022-07-10 06:48:42,607][25689] Fps is (10 sec: 5671.5, 60 sec: 5583.6, 300 sec: 5598.7). Total num frames: 628034560. Throughput: 0: 5853.4. Samples: 628033510. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:48:42,607][25689] Avg episode reward: [(0, '-20.514')] [2022-07-10 06:48:43,562][26022] Updated weights on worker 0-0, policy_version 613320 (0.00091) [2022-07-10 06:48:45,176][26022] Updated weights on worker 0-0, policy_version 613330 (0.00090) [2022-07-10 06:48:47,341][26022] Updated weights on worker 0-0, policy_version 613340 (0.00091) [2022-07-10 06:48:47,615][25689] Fps is (10 sec: 5498.0, 60 sec: 5551.3, 300 sec: 5592.4). Total num frames: 628062208. Throughput: 0: 5857.9. Samples: 628067410. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:48:47,615][25689] Avg episode reward: [(0, '-20.961')] [2022-07-10 06:48:48,797][26022] Updated weights on worker 0-0, policy_version 613350 (0.00084) [2022-07-10 06:48:50,792][26022] Updated weights on worker 0-0, policy_version 613360 (0.00092) [2022-07-10 06:48:52,629][25689] Fps is (10 sec: 5517.2, 60 sec: 5569.6, 300 sec: 5589.7). Total num frames: 628089856. Throughput: 0: 5035.4. Samples: 628084310. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:48:52,630][25689] Avg episode reward: [(0, '-19.960')] [2022-07-10 06:48:52,680][26022] Updated weights on worker 0-0, policy_version 613370 (0.00091) [2022-07-10 06:48:54,504][26022] Updated weights on worker 0-0, policy_version 613380 (0.00087) [2022-07-10 06:48:56,578][26022] Updated weights on worker 0-0, policy_version 613390 (0.00088) [2022-07-10 06:48:57,705][25689] Fps is (10 sec: 5581.3, 60 sec: 5536.4, 300 sec: 5591.8). Total num frames: 628118528. Throughput: 0: 5823.4. Samples: 628117012. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:48:57,706][25689] Avg episode reward: [(0, '-20.869')] [2022-07-10 06:48:58,263][26022] Updated weights on worker 0-0, policy_version 613400 (0.00081) [2022-07-10 06:49:00,031][26022] Updated weights on worker 0-0, policy_version 613410 (0.00094) [2022-07-10 06:49:02,240][26022] Updated weights on worker 0-0, policy_version 613420 (0.00090) [2022-07-10 06:49:02,736][25689] Fps is (10 sec: 5268.7, 60 sec: 5538.6, 300 sec: 5584.6). Total num frames: 628143104. Throughput: 0: 5785.3. Samples: 628149962. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:49:02,736][25689] Avg episode reward: [(0, '-20.321')] [2022-07-10 06:49:04,026][26022] Updated weights on worker 0-0, policy_version 613430 (0.00092) [2022-07-10 06:49:05,957][26022] Updated weights on worker 0-0, policy_version 613440 (0.00095) [2022-07-10 06:49:07,537][26022] Updated weights on worker 0-0, policy_version 613450 (0.00091) [2022-07-10 06:49:07,755][25689] Fps is (10 sec: 5502.5, 60 sec: 5555.8, 300 sec: 5587.8). Total num frames: 628173824. Throughput: 0: 4889.4. Samples: 628165882. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:49:07,755][25689] Avg episode reward: [(0, '-19.992')] [2022-07-10 06:49:09,408][26022] Updated weights on worker 0-0, policy_version 613460 (0.00098) [2022-07-10 06:49:11,257][26022] Updated weights on worker 0-0, policy_version 613470 (0.00093) [2022-07-10 06:49:12,762][25689] Fps is (10 sec: 5719.5, 60 sec: 5540.8, 300 sec: 5588.8). Total num frames: 628200448. Throughput: 0: 5735.1. Samples: 628199772. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:49:12,762][25689] Avg episode reward: [(0, '-20.050')] [2022-07-10 06:49:13,508][26022] Updated weights on worker 0-0, policy_version 613480 (0.00093) [2022-07-10 06:49:14,811][26022] Updated weights on worker 0-0, policy_version 613490 (0.00080) [2022-07-10 06:49:17,027][26022] Updated weights on worker 0-0, policy_version 613500 (0.00090) [2022-07-10 06:49:17,865][25689] Fps is (10 sec: 5570.7, 60 sec: 5588.6, 300 sec: 5593.8). Total num frames: 628230144. Throughput: 0: 5767.2. Samples: 628233274. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:49:17,865][25689] Avg episode reward: [(0, '-21.639')] [2022-07-10 06:49:18,304][26022] Updated weights on worker 0-0, policy_version 613510 (0.00093) [2022-07-10 06:49:20,690][26022] Updated weights on worker 0-0, policy_version 613520 (0.00095) [2022-07-10 06:49:22,323][26022] Updated weights on worker 0-0, policy_version 613530 (0.00084) [2022-07-10 06:49:22,918][25689] Fps is (10 sec: 5545.1, 60 sec: 5538.8, 300 sec: 5583.0). Total num frames: 628256768. Throughput: 0: 5793.2. Samples: 628266884. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:49:22,919][25689] Avg episode reward: [(0, '-21.890')] [2022-07-10 06:49:24,048][26022] Updated weights on worker 0-0, policy_version 613540 (0.00090) [2022-07-10 06:49:26,193][26022] Updated weights on worker 0-0, policy_version 613550 (0.00086) [2022-07-10 06:49:27,713][26022] Updated weights on worker 0-0, policy_version 613560 (0.00086) [2022-07-10 06:49:27,949][25689] Fps is (10 sec: 5483.3, 60 sec: 5536.2, 300 sec: 5586.4). Total num frames: 628285440. Throughput: 0: 5818.2. Samples: 628283378. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:49:27,950][25689] Avg episode reward: [(0, '-21.769')] [2022-07-10 06:49:29,696][26022] Updated weights on worker 0-0, policy_version 613570 (0.00090) [2022-07-10 06:49:31,568][26022] Updated weights on worker 0-0, policy_version 613580 (0.00105) [2022-07-10 06:49:32,961][25689] Fps is (10 sec: 5710.0, 60 sec: 5571.7, 300 sec: 5587.6). Total num frames: 628314112. Throughput: 0: 5808.3. Samples: 628317096. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:49:32,961][25689] Avg episode reward: [(0, '-21.814')] [2022-07-10 06:49:33,295][26022] Updated weights on worker 0-0, policy_version 613590 (0.00086) [2022-07-10 06:49:35,229][26022] Updated weights on worker 0-0, policy_version 613600 (0.00088) [2022-07-10 06:49:37,167][26022] Updated weights on worker 0-0, policy_version 613610 (0.00097) [2022-07-10 06:49:38,047][25689] Fps is (10 sec: 5577.7, 60 sec: 5535.7, 300 sec: 5584.1). Total num frames: 628341760. Throughput: 0: 5806.1. Samples: 628350452. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:49:38,047][25689] Avg episode reward: [(0, '-21.298')] [2022-07-10 06:49:38,908][26022] Updated weights on worker 0-0, policy_version 613620 (0.00084) [2022-07-10 06:49:40,815][26022] Updated weights on worker 0-0, policy_version 613630 (0.00080) [2022-07-10 06:49:42,503][26022] Updated weights on worker 0-0, policy_version 613640 (0.00094) [2022-07-10 06:49:43,068][25689] Fps is (10 sec: 5572.4, 60 sec: 5555.2, 300 sec: 5584.1). Total num frames: 628370432. Throughput: 0: 4975.1. Samples: 628367130. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:49:43,068][25689] Avg episode reward: [(0, '-20.704')] [2022-07-10 06:49:44,417][26022] Updated weights on worker 0-0, policy_version 613650 (0.00086) [2022-07-10 06:49:46,107][26022] Updated weights on worker 0-0, policy_version 613660 (0.00084) [2022-07-10 06:49:48,016][26022] Updated weights on worker 0-0, policy_version 613670 (0.00081) [2022-07-10 06:49:48,100][25689] Fps is (10 sec: 5602.0, 60 sec: 5553.0, 300 sec: 5578.4). Total num frames: 628398080. Throughput: 0: 5834.7. Samples: 628400952. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:49:48,101][25689] Avg episode reward: [(0, '-19.571')] [2022-07-10 06:49:49,852][26022] Updated weights on worker 0-0, policy_version 613680 (0.00118) [2022-07-10 06:49:51,636][26022] Updated weights on worker 0-0, policy_version 613690 (0.00093) [2022-07-10 06:49:53,106][25689] Fps is (10 sec: 5508.9, 60 sec: 5553.8, 300 sec: 5579.5). Total num frames: 628425728. Throughput: 0: 5851.9. Samples: 628434980. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:49:53,106][25689] Avg episode reward: [(0, '-19.428')] [2022-07-10 06:49:53,575][26022] Updated weights on worker 0-0, policy_version 613700 (0.00103) [2022-07-10 06:49:55,166][26022] Updated weights on worker 0-0, policy_version 613710 (0.00090) [2022-07-10 06:49:57,205][26022] Updated weights on worker 0-0, policy_version 613720 (0.00093) [2022-07-10 06:49:58,182][25689] Fps is (10 sec: 5586.1, 60 sec: 5553.7, 300 sec: 5581.6). Total num frames: 628454400. Throughput: 0: 5027.4. Samples: 628451682. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:49:58,183][25689] Avg episode reward: [(0, '-19.157')] [2022-07-10 06:49:58,971][26022] Updated weights on worker 0-0, policy_version 613730 (0.00093) [2022-07-10 06:50:00,924][26022] Updated weights on worker 0-0, policy_version 613740 (0.00096) [2022-07-10 06:50:02,902][26022] Updated weights on worker 0-0, policy_version 613750 (0.00090) [2022-07-10 06:50:03,195][25689] Fps is (10 sec: 5480.8, 60 sec: 5589.2, 300 sec: 5581.5). Total num frames: 628481024. Throughput: 0: 5761.5. Samples: 628483090. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:50:03,195][25689] Avg episode reward: [(0, '-19.291')] [2022-07-10 06:50:04,895][26022] Updated weights on worker 0-0, policy_version 613760 (0.00091) [2022-07-10 06:50:06,691][26022] Updated weights on worker 0-0, policy_version 613770 (0.00089) [2022-07-10 06:50:08,197][25689] Fps is (10 sec: 5317.0, 60 sec: 5523.0, 300 sec: 5574.9). Total num frames: 628507648. Throughput: 0: 5743.1. Samples: 628516370. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:50:08,197][25689] Avg episode reward: [(0, '-19.925')] [2022-07-10 06:50:08,734][26022] Updated weights on worker 0-0, policy_version 613780 (0.00093) [2022-07-10 06:50:09,957][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:50:09,966][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000613787_628517888.pth [2022-07-10 06:50:09,967][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000611823_626506752.pth [2022-07-10 06:50:10,491][26022] Updated weights on worker 0-0, policy_version 613790 (0.00102) [2022-07-10 06:50:12,335][26022] Updated weights on worker 0-0, policy_version 613800 (0.00084) [2022-07-10 06:50:13,206][25689] Fps is (10 sec: 5421.3, 60 sec: 5539.8, 300 sec: 5572.7). Total num frames: 628535296. Throughput: 0: 4858.6. Samples: 628532638. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:50:13,206][25689] Avg episode reward: [(0, '-20.254')] [2022-07-10 06:50:14,264][26022] Updated weights on worker 0-0, policy_version 613810 (0.00090) [2022-07-10 06:50:15,954][26022] Updated weights on worker 0-0, policy_version 613820 (0.00091) [2022-07-10 06:50:17,924][26022] Updated weights on worker 0-0, policy_version 613830 (0.00115) [2022-07-10 06:50:18,283][25689] Fps is (10 sec: 5584.0, 60 sec: 5525.2, 300 sec: 5571.9). Total num frames: 628563968. Throughput: 0: 5689.6. Samples: 628566046. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:50:18,283][25689] Avg episode reward: [(0, '-21.237')] [2022-07-10 06:50:19,656][26022] Updated weights on worker 0-0, policy_version 613840 (0.00086) [2022-07-10 06:50:21,427][26022] Updated weights on worker 0-0, policy_version 613850 (0.00080) [2022-07-10 06:50:23,301][25689] Fps is (10 sec: 5578.9, 60 sec: 5545.5, 300 sec: 5564.7). Total num frames: 628591616. Throughput: 0: 5790.9. Samples: 628599522. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:50:23,301][25689] Avg episode reward: [(0, '-22.154')] [2022-07-10 06:50:23,661][26022] Updated weights on worker 0-0, policy_version 613860 (0.00091) [2022-07-10 06:50:25,167][26022] Updated weights on worker 0-0, policy_version 613870 (0.00091) [2022-07-10 06:50:27,205][26022] Updated weights on worker 0-0, policy_version 613880 (0.00094) [2022-07-10 06:50:28,332][25689] Fps is (10 sec: 5604.4, 60 sec: 5545.4, 300 sec: 5568.1). Total num frames: 628620288. Throughput: 0: 4952.1. Samples: 628616082. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:50:28,334][25689] Avg episode reward: [(0, '-21.086')] [2022-07-10 06:50:28,987][26022] Updated weights on worker 0-0, policy_version 613890 (0.00090) [2022-07-10 06:50:30,746][26022] Updated weights on worker 0-0, policy_version 613900 (0.00088) [2022-07-10 06:50:32,621][26022] Updated weights on worker 0-0, policy_version 613910 (0.00086) [2022-07-10 06:50:33,343][25689] Fps is (10 sec: 5608.0, 60 sec: 5528.5, 300 sec: 5565.3). Total num frames: 628647936. Throughput: 0: 5797.3. Samples: 628649384. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:50:33,344][25689] Avg episode reward: [(0, '-20.247')] [2022-07-10 06:50:34,384][26022] Updated weights on worker 0-0, policy_version 613920 (0.00084) [2022-07-10 06:50:36,222][26022] Updated weights on worker 0-0, policy_version 613930 (0.00089) [2022-07-10 06:50:38,020][26022] Updated weights on worker 0-0, policy_version 613940 (0.00086) [2022-07-10 06:50:38,387][25689] Fps is (10 sec: 5601.4, 60 sec: 5549.3, 300 sec: 5561.1). Total num frames: 628676608. Throughput: 0: 5828.7. Samples: 628683226. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:50:38,387][25689] Avg episode reward: [(0, '-20.296')] [2022-07-10 06:50:39,882][26022] Updated weights on worker 0-0, policy_version 613950 (0.00084) [2022-07-10 06:50:41,786][26022] Updated weights on worker 0-0, policy_version 613960 (0.00086) [2022-07-10 06:50:43,403][25689] Fps is (10 sec: 5598.6, 60 sec: 5532.8, 300 sec: 5564.5). Total num frames: 628704256. Throughput: 0: 5005.8. Samples: 628700150. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:50:43,404][25689] Avg episode reward: [(0, '-19.927')] [2022-07-10 06:50:43,500][26022] Updated weights on worker 0-0, policy_version 613970 (0.00083) [2022-07-10 06:50:45,471][26022] Updated weights on worker 0-0, policy_version 613980 (0.00081) [2022-07-10 06:50:47,143][26022] Updated weights on worker 0-0, policy_version 613990 (0.00095) [2022-07-10 06:50:48,409][25689] Fps is (10 sec: 5619.7, 60 sec: 5552.2, 300 sec: 5565.0). Total num frames: 628732928. Throughput: 0: 5868.3. Samples: 628733896. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:50:48,409][25689] Avg episode reward: [(0, '-19.093')] [2022-07-10 06:50:49,113][26022] Updated weights on worker 0-0, policy_version 614000 (0.00085) [2022-07-10 06:50:50,861][26022] Updated weights on worker 0-0, policy_version 614010 (0.00098) [2022-07-10 06:50:52,811][26022] Updated weights on worker 0-0, policy_version 614020 (0.00086) [2022-07-10 06:50:53,428][25689] Fps is (10 sec: 5617.9, 60 sec: 5551.0, 300 sec: 5563.5). Total num frames: 628760576. Throughput: 0: 5894.0. Samples: 628767764. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:50:53,429][25689] Avg episode reward: [(0, '-17.450')] [2022-07-10 06:50:54,371][26022] Updated weights on worker 0-0, policy_version 614030 (0.00083) [2022-07-10 06:50:56,279][26022] Updated weights on worker 0-0, policy_version 614040 (0.00084) [2022-07-10 06:50:58,322][26022] Updated weights on worker 0-0, policy_version 614050 (0.00091) [2022-07-10 06:50:58,516][25689] Fps is (10 sec: 5470.9, 60 sec: 5533.0, 300 sec: 5562.0). Total num frames: 628788224. Throughput: 0: 5037.9. Samples: 628784634. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:50:58,516][25689] Avg episode reward: [(0, '-16.816')] [2022-07-10 06:50:59,774][26022] Updated weights on worker 0-0, policy_version 614060 (0.00086) [2022-07-10 06:51:02,260][26022] Updated weights on worker 0-0, policy_version 614070 (0.00090) [2022-07-10 06:51:03,587][25689] Fps is (10 sec: 5443.3, 60 sec: 5544.6, 300 sec: 5560.9). Total num frames: 628815872. Throughput: 0: 5741.2. Samples: 628816028. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:51:03,587][25689] Avg episode reward: [(0, '-17.219')] [2022-07-10 06:51:04,074][26022] Updated weights on worker 0-0, policy_version 614080 (0.00090) [2022-07-10 06:51:05,869][26022] Updated weights on worker 0-0, policy_version 614090 (0.00090) [2022-07-10 06:51:07,647][26022] Updated weights on worker 0-0, policy_version 614100 (0.00090) [2022-07-10 06:51:08,624][25689] Fps is (10 sec: 5470.3, 60 sec: 5558.3, 300 sec: 5563.9). Total num frames: 628843520. Throughput: 0: 5737.4. Samples: 628849882. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:51:08,625][25689] Avg episode reward: [(0, '-17.141')] [2022-07-10 06:51:09,447][26022] Updated weights on worker 0-0, policy_version 614110 (0.00082) [2022-07-10 06:51:11,359][26022] Updated weights on worker 0-0, policy_version 614120 (0.00093) [2022-07-10 06:51:13,184][26022] Updated weights on worker 0-0, policy_version 614130 (0.00090) [2022-07-10 06:51:13,633][25689] Fps is (10 sec: 5605.8, 60 sec: 5575.2, 300 sec: 5561.5). Total num frames: 628872192. Throughput: 0: 4895.1. Samples: 628866672. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:51:13,634][25689] Avg episode reward: [(0, '-17.644')] [2022-07-10 06:51:14,941][26022] Updated weights on worker 0-0, policy_version 614140 (0.00090) [2022-07-10 06:51:16,779][26022] Updated weights on worker 0-0, policy_version 614150 (0.00097) [2022-07-10 06:51:18,676][26022] Updated weights on worker 0-0, policy_version 614160 (0.00085) [2022-07-10 06:51:18,727][25689] Fps is (10 sec: 5574.9, 60 sec: 5556.8, 300 sec: 5564.8). Total num frames: 628899840. Throughput: 0: 5733.0. Samples: 628900502. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:51:18,727][25689] Avg episode reward: [(0, '-16.613')] [2022-07-10 06:51:20,502][26022] Updated weights on worker 0-0, policy_version 614170 (0.00085) [2022-07-10 06:51:22,167][26022] Updated weights on worker 0-0, policy_version 614180 (0.00082) [2022-07-10 06:51:23,735][25689] Fps is (10 sec: 5575.2, 60 sec: 5574.6, 300 sec: 5565.1). Total num frames: 628928512. Throughput: 0: 5865.6. Samples: 628934212. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:51:23,736][25689] Avg episode reward: [(0, '-17.257')] [2022-07-10 06:51:24,187][26022] Updated weights on worker 0-0, policy_version 614190 (0.00089) [2022-07-10 06:51:25,895][26022] Updated weights on worker 0-0, policy_version 614200 (0.00091) [2022-07-10 06:51:27,899][26022] Updated weights on worker 0-0, policy_version 614210 (0.00092) [2022-07-10 06:51:28,775][25689] Fps is (10 sec: 5706.6, 60 sec: 5573.8, 300 sec: 5565.2). Total num frames: 628957184. Throughput: 0: 5011.2. Samples: 628950864. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:51:28,776][25689] Avg episode reward: [(0, '-17.892')] [2022-07-10 06:51:29,783][26022] Updated weights on worker 0-0, policy_version 614220 (0.00085) [2022-07-10 06:51:31,454][26022] Updated weights on worker 0-0, policy_version 614230 (0.00093) [2022-07-10 06:51:33,358][26022] Updated weights on worker 0-0, policy_version 614240 (0.00107) [2022-07-10 06:51:33,815][25689] Fps is (10 sec: 5384.4, 60 sec: 5537.3, 300 sec: 5562.2). Total num frames: 628982784. Throughput: 0: 5822.7. Samples: 628984182. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:51:33,815][25689] Avg episode reward: [(0, '-18.155')] [2022-07-10 06:51:35,160][26022] Updated weights on worker 0-0, policy_version 614250 (0.00090) [2022-07-10 06:51:37,047][26022] Updated weights on worker 0-0, policy_version 614260 (0.00083) [2022-07-10 06:51:38,856][25689] Fps is (10 sec: 5485.6, 60 sec: 5554.5, 300 sec: 5558.6). Total num frames: 629012480. Throughput: 0: 5817.2. Samples: 629017596. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:51:38,859][25689] Avg episode reward: [(0, '-19.116')] [2022-07-10 06:51:38,863][26022] Updated weights on worker 0-0, policy_version 614270 (0.00657) [2022-07-10 06:51:40,599][26022] Updated weights on worker 0-0, policy_version 614280 (0.00084) [2022-07-10 06:51:42,472][26022] Updated weights on worker 0-0, policy_version 614290 (0.00089) [2022-07-10 06:51:43,883][25689] Fps is (10 sec: 5695.8, 60 sec: 5553.5, 300 sec: 5562.0). Total num frames: 629040128. Throughput: 0: 4977.8. Samples: 629034502. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:51:43,884][25689] Avg episode reward: [(0, '-19.900')] [2022-07-10 06:51:44,191][26022] Updated weights on worker 0-0, policy_version 614300 (0.00085) [2022-07-10 06:51:46,012][26022] Updated weights on worker 0-0, policy_version 614310 (0.00084) [2022-07-10 06:51:47,714][26022] Updated weights on worker 0-0, policy_version 614320 (0.00093) [2022-07-10 06:51:48,907][25689] Fps is (10 sec: 5603.4, 60 sec: 5551.8, 300 sec: 5561.9). Total num frames: 629068800. Throughput: 0: 5842.0. Samples: 629068470. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:51:48,908][25689] Avg episode reward: [(0, '-20.518')] [2022-07-10 06:51:49,736][26022] Updated weights on worker 0-0, policy_version 614330 (0.00085) [2022-07-10 06:51:51,502][26022] Updated weights on worker 0-0, policy_version 614340 (0.00049) [2022-07-10 06:51:53,202][26022] Updated weights on worker 0-0, policy_version 614350 (0.00088) [2022-07-10 06:51:53,926][25689] Fps is (10 sec: 5607.8, 60 sec: 5551.8, 300 sec: 5552.0). Total num frames: 629096448. Throughput: 0: 5878.7. Samples: 629102408. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:51:53,927][25689] Avg episode reward: [(0, '-20.275')] [2022-07-10 06:51:55,080][26022] Updated weights on worker 0-0, policy_version 614360 (0.00085) [2022-07-10 06:51:56,888][26022] Updated weights on worker 0-0, policy_version 614370 (0.00081) [2022-07-10 06:51:58,763][26022] Updated weights on worker 0-0, policy_version 614380 (0.00083) [2022-07-10 06:51:59,045][25689] Fps is (10 sec: 5757.0, 60 sec: 5599.7, 300 sec: 5565.1). Total num frames: 629127168. Throughput: 0: 5047.1. Samples: 629119496. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:51:59,046][25689] Avg episode reward: [(0, '-19.717')] [2022-07-10 06:52:00,543][26022] Updated weights on worker 0-0, policy_version 614390 (0.00091) [2022-07-10 06:52:02,476][26022] Updated weights on worker 0-0, policy_version 614400 (0.00108) [2022-07-10 06:52:04,059][25689] Fps is (10 sec: 5457.0, 60 sec: 5554.2, 300 sec: 5558.2). Total num frames: 629151744. Throughput: 0: 5854.8. Samples: 629152630. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:52:04,060][25689] Avg episode reward: [(0, '-19.908')] [2022-07-10 06:52:04,695][26022] Updated weights on worker 0-0, policy_version 614410 (0.00090) [2022-07-10 06:52:06,442][26022] Updated weights on worker 0-0, policy_version 614420 (0.00086) [2022-07-10 06:52:08,318][26022] Updated weights on worker 0-0, policy_version 614430 (0.00086) [2022-07-10 06:52:09,074][25689] Fps is (10 sec: 5309.3, 60 sec: 5573.1, 300 sec: 5558.2). Total num frames: 629180416. Throughput: 0: 5791.1. Samples: 629185264. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:52:09,075][25689] Avg episode reward: [(0, '-19.879')] [2022-07-10 06:52:09,924][26022] Updated weights on worker 0-0, policy_version 614440 (0.00089) [2022-07-10 06:52:10,119][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:52:10,139][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000614441_629187584.pth [2022-07-10 06:52:10,140][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000612482_627181568.pth [2022-07-10 06:52:10,141][25974] Saving a milestone ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/milestones/checkpoint_000614441_629187584.pth.milestone [2022-07-10 06:52:11,989][26022] Updated weights on worker 0-0, policy_version 614450 (0.00085) [2022-07-10 06:52:13,554][26022] Updated weights on worker 0-0, policy_version 614460 (0.00087) [2022-07-10 06:52:14,122][25689] Fps is (10 sec: 5596.6, 60 sec: 5552.6, 300 sec: 5554.8). Total num frames: 629208064. Throughput: 0: 4932.7. Samples: 629202030. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 06:52:14,123][25689] Avg episode reward: [(0, '-20.364')] [2022-07-10 06:52:15,512][26022] Updated weights on worker 0-0, policy_version 614470 (0.00093) [2022-07-10 06:52:17,399][26022] Updated weights on worker 0-0, policy_version 614480 (0.00086) [2022-07-10 06:52:19,215][25689] Fps is (10 sec: 5654.8, 60 sec: 5586.5, 300 sec: 5563.9). Total num frames: 629237760. Throughput: 0: 5755.0. Samples: 629235574. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:52:19,216][25689] Avg episode reward: [(0, '-20.170')] [2022-07-10 06:52:19,219][26022] Updated weights on worker 0-0, policy_version 614490 (0.00093) [2022-07-10 06:52:21,105][26022] Updated weights on worker 0-0, policy_version 614500 (0.00089) [2022-07-10 06:52:22,939][26022] Updated weights on worker 0-0, policy_version 614510 (0.00085) [2022-07-10 06:52:24,233][25689] Fps is (10 sec: 5570.3, 60 sec: 5551.8, 300 sec: 5556.9). Total num frames: 629264384. Throughput: 0: 5782.4. Samples: 629269284. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:52:24,233][25689] Avg episode reward: [(0, '-22.181')] [2022-07-10 06:52:24,602][26022] Updated weights on worker 0-0, policy_version 614520 (0.00091) [2022-07-10 06:52:26,520][26022] Updated weights on worker 0-0, policy_version 614530 (0.00084) [2022-07-10 06:52:28,399][26022] Updated weights on worker 0-0, policy_version 614540 (0.00092) [2022-07-10 06:52:29,253][25689] Fps is (10 sec: 5508.6, 60 sec: 5553.6, 300 sec: 5556.7). Total num frames: 629293056. Throughput: 0: 5821.9. Samples: 629302744. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:52:29,254][25689] Avg episode reward: [(0, '-21.450')] [2022-07-10 06:52:30,148][26022] Updated weights on worker 0-0, policy_version 614550 (0.00086) [2022-07-10 06:52:32,162][26022] Updated weights on worker 0-0, policy_version 614560 (0.00086) [2022-07-10 06:52:33,944][26022] Updated weights on worker 0-0, policy_version 614570 (0.00089) [2022-07-10 06:52:34,330][25689] Fps is (10 sec: 5679.5, 60 sec: 5601.0, 300 sec: 5560.0). Total num frames: 629321728. Throughput: 0: 5822.9. Samples: 629319696. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:52:34,330][25689] Avg episode reward: [(0, '-21.111')] [2022-07-10 06:52:35,734][26022] Updated weights on worker 0-0, policy_version 614580 (0.00084) [2022-07-10 06:52:37,551][26022] Updated weights on worker 0-0, policy_version 614590 (0.00088) [2022-07-10 06:52:39,454][25689] Fps is (10 sec: 5621.7, 60 sec: 5576.4, 300 sec: 5561.3). Total num frames: 629350400. Throughput: 0: 5806.6. Samples: 629353092. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:52:39,455][25689] Avg episode reward: [(0, '-19.805')] [2022-07-10 06:52:39,455][26022] Updated weights on worker 0-0, policy_version 614600 (0.00082) [2022-07-10 06:52:41,267][26022] Updated weights on worker 0-0, policy_version 614610 (0.00092) [2022-07-10 06:52:43,288][26022] Updated weights on worker 0-0, policy_version 614620 (0.00618) [2022-07-10 06:52:44,519][25689] Fps is (10 sec: 5527.1, 60 sec: 5572.9, 300 sec: 5553.7). Total num frames: 629378048. Throughput: 0: 5764.5. Samples: 629386224. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:52:44,520][25689] Avg episode reward: [(0, '-19.419')] [2022-07-10 06:52:45,076][26022] Updated weights on worker 0-0, policy_version 614630 (0.00086) [2022-07-10 06:52:46,861][26022] Updated weights on worker 0-0, policy_version 614640 (0.00092) [2022-07-10 06:52:48,817][26022] Updated weights on worker 0-0, policy_version 614650 (0.00086) [2022-07-10 06:52:49,525][25689] Fps is (10 sec: 5490.3, 60 sec: 5557.6, 300 sec: 5557.6). Total num frames: 629405696. Throughput: 0: 4941.4. Samples: 629402916. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:52:49,527][25689] Avg episode reward: [(0, '-17.834')] [2022-07-10 06:52:50,444][26022] Updated weights on worker 0-0, policy_version 614660 (0.00090) [2022-07-10 06:52:52,482][26022] Updated weights on worker 0-0, policy_version 614670 (0.00090) [2022-07-10 06:52:54,104][26022] Updated weights on worker 0-0, policy_version 614680 (0.00797) [2022-07-10 06:52:54,590][25689] Fps is (10 sec: 5490.7, 60 sec: 5553.4, 300 sec: 5547.6). Total num frames: 629433344. Throughput: 0: 5751.9. Samples: 629436232. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:52:54,592][25689] Avg episode reward: [(0, '-18.297')] [2022-07-10 06:52:56,157][26022] Updated weights on worker 0-0, policy_version 614690 (0.00091) [2022-07-10 06:52:57,846][26022] Updated weights on worker 0-0, policy_version 614700 (0.00098) [2022-07-10 06:52:59,608][26022] Updated weights on worker 0-0, policy_version 614710 (0.00098) [2022-07-10 06:52:59,739][25689] Fps is (10 sec: 5614.7, 60 sec: 5533.9, 300 sec: 5563.0). Total num frames: 629463040. Throughput: 0: 5764.6. Samples: 629470026. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:52:59,739][25689] Avg episode reward: [(0, '-18.933')] [2022-07-10 06:53:01,768][26022] Updated weights on worker 0-0, policy_version 614720 (0.00092) [2022-07-10 06:53:03,736][26022] Updated weights on worker 0-0, policy_version 614730 (0.00095) [2022-07-10 06:53:04,815][25689] Fps is (10 sec: 5408.3, 60 sec: 5545.1, 300 sec: 5548.3). Total num frames: 629488640. Throughput: 0: 4855.0. Samples: 629484758. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:53:04,817][25689] Avg episode reward: [(0, '-19.822')] [2022-07-10 06:53:05,684][26022] Updated weights on worker 0-0, policy_version 614740 (0.00088) [2022-07-10 06:53:07,406][26022] Updated weights on worker 0-0, policy_version 614750 (0.00092) [2022-07-10 06:53:09,079][26022] Updated weights on worker 0-0, policy_version 614760 (0.00083) [2022-07-10 06:53:09,899][25689] Fps is (10 sec: 5341.9, 60 sec: 5538.9, 300 sec: 5550.7). Total num frames: 629517312. Throughput: 0: 5667.5. Samples: 629518380. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:53:09,899][25689] Avg episode reward: [(0, '-20.799')] [2022-07-10 06:53:11,063][26022] Updated weights on worker 0-0, policy_version 614770 (0.00082) [2022-07-10 06:53:12,788][26022] Updated weights on worker 0-0, policy_version 614780 (0.00090) [2022-07-10 06:53:14,721][26022] Updated weights on worker 0-0, policy_version 614790 (0.00087) [2022-07-10 06:53:14,910][25689] Fps is (10 sec: 5781.7, 60 sec: 5575.9, 300 sec: 5562.1). Total num frames: 629547008. Throughput: 0: 5709.5. Samples: 629552248. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:53:14,910][25689] Avg episode reward: [(0, '-19.453')] [2022-07-10 06:53:16,567][26022] Updated weights on worker 0-0, policy_version 614800 (0.00090) [2022-07-10 06:53:18,215][26022] Updated weights on worker 0-0, policy_version 614810 (0.00090) [2022-07-10 06:53:19,978][25689] Fps is (10 sec: 5587.5, 60 sec: 5527.6, 300 sec: 5551.7). Total num frames: 629573632. Throughput: 0: 4897.0. Samples: 629569138. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:53:19,979][25689] Avg episode reward: [(0, '-18.121')] [2022-07-10 06:53:20,400][26022] Updated weights on worker 0-0, policy_version 614820 (0.00091) [2022-07-10 06:53:21,818][26022] Updated weights on worker 0-0, policy_version 614830 (0.00079) [2022-07-10 06:53:23,840][26022] Updated weights on worker 0-0, policy_version 614840 (0.00090) [2022-07-10 06:53:25,020][25689] Fps is (10 sec: 5469.3, 60 sec: 5559.1, 300 sec: 5551.0). Total num frames: 629602304. Throughput: 0: 5844.4. Samples: 629602846. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:53:25,022][25689] Avg episode reward: [(0, '-16.770')] [2022-07-10 06:53:25,559][26022] Updated weights on worker 0-0, policy_version 614850 (0.00091) [2022-07-10 06:53:27,628][26022] Updated weights on worker 0-0, policy_version 614860 (0.00090) [2022-07-10 06:53:29,298][26022] Updated weights on worker 0-0, policy_version 614870 (0.00089) [2022-07-10 06:53:30,031][25689] Fps is (10 sec: 5602.4, 60 sec: 5543.1, 300 sec: 5554.8). Total num frames: 629629952. Throughput: 0: 5855.5. Samples: 629636266. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:53:30,031][25689] Avg episode reward: [(0, '-17.711')] [2022-07-10 06:53:31,280][26022] Updated weights on worker 0-0, policy_version 614880 (0.00084) [2022-07-10 06:53:33,015][26022] Updated weights on worker 0-0, policy_version 614890 (0.00079) [2022-07-10 06:53:34,816][26022] Updated weights on worker 0-0, policy_version 614900 (0.00083) [2022-07-10 06:53:35,069][25689] Fps is (10 sec: 5706.5, 60 sec: 5563.4, 300 sec: 5555.2). Total num frames: 629659648. Throughput: 0: 5002.7. Samples: 629653098. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:53:35,069][25689] Avg episode reward: [(0, '-17.907')] [2022-07-10 06:53:36,703][26022] Updated weights on worker 0-0, policy_version 614910 (0.00087) [2022-07-10 06:53:38,268][26022] Updated weights on worker 0-0, policy_version 614920 (0.00092) [2022-07-10 06:53:40,183][25689] Fps is (10 sec: 5547.5, 60 sec: 5530.6, 300 sec: 5550.5). Total num frames: 629686272. Throughput: 0: 5829.7. Samples: 629686928. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:53:40,184][25689] Avg episode reward: [(0, '-20.174')] [2022-07-10 06:53:40,277][26022] Updated weights on worker 0-0, policy_version 614930 (0.00090) [2022-07-10 06:53:41,850][26022] Updated weights on worker 0-0, policy_version 614940 (0.00098) [2022-07-10 06:53:44,014][26022] Updated weights on worker 0-0, policy_version 614950 (0.00088) [2022-07-10 06:53:45,195][25689] Fps is (10 sec: 5663.1, 60 sec: 5586.2, 300 sec: 5560.8). Total num frames: 629716992. Throughput: 0: 5855.3. Samples: 629720976. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:53:45,195][25689] Avg episode reward: [(0, '-20.171')] [2022-07-10 06:53:45,781][26022] Updated weights on worker 0-0, policy_version 614960 (0.00092) [2022-07-10 06:53:47,483][26022] Updated weights on worker 0-0, policy_version 614970 (0.00084) [2022-07-10 06:53:49,307][26022] Updated weights on worker 0-0, policy_version 614980 (0.00085) [2022-07-10 06:53:50,257][25689] Fps is (10 sec: 5794.2, 60 sec: 5581.0, 300 sec: 5559.9). Total num frames: 629744640. Throughput: 0: 5025.6. Samples: 629737914. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:53:50,258][25689] Avg episode reward: [(0, '-20.591')] [2022-07-10 06:53:51,211][26022] Updated weights on worker 0-0, policy_version 614990 (0.00085) [2022-07-10 06:53:52,854][26022] Updated weights on worker 0-0, policy_version 615000 (0.00086) [2022-07-10 06:53:54,769][26022] Updated weights on worker 0-0, policy_version 615010 (0.00090) [2022-07-10 06:53:55,289][25689] Fps is (10 sec: 5376.4, 60 sec: 5567.1, 300 sec: 5553.8). Total num frames: 629771264. Throughput: 0: 5869.5. Samples: 629771782. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:53:55,291][25689] Avg episode reward: [(0, '-19.856')] [2022-07-10 06:53:56,513][26022] Updated weights on worker 0-0, policy_version 615020 (0.00068) [2022-07-10 06:53:58,434][26022] Updated weights on worker 0-0, policy_version 615030 (0.00097) [2022-07-10 06:54:00,315][26022] Updated weights on worker 0-0, policy_version 615040 (0.00084) [2022-07-10 06:54:00,384][25689] Fps is (10 sec: 5561.5, 60 sec: 5572.1, 300 sec: 5569.9). Total num frames: 629800960. Throughput: 0: 5867.6. Samples: 629805456. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:54:00,384][25689] Avg episode reward: [(0, '-18.851')] [2022-07-10 06:54:02,424][26022] Updated weights on worker 0-0, policy_version 615050 (0.00085) [2022-07-10 06:54:04,345][26022] Updated weights on worker 0-0, policy_version 615060 (0.00081) [2022-07-10 06:54:05,419][25689] Fps is (10 sec: 5459.1, 60 sec: 5575.9, 300 sec: 5552.4). Total num frames: 629826560. Throughput: 0: 4895.2. Samples: 629819976. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:54:05,420][25689] Avg episode reward: [(0, '-18.996')] [2022-07-10 06:54:06,174][26022] Updated weights on worker 0-0, policy_version 615070 (0.00080) [2022-07-10 06:54:08,025][26022] Updated weights on worker 0-0, policy_version 615080 (0.00088) [2022-07-10 06:54:09,693][26022] Updated weights on worker 0-0, policy_version 615090 (0.00092) [2022-07-10 06:54:10,150][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:54:10,162][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000615092_629854208.pth [2022-07-10 06:54:10,162][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000613136_627851264.pth [2022-07-10 06:54:10,441][25689] Fps is (10 sec: 5396.2, 60 sec: 5581.5, 300 sec: 5558.9). Total num frames: 629855232. Throughput: 0: 5751.5. Samples: 629854004. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:54:10,443][25689] Avg episode reward: [(0, '-17.850')] [2022-07-10 06:54:11,498][26022] Updated weights on worker 0-0, policy_version 615100 (0.00094) [2022-07-10 06:54:13,480][26022] Updated weights on worker 0-0, policy_version 615110 (0.00083) [2022-07-10 06:54:15,100][26022] Updated weights on worker 0-0, policy_version 615120 (0.00091) [2022-07-10 06:54:15,443][25689] Fps is (10 sec: 5720.3, 60 sec: 5565.5, 300 sec: 5557.4). Total num frames: 629883904. Throughput: 0: 5760.4. Samples: 629887876. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:54:15,444][25689] Avg episode reward: [(0, '-19.056')] [2022-07-10 06:54:17,121][26022] Updated weights on worker 0-0, policy_version 615130 (0.00086) [2022-07-10 06:54:18,825][26022] Updated weights on worker 0-0, policy_version 615140 (0.00085) [2022-07-10 06:54:20,498][25689] Fps is (10 sec: 5599.8, 60 sec: 5583.6, 300 sec: 5560.8). Total num frames: 629911552. Throughput: 0: 4941.1. Samples: 629904846. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:54:20,499][25689] Avg episode reward: [(0, '-19.092')] [2022-07-10 06:54:20,675][26022] Updated weights on worker 0-0, policy_version 615150 (0.00094) [2022-07-10 06:54:22,317][26022] Updated weights on worker 0-0, policy_version 615160 (0.00085) [2022-07-10 06:54:24,497][26022] Updated weights on worker 0-0, policy_version 615170 (0.00078) [2022-07-10 06:54:25,504][25689] Fps is (10 sec: 5597.8, 60 sec: 5586.9, 300 sec: 5561.3). Total num frames: 629940224. Throughput: 0: 5911.0. Samples: 629938702. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:54:25,505][25689] Avg episode reward: [(0, '-18.192')] [2022-07-10 06:54:25,929][26022] Updated weights on worker 0-0, policy_version 615180 (0.00098) [2022-07-10 06:54:28,082][26022] Updated weights on worker 0-0, policy_version 615190 (0.00086) [2022-07-10 06:54:29,605][26022] Updated weights on worker 0-0, policy_version 615200 (0.00094) [2022-07-10 06:54:30,551][25689] Fps is (10 sec: 5500.6, 60 sec: 5566.7, 300 sec: 5553.7). Total num frames: 629966848. Throughput: 0: 5886.1. Samples: 629972374. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:54:30,552][25689] Avg episode reward: [(0, '-18.072')] [2022-07-10 06:54:31,623][26022] Updated weights on worker 0-0, policy_version 615210 (0.00090) [2022-07-10 06:54:33,530][26022] Updated weights on worker 0-0, policy_version 615220 (0.00090) [2022-07-10 06:54:35,360][26022] Updated weights on worker 0-0, policy_version 615230 (0.00086) [2022-07-10 06:54:35,556][25689] Fps is (10 sec: 5602.8, 60 sec: 5569.7, 300 sec: 5562.1). Total num frames: 629996544. Throughput: 0: 5861.8. Samples: 630005774. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:54:35,557][25689] Avg episode reward: [(0, '-16.763')] [2022-07-10 06:54:37,242][26022] Updated weights on worker 0-0, policy_version 615240 (0.00081) [2022-07-10 06:54:39,024][26022] Updated weights on worker 0-0, policy_version 615250 (0.00087) [2022-07-10 06:54:40,669][25689] Fps is (10 sec: 5768.6, 60 sec: 5603.7, 300 sec: 5560.4). Total num frames: 630025216. Throughput: 0: 5835.1. Samples: 630022546. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:54:40,670][25689] Avg episode reward: [(0, '-16.436')] [2022-07-10 06:54:40,795][26022] Updated weights on worker 0-0, policy_version 615260 (0.00083) [2022-07-10 06:54:42,770][26022] Updated weights on worker 0-0, policy_version 615270 (0.00086) [2022-07-10 06:54:44,489][26022] Updated weights on worker 0-0, policy_version 615280 (0.00085) [2022-07-10 06:54:45,745][25689] Fps is (10 sec: 5628.1, 60 sec: 5563.9, 300 sec: 5563.0). Total num frames: 630053888. Throughput: 0: 5813.7. Samples: 630056376. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:54:45,746][25689] Avg episode reward: [(0, '-16.014')] [2022-07-10 06:54:46,392][26022] Updated weights on worker 0-0, policy_version 615290 (0.00082) [2022-07-10 06:54:48,070][26022] Updated weights on worker 0-0, policy_version 615300 (0.00090) [2022-07-10 06:54:49,849][26022] Updated weights on worker 0-0, policy_version 615310 (0.00089) [2022-07-10 06:54:50,754][25689] Fps is (10 sec: 5483.4, 60 sec: 5551.9, 300 sec: 5559.5). Total num frames: 630080512. Throughput: 0: 5831.8. Samples: 630090190. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:54:50,754][25689] Avg episode reward: [(0, '-16.511')] [2022-07-10 06:54:51,821][26022] Updated weights on worker 0-0, policy_version 615320 (0.00087) [2022-07-10 06:54:53,838][26022] Updated weights on worker 0-0, policy_version 615330 (0.00090) [2022-07-10 06:54:55,365][26022] Updated weights on worker 0-0, policy_version 615340 (0.00085) [2022-07-10 06:54:55,790][25689] Fps is (10 sec: 5708.8, 60 sec: 5619.3, 300 sec: 5567.2). Total num frames: 630111232. Throughput: 0: 4999.9. Samples: 630106940. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:54:55,791][25689] Avg episode reward: [(0, '-16.386')] [2022-07-10 06:54:57,437][26022] Updated weights on worker 0-0, policy_version 615350 (0.00086) [2022-07-10 06:54:58,891][26022] Updated weights on worker 0-0, policy_version 615360 (0.00091) [2022-07-10 06:55:00,855][25689] Fps is (10 sec: 5676.9, 60 sec: 5571.2, 300 sec: 5566.2). Total num frames: 630137856. Throughput: 0: 5856.5. Samples: 630140762. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:55:00,855][25689] Avg episode reward: [(0, '-15.218')] [2022-07-10 06:55:01,018][26022] Updated weights on worker 0-0, policy_version 615370 (0.00091) [2022-07-10 06:55:03,108][26022] Updated weights on worker 0-0, policy_version 615380 (0.00093) [2022-07-10 06:55:04,837][26022] Updated weights on worker 0-0, policy_version 615390 (0.00086) [2022-07-10 06:55:05,872][25689] Fps is (10 sec: 5179.9, 60 sec: 5572.8, 300 sec: 5562.5). Total num frames: 630163456. Throughput: 0: 5756.6. Samples: 630172238. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:55:05,874][25689] Avg episode reward: [(0, '-15.761')] [2022-07-10 06:55:06,809][26022] Updated weights on worker 0-0, policy_version 615400 (0.00088) [2022-07-10 06:55:08,635][26022] Updated weights on worker 0-0, policy_version 615410 (0.00088) [2022-07-10 06:55:10,486][26022] Updated weights on worker 0-0, policy_version 615420 (0.00086) [2022-07-10 06:55:10,920][25689] Fps is (10 sec: 5392.2, 60 sec: 5570.5, 300 sec: 5565.2). Total num frames: 630192128. Throughput: 0: 4902.2. Samples: 630189048. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:55:10,920][25689] Avg episode reward: [(0, '-14.969')] [2022-07-10 06:55:12,276][26022] Updated weights on worker 0-0, policy_version 615430 (0.00375) [2022-07-10 06:55:14,183][26022] Updated weights on worker 0-0, policy_version 615440 (0.00086) [2022-07-10 06:55:15,705][26022] Updated weights on worker 0-0, policy_version 615450 (0.00088) [2022-07-10 06:55:15,923][25689] Fps is (10 sec: 5807.4, 60 sec: 5587.4, 300 sec: 5570.0). Total num frames: 630221824. Throughput: 0: 5751.9. Samples: 630222742. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:55:15,924][25689] Avg episode reward: [(0, '-14.373')] [2022-07-10 06:55:17,750][26022] Updated weights on worker 0-0, policy_version 615460 (0.00094) [2022-07-10 06:55:19,290][26022] Updated weights on worker 0-0, policy_version 615470 (0.00102) [2022-07-10 06:55:21,031][25689] Fps is (10 sec: 5671.5, 60 sec: 5582.5, 300 sec: 5568.3). Total num frames: 630249472. Throughput: 0: 5734.8. Samples: 630256466. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 06:55:21,031][25689] Avg episode reward: [(0, '-12.957')] [2022-07-10 06:55:21,435][26022] Updated weights on worker 0-0, policy_version 615480 (0.00086) [2022-07-10 06:55:23,087][26022] Updated weights on worker 0-0, policy_version 615490 (0.00468) [2022-07-10 06:55:24,936][26022] Updated weights on worker 0-0, policy_version 615500 (0.00091) [2022-07-10 06:55:26,090][25689] Fps is (10 sec: 5539.4, 60 sec: 5577.6, 300 sec: 5567.8). Total num frames: 630278144. Throughput: 0: 5004.6. Samples: 630273420. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:55:26,093][25689] Avg episode reward: [(0, '-14.236')] [2022-07-10 06:55:26,700][26022] Updated weights on worker 0-0, policy_version 615510 (0.00094) [2022-07-10 06:55:28,602][26022] Updated weights on worker 0-0, policy_version 615520 (0.00087) [2022-07-10 06:55:30,309][26022] Updated weights on worker 0-0, policy_version 615530 (0.00086) [2022-07-10 06:55:31,128][25689] Fps is (10 sec: 5577.7, 60 sec: 5595.3, 300 sec: 5567.3). Total num frames: 630305792. Throughput: 0: 5845.6. Samples: 630307178. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:55:31,129][25689] Avg episode reward: [(0, '-13.914')] [2022-07-10 06:55:32,475][26022] Updated weights on worker 0-0, policy_version 615540 (0.00083) [2022-07-10 06:55:34,122][26022] Updated weights on worker 0-0, policy_version 615550 (0.00088) [2022-07-10 06:55:36,067][26022] Updated weights on worker 0-0, policy_version 615560 (0.00094) [2022-07-10 06:55:36,171][25689] Fps is (10 sec: 5485.2, 60 sec: 5558.0, 300 sec: 5563.9). Total num frames: 630333440. Throughput: 0: 5812.0. Samples: 630340424. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:55:36,171][25689] Avg episode reward: [(0, '-14.603')] [2022-07-10 06:55:37,820][26022] Updated weights on worker 0-0, policy_version 615570 (0.00086) [2022-07-10 06:55:39,547][26022] Updated weights on worker 0-0, policy_version 615580 (0.00081) [2022-07-10 06:55:41,277][25689] Fps is (10 sec: 5549.3, 60 sec: 5558.6, 300 sec: 5565.6). Total num frames: 630362112. Throughput: 0: 4968.9. Samples: 630357068. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:55:41,278][25689] Avg episode reward: [(0, '-14.662')] [2022-07-10 06:55:41,521][26022] Updated weights on worker 0-0, policy_version 615590 (0.00100) [2022-07-10 06:55:43,272][26022] Updated weights on worker 0-0, policy_version 615600 (0.00091) [2022-07-10 06:55:45,127][26022] Updated weights on worker 0-0, policy_version 615610 (0.00084) [2022-07-10 06:55:46,345][25689] Fps is (10 sec: 5736.9, 60 sec: 5576.3, 300 sec: 5567.9). Total num frames: 630391808. Throughput: 0: 5807.2. Samples: 630391044. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:55:46,345][25689] Avg episode reward: [(0, '-15.157')] [2022-07-10 06:55:47,081][26022] Updated weights on worker 0-0, policy_version 615620 (0.00050) [2022-07-10 06:55:48,599][26022] Updated weights on worker 0-0, policy_version 615630 (0.00087) [2022-07-10 06:55:50,715][26022] Updated weights on worker 0-0, policy_version 615640 (0.00092) [2022-07-10 06:55:51,419][25689] Fps is (10 sec: 5654.4, 60 sec: 5587.1, 300 sec: 5566.9). Total num frames: 630419456. Throughput: 0: 5809.8. Samples: 630425062. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:55:51,420][25689] Avg episode reward: [(0, '-15.168')] [2022-07-10 06:55:52,286][26022] Updated weights on worker 0-0, policy_version 615650 (0.00086) [2022-07-10 06:55:54,131][26022] Updated weights on worker 0-0, policy_version 615660 (0.00082) [2022-07-10 06:55:56,065][26022] Updated weights on worker 0-0, policy_version 615670 (0.00097) [2022-07-10 06:55:56,455][25689] Fps is (10 sec: 5672.1, 60 sec: 5570.3, 300 sec: 5574.7). Total num frames: 630449152. Throughput: 0: 5009.7. Samples: 630442040. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:55:56,455][25689] Avg episode reward: [(0, '-15.762')] [2022-07-10 06:55:57,776][26022] Updated weights on worker 0-0, policy_version 615680 (0.00089) [2022-07-10 06:55:59,633][26022] Updated weights on worker 0-0, policy_version 615690 (0.00090) [2022-07-10 06:56:01,498][25689] Fps is (10 sec: 5689.0, 60 sec: 5589.1, 300 sec: 5575.2). Total num frames: 630476800. Throughput: 0: 5878.5. Samples: 630475940. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:56:01,499][25689] Avg episode reward: [(0, '-16.234')] [2022-07-10 06:56:01,502][26022] Updated weights on worker 0-0, policy_version 615700 (0.00093) [2022-07-10 06:56:03,574][26022] Updated weights on worker 0-0, policy_version 615710 (0.00082) [2022-07-10 06:56:05,606][26022] Updated weights on worker 0-0, policy_version 615720 (0.00087) [2022-07-10 06:56:06,540][25689] Fps is (10 sec: 5279.7, 60 sec: 5586.9, 300 sec: 5568.3). Total num frames: 630502400. Throughput: 0: 5774.6. Samples: 630507664. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:56:06,541][25689] Avg episode reward: [(0, '-16.717')] [2022-07-10 06:56:07,280][26022] Updated weights on worker 0-0, policy_version 615730 (0.00084) [2022-07-10 06:56:09,196][26022] Updated weights on worker 0-0, policy_version 615740 (0.00092) [2022-07-10 06:56:10,371][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:56:10,393][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000615746_630523904.pth [2022-07-10 06:56:10,393][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000613787_628517888.pth [2022-07-10 06:56:10,936][26022] Updated weights on worker 0-0, policy_version 615750 (0.00091) [2022-07-10 06:56:11,603][25689] Fps is (10 sec: 5371.1, 60 sec: 5585.5, 300 sec: 5567.3). Total num frames: 630531072. Throughput: 0: 4922.9. Samples: 630524426. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:56:11,603][25689] Avg episode reward: [(0, '-15.363')] [2022-07-10 06:56:12,766][26022] Updated weights on worker 0-0, policy_version 615760 (0.00086) [2022-07-10 06:56:14,595][26022] Updated weights on worker 0-0, policy_version 615770 (0.00093) [2022-07-10 06:56:16,464][26022] Updated weights on worker 0-0, policy_version 615780 (0.00093) [2022-07-10 06:56:16,668][25689] Fps is (10 sec: 5560.8, 60 sec: 5546.0, 300 sec: 5567.8). Total num frames: 630558720. Throughput: 0: 5746.3. Samples: 630558192. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:56:16,669][25689] Avg episode reward: [(0, '-16.214')] [2022-07-10 06:56:18,077][26022] Updated weights on worker 0-0, policy_version 615790 (0.00090) [2022-07-10 06:56:20,235][26022] Updated weights on worker 0-0, policy_version 615800 (0.00090) [2022-07-10 06:56:21,742][25689] Fps is (10 sec: 5655.6, 60 sec: 5582.9, 300 sec: 5570.0). Total num frames: 630588416. Throughput: 0: 5731.9. Samples: 630591976. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:56:21,744][25689] Avg episode reward: [(0, '-15.687')] [2022-07-10 06:56:21,755][26022] Updated weights on worker 0-0, policy_version 615810 (0.00087) [2022-07-10 06:56:23,788][26022] Updated weights on worker 0-0, policy_version 615820 (0.00092) [2022-07-10 06:56:25,435][26022] Updated weights on worker 0-0, policy_version 615830 (0.00905) [2022-07-10 06:56:26,819][25689] Fps is (10 sec: 5750.2, 60 sec: 5581.3, 300 sec: 5569.3). Total num frames: 630617088. Throughput: 0: 5817.6. Samples: 630625636. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:56:26,820][25689] Avg episode reward: [(0, '-14.969')] [2022-07-10 06:56:27,318][26022] Updated weights on worker 0-0, policy_version 615840 (0.00097) [2022-07-10 06:56:29,198][26022] Updated weights on worker 0-0, policy_version 615850 (0.00096) [2022-07-10 06:56:30,934][26022] Updated weights on worker 0-0, policy_version 615860 (0.00088) [2022-07-10 06:56:31,911][25689] Fps is (10 sec: 5337.0, 60 sec: 5542.6, 300 sec: 5568.3). Total num frames: 630642688. Throughput: 0: 5813.5. Samples: 630642488. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:56:31,912][25689] Avg episode reward: [(0, '-14.241')] [2022-07-10 06:56:32,791][26022] Updated weights on worker 0-0, policy_version 615870 (0.00098) [2022-07-10 06:56:34,894][26022] Updated weights on worker 0-0, policy_version 615880 (0.00095) [2022-07-10 06:56:36,399][26022] Updated weights on worker 0-0, policy_version 615890 (0.00083) [2022-07-10 06:56:36,969][25689] Fps is (10 sec: 5649.2, 60 sec: 5608.6, 300 sec: 5574.9). Total num frames: 630674432. Throughput: 0: 5817.6. Samples: 630676296. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:56:36,970][25689] Avg episode reward: [(0, '-13.919')] [2022-07-10 06:56:38,552][26022] Updated weights on worker 0-0, policy_version 615900 (0.00086) [2022-07-10 06:56:40,222][26022] Updated weights on worker 0-0, policy_version 615910 (0.00093) [2022-07-10 06:56:41,907][26022] Updated weights on worker 0-0, policy_version 615920 (0.00088) [2022-07-10 06:56:42,086][25689] Fps is (10 sec: 5937.6, 60 sec: 5607.6, 300 sec: 5576.6). Total num frames: 630703104. Throughput: 0: 5799.5. Samples: 630709962. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:56:42,087][25689] Avg episode reward: [(0, '-13.316')] [2022-07-10 06:56:43,996][26022] Updated weights on worker 0-0, policy_version 615930 (0.00081) [2022-07-10 06:56:45,453][26022] Updated weights on worker 0-0, policy_version 615940 (0.00090) [2022-07-10 06:56:47,116][25689] Fps is (10 sec: 5449.5, 60 sec: 5560.5, 300 sec: 5569.7). Total num frames: 630729728. Throughput: 0: 4997.3. Samples: 630727070. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:56:47,117][25689] Avg episode reward: [(0, '-13.555')] [2022-07-10 06:56:47,482][26022] Updated weights on worker 0-0, policy_version 615950 (0.00088) [2022-07-10 06:56:49,031][26022] Updated weights on worker 0-0, policy_version 615960 (0.00085) [2022-07-10 06:56:50,986][26022] Updated weights on worker 0-0, policy_version 615970 (0.00088) [2022-07-10 06:56:52,184][25689] Fps is (10 sec: 5577.7, 60 sec: 5594.8, 300 sec: 5575.6). Total num frames: 630759424. Throughput: 0: 5851.8. Samples: 630761120. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:56:52,185][25689] Avg episode reward: [(0, '-13.856')] [2022-07-10 06:56:52,951][26022] Updated weights on worker 0-0, policy_version 615980 (0.00087) [2022-07-10 06:56:54,594][26022] Updated weights on worker 0-0, policy_version 615990 (0.00085) [2022-07-10 06:56:56,413][26022] Updated weights on worker 0-0, policy_version 616000 (0.00088) [2022-07-10 06:56:57,255][25689] Fps is (10 sec: 5655.9, 60 sec: 5557.9, 300 sec: 5566.2). Total num frames: 630787072. Throughput: 0: 5831.6. Samples: 630794594. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:56:57,256][25689] Avg episode reward: [(0, '-14.743')] [2022-07-10 06:56:58,492][26022] Updated weights on worker 0-0, policy_version 616010 (0.00096) [2022-07-10 06:57:00,124][26022] Updated weights on worker 0-0, policy_version 616020 (0.00091) [2022-07-10 06:57:02,321][25689] Fps is (10 sec: 5354.0, 60 sec: 5539.0, 300 sec: 5572.1). Total num frames: 630813696. Throughput: 0: 5004.0. Samples: 630811214. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:57:02,321][25689] Avg episode reward: [(0, '-15.133')] [2022-07-10 06:57:02,504][26022] Updated weights on worker 0-0, policy_version 616030 (0.00089) [2022-07-10 06:57:04,472][26022] Updated weights on worker 0-0, policy_version 616040 (0.00087) [2022-07-10 06:57:06,191][26022] Updated weights on worker 0-0, policy_version 616050 (0.00086) [2022-07-10 06:57:07,408][25689] Fps is (10 sec: 5244.5, 60 sec: 5551.7, 300 sec: 5563.9). Total num frames: 630840320. Throughput: 0: 5673.5. Samples: 630842196. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:57:07,409][25689] Avg episode reward: [(0, '-14.593')] [2022-07-10 06:57:08,112][26022] Updated weights on worker 0-0, policy_version 616060 (0.00088) [2022-07-10 06:57:09,819][26022] Updated weights on worker 0-0, policy_version 616070 (0.00084) [2022-07-10 06:57:11,699][26022] Updated weights on worker 0-0, policy_version 616080 (0.00094) [2022-07-10 06:57:12,418][25689] Fps is (10 sec: 5577.6, 60 sec: 5573.3, 300 sec: 5571.5). Total num frames: 630870016. Throughput: 0: 5669.8. Samples: 630875844. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:57:12,419][25689] Avg episode reward: [(0, '-15.688')] [2022-07-10 06:57:13,578][26022] Updated weights on worker 0-0, policy_version 616090 (0.00093) [2022-07-10 06:57:15,479][26022] Updated weights on worker 0-0, policy_version 616100 (0.00101) [2022-07-10 06:57:17,155][26022] Updated weights on worker 0-0, policy_version 616110 (0.00090) [2022-07-10 06:57:17,436][25689] Fps is (10 sec: 5718.3, 60 sec: 5577.6, 300 sec: 5566.0). Total num frames: 630897664. Throughput: 0: 4862.0. Samples: 630892716. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:57:17,437][25689] Avg episode reward: [(0, '-16.325')] [2022-07-10 06:57:19,189][26022] Updated weights on worker 0-0, policy_version 616120 (0.00090) [2022-07-10 06:57:20,837][26022] Updated weights on worker 0-0, policy_version 616130 (0.00091) [2022-07-10 06:57:22,508][25689] Fps is (10 sec: 5480.4, 60 sec: 5544.2, 300 sec: 5568.4). Total num frames: 630925312. Throughput: 0: 5692.9. Samples: 630926138. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:57:22,508][25689] Avg episode reward: [(0, '-16.360')] [2022-07-10 06:57:22,754][26022] Updated weights on worker 0-0, policy_version 616140 (0.00089) [2022-07-10 06:57:24,387][26022] Updated weights on worker 0-0, policy_version 616150 (0.00096) [2022-07-10 06:57:26,416][26022] Updated weights on worker 0-0, policy_version 616160 (0.00088) [2022-07-10 06:57:27,515][25689] Fps is (10 sec: 5689.5, 60 sec: 5567.4, 300 sec: 5572.1). Total num frames: 630955008. Throughput: 0: 5841.8. Samples: 630959658. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:57:27,516][25689] Avg episode reward: [(0, '-15.776')] [2022-07-10 06:57:28,051][26022] Updated weights on worker 0-0, policy_version 616170 (0.00094) [2022-07-10 06:57:30,226][26022] Updated weights on worker 0-0, policy_version 616180 (0.00085) [2022-07-10 06:57:32,026][26022] Updated weights on worker 0-0, policy_version 616190 (0.00086) [2022-07-10 06:57:32,611][25689] Fps is (10 sec: 5574.4, 60 sec: 5583.9, 300 sec: 5564.9). Total num frames: 630981632. Throughput: 0: 4977.7. Samples: 630976358. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:57:32,612][25689] Avg episode reward: [(0, '-14.392')] [2022-07-10 06:57:33,694][26022] Updated weights on worker 0-0, policy_version 616200 (0.00086) [2022-07-10 06:57:35,568][26022] Updated weights on worker 0-0, policy_version 616210 (0.00082) [2022-07-10 06:57:37,376][26022] Updated weights on worker 0-0, policy_version 616220 (0.00092) [2022-07-10 06:57:37,636][25689] Fps is (10 sec: 5362.2, 60 sec: 5519.4, 300 sec: 5563.3). Total num frames: 631009280. Throughput: 0: 5803.3. Samples: 631009942. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:57:37,637][25689] Avg episode reward: [(0, '-15.376')] [2022-07-10 06:57:39,243][26022] Updated weights on worker 0-0, policy_version 616230 (0.00081) [2022-07-10 06:57:41,072][26022] Updated weights on worker 0-0, policy_version 616240 (0.00095) [2022-07-10 06:57:42,725][25689] Fps is (10 sec: 5568.7, 60 sec: 5522.0, 300 sec: 5566.3). Total num frames: 631037952. Throughput: 0: 5808.0. Samples: 631043558. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:57:42,726][25689] Avg episode reward: [(0, '-14.769')] [2022-07-10 06:57:43,031][26022] Updated weights on worker 0-0, policy_version 616250 (0.00110) [2022-07-10 06:57:44,679][26022] Updated weights on worker 0-0, policy_version 616260 (0.00091) [2022-07-10 06:57:46,655][26022] Updated weights on worker 0-0, policy_version 616270 (0.00094) [2022-07-10 06:57:47,795][25689] Fps is (10 sec: 5745.8, 60 sec: 5569.0, 300 sec: 5572.0). Total num frames: 631067648. Throughput: 0: 4970.7. Samples: 631060462. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:57:47,795][25689] Avg episode reward: [(0, '-12.854')] [2022-07-10 06:57:48,334][26022] Updated weights on worker 0-0, policy_version 616280 (0.00087) [2022-07-10 06:57:50,044][26022] Updated weights on worker 0-0, policy_version 616290 (0.00098) [2022-07-10 06:57:52,177][26022] Updated weights on worker 0-0, policy_version 616300 (0.00092) [2022-07-10 06:57:52,797][25689] Fps is (10 sec: 5591.4, 60 sec: 5524.3, 300 sec: 5569.7). Total num frames: 631094272. Throughput: 0: 5844.1. Samples: 631094328. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:57:52,798][25689] Avg episode reward: [(0, '-12.920')] [2022-07-10 06:57:53,763][26022] Updated weights on worker 0-0, policy_version 616310 (0.00095) [2022-07-10 06:57:55,735][26022] Updated weights on worker 0-0, policy_version 616320 (0.00090) [2022-07-10 06:57:57,458][26022] Updated weights on worker 0-0, policy_version 616330 (0.00086) [2022-07-10 06:57:57,815][25689] Fps is (10 sec: 5518.7, 60 sec: 5546.1, 300 sec: 5568.7). Total num frames: 631122944. Throughput: 0: 5850.6. Samples: 631127996. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:57:57,815][25689] Avg episode reward: [(0, '-13.176')] [2022-07-10 06:57:59,418][26022] Updated weights on worker 0-0, policy_version 616340 (0.00089) [2022-07-10 06:58:01,118][26022] Updated weights on worker 0-0, policy_version 616350 (0.00122) [2022-07-10 06:58:02,929][25689] Fps is (10 sec: 5457.8, 60 sec: 5541.7, 300 sec: 5571.4). Total num frames: 631149568. Throughput: 0: 5742.4. Samples: 631159578. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:58:02,931][25689] Avg episode reward: [(0, '-12.359')] [2022-07-10 06:58:03,328][26022] Updated weights on worker 0-0, policy_version 616360 (0.00098) [2022-07-10 06:58:05,227][26022] Updated weights on worker 0-0, policy_version 616370 (0.00095) [2022-07-10 06:58:07,084][26022] Updated weights on worker 0-0, policy_version 616380 (0.00093) [2022-07-10 06:58:08,026][25689] Fps is (10 sec: 5415.2, 60 sec: 5574.6, 300 sec: 5571.2). Total num frames: 631178240. Throughput: 0: 5724.1. Samples: 631176266. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:58:08,026][25689] Avg episode reward: [(0, '-12.513')] [2022-07-10 06:58:08,711][26022] Updated weights on worker 0-0, policy_version 616390 (0.00083) [2022-07-10 06:58:10,474][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 06:58:10,485][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000616399_631192576.pth [2022-07-10 06:58:10,486][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000614441_629187584.pth [2022-07-10 06:58:10,665][26022] Updated weights on worker 0-0, policy_version 616400 (0.00086) [2022-07-10 06:58:12,451][26022] Updated weights on worker 0-0, policy_version 616410 (0.00087) [2022-07-10 06:58:13,030][25689] Fps is (10 sec: 5575.6, 60 sec: 5541.4, 300 sec: 5564.5). Total num frames: 631205888. Throughput: 0: 5727.0. Samples: 631210200. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:58:13,032][25689] Avg episode reward: [(0, '-13.124')] [2022-07-10 06:58:14,251][26022] Updated weights on worker 0-0, policy_version 616420 (0.00103) [2022-07-10 06:58:16,213][26022] Updated weights on worker 0-0, policy_version 616430 (0.00091) [2022-07-10 06:58:17,839][26022] Updated weights on worker 0-0, policy_version 616440 (0.00097) [2022-07-10 06:58:18,091][25689] Fps is (10 sec: 5595.8, 60 sec: 5554.4, 300 sec: 5571.5). Total num frames: 631234560. Throughput: 0: 5718.2. Samples: 631243938. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:58:18,091][25689] Avg episode reward: [(0, '-14.095')] [2022-07-10 06:58:19,942][26022] Updated weights on worker 0-0, policy_version 616450 (0.00094) [2022-07-10 06:58:21,636][26022] Updated weights on worker 0-0, policy_version 616460 (0.00094) [2022-07-10 06:58:23,155][25689] Fps is (10 sec: 5663.4, 60 sec: 5571.9, 300 sec: 5571.1). Total num frames: 631263232. Throughput: 0: 4995.6. Samples: 631260622. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:58:23,156][25689] Avg episode reward: [(0, '-12.923')] [2022-07-10 06:58:23,412][26022] Updated weights on worker 0-0, policy_version 616470 (0.00092) [2022-07-10 06:58:25,368][26022] Updated weights on worker 0-0, policy_version 616480 (0.00087) [2022-07-10 06:58:27,083][26022] Updated weights on worker 0-0, policy_version 616490 (0.00087) [2022-07-10 06:58:28,167][25689] Fps is (10 sec: 5487.9, 60 sec: 5520.9, 300 sec: 5567.6). Total num frames: 631289856. Throughput: 0: 5857.2. Samples: 631294234. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 06:58:28,167][25689] Avg episode reward: [(0, '-14.377')] [2022-07-10 06:58:28,884][26022] Updated weights on worker 0-0, policy_version 616500 (0.00095) [2022-07-10 06:58:30,976][26022] Updated weights on worker 0-0, policy_version 616510 (0.00091) [2022-07-10 06:58:32,657][26022] Updated weights on worker 0-0, policy_version 616520 (0.00092) [2022-07-10 06:58:33,186][25689] Fps is (10 sec: 5614.8, 60 sec: 5578.6, 300 sec: 5568.0). Total num frames: 631319552. Throughput: 0: 5836.9. Samples: 631327848. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 06:58:33,187][25689] Avg episode reward: [(0, '-15.284')] [2022-07-10 06:58:34,525][26022] Updated weights on worker 0-0, policy_version 616530 (0.00083) [2022-07-10 06:58:36,332][26022] Updated weights on worker 0-0, policy_version 616540 (0.00086) [2022-07-10 06:58:38,191][25689] Fps is (10 sec: 5618.3, 60 sec: 5563.5, 300 sec: 5570.0). Total num frames: 631346176. Throughput: 0: 4996.2. Samples: 631344364. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 06:58:38,192][25689] Avg episode reward: [(0, '-15.485')] [2022-07-10 06:58:38,276][26022] Updated weights on worker 0-0, policy_version 616550 (0.00098) [2022-07-10 06:58:40,175][26022] Updated weights on worker 0-0, policy_version 616560 (0.00086) [2022-07-10 06:58:41,869][26022] Updated weights on worker 0-0, policy_version 616570 (0.00091) [2022-07-10 06:58:43,288][25689] Fps is (10 sec: 5473.7, 60 sec: 5562.7, 300 sec: 5561.5). Total num frames: 631374848. Throughput: 0: 5828.6. Samples: 631377968. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 06:58:43,289][25689] Avg episode reward: [(0, '-16.268')] [2022-07-10 06:58:43,597][26022] Updated weights on worker 0-0, policy_version 616580 (0.00093) [2022-07-10 06:58:45,483][26022] Updated weights on worker 0-0, policy_version 616590 (0.00087) [2022-07-10 06:58:47,272][26022] Updated weights on worker 0-0, policy_version 616600 (0.00091) [2022-07-10 06:58:48,323][25689] Fps is (10 sec: 5760.9, 60 sec: 5566.0, 300 sec: 5568.9). Total num frames: 631404544. Throughput: 0: 5841.5. Samples: 631411976. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 06:58:48,323][25689] Avg episode reward: [(0, '-16.409')] [2022-07-10 06:58:49,127][26022] Updated weights on worker 0-0, policy_version 616610 (0.00082) [2022-07-10 06:58:50,808][26022] Updated weights on worker 0-0, policy_version 616620 (0.00102) [2022-07-10 06:58:52,742][26022] Updated weights on worker 0-0, policy_version 616630 (0.00091) [2022-07-10 06:58:53,339][25689] Fps is (10 sec: 5705.6, 60 sec: 5581.7, 300 sec: 5572.7). Total num frames: 631432192. Throughput: 0: 5004.4. Samples: 631428700. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 06:58:53,339][25689] Avg episode reward: [(0, '-17.481')] [2022-07-10 06:58:54,630][26022] Updated weights on worker 0-0, policy_version 616640 (0.00084) [2022-07-10 06:58:56,260][26022] Updated weights on worker 0-0, policy_version 616650 (0.00095) [2022-07-10 06:58:58,285][26022] Updated weights on worker 0-0, policy_version 616660 (0.00615) [2022-07-10 06:58:58,367][25689] Fps is (10 sec: 5505.5, 60 sec: 5563.7, 300 sec: 5567.0). Total num frames: 631459840. Throughput: 0: 5859.0. Samples: 631462572. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 06:58:58,367][25689] Avg episode reward: [(0, '-18.167')] [2022-07-10 06:58:59,995][26022] Updated weights on worker 0-0, policy_version 616670 (0.00089) [2022-07-10 06:59:02,187][26022] Updated weights on worker 0-0, policy_version 616680 (0.00091) [2022-07-10 06:59:03,414][25689] Fps is (10 sec: 5488.1, 60 sec: 5586.8, 300 sec: 5573.7). Total num frames: 631487488. Throughput: 0: 5795.4. Samples: 631494608. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 06:59:03,415][25689] Avg episode reward: [(0, '-16.880')] [2022-07-10 06:59:04,000][26022] Updated weights on worker 0-0, policy_version 616690 (0.00087) [2022-07-10 06:59:05,895][26022] Updated weights on worker 0-0, policy_version 616700 (0.00104) [2022-07-10 06:59:07,545][26022] Updated weights on worker 0-0, policy_version 616710 (0.00091) [2022-07-10 06:59:08,448][25689] Fps is (10 sec: 5485.2, 60 sec: 5575.7, 300 sec: 5570.0). Total num frames: 631515136. Throughput: 0: 4940.0. Samples: 631511392. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 06:59:08,448][25689] Avg episode reward: [(0, '-15.132')] [2022-07-10 06:59:09,479][26022] Updated weights on worker 0-0, policy_version 616720 (0.00088) [2022-07-10 06:59:10,998][26022] Updated weights on worker 0-0, policy_version 616730 (0.00081) [2022-07-10 06:59:13,226][26022] Updated weights on worker 0-0, policy_version 616740 (0.00087) [2022-07-10 06:59:13,482][25689] Fps is (10 sec: 5594.2, 60 sec: 5589.9, 300 sec: 5569.4). Total num frames: 631543808. Throughput: 0: 5793.3. Samples: 631545396. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 06:59:13,483][25689] Avg episode reward: [(0, '-15.751')] [2022-07-10 06:59:14,618][26022] Updated weights on worker 0-0, policy_version 616750 (0.00086) [2022-07-10 06:59:16,685][26022] Updated weights on worker 0-0, policy_version 616760 (0.00098) [2022-07-10 06:59:18,366][26022] Updated weights on worker 0-0, policy_version 616770 (0.00087) [2022-07-10 06:59:18,486][25689] Fps is (10 sec: 5814.7, 60 sec: 5612.1, 300 sec: 5577.3). Total num frames: 631573504. Throughput: 0: 5809.5. Samples: 631579452. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 06:59:18,486][25689] Avg episode reward: [(0, '-15.894')] [2022-07-10 06:59:20,333][26022] Updated weights on worker 0-0, policy_version 616780 (0.00083) [2022-07-10 06:59:22,160][26022] Updated weights on worker 0-0, policy_version 616790 (0.00093) [2022-07-10 06:59:23,532][25689] Fps is (10 sec: 5604.2, 60 sec: 5580.0, 300 sec: 5569.6). Total num frames: 631600128. Throughput: 0: 5055.3. Samples: 631596306. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 06:59:23,532][25689] Avg episode reward: [(0, '-14.357')] [2022-07-10 06:59:23,919][26022] Updated weights on worker 0-0, policy_version 616800 (0.00089) [2022-07-10 06:59:25,600][26022] Updated weights on worker 0-0, policy_version 616810 (0.00088) [2022-07-10 06:59:27,520][26022] Updated weights on worker 0-0, policy_version 616820 (0.00084) [2022-07-10 06:59:28,544][25689] Fps is (10 sec: 5497.6, 60 sec: 5613.8, 300 sec: 5577.2). Total num frames: 631628800. Throughput: 0: 5913.1. Samples: 631630222. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 06:59:28,544][25689] Avg episode reward: [(0, '-13.573')] [2022-07-10 06:59:29,481][26022] Updated weights on worker 0-0, policy_version 616830 (0.00086) [2022-07-10 06:59:31,099][26022] Updated weights on worker 0-0, policy_version 616840 (0.00091) [2022-07-10 06:59:33,086][26022] Updated weights on worker 0-0, policy_version 616850 (0.00088) [2022-07-10 06:59:33,582][25689] Fps is (10 sec: 5705.8, 60 sec: 5595.1, 300 sec: 5573.1). Total num frames: 631657472. Throughput: 0: 5894.8. Samples: 631663880. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 06:59:33,582][25689] Avg episode reward: [(0, '-13.193')] [2022-07-10 06:59:34,910][26022] Updated weights on worker 0-0, policy_version 616860 (0.00092) [2022-07-10 06:59:36,736][26022] Updated weights on worker 0-0, policy_version 616870 (0.00085) [2022-07-10 06:59:38,571][26022] Updated weights on worker 0-0, policy_version 616880 (0.00124) [2022-07-10 06:59:38,663][25689] Fps is (10 sec: 5565.8, 60 sec: 5605.0, 300 sec: 5570.3). Total num frames: 631685120. Throughput: 0: 5024.8. Samples: 631680838. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 06:59:38,663][25689] Avg episode reward: [(0, '-13.317')] [2022-07-10 06:59:40,374][26022] Updated weights on worker 0-0, policy_version 616890 (0.00086) [2022-07-10 06:59:42,315][26022] Updated weights on worker 0-0, policy_version 616900 (0.00085) [2022-07-10 06:59:43,795][25689] Fps is (10 sec: 5514.6, 60 sec: 5601.8, 300 sec: 5569.2). Total num frames: 631713792. Throughput: 0: 5820.9. Samples: 631714254. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 06:59:43,795][25689] Avg episode reward: [(0, '-13.096')] [2022-07-10 06:59:43,928][26022] Updated weights on worker 0-0, policy_version 616910 (0.00089) [2022-07-10 06:59:45,934][26022] Updated weights on worker 0-0, policy_version 616920 (0.00089) [2022-07-10 06:59:47,678][26022] Updated weights on worker 0-0, policy_version 616930 (0.00080) [2022-07-10 06:59:48,884][25689] Fps is (10 sec: 5610.0, 60 sec: 5579.8, 300 sec: 5574.6). Total num frames: 631742464. Throughput: 0: 5794.8. Samples: 631748092. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 06:59:48,885][25689] Avg episode reward: [(0, '-12.476')] [2022-07-10 06:59:49,500][26022] Updated weights on worker 0-0, policy_version 616940 (0.00088) [2022-07-10 06:59:51,210][26022] Updated weights on worker 0-0, policy_version 616950 (0.00089) [2022-07-10 06:59:53,239][26022] Updated weights on worker 0-0, policy_version 616960 (0.00086) [2022-07-10 06:59:53,928][25689] Fps is (10 sec: 5759.8, 60 sec: 5611.0, 300 sec: 5571.0). Total num frames: 631772160. Throughput: 0: 4974.9. Samples: 631765102. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 06:59:53,930][25689] Avg episode reward: [(0, '-13.138')] [2022-07-10 06:59:54,955][26022] Updated weights on worker 0-0, policy_version 616970 (0.00091) [2022-07-10 06:59:56,901][26022] Updated weights on worker 0-0, policy_version 616980 (0.00092) [2022-07-10 06:59:58,768][26022] Updated weights on worker 0-0, policy_version 616990 (0.00089) [2022-07-10 06:59:59,031][25689] Fps is (10 sec: 5651.7, 60 sec: 5604.1, 300 sec: 5573.7). Total num frames: 631799808. Throughput: 0: 5799.7. Samples: 631798966. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 06:59:59,031][25689] Avg episode reward: [(0, '-13.286')] [2022-07-10 07:00:00,426][26022] Updated weights on worker 0-0, policy_version 617000 (0.00082) [2022-07-10 07:00:02,735][26022] Updated weights on worker 0-0, policy_version 617010 (0.00083) [2022-07-10 07:00:04,161][25689] Fps is (10 sec: 5303.7, 60 sec: 5579.6, 300 sec: 5575.0). Total num frames: 631826432. Throughput: 0: 5712.7. Samples: 631830602. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 07:00:04,163][25689] Avg episode reward: [(0, '-14.164')] [2022-07-10 07:00:04,472][26022] Updated weights on worker 0-0, policy_version 617020 (0.00090) [2022-07-10 07:00:06,282][26022] Updated weights on worker 0-0, policy_version 617030 (0.00089) [2022-07-10 07:00:08,167][26022] Updated weights on worker 0-0, policy_version 617040 (0.00081) [2022-07-10 07:00:09,203][25689] Fps is (10 sec: 5335.4, 60 sec: 5578.9, 300 sec: 5571.7). Total num frames: 631854080. Throughput: 0: 4887.0. Samples: 631847388. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 07:00:09,204][25689] Avg episode reward: [(0, '-13.832')] [2022-07-10 07:00:09,910][26022] Updated weights on worker 0-0, policy_version 617050 (0.00094) [2022-07-10 07:00:10,563][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:00:10,575][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000617053_631862272.pth [2022-07-10 07:00:10,576][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000615092_629854208.pth [2022-07-10 07:00:11,872][26022] Updated weights on worker 0-0, policy_version 617060 (0.00081) [2022-07-10 07:00:13,722][26022] Updated weights on worker 0-0, policy_version 617070 (0.00095) [2022-07-10 07:00:14,283][25689] Fps is (10 sec: 5361.7, 60 sec: 5540.9, 300 sec: 5560.0). Total num frames: 631880704. Throughput: 0: 5682.4. Samples: 631880768. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 07:00:14,284][25689] Avg episode reward: [(0, '-15.702')] [2022-07-10 07:00:15,375][26022] Updated weights on worker 0-0, policy_version 617080 (0.00088) [2022-07-10 07:00:17,594][26022] Updated weights on worker 0-0, policy_version 617090 (0.00084) [2022-07-10 07:00:19,004][26022] Updated weights on worker 0-0, policy_version 617100 (0.00086) [2022-07-10 07:00:19,288][25689] Fps is (10 sec: 5686.0, 60 sec: 5557.7, 300 sec: 5572.2). Total num frames: 631911424. Throughput: 0: 5695.7. Samples: 631914344. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 07:00:19,288][25689] Avg episode reward: [(0, '-15.100')] [2022-07-10 07:00:21,255][26022] Updated weights on worker 0-0, policy_version 617110 (0.00087) [2022-07-10 07:00:22,736][26022] Updated weights on worker 0-0, policy_version 617120 (0.00092) [2022-07-10 07:00:24,402][25689] Fps is (10 sec: 5667.1, 60 sec: 5551.5, 300 sec: 5564.3). Total num frames: 631938048. Throughput: 0: 5810.7. Samples: 631948216. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 07:00:24,403][25689] Avg episode reward: [(0, '-15.491')] [2022-07-10 07:00:24,625][26022] Updated weights on worker 0-0, policy_version 617130 (0.00087) [2022-07-10 07:00:26,577][26022] Updated weights on worker 0-0, policy_version 617140 (0.00086) [2022-07-10 07:00:28,275][26022] Updated weights on worker 0-0, policy_version 617150 (0.00082) [2022-07-10 07:00:29,414][25689] Fps is (10 sec: 5460.8, 60 sec: 5551.5, 300 sec: 5568.2). Total num frames: 631966720. Throughput: 0: 5812.3. Samples: 631964860. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 07:00:29,414][25689] Avg episode reward: [(0, '-14.734')] [2022-07-10 07:00:30,260][26022] Updated weights on worker 0-0, policy_version 617160 (0.00085) [2022-07-10 07:00:31,811][26022] Updated weights on worker 0-0, policy_version 617170 (0.00083) [2022-07-10 07:00:33,883][26022] Updated weights on worker 0-0, policy_version 617180 (0.00090) [2022-07-10 07:00:34,419][25689] Fps is (10 sec: 5929.2, 60 sec: 5588.1, 300 sec: 5579.2). Total num frames: 631997440. Throughput: 0: 5859.7. Samples: 631998758. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 07:00:34,419][25689] Avg episode reward: [(0, '-14.962')] [2022-07-10 07:00:35,537][26022] Updated weights on worker 0-0, policy_version 617190 (0.00093) [2022-07-10 07:00:37,400][26022] Updated weights on worker 0-0, policy_version 617200 (0.00091) [2022-07-10 07:00:39,320][26022] Updated weights on worker 0-0, policy_version 617210 (0.00085) [2022-07-10 07:00:39,443][25689] Fps is (10 sec: 5615.4, 60 sec: 5559.7, 300 sec: 5570.5). Total num frames: 632023040. Throughput: 0: 5871.4. Samples: 632032684. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 07:00:39,444][25689] Avg episode reward: [(0, '-14.807')] [2022-07-10 07:00:41,053][26022] Updated weights on worker 0-0, policy_version 617220 (0.00085) [2022-07-10 07:00:43,031][26022] Updated weights on worker 0-0, policy_version 617230 (0.00085) [2022-07-10 07:00:44,585][25689] Fps is (10 sec: 5338.4, 60 sec: 5558.7, 300 sec: 5565.7). Total num frames: 632051712. Throughput: 0: 5009.5. Samples: 632049324. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 07:00:44,586][25689] Avg episode reward: [(0, '-12.039')] [2022-07-10 07:00:44,752][26022] Updated weights on worker 0-0, policy_version 617240 (0.00086) [2022-07-10 07:00:46,582][26022] Updated weights on worker 0-0, policy_version 617250 (0.00091) [2022-07-10 07:00:48,394][26022] Updated weights on worker 0-0, policy_version 617260 (0.00086) [2022-07-10 07:00:49,660][25689] Fps is (10 sec: 5713.2, 60 sec: 5577.0, 300 sec: 5572.5). Total num frames: 632081408. Throughput: 0: 5848.7. Samples: 632083272. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 07:00:49,660][25689] Avg episode reward: [(0, '-11.445')] [2022-07-10 07:00:50,251][26022] Updated weights on worker 0-0, policy_version 617270 (0.00103) [2022-07-10 07:00:52,053][26022] Updated weights on worker 0-0, policy_version 617280 (0.00087) [2022-07-10 07:00:53,799][26022] Updated weights on worker 0-0, policy_version 617290 (0.00924) [2022-07-10 07:00:54,683][25689] Fps is (10 sec: 5577.6, 60 sec: 5528.3, 300 sec: 5562.5). Total num frames: 632108032. Throughput: 0: 5837.9. Samples: 632117056. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 07:00:54,683][25689] Avg episode reward: [(0, '-11.491')] [2022-07-10 07:00:55,671][26022] Updated weights on worker 0-0, policy_version 617300 (0.00095) [2022-07-10 07:00:57,540][26022] Updated weights on worker 0-0, policy_version 617310 (0.00083) [2022-07-10 07:00:59,151][26022] Updated weights on worker 0-0, policy_version 617320 (0.00087) [2022-07-10 07:00:59,684][25689] Fps is (10 sec: 5618.2, 60 sec: 5571.3, 300 sec: 5570.1). Total num frames: 632137728. Throughput: 0: 4992.9. Samples: 632133746. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 07:00:59,685][25689] Avg episode reward: [(0, '-13.683')] [2022-07-10 07:01:01,237][26022] Updated weights on worker 0-0, policy_version 617330 (0.00085) [2022-07-10 07:01:03,393][26022] Updated weights on worker 0-0, policy_version 617340 (0.00085) [2022-07-10 07:01:04,775][25689] Fps is (10 sec: 5478.9, 60 sec: 5558.0, 300 sec: 5569.2). Total num frames: 632163328. Throughput: 0: 5732.3. Samples: 632165060. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 07:01:04,776][25689] Avg episode reward: [(0, '-13.483')] [2022-07-10 07:01:05,379][26022] Updated weights on worker 0-0, policy_version 617350 (0.00084) [2022-07-10 07:01:07,200][26022] Updated weights on worker 0-0, policy_version 617360 (0.00088) [2022-07-10 07:01:08,795][26022] Updated weights on worker 0-0, policy_version 617370 (0.00093) [2022-07-10 07:01:09,843][25689] Fps is (10 sec: 5342.6, 60 sec: 5572.6, 300 sec: 5569.1). Total num frames: 632192000. Throughput: 0: 5726.3. Samples: 632198846. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 07:01:09,843][25689] Avg episode reward: [(0, '-14.768')] [2022-07-10 07:01:10,974][26022] Updated weights on worker 0-0, policy_version 617380 (0.00086) [2022-07-10 07:01:12,498][26022] Updated weights on worker 0-0, policy_version 617390 (0.00085) [2022-07-10 07:01:14,478][26022] Updated weights on worker 0-0, policy_version 617400 (0.00081) [2022-07-10 07:01:14,844][25689] Fps is (10 sec: 5593.6, 60 sec: 5596.8, 300 sec: 5570.3). Total num frames: 632219648. Throughput: 0: 4891.1. Samples: 632215666. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 07:01:14,845][25689] Avg episode reward: [(0, '-15.223')] [2022-07-10 07:01:16,083][26022] Updated weights on worker 0-0, policy_version 617410 (0.00082) [2022-07-10 07:01:17,883][26022] Updated weights on worker 0-0, policy_version 617420 (0.00086) [2022-07-10 07:01:19,736][26022] Updated weights on worker 0-0, policy_version 617430 (0.00089) [2022-07-10 07:01:19,850][25689] Fps is (10 sec: 5627.7, 60 sec: 5562.8, 300 sec: 5568.1). Total num frames: 632248320. Throughput: 0: 5752.3. Samples: 632249746. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 07:01:19,851][25689] Avg episode reward: [(0, '-15.170')] [2022-07-10 07:01:21,601][26022] Updated weights on worker 0-0, policy_version 617440 (0.00089) [2022-07-10 07:01:23,464][26022] Updated weights on worker 0-0, policy_version 617450 (0.00087) [2022-07-10 07:01:24,923][25689] Fps is (10 sec: 5689.7, 60 sec: 5600.5, 300 sec: 5568.2). Total num frames: 632276992. Throughput: 0: 5883.7. Samples: 632283600. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 07:01:24,924][25689] Avg episode reward: [(0, '-14.850')] [2022-07-10 07:01:25,149][26022] Updated weights on worker 0-0, policy_version 617460 (0.00084) [2022-07-10 07:01:27,072][26022] Updated weights on worker 0-0, policy_version 617470 (0.00093) [2022-07-10 07:01:28,917][26022] Updated weights on worker 0-0, policy_version 617480 (0.00085) [2022-07-10 07:01:29,925][25689] Fps is (10 sec: 5590.3, 60 sec: 5584.4, 300 sec: 5576.8). Total num frames: 632304640. Throughput: 0: 5054.7. Samples: 632300356. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 07:01:29,926][25689] Avg episode reward: [(0, '-15.296')] [2022-07-10 07:01:30,908][26022] Updated weights on worker 0-0, policy_version 617490 (0.00092) [2022-07-10 07:01:32,611][26022] Updated weights on worker 0-0, policy_version 617500 (0.00088) [2022-07-10 07:01:34,606][26022] Updated weights on worker 0-0, policy_version 617510 (0.00507) [2022-07-10 07:01:34,950][25689] Fps is (10 sec: 5616.6, 60 sec: 5548.8, 300 sec: 5567.1). Total num frames: 632333312. Throughput: 0: 5884.8. Samples: 632333986. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 07:01:34,952][25689] Avg episode reward: [(0, '-15.457')] [2022-07-10 07:01:36,068][26022] Updated weights on worker 0-0, policy_version 617520 (0.00086) [2022-07-10 07:01:38,099][26022] Updated weights on worker 0-0, policy_version 617530 (0.00088) [2022-07-10 07:01:39,651][26022] Updated weights on worker 0-0, policy_version 617540 (0.00084) [2022-07-10 07:01:39,980][25689] Fps is (10 sec: 5703.1, 60 sec: 5599.0, 300 sec: 5568.7). Total num frames: 632361984. Throughput: 0: 5868.0. Samples: 632367866. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 07:01:39,980][25689] Avg episode reward: [(0, '-13.631')] [2022-07-10 07:01:41,846][26022] Updated weights on worker 0-0, policy_version 617550 (0.00089) [2022-07-10 07:01:43,637][26022] Updated weights on worker 0-0, policy_version 617560 (0.00091) [2022-07-10 07:01:45,027][25689] Fps is (10 sec: 5588.9, 60 sec: 5590.8, 300 sec: 5571.8). Total num frames: 632389632. Throughput: 0: 5004.7. Samples: 632384218. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:01:45,028][25689] Avg episode reward: [(0, '-13.130')] [2022-07-10 07:01:45,403][26022] Updated weights on worker 0-0, policy_version 617570 (0.00098) [2022-07-10 07:01:47,235][26022] Updated weights on worker 0-0, policy_version 617580 (0.00081) [2022-07-10 07:01:49,081][26022] Updated weights on worker 0-0, policy_version 617590 (0.00088) [2022-07-10 07:01:50,032][25689] Fps is (10 sec: 5500.6, 60 sec: 5563.3, 300 sec: 5566.1). Total num frames: 632417280. Throughput: 0: 5852.2. Samples: 632418030. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:01:50,033][25689] Avg episode reward: [(0, '-12.498')] [2022-07-10 07:01:50,752][26022] Updated weights on worker 0-0, policy_version 617600 (0.00081) [2022-07-10 07:01:52,624][26022] Updated weights on worker 0-0, policy_version 617610 (0.00104) [2022-07-10 07:01:54,495][26022] Updated weights on worker 0-0, policy_version 617620 (0.00086) [2022-07-10 07:01:55,071][25689] Fps is (10 sec: 5505.3, 60 sec: 5578.8, 300 sec: 5566.7). Total num frames: 632444928. Throughput: 0: 5862.5. Samples: 632451946. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:01:55,072][25689] Avg episode reward: [(0, '-12.623')] [2022-07-10 07:01:56,176][26022] Updated weights on worker 0-0, policy_version 617630 (0.00089) [2022-07-10 07:01:58,239][26022] Updated weights on worker 0-0, policy_version 617640 (0.00082) [2022-07-10 07:01:59,793][26022] Updated weights on worker 0-0, policy_version 617650 (0.00088) [2022-07-10 07:02:00,109][25689] Fps is (10 sec: 5690.8, 60 sec: 5575.5, 300 sec: 5577.5). Total num frames: 632474624. Throughput: 0: 5016.3. Samples: 632468840. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:02:00,110][25689] Avg episode reward: [(0, '-11.285')] [2022-07-10 07:02:02,218][26022] Updated weights on worker 0-0, policy_version 617660 (0.00086) [2022-07-10 07:02:03,875][26022] Updated weights on worker 0-0, policy_version 617670 (0.00088) [2022-07-10 07:02:05,167][25689] Fps is (10 sec: 5477.0, 60 sec: 5578.5, 300 sec: 5574.7). Total num frames: 632500224. Throughput: 0: 5758.4. Samples: 632500194. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:02:05,168][25689] Avg episode reward: [(0, '-10.745')] [2022-07-10 07:02:05,916][26022] Updated weights on worker 0-0, policy_version 617680 (0.00085) [2022-07-10 07:02:07,598][26022] Updated weights on worker 0-0, policy_version 617690 (0.00098) [2022-07-10 07:02:09,497][26022] Updated weights on worker 0-0, policy_version 617700 (0.00082) [2022-07-10 07:02:10,179][25689] Fps is (10 sec: 5288.0, 60 sec: 5566.7, 300 sec: 5567.8). Total num frames: 632527872. Throughput: 0: 5762.5. Samples: 632534122. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:02:10,179][25689] Avg episode reward: [(0, '-12.314')] [2022-07-10 07:02:10,638][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:02:10,662][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000617705_632529920.pth [2022-07-10 07:02:10,663][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000615746_630523904.pth [2022-07-10 07:02:11,223][26022] Updated weights on worker 0-0, policy_version 617710 (0.01315) [2022-07-10 07:02:13,515][26022] Updated weights on worker 0-0, policy_version 617720 (0.00089) [2022-07-10 07:02:14,841][26022] Updated weights on worker 0-0, policy_version 617730 (0.00085) [2022-07-10 07:02:15,199][25689] Fps is (10 sec: 5716.5, 60 sec: 5598.9, 300 sec: 5574.6). Total num frames: 632557568. Throughput: 0: 4905.5. Samples: 632550680. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:02:15,199][25689] Avg episode reward: [(0, '-12.260')] [2022-07-10 07:02:16,918][26022] Updated weights on worker 0-0, policy_version 617740 (0.00090) [2022-07-10 07:02:18,435][26022] Updated weights on worker 0-0, policy_version 617750 (0.00097) [2022-07-10 07:02:20,216][25689] Fps is (10 sec: 5610.8, 60 sec: 5563.9, 300 sec: 5572.2). Total num frames: 632584192. Throughput: 0: 5760.4. Samples: 632584666. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:02:20,217][25689] Avg episode reward: [(0, '-12.318')] [2022-07-10 07:02:20,576][26022] Updated weights on worker 0-0, policy_version 617760 (0.00110) [2022-07-10 07:02:22,231][26022] Updated weights on worker 0-0, policy_version 617770 (0.00086) [2022-07-10 07:02:24,099][26022] Updated weights on worker 0-0, policy_version 617780 (0.00092) [2022-07-10 07:02:25,317][25689] Fps is (10 sec: 5465.1, 60 sec: 5561.3, 300 sec: 5567.0). Total num frames: 632612864. Throughput: 0: 5868.5. Samples: 632618440. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:02:25,317][25689] Avg episode reward: [(0, '-12.335')] [2022-07-10 07:02:25,875][26022] Updated weights on worker 0-0, policy_version 617790 (0.00087) [2022-07-10 07:02:27,825][26022] Updated weights on worker 0-0, policy_version 617800 (0.00092) [2022-07-10 07:02:29,515][26022] Updated weights on worker 0-0, policy_version 617810 (0.00098) [2022-07-10 07:02:30,323][25689] Fps is (10 sec: 5674.1, 60 sec: 5577.9, 300 sec: 5575.5). Total num frames: 632641536. Throughput: 0: 5017.1. Samples: 632635188. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:02:30,324][25689] Avg episode reward: [(0, '-12.605')] [2022-07-10 07:02:31,527][26022] Updated weights on worker 0-0, policy_version 617820 (0.00092) [2022-07-10 07:02:33,317][26022] Updated weights on worker 0-0, policy_version 617830 (0.00089) [2022-07-10 07:02:34,982][26022] Updated weights on worker 0-0, policy_version 617840 (0.00094) [2022-07-10 07:02:35,327][25689] Fps is (10 sec: 5524.1, 60 sec: 5546.0, 300 sec: 5572.5). Total num frames: 632668160. Throughput: 0: 5867.4. Samples: 632668778. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:02:35,327][25689] Avg episode reward: [(0, '-12.607')] [2022-07-10 07:02:37,015][26022] Updated weights on worker 0-0, policy_version 617851 (0.00089) [2022-07-10 07:02:38,998][26022] Updated weights on worker 0-0, policy_version 617861 (0.00094) [2022-07-10 07:02:40,355][25689] Fps is (10 sec: 5614.0, 60 sec: 5563.1, 300 sec: 5577.1). Total num frames: 632697856. Throughput: 0: 5856.0. Samples: 632702596. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:02:40,355][25689] Avg episode reward: [(0, '-10.548')] [2022-07-10 07:02:40,812][26022] Updated weights on worker 0-0, policy_version 617871 (0.00093) [2022-07-10 07:02:42,573][26022] Updated weights on worker 0-0, policy_version 617881 (0.00082) [2022-07-10 07:02:44,515][26022] Updated weights on worker 0-0, policy_version 617891 (0.00085) [2022-07-10 07:02:45,419][25689] Fps is (10 sec: 5783.1, 60 sec: 5578.5, 300 sec: 5573.7). Total num frames: 632726528. Throughput: 0: 5024.1. Samples: 632719438. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:02:45,420][25689] Avg episode reward: [(0, '-10.826')] [2022-07-10 07:02:46,146][26022] Updated weights on worker 0-0, policy_version 617901 (0.00087) [2022-07-10 07:02:48,094][26022] Updated weights on worker 0-0, policy_version 617911 (0.00087) [2022-07-10 07:02:49,708][26022] Updated weights on worker 0-0, policy_version 617921 (0.00090) [2022-07-10 07:02:50,440][25689] Fps is (10 sec: 5686.1, 60 sec: 5594.0, 300 sec: 5580.3). Total num frames: 632755200. Throughput: 0: 5878.1. Samples: 632753438. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:02:50,440][25689] Avg episode reward: [(0, '-10.476')] [2022-07-10 07:02:51,577][26022] Updated weights on worker 0-0, policy_version 617931 (0.00086) [2022-07-10 07:02:53,359][26022] Updated weights on worker 0-0, policy_version 617941 (0.00093) [2022-07-10 07:02:55,394][26022] Updated weights on worker 0-0, policy_version 617951 (0.00088) [2022-07-10 07:02:55,468][25689] Fps is (10 sec: 5503.1, 60 sec: 5578.1, 300 sec: 5573.2). Total num frames: 632781824. Throughput: 0: 5893.9. Samples: 632787488. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:02:55,469][25689] Avg episode reward: [(0, '-11.098')] [2022-07-10 07:02:56,927][26022] Updated weights on worker 0-0, policy_version 617961 (0.00087) [2022-07-10 07:02:58,922][26022] Updated weights on worker 0-0, policy_version 617971 (0.00087) [2022-07-10 07:03:00,526][25689] Fps is (10 sec: 5583.9, 60 sec: 5576.2, 300 sec: 5584.6). Total num frames: 632811520. Throughput: 0: 5041.1. Samples: 632804280. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:03:00,526][25689] Avg episode reward: [(0, '-10.707')] [2022-07-10 07:03:00,671][26022] Updated weights on worker 0-0, policy_version 617981 (0.00084) [2022-07-10 07:03:02,861][26022] Updated weights on worker 0-0, policy_version 617991 (0.00090) [2022-07-10 07:03:04,683][26022] Updated weights on worker 0-0, policy_version 618001 (0.00087) [2022-07-10 07:03:05,599][25689] Fps is (10 sec: 5660.2, 60 sec: 5608.7, 300 sec: 5581.6). Total num frames: 632839168. Throughput: 0: 5777.3. Samples: 632836020. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:03:05,599][25689] Avg episode reward: [(0, '-13.731')] [2022-07-10 07:03:06,475][26022] Updated weights on worker 0-0, policy_version 618011 (0.00086) [2022-07-10 07:03:08,312][26022] Updated weights on worker 0-0, policy_version 618021 (0.00087) [2022-07-10 07:03:10,260][26022] Updated weights on worker 0-0, policy_version 618031 (0.00096) [2022-07-10 07:03:10,604][25689] Fps is (10 sec: 5385.3, 60 sec: 5592.4, 300 sec: 5578.1). Total num frames: 632865792. Throughput: 0: 5781.2. Samples: 632870010. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:03:10,605][25689] Avg episode reward: [(0, '-13.704')] [2022-07-10 07:03:11,806][26022] Updated weights on worker 0-0, policy_version 618041 (0.00087) [2022-07-10 07:03:13,776][26022] Updated weights on worker 0-0, policy_version 618051 (0.00989) [2022-07-10 07:03:15,588][26022] Updated weights on worker 0-0, policy_version 618061 (0.00089) [2022-07-10 07:03:15,639][25689] Fps is (10 sec: 5507.6, 60 sec: 5574.1, 300 sec: 5578.6). Total num frames: 632894464. Throughput: 0: 5764.8. Samples: 632903770. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:03:15,640][25689] Avg episode reward: [(0, '-14.649')] [2022-07-10 07:03:17,555][26022] Updated weights on worker 0-0, policy_version 618071 (0.00094) [2022-07-10 07:03:19,272][26022] Updated weights on worker 0-0, policy_version 618081 (0.00092) [2022-07-10 07:03:20,659][25689] Fps is (10 sec: 5601.1, 60 sec: 5590.8, 300 sec: 5576.0). Total num frames: 632922112. Throughput: 0: 5779.3. Samples: 632920634. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:03:20,660][25689] Avg episode reward: [(0, '-14.690')] [2022-07-10 07:03:21,055][26022] Updated weights on worker 0-0, policy_version 618091 (0.00089) [2022-07-10 07:03:22,717][26022] Updated weights on worker 0-0, policy_version 618101 (0.00090) [2022-07-10 07:03:24,891][26022] Updated weights on worker 0-0, policy_version 618111 (0.00088) [2022-07-10 07:03:25,711][25689] Fps is (10 sec: 5693.1, 60 sec: 5612.2, 300 sec: 5585.6). Total num frames: 632951808. Throughput: 0: 5895.9. Samples: 632954602. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:03:25,712][25689] Avg episode reward: [(0, '-12.682')] [2022-07-10 07:03:26,381][26022] Updated weights on worker 0-0, policy_version 618121 (0.00088) [2022-07-10 07:03:28,521][26022] Updated weights on worker 0-0, policy_version 618131 (0.00091) [2022-07-10 07:03:30,230][26022] Updated weights on worker 0-0, policy_version 618141 (0.00102) [2022-07-10 07:03:30,715][25689] Fps is (10 sec: 5600.5, 60 sec: 5578.5, 300 sec: 5575.5). Total num frames: 632978432. Throughput: 0: 5864.7. Samples: 632987956. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:03:30,715][25689] Avg episode reward: [(0, '-11.158')] [2022-07-10 07:03:32,024][26022] Updated weights on worker 0-0, policy_version 618151 (0.00084) [2022-07-10 07:03:33,818][26022] Updated weights on worker 0-0, policy_version 618161 (0.00087) [2022-07-10 07:03:35,642][26022] Updated weights on worker 0-0, policy_version 618171 (0.00092) [2022-07-10 07:03:35,727][25689] Fps is (10 sec: 5520.9, 60 sec: 5611.6, 300 sec: 5582.3). Total num frames: 633007104. Throughput: 0: 5029.1. Samples: 633004796. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:03:35,728][25689] Avg episode reward: [(0, '-10.406')] [2022-07-10 07:03:37,599][26022] Updated weights on worker 0-0, policy_version 618181 (0.00093) [2022-07-10 07:03:39,320][26022] Updated weights on worker 0-0, policy_version 618191 (0.00087) [2022-07-10 07:03:40,795][25689] Fps is (10 sec: 5587.4, 60 sec: 5574.1, 300 sec: 5579.4). Total num frames: 633034752. Throughput: 0: 5868.4. Samples: 633038800. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:03:40,795][25689] Avg episode reward: [(0, '-10.666')] [2022-07-10 07:03:41,209][26022] Updated weights on worker 0-0, policy_version 618201 (0.00082) [2022-07-10 07:03:42,980][26022] Updated weights on worker 0-0, policy_version 618211 (0.00080) [2022-07-10 07:03:45,029][26022] Updated weights on worker 0-0, policy_version 618221 (0.00085) [2022-07-10 07:03:45,893][25689] Fps is (10 sec: 5439.6, 60 sec: 5554.1, 300 sec: 5571.3). Total num frames: 633062400. Throughput: 0: 5806.7. Samples: 633071788. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:03:45,893][25689] Avg episode reward: [(0, '-10.109')] [2022-07-10 07:03:46,596][26022] Updated weights on worker 0-0, policy_version 618231 (0.00093) [2022-07-10 07:03:48,722][26022] Updated weights on worker 0-0, policy_version 618241 (0.00092) [2022-07-10 07:03:50,202][26022] Updated weights on worker 0-0, policy_version 618251 (0.00096) [2022-07-10 07:03:50,911][25689] Fps is (10 sec: 5567.3, 60 sec: 5554.2, 300 sec: 5574.7). Total num frames: 633091072. Throughput: 0: 4986.3. Samples: 633088660. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:03:50,911][25689] Avg episode reward: [(0, '-9.888')] [2022-07-10 07:03:52,518][26022] Updated weights on worker 0-0, policy_version 618261 (0.00094) [2022-07-10 07:03:54,118][26022] Updated weights on worker 0-0, policy_version 618271 (0.00089) [2022-07-10 07:03:55,959][25689] Fps is (10 sec: 5595.0, 60 sec: 5569.4, 300 sec: 5574.3). Total num frames: 633118720. Throughput: 0: 5766.2. Samples: 633121456. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:03:55,959][25689] Avg episode reward: [(0, '-11.993')] [2022-07-10 07:03:56,148][26022] Updated weights on worker 0-0, policy_version 618281 (0.00088) [2022-07-10 07:03:57,930][26022] Updated weights on worker 0-0, policy_version 618291 (0.00093) [2022-07-10 07:03:59,768][26022] Updated weights on worker 0-0, policy_version 618301 (0.00097) [2022-07-10 07:04:00,970][25689] Fps is (10 sec: 5598.7, 60 sec: 5556.7, 300 sec: 5578.5). Total num frames: 633147392. Throughput: 0: 5774.2. Samples: 633155298. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:04:00,971][25689] Avg episode reward: [(0, '-11.524')] [2022-07-10 07:04:01,626][26022] Updated weights on worker 0-0, policy_version 618311 (0.00094) [2022-07-10 07:04:03,687][26022] Updated weights on worker 0-0, policy_version 618321 (0.00086) [2022-07-10 07:04:05,436][26022] Updated weights on worker 0-0, policy_version 618331 (0.00089) [2022-07-10 07:04:06,059][25689] Fps is (10 sec: 5474.9, 60 sec: 5538.3, 300 sec: 5574.0). Total num frames: 633174016. Throughput: 0: 4869.8. Samples: 633169994. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:04:06,061][25689] Avg episode reward: [(0, '-13.194')] [2022-07-10 07:04:07,408][26022] Updated weights on worker 0-0, policy_version 618341 (0.00089) [2022-07-10 07:04:09,128][26022] Updated weights on worker 0-0, policy_version 618351 (0.00089) [2022-07-10 07:04:10,742][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:04:10,760][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000618360_633200640.pth [2022-07-10 07:04:10,761][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000616399_631192576.pth [2022-07-10 07:04:10,991][26022] Updated weights on worker 0-0, policy_version 618361 (0.00082) [2022-07-10 07:04:11,103][25689] Fps is (10 sec: 5356.3, 60 sec: 5551.7, 300 sec: 5570.4). Total num frames: 633201664. Throughput: 0: 5691.5. Samples: 633203582. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:04:11,103][25689] Avg episode reward: [(0, '-12.797')] [2022-07-10 07:04:12,670][26022] Updated weights on worker 0-0, policy_version 618371 (0.00095) [2022-07-10 07:04:14,809][26022] Updated weights on worker 0-0, policy_version 618381 (0.00087) [2022-07-10 07:04:16,106][25689] Fps is (10 sec: 5503.5, 60 sec: 5537.6, 300 sec: 5563.5). Total num frames: 633229312. Throughput: 0: 5765.1. Samples: 633237608. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:04:16,108][25689] Avg episode reward: [(0, '-12.299')] [2022-07-10 07:04:16,366][26022] Updated weights on worker 0-0, policy_version 618391 (0.00086) [2022-07-10 07:04:18,527][26022] Updated weights on worker 0-0, policy_version 618401 (0.00081) [2022-07-10 07:04:19,870][26022] Updated weights on worker 0-0, policy_version 618411 (0.00084) [2022-07-10 07:04:21,191][25689] Fps is (10 sec: 5684.1, 60 sec: 5565.5, 300 sec: 5573.1). Total num frames: 633259008. Throughput: 0: 4914.1. Samples: 633254666. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:04:21,193][25689] Avg episode reward: [(0, '-11.588')] [2022-07-10 07:04:22,110][26022] Updated weights on worker 0-0, policy_version 618421 (0.00084) [2022-07-10 07:04:23,607][26022] Updated weights on worker 0-0, policy_version 618431 (0.00088) [2022-07-10 07:04:25,676][26022] Updated weights on worker 0-0, policy_version 618441 (0.00090) [2022-07-10 07:04:26,263][25689] Fps is (10 sec: 5746.5, 60 sec: 5546.8, 300 sec: 5572.0). Total num frames: 633287680. Throughput: 0: 5869.2. Samples: 633288580. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:04:26,264][25689] Avg episode reward: [(0, '-11.154')] [2022-07-10 07:04:27,201][26022] Updated weights on worker 0-0, policy_version 618451 (0.00094) [2022-07-10 07:04:29,279][26022] Updated weights on worker 0-0, policy_version 618461 (0.00085) [2022-07-10 07:04:30,911][26022] Updated weights on worker 0-0, policy_version 618471 (0.00085) [2022-07-10 07:04:31,277][25689] Fps is (10 sec: 5583.9, 60 sec: 5562.8, 300 sec: 5569.0). Total num frames: 633315328. Throughput: 0: 5882.2. Samples: 633322254. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:04:31,280][25689] Avg episode reward: [(0, '-11.135')] [2022-07-10 07:04:32,911][26022] Updated weights on worker 0-0, policy_version 618481 (0.00089) [2022-07-10 07:04:34,759][26022] Updated weights on worker 0-0, policy_version 618491 (0.00087) [2022-07-10 07:04:36,301][25689] Fps is (10 sec: 5610.7, 60 sec: 5561.7, 300 sec: 5573.5). Total num frames: 633344000. Throughput: 0: 5023.9. Samples: 633339066. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:04:36,302][25689] Avg episode reward: [(0, '-10.254')] [2022-07-10 07:04:36,347][26022] Updated weights on worker 0-0, policy_version 618501 (0.00092) [2022-07-10 07:04:38,451][26022] Updated weights on worker 0-0, policy_version 618511 (0.00085) [2022-07-10 07:04:39,968][26022] Updated weights on worker 0-0, policy_version 618521 (0.00092) [2022-07-10 07:04:41,328][25689] Fps is (10 sec: 5603.7, 60 sec: 5565.5, 300 sec: 5572.0). Total num frames: 633371648. Throughput: 0: 5858.6. Samples: 633372638. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:04:41,328][25689] Avg episode reward: [(0, '-9.938')] [2022-07-10 07:04:42,072][26022] Updated weights on worker 0-0, policy_version 618531 (0.00089) [2022-07-10 07:04:43,785][26022] Updated weights on worker 0-0, policy_version 618541 (0.00094) [2022-07-10 07:04:45,624][26022] Updated weights on worker 0-0, policy_version 618551 (0.00088) [2022-07-10 07:04:46,379][25689] Fps is (10 sec: 5487.1, 60 sec: 5569.8, 300 sec: 5569.3). Total num frames: 633399296. Throughput: 0: 5828.4. Samples: 633405820. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 07:04:46,379][25689] Avg episode reward: [(0, '-9.669')] [2022-07-10 07:04:47,438][26022] Updated weights on worker 0-0, policy_version 618561 (0.00089) [2022-07-10 07:04:49,431][26022] Updated weights on worker 0-0, policy_version 618571 (0.00089) [2022-07-10 07:04:51,139][26022] Updated weights on worker 0-0, policy_version 618581 (0.00093) [2022-07-10 07:04:51,435][25689] Fps is (10 sec: 5471.1, 60 sec: 5549.4, 300 sec: 5562.2). Total num frames: 633426944. Throughput: 0: 4982.8. Samples: 633422694. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:04:51,435][25689] Avg episode reward: [(0, '-10.277')] [2022-07-10 07:04:53,124][26022] Updated weights on worker 0-0, policy_version 618591 (0.00091) [2022-07-10 07:04:54,851][26022] Updated weights on worker 0-0, policy_version 618601 (0.00092) [2022-07-10 07:04:56,515][25689] Fps is (10 sec: 5556.5, 60 sec: 5563.4, 300 sec: 5566.0). Total num frames: 633455616. Throughput: 0: 5794.8. Samples: 633456198. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:04:56,515][25689] Avg episode reward: [(0, '-10.008')] [2022-07-10 07:04:56,869][26022] Updated weights on worker 0-0, policy_version 618611 (0.00094) [2022-07-10 07:04:58,577][26022] Updated weights on worker 0-0, policy_version 618621 (0.00085) [2022-07-10 07:05:00,431][26022] Updated weights on worker 0-0, policy_version 618631 (0.00084) [2022-07-10 07:05:01,607][25689] Fps is (10 sec: 5637.5, 60 sec: 5556.0, 300 sec: 5573.6). Total num frames: 633484288. Throughput: 0: 5760.6. Samples: 633489456. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:05:01,615][25689] Avg episode reward: [(0, '-9.434')] [2022-07-10 07:05:02,770][26022] Updated weights on worker 0-0, policy_version 618641 (0.00099) [2022-07-10 07:05:04,471][26022] Updated weights on worker 0-0, policy_version 618651 (0.00095) [2022-07-10 07:05:06,588][26022] Updated weights on worker 0-0, policy_version 618661 (0.00087) [2022-07-10 07:05:06,710][25689] Fps is (10 sec: 5222.9, 60 sec: 5520.9, 300 sec: 5562.2). Total num frames: 633508864. Throughput: 0: 4844.3. Samples: 633504310. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:05:06,712][25689] Avg episode reward: [(0, '-9.841')] [2022-07-10 07:05:07,980][26022] Updated weights on worker 0-0, policy_version 618671 (0.00087) [2022-07-10 07:05:10,004][26022] Updated weights on worker 0-0, policy_version 618681 (0.00090) [2022-07-10 07:05:11,735][25689] Fps is (10 sec: 5358.6, 60 sec: 5556.4, 300 sec: 5573.5). Total num frames: 633538560. Throughput: 0: 5689.5. Samples: 633538190. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:05:11,736][25689] Avg episode reward: [(0, '-9.775')] [2022-07-10 07:05:11,784][26022] Updated weights on worker 0-0, policy_version 618691 (0.00083) [2022-07-10 07:05:13,552][26022] Updated weights on worker 0-0, policy_version 618701 (0.00084) [2022-07-10 07:05:15,463][26022] Updated weights on worker 0-0, policy_version 618711 (0.00084) [2022-07-10 07:05:16,807][25689] Fps is (10 sec: 5780.4, 60 sec: 5567.0, 300 sec: 5565.4). Total num frames: 633567232. Throughput: 0: 5687.7. Samples: 633571616. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:05:16,808][25689] Avg episode reward: [(0, '-10.240')] [2022-07-10 07:05:17,322][26022] Updated weights on worker 0-0, policy_version 618721 (0.00083) [2022-07-10 07:05:18,951][26022] Updated weights on worker 0-0, policy_version 618731 (0.00096) [2022-07-10 07:05:20,980][26022] Updated weights on worker 0-0, policy_version 618741 (0.00086) [2022-07-10 07:05:21,825][25689] Fps is (10 sec: 5581.9, 60 sec: 5539.4, 300 sec: 5570.6). Total num frames: 633594880. Throughput: 0: 4904.8. Samples: 633588620. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:05:21,825][25689] Avg episode reward: [(0, '-11.238')] [2022-07-10 07:05:22,620][26022] Updated weights on worker 0-0, policy_version 618751 (0.00084) [2022-07-10 07:05:24,617][26022] Updated weights on worker 0-0, policy_version 618761 (0.00086) [2022-07-10 07:05:26,314][26022] Updated weights on worker 0-0, policy_version 618771 (0.00085) [2022-07-10 07:05:26,864][25689] Fps is (10 sec: 5701.9, 60 sec: 5559.3, 300 sec: 5573.6). Total num frames: 633624576. Throughput: 0: 5867.8. Samples: 633622572. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:05:26,866][25689] Avg episode reward: [(0, '-10.611')] [2022-07-10 07:05:28,094][26022] Updated weights on worker 0-0, policy_version 618781 (0.00087) [2022-07-10 07:05:29,936][26022] Updated weights on worker 0-0, policy_version 618791 (0.00093) [2022-07-10 07:05:31,784][26022] Updated weights on worker 0-0, policy_version 618801 (0.00092) [2022-07-10 07:05:31,959][25689] Fps is (10 sec: 5658.2, 60 sec: 5551.8, 300 sec: 5561.5). Total num frames: 633652224. Throughput: 0: 5827.2. Samples: 633656042. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:05:31,960][25689] Avg episode reward: [(0, '-11.096')] [2022-07-10 07:05:33,603][26022] Updated weights on worker 0-0, policy_version 618811 (0.00087) [2022-07-10 07:05:35,631][26022] Updated weights on worker 0-0, policy_version 618821 (0.00087) [2022-07-10 07:05:37,031][25689] Fps is (10 sec: 5640.4, 60 sec: 5564.3, 300 sec: 5574.4). Total num frames: 633681920. Throughput: 0: 5842.0. Samples: 633689762. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:05:37,031][25689] Avg episode reward: [(0, '-10.037')] [2022-07-10 07:05:37,145][26022] Updated weights on worker 0-0, policy_version 618831 (0.00090) [2022-07-10 07:05:39,341][26022] Updated weights on worker 0-0, policy_version 618841 (0.00092) [2022-07-10 07:05:40,860][26022] Updated weights on worker 0-0, policy_version 618851 (0.00092) [2022-07-10 07:05:42,055][25689] Fps is (10 sec: 5477.4, 60 sec: 5530.9, 300 sec: 5566.3). Total num frames: 633707520. Throughput: 0: 5842.2. Samples: 633706808. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:05:42,056][25689] Avg episode reward: [(0, '-10.852')] [2022-07-10 07:05:42,855][26022] Updated weights on worker 0-0, policy_version 618861 (0.00088) [2022-07-10 07:05:44,648][26022] Updated weights on worker 0-0, policy_version 618871 (0.00098) [2022-07-10 07:05:46,315][26022] Updated weights on worker 0-0, policy_version 618881 (0.00076) [2022-07-10 07:05:47,092][25689] Fps is (10 sec: 5597.7, 60 sec: 5582.7, 300 sec: 5570.5). Total num frames: 633738240. Throughput: 0: 5828.4. Samples: 633740468. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:05:47,092][25689] Avg episode reward: [(0, '-10.027')] [2022-07-10 07:05:48,282][26022] Updated weights on worker 0-0, policy_version 618891 (0.00334) [2022-07-10 07:05:49,946][26022] Updated weights on worker 0-0, policy_version 618901 (0.00090) [2022-07-10 07:05:51,856][26022] Updated weights on worker 0-0, policy_version 618911 (0.00094) [2022-07-10 07:05:52,115][25689] Fps is (10 sec: 5903.3, 60 sec: 5602.6, 300 sec: 5577.3). Total num frames: 633766912. Throughput: 0: 5870.4. Samples: 633774366. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:05:52,116][25689] Avg episode reward: [(0, '-9.934')] [2022-07-10 07:05:53,726][26022] Updated weights on worker 0-0, policy_version 618921 (0.00087) [2022-07-10 07:05:55,439][26022] Updated weights on worker 0-0, policy_version 618931 (0.00087) [2022-07-10 07:05:57,120][25689] Fps is (10 sec: 5616.5, 60 sec: 5592.7, 300 sec: 5570.4). Total num frames: 633794560. Throughput: 0: 5060.3. Samples: 633791418. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:05:57,120][25689] Avg episode reward: [(0, '-8.862')] [2022-07-10 07:05:57,219][26022] Updated weights on worker 0-0, policy_version 618941 (0.00095) [2022-07-10 07:05:59,111][26022] Updated weights on worker 0-0, policy_version 618951 (0.00086) [2022-07-10 07:06:00,765][26022] Updated weights on worker 0-0, policy_version 618961 (0.00087) [2022-07-10 07:06:02,134][25689] Fps is (10 sec: 5212.7, 60 sec: 5532.2, 300 sec: 5568.4). Total num frames: 633819136. Throughput: 0: 5894.9. Samples: 633825174. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:06:02,134][25689] Avg episode reward: [(0, '-9.907')] [2022-07-10 07:06:03,153][26022] Updated weights on worker 0-0, policy_version 618971 (0.00086) [2022-07-10 07:06:04,931][26022] Updated weights on worker 0-0, policy_version 618981 (0.00096) [2022-07-10 07:06:06,671][26022] Updated weights on worker 0-0, policy_version 618991 (0.00087) [2022-07-10 07:06:07,242][25689] Fps is (10 sec: 5361.5, 60 sec: 5616.3, 300 sec: 5571.1). Total num frames: 633848832. Throughput: 0: 5767.5. Samples: 633856684. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:06:07,243][25689] Avg episode reward: [(0, '-9.202')] [2022-07-10 07:06:08,825][26022] Updated weights on worker 0-0, policy_version 619001 (0.00093) [2022-07-10 07:06:10,367][26022] Updated weights on worker 0-0, policy_version 619011 (0.00088) [2022-07-10 07:06:10,763][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:06:10,788][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000619013_633869312.pth [2022-07-10 07:06:10,789][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000617053_631862272.pth [2022-07-10 07:06:12,247][25689] Fps is (10 sec: 5670.3, 60 sec: 5584.3, 300 sec: 5571.0). Total num frames: 633876480. Throughput: 0: 4935.7. Samples: 633873730. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:06:12,247][25689] Avg episode reward: [(0, '-8.021')] [2022-07-10 07:06:12,268][26022] Updated weights on worker 0-0, policy_version 619021 (0.00091) [2022-07-10 07:06:14,048][26022] Updated weights on worker 0-0, policy_version 619031 (0.00087) [2022-07-10 07:06:15,867][26022] Updated weights on worker 0-0, policy_version 619041 (0.00108) [2022-07-10 07:06:17,267][25689] Fps is (10 sec: 5618.1, 60 sec: 5589.2, 300 sec: 5570.7). Total num frames: 633905152. Throughput: 0: 5759.2. Samples: 633907450. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:06:17,268][25689] Avg episode reward: [(0, '-8.645')] [2022-07-10 07:06:17,851][26022] Updated weights on worker 0-0, policy_version 619051 (0.00095) [2022-07-10 07:06:19,418][26022] Updated weights on worker 0-0, policy_version 619061 (0.00088) [2022-07-10 07:06:21,340][26022] Updated weights on worker 0-0, policy_version 619071 (0.00086) [2022-07-10 07:06:22,310][25689] Fps is (10 sec: 5698.4, 60 sec: 5603.7, 300 sec: 5571.3). Total num frames: 633933824. Throughput: 0: 5764.0. Samples: 633941470. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:06:22,311][25689] Avg episode reward: [(0, '-8.908')] [2022-07-10 07:06:23,319][26022] Updated weights on worker 0-0, policy_version 619081 (0.00084) [2022-07-10 07:06:24,810][26022] Updated weights on worker 0-0, policy_version 619091 (0.00096) [2022-07-10 07:06:27,168][26022] Updated weights on worker 0-0, policy_version 619101 (0.00092) [2022-07-10 07:06:27,347][25689] Fps is (10 sec: 5587.5, 60 sec: 5570.1, 300 sec: 5570.6). Total num frames: 633961472. Throughput: 0: 5049.8. Samples: 633958208. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:06:27,347][25689] Avg episode reward: [(0, '-10.053')] [2022-07-10 07:06:28,587][26022] Updated weights on worker 0-0, policy_version 619111 (0.00088) [2022-07-10 07:06:30,506][26022] Updated weights on worker 0-0, policy_version 619121 (0.00093) [2022-07-10 07:06:32,355][25689] Fps is (10 sec: 5505.0, 60 sec: 5578.2, 300 sec: 5567.5). Total num frames: 633989120. Throughput: 0: 5869.8. Samples: 633991758. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:06:32,355][25689] Avg episode reward: [(0, '-9.882')] [2022-07-10 07:06:32,467][26022] Updated weights on worker 0-0, policy_version 619131 (0.00087) [2022-07-10 07:06:34,241][26022] Updated weights on worker 0-0, policy_version 619141 (0.00085) [2022-07-10 07:06:36,096][26022] Updated weights on worker 0-0, policy_version 619151 (0.00097) [2022-07-10 07:06:37,364][25689] Fps is (10 sec: 5724.3, 60 sec: 5583.9, 300 sec: 5571.3). Total num frames: 634018816. Throughput: 0: 5875.7. Samples: 634025536. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:06:37,365][25689] Avg episode reward: [(0, '-10.671')] [2022-07-10 07:06:37,840][26022] Updated weights on worker 0-0, policy_version 619161 (0.00090) [2022-07-10 07:06:39,631][26022] Updated weights on worker 0-0, policy_version 619171 (0.00086) [2022-07-10 07:06:41,381][26022] Updated weights on worker 0-0, policy_version 619181 (0.00089) [2022-07-10 07:06:42,379][25689] Fps is (10 sec: 5618.4, 60 sec: 5601.7, 300 sec: 5568.5). Total num frames: 634045440. Throughput: 0: 5034.9. Samples: 634042516. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:06:42,379][25689] Avg episode reward: [(0, '-12.616')] [2022-07-10 07:06:43,345][26022] Updated weights on worker 0-0, policy_version 619191 (0.00092) [2022-07-10 07:06:45,169][26022] Updated weights on worker 0-0, policy_version 619201 (0.00086) [2022-07-10 07:06:46,980][26022] Updated weights on worker 0-0, policy_version 619211 (0.00087) [2022-07-10 07:06:47,441][25689] Fps is (10 sec: 5589.0, 60 sec: 5582.4, 300 sec: 5574.3). Total num frames: 634075136. Throughput: 0: 5850.2. Samples: 634075766. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:06:47,443][25689] Avg episode reward: [(0, '-11.792')] [2022-07-10 07:06:48,821][26022] Updated weights on worker 0-0, policy_version 619221 (0.00091) [2022-07-10 07:06:50,560][26022] Updated weights on worker 0-0, policy_version 619231 (0.00093) [2022-07-10 07:06:52,493][25689] Fps is (10 sec: 5568.8, 60 sec: 5545.9, 300 sec: 5570.7). Total num frames: 634101760. Throughput: 0: 5850.5. Samples: 634109574. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:06:52,494][25689] Avg episode reward: [(0, '-12.719')] [2022-07-10 07:06:52,633][26022] Updated weights on worker 0-0, policy_version 619241 (0.00091) [2022-07-10 07:06:54,279][26022] Updated weights on worker 0-0, policy_version 619251 (0.00092) [2022-07-10 07:06:56,194][26022] Updated weights on worker 0-0, policy_version 619261 (0.00087) [2022-07-10 07:06:57,526][25689] Fps is (10 sec: 5584.9, 60 sec: 5577.2, 300 sec: 5570.7). Total num frames: 634131456. Throughput: 0: 5015.0. Samples: 634126646. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:06:57,526][25689] Avg episode reward: [(0, '-12.335')] [2022-07-10 07:06:57,738][26022] Updated weights on worker 0-0, policy_version 619271 (0.00098) [2022-07-10 07:06:59,933][26022] Updated weights on worker 0-0, policy_version 619281 (0.00093) [2022-07-10 07:07:01,345][26022] Updated weights on worker 0-0, policy_version 619291 (0.00091) [2022-07-10 07:07:02,543][25689] Fps is (10 sec: 5502.0, 60 sec: 5593.9, 300 sec: 5571.5). Total num frames: 634157056. Throughput: 0: 5840.6. Samples: 634160284. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:07:02,543][25689] Avg episode reward: [(0, '-11.932')] [2022-07-10 07:07:03,783][26022] Updated weights on worker 0-0, policy_version 619301 (0.00088) [2022-07-10 07:07:05,527][26022] Updated weights on worker 0-0, policy_version 619311 (0.00088) [2022-07-10 07:07:07,491][26022] Updated weights on worker 0-0, policy_version 619321 (0.00095) [2022-07-10 07:07:07,637][25689] Fps is (10 sec: 5266.0, 60 sec: 5561.2, 300 sec: 5570.0). Total num frames: 634184704. Throughput: 0: 5740.7. Samples: 634191706. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:07:07,638][25689] Avg episode reward: [(0, '-13.695')] [2022-07-10 07:07:09,291][26022] Updated weights on worker 0-0, policy_version 619331 (0.00090) [2022-07-10 07:07:11,275][26022] Updated weights on worker 0-0, policy_version 619341 (0.00088) [2022-07-10 07:07:12,658][25689] Fps is (10 sec: 5668.9, 60 sec: 5593.6, 300 sec: 5569.9). Total num frames: 634214400. Throughput: 0: 4913.2. Samples: 634208652. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:07:12,659][25689] Avg episode reward: [(0, '-12.439')] [2022-07-10 07:07:13,050][26022] Updated weights on worker 0-0, policy_version 619351 (0.00087) [2022-07-10 07:07:14,834][26022] Updated weights on worker 0-0, policy_version 619361 (0.00080) [2022-07-10 07:07:16,365][26022] Updated weights on worker 0-0, policy_version 619371 (0.00086) [2022-07-10 07:07:17,695][25689] Fps is (10 sec: 5599.5, 60 sec: 5558.2, 300 sec: 5569.6). Total num frames: 634241024. Throughput: 0: 5739.9. Samples: 634242418. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:07:17,696][25689] Avg episode reward: [(0, '-12.943')] [2022-07-10 07:07:18,488][26022] Updated weights on worker 0-0, policy_version 619381 (0.00091) [2022-07-10 07:07:20,072][26022] Updated weights on worker 0-0, policy_version 619391 (0.00088) [2022-07-10 07:07:21,966][26022] Updated weights on worker 0-0, policy_version 619401 (0.00087) [2022-07-10 07:07:22,714][25689] Fps is (10 sec: 5498.9, 60 sec: 5560.4, 300 sec: 5571.1). Total num frames: 634269696. Throughput: 0: 5727.4. Samples: 634275814. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:07:22,715][25689] Avg episode reward: [(0, '-12.052')] [2022-07-10 07:07:24,019][26022] Updated weights on worker 0-0, policy_version 619411 (0.00082) [2022-07-10 07:07:25,571][26022] Updated weights on worker 0-0, policy_version 619421 (0.00088) [2022-07-10 07:07:27,657][26022] Updated weights on worker 0-0, policy_version 619431 (0.00083) [2022-07-10 07:07:27,836][25689] Fps is (10 sec: 5654.7, 60 sec: 5569.5, 300 sec: 5568.9). Total num frames: 634298368. Throughput: 0: 4996.9. Samples: 634292638. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:07:27,837][25689] Avg episode reward: [(0, '-12.042')] [2022-07-10 07:07:29,183][26022] Updated weights on worker 0-0, policy_version 619441 (0.00093) [2022-07-10 07:07:31,252][26022] Updated weights on worker 0-0, policy_version 619451 (0.00074) [2022-07-10 07:07:32,878][25689] Fps is (10 sec: 5541.3, 60 sec: 5566.4, 300 sec: 5571.7). Total num frames: 634326016. Throughput: 0: 5820.4. Samples: 634326338. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:07:32,878][25689] Avg episode reward: [(0, '-13.201')] [2022-07-10 07:07:33,037][26022] Updated weights on worker 0-0, policy_version 619461 (0.00090) [2022-07-10 07:07:34,832][26022] Updated weights on worker 0-0, policy_version 619471 (0.00083) [2022-07-10 07:07:36,715][26022] Updated weights on worker 0-0, policy_version 619481 (0.00084) [2022-07-10 07:07:37,899][25689] Fps is (10 sec: 5597.0, 60 sec: 5548.4, 300 sec: 5568.4). Total num frames: 634354688. Throughput: 0: 5813.6. Samples: 634359874. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:07:37,903][25689] Avg episode reward: [(0, '-13.750')] [2022-07-10 07:07:38,392][26022] Updated weights on worker 0-0, policy_version 619491 (0.00097) [2022-07-10 07:07:40,530][26022] Updated weights on worker 0-0, policy_version 619501 (0.00094) [2022-07-10 07:07:42,187][26022] Updated weights on worker 0-0, policy_version 619511 (0.00079) [2022-07-10 07:07:42,903][25689] Fps is (10 sec: 5719.7, 60 sec: 5583.2, 300 sec: 5569.5). Total num frames: 634383360. Throughput: 0: 5002.3. Samples: 634376806. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:07:42,904][25689] Avg episode reward: [(0, '-13.277')] [2022-07-10 07:07:44,138][26022] Updated weights on worker 0-0, policy_version 619521 (0.00075) [2022-07-10 07:07:45,809][26022] Updated weights on worker 0-0, policy_version 619531 (0.00090) [2022-07-10 07:07:47,755][26022] Updated weights on worker 0-0, policy_version 619541 (0.00086) [2022-07-10 07:07:48,053][25689] Fps is (10 sec: 5546.6, 60 sec: 5541.4, 300 sec: 5563.6). Total num frames: 634411008. Throughput: 0: 5824.4. Samples: 634410386. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:07:48,053][25689] Avg episode reward: [(0, '-12.574')] [2022-07-10 07:07:49,684][26022] Updated weights on worker 0-0, policy_version 619551 (0.00093) [2022-07-10 07:07:51,351][26022] Updated weights on worker 0-0, policy_version 619561 (0.00092) [2022-07-10 07:07:53,138][25689] Fps is (10 sec: 5402.7, 60 sec: 5555.1, 300 sec: 5566.0). Total num frames: 634438656. Throughput: 0: 5780.6. Samples: 634443456. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:07:53,139][25689] Avg episode reward: [(0, '-13.153')] [2022-07-10 07:07:53,348][26022] Updated weights on worker 0-0, policy_version 619571 (0.00082) [2022-07-10 07:07:55,127][26022] Updated weights on worker 0-0, policy_version 619581 (0.00090) [2022-07-10 07:07:57,087][26022] Updated weights on worker 0-0, policy_version 619591 (0.00101) [2022-07-10 07:07:58,144][25689] Fps is (10 sec: 5682.4, 60 sec: 5557.6, 300 sec: 5567.0). Total num frames: 634468352. Throughput: 0: 5789.4. Samples: 634477084. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 07:07:58,145][25689] Avg episode reward: [(0, '-11.994')] [2022-07-10 07:07:58,729][26022] Updated weights on worker 0-0, policy_version 619601 (0.00084) [2022-07-10 07:08:00,667][26022] Updated weights on worker 0-0, policy_version 619611 (0.00090) [2022-07-10 07:08:02,832][26022] Updated weights on worker 0-0, policy_version 619621 (0.00087) [2022-07-10 07:08:03,166][25689] Fps is (10 sec: 5514.5, 60 sec: 5557.2, 300 sec: 5561.0). Total num frames: 634493952. Throughput: 0: 5780.6. Samples: 634493934. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:08:03,166][25689] Avg episode reward: [(0, '-11.290')] [2022-07-10 07:08:04,625][26022] Updated weights on worker 0-0, policy_version 619631 (0.00087) [2022-07-10 07:08:06,435][26022] Updated weights on worker 0-0, policy_version 619641 (0.00088) [2022-07-10 07:08:08,286][25689] Fps is (10 sec: 5351.5, 60 sec: 5571.8, 300 sec: 5565.8). Total num frames: 634522624. Throughput: 0: 5691.3. Samples: 634525538. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:08:08,286][25689] Avg episode reward: [(0, '-10.438')] [2022-07-10 07:08:08,292][26022] Updated weights on worker 0-0, policy_version 619651 (0.00090) [2022-07-10 07:08:10,200][26022] Updated weights on worker 0-0, policy_version 619661 (0.00094) [2022-07-10 07:08:10,838][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:08:10,853][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000619665_634536960.pth [2022-07-10 07:08:10,854][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000617705_632529920.pth [2022-07-10 07:08:11,865][26022] Updated weights on worker 0-0, policy_version 619671 (0.00086) [2022-07-10 07:08:13,288][25689] Fps is (10 sec: 5563.8, 60 sec: 5539.7, 300 sec: 5562.9). Total num frames: 634550272. Throughput: 0: 5740.9. Samples: 634559134. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:08:13,289][25689] Avg episode reward: [(0, '-10.674')] [2022-07-10 07:08:13,777][26022] Updated weights on worker 0-0, policy_version 619681 (0.00088) [2022-07-10 07:08:15,706][26022] Updated weights on worker 0-0, policy_version 619691 (0.00092) [2022-07-10 07:08:17,564][26022] Updated weights on worker 0-0, policy_version 619701 (0.00099) [2022-07-10 07:08:18,384][25689] Fps is (10 sec: 5475.6, 60 sec: 5551.2, 300 sec: 5561.5). Total num frames: 634577920. Throughput: 0: 4883.2. Samples: 634575920. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:08:18,385][25689] Avg episode reward: [(0, '-9.404')] [2022-07-10 07:08:19,305][26022] Updated weights on worker 0-0, policy_version 619711 (0.00081) [2022-07-10 07:08:20,963][26022] Updated weights on worker 0-0, policy_version 619721 (0.00085) [2022-07-10 07:08:22,958][26022] Updated weights on worker 0-0, policy_version 619731 (0.00087) [2022-07-10 07:08:23,479][25689] Fps is (10 sec: 5526.8, 60 sec: 5544.3, 300 sec: 5557.3). Total num frames: 634606592. Throughput: 0: 5693.6. Samples: 634609588. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:08:23,479][25689] Avg episode reward: [(0, '-8.417')] [2022-07-10 07:08:24,779][26022] Updated weights on worker 0-0, policy_version 619741 (0.00084) [2022-07-10 07:08:26,621][26022] Updated weights on worker 0-0, policy_version 619751 (0.00089) [2022-07-10 07:08:28,292][26022] Updated weights on worker 0-0, policy_version 619761 (0.00085) [2022-07-10 07:08:28,513][25689] Fps is (10 sec: 5762.6, 60 sec: 5569.2, 300 sec: 5567.0). Total num frames: 634636288. Throughput: 0: 5812.6. Samples: 634643112. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:08:28,514][25689] Avg episode reward: [(0, '-9.241')] [2022-07-10 07:08:30,233][26022] Updated weights on worker 0-0, policy_version 619771 (0.00089) [2022-07-10 07:08:32,132][26022] Updated weights on worker 0-0, policy_version 619781 (0.00092) [2022-07-10 07:08:33,546][25689] Fps is (10 sec: 5594.4, 60 sec: 5553.1, 300 sec: 5559.8). Total num frames: 634662912. Throughput: 0: 4973.8. Samples: 634659890. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:08:33,549][25689] Avg episode reward: [(0, '-8.835')] [2022-07-10 07:08:33,956][26022] Updated weights on worker 0-0, policy_version 619791 (0.00091) [2022-07-10 07:08:35,799][26022] Updated weights on worker 0-0, policy_version 619801 (0.00084) [2022-07-10 07:08:37,612][26022] Updated weights on worker 0-0, policy_version 619811 (0.00095) [2022-07-10 07:08:38,553][25689] Fps is (10 sec: 5609.9, 60 sec: 5571.3, 300 sec: 5567.8). Total num frames: 634692608. Throughput: 0: 5843.2. Samples: 634693766. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:08:38,553][25689] Avg episode reward: [(0, '-8.762')] [2022-07-10 07:08:39,567][26022] Updated weights on worker 0-0, policy_version 619821 (0.00088) [2022-07-10 07:08:41,191][26022] Updated weights on worker 0-0, policy_version 619831 (0.00089) [2022-07-10 07:08:43,109][26022] Updated weights on worker 0-0, policy_version 619841 (0.00086) [2022-07-10 07:08:43,572][25689] Fps is (10 sec: 5514.8, 60 sec: 5519.3, 300 sec: 5562.4). Total num frames: 634718208. Throughput: 0: 5868.1. Samples: 634727502. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:08:43,573][25689] Avg episode reward: [(0, '-8.375')] [2022-07-10 07:08:44,876][26022] Updated weights on worker 0-0, policy_version 619851 (0.00085) [2022-07-10 07:08:46,807][26022] Updated weights on worker 0-0, policy_version 619861 (0.00091) [2022-07-10 07:08:48,605][25689] Fps is (10 sec: 5398.8, 60 sec: 5546.8, 300 sec: 5562.1). Total num frames: 634746880. Throughput: 0: 5036.1. Samples: 634744298. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:08:48,606][25689] Avg episode reward: [(0, '-9.259')] [2022-07-10 07:08:48,699][26022] Updated weights on worker 0-0, policy_version 619871 (0.00082) [2022-07-10 07:08:50,316][26022] Updated weights on worker 0-0, policy_version 619881 (0.00093) [2022-07-10 07:08:52,317][26022] Updated weights on worker 0-0, policy_version 619891 (0.00083) [2022-07-10 07:08:53,616][25689] Fps is (10 sec: 5709.4, 60 sec: 5570.6, 300 sec: 5566.2). Total num frames: 634775552. Throughput: 0: 5845.9. Samples: 634777220. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:08:53,617][25689] Avg episode reward: [(0, '-10.474')] [2022-07-10 07:08:54,252][26022] Updated weights on worker 0-0, policy_version 619901 (0.00167) [2022-07-10 07:08:56,000][26022] Updated weights on worker 0-0, policy_version 619911 (0.00093) [2022-07-10 07:08:57,718][26022] Updated weights on worker 0-0, policy_version 619921 (0.00090) [2022-07-10 07:08:58,658][25689] Fps is (10 sec: 5602.7, 60 sec: 5533.5, 300 sec: 5562.2). Total num frames: 634803200. Throughput: 0: 5835.4. Samples: 634811086. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:08:58,658][25689] Avg episode reward: [(0, '-9.871')] [2022-07-10 07:08:59,797][26022] Updated weights on worker 0-0, policy_version 619931 (0.00082) [2022-07-10 07:09:01,437][26022] Updated weights on worker 0-0, policy_version 619941 (0.00086) [2022-07-10 07:09:03,734][25689] Fps is (10 sec: 5262.9, 60 sec: 5528.5, 300 sec: 5559.0). Total num frames: 634828800. Throughput: 0: 4988.2. Samples: 634828070. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:09:03,734][25689] Avg episode reward: [(0, '-10.031')] [2022-07-10 07:09:03,764][26022] Updated weights on worker 0-0, policy_version 619951 (0.00092) [2022-07-10 07:09:05,466][26022] Updated weights on worker 0-0, policy_version 619961 (0.00082) [2022-07-10 07:09:07,126][26022] Updated weights on worker 0-0, policy_version 619971 (0.00095) [2022-07-10 07:09:08,871][25689] Fps is (10 sec: 5414.2, 60 sec: 5543.8, 300 sec: 5564.2). Total num frames: 634858496. Throughput: 0: 5707.6. Samples: 634859964. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:09:08,872][25689] Avg episode reward: [(0, '-10.314')] [2022-07-10 07:09:09,172][26022] Updated weights on worker 0-0, policy_version 619981 (0.00091) [2022-07-10 07:09:10,866][26022] Updated weights on worker 0-0, policy_version 619991 (0.00087) [2022-07-10 07:09:12,855][26022] Updated weights on worker 0-0, policy_version 620001 (0.00099) [2022-07-10 07:09:13,923][25689] Fps is (10 sec: 5828.9, 60 sec: 5573.0, 300 sec: 5570.1). Total num frames: 634888192. Throughput: 0: 5739.2. Samples: 634893764. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:09:13,924][25689] Avg episode reward: [(0, '-9.497')] [2022-07-10 07:09:14,598][26022] Updated weights on worker 0-0, policy_version 620011 (0.00083) [2022-07-10 07:09:16,215][26022] Updated weights on worker 0-0, policy_version 620021 (0.00086) [2022-07-10 07:09:18,265][26022] Updated weights on worker 0-0, policy_version 620031 (0.00091) [2022-07-10 07:09:18,967][25689] Fps is (10 sec: 5578.3, 60 sec: 5561.0, 300 sec: 5560.6). Total num frames: 634914816. Throughput: 0: 4902.3. Samples: 634910648. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:09:18,967][25689] Avg episode reward: [(0, '-10.094')] [2022-07-10 07:09:20,020][26022] Updated weights on worker 0-0, policy_version 620041 (0.00082) [2022-07-10 07:09:21,899][26022] Updated weights on worker 0-0, policy_version 620051 (0.00087) [2022-07-10 07:09:23,919][26022] Updated weights on worker 0-0, policy_version 620061 (0.00084) [2022-07-10 07:09:24,015][25689] Fps is (10 sec: 5377.8, 60 sec: 5548.3, 300 sec: 5557.6). Total num frames: 634942464. Throughput: 0: 5704.9. Samples: 634943772. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:09:24,015][25689] Avg episode reward: [(0, '-10.177')] [2022-07-10 07:09:25,496][26022] Updated weights on worker 0-0, policy_version 620071 (0.00087) [2022-07-10 07:09:27,524][26022] Updated weights on worker 0-0, policy_version 620081 (0.00085) [2022-07-10 07:09:29,103][25689] Fps is (10 sec: 5657.2, 60 sec: 5543.3, 300 sec: 5563.1). Total num frames: 634972160. Throughput: 0: 5800.2. Samples: 634977318. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:09:29,104][25689] Avg episode reward: [(0, '-9.519')] [2022-07-10 07:09:29,213][26022] Updated weights on worker 0-0, policy_version 620091 (0.00085) [2022-07-10 07:09:31,187][26022] Updated weights on worker 0-0, policy_version 620101 (0.00110) [2022-07-10 07:09:32,942][26022] Updated weights on worker 0-0, policy_version 620111 (0.00088) [2022-07-10 07:09:34,194][25689] Fps is (10 sec: 5633.5, 60 sec: 5554.9, 300 sec: 5558.4). Total num frames: 634999808. Throughput: 0: 4945.6. Samples: 634994016. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:09:34,195][25689] Avg episode reward: [(0, '-11.918')] [2022-07-10 07:09:34,934][26022] Updated weights on worker 0-0, policy_version 620121 (0.00090) [2022-07-10 07:09:36,544][26022] Updated weights on worker 0-0, policy_version 620131 (0.00988) [2022-07-10 07:09:38,657][26022] Updated weights on worker 0-0, policy_version 620141 (0.00086) [2022-07-10 07:09:39,267][25689] Fps is (10 sec: 5541.3, 60 sec: 5532.0, 300 sec: 5561.0). Total num frames: 635028480. Throughput: 0: 5769.7. Samples: 635027774. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:09:39,267][25689] Avg episode reward: [(0, '-12.561')] [2022-07-10 07:09:40,191][26022] Updated weights on worker 0-0, policy_version 620151 (0.00107) [2022-07-10 07:09:42,031][26022] Updated weights on worker 0-0, policy_version 620161 (0.00090) [2022-07-10 07:09:43,900][26022] Updated weights on worker 0-0, policy_version 620171 (0.00685) [2022-07-10 07:09:44,277][25689] Fps is (10 sec: 5686.9, 60 sec: 5583.5, 300 sec: 5565.2). Total num frames: 635057152. Throughput: 0: 5803.2. Samples: 635061360. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:09:44,278][25689] Avg episode reward: [(0, '-12.235')] [2022-07-10 07:09:45,659][26022] Updated weights on worker 0-0, policy_version 620181 (0.00086) [2022-07-10 07:09:47,503][26022] Updated weights on worker 0-0, policy_version 620191 (0.00094) [2022-07-10 07:09:49,320][26022] Updated weights on worker 0-0, policy_version 620201 (0.01029) [2022-07-10 07:09:49,387][25689] Fps is (10 sec: 5666.5, 60 sec: 5576.4, 300 sec: 5567.6). Total num frames: 635085824. Throughput: 0: 5797.4. Samples: 635094908. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:09:49,387][25689] Avg episode reward: [(0, '-12.095')] [2022-07-10 07:09:51,196][26022] Updated weights on worker 0-0, policy_version 620211 (0.00093) [2022-07-10 07:09:52,987][26022] Updated weights on worker 0-0, policy_version 620221 (0.00089) [2022-07-10 07:09:54,471][25689] Fps is (10 sec: 5525.3, 60 sec: 5552.9, 300 sec: 5564.1). Total num frames: 635113472. Throughput: 0: 5807.2. Samples: 635111766. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:09:54,471][25689] Avg episode reward: [(0, '-11.896')] [2022-07-10 07:09:54,987][26022] Updated weights on worker 0-0, policy_version 620231 (0.00082) [2022-07-10 07:09:56,643][26022] Updated weights on worker 0-0, policy_version 620241 (0.00083) [2022-07-10 07:09:58,587][26022] Updated weights on worker 0-0, policy_version 620251 (0.00089) [2022-07-10 07:09:59,505][25689] Fps is (10 sec: 5464.9, 60 sec: 5553.6, 300 sec: 5561.7). Total num frames: 635141120. Throughput: 0: 5819.9. Samples: 635145558. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:09:59,506][25689] Avg episode reward: [(0, '-11.174')] [2022-07-10 07:10:00,275][26022] Updated weights on worker 0-0, policy_version 620261 (0.00081) [2022-07-10 07:10:02,658][26022] Updated weights on worker 0-0, policy_version 620271 (0.00083) [2022-07-10 07:10:04,345][26022] Updated weights on worker 0-0, policy_version 620281 (0.00088) [2022-07-10 07:10:04,512][25689] Fps is (10 sec: 5405.1, 60 sec: 5576.8, 300 sec: 5570.4). Total num frames: 635167744. Throughput: 0: 5726.8. Samples: 635177236. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:10:04,512][25689] Avg episode reward: [(0, '-9.981')] [2022-07-10 07:10:06,022][26022] Updated weights on worker 0-0, policy_version 620291 (0.00091) [2022-07-10 07:10:08,351][26022] Updated weights on worker 0-0, policy_version 620301 (0.00092) [2022-07-10 07:10:09,581][25689] Fps is (10 sec: 5589.8, 60 sec: 5583.0, 300 sec: 5569.6). Total num frames: 635197440. Throughput: 0: 4903.4. Samples: 635193926. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:10:09,581][25689] Avg episode reward: [(0, '-10.476')] [2022-07-10 07:10:09,771][26022] Updated weights on worker 0-0, policy_version 620311 (0.00085) [2022-07-10 07:10:11,083][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:10:11,094][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000620316_635203584.pth [2022-07-10 07:10:11,095][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000618360_633200640.pth [2022-07-10 07:10:11,852][26022] Updated weights on worker 0-0, policy_version 620321 (0.00086) [2022-07-10 07:10:13,542][26022] Updated weights on worker 0-0, policy_version 620331 (0.00089) [2022-07-10 07:10:14,599][25689] Fps is (10 sec: 5481.9, 60 sec: 5518.7, 300 sec: 5560.3). Total num frames: 635223040. Throughput: 0: 5733.3. Samples: 635227166. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:10:14,599][25689] Avg episode reward: [(0, '-11.476')] [2022-07-10 07:10:15,495][26022] Updated weights on worker 0-0, policy_version 620341 (0.00086) [2022-07-10 07:10:17,307][26022] Updated weights on worker 0-0, policy_version 620351 (0.00090) [2022-07-10 07:10:19,252][26022] Updated weights on worker 0-0, policy_version 620361 (0.00092) [2022-07-10 07:10:19,610][25689] Fps is (10 sec: 5411.2, 60 sec: 5555.4, 300 sec: 5563.8). Total num frames: 635251712. Throughput: 0: 5726.1. Samples: 635260682. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:10:19,611][25689] Avg episode reward: [(0, '-10.859')] [2022-07-10 07:10:20,971][26022] Updated weights on worker 0-0, policy_version 620371 (0.00098) [2022-07-10 07:10:22,835][26022] Updated weights on worker 0-0, policy_version 620381 (0.00084) [2022-07-10 07:10:24,618][25689] Fps is (10 sec: 5621.3, 60 sec: 5559.1, 300 sec: 5557.6). Total num frames: 635279360. Throughput: 0: 4978.0. Samples: 635277324. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:10:24,618][25689] Avg episode reward: [(0, '-10.719')] [2022-07-10 07:10:24,626][26022] Updated weights on worker 0-0, policy_version 620391 (0.00084) [2022-07-10 07:10:26,631][26022] Updated weights on worker 0-0, policy_version 620401 (0.00375) [2022-07-10 07:10:28,316][26022] Updated weights on worker 0-0, policy_version 620411 (0.00090) [2022-07-10 07:10:29,659][25689] Fps is (10 sec: 5502.8, 60 sec: 5529.6, 300 sec: 5558.6). Total num frames: 635307008. Throughput: 0: 5817.0. Samples: 635310720. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:10:29,659][25689] Avg episode reward: [(0, '-11.706')] [2022-07-10 07:10:30,190][26022] Updated weights on worker 0-0, policy_version 620421 (0.00091) [2022-07-10 07:10:32,004][26022] Updated weights on worker 0-0, policy_version 620431 (0.00090) [2022-07-10 07:10:33,910][26022] Updated weights on worker 0-0, policy_version 620441 (0.00086) [2022-07-10 07:10:34,671][25689] Fps is (10 sec: 5602.3, 60 sec: 5553.7, 300 sec: 5556.2). Total num frames: 635335680. Throughput: 0: 5831.1. Samples: 635344208. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:10:34,671][25689] Avg episode reward: [(0, '-11.771')] [2022-07-10 07:10:35,661][26022] Updated weights on worker 0-0, policy_version 620451 (0.01508) [2022-07-10 07:10:37,625][26022] Updated weights on worker 0-0, policy_version 620461 (0.00086) [2022-07-10 07:10:39,362][26022] Updated weights on worker 0-0, policy_version 620471 (0.00089) [2022-07-10 07:10:39,680][25689] Fps is (10 sec: 5619.9, 60 sec: 5542.6, 300 sec: 5563.4). Total num frames: 635363328. Throughput: 0: 5001.6. Samples: 635361064. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:10:39,681][25689] Avg episode reward: [(0, '-10.608')] [2022-07-10 07:10:41,286][26022] Updated weights on worker 0-0, policy_version 620481 (0.00087) [2022-07-10 07:10:43,218][26022] Updated weights on worker 0-0, policy_version 620491 (0.00092) [2022-07-10 07:10:44,683][25689] Fps is (10 sec: 5420.7, 60 sec: 5509.5, 300 sec: 5550.3). Total num frames: 635389952. Throughput: 0: 5835.9. Samples: 635394422. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:10:44,683][25689] Avg episode reward: [(0, '-10.255')] [2022-07-10 07:10:44,854][26022] Updated weights on worker 0-0, policy_version 620501 (0.00088) [2022-07-10 07:10:46,751][26022] Updated weights on worker 0-0, policy_version 620511 (0.00093) [2022-07-10 07:10:48,531][26022] Updated weights on worker 0-0, policy_version 620521 (0.00089) [2022-07-10 07:10:49,758][25689] Fps is (10 sec: 5588.3, 60 sec: 5529.5, 300 sec: 5552.7). Total num frames: 635419648. Throughput: 0: 5827.9. Samples: 635427858. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:10:49,759][25689] Avg episode reward: [(0, '-8.894')] [2022-07-10 07:10:50,447][26022] Updated weights on worker 0-0, policy_version 620531 (0.00090) [2022-07-10 07:10:52,301][26022] Updated weights on worker 0-0, policy_version 620541 (0.00094) [2022-07-10 07:10:54,066][26022] Updated weights on worker 0-0, policy_version 620551 (0.00086) [2022-07-10 07:10:54,797][25689] Fps is (10 sec: 5669.8, 60 sec: 5533.7, 300 sec: 5552.1). Total num frames: 635447296. Throughput: 0: 4988.5. Samples: 635444606. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:10:54,797][25689] Avg episode reward: [(0, '-11.164')] [2022-07-10 07:10:55,979][26022] Updated weights on worker 0-0, policy_version 620561 (0.00086) [2022-07-10 07:10:57,723][26022] Updated weights on worker 0-0, policy_version 620571 (0.00904) [2022-07-10 07:10:59,800][26022] Updated weights on worker 0-0, policy_version 620581 (0.00092) [2022-07-10 07:10:59,821][25689] Fps is (10 sec: 5495.0, 60 sec: 5534.6, 300 sec: 5562.2). Total num frames: 635474944. Throughput: 0: 5814.5. Samples: 635478174. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:10:59,822][25689] Avg episode reward: [(0, '-9.784')] [2022-07-10 07:11:01,418][26022] Updated weights on worker 0-0, policy_version 620591 (0.00086) [2022-07-10 07:11:03,820][26022] Updated weights on worker 0-0, policy_version 620601 (0.00090) [2022-07-10 07:11:04,861][25689] Fps is (10 sec: 5392.3, 60 sec: 5531.5, 300 sec: 5553.2). Total num frames: 635501568. Throughput: 0: 5703.8. Samples: 635509518. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 07:11:04,862][25689] Avg episode reward: [(0, '-9.034')] [2022-07-10 07:11:05,425][26022] Updated weights on worker 0-0, policy_version 620611 (0.00104) [2022-07-10 07:11:07,627][26022] Updated weights on worker 0-0, policy_version 620621 (0.00086) [2022-07-10 07:11:09,046][26022] Updated weights on worker 0-0, policy_version 620631 (0.00086) [2022-07-10 07:11:09,975][25689] Fps is (10 sec: 5446.1, 60 sec: 5510.5, 300 sec: 5554.6). Total num frames: 635530240. Throughput: 0: 4867.0. Samples: 635526250. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:11:09,975][25689] Avg episode reward: [(0, '-9.876')] [2022-07-10 07:11:10,999][26022] Updated weights on worker 0-0, policy_version 620641 (0.00087) [2022-07-10 07:11:12,805][26022] Updated weights on worker 0-0, policy_version 620651 (0.00088) [2022-07-10 07:11:14,610][26022] Updated weights on worker 0-0, policy_version 620661 (0.00086) [2022-07-10 07:11:15,013][25689] Fps is (10 sec: 5749.7, 60 sec: 5576.4, 300 sec: 5557.7). Total num frames: 635559936. Throughput: 0: 5705.7. Samples: 635559954. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:11:15,013][25689] Avg episode reward: [(0, '-9.352')] [2022-07-10 07:11:16,554][26022] Updated weights on worker 0-0, policy_version 620671 (0.00084) [2022-07-10 07:11:18,267][26022] Updated weights on worker 0-0, policy_version 620681 (0.00088) [2022-07-10 07:11:20,110][25689] Fps is (10 sec: 5455.7, 60 sec: 5517.8, 300 sec: 5546.4). Total num frames: 635585536. Throughput: 0: 5683.6. Samples: 635593486. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:11:20,111][25689] Avg episode reward: [(0, '-8.864')] [2022-07-10 07:11:20,296][26022] Updated weights on worker 0-0, policy_version 620691 (0.00086) [2022-07-10 07:11:22,035][26022] Updated weights on worker 0-0, policy_version 620701 (0.00092) [2022-07-10 07:11:23,843][26022] Updated weights on worker 0-0, policy_version 620711 (0.00090) [2022-07-10 07:11:25,133][25689] Fps is (10 sec: 5464.0, 60 sec: 5550.2, 300 sec: 5553.5). Total num frames: 635615232. Throughput: 0: 4976.2. Samples: 635610394. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:11:25,134][25689] Avg episode reward: [(0, '-8.163')] [2022-07-10 07:11:25,826][26022] Updated weights on worker 0-0, policy_version 620721 (0.00088) [2022-07-10 07:11:27,522][26022] Updated weights on worker 0-0, policy_version 620731 (0.00087) [2022-07-10 07:11:29,450][26022] Updated weights on worker 0-0, policy_version 620741 (0.00090) [2022-07-10 07:11:30,220][25689] Fps is (10 sec: 5672.1, 60 sec: 5546.0, 300 sec: 5552.0). Total num frames: 635642880. Throughput: 0: 5786.0. Samples: 635643388. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:11:30,221][25689] Avg episode reward: [(0, '-7.829')] [2022-07-10 07:11:31,175][26022] Updated weights on worker 0-0, policy_version 620751 (0.00085) [2022-07-10 07:11:33,059][26022] Updated weights on worker 0-0, policy_version 620761 (0.00084) [2022-07-10 07:11:34,905][26022] Updated weights on worker 0-0, policy_version 620771 (0.00085) [2022-07-10 07:11:35,311][25689] Fps is (10 sec: 5533.4, 60 sec: 5538.7, 300 sec: 5547.1). Total num frames: 635671552. Throughput: 0: 5758.4. Samples: 635676840. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:11:35,313][25689] Avg episode reward: [(0, '-7.338')] [2022-07-10 07:11:36,819][26022] Updated weights on worker 0-0, policy_version 620781 (0.00086) [2022-07-10 07:11:38,535][26022] Updated weights on worker 0-0, policy_version 620791 (0.00085) [2022-07-10 07:11:40,375][25689] Fps is (10 sec: 5546.0, 60 sec: 5533.8, 300 sec: 5549.6). Total num frames: 635699200. Throughput: 0: 4935.9. Samples: 635693512. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:11:40,376][25689] Avg episode reward: [(0, '-6.550')] [2022-07-10 07:11:40,399][26022] Updated weights on worker 0-0, policy_version 620801 (0.00084) [2022-07-10 07:11:42,239][26022] Updated weights on worker 0-0, policy_version 620811 (0.00091) [2022-07-10 07:11:44,339][26022] Updated weights on worker 0-0, policy_version 620821 (0.00091) [2022-07-10 07:11:45,466][25689] Fps is (10 sec: 5647.0, 60 sec: 5576.3, 300 sec: 5549.1). Total num frames: 635728896. Throughput: 0: 5729.6. Samples: 635726894. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:11:45,468][25689] Avg episode reward: [(0, '-6.244')] [2022-07-10 07:11:45,935][26022] Updated weights on worker 0-0, policy_version 620831 (0.00616) [2022-07-10 07:11:47,945][26022] Updated weights on worker 0-0, policy_version 620841 (0.00089) [2022-07-10 07:11:49,502][26022] Updated weights on worker 0-0, policy_version 620851 (0.00082) [2022-07-10 07:11:50,520][25689] Fps is (10 sec: 5450.6, 60 sec: 5510.9, 300 sec: 5545.6). Total num frames: 635754496. Throughput: 0: 5752.7. Samples: 635760168. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:11:50,520][25689] Avg episode reward: [(0, '-6.646')] [2022-07-10 07:11:51,840][26022] Updated weights on worker 0-0, policy_version 620861 (0.00092) [2022-07-10 07:11:53,316][26022] Updated weights on worker 0-0, policy_version 620871 (0.00095) [2022-07-10 07:11:55,413][26022] Updated weights on worker 0-0, policy_version 620881 (0.00085) [2022-07-10 07:11:55,538][25689] Fps is (10 sec: 5388.2, 60 sec: 5529.5, 300 sec: 5542.4). Total num frames: 635783168. Throughput: 0: 4951.9. Samples: 635777004. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:11:55,539][25689] Avg episode reward: [(0, '-6.531')] [2022-07-10 07:11:56,944][26022] Updated weights on worker 0-0, policy_version 620891 (0.00086) [2022-07-10 07:11:58,847][26022] Updated weights on worker 0-0, policy_version 620901 (0.00085) [2022-07-10 07:12:00,579][25689] Fps is (10 sec: 5700.8, 60 sec: 5544.9, 300 sec: 5552.3). Total num frames: 635811840. Throughput: 0: 5790.2. Samples: 635810496. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:12:00,579][25689] Avg episode reward: [(0, '-8.688')] [2022-07-10 07:12:00,799][26022] Updated weights on worker 0-0, policy_version 620911 (0.00090) [2022-07-10 07:12:02,731][26022] Updated weights on worker 0-0, policy_version 620921 (0.00085) [2022-07-10 07:12:04,784][26022] Updated weights on worker 0-0, policy_version 620931 (0.00086) [2022-07-10 07:12:05,596][25689] Fps is (10 sec: 5294.1, 60 sec: 5513.3, 300 sec: 5543.4). Total num frames: 635836416. Throughput: 0: 5727.8. Samples: 635842198. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:12:05,598][25689] Avg episode reward: [(0, '-9.672')] [2022-07-10 07:12:06,551][26022] Updated weights on worker 0-0, policy_version 620941 (0.00086) [2022-07-10 07:12:08,449][26022] Updated weights on worker 0-0, policy_version 620951 (0.00084) [2022-07-10 07:12:10,305][26022] Updated weights on worker 0-0, policy_version 620961 (0.00091) [2022-07-10 07:12:10,657][25689] Fps is (10 sec: 5385.1, 60 sec: 5534.9, 300 sec: 5542.7). Total num frames: 635866112. Throughput: 0: 4899.8. Samples: 635858836. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:12:10,657][25689] Avg episode reward: [(0, '-10.046')] [2022-07-10 07:12:11,224][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:12:11,240][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000620966_635869184.pth [2022-07-10 07:12:11,240][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000619013_633869312.pth [2022-07-10 07:12:12,156][26022] Updated weights on worker 0-0, policy_version 620971 (0.00087) [2022-07-10 07:12:14,105][26022] Updated weights on worker 0-0, policy_version 620981 (0.00093) [2022-07-10 07:12:15,730][25689] Fps is (10 sec: 5658.8, 60 sec: 5498.0, 300 sec: 5545.4). Total num frames: 635893760. Throughput: 0: 5697.9. Samples: 635892054. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:12:15,731][25689] Avg episode reward: [(0, '-10.016')] [2022-07-10 07:12:16,034][26022] Updated weights on worker 0-0, policy_version 620991 (0.00089) [2022-07-10 07:12:17,701][26022] Updated weights on worker 0-0, policy_version 621001 (0.00091) [2022-07-10 07:12:19,528][26022] Updated weights on worker 0-0, policy_version 621011 (0.00088) [2022-07-10 07:12:20,780][25689] Fps is (10 sec: 5563.5, 60 sec: 5552.9, 300 sec: 5544.9). Total num frames: 635922432. Throughput: 0: 5685.9. Samples: 635925358. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:12:20,781][25689] Avg episode reward: [(0, '-10.066')] [2022-07-10 07:12:21,278][26022] Updated weights on worker 0-0, policy_version 621021 (0.00088) [2022-07-10 07:12:23,239][26022] Updated weights on worker 0-0, policy_version 621031 (0.00086) [2022-07-10 07:12:25,038][26022] Updated weights on worker 0-0, policy_version 621041 (0.00096) [2022-07-10 07:12:25,797][25689] Fps is (10 sec: 5594.4, 60 sec: 5519.7, 300 sec: 5543.4). Total num frames: 635950080. Throughput: 0: 5765.6. Samples: 635958668. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:12:25,798][25689] Avg episode reward: [(0, '-10.803')] [2022-07-10 07:12:27,031][26022] Updated weights on worker 0-0, policy_version 621051 (0.00087) [2022-07-10 07:12:28,566][26022] Updated weights on worker 0-0, policy_version 621061 (0.00092) [2022-07-10 07:12:30,714][26022] Updated weights on worker 0-0, policy_version 621071 (0.00092) [2022-07-10 07:12:30,848][25689] Fps is (10 sec: 5492.4, 60 sec: 5523.0, 300 sec: 5543.2). Total num frames: 635977728. Throughput: 0: 5777.9. Samples: 635975496. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:12:30,848][25689] Avg episode reward: [(0, '-9.621')] [2022-07-10 07:12:32,396][26022] Updated weights on worker 0-0, policy_version 621081 (0.00099) [2022-07-10 07:12:34,223][26022] Updated weights on worker 0-0, policy_version 621091 (0.00085) [2022-07-10 07:12:35,920][25689] Fps is (10 sec: 5563.5, 60 sec: 5524.7, 300 sec: 5542.3). Total num frames: 636006400. Throughput: 0: 5804.6. Samples: 636009250. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:12:35,921][25689] Avg episode reward: [(0, '-9.776')] [2022-07-10 07:12:36,037][26022] Updated weights on worker 0-0, policy_version 621101 (0.00244) [2022-07-10 07:12:37,857][26022] Updated weights on worker 0-0, policy_version 621111 (0.00095) [2022-07-10 07:12:39,746][26022] Updated weights on worker 0-0, policy_version 621121 (0.00091) [2022-07-10 07:12:40,961][25689] Fps is (10 sec: 5568.9, 60 sec: 5526.8, 300 sec: 5538.2). Total num frames: 636034048. Throughput: 0: 5820.4. Samples: 636042820. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:12:40,962][25689] Avg episode reward: [(0, '-11.421')] [2022-07-10 07:12:41,628][26022] Updated weights on worker 0-0, policy_version 621131 (0.00090) [2022-07-10 07:12:43,323][26022] Updated weights on worker 0-0, policy_version 621141 (0.00084) [2022-07-10 07:12:45,233][26022] Updated weights on worker 0-0, policy_version 621151 (0.00084) [2022-07-10 07:12:45,976][25689] Fps is (10 sec: 5499.0, 60 sec: 5499.9, 300 sec: 5540.7). Total num frames: 636061696. Throughput: 0: 5006.2. Samples: 636059688. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:12:45,976][25689] Avg episode reward: [(0, '-10.104')] [2022-07-10 07:12:46,923][26022] Updated weights on worker 0-0, policy_version 621161 (0.00094) [2022-07-10 07:12:48,961][26022] Updated weights on worker 0-0, policy_version 621171 (0.00085) [2022-07-10 07:12:50,716][26022] Updated weights on worker 0-0, policy_version 621181 (0.00090) [2022-07-10 07:12:51,063][25689] Fps is (10 sec: 5676.4, 60 sec: 5564.5, 300 sec: 5547.5). Total num frames: 636091392. Throughput: 0: 5813.6. Samples: 636093020. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:12:51,064][25689] Avg episode reward: [(0, '-10.648')] [2022-07-10 07:12:52,822][26022] Updated weights on worker 0-0, policy_version 621191 (0.00082) [2022-07-10 07:12:54,315][26022] Updated weights on worker 0-0, policy_version 621201 (0.00057) [2022-07-10 07:12:56,129][25689] Fps is (10 sec: 5547.1, 60 sec: 5526.4, 300 sec: 5536.1). Total num frames: 636118016. Throughput: 0: 5796.7. Samples: 636126394. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:12:56,129][25689] Avg episode reward: [(0, '-11.185')] [2022-07-10 07:12:56,512][26022] Updated weights on worker 0-0, policy_version 621211 (0.00085) [2022-07-10 07:12:58,045][26022] Updated weights on worker 0-0, policy_version 621221 (0.00083) [2022-07-10 07:13:00,066][26022] Updated weights on worker 0-0, policy_version 621231 (0.00091) [2022-07-10 07:13:01,146][25689] Fps is (10 sec: 5585.9, 60 sec: 5545.5, 300 sec: 5549.9). Total num frames: 636147712. Throughput: 0: 4980.1. Samples: 636143342. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:13:01,146][25689] Avg episode reward: [(0, '-11.410')] [2022-07-10 07:13:01,939][26022] Updated weights on worker 0-0, policy_version 621241 (0.00080) [2022-07-10 07:13:04,082][26022] Updated weights on worker 0-0, policy_version 621251 (0.00088) [2022-07-10 07:13:05,902][26022] Updated weights on worker 0-0, policy_version 621261 (0.00092) [2022-07-10 07:13:06,230][25689] Fps is (10 sec: 5474.5, 60 sec: 5556.3, 300 sec: 5540.3). Total num frames: 636173312. Throughput: 0: 5694.5. Samples: 636175022. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:13:06,232][25689] Avg episode reward: [(0, '-10.750')] [2022-07-10 07:13:07,742][26022] Updated weights on worker 0-0, policy_version 621271 (0.00092) [2022-07-10 07:13:09,324][26022] Updated weights on worker 0-0, policy_version 621281 (0.00084) [2022-07-10 07:13:11,316][25689] Fps is (10 sec: 5235.3, 60 sec: 5520.1, 300 sec: 5538.7). Total num frames: 636200960. Throughput: 0: 5718.3. Samples: 636208834. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:13:11,318][25689] Avg episode reward: [(0, '-9.844')] [2022-07-10 07:13:11,437][26022] Updated weights on worker 0-0, policy_version 621291 (0.00054) [2022-07-10 07:13:12,939][26022] Updated weights on worker 0-0, policy_version 621301 (0.00089) [2022-07-10 07:13:15,059][26022] Updated weights on worker 0-0, policy_version 621311 (0.00085) [2022-07-10 07:13:16,325][25689] Fps is (10 sec: 5781.9, 60 sec: 5576.7, 300 sec: 5550.6). Total num frames: 636231680. Throughput: 0: 4915.1. Samples: 636225658. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:13:16,325][25689] Avg episode reward: [(0, '-9.273')] [2022-07-10 07:13:16,558][26022] Updated weights on worker 0-0, policy_version 621321 (0.00081) [2022-07-10 07:13:18,693][26022] Updated weights on worker 0-0, policy_version 621331 (0.00093) [2022-07-10 07:13:20,621][26022] Updated weights on worker 0-0, policy_version 621341 (0.00085) [2022-07-10 07:13:21,420][25689] Fps is (10 sec: 5675.8, 60 sec: 5538.8, 300 sec: 5543.8). Total num frames: 636258304. Throughput: 0: 5712.0. Samples: 636259148. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:13:21,420][25689] Avg episode reward: [(0, '-11.175')] [2022-07-10 07:13:22,207][26022] Updated weights on worker 0-0, policy_version 621351 (0.00095) [2022-07-10 07:13:24,104][26022] Updated weights on worker 0-0, policy_version 621361 (0.00083) [2022-07-10 07:13:25,809][26022] Updated weights on worker 0-0, policy_version 621371 (0.00096) [2022-07-10 07:13:26,443][25689] Fps is (10 sec: 5363.8, 60 sec: 5538.3, 300 sec: 5537.1). Total num frames: 636285952. Throughput: 0: 5822.5. Samples: 636292714. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:13:26,444][25689] Avg episode reward: [(0, '-10.460')] [2022-07-10 07:13:27,671][26022] Updated weights on worker 0-0, policy_version 621381 (0.00087) [2022-07-10 07:13:29,572][26022] Updated weights on worker 0-0, policy_version 621391 (0.00089) [2022-07-10 07:13:31,331][26022] Updated weights on worker 0-0, policy_version 621401 (0.00084) [2022-07-10 07:13:31,521][25689] Fps is (10 sec: 5677.0, 60 sec: 5569.5, 300 sec: 5546.6). Total num frames: 636315648. Throughput: 0: 4984.6. Samples: 636309544. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:13:31,521][25689] Avg episode reward: [(0, '-9.216')] [2022-07-10 07:13:33,253][26022] Updated weights on worker 0-0, policy_version 621411 (0.00088) [2022-07-10 07:13:34,995][26022] Updated weights on worker 0-0, policy_version 621421 (0.00088) [2022-07-10 07:13:36,537][25689] Fps is (10 sec: 5579.4, 60 sec: 5540.9, 300 sec: 5536.1). Total num frames: 636342272. Throughput: 0: 5811.1. Samples: 636343114. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:13:36,538][25689] Avg episode reward: [(0, '-9.605')] [2022-07-10 07:13:37,018][26022] Updated weights on worker 0-0, policy_version 621431 (0.00093) [2022-07-10 07:13:38,692][26022] Updated weights on worker 0-0, policy_version 621441 (0.00088) [2022-07-10 07:13:40,664][26022] Updated weights on worker 0-0, policy_version 621451 (0.00096) [2022-07-10 07:13:41,550][25689] Fps is (10 sec: 5513.3, 60 sec: 5560.3, 300 sec: 5546.5). Total num frames: 636370944. Throughput: 0: 5830.7. Samples: 636376524. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:13:41,551][25689] Avg episode reward: [(0, '-9.680')] [2022-07-10 07:13:42,459][26022] Updated weights on worker 0-0, policy_version 621461 (0.00079) [2022-07-10 07:13:44,397][26022] Updated weights on worker 0-0, policy_version 621471 (0.00084) [2022-07-10 07:13:46,041][26022] Updated weights on worker 0-0, policy_version 621481 (0.00084) [2022-07-10 07:13:46,562][25689] Fps is (10 sec: 5617.7, 60 sec: 5560.6, 300 sec: 5543.5). Total num frames: 636398592. Throughput: 0: 5005.0. Samples: 636393412. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:13:46,563][25689] Avg episode reward: [(0, '-10.637')] [2022-07-10 07:13:47,888][26022] Updated weights on worker 0-0, policy_version 621491 (0.00092) [2022-07-10 07:13:49,719][26022] Updated weights on worker 0-0, policy_version 621501 (0.00087) [2022-07-10 07:13:51,594][26022] Updated weights on worker 0-0, policy_version 621511 (0.00082) [2022-07-10 07:13:51,695][25689] Fps is (10 sec: 5551.8, 60 sec: 5539.5, 300 sec: 5541.2). Total num frames: 636427264. Throughput: 0: 5847.8. Samples: 636427518. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:13:51,695][25689] Avg episode reward: [(0, '-9.676')] [2022-07-10 07:13:53,579][26022] Updated weights on worker 0-0, policy_version 621521 (0.00083) [2022-07-10 07:13:55,354][26022] Updated weights on worker 0-0, policy_version 621531 (0.00091) [2022-07-10 07:13:56,744][25689] Fps is (10 sec: 5531.5, 60 sec: 5557.9, 300 sec: 5541.0). Total num frames: 636454912. Throughput: 0: 5819.0. Samples: 636460700. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:13:56,745][25689] Avg episode reward: [(0, '-10.067')] [2022-07-10 07:13:57,183][26022] Updated weights on worker 0-0, policy_version 621541 (0.00095) [2022-07-10 07:13:58,976][26022] Updated weights on worker 0-0, policy_version 621551 (0.00091) [2022-07-10 07:14:00,691][26022] Updated weights on worker 0-0, policy_version 621561 (0.00093) [2022-07-10 07:14:01,847][25689] Fps is (10 sec: 5446.8, 60 sec: 5516.3, 300 sec: 5547.4). Total num frames: 636482560. Throughput: 0: 4982.7. Samples: 636477654. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:14:01,847][25689] Avg episode reward: [(0, '-12.597')] [2022-07-10 07:14:03,254][26022] Updated weights on worker 0-0, policy_version 621571 (0.00088) [2022-07-10 07:14:04,863][26022] Updated weights on worker 0-0, policy_version 621581 (0.00099) [2022-07-10 07:14:06,776][26022] Updated weights on worker 0-0, policy_version 621591 (0.00085) [2022-07-10 07:14:06,868][25689] Fps is (10 sec: 5360.8, 60 sec: 5538.9, 300 sec: 5539.3). Total num frames: 636509184. Throughput: 0: 5678.5. Samples: 636508718. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:14:06,869][25689] Avg episode reward: [(0, '-11.990')] [2022-07-10 07:14:08,596][26022] Updated weights on worker 0-0, policy_version 621601 (0.00089) [2022-07-10 07:14:10,418][26022] Updated weights on worker 0-0, policy_version 621611 (0.00081) [2022-07-10 07:14:11,403][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:14:11,425][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000621614_636532736.pth [2022-07-10 07:14:11,425][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000619665_634536960.pth [2022-07-10 07:14:11,908][25689] Fps is (10 sec: 5394.2, 60 sec: 5543.2, 300 sec: 5532.6). Total num frames: 636536832. Throughput: 0: 5629.4. Samples: 636541306. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:14:11,908][25689] Avg episode reward: [(0, '-10.955')] [2022-07-10 07:14:12,228][26022] Updated weights on worker 0-0, policy_version 621621 (0.00084) [2022-07-10 07:14:14,322][26022] Updated weights on worker 0-0, policy_version 621631 (0.00080) [2022-07-10 07:14:15,904][26022] Updated weights on worker 0-0, policy_version 621641 (0.00086) [2022-07-10 07:14:16,920][25689] Fps is (10 sec: 5603.0, 60 sec: 5509.1, 300 sec: 5540.1). Total num frames: 636565504. Throughput: 0: 4824.3. Samples: 636558034. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2022-07-10 07:14:16,920][25689] Avg episode reward: [(0, '-10.753')] [2022-07-10 07:14:18,043][26022] Updated weights on worker 0-0, policy_version 621651 (0.00081) [2022-07-10 07:14:19,474][26022] Updated weights on worker 0-0, policy_version 621661 (0.00086) [2022-07-10 07:14:21,774][26022] Updated weights on worker 0-0, policy_version 621671 (0.00090) [2022-07-10 07:14:21,967][25689] Fps is (10 sec: 5497.2, 60 sec: 5513.5, 300 sec: 5536.7). Total num frames: 636592128. Throughput: 0: 5662.7. Samples: 636591588. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:14:21,968][25689] Avg episode reward: [(0, '-10.700')] [2022-07-10 07:14:23,321][26022] Updated weights on worker 0-0, policy_version 621681 (0.00092) [2022-07-10 07:14:25,160][26022] Updated weights on worker 0-0, policy_version 621691 (0.00079) [2022-07-10 07:14:26,975][25689] Fps is (10 sec: 5397.3, 60 sec: 5514.8, 300 sec: 5531.3). Total num frames: 636619776. Throughput: 0: 5771.4. Samples: 636624764. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:14:26,977][25689] Avg episode reward: [(0, '-11.906')] [2022-07-10 07:14:27,351][26022] Updated weights on worker 0-0, policy_version 621701 (0.00088) [2022-07-10 07:14:28,833][26022] Updated weights on worker 0-0, policy_version 621711 (0.00617) [2022-07-10 07:14:30,973][26022] Updated weights on worker 0-0, policy_version 621721 (0.00094) [2022-07-10 07:14:32,019][25689] Fps is (10 sec: 5602.7, 60 sec: 5501.0, 300 sec: 5535.6). Total num frames: 636648448. Throughput: 0: 5801.2. Samples: 636657976. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:14:32,020][25689] Avg episode reward: [(0, '-10.806')] [2022-07-10 07:14:32,640][26022] Updated weights on worker 0-0, policy_version 621731 (0.00090) [2022-07-10 07:14:34,664][26022] Updated weights on worker 0-0, policy_version 621741 (0.00110) [2022-07-10 07:14:36,191][26022] Updated weights on worker 0-0, policy_version 621751 (0.00090) [2022-07-10 07:14:37,055][25689] Fps is (10 sec: 5688.8, 60 sec: 5533.0, 300 sec: 5536.3). Total num frames: 636677120. Throughput: 0: 5791.5. Samples: 636674650. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:14:37,056][25689] Avg episode reward: [(0, '-11.845')] [2022-07-10 07:14:38,406][26022] Updated weights on worker 0-0, policy_version 621761 (0.00100) [2022-07-10 07:14:39,848][26022] Updated weights on worker 0-0, policy_version 621771 (0.00089) [2022-07-10 07:14:41,867][26022] Updated weights on worker 0-0, policy_version 621781 (0.00096) [2022-07-10 07:14:42,063][25689] Fps is (10 sec: 5505.6, 60 sec: 5499.7, 300 sec: 5529.5). Total num frames: 636703744. Throughput: 0: 5801.3. Samples: 636708170. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:14:42,063][25689] Avg episode reward: [(0, '-12.158')] [2022-07-10 07:14:43,707][26022] Updated weights on worker 0-0, policy_version 621791 (0.00085) [2022-07-10 07:14:45,435][26022] Updated weights on worker 0-0, policy_version 621801 (0.00089) [2022-07-10 07:14:47,071][25689] Fps is (10 sec: 5418.7, 60 sec: 5500.0, 300 sec: 5527.9). Total num frames: 636731392. Throughput: 0: 5836.4. Samples: 636742052. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:14:47,072][25689] Avg episode reward: [(0, '-10.844')] [2022-07-10 07:14:47,442][26022] Updated weights on worker 0-0, policy_version 621811 (0.00086) [2022-07-10 07:14:49,020][26022] Updated weights on worker 0-0, policy_version 621821 (0.00087) [2022-07-10 07:14:51,102][26022] Updated weights on worker 0-0, policy_version 621831 (0.00079) [2022-07-10 07:14:52,215][25689] Fps is (10 sec: 5749.3, 60 sec: 5532.9, 300 sec: 5537.1). Total num frames: 636762112. Throughput: 0: 4993.7. Samples: 636758828. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:14:52,215][25689] Avg episode reward: [(0, '-10.871')] [2022-07-10 07:14:52,906][26022] Updated weights on worker 0-0, policy_version 621841 (0.00087) [2022-07-10 07:14:54,629][26022] Updated weights on worker 0-0, policy_version 621851 (0.00087) [2022-07-10 07:14:56,447][26022] Updated weights on worker 0-0, policy_version 621861 (0.00102) [2022-07-10 07:14:57,250][25689] Fps is (10 sec: 5633.3, 60 sec: 5517.2, 300 sec: 5533.6). Total num frames: 636788736. Throughput: 0: 5835.7. Samples: 636792502. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:14:57,251][25689] Avg episode reward: [(0, '-9.610')] [2022-07-10 07:14:58,210][26022] Updated weights on worker 0-0, policy_version 621871 (0.00094) [2022-07-10 07:15:00,312][26022] Updated weights on worker 0-0, policy_version 621881 (0.00096) [2022-07-10 07:15:02,280][25689] Fps is (10 sec: 5290.1, 60 sec: 5506.9, 300 sec: 5533.2). Total num frames: 636815360. Throughput: 0: 5719.0. Samples: 636823794. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:15:02,282][25689] Avg episode reward: [(0, '-9.530')] [2022-07-10 07:15:02,510][26022] Updated weights on worker 0-0, policy_version 621891 (0.00089) [2022-07-10 07:15:04,255][26022] Updated weights on worker 0-0, policy_version 621901 (0.00086) [2022-07-10 07:15:06,095][26022] Updated weights on worker 0-0, policy_version 621911 (0.00080) [2022-07-10 07:15:07,297][25689] Fps is (10 sec: 5503.9, 60 sec: 5541.2, 300 sec: 5530.7). Total num frames: 636844032. Throughput: 0: 4876.0. Samples: 636840674. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:15:07,298][25689] Avg episode reward: [(0, '-9.230')] [2022-07-10 07:15:07,691][26022] Updated weights on worker 0-0, policy_version 621921 (0.00087) [2022-07-10 07:15:09,901][26022] Updated weights on worker 0-0, policy_version 621931 (0.00089) [2022-07-10 07:15:11,489][26022] Updated weights on worker 0-0, policy_version 621941 (0.00092) [2022-07-10 07:15:12,392][25689] Fps is (10 sec: 5468.3, 60 sec: 5519.2, 300 sec: 5532.7). Total num frames: 636870656. Throughput: 0: 5713.2. Samples: 636874108. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:15:12,394][25689] Avg episode reward: [(0, '-8.564')] [2022-07-10 07:15:13,413][26022] Updated weights on worker 0-0, policy_version 621951 (0.00112) [2022-07-10 07:15:15,286][26022] Updated weights on worker 0-0, policy_version 621961 (0.00087) [2022-07-10 07:15:17,086][26022] Updated weights on worker 0-0, policy_version 621971 (0.00083) [2022-07-10 07:15:17,444][25689] Fps is (10 sec: 5550.4, 60 sec: 5532.5, 300 sec: 5535.4). Total num frames: 636900352. Throughput: 0: 5718.6. Samples: 636907982. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:15:17,445][25689] Avg episode reward: [(0, '-8.480')] [2022-07-10 07:15:18,870][26022] Updated weights on worker 0-0, policy_version 621981 (0.00085) [2022-07-10 07:15:20,773][26022] Updated weights on worker 0-0, policy_version 621991 (0.00088) [2022-07-10 07:15:22,467][25689] Fps is (10 sec: 5692.2, 60 sec: 5551.6, 300 sec: 5535.2). Total num frames: 636928000. Throughput: 0: 4999.2. Samples: 636924710. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:15:22,467][25689] Avg episode reward: [(0, '-9.603')] [2022-07-10 07:15:22,611][26022] Updated weights on worker 0-0, policy_version 622001 (0.00087) [2022-07-10 07:15:24,292][26022] Updated weights on worker 0-0, policy_version 622011 (0.00092) [2022-07-10 07:15:26,193][26022] Updated weights on worker 0-0, policy_version 622021 (0.00090) [2022-07-10 07:15:27,543][25689] Fps is (10 sec: 5577.0, 60 sec: 5562.3, 300 sec: 5537.9). Total num frames: 636956672. Throughput: 0: 5805.3. Samples: 636958206. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:15:27,544][25689] Avg episode reward: [(0, '-9.261')] [2022-07-10 07:15:28,142][26022] Updated weights on worker 0-0, policy_version 622031 (0.00086) [2022-07-10 07:15:29,945][26022] Updated weights on worker 0-0, policy_version 622041 (0.00114) [2022-07-10 07:15:31,779][26022] Updated weights on worker 0-0, policy_version 622051 (0.00248) [2022-07-10 07:15:32,656][25689] Fps is (10 sec: 5527.4, 60 sec: 5539.1, 300 sec: 5532.6). Total num frames: 636984320. Throughput: 0: 5783.7. Samples: 636991306. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:15:32,658][25689] Avg episode reward: [(0, '-8.557')] [2022-07-10 07:15:33,421][26022] Updated weights on worker 0-0, policy_version 622061 (0.00085) [2022-07-10 07:15:35,367][26022] Updated weights on worker 0-0, policy_version 622071 (0.00059) [2022-07-10 07:15:37,424][26022] Updated weights on worker 0-0, policy_version 622081 (0.00091) [2022-07-10 07:15:37,687][25689] Fps is (10 sec: 5350.2, 60 sec: 5505.8, 300 sec: 5528.8). Total num frames: 637010944. Throughput: 0: 4938.1. Samples: 637007942. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:15:37,687][25689] Avg episode reward: [(0, '-7.769')] [2022-07-10 07:15:39,095][26022] Updated weights on worker 0-0, policy_version 622091 (0.00082) [2022-07-10 07:15:41,039][26022] Updated weights on worker 0-0, policy_version 622101 (0.00098) [2022-07-10 07:15:42,689][25689] Fps is (10 sec: 5613.4, 60 sec: 5556.9, 300 sec: 5539.1). Total num frames: 637040640. Throughput: 0: 5770.0. Samples: 637041394. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:15:42,690][25689] Avg episode reward: [(0, '-7.907')] [2022-07-10 07:15:42,830][26022] Updated weights on worker 0-0, policy_version 622111 (0.00094) [2022-07-10 07:15:44,750][26022] Updated weights on worker 0-0, policy_version 622121 (0.00090) [2022-07-10 07:15:46,323][26022] Updated weights on worker 0-0, policy_version 622131 (0.00092) [2022-07-10 07:15:47,714][25689] Fps is (10 sec: 5719.0, 60 sec: 5555.4, 300 sec: 5533.2). Total num frames: 637068288. Throughput: 0: 5797.6. Samples: 637075150. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:15:47,715][25689] Avg episode reward: [(0, '-7.424')] [2022-07-10 07:15:48,547][26022] Updated weights on worker 0-0, policy_version 622141 (0.00052) [2022-07-10 07:15:50,059][26022] Updated weights on worker 0-0, policy_version 622151 (0.00096) [2022-07-10 07:15:52,138][26022] Updated weights on worker 0-0, policy_version 622161 (0.00087) [2022-07-10 07:15:52,834][25689] Fps is (10 sec: 5551.8, 60 sec: 5523.8, 300 sec: 5535.1). Total num frames: 637096960. Throughput: 0: 4978.9. Samples: 637091768. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:15:52,834][25689] Avg episode reward: [(0, '-7.838')] [2022-07-10 07:15:53,747][26022] Updated weights on worker 0-0, policy_version 622171 (0.00083) [2022-07-10 07:15:55,874][26022] Updated weights on worker 0-0, policy_version 622181 (0.00099) [2022-07-10 07:15:57,379][26022] Updated weights on worker 0-0, policy_version 622191 (0.00088) [2022-07-10 07:15:57,877][25689] Fps is (10 sec: 5642.2, 60 sec: 5556.9, 300 sec: 5538.2). Total num frames: 637125632. Throughput: 0: 5801.9. Samples: 637125084. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:15:57,878][25689] Avg episode reward: [(0, '-7.998')] [2022-07-10 07:15:59,379][26022] Updated weights on worker 0-0, policy_version 622201 (0.00085) [2022-07-10 07:16:01,325][26022] Updated weights on worker 0-0, policy_version 622211 (0.00086) [2022-07-10 07:16:02,929][25689] Fps is (10 sec: 5375.7, 60 sec: 5538.0, 300 sec: 5534.5). Total num frames: 637151232. Throughput: 0: 5760.7. Samples: 637157992. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:16:02,930][25689] Avg episode reward: [(0, '-8.800')] [2022-07-10 07:16:03,379][26022] Updated weights on worker 0-0, policy_version 622221 (0.00090) [2022-07-10 07:16:05,264][26022] Updated weights on worker 0-0, policy_version 622231 (0.00088) [2022-07-10 07:16:07,236][26022] Updated weights on worker 0-0, policy_version 622241 (0.00090) [2022-07-10 07:16:07,955][25689] Fps is (10 sec: 5283.6, 60 sec: 5520.3, 300 sec: 5532.7). Total num frames: 637178880. Throughput: 0: 4850.8. Samples: 637173336. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:16:07,955][25689] Avg episode reward: [(0, '-9.570')] [2022-07-10 07:16:08,961][26022] Updated weights on worker 0-0, policy_version 622251 (0.00090) [2022-07-10 07:16:10,892][26022] Updated weights on worker 0-0, policy_version 622261 (0.00086) [2022-07-10 07:16:11,493][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:16:11,501][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000622265_637199360.pth [2022-07-10 07:16:11,502][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000620316_635203584.pth [2022-07-10 07:16:12,433][26022] Updated weights on worker 0-0, policy_version 622271 (0.00090) [2022-07-10 07:16:13,094][25689] Fps is (10 sec: 5541.0, 60 sec: 5550.1, 300 sec: 5527.4). Total num frames: 637207552. Throughput: 0: 5684.2. Samples: 637206930. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:16:13,094][25689] Avg episode reward: [(0, '-10.036')] [2022-07-10 07:16:14,452][26022] Updated weights on worker 0-0, policy_version 622281 (0.00088) [2022-07-10 07:16:16,028][26022] Updated weights on worker 0-0, policy_version 622291 (0.00093) [2022-07-10 07:16:18,098][25689] Fps is (10 sec: 5552.8, 60 sec: 5520.7, 300 sec: 5536.0). Total num frames: 637235200. Throughput: 0: 5713.1. Samples: 637240606. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:16:18,098][25689] Avg episode reward: [(0, '-10.944')] [2022-07-10 07:16:18,107][26022] Updated weights on worker 0-0, policy_version 622301 (0.00098) [2022-07-10 07:16:19,916][26022] Updated weights on worker 0-0, policy_version 622311 (0.00086) [2022-07-10 07:16:21,823][26022] Updated weights on worker 0-0, policy_version 622321 (0.00083) [2022-07-10 07:16:23,104][25689] Fps is (10 sec: 5728.3, 60 sec: 5555.9, 300 sec: 5536.3). Total num frames: 637264896. Throughput: 0: 4924.8. Samples: 637257350. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:16:23,105][25689] Avg episode reward: [(0, '-10.995')] [2022-07-10 07:16:23,555][26022] Updated weights on worker 0-0, policy_version 622331 (0.00092) [2022-07-10 07:16:25,471][26022] Updated weights on worker 0-0, policy_version 622341 (0.00094) [2022-07-10 07:16:27,362][26022] Updated weights on worker 0-0, policy_version 622351 (0.00088) [2022-07-10 07:16:28,141][25689] Fps is (10 sec: 5607.6, 60 sec: 5525.7, 300 sec: 5533.8). Total num frames: 637291520. Throughput: 0: 5810.0. Samples: 637290618. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:16:28,142][25689] Avg episode reward: [(0, '-10.113')] [2022-07-10 07:16:29,147][26022] Updated weights on worker 0-0, policy_version 622361 (0.00087) [2022-07-10 07:16:31,084][26022] Updated weights on worker 0-0, policy_version 622371 (0.00085) [2022-07-10 07:16:32,894][26022] Updated weights on worker 0-0, policy_version 622381 (0.00090) [2022-07-10 07:16:33,204][25689] Fps is (10 sec: 5373.5, 60 sec: 5530.3, 300 sec: 5530.9). Total num frames: 637319168. Throughput: 0: 5809.1. Samples: 637323754. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:16:33,205][25689] Avg episode reward: [(0, '-10.464')] [2022-07-10 07:16:34,583][26022] Updated weights on worker 0-0, policy_version 622391 (0.00092) [2022-07-10 07:16:36,708][26022] Updated weights on worker 0-0, policy_version 622401 (0.00086) [2022-07-10 07:16:38,135][26022] Updated weights on worker 0-0, policy_version 622411 (0.00088) [2022-07-10 07:16:38,232][25689] Fps is (10 sec: 5682.8, 60 sec: 5581.3, 300 sec: 5538.4). Total num frames: 637348864. Throughput: 0: 4962.7. Samples: 637340526. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:16:38,232][25689] Avg episode reward: [(0, '-11.776')] [2022-07-10 07:16:40,254][26022] Updated weights on worker 0-0, policy_version 622421 (0.01158) [2022-07-10 07:16:42,158][26022] Updated weights on worker 0-0, policy_version 622431 (0.00052) [2022-07-10 07:16:43,291][25689] Fps is (10 sec: 5583.8, 60 sec: 5525.4, 300 sec: 5528.7). Total num frames: 637375488. Throughput: 0: 5797.3. Samples: 637374376. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:16:43,291][25689] Avg episode reward: [(0, '-12.808')] [2022-07-10 07:16:43,828][26022] Updated weights on worker 0-0, policy_version 622441 (0.00089) [2022-07-10 07:16:45,771][26022] Updated weights on worker 0-0, policy_version 622451 (0.00124) [2022-07-10 07:16:47,294][26022] Updated weights on worker 0-0, policy_version 622461 (0.00087) [2022-07-10 07:16:48,363][25689] Fps is (10 sec: 5558.9, 60 sec: 5554.8, 300 sec: 5542.1). Total num frames: 637405184. Throughput: 0: 5812.9. Samples: 637408168. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:16:48,364][25689] Avg episode reward: [(0, '-11.718')] [2022-07-10 07:16:49,312][26022] Updated weights on worker 0-0, policy_version 622471 (0.00091) [2022-07-10 07:16:51,211][26022] Updated weights on worker 0-0, policy_version 622481 (0.00086) [2022-07-10 07:16:52,881][26022] Updated weights on worker 0-0, policy_version 622491 (0.00091) [2022-07-10 07:16:53,431][25689] Fps is (10 sec: 5756.3, 60 sec: 5559.6, 300 sec: 5541.2). Total num frames: 637433856. Throughput: 0: 5833.5. Samples: 637441744. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:16:53,431][25689] Avg episode reward: [(0, '-12.452')] [2022-07-10 07:16:55,019][26022] Updated weights on worker 0-0, policy_version 622501 (0.00055) [2022-07-10 07:16:56,513][26022] Updated weights on worker 0-0, policy_version 622511 (0.00091) [2022-07-10 07:16:58,480][25689] Fps is (10 sec: 5364.7, 60 sec: 5508.4, 300 sec: 5530.7). Total num frames: 637459456. Throughput: 0: 5829.3. Samples: 637458558. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:16:58,480][25689] Avg episode reward: [(0, '-12.664')] [2022-07-10 07:16:58,708][26022] Updated weights on worker 0-0, policy_version 622521 (0.00091) [2022-07-10 07:17:00,283][26022] Updated weights on worker 0-0, policy_version 622531 (0.00088) [2022-07-10 07:17:02,663][26022] Updated weights on worker 0-0, policy_version 622541 (0.00094) [2022-07-10 07:17:03,486][25689] Fps is (10 sec: 5397.4, 60 sec: 5563.3, 300 sec: 5544.7). Total num frames: 637488128. Throughput: 0: 5744.0. Samples: 637490378. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:17:03,486][25689] Avg episode reward: [(0, '-11.882')] [2022-07-10 07:17:04,434][26022] Updated weights on worker 0-0, policy_version 622551 (0.00052) [2022-07-10 07:17:06,139][26022] Updated weights on worker 0-0, policy_version 622561 (0.00082) [2022-07-10 07:17:08,081][26022] Updated weights on worker 0-0, policy_version 622571 (0.00094) [2022-07-10 07:17:08,498][25689] Fps is (10 sec: 5519.9, 60 sec: 5547.7, 300 sec: 5535.3). Total num frames: 637514752. Throughput: 0: 5721.6. Samples: 637523366. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:17:08,498][25689] Avg episode reward: [(0, '-10.495')] [2022-07-10 07:17:09,998][26022] Updated weights on worker 0-0, policy_version 622581 (0.00087) [2022-07-10 07:17:11,714][26022] Updated weights on worker 0-0, policy_version 622591 (0.00089) [2022-07-10 07:17:13,548][25689] Fps is (10 sec: 5393.8, 60 sec: 5538.9, 300 sec: 5535.7). Total num frames: 637542400. Throughput: 0: 4887.8. Samples: 637540072. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:17:13,548][25689] Avg episode reward: [(0, '-10.380')] [2022-07-10 07:17:13,622][26022] Updated weights on worker 0-0, policy_version 622601 (0.00092) [2022-07-10 07:17:15,395][26022] Updated weights on worker 0-0, policy_version 622611 (0.00089) [2022-07-10 07:17:17,291][26022] Updated weights on worker 0-0, policy_version 622621 (0.00090) [2022-07-10 07:17:18,557][25689] Fps is (10 sec: 5598.8, 60 sec: 5555.4, 300 sec: 5536.5). Total num frames: 637571072. Throughput: 0: 5745.4. Samples: 637573908. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:17:18,557][25689] Avg episode reward: [(0, '-9.938')] [2022-07-10 07:17:18,892][26022] Updated weights on worker 0-0, policy_version 622631 (0.00088) [2022-07-10 07:17:21,155][26022] Updated weights on worker 0-0, policy_version 622641 (0.00089) [2022-07-10 07:17:22,650][26022] Updated weights on worker 0-0, policy_version 622651 (0.00087) [2022-07-10 07:17:23,579][25689] Fps is (10 sec: 5716.7, 60 sec: 5537.0, 300 sec: 5539.8). Total num frames: 637599744. Throughput: 0: 5823.2. Samples: 637607382. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2022-07-10 07:17:23,579][25689] Avg episode reward: [(0, '-8.921')] [2022-07-10 07:17:24,807][26022] Updated weights on worker 0-0, policy_version 622661 (0.00083) [2022-07-10 07:17:26,223][26022] Updated weights on worker 0-0, policy_version 622671 (0.00093) [2022-07-10 07:17:28,355][26022] Updated weights on worker 0-0, policy_version 622681 (0.00084) [2022-07-10 07:17:28,601][25689] Fps is (10 sec: 5505.2, 60 sec: 5538.4, 300 sec: 5536.9). Total num frames: 637626368. Throughput: 0: 5003.2. Samples: 637623948. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:17:28,602][25689] Avg episode reward: [(0, '-9.913')] [2022-07-10 07:17:30,155][26022] Updated weights on worker 0-0, policy_version 622691 (0.00094) [2022-07-10 07:17:31,950][26022] Updated weights on worker 0-0, policy_version 622701 (0.00087) [2022-07-10 07:17:33,671][25689] Fps is (10 sec: 5377.7, 60 sec: 5537.8, 300 sec: 5533.5). Total num frames: 637654016. Throughput: 0: 5824.7. Samples: 637657282. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:17:33,671][25689] Avg episode reward: [(0, '-9.540')] [2022-07-10 07:17:33,975][26022] Updated weights on worker 0-0, policy_version 622711 (0.00091) [2022-07-10 07:17:35,507][26022] Updated weights on worker 0-0, policy_version 622721 (0.00086) [2022-07-10 07:17:37,659][26022] Updated weights on worker 0-0, policy_version 622731 (0.00090) [2022-07-10 07:17:38,687][25689] Fps is (10 sec: 5685.7, 60 sec: 5538.8, 300 sec: 5540.9). Total num frames: 637683712. Throughput: 0: 5809.1. Samples: 637690844. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:17:38,687][25689] Avg episode reward: [(0, '-9.043')] [2022-07-10 07:17:39,175][26022] Updated weights on worker 0-0, policy_version 622741 (0.00094) [2022-07-10 07:17:41,231][26022] Updated weights on worker 0-0, policy_version 622751 (0.00086) [2022-07-10 07:17:43,007][26022] Updated weights on worker 0-0, policy_version 622761 (0.00099) [2022-07-10 07:17:43,692][25689] Fps is (10 sec: 5518.0, 60 sec: 5526.8, 300 sec: 5534.2). Total num frames: 637709312. Throughput: 0: 4980.7. Samples: 637707558. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:17:43,692][25689] Avg episode reward: [(0, '-9.116')] [2022-07-10 07:17:44,906][26022] Updated weights on worker 0-0, policy_version 622771 (0.00094) [2022-07-10 07:17:46,914][26022] Updated weights on worker 0-0, policy_version 622781 (0.00091) [2022-07-10 07:17:48,519][26022] Updated weights on worker 0-0, policy_version 622791 (0.00082) [2022-07-10 07:17:48,719][25689] Fps is (10 sec: 5512.0, 60 sec: 5531.0, 300 sec: 5535.3). Total num frames: 637739008. Throughput: 0: 5818.0. Samples: 637740992. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:17:48,720][25689] Avg episode reward: [(0, '-10.844')] [2022-07-10 07:17:50,499][26022] Updated weights on worker 0-0, policy_version 622801 (0.00082) [2022-07-10 07:17:52,124][26022] Updated weights on worker 0-0, policy_version 622811 (0.00086) [2022-07-10 07:17:53,781][25689] Fps is (10 sec: 5683.7, 60 sec: 5514.5, 300 sec: 5538.8). Total num frames: 637766656. Throughput: 0: 5829.2. Samples: 637774508. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:17:53,782][25689] Avg episode reward: [(0, '-9.163')] [2022-07-10 07:17:53,954][26022] Updated weights on worker 0-0, policy_version 622821 (0.00086) [2022-07-10 07:17:55,911][26022] Updated weights on worker 0-0, policy_version 622831 (0.00092) [2022-07-10 07:17:57,728][26022] Updated weights on worker 0-0, policy_version 622841 (0.00092) [2022-07-10 07:17:58,785][25689] Fps is (10 sec: 5594.8, 60 sec: 5569.5, 300 sec: 5535.6). Total num frames: 637795328. Throughput: 0: 4992.2. Samples: 637791180. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:17:58,786][25689] Avg episode reward: [(0, '-9.306')] [2022-07-10 07:17:59,544][26022] Updated weights on worker 0-0, policy_version 622851 (0.00094) [2022-07-10 07:18:01,451][26022] Updated weights on worker 0-0, policy_version 622861 (0.00095) [2022-07-10 07:18:03,646][26022] Updated weights on worker 0-0, policy_version 622871 (0.00087) [2022-07-10 07:18:03,791][25689] Fps is (10 sec: 5319.6, 60 sec: 5501.7, 300 sec: 5533.6). Total num frames: 637819904. Throughput: 0: 5718.7. Samples: 637822498. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:18:03,791][25689] Avg episode reward: [(0, '-10.289')] [2022-07-10 07:18:05,523][26022] Updated weights on worker 0-0, policy_version 622881 (0.00098) [2022-07-10 07:18:07,332][26022] Updated weights on worker 0-0, policy_version 622891 (0.00091) [2022-07-10 07:18:08,808][25689] Fps is (10 sec: 5108.3, 60 sec: 5501.1, 300 sec: 5531.5). Total num frames: 637846528. Throughput: 0: 5722.1. Samples: 637855946. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:18:08,809][25689] Avg episode reward: [(0, '-9.191')] [2022-07-10 07:18:09,307][26022] Updated weights on worker 0-0, policy_version 622901 (0.00094) [2022-07-10 07:18:10,936][26022] Updated weights on worker 0-0, policy_version 622911 (0.00098) [2022-07-10 07:18:11,663][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:18:11,674][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000622914_637863936.pth [2022-07-10 07:18:11,674][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000620966_635869184.pth [2022-07-10 07:18:12,870][26022] Updated weights on worker 0-0, policy_version 622921 (0.00087) [2022-07-10 07:18:13,864][25689] Fps is (10 sec: 5794.4, 60 sec: 5568.5, 300 sec: 5534.0). Total num frames: 637878272. Throughput: 0: 4875.1. Samples: 637872414. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:18:13,864][25689] Avg episode reward: [(0, '-8.829')] [2022-07-10 07:18:14,858][26022] Updated weights on worker 0-0, policy_version 622931 (0.00090) [2022-07-10 07:18:16,375][26022] Updated weights on worker 0-0, policy_version 622941 (0.00090) [2022-07-10 07:18:18,667][26022] Updated weights on worker 0-0, policy_version 622951 (0.00385) [2022-07-10 07:18:18,931][25689] Fps is (10 sec: 5563.3, 60 sec: 5495.3, 300 sec: 5527.7). Total num frames: 637902848. Throughput: 0: 5701.1. Samples: 637906036. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:18:18,932][25689] Avg episode reward: [(0, '-6.949')] [2022-07-10 07:18:20,009][26022] Updated weights on worker 0-0, policy_version 622961 (0.00087) [2022-07-10 07:18:22,206][26022] Updated weights on worker 0-0, policy_version 622971 (0.00087) [2022-07-10 07:18:23,874][26022] Updated weights on worker 0-0, policy_version 622981 (0.00083) [2022-07-10 07:18:23,968][25689] Fps is (10 sec: 5371.3, 60 sec: 5510.9, 300 sec: 5534.3). Total num frames: 637932544. Throughput: 0: 5798.5. Samples: 637939496. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:18:23,970][25689] Avg episode reward: [(0, '-8.045')] [2022-07-10 07:18:25,619][26022] Updated weights on worker 0-0, policy_version 622991 (0.00087) [2022-07-10 07:18:27,614][26022] Updated weights on worker 0-0, policy_version 623001 (0.00086) [2022-07-10 07:18:29,006][25689] Fps is (10 sec: 5793.4, 60 sec: 5543.3, 300 sec: 5531.6). Total num frames: 637961216. Throughput: 0: 4946.3. Samples: 637955854. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:18:29,008][25689] Avg episode reward: [(0, '-7.366')] [2022-07-10 07:18:29,560][26022] Updated weights on worker 0-0, policy_version 623011 (0.00094) [2022-07-10 07:18:31,454][26022] Updated weights on worker 0-0, policy_version 623021 (0.00093) [2022-07-10 07:18:33,538][26022] Updated weights on worker 0-0, policy_version 623031 (0.00083) [2022-07-10 07:18:34,071][25689] Fps is (10 sec: 5371.8, 60 sec: 5509.9, 300 sec: 5527.2). Total num frames: 637986816. Throughput: 0: 5755.1. Samples: 637988708. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:18:34,072][25689] Avg episode reward: [(0, '-7.097')] [2022-07-10 07:18:34,869][26022] Updated weights on worker 0-0, policy_version 623041 (0.00095) [2022-07-10 07:18:37,109][26022] Updated weights on worker 0-0, policy_version 623051 (0.00095) [2022-07-10 07:18:38,475][26022] Updated weights on worker 0-0, policy_version 623061 (0.00092) [2022-07-10 07:18:39,087][25689] Fps is (10 sec: 5485.1, 60 sec: 5509.9, 300 sec: 5530.6). Total num frames: 638016512. Throughput: 0: 5765.2. Samples: 638022240. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:18:39,088][25689] Avg episode reward: [(0, '-7.212')] [2022-07-10 07:18:40,658][26022] Updated weights on worker 0-0, policy_version 623071 (0.00102) [2022-07-10 07:18:42,491][26022] Updated weights on worker 0-0, policy_version 623081 (0.00113) [2022-07-10 07:18:44,116][25689] Fps is (10 sec: 5708.7, 60 sec: 5541.6, 300 sec: 5530.3). Total num frames: 638044160. Throughput: 0: 4947.1. Samples: 638039170. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:18:44,117][25689] Avg episode reward: [(0, '-6.486')] [2022-07-10 07:18:44,239][26022] Updated weights on worker 0-0, policy_version 623091 (0.00700) [2022-07-10 07:18:46,082][26022] Updated weights on worker 0-0, policy_version 623101 (0.00090) [2022-07-10 07:18:47,937][26022] Updated weights on worker 0-0, policy_version 623111 (0.00088) [2022-07-10 07:18:49,150][25689] Fps is (10 sec: 5597.2, 60 sec: 5524.1, 300 sec: 5532.2). Total num frames: 638072832. Throughput: 0: 5815.9. Samples: 638073004. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:18:49,150][25689] Avg episode reward: [(0, '-6.836')] [2022-07-10 07:18:49,664][26022] Updated weights on worker 0-0, policy_version 623121 (0.00094) [2022-07-10 07:18:51,680][26022] Updated weights on worker 0-0, policy_version 623131 (0.00081) [2022-07-10 07:18:53,499][26022] Updated weights on worker 0-0, policy_version 623141 (0.00088) [2022-07-10 07:18:54,276][25689] Fps is (10 sec: 5543.2, 60 sec: 5518.2, 300 sec: 5530.7). Total num frames: 638100480. Throughput: 0: 5817.3. Samples: 638106248. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:18:54,277][25689] Avg episode reward: [(0, '-5.716')] [2022-07-10 07:18:55,235][26022] Updated weights on worker 0-0, policy_version 623151 (0.00081) [2022-07-10 07:18:57,135][26022] Updated weights on worker 0-0, policy_version 623161 (0.00092) [2022-07-10 07:18:58,953][26022] Updated weights on worker 0-0, policy_version 623171 (0.00086) [2022-07-10 07:18:59,305][25689] Fps is (10 sec: 5546.0, 60 sec: 5516.0, 300 sec: 5535.5). Total num frames: 638129152. Throughput: 0: 4987.2. Samples: 638123068. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:18:59,305][25689] Avg episode reward: [(0, '-6.661')] [2022-07-10 07:19:00,727][26022] Updated weights on worker 0-0, policy_version 623181 (0.00089) [2022-07-10 07:19:02,985][26022] Updated weights on worker 0-0, policy_version 623191 (0.00087) [2022-07-10 07:19:04,310][25689] Fps is (10 sec: 5408.7, 60 sec: 5532.9, 300 sec: 5532.4). Total num frames: 638154752. Throughput: 0: 5720.2. Samples: 638154686. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:19:04,311][25689] Avg episode reward: [(0, '-6.364')] [2022-07-10 07:19:04,911][26022] Updated weights on worker 0-0, policy_version 623201 (0.00082) [2022-07-10 07:19:06,591][26022] Updated weights on worker 0-0, policy_version 623211 (0.00092) [2022-07-10 07:19:08,569][26022] Updated weights on worker 0-0, policy_version 623221 (0.00092) [2022-07-10 07:19:09,342][25689] Fps is (10 sec: 5305.0, 60 sec: 5548.5, 300 sec: 5532.5). Total num frames: 638182400. Throughput: 0: 5704.9. Samples: 638188198. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:19:09,342][25689] Avg episode reward: [(0, '-5.582')] [2022-07-10 07:19:10,208][26022] Updated weights on worker 0-0, policy_version 623231 (0.00093) [2022-07-10 07:19:12,284][26022] Updated weights on worker 0-0, policy_version 623241 (0.00090) [2022-07-10 07:19:14,071][26022] Updated weights on worker 0-0, policy_version 623251 (0.00096) [2022-07-10 07:19:14,391][25689] Fps is (10 sec: 5485.6, 60 sec: 5481.5, 300 sec: 5528.4). Total num frames: 638210048. Throughput: 0: 4877.9. Samples: 638204364. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:19:14,391][25689] Avg episode reward: [(0, '-5.196')] [2022-07-10 07:19:15,870][26022] Updated weights on worker 0-0, policy_version 623261 (0.00349) [2022-07-10 07:19:17,883][26022] Updated weights on worker 0-0, policy_version 623271 (0.00095) [2022-07-10 07:19:19,402][25689] Fps is (10 sec: 5598.4, 60 sec: 5554.3, 300 sec: 5536.0). Total num frames: 638238720. Throughput: 0: 5699.6. Samples: 638237614. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:19:19,402][25689] Avg episode reward: [(0, '-5.852')] [2022-07-10 07:19:19,468][26022] Updated weights on worker 0-0, policy_version 623281 (0.00090) [2022-07-10 07:19:21,612][26022] Updated weights on worker 0-0, policy_version 623291 (0.00094) [2022-07-10 07:19:23,215][26022] Updated weights on worker 0-0, policy_version 623301 (0.00086) [2022-07-10 07:19:24,424][25689] Fps is (10 sec: 5511.0, 60 sec: 5504.8, 300 sec: 5532.3). Total num frames: 638265344. Throughput: 0: 5773.4. Samples: 638270812. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:19:24,425][25689] Avg episode reward: [(0, '-6.588')] [2022-07-10 07:19:25,249][26022] Updated weights on worker 0-0, policy_version 623311 (0.00093) [2022-07-10 07:19:26,924][26022] Updated weights on worker 0-0, policy_version 623321 (0.00092) [2022-07-10 07:19:28,817][26022] Updated weights on worker 0-0, policy_version 623331 (0.00449) [2022-07-10 07:19:29,442][25689] Fps is (10 sec: 5507.2, 60 sec: 5506.6, 300 sec: 5532.7). Total num frames: 638294016. Throughput: 0: 4943.1. Samples: 638287556. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:19:29,443][25689] Avg episode reward: [(0, '-7.265')] [2022-07-10 07:19:30,555][26022] Updated weights on worker 0-0, policy_version 623341 (0.00097) [2022-07-10 07:19:32,659][26022] Updated weights on worker 0-0, policy_version 623351 (0.00091) [2022-07-10 07:19:34,487][26022] Updated weights on worker 0-0, policy_version 623361 (0.00083) [2022-07-10 07:19:34,511][25689] Fps is (10 sec: 5584.0, 60 sec: 5540.2, 300 sec: 5528.7). Total num frames: 638321664. Throughput: 0: 5787.9. Samples: 638320816. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:19:34,511][25689] Avg episode reward: [(0, '-8.287')] [2022-07-10 07:19:36,167][26022] Updated weights on worker 0-0, policy_version 623371 (0.00091) [2022-07-10 07:19:38,030][26022] Updated weights on worker 0-0, policy_version 623381 (0.00087) [2022-07-10 07:19:39,527][25689] Fps is (10 sec: 5584.9, 60 sec: 5523.3, 300 sec: 5535.4). Total num frames: 638350336. Throughput: 0: 5798.1. Samples: 638354300. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:19:39,527][25689] Avg episode reward: [(0, '-9.742')] [2022-07-10 07:19:39,827][26022] Updated weights on worker 0-0, policy_version 623391 (0.00085) [2022-07-10 07:19:41,828][26022] Updated weights on worker 0-0, policy_version 623401 (0.00085) [2022-07-10 07:19:43,760][26022] Updated weights on worker 0-0, policy_version 623411 (0.00084) [2022-07-10 07:19:44,530][25689] Fps is (10 sec: 5519.0, 60 sec: 5508.7, 300 sec: 5532.1). Total num frames: 638376960. Throughput: 0: 4983.0. Samples: 638370996. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:19:44,531][25689] Avg episode reward: [(0, '-11.327')] [2022-07-10 07:19:45,391][26022] Updated weights on worker 0-0, policy_version 623421 (0.00096) [2022-07-10 07:19:47,472][26022] Updated weights on worker 0-0, policy_version 623431 (0.00092) [2022-07-10 07:19:49,224][26022] Updated weights on worker 0-0, policy_version 623441 (0.00083) [2022-07-10 07:19:49,539][25689] Fps is (10 sec: 5522.8, 60 sec: 5510.9, 300 sec: 5527.7). Total num frames: 638405632. Throughput: 0: 5817.4. Samples: 638404466. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:19:49,540][25689] Avg episode reward: [(0, '-11.933')] [2022-07-10 07:19:51,095][26022] Updated weights on worker 0-0, policy_version 623451 (0.00093) [2022-07-10 07:19:52,755][26022] Updated weights on worker 0-0, policy_version 623461 (0.00082) [2022-07-10 07:19:54,605][25689] Fps is (10 sec: 5590.1, 60 sec: 5516.5, 300 sec: 5530.6). Total num frames: 638433280. Throughput: 0: 5831.1. Samples: 638437986. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:19:54,605][25689] Avg episode reward: [(0, '-12.338')] [2022-07-10 07:19:54,640][26022] Updated weights on worker 0-0, policy_version 623471 (0.00093) [2022-07-10 07:19:56,558][26022] Updated weights on worker 0-0, policy_version 623481 (0.00094) [2022-07-10 07:19:58,448][26022] Updated weights on worker 0-0, policy_version 623491 (0.00094) [2022-07-10 07:19:59,613][25689] Fps is (10 sec: 5590.5, 60 sec: 5518.3, 300 sec: 5537.9). Total num frames: 638461952. Throughput: 0: 5001.8. Samples: 638454768. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:19:59,614][25689] Avg episode reward: [(0, '-13.557')] [2022-07-10 07:20:00,248][26022] Updated weights on worker 0-0, policy_version 623501 (0.00094) [2022-07-10 07:20:02,409][26022] Updated weights on worker 0-0, policy_version 623511 (0.00087) [2022-07-10 07:20:04,144][26022] Updated weights on worker 0-0, policy_version 623521 (0.00089) [2022-07-10 07:20:04,642][25689] Fps is (10 sec: 5407.2, 60 sec: 5516.2, 300 sec: 5527.3). Total num frames: 638487552. Throughput: 0: 5711.1. Samples: 638485858. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:20:04,642][25689] Avg episode reward: [(0, '-13.670')] [2022-07-10 07:20:06,384][26022] Updated weights on worker 0-0, policy_version 623531 (0.00093) [2022-07-10 07:20:07,879][26022] Updated weights on worker 0-0, policy_version 623541 (0.00097) [2022-07-10 07:20:09,651][25689] Fps is (10 sec: 5203.0, 60 sec: 5501.3, 300 sec: 5528.9). Total num frames: 638514176. Throughput: 0: 5702.2. Samples: 638519146. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:20:09,651][25689] Avg episode reward: [(0, '-12.837')] [2022-07-10 07:20:09,895][26022] Updated weights on worker 0-0, policy_version 623551 (0.00079) [2022-07-10 07:20:11,692][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:20:11,710][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000623561_638526464.pth [2022-07-10 07:20:11,710][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000621614_636532736.pth [2022-07-10 07:20:11,714][26022] Updated weights on worker 0-0, policy_version 623561 (0.00079) [2022-07-10 07:20:13,331][26022] Updated weights on worker 0-0, policy_version 623571 (0.00087) [2022-07-10 07:20:14,720][25689] Fps is (10 sec: 5486.5, 60 sec: 5516.4, 300 sec: 5525.2). Total num frames: 638542848. Throughput: 0: 5695.5. Samples: 638552554. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:20:14,721][25689] Avg episode reward: [(0, '-12.500')] [2022-07-10 07:20:15,490][26022] Updated weights on worker 0-0, policy_version 623581 (0.00764) [2022-07-10 07:20:17,140][26022] Updated weights on worker 0-0, policy_version 623591 (0.00089) [2022-07-10 07:20:18,996][26022] Updated weights on worker 0-0, policy_version 623601 (0.00086) [2022-07-10 07:20:19,737][25689] Fps is (10 sec: 5685.2, 60 sec: 5515.8, 300 sec: 5528.7). Total num frames: 638571520. Throughput: 0: 5695.3. Samples: 638569380. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:20:19,738][25689] Avg episode reward: [(0, '-11.396')] [2022-07-10 07:20:20,850][26022] Updated weights on worker 0-0, policy_version 623611 (0.00108) [2022-07-10 07:20:22,631][26022] Updated weights on worker 0-0, policy_version 623621 (0.00094) [2022-07-10 07:20:24,466][26022] Updated weights on worker 0-0, policy_version 623631 (0.00087) [2022-07-10 07:20:24,768][25689] Fps is (10 sec: 5605.0, 60 sec: 5532.0, 300 sec: 5526.1). Total num frames: 638599168. Throughput: 0: 5819.7. Samples: 638602990. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:20:24,770][25689] Avg episode reward: [(0, '-9.472')] [2022-07-10 07:20:26,584][26022] Updated weights on worker 0-0, policy_version 623641 (0.00095) [2022-07-10 07:20:28,180][26022] Updated weights on worker 0-0, policy_version 623651 (0.00098) [2022-07-10 07:20:29,770][25689] Fps is (10 sec: 5511.3, 60 sec: 5516.5, 300 sec: 5528.2). Total num frames: 638626816. Throughput: 0: 5826.9. Samples: 638636382. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:20:29,771][25689] Avg episode reward: [(0, '-8.005')] [2022-07-10 07:20:30,070][26022] Updated weights on worker 0-0, policy_version 623661 (0.00095) [2022-07-10 07:20:31,721][26022] Updated weights on worker 0-0, policy_version 623671 (0.00089) [2022-07-10 07:20:33,607][26022] Updated weights on worker 0-0, policy_version 623681 (0.00092) [2022-07-10 07:20:34,821][25689] Fps is (10 sec: 5500.8, 60 sec: 5518.1, 300 sec: 5531.2). Total num frames: 638654464. Throughput: 0: 4991.8. Samples: 638652892. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:20:34,821][25689] Avg episode reward: [(0, '-8.237')] [2022-07-10 07:20:35,536][26022] Updated weights on worker 0-0, policy_version 623691 (0.00092) [2022-07-10 07:20:37,475][26022] Updated weights on worker 0-0, policy_version 623701 (0.00088) [2022-07-10 07:20:39,088][26022] Updated weights on worker 0-0, policy_version 623711 (0.00087) [2022-07-10 07:20:39,824][25689] Fps is (10 sec: 5602.0, 60 sec: 5519.3, 300 sec: 5527.8). Total num frames: 638683136. Throughput: 0: 5823.9. Samples: 638686362. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:20:39,825][25689] Avg episode reward: [(0, '-8.512')] [2022-07-10 07:20:41,315][26022] Updated weights on worker 0-0, policy_version 623721 (0.00092) [2022-07-10 07:20:42,758][26022] Updated weights on worker 0-0, policy_version 623731 (0.00089) [2022-07-10 07:20:44,834][25689] Fps is (10 sec: 5522.4, 60 sec: 5518.7, 300 sec: 5524.6). Total num frames: 638709760. Throughput: 0: 5814.8. Samples: 638719666. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:20:44,836][25689] Avg episode reward: [(0, '-7.714')] [2022-07-10 07:20:44,875][26022] Updated weights on worker 0-0, policy_version 623741 (0.00089) [2022-07-10 07:20:46,441][26022] Updated weights on worker 0-0, policy_version 623751 (0.00098) [2022-07-10 07:20:48,402][26022] Updated weights on worker 0-0, policy_version 623761 (0.00089) [2022-07-10 07:20:49,849][25689] Fps is (10 sec: 5617.6, 60 sec: 5535.1, 300 sec: 5530.0). Total num frames: 638739456. Throughput: 0: 4978.6. Samples: 638736348. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:20:49,851][25689] Avg episode reward: [(0, '-8.494')] [2022-07-10 07:20:50,244][26022] Updated weights on worker 0-0, policy_version 623771 (0.00052) [2022-07-10 07:20:52,195][26022] Updated weights on worker 0-0, policy_version 623781 (0.00095) [2022-07-10 07:20:54,018][26022] Updated weights on worker 0-0, policy_version 623791 (0.00090) [2022-07-10 07:20:54,935][25689] Fps is (10 sec: 5575.7, 60 sec: 5516.3, 300 sec: 5522.3). Total num frames: 638766080. Throughput: 0: 5800.3. Samples: 638769560. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:20:54,935][25689] Avg episode reward: [(0, '-8.563')] [2022-07-10 07:20:55,936][26022] Updated weights on worker 0-0, policy_version 623801 (0.00088) [2022-07-10 07:20:57,490][26022] Updated weights on worker 0-0, policy_version 623811 (0.00085) [2022-07-10 07:20:59,695][26022] Updated weights on worker 0-0, policy_version 623821 (0.00097) [2022-07-10 07:21:00,000][25689] Fps is (10 sec: 5346.9, 60 sec: 5494.2, 300 sec: 5529.0). Total num frames: 638793728. Throughput: 0: 5781.7. Samples: 638803012. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:21:00,000][25689] Avg episode reward: [(0, '-9.661')] [2022-07-10 07:21:01,320][26022] Updated weights on worker 0-0, policy_version 623831 (0.00091) [2022-07-10 07:21:03,724][26022] Updated weights on worker 0-0, policy_version 623841 (0.00097) [2022-07-10 07:21:05,020][25689] Fps is (10 sec: 5482.9, 60 sec: 5528.9, 300 sec: 5529.1). Total num frames: 638821376. Throughput: 0: 4867.7. Samples: 638817926. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:21:05,020][25689] Avg episode reward: [(0, '-9.086')] [2022-07-10 07:21:05,336][26022] Updated weights on worker 0-0, policy_version 623851 (0.00088) [2022-07-10 07:21:07,327][26022] Updated weights on worker 0-0, policy_version 623861 (0.00092) [2022-07-10 07:21:09,097][26022] Updated weights on worker 0-0, policy_version 623871 (0.00088) [2022-07-10 07:21:10,047][25689] Fps is (10 sec: 5401.6, 60 sec: 5527.2, 300 sec: 5524.3). Total num frames: 638848000. Throughput: 0: 5681.2. Samples: 638851094. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:21:10,048][25689] Avg episode reward: [(0, '-10.524')] [2022-07-10 07:21:11,050][26022] Updated weights on worker 0-0, policy_version 623881 (0.00096) [2022-07-10 07:21:12,711][26022] Updated weights on worker 0-0, policy_version 623891 (0.00101) [2022-07-10 07:21:14,673][26022] Updated weights on worker 0-0, policy_version 623901 (0.00091) [2022-07-10 07:21:15,167][25689] Fps is (10 sec: 5449.7, 60 sec: 5522.6, 300 sec: 5525.6). Total num frames: 638876672. Throughput: 0: 5697.7. Samples: 638884836. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:21:15,170][25689] Avg episode reward: [(0, '-9.478')] [2022-07-10 07:21:16,436][26022] Updated weights on worker 0-0, policy_version 623911 (0.00087) [2022-07-10 07:21:18,293][26022] Updated weights on worker 0-0, policy_version 623921 (0.00093) [2022-07-10 07:21:20,132][26022] Updated weights on worker 0-0, policy_version 623931 (0.00085) [2022-07-10 07:21:20,242][25689] Fps is (10 sec: 5624.8, 60 sec: 5517.3, 300 sec: 5520.8). Total num frames: 638905344. Throughput: 0: 4879.1. Samples: 638901776. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:21:20,242][25689] Avg episode reward: [(0, '-8.849')] [2022-07-10 07:21:22,029][26022] Updated weights on worker 0-0, policy_version 623941 (0.00095) [2022-07-10 07:21:23,717][26022] Updated weights on worker 0-0, policy_version 623951 (0.00087) [2022-07-10 07:21:25,249][25689] Fps is (10 sec: 5687.7, 60 sec: 5536.5, 300 sec: 5528.3). Total num frames: 638934016. Throughput: 0: 5811.4. Samples: 638935484. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:21:25,249][25689] Avg episode reward: [(0, '-8.001')] [2022-07-10 07:21:25,573][26022] Updated weights on worker 0-0, policy_version 623961 (0.00087) [2022-07-10 07:21:27,528][26022] Updated weights on worker 0-0, policy_version 623971 (0.00094) [2022-07-10 07:21:29,303][26022] Updated weights on worker 0-0, policy_version 623981 (0.00093) [2022-07-10 07:21:30,278][25689] Fps is (10 sec: 5611.7, 60 sec: 5534.0, 300 sec: 5528.9). Total num frames: 638961664. Throughput: 0: 5819.2. Samples: 638968822. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:21:30,279][25689] Avg episode reward: [(0, '-7.870')] [2022-07-10 07:21:31,199][26022] Updated weights on worker 0-0, policy_version 623991 (0.00090) [2022-07-10 07:21:33,111][26022] Updated weights on worker 0-0, policy_version 624001 (0.00087) [2022-07-10 07:21:34,759][26022] Updated weights on worker 0-0, policy_version 624011 (0.00082) [2022-07-10 07:21:35,388][25689] Fps is (10 sec: 5554.5, 60 sec: 5545.4, 300 sec: 5523.9). Total num frames: 638990336. Throughput: 0: 4972.2. Samples: 638985382. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:21:35,389][25689] Avg episode reward: [(0, '-8.272')] [2022-07-10 07:21:36,768][26022] Updated weights on worker 0-0, policy_version 624021 (0.00095) [2022-07-10 07:21:38,559][26022] Updated weights on worker 0-0, policy_version 624031 (0.00095) [2022-07-10 07:21:40,398][25689] Fps is (10 sec: 5464.2, 60 sec: 5511.0, 300 sec: 5524.9). Total num frames: 639016960. Throughput: 0: 5782.3. Samples: 639018322. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:21:40,398][25689] Avg episode reward: [(0, '-7.816')] [2022-07-10 07:21:40,533][26022] Updated weights on worker 0-0, policy_version 624041 (0.00755) [2022-07-10 07:21:42,228][26022] Updated weights on worker 0-0, policy_version 624051 (0.00095) [2022-07-10 07:21:44,149][26022] Updated weights on worker 0-0, policy_version 624061 (0.00088) [2022-07-10 07:21:45,416][25689] Fps is (10 sec: 5412.1, 60 sec: 5527.2, 300 sec: 5519.0). Total num frames: 639044608. Throughput: 0: 5766.1. Samples: 639051772. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:21:45,417][25689] Avg episode reward: [(0, '-9.276')] [2022-07-10 07:21:45,977][26022] Updated weights on worker 0-0, policy_version 624071 (0.00314) [2022-07-10 07:21:47,856][26022] Updated weights on worker 0-0, policy_version 624081 (0.00709) [2022-07-10 07:21:49,712][26022] Updated weights on worker 0-0, policy_version 624091 (0.00091) [2022-07-10 07:21:50,443][25689] Fps is (10 sec: 5606.7, 60 sec: 5509.3, 300 sec: 5519.7). Total num frames: 639073280. Throughput: 0: 4937.4. Samples: 639068384. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:21:50,443][25689] Avg episode reward: [(0, '-8.656')] [2022-07-10 07:21:51,591][26022] Updated weights on worker 0-0, policy_version 624101 (0.00084) [2022-07-10 07:21:53,254][26022] Updated weights on worker 0-0, policy_version 624111 (0.00088) [2022-07-10 07:21:55,232][26022] Updated weights on worker 0-0, policy_version 624121 (0.00098) [2022-07-10 07:21:55,545][25689] Fps is (10 sec: 5661.6, 60 sec: 5541.5, 300 sec: 5529.1). Total num frames: 639101952. Throughput: 0: 5791.0. Samples: 639102108. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:21:55,545][25689] Avg episode reward: [(0, '-9.487')] [2022-07-10 07:21:56,888][26022] Updated weights on worker 0-0, policy_version 624131 (0.00098) [2022-07-10 07:21:58,760][26022] Updated weights on worker 0-0, policy_version 624141 (0.00094) [2022-07-10 07:22:00,569][25689] Fps is (10 sec: 5561.6, 60 sec: 5545.2, 300 sec: 5525.3). Total num frames: 639129600. Throughput: 0: 5820.1. Samples: 639135724. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:22:00,570][25689] Avg episode reward: [(0, '-10.600')] [2022-07-10 07:22:00,715][26022] Updated weights on worker 0-0, policy_version 624151 (0.00085) [2022-07-10 07:22:02,657][26022] Updated weights on worker 0-0, policy_version 624161 (0.00086) [2022-07-10 07:22:04,767][26022] Updated weights on worker 0-0, policy_version 624171 (0.00088) [2022-07-10 07:22:05,640][25689] Fps is (10 sec: 5376.0, 60 sec: 5523.7, 300 sec: 5524.2). Total num frames: 639156224. Throughput: 0: 4880.7. Samples: 639150478. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:22:05,642][25689] Avg episode reward: [(0, '-9.613')] [2022-07-10 07:22:06,310][26022] Updated weights on worker 0-0, policy_version 624181 (0.00121) [2022-07-10 07:22:08,349][26022] Updated weights on worker 0-0, policy_version 624191 (0.00093) [2022-07-10 07:22:10,219][26022] Updated weights on worker 0-0, policy_version 624201 (0.00065) [2022-07-10 07:22:10,647][25689] Fps is (10 sec: 5385.6, 60 sec: 5542.5, 300 sec: 5525.0). Total num frames: 639183872. Throughput: 0: 5705.9. Samples: 639183666. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:22:10,647][25689] Avg episode reward: [(0, '-8.399')] [2022-07-10 07:22:11,759][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:22:11,774][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000624209_639190016.pth [2022-07-10 07:22:11,774][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000622265_637199360.pth [2022-07-10 07:22:12,096][26022] Updated weights on worker 0-0, policy_version 624211 (0.00084) [2022-07-10 07:22:13,931][26022] Updated weights on worker 0-0, policy_version 624221 (0.00084) [2022-07-10 07:22:15,768][25689] Fps is (10 sec: 5459.8, 60 sec: 5525.4, 300 sec: 5519.5). Total num frames: 639211520. Throughput: 0: 5679.3. Samples: 639216962. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:22:15,768][25689] Avg episode reward: [(0, '-7.912')] [2022-07-10 07:22:15,831][26022] Updated weights on worker 0-0, policy_version 624231 (0.00082) [2022-07-10 07:22:17,637][26022] Updated weights on worker 0-0, policy_version 624241 (0.00082) [2022-07-10 07:22:19,539][26022] Updated weights on worker 0-0, policy_version 624251 (0.00098) [2022-07-10 07:22:20,779][25689] Fps is (10 sec: 5457.4, 60 sec: 5514.4, 300 sec: 5516.2). Total num frames: 639239168. Throughput: 0: 4850.5. Samples: 639233750. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:22:20,779][25689] Avg episode reward: [(0, '-8.008')] [2022-07-10 07:22:21,255][26022] Updated weights on worker 0-0, policy_version 624261 (0.00085) [2022-07-10 07:22:23,117][26022] Updated weights on worker 0-0, policy_version 624271 (0.00082) [2022-07-10 07:22:24,872][26022] Updated weights on worker 0-0, policy_version 624281 (0.00087) [2022-07-10 07:22:25,785][25689] Fps is (10 sec: 5622.0, 60 sec: 5514.4, 300 sec: 5523.4). Total num frames: 639267840. Throughput: 0: 5807.8. Samples: 639267480. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:22:25,786][25689] Avg episode reward: [(0, '-6.336')] [2022-07-10 07:22:26,820][26022] Updated weights on worker 0-0, policy_version 624291 (0.00089) [2022-07-10 07:22:28,736][26022] Updated weights on worker 0-0, policy_version 624301 (0.00092) [2022-07-10 07:22:30,649][26022] Updated weights on worker 0-0, policy_version 624311 (0.00095) [2022-07-10 07:22:30,805][25689] Fps is (10 sec: 5515.0, 60 sec: 5498.4, 300 sec: 5520.9). Total num frames: 639294464. Throughput: 0: 5802.7. Samples: 639300640. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:22:30,806][25689] Avg episode reward: [(0, '-6.347')] [2022-07-10 07:22:32,395][26022] Updated weights on worker 0-0, policy_version 624321 (0.00086) [2022-07-10 07:22:34,296][26022] Updated weights on worker 0-0, policy_version 624331 (0.00086) [2022-07-10 07:22:35,853][25689] Fps is (10 sec: 5594.2, 60 sec: 5521.0, 300 sec: 5520.3). Total num frames: 639324160. Throughput: 0: 4990.3. Samples: 639317194. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:22:35,854][25689] Avg episode reward: [(0, '-6.937')] [2022-07-10 07:22:36,042][26022] Updated weights on worker 0-0, policy_version 624341 (0.00085) [2022-07-10 07:22:37,923][26022] Updated weights on worker 0-0, policy_version 624351 (0.00092) [2022-07-10 07:22:39,705][26022] Updated weights on worker 0-0, policy_version 624361 (0.00086) [2022-07-10 07:22:40,907][25689] Fps is (10 sec: 5676.8, 60 sec: 5533.8, 300 sec: 5526.3). Total num frames: 639351808. Throughput: 0: 5798.0. Samples: 639350452. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:22:40,907][25689] Avg episode reward: [(0, '-6.360')] [2022-07-10 07:22:41,783][26022] Updated weights on worker 0-0, policy_version 624371 (0.00082) [2022-07-10 07:22:43,310][26022] Updated weights on worker 0-0, policy_version 624381 (0.00092) [2022-07-10 07:22:45,410][26022] Updated weights on worker 0-0, policy_version 624391 (0.00091) [2022-07-10 07:22:45,936][25689] Fps is (10 sec: 5484.3, 60 sec: 5532.9, 300 sec: 5519.4). Total num frames: 639379456. Throughput: 0: 5776.9. Samples: 639383886. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:22:45,936][25689] Avg episode reward: [(0, '-6.198')] [2022-07-10 07:22:47,004][26022] Updated weights on worker 0-0, policy_version 624401 (0.00090) [2022-07-10 07:22:48,936][26022] Updated weights on worker 0-0, policy_version 624411 (0.00090) [2022-07-10 07:22:50,762][26022] Updated weights on worker 0-0, policy_version 624421 (0.00097) [2022-07-10 07:22:50,942][25689] Fps is (10 sec: 5510.1, 60 sec: 5517.8, 300 sec: 5520.4). Total num frames: 639407104. Throughput: 0: 4969.1. Samples: 639400702. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:22:50,943][25689] Avg episode reward: [(0, '-7.190')] [2022-07-10 07:22:52,567][26022] Updated weights on worker 0-0, policy_version 624431 (0.00087) [2022-07-10 07:22:54,443][26022] Updated weights on worker 0-0, policy_version 624441 (0.00087) [2022-07-10 07:22:55,998][25689] Fps is (10 sec: 5698.9, 60 sec: 5538.9, 300 sec: 5522.9). Total num frames: 639436800. Throughput: 0: 5817.4. Samples: 639434386. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:22:56,000][25689] Avg episode reward: [(0, '-6.841')] [2022-07-10 07:22:56,170][26022] Updated weights on worker 0-0, policy_version 624451 (0.00088) [2022-07-10 07:22:57,957][26022] Updated weights on worker 0-0, policy_version 624461 (0.00092) [2022-07-10 07:22:59,898][26022] Updated weights on worker 0-0, policy_version 624471 (0.00093) [2022-07-10 07:23:01,037][25689] Fps is (10 sec: 5579.1, 60 sec: 5520.7, 300 sec: 5529.2). Total num frames: 639463424. Throughput: 0: 5834.0. Samples: 639467892. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:23:01,037][25689] Avg episode reward: [(0, '-6.402')] [2022-07-10 07:23:01,647][26022] Updated weights on worker 0-0, policy_version 624481 (0.00087) [2022-07-10 07:23:04,111][26022] Updated weights on worker 0-0, policy_version 624491 (0.00084) [2022-07-10 07:23:05,657][26022] Updated weights on worker 0-0, policy_version 624501 (0.00084) [2022-07-10 07:23:06,058][25689] Fps is (10 sec: 5293.1, 60 sec: 5525.2, 300 sec: 5529.1). Total num frames: 639490048. Throughput: 0: 5744.8. Samples: 639499484. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:23:06,058][25689] Avg episode reward: [(0, '-6.480')] [2022-07-10 07:23:07,663][26022] Updated weights on worker 0-0, policy_version 624511 (0.00089) [2022-07-10 07:23:09,335][26022] Updated weights on worker 0-0, policy_version 624521 (0.00084) [2022-07-10 07:23:11,079][25689] Fps is (10 sec: 5506.5, 60 sec: 5540.9, 300 sec: 5519.4). Total num frames: 639518720. Throughput: 0: 5726.1. Samples: 639516008. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:23:11,081][25689] Avg episode reward: [(0, '-6.764')] [2022-07-10 07:23:11,319][26022] Updated weights on worker 0-0, policy_version 624531 (0.00095) [2022-07-10 07:23:13,044][26022] Updated weights on worker 0-0, policy_version 624541 (0.00056) [2022-07-10 07:23:15,236][26022] Updated weights on worker 0-0, policy_version 624551 (0.00089) [2022-07-10 07:23:16,127][25689] Fps is (10 sec: 5491.8, 60 sec: 5530.6, 300 sec: 5526.7). Total num frames: 639545344. Throughput: 0: 5707.1. Samples: 639549262. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:23:16,127][25689] Avg episode reward: [(0, '-5.290')] [2022-07-10 07:23:16,878][26022] Updated weights on worker 0-0, policy_version 624561 (0.00092) [2022-07-10 07:23:18,724][26022] Updated weights on worker 0-0, policy_version 624571 (0.00092) [2022-07-10 07:23:20,428][26022] Updated weights on worker 0-0, policy_version 624581 (0.00086) [2022-07-10 07:23:21,130][25689] Fps is (10 sec: 5501.2, 60 sec: 5548.3, 300 sec: 5523.8). Total num frames: 639574016. Throughput: 0: 5720.0. Samples: 639582828. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:23:21,132][25689] Avg episode reward: [(0, '-4.965')] [2022-07-10 07:23:22,391][26022] Updated weights on worker 0-0, policy_version 624591 (0.00084) [2022-07-10 07:23:24,412][26022] Updated weights on worker 0-0, policy_version 624601 (0.00087) [2022-07-10 07:23:26,098][26022] Updated weights on worker 0-0, policy_version 624611 (0.00096) [2022-07-10 07:23:26,137][25689] Fps is (10 sec: 5626.1, 60 sec: 5531.3, 300 sec: 5521.0). Total num frames: 639601664. Throughput: 0: 4993.4. Samples: 639599748. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:23:26,139][25689] Avg episode reward: [(0, '-5.164')] [2022-07-10 07:23:27,871][26022] Updated weights on worker 0-0, policy_version 624621 (0.00105) [2022-07-10 07:23:29,703][26022] Updated weights on worker 0-0, policy_version 624631 (0.00086) [2022-07-10 07:23:31,159][25689] Fps is (10 sec: 5616.2, 60 sec: 5565.1, 300 sec: 5532.1). Total num frames: 639630336. Throughput: 0: 5844.9. Samples: 639633372. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:23:31,160][25689] Avg episode reward: [(0, '-6.458')] [2022-07-10 07:23:31,445][26022] Updated weights on worker 0-0, policy_version 624641 (0.00051) [2022-07-10 07:23:33,554][26022] Updated weights on worker 0-0, policy_version 624651 (0.00087) [2022-07-10 07:23:35,134][26022] Updated weights on worker 0-0, policy_version 624661 (0.00087) [2022-07-10 07:23:36,226][25689] Fps is (10 sec: 5582.6, 60 sec: 5529.4, 300 sec: 5524.3). Total num frames: 639657984. Throughput: 0: 5840.3. Samples: 639666646. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:23:36,226][25689] Avg episode reward: [(0, '-6.814')] [2022-07-10 07:23:37,043][26022] Updated weights on worker 0-0, policy_version 624671 (0.00096) [2022-07-10 07:23:39,059][26022] Updated weights on worker 0-0, policy_version 624681 (0.00089) [2022-07-10 07:23:40,866][26022] Updated weights on worker 0-0, policy_version 624691 (0.00100) [2022-07-10 07:23:41,251][25689] Fps is (10 sec: 5479.2, 60 sec: 5532.0, 300 sec: 5524.3). Total num frames: 639685632. Throughput: 0: 4992.7. Samples: 639683282. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:23:41,251][25689] Avg episode reward: [(0, '-8.615')] [2022-07-10 07:23:42,643][26022] Updated weights on worker 0-0, policy_version 624701 (0.00063) [2022-07-10 07:23:44,505][26022] Updated weights on worker 0-0, policy_version 624711 (0.00088) [2022-07-10 07:23:46,254][25689] Fps is (10 sec: 5616.3, 60 sec: 5551.4, 300 sec: 5524.9). Total num frames: 639714304. Throughput: 0: 5819.5. Samples: 639716816. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 07:23:46,254][25689] Avg episode reward: [(0, '-8.366')] [2022-07-10 07:23:46,250][26022] Updated weights on worker 0-0, policy_version 624721 (0.00096) [2022-07-10 07:23:48,199][26022] Updated weights on worker 0-0, policy_version 624731 (0.00090) [2022-07-10 07:23:49,926][26022] Updated weights on worker 0-0, policy_version 624741 (0.00088) [2022-07-10 07:23:51,262][25689] Fps is (10 sec: 5523.6, 60 sec: 5534.3, 300 sec: 5523.7). Total num frames: 639740928. Throughput: 0: 5801.6. Samples: 639750000. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:23:51,262][25689] Avg episode reward: [(0, '-10.374')] [2022-07-10 07:23:51,964][26022] Updated weights on worker 0-0, policy_version 624751 (0.00085) [2022-07-10 07:23:53,826][26022] Updated weights on worker 0-0, policy_version 624761 (0.00086) [2022-07-10 07:23:55,600][26022] Updated weights on worker 0-0, policy_version 624771 (0.00089) [2022-07-10 07:23:56,353][25689] Fps is (10 sec: 5474.9, 60 sec: 5514.0, 300 sec: 5522.5). Total num frames: 639769600. Throughput: 0: 4964.1. Samples: 639766562. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:23:56,354][25689] Avg episode reward: [(0, '-8.451')] [2022-07-10 07:23:57,482][26022] Updated weights on worker 0-0, policy_version 624781 (0.00092) [2022-07-10 07:23:59,176][26022] Updated weights on worker 0-0, policy_version 624791 (0.00084) [2022-07-10 07:24:01,079][26022] Updated weights on worker 0-0, policy_version 624801 (0.00097) [2022-07-10 07:24:01,418][25689] Fps is (10 sec: 5545.3, 60 sec: 5528.6, 300 sec: 5528.3). Total num frames: 639797248. Throughput: 0: 5783.9. Samples: 639799928. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:24:01,418][25689] Avg episode reward: [(0, '-9.285')] [2022-07-10 07:24:03,335][26022] Updated weights on worker 0-0, policy_version 624811 (0.00093) [2022-07-10 07:24:05,081][26022] Updated weights on worker 0-0, policy_version 624821 (0.00087) [2022-07-10 07:24:06,424][25689] Fps is (10 sec: 5287.4, 60 sec: 5513.0, 300 sec: 5521.9). Total num frames: 639822848. Throughput: 0: 5694.4. Samples: 639831676. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:24:06,425][25689] Avg episode reward: [(0, '-8.637')] [2022-07-10 07:24:06,926][26022] Updated weights on worker 0-0, policy_version 624831 (0.00086) [2022-07-10 07:24:08,828][26022] Updated weights on worker 0-0, policy_version 624841 (0.00090) [2022-07-10 07:24:10,653][26022] Updated weights on worker 0-0, policy_version 624851 (0.00091) [2022-07-10 07:24:11,427][25689] Fps is (10 sec: 5524.5, 60 sec: 5531.6, 300 sec: 5529.6). Total num frames: 639852544. Throughput: 0: 4880.5. Samples: 639848418. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:24:11,428][25689] Avg episode reward: [(0, '-7.340')] [2022-07-10 07:24:11,940][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:24:11,950][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000624858_639854592.pth [2022-07-10 07:24:11,950][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000622914_637863936.pth [2022-07-10 07:24:12,504][26022] Updated weights on worker 0-0, policy_version 624861 (0.00087) [2022-07-10 07:24:14,288][26022] Updated weights on worker 0-0, policy_version 624871 (0.00094) [2022-07-10 07:24:16,185][26022] Updated weights on worker 0-0, policy_version 624881 (0.00086) [2022-07-10 07:24:16,481][25689] Fps is (10 sec: 5600.0, 60 sec: 5531.0, 300 sec: 5521.9). Total num frames: 639879168. Throughput: 0: 5734.2. Samples: 639881980. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:24:16,482][25689] Avg episode reward: [(0, '-7.640')] [2022-07-10 07:24:18,013][26022] Updated weights on worker 0-0, policy_version 624891 (0.00080) [2022-07-10 07:24:19,950][26022] Updated weights on worker 0-0, policy_version 624901 (0.00094) [2022-07-10 07:24:21,490][25689] Fps is (10 sec: 5495.2, 60 sec: 5530.6, 300 sec: 5529.1). Total num frames: 639907840. Throughput: 0: 5762.8. Samples: 639915598. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:24:21,492][25689] Avg episode reward: [(0, '-6.729')] [2022-07-10 07:24:21,604][26022] Updated weights on worker 0-0, policy_version 624911 (0.00089) [2022-07-10 07:24:23,712][26022] Updated weights on worker 0-0, policy_version 624921 (0.00082) [2022-07-10 07:24:25,215][26022] Updated weights on worker 0-0, policy_version 624931 (0.00084) [2022-07-10 07:24:26,494][25689] Fps is (10 sec: 5522.6, 60 sec: 5513.9, 300 sec: 5522.4). Total num frames: 639934464. Throughput: 0: 5018.7. Samples: 639932400. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:24:26,495][25689] Avg episode reward: [(0, '-8.110')] [2022-07-10 07:24:27,305][26022] Updated weights on worker 0-0, policy_version 624941 (0.00080) [2022-07-10 07:24:29,086][26022] Updated weights on worker 0-0, policy_version 624951 (0.00104) [2022-07-10 07:24:30,871][26022] Updated weights on worker 0-0, policy_version 624961 (0.00084) [2022-07-10 07:24:31,507][25689] Fps is (10 sec: 5520.1, 60 sec: 5514.7, 300 sec: 5526.9). Total num frames: 639963136. Throughput: 0: 5835.2. Samples: 639965588. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:24:31,509][25689] Avg episode reward: [(0, '-8.431')] [2022-07-10 07:24:32,687][26022] Updated weights on worker 0-0, policy_version 624971 (0.00089) [2022-07-10 07:24:34,528][26022] Updated weights on worker 0-0, policy_version 624981 (0.00085) [2022-07-10 07:24:36,422][26022] Updated weights on worker 0-0, policy_version 624991 (0.00086) [2022-07-10 07:24:36,642][25689] Fps is (10 sec: 5650.8, 60 sec: 5525.4, 300 sec: 5524.7). Total num frames: 639991808. Throughput: 0: 5812.7. Samples: 639999168. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:24:36,643][25689] Avg episode reward: [(0, '-9.385')] [2022-07-10 07:24:38,232][26022] Updated weights on worker 0-0, policy_version 625001 (0.00093) [2022-07-10 07:24:39,932][26022] Updated weights on worker 0-0, policy_version 625011 (0.00090) [2022-07-10 07:24:41,655][25689] Fps is (10 sec: 5449.0, 60 sec: 5509.6, 300 sec: 5524.5). Total num frames: 640018432. Throughput: 0: 4971.9. Samples: 640015856. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:24:41,656][25689] Avg episode reward: [(0, '-9.360')] [2022-07-10 07:24:41,896][26022] Updated weights on worker 0-0, policy_version 625021 (0.00094) [2022-07-10 07:24:43,453][26022] Updated weights on worker 0-0, policy_version 625031 (0.00088) [2022-07-10 07:24:45,571][26022] Updated weights on worker 0-0, policy_version 625041 (0.00086) [2022-07-10 07:24:46,734][25689] Fps is (10 sec: 5682.4, 60 sec: 5536.5, 300 sec: 5530.1). Total num frames: 640049152. Throughput: 0: 5802.5. Samples: 640049840. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:24:46,734][25689] Avg episode reward: [(0, '-9.471')] [2022-07-10 07:24:47,112][26022] Updated weights on worker 0-0, policy_version 625051 (0.00095) [2022-07-10 07:24:49,241][26022] Updated weights on worker 0-0, policy_version 625061 (0.00092) [2022-07-10 07:24:51,068][26022] Updated weights on worker 0-0, policy_version 625071 (0.00084) [2022-07-10 07:24:51,788][25689] Fps is (10 sec: 5658.9, 60 sec: 5532.2, 300 sec: 5526.9). Total num frames: 640075776. Throughput: 0: 5818.4. Samples: 640083592. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:24:51,789][25689] Avg episode reward: [(0, '-9.543')] [2022-07-10 07:24:52,826][26022] Updated weights on worker 0-0, policy_version 625081 (0.00083) [2022-07-10 07:24:54,794][26022] Updated weights on worker 0-0, policy_version 625091 (0.00092) [2022-07-10 07:24:56,491][26022] Updated weights on worker 0-0, policy_version 625101 (0.00087) [2022-07-10 07:24:56,869][25689] Fps is (10 sec: 5657.7, 60 sec: 5567.1, 300 sec: 5532.4). Total num frames: 640106496. Throughput: 0: 5010.3. Samples: 640100512. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:24:56,870][25689] Avg episode reward: [(0, '-10.370')] [2022-07-10 07:24:58,372][26022] Updated weights on worker 0-0, policy_version 625111 (0.00092) [2022-07-10 07:24:59,914][26022] Updated weights on worker 0-0, policy_version 625121 (0.00085) [2022-07-10 07:25:01,918][25689] Fps is (10 sec: 5458.9, 60 sec: 5517.8, 300 sec: 5528.6). Total num frames: 640131072. Throughput: 0: 5845.1. Samples: 640134296. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:25:01,918][25689] Avg episode reward: [(0, '-9.473')] [2022-07-10 07:25:02,355][26022] Updated weights on worker 0-0, policy_version 625131 (0.00086) [2022-07-10 07:25:04,223][26022] Updated weights on worker 0-0, policy_version 625141 (0.00090) [2022-07-10 07:25:05,960][26022] Updated weights on worker 0-0, policy_version 625151 (0.00083) [2022-07-10 07:25:06,923][25689] Fps is (10 sec: 5194.4, 60 sec: 5551.7, 300 sec: 5532.1). Total num frames: 640158720. Throughput: 0: 5748.1. Samples: 640165890. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:25:06,924][25689] Avg episode reward: [(0, '-8.621')] [2022-07-10 07:25:07,774][26022] Updated weights on worker 0-0, policy_version 625161 (0.00095) [2022-07-10 07:25:09,779][26022] Updated weights on worker 0-0, policy_version 625171 (0.00084) [2022-07-10 07:25:11,483][26022] Updated weights on worker 0-0, policy_version 625181 (0.00086) [2022-07-10 07:25:11,935][25689] Fps is (10 sec: 5724.8, 60 sec: 5550.9, 300 sec: 5536.6). Total num frames: 640188416. Throughput: 0: 4918.3. Samples: 640182678. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:25:11,935][25689] Avg episode reward: [(0, '-10.561')] [2022-07-10 07:25:13,444][26022] Updated weights on worker 0-0, policy_version 625191 (0.00084) [2022-07-10 07:25:15,098][26022] Updated weights on worker 0-0, policy_version 625201 (0.00084) [2022-07-10 07:25:17,014][25689] Fps is (10 sec: 5682.8, 60 sec: 5565.5, 300 sec: 5532.0). Total num frames: 640216064. Throughput: 0: 5730.3. Samples: 640215946. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:25:17,014][25689] Avg episode reward: [(0, '-10.791')] [2022-07-10 07:25:17,019][26022] Updated weights on worker 0-0, policy_version 625211 (0.00088) [2022-07-10 07:25:18,836][26022] Updated weights on worker 0-0, policy_version 625221 (0.00086) [2022-07-10 07:25:20,777][26022] Updated weights on worker 0-0, policy_version 625231 (0.00092) [2022-07-10 07:25:22,066][25689] Fps is (10 sec: 5458.1, 60 sec: 5544.6, 300 sec: 5531.6). Total num frames: 640243712. Throughput: 0: 5728.4. Samples: 640249710. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:25:22,066][25689] Avg episode reward: [(0, '-11.782')] [2022-07-10 07:25:22,463][26022] Updated weights on worker 0-0, policy_version 625241 (0.00087) [2022-07-10 07:25:24,350][26022] Updated weights on worker 0-0, policy_version 625251 (0.00084) [2022-07-10 07:25:26,092][26022] Updated weights on worker 0-0, policy_version 625261 (0.00094) [2022-07-10 07:25:27,074][25689] Fps is (10 sec: 5598.1, 60 sec: 5578.1, 300 sec: 5535.0). Total num frames: 640272384. Throughput: 0: 4997.0. Samples: 640266586. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:25:27,075][25689] Avg episode reward: [(0, '-10.989')] [2022-07-10 07:25:28,109][26022] Updated weights on worker 0-0, policy_version 625271 (0.00087) [2022-07-10 07:25:29,703][26022] Updated weights on worker 0-0, policy_version 625281 (0.00087) [2022-07-10 07:25:31,715][26022] Updated weights on worker 0-0, policy_version 625291 (0.00086) [2022-07-10 07:25:32,099][25689] Fps is (10 sec: 5715.4, 60 sec: 5577.0, 300 sec: 5538.9). Total num frames: 640301056. Throughput: 0: 5834.0. Samples: 640300316. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:25:32,099][25689] Avg episode reward: [(0, '-11.065')] [2022-07-10 07:25:33,420][26022] Updated weights on worker 0-0, policy_version 625301 (0.00089) [2022-07-10 07:25:35,161][26022] Updated weights on worker 0-0, policy_version 625311 (0.00053) [2022-07-10 07:25:37,083][26022] Updated weights on worker 0-0, policy_version 625321 (0.00088) [2022-07-10 07:25:37,172][25689] Fps is (10 sec: 5577.6, 60 sec: 5565.8, 300 sec: 5534.1). Total num frames: 640328704. Throughput: 0: 5859.1. Samples: 640334054. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:25:37,172][25689] Avg episode reward: [(0, '-11.156')] [2022-07-10 07:25:39,082][26022] Updated weights on worker 0-0, policy_version 625331 (0.00086) [2022-07-10 07:25:40,732][26022] Updated weights on worker 0-0, policy_version 625341 (0.00093) [2022-07-10 07:25:42,184][25689] Fps is (10 sec: 5381.3, 60 sec: 5565.8, 300 sec: 5534.1). Total num frames: 640355328. Throughput: 0: 5011.5. Samples: 640350534. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:25:42,185][25689] Avg episode reward: [(0, '-10.235')] [2022-07-10 07:25:42,800][26022] Updated weights on worker 0-0, policy_version 625351 (0.00090) [2022-07-10 07:25:44,448][26022] Updated weights on worker 0-0, policy_version 625361 (0.00098) [2022-07-10 07:25:46,428][26022] Updated weights on worker 0-0, policy_version 625371 (0.00090) [2022-07-10 07:25:47,203][25689] Fps is (10 sec: 5614.6, 60 sec: 5554.4, 300 sec: 5534.0). Total num frames: 640385024. Throughput: 0: 5822.9. Samples: 640383792. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:25:47,203][25689] Avg episode reward: [(0, '-8.899')] [2022-07-10 07:25:48,246][26022] Updated weights on worker 0-0, policy_version 625381 (0.00082) [2022-07-10 07:25:49,858][26022] Updated weights on worker 0-0, policy_version 625391 (0.00053) [2022-07-10 07:25:51,951][26022] Updated weights on worker 0-0, policy_version 625401 (0.00087) [2022-07-10 07:25:52,206][25689] Fps is (10 sec: 5619.7, 60 sec: 5559.2, 300 sec: 5535.6). Total num frames: 640411648. Throughput: 0: 5827.9. Samples: 640417498. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:25:52,206][25689] Avg episode reward: [(0, '-8.189')] [2022-07-10 07:25:53,496][26022] Updated weights on worker 0-0, policy_version 625411 (0.00090) [2022-07-10 07:25:55,424][26022] Updated weights on worker 0-0, policy_version 625421 (0.00099) [2022-07-10 07:25:57,335][25689] Fps is (10 sec: 5457.4, 60 sec: 5520.8, 300 sec: 5537.8). Total num frames: 640440320. Throughput: 0: 4973.9. Samples: 640434344. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:25:57,336][25689] Avg episode reward: [(0, '-8.901')] [2022-07-10 07:25:57,412][26022] Updated weights on worker 0-0, policy_version 625431 (0.00089) [2022-07-10 07:25:59,070][26022] Updated weights on worker 0-0, policy_version 625441 (0.00087) [2022-07-10 07:26:01,017][26022] Updated weights on worker 0-0, policy_version 625451 (0.00085) [2022-07-10 07:26:02,384][25689] Fps is (10 sec: 5533.4, 60 sec: 5571.6, 300 sec: 5537.3). Total num frames: 640467968. Throughput: 0: 5801.7. Samples: 640467728. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:26:02,385][25689] Avg episode reward: [(0, '-9.078')] [2022-07-10 07:26:03,011][26022] Updated weights on worker 0-0, policy_version 625461 (0.00093) [2022-07-10 07:26:04,987][26022] Updated weights on worker 0-0, policy_version 625471 (0.00094) [2022-07-10 07:26:07,075][26022] Updated weights on worker 0-0, policy_version 625481 (0.00054) [2022-07-10 07:26:07,391][25689] Fps is (10 sec: 5295.1, 60 sec: 5537.6, 300 sec: 5534.2). Total num frames: 640493568. Throughput: 0: 5708.5. Samples: 640499036. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:26:07,392][25689] Avg episode reward: [(0, '-9.401')] [2022-07-10 07:26:08,686][26022] Updated weights on worker 0-0, policy_version 625491 (0.00086) [2022-07-10 07:26:10,715][26022] Updated weights on worker 0-0, policy_version 625501 (0.00091) [2022-07-10 07:26:12,023][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:26:12,030][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000625509_640521216.pth [2022-07-10 07:26:12,030][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000623561_638526464.pth [2022-07-10 07:26:12,321][26022] Updated weights on worker 0-0, policy_version 625511 (0.00086) [2022-07-10 07:26:12,394][25689] Fps is (10 sec: 5524.3, 60 sec: 5538.4, 300 sec: 5539.8). Total num frames: 640523264. Throughput: 0: 4860.6. Samples: 640515622. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:26:12,394][25689] Avg episode reward: [(0, '-9.592')] [2022-07-10 07:26:14,379][26022] Updated weights on worker 0-0, policy_version 625521 (0.00065) [2022-07-10 07:26:16,235][26022] Updated weights on worker 0-0, policy_version 625531 (0.00086) [2022-07-10 07:26:17,424][25689] Fps is (10 sec: 5613.4, 60 sec: 5525.9, 300 sec: 5533.8). Total num frames: 640549888. Throughput: 0: 5692.8. Samples: 640548708. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:26:17,426][25689] Avg episode reward: [(0, '-9.482')] [2022-07-10 07:26:17,860][26022] Updated weights on worker 0-0, policy_version 625541 (0.00096) [2022-07-10 07:26:20,153][26022] Updated weights on worker 0-0, policy_version 625551 (0.00091) [2022-07-10 07:26:21,619][26022] Updated weights on worker 0-0, policy_version 625561 (0.00083) [2022-07-10 07:26:22,517][25689] Fps is (10 sec: 5462.4, 60 sec: 5539.1, 300 sec: 5532.2). Total num frames: 640578560. Throughput: 0: 5678.4. Samples: 640582050. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:26:22,517][25689] Avg episode reward: [(0, '-7.791')] [2022-07-10 07:26:23,663][26022] Updated weights on worker 0-0, policy_version 625571 (0.00090) [2022-07-10 07:26:25,331][26022] Updated weights on worker 0-0, policy_version 625581 (0.00088) [2022-07-10 07:26:27,315][26022] Updated weights on worker 0-0, policy_version 625591 (0.00089) [2022-07-10 07:26:27,545][25689] Fps is (10 sec: 5564.8, 60 sec: 5520.4, 300 sec: 5532.2). Total num frames: 640606208. Throughput: 0: 5778.9. Samples: 640615504. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:26:27,546][25689] Avg episode reward: [(0, '-8.309')] [2022-07-10 07:26:29,288][26022] Updated weights on worker 0-0, policy_version 625601 (0.00085) [2022-07-10 07:26:30,895][26022] Updated weights on worker 0-0, policy_version 625611 (0.00087) [2022-07-10 07:26:32,632][25689] Fps is (10 sec: 5466.5, 60 sec: 5497.8, 300 sec: 5529.2). Total num frames: 640633856. Throughput: 0: 5753.9. Samples: 640632072. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:26:32,633][25689] Avg episode reward: [(0, '-7.742')] [2022-07-10 07:26:32,719][26022] Updated weights on worker 0-0, policy_version 625621 (0.00572) [2022-07-10 07:26:34,652][26022] Updated weights on worker 0-0, policy_version 625631 (0.00084) [2022-07-10 07:26:36,493][26022] Updated weights on worker 0-0, policy_version 625641 (0.00057) [2022-07-10 07:26:37,782][25689] Fps is (10 sec: 5501.4, 60 sec: 5507.7, 300 sec: 5533.5). Total num frames: 640662528. Throughput: 0: 5738.4. Samples: 640665530. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:26:37,783][25689] Avg episode reward: [(0, '-6.355')] [2022-07-10 07:26:38,481][26022] Updated weights on worker 0-0, policy_version 625651 (0.00096) [2022-07-10 07:26:40,158][26022] Updated weights on worker 0-0, policy_version 625661 (0.00086) [2022-07-10 07:26:41,942][26022] Updated weights on worker 0-0, policy_version 625671 (0.00092) [2022-07-10 07:26:42,816][25689] Fps is (10 sec: 5530.2, 60 sec: 5522.6, 300 sec: 5533.2). Total num frames: 640690176. Throughput: 0: 5747.0. Samples: 640698712. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:26:42,818][25689] Avg episode reward: [(0, '-8.295')] [2022-07-10 07:26:43,990][26022] Updated weights on worker 0-0, policy_version 625681 (0.00082) [2022-07-10 07:26:45,699][26022] Updated weights on worker 0-0, policy_version 625691 (0.00093) [2022-07-10 07:26:47,583][26022] Updated weights on worker 0-0, policy_version 625701 (0.00086) [2022-07-10 07:26:47,912][25689] Fps is (10 sec: 5660.8, 60 sec: 5515.6, 300 sec: 5535.3). Total num frames: 640719872. Throughput: 0: 4901.1. Samples: 640715326. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:26:47,913][25689] Avg episode reward: [(0, '-7.463')] [2022-07-10 07:26:49,488][26022] Updated weights on worker 0-0, policy_version 625711 (0.00107) [2022-07-10 07:26:51,139][26022] Updated weights on worker 0-0, policy_version 625721 (0.00087) [2022-07-10 07:26:52,929][25689] Fps is (10 sec: 5569.3, 60 sec: 5514.3, 300 sec: 5530.0). Total num frames: 640746496. Throughput: 0: 5757.9. Samples: 640748938. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:26:52,930][25689] Avg episode reward: [(0, '-7.018')] [2022-07-10 07:26:53,095][26022] Updated weights on worker 0-0, policy_version 625731 (0.00088) [2022-07-10 07:26:54,847][26022] Updated weights on worker 0-0, policy_version 625741 (0.00081) [2022-07-10 07:26:56,760][26022] Updated weights on worker 0-0, policy_version 625751 (0.00110) [2022-07-10 07:26:58,026][25689] Fps is (10 sec: 5568.6, 60 sec: 5534.1, 300 sec: 5535.6). Total num frames: 640776192. Throughput: 0: 5759.9. Samples: 640782132. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 07:26:58,027][25689] Avg episode reward: [(0, '-6.235')] [2022-07-10 07:26:58,751][26022] Updated weights on worker 0-0, policy_version 625761 (0.00098) [2022-07-10 07:27:00,393][26022] Updated weights on worker 0-0, policy_version 625771 (0.00092) [2022-07-10 07:27:02,819][26022] Updated weights on worker 0-0, policy_version 625781 (0.00086) [2022-07-10 07:27:03,076][25689] Fps is (10 sec: 5348.7, 60 sec: 5483.5, 300 sec: 5529.1). Total num frames: 640800768. Throughput: 0: 4948.1. Samples: 640798958. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:27:03,078][25689] Avg episode reward: [(0, '-6.087')] [2022-07-10 07:27:04,522][26022] Updated weights on worker 0-0, policy_version 625791 (0.00090) [2022-07-10 07:27:06,408][26022] Updated weights on worker 0-0, policy_version 625801 (0.00074) [2022-07-10 07:27:08,119][25689] Fps is (10 sec: 5276.1, 60 sec: 5530.8, 300 sec: 5531.8). Total num frames: 640829440. Throughput: 0: 5698.1. Samples: 640830464. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:27:08,119][25689] Avg episode reward: [(0, '-5.021')] [2022-07-10 07:27:08,163][26022] Updated weights on worker 0-0, policy_version 625811 (0.00087) [2022-07-10 07:27:09,946][26022] Updated weights on worker 0-0, policy_version 625821 (0.00091) [2022-07-10 07:27:11,844][26022] Updated weights on worker 0-0, policy_version 625831 (0.00089) [2022-07-10 07:27:13,167][25689] Fps is (10 sec: 5682.7, 60 sec: 5509.8, 300 sec: 5536.6). Total num frames: 640858112. Throughput: 0: 5680.1. Samples: 640863890. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:27:13,173][25689] Avg episode reward: [(0, '-6.440')] [2022-07-10 07:27:13,882][26022] Updated weights on worker 0-0, policy_version 625841 (0.00088) [2022-07-10 07:27:15,558][26022] Updated weights on worker 0-0, policy_version 625851 (0.00099) [2022-07-10 07:27:17,528][26022] Updated weights on worker 0-0, policy_version 625861 (0.00085) [2022-07-10 07:27:18,233][25689] Fps is (10 sec: 5669.4, 60 sec: 5540.2, 300 sec: 5539.1). Total num frames: 640886784. Throughput: 0: 4879.7. Samples: 640880736. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:27:18,234][25689] Avg episode reward: [(0, '-7.220')] [2022-07-10 07:27:18,923][26022] Updated weights on worker 0-0, policy_version 625871 (0.00086) [2022-07-10 07:27:21,060][26022] Updated weights on worker 0-0, policy_version 625881 (0.00091) [2022-07-10 07:27:23,010][26022] Updated weights on worker 0-0, policy_version 625891 (0.00101) [2022-07-10 07:27:23,257][25689] Fps is (10 sec: 5479.9, 60 sec: 5512.8, 300 sec: 5531.8). Total num frames: 640913408. Throughput: 0: 5716.2. Samples: 640914320. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:27:23,258][25689] Avg episode reward: [(0, '-8.608')] [2022-07-10 07:27:24,656][26022] Updated weights on worker 0-0, policy_version 625901 (0.00085) [2022-07-10 07:27:26,631][26022] Updated weights on worker 0-0, policy_version 625911 (0.00088) [2022-07-10 07:27:28,204][26022] Updated weights on worker 0-0, policy_version 625921 (0.00080) [2022-07-10 07:27:28,301][25689] Fps is (10 sec: 5594.3, 60 sec: 5545.1, 300 sec: 5541.7). Total num frames: 640943104. Throughput: 0: 5827.9. Samples: 640948084. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:27:28,301][25689] Avg episode reward: [(0, '-8.833')] [2022-07-10 07:27:30,224][26022] Updated weights on worker 0-0, policy_version 625931 (0.00084) [2022-07-10 07:27:32,039][26022] Updated weights on worker 0-0, policy_version 625941 (0.00050) [2022-07-10 07:27:33,326][25689] Fps is (10 sec: 5695.4, 60 sec: 5550.8, 300 sec: 5535.3). Total num frames: 640970752. Throughput: 0: 5012.4. Samples: 640964936. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:27:33,326][25689] Avg episode reward: [(0, '-11.637')] [2022-07-10 07:27:33,855][26022] Updated weights on worker 0-0, policy_version 625951 (0.00094) [2022-07-10 07:27:35,677][26022] Updated weights on worker 0-0, policy_version 625961 (0.00091) [2022-07-10 07:27:37,708][26022] Updated weights on worker 0-0, policy_version 625971 (0.00090) [2022-07-10 07:27:38,485][25689] Fps is (10 sec: 5429.6, 60 sec: 5533.1, 300 sec: 5533.3). Total num frames: 640998400. Throughput: 0: 5799.6. Samples: 640998186. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:27:38,485][25689] Avg episode reward: [(0, '-12.891')] [2022-07-10 07:27:39,275][26022] Updated weights on worker 0-0, policy_version 625981 (0.00086) [2022-07-10 07:27:41,453][26022] Updated weights on worker 0-0, policy_version 625991 (0.00086) [2022-07-10 07:27:43,062][26022] Updated weights on worker 0-0, policy_version 626001 (0.00083) [2022-07-10 07:27:43,522][25689] Fps is (10 sec: 5523.5, 60 sec: 5549.7, 300 sec: 5536.6). Total num frames: 641027072. Throughput: 0: 5756.6. Samples: 641030976. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:27:43,523][25689] Avg episode reward: [(0, '-12.724')] [2022-07-10 07:27:45,116][26022] Updated weights on worker 0-0, policy_version 626011 (0.00089) [2022-07-10 07:27:46,759][26022] Updated weights on worker 0-0, policy_version 626021 (0.00083) [2022-07-10 07:27:48,533][25689] Fps is (10 sec: 5503.2, 60 sec: 5506.8, 300 sec: 5533.1). Total num frames: 641053696. Throughput: 0: 4923.7. Samples: 641047700. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:27:48,533][25689] Avg episode reward: [(0, '-10.444')] [2022-07-10 07:27:48,795][26022] Updated weights on worker 0-0, policy_version 626031 (0.00086) [2022-07-10 07:27:50,449][26022] Updated weights on worker 0-0, policy_version 626041 (0.00082) [2022-07-10 07:27:52,472][26022] Updated weights on worker 0-0, policy_version 626051 (0.00097) [2022-07-10 07:27:53,561][25689] Fps is (10 sec: 5508.2, 60 sec: 5539.5, 300 sec: 5530.1). Total num frames: 641082368. Throughput: 0: 5746.9. Samples: 641081226. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:27:53,562][25689] Avg episode reward: [(0, '-9.812')] [2022-07-10 07:27:54,136][26022] Updated weights on worker 0-0, policy_version 626061 (0.00104) [2022-07-10 07:27:56,044][26022] Updated weights on worker 0-0, policy_version 626071 (0.00102) [2022-07-10 07:27:58,040][26022] Updated weights on worker 0-0, policy_version 626081 (0.00086) [2022-07-10 07:27:58,662][25689] Fps is (10 sec: 5459.2, 60 sec: 5488.5, 300 sec: 5529.0). Total num frames: 641108992. Throughput: 0: 5756.8. Samples: 641114340. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:27:58,662][25689] Avg episode reward: [(0, '-9.064')] [2022-07-10 07:27:59,708][26022] Updated weights on worker 0-0, policy_version 626091 (0.00101) [2022-07-10 07:28:01,704][26022] Updated weights on worker 0-0, policy_version 626101 (0.00089) [2022-07-10 07:28:03,602][26022] Updated weights on worker 0-0, policy_version 626111 (0.00085) [2022-07-10 07:28:03,667][25689] Fps is (10 sec: 5471.4, 60 sec: 5560.1, 300 sec: 5536.2). Total num frames: 641137664. Throughput: 0: 5700.2. Samples: 641145808. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:28:03,668][25689] Avg episode reward: [(0, '-7.272')] [2022-07-10 07:28:05,684][26022] Updated weights on worker 0-0, policy_version 626121 (0.00079) [2022-07-10 07:28:07,471][26022] Updated weights on worker 0-0, policy_version 626131 (0.00093) [2022-07-10 07:28:08,679][25689] Fps is (10 sec: 5519.9, 60 sec: 5529.1, 300 sec: 5529.5). Total num frames: 641164288. Throughput: 0: 5706.1. Samples: 641162658. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:28:08,680][25689] Avg episode reward: [(0, '-5.832')] [2022-07-10 07:28:09,362][26022] Updated weights on worker 0-0, policy_version 626141 (0.00097) [2022-07-10 07:28:11,206][26022] Updated weights on worker 0-0, policy_version 626151 (0.00085) [2022-07-10 07:28:12,257][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:28:12,270][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000626157_641184768.pth [2022-07-10 07:28:12,271][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000624209_639190016.pth [2022-07-10 07:28:13,203][26022] Updated weights on worker 0-0, policy_version 626161 (0.00085) [2022-07-10 07:28:13,705][25689] Fps is (10 sec: 5406.8, 60 sec: 5514.2, 300 sec: 5533.3). Total num frames: 641191936. Throughput: 0: 5694.3. Samples: 641195932. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:28:13,706][25689] Avg episode reward: [(0, '-4.615')] [2022-07-10 07:28:14,664][26022] Updated weights on worker 0-0, policy_version 626171 (0.00088) [2022-07-10 07:28:16,807][26022] Updated weights on worker 0-0, policy_version 626181 (0.00085) [2022-07-10 07:28:18,445][26022] Updated weights on worker 0-0, policy_version 626191 (0.00059) [2022-07-10 07:28:18,831][25689] Fps is (10 sec: 5547.9, 60 sec: 5508.9, 300 sec: 5531.0). Total num frames: 641220608. Throughput: 0: 5702.9. Samples: 641229362. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:28:18,831][25689] Avg episode reward: [(0, '-5.123')] [2022-07-10 07:28:20,381][26022] Updated weights on worker 0-0, policy_version 626201 (0.00098) [2022-07-10 07:28:22,117][26022] Updated weights on worker 0-0, policy_version 626211 (0.00085) [2022-07-10 07:28:23,918][25689] Fps is (10 sec: 5715.3, 60 sec: 5553.8, 300 sec: 5536.4). Total num frames: 641250304. Throughput: 0: 4960.4. Samples: 641246258. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:28:23,919][25689] Avg episode reward: [(0, '-5.471')] [2022-07-10 07:28:23,931][26022] Updated weights on worker 0-0, policy_version 626221 (0.00096) [2022-07-10 07:28:25,767][26022] Updated weights on worker 0-0, policy_version 626231 (0.00093) [2022-07-10 07:28:27,723][26022] Updated weights on worker 0-0, policy_version 626241 (0.00086) [2022-07-10 07:28:28,946][25689] Fps is (10 sec: 5567.7, 60 sec: 5504.5, 300 sec: 5529.4). Total num frames: 641276928. Throughput: 0: 5777.6. Samples: 641279752. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:28:28,947][25689] Avg episode reward: [(0, '-7.919')] [2022-07-10 07:28:29,576][26022] Updated weights on worker 0-0, policy_version 626251 (0.00088) [2022-07-10 07:28:31,493][26022] Updated weights on worker 0-0, policy_version 626261 (0.00083) [2022-07-10 07:28:33,156][26022] Updated weights on worker 0-0, policy_version 626271 (0.00091) [2022-07-10 07:28:33,962][25689] Fps is (10 sec: 5403.3, 60 sec: 5505.4, 300 sec: 5530.4). Total num frames: 641304576. Throughput: 0: 5789.1. Samples: 641313200. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:28:33,962][25689] Avg episode reward: [(0, '-6.913')] [2022-07-10 07:28:35,119][26022] Updated weights on worker 0-0, policy_version 626281 (0.00094) [2022-07-10 07:28:37,027][26022] Updated weights on worker 0-0, policy_version 626291 (0.00086) [2022-07-10 07:28:38,723][26022] Updated weights on worker 0-0, policy_version 626301 (0.00093) [2022-07-10 07:28:39,072][25689] Fps is (10 sec: 5562.0, 60 sec: 5526.7, 300 sec: 5532.2). Total num frames: 641333248. Throughput: 0: 4959.5. Samples: 641329748. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:28:39,073][25689] Avg episode reward: [(0, '-7.589')] [2022-07-10 07:28:40,674][26022] Updated weights on worker 0-0, policy_version 626311 (0.00087) [2022-07-10 07:28:42,608][26022] Updated weights on worker 0-0, policy_version 626321 (0.00088) [2022-07-10 07:28:44,100][25689] Fps is (10 sec: 5656.6, 60 sec: 5527.6, 300 sec: 5531.8). Total num frames: 641361920. Throughput: 0: 5815.8. Samples: 641363632. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:28:44,100][25689] Avg episode reward: [(0, '-9.751')] [2022-07-10 07:28:44,233][26022] Updated weights on worker 0-0, policy_version 626331 (0.00093) [2022-07-10 07:28:46,300][26022] Updated weights on worker 0-0, policy_version 626341 (0.00082) [2022-07-10 07:28:47,892][26022] Updated weights on worker 0-0, policy_version 626351 (0.00095) [2022-07-10 07:28:49,117][25689] Fps is (10 sec: 5402.9, 60 sec: 5510.1, 300 sec: 5528.2). Total num frames: 641387520. Throughput: 0: 5814.9. Samples: 641397044. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:28:49,118][25689] Avg episode reward: [(0, '-10.697')] [2022-07-10 07:28:49,791][26022] Updated weights on worker 0-0, policy_version 626361 (0.00092) [2022-07-10 07:28:51,731][26022] Updated weights on worker 0-0, policy_version 626371 (0.00092) [2022-07-10 07:28:53,379][26022] Updated weights on worker 0-0, policy_version 626381 (0.00094) [2022-07-10 07:28:54,140][25689] Fps is (10 sec: 5507.5, 60 sec: 5527.5, 300 sec: 5532.9). Total num frames: 641417216. Throughput: 0: 4967.6. Samples: 641413436. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:28:54,141][25689] Avg episode reward: [(0, '-10.741')] [2022-07-10 07:28:55,352][26022] Updated weights on worker 0-0, policy_version 626391 (0.00086) [2022-07-10 07:28:57,204][26022] Updated weights on worker 0-0, policy_version 626401 (0.00095) [2022-07-10 07:28:58,983][26022] Updated weights on worker 0-0, policy_version 626411 (0.00086) [2022-07-10 07:28:59,206][25689] Fps is (10 sec: 5785.1, 60 sec: 5564.4, 300 sec: 5536.3). Total num frames: 641445888. Throughput: 0: 5809.5. Samples: 641446718. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:28:59,207][25689] Avg episode reward: [(0, '-10.488')] [2022-07-10 07:29:00,863][26022] Updated weights on worker 0-0, policy_version 626421 (0.00090) [2022-07-10 07:29:02,999][26022] Updated weights on worker 0-0, policy_version 626431 (0.00080) [2022-07-10 07:29:04,248][25689] Fps is (10 sec: 5267.8, 60 sec: 5493.5, 300 sec: 5532.2). Total num frames: 641470464. Throughput: 0: 5690.5. Samples: 641478286. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:29:04,249][25689] Avg episode reward: [(0, '-9.705')] [2022-07-10 07:29:05,015][26022] Updated weights on worker 0-0, policy_version 626441 (0.00115) [2022-07-10 07:29:06,576][26022] Updated weights on worker 0-0, policy_version 626451 (0.00087) [2022-07-10 07:29:08,823][26022] Updated weights on worker 0-0, policy_version 626461 (0.00088) [2022-07-10 07:29:09,255][25689] Fps is (10 sec: 5400.9, 60 sec: 5544.6, 300 sec: 5532.1). Total num frames: 641500160. Throughput: 0: 4866.2. Samples: 641495040. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:29:09,256][25689] Avg episode reward: [(0, '-10.150')] [2022-07-10 07:29:10,409][26022] Updated weights on worker 0-0, policy_version 626471 (0.00085) [2022-07-10 07:29:12,525][26022] Updated weights on worker 0-0, policy_version 626481 (0.00092) [2022-07-10 07:29:14,252][26022] Updated weights on worker 0-0, policy_version 626491 (0.00091) [2022-07-10 07:29:14,313][25689] Fps is (10 sec: 5595.6, 60 sec: 5524.8, 300 sec: 5532.0). Total num frames: 641526784. Throughput: 0: 5681.7. Samples: 641528054. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:29:14,313][25689] Avg episode reward: [(0, '-7.722')] [2022-07-10 07:29:15,987][26022] Updated weights on worker 0-0, policy_version 626501 (0.00089) [2022-07-10 07:29:18,049][26022] Updated weights on worker 0-0, policy_version 626511 (0.00086) [2022-07-10 07:29:19,458][25689] Fps is (10 sec: 5419.5, 60 sec: 5523.0, 300 sec: 5529.5). Total num frames: 641555456. Throughput: 0: 5648.3. Samples: 641561106. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:29:19,459][25689] Avg episode reward: [(0, '-7.468')] [2022-07-10 07:29:19,759][26022] Updated weights on worker 0-0, policy_version 626521 (0.00082) [2022-07-10 07:29:21,765][26022] Updated weights on worker 0-0, policy_version 626531 (0.00092) [2022-07-10 07:29:23,287][26022] Updated weights on worker 0-0, policy_version 626541 (0.00089) [2022-07-10 07:29:24,481][25689] Fps is (10 sec: 5538.8, 60 sec: 5495.1, 300 sec: 5532.6). Total num frames: 641583104. Throughput: 0: 4921.0. Samples: 641577854. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:29:24,482][25689] Avg episode reward: [(0, '-7.846')] [2022-07-10 07:29:25,199][26022] Updated weights on worker 0-0, policy_version 626551 (0.00051) [2022-07-10 07:29:27,108][26022] Updated weights on worker 0-0, policy_version 626561 (0.00094) [2022-07-10 07:29:28,807][26022] Updated weights on worker 0-0, policy_version 626571 (0.00086) [2022-07-10 07:29:29,484][25689] Fps is (10 sec: 5617.8, 60 sec: 5531.3, 300 sec: 5532.8). Total num frames: 641611776. Throughput: 0: 5755.3. Samples: 641611462. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:29:29,484][25689] Avg episode reward: [(0, '-7.910')] [2022-07-10 07:29:30,794][26022] Updated weights on worker 0-0, policy_version 626581 (0.00084) [2022-07-10 07:29:32,727][26022] Updated weights on worker 0-0, policy_version 626591 (0.00092) [2022-07-10 07:29:34,430][26022] Updated weights on worker 0-0, policy_version 626601 (0.00100) [2022-07-10 07:29:34,525][25689] Fps is (10 sec: 5607.7, 60 sec: 5528.9, 300 sec: 5531.1). Total num frames: 641639424. Throughput: 0: 5774.2. Samples: 641644760. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:29:34,525][25689] Avg episode reward: [(0, '-8.418')] [2022-07-10 07:29:36,599][26022] Updated weights on worker 0-0, policy_version 626611 (0.00087) [2022-07-10 07:29:38,144][26022] Updated weights on worker 0-0, policy_version 626621 (0.00090) [2022-07-10 07:29:39,615][25689] Fps is (10 sec: 5356.8, 60 sec: 5496.9, 300 sec: 5529.6). Total num frames: 641666048. Throughput: 0: 4974.6. Samples: 641661378. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:29:39,616][25689] Avg episode reward: [(0, '-5.751')] [2022-07-10 07:29:40,114][26022] Updated weights on worker 0-0, policy_version 626631 (0.00079) [2022-07-10 07:29:41,898][26022] Updated weights on worker 0-0, policy_version 626641 (0.00084) [2022-07-10 07:29:43,737][26022] Updated weights on worker 0-0, policy_version 626651 (0.00094) [2022-07-10 07:29:44,652][25689] Fps is (10 sec: 5459.8, 60 sec: 5496.0, 300 sec: 5523.5). Total num frames: 641694720. Throughput: 0: 5796.6. Samples: 641694778. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:29:44,653][25689] Avg episode reward: [(0, '-6.971')] [2022-07-10 07:29:45,610][26022] Updated weights on worker 0-0, policy_version 626661 (0.00088) [2022-07-10 07:29:47,429][26022] Updated weights on worker 0-0, policy_version 626671 (0.00085) [2022-07-10 07:29:49,279][26022] Updated weights on worker 0-0, policy_version 626681 (0.00091) [2022-07-10 07:29:49,743][25689] Fps is (10 sec: 5763.3, 60 sec: 5557.0, 300 sec: 5533.2). Total num frames: 641724416. Throughput: 0: 5743.0. Samples: 641727810. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:29:49,744][25689] Avg episode reward: [(0, '-7.379')] [2022-07-10 07:29:51,334][26022] Updated weights on worker 0-0, policy_version 626691 (0.00083) [2022-07-10 07:29:52,639][26022] Updated weights on worker 0-0, policy_version 626701 (0.00096) [2022-07-10 07:29:54,812][25689] Fps is (10 sec: 5442.6, 60 sec: 5485.2, 300 sec: 5516.2). Total num frames: 641750016. Throughput: 0: 4921.7. Samples: 641744610. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:29:54,813][25689] Avg episode reward: [(0, '-8.509')] [2022-07-10 07:29:54,986][26022] Updated weights on worker 0-0, policy_version 626711 (0.00083) [2022-07-10 07:29:56,619][26022] Updated weights on worker 0-0, policy_version 626721 (0.00089) [2022-07-10 07:29:58,640][26022] Updated weights on worker 0-0, policy_version 626731 (0.00089) [2022-07-10 07:29:59,916][25689] Fps is (10 sec: 5435.2, 60 sec: 5498.7, 300 sec: 5532.4). Total num frames: 641779712. Throughput: 0: 5751.1. Samples: 641778132. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:29:59,917][25689] Avg episode reward: [(0, '-7.140')] [2022-07-10 07:30:00,211][26022] Updated weights on worker 0-0, policy_version 626741 (0.00090) [2022-07-10 07:30:02,675][26022] Updated weights on worker 0-0, policy_version 626751 (0.00088) [2022-07-10 07:30:04,337][26022] Updated weights on worker 0-0, policy_version 626761 (0.00096) [2022-07-10 07:30:04,955][25689] Fps is (10 sec: 5552.9, 60 sec: 5532.7, 300 sec: 5528.3). Total num frames: 641806336. Throughput: 0: 5658.9. Samples: 641809666. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 07:30:04,955][25689] Avg episode reward: [(0, '-7.655')] [2022-07-10 07:30:06,163][26022] Updated weights on worker 0-0, policy_version 626771 (0.00087) [2022-07-10 07:30:07,972][26022] Updated weights on worker 0-0, policy_version 626781 (0.00081) [2022-07-10 07:30:09,872][26022] Updated weights on worker 0-0, policy_version 626791 (0.00095) [2022-07-10 07:30:09,977][25689] Fps is (10 sec: 5394.3, 60 sec: 5497.6, 300 sec: 5521.2). Total num frames: 641833984. Throughput: 0: 5718.6. Samples: 641843524. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:30:09,978][25689] Avg episode reward: [(0, '-6.461')] [2022-07-10 07:30:11,551][26022] Updated weights on worker 0-0, policy_version 626801 (0.00088) [2022-07-10 07:30:12,455][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:30:12,465][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000626805_641848320.pth [2022-07-10 07:30:12,466][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000624858_639854592.pth [2022-07-10 07:30:13,508][26022] Updated weights on worker 0-0, policy_version 626811 (0.00089) [2022-07-10 07:30:15,045][25689] Fps is (10 sec: 5683.1, 60 sec: 5547.2, 300 sec: 5528.3). Total num frames: 641863680. Throughput: 0: 5717.8. Samples: 641860298. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:30:15,046][25689] Avg episode reward: [(0, '-7.100')] [2022-07-10 07:30:15,211][26022] Updated weights on worker 0-0, policy_version 626821 (0.00087) [2022-07-10 07:30:17,196][26022] Updated weights on worker 0-0, policy_version 626831 (0.00086) [2022-07-10 07:30:19,128][26022] Updated weights on worker 0-0, policy_version 626841 (0.00090) [2022-07-10 07:30:20,093][25689] Fps is (10 sec: 5567.5, 60 sec: 5522.4, 300 sec: 5525.0). Total num frames: 641890304. Throughput: 0: 5714.9. Samples: 641893440. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:30:20,094][25689] Avg episode reward: [(0, '-6.313')] [2022-07-10 07:30:20,798][26022] Updated weights on worker 0-0, policy_version 626851 (0.00084) [2022-07-10 07:30:22,779][26022] Updated weights on worker 0-0, policy_version 626861 (0.00086) [2022-07-10 07:30:24,583][26022] Updated weights on worker 0-0, policy_version 626871 (0.00085) [2022-07-10 07:30:25,109][25689] Fps is (10 sec: 5494.7, 60 sec: 5539.9, 300 sec: 5524.8). Total num frames: 641918976. Throughput: 0: 5829.6. Samples: 641927154. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:30:25,109][25689] Avg episode reward: [(0, '-5.634')] [2022-07-10 07:30:26,453][26022] Updated weights on worker 0-0, policy_version 626881 (0.00090) [2022-07-10 07:30:28,159][26022] Updated weights on worker 0-0, policy_version 626891 (0.00090) [2022-07-10 07:30:30,122][25689] Fps is (10 sec: 5615.6, 60 sec: 5522.0, 300 sec: 5521.6). Total num frames: 641946624. Throughput: 0: 4975.4. Samples: 641943754. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:30:30,123][25689] Avg episode reward: [(0, '-5.263')] [2022-07-10 07:30:30,125][26022] Updated weights on worker 0-0, policy_version 626901 (0.00086) [2022-07-10 07:30:31,707][26022] Updated weights on worker 0-0, policy_version 626911 (0.00100) [2022-07-10 07:30:33,696][26022] Updated weights on worker 0-0, policy_version 626921 (0.00081) [2022-07-10 07:30:35,124][25689] Fps is (10 sec: 5623.1, 60 sec: 5542.5, 300 sec: 5526.4). Total num frames: 641975296. Throughput: 0: 5827.6. Samples: 641977312. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:30:35,125][25689] Avg episode reward: [(0, '-7.745')] [2022-07-10 07:30:35,319][26022] Updated weights on worker 0-0, policy_version 626931 (0.00112) [2022-07-10 07:30:37,632][26022] Updated weights on worker 0-0, policy_version 626941 (0.00092) [2022-07-10 07:30:39,097][26022] Updated weights on worker 0-0, policy_version 626951 (0.00090) [2022-07-10 07:30:40,173][25689] Fps is (10 sec: 5501.7, 60 sec: 5546.3, 300 sec: 5525.7). Total num frames: 642001920. Throughput: 0: 5831.5. Samples: 642010536. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:30:40,175][25689] Avg episode reward: [(0, '-7.926')] [2022-07-10 07:30:41,244][26022] Updated weights on worker 0-0, policy_version 626961 (0.00092) [2022-07-10 07:30:42,817][26022] Updated weights on worker 0-0, policy_version 626971 (0.00088) [2022-07-10 07:30:44,822][26022] Updated weights on worker 0-0, policy_version 626981 (0.00083) [2022-07-10 07:30:45,198][25689] Fps is (10 sec: 5489.0, 60 sec: 5547.4, 300 sec: 5522.1). Total num frames: 642030592. Throughput: 0: 4985.6. Samples: 642027314. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:30:45,199][25689] Avg episode reward: [(0, '-8.733')] [2022-07-10 07:30:46,685][26022] Updated weights on worker 0-0, policy_version 626991 (0.00080) [2022-07-10 07:30:48,430][26022] Updated weights on worker 0-0, policy_version 627001 (0.00090) [2022-07-10 07:30:50,228][25689] Fps is (10 sec: 5601.5, 60 sec: 5519.1, 300 sec: 5525.1). Total num frames: 642058240. Throughput: 0: 5815.7. Samples: 642060680. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:30:50,229][25689] Avg episode reward: [(0, '-9.671')] [2022-07-10 07:30:50,390][26022] Updated weights on worker 0-0, policy_version 627011 (0.00090) [2022-07-10 07:30:52,156][26022] Updated weights on worker 0-0, policy_version 627021 (0.00088) [2022-07-10 07:30:54,047][26022] Updated weights on worker 0-0, policy_version 627031 (0.00082) [2022-07-10 07:30:55,266][25689] Fps is (10 sec: 5492.5, 60 sec: 5555.9, 300 sec: 5523.3). Total num frames: 642085888. Throughput: 0: 5808.0. Samples: 642094294. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:30:55,266][25689] Avg episode reward: [(0, '-10.118')] [2022-07-10 07:30:55,800][26022] Updated weights on worker 0-0, policy_version 627041 (0.00085) [2022-07-10 07:30:57,845][26022] Updated weights on worker 0-0, policy_version 627051 (0.00085) [2022-07-10 07:30:59,458][26022] Updated weights on worker 0-0, policy_version 627061 (0.00089) [2022-07-10 07:31:00,397][25689] Fps is (10 sec: 5538.0, 60 sec: 5536.4, 300 sec: 5525.2). Total num frames: 642114560. Throughput: 0: 4971.7. Samples: 642111086. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:31:00,398][25689] Avg episode reward: [(0, '-10.937')] [2022-07-10 07:31:01,400][26022] Updated weights on worker 0-0, policy_version 627071 (0.00084) [2022-07-10 07:31:03,636][26022] Updated weights on worker 0-0, policy_version 627081 (0.00088) [2022-07-10 07:31:05,209][26022] Updated weights on worker 0-0, policy_version 627091 (0.00298) [2022-07-10 07:31:05,468][25689] Fps is (10 sec: 5520.2, 60 sec: 5550.3, 300 sec: 5530.9). Total num frames: 642142208. Throughput: 0: 5691.7. Samples: 642142686. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:31:05,469][25689] Avg episode reward: [(0, '-11.120')] [2022-07-10 07:31:07,345][26022] Updated weights on worker 0-0, policy_version 627101 (0.00089) [2022-07-10 07:31:08,894][26022] Updated weights on worker 0-0, policy_version 627111 (0.00091) [2022-07-10 07:31:10,490][25689] Fps is (10 sec: 5377.5, 60 sec: 5533.5, 300 sec: 5520.3). Total num frames: 642168832. Throughput: 0: 5724.5. Samples: 642176672. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:31:10,490][25689] Avg episode reward: [(0, '-9.430')] [2022-07-10 07:31:10,932][26022] Updated weights on worker 0-0, policy_version 627121 (0.00087) [2022-07-10 07:31:12,510][26022] Updated weights on worker 0-0, policy_version 627131 (0.00087) [2022-07-10 07:31:14,356][26022] Updated weights on worker 0-0, policy_version 627141 (0.00509) [2022-07-10 07:31:15,508][25689] Fps is (10 sec: 5507.8, 60 sec: 5521.1, 300 sec: 5527.4). Total num frames: 642197504. Throughput: 0: 4905.2. Samples: 642193586. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:31:15,508][25689] Avg episode reward: [(0, '-9.585')] [2022-07-10 07:31:16,237][26022] Updated weights on worker 0-0, policy_version 627151 (0.00091) [2022-07-10 07:31:18,051][26022] Updated weights on worker 0-0, policy_version 627161 (0.00099) [2022-07-10 07:31:19,908][26022] Updated weights on worker 0-0, policy_version 627171 (0.00089) [2022-07-10 07:31:20,616][25689] Fps is (10 sec: 5764.0, 60 sec: 5566.4, 300 sec: 5530.5). Total num frames: 642227200. Throughput: 0: 5721.3. Samples: 642226764. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:31:20,618][25689] Avg episode reward: [(0, '-9.753')] [2022-07-10 07:31:21,961][26022] Updated weights on worker 0-0, policy_version 627181 (0.00088) [2022-07-10 07:31:23,547][26022] Updated weights on worker 0-0, policy_version 627191 (0.00091) [2022-07-10 07:31:25,491][26022] Updated weights on worker 0-0, policy_version 627201 (0.00085) [2022-07-10 07:31:25,686][25689] Fps is (10 sec: 5533.3, 60 sec: 5527.6, 300 sec: 5526.3). Total num frames: 642253824. Throughput: 0: 5827.7. Samples: 642260510. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:31:25,688][25689] Avg episode reward: [(0, '-8.732')] [2022-07-10 07:31:27,286][26022] Updated weights on worker 0-0, policy_version 627211 (0.00089) [2022-07-10 07:31:29,108][26022] Updated weights on worker 0-0, policy_version 627221 (0.00089) [2022-07-10 07:31:30,762][25689] Fps is (10 sec: 5349.2, 60 sec: 5521.9, 300 sec: 5526.5). Total num frames: 642281472. Throughput: 0: 4955.2. Samples: 642277124. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:31:30,762][25689] Avg episode reward: [(0, '-7.003')] [2022-07-10 07:31:31,231][26022] Updated weights on worker 0-0, policy_version 627231 (0.00083) [2022-07-10 07:31:32,639][26022] Updated weights on worker 0-0, policy_version 627241 (0.00092) [2022-07-10 07:31:34,739][26022] Updated weights on worker 0-0, policy_version 627251 (0.00087) [2022-07-10 07:31:35,767][25689] Fps is (10 sec: 5586.9, 60 sec: 5521.6, 300 sec: 5529.2). Total num frames: 642310144. Throughput: 0: 5780.0. Samples: 642310684. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:31:35,768][25689] Avg episode reward: [(0, '-7.358')] [2022-07-10 07:31:36,495][26022] Updated weights on worker 0-0, policy_version 627261 (0.00088) [2022-07-10 07:31:38,434][26022] Updated weights on worker 0-0, policy_version 627271 (0.00107) [2022-07-10 07:31:40,302][26022] Updated weights on worker 0-0, policy_version 627281 (0.00083) [2022-07-10 07:31:40,813][25689] Fps is (10 sec: 5705.0, 60 sec: 5555.7, 300 sec: 5532.4). Total num frames: 642338816. Throughput: 0: 5806.5. Samples: 642344038. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:31:40,814][25689] Avg episode reward: [(0, '-7.509')] [2022-07-10 07:31:42,111][26022] Updated weights on worker 0-0, policy_version 627291 (0.00091) [2022-07-10 07:31:43,903][26022] Updated weights on worker 0-0, policy_version 627301 (0.00085) [2022-07-10 07:31:45,711][26022] Updated weights on worker 0-0, policy_version 627311 (0.00071) [2022-07-10 07:31:45,912][25689] Fps is (10 sec: 5551.6, 60 sec: 5532.1, 300 sec: 5525.5). Total num frames: 642366464. Throughput: 0: 4956.2. Samples: 642360752. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:31:45,914][25689] Avg episode reward: [(0, '-8.310')] [2022-07-10 07:31:47,503][26022] Updated weights on worker 0-0, policy_version 627321 (0.00090) [2022-07-10 07:31:49,431][26022] Updated weights on worker 0-0, policy_version 627331 (0.00089) [2022-07-10 07:31:50,943][25689] Fps is (10 sec: 5560.1, 60 sec: 5548.8, 300 sec: 5532.1). Total num frames: 642395136. Throughput: 0: 5820.5. Samples: 642394586. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:31:50,943][25689] Avg episode reward: [(0, '-9.435')] [2022-07-10 07:31:51,200][26022] Updated weights on worker 0-0, policy_version 627341 (0.00086) [2022-07-10 07:31:53,045][26022] Updated weights on worker 0-0, policy_version 627351 (0.01487) [2022-07-10 07:31:54,907][26022] Updated weights on worker 0-0, policy_version 627361 (0.00088) [2022-07-10 07:31:55,987][25689] Fps is (10 sec: 5590.4, 60 sec: 5548.3, 300 sec: 5526.2). Total num frames: 642422784. Throughput: 0: 5805.8. Samples: 642428072. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:31:55,987][25689] Avg episode reward: [(0, '-9.863')] [2022-07-10 07:31:56,695][26022] Updated weights on worker 0-0, policy_version 627371 (0.00085) [2022-07-10 07:31:58,465][26022] Updated weights on worker 0-0, policy_version 627381 (0.00091) [2022-07-10 07:32:00,386][26022] Updated weights on worker 0-0, policy_version 627391 (0.00092) [2022-07-10 07:32:01,105][25689] Fps is (10 sec: 5541.9, 60 sec: 5549.5, 300 sec: 5538.7). Total num frames: 642451456. Throughput: 0: 4961.6. Samples: 642444718. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:32:01,106][25689] Avg episode reward: [(0, '-10.445')] [2022-07-10 07:32:02,717][26022] Updated weights on worker 0-0, policy_version 627401 (0.00054) [2022-07-10 07:32:04,521][26022] Updated weights on worker 0-0, policy_version 627411 (0.00084) [2022-07-10 07:32:06,181][25689] Fps is (10 sec: 5524.5, 60 sec: 5549.0, 300 sec: 5534.6). Total num frames: 642479104. Throughput: 0: 5691.7. Samples: 642476118. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:32:06,183][25689] Avg episode reward: [(0, '-9.698')] [2022-07-10 07:32:06,194][26022] Updated weights on worker 0-0, policy_version 627421 (0.00087) [2022-07-10 07:32:08,186][26022] Updated weights on worker 0-0, policy_version 627431 (0.00558) [2022-07-10 07:32:10,016][26022] Updated weights on worker 0-0, policy_version 627441 (0.00088) [2022-07-10 07:32:11,225][25689] Fps is (10 sec: 5464.0, 60 sec: 5563.8, 300 sec: 5531.3). Total num frames: 642506752. Throughput: 0: 5674.0. Samples: 642509670. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:32:11,226][25689] Avg episode reward: [(0, '-11.977')] [2022-07-10 07:32:11,768][26022] Updated weights on worker 0-0, policy_version 627451 (0.00091) [2022-07-10 07:32:12,468][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:32:12,480][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000627455_642513920.pth [2022-07-10 07:32:12,480][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000625509_640521216.pth [2022-07-10 07:32:13,591][26022] Updated weights on worker 0-0, policy_version 627461 (0.00083) [2022-07-10 07:32:15,446][26022] Updated weights on worker 0-0, policy_version 627471 (0.00087) [2022-07-10 07:32:16,242][25689] Fps is (10 sec: 5394.1, 60 sec: 5530.2, 300 sec: 5525.3). Total num frames: 642533376. Throughput: 0: 5698.2. Samples: 642543494. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:32:16,245][25689] Avg episode reward: [(0, '-8.924')] [2022-07-10 07:32:17,131][26022] Updated weights on worker 0-0, policy_version 627481 (0.00085) [2022-07-10 07:32:19,278][26022] Updated weights on worker 0-0, policy_version 627491 (0.00091) [2022-07-10 07:32:20,814][26022] Updated weights on worker 0-0, policy_version 627501 (0.00086) [2022-07-10 07:32:21,339][25689] Fps is (10 sec: 5568.9, 60 sec: 5531.3, 300 sec: 5534.3). Total num frames: 642563072. Throughput: 0: 5709.3. Samples: 642560236. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:32:21,339][25689] Avg episode reward: [(0, '-9.074')] [2022-07-10 07:32:22,932][26022] Updated weights on worker 0-0, policy_version 627511 (0.00090) [2022-07-10 07:32:24,545][26022] Updated weights on worker 0-0, policy_version 627521 (0.00090) [2022-07-10 07:32:26,391][25689] Fps is (10 sec: 5650.5, 60 sec: 5549.8, 300 sec: 5527.2). Total num frames: 642590720. Throughput: 0: 5823.2. Samples: 642593804. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:32:26,391][25689] Avg episode reward: [(0, '-8.606')] [2022-07-10 07:32:26,713][26022] Updated weights on worker 0-0, policy_version 627531 (0.00096) [2022-07-10 07:32:28,392][26022] Updated weights on worker 0-0, policy_version 627541 (0.00092) [2022-07-10 07:32:30,180][26022] Updated weights on worker 0-0, policy_version 627551 (0.00080) [2022-07-10 07:32:31,392][25689] Fps is (10 sec: 5500.1, 60 sec: 5556.6, 300 sec: 5527.7). Total num frames: 642618368. Throughput: 0: 5821.5. Samples: 642627072. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:32:31,393][25689] Avg episode reward: [(0, '-9.044')] [2022-07-10 07:32:31,872][26022] Updated weights on worker 0-0, policy_version 627561 (0.00089) [2022-07-10 07:32:34,034][26022] Updated weights on worker 0-0, policy_version 627571 (0.00089) [2022-07-10 07:32:35,571][26022] Updated weights on worker 0-0, policy_version 627581 (0.00088) [2022-07-10 07:32:36,415][25689] Fps is (10 sec: 5618.3, 60 sec: 5554.9, 300 sec: 5533.6). Total num frames: 642647040. Throughput: 0: 4969.3. Samples: 642643738. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:32:36,416][25689] Avg episode reward: [(0, '-8.729')] [2022-07-10 07:32:37,693][26022] Updated weights on worker 0-0, policy_version 627591 (0.00082) [2022-07-10 07:32:39,372][26022] Updated weights on worker 0-0, policy_version 627601 (0.00092) [2022-07-10 07:32:41,294][26022] Updated weights on worker 0-0, policy_version 627611 (0.00086) [2022-07-10 07:32:41,474][25689] Fps is (10 sec: 5484.6, 60 sec: 5520.0, 300 sec: 5526.4). Total num frames: 642673664. Throughput: 0: 5797.1. Samples: 642676964. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:32:41,475][25689] Avg episode reward: [(0, '-7.957')] [2022-07-10 07:32:43,075][26022] Updated weights on worker 0-0, policy_version 627621 (0.00082) [2022-07-10 07:32:45,057][26022] Updated weights on worker 0-0, policy_version 627631 (0.00090) [2022-07-10 07:32:46,477][25689] Fps is (10 sec: 5597.2, 60 sec: 5562.5, 300 sec: 5536.8). Total num frames: 642703360. Throughput: 0: 5816.8. Samples: 642710644. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:32:46,478][25689] Avg episode reward: [(0, '-10.188')] [2022-07-10 07:32:46,569][26022] Updated weights on worker 0-0, policy_version 627641 (0.00086) [2022-07-10 07:32:48,643][26022] Updated weights on worker 0-0, policy_version 627651 (0.00092) [2022-07-10 07:32:50,389][26022] Updated weights on worker 0-0, policy_version 627661 (0.00082) [2022-07-10 07:32:51,521][25689] Fps is (10 sec: 5708.0, 60 sec: 5544.5, 300 sec: 5533.1). Total num frames: 642731008. Throughput: 0: 4994.3. Samples: 642727598. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:32:51,522][25689] Avg episode reward: [(0, '-10.456')] [2022-07-10 07:32:52,197][26022] Updated weights on worker 0-0, policy_version 627671 (0.00091) [2022-07-10 07:32:54,026][26022] Updated weights on worker 0-0, policy_version 627681 (0.00088) [2022-07-10 07:32:55,894][26022] Updated weights on worker 0-0, policy_version 627691 (0.00087) [2022-07-10 07:32:56,548][25689] Fps is (10 sec: 5389.3, 60 sec: 5529.1, 300 sec: 5534.5). Total num frames: 642757632. Throughput: 0: 5830.6. Samples: 642761122. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:32:56,549][25689] Avg episode reward: [(0, '-12.089')] [2022-07-10 07:32:57,740][26022] Updated weights on worker 0-0, policy_version 627701 (0.00077) [2022-07-10 07:32:59,815][26022] Updated weights on worker 0-0, policy_version 627711 (0.00096) [2022-07-10 07:33:01,348][26022] Updated weights on worker 0-0, policy_version 627721 (0.00092) [2022-07-10 07:33:01,587][25689] Fps is (10 sec: 5595.0, 60 sec: 5553.3, 300 sec: 5537.3). Total num frames: 642787328. Throughput: 0: 5823.9. Samples: 642794096. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:33:01,587][25689] Avg episode reward: [(0, '-11.294')] [2022-07-10 07:33:03,767][26022] Updated weights on worker 0-0, policy_version 627731 (0.00091) [2022-07-10 07:33:05,441][26022] Updated weights on worker 0-0, policy_version 627741 (0.00089) [2022-07-10 07:33:06,591][25689] Fps is (10 sec: 5404.1, 60 sec: 5509.1, 300 sec: 5530.6). Total num frames: 642811904. Throughput: 0: 4883.3. Samples: 642808862. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:33:06,593][25689] Avg episode reward: [(0, '-11.738')] [2022-07-10 07:33:07,265][26022] Updated weights on worker 0-0, policy_version 627751 (0.00085) [2022-07-10 07:33:09,239][26022] Updated weights on worker 0-0, policy_version 627761 (0.00080) [2022-07-10 07:33:11,127][26022] Updated weights on worker 0-0, policy_version 627771 (0.00085) [2022-07-10 07:33:11,599][25689] Fps is (10 sec: 5216.2, 60 sec: 5512.4, 300 sec: 5530.9). Total num frames: 642839552. Throughput: 0: 5714.7. Samples: 642842338. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:33:11,599][25689] Avg episode reward: [(0, '-10.692')] [2022-07-10 07:33:12,748][26022] Updated weights on worker 0-0, policy_version 627781 (0.00086) [2022-07-10 07:33:14,858][26022] Updated weights on worker 0-0, policy_version 627791 (0.00089) [2022-07-10 07:33:16,303][26022] Updated weights on worker 0-0, policy_version 627801 (0.00100) [2022-07-10 07:33:16,610][25689] Fps is (10 sec: 5723.4, 60 sec: 5563.8, 300 sec: 5536.5). Total num frames: 642869248. Throughput: 0: 5719.1. Samples: 642875858. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 07:33:16,612][25689] Avg episode reward: [(0, '-10.306')] [2022-07-10 07:33:18,531][26022] Updated weights on worker 0-0, policy_version 627811 (0.00086) [2022-07-10 07:33:20,262][26022] Updated weights on worker 0-0, policy_version 627821 (0.00089) [2022-07-10 07:33:21,679][25689] Fps is (10 sec: 5485.2, 60 sec: 5498.4, 300 sec: 5523.0). Total num frames: 642894848. Throughput: 0: 4899.6. Samples: 642892542. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:33:21,681][25689] Avg episode reward: [(0, '-9.055')] [2022-07-10 07:33:21,965][26022] Updated weights on worker 0-0, policy_version 627831 (0.00090) [2022-07-10 07:33:24,098][26022] Updated weights on worker 0-0, policy_version 627841 (0.00086) [2022-07-10 07:33:25,720][26022] Updated weights on worker 0-0, policy_version 627851 (0.00087) [2022-07-10 07:33:26,686][25689] Fps is (10 sec: 5386.3, 60 sec: 5519.5, 300 sec: 5530.3). Total num frames: 642923520. Throughput: 0: 5813.4. Samples: 642925682. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:33:26,686][25689] Avg episode reward: [(0, '-8.563')] [2022-07-10 07:33:27,719][26022] Updated weights on worker 0-0, policy_version 627861 (0.00086) [2022-07-10 07:33:29,568][26022] Updated weights on worker 0-0, policy_version 627871 (0.00088) [2022-07-10 07:33:31,276][26022] Updated weights on worker 0-0, policy_version 627881 (0.00093) [2022-07-10 07:33:31,692][25689] Fps is (10 sec: 5727.4, 60 sec: 5536.1, 300 sec: 5533.9). Total num frames: 642952192. Throughput: 0: 5828.4. Samples: 642959450. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:33:31,692][25689] Avg episode reward: [(0, '-7.956')] [2022-07-10 07:33:33,030][26022] Updated weights on worker 0-0, policy_version 627891 (0.00083) [2022-07-10 07:33:35,024][26022] Updated weights on worker 0-0, policy_version 627901 (0.00092) [2022-07-10 07:33:36,559][26022] Updated weights on worker 0-0, policy_version 627911 (0.00097) [2022-07-10 07:33:36,724][25689] Fps is (10 sec: 5712.6, 60 sec: 5535.3, 300 sec: 5535.4). Total num frames: 642980864. Throughput: 0: 4993.7. Samples: 642976302. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:33:36,724][25689] Avg episode reward: [(0, '-6.363')] [2022-07-10 07:33:38,688][26022] Updated weights on worker 0-0, policy_version 627921 (0.00090) [2022-07-10 07:33:40,531][26022] Updated weights on worker 0-0, policy_version 627931 (0.00088) [2022-07-10 07:33:41,779][25689] Fps is (10 sec: 5583.5, 60 sec: 5552.6, 300 sec: 5531.4). Total num frames: 643008512. Throughput: 0: 5833.5. Samples: 643009790. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:33:41,779][25689] Avg episode reward: [(0, '-6.992')] [2022-07-10 07:33:42,171][26022] Updated weights on worker 0-0, policy_version 627941 (0.00088) [2022-07-10 07:33:44,402][26022] Updated weights on worker 0-0, policy_version 627951 (0.00085) [2022-07-10 07:33:45,648][26022] Updated weights on worker 0-0, policy_version 627961 (0.00090) [2022-07-10 07:33:46,801][25689] Fps is (10 sec: 5588.6, 60 sec: 5533.9, 300 sec: 5541.7). Total num frames: 643037184. Throughput: 0: 5849.6. Samples: 643043350. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:33:46,802][25689] Avg episode reward: [(0, '-6.391')] [2022-07-10 07:33:47,906][26022] Updated weights on worker 0-0, policy_version 627971 (0.00087) [2022-07-10 07:33:49,365][26022] Updated weights on worker 0-0, policy_version 627981 (0.00088) [2022-07-10 07:33:51,605][26022] Updated weights on worker 0-0, policy_version 627991 (0.00084) [2022-07-10 07:33:51,853][25689] Fps is (10 sec: 5488.6, 60 sec: 5516.1, 300 sec: 5530.8). Total num frames: 643063808. Throughput: 0: 4990.7. Samples: 643060070. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:33:51,854][25689] Avg episode reward: [(0, '-8.111')] [2022-07-10 07:33:53,499][26022] Updated weights on worker 0-0, policy_version 628001 (0.00087) [2022-07-10 07:33:55,113][26022] Updated weights on worker 0-0, policy_version 628011 (0.00083) [2022-07-10 07:33:56,901][25689] Fps is (10 sec: 5373.4, 60 sec: 5531.1, 300 sec: 5527.7). Total num frames: 643091456. Throughput: 0: 5785.3. Samples: 643093036. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:33:56,902][25689] Avg episode reward: [(0, '-6.811')] [2022-07-10 07:33:57,113][26022] Updated weights on worker 0-0, policy_version 628021 (0.00087) [2022-07-10 07:33:58,892][26022] Updated weights on worker 0-0, policy_version 628031 (0.00086) [2022-07-10 07:34:00,688][26022] Updated weights on worker 0-0, policy_version 628041 (0.00089) [2022-07-10 07:34:01,971][25689] Fps is (10 sec: 5566.7, 60 sec: 5511.4, 300 sec: 5540.9). Total num frames: 643120128. Throughput: 0: 5778.0. Samples: 643126460. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:34:01,971][25689] Avg episode reward: [(0, '-7.297')] [2022-07-10 07:34:03,123][26022] Updated weights on worker 0-0, policy_version 628051 (0.00082) [2022-07-10 07:34:04,596][26022] Updated weights on worker 0-0, policy_version 628061 (0.00088) [2022-07-10 07:34:06,482][26022] Updated weights on worker 0-0, policy_version 628071 (0.00893) [2022-07-10 07:34:07,040][25689] Fps is (10 sec: 5454.2, 60 sec: 5539.3, 300 sec: 5529.5). Total num frames: 643146752. Throughput: 0: 4841.5. Samples: 643141334. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:34:07,041][25689] Avg episode reward: [(0, '-8.887')] [2022-07-10 07:34:08,436][26022] Updated weights on worker 0-0, policy_version 628081 (0.00092) [2022-07-10 07:34:10,289][26022] Updated weights on worker 0-0, policy_version 628091 (0.00083) [2022-07-10 07:34:12,073][25689] Fps is (10 sec: 5473.6, 60 sec: 5554.0, 300 sec: 5536.8). Total num frames: 643175424. Throughput: 0: 5672.4. Samples: 643174764. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:34:12,075][25689] Avg episode reward: [(0, '-8.555')] [2022-07-10 07:34:12,078][26022] Updated weights on worker 0-0, policy_version 628101 (0.00097) [2022-07-10 07:34:12,561][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:34:12,584][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000628104_643178496.pth [2022-07-10 07:34:12,584][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000626157_641184768.pth [2022-07-10 07:34:13,798][26022] Updated weights on worker 0-0, policy_version 628111 (0.00084) [2022-07-10 07:34:15,756][26022] Updated weights on worker 0-0, policy_version 628121 (0.00085) [2022-07-10 07:34:17,164][25689] Fps is (10 sec: 5664.1, 60 sec: 5529.7, 300 sec: 5537.8). Total num frames: 643204096. Throughput: 0: 5693.0. Samples: 643208390. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:34:17,166][25689] Avg episode reward: [(0, '-6.579')] [2022-07-10 07:34:17,722][26022] Updated weights on worker 0-0, policy_version 628131 (0.00084) [2022-07-10 07:34:19,384][26022] Updated weights on worker 0-0, policy_version 628141 (0.00093) [2022-07-10 07:34:21,559][26022] Updated weights on worker 0-0, policy_version 628151 (0.00095) [2022-07-10 07:34:22,258][25689] Fps is (10 sec: 5328.5, 60 sec: 5527.5, 300 sec: 5529.6). Total num frames: 643229696. Throughput: 0: 4869.9. Samples: 643225258. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:34:22,259][25689] Avg episode reward: [(0, '-7.126')] [2022-07-10 07:34:23,002][26022] Updated weights on worker 0-0, policy_version 628161 (0.00098) [2022-07-10 07:34:25,086][26022] Updated weights on worker 0-0, policy_version 628171 (0.00093) [2022-07-10 07:34:26,717][26022] Updated weights on worker 0-0, policy_version 628181 (0.00084) [2022-07-10 07:34:27,310][25689] Fps is (10 sec: 5349.3, 60 sec: 5523.4, 300 sec: 5528.7). Total num frames: 643258368. Throughput: 0: 5772.3. Samples: 643258336. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:34:27,310][25689] Avg episode reward: [(0, '-6.679')] [2022-07-10 07:34:28,667][26022] Updated weights on worker 0-0, policy_version 628191 (0.00096) [2022-07-10 07:34:30,608][26022] Updated weights on worker 0-0, policy_version 628201 (0.00087) [2022-07-10 07:34:32,221][26022] Updated weights on worker 0-0, policy_version 628211 (0.00084) [2022-07-10 07:34:32,346][25689] Fps is (10 sec: 5887.5, 60 sec: 5554.4, 300 sec: 5539.1). Total num frames: 643289088. Throughput: 0: 5771.1. Samples: 643291762. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:34:32,346][25689] Avg episode reward: [(0, '-7.224')] [2022-07-10 07:34:34,351][26022] Updated weights on worker 0-0, policy_version 628221 (0.00089) [2022-07-10 07:34:35,748][26022] Updated weights on worker 0-0, policy_version 628231 (0.00105) [2022-07-10 07:34:37,363][25689] Fps is (10 sec: 5602.2, 60 sec: 5505.1, 300 sec: 5537.0). Total num frames: 643314688. Throughput: 0: 4959.4. Samples: 643308564. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:34:37,365][25689] Avg episode reward: [(0, '-6.984')] [2022-07-10 07:34:38,030][26022] Updated weights on worker 0-0, policy_version 628241 (0.00092) [2022-07-10 07:34:39,867][26022] Updated weights on worker 0-0, policy_version 628251 (0.00086) [2022-07-10 07:34:41,618][26022] Updated weights on worker 0-0, policy_version 628261 (0.00085) [2022-07-10 07:34:42,407][25689] Fps is (10 sec: 5394.4, 60 sec: 5523.0, 300 sec: 5536.9). Total num frames: 643343360. Throughput: 0: 5779.7. Samples: 643341712. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:34:42,412][25689] Avg episode reward: [(0, '-6.778')] [2022-07-10 07:34:43,384][26022] Updated weights on worker 0-0, policy_version 628271 (0.00090) [2022-07-10 07:34:45,259][26022] Updated weights on worker 0-0, policy_version 628281 (0.00086) [2022-07-10 07:34:47,147][26022] Updated weights on worker 0-0, policy_version 628291 (0.00089) [2022-07-10 07:34:47,415][25689] Fps is (10 sec: 5602.7, 60 sec: 5507.4, 300 sec: 5531.6). Total num frames: 643371008. Throughput: 0: 5812.3. Samples: 643375196. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:34:47,415][25689] Avg episode reward: [(0, '-7.901')] [2022-07-10 07:34:49,101][26022] Updated weights on worker 0-0, policy_version 628301 (0.00089) [2022-07-10 07:34:50,867][26022] Updated weights on worker 0-0, policy_version 628311 (0.00083) [2022-07-10 07:34:52,427][25689] Fps is (10 sec: 5518.6, 60 sec: 5528.0, 300 sec: 5539.5). Total num frames: 643398656. Throughput: 0: 4975.7. Samples: 643391678. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:34:52,429][25689] Avg episode reward: [(0, '-6.929')] [2022-07-10 07:34:52,717][26022] Updated weights on worker 0-0, policy_version 628321 (0.00088) [2022-07-10 07:34:54,714][26022] Updated weights on worker 0-0, policy_version 628331 (0.00091) [2022-07-10 07:34:56,452][26022] Updated weights on worker 0-0, policy_version 628341 (0.00085) [2022-07-10 07:34:57,458][25689] Fps is (10 sec: 5506.1, 60 sec: 5529.5, 300 sec: 5534.0). Total num frames: 643426304. Throughput: 0: 5768.9. Samples: 643424490. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:34:57,459][25689] Avg episode reward: [(0, '-7.038')] [2022-07-10 07:34:58,573][26022] Updated weights on worker 0-0, policy_version 628351 (0.00095) [2022-07-10 07:35:00,072][26022] Updated weights on worker 0-0, policy_version 628361 (0.00092) [2022-07-10 07:35:02,502][25689] Fps is (10 sec: 5183.6, 60 sec: 5464.2, 300 sec: 5527.0). Total num frames: 643450880. Throughput: 0: 5752.2. Samples: 643457302. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:35:02,502][25689] Avg episode reward: [(0, '-5.417')] [2022-07-10 07:35:02,528][26022] Updated weights on worker 0-0, policy_version 628371 (0.00089) [2022-07-10 07:35:04,164][26022] Updated weights on worker 0-0, policy_version 628381 (0.00097) [2022-07-10 07:35:06,213][26022] Updated weights on worker 0-0, policy_version 628391 (0.00093) [2022-07-10 07:35:07,506][25689] Fps is (10 sec: 5401.4, 60 sec: 5520.9, 300 sec: 5534.2). Total num frames: 643480576. Throughput: 0: 5682.8. Samples: 643489366. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:35:07,506][25689] Avg episode reward: [(0, '-7.219')] [2022-07-10 07:35:07,845][26022] Updated weights on worker 0-0, policy_version 628401 (0.00094) [2022-07-10 07:35:09,820][26022] Updated weights on worker 0-0, policy_version 628411 (0.00084) [2022-07-10 07:35:11,714][26022] Updated weights on worker 0-0, policy_version 628421 (0.00095) [2022-07-10 07:35:12,525][25689] Fps is (10 sec: 5618.9, 60 sec: 5488.2, 300 sec: 5524.8). Total num frames: 643507200. Throughput: 0: 5690.9. Samples: 643506056. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:35:12,525][25689] Avg episode reward: [(0, '-5.788')] [2022-07-10 07:35:13,452][26022] Updated weights on worker 0-0, policy_version 628431 (0.00082) [2022-07-10 07:35:15,460][26022] Updated weights on worker 0-0, policy_version 628441 (0.00084) [2022-07-10 07:35:17,047][26022] Updated weights on worker 0-0, policy_version 628451 (0.00090) [2022-07-10 07:35:17,531][25689] Fps is (10 sec: 5515.3, 60 sec: 5495.9, 300 sec: 5532.5). Total num frames: 643535872. Throughput: 0: 5735.2. Samples: 643539616. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:35:17,532][25689] Avg episode reward: [(0, '-5.445')] [2022-07-10 07:35:19,058][26022] Updated weights on worker 0-0, policy_version 628461 (0.00101) [2022-07-10 07:35:20,748][26022] Updated weights on worker 0-0, policy_version 628471 (0.00084) [2022-07-10 07:35:22,639][25689] Fps is (10 sec: 5568.6, 60 sec: 5528.7, 300 sec: 5527.3). Total num frames: 643563520. Throughput: 0: 5760.3. Samples: 643573298. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:35:22,639][25689] Avg episode reward: [(0, '-5.997')] [2022-07-10 07:35:22,753][26022] Updated weights on worker 0-0, policy_version 628481 (0.00090) [2022-07-10 07:35:24,382][26022] Updated weights on worker 0-0, policy_version 628491 (0.00081) [2022-07-10 07:35:26,396][26022] Updated weights on worker 0-0, policy_version 628501 (0.00089) [2022-07-10 07:35:27,653][25689] Fps is (10 sec: 5564.6, 60 sec: 5532.1, 300 sec: 5530.8). Total num frames: 643592192. Throughput: 0: 4990.9. Samples: 643589918. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:35:27,653][25689] Avg episode reward: [(0, '-6.062')] [2022-07-10 07:35:28,153][26022] Updated weights on worker 0-0, policy_version 628511 (0.00096) [2022-07-10 07:35:29,983][26022] Updated weights on worker 0-0, policy_version 628521 (0.00100) [2022-07-10 07:35:31,826][26022] Updated weights on worker 0-0, policy_version 628531 (0.00090) [2022-07-10 07:35:32,666][25689] Fps is (10 sec: 5514.7, 60 sec: 5466.3, 300 sec: 5523.7). Total num frames: 643618816. Throughput: 0: 5822.7. Samples: 643623330. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:35:32,666][25689] Avg episode reward: [(0, '-5.842')] [2022-07-10 07:35:33,618][26022] Updated weights on worker 0-0, policy_version 628541 (0.00086) [2022-07-10 07:35:35,451][26022] Updated weights on worker 0-0, policy_version 628551 (0.00089) [2022-07-10 07:35:37,347][26022] Updated weights on worker 0-0, policy_version 628561 (0.00091) [2022-07-10 07:35:37,707][25689] Fps is (10 sec: 5601.6, 60 sec: 5532.0, 300 sec: 5534.1). Total num frames: 643648512. Throughput: 0: 5809.0. Samples: 643656816. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:35:37,708][25689] Avg episode reward: [(0, '-6.148')] [2022-07-10 07:35:39,272][26022] Updated weights on worker 0-0, policy_version 628571 (0.00091) [2022-07-10 07:35:40,911][26022] Updated weights on worker 0-0, policy_version 628581 (0.00097) [2022-07-10 07:35:42,799][25689] Fps is (10 sec: 5659.2, 60 sec: 5510.7, 300 sec: 5529.5). Total num frames: 643676160. Throughput: 0: 4972.1. Samples: 643673536. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:35:42,799][25689] Avg episode reward: [(0, '-6.264')] [2022-07-10 07:35:43,022][26022] Updated weights on worker 0-0, policy_version 628591 (0.00087) [2022-07-10 07:35:44,611][26022] Updated weights on worker 0-0, policy_version 628601 (0.00084) [2022-07-10 07:35:46,703][26022] Updated weights on worker 0-0, policy_version 628611 (0.00083) [2022-07-10 07:35:47,854][25689] Fps is (10 sec: 5449.8, 60 sec: 5506.4, 300 sec: 5529.0). Total num frames: 643703808. Throughput: 0: 5784.5. Samples: 643706770. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:35:47,855][25689] Avg episode reward: [(0, '-5.591')] [2022-07-10 07:35:48,349][26022] Updated weights on worker 0-0, policy_version 628621 (0.00091) [2022-07-10 07:35:50,280][26022] Updated weights on worker 0-0, policy_version 628631 (0.00085) [2022-07-10 07:35:52,085][26022] Updated weights on worker 0-0, policy_version 628641 (0.00091) [2022-07-10 07:35:52,863][25689] Fps is (10 sec: 5494.3, 60 sec: 5506.6, 300 sec: 5529.5). Total num frames: 643731456. Throughput: 0: 5786.1. Samples: 643740192. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:35:52,863][25689] Avg episode reward: [(0, '-7.292')] [2022-07-10 07:35:53,974][26022] Updated weights on worker 0-0, policy_version 628651 (0.00103) [2022-07-10 07:35:55,988][26022] Updated weights on worker 0-0, policy_version 628661 (0.00090) [2022-07-10 07:35:57,644][26022] Updated weights on worker 0-0, policy_version 628671 (0.00093) [2022-07-10 07:35:57,908][25689] Fps is (10 sec: 5703.4, 60 sec: 5539.2, 300 sec: 5534.6). Total num frames: 643761152. Throughput: 0: 4954.5. Samples: 643756902. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:35:57,908][25689] Avg episode reward: [(0, '-7.752')] [2022-07-10 07:35:59,570][26022] Updated weights on worker 0-0, policy_version 628681 (0.00099) [2022-07-10 07:36:01,227][26022] Updated weights on worker 0-0, policy_version 628691 (0.00094) [2022-07-10 07:36:02,952][25689] Fps is (10 sec: 5379.5, 60 sec: 5539.3, 300 sec: 5524.8). Total num frames: 643785728. Throughput: 0: 5803.0. Samples: 643790484. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:36:02,954][25689] Avg episode reward: [(0, '-7.790')] [2022-07-10 07:36:03,494][26022] Updated weights on worker 0-0, policy_version 628701 (0.00093) [2022-07-10 07:36:05,413][26022] Updated weights on worker 0-0, policy_version 628711 (0.00093) [2022-07-10 07:36:07,094][26022] Updated weights on worker 0-0, policy_version 628721 (0.00089) [2022-07-10 07:36:07,999][25689] Fps is (10 sec: 5276.9, 60 sec: 5518.4, 300 sec: 5531.2). Total num frames: 643814400. Throughput: 0: 5729.8. Samples: 643822198. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:36:08,000][25689] Avg episode reward: [(0, '-10.366')] [2022-07-10 07:36:09,022][26022] Updated weights on worker 0-0, policy_version 628731 (0.00077) [2022-07-10 07:36:10,790][26022] Updated weights on worker 0-0, policy_version 628741 (0.00095) [2022-07-10 07:36:12,811][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:36:12,819][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000628751_643841024.pth [2022-07-10 07:36:12,820][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000626805_641848320.pth [2022-07-10 07:36:12,823][26022] Updated weights on worker 0-0, policy_version 628751 (0.00090) [2022-07-10 07:36:13,023][25689] Fps is (10 sec: 5592.2, 60 sec: 5534.9, 300 sec: 5527.6). Total num frames: 643842048. Throughput: 0: 4879.6. Samples: 643838564. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:36:13,031][25689] Avg episode reward: [(0, '-10.385')] [2022-07-10 07:36:14,612][26022] Updated weights on worker 0-0, policy_version 628761 (0.00094) [2022-07-10 07:36:16,397][26022] Updated weights on worker 0-0, policy_version 628771 (0.00085) [2022-07-10 07:36:18,040][25689] Fps is (10 sec: 5609.0, 60 sec: 5533.9, 300 sec: 5525.9). Total num frames: 643870720. Throughput: 0: 5725.3. Samples: 643872162. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:36:18,040][25689] Avg episode reward: [(0, '-10.827')] [2022-07-10 07:36:18,192][26022] Updated weights on worker 0-0, policy_version 628781 (0.00091) [2022-07-10 07:36:20,015][26022] Updated weights on worker 0-0, policy_version 628791 (0.00093) [2022-07-10 07:36:21,771][26022] Updated weights on worker 0-0, policy_version 628801 (0.00097) [2022-07-10 07:36:23,081][25689] Fps is (10 sec: 5701.2, 60 sec: 5556.9, 300 sec: 5533.3). Total num frames: 643899392. Throughput: 0: 5735.4. Samples: 643905934. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:36:23,081][25689] Avg episode reward: [(0, '-9.382')] [2022-07-10 07:36:23,729][26022] Updated weights on worker 0-0, policy_version 628811 (0.00084) [2022-07-10 07:36:25,575][26022] Updated weights on worker 0-0, policy_version 628821 (0.00894) [2022-07-10 07:36:27,347][26022] Updated weights on worker 0-0, policy_version 628831 (0.00091) [2022-07-10 07:36:28,109][25689] Fps is (10 sec: 5593.3, 60 sec: 5538.7, 300 sec: 5534.2). Total num frames: 643927040. Throughput: 0: 4994.5. Samples: 643922638. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2022-07-10 07:36:28,109][25689] Avg episode reward: [(0, '-8.223')] [2022-07-10 07:36:29,203][26022] Updated weights on worker 0-0, policy_version 628841 (0.00087) [2022-07-10 07:36:30,986][26022] Updated weights on worker 0-0, policy_version 628851 (0.00083) [2022-07-10 07:36:32,952][26022] Updated weights on worker 0-0, policy_version 628861 (0.00089) [2022-07-10 07:36:33,126][25689] Fps is (10 sec: 5504.8, 60 sec: 5555.3, 300 sec: 5530.5). Total num frames: 643954688. Throughput: 0: 5849.8. Samples: 643956164. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:36:33,127][25689] Avg episode reward: [(0, '-7.829')] [2022-07-10 07:36:34,711][26022] Updated weights on worker 0-0, policy_version 628871 (0.00090) [2022-07-10 07:36:36,588][26022] Updated weights on worker 0-0, policy_version 628881 (0.00083) [2022-07-10 07:36:38,154][25689] Fps is (10 sec: 5504.9, 60 sec: 5522.6, 300 sec: 5527.4). Total num frames: 643982336. Throughput: 0: 5840.7. Samples: 643989642. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:36:38,154][25689] Avg episode reward: [(0, '-6.859')] [2022-07-10 07:36:38,352][26022] Updated weights on worker 0-0, policy_version 628891 (0.00091) [2022-07-10 07:36:40,263][26022] Updated weights on worker 0-0, policy_version 628901 (0.00095) [2022-07-10 07:36:42,044][26022] Updated weights on worker 0-0, policy_version 628911 (0.00097) [2022-07-10 07:36:43,198][25689] Fps is (10 sec: 5489.9, 60 sec: 5526.9, 300 sec: 5528.4). Total num frames: 644009984. Throughput: 0: 4980.8. Samples: 644006136. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:36:43,199][25689] Avg episode reward: [(0, '-8.206')] [2022-07-10 07:36:43,965][26022] Updated weights on worker 0-0, policy_version 628921 (0.00088) [2022-07-10 07:36:45,650][26022] Updated weights on worker 0-0, policy_version 628931 (0.00093) [2022-07-10 07:36:47,736][26022] Updated weights on worker 0-0, policy_version 628941 (0.00088) [2022-07-10 07:36:48,209][25689] Fps is (10 sec: 5499.0, 60 sec: 5530.9, 300 sec: 5525.3). Total num frames: 644037632. Throughput: 0: 5808.9. Samples: 644039400. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:36:48,210][25689] Avg episode reward: [(0, '-8.963')] [2022-07-10 07:36:49,455][26022] Updated weights on worker 0-0, policy_version 628951 (0.00100) [2022-07-10 07:36:51,363][26022] Updated weights on worker 0-0, policy_version 628961 (0.00091) [2022-07-10 07:36:53,184][26022] Updated weights on worker 0-0, policy_version 628971 (0.00087) [2022-07-10 07:36:53,227][25689] Fps is (10 sec: 5616.0, 60 sec: 5547.2, 300 sec: 5529.3). Total num frames: 644066304. Throughput: 0: 5790.1. Samples: 644072550. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:36:53,227][25689] Avg episode reward: [(0, '-9.439')] [2022-07-10 07:36:55,067][26022] Updated weights on worker 0-0, policy_version 628981 (0.00085) [2022-07-10 07:36:56,878][26022] Updated weights on worker 0-0, policy_version 628991 (0.00089) [2022-07-10 07:36:58,231][25689] Fps is (10 sec: 5517.7, 60 sec: 5500.0, 300 sec: 5524.5). Total num frames: 644092928. Throughput: 0: 4947.8. Samples: 644088980. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:36:58,232][25689] Avg episode reward: [(0, '-10.934')] [2022-07-10 07:36:58,633][26022] Updated weights on worker 0-0, policy_version 629001 (0.00089) [2022-07-10 07:37:00,696][26022] Updated weights on worker 0-0, policy_version 629011 (0.00096) [2022-07-10 07:37:02,829][26022] Updated weights on worker 0-0, policy_version 629021 (0.00090) [2022-07-10 07:37:03,269][25689] Fps is (10 sec: 5200.4, 60 sec: 5517.5, 300 sec: 5518.3). Total num frames: 644118528. Throughput: 0: 5778.9. Samples: 644122124. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:37:03,271][25689] Avg episode reward: [(0, '-12.383')] [2022-07-10 07:37:04,798][26022] Updated weights on worker 0-0, policy_version 629031 (0.00084) [2022-07-10 07:37:06,649][26022] Updated weights on worker 0-0, policy_version 629041 (0.00100) [2022-07-10 07:37:08,281][25689] Fps is (10 sec: 5400.3, 60 sec: 5520.7, 300 sec: 5522.4). Total num frames: 644147200. Throughput: 0: 5679.9. Samples: 644153404. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:37:08,282][25689] Avg episode reward: [(0, '-10.637')] [2022-07-10 07:37:08,378][26022] Updated weights on worker 0-0, policy_version 629051 (0.00095) [2022-07-10 07:37:10,394][26022] Updated weights on worker 0-0, policy_version 629061 (0.00087) [2022-07-10 07:37:12,061][26022] Updated weights on worker 0-0, policy_version 629071 (0.00090) [2022-07-10 07:37:13,292][25689] Fps is (10 sec: 5516.8, 60 sec: 5504.9, 300 sec: 5522.5). Total num frames: 644173824. Throughput: 0: 4863.5. Samples: 644170140. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:37:13,293][25689] Avg episode reward: [(0, '-10.829')] [2022-07-10 07:37:14,103][26022] Updated weights on worker 0-0, policy_version 629081 (0.00087) [2022-07-10 07:37:15,895][26022] Updated weights on worker 0-0, policy_version 629091 (0.00095) [2022-07-10 07:37:17,682][26022] Updated weights on worker 0-0, policy_version 629101 (0.00554) [2022-07-10 07:37:18,306][25689] Fps is (10 sec: 5515.9, 60 sec: 5505.2, 300 sec: 5520.6). Total num frames: 644202496. Throughput: 0: 5695.6. Samples: 644203318. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:37:18,306][25689] Avg episode reward: [(0, '-10.901')] [2022-07-10 07:37:19,524][26022] Updated weights on worker 0-0, policy_version 629111 (0.00087) [2022-07-10 07:37:21,488][26022] Updated weights on worker 0-0, policy_version 629121 (0.00086) [2022-07-10 07:37:23,159][26022] Updated weights on worker 0-0, policy_version 629131 (0.00087) [2022-07-10 07:37:23,339][25689] Fps is (10 sec: 5707.6, 60 sec: 5505.9, 300 sec: 5524.4). Total num frames: 644231168. Throughput: 0: 5705.4. Samples: 644236632. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:37:23,340][25689] Avg episode reward: [(0, '-11.207')] [2022-07-10 07:37:25,303][26022] Updated weights on worker 0-0, policy_version 629141 (0.00085) [2022-07-10 07:37:26,791][26022] Updated weights on worker 0-0, policy_version 629151 (0.00081) [2022-07-10 07:37:28,344][25689] Fps is (10 sec: 5304.3, 60 sec: 5457.0, 300 sec: 5513.9). Total num frames: 644255744. Throughput: 0: 4968.5. Samples: 644253090. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:37:28,345][25689] Avg episode reward: [(0, '-9.061')] [2022-07-10 07:37:29,075][26022] Updated weights on worker 0-0, policy_version 629161 (0.00082) [2022-07-10 07:37:30,431][26022] Updated weights on worker 0-0, policy_version 629171 (0.00081) [2022-07-10 07:37:32,718][26022] Updated weights on worker 0-0, policy_version 629181 (0.00091) [2022-07-10 07:37:33,375][25689] Fps is (10 sec: 5510.2, 60 sec: 5506.8, 300 sec: 5520.7). Total num frames: 644286464. Throughput: 0: 5767.4. Samples: 644285962. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:37:33,375][25689] Avg episode reward: [(0, '-8.555')] [2022-07-10 07:37:34,612][26022] Updated weights on worker 0-0, policy_version 629191 (0.00094) [2022-07-10 07:37:36,201][26022] Updated weights on worker 0-0, policy_version 629201 (0.00086) [2022-07-10 07:37:38,274][26022] Updated weights on worker 0-0, policy_version 629211 (0.00091) [2022-07-10 07:37:38,377][25689] Fps is (10 sec: 5613.6, 60 sec: 5475.1, 300 sec: 5518.3). Total num frames: 644312064. Throughput: 0: 5780.6. Samples: 644319342. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:37:38,378][25689] Avg episode reward: [(0, '-9.702')] [2022-07-10 07:37:39,936][26022] Updated weights on worker 0-0, policy_version 629221 (0.00092) [2022-07-10 07:37:41,814][26022] Updated weights on worker 0-0, policy_version 629231 (0.00088) [2022-07-10 07:37:43,500][25689] Fps is (10 sec: 5359.9, 60 sec: 5484.9, 300 sec: 5512.6). Total num frames: 644340736. Throughput: 0: 4930.5. Samples: 644336034. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:37:43,501][25689] Avg episode reward: [(0, '-8.983')] [2022-07-10 07:37:43,856][26022] Updated weights on worker 0-0, policy_version 629241 (0.00093) [2022-07-10 07:37:45,399][26022] Updated weights on worker 0-0, policy_version 629251 (0.00097) [2022-07-10 07:37:47,382][26022] Updated weights on worker 0-0, policy_version 629261 (0.00091) [2022-07-10 07:37:48,506][25689] Fps is (10 sec: 5762.3, 60 sec: 5519.4, 300 sec: 5520.2). Total num frames: 644370432. Throughput: 0: 5778.4. Samples: 644369594. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:37:48,507][25689] Avg episode reward: [(0, '-8.225')] [2022-07-10 07:37:49,126][26022] Updated weights on worker 0-0, policy_version 629271 (0.00087) [2022-07-10 07:37:51,025][26022] Updated weights on worker 0-0, policy_version 629281 (0.00083) [2022-07-10 07:37:52,787][26022] Updated weights on worker 0-0, policy_version 629291 (0.00083) [2022-07-10 07:37:53,522][25689] Fps is (10 sec: 5721.7, 60 sec: 5502.5, 300 sec: 5523.9). Total num frames: 644398080. Throughput: 0: 5825.7. Samples: 644403336. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:37:53,523][25689] Avg episode reward: [(0, '-9.612')] [2022-07-10 07:37:54,520][26022] Updated weights on worker 0-0, policy_version 629301 (0.00093) [2022-07-10 07:37:56,475][26022] Updated weights on worker 0-0, policy_version 629311 (0.00087) [2022-07-10 07:37:58,391][26022] Updated weights on worker 0-0, policy_version 629321 (0.00085) [2022-07-10 07:37:58,539][25689] Fps is (10 sec: 5409.6, 60 sec: 5501.4, 300 sec: 5513.9). Total num frames: 644424704. Throughput: 0: 4994.9. Samples: 644420048. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:37:58,541][25689] Avg episode reward: [(0, '-10.769')] [2022-07-10 07:37:59,997][26022] Updated weights on worker 0-0, policy_version 629331 (0.00093) [2022-07-10 07:38:02,485][26022] Updated weights on worker 0-0, policy_version 629341 (0.00085) [2022-07-10 07:38:03,607][25689] Fps is (10 sec: 5483.3, 60 sec: 5549.6, 300 sec: 5526.5). Total num frames: 644453376. Throughput: 0: 5824.0. Samples: 644453136. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:38:03,607][25689] Avg episode reward: [(0, '-10.285')] [2022-07-10 07:38:04,047][26022] Updated weights on worker 0-0, policy_version 629351 (0.00086) [2022-07-10 07:38:06,153][26022] Updated weights on worker 0-0, policy_version 629361 (0.00087) [2022-07-10 07:38:07,802][26022] Updated weights on worker 0-0, policy_version 629371 (0.00085) [2022-07-10 07:38:08,627][25689] Fps is (10 sec: 5481.1, 60 sec: 5514.8, 300 sec: 5522.9). Total num frames: 644480000. Throughput: 0: 5743.1. Samples: 644485150. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:38:08,628][25689] Avg episode reward: [(0, '-10.306')] [2022-07-10 07:38:09,654][26022] Updated weights on worker 0-0, policy_version 629381 (0.00092) [2022-07-10 07:38:11,524][26022] Updated weights on worker 0-0, policy_version 629391 (0.00082) [2022-07-10 07:38:12,967][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:38:12,979][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000629399_644504576.pth [2022-07-10 07:38:12,980][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000627455_642513920.pth [2022-07-10 07:38:13,268][26022] Updated weights on worker 0-0, policy_version 629401 (0.00088) [2022-07-10 07:38:13,632][25689] Fps is (10 sec: 5413.4, 60 sec: 5532.4, 300 sec: 5516.1). Total num frames: 644507648. Throughput: 0: 4904.5. Samples: 644501964. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:38:13,634][25689] Avg episode reward: [(0, '-10.384')] [2022-07-10 07:38:15,228][26022] Updated weights on worker 0-0, policy_version 629411 (0.00092) [2022-07-10 07:38:17,028][26022] Updated weights on worker 0-0, policy_version 629421 (0.00086) [2022-07-10 07:38:18,639][25689] Fps is (10 sec: 5625.6, 60 sec: 5533.0, 300 sec: 5527.6). Total num frames: 644536320. Throughput: 0: 5732.0. Samples: 644535260. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:38:18,640][25689] Avg episode reward: [(0, '-10.224')] [2022-07-10 07:38:18,793][26022] Updated weights on worker 0-0, policy_version 629431 (0.00090) [2022-07-10 07:38:20,647][26022] Updated weights on worker 0-0, policy_version 629441 (0.00084) [2022-07-10 07:38:22,594][26022] Updated weights on worker 0-0, policy_version 629451 (0.00078) [2022-07-10 07:38:23,692][25689] Fps is (10 sec: 5496.7, 60 sec: 5497.3, 300 sec: 5519.8). Total num frames: 644562944. Throughput: 0: 5755.2. Samples: 644568732. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:38:23,694][25689] Avg episode reward: [(0, '-9.369')] [2022-07-10 07:38:24,340][26022] Updated weights on worker 0-0, policy_version 629461 (0.00082) [2022-07-10 07:38:26,532][26022] Updated weights on worker 0-0, policy_version 629471 (0.00087) [2022-07-10 07:38:27,958][26022] Updated weights on worker 0-0, policy_version 629481 (0.00090) [2022-07-10 07:38:28,793][25689] Fps is (10 sec: 5445.6, 60 sec: 5556.3, 300 sec: 5518.1). Total num frames: 644591616. Throughput: 0: 4971.5. Samples: 644585404. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:38:28,795][25689] Avg episode reward: [(0, '-8.715')] [2022-07-10 07:38:30,200][26022] Updated weights on worker 0-0, policy_version 629491 (0.00092) [2022-07-10 07:38:31,655][26022] Updated weights on worker 0-0, policy_version 629501 (0.00087) [2022-07-10 07:38:33,599][26022] Updated weights on worker 0-0, policy_version 629511 (0.00089) [2022-07-10 07:38:33,809][25689] Fps is (10 sec: 5668.4, 60 sec: 5523.8, 300 sec: 5518.4). Total num frames: 644620288. Throughput: 0: 5772.1. Samples: 644618424. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:38:33,810][25689] Avg episode reward: [(0, '-8.569')] [2022-07-10 07:38:35,377][26022] Updated weights on worker 0-0, policy_version 629521 (0.00105) [2022-07-10 07:38:37,298][26022] Updated weights on worker 0-0, policy_version 629531 (0.00060) [2022-07-10 07:38:38,811][25689] Fps is (10 sec: 5519.5, 60 sec: 5540.7, 300 sec: 5515.9). Total num frames: 644646912. Throughput: 0: 5778.8. Samples: 644651832. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:38:38,813][25689] Avg episode reward: [(0, '-8.978')] [2022-07-10 07:38:39,182][26022] Updated weights on worker 0-0, policy_version 629541 (0.00089) [2022-07-10 07:38:40,919][26022] Updated weights on worker 0-0, policy_version 629551 (0.00086) [2022-07-10 07:38:42,855][26022] Updated weights on worker 0-0, policy_version 629561 (0.00094) [2022-07-10 07:38:43,868][25689] Fps is (10 sec: 5395.1, 60 sec: 5529.8, 300 sec: 5511.8). Total num frames: 644674560. Throughput: 0: 5780.1. Samples: 644685352. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:38:43,869][25689] Avg episode reward: [(0, '-8.468')] [2022-07-10 07:38:44,745][26022] Updated weights on worker 0-0, policy_version 629571 (0.00096) [2022-07-10 07:38:46,413][26022] Updated weights on worker 0-0, policy_version 629581 (0.00087) [2022-07-10 07:38:48,457][26022] Updated weights on worker 0-0, policy_version 629591 (0.00091) [2022-07-10 07:38:48,928][25689] Fps is (10 sec: 5566.9, 60 sec: 5508.0, 300 sec: 5518.6). Total num frames: 644703232. Throughput: 0: 5791.4. Samples: 644702014. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:38:48,928][25689] Avg episode reward: [(0, '-10.505')] [2022-07-10 07:38:50,082][26022] Updated weights on worker 0-0, policy_version 629601 (0.00088) [2022-07-10 07:38:52,220][26022] Updated weights on worker 0-0, policy_version 629611 (0.00089) [2022-07-10 07:38:53,787][26022] Updated weights on worker 0-0, policy_version 629621 (0.00092) [2022-07-10 07:38:53,947][25689] Fps is (10 sec: 5689.7, 60 sec: 5524.7, 300 sec: 5522.6). Total num frames: 644731904. Throughput: 0: 5791.6. Samples: 644735056. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:38:53,947][25689] Avg episode reward: [(0, '-11.473')] [2022-07-10 07:38:55,801][26022] Updated weights on worker 0-0, policy_version 629631 (0.00086) [2022-07-10 07:38:57,615][26022] Updated weights on worker 0-0, policy_version 629641 (0.00090) [2022-07-10 07:38:58,971][25689] Fps is (10 sec: 5506.1, 60 sec: 5524.0, 300 sec: 5516.5). Total num frames: 644758528. Throughput: 0: 5783.9. Samples: 644768434. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:38:58,972][25689] Avg episode reward: [(0, '-11.483')] [2022-07-10 07:38:59,575][26022] Updated weights on worker 0-0, policy_version 629651 (0.00092) [2022-07-10 07:39:01,236][26022] Updated weights on worker 0-0, policy_version 629661 (0.00088) [2022-07-10 07:39:03,666][26022] Updated weights on worker 0-0, policy_version 629671 (0.00088) [2022-07-10 07:39:04,080][25689] Fps is (10 sec: 5153.5, 60 sec: 5469.4, 300 sec: 5512.3). Total num frames: 644784128. Throughput: 0: 4875.4. Samples: 644783894. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:39:04,081][25689] Avg episode reward: [(0, '-11.225')] [2022-07-10 07:39:05,270][26022] Updated weights on worker 0-0, policy_version 629681 (0.00089) [2022-07-10 07:39:07,398][26022] Updated weights on worker 0-0, policy_version 629691 (0.00522) [2022-07-10 07:39:09,023][26022] Updated weights on worker 0-0, policy_version 629701 (0.00095) [2022-07-10 07:39:09,121][25689] Fps is (10 sec: 5447.6, 60 sec: 5518.3, 300 sec: 5515.6). Total num frames: 644813824. Throughput: 0: 5668.2. Samples: 644816474. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:39:09,122][25689] Avg episode reward: [(0, '-10.376')] [2022-07-10 07:39:11,021][26022] Updated weights on worker 0-0, policy_version 629711 (0.00086) [2022-07-10 07:39:12,743][26022] Updated weights on worker 0-0, policy_version 629721 (0.01265) [2022-07-10 07:39:14,230][25689] Fps is (10 sec: 5548.8, 60 sec: 5491.9, 300 sec: 5508.4). Total num frames: 644840448. Throughput: 0: 5652.3. Samples: 644849706. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:39:14,231][25689] Avg episode reward: [(0, '-10.180')] [2022-07-10 07:39:14,815][26022] Updated weights on worker 0-0, policy_version 629731 (0.00089) [2022-07-10 07:39:16,454][26022] Updated weights on worker 0-0, policy_version 629741 (0.00100) [2022-07-10 07:39:18,441][26022] Updated weights on worker 0-0, policy_version 629751 (0.00085) [2022-07-10 07:39:19,284][25689] Fps is (10 sec: 5440.9, 60 sec: 5487.6, 300 sec: 5519.5). Total num frames: 644869120. Throughput: 0: 4816.2. Samples: 644866268. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:39:19,285][25689] Avg episode reward: [(0, '-8.518')] [2022-07-10 07:39:20,266][26022] Updated weights on worker 0-0, policy_version 629761 (0.00088) [2022-07-10 07:39:22,060][26022] Updated weights on worker 0-0, policy_version 629771 (0.00092) [2022-07-10 07:39:23,817][26022] Updated weights on worker 0-0, policy_version 629781 (0.00091) [2022-07-10 07:39:24,403][25689] Fps is (10 sec: 5637.3, 60 sec: 5515.5, 300 sec: 5518.2). Total num frames: 644897792. Throughput: 0: 5708.4. Samples: 644899902. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:39:24,403][25689] Avg episode reward: [(0, '-7.136')] [2022-07-10 07:39:25,611][26022] Updated weights on worker 0-0, policy_version 629791 (0.00085) [2022-07-10 07:39:27,658][26022] Updated weights on worker 0-0, policy_version 629801 (0.00089) [2022-07-10 07:39:29,408][25689] Fps is (10 sec: 5563.2, 60 sec: 5507.3, 300 sec: 5508.5). Total num frames: 644925440. Throughput: 0: 5736.3. Samples: 644932844. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:39:29,408][25689] Avg episode reward: [(0, '-7.249')] [2022-07-10 07:39:29,504][26022] Updated weights on worker 0-0, policy_version 629811 (0.00095) [2022-07-10 07:39:31,530][26022] Updated weights on worker 0-0, policy_version 629821 (0.00094) [2022-07-10 07:39:33,175][26022] Updated weights on worker 0-0, policy_version 629831 (0.00086) [2022-07-10 07:39:34,422][25689] Fps is (10 sec: 5518.9, 60 sec: 5490.5, 300 sec: 5515.4). Total num frames: 644953088. Throughput: 0: 4948.3. Samples: 644949620. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:39:34,422][25689] Avg episode reward: [(0, '-8.105')] [2022-07-10 07:39:34,988][26022] Updated weights on worker 0-0, policy_version 629841 (0.00089) [2022-07-10 07:39:36,896][26022] Updated weights on worker 0-0, policy_version 629851 (0.00094) [2022-07-10 07:39:38,774][26022] Updated weights on worker 0-0, policy_version 629861 (0.00086) [2022-07-10 07:39:39,449][25689] Fps is (10 sec: 5608.7, 60 sec: 5522.1, 300 sec: 5515.7). Total num frames: 644981760. Throughput: 0: 5793.5. Samples: 644983094. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 07:39:39,451][25689] Avg episode reward: [(0, '-10.116')] [2022-07-10 07:39:40,640][26022] Updated weights on worker 0-0, policy_version 629871 (0.00093) [2022-07-10 07:39:42,475][26022] Updated weights on worker 0-0, policy_version 629881 (0.00091) [2022-07-10 07:39:44,338][26022] Updated weights on worker 0-0, policy_version 629891 (0.00091) [2022-07-10 07:39:44,524][25689] Fps is (10 sec: 5575.2, 60 sec: 5520.5, 300 sec: 5514.5). Total num frames: 645009408. Throughput: 0: 5769.6. Samples: 645015994. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:39:44,524][25689] Avg episode reward: [(0, '-11.091')] [2022-07-10 07:39:46,535][26022] Updated weights on worker 0-0, policy_version 629901 (0.00090) [2022-07-10 07:39:47,985][26022] Updated weights on worker 0-0, policy_version 629911 (0.00093) [2022-07-10 07:39:49,549][25689] Fps is (10 sec: 5474.8, 60 sec: 5506.7, 300 sec: 5514.2). Total num frames: 645037056. Throughput: 0: 4953.4. Samples: 645032614. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:39:49,550][25689] Avg episode reward: [(0, '-12.612')] [2022-07-10 07:39:50,011][26022] Updated weights on worker 0-0, policy_version 629921 (0.00089) [2022-07-10 07:39:51,495][26022] Updated weights on worker 0-0, policy_version 629931 (0.00079) [2022-07-10 07:39:53,579][26022] Updated weights on worker 0-0, policy_version 629941 (0.00089) [2022-07-10 07:39:54,584][25689] Fps is (10 sec: 5598.2, 60 sec: 5505.2, 300 sec: 5517.6). Total num frames: 645065728. Throughput: 0: 5788.1. Samples: 645066320. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:39:54,585][25689] Avg episode reward: [(0, '-10.933')] [2022-07-10 07:39:55,319][26022] Updated weights on worker 0-0, policy_version 629951 (0.00087) [2022-07-10 07:39:57,257][26022] Updated weights on worker 0-0, policy_version 629961 (0.00077) [2022-07-10 07:39:59,016][26022] Updated weights on worker 0-0, policy_version 629971 (0.00094) [2022-07-10 07:39:59,640][25689] Fps is (10 sec: 5581.4, 60 sec: 5519.2, 300 sec: 5527.7). Total num frames: 645093376. Throughput: 0: 5755.5. Samples: 645099302. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:39:59,641][25689] Avg episode reward: [(0, '-10.928')] [2022-07-10 07:40:01,104][26022] Updated weights on worker 0-0, policy_version 629981 (0.00086) [2022-07-10 07:40:03,071][26022] Updated weights on worker 0-0, policy_version 629991 (0.00084) [2022-07-10 07:40:04,798][25689] Fps is (10 sec: 5113.0, 60 sec: 5498.0, 300 sec: 5507.6). Total num frames: 645117952. Throughput: 0: 4822.9. Samples: 645113776. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:40:04,799][25689] Avg episode reward: [(0, '-9.624')] [2022-07-10 07:40:05,194][26022] Updated weights on worker 0-0, policy_version 630001 (0.00087) [2022-07-10 07:40:06,809][26022] Updated weights on worker 0-0, policy_version 630011 (0.00095) [2022-07-10 07:40:08,872][26022] Updated weights on worker 0-0, policy_version 630021 (0.00089) [2022-07-10 07:40:09,819][25689] Fps is (10 sec: 5331.9, 60 sec: 5499.8, 300 sec: 5517.9). Total num frames: 645147648. Throughput: 0: 5633.9. Samples: 645146810. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:40:09,819][25689] Avg episode reward: [(0, '-10.025')] [2022-07-10 07:40:10,615][26022] Updated weights on worker 0-0, policy_version 630031 (0.00085) [2022-07-10 07:40:12,279][26022] Updated weights on worker 0-0, policy_version 630041 (0.00084) [2022-07-10 07:40:13,024][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:40:13,037][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000630044_645165056.pth [2022-07-10 07:40:13,037][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000628104_643178496.pth [2022-07-10 07:40:14,419][26022] Updated weights on worker 0-0, policy_version 630051 (0.00094) [2022-07-10 07:40:14,888][25689] Fps is (10 sec: 5683.4, 60 sec: 5520.3, 300 sec: 5513.3). Total num frames: 645175296. Throughput: 0: 5609.6. Samples: 645180216. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:40:14,888][25689] Avg episode reward: [(0, '-7.806')] [2022-07-10 07:40:16,299][26022] Updated weights on worker 0-0, policy_version 630061 (0.00083) [2022-07-10 07:40:17,839][26022] Updated weights on worker 0-0, policy_version 630071 (0.00087) [2022-07-10 07:40:19,873][26022] Updated weights on worker 0-0, policy_version 630081 (0.00082) [2022-07-10 07:40:19,933][25689] Fps is (10 sec: 5467.0, 60 sec: 5504.2, 300 sec: 5514.5). Total num frames: 645202944. Throughput: 0: 4820.4. Samples: 645197124. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:40:19,938][25689] Avg episode reward: [(0, '-6.989')] [2022-07-10 07:40:21,453][26022] Updated weights on worker 0-0, policy_version 630091 (0.00088) [2022-07-10 07:40:23,451][26022] Updated weights on worker 0-0, policy_version 630101 (0.00093) [2022-07-10 07:40:25,008][25689] Fps is (10 sec: 5666.1, 60 sec: 5525.0, 300 sec: 5516.8). Total num frames: 645232640. Throughput: 0: 5795.1. Samples: 645230896. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:40:25,009][25689] Avg episode reward: [(0, '-6.058')] [2022-07-10 07:40:25,122][26022] Updated weights on worker 0-0, policy_version 630111 (0.00089) [2022-07-10 07:40:27,107][26022] Updated weights on worker 0-0, policy_version 630121 (0.00089) [2022-07-10 07:40:28,988][26022] Updated weights on worker 0-0, policy_version 630131 (0.00082) [2022-07-10 07:40:30,094][25689] Fps is (10 sec: 5542.9, 60 sec: 5500.8, 300 sec: 5515.4). Total num frames: 645259264. Throughput: 0: 5781.7. Samples: 645264036. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:40:30,096][25689] Avg episode reward: [(0, '-5.585')] [2022-07-10 07:40:30,871][26022] Updated weights on worker 0-0, policy_version 630141 (0.00081) [2022-07-10 07:40:32,588][26022] Updated weights on worker 0-0, policy_version 630151 (0.00085) [2022-07-10 07:40:34,650][26022] Updated weights on worker 0-0, policy_version 630161 (0.00090) [2022-07-10 07:40:35,104][25689] Fps is (10 sec: 5477.3, 60 sec: 5518.1, 300 sec: 5512.6). Total num frames: 645287936. Throughput: 0: 4973.8. Samples: 645280768. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:40:35,105][25689] Avg episode reward: [(0, '-8.144')] [2022-07-10 07:40:36,099][26022] Updated weights on worker 0-0, policy_version 630171 (0.00086) [2022-07-10 07:40:38,193][26022] Updated weights on worker 0-0, policy_version 630181 (0.00086) [2022-07-10 07:40:39,972][26022] Updated weights on worker 0-0, policy_version 630191 (0.00095) [2022-07-10 07:40:40,119][25689] Fps is (10 sec: 5618.2, 60 sec: 5502.4, 300 sec: 5514.0). Total num frames: 645315584. Throughput: 0: 5811.9. Samples: 645314440. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:40:40,119][25689] Avg episode reward: [(0, '-8.798')] [2022-07-10 07:40:41,963][26022] Updated weights on worker 0-0, policy_version 630201 (0.00088) [2022-07-10 07:40:43,694][26022] Updated weights on worker 0-0, policy_version 630211 (0.00083) [2022-07-10 07:40:45,178][25689] Fps is (10 sec: 5590.4, 60 sec: 5520.6, 300 sec: 5517.4). Total num frames: 645344256. Throughput: 0: 5800.4. Samples: 645347890. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:40:45,179][25689] Avg episode reward: [(0, '-9.842')] [2022-07-10 07:40:45,784][26022] Updated weights on worker 0-0, policy_version 630221 (0.00256) [2022-07-10 07:40:47,178][26022] Updated weights on worker 0-0, policy_version 630231 (0.00089) [2022-07-10 07:40:49,451][26022] Updated weights on worker 0-0, policy_version 630241 (0.01310) [2022-07-10 07:40:50,219][25689] Fps is (10 sec: 5677.2, 60 sec: 5536.0, 300 sec: 5520.2). Total num frames: 645372928. Throughput: 0: 5838.9. Samples: 645381546. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:40:50,220][25689] Avg episode reward: [(0, '-10.827')] [2022-07-10 07:40:50,827][26022] Updated weights on worker 0-0, policy_version 630251 (0.00088) [2022-07-10 07:40:52,979][26022] Updated weights on worker 0-0, policy_version 630261 (0.00090) [2022-07-10 07:40:54,576][26022] Updated weights on worker 0-0, policy_version 630271 (0.00089) [2022-07-10 07:40:55,227][25689] Fps is (10 sec: 5503.0, 60 sec: 5504.8, 300 sec: 5510.6). Total num frames: 645399552. Throughput: 0: 5847.6. Samples: 645398438. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:40:55,227][25689] Avg episode reward: [(0, '-10.021')] [2022-07-10 07:40:56,536][26022] Updated weights on worker 0-0, policy_version 630281 (0.00096) [2022-07-10 07:40:58,495][26022] Updated weights on worker 0-0, policy_version 630291 (0.00088) [2022-07-10 07:41:00,064][26022] Updated weights on worker 0-0, policy_version 630301 (0.00089) [2022-07-10 07:41:00,254][25689] Fps is (10 sec: 5510.4, 60 sec: 5524.3, 300 sec: 5524.7). Total num frames: 645428224. Throughput: 0: 5842.7. Samples: 645432086. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:41:00,256][25689] Avg episode reward: [(0, '-8.704')] [2022-07-10 07:41:02,573][26022] Updated weights on worker 0-0, policy_version 630311 (0.00086) [2022-07-10 07:41:04,364][26022] Updated weights on worker 0-0, policy_version 630321 (0.00086) [2022-07-10 07:41:05,306][25689] Fps is (10 sec: 5282.7, 60 sec: 5533.9, 300 sec: 5510.8). Total num frames: 645452800. Throughput: 0: 5728.6. Samples: 645463194. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:41:05,308][25689] Avg episode reward: [(0, '-6.784')] [2022-07-10 07:41:05,982][26022] Updated weights on worker 0-0, policy_version 630331 (0.00093) [2022-07-10 07:41:07,940][26022] Updated weights on worker 0-0, policy_version 630341 (0.00083) [2022-07-10 07:41:09,661][26022] Updated weights on worker 0-0, policy_version 630351 (0.00090) [2022-07-10 07:41:10,332][25689] Fps is (10 sec: 5385.1, 60 sec: 5533.4, 300 sec: 5517.6). Total num frames: 645482496. Throughput: 0: 4890.0. Samples: 645479898. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:41:10,333][25689] Avg episode reward: [(0, '-7.951')] [2022-07-10 07:41:11,837][26022] Updated weights on worker 0-0, policy_version 630361 (0.00089) [2022-07-10 07:41:13,403][26022] Updated weights on worker 0-0, policy_version 630371 (0.00096) [2022-07-10 07:41:15,360][25689] Fps is (10 sec: 5601.8, 60 sec: 5520.3, 300 sec: 5510.6). Total num frames: 645509120. Throughput: 0: 5713.9. Samples: 645513478. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:41:15,363][25689] Avg episode reward: [(0, '-10.180')] [2022-07-10 07:41:15,378][26022] Updated weights on worker 0-0, policy_version 630381 (0.00092) [2022-07-10 07:41:17,132][26022] Updated weights on worker 0-0, policy_version 630391 (0.00084) [2022-07-10 07:41:18,911][26022] Updated weights on worker 0-0, policy_version 630401 (0.00095) [2022-07-10 07:41:20,365][25689] Fps is (10 sec: 5511.9, 60 sec: 5541.0, 300 sec: 5511.2). Total num frames: 645537792. Throughput: 0: 5715.4. Samples: 645547026. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:41:20,365][25689] Avg episode reward: [(0, '-12.821')] [2022-07-10 07:41:20,895][26022] Updated weights on worker 0-0, policy_version 630411 (0.00083) [2022-07-10 07:41:22,712][26022] Updated weights on worker 0-0, policy_version 630421 (0.00369) [2022-07-10 07:41:24,538][26022] Updated weights on worker 0-0, policy_version 630431 (0.00088) [2022-07-10 07:41:25,495][25689] Fps is (10 sec: 5658.0, 60 sec: 5519.0, 300 sec: 5512.8). Total num frames: 645566464. Throughput: 0: 4975.4. Samples: 645563642. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:41:25,496][25689] Avg episode reward: [(0, '-13.674')] [2022-07-10 07:41:26,387][26022] Updated weights on worker 0-0, policy_version 630441 (0.00085) [2022-07-10 07:41:28,021][26022] Updated weights on worker 0-0, policy_version 630451 (0.00083) [2022-07-10 07:41:30,048][26022] Updated weights on worker 0-0, policy_version 630461 (0.00087) [2022-07-10 07:41:30,544][25689] Fps is (10 sec: 5532.6, 60 sec: 5539.2, 300 sec: 5512.2). Total num frames: 645594112. Throughput: 0: 5799.2. Samples: 645597110. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:41:30,545][25689] Avg episode reward: [(0, '-14.573')] [2022-07-10 07:41:31,858][26022] Updated weights on worker 0-0, policy_version 630471 (0.00098) [2022-07-10 07:41:33,655][26022] Updated weights on worker 0-0, policy_version 630481 (0.00089) [2022-07-10 07:41:35,557][25689] Fps is (10 sec: 5495.8, 60 sec: 5522.1, 300 sec: 5512.5). Total num frames: 645621760. Throughput: 0: 5803.8. Samples: 645630694. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:41:35,557][25689] Avg episode reward: [(0, '-15.211')] [2022-07-10 07:41:35,563][26022] Updated weights on worker 0-0, policy_version 630491 (0.00111) [2022-07-10 07:41:37,257][26022] Updated weights on worker 0-0, policy_version 630501 (0.00087) [2022-07-10 07:41:39,242][26022] Updated weights on worker 0-0, policy_version 630511 (0.00085) [2022-07-10 07:41:40,572][25689] Fps is (10 sec: 5616.5, 60 sec: 5539.0, 300 sec: 5516.5). Total num frames: 645650432. Throughput: 0: 4967.1. Samples: 645647398. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:41:40,572][25689] Avg episode reward: [(0, '-13.050')] [2022-07-10 07:41:40,932][26022] Updated weights on worker 0-0, policy_version 630521 (0.00092) [2022-07-10 07:41:43,010][26022] Updated weights on worker 0-0, policy_version 630531 (0.00091) [2022-07-10 07:41:44,683][26022] Updated weights on worker 0-0, policy_version 630541 (0.00080) [2022-07-10 07:41:45,699][25689] Fps is (10 sec: 5653.8, 60 sec: 5532.8, 300 sec: 5517.7). Total num frames: 645679104. Throughput: 0: 5803.9. Samples: 645680904. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:41:45,700][25689] Avg episode reward: [(0, '-10.370')] [2022-07-10 07:41:46,564][26022] Updated weights on worker 0-0, policy_version 630551 (0.00093) [2022-07-10 07:41:48,470][26022] Updated weights on worker 0-0, policy_version 630561 (0.00095) [2022-07-10 07:41:50,362][26022] Updated weights on worker 0-0, policy_version 630571 (0.00076) [2022-07-10 07:41:50,741][25689] Fps is (10 sec: 5639.1, 60 sec: 5532.7, 300 sec: 5517.3). Total num frames: 645707776. Throughput: 0: 5816.4. Samples: 645714582. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:41:50,741][25689] Avg episode reward: [(0, '-8.122')] [2022-07-10 07:41:52,031][26022] Updated weights on worker 0-0, policy_version 630581 (0.00092) [2022-07-10 07:41:53,872][26022] Updated weights on worker 0-0, policy_version 630591 (0.00092) [2022-07-10 07:41:55,515][26022] Updated weights on worker 0-0, policy_version 630601 (0.00095) [2022-07-10 07:41:55,772][25689] Fps is (10 sec: 5693.2, 60 sec: 5564.4, 300 sec: 5523.7). Total num frames: 645736448. Throughput: 0: 4986.5. Samples: 645731496. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:41:55,774][25689] Avg episode reward: [(0, '-7.296')] [2022-07-10 07:41:57,519][26022] Updated weights on worker 0-0, policy_version 630611 (0.00083) [2022-07-10 07:41:59,478][26022] Updated weights on worker 0-0, policy_version 630621 (0.00089) [2022-07-10 07:42:00,836][25689] Fps is (10 sec: 5477.7, 60 sec: 5527.2, 300 sec: 5526.6). Total num frames: 645763072. Throughput: 0: 5810.5. Samples: 645765140. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:42:00,836][25689] Avg episode reward: [(0, '-6.290')] [2022-07-10 07:42:01,222][26022] Updated weights on worker 0-0, policy_version 630631 (0.00091) [2022-07-10 07:42:03,478][26022] Updated weights on worker 0-0, policy_version 630641 (0.00093) [2022-07-10 07:42:05,141][26022] Updated weights on worker 0-0, policy_version 630651 (0.00089) [2022-07-10 07:42:05,912][25689] Fps is (10 sec: 5251.3, 60 sec: 5558.9, 300 sec: 5518.6). Total num frames: 645789696. Throughput: 0: 5715.3. Samples: 645796424. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:42:05,914][25689] Avg episode reward: [(0, '-6.994')] [2022-07-10 07:42:07,015][26022] Updated weights on worker 0-0, policy_version 630661 (0.00086) [2022-07-10 07:42:09,111][26022] Updated weights on worker 0-0, policy_version 630671 (0.00092) [2022-07-10 07:42:10,624][26022] Updated weights on worker 0-0, policy_version 630681 (0.00091) [2022-07-10 07:42:10,930][25689] Fps is (10 sec: 5477.9, 60 sec: 5542.7, 300 sec: 5525.3). Total num frames: 645818368. Throughput: 0: 4888.3. Samples: 645813272. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:42:10,935][25689] Avg episode reward: [(0, '-8.079')] [2022-07-10 07:42:12,683][26022] Updated weights on worker 0-0, policy_version 630691 (0.00099) [2022-07-10 07:42:13,131][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:42:13,162][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000630694_645830656.pth [2022-07-10 07:42:13,163][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000628751_643841024.pth [2022-07-10 07:42:14,311][26022] Updated weights on worker 0-0, policy_version 630701 (0.00090) [2022-07-10 07:42:15,969][25689] Fps is (10 sec: 5599.6, 60 sec: 5558.5, 300 sec: 5521.4). Total num frames: 645846016. Throughput: 0: 5696.2. Samples: 645846546. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:42:15,971][25689] Avg episode reward: [(0, '-8.090')] [2022-07-10 07:42:16,229][26022] Updated weights on worker 0-0, policy_version 630711 (0.00086) [2022-07-10 07:42:18,135][26022] Updated weights on worker 0-0, policy_version 630721 (0.00091) [2022-07-10 07:42:19,955][26022] Updated weights on worker 0-0, policy_version 630731 (0.00087) [2022-07-10 07:42:20,994][25689] Fps is (10 sec: 5494.3, 60 sec: 5539.7, 300 sec: 5518.1). Total num frames: 645873664. Throughput: 0: 5704.9. Samples: 645880142. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:42:20,995][25689] Avg episode reward: [(0, '-7.859')] [2022-07-10 07:42:21,763][26022] Updated weights on worker 0-0, policy_version 630741 (0.00081) [2022-07-10 07:42:23,588][26022] Updated weights on worker 0-0, policy_version 630751 (0.00086) [2022-07-10 07:42:25,344][26022] Updated weights on worker 0-0, policy_version 630761 (0.00090) [2022-07-10 07:42:26,122][25689] Fps is (10 sec: 5547.2, 60 sec: 5540.0, 300 sec: 5529.6). Total num frames: 645902336. Throughput: 0: 4971.2. Samples: 645896896. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:42:26,123][25689] Avg episode reward: [(0, '-9.271')] [2022-07-10 07:42:27,331][26022] Updated weights on worker 0-0, policy_version 630771 (0.00097) [2022-07-10 07:42:29,052][26022] Updated weights on worker 0-0, policy_version 630781 (0.00088) [2022-07-10 07:42:31,049][26022] Updated weights on worker 0-0, policy_version 630791 (0.00087) [2022-07-10 07:42:31,177][25689] Fps is (10 sec: 5530.7, 60 sec: 5539.4, 300 sec: 5518.8). Total num frames: 645929984. Throughput: 0: 5769.9. Samples: 645930098. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:42:31,178][25689] Avg episode reward: [(0, '-10.523')] [2022-07-10 07:42:32,847][26022] Updated weights on worker 0-0, policy_version 630801 (0.00083) [2022-07-10 07:42:34,621][26022] Updated weights on worker 0-0, policy_version 630811 (0.00086) [2022-07-10 07:42:36,231][25689] Fps is (10 sec: 5571.4, 60 sec: 5552.5, 300 sec: 5528.2). Total num frames: 645958656. Throughput: 0: 5767.9. Samples: 645963414. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:42:36,232][25689] Avg episode reward: [(0, '-10.097')] [2022-07-10 07:42:36,562][26022] Updated weights on worker 0-0, policy_version 630821 (0.00084) [2022-07-10 07:42:38,385][26022] Updated weights on worker 0-0, policy_version 630831 (0.00099) [2022-07-10 07:42:40,298][26022] Updated weights on worker 0-0, policy_version 630841 (0.00082) [2022-07-10 07:42:41,239][25689] Fps is (10 sec: 5495.8, 60 sec: 5519.4, 300 sec: 5523.4). Total num frames: 645985280. Throughput: 0: 4925.1. Samples: 645979848. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:42:41,239][25689] Avg episode reward: [(0, '-10.724')] [2022-07-10 07:42:42,198][26022] Updated weights on worker 0-0, policy_version 630851 (0.00086) [2022-07-10 07:42:43,775][26022] Updated weights on worker 0-0, policy_version 630861 (0.00088) [2022-07-10 07:42:46,004][26022] Updated weights on worker 0-0, policy_version 630871 (0.00091) [2022-07-10 07:42:46,313][25689] Fps is (10 sec: 5484.6, 60 sec: 5524.3, 300 sec: 5518.7). Total num frames: 646013952. Throughput: 0: 5753.1. Samples: 646013056. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:42:46,314][25689] Avg episode reward: [(0, '-10.722')] [2022-07-10 07:42:47,678][26022] Updated weights on worker 0-0, policy_version 630881 (0.00085) [2022-07-10 07:42:49,511][26022] Updated weights on worker 0-0, policy_version 630891 (0.00083) [2022-07-10 07:42:51,346][25689] Fps is (10 sec: 5673.8, 60 sec: 5525.1, 300 sec: 5521.9). Total num frames: 646042624. Throughput: 0: 5785.9. Samples: 646046790. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 07:42:51,346][25689] Avg episode reward: [(0, '-10.946')] [2022-07-10 07:42:51,346][26022] Updated weights on worker 0-0, policy_version 630901 (0.00087) [2022-07-10 07:42:53,166][26022] Updated weights on worker 0-0, policy_version 630911 (0.00084) [2022-07-10 07:42:55,109][26022] Updated weights on worker 0-0, policy_version 630921 (0.00091) [2022-07-10 07:42:56,388][25689] Fps is (10 sec: 5590.4, 60 sec: 5507.2, 300 sec: 5524.8). Total num frames: 646070272. Throughput: 0: 4965.1. Samples: 646063492. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:42:56,388][25689] Avg episode reward: [(0, '-9.739')] [2022-07-10 07:42:56,819][26022] Updated weights on worker 0-0, policy_version 630931 (0.00102) [2022-07-10 07:42:58,631][26022] Updated weights on worker 0-0, policy_version 630941 (0.00082) [2022-07-10 07:43:00,744][26022] Updated weights on worker 0-0, policy_version 630951 (0.00088) [2022-07-10 07:43:01,397][25689] Fps is (10 sec: 5603.4, 60 sec: 5546.0, 300 sec: 5525.9). Total num frames: 646098944. Throughput: 0: 5815.9. Samples: 646097084. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:43:01,397][25689] Avg episode reward: [(0, '-8.237')] [2022-07-10 07:43:02,623][26022] Updated weights on worker 0-0, policy_version 630961 (0.00088) [2022-07-10 07:43:04,826][26022] Updated weights on worker 0-0, policy_version 630971 (0.00103) [2022-07-10 07:43:06,428][26022] Updated weights on worker 0-0, policy_version 630981 (0.00092) [2022-07-10 07:43:06,531][25689] Fps is (10 sec: 5350.7, 60 sec: 5523.8, 300 sec: 5520.4). Total num frames: 646124544. Throughput: 0: 5690.0. Samples: 646128092. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:43:06,531][25689] Avg episode reward: [(0, '-8.709')] [2022-07-10 07:43:08,283][26022] Updated weights on worker 0-0, policy_version 630991 (0.00083) [2022-07-10 07:43:10,321][26022] Updated weights on worker 0-0, policy_version 631001 (0.00088) [2022-07-10 07:43:11,618][25689] Fps is (10 sec: 5309.7, 60 sec: 5517.5, 300 sec: 5522.3). Total num frames: 646153216. Throughput: 0: 5668.1. Samples: 646161696. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:43:11,619][25689] Avg episode reward: [(0, '-10.771')] [2022-07-10 07:43:12,037][26022] Updated weights on worker 0-0, policy_version 631011 (0.00091) [2022-07-10 07:43:13,739][26022] Updated weights on worker 0-0, policy_version 631021 (0.00088) [2022-07-10 07:43:15,543][26022] Updated weights on worker 0-0, policy_version 631031 (0.00089) [2022-07-10 07:43:16,693][25689] Fps is (10 sec: 5542.0, 60 sec: 5514.3, 300 sec: 5517.6). Total num frames: 646180864. Throughput: 0: 5664.0. Samples: 646178502. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:43:16,694][25689] Avg episode reward: [(0, '-10.161')] [2022-07-10 07:43:17,595][26022] Updated weights on worker 0-0, policy_version 631041 (0.00093) [2022-07-10 07:43:19,411][26022] Updated weights on worker 0-0, policy_version 631051 (0.00094) [2022-07-10 07:43:21,220][26022] Updated weights on worker 0-0, policy_version 631061 (0.00094) [2022-07-10 07:43:21,718][25689] Fps is (10 sec: 5576.2, 60 sec: 5531.1, 300 sec: 5525.0). Total num frames: 646209536. Throughput: 0: 5647.9. Samples: 646211858. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:43:21,719][25689] Avg episode reward: [(0, '-9.816')] [2022-07-10 07:43:23,049][26022] Updated weights on worker 0-0, policy_version 631071 (0.00096) [2022-07-10 07:43:24,921][26022] Updated weights on worker 0-0, policy_version 631081 (0.00079) [2022-07-10 07:43:26,682][26022] Updated weights on worker 0-0, policy_version 631091 (0.00088) [2022-07-10 07:43:26,814][25689] Fps is (10 sec: 5564.9, 60 sec: 5517.2, 300 sec: 5521.7). Total num frames: 646237184. Throughput: 0: 5756.0. Samples: 646244842. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:43:26,815][25689] Avg episode reward: [(0, '-10.620')] [2022-07-10 07:43:28,703][26022] Updated weights on worker 0-0, policy_version 631101 (0.00087) [2022-07-10 07:43:30,380][26022] Updated weights on worker 0-0, policy_version 631111 (0.00084) [2022-07-10 07:43:31,884][25689] Fps is (10 sec: 5439.1, 60 sec: 5515.8, 300 sec: 5517.2). Total num frames: 646264832. Throughput: 0: 4924.2. Samples: 646261492. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:43:31,885][25689] Avg episode reward: [(0, '-8.901')] [2022-07-10 07:43:32,242][26022] Updated weights on worker 0-0, policy_version 631121 (0.00089) [2022-07-10 07:43:34,136][26022] Updated weights on worker 0-0, policy_version 631131 (0.00089) [2022-07-10 07:43:36,041][26022] Updated weights on worker 0-0, policy_version 631141 (0.00085) [2022-07-10 07:43:36,908][25689] Fps is (10 sec: 5579.2, 60 sec: 5518.5, 300 sec: 5523.7). Total num frames: 646293504. Throughput: 0: 5762.1. Samples: 646294984. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:43:36,909][25689] Avg episode reward: [(0, '-9.624')] [2022-07-10 07:43:37,927][26022] Updated weights on worker 0-0, policy_version 631151 (0.00092) [2022-07-10 07:43:39,682][26022] Updated weights on worker 0-0, policy_version 631161 (0.00088) [2022-07-10 07:43:41,474][26022] Updated weights on worker 0-0, policy_version 631171 (0.00096) [2022-07-10 07:43:41,979][25689] Fps is (10 sec: 5579.4, 60 sec: 5529.7, 300 sec: 5523.4). Total num frames: 646321152. Throughput: 0: 5762.9. Samples: 646328618. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:43:41,979][25689] Avg episode reward: [(0, '-7.749')] [2022-07-10 07:43:43,207][26022] Updated weights on worker 0-0, policy_version 631181 (0.00086) [2022-07-10 07:43:45,286][26022] Updated weights on worker 0-0, policy_version 631191 (0.00087) [2022-07-10 07:43:46,955][26022] Updated weights on worker 0-0, policy_version 631201 (0.00087) [2022-07-10 07:43:47,096][25689] Fps is (10 sec: 5628.7, 60 sec: 5542.6, 300 sec: 5525.8). Total num frames: 646350848. Throughput: 0: 4955.8. Samples: 646345358. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:43:47,096][25689] Avg episode reward: [(0, '-8.640')] [2022-07-10 07:43:48,923][26022] Updated weights on worker 0-0, policy_version 631211 (0.00085) [2022-07-10 07:43:50,507][26022] Updated weights on worker 0-0, policy_version 631221 (0.00087) [2022-07-10 07:43:52,106][25689] Fps is (10 sec: 5561.3, 60 sec: 5511.0, 300 sec: 5519.1). Total num frames: 646377472. Throughput: 0: 5816.3. Samples: 646379106. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:43:52,106][25689] Avg episode reward: [(0, '-8.744')] [2022-07-10 07:43:52,578][26022] Updated weights on worker 0-0, policy_version 631231 (0.00088) [2022-07-10 07:43:54,322][26022] Updated weights on worker 0-0, policy_version 631241 (0.00094) [2022-07-10 07:43:56,019][26022] Updated weights on worker 0-0, policy_version 631251 (0.00098) [2022-07-10 07:43:57,138][25689] Fps is (10 sec: 5608.6, 60 sec: 5545.6, 300 sec: 5529.2). Total num frames: 646407168. Throughput: 0: 5811.5. Samples: 646412548. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:43:57,138][25689] Avg episode reward: [(0, '-8.311')] [2022-07-10 07:43:57,876][26022] Updated weights on worker 0-0, policy_version 631261 (0.00086) [2022-07-10 07:43:59,630][26022] Updated weights on worker 0-0, policy_version 631271 (0.00097) [2022-07-10 07:44:01,744][26022] Updated weights on worker 0-0, policy_version 631281 (0.00087) [2022-07-10 07:44:02,233][25689] Fps is (10 sec: 5561.6, 60 sec: 5504.1, 300 sec: 5533.0). Total num frames: 646433792. Throughput: 0: 4978.8. Samples: 646429458. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:44:02,233][25689] Avg episode reward: [(0, '-8.199')] [2022-07-10 07:44:03,842][26022] Updated weights on worker 0-0, policy_version 631291 (0.00090) [2022-07-10 07:44:05,819][26022] Updated weights on worker 0-0, policy_version 631301 (0.00089) [2022-07-10 07:44:07,344][25689] Fps is (10 sec: 5317.5, 60 sec: 5539.8, 300 sec: 5524.8). Total num frames: 646461440. Throughput: 0: 5677.3. Samples: 646460312. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:44:07,345][25689] Avg episode reward: [(0, '-6.103')] [2022-07-10 07:44:07,523][26022] Updated weights on worker 0-0, policy_version 631311 (0.00087) [2022-07-10 07:44:09,600][26022] Updated weights on worker 0-0, policy_version 631321 (0.00262) [2022-07-10 07:44:11,267][26022] Updated weights on worker 0-0, policy_version 631331 (0.00095) [2022-07-10 07:44:12,350][25689] Fps is (10 sec: 5364.1, 60 sec: 5513.5, 300 sec: 5526.7). Total num frames: 646488064. Throughput: 0: 5662.3. Samples: 646493734. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:44:12,351][25689] Avg episode reward: [(0, '-6.888')] [2022-07-10 07:44:13,277][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:44:13,294][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000631341_646493184.pth [2022-07-10 07:44:13,295][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000629399_644504576.pth [2022-07-10 07:44:13,301][26022] Updated weights on worker 0-0, policy_version 631341 (0.00095) [2022-07-10 07:44:14,941][26022] Updated weights on worker 0-0, policy_version 631351 (0.00082) [2022-07-10 07:44:16,899][26022] Updated weights on worker 0-0, policy_version 631361 (0.00089) [2022-07-10 07:44:17,359][25689] Fps is (10 sec: 5419.0, 60 sec: 5519.5, 300 sec: 5524.1). Total num frames: 646515712. Throughput: 0: 4842.7. Samples: 646510476. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:44:17,360][25689] Avg episode reward: [(0, '-7.618')] [2022-07-10 07:44:18,495][26022] Updated weights on worker 0-0, policy_version 631371 (0.00103) [2022-07-10 07:44:20,463][26022] Updated weights on worker 0-0, policy_version 631381 (0.00098) [2022-07-10 07:44:22,317][26022] Updated weights on worker 0-0, policy_version 631391 (0.00090) [2022-07-10 07:44:22,388][25689] Fps is (10 sec: 5610.9, 60 sec: 5519.2, 300 sec: 5525.8). Total num frames: 646544384. Throughput: 0: 5675.8. Samples: 646543854. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:44:22,388][25689] Avg episode reward: [(0, '-8.876')] [2022-07-10 07:44:24,194][26022] Updated weights on worker 0-0, policy_version 631401 (0.00085) [2022-07-10 07:44:26,106][26022] Updated weights on worker 0-0, policy_version 631411 (0.00087) [2022-07-10 07:44:27,459][25689] Fps is (10 sec: 5678.0, 60 sec: 5538.3, 300 sec: 5528.0). Total num frames: 646573056. Throughput: 0: 5805.1. Samples: 646577078. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:44:27,459][25689] Avg episode reward: [(0, '-10.181')] [2022-07-10 07:44:27,777][26022] Updated weights on worker 0-0, policy_version 631421 (0.00095) [2022-07-10 07:44:29,858][26022] Updated weights on worker 0-0, policy_version 631431 (0.00091) [2022-07-10 07:44:31,703][26022] Updated weights on worker 0-0, policy_version 631441 (0.00097) [2022-07-10 07:44:32,489][25689] Fps is (10 sec: 5474.1, 60 sec: 5525.1, 300 sec: 5524.2). Total num frames: 646599680. Throughput: 0: 4969.1. Samples: 646593806. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:44:32,490][25689] Avg episode reward: [(0, '-10.504')] [2022-07-10 07:44:33,477][26022] Updated weights on worker 0-0, policy_version 631451 (0.00090) [2022-07-10 07:44:35,311][26022] Updated weights on worker 0-0, policy_version 631461 (0.00098) [2022-07-10 07:44:37,164][26022] Updated weights on worker 0-0, policy_version 631471 (0.00087) [2022-07-10 07:44:37,516][25689] Fps is (10 sec: 5396.4, 60 sec: 5508.0, 300 sec: 5520.8). Total num frames: 646627328. Throughput: 0: 5772.1. Samples: 646626820. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:44:37,516][25689] Avg episode reward: [(0, '-10.012')] [2022-07-10 07:44:39,044][26022] Updated weights on worker 0-0, policy_version 631481 (0.00097) [2022-07-10 07:44:41,023][26022] Updated weights on worker 0-0, policy_version 631491 (0.00067) [2022-07-10 07:44:42,548][25689] Fps is (10 sec: 5497.0, 60 sec: 5511.4, 300 sec: 5521.6). Total num frames: 646654976. Throughput: 0: 5770.6. Samples: 646660194. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:44:42,549][25689] Avg episode reward: [(0, '-12.445')] [2022-07-10 07:44:42,847][26022] Updated weights on worker 0-0, policy_version 631501 (0.00098) [2022-07-10 07:44:44,525][26022] Updated weights on worker 0-0, policy_version 631511 (0.00085) [2022-07-10 07:44:46,403][26022] Updated weights on worker 0-0, policy_version 631521 (0.00089) [2022-07-10 07:44:47,630][25689] Fps is (10 sec: 5669.7, 60 sec: 5514.7, 300 sec: 5527.4). Total num frames: 646684672. Throughput: 0: 5785.4. Samples: 646693776. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:44:47,630][25689] Avg episode reward: [(0, '-10.864')] [2022-07-10 07:44:48,140][26022] Updated weights on worker 0-0, policy_version 631531 (0.00079) [2022-07-10 07:44:50,209][26022] Updated weights on worker 0-0, policy_version 631541 (0.00094) [2022-07-10 07:44:51,840][26022] Updated weights on worker 0-0, policy_version 631551 (0.00085) [2022-07-10 07:44:52,678][25689] Fps is (10 sec: 5661.3, 60 sec: 5528.1, 300 sec: 5523.7). Total num frames: 646712320. Throughput: 0: 5787.9. Samples: 646710654. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:44:52,678][25689] Avg episode reward: [(0, '-8.053')] [2022-07-10 07:44:54,001][26022] Updated weights on worker 0-0, policy_version 631561 (0.00083) [2022-07-10 07:44:55,421][26022] Updated weights on worker 0-0, policy_version 631571 (0.00091) [2022-07-10 07:44:57,580][26022] Updated weights on worker 0-0, policy_version 631581 (0.00091) [2022-07-10 07:44:57,733][25689] Fps is (10 sec: 5473.1, 60 sec: 5492.2, 300 sec: 5523.8). Total num frames: 646739968. Throughput: 0: 5800.8. Samples: 646744096. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:44:57,734][25689] Avg episode reward: [(0, '-9.404')] [2022-07-10 07:44:59,247][26022] Updated weights on worker 0-0, policy_version 631591 (0.00084) [2022-07-10 07:45:01,250][26022] Updated weights on worker 0-0, policy_version 631601 (0.00098) [2022-07-10 07:45:02,769][25689] Fps is (10 sec: 5377.8, 60 sec: 5497.5, 300 sec: 5532.9). Total num frames: 646766592. Throughput: 0: 5680.2. Samples: 646775054. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:45:02,770][25689] Avg episode reward: [(0, '-8.515')] [2022-07-10 07:45:03,106][26022] Updated weights on worker 0-0, policy_version 631611 (0.00092) [2022-07-10 07:45:05,368][26022] Updated weights on worker 0-0, policy_version 631621 (0.00093) [2022-07-10 07:45:07,173][26022] Updated weights on worker 0-0, policy_version 631631 (0.00087) [2022-07-10 07:45:07,905][25689] Fps is (10 sec: 5234.9, 60 sec: 5478.4, 300 sec: 5520.4). Total num frames: 646793216. Throughput: 0: 4818.7. Samples: 646791476. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:45:07,905][25689] Avg episode reward: [(0, '-6.904')] [2022-07-10 07:45:08,977][26022] Updated weights on worker 0-0, policy_version 631641 (0.00098) [2022-07-10 07:45:10,896][26022] Updated weights on worker 0-0, policy_version 631651 (0.00095) [2022-07-10 07:45:12,730][26022] Updated weights on worker 0-0, policy_version 631661 (0.00097) [2022-07-10 07:45:12,918][25689] Fps is (10 sec: 5448.8, 60 sec: 5511.6, 300 sec: 5524.9). Total num frames: 646821888. Throughput: 0: 5623.0. Samples: 646824464. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:45:12,918][25689] Avg episode reward: [(0, '-6.880')] [2022-07-10 07:45:14,509][26022] Updated weights on worker 0-0, policy_version 631671 (0.00094) [2022-07-10 07:45:16,497][26022] Updated weights on worker 0-0, policy_version 631681 (0.00091) [2022-07-10 07:45:17,958][25689] Fps is (10 sec: 5602.0, 60 sec: 5508.7, 300 sec: 5525.0). Total num frames: 646849536. Throughput: 0: 5625.3. Samples: 646857870. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:45:17,959][25689] Avg episode reward: [(0, '-6.870')] [2022-07-10 07:45:18,202][26022] Updated weights on worker 0-0, policy_version 631691 (0.00091) [2022-07-10 07:45:20,148][26022] Updated weights on worker 0-0, policy_version 631701 (0.00091) [2022-07-10 07:45:21,966][26022] Updated weights on worker 0-0, policy_version 631711 (0.00088) [2022-07-10 07:45:22,989][25689] Fps is (10 sec: 5490.4, 60 sec: 5491.6, 300 sec: 5519.0). Total num frames: 646877184. Throughput: 0: 4921.6. Samples: 646874568. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:45:22,990][25689] Avg episode reward: [(0, '-8.017')] [2022-07-10 07:45:23,827][26022] Updated weights on worker 0-0, policy_version 631721 (0.00085) [2022-07-10 07:45:25,598][26022] Updated weights on worker 0-0, policy_version 631731 (0.00091) [2022-07-10 07:45:27,395][26022] Updated weights on worker 0-0, policy_version 631741 (0.00085) [2022-07-10 07:45:28,098][25689] Fps is (10 sec: 5554.3, 60 sec: 5488.1, 300 sec: 5525.4). Total num frames: 646905856. Throughput: 0: 5768.5. Samples: 646907962. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:45:28,104][25689] Avg episode reward: [(0, '-9.210')] [2022-07-10 07:45:29,279][26022] Updated weights on worker 0-0, policy_version 631751 (0.00092) [2022-07-10 07:45:31,138][26022] Updated weights on worker 0-0, policy_version 631761 (0.00085) [2022-07-10 07:45:33,032][26022] Updated weights on worker 0-0, policy_version 631771 (0.00087) [2022-07-10 07:45:33,147][25689] Fps is (10 sec: 5544.5, 60 sec: 5503.4, 300 sec: 5521.2). Total num frames: 646933504. Throughput: 0: 5783.1. Samples: 646941452. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:45:33,147][25689] Avg episode reward: [(0, '-8.300')] [2022-07-10 07:45:34,756][26022] Updated weights on worker 0-0, policy_version 631781 (0.00098) [2022-07-10 07:45:36,682][26022] Updated weights on worker 0-0, policy_version 631791 (0.00093) [2022-07-10 07:45:38,159][25689] Fps is (10 sec: 5496.5, 60 sec: 5504.8, 300 sec: 5521.3). Total num frames: 646961152. Throughput: 0: 4977.3. Samples: 646958412. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:45:38,159][25689] Avg episode reward: [(0, '-9.282')] [2022-07-10 07:45:38,562][26022] Updated weights on worker 0-0, policy_version 631801 (0.00617) [2022-07-10 07:45:40,213][26022] Updated weights on worker 0-0, policy_version 631811 (0.00083) [2022-07-10 07:45:42,423][26022] Updated weights on worker 0-0, policy_version 631821 (0.00083) [2022-07-10 07:45:43,180][25689] Fps is (10 sec: 5715.3, 60 sec: 5539.5, 300 sec: 5525.4). Total num frames: 646990848. Throughput: 0: 5791.0. Samples: 646991496. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:45:43,181][25689] Avg episode reward: [(0, '-10.272')] [2022-07-10 07:45:43,968][26022] Updated weights on worker 0-0, policy_version 631831 (0.00151) [2022-07-10 07:45:45,845][26022] Updated weights on worker 0-0, policy_version 631841 (0.00093) [2022-07-10 07:45:47,799][26022] Updated weights on worker 0-0, policy_version 631851 (0.00087) [2022-07-10 07:45:48,242][25689] Fps is (10 sec: 5585.3, 60 sec: 5490.6, 300 sec: 5518.2). Total num frames: 647017472. Throughput: 0: 5810.3. Samples: 647025004. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:45:48,243][25689] Avg episode reward: [(0, '-10.250')] [2022-07-10 07:45:49,321][26022] Updated weights on worker 0-0, policy_version 631861 (0.00086) [2022-07-10 07:45:51,613][26022] Updated weights on worker 0-0, policy_version 631871 (0.00087) [2022-07-10 07:45:53,014][26022] Updated weights on worker 0-0, policy_version 631881 (0.00083) [2022-07-10 07:45:53,258][25689] Fps is (10 sec: 5588.5, 60 sec: 5527.3, 300 sec: 5528.3). Total num frames: 647047168. Throughput: 0: 4976.2. Samples: 647041530. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:45:53,259][25689] Avg episode reward: [(0, '-8.652')] [2022-07-10 07:45:55,273][26022] Updated weights on worker 0-0, policy_version 631891 (0.00085) [2022-07-10 07:45:56,847][26022] Updated weights on worker 0-0, policy_version 631901 (0.00084) [2022-07-10 07:45:58,278][25689] Fps is (10 sec: 5510.2, 60 sec: 5496.8, 300 sec: 5518.2). Total num frames: 647072768. Throughput: 0: 5788.8. Samples: 647074876. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 07:45:58,278][25689] Avg episode reward: [(0, '-7.987')] [2022-07-10 07:45:58,797][26022] Updated weights on worker 0-0, policy_version 631911 (0.01146) [2022-07-10 07:46:00,488][26022] Updated weights on worker 0-0, policy_version 631921 (0.00091) [2022-07-10 07:46:02,812][26022] Updated weights on worker 0-0, policy_version 631931 (0.00091) [2022-07-10 07:46:03,320][25689] Fps is (10 sec: 5190.2, 60 sec: 5496.2, 300 sec: 5525.2). Total num frames: 647099392. Throughput: 0: 5695.7. Samples: 647106206. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:46:03,322][25689] Avg episode reward: [(0, '-8.474')] [2022-07-10 07:46:04,716][26022] Updated weights on worker 0-0, policy_version 631941 (0.00092) [2022-07-10 07:46:06,419][26022] Updated weights on worker 0-0, policy_version 631951 (0.00084) [2022-07-10 07:46:08,437][25689] Fps is (10 sec: 5442.8, 60 sec: 5531.7, 300 sec: 5520.1). Total num frames: 647128064. Throughput: 0: 4836.1. Samples: 647122668. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:46:08,438][25689] Avg episode reward: [(0, '-8.484')] [2022-07-10 07:46:08,450][26022] Updated weights on worker 0-0, policy_version 631961 (0.00190) [2022-07-10 07:46:10,414][26022] Updated weights on worker 0-0, policy_version 631971 (0.00085) [2022-07-10 07:46:12,108][26022] Updated weights on worker 0-0, policy_version 631981 (0.00085) [2022-07-10 07:46:13,376][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:46:13,390][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000631988_647155712.pth [2022-07-10 07:46:13,391][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000630044_645165056.pth [2022-07-10 07:46:13,531][25689] Fps is (10 sec: 5515.6, 60 sec: 5507.4, 300 sec: 5522.3). Total num frames: 647155712. Throughput: 0: 5635.3. Samples: 647155774. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:46:13,532][25689] Avg episode reward: [(0, '-7.775')] [2022-07-10 07:46:14,202][26022] Updated weights on worker 0-0, policy_version 631991 (0.00091) [2022-07-10 07:46:15,770][26022] Updated weights on worker 0-0, policy_version 632001 (0.00085) [2022-07-10 07:46:17,731][26022] Updated weights on worker 0-0, policy_version 632011 (0.00581) [2022-07-10 07:46:18,558][25689] Fps is (10 sec: 5665.8, 60 sec: 5542.5, 300 sec: 5525.3). Total num frames: 647185408. Throughput: 0: 5643.2. Samples: 647189324. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:46:18,559][25689] Avg episode reward: [(0, '-8.354')] [2022-07-10 07:46:19,551][26022] Updated weights on worker 0-0, policy_version 632021 (0.00091) [2022-07-10 07:46:21,389][26022] Updated weights on worker 0-0, policy_version 632031 (0.00092) [2022-07-10 07:46:23,111][26022] Updated weights on worker 0-0, policy_version 632041 (0.00083) [2022-07-10 07:46:23,648][25689] Fps is (10 sec: 5566.7, 60 sec: 5520.1, 300 sec: 5519.2). Total num frames: 647212032. Throughput: 0: 4917.6. Samples: 647206172. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:46:23,649][25689] Avg episode reward: [(0, '-7.928')] [2022-07-10 07:46:24,842][26022] Updated weights on worker 0-0, policy_version 632051 (0.00088) [2022-07-10 07:46:26,803][26022] Updated weights on worker 0-0, policy_version 632061 (0.00094) [2022-07-10 07:46:28,554][26022] Updated weights on worker 0-0, policy_version 632071 (0.00090) [2022-07-10 07:46:28,774][25689] Fps is (10 sec: 5412.6, 60 sec: 5518.6, 300 sec: 5521.2). Total num frames: 647240704. Throughput: 0: 5732.6. Samples: 647239254. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:46:28,775][25689] Avg episode reward: [(0, '-7.685')] [2022-07-10 07:46:30,630][26022] Updated weights on worker 0-0, policy_version 632081 (0.00054) [2022-07-10 07:46:32,486][26022] Updated weights on worker 0-0, policy_version 632091 (0.00976) [2022-07-10 07:46:33,854][25689] Fps is (10 sec: 5518.7, 60 sec: 5515.8, 300 sec: 5520.0). Total num frames: 647268352. Throughput: 0: 5749.0. Samples: 647272608. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:46:33,854][25689] Avg episode reward: [(0, '-8.379')] [2022-07-10 07:46:34,196][26022] Updated weights on worker 0-0, policy_version 632101 (0.00096) [2022-07-10 07:46:36,091][26022] Updated weights on worker 0-0, policy_version 632111 (0.00098) [2022-07-10 07:46:38,034][26022] Updated weights on worker 0-0, policy_version 632121 (0.00089) [2022-07-10 07:46:38,883][25689] Fps is (10 sec: 5571.7, 60 sec: 5531.1, 300 sec: 5519.7). Total num frames: 647297024. Throughput: 0: 5727.6. Samples: 647305734. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:46:38,883][25689] Avg episode reward: [(0, '-7.800')] [2022-07-10 07:46:39,824][26022] Updated weights on worker 0-0, policy_version 632131 (0.00086) [2022-07-10 07:46:41,828][26022] Updated weights on worker 0-0, policy_version 632141 (0.00087) [2022-07-10 07:46:43,491][26022] Updated weights on worker 0-0, policy_version 632151 (0.00096) [2022-07-10 07:46:43,954][25689] Fps is (10 sec: 5677.6, 60 sec: 5509.8, 300 sec: 5520.8). Total num frames: 647325696. Throughput: 0: 5726.6. Samples: 647322452. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:46:43,954][25689] Avg episode reward: [(0, '-6.749')] [2022-07-10 07:46:45,307][26022] Updated weights on worker 0-0, policy_version 632161 (0.00093) [2022-07-10 07:46:47,122][26022] Updated weights on worker 0-0, policy_version 632171 (0.00090) [2022-07-10 07:46:49,062][25689] Fps is (10 sec: 5432.0, 60 sec: 5505.6, 300 sec: 5512.7). Total num frames: 647352320. Throughput: 0: 5751.1. Samples: 647355932. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:46:49,063][25689] Avg episode reward: [(0, '-8.323')] [2022-07-10 07:46:49,133][26022] Updated weights on worker 0-0, policy_version 632181 (0.00084) [2022-07-10 07:46:50,951][26022] Updated weights on worker 0-0, policy_version 632191 (0.00084) [2022-07-10 07:46:52,734][26022] Updated weights on worker 0-0, policy_version 632201 (0.00085) [2022-07-10 07:46:54,120][25689] Fps is (10 sec: 5439.1, 60 sec: 5484.9, 300 sec: 5512.2). Total num frames: 647380992. Throughput: 0: 5771.2. Samples: 647389570. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:46:54,121][25689] Avg episode reward: [(0, '-8.456')] [2022-07-10 07:46:54,728][26022] Updated weights on worker 0-0, policy_version 632211 (0.00092) [2022-07-10 07:46:56,190][26022] Updated weights on worker 0-0, policy_version 632221 (0.00099) [2022-07-10 07:46:58,417][26022] Updated weights on worker 0-0, policy_version 632231 (0.00091) [2022-07-10 07:46:59,134][25689] Fps is (10 sec: 5693.6, 60 sec: 5535.9, 300 sec: 5519.9). Total num frames: 647409664. Throughput: 0: 4971.0. Samples: 647406404. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:46:59,134][25689] Avg episode reward: [(0, '-8.379')] [2022-07-10 07:46:59,902][26022] Updated weights on worker 0-0, policy_version 632241 (0.00085) [2022-07-10 07:47:02,418][26022] Updated weights on worker 0-0, policy_version 632251 (0.00094) [2022-07-10 07:47:04,008][26022] Updated weights on worker 0-0, policy_version 632261 (0.00090) [2022-07-10 07:47:04,158][25689] Fps is (10 sec: 5508.6, 60 sec: 5537.6, 300 sec: 5520.9). Total num frames: 647436288. Throughput: 0: 5714.4. Samples: 647437908. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:47:04,159][25689] Avg episode reward: [(0, '-9.576')] [2022-07-10 07:47:05,822][26022] Updated weights on worker 0-0, policy_version 632271 (0.00091) [2022-07-10 07:47:07,646][26022] Updated weights on worker 0-0, policy_version 632281 (0.00095) [2022-07-10 07:47:09,301][25689] Fps is (10 sec: 5337.8, 60 sec: 5518.4, 300 sec: 5515.2). Total num frames: 647463936. Throughput: 0: 5678.1. Samples: 647470852. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:47:09,302][25689] Avg episode reward: [(0, '-9.766')] [2022-07-10 07:47:09,688][26022] Updated weights on worker 0-0, policy_version 632291 (0.00093) [2022-07-10 07:47:11,335][26022] Updated weights on worker 0-0, policy_version 632301 (0.00085) [2022-07-10 07:47:13,700][26022] Updated weights on worker 0-0, policy_version 632311 (0.00085) [2022-07-10 07:47:14,350][25689] Fps is (10 sec: 5325.2, 60 sec: 5505.7, 300 sec: 5511.6). Total num frames: 647490560. Throughput: 0: 4848.9. Samples: 647487662. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:47:14,350][25689] Avg episode reward: [(0, '-6.795')] [2022-07-10 07:47:15,074][26022] Updated weights on worker 0-0, policy_version 632321 (0.00093) [2022-07-10 07:47:17,113][26022] Updated weights on worker 0-0, policy_version 632331 (0.00086) [2022-07-10 07:47:18,654][26022] Updated weights on worker 0-0, policy_version 632341 (0.00085) [2022-07-10 07:47:19,378][25689] Fps is (10 sec: 5589.2, 60 sec: 5505.6, 300 sec: 5518.4). Total num frames: 647520256. Throughput: 0: 5671.7. Samples: 647521222. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:47:19,381][25689] Avg episode reward: [(0, '-7.092')] [2022-07-10 07:47:20,993][26022] Updated weights on worker 0-0, policy_version 632351 (0.00087) [2022-07-10 07:47:22,444][26022] Updated weights on worker 0-0, policy_version 632361 (0.00087) [2022-07-10 07:47:24,433][25689] Fps is (10 sec: 5686.8, 60 sec: 5525.6, 300 sec: 5516.3). Total num frames: 647547904. Throughput: 0: 5762.2. Samples: 647554738. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:47:24,434][25689] Avg episode reward: [(0, '-10.031')] [2022-07-10 07:47:24,441][26022] Updated weights on worker 0-0, policy_version 632371 (0.00096) [2022-07-10 07:47:26,083][26022] Updated weights on worker 0-0, policy_version 632381 (0.00087) [2022-07-10 07:47:28,204][26022] Updated weights on worker 0-0, policy_version 632391 (0.00087) [2022-07-10 07:47:29,493][25689] Fps is (10 sec: 5466.5, 60 sec: 5514.7, 300 sec: 5516.2). Total num frames: 647575552. Throughput: 0: 4970.3. Samples: 647571212. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:47:29,494][25689] Avg episode reward: [(0, '-10.123')] [2022-07-10 07:47:29,875][26022] Updated weights on worker 0-0, policy_version 632401 (0.00070) [2022-07-10 07:47:31,807][26022] Updated weights on worker 0-0, policy_version 632411 (0.00090) [2022-07-10 07:47:33,526][26022] Updated weights on worker 0-0, policy_version 632421 (0.00096) [2022-07-10 07:47:34,503][25689] Fps is (10 sec: 5593.0, 60 sec: 5537.9, 300 sec: 5517.0). Total num frames: 647604224. Throughput: 0: 5802.4. Samples: 647604600. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:47:34,504][25689] Avg episode reward: [(0, '-9.770')] [2022-07-10 07:47:35,583][26022] Updated weights on worker 0-0, policy_version 632432 (0.00097) [2022-07-10 07:47:37,551][26022] Updated weights on worker 0-0, policy_version 632442 (0.00087) [2022-07-10 07:47:39,261][26022] Updated weights on worker 0-0, policy_version 632452 (0.00088) [2022-07-10 07:47:39,575][25689] Fps is (10 sec: 5586.5, 60 sec: 5517.1, 300 sec: 5519.2). Total num frames: 647631872. Throughput: 0: 5783.7. Samples: 647638036. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:47:39,575][25689] Avg episode reward: [(0, '-12.794')] [2022-07-10 07:47:41,127][26022] Updated weights on worker 0-0, policy_version 632462 (0.00098) [2022-07-10 07:47:42,909][26022] Updated weights on worker 0-0, policy_version 632472 (0.00088) [2022-07-10 07:47:44,628][25689] Fps is (10 sec: 5562.3, 60 sec: 5518.7, 300 sec: 5519.6). Total num frames: 647660544. Throughput: 0: 4965.6. Samples: 647655018. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:47:44,629][25689] Avg episode reward: [(0, '-12.610')] [2022-07-10 07:47:44,689][26022] Updated weights on worker 0-0, policy_version 632482 (0.00087) [2022-07-10 07:47:46,729][26022] Updated weights on worker 0-0, policy_version 632492 (0.00091) [2022-07-10 07:47:48,430][26022] Updated weights on worker 0-0, policy_version 632502 (0.00090) [2022-07-10 07:47:49,709][25689] Fps is (10 sec: 5557.7, 60 sec: 5538.2, 300 sec: 5515.3). Total num frames: 647688192. Throughput: 0: 5795.6. Samples: 647688372. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:47:49,709][25689] Avg episode reward: [(0, '-9.396')] [2022-07-10 07:47:50,422][26022] Updated weights on worker 0-0, policy_version 632512 (0.00086) [2022-07-10 07:47:52,080][26022] Updated weights on worker 0-0, policy_version 632522 (0.00091) [2022-07-10 07:47:53,912][26022] Updated weights on worker 0-0, policy_version 632532 (0.00090) [2022-07-10 07:47:54,755][25689] Fps is (10 sec: 5460.6, 60 sec: 5522.4, 300 sec: 5515.2). Total num frames: 647715840. Throughput: 0: 5787.1. Samples: 647721798. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:47:54,755][25689] Avg episode reward: [(0, '-8.781')] [2022-07-10 07:47:55,831][26022] Updated weights on worker 0-0, policy_version 632542 (0.00090) [2022-07-10 07:47:57,700][26022] Updated weights on worker 0-0, policy_version 632552 (0.00092) [2022-07-10 07:47:59,540][26022] Updated weights on worker 0-0, policy_version 632562 (0.00084) [2022-07-10 07:47:59,767][25689] Fps is (10 sec: 5599.4, 60 sec: 5522.5, 300 sec: 5515.2). Total num frames: 647744512. Throughput: 0: 4974.6. Samples: 647738482. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:47:59,767][25689] Avg episode reward: [(0, '-9.101')] [2022-07-10 07:48:01,559][26022] Updated weights on worker 0-0, policy_version 632572 (0.00086) [2022-07-10 07:48:03,439][26022] Updated weights on worker 0-0, policy_version 632582 (0.00089) [2022-07-10 07:48:04,774][25689] Fps is (10 sec: 5416.8, 60 sec: 5507.2, 300 sec: 5517.5). Total num frames: 647770112. Throughput: 0: 5696.1. Samples: 647769768. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:48:04,774][25689] Avg episode reward: [(0, '-8.179')] [2022-07-10 07:48:05,607][26022] Updated weights on worker 0-0, policy_version 632592 (0.00083) [2022-07-10 07:48:07,120][26022] Updated weights on worker 0-0, policy_version 632602 (0.00086) [2022-07-10 07:48:09,185][26022] Updated weights on worker 0-0, policy_version 632612 (0.00087) [2022-07-10 07:48:09,856][25689] Fps is (10 sec: 5277.7, 60 sec: 5512.8, 300 sec: 5514.2). Total num frames: 647797760. Throughput: 0: 5694.8. Samples: 647803108. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:48:09,857][25689] Avg episode reward: [(0, '-5.698')] [2022-07-10 07:48:10,842][26022] Updated weights on worker 0-0, policy_version 632622 (0.00093) [2022-07-10 07:48:12,931][26022] Updated weights on worker 0-0, policy_version 632632 (0.00082) [2022-07-10 07:48:13,649][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:48:13,675][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000632636_647819264.pth [2022-07-10 07:48:13,675][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000630694_645830656.pth [2022-07-10 07:48:14,656][26022] Updated weights on worker 0-0, policy_version 632642 (0.00090) [2022-07-10 07:48:14,875][25689] Fps is (10 sec: 5575.3, 60 sec: 5549.2, 300 sec: 5518.7). Total num frames: 647826432. Throughput: 0: 4864.2. Samples: 647819670. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:48:14,876][25689] Avg episode reward: [(0, '-5.883')] [2022-07-10 07:48:16,435][26022] Updated weights on worker 0-0, policy_version 632652 (0.00094) [2022-07-10 07:48:18,358][26022] Updated weights on worker 0-0, policy_version 632662 (0.00083) [2022-07-10 07:48:19,880][25689] Fps is (10 sec: 5720.7, 60 sec: 5534.5, 300 sec: 5519.0). Total num frames: 647855104. Throughput: 0: 5702.1. Samples: 647853170. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:48:19,882][25689] Avg episode reward: [(0, '-6.703')] [2022-07-10 07:48:20,186][26022] Updated weights on worker 0-0, policy_version 632672 (0.00084) [2022-07-10 07:48:22,019][26022] Updated weights on worker 0-0, policy_version 632682 (0.00094) [2022-07-10 07:48:23,767][26022] Updated weights on worker 0-0, policy_version 632692 (0.00083) [2022-07-10 07:48:24,889][25689] Fps is (10 sec: 5420.1, 60 sec: 5504.9, 300 sec: 5513.8). Total num frames: 647880704. Throughput: 0: 5824.0. Samples: 647886916. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:48:24,889][25689] Avg episode reward: [(0, '-7.940')] [2022-07-10 07:48:25,671][26022] Updated weights on worker 0-0, policy_version 632702 (0.00093) [2022-07-10 07:48:27,630][26022] Updated weights on worker 0-0, policy_version 632712 (0.00088) [2022-07-10 07:48:29,302][26022] Updated weights on worker 0-0, policy_version 632722 (0.00091) [2022-07-10 07:48:30,003][25689] Fps is (10 sec: 5462.5, 60 sec: 5533.8, 300 sec: 5519.8). Total num frames: 647910400. Throughput: 0: 4964.0. Samples: 647903116. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:48:30,004][25689] Avg episode reward: [(0, '-8.377')] [2022-07-10 07:48:31,323][26022] Updated weights on worker 0-0, policy_version 632732 (0.00082) [2022-07-10 07:48:33,151][26022] Updated weights on worker 0-0, policy_version 632742 (0.00090) [2022-07-10 07:48:34,952][26022] Updated weights on worker 0-0, policy_version 632752 (0.00092) [2022-07-10 07:48:35,022][25689] Fps is (10 sec: 5658.9, 60 sec: 5516.0, 300 sec: 5516.5). Total num frames: 647938048. Throughput: 0: 5795.1. Samples: 647936422. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:48:35,023][25689] Avg episode reward: [(0, '-8.015')] [2022-07-10 07:48:36,778][26022] Updated weights on worker 0-0, policy_version 632762 (0.00085) [2022-07-10 07:48:38,538][26022] Updated weights on worker 0-0, policy_version 632772 (0.00085) [2022-07-10 07:48:40,046][25689] Fps is (10 sec: 5404.0, 60 sec: 5503.5, 300 sec: 5513.9). Total num frames: 647964672. Throughput: 0: 5786.0. Samples: 647969850. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:48:40,047][25689] Avg episode reward: [(0, '-8.943')] [2022-07-10 07:48:40,657][26022] Updated weights on worker 0-0, policy_version 632782 (0.00092) [2022-07-10 07:48:42,293][26022] Updated weights on worker 0-0, policy_version 632792 (0.00086) [2022-07-10 07:48:44,285][26022] Updated weights on worker 0-0, policy_version 632802 (0.00100) [2022-07-10 07:48:45,052][25689] Fps is (10 sec: 5615.2, 60 sec: 5524.7, 300 sec: 5516.0). Total num frames: 647994368. Throughput: 0: 5763.9. Samples: 648003136. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:48:45,052][25689] Avg episode reward: [(0, '-9.876')] [2022-07-10 07:48:46,212][26022] Updated weights on worker 0-0, policy_version 632812 (0.00091) [2022-07-10 07:48:47,897][26022] Updated weights on worker 0-0, policy_version 632822 (0.00086) [2022-07-10 07:48:49,779][26022] Updated weights on worker 0-0, policy_version 632832 (0.00086) [2022-07-10 07:48:50,099][25689] Fps is (10 sec: 5704.1, 60 sec: 5527.8, 300 sec: 5518.7). Total num frames: 648022016. Throughput: 0: 5811.9. Samples: 648019912. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:48:50,099][25689] Avg episode reward: [(0, '-8.969')] [2022-07-10 07:48:51,644][26022] Updated weights on worker 0-0, policy_version 632842 (0.00087) [2022-07-10 07:48:53,280][26022] Updated weights on worker 0-0, policy_version 632852 (0.00085) [2022-07-10 07:48:55,123][25689] Fps is (10 sec: 5490.6, 60 sec: 5529.8, 300 sec: 5512.0). Total num frames: 648049664. Throughput: 0: 5802.2. Samples: 648053052. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:48:55,123][25689] Avg episode reward: [(0, '-7.266')] [2022-07-10 07:48:55,524][26022] Updated weights on worker 0-0, policy_version 632862 (0.00087) [2022-07-10 07:48:57,023][26022] Updated weights on worker 0-0, policy_version 632872 (0.00084) [2022-07-10 07:48:59,090][26022] Updated weights on worker 0-0, policy_version 632882 (0.00103) [2022-07-10 07:49:00,144][25689] Fps is (10 sec: 5504.7, 60 sec: 5512.0, 300 sec: 5516.8). Total num frames: 648077312. Throughput: 0: 5801.0. Samples: 648086440. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:49:00,145][25689] Avg episode reward: [(0, '-6.825')] [2022-07-10 07:49:00,761][26022] Updated weights on worker 0-0, policy_version 632892 (0.00087) [2022-07-10 07:49:03,025][26022] Updated weights on worker 0-0, policy_version 632902 (0.00087) [2022-07-10 07:49:05,014][26022] Updated weights on worker 0-0, policy_version 632912 (0.00090) [2022-07-10 07:49:05,151][25689] Fps is (10 sec: 5310.1, 60 sec: 5512.1, 300 sec: 5511.9). Total num frames: 648102912. Throughput: 0: 4879.9. Samples: 648101216. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:49:05,151][25689] Avg episode reward: [(0, '-7.000')] [2022-07-10 07:49:06,855][26022] Updated weights on worker 0-0, policy_version 632922 (0.00096) [2022-07-10 07:49:08,543][26022] Updated weights on worker 0-0, policy_version 632932 (0.00092) [2022-07-10 07:49:10,206][25689] Fps is (10 sec: 5190.3, 60 sec: 5497.5, 300 sec: 5511.0). Total num frames: 648129536. Throughput: 0: 5690.7. Samples: 648134336. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 07:49:10,207][25689] Avg episode reward: [(0, '-7.046')] [2022-07-10 07:49:10,662][26022] Updated weights on worker 0-0, policy_version 632942 (0.00085) [2022-07-10 07:49:12,235][26022] Updated weights on worker 0-0, policy_version 632952 (0.00087) [2022-07-10 07:49:14,392][26022] Updated weights on worker 0-0, policy_version 632962 (0.00094) [2022-07-10 07:49:15,211][25689] Fps is (10 sec: 5598.1, 60 sec: 5515.8, 300 sec: 5517.9). Total num frames: 648159232. Throughput: 0: 5698.6. Samples: 648167526. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:49:15,212][25689] Avg episode reward: [(0, '-6.581')] [2022-07-10 07:49:16,147][26022] Updated weights on worker 0-0, policy_version 632972 (0.00088) [2022-07-10 07:49:17,771][26022] Updated weights on worker 0-0, policy_version 632982 (0.00090) [2022-07-10 07:49:19,744][26022] Updated weights on worker 0-0, policy_version 632992 (0.00085) [2022-07-10 07:49:20,223][25689] Fps is (10 sec: 5622.6, 60 sec: 5481.2, 300 sec: 5511.3). Total num frames: 648185856. Throughput: 0: 4870.1. Samples: 648184224. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:49:20,223][25689] Avg episode reward: [(0, '-7.461')] [2022-07-10 07:49:21,343][26022] Updated weights on worker 0-0, policy_version 633002 (0.00091) [2022-07-10 07:49:23,539][26022] Updated weights on worker 0-0, policy_version 633012 (0.00083) [2022-07-10 07:49:25,174][26022] Updated weights on worker 0-0, policy_version 633022 (0.00091) [2022-07-10 07:49:25,256][25689] Fps is (10 sec: 5504.8, 60 sec: 5529.9, 300 sec: 5512.0). Total num frames: 648214528. Throughput: 0: 5793.8. Samples: 648217704. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:49:25,257][25689] Avg episode reward: [(0, '-8.824')] [2022-07-10 07:49:27,105][26022] Updated weights on worker 0-0, policy_version 633032 (0.00104) [2022-07-10 07:49:28,932][26022] Updated weights on worker 0-0, policy_version 633042 (0.00091) [2022-07-10 07:49:30,327][25689] Fps is (10 sec: 5573.9, 60 sec: 5499.9, 300 sec: 5514.7). Total num frames: 648242176. Throughput: 0: 5783.0. Samples: 648250694. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:49:30,329][25689] Avg episode reward: [(0, '-8.348')] [2022-07-10 07:49:31,020][26022] Updated weights on worker 0-0, policy_version 633052 (0.00095) [2022-07-10 07:49:32,692][26022] Updated weights on worker 0-0, policy_version 633062 (0.00081) [2022-07-10 07:49:34,520][26022] Updated weights on worker 0-0, policy_version 633072 (0.00092) [2022-07-10 07:49:35,351][25689] Fps is (10 sec: 5477.9, 60 sec: 5499.5, 300 sec: 5514.8). Total num frames: 648269824. Throughput: 0: 4966.1. Samples: 648267540. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:49:35,352][25689] Avg episode reward: [(0, '-8.598')] [2022-07-10 07:49:36,345][26022] Updated weights on worker 0-0, policy_version 633082 (0.00086) [2022-07-10 07:49:38,172][26022] Updated weights on worker 0-0, policy_version 633092 (0.01199) [2022-07-10 07:49:39,918][26022] Updated weights on worker 0-0, policy_version 633102 (0.00094) [2022-07-10 07:49:40,438][25689] Fps is (10 sec: 5570.0, 60 sec: 5527.6, 300 sec: 5517.2). Total num frames: 648298496. Throughput: 0: 5775.5. Samples: 648300978. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:49:40,439][25689] Avg episode reward: [(0, '-8.910')] [2022-07-10 07:49:41,903][26022] Updated weights on worker 0-0, policy_version 633112 (0.00084) [2022-07-10 07:49:43,654][26022] Updated weights on worker 0-0, policy_version 633122 (0.00086) [2022-07-10 07:49:45,445][25689] Fps is (10 sec: 5478.1, 60 sec: 5476.7, 300 sec: 5508.3). Total num frames: 648325120. Throughput: 0: 5764.5. Samples: 648334080. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:49:45,445][25689] Avg episode reward: [(0, '-8.182')] [2022-07-10 07:49:45,645][26022] Updated weights on worker 0-0, policy_version 633132 (0.00092) [2022-07-10 07:49:47,485][26022] Updated weights on worker 0-0, policy_version 633142 (0.00091) [2022-07-10 07:49:49,319][26022] Updated weights on worker 0-0, policy_version 633152 (0.00089) [2022-07-10 07:49:50,511][25689] Fps is (10 sec: 5591.3, 60 sec: 5508.8, 300 sec: 5514.8). Total num frames: 648354816. Throughput: 0: 4955.0. Samples: 648350706. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:49:50,511][25689] Avg episode reward: [(0, '-6.820')] [2022-07-10 07:49:51,274][26022] Updated weights on worker 0-0, policy_version 633162 (0.00086) [2022-07-10 07:49:53,000][26022] Updated weights on worker 0-0, policy_version 633172 (0.00088) [2022-07-10 07:49:54,727][26022] Updated weights on worker 0-0, policy_version 633182 (0.00087) [2022-07-10 07:49:55,522][25689] Fps is (10 sec: 5690.2, 60 sec: 5510.0, 300 sec: 5515.6). Total num frames: 648382464. Throughput: 0: 5789.3. Samples: 648384320. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:49:55,523][25689] Avg episode reward: [(0, '-7.725')] [2022-07-10 07:49:56,571][26022] Updated weights on worker 0-0, policy_version 633192 (0.00090) [2022-07-10 07:49:58,466][26022] Updated weights on worker 0-0, policy_version 633202 (0.00082) [2022-07-10 07:50:00,385][26022] Updated weights on worker 0-0, policy_version 633212 (0.00105) [2022-07-10 07:50:00,535][25689] Fps is (10 sec: 5516.2, 60 sec: 5510.8, 300 sec: 5519.5). Total num frames: 648410112. Throughput: 0: 5821.5. Samples: 648417972. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:50:00,536][25689] Avg episode reward: [(0, '-7.765')] [2022-07-10 07:50:02,487][26022] Updated weights on worker 0-0, policy_version 633222 (0.00089) [2022-07-10 07:50:04,391][26022] Updated weights on worker 0-0, policy_version 633232 (0.00085) [2022-07-10 07:50:05,548][25689] Fps is (10 sec: 5413.3, 60 sec: 5527.1, 300 sec: 5521.8). Total num frames: 648436736. Throughput: 0: 4899.3. Samples: 648432572. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:50:05,548][25689] Avg episode reward: [(0, '-7.722')] [2022-07-10 07:50:06,256][26022] Updated weights on worker 0-0, policy_version 633242 (0.00094) [2022-07-10 07:50:08,099][26022] Updated weights on worker 0-0, policy_version 633252 (0.00089) [2022-07-10 07:50:09,886][26022] Updated weights on worker 0-0, policy_version 633262 (0.00083) [2022-07-10 07:50:10,616][25689] Fps is (10 sec: 5383.7, 60 sec: 5542.9, 300 sec: 5517.3). Total num frames: 648464384. Throughput: 0: 5734.3. Samples: 648465994. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:50:10,616][25689] Avg episode reward: [(0, '-7.053')] [2022-07-10 07:50:11,715][26022] Updated weights on worker 0-0, policy_version 633272 (0.00094) [2022-07-10 07:50:13,699][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:50:13,710][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000633282_648480768.pth [2022-07-10 07:50:13,711][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000631341_646493184.pth [2022-07-10 07:50:13,716][26022] Updated weights on worker 0-0, policy_version 633282 (0.00092) [2022-07-10 07:50:15,411][26022] Updated weights on worker 0-0, policy_version 633292 (0.00102) [2022-07-10 07:50:15,667][25689] Fps is (10 sec: 5464.3, 60 sec: 5504.8, 300 sec: 5517.1). Total num frames: 648492032. Throughput: 0: 5693.1. Samples: 648499008. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:50:15,668][25689] Avg episode reward: [(0, '-6.418')] [2022-07-10 07:50:17,372][26022] Updated weights on worker 0-0, policy_version 633302 (0.00092) [2022-07-10 07:50:19,060][26022] Updated weights on worker 0-0, policy_version 633312 (0.00090) [2022-07-10 07:50:20,723][25689] Fps is (10 sec: 5471.1, 60 sec: 5517.7, 300 sec: 5516.7). Total num frames: 648519680. Throughput: 0: 4839.7. Samples: 648515674. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:50:20,723][25689] Avg episode reward: [(0, '-7.094')] [2022-07-10 07:50:20,814][26022] Updated weights on worker 0-0, policy_version 633322 (0.00089) [2022-07-10 07:50:22,861][26022] Updated weights on worker 0-0, policy_version 633332 (0.00089) [2022-07-10 07:50:24,827][26022] Updated weights on worker 0-0, policy_version 633342 (0.00086) [2022-07-10 07:50:25,747][25689] Fps is (10 sec: 5486.0, 60 sec: 5501.7, 300 sec: 5514.8). Total num frames: 648547328. Throughput: 0: 5760.7. Samples: 648548932. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:50:25,747][25689] Avg episode reward: [(0, '-5.945')] [2022-07-10 07:50:26,690][26022] Updated weights on worker 0-0, policy_version 633352 (0.00087) [2022-07-10 07:50:28,593][26022] Updated weights on worker 0-0, policy_version 633362 (0.00092) [2022-07-10 07:50:30,360][26022] Updated weights on worker 0-0, policy_version 633372 (0.00758) [2022-07-10 07:50:30,834][25689] Fps is (10 sec: 5570.1, 60 sec: 5517.1, 300 sec: 5517.5). Total num frames: 648576000. Throughput: 0: 5732.8. Samples: 648581900. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:50:30,836][25689] Avg episode reward: [(0, '-5.476')] [2022-07-10 07:50:32,104][26022] Updated weights on worker 0-0, policy_version 633382 (0.00081) [2022-07-10 07:50:34,055][26022] Updated weights on worker 0-0, policy_version 633392 (0.00055) [2022-07-10 07:50:35,839][25689] Fps is (10 sec: 5479.2, 60 sec: 5501.9, 300 sec: 5514.2). Total num frames: 648602624. Throughput: 0: 4936.9. Samples: 648598592. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:50:35,839][25689] Avg episode reward: [(0, '-8.275')] [2022-07-10 07:50:35,883][26022] Updated weights on worker 0-0, policy_version 633402 (0.00084) [2022-07-10 07:50:37,701][26022] Updated weights on worker 0-0, policy_version 633412 (0.00097) [2022-07-10 07:50:39,646][26022] Updated weights on worker 0-0, policy_version 633422 (0.00091) [2022-07-10 07:50:40,853][25689] Fps is (10 sec: 5519.1, 60 sec: 5508.6, 300 sec: 5510.9). Total num frames: 648631296. Throughput: 0: 5775.9. Samples: 648631944. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:50:40,854][25689] Avg episode reward: [(0, '-9.235')] [2022-07-10 07:50:41,456][26022] Updated weights on worker 0-0, policy_version 633432 (0.00092) [2022-07-10 07:50:43,223][26022] Updated weights on worker 0-0, policy_version 633442 (0.00088) [2022-07-10 07:50:45,068][26022] Updated weights on worker 0-0, policy_version 633452 (0.00090) [2022-07-10 07:50:45,879][25689] Fps is (10 sec: 5507.5, 60 sec: 5506.8, 300 sec: 5511.6). Total num frames: 648657920. Throughput: 0: 5782.3. Samples: 648665342. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:50:45,889][25689] Avg episode reward: [(0, '-10.333')] [2022-07-10 07:50:46,855][26022] Updated weights on worker 0-0, policy_version 633462 (0.00100) [2022-07-10 07:50:48,852][26022] Updated weights on worker 0-0, policy_version 633472 (0.00099) [2022-07-10 07:50:50,664][26022] Updated weights on worker 0-0, policy_version 633482 (0.00092) [2022-07-10 07:50:51,026][25689] Fps is (10 sec: 5435.5, 60 sec: 5482.5, 300 sec: 5505.7). Total num frames: 648686592. Throughput: 0: 4951.8. Samples: 648681888. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:50:51,028][25689] Avg episode reward: [(0, '-9.601')] [2022-07-10 07:50:52,479][26022] Updated weights on worker 0-0, policy_version 633492 (0.00083) [2022-07-10 07:50:54,279][26022] Updated weights on worker 0-0, policy_version 633502 (0.00083) [2022-07-10 07:50:56,079][25689] Fps is (10 sec: 5621.8, 60 sec: 5495.6, 300 sec: 5515.4). Total num frames: 648715264. Throughput: 0: 5767.8. Samples: 648715334. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:50:56,081][25689] Avg episode reward: [(0, '-8.184')] [2022-07-10 07:50:56,146][26022] Updated weights on worker 0-0, policy_version 633512 (0.00093) [2022-07-10 07:50:58,139][26022] Updated weights on worker 0-0, policy_version 633522 (0.00094) [2022-07-10 07:50:59,741][26022] Updated weights on worker 0-0, policy_version 633532 (0.00084) [2022-07-10 07:51:01,099][25689] Fps is (10 sec: 5591.1, 60 sec: 5495.0, 300 sec: 5519.3). Total num frames: 648742912. Throughput: 0: 5775.2. Samples: 648748870. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:51:01,100][25689] Avg episode reward: [(0, '-7.862')] [2022-07-10 07:51:02,074][26022] Updated weights on worker 0-0, policy_version 633542 (0.00086) [2022-07-10 07:51:03,779][26022] Updated weights on worker 0-0, policy_version 633552 (0.00090) [2022-07-10 07:51:05,884][26022] Updated weights on worker 0-0, policy_version 633562 (0.00093) [2022-07-10 07:51:06,148][25689] Fps is (10 sec: 5389.9, 60 sec: 5491.7, 300 sec: 5513.6). Total num frames: 648769536. Throughput: 0: 4842.9. Samples: 648763506. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:51:06,149][25689] Avg episode reward: [(0, '-7.425')] [2022-07-10 07:51:07,472][26022] Updated weights on worker 0-0, policy_version 633572 (0.00088) [2022-07-10 07:51:09,544][26022] Updated weights on worker 0-0, policy_version 633582 (0.00091) [2022-07-10 07:51:11,079][26022] Updated weights on worker 0-0, policy_version 633592 (0.00091) [2022-07-10 07:51:11,230][25689] Fps is (10 sec: 5559.3, 60 sec: 5524.3, 300 sec: 5520.7). Total num frames: 648799232. Throughput: 0: 5690.7. Samples: 648796862. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:51:11,230][25689] Avg episode reward: [(0, '-7.810')] [2022-07-10 07:51:13,186][26022] Updated weights on worker 0-0, policy_version 633602 (0.00314) [2022-07-10 07:51:14,848][26022] Updated weights on worker 0-0, policy_version 633612 (0.00093) [2022-07-10 07:51:16,260][25689] Fps is (10 sec: 5468.5, 60 sec: 5492.4, 300 sec: 5506.9). Total num frames: 648824832. Throughput: 0: 5678.0. Samples: 648829920. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:51:16,260][25689] Avg episode reward: [(0, '-8.011')] [2022-07-10 07:51:16,908][26022] Updated weights on worker 0-0, policy_version 633622 (0.00093) [2022-07-10 07:51:18,639][26022] Updated weights on worker 0-0, policy_version 633632 (0.00086) [2022-07-10 07:51:20,747][26022] Updated weights on worker 0-0, policy_version 633642 (0.00084) [2022-07-10 07:51:21,269][25689] Fps is (10 sec: 5405.9, 60 sec: 5513.5, 300 sec: 5515.3). Total num frames: 648853504. Throughput: 0: 4844.3. Samples: 648846580. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:51:21,270][25689] Avg episode reward: [(0, '-9.805')] [2022-07-10 07:51:22,195][26022] Updated weights on worker 0-0, policy_version 633652 (0.00085) [2022-07-10 07:51:24,273][26022] Updated weights on worker 0-0, policy_version 633662 (0.00874) [2022-07-10 07:51:25,915][26022] Updated weights on worker 0-0, policy_version 633672 (0.00088) [2022-07-10 07:51:26,315][25689] Fps is (10 sec: 5601.1, 60 sec: 5511.5, 300 sec: 5513.4). Total num frames: 648881152. Throughput: 0: 5797.2. Samples: 648880416. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:51:26,316][25689] Avg episode reward: [(0, '-10.115')] [2022-07-10 07:51:27,986][26022] Updated weights on worker 0-0, policy_version 633682 (0.00094) [2022-07-10 07:51:29,698][26022] Updated weights on worker 0-0, policy_version 633692 (0.00103) [2022-07-10 07:51:31,402][25689] Fps is (10 sec: 5558.5, 60 sec: 5511.6, 300 sec: 5516.7). Total num frames: 648909824. Throughput: 0: 5784.9. Samples: 648913554. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:51:31,403][25689] Avg episode reward: [(0, '-9.395')] [2022-07-10 07:51:31,549][26022] Updated weights on worker 0-0, policy_version 633702 (0.00086) [2022-07-10 07:51:33,454][26022] Updated weights on worker 0-0, policy_version 633712 (0.00092) [2022-07-10 07:51:35,479][26022] Updated weights on worker 0-0, policy_version 633722 (0.00095) [2022-07-10 07:51:36,418][25689] Fps is (10 sec: 5473.5, 60 sec: 5510.6, 300 sec: 5510.0). Total num frames: 648936448. Throughput: 0: 5799.2. Samples: 648946818. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:51:36,419][25689] Avg episode reward: [(0, '-7.890')] [2022-07-10 07:51:37,091][26022] Updated weights on worker 0-0, policy_version 633732 (0.00086) [2022-07-10 07:51:39,064][26022] Updated weights on worker 0-0, policy_version 633742 (0.00090) [2022-07-10 07:51:40,615][26022] Updated weights on worker 0-0, policy_version 633752 (0.00058) [2022-07-10 07:51:41,427][25689] Fps is (10 sec: 5515.6, 60 sec: 5511.0, 300 sec: 5511.2). Total num frames: 648965120. Throughput: 0: 5801.6. Samples: 648963526. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:51:41,428][25689] Avg episode reward: [(0, '-8.196')] [2022-07-10 07:51:42,703][26022] Updated weights on worker 0-0, policy_version 633762 (0.00095) [2022-07-10 07:51:44,431][26022] Updated weights on worker 0-0, policy_version 633772 (0.00094) [2022-07-10 07:51:46,377][26022] Updated weights on worker 0-0, policy_version 633782 (0.00081) [2022-07-10 07:51:46,443][25689] Fps is (10 sec: 5617.7, 60 sec: 5528.8, 300 sec: 5516.3). Total num frames: 648992768. Throughput: 0: 5786.1. Samples: 648996878. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:51:46,445][25689] Avg episode reward: [(0, '-7.478')] [2022-07-10 07:51:48,179][26022] Updated weights on worker 0-0, policy_version 633792 (0.00093) [2022-07-10 07:51:50,035][26022] Updated weights on worker 0-0, policy_version 633802 (0.00079) [2022-07-10 07:51:51,529][25689] Fps is (10 sec: 5575.3, 60 sec: 5534.4, 300 sec: 5515.8). Total num frames: 649021440. Throughput: 0: 5779.6. Samples: 649029882. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:51:51,529][25689] Avg episode reward: [(0, '-7.251')] [2022-07-10 07:51:51,843][26022] Updated weights on worker 0-0, policy_version 633812 (0.00090) [2022-07-10 07:51:53,914][26022] Updated weights on worker 0-0, policy_version 633822 (0.00088) [2022-07-10 07:51:55,482][26022] Updated weights on worker 0-0, policy_version 633832 (0.00929) [2022-07-10 07:51:56,603][25689] Fps is (10 sec: 5442.4, 60 sec: 5498.6, 300 sec: 5507.8). Total num frames: 649048064. Throughput: 0: 4942.1. Samples: 649046578. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:51:56,604][25689] Avg episode reward: [(0, '-7.690')] [2022-07-10 07:51:57,419][26022] Updated weights on worker 0-0, policy_version 633842 (0.00078) [2022-07-10 07:51:59,242][26022] Updated weights on worker 0-0, policy_version 633852 (0.00104) [2022-07-10 07:52:01,147][26022] Updated weights on worker 0-0, policy_version 633862 (0.00084) [2022-07-10 07:52:01,605][25689] Fps is (10 sec: 5589.3, 60 sec: 5534.1, 300 sec: 5518.5). Total num frames: 649077760. Throughput: 0: 5793.4. Samples: 649080426. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:52:01,607][25689] Avg episode reward: [(0, '-7.755')] [2022-07-10 07:52:03,419][26022] Updated weights on worker 0-0, policy_version 633872 (0.00084) [2022-07-10 07:52:05,109][26022] Updated weights on worker 0-0, policy_version 633882 (0.00084) [2022-07-10 07:52:06,614][25689] Fps is (10 sec: 5421.1, 60 sec: 5503.9, 300 sec: 5510.7). Total num frames: 649102336. Throughput: 0: 5700.4. Samples: 649111864. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:52:06,615][25689] Avg episode reward: [(0, '-7.651')] [2022-07-10 07:52:07,005][26022] Updated weights on worker 0-0, policy_version 633892 (0.00090) [2022-07-10 07:52:08,787][26022] Updated weights on worker 0-0, policy_version 633902 (0.00089) [2022-07-10 07:52:10,687][26022] Updated weights on worker 0-0, policy_version 633912 (0.00090) [2022-07-10 07:52:11,690][25689] Fps is (10 sec: 5280.0, 60 sec: 5487.5, 300 sec: 5517.1). Total num frames: 649131008. Throughput: 0: 4880.3. Samples: 649128274. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:52:11,691][25689] Avg episode reward: [(0, '-9.050')] [2022-07-10 07:52:12,602][26022] Updated weights on worker 0-0, policy_version 633922 (0.00087) [2022-07-10 07:52:13,775][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:52:13,788][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000633930_649144320.pth [2022-07-10 07:52:13,789][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000631988_647155712.pth [2022-07-10 07:52:14,505][26022] Updated weights on worker 0-0, policy_version 633932 (0.00085) [2022-07-10 07:52:16,118][26022] Updated weights on worker 0-0, policy_version 633942 (0.00086) [2022-07-10 07:52:16,695][25689] Fps is (10 sec: 5688.8, 60 sec: 5540.7, 300 sec: 5514.1). Total num frames: 649159680. Throughput: 0: 5735.2. Samples: 649161806. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:52:16,697][25689] Avg episode reward: [(0, '-9.269')] [2022-07-10 07:52:18,150][26022] Updated weights on worker 0-0, policy_version 633952 (0.00093) [2022-07-10 07:52:19,834][26022] Updated weights on worker 0-0, policy_version 633962 (0.00094) [2022-07-10 07:52:21,726][25689] Fps is (10 sec: 5407.6, 60 sec: 5487.8, 300 sec: 5507.6). Total num frames: 649185280. Throughput: 0: 5695.8. Samples: 649195032. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-10 07:52:21,728][25689] Avg episode reward: [(0, '-11.055')] [2022-07-10 07:52:21,940][26022] Updated weights on worker 0-0, policy_version 633972 (0.00092) [2022-07-10 07:52:23,544][26022] Updated weights on worker 0-0, policy_version 633982 (0.00992) [2022-07-10 07:52:25,426][26022] Updated weights on worker 0-0, policy_version 633992 (0.00096) [2022-07-10 07:52:26,781][25689] Fps is (10 sec: 5482.5, 60 sec: 5520.9, 300 sec: 5514.6). Total num frames: 649214976. Throughput: 0: 4963.5. Samples: 649211958. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:52:26,782][25689] Avg episode reward: [(0, '-10.094')] [2022-07-10 07:52:27,403][26022] Updated weights on worker 0-0, policy_version 634002 (0.00085) [2022-07-10 07:52:29,176][26022] Updated weights on worker 0-0, policy_version 634012 (0.00088) [2022-07-10 07:52:30,903][26022] Updated weights on worker 0-0, policy_version 634022 (0.00055) [2022-07-10 07:52:31,846][25689] Fps is (10 sec: 5767.4, 60 sec: 5522.8, 300 sec: 5513.6). Total num frames: 649243648. Throughput: 0: 5806.1. Samples: 649245304. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:52:31,847][25689] Avg episode reward: [(0, '-9.385')] [2022-07-10 07:52:32,619][26022] Updated weights on worker 0-0, policy_version 634032 (0.00088) [2022-07-10 07:52:34,796][26022] Updated weights on worker 0-0, policy_version 634042 (0.00083) [2022-07-10 07:52:36,447][26022] Updated weights on worker 0-0, policy_version 634052 (0.00093) [2022-07-10 07:52:36,942][25689] Fps is (10 sec: 5542.6, 60 sec: 5532.5, 300 sec: 5513.1). Total num frames: 649271296. Throughput: 0: 5779.0. Samples: 649278814. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:52:36,942][25689] Avg episode reward: [(0, '-10.548')] [2022-07-10 07:52:38,362][26022] Updated weights on worker 0-0, policy_version 634062 (0.00092) [2022-07-10 07:52:40,043][26022] Updated weights on worker 0-0, policy_version 634072 (0.00084) [2022-07-10 07:52:42,002][25689] Fps is (10 sec: 5444.7, 60 sec: 5510.9, 300 sec: 5509.6). Total num frames: 649298944. Throughput: 0: 4951.4. Samples: 649295428. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:52:42,003][25689] Avg episode reward: [(0, '-9.428')] [2022-07-10 07:52:42,034][26022] Updated weights on worker 0-0, policy_version 634082 (0.00084) [2022-07-10 07:52:43,967][26022] Updated weights on worker 0-0, policy_version 634092 (0.00095) [2022-07-10 07:52:45,622][26022] Updated weights on worker 0-0, policy_version 634102 (0.00088) [2022-07-10 07:52:47,026][25689] Fps is (10 sec: 5585.1, 60 sec: 5527.1, 300 sec: 5514.1). Total num frames: 649327616. Throughput: 0: 5790.6. Samples: 649329190. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:52:47,026][25689] Avg episode reward: [(0, '-8.225')] [2022-07-10 07:52:47,638][26022] Updated weights on worker 0-0, policy_version 634112 (0.00085) [2022-07-10 07:52:49,350][26022] Updated weights on worker 0-0, policy_version 634122 (0.00089) [2022-07-10 07:52:51,291][26022] Updated weights on worker 0-0, policy_version 634132 (0.00094) [2022-07-10 07:52:52,112][25689] Fps is (10 sec: 5672.3, 60 sec: 5527.1, 300 sec: 5516.8). Total num frames: 649356288. Throughput: 0: 5783.8. Samples: 649362514. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:52:52,112][25689] Avg episode reward: [(0, '-7.323')] [2022-07-10 07:52:53,126][26022] Updated weights on worker 0-0, policy_version 634142 (0.00090) [2022-07-10 07:52:54,882][26022] Updated weights on worker 0-0, policy_version 634152 (0.00091) [2022-07-10 07:52:56,708][26022] Updated weights on worker 0-0, policy_version 634162 (0.00085) [2022-07-10 07:52:57,127][25689] Fps is (10 sec: 5575.7, 60 sec: 5549.4, 300 sec: 5513.3). Total num frames: 649383936. Throughput: 0: 4975.1. Samples: 649379236. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:52:57,127][25689] Avg episode reward: [(0, '-7.212')] [2022-07-10 07:52:58,630][26022] Updated weights on worker 0-0, policy_version 634172 (0.00093) [2022-07-10 07:53:00,472][26022] Updated weights on worker 0-0, policy_version 634182 (0.00093) [2022-07-10 07:53:02,139][25689] Fps is (10 sec: 5412.6, 60 sec: 5497.8, 300 sec: 5516.6). Total num frames: 649410560. Throughput: 0: 5816.9. Samples: 649412560. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:53:02,139][25689] Avg episode reward: [(0, '-6.653')] [2022-07-10 07:53:02,639][26022] Updated weights on worker 0-0, policy_version 634192 (0.00084) [2022-07-10 07:53:04,394][26022] Updated weights on worker 0-0, policy_version 634202 (0.00100) [2022-07-10 07:53:06,399][26022] Updated weights on worker 0-0, policy_version 634212 (0.00081) [2022-07-10 07:53:07,148][25689] Fps is (10 sec: 5211.1, 60 sec: 5514.7, 300 sec: 5511.1). Total num frames: 649436160. Throughput: 0: 5707.6. Samples: 649444042. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:53:07,149][25689] Avg episode reward: [(0, '-6.581')] [2022-07-10 07:53:08,091][26022] Updated weights on worker 0-0, policy_version 634222 (0.00121) [2022-07-10 07:53:10,011][26022] Updated weights on worker 0-0, policy_version 634232 (0.00096) [2022-07-10 07:53:11,880][26022] Updated weights on worker 0-0, policy_version 634242 (0.00090) [2022-07-10 07:53:12,219][25689] Fps is (10 sec: 5485.5, 60 sec: 5532.0, 300 sec: 5513.6). Total num frames: 649465856. Throughput: 0: 4880.1. Samples: 649460640. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:53:12,220][25689] Avg episode reward: [(0, '-7.906')] [2022-07-10 07:53:13,817][26022] Updated weights on worker 0-0, policy_version 634252 (0.00087) [2022-07-10 07:53:15,552][26022] Updated weights on worker 0-0, policy_version 634262 (0.00092) [2022-07-10 07:53:17,241][25689] Fps is (10 sec: 5682.0, 60 sec: 5513.6, 300 sec: 5509.8). Total num frames: 649493504. Throughput: 0: 5689.7. Samples: 649493678. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:53:17,241][25689] Avg episode reward: [(0, '-6.072')] [2022-07-10 07:53:17,432][26022] Updated weights on worker 0-0, policy_version 634272 (0.00078) [2022-07-10 07:53:19,410][26022] Updated weights on worker 0-0, policy_version 634282 (0.00093) [2022-07-10 07:53:20,984][26022] Updated weights on worker 0-0, policy_version 634292 (0.00088) [2022-07-10 07:53:22,245][25689] Fps is (10 sec: 5413.5, 60 sec: 5533.0, 300 sec: 5513.3). Total num frames: 649520128. Throughput: 0: 5676.7. Samples: 649526694. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:53:22,245][25689] Avg episode reward: [(0, '-6.516')] [2022-07-10 07:53:23,136][26022] Updated weights on worker 0-0, policy_version 634302 (0.00091) [2022-07-10 07:53:24,919][26022] Updated weights on worker 0-0, policy_version 634312 (0.00108) [2022-07-10 07:53:26,698][26022] Updated weights on worker 0-0, policy_version 634322 (0.00083) [2022-07-10 07:53:27,267][25689] Fps is (10 sec: 5617.0, 60 sec: 5535.9, 300 sec: 5515.0). Total num frames: 649549824. Throughput: 0: 4936.0. Samples: 649543350. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:53:27,268][25689] Avg episode reward: [(0, '-10.220')] [2022-07-10 07:53:28,701][26022] Updated weights on worker 0-0, policy_version 634332 (0.00085) [2022-07-10 07:53:30,344][26022] Updated weights on worker 0-0, policy_version 634342 (0.00091) [2022-07-10 07:53:32,396][25689] Fps is (10 sec: 5447.0, 60 sec: 5479.4, 300 sec: 5506.1). Total num frames: 649575424. Throughput: 0: 5726.2. Samples: 649576178. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:53:32,400][25689] Avg episode reward: [(0, '-9.875')] [2022-07-10 07:53:32,450][26022] Updated weights on worker 0-0, policy_version 634352 (0.00094) [2022-07-10 07:53:34,039][26022] Updated weights on worker 0-0, policy_version 634362 (0.00089) [2022-07-10 07:53:36,088][26022] Updated weights on worker 0-0, policy_version 634372 (0.00093) [2022-07-10 07:53:37,468][25689] Fps is (10 sec: 5320.6, 60 sec: 5498.5, 300 sec: 5512.1). Total num frames: 649604096. Throughput: 0: 5717.6. Samples: 649609328. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:53:37,468][25689] Avg episode reward: [(0, '-7.283')] [2022-07-10 07:53:38,052][26022] Updated weights on worker 0-0, policy_version 634382 (0.00090) [2022-07-10 07:53:39,549][26022] Updated weights on worker 0-0, policy_version 634392 (0.00091) [2022-07-10 07:53:41,851][26022] Updated weights on worker 0-0, policy_version 634402 (0.00117) [2022-07-10 07:53:42,516][25689] Fps is (10 sec: 5565.4, 60 sec: 5499.6, 300 sec: 5504.5). Total num frames: 649631744. Throughput: 0: 4897.5. Samples: 649625972. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:53:42,518][25689] Avg episode reward: [(0, '-8.601')] [2022-07-10 07:53:43,352][26022] Updated weights on worker 0-0, policy_version 634412 (0.00092) [2022-07-10 07:53:45,354][26022] Updated weights on worker 0-0, policy_version 634422 (0.00093) [2022-07-10 07:53:47,048][26022] Updated weights on worker 0-0, policy_version 634432 (0.00094) [2022-07-10 07:53:47,590][25689] Fps is (10 sec: 5563.9, 60 sec: 5495.0, 300 sec: 5507.4). Total num frames: 649660416. Throughput: 0: 5706.6. Samples: 649659324. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:53:47,591][25689] Avg episode reward: [(0, '-10.376')] [2022-07-10 07:53:49,194][26022] Updated weights on worker 0-0, policy_version 634442 (0.00084) [2022-07-10 07:53:51,015][26022] Updated weights on worker 0-0, policy_version 634452 (0.00088) [2022-07-10 07:53:52,686][25689] Fps is (10 sec: 5537.6, 60 sec: 5477.2, 300 sec: 5506.1). Total num frames: 649688064. Throughput: 0: 5692.3. Samples: 649691676. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:53:52,687][25689] Avg episode reward: [(0, '-9.786')] [2022-07-10 07:53:53,003][26022] Updated weights on worker 0-0, policy_version 634462 (0.00100) [2022-07-10 07:53:54,555][26022] Updated weights on worker 0-0, policy_version 634472 (0.00099) [2022-07-10 07:53:56,664][26022] Updated weights on worker 0-0, policy_version 634482 (0.00089) [2022-07-10 07:53:57,723][25689] Fps is (10 sec: 5659.3, 60 sec: 5509.0, 300 sec: 5512.7). Total num frames: 649717760. Throughput: 0: 5715.8. Samples: 649725104. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:53:57,723][25689] Avg episode reward: [(0, '-7.191')] [2022-07-10 07:53:58,426][26022] Updated weights on worker 0-0, policy_version 634492 (0.00087) [2022-07-10 07:54:00,322][26022] Updated weights on worker 0-0, policy_version 634502 (0.00093) [2022-07-10 07:54:02,566][26022] Updated weights on worker 0-0, policy_version 634512 (0.00092) [2022-07-10 07:54:02,760][25689] Fps is (10 sec: 5184.1, 60 sec: 5439.1, 300 sec: 5501.8). Total num frames: 649740288. Throughput: 0: 5713.1. Samples: 649741630. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:54:02,761][25689] Avg episode reward: [(0, '-9.363')] [2022-07-10 07:54:04,318][26022] Updated weights on worker 0-0, policy_version 634522 (0.00086) [2022-07-10 07:54:06,231][26022] Updated weights on worker 0-0, policy_version 634532 (0.00088) [2022-07-10 07:54:07,799][25689] Fps is (10 sec: 5081.3, 60 sec: 5487.2, 300 sec: 5508.9). Total num frames: 649768960. Throughput: 0: 5617.1. Samples: 649772840. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:54:07,799][25689] Avg episode reward: [(0, '-9.755')] [2022-07-10 07:54:08,010][26022] Updated weights on worker 0-0, policy_version 634542 (0.00083) [2022-07-10 07:54:09,903][26022] Updated weights on worker 0-0, policy_version 634552 (0.00093) [2022-07-10 07:54:11,773][26022] Updated weights on worker 0-0, policy_version 634562 (0.00088) [2022-07-10 07:54:12,852][25689] Fps is (10 sec: 5783.6, 60 sec: 5488.8, 300 sec: 5508.1). Total num frames: 649798656. Throughput: 0: 5691.4. Samples: 649806448. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:54:12,853][25689] Avg episode reward: [(0, '-10.440')] [2022-07-10 07:54:13,466][26022] Updated weights on worker 0-0, policy_version 634572 (0.00080) [2022-07-10 07:54:13,987][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:54:13,997][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000634574_649803776.pth [2022-07-10 07:54:13,997][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000632636_647819264.pth [2022-07-10 07:54:15,271][26022] Updated weights on worker 0-0, policy_version 634582 (0.00086) [2022-07-10 07:54:17,089][26022] Updated weights on worker 0-0, policy_version 634592 (0.00090) [2022-07-10 07:54:17,879][25689] Fps is (10 sec: 5587.2, 60 sec: 5471.4, 300 sec: 5507.8). Total num frames: 649825280. Throughput: 0: 4874.2. Samples: 649823350. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:54:17,880][25689] Avg episode reward: [(0, '-8.692')] [2022-07-10 07:54:18,828][26022] Updated weights on worker 0-0, policy_version 634602 (0.00084) [2022-07-10 07:54:20,863][26022] Updated weights on worker 0-0, policy_version 634612 (0.00090) [2022-07-10 07:54:22,811][26022] Updated weights on worker 0-0, policy_version 634622 (0.00086) [2022-07-10 07:54:22,909][25689] Fps is (10 sec: 5396.5, 60 sec: 5485.9, 300 sec: 5504.4). Total num frames: 649852928. Throughput: 0: 5712.8. Samples: 649856736. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:54:22,909][25689] Avg episode reward: [(0, '-9.532')] [2022-07-10 07:54:24,455][26022] Updated weights on worker 0-0, policy_version 634632 (0.00088) [2022-07-10 07:54:26,638][26022] Updated weights on worker 0-0, policy_version 634642 (0.00085) [2022-07-10 07:54:27,913][25689] Fps is (10 sec: 5613.2, 60 sec: 5470.8, 300 sec: 5509.1). Total num frames: 649881600. Throughput: 0: 5814.7. Samples: 649889796. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:54:27,913][25689] Avg episode reward: [(0, '-9.640')] [2022-07-10 07:54:28,065][26022] Updated weights on worker 0-0, policy_version 634652 (0.00092) [2022-07-10 07:54:30,264][26022] Updated weights on worker 0-0, policy_version 634662 (0.00089) [2022-07-10 07:54:31,795][26022] Updated weights on worker 0-0, policy_version 634672 (0.00095) [2022-07-10 07:54:33,003][25689] Fps is (10 sec: 5579.6, 60 sec: 5508.0, 300 sec: 5507.9). Total num frames: 649909248. Throughput: 0: 4955.1. Samples: 649906298. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:54:33,003][25689] Avg episode reward: [(0, '-9.451')] [2022-07-10 07:54:33,903][26022] Updated weights on worker 0-0, policy_version 634682 (0.00083) [2022-07-10 07:54:35,599][26022] Updated weights on worker 0-0, policy_version 634692 (0.00084) [2022-07-10 07:54:37,656][26022] Updated weights on worker 0-0, policy_version 634702 (0.00091) [2022-07-10 07:54:38,019][25689] Fps is (10 sec: 5370.2, 60 sec: 5479.3, 300 sec: 5502.3). Total num frames: 649935872. Throughput: 0: 5772.1. Samples: 649939598. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:54:38,020][25689] Avg episode reward: [(0, '-8.021')] [2022-07-10 07:54:39,186][26022] Updated weights on worker 0-0, policy_version 634712 (0.00092) [2022-07-10 07:54:41,468][26022] Updated weights on worker 0-0, policy_version 634722 (0.00088) [2022-07-10 07:54:43,087][25689] Fps is (10 sec: 5483.5, 60 sec: 5494.4, 300 sec: 5508.0). Total num frames: 649964544. Throughput: 0: 5750.1. Samples: 649972762. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:54:43,087][25689] Avg episode reward: [(0, '-9.347')] [2022-07-10 07:54:43,110][26022] Updated weights on worker 0-0, policy_version 634732 (0.00094) [2022-07-10 07:54:44,995][26022] Updated weights on worker 0-0, policy_version 634742 (0.00083) [2022-07-10 07:54:46,695][26022] Updated weights on worker 0-0, policy_version 634752 (0.00084) [2022-07-10 07:54:48,123][25689] Fps is (10 sec: 5675.3, 60 sec: 5497.9, 300 sec: 5505.2). Total num frames: 649993216. Throughput: 0: 4938.6. Samples: 649989606. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:54:48,123][25689] Avg episode reward: [(0, '-10.218')] [2022-07-10 07:54:48,660][26022] Updated weights on worker 0-0, policy_version 634762 (0.00085) [2022-07-10 07:54:50,670][26022] Updated weights on worker 0-0, policy_version 634772 (0.00094) [2022-07-10 07:54:52,365][26022] Updated weights on worker 0-0, policy_version 634782 (0.00099) [2022-07-10 07:54:53,253][25689] Fps is (10 sec: 5539.9, 60 sec: 5494.7, 300 sec: 5503.0). Total num frames: 650020864. Throughput: 0: 5750.4. Samples: 650022746. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:54:53,255][25689] Avg episode reward: [(0, '-9.951')] [2022-07-10 07:54:54,127][26022] Updated weights on worker 0-0, policy_version 634792 (0.00093) [2022-07-10 07:54:56,148][26022] Updated weights on worker 0-0, policy_version 634802 (0.00101) [2022-07-10 07:54:57,818][26022] Updated weights on worker 0-0, policy_version 634812 (0.00365) [2022-07-10 07:54:58,271][25689] Fps is (10 sec: 5550.0, 60 sec: 5479.6, 300 sec: 5506.3). Total num frames: 650049536. Throughput: 0: 5760.3. Samples: 650056254. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:54:58,271][25689] Avg episode reward: [(0, '-10.803')] [2022-07-10 07:54:59,844][26022] Updated weights on worker 0-0, policy_version 634822 (0.00093) [2022-07-10 07:55:01,705][26022] Updated weights on worker 0-0, policy_version 634832 (0.00090) [2022-07-10 07:55:03,293][25689] Fps is (10 sec: 5303.4, 60 sec: 5514.7, 300 sec: 5499.2). Total num frames: 650074112. Throughput: 0: 4955.2. Samples: 650072890. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:55:03,294][25689] Avg episode reward: [(0, '-7.986')] [2022-07-10 07:55:03,888][26022] Updated weights on worker 0-0, policy_version 634842 (0.00107) [2022-07-10 07:55:05,685][26022] Updated weights on worker 0-0, policy_version 634852 (0.00087) [2022-07-10 07:55:07,566][26022] Updated weights on worker 0-0, policy_version 634862 (0.00091) [2022-07-10 07:55:08,310][25689] Fps is (10 sec: 5303.9, 60 sec: 5516.8, 300 sec: 5503.6). Total num frames: 650102784. Throughput: 0: 5662.6. Samples: 650103918. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:55:08,311][25689] Avg episode reward: [(0, '-8.601')] [2022-07-10 07:55:09,524][26022] Updated weights on worker 0-0, policy_version 634872 (0.00089) [2022-07-10 07:55:11,135][26022] Updated weights on worker 0-0, policy_version 634882 (0.00086) [2022-07-10 07:55:13,156][26022] Updated weights on worker 0-0, policy_version 634892 (0.00085) [2022-07-10 07:55:13,438][25689] Fps is (10 sec: 5652.6, 60 sec: 5493.0, 300 sec: 5505.7). Total num frames: 650131456. Throughput: 0: 5653.6. Samples: 650136864. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:55:13,439][25689] Avg episode reward: [(0, '-8.058')] [2022-07-10 07:55:15,439][26022] Updated weights on worker 0-0, policy_version 634902 (0.00086) [2022-07-10 07:55:16,667][26022] Updated weights on worker 0-0, policy_version 634912 (0.00096) [2022-07-10 07:55:18,448][25689] Fps is (10 sec: 5353.1, 60 sec: 5477.6, 300 sec: 5499.6). Total num frames: 650157056. Throughput: 0: 4823.1. Samples: 650153574. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:55:18,449][25689] Avg episode reward: [(0, '-9.340')] [2022-07-10 07:55:18,989][26022] Updated weights on worker 0-0, policy_version 634922 (0.00093) [2022-07-10 07:55:20,281][26022] Updated weights on worker 0-0, policy_version 634932 (0.00091) [2022-07-10 07:55:22,591][26022] Updated weights on worker 0-0, policy_version 634942 (0.00086) [2022-07-10 07:55:23,468][25689] Fps is (10 sec: 5513.4, 60 sec: 5512.4, 300 sec: 5506.6). Total num frames: 650186752. Throughput: 0: 5645.9. Samples: 650186792. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:55:23,468][25689] Avg episode reward: [(0, '-9.781')] [2022-07-10 07:55:24,177][26022] Updated weights on worker 0-0, policy_version 634952 (0.00087) [2022-07-10 07:55:26,216][26022] Updated weights on worker 0-0, policy_version 634962 (0.00086) [2022-07-10 07:55:28,154][26022] Updated weights on worker 0-0, policy_version 634972 (0.00088) [2022-07-10 07:55:28,472][25689] Fps is (10 sec: 5720.7, 60 sec: 5495.4, 300 sec: 5504.7). Total num frames: 650214400. Throughput: 0: 5749.2. Samples: 650219836. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:55:28,473][25689] Avg episode reward: [(0, '-9.145')] [2022-07-10 07:55:29,887][26022] Updated weights on worker 0-0, policy_version 634982 (0.00879) [2022-07-10 07:55:31,671][26022] Updated weights on worker 0-0, policy_version 634992 (0.00093) [2022-07-10 07:55:33,579][25689] Fps is (10 sec: 5367.5, 60 sec: 5477.0, 300 sec: 5502.8). Total num frames: 650241024. Throughput: 0: 4940.7. Samples: 650236372. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 07:55:33,585][25689] Avg episode reward: [(0, '-7.694')] [2022-07-10 07:55:33,646][26022] Updated weights on worker 0-0, policy_version 635002 (0.00089) [2022-07-10 07:55:35,232][26022] Updated weights on worker 0-0, policy_version 635012 (0.00083) [2022-07-10 07:55:37,284][26022] Updated weights on worker 0-0, policy_version 635022 (0.00085) [2022-07-10 07:55:38,592][25689] Fps is (10 sec: 5464.4, 60 sec: 5511.1, 300 sec: 5502.8). Total num frames: 650269696. Throughput: 0: 5764.0. Samples: 650269680. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:55:38,592][25689] Avg episode reward: [(0, '-8.267')] [2022-07-10 07:55:39,024][26022] Updated weights on worker 0-0, policy_version 635032 (0.00096) [2022-07-10 07:55:40,860][26022] Updated weights on worker 0-0, policy_version 635042 (0.00092) [2022-07-10 07:55:42,763][26022] Updated weights on worker 0-0, policy_version 635052 (0.00083) [2022-07-10 07:55:43,599][25689] Fps is (10 sec: 5620.6, 60 sec: 5499.7, 300 sec: 5506.6). Total num frames: 650297344. Throughput: 0: 5773.1. Samples: 650303014. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:55:43,601][25689] Avg episode reward: [(0, '-8.439')] [2022-07-10 07:55:44,530][26022] Updated weights on worker 0-0, policy_version 635062 (0.00073) [2022-07-10 07:55:46,356][26022] Updated weights on worker 0-0, policy_version 635072 (0.00093) [2022-07-10 07:55:48,291][26022] Updated weights on worker 0-0, policy_version 635082 (0.00092) [2022-07-10 07:55:48,631][25689] Fps is (10 sec: 5405.9, 60 sec: 5466.2, 300 sec: 5501.8). Total num frames: 650323968. Throughput: 0: 4957.0. Samples: 650319762. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:55:48,632][25689] Avg episode reward: [(0, '-5.795')] [2022-07-10 07:55:50,226][26022] Updated weights on worker 0-0, policy_version 635092 (0.00087) [2022-07-10 07:55:52,019][26022] Updated weights on worker 0-0, policy_version 635102 (0.00098) [2022-07-10 07:55:53,718][25689] Fps is (10 sec: 5464.8, 60 sec: 5487.1, 300 sec: 5501.2). Total num frames: 650352640. Throughput: 0: 5788.3. Samples: 650352942. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:55:53,718][25689] Avg episode reward: [(0, '-5.584')] [2022-07-10 07:55:53,952][26022] Updated weights on worker 0-0, policy_version 635112 (0.00089) [2022-07-10 07:55:55,764][26022] Updated weights on worker 0-0, policy_version 635122 (0.00090) [2022-07-10 07:55:57,656][26022] Updated weights on worker 0-0, policy_version 635132 (0.00089) [2022-07-10 07:55:58,795][25689] Fps is (10 sec: 5742.5, 60 sec: 5498.6, 300 sec: 5507.0). Total num frames: 650382336. Throughput: 0: 5761.1. Samples: 650386074. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:55:58,797][25689] Avg episode reward: [(0, '-6.463')] [2022-07-10 07:55:59,478][26022] Updated weights on worker 0-0, policy_version 635142 (0.00088) [2022-07-10 07:56:01,345][26022] Updated weights on worker 0-0, policy_version 635152 (0.00092) [2022-07-10 07:56:03,578][26022] Updated weights on worker 0-0, policy_version 635162 (0.00085) [2022-07-10 07:56:03,877][25689] Fps is (10 sec: 5442.8, 60 sec: 5510.1, 300 sec: 5503.0). Total num frames: 650407936. Throughput: 0: 5649.0. Samples: 650417564. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:56:03,878][25689] Avg episode reward: [(0, '-8.088')] [2022-07-10 07:56:05,314][26022] Updated weights on worker 0-0, policy_version 635172 (0.00088) [2022-07-10 07:56:07,187][26022] Updated weights on worker 0-0, policy_version 635182 (0.00082) [2022-07-10 07:56:08,769][26022] Updated weights on worker 0-0, policy_version 635192 (0.00086) [2022-07-10 07:56:08,943][25689] Fps is (10 sec: 5449.2, 60 sec: 5522.6, 300 sec: 5503.3). Total num frames: 650437632. Throughput: 0: 5642.1. Samples: 650434362. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:56:08,943][25689] Avg episode reward: [(0, '-7.868')] [2022-07-10 07:56:10,922][26022] Updated weights on worker 0-0, policy_version 635202 (0.00084) [2022-07-10 07:56:12,642][26022] Updated weights on worker 0-0, policy_version 635212 (0.00089) [2022-07-10 07:56:14,011][25689] Fps is (10 sec: 5557.8, 60 sec: 5494.2, 300 sec: 5506.0). Total num frames: 650464256. Throughput: 0: 5648.9. Samples: 650467576. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:56:14,011][25689] Avg episode reward: [(0, '-9.413')] [2022-07-10 07:56:14,216][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:56:14,230][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000635220_650465280.pth [2022-07-10 07:56:14,230][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000633282_648480768.pth [2022-07-10 07:56:14,598][26022] Updated weights on worker 0-0, policy_version 635222 (0.00095) [2022-07-10 07:56:16,270][26022] Updated weights on worker 0-0, policy_version 635232 (0.00090) [2022-07-10 07:56:18,127][26022] Updated weights on worker 0-0, policy_version 635242 (0.00082) [2022-07-10 07:56:19,023][25689] Fps is (10 sec: 5384.1, 60 sec: 5527.9, 300 sec: 5502.5). Total num frames: 650491904. Throughput: 0: 5686.2. Samples: 650501092. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:56:19,023][25689] Avg episode reward: [(0, '-9.477')] [2022-07-10 07:56:20,135][26022] Updated weights on worker 0-0, policy_version 635252 (0.00091) [2022-07-10 07:56:21,902][26022] Updated weights on worker 0-0, policy_version 635262 (0.00053) [2022-07-10 07:56:23,526][26022] Updated weights on worker 0-0, policy_version 635272 (0.00092) [2022-07-10 07:56:24,037][25689] Fps is (10 sec: 5515.1, 60 sec: 5494.5, 300 sec: 5503.1). Total num frames: 650519552. Throughput: 0: 4968.9. Samples: 650517736. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:56:24,037][25689] Avg episode reward: [(0, '-9.359')] [2022-07-10 07:56:25,594][26022] Updated weights on worker 0-0, policy_version 635282 (0.00086) [2022-07-10 07:56:27,354][26022] Updated weights on worker 0-0, policy_version 635292 (0.00095) [2022-07-10 07:56:29,055][25689] Fps is (10 sec: 5512.0, 60 sec: 5493.4, 300 sec: 5501.0). Total num frames: 650547200. Throughput: 0: 5802.6. Samples: 650551064. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:56:29,055][25689] Avg episode reward: [(0, '-8.205')] [2022-07-10 07:56:29,304][26022] Updated weights on worker 0-0, policy_version 635302 (0.00076) [2022-07-10 07:56:30,975][26022] Updated weights on worker 0-0, policy_version 635312 (0.00061) [2022-07-10 07:56:33,085][26022] Updated weights on worker 0-0, policy_version 635322 (0.00065) [2022-07-10 07:56:34,130][25689] Fps is (10 sec: 5681.8, 60 sec: 5546.9, 300 sec: 5510.2). Total num frames: 650576896. Throughput: 0: 5789.6. Samples: 650584056. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:56:34,130][25689] Avg episode reward: [(0, '-8.787')] [2022-07-10 07:56:34,897][26022] Updated weights on worker 0-0, policy_version 635332 (0.00093) [2022-07-10 07:56:36,697][26022] Updated weights on worker 0-0, policy_version 635342 (0.00086) [2022-07-10 07:56:38,540][26022] Updated weights on worker 0-0, policy_version 635352 (0.00089) [2022-07-10 07:56:39,166][25689] Fps is (10 sec: 5468.8, 60 sec: 5494.1, 300 sec: 5499.4). Total num frames: 650602496. Throughput: 0: 4948.2. Samples: 650600764. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:56:39,166][25689] Avg episode reward: [(0, '-7.319')] [2022-07-10 07:56:40,485][26022] Updated weights on worker 0-0, policy_version 635362 (0.00088) [2022-07-10 07:56:42,254][26022] Updated weights on worker 0-0, policy_version 635372 (0.00083) [2022-07-10 07:56:44,020][26022] Updated weights on worker 0-0, policy_version 635382 (0.00090) [2022-07-10 07:56:44,188][25689] Fps is (10 sec: 5395.8, 60 sec: 5509.7, 300 sec: 5502.7). Total num frames: 650631168. Throughput: 0: 5788.7. Samples: 650634382. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:56:44,188][25689] Avg episode reward: [(0, '-6.742')] [2022-07-10 07:56:45,731][26022] Updated weights on worker 0-0, policy_version 635392 (0.00086) [2022-07-10 07:56:47,883][26022] Updated weights on worker 0-0, policy_version 635402 (0.00088) [2022-07-10 07:56:49,222][25689] Fps is (10 sec: 5600.7, 60 sec: 5526.4, 300 sec: 5500.2). Total num frames: 650658816. Throughput: 0: 5787.3. Samples: 650667776. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:56:49,222][25689] Avg episode reward: [(0, '-7.884')] [2022-07-10 07:56:49,623][26022] Updated weights on worker 0-0, policy_version 635412 (0.00100) [2022-07-10 07:56:51,504][26022] Updated weights on worker 0-0, policy_version 635422 (0.00089) [2022-07-10 07:56:53,307][26022] Updated weights on worker 0-0, policy_version 635432 (0.00089) [2022-07-10 07:56:54,285][25689] Fps is (10 sec: 5679.2, 60 sec: 5545.4, 300 sec: 5510.7). Total num frames: 650688512. Throughput: 0: 4967.3. Samples: 650684174. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:56:54,285][25689] Avg episode reward: [(0, '-7.351')] [2022-07-10 07:56:55,091][26022] Updated weights on worker 0-0, policy_version 635442 (0.00089) [2022-07-10 07:56:56,938][26022] Updated weights on worker 0-0, policy_version 635452 (0.00090) [2022-07-10 07:56:58,997][26022] Updated weights on worker 0-0, policy_version 635462 (0.00084) [2022-07-10 07:56:59,289][25689] Fps is (10 sec: 5594.5, 60 sec: 5501.4, 300 sec: 5500.4). Total num frames: 650715136. Throughput: 0: 5811.1. Samples: 650717700. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:56:59,289][25689] Avg episode reward: [(0, '-6.243')] [2022-07-10 07:57:00,512][26022] Updated weights on worker 0-0, policy_version 635472 (0.00084) [2022-07-10 07:57:02,919][26022] Updated weights on worker 0-0, policy_version 635482 (0.00096) [2022-07-10 07:57:04,343][25689] Fps is (10 sec: 5396.0, 60 sec: 5537.8, 300 sec: 5509.9). Total num frames: 650742784. Throughput: 0: 5680.4. Samples: 650748870. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:57:04,343][25689] Avg episode reward: [(0, '-6.952')] [2022-07-10 07:57:04,742][26022] Updated weights on worker 0-0, policy_version 635492 (0.00089) [2022-07-10 07:57:06,612][26022] Updated weights on worker 0-0, policy_version 635502 (0.00121) [2022-07-10 07:57:08,315][26022] Updated weights on worker 0-0, policy_version 635512 (0.00081) [2022-07-10 07:57:09,362][25689] Fps is (10 sec: 5285.9, 60 sec: 5474.2, 300 sec: 5500.6). Total num frames: 650768384. Throughput: 0: 4861.6. Samples: 650765692. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:57:09,363][25689] Avg episode reward: [(0, '-6.847')] [2022-07-10 07:57:10,367][26022] Updated weights on worker 0-0, policy_version 635522 (0.00085) [2022-07-10 07:57:12,184][26022] Updated weights on worker 0-0, policy_version 635532 (0.00091) [2022-07-10 07:57:14,031][26022] Updated weights on worker 0-0, policy_version 635542 (0.00084) [2022-07-10 07:57:14,418][25689] Fps is (10 sec: 5488.5, 60 sec: 5526.3, 300 sec: 5503.1). Total num frames: 650798080. Throughput: 0: 5700.5. Samples: 650798940. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:57:14,418][25689] Avg episode reward: [(0, '-5.265')] [2022-07-10 07:57:15,715][26022] Updated weights on worker 0-0, policy_version 635552 (0.00098) [2022-07-10 07:57:17,544][26022] Updated weights on worker 0-0, policy_version 635562 (0.00100) [2022-07-10 07:57:19,421][25689] Fps is (10 sec: 5599.1, 60 sec: 5510.1, 300 sec: 5507.1). Total num frames: 650824704. Throughput: 0: 5712.2. Samples: 650832700. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:57:19,422][25689] Avg episode reward: [(0, '-6.390')] [2022-07-10 07:57:19,599][26022] Updated weights on worker 0-0, policy_version 635572 (0.00078) [2022-07-10 07:57:21,264][26022] Updated weights on worker 0-0, policy_version 635582 (0.00092) [2022-07-10 07:57:23,215][26022] Updated weights on worker 0-0, policy_version 635592 (0.00086) [2022-07-10 07:57:24,445][25689] Fps is (10 sec: 5412.3, 60 sec: 5509.2, 300 sec: 5500.7). Total num frames: 650852352. Throughput: 0: 4990.8. Samples: 650849198. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:57:24,446][25689] Avg episode reward: [(0, '-7.218')] [2022-07-10 07:57:24,938][26022] Updated weights on worker 0-0, policy_version 635602 (0.00092) [2022-07-10 07:57:26,793][26022] Updated weights on worker 0-0, policy_version 635612 (0.00093) [2022-07-10 07:57:28,764][26022] Updated weights on worker 0-0, policy_version 635622 (0.00092) [2022-07-10 07:57:29,448][25689] Fps is (10 sec: 5515.1, 60 sec: 5510.6, 300 sec: 5498.5). Total num frames: 650880000. Throughput: 0: 5826.4. Samples: 650882716. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:57:29,449][25689] Avg episode reward: [(0, '-7.381')] [2022-07-10 07:57:30,307][26022] Updated weights on worker 0-0, policy_version 635632 (0.00093) [2022-07-10 07:57:32,470][26022] Updated weights on worker 0-0, policy_version 635642 (0.00091) [2022-07-10 07:57:34,056][26022] Updated weights on worker 0-0, policy_version 635652 (0.00087) [2022-07-10 07:57:34,539][25689] Fps is (10 sec: 5579.9, 60 sec: 5492.2, 300 sec: 5502.0). Total num frames: 650908672. Throughput: 0: 5826.4. Samples: 650916172. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:57:34,539][25689] Avg episode reward: [(0, '-8.288')] [2022-07-10 07:57:36,092][26022] Updated weights on worker 0-0, policy_version 635662 (0.00094) [2022-07-10 07:57:37,963][26022] Updated weights on worker 0-0, policy_version 635672 (0.00093) [2022-07-10 07:57:39,591][25689] Fps is (10 sec: 5653.1, 60 sec: 5541.5, 300 sec: 5505.6). Total num frames: 650937344. Throughput: 0: 4962.3. Samples: 650932790. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:57:39,592][25689] Avg episode reward: [(0, '-8.628')] [2022-07-10 07:57:39,865][26022] Updated weights on worker 0-0, policy_version 635682 (0.00972) [2022-07-10 07:57:41,586][26022] Updated weights on worker 0-0, policy_version 635692 (0.00105) [2022-07-10 07:57:43,555][26022] Updated weights on worker 0-0, policy_version 635702 (0.00085) [2022-07-10 07:57:44,609][25689] Fps is (10 sec: 5490.8, 60 sec: 5508.0, 300 sec: 5498.8). Total num frames: 650963968. Throughput: 0: 5793.5. Samples: 650966018. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:57:44,610][25689] Avg episode reward: [(0, '-10.044')] [2022-07-10 07:57:45,224][26022] Updated weights on worker 0-0, policy_version 635712 (0.00088) [2022-07-10 07:57:47,190][26022] Updated weights on worker 0-0, policy_version 635722 (0.00087) [2022-07-10 07:57:48,819][26022] Updated weights on worker 0-0, policy_version 635732 (0.00083) [2022-07-10 07:57:49,621][25689] Fps is (10 sec: 5513.3, 60 sec: 5527.0, 300 sec: 5500.2). Total num frames: 650992640. Throughput: 0: 5799.9. Samples: 650999718. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:57:49,621][25689] Avg episode reward: [(0, '-9.379')] [2022-07-10 07:57:50,846][26022] Updated weights on worker 0-0, policy_version 635742 (0.00099) [2022-07-10 07:57:52,589][26022] Updated weights on worker 0-0, policy_version 635752 (0.00103) [2022-07-10 07:57:54,528][26022] Updated weights on worker 0-0, policy_version 635762 (0.00092) [2022-07-10 07:57:54,692][25689] Fps is (10 sec: 5789.0, 60 sec: 5526.3, 300 sec: 5506.0). Total num frames: 651022336. Throughput: 0: 4973.2. Samples: 651016398. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:57:54,692][25689] Avg episode reward: [(0, '-9.105')] [2022-07-10 07:57:56,580][26022] Updated weights on worker 0-0, policy_version 635772 (0.00088) [2022-07-10 07:57:58,094][26022] Updated weights on worker 0-0, policy_version 635782 (0.00093) [2022-07-10 07:57:59,758][25689] Fps is (10 sec: 5454.6, 60 sec: 5503.6, 300 sec: 5501.6). Total num frames: 651047936. Throughput: 0: 5817.1. Samples: 651050104. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:57:59,759][25689] Avg episode reward: [(0, '-8.922')] [2022-07-10 07:58:00,189][26022] Updated weights on worker 0-0, policy_version 635792 (0.00091) [2022-07-10 07:58:01,860][26022] Updated weights on worker 0-0, policy_version 635802 (0.00124) [2022-07-10 07:58:04,203][26022] Updated weights on worker 0-0, policy_version 635812 (0.00083) [2022-07-10 07:58:04,769][25689] Fps is (10 sec: 5385.7, 60 sec: 5524.5, 300 sec: 5511.9). Total num frames: 651076608. Throughput: 0: 5730.6. Samples: 651081544. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:58:04,769][25689] Avg episode reward: [(0, '-10.095')] [2022-07-10 07:58:05,889][26022] Updated weights on worker 0-0, policy_version 635822 (0.00093) [2022-07-10 07:58:07,747][26022] Updated weights on worker 0-0, policy_version 635832 (0.00091) [2022-07-10 07:58:09,655][26022] Updated weights on worker 0-0, policy_version 635842 (0.00097) [2022-07-10 07:58:09,773][25689] Fps is (10 sec: 5521.7, 60 sec: 5542.9, 300 sec: 5502.8). Total num frames: 651103232. Throughput: 0: 4890.6. Samples: 651098272. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:58:09,773][25689] Avg episode reward: [(0, '-9.737')] [2022-07-10 07:58:11,392][26022] Updated weights on worker 0-0, policy_version 635852 (0.00085) [2022-07-10 07:58:13,213][26022] Updated weights on worker 0-0, policy_version 635862 (0.00082) [2022-07-10 07:58:14,258][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 07:58:14,270][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000635868_651128832.pth [2022-07-10 07:58:14,270][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000633930_649144320.pth [2022-07-10 07:58:14,836][25689] Fps is (10 sec: 5492.5, 60 sec: 5525.2, 300 sec: 5505.5). Total num frames: 651131904. Throughput: 0: 5717.1. Samples: 651131566. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:58:14,837][25689] Avg episode reward: [(0, '-9.063')] [2022-07-10 07:58:14,950][26022] Updated weights on worker 0-0, policy_version 635872 (0.00090) [2022-07-10 07:58:17,167][26022] Updated weights on worker 0-0, policy_version 635882 (0.00092) [2022-07-10 07:58:18,597][26022] Updated weights on worker 0-0, policy_version 635892 (0.00092) [2022-07-10 07:58:19,881][25689] Fps is (10 sec: 5470.4, 60 sec: 5521.4, 300 sec: 5504.7). Total num frames: 651158528. Throughput: 0: 5700.6. Samples: 651164814. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:58:19,881][25689] Avg episode reward: [(0, '-8.061')] [2022-07-10 07:58:20,699][26022] Updated weights on worker 0-0, policy_version 635902 (0.00087) [2022-07-10 07:58:22,379][26022] Updated weights on worker 0-0, policy_version 635912 (0.00084) [2022-07-10 07:58:24,434][26022] Updated weights on worker 0-0, policy_version 635922 (0.00101) [2022-07-10 07:58:24,915][25689] Fps is (10 sec: 5385.0, 60 sec: 5520.5, 300 sec: 5497.6). Total num frames: 651186176. Throughput: 0: 4950.1. Samples: 651181268. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:58:24,915][25689] Avg episode reward: [(0, '-8.906')] [2022-07-10 07:58:26,251][26022] Updated weights on worker 0-0, policy_version 635932 (0.00090) [2022-07-10 07:58:28,040][26022] Updated weights on worker 0-0, policy_version 635942 (0.00083) [2022-07-10 07:58:29,929][25689] Fps is (10 sec: 5401.4, 60 sec: 5502.5, 300 sec: 5503.2). Total num frames: 651212800. Throughput: 0: 5755.3. Samples: 651214276. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:58:29,930][25689] Avg episode reward: [(0, '-7.084')] [2022-07-10 07:58:30,134][26022] Updated weights on worker 0-0, policy_version 635952 (0.00064) [2022-07-10 07:58:31,830][26022] Updated weights on worker 0-0, policy_version 635962 (0.00088) [2022-07-10 07:58:33,752][26022] Updated weights on worker 0-0, policy_version 635972 (0.00110) [2022-07-10 07:58:35,057][25689] Fps is (10 sec: 5452.1, 60 sec: 5499.2, 300 sec: 5502.1). Total num frames: 651241472. Throughput: 0: 5736.8. Samples: 651247566. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:58:35,057][25689] Avg episode reward: [(0, '-7.769')] [2022-07-10 07:58:35,696][26022] Updated weights on worker 0-0, policy_version 635982 (0.00091) [2022-07-10 07:58:37,440][26022] Updated weights on worker 0-0, policy_version 635992 (0.00092) [2022-07-10 07:58:39,311][26022] Updated weights on worker 0-0, policy_version 636002 (0.00089) [2022-07-10 07:58:40,076][25689] Fps is (10 sec: 5751.7, 60 sec: 5519.1, 300 sec: 5509.5). Total num frames: 651271168. Throughput: 0: 4932.0. Samples: 651264418. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:58:40,078][25689] Avg episode reward: [(0, '-8.075')] [2022-07-10 07:58:41,008][26022] Updated weights on worker 0-0, policy_version 636012 (0.00089) [2022-07-10 07:58:42,815][26022] Updated weights on worker 0-0, policy_version 636022 (0.00085) [2022-07-10 07:58:44,719][26022] Updated weights on worker 0-0, policy_version 636032 (0.00091) [2022-07-10 07:58:45,086][25689] Fps is (10 sec: 5615.5, 60 sec: 5519.9, 300 sec: 5503.9). Total num frames: 651297792. Throughput: 0: 5785.9. Samples: 651297978. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 07:58:45,088][25689] Avg episode reward: [(0, '-8.095')] [2022-07-10 07:58:46,358][26022] Updated weights on worker 0-0, policy_version 636042 (0.00086) [2022-07-10 07:58:48,419][26022] Updated weights on worker 0-0, policy_version 636052 (0.00084) [2022-07-10 07:58:50,102][25689] Fps is (10 sec: 5515.6, 60 sec: 5519.5, 300 sec: 5508.8). Total num frames: 651326464. Throughput: 0: 5811.5. Samples: 651331512. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 07:58:50,102][25689] Avg episode reward: [(0, '-7.890')] [2022-07-10 07:58:50,135][26022] Updated weights on worker 0-0, policy_version 636062 (0.00094) [2022-07-10 07:58:52,066][26022] Updated weights on worker 0-0, policy_version 636072 (0.00078) [2022-07-10 07:58:53,806][26022] Updated weights on worker 0-0, policy_version 636082 (0.00086) [2022-07-10 07:58:55,185][25689] Fps is (10 sec: 5576.7, 60 sec: 5484.5, 300 sec: 5501.0). Total num frames: 651354112. Throughput: 0: 4979.2. Samples: 651347790. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 07:58:55,187][25689] Avg episode reward: [(0, '-8.006')] [2022-07-10 07:58:55,812][26022] Updated weights on worker 0-0, policy_version 636092 (0.00083) [2022-07-10 07:58:57,626][26022] Updated weights on worker 0-0, policy_version 636102 (0.00086) [2022-07-10 07:58:59,669][26022] Updated weights on worker 0-0, policy_version 636112 (0.00083) [2022-07-10 07:59:00,222][25689] Fps is (10 sec: 5565.0, 60 sec: 5538.0, 300 sec: 5521.7). Total num frames: 651382784. Throughput: 0: 5789.4. Samples: 651381050. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 07:59:00,223][25689] Avg episode reward: [(0, '-7.715')] [2022-07-10 07:59:01,282][26022] Updated weights on worker 0-0, policy_version 636122 (0.00086) [2022-07-10 07:59:03,721][26022] Updated weights on worker 0-0, policy_version 636132 (0.00081) [2022-07-10 07:59:05,243][25689] Fps is (10 sec: 5396.1, 60 sec: 5486.3, 300 sec: 5511.7). Total num frames: 651408384. Throughput: 0: 5667.4. Samples: 651412214. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 07:59:05,243][25689] Avg episode reward: [(0, '-6.475')] [2022-07-10 07:59:05,483][26022] Updated weights on worker 0-0, policy_version 636142 (0.00084) [2022-07-10 07:59:07,403][26022] Updated weights on worker 0-0, policy_version 636152 (0.00087) [2022-07-10 07:59:09,043][26022] Updated weights on worker 0-0, policy_version 636162 (0.00087) [2022-07-10 07:59:10,269][25689] Fps is (10 sec: 5198.0, 60 sec: 5484.2, 300 sec: 5501.9). Total num frames: 651435008. Throughput: 0: 4827.2. Samples: 651428864. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 07:59:10,271][25689] Avg episode reward: [(0, '-6.822')] [2022-07-10 07:59:11,172][26022] Updated weights on worker 0-0, policy_version 636172 (0.00303) [2022-07-10 07:59:12,806][26022] Updated weights on worker 0-0, policy_version 636182 (0.00088) [2022-07-10 07:59:14,821][26022] Updated weights on worker 0-0, policy_version 636192 (0.00088) [2022-07-10 07:59:15,328][25689] Fps is (10 sec: 5482.9, 60 sec: 5484.7, 300 sec: 5508.2). Total num frames: 651463680. Throughput: 0: 5680.5. Samples: 651462212. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 07:59:15,329][25689] Avg episode reward: [(0, '-7.420')] [2022-07-10 07:59:16,487][26022] Updated weights on worker 0-0, policy_version 636202 (0.00085) [2022-07-10 07:59:18,329][26022] Updated weights on worker 0-0, policy_version 636212 (0.00086) [2022-07-10 07:59:20,148][26022] Updated weights on worker 0-0, policy_version 636222 (0.00088) [2022-07-10 07:59:20,336][25689] Fps is (10 sec: 5594.4, 60 sec: 5504.9, 300 sec: 5508.6). Total num frames: 651491328. Throughput: 0: 5691.4. Samples: 651495526. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 07:59:20,338][25689] Avg episode reward: [(0, '-7.596')] [2022-07-10 07:59:22,373][26022] Updated weights on worker 0-0, policy_version 636232 (0.00057) [2022-07-10 07:59:24,067][26022] Updated weights on worker 0-0, policy_version 636242 (0.00087) [2022-07-10 07:59:25,342][25689] Fps is (10 sec: 5522.1, 60 sec: 5507.5, 300 sec: 5505.1). Total num frames: 651518976. Throughput: 0: 5789.4. Samples: 651528574. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 07:59:25,343][25689] Avg episode reward: [(0, '-6.982')] [2022-07-10 07:59:26,015][26022] Updated weights on worker 0-0, policy_version 636252 (0.00090) [2022-07-10 07:59:27,669][26022] Updated weights on worker 0-0, policy_version 636262 (0.00057) [2022-07-10 07:59:29,688][26022] Updated weights on worker 0-0, policy_version 636272 (0.00085) [2022-07-10 07:59:30,401][25689] Fps is (10 sec: 5494.3, 60 sec: 5520.3, 300 sec: 5505.7). Total num frames: 651546624. Throughput: 0: 5780.9. Samples: 651545242. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 07:59:30,402][25689] Avg episode reward: [(0, '-7.061')] [2022-07-10 07:59:31,424][26022] Updated weights on worker 0-0, policy_version 636282 (0.00089) [2022-07-10 07:59:33,507][26022] Updated weights on worker 0-0, policy_version 636292 (0.00088) [2022-07-10 07:59:35,135][26022] Updated weights on worker 0-0, policy_version 636302 (0.00087) [2022-07-10 07:59:35,522][25689] Fps is (10 sec: 5532.2, 60 sec: 5520.9, 300 sec: 5510.6). Total num frames: 651575296. Throughput: 0: 5750.8. Samples: 651578344. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 07:59:35,523][25689] Avg episode reward: [(0, '-7.389')] [2022-07-10 07:59:37,026][26022] Updated weights on worker 0-0, policy_version 636312 (0.00084) [2022-07-10 07:59:38,756][26022] Updated weights on worker 0-0, policy_version 636322 (0.00082) [2022-07-10 07:59:40,539][25689] Fps is (10 sec: 5454.2, 60 sec: 5470.4, 300 sec: 5504.7). Total num frames: 651601920. Throughput: 0: 5750.7. Samples: 651611704. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 07:59:40,539][25689] Avg episode reward: [(0, '-8.451')] [2022-07-10 07:59:40,877][26022] Updated weights on worker 0-0, policy_version 636332 (0.00097) [2022-07-10 07:59:42,433][26022] Updated weights on worker 0-0, policy_version 636342 (0.00087) [2022-07-10 07:59:44,613][26022] Updated weights on worker 0-0, policy_version 636352 (0.00088) [2022-07-10 07:59:45,610][25689] Fps is (10 sec: 5583.1, 60 sec: 5515.6, 300 sec: 5507.5). Total num frames: 651631616. Throughput: 0: 4919.2. Samples: 651628276. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 07:59:45,611][25689] Avg episode reward: [(0, '-5.276')] [2022-07-10 07:59:46,191][26022] Updated weights on worker 0-0, policy_version 636362 (0.00090) [2022-07-10 07:59:48,188][26022] Updated weights on worker 0-0, policy_version 636372 (0.00089) [2022-07-10 07:59:49,989][26022] Updated weights on worker 0-0, policy_version 636382 (0.00629) [2022-07-10 07:59:50,650][25689] Fps is (10 sec: 5570.2, 60 sec: 5479.5, 300 sec: 5505.7). Total num frames: 651658240. Throughput: 0: 5746.5. Samples: 651661604. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 07:59:50,650][25689] Avg episode reward: [(0, '-5.529')] [2022-07-10 07:59:51,671][26022] Updated weights on worker 0-0, policy_version 636392 (0.00087) [2022-07-10 07:59:53,780][26022] Updated weights on worker 0-0, policy_version 636402 (0.00093) [2022-07-10 07:59:55,430][26022] Updated weights on worker 0-0, policy_version 636412 (0.00095) [2022-07-10 07:59:55,769][25689] Fps is (10 sec: 5443.1, 60 sec: 5493.3, 300 sec: 5503.8). Total num frames: 651686912. Throughput: 0: 5760.5. Samples: 651694974. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 07:59:55,771][25689] Avg episode reward: [(0, '-8.306')] [2022-07-10 07:59:57,284][26022] Updated weights on worker 0-0, policy_version 636422 (0.00093) [2022-07-10 07:59:59,170][26022] Updated weights on worker 0-0, policy_version 636432 (0.00086) [2022-07-10 08:00:00,844][25689] Fps is (10 sec: 5625.1, 60 sec: 5489.8, 300 sec: 5516.6). Total num frames: 651715584. Throughput: 0: 4924.2. Samples: 651711692. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 08:00:00,845][25689] Avg episode reward: [(0, '-7.463')] [2022-07-10 08:00:00,970][26022] Updated weights on worker 0-0, policy_version 636442 (0.00092) [2022-07-10 08:00:03,152][26022] Updated weights on worker 0-0, policy_version 636452 (0.00089) [2022-07-10 08:00:04,915][26022] Updated weights on worker 0-0, policy_version 636462 (0.00088) [2022-07-10 08:00:05,869][25689] Fps is (10 sec: 5373.0, 60 sec: 5489.3, 300 sec: 5506.1). Total num frames: 651741184. Throughput: 0: 5670.8. Samples: 651743164. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 08:00:05,870][25689] Avg episode reward: [(0, '-7.417')] [2022-07-10 08:00:06,718][26022] Updated weights on worker 0-0, policy_version 636472 (0.00094) [2022-07-10 08:00:08,483][26022] Updated weights on worker 0-0, policy_version 636482 (0.00102) [2022-07-10 08:00:10,721][26022] Updated weights on worker 0-0, policy_version 636492 (0.00092) [2022-07-10 08:00:10,881][25689] Fps is (10 sec: 5305.4, 60 sec: 5507.6, 300 sec: 5504.8). Total num frames: 651768832. Throughput: 0: 5681.1. Samples: 651776538. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 08:00:10,881][25689] Avg episode reward: [(0, '-7.407')] [2022-07-10 08:00:12,324][26022] Updated weights on worker 0-0, policy_version 636502 (0.00090) [2022-07-10 08:00:14,246][26022] Updated weights on worker 0-0, policy_version 636512 (0.00090) [2022-07-10 08:00:14,491][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:00:14,504][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000636513_651789312.pth [2022-07-10 08:00:14,504][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000634574_649803776.pth [2022-07-10 08:00:15,962][25689] Fps is (10 sec: 5478.8, 60 sec: 5488.7, 300 sec: 5510.4). Total num frames: 651796480. Throughput: 0: 4861.9. Samples: 651793152. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 08:00:15,963][25689] Avg episode reward: [(0, '-7.443')] [2022-07-10 08:00:16,048][26022] Updated weights on worker 0-0, policy_version 636522 (0.00096) [2022-07-10 08:00:18,028][26022] Updated weights on worker 0-0, policy_version 636532 (0.00090) [2022-07-10 08:00:19,731][26022] Updated weights on worker 0-0, policy_version 636542 (0.00092) [2022-07-10 08:00:21,061][25689] Fps is (10 sec: 5532.2, 60 sec: 5497.3, 300 sec: 5505.5). Total num frames: 651825152. Throughput: 0: 5667.8. Samples: 651826278. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 08:00:21,062][25689] Avg episode reward: [(0, '-6.157')] [2022-07-10 08:00:21,733][26022] Updated weights on worker 0-0, policy_version 636552 (0.00086) [2022-07-10 08:00:23,244][26022] Updated weights on worker 0-0, policy_version 636562 (0.00082) [2022-07-10 08:00:25,369][26022] Updated weights on worker 0-0, policy_version 636572 (0.00086) [2022-07-10 08:00:26,108][25689] Fps is (10 sec: 5651.8, 60 sec: 5510.4, 300 sec: 5508.2). Total num frames: 651853824. Throughput: 0: 5762.8. Samples: 651859796. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 08:00:26,109][25689] Avg episode reward: [(0, '-3.985')] [2022-07-10 08:00:27,000][26022] Updated weights on worker 0-0, policy_version 636582 (0.00088) [2022-07-10 08:00:29,029][26022] Updated weights on worker 0-0, policy_version 636592 (0.00091) [2022-07-10 08:00:30,811][26022] Updated weights on worker 0-0, policy_version 636602 (0.00093) [2022-07-10 08:00:31,124][25689] Fps is (10 sec: 5597.0, 60 sec: 5514.4, 300 sec: 5513.3). Total num frames: 651881472. Throughput: 0: 4925.8. Samples: 651876250. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 08:00:31,124][25689] Avg episode reward: [(0, '-5.067')] [2022-07-10 08:00:32,804][26022] Updated weights on worker 0-0, policy_version 636612 (0.00085) [2022-07-10 08:00:34,784][26022] Updated weights on worker 0-0, policy_version 636622 (0.00089) [2022-07-10 08:00:36,232][25689] Fps is (10 sec: 5462.0, 60 sec: 5498.7, 300 sec: 5508.1). Total num frames: 651909120. Throughput: 0: 5735.4. Samples: 651909408. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 08:00:36,233][25689] Avg episode reward: [(0, '-4.568')] [2022-07-10 08:00:36,537][26022] Updated weights on worker 0-0, policy_version 636632 (0.00096) [2022-07-10 08:00:38,304][26022] Updated weights on worker 0-0, policy_version 636642 (0.00090) [2022-07-10 08:00:40,084][26022] Updated weights on worker 0-0, policy_version 636652 (0.00085) [2022-07-10 08:00:41,245][25689] Fps is (10 sec: 5564.7, 60 sec: 5532.8, 300 sec: 5511.4). Total num frames: 651937792. Throughput: 0: 5776.3. Samples: 651942864. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 08:00:41,245][25689] Avg episode reward: [(0, '-3.955')] [2022-07-10 08:00:42,088][26022] Updated weights on worker 0-0, policy_version 636662 (0.00090) [2022-07-10 08:00:43,858][26022] Updated weights on worker 0-0, policy_version 636672 (0.00095) [2022-07-10 08:00:45,748][26022] Updated weights on worker 0-0, policy_version 636682 (0.00088) [2022-07-10 08:00:46,266][25689] Fps is (10 sec: 5612.8, 60 sec: 5503.5, 300 sec: 5515.0). Total num frames: 651965440. Throughput: 0: 4950.1. Samples: 651959580. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 08:00:46,267][25689] Avg episode reward: [(0, '-3.652')] [2022-07-10 08:00:47,536][26022] Updated weights on worker 0-0, policy_version 636692 (0.00092) [2022-07-10 08:00:49,479][26022] Updated weights on worker 0-0, policy_version 636702 (0.00086) [2022-07-10 08:00:51,143][26022] Updated weights on worker 0-0, policy_version 636712 (0.00094) [2022-07-10 08:00:51,291][25689] Fps is (10 sec: 5504.0, 60 sec: 5521.8, 300 sec: 5512.8). Total num frames: 651993088. Throughput: 0: 5781.0. Samples: 651992838. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 08:00:51,292][25689] Avg episode reward: [(0, '-3.457')] [2022-07-10 08:00:53,089][26022] Updated weights on worker 0-0, policy_version 636722 (0.00096) [2022-07-10 08:00:55,022][26022] Updated weights on worker 0-0, policy_version 636732 (0.00085) [2022-07-10 08:00:56,348][25689] Fps is (10 sec: 5586.4, 60 sec: 5527.4, 300 sec: 5509.7). Total num frames: 652021760. Throughput: 0: 5812.8. Samples: 652026338. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 08:00:56,349][25689] Avg episode reward: [(0, '-3.595')] [2022-07-10 08:00:56,610][26022] Updated weights on worker 0-0, policy_version 636742 (0.00088) [2022-07-10 08:00:58,681][26022] Updated weights on worker 0-0, policy_version 636752 (0.00087) [2022-07-10 08:01:00,394][26022] Updated weights on worker 0-0, policy_version 636762 (0.00089) [2022-07-10 08:01:01,371][25689] Fps is (10 sec: 5485.9, 60 sec: 5498.4, 300 sec: 5514.2). Total num frames: 652048384. Throughput: 0: 4972.2. Samples: 652042934. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 08:01:01,371][25689] Avg episode reward: [(0, '-5.425')] [2022-07-10 08:01:02,759][26022] Updated weights on worker 0-0, policy_version 636772 (0.00092) [2022-07-10 08:01:04,500][26022] Updated weights on worker 0-0, policy_version 636782 (0.00095) [2022-07-10 08:01:06,356][26022] Updated weights on worker 0-0, policy_version 636792 (0.00092) [2022-07-10 08:01:06,431][25689] Fps is (10 sec: 5281.1, 60 sec: 5512.2, 300 sec: 5504.0). Total num frames: 652075008. Throughput: 0: 5671.3. Samples: 652073938. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 08:01:06,431][25689] Avg episode reward: [(0, '-6.737')] [2022-07-10 08:01:08,268][26022] Updated weights on worker 0-0, policy_version 636802 (0.00092) [2022-07-10 08:01:10,217][26022] Updated weights on worker 0-0, policy_version 636812 (0.00083) [2022-07-10 08:01:11,439][25689] Fps is (10 sec: 5288.8, 60 sec: 5495.6, 300 sec: 5505.1). Total num frames: 652101632. Throughput: 0: 5675.8. Samples: 652107192. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 08:01:11,439][25689] Avg episode reward: [(0, '-6.722')] [2022-07-10 08:01:11,996][26022] Updated weights on worker 0-0, policy_version 636822 (0.00096) [2022-07-10 08:01:13,918][26022] Updated weights on worker 0-0, policy_version 636832 (0.00094) [2022-07-10 08:01:15,874][26022] Updated weights on worker 0-0, policy_version 636842 (0.00087) [2022-07-10 08:01:16,564][25689] Fps is (10 sec: 5456.6, 60 sec: 5508.4, 300 sec: 5506.5). Total num frames: 652130304. Throughput: 0: 4821.1. Samples: 652123804. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 08:01:16,571][25689] Avg episode reward: [(0, '-7.126')] [2022-07-10 08:01:17,649][26022] Updated weights on worker 0-0, policy_version 636852 (0.00091) [2022-07-10 08:01:19,471][26022] Updated weights on worker 0-0, policy_version 636862 (0.00085) [2022-07-10 08:01:21,245][26022] Updated weights on worker 0-0, policy_version 636872 (0.00087) [2022-07-10 08:01:21,575][25689] Fps is (10 sec: 5657.4, 60 sec: 5516.5, 300 sec: 5510.0). Total num frames: 652158976. Throughput: 0: 5646.6. Samples: 652157018. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 08:01:21,575][25689] Avg episode reward: [(0, '-7.496')] [2022-07-10 08:01:23,207][26022] Updated weights on worker 0-0, policy_version 636882 (0.00086) [2022-07-10 08:01:24,976][26022] Updated weights on worker 0-0, policy_version 636892 (0.00091) [2022-07-10 08:01:26,598][25689] Fps is (10 sec: 5613.3, 60 sec: 5501.8, 300 sec: 5509.9). Total num frames: 652186624. Throughput: 0: 5769.9. Samples: 652190300. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 08:01:26,598][25689] Avg episode reward: [(0, '-8.175')] [2022-07-10 08:01:26,791][26022] Updated weights on worker 0-0, policy_version 636902 (0.00095) [2022-07-10 08:01:28,805][26022] Updated weights on worker 0-0, policy_version 636912 (0.00085) [2022-07-10 08:01:30,706][26022] Updated weights on worker 0-0, policy_version 636922 (0.00082) [2022-07-10 08:01:31,619][25689] Fps is (10 sec: 5403.5, 60 sec: 5484.3, 300 sec: 5500.5). Total num frames: 652213248. Throughput: 0: 4927.8. Samples: 652206636. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 08:01:31,619][25689] Avg episode reward: [(0, '-7.660')] [2022-07-10 08:01:32,298][26022] Updated weights on worker 0-0, policy_version 636932 (0.00086) [2022-07-10 08:01:34,351][26022] Updated weights on worker 0-0, policy_version 636942 (0.00095) [2022-07-10 08:01:35,980][26022] Updated weights on worker 0-0, policy_version 636952 (0.00083) [2022-07-10 08:01:36,698][25689] Fps is (10 sec: 5373.6, 60 sec: 5487.0, 300 sec: 5506.6). Total num frames: 652240896. Throughput: 0: 5761.5. Samples: 652239802. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 08:01:36,698][25689] Avg episode reward: [(0, '-6.358')] [2022-07-10 08:01:37,952][26022] Updated weights on worker 0-0, policy_version 636962 (0.00091) [2022-07-10 08:01:39,984][26022] Updated weights on worker 0-0, policy_version 636972 (0.00081) [2022-07-10 08:01:41,443][26022] Updated weights on worker 0-0, policy_version 636982 (0.00086) [2022-07-10 08:01:41,700][25689] Fps is (10 sec: 5688.2, 60 sec: 5504.9, 300 sec: 5510.4). Total num frames: 652270592. Throughput: 0: 5765.7. Samples: 652273054. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 08:01:41,700][25689] Avg episode reward: [(0, '-7.456')] [2022-07-10 08:01:43,619][26022] Updated weights on worker 0-0, policy_version 636992 (0.00093) [2022-07-10 08:01:45,043][26022] Updated weights on worker 0-0, policy_version 637002 (0.00094) [2022-07-10 08:01:46,727][25689] Fps is (10 sec: 5513.4, 60 sec: 5470.5, 300 sec: 5503.7). Total num frames: 652296192. Throughput: 0: 5776.5. Samples: 652306576. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 08:01:46,727][25689] Avg episode reward: [(0, '-6.706')] [2022-07-10 08:01:47,181][26022] Updated weights on worker 0-0, policy_version 637012 (0.00091) [2022-07-10 08:01:49,139][26022] Updated weights on worker 0-0, policy_version 637022 (0.00085) [2022-07-10 08:01:50,796][26022] Updated weights on worker 0-0, policy_version 637032 (0.00084) [2022-07-10 08:01:51,815][25689] Fps is (10 sec: 5466.7, 60 sec: 5498.7, 300 sec: 5503.2). Total num frames: 652325888. Throughput: 0: 5770.5. Samples: 652323178. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 08:01:51,816][25689] Avg episode reward: [(0, '-5.527')] [2022-07-10 08:01:52,780][26022] Updated weights on worker 0-0, policy_version 637042 (0.00091) [2022-07-10 08:01:54,781][26022] Updated weights on worker 0-0, policy_version 637052 (0.00087) [2022-07-10 08:01:56,352][26022] Updated weights on worker 0-0, policy_version 637062 (0.00093) [2022-07-10 08:01:56,943][25689] Fps is (10 sec: 5713.6, 60 sec: 5492.2, 300 sec: 5507.8). Total num frames: 652354560. Throughput: 0: 5772.6. Samples: 652356668. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 08:01:56,943][25689] Avg episode reward: [(0, '-6.168')] [2022-07-10 08:01:58,423][26022] Updated weights on worker 0-0, policy_version 637072 (0.00886) [2022-07-10 08:01:59,914][26022] Updated weights on worker 0-0, policy_version 637082 (0.00085) [2022-07-10 08:02:01,970][25689] Fps is (10 sec: 5243.7, 60 sec: 5458.0, 300 sec: 5498.0). Total num frames: 652379136. Throughput: 0: 5695.9. Samples: 652388508. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:02:01,970][25689] Avg episode reward: [(0, '-5.524')] [2022-07-10 08:02:02,399][26022] Updated weights on worker 0-0, policy_version 637092 (0.00090) [2022-07-10 08:02:04,075][26022] Updated weights on worker 0-0, policy_version 637102 (0.00091) [2022-07-10 08:02:05,932][26022] Updated weights on worker 0-0, policy_version 637112 (0.00079) [2022-07-10 08:02:06,995][25689] Fps is (10 sec: 5297.2, 60 sec: 5495.0, 300 sec: 5508.2). Total num frames: 652407808. Throughput: 0: 4833.6. Samples: 652404542. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:02:06,995][25689] Avg episode reward: [(0, '-5.229')] [2022-07-10 08:02:07,747][26022] Updated weights on worker 0-0, policy_version 637122 (0.00096) [2022-07-10 08:02:09,696][26022] Updated weights on worker 0-0, policy_version 637132 (0.00089) [2022-07-10 08:02:11,483][26022] Updated weights on worker 0-0, policy_version 637142 (0.00082) [2022-07-10 08:02:12,037][25689] Fps is (10 sec: 5594.2, 60 sec: 5508.8, 300 sec: 5501.6). Total num frames: 652435456. Throughput: 0: 5673.3. Samples: 652437904. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:02:12,038][25689] Avg episode reward: [(0, '-5.036')] [2022-07-10 08:02:13,474][26022] Updated weights on worker 0-0, policy_version 637152 (0.00085) [2022-07-10 08:02:14,624][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:02:14,632][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000637159_652450816.pth [2022-07-10 08:02:14,633][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000635220_650465280.pth [2022-07-10 08:02:15,232][26022] Updated weights on worker 0-0, policy_version 637162 (0.00083) [2022-07-10 08:02:17,051][26022] Updated weights on worker 0-0, policy_version 637172 (0.00094) [2022-07-10 08:02:17,153][25689] Fps is (10 sec: 5544.2, 60 sec: 5509.7, 300 sec: 5506.4). Total num frames: 652464128. Throughput: 0: 5671.6. Samples: 652471294. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:02:17,154][25689] Avg episode reward: [(0, '-6.194')] [2022-07-10 08:02:18,952][26022] Updated weights on worker 0-0, policy_version 637182 (0.00098) [2022-07-10 08:02:20,554][26022] Updated weights on worker 0-0, policy_version 637192 (0.00082) [2022-07-10 08:02:22,186][25689] Fps is (10 sec: 5549.3, 60 sec: 5490.7, 300 sec: 5506.2). Total num frames: 652491776. Throughput: 0: 4934.1. Samples: 652488258. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:02:22,187][25689] Avg episode reward: [(0, '-6.030')] [2022-07-10 08:02:22,807][26022] Updated weights on worker 0-0, policy_version 637202 (0.00087) [2022-07-10 08:02:24,258][26022] Updated weights on worker 0-0, policy_version 637212 (0.00084) [2022-07-10 08:02:26,411][26022] Updated weights on worker 0-0, policy_version 637222 (0.00090) [2022-07-10 08:02:27,209][25689] Fps is (10 sec: 5600.6, 60 sec: 5507.6, 300 sec: 5509.3). Total num frames: 652520448. Throughput: 0: 5809.1. Samples: 652521970. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:02:27,211][25689] Avg episode reward: [(0, '-5.881')] [2022-07-10 08:02:27,947][26022] Updated weights on worker 0-0, policy_version 637232 (0.00096) [2022-07-10 08:02:29,860][26022] Updated weights on worker 0-0, policy_version 637242 (0.00097) [2022-07-10 08:02:32,000][26022] Updated weights on worker 0-0, policy_version 637252 (0.00088) [2022-07-10 08:02:32,221][25689] Fps is (10 sec: 5612.3, 60 sec: 5525.3, 300 sec: 5507.3). Total num frames: 652548096. Throughput: 0: 5807.1. Samples: 652555114. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:02:32,223][25689] Avg episode reward: [(0, '-7.290')] [2022-07-10 08:02:33,668][26022] Updated weights on worker 0-0, policy_version 637262 (0.00089) [2022-07-10 08:02:35,554][26022] Updated weights on worker 0-0, policy_version 637272 (0.00085) [2022-07-10 08:02:37,358][25689] Fps is (10 sec: 5448.3, 60 sec: 5520.0, 300 sec: 5502.3). Total num frames: 652575744. Throughput: 0: 4970.5. Samples: 652571724. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:02:37,359][25689] Avg episode reward: [(0, '-6.400')] [2022-07-10 08:02:37,485][26022] Updated weights on worker 0-0, policy_version 637282 (0.00089) [2022-07-10 08:02:39,168][26022] Updated weights on worker 0-0, policy_version 637292 (0.00082) [2022-07-10 08:02:41,072][26022] Updated weights on worker 0-0, policy_version 637302 (0.00094) [2022-07-10 08:02:42,426][25689] Fps is (10 sec: 5418.5, 60 sec: 5480.3, 300 sec: 5504.8). Total num frames: 652603392. Throughput: 0: 5763.1. Samples: 652604904. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:02:42,427][25689] Avg episode reward: [(0, '-6.004')] [2022-07-10 08:02:42,839][26022] Updated weights on worker 0-0, policy_version 637312 (0.00086) [2022-07-10 08:02:44,668][26022] Updated weights on worker 0-0, policy_version 637322 (0.00092) [2022-07-10 08:02:46,714][26022] Updated weights on worker 0-0, policy_version 637332 (0.00090) [2022-07-10 08:02:47,439][25689] Fps is (10 sec: 5688.7, 60 sec: 5549.1, 300 sec: 5508.2). Total num frames: 652633088. Throughput: 0: 5740.8. Samples: 652638104. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:02:47,439][25689] Avg episode reward: [(0, '-5.813')] [2022-07-10 08:02:48,595][26022] Updated weights on worker 0-0, policy_version 637342 (0.00091) [2022-07-10 08:02:50,197][26022] Updated weights on worker 0-0, policy_version 637352 (0.00091) [2022-07-10 08:02:52,265][26022] Updated weights on worker 0-0, policy_version 637362 (0.00085) [2022-07-10 08:02:52,459][25689] Fps is (10 sec: 5511.6, 60 sec: 5487.8, 300 sec: 5495.4). Total num frames: 652658688. Throughput: 0: 4919.2. Samples: 652654666. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:02:52,459][25689] Avg episode reward: [(0, '-6.167')] [2022-07-10 08:02:53,913][26022] Updated weights on worker 0-0, policy_version 637372 (0.00087) [2022-07-10 08:02:55,995][26022] Updated weights on worker 0-0, policy_version 637382 (0.00098) [2022-07-10 08:02:57,536][25689] Fps is (10 sec: 5476.3, 60 sec: 5509.2, 300 sec: 5509.0). Total num frames: 652688384. Throughput: 0: 5771.3. Samples: 652688174. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:02:57,536][25689] Avg episode reward: [(0, '-7.081')] [2022-07-10 08:02:57,707][26022] Updated weights on worker 0-0, policy_version 637392 (0.00088) [2022-07-10 08:02:59,551][26022] Updated weights on worker 0-0, policy_version 637402 (0.00088) [2022-07-10 08:03:02,115][26022] Updated weights on worker 0-0, policy_version 637412 (0.00088) [2022-07-10 08:03:02,598][25689] Fps is (10 sec: 5251.8, 60 sec: 5489.2, 300 sec: 5490.8). Total num frames: 652711936. Throughput: 0: 5661.9. Samples: 652719114. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:03:02,598][25689] Avg episode reward: [(0, '-9.084')] [2022-07-10 08:03:03,530][26022] Updated weights on worker 0-0, policy_version 637422 (0.00097) [2022-07-10 08:03:05,575][26022] Updated weights on worker 0-0, policy_version 637432 (0.00084) [2022-07-10 08:03:07,206][26022] Updated weights on worker 0-0, policy_version 637442 (0.00076) [2022-07-10 08:03:07,615][25689] Fps is (10 sec: 5283.0, 60 sec: 5506.8, 300 sec: 5500.9). Total num frames: 652741632. Throughput: 0: 4840.9. Samples: 652735776. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:03:07,616][25689] Avg episode reward: [(0, '-9.263')] [2022-07-10 08:03:09,267][26022] Updated weights on worker 0-0, policy_version 637452 (0.00088) [2022-07-10 08:03:11,002][26022] Updated weights on worker 0-0, policy_version 637462 (0.00051) [2022-07-10 08:03:12,623][25689] Fps is (10 sec: 5720.1, 60 sec: 5510.0, 300 sec: 5498.5). Total num frames: 652769280. Throughput: 0: 5685.7. Samples: 652769312. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:03:12,623][25689] Avg episode reward: [(0, '-9.208')] [2022-07-10 08:03:12,802][26022] Updated weights on worker 0-0, policy_version 637472 (0.00090) [2022-07-10 08:03:14,636][26022] Updated weights on worker 0-0, policy_version 637482 (0.00081) [2022-07-10 08:03:16,570][26022] Updated weights on worker 0-0, policy_version 637492 (0.00086) [2022-07-10 08:03:17,720][25689] Fps is (10 sec: 5573.5, 60 sec: 5511.6, 300 sec: 5504.4). Total num frames: 652797952. Throughput: 0: 5679.4. Samples: 652802808. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:03:17,720][25689] Avg episode reward: [(0, '-10.905')] [2022-07-10 08:03:18,299][26022] Updated weights on worker 0-0, policy_version 637502 (0.00085) [2022-07-10 08:03:20,446][26022] Updated weights on worker 0-0, policy_version 637512 (0.00088) [2022-07-10 08:03:21,918][26022] Updated weights on worker 0-0, policy_version 637522 (0.00078) [2022-07-10 08:03:22,738][25689] Fps is (10 sec: 5668.9, 60 sec: 5529.9, 300 sec: 5508.1). Total num frames: 652826624. Throughput: 0: 4985.1. Samples: 652819516. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:03:22,739][25689] Avg episode reward: [(0, '-13.484')] [2022-07-10 08:03:24,036][26022] Updated weights on worker 0-0, policy_version 637532 (0.00086) [2022-07-10 08:03:25,514][26022] Updated weights on worker 0-0, policy_version 637542 (0.00089) [2022-07-10 08:03:27,638][26022] Updated weights on worker 0-0, policy_version 637552 (0.00088) [2022-07-10 08:03:27,770][25689] Fps is (10 sec: 5604.0, 60 sec: 5512.2, 300 sec: 5511.2). Total num frames: 652854272. Throughput: 0: 5815.4. Samples: 652852984. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:03:27,770][25689] Avg episode reward: [(0, '-12.432')] [2022-07-10 08:03:29,376][26022] Updated weights on worker 0-0, policy_version 637562 (0.00089) [2022-07-10 08:03:31,385][26022] Updated weights on worker 0-0, policy_version 637572 (0.00089) [2022-07-10 08:03:32,841][25689] Fps is (10 sec: 5372.1, 60 sec: 5490.0, 300 sec: 5505.4). Total num frames: 652880896. Throughput: 0: 5764.1. Samples: 652885852. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:03:32,842][25689] Avg episode reward: [(0, '-11.333')] [2022-07-10 08:03:33,212][26022] Updated weights on worker 0-0, policy_version 637582 (0.00092) [2022-07-10 08:03:34,837][26022] Updated weights on worker 0-0, policy_version 637592 (0.00094) [2022-07-10 08:03:36,942][26022] Updated weights on worker 0-0, policy_version 637602 (0.00095) [2022-07-10 08:03:37,920][25689] Fps is (10 sec: 5548.7, 60 sec: 5529.0, 300 sec: 5504.3). Total num frames: 652910592. Throughput: 0: 5755.8. Samples: 652919076. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:03:37,922][25689] Avg episode reward: [(0, '-10.879')] [2022-07-10 08:03:38,765][26022] Updated weights on worker 0-0, policy_version 637612 (0.00093) [2022-07-10 08:03:40,603][26022] Updated weights on worker 0-0, policy_version 637622 (0.00091) [2022-07-10 08:03:42,636][26022] Updated weights on worker 0-0, policy_version 637632 (0.00086) [2022-07-10 08:03:42,941][25689] Fps is (10 sec: 5475.0, 60 sec: 5499.5, 300 sec: 5500.7). Total num frames: 652936192. Throughput: 0: 5748.6. Samples: 652935650. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:03:42,941][25689] Avg episode reward: [(0, '-9.800')] [2022-07-10 08:03:44,237][26022] Updated weights on worker 0-0, policy_version 637642 (0.00095) [2022-07-10 08:03:46,431][26022] Updated weights on worker 0-0, policy_version 637652 (0.00088) [2022-07-10 08:03:47,935][26022] Updated weights on worker 0-0, policy_version 637662 (0.00090) [2022-07-10 08:03:48,030][25689] Fps is (10 sec: 5570.9, 60 sec: 5509.5, 300 sec: 5506.2). Total num frames: 652966912. Throughput: 0: 5732.7. Samples: 652969126. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:03:48,030][25689] Avg episode reward: [(0, '-9.530')] [2022-07-10 08:03:49,987][26022] Updated weights on worker 0-0, policy_version 637672 (0.00089) [2022-07-10 08:03:51,577][26022] Updated weights on worker 0-0, policy_version 637682 (0.00087) [2022-07-10 08:03:53,037][25689] Fps is (10 sec: 5679.8, 60 sec: 5527.6, 300 sec: 5504.2). Total num frames: 652993536. Throughput: 0: 5775.4. Samples: 653002490. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:03:53,037][25689] Avg episode reward: [(0, '-9.355')] [2022-07-10 08:03:53,721][26022] Updated weights on worker 0-0, policy_version 637692 (0.00049) [2022-07-10 08:03:55,232][26022] Updated weights on worker 0-0, policy_version 637702 (0.00091) [2022-07-10 08:03:57,320][26022] Updated weights on worker 0-0, policy_version 637712 (0.00086) [2022-07-10 08:03:58,074][25689] Fps is (10 sec: 5504.9, 60 sec: 5514.2, 300 sec: 5504.2). Total num frames: 653022208. Throughput: 0: 4964.2. Samples: 653019126. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:03:58,075][25689] Avg episode reward: [(0, '-8.138')] [2022-07-10 08:03:58,995][26022] Updated weights on worker 0-0, policy_version 637722 (0.00098) [2022-07-10 08:04:01,013][26022] Updated weights on worker 0-0, policy_version 637732 (0.00084) [2022-07-10 08:04:03,093][25689] Fps is (10 sec: 5397.1, 60 sec: 5552.1, 300 sec: 5504.2). Total num frames: 653047808. Throughput: 0: 5780.4. Samples: 653052136. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:04:03,093][25689] Avg episode reward: [(0, '-7.977')] [2022-07-10 08:04:03,095][26022] Updated weights on worker 0-0, policy_version 637742 (0.00087) [2022-07-10 08:04:05,079][26022] Updated weights on worker 0-0, policy_version 637752 (0.00882) [2022-07-10 08:04:06,891][26022] Updated weights on worker 0-0, policy_version 637762 (0.00096) [2022-07-10 08:04:08,169][25689] Fps is (10 sec: 5173.3, 60 sec: 5495.9, 300 sec: 5503.3). Total num frames: 653074432. Throughput: 0: 5689.6. Samples: 653083714. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:04:08,170][25689] Avg episode reward: [(0, '-8.135')] [2022-07-10 08:04:08,584][26022] Updated weights on worker 0-0, policy_version 637772 (0.00091) [2022-07-10 08:04:10,602][26022] Updated weights on worker 0-0, policy_version 637782 (0.00088) [2022-07-10 08:04:12,307][26022] Updated weights on worker 0-0, policy_version 637792 (0.00083) [2022-07-10 08:04:13,253][25689] Fps is (10 sec: 5442.1, 60 sec: 5505.9, 300 sec: 5502.8). Total num frames: 653103104. Throughput: 0: 4844.5. Samples: 653100430. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:04:13,254][25689] Avg episode reward: [(0, '-7.926')] [2022-07-10 08:04:14,360][26022] Updated weights on worker 0-0, policy_version 637802 (0.00092) [2022-07-10 08:04:14,658][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:04:14,670][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000637805_653112320.pth [2022-07-10 08:04:14,671][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000635868_651128832.pth [2022-07-10 08:04:16,050][26022] Updated weights on worker 0-0, policy_version 637812 (0.00094) [2022-07-10 08:04:18,003][26022] Updated weights on worker 0-0, policy_version 637822 (0.00079) [2022-07-10 08:04:18,324][25689] Fps is (10 sec: 5748.2, 60 sec: 5525.2, 300 sec: 5508.5). Total num frames: 653132800. Throughput: 0: 5657.0. Samples: 653133676. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:04:18,324][25689] Avg episode reward: [(0, '-6.263')] [2022-07-10 08:04:19,960][26022] Updated weights on worker 0-0, policy_version 637832 (0.00087) [2022-07-10 08:04:21,524][26022] Updated weights on worker 0-0, policy_version 637842 (0.00104) [2022-07-10 08:04:23,331][25689] Fps is (10 sec: 5588.6, 60 sec: 5492.4, 300 sec: 5505.1). Total num frames: 653159424. Throughput: 0: 5694.4. Samples: 653167382. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:04:23,331][25689] Avg episode reward: [(0, '-6.907')] [2022-07-10 08:04:23,476][26022] Updated weights on worker 0-0, policy_version 637852 (0.00092) [2022-07-10 08:04:25,279][26022] Updated weights on worker 0-0, policy_version 637862 (0.00085) [2022-07-10 08:04:27,233][26022] Updated weights on worker 0-0, policy_version 637872 (0.00088) [2022-07-10 08:04:28,361][25689] Fps is (10 sec: 5509.2, 60 sec: 5509.5, 300 sec: 5509.0). Total num frames: 653188096. Throughput: 0: 4968.8. Samples: 653184040. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:04:28,362][25689] Avg episode reward: [(0, '-7.676')] [2022-07-10 08:04:28,991][26022] Updated weights on worker 0-0, policy_version 637882 (0.00097) [2022-07-10 08:04:30,716][26022] Updated weights on worker 0-0, policy_version 637892 (0.00095) [2022-07-10 08:04:32,773][26022] Updated weights on worker 0-0, policy_version 637902 (0.00093) [2022-07-10 08:04:33,388][25689] Fps is (10 sec: 5600.1, 60 sec: 5530.4, 300 sec: 5507.3). Total num frames: 653215744. Throughput: 0: 5795.1. Samples: 653217112. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:04:33,388][25689] Avg episode reward: [(0, '-7.415')] [2022-07-10 08:04:34,728][26022] Updated weights on worker 0-0, policy_version 637912 (0.00091) [2022-07-10 08:04:36,455][26022] Updated weights on worker 0-0, policy_version 637922 (0.00083) [2022-07-10 08:04:38,360][26022] Updated weights on worker 0-0, policy_version 637932 (0.00101) [2022-07-10 08:04:38,500][25689] Fps is (10 sec: 5352.4, 60 sec: 5476.6, 300 sec: 5505.6). Total num frames: 653242368. Throughput: 0: 5771.8. Samples: 653250132. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:04:38,502][25689] Avg episode reward: [(0, '-6.887')] [2022-07-10 08:04:39,931][26022] Updated weights on worker 0-0, policy_version 637942 (0.00091) [2022-07-10 08:04:42,284][26022] Updated weights on worker 0-0, policy_version 637952 (0.00094) [2022-07-10 08:04:43,530][25689] Fps is (10 sec: 5553.1, 60 sec: 5543.4, 300 sec: 5506.3). Total num frames: 653272064. Throughput: 0: 4920.0. Samples: 653266762. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:04:43,531][25689] Avg episode reward: [(0, '-6.807')] [2022-07-10 08:04:43,688][26022] Updated weights on worker 0-0, policy_version 637962 (0.00098) [2022-07-10 08:04:45,816][26022] Updated weights on worker 0-0, policy_version 637972 (0.00095) [2022-07-10 08:04:47,489][26022] Updated weights on worker 0-0, policy_version 637982 (0.01135) [2022-07-10 08:04:48,563][25689] Fps is (10 sec: 5495.3, 60 sec: 5464.0, 300 sec: 5503.0). Total num frames: 653297664. Throughput: 0: 5734.7. Samples: 653299892. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:04:48,563][25689] Avg episode reward: [(0, '-6.512')] [2022-07-10 08:04:49,469][26022] Updated weights on worker 0-0, policy_version 637992 (0.00089) [2022-07-10 08:04:51,435][26022] Updated weights on worker 0-0, policy_version 638002 (0.00081) [2022-07-10 08:04:52,958][26022] Updated weights on worker 0-0, policy_version 638012 (0.00091) [2022-07-10 08:04:53,590][25689] Fps is (10 sec: 5394.9, 60 sec: 5496.0, 300 sec: 5504.7). Total num frames: 653326336. Throughput: 0: 5752.3. Samples: 653333320. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:04:53,591][25689] Avg episode reward: [(0, '-5.391')] [2022-07-10 08:04:55,023][26022] Updated weights on worker 0-0, policy_version 638022 (0.00081) [2022-07-10 08:04:56,884][26022] Updated weights on worker 0-0, policy_version 638032 (0.00091) [2022-07-10 08:04:58,500][26022] Updated weights on worker 0-0, policy_version 638042 (0.00086) [2022-07-10 08:04:58,666][25689] Fps is (10 sec: 5777.3, 60 sec: 5509.4, 300 sec: 5508.2). Total num frames: 653356032. Throughput: 0: 4959.8. Samples: 653350148. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:04:58,666][25689] Avg episode reward: [(0, '-5.348')] [2022-07-10 08:05:00,570][26022] Updated weights on worker 0-0, policy_version 638052 (0.00096) [2022-07-10 08:05:02,679][26022] Updated weights on worker 0-0, policy_version 638062 (0.00088) [2022-07-10 08:05:03,736][25689] Fps is (10 sec: 5349.2, 60 sec: 5487.8, 300 sec: 5503.9). Total num frames: 653380608. Throughput: 0: 5723.4. Samples: 653382408. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:05:03,736][25689] Avg episode reward: [(0, '-6.375')] [2022-07-10 08:05:04,642][26022] Updated weights on worker 0-0, policy_version 638072 (0.00087) [2022-07-10 08:05:06,336][26022] Updated weights on worker 0-0, policy_version 638082 (0.00095) [2022-07-10 08:05:08,382][26022] Updated weights on worker 0-0, policy_version 638092 (0.00090) [2022-07-10 08:05:08,814][25689] Fps is (10 sec: 5146.3, 60 sec: 5504.6, 300 sec: 5502.7). Total num frames: 653408256. Throughput: 0: 5652.4. Samples: 653414358. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 08:05:08,814][25689] Avg episode reward: [(0, '-6.138')] [2022-07-10 08:05:10,047][26022] Updated weights on worker 0-0, policy_version 638102 (0.00580) [2022-07-10 08:05:12,069][26022] Updated weights on worker 0-0, policy_version 638112 (0.00088) [2022-07-10 08:05:13,607][26022] Updated weights on worker 0-0, policy_version 638122 (0.00089) [2022-07-10 08:05:13,881][25689] Fps is (10 sec: 5652.4, 60 sec: 5523.0, 300 sec: 5509.8). Total num frames: 653437952. Throughput: 0: 4820.6. Samples: 653431134. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:05:13,881][25689] Avg episode reward: [(0, '-6.958')] [2022-07-10 08:05:15,897][26022] Updated weights on worker 0-0, policy_version 638132 (0.00092) [2022-07-10 08:05:17,411][26022] Updated weights on worker 0-0, policy_version 638142 (0.00094) [2022-07-10 08:05:18,933][25689] Fps is (10 sec: 5464.3, 60 sec: 5457.1, 300 sec: 5500.3). Total num frames: 653463552. Throughput: 0: 5635.1. Samples: 653464358. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:05:18,934][25689] Avg episode reward: [(0, '-6.477')] [2022-07-10 08:05:19,455][26022] Updated weights on worker 0-0, policy_version 638152 (0.00088) [2022-07-10 08:05:21,197][26022] Updated weights on worker 0-0, policy_version 638162 (0.00090) [2022-07-10 08:05:23,146][26022] Updated weights on worker 0-0, policy_version 638172 (0.00094) [2022-07-10 08:05:23,941][25689] Fps is (10 sec: 5394.6, 60 sec: 5490.8, 300 sec: 5501.1). Total num frames: 653492224. Throughput: 0: 5677.7. Samples: 653497128. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:05:23,942][25689] Avg episode reward: [(0, '-7.086')] [2022-07-10 08:05:25,088][26022] Updated weights on worker 0-0, policy_version 638182 (0.00090) [2022-07-10 08:05:26,979][26022] Updated weights on worker 0-0, policy_version 638192 (0.00086) [2022-07-10 08:05:28,704][26022] Updated weights on worker 0-0, policy_version 638202 (0.00089) [2022-07-10 08:05:28,974][25689] Fps is (10 sec: 5609.4, 60 sec: 5473.7, 300 sec: 5500.8). Total num frames: 653519872. Throughput: 0: 4921.6. Samples: 653513578. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:05:28,974][25689] Avg episode reward: [(0, '-6.601')] [2022-07-10 08:05:30,695][26022] Updated weights on worker 0-0, policy_version 638212 (0.00090) [2022-07-10 08:05:32,391][26022] Updated weights on worker 0-0, policy_version 638222 (0.00058) [2022-07-10 08:05:33,978][25689] Fps is (10 sec: 5305.2, 60 sec: 5441.9, 300 sec: 5495.8). Total num frames: 653545472. Throughput: 0: 5753.3. Samples: 653546760. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:05:33,979][25689] Avg episode reward: [(0, '-7.219')] [2022-07-10 08:05:34,303][26022] Updated weights on worker 0-0, policy_version 638232 (0.00097) [2022-07-10 08:05:36,042][26022] Updated weights on worker 0-0, policy_version 638242 (0.00093) [2022-07-10 08:05:38,083][26022] Updated weights on worker 0-0, policy_version 638252 (0.00093) [2022-07-10 08:05:39,055][25689] Fps is (10 sec: 5484.7, 60 sec: 5495.8, 300 sec: 5498.0). Total num frames: 653575168. Throughput: 0: 5735.9. Samples: 653579776. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:05:39,056][25689] Avg episode reward: [(0, '-7.965')] [2022-07-10 08:05:39,925][26022] Updated weights on worker 0-0, policy_version 638262 (0.00095) [2022-07-10 08:05:41,738][26022] Updated weights on worker 0-0, policy_version 638272 (0.00086) [2022-07-10 08:05:43,739][26022] Updated weights on worker 0-0, policy_version 638282 (0.00088) [2022-07-10 08:05:44,071][25689] Fps is (10 sec: 5681.9, 60 sec: 5463.3, 300 sec: 5498.2). Total num frames: 653602816. Throughput: 0: 4932.0. Samples: 653596406. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:05:44,071][25689] Avg episode reward: [(0, '-6.686')] [2022-07-10 08:05:45,479][26022] Updated weights on worker 0-0, policy_version 638292 (0.00101) [2022-07-10 08:05:47,459][26022] Updated weights on worker 0-0, policy_version 638302 (0.00082) [2022-07-10 08:05:49,076][25689] Fps is (10 sec: 5416.1, 60 sec: 5482.7, 300 sec: 5495.1). Total num frames: 653629440. Throughput: 0: 5763.1. Samples: 653629430. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:05:49,076][25689] Avg episode reward: [(0, '-6.154')] [2022-07-10 08:05:49,257][26022] Updated weights on worker 0-0, policy_version 638312 (0.00092) [2022-07-10 08:05:51,131][26022] Updated weights on worker 0-0, policy_version 638322 (0.00096) [2022-07-10 08:05:52,895][26022] Updated weights on worker 0-0, policy_version 638332 (0.00086) [2022-07-10 08:05:54,084][25689] Fps is (10 sec: 5419.7, 60 sec: 5467.5, 300 sec: 5492.5). Total num frames: 653657088. Throughput: 0: 5749.7. Samples: 653662364. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:05:54,085][25689] Avg episode reward: [(0, '-5.586')] [2022-07-10 08:05:54,880][26022] Updated weights on worker 0-0, policy_version 638342 (0.00093) [2022-07-10 08:05:56,792][26022] Updated weights on worker 0-0, policy_version 638352 (0.00090) [2022-07-10 08:05:58,662][26022] Updated weights on worker 0-0, policy_version 638362 (0.00084) [2022-07-10 08:05:59,169][25689] Fps is (10 sec: 5478.4, 60 sec: 5432.8, 300 sec: 5494.8). Total num frames: 653684736. Throughput: 0: 4920.0. Samples: 653678738. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:05:59,170][25689] Avg episode reward: [(0, '-5.854')] [2022-07-10 08:06:00,453][26022] Updated weights on worker 0-0, policy_version 638372 (0.00087) [2022-07-10 08:06:02,986][26022] Updated weights on worker 0-0, policy_version 638382 (0.00090) [2022-07-10 08:06:04,178][25689] Fps is (10 sec: 5377.1, 60 sec: 5472.2, 300 sec: 5495.8). Total num frames: 653711360. Throughput: 0: 5644.5. Samples: 653709900. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:06:04,178][25689] Avg episode reward: [(0, '-6.093')] [2022-07-10 08:06:04,579][26022] Updated weights on worker 0-0, policy_version 638392 (0.00085) [2022-07-10 08:06:06,451][26022] Updated weights on worker 0-0, policy_version 638402 (0.00075) [2022-07-10 08:06:08,224][26022] Updated weights on worker 0-0, policy_version 638412 (0.00088) [2022-07-10 08:06:09,195][25689] Fps is (10 sec: 5413.6, 60 sec: 5477.8, 300 sec: 5499.1). Total num frames: 653739008. Throughput: 0: 5642.5. Samples: 653742950. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:06:09,195][25689] Avg episode reward: [(0, '-5.213')] [2022-07-10 08:06:10,117][26022] Updated weights on worker 0-0, policy_version 638422 (0.00093) [2022-07-10 08:06:11,778][26022] Updated weights on worker 0-0, policy_version 638432 (0.00088) [2022-07-10 08:06:13,875][26022] Updated weights on worker 0-0, policy_version 638442 (0.00090) [2022-07-10 08:06:14,200][25689] Fps is (10 sec: 5517.2, 60 sec: 5449.4, 300 sec: 5497.8). Total num frames: 653766656. Throughput: 0: 4828.3. Samples: 653759488. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:06:14,201][25689] Avg episode reward: [(0, '-5.530')] [2022-07-10 08:06:14,778][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:06:14,790][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000638448_653770752.pth [2022-07-10 08:06:14,791][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000636513_651789312.pth [2022-07-10 08:06:15,547][26022] Updated weights on worker 0-0, policy_version 638452 (0.00097) [2022-07-10 08:06:17,394][26022] Updated weights on worker 0-0, policy_version 638462 (0.00084) [2022-07-10 08:06:19,249][25689] Fps is (10 sec: 5499.9, 60 sec: 5483.7, 300 sec: 5493.7). Total num frames: 653794304. Throughput: 0: 5688.5. Samples: 653792958. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:06:19,249][25689] Avg episode reward: [(0, '-6.295')] [2022-07-10 08:06:19,317][26022] Updated weights on worker 0-0, policy_version 638472 (0.00084) [2022-07-10 08:06:20,967][26022] Updated weights on worker 0-0, policy_version 638482 (0.00085) [2022-07-10 08:06:23,171][26022] Updated weights on worker 0-0, policy_version 638492 (0.00104) [2022-07-10 08:06:24,305][25689] Fps is (10 sec: 5573.4, 60 sec: 5479.3, 300 sec: 5496.5). Total num frames: 653822976. Throughput: 0: 5788.6. Samples: 653826410. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:06:24,306][25689] Avg episode reward: [(0, '-6.395')] [2022-07-10 08:06:24,746][26022] Updated weights on worker 0-0, policy_version 638502 (0.00105) [2022-07-10 08:06:26,850][26022] Updated weights on worker 0-0, policy_version 638512 (0.00089) [2022-07-10 08:06:28,528][26022] Updated weights on worker 0-0, policy_version 638522 (0.00091) [2022-07-10 08:06:29,311][25689] Fps is (10 sec: 5597.3, 60 sec: 5481.7, 300 sec: 5500.2). Total num frames: 653850624. Throughput: 0: 5816.3. Samples: 653859952. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:06:29,311][25689] Avg episode reward: [(0, '-5.649')] [2022-07-10 08:06:30,243][26022] Updated weights on worker 0-0, policy_version 638532 (0.00083) [2022-07-10 08:06:32,219][26022] Updated weights on worker 0-0, policy_version 638542 (0.00084) [2022-07-10 08:06:33,919][26022] Updated weights on worker 0-0, policy_version 638552 (0.00084) [2022-07-10 08:06:34,322][25689] Fps is (10 sec: 5520.5, 60 sec: 5515.1, 300 sec: 5501.5). Total num frames: 653878272. Throughput: 0: 5816.5. Samples: 653876524. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:06:34,322][25689] Avg episode reward: [(0, '-5.890')] [2022-07-10 08:06:35,999][26022] Updated weights on worker 0-0, policy_version 638562 (0.00088) [2022-07-10 08:06:37,886][26022] Updated weights on worker 0-0, policy_version 638572 (0.00095) [2022-07-10 08:06:39,380][25689] Fps is (10 sec: 5491.5, 60 sec: 5482.9, 300 sec: 5493.6). Total num frames: 653905920. Throughput: 0: 5784.9. Samples: 653909416. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:06:39,381][25689] Avg episode reward: [(0, '-5.660')] [2022-07-10 08:06:39,426][26022] Updated weights on worker 0-0, policy_version 638582 (0.00085) [2022-07-10 08:06:41,526][26022] Updated weights on worker 0-0, policy_version 638592 (0.00088) [2022-07-10 08:06:43,211][26022] Updated weights on worker 0-0, policy_version 638602 (0.00092) [2022-07-10 08:06:44,388][25689] Fps is (10 sec: 5493.3, 60 sec: 5483.6, 300 sec: 5500.8). Total num frames: 653933568. Throughput: 0: 5781.7. Samples: 653942520. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:06:44,388][25689] Avg episode reward: [(0, '-6.413')] [2022-07-10 08:06:45,281][26022] Updated weights on worker 0-0, policy_version 638612 (0.00085) [2022-07-10 08:06:47,300][26022] Updated weights on worker 0-0, policy_version 638622 (0.00099) [2022-07-10 08:06:48,922][26022] Updated weights on worker 0-0, policy_version 638632 (0.00084) [2022-07-10 08:06:49,399][25689] Fps is (10 sec: 5519.3, 60 sec: 5500.0, 300 sec: 5495.4). Total num frames: 653961216. Throughput: 0: 4933.0. Samples: 653959046. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:06:49,399][25689] Avg episode reward: [(0, '-5.361')] [2022-07-10 08:06:50,787][26022] Updated weights on worker 0-0, policy_version 638642 (0.00397) [2022-07-10 08:06:52,743][26022] Updated weights on worker 0-0, policy_version 638652 (0.00097) [2022-07-10 08:06:54,409][25689] Fps is (10 sec: 5518.1, 60 sec: 5499.9, 300 sec: 5494.1). Total num frames: 653988864. Throughput: 0: 5769.9. Samples: 653992424. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:06:54,409][25689] Avg episode reward: [(0, '-5.251')] [2022-07-10 08:06:54,442][26022] Updated weights on worker 0-0, policy_version 638662 (0.00089) [2022-07-10 08:06:56,521][26022] Updated weights on worker 0-0, policy_version 638672 (0.00087) [2022-07-10 08:06:58,160][26022] Updated weights on worker 0-0, policy_version 638682 (0.00091) [2022-07-10 08:06:59,495][25689] Fps is (10 sec: 5375.6, 60 sec: 5482.8, 300 sec: 5499.9). Total num frames: 654015488. Throughput: 0: 5791.3. Samples: 654025906. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:06:59,495][25689] Avg episode reward: [(0, '-5.458')] [2022-07-10 08:07:00,011][26022] Updated weights on worker 0-0, policy_version 638692 (0.00084) [2022-07-10 08:07:02,275][26022] Updated weights on worker 0-0, policy_version 638702 (0.00091) [2022-07-10 08:07:04,087][26022] Updated weights on worker 0-0, policy_version 638712 (0.00099) [2022-07-10 08:07:04,525][25689] Fps is (10 sec: 5364.7, 60 sec: 5497.8, 300 sec: 5496.3). Total num frames: 654043136. Throughput: 0: 4860.7. Samples: 654040402. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:07:04,526][25689] Avg episode reward: [(0, '-4.408')] [2022-07-10 08:07:06,137][26022] Updated weights on worker 0-0, policy_version 638722 (0.00090) [2022-07-10 08:07:07,791][26022] Updated weights on worker 0-0, policy_version 638732 (0.00085) [2022-07-10 08:07:09,555][25689] Fps is (10 sec: 5394.8, 60 sec: 5479.6, 300 sec: 5493.1). Total num frames: 654069760. Throughput: 0: 5681.1. Samples: 654073554. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:07:09,555][25689] Avg episode reward: [(0, '-5.019')] [2022-07-10 08:07:09,666][26022] Updated weights on worker 0-0, policy_version 638742 (0.00081) [2022-07-10 08:07:11,511][26022] Updated weights on worker 0-0, policy_version 638752 (0.00090) [2022-07-10 08:07:13,177][26022] Updated weights on worker 0-0, policy_version 638762 (0.00084) [2022-07-10 08:07:14,577][25689] Fps is (10 sec: 5500.8, 60 sec: 5495.0, 300 sec: 5494.9). Total num frames: 654098432. Throughput: 0: 5685.0. Samples: 654107084. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:07:14,579][25689] Avg episode reward: [(0, '-4.268')] [2022-07-10 08:07:15,392][26022] Updated weights on worker 0-0, policy_version 638772 (0.00094) [2022-07-10 08:07:17,033][26022] Updated weights on worker 0-0, policy_version 638782 (0.00095) [2022-07-10 08:07:18,856][26022] Updated weights on worker 0-0, policy_version 638792 (0.00095) [2022-07-10 08:07:19,708][25689] Fps is (10 sec: 5647.8, 60 sec: 5504.5, 300 sec: 5496.5). Total num frames: 654127104. Throughput: 0: 4843.1. Samples: 654123802. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:07:19,710][25689] Avg episode reward: [(0, '-5.990')] [2022-07-10 08:07:20,888][26022] Updated weights on worker 0-0, policy_version 638802 (0.00078) [2022-07-10 08:07:22,410][26022] Updated weights on worker 0-0, policy_version 638812 (0.00096) [2022-07-10 08:07:24,468][26022] Updated weights on worker 0-0, policy_version 638822 (0.00091) [2022-07-10 08:07:24,783][25689] Fps is (10 sec: 5618.7, 60 sec: 5502.8, 300 sec: 5495.5). Total num frames: 654155776. Throughput: 0: 5756.7. Samples: 654157022. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:07:24,784][25689] Avg episode reward: [(0, '-7.337')] [2022-07-10 08:07:26,465][26022] Updated weights on worker 0-0, policy_version 638832 (0.00089) [2022-07-10 08:07:28,122][26022] Updated weights on worker 0-0, policy_version 638842 (0.00089) [2022-07-10 08:07:29,833][25689] Fps is (10 sec: 5461.2, 60 sec: 5481.9, 300 sec: 5491.4). Total num frames: 654182400. Throughput: 0: 5751.0. Samples: 654190176. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:07:29,835][25689] Avg episode reward: [(0, '-8.650')] [2022-07-10 08:07:30,101][26022] Updated weights on worker 0-0, policy_version 638852 (0.00086) [2022-07-10 08:07:31,638][26022] Updated weights on worker 0-0, policy_version 638862 (0.00092) [2022-07-10 08:07:33,674][26022] Updated weights on worker 0-0, policy_version 638872 (0.00090) [2022-07-10 08:07:34,883][25689] Fps is (10 sec: 5475.2, 60 sec: 5495.3, 300 sec: 5496.5). Total num frames: 654211072. Throughput: 0: 4910.7. Samples: 654206796. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:07:34,884][25689] Avg episode reward: [(0, '-9.329')] [2022-07-10 08:07:35,691][26022] Updated weights on worker 0-0, policy_version 638882 (0.00094) [2022-07-10 08:07:37,347][26022] Updated weights on worker 0-0, policy_version 638892 (0.00114) [2022-07-10 08:07:39,479][26022] Updated weights on worker 0-0, policy_version 638902 (0.00085) [2022-07-10 08:07:39,933][25689] Fps is (10 sec: 5677.9, 60 sec: 5512.9, 300 sec: 5500.2). Total num frames: 654239744. Throughput: 0: 5739.7. Samples: 654239886. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:07:39,933][25689] Avg episode reward: [(0, '-8.392')] [2022-07-10 08:07:41,164][26022] Updated weights on worker 0-0, policy_version 638912 (0.00093) [2022-07-10 08:07:42,975][26022] Updated weights on worker 0-0, policy_version 638922 (0.00086) [2022-07-10 08:07:44,993][25689] Fps is (10 sec: 5469.5, 60 sec: 5491.3, 300 sec: 5489.0). Total num frames: 654266368. Throughput: 0: 5748.5. Samples: 654273194. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:07:44,993][25689] Avg episode reward: [(0, '-7.495')] [2022-07-10 08:07:44,994][26022] Updated weights on worker 0-0, policy_version 638932 (0.00092) [2022-07-10 08:07:46,791][26022] Updated weights on worker 0-0, policy_version 638942 (0.00084) [2022-07-10 08:07:48,349][26022] Updated weights on worker 0-0, policy_version 638952 (0.00084) [2022-07-10 08:07:50,012][25689] Fps is (10 sec: 5282.8, 60 sec: 5473.6, 300 sec: 5492.5). Total num frames: 654292992. Throughput: 0: 4935.2. Samples: 654289764. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:07:50,013][25689] Avg episode reward: [(0, '-5.539')] [2022-07-10 08:07:50,311][26022] Updated weights on worker 0-0, policy_version 638962 (0.00091) [2022-07-10 08:07:52,072][26022] Updated weights on worker 0-0, policy_version 638972 (0.00088) [2022-07-10 08:07:54,151][26022] Updated weights on worker 0-0, policy_version 638982 (0.00094) [2022-07-10 08:07:55,038][25689] Fps is (10 sec: 5708.4, 60 sec: 5522.9, 300 sec: 5496.9). Total num frames: 654323712. Throughput: 0: 5772.6. Samples: 654323146. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:07:55,039][25689] Avg episode reward: [(0, '-5.382')] [2022-07-10 08:07:55,856][26022] Updated weights on worker 0-0, policy_version 638992 (0.00098) [2022-07-10 08:07:57,692][26022] Updated weights on worker 0-0, policy_version 639002 (0.00085) [2022-07-10 08:07:59,795][26022] Updated weights on worker 0-0, policy_version 639012 (0.00088) [2022-07-10 08:08:00,145][25689] Fps is (10 sec: 5558.1, 60 sec: 5504.0, 300 sec: 5502.9). Total num frames: 654349312. Throughput: 0: 5763.5. Samples: 654356382. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:08:00,146][25689] Avg episode reward: [(0, '-2.854')] [2022-07-10 08:08:01,736][26022] Updated weights on worker 0-0, policy_version 639022 (0.00124) [2022-07-10 08:08:03,845][26022] Updated weights on worker 0-0, policy_version 639032 (0.00093) [2022-07-10 08:08:05,179][25689] Fps is (10 sec: 5250.6, 60 sec: 5503.7, 300 sec: 5495.7). Total num frames: 654376960. Throughput: 0: 4840.3. Samples: 654370904. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:08:05,180][25689] Avg episode reward: [(0, '-3.083')] [2022-07-10 08:08:05,464][26022] Updated weights on worker 0-0, policy_version 639042 (0.00089) [2022-07-10 08:08:07,484][26022] Updated weights on worker 0-0, policy_version 639052 (0.00087) [2022-07-10 08:08:09,034][26022] Updated weights on worker 0-0, policy_version 639062 (0.00086) [2022-07-10 08:08:10,186][25689] Fps is (10 sec: 5507.3, 60 sec: 5522.7, 300 sec: 5495.7). Total num frames: 654404608. Throughput: 0: 5669.8. Samples: 654404144. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:08:10,186][25689] Avg episode reward: [(0, '-3.306')] [2022-07-10 08:08:11,188][26022] Updated weights on worker 0-0, policy_version 639072 (0.00086) [2022-07-10 08:08:12,874][26022] Updated weights on worker 0-0, policy_version 639082 (0.00091) [2022-07-10 08:08:14,631][26022] Updated weights on worker 0-0, policy_version 639092 (0.00095) [2022-07-10 08:08:14,987][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:08:15,004][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000639093_654431232.pth [2022-07-10 08:08:15,004][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000637159_652450816.pth [2022-07-10 08:08:15,187][25689] Fps is (10 sec: 5525.1, 60 sec: 5507.7, 300 sec: 5494.1). Total num frames: 654432256. Throughput: 0: 5680.2. Samples: 654437598. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:08:15,188][25689] Avg episode reward: [(0, '-3.312')] [2022-07-10 08:08:16,633][26022] Updated weights on worker 0-0, policy_version 639102 (0.00087) [2022-07-10 08:08:18,259][26022] Updated weights on worker 0-0, policy_version 639112 (0.00081) [2022-07-10 08:08:20,104][26022] Updated weights on worker 0-0, policy_version 639122 (0.00085) [2022-07-10 08:08:20,246][25689] Fps is (10 sec: 5598.5, 60 sec: 5514.3, 300 sec: 5493.3). Total num frames: 654460928. Throughput: 0: 4873.8. Samples: 654454346. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:08:20,246][25689] Avg episode reward: [(0, '-4.355')] [2022-07-10 08:08:22,259][26022] Updated weights on worker 0-0, policy_version 639132 (0.00086) [2022-07-10 08:08:23,707][26022] Updated weights on worker 0-0, policy_version 639142 (0.00096) [2022-07-10 08:08:25,313][25689] Fps is (10 sec: 5460.8, 60 sec: 5481.2, 300 sec: 5489.2). Total num frames: 654487552. Throughput: 0: 5817.1. Samples: 654488026. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:08:25,314][25689] Avg episode reward: [(0, '-4.635')] [2022-07-10 08:08:25,964][26022] Updated weights on worker 0-0, policy_version 639152 (0.00096) [2022-07-10 08:08:27,659][26022] Updated weights on worker 0-0, policy_version 639162 (0.00090) [2022-07-10 08:08:29,531][26022] Updated weights on worker 0-0, policy_version 639172 (0.00087) [2022-07-10 08:08:30,334][25689] Fps is (10 sec: 5480.9, 60 sec: 5517.7, 300 sec: 5497.0). Total num frames: 654516224. Throughput: 0: 5808.2. Samples: 654521170. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:08:30,339][25689] Avg episode reward: [(0, '-5.422')] [2022-07-10 08:08:31,564][26022] Updated weights on worker 0-0, policy_version 639182 (0.00082) [2022-07-10 08:08:33,117][26022] Updated weights on worker 0-0, policy_version 639192 (0.00086) [2022-07-10 08:08:35,289][26022] Updated weights on worker 0-0, policy_version 639202 (0.00085) [2022-07-10 08:08:35,388][25689] Fps is (10 sec: 5488.6, 60 sec: 5483.4, 300 sec: 5487.2). Total num frames: 654542848. Throughput: 0: 4953.7. Samples: 654537666. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:08:35,389][25689] Avg episode reward: [(0, '-5.987')] [2022-07-10 08:08:36,930][26022] Updated weights on worker 0-0, policy_version 639212 (0.00099) [2022-07-10 08:08:38,783][26022] Updated weights on worker 0-0, policy_version 639222 (0.00089) [2022-07-10 08:08:40,462][25689] Fps is (10 sec: 5459.9, 60 sec: 5481.2, 300 sec: 5496.5). Total num frames: 654571520. Throughput: 0: 5761.3. Samples: 654570820. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:08:40,462][25689] Avg episode reward: [(0, '-6.061')] [2022-07-10 08:08:40,831][26022] Updated weights on worker 0-0, policy_version 639232 (0.00093) [2022-07-10 08:08:42,469][26022] Updated weights on worker 0-0, policy_version 639242 (0.00089) [2022-07-10 08:08:44,375][26022] Updated weights on worker 0-0, policy_version 639252 (0.00091) [2022-07-10 08:08:45,464][25689] Fps is (10 sec: 5691.3, 60 sec: 5520.4, 300 sec: 5491.2). Total num frames: 654600192. Throughput: 0: 5774.9. Samples: 654604394. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:08:45,465][25689] Avg episode reward: [(0, '-7.080')] [2022-07-10 08:08:46,155][26022] Updated weights on worker 0-0, policy_version 639262 (0.00094) [2022-07-10 08:08:47,834][26022] Updated weights on worker 0-0, policy_version 639272 (0.00093) [2022-07-10 08:08:49,932][26022] Updated weights on worker 0-0, policy_version 639282 (0.00094) [2022-07-10 08:08:50,475][25689] Fps is (10 sec: 5522.3, 60 sec: 5521.1, 300 sec: 5491.2). Total num frames: 654626816. Throughput: 0: 4961.9. Samples: 654621106. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:08:50,476][25689] Avg episode reward: [(0, '-6.246')] [2022-07-10 08:08:51,583][26022] Updated weights on worker 0-0, policy_version 639292 (0.00088) [2022-07-10 08:08:53,611][26022] Updated weights on worker 0-0, policy_version 639302 (0.00086) [2022-07-10 08:08:55,268][26022] Updated weights on worker 0-0, policy_version 639312 (0.00088) [2022-07-10 08:08:55,499][25689] Fps is (10 sec: 5510.5, 60 sec: 5487.5, 300 sec: 5491.4). Total num frames: 654655488. Throughput: 0: 5798.0. Samples: 654654266. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:08:55,499][25689] Avg episode reward: [(0, '-6.105')] [2022-07-10 08:08:56,998][26022] Updated weights on worker 0-0, policy_version 639322 (0.00087) [2022-07-10 08:08:59,270][26022] Updated weights on worker 0-0, policy_version 639332 (0.00087) [2022-07-10 08:09:00,616][25689] Fps is (10 sec: 5654.6, 60 sec: 5537.3, 300 sec: 5499.9). Total num frames: 654684160. Throughput: 0: 5808.7. Samples: 654687890. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:09:00,617][25689] Avg episode reward: [(0, '-4.401')] [2022-07-10 08:09:00,750][26022] Updated weights on worker 0-0, policy_version 639342 (0.00115) [2022-07-10 08:09:03,091][26022] Updated weights on worker 0-0, policy_version 639352 (0.00099) [2022-07-10 08:09:04,795][26022] Updated weights on worker 0-0, policy_version 639362 (0.00052) [2022-07-10 08:09:05,645][25689] Fps is (10 sec: 5248.1, 60 sec: 5487.0, 300 sec: 5493.9). Total num frames: 654708736. Throughput: 0: 5680.0. Samples: 654719022. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:09:05,645][25689] Avg episode reward: [(0, '-5.045')] [2022-07-10 08:09:07,000][26022] Updated weights on worker 0-0, policy_version 639372 (0.00091) [2022-07-10 08:09:08,746][26022] Updated weights on worker 0-0, policy_version 639382 (0.00090) [2022-07-10 08:09:10,494][26022] Updated weights on worker 0-0, policy_version 639392 (0.00617) [2022-07-10 08:09:10,687][25689] Fps is (10 sec: 5389.0, 60 sec: 5517.6, 300 sec: 5498.1). Total num frames: 654738432. Throughput: 0: 5667.1. Samples: 654735650. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:09:10,688][25689] Avg episode reward: [(0, '-5.543')] [2022-07-10 08:09:12,385][26022] Updated weights on worker 0-0, policy_version 639402 (0.00085) [2022-07-10 08:09:14,386][26022] Updated weights on worker 0-0, policy_version 639412 (0.00089) [2022-07-10 08:09:15,717][25689] Fps is (10 sec: 5693.5, 60 sec: 5515.1, 300 sec: 5492.0). Total num frames: 654766080. Throughput: 0: 5669.3. Samples: 654768888. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:09:15,717][25689] Avg episode reward: [(0, '-7.378')] [2022-07-10 08:09:16,025][26022] Updated weights on worker 0-0, policy_version 639422 (0.00095) [2022-07-10 08:09:17,951][26022] Updated weights on worker 0-0, policy_version 639432 (0.00090) [2022-07-10 08:09:19,735][26022] Updated weights on worker 0-0, policy_version 639442 (0.00083) [2022-07-10 08:09:20,806][25689] Fps is (10 sec: 5464.8, 60 sec: 5495.3, 300 sec: 5493.9). Total num frames: 654793728. Throughput: 0: 5651.9. Samples: 654802000. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:09:20,806][25689] Avg episode reward: [(0, '-7.560')] [2022-07-10 08:09:21,655][26022] Updated weights on worker 0-0, policy_version 639452 (0.00102) [2022-07-10 08:09:23,553][26022] Updated weights on worker 0-0, policy_version 639462 (0.00092) [2022-07-10 08:09:25,358][26022] Updated weights on worker 0-0, policy_version 639472 (0.00079) [2022-07-10 08:09:25,833][25689] Fps is (10 sec: 5466.3, 60 sec: 5516.0, 300 sec: 5490.5). Total num frames: 654821376. Throughput: 0: 4935.2. Samples: 654818652. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:09:25,833][25689] Avg episode reward: [(0, '-8.151')] [2022-07-10 08:09:27,102][26022] Updated weights on worker 0-0, policy_version 639482 (0.00091) [2022-07-10 08:09:29,284][26022] Updated weights on worker 0-0, policy_version 639492 (0.00093) [2022-07-10 08:09:30,742][26022] Updated weights on worker 0-0, policy_version 639502 (0.00088) [2022-07-10 08:09:30,838][25689] Fps is (10 sec: 5613.8, 60 sec: 5517.4, 300 sec: 5494.4). Total num frames: 654850048. Throughput: 0: 5776.9. Samples: 654852060. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:09:30,839][25689] Avg episode reward: [(0, '-7.506')] [2022-07-10 08:09:32,836][26022] Updated weights on worker 0-0, policy_version 639512 (0.00089) [2022-07-10 08:09:34,533][26022] Updated weights on worker 0-0, policy_version 639522 (0.00085) [2022-07-10 08:09:35,930][25689] Fps is (10 sec: 5577.7, 60 sec: 5530.9, 300 sec: 5498.2). Total num frames: 654877696. Throughput: 0: 5765.8. Samples: 654885434. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:09:35,930][25689] Avg episode reward: [(0, '-7.371')] [2022-07-10 08:09:36,510][26022] Updated weights on worker 0-0, policy_version 639532 (0.00087) [2022-07-10 08:09:38,487][26022] Updated weights on worker 0-0, policy_version 639542 (0.00102) [2022-07-10 08:09:40,109][26022] Updated weights on worker 0-0, policy_version 639552 (0.00094) [2022-07-10 08:09:40,987][25689] Fps is (10 sec: 5347.7, 60 sec: 5498.6, 300 sec: 5487.4). Total num frames: 654904320. Throughput: 0: 4940.4. Samples: 654901702. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:09:40,988][25689] Avg episode reward: [(0, '-5.478')] [2022-07-10 08:09:42,081][26022] Updated weights on worker 0-0, policy_version 639562 (0.00090) [2022-07-10 08:09:44,007][26022] Updated weights on worker 0-0, policy_version 639572 (0.00090) [2022-07-10 08:09:45,835][26022] Updated weights on worker 0-0, policy_version 639582 (0.00085) [2022-07-10 08:09:45,996][25689] Fps is (10 sec: 5493.3, 60 sec: 5497.9, 300 sec: 5498.1). Total num frames: 654932992. Throughput: 0: 5759.4. Samples: 654934780. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:09:45,996][25689] Avg episode reward: [(0, '-5.832')] [2022-07-10 08:09:47,812][26022] Updated weights on worker 0-0, policy_version 639592 (0.00089) [2022-07-10 08:09:49,485][26022] Updated weights on worker 0-0, policy_version 639602 (0.00094) [2022-07-10 08:09:51,014][25689] Fps is (10 sec: 5514.6, 60 sec: 5497.3, 300 sec: 5491.4). Total num frames: 654959616. Throughput: 0: 5750.5. Samples: 654968082. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:09:51,015][25689] Avg episode reward: [(0, '-6.515')] [2022-07-10 08:09:51,431][26022] Updated weights on worker 0-0, policy_version 639612 (0.00085) [2022-07-10 08:09:53,061][26022] Updated weights on worker 0-0, policy_version 639622 (0.00084) [2022-07-10 08:09:55,187][26022] Updated weights on worker 0-0, policy_version 639632 (0.00093) [2022-07-10 08:09:56,021][25689] Fps is (10 sec: 5515.7, 60 sec: 5498.8, 300 sec: 5489.2). Total num frames: 654988288. Throughput: 0: 4937.3. Samples: 654984630. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:09:56,023][25689] Avg episode reward: [(0, '-6.278')] [2022-07-10 08:09:56,816][26022] Updated weights on worker 0-0, policy_version 639642 (0.01051) [2022-07-10 08:09:58,888][26022] Updated weights on worker 0-0, policy_version 639652 (0.00087) [2022-07-10 08:10:00,611][26022] Updated weights on worker 0-0, policy_version 639662 (0.00088) [2022-07-10 08:10:01,099][25689] Fps is (10 sec: 5584.9, 60 sec: 5485.5, 300 sec: 5499.4). Total num frames: 655015936. Throughput: 0: 5764.5. Samples: 655017636. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:10:01,099][25689] Avg episode reward: [(0, '-9.048')] [2022-07-10 08:10:02,847][26022] Updated weights on worker 0-0, policy_version 639672 (0.00082) [2022-07-10 08:10:04,774][26022] Updated weights on worker 0-0, policy_version 639682 (0.00094) [2022-07-10 08:10:06,108][25689] Fps is (10 sec: 5279.0, 60 sec: 5504.2, 300 sec: 5493.8). Total num frames: 655041536. Throughput: 0: 5665.5. Samples: 655048724. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:10:06,110][25689] Avg episode reward: [(0, '-8.939')] [2022-07-10 08:10:06,532][26022] Updated weights on worker 0-0, policy_version 639692 (0.00088) [2022-07-10 08:10:08,598][26022] Updated weights on worker 0-0, policy_version 639702 (0.00079) [2022-07-10 08:10:10,257][26022] Updated weights on worker 0-0, policy_version 639712 (0.00094) [2022-07-10 08:10:11,135][25689] Fps is (10 sec: 5203.6, 60 sec: 5454.8, 300 sec: 5484.2). Total num frames: 655068160. Throughput: 0: 4821.9. Samples: 655065102. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:10:11,136][25689] Avg episode reward: [(0, '-9.078')] [2022-07-10 08:10:12,258][26022] Updated weights on worker 0-0, policy_version 639722 (0.00088) [2022-07-10 08:10:14,091][26022] Updated weights on worker 0-0, policy_version 639732 (0.00083) [2022-07-10 08:10:15,163][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:10:15,177][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000639737_655090688.pth [2022-07-10 08:10:15,178][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000637805_653112320.pth [2022-07-10 08:10:15,823][26022] Updated weights on worker 0-0, policy_version 639742 (0.00088) [2022-07-10 08:10:16,182][25689] Fps is (10 sec: 5590.7, 60 sec: 5487.1, 300 sec: 5498.1). Total num frames: 655097856. Throughput: 0: 5629.6. Samples: 655098126. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:10:16,182][25689] Avg episode reward: [(0, '-8.929')] [2022-07-10 08:10:18,027][26022] Updated weights on worker 0-0, policy_version 639752 (0.00091) [2022-07-10 08:10:19,573][26022] Updated weights on worker 0-0, policy_version 639762 (0.00090) [2022-07-10 08:10:21,262][25689] Fps is (10 sec: 5561.1, 60 sec: 5470.9, 300 sec: 5489.9). Total num frames: 655124480. Throughput: 0: 5646.4. Samples: 655131488. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:10:21,266][25689] Avg episode reward: [(0, '-7.706')] [2022-07-10 08:10:21,642][26022] Updated weights on worker 0-0, policy_version 639772 (0.00092) [2022-07-10 08:10:23,314][26022] Updated weights on worker 0-0, policy_version 639782 (0.00092) [2022-07-10 08:10:25,202][26022] Updated weights on worker 0-0, policy_version 639792 (0.00088) [2022-07-10 08:10:26,282][25689] Fps is (10 sec: 5474.5, 60 sec: 5488.5, 300 sec: 5493.5). Total num frames: 655153152. Throughput: 0: 4923.8. Samples: 655148058. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:10:26,284][25689] Avg episode reward: [(0, '-6.845')] [2022-07-10 08:10:27,199][26022] Updated weights on worker 0-0, policy_version 639802 (0.00083) [2022-07-10 08:10:28,867][26022] Updated weights on worker 0-0, policy_version 639812 (0.00090) [2022-07-10 08:10:30,738][26022] Updated weights on worker 0-0, policy_version 639822 (0.00092) [2022-07-10 08:10:31,384][25689] Fps is (10 sec: 5665.7, 60 sec: 5479.8, 300 sec: 5502.1). Total num frames: 655181824. Throughput: 0: 5740.3. Samples: 655181336. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:10:31,384][25689] Avg episode reward: [(0, '-4.583')] [2022-07-10 08:10:32,551][26022] Updated weights on worker 0-0, policy_version 639832 (0.00086) [2022-07-10 08:10:34,448][26022] Updated weights on worker 0-0, policy_version 639842 (0.00089) [2022-07-10 08:10:36,387][25689] Fps is (10 sec: 5371.1, 60 sec: 5453.9, 300 sec: 5489.7). Total num frames: 655207424. Throughput: 0: 5765.0. Samples: 655214608. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:10:36,387][25689] Avg episode reward: [(0, '-3.813')] [2022-07-10 08:10:36,440][26022] Updated weights on worker 0-0, policy_version 639852 (0.00089) [2022-07-10 08:10:38,074][26022] Updated weights on worker 0-0, policy_version 639862 (0.00084) [2022-07-10 08:10:40,163][26022] Updated weights on worker 0-0, policy_version 639872 (0.00087) [2022-07-10 08:10:41,460][25689] Fps is (10 sec: 5386.1, 60 sec: 5486.4, 300 sec: 5492.1). Total num frames: 655236096. Throughput: 0: 4937.2. Samples: 655231208. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:10:41,460][25689] Avg episode reward: [(0, '-5.003')] [2022-07-10 08:10:42,027][26022] Updated weights on worker 0-0, policy_version 639882 (0.00459) [2022-07-10 08:10:43,646][26022] Updated weights on worker 0-0, policy_version 639892 (0.00087) [2022-07-10 08:10:45,750][26022] Updated weights on worker 0-0, policy_version 639902 (0.00093) [2022-07-10 08:10:46,556][25689] Fps is (10 sec: 5639.0, 60 sec: 5478.5, 300 sec: 5497.3). Total num frames: 655264768. Throughput: 0: 5715.7. Samples: 655263936. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:10:46,556][25689] Avg episode reward: [(0, '-5.016')] [2022-07-10 08:10:47,263][26022] Updated weights on worker 0-0, policy_version 639912 (0.00084) [2022-07-10 08:10:49,479][26022] Updated weights on worker 0-0, policy_version 639922 (0.00095) [2022-07-10 08:10:50,968][26022] Updated weights on worker 0-0, policy_version 639932 (0.00090) [2022-07-10 08:10:51,575][25689] Fps is (10 sec: 5466.7, 60 sec: 5478.4, 300 sec: 5493.6). Total num frames: 655291392. Throughput: 0: 5739.6. Samples: 655297226. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:10:51,575][25689] Avg episode reward: [(0, '-5.415')] [2022-07-10 08:10:53,145][26022] Updated weights on worker 0-0, policy_version 639942 (0.00087) [2022-07-10 08:10:54,803][26022] Updated weights on worker 0-0, policy_version 639952 (0.00098) [2022-07-10 08:10:56,612][25689] Fps is (10 sec: 5397.1, 60 sec: 5458.8, 300 sec: 5494.5). Total num frames: 655319040. Throughput: 0: 4904.4. Samples: 655313800. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:10:56,618][25689] Avg episode reward: [(0, '-6.744')] [2022-07-10 08:10:56,972][26022] Updated weights on worker 0-0, policy_version 639962 (0.00089) [2022-07-10 08:10:58,541][26022] Updated weights on worker 0-0, policy_version 639972 (0.00090) [2022-07-10 08:11:00,560][26022] Updated weights on worker 0-0, policy_version 639982 (0.00089) [2022-07-10 08:11:01,767][25689] Fps is (10 sec: 5526.1, 60 sec: 5468.7, 300 sec: 5498.7). Total num frames: 655347712. Throughput: 0: 5692.0. Samples: 655346794. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:11:01,769][25689] Avg episode reward: [(0, '-7.815')] [2022-07-10 08:11:02,689][26022] Updated weights on worker 0-0, policy_version 639992 (0.00092) [2022-07-10 08:11:04,648][26022] Updated weights on worker 0-0, policy_version 640002 (0.00080) [2022-07-10 08:11:06,411][26022] Updated weights on worker 0-0, policy_version 640012 (0.00068) [2022-07-10 08:11:06,779][25689] Fps is (10 sec: 5338.1, 60 sec: 5468.5, 300 sec: 5491.9). Total num frames: 655373312. Throughput: 0: 5632.0. Samples: 655377830. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:11:06,779][25689] Avg episode reward: [(0, '-6.710')] [2022-07-10 08:11:08,332][26022] Updated weights on worker 0-0, policy_version 640022 (0.00097) [2022-07-10 08:11:10,275][26022] Updated weights on worker 0-0, policy_version 640032 (0.00082) [2022-07-10 08:11:11,813][25689] Fps is (10 sec: 5402.4, 60 sec: 5501.6, 300 sec: 5494.8). Total num frames: 655401984. Throughput: 0: 5609.8. Samples: 655410756. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:11:11,814][25689] Avg episode reward: [(0, '-6.789')] [2022-07-10 08:11:11,958][26022] Updated weights on worker 0-0, policy_version 640042 (0.00086) [2022-07-10 08:11:14,115][26022] Updated weights on worker 0-0, policy_version 640052 (0.00090) [2022-07-10 08:11:15,651][26022] Updated weights on worker 0-0, policy_version 640062 (0.00530) [2022-07-10 08:11:16,843][25689] Fps is (10 sec: 5392.6, 60 sec: 5435.6, 300 sec: 5488.3). Total num frames: 655427584. Throughput: 0: 5620.2. Samples: 655427502. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:11:16,843][25689] Avg episode reward: [(0, '-8.015')] [2022-07-10 08:11:17,645][26022] Updated weights on worker 0-0, policy_version 640072 (0.00093) [2022-07-10 08:11:19,447][26022] Updated weights on worker 0-0, policy_version 640082 (0.00083) [2022-07-10 08:11:21,206][26022] Updated weights on worker 0-0, policy_version 640092 (0.00086) [2022-07-10 08:11:21,977][25689] Fps is (10 sec: 5440.3, 60 sec: 5481.4, 300 sec: 5490.3). Total num frames: 655457280. Throughput: 0: 5633.3. Samples: 655460644. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:11:21,977][25689] Avg episode reward: [(0, '-6.484')] [2022-07-10 08:11:23,215][26022] Updated weights on worker 0-0, policy_version 640102 (0.00085) [2022-07-10 08:11:25,074][26022] Updated weights on worker 0-0, policy_version 640112 (0.00087) [2022-07-10 08:11:26,632][26022] Updated weights on worker 0-0, policy_version 640122 (0.00086) [2022-07-10 08:11:27,034][25689] Fps is (10 sec: 5727.5, 60 sec: 5478.1, 300 sec: 5492.8). Total num frames: 655485952. Throughput: 0: 5746.6. Samples: 655494226. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:11:27,034][25689] Avg episode reward: [(0, '-5.947')] [2022-07-10 08:11:28,873][26022] Updated weights on worker 0-0, policy_version 640132 (0.00085) [2022-07-10 08:11:30,366][26022] Updated weights on worker 0-0, policy_version 640142 (0.00089) [2022-07-10 08:11:32,092][25689] Fps is (10 sec: 5567.7, 60 sec: 5465.1, 300 sec: 5491.9). Total num frames: 655513600. Throughput: 0: 4940.1. Samples: 655510940. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 08:11:32,093][25689] Avg episode reward: [(0, '-5.270')] [2022-07-10 08:11:32,400][26022] Updated weights on worker 0-0, policy_version 640152 (0.00091) [2022-07-10 08:11:34,274][26022] Updated weights on worker 0-0, policy_version 640162 (0.00090) [2022-07-10 08:11:36,057][26022] Updated weights on worker 0-0, policy_version 640172 (0.00057) [2022-07-10 08:11:37,175][25689] Fps is (10 sec: 5553.3, 60 sec: 5508.4, 300 sec: 5494.9). Total num frames: 655542272. Throughput: 0: 5739.2. Samples: 655544194. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:11:37,176][25689] Avg episode reward: [(0, '-5.631')] [2022-07-10 08:11:37,970][26022] Updated weights on worker 0-0, policy_version 640182 (0.00386) [2022-07-10 08:11:39,683][26022] Updated weights on worker 0-0, policy_version 640192 (0.00092) [2022-07-10 08:11:41,713][26022] Updated weights on worker 0-0, policy_version 640202 (0.00091) [2022-07-10 08:11:42,225][25689] Fps is (10 sec: 5558.3, 60 sec: 5493.7, 300 sec: 5494.1). Total num frames: 655569920. Throughput: 0: 5757.4. Samples: 655577222. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:11:42,226][25689] Avg episode reward: [(0, '-4.125')] [2022-07-10 08:11:43,287][26022] Updated weights on worker 0-0, policy_version 640212 (0.00093) [2022-07-10 08:11:45,201][26022] Updated weights on worker 0-0, policy_version 640222 (0.00089) [2022-07-10 08:11:47,234][25689] Fps is (10 sec: 5395.5, 60 sec: 5467.8, 300 sec: 5490.7). Total num frames: 655596544. Throughput: 0: 4939.1. Samples: 655593998. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:11:47,235][25689] Avg episode reward: [(0, '-2.058')] [2022-07-10 08:11:47,279][26022] Updated weights on worker 0-0, policy_version 640232 (0.00096) [2022-07-10 08:11:48,917][26022] Updated weights on worker 0-0, policy_version 640242 (0.00087) [2022-07-10 08:11:50,875][26022] Updated weights on worker 0-0, policy_version 640252 (0.00094) [2022-07-10 08:11:52,250][25689] Fps is (10 sec: 5515.6, 60 sec: 5501.8, 300 sec: 5494.0). Total num frames: 655625216. Throughput: 0: 5769.0. Samples: 655627232. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:11:52,251][25689] Avg episode reward: [(0, '-2.178')] [2022-07-10 08:11:52,818][26022] Updated weights on worker 0-0, policy_version 640262 (0.00089) [2022-07-10 08:11:54,660][26022] Updated weights on worker 0-0, policy_version 640272 (0.00084) [2022-07-10 08:11:56,372][26022] Updated weights on worker 0-0, policy_version 640282 (0.00092) [2022-07-10 08:11:57,260][25689] Fps is (10 sec: 5719.5, 60 sec: 5521.1, 300 sec: 5502.3). Total num frames: 655653888. Throughput: 0: 5803.8. Samples: 655660762. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:11:57,261][25689] Avg episode reward: [(0, '-1.737')] [2022-07-10 08:11:58,359][26022] Updated weights on worker 0-0, policy_version 640292 (0.00090) [2022-07-10 08:12:00,020][26022] Updated weights on worker 0-0, policy_version 640302 (0.00088) [2022-07-10 08:12:02,392][25689] Fps is (10 sec: 5250.4, 60 sec: 5455.7, 300 sec: 5490.1). Total num frames: 655678464. Throughput: 0: 4968.3. Samples: 655677418. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:12:02,394][25689] Avg episode reward: [(0, '-2.814')] [2022-07-10 08:12:02,425][26022] Updated weights on worker 0-0, policy_version 640312 (0.00093) [2022-07-10 08:12:04,141][26022] Updated weights on worker 0-0, policy_version 640322 (0.00089) [2022-07-10 08:12:06,146][26022] Updated weights on worker 0-0, policy_version 640332 (0.00096) [2022-07-10 08:12:07,415][25689] Fps is (10 sec: 5243.9, 60 sec: 5505.4, 300 sec: 5497.1). Total num frames: 655707136. Throughput: 0: 5663.2. Samples: 655708284. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:12:07,415][25689] Avg episode reward: [(0, '-2.918')] [2022-07-10 08:12:07,936][26022] Updated weights on worker 0-0, policy_version 640342 (0.00091) [2022-07-10 08:12:09,665][26022] Updated weights on worker 0-0, policy_version 640352 (0.00086) [2022-07-10 08:12:11,711][26022] Updated weights on worker 0-0, policy_version 640362 (0.00096) [2022-07-10 08:12:12,423][25689] Fps is (10 sec: 5512.5, 60 sec: 5473.9, 300 sec: 5490.5). Total num frames: 655733760. Throughput: 0: 5659.6. Samples: 655741402. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:12:12,424][25689] Avg episode reward: [(0, '-3.261')] [2022-07-10 08:12:13,465][26022] Updated weights on worker 0-0, policy_version 640372 (0.00087) [2022-07-10 08:12:15,260][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:12:15,270][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000640381_655750144.pth [2022-07-10 08:12:15,271][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000638448_653770752.pth [2022-07-10 08:12:15,390][26022] Updated weights on worker 0-0, policy_version 640382 (0.00097) [2022-07-10 08:12:17,278][26022] Updated weights on worker 0-0, policy_version 640392 (0.00089) [2022-07-10 08:12:17,505][25689] Fps is (10 sec: 5480.3, 60 sec: 5519.9, 300 sec: 5491.4). Total num frames: 655762432. Throughput: 0: 4807.7. Samples: 655758090. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:12:17,505][25689] Avg episode reward: [(0, '-3.706')] [2022-07-10 08:12:19,095][26022] Updated weights on worker 0-0, policy_version 640402 (0.00949) [2022-07-10 08:12:20,903][26022] Updated weights on worker 0-0, policy_version 640412 (0.00090) [2022-07-10 08:12:22,553][25689] Fps is (10 sec: 5560.0, 60 sec: 5493.9, 300 sec: 5488.5). Total num frames: 655790080. Throughput: 0: 5651.8. Samples: 655791362. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:12:22,554][25689] Avg episode reward: [(0, '-4.063')] [2022-07-10 08:12:22,854][26022] Updated weights on worker 0-0, policy_version 640422 (0.00087) [2022-07-10 08:12:24,555][26022] Updated weights on worker 0-0, policy_version 640432 (0.00087) [2022-07-10 08:12:26,534][26022] Updated weights on worker 0-0, policy_version 640442 (0.00085) [2022-07-10 08:12:27,562][25689] Fps is (10 sec: 5599.7, 60 sec: 5498.2, 300 sec: 5496.1). Total num frames: 655818752. Throughput: 0: 5762.3. Samples: 655824380. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:12:27,563][25689] Avg episode reward: [(0, '-4.556')] [2022-07-10 08:12:28,318][26022] Updated weights on worker 0-0, policy_version 640452 (0.00634) [2022-07-10 08:12:30,277][26022] Updated weights on worker 0-0, policy_version 640462 (0.00091) [2022-07-10 08:12:32,019][26022] Updated weights on worker 0-0, policy_version 640472 (0.00096) [2022-07-10 08:12:32,577][25689] Fps is (10 sec: 5618.2, 60 sec: 5502.2, 300 sec: 5493.3). Total num frames: 655846400. Throughput: 0: 4941.6. Samples: 655840996. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:12:32,578][25689] Avg episode reward: [(0, '-6.593')] [2022-07-10 08:12:33,838][26022] Updated weights on worker 0-0, policy_version 640482 (0.00090) [2022-07-10 08:12:35,692][26022] Updated weights on worker 0-0, policy_version 640492 (0.00089) [2022-07-10 08:12:37,619][25689] Fps is (10 sec: 5396.8, 60 sec: 5472.1, 300 sec: 5486.6). Total num frames: 655873024. Throughput: 0: 5785.0. Samples: 655874450. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:12:37,619][25689] Avg episode reward: [(0, '-6.331')] [2022-07-10 08:12:37,631][26022] Updated weights on worker 0-0, policy_version 640502 (0.00093) [2022-07-10 08:12:39,439][26022] Updated weights on worker 0-0, policy_version 640512 (0.00086) [2022-07-10 08:12:41,241][26022] Updated weights on worker 0-0, policy_version 640522 (0.00093) [2022-07-10 08:12:42,691][25689] Fps is (10 sec: 5467.5, 60 sec: 5487.0, 300 sec: 5493.2). Total num frames: 655901696. Throughput: 0: 5760.4. Samples: 655907366. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:12:42,691][25689] Avg episode reward: [(0, '-7.021')] [2022-07-10 08:12:43,309][26022] Updated weights on worker 0-0, policy_version 640532 (0.00091) [2022-07-10 08:12:45,019][26022] Updated weights on worker 0-0, policy_version 640542 (0.00899) [2022-07-10 08:12:46,897][26022] Updated weights on worker 0-0, policy_version 640552 (0.00087) [2022-07-10 08:12:47,729][25689] Fps is (10 sec: 5570.2, 60 sec: 5501.3, 300 sec: 5496.3). Total num frames: 655929344. Throughput: 0: 4938.5. Samples: 655923976. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:12:47,730][25689] Avg episode reward: [(0, '-8.559')] [2022-07-10 08:12:48,739][26022] Updated weights on worker 0-0, policy_version 640562 (0.00088) [2022-07-10 08:12:50,385][26022] Updated weights on worker 0-0, policy_version 640572 (0.00085) [2022-07-10 08:12:52,497][26022] Updated weights on worker 0-0, policy_version 640582 (0.00051) [2022-07-10 08:12:52,745][25689] Fps is (10 sec: 5398.1, 60 sec: 5467.5, 300 sec: 5482.8). Total num frames: 655955968. Throughput: 0: 5765.9. Samples: 655957280. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:12:52,745][25689] Avg episode reward: [(0, '-8.300')] [2022-07-10 08:12:54,141][26022] Updated weights on worker 0-0, policy_version 640592 (0.00082) [2022-07-10 08:12:56,364][26022] Updated weights on worker 0-0, policy_version 640602 (0.00087) [2022-07-10 08:12:57,779][25689] Fps is (10 sec: 5604.3, 60 sec: 5482.2, 300 sec: 5497.9). Total num frames: 655985664. Throughput: 0: 5764.4. Samples: 655990662. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:12:57,779][25689] Avg episode reward: [(0, '-7.852')] [2022-07-10 08:12:58,052][26022] Updated weights on worker 0-0, policy_version 640612 (0.00084) [2022-07-10 08:12:59,855][26022] Updated weights on worker 0-0, policy_version 640622 (0.00110) [2022-07-10 08:13:01,850][26022] Updated weights on worker 0-0, policy_version 640632 (0.00094) [2022-07-10 08:13:02,905][25689] Fps is (10 sec: 5442.2, 60 sec: 5499.6, 300 sec: 5489.3). Total num frames: 656011264. Throughput: 0: 4949.0. Samples: 656007408. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:13:02,906][25689] Avg episode reward: [(0, '-5.551')] [2022-07-10 08:13:03,964][26022] Updated weights on worker 0-0, policy_version 640642 (0.00095) [2022-07-10 08:13:05,537][26022] Updated weights on worker 0-0, policy_version 640652 (0.00096) [2022-07-10 08:13:07,629][26022] Updated weights on worker 0-0, policy_version 640662 (0.00081) [2022-07-10 08:13:07,934][25689] Fps is (10 sec: 5344.4, 60 sec: 5499.1, 300 sec: 5492.3). Total num frames: 656039936. Throughput: 0: 5687.8. Samples: 656038894. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:13:07,934][25689] Avg episode reward: [(0, '-6.091')] [2022-07-10 08:13:09,320][26022] Updated weights on worker 0-0, policy_version 640672 (0.00093) [2022-07-10 08:13:11,254][26022] Updated weights on worker 0-0, policy_version 640682 (0.00092) [2022-07-10 08:13:12,967][25689] Fps is (10 sec: 5699.1, 60 sec: 5530.7, 300 sec: 5495.2). Total num frames: 656068608. Throughput: 0: 5680.5. Samples: 656072154. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:13:12,968][26022] Updated weights on worker 0-0, policy_version 640692 (0.00080) [2022-07-10 08:13:12,968][25689] Avg episode reward: [(0, '-6.426')] [2022-07-10 08:13:14,904][26022] Updated weights on worker 0-0, policy_version 640702 (0.00092) [2022-07-10 08:13:16,814][26022] Updated weights on worker 0-0, policy_version 640712 (0.00087) [2022-07-10 08:13:17,980][25689] Fps is (10 sec: 5504.1, 60 sec: 5503.1, 300 sec: 5489.1). Total num frames: 656095232. Throughput: 0: 4856.2. Samples: 656088764. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:13:17,980][25689] Avg episode reward: [(0, '-5.960')] [2022-07-10 08:13:18,666][26022] Updated weights on worker 0-0, policy_version 640722 (0.00092) [2022-07-10 08:13:20,354][26022] Updated weights on worker 0-0, policy_version 640732 (0.00052) [2022-07-10 08:13:22,520][26022] Updated weights on worker 0-0, policy_version 640742 (0.00081) [2022-07-10 08:13:23,106][25689] Fps is (10 sec: 5352.9, 60 sec: 5496.0, 300 sec: 5491.5). Total num frames: 656122880. Throughput: 0: 5674.8. Samples: 656122042. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:13:23,106][25689] Avg episode reward: [(0, '-5.369')] [2022-07-10 08:13:24,087][26022] Updated weights on worker 0-0, policy_version 640752 (0.00054) [2022-07-10 08:13:26,007][26022] Updated weights on worker 0-0, policy_version 640762 (0.00095) [2022-07-10 08:13:27,871][26022] Updated weights on worker 0-0, policy_version 640772 (0.00095) [2022-07-10 08:13:28,122][25689] Fps is (10 sec: 5452.1, 60 sec: 5478.5, 300 sec: 5488.2). Total num frames: 656150528. Throughput: 0: 5765.5. Samples: 656155290. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:13:28,123][25689] Avg episode reward: [(0, '-5.641')] [2022-07-10 08:13:29,691][26022] Updated weights on worker 0-0, policy_version 640782 (0.00083) [2022-07-10 08:13:31,497][26022] Updated weights on worker 0-0, policy_version 640792 (0.00082) [2022-07-10 08:13:33,135][25689] Fps is (10 sec: 5717.6, 60 sec: 5512.5, 300 sec: 5499.2). Total num frames: 656180224. Throughput: 0: 5780.6. Samples: 656188738. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:13:33,136][25689] Avg episode reward: [(0, '-7.234')] [2022-07-10 08:13:33,312][26022] Updated weights on worker 0-0, policy_version 640802 (0.00086) [2022-07-10 08:13:35,219][26022] Updated weights on worker 0-0, policy_version 640812 (0.00092) [2022-07-10 08:13:37,150][26022] Updated weights on worker 0-0, policy_version 640822 (0.00091) [2022-07-10 08:13:38,163][25689] Fps is (10 sec: 5609.2, 60 sec: 5513.7, 300 sec: 5493.2). Total num frames: 656206848. Throughput: 0: 5779.6. Samples: 656205412. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:13:38,163][25689] Avg episode reward: [(0, '-6.536')] [2022-07-10 08:13:38,992][26022] Updated weights on worker 0-0, policy_version 640832 (0.00093) [2022-07-10 08:13:40,801][26022] Updated weights on worker 0-0, policy_version 640842 (0.00083) [2022-07-10 08:13:42,739][26022] Updated weights on worker 0-0, policy_version 640852 (0.00087) [2022-07-10 08:13:43,234][25689] Fps is (10 sec: 5475.3, 60 sec: 5513.8, 300 sec: 5491.9). Total num frames: 656235520. Throughput: 0: 5785.7. Samples: 656238500. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:13:43,235][25689] Avg episode reward: [(0, '-5.176')] [2022-07-10 08:13:44,604][26022] Updated weights on worker 0-0, policy_version 640862 (0.00090) [2022-07-10 08:13:46,248][26022] Updated weights on worker 0-0, policy_version 640872 (0.00085) [2022-07-10 08:13:48,191][26022] Updated weights on worker 0-0, policy_version 640882 (0.00087) [2022-07-10 08:13:48,245][25689] Fps is (10 sec: 5586.1, 60 sec: 5516.3, 300 sec: 5495.4). Total num frames: 656263168. Throughput: 0: 5799.2. Samples: 656271986. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:13:48,246][25689] Avg episode reward: [(0, '-4.913')] [2022-07-10 08:13:50,048][26022] Updated weights on worker 0-0, policy_version 640892 (0.00084) [2022-07-10 08:13:52,011][26022] Updated weights on worker 0-0, policy_version 640902 (0.00090) [2022-07-10 08:13:53,274][25689] Fps is (10 sec: 5405.8, 60 sec: 5515.1, 300 sec: 5488.4). Total num frames: 656289792. Throughput: 0: 4947.7. Samples: 656288378. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:13:53,276][25689] Avg episode reward: [(0, '-4.895')] [2022-07-10 08:13:53,868][26022] Updated weights on worker 0-0, policy_version 640912 (0.00086) [2022-07-10 08:13:55,496][26022] Updated weights on worker 0-0, policy_version 640922 (0.00096) [2022-07-10 08:13:57,545][26022] Updated weights on worker 0-0, policy_version 640932 (0.00087) [2022-07-10 08:13:58,280][25689] Fps is (10 sec: 5408.4, 60 sec: 5483.8, 300 sec: 5487.0). Total num frames: 656317440. Throughput: 0: 5773.7. Samples: 656321562. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:13:58,280][25689] Avg episode reward: [(0, '-5.033')] [2022-07-10 08:13:59,414][26022] Updated weights on worker 0-0, policy_version 640942 (0.00087) [2022-07-10 08:14:01,233][26022] Updated weights on worker 0-0, policy_version 640952 (0.00091) [2022-07-10 08:14:03,323][25689] Fps is (10 sec: 5299.0, 60 sec: 5491.4, 300 sec: 5490.2). Total num frames: 656343040. Throughput: 0: 5685.6. Samples: 656352714. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:14:03,323][25689] Avg episode reward: [(0, '-4.038')] [2022-07-10 08:14:03,708][26022] Updated weights on worker 0-0, policy_version 640962 (0.00089) [2022-07-10 08:14:05,140][26022] Updated weights on worker 0-0, policy_version 640972 (0.00088) [2022-07-10 08:14:07,225][26022] Updated weights on worker 0-0, policy_version 640982 (0.00090) [2022-07-10 08:14:08,358][25689] Fps is (10 sec: 5385.2, 60 sec: 5490.8, 300 sec: 5486.9). Total num frames: 656371712. Throughput: 0: 4850.0. Samples: 656369534. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:14:08,358][25689] Avg episode reward: [(0, '-6.832')] [2022-07-10 08:14:08,830][26022] Updated weights on worker 0-0, policy_version 640992 (0.00087) [2022-07-10 08:14:10,904][26022] Updated weights on worker 0-0, policy_version 641002 (0.00098) [2022-07-10 08:14:12,784][26022] Updated weights on worker 0-0, policy_version 641012 (0.00096) [2022-07-10 08:14:13,367][25689] Fps is (10 sec: 5607.3, 60 sec: 5476.1, 300 sec: 5487.2). Total num frames: 656399360. Throughput: 0: 5693.4. Samples: 656402774. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:14:13,367][25689] Avg episode reward: [(0, '-8.139')] [2022-07-10 08:14:14,735][26022] Updated weights on worker 0-0, policy_version 641022 (0.00087) [2022-07-10 08:14:15,506][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:14:15,526][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000641027_656411648.pth [2022-07-10 08:14:15,527][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000639093_654431232.pth [2022-07-10 08:14:16,335][26022] Updated weights on worker 0-0, policy_version 641032 (0.00086) [2022-07-10 08:14:18,339][26022] Updated weights on worker 0-0, policy_version 641042 (0.00084) [2022-07-10 08:14:18,369][25689] Fps is (10 sec: 5523.4, 60 sec: 5494.0, 300 sec: 5488.9). Total num frames: 656427008. Throughput: 0: 5696.6. Samples: 656436002. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:14:18,370][25689] Avg episode reward: [(0, '-8.947')] [2022-07-10 08:14:19,928][26022] Updated weights on worker 0-0, policy_version 641052 (0.00091) [2022-07-10 08:14:22,120][26022] Updated weights on worker 0-0, policy_version 641062 (0.00091) [2022-07-10 08:14:23,543][25689] Fps is (10 sec: 5534.4, 60 sec: 5506.5, 300 sec: 5489.6). Total num frames: 656455680. Throughput: 0: 4935.8. Samples: 656452518. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:14:23,544][25689] Avg episode reward: [(0, '-9.783')] [2022-07-10 08:14:23,802][26022] Updated weights on worker 0-0, policy_version 641072 (0.00093) [2022-07-10 08:14:25,532][26022] Updated weights on worker 0-0, policy_version 641082 (0.00086) [2022-07-10 08:14:27,716][26022] Updated weights on worker 0-0, policy_version 641092 (0.00091) [2022-07-10 08:14:28,551][25689] Fps is (10 sec: 5531.4, 60 sec: 5507.3, 300 sec: 5486.1). Total num frames: 656483328. Throughput: 0: 5736.6. Samples: 656485374. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:14:28,552][25689] Avg episode reward: [(0, '-9.135')] [2022-07-10 08:14:29,597][26022] Updated weights on worker 0-0, policy_version 641102 (0.00090) [2022-07-10 08:14:31,403][26022] Updated weights on worker 0-0, policy_version 641112 (0.00097) [2022-07-10 08:14:33,351][26022] Updated weights on worker 0-0, policy_version 641122 (0.00069) [2022-07-10 08:14:33,640][25689] Fps is (10 sec: 5375.3, 60 sec: 5449.7, 300 sec: 5482.7). Total num frames: 656509952. Throughput: 0: 5688.8. Samples: 656518102. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:14:33,640][25689] Avg episode reward: [(0, '-10.063')] [2022-07-10 08:14:35,041][26022] Updated weights on worker 0-0, policy_version 641132 (0.00086) [2022-07-10 08:14:36,889][26022] Updated weights on worker 0-0, policy_version 641142 (0.00088) [2022-07-10 08:14:38,674][25689] Fps is (10 sec: 5462.5, 60 sec: 5482.9, 300 sec: 5490.0). Total num frames: 656538624. Throughput: 0: 4861.7. Samples: 656534718. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:14:38,674][25689] Avg episode reward: [(0, '-8.981')] [2022-07-10 08:14:39,016][26022] Updated weights on worker 0-0, policy_version 641152 (0.00083) [2022-07-10 08:14:40,718][26022] Updated weights on worker 0-0, policy_version 641162 (0.00087) [2022-07-10 08:14:42,813][26022] Updated weights on worker 0-0, policy_version 641172 (0.00088) [2022-07-10 08:14:43,815][25689] Fps is (10 sec: 5535.2, 60 sec: 5459.7, 300 sec: 5484.1). Total num frames: 656566272. Throughput: 0: 5679.0. Samples: 656567640. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2022-07-10 08:14:43,815][25689] Avg episode reward: [(0, '-8.208')] [2022-07-10 08:14:44,373][26022] Updated weights on worker 0-0, policy_version 641182 (0.00087) [2022-07-10 08:14:46,455][26022] Updated weights on worker 0-0, policy_version 641192 (0.00091) [2022-07-10 08:14:48,115][26022] Updated weights on worker 0-0, policy_version 641202 (0.00089) [2022-07-10 08:14:48,824][25689] Fps is (10 sec: 5447.7, 60 sec: 5459.8, 300 sec: 5487.7). Total num frames: 656593920. Throughput: 0: 5683.1. Samples: 656600588. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:14:48,826][25689] Avg episode reward: [(0, '-8.578')] [2022-07-10 08:14:50,058][26022] Updated weights on worker 0-0, policy_version 641212 (0.00087) [2022-07-10 08:14:51,941][26022] Updated weights on worker 0-0, policy_version 641222 (0.00398) [2022-07-10 08:14:53,747][26022] Updated weights on worker 0-0, policy_version 641232 (0.00087) [2022-07-10 08:14:53,850][25689] Fps is (10 sec: 5510.0, 60 sec: 5477.0, 300 sec: 5483.9). Total num frames: 656621568. Throughput: 0: 4911.3. Samples: 656617362. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:14:53,851][25689] Avg episode reward: [(0, '-7.789')] [2022-07-10 08:14:55,632][26022] Updated weights on worker 0-0, policy_version 641242 (0.00086) [2022-07-10 08:14:57,487][26022] Updated weights on worker 0-0, policy_version 641252 (0.00091) [2022-07-10 08:14:58,867][25689] Fps is (10 sec: 5506.1, 60 sec: 5476.0, 300 sec: 5485.1). Total num frames: 656649216. Throughput: 0: 5729.1. Samples: 656650406. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:14:58,867][25689] Avg episode reward: [(0, '-7.633')] [2022-07-10 08:14:59,435][26022] Updated weights on worker 0-0, policy_version 641262 (0.00094) [2022-07-10 08:15:01,261][26022] Updated weights on worker 0-0, policy_version 641272 (0.00092) [2022-07-10 08:15:03,420][26022] Updated weights on worker 0-0, policy_version 641282 (0.00086) [2022-07-10 08:15:03,950][25689] Fps is (10 sec: 5373.9, 60 sec: 5489.3, 300 sec: 5487.2). Total num frames: 656675840. Throughput: 0: 5659.3. Samples: 656681590. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:15:03,950][25689] Avg episode reward: [(0, '-8.154')] [2022-07-10 08:15:05,263][26022] Updated weights on worker 0-0, policy_version 641292 (0.00091) [2022-07-10 08:15:07,199][26022] Updated weights on worker 0-0, policy_version 641302 (0.00082) [2022-07-10 08:15:08,903][26022] Updated weights on worker 0-0, policy_version 641312 (0.00086) [2022-07-10 08:15:08,977][25689] Fps is (10 sec: 5469.1, 60 sec: 5490.0, 300 sec: 5494.0). Total num frames: 656704512. Throughput: 0: 4850.3. Samples: 656698336. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:15:08,978][25689] Avg episode reward: [(0, '-6.696')] [2022-07-10 08:15:10,877][26022] Updated weights on worker 0-0, policy_version 641322 (0.00105) [2022-07-10 08:15:12,237][26022] Updated weights on worker 0-0, policy_version 641332 (0.00081) [2022-07-10 08:15:14,027][25689] Fps is (10 sec: 5385.4, 60 sec: 5452.5, 300 sec: 5480.2). Total num frames: 656730112. Throughput: 0: 5674.2. Samples: 656731848. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:15:14,028][25689] Avg episode reward: [(0, '-5.780')] [2022-07-10 08:15:14,487][26022] Updated weights on worker 0-0, policy_version 641342 (0.00086) [2022-07-10 08:15:16,238][26022] Updated weights on worker 0-0, policy_version 641352 (0.00742) [2022-07-10 08:15:18,202][26022] Updated weights on worker 0-0, policy_version 641362 (0.00090) [2022-07-10 08:15:19,029][25689] Fps is (10 sec: 5704.8, 60 sec: 5520.1, 300 sec: 5498.9). Total num frames: 656761856. Throughput: 0: 5672.7. Samples: 656764782. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:15:19,029][25689] Avg episode reward: [(0, '-5.883')] [2022-07-10 08:15:20,217][26022] Updated weights on worker 0-0, policy_version 641372 (0.00094) [2022-07-10 08:15:21,759][26022] Updated weights on worker 0-0, policy_version 641382 (0.00090) [2022-07-10 08:15:23,875][26022] Updated weights on worker 0-0, policy_version 641392 (0.00089) [2022-07-10 08:15:24,111][25689] Fps is (10 sec: 5585.3, 60 sec: 5460.9, 300 sec: 5484.0). Total num frames: 656786432. Throughput: 0: 4952.8. Samples: 656781444. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:15:24,111][25689] Avg episode reward: [(0, '-5.407')] [2022-07-10 08:15:25,427][26022] Updated weights on worker 0-0, policy_version 641402 (0.00091) [2022-07-10 08:15:27,543][26022] Updated weights on worker 0-0, policy_version 641412 (0.00088) [2022-07-10 08:15:29,135][25689] Fps is (10 sec: 5370.5, 60 sec: 5493.2, 300 sec: 5488.8). Total num frames: 656816128. Throughput: 0: 5766.4. Samples: 656814572. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:15:29,135][25689] Avg episode reward: [(0, '-4.846')] [2022-07-10 08:15:29,138][26022] Updated weights on worker 0-0, policy_version 641422 (0.00094) [2022-07-10 08:15:31,050][26022] Updated weights on worker 0-0, policy_version 641432 (0.00095) [2022-07-10 08:15:33,021][26022] Updated weights on worker 0-0, policy_version 641442 (0.00089) [2022-07-10 08:15:34,233][25689] Fps is (10 sec: 5664.8, 60 sec: 5509.2, 300 sec: 5493.9). Total num frames: 656843776. Throughput: 0: 5747.7. Samples: 656847988. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:15:34,234][25689] Avg episode reward: [(0, '-5.493')] [2022-07-10 08:15:34,847][26022] Updated weights on worker 0-0, policy_version 641452 (0.00084) [2022-07-10 08:15:36,552][26022] Updated weights on worker 0-0, policy_version 641462 (0.00088) [2022-07-10 08:15:38,512][26022] Updated weights on worker 0-0, policy_version 641472 (0.00095) [2022-07-10 08:15:39,241][25689] Fps is (10 sec: 5370.1, 60 sec: 5477.8, 300 sec: 5488.3). Total num frames: 656870400. Throughput: 0: 4951.7. Samples: 656864866. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:15:39,242][25689] Avg episode reward: [(0, '-4.820')] [2022-07-10 08:15:40,219][26022] Updated weights on worker 0-0, policy_version 641482 (0.00091) [2022-07-10 08:15:42,278][26022] Updated weights on worker 0-0, policy_version 641492 (0.00085) [2022-07-10 08:15:44,267][26022] Updated weights on worker 0-0, policy_version 641502 (0.00090) [2022-07-10 08:15:44,360][25689] Fps is (10 sec: 5460.3, 60 sec: 5496.7, 300 sec: 5487.8). Total num frames: 656899072. Throughput: 0: 5767.9. Samples: 656898242. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:15:44,361][25689] Avg episode reward: [(0, '-6.783')] [2022-07-10 08:15:45,790][26022] Updated weights on worker 0-0, policy_version 641512 (0.00087) [2022-07-10 08:15:48,097][26022] Updated weights on worker 0-0, policy_version 641522 (0.00087) [2022-07-10 08:15:49,368][25689] Fps is (10 sec: 5662.6, 60 sec: 5513.8, 300 sec: 5494.9). Total num frames: 656927744. Throughput: 0: 5769.9. Samples: 656931316. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:15:49,368][25689] Avg episode reward: [(0, '-7.200')] [2022-07-10 08:15:49,591][26022] Updated weights on worker 0-0, policy_version 641532 (0.00092) [2022-07-10 08:15:51,492][26022] Updated weights on worker 0-0, policy_version 641542 (0.00111) [2022-07-10 08:15:53,303][26022] Updated weights on worker 0-0, policy_version 641552 (0.00055) [2022-07-10 08:15:54,394][25689] Fps is (10 sec: 5612.9, 60 sec: 5513.7, 300 sec: 5495.1). Total num frames: 656955392. Throughput: 0: 5790.5. Samples: 656964730. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:15:54,395][25689] Avg episode reward: [(0, '-6.277')] [2022-07-10 08:15:55,130][26022] Updated weights on worker 0-0, policy_version 641562 (0.00092) [2022-07-10 08:15:57,144][26022] Updated weights on worker 0-0, policy_version 641572 (0.00090) [2022-07-10 08:15:58,884][26022] Updated weights on worker 0-0, policy_version 641582 (0.00087) [2022-07-10 08:15:59,442][25689] Fps is (10 sec: 5488.9, 60 sec: 5510.9, 300 sec: 5493.7). Total num frames: 656983040. Throughput: 0: 5758.2. Samples: 656981188. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:15:59,443][25689] Avg episode reward: [(0, '-7.752')] [2022-07-10 08:16:00,547][26022] Updated weights on worker 0-0, policy_version 641592 (0.00085) [2022-07-10 08:16:02,951][26022] Updated weights on worker 0-0, policy_version 641602 (0.00093) [2022-07-10 08:16:04,551][25689] Fps is (10 sec: 5242.7, 60 sec: 5491.6, 300 sec: 5491.9). Total num frames: 657008640. Throughput: 0: 5643.7. Samples: 657012192. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:16:04,555][25689] Avg episode reward: [(0, '-7.066')] [2022-07-10 08:16:04,980][26022] Updated weights on worker 0-0, policy_version 641612 (0.00092) [2022-07-10 08:16:06,589][26022] Updated weights on worker 0-0, policy_version 641622 (0.00088) [2022-07-10 08:16:08,679][26022] Updated weights on worker 0-0, policy_version 641632 (0.00088) [2022-07-10 08:16:09,583][25689] Fps is (10 sec: 5250.7, 60 sec: 5474.3, 300 sec: 5488.5). Total num frames: 657036288. Throughput: 0: 5618.0. Samples: 657044886. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:16:09,585][25689] Avg episode reward: [(0, '-8.969')] [2022-07-10 08:16:10,360][26022] Updated weights on worker 0-0, policy_version 641642 (0.00096) [2022-07-10 08:16:12,605][26022] Updated weights on worker 0-0, policy_version 641652 (0.00092) [2022-07-10 08:16:14,214][26022] Updated weights on worker 0-0, policy_version 641662 (0.00612) [2022-07-10 08:16:14,630][25689] Fps is (10 sec: 5384.6, 60 sec: 5491.5, 300 sec: 5491.6). Total num frames: 657062912. Throughput: 0: 4771.1. Samples: 657061276. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:16:14,631][25689] Avg episode reward: [(0, '-7.880')] [2022-07-10 08:16:15,618][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:16:15,630][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000641669_657069056.pth [2022-07-10 08:16:15,631][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000639737_655090688.pth [2022-07-10 08:16:16,162][26022] Updated weights on worker 0-0, policy_version 641672 (0.00089) [2022-07-10 08:16:17,968][26022] Updated weights on worker 0-0, policy_version 641682 (0.00093) [2022-07-10 08:16:19,662][25689] Fps is (10 sec: 5486.3, 60 sec: 5438.1, 300 sec: 5490.0). Total num frames: 657091584. Throughput: 0: 5598.0. Samples: 657094380. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:16:19,663][25689] Avg episode reward: [(0, '-7.822')] [2022-07-10 08:16:19,762][26022] Updated weights on worker 0-0, policy_version 641692 (0.00092) [2022-07-10 08:16:21,667][26022] Updated weights on worker 0-0, policy_version 641702 (0.00085) [2022-07-10 08:16:23,554][26022] Updated weights on worker 0-0, policy_version 641712 (0.00084) [2022-07-10 08:16:24,743][25689] Fps is (10 sec: 5468.0, 60 sec: 5471.9, 300 sec: 5482.7). Total num frames: 657118208. Throughput: 0: 5714.3. Samples: 657127574. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:16:24,743][25689] Avg episode reward: [(0, '-9.468')] [2022-07-10 08:16:25,166][26022] Updated weights on worker 0-0, policy_version 641722 (0.00099) [2022-07-10 08:16:27,294][26022] Updated weights on worker 0-0, policy_version 641732 (0.00080) [2022-07-10 08:16:28,801][26022] Updated weights on worker 0-0, policy_version 641742 (0.00092) [2022-07-10 08:16:29,752][25689] Fps is (10 sec: 5480.3, 60 sec: 5456.4, 300 sec: 5487.1). Total num frames: 657146880. Throughput: 0: 4931.7. Samples: 657144352. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:16:29,752][25689] Avg episode reward: [(0, '-7.559')] [2022-07-10 08:16:30,951][26022] Updated weights on worker 0-0, policy_version 641752 (0.00093) [2022-07-10 08:16:32,694][26022] Updated weights on worker 0-0, policy_version 641762 (0.00090) [2022-07-10 08:16:34,414][26022] Updated weights on worker 0-0, policy_version 641772 (0.00091) [2022-07-10 08:16:34,836][25689] Fps is (10 sec: 5783.1, 60 sec: 5491.6, 300 sec: 5490.5). Total num frames: 657176576. Throughput: 0: 5763.0. Samples: 657177720. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:16:34,836][25689] Avg episode reward: [(0, '-7.181')] [2022-07-10 08:16:36,653][26022] Updated weights on worker 0-0, policy_version 641782 (0.00085) [2022-07-10 08:16:38,055][26022] Updated weights on worker 0-0, policy_version 641792 (0.00086) [2022-07-10 08:16:39,912][25689] Fps is (10 sec: 5543.3, 60 sec: 5485.3, 300 sec: 5486.6). Total num frames: 657203200. Throughput: 0: 5755.0. Samples: 657210918. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:16:39,912][25689] Avg episode reward: [(0, '-6.306')] [2022-07-10 08:16:40,241][26022] Updated weights on worker 0-0, policy_version 641802 (0.00089) [2022-07-10 08:16:42,172][26022] Updated weights on worker 0-0, policy_version 641812 (0.00085) [2022-07-10 08:16:43,744][26022] Updated weights on worker 0-0, policy_version 641822 (0.00089) [2022-07-10 08:16:44,996][25689] Fps is (10 sec: 5341.7, 60 sec: 5471.7, 300 sec: 5488.6). Total num frames: 657230848. Throughput: 0: 4926.9. Samples: 657227358. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:16:44,996][25689] Avg episode reward: [(0, '-5.180')] [2022-07-10 08:16:45,803][26022] Updated weights on worker 0-0, policy_version 641832 (0.00614) [2022-07-10 08:16:47,622][26022] Updated weights on worker 0-0, policy_version 641842 (0.00096) [2022-07-10 08:16:49,475][26022] Updated weights on worker 0-0, policy_version 641852 (0.00091) [2022-07-10 08:16:50,030][25689] Fps is (10 sec: 5566.3, 60 sec: 5469.2, 300 sec: 5488.3). Total num frames: 657259520. Throughput: 0: 5726.2. Samples: 657260468. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:16:50,031][25689] Avg episode reward: [(0, '-4.422')] [2022-07-10 08:16:51,167][26022] Updated weights on worker 0-0, policy_version 641862 (0.00091) [2022-07-10 08:16:53,117][26022] Updated weights on worker 0-0, policy_version 641872 (0.00097) [2022-07-10 08:16:54,810][26022] Updated weights on worker 0-0, policy_version 641882 (0.00087) [2022-07-10 08:16:55,101][25689] Fps is (10 sec: 5674.4, 60 sec: 5482.1, 300 sec: 5487.2). Total num frames: 657288192. Throughput: 0: 5735.4. Samples: 657293952. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:16:55,102][25689] Avg episode reward: [(0, '-4.651')] [2022-07-10 08:16:56,851][26022] Updated weights on worker 0-0, policy_version 641892 (0.00101) [2022-07-10 08:16:58,575][26022] Updated weights on worker 0-0, policy_version 641902 (0.00090) [2022-07-10 08:17:00,134][25689] Fps is (10 sec: 5472.8, 60 sec: 5466.6, 300 sec: 5495.9). Total num frames: 657314816. Throughput: 0: 4914.1. Samples: 657310290. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:17:00,135][25689] Avg episode reward: [(0, '-5.109')] [2022-07-10 08:17:00,557][26022] Updated weights on worker 0-0, policy_version 641912 (0.00086) [2022-07-10 08:17:02,987][26022] Updated weights on worker 0-0, policy_version 641922 (0.00089) [2022-07-10 08:17:04,560][26022] Updated weights on worker 0-0, policy_version 641932 (0.00097) [2022-07-10 08:17:05,191][25689] Fps is (10 sec: 5379.0, 60 sec: 5505.0, 300 sec: 5491.8). Total num frames: 657342464. Throughput: 0: 5658.4. Samples: 657341632. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:17:05,191][25689] Avg episode reward: [(0, '-4.403')] [2022-07-10 08:17:06,572][26022] Updated weights on worker 0-0, policy_version 641942 (0.00092) [2022-07-10 08:17:08,204][26022] Updated weights on worker 0-0, policy_version 641952 (0.00093) [2022-07-10 08:17:10,118][26022] Updated weights on worker 0-0, policy_version 641962 (0.00081) [2022-07-10 08:17:10,214][25689] Fps is (10 sec: 5485.4, 60 sec: 5505.8, 300 sec: 5495.0). Total num frames: 657370112. Throughput: 0: 5677.2. Samples: 657375060. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:17:10,216][25689] Avg episode reward: [(0, '-4.432')] [2022-07-10 08:17:12,090][26022] Updated weights on worker 0-0, policy_version 641972 (0.00091) [2022-07-10 08:17:13,717][26022] Updated weights on worker 0-0, policy_version 641982 (0.00086) [2022-07-10 08:17:15,226][25689] Fps is (10 sec: 5408.1, 60 sec: 5509.0, 300 sec: 5489.4). Total num frames: 657396736. Throughput: 0: 4864.8. Samples: 657391856. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:17:15,227][25689] Avg episode reward: [(0, '-6.277')] [2022-07-10 08:17:15,716][26022] Updated weights on worker 0-0, policy_version 641992 (0.00087) [2022-07-10 08:17:17,555][26022] Updated weights on worker 0-0, policy_version 642002 (0.00093) [2022-07-10 08:17:19,489][26022] Updated weights on worker 0-0, policy_version 642012 (0.01107) [2022-07-10 08:17:20,275][25689] Fps is (10 sec: 5394.5, 60 sec: 5490.6, 300 sec: 5489.4). Total num frames: 657424384. Throughput: 0: 5678.5. Samples: 657424662. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:17:20,276][25689] Avg episode reward: [(0, '-7.529')] [2022-07-10 08:17:21,437][26022] Updated weights on worker 0-0, policy_version 642022 (0.00084) [2022-07-10 08:17:23,230][26022] Updated weights on worker 0-0, policy_version 642032 (0.00097) [2022-07-10 08:17:24,918][26022] Updated weights on worker 0-0, policy_version 642042 (0.00090) [2022-07-10 08:17:25,376][25689] Fps is (10 sec: 5548.9, 60 sec: 5522.5, 300 sec: 5487.7). Total num frames: 657453056. Throughput: 0: 5752.7. Samples: 657457750. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:17:25,376][25689] Avg episode reward: [(0, '-7.642')] [2022-07-10 08:17:26,905][26022] Updated weights on worker 0-0, policy_version 642052 (0.00084) [2022-07-10 08:17:28,869][26022] Updated weights on worker 0-0, policy_version 642062 (0.00087) [2022-07-10 08:17:30,399][25689] Fps is (10 sec: 5563.4, 60 sec: 5504.4, 300 sec: 5487.5). Total num frames: 657480704. Throughput: 0: 4910.1. Samples: 657474166. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:17:30,399][25689] Avg episode reward: [(0, '-8.560')] [2022-07-10 08:17:30,589][26022] Updated weights on worker 0-0, policy_version 642072 (0.00684) [2022-07-10 08:17:32,403][26022] Updated weights on worker 0-0, policy_version 642082 (0.00086) [2022-07-10 08:17:34,273][26022] Updated weights on worker 0-0, policy_version 642092 (0.00085) [2022-07-10 08:17:35,417][25689] Fps is (10 sec: 5404.9, 60 sec: 5459.6, 300 sec: 5488.0). Total num frames: 657507328. Throughput: 0: 5715.9. Samples: 657507266. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:17:35,418][25689] Avg episode reward: [(0, '-8.944')] [2022-07-10 08:17:36,175][26022] Updated weights on worker 0-0, policy_version 642102 (0.00080) [2022-07-10 08:17:38,175][26022] Updated weights on worker 0-0, policy_version 642112 (0.00095) [2022-07-10 08:17:39,833][26022] Updated weights on worker 0-0, policy_version 642122 (0.00089) [2022-07-10 08:17:40,439][25689] Fps is (10 sec: 5507.2, 60 sec: 5498.4, 300 sec: 5488.9). Total num frames: 657536000. Throughput: 0: 5739.2. Samples: 657540388. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:17:40,440][25689] Avg episode reward: [(0, '-6.684')] [2022-07-10 08:17:41,907][26022] Updated weights on worker 0-0, policy_version 642132 (0.00091) [2022-07-10 08:17:43,632][26022] Updated weights on worker 0-0, policy_version 642142 (0.00089) [2022-07-10 08:17:45,415][26022] Updated weights on worker 0-0, policy_version 642152 (0.00091) [2022-07-10 08:17:45,490][25689] Fps is (10 sec: 5591.0, 60 sec: 5501.3, 300 sec: 5488.6). Total num frames: 657563648. Throughput: 0: 4937.0. Samples: 657557056. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:17:45,491][25689] Avg episode reward: [(0, '-5.099')] [2022-07-10 08:17:47,451][26022] Updated weights on worker 0-0, policy_version 642162 (0.00083) [2022-07-10 08:17:48,944][26022] Updated weights on worker 0-0, policy_version 642172 (0.00090) [2022-07-10 08:17:50,526][25689] Fps is (10 sec: 5380.4, 60 sec: 5467.3, 300 sec: 5488.3). Total num frames: 657590272. Throughput: 0: 5766.6. Samples: 657590234. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:17:50,527][25689] Avg episode reward: [(0, '-4.739')] [2022-07-10 08:17:51,081][26022] Updated weights on worker 0-0, policy_version 642182 (0.00090) [2022-07-10 08:17:52,689][26022] Updated weights on worker 0-0, policy_version 642192 (0.00103) [2022-07-10 08:17:54,755][26022] Updated weights on worker 0-0, policy_version 642202 (0.00090) [2022-07-10 08:17:55,537][25689] Fps is (10 sec: 5606.0, 60 sec: 5489.8, 300 sec: 5488.7). Total num frames: 657619968. Throughput: 0: 5772.1. Samples: 657623398. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:17:55,537][25689] Avg episode reward: [(0, '-3.931')] [2022-07-10 08:17:56,746][26022] Updated weights on worker 0-0, policy_version 642212 (0.00083) [2022-07-10 08:17:58,448][26022] Updated weights on worker 0-0, policy_version 642222 (0.00090) [2022-07-10 08:18:00,517][26022] Updated weights on worker 0-0, policy_version 642232 (0.00094) [2022-07-10 08:18:00,550][25689] Fps is (10 sec: 5516.4, 60 sec: 5474.5, 300 sec: 5490.8). Total num frames: 657645568. Throughput: 0: 4941.7. Samples: 657639770. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:18:00,551][25689] Avg episode reward: [(0, '-3.845')] [2022-07-10 08:18:02,705][26022] Updated weights on worker 0-0, policy_version 642242 (0.00088) [2022-07-10 08:18:04,373][26022] Updated weights on worker 0-0, policy_version 642252 (0.00084) [2022-07-10 08:18:05,586][25689] Fps is (10 sec: 5196.6, 60 sec: 5459.5, 300 sec: 5483.8). Total num frames: 657672192. Throughput: 0: 5653.7. Samples: 657670674. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:18:05,587][25689] Avg episode reward: [(0, '-3.642')] [2022-07-10 08:18:06,296][26022] Updated weights on worker 0-0, policy_version 642262 (0.00988) [2022-07-10 08:18:08,154][26022] Updated weights on worker 0-0, policy_version 642272 (0.00088) [2022-07-10 08:18:09,997][26022] Updated weights on worker 0-0, policy_version 642282 (0.00088) [2022-07-10 08:18:10,591][25689] Fps is (10 sec: 5303.3, 60 sec: 5444.3, 300 sec: 5477.4). Total num frames: 657698816. Throughput: 0: 5664.0. Samples: 657703880. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:18:10,591][25689] Avg episode reward: [(0, '-3.888')] [2022-07-10 08:18:11,926][26022] Updated weights on worker 0-0, policy_version 642292 (0.00098) [2022-07-10 08:18:13,621][26022] Updated weights on worker 0-0, policy_version 642302 (0.00086) [2022-07-10 08:18:15,596][25689] Fps is (10 sec: 5422.2, 60 sec: 5461.8, 300 sec: 5481.0). Total num frames: 657726464. Throughput: 0: 4839.8. Samples: 657720480. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:18:15,596][25689] Avg episode reward: [(0, '-4.742')] [2022-07-10 08:18:15,633][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:18:15,646][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000642312_657727488.pth [2022-07-10 08:18:15,647][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000640381_655750144.pth [2022-07-10 08:18:15,652][26022] Updated weights on worker 0-0, policy_version 642312 (0.00093) [2022-07-10 08:18:17,301][26022] Updated weights on worker 0-0, policy_version 642322 (0.00094) [2022-07-10 08:18:19,540][26022] Updated weights on worker 0-0, policy_version 642332 (0.00698) [2022-07-10 08:18:20,604][25689] Fps is (10 sec: 5624.3, 60 sec: 5482.4, 300 sec: 5486.6). Total num frames: 657755136. Throughput: 0: 5673.7. Samples: 657753552. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:18:20,605][25689] Avg episode reward: [(0, '-4.387')] [2022-07-10 08:18:20,996][26022] Updated weights on worker 0-0, policy_version 642342 (0.00082) [2022-07-10 08:18:23,312][26022] Updated weights on worker 0-0, policy_version 642352 (0.00089) [2022-07-10 08:18:24,756][26022] Updated weights on worker 0-0, policy_version 642362 (0.00089) [2022-07-10 08:18:25,731][25689] Fps is (10 sec: 5556.9, 60 sec: 5463.1, 300 sec: 5484.6). Total num frames: 657782784. Throughput: 0: 5770.9. Samples: 657786926. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:18:25,731][25689] Avg episode reward: [(0, '-5.141')] [2022-07-10 08:18:26,812][26022] Updated weights on worker 0-0, policy_version 642372 (0.00082) [2022-07-10 08:18:28,590][26022] Updated weights on worker 0-0, policy_version 642382 (0.00089) [2022-07-10 08:18:30,553][26022] Updated weights on worker 0-0, policy_version 642392 (0.00086) [2022-07-10 08:18:30,737][25689] Fps is (10 sec: 5457.2, 60 sec: 5464.6, 300 sec: 5477.8). Total num frames: 657810432. Throughput: 0: 5775.4. Samples: 657820234. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:18:30,738][25689] Avg episode reward: [(0, '-3.292')] [2022-07-10 08:18:32,224][26022] Updated weights on worker 0-0, policy_version 642402 (0.00085) [2022-07-10 08:18:34,087][26022] Updated weights on worker 0-0, policy_version 642412 (0.00085) [2022-07-10 08:18:35,755][25689] Fps is (10 sec: 5618.6, 60 sec: 5498.7, 300 sec: 5484.9). Total num frames: 657839104. Throughput: 0: 5781.0. Samples: 657837018. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:18:35,755][25689] Avg episode reward: [(0, '-3.729')] [2022-07-10 08:18:35,862][26022] Updated weights on worker 0-0, policy_version 642422 (0.00088) [2022-07-10 08:18:37,984][26022] Updated weights on worker 0-0, policy_version 642432 (0.00095) [2022-07-10 08:18:39,509][26022] Updated weights on worker 0-0, policy_version 642442 (0.00086) [2022-07-10 08:18:40,767][25689] Fps is (10 sec: 5615.4, 60 sec: 5482.6, 300 sec: 5482.5). Total num frames: 657866752. Throughput: 0: 5788.6. Samples: 657870264. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:18:40,767][25689] Avg episode reward: [(0, '-4.359')] [2022-07-10 08:18:41,671][26022] Updated weights on worker 0-0, policy_version 642452 (0.00085) [2022-07-10 08:18:43,331][26022] Updated weights on worker 0-0, policy_version 642462 (0.00093) [2022-07-10 08:18:45,195][26022] Updated weights on worker 0-0, policy_version 642472 (0.00086) [2022-07-10 08:18:45,832][25689] Fps is (10 sec: 5487.0, 60 sec: 5481.3, 300 sec: 5481.5). Total num frames: 657894400. Throughput: 0: 5794.4. Samples: 657903402. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:18:45,833][25689] Avg episode reward: [(0, '-5.192')] [2022-07-10 08:18:47,100][26022] Updated weights on worker 0-0, policy_version 642482 (0.00091) [2022-07-10 08:18:49,106][26022] Updated weights on worker 0-0, policy_version 642492 (0.00092) [2022-07-10 08:18:50,736][26022] Updated weights on worker 0-0, policy_version 642502 (0.00091) [2022-07-10 08:18:50,867][25689] Fps is (10 sec: 5474.9, 60 sec: 5498.4, 300 sec: 5484.9). Total num frames: 657922048. Throughput: 0: 4948.0. Samples: 657919834. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:18:50,867][25689] Avg episode reward: [(0, '-5.648')] [2022-07-10 08:18:52,893][26022] Updated weights on worker 0-0, policy_version 642512 (0.00093) [2022-07-10 08:18:54,435][26022] Updated weights on worker 0-0, policy_version 642522 (0.00086) [2022-07-10 08:18:55,891][25689] Fps is (10 sec: 5497.4, 60 sec: 5463.2, 300 sec: 5484.5). Total num frames: 657949696. Throughput: 0: 5752.2. Samples: 657952846. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:18:55,891][25689] Avg episode reward: [(0, '-5.565')] [2022-07-10 08:18:56,533][26022] Updated weights on worker 0-0, policy_version 642532 (0.00082) [2022-07-10 08:18:58,283][26022] Updated weights on worker 0-0, policy_version 642542 (0.00095) [2022-07-10 08:19:00,007][26022] Updated weights on worker 0-0, policy_version 642552 (0.00086) [2022-07-10 08:19:00,922][25689] Fps is (10 sec: 5499.4, 60 sec: 5495.6, 300 sec: 5491.6). Total num frames: 657977344. Throughput: 0: 5762.7. Samples: 657986410. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:19:00,922][25689] Avg episode reward: [(0, '-5.875')] [2022-07-10 08:19:01,998][26022] Updated weights on worker 0-0, policy_version 642562 (0.00092) [2022-07-10 08:19:03,996][26022] Updated weights on worker 0-0, policy_version 642572 (0.00089) [2022-07-10 08:19:05,954][26022] Updated weights on worker 0-0, policy_version 642582 (0.00081) [2022-07-10 08:19:05,990][25689] Fps is (10 sec: 5475.3, 60 sec: 5509.6, 300 sec: 5487.6). Total num frames: 658004992. Throughput: 0: 4830.8. Samples: 658000780. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:19:05,990][25689] Avg episode reward: [(0, '-5.478')] [2022-07-10 08:19:07,955][26022] Updated weights on worker 0-0, policy_version 642592 (0.00090) [2022-07-10 08:19:09,541][26022] Updated weights on worker 0-0, policy_version 642602 (0.00089) [2022-07-10 08:19:11,017][25689] Fps is (10 sec: 5375.9, 60 sec: 5507.5, 300 sec: 5483.8). Total num frames: 658031616. Throughput: 0: 5672.3. Samples: 658034134. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:19:11,018][25689] Avg episode reward: [(0, '-5.160')] [2022-07-10 08:19:11,551][26022] Updated weights on worker 0-0, policy_version 642612 (0.00092) [2022-07-10 08:19:13,365][26022] Updated weights on worker 0-0, policy_version 642622 (0.00088) [2022-07-10 08:19:15,126][26022] Updated weights on worker 0-0, policy_version 642632 (0.00088) [2022-07-10 08:19:16,038][25689] Fps is (10 sec: 5503.0, 60 sec: 5523.0, 300 sec: 5486.9). Total num frames: 658060288. Throughput: 0: 5683.8. Samples: 658067360. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:19:16,039][25689] Avg episode reward: [(0, '-5.678')] [2022-07-10 08:19:17,182][26022] Updated weights on worker 0-0, policy_version 642642 (0.00084) [2022-07-10 08:19:18,859][26022] Updated weights on worker 0-0, policy_version 642652 (0.00095) [2022-07-10 08:19:20,989][26022] Updated weights on worker 0-0, policy_version 642662 (0.00089) [2022-07-10 08:19:21,089][25689] Fps is (10 sec: 5388.7, 60 sec: 5468.4, 300 sec: 5478.9). Total num frames: 658085888. Throughput: 0: 4828.4. Samples: 658083784. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:19:21,089][25689] Avg episode reward: [(0, '-6.251')] [2022-07-10 08:19:22,556][26022] Updated weights on worker 0-0, policy_version 642672 (0.00104) [2022-07-10 08:19:24,510][26022] Updated weights on worker 0-0, policy_version 642682 (0.00088) [2022-07-10 08:19:26,158][25689] Fps is (10 sec: 5362.8, 60 sec: 5490.5, 300 sec: 5481.2). Total num frames: 658114560. Throughput: 0: 5761.3. Samples: 658116976. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:19:26,159][25689] Avg episode reward: [(0, '-7.137')] [2022-07-10 08:19:26,355][26022] Updated weights on worker 0-0, policy_version 642692 (0.00092) [2022-07-10 08:19:28,435][26022] Updated weights on worker 0-0, policy_version 642702 (0.00087) [2022-07-10 08:19:30,276][26022] Updated weights on worker 0-0, policy_version 642712 (0.00090) [2022-07-10 08:19:31,230][25689] Fps is (10 sec: 5553.4, 60 sec: 5484.5, 300 sec: 5484.9). Total num frames: 658142208. Throughput: 0: 5720.9. Samples: 658149770. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:19:31,231][25689] Avg episode reward: [(0, '-6.621')] [2022-07-10 08:19:32,256][26022] Updated weights on worker 0-0, policy_version 642722 (0.00094) [2022-07-10 08:19:33,673][26022] Updated weights on worker 0-0, policy_version 642732 (0.00091) [2022-07-10 08:19:35,555][26022] Updated weights on worker 0-0, policy_version 642742 (0.00090) [2022-07-10 08:19:36,244][25689] Fps is (10 sec: 5584.3, 60 sec: 5484.9, 300 sec: 5485.3). Total num frames: 658170880. Throughput: 0: 4905.4. Samples: 658166476. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:19:36,244][25689] Avg episode reward: [(0, '-5.659')] [2022-07-10 08:19:37,621][26022] Updated weights on worker 0-0, policy_version 642752 (0.00088) [2022-07-10 08:19:39,423][26022] Updated weights on worker 0-0, policy_version 642762 (0.00081) [2022-07-10 08:19:41,248][25689] Fps is (10 sec: 5519.6, 60 sec: 5468.6, 300 sec: 5484.4). Total num frames: 658197504. Throughput: 0: 5763.5. Samples: 658199974. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:19:41,249][25689] Avg episode reward: [(0, '-5.633')] [2022-07-10 08:19:41,278][26022] Updated weights on worker 0-0, policy_version 642772 (0.00095) [2022-07-10 08:19:43,004][26022] Updated weights on worker 0-0, policy_version 642782 (0.00109) [2022-07-10 08:19:44,819][26022] Updated weights on worker 0-0, policy_version 642792 (0.00083) [2022-07-10 08:19:46,303][25689] Fps is (10 sec: 5497.2, 60 sec: 5486.6, 300 sec: 5487.0). Total num frames: 658226176. Throughput: 0: 5759.8. Samples: 658233004. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:19:46,303][25689] Avg episode reward: [(0, '-5.925')] [2022-07-10 08:19:46,960][26022] Updated weights on worker 0-0, policy_version 642802 (0.00086) [2022-07-10 08:19:48,533][26022] Updated weights on worker 0-0, policy_version 642812 (0.00093) [2022-07-10 08:19:50,658][26022] Updated weights on worker 0-0, policy_version 642822 (0.00085) [2022-07-10 08:19:51,313][25689] Fps is (10 sec: 5493.9, 60 sec: 5471.8, 300 sec: 5483.8). Total num frames: 658252800. Throughput: 0: 4968.9. Samples: 658249560. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:19:51,315][25689] Avg episode reward: [(0, '-3.771')] [2022-07-10 08:19:52,302][26022] Updated weights on worker 0-0, policy_version 642832 (0.00089) [2022-07-10 08:19:54,287][26022] Updated weights on worker 0-0, policy_version 642842 (0.00086) [2022-07-10 08:19:56,024][26022] Updated weights on worker 0-0, policy_version 642852 (0.00091) [2022-07-10 08:19:56,322][25689] Fps is (10 sec: 5519.2, 60 sec: 5490.2, 300 sec: 5487.4). Total num frames: 658281472. Throughput: 0: 5791.8. Samples: 658282762. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:19:56,322][25689] Avg episode reward: [(0, '-3.470')] [2022-07-10 08:19:57,876][26022] Updated weights on worker 0-0, policy_version 642862 (0.00099) [2022-07-10 08:19:59,828][26022] Updated weights on worker 0-0, policy_version 642872 (0.00085) [2022-07-10 08:20:01,342][25689] Fps is (10 sec: 5513.5, 60 sec: 5474.1, 300 sec: 5488.6). Total num frames: 658308096. Throughput: 0: 5773.0. Samples: 658315978. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:20:01,343][25689] Avg episode reward: [(0, '-5.199')] [2022-07-10 08:20:01,759][26022] Updated weights on worker 0-0, policy_version 642882 (0.00088) [2022-07-10 08:20:03,889][26022] Updated weights on worker 0-0, policy_version 642892 (0.00095) [2022-07-10 08:20:05,879][26022] Updated weights on worker 0-0, policy_version 642902 (0.00094) [2022-07-10 08:20:06,468][25689] Fps is (10 sec: 5248.1, 60 sec: 5452.1, 300 sec: 5479.9). Total num frames: 658334720. Throughput: 0: 4825.3. Samples: 658330306. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:20:06,468][25689] Avg episode reward: [(0, '-5.967')] [2022-07-10 08:20:07,663][26022] Updated weights on worker 0-0, policy_version 642912 (0.00085) [2022-07-10 08:20:09,563][26022] Updated weights on worker 0-0, policy_version 642922 (0.00084) [2022-07-10 08:20:11,444][26022] Updated weights on worker 0-0, policy_version 642932 (0.00085) [2022-07-10 08:20:11,533][25689] Fps is (10 sec: 5426.3, 60 sec: 5482.5, 300 sec: 5489.9). Total num frames: 658363392. Throughput: 0: 5621.7. Samples: 658363228. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:20:11,533][25689] Avg episode reward: [(0, '-5.995')] [2022-07-10 08:20:13,283][26022] Updated weights on worker 0-0, policy_version 642942 (0.00084) [2022-07-10 08:20:15,025][26022] Updated weights on worker 0-0, policy_version 642952 (0.00084) [2022-07-10 08:20:15,824][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:20:15,837][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000642956_658386944.pth [2022-07-10 08:20:15,837][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000641027_656411648.pth [2022-07-10 08:20:16,578][25689] Fps is (10 sec: 5469.1, 60 sec: 5446.4, 300 sec: 5471.9). Total num frames: 658390016. Throughput: 0: 5614.7. Samples: 658396498. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:20:16,579][25689] Avg episode reward: [(0, '-5.291')] [2022-07-10 08:20:17,004][26022] Updated weights on worker 0-0, policy_version 642962 (0.00094) [2022-07-10 08:20:18,587][26022] Updated weights on worker 0-0, policy_version 642972 (0.00095) [2022-07-10 08:20:20,747][26022] Updated weights on worker 0-0, policy_version 642982 (0.00093) [2022-07-10 08:20:21,581][25689] Fps is (10 sec: 5604.8, 60 sec: 5518.4, 300 sec: 5490.6). Total num frames: 658419712. Throughput: 0: 4789.9. Samples: 658412920. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:20:21,582][25689] Avg episode reward: [(0, '-5.573')] [2022-07-10 08:20:22,480][26022] Updated weights on worker 0-0, policy_version 642992 (0.00090) [2022-07-10 08:20:24,281][26022] Updated weights on worker 0-0, policy_version 643002 (0.00086) [2022-07-10 08:20:26,189][26022] Updated weights on worker 0-0, policy_version 643012 (0.00094) [2022-07-10 08:20:26,671][25689] Fps is (10 sec: 5580.4, 60 sec: 5482.8, 300 sec: 5479.0). Total num frames: 658446336. Throughput: 0: 5737.3. Samples: 658446218. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:20:26,671][25689] Avg episode reward: [(0, '-6.893')] [2022-07-10 08:20:28,091][26022] Updated weights on worker 0-0, policy_version 643022 (0.00094) [2022-07-10 08:20:29,932][26022] Updated weights on worker 0-0, policy_version 643032 (0.00094) [2022-07-10 08:20:31,764][25689] Fps is (10 sec: 5330.0, 60 sec: 5480.9, 300 sec: 5479.1). Total num frames: 658473984. Throughput: 0: 5744.6. Samples: 658479446. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:20:31,764][25689] Avg episode reward: [(0, '-5.543')] [2022-07-10 08:20:31,875][26022] Updated weights on worker 0-0, policy_version 643042 (0.00093) [2022-07-10 08:20:33,665][26022] Updated weights on worker 0-0, policy_version 643052 (0.00087) [2022-07-10 08:20:35,477][26022] Updated weights on worker 0-0, policy_version 643062 (0.00086) [2022-07-10 08:20:36,800][25689] Fps is (10 sec: 5560.3, 60 sec: 5478.8, 300 sec: 5485.5). Total num frames: 658502656. Throughput: 0: 4929.4. Samples: 658496178. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:20:36,801][25689] Avg episode reward: [(0, '-4.361')] [2022-07-10 08:20:37,346][26022] Updated weights on worker 0-0, policy_version 643072 (0.00091) [2022-07-10 08:20:39,132][26022] Updated weights on worker 0-0, policy_version 643082 (0.00082) [2022-07-10 08:20:41,208][26022] Updated weights on worker 0-0, policy_version 643092 (0.00093) [2022-07-10 08:20:41,821][25689] Fps is (10 sec: 5599.9, 60 sec: 5494.2, 300 sec: 5483.9). Total num frames: 658530304. Throughput: 0: 5763.5. Samples: 658529572. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:20:41,821][25689] Avg episode reward: [(0, '-3.832')] [2022-07-10 08:20:42,803][26022] Updated weights on worker 0-0, policy_version 643102 (0.00102) [2022-07-10 08:20:44,960][26022] Updated weights on worker 0-0, policy_version 643112 (0.00089) [2022-07-10 08:20:46,531][26022] Updated weights on worker 0-0, policy_version 643122 (0.00087) [2022-07-10 08:20:46,884][25689] Fps is (10 sec: 5483.2, 60 sec: 5476.5, 300 sec: 5479.4). Total num frames: 658557952. Throughput: 0: 5762.9. Samples: 658562706. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:20:46,887][25689] Avg episode reward: [(0, '-2.853')] [2022-07-10 08:20:48,523][26022] Updated weights on worker 0-0, policy_version 643132 (0.00092) [2022-07-10 08:20:50,349][26022] Updated weights on worker 0-0, policy_version 643142 (0.00087) [2022-07-10 08:20:51,918][25689] Fps is (10 sec: 5476.5, 60 sec: 5491.3, 300 sec: 5479.3). Total num frames: 658585600. Throughput: 0: 4958.5. Samples: 658579382. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:20:51,920][25689] Avg episode reward: [(0, '-3.111')] [2022-07-10 08:20:52,352][26022] Updated weights on worker 0-0, policy_version 643152 (0.00089) [2022-07-10 08:20:53,913][26022] Updated weights on worker 0-0, policy_version 643162 (0.00087) [2022-07-10 08:20:56,036][26022] Updated weights on worker 0-0, policy_version 643172 (0.00091) [2022-07-10 08:20:56,974][25689] Fps is (10 sec: 5582.2, 60 sec: 5487.0, 300 sec: 5482.6). Total num frames: 658614272. Throughput: 0: 5773.1. Samples: 658612642. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:20:56,974][25689] Avg episode reward: [(0, '-3.082')] [2022-07-10 08:20:57,670][26022] Updated weights on worker 0-0, policy_version 643182 (0.00084) [2022-07-10 08:20:59,543][26022] Updated weights on worker 0-0, policy_version 643192 (0.00093) [2022-07-10 08:21:01,334][26022] Updated weights on worker 0-0, policy_version 643202 (0.00086) [2022-07-10 08:21:02,040][25689] Fps is (10 sec: 5462.8, 60 sec: 5482.9, 300 sec: 5486.8). Total num frames: 658640896. Throughput: 0: 5760.0. Samples: 658646036. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:21:02,041][25689] Avg episode reward: [(0, '-4.074')] [2022-07-10 08:21:03,751][26022] Updated weights on worker 0-0, policy_version 643212 (0.00088) [2022-07-10 08:21:05,412][26022] Updated weights on worker 0-0, policy_version 643222 (0.00086) [2022-07-10 08:21:07,117][25689] Fps is (10 sec: 5350.4, 60 sec: 5504.2, 300 sec: 5485.9). Total num frames: 658668544. Throughput: 0: 5677.6. Samples: 658677580. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:21:07,118][25689] Avg episode reward: [(0, '-4.275')] [2022-07-10 08:21:07,350][26022] Updated weights on worker 0-0, policy_version 643232 (0.00090) [2022-07-10 08:21:08,890][26022] Updated weights on worker 0-0, policy_version 643242 (0.00099) [2022-07-10 08:21:10,988][26022] Updated weights on worker 0-0, policy_version 643252 (0.00614) [2022-07-10 08:21:12,121][25689] Fps is (10 sec: 5485.5, 60 sec: 5492.9, 300 sec: 5490.2). Total num frames: 658696192. Throughput: 0: 5689.9. Samples: 658694334. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 08:21:12,124][25689] Avg episode reward: [(0, '-5.202')] [2022-07-10 08:21:12,672][26022] Updated weights on worker 0-0, policy_version 643262 (0.00092) [2022-07-10 08:21:14,835][26022] Updated weights on worker 0-0, policy_version 643272 (0.00101) [2022-07-10 08:21:16,448][26022] Updated weights on worker 0-0, policy_version 643282 (0.00091) [2022-07-10 08:21:17,125][25689] Fps is (10 sec: 5525.1, 60 sec: 5513.5, 300 sec: 5487.3). Total num frames: 658723840. Throughput: 0: 5686.8. Samples: 658727242. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:21:17,126][25689] Avg episode reward: [(0, '-5.620')] [2022-07-10 08:21:18,353][26022] Updated weights on worker 0-0, policy_version 643292 (0.00086) [2022-07-10 08:21:20,265][26022] Updated weights on worker 0-0, policy_version 643302 (0.00089) [2022-07-10 08:21:21,905][26022] Updated weights on worker 0-0, policy_version 643312 (0.00090) [2022-07-10 08:21:22,217][25689] Fps is (10 sec: 5578.5, 60 sec: 5488.6, 300 sec: 5493.9). Total num frames: 658752512. Throughput: 0: 5676.6. Samples: 658760568. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:21:22,217][25689] Avg episode reward: [(0, '-6.890')] [2022-07-10 08:21:24,046][26022] Updated weights on worker 0-0, policy_version 643322 (0.00113) [2022-07-10 08:21:25,669][26022] Updated weights on worker 0-0, policy_version 643332 (0.00088) [2022-07-10 08:21:27,288][25689] Fps is (10 sec: 5441.2, 60 sec: 5490.2, 300 sec: 5485.9). Total num frames: 658779136. Throughput: 0: 4921.3. Samples: 658776846. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:21:27,288][25689] Avg episode reward: [(0, '-8.508')] [2022-07-10 08:21:27,847][26022] Updated weights on worker 0-0, policy_version 643342 (0.00095) [2022-07-10 08:21:29,456][26022] Updated weights on worker 0-0, policy_version 643352 (0.00096) [2022-07-10 08:21:31,425][26022] Updated weights on worker 0-0, policy_version 643362 (0.00079) [2022-07-10 08:21:32,332][25689] Fps is (10 sec: 5466.6, 60 sec: 5511.5, 300 sec: 5483.2). Total num frames: 658807808. Throughput: 0: 5724.3. Samples: 658810028. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:21:32,332][25689] Avg episode reward: [(0, '-9.044')] [2022-07-10 08:21:33,110][26022] Updated weights on worker 0-0, policy_version 643372 (0.00100) [2022-07-10 08:21:35,163][26022] Updated weights on worker 0-0, policy_version 643382 (0.00086) [2022-07-10 08:21:36,980][26022] Updated weights on worker 0-0, policy_version 643392 (0.00085) [2022-07-10 08:21:37,344][25689] Fps is (10 sec: 5702.1, 60 sec: 5513.7, 300 sec: 5491.3). Total num frames: 658836480. Throughput: 0: 5742.1. Samples: 658843342. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:21:37,345][25689] Avg episode reward: [(0, '-9.341')] [2022-07-10 08:21:38,706][26022] Updated weights on worker 0-0, policy_version 643402 (0.00085) [2022-07-10 08:21:40,783][26022] Updated weights on worker 0-0, policy_version 643412 (0.00090) [2022-07-10 08:21:42,371][25689] Fps is (10 sec: 5405.9, 60 sec: 5479.3, 300 sec: 5485.4). Total num frames: 658862080. Throughput: 0: 4929.7. Samples: 658859924. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:21:42,372][25689] Avg episode reward: [(0, '-8.971')] [2022-07-10 08:21:42,458][26022] Updated weights on worker 0-0, policy_version 643422 (0.00086) [2022-07-10 08:21:44,223][26022] Updated weights on worker 0-0, policy_version 643432 (0.00088) [2022-07-10 08:21:46,163][26022] Updated weights on worker 0-0, policy_version 643442 (0.00085) [2022-07-10 08:21:47,466][25689] Fps is (10 sec: 5362.1, 60 sec: 5493.4, 300 sec: 5484.3). Total num frames: 658890752. Throughput: 0: 5764.6. Samples: 658893164. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:21:47,466][25689] Avg episode reward: [(0, '-9.359')] [2022-07-10 08:21:47,985][26022] Updated weights on worker 0-0, policy_version 643452 (0.00087) [2022-07-10 08:21:49,999][26022] Updated weights on worker 0-0, policy_version 643462 (0.00087) [2022-07-10 08:21:51,812][26022] Updated weights on worker 0-0, policy_version 643472 (0.00093) [2022-07-10 08:21:52,534][25689] Fps is (10 sec: 5642.6, 60 sec: 5507.2, 300 sec: 5484.4). Total num frames: 658919424. Throughput: 0: 5763.7. Samples: 658926468. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:21:52,535][25689] Avg episode reward: [(0, '-7.134')] [2022-07-10 08:21:53,566][26022] Updated weights on worker 0-0, policy_version 643482 (0.00091) [2022-07-10 08:21:55,511][26022] Updated weights on worker 0-0, policy_version 643492 (0.00086) [2022-07-10 08:21:57,163][26022] Updated weights on worker 0-0, policy_version 643502 (0.00082) [2022-07-10 08:21:57,585][25689] Fps is (10 sec: 5666.8, 60 sec: 5507.6, 300 sec: 5490.9). Total num frames: 658948096. Throughput: 0: 4922.6. Samples: 658942978. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:21:57,586][25689] Avg episode reward: [(0, '-5.763')] [2022-07-10 08:21:59,131][26022] Updated weights on worker 0-0, policy_version 643512 (0.00080) [2022-07-10 08:22:01,009][26022] Updated weights on worker 0-0, policy_version 643522 (0.00084) [2022-07-10 08:22:02,599][25689] Fps is (10 sec: 5290.4, 60 sec: 5478.6, 300 sec: 5481.4). Total num frames: 658972672. Throughput: 0: 5763.7. Samples: 658976512. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:22:02,600][25689] Avg episode reward: [(0, '-6.442')] [2022-07-10 08:22:03,225][26022] Updated weights on worker 0-0, policy_version 643532 (0.00097) [2022-07-10 08:22:05,059][26022] Updated weights on worker 0-0, policy_version 643542 (0.00091) [2022-07-10 08:22:06,853][26022] Updated weights on worker 0-0, policy_version 643552 (0.00086) [2022-07-10 08:22:07,649][25689] Fps is (10 sec: 5392.8, 60 sec: 5514.9, 300 sec: 5487.8). Total num frames: 659002368. Throughput: 0: 5687.9. Samples: 659007962. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:22:07,649][25689] Avg episode reward: [(0, '-7.142')] [2022-07-10 08:22:08,700][26022] Updated weights on worker 0-0, policy_version 643562 (0.00088) [2022-07-10 08:22:10,696][26022] Updated weights on worker 0-0, policy_version 643572 (0.00090) [2022-07-10 08:22:12,489][26022] Updated weights on worker 0-0, policy_version 643582 (0.00094) [2022-07-10 08:22:12,655][25689] Fps is (10 sec: 5601.0, 60 sec: 5497.8, 300 sec: 5487.9). Total num frames: 659028992. Throughput: 0: 4882.8. Samples: 659024712. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:22:12,655][25689] Avg episode reward: [(0, '-8.706')] [2022-07-10 08:22:14,201][26022] Updated weights on worker 0-0, policy_version 643592 (0.00087) [2022-07-10 08:22:16,092][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:22:16,103][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000643602_659048448.pth [2022-07-10 08:22:16,103][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000641669_657069056.pth [2022-07-10 08:22:16,106][26022] Updated weights on worker 0-0, policy_version 643602 (0.00095) [2022-07-10 08:22:17,660][25689] Fps is (10 sec: 5523.2, 60 sec: 5514.6, 300 sec: 5492.1). Total num frames: 659057664. Throughput: 0: 5743.7. Samples: 659058286. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:22:17,662][25689] Avg episode reward: [(0, '-8.619')] [2022-07-10 08:22:17,962][26022] Updated weights on worker 0-0, policy_version 643612 (0.00088) [2022-07-10 08:22:19,840][26022] Updated weights on worker 0-0, policy_version 643622 (0.00092) [2022-07-10 08:22:21,572][26022] Updated weights on worker 0-0, policy_version 643632 (0.00090) [2022-07-10 08:22:22,667][25689] Fps is (10 sec: 5523.0, 60 sec: 5488.4, 300 sec: 5487.0). Total num frames: 659084288. Throughput: 0: 5737.0. Samples: 659091638. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:22:22,668][25689] Avg episode reward: [(0, '-9.071')] [2022-07-10 08:22:23,268][26022] Updated weights on worker 0-0, policy_version 643642 (0.00090) [2022-07-10 08:22:25,334][26022] Updated weights on worker 0-0, policy_version 643652 (0.00083) [2022-07-10 08:22:27,258][26022] Updated weights on worker 0-0, policy_version 643662 (0.00085) [2022-07-10 08:22:27,711][25689] Fps is (10 sec: 5399.9, 60 sec: 5507.8, 300 sec: 5486.6). Total num frames: 659111936. Throughput: 0: 4992.3. Samples: 659108118. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:22:27,711][25689] Avg episode reward: [(0, '-8.915')] [2022-07-10 08:22:29,052][26022] Updated weights on worker 0-0, policy_version 643672 (0.00091) [2022-07-10 08:22:30,843][26022] Updated weights on worker 0-0, policy_version 643682 (0.00090) [2022-07-10 08:22:32,725][25689] Fps is (10 sec: 5497.3, 60 sec: 5493.6, 300 sec: 5490.1). Total num frames: 659139584. Throughput: 0: 5812.6. Samples: 659141376. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:22:32,727][25689] Avg episode reward: [(0, '-6.534')] [2022-07-10 08:22:32,772][26022] Updated weights on worker 0-0, policy_version 643692 (0.00086) [2022-07-10 08:22:34,811][26022] Updated weights on worker 0-0, policy_version 643702 (0.00091) [2022-07-10 08:22:36,365][26022] Updated weights on worker 0-0, policy_version 643712 (0.00085) [2022-07-10 08:22:37,758][25689] Fps is (10 sec: 5503.5, 60 sec: 5474.7, 300 sec: 5486.5). Total num frames: 659167232. Throughput: 0: 5795.0. Samples: 659174754. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:22:37,759][25689] Avg episode reward: [(0, '-5.969')] [2022-07-10 08:22:38,341][26022] Updated weights on worker 0-0, policy_version 643722 (0.00091) [2022-07-10 08:22:40,007][26022] Updated weights on worker 0-0, policy_version 643732 (0.00083) [2022-07-10 08:22:41,816][26022] Updated weights on worker 0-0, policy_version 643742 (0.00089) [2022-07-10 08:22:42,772][25689] Fps is (10 sec: 5606.1, 60 sec: 5526.9, 300 sec: 5490.6). Total num frames: 659195904. Throughput: 0: 4963.2. Samples: 659191424. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:22:42,772][25689] Avg episode reward: [(0, '-4.776')] [2022-07-10 08:22:43,706][26022] Updated weights on worker 0-0, policy_version 643752 (0.00092) [2022-07-10 08:22:45,718][26022] Updated weights on worker 0-0, policy_version 643762 (0.00102) [2022-07-10 08:22:47,594][26022] Updated weights on worker 0-0, policy_version 643772 (0.00084) [2022-07-10 08:22:47,831][25689] Fps is (10 sec: 5591.6, 60 sec: 5513.1, 300 sec: 5493.6). Total num frames: 659223552. Throughput: 0: 5783.1. Samples: 659224472. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:22:47,831][25689] Avg episode reward: [(0, '-4.912')] [2022-07-10 08:22:49,465][26022] Updated weights on worker 0-0, policy_version 643782 (0.00084) [2022-07-10 08:22:51,243][26022] Updated weights on worker 0-0, policy_version 643792 (0.00099) [2022-07-10 08:22:52,883][25689] Fps is (10 sec: 5671.1, 60 sec: 5531.5, 300 sec: 5492.9). Total num frames: 659253248. Throughput: 0: 5789.3. Samples: 659258076. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:22:52,884][25689] Avg episode reward: [(0, '-5.800')] [2022-07-10 08:22:52,891][26022] Updated weights on worker 0-0, policy_version 643802 (0.00090) [2022-07-10 08:22:55,006][26022] Updated weights on worker 0-0, policy_version 643812 (0.00093) [2022-07-10 08:22:56,624][26022] Updated weights on worker 0-0, policy_version 643822 (0.00090) [2022-07-10 08:22:57,907][25689] Fps is (10 sec: 5589.3, 60 sec: 5500.1, 300 sec: 5496.1). Total num frames: 659279872. Throughput: 0: 4955.2. Samples: 659274594. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:22:57,908][25689] Avg episode reward: [(0, '-5.734')] [2022-07-10 08:22:58,716][26022] Updated weights on worker 0-0, policy_version 643832 (0.00088) [2022-07-10 08:23:00,344][26022] Updated weights on worker 0-0, policy_version 643842 (0.00085) [2022-07-10 08:23:02,439][26022] Updated weights on worker 0-0, policy_version 643852 (0.00091) [2022-07-10 08:23:02,960][25689] Fps is (10 sec: 5182.8, 60 sec: 5513.5, 300 sec: 5492.4). Total num frames: 659305472. Throughput: 0: 5763.4. Samples: 659307776. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:23:02,960][25689] Avg episode reward: [(0, '-6.763')] [2022-07-10 08:23:04,607][26022] Updated weights on worker 0-0, policy_version 643862 (0.00089) [2022-07-10 08:23:06,284][26022] Updated weights on worker 0-0, policy_version 643872 (0.00087) [2022-07-10 08:23:07,997][25689] Fps is (10 sec: 5277.3, 60 sec: 5480.7, 300 sec: 5495.2). Total num frames: 659333120. Throughput: 0: 5678.6. Samples: 659338988. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:23:07,998][25689] Avg episode reward: [(0, '-7.938')] [2022-07-10 08:23:08,360][26022] Updated weights on worker 0-0, policy_version 643882 (0.00094) [2022-07-10 08:23:10,004][26022] Updated weights on worker 0-0, policy_version 643892 (0.00093) [2022-07-10 08:23:11,800][26022] Updated weights on worker 0-0, policy_version 643902 (0.00089) [2022-07-10 08:23:13,085][25689] Fps is (10 sec: 5663.1, 60 sec: 5524.1, 300 sec: 5500.5). Total num frames: 659362816. Throughput: 0: 4834.3. Samples: 659355740. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:23:13,086][25689] Avg episode reward: [(0, '-7.450')] [2022-07-10 08:23:13,735][26022] Updated weights on worker 0-0, policy_version 643912 (0.00104) [2022-07-10 08:23:15,462][26022] Updated weights on worker 0-0, policy_version 643922 (0.00085) [2022-07-10 08:23:17,488][26022] Updated weights on worker 0-0, policy_version 643932 (0.00094) [2022-07-10 08:23:18,131][25689] Fps is (10 sec: 5557.9, 60 sec: 5486.6, 300 sec: 5493.0). Total num frames: 659389440. Throughput: 0: 5670.0. Samples: 659389260. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:23:18,131][25689] Avg episode reward: [(0, '-7.837')] [2022-07-10 08:23:19,225][26022] Updated weights on worker 0-0, policy_version 643942 (0.00092) [2022-07-10 08:23:21,007][26022] Updated weights on worker 0-0, policy_version 643952 (0.00087) [2022-07-10 08:23:22,925][26022] Updated weights on worker 0-0, policy_version 643962 (0.00268) [2022-07-10 08:23:23,138][25689] Fps is (10 sec: 5602.8, 60 sec: 5537.3, 300 sec: 5502.1). Total num frames: 659419136. Throughput: 0: 5705.7. Samples: 659422906. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:23:23,138][25689] Avg episode reward: [(0, '-7.996')] [2022-07-10 08:23:24,492][26022] Updated weights on worker 0-0, policy_version 643972 (0.00094) [2022-07-10 08:23:26,606][26022] Updated weights on worker 0-0, policy_version 643982 (0.00088) [2022-07-10 08:23:28,209][25689] Fps is (10 sec: 5689.8, 60 sec: 5534.9, 300 sec: 5500.9). Total num frames: 659446784. Throughput: 0: 4969.0. Samples: 659439420. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:23:28,209][25689] Avg episode reward: [(0, '-7.512')] [2022-07-10 08:23:28,496][26022] Updated weights on worker 0-0, policy_version 643992 (0.00090) [2022-07-10 08:23:30,397][26022] Updated weights on worker 0-0, policy_version 644002 (0.00091) [2022-07-10 08:23:32,214][26022] Updated weights on worker 0-0, policy_version 644012 (0.00087) [2022-07-10 08:23:33,253][25689] Fps is (10 sec: 5264.3, 60 sec: 5498.4, 300 sec: 5490.1). Total num frames: 659472384. Throughput: 0: 5780.7. Samples: 659472320. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:23:33,253][25689] Avg episode reward: [(0, '-6.261')] [2022-07-10 08:23:33,983][26022] Updated weights on worker 0-0, policy_version 644022 (0.00084) [2022-07-10 08:23:36,032][26022] Updated weights on worker 0-0, policy_version 644032 (0.00086) [2022-07-10 08:23:37,636][26022] Updated weights on worker 0-0, policy_version 644042 (0.00085) [2022-07-10 08:23:38,281][25689] Fps is (10 sec: 5388.2, 60 sec: 5515.7, 300 sec: 5493.2). Total num frames: 659501056. Throughput: 0: 5774.3. Samples: 659505618. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:23:38,282][25689] Avg episode reward: [(0, '-4.987')] [2022-07-10 08:23:39,680][26022] Updated weights on worker 0-0, policy_version 644052 (0.00086) [2022-07-10 08:23:41,561][26022] Updated weights on worker 0-0, policy_version 644062 (0.00106) [2022-07-10 08:23:43,207][26022] Updated weights on worker 0-0, policy_version 644072 (0.00084) [2022-07-10 08:23:43,295][25689] Fps is (10 sec: 5710.3, 60 sec: 5515.6, 300 sec: 5497.6). Total num frames: 659529728. Throughput: 0: 4933.2. Samples: 659522346. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:23:43,295][25689] Avg episode reward: [(0, '-3.972')] [2022-07-10 08:23:45,241][26022] Updated weights on worker 0-0, policy_version 644082 (0.00054) [2022-07-10 08:23:47,023][26022] Updated weights on worker 0-0, policy_version 644092 (0.00092) [2022-07-10 08:23:48,340][25689] Fps is (10 sec: 5599.2, 60 sec: 5516.9, 300 sec: 5497.4). Total num frames: 659557376. Throughput: 0: 5773.0. Samples: 659555636. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:23:48,340][25689] Avg episode reward: [(0, '-4.068')] [2022-07-10 08:23:48,782][26022] Updated weights on worker 0-0, policy_version 644102 (0.00084) [2022-07-10 08:23:50,819][26022] Updated weights on worker 0-0, policy_version 644112 (0.00096) [2022-07-10 08:23:52,576][26022] Updated weights on worker 0-0, policy_version 644122 (0.00098) [2022-07-10 08:23:53,351][25689] Fps is (10 sec: 5396.9, 60 sec: 5469.9, 300 sec: 5494.2). Total num frames: 659584000. Throughput: 0: 5770.6. Samples: 659588300. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:23:53,351][25689] Avg episode reward: [(0, '-3.345')] [2022-07-10 08:23:54,616][26022] Updated weights on worker 0-0, policy_version 644132 (0.00095) [2022-07-10 08:23:56,352][26022] Updated weights on worker 0-0, policy_version 644142 (0.00087) [2022-07-10 08:23:58,243][26022] Updated weights on worker 0-0, policy_version 644152 (0.00099) [2022-07-10 08:23:58,358][25689] Fps is (10 sec: 5417.1, 60 sec: 5488.3, 300 sec: 5494.6). Total num frames: 659611648. Throughput: 0: 5768.5. Samples: 659621432. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:23:58,359][25689] Avg episode reward: [(0, '-3.758')] [2022-07-10 08:24:00,065][26022] Updated weights on worker 0-0, policy_version 644162 (0.00087) [2022-07-10 08:24:02,203][26022] Updated weights on worker 0-0, policy_version 644172 (0.00109) [2022-07-10 08:24:03,365][25689] Fps is (10 sec: 5419.8, 60 sec: 5509.5, 300 sec: 5492.3). Total num frames: 659638272. Throughput: 0: 5774.6. Samples: 659638242. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:24:03,367][25689] Avg episode reward: [(0, '-4.273')] [2022-07-10 08:24:04,212][26022] Updated weights on worker 0-0, policy_version 644182 (0.00102) [2022-07-10 08:24:06,001][26022] Updated weights on worker 0-0, policy_version 644192 (0.00087) [2022-07-10 08:24:07,964][26022] Updated weights on worker 0-0, policy_version 644202 (0.00085) [2022-07-10 08:24:08,440][25689] Fps is (10 sec: 5281.7, 60 sec: 5489.1, 300 sec: 5491.4). Total num frames: 659664896. Throughput: 0: 5648.3. Samples: 659669168. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:24:08,441][25689] Avg episode reward: [(0, '-4.702')] [2022-07-10 08:24:09,796][26022] Updated weights on worker 0-0, policy_version 644212 (0.00085) [2022-07-10 08:24:11,741][26022] Updated weights on worker 0-0, policy_version 644222 (0.00093) [2022-07-10 08:24:13,523][25689] Fps is (10 sec: 5342.7, 60 sec: 5455.7, 300 sec: 5486.9). Total num frames: 659692544. Throughput: 0: 5682.4. Samples: 659702924. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:24:13,523][25689] Avg episode reward: [(0, '-4.131')] [2022-07-10 08:24:13,542][26022] Updated weights on worker 0-0, policy_version 644232 (0.00092) [2022-07-10 08:24:15,327][26022] Updated weights on worker 0-0, policy_version 644242 (0.00089) [2022-07-10 08:24:16,368][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:24:16,382][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000644249_659710976.pth [2022-07-10 08:24:16,383][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000642312_657727488.pth [2022-07-10 08:24:17,237][26022] Updated weights on worker 0-0, policy_version 644252 (0.00091) [2022-07-10 08:24:18,603][25689] Fps is (10 sec: 5541.8, 60 sec: 5486.4, 300 sec: 5496.6). Total num frames: 659721216. Throughput: 0: 4847.2. Samples: 659719560. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:24:18,603][25689] Avg episode reward: [(0, '-3.257')] [2022-07-10 08:24:19,022][26022] Updated weights on worker 0-0, policy_version 644262 (0.00087) [2022-07-10 08:24:20,897][26022] Updated weights on worker 0-0, policy_version 644272 (0.00087) [2022-07-10 08:24:22,595][26022] Updated weights on worker 0-0, policy_version 644282 (0.00085) [2022-07-10 08:24:23,623][25689] Fps is (10 sec: 5779.2, 60 sec: 5485.3, 300 sec: 5501.0). Total num frames: 659750912. Throughput: 0: 5663.9. Samples: 659752980. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 08:24:23,623][25689] Avg episode reward: [(0, '-2.926')] [2022-07-10 08:24:24,706][26022] Updated weights on worker 0-0, policy_version 644292 (0.00091) [2022-07-10 08:24:26,284][26022] Updated weights on worker 0-0, policy_version 644302 (0.00115) [2022-07-10 08:24:28,503][26022] Updated weights on worker 0-0, policy_version 644312 (0.00061) [2022-07-10 08:24:28,711][25689] Fps is (10 sec: 5470.2, 60 sec: 5449.8, 300 sec: 5493.8). Total num frames: 659776512. Throughput: 0: 5752.5. Samples: 659785780. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:24:28,712][25689] Avg episode reward: [(0, '-3.076')] [2022-07-10 08:24:30,007][26022] Updated weights on worker 0-0, policy_version 644322 (0.00087) [2022-07-10 08:24:32,192][26022] Updated weights on worker 0-0, policy_version 644332 (0.00093) [2022-07-10 08:24:33,740][25689] Fps is (10 sec: 5364.5, 60 sec: 5502.0, 300 sec: 5493.5). Total num frames: 659805184. Throughput: 0: 4920.3. Samples: 659802396. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:24:33,740][25689] Avg episode reward: [(0, '-3.125')] [2022-07-10 08:24:33,776][26022] Updated weights on worker 0-0, policy_version 644342 (0.00091) [2022-07-10 08:24:35,786][26022] Updated weights on worker 0-0, policy_version 644352 (0.00087) [2022-07-10 08:24:37,451][26022] Updated weights on worker 0-0, policy_version 644362 (0.00086) [2022-07-10 08:24:38,781][25689] Fps is (10 sec: 5593.3, 60 sec: 5484.0, 300 sec: 5496.3). Total num frames: 659832832. Throughput: 0: 5768.8. Samples: 659835962. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:24:38,781][25689] Avg episode reward: [(0, '-4.354')] [2022-07-10 08:24:39,274][26022] Updated weights on worker 0-0, policy_version 644372 (0.00088) [2022-07-10 08:24:41,081][26022] Updated weights on worker 0-0, policy_version 644382 (0.00091) [2022-07-10 08:24:42,930][26022] Updated weights on worker 0-0, policy_version 644392 (0.00092) [2022-07-10 08:24:43,804][25689] Fps is (10 sec: 5595.7, 60 sec: 5483.0, 300 sec: 5496.9). Total num frames: 659861504. Throughput: 0: 5764.6. Samples: 659869320. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:24:43,805][25689] Avg episode reward: [(0, '-4.720')] [2022-07-10 08:24:44,872][26022] Updated weights on worker 0-0, policy_version 644402 (0.00090) [2022-07-10 08:24:46,665][26022] Updated weights on worker 0-0, policy_version 644412 (0.00093) [2022-07-10 08:24:48,319][26022] Updated weights on worker 0-0, policy_version 644422 (0.00096) [2022-07-10 08:24:48,865][25689] Fps is (10 sec: 5584.9, 60 sec: 5481.6, 300 sec: 5499.4). Total num frames: 659889152. Throughput: 0: 4972.8. Samples: 659886000. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:24:48,867][25689] Avg episode reward: [(0, '-7.657')] [2022-07-10 08:24:50,401][26022] Updated weights on worker 0-0, policy_version 644432 (0.00086) [2022-07-10 08:24:52,332][26022] Updated weights on worker 0-0, policy_version 644442 (0.00090) [2022-07-10 08:24:53,842][26022] Updated weights on worker 0-0, policy_version 644452 (0.00094) [2022-07-10 08:24:53,967][25689] Fps is (10 sec: 5642.6, 60 sec: 5524.1, 300 sec: 5501.1). Total num frames: 659918848. Throughput: 0: 5779.2. Samples: 659919294. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:24:53,968][25689] Avg episode reward: [(0, '-7.736')] [2022-07-10 08:24:56,227][26022] Updated weights on worker 0-0, policy_version 644462 (0.00088) [2022-07-10 08:24:57,592][26022] Updated weights on worker 0-0, policy_version 644472 (0.00087) [2022-07-10 08:24:58,970][25689] Fps is (10 sec: 5573.2, 60 sec: 5507.6, 300 sec: 5501.4). Total num frames: 659945472. Throughput: 0: 5782.2. Samples: 659952704. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:24:58,971][25689] Avg episode reward: [(0, '-8.283')] [2022-07-10 08:24:59,755][26022] Updated weights on worker 0-0, policy_version 644482 (0.00085) [2022-07-10 08:25:01,218][26022] Updated weights on worker 0-0, policy_version 644492 (0.00078) [2022-07-10 08:25:03,708][26022] Updated weights on worker 0-0, policy_version 644502 (0.00092) [2022-07-10 08:25:03,976][25689] Fps is (10 sec: 5422.4, 60 sec: 5524.5, 300 sec: 5507.1). Total num frames: 659973120. Throughput: 0: 4965.3. Samples: 659969478. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:25:03,976][25689] Avg episode reward: [(0, '-6.859')] [2022-07-10 08:25:05,605][26022] Updated weights on worker 0-0, policy_version 644512 (0.00093) [2022-07-10 08:25:07,136][26022] Updated weights on worker 0-0, policy_version 644522 (0.00092) [2022-07-10 08:25:09,047][25689] Fps is (10 sec: 5284.4, 60 sec: 5508.0, 300 sec: 5496.6). Total num frames: 659998720. Throughput: 0: 5693.1. Samples: 660000898. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:25:09,047][25689] Avg episode reward: [(0, '-7.742')] [2022-07-10 08:25:09,228][26022] Updated weights on worker 0-0, policy_version 644532 (0.00092) [2022-07-10 08:25:11,047][26022] Updated weights on worker 0-0, policy_version 644542 (0.00079) [2022-07-10 08:25:12,744][26022] Updated weights on worker 0-0, policy_version 644552 (0.00094) [2022-07-10 08:25:14,058][25689] Fps is (10 sec: 5484.6, 60 sec: 5548.4, 300 sec: 5507.6). Total num frames: 660028416. Throughput: 0: 5719.1. Samples: 660034196. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:25:14,058][25689] Avg episode reward: [(0, '-5.988')] [2022-07-10 08:25:14,772][26022] Updated weights on worker 0-0, policy_version 644562 (0.00091) [2022-07-10 08:25:16,587][26022] Updated weights on worker 0-0, policy_version 644572 (0.00083) [2022-07-10 08:25:18,345][26022] Updated weights on worker 0-0, policy_version 644582 (0.00091) [2022-07-10 08:25:19,080][25689] Fps is (10 sec: 5715.2, 60 sec: 5536.7, 300 sec: 5500.4). Total num frames: 660056064. Throughput: 0: 4883.5. Samples: 660050912. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:25:19,081][25689] Avg episode reward: [(0, '-4.867')] [2022-07-10 08:25:20,346][26022] Updated weights on worker 0-0, policy_version 644592 (0.00081) [2022-07-10 08:25:21,862][26022] Updated weights on worker 0-0, policy_version 644602 (0.00091) [2022-07-10 08:25:23,974][26022] Updated weights on worker 0-0, policy_version 644612 (0.00086) [2022-07-10 08:25:24,139][25689] Fps is (10 sec: 5383.4, 60 sec: 5482.4, 300 sec: 5500.9). Total num frames: 660082688. Throughput: 0: 5691.8. Samples: 660084244. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:25:24,140][25689] Avg episode reward: [(0, '-6.980')] [2022-07-10 08:25:25,721][26022] Updated weights on worker 0-0, policy_version 644622 (0.00082) [2022-07-10 08:25:27,665][26022] Updated weights on worker 0-0, policy_version 644632 (0.00098) [2022-07-10 08:25:29,199][25689] Fps is (10 sec: 5464.8, 60 sec: 5535.8, 300 sec: 5505.0). Total num frames: 660111360. Throughput: 0: 5789.6. Samples: 660117572. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:25:29,200][25689] Avg episode reward: [(0, '-6.804')] [2022-07-10 08:25:29,499][26022] Updated weights on worker 0-0, policy_version 644642 (0.00085) [2022-07-10 08:25:31,422][26022] Updated weights on worker 0-0, policy_version 644652 (0.00093) [2022-07-10 08:25:33,027][26022] Updated weights on worker 0-0, policy_version 644662 (0.00085) [2022-07-10 08:25:34,233][25689] Fps is (10 sec: 5579.7, 60 sec: 5518.3, 300 sec: 5501.6). Total num frames: 660139008. Throughput: 0: 4942.9. Samples: 660133920. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:25:34,233][25689] Avg episode reward: [(0, '-5.639')] [2022-07-10 08:25:35,298][26022] Updated weights on worker 0-0, policy_version 644672 (0.00088) [2022-07-10 08:25:36,788][26022] Updated weights on worker 0-0, policy_version 644682 (0.00093) [2022-07-10 08:25:38,869][26022] Updated weights on worker 0-0, policy_version 644692 (0.00088) [2022-07-10 08:25:39,260][25689] Fps is (10 sec: 5598.1, 60 sec: 5536.6, 300 sec: 5504.9). Total num frames: 660167680. Throughput: 0: 5749.2. Samples: 660166926. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:25:39,260][25689] Avg episode reward: [(0, '-5.598')] [2022-07-10 08:25:40,583][26022] Updated weights on worker 0-0, policy_version 644702 (0.00053) [2022-07-10 08:25:42,468][26022] Updated weights on worker 0-0, policy_version 644712 (0.00088) [2022-07-10 08:25:44,211][26022] Updated weights on worker 0-0, policy_version 644722 (0.00097) [2022-07-10 08:25:44,287][25689] Fps is (10 sec: 5601.9, 60 sec: 5519.4, 300 sec: 5505.6). Total num frames: 660195328. Throughput: 0: 5760.7. Samples: 660200308. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:25:44,287][25689] Avg episode reward: [(0, '-5.526')] [2022-07-10 08:25:46,084][26022] Updated weights on worker 0-0, policy_version 644732 (0.00091) [2022-07-10 08:25:48,087][26022] Updated weights on worker 0-0, policy_version 644742 (0.00098) [2022-07-10 08:25:49,432][25689] Fps is (10 sec: 5536.3, 60 sec: 5528.5, 300 sec: 5507.0). Total num frames: 660224000. Throughput: 0: 4911.0. Samples: 660216938. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:25:49,433][25689] Avg episode reward: [(0, '-5.814')] [2022-07-10 08:25:49,883][26022] Updated weights on worker 0-0, policy_version 644752 (0.00088) [2022-07-10 08:25:51,774][26022] Updated weights on worker 0-0, policy_version 644762 (0.00050) [2022-07-10 08:25:53,667][26022] Updated weights on worker 0-0, policy_version 644772 (0.00094) [2022-07-10 08:25:54,448][25689] Fps is (10 sec: 5340.9, 60 sec: 5468.7, 300 sec: 5497.4). Total num frames: 660249600. Throughput: 0: 5759.0. Samples: 660250340. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:25:54,449][25689] Avg episode reward: [(0, '-3.500')] [2022-07-10 08:25:55,202][26022] Updated weights on worker 0-0, policy_version 644782 (0.00081) [2022-07-10 08:25:57,409][26022] Updated weights on worker 0-0, policy_version 644792 (0.00096) [2022-07-10 08:25:58,819][26022] Updated weights on worker 0-0, policy_version 644802 (0.01265) [2022-07-10 08:25:59,454][25689] Fps is (10 sec: 5517.6, 60 sec: 5519.2, 300 sec: 5508.8). Total num frames: 660279296. Throughput: 0: 5794.0. Samples: 660283932. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:25:59,455][25689] Avg episode reward: [(0, '-3.489')] [2022-07-10 08:26:01,071][26022] Updated weights on worker 0-0, policy_version 644812 (0.00080) [2022-07-10 08:26:02,909][26022] Updated weights on worker 0-0, policy_version 644822 (0.00092) [2022-07-10 08:26:04,537][25689] Fps is (10 sec: 5480.8, 60 sec: 5478.3, 300 sec: 5501.8). Total num frames: 660304896. Throughput: 0: 4939.0. Samples: 660300326. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:26:04,539][25689] Avg episode reward: [(0, '-4.315')] [2022-07-10 08:26:04,855][26022] Updated weights on worker 0-0, policy_version 644832 (0.00090) [2022-07-10 08:26:06,788][26022] Updated weights on worker 0-0, policy_version 644842 (0.00090) [2022-07-10 08:26:08,467][26022] Updated weights on worker 0-0, policy_version 644852 (0.00085) [2022-07-10 08:26:09,606][25689] Fps is (10 sec: 5346.1, 60 sec: 5529.3, 300 sec: 5504.1). Total num frames: 660333568. Throughput: 0: 5728.5. Samples: 660332500. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:26:09,606][25689] Avg episode reward: [(0, '-3.929')] [2022-07-10 08:26:10,366][26022] Updated weights on worker 0-0, policy_version 644862 (0.00093) [2022-07-10 08:26:12,295][26022] Updated weights on worker 0-0, policy_version 644872 (0.00084) [2022-07-10 08:26:13,916][26022] Updated weights on worker 0-0, policy_version 644882 (0.00084) [2022-07-10 08:26:14,610][25689] Fps is (10 sec: 5693.1, 60 sec: 5513.0, 300 sec: 5507.5). Total num frames: 660362240. Throughput: 0: 5732.5. Samples: 660365916. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:26:14,610][25689] Avg episode reward: [(0, '-4.978')] [2022-07-10 08:26:16,107][26022] Updated weights on worker 0-0, policy_version 644892 (0.00097) [2022-07-10 08:26:16,490][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:26:16,512][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000644895_660372480.pth [2022-07-10 08:26:16,513][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000642956_658386944.pth [2022-07-10 08:26:17,488][26022] Updated weights on worker 0-0, policy_version 644902 (0.00081) [2022-07-10 08:26:19,621][25689] Fps is (10 sec: 5521.3, 60 sec: 5497.2, 300 sec: 5502.1). Total num frames: 660388864. Throughput: 0: 5712.3. Samples: 660399130. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:26:19,623][25689] Avg episode reward: [(0, '-5.307')] [2022-07-10 08:26:19,652][26022] Updated weights on worker 0-0, policy_version 644912 (0.00082) [2022-07-10 08:26:21,238][26022] Updated weights on worker 0-0, policy_version 644922 (0.00080) [2022-07-10 08:26:23,309][26022] Updated weights on worker 0-0, policy_version 644932 (0.00085) [2022-07-10 08:26:24,663][25689] Fps is (10 sec: 5602.4, 60 sec: 5549.5, 300 sec: 5513.0). Total num frames: 660418560. Throughput: 0: 5745.9. Samples: 660415964. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:26:24,663][25689] Avg episode reward: [(0, '-4.753')] [2022-07-10 08:26:24,921][26022] Updated weights on worker 0-0, policy_version 644942 (0.00084) [2022-07-10 08:26:27,129][26022] Updated weights on worker 0-0, policy_version 644952 (0.00085) [2022-07-10 08:26:28,764][26022] Updated weights on worker 0-0, policy_version 644962 (0.00085) [2022-07-10 08:26:29,782][25689] Fps is (10 sec: 5542.8, 60 sec: 5510.2, 300 sec: 5504.7). Total num frames: 660445184. Throughput: 0: 5777.7. Samples: 660449070. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:26:29,782][25689] Avg episode reward: [(0, '-4.328')] [2022-07-10 08:26:30,807][26022] Updated weights on worker 0-0, policy_version 644972 (0.00089) [2022-07-10 08:26:32,595][26022] Updated weights on worker 0-0, policy_version 644982 (0.00090) [2022-07-10 08:26:34,381][26022] Updated weights on worker 0-0, policy_version 644992 (0.00087) [2022-07-10 08:26:34,800][25689] Fps is (10 sec: 5353.8, 60 sec: 5511.7, 300 sec: 5501.2). Total num frames: 660472832. Throughput: 0: 5759.1. Samples: 660482192. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:26:34,800][25689] Avg episode reward: [(0, '-6.674')] [2022-07-10 08:26:36,410][26022] Updated weights on worker 0-0, policy_version 645002 (0.00094) [2022-07-10 08:26:38,148][26022] Updated weights on worker 0-0, policy_version 645012 (0.00093) [2022-07-10 08:26:39,843][25689] Fps is (10 sec: 5495.8, 60 sec: 5493.2, 300 sec: 5507.7). Total num frames: 660500480. Throughput: 0: 4920.7. Samples: 660498638. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:26:39,844][25689] Avg episode reward: [(0, '-6.374')] [2022-07-10 08:26:40,047][26022] Updated weights on worker 0-0, policy_version 645022 (0.00099) [2022-07-10 08:26:42,191][26022] Updated weights on worker 0-0, policy_version 645032 (0.00089) [2022-07-10 08:26:43,616][26022] Updated weights on worker 0-0, policy_version 645042 (0.00089) [2022-07-10 08:26:44,855][25689] Fps is (10 sec: 5499.3, 60 sec: 5494.6, 300 sec: 5505.8). Total num frames: 660528128. Throughput: 0: 5739.3. Samples: 660531852. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:26:44,855][25689] Avg episode reward: [(0, '-6.137')] [2022-07-10 08:26:45,871][26022] Updated weights on worker 0-0, policy_version 645052 (0.00090) [2022-07-10 08:26:47,249][26022] Updated weights on worker 0-0, policy_version 645062 (0.00081) [2022-07-10 08:26:49,382][26022] Updated weights on worker 0-0, policy_version 645072 (0.00093) [2022-07-10 08:26:49,911][25689] Fps is (10 sec: 5492.3, 60 sec: 5485.8, 300 sec: 5502.6). Total num frames: 660555776. Throughput: 0: 5748.5. Samples: 660564784. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:26:49,912][25689] Avg episode reward: [(0, '-6.219')] [2022-07-10 08:26:51,287][26022] Updated weights on worker 0-0, policy_version 645082 (0.00082) [2022-07-10 08:26:53,155][26022] Updated weights on worker 0-0, policy_version 645092 (0.00083) [2022-07-10 08:26:54,938][25689] Fps is (10 sec: 5483.8, 60 sec: 5518.7, 300 sec: 5499.6). Total num frames: 660583424. Throughput: 0: 4937.2. Samples: 660581620. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:26:54,939][25689] Avg episode reward: [(0, '-7.788')] [2022-07-10 08:26:54,962][26022] Updated weights on worker 0-0, policy_version 645102 (0.00095) [2022-07-10 08:26:56,776][26022] Updated weights on worker 0-0, policy_version 645112 (0.00095) [2022-07-10 08:26:58,520][26022] Updated weights on worker 0-0, policy_version 645122 (0.00085) [2022-07-10 08:26:59,952][25689] Fps is (10 sec: 5609.3, 60 sec: 5501.1, 300 sec: 5513.4). Total num frames: 660612096. Throughput: 0: 5776.7. Samples: 660614798. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:26:59,952][25689] Avg episode reward: [(0, '-4.917')] [2022-07-10 08:27:00,541][26022] Updated weights on worker 0-0, policy_version 645132 (0.00088) [2022-07-10 08:27:02,451][26022] Updated weights on worker 0-0, policy_version 645142 (0.00098) [2022-07-10 08:27:04,609][26022] Updated weights on worker 0-0, policy_version 645152 (0.00052) [2022-07-10 08:27:04,959][25689] Fps is (10 sec: 5313.9, 60 sec: 5491.0, 300 sec: 5497.0). Total num frames: 660636672. Throughput: 0: 5683.9. Samples: 660646122. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:27:04,961][25689] Avg episode reward: [(0, '-5.528')] [2022-07-10 08:27:06,350][26022] Updated weights on worker 0-0, policy_version 645162 (0.00091) [2022-07-10 08:27:08,422][26022] Updated weights on worker 0-0, policy_version 645172 (0.00095) [2022-07-10 08:27:10,087][25689] Fps is (10 sec: 5253.8, 60 sec: 5485.6, 300 sec: 5501.6). Total num frames: 660665344. Throughput: 0: 4862.6. Samples: 660662890. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:27:10,088][25689] Avg episode reward: [(0, '-5.695')] [2022-07-10 08:27:10,124][26022] Updated weights on worker 0-0, policy_version 645182 (0.00093) [2022-07-10 08:27:11,886][26022] Updated weights on worker 0-0, policy_version 645192 (0.00088) [2022-07-10 08:27:13,690][26022] Updated weights on worker 0-0, policy_version 645202 (0.00088) [2022-07-10 08:27:15,099][25689] Fps is (10 sec: 5655.2, 60 sec: 5484.9, 300 sec: 5501.5). Total num frames: 660694016. Throughput: 0: 5701.8. Samples: 660696572. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:27:15,100][25689] Avg episode reward: [(0, '-5.387')] [2022-07-10 08:27:15,474][26022] Updated weights on worker 0-0, policy_version 645212 (0.00087) [2022-07-10 08:27:17,523][26022] Updated weights on worker 0-0, policy_version 645222 (0.00089) [2022-07-10 08:27:19,032][26022] Updated weights on worker 0-0, policy_version 645232 (0.00087) [2022-07-10 08:27:20,138][25689] Fps is (10 sec: 5806.9, 60 sec: 5533.1, 300 sec: 5511.2). Total num frames: 660723712. Throughput: 0: 5724.6. Samples: 660730358. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:27:20,139][25689] Avg episode reward: [(0, '-4.877')] [2022-07-10 08:27:20,995][26022] Updated weights on worker 0-0, policy_version 645242 (0.00085) [2022-07-10 08:27:22,647][26022] Updated weights on worker 0-0, policy_version 645252 (0.00086) [2022-07-10 08:27:24,696][26022] Updated weights on worker 0-0, policy_version 645262 (0.00086) [2022-07-10 08:27:25,177][25689] Fps is (10 sec: 5486.9, 60 sec: 5465.7, 300 sec: 5504.4). Total num frames: 660749312. Throughput: 0: 4993.9. Samples: 660747088. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:27:25,177][25689] Avg episode reward: [(0, '-3.979')] [2022-07-10 08:27:26,501][26022] Updated weights on worker 0-0, policy_version 645272 (0.00086) [2022-07-10 08:27:28,372][26022] Updated weights on worker 0-0, policy_version 645282 (0.00092) [2022-07-10 08:27:30,149][26022] Updated weights on worker 0-0, policy_version 645292 (0.00090) [2022-07-10 08:27:30,244][25689] Fps is (10 sec: 5471.7, 60 sec: 5521.2, 300 sec: 5510.3). Total num frames: 660779008. Throughput: 0: 5822.2. Samples: 660780250. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:27:30,244][25689] Avg episode reward: [(0, '-5.162')] [2022-07-10 08:27:31,963][26022] Updated weights on worker 0-0, policy_version 645302 (0.00087) [2022-07-10 08:27:34,003][26022] Updated weights on worker 0-0, policy_version 645312 (0.00093) [2022-07-10 08:27:35,271][25689] Fps is (10 sec: 5680.6, 60 sec: 5520.3, 300 sec: 5510.4). Total num frames: 660806656. Throughput: 0: 5819.9. Samples: 660813974. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 08:27:35,272][25689] Avg episode reward: [(0, '-5.230')] [2022-07-10 08:27:35,785][26022] Updated weights on worker 0-0, policy_version 645322 (0.00090) [2022-07-10 08:27:37,459][26022] Updated weights on worker 0-0, policy_version 645332 (0.00092) [2022-07-10 08:27:39,365][26022] Updated weights on worker 0-0, policy_version 645342 (0.00081) [2022-07-10 08:27:40,292][25689] Fps is (10 sec: 5604.9, 60 sec: 5539.3, 300 sec: 5510.3). Total num frames: 660835328. Throughput: 0: 4991.6. Samples: 660830960. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:27:40,293][25689] Avg episode reward: [(0, '-4.506')] [2022-07-10 08:27:41,124][26022] Updated weights on worker 0-0, policy_version 645352 (0.00084) [2022-07-10 08:27:43,119][26022] Updated weights on worker 0-0, policy_version 645362 (0.00080) [2022-07-10 08:27:44,856][26022] Updated weights on worker 0-0, policy_version 645372 (0.00089) [2022-07-10 08:27:45,318][25689] Fps is (10 sec: 5605.9, 60 sec: 5538.1, 300 sec: 5510.9). Total num frames: 660862976. Throughput: 0: 5809.5. Samples: 660864098. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:27:45,319][25689] Avg episode reward: [(0, '-5.402')] [2022-07-10 08:27:46,827][26022] Updated weights on worker 0-0, policy_version 645382 (0.00084) [2022-07-10 08:27:48,664][26022] Updated weights on worker 0-0, policy_version 645392 (0.00088) [2022-07-10 08:27:50,385][25689] Fps is (10 sec: 5478.9, 60 sec: 5537.1, 300 sec: 5503.8). Total num frames: 660890624. Throughput: 0: 5825.9. Samples: 660897588. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:27:50,385][25689] Avg episode reward: [(0, '-5.561')] [2022-07-10 08:27:50,444][26022] Updated weights on worker 0-0, policy_version 645402 (0.00092) [2022-07-10 08:27:52,275][26022] Updated weights on worker 0-0, policy_version 645412 (0.00097) [2022-07-10 08:27:54,034][26022] Updated weights on worker 0-0, policy_version 645422 (0.00085) [2022-07-10 08:27:55,426][25689] Fps is (10 sec: 5571.8, 60 sec: 5552.7, 300 sec: 5510.3). Total num frames: 660919296. Throughput: 0: 4971.3. Samples: 660914168. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:27:55,426][25689] Avg episode reward: [(0, '-4.997')] [2022-07-10 08:27:55,960][26022] Updated weights on worker 0-0, policy_version 645432 (0.00093) [2022-07-10 08:27:57,856][26022] Updated weights on worker 0-0, policy_version 645442 (0.00093) [2022-07-10 08:27:59,712][26022] Updated weights on worker 0-0, policy_version 645452 (0.00087) [2022-07-10 08:28:00,472][25689] Fps is (10 sec: 5583.3, 60 sec: 5532.8, 300 sec: 5517.3). Total num frames: 660946944. Throughput: 0: 5781.4. Samples: 660947626. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:28:00,473][25689] Avg episode reward: [(0, '-4.140')] [2022-07-10 08:28:01,600][26022] Updated weights on worker 0-0, policy_version 645462 (0.00086) [2022-07-10 08:28:03,792][26022] Updated weights on worker 0-0, policy_version 645472 (0.00093) [2022-07-10 08:28:05,498][25689] Fps is (10 sec: 5286.6, 60 sec: 5548.0, 300 sec: 5510.6). Total num frames: 660972544. Throughput: 0: 5674.0. Samples: 660978600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:28:05,499][25689] Avg episode reward: [(0, '-3.324')] [2022-07-10 08:28:05,548][26022] Updated weights on worker 0-0, policy_version 645482 (0.00092) [2022-07-10 08:28:07,468][26022] Updated weights on worker 0-0, policy_version 645492 (0.00085) [2022-07-10 08:28:09,358][26022] Updated weights on worker 0-0, policy_version 645502 (0.00086) [2022-07-10 08:28:10,602][25689] Fps is (10 sec: 5357.6, 60 sec: 5550.2, 300 sec: 5506.9). Total num frames: 661001216. Throughput: 0: 4830.5. Samples: 660995246. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:28:10,602][25689] Avg episode reward: [(0, '-3.464')] [2022-07-10 08:28:11,147][26022] Updated weights on worker 0-0, policy_version 645512 (0.00087) [2022-07-10 08:28:13,046][26022] Updated weights on worker 0-0, policy_version 645522 (0.00093) [2022-07-10 08:28:14,897][26022] Updated weights on worker 0-0, policy_version 645532 (0.00092) [2022-07-10 08:28:15,654][25689] Fps is (10 sec: 5444.9, 60 sec: 5512.7, 300 sec: 5506.8). Total num frames: 661027840. Throughput: 0: 5653.5. Samples: 661028526. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:28:15,654][25689] Avg episode reward: [(0, '-2.462')] [2022-07-10 08:28:16,525][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:28:16,534][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000645542_661035008.pth [2022-07-10 08:28:16,535][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000643602_659048448.pth [2022-07-10 08:28:16,537][26022] Updated weights on worker 0-0, policy_version 645542 (0.00084) [2022-07-10 08:28:18,831][26022] Updated weights on worker 0-0, policy_version 645552 (0.00087) [2022-07-10 08:28:20,196][26022] Updated weights on worker 0-0, policy_version 645562 (0.00095) [2022-07-10 08:28:20,655][25689] Fps is (10 sec: 5500.2, 60 sec: 5499.3, 300 sec: 5503.5). Total num frames: 661056512. Throughput: 0: 5665.7. Samples: 661061978. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:28:20,656][25689] Avg episode reward: [(0, '-2.663')] [2022-07-10 08:28:22,186][26022] Updated weights on worker 0-0, policy_version 645572 (0.00084) [2022-07-10 08:28:24,092][26022] Updated weights on worker 0-0, policy_version 645582 (0.00100) [2022-07-10 08:28:25,669][25689] Fps is (10 sec: 5623.7, 60 sec: 5535.4, 300 sec: 5504.5). Total num frames: 661084160. Throughput: 0: 4978.4. Samples: 661079016. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:28:25,669][25689] Avg episode reward: [(0, '-3.790')] [2022-07-10 08:28:25,797][26022] Updated weights on worker 0-0, policy_version 645592 (0.00092) [2022-07-10 08:28:27,781][26022] Updated weights on worker 0-0, policy_version 645602 (0.00088) [2022-07-10 08:28:29,332][26022] Updated weights on worker 0-0, policy_version 645612 (0.00091) [2022-07-10 08:28:30,768][25689] Fps is (10 sec: 5569.5, 60 sec: 5515.6, 300 sec: 5513.8). Total num frames: 661112832. Throughput: 0: 5814.3. Samples: 661112496. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:28:30,768][25689] Avg episode reward: [(0, '-4.373')] [2022-07-10 08:28:31,442][26022] Updated weights on worker 0-0, policy_version 645622 (0.00085) [2022-07-10 08:28:33,351][26022] Updated weights on worker 0-0, policy_version 645632 (0.00089) [2022-07-10 08:28:35,211][26022] Updated weights on worker 0-0, policy_version 645642 (0.00095) [2022-07-10 08:28:35,851][25689] Fps is (10 sec: 5631.4, 60 sec: 5527.4, 300 sec: 5512.8). Total num frames: 661141504. Throughput: 0: 5809.8. Samples: 661145870. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:28:35,852][25689] Avg episode reward: [(0, '-4.187')] [2022-07-10 08:28:36,937][26022] Updated weights on worker 0-0, policy_version 645652 (0.00087) [2022-07-10 08:28:38,732][26022] Updated weights on worker 0-0, policy_version 645662 (0.00088) [2022-07-10 08:28:40,597][26022] Updated weights on worker 0-0, policy_version 645672 (0.00422) [2022-07-10 08:28:40,855][25689] Fps is (10 sec: 5583.1, 60 sec: 5512.0, 300 sec: 5509.6). Total num frames: 661169152. Throughput: 0: 4986.9. Samples: 661162712. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:28:40,857][25689] Avg episode reward: [(0, '-3.888')] [2022-07-10 08:28:42,223][26022] Updated weights on worker 0-0, policy_version 645682 (0.00083) [2022-07-10 08:28:44,367][26022] Updated weights on worker 0-0, policy_version 645692 (0.00086) [2022-07-10 08:28:45,916][25689] Fps is (10 sec: 5697.8, 60 sec: 5542.7, 300 sec: 5516.1). Total num frames: 661198848. Throughput: 0: 5808.2. Samples: 661196614. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:28:45,916][25689] Avg episode reward: [(0, '-3.874')] [2022-07-10 08:28:45,924][26022] Updated weights on worker 0-0, policy_version 645702 (0.00094) [2022-07-10 08:28:47,955][26022] Updated weights on worker 0-0, policy_version 645712 (0.00087) [2022-07-10 08:28:49,520][26022] Updated weights on worker 0-0, policy_version 645722 (0.00087) [2022-07-10 08:28:50,983][25689] Fps is (10 sec: 5560.7, 60 sec: 5525.7, 300 sec: 5515.1). Total num frames: 661225472. Throughput: 0: 5825.0. Samples: 661230254. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:28:50,984][25689] Avg episode reward: [(0, '-4.018')] [2022-07-10 08:28:51,567][26022] Updated weights on worker 0-0, policy_version 645732 (0.00087) [2022-07-10 08:28:53,481][26022] Updated weights on worker 0-0, policy_version 645742 (0.00084) [2022-07-10 08:28:55,083][26022] Updated weights on worker 0-0, policy_version 645752 (0.00093) [2022-07-10 08:28:56,080][25689] Fps is (10 sec: 5541.1, 60 sec: 5537.5, 300 sec: 5520.3). Total num frames: 661255168. Throughput: 0: 5825.5. Samples: 661263710. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:28:56,080][25689] Avg episode reward: [(0, '-4.807')] [2022-07-10 08:28:57,162][26022] Updated weights on worker 0-0, policy_version 645762 (0.00097) [2022-07-10 08:28:58,903][26022] Updated weights on worker 0-0, policy_version 645772 (0.00097) [2022-07-10 08:29:00,928][26022] Updated weights on worker 0-0, policy_version 645782 (0.00091) [2022-07-10 08:29:01,111][25689] Fps is (10 sec: 5662.1, 60 sec: 5538.9, 300 sec: 5523.3). Total num frames: 661282816. Throughput: 0: 5813.3. Samples: 661280466. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:29:01,112][25689] Avg episode reward: [(0, '-4.109')] [2022-07-10 08:29:03,048][26022] Updated weights on worker 0-0, policy_version 645792 (0.00086) [2022-07-10 08:29:04,757][26022] Updated weights on worker 0-0, policy_version 645802 (0.00088) [2022-07-10 08:29:06,118][25689] Fps is (10 sec: 5202.2, 60 sec: 5523.7, 300 sec: 5517.7). Total num frames: 661307392. Throughput: 0: 5695.7. Samples: 661311684. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:29:06,119][25689] Avg episode reward: [(0, '-4.353')] [2022-07-10 08:29:06,645][26022] Updated weights on worker 0-0, policy_version 645812 (0.00089) [2022-07-10 08:29:08,456][26022] Updated weights on worker 0-0, policy_version 645822 (0.00091) [2022-07-10 08:29:10,260][26022] Updated weights on worker 0-0, policy_version 645832 (0.00088) [2022-07-10 08:29:11,161][25689] Fps is (10 sec: 5400.3, 60 sec: 5546.2, 300 sec: 5525.3). Total num frames: 661337088. Throughput: 0: 5690.3. Samples: 661345070. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:29:11,161][25689] Avg episode reward: [(0, '-5.550')] [2022-07-10 08:29:12,311][26022] Updated weights on worker 0-0, policy_version 645842 (0.00093) [2022-07-10 08:29:13,913][26022] Updated weights on worker 0-0, policy_version 645852 (0.00094) [2022-07-10 08:29:15,920][26022] Updated weights on worker 0-0, policy_version 645862 (0.00082) [2022-07-10 08:29:16,180][25689] Fps is (10 sec: 5699.4, 60 sec: 5566.1, 300 sec: 5523.0). Total num frames: 661364736. Throughput: 0: 4886.5. Samples: 661361930. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:29:16,180][25689] Avg episode reward: [(0, '-5.696')] [2022-07-10 08:29:17,572][26022] Updated weights on worker 0-0, policy_version 645872 (0.00094) [2022-07-10 08:29:19,347][26022] Updated weights on worker 0-0, policy_version 645882 (0.00081) [2022-07-10 08:29:21,222][25689] Fps is (10 sec: 5495.7, 60 sec: 5545.5, 300 sec: 5515.7). Total num frames: 661392384. Throughput: 0: 5729.0. Samples: 661395684. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:29:21,223][25689] Avg episode reward: [(0, '-7.058')] [2022-07-10 08:29:21,276][26022] Updated weights on worker 0-0, policy_version 645892 (0.00087) [2022-07-10 08:29:23,067][26022] Updated weights on worker 0-0, policy_version 645902 (0.00089) [2022-07-10 08:29:24,858][26022] Updated weights on worker 0-0, policy_version 645912 (0.00085) [2022-07-10 08:29:26,227][25689] Fps is (10 sec: 5605.8, 60 sec: 5563.2, 300 sec: 5527.6). Total num frames: 661421056. Throughput: 0: 5852.2. Samples: 661429362. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:29:26,227][25689] Avg episode reward: [(0, '-5.908')] [2022-07-10 08:29:26,820][26022] Updated weights on worker 0-0, policy_version 645922 (0.00087) [2022-07-10 08:29:28,612][26022] Updated weights on worker 0-0, policy_version 645932 (0.00090) [2022-07-10 08:29:30,313][26022] Updated weights on worker 0-0, policy_version 645942 (0.00092) [2022-07-10 08:29:31,281][25689] Fps is (10 sec: 5701.3, 60 sec: 5567.3, 300 sec: 5527.1). Total num frames: 661449728. Throughput: 0: 5027.9. Samples: 661446230. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:29:31,281][25689] Avg episode reward: [(0, '-6.006')] [2022-07-10 08:29:32,345][26022] Updated weights on worker 0-0, policy_version 645952 (0.00091) [2022-07-10 08:29:33,897][26022] Updated weights on worker 0-0, policy_version 645962 (0.00090) [2022-07-10 08:29:36,119][26022] Updated weights on worker 0-0, policy_version 645972 (0.00093) [2022-07-10 08:29:36,291][25689] Fps is (10 sec: 5392.7, 60 sec: 5523.3, 300 sec: 5520.8). Total num frames: 661475328. Throughput: 0: 5850.6. Samples: 661479590. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:29:36,291][25689] Avg episode reward: [(0, '-5.190')] [2022-07-10 08:29:37,496][26022] Updated weights on worker 0-0, policy_version 645982 (0.00098) [2022-07-10 08:29:39,631][26022] Updated weights on worker 0-0, policy_version 645992 (0.00088) [2022-07-10 08:29:41,296][25689] Fps is (10 sec: 5521.3, 60 sec: 5557.1, 300 sec: 5524.6). Total num frames: 661505024. Throughput: 0: 5850.1. Samples: 661513114. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:29:41,297][25689] Avg episode reward: [(0, '-4.859')] [2022-07-10 08:29:41,695][26022] Updated weights on worker 0-0, policy_version 646002 (0.00082) [2022-07-10 08:29:43,140][26022] Updated weights on worker 0-0, policy_version 646012 (0.00091) [2022-07-10 08:29:45,315][26022] Updated weights on worker 0-0, policy_version 646022 (0.00083) [2022-07-10 08:29:46,366][25689] Fps is (10 sec: 5793.0, 60 sec: 5539.2, 300 sec: 5527.9). Total num frames: 661533696. Throughput: 0: 4984.3. Samples: 661529744. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:29:46,367][25689] Avg episode reward: [(0, '-4.626')] [2022-07-10 08:29:46,787][26022] Updated weights on worker 0-0, policy_version 646032 (0.00087) [2022-07-10 08:29:48,926][26022] Updated weights on worker 0-0, policy_version 646042 (0.00086) [2022-07-10 08:29:50,670][26022] Updated weights on worker 0-0, policy_version 646052 (0.00085) [2022-07-10 08:29:51,423][25689] Fps is (10 sec: 5460.1, 60 sec: 5540.2, 300 sec: 5518.4). Total num frames: 661560320. Throughput: 0: 5802.5. Samples: 661563106. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:29:51,424][25689] Avg episode reward: [(0, '-2.582')] [2022-07-10 08:29:52,496][26022] Updated weights on worker 0-0, policy_version 646062 (0.00085) [2022-07-10 08:29:54,282][26022] Updated weights on worker 0-0, policy_version 646072 (0.00116) [2022-07-10 08:29:56,450][25689] Fps is (10 sec: 5280.7, 60 sec: 5495.7, 300 sec: 5517.9). Total num frames: 661586944. Throughput: 0: 5807.1. Samples: 661596656. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:29:56,451][25689] Avg episode reward: [(0, '-1.532')] [2022-07-10 08:29:56,463][26022] Updated weights on worker 0-0, policy_version 646082 (0.00080) [2022-07-10 08:29:57,954][26022] Updated weights on worker 0-0, policy_version 646092 (0.00049) [2022-07-10 08:29:59,815][26022] Updated weights on worker 0-0, policy_version 646102 (0.00086) [2022-07-10 08:30:01,523][25689] Fps is (10 sec: 5677.3, 60 sec: 5542.7, 300 sec: 5527.0). Total num frames: 661617664. Throughput: 0: 4963.2. Samples: 661613508. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:30:01,524][25689] Avg episode reward: [(0, '-1.127')] [2022-07-10 08:30:01,804][26022] Updated weights on worker 0-0, policy_version 646112 (0.00095) [2022-07-10 08:30:03,887][26022] Updated weights on worker 0-0, policy_version 646122 (0.00088) [2022-07-10 08:30:05,832][26022] Updated weights on worker 0-0, policy_version 646132 (0.00082) [2022-07-10 08:30:06,536][25689] Fps is (10 sec: 5583.8, 60 sec: 5559.2, 300 sec: 5528.1). Total num frames: 661643264. Throughput: 0: 5715.3. Samples: 661645020. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:30:06,538][25689] Avg episode reward: [(0, '-1.128')] [2022-07-10 08:30:07,530][26022] Updated weights on worker 0-0, policy_version 646142 (0.00092) [2022-07-10 08:30:09,424][26022] Updated weights on worker 0-0, policy_version 646152 (0.00084) [2022-07-10 08:30:11,390][26022] Updated weights on worker 0-0, policy_version 646162 (0.00088) [2022-07-10 08:30:11,660][25689] Fps is (10 sec: 5354.2, 60 sec: 5534.8, 300 sec: 5522.5). Total num frames: 661671936. Throughput: 0: 5699.8. Samples: 661678452. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:30:11,660][25689] Avg episode reward: [(0, '-1.955')] [2022-07-10 08:30:13,166][26022] Updated weights on worker 0-0, policy_version 646172 (0.00086) [2022-07-10 08:30:14,964][26022] Updated weights on worker 0-0, policy_version 646182 (0.00095) [2022-07-10 08:30:16,602][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:30:16,620][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000646191_661699584.pth [2022-07-10 08:30:16,620][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000644249_659710976.pth [2022-07-10 08:30:16,701][25689] Fps is (10 sec: 5641.4, 60 sec: 5549.7, 300 sec: 5525.6). Total num frames: 661700608. Throughput: 0: 4859.8. Samples: 661695072. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:30:16,701][25689] Avg episode reward: [(0, '-3.144')] [2022-07-10 08:30:16,707][26022] Updated weights on worker 0-0, policy_version 646192 (0.00074) [2022-07-10 08:30:18,715][26022] Updated weights on worker 0-0, policy_version 646202 (0.00092) [2022-07-10 08:30:20,429][26022] Updated weights on worker 0-0, policy_version 646212 (0.00087) [2022-07-10 08:30:21,756][25689] Fps is (10 sec: 5477.1, 60 sec: 5531.7, 300 sec: 5525.7). Total num frames: 661727232. Throughput: 0: 5705.7. Samples: 661728946. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:30:21,756][25689] Avg episode reward: [(0, '-3.921')] [2022-07-10 08:30:22,156][26022] Updated weights on worker 0-0, policy_version 646222 (0.00089) [2022-07-10 08:30:24,210][26022] Updated weights on worker 0-0, policy_version 646232 (0.00093) [2022-07-10 08:30:25,676][26022] Updated weights on worker 0-0, policy_version 646242 (0.00088) [2022-07-10 08:30:26,766][25689] Fps is (10 sec: 5493.8, 60 sec: 5531.1, 300 sec: 5526.6). Total num frames: 661755904. Throughput: 0: 5793.2. Samples: 661762216. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:30:26,766][25689] Avg episode reward: [(0, '-3.640')] [2022-07-10 08:30:27,866][26022] Updated weights on worker 0-0, policy_version 646252 (0.00084) [2022-07-10 08:30:29,452][26022] Updated weights on worker 0-0, policy_version 646262 (0.00087) [2022-07-10 08:30:31,486][26022] Updated weights on worker 0-0, policy_version 646272 (0.00091) [2022-07-10 08:30:31,822][25689] Fps is (10 sec: 5696.7, 60 sec: 5530.9, 300 sec: 5529.7). Total num frames: 661784576. Throughput: 0: 4988.4. Samples: 661779028. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:30:31,822][25689] Avg episode reward: [(0, '-5.613')] [2022-07-10 08:30:33,544][26022] Updated weights on worker 0-0, policy_version 646282 (0.00096) [2022-07-10 08:30:35,035][26022] Updated weights on worker 0-0, policy_version 646292 (0.00091) [2022-07-10 08:30:36,836][25689] Fps is (10 sec: 5389.5, 60 sec: 5530.5, 300 sec: 5519.6). Total num frames: 661810176. Throughput: 0: 5815.7. Samples: 661812170. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:30:36,836][25689] Avg episode reward: [(0, '-6.706')] [2022-07-10 08:30:37,204][26022] Updated weights on worker 0-0, policy_version 646302 (0.00091) [2022-07-10 08:30:38,821][26022] Updated weights on worker 0-0, policy_version 646312 (0.00086) [2022-07-10 08:30:40,821][26022] Updated weights on worker 0-0, policy_version 646322 (0.00094) [2022-07-10 08:30:41,850][25689] Fps is (10 sec: 5514.2, 60 sec: 5529.7, 300 sec: 5526.7). Total num frames: 661839872. Throughput: 0: 5800.2. Samples: 661845496. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:30:41,850][25689] Avg episode reward: [(0, '-6.546')] [2022-07-10 08:30:42,644][26022] Updated weights on worker 0-0, policy_version 646332 (0.00089) [2022-07-10 08:30:44,392][26022] Updated weights on worker 0-0, policy_version 646342 (0.00085) [2022-07-10 08:30:46,200][26022] Updated weights on worker 0-0, policy_version 646352 (0.00096) [2022-07-10 08:30:46,855][25689] Fps is (10 sec: 5723.7, 60 sec: 5518.8, 300 sec: 5525.9). Total num frames: 661867520. Throughput: 0: 4981.2. Samples: 661862282. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 08:30:46,855][25689] Avg episode reward: [(0, '-5.801')] [2022-07-10 08:30:48,071][26022] Updated weights on worker 0-0, policy_version 646362 (0.00082) [2022-07-10 08:30:50,006][26022] Updated weights on worker 0-0, policy_version 646372 (0.00091) [2022-07-10 08:30:51,902][25689] Fps is (10 sec: 5500.8, 60 sec: 5536.6, 300 sec: 5532.2). Total num frames: 661895168. Throughput: 0: 5809.7. Samples: 661895688. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:30:51,902][25689] Avg episode reward: [(0, '-6.449')] [2022-07-10 08:30:51,904][26022] Updated weights on worker 0-0, policy_version 646382 (0.00088) [2022-07-10 08:30:53,733][26022] Updated weights on worker 0-0, policy_version 646392 (0.00091) [2022-07-10 08:30:55,580][26022] Updated weights on worker 0-0, policy_version 646402 (0.00087) [2022-07-10 08:30:56,906][25689] Fps is (10 sec: 5603.3, 60 sec: 5572.6, 300 sec: 5528.8). Total num frames: 661923840. Throughput: 0: 5822.7. Samples: 661929032. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:30:56,906][25689] Avg episode reward: [(0, '-7.290')] [2022-07-10 08:30:57,271][26022] Updated weights on worker 0-0, policy_version 646412 (0.00078) [2022-07-10 08:30:59,287][26022] Updated weights on worker 0-0, policy_version 646422 (0.00088) [2022-07-10 08:31:01,079][26022] Updated weights on worker 0-0, policy_version 646432 (0.00085) [2022-07-10 08:31:01,921][25689] Fps is (10 sec: 5417.0, 60 sec: 5493.2, 300 sec: 5530.0). Total num frames: 661949440. Throughput: 0: 5003.2. Samples: 661945916. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:31:01,921][25689] Avg episode reward: [(0, '-4.104')] [2022-07-10 08:31:03,251][26022] Updated weights on worker 0-0, policy_version 646442 (0.00100) [2022-07-10 08:31:04,859][26022] Updated weights on worker 0-0, policy_version 646452 (0.00097) [2022-07-10 08:31:06,933][25689] Fps is (10 sec: 5208.1, 60 sec: 5510.2, 300 sec: 5524.2). Total num frames: 661976064. Throughput: 0: 5729.2. Samples: 661977316. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:31:06,935][25689] Avg episode reward: [(0, '-4.282')] [2022-07-10 08:31:06,978][26022] Updated weights on worker 0-0, policy_version 646462 (0.00089) [2022-07-10 08:31:08,675][26022] Updated weights on worker 0-0, policy_version 646472 (0.00084) [2022-07-10 08:31:10,541][26022] Updated weights on worker 0-0, policy_version 646482 (0.00086) [2022-07-10 08:31:12,001][25689] Fps is (10 sec: 5485.6, 60 sec: 5515.3, 300 sec: 5523.0). Total num frames: 662004736. Throughput: 0: 5723.9. Samples: 662010734. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:31:12,001][25689] Avg episode reward: [(0, '-5.603')] [2022-07-10 08:31:12,356][26022] Updated weights on worker 0-0, policy_version 646492 (0.00091) [2022-07-10 08:31:14,149][26022] Updated weights on worker 0-0, policy_version 646502 (0.00086) [2022-07-10 08:31:16,155][26022] Updated weights on worker 0-0, policy_version 646512 (0.00088) [2022-07-10 08:31:17,013][25689] Fps is (10 sec: 5688.8, 60 sec: 5517.9, 300 sec: 5529.9). Total num frames: 662033408. Throughput: 0: 4900.9. Samples: 662027576. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:31:17,015][25689] Avg episode reward: [(0, '-5.593')] [2022-07-10 08:31:17,893][26022] Updated weights on worker 0-0, policy_version 646522 (0.00093) [2022-07-10 08:31:19,779][26022] Updated weights on worker 0-0, policy_version 646532 (0.00087) [2022-07-10 08:31:21,568][26022] Updated weights on worker 0-0, policy_version 646542 (0.00082) [2022-07-10 08:31:22,024][25689] Fps is (10 sec: 5516.9, 60 sec: 5522.0, 300 sec: 5520.1). Total num frames: 662060032. Throughput: 0: 5717.3. Samples: 662060852. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:31:22,025][25689] Avg episode reward: [(0, '-5.265')] [2022-07-10 08:31:23,500][26022] Updated weights on worker 0-0, policy_version 646552 (0.00098) [2022-07-10 08:31:25,471][26022] Updated weights on worker 0-0, policy_version 646562 (0.00098) [2022-07-10 08:31:27,042][25689] Fps is (10 sec: 5411.9, 60 sec: 5504.3, 300 sec: 5525.5). Total num frames: 662087680. Throughput: 0: 5801.7. Samples: 662093980. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:31:27,042][25689] Avg episode reward: [(0, '-6.757')] [2022-07-10 08:31:27,256][26022] Updated weights on worker 0-0, policy_version 646572 (0.00098) [2022-07-10 08:31:28,980][26022] Updated weights on worker 0-0, policy_version 646582 (0.00093) [2022-07-10 08:31:30,995][26022] Updated weights on worker 0-0, policy_version 646592 (0.00094) [2022-07-10 08:31:32,084][25689] Fps is (10 sec: 5598.3, 60 sec: 5505.5, 300 sec: 5528.4). Total num frames: 662116352. Throughput: 0: 4969.7. Samples: 662110542. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:31:32,085][25689] Avg episode reward: [(0, '-9.131')] [2022-07-10 08:31:32,721][26022] Updated weights on worker 0-0, policy_version 646602 (0.00083) [2022-07-10 08:31:34,696][26022] Updated weights on worker 0-0, policy_version 646612 (0.00115) [2022-07-10 08:31:36,361][26022] Updated weights on worker 0-0, policy_version 646622 (0.00091) [2022-07-10 08:31:37,099][25689] Fps is (10 sec: 5498.2, 60 sec: 5522.5, 300 sec: 5525.5). Total num frames: 662142976. Throughput: 0: 5779.5. Samples: 662143660. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:31:37,099][25689] Avg episode reward: [(0, '-8.047')] [2022-07-10 08:31:38,467][26022] Updated weights on worker 0-0, policy_version 646632 (0.00095) [2022-07-10 08:31:40,276][26022] Updated weights on worker 0-0, policy_version 646642 (0.00084) [2022-07-10 08:31:42,112][25689] Fps is (10 sec: 5412.3, 60 sec: 5488.6, 300 sec: 5525.5). Total num frames: 662170624. Throughput: 0: 5780.6. Samples: 662176970. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:31:42,112][25689] Avg episode reward: [(0, '-7.380')] [2022-07-10 08:31:42,135][26022] Updated weights on worker 0-0, policy_version 646652 (0.00087) [2022-07-10 08:31:43,874][26022] Updated weights on worker 0-0, policy_version 646662 (0.00086) [2022-07-10 08:31:45,902][26022] Updated weights on worker 0-0, policy_version 646672 (0.00088) [2022-07-10 08:31:47,121][25689] Fps is (10 sec: 5619.6, 60 sec: 5505.2, 300 sec: 5529.8). Total num frames: 662199296. Throughput: 0: 4958.2. Samples: 662193536. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:31:47,122][25689] Avg episode reward: [(0, '-6.499')] [2022-07-10 08:31:47,608][26022] Updated weights on worker 0-0, policy_version 646682 (0.00083) [2022-07-10 08:31:49,528][26022] Updated weights on worker 0-0, policy_version 646692 (0.00096) [2022-07-10 08:31:51,350][26022] Updated weights on worker 0-0, policy_version 646702 (0.00090) [2022-07-10 08:31:52,164][25689] Fps is (10 sec: 5602.4, 60 sec: 5505.5, 300 sec: 5529.5). Total num frames: 662226944. Throughput: 0: 5797.8. Samples: 662226964. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:31:52,165][25689] Avg episode reward: [(0, '-7.240')] [2022-07-10 08:31:53,094][26022] Updated weights on worker 0-0, policy_version 646712 (0.00084) [2022-07-10 08:31:55,006][26022] Updated weights on worker 0-0, policy_version 646722 (0.00087) [2022-07-10 08:31:56,727][26022] Updated weights on worker 0-0, policy_version 646732 (0.00091) [2022-07-10 08:31:57,182][25689] Fps is (10 sec: 5597.5, 60 sec: 5504.2, 300 sec: 5529.4). Total num frames: 662255616. Throughput: 0: 5826.4. Samples: 662260676. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:31:57,183][25689] Avg episode reward: [(0, '-5.075')] [2022-07-10 08:31:58,594][26022] Updated weights on worker 0-0, policy_version 646742 (0.00086) [2022-07-10 08:32:00,498][26022] Updated weights on worker 0-0, policy_version 646752 (0.00086) [2022-07-10 08:32:02,199][25689] Fps is (10 sec: 5408.2, 60 sec: 5504.0, 300 sec: 5532.7). Total num frames: 662281216. Throughput: 0: 5005.7. Samples: 662277524. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:32:02,200][25689] Avg episode reward: [(0, '-4.860')] [2022-07-10 08:32:02,684][26022] Updated weights on worker 0-0, policy_version 646762 (0.00095) [2022-07-10 08:32:04,640][26022] Updated weights on worker 0-0, policy_version 646772 (0.00088) [2022-07-10 08:32:06,357][26022] Updated weights on worker 0-0, policy_version 646782 (0.00088) [2022-07-10 08:32:07,220][25689] Fps is (10 sec: 5304.8, 60 sec: 5520.3, 300 sec: 5531.3). Total num frames: 662308864. Throughput: 0: 5742.2. Samples: 662308950. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:32:07,221][25689] Avg episode reward: [(0, '-4.700')] [2022-07-10 08:32:08,215][26022] Updated weights on worker 0-0, policy_version 646792 (0.00085) [2022-07-10 08:32:10,242][26022] Updated weights on worker 0-0, policy_version 646802 (0.00090) [2022-07-10 08:32:11,679][26022] Updated weights on worker 0-0, policy_version 646812 (0.00091) [2022-07-10 08:32:12,263][25689] Fps is (10 sec: 5596.2, 60 sec: 5522.5, 300 sec: 5530.7). Total num frames: 662337536. Throughput: 0: 5724.3. Samples: 662342018. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:32:12,264][25689] Avg episode reward: [(0, '-5.655')] [2022-07-10 08:32:13,966][26022] Updated weights on worker 0-0, policy_version 646822 (0.00087) [2022-07-10 08:32:15,381][26022] Updated weights on worker 0-0, policy_version 646832 (0.00087) [2022-07-10 08:32:16,689][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:32:16,702][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000646838_662362112.pth [2022-07-10 08:32:16,702][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000644895_660372480.pth [2022-07-10 08:32:17,275][25689] Fps is (10 sec: 5499.6, 60 sec: 5488.6, 300 sec: 5520.9). Total num frames: 662364160. Throughput: 0: 4891.2. Samples: 662358952. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:32:17,275][25689] Avg episode reward: [(0, '-6.839')] [2022-07-10 08:32:17,450][26022] Updated weights on worker 0-0, policy_version 646842 (0.00088) [2022-07-10 08:32:19,020][26022] Updated weights on worker 0-0, policy_version 646852 (0.00092) [2022-07-10 08:32:21,061][26022] Updated weights on worker 0-0, policy_version 646862 (0.00092) [2022-07-10 08:32:22,285][25689] Fps is (10 sec: 5620.2, 60 sec: 5539.7, 300 sec: 5535.2). Total num frames: 662393856. Throughput: 0: 5728.6. Samples: 662392584. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:32:22,285][25689] Avg episode reward: [(0, '-5.398')] [2022-07-10 08:32:22,865][26022] Updated weights on worker 0-0, policy_version 646872 (0.00095) [2022-07-10 08:32:24,676][26022] Updated weights on worker 0-0, policy_version 646882 (0.00095) [2022-07-10 08:32:26,418][26022] Updated weights on worker 0-0, policy_version 646892 (0.00091) [2022-07-10 08:32:27,312][25689] Fps is (10 sec: 5611.1, 60 sec: 5521.8, 300 sec: 5525.6). Total num frames: 662420480. Throughput: 0: 5824.1. Samples: 662425966. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:32:27,312][25689] Avg episode reward: [(0, '-5.485')] [2022-07-10 08:32:28,391][26022] Updated weights on worker 0-0, policy_version 646902 (0.00090) [2022-07-10 08:32:30,188][26022] Updated weights on worker 0-0, policy_version 646912 (0.00085) [2022-07-10 08:32:32,100][26022] Updated weights on worker 0-0, policy_version 646922 (0.00079) [2022-07-10 08:32:32,366][25689] Fps is (10 sec: 5484.8, 60 sec: 5520.7, 300 sec: 5528.5). Total num frames: 662449152. Throughput: 0: 5007.7. Samples: 662442686. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:32:32,367][25689] Avg episode reward: [(0, '-4.546')] [2022-07-10 08:32:33,729][26022] Updated weights on worker 0-0, policy_version 646932 (0.00090) [2022-07-10 08:32:35,720][26022] Updated weights on worker 0-0, policy_version 646942 (0.00087) [2022-07-10 08:32:37,382][25689] Fps is (10 sec: 5592.7, 60 sec: 5537.6, 300 sec: 5525.2). Total num frames: 662476800. Throughput: 0: 5838.9. Samples: 662476358. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:32:37,383][25689] Avg episode reward: [(0, '-5.224')] [2022-07-10 08:32:37,513][26022] Updated weights on worker 0-0, policy_version 646952 (0.00093) [2022-07-10 08:32:39,369][26022] Updated weights on worker 0-0, policy_version 646962 (0.00085) [2022-07-10 08:32:41,270][26022] Updated weights on worker 0-0, policy_version 646972 (0.00088) [2022-07-10 08:32:42,390][25689] Fps is (10 sec: 5618.7, 60 sec: 5555.0, 300 sec: 5528.9). Total num frames: 662505472. Throughput: 0: 5826.4. Samples: 662509726. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:32:42,390][25689] Avg episode reward: [(0, '-4.377')] [2022-07-10 08:32:43,164][26022] Updated weights on worker 0-0, policy_version 646982 (0.00092) [2022-07-10 08:32:44,891][26022] Updated weights on worker 0-0, policy_version 646992 (0.00094) [2022-07-10 08:32:46,832][26022] Updated weights on worker 0-0, policy_version 647002 (0.00090) [2022-07-10 08:32:47,400][25689] Fps is (10 sec: 5621.9, 60 sec: 5537.9, 300 sec: 5530.0). Total num frames: 662533120. Throughput: 0: 5004.4. Samples: 662526496. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:32:47,401][25689] Avg episode reward: [(0, '-4.052')] [2022-07-10 08:32:48,623][26022] Updated weights on worker 0-0, policy_version 647012 (0.00092) [2022-07-10 08:32:50,506][26022] Updated weights on worker 0-0, policy_version 647022 (0.00091) [2022-07-10 08:32:52,336][26022] Updated weights on worker 0-0, policy_version 647032 (0.00103) [2022-07-10 08:32:52,454][25689] Fps is (10 sec: 5494.3, 60 sec: 5537.0, 300 sec: 5526.3). Total num frames: 662560768. Throughput: 0: 5827.3. Samples: 662559744. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:32:52,455][25689] Avg episode reward: [(0, '-5.026')] [2022-07-10 08:32:54,087][26022] Updated weights on worker 0-0, policy_version 647042 (0.00093) [2022-07-10 08:32:56,054][26022] Updated weights on worker 0-0, policy_version 647052 (0.00054) [2022-07-10 08:32:57,465][25689] Fps is (10 sec: 5595.6, 60 sec: 5537.6, 300 sec: 5530.4). Total num frames: 662589440. Throughput: 0: 5815.8. Samples: 662593158. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:32:57,466][25689] Avg episode reward: [(0, '-5.122')] [2022-07-10 08:32:58,087][26022] Updated weights on worker 0-0, policy_version 647062 (0.00082) [2022-07-10 08:32:59,692][26022] Updated weights on worker 0-0, policy_version 647072 (0.00086) [2022-07-10 08:33:01,797][26022] Updated weights on worker 0-0, policy_version 647082 (0.00084) [2022-07-10 08:33:02,475][25689] Fps is (10 sec: 5313.9, 60 sec: 5521.3, 300 sec: 5527.3). Total num frames: 662614016. Throughput: 0: 4985.1. Samples: 662609850. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:33:02,475][25689] Avg episode reward: [(0, '-5.927')] [2022-07-10 08:33:03,687][26022] Updated weights on worker 0-0, policy_version 647092 (0.00091) [2022-07-10 08:33:05,735][26022] Updated weights on worker 0-0, policy_version 647102 (0.00096) [2022-07-10 08:33:07,491][25689] Fps is (10 sec: 5209.1, 60 sec: 5521.8, 300 sec: 5525.5). Total num frames: 662641664. Throughput: 0: 5706.8. Samples: 662641148. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:33:07,491][25689] Avg episode reward: [(0, '-5.455')] [2022-07-10 08:33:07,518][26022] Updated weights on worker 0-0, policy_version 647112 (0.00113) [2022-07-10 08:33:09,336][26022] Updated weights on worker 0-0, policy_version 647122 (0.00101) [2022-07-10 08:33:11,365][26022] Updated weights on worker 0-0, policy_version 647132 (0.00095) [2022-07-10 08:33:12,547][25689] Fps is (10 sec: 5693.4, 60 sec: 5537.6, 300 sec: 5535.7). Total num frames: 662671360. Throughput: 0: 5698.2. Samples: 662674234. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:33:12,547][25689] Avg episode reward: [(0, '-5.342')] [2022-07-10 08:33:12,975][26022] Updated weights on worker 0-0, policy_version 647142 (0.00092) [2022-07-10 08:33:14,951][26022] Updated weights on worker 0-0, policy_version 647152 (0.00086) [2022-07-10 08:33:16,892][26022] Updated weights on worker 0-0, policy_version 647162 (0.00085) [2022-07-10 08:33:17,563][25689] Fps is (10 sec: 5490.1, 60 sec: 5520.2, 300 sec: 5525.1). Total num frames: 662696960. Throughput: 0: 4867.8. Samples: 662690988. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:33:17,563][25689] Avg episode reward: [(0, '-6.311')] [2022-07-10 08:33:18,682][26022] Updated weights on worker 0-0, policy_version 647172 (0.00087) [2022-07-10 08:33:20,603][26022] Updated weights on worker 0-0, policy_version 647182 (0.00087) [2022-07-10 08:33:22,330][26022] Updated weights on worker 0-0, policy_version 647192 (0.00095) [2022-07-10 08:33:22,571][25689] Fps is (10 sec: 5516.4, 60 sec: 5520.3, 300 sec: 5532.1). Total num frames: 662726656. Throughput: 0: 5691.1. Samples: 662724218. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:33:22,571][25689] Avg episode reward: [(0, '-6.005')] [2022-07-10 08:33:24,091][26022] Updated weights on worker 0-0, policy_version 647202 (0.00054) [2022-07-10 08:33:26,046][26022] Updated weights on worker 0-0, policy_version 647212 (0.00089) [2022-07-10 08:33:27,596][25689] Fps is (10 sec: 5613.1, 60 sec: 5520.5, 300 sec: 5526.6). Total num frames: 662753280. Throughput: 0: 5795.0. Samples: 662757660. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:33:27,597][25689] Avg episode reward: [(0, '-5.919')] [2022-07-10 08:33:27,988][26022] Updated weights on worker 0-0, policy_version 647222 (0.00092) [2022-07-10 08:33:29,735][26022] Updated weights on worker 0-0, policy_version 647232 (0.00083) [2022-07-10 08:33:31,508][26022] Updated weights on worker 0-0, policy_version 647242 (0.00096) [2022-07-10 08:33:32,652][25689] Fps is (10 sec: 5383.6, 60 sec: 5503.4, 300 sec: 5523.7). Total num frames: 662780928. Throughput: 0: 5785.6. Samples: 662790554. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:33:32,652][25689] Avg episode reward: [(0, '-6.184')] [2022-07-10 08:33:33,403][26022] Updated weights on worker 0-0, policy_version 647252 (0.00092) [2022-07-10 08:33:35,364][26022] Updated weights on worker 0-0, policy_version 647262 (0.00091) [2022-07-10 08:33:37,168][26022] Updated weights on worker 0-0, policy_version 647272 (0.00099) [2022-07-10 08:33:37,667][25689] Fps is (10 sec: 5491.0, 60 sec: 5503.5, 300 sec: 5523.4). Total num frames: 662808576. Throughput: 0: 5789.3. Samples: 662807376. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:33:37,667][25689] Avg episode reward: [(0, '-6.215')] [2022-07-10 08:33:39,108][26022] Updated weights on worker 0-0, policy_version 647282 (0.00089) [2022-07-10 08:33:40,894][26022] Updated weights on worker 0-0, policy_version 647292 (0.00097) [2022-07-10 08:33:42,690][25689] Fps is (10 sec: 5508.5, 60 sec: 5485.1, 300 sec: 5517.2). Total num frames: 662836224. Throughput: 0: 5770.4. Samples: 662840316. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:33:42,691][25689] Avg episode reward: [(0, '-5.680')] [2022-07-10 08:33:42,848][26022] Updated weights on worker 0-0, policy_version 647302 (0.00092) [2022-07-10 08:33:44,539][26022] Updated weights on worker 0-0, policy_version 647312 (0.00091) [2022-07-10 08:33:46,496][26022] Updated weights on worker 0-0, policy_version 647322 (0.00088) [2022-07-10 08:33:47,708][25689] Fps is (10 sec: 5608.6, 60 sec: 5501.3, 300 sec: 5525.1). Total num frames: 662864896. Throughput: 0: 5765.8. Samples: 662873622. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:33:47,709][25689] Avg episode reward: [(0, '-5.059')] [2022-07-10 08:33:48,145][26022] Updated weights on worker 0-0, policy_version 647332 (0.00081) [2022-07-10 08:33:50,239][26022] Updated weights on worker 0-0, policy_version 647342 (0.00088) [2022-07-10 08:33:52,182][26022] Updated weights on worker 0-0, policy_version 647352 (0.00087) [2022-07-10 08:33:52,757][25689] Fps is (10 sec: 5492.7, 60 sec: 5484.8, 300 sec: 5515.6). Total num frames: 662891520. Throughput: 0: 4956.1. Samples: 662890198. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:33:52,759][25689] Avg episode reward: [(0, '-5.284')] [2022-07-10 08:33:53,994][26022] Updated weights on worker 0-0, policy_version 647362 (0.00088) [2022-07-10 08:33:55,835][26022] Updated weights on worker 0-0, policy_version 647372 (0.00092) [2022-07-10 08:33:57,486][26022] Updated weights on worker 0-0, policy_version 647382 (0.00087) [2022-07-10 08:33:57,778][25689] Fps is (10 sec: 5491.6, 60 sec: 5484.0, 300 sec: 5519.2). Total num frames: 662920192. Throughput: 0: 5753.9. Samples: 662923092. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:33:57,778][25689] Avg episode reward: [(0, '-3.692')] [2022-07-10 08:33:59,449][26022] Updated weights on worker 0-0, policy_version 647392 (0.00096) [2022-07-10 08:34:01,301][26022] Updated weights on worker 0-0, policy_version 647402 (0.00087) [2022-07-10 08:34:02,789][25689] Fps is (10 sec: 5410.1, 60 sec: 5500.8, 300 sec: 5522.6). Total num frames: 662945792. Throughput: 0: 5689.4. Samples: 662954666. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2022-07-10 08:34:02,789][25689] Avg episode reward: [(0, '-3.388')] [2022-07-10 08:34:03,409][26022] Updated weights on worker 0-0, policy_version 647412 (0.00090) [2022-07-10 08:34:05,257][26022] Updated weights on worker 0-0, policy_version 647422 (0.00086) [2022-07-10 08:34:07,276][26022] Updated weights on worker 0-0, policy_version 647432 (0.00084) [2022-07-10 08:34:07,799][25689] Fps is (10 sec: 5211.4, 60 sec: 5484.4, 300 sec: 5512.9). Total num frames: 662972416. Throughput: 0: 4861.3. Samples: 662971286. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:34:07,799][25689] Avg episode reward: [(0, '-1.771')] [2022-07-10 08:34:09,025][26022] Updated weights on worker 0-0, policy_version 647442 (0.00094) [2022-07-10 08:34:10,988][26022] Updated weights on worker 0-0, policy_version 647452 (0.00083) [2022-07-10 08:34:12,699][26022] Updated weights on worker 0-0, policy_version 647462 (0.00323) [2022-07-10 08:34:12,866][25689] Fps is (10 sec: 5588.7, 60 sec: 5483.3, 300 sec: 5518.9). Total num frames: 663002112. Throughput: 0: 5681.3. Samples: 663004444. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:34:12,867][25689] Avg episode reward: [(0, '-1.261')] [2022-07-10 08:34:14,642][26022] Updated weights on worker 0-0, policy_version 647472 (0.00091) [2022-07-10 08:34:16,311][26022] Updated weights on worker 0-0, policy_version 647482 (0.00425) [2022-07-10 08:34:16,866][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:34:16,876][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000647485_663024640.pth [2022-07-10 08:34:16,877][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000645542_661035008.pth [2022-07-10 08:34:17,895][25689] Fps is (10 sec: 5578.5, 60 sec: 5499.2, 300 sec: 5515.7). Total num frames: 663028736. Throughput: 0: 5717.2. Samples: 663038106. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:34:17,895][25689] Avg episode reward: [(0, '-0.919')] [2022-07-10 08:34:18,229][26022] Updated weights on worker 0-0, policy_version 647492 (0.00096) [2022-07-10 08:34:19,962][26022] Updated weights on worker 0-0, policy_version 647502 (0.00084) [2022-07-10 08:34:21,954][26022] Updated weights on worker 0-0, policy_version 647512 (0.00094) [2022-07-10 08:34:22,901][25689] Fps is (10 sec: 5510.2, 60 sec: 5482.3, 300 sec: 5515.6). Total num frames: 663057408. Throughput: 0: 4994.6. Samples: 663055120. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:34:22,902][25689] Avg episode reward: [(0, '-1.683')] [2022-07-10 08:34:23,627][26022] Updated weights on worker 0-0, policy_version 647522 (0.00081) [2022-07-10 08:34:25,532][26022] Updated weights on worker 0-0, policy_version 647532 (0.00092) [2022-07-10 08:34:27,437][26022] Updated weights on worker 0-0, policy_version 647542 (0.00091) [2022-07-10 08:34:27,906][25689] Fps is (10 sec: 5625.6, 60 sec: 5501.2, 300 sec: 5513.1). Total num frames: 663085056. Throughput: 0: 5836.3. Samples: 663088638. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:34:27,907][25689] Avg episode reward: [(0, '-2.959')] [2022-07-10 08:34:29,121][26022] Updated weights on worker 0-0, policy_version 647552 (0.00089) [2022-07-10 08:34:31,114][26022] Updated weights on worker 0-0, policy_version 647562 (0.00091) [2022-07-10 08:34:32,724][26022] Updated weights on worker 0-0, policy_version 647572 (0.00086) [2022-07-10 08:34:33,007][25689] Fps is (10 sec: 5573.2, 60 sec: 5514.0, 300 sec: 5521.8). Total num frames: 663113728. Throughput: 0: 5835.6. Samples: 663121978. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:34:33,007][25689] Avg episode reward: [(0, '-4.438')] [2022-07-10 08:34:34,842][26022] Updated weights on worker 0-0, policy_version 647582 (0.00090) [2022-07-10 08:34:36,538][26022] Updated weights on worker 0-0, policy_version 647592 (0.00086) [2022-07-10 08:34:38,037][25689] Fps is (10 sec: 5458.0, 60 sec: 5495.7, 300 sec: 5510.9). Total num frames: 663140352. Throughput: 0: 5004.7. Samples: 663138914. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:34:38,039][25689] Avg episode reward: [(0, '-4.556')] [2022-07-10 08:34:38,440][26022] Updated weights on worker 0-0, policy_version 647602 (0.00093) [2022-07-10 08:34:40,485][26022] Updated weights on worker 0-0, policy_version 647612 (0.00085) [2022-07-10 08:34:42,154][26022] Updated weights on worker 0-0, policy_version 647622 (0.00086) [2022-07-10 08:34:43,045][25689] Fps is (10 sec: 5508.9, 60 sec: 5514.1, 300 sec: 5512.1). Total num frames: 663169024. Throughput: 0: 5793.1. Samples: 663171812. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:34:43,046][25689] Avg episode reward: [(0, '-4.465')] [2022-07-10 08:34:44,084][26022] Updated weights on worker 0-0, policy_version 647632 (0.00098) [2022-07-10 08:34:45,884][26022] Updated weights on worker 0-0, policy_version 647642 (0.00087) [2022-07-10 08:34:47,884][26022] Updated weights on worker 0-0, policy_version 647652 (0.00089) [2022-07-10 08:34:48,056][25689] Fps is (10 sec: 5621.4, 60 sec: 5497.7, 300 sec: 5516.4). Total num frames: 663196672. Throughput: 0: 5782.7. Samples: 663205160. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:34:48,058][25689] Avg episode reward: [(0, '-4.117')] [2022-07-10 08:34:49,565][26022] Updated weights on worker 0-0, policy_version 647662 (0.00103) [2022-07-10 08:34:51,622][26022] Updated weights on worker 0-0, policy_version 647672 (0.00094) [2022-07-10 08:34:53,188][25689] Fps is (10 sec: 5552.2, 60 sec: 5524.0, 300 sec: 5521.4). Total num frames: 663225344. Throughput: 0: 4945.3. Samples: 663221782. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:34:53,189][25689] Avg episode reward: [(0, '-4.857')] [2022-07-10 08:34:53,259][26022] Updated weights on worker 0-0, policy_version 647682 (0.00088) [2022-07-10 08:34:55,172][26022] Updated weights on worker 0-0, policy_version 647692 (0.00095) [2022-07-10 08:34:57,023][26022] Updated weights on worker 0-0, policy_version 647702 (0.00088) [2022-07-10 08:34:58,215][25689] Fps is (10 sec: 5644.9, 60 sec: 5523.5, 300 sec: 5515.3). Total num frames: 663254016. Throughput: 0: 5757.0. Samples: 663255076. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:34:58,215][25689] Avg episode reward: [(0, '-4.552')] [2022-07-10 08:34:58,858][26022] Updated weights on worker 0-0, policy_version 647712 (0.00086) [2022-07-10 08:35:00,686][26022] Updated weights on worker 0-0, policy_version 647722 (0.00086) [2022-07-10 08:35:02,931][26022] Updated weights on worker 0-0, policy_version 647732 (0.00102) [2022-07-10 08:35:03,223][25689] Fps is (10 sec: 5204.5, 60 sec: 5489.9, 300 sec: 5508.5). Total num frames: 663277568. Throughput: 0: 5681.7. Samples: 663286458. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:35:03,225][25689] Avg episode reward: [(0, '-6.939')] [2022-07-10 08:35:04,689][26022] Updated weights on worker 0-0, policy_version 647742 (0.00091) [2022-07-10 08:35:06,551][26022] Updated weights on worker 0-0, policy_version 647752 (0.00943) [2022-07-10 08:35:08,243][25689] Fps is (10 sec: 5207.8, 60 sec: 5522.9, 300 sec: 5510.4). Total num frames: 663306240. Throughput: 0: 4853.7. Samples: 663303140. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:35:08,243][25689] Avg episode reward: [(0, '-6.971')] [2022-07-10 08:35:08,500][26022] Updated weights on worker 0-0, policy_version 647762 (0.00091) [2022-07-10 08:35:10,296][26022] Updated weights on worker 0-0, policy_version 647772 (0.00088) [2022-07-10 08:35:12,203][26022] Updated weights on worker 0-0, policy_version 647782 (0.00087) [2022-07-10 08:35:13,303][25689] Fps is (10 sec: 5587.4, 60 sec: 5489.7, 300 sec: 5506.6). Total num frames: 663333888. Throughput: 0: 5684.1. Samples: 663336114. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:35:13,304][25689] Avg episode reward: [(0, '-7.900')] [2022-07-10 08:35:13,975][26022] Updated weights on worker 0-0, policy_version 647792 (0.00094) [2022-07-10 08:35:15,843][26022] Updated weights on worker 0-0, policy_version 647802 (0.00092) [2022-07-10 08:35:17,715][26022] Updated weights on worker 0-0, policy_version 647812 (0.00092) [2022-07-10 08:35:18,327][25689] Fps is (10 sec: 5585.2, 60 sec: 5524.0, 300 sec: 5514.1). Total num frames: 663362560. Throughput: 0: 5689.2. Samples: 663369496. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:35:18,327][25689] Avg episode reward: [(0, '-8.316')] [2022-07-10 08:35:19,704][26022] Updated weights on worker 0-0, policy_version 647822 (0.00084) [2022-07-10 08:35:21,333][26022] Updated weights on worker 0-0, policy_version 647832 (0.00495) [2022-07-10 08:35:23,345][25689] Fps is (10 sec: 5506.0, 60 sec: 5489.0, 300 sec: 5507.1). Total num frames: 663389184. Throughput: 0: 4956.1. Samples: 663386188. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:35:23,346][25689] Avg episode reward: [(0, '-7.555')] [2022-07-10 08:35:23,487][26022] Updated weights on worker 0-0, policy_version 647842 (0.00091) [2022-07-10 08:35:24,834][26022] Updated weights on worker 0-0, policy_version 647852 (0.00088) [2022-07-10 08:35:27,020][26022] Updated weights on worker 0-0, policy_version 647862 (0.00090) [2022-07-10 08:35:28,353][25689] Fps is (10 sec: 5617.1, 60 sec: 5522.6, 300 sec: 5511.4). Total num frames: 663418880. Throughput: 0: 5785.2. Samples: 663419482. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:35:28,354][25689] Avg episode reward: [(0, '-7.962')] [2022-07-10 08:35:28,549][26022] Updated weights on worker 0-0, policy_version 647872 (0.00094) [2022-07-10 08:35:30,837][26022] Updated weights on worker 0-0, policy_version 647882 (0.00088) [2022-07-10 08:35:32,460][26022] Updated weights on worker 0-0, policy_version 647892 (0.00085) [2022-07-10 08:35:33,432][25689] Fps is (10 sec: 5685.3, 60 sec: 5507.7, 300 sec: 5517.1). Total num frames: 663446528. Throughput: 0: 5805.4. Samples: 663452972. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:35:33,433][25689] Avg episode reward: [(0, '-7.544')] [2022-07-10 08:35:34,319][26022] Updated weights on worker 0-0, policy_version 647902 (0.00095) [2022-07-10 08:35:36,033][26022] Updated weights on worker 0-0, policy_version 647912 (0.00088) [2022-07-10 08:35:38,088][26022] Updated weights on worker 0-0, policy_version 647922 (0.00097) [2022-07-10 08:35:38,454][25689] Fps is (10 sec: 5373.2, 60 sec: 5508.4, 300 sec: 5506.6). Total num frames: 663473152. Throughput: 0: 4982.1. Samples: 663469770. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:35:38,454][25689] Avg episode reward: [(0, '-7.869')] [2022-07-10 08:35:39,667][26022] Updated weights on worker 0-0, policy_version 647932 (0.00089) [2022-07-10 08:35:41,727][26022] Updated weights on worker 0-0, policy_version 647942 (0.00084) [2022-07-10 08:35:43,263][26022] Updated weights on worker 0-0, policy_version 647952 (0.00086) [2022-07-10 08:35:43,474][25689] Fps is (10 sec: 5608.2, 60 sec: 5524.2, 300 sec: 5513.2). Total num frames: 663502848. Throughput: 0: 5816.0. Samples: 663503256. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:35:43,475][25689] Avg episode reward: [(0, '-8.672')] [2022-07-10 08:35:45,454][26022] Updated weights on worker 0-0, policy_version 647962 (0.00088) [2022-07-10 08:35:47,118][26022] Updated weights on worker 0-0, policy_version 647972 (0.00089) [2022-07-10 08:35:48,481][25689] Fps is (10 sec: 5616.8, 60 sec: 5507.7, 300 sec: 5510.5). Total num frames: 663529472. Throughput: 0: 5826.7. Samples: 663536758. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:35:48,481][25689] Avg episode reward: [(0, '-7.931')] [2022-07-10 08:35:48,918][26022] Updated weights on worker 0-0, policy_version 647982 (0.00087) [2022-07-10 08:35:50,691][26022] Updated weights on worker 0-0, policy_version 647992 (0.00063) [2022-07-10 08:35:52,710][26022] Updated weights on worker 0-0, policy_version 648002 (0.00076) [2022-07-10 08:35:53,627][25689] Fps is (10 sec: 5446.6, 60 sec: 5506.4, 300 sec: 5507.9). Total num frames: 663558144. Throughput: 0: 4968.3. Samples: 663553306. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:35:53,627][25689] Avg episode reward: [(0, '-7.815')] [2022-07-10 08:35:54,472][26022] Updated weights on worker 0-0, policy_version 648012 (0.00094) [2022-07-10 08:35:56,617][26022] Updated weights on worker 0-0, policy_version 648022 (0.00092) [2022-07-10 08:35:58,082][26022] Updated weights on worker 0-0, policy_version 648032 (0.00085) [2022-07-10 08:35:58,685][25689] Fps is (10 sec: 5719.8, 60 sec: 5520.4, 300 sec: 5520.9). Total num frames: 663587840. Throughput: 0: 5766.5. Samples: 663586436. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:35:58,686][25689] Avg episode reward: [(0, '-9.291')] [2022-07-10 08:36:00,208][26022] Updated weights on worker 0-0, policy_version 648042 (0.00088) [2022-07-10 08:36:02,177][26022] Updated weights on worker 0-0, policy_version 648052 (0.00087) [2022-07-10 08:36:03,720][25689] Fps is (10 sec: 5275.5, 60 sec: 5518.0, 300 sec: 5510.1). Total num frames: 663611392. Throughput: 0: 5659.9. Samples: 663617846. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:36:03,721][25689] Avg episode reward: [(0, '-6.607')] [2022-07-10 08:36:04,108][26022] Updated weights on worker 0-0, policy_version 648062 (0.00088) [2022-07-10 08:36:05,883][26022] Updated weights on worker 0-0, policy_version 648072 (0.00095) [2022-07-10 08:36:07,835][26022] Updated weights on worker 0-0, policy_version 648082 (0.00098) [2022-07-10 08:36:08,741][25689] Fps is (10 sec: 5295.5, 60 sec: 5534.9, 300 sec: 5514.4). Total num frames: 663641088. Throughput: 0: 4818.2. Samples: 663634374. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:36:08,741][25689] Avg episode reward: [(0, '-5.535')] [2022-07-10 08:36:09,841][26022] Updated weights on worker 0-0, policy_version 648092 (0.00087) [2022-07-10 08:36:11,610][26022] Updated weights on worker 0-0, policy_version 648102 (0.00097) [2022-07-10 08:36:13,467][26022] Updated weights on worker 0-0, policy_version 648112 (0.00093) [2022-07-10 08:36:13,789][25689] Fps is (10 sec: 5593.3, 60 sec: 5519.0, 300 sec: 5506.9). Total num frames: 663667712. Throughput: 0: 5665.6. Samples: 663667540. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:36:13,790][25689] Avg episode reward: [(0, '-6.240')] [2022-07-10 08:36:15,305][26022] Updated weights on worker 0-0, policy_version 648122 (0.00088) [2022-07-10 08:36:16,889][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:36:16,902][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000648130_663685120.pth [2022-07-10 08:36:16,902][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000646191_661699584.pth [2022-07-10 08:36:17,039][26022] Updated weights on worker 0-0, policy_version 648132 (0.00084) [2022-07-10 08:36:18,819][25689] Fps is (10 sec: 5486.8, 60 sec: 5518.5, 300 sec: 5513.4). Total num frames: 663696384. Throughput: 0: 5694.9. Samples: 663701094. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:36:18,819][25689] Avg episode reward: [(0, '-6.517')] [2022-07-10 08:36:18,972][26022] Updated weights on worker 0-0, policy_version 648142 (0.00090) [2022-07-10 08:36:20,772][26022] Updated weights on worker 0-0, policy_version 648152 (0.00091) [2022-07-10 08:36:22,639][26022] Updated weights on worker 0-0, policy_version 648162 (0.00094) [2022-07-10 08:36:23,871][25689] Fps is (10 sec: 5687.6, 60 sec: 5549.2, 300 sec: 5516.2). Total num frames: 663725056. Throughput: 0: 4964.5. Samples: 663717888. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:36:23,872][25689] Avg episode reward: [(0, '-6.304')] [2022-07-10 08:36:24,552][26022] Updated weights on worker 0-0, policy_version 648172 (0.00413) [2022-07-10 08:36:26,461][26022] Updated weights on worker 0-0, policy_version 648182 (0.00096) [2022-07-10 08:36:28,206][26022] Updated weights on worker 0-0, policy_version 648192 (0.00091) [2022-07-10 08:36:28,942][25689] Fps is (10 sec: 5563.3, 60 sec: 5509.7, 300 sec: 5512.2). Total num frames: 663752704. Throughput: 0: 5778.2. Samples: 663751102. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:36:28,943][25689] Avg episode reward: [(0, '-6.427')] [2022-07-10 08:36:29,974][26022] Updated weights on worker 0-0, policy_version 648202 (0.00082) [2022-07-10 08:36:31,766][26022] Updated weights on worker 0-0, policy_version 648212 (0.00085) [2022-07-10 08:36:33,812][26022] Updated weights on worker 0-0, policy_version 648222 (0.00090) [2022-07-10 08:36:34,004][25689] Fps is (10 sec: 5356.0, 60 sec: 5494.2, 300 sec: 5511.4). Total num frames: 663779328. Throughput: 0: 5767.6. Samples: 663784134. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:36:34,005][25689] Avg episode reward: [(0, '-5.777')] [2022-07-10 08:36:35,567][26022] Updated weights on worker 0-0, policy_version 648232 (0.00091) [2022-07-10 08:36:37,482][26022] Updated weights on worker 0-0, policy_version 648242 (0.00093) [2022-07-10 08:36:39,009][25689] Fps is (10 sec: 5493.0, 60 sec: 5529.6, 300 sec: 5515.0). Total num frames: 663808000. Throughput: 0: 5765.7. Samples: 663817504. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:36:39,009][25689] Avg episode reward: [(0, '-5.431')] [2022-07-10 08:36:39,303][26022] Updated weights on worker 0-0, policy_version 648252 (0.00082) [2022-07-10 08:36:41,453][26022] Updated weights on worker 0-0, policy_version 648262 (0.00093) [2022-07-10 08:36:43,166][26022] Updated weights on worker 0-0, policy_version 648272 (0.00096) [2022-07-10 08:36:44,078][25689] Fps is (10 sec: 5591.0, 60 sec: 5491.4, 300 sec: 5510.4). Total num frames: 663835648. Throughput: 0: 5736.9. Samples: 663833810. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:36:44,078][25689] Avg episode reward: [(0, '-7.683')] [2022-07-10 08:36:45,067][26022] Updated weights on worker 0-0, policy_version 648282 (0.00624) [2022-07-10 08:36:46,751][26022] Updated weights on worker 0-0, policy_version 648292 (0.00092) [2022-07-10 08:36:48,651][26022] Updated weights on worker 0-0, policy_version 648302 (0.00086) [2022-07-10 08:36:49,115][25689] Fps is (10 sec: 5370.2, 60 sec: 5488.6, 300 sec: 5507.1). Total num frames: 663862272. Throughput: 0: 5740.5. Samples: 663866904. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:36:49,116][25689] Avg episode reward: [(0, '-4.752')] [2022-07-10 08:36:50,625][26022] Updated weights on worker 0-0, policy_version 648312 (0.00086) [2022-07-10 08:36:52,483][26022] Updated weights on worker 0-0, policy_version 648322 (0.00091) [2022-07-10 08:36:54,170][25689] Fps is (10 sec: 5479.3, 60 sec: 5496.9, 300 sec: 5506.4). Total num frames: 663890944. Throughput: 0: 5749.6. Samples: 663900074. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:36:54,170][25689] Avg episode reward: [(0, '-3.912')] [2022-07-10 08:36:54,232][26022] Updated weights on worker 0-0, policy_version 648332 (0.00081) [2022-07-10 08:36:56,095][26022] Updated weights on worker 0-0, policy_version 648342 (0.00096) [2022-07-10 08:36:58,116][26022] Updated weights on worker 0-0, policy_version 648352 (0.00086) [2022-07-10 08:36:59,176][25689] Fps is (10 sec: 5699.8, 60 sec: 5484.8, 300 sec: 5516.9). Total num frames: 663919616. Throughput: 0: 4917.6. Samples: 663916676. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:36:59,176][25689] Avg episode reward: [(0, '-4.636')] [2022-07-10 08:36:59,879][26022] Updated weights on worker 0-0, policy_version 648362 (0.00089) [2022-07-10 08:37:02,075][26022] Updated weights on worker 0-0, policy_version 648372 (0.00087) [2022-07-10 08:37:03,716][26022] Updated weights on worker 0-0, policy_version 648382 (0.00088) [2022-07-10 08:37:04,215][25689] Fps is (10 sec: 5300.9, 60 sec: 5501.3, 300 sec: 5506.3). Total num frames: 663944192. Throughput: 0: 5673.2. Samples: 663948050. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:37:04,215][25689] Avg episode reward: [(0, '-5.012')] [2022-07-10 08:37:05,675][26022] Updated weights on worker 0-0, policy_version 648392 (0.00088) [2022-07-10 08:37:07,440][26022] Updated weights on worker 0-0, policy_version 648402 (0.00089) [2022-07-10 08:37:09,238][25689] Fps is (10 sec: 5393.3, 60 sec: 5501.0, 300 sec: 5510.1). Total num frames: 663973888. Throughput: 0: 5723.9. Samples: 663982090. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:37:09,239][25689] Avg episode reward: [(0, '-4.336')] [2022-07-10 08:37:09,240][26022] Updated weights on worker 0-0, policy_version 648412 (0.00081) [2022-07-10 08:37:11,191][26022] Updated weights on worker 0-0, policy_version 648422 (0.00088) [2022-07-10 08:37:12,988][26022] Updated weights on worker 0-0, policy_version 648432 (0.00090) [2022-07-10 08:37:14,287][25689] Fps is (10 sec: 5591.3, 60 sec: 5501.0, 300 sec: 5509.4). Total num frames: 664000512. Throughput: 0: 4897.3. Samples: 663998598. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 08:37:14,288][25689] Avg episode reward: [(0, '-4.209')] [2022-07-10 08:37:14,950][26022] Updated weights on worker 0-0, policy_version 648442 (0.00352) [2022-07-10 08:37:16,618][26022] Updated weights on worker 0-0, policy_version 648452 (0.00091) [2022-07-10 08:37:18,657][26022] Updated weights on worker 0-0, policy_version 648462 (0.00088) [2022-07-10 08:37:19,303][25689] Fps is (10 sec: 5494.2, 60 sec: 5502.3, 300 sec: 5505.8). Total num frames: 664029184. Throughput: 0: 5732.8. Samples: 664032062. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:37:19,303][25689] Avg episode reward: [(0, '-5.418')] [2022-07-10 08:37:20,477][26022] Updated weights on worker 0-0, policy_version 648472 (0.00088) [2022-07-10 08:37:22,191][26022] Updated weights on worker 0-0, policy_version 648482 (0.00092) [2022-07-10 08:37:24,095][26022] Updated weights on worker 0-0, policy_version 648492 (0.00087) [2022-07-10 08:37:24,323][25689] Fps is (10 sec: 5611.9, 60 sec: 5488.3, 300 sec: 5509.4). Total num frames: 664056832. Throughput: 0: 5843.6. Samples: 664065556. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:37:24,329][25689] Avg episode reward: [(0, '-5.764')] [2022-07-10 08:37:25,819][26022] Updated weights on worker 0-0, policy_version 648502 (0.00081) [2022-07-10 08:37:27,891][26022] Updated weights on worker 0-0, policy_version 648512 (0.00090) [2022-07-10 08:37:29,351][25689] Fps is (10 sec: 5503.1, 60 sec: 5492.2, 300 sec: 5506.4). Total num frames: 664084480. Throughput: 0: 4971.9. Samples: 664082088. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:37:29,351][25689] Avg episode reward: [(0, '-5.280')] [2022-07-10 08:37:29,623][26022] Updated weights on worker 0-0, policy_version 648522 (0.00093) [2022-07-10 08:37:31,400][26022] Updated weights on worker 0-0, policy_version 648532 (0.00095) [2022-07-10 08:37:33,363][26022] Updated weights on worker 0-0, policy_version 648542 (0.00094) [2022-07-10 08:37:34,413][25689] Fps is (10 sec: 5480.3, 60 sec: 5509.2, 300 sec: 5505.6). Total num frames: 664112128. Throughput: 0: 5812.3. Samples: 664115574. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:37:34,414][25689] Avg episode reward: [(0, '-5.255')] [2022-07-10 08:37:34,902][26022] Updated weights on worker 0-0, policy_version 648552 (0.00087) [2022-07-10 08:37:37,105][26022] Updated weights on worker 0-0, policy_version 648562 (0.00091) [2022-07-10 08:37:38,738][26022] Updated weights on worker 0-0, policy_version 648572 (0.00620) [2022-07-10 08:37:39,479][25689] Fps is (10 sec: 5459.4, 60 sec: 5486.6, 300 sec: 5501.1). Total num frames: 664139776. Throughput: 0: 5787.6. Samples: 664148836. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:37:39,480][25689] Avg episode reward: [(0, '-5.762')] [2022-07-10 08:37:40,615][26022] Updated weights on worker 0-0, policy_version 648582 (0.00087) [2022-07-10 08:37:42,504][26022] Updated weights on worker 0-0, policy_version 648592 (0.00088) [2022-07-10 08:37:44,368][26022] Updated weights on worker 0-0, policy_version 648602 (0.00091) [2022-07-10 08:37:44,556][25689] Fps is (10 sec: 5653.4, 60 sec: 5519.7, 300 sec: 5506.7). Total num frames: 664169472. Throughput: 0: 4929.3. Samples: 664165292. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:37:44,557][25689] Avg episode reward: [(0, '-5.924')] [2022-07-10 08:37:46,169][26022] Updated weights on worker 0-0, policy_version 648612 (0.00094) [2022-07-10 08:37:48,098][26022] Updated weights on worker 0-0, policy_version 648622 (0.00089) [2022-07-10 08:37:49,621][25689] Fps is (10 sec: 5856.3, 60 sec: 5568.0, 300 sec: 5513.4). Total num frames: 664199168. Throughput: 0: 5765.2. Samples: 664198950. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:37:49,621][25689] Avg episode reward: [(0, '-5.168')] [2022-07-10 08:37:49,622][26022] Updated weights on worker 0-0, policy_version 648632 (0.00071) [2022-07-10 08:37:51,768][26022] Updated weights on worker 0-0, policy_version 648642 (0.00088) [2022-07-10 08:37:53,474][26022] Updated weights on worker 0-0, policy_version 648652 (0.00087) [2022-07-10 08:37:54,692][25689] Fps is (10 sec: 5455.7, 60 sec: 5515.7, 300 sec: 5502.0). Total num frames: 664224768. Throughput: 0: 5753.0. Samples: 664232238. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:37:54,692][25689] Avg episode reward: [(0, '-5.150')] [2022-07-10 08:37:55,463][26022] Updated weights on worker 0-0, policy_version 648662 (0.00085) [2022-07-10 08:37:57,282][26022] Updated weights on worker 0-0, policy_version 648672 (0.00087) [2022-07-10 08:37:59,090][26022] Updated weights on worker 0-0, policy_version 648682 (0.00095) [2022-07-10 08:37:59,776][25689] Fps is (10 sec: 5445.5, 60 sec: 5525.5, 300 sec: 5517.8). Total num frames: 664254464. Throughput: 0: 4921.0. Samples: 664248714. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:37:59,776][25689] Avg episode reward: [(0, '-6.384')] [2022-07-10 08:38:01,012][26022] Updated weights on worker 0-0, policy_version 648692 (0.00088) [2022-07-10 08:38:03,059][26022] Updated weights on worker 0-0, policy_version 648702 (0.00100) [2022-07-10 08:38:04,792][25689] Fps is (10 sec: 5271.8, 60 sec: 5510.7, 300 sec: 5504.0). Total num frames: 664278016. Throughput: 0: 5692.2. Samples: 664280478. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:38:04,793][25689] Avg episode reward: [(0, '-7.375')] [2022-07-10 08:38:04,977][26022] Updated weights on worker 0-0, policy_version 648712 (0.00087) [2022-07-10 08:38:06,566][26022] Updated weights on worker 0-0, policy_version 648722 (0.00084) [2022-07-10 08:38:08,582][26022] Updated weights on worker 0-0, policy_version 648732 (0.00093) [2022-07-10 08:38:09,870][25689] Fps is (10 sec: 5376.3, 60 sec: 5522.7, 300 sec: 5507.1). Total num frames: 664308736. Throughput: 0: 5698.8. Samples: 664314346. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:38:09,871][25689] Avg episode reward: [(0, '-6.966')] [2022-07-10 08:38:10,404][26022] Updated weights on worker 0-0, policy_version 648742 (0.00089) [2022-07-10 08:38:12,237][26022] Updated weights on worker 0-0, policy_version 648752 (0.00609) [2022-07-10 08:38:14,139][26022] Updated weights on worker 0-0, policy_version 648762 (0.00091) [2022-07-10 08:38:14,924][25689] Fps is (10 sec: 5659.9, 60 sec: 5522.2, 300 sec: 5509.8). Total num frames: 664335360. Throughput: 0: 4868.8. Samples: 664330744. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:38:14,924][25689] Avg episode reward: [(0, '-7.198')] [2022-07-10 08:38:16,040][26022] Updated weights on worker 0-0, policy_version 648772 (0.00105) [2022-07-10 08:38:17,074][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:38:17,090][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000648778_664348672.pth [2022-07-10 08:38:17,090][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000646838_662362112.pth [2022-07-10 08:38:17,835][26022] Updated weights on worker 0-0, policy_version 648782 (0.00087) [2022-07-10 08:38:19,698][26022] Updated weights on worker 0-0, policy_version 648792 (0.00096) [2022-07-10 08:38:19,992][25689] Fps is (10 sec: 5462.9, 60 sec: 5517.4, 300 sec: 5505.2). Total num frames: 664364032. Throughput: 0: 5715.2. Samples: 664364256. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:38:19,993][25689] Avg episode reward: [(0, '-10.018')] [2022-07-10 08:38:21,498][26022] Updated weights on worker 0-0, policy_version 648802 (0.00088) [2022-07-10 08:38:23,390][26022] Updated weights on worker 0-0, policy_version 648812 (0.00097) [2022-07-10 08:38:24,998][25689] Fps is (10 sec: 5793.7, 60 sec: 5552.5, 300 sec: 5515.9). Total num frames: 664393728. Throughput: 0: 5817.6. Samples: 664398030. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:38:24,999][25689] Avg episode reward: [(0, '-11.164')] [2022-07-10 08:38:25,011][26022] Updated weights on worker 0-0, policy_version 648822 (0.00765) [2022-07-10 08:38:27,079][26022] Updated weights on worker 0-0, policy_version 648832 (0.00086) [2022-07-10 08:38:28,907][26022] Updated weights on worker 0-0, policy_version 648842 (0.00085) [2022-07-10 08:38:30,025][25689] Fps is (10 sec: 5613.6, 60 sec: 5535.7, 300 sec: 5513.0). Total num frames: 664420352. Throughput: 0: 4968.4. Samples: 664414484. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:38:30,026][25689] Avg episode reward: [(0, '-8.159')] [2022-07-10 08:38:30,949][26022] Updated weights on worker 0-0, policy_version 648852 (0.00090) [2022-07-10 08:38:32,586][26022] Updated weights on worker 0-0, policy_version 648862 (0.00087) [2022-07-10 08:38:34,544][26022] Updated weights on worker 0-0, policy_version 648872 (0.00089) [2022-07-10 08:38:35,115][25689] Fps is (10 sec: 5567.1, 60 sec: 5566.9, 300 sec: 5518.5). Total num frames: 664450048. Throughput: 0: 5800.3. Samples: 664447858. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:38:35,117][25689] Avg episode reward: [(0, '-8.260')] [2022-07-10 08:38:36,079][26022] Updated weights on worker 0-0, policy_version 648882 (0.00089) [2022-07-10 08:38:38,055][26022] Updated weights on worker 0-0, policy_version 648892 (0.00085) [2022-07-10 08:38:40,016][26022] Updated weights on worker 0-0, policy_version 648902 (0.00087) [2022-07-10 08:38:40,119][25689] Fps is (10 sec: 5579.8, 60 sec: 5555.7, 300 sec: 5515.4). Total num frames: 664476672. Throughput: 0: 5823.8. Samples: 664481468. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:38:40,119][25689] Avg episode reward: [(0, '-6.970')] [2022-07-10 08:38:41,531][26022] Updated weights on worker 0-0, policy_version 648912 (0.00085) [2022-07-10 08:38:43,992][26022] Updated weights on worker 0-0, policy_version 648922 (0.00096) [2022-07-10 08:38:45,159][25689] Fps is (10 sec: 5505.3, 60 sec: 5542.2, 300 sec: 5515.0). Total num frames: 664505344. Throughput: 0: 4954.4. Samples: 664497914. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:38:45,161][25689] Avg episode reward: [(0, '-8.275')] [2022-07-10 08:38:45,202][26022] Updated weights on worker 0-0, policy_version 648932 (0.00091) [2022-07-10 08:38:47,475][26022] Updated weights on worker 0-0, policy_version 648942 (0.00087) [2022-07-10 08:38:49,196][26022] Updated weights on worker 0-0, policy_version 648952 (0.00089) [2022-07-10 08:38:50,239][25689] Fps is (10 sec: 5464.0, 60 sec: 5490.2, 300 sec: 5514.5). Total num frames: 664531968. Throughput: 0: 5783.3. Samples: 664531386. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:38:50,240][25689] Avg episode reward: [(0, '-6.074')] [2022-07-10 08:38:50,876][26022] Updated weights on worker 0-0, policy_version 648962 (0.00097) [2022-07-10 08:38:52,950][26022] Updated weights on worker 0-0, policy_version 648972 (0.00089) [2022-07-10 08:38:54,496][26022] Updated weights on worker 0-0, policy_version 648982 (0.01400) [2022-07-10 08:38:55,302][25689] Fps is (10 sec: 5451.7, 60 sec: 5541.5, 300 sec: 5513.7). Total num frames: 664560640. Throughput: 0: 5774.7. Samples: 664564432. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:38:55,303][25689] Avg episode reward: [(0, '-5.228')] [2022-07-10 08:38:56,627][26022] Updated weights on worker 0-0, policy_version 648992 (0.00091) [2022-07-10 08:38:58,239][26022] Updated weights on worker 0-0, policy_version 649002 (0.00088) [2022-07-10 08:39:00,147][26022] Updated weights on worker 0-0, policy_version 649012 (0.00087) [2022-07-10 08:39:00,317][25689] Fps is (10 sec: 5689.7, 60 sec: 5530.9, 300 sec: 5523.9). Total num frames: 664589312. Throughput: 0: 5768.4. Samples: 664597982. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:39:00,318][25689] Avg episode reward: [(0, '-6.434')] [2022-07-10 08:39:02,629][26022] Updated weights on worker 0-0, policy_version 649022 (0.00087) [2022-07-10 08:39:04,094][26022] Updated weights on worker 0-0, policy_version 649032 (0.00085) [2022-07-10 08:39:05,330][25689] Fps is (10 sec: 5208.0, 60 sec: 5531.2, 300 sec: 5513.5). Total num frames: 664612864. Throughput: 0: 5692.1. Samples: 664612726. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:39:05,330][25689] Avg episode reward: [(0, '-6.724')] [2022-07-10 08:39:06,146][26022] Updated weights on worker 0-0, policy_version 649042 (0.00089) [2022-07-10 08:39:08,021][26022] Updated weights on worker 0-0, policy_version 649052 (0.00092) [2022-07-10 08:39:09,649][26022] Updated weights on worker 0-0, policy_version 649062 (0.00087) [2022-07-10 08:39:10,336][25689] Fps is (10 sec: 5417.3, 60 sec: 5537.8, 300 sec: 5518.1). Total num frames: 664643584. Throughput: 0: 5719.4. Samples: 664646328. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:39:10,336][25689] Avg episode reward: [(0, '-5.642')] [2022-07-10 08:39:11,779][26022] Updated weights on worker 0-0, policy_version 649072 (0.00054) [2022-07-10 08:39:13,350][26022] Updated weights on worker 0-0, policy_version 649082 (0.00088) [2022-07-10 08:39:15,444][25689] Fps is (10 sec: 5568.2, 60 sec: 5515.9, 300 sec: 5513.2). Total num frames: 664669184. Throughput: 0: 5719.9. Samples: 664679644. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:39:15,445][25689] Avg episode reward: [(0, '-7.121')] [2022-07-10 08:39:15,479][26022] Updated weights on worker 0-0, policy_version 649092 (0.00086) [2022-07-10 08:39:17,077][26022] Updated weights on worker 0-0, policy_version 649102 (0.00089) [2022-07-10 08:39:19,093][26022] Updated weights on worker 0-0, policy_version 649112 (0.00086) [2022-07-10 08:39:20,470][25689] Fps is (10 sec: 5456.6, 60 sec: 5536.8, 300 sec: 5516.3). Total num frames: 664698880. Throughput: 0: 4892.0. Samples: 664696566. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:39:20,470][25689] Avg episode reward: [(0, '-7.415')] [2022-07-10 08:39:20,696][26022] Updated weights on worker 0-0, policy_version 649122 (0.00098) [2022-07-10 08:39:22,763][26022] Updated weights on worker 0-0, policy_version 649132 (0.00091) [2022-07-10 08:39:24,361][26022] Updated weights on worker 0-0, policy_version 649142 (0.00086) [2022-07-10 08:39:25,507][25689] Fps is (10 sec: 5698.8, 60 sec: 5500.1, 300 sec: 5515.7). Total num frames: 664726528. Throughput: 0: 5819.2. Samples: 664730140. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:39:25,507][25689] Avg episode reward: [(0, '-5.151')] [2022-07-10 08:39:26,401][26022] Updated weights on worker 0-0, policy_version 649152 (0.00095) [2022-07-10 08:39:28,217][26022] Updated weights on worker 0-0, policy_version 649162 (0.00094) [2022-07-10 08:39:30,049][26022] Updated weights on worker 0-0, policy_version 649172 (0.00084) [2022-07-10 08:39:30,538][25689] Fps is (10 sec: 5593.4, 60 sec: 5533.4, 300 sec: 5517.0). Total num frames: 664755200. Throughput: 0: 5798.8. Samples: 664763480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:39:30,539][25689] Avg episode reward: [(0, '-5.088')] [2022-07-10 08:39:31,698][26022] Updated weights on worker 0-0, policy_version 649182 (0.00092) [2022-07-10 08:39:33,744][26022] Updated weights on worker 0-0, policy_version 649192 (0.00086) [2022-07-10 08:39:35,483][26022] Updated weights on worker 0-0, policy_version 649202 (0.00083) [2022-07-10 08:39:35,659][25689] Fps is (10 sec: 5547.6, 60 sec: 5496.8, 300 sec: 5518.7). Total num frames: 664782848. Throughput: 0: 4973.8. Samples: 664780188. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:39:35,660][25689] Avg episode reward: [(0, '-5.511')] [2022-07-10 08:39:37,416][26022] Updated weights on worker 0-0, policy_version 649212 (0.00093) [2022-07-10 08:39:39,102][26022] Updated weights on worker 0-0, policy_version 649222 (0.00085) [2022-07-10 08:39:40,711][25689] Fps is (10 sec: 5536.5, 60 sec: 5526.2, 300 sec: 5517.9). Total num frames: 664811520. Throughput: 0: 5793.0. Samples: 664813824. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:39:40,711][25689] Avg episode reward: [(0, '-5.816')] [2022-07-10 08:39:41,287][26022] Updated weights on worker 0-0, policy_version 649232 (0.00092) [2022-07-10 08:39:42,941][26022] Updated weights on worker 0-0, policy_version 649242 (0.00094) [2022-07-10 08:39:44,936][26022] Updated weights on worker 0-0, policy_version 649252 (0.00092) [2022-07-10 08:39:45,717][25689] Fps is (10 sec: 5701.6, 60 sec: 5529.4, 300 sec: 5521.5). Total num frames: 664840192. Throughput: 0: 5782.1. Samples: 664846994. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:39:45,717][25689] Avg episode reward: [(0, '-5.150')] [2022-07-10 08:39:46,494][26022] Updated weights on worker 0-0, policy_version 649262 (0.00089) [2022-07-10 08:39:48,652][26022] Updated weights on worker 0-0, policy_version 649272 (0.00091) [2022-07-10 08:39:50,359][26022] Updated weights on worker 0-0, policy_version 649282 (0.00095) [2022-07-10 08:39:50,727][25689] Fps is (10 sec: 5623.1, 60 sec: 5552.7, 300 sec: 5520.3). Total num frames: 664867840. Throughput: 0: 4965.7. Samples: 664863730. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:39:50,735][25689] Avg episode reward: [(0, '-3.933')] [2022-07-10 08:39:52,331][26022] Updated weights on worker 0-0, policy_version 649292 (0.00094) [2022-07-10 08:39:53,849][26022] Updated weights on worker 0-0, policy_version 649302 (0.00089) [2022-07-10 08:39:55,848][25689] Fps is (10 sec: 5255.7, 60 sec: 5496.7, 300 sec: 5508.2). Total num frames: 664893440. Throughput: 0: 5800.4. Samples: 664897292. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:39:55,848][25689] Avg episode reward: [(0, '-4.693')] [2022-07-10 08:39:56,027][26022] Updated weights on worker 0-0, policy_version 649312 (0.00095) [2022-07-10 08:39:57,526][26022] Updated weights on worker 0-0, policy_version 649322 (0.00088) [2022-07-10 08:39:59,578][26022] Updated weights on worker 0-0, policy_version 649332 (0.00082) [2022-07-10 08:40:00,879][25689] Fps is (10 sec: 5547.8, 60 sec: 5529.1, 300 sec: 5531.9). Total num frames: 664924160. Throughput: 0: 5794.1. Samples: 664930678. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:40:00,879][25689] Avg episode reward: [(0, '-5.251')] [2022-07-10 08:40:01,212][26022] Updated weights on worker 0-0, policy_version 649342 (0.00090) [2022-07-10 08:40:03,487][26022] Updated weights on worker 0-0, policy_version 649352 (0.00090) [2022-07-10 08:40:05,372][26022] Updated weights on worker 0-0, policy_version 649362 (0.00094) [2022-07-10 08:40:05,907][25689] Fps is (10 sec: 5599.2, 60 sec: 5561.5, 300 sec: 5521.4). Total num frames: 664949760. Throughput: 0: 4873.1. Samples: 664945382. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:40:05,909][25689] Avg episode reward: [(0, '-4.983')] [2022-07-10 08:40:07,303][26022] Updated weights on worker 0-0, policy_version 649372 (0.00093) [2022-07-10 08:40:09,032][26022] Updated weights on worker 0-0, policy_version 649382 (0.00057) [2022-07-10 08:40:10,934][25689] Fps is (10 sec: 5193.5, 60 sec: 5491.9, 300 sec: 5518.6). Total num frames: 664976384. Throughput: 0: 5686.5. Samples: 664978640. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:40:10,936][25689] Avg episode reward: [(0, '-5.702')] [2022-07-10 08:40:10,950][26022] Updated weights on worker 0-0, policy_version 649392 (0.00186) [2022-07-10 08:40:12,736][26022] Updated weights on worker 0-0, policy_version 649402 (0.00093) [2022-07-10 08:40:14,608][26022] Updated weights on worker 0-0, policy_version 649412 (0.00087) [2022-07-10 08:40:16,021][25689] Fps is (10 sec: 5467.1, 60 sec: 5544.6, 300 sec: 5517.4). Total num frames: 665005056. Throughput: 0: 5673.4. Samples: 665011740. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:40:16,021][25689] Avg episode reward: [(0, '-4.238')] [2022-07-10 08:40:16,417][26022] Updated weights on worker 0-0, policy_version 649422 (0.00094) [2022-07-10 08:40:17,174][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:40:17,186][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000649425_665011200.pth [2022-07-10 08:40:17,186][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000647485_663024640.pth [2022-07-10 08:40:18,355][26022] Updated weights on worker 0-0, policy_version 649432 (0.00079) [2022-07-10 08:40:20,245][26022] Updated weights on worker 0-0, policy_version 649442 (0.00087) [2022-07-10 08:40:21,049][25689] Fps is (10 sec: 5568.2, 60 sec: 5510.5, 300 sec: 5520.7). Total num frames: 665032704. Throughput: 0: 4851.0. Samples: 665028524. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:40:21,049][25689] Avg episode reward: [(0, '-4.525')] [2022-07-10 08:40:22,163][26022] Updated weights on worker 0-0, policy_version 649452 (0.00082) [2022-07-10 08:40:23,644][26022] Updated weights on worker 0-0, policy_version 649462 (0.00083) [2022-07-10 08:40:25,792][26022] Updated weights on worker 0-0, policy_version 649472 (0.00099) [2022-07-10 08:40:26,073][25689] Fps is (10 sec: 5500.9, 60 sec: 5511.7, 300 sec: 5513.5). Total num frames: 665060352. Throughput: 0: 5784.4. Samples: 665062034. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 08:40:26,074][25689] Avg episode reward: [(0, '-5.384')] [2022-07-10 08:40:27,551][26022] Updated weights on worker 0-0, policy_version 649482 (0.00089) [2022-07-10 08:40:29,318][26022] Updated weights on worker 0-0, policy_version 649492 (0.00090) [2022-07-10 08:40:31,078][25689] Fps is (10 sec: 5513.3, 60 sec: 5497.2, 300 sec: 5514.8). Total num frames: 665088000. Throughput: 0: 5790.9. Samples: 665095292. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:40:31,079][25689] Avg episode reward: [(0, '-4.369')] [2022-07-10 08:40:31,216][26022] Updated weights on worker 0-0, policy_version 649502 (0.00082) [2022-07-10 08:40:33,312][26022] Updated weights on worker 0-0, policy_version 649512 (0.00086) [2022-07-10 08:40:34,875][26022] Updated weights on worker 0-0, policy_version 649522 (0.00083) [2022-07-10 08:40:36,125][25689] Fps is (10 sec: 5501.0, 60 sec: 5503.9, 300 sec: 5517.8). Total num frames: 665115648. Throughput: 0: 4976.2. Samples: 665111782. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:40:36,126][25689] Avg episode reward: [(0, '-3.177')] [2022-07-10 08:40:36,979][26022] Updated weights on worker 0-0, policy_version 649532 (0.00088) [2022-07-10 08:40:38,556][26022] Updated weights on worker 0-0, policy_version 649542 (0.00094) [2022-07-10 08:40:40,538][26022] Updated weights on worker 0-0, policy_version 649552 (0.00088) [2022-07-10 08:40:41,153][25689] Fps is (10 sec: 5692.1, 60 sec: 5523.1, 300 sec: 5517.7). Total num frames: 665145344. Throughput: 0: 5807.4. Samples: 665145274. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:40:41,156][25689] Avg episode reward: [(0, '-3.624')] [2022-07-10 08:40:42,182][26022] Updated weights on worker 0-0, policy_version 649562 (0.00084) [2022-07-10 08:40:44,247][26022] Updated weights on worker 0-0, policy_version 649572 (0.00086) [2022-07-10 08:40:46,080][26022] Updated weights on worker 0-0, policy_version 649582 (0.00095) [2022-07-10 08:40:46,160][25689] Fps is (10 sec: 5612.6, 60 sec: 5489.1, 300 sec: 5517.7). Total num frames: 665171968. Throughput: 0: 5802.2. Samples: 665178580. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:40:46,160][25689] Avg episode reward: [(0, '-4.124')] [2022-07-10 08:40:47,904][26022] Updated weights on worker 0-0, policy_version 649592 (0.00091) [2022-07-10 08:40:49,758][26022] Updated weights on worker 0-0, policy_version 649602 (0.00087) [2022-07-10 08:40:51,190][25689] Fps is (10 sec: 5509.0, 60 sec: 5504.2, 300 sec: 5519.8). Total num frames: 665200640. Throughput: 0: 4955.4. Samples: 665194952. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:40:51,192][25689] Avg episode reward: [(0, '-4.471')] [2022-07-10 08:40:51,832][26022] Updated weights on worker 0-0, policy_version 649612 (0.00085) [2022-07-10 08:40:53,401][26022] Updated weights on worker 0-0, policy_version 649622 (0.00088) [2022-07-10 08:40:55,442][26022] Updated weights on worker 0-0, policy_version 649632 (0.00089) [2022-07-10 08:40:56,308][25689] Fps is (10 sec: 5449.1, 60 sec: 5521.4, 300 sec: 5508.4). Total num frames: 665227264. Throughput: 0: 5778.6. Samples: 665228408. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:40:56,309][25689] Avg episode reward: [(0, '-3.807')] [2022-07-10 08:40:57,018][26022] Updated weights on worker 0-0, policy_version 649642 (0.00087) [2022-07-10 08:40:59,167][26022] Updated weights on worker 0-0, policy_version 649652 (0.00094) [2022-07-10 08:41:00,680][26022] Updated weights on worker 0-0, policy_version 649662 (0.00090) [2022-07-10 08:41:01,325][25689] Fps is (10 sec: 5557.2, 60 sec: 5505.7, 300 sec: 5529.4). Total num frames: 665256960. Throughput: 0: 5774.9. Samples: 665261766. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:41:01,325][25689] Avg episode reward: [(0, '-4.605')] [2022-07-10 08:41:03,057][26022] Updated weights on worker 0-0, policy_version 649672 (0.00084) [2022-07-10 08:41:04,695][26022] Updated weights on worker 0-0, policy_version 649682 (0.00105) [2022-07-10 08:41:06,376][25689] Fps is (10 sec: 5492.1, 60 sec: 5503.6, 300 sec: 5515.0). Total num frames: 665282560. Throughput: 0: 4838.5. Samples: 665276396. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:41:06,376][25689] Avg episode reward: [(0, '-4.516')] [2022-07-10 08:41:06,669][26022] Updated weights on worker 0-0, policy_version 649692 (0.00085) [2022-07-10 08:41:08,574][26022] Updated weights on worker 0-0, policy_version 649702 (0.00083) [2022-07-10 08:41:10,509][26022] Updated weights on worker 0-0, policy_version 649712 (0.00084) [2022-07-10 08:41:11,429][25689] Fps is (10 sec: 5269.7, 60 sec: 5518.2, 300 sec: 5518.4). Total num frames: 665310208. Throughput: 0: 5692.4. Samples: 665310162. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:41:11,430][25689] Avg episode reward: [(0, '-4.090')] [2022-07-10 08:41:12,123][26022] Updated weights on worker 0-0, policy_version 649722 (0.00086) [2022-07-10 08:41:14,338][26022] Updated weights on worker 0-0, policy_version 649732 (0.00100) [2022-07-10 08:41:15,797][26022] Updated weights on worker 0-0, policy_version 649742 (0.00090) [2022-07-10 08:41:16,539][25689] Fps is (10 sec: 5642.2, 60 sec: 5533.0, 300 sec: 5520.3). Total num frames: 665339904. Throughput: 0: 5706.8. Samples: 665343866. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:41:16,541][25689] Avg episode reward: [(0, '-3.432')] [2022-07-10 08:41:17,840][26022] Updated weights on worker 0-0, policy_version 649752 (0.00091) [2022-07-10 08:41:19,440][26022] Updated weights on worker 0-0, policy_version 649762 (0.00118) [2022-07-10 08:41:21,539][26022] Updated weights on worker 0-0, policy_version 649772 (0.00085) [2022-07-10 08:41:21,628][25689] Fps is (10 sec: 5622.8, 60 sec: 5527.5, 300 sec: 5516.2). Total num frames: 665367552. Throughput: 0: 4860.5. Samples: 665360450. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:41:21,628][25689] Avg episode reward: [(0, '-3.434')] [2022-07-10 08:41:23,098][26022] Updated weights on worker 0-0, policy_version 649782 (0.00094) [2022-07-10 08:41:25,209][26022] Updated weights on worker 0-0, policy_version 649792 (0.00092) [2022-07-10 08:41:26,711][25689] Fps is (10 sec: 5537.0, 60 sec: 5539.0, 300 sec: 5519.5). Total num frames: 665396224. Throughput: 0: 5773.8. Samples: 665393806. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:41:26,712][25689] Avg episode reward: [(0, '-3.355')] [2022-07-10 08:41:26,909][26022] Updated weights on worker 0-0, policy_version 649802 (0.00089) [2022-07-10 08:41:28,943][26022] Updated weights on worker 0-0, policy_version 649812 (0.00090) [2022-07-10 08:41:30,603][26022] Updated weights on worker 0-0, policy_version 649822 (0.00082) [2022-07-10 08:41:31,787][25689] Fps is (10 sec: 5544.0, 60 sec: 5532.6, 300 sec: 5522.6). Total num frames: 665423872. Throughput: 0: 5747.0. Samples: 665427156. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:41:31,788][25689] Avg episode reward: [(0, '-3.713')] [2022-07-10 08:41:32,410][26022] Updated weights on worker 0-0, policy_version 649832 (0.00089) [2022-07-10 08:41:34,269][26022] Updated weights on worker 0-0, policy_version 649842 (0.00086) [2022-07-10 08:41:36,179][26022] Updated weights on worker 0-0, policy_version 649852 (0.00089) [2022-07-10 08:41:36,863][25689] Fps is (10 sec: 5446.7, 60 sec: 5529.9, 300 sec: 5517.9). Total num frames: 665451520. Throughput: 0: 5749.6. Samples: 665460720. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:41:36,863][25689] Avg episode reward: [(0, '-3.643')] [2022-07-10 08:41:37,823][26022] Updated weights on worker 0-0, policy_version 649862 (0.00089) [2022-07-10 08:41:39,820][26022] Updated weights on worker 0-0, policy_version 649872 (0.00089) [2022-07-10 08:41:41,637][26022] Updated weights on worker 0-0, policy_version 649882 (0.00088) [2022-07-10 08:41:41,953][25689] Fps is (10 sec: 5539.5, 60 sec: 5507.3, 300 sec: 5520.9). Total num frames: 665480192. Throughput: 0: 5758.8. Samples: 665477502. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:41:41,954][25689] Avg episode reward: [(0, '-3.135')] [2022-07-10 08:41:43,467][26022] Updated weights on worker 0-0, policy_version 649892 (0.00091) [2022-07-10 08:41:45,313][26022] Updated weights on worker 0-0, policy_version 649902 (0.00089) [2022-07-10 08:41:47,000][25689] Fps is (10 sec: 5656.6, 60 sec: 5537.4, 300 sec: 5527.6). Total num frames: 665508864. Throughput: 0: 5775.5. Samples: 665510988. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:41:47,001][25689] Avg episode reward: [(0, '-4.763')] [2022-07-10 08:41:47,188][26022] Updated weights on worker 0-0, policy_version 649912 (0.00089) [2022-07-10 08:41:48,959][26022] Updated weights on worker 0-0, policy_version 649922 (0.00091) [2022-07-10 08:41:50,944][26022] Updated weights on worker 0-0, policy_version 649932 (0.00097) [2022-07-10 08:41:52,003][25689] Fps is (10 sec: 5502.3, 60 sec: 5506.2, 300 sec: 5521.7). Total num frames: 665535488. Throughput: 0: 5784.0. Samples: 665544088. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:41:52,004][25689] Avg episode reward: [(0, '-6.205')] [2022-07-10 08:41:52,636][26022] Updated weights on worker 0-0, policy_version 649942 (0.00084) [2022-07-10 08:41:54,766][26022] Updated weights on worker 0-0, policy_version 649952 (0.00082) [2022-07-10 08:41:56,307][26022] Updated weights on worker 0-0, policy_version 649962 (0.00081) [2022-07-10 08:41:57,088][25689] Fps is (10 sec: 5583.1, 60 sec: 5559.7, 300 sec: 5523.7). Total num frames: 665565184. Throughput: 0: 4946.3. Samples: 665560760. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:41:57,088][25689] Avg episode reward: [(0, '-6.099')] [2022-07-10 08:41:58,359][26022] Updated weights on worker 0-0, policy_version 649972 (0.00085) [2022-07-10 08:42:00,082][26022] Updated weights on worker 0-0, policy_version 649982 (0.00084) [2022-07-10 08:42:02,112][25689] Fps is (10 sec: 5368.6, 60 sec: 5474.8, 300 sec: 5523.9). Total num frames: 665589760. Throughput: 0: 5778.3. Samples: 665593986. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:42:02,113][25689] Avg episode reward: [(0, '-5.860')] [2022-07-10 08:42:02,458][26022] Updated weights on worker 0-0, policy_version 649992 (0.00090) [2022-07-10 08:42:04,156][26022] Updated weights on worker 0-0, policy_version 650002 (0.00095) [2022-07-10 08:42:06,092][26022] Updated weights on worker 0-0, policy_version 650012 (0.00089) [2022-07-10 08:42:07,154][25689] Fps is (10 sec: 5188.2, 60 sec: 5509.4, 300 sec: 5516.7). Total num frames: 665617408. Throughput: 0: 5659.4. Samples: 665625044. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:42:07,154][25689] Avg episode reward: [(0, '-6.290')] [2022-07-10 08:42:07,814][26022] Updated weights on worker 0-0, policy_version 650022 (0.00091) [2022-07-10 08:42:09,990][26022] Updated weights on worker 0-0, policy_version 650032 (0.00094) [2022-07-10 08:42:11,499][26022] Updated weights on worker 0-0, policy_version 650042 (0.00087) [2022-07-10 08:42:12,175][25689] Fps is (10 sec: 5495.4, 60 sec: 5512.3, 300 sec: 5520.7). Total num frames: 665645056. Throughput: 0: 4836.5. Samples: 665641648. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:42:12,175][25689] Avg episode reward: [(0, '-6.504')] [2022-07-10 08:42:13,559][26022] Updated weights on worker 0-0, policy_version 650052 (0.00085) [2022-07-10 08:42:15,363][26022] Updated weights on worker 0-0, policy_version 650062 (0.00084) [2022-07-10 08:42:17,143][26022] Updated weights on worker 0-0, policy_version 650072 (0.00096) [2022-07-10 08:42:17,313][25689] Fps is (10 sec: 5543.8, 60 sec: 5492.9, 300 sec: 5518.4). Total num frames: 665673728. Throughput: 0: 5655.7. Samples: 665675146. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:42:17,313][25689] Avg episode reward: [(0, '-4.091')] [2022-07-10 08:42:17,333][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:42:17,344][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000650073_665674752.pth [2022-07-10 08:42:17,345][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000648130_663685120.pth [2022-07-10 08:42:19,007][26022] Updated weights on worker 0-0, policy_version 650082 (0.00092) [2022-07-10 08:42:20,847][26022] Updated weights on worker 0-0, policy_version 650092 (0.00075) [2022-07-10 08:42:22,328][25689] Fps is (10 sec: 5546.7, 60 sec: 5499.5, 300 sec: 5518.5). Total num frames: 665701376. Throughput: 0: 5664.6. Samples: 665708502. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:42:22,329][25689] Avg episode reward: [(0, '-2.831')] [2022-07-10 08:42:22,827][26022] Updated weights on worker 0-0, policy_version 650102 (0.00096) [2022-07-10 08:42:24,583][26022] Updated weights on worker 0-0, policy_version 650112 (0.00094) [2022-07-10 08:42:26,447][26022] Updated weights on worker 0-0, policy_version 650122 (0.00091) [2022-07-10 08:42:27,362][25689] Fps is (10 sec: 5604.3, 60 sec: 5504.0, 300 sec: 5521.8). Total num frames: 665730048. Throughput: 0: 4955.0. Samples: 665725176. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:42:27,363][25689] Avg episode reward: [(0, '-2.889')] [2022-07-10 08:42:28,295][26022] Updated weights on worker 0-0, policy_version 650132 (0.00087) [2022-07-10 08:42:29,877][26022] Updated weights on worker 0-0, policy_version 650142 (0.00089) [2022-07-10 08:42:31,963][26022] Updated weights on worker 0-0, policy_version 650152 (0.00089) [2022-07-10 08:42:32,372][25689] Fps is (10 sec: 5607.6, 60 sec: 5510.0, 300 sec: 5522.8). Total num frames: 665757696. Throughput: 0: 5804.5. Samples: 665758884. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:42:32,373][25689] Avg episode reward: [(0, '-4.407')] [2022-07-10 08:42:33,682][26022] Updated weights on worker 0-0, policy_version 650162 (0.00082) [2022-07-10 08:42:35,606][26022] Updated weights on worker 0-0, policy_version 650172 (0.00089) [2022-07-10 08:42:37,264][26022] Updated weights on worker 0-0, policy_version 650182 (0.00085) [2022-07-10 08:42:37,490][25689] Fps is (10 sec: 5560.9, 60 sec: 5523.1, 300 sec: 5525.3). Total num frames: 665786368. Throughput: 0: 5813.9. Samples: 665792456. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:42:37,491][25689] Avg episode reward: [(0, '-5.091')] [2022-07-10 08:42:39,296][26022] Updated weights on worker 0-0, policy_version 650192 (0.00094) [2022-07-10 08:42:41,093][26022] Updated weights on worker 0-0, policy_version 650202 (0.00087) [2022-07-10 08:42:42,518][25689] Fps is (10 sec: 5652.1, 60 sec: 5528.8, 300 sec: 5522.8). Total num frames: 665815040. Throughput: 0: 4987.8. Samples: 665809202. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:42:42,518][25689] Avg episode reward: [(0, '-7.864')] [2022-07-10 08:42:43,003][26022] Updated weights on worker 0-0, policy_version 650212 (0.00091) [2022-07-10 08:42:44,609][26022] Updated weights on worker 0-0, policy_version 650222 (0.00087) [2022-07-10 08:42:46,807][26022] Updated weights on worker 0-0, policy_version 650232 (0.00087) [2022-07-10 08:42:47,556][25689] Fps is (10 sec: 5595.3, 60 sec: 5512.7, 300 sec: 5516.4). Total num frames: 665842688. Throughput: 0: 5827.6. Samples: 665842856. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:42:47,557][25689] Avg episode reward: [(0, '-8.750')] [2022-07-10 08:42:48,314][26022] Updated weights on worker 0-0, policy_version 650242 (0.00086) [2022-07-10 08:42:50,317][26022] Updated weights on worker 0-0, policy_version 650252 (0.00082) [2022-07-10 08:42:52,006][26022] Updated weights on worker 0-0, policy_version 650262 (0.00093) [2022-07-10 08:42:52,579][25689] Fps is (10 sec: 5495.7, 60 sec: 5527.7, 300 sec: 5524.1). Total num frames: 665870336. Throughput: 0: 5798.6. Samples: 665876058. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:42:52,580][25689] Avg episode reward: [(0, '-8.483')] [2022-07-10 08:42:54,026][26022] Updated weights on worker 0-0, policy_version 650272 (0.00086) [2022-07-10 08:42:55,729][26022] Updated weights on worker 0-0, policy_version 650282 (0.00085) [2022-07-10 08:42:57,658][25689] Fps is (10 sec: 5473.8, 60 sec: 5494.5, 300 sec: 5517.3). Total num frames: 665897984. Throughput: 0: 4979.9. Samples: 665892888. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:42:57,659][25689] Avg episode reward: [(0, '-8.841')] [2022-07-10 08:42:57,664][26022] Updated weights on worker 0-0, policy_version 650292 (0.00087) [2022-07-10 08:42:59,531][26022] Updated weights on worker 0-0, policy_version 650302 (0.00082) [2022-07-10 08:43:01,382][26022] Updated weights on worker 0-0, policy_version 650312 (0.00060) [2022-07-10 08:43:02,693][25689] Fps is (10 sec: 5366.3, 60 sec: 5527.3, 300 sec: 5527.3). Total num frames: 665924608. Throughput: 0: 5798.0. Samples: 665926178. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:43:02,693][25689] Avg episode reward: [(0, '-8.677')] [2022-07-10 08:43:03,726][26022] Updated weights on worker 0-0, policy_version 650322 (0.00090) [2022-07-10 08:43:05,263][26022] Updated weights on worker 0-0, policy_version 650332 (0.00085) [2022-07-10 08:43:07,265][26022] Updated weights on worker 0-0, policy_version 650342 (0.00086) [2022-07-10 08:43:07,766][25689] Fps is (10 sec: 5470.6, 60 sec: 5541.4, 300 sec: 5520.5). Total num frames: 665953280. Throughput: 0: 5681.9. Samples: 665957686. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:43:07,766][25689] Avg episode reward: [(0, '-5.908')] [2022-07-10 08:43:08,966][26022] Updated weights on worker 0-0, policy_version 650352 (0.00092) [2022-07-10 08:43:10,961][26022] Updated weights on worker 0-0, policy_version 650362 (0.00090) [2022-07-10 08:43:12,787][25689] Fps is (10 sec: 5478.1, 60 sec: 5524.4, 300 sec: 5521.1). Total num frames: 665979904. Throughput: 0: 4864.5. Samples: 665974358. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:43:12,787][25689] Avg episode reward: [(0, '-5.200')] [2022-07-10 08:43:12,889][26022] Updated weights on worker 0-0, policy_version 650372 (0.00087) [2022-07-10 08:43:14,425][26022] Updated weights on worker 0-0, policy_version 650382 (0.00095) [2022-07-10 08:43:16,552][26022] Updated weights on worker 0-0, policy_version 650392 (0.00089) [2022-07-10 08:43:17,883][25689] Fps is (10 sec: 5566.9, 60 sec: 5545.2, 300 sec: 5524.1). Total num frames: 666009600. Throughput: 0: 5678.0. Samples: 666007724. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:43:17,883][25689] Avg episode reward: [(0, '-4.906')] [2022-07-10 08:43:18,333][26022] Updated weights on worker 0-0, policy_version 650402 (0.00091) [2022-07-10 08:43:20,087][26022] Updated weights on worker 0-0, policy_version 650412 (0.00082) [2022-07-10 08:43:22,119][26022] Updated weights on worker 0-0, policy_version 650422 (0.00091) [2022-07-10 08:43:22,895][25689] Fps is (10 sec: 5571.9, 60 sec: 5528.6, 300 sec: 5513.6). Total num frames: 666036224. Throughput: 0: 5681.7. Samples: 666040958. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:43:22,895][25689] Avg episode reward: [(0, '-5.207')] [2022-07-10 08:43:23,768][26022] Updated weights on worker 0-0, policy_version 650432 (0.00088) [2022-07-10 08:43:25,758][26022] Updated weights on worker 0-0, policy_version 650442 (0.00087) [2022-07-10 08:43:27,472][26022] Updated weights on worker 0-0, policy_version 650452 (0.00088) [2022-07-10 08:43:27,918][25689] Fps is (10 sec: 5510.2, 60 sec: 5529.6, 300 sec: 5520.6). Total num frames: 666064896. Throughput: 0: 5784.6. Samples: 666074258. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:43:27,918][25689] Avg episode reward: [(0, '-5.090')] [2022-07-10 08:43:29,379][26022] Updated weights on worker 0-0, policy_version 650462 (0.00088) [2022-07-10 08:43:31,263][26022] Updated weights on worker 0-0, policy_version 650472 (0.00089) [2022-07-10 08:43:32,934][25689] Fps is (10 sec: 5507.8, 60 sec: 5512.1, 300 sec: 5511.6). Total num frames: 666091520. Throughput: 0: 5783.5. Samples: 666090880. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:43:32,935][25689] Avg episode reward: [(0, '-5.578')] [2022-07-10 08:43:33,176][26022] Updated weights on worker 0-0, policy_version 650482 (0.00088) [2022-07-10 08:43:34,872][26022] Updated weights on worker 0-0, policy_version 650492 (0.00092) [2022-07-10 08:43:36,661][26022] Updated weights on worker 0-0, policy_version 650502 (0.00077) [2022-07-10 08:43:37,989][25689] Fps is (10 sec: 5490.4, 60 sec: 5517.8, 300 sec: 5517.6). Total num frames: 666120192. Throughput: 0: 5797.8. Samples: 666124298. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:43:37,990][25689] Avg episode reward: [(0, '-6.356')] [2022-07-10 08:43:38,559][26022] Updated weights on worker 0-0, policy_version 650512 (0.00100) [2022-07-10 08:43:40,582][26022] Updated weights on worker 0-0, policy_version 650522 (0.00097) [2022-07-10 08:43:42,242][26022] Updated weights on worker 0-0, policy_version 650532 (0.00086) [2022-07-10 08:43:43,006][25689] Fps is (10 sec: 5795.2, 60 sec: 5535.7, 300 sec: 5521.4). Total num frames: 666149888. Throughput: 0: 5817.7. Samples: 666157960. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-10 08:43:43,007][25689] Avg episode reward: [(0, '-8.161')] [2022-07-10 08:43:44,208][26022] Updated weights on worker 0-0, policy_version 650542 (0.00089) [2022-07-10 08:43:45,908][26022] Updated weights on worker 0-0, policy_version 650552 (0.00091) [2022-07-10 08:43:47,831][26022] Updated weights on worker 0-0, policy_version 650562 (0.00111) [2022-07-10 08:43:48,018][25689] Fps is (10 sec: 5616.0, 60 sec: 5521.2, 300 sec: 5522.7). Total num frames: 666176512. Throughput: 0: 4994.6. Samples: 666174648. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:43:48,018][25689] Avg episode reward: [(0, '-10.032')] [2022-07-10 08:43:49,549][26022] Updated weights on worker 0-0, policy_version 650572 (0.00084) [2022-07-10 08:43:51,619][26022] Updated weights on worker 0-0, policy_version 650582 (0.00087) [2022-07-10 08:43:53,043][25689] Fps is (10 sec: 5407.3, 60 sec: 5521.1, 300 sec: 5520.0). Total num frames: 666204160. Throughput: 0: 5792.0. Samples: 666207348. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:43:53,043][25689] Avg episode reward: [(0, '-10.688')] [2022-07-10 08:43:53,387][26022] Updated weights on worker 0-0, policy_version 650592 (0.00096) [2022-07-10 08:43:55,228][26022] Updated weights on worker 0-0, policy_version 650602 (0.00085) [2022-07-10 08:43:57,130][26022] Updated weights on worker 0-0, policy_version 650612 (0.00095) [2022-07-10 08:43:58,166][25689] Fps is (10 sec: 5549.5, 60 sec: 5533.9, 300 sec: 5518.0). Total num frames: 666232832. Throughput: 0: 5788.2. Samples: 666241088. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:43:58,167][25689] Avg episode reward: [(0, '-9.356')] [2022-07-10 08:43:58,651][26022] Updated weights on worker 0-0, policy_version 650622 (0.00087) [2022-07-10 08:44:00,780][26022] Updated weights on worker 0-0, policy_version 650632 (0.00082) [2022-07-10 08:44:02,803][26022] Updated weights on worker 0-0, policy_version 650642 (0.00085) [2022-07-10 08:44:03,205][25689] Fps is (10 sec: 5441.2, 60 sec: 5533.5, 300 sec: 5527.8). Total num frames: 666259456. Throughput: 0: 4951.2. Samples: 666257970. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:44:03,206][25689] Avg episode reward: [(0, '-8.906')] [2022-07-10 08:44:04,716][26022] Updated weights on worker 0-0, policy_version 650652 (0.00091) [2022-07-10 08:44:06,360][26022] Updated weights on worker 0-0, policy_version 650662 (0.00089) [2022-07-10 08:44:08,256][25689] Fps is (10 sec: 5480.7, 60 sec: 5535.6, 300 sec: 5520.1). Total num frames: 666288128. Throughput: 0: 5684.7. Samples: 666289694. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:44:08,256][25689] Avg episode reward: [(0, '-8.111')] [2022-07-10 08:44:08,259][26022] Updated weights on worker 0-0, policy_version 650672 (0.00085) [2022-07-10 08:44:10,113][26022] Updated weights on worker 0-0, policy_version 650682 (0.00090) [2022-07-10 08:44:12,016][26022] Updated weights on worker 0-0, policy_version 650692 (0.00085) [2022-07-10 08:44:13,291][25689] Fps is (10 sec: 5482.7, 60 sec: 5534.3, 300 sec: 5524.9). Total num frames: 666314752. Throughput: 0: 5733.7. Samples: 666323444. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:44:13,291][25689] Avg episode reward: [(0, '-6.790')] [2022-07-10 08:44:13,666][26022] Updated weights on worker 0-0, policy_version 650702 (0.00087) [2022-07-10 08:44:15,744][26022] Updated weights on worker 0-0, policy_version 650712 (0.00095) [2022-07-10 08:44:17,466][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:44:17,486][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000650722_666339328.pth [2022-07-10 08:44:17,486][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000648778_664348672.pth [2022-07-10 08:44:17,492][26022] Updated weights on worker 0-0, policy_version 650722 (0.00096) [2022-07-10 08:44:18,348][25689] Fps is (10 sec: 5377.5, 60 sec: 5504.0, 300 sec: 5517.4). Total num frames: 666342400. Throughput: 0: 4907.9. Samples: 666340138. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:44:18,349][25689] Avg episode reward: [(0, '-5.892')] [2022-07-10 08:44:19,569][26022] Updated weights on worker 0-0, policy_version 650732 (0.00051) [2022-07-10 08:44:21,296][26022] Updated weights on worker 0-0, policy_version 650742 (0.00087) [2022-07-10 08:44:23,148][26022] Updated weights on worker 0-0, policy_version 650752 (0.00626) [2022-07-10 08:44:23,376][25689] Fps is (10 sec: 5584.6, 60 sec: 5536.4, 300 sec: 5521.0). Total num frames: 666371072. Throughput: 0: 5705.2. Samples: 666373046. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:44:23,376][25689] Avg episode reward: [(0, '-5.201')] [2022-07-10 08:44:24,954][26022] Updated weights on worker 0-0, policy_version 650762 (0.00089) [2022-07-10 08:44:26,867][26022] Updated weights on worker 0-0, policy_version 650772 (0.00087) [2022-07-10 08:44:28,400][25689] Fps is (10 sec: 5704.5, 60 sec: 5536.3, 300 sec: 5521.2). Total num frames: 666399744. Throughput: 0: 5819.6. Samples: 666406928. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:44:28,401][25689] Avg episode reward: [(0, '-5.848')] [2022-07-10 08:44:28,553][26022] Updated weights on worker 0-0, policy_version 650782 (0.00091) [2022-07-10 08:44:30,458][26022] Updated weights on worker 0-0, policy_version 650792 (0.00095) [2022-07-10 08:44:32,223][26022] Updated weights on worker 0-0, policy_version 650802 (0.00096) [2022-07-10 08:44:33,436][25689] Fps is (10 sec: 5496.6, 60 sec: 5534.5, 300 sec: 5519.3). Total num frames: 666426368. Throughput: 0: 4959.4. Samples: 666423350. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:44:33,436][25689] Avg episode reward: [(0, '-8.123')] [2022-07-10 08:44:34,323][26022] Updated weights on worker 0-0, policy_version 650812 (0.00894) [2022-07-10 08:44:36,024][26022] Updated weights on worker 0-0, policy_version 650822 (0.00086) [2022-07-10 08:44:37,808][26022] Updated weights on worker 0-0, policy_version 650832 (0.00088) [2022-07-10 08:44:38,515][25689] Fps is (10 sec: 5467.0, 60 sec: 5532.3, 300 sec: 5518.8). Total num frames: 666455040. Throughput: 0: 5763.1. Samples: 666456362. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:44:38,515][25689] Avg episode reward: [(0, '-8.323')] [2022-07-10 08:44:39,677][26022] Updated weights on worker 0-0, policy_version 650842 (0.00051) [2022-07-10 08:44:41,627][26022] Updated weights on worker 0-0, policy_version 650852 (0.00087) [2022-07-10 08:44:43,535][25689] Fps is (10 sec: 5475.4, 60 sec: 5481.3, 300 sec: 5511.7). Total num frames: 666481664. Throughput: 0: 5793.6. Samples: 666489840. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:44:43,535][25689] Avg episode reward: [(0, '-8.423')] [2022-07-10 08:44:43,676][26022] Updated weights on worker 0-0, policy_version 650862 (0.00093) [2022-07-10 08:44:45,208][26022] Updated weights on worker 0-0, policy_version 650872 (0.00095) [2022-07-10 08:44:47,099][26022] Updated weights on worker 0-0, policy_version 650882 (0.00095) [2022-07-10 08:44:48,563][25689] Fps is (10 sec: 5605.1, 60 sec: 5530.5, 300 sec: 5518.2). Total num frames: 666511360. Throughput: 0: 4937.0. Samples: 666506470. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:44:48,563][25689] Avg episode reward: [(0, '-8.149')] [2022-07-10 08:44:48,909][26022] Updated weights on worker 0-0, policy_version 650892 (0.00087) [2022-07-10 08:44:50,817][26022] Updated weights on worker 0-0, policy_version 650902 (0.00098) [2022-07-10 08:44:52,690][26022] Updated weights on worker 0-0, policy_version 650912 (0.00090) [2022-07-10 08:44:53,577][25689] Fps is (10 sec: 5608.1, 60 sec: 5514.6, 300 sec: 5523.6). Total num frames: 666537984. Throughput: 0: 5788.0. Samples: 666539930. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:44:53,578][25689] Avg episode reward: [(0, '-7.371')] [2022-07-10 08:44:54,320][26022] Updated weights on worker 0-0, policy_version 650922 (0.00085) [2022-07-10 08:44:56,346][26022] Updated weights on worker 0-0, policy_version 650932 (0.00086) [2022-07-10 08:44:58,088][26022] Updated weights on worker 0-0, policy_version 650942 (0.00089) [2022-07-10 08:44:58,637][25689] Fps is (10 sec: 5489.0, 60 sec: 5520.4, 300 sec: 5516.2). Total num frames: 666566656. Throughput: 0: 5814.1. Samples: 666573354. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:44:58,639][25689] Avg episode reward: [(0, '-5.719')] [2022-07-10 08:45:00,047][26022] Updated weights on worker 0-0, policy_version 650952 (0.00085) [2022-07-10 08:45:01,957][26022] Updated weights on worker 0-0, policy_version 650962 (0.00103) [2022-07-10 08:45:03,651][25689] Fps is (10 sec: 5285.7, 60 sec: 5488.8, 300 sec: 5513.0). Total num frames: 666591232. Throughput: 0: 4985.5. Samples: 666590132. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:45:03,652][25689] Avg episode reward: [(0, '-4.057')] [2022-07-10 08:45:04,202][26022] Updated weights on worker 0-0, policy_version 650972 (0.00091) [2022-07-10 08:45:05,650][26022] Updated weights on worker 0-0, policy_version 650982 (0.00091) [2022-07-10 08:45:07,779][26022] Updated weights on worker 0-0, policy_version 650992 (0.00086) [2022-07-10 08:45:08,659][25689] Fps is (10 sec: 5517.6, 60 sec: 5526.6, 300 sec: 5527.2). Total num frames: 666621952. Throughput: 0: 5727.7. Samples: 666621572. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:45:08,661][25689] Avg episode reward: [(0, '-3.969')] [2022-07-10 08:45:09,622][26022] Updated weights on worker 0-0, policy_version 651002 (0.00092) [2022-07-10 08:45:11,455][26022] Updated weights on worker 0-0, policy_version 651012 (0.00094) [2022-07-10 08:45:13,343][26022] Updated weights on worker 0-0, policy_version 651022 (0.00098) [2022-07-10 08:45:13,664][25689] Fps is (10 sec: 5624.7, 60 sec: 5512.4, 300 sec: 5518.3). Total num frames: 666647552. Throughput: 0: 5731.1. Samples: 666655050. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:45:13,665][25689] Avg episode reward: [(0, '-3.682')] [2022-07-10 08:45:15,243][26022] Updated weights on worker 0-0, policy_version 651032 (0.00096) [2022-07-10 08:45:16,984][26022] Updated weights on worker 0-0, policy_version 651042 (0.00084) [2022-07-10 08:45:18,798][25689] Fps is (10 sec: 5352.5, 60 sec: 5522.3, 300 sec: 5519.8). Total num frames: 666676224. Throughput: 0: 4874.7. Samples: 666671634. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:45:18,799][25689] Avg episode reward: [(0, '-3.430')] [2022-07-10 08:45:19,035][26022] Updated weights on worker 0-0, policy_version 651052 (0.00093) [2022-07-10 08:45:20,484][26022] Updated weights on worker 0-0, policy_version 651062 (0.00092) [2022-07-10 08:45:22,466][26022] Updated weights on worker 0-0, policy_version 651072 (0.00091) [2022-07-10 08:45:23,819][25689] Fps is (10 sec: 5748.1, 60 sec: 5539.9, 300 sec: 5526.8). Total num frames: 666705920. Throughput: 0: 5704.1. Samples: 666705168. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:45:23,819][25689] Avg episode reward: [(0, '-4.735')] [2022-07-10 08:45:24,121][26022] Updated weights on worker 0-0, policy_version 651082 (0.00089) [2022-07-10 08:45:26,207][26022] Updated weights on worker 0-0, policy_version 651092 (0.00086) [2022-07-10 08:45:28,053][26022] Updated weights on worker 0-0, policy_version 651102 (0.00091) [2022-07-10 08:45:28,873][25689] Fps is (10 sec: 5590.0, 60 sec: 5503.3, 300 sec: 5522.4). Total num frames: 666732544. Throughput: 0: 5791.6. Samples: 666738648. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:45:28,874][25689] Avg episode reward: [(0, '-4.398')] [2022-07-10 08:45:29,923][26022] Updated weights on worker 0-0, policy_version 651112 (0.00095) [2022-07-10 08:45:31,647][26022] Updated weights on worker 0-0, policy_version 651122 (0.00083) [2022-07-10 08:45:33,519][26022] Updated weights on worker 0-0, policy_version 651132 (0.00102) [2022-07-10 08:45:33,881][25689] Fps is (10 sec: 5393.4, 60 sec: 5522.7, 300 sec: 5523.1). Total num frames: 666760192. Throughput: 0: 4962.5. Samples: 666755378. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:45:33,882][25689] Avg episode reward: [(0, '-5.305')] [2022-07-10 08:45:35,290][26022] Updated weights on worker 0-0, policy_version 651142 (0.00089) [2022-07-10 08:45:37,203][26022] Updated weights on worker 0-0, policy_version 651152 (0.00087) [2022-07-10 08:45:38,964][25689] Fps is (10 sec: 5581.7, 60 sec: 5522.4, 300 sec: 5518.7). Total num frames: 666788864. Throughput: 0: 5805.8. Samples: 666788710. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:45:38,964][25689] Avg episode reward: [(0, '-6.376')] [2022-07-10 08:45:39,156][26022] Updated weights on worker 0-0, policy_version 651162 (0.00094) [2022-07-10 08:45:40,984][26022] Updated weights on worker 0-0, policy_version 651172 (0.00095) [2022-07-10 08:45:42,807][26022] Updated weights on worker 0-0, policy_version 651182 (0.00083) [2022-07-10 08:45:44,023][25689] Fps is (10 sec: 5553.5, 60 sec: 5535.8, 300 sec: 5521.1). Total num frames: 666816512. Throughput: 0: 5792.1. Samples: 666822192. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:45:44,023][25689] Avg episode reward: [(0, '-6.106')] [2022-07-10 08:45:44,520][26022] Updated weights on worker 0-0, policy_version 651192 (0.00089) [2022-07-10 08:45:46,439][26022] Updated weights on worker 0-0, policy_version 651202 (0.00112) [2022-07-10 08:45:48,337][26022] Updated weights on worker 0-0, policy_version 651212 (0.00084) [2022-07-10 08:45:49,048][25689] Fps is (10 sec: 5483.5, 60 sec: 5502.2, 300 sec: 5517.8). Total num frames: 666844160. Throughput: 0: 4977.5. Samples: 666839066. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:45:49,048][25689] Avg episode reward: [(0, '-6.554')] [2022-07-10 08:45:49,994][26022] Updated weights on worker 0-0, policy_version 651222 (0.00083) [2022-07-10 08:45:52,118][26022] Updated weights on worker 0-0, policy_version 651232 (0.00088) [2022-07-10 08:45:53,752][26022] Updated weights on worker 0-0, policy_version 651242 (0.00089) [2022-07-10 08:45:54,056][25689] Fps is (10 sec: 5613.1, 60 sec: 5536.6, 300 sec: 5526.7). Total num frames: 666872832. Throughput: 0: 5797.6. Samples: 666872344. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:45:54,057][25689] Avg episode reward: [(0, '-6.897')] [2022-07-10 08:45:55,613][26022] Updated weights on worker 0-0, policy_version 651252 (0.00097) [2022-07-10 08:45:57,447][26022] Updated weights on worker 0-0, policy_version 651262 (0.00091) [2022-07-10 08:45:59,110][25689] Fps is (10 sec: 5699.0, 60 sec: 5537.2, 300 sec: 5522.6). Total num frames: 666901504. Throughput: 0: 5810.9. Samples: 666905778. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:45:59,111][25689] Avg episode reward: [(0, '-7.549')] [2022-07-10 08:45:59,250][26022] Updated weights on worker 0-0, policy_version 651272 (0.01059) [2022-07-10 08:46:01,536][26022] Updated weights on worker 0-0, policy_version 651282 (0.00086) [2022-07-10 08:46:03,294][26022] Updated weights on worker 0-0, policy_version 651292 (0.00085) [2022-07-10 08:46:04,133][25689] Fps is (10 sec: 5386.2, 60 sec: 5553.3, 300 sec: 5523.1). Total num frames: 666927104. Throughput: 0: 4980.3. Samples: 666922344. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:46:04,133][25689] Avg episode reward: [(0, '-6.363')] [2022-07-10 08:46:05,467][26022] Updated weights on worker 0-0, policy_version 651302 (0.00093) [2022-07-10 08:46:07,133][26022] Updated weights on worker 0-0, policy_version 651312 (0.00087) [2022-07-10 08:46:08,790][26022] Updated weights on worker 0-0, policy_version 651322 (0.00088) [2022-07-10 08:46:09,157][25689] Fps is (10 sec: 5401.7, 60 sec: 5517.9, 300 sec: 5527.1). Total num frames: 666955776. Throughput: 0: 5688.0. Samples: 666953448. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:46:09,158][25689] Avg episode reward: [(0, '-6.400')] [2022-07-10 08:46:10,751][26022] Updated weights on worker 0-0, policy_version 651332 (0.00092) [2022-07-10 08:46:12,544][26022] Updated weights on worker 0-0, policy_version 651342 (0.00091) [2022-07-10 08:46:14,188][25689] Fps is (10 sec: 5499.1, 60 sec: 5532.5, 300 sec: 5518.2). Total num frames: 666982400. Throughput: 0: 5702.0. Samples: 666987136. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:46:14,189][25689] Avg episode reward: [(0, '-6.352')] [2022-07-10 08:46:14,316][26022] Updated weights on worker 0-0, policy_version 651352 (0.00085) [2022-07-10 08:46:16,596][26022] Updated weights on worker 0-0, policy_version 651362 (0.00091) [2022-07-10 08:46:17,565][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:46:17,577][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000651370_667002880.pth [2022-07-10 08:46:17,578][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000649425_665011200.pth [2022-07-10 08:46:18,183][26022] Updated weights on worker 0-0, policy_version 651372 (0.00094) [2022-07-10 08:46:19,258][25689] Fps is (10 sec: 5372.9, 60 sec: 5521.4, 300 sec: 5518.6). Total num frames: 667010048. Throughput: 0: 4866.6. Samples: 667003832. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:46:19,259][25689] Avg episode reward: [(0, '-6.453')] [2022-07-10 08:46:20,164][26022] Updated weights on worker 0-0, policy_version 651382 (0.00092) [2022-07-10 08:46:21,862][26022] Updated weights on worker 0-0, policy_version 651392 (0.00090) [2022-07-10 08:46:23,771][26022] Updated weights on worker 0-0, policy_version 651402 (0.00093) [2022-07-10 08:46:24,268][25689] Fps is (10 sec: 5587.6, 60 sec: 5505.5, 300 sec: 5519.9). Total num frames: 667038720. Throughput: 0: 5677.5. Samples: 667036660. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:46:24,269][25689] Avg episode reward: [(0, '-5.946')] [2022-07-10 08:46:25,608][26022] Updated weights on worker 0-0, policy_version 651412 (0.00086) [2022-07-10 08:46:27,506][26022] Updated weights on worker 0-0, policy_version 651422 (0.00087) [2022-07-10 08:46:29,175][26022] Updated weights on worker 0-0, policy_version 651432 (0.00090) [2022-07-10 08:46:29,284][25689] Fps is (10 sec: 5617.3, 60 sec: 5525.9, 300 sec: 5521.0). Total num frames: 667066368. Throughput: 0: 5794.9. Samples: 667070084. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:46:29,285][25689] Avg episode reward: [(0, '-6.281')] [2022-07-10 08:46:31,055][26022] Updated weights on worker 0-0, policy_version 651442 (0.00090) [2022-07-10 08:46:32,972][26022] Updated weights on worker 0-0, policy_version 651452 (0.00088) [2022-07-10 08:46:34,288][25689] Fps is (10 sec: 5416.2, 60 sec: 5509.3, 300 sec: 5518.9). Total num frames: 667092992. Throughput: 0: 5797.8. Samples: 667103670. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:46:34,288][25689] Avg episode reward: [(0, '-6.893')] [2022-07-10 08:46:34,597][26022] Updated weights on worker 0-0, policy_version 651462 (0.00094) [2022-07-10 08:46:36,634][26022] Updated weights on worker 0-0, policy_version 651472 (0.00086) [2022-07-10 08:46:38,352][26022] Updated weights on worker 0-0, policy_version 651482 (0.00058) [2022-07-10 08:46:39,334][25689] Fps is (10 sec: 5400.6, 60 sec: 5495.7, 300 sec: 5516.3). Total num frames: 667120640. Throughput: 0: 5804.6. Samples: 667120362. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:46:39,334][25689] Avg episode reward: [(0, '-5.124')] [2022-07-10 08:46:40,503][26022] Updated weights on worker 0-0, policy_version 651492 (0.00094) [2022-07-10 08:46:42,258][26022] Updated weights on worker 0-0, policy_version 651502 (0.00094) [2022-07-10 08:46:43,947][26022] Updated weights on worker 0-0, policy_version 651512 (0.00096) [2022-07-10 08:46:44,350][25689] Fps is (10 sec: 5699.1, 60 sec: 5533.6, 300 sec: 5520.3). Total num frames: 667150336. Throughput: 0: 5824.5. Samples: 667153628. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:46:44,350][25689] Avg episode reward: [(0, '-4.056')] [2022-07-10 08:46:45,950][26022] Updated weights on worker 0-0, policy_version 651522 (0.00092) [2022-07-10 08:46:47,511][26022] Updated weights on worker 0-0, policy_version 651532 (0.00096) [2022-07-10 08:46:49,368][25689] Fps is (10 sec: 5612.6, 60 sec: 5517.2, 300 sec: 5520.1). Total num frames: 667176960. Throughput: 0: 5837.6. Samples: 667187326. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:46:49,369][25689] Avg episode reward: [(0, '-3.635')] [2022-07-10 08:46:49,632][26022] Updated weights on worker 0-0, policy_version 651542 (0.00091) [2022-07-10 08:46:51,240][26022] Updated weights on worker 0-0, policy_version 651552 (0.00087) [2022-07-10 08:46:53,384][26022] Updated weights on worker 0-0, policy_version 651562 (0.00090) [2022-07-10 08:46:54,397][25689] Fps is (10 sec: 5503.5, 60 sec: 5515.3, 300 sec: 5517.6). Total num frames: 667205632. Throughput: 0: 4991.0. Samples: 667204036. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 08:46:54,398][25689] Avg episode reward: [(0, '-4.025')] [2022-07-10 08:46:55,006][26022] Updated weights on worker 0-0, policy_version 651572 (0.00090) [2022-07-10 08:46:57,063][26022] Updated weights on worker 0-0, policy_version 651582 (0.00090) [2022-07-10 08:46:58,646][26022] Updated weights on worker 0-0, policy_version 651592 (0.00094) [2022-07-10 08:46:59,443][25689] Fps is (10 sec: 5590.3, 60 sec: 5499.1, 300 sec: 5527.6). Total num frames: 667233280. Throughput: 0: 5808.9. Samples: 667237174. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:46:59,444][25689] Avg episode reward: [(0, '-3.914')] [2022-07-10 08:47:00,858][26022] Updated weights on worker 0-0, policy_version 651602 (0.00085) [2022-07-10 08:47:02,768][26022] Updated weights on worker 0-0, policy_version 651612 (0.00089) [2022-07-10 08:47:04,459][25689] Fps is (10 sec: 5190.5, 60 sec: 5482.7, 300 sec: 5517.7). Total num frames: 667257856. Throughput: 0: 5701.2. Samples: 667268272. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:47:04,459][25689] Avg episode reward: [(0, '-4.202')] [2022-07-10 08:47:04,967][26022] Updated weights on worker 0-0, policy_version 651623 (0.00096) [2022-07-10 08:47:06,717][26022] Updated weights on worker 0-0, policy_version 651633 (0.00089) [2022-07-10 08:47:08,746][26022] Updated weights on worker 0-0, policy_version 651643 (0.00079) [2022-07-10 08:47:09,483][25689] Fps is (10 sec: 5303.6, 60 sec: 5482.8, 300 sec: 5521.1). Total num frames: 667286528. Throughput: 0: 4847.9. Samples: 667284840. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:47:09,483][25689] Avg episode reward: [(0, '-5.000')] [2022-07-10 08:47:10,179][26022] Updated weights on worker 0-0, policy_version 651653 (0.00081) [2022-07-10 08:47:12,424][26022] Updated weights on worker 0-0, policy_version 651663 (0.00083) [2022-07-10 08:47:14,066][26022] Updated weights on worker 0-0, policy_version 651673 (0.00093) [2022-07-10 08:47:14,491][25689] Fps is (10 sec: 5614.1, 60 sec: 5501.9, 300 sec: 5520.1). Total num frames: 667314176. Throughput: 0: 5676.9. Samples: 667318104. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:47:14,491][25689] Avg episode reward: [(0, '-4.738')] [2022-07-10 08:47:16,047][26022] Updated weights on worker 0-0, policy_version 651683 (0.00087) [2022-07-10 08:47:17,874][26022] Updated weights on worker 0-0, policy_version 651693 (0.00053) [2022-07-10 08:47:19,582][25689] Fps is (10 sec: 5576.8, 60 sec: 5516.9, 300 sec: 5522.1). Total num frames: 667342848. Throughput: 0: 5656.1. Samples: 667351082. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:47:19,582][25689] Avg episode reward: [(0, '-3.785')] [2022-07-10 08:47:19,799][26022] Updated weights on worker 0-0, policy_version 651703 (0.00087) [2022-07-10 08:47:21,608][26022] Updated weights on worker 0-0, policy_version 651713 (0.00097) [2022-07-10 08:47:23,274][26022] Updated weights on worker 0-0, policy_version 651723 (0.00086) [2022-07-10 08:47:24,616][25689] Fps is (10 sec: 5461.0, 60 sec: 5480.7, 300 sec: 5515.2). Total num frames: 667369472. Throughput: 0: 4929.7. Samples: 667367642. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:47:24,617][25689] Avg episode reward: [(0, '-3.119')] [2022-07-10 08:47:25,347][26022] Updated weights on worker 0-0, policy_version 651733 (0.00086) [2022-07-10 08:47:27,187][26022] Updated weights on worker 0-0, policy_version 651743 (0.00096) [2022-07-10 08:47:29,043][26022] Updated weights on worker 0-0, policy_version 651753 (0.00083) [2022-07-10 08:47:29,627][25689] Fps is (10 sec: 5606.6, 60 sec: 5515.2, 300 sec: 5522.1). Total num frames: 667399168. Throughput: 0: 5761.8. Samples: 667400908. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:47:29,627][25689] Avg episode reward: [(0, '-2.678')] [2022-07-10 08:47:30,836][26022] Updated weights on worker 0-0, policy_version 651763 (0.00084) [2022-07-10 08:47:32,601][26022] Updated weights on worker 0-0, policy_version 651773 (0.00086) [2022-07-10 08:47:34,479][26022] Updated weights on worker 0-0, policy_version 651783 (0.00058) [2022-07-10 08:47:34,650][25689] Fps is (10 sec: 5613.1, 60 sec: 5513.4, 300 sec: 5517.0). Total num frames: 667425792. Throughput: 0: 5782.5. Samples: 667434674. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:47:34,650][25689] Avg episode reward: [(0, '-1.987')] [2022-07-10 08:47:36,307][26022] Updated weights on worker 0-0, policy_version 651793 (0.00083) [2022-07-10 08:47:38,094][26022] Updated weights on worker 0-0, policy_version 651803 (0.00084) [2022-07-10 08:47:39,744][25689] Fps is (10 sec: 5567.0, 60 sec: 5542.9, 300 sec: 5519.2). Total num frames: 667455488. Throughput: 0: 4970.3. Samples: 667451294. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:47:39,746][25689] Avg episode reward: [(0, '-2.629')] [2022-07-10 08:47:40,118][26022] Updated weights on worker 0-0, policy_version 651813 (0.00103) [2022-07-10 08:47:41,844][26022] Updated weights on worker 0-0, policy_version 651823 (0.00092) [2022-07-10 08:47:43,689][26022] Updated weights on worker 0-0, policy_version 651833 (0.00090) [2022-07-10 08:47:44,804][25689] Fps is (10 sec: 5647.5, 60 sec: 5505.0, 300 sec: 5518.8). Total num frames: 667483136. Throughput: 0: 5800.1. Samples: 667484732. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:47:44,804][25689] Avg episode reward: [(0, '-2.845')] [2022-07-10 08:47:45,604][26022] Updated weights on worker 0-0, policy_version 651843 (0.00091) [2022-07-10 08:47:47,221][26022] Updated weights on worker 0-0, policy_version 651853 (0.00094) [2022-07-10 08:47:49,200][26022] Updated weights on worker 0-0, policy_version 651863 (0.00088) [2022-07-10 08:47:49,809][25689] Fps is (10 sec: 5595.7, 60 sec: 5540.1, 300 sec: 5522.6). Total num frames: 667511808. Throughput: 0: 5817.0. Samples: 667518304. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:47:49,809][25689] Avg episode reward: [(0, '-2.385')] [2022-07-10 08:47:51,180][26022] Updated weights on worker 0-0, policy_version 651873 (0.00086) [2022-07-10 08:47:52,713][26022] Updated weights on worker 0-0, policy_version 651883 (0.00083) [2022-07-10 08:47:54,836][25689] Fps is (10 sec: 5409.6, 60 sec: 5489.5, 300 sec: 5516.6). Total num frames: 667537408. Throughput: 0: 4978.4. Samples: 667535170. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:47:54,837][25689] Avg episode reward: [(0, '-3.101')] [2022-07-10 08:47:54,864][26022] Updated weights on worker 0-0, policy_version 651893 (0.00196) [2022-07-10 08:47:56,476][26022] Updated weights on worker 0-0, policy_version 651903 (0.00093) [2022-07-10 08:47:58,411][26022] Updated weights on worker 0-0, policy_version 651913 (0.00094) [2022-07-10 08:47:59,939][25689] Fps is (10 sec: 5357.2, 60 sec: 5501.1, 300 sec: 5522.3). Total num frames: 667566080. Throughput: 0: 5808.9. Samples: 667568608. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:47:59,940][25689] Avg episode reward: [(0, '-2.876')] [2022-07-10 08:48:00,222][26022] Updated weights on worker 0-0, policy_version 651923 (0.00090) [2022-07-10 08:48:02,100][26022] Updated weights on worker 0-0, policy_version 651933 (0.00097) [2022-07-10 08:48:04,189][26022] Updated weights on worker 0-0, policy_version 651943 (0.00078) [2022-07-10 08:48:04,987][25689] Fps is (10 sec: 5548.4, 60 sec: 5549.0, 300 sec: 5519.3). Total num frames: 667593728. Throughput: 0: 5714.9. Samples: 667600076. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:48:04,987][25689] Avg episode reward: [(0, '-2.832')] [2022-07-10 08:48:05,927][26022] Updated weights on worker 0-0, policy_version 651953 (0.00088) [2022-07-10 08:48:07,760][26022] Updated weights on worker 0-0, policy_version 651963 (0.00081) [2022-07-10 08:48:09,944][26022] Updated weights on worker 0-0, policy_version 651973 (0.00093) [2022-07-10 08:48:10,054][25689] Fps is (10 sec: 5365.5, 60 sec: 5511.2, 300 sec: 5518.4). Total num frames: 667620352. Throughput: 0: 4863.3. Samples: 667616764. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:48:10,055][25689] Avg episode reward: [(0, '-1.399')] [2022-07-10 08:48:11,426][26022] Updated weights on worker 0-0, policy_version 651983 (0.00081) [2022-07-10 08:48:13,520][26022] Updated weights on worker 0-0, policy_version 651993 (0.00086) [2022-07-10 08:48:15,099][25689] Fps is (10 sec: 5569.6, 60 sec: 5541.7, 300 sec: 5519.4). Total num frames: 667650048. Throughput: 0: 5677.6. Samples: 667650214. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:48:15,099][25689] Avg episode reward: [(0, '-1.810')] [2022-07-10 08:48:15,215][26022] Updated weights on worker 0-0, policy_version 652003 (0.00084) [2022-07-10 08:48:17,058][26022] Updated weights on worker 0-0, policy_version 652013 (0.00089) [2022-07-10 08:48:17,672][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:48:17,683][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000652017_667665408.pth [2022-07-10 08:48:17,695][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000650073_665674752.pth [2022-07-10 08:48:18,910][26022] Updated weights on worker 0-0, policy_version 652023 (0.00092) [2022-07-10 08:48:20,164][25689] Fps is (10 sec: 5671.9, 60 sec: 5527.1, 300 sec: 5521.8). Total num frames: 667677696. Throughput: 0: 5690.7. Samples: 667683704. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:48:20,165][25689] Avg episode reward: [(0, '-1.380')] [2022-07-10 08:48:20,670][26022] Updated weights on worker 0-0, policy_version 652033 (0.00093) [2022-07-10 08:48:22,576][26022] Updated weights on worker 0-0, policy_version 652043 (0.00090) [2022-07-10 08:48:24,455][26022] Updated weights on worker 0-0, policy_version 652053 (0.00090) [2022-07-10 08:48:25,200][25689] Fps is (10 sec: 5575.5, 60 sec: 5560.8, 300 sec: 5521.6). Total num frames: 667706368. Throughput: 0: 4971.1. Samples: 667700558. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:48:25,202][25689] Avg episode reward: [(0, '-1.158')] [2022-07-10 08:48:26,339][26022] Updated weights on worker 0-0, policy_version 652063 (0.00096) [2022-07-10 08:48:28,285][26022] Updated weights on worker 0-0, policy_version 652073 (0.00090) [2022-07-10 08:48:29,895][26022] Updated weights on worker 0-0, policy_version 652083 (0.00093) [2022-07-10 08:48:30,285][25689] Fps is (10 sec: 5564.7, 60 sec: 5520.2, 300 sec: 5523.8). Total num frames: 667734016. Throughput: 0: 5781.8. Samples: 667733736. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:48:30,286][25689] Avg episode reward: [(0, '-0.907')] [2022-07-10 08:48:32,018][26022] Updated weights on worker 0-0, policy_version 652093 (0.00091) [2022-07-10 08:48:33,724][26022] Updated weights on worker 0-0, policy_version 652103 (0.00089) [2022-07-10 08:48:35,292][25689] Fps is (10 sec: 5479.2, 60 sec: 5538.6, 300 sec: 5521.2). Total num frames: 667761664. Throughput: 0: 5781.4. Samples: 667766960. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:48:35,292][25689] Avg episode reward: [(0, '-1.111')] [2022-07-10 08:48:35,689][26022] Updated weights on worker 0-0, policy_version 652113 (0.00089) [2022-07-10 08:48:37,291][26022] Updated weights on worker 0-0, policy_version 652123 (0.00109) [2022-07-10 08:48:39,229][26022] Updated weights on worker 0-0, policy_version 652133 (0.00079) [2022-07-10 08:48:40,363][25689] Fps is (10 sec: 5588.9, 60 sec: 5523.8, 300 sec: 5516.8). Total num frames: 667790336. Throughput: 0: 4943.1. Samples: 667783546. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:48:40,366][25689] Avg episode reward: [(0, '-1.075')] [2022-07-10 08:48:41,172][26022] Updated weights on worker 0-0, policy_version 652143 (0.00087) [2022-07-10 08:48:42,854][26022] Updated weights on worker 0-0, policy_version 652153 (0.00081) [2022-07-10 08:48:44,674][26022] Updated weights on worker 0-0, policy_version 652163 (0.00082) [2022-07-10 08:48:45,473][25689] Fps is (10 sec: 5531.8, 60 sec: 5519.2, 300 sec: 5518.4). Total num frames: 667817984. Throughput: 0: 5739.4. Samples: 667816914. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:48:45,474][25689] Avg episode reward: [(0, '-1.095')] [2022-07-10 08:48:46,591][26022] Updated weights on worker 0-0, policy_version 652173 (0.00086) [2022-07-10 08:48:48,327][26022] Updated weights on worker 0-0, policy_version 652183 (0.00059) [2022-07-10 08:48:50,450][26022] Updated weights on worker 0-0, policy_version 652193 (0.00086) [2022-07-10 08:48:50,493][25689] Fps is (10 sec: 5559.5, 60 sec: 5517.9, 300 sec: 5521.9). Total num frames: 667846656. Throughput: 0: 5786.0. Samples: 667850656. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:48:50,494][25689] Avg episode reward: [(0, '-2.378')] [2022-07-10 08:48:52,053][26022] Updated weights on worker 0-0, policy_version 652203 (0.00092) [2022-07-10 08:48:54,078][26022] Updated weights on worker 0-0, policy_version 652213 (0.00085) [2022-07-10 08:48:55,527][25689] Fps is (10 sec: 5499.9, 60 sec: 5534.1, 300 sec: 5516.7). Total num frames: 667873280. Throughput: 0: 4942.3. Samples: 667866968. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:48:55,529][25689] Avg episode reward: [(0, '-3.240')] [2022-07-10 08:48:55,836][26022] Updated weights on worker 0-0, policy_version 652223 (0.00090) [2022-07-10 08:48:57,933][26022] Updated weights on worker 0-0, policy_version 652233 (0.00088) [2022-07-10 08:48:59,455][26022] Updated weights on worker 0-0, policy_version 652243 (0.00087) [2022-07-10 08:49:00,620][25689] Fps is (10 sec: 5460.0, 60 sec: 5535.0, 300 sec: 5522.6). Total num frames: 667901952. Throughput: 0: 5760.7. Samples: 667900246. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:49:00,621][25689] Avg episode reward: [(0, '-5.024')] [2022-07-10 08:49:01,637][26022] Updated weights on worker 0-0, policy_version 652253 (0.00086) [2022-07-10 08:49:03,480][26022] Updated weights on worker 0-0, policy_version 652263 (0.00094) [2022-07-10 08:49:05,532][26022] Updated weights on worker 0-0, policy_version 652273 (0.00085) [2022-07-10 08:49:05,625][25689] Fps is (10 sec: 5374.7, 60 sec: 5505.2, 300 sec: 5513.1). Total num frames: 667927552. Throughput: 0: 5690.9. Samples: 667931596. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:49:05,625][25689] Avg episode reward: [(0, '-4.744')] [2022-07-10 08:49:07,543][26022] Updated weights on worker 0-0, policy_version 652283 (0.00088) [2022-07-10 08:49:09,069][26022] Updated weights on worker 0-0, policy_version 652293 (0.00086) [2022-07-10 08:49:10,649][25689] Fps is (10 sec: 5411.6, 60 sec: 5542.9, 300 sec: 5520.2). Total num frames: 667956224. Throughput: 0: 5689.1. Samples: 667965328. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:49:10,652][25689] Avg episode reward: [(0, '-4.810')] [2022-07-10 08:49:11,151][26022] Updated weights on worker 0-0, policy_version 652303 (0.00095) [2022-07-10 08:49:12,682][26022] Updated weights on worker 0-0, policy_version 652313 (0.00092) [2022-07-10 08:49:14,571][26022] Updated weights on worker 0-0, policy_version 652323 (0.00088) [2022-07-10 08:49:15,658][25689] Fps is (10 sec: 5715.5, 60 sec: 5529.3, 300 sec: 5524.5). Total num frames: 667984896. Throughput: 0: 5719.2. Samples: 667982100. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:49:15,658][25689] Avg episode reward: [(0, '-5.086')] [2022-07-10 08:49:16,647][26022] Updated weights on worker 0-0, policy_version 652333 (0.00099) [2022-07-10 08:49:18,143][26022] Updated weights on worker 0-0, policy_version 652343 (0.00095) [2022-07-10 08:49:20,387][26022] Updated weights on worker 0-0, policy_version 652353 (0.00088) [2022-07-10 08:49:20,695][25689] Fps is (10 sec: 5402.6, 60 sec: 5498.1, 300 sec: 5514.0). Total num frames: 668010496. Throughput: 0: 5725.8. Samples: 668015188. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:49:20,695][25689] Avg episode reward: [(0, '-3.131')] [2022-07-10 08:49:22,019][26022] Updated weights on worker 0-0, policy_version 652363 (0.00086) [2022-07-10 08:49:23,987][26022] Updated weights on worker 0-0, policy_version 652373 (0.00088) [2022-07-10 08:49:25,636][26022] Updated weights on worker 0-0, policy_version 652383 (0.00094) [2022-07-10 08:49:25,733][25689] Fps is (10 sec: 5488.5, 60 sec: 5514.8, 300 sec: 5517.2). Total num frames: 668040192. Throughput: 0: 5805.4. Samples: 668048330. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:49:25,733][25689] Avg episode reward: [(0, '-2.734')] [2022-07-10 08:49:27,695][26022] Updated weights on worker 0-0, policy_version 652393 (0.00087) [2022-07-10 08:49:29,382][26022] Updated weights on worker 0-0, policy_version 652403 (0.00085) [2022-07-10 08:49:30,752][25689] Fps is (10 sec: 5599.9, 60 sec: 5503.9, 300 sec: 5517.5). Total num frames: 668066816. Throughput: 0: 4956.7. Samples: 668064972. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:49:30,753][25689] Avg episode reward: [(0, '-0.988')] [2022-07-10 08:49:31,418][26022] Updated weights on worker 0-0, policy_version 652413 (0.00085) [2022-07-10 08:49:33,047][26022] Updated weights on worker 0-0, policy_version 652423 (0.00092) [2022-07-10 08:49:35,116][26022] Updated weights on worker 0-0, policy_version 652433 (0.00093) [2022-07-10 08:49:35,787][25689] Fps is (10 sec: 5601.4, 60 sec: 5535.1, 300 sec: 5521.8). Total num frames: 668096512. Throughput: 0: 5791.1. Samples: 668098672. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:49:35,788][25689] Avg episode reward: [(0, '-1.253')] [2022-07-10 08:49:36,682][26022] Updated weights on worker 0-0, policy_version 652443 (0.00092) [2022-07-10 08:49:38,816][26022] Updated weights on worker 0-0, policy_version 652453 (0.01191) [2022-07-10 08:49:40,566][26022] Updated weights on worker 0-0, policy_version 652463 (0.00099) [2022-07-10 08:49:40,916][25689] Fps is (10 sec: 5541.2, 60 sec: 5496.0, 300 sec: 5519.8). Total num frames: 668123136. Throughput: 0: 5765.9. Samples: 668131782. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:49:40,916][25689] Avg episode reward: [(0, '-2.830')] [2022-07-10 08:49:42,472][26022] Updated weights on worker 0-0, policy_version 652473 (0.00089) [2022-07-10 08:49:44,016][26022] Updated weights on worker 0-0, policy_version 652483 (0.00089) [2022-07-10 08:49:45,958][25689] Fps is (10 sec: 5336.2, 60 sec: 5502.2, 300 sec: 5512.6). Total num frames: 668150784. Throughput: 0: 4954.9. Samples: 668148544. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:49:45,958][25689] Avg episode reward: [(0, '-4.774')] [2022-07-10 08:49:46,269][26022] Updated weights on worker 0-0, policy_version 652493 (0.00086) [2022-07-10 08:49:47,723][26022] Updated weights on worker 0-0, policy_version 652503 (0.00093) [2022-07-10 08:49:49,771][26022] Updated weights on worker 0-0, policy_version 652513 (0.00090) [2022-07-10 08:49:50,991][25689] Fps is (10 sec: 5793.1, 60 sec: 5534.8, 300 sec: 5526.0). Total num frames: 668181504. Throughput: 0: 5795.7. Samples: 668182272. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:49:50,993][25689] Avg episode reward: [(0, '-6.443')] [2022-07-10 08:49:51,408][26022] Updated weights on worker 0-0, policy_version 652523 (0.00084) [2022-07-10 08:49:53,486][26022] Updated weights on worker 0-0, policy_version 652533 (0.00085) [2022-07-10 08:49:55,314][26022] Updated weights on worker 0-0, policy_version 652543 (0.00084) [2022-07-10 08:49:56,000][25689] Fps is (10 sec: 5608.3, 60 sec: 5520.2, 300 sec: 5516.6). Total num frames: 668207104. Throughput: 0: 5787.9. Samples: 668215660. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:49:56,001][25689] Avg episode reward: [(0, '-7.468')] [2022-07-10 08:49:57,123][26022] Updated weights on worker 0-0, policy_version 652553 (0.00086) [2022-07-10 08:49:59,041][26022] Updated weights on worker 0-0, policy_version 652563 (0.00083) [2022-07-10 08:50:00,999][26022] Updated weights on worker 0-0, policy_version 652573 (0.00084) [2022-07-10 08:50:01,079][25689] Fps is (10 sec: 5278.7, 60 sec: 5504.6, 300 sec: 5525.8). Total num frames: 668234752. Throughput: 0: 4971.9. Samples: 668232030. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:50:01,079][25689] Avg episode reward: [(0, '-7.342')] [2022-07-10 08:50:02,874][26022] Updated weights on worker 0-0, policy_version 652583 (0.00091) [2022-07-10 08:50:05,023][26022] Updated weights on worker 0-0, policy_version 652593 (0.00084) [2022-07-10 08:50:06,115][25689] Fps is (10 sec: 5466.5, 60 sec: 5535.5, 300 sec: 5514.9). Total num frames: 668262400. Throughput: 0: 5694.4. Samples: 668263330. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:50:06,117][25689] Avg episode reward: [(0, '-6.902')] [2022-07-10 08:50:06,580][26022] Updated weights on worker 0-0, policy_version 652603 (0.00086) [2022-07-10 08:50:08,703][26022] Updated weights on worker 0-0, policy_version 652613 (0.00087) [2022-07-10 08:50:10,255][26022] Updated weights on worker 0-0, policy_version 652623 (0.00090) [2022-07-10 08:50:11,139][25689] Fps is (10 sec: 5496.6, 60 sec: 5518.7, 300 sec: 5521.5). Total num frames: 668290048. Throughput: 0: 5691.1. Samples: 668296932. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 08:50:11,139][25689] Avg episode reward: [(0, '-5.992')] [2022-07-10 08:50:12,290][26022] Updated weights on worker 0-0, policy_version 652633 (0.00082) [2022-07-10 08:50:13,985][26022] Updated weights on worker 0-0, policy_version 652643 (0.00093) [2022-07-10 08:50:15,989][26022] Updated weights on worker 0-0, policy_version 652653 (0.00091) [2022-07-10 08:50:16,151][25689] Fps is (10 sec: 5408.0, 60 sec: 5484.5, 300 sec: 5516.8). Total num frames: 668316672. Throughput: 0: 4865.4. Samples: 668313702. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:50:16,152][25689] Avg episode reward: [(0, '-4.734')] [2022-07-10 08:50:17,591][26022] Updated weights on worker 0-0, policy_version 652663 (0.00094) [2022-07-10 08:50:17,843][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:50:17,854][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000652664_668327936.pth [2022-07-10 08:50:17,855][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000650722_666339328.pth [2022-07-10 08:50:19,750][26022] Updated weights on worker 0-0, policy_version 652673 (0.00083) [2022-07-10 08:50:21,205][25689] Fps is (10 sec: 5492.9, 60 sec: 5533.7, 300 sec: 5512.8). Total num frames: 668345344. Throughput: 0: 5713.0. Samples: 668347014. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:50:21,206][25689] Avg episode reward: [(0, '-3.552')] [2022-07-10 08:50:21,425][26022] Updated weights on worker 0-0, policy_version 652683 (0.00088) [2022-07-10 08:50:23,447][26022] Updated weights on worker 0-0, policy_version 652693 (0.00084) [2022-07-10 08:50:24,984][26022] Updated weights on worker 0-0, policy_version 652703 (0.00089) [2022-07-10 08:50:26,208][25689] Fps is (10 sec: 5702.0, 60 sec: 5520.0, 300 sec: 5520.6). Total num frames: 668374016. Throughput: 0: 5838.1. Samples: 668380632. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:50:26,208][25689] Avg episode reward: [(0, '-2.937')] [2022-07-10 08:50:27,129][26022] Updated weights on worker 0-0, policy_version 652713 (0.00093) [2022-07-10 08:50:28,712][26022] Updated weights on worker 0-0, policy_version 652723 (0.00088) [2022-07-10 08:50:30,856][26022] Updated weights on worker 0-0, policy_version 652733 (0.00095) [2022-07-10 08:50:31,213][25689] Fps is (10 sec: 5525.3, 60 sec: 5521.3, 300 sec: 5517.2). Total num frames: 668400640. Throughput: 0: 4987.6. Samples: 668397056. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:50:31,214][25689] Avg episode reward: [(0, '-2.847')] [2022-07-10 08:50:32,426][26022] Updated weights on worker 0-0, policy_version 652743 (0.00094) [2022-07-10 08:50:34,536][26022] Updated weights on worker 0-0, policy_version 652753 (0.00089) [2022-07-10 08:50:36,031][26022] Updated weights on worker 0-0, policy_version 652763 (0.00082) [2022-07-10 08:50:36,224][25689] Fps is (10 sec: 5520.9, 60 sec: 5506.6, 300 sec: 5518.6). Total num frames: 668429312. Throughput: 0: 5800.2. Samples: 668430128. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:50:36,224][25689] Avg episode reward: [(0, '-3.116')] [2022-07-10 08:50:38,140][26022] Updated weights on worker 0-0, policy_version 652773 (0.00093) [2022-07-10 08:50:40,005][26022] Updated weights on worker 0-0, policy_version 652783 (0.00090) [2022-07-10 08:50:41,263][25689] Fps is (10 sec: 5604.3, 60 sec: 5531.8, 300 sec: 5518.9). Total num frames: 668456960. Throughput: 0: 5803.5. Samples: 668463416. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:50:41,263][25689] Avg episode reward: [(0, '-3.306')] [2022-07-10 08:50:41,959][26022] Updated weights on worker 0-0, policy_version 652793 (0.00084) [2022-07-10 08:50:43,552][26022] Updated weights on worker 0-0, policy_version 652803 (0.00089) [2022-07-10 08:50:45,558][26022] Updated weights on worker 0-0, policy_version 652813 (0.00088) [2022-07-10 08:50:46,270][25689] Fps is (10 sec: 5402.1, 60 sec: 5517.9, 300 sec: 5515.8). Total num frames: 668483584. Throughput: 0: 4963.7. Samples: 668480214. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:50:46,271][25689] Avg episode reward: [(0, '-3.094')] [2022-07-10 08:50:47,298][26022] Updated weights on worker 0-0, policy_version 652823 (0.00087) [2022-07-10 08:50:49,200][26022] Updated weights on worker 0-0, policy_version 652833 (0.00085) [2022-07-10 08:50:51,144][26022] Updated weights on worker 0-0, policy_version 652843 (0.00086) [2022-07-10 08:50:51,290][25689] Fps is (10 sec: 5514.5, 60 sec: 5485.2, 300 sec: 5515.6). Total num frames: 668512256. Throughput: 0: 5815.8. Samples: 668513818. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:50:51,292][25689] Avg episode reward: [(0, '-3.042')] [2022-07-10 08:50:52,878][26022] Updated weights on worker 0-0, policy_version 652853 (0.00087) [2022-07-10 08:50:54,776][26022] Updated weights on worker 0-0, policy_version 652863 (0.00095) [2022-07-10 08:50:56,297][25689] Fps is (10 sec: 5617.1, 60 sec: 5519.4, 300 sec: 5513.0). Total num frames: 668539904. Throughput: 0: 5837.2. Samples: 668547298. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:50:56,297][25689] Avg episode reward: [(0, '-2.497')] [2022-07-10 08:50:56,664][26022] Updated weights on worker 0-0, policy_version 652873 (0.00085) [2022-07-10 08:50:58,270][26022] Updated weights on worker 0-0, policy_version 652883 (0.00089) [2022-07-10 08:51:00,447][26022] Updated weights on worker 0-0, policy_version 652893 (0.00083) [2022-07-10 08:51:01,354][25689] Fps is (10 sec: 5494.6, 60 sec: 5521.4, 300 sec: 5519.3). Total num frames: 668567552. Throughput: 0: 5000.1. Samples: 668563874. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:51:01,354][25689] Avg episode reward: [(0, '-3.200')] [2022-07-10 08:51:02,474][26022] Updated weights on worker 0-0, policy_version 652903 (0.00084) [2022-07-10 08:51:04,193][26022] Updated weights on worker 0-0, policy_version 652913 (0.00086) [2022-07-10 08:51:06,131][26022] Updated weights on worker 0-0, policy_version 652923 (0.00086) [2022-07-10 08:51:06,374][25689] Fps is (10 sec: 5385.6, 60 sec: 5505.9, 300 sec: 5512.5). Total num frames: 668594176. Throughput: 0: 5733.5. Samples: 668595478. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:51:06,374][25689] Avg episode reward: [(0, '-3.679')] [2022-07-10 08:51:07,814][26022] Updated weights on worker 0-0, policy_version 652933 (0.00093) [2022-07-10 08:51:09,779][26022] Updated weights on worker 0-0, policy_version 652943 (0.00086) [2022-07-10 08:51:11,383][25689] Fps is (10 sec: 5513.7, 60 sec: 5524.2, 300 sec: 5519.8). Total num frames: 668622848. Throughput: 0: 5730.3. Samples: 668628954. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:51:11,383][25689] Avg episode reward: [(0, '-3.751')] [2022-07-10 08:51:11,592][26022] Updated weights on worker 0-0, policy_version 652953 (0.00090) [2022-07-10 08:51:13,499][26022] Updated weights on worker 0-0, policy_version 652963 (0.00093) [2022-07-10 08:51:15,405][26022] Updated weights on worker 0-0, policy_version 652973 (0.00100) [2022-07-10 08:51:16,389][25689] Fps is (10 sec: 5623.6, 60 sec: 5541.8, 300 sec: 5521.0). Total num frames: 668650496. Throughput: 0: 4893.2. Samples: 668645612. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:51:16,392][25689] Avg episode reward: [(0, '-4.327')] [2022-07-10 08:51:17,071][26022] Updated weights on worker 0-0, policy_version 652983 (0.00095) [2022-07-10 08:51:19,140][26022] Updated weights on worker 0-0, policy_version 652993 (0.00091) [2022-07-10 08:51:20,720][26022] Updated weights on worker 0-0, policy_version 653003 (0.00086) [2022-07-10 08:51:21,470][25689] Fps is (10 sec: 5481.7, 60 sec: 5522.3, 300 sec: 5516.2). Total num frames: 668678144. Throughput: 0: 5722.9. Samples: 668678996. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:51:21,470][25689] Avg episode reward: [(0, '-4.996')] [2022-07-10 08:51:22,757][26022] Updated weights on worker 0-0, policy_version 653013 (0.00088) [2022-07-10 08:51:24,586][26022] Updated weights on worker 0-0, policy_version 653023 (0.00092) [2022-07-10 08:51:26,376][26022] Updated weights on worker 0-0, policy_version 653033 (0.00081) [2022-07-10 08:51:26,531][25689] Fps is (10 sec: 5553.1, 60 sec: 5517.0, 300 sec: 5518.8). Total num frames: 668706816. Throughput: 0: 5803.3. Samples: 668712454. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:51:26,531][25689] Avg episode reward: [(0, '-4.596')] [2022-07-10 08:51:28,125][26022] Updated weights on worker 0-0, policy_version 653043 (0.00098) [2022-07-10 08:51:30,152][26022] Updated weights on worker 0-0, policy_version 653053 (0.00100) [2022-07-10 08:51:31,540][25689] Fps is (10 sec: 5592.7, 60 sec: 5533.6, 300 sec: 5522.2). Total num frames: 668734464. Throughput: 0: 4970.3. Samples: 668729144. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:51:31,541][25689] Avg episode reward: [(0, '-6.130')] [2022-07-10 08:51:31,902][26022] Updated weights on worker 0-0, policy_version 653063 (0.00086) [2022-07-10 08:51:33,908][26022] Updated weights on worker 0-0, policy_version 653073 (0.00099) [2022-07-10 08:51:35,536][26022] Updated weights on worker 0-0, policy_version 653083 (0.00088) [2022-07-10 08:51:36,542][25689] Fps is (10 sec: 5318.9, 60 sec: 5483.5, 300 sec: 5516.1). Total num frames: 668760064. Throughput: 0: 5787.6. Samples: 668762250. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:51:36,543][25689] Avg episode reward: [(0, '-6.924')] [2022-07-10 08:51:37,438][26022] Updated weights on worker 0-0, policy_version 653093 (0.00081) [2022-07-10 08:51:39,539][26022] Updated weights on worker 0-0, policy_version 653103 (0.00086) [2022-07-10 08:51:41,145][26022] Updated weights on worker 0-0, policy_version 653113 (0.00086) [2022-07-10 08:51:41,635][25689] Fps is (10 sec: 5579.3, 60 sec: 5529.5, 300 sec: 5518.1). Total num frames: 668790784. Throughput: 0: 5778.8. Samples: 668795524. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:51:41,635][25689] Avg episode reward: [(0, '-6.862')] [2022-07-10 08:51:43,346][26022] Updated weights on worker 0-0, policy_version 653123 (0.00085) [2022-07-10 08:51:44,824][26022] Updated weights on worker 0-0, policy_version 653133 (0.00102) [2022-07-10 08:51:46,669][25689] Fps is (10 sec: 5662.4, 60 sec: 5527.0, 300 sec: 5517.8). Total num frames: 668817408. Throughput: 0: 4953.7. Samples: 668812214. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:51:46,669][25689] Avg episode reward: [(0, '-5.947')] [2022-07-10 08:51:46,750][26022] Updated weights on worker 0-0, policy_version 653143 (0.00095) [2022-07-10 08:51:48,373][26022] Updated weights on worker 0-0, policy_version 653153 (0.00079) [2022-07-10 08:51:50,243][26022] Updated weights on worker 0-0, policy_version 653163 (0.00086) [2022-07-10 08:51:51,701][25689] Fps is (10 sec: 5492.9, 60 sec: 5525.8, 300 sec: 5517.7). Total num frames: 668846080. Throughput: 0: 5796.1. Samples: 668846000. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:51:51,702][25689] Avg episode reward: [(0, '-5.992')] [2022-07-10 08:51:52,113][26022] Updated weights on worker 0-0, policy_version 653173 (0.00087) [2022-07-10 08:51:54,189][26022] Updated weights on worker 0-0, policy_version 653183 (0.00085) [2022-07-10 08:51:55,762][26022] Updated weights on worker 0-0, policy_version 653193 (0.00097) [2022-07-10 08:51:56,724][25689] Fps is (10 sec: 5703.2, 60 sec: 5541.4, 300 sec: 5521.6). Total num frames: 668874752. Throughput: 0: 5806.1. Samples: 668879428. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:51:56,724][25689] Avg episode reward: [(0, '-6.148')] [2022-07-10 08:51:57,864][26022] Updated weights on worker 0-0, policy_version 653203 (0.00086) [2022-07-10 08:51:59,449][26022] Updated weights on worker 0-0, policy_version 653213 (0.00087) [2022-07-10 08:52:01,848][25689] Fps is (10 sec: 5147.2, 60 sec: 5467.5, 300 sec: 5516.2). Total num frames: 668898304. Throughput: 0: 4969.8. Samples: 668895978. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:52:01,848][25689] Avg episode reward: [(0, '-4.624')] [2022-07-10 08:52:01,897][26022] Updated weights on worker 0-0, policy_version 653223 (0.00088) [2022-07-10 08:52:03,700][26022] Updated weights on worker 0-0, policy_version 653233 (0.00094) [2022-07-10 08:52:05,459][26022] Updated weights on worker 0-0, policy_version 653243 (0.00088) [2022-07-10 08:52:06,861][25689] Fps is (10 sec: 5252.6, 60 sec: 5518.9, 300 sec: 5519.8). Total num frames: 668928000. Throughput: 0: 5693.7. Samples: 668927182. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:52:06,862][25689] Avg episode reward: [(0, '-3.914')] [2022-07-10 08:52:07,356][26022] Updated weights on worker 0-0, policy_version 653253 (0.00092) [2022-07-10 08:52:09,091][26022] Updated weights on worker 0-0, policy_version 653263 (0.00085) [2022-07-10 08:52:11,103][26022] Updated weights on worker 0-0, policy_version 653273 (0.00094) [2022-07-10 08:52:11,906][25689] Fps is (10 sec: 5701.0, 60 sec: 5498.7, 300 sec: 5519.1). Total num frames: 668955648. Throughput: 0: 5660.5. Samples: 668960368. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:52:11,912][25689] Avg episode reward: [(0, '-3.872')] [2022-07-10 08:52:13,054][26022] Updated weights on worker 0-0, policy_version 653283 (0.00096) [2022-07-10 08:52:14,682][26022] Updated weights on worker 0-0, policy_version 653293 (0.00093) [2022-07-10 08:52:16,667][26022] Updated weights on worker 0-0, policy_version 653303 (0.00088) [2022-07-10 08:52:16,937][25689] Fps is (10 sec: 5488.0, 60 sec: 5496.5, 300 sec: 5516.8). Total num frames: 668983296. Throughput: 0: 4837.2. Samples: 668977204. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:52:16,937][25689] Avg episode reward: [(0, '-5.388')] [2022-07-10 08:52:17,891][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:52:17,906][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000653311_668990464.pth [2022-07-10 08:52:17,907][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000651370_667002880.pth [2022-07-10 08:52:18,287][26022] Updated weights on worker 0-0, policy_version 653313 (0.00079) [2022-07-10 08:52:20,258][26022] Updated weights on worker 0-0, policy_version 653323 (0.00086) [2022-07-10 08:52:22,032][25689] Fps is (10 sec: 5562.0, 60 sec: 5512.1, 300 sec: 5522.6). Total num frames: 669011968. Throughput: 0: 5683.2. Samples: 669010690. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:52:22,033][25689] Avg episode reward: [(0, '-5.834')] [2022-07-10 08:52:22,190][26022] Updated weights on worker 0-0, policy_version 653333 (0.00087) [2022-07-10 08:52:23,837][26022] Updated weights on worker 0-0, policy_version 653343 (0.00092) [2022-07-10 08:52:25,776][26022] Updated weights on worker 0-0, policy_version 653353 (0.00091) [2022-07-10 08:52:27,089][25689] Fps is (10 sec: 5648.5, 60 sec: 5512.5, 300 sec: 5518.3). Total num frames: 669040640. Throughput: 0: 5777.9. Samples: 669044056. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:52:27,089][25689] Avg episode reward: [(0, '-6.054')] [2022-07-10 08:52:27,661][26022] Updated weights on worker 0-0, policy_version 653363 (0.00086) [2022-07-10 08:52:29,382][26022] Updated weights on worker 0-0, policy_version 653373 (0.00093) [2022-07-10 08:52:31,532][26022] Updated weights on worker 0-0, policy_version 653383 (0.00088) [2022-07-10 08:52:32,096][25689] Fps is (10 sec: 5596.1, 60 sec: 5512.6, 300 sec: 5522.0). Total num frames: 669068288. Throughput: 0: 5779.8. Samples: 669077062. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:52:32,097][25689] Avg episode reward: [(0, '-6.137')] [2022-07-10 08:52:33,191][26022] Updated weights on worker 0-0, policy_version 653393 (0.00094) [2022-07-10 08:52:35,160][26022] Updated weights on worker 0-0, policy_version 653403 (0.00083) [2022-07-10 08:52:36,864][26022] Updated weights on worker 0-0, policy_version 653413 (0.00090) [2022-07-10 08:52:37,127][25689] Fps is (10 sec: 5508.7, 60 sec: 5543.8, 300 sec: 5516.3). Total num frames: 669095936. Throughput: 0: 5773.8. Samples: 669093776. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:52:37,127][25689] Avg episode reward: [(0, '-6.242')] [2022-07-10 08:52:38,849][26022] Updated weights on worker 0-0, policy_version 653423 (0.00088) [2022-07-10 08:52:40,519][26022] Updated weights on worker 0-0, policy_version 653433 (0.00092) [2022-07-10 08:52:42,197][25689] Fps is (10 sec: 5474.7, 60 sec: 5495.2, 300 sec: 5516.1). Total num frames: 669123584. Throughput: 0: 5780.8. Samples: 669127256. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:52:42,197][25689] Avg episode reward: [(0, '-6.235')] [2022-07-10 08:52:42,567][26022] Updated weights on worker 0-0, policy_version 653443 (0.00080) [2022-07-10 08:52:44,104][26022] Updated weights on worker 0-0, policy_version 653453 (0.00086) [2022-07-10 08:52:46,115][26022] Updated weights on worker 0-0, policy_version 653463 (0.00086) [2022-07-10 08:52:47,201][25689] Fps is (10 sec: 5590.4, 60 sec: 5531.7, 300 sec: 5516.1). Total num frames: 669152256. Throughput: 0: 5809.9. Samples: 669160906. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:52:47,202][25689] Avg episode reward: [(0, '-5.868')] [2022-07-10 08:52:47,797][26022] Updated weights on worker 0-0, policy_version 653473 (0.00087) [2022-07-10 08:52:49,720][26022] Updated weights on worker 0-0, policy_version 653483 (0.00086) [2022-07-10 08:52:51,547][26022] Updated weights on worker 0-0, policy_version 653493 (0.00466) [2022-07-10 08:52:52,214][25689] Fps is (10 sec: 5622.5, 60 sec: 5516.7, 300 sec: 5523.3). Total num frames: 669179904. Throughput: 0: 5014.9. Samples: 669177948. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:52:52,214][25689] Avg episode reward: [(0, '-4.826')] [2022-07-10 08:52:53,283][26022] Updated weights on worker 0-0, policy_version 653503 (0.00093) [2022-07-10 08:52:55,172][26022] Updated weights on worker 0-0, policy_version 653513 (0.00094) [2022-07-10 08:52:57,008][26022] Updated weights on worker 0-0, policy_version 653523 (0.00097) [2022-07-10 08:52:57,227][25689] Fps is (10 sec: 5617.6, 60 sec: 5517.5, 300 sec: 5524.9). Total num frames: 669208576. Throughput: 0: 5872.2. Samples: 669211808. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:52:57,227][25689] Avg episode reward: [(0, '-5.796')] [2022-07-10 08:52:58,715][26022] Updated weights on worker 0-0, policy_version 653533 (0.00094) [2022-07-10 08:53:00,575][26022] Updated weights on worker 0-0, policy_version 653543 (0.00098) [2022-07-10 08:53:02,307][25689] Fps is (10 sec: 5478.5, 60 sec: 5572.3, 300 sec: 5520.9). Total num frames: 669235200. Throughput: 0: 5814.0. Samples: 669244176. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:53:02,307][25689] Avg episode reward: [(0, '-5.282')] [2022-07-10 08:53:03,067][26022] Updated weights on worker 0-0, policy_version 653553 (0.00101) [2022-07-10 08:53:04,748][26022] Updated weights on worker 0-0, policy_version 653563 (0.00088) [2022-07-10 08:53:06,743][26022] Updated weights on worker 0-0, policy_version 653573 (0.00093) [2022-07-10 08:53:07,313][25689] Fps is (10 sec: 5279.1, 60 sec: 5522.1, 300 sec: 5522.0). Total num frames: 669261824. Throughput: 0: 4912.7. Samples: 669259710. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:53:07,314][25689] Avg episode reward: [(0, '-5.816')] [2022-07-10 08:53:08,339][26022] Updated weights on worker 0-0, policy_version 653583 (0.00091) [2022-07-10 08:53:10,315][26022] Updated weights on worker 0-0, policy_version 653593 (0.00081) [2022-07-10 08:53:12,120][26022] Updated weights on worker 0-0, policy_version 653603 (0.00087) [2022-07-10 08:53:12,326][25689] Fps is (10 sec: 5416.7, 60 sec: 5525.1, 300 sec: 5515.7). Total num frames: 669289472. Throughput: 0: 5731.4. Samples: 669293222. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:53:12,327][25689] Avg episode reward: [(0, '-5.786')] [2022-07-10 08:53:13,773][26022] Updated weights on worker 0-0, policy_version 653613 (0.00088) [2022-07-10 08:53:15,940][26022] Updated weights on worker 0-0, policy_version 653623 (0.00086) [2022-07-10 08:53:17,346][25689] Fps is (10 sec: 5715.9, 60 sec: 5560.0, 300 sec: 5523.5). Total num frames: 669319168. Throughput: 0: 5719.1. Samples: 669326870. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:53:17,346][25689] Avg episode reward: [(0, '-6.155')] [2022-07-10 08:53:17,490][26022] Updated weights on worker 0-0, policy_version 653633 (0.00091) [2022-07-10 08:53:19,450][26022] Updated weights on worker 0-0, policy_version 653643 (0.00086) [2022-07-10 08:53:21,411][26022] Updated weights on worker 0-0, policy_version 653653 (0.00090) [2022-07-10 08:53:22,420][25689] Fps is (10 sec: 5579.5, 60 sec: 5528.0, 300 sec: 5515.9). Total num frames: 669345792. Throughput: 0: 4943.0. Samples: 669343596. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 08:53:22,421][25689] Avg episode reward: [(0, '-6.634')] [2022-07-10 08:53:23,123][26022] Updated weights on worker 0-0, policy_version 653663 (0.00091) [2022-07-10 08:53:24,918][26022] Updated weights on worker 0-0, policy_version 653673 (0.00085) [2022-07-10 08:53:26,909][26022] Updated weights on worker 0-0, policy_version 653683 (0.01034) [2022-07-10 08:53:27,460][25689] Fps is (10 sec: 5466.9, 60 sec: 5529.5, 300 sec: 5520.1). Total num frames: 669374464. Throughput: 0: 5807.9. Samples: 669376722. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:53:27,461][25689] Avg episode reward: [(0, '-7.122')] [2022-07-10 08:53:28,803][26022] Updated weights on worker 0-0, policy_version 653693 (0.00088) [2022-07-10 08:53:30,575][26022] Updated weights on worker 0-0, policy_version 653703 (0.00089) [2022-07-10 08:53:32,282][26022] Updated weights on worker 0-0, policy_version 653713 (0.00086) [2022-07-10 08:53:32,500][25689] Fps is (10 sec: 5688.9, 60 sec: 5543.5, 300 sec: 5523.0). Total num frames: 669403136. Throughput: 0: 5793.0. Samples: 669410090. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:53:32,501][25689] Avg episode reward: [(0, '-8.126')] [2022-07-10 08:53:34,323][26022] Updated weights on worker 0-0, policy_version 653723 (0.00084) [2022-07-10 08:53:36,006][26022] Updated weights on worker 0-0, policy_version 653733 (0.00079) [2022-07-10 08:53:37,501][25689] Fps is (10 sec: 5507.1, 60 sec: 5529.2, 300 sec: 5517.4). Total num frames: 669429760. Throughput: 0: 4962.9. Samples: 669426900. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:53:37,502][25689] Avg episode reward: [(0, '-8.797')] [2022-07-10 08:53:37,877][26022] Updated weights on worker 0-0, policy_version 653743 (0.00093) [2022-07-10 08:53:39,549][26022] Updated weights on worker 0-0, policy_version 653753 (0.00085) [2022-07-10 08:53:41,490][26022] Updated weights on worker 0-0, policy_version 653763 (0.00088) [2022-07-10 08:53:42,627][25689] Fps is (10 sec: 5561.6, 60 sec: 5558.0, 300 sec: 5524.0). Total num frames: 669459456. Throughput: 0: 5795.5. Samples: 669460702. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:53:42,627][25689] Avg episode reward: [(0, '-7.137')] [2022-07-10 08:53:43,167][26022] Updated weights on worker 0-0, policy_version 653773 (0.00085) [2022-07-10 08:53:45,190][26022] Updated weights on worker 0-0, policy_version 653783 (0.00095) [2022-07-10 08:53:47,141][26022] Updated weights on worker 0-0, policy_version 653793 (0.00088) [2022-07-10 08:53:47,632][25689] Fps is (10 sec: 5559.5, 60 sec: 5524.1, 300 sec: 5517.4). Total num frames: 669486080. Throughput: 0: 5831.6. Samples: 669494352. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:53:47,632][25689] Avg episode reward: [(0, '-7.900')] [2022-07-10 08:53:48,723][26022] Updated weights on worker 0-0, policy_version 653803 (0.00088) [2022-07-10 08:53:50,799][26022] Updated weights on worker 0-0, policy_version 653813 (0.00092) [2022-07-10 08:53:52,413][26022] Updated weights on worker 0-0, policy_version 653823 (0.00081) [2022-07-10 08:53:52,646][25689] Fps is (10 sec: 5621.3, 60 sec: 5557.8, 300 sec: 5528.1). Total num frames: 669515776. Throughput: 0: 5029.8. Samples: 669511418. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:53:52,646][25689] Avg episode reward: [(0, '-8.791')] [2022-07-10 08:53:54,456][26022] Updated weights on worker 0-0, policy_version 653833 (0.00092) [2022-07-10 08:53:56,151][26022] Updated weights on worker 0-0, policy_version 653843 (0.00098) [2022-07-10 08:53:57,663][25689] Fps is (10 sec: 5716.7, 60 sec: 5540.5, 300 sec: 5526.1). Total num frames: 669543424. Throughput: 0: 5831.3. Samples: 669544468. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:53:57,663][25689] Avg episode reward: [(0, '-7.282')] [2022-07-10 08:53:58,042][26022] Updated weights on worker 0-0, policy_version 653853 (0.00085) [2022-07-10 08:53:59,870][26022] Updated weights on worker 0-0, policy_version 653863 (0.00095) [2022-07-10 08:54:01,728][26022] Updated weights on worker 0-0, policy_version 653873 (0.00088) [2022-07-10 08:54:02,719][25689] Fps is (10 sec: 5082.9, 60 sec: 5491.9, 300 sec: 5518.2). Total num frames: 669566976. Throughput: 0: 5743.2. Samples: 669576094. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:54:02,719][25689] Avg episode reward: [(0, '-7.072')] [2022-07-10 08:54:03,866][26022] Updated weights on worker 0-0, policy_version 653883 (0.00082) [2022-07-10 08:54:05,994][26022] Updated weights on worker 0-0, policy_version 653893 (0.00094) [2022-07-10 08:54:07,587][26022] Updated weights on worker 0-0, policy_version 653903 (0.00080) [2022-07-10 08:54:07,725][25689] Fps is (10 sec: 5393.6, 60 sec: 5559.7, 300 sec: 5525.4). Total num frames: 669597696. Throughput: 0: 4874.5. Samples: 669592296. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:54:07,726][25689] Avg episode reward: [(0, '-6.994')] [2022-07-10 08:54:09,443][26022] Updated weights on worker 0-0, policy_version 653913 (0.00094) [2022-07-10 08:54:11,154][26022] Updated weights on worker 0-0, policy_version 653923 (0.00092) [2022-07-10 08:54:12,777][25689] Fps is (10 sec: 5803.2, 60 sec: 5556.2, 300 sec: 5521.2). Total num frames: 669625344. Throughput: 0: 5687.0. Samples: 669625902. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:54:12,777][25689] Avg episode reward: [(0, '-7.641')] [2022-07-10 08:54:13,133][26022] Updated weights on worker 0-0, policy_version 653933 (0.00090) [2022-07-10 08:54:15,019][26022] Updated weights on worker 0-0, policy_version 653943 (0.00092) [2022-07-10 08:54:16,831][26022] Updated weights on worker 0-0, policy_version 653953 (0.00087) [2022-07-10 08:54:17,790][25689] Fps is (10 sec: 5391.9, 60 sec: 5505.8, 300 sec: 5525.1). Total num frames: 669651968. Throughput: 0: 5733.3. Samples: 669659866. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:54:17,791][25689] Avg episode reward: [(0, '-6.410')] [2022-07-10 08:54:17,914][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:54:17,924][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000653959_669654016.pth [2022-07-10 08:54:17,927][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000652017_667665408.pth [2022-07-10 08:54:18,490][26022] Updated weights on worker 0-0, policy_version 653963 (0.00094) [2022-07-10 08:54:20,663][26022] Updated weights on worker 0-0, policy_version 653973 (0.00096) [2022-07-10 08:54:22,162][26022] Updated weights on worker 0-0, policy_version 653983 (0.00085) [2022-07-10 08:54:22,932][25689] Fps is (10 sec: 5646.6, 60 sec: 5567.4, 300 sec: 5526.6). Total num frames: 669682688. Throughput: 0: 4976.7. Samples: 669676690. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:54:22,932][25689] Avg episode reward: [(0, '-5.399')] [2022-07-10 08:54:24,333][26022] Updated weights on worker 0-0, policy_version 653993 (0.00086) [2022-07-10 08:54:25,886][26022] Updated weights on worker 0-0, policy_version 654003 (0.00092) [2022-07-10 08:54:27,945][25689] Fps is (10 sec: 5546.4, 60 sec: 5519.2, 300 sec: 5523.3). Total num frames: 669708288. Throughput: 0: 5827.8. Samples: 669710134. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:54:27,945][25689] Avg episode reward: [(0, '-5.005')] [2022-07-10 08:54:28,062][26022] Updated weights on worker 0-0, policy_version 654013 (0.00091) [2022-07-10 08:54:29,664][26022] Updated weights on worker 0-0, policy_version 654023 (0.00091) [2022-07-10 08:54:31,542][26022] Updated weights on worker 0-0, policy_version 654033 (0.00091) [2022-07-10 08:54:32,966][25689] Fps is (10 sec: 5510.6, 60 sec: 5537.8, 300 sec: 5523.5). Total num frames: 669737984. Throughput: 0: 5831.2. Samples: 669743634. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:54:32,967][25689] Avg episode reward: [(0, '-6.141')] [2022-07-10 08:54:33,410][26022] Updated weights on worker 0-0, policy_version 654043 (0.00093) [2022-07-10 08:54:35,110][26022] Updated weights on worker 0-0, policy_version 654053 (0.00097) [2022-07-10 08:54:37,089][26022] Updated weights on worker 0-0, policy_version 654063 (0.00450) [2022-07-10 08:54:37,985][25689] Fps is (10 sec: 5711.2, 60 sec: 5553.1, 300 sec: 5529.0). Total num frames: 669765632. Throughput: 0: 4973.2. Samples: 669760300. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:54:37,986][25689] Avg episode reward: [(0, '-5.168')] [2022-07-10 08:54:38,812][26022] Updated weights on worker 0-0, policy_version 654073 (0.00094) [2022-07-10 08:54:40,787][26022] Updated weights on worker 0-0, policy_version 654083 (0.00088) [2022-07-10 08:54:42,445][26022] Updated weights on worker 0-0, policy_version 654093 (0.00085) [2022-07-10 08:54:43,043][25689] Fps is (10 sec: 5487.7, 60 sec: 5525.4, 300 sec: 5528.7). Total num frames: 669793280. Throughput: 0: 5824.9. Samples: 669793834. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:54:43,043][25689] Avg episode reward: [(0, '-5.117')] [2022-07-10 08:54:44,340][26022] Updated weights on worker 0-0, policy_version 654103 (0.00095) [2022-07-10 08:54:46,387][26022] Updated weights on worker 0-0, policy_version 654113 (0.00087) [2022-07-10 08:54:47,956][26022] Updated weights on worker 0-0, policy_version 654123 (0.00093) [2022-07-10 08:54:48,075][25689] Fps is (10 sec: 5683.8, 60 sec: 5573.8, 300 sec: 5525.3). Total num frames: 669822976. Throughput: 0: 5815.6. Samples: 669827200. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:54:48,075][25689] Avg episode reward: [(0, '-6.682')] [2022-07-10 08:54:50,093][26022] Updated weights on worker 0-0, policy_version 654133 (0.00064) [2022-07-10 08:54:51,650][26022] Updated weights on worker 0-0, policy_version 654143 (0.00079) [2022-07-10 08:54:53,082][25689] Fps is (10 sec: 5609.8, 60 sec: 5523.5, 300 sec: 5528.8). Total num frames: 669849600. Throughput: 0: 4989.1. Samples: 669843994. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:54:53,083][25689] Avg episode reward: [(0, '-6.499')] [2022-07-10 08:54:53,690][26022] Updated weights on worker 0-0, policy_version 654153 (0.00091) [2022-07-10 08:54:55,178][26022] Updated weights on worker 0-0, policy_version 654163 (0.00085) [2022-07-10 08:54:57,312][26022] Updated weights on worker 0-0, policy_version 654173 (0.00083) [2022-07-10 08:54:58,089][25689] Fps is (10 sec: 5521.7, 60 sec: 5541.4, 300 sec: 5533.5). Total num frames: 669878272. Throughput: 0: 5834.2. Samples: 669877586. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:54:58,089][25689] Avg episode reward: [(0, '-5.118')] [2022-07-10 08:54:58,866][26022] Updated weights on worker 0-0, policy_version 654183 (0.00082) [2022-07-10 08:55:00,962][26022] Updated weights on worker 0-0, policy_version 654193 (0.00089) [2022-07-10 08:55:03,091][26022] Updated weights on worker 0-0, policy_version 654203 (0.00080) [2022-07-10 08:55:03,221][25689] Fps is (10 sec: 5353.1, 60 sec: 5568.3, 300 sec: 5524.9). Total num frames: 669903872. Throughput: 0: 5703.2. Samples: 669908912. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:55:03,221][25689] Avg episode reward: [(0, '-4.594')] [2022-07-10 08:55:05,139][26022] Updated weights on worker 0-0, policy_version 654213 (0.00087) [2022-07-10 08:55:06,835][26022] Updated weights on worker 0-0, policy_version 654223 (0.00090) [2022-07-10 08:55:08,249][25689] Fps is (10 sec: 5241.1, 60 sec: 5515.6, 300 sec: 5524.8). Total num frames: 669931520. Throughput: 0: 4860.9. Samples: 669925264. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:55:08,250][25689] Avg episode reward: [(0, '-5.444')] [2022-07-10 08:55:08,684][26022] Updated weights on worker 0-0, policy_version 654233 (0.00084) [2022-07-10 08:55:10,484][26022] Updated weights on worker 0-0, policy_version 654243 (0.00093) [2022-07-10 08:55:12,451][26022] Updated weights on worker 0-0, policy_version 654253 (0.00087) [2022-07-10 08:55:13,275][25689] Fps is (10 sec: 5499.9, 60 sec: 5517.9, 300 sec: 5528.0). Total num frames: 669959168. Throughput: 0: 5671.7. Samples: 669958518. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:55:13,277][25689] Avg episode reward: [(0, '-6.851')] [2022-07-10 08:55:14,205][26022] Updated weights on worker 0-0, policy_version 654263 (0.00089) [2022-07-10 08:55:16,043][26022] Updated weights on worker 0-0, policy_version 654273 (0.00083) [2022-07-10 08:55:18,004][26022] Updated weights on worker 0-0, policy_version 654283 (0.00091) [2022-07-10 08:55:18,284][25689] Fps is (10 sec: 5612.2, 60 sec: 5552.2, 300 sec: 5528.8). Total num frames: 669987840. Throughput: 0: 5673.0. Samples: 669992152. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:55:18,284][25689] Avg episode reward: [(0, '-5.736')] [2022-07-10 08:55:19,849][26022] Updated weights on worker 0-0, policy_version 654293 (0.00090) [2022-07-10 08:55:21,619][26022] Updated weights on worker 0-0, policy_version 654303 (0.00094) [2022-07-10 08:55:23,324][25689] Fps is (10 sec: 5604.2, 60 sec: 5510.6, 300 sec: 5524.7). Total num frames: 670015488. Throughput: 0: 4969.1. Samples: 670008810. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:55:23,325][25689] Avg episode reward: [(0, '-4.604')] [2022-07-10 08:55:23,381][26022] Updated weights on worker 0-0, policy_version 654313 (0.00081) [2022-07-10 08:55:25,236][26022] Updated weights on worker 0-0, policy_version 654323 (0.00089) [2022-07-10 08:55:27,151][26022] Updated weights on worker 0-0, policy_version 654333 (0.00086) [2022-07-10 08:55:28,331][25689] Fps is (10 sec: 5401.9, 60 sec: 5528.2, 300 sec: 5524.7). Total num frames: 670042112. Throughput: 0: 5808.0. Samples: 670041902. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:55:28,331][25689] Avg episode reward: [(0, '-4.513')] [2022-07-10 08:55:29,066][26022] Updated weights on worker 0-0, policy_version 654343 (0.00092) [2022-07-10 08:55:30,780][26022] Updated weights on worker 0-0, policy_version 654353 (0.00098) [2022-07-10 08:55:32,696][26022] Updated weights on worker 0-0, policy_version 654363 (0.00084) [2022-07-10 08:55:33,351][25689] Fps is (10 sec: 5617.2, 60 sec: 5528.3, 300 sec: 5527.9). Total num frames: 670071808. Throughput: 0: 5816.8. Samples: 670075296. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:55:33,351][25689] Avg episode reward: [(0, '-4.424')] [2022-07-10 08:55:34,518][26022] Updated weights on worker 0-0, policy_version 654373 (0.00098) [2022-07-10 08:55:36,402][26022] Updated weights on worker 0-0, policy_version 654383 (0.00091) [2022-07-10 08:55:38,289][26022] Updated weights on worker 0-0, policy_version 654393 (0.00084) [2022-07-10 08:55:38,361][25689] Fps is (10 sec: 5615.2, 60 sec: 5512.2, 300 sec: 5525.0). Total num frames: 670098432. Throughput: 0: 4966.7. Samples: 670091868. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:55:38,361][25689] Avg episode reward: [(0, '-2.099')] [2022-07-10 08:55:39,975][26022] Updated weights on worker 0-0, policy_version 654403 (0.00088) [2022-07-10 08:55:42,022][26022] Updated weights on worker 0-0, policy_version 654413 (0.00081) [2022-07-10 08:55:43,474][25689] Fps is (10 sec: 5563.3, 60 sec: 5541.0, 300 sec: 5533.4). Total num frames: 670128128. Throughput: 0: 5778.8. Samples: 670125252. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:55:43,475][25689] Avg episode reward: [(0, '-1.838')] [2022-07-10 08:55:43,581][26022] Updated weights on worker 0-0, policy_version 654423 (0.00093) [2022-07-10 08:55:45,643][26022] Updated weights on worker 0-0, policy_version 654433 (0.00091) [2022-07-10 08:55:47,620][26022] Updated weights on worker 0-0, policy_version 654443 (0.00085) [2022-07-10 08:55:48,555][25689] Fps is (10 sec: 5524.5, 60 sec: 5485.7, 300 sec: 5525.4). Total num frames: 670154752. Throughput: 0: 5776.4. Samples: 670158726. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:55:48,556][25689] Avg episode reward: [(0, '-1.797')] [2022-07-10 08:55:49,452][26022] Updated weights on worker 0-0, policy_version 654453 (0.00091) [2022-07-10 08:55:51,042][26022] Updated weights on worker 0-0, policy_version 654463 (0.00087) [2022-07-10 08:55:53,076][26022] Updated weights on worker 0-0, policy_version 654473 (0.00084) [2022-07-10 08:55:53,604][25689] Fps is (10 sec: 5458.9, 60 sec: 5515.8, 300 sec: 5528.0). Total num frames: 670183424. Throughput: 0: 5762.6. Samples: 670192006. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:55:53,604][25689] Avg episode reward: [(0, '-3.122')] [2022-07-10 08:55:55,031][26022] Updated weights on worker 0-0, policy_version 654483 (0.00085) [2022-07-10 08:55:56,739][26022] Updated weights on worker 0-0, policy_version 654493 (0.00091) [2022-07-10 08:55:58,568][26022] Updated weights on worker 0-0, policy_version 654503 (0.00094) [2022-07-10 08:55:58,662][25689] Fps is (10 sec: 5572.4, 60 sec: 5494.2, 300 sec: 5528.0). Total num frames: 670211072. Throughput: 0: 5756.1. Samples: 670208726. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:55:58,663][25689] Avg episode reward: [(0, '-3.157')] [2022-07-10 08:56:00,258][26022] Updated weights on worker 0-0, policy_version 654513 (0.00088) [2022-07-10 08:56:02,767][26022] Updated weights on worker 0-0, policy_version 654523 (0.00087) [2022-07-10 08:56:03,706][25689] Fps is (10 sec: 5372.4, 60 sec: 5519.1, 300 sec: 5527.6). Total num frames: 670237696. Throughput: 0: 5664.4. Samples: 670239852. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:56:03,706][25689] Avg episode reward: [(0, '-4.839')] [2022-07-10 08:56:04,569][26022] Updated weights on worker 0-0, policy_version 654533 (0.00087) [2022-07-10 08:56:06,263][26022] Updated weights on worker 0-0, policy_version 654543 (0.00094) [2022-07-10 08:56:08,471][26022] Updated weights on worker 0-0, policy_version 654553 (0.00089) [2022-07-10 08:56:08,711][25689] Fps is (10 sec: 5298.7, 60 sec: 5504.2, 300 sec: 5520.7). Total num frames: 670264320. Throughput: 0: 5677.0. Samples: 670273154. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:56:08,712][25689] Avg episode reward: [(0, '-5.973')] [2022-07-10 08:56:09,959][26022] Updated weights on worker 0-0, policy_version 654563 (0.00086) [2022-07-10 08:56:12,003][26022] Updated weights on worker 0-0, policy_version 654573 (0.00088) [2022-07-10 08:56:13,599][26022] Updated weights on worker 0-0, policy_version 654583 (0.00091) [2022-07-10 08:56:13,724][25689] Fps is (10 sec: 5621.6, 60 sec: 5539.3, 300 sec: 5527.5). Total num frames: 670294016. Throughput: 0: 4854.2. Samples: 670289676. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:56:13,725][25689] Avg episode reward: [(0, '-7.147')] [2022-07-10 08:56:15,723][26022] Updated weights on worker 0-0, policy_version 654593 (0.00107) [2022-07-10 08:56:17,456][26022] Updated weights on worker 0-0, policy_version 654603 (0.00093) [2022-07-10 08:56:17,966][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:56:17,994][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000654605_670315520.pth [2022-07-10 08:56:17,995][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000652664_668327936.pth [2022-07-10 08:56:18,751][25689] Fps is (10 sec: 5610.0, 60 sec: 5503.8, 300 sec: 5525.1). Total num frames: 670320640. Throughput: 0: 5675.4. Samples: 670322736. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:56:18,751][25689] Avg episode reward: [(0, '-8.397')] [2022-07-10 08:56:19,583][26022] Updated weights on worker 0-0, policy_version 654613 (0.00096) [2022-07-10 08:56:21,248][26022] Updated weights on worker 0-0, policy_version 654623 (0.00090) [2022-07-10 08:56:23,236][26022] Updated weights on worker 0-0, policy_version 654633 (0.00096) [2022-07-10 08:56:23,860][25689] Fps is (10 sec: 5253.5, 60 sec: 5480.7, 300 sec: 5517.3). Total num frames: 670347264. Throughput: 0: 5758.4. Samples: 670355908. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:56:23,861][25689] Avg episode reward: [(0, '-7.295')] [2022-07-10 08:56:24,930][26022] Updated weights on worker 0-0, policy_version 654643 (0.00089) [2022-07-10 08:56:26,741][26022] Updated weights on worker 0-0, policy_version 654653 (0.00089) [2022-07-10 08:56:28,595][26022] Updated weights on worker 0-0, policy_version 654663 (0.00097) [2022-07-10 08:56:28,945][25689] Fps is (10 sec: 5424.1, 60 sec: 5507.3, 300 sec: 5519.3). Total num frames: 670375936. Throughput: 0: 4909.7. Samples: 670372496. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:56:28,946][25689] Avg episode reward: [(0, '-6.693')] [2022-07-10 08:56:30,398][26022] Updated weights on worker 0-0, policy_version 654673 (0.00080) [2022-07-10 08:56:32,329][26022] Updated weights on worker 0-0, policy_version 654683 (0.00084) [2022-07-10 08:56:33,998][25689] Fps is (10 sec: 5656.2, 60 sec: 5487.4, 300 sec: 5528.7). Total num frames: 670404608. Throughput: 0: 5725.2. Samples: 670405748. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:56:33,999][25689] Avg episode reward: [(0, '-4.033')] [2022-07-10 08:56:34,055][26022] Updated weights on worker 0-0, policy_version 654693 (0.00613) [2022-07-10 08:56:36,113][26022] Updated weights on worker 0-0, policy_version 654703 (0.00097) [2022-07-10 08:56:37,973][26022] Updated weights on worker 0-0, policy_version 654713 (0.00082) [2022-07-10 08:56:39,025][25689] Fps is (10 sec: 5587.2, 60 sec: 5502.8, 300 sec: 5519.6). Total num frames: 670432256. Throughput: 0: 5733.4. Samples: 670438980. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-10 08:56:39,026][25689] Avg episode reward: [(0, '-7.380')] [2022-07-10 08:56:39,731][26022] Updated weights on worker 0-0, policy_version 654723 (0.00110) [2022-07-10 08:56:41,758][26022] Updated weights on worker 0-0, policy_version 654733 (0.00087) [2022-07-10 08:56:43,312][26022] Updated weights on worker 0-0, policy_version 654743 (0.00085) [2022-07-10 08:56:44,086][25689] Fps is (10 sec: 5481.7, 60 sec: 5473.8, 300 sec: 5522.5). Total num frames: 670459904. Throughput: 0: 4919.9. Samples: 670455416. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:56:44,089][25689] Avg episode reward: [(0, '-7.158')] [2022-07-10 08:56:45,339][26022] Updated weights on worker 0-0, policy_version 654753 (0.00087) [2022-07-10 08:56:47,278][26022] Updated weights on worker 0-0, policy_version 654763 (0.00086) [2022-07-10 08:56:48,980][26022] Updated weights on worker 0-0, policy_version 654773 (0.00084) [2022-07-10 08:56:49,147][25689] Fps is (10 sec: 5463.2, 60 sec: 5492.5, 300 sec: 5518.6). Total num frames: 670487552. Throughput: 0: 5748.6. Samples: 670488628. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:56:49,147][25689] Avg episode reward: [(0, '-6.733')] [2022-07-10 08:56:50,935][26022] Updated weights on worker 0-0, policy_version 654783 (0.00089) [2022-07-10 08:56:52,888][26022] Updated weights on worker 0-0, policy_version 654793 (0.00087) [2022-07-10 08:56:54,197][25689] Fps is (10 sec: 5569.8, 60 sec: 5492.3, 300 sec: 5518.0). Total num frames: 670516224. Throughput: 0: 5750.1. Samples: 670521894. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:56:54,198][25689] Avg episode reward: [(0, '-7.553')] [2022-07-10 08:56:54,756][26022] Updated weights on worker 0-0, policy_version 654803 (0.00086) [2022-07-10 08:56:56,653][26022] Updated weights on worker 0-0, policy_version 654813 (0.01039) [2022-07-10 08:56:58,287][26022] Updated weights on worker 0-0, policy_version 654823 (0.00085) [2022-07-10 08:56:59,257][25689] Fps is (10 sec: 5570.5, 60 sec: 5492.2, 300 sec: 5533.0). Total num frames: 670543872. Throughput: 0: 4921.1. Samples: 670538544. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:56:59,258][25689] Avg episode reward: [(0, '-7.071')] [2022-07-10 08:57:00,320][26022] Updated weights on worker 0-0, policy_version 654833 (0.00089) [2022-07-10 08:57:01,849][26022] Updated weights on worker 0-0, policy_version 654843 (0.00087) [2022-07-10 08:57:04,260][26022] Updated weights on worker 0-0, policy_version 654853 (0.00081) [2022-07-10 08:57:04,359][25689] Fps is (10 sec: 5240.0, 60 sec: 5470.0, 300 sec: 5517.6). Total num frames: 670569472. Throughput: 0: 5660.2. Samples: 670570168. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:57:04,360][25689] Avg episode reward: [(0, '-8.778')] [2022-07-10 08:57:05,971][26022] Updated weights on worker 0-0, policy_version 654863 (0.00090) [2022-07-10 08:57:08,021][26022] Updated weights on worker 0-0, policy_version 654873 (0.00087) [2022-07-10 08:57:09,397][25689] Fps is (10 sec: 5352.3, 60 sec: 5500.8, 300 sec: 5521.2). Total num frames: 670598144. Throughput: 0: 5647.7. Samples: 670602996. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:57:09,398][25689] Avg episode reward: [(0, '-4.685')] [2022-07-10 08:57:09,771][26022] Updated weights on worker 0-0, policy_version 654883 (0.00087) [2022-07-10 08:57:11,701][26022] Updated weights on worker 0-0, policy_version 654893 (0.00084) [2022-07-10 08:57:13,414][26022] Updated weights on worker 0-0, policy_version 654903 (0.00091) [2022-07-10 08:57:14,444][25689] Fps is (10 sec: 5584.7, 60 sec: 5464.1, 300 sec: 5520.9). Total num frames: 670625792. Throughput: 0: 4832.4. Samples: 670619734. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:57:14,445][25689] Avg episode reward: [(0, '-5.316')] [2022-07-10 08:57:15,426][26022] Updated weights on worker 0-0, policy_version 654913 (0.00086) [2022-07-10 08:57:17,050][26022] Updated weights on worker 0-0, policy_version 654923 (0.00091) [2022-07-10 08:57:19,011][26022] Updated weights on worker 0-0, policy_version 654933 (0.00095) [2022-07-10 08:57:19,454][25689] Fps is (10 sec: 5600.6, 60 sec: 5499.3, 300 sec: 5522.5). Total num frames: 670654464. Throughput: 0: 5679.7. Samples: 670653252. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:57:19,454][25689] Avg episode reward: [(0, '-5.366')] [2022-07-10 08:57:20,767][26022] Updated weights on worker 0-0, policy_version 654943 (0.00113) [2022-07-10 08:57:22,701][26022] Updated weights on worker 0-0, policy_version 654953 (0.00084) [2022-07-10 08:57:24,503][26022] Updated weights on worker 0-0, policy_version 654963 (0.00087) [2022-07-10 08:57:24,597][25689] Fps is (10 sec: 5547.2, 60 sec: 5513.1, 300 sec: 5517.4). Total num frames: 670682112. Throughput: 0: 5763.6. Samples: 670686810. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:57:24,597][25689] Avg episode reward: [(0, '-4.714')] [2022-07-10 08:57:26,586][26022] Updated weights on worker 0-0, policy_version 654973 (0.00086) [2022-07-10 08:57:27,955][26022] Updated weights on worker 0-0, policy_version 654983 (0.00082) [2022-07-10 08:57:29,609][25689] Fps is (10 sec: 5344.2, 60 sec: 5486.0, 300 sec: 5513.9). Total num frames: 670708736. Throughput: 0: 4970.2. Samples: 670703452. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:57:29,609][25689] Avg episode reward: [(0, '-6.257')] [2022-07-10 08:57:30,269][26022] Updated weights on worker 0-0, policy_version 654993 (0.00087) [2022-07-10 08:57:31,547][26022] Updated weights on worker 0-0, policy_version 655003 (0.00083) [2022-07-10 08:57:33,811][26022] Updated weights on worker 0-0, policy_version 655013 (0.00097) [2022-07-10 08:57:34,617][25689] Fps is (10 sec: 5722.6, 60 sec: 5523.8, 300 sec: 5524.6). Total num frames: 670739456. Throughput: 0: 5796.9. Samples: 670736678. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:57:34,618][25689] Avg episode reward: [(0, '-6.689')] [2022-07-10 08:57:35,651][26022] Updated weights on worker 0-0, policy_version 655023 (0.00087) [2022-07-10 08:57:37,371][26022] Updated weights on worker 0-0, policy_version 655033 (0.00492) [2022-07-10 08:57:39,341][26022] Updated weights on worker 0-0, policy_version 655043 (0.00112) [2022-07-10 08:57:39,626][25689] Fps is (10 sec: 5724.7, 60 sec: 5508.6, 300 sec: 5522.3). Total num frames: 670766080. Throughput: 0: 5770.6. Samples: 670769658. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:57:39,627][25689] Avg episode reward: [(0, '-7.197')] [2022-07-10 08:57:41,235][26022] Updated weights on worker 0-0, policy_version 655053 (0.00090) [2022-07-10 08:57:43,030][26022] Updated weights on worker 0-0, policy_version 655063 (0.00092) [2022-07-10 08:57:44,721][25689] Fps is (10 sec: 5371.7, 60 sec: 5505.5, 300 sec: 5517.2). Total num frames: 670793728. Throughput: 0: 5767.1. Samples: 670802866. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:57:44,721][25689] Avg episode reward: [(0, '-7.693')] [2022-07-10 08:57:44,971][26022] Updated weights on worker 0-0, policy_version 655073 (0.00109) [2022-07-10 08:57:46,733][26022] Updated weights on worker 0-0, policy_version 655083 (0.00093) [2022-07-10 08:57:48,619][26022] Updated weights on worker 0-0, policy_version 655093 (0.00085) [2022-07-10 08:57:49,735][25689] Fps is (10 sec: 5470.0, 60 sec: 5509.7, 300 sec: 5517.2). Total num frames: 670821376. Throughput: 0: 5772.9. Samples: 670819638. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:57:49,735][25689] Avg episode reward: [(0, '-6.022')] [2022-07-10 08:57:50,347][26022] Updated weights on worker 0-0, policy_version 655103 (0.00086) [2022-07-10 08:57:52,138][26022] Updated weights on worker 0-0, policy_version 655113 (0.00089) [2022-07-10 08:57:54,005][26022] Updated weights on worker 0-0, policy_version 655123 (0.00091) [2022-07-10 08:57:54,750][25689] Fps is (10 sec: 5513.7, 60 sec: 5496.1, 300 sec: 5513.7). Total num frames: 670849024. Throughput: 0: 5791.1. Samples: 670853266. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:57:54,750][25689] Avg episode reward: [(0, '-4.531')] [2022-07-10 08:57:55,827][26022] Updated weights on worker 0-0, policy_version 655133 (0.00089) [2022-07-10 08:57:57,907][26022] Updated weights on worker 0-0, policy_version 655143 (0.00090) [2022-07-10 08:57:59,437][26022] Updated weights on worker 0-0, policy_version 655153 (0.00082) [2022-07-10 08:57:59,785][25689] Fps is (10 sec: 5706.0, 60 sec: 5532.2, 300 sec: 5524.8). Total num frames: 670878720. Throughput: 0: 5808.7. Samples: 670886754. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:57:59,785][25689] Avg episode reward: [(0, '-3.921')] [2022-07-10 08:58:01,421][26022] Updated weights on worker 0-0, policy_version 655163 (0.00089) [2022-07-10 08:58:03,651][26022] Updated weights on worker 0-0, policy_version 655173 (0.00096) [2022-07-10 08:58:04,909][25689] Fps is (10 sec: 5342.3, 60 sec: 5513.3, 300 sec: 5515.8). Total num frames: 670903296. Throughput: 0: 4884.8. Samples: 670901480. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:58:04,911][25689] Avg episode reward: [(0, '-3.748')] [2022-07-10 08:58:05,487][26022] Updated weights on worker 0-0, policy_version 655183 (0.00095) [2022-07-10 08:58:07,370][26022] Updated weights on worker 0-0, policy_version 655193 (0.00086) [2022-07-10 08:58:09,059][26022] Updated weights on worker 0-0, policy_version 655203 (0.00095) [2022-07-10 08:58:09,911][25689] Fps is (10 sec: 5258.3, 60 sec: 5516.5, 300 sec: 5519.4). Total num frames: 670931968. Throughput: 0: 5709.1. Samples: 670934826. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:58:09,913][25689] Avg episode reward: [(0, '-4.397')] [2022-07-10 08:58:10,987][26022] Updated weights on worker 0-0, policy_version 655213 (0.00088) [2022-07-10 08:58:12,872][26022] Updated weights on worker 0-0, policy_version 655223 (0.00091) [2022-07-10 08:58:14,561][26022] Updated weights on worker 0-0, policy_version 655233 (0.00091) [2022-07-10 08:58:14,993][25689] Fps is (10 sec: 5686.3, 60 sec: 5530.2, 300 sec: 5514.8). Total num frames: 670960640. Throughput: 0: 5675.2. Samples: 670968152. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:58:14,995][25689] Avg episode reward: [(0, '-3.610')] [2022-07-10 08:58:16,687][26022] Updated weights on worker 0-0, policy_version 655243 (0.00088) [2022-07-10 08:58:18,253][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 08:58:18,263][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000655253_670979072.pth [2022-07-10 08:58:18,264][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000653311_668990464.pth [2022-07-10 08:58:18,274][26022] Updated weights on worker 0-0, policy_version 655253 (0.00093) [2022-07-10 08:58:20,071][25689] Fps is (10 sec: 5543.4, 60 sec: 5507.1, 300 sec: 5518.2). Total num frames: 670988288. Throughput: 0: 4836.1. Samples: 670984860. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:58:20,071][25689] Avg episode reward: [(0, '-5.437')] [2022-07-10 08:58:20,229][26022] Updated weights on worker 0-0, policy_version 655263 (0.00083) [2022-07-10 08:58:22,135][26022] Updated weights on worker 0-0, policy_version 655273 (0.00086) [2022-07-10 08:58:23,903][26022] Updated weights on worker 0-0, policy_version 655283 (0.00091) [2022-07-10 08:58:25,150][25689] Fps is (10 sec: 5443.9, 60 sec: 5512.9, 300 sec: 5514.0). Total num frames: 671015936. Throughput: 0: 5766.6. Samples: 671018208. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:58:25,151][25689] Avg episode reward: [(0, '-4.194')] [2022-07-10 08:58:25,790][26022] Updated weights on worker 0-0, policy_version 655293 (0.00091) [2022-07-10 08:58:27,734][26022] Updated weights on worker 0-0, policy_version 655303 (0.00094) [2022-07-10 08:58:29,519][26022] Updated weights on worker 0-0, policy_version 655313 (0.00089) [2022-07-10 08:58:30,187][25689] Fps is (10 sec: 5567.3, 60 sec: 5544.4, 300 sec: 5514.1). Total num frames: 671044608. Throughput: 0: 5747.4. Samples: 671051360. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:58:30,193][25689] Avg episode reward: [(0, '-3.926')] [2022-07-10 08:58:31,436][26022] Updated weights on worker 0-0, policy_version 655323 (0.00098) [2022-07-10 08:58:32,888][26022] Updated weights on worker 0-0, policy_version 655333 (0.00089) [2022-07-10 08:58:35,218][25689] Fps is (10 sec: 5391.1, 60 sec: 5457.9, 300 sec: 5510.1). Total num frames: 671070208. Throughput: 0: 4940.6. Samples: 671068072. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:58:35,218][25689] Avg episode reward: [(0, '-3.600')] [2022-07-10 08:58:35,251][26022] Updated weights on worker 0-0, policy_version 655343 (0.00088) [2022-07-10 08:58:36,718][26022] Updated weights on worker 0-0, policy_version 655353 (0.00086) [2022-07-10 08:58:38,887][26022] Updated weights on worker 0-0, policy_version 655363 (0.00082) [2022-07-10 08:58:40,227][25689] Fps is (10 sec: 5609.7, 60 sec: 5525.4, 300 sec: 5515.7). Total num frames: 671100928. Throughput: 0: 5770.7. Samples: 671101174. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:58:40,228][25689] Avg episode reward: [(0, '-3.753')] [2022-07-10 08:58:40,374][26022] Updated weights on worker 0-0, policy_version 655373 (0.00086) [2022-07-10 08:58:42,357][26022] Updated weights on worker 0-0, policy_version 655383 (0.00093) [2022-07-10 08:58:44,008][26022] Updated weights on worker 0-0, policy_version 655393 (0.00089) [2022-07-10 08:58:45,335][25689] Fps is (10 sec: 5769.0, 60 sec: 5524.2, 300 sec: 5517.2). Total num frames: 671128576. Throughput: 0: 5778.5. Samples: 671134844. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:58:45,336][25689] Avg episode reward: [(0, '-3.777')] [2022-07-10 08:58:45,919][26022] Updated weights on worker 0-0, policy_version 655403 (0.00084) [2022-07-10 08:58:47,781][26022] Updated weights on worker 0-0, policy_version 655413 (0.00105) [2022-07-10 08:58:49,565][26022] Updated weights on worker 0-0, policy_version 655423 (0.00096) [2022-07-10 08:58:50,391][25689] Fps is (10 sec: 5541.2, 60 sec: 5537.3, 300 sec: 5513.0). Total num frames: 671157248. Throughput: 0: 4965.6. Samples: 671151684. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:58:50,392][25689] Avg episode reward: [(0, '-3.716')] [2022-07-10 08:58:51,596][26022] Updated weights on worker 0-0, policy_version 655433 (0.00090) [2022-07-10 08:58:53,452][26022] Updated weights on worker 0-0, policy_version 655443 (0.00086) [2022-07-10 08:58:55,127][26022] Updated weights on worker 0-0, policy_version 655453 (0.00088) [2022-07-10 08:58:55,439][25689] Fps is (10 sec: 5675.6, 60 sec: 5551.2, 300 sec: 5515.9). Total num frames: 671185920. Throughput: 0: 5773.8. Samples: 671184826. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:58:55,439][25689] Avg episode reward: [(0, '-4.816')] [2022-07-10 08:58:57,262][26022] Updated weights on worker 0-0, policy_version 655463 (0.00087) [2022-07-10 08:58:58,658][26022] Updated weights on worker 0-0, policy_version 655473 (0.00086) [2022-07-10 08:59:00,456][25689] Fps is (10 sec: 5391.9, 60 sec: 5485.3, 300 sec: 5523.5). Total num frames: 671211520. Throughput: 0: 5790.0. Samples: 671218302. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:59:00,457][25689] Avg episode reward: [(0, '-5.626')] [2022-07-10 08:59:00,978][26022] Updated weights on worker 0-0, policy_version 655483 (0.00085) [2022-07-10 08:59:03,057][26022] Updated weights on worker 0-0, policy_version 655493 (0.00082) [2022-07-10 08:59:04,919][26022] Updated weights on worker 0-0, policy_version 655503 (0.00088) [2022-07-10 08:59:05,542][25689] Fps is (10 sec: 5270.4, 60 sec: 5539.4, 300 sec: 5511.7). Total num frames: 671239168. Throughput: 0: 4848.6. Samples: 671232818. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:59:05,543][25689] Avg episode reward: [(0, '-5.564')] [2022-07-10 08:59:06,675][26022] Updated weights on worker 0-0, policy_version 655513 (0.00090) [2022-07-10 08:59:08,538][26022] Updated weights on worker 0-0, policy_version 655523 (0.00370) [2022-07-10 08:59:10,468][26022] Updated weights on worker 0-0, policy_version 655533 (0.00084) [2022-07-10 08:59:10,611][25689] Fps is (10 sec: 5445.6, 60 sec: 5516.5, 300 sec: 5511.4). Total num frames: 671266816. Throughput: 0: 5660.2. Samples: 671266132. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:59:10,611][25689] Avg episode reward: [(0, '-6.445')] [2022-07-10 08:59:12,339][26022] Updated weights on worker 0-0, policy_version 655543 (0.00091) [2022-07-10 08:59:14,087][26022] Updated weights on worker 0-0, policy_version 655553 (0.00086) [2022-07-10 08:59:15,626][25689] Fps is (10 sec: 5483.7, 60 sec: 5505.7, 300 sec: 5514.8). Total num frames: 671294464. Throughput: 0: 5689.0. Samples: 671299670. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:59:15,626][25689] Avg episode reward: [(0, '-6.475')] [2022-07-10 08:59:15,999][26022] Updated weights on worker 0-0, policy_version 655563 (0.00087) [2022-07-10 08:59:17,557][26022] Updated weights on worker 0-0, policy_version 655573 (0.00084) [2022-07-10 08:59:19,570][26022] Updated weights on worker 0-0, policy_version 655583 (0.00096) [2022-07-10 08:59:20,641][25689] Fps is (10 sec: 5615.2, 60 sec: 5528.3, 300 sec: 5510.2). Total num frames: 671323136. Throughput: 0: 4865.4. Samples: 671316506. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:59:20,641][25689] Avg episode reward: [(0, '-6.165')] [2022-07-10 08:59:21,392][26022] Updated weights on worker 0-0, policy_version 655593 (0.00084) [2022-07-10 08:59:23,214][26022] Updated weights on worker 0-0, policy_version 655603 (0.00089) [2022-07-10 08:59:25,144][26022] Updated weights on worker 0-0, policy_version 655613 (0.00089) [2022-07-10 08:59:25,703][25689] Fps is (10 sec: 5690.6, 60 sec: 5546.8, 300 sec: 5519.7). Total num frames: 671351808. Throughput: 0: 5821.0. Samples: 671350174. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:59:25,703][25689] Avg episode reward: [(0, '-5.605')] [2022-07-10 08:59:26,924][26022] Updated weights on worker 0-0, policy_version 655623 (0.01402) [2022-07-10 08:59:28,780][26022] Updated weights on worker 0-0, policy_version 655633 (0.00086) [2022-07-10 08:59:30,603][26022] Updated weights on worker 0-0, policy_version 655643 (0.00224) [2022-07-10 08:59:30,737][25689] Fps is (10 sec: 5476.9, 60 sec: 5513.2, 300 sec: 5509.1). Total num frames: 671378432. Throughput: 0: 5832.3. Samples: 671383514. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:59:30,737][25689] Avg episode reward: [(0, '-5.548')] [2022-07-10 08:59:32,536][26022] Updated weights on worker 0-0, policy_version 655653 (0.00091) [2022-07-10 08:59:34,458][26022] Updated weights on worker 0-0, policy_version 655663 (0.00084) [2022-07-10 08:59:35,762][25689] Fps is (10 sec: 5395.3, 60 sec: 5547.6, 300 sec: 5509.0). Total num frames: 671406080. Throughput: 0: 4985.1. Samples: 671400050. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:59:35,762][25689] Avg episode reward: [(0, '-6.132')] [2022-07-10 08:59:36,123][26022] Updated weights on worker 0-0, policy_version 655673 (0.00079) [2022-07-10 08:59:38,050][26022] Updated weights on worker 0-0, policy_version 655683 (0.00095) [2022-07-10 08:59:39,883][26022] Updated weights on worker 0-0, policy_version 655693 (0.00075) [2022-07-10 08:59:40,763][25689] Fps is (10 sec: 5617.3, 60 sec: 5514.5, 300 sec: 5513.5). Total num frames: 671434752. Throughput: 0: 5809.4. Samples: 671433404. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:59:40,763][25689] Avg episode reward: [(0, '-5.555')] [2022-07-10 08:59:41,792][26022] Updated weights on worker 0-0, policy_version 655703 (0.00091) [2022-07-10 08:59:43,435][26022] Updated weights on worker 0-0, policy_version 655713 (0.00086) [2022-07-10 08:59:45,431][26022] Updated weights on worker 0-0, policy_version 655723 (0.00090) [2022-07-10 08:59:45,810][25689] Fps is (10 sec: 5604.7, 60 sec: 5520.0, 300 sec: 5506.3). Total num frames: 671462400. Throughput: 0: 5799.0. Samples: 671466778. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:59:45,811][25689] Avg episode reward: [(0, '-7.521')] [2022-07-10 08:59:47,352][26022] Updated weights on worker 0-0, policy_version 655733 (0.00090) [2022-07-10 08:59:49,026][26022] Updated weights on worker 0-0, policy_version 655743 (0.00087) [2022-07-10 08:59:50,863][25689] Fps is (10 sec: 5474.7, 60 sec: 5503.3, 300 sec: 5508.9). Total num frames: 671490048. Throughput: 0: 4968.5. Samples: 671483512. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 08:59:50,865][25689] Avg episode reward: [(0, '-9.123')] [2022-07-10 08:59:51,028][26022] Updated weights on worker 0-0, policy_version 655753 (0.00093) [2022-07-10 08:59:52,683][26022] Updated weights on worker 0-0, policy_version 655763 (0.00088) [2022-07-10 08:59:54,712][26022] Updated weights on worker 0-0, policy_version 655773 (0.00087) [2022-07-10 08:59:55,911][25689] Fps is (10 sec: 5576.1, 60 sec: 5503.4, 300 sec: 5508.2). Total num frames: 671518720. Throughput: 0: 5815.0. Samples: 671517216. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 08:59:55,911][25689] Avg episode reward: [(0, '-7.558')] [2022-07-10 08:59:56,224][26022] Updated weights on worker 0-0, policy_version 655783 (0.00093) [2022-07-10 08:59:58,233][26022] Updated weights on worker 0-0, policy_version 655793 (0.00095) [2022-07-10 08:59:59,994][26022] Updated weights on worker 0-0, policy_version 655803 (0.00093) [2022-07-10 09:00:00,913][25689] Fps is (10 sec: 5603.9, 60 sec: 5538.6, 300 sec: 5517.4). Total num frames: 671546368. Throughput: 0: 5820.7. Samples: 671550694. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:00:00,915][25689] Avg episode reward: [(0, '-7.877')] [2022-07-10 09:00:02,260][26022] Updated weights on worker 0-0, policy_version 655813 (0.00090) [2022-07-10 09:00:04,173][26022] Updated weights on worker 0-0, policy_version 655823 (0.00090) [2022-07-10 09:00:06,061][25689] Fps is (10 sec: 5347.0, 60 sec: 5516.0, 300 sec: 5511.8). Total num frames: 671572992. Throughput: 0: 5690.5. Samples: 671582012. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:00:06,063][25689] Avg episode reward: [(0, '-8.174')] [2022-07-10 09:00:06,069][26022] Updated weights on worker 0-0, policy_version 655833 (0.00100) [2022-07-10 09:00:07,746][26022] Updated weights on worker 0-0, policy_version 655843 (0.00088) [2022-07-10 09:00:09,879][26022] Updated weights on worker 0-0, policy_version 655853 (0.00092) [2022-07-10 09:00:11,073][25689] Fps is (10 sec: 5442.9, 60 sec: 5538.2, 300 sec: 5515.5). Total num frames: 671601664. Throughput: 0: 5687.8. Samples: 671598460. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:00:11,073][25689] Avg episode reward: [(0, '-10.051')] [2022-07-10 09:00:11,431][26022] Updated weights on worker 0-0, policy_version 655863 (0.00089) [2022-07-10 09:00:13,682][26022] Updated weights on worker 0-0, policy_version 655873 (0.00079) [2022-07-10 09:00:15,206][26022] Updated weights on worker 0-0, policy_version 655883 (0.00087) [2022-07-10 09:00:16,098][25689] Fps is (10 sec: 5509.3, 60 sec: 5520.3, 300 sec: 5508.3). Total num frames: 671628288. Throughput: 0: 5653.8. Samples: 671631348. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:00:16,100][25689] Avg episode reward: [(0, '-8.409')] [2022-07-10 09:00:17,195][26022] Updated weights on worker 0-0, policy_version 655893 (0.00086) [2022-07-10 09:00:18,496][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:00:18,512][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000655900_671641600.pth [2022-07-10 09:00:18,525][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000653959_669654016.pth [2022-07-10 09:00:18,830][26022] Updated weights on worker 0-0, policy_version 655903 (0.00093) [2022-07-10 09:00:20,620][26022] Updated weights on worker 0-0, policy_version 655913 (0.00086) [2022-07-10 09:00:21,124][25689] Fps is (10 sec: 5399.5, 60 sec: 5502.3, 300 sec: 5508.6). Total num frames: 671655936. Throughput: 0: 5664.6. Samples: 671665180. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:00:21,125][25689] Avg episode reward: [(0, '-7.519')] [2022-07-10 09:00:22,583][26022] Updated weights on worker 0-0, policy_version 655923 (0.00088) [2022-07-10 09:00:24,604][26022] Updated weights on worker 0-0, policy_version 655933 (0.00083) [2022-07-10 09:00:26,209][25689] Fps is (10 sec: 5671.2, 60 sec: 5517.2, 300 sec: 5517.4). Total num frames: 671685632. Throughput: 0: 4959.6. Samples: 671681940. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:00:26,209][25689] Avg episode reward: [(0, '-11.373')] [2022-07-10 09:00:26,214][26022] Updated weights on worker 0-0, policy_version 655943 (0.00089) [2022-07-10 09:00:28,260][26022] Updated weights on worker 0-0, policy_version 655953 (0.00080) [2022-07-10 09:00:29,843][26022] Updated weights on worker 0-0, policy_version 655963 (0.00087) [2022-07-10 09:00:31,230][25689] Fps is (10 sec: 5674.5, 60 sec: 5535.3, 300 sec: 5510.5). Total num frames: 671713280. Throughput: 0: 5792.9. Samples: 671715228. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:00:31,230][25689] Avg episode reward: [(0, '-9.904')] [2022-07-10 09:00:31,734][26022] Updated weights on worker 0-0, policy_version 655973 (0.00089) [2022-07-10 09:00:33,695][26022] Updated weights on worker 0-0, policy_version 655983 (0.00085) [2022-07-10 09:00:35,453][26022] Updated weights on worker 0-0, policy_version 655993 (0.00092) [2022-07-10 09:00:36,279][25689] Fps is (10 sec: 5491.4, 60 sec: 5533.1, 300 sec: 5513.2). Total num frames: 671740928. Throughput: 0: 5815.8. Samples: 671748716. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:00:36,279][25689] Avg episode reward: [(0, '-7.847')] [2022-07-10 09:00:37,296][26022] Updated weights on worker 0-0, policy_version 656003 (0.00086) [2022-07-10 09:00:39,146][26022] Updated weights on worker 0-0, policy_version 656013 (0.00085) [2022-07-10 09:00:40,877][26022] Updated weights on worker 0-0, policy_version 656023 (0.00083) [2022-07-10 09:00:41,303][25689] Fps is (10 sec: 5590.9, 60 sec: 5531.0, 300 sec: 5511.5). Total num frames: 671769600. Throughput: 0: 4966.1. Samples: 671765392. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:00:41,304][25689] Avg episode reward: [(0, '-8.030')] [2022-07-10 09:00:42,898][26022] Updated weights on worker 0-0, policy_version 656033 (0.00091) [2022-07-10 09:00:44,667][26022] Updated weights on worker 0-0, policy_version 656043 (0.00088) [2022-07-10 09:00:46,420][25689] Fps is (10 sec: 5452.6, 60 sec: 5507.8, 300 sec: 5510.8). Total num frames: 671796224. Throughput: 0: 5772.0. Samples: 671798598. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:00:46,420][25689] Avg episode reward: [(0, '-7.314')] [2022-07-10 09:00:46,685][26022] Updated weights on worker 0-0, policy_version 656053 (0.00092) [2022-07-10 09:00:48,351][26022] Updated weights on worker 0-0, policy_version 656063 (0.00085) [2022-07-10 09:00:50,307][26022] Updated weights on worker 0-0, policy_version 656073 (0.00089) [2022-07-10 09:00:51,506][25689] Fps is (10 sec: 5420.0, 60 sec: 5521.7, 300 sec: 5510.1). Total num frames: 671824896. Throughput: 0: 5778.8. Samples: 671832398. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:00:51,506][25689] Avg episode reward: [(0, '-5.812')] [2022-07-10 09:00:52,136][26022] Updated weights on worker 0-0, policy_version 656083 (0.00089) [2022-07-10 09:00:53,786][26022] Updated weights on worker 0-0, policy_version 656093 (0.00094) [2022-07-10 09:00:55,771][26022] Updated weights on worker 0-0, policy_version 656103 (0.00117) [2022-07-10 09:00:56,525][25689] Fps is (10 sec: 5877.4, 60 sec: 5558.0, 300 sec: 5521.1). Total num frames: 671855616. Throughput: 0: 4965.2. Samples: 671849244. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:00:56,525][25689] Avg episode reward: [(0, '-4.972')] [2022-07-10 09:00:57,532][26022] Updated weights on worker 0-0, policy_version 656113 (0.00094) [2022-07-10 09:00:59,265][26022] Updated weights on worker 0-0, policy_version 656123 (0.00080) [2022-07-10 09:01:01,582][25689] Fps is (10 sec: 5386.0, 60 sec: 5485.5, 300 sec: 5510.6). Total num frames: 671879168. Throughput: 0: 5773.7. Samples: 671882476. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:01:01,583][25689] Avg episode reward: [(0, '-5.109')] [2022-07-10 09:01:02,008][26022] Updated weights on worker 0-0, policy_version 656133 (0.00095) [2022-07-10 09:01:03,306][26022] Updated weights on worker 0-0, policy_version 656143 (0.00089) [2022-07-10 09:01:05,624][26022] Updated weights on worker 0-0, policy_version 656153 (0.00094) [2022-07-10 09:01:06,733][25689] Fps is (10 sec: 5216.3, 60 sec: 5535.8, 300 sec: 5518.2). Total num frames: 671908864. Throughput: 0: 5654.2. Samples: 671913454. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:01:06,734][25689] Avg episode reward: [(0, '-5.794')] [2022-07-10 09:01:07,069][26022] Updated weights on worker 0-0, policy_version 656163 (0.00090) [2022-07-10 09:01:09,072][26022] Updated weights on worker 0-0, policy_version 656173 (0.00094) [2022-07-10 09:01:11,172][26022] Updated weights on worker 0-0, policy_version 656183 (0.00091) [2022-07-10 09:01:11,782][25689] Fps is (10 sec: 5521.4, 60 sec: 5498.7, 300 sec: 5507.2). Total num frames: 671935488. Throughput: 0: 4808.9. Samples: 671929906. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:01:11,783][25689] Avg episode reward: [(0, '-5.519')] [2022-07-10 09:01:12,786][26022] Updated weights on worker 0-0, policy_version 656193 (0.00092) [2022-07-10 09:01:14,667][26022] Updated weights on worker 0-0, policy_version 656203 (0.00092) [2022-07-10 09:01:16,814][25689] Fps is (10 sec: 5180.3, 60 sec: 5481.2, 300 sec: 5503.7). Total num frames: 671961088. Throughput: 0: 5604.4. Samples: 671962954. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:01:16,816][25689] Avg episode reward: [(0, '-5.131')] [2022-07-10 09:01:16,938][26022] Updated weights on worker 0-0, policy_version 656213 (0.00092) [2022-07-10 09:01:18,165][26022] Updated weights on worker 0-0, policy_version 656223 (0.00091) [2022-07-10 09:01:20,520][26022] Updated weights on worker 0-0, policy_version 656233 (0.00096) [2022-07-10 09:01:21,849][25689] Fps is (10 sec: 5594.7, 60 sec: 5531.0, 300 sec: 5518.8). Total num frames: 671991808. Throughput: 0: 5625.1. Samples: 671996480. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:01:21,849][25689] Avg episode reward: [(0, '-6.012')] [2022-07-10 09:01:21,868][26022] Updated weights on worker 0-0, policy_version 656243 (0.00091) [2022-07-10 09:01:23,983][26022] Updated weights on worker 0-0, policy_version 656253 (0.00084) [2022-07-10 09:01:25,655][26022] Updated weights on worker 0-0, policy_version 656263 (0.00087) [2022-07-10 09:01:26,988][25689] Fps is (10 sec: 5636.2, 60 sec: 5475.5, 300 sec: 5510.9). Total num frames: 672018432. Throughput: 0: 4920.8. Samples: 672013126. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:01:26,989][25689] Avg episode reward: [(0, '-6.018')] [2022-07-10 09:01:27,579][26022] Updated weights on worker 0-0, policy_version 656273 (0.00086) [2022-07-10 09:01:29,491][26022] Updated weights on worker 0-0, policy_version 656283 (0.00087) [2022-07-10 09:01:31,375][26022] Updated weights on worker 0-0, policy_version 656293 (0.00079) [2022-07-10 09:01:32,031][25689] Fps is (10 sec: 5430.9, 60 sec: 5490.4, 300 sec: 5511.1). Total num frames: 672047104. Throughput: 0: 5750.0. Samples: 672046334. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:01:32,031][25689] Avg episode reward: [(0, '-6.097')] [2022-07-10 09:01:33,153][26022] Updated weights on worker 0-0, policy_version 656303 (0.00087) [2022-07-10 09:01:35,094][26022] Updated weights on worker 0-0, policy_version 656313 (0.00091) [2022-07-10 09:01:36,963][26022] Updated weights on worker 0-0, policy_version 656323 (0.00088) [2022-07-10 09:01:37,047][25689] Fps is (10 sec: 5599.4, 60 sec: 5493.4, 300 sec: 5511.3). Total num frames: 672074752. Throughput: 0: 5764.9. Samples: 672079592. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:01:37,047][25689] Avg episode reward: [(0, '-4.877')] [2022-07-10 09:01:38,720][26022] Updated weights on worker 0-0, policy_version 656333 (0.00094) [2022-07-10 09:01:40,731][26022] Updated weights on worker 0-0, policy_version 656343 (0.00101) [2022-07-10 09:01:42,053][25689] Fps is (10 sec: 5619.6, 60 sec: 5495.1, 300 sec: 5515.8). Total num frames: 672103424. Throughput: 0: 4934.4. Samples: 672096174. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:01:42,053][25689] Avg episode reward: [(0, '-5.576')] [2022-07-10 09:01:42,575][26022] Updated weights on worker 0-0, policy_version 656353 (0.00088) [2022-07-10 09:01:44,299][26022] Updated weights on worker 0-0, policy_version 656363 (0.00085) [2022-07-10 09:01:46,522][26022] Updated weights on worker 0-0, policy_version 656373 (0.00863) [2022-07-10 09:01:47,165][25689] Fps is (10 sec: 5566.4, 60 sec: 5512.4, 300 sec: 5514.8). Total num frames: 672131072. Throughput: 0: 5754.6. Samples: 672129232. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:01:47,165][25689] Avg episode reward: [(0, '-7.001')] [2022-07-10 09:01:48,105][26022] Updated weights on worker 0-0, policy_version 656383 (0.00092) [2022-07-10 09:01:49,999][26022] Updated weights on worker 0-0, policy_version 656393 (0.00082) [2022-07-10 09:01:51,641][26022] Updated weights on worker 0-0, policy_version 656403 (0.00084) [2022-07-10 09:01:52,177][25689] Fps is (10 sec: 5563.1, 60 sec: 5519.1, 300 sec: 5515.6). Total num frames: 672159744. Throughput: 0: 5772.1. Samples: 672162618. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:01:52,177][25689] Avg episode reward: [(0, '-4.719')] [2022-07-10 09:01:53,556][26022] Updated weights on worker 0-0, policy_version 656413 (0.00093) [2022-07-10 09:01:55,330][26022] Updated weights on worker 0-0, policy_version 656423 (0.00088) [2022-07-10 09:01:57,266][25689] Fps is (10 sec: 5473.9, 60 sec: 5445.2, 300 sec: 5511.6). Total num frames: 672186368. Throughput: 0: 5770.1. Samples: 672196262. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:01:57,267][25689] Avg episode reward: [(0, '-5.788')] [2022-07-10 09:01:57,289][26022] Updated weights on worker 0-0, policy_version 656433 (0.00087) [2022-07-10 09:01:59,005][26022] Updated weights on worker 0-0, policy_version 656443 (0.00092) [2022-07-10 09:02:00,961][26022] Updated weights on worker 0-0, policy_version 656453 (0.00386) [2022-07-10 09:02:02,271][25689] Fps is (10 sec: 5376.4, 60 sec: 5517.4, 300 sec: 5520.2). Total num frames: 672214016. Throughput: 0: 5771.6. Samples: 672212866. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:02:02,272][25689] Avg episode reward: [(0, '-6.019')] [2022-07-10 09:02:03,220][26022] Updated weights on worker 0-0, policy_version 656463 (0.00086) [2022-07-10 09:02:05,087][26022] Updated weights on worker 0-0, policy_version 656473 (0.00086) [2022-07-10 09:02:06,966][26022] Updated weights on worker 0-0, policy_version 656483 (0.00089) [2022-07-10 09:02:07,428][25689] Fps is (10 sec: 5441.5, 60 sec: 5483.1, 300 sec: 5514.6). Total num frames: 672241664. Throughput: 0: 5670.8. Samples: 672244144. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:02:07,429][25689] Avg episode reward: [(0, '-6.525')] [2022-07-10 09:02:08,771][26022] Updated weights on worker 0-0, policy_version 656493 (0.00084) [2022-07-10 09:02:10,608][26022] Updated weights on worker 0-0, policy_version 656503 (0.00092) [2022-07-10 09:02:12,267][26022] Updated weights on worker 0-0, policy_version 656513 (0.00052) [2022-07-10 09:02:12,463][25689] Fps is (10 sec: 5526.2, 60 sec: 5518.2, 300 sec: 5518.3). Total num frames: 672270336. Throughput: 0: 5665.0. Samples: 672277538. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:02:12,463][25689] Avg episode reward: [(0, '-7.062')] [2022-07-10 09:02:14,351][26022] Updated weights on worker 0-0, policy_version 656523 (0.00094) [2022-07-10 09:02:16,084][26022] Updated weights on worker 0-0, policy_version 656533 (0.00085) [2022-07-10 09:02:17,467][25689] Fps is (10 sec: 5406.5, 60 sec: 5520.8, 300 sec: 5508.1). Total num frames: 672295936. Throughput: 0: 4830.8. Samples: 672293844. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:02:17,468][25689] Avg episode reward: [(0, '-6.728')] [2022-07-10 09:02:18,023][26022] Updated weights on worker 0-0, policy_version 656543 (0.00061) [2022-07-10 09:02:18,701][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:02:18,726][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000656547_672304128.pth [2022-07-10 09:02:18,727][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000654605_670315520.pth [2022-07-10 09:02:19,713][26022] Updated weights on worker 0-0, policy_version 656553 (0.00084) [2022-07-10 09:02:21,786][26022] Updated weights on worker 0-0, policy_version 656563 (0.00092) [2022-07-10 09:02:22,472][25689] Fps is (10 sec: 5524.2, 60 sec: 5506.5, 300 sec: 5517.5). Total num frames: 672325632. Throughput: 0: 5665.0. Samples: 672327306. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:02:22,473][25689] Avg episode reward: [(0, '-7.118')] [2022-07-10 09:02:23,434][26022] Updated weights on worker 0-0, policy_version 656573 (0.00088) [2022-07-10 09:02:25,371][26022] Updated weights on worker 0-0, policy_version 656583 (0.00085) [2022-07-10 09:02:27,110][26022] Updated weights on worker 0-0, policy_version 656593 (0.00096) [2022-07-10 09:02:27,551][25689] Fps is (10 sec: 5584.9, 60 sec: 5512.1, 300 sec: 5516.2). Total num frames: 672352256. Throughput: 0: 5792.3. Samples: 672360700. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:02:27,553][25689] Avg episode reward: [(0, '-6.696')] [2022-07-10 09:02:29,220][26022] Updated weights on worker 0-0, policy_version 656603 (0.00090) [2022-07-10 09:02:30,877][26022] Updated weights on worker 0-0, policy_version 656613 (0.00082) [2022-07-10 09:02:32,565][25689] Fps is (10 sec: 5478.7, 60 sec: 5514.6, 300 sec: 5509.3). Total num frames: 672380928. Throughput: 0: 4957.9. Samples: 672377204. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:02:32,566][25689] Avg episode reward: [(0, '-6.919')] [2022-07-10 09:02:32,713][26022] Updated weights on worker 0-0, policy_version 656623 (0.00091) [2022-07-10 09:02:34,577][26022] Updated weights on worker 0-0, policy_version 656633 (0.00097) [2022-07-10 09:02:36,435][26022] Updated weights on worker 0-0, policy_version 656643 (0.00092) [2022-07-10 09:02:37,624][25689] Fps is (10 sec: 5591.0, 60 sec: 5510.7, 300 sec: 5511.8). Total num frames: 672408576. Throughput: 0: 5795.5. Samples: 672410668. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:02:37,625][25689] Avg episode reward: [(0, '-7.706')] [2022-07-10 09:02:38,382][26022] Updated weights on worker 0-0, policy_version 656653 (0.00093) [2022-07-10 09:02:40,268][26022] Updated weights on worker 0-0, policy_version 656663 (0.00102) [2022-07-10 09:02:41,930][26022] Updated weights on worker 0-0, policy_version 656673 (0.00104) [2022-07-10 09:02:42,627][25689] Fps is (10 sec: 5597.4, 60 sec: 5511.0, 300 sec: 5516.9). Total num frames: 672437248. Throughput: 0: 5784.0. Samples: 672443880. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:02:42,629][25689] Avg episode reward: [(0, '-8.055')] [2022-07-10 09:02:43,875][26022] Updated weights on worker 0-0, policy_version 656683 (0.00085) [2022-07-10 09:02:45,741][26022] Updated weights on worker 0-0, policy_version 656693 (0.00086) [2022-07-10 09:02:47,711][25689] Fps is (10 sec: 5482.0, 60 sec: 5496.6, 300 sec: 5512.2). Total num frames: 672463872. Throughput: 0: 4937.3. Samples: 672460236. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:02:47,712][25689] Avg episode reward: [(0, '-8.001')] [2022-07-10 09:02:47,713][26022] Updated weights on worker 0-0, policy_version 656703 (0.00096) [2022-07-10 09:02:49,351][26022] Updated weights on worker 0-0, policy_version 656713 (0.00092) [2022-07-10 09:02:51,410][26022] Updated weights on worker 0-0, policy_version 656723 (0.00093) [2022-07-10 09:02:52,719][25689] Fps is (10 sec: 5479.4, 60 sec: 5497.0, 300 sec: 5515.7). Total num frames: 672492544. Throughput: 0: 5769.6. Samples: 672493480. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:02:52,719][25689] Avg episode reward: [(0, '-8.475')] [2022-07-10 09:02:53,165][26022] Updated weights on worker 0-0, policy_version 656733 (0.00089) [2022-07-10 09:02:55,003][26022] Updated weights on worker 0-0, policy_version 656743 (0.00095) [2022-07-10 09:02:56,655][26022] Updated weights on worker 0-0, policy_version 656753 (0.00089) [2022-07-10 09:02:57,722][25689] Fps is (10 sec: 5523.6, 60 sec: 5504.9, 300 sec: 5506.0). Total num frames: 672519168. Throughput: 0: 5780.7. Samples: 672526846. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:02:57,722][25689] Avg episode reward: [(0, '-6.781')] [2022-07-10 09:02:58,596][26022] Updated weights on worker 0-0, policy_version 656763 (0.00103) [2022-07-10 09:03:00,521][26022] Updated weights on worker 0-0, policy_version 656773 (0.00090) [2022-07-10 09:03:02,753][25689] Fps is (10 sec: 5204.5, 60 sec: 5468.6, 300 sec: 5511.2). Total num frames: 672544768. Throughput: 0: 4947.4. Samples: 672543452. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:03:02,754][25689] Avg episode reward: [(0, '-6.640')] [2022-07-10 09:03:02,823][26022] Updated weights on worker 0-0, policy_version 656783 (0.00096) [2022-07-10 09:03:04,703][26022] Updated weights on worker 0-0, policy_version 656793 (0.00088) [2022-07-10 09:03:06,362][26022] Updated weights on worker 0-0, policy_version 656803 (0.00088) [2022-07-10 09:03:07,819][25689] Fps is (10 sec: 5375.1, 60 sec: 5493.8, 300 sec: 5510.0). Total num frames: 672573440. Throughput: 0: 5693.8. Samples: 672574726. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 09:03:07,819][25689] Avg episode reward: [(0, '-7.330')] [2022-07-10 09:03:08,181][26022] Updated weights on worker 0-0, policy_version 656813 (0.00089) [2022-07-10 09:03:10,244][26022] Updated weights on worker 0-0, policy_version 656823 (0.00108) [2022-07-10 09:03:11,832][26022] Updated weights on worker 0-0, policy_version 656833 (0.00089) [2022-07-10 09:03:12,865][25689] Fps is (10 sec: 5569.8, 60 sec: 5475.8, 300 sec: 5507.2). Total num frames: 672601088. Throughput: 0: 5670.2. Samples: 672607712. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:03:12,865][25689] Avg episode reward: [(0, '-5.934')] [2022-07-10 09:03:14,027][26022] Updated weights on worker 0-0, policy_version 656843 (0.00082) [2022-07-10 09:03:15,628][26022] Updated weights on worker 0-0, policy_version 656853 (0.00087) [2022-07-10 09:03:17,607][26022] Updated weights on worker 0-0, policy_version 656863 (0.00089) [2022-07-10 09:03:17,877][25689] Fps is (10 sec: 5497.6, 60 sec: 5509.0, 300 sec: 5508.4). Total num frames: 672628736. Throughput: 0: 4841.5. Samples: 672624428. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:03:17,878][25689] Avg episode reward: [(0, '-3.841')] [2022-07-10 09:03:19,577][26022] Updated weights on worker 0-0, policy_version 656873 (0.00091) [2022-07-10 09:03:21,070][26022] Updated weights on worker 0-0, policy_version 656883 (0.00096) [2022-07-10 09:03:22,879][25689] Fps is (10 sec: 5521.9, 60 sec: 5475.5, 300 sec: 5509.9). Total num frames: 672656384. Throughput: 0: 5691.9. Samples: 672658004. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:03:22,879][25689] Avg episode reward: [(0, '-4.339')] [2022-07-10 09:03:23,140][26022] Updated weights on worker 0-0, policy_version 656893 (0.00088) [2022-07-10 09:03:24,763][26022] Updated weights on worker 0-0, policy_version 656903 (0.00084) [2022-07-10 09:03:26,775][26022] Updated weights on worker 0-0, policy_version 656913 (0.00089) [2022-07-10 09:03:27,947][25689] Fps is (10 sec: 5592.8, 60 sec: 5510.3, 300 sec: 5509.3). Total num frames: 672685056. Throughput: 0: 5801.3. Samples: 672691496. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:03:27,949][25689] Avg episode reward: [(0, '-4.112')] [2022-07-10 09:03:28,656][26022] Updated weights on worker 0-0, policy_version 656923 (0.00089) [2022-07-10 09:03:30,387][26022] Updated weights on worker 0-0, policy_version 656933 (0.00092) [2022-07-10 09:03:32,407][26022] Updated weights on worker 0-0, policy_version 656943 (0.00088) [2022-07-10 09:03:32,952][25689] Fps is (10 sec: 5692.9, 60 sec: 5511.2, 300 sec: 5520.1). Total num frames: 672713728. Throughput: 0: 5008.0. Samples: 672708306. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:03:32,953][25689] Avg episode reward: [(0, '-3.928')] [2022-07-10 09:03:33,890][26022] Updated weights on worker 0-0, policy_version 656953 (0.00100) [2022-07-10 09:03:35,853][26022] Updated weights on worker 0-0, policy_version 656963 (0.00085) [2022-07-10 09:03:37,809][26022] Updated weights on worker 0-0, policy_version 656973 (0.00051) [2022-07-10 09:03:37,970][25689] Fps is (10 sec: 5619.1, 60 sec: 5514.9, 300 sec: 5509.6). Total num frames: 672741376. Throughput: 0: 5855.8. Samples: 672742090. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:03:37,971][25689] Avg episode reward: [(0, '-3.739')] [2022-07-10 09:03:39,675][26022] Updated weights on worker 0-0, policy_version 656983 (0.00089) [2022-07-10 09:03:41,467][26022] Updated weights on worker 0-0, policy_version 656993 (0.00091) [2022-07-10 09:03:42,987][25689] Fps is (10 sec: 5612.3, 60 sec: 5513.6, 300 sec: 5514.7). Total num frames: 672770048. Throughput: 0: 5857.1. Samples: 672775778. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:03:42,987][25689] Avg episode reward: [(0, '-4.136')] [2022-07-10 09:03:43,171][26022] Updated weights on worker 0-0, policy_version 657003 (0.00087) [2022-07-10 09:03:44,966][26022] Updated weights on worker 0-0, policy_version 657013 (0.00084) [2022-07-10 09:03:47,088][26022] Updated weights on worker 0-0, policy_version 657023 (0.00085) [2022-07-10 09:03:48,036][25689] Fps is (10 sec: 5696.7, 60 sec: 5550.7, 300 sec: 5514.9). Total num frames: 672798720. Throughput: 0: 5020.7. Samples: 672792358. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:03:48,038][25689] Avg episode reward: [(0, '-4.967')] [2022-07-10 09:03:48,882][26022] Updated weights on worker 0-0, policy_version 657033 (0.00083) [2022-07-10 09:03:50,574][26022] Updated weights on worker 0-0, policy_version 657043 (0.00097) [2022-07-10 09:03:52,421][26022] Updated weights on worker 0-0, policy_version 657053 (0.00086) [2022-07-10 09:03:53,040][25689] Fps is (10 sec: 5500.0, 60 sec: 5517.1, 300 sec: 5508.8). Total num frames: 672825344. Throughput: 0: 5834.3. Samples: 672825510. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:03:53,042][25689] Avg episode reward: [(0, '-3.781')] [2022-07-10 09:03:54,327][26022] Updated weights on worker 0-0, policy_version 657063 (0.00091) [2022-07-10 09:03:56,178][26022] Updated weights on worker 0-0, policy_version 657073 (0.00086) [2022-07-10 09:03:58,064][25689] Fps is (10 sec: 5412.0, 60 sec: 5532.1, 300 sec: 5515.5). Total num frames: 672852992. Throughput: 0: 5813.5. Samples: 672858908. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:03:58,065][25689] Avg episode reward: [(0, '-3.365')] [2022-07-10 09:03:58,068][26022] Updated weights on worker 0-0, policy_version 657083 (0.00102) [2022-07-10 09:03:59,835][26022] Updated weights on worker 0-0, policy_version 657093 (0.00094) [2022-07-10 09:04:01,831][26022] Updated weights on worker 0-0, policy_version 657103 (0.00098) [2022-07-10 09:04:03,097][25689] Fps is (10 sec: 5396.6, 60 sec: 5549.0, 300 sec: 5513.1). Total num frames: 672879616. Throughput: 0: 4961.7. Samples: 672875560. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:04:03,099][25689] Avg episode reward: [(0, '-3.286')] [2022-07-10 09:04:03,995][26022] Updated weights on worker 0-0, policy_version 657113 (0.00091) [2022-07-10 09:04:05,647][26022] Updated weights on worker 0-0, policy_version 657123 (0.00084) [2022-07-10 09:04:07,577][26022] Updated weights on worker 0-0, policy_version 657133 (0.00085) [2022-07-10 09:04:08,207][25689] Fps is (10 sec: 5249.8, 60 sec: 5511.0, 300 sec: 5508.9). Total num frames: 672906240. Throughput: 0: 5684.3. Samples: 672907016. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:04:08,208][25689] Avg episode reward: [(0, '-4.169')] [2022-07-10 09:04:09,335][26022] Updated weights on worker 0-0, policy_version 657143 (0.00093) [2022-07-10 09:04:11,344][26022] Updated weights on worker 0-0, policy_version 657153 (0.00096) [2022-07-10 09:04:12,963][26022] Updated weights on worker 0-0, policy_version 657163 (0.00088) [2022-07-10 09:04:13,255][25689] Fps is (10 sec: 5544.6, 60 sec: 5544.8, 300 sec: 5515.2). Total num frames: 672935936. Throughput: 0: 5695.0. Samples: 672940632. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:04:13,255][25689] Avg episode reward: [(0, '-4.603')] [2022-07-10 09:04:14,753][26022] Updated weights on worker 0-0, policy_version 657173 (0.00084) [2022-07-10 09:04:16,702][26022] Updated weights on worker 0-0, policy_version 657183 (0.00611) [2022-07-10 09:04:18,272][25689] Fps is (10 sec: 5697.5, 60 sec: 5544.3, 300 sec: 5511.7). Total num frames: 672963584. Throughput: 0: 4874.8. Samples: 672957416. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:04:18,273][25689] Avg episode reward: [(0, '-4.927')] [2022-07-10 09:04:18,594][26022] Updated weights on worker 0-0, policy_version 657193 (0.00100) [2022-07-10 09:04:18,804][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:04:18,822][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000657194_672966656.pth [2022-07-10 09:04:18,822][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000655253_670979072.pth [2022-07-10 09:04:20,286][26022] Updated weights on worker 0-0, policy_version 657203 (0.00465) [2022-07-10 09:04:22,254][26022] Updated weights on worker 0-0, policy_version 657213 (0.00092) [2022-07-10 09:04:23,299][25689] Fps is (10 sec: 5505.5, 60 sec: 5542.0, 300 sec: 5508.9). Total num frames: 672991232. Throughput: 0: 5711.6. Samples: 672990942. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:04:23,299][25689] Avg episode reward: [(0, '-6.891')] [2022-07-10 09:04:24,008][26022] Updated weights on worker 0-0, policy_version 657223 (0.00087) [2022-07-10 09:04:26,155][26022] Updated weights on worker 0-0, policy_version 657233 (0.00084) [2022-07-10 09:04:27,727][26022] Updated weights on worker 0-0, policy_version 657243 (0.00086) [2022-07-10 09:04:28,366][25689] Fps is (10 sec: 5579.5, 60 sec: 5542.1, 300 sec: 5515.2). Total num frames: 673019904. Throughput: 0: 5814.7. Samples: 673024234. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:04:28,367][25689] Avg episode reward: [(0, '-7.338')] [2022-07-10 09:04:29,727][26022] Updated weights on worker 0-0, policy_version 657253 (0.00093) [2022-07-10 09:04:31,388][26022] Updated weights on worker 0-0, policy_version 657263 (0.00095) [2022-07-10 09:04:33,387][25689] Fps is (10 sec: 5481.4, 60 sec: 5506.7, 300 sec: 5511.8). Total num frames: 673046528. Throughput: 0: 4975.9. Samples: 673040806. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:04:33,387][25689] Avg episode reward: [(0, '-7.743')] [2022-07-10 09:04:33,475][26022] Updated weights on worker 0-0, policy_version 657273 (0.00085) [2022-07-10 09:04:35,214][26022] Updated weights on worker 0-0, policy_version 657283 (0.00084) [2022-07-10 09:04:37,108][26022] Updated weights on worker 0-0, policy_version 657293 (0.00091) [2022-07-10 09:04:38,401][25689] Fps is (10 sec: 5714.3, 60 sec: 5557.9, 300 sec: 5518.4). Total num frames: 673077248. Throughput: 0: 5800.3. Samples: 673074172. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:04:38,402][25689] Avg episode reward: [(0, '-7.315')] [2022-07-10 09:04:38,917][26022] Updated weights on worker 0-0, policy_version 657303 (0.00093) [2022-07-10 09:04:40,725][26022] Updated weights on worker 0-0, policy_version 657313 (0.00093) [2022-07-10 09:04:42,635][26022] Updated weights on worker 0-0, policy_version 657323 (0.00087) [2022-07-10 09:04:43,404][25689] Fps is (10 sec: 5622.1, 60 sec: 5508.3, 300 sec: 5512.4). Total num frames: 673102848. Throughput: 0: 5796.2. Samples: 673107478. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:04:43,405][25689] Avg episode reward: [(0, '-6.712')] [2022-07-10 09:04:44,452][26022] Updated weights on worker 0-0, policy_version 657333 (0.00093) [2022-07-10 09:04:46,236][26022] Updated weights on worker 0-0, policy_version 657343 (0.00089) [2022-07-10 09:04:48,335][26022] Updated weights on worker 0-0, policy_version 657353 (0.00081) [2022-07-10 09:04:48,446][25689] Fps is (10 sec: 5301.2, 60 sec: 5492.1, 300 sec: 5512.6). Total num frames: 673130496. Throughput: 0: 4968.9. Samples: 673124008. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:04:48,446][25689] Avg episode reward: [(0, '-5.739')] [2022-07-10 09:04:50,012][26022] Updated weights on worker 0-0, policy_version 657363 (0.00077) [2022-07-10 09:04:51,966][26022] Updated weights on worker 0-0, policy_version 657373 (0.00091) [2022-07-10 09:04:53,449][25689] Fps is (10 sec: 5606.7, 60 sec: 5526.1, 300 sec: 5513.4). Total num frames: 673159168. Throughput: 0: 5803.5. Samples: 673157240. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:04:53,450][25689] Avg episode reward: [(0, '-4.046')] [2022-07-10 09:04:53,591][26022] Updated weights on worker 0-0, policy_version 657383 (0.00086) [2022-07-10 09:04:55,689][26022] Updated weights on worker 0-0, policy_version 657393 (0.00092) [2022-07-10 09:04:57,128][26022] Updated weights on worker 0-0, policy_version 657403 (0.00088) [2022-07-10 09:04:58,469][25689] Fps is (10 sec: 5517.0, 60 sec: 5509.5, 300 sec: 5509.6). Total num frames: 673185792. Throughput: 0: 5820.1. Samples: 673190966. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:04:58,470][25689] Avg episode reward: [(0, '-4.405')] [2022-07-10 09:04:59,301][26022] Updated weights on worker 0-0, policy_version 657413 (0.00087) [2022-07-10 09:05:00,687][26022] Updated weights on worker 0-0, policy_version 657423 (0.00085) [2022-07-10 09:05:03,265][26022] Updated weights on worker 0-0, policy_version 657433 (0.00092) [2022-07-10 09:05:03,476][25689] Fps is (10 sec: 5208.8, 60 sec: 5495.0, 300 sec: 5508.8). Total num frames: 673211392. Throughput: 0: 5003.4. Samples: 673207902. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:05:03,476][25689] Avg episode reward: [(0, '-4.200')] [2022-07-10 09:05:04,959][26022] Updated weights on worker 0-0, policy_version 657443 (0.00059) [2022-07-10 09:05:06,808][26022] Updated weights on worker 0-0, policy_version 657453 (0.00090) [2022-07-10 09:05:08,593][25689] Fps is (10 sec: 5461.8, 60 sec: 5545.2, 300 sec: 5510.3). Total num frames: 673241088. Throughput: 0: 5723.6. Samples: 673239322. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:05:08,595][25689] Avg episode reward: [(0, '-4.583')] [2022-07-10 09:05:08,781][26022] Updated weights on worker 0-0, policy_version 657463 (0.00092) [2022-07-10 09:05:10,496][26022] Updated weights on worker 0-0, policy_version 657473 (0.00091) [2022-07-10 09:05:12,430][26022] Updated weights on worker 0-0, policy_version 657483 (0.00100) [2022-07-10 09:05:13,624][25689] Fps is (10 sec: 5751.5, 60 sec: 5529.7, 300 sec: 5517.1). Total num frames: 673269760. Throughput: 0: 5727.2. Samples: 673272782. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:05:13,624][25689] Avg episode reward: [(0, '-4.763')] [2022-07-10 09:05:14,327][26022] Updated weights on worker 0-0, policy_version 657493 (0.00086) [2022-07-10 09:05:15,982][26022] Updated weights on worker 0-0, policy_version 657503 (0.00077) [2022-07-10 09:05:18,082][26022] Updated weights on worker 0-0, policy_version 657513 (0.00087) [2022-07-10 09:05:18,652][25689] Fps is (10 sec: 5599.0, 60 sec: 5528.7, 300 sec: 5517.0). Total num frames: 673297408. Throughput: 0: 5705.0. Samples: 673306110. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:05:18,654][25689] Avg episode reward: [(0, '-4.788')] [2022-07-10 09:05:19,754][26022] Updated weights on worker 0-0, policy_version 657523 (0.00091) [2022-07-10 09:05:21,572][26022] Updated weights on worker 0-0, policy_version 657533 (0.00088) [2022-07-10 09:05:23,279][26022] Updated weights on worker 0-0, policy_version 657543 (0.00090) [2022-07-10 09:05:23,662][25689] Fps is (10 sec: 5610.4, 60 sec: 5547.2, 300 sec: 5515.0). Total num frames: 673326080. Throughput: 0: 5704.5. Samples: 673323058. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:05:23,663][25689] Avg episode reward: [(0, '-5.506')] [2022-07-10 09:05:25,262][26022] Updated weights on worker 0-0, policy_version 657553 (0.00090) [2022-07-10 09:05:27,027][26022] Updated weights on worker 0-0, policy_version 657563 (0.00092) [2022-07-10 09:05:28,719][25689] Fps is (10 sec: 5695.9, 60 sec: 5548.1, 300 sec: 5517.7). Total num frames: 673354752. Throughput: 0: 5836.6. Samples: 673356792. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:05:28,720][25689] Avg episode reward: [(0, '-6.693')] [2022-07-10 09:05:28,735][26022] Updated weights on worker 0-0, policy_version 657573 (0.00091) [2022-07-10 09:05:30,548][26022] Updated weights on worker 0-0, policy_version 657583 (0.00877) [2022-07-10 09:05:32,601][26022] Updated weights on worker 0-0, policy_version 657593 (0.00090) [2022-07-10 09:05:33,726][25689] Fps is (10 sec: 5494.4, 60 sec: 5549.4, 300 sec: 5515.1). Total num frames: 673381376. Throughput: 0: 5841.9. Samples: 673390220. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:05:33,727][25689] Avg episode reward: [(0, '-5.602')] [2022-07-10 09:05:34,255][26022] Updated weights on worker 0-0, policy_version 657603 (0.00090) [2022-07-10 09:05:36,372][26022] Updated weights on worker 0-0, policy_version 657613 (0.00094) [2022-07-10 09:05:37,952][26022] Updated weights on worker 0-0, policy_version 657623 (0.00090) [2022-07-10 09:05:38,739][25689] Fps is (10 sec: 5416.8, 60 sec: 5498.7, 300 sec: 5511.9). Total num frames: 673409024. Throughput: 0: 5015.5. Samples: 673406856. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:05:38,739][25689] Avg episode reward: [(0, '-7.285')] [2022-07-10 09:05:39,987][26022] Updated weights on worker 0-0, policy_version 657633 (0.00086) [2022-07-10 09:05:41,676][26022] Updated weights on worker 0-0, policy_version 657643 (0.00085) [2022-07-10 09:05:43,653][26022] Updated weights on worker 0-0, policy_version 657653 (0.00092) [2022-07-10 09:05:43,742][25689] Fps is (10 sec: 5623.1, 60 sec: 5549.5, 300 sec: 5520.8). Total num frames: 673437696. Throughput: 0: 5842.3. Samples: 673440370. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:05:43,743][25689] Avg episode reward: [(0, '-7.540')] [2022-07-10 09:05:45,125][26022] Updated weights on worker 0-0, policy_version 657663 (0.00088) [2022-07-10 09:05:47,355][26022] Updated weights on worker 0-0, policy_version 657673 (0.00082) [2022-07-10 09:05:48,816][25689] Fps is (10 sec: 5690.2, 60 sec: 5563.5, 300 sec: 5521.1). Total num frames: 673466368. Throughput: 0: 5796.2. Samples: 673473278. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:05:48,817][25689] Avg episode reward: [(0, '-7.667')] [2022-07-10 09:05:48,983][26022] Updated weights on worker 0-0, policy_version 657683 (0.00092) [2022-07-10 09:05:51,084][26022] Updated weights on worker 0-0, policy_version 657693 (0.00088) [2022-07-10 09:05:52,830][26022] Updated weights on worker 0-0, policy_version 657703 (0.00089) [2022-07-10 09:05:53,889][25689] Fps is (10 sec: 5449.8, 60 sec: 5523.3, 300 sec: 5506.3). Total num frames: 673492992. Throughput: 0: 4942.3. Samples: 673489868. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:05:53,889][25689] Avg episode reward: [(0, '-7.947')] [2022-07-10 09:05:54,794][26022] Updated weights on worker 0-0, policy_version 657713 (0.00089) [2022-07-10 09:05:56,569][26022] Updated weights on worker 0-0, policy_version 657723 (0.00086) [2022-07-10 09:05:58,423][26022] Updated weights on worker 0-0, policy_version 657733 (0.00109) [2022-07-10 09:05:58,951][25689] Fps is (10 sec: 5456.1, 60 sec: 5553.2, 300 sec: 5523.4). Total num frames: 673521664. Throughput: 0: 5767.2. Samples: 673523426. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:05:58,952][25689] Avg episode reward: [(0, '-6.818')] [2022-07-10 09:06:00,000][26022] Updated weights on worker 0-0, policy_version 657743 (0.00085) [2022-07-10 09:06:02,510][26022] Updated weights on worker 0-0, policy_version 657753 (0.00095) [2022-07-10 09:06:03,962][25689] Fps is (10 sec: 5489.3, 60 sec: 5569.8, 300 sec: 5515.7). Total num frames: 673548288. Throughput: 0: 5672.8. Samples: 673555074. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:06:03,964][25689] Avg episode reward: [(0, '-7.687')] [2022-07-10 09:06:04,116][26022] Updated weights on worker 0-0, policy_version 657763 (0.00091) [2022-07-10 09:06:06,227][26022] Updated weights on worker 0-0, policy_version 657773 (0.00086) [2022-07-10 09:06:07,778][26022] Updated weights on worker 0-0, policy_version 657783 (0.00087) [2022-07-10 09:06:09,084][25689] Fps is (10 sec: 5255.2, 60 sec: 5518.6, 300 sec: 5514.4). Total num frames: 673574912. Throughput: 0: 4867.0. Samples: 673571918. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:06:09,084][25689] Avg episode reward: [(0, '-7.910')] [2022-07-10 09:06:09,819][26022] Updated weights on worker 0-0, policy_version 657793 (0.00081) [2022-07-10 09:06:11,523][26022] Updated weights on worker 0-0, policy_version 657803 (0.00091) [2022-07-10 09:06:13,543][26022] Updated weights on worker 0-0, policy_version 657813 (0.00096) [2022-07-10 09:06:14,103][25689] Fps is (10 sec: 5452.9, 60 sec: 5519.7, 300 sec: 5524.9). Total num frames: 673603584. Throughput: 0: 5730.5. Samples: 673605704. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:06:14,103][25689] Avg episode reward: [(0, '-7.282')] [2022-07-10 09:06:15,139][26022] Updated weights on worker 0-0, policy_version 657823 (0.00087) [2022-07-10 09:06:17,063][26022] Updated weights on worker 0-0, policy_version 657833 (0.00094) [2022-07-10 09:06:19,022][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:06:19,034][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000657843_673631232.pth [2022-07-10 09:06:19,034][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000655900_671641600.pth [2022-07-10 09:06:19,044][26022] Updated weights on worker 0-0, policy_version 657843 (0.00094) [2022-07-10 09:06:19,127][25689] Fps is (10 sec: 5709.8, 60 sec: 5537.0, 300 sec: 5518.2). Total num frames: 673632256. Throughput: 0: 5745.3. Samples: 673639340. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2022-07-10 09:06:19,127][25689] Avg episode reward: [(0, '-7.344')] [2022-07-10 09:06:20,814][26022] Updated weights on worker 0-0, policy_version 657853 (0.00095) [2022-07-10 09:06:22,512][26022] Updated weights on worker 0-0, policy_version 657863 (0.00087) [2022-07-10 09:06:24,151][25689] Fps is (10 sec: 5604.7, 60 sec: 5518.8, 300 sec: 5523.8). Total num frames: 673659904. Throughput: 0: 5007.5. Samples: 673656172. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:06:24,152][25689] Avg episode reward: [(0, '-6.923')] [2022-07-10 09:06:24,465][26022] Updated weights on worker 0-0, policy_version 657873 (0.00088) [2022-07-10 09:06:26,258][26022] Updated weights on worker 0-0, policy_version 657883 (0.00094) [2022-07-10 09:06:28,115][26022] Updated weights on worker 0-0, policy_version 657893 (0.00093) [2022-07-10 09:06:29,254][25689] Fps is (10 sec: 5460.1, 60 sec: 5497.7, 300 sec: 5519.2). Total num frames: 673687552. Throughput: 0: 5833.8. Samples: 673689588. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:06:29,256][25689] Avg episode reward: [(0, '-6.369')] [2022-07-10 09:06:29,891][26022] Updated weights on worker 0-0, policy_version 657903 (0.00093) [2022-07-10 09:06:31,811][26022] Updated weights on worker 0-0, policy_version 657913 (0.00096) [2022-07-10 09:06:33,559][26022] Updated weights on worker 0-0, policy_version 657923 (0.00085) [2022-07-10 09:06:34,268][25689] Fps is (10 sec: 5668.3, 60 sec: 5547.8, 300 sec: 5526.2). Total num frames: 673717248. Throughput: 0: 5819.7. Samples: 673723060. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:06:34,269][25689] Avg episode reward: [(0, '-5.355')] [2022-07-10 09:06:35,493][26022] Updated weights on worker 0-0, policy_version 657933 (0.00096) [2022-07-10 09:06:37,232][26022] Updated weights on worker 0-0, policy_version 657943 (0.00088) [2022-07-10 09:06:39,199][26022] Updated weights on worker 0-0, policy_version 657953 (0.00086) [2022-07-10 09:06:39,294][25689] Fps is (10 sec: 5609.4, 60 sec: 5529.6, 300 sec: 5518.9). Total num frames: 673743872. Throughput: 0: 4983.3. Samples: 673739842. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:06:39,295][25689] Avg episode reward: [(0, '-4.557')] [2022-07-10 09:06:40,867][26022] Updated weights on worker 0-0, policy_version 657963 (0.00088) [2022-07-10 09:06:42,853][26022] Updated weights on worker 0-0, policy_version 657973 (0.00423) [2022-07-10 09:06:44,311][25689] Fps is (10 sec: 5506.0, 60 sec: 5528.4, 300 sec: 5524.1). Total num frames: 673772544. Throughput: 0: 5824.0. Samples: 673773580. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:06:44,311][25689] Avg episode reward: [(0, '-3.841')] [2022-07-10 09:06:44,447][26022] Updated weights on worker 0-0, policy_version 657983 (0.00086) [2022-07-10 09:06:46,535][26022] Updated weights on worker 0-0, policy_version 657993 (0.00082) [2022-07-10 09:06:48,359][26022] Updated weights on worker 0-0, policy_version 658003 (0.00092) [2022-07-10 09:06:49,361][25689] Fps is (10 sec: 5696.7, 60 sec: 5530.6, 300 sec: 5523.4). Total num frames: 673801216. Throughput: 0: 5837.1. Samples: 673806952. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:06:49,361][25689] Avg episode reward: [(0, '-3.629')] [2022-07-10 09:06:50,117][26022] Updated weights on worker 0-0, policy_version 658013 (0.01157) [2022-07-10 09:06:51,896][26022] Updated weights on worker 0-0, policy_version 658023 (0.00093) [2022-07-10 09:06:53,569][26022] Updated weights on worker 0-0, policy_version 658033 (0.00090) [2022-07-10 09:06:54,402][25689] Fps is (10 sec: 5479.6, 60 sec: 5533.4, 300 sec: 5524.3). Total num frames: 673827840. Throughput: 0: 5001.0. Samples: 673823750. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:06:54,403][25689] Avg episode reward: [(0, '-2.372')] [2022-07-10 09:06:55,463][26022] Updated weights on worker 0-0, policy_version 658043 (0.00084) [2022-07-10 09:06:57,647][26022] Updated weights on worker 0-0, policy_version 658053 (0.00084) [2022-07-10 09:06:59,085][26022] Updated weights on worker 0-0, policy_version 658063 (0.00084) [2022-07-10 09:06:59,432][25689] Fps is (10 sec: 5693.8, 60 sec: 5570.3, 300 sec: 5534.2). Total num frames: 673858560. Throughput: 0: 5844.0. Samples: 673857526. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:06:59,433][25689] Avg episode reward: [(0, '-1.666')] [2022-07-10 09:07:01,267][26022] Updated weights on worker 0-0, policy_version 658073 (0.00088) [2022-07-10 09:07:03,138][26022] Updated weights on worker 0-0, policy_version 658083 (0.00087) [2022-07-10 09:07:04,463][25689] Fps is (10 sec: 5394.4, 60 sec: 5517.7, 300 sec: 5522.7). Total num frames: 673882112. Throughput: 0: 5715.7. Samples: 673888762. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:07:04,463][25689] Avg episode reward: [(0, '-4.646')] [2022-07-10 09:07:05,285][26022] Updated weights on worker 0-0, policy_version 658093 (0.00086) [2022-07-10 09:07:07,044][26022] Updated weights on worker 0-0, policy_version 658103 (0.00111) [2022-07-10 09:07:08,727][26022] Updated weights on worker 0-0, policy_version 658113 (0.00087) [2022-07-10 09:07:09,526][25689] Fps is (10 sec: 5173.9, 60 sec: 5556.9, 300 sec: 5522.2). Total num frames: 673910784. Throughput: 0: 4890.1. Samples: 673905560. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:07:09,528][25689] Avg episode reward: [(0, '-4.502')] [2022-07-10 09:07:10,674][26022] Updated weights on worker 0-0, policy_version 658123 (0.00081) [2022-07-10 09:07:12,562][26022] Updated weights on worker 0-0, policy_version 658133 (0.00087) [2022-07-10 09:07:14,213][26022] Updated weights on worker 0-0, policy_version 658143 (0.00085) [2022-07-10 09:07:14,622][25689] Fps is (10 sec: 5745.5, 60 sec: 5566.8, 300 sec: 5534.3). Total num frames: 673940480. Throughput: 0: 5698.9. Samples: 673938980. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:07:14,623][25689] Avg episode reward: [(0, '-3.745')] [2022-07-10 09:07:16,274][26022] Updated weights on worker 0-0, policy_version 658153 (0.00091) [2022-07-10 09:07:17,771][26022] Updated weights on worker 0-0, policy_version 658163 (0.00084) [2022-07-10 09:07:19,651][25689] Fps is (10 sec: 5461.4, 60 sec: 5515.6, 300 sec: 5520.1). Total num frames: 673966080. Throughput: 0: 5675.0. Samples: 673972266. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:07:19,652][25689] Avg episode reward: [(0, '-4.643')] [2022-07-10 09:07:19,891][26022] Updated weights on worker 0-0, policy_version 658173 (0.00097) [2022-07-10 09:07:21,671][26022] Updated weights on worker 0-0, policy_version 658183 (0.00095) [2022-07-10 09:07:23,615][26022] Updated weights on worker 0-0, policy_version 658193 (0.00083) [2022-07-10 09:07:24,731][25689] Fps is (10 sec: 5469.9, 60 sec: 5544.3, 300 sec: 5530.4). Total num frames: 673995776. Throughput: 0: 4939.0. Samples: 673988866. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:07:24,733][25689] Avg episode reward: [(0, '-5.894')] [2022-07-10 09:07:25,412][26022] Updated weights on worker 0-0, policy_version 658203 (0.00090) [2022-07-10 09:07:27,381][26022] Updated weights on worker 0-0, policy_version 658213 (0.00091) [2022-07-10 09:07:29,308][26022] Updated weights on worker 0-0, policy_version 658223 (0.00087) [2022-07-10 09:07:29,813][25689] Fps is (10 sec: 5642.5, 60 sec: 5546.2, 300 sec: 5525.7). Total num frames: 674023424. Throughput: 0: 5745.3. Samples: 674022118. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:07:29,815][25689] Avg episode reward: [(0, '-5.518')] [2022-07-10 09:07:31,003][26022] Updated weights on worker 0-0, policy_version 658233 (0.00085) [2022-07-10 09:07:32,825][26022] Updated weights on worker 0-0, policy_version 658243 (0.00091) [2022-07-10 09:07:34,578][26022] Updated weights on worker 0-0, policy_version 658253 (0.00085) [2022-07-10 09:07:34,846][25689] Fps is (10 sec: 5669.1, 60 sec: 5544.4, 300 sec: 5533.1). Total num frames: 674053120. Throughput: 0: 5764.6. Samples: 674055564. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:07:34,847][25689] Avg episode reward: [(0, '-4.066')] [2022-07-10 09:07:36,624][26022] Updated weights on worker 0-0, policy_version 658263 (0.00082) [2022-07-10 09:07:38,263][26022] Updated weights on worker 0-0, policy_version 658273 (0.00087) [2022-07-10 09:07:39,856][25689] Fps is (10 sec: 5608.1, 60 sec: 5546.0, 300 sec: 5526.0). Total num frames: 674079744. Throughput: 0: 5772.7. Samples: 674088906. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:07:39,857][25689] Avg episode reward: [(0, '-3.427')] [2022-07-10 09:07:40,129][26022] Updated weights on worker 0-0, policy_version 658283 (0.00082) [2022-07-10 09:07:41,994][26022] Updated weights on worker 0-0, policy_version 658293 (0.00091) [2022-07-10 09:07:43,903][26022] Updated weights on worker 0-0, policy_version 658303 (0.00087) [2022-07-10 09:07:44,881][25689] Fps is (10 sec: 5408.2, 60 sec: 5528.2, 300 sec: 5530.6). Total num frames: 674107392. Throughput: 0: 5799.8. Samples: 674105734. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:07:44,883][25689] Avg episode reward: [(0, '-4.173')] [2022-07-10 09:07:45,689][26022] Updated weights on worker 0-0, policy_version 658313 (0.00092) [2022-07-10 09:07:47,780][26022] Updated weights on worker 0-0, policy_version 658323 (0.00092) [2022-07-10 09:07:49,386][26022] Updated weights on worker 0-0, policy_version 658333 (0.00095) [2022-07-10 09:07:49,992][25689] Fps is (10 sec: 5455.1, 60 sec: 5505.8, 300 sec: 5525.2). Total num frames: 674135040. Throughput: 0: 5778.7. Samples: 674138728. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:07:49,993][25689] Avg episode reward: [(0, '-5.523')] [2022-07-10 09:07:51,609][26022] Updated weights on worker 0-0, policy_version 658343 (0.00080) [2022-07-10 09:07:53,122][26022] Updated weights on worker 0-0, policy_version 658353 (0.00089) [2022-07-10 09:07:55,032][25689] Fps is (10 sec: 5346.3, 60 sec: 5505.9, 300 sec: 5524.5). Total num frames: 674161664. Throughput: 0: 5756.8. Samples: 674171774. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:07:55,033][25689] Avg episode reward: [(0, '-5.028')] [2022-07-10 09:07:55,234][26022] Updated weights on worker 0-0, policy_version 658363 (0.00086) [2022-07-10 09:07:56,869][26022] Updated weights on worker 0-0, policy_version 658373 (0.00088) [2022-07-10 09:07:58,856][26022] Updated weights on worker 0-0, policy_version 658383 (0.00095) [2022-07-10 09:08:00,101][25689] Fps is (10 sec: 5571.4, 60 sec: 5485.5, 300 sec: 5537.6). Total num frames: 674191360. Throughput: 0: 4903.8. Samples: 674188186. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:08:00,102][25689] Avg episode reward: [(0, '-5.227')] [2022-07-10 09:08:00,504][26022] Updated weights on worker 0-0, policy_version 658393 (0.00082) [2022-07-10 09:08:02,929][26022] Updated weights on worker 0-0, policy_version 658403 (0.00078) [2022-07-10 09:08:04,547][26022] Updated weights on worker 0-0, policy_version 658413 (0.00086) [2022-07-10 09:08:05,136][25689] Fps is (10 sec: 5472.9, 60 sec: 5518.9, 300 sec: 5527.8). Total num frames: 674216960. Throughput: 0: 5620.4. Samples: 674219574. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:08:05,137][25689] Avg episode reward: [(0, '-4.647')] [2022-07-10 09:08:06,590][26022] Updated weights on worker 0-0, policy_version 658423 (0.00085) [2022-07-10 09:08:08,475][26022] Updated weights on worker 0-0, policy_version 658433 (0.00095) [2022-07-10 09:08:10,221][26022] Updated weights on worker 0-0, policy_version 658443 (0.00083) [2022-07-10 09:08:10,229][25689] Fps is (10 sec: 5257.5, 60 sec: 5499.2, 300 sec: 5527.0). Total num frames: 674244608. Throughput: 0: 5643.2. Samples: 674252928. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:08:10,230][25689] Avg episode reward: [(0, '-5.262')] [2022-07-10 09:08:12,016][26022] Updated weights on worker 0-0, policy_version 658453 (0.00093) [2022-07-10 09:08:13,964][26022] Updated weights on worker 0-0, policy_version 658463 (0.00091) [2022-07-10 09:08:15,247][25689] Fps is (10 sec: 5570.5, 60 sec: 5489.5, 300 sec: 5530.3). Total num frames: 674273280. Throughput: 0: 4845.3. Samples: 674269718. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:08:15,247][25689] Avg episode reward: [(0, '-6.408')] [2022-07-10 09:08:15,724][26022] Updated weights on worker 0-0, policy_version 658473 (0.00091) [2022-07-10 09:08:17,657][26022] Updated weights on worker 0-0, policy_version 658483 (0.00057) [2022-07-10 09:08:19,274][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:08:19,289][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000658492_674295808.pth [2022-07-10 09:08:19,289][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000656547_672304128.pth [2022-07-10 09:08:19,382][26022] Updated weights on worker 0-0, policy_version 658493 (0.00092) [2022-07-10 09:08:20,339][25689] Fps is (10 sec: 5571.1, 60 sec: 5517.5, 300 sec: 5528.6). Total num frames: 674300928. Throughput: 0: 5681.4. Samples: 674303162. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:08:20,339][25689] Avg episode reward: [(0, '-4.810')] [2022-07-10 09:08:21,342][26022] Updated weights on worker 0-0, policy_version 658503 (0.00087) [2022-07-10 09:08:23,240][26022] Updated weights on worker 0-0, policy_version 658513 (0.00091) [2022-07-10 09:08:24,924][26022] Updated weights on worker 0-0, policy_version 658523 (0.00088) [2022-07-10 09:08:25,375][25689] Fps is (10 sec: 5560.7, 60 sec: 5504.7, 300 sec: 5529.2). Total num frames: 674329600. Throughput: 0: 5778.7. Samples: 674336526. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:08:25,375][25689] Avg episode reward: [(0, '-3.491')] [2022-07-10 09:08:26,906][26022] Updated weights on worker 0-0, policy_version 658533 (0.00111) [2022-07-10 09:08:28,523][26022] Updated weights on worker 0-0, policy_version 658543 (0.00087) [2022-07-10 09:08:30,452][25689] Fps is (10 sec: 5569.2, 60 sec: 5505.2, 300 sec: 5524.4). Total num frames: 674357248. Throughput: 0: 4944.6. Samples: 674352918. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:08:30,452][25689] Avg episode reward: [(0, '-3.876')] [2022-07-10 09:08:30,566][26022] Updated weights on worker 0-0, policy_version 658553 (0.00093) [2022-07-10 09:08:32,544][26022] Updated weights on worker 0-0, policy_version 658563 (0.00086) [2022-07-10 09:08:34,317][26022] Updated weights on worker 0-0, policy_version 658573 (0.00616) [2022-07-10 09:08:35,460][25689] Fps is (10 sec: 5482.9, 60 sec: 5473.6, 300 sec: 5524.6). Total num frames: 674384896. Throughput: 0: 5754.5. Samples: 674386036. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:08:35,461][25689] Avg episode reward: [(0, '-5.351')] [2022-07-10 09:08:36,191][26022] Updated weights on worker 0-0, policy_version 658583 (0.00082) [2022-07-10 09:08:38,093][26022] Updated weights on worker 0-0, policy_version 658593 (0.00091) [2022-07-10 09:08:39,669][26022] Updated weights on worker 0-0, policy_version 658603 (0.00087) [2022-07-10 09:08:40,492][25689] Fps is (10 sec: 5507.2, 60 sec: 5488.5, 300 sec: 5520.9). Total num frames: 674412544. Throughput: 0: 5779.1. Samples: 674419630. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:08:40,494][25689] Avg episode reward: [(0, '-4.914')] [2022-07-10 09:08:41,789][26022] Updated weights on worker 0-0, policy_version 658613 (0.00084) [2022-07-10 09:08:43,307][26022] Updated weights on worker 0-0, policy_version 658623 (0.00085) [2022-07-10 09:08:45,419][26022] Updated weights on worker 0-0, policy_version 658633 (0.00099) [2022-07-10 09:08:45,511][25689] Fps is (10 sec: 5501.7, 60 sec: 5489.1, 300 sec: 5518.0). Total num frames: 674440192. Throughput: 0: 4965.2. Samples: 674436504. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:08:45,511][25689] Avg episode reward: [(0, '-3.870')] [2022-07-10 09:08:47,089][26022] Updated weights on worker 0-0, policy_version 658643 (0.00090) [2022-07-10 09:08:48,960][26022] Updated weights on worker 0-0, policy_version 658653 (0.00090) [2022-07-10 09:08:50,583][25689] Fps is (10 sec: 5581.5, 60 sec: 5509.5, 300 sec: 5523.7). Total num frames: 674468864. Throughput: 0: 5817.7. Samples: 674470036. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:08:50,583][25689] Avg episode reward: [(0, '-4.409')] [2022-07-10 09:08:50,944][26022] Updated weights on worker 0-0, policy_version 658663 (0.00095) [2022-07-10 09:08:52,907][26022] Updated weights on worker 0-0, policy_version 658673 (0.00086) [2022-07-10 09:08:54,608][26022] Updated weights on worker 0-0, policy_version 658683 (0.00083) [2022-07-10 09:08:55,615][25689] Fps is (10 sec: 5574.0, 60 sec: 5527.2, 300 sec: 5523.5). Total num frames: 674496512. Throughput: 0: 5780.2. Samples: 674502534. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:08:55,615][25689] Avg episode reward: [(0, '-5.261')] [2022-07-10 09:08:56,675][26022] Updated weights on worker 0-0, policy_version 658693 (0.00529) [2022-07-10 09:08:58,172][26022] Updated weights on worker 0-0, policy_version 658703 (0.00088) [2022-07-10 09:09:00,253][26022] Updated weights on worker 0-0, policy_version 658713 (0.00085) [2022-07-10 09:09:00,617][25689] Fps is (10 sec: 5714.9, 60 sec: 5533.3, 300 sec: 5534.4). Total num frames: 674526208. Throughput: 0: 4962.6. Samples: 674519502. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:09:00,618][25689] Avg episode reward: [(0, '-5.992')] [2022-07-10 09:09:02,083][26022] Updated weights on worker 0-0, policy_version 658723 (0.00104) [2022-07-10 09:09:04,214][26022] Updated weights on worker 0-0, policy_version 658733 (0.00089) [2022-07-10 09:09:05,643][25689] Fps is (10 sec: 5412.0, 60 sec: 5517.2, 300 sec: 5529.1). Total num frames: 674550784. Throughput: 0: 5682.4. Samples: 674550902. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:09:05,643][25689] Avg episode reward: [(0, '-5.073')] [2022-07-10 09:09:06,081][26022] Updated weights on worker 0-0, policy_version 658743 (0.00087) [2022-07-10 09:09:07,844][26022] Updated weights on worker 0-0, policy_version 658753 (0.00085) [2022-07-10 09:09:09,802][26022] Updated weights on worker 0-0, policy_version 658763 (0.00085) [2022-07-10 09:09:10,682][25689] Fps is (10 sec: 5086.9, 60 sec: 5505.1, 300 sec: 5518.9). Total num frames: 674577408. Throughput: 0: 5692.7. Samples: 674584454. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:09:10,682][25689] Avg episode reward: [(0, '-7.910')] [2022-07-10 09:09:11,471][26022] Updated weights on worker 0-0, policy_version 658773 (0.00090) [2022-07-10 09:09:13,438][26022] Updated weights on worker 0-0, policy_version 658783 (0.00085) [2022-07-10 09:09:15,092][26022] Updated weights on worker 0-0, policy_version 658793 (0.00094) [2022-07-10 09:09:15,690][25689] Fps is (10 sec: 5605.6, 60 sec: 5522.9, 300 sec: 5526.0). Total num frames: 674607104. Throughput: 0: 4916.8. Samples: 674601240. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:09:15,690][25689] Avg episode reward: [(0, '-8.465')] [2022-07-10 09:09:17,082][26022] Updated weights on worker 0-0, policy_version 658803 (0.00087) [2022-07-10 09:09:18,985][26022] Updated weights on worker 0-0, policy_version 658813 (0.00090) [2022-07-10 09:09:20,560][26022] Updated weights on worker 0-0, policy_version 658823 (0.00093) [2022-07-10 09:09:20,718][25689] Fps is (10 sec: 5714.0, 60 sec: 5528.8, 300 sec: 5525.9). Total num frames: 674634752. Throughput: 0: 5723.3. Samples: 674634544. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:09:20,718][25689] Avg episode reward: [(0, '-7.192')] [2022-07-10 09:09:22,868][26022] Updated weights on worker 0-0, policy_version 658833 (0.00090) [2022-07-10 09:09:24,370][26022] Updated weights on worker 0-0, policy_version 658843 (0.00092) [2022-07-10 09:09:25,731][25689] Fps is (10 sec: 5303.1, 60 sec: 5480.0, 300 sec: 5516.6). Total num frames: 674660352. Throughput: 0: 5818.8. Samples: 674667790. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:09:25,731][25689] Avg episode reward: [(0, '-6.121')] [2022-07-10 09:09:26,383][26022] Updated weights on worker 0-0, policy_version 658853 (0.00077) [2022-07-10 09:09:28,048][26022] Updated weights on worker 0-0, policy_version 658863 (0.00089) [2022-07-10 09:09:29,872][26022] Updated weights on worker 0-0, policy_version 658873 (0.00091) [2022-07-10 09:09:30,834][25689] Fps is (10 sec: 5567.4, 60 sec: 5528.5, 300 sec: 5528.9). Total num frames: 674691072. Throughput: 0: 4972.2. Samples: 674684652. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:09:30,834][25689] Avg episode reward: [(0, '-6.744')] [2022-07-10 09:09:31,717][26022] Updated weights on worker 0-0, policy_version 658883 (0.00089) [2022-07-10 09:09:33,582][26022] Updated weights on worker 0-0, policy_version 658893 (0.00091) [2022-07-10 09:09:35,429][26022] Updated weights on worker 0-0, policy_version 658903 (0.00084) [2022-07-10 09:09:35,851][25689] Fps is (10 sec: 5666.4, 60 sec: 5510.8, 300 sec: 5515.1). Total num frames: 674717696. Throughput: 0: 5795.4. Samples: 674718080. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:09:35,851][25689] Avg episode reward: [(0, '-6.496')] [2022-07-10 09:09:37,374][26022] Updated weights on worker 0-0, policy_version 658913 (0.00080) [2022-07-10 09:09:39,183][26022] Updated weights on worker 0-0, policy_version 658923 (0.00092) [2022-07-10 09:09:40,874][25689] Fps is (10 sec: 5507.2, 60 sec: 5528.5, 300 sec: 5525.0). Total num frames: 674746368. Throughput: 0: 5784.3. Samples: 674751136. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:09:40,875][25689] Avg episode reward: [(0, '-4.682')] [2022-07-10 09:09:41,209][26022] Updated weights on worker 0-0, policy_version 658933 (0.00088) [2022-07-10 09:09:42,994][26022] Updated weights on worker 0-0, policy_version 658943 (0.00375) [2022-07-10 09:09:44,940][26022] Updated weights on worker 0-0, policy_version 658953 (0.00091) [2022-07-10 09:09:45,876][25689] Fps is (10 sec: 5515.6, 60 sec: 5513.1, 300 sec: 5522.3). Total num frames: 674772992. Throughput: 0: 4962.5. Samples: 674767762. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:09:45,877][25689] Avg episode reward: [(0, '-6.204')] [2022-07-10 09:09:46,589][26022] Updated weights on worker 0-0, policy_version 658963 (0.00085) [2022-07-10 09:09:48,611][26022] Updated weights on worker 0-0, policy_version 658973 (0.00085) [2022-07-10 09:09:50,368][26022] Updated weights on worker 0-0, policy_version 658983 (0.00094) [2022-07-10 09:09:50,976][25689] Fps is (10 sec: 5474.0, 60 sec: 5510.6, 300 sec: 5520.5). Total num frames: 674801664. Throughput: 0: 5765.0. Samples: 674800772. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:09:50,976][25689] Avg episode reward: [(0, '-5.968')] [2022-07-10 09:09:52,507][26022] Updated weights on worker 0-0, policy_version 658993 (0.00087) [2022-07-10 09:09:54,250][26022] Updated weights on worker 0-0, policy_version 659003 (0.00089) [2022-07-10 09:09:56,023][25689] Fps is (10 sec: 5449.5, 60 sec: 5492.2, 300 sec: 5520.0). Total num frames: 674828288. Throughput: 0: 5720.9. Samples: 674833484. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:09:56,023][25689] Avg episode reward: [(0, '-5.976')] [2022-07-10 09:09:56,061][26022] Updated weights on worker 0-0, policy_version 659013 (0.00092) [2022-07-10 09:09:57,910][26022] Updated weights on worker 0-0, policy_version 659023 (0.00089) [2022-07-10 09:09:59,568][26022] Updated weights on worker 0-0, policy_version 659033 (0.00084) [2022-07-10 09:10:01,101][25689] Fps is (10 sec: 5562.3, 60 sec: 5485.3, 300 sec: 5532.4). Total num frames: 674857984. Throughput: 0: 5733.0. Samples: 674867096. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:10:01,102][25689] Avg episode reward: [(0, '-6.172')] [2022-07-10 09:10:02,126][26022] Updated weights on worker 0-0, policy_version 659043 (0.00090) [2022-07-10 09:10:03,708][26022] Updated weights on worker 0-0, policy_version 659053 (0.00086) [2022-07-10 09:10:05,714][26022] Updated weights on worker 0-0, policy_version 659063 (0.00084) [2022-07-10 09:10:06,159][25689] Fps is (10 sec: 5455.4, 60 sec: 5499.3, 300 sec: 5519.8). Total num frames: 674883584. Throughput: 0: 5608.6. Samples: 674881520. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:10:06,160][25689] Avg episode reward: [(0, '-5.661')] [2022-07-10 09:10:07,274][26022] Updated weights on worker 0-0, policy_version 659073 (0.00088) [2022-07-10 09:10:09,360][26022] Updated weights on worker 0-0, policy_version 659083 (0.00085) [2022-07-10 09:10:11,075][26022] Updated weights on worker 0-0, policy_version 659093 (0.00260) [2022-07-10 09:10:11,276][25689] Fps is (10 sec: 5233.1, 60 sec: 5509.2, 300 sec: 5514.7). Total num frames: 674911232. Throughput: 0: 5620.2. Samples: 674914866. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:10:11,277][25689] Avg episode reward: [(0, '-4.482')] [2022-07-10 09:10:13,043][26022] Updated weights on worker 0-0, policy_version 659103 (0.00080) [2022-07-10 09:10:14,857][26022] Updated weights on worker 0-0, policy_version 659113 (0.00099) [2022-07-10 09:10:16,354][25689] Fps is (10 sec: 5624.5, 60 sec: 5502.8, 300 sec: 5520.7). Total num frames: 674940928. Throughput: 0: 5661.0. Samples: 674948580. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:10:16,355][25689] Avg episode reward: [(0, '-2.730')] [2022-07-10 09:10:16,555][26022] Updated weights on worker 0-0, policy_version 659123 (0.00091) [2022-07-10 09:10:18,507][26022] Updated weights on worker 0-0, policy_version 659133 (0.00087) [2022-07-10 09:10:19,396][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:10:19,417][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000659139_674958336.pth [2022-07-10 09:10:19,417][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000657194_672966656.pth [2022-07-10 09:10:20,217][26022] Updated weights on worker 0-0, policy_version 659143 (0.00087) [2022-07-10 09:10:21,384][25689] Fps is (10 sec: 5571.8, 60 sec: 5485.7, 300 sec: 5513.4). Total num frames: 674967552. Throughput: 0: 4848.3. Samples: 674965436. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:10:21,385][25689] Avg episode reward: [(0, '-4.053')] [2022-07-10 09:10:22,087][26022] Updated weights on worker 0-0, policy_version 659153 (0.00091) [2022-07-10 09:10:24,040][26022] Updated weights on worker 0-0, policy_version 659163 (0.00093) [2022-07-10 09:10:25,796][26022] Updated weights on worker 0-0, policy_version 659173 (0.00091) [2022-07-10 09:10:26,404][25689] Fps is (10 sec: 5400.2, 60 sec: 5518.8, 300 sec: 5510.7). Total num frames: 674995200. Throughput: 0: 5790.8. Samples: 674998756. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:10:26,405][25689] Avg episode reward: [(0, '-2.698')] [2022-07-10 09:10:27,699][26022] Updated weights on worker 0-0, policy_version 659183 (0.00088) [2022-07-10 09:10:29,615][26022] Updated weights on worker 0-0, policy_version 659193 (0.00081) [2022-07-10 09:10:31,217][26022] Updated weights on worker 0-0, policy_version 659203 (0.00095) [2022-07-10 09:10:31,465][25689] Fps is (10 sec: 5688.8, 60 sec: 5505.8, 300 sec: 5520.0). Total num frames: 675024896. Throughput: 0: 5816.2. Samples: 675032284. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:10:31,465][25689] Avg episode reward: [(0, '-3.893')] [2022-07-10 09:10:33,287][26022] Updated weights on worker 0-0, policy_version 659213 (0.00088) [2022-07-10 09:10:34,764][26022] Updated weights on worker 0-0, policy_version 659223 (0.00088) [2022-07-10 09:10:36,515][25689] Fps is (10 sec: 5671.7, 60 sec: 5519.7, 300 sec: 5519.3). Total num frames: 675052544. Throughput: 0: 4982.9. Samples: 675049038. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:10:36,515][25689] Avg episode reward: [(0, '-2.623')] [2022-07-10 09:10:37,028][26022] Updated weights on worker 0-0, policy_version 659233 (0.00091) [2022-07-10 09:10:38,576][26022] Updated weights on worker 0-0, policy_version 659243 (0.00094) [2022-07-10 09:10:40,635][26022] Updated weights on worker 0-0, policy_version 659253 (0.00093) [2022-07-10 09:10:41,610][25689] Fps is (10 sec: 5551.6, 60 sec: 5513.2, 300 sec: 5517.6). Total num frames: 675081216. Throughput: 0: 5791.6. Samples: 675082572. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:10:41,610][25689] Avg episode reward: [(0, '-4.184')] [2022-07-10 09:10:42,235][26022] Updated weights on worker 0-0, policy_version 659263 (0.00083) [2022-07-10 09:10:44,176][26022] Updated weights on worker 0-0, policy_version 659273 (0.00079) [2022-07-10 09:10:45,956][26022] Updated weights on worker 0-0, policy_version 659283 (0.00091) [2022-07-10 09:10:46,618][25689] Fps is (10 sec: 5473.3, 60 sec: 5512.7, 300 sec: 5512.0). Total num frames: 675107840. Throughput: 0: 5799.8. Samples: 675115990. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:10:46,618][25689] Avg episode reward: [(0, '-5.887')] [2022-07-10 09:10:47,842][26022] Updated weights on worker 0-0, policy_version 659293 (0.00085) [2022-07-10 09:10:49,754][26022] Updated weights on worker 0-0, policy_version 659303 (0.00085) [2022-07-10 09:10:51,687][25689] Fps is (10 sec: 5385.4, 60 sec: 5498.6, 300 sec: 5515.5). Total num frames: 675135488. Throughput: 0: 4967.8. Samples: 675132746. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:10:51,688][25689] Avg episode reward: [(0, '-4.578')] [2022-07-10 09:10:51,698][26022] Updated weights on worker 0-0, policy_version 659313 (0.00087) [2022-07-10 09:10:53,120][26022] Updated weights on worker 0-0, policy_version 659323 (0.00092) [2022-07-10 09:10:55,564][26022] Updated weights on worker 0-0, policy_version 659333 (0.00086) [2022-07-10 09:10:56,723][25689] Fps is (10 sec: 5776.2, 60 sec: 5567.1, 300 sec: 5522.8). Total num frames: 675166208. Throughput: 0: 5799.2. Samples: 675166228. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:10:56,723][25689] Avg episode reward: [(0, '-4.638')] [2022-07-10 09:10:56,759][26022] Updated weights on worker 0-0, policy_version 659343 (0.00088) [2022-07-10 09:10:59,038][26022] Updated weights on worker 0-0, policy_version 659353 (0.00109) [2022-07-10 09:11:00,568][26022] Updated weights on worker 0-0, policy_version 659363 (0.00092) [2022-07-10 09:11:01,758][25689] Fps is (10 sec: 5693.9, 60 sec: 5520.4, 300 sec: 5522.4). Total num frames: 675192832. Throughput: 0: 5829.4. Samples: 675200026. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:11:01,759][25689] Avg episode reward: [(0, '-5.179')] [2022-07-10 09:11:03,001][26022] Updated weights on worker 0-0, policy_version 659373 (0.00089) [2022-07-10 09:11:04,710][26022] Updated weights on worker 0-0, policy_version 659383 (0.00095) [2022-07-10 09:11:06,453][26022] Updated weights on worker 0-0, policy_version 659393 (0.00086) [2022-07-10 09:11:06,765][25689] Fps is (10 sec: 5200.2, 60 sec: 5525.0, 300 sec: 5521.1). Total num frames: 675218432. Throughput: 0: 4902.5. Samples: 675214764. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:11:06,766][25689] Avg episode reward: [(0, '-5.736')] [2022-07-10 09:11:08,317][26022] Updated weights on worker 0-0, policy_version 659403 (0.00097) [2022-07-10 09:11:10,392][26022] Updated weights on worker 0-0, policy_version 659413 (0.00104) [2022-07-10 09:11:11,851][25689] Fps is (10 sec: 5377.3, 60 sec: 5544.8, 300 sec: 5519.8). Total num frames: 675247104. Throughput: 0: 5734.4. Samples: 675248374. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:11:11,851][25689] Avg episode reward: [(0, '-4.210')] [2022-07-10 09:11:11,970][26022] Updated weights on worker 0-0, policy_version 659423 (0.00088) [2022-07-10 09:11:14,000][26022] Updated weights on worker 0-0, policy_version 659433 (0.00082) [2022-07-10 09:11:15,486][26022] Updated weights on worker 0-0, policy_version 659443 (0.00086) [2022-07-10 09:11:16,852][25689] Fps is (10 sec: 5685.0, 60 sec: 5534.9, 300 sec: 5520.3). Total num frames: 675275776. Throughput: 0: 5761.0. Samples: 675282194. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:11:16,853][25689] Avg episode reward: [(0, '-5.661')] [2022-07-10 09:11:17,559][26022] Updated weights on worker 0-0, policy_version 659453 (0.00092) [2022-07-10 09:11:19,296][26022] Updated weights on worker 0-0, policy_version 659463 (0.00079) [2022-07-10 09:11:21,200][26022] Updated weights on worker 0-0, policy_version 659473 (0.00097) [2022-07-10 09:11:21,863][25689] Fps is (10 sec: 5625.5, 60 sec: 5553.6, 300 sec: 5520.5). Total num frames: 675303424. Throughput: 0: 4924.6. Samples: 675299032. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:11:21,863][25689] Avg episode reward: [(0, '-3.867')] [2022-07-10 09:11:22,959][26022] Updated weights on worker 0-0, policy_version 659483 (0.00084) [2022-07-10 09:11:25,007][26022] Updated weights on worker 0-0, policy_version 659493 (0.00084) [2022-07-10 09:11:26,533][26022] Updated weights on worker 0-0, policy_version 659503 (0.00089) [2022-07-10 09:11:26,880][25689] Fps is (10 sec: 5718.7, 60 sec: 5587.7, 300 sec: 5529.0). Total num frames: 675333120. Throughput: 0: 5846.6. Samples: 675332364. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:11:26,880][25689] Avg episode reward: [(0, '-6.134')] [2022-07-10 09:11:28,803][26022] Updated weights on worker 0-0, policy_version 659513 (0.00095) [2022-07-10 09:11:30,099][26022] Updated weights on worker 0-0, policy_version 659523 (0.00090) [2022-07-10 09:11:31,951][25689] Fps is (10 sec: 5481.3, 60 sec: 5519.0, 300 sec: 5514.2). Total num frames: 675358720. Throughput: 0: 5836.3. Samples: 675365682. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:11:31,951][25689] Avg episode reward: [(0, '-5.903')] [2022-07-10 09:11:32,408][26022] Updated weights on worker 0-0, policy_version 659533 (0.00092) [2022-07-10 09:11:33,992][26022] Updated weights on worker 0-0, policy_version 659543 (0.00085) [2022-07-10 09:11:35,932][26022] Updated weights on worker 0-0, policy_version 659553 (0.00087) [2022-07-10 09:11:36,972][25689] Fps is (10 sec: 5377.4, 60 sec: 5538.6, 300 sec: 5521.1). Total num frames: 675387392. Throughput: 0: 4975.5. Samples: 675382300. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:11:36,973][25689] Avg episode reward: [(0, '-5.767')] [2022-07-10 09:11:37,813][26022] Updated weights on worker 0-0, policy_version 659563 (0.00085) [2022-07-10 09:11:39,617][26022] Updated weights on worker 0-0, policy_version 659573 (0.00086) [2022-07-10 09:11:41,329][26022] Updated weights on worker 0-0, policy_version 659583 (0.00088) [2022-07-10 09:11:42,004][25689] Fps is (10 sec: 5704.1, 60 sec: 5544.4, 300 sec: 5520.9). Total num frames: 675416064. Throughput: 0: 5790.8. Samples: 675415666. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:11:42,007][25689] Avg episode reward: [(0, '-4.301')] [2022-07-10 09:11:43,523][26022] Updated weights on worker 0-0, policy_version 659593 (0.00084) [2022-07-10 09:11:44,946][26022] Updated weights on worker 0-0, policy_version 659603 (0.00086) [2022-07-10 09:11:47,025][25689] Fps is (10 sec: 5500.6, 60 sec: 5543.2, 300 sec: 5514.5). Total num frames: 675442688. Throughput: 0: 5817.2. Samples: 675449552. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:11:47,025][25689] Avg episode reward: [(0, '-3.267')] [2022-07-10 09:11:47,097][26022] Updated weights on worker 0-0, policy_version 659613 (0.00087) [2022-07-10 09:11:48,520][26022] Updated weights on worker 0-0, policy_version 659623 (0.00083) [2022-07-10 09:11:50,598][26022] Updated weights on worker 0-0, policy_version 659633 (0.00093) [2022-07-10 09:11:52,083][25689] Fps is (10 sec: 5588.0, 60 sec: 5578.2, 300 sec: 5524.5). Total num frames: 675472384. Throughput: 0: 5000.1. Samples: 675466344. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:11:52,083][25689] Avg episode reward: [(0, '-2.585')] [2022-07-10 09:11:52,384][26022] Updated weights on worker 0-0, policy_version 659643 (0.00093) [2022-07-10 09:11:54,166][26022] Updated weights on worker 0-0, policy_version 659653 (0.00091) [2022-07-10 09:11:55,904][26022] Updated weights on worker 0-0, policy_version 659663 (0.00091) [2022-07-10 09:11:57,160][25689] Fps is (10 sec: 5657.8, 60 sec: 5523.4, 300 sec: 5513.3). Total num frames: 675500032. Throughput: 0: 5832.6. Samples: 675500050. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:11:57,161][25689] Avg episode reward: [(0, '-2.784')] [2022-07-10 09:11:58,303][26022] Updated weights on worker 0-0, policy_version 659673 (0.00084) [2022-07-10 09:11:59,495][26022] Updated weights on worker 0-0, policy_version 659683 (0.00089) [2022-07-10 09:12:01,715][26022] Updated weights on worker 0-0, policy_version 659693 (0.00095) [2022-07-10 09:12:02,174][25689] Fps is (10 sec: 5378.0, 60 sec: 5525.4, 300 sec: 5524.0). Total num frames: 675526656. Throughput: 0: 5834.3. Samples: 675533346. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:12:02,176][25689] Avg episode reward: [(0, '-4.180')] [2022-07-10 09:12:03,715][26022] Updated weights on worker 0-0, policy_version 659703 (0.00094) [2022-07-10 09:12:05,593][26022] Updated weights on worker 0-0, policy_version 659713 (0.00096) [2022-07-10 09:12:07,251][25689] Fps is (10 sec: 5378.7, 60 sec: 5553.0, 300 sec: 5520.3). Total num frames: 675554304. Throughput: 0: 4864.9. Samples: 675547948. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:12:07,253][25689] Avg episode reward: [(0, '-4.873')] [2022-07-10 09:12:07,569][26022] Updated weights on worker 0-0, policy_version 659723 (0.00088) [2022-07-10 09:12:09,151][26022] Updated weights on worker 0-0, policy_version 659733 (0.00088) [2022-07-10 09:12:11,299][26022] Updated weights on worker 0-0, policy_version 659743 (0.00088) [2022-07-10 09:12:12,340][25689] Fps is (10 sec: 5640.6, 60 sec: 5569.5, 300 sec: 5520.4). Total num frames: 675584000. Throughput: 0: 5694.3. Samples: 675581698. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:12:12,341][25689] Avg episode reward: [(0, '-6.176')] [2022-07-10 09:12:13,082][26022] Updated weights on worker 0-0, policy_version 659753 (0.00084) [2022-07-10 09:12:14,662][26022] Updated weights on worker 0-0, policy_version 659763 (0.00084) [2022-07-10 09:12:16,787][26022] Updated weights on worker 0-0, policy_version 659773 (0.00084) [2022-07-10 09:12:17,410][25689] Fps is (10 sec: 5543.4, 60 sec: 5529.4, 300 sec: 5523.1). Total num frames: 675610624. Throughput: 0: 5692.4. Samples: 675615322. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:12:17,411][25689] Avg episode reward: [(0, '-7.541')] [2022-07-10 09:12:18,292][26022] Updated weights on worker 0-0, policy_version 659783 (0.00099) [2022-07-10 09:12:19,457][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:12:19,470][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000659789_675623936.pth [2022-07-10 09:12:19,470][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000657843_673631232.pth [2022-07-10 09:12:20,317][26022] Updated weights on worker 0-0, policy_version 659793 (0.00095) [2022-07-10 09:12:22,113][26022] Updated weights on worker 0-0, policy_version 659803 (0.00091) [2022-07-10 09:12:22,464][25689] Fps is (10 sec: 5461.9, 60 sec: 5542.3, 300 sec: 5520.1). Total num frames: 675639296. Throughput: 0: 5709.0. Samples: 675649182. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:12:22,465][25689] Avg episode reward: [(0, '-7.875')] [2022-07-10 09:12:23,758][26022] Updated weights on worker 0-0, policy_version 659813 (0.00090) [2022-07-10 09:12:25,872][26022] Updated weights on worker 0-0, policy_version 659823 (0.00090) [2022-07-10 09:12:27,551][25689] Fps is (10 sec: 5654.9, 60 sec: 5519.0, 300 sec: 5523.5). Total num frames: 675667968. Throughput: 0: 5821.4. Samples: 675666126. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:12:27,553][25689] Avg episode reward: [(0, '-7.122')] [2022-07-10 09:12:27,637][26022] Updated weights on worker 0-0, policy_version 659833 (0.00090) [2022-07-10 09:12:29,568][26022] Updated weights on worker 0-0, policy_version 659843 (0.00093) [2022-07-10 09:12:31,109][26022] Updated weights on worker 0-0, policy_version 659853 (0.00084) [2022-07-10 09:12:32,609][25689] Fps is (10 sec: 5551.6, 60 sec: 5554.0, 300 sec: 5516.1). Total num frames: 675695616. Throughput: 0: 5809.1. Samples: 675699442. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:12:32,611][25689] Avg episode reward: [(0, '-5.782')] [2022-07-10 09:12:33,255][26022] Updated weights on worker 0-0, policy_version 659863 (0.00611) [2022-07-10 09:12:34,962][26022] Updated weights on worker 0-0, policy_version 659873 (0.00092) [2022-07-10 09:12:36,587][26022] Updated weights on worker 0-0, policy_version 659883 (0.00087) [2022-07-10 09:12:37,627][25689] Fps is (10 sec: 5589.6, 60 sec: 5554.3, 300 sec: 5522.8). Total num frames: 675724288. Throughput: 0: 5827.3. Samples: 675733130. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:12:37,629][25689] Avg episode reward: [(0, '-5.720')] [2022-07-10 09:12:38,679][26022] Updated weights on worker 0-0, policy_version 659893 (0.00094) [2022-07-10 09:12:40,563][26022] Updated weights on worker 0-0, policy_version 659903 (0.00091) [2022-07-10 09:12:42,192][26022] Updated weights on worker 0-0, policy_version 659913 (0.00090) [2022-07-10 09:12:42,639][25689] Fps is (10 sec: 5717.7, 60 sec: 5556.2, 300 sec: 5526.5). Total num frames: 675752960. Throughput: 0: 5000.5. Samples: 675750062. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:12:42,639][25689] Avg episode reward: [(0, '-5.189')] [2022-07-10 09:12:44,057][26022] Updated weights on worker 0-0, policy_version 659923 (0.00083) [2022-07-10 09:12:45,790][26022] Updated weights on worker 0-0, policy_version 659933 (0.00099) [2022-07-10 09:12:47,584][26022] Updated weights on worker 0-0, policy_version 659943 (0.00082) [2022-07-10 09:12:47,675][25689] Fps is (10 sec: 5707.0, 60 sec: 5588.5, 300 sec: 5531.4). Total num frames: 675781632. Throughput: 0: 5833.5. Samples: 675783518. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2022-07-10 09:12:47,677][25689] Avg episode reward: [(0, '-7.274')] [2022-07-10 09:12:49,673][26022] Updated weights on worker 0-0, policy_version 659953 (0.00082) [2022-07-10 09:12:51,260][26022] Updated weights on worker 0-0, policy_version 659963 (0.00090) [2022-07-10 09:12:52,757][25689] Fps is (10 sec: 5464.6, 60 sec: 5535.6, 300 sec: 5530.6). Total num frames: 675808256. Throughput: 0: 5831.5. Samples: 675816936. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:12:52,758][25689] Avg episode reward: [(0, '-7.532')] [2022-07-10 09:12:53,471][26022] Updated weights on worker 0-0, policy_version 659973 (0.00090) [2022-07-10 09:12:55,109][26022] Updated weights on worker 0-0, policy_version 659983 (0.00088) [2022-07-10 09:12:56,923][26022] Updated weights on worker 0-0, policy_version 659993 (0.00092) [2022-07-10 09:12:57,810][25689] Fps is (10 sec: 5354.9, 60 sec: 5537.9, 300 sec: 5524.0). Total num frames: 675835904. Throughput: 0: 4984.6. Samples: 675833734. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:12:57,811][25689] Avg episode reward: [(0, '-6.654')] [2022-07-10 09:12:58,822][26022] Updated weights on worker 0-0, policy_version 660003 (0.00077) [2022-07-10 09:13:00,635][26022] Updated weights on worker 0-0, policy_version 660013 (0.00086) [2022-07-10 09:13:02,675][26022] Updated weights on worker 0-0, policy_version 660023 (0.00095) [2022-07-10 09:13:02,859][25689] Fps is (10 sec: 5575.3, 60 sec: 5568.4, 300 sec: 5534.1). Total num frames: 675864576. Throughput: 0: 5792.7. Samples: 675867194. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:13:02,860][25689] Avg episode reward: [(0, '-7.011')] [2022-07-10 09:13:04,968][26022] Updated weights on worker 0-0, policy_version 660033 (0.01360) [2022-07-10 09:13:06,363][26022] Updated weights on worker 0-0, policy_version 660043 (0.00087) [2022-07-10 09:13:07,879][25689] Fps is (10 sec: 5491.8, 60 sec: 5556.7, 300 sec: 5532.0). Total num frames: 675891200. Throughput: 0: 5692.7. Samples: 675898532. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:13:07,879][25689] Avg episode reward: [(0, '-6.604')] [2022-07-10 09:13:08,615][26022] Updated weights on worker 0-0, policy_version 660053 (0.00086) [2022-07-10 09:13:10,173][26022] Updated weights on worker 0-0, policy_version 660063 (0.00091) [2022-07-10 09:13:12,244][26022] Updated weights on worker 0-0, policy_version 660073 (0.00098) [2022-07-10 09:13:12,919][25689] Fps is (10 sec: 5293.3, 60 sec: 5510.6, 300 sec: 5524.7). Total num frames: 675917824. Throughput: 0: 4864.0. Samples: 675915000. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:13:12,919][25689] Avg episode reward: [(0, '-6.496')] [2022-07-10 09:13:13,995][26022] Updated weights on worker 0-0, policy_version 660083 (0.00087) [2022-07-10 09:13:15,885][26022] Updated weights on worker 0-0, policy_version 660093 (0.00076) [2022-07-10 09:13:17,497][26022] Updated weights on worker 0-0, policy_version 660103 (0.00082) [2022-07-10 09:13:18,016][25689] Fps is (10 sec: 5555.7, 60 sec: 5558.8, 300 sec: 5531.5). Total num frames: 675947520. Throughput: 0: 5667.4. Samples: 675948250. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:13:18,017][25689] Avg episode reward: [(0, '-5.358')] [2022-07-10 09:13:19,630][26022] Updated weights on worker 0-0, policy_version 660113 (0.00105) [2022-07-10 09:13:21,280][26022] Updated weights on worker 0-0, policy_version 660123 (0.00090) [2022-07-10 09:13:23,031][25689] Fps is (10 sec: 5569.4, 60 sec: 5528.6, 300 sec: 5525.0). Total num frames: 675974144. Throughput: 0: 5681.4. Samples: 675981800. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:13:23,032][25689] Avg episode reward: [(0, '-5.253')] [2022-07-10 09:13:23,158][26022] Updated weights on worker 0-0, policy_version 660133 (0.00086) [2022-07-10 09:13:24,976][26022] Updated weights on worker 0-0, policy_version 660143 (0.00093) [2022-07-10 09:13:26,786][26022] Updated weights on worker 0-0, policy_version 660153 (0.00083) [2022-07-10 09:13:28,072][25689] Fps is (10 sec: 5499.0, 60 sec: 5532.8, 300 sec: 5529.1). Total num frames: 676002816. Throughput: 0: 4960.3. Samples: 675998696. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:13:28,072][25689] Avg episode reward: [(0, '-4.872')] [2022-07-10 09:13:28,830][26022] Updated weights on worker 0-0, policy_version 660163 (0.00090) [2022-07-10 09:13:30,792][26022] Updated weights on worker 0-0, policy_version 660173 (0.00089) [2022-07-10 09:13:32,300][26022] Updated weights on worker 0-0, policy_version 660183 (0.00085) [2022-07-10 09:13:33,196][25689] Fps is (10 sec: 5540.9, 60 sec: 5526.8, 300 sec: 5526.9). Total num frames: 676030464. Throughput: 0: 5749.3. Samples: 676031578. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:13:33,196][25689] Avg episode reward: [(0, '-4.367')] [2022-07-10 09:13:34,415][26022] Updated weights on worker 0-0, policy_version 660193 (0.00085) [2022-07-10 09:13:36,095][26022] Updated weights on worker 0-0, policy_version 660203 (0.00093) [2022-07-10 09:13:38,059][26022] Updated weights on worker 0-0, policy_version 660213 (0.00094) [2022-07-10 09:13:38,239][25689] Fps is (10 sec: 5640.1, 60 sec: 5541.3, 300 sec: 5533.6). Total num frames: 676060160. Throughput: 0: 5773.8. Samples: 676065014. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:13:38,240][25689] Avg episode reward: [(0, '-3.257')] [2022-07-10 09:13:39,877][26022] Updated weights on worker 0-0, policy_version 660223 (0.00094) [2022-07-10 09:13:41,525][26022] Updated weights on worker 0-0, policy_version 660233 (0.00082) [2022-07-10 09:13:43,314][25689] Fps is (10 sec: 5667.3, 60 sec: 5518.7, 300 sec: 5532.6). Total num frames: 676087808. Throughput: 0: 4939.1. Samples: 676081980. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:13:43,315][25689] Avg episode reward: [(0, '-3.129')] [2022-07-10 09:13:43,498][26022] Updated weights on worker 0-0, policy_version 660243 (0.00086) [2022-07-10 09:13:45,335][26022] Updated weights on worker 0-0, policy_version 660253 (0.00230) [2022-07-10 09:13:47,093][26022] Updated weights on worker 0-0, policy_version 660263 (0.00081) [2022-07-10 09:13:48,340][25689] Fps is (10 sec: 5474.2, 60 sec: 5502.7, 300 sec: 5530.0). Total num frames: 676115456. Throughput: 0: 5755.9. Samples: 676115360. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:13:48,341][25689] Avg episode reward: [(0, '-1.365')] [2022-07-10 09:13:49,015][26022] Updated weights on worker 0-0, policy_version 660273 (0.00094) [2022-07-10 09:13:50,748][26022] Updated weights on worker 0-0, policy_version 660283 (0.00086) [2022-07-10 09:13:52,729][26022] Updated weights on worker 0-0, policy_version 660293 (0.00091) [2022-07-10 09:13:53,382][25689] Fps is (10 sec: 5492.1, 60 sec: 5523.3, 300 sec: 5529.8). Total num frames: 676143104. Throughput: 0: 5791.8. Samples: 676148496. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:13:53,383][25689] Avg episode reward: [(0, '-2.627')] [2022-07-10 09:13:54,615][26022] Updated weights on worker 0-0, policy_version 660303 (0.00088) [2022-07-10 09:13:56,391][26022] Updated weights on worker 0-0, policy_version 660313 (0.00083) [2022-07-10 09:13:58,388][25689] Fps is (10 sec: 5503.1, 60 sec: 5527.5, 300 sec: 5522.8). Total num frames: 676170752. Throughput: 0: 4965.4. Samples: 676165064. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:13:58,389][25689] Avg episode reward: [(0, '-4.131')] [2022-07-10 09:13:58,396][26022] Updated weights on worker 0-0, policy_version 660323 (0.00093) [2022-07-10 09:14:00,187][26022] Updated weights on worker 0-0, policy_version 660333 (0.00092) [2022-07-10 09:14:02,412][26022] Updated weights on worker 0-0, policy_version 660343 (0.00092) [2022-07-10 09:14:03,416][25689] Fps is (10 sec: 5409.3, 60 sec: 5495.7, 300 sec: 5529.7). Total num frames: 676197376. Throughput: 0: 5783.8. Samples: 676198242. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:14:03,416][25689] Avg episode reward: [(0, '-4.624')] [2022-07-10 09:14:04,284][26022] Updated weights on worker 0-0, policy_version 660353 (0.00088) [2022-07-10 09:14:06,050][26022] Updated weights on worker 0-0, policy_version 660363 (0.00093) [2022-07-10 09:14:07,783][26022] Updated weights on worker 0-0, policy_version 660373 (0.00096) [2022-07-10 09:14:08,438][25689] Fps is (10 sec: 5400.2, 60 sec: 5512.3, 300 sec: 5533.4). Total num frames: 676225024. Throughput: 0: 5682.6. Samples: 676229570. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:14:08,440][25689] Avg episode reward: [(0, '-4.771')] [2022-07-10 09:14:09,771][26022] Updated weights on worker 0-0, policy_version 660383 (0.00089) [2022-07-10 09:14:11,463][26022] Updated weights on worker 0-0, policy_version 660393 (0.00093) [2022-07-10 09:14:13,514][25689] Fps is (10 sec: 5475.5, 60 sec: 5525.9, 300 sec: 5525.3). Total num frames: 676252672. Throughput: 0: 4850.1. Samples: 676246138. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:14:13,515][25689] Avg episode reward: [(0, '-6.291')] [2022-07-10 09:14:13,519][26022] Updated weights on worker 0-0, policy_version 660403 (0.00084) [2022-07-10 09:14:15,121][26022] Updated weights on worker 0-0, policy_version 660413 (0.00093) [2022-07-10 09:14:17,289][26022] Updated weights on worker 0-0, policy_version 660423 (0.00085) [2022-07-10 09:14:18,540][25689] Fps is (10 sec: 5575.6, 60 sec: 5515.6, 300 sec: 5528.8). Total num frames: 676281344. Throughput: 0: 5681.1. Samples: 676279544. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:14:18,540][25689] Avg episode reward: [(0, '-5.559')] [2022-07-10 09:14:19,032][26022] Updated weights on worker 0-0, policy_version 660433 (0.00089) [2022-07-10 09:14:19,585][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:14:19,598][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000660436_676286464.pth [2022-07-10 09:14:19,599][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000658492_674295808.pth [2022-07-10 09:14:20,783][26022] Updated weights on worker 0-0, policy_version 660443 (0.00085) [2022-07-10 09:14:22,742][26022] Updated weights on worker 0-0, policy_version 660453 (0.00213) [2022-07-10 09:14:23,614][25689] Fps is (10 sec: 5576.6, 60 sec: 5527.1, 300 sec: 5534.5). Total num frames: 676308992. Throughput: 0: 5684.9. Samples: 676313066. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:14:23,614][25689] Avg episode reward: [(0, '-7.088')] [2022-07-10 09:14:24,352][26022] Updated weights on worker 0-0, policy_version 660463 (0.00087) [2022-07-10 09:14:26,391][26022] Updated weights on worker 0-0, policy_version 660473 (0.00082) [2022-07-10 09:14:28,377][26022] Updated weights on worker 0-0, policy_version 660483 (0.00093) [2022-07-10 09:14:28,624][25689] Fps is (10 sec: 5381.6, 60 sec: 5496.1, 300 sec: 5522.5). Total num frames: 676335616. Throughput: 0: 5777.4. Samples: 676346192. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:14:28,625][25689] Avg episode reward: [(0, '-6.594')] [2022-07-10 09:14:29,939][26022] Updated weights on worker 0-0, policy_version 660493 (0.00084) [2022-07-10 09:14:32,044][26022] Updated weights on worker 0-0, policy_version 660503 (0.00096) [2022-07-10 09:14:33,585][26022] Updated weights on worker 0-0, policy_version 660513 (0.00093) [2022-07-10 09:14:33,679][25689] Fps is (10 sec: 5595.7, 60 sec: 5536.2, 300 sec: 5532.1). Total num frames: 676365312. Throughput: 0: 5783.3. Samples: 676362754. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:14:33,679][25689] Avg episode reward: [(0, '-6.687')] [2022-07-10 09:14:35,561][26022] Updated weights on worker 0-0, policy_version 660523 (0.00585) [2022-07-10 09:14:37,327][26022] Updated weights on worker 0-0, policy_version 660533 (0.00081) [2022-07-10 09:14:38,696][25689] Fps is (10 sec: 5693.6, 60 sec: 5504.8, 300 sec: 5528.8). Total num frames: 676392960. Throughput: 0: 5799.5. Samples: 676396440. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:14:38,697][25689] Avg episode reward: [(0, '-6.401')] [2022-07-10 09:14:39,194][26022] Updated weights on worker 0-0, policy_version 660543 (0.00090) [2022-07-10 09:14:41,142][26022] Updated weights on worker 0-0, policy_version 660553 (0.00090) [2022-07-10 09:14:42,788][26022] Updated weights on worker 0-0, policy_version 660563 (0.00090) [2022-07-10 09:14:43,703][25689] Fps is (10 sec: 5414.0, 60 sec: 5494.0, 300 sec: 5528.7). Total num frames: 676419584. Throughput: 0: 5815.8. Samples: 676429900. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:14:43,703][25689] Avg episode reward: [(0, '-7.385')] [2022-07-10 09:14:44,723][26022] Updated weights on worker 0-0, policy_version 660573 (0.00089) [2022-07-10 09:14:46,524][26022] Updated weights on worker 0-0, policy_version 660583 (0.00088) [2022-07-10 09:14:48,349][26022] Updated weights on worker 0-0, policy_version 660593 (0.00084) [2022-07-10 09:14:48,739][25689] Fps is (10 sec: 5608.1, 60 sec: 5527.0, 300 sec: 5533.3). Total num frames: 676449280. Throughput: 0: 5004.8. Samples: 676446860. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:14:48,739][25689] Avg episode reward: [(0, '-7.024')] [2022-07-10 09:14:50,254][26022] Updated weights on worker 0-0, policy_version 660603 (0.00079) [2022-07-10 09:14:51,908][26022] Updated weights on worker 0-0, policy_version 660613 (0.00081) [2022-07-10 09:14:53,825][25689] Fps is (10 sec: 5665.3, 60 sec: 5523.0, 300 sec: 5536.0). Total num frames: 676476928. Throughput: 0: 5845.8. Samples: 676480524. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:14:53,825][25689] Avg episode reward: [(0, '-8.050')] [2022-07-10 09:14:53,876][26022] Updated weights on worker 0-0, policy_version 660623 (0.00089) [2022-07-10 09:14:55,744][26022] Updated weights on worker 0-0, policy_version 660633 (0.00091) [2022-07-10 09:14:57,475][26022] Updated weights on worker 0-0, policy_version 660643 (0.00087) [2022-07-10 09:14:58,846][25689] Fps is (10 sec: 5471.1, 60 sec: 5521.7, 300 sec: 5530.2). Total num frames: 676504576. Throughput: 0: 5837.6. Samples: 676514066. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:14:58,847][25689] Avg episode reward: [(0, '-7.068')] [2022-07-10 09:14:59,394][26022] Updated weights on worker 0-0, policy_version 660653 (0.00089) [2022-07-10 09:15:01,217][26022] Updated weights on worker 0-0, policy_version 660663 (0.00092) [2022-07-10 09:15:03,395][26022] Updated weights on worker 0-0, policy_version 660673 (0.00096) [2022-07-10 09:15:03,851][25689] Fps is (10 sec: 5413.2, 60 sec: 5523.7, 300 sec: 5534.6). Total num frames: 676531200. Throughput: 0: 4968.5. Samples: 676530004. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:15:03,851][25689] Avg episode reward: [(0, '-7.829')] [2022-07-10 09:15:05,184][26022] Updated weights on worker 0-0, policy_version 660683 (0.00087) [2022-07-10 09:15:06,975][26022] Updated weights on worker 0-0, policy_version 660693 (0.00088) [2022-07-10 09:15:08,873][25689] Fps is (10 sec: 5412.5, 60 sec: 5523.7, 300 sec: 5536.4). Total num frames: 676558848. Throughput: 0: 5737.2. Samples: 676562374. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:15:08,874][25689] Avg episode reward: [(0, '-7.600')] [2022-07-10 09:15:08,951][26022] Updated weights on worker 0-0, policy_version 660703 (0.00096) [2022-07-10 09:15:10,681][26022] Updated weights on worker 0-0, policy_version 660713 (0.00091) [2022-07-10 09:15:12,612][26022] Updated weights on worker 0-0, policy_version 660723 (0.00093) [2022-07-10 09:15:13,988][25689] Fps is (10 sec: 5555.7, 60 sec: 5537.1, 300 sec: 5532.3). Total num frames: 676587520. Throughput: 0: 5726.2. Samples: 676595982. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:15:13,988][25689] Avg episode reward: [(0, '-6.562')] [2022-07-10 09:15:14,519][26022] Updated weights on worker 0-0, policy_version 660733 (0.00089) [2022-07-10 09:15:16,353][26022] Updated weights on worker 0-0, policy_version 660743 (0.00095) [2022-07-10 09:15:18,045][26022] Updated weights on worker 0-0, policy_version 660753 (0.00085) [2022-07-10 09:15:19,017][25689] Fps is (10 sec: 5753.8, 60 sec: 5553.7, 300 sec: 5542.6). Total num frames: 676617216. Throughput: 0: 4898.9. Samples: 676612884. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:15:19,018][25689] Avg episode reward: [(0, '-5.060')] [2022-07-10 09:15:20,169][26022] Updated weights on worker 0-0, policy_version 660763 (0.00082) [2022-07-10 09:15:21,688][26022] Updated weights on worker 0-0, policy_version 660773 (0.00091) [2022-07-10 09:15:23,591][26022] Updated weights on worker 0-0, policy_version 660783 (0.00087) [2022-07-10 09:15:24,035][25689] Fps is (10 sec: 5605.7, 60 sec: 5541.9, 300 sec: 5539.2). Total num frames: 676643840. Throughput: 0: 5766.4. Samples: 676646394. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:15:24,035][25689] Avg episode reward: [(0, '-2.679')] [2022-07-10 09:15:25,325][26022] Updated weights on worker 0-0, policy_version 660793 (0.00081) [2022-07-10 09:15:27,319][26022] Updated weights on worker 0-0, policy_version 660803 (0.00084) [2022-07-10 09:15:29,095][25689] Fps is (10 sec: 5385.4, 60 sec: 5554.3, 300 sec: 5532.3). Total num frames: 676671488. Throughput: 0: 5815.0. Samples: 676679964. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:15:29,095][25689] Avg episode reward: [(0, '-2.310')] [2022-07-10 09:15:29,279][26022] Updated weights on worker 0-0, policy_version 660813 (0.00090) [2022-07-10 09:15:30,946][26022] Updated weights on worker 0-0, policy_version 660823 (0.00057) [2022-07-10 09:15:32,717][26022] Updated weights on worker 0-0, policy_version 660833 (0.00091) [2022-07-10 09:15:34,197][25689] Fps is (10 sec: 5542.1, 60 sec: 5533.0, 300 sec: 5534.8). Total num frames: 676700160. Throughput: 0: 4978.2. Samples: 676696588. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:15:34,198][25689] Avg episode reward: [(0, '-1.574')] [2022-07-10 09:15:34,726][26022] Updated weights on worker 0-0, policy_version 660843 (0.00084) [2022-07-10 09:15:36,419][26022] Updated weights on worker 0-0, policy_version 660853 (0.00080) [2022-07-10 09:15:38,411][26022] Updated weights on worker 0-0, policy_version 660863 (0.00267) [2022-07-10 09:15:39,225][25689] Fps is (10 sec: 5458.6, 60 sec: 5515.1, 300 sec: 5529.2). Total num frames: 676726784. Throughput: 0: 5799.1. Samples: 676730070. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:15:39,225][25689] Avg episode reward: [(0, '-1.253')] [2022-07-10 09:15:40,079][26022] Updated weights on worker 0-0, policy_version 660873 (0.00092) [2022-07-10 09:15:42,049][26022] Updated weights on worker 0-0, policy_version 660883 (0.00082) [2022-07-10 09:15:43,761][26022] Updated weights on worker 0-0, policy_version 660893 (0.00081) [2022-07-10 09:15:44,245][25689] Fps is (10 sec: 5605.4, 60 sec: 5564.7, 300 sec: 5539.3). Total num frames: 676756480. Throughput: 0: 5820.4. Samples: 676764022. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:15:44,245][25689] Avg episode reward: [(0, '-1.745')] [2022-07-10 09:15:45,494][26022] Updated weights on worker 0-0, policy_version 660903 (0.00084) [2022-07-10 09:15:47,402][26022] Updated weights on worker 0-0, policy_version 660913 (0.00090) [2022-07-10 09:15:49,073][26022] Updated weights on worker 0-0, policy_version 660923 (0.00090) [2022-07-10 09:15:49,249][25689] Fps is (10 sec: 5822.7, 60 sec: 5550.7, 300 sec: 5543.9). Total num frames: 676785152. Throughput: 0: 5010.7. Samples: 676780950. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:15:49,249][25689] Avg episode reward: [(0, '-3.362')] [2022-07-10 09:15:51,209][26022] Updated weights on worker 0-0, policy_version 660933 (0.00086) [2022-07-10 09:15:52,966][26022] Updated weights on worker 0-0, policy_version 660943 (0.00091) [2022-07-10 09:15:54,293][25689] Fps is (10 sec: 5604.8, 60 sec: 5554.5, 300 sec: 5533.5). Total num frames: 676812800. Throughput: 0: 5866.5. Samples: 676814482. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:15:54,294][25689] Avg episode reward: [(0, '-5.462')] [2022-07-10 09:15:54,682][26022] Updated weights on worker 0-0, policy_version 660953 (0.00086) [2022-07-10 09:15:56,628][26022] Updated weights on worker 0-0, policy_version 660963 (0.00093) [2022-07-10 09:15:58,410][26022] Updated weights on worker 0-0, policy_version 660973 (0.00086) [2022-07-10 09:15:59,373][25689] Fps is (10 sec: 5664.3, 60 sec: 5583.0, 300 sec: 5543.0). Total num frames: 676842496. Throughput: 0: 5854.9. Samples: 676848034. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:15:59,373][25689] Avg episode reward: [(0, '-4.770')] [2022-07-10 09:16:00,414][26022] Updated weights on worker 0-0, policy_version 660983 (0.00082) [2022-07-10 09:16:02,509][26022] Updated weights on worker 0-0, policy_version 660993 (0.00085) [2022-07-10 09:16:04,288][26022] Updated weights on worker 0-0, policy_version 661003 (0.00083) [2022-07-10 09:16:04,397][25689] Fps is (10 sec: 5371.3, 60 sec: 5547.3, 300 sec: 5539.2). Total num frames: 676867072. Throughput: 0: 4894.2. Samples: 676862652. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 09:16:04,398][25689] Avg episode reward: [(0, '-4.578')] [2022-07-10 09:16:06,125][26022] Updated weights on worker 0-0, policy_version 661013 (0.00093) [2022-07-10 09:16:08,084][26022] Updated weights on worker 0-0, policy_version 661023 (0.00087) [2022-07-10 09:16:09,426][25689] Fps is (10 sec: 5194.9, 60 sec: 5546.8, 300 sec: 5536.8). Total num frames: 676894720. Throughput: 0: 5697.5. Samples: 676895906. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:16:09,426][25689] Avg episode reward: [(0, '-3.794')] [2022-07-10 09:16:09,983][26022] Updated weights on worker 0-0, policy_version 661033 (0.00095) [2022-07-10 09:16:11,671][26022] Updated weights on worker 0-0, policy_version 661043 (0.00085) [2022-07-10 09:16:13,525][26022] Updated weights on worker 0-0, policy_version 661053 (0.00084) [2022-07-10 09:16:14,556][25689] Fps is (10 sec: 5544.1, 60 sec: 5545.4, 300 sec: 5534.4). Total num frames: 676923392. Throughput: 0: 5681.9. Samples: 676929612. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:16:14,559][25689] Avg episode reward: [(0, '-2.242')] [2022-07-10 09:16:15,251][26022] Updated weights on worker 0-0, policy_version 661063 (0.00089) [2022-07-10 09:16:17,279][26022] Updated weights on worker 0-0, policy_version 661073 (0.00093) [2022-07-10 09:16:18,834][26022] Updated weights on worker 0-0, policy_version 661083 (0.00086) [2022-07-10 09:16:19,574][25689] Fps is (10 sec: 5650.5, 60 sec: 5529.5, 300 sec: 5537.7). Total num frames: 676952064. Throughput: 0: 4885.3. Samples: 676946724. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:16:19,576][25689] Avg episode reward: [(0, '-2.041')] [2022-07-10 09:16:19,700][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:16:19,725][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000661087_676953088.pth [2022-07-10 09:16:19,726][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000659139_674958336.pth [2022-07-10 09:16:20,783][26022] Updated weights on worker 0-0, policy_version 661093 (0.00087) [2022-07-10 09:16:22,545][26022] Updated weights on worker 0-0, policy_version 661103 (0.00089) [2022-07-10 09:16:24,438][26022] Updated weights on worker 0-0, policy_version 661113 (0.00090) [2022-07-10 09:16:24,638][25689] Fps is (10 sec: 5586.2, 60 sec: 5542.2, 300 sec: 5530.0). Total num frames: 676979712. Throughput: 0: 5836.2. Samples: 676980780. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:16:24,640][25689] Avg episode reward: [(0, '-2.137')] [2022-07-10 09:16:26,176][26022] Updated weights on worker 0-0, policy_version 661123 (0.00100) [2022-07-10 09:16:27,994][26022] Updated weights on worker 0-0, policy_version 661133 (0.00106) [2022-07-10 09:16:29,641][25689] Fps is (10 sec: 5797.7, 60 sec: 5598.1, 300 sec: 5548.4). Total num frames: 677010432. Throughput: 0: 5871.5. Samples: 677014604. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:16:29,642][25689] Avg episode reward: [(0, '-3.054')] [2022-07-10 09:16:29,646][26022] Updated weights on worker 0-0, policy_version 661143 (0.00091) [2022-07-10 09:16:31,755][26022] Updated weights on worker 0-0, policy_version 661153 (0.00081) [2022-07-10 09:16:33,570][26022] Updated weights on worker 0-0, policy_version 661163 (0.00093) [2022-07-10 09:16:34,814][25689] Fps is (10 sec: 5635.1, 60 sec: 5557.8, 300 sec: 5538.7). Total num frames: 677037056. Throughput: 0: 5015.4. Samples: 677031214. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:16:34,815][25689] Avg episode reward: [(0, '-2.803')] [2022-07-10 09:16:35,447][26022] Updated weights on worker 0-0, policy_version 661173 (0.00093) [2022-07-10 09:16:37,080][26022] Updated weights on worker 0-0, policy_version 661183 (0.00092) [2022-07-10 09:16:38,817][26022] Updated weights on worker 0-0, policy_version 661193 (0.00085) [2022-07-10 09:16:39,830][25689] Fps is (10 sec: 5527.8, 60 sec: 5609.5, 300 sec: 5542.5). Total num frames: 677066752. Throughput: 0: 5830.6. Samples: 677064828. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:16:39,831][25689] Avg episode reward: [(0, '-3.516')] [2022-07-10 09:16:40,938][26022] Updated weights on worker 0-0, policy_version 661203 (0.00060) [2022-07-10 09:16:42,664][26022] Updated weights on worker 0-0, policy_version 661213 (0.00096) [2022-07-10 09:16:44,467][26022] Updated weights on worker 0-0, policy_version 661223 (0.00095) [2022-07-10 09:16:44,867][25689] Fps is (10 sec: 5704.2, 60 sec: 5574.2, 300 sec: 5545.6). Total num frames: 677094400. Throughput: 0: 5839.6. Samples: 677098910. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:16:44,867][25689] Avg episode reward: [(0, '-3.183')] [2022-07-10 09:16:46,324][26022] Updated weights on worker 0-0, policy_version 661233 (0.00054) [2022-07-10 09:16:48,153][26022] Updated weights on worker 0-0, policy_version 661243 (0.00086) [2022-07-10 09:16:49,920][25689] Fps is (10 sec: 5480.4, 60 sec: 5552.8, 300 sec: 5538.8). Total num frames: 677122048. Throughput: 0: 4974.3. Samples: 677115474. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:16:49,920][25689] Avg episode reward: [(0, '-3.664')] [2022-07-10 09:16:50,082][26022] Updated weights on worker 0-0, policy_version 661253 (0.00095) [2022-07-10 09:16:51,656][26022] Updated weights on worker 0-0, policy_version 661263 (0.00086) [2022-07-10 09:16:53,615][26022] Updated weights on worker 0-0, policy_version 661273 (0.00086) [2022-07-10 09:16:55,015][25689] Fps is (10 sec: 5650.7, 60 sec: 5581.9, 300 sec: 5545.4). Total num frames: 677151744. Throughput: 0: 5851.3. Samples: 677149418. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:16:55,015][25689] Avg episode reward: [(0, '-3.129')] [2022-07-10 09:16:55,440][26022] Updated weights on worker 0-0, policy_version 661283 (0.00090) [2022-07-10 09:16:57,213][26022] Updated weights on worker 0-0, policy_version 661293 (0.00087) [2022-07-10 09:16:59,063][26022] Updated weights on worker 0-0, policy_version 661303 (0.00091) [2022-07-10 09:17:00,048][25689] Fps is (10 sec: 5661.8, 60 sec: 5552.4, 300 sec: 5548.4). Total num frames: 677179392. Throughput: 0: 5858.5. Samples: 677183276. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:17:00,048][25689] Avg episode reward: [(0, '-4.850')] [2022-07-10 09:17:00,878][26022] Updated weights on worker 0-0, policy_version 661313 (0.00098) [2022-07-10 09:17:03,349][26022] Updated weights on worker 0-0, policy_version 661323 (0.00084) [2022-07-10 09:17:05,110][25689] Fps is (10 sec: 5173.5, 60 sec: 5549.0, 300 sec: 5538.4). Total num frames: 677203968. Throughput: 0: 5692.1. Samples: 677214134. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:17:05,110][25689] Avg episode reward: [(0, '-5.920')] [2022-07-10 09:17:05,122][26022] Updated weights on worker 0-0, policy_version 661333 (0.00089) [2022-07-10 09:17:06,798][26022] Updated weights on worker 0-0, policy_version 661343 (0.00086) [2022-07-10 09:17:08,691][26022] Updated weights on worker 0-0, policy_version 661353 (0.00089) [2022-07-10 09:17:10,171][25689] Fps is (10 sec: 5462.3, 60 sec: 5596.6, 300 sec: 5542.4). Total num frames: 677234688. Throughput: 0: 5713.8. Samples: 677231186. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:17:10,173][25689] Avg episode reward: [(0, '-5.092')] [2022-07-10 09:17:10,290][26022] Updated weights on worker 0-0, policy_version 661363 (0.00090) [2022-07-10 09:17:12,547][26022] Updated weights on worker 0-0, policy_version 661373 (0.00092) [2022-07-10 09:17:14,221][26022] Updated weights on worker 0-0, policy_version 661383 (0.00087) [2022-07-10 09:17:15,255][25689] Fps is (10 sec: 5652.5, 60 sec: 5567.2, 300 sec: 5542.1). Total num frames: 677261312. Throughput: 0: 5682.7. Samples: 677264434. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:17:15,256][25689] Avg episode reward: [(0, '-5.846')] [2022-07-10 09:17:15,997][26022] Updated weights on worker 0-0, policy_version 661393 (0.00082) [2022-07-10 09:17:17,891][26022] Updated weights on worker 0-0, policy_version 661403 (0.01095) [2022-07-10 09:17:19,478][26022] Updated weights on worker 0-0, policy_version 661413 (0.00088) [2022-07-10 09:17:20,258][25689] Fps is (10 sec: 5482.1, 60 sec: 5568.5, 300 sec: 5543.1). Total num frames: 677289984. Throughput: 0: 5701.1. Samples: 677298494. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:17:20,258][25689] Avg episode reward: [(0, '-4.673')] [2022-07-10 09:17:21,499][26022] Updated weights on worker 0-0, policy_version 661423 (0.00093) [2022-07-10 09:17:23,293][26022] Updated weights on worker 0-0, policy_version 661433 (0.00083) [2022-07-10 09:17:25,060][26022] Updated weights on worker 0-0, policy_version 661443 (0.00092) [2022-07-10 09:17:25,283][25689] Fps is (10 sec: 5718.3, 60 sec: 5589.0, 300 sec: 5544.2). Total num frames: 677318656. Throughput: 0: 5013.4. Samples: 677315270. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:17:25,283][25689] Avg episode reward: [(0, '-5.797')] [2022-07-10 09:17:27,037][26022] Updated weights on worker 0-0, policy_version 661453 (0.00087) [2022-07-10 09:17:28,689][26022] Updated weights on worker 0-0, policy_version 661463 (0.00083) [2022-07-10 09:17:30,295][25689] Fps is (10 sec: 5611.3, 60 sec: 5537.6, 300 sec: 5545.1). Total num frames: 677346304. Throughput: 0: 5857.5. Samples: 677349060. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:17:30,296][25689] Avg episode reward: [(0, '-5.364')] [2022-07-10 09:17:30,734][26022] Updated weights on worker 0-0, policy_version 661473 (0.00091) [2022-07-10 09:17:32,448][26022] Updated weights on worker 0-0, policy_version 661483 (0.00088) [2022-07-10 09:17:34,363][26022] Updated weights on worker 0-0, policy_version 661493 (0.00085) [2022-07-10 09:17:35,393][25689] Fps is (10 sec: 5671.8, 60 sec: 5595.0, 300 sec: 5547.0). Total num frames: 677376000. Throughput: 0: 5879.0. Samples: 677382828. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:17:35,394][25689] Avg episode reward: [(0, '-6.343')] [2022-07-10 09:17:36,029][26022] Updated weights on worker 0-0, policy_version 661503 (0.00089) [2022-07-10 09:17:37,862][26022] Updated weights on worker 0-0, policy_version 661513 (0.00089) [2022-07-10 09:17:39,646][26022] Updated weights on worker 0-0, policy_version 661523 (0.00092) [2022-07-10 09:17:40,463][25689] Fps is (10 sec: 5639.6, 60 sec: 5556.3, 300 sec: 5542.5). Total num frames: 677403648. Throughput: 0: 5020.3. Samples: 677399930. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:17:40,463][25689] Avg episode reward: [(0, '-5.902')] [2022-07-10 09:17:41,644][26022] Updated weights on worker 0-0, policy_version 661533 (0.00101) [2022-07-10 09:17:43,239][26022] Updated weights on worker 0-0, policy_version 661543 (0.00086) [2022-07-10 09:17:45,067][26022] Updated weights on worker 0-0, policy_version 661553 (0.00096) [2022-07-10 09:17:45,467][25689] Fps is (10 sec: 5489.0, 60 sec: 5559.3, 300 sec: 5539.7). Total num frames: 677431296. Throughput: 0: 5877.6. Samples: 677433906. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:17:45,468][25689] Avg episode reward: [(0, '-6.470')] [2022-07-10 09:17:46,883][26022] Updated weights on worker 0-0, policy_version 661563 (0.00087) [2022-07-10 09:17:48,810][26022] Updated weights on worker 0-0, policy_version 661573 (0.00081) [2022-07-10 09:17:50,497][25689] Fps is (10 sec: 5714.7, 60 sec: 5595.2, 300 sec: 5550.9). Total num frames: 677460992. Throughput: 0: 5868.6. Samples: 677467620. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:17:50,498][25689] Avg episode reward: [(0, '-5.435')] [2022-07-10 09:17:50,508][26022] Updated weights on worker 0-0, policy_version 661583 (0.00081) [2022-07-10 09:17:52,401][26022] Updated weights on worker 0-0, policy_version 661593 (0.00085) [2022-07-10 09:17:54,142][26022] Updated weights on worker 0-0, policy_version 661603 (0.00088) [2022-07-10 09:17:55,588][25689] Fps is (10 sec: 5666.0, 60 sec: 5561.8, 300 sec: 5550.2). Total num frames: 677488640. Throughput: 0: 5033.6. Samples: 677484480. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:17:55,588][25689] Avg episode reward: [(0, '-6.008')] [2022-07-10 09:17:56,044][26022] Updated weights on worker 0-0, policy_version 661613 (0.00156) [2022-07-10 09:17:57,926][26022] Updated weights on worker 0-0, policy_version 661623 (0.00088) [2022-07-10 09:17:59,408][26022] Updated weights on worker 0-0, policy_version 661633 (0.00095) [2022-07-10 09:18:00,634][25689] Fps is (10 sec: 5556.3, 60 sec: 5577.6, 300 sec: 5550.3). Total num frames: 677517312. Throughput: 0: 5877.5. Samples: 677518484. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:18:00,634][25689] Avg episode reward: [(0, '-6.899')] [2022-07-10 09:18:01,620][26022] Updated weights on worker 0-0, policy_version 661643 (0.00089) [2022-07-10 09:18:03,677][26022] Updated weights on worker 0-0, policy_version 661653 (0.00091) [2022-07-10 09:18:05,532][26022] Updated weights on worker 0-0, policy_version 661663 (0.00094) [2022-07-10 09:18:05,638][25689] Fps is (10 sec: 5400.1, 60 sec: 5599.8, 300 sec: 5547.1). Total num frames: 677542912. Throughput: 0: 5751.4. Samples: 677549918. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:18:05,638][25689] Avg episode reward: [(0, '-5.392')] [2022-07-10 09:18:07,474][26022] Updated weights on worker 0-0, policy_version 661673 (0.00082) [2022-07-10 09:18:09,131][26022] Updated weights on worker 0-0, policy_version 661683 (0.00092) [2022-07-10 09:18:10,726][25689] Fps is (10 sec: 5276.1, 60 sec: 5546.6, 300 sec: 5549.7). Total num frames: 677570560. Throughput: 0: 4889.2. Samples: 677566522. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:18:10,727][25689] Avg episode reward: [(0, '-5.824')] [2022-07-10 09:18:11,185][26022] Updated weights on worker 0-0, policy_version 661693 (0.00080) [2022-07-10 09:18:12,942][26022] Updated weights on worker 0-0, policy_version 661703 (0.00092) [2022-07-10 09:18:14,810][26022] Updated weights on worker 0-0, policy_version 661713 (0.00098) [2022-07-10 09:18:15,805][25689] Fps is (10 sec: 5640.4, 60 sec: 5597.7, 300 sec: 5550.0). Total num frames: 677600256. Throughput: 0: 5713.3. Samples: 677599984. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:18:15,805][25689] Avg episode reward: [(0, '-7.229')] [2022-07-10 09:18:16,541][26022] Updated weights on worker 0-0, policy_version 661723 (0.00085) [2022-07-10 09:18:18,295][26022] Updated weights on worker 0-0, policy_version 661733 (0.00083) [2022-07-10 09:18:19,735][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:18:19,746][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000661739_677620736.pth [2022-07-10 09:18:19,748][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000659789_675623936.pth [2022-07-10 09:18:20,266][26022] Updated weights on worker 0-0, policy_version 661743 (0.00083) [2022-07-10 09:18:20,823][25689] Fps is (10 sec: 5780.7, 60 sec: 5596.3, 300 sec: 5556.8). Total num frames: 677628928. Throughput: 0: 5696.9. Samples: 677633502. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:18:20,824][25689] Avg episode reward: [(0, '-5.929')] [2022-07-10 09:18:22,164][26022] Updated weights on worker 0-0, policy_version 661753 (0.00089) [2022-07-10 09:18:23,812][26022] Updated weights on worker 0-0, policy_version 661763 (0.00092) [2022-07-10 09:18:25,832][26022] Updated weights on worker 0-0, policy_version 661773 (0.00094) [2022-07-10 09:18:25,911][25689] Fps is (10 sec: 5471.5, 60 sec: 5556.8, 300 sec: 5549.1). Total num frames: 677655552. Throughput: 0: 4949.3. Samples: 677650254. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:18:25,911][25689] Avg episode reward: [(0, '-4.727')] [2022-07-10 09:18:27,460][26022] Updated weights on worker 0-0, policy_version 661783 (0.00087) [2022-07-10 09:18:29,562][26022] Updated weights on worker 0-0, policy_version 661793 (0.00089) [2022-07-10 09:18:30,931][25689] Fps is (10 sec: 5369.4, 60 sec: 5556.0, 300 sec: 5551.0). Total num frames: 677683200. Throughput: 0: 5795.3. Samples: 677683614. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:18:30,931][25689] Avg episode reward: [(0, '-4.335')] [2022-07-10 09:18:31,447][26022] Updated weights on worker 0-0, policy_version 661803 (0.00091) [2022-07-10 09:18:33,087][26022] Updated weights on worker 0-0, policy_version 661813 (0.00087) [2022-07-10 09:18:35,078][26022] Updated weights on worker 0-0, policy_version 661823 (0.00085) [2022-07-10 09:18:36,063][25689] Fps is (10 sec: 5749.1, 60 sec: 5569.8, 300 sec: 5552.8). Total num frames: 677713920. Throughput: 0: 5784.7. Samples: 677717174. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:18:36,064][25689] Avg episode reward: [(0, '-4.803')] [2022-07-10 09:18:36,813][26022] Updated weights on worker 0-0, policy_version 661833 (0.00092) [2022-07-10 09:18:38,721][26022] Updated weights on worker 0-0, policy_version 661843 (0.00088) [2022-07-10 09:18:40,600][26022] Updated weights on worker 0-0, policy_version 661853 (0.00095) [2022-07-10 09:18:41,102][25689] Fps is (10 sec: 5537.0, 60 sec: 5538.8, 300 sec: 5546.6). Total num frames: 677739520. Throughput: 0: 4959.2. Samples: 677734070. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:18:41,104][25689] Avg episode reward: [(0, '-4.564')] [2022-07-10 09:18:42,331][26022] Updated weights on worker 0-0, policy_version 661863 (0.00092) [2022-07-10 09:18:44,304][26022] Updated weights on worker 0-0, policy_version 661873 (0.00087) [2022-07-10 09:18:45,768][26022] Updated weights on worker 0-0, policy_version 661883 (0.00089) [2022-07-10 09:18:46,117][25689] Fps is (10 sec: 5500.1, 60 sec: 5571.6, 300 sec: 5553.7). Total num frames: 677769216. Throughput: 0: 5810.9. Samples: 677767670. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:18:46,118][25689] Avg episode reward: [(0, '-4.070')] [2022-07-10 09:18:47,816][26022] Updated weights on worker 0-0, policy_version 661893 (0.00083) [2022-07-10 09:18:49,770][26022] Updated weights on worker 0-0, policy_version 661903 (0.00095) [2022-07-10 09:18:51,151][25689] Fps is (10 sec: 5706.8, 60 sec: 5537.6, 300 sec: 5553.8). Total num frames: 677796864. Throughput: 0: 5816.7. Samples: 677801228. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:18:51,152][25689] Avg episode reward: [(0, '-4.042')] [2022-07-10 09:18:51,515][26022] Updated weights on worker 0-0, policy_version 661913 (0.00087) [2022-07-10 09:18:53,414][26022] Updated weights on worker 0-0, policy_version 661923 (0.00086) [2022-07-10 09:18:55,183][26022] Updated weights on worker 0-0, policy_version 661933 (0.00089) [2022-07-10 09:18:56,229][25689] Fps is (10 sec: 5367.0, 60 sec: 5521.8, 300 sec: 5549.1). Total num frames: 677823488. Throughput: 0: 4976.9. Samples: 677817538. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:18:56,231][25689] Avg episode reward: [(0, '-5.895')] [2022-07-10 09:18:57,084][26022] Updated weights on worker 0-0, policy_version 661943 (0.00090) [2022-07-10 09:18:58,975][26022] Updated weights on worker 0-0, policy_version 661953 (0.00087) [2022-07-10 09:19:00,649][26022] Updated weights on worker 0-0, policy_version 661963 (0.00057) [2022-07-10 09:19:01,249][25689] Fps is (10 sec: 5475.6, 60 sec: 5524.1, 300 sec: 5556.1). Total num frames: 677852160. Throughput: 0: 5809.8. Samples: 677851120. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:19:01,250][25689] Avg episode reward: [(0, '-5.567')] [2022-07-10 09:19:02,909][26022] Updated weights on worker 0-0, policy_version 661973 (0.00098) [2022-07-10 09:19:04,754][26022] Updated weights on worker 0-0, policy_version 661983 (0.00081) [2022-07-10 09:19:06,253][25689] Fps is (10 sec: 5516.4, 60 sec: 5541.0, 300 sec: 5553.0). Total num frames: 677878784. Throughput: 0: 5691.9. Samples: 677882284. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:19:06,256][25689] Avg episode reward: [(0, '-6.862')] [2022-07-10 09:19:06,549][26022] Updated weights on worker 0-0, policy_version 661993 (0.00096) [2022-07-10 09:19:08,463][26022] Updated weights on worker 0-0, policy_version 662003 (0.00084) [2022-07-10 09:19:10,192][26022] Updated weights on worker 0-0, policy_version 662013 (0.00090) [2022-07-10 09:19:11,271][25689] Fps is (10 sec: 5415.3, 60 sec: 5547.4, 300 sec: 5554.1). Total num frames: 677906432. Throughput: 0: 5710.7. Samples: 677916132. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:19:11,273][25689] Avg episode reward: [(0, '-7.473')] [2022-07-10 09:19:12,058][26022] Updated weights on worker 0-0, policy_version 662023 (0.00080) [2022-07-10 09:19:13,856][26022] Updated weights on worker 0-0, policy_version 662033 (0.00093) [2022-07-10 09:19:15,748][26022] Updated weights on worker 0-0, policy_version 662043 (0.00091) [2022-07-10 09:19:16,328][25689] Fps is (10 sec: 5488.6, 60 sec: 5515.6, 300 sec: 5550.0). Total num frames: 677934080. Throughput: 0: 5741.2. Samples: 677932928. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:19:16,328][25689] Avg episode reward: [(0, '-7.408')] [2022-07-10 09:19:17,489][26022] Updated weights on worker 0-0, policy_version 662053 (0.00085) [2022-07-10 09:19:19,627][26022] Updated weights on worker 0-0, policy_version 662063 (0.00096) [2022-07-10 09:19:21,123][26022] Updated weights on worker 0-0, policy_version 662073 (0.00083) [2022-07-10 09:19:21,344][25689] Fps is (10 sec: 5591.1, 60 sec: 5515.8, 300 sec: 5554.6). Total num frames: 677962752. Throughput: 0: 5737.6. Samples: 677966418. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 09:19:21,345][25689] Avg episode reward: [(0, '-7.030')] [2022-07-10 09:19:23,215][26022] Updated weights on worker 0-0, policy_version 662083 (0.00087) [2022-07-10 09:19:24,813][26022] Updated weights on worker 0-0, policy_version 662093 (0.00092) [2022-07-10 09:19:26,375][25689] Fps is (10 sec: 5503.9, 60 sec: 5521.0, 300 sec: 5554.2). Total num frames: 677989376. Throughput: 0: 5839.9. Samples: 677999792. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:19:26,375][25689] Avg episode reward: [(0, '-5.894')] [2022-07-10 09:19:26,761][26022] Updated weights on worker 0-0, policy_version 662103 (0.00087) [2022-07-10 09:19:28,723][26022] Updated weights on worker 0-0, policy_version 662113 (0.00093) [2022-07-10 09:19:30,404][26022] Updated weights on worker 0-0, policy_version 662123 (0.00090) [2022-07-10 09:19:31,397][25689] Fps is (10 sec: 5602.7, 60 sec: 5554.7, 300 sec: 5554.8). Total num frames: 678019072. Throughput: 0: 4981.7. Samples: 678016390. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:19:31,397][25689] Avg episode reward: [(0, '-5.912')] [2022-07-10 09:19:32,507][26022] Updated weights on worker 0-0, policy_version 662133 (0.00084) [2022-07-10 09:19:34,288][26022] Updated weights on worker 0-0, policy_version 662143 (0.00095) [2022-07-10 09:19:36,034][26022] Updated weights on worker 0-0, policy_version 662153 (0.00087) [2022-07-10 09:19:36,532][25689] Fps is (10 sec: 5746.6, 60 sec: 5520.6, 300 sec: 5556.0). Total num frames: 678047744. Throughput: 0: 5764.3. Samples: 678049388. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:19:36,532][25689] Avg episode reward: [(0, '-3.025')] [2022-07-10 09:19:37,908][26022] Updated weights on worker 0-0, policy_version 662163 (0.00084) [2022-07-10 09:19:39,625][26022] Updated weights on worker 0-0, policy_version 662173 (0.00090) [2022-07-10 09:19:41,563][25689] Fps is (10 sec: 5439.2, 60 sec: 5538.3, 300 sec: 5555.6). Total num frames: 678074368. Throughput: 0: 5782.6. Samples: 678083334. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:19:41,564][25689] Avg episode reward: [(0, '-2.583')] [2022-07-10 09:19:41,672][26022] Updated weights on worker 0-0, policy_version 662183 (0.00092) [2022-07-10 09:19:43,454][26022] Updated weights on worker 0-0, policy_version 662193 (0.00085) [2022-07-10 09:19:45,244][26022] Updated weights on worker 0-0, policy_version 662203 (0.00088) [2022-07-10 09:19:46,616][25689] Fps is (10 sec: 5382.0, 60 sec: 5501.0, 300 sec: 5548.4). Total num frames: 678102016. Throughput: 0: 4943.5. Samples: 678099852. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:19:46,616][25689] Avg episode reward: [(0, '-3.243')] [2022-07-10 09:19:47,197][26022] Updated weights on worker 0-0, policy_version 662213 (0.00092) [2022-07-10 09:19:48,926][26022] Updated weights on worker 0-0, policy_version 662223 (0.00084) [2022-07-10 09:19:50,891][26022] Updated weights on worker 0-0, policy_version 662233 (0.00096) [2022-07-10 09:19:51,623][25689] Fps is (10 sec: 5700.2, 60 sec: 5537.2, 300 sec: 5556.7). Total num frames: 678131712. Throughput: 0: 5809.6. Samples: 678133896. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:19:51,624][25689] Avg episode reward: [(0, '-3.800')] [2022-07-10 09:19:52,548][26022] Updated weights on worker 0-0, policy_version 662243 (0.00081) [2022-07-10 09:19:54,424][26022] Updated weights on worker 0-0, policy_version 662253 (0.00085) [2022-07-10 09:19:56,230][26022] Updated weights on worker 0-0, policy_version 662263 (0.00094) [2022-07-10 09:19:56,662][25689] Fps is (10 sec: 5809.7, 60 sec: 5574.7, 300 sec: 5559.8). Total num frames: 678160384. Throughput: 0: 5880.2. Samples: 678167760. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:19:56,663][25689] Avg episode reward: [(0, '-4.193')] [2022-07-10 09:19:57,934][26022] Updated weights on worker 0-0, policy_version 662273 (0.00085) [2022-07-10 09:19:59,834][26022] Updated weights on worker 0-0, policy_version 662283 (0.00092) [2022-07-10 09:20:01,681][25689] Fps is (10 sec: 5497.5, 60 sec: 5540.9, 300 sec: 5559.6). Total num frames: 678187008. Throughput: 0: 5025.7. Samples: 678184442. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:20:01,682][25689] Avg episode reward: [(0, '-4.543')] [2022-07-10 09:20:02,057][26022] Updated weights on worker 0-0, policy_version 662293 (0.00105) [2022-07-10 09:20:03,790][26022] Updated weights on worker 0-0, policy_version 662303 (0.00088) [2022-07-10 09:20:05,876][26022] Updated weights on worker 0-0, policy_version 662313 (0.00089) [2022-07-10 09:20:06,704][25689] Fps is (10 sec: 5302.7, 60 sec: 5539.2, 300 sec: 5556.1). Total num frames: 678213632. Throughput: 0: 5767.7. Samples: 678215716. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:20:06,704][25689] Avg episode reward: [(0, '-4.725')] [2022-07-10 09:20:07,485][26022] Updated weights on worker 0-0, policy_version 662323 (0.00089) [2022-07-10 09:20:09,664][26022] Updated weights on worker 0-0, policy_version 662333 (0.00094) [2022-07-10 09:20:11,230][26022] Updated weights on worker 0-0, policy_version 662343 (0.00092) [2022-07-10 09:20:11,728][25689] Fps is (10 sec: 5503.7, 60 sec: 5555.6, 300 sec: 5557.8). Total num frames: 678242304. Throughput: 0: 5717.4. Samples: 678248846. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:20:11,729][25689] Avg episode reward: [(0, '-4.473')] [2022-07-10 09:20:13,205][26022] Updated weights on worker 0-0, policy_version 662353 (0.00051) [2022-07-10 09:20:15,106][26022] Updated weights on worker 0-0, policy_version 662363 (0.00082) [2022-07-10 09:20:16,783][25689] Fps is (10 sec: 5486.4, 60 sec: 5538.8, 300 sec: 5547.0). Total num frames: 678268928. Throughput: 0: 4867.8. Samples: 678265702. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:20:16,785][25689] Avg episode reward: [(0, '-3.797')] [2022-07-10 09:20:16,938][26022] Updated weights on worker 0-0, policy_version 662373 (0.00087) [2022-07-10 09:20:18,728][26022] Updated weights on worker 0-0, policy_version 662383 (0.00096) [2022-07-10 09:20:19,752][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:20:19,762][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000662389_678286336.pth [2022-07-10 09:20:19,763][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000660436_676286464.pth [2022-07-10 09:20:20,378][26022] Updated weights on worker 0-0, policy_version 662393 (0.00091) [2022-07-10 09:20:21,799][25689] Fps is (10 sec: 5389.3, 60 sec: 5521.9, 300 sec: 5550.5). Total num frames: 678296576. Throughput: 0: 5711.1. Samples: 678299334. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:20:21,800][25689] Avg episode reward: [(0, '-4.434')] [2022-07-10 09:20:22,418][26022] Updated weights on worker 0-0, policy_version 662403 (0.00086) [2022-07-10 09:20:24,376][26022] Updated weights on worker 0-0, policy_version 662413 (0.00088) [2022-07-10 09:20:25,844][26022] Updated weights on worker 0-0, policy_version 662423 (0.00089) [2022-07-10 09:20:26,807][25689] Fps is (10 sec: 5618.5, 60 sec: 5557.9, 300 sec: 5554.9). Total num frames: 678325248. Throughput: 0: 5836.4. Samples: 678333044. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:20:26,807][25689] Avg episode reward: [(0, '-5.117')] [2022-07-10 09:20:27,950][26022] Updated weights on worker 0-0, policy_version 662433 (0.00076) [2022-07-10 09:20:29,471][26022] Updated weights on worker 0-0, policy_version 662443 (0.00088) [2022-07-10 09:20:31,505][26022] Updated weights on worker 0-0, policy_version 662453 (0.00087) [2022-07-10 09:20:31,823][25689] Fps is (10 sec: 5618.5, 60 sec: 5524.5, 300 sec: 5553.1). Total num frames: 678352896. Throughput: 0: 5026.3. Samples: 678349846. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:20:31,824][25689] Avg episode reward: [(0, '-5.186')] [2022-07-10 09:20:33,109][26022] Updated weights on worker 0-0, policy_version 662463 (0.00087) [2022-07-10 09:20:35,213][26022] Updated weights on worker 0-0, policy_version 662473 (0.00087) [2022-07-10 09:20:36,957][25689] Fps is (10 sec: 5548.6, 60 sec: 5524.6, 300 sec: 5558.0). Total num frames: 678381568. Throughput: 0: 5824.8. Samples: 678383212. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:20:36,957][25689] Avg episode reward: [(0, '-4.328')] [2022-07-10 09:20:37,017][26022] Updated weights on worker 0-0, policy_version 662483 (0.00081) [2022-07-10 09:20:39,040][26022] Updated weights on worker 0-0, policy_version 662493 (0.00090) [2022-07-10 09:20:40,670][26022] Updated weights on worker 0-0, policy_version 662503 (0.00066) [2022-07-10 09:20:41,963][25689] Fps is (10 sec: 5452.9, 60 sec: 5526.9, 300 sec: 5547.9). Total num frames: 678408192. Throughput: 0: 5800.5. Samples: 678416298. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:20:41,965][25689] Avg episode reward: [(0, '-3.963')] [2022-07-10 09:20:42,562][26022] Updated weights on worker 0-0, policy_version 662513 (0.00096) [2022-07-10 09:20:44,462][26022] Updated weights on worker 0-0, policy_version 662523 (0.00078) [2022-07-10 09:20:46,219][26022] Updated weights on worker 0-0, policy_version 662533 (0.00089) [2022-07-10 09:20:47,053][25689] Fps is (10 sec: 5680.0, 60 sec: 5574.3, 300 sec: 5553.2). Total num frames: 678438912. Throughput: 0: 4951.7. Samples: 678433296. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:20:47,055][25689] Avg episode reward: [(0, '-4.309')] [2022-07-10 09:20:48,180][26022] Updated weights on worker 0-0, policy_version 662543 (0.00088) [2022-07-10 09:20:49,729][26022] Updated weights on worker 0-0, policy_version 662553 (0.00974) [2022-07-10 09:20:51,909][26022] Updated weights on worker 0-0, policy_version 662563 (0.00087) [2022-07-10 09:20:52,121][25689] Fps is (10 sec: 5645.5, 60 sec: 5518.0, 300 sec: 5549.3). Total num frames: 678465536. Throughput: 0: 5773.9. Samples: 678467044. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:20:52,121][25689] Avg episode reward: [(0, '-2.663')] [2022-07-10 09:20:53,440][26022] Updated weights on worker 0-0, policy_version 662573 (0.00087) [2022-07-10 09:20:55,406][26022] Updated weights on worker 0-0, policy_version 662583 (0.00093) [2022-07-10 09:20:57,176][25689] Fps is (10 sec: 5462.5, 60 sec: 5516.5, 300 sec: 5546.3). Total num frames: 678494208. Throughput: 0: 5784.6. Samples: 678500170. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:20:57,178][25689] Avg episode reward: [(0, '-3.144')] [2022-07-10 09:20:57,337][26022] Updated weights on worker 0-0, policy_version 662593 (0.00086) [2022-07-10 09:20:59,010][26022] Updated weights on worker 0-0, policy_version 662603 (0.00083) [2022-07-10 09:21:01,039][26022] Updated weights on worker 0-0, policy_version 662613 (0.00085) [2022-07-10 09:21:02,185][25689] Fps is (10 sec: 5392.5, 60 sec: 5500.5, 300 sec: 5550.0). Total num frames: 678519808. Throughput: 0: 4973.6. Samples: 678516874. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:21:02,187][25689] Avg episode reward: [(0, '-4.787')] [2022-07-10 09:21:03,068][26022] Updated weights on worker 0-0, policy_version 662623 (0.00091) [2022-07-10 09:21:04,963][26022] Updated weights on worker 0-0, policy_version 662633 (0.00093) [2022-07-10 09:21:06,807][26022] Updated weights on worker 0-0, policy_version 662643 (0.00092) [2022-07-10 09:21:07,231][25689] Fps is (10 sec: 5295.3, 60 sec: 5515.3, 300 sec: 5549.7). Total num frames: 678547456. Throughput: 0: 5707.8. Samples: 678548470. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:21:07,232][25689] Avg episode reward: [(0, '-4.482')] [2022-07-10 09:21:08,677][26022] Updated weights on worker 0-0, policy_version 662653 (0.00102) [2022-07-10 09:21:10,488][26022] Updated weights on worker 0-0, policy_version 662663 (0.00089) [2022-07-10 09:21:12,247][25689] Fps is (10 sec: 5597.1, 60 sec: 5516.0, 300 sec: 5551.8). Total num frames: 678576128. Throughput: 0: 5696.1. Samples: 678581688. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:21:12,248][25689] Avg episode reward: [(0, '-4.443')] [2022-07-10 09:21:12,430][26022] Updated weights on worker 0-0, policy_version 662673 (0.00090) [2022-07-10 09:21:14,019][26022] Updated weights on worker 0-0, policy_version 662683 (0.00095) [2022-07-10 09:21:16,216][26022] Updated weights on worker 0-0, policy_version 662693 (0.00087) [2022-07-10 09:21:17,343][25689] Fps is (10 sec: 5671.2, 60 sec: 5546.1, 300 sec: 5550.4). Total num frames: 678604800. Throughput: 0: 4862.8. Samples: 678598238. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:21:17,344][25689] Avg episode reward: [(0, '-4.840')] [2022-07-10 09:21:17,842][26022] Updated weights on worker 0-0, policy_version 662703 (0.00092) [2022-07-10 09:21:19,887][26022] Updated weights on worker 0-0, policy_version 662713 (0.00089) [2022-07-10 09:21:21,355][26022] Updated weights on worker 0-0, policy_version 662723 (0.00082) [2022-07-10 09:21:22,358][25689] Fps is (10 sec: 5469.0, 60 sec: 5529.2, 300 sec: 5547.9). Total num frames: 678631424. Throughput: 0: 5700.6. Samples: 678631872. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:21:22,360][25689] Avg episode reward: [(0, '-4.556')] [2022-07-10 09:21:23,370][26022] Updated weights on worker 0-0, policy_version 662733 (0.00080) [2022-07-10 09:21:25,199][26022] Updated weights on worker 0-0, policy_version 662743 (0.00082) [2022-07-10 09:21:27,120][26022] Updated weights on worker 0-0, policy_version 662753 (0.00094) [2022-07-10 09:21:27,384][25689] Fps is (10 sec: 5506.7, 60 sec: 5527.6, 300 sec: 5540.6). Total num frames: 678660096. Throughput: 0: 5811.4. Samples: 678665588. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:21:27,386][25689] Avg episode reward: [(0, '-4.854')] [2022-07-10 09:21:28,768][26022] Updated weights on worker 0-0, policy_version 662763 (0.00084) [2022-07-10 09:21:30,840][26022] Updated weights on worker 0-0, policy_version 662773 (0.00357) [2022-07-10 09:21:32,395][25689] Fps is (10 sec: 5713.5, 60 sec: 5545.0, 300 sec: 5550.5). Total num frames: 678688768. Throughput: 0: 4993.1. Samples: 678682286. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:21:32,395][25689] Avg episode reward: [(0, '-3.412')] [2022-07-10 09:21:32,466][26022] Updated weights on worker 0-0, policy_version 662783 (0.00086) [2022-07-10 09:21:34,494][26022] Updated weights on worker 0-0, policy_version 662793 (0.00085) [2022-07-10 09:21:36,223][26022] Updated weights on worker 0-0, policy_version 662803 (0.00081) [2022-07-10 09:21:37,434][25689] Fps is (10 sec: 5604.3, 60 sec: 5536.8, 300 sec: 5543.2). Total num frames: 678716416. Throughput: 0: 5870.0. Samples: 678716172. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:21:37,436][25689] Avg episode reward: [(0, '-3.648')] [2022-07-10 09:21:37,899][26022] Updated weights on worker 0-0, policy_version 662813 (0.00097) [2022-07-10 09:21:39,915][26022] Updated weights on worker 0-0, policy_version 662823 (0.00096) [2022-07-10 09:21:41,812][26022] Updated weights on worker 0-0, policy_version 662833 (0.00088) [2022-07-10 09:21:42,455][25689] Fps is (10 sec: 5496.7, 60 sec: 5552.4, 300 sec: 5543.5). Total num frames: 678744064. Throughput: 0: 5850.3. Samples: 678749442. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:21:42,455][25689] Avg episode reward: [(0, '-3.655')] [2022-07-10 09:21:43,471][26022] Updated weights on worker 0-0, policy_version 662843 (0.00089) [2022-07-10 09:21:45,604][26022] Updated weights on worker 0-0, policy_version 662853 (0.00083) [2022-07-10 09:21:47,044][26022] Updated weights on worker 0-0, policy_version 662863 (0.00088) [2022-07-10 09:21:47,462][25689] Fps is (10 sec: 5718.1, 60 sec: 5543.0, 300 sec: 5551.2). Total num frames: 678773760. Throughput: 0: 5020.9. Samples: 678766396. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:21:47,463][25689] Avg episode reward: [(0, '-4.172')] [2022-07-10 09:21:49,155][26022] Updated weights on worker 0-0, policy_version 662873 (0.00090) [2022-07-10 09:21:50,679][26022] Updated weights on worker 0-0, policy_version 662883 (0.00089) [2022-07-10 09:21:52,474][25689] Fps is (10 sec: 5620.8, 60 sec: 5548.1, 300 sec: 5542.4). Total num frames: 678800384. Throughput: 0: 5870.9. Samples: 678800172. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:21:52,475][25689] Avg episode reward: [(0, '-3.699')] [2022-07-10 09:21:52,662][26022] Updated weights on worker 0-0, policy_version 662893 (0.00083) [2022-07-10 09:21:54,430][26022] Updated weights on worker 0-0, policy_version 662903 (0.00089) [2022-07-10 09:21:56,373][26022] Updated weights on worker 0-0, policy_version 662913 (0.00091) [2022-07-10 09:21:57,519][25689] Fps is (10 sec: 5498.6, 60 sec: 5549.1, 300 sec: 5545.6). Total num frames: 678829056. Throughput: 0: 5853.3. Samples: 678833734. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:21:57,519][25689] Avg episode reward: [(0, '-2.454')] [2022-07-10 09:21:58,131][26022] Updated weights on worker 0-0, policy_version 662923 (0.00094) [2022-07-10 09:22:00,096][26022] Updated weights on worker 0-0, policy_version 662933 (0.00091) [2022-07-10 09:22:02,160][26022] Updated weights on worker 0-0, policy_version 662943 (0.00085) [2022-07-10 09:22:02,541][25689] Fps is (10 sec: 5391.2, 60 sec: 5547.9, 300 sec: 5549.8). Total num frames: 678854656. Throughput: 0: 5029.3. Samples: 678850464. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:22:02,542][25689] Avg episode reward: [(0, '-3.560')] [2022-07-10 09:22:04,071][26022] Updated weights on worker 0-0, policy_version 662953 (0.00084) [2022-07-10 09:22:06,100][26022] Updated weights on worker 0-0, policy_version 662963 (0.00086) [2022-07-10 09:22:07,567][25689] Fps is (10 sec: 5401.2, 60 sec: 5566.7, 300 sec: 5543.6). Total num frames: 678883328. Throughput: 0: 5741.4. Samples: 678881824. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:22:07,567][25689] Avg episode reward: [(0, '-3.043')] [2022-07-10 09:22:07,567][26022] Updated weights on worker 0-0, policy_version 662973 (0.00088) [2022-07-10 09:22:09,755][26022] Updated weights on worker 0-0, policy_version 662983 (0.00087) [2022-07-10 09:22:11,278][26022] Updated weights on worker 0-0, policy_version 662993 (0.00096) [2022-07-10 09:22:12,605][25689] Fps is (10 sec: 5596.1, 60 sec: 5547.7, 300 sec: 5547.9). Total num frames: 678910976. Throughput: 0: 5714.0. Samples: 678915200. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:22:12,607][25689] Avg episode reward: [(0, '-2.934')] [2022-07-10 09:22:13,406][26022] Updated weights on worker 0-0, policy_version 663003 (0.00087) [2022-07-10 09:22:15,025][26022] Updated weights on worker 0-0, policy_version 663013 (0.00095) [2022-07-10 09:22:16,897][26022] Updated weights on worker 0-0, policy_version 663023 (0.00094) [2022-07-10 09:22:17,687][25689] Fps is (10 sec: 5565.2, 60 sec: 5549.0, 300 sec: 5546.4). Total num frames: 678939648. Throughput: 0: 5701.3. Samples: 678948718. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:22:17,687][25689] Avg episode reward: [(0, '-3.401')] [2022-07-10 09:22:18,657][26022] Updated weights on worker 0-0, policy_version 663033 (0.00091) [2022-07-10 09:22:19,780][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:22:19,793][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000663038_678950912.pth [2022-07-10 09:22:19,794][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000661087_676953088.pth [2022-07-10 09:22:20,547][26022] Updated weights on worker 0-0, policy_version 663043 (0.00096) [2022-07-10 09:22:22,444][26022] Updated weights on worker 0-0, policy_version 663053 (0.00087) [2022-07-10 09:22:22,700][25689] Fps is (10 sec: 5478.0, 60 sec: 5549.2, 300 sec: 5539.8). Total num frames: 678966272. Throughput: 0: 5703.4. Samples: 678965434. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:22:22,701][25689] Avg episode reward: [(0, '-3.959')] [2022-07-10 09:22:24,268][26022] Updated weights on worker 0-0, policy_version 663063 (0.00086) [2022-07-10 09:22:26,009][26022] Updated weights on worker 0-0, policy_version 663073 (0.00090) [2022-07-10 09:22:27,706][25689] Fps is (10 sec: 5518.8, 60 sec: 5551.0, 300 sec: 5543.3). Total num frames: 678994944. Throughput: 0: 5826.6. Samples: 678999170. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:22:27,707][25689] Avg episode reward: [(0, '-4.509')] [2022-07-10 09:22:27,902][26022] Updated weights on worker 0-0, policy_version 663083 (0.00098) [2022-07-10 09:22:29,771][26022] Updated weights on worker 0-0, policy_version 663093 (0.00086) [2022-07-10 09:22:31,551][26022] Updated weights on worker 0-0, policy_version 663103 (0.00091) [2022-07-10 09:22:32,731][25689] Fps is (10 sec: 5614.5, 60 sec: 5532.7, 300 sec: 5537.8). Total num frames: 679022592. Throughput: 0: 5833.7. Samples: 679032604. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 09:22:32,731][25689] Avg episode reward: [(0, '-4.896')] [2022-07-10 09:22:33,492][26022] Updated weights on worker 0-0, policy_version 663113 (0.00092) [2022-07-10 09:22:35,167][26022] Updated weights on worker 0-0, policy_version 663123 (0.00079) [2022-07-10 09:22:37,116][26022] Updated weights on worker 0-0, policy_version 663133 (0.00088) [2022-07-10 09:22:37,817][25689] Fps is (10 sec: 5671.5, 60 sec: 5562.3, 300 sec: 5544.4). Total num frames: 679052288. Throughput: 0: 5002.8. Samples: 679049426. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:22:37,819][25689] Avg episode reward: [(0, '-4.728')] [2022-07-10 09:22:38,891][26022] Updated weights on worker 0-0, policy_version 663143 (0.00095) [2022-07-10 09:22:40,733][26022] Updated weights on worker 0-0, policy_version 663153 (0.00086) [2022-07-10 09:22:42,654][26022] Updated weights on worker 0-0, policy_version 663163 (0.00090) [2022-07-10 09:22:42,853][25689] Fps is (10 sec: 5563.8, 60 sec: 5544.0, 300 sec: 5540.3). Total num frames: 679078912. Throughput: 0: 5840.7. Samples: 679083146. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:22:42,854][25689] Avg episode reward: [(0, '-5.394')] [2022-07-10 09:22:44,381][26022] Updated weights on worker 0-0, policy_version 663173 (0.00094) [2022-07-10 09:22:46,283][26022] Updated weights on worker 0-0, policy_version 663183 (0.00083) [2022-07-10 09:22:47,883][25689] Fps is (10 sec: 5595.3, 60 sec: 5541.9, 300 sec: 5540.3). Total num frames: 679108608. Throughput: 0: 5834.8. Samples: 679116896. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:22:47,883][25689] Avg episode reward: [(0, '-4.586')] [2022-07-10 09:22:47,929][26022] Updated weights on worker 0-0, policy_version 663193 (0.00085) [2022-07-10 09:22:50,054][26022] Updated weights on worker 0-0, policy_version 663203 (0.00086) [2022-07-10 09:22:51,706][26022] Updated weights on worker 0-0, policy_version 663213 (0.00079) [2022-07-10 09:22:52,903][25689] Fps is (10 sec: 5705.8, 60 sec: 5558.1, 300 sec: 5541.6). Total num frames: 679136256. Throughput: 0: 5027.6. Samples: 679134024. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:22:52,904][25689] Avg episode reward: [(0, '-5.034')] [2022-07-10 09:22:53,544][26022] Updated weights on worker 0-0, policy_version 663223 (0.00087) [2022-07-10 09:22:55,227][26022] Updated weights on worker 0-0, policy_version 663233 (0.00057) [2022-07-10 09:22:57,089][26022] Updated weights on worker 0-0, policy_version 663243 (0.00096) [2022-07-10 09:22:57,949][25689] Fps is (10 sec: 5696.5, 60 sec: 5574.9, 300 sec: 5545.1). Total num frames: 679165952. Throughput: 0: 5897.3. Samples: 679168152. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:22:57,950][25689] Avg episode reward: [(0, '-5.719')] [2022-07-10 09:22:59,014][26022] Updated weights on worker 0-0, policy_version 663253 (0.00088) [2022-07-10 09:23:00,648][26022] Updated weights on worker 0-0, policy_version 663263 (0.00090) [2022-07-10 09:23:02,777][26022] Updated weights on worker 0-0, policy_version 663273 (0.00087) [2022-07-10 09:23:02,961][25689] Fps is (10 sec: 5498.2, 60 sec: 5576.0, 300 sec: 5545.0). Total num frames: 679191552. Throughput: 0: 5810.7. Samples: 679199984. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:23:02,961][25689] Avg episode reward: [(0, '-6.475')] [2022-07-10 09:23:04,620][26022] Updated weights on worker 0-0, policy_version 663283 (0.00091) [2022-07-10 09:23:06,486][26022] Updated weights on worker 0-0, policy_version 663293 (0.00084) [2022-07-10 09:23:07,973][25689] Fps is (10 sec: 5414.2, 60 sec: 5577.1, 300 sec: 5549.8). Total num frames: 679220224. Throughput: 0: 4990.1. Samples: 679217150. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:23:07,974][25689] Avg episode reward: [(0, '-6.044')] [2022-07-10 09:23:08,241][26022] Updated weights on worker 0-0, policy_version 663303 (0.00087) [2022-07-10 09:23:09,952][26022] Updated weights on worker 0-0, policy_version 663313 (0.00090) [2022-07-10 09:23:11,864][26022] Updated weights on worker 0-0, policy_version 663323 (0.00063) [2022-07-10 09:23:12,979][25689] Fps is (10 sec: 5724.0, 60 sec: 5597.2, 300 sec: 5547.7). Total num frames: 679248896. Throughput: 0: 5831.9. Samples: 679251100. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:23:12,979][25689] Avg episode reward: [(0, '-6.288')] [2022-07-10 09:23:13,614][26022] Updated weights on worker 0-0, policy_version 663333 (0.00089) [2022-07-10 09:23:15,612][26022] Updated weights on worker 0-0, policy_version 663343 (0.00085) [2022-07-10 09:23:17,324][26022] Updated weights on worker 0-0, policy_version 663353 (0.00091) [2022-07-10 09:23:18,087][25689] Fps is (10 sec: 5467.6, 60 sec: 5560.8, 300 sec: 5539.2). Total num frames: 679275520. Throughput: 0: 5791.7. Samples: 679284782. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:23:18,087][25689] Avg episode reward: [(0, '-5.834')] [2022-07-10 09:23:19,177][26022] Updated weights on worker 0-0, policy_version 663363 (0.00079) [2022-07-10 09:23:21,273][26022] Updated weights on worker 0-0, policy_version 663373 (0.00086) [2022-07-10 09:23:22,691][26022] Updated weights on worker 0-0, policy_version 663383 (0.00094) [2022-07-10 09:23:23,098][25689] Fps is (10 sec: 5768.0, 60 sec: 5645.7, 300 sec: 5557.8). Total num frames: 679307264. Throughput: 0: 5050.4. Samples: 679301686. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:23:23,098][25689] Avg episode reward: [(0, '-4.622')] [2022-07-10 09:23:24,724][26022] Updated weights on worker 0-0, policy_version 663393 (0.00090) [2022-07-10 09:23:26,440][26022] Updated weights on worker 0-0, policy_version 663403 (0.00092) [2022-07-10 09:23:28,145][25689] Fps is (10 sec: 5701.5, 60 sec: 5591.2, 300 sec: 5550.4). Total num frames: 679332864. Throughput: 0: 5857.2. Samples: 679335296. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:23:28,145][25689] Avg episode reward: [(0, '-4.228')] [2022-07-10 09:23:28,339][26022] Updated weights on worker 0-0, policy_version 663413 (0.00085) [2022-07-10 09:23:30,181][26022] Updated weights on worker 0-0, policy_version 663423 (0.00085) [2022-07-10 09:23:31,819][26022] Updated weights on worker 0-0, policy_version 663433 (0.00085) [2022-07-10 09:23:33,218][25689] Fps is (10 sec: 5160.2, 60 sec: 5569.7, 300 sec: 5537.7). Total num frames: 679359488. Throughput: 0: 5816.8. Samples: 679368830. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:23:33,219][25689] Avg episode reward: [(0, '-6.538')] [2022-07-10 09:23:33,785][26022] Updated weights on worker 0-0, policy_version 663443 (0.00085) [2022-07-10 09:23:35,758][26022] Updated weights on worker 0-0, policy_version 663453 (0.00093) [2022-07-10 09:23:37,535][26022] Updated weights on worker 0-0, policy_version 663463 (0.00087) [2022-07-10 09:23:38,276][25689] Fps is (10 sec: 5761.3, 60 sec: 5606.2, 300 sec: 5558.1). Total num frames: 679391232. Throughput: 0: 5004.0. Samples: 679385806. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:23:38,276][25689] Avg episode reward: [(0, '-5.998')] [2022-07-10 09:23:39,503][26022] Updated weights on worker 0-0, policy_version 663473 (0.00088) [2022-07-10 09:23:41,166][26022] Updated weights on worker 0-0, policy_version 663483 (0.00091) [2022-07-10 09:23:43,086][26022] Updated weights on worker 0-0, policy_version 663493 (0.00092) [2022-07-10 09:23:43,339][25689] Fps is (10 sec: 5767.5, 60 sec: 5603.7, 300 sec: 5546.8). Total num frames: 679417856. Throughput: 0: 5796.9. Samples: 679419020. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:23:43,339][25689] Avg episode reward: [(0, '-5.476')] [2022-07-10 09:23:44,883][26022] Updated weights on worker 0-0, policy_version 663503 (0.00084) [2022-07-10 09:23:46,542][26022] Updated weights on worker 0-0, policy_version 663513 (0.00089) [2022-07-10 09:23:48,352][25689] Fps is (10 sec: 5386.3, 60 sec: 5571.4, 300 sec: 5547.2). Total num frames: 679445504. Throughput: 0: 5827.7. Samples: 679453058. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:23:48,352][25689] Avg episode reward: [(0, '-5.171')] [2022-07-10 09:23:48,579][26022] Updated weights on worker 0-0, policy_version 663523 (0.00092) [2022-07-10 09:23:50,036][26022] Updated weights on worker 0-0, policy_version 663533 (0.00096) [2022-07-10 09:23:52,348][26022] Updated weights on worker 0-0, policy_version 663543 (0.00084) [2022-07-10 09:23:53,356][25689] Fps is (10 sec: 5724.3, 60 sec: 5606.8, 300 sec: 5558.9). Total num frames: 679475200. Throughput: 0: 5021.1. Samples: 679469944. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:23:53,358][25689] Avg episode reward: [(0, '-7.714')] [2022-07-10 09:23:54,337][26022] Updated weights on worker 0-0, policy_version 663553 (0.00086) [2022-07-10 09:23:55,741][26022] Updated weights on worker 0-0, policy_version 663563 (0.00084) [2022-07-10 09:23:57,673][26022] Updated weights on worker 0-0, policy_version 663573 (0.00086) [2022-07-10 09:23:58,428][25689] Fps is (10 sec: 5690.8, 60 sec: 5570.5, 300 sec: 5554.5). Total num frames: 679502848. Throughput: 0: 5837.9. Samples: 679503454. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:23:58,429][25689] Avg episode reward: [(0, '-6.019')] [2022-07-10 09:23:59,491][26022] Updated weights on worker 0-0, policy_version 663583 (0.00090) [2022-07-10 09:24:01,223][26022] Updated weights on worker 0-0, policy_version 663593 (0.00081) [2022-07-10 09:24:03,445][25689] Fps is (10 sec: 5379.3, 60 sec: 5586.9, 300 sec: 5554.3). Total num frames: 679529472. Throughput: 0: 5780.1. Samples: 679535238. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:24:03,446][25689] Avg episode reward: [(0, '-6.512')] [2022-07-10 09:24:03,451][26022] Updated weights on worker 0-0, policy_version 663603 (0.00082) [2022-07-10 09:24:05,211][26022] Updated weights on worker 0-0, policy_version 663613 (0.00085) [2022-07-10 09:24:07,064][26022] Updated weights on worker 0-0, policy_version 663623 (0.00082) [2022-07-10 09:24:08,473][25689] Fps is (10 sec: 5402.9, 60 sec: 5568.6, 300 sec: 5554.1). Total num frames: 679557120. Throughput: 0: 4923.5. Samples: 679552126. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:24:08,474][25689] Avg episode reward: [(0, '-7.111')] [2022-07-10 09:24:09,099][26022] Updated weights on worker 0-0, policy_version 663633 (0.00087) [2022-07-10 09:24:10,621][26022] Updated weights on worker 0-0, policy_version 663643 (0.00093) [2022-07-10 09:24:12,665][26022] Updated weights on worker 0-0, policy_version 663653 (0.00084) [2022-07-10 09:24:13,559][25689] Fps is (10 sec: 5568.7, 60 sec: 5561.1, 300 sec: 5557.0). Total num frames: 679585792. Throughput: 0: 5738.4. Samples: 679585876. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:24:13,559][25689] Avg episode reward: [(0, '-7.728')] [2022-07-10 09:24:14,195][26022] Updated weights on worker 0-0, policy_version 663663 (0.00086) [2022-07-10 09:24:16,231][26022] Updated weights on worker 0-0, policy_version 663673 (0.00087) [2022-07-10 09:24:18,028][26022] Updated weights on worker 0-0, policy_version 663683 (0.00091) [2022-07-10 09:24:18,657][25689] Fps is (10 sec: 5530.3, 60 sec: 5579.0, 300 sec: 5552.0). Total num frames: 679613440. Throughput: 0: 5740.6. Samples: 679619580. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:24:18,657][25689] Avg episode reward: [(0, '-5.746')] [2022-07-10 09:24:19,763][26022] Updated weights on worker 0-0, policy_version 663693 (0.00092) [2022-07-10 09:24:20,036][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:24:20,050][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000663694_679622656.pth [2022-07-10 09:24:20,051][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000661739_677620736.pth [2022-07-10 09:24:21,582][26022] Updated weights on worker 0-0, policy_version 663703 (0.00087) [2022-07-10 09:24:23,664][25689] Fps is (10 sec: 5573.7, 60 sec: 5528.7, 300 sec: 5559.4). Total num frames: 679642112. Throughput: 0: 5011.5. Samples: 679636562. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:24:23,664][25689] Avg episode reward: [(0, '-6.596')] [2022-07-10 09:24:23,671][26022] Updated weights on worker 0-0, policy_version 663713 (0.00092) [2022-07-10 09:24:25,251][26022] Updated weights on worker 0-0, policy_version 663723 (0.00427) [2022-07-10 09:24:27,290][26022] Updated weights on worker 0-0, policy_version 663733 (0.00096) [2022-07-10 09:24:28,756][25689] Fps is (10 sec: 5779.4, 60 sec: 5592.1, 300 sec: 5558.0). Total num frames: 679671808. Throughput: 0: 5822.2. Samples: 679670218. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:24:28,757][25689] Avg episode reward: [(0, '-5.956')] [2022-07-10 09:24:29,051][26022] Updated weights on worker 0-0, policy_version 663743 (0.00094) [2022-07-10 09:24:30,875][26022] Updated weights on worker 0-0, policy_version 663753 (0.00094) [2022-07-10 09:24:32,607][26022] Updated weights on worker 0-0, policy_version 663763 (0.00080) [2022-07-10 09:24:33,763][25689] Fps is (10 sec: 5576.5, 60 sec: 5598.2, 300 sec: 5553.5). Total num frames: 679698432. Throughput: 0: 5827.7. Samples: 679703620. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:24:33,764][25689] Avg episode reward: [(0, '-5.183')] [2022-07-10 09:24:34,626][26022] Updated weights on worker 0-0, policy_version 663773 (0.00085) [2022-07-10 09:24:36,458][26022] Updated weights on worker 0-0, policy_version 663783 (0.00092) [2022-07-10 09:24:38,413][26022] Updated weights on worker 0-0, policy_version 663793 (0.00092) [2022-07-10 09:24:38,894][25689] Fps is (10 sec: 5454.6, 60 sec: 5540.8, 300 sec: 5558.6). Total num frames: 679727104. Throughput: 0: 4980.7. Samples: 679720374. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:24:38,894][25689] Avg episode reward: [(0, '-4.628')] [2022-07-10 09:24:40,196][26022] Updated weights on worker 0-0, policy_version 663803 (0.00089) [2022-07-10 09:24:42,154][26022] Updated weights on worker 0-0, policy_version 663813 (0.00085) [2022-07-10 09:24:43,718][26022] Updated weights on worker 0-0, policy_version 663823 (0.00085) [2022-07-10 09:24:43,914][25689] Fps is (10 sec: 5548.3, 60 sec: 5561.6, 300 sec: 5559.2). Total num frames: 679754752. Throughput: 0: 5771.7. Samples: 679753440. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:24:43,914][25689] Avg episode reward: [(0, '-5.647')] [2022-07-10 09:24:45,672][26022] Updated weights on worker 0-0, policy_version 663833 (0.00096) [2022-07-10 09:24:47,432][26022] Updated weights on worker 0-0, policy_version 663843 (0.00093) [2022-07-10 09:24:48,988][25689] Fps is (10 sec: 5579.6, 60 sec: 5572.9, 300 sec: 5554.5). Total num frames: 679783424. Throughput: 0: 5775.9. Samples: 679787072. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:24:48,988][25689] Avg episode reward: [(0, '-6.442')] [2022-07-10 09:24:49,195][26022] Updated weights on worker 0-0, policy_version 663853 (0.00091) [2022-07-10 09:24:51,242][26022] Updated weights on worker 0-0, policy_version 663863 (0.00091) [2022-07-10 09:24:53,025][26022] Updated weights on worker 0-0, policy_version 663873 (0.00082) [2022-07-10 09:24:54,039][25689] Fps is (10 sec: 5461.3, 60 sec: 5518.0, 300 sec: 5547.4). Total num frames: 679810048. Throughput: 0: 5764.8. Samples: 679820506. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:24:54,041][25689] Avg episode reward: [(0, '-8.325')] [2022-07-10 09:24:54,835][26022] Updated weights on worker 0-0, policy_version 663883 (0.00086) [2022-07-10 09:24:56,491][26022] Updated weights on worker 0-0, policy_version 663893 (0.00096) [2022-07-10 09:24:58,403][26022] Updated weights on worker 0-0, policy_version 663903 (0.00082) [2022-07-10 09:24:59,142][25689] Fps is (10 sec: 5546.3, 60 sec: 5548.9, 300 sec: 5556.1). Total num frames: 679839744. Throughput: 0: 5774.6. Samples: 679837300. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:24:59,143][25689] Avg episode reward: [(0, '-7.710')] [2022-07-10 09:25:00,375][26022] Updated weights on worker 0-0, policy_version 663913 (0.00093) [2022-07-10 09:25:02,591][26022] Updated weights on worker 0-0, policy_version 663923 (0.00097) [2022-07-10 09:25:04,164][25689] Fps is (10 sec: 5461.1, 60 sec: 5531.6, 300 sec: 5552.7). Total num frames: 679865344. Throughput: 0: 5687.1. Samples: 679868608. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:25:04,166][25689] Avg episode reward: [(0, '-7.917')] [2022-07-10 09:25:04,539][26022] Updated weights on worker 0-0, policy_version 663933 (0.00081) [2022-07-10 09:25:06,235][26022] Updated weights on worker 0-0, policy_version 663943 (0.00090) [2022-07-10 09:25:08,193][26022] Updated weights on worker 0-0, policy_version 663953 (0.00086) [2022-07-10 09:25:09,211][25689] Fps is (10 sec: 5288.6, 60 sec: 5529.9, 300 sec: 5548.9). Total num frames: 679892992. Throughput: 0: 5698.2. Samples: 679902306. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:25:09,211][25689] Avg episode reward: [(0, '-7.543')] [2022-07-10 09:25:09,840][26022] Updated weights on worker 0-0, policy_version 663963 (0.00085) [2022-07-10 09:25:11,960][26022] Updated weights on worker 0-0, policy_version 663973 (0.00088) [2022-07-10 09:25:13,447][26022] Updated weights on worker 0-0, policy_version 663983 (0.00085) [2022-07-10 09:25:14,304][25689] Fps is (10 sec: 5655.7, 60 sec: 5546.1, 300 sec: 5558.5). Total num frames: 679922688. Throughput: 0: 4854.9. Samples: 679918888. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:25:14,305][25689] Avg episode reward: [(0, '-6.924')] [2022-07-10 09:25:15,561][26022] Updated weights on worker 0-0, policy_version 663993 (0.00091) [2022-07-10 09:25:17,222][26022] Updated weights on worker 0-0, policy_version 664003 (0.00087) [2022-07-10 09:25:18,998][26022] Updated weights on worker 0-0, policy_version 664013 (0.00086) [2022-07-10 09:25:19,390][25689] Fps is (10 sec: 5633.5, 60 sec: 5547.2, 300 sec: 5557.2). Total num frames: 679950336. Throughput: 0: 5694.4. Samples: 679952598. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:25:19,390][25689] Avg episode reward: [(0, '-5.728')] [2022-07-10 09:25:20,904][26022] Updated weights on worker 0-0, policy_version 664023 (0.00089) [2022-07-10 09:25:22,804][26022] Updated weights on worker 0-0, policy_version 664033 (0.00092) [2022-07-10 09:25:24,467][25689] Fps is (10 sec: 5541.3, 60 sec: 5540.7, 300 sec: 5555.9). Total num frames: 679979008. Throughput: 0: 5803.1. Samples: 679986428. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:25:24,468][25689] Avg episode reward: [(0, '-3.854')] [2022-07-10 09:25:24,550][26022] Updated weights on worker 0-0, policy_version 664043 (0.00085) [2022-07-10 09:25:26,463][26022] Updated weights on worker 0-0, policy_version 664053 (0.00088) [2022-07-10 09:25:28,124][26022] Updated weights on worker 0-0, policy_version 664063 (0.00085) [2022-07-10 09:25:29,497][25689] Fps is (10 sec: 5673.4, 60 sec: 5529.6, 300 sec: 5559.0). Total num frames: 680007680. Throughput: 0: 4981.6. Samples: 680003366. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:25:29,498][25689] Avg episode reward: [(0, '-5.397')] [2022-07-10 09:25:30,013][26022] Updated weights on worker 0-0, policy_version 664073 (0.00083) [2022-07-10 09:25:31,733][26022] Updated weights on worker 0-0, policy_version 664083 (0.00084) [2022-07-10 09:25:33,675][26022] Updated weights on worker 0-0, policy_version 664093 (0.00089) [2022-07-10 09:25:34,518][25689] Fps is (10 sec: 5603.6, 60 sec: 5545.2, 300 sec: 5557.7). Total num frames: 680035328. Throughput: 0: 5844.7. Samples: 680037034. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:25:34,519][25689] Avg episode reward: [(0, '-6.955')] [2022-07-10 09:25:35,491][26022] Updated weights on worker 0-0, policy_version 664103 (0.00089) [2022-07-10 09:25:37,315][26022] Updated weights on worker 0-0, policy_version 664113 (0.00085) [2022-07-10 09:25:39,069][26022] Updated weights on worker 0-0, policy_version 664123 (0.00085) [2022-07-10 09:25:39,600][25689] Fps is (10 sec: 5676.2, 60 sec: 5566.5, 300 sec: 5566.6). Total num frames: 680065024. Throughput: 0: 5852.1. Samples: 680070868. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:25:39,600][25689] Avg episode reward: [(0, '-6.351')] [2022-07-10 09:25:41,050][26022] Updated weights on worker 0-0, policy_version 664133 (0.00095) [2022-07-10 09:25:42,857][26022] Updated weights on worker 0-0, policy_version 664143 (0.00087) [2022-07-10 09:25:44,654][25689] Fps is (10 sec: 5556.6, 60 sec: 5546.5, 300 sec: 5553.5). Total num frames: 680091648. Throughput: 0: 5007.0. Samples: 680087502. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:25:44,654][25689] Avg episode reward: [(0, '-5.352')] [2022-07-10 09:25:44,715][26022] Updated weights on worker 0-0, policy_version 664153 (0.00086) [2022-07-10 09:25:46,469][26022] Updated weights on worker 0-0, policy_version 664163 (0.00088) [2022-07-10 09:25:48,321][26022] Updated weights on worker 0-0, policy_version 664173 (0.00067) [2022-07-10 09:25:49,676][25689] Fps is (10 sec: 5487.7, 60 sec: 5551.2, 300 sec: 5561.3). Total num frames: 680120320. Throughput: 0: 5839.2. Samples: 680121194. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 09:25:49,677][25689] Avg episode reward: [(0, '-4.548')] [2022-07-10 09:25:50,124][26022] Updated weights on worker 0-0, policy_version 664183 (0.00086) [2022-07-10 09:25:51,923][26022] Updated weights on worker 0-0, policy_version 664193 (0.00091) [2022-07-10 09:25:53,748][26022] Updated weights on worker 0-0, policy_version 664203 (0.00092) [2022-07-10 09:25:54,694][25689] Fps is (10 sec: 5711.5, 60 sec: 5588.1, 300 sec: 5562.0). Total num frames: 680148992. Throughput: 0: 5843.1. Samples: 680154924. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:25:54,694][25689] Avg episode reward: [(0, '-5.280')] [2022-07-10 09:25:55,571][26022] Updated weights on worker 0-0, policy_version 664213 (0.00083) [2022-07-10 09:25:57,352][26022] Updated weights on worker 0-0, policy_version 664223 (0.00089) [2022-07-10 09:25:59,243][26022] Updated weights on worker 0-0, policy_version 664233 (0.00089) [2022-07-10 09:25:59,759][25689] Fps is (10 sec: 5687.1, 60 sec: 5574.7, 300 sec: 5571.2). Total num frames: 680177664. Throughput: 0: 5003.5. Samples: 680171736. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:25:59,760][25689] Avg episode reward: [(0, '-2.862')] [2022-07-10 09:26:01,001][26022] Updated weights on worker 0-0, policy_version 664243 (0.00090) [2022-07-10 09:26:03,379][26022] Updated weights on worker 0-0, policy_version 664253 (0.00082) [2022-07-10 09:26:04,771][25689] Fps is (10 sec: 5284.2, 60 sec: 5558.8, 300 sec: 5561.6). Total num frames: 680202240. Throughput: 0: 5754.0. Samples: 680203254. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:26:04,771][25689] Avg episode reward: [(0, '-2.902')] [2022-07-10 09:26:04,975][26022] Updated weights on worker 0-0, policy_version 664263 (0.00083) [2022-07-10 09:26:06,975][26022] Updated weights on worker 0-0, policy_version 664273 (0.00084) [2022-07-10 09:26:08,722][26022] Updated weights on worker 0-0, policy_version 664283 (0.00091) [2022-07-10 09:26:09,793][25689] Fps is (10 sec: 5307.2, 60 sec: 5577.9, 300 sec: 5561.5). Total num frames: 680230912. Throughput: 0: 5760.2. Samples: 680237068. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:26:09,793][25689] Avg episode reward: [(0, '-2.653')] [2022-07-10 09:26:10,625][26022] Updated weights on worker 0-0, policy_version 664293 (0.00089) [2022-07-10 09:26:12,335][26022] Updated weights on worker 0-0, policy_version 664303 (0.00096) [2022-07-10 09:26:14,210][26022] Updated weights on worker 0-0, policy_version 664313 (0.00097) [2022-07-10 09:26:14,811][25689] Fps is (10 sec: 5609.6, 60 sec: 5551.0, 300 sec: 5559.5). Total num frames: 680258560. Throughput: 0: 4919.1. Samples: 680253880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:26:14,811][25689] Avg episode reward: [(0, '-4.117')] [2022-07-10 09:26:16,019][26022] Updated weights on worker 0-0, policy_version 664323 (0.00090) [2022-07-10 09:26:17,914][26022] Updated weights on worker 0-0, policy_version 664333 (0.00088) [2022-07-10 09:26:19,775][26022] Updated weights on worker 0-0, policy_version 664343 (0.00090) [2022-07-10 09:26:19,942][25689] Fps is (10 sec: 5549.0, 60 sec: 5563.7, 300 sec: 5564.2). Total num frames: 680287232. Throughput: 0: 5731.2. Samples: 680287408. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:26:19,943][25689] Avg episode reward: [(0, '-3.769')] [2022-07-10 09:26:20,180][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:26:20,193][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000664346_680290304.pth [2022-07-10 09:26:20,194][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000662389_678286336.pth [2022-07-10 09:26:21,539][26022] Updated weights on worker 0-0, policy_version 664353 (0.00090) [2022-07-10 09:26:23,305][26022] Updated weights on worker 0-0, policy_version 664363 (0.00854) [2022-07-10 09:26:24,946][25689] Fps is (10 sec: 5657.6, 60 sec: 5570.4, 300 sec: 5564.6). Total num frames: 680315904. Throughput: 0: 5847.1. Samples: 680321224. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:26:24,947][25689] Avg episode reward: [(0, '-3.225')] [2022-07-10 09:26:25,073][26022] Updated weights on worker 0-0, policy_version 664373 (0.00087) [2022-07-10 09:26:27,054][26022] Updated weights on worker 0-0, policy_version 664383 (0.00091) [2022-07-10 09:26:28,707][26022] Updated weights on worker 0-0, policy_version 664393 (0.00099) [2022-07-10 09:26:29,969][25689] Fps is (10 sec: 5719.0, 60 sec: 5571.1, 300 sec: 5564.3). Total num frames: 680344576. Throughput: 0: 5009.7. Samples: 680338146. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:26:29,970][25689] Avg episode reward: [(0, '-5.327')] [2022-07-10 09:26:30,739][26022] Updated weights on worker 0-0, policy_version 664403 (0.00086) [2022-07-10 09:26:32,372][26022] Updated weights on worker 0-0, policy_version 664413 (0.00085) [2022-07-10 09:26:34,391][26022] Updated weights on worker 0-0, policy_version 664423 (0.00092) [2022-07-10 09:26:35,038][25689] Fps is (10 sec: 5682.6, 60 sec: 5583.6, 300 sec: 5567.2). Total num frames: 680373248. Throughput: 0: 5843.5. Samples: 680372076. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:26:35,039][25689] Avg episode reward: [(0, '-5.317')] [2022-07-10 09:26:35,957][26022] Updated weights on worker 0-0, policy_version 664433 (0.00089) [2022-07-10 09:26:38,002][26022] Updated weights on worker 0-0, policy_version 664443 (0.00085) [2022-07-10 09:26:39,745][26022] Updated weights on worker 0-0, policy_version 664453 (0.00081) [2022-07-10 09:26:40,083][25689] Fps is (10 sec: 5670.0, 60 sec: 5570.1, 300 sec: 5570.2). Total num frames: 680401920. Throughput: 0: 5880.8. Samples: 680405850. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:26:40,083][25689] Avg episode reward: [(0, '-4.889')] [2022-07-10 09:26:41,616][26022] Updated weights on worker 0-0, policy_version 664463 (0.00440) [2022-07-10 09:26:43,449][26022] Updated weights on worker 0-0, policy_version 664473 (0.00082) [2022-07-10 09:26:45,115][25689] Fps is (10 sec: 5589.0, 60 sec: 5589.1, 300 sec: 5562.9). Total num frames: 680429568. Throughput: 0: 5036.4. Samples: 680422798. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:26:45,115][25689] Avg episode reward: [(0, '-4.555')] [2022-07-10 09:26:45,219][26022] Updated weights on worker 0-0, policy_version 664483 (0.00088) [2022-07-10 09:26:46,996][26022] Updated weights on worker 0-0, policy_version 664493 (0.00088) [2022-07-10 09:26:48,914][26022] Updated weights on worker 0-0, policy_version 664503 (0.00095) [2022-07-10 09:26:50,136][25689] Fps is (10 sec: 5500.3, 60 sec: 5572.2, 300 sec: 5566.2). Total num frames: 680457216. Throughput: 0: 5863.6. Samples: 680456396. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:26:50,136][25689] Avg episode reward: [(0, '-5.452')] [2022-07-10 09:26:50,690][26022] Updated weights on worker 0-0, policy_version 664513 (0.00093) [2022-07-10 09:26:52,652][26022] Updated weights on worker 0-0, policy_version 664523 (0.00086) [2022-07-10 09:26:54,263][26022] Updated weights on worker 0-0, policy_version 664533 (0.00090) [2022-07-10 09:26:55,150][25689] Fps is (10 sec: 5612.3, 60 sec: 5572.6, 300 sec: 5566.7). Total num frames: 680485888. Throughput: 0: 5874.5. Samples: 680490224. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:26:55,150][25689] Avg episode reward: [(0, '-6.373')] [2022-07-10 09:26:56,172][26022] Updated weights on worker 0-0, policy_version 664543 (0.00099) [2022-07-10 09:26:57,875][26022] Updated weights on worker 0-0, policy_version 664553 (0.00084) [2022-07-10 09:26:59,886][26022] Updated weights on worker 0-0, policy_version 664563 (0.00090) [2022-07-10 09:27:00,214][25689] Fps is (10 sec: 5486.7, 60 sec: 5538.8, 300 sec: 5569.4). Total num frames: 680512512. Throughput: 0: 5866.8. Samples: 680523958. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:27:00,215][25689] Avg episode reward: [(0, '-4.608')] [2022-07-10 09:27:01,918][26022] Updated weights on worker 0-0, policy_version 664573 (0.00095) [2022-07-10 09:27:04,051][26022] Updated weights on worker 0-0, policy_version 664583 (0.00082) [2022-07-10 09:27:05,280][25689] Fps is (10 sec: 5357.4, 60 sec: 5584.6, 300 sec: 5565.2). Total num frames: 680540160. Throughput: 0: 5740.0. Samples: 680538548. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:27:05,280][25689] Avg episode reward: [(0, '-5.262')] [2022-07-10 09:27:05,624][26022] Updated weights on worker 0-0, policy_version 664593 (0.00094) [2022-07-10 09:27:07,820][26022] Updated weights on worker 0-0, policy_version 664603 (0.00084) [2022-07-10 09:27:09,149][26022] Updated weights on worker 0-0, policy_version 664613 (0.00089) [2022-07-10 09:27:10,310][25689] Fps is (10 sec: 5477.1, 60 sec: 5566.9, 300 sec: 5565.4). Total num frames: 680567808. Throughput: 0: 5731.9. Samples: 680572032. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:27:10,311][25689] Avg episode reward: [(0, '-6.132')] [2022-07-10 09:27:11,426][26022] Updated weights on worker 0-0, policy_version 664623 (0.00084) [2022-07-10 09:27:13,016][26022] Updated weights on worker 0-0, policy_version 664633 (0.00106) [2022-07-10 09:27:14,991][26022] Updated weights on worker 0-0, policy_version 664643 (0.00083) [2022-07-10 09:27:15,314][25689] Fps is (10 sec: 5613.0, 60 sec: 5585.2, 300 sec: 5566.8). Total num frames: 680596480. Throughput: 0: 5701.5. Samples: 680605190. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:27:15,314][25689] Avg episode reward: [(0, '-5.306')] [2022-07-10 09:27:16,911][26022] Updated weights on worker 0-0, policy_version 664653 (0.00684) [2022-07-10 09:27:18,678][26022] Updated weights on worker 0-0, policy_version 664663 (0.00090) [2022-07-10 09:27:20,437][25689] Fps is (10 sec: 5561.2, 60 sec: 5569.0, 300 sec: 5568.2). Total num frames: 680624128. Throughput: 0: 4848.1. Samples: 680622002. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:27:20,438][25689] Avg episode reward: [(0, '-5.033')] [2022-07-10 09:27:20,622][26022] Updated weights on worker 0-0, policy_version 664673 (0.00086) [2022-07-10 09:27:22,203][26022] Updated weights on worker 0-0, policy_version 664683 (0.00096) [2022-07-10 09:27:24,213][26022] Updated weights on worker 0-0, policy_version 664693 (0.00090) [2022-07-10 09:27:25,484][25689] Fps is (10 sec: 5537.4, 60 sec: 5565.0, 300 sec: 5567.4). Total num frames: 680652800. Throughput: 0: 5778.4. Samples: 680655298. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:27:25,485][25689] Avg episode reward: [(0, '-5.752')] [2022-07-10 09:27:26,087][26022] Updated weights on worker 0-0, policy_version 664703 (0.00094) [2022-07-10 09:27:27,835][26022] Updated weights on worker 0-0, policy_version 664713 (0.00097) [2022-07-10 09:27:29,776][26022] Updated weights on worker 0-0, policy_version 664723 (0.00085) [2022-07-10 09:27:30,503][25689] Fps is (10 sec: 5697.2, 60 sec: 5565.4, 300 sec: 5571.0). Total num frames: 680681472. Throughput: 0: 5793.1. Samples: 680689010. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:27:30,503][25689] Avg episode reward: [(0, '-5.062')] [2022-07-10 09:27:31,410][26022] Updated weights on worker 0-0, policy_version 664733 (0.00098) [2022-07-10 09:27:33,340][26022] Updated weights on worker 0-0, policy_version 664743 (0.00082) [2022-07-10 09:27:35,234][26022] Updated weights on worker 0-0, policy_version 664753 (0.00087) [2022-07-10 09:27:35,506][25689] Fps is (10 sec: 5517.7, 60 sec: 5537.6, 300 sec: 5562.2). Total num frames: 680708096. Throughput: 0: 4982.4. Samples: 680705796. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:27:35,508][25689] Avg episode reward: [(0, '-4.355')] [2022-07-10 09:27:36,932][26022] Updated weights on worker 0-0, policy_version 664763 (0.00091) [2022-07-10 09:27:38,816][26022] Updated weights on worker 0-0, policy_version 664773 (0.00086) [2022-07-10 09:27:40,591][25689] Fps is (10 sec: 5481.3, 60 sec: 5533.9, 300 sec: 5568.2). Total num frames: 680736768. Throughput: 0: 5828.4. Samples: 680739464. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:27:40,591][25689] Avg episode reward: [(0, '-5.263')] [2022-07-10 09:27:40,827][26022] Updated weights on worker 0-0, policy_version 664783 (0.00090) [2022-07-10 09:27:42,470][26022] Updated weights on worker 0-0, policy_version 664793 (0.00111) [2022-07-10 09:27:44,480][26022] Updated weights on worker 0-0, policy_version 664803 (0.00081) [2022-07-10 09:27:45,610][25689] Fps is (10 sec: 5675.3, 60 sec: 5552.0, 300 sec: 5564.9). Total num frames: 680765440. Throughput: 0: 5847.0. Samples: 680772974. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:27:45,612][25689] Avg episode reward: [(0, '-6.714')] [2022-07-10 09:27:45,998][26022] Updated weights on worker 0-0, policy_version 664813 (0.00084) [2022-07-10 09:27:48,168][26022] Updated weights on worker 0-0, policy_version 664823 (0.00080) [2022-07-10 09:27:49,687][26022] Updated weights on worker 0-0, policy_version 664833 (0.00088) [2022-07-10 09:27:50,679][25689] Fps is (10 sec: 5582.7, 60 sec: 5547.6, 300 sec: 5564.0). Total num frames: 680793088. Throughput: 0: 4995.1. Samples: 680789794. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:27:50,681][25689] Avg episode reward: [(0, '-5.866')] [2022-07-10 09:27:51,723][26022] Updated weights on worker 0-0, policy_version 664843 (0.00095) [2022-07-10 09:27:53,498][26022] Updated weights on worker 0-0, policy_version 664853 (0.00085) [2022-07-10 09:27:55,339][26022] Updated weights on worker 0-0, policy_version 664863 (0.00084) [2022-07-10 09:27:55,685][25689] Fps is (10 sec: 5488.5, 60 sec: 5531.4, 300 sec: 5557.9). Total num frames: 680820736. Throughput: 0: 5829.7. Samples: 680823434. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:27:55,686][25689] Avg episode reward: [(0, '-5.517')] [2022-07-10 09:27:57,174][26022] Updated weights on worker 0-0, policy_version 664873 (0.00096) [2022-07-10 09:27:58,989][26022] Updated weights on worker 0-0, policy_version 664883 (0.00087) [2022-07-10 09:28:00,814][25689] Fps is (10 sec: 5556.8, 60 sec: 5559.3, 300 sec: 5566.0). Total num frames: 680849408. Throughput: 0: 5816.3. Samples: 680857090. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:28:00,815][25689] Avg episode reward: [(0, '-5.055')] [2022-07-10 09:28:00,888][26022] Updated weights on worker 0-0, policy_version 664893 (0.00776) [2022-07-10 09:28:03,031][26022] Updated weights on worker 0-0, policy_version 664903 (0.00087) [2022-07-10 09:28:04,901][26022] Updated weights on worker 0-0, policy_version 664913 (0.01114) [2022-07-10 09:28:05,841][25689] Fps is (10 sec: 5444.6, 60 sec: 5546.0, 300 sec: 5558.9). Total num frames: 680876032. Throughput: 0: 4881.1. Samples: 680871724. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:28:05,842][25689] Avg episode reward: [(0, '-4.754')] [2022-07-10 09:28:06,719][26022] Updated weights on worker 0-0, policy_version 664923 (0.00389) [2022-07-10 09:28:08,672][26022] Updated weights on worker 0-0, policy_version 664933 (0.00086) [2022-07-10 09:28:10,283][26022] Updated weights on worker 0-0, policy_version 664943 (0.00088) [2022-07-10 09:28:10,847][25689] Fps is (10 sec: 5511.4, 60 sec: 5565.1, 300 sec: 5558.9). Total num frames: 680904704. Throughput: 0: 5726.2. Samples: 680905282. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:28:10,848][25689] Avg episode reward: [(0, '-4.040')] [2022-07-10 09:28:12,228][26022] Updated weights on worker 0-0, policy_version 664953 (0.00090) [2022-07-10 09:28:13,910][26022] Updated weights on worker 0-0, policy_version 664963 (0.00090) [2022-07-10 09:28:15,818][26022] Updated weights on worker 0-0, policy_version 664973 (0.00085) [2022-07-10 09:28:15,875][25689] Fps is (10 sec: 5613.2, 60 sec: 5546.0, 300 sec: 5563.8). Total num frames: 680932352. Throughput: 0: 5717.2. Samples: 680938862. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:28:15,875][25689] Avg episode reward: [(0, '-3.952')] [2022-07-10 09:28:17,612][26022] Updated weights on worker 0-0, policy_version 664983 (0.00088) [2022-07-10 09:28:19,519][26022] Updated weights on worker 0-0, policy_version 664993 (0.00090) [2022-07-10 09:28:20,324][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:28:20,338][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000664997_680956928.pth [2022-07-10 09:28:20,338][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000663038_678950912.pth [2022-07-10 09:28:21,015][25689] Fps is (10 sec: 5538.9, 60 sec: 5561.4, 300 sec: 5551.1). Total num frames: 680961024. Throughput: 0: 4878.0. Samples: 680955630. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:28:21,016][25689] Avg episode reward: [(0, '-4.232')] [2022-07-10 09:28:21,410][26022] Updated weights on worker 0-0, policy_version 665003 (0.00122) [2022-07-10 09:28:23,084][26022] Updated weights on worker 0-0, policy_version 665013 (0.00093) [2022-07-10 09:28:25,048][26022] Updated weights on worker 0-0, policy_version 665023 (0.00083) [2022-07-10 09:28:26,069][25689] Fps is (10 sec: 5625.0, 60 sec: 5560.7, 300 sec: 5561.2). Total num frames: 680989696. Throughput: 0: 5824.2. Samples: 680989536. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:28:26,070][25689] Avg episode reward: [(0, '-5.440')] [2022-07-10 09:28:26,742][26022] Updated weights on worker 0-0, policy_version 665033 (0.00086) [2022-07-10 09:28:28,475][26022] Updated weights on worker 0-0, policy_version 665043 (0.00088) [2022-07-10 09:28:30,555][26022] Updated weights on worker 0-0, policy_version 665053 (0.00087) [2022-07-10 09:28:31,074][25689] Fps is (10 sec: 5599.0, 60 sec: 5545.0, 300 sec: 5566.0). Total num frames: 681017344. Throughput: 0: 5817.6. Samples: 681022954. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:28:31,075][25689] Avg episode reward: [(0, '-5.831')] [2022-07-10 09:28:32,292][26022] Updated weights on worker 0-0, policy_version 665063 (0.00088) [2022-07-10 09:28:34,030][26022] Updated weights on worker 0-0, policy_version 665073 (0.00106) [2022-07-10 09:28:36,026][26022] Updated weights on worker 0-0, policy_version 665083 (0.00083) [2022-07-10 09:28:36,110][25689] Fps is (10 sec: 5507.0, 60 sec: 5559.0, 300 sec: 5552.6). Total num frames: 681044992. Throughput: 0: 4983.1. Samples: 681039696. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:28:36,111][25689] Avg episode reward: [(0, '-6.396')] [2022-07-10 09:28:37,715][26022] Updated weights on worker 0-0, policy_version 665093 (0.00084) [2022-07-10 09:28:39,739][26022] Updated weights on worker 0-0, policy_version 665103 (0.00091) [2022-07-10 09:28:41,167][25689] Fps is (10 sec: 5580.2, 60 sec: 5561.5, 300 sec: 5559.6). Total num frames: 681073664. Throughput: 0: 5823.2. Samples: 681072978. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:28:41,168][25689] Avg episode reward: [(0, '-6.056')] [2022-07-10 09:28:41,627][26022] Updated weights on worker 0-0, policy_version 665113 (0.00085) [2022-07-10 09:28:43,348][26022] Updated weights on worker 0-0, policy_version 665123 (0.00091) [2022-07-10 09:28:45,270][26022] Updated weights on worker 0-0, policy_version 665133 (0.00085) [2022-07-10 09:28:46,170][25689] Fps is (10 sec: 5395.0, 60 sec: 5512.3, 300 sec: 5552.9). Total num frames: 681099264. Throughput: 0: 5810.6. Samples: 681106334. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:28:46,172][25689] Avg episode reward: [(0, '-7.288')] [2022-07-10 09:28:46,916][26022] Updated weights on worker 0-0, policy_version 665143 (0.00086) [2022-07-10 09:28:48,854][26022] Updated weights on worker 0-0, policy_version 665153 (0.00085) [2022-07-10 09:28:50,548][26022] Updated weights on worker 0-0, policy_version 665163 (0.00090) [2022-07-10 09:28:51,203][25689] Fps is (10 sec: 5611.7, 60 sec: 5566.3, 300 sec: 5555.8). Total num frames: 681129984. Throughput: 0: 4986.1. Samples: 681123320. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:28:51,205][25689] Avg episode reward: [(0, '-6.838')] [2022-07-10 09:28:52,601][26022] Updated weights on worker 0-0, policy_version 665173 (0.00084) [2022-07-10 09:28:54,227][26022] Updated weights on worker 0-0, policy_version 665183 (0.00409) [2022-07-10 09:28:56,227][25689] Fps is (10 sec: 5702.1, 60 sec: 5547.8, 300 sec: 5553.3). Total num frames: 681156608. Throughput: 0: 5815.7. Samples: 681156688. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:28:56,227][25689] Avg episode reward: [(0, '-4.575')] [2022-07-10 09:28:56,311][26022] Updated weights on worker 0-0, policy_version 665193 (0.00096) [2022-07-10 09:28:57,980][26022] Updated weights on worker 0-0, policy_version 665203 (0.00085) [2022-07-10 09:28:59,867][26022] Updated weights on worker 0-0, policy_version 665213 (0.00096) [2022-07-10 09:29:01,294][25689] Fps is (10 sec: 5480.2, 60 sec: 5553.5, 300 sec: 5559.2). Total num frames: 681185280. Throughput: 0: 5826.3. Samples: 681190240. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:29:01,295][25689] Avg episode reward: [(0, '-4.453')] [2022-07-10 09:29:02,014][26022] Updated weights on worker 0-0, policy_version 665223 (0.00085) [2022-07-10 09:29:03,651][26022] Updated weights on worker 0-0, policy_version 665233 (0.00086) [2022-07-10 09:29:06,028][26022] Updated weights on worker 0-0, policy_version 665243 (0.00081) [2022-07-10 09:29:06,307][25689] Fps is (10 sec: 5383.9, 60 sec: 5537.8, 300 sec: 5552.6). Total num frames: 681210880. Throughput: 0: 4901.3. Samples: 681205032. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 09:29:06,308][25689] Avg episode reward: [(0, '-4.661')] [2022-07-10 09:29:07,307][26022] Updated weights on worker 0-0, policy_version 665253 (0.00088) [2022-07-10 09:29:09,512][26022] Updated weights on worker 0-0, policy_version 665263 (0.00088) [2022-07-10 09:29:11,336][25689] Fps is (10 sec: 5302.2, 60 sec: 5518.8, 300 sec: 5550.2). Total num frames: 681238528. Throughput: 0: 5726.4. Samples: 681238608. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:29:11,338][25689] Avg episode reward: [(0, '-3.521')] [2022-07-10 09:29:11,356][26022] Updated weights on worker 0-0, policy_version 665273 (0.00086) [2022-07-10 09:29:12,879][26022] Updated weights on worker 0-0, policy_version 665283 (0.00085) [2022-07-10 09:29:14,959][26022] Updated weights on worker 0-0, policy_version 665293 (0.00086) [2022-07-10 09:29:16,356][25689] Fps is (10 sec: 5706.3, 60 sec: 5553.3, 300 sec: 5558.6). Total num frames: 681268224. Throughput: 0: 5752.0. Samples: 681272474. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:29:16,358][25689] Avg episode reward: [(0, '-3.855')] [2022-07-10 09:29:16,819][26022] Updated weights on worker 0-0, policy_version 665303 (0.00087) [2022-07-10 09:29:18,527][26022] Updated weights on worker 0-0, policy_version 665313 (0.00084) [2022-07-10 09:29:20,490][26022] Updated weights on worker 0-0, policy_version 665323 (0.00094) [2022-07-10 09:29:21,427][25689] Fps is (10 sec: 5784.4, 60 sec: 5559.8, 300 sec: 5557.4). Total num frames: 681296896. Throughput: 0: 4914.3. Samples: 681289180. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:29:21,427][25689] Avg episode reward: [(0, '-6.257')] [2022-07-10 09:29:22,100][26022] Updated weights on worker 0-0, policy_version 665333 (0.00090) [2022-07-10 09:29:23,974][26022] Updated weights on worker 0-0, policy_version 665343 (0.00087) [2022-07-10 09:29:25,712][26022] Updated weights on worker 0-0, policy_version 665353 (0.00096) [2022-07-10 09:29:26,441][25689] Fps is (10 sec: 5483.0, 60 sec: 5529.4, 300 sec: 5548.5). Total num frames: 681323520. Throughput: 0: 5874.7. Samples: 681323314. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:29:26,442][25689] Avg episode reward: [(0, '-7.316')] [2022-07-10 09:29:27,798][26022] Updated weights on worker 0-0, policy_version 665363 (0.00104) [2022-07-10 09:29:29,467][26022] Updated weights on worker 0-0, policy_version 665373 (0.00089) [2022-07-10 09:29:31,459][25689] Fps is (10 sec: 5511.9, 60 sec: 5545.3, 300 sec: 5555.2). Total num frames: 681352192. Throughput: 0: 5883.5. Samples: 681357000. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:29:31,459][25689] Avg episode reward: [(0, '-9.185')] [2022-07-10 09:29:31,460][26022] Updated weights on worker 0-0, policy_version 665383 (0.00086) [2022-07-10 09:29:33,031][26022] Updated weights on worker 0-0, policy_version 665393 (0.00082) [2022-07-10 09:29:34,973][26022] Updated weights on worker 0-0, policy_version 665403 (0.00083) [2022-07-10 09:29:36,477][25689] Fps is (10 sec: 5815.8, 60 sec: 5580.8, 300 sec: 5560.7). Total num frames: 681381888. Throughput: 0: 5887.6. Samples: 681390940. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:29:36,478][25689] Avg episode reward: [(0, '-9.151')] [2022-07-10 09:29:36,731][26022] Updated weights on worker 0-0, policy_version 665413 (0.00088) [2022-07-10 09:29:38,570][26022] Updated weights on worker 0-0, policy_version 665423 (0.00086) [2022-07-10 09:29:40,389][26022] Updated weights on worker 0-0, policy_version 665433 (0.00085) [2022-07-10 09:29:41,541][25689] Fps is (10 sec: 5687.9, 60 sec: 5563.2, 300 sec: 5559.9). Total num frames: 681409536. Throughput: 0: 5891.7. Samples: 681407684. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:29:41,541][25689] Avg episode reward: [(0, '-8.371')] [2022-07-10 09:29:42,195][26022] Updated weights on worker 0-0, policy_version 665443 (0.00085) [2022-07-10 09:29:44,058][26022] Updated weights on worker 0-0, policy_version 665453 (0.00094) [2022-07-10 09:29:45,937][26022] Updated weights on worker 0-0, policy_version 665463 (0.00093) [2022-07-10 09:29:46,579][25689] Fps is (10 sec: 5474.2, 60 sec: 5593.9, 300 sec: 5557.2). Total num frames: 681437184. Throughput: 0: 5879.1. Samples: 681441704. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:29:46,579][25689] Avg episode reward: [(0, '-8.662')] [2022-07-10 09:29:47,412][26022] Updated weights on worker 0-0, policy_version 665473 (0.00092) [2022-07-10 09:29:49,716][26022] Updated weights on worker 0-0, policy_version 665483 (0.00088) [2022-07-10 09:29:51,241][26022] Updated weights on worker 0-0, policy_version 665493 (0.00084) [2022-07-10 09:29:51,594][25689] Fps is (10 sec: 5601.8, 60 sec: 5561.6, 300 sec: 5564.7). Total num frames: 681465856. Throughput: 0: 5871.7. Samples: 681475230. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:29:51,595][25689] Avg episode reward: [(0, '-6.197')] [2022-07-10 09:29:53,185][26022] Updated weights on worker 0-0, policy_version 665503 (0.00090) [2022-07-10 09:29:54,883][26022] Updated weights on worker 0-0, policy_version 665513 (0.00087) [2022-07-10 09:29:56,606][25689] Fps is (10 sec: 5718.9, 60 sec: 5596.6, 300 sec: 5563.0). Total num frames: 681494528. Throughput: 0: 5032.1. Samples: 681492228. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:29:56,608][25689] Avg episode reward: [(0, '-6.422')] [2022-07-10 09:29:56,720][26022] Updated weights on worker 0-0, policy_version 665523 (0.00095) [2022-07-10 09:29:58,463][26022] Updated weights on worker 0-0, policy_version 665533 (0.00080) [2022-07-10 09:30:00,402][26022] Updated weights on worker 0-0, policy_version 665543 (0.00085) [2022-07-10 09:30:01,671][25689] Fps is (10 sec: 5691.0, 60 sec: 5596.8, 300 sec: 5572.5). Total num frames: 681523200. Throughput: 0: 5896.7. Samples: 681526386. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:30:01,673][25689] Avg episode reward: [(0, '-5.348')] [2022-07-10 09:30:02,604][26022] Updated weights on worker 0-0, policy_version 665553 (0.00104) [2022-07-10 09:30:04,357][26022] Updated weights on worker 0-0, policy_version 665563 (0.00086) [2022-07-10 09:30:06,251][26022] Updated weights on worker 0-0, policy_version 665573 (0.00083) [2022-07-10 09:30:06,678][25689] Fps is (10 sec: 5490.0, 60 sec: 5614.4, 300 sec: 5569.8). Total num frames: 681549824. Throughput: 0: 5784.2. Samples: 681557962. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:30:06,678][25689] Avg episode reward: [(0, '-4.735')] [2022-07-10 09:30:07,908][26022] Updated weights on worker 0-0, policy_version 665583 (0.00092) [2022-07-10 09:30:09,822][26022] Updated weights on worker 0-0, policy_version 665593 (0.00085) [2022-07-10 09:30:11,719][25689] Fps is (10 sec: 5299.3, 60 sec: 5596.3, 300 sec: 5560.4). Total num frames: 681576448. Throughput: 0: 4945.5. Samples: 681574756. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:30:11,719][25689] Avg episode reward: [(0, '-5.203')] [2022-07-10 09:30:11,794][26022] Updated weights on worker 0-0, policy_version 665603 (0.00083) [2022-07-10 09:30:13,500][26022] Updated weights on worker 0-0, policy_version 665613 (0.00088) [2022-07-10 09:30:15,279][26022] Updated weights on worker 0-0, policy_version 665623 (0.00095) [2022-07-10 09:30:16,733][25689] Fps is (10 sec: 5499.2, 60 sec: 5579.9, 300 sec: 5565.2). Total num frames: 681605120. Throughput: 0: 5760.6. Samples: 681608174. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:30:16,735][25689] Avg episode reward: [(0, '-5.263')] [2022-07-10 09:30:17,284][26022] Updated weights on worker 0-0, policy_version 665633 (0.00088) [2022-07-10 09:30:19,175][26022] Updated weights on worker 0-0, policy_version 665643 (0.00078) [2022-07-10 09:30:20,358][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:30:20,375][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000665650_681625600.pth [2022-07-10 09:30:20,376][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000663694_679622656.pth [2022-07-10 09:30:21,033][26022] Updated weights on worker 0-0, policy_version 665653 (0.00093) [2022-07-10 09:30:21,789][25689] Fps is (10 sec: 5592.6, 60 sec: 5564.2, 300 sec: 5562.2). Total num frames: 681632768. Throughput: 0: 5716.8. Samples: 681641400. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:30:21,790][25689] Avg episode reward: [(0, '-4.672')] [2022-07-10 09:30:22,814][26022] Updated weights on worker 0-0, policy_version 665663 (0.00084) [2022-07-10 09:30:24,624][26022] Updated weights on worker 0-0, policy_version 665673 (0.00089) [2022-07-10 09:30:26,507][26022] Updated weights on worker 0-0, policy_version 665683 (0.00088) [2022-07-10 09:30:26,827][25689] Fps is (10 sec: 5478.3, 60 sec: 5579.1, 300 sec: 5558.6). Total num frames: 681660416. Throughput: 0: 4983.4. Samples: 681658372. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:30:26,827][25689] Avg episode reward: [(0, '-5.504')] [2022-07-10 09:30:28,086][26022] Updated weights on worker 0-0, policy_version 665693 (0.00054) [2022-07-10 09:30:30,338][26022] Updated weights on worker 0-0, policy_version 665703 (0.00111) [2022-07-10 09:30:31,807][26022] Updated weights on worker 0-0, policy_version 665713 (0.00088) [2022-07-10 09:30:31,838][25689] Fps is (10 sec: 5706.8, 60 sec: 5596.7, 300 sec: 5565.7). Total num frames: 681690112. Throughput: 0: 5815.3. Samples: 681691754. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:30:31,838][25689] Avg episode reward: [(0, '-5.892')] [2022-07-10 09:30:33,839][26022] Updated weights on worker 0-0, policy_version 665723 (0.00092) [2022-07-10 09:30:35,587][26022] Updated weights on worker 0-0, policy_version 665733 (0.00092) [2022-07-10 09:30:36,863][25689] Fps is (10 sec: 5611.8, 60 sec: 5545.2, 300 sec: 5556.4). Total num frames: 681716736. Throughput: 0: 5818.8. Samples: 681725304. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:30:36,863][25689] Avg episode reward: [(0, '-4.557')] [2022-07-10 09:30:37,465][26022] Updated weights on worker 0-0, policy_version 665743 (0.00081) [2022-07-10 09:30:39,336][26022] Updated weights on worker 0-0, policy_version 665753 (0.00085) [2022-07-10 09:30:41,063][26022] Updated weights on worker 0-0, policy_version 665763 (0.00081) [2022-07-10 09:30:41,999][25689] Fps is (10 sec: 5441.7, 60 sec: 5555.4, 300 sec: 5561.7). Total num frames: 681745408. Throughput: 0: 4976.5. Samples: 681741976. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:30:42,000][25689] Avg episode reward: [(0, '-4.934')] [2022-07-10 09:30:42,815][26022] Updated weights on worker 0-0, policy_version 665773 (0.00086) [2022-07-10 09:30:44,627][26022] Updated weights on worker 0-0, policy_version 665783 (0.00086) [2022-07-10 09:30:46,560][26022] Updated weights on worker 0-0, policy_version 665793 (0.00094) [2022-07-10 09:30:47,019][25689] Fps is (10 sec: 5545.2, 60 sec: 5557.1, 300 sec: 5558.3). Total num frames: 681773056. Throughput: 0: 5816.1. Samples: 681775814. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:30:47,020][25689] Avg episode reward: [(0, '-3.846')] [2022-07-10 09:30:48,678][26022] Updated weights on worker 0-0, policy_version 665803 (0.00059) [2022-07-10 09:30:50,366][26022] Updated weights on worker 0-0, policy_version 665813 (0.00108) [2022-07-10 09:30:52,033][25689] Fps is (10 sec: 5613.2, 60 sec: 5557.3, 300 sec: 5558.4). Total num frames: 681801728. Throughput: 0: 5797.2. Samples: 681808830. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:30:52,033][25689] Avg episode reward: [(0, '-2.812')] [2022-07-10 09:30:52,162][26022] Updated weights on worker 0-0, policy_version 665823 (0.00088) [2022-07-10 09:30:54,046][26022] Updated weights on worker 0-0, policy_version 665833 (0.00086) [2022-07-10 09:30:55,815][26022] Updated weights on worker 0-0, policy_version 665843 (0.00095) [2022-07-10 09:30:57,035][25689] Fps is (10 sec: 5623.0, 60 sec: 5541.1, 300 sec: 5556.2). Total num frames: 681829376. Throughput: 0: 4976.1. Samples: 681825684. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:30:57,037][25689] Avg episode reward: [(0, '-3.550')] [2022-07-10 09:30:57,792][26022] Updated weights on worker 0-0, policy_version 665853 (0.00093) [2022-07-10 09:30:59,550][26022] Updated weights on worker 0-0, policy_version 665863 (0.00086) [2022-07-10 09:31:01,518][26022] Updated weights on worker 0-0, policy_version 665873 (0.00087) [2022-07-10 09:31:02,110][25689] Fps is (10 sec: 5385.6, 60 sec: 5506.3, 300 sec: 5561.9). Total num frames: 681856000. Throughput: 0: 5831.4. Samples: 681859250. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:31:02,111][25689] Avg episode reward: [(0, '-3.945')] [2022-07-10 09:31:03,497][26022] Updated weights on worker 0-0, policy_version 665883 (0.00084) [2022-07-10 09:31:05,344][26022] Updated weights on worker 0-0, policy_version 665893 (0.00086) [2022-07-10 09:31:07,143][25689] Fps is (10 sec: 5369.4, 60 sec: 5520.9, 300 sec: 5558.2). Total num frames: 681883648. Throughput: 0: 5717.4. Samples: 681890868. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:31:07,143][25689] Avg episode reward: [(0, '-4.258')] [2022-07-10 09:31:07,214][26022] Updated weights on worker 0-0, policy_version 665903 (0.00089) [2022-07-10 09:31:09,162][26022] Updated weights on worker 0-0, policy_version 665913 (0.00089) [2022-07-10 09:31:10,972][26022] Updated weights on worker 0-0, policy_version 665923 (0.00088) [2022-07-10 09:31:12,158][25689] Fps is (10 sec: 5605.2, 60 sec: 5557.2, 300 sec: 5561.7). Total num frames: 681912320. Throughput: 0: 4916.6. Samples: 681907776. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:31:12,160][25689] Avg episode reward: [(0, '-4.557')] [2022-07-10 09:31:12,743][26022] Updated weights on worker 0-0, policy_version 665933 (0.00092) [2022-07-10 09:31:14,433][26022] Updated weights on worker 0-0, policy_version 665943 (0.00084) [2022-07-10 09:31:16,197][26022] Updated weights on worker 0-0, policy_version 665953 (0.00076) [2022-07-10 09:31:17,258][25689] Fps is (10 sec: 5567.7, 60 sec: 5532.4, 300 sec: 5558.9). Total num frames: 681939968. Throughput: 0: 5745.0. Samples: 681941866. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:31:17,259][25689] Avg episode reward: [(0, '-5.456')] [2022-07-10 09:31:18,146][26022] Updated weights on worker 0-0, policy_version 665963 (0.00088) [2022-07-10 09:31:19,970][26022] Updated weights on worker 0-0, policy_version 665973 (0.00097) [2022-07-10 09:31:21,689][26022] Updated weights on worker 0-0, policy_version 665983 (0.00093) [2022-07-10 09:31:22,401][25689] Fps is (10 sec: 5598.2, 60 sec: 5558.2, 300 sec: 5559.7). Total num frames: 681969664. Throughput: 0: 5731.1. Samples: 681975540. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:31:22,402][25689] Avg episode reward: [(0, '-5.772')] [2022-07-10 09:31:23,730][26022] Updated weights on worker 0-0, policy_version 665993 (0.00093) [2022-07-10 09:31:25,450][26022] Updated weights on worker 0-0, policy_version 666003 (0.00090) [2022-07-10 09:31:27,370][26022] Updated weights on worker 0-0, policy_version 666013 (0.01130) [2022-07-10 09:31:27,458][25689] Fps is (10 sec: 5622.0, 60 sec: 5556.4, 300 sec: 5555.6). Total num frames: 681997312. Throughput: 0: 4989.4. Samples: 681992230. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:31:27,460][25689] Avg episode reward: [(0, '-5.956')] [2022-07-10 09:31:29,093][26022] Updated weights on worker 0-0, policy_version 666023 (0.00087) [2022-07-10 09:31:30,935][26022] Updated weights on worker 0-0, policy_version 666033 (0.00095) [2022-07-10 09:31:32,463][25689] Fps is (10 sec: 5495.4, 60 sec: 5523.2, 300 sec: 5553.4). Total num frames: 682024960. Throughput: 0: 5808.7. Samples: 682025724. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:31:32,464][25689] Avg episode reward: [(0, '-5.656')] [2022-07-10 09:31:32,956][26022] Updated weights on worker 0-0, policy_version 666043 (0.00091) [2022-07-10 09:31:34,687][26022] Updated weights on worker 0-0, policy_version 666053 (0.00084) [2022-07-10 09:31:36,470][26022] Updated weights on worker 0-0, policy_version 666063 (0.00092) [2022-07-10 09:31:37,465][25689] Fps is (10 sec: 5628.0, 60 sec: 5559.1, 300 sec: 5554.2). Total num frames: 682053632. Throughput: 0: 5820.9. Samples: 682059490. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:31:37,466][25689] Avg episode reward: [(0, '-5.394')] [2022-07-10 09:31:38,273][26022] Updated weights on worker 0-0, policy_version 666073 (0.00091) [2022-07-10 09:31:40,023][26022] Updated weights on worker 0-0, policy_version 666083 (0.00097) [2022-07-10 09:31:41,846][26022] Updated weights on worker 0-0, policy_version 666093 (0.00085) [2022-07-10 09:31:42,521][25689] Fps is (10 sec: 5599.7, 60 sec: 5549.6, 300 sec: 5553.7). Total num frames: 682081280. Throughput: 0: 5003.6. Samples: 682076212. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:31:42,522][25689] Avg episode reward: [(0, '-5.508')] [2022-07-10 09:31:43,885][26022] Updated weights on worker 0-0, policy_version 666103 (0.00051) [2022-07-10 09:31:45,599][26022] Updated weights on worker 0-0, policy_version 666113 (0.00084) [2022-07-10 09:31:47,541][25689] Fps is (10 sec: 5589.9, 60 sec: 5566.6, 300 sec: 5557.2). Total num frames: 682109952. Throughput: 0: 5860.3. Samples: 682109920. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:31:47,541][25689] Avg episode reward: [(0, '-5.350')] [2022-07-10 09:31:47,548][26022] Updated weights on worker 0-0, policy_version 666123 (0.00090) [2022-07-10 09:31:49,151][26022] Updated weights on worker 0-0, policy_version 666133 (0.00085) [2022-07-10 09:31:51,331][26022] Updated weights on worker 0-0, policy_version 666143 (0.00081) [2022-07-10 09:31:52,635][25689] Fps is (10 sec: 5669.9, 60 sec: 5559.1, 300 sec: 5555.7). Total num frames: 682138624. Throughput: 0: 5828.0. Samples: 682143284. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:31:52,636][25689] Avg episode reward: [(0, '-5.220')] [2022-07-10 09:31:53,028][26022] Updated weights on worker 0-0, policy_version 666153 (0.00090) [2022-07-10 09:31:54,854][26022] Updated weights on worker 0-0, policy_version 666163 (0.00091) [2022-07-10 09:31:56,652][26022] Updated weights on worker 0-0, policy_version 666173 (0.00086) [2022-07-10 09:31:57,660][25689] Fps is (10 sec: 5464.3, 60 sec: 5540.2, 300 sec: 5556.4). Total num frames: 682165248. Throughput: 0: 4976.6. Samples: 682159992. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:31:57,661][25689] Avg episode reward: [(0, '-6.218')] [2022-07-10 09:31:58,540][26022] Updated weights on worker 0-0, policy_version 666183 (0.00085) [2022-07-10 09:32:00,356][26022] Updated weights on worker 0-0, policy_version 666193 (0.00085) [2022-07-10 09:32:02,530][26022] Updated weights on worker 0-0, policy_version 666203 (0.00084) [2022-07-10 09:32:02,715][25689] Fps is (10 sec: 5384.3, 60 sec: 5558.9, 300 sec: 5556.6). Total num frames: 682192896. Throughput: 0: 5816.4. Samples: 682193666. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:32:02,715][25689] Avg episode reward: [(0, '-6.628')] [2022-07-10 09:32:04,271][26022] Updated weights on worker 0-0, policy_version 666213 (0.00085) [2022-07-10 09:32:06,161][26022] Updated weights on worker 0-0, policy_version 666223 (0.00085) [2022-07-10 09:32:07,723][25689] Fps is (10 sec: 5495.2, 60 sec: 5561.2, 300 sec: 5557.0). Total num frames: 682220544. Throughput: 0: 5728.8. Samples: 682225540. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:32:07,723][25689] Avg episode reward: [(0, '-7.807')] [2022-07-10 09:32:08,170][26022] Updated weights on worker 0-0, policy_version 666233 (0.00096) [2022-07-10 09:32:09,763][26022] Updated weights on worker 0-0, policy_version 666243 (0.00091) [2022-07-10 09:32:11,655][26022] Updated weights on worker 0-0, policy_version 666253 (0.00082) [2022-07-10 09:32:12,724][25689] Fps is (10 sec: 5626.8, 60 sec: 5562.5, 300 sec: 5557.1). Total num frames: 682249216. Throughput: 0: 4935.1. Samples: 682242426. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:32:12,724][25689] Avg episode reward: [(0, '-6.920')] [2022-07-10 09:32:13,421][26022] Updated weights on worker 0-0, policy_version 666263 (0.00085) [2022-07-10 09:32:15,258][26022] Updated weights on worker 0-0, policy_version 666273 (0.00086) [2022-07-10 09:32:17,065][26022] Updated weights on worker 0-0, policy_version 666283 (0.00089) [2022-07-10 09:32:17,736][25689] Fps is (10 sec: 5727.1, 60 sec: 5587.6, 300 sec: 5562.6). Total num frames: 682277888. Throughput: 0: 5812.6. Samples: 682276682. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:32:17,736][25689] Avg episode reward: [(0, '-7.576')] [2022-07-10 09:32:18,931][26022] Updated weights on worker 0-0, policy_version 666293 (0.00088) [2022-07-10 09:32:20,380][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:32:20,395][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000666300_682291200.pth [2022-07-10 09:32:20,395][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000664346_680290304.pth [2022-07-10 09:32:20,739][26022] Updated weights on worker 0-0, policy_version 666303 (0.00096) [2022-07-10 09:32:22,496][26022] Updated weights on worker 0-0, policy_version 666313 (0.00088) [2022-07-10 09:32:22,852][25689] Fps is (10 sec: 5560.7, 60 sec: 5556.1, 300 sec: 5557.9). Total num frames: 682305536. Throughput: 0: 5774.0. Samples: 682309940. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 09:32:22,853][25689] Avg episode reward: [(0, '-6.925')] [2022-07-10 09:32:24,547][26022] Updated weights on worker 0-0, policy_version 666323 (0.00089) [2022-07-10 09:32:26,320][26022] Updated weights on worker 0-0, policy_version 666333 (0.00089) [2022-07-10 09:32:27,867][25689] Fps is (10 sec: 5458.0, 60 sec: 5560.0, 300 sec: 5554.5). Total num frames: 682333184. Throughput: 0: 5014.4. Samples: 682326550. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:32:27,867][25689] Avg episode reward: [(0, '-4.878')] [2022-07-10 09:32:28,271][26022] Updated weights on worker 0-0, policy_version 666343 (0.00086) [2022-07-10 09:32:29,944][26022] Updated weights on worker 0-0, policy_version 666353 (0.00090) [2022-07-10 09:32:31,813][26022] Updated weights on worker 0-0, policy_version 666363 (0.00090) [2022-07-10 09:32:32,902][25689] Fps is (10 sec: 5603.8, 60 sec: 5574.1, 300 sec: 5560.8). Total num frames: 682361856. Throughput: 0: 5825.6. Samples: 682359978. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:32:32,904][25689] Avg episode reward: [(0, '-5.085')] [2022-07-10 09:32:33,610][26022] Updated weights on worker 0-0, policy_version 666373 (0.00085) [2022-07-10 09:32:35,527][26022] Updated weights on worker 0-0, policy_version 666383 (0.00084) [2022-07-10 09:32:37,415][26022] Updated weights on worker 0-0, policy_version 666393 (0.00089) [2022-07-10 09:32:37,911][25689] Fps is (10 sec: 5505.5, 60 sec: 5539.7, 300 sec: 5555.3). Total num frames: 682388480. Throughput: 0: 5797.2. Samples: 682393640. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:32:37,911][25689] Avg episode reward: [(0, '-5.426')] [2022-07-10 09:32:39,044][26022] Updated weights on worker 0-0, policy_version 666403 (0.00090) [2022-07-10 09:32:40,967][26022] Updated weights on worker 0-0, policy_version 666413 (0.00086) [2022-07-10 09:32:42,782][26022] Updated weights on worker 0-0, policy_version 666423 (0.00092) [2022-07-10 09:32:42,974][25689] Fps is (10 sec: 5490.3, 60 sec: 5555.9, 300 sec: 5554.5). Total num frames: 682417152. Throughput: 0: 5836.3. Samples: 682427378. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:32:42,975][25689] Avg episode reward: [(0, '-4.668')] [2022-07-10 09:32:44,479][26022] Updated weights on worker 0-0, policy_version 666433 (0.00080) [2022-07-10 09:32:46,458][26022] Updated weights on worker 0-0, policy_version 666443 (0.00080) [2022-07-10 09:32:47,978][25689] Fps is (10 sec: 5797.5, 60 sec: 5574.3, 300 sec: 5562.6). Total num frames: 682446848. Throughput: 0: 5849.7. Samples: 682444196. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:32:47,979][25689] Avg episode reward: [(0, '-4.293')] [2022-07-10 09:32:48,193][26022] Updated weights on worker 0-0, policy_version 666453 (0.00089) [2022-07-10 09:32:50,011][26022] Updated weights on worker 0-0, policy_version 666463 (0.00087) [2022-07-10 09:32:52,007][26022] Updated weights on worker 0-0, policy_version 666473 (0.00085) [2022-07-10 09:32:52,990][25689] Fps is (10 sec: 5623.1, 60 sec: 5548.0, 300 sec: 5559.1). Total num frames: 682473472. Throughput: 0: 5873.8. Samples: 682477968. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:32:52,992][25689] Avg episode reward: [(0, '-4.691')] [2022-07-10 09:32:53,560][26022] Updated weights on worker 0-0, policy_version 666483 (0.00088) [2022-07-10 09:32:55,673][26022] Updated weights on worker 0-0, policy_version 666493 (0.00086) [2022-07-10 09:32:57,142][26022] Updated weights on worker 0-0, policy_version 666503 (0.00082) [2022-07-10 09:32:58,017][25689] Fps is (10 sec: 5508.6, 60 sec: 5581.8, 300 sec: 5561.0). Total num frames: 682502144. Throughput: 0: 5867.3. Samples: 682511608. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:32:58,019][25689] Avg episode reward: [(0, '-4.857')] [2022-07-10 09:32:59,402][26022] Updated weights on worker 0-0, policy_version 666513 (0.00089) [2022-07-10 09:33:01,020][26022] Updated weights on worker 0-0, policy_version 666523 (0.00096) [2022-07-10 09:33:03,083][25689] Fps is (10 sec: 5376.9, 60 sec: 5546.7, 300 sec: 5556.8). Total num frames: 682527744. Throughput: 0: 5019.6. Samples: 682528320. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:33:03,084][25689] Avg episode reward: [(0, '-4.131')] [2022-07-10 09:33:03,300][26022] Updated weights on worker 0-0, policy_version 666533 (0.00092) [2022-07-10 09:33:05,114][26022] Updated weights on worker 0-0, policy_version 666543 (0.00093) [2022-07-10 09:33:07,010][26022] Updated weights on worker 0-0, policy_version 666553 (0.00088) [2022-07-10 09:33:08,095][25689] Fps is (10 sec: 5385.1, 60 sec: 5563.4, 300 sec: 5556.7). Total num frames: 682556416. Throughput: 0: 5750.0. Samples: 682559864. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:33:08,095][25689] Avg episode reward: [(0, '-3.700')] [2022-07-10 09:33:08,733][26022] Updated weights on worker 0-0, policy_version 666563 (0.00097) [2022-07-10 09:33:10,697][26022] Updated weights on worker 0-0, policy_version 666573 (0.00085) [2022-07-10 09:33:12,446][26022] Updated weights on worker 0-0, policy_version 666583 (0.00080) [2022-07-10 09:33:13,115][25689] Fps is (10 sec: 5614.6, 60 sec: 5544.7, 300 sec: 5556.8). Total num frames: 682584064. Throughput: 0: 5730.3. Samples: 682593288. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:33:13,115][25689] Avg episode reward: [(0, '-3.607')] [2022-07-10 09:33:14,146][26022] Updated weights on worker 0-0, policy_version 666593 (0.00086) [2022-07-10 09:33:16,011][26022] Updated weights on worker 0-0, policy_version 666603 (0.00060) [2022-07-10 09:33:17,876][26022] Updated weights on worker 0-0, policy_version 666613 (0.00094) [2022-07-10 09:33:18,125][25689] Fps is (10 sec: 5614.9, 60 sec: 5544.8, 300 sec: 5559.3). Total num frames: 682612736. Throughput: 0: 4901.1. Samples: 682610162. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:33:18,126][25689] Avg episode reward: [(0, '-2.781')] [2022-07-10 09:33:19,770][26022] Updated weights on worker 0-0, policy_version 666623 (0.00088) [2022-07-10 09:33:21,452][26022] Updated weights on worker 0-0, policy_version 666633 (0.00089) [2022-07-10 09:33:23,180][25689] Fps is (10 sec: 5595.5, 60 sec: 5550.5, 300 sec: 5555.8). Total num frames: 682640384. Throughput: 0: 5742.4. Samples: 682643720. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:33:23,180][25689] Avg episode reward: [(0, '-3.015')] [2022-07-10 09:33:23,408][26022] Updated weights on worker 0-0, policy_version 666643 (0.00089) [2022-07-10 09:33:25,178][26022] Updated weights on worker 0-0, policy_version 666653 (0.00423) [2022-07-10 09:33:27,165][26022] Updated weights on worker 0-0, policy_version 666663 (0.00091) [2022-07-10 09:33:28,258][25689] Fps is (10 sec: 5456.9, 60 sec: 5544.6, 300 sec: 5554.4). Total num frames: 682668032. Throughput: 0: 5819.3. Samples: 682677202. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:33:28,259][25689] Avg episode reward: [(0, '-4.617')] [2022-07-10 09:33:28,864][26022] Updated weights on worker 0-0, policy_version 666673 (0.00092) [2022-07-10 09:33:30,972][26022] Updated weights on worker 0-0, policy_version 666683 (0.00092) [2022-07-10 09:33:32,450][26022] Updated weights on worker 0-0, policy_version 666693 (0.00094) [2022-07-10 09:33:33,352][25689] Fps is (10 sec: 5536.8, 60 sec: 5539.3, 300 sec: 5556.8). Total num frames: 682696704. Throughput: 0: 4966.0. Samples: 682693786. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:33:33,352][25689] Avg episode reward: [(0, '-4.634')] [2022-07-10 09:33:34,716][26022] Updated weights on worker 0-0, policy_version 666703 (0.00093) [2022-07-10 09:33:36,205][26022] Updated weights on worker 0-0, policy_version 666713 (0.00095) [2022-07-10 09:33:38,219][26022] Updated weights on worker 0-0, policy_version 666723 (0.00094) [2022-07-10 09:33:38,354][25689] Fps is (10 sec: 5680.0, 60 sec: 5573.7, 300 sec: 5557.8). Total num frames: 682725376. Throughput: 0: 5792.0. Samples: 682727328. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:33:38,355][25689] Avg episode reward: [(0, '-4.497')] [2022-07-10 09:33:40,452][26022] Updated weights on worker 0-0, policy_version 666733 (0.00089) [2022-07-10 09:33:42,063][26022] Updated weights on worker 0-0, policy_version 666743 (0.00092) [2022-07-10 09:33:43,419][25689] Fps is (10 sec: 5391.0, 60 sec: 5522.8, 300 sec: 5556.7). Total num frames: 682750976. Throughput: 0: 5682.3. Samples: 682758724. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:33:43,419][25689] Avg episode reward: [(0, '-4.747')] [2022-07-10 09:33:44,294][26022] Updated weights on worker 0-0, policy_version 666753 (0.00086) [2022-07-10 09:33:45,828][26022] Updated weights on worker 0-0, policy_version 666763 (0.00095) [2022-07-10 09:33:47,790][26022] Updated weights on worker 0-0, policy_version 666773 (0.00094) [2022-07-10 09:33:48,454][25689] Fps is (10 sec: 5272.3, 60 sec: 5486.1, 300 sec: 5546.3). Total num frames: 682778624. Throughput: 0: 4858.4. Samples: 682775318. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:33:48,454][25689] Avg episode reward: [(0, '-5.618')] [2022-07-10 09:33:49,935][26022] Updated weights on worker 0-0, policy_version 666783 (0.00084) [2022-07-10 09:33:51,724][26022] Updated weights on worker 0-0, policy_version 666793 (0.00086) [2022-07-10 09:33:53,488][25689] Fps is (10 sec: 5389.7, 60 sec: 5484.0, 300 sec: 5546.1). Total num frames: 682805248. Throughput: 0: 5657.6. Samples: 682807712. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:33:53,491][25689] Avg episode reward: [(0, '-5.595')] [2022-07-10 09:33:53,546][26022] Updated weights on worker 0-0, policy_version 666803 (0.00089) [2022-07-10 09:33:55,386][26022] Updated weights on worker 0-0, policy_version 666813 (0.00090) [2022-07-10 09:33:57,372][26022] Updated weights on worker 0-0, policy_version 666823 (0.00095) [2022-07-10 09:33:58,493][25689] Fps is (10 sec: 5304.1, 60 sec: 5452.2, 300 sec: 5540.4). Total num frames: 682831872. Throughput: 0: 5592.6. Samples: 682839954. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:33:58,493][25689] Avg episode reward: [(0, '-4.191')] [2022-07-10 09:33:59,418][26022] Updated weights on worker 0-0, policy_version 666833 (0.00092) [2022-07-10 09:34:01,242][26022] Updated weights on worker 0-0, policy_version 666843 (0.00092) [2022-07-10 09:34:03,554][26022] Updated weights on worker 0-0, policy_version 666853 (0.00094) [2022-07-10 09:34:03,594][25689] Fps is (10 sec: 5168.0, 60 sec: 5449.1, 300 sec: 5538.7). Total num frames: 682857472. Throughput: 0: 4825.1. Samples: 682856070. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:34:03,594][25689] Avg episode reward: [(0, '-4.265')] [2022-07-10 09:34:05,578][26022] Updated weights on worker 0-0, policy_version 666863 (0.00094) [2022-07-10 09:34:07,508][26022] Updated weights on worker 0-0, policy_version 666873 (0.00099) [2022-07-10 09:34:08,605][25689] Fps is (10 sec: 5164.3, 60 sec: 5415.3, 300 sec: 5535.6). Total num frames: 682884096. Throughput: 0: 5474.7. Samples: 682885640. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:34:08,607][25689] Avg episode reward: [(0, '-4.239')] [2022-07-10 09:34:09,385][26022] Updated weights on worker 0-0, policy_version 666883 (0.00084) [2022-07-10 09:34:11,229][26022] Updated weights on worker 0-0, policy_version 666893 (0.00090) [2022-07-10 09:34:12,988][26022] Updated weights on worker 0-0, policy_version 666903 (0.00094) [2022-07-10 09:34:13,624][25689] Fps is (10 sec: 5308.4, 60 sec: 5398.4, 300 sec: 5525.3). Total num frames: 682910720. Throughput: 0: 5492.4. Samples: 682918308. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:34:13,626][25689] Avg episode reward: [(0, '-5.844')] [2022-07-10 09:34:15,074][26022] Updated weights on worker 0-0, policy_version 666913 (0.00082) [2022-07-10 09:34:16,832][26022] Updated weights on worker 0-0, policy_version 666923 (0.00091) [2022-07-10 09:34:18,531][26022] Updated weights on worker 0-0, policy_version 666933 (0.00070) [2022-07-10 09:34:18,648][25689] Fps is (10 sec: 5505.9, 60 sec: 5397.2, 300 sec: 5526.2). Total num frames: 682939392. Throughput: 0: 4730.0. Samples: 682935290. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:34:18,650][25689] Avg episode reward: [(0, '-4.781')] [2022-07-10 09:34:20,326][26022] Updated weights on worker 0-0, policy_version 666943 (0.00085) [2022-07-10 09:34:20,589][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:34:20,602][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000666944_682950656.pth [2022-07-10 09:34:20,603][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000664997_680956928.pth [2022-07-10 09:34:22,120][26022] Updated weights on worker 0-0, policy_version 666953 (0.00085) [2022-07-10 09:34:23,759][25689] Fps is (10 sec: 5658.2, 60 sec: 5409.1, 300 sec: 5531.3). Total num frames: 682968064. Throughput: 0: 5591.2. Samples: 682968820. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:34:23,760][25689] Avg episode reward: [(0, '-5.792')] [2022-07-10 09:34:24,188][26022] Updated weights on worker 0-0, policy_version 666963 (0.00092) [2022-07-10 09:34:26,095][26022] Updated weights on worker 0-0, policy_version 666973 (0.00083) [2022-07-10 09:34:27,663][26022] Updated weights on worker 0-0, policy_version 666983 (0.00085) [2022-07-10 09:34:28,792][25689] Fps is (10 sec: 5451.4, 60 sec: 5396.3, 300 sec: 5524.1). Total num frames: 682994688. Throughput: 0: 5774.2. Samples: 683002202. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:34:28,792][25689] Avg episode reward: [(0, '-5.854')] [2022-07-10 09:34:29,521][26022] Updated weights on worker 0-0, policy_version 666993 (0.00087) [2022-07-10 09:34:31,361][26022] Updated weights on worker 0-0, policy_version 667003 (0.00086) [2022-07-10 09:34:33,259][26022] Updated weights on worker 0-0, policy_version 667013 (0.00086) [2022-07-10 09:34:33,801][25689] Fps is (10 sec: 5608.4, 60 sec: 5420.7, 300 sec: 5524.3). Total num frames: 683024384. Throughput: 0: 4984.0. Samples: 683018870. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:34:33,802][25689] Avg episode reward: [(0, '-5.113')] [2022-07-10 09:34:35,037][26022] Updated weights on worker 0-0, policy_version 667023 (0.00087) [2022-07-10 09:34:37,035][26022] Updated weights on worker 0-0, policy_version 667033 (0.00082) [2022-07-10 09:34:38,596][26022] Updated weights on worker 0-0, policy_version 667043 (0.00100) [2022-07-10 09:34:38,833][25689] Fps is (10 sec: 5812.9, 60 sec: 5418.1, 300 sec: 5528.3). Total num frames: 683053056. Throughput: 0: 5811.4. Samples: 683052590. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:34:38,833][25689] Avg episode reward: [(0, '-4.610')] [2022-07-10 09:34:40,725][26022] Updated weights on worker 0-0, policy_version 667053 (0.00088) [2022-07-10 09:34:42,236][26022] Updated weights on worker 0-0, policy_version 667063 (0.00086) [2022-07-10 09:34:43,907][25689] Fps is (10 sec: 5471.9, 60 sec: 5434.2, 300 sec: 5524.2). Total num frames: 683079680. Throughput: 0: 5826.7. Samples: 683086214. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:34:43,907][25689] Avg episode reward: [(0, '-2.691')] [2022-07-10 09:34:44,265][26022] Updated weights on worker 0-0, policy_version 667073 (0.00086) [2022-07-10 09:34:46,058][26022] Updated weights on worker 0-0, policy_version 667083 (0.00082) [2022-07-10 09:34:47,750][26022] Updated weights on worker 0-0, policy_version 667093 (0.00091) [2022-07-10 09:34:48,959][25689] Fps is (10 sec: 5460.4, 60 sec: 5449.5, 300 sec: 5523.5). Total num frames: 683108352. Throughput: 0: 5008.8. Samples: 683103218. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:34:48,960][25689] Avg episode reward: [(0, '-3.238')] [2022-07-10 09:34:49,791][26022] Updated weights on worker 0-0, policy_version 667103 (0.00094) [2022-07-10 09:34:51,620][26022] Updated weights on worker 0-0, policy_version 667113 (0.00091) [2022-07-10 09:34:53,197][26022] Updated weights on worker 0-0, policy_version 667123 (0.00084) [2022-07-10 09:34:53,968][25689] Fps is (10 sec: 5801.0, 60 sec: 5502.6, 300 sec: 5527.0). Total num frames: 683138048. Throughput: 0: 5856.1. Samples: 683136970. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:34:53,969][25689] Avg episode reward: [(0, '-3.031')] [2022-07-10 09:34:55,327][26022] Updated weights on worker 0-0, policy_version 667133 (0.00086) [2022-07-10 09:34:56,929][26022] Updated weights on worker 0-0, policy_version 667143 (0.00086) [2022-07-10 09:34:58,941][26022] Updated weights on worker 0-0, policy_version 667153 (0.00088) [2022-07-10 09:34:59,007][25689] Fps is (10 sec: 5605.6, 60 sec: 5499.5, 300 sec: 5520.6). Total num frames: 683164672. Throughput: 0: 5849.1. Samples: 683170590. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:34:59,007][25689] Avg episode reward: [(0, '-3.014')] [2022-07-10 09:35:00,647][26022] Updated weights on worker 0-0, policy_version 667163 (0.00091) [2022-07-10 09:35:02,714][26022] Updated weights on worker 0-0, policy_version 667173 (0.00100) [2022-07-10 09:35:04,072][25689] Fps is (10 sec: 5168.8, 60 sec: 5502.8, 300 sec: 5516.1). Total num frames: 683190272. Throughput: 0: 5013.0. Samples: 683187302. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:35:04,073][25689] Avg episode reward: [(0, '-3.080')] [2022-07-10 09:35:04,716][26022] Updated weights on worker 0-0, policy_version 667183 (0.00085) [2022-07-10 09:35:06,430][26022] Updated weights on worker 0-0, policy_version 667193 (0.00069) [2022-07-10 09:35:08,326][26022] Updated weights on worker 0-0, policy_version 667203 (0.00093) [2022-07-10 09:35:09,099][25689] Fps is (10 sec: 5478.8, 60 sec: 5552.1, 300 sec: 5526.7). Total num frames: 683219968. Throughput: 0: 5747.9. Samples: 683218980. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:35:09,100][25689] Avg episode reward: [(0, '-4.778')] [2022-07-10 09:35:10,208][26022] Updated weights on worker 0-0, policy_version 667213 (0.00085) [2022-07-10 09:35:11,818][26022] Updated weights on worker 0-0, policy_version 667223 (0.00087) [2022-07-10 09:35:13,830][26022] Updated weights on worker 0-0, policy_version 667233 (0.00090) [2022-07-10 09:35:14,153][25689] Fps is (10 sec: 5688.6, 60 sec: 5565.9, 300 sec: 5522.5). Total num frames: 683247616. Throughput: 0: 5731.4. Samples: 683252654. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:35:14,153][25689] Avg episode reward: [(0, '-4.861')] [2022-07-10 09:35:15,596][26022] Updated weights on worker 0-0, policy_version 667243 (0.00091) [2022-07-10 09:35:17,587][26022] Updated weights on worker 0-0, policy_version 667253 (0.00086) [2022-07-10 09:35:19,239][25689] Fps is (10 sec: 5554.6, 60 sec: 5560.2, 300 sec: 5525.4). Total num frames: 683276288. Throughput: 0: 5727.6. Samples: 683286472. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:35:19,239][25689] Avg episode reward: [(0, '-6.056')] [2022-07-10 09:35:19,292][26022] Updated weights on worker 0-0, policy_version 667263 (0.00088) [2022-07-10 09:35:21,190][26022] Updated weights on worker 0-0, policy_version 667273 (0.00101) [2022-07-10 09:35:22,959][26022] Updated weights on worker 0-0, policy_version 667283 (0.00448) [2022-07-10 09:35:24,287][25689] Fps is (10 sec: 5557.1, 60 sec: 5549.0, 300 sec: 5525.2). Total num frames: 683303936. Throughput: 0: 5741.1. Samples: 683303360. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:35:24,288][25689] Avg episode reward: [(0, '-6.702')] [2022-07-10 09:35:24,689][26022] Updated weights on worker 0-0, policy_version 667293 (0.00094) [2022-07-10 09:35:26,461][26022] Updated weights on worker 0-0, policy_version 667303 (0.00083) [2022-07-10 09:35:28,355][26022] Updated weights on worker 0-0, policy_version 667313 (0.00088) [2022-07-10 09:35:29,363][25689] Fps is (10 sec: 5664.3, 60 sec: 5595.8, 300 sec: 5524.0). Total num frames: 683333632. Throughput: 0: 5818.3. Samples: 683336878. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:35:29,363][25689] Avg episode reward: [(0, '-6.839')] [2022-07-10 09:35:30,265][26022] Updated weights on worker 0-0, policy_version 667323 (0.00085) [2022-07-10 09:35:32,044][26022] Updated weights on worker 0-0, policy_version 667333 (0.00085) [2022-07-10 09:35:33,994][26022] Updated weights on worker 0-0, policy_version 667343 (0.00088) [2022-07-10 09:35:34,417][25689] Fps is (10 sec: 5660.9, 60 sec: 5557.9, 300 sec: 5526.9). Total num frames: 683361280. Throughput: 0: 5826.9. Samples: 683370734. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:35:34,418][25689] Avg episode reward: [(0, '-7.293')] [2022-07-10 09:35:35,678][26022] Updated weights on worker 0-0, policy_version 667353 (0.00083) [2022-07-10 09:35:37,711][26022] Updated weights on worker 0-0, policy_version 667363 (0.00088) [2022-07-10 09:35:39,350][26022] Updated weights on worker 0-0, policy_version 667373 (0.00098) [2022-07-10 09:35:39,452][25689] Fps is (10 sec: 5582.1, 60 sec: 5557.6, 300 sec: 5528.8). Total num frames: 683389952. Throughput: 0: 4996.4. Samples: 683387466. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 09:35:39,453][25689] Avg episode reward: [(0, '-7.082')] [2022-07-10 09:35:41,355][26022] Updated weights on worker 0-0, policy_version 667383 (0.00096) [2022-07-10 09:35:43,072][26022] Updated weights on worker 0-0, policy_version 667393 (0.00087) [2022-07-10 09:35:44,517][25689] Fps is (10 sec: 5576.6, 60 sec: 5575.3, 300 sec: 5527.9). Total num frames: 683417600. Throughput: 0: 5802.7. Samples: 683420744. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:35:44,517][25689] Avg episode reward: [(0, '-8.554')] [2022-07-10 09:35:44,946][26022] Updated weights on worker 0-0, policy_version 667403 (0.00081) [2022-07-10 09:35:46,699][26022] Updated weights on worker 0-0, policy_version 667413 (0.00083) [2022-07-10 09:35:48,605][26022] Updated weights on worker 0-0, policy_version 667423 (0.00086) [2022-07-10 09:35:49,565][25689] Fps is (10 sec: 5569.1, 60 sec: 5575.7, 300 sec: 5527.3). Total num frames: 683446272. Throughput: 0: 5817.4. Samples: 683454404. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:35:49,565][25689] Avg episode reward: [(0, '-8.401')] [2022-07-10 09:35:50,427][26022] Updated weights on worker 0-0, policy_version 667433 (0.00083) [2022-07-10 09:35:52,172][26022] Updated weights on worker 0-0, policy_version 667443 (0.00085) [2022-07-10 09:35:54,022][26022] Updated weights on worker 0-0, policy_version 667453 (0.00079) [2022-07-10 09:35:54,592][25689] Fps is (10 sec: 5589.7, 60 sec: 5540.3, 300 sec: 5526.8). Total num frames: 683473920. Throughput: 0: 4977.9. Samples: 683471164. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:35:54,593][25689] Avg episode reward: [(0, '-6.711')] [2022-07-10 09:35:55,772][26022] Updated weights on worker 0-0, policy_version 667463 (0.00090) [2022-07-10 09:35:57,903][26022] Updated weights on worker 0-0, policy_version 667473 (0.00084) [2022-07-10 09:35:59,601][25689] Fps is (10 sec: 5611.8, 60 sec: 5576.8, 300 sec: 5534.9). Total num frames: 683502592. Throughput: 0: 5821.2. Samples: 683504756. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:35:59,601][25689] Avg episode reward: [(0, '-6.578')] [2022-07-10 09:35:59,615][26022] Updated weights on worker 0-0, policy_version 667483 (0.00091) [2022-07-10 09:36:01,397][26022] Updated weights on worker 0-0, policy_version 667493 (0.00088) [2022-07-10 09:36:03,496][26022] Updated weights on worker 0-0, policy_version 667503 (0.00086) [2022-07-10 09:36:04,659][25689] Fps is (10 sec: 5391.3, 60 sec: 5577.4, 300 sec: 5527.6). Total num frames: 683528192. Throughput: 0: 5738.3. Samples: 683536326. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:36:04,660][25689] Avg episode reward: [(0, '-5.306')] [2022-07-10 09:36:05,500][26022] Updated weights on worker 0-0, policy_version 667513 (0.00084) [2022-07-10 09:36:07,364][26022] Updated weights on worker 0-0, policy_version 667523 (0.00091) [2022-07-10 09:36:09,011][26022] Updated weights on worker 0-0, policy_version 667533 (0.00095) [2022-07-10 09:36:09,674][25689] Fps is (10 sec: 5286.4, 60 sec: 5544.8, 300 sec: 5524.1). Total num frames: 683555840. Throughput: 0: 4908.8. Samples: 683553114. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:36:09,674][25689] Avg episode reward: [(0, '-3.550')] [2022-07-10 09:36:10,880][26022] Updated weights on worker 0-0, policy_version 667543 (0.00093) [2022-07-10 09:36:12,947][26022] Updated weights on worker 0-0, policy_version 667553 (0.00087) [2022-07-10 09:36:14,437][26022] Updated weights on worker 0-0, policy_version 667563 (0.00092) [2022-07-10 09:36:14,699][25689] Fps is (10 sec: 5711.7, 60 sec: 5581.2, 300 sec: 5532.4). Total num frames: 683585536. Throughput: 0: 5750.6. Samples: 683586788. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:36:14,699][25689] Avg episode reward: [(0, '-1.281')] [2022-07-10 09:36:16,572][26022] Updated weights on worker 0-0, policy_version 667573 (0.00089) [2022-07-10 09:36:18,044][26022] Updated weights on worker 0-0, policy_version 667583 (0.00097) [2022-07-10 09:36:19,707][25689] Fps is (10 sec: 5613.1, 60 sec: 5554.5, 300 sec: 5524.6). Total num frames: 683612160. Throughput: 0: 5751.6. Samples: 683620400. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:36:19,708][25689] Avg episode reward: [(0, '-1.830')] [2022-07-10 09:36:20,206][26022] Updated weights on worker 0-0, policy_version 667593 (0.00087) [2022-07-10 09:36:20,636][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:36:20,645][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000667596_683618304.pth [2022-07-10 09:36:20,646][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000665650_681625600.pth [2022-07-10 09:36:21,729][26022] Updated weights on worker 0-0, policy_version 667603 (0.00091) [2022-07-10 09:36:23,655][26022] Updated weights on worker 0-0, policy_version 667613 (0.00086) [2022-07-10 09:36:24,828][25689] Fps is (10 sec: 5560.2, 60 sec: 5581.7, 300 sec: 5530.3). Total num frames: 683641856. Throughput: 0: 4999.7. Samples: 683637164. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:36:24,828][25689] Avg episode reward: [(0, '-1.985')] [2022-07-10 09:36:25,496][26022] Updated weights on worker 0-0, policy_version 667623 (0.00084) [2022-07-10 09:36:27,482][26022] Updated weights on worker 0-0, policy_version 667633 (0.00092) [2022-07-10 09:36:29,243][26022] Updated weights on worker 0-0, policy_version 667643 (0.00090) [2022-07-10 09:36:29,871][25689] Fps is (10 sec: 5642.2, 60 sec: 5550.9, 300 sec: 5529.6). Total num frames: 683669504. Throughput: 0: 5814.4. Samples: 683670548. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:36:29,871][25689] Avg episode reward: [(0, '-1.983')] [2022-07-10 09:36:31,232][26022] Updated weights on worker 0-0, policy_version 667653 (0.00093) [2022-07-10 09:36:32,955][26022] Updated weights on worker 0-0, policy_version 667663 (0.00093) [2022-07-10 09:36:34,764][26022] Updated weights on worker 0-0, policy_version 667673 (0.00088) [2022-07-10 09:36:34,874][25689] Fps is (10 sec: 5504.1, 60 sec: 5555.6, 300 sec: 5526.1). Total num frames: 683697152. Throughput: 0: 5803.8. Samples: 683703882. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:36:34,875][25689] Avg episode reward: [(0, '-1.946')] [2022-07-10 09:36:36,507][26022] Updated weights on worker 0-0, policy_version 667683 (0.00089) [2022-07-10 09:36:38,617][26022] Updated weights on worker 0-0, policy_version 667693 (0.00086) [2022-07-10 09:36:39,880][25689] Fps is (10 sec: 5524.5, 60 sec: 5541.3, 300 sec: 5527.1). Total num frames: 683724800. Throughput: 0: 4978.1. Samples: 683720822. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:36:39,882][25689] Avg episode reward: [(0, '-2.853')] [2022-07-10 09:36:40,233][26022] Updated weights on worker 0-0, policy_version 667703 (0.00090) [2022-07-10 09:36:42,189][26022] Updated weights on worker 0-0, policy_version 667713 (0.00089) [2022-07-10 09:36:43,915][26022] Updated weights on worker 0-0, policy_version 667723 (0.00089) [2022-07-10 09:36:45,021][25689] Fps is (10 sec: 5651.1, 60 sec: 5568.1, 300 sec: 5528.2). Total num frames: 683754496. Throughput: 0: 5807.2. Samples: 683754432. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:36:45,022][25689] Avg episode reward: [(0, '-1.995')] [2022-07-10 09:36:45,977][26022] Updated weights on worker 0-0, policy_version 667733 (0.00088) [2022-07-10 09:36:47,555][26022] Updated weights on worker 0-0, policy_version 667743 (0.00137) [2022-07-10 09:36:49,464][26022] Updated weights on worker 0-0, policy_version 667753 (0.00083) [2022-07-10 09:36:50,043][25689] Fps is (10 sec: 5541.8, 60 sec: 5536.7, 300 sec: 5522.7). Total num frames: 683781120. Throughput: 0: 5803.3. Samples: 683787612. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:36:50,043][25689] Avg episode reward: [(0, '-2.783')] [2022-07-10 09:36:51,135][26022] Updated weights on worker 0-0, policy_version 667763 (0.00101) [2022-07-10 09:36:53,246][26022] Updated weights on worker 0-0, policy_version 667773 (0.00096) [2022-07-10 09:36:54,987][26022] Updated weights on worker 0-0, policy_version 667783 (0.00091) [2022-07-10 09:36:55,075][25689] Fps is (10 sec: 5500.0, 60 sec: 5553.2, 300 sec: 5529.5). Total num frames: 683809792. Throughput: 0: 4977.5. Samples: 683804434. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:36:55,076][25689] Avg episode reward: [(0, '-2.923')] [2022-07-10 09:36:56,906][26022] Updated weights on worker 0-0, policy_version 667793 (0.00092) [2022-07-10 09:36:58,746][26022] Updated weights on worker 0-0, policy_version 667803 (0.00092) [2022-07-10 09:37:00,140][25689] Fps is (10 sec: 5678.9, 60 sec: 5548.0, 300 sec: 5532.7). Total num frames: 683838464. Throughput: 0: 5786.6. Samples: 683838062. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:37:00,143][25689] Avg episode reward: [(0, '-3.847')] [2022-07-10 09:37:00,417][26022] Updated weights on worker 0-0, policy_version 667813 (0.00089) [2022-07-10 09:37:02,746][26022] Updated weights on worker 0-0, policy_version 667823 (0.00090) [2022-07-10 09:37:04,689][26022] Updated weights on worker 0-0, policy_version 667833 (0.00091) [2022-07-10 09:37:05,227][25689] Fps is (10 sec: 5245.5, 60 sec: 5528.5, 300 sec: 5520.9). Total num frames: 683863040. Throughput: 0: 5704.8. Samples: 683869698. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:37:05,227][25689] Avg episode reward: [(0, '-5.069')] [2022-07-10 09:37:06,387][26022] Updated weights on worker 0-0, policy_version 667843 (0.00085) [2022-07-10 09:37:08,456][26022] Updated weights on worker 0-0, policy_version 667853 (0.00104) [2022-07-10 09:37:10,007][26022] Updated weights on worker 0-0, policy_version 667863 (0.00107) [2022-07-10 09:37:10,247][25689] Fps is (10 sec: 5471.4, 60 sec: 5578.7, 300 sec: 5527.5). Total num frames: 683893760. Throughput: 0: 4894.1. Samples: 683886492. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:37:10,247][25689] Avg episode reward: [(0, '-4.226')] [2022-07-10 09:37:12,003][26022] Updated weights on worker 0-0, policy_version 667873 (0.00093) [2022-07-10 09:37:13,581][26022] Updated weights on worker 0-0, policy_version 667883 (0.00080) [2022-07-10 09:37:15,257][25689] Fps is (10 sec: 5717.2, 60 sec: 5529.4, 300 sec: 5520.6). Total num frames: 683920384. Throughput: 0: 5735.1. Samples: 683920176. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:37:15,257][25689] Avg episode reward: [(0, '-4.223')] [2022-07-10 09:37:15,620][26022] Updated weights on worker 0-0, policy_version 667893 (0.00087) [2022-07-10 09:37:17,188][26022] Updated weights on worker 0-0, policy_version 667903 (0.00086) [2022-07-10 09:37:19,376][26022] Updated weights on worker 0-0, policy_version 667913 (0.00094) [2022-07-10 09:37:20,265][25689] Fps is (10 sec: 5519.7, 60 sec: 5563.2, 300 sec: 5526.1). Total num frames: 683949056. Throughput: 0: 5764.0. Samples: 683954058. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:37:20,265][25689] Avg episode reward: [(0, '-4.752')] [2022-07-10 09:37:20,802][26022] Updated weights on worker 0-0, policy_version 667923 (0.00090) [2022-07-10 09:37:23,043][26022] Updated weights on worker 0-0, policy_version 667933 (0.00080) [2022-07-10 09:37:24,552][26022] Updated weights on worker 0-0, policy_version 667943 (0.00091) [2022-07-10 09:37:25,396][25689] Fps is (10 sec: 5554.5, 60 sec: 5528.4, 300 sec: 5523.9). Total num frames: 683976704. Throughput: 0: 5014.4. Samples: 683970836. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:37:25,397][25689] Avg episode reward: [(0, '-5.707')] [2022-07-10 09:37:26,629][26022] Updated weights on worker 0-0, policy_version 667953 (0.00085) [2022-07-10 09:37:28,202][26022] Updated weights on worker 0-0, policy_version 667963 (0.00090) [2022-07-10 09:37:30,170][26022] Updated weights on worker 0-0, policy_version 667973 (0.00084) [2022-07-10 09:37:30,449][25689] Fps is (10 sec: 5530.0, 60 sec: 5544.4, 300 sec: 5523.6). Total num frames: 684005376. Throughput: 0: 5825.8. Samples: 684004186. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:37:30,450][25689] Avg episode reward: [(0, '-3.788')] [2022-07-10 09:37:32,022][26022] Updated weights on worker 0-0, policy_version 667983 (0.00087) [2022-07-10 09:37:33,835][26022] Updated weights on worker 0-0, policy_version 667993 (0.00102) [2022-07-10 09:37:35,515][25689] Fps is (10 sec: 5768.5, 60 sec: 5572.5, 300 sec: 5532.8). Total num frames: 684035072. Throughput: 0: 5806.4. Samples: 684037802. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:37:35,515][25689] Avg episode reward: [(0, '-4.844')] [2022-07-10 09:37:35,517][26022] Updated weights on worker 0-0, policy_version 668003 (0.00109) [2022-07-10 09:37:37,517][26022] Updated weights on worker 0-0, policy_version 668013 (0.00090) [2022-07-10 09:37:39,186][26022] Updated weights on worker 0-0, policy_version 668023 (0.00086) [2022-07-10 09:37:40,545][25689] Fps is (10 sec: 5578.2, 60 sec: 5553.3, 300 sec: 5526.6). Total num frames: 684061696. Throughput: 0: 5801.1. Samples: 684071708. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:37:40,546][25689] Avg episode reward: [(0, '-5.884')] [2022-07-10 09:37:41,263][26022] Updated weights on worker 0-0, policy_version 668033 (0.00086) [2022-07-10 09:37:42,903][26022] Updated weights on worker 0-0, policy_version 668043 (0.00088) [2022-07-10 09:37:44,931][26022] Updated weights on worker 0-0, policy_version 668053 (0.00086) [2022-07-10 09:37:45,627][25689] Fps is (10 sec: 5468.3, 60 sec: 5542.0, 300 sec: 5521.7). Total num frames: 684090368. Throughput: 0: 5809.8. Samples: 684088372. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:37:45,627][25689] Avg episode reward: [(0, '-7.849')] [2022-07-10 09:37:46,673][26022] Updated weights on worker 0-0, policy_version 668063 (0.00086) [2022-07-10 09:37:48,432][26022] Updated weights on worker 0-0, policy_version 668073 (0.00090) [2022-07-10 09:37:50,285][26022] Updated weights on worker 0-0, policy_version 668083 (0.00090) [2022-07-10 09:37:50,636][25689] Fps is (10 sec: 5682.8, 60 sec: 5576.8, 300 sec: 5528.6). Total num frames: 684119040. Throughput: 0: 5856.1. Samples: 684122404. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:37:50,638][25689] Avg episode reward: [(0, '-10.033')] [2022-07-10 09:37:52,144][26022] Updated weights on worker 0-0, policy_version 668093 (0.00087) [2022-07-10 09:37:53,865][26022] Updated weights on worker 0-0, policy_version 668103 (0.00086) [2022-07-10 09:37:55,678][25689] Fps is (10 sec: 5603.5, 60 sec: 5559.1, 300 sec: 5524.9). Total num frames: 684146688. Throughput: 0: 5876.9. Samples: 684156298. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:37:55,678][25689] Avg episode reward: [(0, '-8.605')] [2022-07-10 09:37:55,732][26022] Updated weights on worker 0-0, policy_version 668113 (0.00093) [2022-07-10 09:37:57,515][26022] Updated weights on worker 0-0, policy_version 668123 (0.00083) [2022-07-10 09:37:59,255][26022] Updated weights on worker 0-0, policy_version 668133 (0.00081) [2022-07-10 09:38:00,699][25689] Fps is (10 sec: 5597.0, 60 sec: 5563.2, 300 sec: 5536.1). Total num frames: 684175360. Throughput: 0: 5017.1. Samples: 684172818. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:38:00,699][25689] Avg episode reward: [(0, '-8.576')] [2022-07-10 09:38:01,282][26022] Updated weights on worker 0-0, policy_version 668143 (0.00083) [2022-07-10 09:38:03,390][26022] Updated weights on worker 0-0, policy_version 668153 (0.00090) [2022-07-10 09:38:05,385][26022] Updated weights on worker 0-0, policy_version 668163 (0.00086) [2022-07-10 09:38:05,816][25689] Fps is (10 sec: 5453.9, 60 sec: 5594.1, 300 sec: 5527.2). Total num frames: 684201984. Throughput: 0: 5741.9. Samples: 684204300. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:38:05,817][25689] Avg episode reward: [(0, '-6.885')] [2022-07-10 09:38:07,043][26022] Updated weights on worker 0-0, policy_version 668173 (0.00090) [2022-07-10 09:38:08,915][26022] Updated weights on worker 0-0, policy_version 668183 (0.00094) [2022-07-10 09:38:10,838][25689] Fps is (10 sec: 5352.7, 60 sec: 5543.3, 300 sec: 5527.2). Total num frames: 684229632. Throughput: 0: 5718.0. Samples: 684237916. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:38:10,838][25689] Avg episode reward: [(0, '-5.927')] [2022-07-10 09:38:10,839][26022] Updated weights on worker 0-0, policy_version 668193 (0.00085) [2022-07-10 09:38:12,536][26022] Updated weights on worker 0-0, policy_version 668203 (0.00085) [2022-07-10 09:38:14,467][26022] Updated weights on worker 0-0, policy_version 668213 (0.00090) [2022-07-10 09:38:15,862][25689] Fps is (10 sec: 5504.6, 60 sec: 5558.9, 300 sec: 5523.5). Total num frames: 684257280. Throughput: 0: 4872.9. Samples: 684254654. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:38:15,863][25689] Avg episode reward: [(0, '-2.406')] [2022-07-10 09:38:16,375][26022] Updated weights on worker 0-0, policy_version 668223 (0.00082) [2022-07-10 09:38:17,972][26022] Updated weights on worker 0-0, policy_version 668233 (0.00089) [2022-07-10 09:38:20,153][26022] Updated weights on worker 0-0, policy_version 668243 (0.00094) [2022-07-10 09:38:20,657][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:38:20,673][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000668248_684285952.pth [2022-07-10 09:38:20,679][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000666300_682291200.pth [2022-07-10 09:38:20,893][25689] Fps is (10 sec: 5601.0, 60 sec: 5556.7, 300 sec: 5527.4). Total num frames: 684285952. Throughput: 0: 5737.7. Samples: 684288688. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:38:20,894][25689] Avg episode reward: [(0, '-2.832')] [2022-07-10 09:38:21,582][26022] Updated weights on worker 0-0, policy_version 668253 (0.00082) [2022-07-10 09:38:23,651][26022] Updated weights on worker 0-0, policy_version 668263 (0.00094) [2022-07-10 09:38:25,264][26022] Updated weights on worker 0-0, policy_version 668273 (0.00090) [2022-07-10 09:38:25,985][25689] Fps is (10 sec: 5765.6, 60 sec: 5594.1, 300 sec: 5534.0). Total num frames: 684315648. Throughput: 0: 5863.6. Samples: 684322562. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:38:25,986][25689] Avg episode reward: [(0, '-4.134')] [2022-07-10 09:38:27,325][26022] Updated weights on worker 0-0, policy_version 668283 (0.00091) [2022-07-10 09:38:29,023][26022] Updated weights on worker 0-0, policy_version 668293 (0.00089) [2022-07-10 09:38:30,842][26022] Updated weights on worker 0-0, policy_version 668303 (0.00089) [2022-07-10 09:38:31,000][25689] Fps is (10 sec: 5572.7, 60 sec: 5563.9, 300 sec: 5528.6). Total num frames: 684342272. Throughput: 0: 5017.9. Samples: 684339086. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:38:31,000][25689] Avg episode reward: [(0, '-4.917')] [2022-07-10 09:38:32,454][26022] Updated weights on worker 0-0, policy_version 668313 (0.00096) [2022-07-10 09:38:34,590][26022] Updated weights on worker 0-0, policy_version 668323 (0.00089) [2022-07-10 09:38:36,012][25689] Fps is (10 sec: 5515.1, 60 sec: 5551.9, 300 sec: 5528.4). Total num frames: 684370944. Throughput: 0: 5858.9. Samples: 684372710. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:38:36,012][25689] Avg episode reward: [(0, '-6.075')] [2022-07-10 09:38:36,476][26022] Updated weights on worker 0-0, policy_version 668333 (0.00095) [2022-07-10 09:38:38,231][26022] Updated weights on worker 0-0, policy_version 668343 (0.00088) [2022-07-10 09:38:40,108][26022] Updated weights on worker 0-0, policy_version 668353 (0.00084) [2022-07-10 09:38:41,035][25689] Fps is (10 sec: 5714.3, 60 sec: 5586.4, 300 sec: 5539.5). Total num frames: 684399616. Throughput: 0: 5847.5. Samples: 684406466. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:38:41,035][25689] Avg episode reward: [(0, '-5.010')] [2022-07-10 09:38:41,910][26022] Updated weights on worker 0-0, policy_version 668363 (0.00372) [2022-07-10 09:38:43,795][26022] Updated weights on worker 0-0, policy_version 668373 (0.00095) [2022-07-10 09:38:45,631][26022] Updated weights on worker 0-0, policy_version 668383 (0.00091) [2022-07-10 09:38:46,106][25689] Fps is (10 sec: 5579.5, 60 sec: 5570.4, 300 sec: 5538.8). Total num frames: 684427264. Throughput: 0: 5004.9. Samples: 684423262. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:38:46,106][25689] Avg episode reward: [(0, '-5.326')] [2022-07-10 09:38:47,188][26022] Updated weights on worker 0-0, policy_version 668393 (0.00084) [2022-07-10 09:38:49,329][26022] Updated weights on worker 0-0, policy_version 668403 (0.00081) [2022-07-10 09:38:50,848][26022] Updated weights on worker 0-0, policy_version 668413 (0.00093) [2022-07-10 09:38:51,120][25689] Fps is (10 sec: 5482.7, 60 sec: 5553.1, 300 sec: 5542.6). Total num frames: 684454912. Throughput: 0: 5857.2. Samples: 684456936. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 09:38:51,121][25689] Avg episode reward: [(0, '-6.090')] [2022-07-10 09:38:53,019][26022] Updated weights on worker 0-0, policy_version 668423 (0.00080) [2022-07-10 09:38:54,857][26022] Updated weights on worker 0-0, policy_version 668433 (0.00092) [2022-07-10 09:38:56,122][25689] Fps is (10 sec: 5418.4, 60 sec: 5539.8, 300 sec: 5542.7). Total num frames: 684481536. Throughput: 0: 5823.3. Samples: 684489818. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:38:56,123][25689] Avg episode reward: [(0, '-5.518')] [2022-07-10 09:38:56,566][26022] Updated weights on worker 0-0, policy_version 668443 (0.00087) [2022-07-10 09:38:58,530][26022] Updated weights on worker 0-0, policy_version 668453 (0.00095) [2022-07-10 09:39:00,448][26022] Updated weights on worker 0-0, policy_version 668463 (0.00089) [2022-07-10 09:39:01,153][25689] Fps is (10 sec: 5409.6, 60 sec: 5521.9, 300 sec: 5550.9). Total num frames: 684509184. Throughput: 0: 4962.5. Samples: 684506302. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:39:01,155][25689] Avg episode reward: [(0, '-3.944')] [2022-07-10 09:39:02,581][26022] Updated weights on worker 0-0, policy_version 668473 (0.00084) [2022-07-10 09:39:04,419][26022] Updated weights on worker 0-0, policy_version 668483 (0.00086) [2022-07-10 09:39:05,886][26022] Updated weights on worker 0-0, policy_version 668493 (0.00090) [2022-07-10 09:39:06,235][25689] Fps is (10 sec: 5468.0, 60 sec: 5542.2, 300 sec: 5553.0). Total num frames: 684536832. Throughput: 0: 5689.5. Samples: 684537786. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:39:06,235][25689] Avg episode reward: [(0, '-3.992')] [2022-07-10 09:39:08,125][26022] Updated weights on worker 0-0, policy_version 668503 (0.00090) [2022-07-10 09:39:09,780][26022] Updated weights on worker 0-0, policy_version 668513 (0.00093) [2022-07-10 09:39:11,255][25689] Fps is (10 sec: 5473.9, 60 sec: 5542.3, 300 sec: 5556.4). Total num frames: 684564480. Throughput: 0: 5708.9. Samples: 684571882. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:39:11,257][25689] Avg episode reward: [(0, '-3.658')] [2022-07-10 09:39:11,627][26022] Updated weights on worker 0-0, policy_version 668523 (0.00621) [2022-07-10 09:39:13,563][26022] Updated weights on worker 0-0, policy_version 668533 (0.00087) [2022-07-10 09:39:15,243][26022] Updated weights on worker 0-0, policy_version 668543 (0.00090) [2022-07-10 09:39:16,259][25689] Fps is (10 sec: 5618.1, 60 sec: 5561.0, 300 sec: 5556.8). Total num frames: 684593152. Throughput: 0: 4901.6. Samples: 684588524. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:39:16,260][25689] Avg episode reward: [(0, '-2.666')] [2022-07-10 09:39:17,223][26022] Updated weights on worker 0-0, policy_version 668553 (0.00084) [2022-07-10 09:39:18,992][26022] Updated weights on worker 0-0, policy_version 668563 (0.00720) [2022-07-10 09:39:20,836][26022] Updated weights on worker 0-0, policy_version 668573 (0.00091) [2022-07-10 09:39:21,286][25689] Fps is (10 sec: 5716.6, 60 sec: 5561.5, 300 sec: 5558.4). Total num frames: 684621824. Throughput: 0: 5750.0. Samples: 684622066. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:39:21,286][25689] Avg episode reward: [(0, '-1.755')] [2022-07-10 09:39:22,755][26022] Updated weights on worker 0-0, policy_version 668583 (0.00087) [2022-07-10 09:39:24,469][26022] Updated weights on worker 0-0, policy_version 668593 (0.00087) [2022-07-10 09:39:26,383][25689] Fps is (10 sec: 5462.3, 60 sec: 5510.2, 300 sec: 5557.2). Total num frames: 684648448. Throughput: 0: 5853.6. Samples: 684655724. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:39:26,383][25689] Avg episode reward: [(0, '-1.913')] [2022-07-10 09:39:26,468][26022] Updated weights on worker 0-0, policy_version 668603 (0.00777) [2022-07-10 09:39:28,171][26022] Updated weights on worker 0-0, policy_version 668613 (0.00099) [2022-07-10 09:39:30,111][26022] Updated weights on worker 0-0, policy_version 668623 (0.00712) [2022-07-10 09:39:31,443][25689] Fps is (10 sec: 5444.0, 60 sec: 5539.9, 300 sec: 5552.8). Total num frames: 684677120. Throughput: 0: 4961.9. Samples: 684672056. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:39:31,444][25689] Avg episode reward: [(0, '-2.673')] [2022-07-10 09:39:31,722][26022] Updated weights on worker 0-0, policy_version 668633 (0.00085) [2022-07-10 09:39:33,771][26022] Updated weights on worker 0-0, policy_version 668643 (0.00083) [2022-07-10 09:39:35,601][26022] Updated weights on worker 0-0, policy_version 668653 (0.00092) [2022-07-10 09:39:36,466][25689] Fps is (10 sec: 5585.5, 60 sec: 5521.9, 300 sec: 5549.5). Total num frames: 684704768. Throughput: 0: 5808.4. Samples: 684705892. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:39:36,466][25689] Avg episode reward: [(0, '-3.288')] [2022-07-10 09:39:37,547][26022] Updated weights on worker 0-0, policy_version 668663 (0.00051) [2022-07-10 09:39:39,002][26022] Updated weights on worker 0-0, policy_version 668673 (0.00087) [2022-07-10 09:39:41,184][26022] Updated weights on worker 0-0, policy_version 668683 (0.00099) [2022-07-10 09:39:41,527][25689] Fps is (10 sec: 5585.4, 60 sec: 5518.5, 300 sec: 5556.6). Total num frames: 684733440. Throughput: 0: 5790.6. Samples: 684739272. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:39:41,527][25689] Avg episode reward: [(0, '-3.224')] [2022-07-10 09:39:42,794][26022] Updated weights on worker 0-0, policy_version 668693 (0.00092) [2022-07-10 09:39:44,725][26022] Updated weights on worker 0-0, policy_version 668703 (0.00085) [2022-07-10 09:39:46,468][26022] Updated weights on worker 0-0, policy_version 668713 (0.00092) [2022-07-10 09:39:46,651][25689] Fps is (10 sec: 5630.4, 60 sec: 5530.6, 300 sec: 5555.3). Total num frames: 684762112. Throughput: 0: 4948.9. Samples: 684756028. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:39:46,654][25689] Avg episode reward: [(0, '-4.348')] [2022-07-10 09:39:48,325][26022] Updated weights on worker 0-0, policy_version 668723 (0.00084) [2022-07-10 09:39:50,211][26022] Updated weights on worker 0-0, policy_version 668733 (0.00078) [2022-07-10 09:39:51,718][25689] Fps is (10 sec: 5626.5, 60 sec: 5542.6, 300 sec: 5550.7). Total num frames: 684790784. Throughput: 0: 5805.3. Samples: 684789760. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:39:51,721][25689] Avg episode reward: [(0, '-3.112')] [2022-07-10 09:39:51,882][26022] Updated weights on worker 0-0, policy_version 668743 (0.00089) [2022-07-10 09:39:53,703][26022] Updated weights on worker 0-0, policy_version 668753 (0.00080) [2022-07-10 09:39:55,717][26022] Updated weights on worker 0-0, policy_version 668763 (0.00082) [2022-07-10 09:39:56,745][25689] Fps is (10 sec: 5680.7, 60 sec: 5574.1, 300 sec: 5557.8). Total num frames: 684819456. Throughput: 0: 5796.5. Samples: 684823440. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:39:56,745][25689] Avg episode reward: [(0, '-3.456')] [2022-07-10 09:39:57,384][26022] Updated weights on worker 0-0, policy_version 668773 (0.00086) [2022-07-10 09:39:59,488][26022] Updated weights on worker 0-0, policy_version 668783 (0.00079) [2022-07-10 09:40:01,181][26022] Updated weights on worker 0-0, policy_version 668793 (0.00095) [2022-07-10 09:40:01,830][25689] Fps is (10 sec: 5468.7, 60 sec: 5552.3, 300 sec: 5560.9). Total num frames: 684846080. Throughput: 0: 4970.8. Samples: 684840196. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:40:01,830][25689] Avg episode reward: [(0, '-4.984')] [2022-07-10 09:40:03,527][26022] Updated weights on worker 0-0, policy_version 668803 (0.00087) [2022-07-10 09:40:05,147][26022] Updated weights on worker 0-0, policy_version 668813 (0.00111) [2022-07-10 09:40:07,007][25689] Fps is (10 sec: 5288.5, 60 sec: 5543.6, 300 sec: 5551.2). Total num frames: 684873728. Throughput: 0: 5677.9. Samples: 684871612. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:40:07,008][25689] Avg episode reward: [(0, '-5.316')] [2022-07-10 09:40:07,193][26022] Updated weights on worker 0-0, policy_version 668823 (0.00086) [2022-07-10 09:40:09,004][26022] Updated weights on worker 0-0, policy_version 668833 (0.00090) [2022-07-10 09:40:10,693][26022] Updated weights on worker 0-0, policy_version 668843 (0.00094) [2022-07-10 09:40:12,036][25689] Fps is (10 sec: 5417.8, 60 sec: 5542.7, 300 sec: 5551.7). Total num frames: 684901376. Throughput: 0: 5683.0. Samples: 684905226. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:40:12,038][25689] Avg episode reward: [(0, '-4.676')] [2022-07-10 09:40:12,413][26022] Updated weights on worker 0-0, policy_version 668853 (0.00080) [2022-07-10 09:40:14,470][26022] Updated weights on worker 0-0, policy_version 668863 (0.00094) [2022-07-10 09:40:16,165][26022] Updated weights on worker 0-0, policy_version 668873 (0.00485) [2022-07-10 09:40:17,047][25689] Fps is (10 sec: 5609.9, 60 sec: 5542.2, 300 sec: 5553.1). Total num frames: 684930048. Throughput: 0: 5699.9. Samples: 684939158. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:40:17,048][25689] Avg episode reward: [(0, '-4.603')] [2022-07-10 09:40:18,028][26022] Updated weights on worker 0-0, policy_version 668883 (0.00089) [2022-07-10 09:40:19,800][26022] Updated weights on worker 0-0, policy_version 668893 (0.00084) [2022-07-10 09:40:20,839][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:40:20,855][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000668898_684951552.pth [2022-07-10 09:40:20,855][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000666944_682950656.pth [2022-07-10 09:40:20,856][25974] Saving a milestone ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/milestones/checkpoint_000668898_684951552.pth.milestone [2022-07-10 09:40:21,647][26022] Updated weights on worker 0-0, policy_version 668903 (0.00090) [2022-07-10 09:40:22,064][25689] Fps is (10 sec: 5616.3, 60 sec: 5526.2, 300 sec: 5553.7). Total num frames: 684957696. Throughput: 0: 5725.0. Samples: 684956038. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:40:22,065][25689] Avg episode reward: [(0, '-5.324')] [2022-07-10 09:40:23,410][26022] Updated weights on worker 0-0, policy_version 668913 (0.00087) [2022-07-10 09:40:25,492][26022] Updated weights on worker 0-0, policy_version 668923 (0.00094) [2022-07-10 09:40:27,063][26022] Updated weights on worker 0-0, policy_version 668933 (0.00082) [2022-07-10 09:40:27,183][25689] Fps is (10 sec: 5657.2, 60 sec: 5574.7, 300 sec: 5552.9). Total num frames: 684987392. Throughput: 0: 5849.0. Samples: 684989622. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:40:27,184][25689] Avg episode reward: [(0, '-5.455')] [2022-07-10 09:40:29,066][26022] Updated weights on worker 0-0, policy_version 668943 (0.00082) [2022-07-10 09:40:30,892][26022] Updated weights on worker 0-0, policy_version 668953 (0.00085) [2022-07-10 09:40:32,208][25689] Fps is (10 sec: 5653.2, 60 sec: 5561.1, 300 sec: 5553.4). Total num frames: 685015040. Throughput: 0: 5837.4. Samples: 685022976. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:40:32,208][25689] Avg episode reward: [(0, '-4.077')] [2022-07-10 09:40:32,803][26022] Updated weights on worker 0-0, policy_version 668963 (0.00104) [2022-07-10 09:40:34,552][26022] Updated weights on worker 0-0, policy_version 668973 (0.00087) [2022-07-10 09:40:36,380][26022] Updated weights on worker 0-0, policy_version 668983 (0.00089) [2022-07-10 09:40:37,224][25689] Fps is (10 sec: 5609.3, 60 sec: 5578.6, 300 sec: 5553.8). Total num frames: 685043712. Throughput: 0: 5004.6. Samples: 685040136. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:40:37,224][25689] Avg episode reward: [(0, '-3.260')] [2022-07-10 09:40:38,130][26022] Updated weights on worker 0-0, policy_version 668993 (0.00088) [2022-07-10 09:40:40,088][26022] Updated weights on worker 0-0, policy_version 669003 (0.00081) [2022-07-10 09:40:41,709][26022] Updated weights on worker 0-0, policy_version 669013 (0.00097) [2022-07-10 09:40:42,275][25689] Fps is (10 sec: 5594.7, 60 sec: 5562.7, 300 sec: 5554.0). Total num frames: 685071360. Throughput: 0: 5840.6. Samples: 685074078. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:40:42,275][25689] Avg episode reward: [(0, '-3.755')] [2022-07-10 09:40:43,678][26022] Updated weights on worker 0-0, policy_version 669023 (0.00093) [2022-07-10 09:40:45,480][26022] Updated weights on worker 0-0, policy_version 669033 (0.01150) [2022-07-10 09:40:47,098][26022] Updated weights on worker 0-0, policy_version 669043 (0.00087) [2022-07-10 09:40:47,409][25689] Fps is (10 sec: 5629.9, 60 sec: 5578.5, 300 sec: 5555.9). Total num frames: 685101056. Throughput: 0: 5857.8. Samples: 685108102. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:40:47,410][25689] Avg episode reward: [(0, '-4.258')] [2022-07-10 09:40:49,143][26022] Updated weights on worker 0-0, policy_version 669053 (0.00089) [2022-07-10 09:40:50,703][26022] Updated weights on worker 0-0, policy_version 669063 (0.00089) [2022-07-10 09:40:52,501][25689] Fps is (10 sec: 5607.2, 60 sec: 5559.5, 300 sec: 5554.7). Total num frames: 685128704. Throughput: 0: 5016.0. Samples: 685124768. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:40:52,502][25689] Avg episode reward: [(0, '-5.217')] [2022-07-10 09:40:52,641][26022] Updated weights on worker 0-0, policy_version 669073 (0.00080) [2022-07-10 09:40:54,419][26022] Updated weights on worker 0-0, policy_version 669083 (0.00092) [2022-07-10 09:40:56,346][26022] Updated weights on worker 0-0, policy_version 669093 (0.00089) [2022-07-10 09:40:57,521][25689] Fps is (10 sec: 5569.7, 60 sec: 5560.1, 300 sec: 5554.5). Total num frames: 685157376. Throughput: 0: 5831.7. Samples: 685158506. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:40:57,522][25689] Avg episode reward: [(0, '-5.401')] [2022-07-10 09:40:58,341][26022] Updated weights on worker 0-0, policy_version 669103 (0.00088) [2022-07-10 09:40:59,959][26022] Updated weights on worker 0-0, policy_version 669113 (0.00079) [2022-07-10 09:41:02,131][26022] Updated weights on worker 0-0, policy_version 669123 (0.00104) [2022-07-10 09:41:02,528][25689] Fps is (10 sec: 5514.7, 60 sec: 5567.2, 300 sec: 5558.8). Total num frames: 685184000. Throughput: 0: 5732.2. Samples: 685190176. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:41:02,529][25689] Avg episode reward: [(0, '-5.172')] [2022-07-10 09:41:04,104][26022] Updated weights on worker 0-0, policy_version 669133 (0.00753) [2022-07-10 09:41:05,918][26022] Updated weights on worker 0-0, policy_version 669143 (0.00094) [2022-07-10 09:41:07,633][25689] Fps is (10 sec: 5367.5, 60 sec: 5573.9, 300 sec: 5557.2). Total num frames: 685211648. Throughput: 0: 4876.5. Samples: 685206720. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:41:07,633][25689] Avg episode reward: [(0, '-5.151')] [2022-07-10 09:41:07,719][26022] Updated weights on worker 0-0, policy_version 669153 (0.00081) [2022-07-10 09:41:09,676][26022] Updated weights on worker 0-0, policy_version 669163 (0.00088) [2022-07-10 09:41:11,383][26022] Updated weights on worker 0-0, policy_version 669173 (0.00084) [2022-07-10 09:41:12,684][25689] Fps is (10 sec: 5545.9, 60 sec: 5588.8, 300 sec: 5553.2). Total num frames: 685240320. Throughput: 0: 5731.5. Samples: 685240442. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:41:12,684][25689] Avg episode reward: [(0, '-4.523')] [2022-07-10 09:41:13,341][26022] Updated weights on worker 0-0, policy_version 669183 (0.00092) [2022-07-10 09:41:14,991][26022] Updated weights on worker 0-0, policy_version 669193 (0.00085) [2022-07-10 09:41:16,802][26022] Updated weights on worker 0-0, policy_version 669203 (0.00085) [2022-07-10 09:41:17,747][25689] Fps is (10 sec: 5669.5, 60 sec: 5583.9, 300 sec: 5559.1). Total num frames: 685268992. Throughput: 0: 5739.1. Samples: 685274584. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:41:17,748][25689] Avg episode reward: [(0, '-4.921')] [2022-07-10 09:41:18,511][26022] Updated weights on worker 0-0, policy_version 669213 (0.00090) [2022-07-10 09:41:20,519][26022] Updated weights on worker 0-0, policy_version 669223 (0.00082) [2022-07-10 09:41:22,211][26022] Updated weights on worker 0-0, policy_version 669233 (0.00084) [2022-07-10 09:41:22,763][25689] Fps is (10 sec: 5587.8, 60 sec: 5584.1, 300 sec: 5554.2). Total num frames: 685296640. Throughput: 0: 5004.7. Samples: 685291442. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:41:22,763][25689] Avg episode reward: [(0, '-3.604')] [2022-07-10 09:41:24,063][26022] Updated weights on worker 0-0, policy_version 669243 (0.00085) [2022-07-10 09:41:25,853][26022] Updated weights on worker 0-0, policy_version 669253 (0.00088) [2022-07-10 09:41:27,713][26022] Updated weights on worker 0-0, policy_version 669263 (0.00084) [2022-07-10 09:41:27,807][25689] Fps is (10 sec: 5598.3, 60 sec: 5574.0, 300 sec: 5557.6). Total num frames: 685325312. Throughput: 0: 5863.7. Samples: 685325018. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:41:27,809][25689] Avg episode reward: [(0, '-3.036')] [2022-07-10 09:41:29,735][26022] Updated weights on worker 0-0, policy_version 669273 (0.00092) [2022-07-10 09:41:31,518][26022] Updated weights on worker 0-0, policy_version 669283 (0.00089) [2022-07-10 09:41:32,817][25689] Fps is (10 sec: 5601.7, 60 sec: 5575.4, 300 sec: 5557.5). Total num frames: 685352960. Throughput: 0: 5871.1. Samples: 685358646. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:41:32,817][25689] Avg episode reward: [(0, '-4.596')] [2022-07-10 09:41:33,333][26022] Updated weights on worker 0-0, policy_version 669293 (0.00099) [2022-07-10 09:41:35,015][26022] Updated weights on worker 0-0, policy_version 669303 (0.00085) [2022-07-10 09:41:36,806][26022] Updated weights on worker 0-0, policy_version 669313 (0.00089) [2022-07-10 09:41:37,829][25689] Fps is (10 sec: 5721.8, 60 sec: 5592.6, 300 sec: 5564.2). Total num frames: 685382656. Throughput: 0: 5019.1. Samples: 685375380. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:41:37,831][25689] Avg episode reward: [(0, '-4.757')] [2022-07-10 09:41:38,773][26022] Updated weights on worker 0-0, policy_version 669323 (0.00076) [2022-07-10 09:41:40,563][26022] Updated weights on worker 0-0, policy_version 669333 (0.00088) [2022-07-10 09:41:42,331][26022] Updated weights on worker 0-0, policy_version 669343 (0.00087) [2022-07-10 09:41:42,865][25689] Fps is (10 sec: 5604.7, 60 sec: 5577.1, 300 sec: 5555.9). Total num frames: 685409280. Throughput: 0: 5861.3. Samples: 685409270. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:41:42,867][25689] Avg episode reward: [(0, '-5.327')] [2022-07-10 09:41:44,043][26022] Updated weights on worker 0-0, policy_version 669353 (0.00086) [2022-07-10 09:41:46,116][26022] Updated weights on worker 0-0, policy_version 669363 (0.00086) [2022-07-10 09:41:47,692][26022] Updated weights on worker 0-0, policy_version 669373 (0.00089) [2022-07-10 09:41:47,925][25689] Fps is (10 sec: 5578.5, 60 sec: 5584.0, 300 sec: 5565.5). Total num frames: 685438976. Throughput: 0: 5864.0. Samples: 685442990. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:41:47,926][25689] Avg episode reward: [(0, '-5.474')] [2022-07-10 09:41:49,566][26022] Updated weights on worker 0-0, policy_version 669383 (0.00093) [2022-07-10 09:41:51,383][26022] Updated weights on worker 0-0, policy_version 669393 (0.00081) [2022-07-10 09:41:53,003][25689] Fps is (10 sec: 5656.6, 60 sec: 5585.3, 300 sec: 5561.2). Total num frames: 685466624. Throughput: 0: 5023.0. Samples: 685460042. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:41:53,003][25689] Avg episode reward: [(0, '-5.929')] [2022-07-10 09:41:53,199][26022] Updated weights on worker 0-0, policy_version 669403 (0.00090) [2022-07-10 09:41:55,117][26022] Updated weights on worker 0-0, policy_version 669413 (0.00086) [2022-07-10 09:41:56,934][26022] Updated weights on worker 0-0, policy_version 669423 (0.00085) [2022-07-10 09:41:58,042][25689] Fps is (10 sec: 5465.9, 60 sec: 5566.6, 300 sec: 5558.2). Total num frames: 685494272. Throughput: 0: 5869.4. Samples: 685494014. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:41:58,042][25689] Avg episode reward: [(0, '-5.763')] [2022-07-10 09:41:58,556][26022] Updated weights on worker 0-0, policy_version 669433 (0.00086) [2022-07-10 09:42:00,655][26022] Updated weights on worker 0-0, policy_version 669443 (0.00084) [2022-07-10 09:42:02,192][26022] Updated weights on worker 0-0, policy_version 669453 (0.00082) [2022-07-10 09:42:03,045][25689] Fps is (10 sec: 5404.6, 60 sec: 5567.0, 300 sec: 5566.7). Total num frames: 685520896. Throughput: 0: 5861.1. Samples: 685527544. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:42:03,045][25689] Avg episode reward: [(0, '-4.304')] [2022-07-10 09:42:04,552][26022] Updated weights on worker 0-0, policy_version 669463 (0.00089) [2022-07-10 09:42:06,685][26022] Updated weights on worker 0-0, policy_version 669473 (0.00086) [2022-07-10 09:42:08,115][25689] Fps is (10 sec: 5489.5, 60 sec: 5587.1, 300 sec: 5558.9). Total num frames: 685549568. Throughput: 0: 5742.0. Samples: 685558920. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 09:42:08,116][25689] Avg episode reward: [(0, '-5.313')] [2022-07-10 09:42:08,404][26022] Updated weights on worker 0-0, policy_version 669483 (0.00079) [2022-07-10 09:42:10,252][26022] Updated weights on worker 0-0, policy_version 669493 (0.00085) [2022-07-10 09:42:11,803][26022] Updated weights on worker 0-0, policy_version 669503 (0.00086) [2022-07-10 09:42:13,139][25689] Fps is (10 sec: 5579.5, 60 sec: 5572.6, 300 sec: 5562.0). Total num frames: 685577216. Throughput: 0: 5748.8. Samples: 685575800. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:42:13,139][25689] Avg episode reward: [(0, '-5.011')] [2022-07-10 09:42:13,769][26022] Updated weights on worker 0-0, policy_version 669513 (0.00087) [2022-07-10 09:42:15,540][26022] Updated weights on worker 0-0, policy_version 669523 (0.00090) [2022-07-10 09:42:17,435][26022] Updated weights on worker 0-0, policy_version 669533 (0.00090) [2022-07-10 09:42:18,155][25689] Fps is (10 sec: 5609.7, 60 sec: 5577.0, 300 sec: 5561.9). Total num frames: 685605888. Throughput: 0: 5744.1. Samples: 685609544. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:42:18,155][25689] Avg episode reward: [(0, '-4.687')] [2022-07-10 09:42:19,242][26022] Updated weights on worker 0-0, policy_version 669543 (0.00086) [2022-07-10 09:42:21,006][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:42:21,018][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000669553_685622272.pth [2022-07-10 09:42:21,018][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000667596_683618304.pth [2022-07-10 09:42:21,024][26022] Updated weights on worker 0-0, policy_version 669553 (0.00082) [2022-07-10 09:42:22,892][26022] Updated weights on worker 0-0, policy_version 669563 (0.00089) [2022-07-10 09:42:23,176][25689] Fps is (10 sec: 5611.3, 60 sec: 5576.5, 300 sec: 5564.0). Total num frames: 685633536. Throughput: 0: 5774.7. Samples: 685643794. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:42:23,176][25689] Avg episode reward: [(0, '-5.536')] [2022-07-10 09:42:24,515][26022] Updated weights on worker 0-0, policy_version 669573 (0.00106) [2022-07-10 09:42:26,507][26022] Updated weights on worker 0-0, policy_version 669583 (0.00076) [2022-07-10 09:42:28,289][25689] Fps is (10 sec: 5658.6, 60 sec: 5587.2, 300 sec: 5566.3). Total num frames: 685663232. Throughput: 0: 5036.4. Samples: 685660522. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:42:28,289][25689] Avg episode reward: [(0, '-5.129')] [2022-07-10 09:42:28,296][26022] Updated weights on worker 0-0, policy_version 669593 (0.00094) [2022-07-10 09:42:30,035][26022] Updated weights on worker 0-0, policy_version 669603 (0.00083) [2022-07-10 09:42:32,039][26022] Updated weights on worker 0-0, policy_version 669613 (0.00083) [2022-07-10 09:42:33,333][25689] Fps is (10 sec: 5746.7, 60 sec: 5600.9, 300 sec: 5563.2). Total num frames: 685691904. Throughput: 0: 5864.3. Samples: 685694222. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:42:33,333][25689] Avg episode reward: [(0, '-5.368')] [2022-07-10 09:42:33,727][26022] Updated weights on worker 0-0, policy_version 669623 (0.00082) [2022-07-10 09:42:35,795][26022] Updated weights on worker 0-0, policy_version 669633 (0.00092) [2022-07-10 09:42:37,296][26022] Updated weights on worker 0-0, policy_version 669643 (0.00080) [2022-07-10 09:42:38,338][25689] Fps is (10 sec: 5502.3, 60 sec: 5550.8, 300 sec: 5563.7). Total num frames: 685718528. Throughput: 0: 5868.4. Samples: 685727990. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:42:38,339][25689] Avg episode reward: [(0, '-5.212')] [2022-07-10 09:42:39,367][26022] Updated weights on worker 0-0, policy_version 669653 (0.00109) [2022-07-10 09:42:40,870][26022] Updated weights on worker 0-0, policy_version 669663 (0.00089) [2022-07-10 09:42:42,789][26022] Updated weights on worker 0-0, policy_version 669673 (0.00090) [2022-07-10 09:42:43,442][25689] Fps is (10 sec: 5571.3, 60 sec: 5595.3, 300 sec: 5566.7). Total num frames: 685748224. Throughput: 0: 4993.3. Samples: 685744988. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:42:43,442][25689] Avg episode reward: [(0, '-4.330')] [2022-07-10 09:42:44,832][26022] Updated weights on worker 0-0, policy_version 669683 (0.00099) [2022-07-10 09:42:46,407][26022] Updated weights on worker 0-0, policy_version 669693 (0.00090) [2022-07-10 09:42:48,485][25689] Fps is (10 sec: 5550.4, 60 sec: 5546.1, 300 sec: 5559.2). Total num frames: 685774848. Throughput: 0: 5842.2. Samples: 685778516. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:42:48,487][25689] Avg episode reward: [(0, '-3.549')] [2022-07-10 09:42:48,535][26022] Updated weights on worker 0-0, policy_version 669703 (0.00096) [2022-07-10 09:42:50,073][26022] Updated weights on worker 0-0, policy_version 669713 (0.00082) [2022-07-10 09:42:51,992][26022] Updated weights on worker 0-0, policy_version 669723 (0.00083) [2022-07-10 09:42:53,557][25689] Fps is (10 sec: 5567.5, 60 sec: 5580.4, 300 sec: 5565.5). Total num frames: 685804544. Throughput: 0: 5854.0. Samples: 685812618. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:42:53,558][25689] Avg episode reward: [(0, '-3.843')] [2022-07-10 09:42:53,787][26022] Updated weights on worker 0-0, policy_version 669733 (0.00101) [2022-07-10 09:42:55,333][26022] Updated weights on worker 0-0, policy_version 669743 (0.00087) [2022-07-10 09:42:57,449][26022] Updated weights on worker 0-0, policy_version 669753 (0.00086) [2022-07-10 09:42:58,609][25689] Fps is (10 sec: 5967.6, 60 sec: 5629.9, 300 sec: 5571.8). Total num frames: 685835264. Throughput: 0: 5012.9. Samples: 685829606. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:42:58,610][25689] Avg episode reward: [(0, '-4.809')] [2022-07-10 09:42:59,070][26022] Updated weights on worker 0-0, policy_version 669763 (0.00087) [2022-07-10 09:43:01,076][26022] Updated weights on worker 0-0, policy_version 669773 (0.00111) [2022-07-10 09:43:03,313][26022] Updated weights on worker 0-0, policy_version 669783 (0.00087) [2022-07-10 09:43:03,623][25689] Fps is (10 sec: 5493.7, 60 sec: 5595.2, 300 sec: 5566.9). Total num frames: 685859840. Throughput: 0: 5801.8. Samples: 685862074. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:43:03,625][25689] Avg episode reward: [(0, '-4.359')] [2022-07-10 09:43:05,073][26022] Updated weights on worker 0-0, policy_version 669793 (0.00114) [2022-07-10 09:43:07,012][26022] Updated weights on worker 0-0, policy_version 669803 (0.00087) [2022-07-10 09:43:08,731][25689] Fps is (10 sec: 5260.7, 60 sec: 5591.6, 300 sec: 5568.7). Total num frames: 685888512. Throughput: 0: 5744.7. Samples: 685894822. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:43:08,732][25689] Avg episode reward: [(0, '-4.151')] [2022-07-10 09:43:08,738][26022] Updated weights on worker 0-0, policy_version 669813 (0.00092) [2022-07-10 09:43:10,577][26022] Updated weights on worker 0-0, policy_version 669823 (0.00052) [2022-07-10 09:43:12,439][26022] Updated weights on worker 0-0, policy_version 669833 (0.00090) [2022-07-10 09:43:13,751][25689] Fps is (10 sec: 5459.5, 60 sec: 5575.1, 300 sec: 5565.3). Total num frames: 685915136. Throughput: 0: 4906.3. Samples: 685911694. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:43:13,751][25689] Avg episode reward: [(0, '-4.038')] [2022-07-10 09:43:14,251][26022] Updated weights on worker 0-0, policy_version 669843 (0.00609) [2022-07-10 09:43:16,041][26022] Updated weights on worker 0-0, policy_version 669853 (0.00118) [2022-07-10 09:43:17,761][26022] Updated weights on worker 0-0, policy_version 669863 (0.00084) [2022-07-10 09:43:18,821][25689] Fps is (10 sec: 5581.6, 60 sec: 5587.0, 300 sec: 5568.1). Total num frames: 685944832. Throughput: 0: 5740.9. Samples: 685945642. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:43:18,822][25689] Avg episode reward: [(0, '-3.283')] [2022-07-10 09:43:19,737][26022] Updated weights on worker 0-0, policy_version 669873 (0.00091) [2022-07-10 09:43:21,337][26022] Updated weights on worker 0-0, policy_version 669883 (0.00085) [2022-07-10 09:43:23,431][26022] Updated weights on worker 0-0, policy_version 669893 (0.00087) [2022-07-10 09:43:23,887][25689] Fps is (10 sec: 5758.5, 60 sec: 5599.7, 300 sec: 5565.1). Total num frames: 685973504. Throughput: 0: 5798.2. Samples: 685979570. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:43:23,888][25689] Avg episode reward: [(0, '-2.009')] [2022-07-10 09:43:25,109][26022] Updated weights on worker 0-0, policy_version 669903 (0.00090) [2022-07-10 09:43:27,093][26022] Updated weights on worker 0-0, policy_version 669913 (0.00095) [2022-07-10 09:43:28,848][26022] Updated weights on worker 0-0, policy_version 669923 (0.00104) [2022-07-10 09:43:28,934][25689] Fps is (10 sec: 5569.3, 60 sec: 5572.1, 300 sec: 5567.9). Total num frames: 686001152. Throughput: 0: 5010.3. Samples: 685996044. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:43:28,935][25689] Avg episode reward: [(0, '-1.904')] [2022-07-10 09:43:30,540][26022] Updated weights on worker 0-0, policy_version 669933 (0.00092) [2022-07-10 09:43:32,592][26022] Updated weights on worker 0-0, policy_version 669943 (0.00089) [2022-07-10 09:43:33,944][25689] Fps is (10 sec: 5600.3, 60 sec: 5575.2, 300 sec: 5568.0). Total num frames: 686029824. Throughput: 0: 5843.0. Samples: 686029678. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:43:33,944][25689] Avg episode reward: [(0, '-1.486')] [2022-07-10 09:43:34,172][26022] Updated weights on worker 0-0, policy_version 669953 (0.00090) [2022-07-10 09:43:36,391][26022] Updated weights on worker 0-0, policy_version 669963 (0.00083) [2022-07-10 09:43:37,718][26022] Updated weights on worker 0-0, policy_version 669973 (0.00092) [2022-07-10 09:43:38,948][25689] Fps is (10 sec: 5521.6, 60 sec: 5575.3, 300 sec: 5561.5). Total num frames: 686056448. Throughput: 0: 5834.2. Samples: 686063068. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:43:38,949][25689] Avg episode reward: [(0, '-1.455')] [2022-07-10 09:43:39,845][26022] Updated weights on worker 0-0, policy_version 669983 (0.00089) [2022-07-10 09:43:41,574][26022] Updated weights on worker 0-0, policy_version 669993 (0.00083) [2022-07-10 09:43:43,607][26022] Updated weights on worker 0-0, policy_version 670003 (0.00084) [2022-07-10 09:43:43,950][25689] Fps is (10 sec: 5628.7, 60 sec: 5584.7, 300 sec: 5569.6). Total num frames: 686086144. Throughput: 0: 4996.6. Samples: 686079814. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:43:43,950][25689] Avg episode reward: [(0, '-1.801')] [2022-07-10 09:43:45,490][26022] Updated weights on worker 0-0, policy_version 670013 (0.00097) [2022-07-10 09:43:47,301][26022] Updated weights on worker 0-0, policy_version 670023 (0.00086) [2022-07-10 09:43:49,057][25689] Fps is (10 sec: 5571.3, 60 sec: 5578.8, 300 sec: 5564.5). Total num frames: 686112768. Throughput: 0: 5827.5. Samples: 686113314. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:43:49,058][25689] Avg episode reward: [(0, '-2.283')] [2022-07-10 09:43:49,270][26022] Updated weights on worker 0-0, policy_version 670033 (0.00869) [2022-07-10 09:43:50,955][26022] Updated weights on worker 0-0, policy_version 670043 (0.00096) [2022-07-10 09:43:52,827][26022] Updated weights on worker 0-0, policy_version 670053 (0.00087) [2022-07-10 09:43:54,084][25689] Fps is (10 sec: 5456.1, 60 sec: 5566.1, 300 sec: 5570.9). Total num frames: 686141440. Throughput: 0: 5812.1. Samples: 686146738. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:43:54,085][25689] Avg episode reward: [(0, '-3.369')] [2022-07-10 09:43:54,560][26022] Updated weights on worker 0-0, policy_version 670063 (0.00092) [2022-07-10 09:43:56,512][26022] Updated weights on worker 0-0, policy_version 670073 (0.00085) [2022-07-10 09:43:58,312][26022] Updated weights on worker 0-0, policy_version 670083 (0.00087) [2022-07-10 09:43:59,098][25689] Fps is (10 sec: 5609.3, 60 sec: 5518.8, 300 sec: 5571.2). Total num frames: 686169088. Throughput: 0: 4988.3. Samples: 686163580. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:43:59,099][25689] Avg episode reward: [(0, '-6.164')] [2022-07-10 09:43:59,908][26022] Updated weights on worker 0-0, policy_version 670093 (0.00091) [2022-07-10 09:44:02,060][26022] Updated weights on worker 0-0, policy_version 670103 (0.00097) [2022-07-10 09:44:04,101][26022] Updated weights on worker 0-0, policy_version 670113 (0.00096) [2022-07-10 09:44:04,119][25689] Fps is (10 sec: 5408.6, 60 sec: 5552.0, 300 sec: 5568.9). Total num frames: 686195712. Throughput: 0: 5723.6. Samples: 686195254. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:44:04,119][25689] Avg episode reward: [(0, '-6.134')] [2022-07-10 09:44:05,973][26022] Updated weights on worker 0-0, policy_version 670123 (0.00082) [2022-07-10 09:44:07,741][26022] Updated weights on worker 0-0, policy_version 670133 (0.00095) [2022-07-10 09:44:09,154][25689] Fps is (10 sec: 5498.8, 60 sec: 5558.7, 300 sec: 5572.0). Total num frames: 686224384. Throughput: 0: 5739.1. Samples: 686228650. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:44:09,156][25689] Avg episode reward: [(0, '-5.278')] [2022-07-10 09:44:09,627][26022] Updated weights on worker 0-0, policy_version 670143 (0.00089) [2022-07-10 09:44:11,400][26022] Updated weights on worker 0-0, policy_version 670153 (0.00092) [2022-07-10 09:44:13,356][26022] Updated weights on worker 0-0, policy_version 670163 (0.00090) [2022-07-10 09:44:14,191][25689] Fps is (10 sec: 5490.1, 60 sec: 5557.2, 300 sec: 5564.6). Total num frames: 686251008. Throughput: 0: 4910.8. Samples: 686245476. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:44:14,191][25689] Avg episode reward: [(0, '-6.114')] [2022-07-10 09:44:15,227][26022] Updated weights on worker 0-0, policy_version 670173 (0.00088) [2022-07-10 09:44:17,037][26022] Updated weights on worker 0-0, policy_version 670183 (0.00083) [2022-07-10 09:44:18,680][26022] Updated weights on worker 0-0, policy_version 670193 (0.00090) [2022-07-10 09:44:19,202][25689] Fps is (10 sec: 5503.1, 60 sec: 5545.6, 300 sec: 5564.8). Total num frames: 686279680. Throughput: 0: 5743.0. Samples: 686279038. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:44:19,203][25689] Avg episode reward: [(0, '-5.405')] [2022-07-10 09:44:20,726][26022] Updated weights on worker 0-0, policy_version 670203 (0.00094) [2022-07-10 09:44:21,100][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:44:21,114][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000670206_686290944.pth [2022-07-10 09:44:21,115][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000668248_684285952.pth [2022-07-10 09:44:22,488][26022] Updated weights on worker 0-0, policy_version 670213 (0.00110) [2022-07-10 09:44:24,223][25689] Fps is (10 sec: 5512.0, 60 sec: 5515.8, 300 sec: 5566.3). Total num frames: 686306304. Throughput: 0: 5828.6. Samples: 686312432. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:44:24,224][25689] Avg episode reward: [(0, '-3.682')] [2022-07-10 09:44:24,469][26022] Updated weights on worker 0-0, policy_version 670223 (0.00073) [2022-07-10 09:44:26,147][26022] Updated weights on worker 0-0, policy_version 670233 (0.00092) [2022-07-10 09:44:27,937][26022] Updated weights on worker 0-0, policy_version 670243 (0.00084) [2022-07-10 09:44:29,319][25689] Fps is (10 sec: 5566.7, 60 sec: 5545.2, 300 sec: 5569.0). Total num frames: 686336000. Throughput: 0: 4984.1. Samples: 686329156. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:44:29,320][25689] Avg episode reward: [(0, '-3.024')] [2022-07-10 09:44:29,735][26022] Updated weights on worker 0-0, policy_version 670253 (0.00093) [2022-07-10 09:44:31,691][26022] Updated weights on worker 0-0, policy_version 670263 (0.00490) [2022-07-10 09:44:33,464][26022] Updated weights on worker 0-0, policy_version 670273 (0.00089) [2022-07-10 09:44:34,356][25689] Fps is (10 sec: 5658.8, 60 sec: 5525.7, 300 sec: 5568.8). Total num frames: 686363648. Throughput: 0: 5816.2. Samples: 686362762. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:44:34,358][25689] Avg episode reward: [(0, '-3.318')] [2022-07-10 09:44:35,295][26022] Updated weights on worker 0-0, policy_version 670283 (0.00088) [2022-07-10 09:44:37,229][26022] Updated weights on worker 0-0, policy_version 670293 (0.00608) [2022-07-10 09:44:38,959][26022] Updated weights on worker 0-0, policy_version 670303 (0.00087) [2022-07-10 09:44:39,375][25689] Fps is (10 sec: 5600.7, 60 sec: 5558.3, 300 sec: 5569.5). Total num frames: 686392320. Throughput: 0: 5815.0. Samples: 686396344. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:44:39,376][25689] Avg episode reward: [(0, '-3.608')] [2022-07-10 09:44:40,902][26022] Updated weights on worker 0-0, policy_version 670313 (0.00098) [2022-07-10 09:44:42,627][26022] Updated weights on worker 0-0, policy_version 670323 (0.00103) [2022-07-10 09:44:44,393][25689] Fps is (10 sec: 5509.4, 60 sec: 5506.0, 300 sec: 5564.6). Total num frames: 686418944. Throughput: 0: 4995.8. Samples: 686413198. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:44:44,394][25689] Avg episode reward: [(0, '-2.914')] [2022-07-10 09:44:44,532][26022] Updated weights on worker 0-0, policy_version 670333 (0.00086) [2022-07-10 09:44:46,269][26022] Updated weights on worker 0-0, policy_version 670343 (0.00092) [2022-07-10 09:44:48,321][26022] Updated weights on worker 0-0, policy_version 670353 (0.00933) [2022-07-10 09:44:49,471][25689] Fps is (10 sec: 5680.3, 60 sec: 5576.5, 300 sec: 5571.3). Total num frames: 686449664. Throughput: 0: 5854.5. Samples: 686447130. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:44:49,472][25689] Avg episode reward: [(0, '-3.772')] [2022-07-10 09:44:49,764][26022] Updated weights on worker 0-0, policy_version 670363 (0.01460) [2022-07-10 09:44:51,981][26022] Updated weights on worker 0-0, policy_version 670373 (0.00083) [2022-07-10 09:44:53,423][26022] Updated weights on worker 0-0, policy_version 670383 (0.00083) [2022-07-10 09:44:54,513][25689] Fps is (10 sec: 5767.5, 60 sec: 5558.2, 300 sec: 5567.6). Total num frames: 686477312. Throughput: 0: 5861.9. Samples: 686480918. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:44:54,515][25689] Avg episode reward: [(0, '-3.045')] [2022-07-10 09:44:55,377][26022] Updated weights on worker 0-0, policy_version 670393 (0.00084) [2022-07-10 09:44:57,145][26022] Updated weights on worker 0-0, policy_version 670403 (0.00093) [2022-07-10 09:44:59,182][26022] Updated weights on worker 0-0, policy_version 670413 (0.00080) [2022-07-10 09:44:59,535][25689] Fps is (10 sec: 5595.7, 60 sec: 5574.3, 300 sec: 5575.7). Total num frames: 686505984. Throughput: 0: 5883.3. Samples: 686514952. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:44:59,536][25689] Avg episode reward: [(0, '-2.008')] [2022-07-10 09:45:00,726][26022] Updated weights on worker 0-0, policy_version 670423 (0.00088) [2022-07-10 09:45:03,080][26022] Updated weights on worker 0-0, policy_version 670433 (0.00086) [2022-07-10 09:45:04,545][25689] Fps is (10 sec: 5512.1, 60 sec: 5575.3, 300 sec: 5575.4). Total num frames: 686532608. Throughput: 0: 5780.5. Samples: 686529686. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:45:04,547][25689] Avg episode reward: [(0, '-1.874')] [2022-07-10 09:45:04,776][26022] Updated weights on worker 0-0, policy_version 670443 (0.00091) [2022-07-10 09:45:06,775][26022] Updated weights on worker 0-0, policy_version 670453 (0.00088) [2022-07-10 09:45:08,450][26022] Updated weights on worker 0-0, policy_version 670463 (0.00096) [2022-07-10 09:45:09,593][25689] Fps is (10 sec: 5396.1, 60 sec: 5557.2, 300 sec: 5575.0). Total num frames: 686560256. Throughput: 0: 5767.7. Samples: 686563190. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:45:09,593][25689] Avg episode reward: [(0, '-4.631')] [2022-07-10 09:45:10,414][26022] Updated weights on worker 0-0, policy_version 670473 (0.00094) [2022-07-10 09:45:12,042][26022] Updated weights on worker 0-0, policy_version 670483 (0.00101) [2022-07-10 09:45:14,039][26022] Updated weights on worker 0-0, policy_version 670493 (0.00107) [2022-07-10 09:45:14,653][25689] Fps is (10 sec: 5369.1, 60 sec: 5555.1, 300 sec: 5567.2). Total num frames: 686586880. Throughput: 0: 5760.5. Samples: 686596934. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:45:14,654][25689] Avg episode reward: [(0, '-4.916')] [2022-07-10 09:45:15,574][26022] Updated weights on worker 0-0, policy_version 670503 (0.00087) [2022-07-10 09:45:17,783][26022] Updated weights on worker 0-0, policy_version 670513 (0.00093) [2022-07-10 09:45:19,371][26022] Updated weights on worker 0-0, policy_version 670523 (0.00087) [2022-07-10 09:45:19,662][25689] Fps is (10 sec: 5593.6, 60 sec: 5572.3, 300 sec: 5574.2). Total num frames: 686616576. Throughput: 0: 4915.4. Samples: 686613880. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:45:19,662][25689] Avg episode reward: [(0, '-4.869')] [2022-07-10 09:45:21,493][26022] Updated weights on worker 0-0, policy_version 670533 (0.00082) [2022-07-10 09:45:22,951][26022] Updated weights on worker 0-0, policy_version 670543 (0.00088) [2022-07-10 09:45:24,663][25689] Fps is (10 sec: 5626.2, 60 sec: 5574.0, 300 sec: 5566.1). Total num frames: 686643200. Throughput: 0: 5850.7. Samples: 686647394. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 09:45:24,664][25689] Avg episode reward: [(0, '-5.880')] [2022-07-10 09:45:24,921][26022] Updated weights on worker 0-0, policy_version 670553 (0.00087) [2022-07-10 09:45:26,743][26022] Updated weights on worker 0-0, policy_version 670563 (0.00084) [2022-07-10 09:45:28,544][26022] Updated weights on worker 0-0, policy_version 670573 (0.00356) [2022-07-10 09:45:29,744][25689] Fps is (10 sec: 5585.9, 60 sec: 5575.5, 300 sec: 5571.9). Total num frames: 686672896. Throughput: 0: 5839.7. Samples: 686680866. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:45:29,745][25689] Avg episode reward: [(0, '-6.050')] [2022-07-10 09:45:30,482][26022] Updated weights on worker 0-0, policy_version 670583 (0.00092) [2022-07-10 09:45:32,331][26022] Updated weights on worker 0-0, policy_version 670593 (0.00085) [2022-07-10 09:45:34,244][26022] Updated weights on worker 0-0, policy_version 670603 (0.00089) [2022-07-10 09:45:34,766][25689] Fps is (10 sec: 5777.5, 60 sec: 5593.8, 300 sec: 5571.8). Total num frames: 686701568. Throughput: 0: 5010.5. Samples: 686697710. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:45:34,766][25689] Avg episode reward: [(0, '-6.807')] [2022-07-10 09:45:36,105][26022] Updated weights on worker 0-0, policy_version 670613 (0.00088) [2022-07-10 09:45:37,831][26022] Updated weights on worker 0-0, policy_version 670623 (0.00091) [2022-07-10 09:45:39,568][26022] Updated weights on worker 0-0, policy_version 670633 (0.00084) [2022-07-10 09:45:39,820][25689] Fps is (10 sec: 5589.7, 60 sec: 5573.6, 300 sec: 5571.8). Total num frames: 686729216. Throughput: 0: 5816.9. Samples: 686731138. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:45:39,820][25689] Avg episode reward: [(0, '-6.205')] [2022-07-10 09:45:41,603][26022] Updated weights on worker 0-0, policy_version 670643 (0.00089) [2022-07-10 09:45:43,145][26022] Updated weights on worker 0-0, policy_version 670653 (0.00082) [2022-07-10 09:45:44,827][25689] Fps is (10 sec: 5394.3, 60 sec: 5574.6, 300 sec: 5563.8). Total num frames: 686755840. Throughput: 0: 5830.8. Samples: 686764964. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:45:44,827][25689] Avg episode reward: [(0, '-6.804')] [2022-07-10 09:45:45,199][26022] Updated weights on worker 0-0, policy_version 670663 (0.00092) [2022-07-10 09:45:46,999][26022] Updated weights on worker 0-0, policy_version 670673 (0.00090) [2022-07-10 09:45:48,662][26022] Updated weights on worker 0-0, policy_version 670683 (0.00096) [2022-07-10 09:45:49,870][25689] Fps is (10 sec: 5603.9, 60 sec: 5560.8, 300 sec: 5571.6). Total num frames: 686785536. Throughput: 0: 5010.7. Samples: 686781710. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:45:49,871][25689] Avg episode reward: [(0, '-7.674')] [2022-07-10 09:45:50,813][26022] Updated weights on worker 0-0, policy_version 670693 (0.00089) [2022-07-10 09:45:52,441][26022] Updated weights on worker 0-0, policy_version 670703 (0.00225) [2022-07-10 09:45:54,360][26022] Updated weights on worker 0-0, policy_version 670713 (0.00112) [2022-07-10 09:45:54,883][25689] Fps is (10 sec: 5702.1, 60 sec: 5563.5, 300 sec: 5568.3). Total num frames: 686813184. Throughput: 0: 5844.4. Samples: 686815286. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:45:54,884][25689] Avg episode reward: [(0, '-8.707')] [2022-07-10 09:45:56,261][26022] Updated weights on worker 0-0, policy_version 670723 (0.00097) [2022-07-10 09:45:57,776][26022] Updated weights on worker 0-0, policy_version 670733 (0.00089) [2022-07-10 09:45:59,752][26022] Updated weights on worker 0-0, policy_version 670743 (0.00091) [2022-07-10 09:45:59,893][25689] Fps is (10 sec: 5516.9, 60 sec: 5547.7, 300 sec: 5571.7). Total num frames: 686840832. Throughput: 0: 5878.8. Samples: 686849144. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:45:59,893][25689] Avg episode reward: [(0, '-8.448')] [2022-07-10 09:46:02,065][26022] Updated weights on worker 0-0, policy_version 670753 (0.00083) [2022-07-10 09:46:03,638][26022] Updated weights on worker 0-0, policy_version 670763 (0.00089) [2022-07-10 09:46:04,938][25689] Fps is (10 sec: 5397.7, 60 sec: 5544.5, 300 sec: 5569.4). Total num frames: 686867456. Throughput: 0: 4917.3. Samples: 686863858. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:46:04,939][25689] Avg episode reward: [(0, '-9.235')] [2022-07-10 09:46:05,711][26022] Updated weights on worker 0-0, policy_version 670773 (0.00091) [2022-07-10 09:46:07,155][26022] Updated weights on worker 0-0, policy_version 670783 (0.00100) [2022-07-10 09:46:09,499][26022] Updated weights on worker 0-0, policy_version 670793 (0.00088) [2022-07-10 09:46:10,054][25689] Fps is (10 sec: 5441.9, 60 sec: 5555.2, 300 sec: 5568.2). Total num frames: 686896128. Throughput: 0: 5708.7. Samples: 686896934. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:46:10,055][25689] Avg episode reward: [(0, '-9.258')] [2022-07-10 09:46:11,143][26022] Updated weights on worker 0-0, policy_version 670803 (0.00491) [2022-07-10 09:46:12,876][26022] Updated weights on worker 0-0, policy_version 670813 (0.00077) [2022-07-10 09:46:14,792][26022] Updated weights on worker 0-0, policy_version 670823 (0.00095) [2022-07-10 09:46:15,149][25689] Fps is (10 sec: 5415.5, 60 sec: 5552.0, 300 sec: 5560.7). Total num frames: 686922752. Throughput: 0: 5702.5. Samples: 686930850. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:46:15,149][25689] Avg episode reward: [(0, '-6.592')] [2022-07-10 09:46:16,430][26022] Updated weights on worker 0-0, policy_version 670833 (0.00093) [2022-07-10 09:46:18,395][26022] Updated weights on worker 0-0, policy_version 670843 (0.00085) [2022-07-10 09:46:20,158][25689] Fps is (10 sec: 5472.8, 60 sec: 5535.0, 300 sec: 5564.3). Total num frames: 686951424. Throughput: 0: 4878.7. Samples: 686948010. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:46:20,158][25689] Avg episode reward: [(0, '-7.583')] [2022-07-10 09:46:20,363][26022] Updated weights on worker 0-0, policy_version 670853 (0.00087) [2022-07-10 09:46:21,277][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:46:21,294][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000670858_686958592.pth [2022-07-10 09:46:21,294][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000668898_684951552.pth [2022-07-10 09:46:21,986][26022] Updated weights on worker 0-0, policy_version 670863 (0.00085) [2022-07-10 09:46:24,077][26022] Updated weights on worker 0-0, policy_version 670873 (0.00085) [2022-07-10 09:46:25,173][25689] Fps is (10 sec: 5823.0, 60 sec: 5584.6, 300 sec: 5568.3). Total num frames: 686981120. Throughput: 0: 5822.7. Samples: 686981678. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:46:25,174][25689] Avg episode reward: [(0, '-6.222')] [2022-07-10 09:46:25,619][26022] Updated weights on worker 0-0, policy_version 670883 (0.00090) [2022-07-10 09:46:27,507][26022] Updated weights on worker 0-0, policy_version 670893 (0.00089) [2022-07-10 09:46:29,447][26022] Updated weights on worker 0-0, policy_version 670903 (0.00091) [2022-07-10 09:46:30,236][25689] Fps is (10 sec: 5791.4, 60 sec: 5569.3, 300 sec: 5570.7). Total num frames: 687009792. Throughput: 0: 5860.1. Samples: 687015204. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:46:30,237][25689] Avg episode reward: [(0, '-4.010')] [2022-07-10 09:46:31,371][26022] Updated weights on worker 0-0, policy_version 670913 (0.00082) [2022-07-10 09:46:33,241][26022] Updated weights on worker 0-0, policy_version 670923 (0.00083) [2022-07-10 09:46:35,011][26022] Updated weights on worker 0-0, policy_version 670933 (0.00094) [2022-07-10 09:46:35,297][25689] Fps is (10 sec: 5461.7, 60 sec: 5531.8, 300 sec: 5559.5). Total num frames: 687036416. Throughput: 0: 5019.0. Samples: 687031972. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:46:35,297][25689] Avg episode reward: [(0, '-5.644')] [2022-07-10 09:46:36,669][26022] Updated weights on worker 0-0, policy_version 670943 (0.00094) [2022-07-10 09:46:38,742][26022] Updated weights on worker 0-0, policy_version 670953 (0.00087) [2022-07-10 09:46:40,352][25689] Fps is (10 sec: 5364.8, 60 sec: 5531.7, 300 sec: 5562.5). Total num frames: 687064064. Throughput: 0: 5803.6. Samples: 687065212. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:46:40,354][25689] Avg episode reward: [(0, '-5.504')] [2022-07-10 09:46:40,564][26022] Updated weights on worker 0-0, policy_version 670963 (0.00098) [2022-07-10 09:46:42,354][26022] Updated weights on worker 0-0, policy_version 670973 (0.00085) [2022-07-10 09:46:44,056][26022] Updated weights on worker 0-0, policy_version 670983 (0.00591) [2022-07-10 09:46:45,356][25689] Fps is (10 sec: 5700.6, 60 sec: 5582.7, 300 sec: 5563.6). Total num frames: 687093760. Throughput: 0: 5809.1. Samples: 687098926. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:46:45,357][25689] Avg episode reward: [(0, '-4.563')] [2022-07-10 09:46:46,107][26022] Updated weights on worker 0-0, policy_version 670993 (0.00089) [2022-07-10 09:46:47,708][26022] Updated weights on worker 0-0, policy_version 671003 (0.00088) [2022-07-10 09:46:49,799][26022] Updated weights on worker 0-0, policy_version 671013 (0.00085) [2022-07-10 09:46:50,400][25689] Fps is (10 sec: 5707.3, 60 sec: 5548.9, 300 sec: 5564.2). Total num frames: 687121408. Throughput: 0: 4985.8. Samples: 687115736. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:46:50,400][25689] Avg episode reward: [(0, '-4.732')] [2022-07-10 09:46:51,253][26022] Updated weights on worker 0-0, policy_version 671023 (0.00101) [2022-07-10 09:46:53,261][26022] Updated weights on worker 0-0, policy_version 671033 (0.00085) [2022-07-10 09:46:55,100][26022] Updated weights on worker 0-0, policy_version 671043 (0.00082) [2022-07-10 09:46:55,424][25689] Fps is (10 sec: 5593.9, 60 sec: 5564.8, 300 sec: 5567.9). Total num frames: 687150080. Throughput: 0: 5845.6. Samples: 687149628. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:46:55,425][25689] Avg episode reward: [(0, '-3.872')] [2022-07-10 09:46:56,759][26022] Updated weights on worker 0-0, policy_version 671053 (0.00110) [2022-07-10 09:46:58,636][26022] Updated weights on worker 0-0, policy_version 671063 (0.00091) [2022-07-10 09:47:00,350][26022] Updated weights on worker 0-0, policy_version 671073 (0.00086) [2022-07-10 09:47:00,429][25689] Fps is (10 sec: 5717.6, 60 sec: 5582.1, 300 sec: 5574.8). Total num frames: 687178752. Throughput: 0: 5893.7. Samples: 687183538. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:47:00,430][25689] Avg episode reward: [(0, '-4.529')] [2022-07-10 09:47:02,666][26022] Updated weights on worker 0-0, policy_version 671083 (0.00078) [2022-07-10 09:47:04,434][26022] Updated weights on worker 0-0, policy_version 671093 (0.00083) [2022-07-10 09:47:05,449][25689] Fps is (10 sec: 5413.7, 60 sec: 5567.5, 300 sec: 5565.4). Total num frames: 687204352. Throughput: 0: 4956.4. Samples: 687198516. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:47:05,450][25689] Avg episode reward: [(0, '-3.925')] [2022-07-10 09:47:06,404][26022] Updated weights on worker 0-0, policy_version 671103 (0.00090) [2022-07-10 09:47:08,031][26022] Updated weights on worker 0-0, policy_version 671113 (0.00080) [2022-07-10 09:47:10,028][26022] Updated weights on worker 0-0, policy_version 671123 (0.00089) [2022-07-10 09:47:10,538][25689] Fps is (10 sec: 5267.8, 60 sec: 5553.1, 300 sec: 5564.2). Total num frames: 687232000. Throughput: 0: 5774.0. Samples: 687232012. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:47:10,538][25689] Avg episode reward: [(0, '-1.942')] [2022-07-10 09:47:12,018][26022] Updated weights on worker 0-0, policy_version 671133 (0.00764) [2022-07-10 09:47:13,805][26022] Updated weights on worker 0-0, policy_version 671143 (0.00096) [2022-07-10 09:47:15,520][26022] Updated weights on worker 0-0, policy_version 671153 (0.00089) [2022-07-10 09:47:15,557][25689] Fps is (10 sec: 5571.7, 60 sec: 5593.9, 300 sec: 5564.1). Total num frames: 687260672. Throughput: 0: 5743.0. Samples: 687265254. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:47:15,558][25689] Avg episode reward: [(0, '-2.910')] [2022-07-10 09:47:17,482][26022] Updated weights on worker 0-0, policy_version 671163 (0.00089) [2022-07-10 09:47:19,219][26022] Updated weights on worker 0-0, policy_version 671173 (0.00083) [2022-07-10 09:47:20,571][25689] Fps is (10 sec: 5613.0, 60 sec: 5576.5, 300 sec: 5564.3). Total num frames: 687288320. Throughput: 0: 4891.7. Samples: 687282066. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:47:20,572][25689] Avg episode reward: [(0, '-2.783')] [2022-07-10 09:47:21,216][26022] Updated weights on worker 0-0, policy_version 671183 (0.00092) [2022-07-10 09:47:22,978][26022] Updated weights on worker 0-0, policy_version 671193 (0.00085) [2022-07-10 09:47:24,701][26022] Updated weights on worker 0-0, policy_version 671203 (0.00086) [2022-07-10 09:47:25,591][25689] Fps is (10 sec: 5613.3, 60 sec: 5559.1, 300 sec: 5562.6). Total num frames: 687316992. Throughput: 0: 5806.1. Samples: 687315460. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:47:25,591][25689] Avg episode reward: [(0, '-4.436')] [2022-07-10 09:47:26,661][26022] Updated weights on worker 0-0, policy_version 671213 (0.00086) [2022-07-10 09:47:28,419][26022] Updated weights on worker 0-0, policy_version 671223 (0.00086) [2022-07-10 09:47:30,159][26022] Updated weights on worker 0-0, policy_version 671233 (0.00090) [2022-07-10 09:47:30,638][25689] Fps is (10 sec: 5594.6, 60 sec: 5543.7, 300 sec: 5559.0). Total num frames: 687344640. Throughput: 0: 5827.0. Samples: 687349138. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:47:30,639][25689] Avg episode reward: [(0, '-4.621')] [2022-07-10 09:47:32,034][26022] Updated weights on worker 0-0, policy_version 671243 (0.00093) [2022-07-10 09:47:33,791][26022] Updated weights on worker 0-0, policy_version 671253 (0.00086) [2022-07-10 09:47:35,664][25689] Fps is (10 sec: 5387.9, 60 sec: 5546.9, 300 sec: 5558.7). Total num frames: 687371264. Throughput: 0: 5011.8. Samples: 687366022. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:47:35,664][25689] Avg episode reward: [(0, '-4.746')] [2022-07-10 09:47:35,975][26022] Updated weights on worker 0-0, policy_version 671263 (0.00085) [2022-07-10 09:47:37,530][26022] Updated weights on worker 0-0, policy_version 671273 (0.00092) [2022-07-10 09:47:39,451][26022] Updated weights on worker 0-0, policy_version 671283 (0.00087) [2022-07-10 09:47:40,696][25689] Fps is (10 sec: 5497.8, 60 sec: 5566.0, 300 sec: 5556.6). Total num frames: 687399936. Throughput: 0: 5843.5. Samples: 687399664. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:47:40,698][25689] Avg episode reward: [(0, '-5.764')] [2022-07-10 09:47:41,222][26022] Updated weights on worker 0-0, policy_version 671293 (0.00087) [2022-07-10 09:47:43,019][26022] Updated weights on worker 0-0, policy_version 671303 (0.00103) [2022-07-10 09:47:44,946][26022] Updated weights on worker 0-0, policy_version 671313 (0.00081) [2022-07-10 09:47:45,716][25689] Fps is (10 sec: 5806.2, 60 sec: 5564.5, 300 sec: 5567.3). Total num frames: 687429632. Throughput: 0: 5856.2. Samples: 687433320. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:47:45,717][25689] Avg episode reward: [(0, '-6.273')] [2022-07-10 09:47:46,763][26022] Updated weights on worker 0-0, policy_version 671323 (0.00097) [2022-07-10 09:47:48,497][26022] Updated weights on worker 0-0, policy_version 671333 (0.00085) [2022-07-10 09:47:50,442][26022] Updated weights on worker 0-0, policy_version 671343 (0.00088) [2022-07-10 09:47:50,786][25689] Fps is (10 sec: 5683.1, 60 sec: 5562.0, 300 sec: 5560.5). Total num frames: 687457280. Throughput: 0: 5854.8. Samples: 687467102. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:47:50,787][25689] Avg episode reward: [(0, '-6.373')] [2022-07-10 09:47:52,155][26022] Updated weights on worker 0-0, policy_version 671353 (0.00088) [2022-07-10 09:47:53,989][26022] Updated weights on worker 0-0, policy_version 671363 (0.00086) [2022-07-10 09:47:55,701][26022] Updated weights on worker 0-0, policy_version 671373 (0.00100) [2022-07-10 09:47:55,809][25689] Fps is (10 sec: 5580.5, 60 sec: 5562.2, 300 sec: 5554.1). Total num frames: 687485952. Throughput: 0: 5869.0. Samples: 687484254. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:47:55,809][25689] Avg episode reward: [(0, '-4.860')] [2022-07-10 09:47:57,648][26022] Updated weights on worker 0-0, policy_version 671383 (0.00086) [2022-07-10 09:47:59,431][26022] Updated weights on worker 0-0, policy_version 671393 (0.00089) [2022-07-10 09:48:00,822][25689] Fps is (10 sec: 5611.9, 60 sec: 5544.5, 300 sec: 5564.5). Total num frames: 687513600. Throughput: 0: 5873.6. Samples: 687517876. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:48:00,823][25689] Avg episode reward: [(0, '-5.736')] [2022-07-10 09:48:01,168][26022] Updated weights on worker 0-0, policy_version 671403 (0.00090) [2022-07-10 09:48:03,463][26022] Updated weights on worker 0-0, policy_version 671413 (0.00085) [2022-07-10 09:48:05,361][26022] Updated weights on worker 0-0, policy_version 671423 (0.00086) [2022-07-10 09:48:05,846][25689] Fps is (10 sec: 5305.2, 60 sec: 5544.2, 300 sec: 5555.7). Total num frames: 687539200. Throughput: 0: 5758.5. Samples: 687549234. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:48:05,846][25689] Avg episode reward: [(0, '-4.895')] [2022-07-10 09:48:07,107][26022] Updated weights on worker 0-0, policy_version 671433 (0.00089) [2022-07-10 09:48:08,972][26022] Updated weights on worker 0-0, policy_version 671443 (0.00084) [2022-07-10 09:48:10,773][26022] Updated weights on worker 0-0, policy_version 671453 (0.00086) [2022-07-10 09:48:10,898][25689] Fps is (10 sec: 5386.1, 60 sec: 5564.4, 300 sec: 5562.0). Total num frames: 687567872. Throughput: 0: 4918.1. Samples: 687566016. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:48:10,899][25689] Avg episode reward: [(0, '-4.686')] [2022-07-10 09:48:12,748][26022] Updated weights on worker 0-0, policy_version 671463 (0.00087) [2022-07-10 09:48:14,396][26022] Updated weights on worker 0-0, policy_version 671473 (0.00086) [2022-07-10 09:48:15,919][25689] Fps is (10 sec: 5591.1, 60 sec: 5547.4, 300 sec: 5556.1). Total num frames: 687595520. Throughput: 0: 5740.9. Samples: 687599704. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:48:15,919][25689] Avg episode reward: [(0, '-4.029')] [2022-07-10 09:48:16,231][26022] Updated weights on worker 0-0, policy_version 671483 (0.00093) [2022-07-10 09:48:18,123][26022] Updated weights on worker 0-0, policy_version 671493 (0.00081) [2022-07-10 09:48:19,739][26022] Updated weights on worker 0-0, policy_version 671503 (0.00092) [2022-07-10 09:48:20,927][25689] Fps is (10 sec: 5513.8, 60 sec: 5547.9, 300 sec: 5553.7). Total num frames: 687623168. Throughput: 0: 5744.2. Samples: 687633362. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:48:20,928][25689] Avg episode reward: [(0, '-3.168')] [2022-07-10 09:48:21,296][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:48:21,310][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000671510_687626240.pth [2022-07-10 09:48:21,318][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000669553_685622272.pth [2022-07-10 09:48:21,751][26022] Updated weights on worker 0-0, policy_version 671513 (0.00090) [2022-07-10 09:48:23,606][26022] Updated weights on worker 0-0, policy_version 671523 (0.00086) [2022-07-10 09:48:25,297][26022] Updated weights on worker 0-0, policy_version 671533 (0.00086) [2022-07-10 09:48:25,957][25689] Fps is (10 sec: 5712.5, 60 sec: 5563.9, 300 sec: 5560.9). Total num frames: 687652864. Throughput: 0: 5020.9. Samples: 687650212. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:48:25,958][25689] Avg episode reward: [(0, '-2.984')] [2022-07-10 09:48:27,420][26022] Updated weights on worker 0-0, policy_version 671543 (0.00085) [2022-07-10 09:48:28,999][26022] Updated weights on worker 0-0, policy_version 671553 (0.00079) [2022-07-10 09:48:31,026][25689] Fps is (10 sec: 5678.3, 60 sec: 5561.9, 300 sec: 5556.3). Total num frames: 687680512. Throughput: 0: 5842.8. Samples: 687683616. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:48:31,026][25689] Avg episode reward: [(0, '-3.040')] [2022-07-10 09:48:31,032][26022] Updated weights on worker 0-0, policy_version 671563 (0.00088) [2022-07-10 09:48:32,819][26022] Updated weights on worker 0-0, policy_version 671573 (0.00096) [2022-07-10 09:48:34,624][26022] Updated weights on worker 0-0, policy_version 671583 (0.00093) [2022-07-10 09:48:36,117][25689] Fps is (10 sec: 5442.7, 60 sec: 5572.9, 300 sec: 5558.2). Total num frames: 687708160. Throughput: 0: 5810.0. Samples: 687717054. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:48:36,117][25689] Avg episode reward: [(0, '-3.042')] [2022-07-10 09:48:36,387][26022] Updated weights on worker 0-0, policy_version 671593 (0.00087) [2022-07-10 09:48:38,387][26022] Updated weights on worker 0-0, policy_version 671603 (0.00099) [2022-07-10 09:48:40,018][26022] Updated weights on worker 0-0, policy_version 671613 (0.00087) [2022-07-10 09:48:41,123][25689] Fps is (10 sec: 5578.0, 60 sec: 5575.3, 300 sec: 5554.6). Total num frames: 687736832. Throughput: 0: 4976.5. Samples: 687733866. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:48:41,123][25689] Avg episode reward: [(0, '-3.208')] [2022-07-10 09:48:42,099][26022] Updated weights on worker 0-0, policy_version 671623 (0.00081) [2022-07-10 09:48:43,559][26022] Updated weights on worker 0-0, policy_version 671633 (0.00078) [2022-07-10 09:48:45,655][26022] Updated weights on worker 0-0, policy_version 671643 (0.00093) [2022-07-10 09:48:46,151][25689] Fps is (10 sec: 5612.7, 60 sec: 5540.7, 300 sec: 5559.6). Total num frames: 687764480. Throughput: 0: 5805.2. Samples: 687767442. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:48:46,152][25689] Avg episode reward: [(0, '-2.556')] [2022-07-10 09:48:47,495][26022] Updated weights on worker 0-0, policy_version 671653 (0.00090) [2022-07-10 09:48:49,143][26022] Updated weights on worker 0-0, policy_version 671663 (0.00089) [2022-07-10 09:48:51,247][25689] Fps is (10 sec: 5461.7, 60 sec: 5538.3, 300 sec: 5554.8). Total num frames: 687792128. Throughput: 0: 5792.0. Samples: 687800736. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:48:51,248][25689] Avg episode reward: [(0, '-2.629')] [2022-07-10 09:48:51,313][26022] Updated weights on worker 0-0, policy_version 671673 (0.00096) [2022-07-10 09:48:52,983][26022] Updated weights on worker 0-0, policy_version 671683 (0.00093) [2022-07-10 09:48:54,927][26022] Updated weights on worker 0-0, policy_version 671693 (0.00081) [2022-07-10 09:48:56,268][25689] Fps is (10 sec: 5668.4, 60 sec: 5555.4, 300 sec: 5561.6). Total num frames: 687821824. Throughput: 0: 4981.5. Samples: 687817436. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:48:56,268][25689] Avg episode reward: [(0, '-2.777')] [2022-07-10 09:48:56,625][26022] Updated weights on worker 0-0, policy_version 671703 (0.00088) [2022-07-10 09:48:58,420][26022] Updated weights on worker 0-0, policy_version 671713 (0.00094) [2022-07-10 09:49:00,329][26022] Updated weights on worker 0-0, policy_version 671723 (0.00080) [2022-07-10 09:49:01,294][25689] Fps is (10 sec: 5707.6, 60 sec: 5554.2, 300 sec: 5564.9). Total num frames: 687849472. Throughput: 0: 5816.0. Samples: 687851182. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:49:01,294][25689] Avg episode reward: [(0, '-2.985')] [2022-07-10 09:49:02,487][26022] Updated weights on worker 0-0, policy_version 671733 (0.00080) [2022-07-10 09:49:04,283][26022] Updated weights on worker 0-0, policy_version 671743 (0.00083) [2022-07-10 09:49:06,298][25689] Fps is (10 sec: 5308.7, 60 sec: 5556.0, 300 sec: 5555.2). Total num frames: 687875072. Throughput: 0: 5732.6. Samples: 687882936. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:49:06,298][25689] Avg episode reward: [(0, '-3.059')] [2022-07-10 09:49:06,310][26022] Updated weights on worker 0-0, policy_version 671753 (0.00100) [2022-07-10 09:49:07,860][26022] Updated weights on worker 0-0, policy_version 671763 (0.00095) [2022-07-10 09:49:09,838][26022] Updated weights on worker 0-0, policy_version 671773 (0.00087) [2022-07-10 09:49:11,356][25689] Fps is (10 sec: 5393.6, 60 sec: 5555.5, 300 sec: 5561.7). Total num frames: 687903744. Throughput: 0: 4922.1. Samples: 687899716. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:49:11,358][25689] Avg episode reward: [(0, '-2.782')] [2022-07-10 09:49:11,404][26022] Updated weights on worker 0-0, policy_version 671783 (0.00090) [2022-07-10 09:49:13,819][26022] Updated weights on worker 0-0, policy_version 671793 (0.00093) [2022-07-10 09:49:15,144][26022] Updated weights on worker 0-0, policy_version 671803 (0.00086) [2022-07-10 09:49:16,371][25689] Fps is (10 sec: 5489.2, 60 sec: 5539.0, 300 sec: 5554.7). Total num frames: 687930368. Throughput: 0: 5743.7. Samples: 687932908. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:49:16,372][25689] Avg episode reward: [(0, '-3.732')] [2022-07-10 09:49:17,358][26022] Updated weights on worker 0-0, policy_version 671813 (0.00090) [2022-07-10 09:49:18,997][26022] Updated weights on worker 0-0, policy_version 671823 (0.00100) [2022-07-10 09:49:20,872][26022] Updated weights on worker 0-0, policy_version 671833 (0.00084) [2022-07-10 09:49:21,403][25689] Fps is (10 sec: 5503.9, 60 sec: 5553.8, 300 sec: 5561.4). Total num frames: 687959040. Throughput: 0: 5723.8. Samples: 687966282. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:49:21,403][25689] Avg episode reward: [(0, '-3.586')] [2022-07-10 09:49:22,680][26022] Updated weights on worker 0-0, policy_version 671843 (0.00088) [2022-07-10 09:49:24,655][26022] Updated weights on worker 0-0, policy_version 671853 (0.00084) [2022-07-10 09:49:26,419][25689] Fps is (10 sec: 5605.5, 60 sec: 5521.2, 300 sec: 5556.0). Total num frames: 687986688. Throughput: 0: 4972.7. Samples: 687982992. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:49:26,419][25689] Avg episode reward: [(0, '-3.672')] [2022-07-10 09:49:26,425][26022] Updated weights on worker 0-0, policy_version 671863 (0.00094) [2022-07-10 09:49:28,465][26022] Updated weights on worker 0-0, policy_version 671873 (0.00088) [2022-07-10 09:49:30,123][26022] Updated weights on worker 0-0, policy_version 671883 (0.00087) [2022-07-10 09:49:31,531][25689] Fps is (10 sec: 5560.7, 60 sec: 5534.2, 300 sec: 5558.0). Total num frames: 688015360. Throughput: 0: 5781.7. Samples: 688016362. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:49:31,532][25689] Avg episode reward: [(0, '-4.577')] [2022-07-10 09:49:31,994][26022] Updated weights on worker 0-0, policy_version 671893 (0.00059) [2022-07-10 09:49:33,735][26022] Updated weights on worker 0-0, policy_version 671903 (0.00086) [2022-07-10 09:49:35,687][26022] Updated weights on worker 0-0, policy_version 671913 (0.00083) [2022-07-10 09:49:36,567][25689] Fps is (10 sec: 5448.6, 60 sec: 5522.3, 300 sec: 5550.8). Total num frames: 688041984. Throughput: 0: 5802.0. Samples: 688050084. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:49:36,568][25689] Avg episode reward: [(0, '-5.726')] [2022-07-10 09:49:37,326][26022] Updated weights on worker 0-0, policy_version 671923 (0.00083) [2022-07-10 09:49:39,371][26022] Updated weights on worker 0-0, policy_version 671933 (0.00098) [2022-07-10 09:49:41,006][26022] Updated weights on worker 0-0, policy_version 671943 (0.00092) [2022-07-10 09:49:41,593][25689] Fps is (10 sec: 5597.1, 60 sec: 5537.4, 300 sec: 5561.0). Total num frames: 688071680. Throughput: 0: 4973.5. Samples: 688066700. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:49:41,594][25689] Avg episode reward: [(0, '-7.302')] [2022-07-10 09:49:43,110][26022] Updated weights on worker 0-0, policy_version 671953 (0.00094) [2022-07-10 09:49:44,718][26022] Updated weights on worker 0-0, policy_version 671963 (0.00088) [2022-07-10 09:49:46,615][25689] Fps is (10 sec: 5809.0, 60 sec: 5554.9, 300 sec: 5555.2). Total num frames: 688100352. Throughput: 0: 5804.4. Samples: 688100220. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:49:46,616][25689] Avg episode reward: [(0, '-6.017')] [2022-07-10 09:49:46,620][26022] Updated weights on worker 0-0, policy_version 671973 (0.00110) [2022-07-10 09:49:48,427][26022] Updated weights on worker 0-0, policy_version 671983 (0.00083) [2022-07-10 09:49:50,303][26022] Updated weights on worker 0-0, policy_version 671993 (0.00101) [2022-07-10 09:49:51,712][25689] Fps is (10 sec: 5566.2, 60 sec: 5554.8, 300 sec: 5554.2). Total num frames: 688128000. Throughput: 0: 5823.3. Samples: 688133880. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:49:51,714][25689] Avg episode reward: [(0, '-6.889')] [2022-07-10 09:49:52,043][26022] Updated weights on worker 0-0, policy_version 672003 (0.00098) [2022-07-10 09:49:54,214][26022] Updated weights on worker 0-0, policy_version 672013 (0.00088) [2022-07-10 09:49:55,775][26022] Updated weights on worker 0-0, policy_version 672023 (0.00071) [2022-07-10 09:49:56,779][25689] Fps is (10 sec: 5440.4, 60 sec: 5516.7, 300 sec: 5549.9). Total num frames: 688155648. Throughput: 0: 4959.7. Samples: 688150328. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:49:56,780][25689] Avg episode reward: [(0, '-5.846')] [2022-07-10 09:49:57,792][26022] Updated weights on worker 0-0, policy_version 672033 (0.00090) [2022-07-10 09:49:59,692][26022] Updated weights on worker 0-0, policy_version 672043 (0.00092) [2022-07-10 09:50:01,336][26022] Updated weights on worker 0-0, policy_version 672053 (0.00081) [2022-07-10 09:50:01,807][25689] Fps is (10 sec: 5477.5, 60 sec: 5516.6, 300 sec: 5553.0). Total num frames: 688183296. Throughput: 0: 5799.1. Samples: 688183920. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:50:01,807][25689] Avg episode reward: [(0, '-4.991')] [2022-07-10 09:50:03,603][26022] Updated weights on worker 0-0, policy_version 672063 (0.00090) [2022-07-10 09:50:05,472][26022] Updated weights on worker 0-0, policy_version 672073 (0.00094) [2022-07-10 09:50:06,839][25689] Fps is (10 sec: 5496.9, 60 sec: 5547.9, 300 sec: 5553.3). Total num frames: 688210944. Throughput: 0: 5706.1. Samples: 688215616. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:50:06,839][25689] Avg episode reward: [(0, '-5.907')] [2022-07-10 09:50:07,171][26022] Updated weights on worker 0-0, policy_version 672083 (0.00087) [2022-07-10 09:50:09,046][26022] Updated weights on worker 0-0, policy_version 672093 (0.00082) [2022-07-10 09:50:10,584][26022] Updated weights on worker 0-0, policy_version 672103 (0.00084) [2022-07-10 09:50:11,946][25689] Fps is (10 sec: 5453.6, 60 sec: 5526.5, 300 sec: 5555.8). Total num frames: 688238592. Throughput: 0: 4872.4. Samples: 688232472. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:50:11,947][25689] Avg episode reward: [(0, '-5.832')] [2022-07-10 09:50:12,775][26022] Updated weights on worker 0-0, policy_version 672113 (0.00265) [2022-07-10 09:50:14,444][26022] Updated weights on worker 0-0, policy_version 672123 (0.00090) [2022-07-10 09:50:16,333][26022] Updated weights on worker 0-0, policy_version 672133 (0.00094) [2022-07-10 09:50:16,960][25689] Fps is (10 sec: 5564.3, 60 sec: 5560.4, 300 sec: 5552.3). Total num frames: 688267264. Throughput: 0: 5734.9. Samples: 688266066. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:50:16,968][25689] Avg episode reward: [(0, '-5.392')] [2022-07-10 09:50:18,179][26022] Updated weights on worker 0-0, policy_version 672143 (0.00086) [2022-07-10 09:50:19,962][26022] Updated weights on worker 0-0, policy_version 672153 (0.00086) [2022-07-10 09:50:21,472][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:50:21,486][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000672161_688292864.pth [2022-07-10 09:50:21,486][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000670206_686290944.pth [2022-07-10 09:50:21,839][26022] Updated weights on worker 0-0, policy_version 672163 (0.00083) [2022-07-10 09:50:21,975][25689] Fps is (10 sec: 5718.2, 60 sec: 5561.9, 300 sec: 5558.9). Total num frames: 688295936. Throughput: 0: 5746.0. Samples: 688299804. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:50:21,975][25689] Avg episode reward: [(0, '-5.184')] [2022-07-10 09:50:23,708][26022] Updated weights on worker 0-0, policy_version 672173 (0.00090) [2022-07-10 09:50:25,292][26022] Updated weights on worker 0-0, policy_version 672183 (0.00090) [2022-07-10 09:50:26,983][25689] Fps is (10 sec: 5517.3, 60 sec: 5545.7, 300 sec: 5550.0). Total num frames: 688322560. Throughput: 0: 5011.0. Samples: 688316558. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:50:26,983][25689] Avg episode reward: [(0, '-5.899')] [2022-07-10 09:50:27,462][26022] Updated weights on worker 0-0, policy_version 672193 (0.00092) [2022-07-10 09:50:29,143][26022] Updated weights on worker 0-0, policy_version 672203 (0.00088) [2022-07-10 09:50:31,136][26022] Updated weights on worker 0-0, policy_version 672213 (0.00098) [2022-07-10 09:50:32,040][25689] Fps is (10 sec: 5493.7, 60 sec: 5550.8, 300 sec: 5549.3). Total num frames: 688351232. Throughput: 0: 5834.9. Samples: 688349716. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:50:32,041][25689] Avg episode reward: [(0, '-5.978')] [2022-07-10 09:50:32,816][26022] Updated weights on worker 0-0, policy_version 672223 (0.00088) [2022-07-10 09:50:34,759][26022] Updated weights on worker 0-0, policy_version 672233 (0.00063) [2022-07-10 09:50:36,457][26022] Updated weights on worker 0-0, policy_version 672243 (0.00093) [2022-07-10 09:50:37,047][25689] Fps is (10 sec: 5799.7, 60 sec: 5604.3, 300 sec: 5557.1). Total num frames: 688380928. Throughput: 0: 5839.2. Samples: 688383354. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:50:37,047][25689] Avg episode reward: [(0, '-5.563')] [2022-07-10 09:50:38,434][26022] Updated weights on worker 0-0, policy_version 672253 (0.00088) [2022-07-10 09:50:40,199][26022] Updated weights on worker 0-0, policy_version 672263 (0.00086) [2022-07-10 09:50:42,002][26022] Updated weights on worker 0-0, policy_version 672273 (0.00082) [2022-07-10 09:50:42,052][25689] Fps is (10 sec: 5624.9, 60 sec: 5555.4, 300 sec: 5557.1). Total num frames: 688407552. Throughput: 0: 5001.9. Samples: 688400232. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:50:42,053][25689] Avg episode reward: [(0, '-4.349')] [2022-07-10 09:50:43,773][26022] Updated weights on worker 0-0, policy_version 672283 (0.00087) [2022-07-10 09:50:45,498][26022] Updated weights on worker 0-0, policy_version 672293 (0.00092) [2022-07-10 09:50:47,063][25689] Fps is (10 sec: 5418.3, 60 sec: 5539.4, 300 sec: 5550.8). Total num frames: 688435200. Throughput: 0: 5843.4. Samples: 688433896. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:50:47,064][25689] Avg episode reward: [(0, '-4.522')] [2022-07-10 09:50:47,446][26022] Updated weights on worker 0-0, policy_version 672303 (0.00084) [2022-07-10 09:50:49,341][26022] Updated weights on worker 0-0, policy_version 672313 (0.00093) [2022-07-10 09:50:50,936][26022] Updated weights on worker 0-0, policy_version 672323 (0.00092) [2022-07-10 09:50:52,116][25689] Fps is (10 sec: 5494.8, 60 sec: 5543.5, 300 sec: 5550.1). Total num frames: 688462848. Throughput: 0: 5857.1. Samples: 688467302. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:50:52,117][25689] Avg episode reward: [(0, '-3.401')] [2022-07-10 09:50:53,031][26022] Updated weights on worker 0-0, policy_version 672333 (0.00090) [2022-07-10 09:50:54,763][26022] Updated weights on worker 0-0, policy_version 672343 (0.00088) [2022-07-10 09:50:56,539][26022] Updated weights on worker 0-0, policy_version 672353 (0.00086) [2022-07-10 09:50:57,121][25689] Fps is (10 sec: 5701.2, 60 sec: 5583.1, 300 sec: 5557.0). Total num frames: 688492544. Throughput: 0: 5873.7. Samples: 688501266. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:50:57,122][25689] Avg episode reward: [(0, '-3.479')] [2022-07-10 09:50:58,475][26022] Updated weights on worker 0-0, policy_version 672363 (0.00091) [2022-07-10 09:51:00,145][26022] Updated weights on worker 0-0, policy_version 672373 (0.00081) [2022-07-10 09:51:02,159][25689] Fps is (10 sec: 5403.7, 60 sec: 5531.2, 300 sec: 5550.3). Total num frames: 688517120. Throughput: 0: 5877.3. Samples: 688518406. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:51:02,160][25689] Avg episode reward: [(0, '-2.680')] [2022-07-10 09:51:02,510][26022] Updated weights on worker 0-0, policy_version 672383 (0.00093) [2022-07-10 09:51:04,134][26022] Updated weights on worker 0-0, policy_version 672393 (0.00340) [2022-07-10 09:51:06,010][26022] Updated weights on worker 0-0, policy_version 672403 (0.00086) [2022-07-10 09:51:07,165][25689] Fps is (10 sec: 5505.8, 60 sec: 5584.6, 300 sec: 5559.2). Total num frames: 688547840. Throughput: 0: 5780.2. Samples: 688550086. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:51:07,165][25689] Avg episode reward: [(0, '-4.827')] [2022-07-10 09:51:07,901][26022] Updated weights on worker 0-0, policy_version 672413 (0.00083) [2022-07-10 09:51:09,719][26022] Updated weights on worker 0-0, policy_version 672423 (0.00397) [2022-07-10 09:51:11,574][26022] Updated weights on worker 0-0, policy_version 672433 (0.00082) [2022-07-10 09:51:12,278][25689] Fps is (10 sec: 5667.2, 60 sec: 5567.1, 300 sec: 5558.9). Total num frames: 688574464. Throughput: 0: 5776.8. Samples: 688583774. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:51:12,279][25689] Avg episode reward: [(0, '-4.697')] [2022-07-10 09:51:13,402][26022] Updated weights on worker 0-0, policy_version 672443 (0.00089) [2022-07-10 09:51:15,252][26022] Updated weights on worker 0-0, policy_version 672453 (0.00085) [2022-07-10 09:51:17,099][26022] Updated weights on worker 0-0, policy_version 672463 (0.00086) [2022-07-10 09:51:17,291][25689] Fps is (10 sec: 5359.3, 60 sec: 5550.2, 300 sec: 5555.4). Total num frames: 688602112. Throughput: 0: 4921.3. Samples: 688600528. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:51:17,292][25689] Avg episode reward: [(0, '-5.433')] [2022-07-10 09:51:18,738][26022] Updated weights on worker 0-0, policy_version 672473 (0.00087) [2022-07-10 09:51:20,667][26022] Updated weights on worker 0-0, policy_version 672483 (0.00096) [2022-07-10 09:51:22,324][25689] Fps is (10 sec: 5606.3, 60 sec: 5548.5, 300 sec: 5551.6). Total num frames: 688630784. Throughput: 0: 5730.5. Samples: 688633958. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:51:22,324][25689] Avg episode reward: [(0, '-5.852')] [2022-07-10 09:51:22,564][26022] Updated weights on worker 0-0, policy_version 672493 (0.00086) [2022-07-10 09:51:24,341][26022] Updated weights on worker 0-0, policy_version 672503 (0.00085) [2022-07-10 09:51:26,216][26022] Updated weights on worker 0-0, policy_version 672513 (0.00095) [2022-07-10 09:51:27,333][25689] Fps is (10 sec: 5710.5, 60 sec: 5582.3, 300 sec: 5552.6). Total num frames: 688659456. Throughput: 0: 5839.8. Samples: 688667866. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:51:27,335][25689] Avg episode reward: [(0, '-5.663')] [2022-07-10 09:51:28,042][26022] Updated weights on worker 0-0, policy_version 672523 (0.00086) [2022-07-10 09:51:29,754][26022] Updated weights on worker 0-0, policy_version 672533 (0.00082) [2022-07-10 09:51:31,707][26022] Updated weights on worker 0-0, policy_version 672543 (0.00086) [2022-07-10 09:51:32,430][25689] Fps is (10 sec: 5673.8, 60 sec: 5578.6, 300 sec: 5558.8). Total num frames: 688688128. Throughput: 0: 5005.5. Samples: 688684648. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:51:32,431][25689] Avg episode reward: [(0, '-4.499')] [2022-07-10 09:51:33,556][26022] Updated weights on worker 0-0, policy_version 672553 (0.00085) [2022-07-10 09:51:35,367][26022] Updated weights on worker 0-0, policy_version 672563 (0.00101) [2022-07-10 09:51:37,185][26022] Updated weights on worker 0-0, policy_version 672573 (0.00090) [2022-07-10 09:51:37,472][25689] Fps is (10 sec: 5554.9, 60 sec: 5541.5, 300 sec: 5559.1). Total num frames: 688715776. Throughput: 0: 5830.6. Samples: 688718192. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:51:37,473][25689] Avg episode reward: [(0, '-3.056')] [2022-07-10 09:51:38,938][26022] Updated weights on worker 0-0, policy_version 672583 (0.00085) [2022-07-10 09:51:40,855][26022] Updated weights on worker 0-0, policy_version 672593 (0.00380) [2022-07-10 09:51:42,510][25689] Fps is (10 sec: 5587.7, 60 sec: 5572.5, 300 sec: 5555.0). Total num frames: 688744448. Throughput: 0: 5869.1. Samples: 688752432. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:51:42,510][25689] Avg episode reward: [(0, '-3.852')] [2022-07-10 09:51:42,606][26022] Updated weights on worker 0-0, policy_version 672603 (0.00081) [2022-07-10 09:51:44,385][26022] Updated weights on worker 0-0, policy_version 672613 (0.00099) [2022-07-10 09:51:46,289][26022] Updated weights on worker 0-0, policy_version 672623 (0.00080) [2022-07-10 09:51:47,523][25689] Fps is (10 sec: 5705.5, 60 sec: 5589.2, 300 sec: 5559.0). Total num frames: 688773120. Throughput: 0: 5023.4. Samples: 688769286. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:51:47,523][25689] Avg episode reward: [(0, '-2.404')] [2022-07-10 09:51:48,076][26022] Updated weights on worker 0-0, policy_version 672633 (0.00094) [2022-07-10 09:51:49,969][26022] Updated weights on worker 0-0, policy_version 672643 (0.00092) [2022-07-10 09:51:51,663][26022] Updated weights on worker 0-0, policy_version 672653 (0.00086) [2022-07-10 09:51:52,567][25689] Fps is (10 sec: 5498.3, 60 sec: 5573.1, 300 sec: 5551.8). Total num frames: 688799744. Throughput: 0: 5858.3. Samples: 688802612. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:51:52,567][25689] Avg episode reward: [(0, '-2.399')] [2022-07-10 09:51:53,612][26022] Updated weights on worker 0-0, policy_version 672663 (0.00092) [2022-07-10 09:51:55,431][26022] Updated weights on worker 0-0, policy_version 672673 (0.00091) [2022-07-10 09:51:57,165][26022] Updated weights on worker 0-0, policy_version 672683 (0.00090) [2022-07-10 09:51:57,571][25689] Fps is (10 sec: 5706.9, 60 sec: 5590.1, 300 sec: 5558.7). Total num frames: 688830464. Throughput: 0: 5876.8. Samples: 688836308. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 09:51:57,571][25689] Avg episode reward: [(0, '-3.886')] [2022-07-10 09:51:59,234][26022] Updated weights on worker 0-0, policy_version 672693 (0.00081) [2022-07-10 09:52:00,639][26022] Updated weights on worker 0-0, policy_version 672703 (0.00085) [2022-07-10 09:52:02,588][25689] Fps is (10 sec: 5518.0, 60 sec: 5592.1, 300 sec: 5555.3). Total num frames: 688855040. Throughput: 0: 5026.8. Samples: 688853360. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:52:02,588][25689] Avg episode reward: [(0, '-4.530')] [2022-07-10 09:52:03,256][26022] Updated weights on worker 0-0, policy_version 672713 (0.00084) [2022-07-10 09:52:04,757][26022] Updated weights on worker 0-0, policy_version 672723 (0.01102) [2022-07-10 09:52:06,959][26022] Updated weights on worker 0-0, policy_version 672733 (0.00090) [2022-07-10 09:52:07,617][25689] Fps is (10 sec: 5198.4, 60 sec: 5539.1, 300 sec: 5556.4). Total num frames: 688882688. Throughput: 0: 5754.2. Samples: 688884912. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:52:07,617][25689] Avg episode reward: [(0, '-3.851')] [2022-07-10 09:52:08,597][26022] Updated weights on worker 0-0, policy_version 672743 (0.00086) [2022-07-10 09:52:10,413][26022] Updated weights on worker 0-0, policy_version 672753 (0.00091) [2022-07-10 09:52:12,168][26022] Updated weights on worker 0-0, policy_version 672763 (0.00082) [2022-07-10 09:52:12,669][25689] Fps is (10 sec: 5586.8, 60 sec: 5578.6, 300 sec: 5555.8). Total num frames: 688911360. Throughput: 0: 5771.5. Samples: 688918630. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:52:12,669][25689] Avg episode reward: [(0, '-4.730')] [2022-07-10 09:52:13,899][26022] Updated weights on worker 0-0, policy_version 672773 (0.00090) [2022-07-10 09:52:15,863][26022] Updated weights on worker 0-0, policy_version 672783 (0.00093) [2022-07-10 09:52:17,505][26022] Updated weights on worker 0-0, policy_version 672793 (0.00085) [2022-07-10 09:52:17,745][25689] Fps is (10 sec: 5661.9, 60 sec: 5589.7, 300 sec: 5558.1). Total num frames: 688940032. Throughput: 0: 4917.5. Samples: 688935514. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:52:17,745][25689] Avg episode reward: [(0, '-5.512')] [2022-07-10 09:52:19,542][26022] Updated weights on worker 0-0, policy_version 672803 (0.00083) [2022-07-10 09:52:21,350][26022] Updated weights on worker 0-0, policy_version 672813 (0.00087) [2022-07-10 09:52:21,642][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:52:21,655][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000672814_688961536.pth [2022-07-10 09:52:21,655][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000670858_686958592.pth [2022-07-10 09:52:22,837][25689] Fps is (10 sec: 5538.7, 60 sec: 5567.3, 300 sec: 5553.3). Total num frames: 688967680. Throughput: 0: 5721.3. Samples: 688969212. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:52:22,837][25689] Avg episode reward: [(0, '-4.630')] [2022-07-10 09:52:23,133][26022] Updated weights on worker 0-0, policy_version 672823 (0.00466) [2022-07-10 09:52:25,076][26022] Updated weights on worker 0-0, policy_version 672833 (0.00086) [2022-07-10 09:52:26,808][26022] Updated weights on worker 0-0, policy_version 672843 (0.00089) [2022-07-10 09:52:27,919][25689] Fps is (10 sec: 5435.2, 60 sec: 5543.8, 300 sec: 5552.6). Total num frames: 688995328. Throughput: 0: 5800.6. Samples: 689002674. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:52:27,919][25689] Avg episode reward: [(0, '-4.580')] [2022-07-10 09:52:28,628][26022] Updated weights on worker 0-0, policy_version 672853 (0.00095) [2022-07-10 09:52:30,630][26022] Updated weights on worker 0-0, policy_version 672863 (0.00097) [2022-07-10 09:52:32,223][26022] Updated weights on worker 0-0, policy_version 672873 (0.00096) [2022-07-10 09:52:33,011][25689] Fps is (10 sec: 5737.1, 60 sec: 5578.1, 300 sec: 5565.2). Total num frames: 689026048. Throughput: 0: 4957.4. Samples: 689019474. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:52:33,011][25689] Avg episode reward: [(0, '-4.308')] [2022-07-10 09:52:34,223][26022] Updated weights on worker 0-0, policy_version 672883 (0.00086) [2022-07-10 09:52:35,914][26022] Updated weights on worker 0-0, policy_version 672893 (0.00084) [2022-07-10 09:52:37,831][26022] Updated weights on worker 0-0, policy_version 672903 (0.00084) [2022-07-10 09:52:38,016][25689] Fps is (10 sec: 5780.8, 60 sec: 5581.4, 300 sec: 5562.2). Total num frames: 689053696. Throughput: 0: 5814.2. Samples: 689053372. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:52:38,016][25689] Avg episode reward: [(0, '-3.637')] [2022-07-10 09:52:39,731][26022] Updated weights on worker 0-0, policy_version 672913 (0.00094) [2022-07-10 09:52:41,425][26022] Updated weights on worker 0-0, policy_version 672923 (0.00093) [2022-07-10 09:52:43,034][25689] Fps is (10 sec: 5516.7, 60 sec: 5566.3, 300 sec: 5555.4). Total num frames: 689081344. Throughput: 0: 5828.5. Samples: 689086932. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:52:43,035][25689] Avg episode reward: [(0, '-4.413')] [2022-07-10 09:52:43,343][26022] Updated weights on worker 0-0, policy_version 672933 (0.00082) [2022-07-10 09:52:45,001][26022] Updated weights on worker 0-0, policy_version 672943 (0.00086) [2022-07-10 09:52:47,371][26022] Updated weights on worker 0-0, policy_version 672953 (0.00051) [2022-07-10 09:52:48,075][25689] Fps is (10 sec: 5497.1, 60 sec: 5546.8, 300 sec: 5555.9). Total num frames: 689108992. Throughput: 0: 5013.0. Samples: 689103716. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:52:48,075][25689] Avg episode reward: [(0, '-2.135')] [2022-07-10 09:52:48,582][26022] Updated weights on worker 0-0, policy_version 672963 (0.01142) [2022-07-10 09:52:50,958][26022] Updated weights on worker 0-0, policy_version 672973 (0.00651) [2022-07-10 09:52:52,239][26022] Updated weights on worker 0-0, policy_version 672983 (0.00081) [2022-07-10 09:52:53,179][25689] Fps is (10 sec: 5551.8, 60 sec: 5575.1, 300 sec: 5554.4). Total num frames: 689137664. Throughput: 0: 5844.4. Samples: 689137344. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:52:53,179][25689] Avg episode reward: [(0, '-2.157')] [2022-07-10 09:52:54,474][26022] Updated weights on worker 0-0, policy_version 672993 (0.00094) [2022-07-10 09:52:55,843][26022] Updated weights on worker 0-0, policy_version 673003 (0.00098) [2022-07-10 09:52:57,990][26022] Updated weights on worker 0-0, policy_version 673013 (0.00082) [2022-07-10 09:52:58,187][25689] Fps is (10 sec: 5670.5, 60 sec: 5540.9, 300 sec: 5557.9). Total num frames: 689166336. Throughput: 0: 5824.5. Samples: 689170862. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:52:58,188][25689] Avg episode reward: [(0, '-3.257')] [2022-07-10 09:52:59,791][26022] Updated weights on worker 0-0, policy_version 673023 (0.00089) [2022-07-10 09:53:01,460][26022] Updated weights on worker 0-0, policy_version 673033 (0.00085) [2022-07-10 09:53:03,218][25689] Fps is (10 sec: 5508.1, 60 sec: 5573.5, 300 sec: 5561.2). Total num frames: 689192960. Throughput: 0: 4997.3. Samples: 689187796. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:53:03,220][25689] Avg episode reward: [(0, '-3.053')] [2022-07-10 09:53:03,816][26022] Updated weights on worker 0-0, policy_version 673043 (0.00090) [2022-07-10 09:53:05,606][26022] Updated weights on worker 0-0, policy_version 673053 (0.00053) [2022-07-10 09:53:07,469][26022] Updated weights on worker 0-0, policy_version 673063 (0.00085) [2022-07-10 09:53:08,232][25689] Fps is (10 sec: 5403.2, 60 sec: 5574.9, 300 sec: 5558.5). Total num frames: 689220608. Throughput: 0: 5731.7. Samples: 689219250. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:53:08,234][25689] Avg episode reward: [(0, '-3.879')] [2022-07-10 09:53:09,554][26022] Updated weights on worker 0-0, policy_version 673073 (0.00083) [2022-07-10 09:53:11,061][26022] Updated weights on worker 0-0, policy_version 673083 (0.00085) [2022-07-10 09:53:13,179][26022] Updated weights on worker 0-0, policy_version 673093 (0.00084) [2022-07-10 09:53:13,352][25689] Fps is (10 sec: 5355.4, 60 sec: 5534.8, 300 sec: 5553.2). Total num frames: 689247232. Throughput: 0: 5722.8. Samples: 689252790. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:53:13,352][25689] Avg episode reward: [(0, '-3.079')] [2022-07-10 09:53:14,697][26022] Updated weights on worker 0-0, policy_version 673103 (0.00088) [2022-07-10 09:53:16,707][26022] Updated weights on worker 0-0, policy_version 673113 (0.00091) [2022-07-10 09:53:18,445][25689] Fps is (10 sec: 5514.5, 60 sec: 5550.2, 300 sec: 5558.5). Total num frames: 689276928. Throughput: 0: 5697.9. Samples: 689286286. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:53:18,445][25689] Avg episode reward: [(0, '-3.993')] [2022-07-10 09:53:18,655][26022] Updated weights on worker 0-0, policy_version 673123 (0.00088) [2022-07-10 09:53:20,284][26022] Updated weights on worker 0-0, policy_version 673133 (0.00079) [2022-07-10 09:53:22,009][26022] Updated weights on worker 0-0, policy_version 673143 (0.00090) [2022-07-10 09:53:23,448][25689] Fps is (10 sec: 5781.3, 60 sec: 5575.2, 300 sec: 5555.6). Total num frames: 689305600. Throughput: 0: 5702.4. Samples: 689303156. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:53:23,448][25689] Avg episode reward: [(0, '-3.035')] [2022-07-10 09:53:23,888][26022] Updated weights on worker 0-0, policy_version 673153 (0.00089) [2022-07-10 09:53:25,803][26022] Updated weights on worker 0-0, policy_version 673163 (0.00096) [2022-07-10 09:53:27,830][26022] Updated weights on worker 0-0, policy_version 673173 (0.01132) [2022-07-10 09:53:28,542][25689] Fps is (10 sec: 5578.0, 60 sec: 5574.1, 300 sec: 5555.1). Total num frames: 689333248. Throughput: 0: 5780.8. Samples: 689336658. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:53:28,542][25689] Avg episode reward: [(0, '-2.862')] [2022-07-10 09:53:29,437][26022] Updated weights on worker 0-0, policy_version 673183 (0.00606) [2022-07-10 09:53:31,423][26022] Updated weights on worker 0-0, policy_version 673193 (0.00089) [2022-07-10 09:53:33,106][26022] Updated weights on worker 0-0, policy_version 673203 (0.00082) [2022-07-10 09:53:33,596][25689] Fps is (10 sec: 5549.7, 60 sec: 5543.8, 300 sec: 5559.2). Total num frames: 689361920. Throughput: 0: 5818.7. Samples: 689370586. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:53:33,596][25689] Avg episode reward: [(0, '-3.033')] [2022-07-10 09:53:35,125][26022] Updated weights on worker 0-0, policy_version 673213 (0.00086) [2022-07-10 09:53:36,621][26022] Updated weights on worker 0-0, policy_version 673223 (0.00083) [2022-07-10 09:53:38,608][25689] Fps is (10 sec: 5493.3, 60 sec: 5526.3, 300 sec: 5552.3). Total num frames: 689388544. Throughput: 0: 5025.4. Samples: 689387612. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:53:38,608][25689] Avg episode reward: [(0, '-2.733')] [2022-07-10 09:53:38,807][26022] Updated weights on worker 0-0, policy_version 673233 (0.00110) [2022-07-10 09:53:40,234][26022] Updated weights on worker 0-0, policy_version 673243 (0.00084) [2022-07-10 09:53:42,228][26022] Updated weights on worker 0-0, policy_version 673253 (0.00084) [2022-07-10 09:53:43,635][25689] Fps is (10 sec: 5712.2, 60 sec: 5576.2, 300 sec: 5562.6). Total num frames: 689419264. Throughput: 0: 5858.2. Samples: 689421416. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:53:43,636][25689] Avg episode reward: [(0, '-3.939')] [2022-07-10 09:53:43,982][26022] Updated weights on worker 0-0, policy_version 673263 (0.00083) [2022-07-10 09:53:45,806][26022] Updated weights on worker 0-0, policy_version 673273 (0.00084) [2022-07-10 09:53:47,707][26022] Updated weights on worker 0-0, policy_version 673283 (0.00085) [2022-07-10 09:53:48,659][25689] Fps is (10 sec: 5807.3, 60 sec: 5577.7, 300 sec: 5564.0). Total num frames: 689446912. Throughput: 0: 5893.7. Samples: 689455222. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:53:48,659][25689] Avg episode reward: [(0, '-3.921')] [2022-07-10 09:53:49,443][26022] Updated weights on worker 0-0, policy_version 673293 (0.00083) [2022-07-10 09:53:51,384][26022] Updated weights on worker 0-0, policy_version 673303 (0.00070) [2022-07-10 09:53:53,390][26022] Updated weights on worker 0-0, policy_version 673313 (0.00095) [2022-07-10 09:53:53,767][25689] Fps is (10 sec: 5356.4, 60 sec: 5543.5, 300 sec: 5552.0). Total num frames: 689473536. Throughput: 0: 5012.8. Samples: 689471700. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:53:53,769][25689] Avg episode reward: [(0, '-3.601')] [2022-07-10 09:53:55,063][26022] Updated weights on worker 0-0, policy_version 673323 (0.00085) [2022-07-10 09:53:57,047][26022] Updated weights on worker 0-0, policy_version 673333 (0.00084) [2022-07-10 09:53:58,670][26022] Updated weights on worker 0-0, policy_version 673343 (0.00089) [2022-07-10 09:53:58,786][25689] Fps is (10 sec: 5662.0, 60 sec: 5576.3, 300 sec: 5562.4). Total num frames: 689504256. Throughput: 0: 5826.7. Samples: 689505188. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:53:58,795][25689] Avg episode reward: [(0, '-3.631')] [2022-07-10 09:54:00,740][26022] Updated weights on worker 0-0, policy_version 673353 (0.00087) [2022-07-10 09:54:02,791][26022] Updated weights on worker 0-0, policy_version 673363 (0.00079) [2022-07-10 09:54:03,869][25689] Fps is (10 sec: 5575.2, 60 sec: 5554.6, 300 sec: 5561.0). Total num frames: 689529856. Throughput: 0: 5698.7. Samples: 689536724. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:54:03,870][25689] Avg episode reward: [(0, '-3.247')] [2022-07-10 09:54:04,875][26022] Updated weights on worker 0-0, policy_version 673373 (0.00086) [2022-07-10 09:54:06,251][26022] Updated weights on worker 0-0, policy_version 673383 (0.00098) [2022-07-10 09:54:08,532][26022] Updated weights on worker 0-0, policy_version 673393 (0.00094) [2022-07-10 09:54:08,938][25689] Fps is (10 sec: 5144.3, 60 sec: 5532.7, 300 sec: 5553.9). Total num frames: 689556480. Throughput: 0: 4845.1. Samples: 689553478. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:54:08,940][25689] Avg episode reward: [(0, '-3.148')] [2022-07-10 09:54:09,824][26022] Updated weights on worker 0-0, policy_version 673403 (0.00087) [2022-07-10 09:54:12,017][26022] Updated weights on worker 0-0, policy_version 673413 (0.00086) [2022-07-10 09:54:13,773][26022] Updated weights on worker 0-0, policy_version 673423 (0.00092) [2022-07-10 09:54:14,041][25689] Fps is (10 sec: 5536.8, 60 sec: 5584.9, 300 sec: 5562.6). Total num frames: 689586176. Throughput: 0: 5687.4. Samples: 689587006. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:54:14,042][25689] Avg episode reward: [(0, '-2.784')] [2022-07-10 09:54:15,523][26022] Updated weights on worker 0-0, policy_version 673433 (0.00085) [2022-07-10 09:54:17,565][26022] Updated weights on worker 0-0, policy_version 673443 (0.00093) [2022-07-10 09:54:19,094][25689] Fps is (10 sec: 5747.5, 60 sec: 5571.7, 300 sec: 5562.2). Total num frames: 689614848. Throughput: 0: 5687.1. Samples: 689620678. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:54:19,095][25689] Avg episode reward: [(0, '-2.261')] [2022-07-10 09:54:19,181][26022] Updated weights on worker 0-0, policy_version 673453 (0.00089) [2022-07-10 09:54:21,142][26022] Updated weights on worker 0-0, policy_version 673463 (0.00086) [2022-07-10 09:54:21,740][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:54:21,754][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000673467_689630208.pth [2022-07-10 09:54:21,754][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000671510_687626240.pth [2022-07-10 09:54:23,059][26022] Updated weights on worker 0-0, policy_version 673473 (0.00086) [2022-07-10 09:54:24,128][25689] Fps is (10 sec: 5583.5, 60 sec: 5552.0, 300 sec: 5561.8). Total num frames: 689642496. Throughput: 0: 4971.7. Samples: 689637444. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:54:24,129][25689] Avg episode reward: [(0, '-2.976')] [2022-07-10 09:54:24,600][26022] Updated weights on worker 0-0, policy_version 673483 (0.00107) [2022-07-10 09:54:26,763][26022] Updated weights on worker 0-0, policy_version 673493 (0.00086) [2022-07-10 09:54:28,682][26022] Updated weights on worker 0-0, policy_version 673503 (0.00085) [2022-07-10 09:54:29,133][25689] Fps is (10 sec: 5609.7, 60 sec: 5577.0, 300 sec: 5563.8). Total num frames: 689671168. Throughput: 0: 5812.6. Samples: 689670864. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:54:29,134][25689] Avg episode reward: [(0, '-3.736')] [2022-07-10 09:54:30,299][26022] Updated weights on worker 0-0, policy_version 673513 (0.00098) [2022-07-10 09:54:32,171][26022] Updated weights on worker 0-0, policy_version 673523 (0.00094) [2022-07-10 09:54:34,044][26022] Updated weights on worker 0-0, policy_version 673533 (0.00095) [2022-07-10 09:54:34,204][25689] Fps is (10 sec: 5487.8, 60 sec: 5541.7, 300 sec: 5563.2). Total num frames: 689697792. Throughput: 0: 5819.9. Samples: 689704352. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:54:34,205][25689] Avg episode reward: [(0, '-4.400')] [2022-07-10 09:54:35,737][26022] Updated weights on worker 0-0, policy_version 673543 (0.00081) [2022-07-10 09:54:37,806][26022] Updated weights on worker 0-0, policy_version 673553 (0.00093) [2022-07-10 09:54:39,214][25689] Fps is (10 sec: 5485.3, 60 sec: 5575.6, 300 sec: 5560.0). Total num frames: 689726464. Throughput: 0: 4991.4. Samples: 689721108. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:54:39,215][25689] Avg episode reward: [(0, '-4.764')] [2022-07-10 09:54:39,439][26022] Updated weights on worker 0-0, policy_version 673563 (0.00089) [2022-07-10 09:54:41,564][26022] Updated weights on worker 0-0, policy_version 673573 (0.00239) [2022-07-10 09:54:43,057][26022] Updated weights on worker 0-0, policy_version 673583 (0.00092) [2022-07-10 09:54:44,220][25689] Fps is (10 sec: 5725.1, 60 sec: 5543.8, 300 sec: 5560.3). Total num frames: 689755136. Throughput: 0: 5836.6. Samples: 689754716. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:54:44,221][25689] Avg episode reward: [(0, '-4.887')] [2022-07-10 09:54:45,081][26022] Updated weights on worker 0-0, policy_version 673593 (0.00081) [2022-07-10 09:54:46,861][26022] Updated weights on worker 0-0, policy_version 673603 (0.00085) [2022-07-10 09:54:48,554][26022] Updated weights on worker 0-0, policy_version 673613 (0.00095) [2022-07-10 09:54:49,238][25689] Fps is (10 sec: 5618.9, 60 sec: 5544.4, 300 sec: 5561.8). Total num frames: 689782784. Throughput: 0: 5862.2. Samples: 689788718. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:54:49,238][25689] Avg episode reward: [(0, '-4.659')] [2022-07-10 09:54:50,557][26022] Updated weights on worker 0-0, policy_version 673623 (0.00084) [2022-07-10 09:54:52,445][26022] Updated weights on worker 0-0, policy_version 673633 (0.00093) [2022-07-10 09:54:54,140][26022] Updated weights on worker 0-0, policy_version 673643 (0.00082) [2022-07-10 09:54:54,345][25689] Fps is (10 sec: 5562.5, 60 sec: 5578.3, 300 sec: 5564.5). Total num frames: 689811456. Throughput: 0: 5010.5. Samples: 689805270. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:54:54,346][25689] Avg episode reward: [(0, '-5.843')] [2022-07-10 09:54:55,906][26022] Updated weights on worker 0-0, policy_version 673653 (0.00083) [2022-07-10 09:54:57,808][26022] Updated weights on worker 0-0, policy_version 673663 (0.00079) [2022-07-10 09:54:59,359][25689] Fps is (10 sec: 5665.5, 60 sec: 5544.9, 300 sec: 5568.2). Total num frames: 689840128. Throughput: 0: 5860.3. Samples: 689839164. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:54:59,361][25689] Avg episode reward: [(0, '-7.291')] [2022-07-10 09:54:59,556][26022] Updated weights on worker 0-0, policy_version 673673 (0.00085) [2022-07-10 09:55:01,388][26022] Updated weights on worker 0-0, policy_version 673683 (0.00091) [2022-07-10 09:55:03,419][26022] Updated weights on worker 0-0, policy_version 673693 (0.00086) [2022-07-10 09:55:04,396][25689] Fps is (10 sec: 5298.2, 60 sec: 5532.3, 300 sec: 5557.8). Total num frames: 689864704. Throughput: 0: 5763.5. Samples: 689870996. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:55:04,398][25689] Avg episode reward: [(0, '-7.893')] [2022-07-10 09:55:05,351][26022] Updated weights on worker 0-0, policy_version 673703 (0.00091) [2022-07-10 09:55:07,216][26022] Updated weights on worker 0-0, policy_version 673713 (0.00080) [2022-07-10 09:55:08,789][26022] Updated weights on worker 0-0, policy_version 673723 (0.00084) [2022-07-10 09:55:09,408][25689] Fps is (10 sec: 5401.0, 60 sec: 5588.3, 300 sec: 5566.5). Total num frames: 689894400. Throughput: 0: 4921.0. Samples: 689887976. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:55:09,409][25689] Avg episode reward: [(0, '-7.524')] [2022-07-10 09:55:11,002][26022] Updated weights on worker 0-0, policy_version 673733 (0.00090) [2022-07-10 09:55:12,579][26022] Updated weights on worker 0-0, policy_version 673743 (0.00095) [2022-07-10 09:55:14,540][25689] Fps is (10 sec: 5652.8, 60 sec: 5551.7, 300 sec: 5560.8). Total num frames: 689922048. Throughput: 0: 5751.4. Samples: 689921418. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 09:55:14,540][25689] Avg episode reward: [(0, '-7.760')] [2022-07-10 09:55:14,594][26022] Updated weights on worker 0-0, policy_version 673753 (0.00088) [2022-07-10 09:55:16,227][26022] Updated weights on worker 0-0, policy_version 673763 (0.00086) [2022-07-10 09:55:18,275][26022] Updated weights on worker 0-0, policy_version 673773 (0.00089) [2022-07-10 09:55:19,546][25689] Fps is (10 sec: 5555.3, 60 sec: 5556.0, 300 sec: 5561.0). Total num frames: 689950720. Throughput: 0: 5753.0. Samples: 689955298. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:55:19,546][25689] Avg episode reward: [(0, '-7.386')] [2022-07-10 09:55:20,100][26022] Updated weights on worker 0-0, policy_version 673783 (0.00087) [2022-07-10 09:55:21,773][26022] Updated weights on worker 0-0, policy_version 673793 (0.00091) [2022-07-10 09:55:23,661][26022] Updated weights on worker 0-0, policy_version 673803 (0.00088) [2022-07-10 09:55:24,571][25689] Fps is (10 sec: 5818.6, 60 sec: 5590.7, 300 sec: 5571.0). Total num frames: 689980416. Throughput: 0: 5018.5. Samples: 689972248. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:55:24,573][25689] Avg episode reward: [(0, '-6.085')] [2022-07-10 09:55:25,419][26022] Updated weights on worker 0-0, policy_version 673813 (0.00091) [2022-07-10 09:55:27,303][26022] Updated weights on worker 0-0, policy_version 673823 (0.00090) [2022-07-10 09:55:28,968][26022] Updated weights on worker 0-0, policy_version 673833 (0.00086) [2022-07-10 09:55:29,582][25689] Fps is (10 sec: 5611.7, 60 sec: 5556.3, 300 sec: 5565.0). Total num frames: 690007040. Throughput: 0: 5851.7. Samples: 690006030. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:55:29,584][25689] Avg episode reward: [(0, '-5.075')] [2022-07-10 09:55:30,935][26022] Updated weights on worker 0-0, policy_version 673843 (0.00443) [2022-07-10 09:55:32,954][26022] Updated weights on worker 0-0, policy_version 673853 (0.00104) [2022-07-10 09:55:34,364][26022] Updated weights on worker 0-0, policy_version 673863 (0.00090) [2022-07-10 09:55:34,638][25689] Fps is (10 sec: 5492.6, 60 sec: 5591.5, 300 sec: 5560.6). Total num frames: 690035712. Throughput: 0: 5883.2. Samples: 690039662. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:55:34,639][25689] Avg episode reward: [(0, '-3.331')] [2022-07-10 09:55:36,657][26022] Updated weights on worker 0-0, policy_version 673873 (0.00050) [2022-07-10 09:55:38,268][26022] Updated weights on worker 0-0, policy_version 673883 (0.00093) [2022-07-10 09:55:39,662][25689] Fps is (10 sec: 5587.5, 60 sec: 5573.4, 300 sec: 5563.7). Total num frames: 690063360. Throughput: 0: 5861.4. Samples: 690073206. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:55:39,662][25689] Avg episode reward: [(0, '-2.805')] [2022-07-10 09:55:40,026][26022] Updated weights on worker 0-0, policy_version 673893 (0.00094) [2022-07-10 09:55:42,144][26022] Updated weights on worker 0-0, policy_version 673903 (0.00086) [2022-07-10 09:55:43,602][26022] Updated weights on worker 0-0, policy_version 673913 (0.00090) [2022-07-10 09:55:44,681][25689] Fps is (10 sec: 5506.2, 60 sec: 5555.2, 300 sec: 5563.5). Total num frames: 690091008. Throughput: 0: 5845.3. Samples: 690089796. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:55:44,681][25689] Avg episode reward: [(0, '-1.227')] [2022-07-10 09:55:45,695][26022] Updated weights on worker 0-0, policy_version 673923 (0.00081) [2022-07-10 09:55:47,459][26022] Updated weights on worker 0-0, policy_version 673933 (0.00085) [2022-07-10 09:55:49,371][26022] Updated weights on worker 0-0, policy_version 673943 (0.00096) [2022-07-10 09:55:49,735][25689] Fps is (10 sec: 5692.8, 60 sec: 5585.7, 300 sec: 5570.4). Total num frames: 690120704. Throughput: 0: 5819.4. Samples: 690123306. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:55:49,735][25689] Avg episode reward: [(0, '-0.935')] [2022-07-10 09:55:51,164][26022] Updated weights on worker 0-0, policy_version 673953 (0.00101) [2022-07-10 09:55:52,741][26022] Updated weights on worker 0-0, policy_version 673963 (0.00088) [2022-07-10 09:55:54,807][25689] Fps is (10 sec: 5562.1, 60 sec: 5555.2, 300 sec: 5558.8). Total num frames: 690147328. Throughput: 0: 5801.3. Samples: 690156664. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:55:54,807][25689] Avg episode reward: [(0, '-1.743')] [2022-07-10 09:55:54,871][26022] Updated weights on worker 0-0, policy_version 673973 (0.00096) [2022-07-10 09:55:56,676][26022] Updated weights on worker 0-0, policy_version 673983 (0.00080) [2022-07-10 09:55:58,415][26022] Updated weights on worker 0-0, policy_version 673993 (0.00088) [2022-07-10 09:55:59,841][25689] Fps is (10 sec: 5471.3, 60 sec: 5553.2, 300 sec: 5572.7). Total num frames: 690176000. Throughput: 0: 4966.2. Samples: 690173422. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:55:59,842][25689] Avg episode reward: [(0, '-2.742')] [2022-07-10 09:56:00,519][26022] Updated weights on worker 0-0, policy_version 674003 (0.00088) [2022-07-10 09:56:02,460][26022] Updated weights on worker 0-0, policy_version 674013 (0.00087) [2022-07-10 09:56:04,534][26022] Updated weights on worker 0-0, policy_version 674023 (0.00087) [2022-07-10 09:56:04,880][25689] Fps is (10 sec: 5387.7, 60 sec: 5570.0, 300 sec: 5554.8). Total num frames: 690201600. Throughput: 0: 5693.8. Samples: 690204804. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:56:04,880][25689] Avg episode reward: [(0, '-2.425')] [2022-07-10 09:56:06,069][26022] Updated weights on worker 0-0, policy_version 674033 (0.00089) [2022-07-10 09:56:08,148][26022] Updated weights on worker 0-0, policy_version 674043 (0.00086) [2022-07-10 09:56:09,926][25689] Fps is (10 sec: 5280.2, 60 sec: 5533.0, 300 sec: 5559.5). Total num frames: 690229248. Throughput: 0: 5689.3. Samples: 690238180. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:56:09,926][25689] Avg episode reward: [(0, '-3.469')] [2022-07-10 09:56:09,976][26022] Updated weights on worker 0-0, policy_version 674053 (0.00084) [2022-07-10 09:56:11,749][26022] Updated weights on worker 0-0, policy_version 674063 (0.00078) [2022-07-10 09:56:13,431][26022] Updated weights on worker 0-0, policy_version 674073 (0.00090) [2022-07-10 09:56:15,011][25689] Fps is (10 sec: 5761.3, 60 sec: 5588.1, 300 sec: 5568.5). Total num frames: 690259968. Throughput: 0: 4863.7. Samples: 690254932. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:56:15,011][25689] Avg episode reward: [(0, '-3.373')] [2022-07-10 09:56:15,238][26022] Updated weights on worker 0-0, policy_version 674083 (0.00093) [2022-07-10 09:56:17,225][26022] Updated weights on worker 0-0, policy_version 674093 (0.00087) [2022-07-10 09:56:18,844][26022] Updated weights on worker 0-0, policy_version 674103 (0.00092) [2022-07-10 09:56:20,049][25689] Fps is (10 sec: 5664.7, 60 sec: 5551.3, 300 sec: 5561.5). Total num frames: 690286592. Throughput: 0: 5719.1. Samples: 690288990. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:56:20,051][25689] Avg episode reward: [(0, '-4.900')] [2022-07-10 09:56:20,887][26022] Updated weights on worker 0-0, policy_version 674113 (0.00089) [2022-07-10 09:56:21,869][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:56:21,884][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000674118_690296832.pth [2022-07-10 09:56:21,884][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000672161_688292864.pth [2022-07-10 09:56:22,598][26022] Updated weights on worker 0-0, policy_version 674123 (0.00081) [2022-07-10 09:56:24,336][26022] Updated weights on worker 0-0, policy_version 674133 (0.00096) [2022-07-10 09:56:25,077][25689] Fps is (10 sec: 5391.6, 60 sec: 5517.2, 300 sec: 5557.7). Total num frames: 690314240. Throughput: 0: 5833.8. Samples: 690322630. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:56:25,079][25689] Avg episode reward: [(0, '-5.634')] [2022-07-10 09:56:26,537][26022] Updated weights on worker 0-0, policy_version 674143 (0.00089) [2022-07-10 09:56:28,266][26022] Updated weights on worker 0-0, policy_version 674153 (0.00087) [2022-07-10 09:56:29,941][26022] Updated weights on worker 0-0, policy_version 674163 (0.00090) [2022-07-10 09:56:30,108][25689] Fps is (10 sec: 5700.8, 60 sec: 5566.2, 300 sec: 5562.4). Total num frames: 690343936. Throughput: 0: 5012.1. Samples: 690339334. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:56:30,109][25689] Avg episode reward: [(0, '-5.399')] [2022-07-10 09:56:32,031][26022] Updated weights on worker 0-0, policy_version 674173 (0.00082) [2022-07-10 09:56:33,597][26022] Updated weights on worker 0-0, policy_version 674183 (0.00080) [2022-07-10 09:56:35,238][25689] Fps is (10 sec: 5542.5, 60 sec: 5525.5, 300 sec: 5557.3). Total num frames: 690370560. Throughput: 0: 5820.4. Samples: 690372664. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:56:35,239][25689] Avg episode reward: [(0, '-4.330')] [2022-07-10 09:56:35,718][26022] Updated weights on worker 0-0, policy_version 674193 (0.00090) [2022-07-10 09:56:37,284][26022] Updated weights on worker 0-0, policy_version 674203 (0.00088) [2022-07-10 09:56:39,188][26022] Updated weights on worker 0-0, policy_version 674213 (0.00094) [2022-07-10 09:56:40,244][25689] Fps is (10 sec: 5455.5, 60 sec: 5544.1, 300 sec: 5557.9). Total num frames: 690399232. Throughput: 0: 5798.8. Samples: 690406096. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:56:40,245][25689] Avg episode reward: [(0, '-5.235')] [2022-07-10 09:56:41,079][26022] Updated weights on worker 0-0, policy_version 674223 (0.00082) [2022-07-10 09:56:42,907][26022] Updated weights on worker 0-0, policy_version 674233 (0.00083) [2022-07-10 09:56:44,647][26022] Updated weights on worker 0-0, policy_version 674243 (0.00088) [2022-07-10 09:56:45,314][25689] Fps is (10 sec: 5691.5, 60 sec: 5556.3, 300 sec: 5556.9). Total num frames: 690427904. Throughput: 0: 4963.6. Samples: 690423078. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:56:45,314][25689] Avg episode reward: [(0, '-5.152')] [2022-07-10 09:56:46,626][26022] Updated weights on worker 0-0, policy_version 674253 (0.00094) [2022-07-10 09:56:48,349][26022] Updated weights on worker 0-0, policy_version 674263 (0.00088) [2022-07-10 09:56:50,191][26022] Updated weights on worker 0-0, policy_version 674273 (0.00087) [2022-07-10 09:56:50,330][25689] Fps is (10 sec: 5685.4, 60 sec: 5542.9, 300 sec: 5564.3). Total num frames: 690456576. Throughput: 0: 5801.1. Samples: 690456644. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:56:50,339][25689] Avg episode reward: [(0, '-4.639')] [2022-07-10 09:56:52,010][26022] Updated weights on worker 0-0, policy_version 674283 (0.00096) [2022-07-10 09:56:53,910][26022] Updated weights on worker 0-0, policy_version 674293 (0.00085) [2022-07-10 09:56:55,403][25689] Fps is (10 sec: 5581.9, 60 sec: 5559.6, 300 sec: 5552.6). Total num frames: 690484224. Throughput: 0: 5825.9. Samples: 690490144. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:56:55,404][25689] Avg episode reward: [(0, '-4.099')] [2022-07-10 09:56:56,056][26022] Updated weights on worker 0-0, policy_version 674303 (0.00080) [2022-07-10 09:56:57,541][26022] Updated weights on worker 0-0, policy_version 674313 (0.00088) [2022-07-10 09:56:59,300][26022] Updated weights on worker 0-0, policy_version 674323 (0.00092) [2022-07-10 09:57:00,423][25689] Fps is (10 sec: 5580.2, 60 sec: 5561.1, 300 sec: 5566.4). Total num frames: 690512896. Throughput: 0: 4996.3. Samples: 690506916. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:57:00,423][25689] Avg episode reward: [(0, '-3.655')] [2022-07-10 09:57:01,205][26022] Updated weights on worker 0-0, policy_version 674333 (0.00085) [2022-07-10 09:57:03,485][26022] Updated weights on worker 0-0, policy_version 674343 (0.00082) [2022-07-10 09:57:05,348][26022] Updated weights on worker 0-0, policy_version 674353 (0.00091) [2022-07-10 09:57:05,427][25689] Fps is (10 sec: 5312.1, 60 sec: 5547.2, 300 sec: 5556.5). Total num frames: 690537472. Throughput: 0: 5754.1. Samples: 690538814. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:57:05,428][25689] Avg episode reward: [(0, '-3.014')] [2022-07-10 09:57:07,073][26022] Updated weights on worker 0-0, policy_version 674363 (0.00085) [2022-07-10 09:57:09,001][26022] Updated weights on worker 0-0, policy_version 674373 (0.00089) [2022-07-10 09:57:10,460][25689] Fps is (10 sec: 5304.8, 60 sec: 5565.4, 300 sec: 5556.8). Total num frames: 690566144. Throughput: 0: 5756.8. Samples: 690572532. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:57:10,461][25689] Avg episode reward: [(0, '-1.206')] [2022-07-10 09:57:10,715][26022] Updated weights on worker 0-0, policy_version 674383 (0.00094) [2022-07-10 09:57:12,521][26022] Updated weights on worker 0-0, policy_version 674393 (0.00085) [2022-07-10 09:57:14,499][26022] Updated weights on worker 0-0, policy_version 674403 (0.00090) [2022-07-10 09:57:15,535][25689] Fps is (10 sec: 5673.5, 60 sec: 5532.5, 300 sec: 5556.9). Total num frames: 690594816. Throughput: 0: 4924.5. Samples: 690589280. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:57:15,535][25689] Avg episode reward: [(0, '-2.296')] [2022-07-10 09:57:16,226][26022] Updated weights on worker 0-0, policy_version 674413 (0.00095) [2022-07-10 09:57:17,931][26022] Updated weights on worker 0-0, policy_version 674423 (0.00091) [2022-07-10 09:57:20,011][26022] Updated weights on worker 0-0, policy_version 674433 (0.00089) [2022-07-10 09:57:20,549][25689] Fps is (10 sec: 5683.8, 60 sec: 5568.5, 300 sec: 5561.8). Total num frames: 690623488. Throughput: 0: 5770.4. Samples: 690623054. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:57:20,550][25689] Avg episode reward: [(0, '-3.169')] [2022-07-10 09:57:21,648][26022] Updated weights on worker 0-0, policy_version 674443 (0.00083) [2022-07-10 09:57:23,750][26022] Updated weights on worker 0-0, policy_version 674453 (0.00098) [2022-07-10 09:57:25,174][26022] Updated weights on worker 0-0, policy_version 674463 (0.00090) [2022-07-10 09:57:25,565][25689] Fps is (10 sec: 5614.9, 60 sec: 5569.6, 300 sec: 5563.0). Total num frames: 690651136. Throughput: 0: 5853.8. Samples: 690656696. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:57:25,566][25689] Avg episode reward: [(0, '-2.871')] [2022-07-10 09:57:27,200][26022] Updated weights on worker 0-0, policy_version 674473 (0.00087) [2022-07-10 09:57:28,995][26022] Updated weights on worker 0-0, policy_version 674483 (0.00087) [2022-07-10 09:57:30,576][25689] Fps is (10 sec: 5616.9, 60 sec: 5554.5, 300 sec: 5557.6). Total num frames: 690679808. Throughput: 0: 5016.5. Samples: 690673442. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:57:30,577][25689] Avg episode reward: [(0, '-2.549')] [2022-07-10 09:57:30,783][26022] Updated weights on worker 0-0, policy_version 674493 (0.00089) [2022-07-10 09:57:32,726][26022] Updated weights on worker 0-0, policy_version 674503 (0.00102) [2022-07-10 09:57:34,575][26022] Updated weights on worker 0-0, policy_version 674513 (0.00392) [2022-07-10 09:57:35,632][25689] Fps is (10 sec: 5594.6, 60 sec: 5578.3, 300 sec: 5556.7). Total num frames: 690707456. Throughput: 0: 5869.6. Samples: 690707244. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:57:35,633][25689] Avg episode reward: [(0, '-2.537')] [2022-07-10 09:57:36,098][26022] Updated weights on worker 0-0, policy_version 674523 (0.00082) [2022-07-10 09:57:38,095][26022] Updated weights on worker 0-0, policy_version 674533 (0.00098) [2022-07-10 09:57:39,735][26022] Updated weights on worker 0-0, policy_version 674543 (0.00093) [2022-07-10 09:57:40,637][25689] Fps is (10 sec: 5495.9, 60 sec: 5561.4, 300 sec: 5556.9). Total num frames: 690735104. Throughput: 0: 5875.4. Samples: 690741080. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:57:40,638][25689] Avg episode reward: [(0, '-3.065')] [2022-07-10 09:57:41,704][26022] Updated weights on worker 0-0, policy_version 674553 (0.00092) [2022-07-10 09:57:43,708][26022] Updated weights on worker 0-0, policy_version 674563 (0.00089) [2022-07-10 09:57:45,273][26022] Updated weights on worker 0-0, policy_version 674573 (0.00086) [2022-07-10 09:57:45,644][25689] Fps is (10 sec: 5625.5, 60 sec: 5567.3, 300 sec: 5561.0). Total num frames: 690763776. Throughput: 0: 5049.2. Samples: 690758076. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:57:45,644][25689] Avg episode reward: [(0, '-3.726')] [2022-07-10 09:57:47,293][26022] Updated weights on worker 0-0, policy_version 674583 (0.00086) [2022-07-10 09:57:49,168][26022] Updated weights on worker 0-0, policy_version 674593 (0.00087) [2022-07-10 09:57:50,645][25689] Fps is (10 sec: 5627.5, 60 sec: 5551.6, 300 sec: 5559.5). Total num frames: 690791424. Throughput: 0: 5903.0. Samples: 690791912. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:57:50,646][25689] Avg episode reward: [(0, '-4.469')] [2022-07-10 09:57:50,717][26022] Updated weights on worker 0-0, policy_version 674603 (0.00118) [2022-07-10 09:57:52,787][26022] Updated weights on worker 0-0, policy_version 674613 (0.00093) [2022-07-10 09:57:54,332][26022] Updated weights on worker 0-0, policy_version 674623 (0.00091) [2022-07-10 09:57:55,699][25689] Fps is (10 sec: 5498.9, 60 sec: 5553.4, 300 sec: 5555.2). Total num frames: 690819072. Throughput: 0: 5881.8. Samples: 690825276. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:57:55,700][25689] Avg episode reward: [(0, '-4.803')] [2022-07-10 09:57:56,341][26022] Updated weights on worker 0-0, policy_version 674633 (0.00102) [2022-07-10 09:57:58,067][26022] Updated weights on worker 0-0, policy_version 674643 (0.00094) [2022-07-10 09:57:59,966][26022] Updated weights on worker 0-0, policy_version 674653 (0.00098) [2022-07-10 09:58:00,716][25689] Fps is (10 sec: 5592.6, 60 sec: 5553.7, 300 sec: 5562.3). Total num frames: 690847744. Throughput: 0: 5042.7. Samples: 690842330. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:58:00,718][25689] Avg episode reward: [(0, '-4.746')] [2022-07-10 09:58:01,787][26022] Updated weights on worker 0-0, policy_version 674663 (0.00090) [2022-07-10 09:58:04,036][26022] Updated weights on worker 0-0, policy_version 674673 (0.00086) [2022-07-10 09:58:05,724][25689] Fps is (10 sec: 5516.2, 60 sec: 5587.3, 300 sec: 5559.0). Total num frames: 690874368. Throughput: 0: 5763.5. Samples: 690873808. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:58:05,726][25689] Avg episode reward: [(0, '-4.078')] [2022-07-10 09:58:05,768][26022] Updated weights on worker 0-0, policy_version 674683 (0.00085) [2022-07-10 09:58:07,787][26022] Updated weights on worker 0-0, policy_version 674693 (0.00086) [2022-07-10 09:58:09,251][26022] Updated weights on worker 0-0, policy_version 674703 (0.00094) [2022-07-10 09:58:10,735][25689] Fps is (10 sec: 5314.4, 60 sec: 5555.3, 300 sec: 5561.0). Total num frames: 690900992. Throughput: 0: 5754.8. Samples: 690907526. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:58:10,737][25689] Avg episode reward: [(0, '-4.873')] [2022-07-10 09:58:11,462][26022] Updated weights on worker 0-0, policy_version 674713 (0.00094) [2022-07-10 09:58:13,105][26022] Updated weights on worker 0-0, policy_version 674723 (0.00093) [2022-07-10 09:58:15,012][26022] Updated weights on worker 0-0, policy_version 674733 (0.00089) [2022-07-10 09:58:15,867][25689] Fps is (10 sec: 5552.6, 60 sec: 5567.1, 300 sec: 5560.3). Total num frames: 690930688. Throughput: 0: 4895.5. Samples: 690924006. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:58:15,867][25689] Avg episode reward: [(0, '-3.965')] [2022-07-10 09:58:16,904][26022] Updated weights on worker 0-0, policy_version 674743 (0.00093) [2022-07-10 09:58:18,675][26022] Updated weights on worker 0-0, policy_version 674753 (0.00190) [2022-07-10 09:58:20,487][26022] Updated weights on worker 0-0, policy_version 674763 (0.00097) [2022-07-10 09:58:20,930][25689] Fps is (10 sec: 5725.4, 60 sec: 5562.6, 300 sec: 5559.2). Total num frames: 690959360. Throughput: 0: 5705.2. Samples: 690957656. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:58:20,931][25689] Avg episode reward: [(0, '-1.594')] [2022-07-10 09:58:21,950][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 09:58:21,969][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000674772_690966528.pth [2022-07-10 09:58:21,970][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000672814_688961536.pth [2022-07-10 09:58:22,347][26022] Updated weights on worker 0-0, policy_version 674773 (0.00093) [2022-07-10 09:58:24,063][26022] Updated weights on worker 0-0, policy_version 674783 (0.00086) [2022-07-10 09:58:25,963][25689] Fps is (10 sec: 5578.6, 60 sec: 5561.0, 300 sec: 5560.3). Total num frames: 690987008. Throughput: 0: 5819.7. Samples: 690991592. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:58:25,963][25689] Avg episode reward: [(0, '-1.451')] [2022-07-10 09:58:26,107][26022] Updated weights on worker 0-0, policy_version 674793 (0.00095) [2022-07-10 09:58:27,662][26022] Updated weights on worker 0-0, policy_version 674803 (0.00088) [2022-07-10 09:58:29,812][26022] Updated weights on worker 0-0, policy_version 674813 (0.00091) [2022-07-10 09:58:30,992][25689] Fps is (10 sec: 5597.0, 60 sec: 5559.3, 300 sec: 5560.8). Total num frames: 691015680. Throughput: 0: 4961.3. Samples: 691008028. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 09:58:30,993][25689] Avg episode reward: [(0, '-2.330')] [2022-07-10 09:58:31,438][26022] Updated weights on worker 0-0, policy_version 674823 (0.00089) [2022-07-10 09:58:33,437][26022] Updated weights on worker 0-0, policy_version 674833 (0.00089) [2022-07-10 09:58:35,130][26022] Updated weights on worker 0-0, policy_version 674843 (0.00097) [2022-07-10 09:58:36,063][25689] Fps is (10 sec: 5474.7, 60 sec: 5541.0, 300 sec: 5559.7). Total num frames: 691042304. Throughput: 0: 5814.8. Samples: 691041442. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 09:58:36,064][25689] Avg episode reward: [(0, '-2.710')] [2022-07-10 09:58:37,331][26022] Updated weights on worker 0-0, policy_version 674853 (0.00089) [2022-07-10 09:58:38,896][26022] Updated weights on worker 0-0, policy_version 674863 (0.00096) [2022-07-10 09:58:41,017][26022] Updated weights on worker 0-0, policy_version 674873 (0.00085) [2022-07-10 09:58:41,147][25689] Fps is (10 sec: 5344.7, 60 sec: 5533.8, 300 sec: 5548.3). Total num frames: 691069952. Throughput: 0: 5788.8. Samples: 691074688. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 09:58:41,148][25689] Avg episode reward: [(0, '-2.194')] [2022-07-10 09:58:42,494][26022] Updated weights on worker 0-0, policy_version 674883 (0.00088) [2022-07-10 09:58:44,580][26022] Updated weights on worker 0-0, policy_version 674893 (0.00598) [2022-07-10 09:58:46,148][26022] Updated weights on worker 0-0, policy_version 674903 (0.00088) [2022-07-10 09:58:46,162][25689] Fps is (10 sec: 5779.5, 60 sec: 5566.8, 300 sec: 5558.8). Total num frames: 691100672. Throughput: 0: 5785.9. Samples: 691108464. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 09:58:46,163][25689] Avg episode reward: [(0, '-2.838')] [2022-07-10 09:58:48,131][26022] Updated weights on worker 0-0, policy_version 674913 (0.00087) [2022-07-10 09:58:50,022][26022] Updated weights on worker 0-0, policy_version 674923 (0.00093) [2022-07-10 09:58:51,180][25689] Fps is (10 sec: 5511.4, 60 sec: 5514.6, 300 sec: 5553.6). Total num frames: 691125248. Throughput: 0: 5811.9. Samples: 691125356. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 09:58:51,180][25689] Avg episode reward: [(0, '-3.530')] [2022-07-10 09:58:51,787][26022] Updated weights on worker 0-0, policy_version 674933 (0.00099) [2022-07-10 09:58:53,742][26022] Updated weights on worker 0-0, policy_version 674943 (0.00088) [2022-07-10 09:58:55,539][26022] Updated weights on worker 0-0, policy_version 674953 (0.00102) [2022-07-10 09:58:56,337][25689] Fps is (10 sec: 5434.5, 60 sec: 5555.9, 300 sec: 5551.0). Total num frames: 691155968. Throughput: 0: 5768.2. Samples: 691158388. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 09:58:56,339][25689] Avg episode reward: [(0, '-3.909')] [2022-07-10 09:58:57,428][26022] Updated weights on worker 0-0, policy_version 674963 (0.00103) [2022-07-10 09:58:59,008][26022] Updated weights on worker 0-0, policy_version 674973 (0.00090) [2022-07-10 09:59:01,170][26022] Updated weights on worker 0-0, policy_version 674983 (0.00093) [2022-07-10 09:59:01,403][25689] Fps is (10 sec: 5609.2, 60 sec: 5517.6, 300 sec: 5554.8). Total num frames: 691182592. Throughput: 0: 5793.4. Samples: 691192042. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 09:59:01,404][25689] Avg episode reward: [(0, '-4.143')] [2022-07-10 09:59:03,022][26022] Updated weights on worker 0-0, policy_version 674993 (0.00085) [2022-07-10 09:59:05,040][26022] Updated weights on worker 0-0, policy_version 675003 (0.00079) [2022-07-10 09:59:06,483][25689] Fps is (10 sec: 5450.0, 60 sec: 5544.8, 300 sec: 5561.5). Total num frames: 691211264. Throughput: 0: 4846.3. Samples: 691206946. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 09:59:06,484][25689] Avg episode reward: [(0, '-4.554')] [2022-07-10 09:59:06,720][26022] Updated weights on worker 0-0, policy_version 675013 (0.00093) [2022-07-10 09:59:08,598][26022] Updated weights on worker 0-0, policy_version 675023 (0.00092) [2022-07-10 09:59:10,608][26022] Updated weights on worker 0-0, policy_version 675033 (0.00093) [2022-07-10 09:59:11,535][25689] Fps is (10 sec: 5558.6, 60 sec: 5557.9, 300 sec: 5555.5). Total num frames: 691238912. Throughput: 0: 5664.9. Samples: 691240668. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 09:59:11,536][25689] Avg episode reward: [(0, '-6.749')] [2022-07-10 09:59:12,335][26022] Updated weights on worker 0-0, policy_version 675043 (0.00090) [2022-07-10 09:59:14,114][26022] Updated weights on worker 0-0, policy_version 675053 (0.00838) [2022-07-10 09:59:15,931][26022] Updated weights on worker 0-0, policy_version 675063 (0.00087) [2022-07-10 09:59:16,628][25689] Fps is (10 sec: 5551.7, 60 sec: 5544.6, 300 sec: 5554.8). Total num frames: 691267584. Throughput: 0: 5706.4. Samples: 691274176. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 09:59:16,629][25689] Avg episode reward: [(0, '-5.883')] [2022-07-10 09:59:17,923][26022] Updated weights on worker 0-0, policy_version 675073 (0.00087) [2022-07-10 09:59:19,716][26022] Updated weights on worker 0-0, policy_version 675083 (0.00081) [2022-07-10 09:59:21,422][26022] Updated weights on worker 0-0, policy_version 675093 (0.00089) [2022-07-10 09:59:21,669][25689] Fps is (10 sec: 5658.6, 60 sec: 5546.6, 300 sec: 5558.1). Total num frames: 691296256. Throughput: 0: 4875.2. Samples: 691290842. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 09:59:21,670][25689] Avg episode reward: [(0, '-5.535')] [2022-07-10 09:59:23,234][26022] Updated weights on worker 0-0, policy_version 675103 (0.00086) [2022-07-10 09:59:25,142][26022] Updated weights on worker 0-0, policy_version 675113 (0.00089) [2022-07-10 09:59:26,694][25689] Fps is (10 sec: 5594.8, 60 sec: 5547.3, 300 sec: 5554.3). Total num frames: 691323904. Throughput: 0: 5817.4. Samples: 691324522. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 09:59:26,695][25689] Avg episode reward: [(0, '-4.872')] [2022-07-10 09:59:27,047][26022] Updated weights on worker 0-0, policy_version 675123 (0.00096) [2022-07-10 09:59:28,956][26022] Updated weights on worker 0-0, policy_version 675133 (0.00090) [2022-07-10 09:59:30,547][26022] Updated weights on worker 0-0, policy_version 675143 (0.00085) [2022-07-10 09:59:31,712][25689] Fps is (10 sec: 5302.3, 60 sec: 5497.8, 300 sec: 5551.8). Total num frames: 691349504. Throughput: 0: 5804.7. Samples: 691357786. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 09:59:31,713][25689] Avg episode reward: [(0, '-5.049')] [2022-07-10 09:59:32,480][26022] Updated weights on worker 0-0, policy_version 675153 (0.00091) [2022-07-10 09:59:34,623][26022] Updated weights on worker 0-0, policy_version 675163 (0.00081) [2022-07-10 09:59:36,147][26022] Updated weights on worker 0-0, policy_version 675173 (0.00080) [2022-07-10 09:59:36,820][25689] Fps is (10 sec: 5562.2, 60 sec: 5561.8, 300 sec: 5556.9). Total num frames: 691380224. Throughput: 0: 4962.6. Samples: 691374380. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 09:59:36,820][25689] Avg episode reward: [(0, '-4.049')] [2022-07-10 09:59:38,268][26022] Updated weights on worker 0-0, policy_version 675183 (0.00094) [2022-07-10 09:59:39,763][26022] Updated weights on worker 0-0, policy_version 675193 (0.00095) [2022-07-10 09:59:41,839][25689] Fps is (10 sec: 5763.1, 60 sec: 5567.7, 300 sec: 5553.2). Total num frames: 691407872. Throughput: 0: 5792.3. Samples: 691407674. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 09:59:41,840][25689] Avg episode reward: [(0, '-3.958')] [2022-07-10 09:59:41,846][26022] Updated weights on worker 0-0, policy_version 675203 (0.00630) [2022-07-10 09:59:43,535][26022] Updated weights on worker 0-0, policy_version 675213 (0.00089) [2022-07-10 09:59:45,418][26022] Updated weights on worker 0-0, policy_version 675223 (0.00094) [2022-07-10 09:59:46,880][25689] Fps is (10 sec: 5598.5, 60 sec: 5531.7, 300 sec: 5556.2). Total num frames: 691436544. Throughput: 0: 5783.0. Samples: 691441254. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 09:59:46,880][25689] Avg episode reward: [(0, '-3.607')] [2022-07-10 09:59:47,399][26022] Updated weights on worker 0-0, policy_version 675233 (0.00091) [2022-07-10 09:59:49,351][26022] Updated weights on worker 0-0, policy_version 675243 (0.00097) [2022-07-10 09:59:51,035][26022] Updated weights on worker 0-0, policy_version 675253 (0.00091) [2022-07-10 09:59:51,935][25689] Fps is (10 sec: 5578.5, 60 sec: 5578.8, 300 sec: 5553.7). Total num frames: 691464192. Throughput: 0: 4947.7. Samples: 691457846. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 09:59:51,937][25689] Avg episode reward: [(0, '-4.436')] [2022-07-10 09:59:53,117][26022] Updated weights on worker 0-0, policy_version 675263 (0.00084) [2022-07-10 09:59:54,568][26022] Updated weights on worker 0-0, policy_version 675273 (0.00084) [2022-07-10 09:59:56,854][26022] Updated weights on worker 0-0, policy_version 675283 (0.00092) [2022-07-10 09:59:56,986][25689] Fps is (10 sec: 5471.2, 60 sec: 5537.9, 300 sec: 5549.6). Total num frames: 691491840. Throughput: 0: 5775.0. Samples: 691490842. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 09:59:56,987][25689] Avg episode reward: [(0, '-4.271')] [2022-07-10 09:59:58,260][26022] Updated weights on worker 0-0, policy_version 675293 (0.00083) [2022-07-10 10:00:00,252][26022] Updated weights on worker 0-0, policy_version 675303 (0.00081) [2022-07-10 10:00:01,991][25689] Fps is (10 sec: 5295.5, 60 sec: 5526.7, 300 sec: 5553.6). Total num frames: 691517440. Throughput: 0: 5810.0. Samples: 691524752. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 10:00:01,991][25689] Avg episode reward: [(0, '-3.316')] [2022-07-10 10:00:02,365][26022] Updated weights on worker 0-0, policy_version 675313 (0.00088) [2022-07-10 10:00:04,144][26022] Updated weights on worker 0-0, policy_version 675323 (0.00050) [2022-07-10 10:00:06,056][26022] Updated weights on worker 0-0, policy_version 675333 (0.00440) [2022-07-10 10:00:07,046][25689] Fps is (10 sec: 5395.1, 60 sec: 5528.9, 300 sec: 5549.4). Total num frames: 691546112. Throughput: 0: 4868.6. Samples: 691539434. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 10:00:07,048][25689] Avg episode reward: [(0, '-2.465')] [2022-07-10 10:00:07,975][26022] Updated weights on worker 0-0, policy_version 675343 (0.00088) [2022-07-10 10:00:09,690][26022] Updated weights on worker 0-0, policy_version 675353 (0.00086) [2022-07-10 10:00:11,599][26022] Updated weights on worker 0-0, policy_version 675363 (0.00081) [2022-07-10 10:00:12,121][25689] Fps is (10 sec: 5660.3, 60 sec: 5543.7, 300 sec: 5553.9). Total num frames: 691574784. Throughput: 0: 5730.4. Samples: 691573520. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 10:00:12,122][25689] Avg episode reward: [(0, '-3.104')] [2022-07-10 10:00:13,291][26022] Updated weights on worker 0-0, policy_version 675373 (0.00086) [2022-07-10 10:00:15,123][26022] Updated weights on worker 0-0, policy_version 675383 (0.00082) [2022-07-10 10:00:17,093][26022] Updated weights on worker 0-0, policy_version 675393 (0.00099) [2022-07-10 10:00:17,185][25689] Fps is (10 sec: 5656.0, 60 sec: 5546.4, 300 sec: 5552.8). Total num frames: 691603456. Throughput: 0: 5762.2. Samples: 691607228. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 10:00:17,185][25689] Avg episode reward: [(0, '-3.744')] [2022-07-10 10:00:18,808][26022] Updated weights on worker 0-0, policy_version 675403 (0.00297) [2022-07-10 10:00:20,673][26022] Updated weights on worker 0-0, policy_version 675413 (0.00092) [2022-07-10 10:00:21,980][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:00:21,990][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000675420_691630080.pth [2022-07-10 10:00:21,990][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000673467_689630208.pth [2022-07-10 10:00:22,200][25689] Fps is (10 sec: 5689.7, 60 sec: 5548.8, 300 sec: 5549.6). Total num frames: 691632128. Throughput: 0: 4918.2. Samples: 691624146. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 10:00:22,201][25689] Avg episode reward: [(0, '-3.413')] [2022-07-10 10:00:22,349][26022] Updated weights on worker 0-0, policy_version 675423 (0.00096) [2022-07-10 10:00:24,184][26022] Updated weights on worker 0-0, policy_version 675433 (0.00089) [2022-07-10 10:00:26,076][26022] Updated weights on worker 0-0, policy_version 675443 (0.00116) [2022-07-10 10:00:27,231][25689] Fps is (10 sec: 5708.1, 60 sec: 5565.2, 300 sec: 5556.1). Total num frames: 691660800. Throughput: 0: 5871.1. Samples: 691657940. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 10:00:27,232][25689] Avg episode reward: [(0, '-3.415')] [2022-07-10 10:00:27,955][26022] Updated weights on worker 0-0, policy_version 675453 (0.00093) [2022-07-10 10:00:29,892][26022] Updated weights on worker 0-0, policy_version 675463 (0.00089) [2022-07-10 10:00:31,403][26022] Updated weights on worker 0-0, policy_version 675473 (0.00091) [2022-07-10 10:00:32,251][25689] Fps is (10 sec: 5501.5, 60 sec: 5581.8, 300 sec: 5549.9). Total num frames: 691687424. Throughput: 0: 5857.4. Samples: 691691428. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 10:00:32,253][25689] Avg episode reward: [(0, '-3.045')] [2022-07-10 10:00:33,483][26022] Updated weights on worker 0-0, policy_version 675483 (0.00094) [2022-07-10 10:00:35,068][26022] Updated weights on worker 0-0, policy_version 675493 (0.00090) [2022-07-10 10:00:37,064][26022] Updated weights on worker 0-0, policy_version 675503 (0.00094) [2022-07-10 10:00:37,349][25689] Fps is (10 sec: 5566.4, 60 sec: 5565.9, 300 sec: 5555.4). Total num frames: 691717120. Throughput: 0: 5853.7. Samples: 691725262. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 10:00:37,349][25689] Avg episode reward: [(0, '-1.640')] [2022-07-10 10:00:38,827][26022] Updated weights on worker 0-0, policy_version 675513 (0.00101) [2022-07-10 10:00:40,719][26022] Updated weights on worker 0-0, policy_version 675523 (0.00059) [2022-07-10 10:00:42,362][25689] Fps is (10 sec: 5671.4, 60 sec: 5566.4, 300 sec: 5555.5). Total num frames: 691744768. Throughput: 0: 5845.4. Samples: 691742002. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 10:00:42,363][25689] Avg episode reward: [(0, '-3.158')] [2022-07-10 10:00:42,471][26022] Updated weights on worker 0-0, policy_version 675533 (0.00088) [2022-07-10 10:00:44,427][26022] Updated weights on worker 0-0, policy_version 675543 (0.00097) [2022-07-10 10:00:46,209][26022] Updated weights on worker 0-0, policy_version 675553 (0.00086) [2022-07-10 10:00:47,402][25689] Fps is (10 sec: 5500.4, 60 sec: 5549.6, 300 sec: 5548.9). Total num frames: 691772416. Throughput: 0: 5834.1. Samples: 691775620. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 10:00:47,402][25689] Avg episode reward: [(0, '-2.205')] [2022-07-10 10:00:48,064][26022] Updated weights on worker 0-0, policy_version 675563 (0.00090) [2022-07-10 10:00:49,927][26022] Updated weights on worker 0-0, policy_version 675573 (0.00097) [2022-07-10 10:00:51,747][26022] Updated weights on worker 0-0, policy_version 675583 (0.00088) [2022-07-10 10:00:52,410][25689] Fps is (10 sec: 5605.2, 60 sec: 5570.9, 300 sec: 5556.9). Total num frames: 691801088. Throughput: 0: 5856.5. Samples: 691809488. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 10:00:52,411][25689] Avg episode reward: [(0, '-2.704')] [2022-07-10 10:00:53,602][26022] Updated weights on worker 0-0, policy_version 675593 (0.00097) [2022-07-10 10:00:55,311][26022] Updated weights on worker 0-0, policy_version 675603 (0.00086) [2022-07-10 10:00:57,006][26022] Updated weights on worker 0-0, policy_version 675613 (0.00088) [2022-07-10 10:00:57,548][25689] Fps is (10 sec: 5551.0, 60 sec: 5562.9, 300 sec: 5551.6). Total num frames: 691828736. Throughput: 0: 5001.4. Samples: 691826286. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 10:00:57,548][25689] Avg episode reward: [(0, '-3.686')] [2022-07-10 10:00:58,996][26022] Updated weights on worker 0-0, policy_version 675623 (0.00086) [2022-07-10 10:01:00,906][26022] Updated weights on worker 0-0, policy_version 675633 (0.00090) [2022-07-10 10:01:02,571][25689] Fps is (10 sec: 5240.6, 60 sec: 5561.2, 300 sec: 5551.9). Total num frames: 691854336. Throughput: 0: 5809.4. Samples: 691859400. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 10:01:02,571][25689] Avg episode reward: [(0, '-3.191')] [2022-07-10 10:01:03,047][26022] Updated weights on worker 0-0, policy_version 675643 (0.00085) [2022-07-10 10:01:05,180][26022] Updated weights on worker 0-0, policy_version 675653 (0.00099) [2022-07-10 10:01:06,642][26022] Updated weights on worker 0-0, policy_version 675663 (0.00090) [2022-07-10 10:01:07,614][25689] Fps is (10 sec: 5391.5, 60 sec: 5562.3, 300 sec: 5555.3). Total num frames: 691883008. Throughput: 0: 5691.5. Samples: 691890658. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 10:01:07,614][25689] Avg episode reward: [(0, '-3.101')] [2022-07-10 10:01:08,846][26022] Updated weights on worker 0-0, policy_version 675673 (0.00090) [2022-07-10 10:01:10,427][26022] Updated weights on worker 0-0, policy_version 675683 (0.00616) [2022-07-10 10:01:12,262][26022] Updated weights on worker 0-0, policy_version 675693 (0.00094) [2022-07-10 10:01:12,641][25689] Fps is (10 sec: 5694.1, 60 sec: 5566.7, 300 sec: 5549.5). Total num frames: 691911680. Throughput: 0: 4845.4. Samples: 691907518. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 10:01:12,642][25689] Avg episode reward: [(0, '-2.260')] [2022-07-10 10:01:14,391][26022] Updated weights on worker 0-0, policy_version 675703 (0.00086) [2022-07-10 10:01:15,922][26022] Updated weights on worker 0-0, policy_version 675713 (0.00085) [2022-07-10 10:01:17,751][25689] Fps is (10 sec: 5454.6, 60 sec: 5528.6, 300 sec: 5548.2). Total num frames: 691938304. Throughput: 0: 5672.8. Samples: 691940898. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 10:01:17,752][25689] Avg episode reward: [(0, '-2.401')] [2022-07-10 10:01:18,003][26022] Updated weights on worker 0-0, policy_version 675723 (0.00084) [2022-07-10 10:01:19,631][26022] Updated weights on worker 0-0, policy_version 675733 (0.00094) [2022-07-10 10:01:21,394][26022] Updated weights on worker 0-0, policy_version 675743 (0.00093) [2022-07-10 10:01:22,818][25689] Fps is (10 sec: 5534.1, 60 sec: 5540.8, 300 sec: 5554.3). Total num frames: 691968000. Throughput: 0: 5663.5. Samples: 691974072. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 10:01:22,819][25689] Avg episode reward: [(0, '-2.486')] [2022-07-10 10:01:23,403][26022] Updated weights on worker 0-0, policy_version 675753 (0.00097) [2022-07-10 10:01:25,193][26022] Updated weights on worker 0-0, policy_version 675763 (0.00093) [2022-07-10 10:01:27,254][26022] Updated weights on worker 0-0, policy_version 675773 (0.00084) [2022-07-10 10:01:27,837][25689] Fps is (10 sec: 5584.0, 60 sec: 5508.0, 300 sec: 5544.3). Total num frames: 691994624. Throughput: 0: 4951.6. Samples: 691990796. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 10:01:27,838][25689] Avg episode reward: [(0, '-2.427')] [2022-07-10 10:01:29,024][26022] Updated weights on worker 0-0, policy_version 675783 (0.00092) [2022-07-10 10:01:30,964][26022] Updated weights on worker 0-0, policy_version 675793 (0.00089) [2022-07-10 10:01:32,733][26022] Updated weights on worker 0-0, policy_version 675803 (0.00093) [2022-07-10 10:01:32,934][25689] Fps is (10 sec: 5365.3, 60 sec: 5518.0, 300 sec: 5548.3). Total num frames: 692022272. Throughput: 0: 5717.8. Samples: 692023546. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 10:01:32,934][25689] Avg episode reward: [(0, '-3.584')] [2022-07-10 10:01:34,472][26022] Updated weights on worker 0-0, policy_version 675813 (0.00092) [2022-07-10 10:01:36,583][26022] Updated weights on worker 0-0, policy_version 675823 (0.00090) [2022-07-10 10:01:38,046][25689] Fps is (10 sec: 5617.0, 60 sec: 5516.6, 300 sec: 5549.8). Total num frames: 692051968. Throughput: 0: 5720.0. Samples: 692056986. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 10:01:38,047][25689] Avg episode reward: [(0, '-3.048')] [2022-07-10 10:01:38,228][26022] Updated weights on worker 0-0, policy_version 675833 (0.00092) [2022-07-10 10:01:40,198][26022] Updated weights on worker 0-0, policy_version 675843 (0.00092) [2022-07-10 10:01:41,985][26022] Updated weights on worker 0-0, policy_version 675853 (0.00086) [2022-07-10 10:01:43,059][25689] Fps is (10 sec: 5461.4, 60 sec: 5483.0, 300 sec: 5540.5). Total num frames: 692077568. Throughput: 0: 4919.3. Samples: 692073640. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 10:01:43,060][25689] Avg episode reward: [(0, '-3.275')] [2022-07-10 10:01:43,647][26022] Updated weights on worker 0-0, policy_version 675863 (0.00108) [2022-07-10 10:01:45,734][26022] Updated weights on worker 0-0, policy_version 675873 (0.00426) [2022-07-10 10:01:47,275][26022] Updated weights on worker 0-0, policy_version 675883 (0.00084) [2022-07-10 10:01:48,061][25689] Fps is (10 sec: 5521.3, 60 sec: 5520.1, 300 sec: 5544.2). Total num frames: 692107264. Throughput: 0: 5763.1. Samples: 692107348. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 10:01:48,062][25689] Avg episode reward: [(0, '-3.340')] [2022-07-10 10:01:49,447][26022] Updated weights on worker 0-0, policy_version 675893 (0.00087) [2022-07-10 10:01:50,951][26022] Updated weights on worker 0-0, policy_version 675903 (0.00088) [2022-07-10 10:01:52,820][26022] Updated weights on worker 0-0, policy_version 675913 (0.00092) [2022-07-10 10:01:53,130][25689] Fps is (10 sec: 5897.3, 60 sec: 5531.5, 300 sec: 5551.2). Total num frames: 692136960. Throughput: 0: 5794.2. Samples: 692140566. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:01:53,131][25689] Avg episode reward: [(0, '-3.893')] [2022-07-10 10:01:54,908][26022] Updated weights on worker 0-0, policy_version 675923 (0.00083) [2022-07-10 10:01:56,575][26022] Updated weights on worker 0-0, policy_version 675933 (0.00394) [2022-07-10 10:01:58,188][25689] Fps is (10 sec: 5460.5, 60 sec: 5505.0, 300 sec: 5540.2). Total num frames: 692162560. Throughput: 0: 4977.3. Samples: 692157236. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:01:58,188][25689] Avg episode reward: [(0, '-4.263')] [2022-07-10 10:01:58,510][26022] Updated weights on worker 0-0, policy_version 675943 (0.00099) [2022-07-10 10:02:00,435][26022] Updated weights on worker 0-0, policy_version 675953 (0.00090) [2022-07-10 10:02:02,538][26022] Updated weights on worker 0-0, policy_version 675963 (0.00087) [2022-07-10 10:02:03,232][25689] Fps is (10 sec: 5068.2, 60 sec: 5503.1, 300 sec: 5542.9). Total num frames: 692188160. Throughput: 0: 5750.0. Samples: 692189636. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:02:03,232][25689] Avg episode reward: [(0, '-4.456')] [2022-07-10 10:02:04,422][26022] Updated weights on worker 0-0, policy_version 675973 (0.00089) [2022-07-10 10:02:06,146][26022] Updated weights on worker 0-0, policy_version 675983 (0.00085) [2022-07-10 10:02:07,923][26022] Updated weights on worker 0-0, policy_version 675993 (0.00095) [2022-07-10 10:02:08,244][25689] Fps is (10 sec: 5498.8, 60 sec: 5522.8, 300 sec: 5546.7). Total num frames: 692217856. Throughput: 0: 5695.6. Samples: 692222298. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:02:08,244][25689] Avg episode reward: [(0, '-4.785')] [2022-07-10 10:02:10,197][26022] Updated weights on worker 0-0, policy_version 676003 (0.00092) [2022-07-10 10:02:11,299][26022] Updated weights on worker 0-0, policy_version 676013 (0.00052) [2022-07-10 10:02:13,250][25689] Fps is (10 sec: 5724.1, 60 sec: 5507.9, 300 sec: 5544.5). Total num frames: 692245504. Throughput: 0: 4904.7. Samples: 692239250. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:02:13,250][25689] Avg episode reward: [(0, '-5.131')] [2022-07-10 10:02:13,557][26022] Updated weights on worker 0-0, policy_version 676023 (0.00086) [2022-07-10 10:02:15,185][26022] Updated weights on worker 0-0, policy_version 676033 (0.00090) [2022-07-10 10:02:17,122][26022] Updated weights on worker 0-0, policy_version 676043 (0.00081) [2022-07-10 10:02:18,361][25689] Fps is (10 sec: 5566.6, 60 sec: 5541.5, 300 sec: 5542.7). Total num frames: 692274176. Throughput: 0: 5748.6. Samples: 692273204. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:02:18,362][25689] Avg episode reward: [(0, '-4.488')] [2022-07-10 10:02:19,031][26022] Updated weights on worker 0-0, policy_version 676053 (0.00081) [2022-07-10 10:02:20,917][26022] Updated weights on worker 0-0, policy_version 676063 (0.00093) [2022-07-10 10:02:22,055][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:02:22,078][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000676069_692294656.pth [2022-07-10 10:02:22,078][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000674118_690296832.pth [2022-07-10 10:02:22,605][26022] Updated weights on worker 0-0, policy_version 676073 (0.00093) [2022-07-10 10:02:23,385][25689] Fps is (10 sec: 5658.0, 60 sec: 5528.6, 300 sec: 5546.0). Total num frames: 692302848. Throughput: 0: 5814.4. Samples: 692306814. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:02:23,386][25689] Avg episode reward: [(0, '-4.354')] [2022-07-10 10:02:24,507][26022] Updated weights on worker 0-0, policy_version 676083 (0.00091) [2022-07-10 10:02:26,302][26022] Updated weights on worker 0-0, policy_version 676093 (0.00083) [2022-07-10 10:02:28,196][26022] Updated weights on worker 0-0, policy_version 676103 (0.00093) [2022-07-10 10:02:28,393][25689] Fps is (10 sec: 5614.1, 60 sec: 5546.5, 300 sec: 5542.6). Total num frames: 692330496. Throughput: 0: 5025.5. Samples: 692323558. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:02:28,394][25689] Avg episode reward: [(0, '-3.553')] [2022-07-10 10:02:30,139][26022] Updated weights on worker 0-0, policy_version 676113 (0.00084) [2022-07-10 10:02:31,666][26022] Updated weights on worker 0-0, policy_version 676123 (0.00092) [2022-07-10 10:02:33,445][25689] Fps is (10 sec: 5394.6, 60 sec: 5533.6, 300 sec: 5539.3). Total num frames: 692357120. Throughput: 0: 5825.7. Samples: 692356902. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:02:33,446][25689] Avg episode reward: [(0, '-3.032')] [2022-07-10 10:02:33,955][26022] Updated weights on worker 0-0, policy_version 676133 (0.00097) [2022-07-10 10:02:35,428][26022] Updated weights on worker 0-0, policy_version 676143 (0.00088) [2022-07-10 10:02:37,420][26022] Updated weights on worker 0-0, policy_version 676153 (0.00092) [2022-07-10 10:02:38,515][25689] Fps is (10 sec: 5564.3, 60 sec: 5537.6, 300 sec: 5544.9). Total num frames: 692386816. Throughput: 0: 5805.9. Samples: 692390214. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:02:38,515][25689] Avg episode reward: [(0, '-2.982')] [2022-07-10 10:02:39,565][26022] Updated weights on worker 0-0, policy_version 676163 (0.00093) [2022-07-10 10:02:41,015][26022] Updated weights on worker 0-0, policy_version 676173 (0.00109) [2022-07-10 10:02:43,226][26022] Updated weights on worker 0-0, policy_version 676183 (0.00086) [2022-07-10 10:02:43,536][25689] Fps is (10 sec: 5784.7, 60 sec: 5587.6, 300 sec: 5544.7). Total num frames: 692415488. Throughput: 0: 5805.0. Samples: 692423788. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:02:43,536][25689] Avg episode reward: [(0, '-1.590')] [2022-07-10 10:02:44,627][26022] Updated weights on worker 0-0, policy_version 676193 (0.00092) [2022-07-10 10:02:46,659][26022] Updated weights on worker 0-0, policy_version 676203 (0.00082) [2022-07-10 10:02:48,503][26022] Updated weights on worker 0-0, policy_version 676213 (0.00103) [2022-07-10 10:02:48,558][25689] Fps is (10 sec: 5607.6, 60 sec: 5551.9, 300 sec: 5544.3). Total num frames: 692443136. Throughput: 0: 5805.5. Samples: 692440628. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:02:48,568][25689] Avg episode reward: [(0, '-1.637')] [2022-07-10 10:02:50,031][26022] Updated weights on worker 0-0, policy_version 676223 (0.00084) [2022-07-10 10:02:52,072][26022] Updated weights on worker 0-0, policy_version 676233 (0.00093) [2022-07-10 10:02:53,601][25689] Fps is (10 sec: 5493.7, 60 sec: 5520.4, 300 sec: 5544.5). Total num frames: 692470784. Throughput: 0: 5825.4. Samples: 692474316. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:02:53,603][25689] Avg episode reward: [(0, '-0.819')] [2022-07-10 10:02:54,032][26022] Updated weights on worker 0-0, policy_version 676243 (0.00080) [2022-07-10 10:02:55,708][26022] Updated weights on worker 0-0, policy_version 676253 (0.00092) [2022-07-10 10:02:57,646][26022] Updated weights on worker 0-0, policy_version 676263 (0.00085) [2022-07-10 10:02:58,731][25689] Fps is (10 sec: 5536.5, 60 sec: 5564.6, 300 sec: 5542.4). Total num frames: 692499456. Throughput: 0: 5840.5. Samples: 692508286. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:02:58,731][25689] Avg episode reward: [(0, '-2.311')] [2022-07-10 10:02:59,194][26022] Updated weights on worker 0-0, policy_version 676273 (0.00093) [2022-07-10 10:03:01,307][26022] Updated weights on worker 0-0, policy_version 676283 (0.00088) [2022-07-10 10:03:03,368][26022] Updated weights on worker 0-0, policy_version 676293 (0.00088) [2022-07-10 10:03:03,799][25689] Fps is (10 sec: 5422.4, 60 sec: 5579.3, 300 sec: 5541.3). Total num frames: 692526080. Throughput: 0: 4991.3. Samples: 692524928. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:03:03,799][25689] Avg episode reward: [(0, '-2.820')] [2022-07-10 10:03:05,333][26022] Updated weights on worker 0-0, policy_version 676303 (0.00091) [2022-07-10 10:03:07,104][26022] Updated weights on worker 0-0, policy_version 676313 (0.00087) [2022-07-10 10:03:08,818][25689] Fps is (10 sec: 5380.3, 60 sec: 5544.8, 300 sec: 5544.6). Total num frames: 692553728. Throughput: 0: 5709.4. Samples: 692556298. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:03:08,820][25689] Avg episode reward: [(0, '-2.365')] [2022-07-10 10:03:08,950][26022] Updated weights on worker 0-0, policy_version 676323 (0.00093) [2022-07-10 10:03:10,758][26022] Updated weights on worker 0-0, policy_version 676333 (0.00085) [2022-07-10 10:03:12,475][26022] Updated weights on worker 0-0, policy_version 676343 (0.00087) [2022-07-10 10:03:13,838][25689] Fps is (10 sec: 5507.9, 60 sec: 5543.5, 300 sec: 5539.8). Total num frames: 692581376. Throughput: 0: 5724.9. Samples: 692590172. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:03:13,838][25689] Avg episode reward: [(0, '-2.613')] [2022-07-10 10:03:14,456][26022] Updated weights on worker 0-0, policy_version 676353 (0.00081) [2022-07-10 10:03:16,103][26022] Updated weights on worker 0-0, policy_version 676363 (0.00089) [2022-07-10 10:03:18,204][26022] Updated weights on worker 0-0, policy_version 676373 (0.00090) [2022-07-10 10:03:18,943][25689] Fps is (10 sec: 5764.6, 60 sec: 5577.9, 300 sec: 5545.8). Total num frames: 692612096. Throughput: 0: 4886.0. Samples: 692607044. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:03:18,944][25689] Avg episode reward: [(0, '-4.806')] [2022-07-10 10:03:19,840][26022] Updated weights on worker 0-0, policy_version 676383 (0.00089) [2022-07-10 10:03:21,671][26022] Updated weights on worker 0-0, policy_version 676393 (0.00085) [2022-07-10 10:03:23,369][26022] Updated weights on worker 0-0, policy_version 676403 (0.00086) [2022-07-10 10:03:23,979][25689] Fps is (10 sec: 5654.8, 60 sec: 5543.0, 300 sec: 5542.4). Total num frames: 692638720. Throughput: 0: 5743.1. Samples: 692640824. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:03:23,979][25689] Avg episode reward: [(0, '-5.269')] [2022-07-10 10:03:25,420][26022] Updated weights on worker 0-0, policy_version 676413 (0.00091) [2022-07-10 10:03:27,040][26022] Updated weights on worker 0-0, policy_version 676423 (0.00086) [2022-07-10 10:03:28,989][25689] Fps is (10 sec: 5402.8, 60 sec: 5542.9, 300 sec: 5539.3). Total num frames: 692666368. Throughput: 0: 5846.3. Samples: 692674220. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:03:28,991][25689] Avg episode reward: [(0, '-5.122')] [2022-07-10 10:03:29,077][26022] Updated weights on worker 0-0, policy_version 676433 (0.00089) [2022-07-10 10:03:30,916][26022] Updated weights on worker 0-0, policy_version 676443 (0.00083) [2022-07-10 10:03:32,791][26022] Updated weights on worker 0-0, policy_version 676453 (0.00107) [2022-07-10 10:03:33,992][25689] Fps is (10 sec: 5522.4, 60 sec: 5564.3, 300 sec: 5544.0). Total num frames: 692694016. Throughput: 0: 4992.4. Samples: 692690788. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:03:33,993][25689] Avg episode reward: [(0, '-5.125')] [2022-07-10 10:03:34,664][26022] Updated weights on worker 0-0, policy_version 676463 (0.00090) [2022-07-10 10:03:36,384][26022] Updated weights on worker 0-0, policy_version 676473 (0.00092) [2022-07-10 10:03:38,332][26022] Updated weights on worker 0-0, policy_version 676483 (0.00102) [2022-07-10 10:03:39,089][25689] Fps is (10 sec: 5576.2, 60 sec: 5544.8, 300 sec: 5547.2). Total num frames: 692722688. Throughput: 0: 5812.0. Samples: 692724128. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:03:39,089][25689] Avg episode reward: [(0, '-5.477')] [2022-07-10 10:03:40,267][26022] Updated weights on worker 0-0, policy_version 676493 (0.00098) [2022-07-10 10:03:41,900][26022] Updated weights on worker 0-0, policy_version 676503 (0.00093) [2022-07-10 10:03:43,931][26022] Updated weights on worker 0-0, policy_version 676513 (0.00090) [2022-07-10 10:03:44,102][25689] Fps is (10 sec: 5570.9, 60 sec: 5528.6, 300 sec: 5536.9). Total num frames: 692750336. Throughput: 0: 5809.1. Samples: 692757720. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:03:44,102][25689] Avg episode reward: [(0, '-5.022')] [2022-07-10 10:03:45,459][26022] Updated weights on worker 0-0, policy_version 676523 (0.00089) [2022-07-10 10:03:47,451][26022] Updated weights on worker 0-0, policy_version 676533 (0.00089) [2022-07-10 10:03:49,122][25689] Fps is (10 sec: 5613.4, 60 sec: 5545.8, 300 sec: 5550.6). Total num frames: 692779008. Throughput: 0: 4979.0. Samples: 692774464. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:03:49,122][25689] Avg episode reward: [(0, '-4.089')] [2022-07-10 10:03:49,304][26022] Updated weights on worker 0-0, policy_version 676543 (0.00086) [2022-07-10 10:03:51,413][26022] Updated weights on worker 0-0, policy_version 676553 (0.00090) [2022-07-10 10:03:53,029][26022] Updated weights on worker 0-0, policy_version 676563 (0.00086) [2022-07-10 10:03:54,133][25689] Fps is (10 sec: 5614.5, 60 sec: 5548.7, 300 sec: 5543.0). Total num frames: 692806656. Throughput: 0: 5806.2. Samples: 692807730. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:03:54,133][25689] Avg episode reward: [(0, '-3.449')] [2022-07-10 10:03:54,759][26022] Updated weights on worker 0-0, policy_version 676573 (0.00084) [2022-07-10 10:03:56,767][26022] Updated weights on worker 0-0, policy_version 676583 (0.00091) [2022-07-10 10:03:58,610][26022] Updated weights on worker 0-0, policy_version 676593 (0.00091) [2022-07-10 10:03:59,167][25689] Fps is (10 sec: 5504.9, 60 sec: 5540.6, 300 sec: 5547.1). Total num frames: 692834304. Throughput: 0: 5819.6. Samples: 692840974. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:03:59,167][25689] Avg episode reward: [(0, '-2.384')] [2022-07-10 10:04:00,426][26022] Updated weights on worker 0-0, policy_version 676603 (0.00095) [2022-07-10 10:04:02,664][26022] Updated weights on worker 0-0, policy_version 676613 (0.00086) [2022-07-10 10:04:04,184][25689] Fps is (10 sec: 5195.9, 60 sec: 5511.3, 300 sec: 5534.5). Total num frames: 692858880. Throughput: 0: 4962.6. Samples: 692857380. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:04:04,184][25689] Avg episode reward: [(0, '-2.157')] [2022-07-10 10:04:04,501][26022] Updated weights on worker 0-0, policy_version 676623 (0.00087) [2022-07-10 10:04:06,370][26022] Updated weights on worker 0-0, policy_version 676633 (0.00095) [2022-07-10 10:04:08,093][26022] Updated weights on worker 0-0, policy_version 676643 (0.00090) [2022-07-10 10:04:09,186][25689] Fps is (10 sec: 5416.7, 60 sec: 5546.8, 300 sec: 5542.3). Total num frames: 692888576. Throughput: 0: 5702.5. Samples: 692888880. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:04:09,188][25689] Avg episode reward: [(0, '-1.588')] [2022-07-10 10:04:10,185][26022] Updated weights on worker 0-0, policy_version 676653 (0.01333) [2022-07-10 10:04:11,807][26022] Updated weights on worker 0-0, policy_version 676663 (0.00080) [2022-07-10 10:04:13,567][26022] Updated weights on worker 0-0, policy_version 676673 (0.00081) [2022-07-10 10:04:14,200][25689] Fps is (10 sec: 5725.0, 60 sec: 5547.3, 300 sec: 5540.3). Total num frames: 692916224. Throughput: 0: 5739.9. Samples: 692922914. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:04:14,201][25689] Avg episode reward: [(0, '-0.962')] [2022-07-10 10:04:15,459][26022] Updated weights on worker 0-0, policy_version 676683 (0.00090) [2022-07-10 10:04:17,355][26022] Updated weights on worker 0-0, policy_version 676693 (0.00091) [2022-07-10 10:04:19,167][26022] Updated weights on worker 0-0, policy_version 676703 (0.00082) [2022-07-10 10:04:19,264][25689] Fps is (10 sec: 5486.9, 60 sec: 5500.2, 300 sec: 5536.4). Total num frames: 692943872. Throughput: 0: 4915.5. Samples: 692939762. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:04:19,266][25689] Avg episode reward: [(0, '-1.095')] [2022-07-10 10:04:20,878][26022] Updated weights on worker 0-0, policy_version 676713 (0.00087) [2022-07-10 10:04:22,241][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:04:22,254][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000676720_692961280.pth [2022-07-10 10:04:22,255][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000674772_690966528.pth [2022-07-10 10:04:22,852][26022] Updated weights on worker 0-0, policy_version 676723 (0.00092) [2022-07-10 10:04:24,281][25689] Fps is (10 sec: 5586.6, 60 sec: 5535.8, 300 sec: 5540.0). Total num frames: 692972544. Throughput: 0: 5759.2. Samples: 692973128. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:04:24,282][25689] Avg episode reward: [(0, '-2.071')] [2022-07-10 10:04:24,406][26022] Updated weights on worker 0-0, policy_version 676733 (0.00091) [2022-07-10 10:04:26,526][26022] Updated weights on worker 0-0, policy_version 676743 (0.00087) [2022-07-10 10:04:28,145][26022] Updated weights on worker 0-0, policy_version 676753 (0.00086) [2022-07-10 10:04:29,299][25689] Fps is (10 sec: 5714.5, 60 sec: 5552.1, 300 sec: 5550.3). Total num frames: 693001216. Throughput: 0: 5866.6. Samples: 693006872. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:04:29,299][25689] Avg episode reward: [(0, '-2.906')] [2022-07-10 10:04:30,193][26022] Updated weights on worker 0-0, policy_version 676763 (0.00088) [2022-07-10 10:04:31,900][26022] Updated weights on worker 0-0, policy_version 676773 (0.00086) [2022-07-10 10:04:33,840][26022] Updated weights on worker 0-0, policy_version 676783 (0.00093) [2022-07-10 10:04:34,323][25689] Fps is (10 sec: 5506.9, 60 sec: 5533.2, 300 sec: 5538.2). Total num frames: 693027840. Throughput: 0: 4986.5. Samples: 693023254. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:04:34,323][25689] Avg episode reward: [(0, '-2.731')] [2022-07-10 10:04:35,778][26022] Updated weights on worker 0-0, policy_version 676793 (0.00622) [2022-07-10 10:04:37,428][26022] Updated weights on worker 0-0, policy_version 676803 (0.00084) [2022-07-10 10:04:39,388][26022] Updated weights on worker 0-0, policy_version 676813 (0.00092) [2022-07-10 10:04:39,430][25689] Fps is (10 sec: 5458.0, 60 sec: 5532.3, 300 sec: 5540.0). Total num frames: 693056512. Throughput: 0: 5800.3. Samples: 693056730. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:04:39,431][25689] Avg episode reward: [(0, '-3.282')] [2022-07-10 10:04:41,215][26022] Updated weights on worker 0-0, policy_version 676823 (0.00083) [2022-07-10 10:04:42,807][26022] Updated weights on worker 0-0, policy_version 676833 (0.00102) [2022-07-10 10:04:44,498][25689] Fps is (10 sec: 5534.7, 60 sec: 5527.2, 300 sec: 5536.0). Total num frames: 693084160. Throughput: 0: 5785.1. Samples: 693090084. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:04:44,499][25689] Avg episode reward: [(0, '-4.918')] [2022-07-10 10:04:44,996][26022] Updated weights on worker 0-0, policy_version 676843 (0.00084) [2022-07-10 10:04:46,628][26022] Updated weights on worker 0-0, policy_version 676853 (0.00089) [2022-07-10 10:04:48,514][26022] Updated weights on worker 0-0, policy_version 676863 (0.00081) [2022-07-10 10:04:49,515][25689] Fps is (10 sec: 5584.3, 60 sec: 5527.5, 300 sec: 5540.2). Total num frames: 693112832. Throughput: 0: 4933.9. Samples: 693106618. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:04:49,516][25689] Avg episode reward: [(0, '-5.072')] [2022-07-10 10:04:50,470][26022] Updated weights on worker 0-0, policy_version 676873 (0.00098) [2022-07-10 10:04:52,066][26022] Updated weights on worker 0-0, policy_version 676883 (0.00087) [2022-07-10 10:04:54,252][26022] Updated weights on worker 0-0, policy_version 676893 (0.00093) [2022-07-10 10:04:54,536][25689] Fps is (10 sec: 5611.1, 60 sec: 5526.6, 300 sec: 5540.7). Total num frames: 693140480. Throughput: 0: 5776.2. Samples: 693140008. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:04:54,537][25689] Avg episode reward: [(0, '-6.330')] [2022-07-10 10:04:55,952][26022] Updated weights on worker 0-0, policy_version 676903 (0.00088) [2022-07-10 10:04:57,837][26022] Updated weights on worker 0-0, policy_version 676913 (0.00091) [2022-07-10 10:04:59,623][25689] Fps is (10 sec: 5470.6, 60 sec: 5521.7, 300 sec: 5546.1). Total num frames: 693168128. Throughput: 0: 5774.3. Samples: 693173332. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:04:59,624][25689] Avg episode reward: [(0, '-7.158')] [2022-07-10 10:04:59,768][26022] Updated weights on worker 0-0, policy_version 676923 (0.00083) [2022-07-10 10:05:01,355][26022] Updated weights on worker 0-0, policy_version 676933 (0.00094) [2022-07-10 10:05:03,763][26022] Updated weights on worker 0-0, policy_version 676943 (0.00154) [2022-07-10 10:05:04,644][25689] Fps is (10 sec: 5368.8, 60 sec: 5555.2, 300 sec: 5539.8). Total num frames: 693194752. Throughput: 0: 4930.6. Samples: 693189416. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-10 10:05:04,646][25689] Avg episode reward: [(0, '-6.975')] [2022-07-10 10:05:05,540][26022] Updated weights on worker 0-0, policy_version 676953 (0.00089) [2022-07-10 10:05:07,384][26022] Updated weights on worker 0-0, policy_version 676963 (0.00096) [2022-07-10 10:05:09,184][26022] Updated weights on worker 0-0, policy_version 676973 (0.00090) [2022-07-10 10:05:09,660][25689] Fps is (10 sec: 5509.2, 60 sec: 5537.0, 300 sec: 5540.9). Total num frames: 693223424. Throughput: 0: 5694.8. Samples: 693221340. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:05:09,662][25689] Avg episode reward: [(0, '-7.069')] [2022-07-10 10:05:11,085][26022] Updated weights on worker 0-0, policy_version 676983 (0.00054) [2022-07-10 10:05:12,674][26022] Updated weights on worker 0-0, policy_version 676993 (0.00087) [2022-07-10 10:05:14,689][25689] Fps is (10 sec: 5403.0, 60 sec: 5501.8, 300 sec: 5531.2). Total num frames: 693249024. Throughput: 0: 5706.5. Samples: 693255014. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:05:14,690][25689] Avg episode reward: [(0, '-5.980')] [2022-07-10 10:05:15,000][26022] Updated weights on worker 0-0, policy_version 677003 (0.00092) [2022-07-10 10:05:16,476][26022] Updated weights on worker 0-0, policy_version 677013 (0.00086) [2022-07-10 10:05:18,394][26022] Updated weights on worker 0-0, policy_version 677023 (0.00093) [2022-07-10 10:05:19,760][25689] Fps is (10 sec: 5373.6, 60 sec: 5518.1, 300 sec: 5530.2). Total num frames: 693277696. Throughput: 0: 5710.1. Samples: 693288314. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:05:19,761][25689] Avg episode reward: [(0, '-5.460')] [2022-07-10 10:05:20,235][26022] Updated weights on worker 0-0, policy_version 677033 (0.00096) [2022-07-10 10:05:22,027][26022] Updated weights on worker 0-0, policy_version 677043 (0.00090) [2022-07-10 10:05:23,952][26022] Updated weights on worker 0-0, policy_version 677053 (0.00090) [2022-07-10 10:05:24,790][25689] Fps is (10 sec: 5677.3, 60 sec: 5517.0, 300 sec: 5530.2). Total num frames: 693306368. Throughput: 0: 5746.8. Samples: 693305188. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:05:24,790][25689] Avg episode reward: [(0, '-3.870')] [2022-07-10 10:05:25,616][26022] Updated weights on worker 0-0, policy_version 677063 (0.00090) [2022-07-10 10:05:27,490][26022] Updated weights on worker 0-0, policy_version 677073 (0.00099) [2022-07-10 10:05:29,347][26022] Updated weights on worker 0-0, policy_version 677083 (0.00086) [2022-07-10 10:05:29,799][25689] Fps is (10 sec: 5711.9, 60 sec: 5517.7, 300 sec: 5537.3). Total num frames: 693335040. Throughput: 0: 5848.7. Samples: 693339128. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:05:29,800][25689] Avg episode reward: [(0, '-2.487')] [2022-07-10 10:05:31,186][26022] Updated weights on worker 0-0, policy_version 677093 (0.00085) [2022-07-10 10:05:33,021][26022] Updated weights on worker 0-0, policy_version 677103 (0.00083) [2022-07-10 10:05:34,806][25689] Fps is (10 sec: 5622.7, 60 sec: 5536.2, 300 sec: 5532.1). Total num frames: 693362688. Throughput: 0: 5856.0. Samples: 693372820. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:05:34,808][25689] Avg episode reward: [(0, '-2.985')] [2022-07-10 10:05:34,859][26022] Updated weights on worker 0-0, policy_version 677113 (0.00086) [2022-07-10 10:05:36,512][26022] Updated weights on worker 0-0, policy_version 677123 (0.00088) [2022-07-10 10:05:38,597][26022] Updated weights on worker 0-0, policy_version 677133 (0.00093) [2022-07-10 10:05:39,953][25689] Fps is (10 sec: 5546.6, 60 sec: 5532.5, 300 sec: 5533.1). Total num frames: 693391360. Throughput: 0: 4997.8. Samples: 693389236. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:05:39,954][25689] Avg episode reward: [(0, '-4.022')] [2022-07-10 10:05:40,447][26022] Updated weights on worker 0-0, policy_version 677143 (0.00094) [2022-07-10 10:05:42,391][26022] Updated weights on worker 0-0, policy_version 677153 (0.00088) [2022-07-10 10:05:44,139][26022] Updated weights on worker 0-0, policy_version 677163 (0.00060) [2022-07-10 10:05:44,975][25689] Fps is (10 sec: 5438.1, 60 sec: 5519.9, 300 sec: 5530.0). Total num frames: 693417984. Throughput: 0: 5792.2. Samples: 693422102. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:05:44,975][25689] Avg episode reward: [(0, '-4.242')] [2022-07-10 10:05:46,195][26022] Updated weights on worker 0-0, policy_version 677173 (0.00096) [2022-07-10 10:05:47,968][26022] Updated weights on worker 0-0, policy_version 677183 (0.00095) [2022-07-10 10:05:49,899][26022] Updated weights on worker 0-0, policy_version 677193 (0.00087) [2022-07-10 10:05:50,011][25689] Fps is (10 sec: 5396.4, 60 sec: 5501.3, 300 sec: 5526.0). Total num frames: 693445632. Throughput: 0: 5715.6. Samples: 693454646. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:05:50,011][25689] Avg episode reward: [(0, '-3.679')] [2022-07-10 10:05:51,615][26022] Updated weights on worker 0-0, policy_version 677203 (0.00083) [2022-07-10 10:05:53,499][26022] Updated weights on worker 0-0, policy_version 677213 (0.00081) [2022-07-10 10:05:55,089][25689] Fps is (10 sec: 5568.1, 60 sec: 5512.9, 300 sec: 5530.5). Total num frames: 693474304. Throughput: 0: 4862.9. Samples: 693471454. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:05:55,090][25689] Avg episode reward: [(0, '-3.880')] [2022-07-10 10:05:55,328][26022] Updated weights on worker 0-0, policy_version 677223 (0.00086) [2022-07-10 10:05:57,170][26022] Updated weights on worker 0-0, policy_version 677233 (0.00079) [2022-07-10 10:05:58,971][26022] Updated weights on worker 0-0, policy_version 677243 (0.00091) [2022-07-10 10:06:00,180][25689] Fps is (10 sec: 5638.9, 60 sec: 5529.5, 300 sec: 5539.6). Total num frames: 693502976. Throughput: 0: 5743.4. Samples: 693505404. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:06:00,180][25689] Avg episode reward: [(0, '-4.500')] [2022-07-10 10:06:00,732][26022] Updated weights on worker 0-0, policy_version 677253 (0.00087) [2022-07-10 10:06:03,205][26022] Updated weights on worker 0-0, policy_version 677263 (0.00085) [2022-07-10 10:06:04,891][26022] Updated weights on worker 0-0, policy_version 677273 (0.00091) [2022-07-10 10:06:05,219][25689] Fps is (10 sec: 5357.9, 60 sec: 5511.0, 300 sec: 5529.4). Total num frames: 693528576. Throughput: 0: 5668.4. Samples: 693536854. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:06:05,219][25689] Avg episode reward: [(0, '-3.430')] [2022-07-10 10:06:06,693][26022] Updated weights on worker 0-0, policy_version 677283 (0.00084) [2022-07-10 10:06:08,679][26022] Updated weights on worker 0-0, policy_version 677293 (0.00084) [2022-07-10 10:06:10,233][25689] Fps is (10 sec: 5398.8, 60 sec: 5511.2, 300 sec: 5529.6). Total num frames: 693557248. Throughput: 0: 4895.5. Samples: 693553642. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:06:10,233][25689] Avg episode reward: [(0, '-3.929')] [2022-07-10 10:06:10,294][26022] Updated weights on worker 0-0, policy_version 677303 (0.00093) [2022-07-10 10:06:12,506][26022] Updated weights on worker 0-0, policy_version 677313 (0.00094) [2022-07-10 10:06:13,923][26022] Updated weights on worker 0-0, policy_version 677323 (0.00086) [2022-07-10 10:06:15,248][25689] Fps is (10 sec: 5513.5, 60 sec: 5529.3, 300 sec: 5531.4). Total num frames: 693583872. Throughput: 0: 5737.3. Samples: 693587110. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:06:15,249][25689] Avg episode reward: [(0, '-5.255')] [2022-07-10 10:06:15,977][26022] Updated weights on worker 0-0, policy_version 677333 (0.00085) [2022-07-10 10:06:17,839][26022] Updated weights on worker 0-0, policy_version 677343 (0.00095) [2022-07-10 10:06:19,562][26022] Updated weights on worker 0-0, policy_version 677353 (0.00087) [2022-07-10 10:06:20,356][25689] Fps is (10 sec: 5563.3, 60 sec: 5542.8, 300 sec: 5530.6). Total num frames: 693613568. Throughput: 0: 5703.9. Samples: 693620486. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:06:20,357][25689] Avg episode reward: [(0, '-5.516')] [2022-07-10 10:06:21,711][26022] Updated weights on worker 0-0, policy_version 677363 (0.00080) [2022-07-10 10:06:22,363][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:06:22,375][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000677368_693624832.pth [2022-07-10 10:06:22,376][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000675420_691630080.pth [2022-07-10 10:06:23,342][26022] Updated weights on worker 0-0, policy_version 677373 (0.00092) [2022-07-10 10:06:25,166][26022] Updated weights on worker 0-0, policy_version 677383 (0.00093) [2022-07-10 10:06:25,378][25689] Fps is (10 sec: 5660.8, 60 sec: 5526.6, 300 sec: 5534.0). Total num frames: 693641216. Throughput: 0: 4981.8. Samples: 693637284. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:06:25,379][25689] Avg episode reward: [(0, '-5.663')] [2022-07-10 10:06:27,018][26022] Updated weights on worker 0-0, policy_version 677393 (0.00090) [2022-07-10 10:06:28,751][26022] Updated weights on worker 0-0, policy_version 677403 (0.00096) [2022-07-10 10:06:30,392][25689] Fps is (10 sec: 5407.7, 60 sec: 5492.4, 300 sec: 5532.1). Total num frames: 693667840. Throughput: 0: 5809.9. Samples: 693670768. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:06:30,393][25689] Avg episode reward: [(0, '-6.314')] [2022-07-10 10:06:30,717][26022] Updated weights on worker 0-0, policy_version 677413 (0.00095) [2022-07-10 10:06:32,425][26022] Updated weights on worker 0-0, policy_version 677423 (0.00081) [2022-07-10 10:06:34,377][26022] Updated weights on worker 0-0, policy_version 677433 (0.00092) [2022-07-10 10:06:35,419][25689] Fps is (10 sec: 5609.1, 60 sec: 5524.4, 300 sec: 5533.7). Total num frames: 693697536. Throughput: 0: 5809.8. Samples: 693704298. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:06:35,419][25689] Avg episode reward: [(0, '-5.213')] [2022-07-10 10:06:36,274][26022] Updated weights on worker 0-0, policy_version 677443 (0.00088) [2022-07-10 10:06:37,894][26022] Updated weights on worker 0-0, policy_version 677453 (0.00090) [2022-07-10 10:06:39,943][26022] Updated weights on worker 0-0, policy_version 677463 (0.00088) [2022-07-10 10:06:40,527][25689] Fps is (10 sec: 5759.0, 60 sec: 5527.9, 300 sec: 5542.2). Total num frames: 693726208. Throughput: 0: 5000.0. Samples: 693721342. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:06:40,529][25689] Avg episode reward: [(0, '-4.577')] [2022-07-10 10:06:41,695][26022] Updated weights on worker 0-0, policy_version 677473 (0.00092) [2022-07-10 10:06:43,479][26022] Updated weights on worker 0-0, policy_version 677483 (0.00087) [2022-07-10 10:06:45,314][26022] Updated weights on worker 0-0, policy_version 677493 (0.00090) [2022-07-10 10:06:45,606][25689] Fps is (10 sec: 5528.5, 60 sec: 5539.6, 300 sec: 5533.9). Total num frames: 693753856. Throughput: 0: 5828.4. Samples: 693755182. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:06:45,608][25689] Avg episode reward: [(0, '-3.727')] [2022-07-10 10:06:47,055][26022] Updated weights on worker 0-0, policy_version 677503 (0.00084) [2022-07-10 10:06:48,916][26022] Updated weights on worker 0-0, policy_version 677513 (0.00084) [2022-07-10 10:06:50,678][25689] Fps is (10 sec: 5649.3, 60 sec: 5570.0, 300 sec: 5533.9). Total num frames: 693783552. Throughput: 0: 5830.5. Samples: 693789044. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:06:50,679][25689] Avg episode reward: [(0, '-3.722')] [2022-07-10 10:06:50,680][26022] Updated weights on worker 0-0, policy_version 677523 (0.00094) [2022-07-10 10:06:52,529][26022] Updated weights on worker 0-0, policy_version 677533 (0.00100) [2022-07-10 10:06:54,392][26022] Updated weights on worker 0-0, policy_version 677543 (0.00087) [2022-07-10 10:06:55,729][25689] Fps is (10 sec: 5664.8, 60 sec: 5555.7, 300 sec: 5540.9). Total num frames: 693811200. Throughput: 0: 4997.0. Samples: 693805788. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:06:55,730][25689] Avg episode reward: [(0, '-2.921')] [2022-07-10 10:06:56,332][26022] Updated weights on worker 0-0, policy_version 677553 (0.00097) [2022-07-10 10:06:58,008][26022] Updated weights on worker 0-0, policy_version 677563 (0.00095) [2022-07-10 10:06:59,928][26022] Updated weights on worker 0-0, policy_version 677573 (0.00090) [2022-07-10 10:07:00,836][25689] Fps is (10 sec: 5544.2, 60 sec: 5554.2, 300 sec: 5550.0). Total num frames: 693839872. Throughput: 0: 5812.1. Samples: 693839380. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:07:00,837][25689] Avg episode reward: [(0, '-3.107')] [2022-07-10 10:07:02,120][26022] Updated weights on worker 0-0, policy_version 677583 (0.00085) [2022-07-10 10:07:03,883][26022] Updated weights on worker 0-0, policy_version 677593 (0.00096) [2022-07-10 10:07:05,843][25689] Fps is (10 sec: 5264.8, 60 sec: 5540.2, 300 sec: 5532.9). Total num frames: 693864448. Throughput: 0: 5710.0. Samples: 693870734. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:07:05,843][25689] Avg episode reward: [(0, '-3.470')] [2022-07-10 10:07:05,926][26022] Updated weights on worker 0-0, policy_version 677603 (0.00083) [2022-07-10 10:07:07,471][26022] Updated weights on worker 0-0, policy_version 677613 (0.00057) [2022-07-10 10:07:09,419][26022] Updated weights on worker 0-0, policy_version 677623 (0.00086) [2022-07-10 10:07:10,859][25689] Fps is (10 sec: 5415.1, 60 sec: 5556.9, 300 sec: 5539.6). Total num frames: 693894144. Throughput: 0: 4886.7. Samples: 693887662. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:07:10,859][25689] Avg episode reward: [(0, '-2.262')] [2022-07-10 10:07:11,131][26022] Updated weights on worker 0-0, policy_version 677633 (0.00079) [2022-07-10 10:07:12,946][26022] Updated weights on worker 0-0, policy_version 677643 (0.00084) [2022-07-10 10:07:14,972][26022] Updated weights on worker 0-0, policy_version 677653 (0.00083) [2022-07-10 10:07:15,872][25689] Fps is (10 sec: 5819.8, 60 sec: 5590.9, 300 sec: 5541.4). Total num frames: 693922816. Throughput: 0: 5745.3. Samples: 693921516. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:07:15,873][25689] Avg episode reward: [(0, '-2.197')] [2022-07-10 10:07:16,772][26022] Updated weights on worker 0-0, policy_version 677663 (0.00089) [2022-07-10 10:07:18,690][26022] Updated weights on worker 0-0, policy_version 677673 (0.00089) [2022-07-10 10:07:20,491][26022] Updated weights on worker 0-0, policy_version 677683 (0.00096) [2022-07-10 10:07:20,920][25689] Fps is (10 sec: 5597.5, 60 sec: 5562.6, 300 sec: 5537.5). Total num frames: 693950464. Throughput: 0: 5758.6. Samples: 693955036. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:07:20,921][25689] Avg episode reward: [(0, '-2.623')] [2022-07-10 10:07:22,209][26022] Updated weights on worker 0-0, policy_version 677693 (0.00079) [2022-07-10 10:07:24,121][26022] Updated weights on worker 0-0, policy_version 677703 (0.00093) [2022-07-10 10:07:25,753][26022] Updated weights on worker 0-0, policy_version 677713 (0.00104) [2022-07-10 10:07:25,934][25689] Fps is (10 sec: 5495.8, 60 sec: 5563.4, 300 sec: 5537.5). Total num frames: 693978112. Throughput: 0: 5038.1. Samples: 693971952. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:07:25,934][25689] Avg episode reward: [(0, '-2.502')] [2022-07-10 10:07:27,867][26022] Updated weights on worker 0-0, policy_version 677723 (0.00090) [2022-07-10 10:07:29,419][26022] Updated weights on worker 0-0, policy_version 677733 (0.00292) [2022-07-10 10:07:30,941][25689] Fps is (10 sec: 5620.2, 60 sec: 5597.8, 300 sec: 5545.2). Total num frames: 694006784. Throughput: 0: 5871.0. Samples: 694005566. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:07:30,942][25689] Avg episode reward: [(0, '-1.917')] [2022-07-10 10:07:31,511][26022] Updated weights on worker 0-0, policy_version 677743 (0.00086) [2022-07-10 10:07:33,087][26022] Updated weights on worker 0-0, policy_version 677753 (0.00086) [2022-07-10 10:07:35,285][26022] Updated weights on worker 0-0, policy_version 677763 (0.00084) [2022-07-10 10:07:35,972][25689] Fps is (10 sec: 5610.4, 60 sec: 5563.6, 300 sec: 5539.0). Total num frames: 694034432. Throughput: 0: 5857.0. Samples: 694039242. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:07:35,973][25689] Avg episode reward: [(0, '-1.467')] [2022-07-10 10:07:36,775][26022] Updated weights on worker 0-0, policy_version 677773 (0.00079) [2022-07-10 10:07:38,710][26022] Updated weights on worker 0-0, policy_version 677783 (0.00096) [2022-07-10 10:07:40,377][26022] Updated weights on worker 0-0, policy_version 677793 (0.00088) [2022-07-10 10:07:41,028][25689] Fps is (10 sec: 5583.6, 60 sec: 5568.5, 300 sec: 5538.3). Total num frames: 694063104. Throughput: 0: 5021.7. Samples: 694056010. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:07:41,029][25689] Avg episode reward: [(0, '-2.228')] [2022-07-10 10:07:42,427][26022] Updated weights on worker 0-0, policy_version 677803 (0.00091) [2022-07-10 10:07:44,204][26022] Updated weights on worker 0-0, policy_version 677813 (0.00083) [2022-07-10 10:07:45,980][26022] Updated weights on worker 0-0, policy_version 677823 (0.00103) [2022-07-10 10:07:46,034][25689] Fps is (10 sec: 5597.4, 60 sec: 5575.2, 300 sec: 5538.7). Total num frames: 694090752. Throughput: 0: 5842.5. Samples: 694089388. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:07:46,034][25689] Avg episode reward: [(0, '-3.502')] [2022-07-10 10:07:48,001][26022] Updated weights on worker 0-0, policy_version 677833 (0.00085) [2022-07-10 10:07:49,743][26022] Updated weights on worker 0-0, policy_version 677843 (0.00095) [2022-07-10 10:07:51,047][25689] Fps is (10 sec: 5519.3, 60 sec: 5546.7, 300 sec: 5539.2). Total num frames: 694118400. Throughput: 0: 5841.0. Samples: 694123002. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:07:51,048][25689] Avg episode reward: [(0, '-4.390')] [2022-07-10 10:07:51,537][26022] Updated weights on worker 0-0, policy_version 677853 (0.00085) [2022-07-10 10:07:53,404][26022] Updated weights on worker 0-0, policy_version 677863 (0.00089) [2022-07-10 10:07:55,170][26022] Updated weights on worker 0-0, policy_version 677873 (0.00107) [2022-07-10 10:07:56,055][25689] Fps is (10 sec: 5620.2, 60 sec: 5567.6, 300 sec: 5541.5). Total num frames: 694147072. Throughput: 0: 5007.2. Samples: 694139800. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:07:56,055][25689] Avg episode reward: [(0, '-4.852')] [2022-07-10 10:07:57,222][26022] Updated weights on worker 0-0, policy_version 677883 (0.00081) [2022-07-10 10:07:58,682][26022] Updated weights on worker 0-0, policy_version 677893 (0.00081) [2022-07-10 10:08:00,810][26022] Updated weights on worker 0-0, policy_version 677903 (0.00090) [2022-07-10 10:08:01,123][25689] Fps is (10 sec: 5487.9, 60 sec: 5537.3, 300 sec: 5541.5). Total num frames: 694173696. Throughput: 0: 5844.9. Samples: 694173462. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:08:01,124][25689] Avg episode reward: [(0, '-5.997')] [2022-07-10 10:08:02,974][26022] Updated weights on worker 0-0, policy_version 677913 (0.00092) [2022-07-10 10:08:04,831][26022] Updated weights on worker 0-0, policy_version 677923 (0.00095) [2022-07-10 10:08:06,147][25689] Fps is (10 sec: 5276.5, 60 sec: 5569.7, 300 sec: 5538.0). Total num frames: 694200320. Throughput: 0: 5730.7. Samples: 694204646. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:08:06,147][25689] Avg episode reward: [(0, '-5.904')] [2022-07-10 10:08:06,667][26022] Updated weights on worker 0-0, policy_version 677933 (0.00083) [2022-07-10 10:08:08,245][26022] Updated weights on worker 0-0, policy_version 677943 (0.00087) [2022-07-10 10:08:10,278][26022] Updated weights on worker 0-0, policy_version 677953 (0.00096) [2022-07-10 10:08:11,160][25689] Fps is (10 sec: 5508.8, 60 sec: 5552.9, 300 sec: 5541.5). Total num frames: 694228992. Throughput: 0: 4897.1. Samples: 694221500. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:08:11,162][25689] Avg episode reward: [(0, '-6.285')] [2022-07-10 10:08:11,987][26022] Updated weights on worker 0-0, policy_version 677963 (0.00084) [2022-07-10 10:08:13,856][26022] Updated weights on worker 0-0, policy_version 677973 (0.00100) [2022-07-10 10:08:15,823][26022] Updated weights on worker 0-0, policy_version 677983 (0.00092) [2022-07-10 10:08:16,167][25689] Fps is (10 sec: 5518.5, 60 sec: 5519.6, 300 sec: 5529.6). Total num frames: 694255616. Throughput: 0: 5742.8. Samples: 694255298. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:08:16,168][25689] Avg episode reward: [(0, '-6.147')] [2022-07-10 10:08:17,424][26022] Updated weights on worker 0-0, policy_version 677993 (0.00083) [2022-07-10 10:08:19,288][26022] Updated weights on worker 0-0, policy_version 678003 (0.00091) [2022-07-10 10:08:20,801][26022] Updated weights on worker 0-0, policy_version 678013 (0.00088) [2022-07-10 10:08:21,229][25689] Fps is (10 sec: 5593.5, 60 sec: 5552.2, 300 sec: 5539.4). Total num frames: 694285312. Throughput: 0: 5768.3. Samples: 694289442. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 10:08:21,230][25689] Avg episode reward: [(0, '-6.407')] [2022-07-10 10:08:22,460][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:08:22,476][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000678021_694293504.pth [2022-07-10 10:08:22,479][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000676069_692294656.pth [2022-07-10 10:08:23,019][26022] Updated weights on worker 0-0, policy_version 678023 (0.00084) [2022-07-10 10:08:24,767][26022] Updated weights on worker 0-0, policy_version 678033 (0.00089) [2022-07-10 10:08:26,273][25689] Fps is (10 sec: 5775.3, 60 sec: 5566.4, 300 sec: 5542.2). Total num frames: 694313984. Throughput: 0: 5881.5. Samples: 694323020. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:08:26,274][25689] Avg episode reward: [(0, '-7.028')] [2022-07-10 10:08:26,483][26022] Updated weights on worker 0-0, policy_version 678043 (0.00099) [2022-07-10 10:08:28,367][26022] Updated weights on worker 0-0, policy_version 678053 (0.00082) [2022-07-10 10:08:30,103][26022] Updated weights on worker 0-0, policy_version 678063 (0.00087) [2022-07-10 10:08:31,282][25689] Fps is (10 sec: 5602.1, 60 sec: 5549.2, 300 sec: 5542.1). Total num frames: 694341632. Throughput: 0: 5888.6. Samples: 694339992. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:08:31,283][25689] Avg episode reward: [(0, '-7.305')] [2022-07-10 10:08:32,150][26022] Updated weights on worker 0-0, policy_version 678073 (0.00094) [2022-07-10 10:08:33,847][26022] Updated weights on worker 0-0, policy_version 678083 (0.00109) [2022-07-10 10:08:35,734][26022] Updated weights on worker 0-0, policy_version 678093 (0.00086) [2022-07-10 10:08:36,371][25689] Fps is (10 sec: 5577.6, 60 sec: 5560.9, 300 sec: 5542.3). Total num frames: 694370304. Throughput: 0: 5864.1. Samples: 694373776. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:08:36,371][25689] Avg episode reward: [(0, '-7.227')] [2022-07-10 10:08:37,469][26022] Updated weights on worker 0-0, policy_version 678103 (0.00058) [2022-07-10 10:08:39,375][26022] Updated weights on worker 0-0, policy_version 678113 (0.00088) [2022-07-10 10:08:41,293][26022] Updated weights on worker 0-0, policy_version 678123 (0.00086) [2022-07-10 10:08:41,486][25689] Fps is (10 sec: 5519.5, 60 sec: 5538.5, 300 sec: 5540.4). Total num frames: 694397952. Throughput: 0: 5816.2. Samples: 694407262. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:08:41,487][25689] Avg episode reward: [(0, '-5.755')] [2022-07-10 10:08:43,078][26022] Updated weights on worker 0-0, policy_version 678133 (0.00096) [2022-07-10 10:08:45,001][26022] Updated weights on worker 0-0, policy_version 678143 (0.00108) [2022-07-10 10:08:46,557][25689] Fps is (10 sec: 5629.5, 60 sec: 5566.4, 300 sec: 5542.9). Total num frames: 694427648. Throughput: 0: 4991.1. Samples: 694424258. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:08:46,558][25689] Avg episode reward: [(0, '-5.396')] [2022-07-10 10:08:46,632][26022] Updated weights on worker 0-0, policy_version 678153 (0.00082) [2022-07-10 10:08:48,550][26022] Updated weights on worker 0-0, policy_version 678163 (0.00086) [2022-07-10 10:08:50,357][26022] Updated weights on worker 0-0, policy_version 678173 (0.00090) [2022-07-10 10:08:51,559][25689] Fps is (10 sec: 5692.8, 60 sec: 5567.4, 300 sec: 5543.0). Total num frames: 694455296. Throughput: 0: 5813.0. Samples: 694457862. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:08:51,560][25689] Avg episode reward: [(0, '-4.404')] [2022-07-10 10:08:52,339][26022] Updated weights on worker 0-0, policy_version 678183 (0.00087) [2022-07-10 10:08:54,058][26022] Updated weights on worker 0-0, policy_version 678193 (0.00091) [2022-07-10 10:08:55,951][26022] Updated weights on worker 0-0, policy_version 678203 (0.00104) [2022-07-10 10:08:56,572][25689] Fps is (10 sec: 5623.3, 60 sec: 5566.9, 300 sec: 5546.8). Total num frames: 694483968. Throughput: 0: 5811.0. Samples: 694491168. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:08:56,573][25689] Avg episode reward: [(0, '-3.324')] [2022-07-10 10:08:57,766][26022] Updated weights on worker 0-0, policy_version 678213 (0.00089) [2022-07-10 10:08:59,366][26022] Updated weights on worker 0-0, policy_version 678223 (0.00088) [2022-07-10 10:09:01,522][26022] Updated weights on worker 0-0, policy_version 678233 (0.00083) [2022-07-10 10:09:01,698][25689] Fps is (10 sec: 5554.9, 60 sec: 5578.5, 300 sec: 5555.1). Total num frames: 694511616. Throughput: 0: 4987.2. Samples: 694508062. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:09:01,699][25689] Avg episode reward: [(0, '-3.018')] [2022-07-10 10:09:03,554][26022] Updated weights on worker 0-0, policy_version 678243 (0.00091) [2022-07-10 10:09:05,543][26022] Updated weights on worker 0-0, policy_version 678253 (0.00094) [2022-07-10 10:09:06,731][25689] Fps is (10 sec: 5241.6, 60 sec: 5560.7, 300 sec: 5540.8). Total num frames: 694537216. Throughput: 0: 5701.0. Samples: 694539270. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:09:06,732][25689] Avg episode reward: [(0, '-3.271')] [2022-07-10 10:09:07,383][26022] Updated weights on worker 0-0, policy_version 678263 (0.00097) [2022-07-10 10:09:09,291][26022] Updated weights on worker 0-0, policy_version 678273 (0.00094) [2022-07-10 10:09:11,173][26022] Updated weights on worker 0-0, policy_version 678283 (0.00087) [2022-07-10 10:09:11,770][25689] Fps is (10 sec: 5286.8, 60 sec: 5541.6, 300 sec: 5540.3). Total num frames: 694564864. Throughput: 0: 5677.2. Samples: 694572602. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:09:11,770][25689] Avg episode reward: [(0, '-3.721')] [2022-07-10 10:09:13,038][26022] Updated weights on worker 0-0, policy_version 678293 (0.00090) [2022-07-10 10:09:14,605][26022] Updated weights on worker 0-0, policy_version 678303 (0.00080) [2022-07-10 10:09:16,632][26022] Updated weights on worker 0-0, policy_version 678313 (0.00084) [2022-07-10 10:09:16,809][25689] Fps is (10 sec: 5486.9, 60 sec: 5555.5, 300 sec: 5540.8). Total num frames: 694592512. Throughput: 0: 4842.8. Samples: 694589170. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:09:16,810][25689] Avg episode reward: [(0, '-3.713')] [2022-07-10 10:09:18,274][26022] Updated weights on worker 0-0, policy_version 678323 (0.00078) [2022-07-10 10:09:20,231][26022] Updated weights on worker 0-0, policy_version 678333 (0.00095) [2022-07-10 10:09:21,871][25689] Fps is (10 sec: 5676.8, 60 sec: 5555.5, 300 sec: 5543.4). Total num frames: 694622208. Throughput: 0: 5675.3. Samples: 694622552. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:09:21,872][25689] Avg episode reward: [(0, '-3.943')] [2022-07-10 10:09:21,924][26022] Updated weights on worker 0-0, policy_version 678343 (0.00091) [2022-07-10 10:09:23,975][26022] Updated weights on worker 0-0, policy_version 678353 (0.00105) [2022-07-10 10:09:25,740][26022] Updated weights on worker 0-0, policy_version 678363 (0.00095) [2022-07-10 10:09:26,893][25689] Fps is (10 sec: 5585.4, 60 sec: 5523.8, 300 sec: 5536.5). Total num frames: 694648832. Throughput: 0: 5797.4. Samples: 694656154. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:09:26,893][25689] Avg episode reward: [(0, '-3.138')] [2022-07-10 10:09:27,441][26022] Updated weights on worker 0-0, policy_version 678373 (0.00086) [2022-07-10 10:09:29,334][26022] Updated weights on worker 0-0, policy_version 678383 (0.00085) [2022-07-10 10:09:31,255][26022] Updated weights on worker 0-0, policy_version 678393 (0.00078) [2022-07-10 10:09:31,902][25689] Fps is (10 sec: 5512.9, 60 sec: 5540.7, 300 sec: 5543.6). Total num frames: 694677504. Throughput: 0: 4990.5. Samples: 694673070. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:09:31,902][25689] Avg episode reward: [(0, '-1.723')] [2022-07-10 10:09:33,239][26022] Updated weights on worker 0-0, policy_version 678403 (0.00101) [2022-07-10 10:09:34,971][26022] Updated weights on worker 0-0, policy_version 678413 (0.00086) [2022-07-10 10:09:36,775][26022] Updated weights on worker 0-0, policy_version 678423 (0.00081) [2022-07-10 10:09:36,924][25689] Fps is (10 sec: 5716.2, 60 sec: 5546.7, 300 sec: 5545.2). Total num frames: 694706176. Throughput: 0: 5840.1. Samples: 694706646. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:09:36,925][25689] Avg episode reward: [(0, '-1.220')] [2022-07-10 10:09:38,566][26022] Updated weights on worker 0-0, policy_version 678433 (0.00088) [2022-07-10 10:09:40,310][26022] Updated weights on worker 0-0, policy_version 678443 (0.00088) [2022-07-10 10:09:41,973][25689] Fps is (10 sec: 5490.4, 60 sec: 5535.9, 300 sec: 5542.1). Total num frames: 694732800. Throughput: 0: 5849.1. Samples: 694740128. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:09:41,974][25689] Avg episode reward: [(0, '-2.274')] [2022-07-10 10:09:42,118][26022] Updated weights on worker 0-0, policy_version 678453 (0.00089) [2022-07-10 10:09:44,032][26022] Updated weights on worker 0-0, policy_version 678463 (0.00090) [2022-07-10 10:09:45,944][26022] Updated weights on worker 0-0, policy_version 678473 (0.00090) [2022-07-10 10:09:46,977][25689] Fps is (10 sec: 5602.5, 60 sec: 5542.0, 300 sec: 5545.8). Total num frames: 694762496. Throughput: 0: 5018.8. Samples: 694756954. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:09:46,978][25689] Avg episode reward: [(0, '-3.355')] [2022-07-10 10:09:47,836][26022] Updated weights on worker 0-0, policy_version 678483 (0.00086) [2022-07-10 10:09:49,655][26022] Updated weights on worker 0-0, policy_version 678493 (0.00099) [2022-07-10 10:09:51,527][26022] Updated weights on worker 0-0, policy_version 678503 (0.00087) [2022-07-10 10:09:51,992][25689] Fps is (10 sec: 5723.4, 60 sec: 5540.8, 300 sec: 5545.9). Total num frames: 694790144. Throughput: 0: 5837.5. Samples: 694790348. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:09:51,993][25689] Avg episode reward: [(0, '-3.769')] [2022-07-10 10:09:53,307][26022] Updated weights on worker 0-0, policy_version 678513 (0.00090) [2022-07-10 10:09:55,110][26022] Updated weights on worker 0-0, policy_version 678523 (0.00096) [2022-07-10 10:09:56,977][26022] Updated weights on worker 0-0, policy_version 678533 (0.00089) [2022-07-10 10:09:57,008][25689] Fps is (10 sec: 5512.8, 60 sec: 5523.7, 300 sec: 5547.2). Total num frames: 694817792. Throughput: 0: 5841.5. Samples: 694823962. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:09:57,008][25689] Avg episode reward: [(0, '-3.685')] [2022-07-10 10:09:58,731][26022] Updated weights on worker 0-0, policy_version 678543 (0.00082) [2022-07-10 10:10:00,618][26022] Updated weights on worker 0-0, policy_version 678553 (0.00088) [2022-07-10 10:10:02,103][25689] Fps is (10 sec: 5468.9, 60 sec: 5526.4, 300 sec: 5549.3). Total num frames: 694845440. Throughput: 0: 4991.4. Samples: 694840606. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:10:02,107][25689] Avg episode reward: [(0, '-4.377')] [2022-07-10 10:10:02,927][26022] Updated weights on worker 0-0, policy_version 678563 (0.00088) [2022-07-10 10:10:04,675][26022] Updated weights on worker 0-0, policy_version 678573 (0.00096) [2022-07-10 10:10:06,557][26022] Updated weights on worker 0-0, policy_version 678583 (0.00086) [2022-07-10 10:10:07,157][25689] Fps is (10 sec: 5246.4, 60 sec: 5524.5, 300 sec: 5538.3). Total num frames: 694871040. Throughput: 0: 5703.5. Samples: 694872050. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:10:07,158][25689] Avg episode reward: [(0, '-3.025')] [2022-07-10 10:10:08,299][26022] Updated weights on worker 0-0, policy_version 678593 (0.00086) [2022-07-10 10:10:10,316][26022] Updated weights on worker 0-0, policy_version 678603 (0.00097) [2022-07-10 10:10:12,035][26022] Updated weights on worker 0-0, policy_version 678613 (0.00094) [2022-07-10 10:10:12,173][25689] Fps is (10 sec: 5389.3, 60 sec: 5543.5, 300 sec: 5548.8). Total num frames: 694899712. Throughput: 0: 5700.7. Samples: 694905394. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:10:12,174][25689] Avg episode reward: [(0, '-1.247')] [2022-07-10 10:10:14,032][26022] Updated weights on worker 0-0, policy_version 678623 (0.00086) [2022-07-10 10:10:15,785][26022] Updated weights on worker 0-0, policy_version 678633 (0.00081) [2022-07-10 10:10:17,208][25689] Fps is (10 sec: 5705.4, 60 sec: 5560.9, 300 sec: 5549.5). Total num frames: 694928384. Throughput: 0: 4870.3. Samples: 694922344. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:10:17,210][25689] Avg episode reward: [(0, '-1.362')] [2022-07-10 10:10:17,377][26022] Updated weights on worker 0-0, policy_version 678643 (0.00086) [2022-07-10 10:10:19,416][26022] Updated weights on worker 0-0, policy_version 678653 (0.00090) [2022-07-10 10:10:21,100][26022] Updated weights on worker 0-0, policy_version 678663 (0.00092) [2022-07-10 10:10:22,261][25689] Fps is (10 sec: 5684.4, 60 sec: 5544.8, 300 sec: 5549.1). Total num frames: 694957056. Throughput: 0: 5720.4. Samples: 694955920. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:10:22,262][25689] Avg episode reward: [(0, '-1.700')] [2022-07-10 10:10:22,518][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:10:22,530][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000678670_694958080.pth [2022-07-10 10:10:22,531][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000676720_692961280.pth [2022-07-10 10:10:23,003][26022] Updated weights on worker 0-0, policy_version 678673 (0.00096) [2022-07-10 10:10:24,899][26022] Updated weights on worker 0-0, policy_version 678683 (0.00087) [2022-07-10 10:10:26,557][26022] Updated weights on worker 0-0, policy_version 678693 (0.00213) [2022-07-10 10:10:27,331][25689] Fps is (10 sec: 5462.4, 60 sec: 5540.3, 300 sec: 5541.1). Total num frames: 694983680. Throughput: 0: 5819.7. Samples: 694989456. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:10:27,332][25689] Avg episode reward: [(0, '-2.229')] [2022-07-10 10:10:28,691][26022] Updated weights on worker 0-0, policy_version 678703 (0.00089) [2022-07-10 10:10:30,451][26022] Updated weights on worker 0-0, policy_version 678713 (0.00094) [2022-07-10 10:10:32,134][26022] Updated weights on worker 0-0, policy_version 678723 (0.00081) [2022-07-10 10:10:32,374][25689] Fps is (10 sec: 5569.5, 60 sec: 5554.2, 300 sec: 5547.3). Total num frames: 695013376. Throughput: 0: 5000.1. Samples: 695006398. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:10:32,374][25689] Avg episode reward: [(0, '-4.954')] [2022-07-10 10:10:34,136][26022] Updated weights on worker 0-0, policy_version 678733 (0.00092) [2022-07-10 10:10:35,754][26022] Updated weights on worker 0-0, policy_version 678743 (0.00849) [2022-07-10 10:10:37,449][25689] Fps is (10 sec: 5667.4, 60 sec: 5532.4, 300 sec: 5545.2). Total num frames: 695041024. Throughput: 0: 5820.7. Samples: 695040162. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:10:37,450][25689] Avg episode reward: [(0, '-4.795')] [2022-07-10 10:10:37,655][26022] Updated weights on worker 0-0, policy_version 678753 (0.00082) [2022-07-10 10:10:39,594][26022] Updated weights on worker 0-0, policy_version 678763 (0.00092) [2022-07-10 10:10:41,263][26022] Updated weights on worker 0-0, policy_version 678773 (0.00087) [2022-07-10 10:10:42,589][25689] Fps is (10 sec: 5513.5, 60 sec: 5557.9, 300 sec: 5549.8). Total num frames: 695069696. Throughput: 0: 5786.6. Samples: 695073546. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:10:42,589][25689] Avg episode reward: [(0, '-5.931')] [2022-07-10 10:10:43,347][26022] Updated weights on worker 0-0, policy_version 678783 (0.00823) [2022-07-10 10:10:44,842][26022] Updated weights on worker 0-0, policy_version 678793 (0.00086) [2022-07-10 10:10:46,731][26022] Updated weights on worker 0-0, policy_version 678803 (0.00092) [2022-07-10 10:10:47,661][25689] Fps is (10 sec: 5615.4, 60 sec: 5534.8, 300 sec: 5552.6). Total num frames: 695098368. Throughput: 0: 5793.8. Samples: 695107244. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:10:47,661][25689] Avg episode reward: [(0, '-6.018')] [2022-07-10 10:10:48,680][26022] Updated weights on worker 0-0, policy_version 678813 (0.00091) [2022-07-10 10:10:50,506][26022] Updated weights on worker 0-0, policy_version 678823 (0.00092) [2022-07-10 10:10:52,254][26022] Updated weights on worker 0-0, policy_version 678833 (0.00087) [2022-07-10 10:10:52,766][25689] Fps is (10 sec: 5735.0, 60 sec: 5560.3, 300 sec: 5555.6). Total num frames: 695128064. Throughput: 0: 5772.2. Samples: 695124108. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:10:52,767][25689] Avg episode reward: [(0, '-5.923')] [2022-07-10 10:10:54,283][26022] Updated weights on worker 0-0, policy_version 678843 (0.00089) [2022-07-10 10:10:55,845][26022] Updated weights on worker 0-0, policy_version 678853 (0.00091) [2022-07-10 10:10:57,799][25689] Fps is (10 sec: 5454.1, 60 sec: 5525.0, 300 sec: 5546.3). Total num frames: 695153664. Throughput: 0: 5787.8. Samples: 695157946. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:10:57,800][25689] Avg episode reward: [(0, '-4.891')] [2022-07-10 10:10:57,937][26022] Updated weights on worker 0-0, policy_version 678863 (0.00078) [2022-07-10 10:10:59,540][26022] Updated weights on worker 0-0, policy_version 678873 (0.00090) [2022-07-10 10:11:01,408][26022] Updated weights on worker 0-0, policy_version 678883 (0.00091) [2022-07-10 10:11:02,844][25689] Fps is (10 sec: 5385.5, 60 sec: 5546.5, 300 sec: 5556.5). Total num frames: 695182336. Throughput: 0: 5732.0. Samples: 695189648. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:11:02,844][25689] Avg episode reward: [(0, '-3.528')] [2022-07-10 10:11:03,510][26022] Updated weights on worker 0-0, policy_version 678893 (0.00089) [2022-07-10 10:11:05,245][26022] Updated weights on worker 0-0, policy_version 678903 (0.00087) [2022-07-10 10:11:07,091][26022] Updated weights on worker 0-0, policy_version 678913 (0.00085) [2022-07-10 10:11:07,911][25689] Fps is (10 sec: 5671.4, 60 sec: 5595.9, 300 sec: 5555.5). Total num frames: 695211008. Throughput: 0: 4904.4. Samples: 695206556. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:11:07,911][25689] Avg episode reward: [(0, '-2.094')] [2022-07-10 10:11:09,196][26022] Updated weights on worker 0-0, policy_version 678923 (0.00085) [2022-07-10 10:11:10,780][26022] Updated weights on worker 0-0, policy_version 678933 (0.00089) [2022-07-10 10:11:12,741][26022] Updated weights on worker 0-0, policy_version 678943 (0.00088) [2022-07-10 10:11:12,987][25689] Fps is (10 sec: 5451.5, 60 sec: 5556.7, 300 sec: 5554.4). Total num frames: 695237632. Throughput: 0: 5737.1. Samples: 695240116. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:11:12,987][25689] Avg episode reward: [(0, '-1.866')] [2022-07-10 10:11:14,255][26022] Updated weights on worker 0-0, policy_version 678953 (0.00095) [2022-07-10 10:11:16,507][26022] Updated weights on worker 0-0, policy_version 678963 (0.00089) [2022-07-10 10:11:18,016][25689] Fps is (10 sec: 5573.3, 60 sec: 5574.0, 300 sec: 5555.9). Total num frames: 695267328. Throughput: 0: 5743.4. Samples: 695274058. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:11:18,016][25689] Avg episode reward: [(0, '-3.285')] [2022-07-10 10:11:18,300][26022] Updated weights on worker 0-0, policy_version 678973 (0.00048) [2022-07-10 10:11:19,991][26022] Updated weights on worker 0-0, policy_version 678983 (0.00092) [2022-07-10 10:11:21,900][26022] Updated weights on worker 0-0, policy_version 678993 (0.00092) [2022-07-10 10:11:23,097][25689] Fps is (10 sec: 5773.3, 60 sec: 5571.5, 300 sec: 5558.2). Total num frames: 695296000. Throughput: 0: 4999.3. Samples: 695290900. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:11:23,097][25689] Avg episode reward: [(0, '-3.429')] [2022-07-10 10:11:23,583][26022] Updated weights on worker 0-0, policy_version 679003 (0.00090) [2022-07-10 10:11:25,572][26022] Updated weights on worker 0-0, policy_version 679013 (0.00083) [2022-07-10 10:11:27,386][26022] Updated weights on worker 0-0, policy_version 679023 (0.00086) [2022-07-10 10:11:28,112][25689] Fps is (10 sec: 5578.4, 60 sec: 5593.3, 300 sec: 5561.6). Total num frames: 695323648. Throughput: 0: 5846.9. Samples: 695324672. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:11:28,112][25689] Avg episode reward: [(0, '-3.300')] [2022-07-10 10:11:29,103][26022] Updated weights on worker 0-0, policy_version 679033 (0.00088) [2022-07-10 10:11:30,967][26022] Updated weights on worker 0-0, policy_version 679043 (0.00512) [2022-07-10 10:11:32,604][26022] Updated weights on worker 0-0, policy_version 679053 (0.00087) [2022-07-10 10:11:33,116][25689] Fps is (10 sec: 5723.5, 60 sec: 5596.9, 300 sec: 5562.1). Total num frames: 695353344. Throughput: 0: 5868.3. Samples: 695358240. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:11:33,116][25689] Avg episode reward: [(0, '-4.486')] [2022-07-10 10:11:34,691][26022] Updated weights on worker 0-0, policy_version 679063 (0.01253) [2022-07-10 10:11:36,283][26022] Updated weights on worker 0-0, policy_version 679073 (0.00089) [2022-07-10 10:11:38,125][25689] Fps is (10 sec: 5624.8, 60 sec: 5586.2, 300 sec: 5557.0). Total num frames: 695379968. Throughput: 0: 5028.8. Samples: 695375182. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 10:11:38,125][25689] Avg episode reward: [(0, '-4.732')] [2022-07-10 10:11:38,151][26022] Updated weights on worker 0-0, policy_version 679083 (0.00086) [2022-07-10 10:11:40,124][26022] Updated weights on worker 0-0, policy_version 679093 (0.00092) [2022-07-10 10:11:41,899][26022] Updated weights on worker 0-0, policy_version 679103 (0.00089) [2022-07-10 10:11:43,171][25689] Fps is (10 sec: 5397.5, 60 sec: 5577.9, 300 sec: 5557.6). Total num frames: 695407616. Throughput: 0: 5842.8. Samples: 695408188. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:11:43,172][25689] Avg episode reward: [(0, '-5.695')] [2022-07-10 10:11:43,840][26022] Updated weights on worker 0-0, policy_version 679113 (0.00085) [2022-07-10 10:11:45,574][26022] Updated weights on worker 0-0, policy_version 679123 (0.00088) [2022-07-10 10:11:47,384][26022] Updated weights on worker 0-0, policy_version 679133 (0.00095) [2022-07-10 10:11:48,175][25689] Fps is (10 sec: 5400.0, 60 sec: 5550.3, 300 sec: 5548.6). Total num frames: 695434240. Throughput: 0: 5843.0. Samples: 695441900. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:11:48,175][25689] Avg episode reward: [(0, '-6.326')] [2022-07-10 10:11:49,404][26022] Updated weights on worker 0-0, policy_version 679143 (0.00085) [2022-07-10 10:11:51,186][26022] Updated weights on worker 0-0, policy_version 679153 (0.00084) [2022-07-10 10:11:53,031][26022] Updated weights on worker 0-0, policy_version 679163 (0.00089) [2022-07-10 10:11:53,275][25689] Fps is (10 sec: 5573.8, 60 sec: 5550.7, 300 sec: 5554.5). Total num frames: 695463936. Throughput: 0: 4985.6. Samples: 695458746. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:11:53,276][25689] Avg episode reward: [(0, '-6.202')] [2022-07-10 10:11:54,697][26022] Updated weights on worker 0-0, policy_version 679173 (0.00108) [2022-07-10 10:11:56,902][26022] Updated weights on worker 0-0, policy_version 679183 (0.00092) [2022-07-10 10:11:58,287][25689] Fps is (10 sec: 5771.9, 60 sec: 5603.5, 300 sec: 5556.3). Total num frames: 695492608. Throughput: 0: 5814.4. Samples: 695492416. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:11:58,288][25689] Avg episode reward: [(0, '-4.902')] [2022-07-10 10:11:58,437][26022] Updated weights on worker 0-0, policy_version 679193 (0.00087) [2022-07-10 10:12:00,604][26022] Updated weights on worker 0-0, policy_version 679203 (0.00064) [2022-07-10 10:12:02,233][26022] Updated weights on worker 0-0, policy_version 679213 (0.00055) [2022-07-10 10:12:03,325][25689] Fps is (10 sec: 5400.4, 60 sec: 5553.3, 300 sec: 5559.2). Total num frames: 695518208. Throughput: 0: 5735.9. Samples: 695523790. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:12:03,326][25689] Avg episode reward: [(0, '-4.854')] [2022-07-10 10:12:04,579][26022] Updated weights on worker 0-0, policy_version 679223 (0.00084) [2022-07-10 10:12:05,900][26022] Updated weights on worker 0-0, policy_version 679233 (0.00085) [2022-07-10 10:12:08,170][26022] Updated weights on worker 0-0, policy_version 679243 (0.00091) [2022-07-10 10:12:08,344][25689] Fps is (10 sec: 5294.7, 60 sec: 5540.7, 300 sec: 5552.2). Total num frames: 695545856. Throughput: 0: 4894.4. Samples: 695540616. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:12:08,345][25689] Avg episode reward: [(0, '-5.700')] [2022-07-10 10:12:09,522][26022] Updated weights on worker 0-0, policy_version 679253 (0.00081) [2022-07-10 10:12:11,756][26022] Updated weights on worker 0-0, policy_version 679263 (0.00091) [2022-07-10 10:12:13,372][25689] Fps is (10 sec: 5605.6, 60 sec: 5579.1, 300 sec: 5552.0). Total num frames: 695574528. Throughput: 0: 5730.8. Samples: 695573916. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:12:13,373][25689] Avg episode reward: [(0, '-3.523')] [2022-07-10 10:12:13,444][26022] Updated weights on worker 0-0, policy_version 679273 (0.00092) [2022-07-10 10:12:15,450][26022] Updated weights on worker 0-0, policy_version 679283 (0.00093) [2022-07-10 10:12:17,132][26022] Updated weights on worker 0-0, policy_version 679293 (0.00082) [2022-07-10 10:12:18,386][25689] Fps is (10 sec: 5608.7, 60 sec: 5546.6, 300 sec: 5552.6). Total num frames: 695602176. Throughput: 0: 5743.0. Samples: 695607840. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:12:18,386][25689] Avg episode reward: [(0, '-4.662')] [2022-07-10 10:12:19,079][26022] Updated weights on worker 0-0, policy_version 679303 (0.00083) [2022-07-10 10:12:20,891][26022] Updated weights on worker 0-0, policy_version 679313 (0.00094) [2022-07-10 10:12:22,567][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:12:22,579][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000679322_695625728.pth [2022-07-10 10:12:22,580][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000677368_693624832.pth [2022-07-10 10:12:22,813][26022] Updated weights on worker 0-0, policy_version 679323 (0.00086) [2022-07-10 10:12:23,519][25689] Fps is (10 sec: 5449.7, 60 sec: 5524.9, 300 sec: 5550.4). Total num frames: 695629824. Throughput: 0: 4994.9. Samples: 695624656. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:12:23,519][25689] Avg episode reward: [(0, '-5.166')] [2022-07-10 10:12:24,508][26022] Updated weights on worker 0-0, policy_version 679333 (0.00090) [2022-07-10 10:12:26,335][26022] Updated weights on worker 0-0, policy_version 679343 (0.00087) [2022-07-10 10:12:28,059][26022] Updated weights on worker 0-0, policy_version 679353 (0.00083) [2022-07-10 10:12:28,521][25689] Fps is (10 sec: 5556.6, 60 sec: 5543.0, 300 sec: 5550.5). Total num frames: 695658496. Throughput: 0: 5818.2. Samples: 695658010. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:12:28,522][25689] Avg episode reward: [(0, '-4.702')] [2022-07-10 10:12:29,967][26022] Updated weights on worker 0-0, policy_version 679363 (0.00087) [2022-07-10 10:12:32,071][26022] Updated weights on worker 0-0, policy_version 679373 (0.00085) [2022-07-10 10:12:33,601][25689] Fps is (10 sec: 5687.5, 60 sec: 5519.1, 300 sec: 5553.0). Total num frames: 695687168. Throughput: 0: 5806.4. Samples: 695691374. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:12:33,602][25689] Avg episode reward: [(0, '-4.189')] [2022-07-10 10:12:33,673][26022] Updated weights on worker 0-0, policy_version 679383 (0.00083) [2022-07-10 10:12:35,819][26022] Updated weights on worker 0-0, policy_version 679393 (0.00082) [2022-07-10 10:12:37,382][26022] Updated weights on worker 0-0, policy_version 679403 (0.00082) [2022-07-10 10:12:38,684][25689] Fps is (10 sec: 5541.8, 60 sec: 5529.2, 300 sec: 5549.1). Total num frames: 695714816. Throughput: 0: 5775.4. Samples: 695725072. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:12:38,685][25689] Avg episode reward: [(0, '-4.710')] [2022-07-10 10:12:39,382][26022] Updated weights on worker 0-0, policy_version 679413 (0.00084) [2022-07-10 10:12:40,990][26022] Updated weights on worker 0-0, policy_version 679423 (0.00094) [2022-07-10 10:12:43,077][26022] Updated weights on worker 0-0, policy_version 679433 (0.00089) [2022-07-10 10:12:43,749][25689] Fps is (10 sec: 5651.0, 60 sec: 5561.4, 300 sec: 5554.8). Total num frames: 695744512. Throughput: 0: 5793.6. Samples: 695741862. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:12:43,751][25689] Avg episode reward: [(0, '-3.737')] [2022-07-10 10:12:44,893][26022] Updated weights on worker 0-0, policy_version 679443 (0.00084) [2022-07-10 10:12:46,619][26022] Updated weights on worker 0-0, policy_version 679453 (0.00097) [2022-07-10 10:12:48,357][26022] Updated weights on worker 0-0, policy_version 679463 (0.00085) [2022-07-10 10:12:48,799][25689] Fps is (10 sec: 5669.5, 60 sec: 5574.1, 300 sec: 5554.2). Total num frames: 695772160. Throughput: 0: 5798.1. Samples: 695775580. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:12:48,800][25689] Avg episode reward: [(0, '-3.973')] [2022-07-10 10:12:50,328][26022] Updated weights on worker 0-0, policy_version 679473 (0.00086) [2022-07-10 10:12:51,918][26022] Updated weights on worker 0-0, policy_version 679483 (0.00090) [2022-07-10 10:12:53,823][25689] Fps is (10 sec: 5489.1, 60 sec: 5547.3, 300 sec: 5550.4). Total num frames: 695799808. Throughput: 0: 5832.2. Samples: 695809308. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:12:53,823][25689] Avg episode reward: [(0, '-3.455')] [2022-07-10 10:12:53,992][26022] Updated weights on worker 0-0, policy_version 679493 (0.00091) [2022-07-10 10:12:55,746][26022] Updated weights on worker 0-0, policy_version 679503 (0.00095) [2022-07-10 10:12:57,652][26022] Updated weights on worker 0-0, policy_version 679513 (0.00119) [2022-07-10 10:12:58,859][25689] Fps is (10 sec: 5598.4, 60 sec: 5545.1, 300 sec: 5557.9). Total num frames: 695828480. Throughput: 0: 5011.2. Samples: 695826170. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:12:58,861][25689] Avg episode reward: [(0, '-4.048')] [2022-07-10 10:12:59,239][26022] Updated weights on worker 0-0, policy_version 679523 (0.00091) [2022-07-10 10:13:01,599][26022] Updated weights on worker 0-0, policy_version 679533 (0.00081) [2022-07-10 10:13:03,271][26022] Updated weights on worker 0-0, policy_version 679543 (0.00088) [2022-07-10 10:13:03,940][25689] Fps is (10 sec: 5566.4, 60 sec: 5574.8, 300 sec: 5560.2). Total num frames: 695856128. Throughput: 0: 5758.5. Samples: 695858134. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:13:03,941][25689] Avg episode reward: [(0, '-3.522')] [2022-07-10 10:13:05,167][26022] Updated weights on worker 0-0, policy_version 679553 (0.00084) [2022-07-10 10:13:06,894][26022] Updated weights on worker 0-0, policy_version 679563 (0.00093) [2022-07-10 10:13:08,966][25689] Fps is (10 sec: 5268.3, 60 sec: 5540.5, 300 sec: 5549.7). Total num frames: 695881728. Throughput: 0: 5745.7. Samples: 695891456. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:13:08,966][25689] Avg episode reward: [(0, '-5.525')] [2022-07-10 10:13:09,051][26022] Updated weights on worker 0-0, policy_version 679573 (0.00100) [2022-07-10 10:13:10,554][26022] Updated weights on worker 0-0, policy_version 679583 (0.00088) [2022-07-10 10:13:12,554][26022] Updated weights on worker 0-0, policy_version 679593 (0.00096) [2022-07-10 10:13:13,979][25689] Fps is (10 sec: 5406.4, 60 sec: 5541.8, 300 sec: 5556.5). Total num frames: 695910400. Throughput: 0: 4911.8. Samples: 695908312. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:13:13,980][25689] Avg episode reward: [(0, '-5.455')] [2022-07-10 10:13:14,358][26022] Updated weights on worker 0-0, policy_version 679603 (0.00095) [2022-07-10 10:13:16,066][26022] Updated weights on worker 0-0, policy_version 679613 (0.00088) [2022-07-10 10:13:18,088][26022] Updated weights on worker 0-0, policy_version 679623 (0.00085) [2022-07-10 10:13:19,063][25689] Fps is (10 sec: 5780.7, 60 sec: 5569.1, 300 sec: 5556.1). Total num frames: 695940096. Throughput: 0: 5727.4. Samples: 695941888. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:13:19,063][25689] Avg episode reward: [(0, '-6.323')] [2022-07-10 10:13:19,771][26022] Updated weights on worker 0-0, policy_version 679633 (0.00091) [2022-07-10 10:13:21,571][26022] Updated weights on worker 0-0, policy_version 679643 (0.00097) [2022-07-10 10:13:23,653][26022] Updated weights on worker 0-0, policy_version 679653 (0.00089) [2022-07-10 10:13:24,189][25689] Fps is (10 sec: 5516.2, 60 sec: 5552.9, 300 sec: 5547.7). Total num frames: 695966720. Throughput: 0: 5801.9. Samples: 695975612. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:13:24,189][25689] Avg episode reward: [(0, '-6.071')] [2022-07-10 10:13:25,067][26022] Updated weights on worker 0-0, policy_version 679663 (0.00089) [2022-07-10 10:13:27,375][26022] Updated weights on worker 0-0, policy_version 679673 (0.00085) [2022-07-10 10:13:28,912][26022] Updated weights on worker 0-0, policy_version 679683 (0.00088) [2022-07-10 10:13:29,194][25689] Fps is (10 sec: 5559.4, 60 sec: 5569.6, 300 sec: 5554.6). Total num frames: 695996416. Throughput: 0: 4985.5. Samples: 695992300. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:13:29,194][25689] Avg episode reward: [(0, '-7.800')] [2022-07-10 10:13:31,039][26022] Updated weights on worker 0-0, policy_version 679693 (0.00099) [2022-07-10 10:13:32,703][26022] Updated weights on worker 0-0, policy_version 679703 (0.00088) [2022-07-10 10:13:34,223][25689] Fps is (10 sec: 5613.0, 60 sec: 5540.5, 300 sec: 5548.9). Total num frames: 696023040. Throughput: 0: 5790.8. Samples: 696025540. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:13:34,224][25689] Avg episode reward: [(0, '-7.598')] [2022-07-10 10:13:34,654][26022] Updated weights on worker 0-0, policy_version 679713 (0.00083) [2022-07-10 10:13:36,389][26022] Updated weights on worker 0-0, policy_version 679723 (0.00086) [2022-07-10 10:13:38,323][26022] Updated weights on worker 0-0, policy_version 679733 (0.00085) [2022-07-10 10:13:39,240][25689] Fps is (10 sec: 5504.1, 60 sec: 5563.4, 300 sec: 5554.1). Total num frames: 696051712. Throughput: 0: 5793.2. Samples: 696058776. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:13:39,242][25689] Avg episode reward: [(0, '-6.953')] [2022-07-10 10:13:40,168][26022] Updated weights on worker 0-0, policy_version 679743 (0.00079) [2022-07-10 10:13:42,047][26022] Updated weights on worker 0-0, policy_version 679753 (0.00091) [2022-07-10 10:13:43,782][26022] Updated weights on worker 0-0, policy_version 679763 (0.00090) [2022-07-10 10:13:44,281][25689] Fps is (10 sec: 5599.1, 60 sec: 5531.7, 300 sec: 5547.8). Total num frames: 696079360. Throughput: 0: 4973.7. Samples: 696075546. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:13:44,283][25689] Avg episode reward: [(0, '-7.933')] [2022-07-10 10:13:45,615][26022] Updated weights on worker 0-0, policy_version 679773 (0.00088) [2022-07-10 10:13:47,389][26022] Updated weights on worker 0-0, policy_version 679783 (0.00092) [2022-07-10 10:13:49,360][25689] Fps is (10 sec: 5565.0, 60 sec: 5546.0, 300 sec: 5549.8). Total num frames: 696108032. Throughput: 0: 5808.3. Samples: 696109434. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:13:49,361][25689] Avg episode reward: [(0, '-7.543')] [2022-07-10 10:13:49,373][26022] Updated weights on worker 0-0, policy_version 679793 (0.01140) [2022-07-10 10:13:50,939][26022] Updated weights on worker 0-0, policy_version 679803 (0.00087) [2022-07-10 10:13:52,955][26022] Updated weights on worker 0-0, policy_version 679813 (0.00089) [2022-07-10 10:13:54,385][25689] Fps is (10 sec: 5675.8, 60 sec: 5562.8, 300 sec: 5549.6). Total num frames: 696136704. Throughput: 0: 5828.3. Samples: 696143050. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:13:54,385][25689] Avg episode reward: [(0, '-6.589')] [2022-07-10 10:13:54,705][26022] Updated weights on worker 0-0, policy_version 679823 (0.00091) [2022-07-10 10:13:56,626][26022] Updated weights on worker 0-0, policy_version 679833 (0.00722) [2022-07-10 10:13:58,380][26022] Updated weights on worker 0-0, policy_version 679843 (0.00097) [2022-07-10 10:13:59,416][25689] Fps is (10 sec: 5600.8, 60 sec: 5546.3, 300 sec: 5551.3). Total num frames: 696164352. Throughput: 0: 5013.3. Samples: 696159926. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:13:59,417][25689] Avg episode reward: [(0, '-6.399')] [2022-07-10 10:14:00,191][26022] Updated weights on worker 0-0, policy_version 679853 (0.00088) [2022-07-10 10:14:02,450][26022] Updated weights on worker 0-0, policy_version 679863 (0.00087) [2022-07-10 10:14:04,050][26022] Updated weights on worker 0-0, policy_version 679873 (0.00085) [2022-07-10 10:14:04,473][25689] Fps is (10 sec: 5379.8, 60 sec: 5531.7, 300 sec: 5554.3). Total num frames: 696190976. Throughput: 0: 5753.6. Samples: 696191718. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:14:04,475][25689] Avg episode reward: [(0, '-6.545')] [2022-07-10 10:14:06,199][26022] Updated weights on worker 0-0, policy_version 679883 (0.00097) [2022-07-10 10:14:07,786][26022] Updated weights on worker 0-0, policy_version 679893 (0.00085) [2022-07-10 10:14:09,460][26022] Updated weights on worker 0-0, policy_version 679903 (0.00079) [2022-07-10 10:14:09,554][25689] Fps is (10 sec: 5555.2, 60 sec: 5594.2, 300 sec: 5560.4). Total num frames: 696220672. Throughput: 0: 5754.3. Samples: 696225636. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:14:09,555][25689] Avg episode reward: [(0, '-4.730')] [2022-07-10 10:14:11,553][26022] Updated weights on worker 0-0, policy_version 679913 (0.00085) [2022-07-10 10:14:13,269][26022] Updated weights on worker 0-0, policy_version 679923 (0.00091) [2022-07-10 10:14:14,565][25689] Fps is (10 sec: 5682.2, 60 sec: 5577.5, 300 sec: 5561.0). Total num frames: 696248320. Throughput: 0: 4934.8. Samples: 696242634. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:14:14,565][25689] Avg episode reward: [(0, '-3.333')] [2022-07-10 10:14:15,212][26022] Updated weights on worker 0-0, policy_version 679933 (0.00083) [2022-07-10 10:14:17,066][26022] Updated weights on worker 0-0, policy_version 679943 (0.00094) [2022-07-10 10:14:18,743][26022] Updated weights on worker 0-0, policy_version 679953 (0.00089) [2022-07-10 10:14:19,600][25689] Fps is (10 sec: 5402.5, 60 sec: 5531.3, 300 sec: 5551.1). Total num frames: 696274944. Throughput: 0: 5758.9. Samples: 696276164. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:14:19,602][25689] Avg episode reward: [(0, '-2.550')] [2022-07-10 10:14:20,763][26022] Updated weights on worker 0-0, policy_version 679963 (0.00080) [2022-07-10 10:14:22,279][26022] Updated weights on worker 0-0, policy_version 679973 (0.00095) [2022-07-10 10:14:22,606][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:14:22,616][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000679974_696293376.pth [2022-07-10 10:14:22,618][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000678021_694293504.pth [2022-07-10 10:14:24,196][26022] Updated weights on worker 0-0, policy_version 679983 (0.00080) [2022-07-10 10:14:24,651][25689] Fps is (10 sec: 5584.0, 60 sec: 5589.0, 300 sec: 5560.9). Total num frames: 696304640. Throughput: 0: 5853.0. Samples: 696309820. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:14:24,651][25689] Avg episode reward: [(0, '-3.526')] [2022-07-10 10:14:26,254][26022] Updated weights on worker 0-0, policy_version 679993 (0.00107) [2022-07-10 10:14:27,931][26022] Updated weights on worker 0-0, policy_version 680003 (0.00082) [2022-07-10 10:14:29,656][25689] Fps is (10 sec: 5702.8, 60 sec: 5555.1, 300 sec: 5557.6). Total num frames: 696332288. Throughput: 0: 5016.3. Samples: 696326470. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:14:29,656][25689] Avg episode reward: [(0, '-4.429')] [2022-07-10 10:14:29,752][26022] Updated weights on worker 0-0, policy_version 680013 (0.00091) [2022-07-10 10:14:31,587][26022] Updated weights on worker 0-0, policy_version 680023 (0.00093) [2022-07-10 10:14:33,452][26022] Updated weights on worker 0-0, policy_version 680033 (0.00084) [2022-07-10 10:14:34,674][25689] Fps is (10 sec: 5618.8, 60 sec: 5589.9, 300 sec: 5557.6). Total num frames: 696360960. Throughput: 0: 5832.0. Samples: 696359912. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:14:34,675][25689] Avg episode reward: [(0, '-4.676')] [2022-07-10 10:14:35,257][26022] Updated weights on worker 0-0, policy_version 680043 (0.01104) [2022-07-10 10:14:37,163][26022] Updated weights on worker 0-0, policy_version 680053 (0.00057) [2022-07-10 10:14:38,762][26022] Updated weights on worker 0-0, policy_version 680063 (0.00087) [2022-07-10 10:14:39,690][25689] Fps is (10 sec: 5510.4, 60 sec: 5556.2, 300 sec: 5558.2). Total num frames: 696387584. Throughput: 0: 5844.0. Samples: 696393572. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:14:39,691][25689] Avg episode reward: [(0, '-4.903')] [2022-07-10 10:14:40,789][26022] Updated weights on worker 0-0, policy_version 680073 (0.00091) [2022-07-10 10:14:42,529][26022] Updated weights on worker 0-0, policy_version 680083 (0.00091) [2022-07-10 10:14:44,499][26022] Updated weights on worker 0-0, policy_version 680093 (0.00094) [2022-07-10 10:14:44,742][25689] Fps is (10 sec: 5492.5, 60 sec: 5572.2, 300 sec: 5553.9). Total num frames: 696416256. Throughput: 0: 4999.3. Samples: 696410260. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:14:44,742][25689] Avg episode reward: [(0, '-4.895')] [2022-07-10 10:14:46,427][26022] Updated weights on worker 0-0, policy_version 680103 (0.00082) [2022-07-10 10:14:48,234][26022] Updated weights on worker 0-0, policy_version 680113 (0.00082) [2022-07-10 10:14:49,780][25689] Fps is (10 sec: 5582.1, 60 sec: 5559.0, 300 sec: 5553.5). Total num frames: 696443904. Throughput: 0: 5818.5. Samples: 696443564. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:14:49,780][25689] Avg episode reward: [(0, '-3.760')] [2022-07-10 10:14:50,008][26022] Updated weights on worker 0-0, policy_version 680123 (0.00090) [2022-07-10 10:14:51,830][26022] Updated weights on worker 0-0, policy_version 680133 (0.00099) [2022-07-10 10:14:53,709][26022] Updated weights on worker 0-0, policy_version 680143 (0.00085) [2022-07-10 10:14:54,783][25689] Fps is (10 sec: 5506.9, 60 sec: 5544.0, 300 sec: 5553.7). Total num frames: 696471552. Throughput: 0: 5824.5. Samples: 696477036. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 10:14:54,784][25689] Avg episode reward: [(0, '-2.222')] [2022-07-10 10:14:55,491][26022] Updated weights on worker 0-0, policy_version 680153 (0.00084) [2022-07-10 10:14:57,430][26022] Updated weights on worker 0-0, policy_version 680163 (0.00087) [2022-07-10 10:14:59,347][26022] Updated weights on worker 0-0, policy_version 680173 (0.00086) [2022-07-10 10:14:59,802][25689] Fps is (10 sec: 5517.5, 60 sec: 5545.2, 300 sec: 5555.1). Total num frames: 696499200. Throughput: 0: 4978.6. Samples: 696493698. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:14:59,804][25689] Avg episode reward: [(0, '-1.551')] [2022-07-10 10:15:00,990][26022] Updated weights on worker 0-0, policy_version 680183 (0.00095) [2022-07-10 10:15:03,286][26022] Updated weights on worker 0-0, policy_version 680193 (0.00089) [2022-07-10 10:15:04,863][25689] Fps is (10 sec: 5383.8, 60 sec: 5544.7, 300 sec: 5558.4). Total num frames: 696525824. Throughput: 0: 5696.2. Samples: 696524878. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:15:04,866][25689] Avg episode reward: [(0, '-1.569')] [2022-07-10 10:15:05,136][26022] Updated weights on worker 0-0, policy_version 680203 (0.00091) [2022-07-10 10:15:06,911][26022] Updated weights on worker 0-0, policy_version 680213 (0.00099) [2022-07-10 10:15:08,978][26022] Updated weights on worker 0-0, policy_version 680223 (0.00086) [2022-07-10 10:15:09,884][25689] Fps is (10 sec: 5383.0, 60 sec: 5516.4, 300 sec: 5554.9). Total num frames: 696553472. Throughput: 0: 5709.1. Samples: 696558340. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:15:09,885][25689] Avg episode reward: [(0, '-3.064')] [2022-07-10 10:15:10,638][26022] Updated weights on worker 0-0, policy_version 680233 (0.00093) [2022-07-10 10:15:12,463][26022] Updated weights on worker 0-0, policy_version 680243 (0.00083) [2022-07-10 10:15:14,419][26022] Updated weights on worker 0-0, policy_version 680253 (0.00096) [2022-07-10 10:15:14,919][25689] Fps is (10 sec: 5499.3, 60 sec: 5514.1, 300 sec: 5551.5). Total num frames: 696581120. Throughput: 0: 4870.3. Samples: 696575104. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:15:14,921][25689] Avg episode reward: [(0, '-3.542')] [2022-07-10 10:15:16,203][26022] Updated weights on worker 0-0, policy_version 680263 (0.00087) [2022-07-10 10:15:18,219][26022] Updated weights on worker 0-0, policy_version 680273 (0.00094) [2022-07-10 10:15:19,848][26022] Updated weights on worker 0-0, policy_version 680283 (0.00084) [2022-07-10 10:15:19,936][25689] Fps is (10 sec: 5602.7, 60 sec: 5549.7, 300 sec: 5552.2). Total num frames: 696609792. Throughput: 0: 5703.2. Samples: 696608528. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:15:19,938][25689] Avg episode reward: [(0, '-3.809')] [2022-07-10 10:15:21,899][26022] Updated weights on worker 0-0, policy_version 680293 (0.00089) [2022-07-10 10:15:23,572][26022] Updated weights on worker 0-0, policy_version 680303 (0.00086) [2022-07-10 10:15:25,054][25689] Fps is (10 sec: 5556.8, 60 sec: 5509.6, 300 sec: 5554.7). Total num frames: 696637440. Throughput: 0: 5803.7. Samples: 696642058. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:15:25,056][25689] Avg episode reward: [(0, '-4.457')] [2022-07-10 10:15:25,475][26022] Updated weights on worker 0-0, policy_version 680313 (0.00082) [2022-07-10 10:15:27,247][26022] Updated weights on worker 0-0, policy_version 680323 (0.00089) [2022-07-10 10:15:28,945][26022] Updated weights on worker 0-0, policy_version 680333 (0.00093) [2022-07-10 10:15:30,067][25689] Fps is (10 sec: 5559.5, 60 sec: 5525.9, 300 sec: 5551.8). Total num frames: 696666112. Throughput: 0: 4981.9. Samples: 696658888. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:15:30,067][25689] Avg episode reward: [(0, '-4.689')] [2022-07-10 10:15:30,971][26022] Updated weights on worker 0-0, policy_version 680343 (0.00079) [2022-07-10 10:15:32,855][26022] Updated weights on worker 0-0, policy_version 680353 (0.00099) [2022-07-10 10:15:34,473][26022] Updated weights on worker 0-0, policy_version 680363 (0.00094) [2022-07-10 10:15:35,076][25689] Fps is (10 sec: 5721.6, 60 sec: 5526.7, 300 sec: 5556.5). Total num frames: 696694784. Throughput: 0: 5834.4. Samples: 696692712. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:15:35,078][25689] Avg episode reward: [(0, '-2.973')] [2022-07-10 10:15:36,608][26022] Updated weights on worker 0-0, policy_version 680373 (0.00082) [2022-07-10 10:15:37,953][26022] Updated weights on worker 0-0, policy_version 680383 (0.00093) [2022-07-10 10:15:40,042][26022] Updated weights on worker 0-0, policy_version 680393 (0.00089) [2022-07-10 10:15:40,090][25689] Fps is (10 sec: 5619.0, 60 sec: 5543.9, 300 sec: 5555.4). Total num frames: 696722432. Throughput: 0: 5849.8. Samples: 696726422. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:15:40,090][25689] Avg episode reward: [(0, '-2.700')] [2022-07-10 10:15:41,829][26022] Updated weights on worker 0-0, policy_version 680403 (0.00077) [2022-07-10 10:15:43,675][26022] Updated weights on worker 0-0, policy_version 680413 (0.00086) [2022-07-10 10:15:45,159][25689] Fps is (10 sec: 5586.0, 60 sec: 5542.3, 300 sec: 5555.5). Total num frames: 696751104. Throughput: 0: 5868.1. Samples: 696760034. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:15:45,159][25689] Avg episode reward: [(0, '-3.676')] [2022-07-10 10:15:45,523][26022] Updated weights on worker 0-0, policy_version 680423 (0.00098) [2022-07-10 10:15:47,309][26022] Updated weights on worker 0-0, policy_version 680433 (0.00090) [2022-07-10 10:15:49,062][26022] Updated weights on worker 0-0, policy_version 680443 (0.00090) [2022-07-10 10:15:50,179][25689] Fps is (10 sec: 5480.7, 60 sec: 5527.0, 300 sec: 5546.7). Total num frames: 696777728. Throughput: 0: 5863.1. Samples: 696776808. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:15:50,179][25689] Avg episode reward: [(0, '-4.568')] [2022-07-10 10:15:51,075][26022] Updated weights on worker 0-0, policy_version 680453 (0.00085) [2022-07-10 10:15:52,744][26022] Updated weights on worker 0-0, policy_version 680463 (0.00090) [2022-07-10 10:15:54,662][26022] Updated weights on worker 0-0, policy_version 680473 (0.00086) [2022-07-10 10:15:55,203][25689] Fps is (10 sec: 5607.4, 60 sec: 5559.0, 300 sec: 5560.7). Total num frames: 696807424. Throughput: 0: 5862.0. Samples: 696810692. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:15:55,203][25689] Avg episode reward: [(0, '-4.968')] [2022-07-10 10:15:56,443][26022] Updated weights on worker 0-0, policy_version 680483 (0.00089) [2022-07-10 10:15:58,403][26022] Updated weights on worker 0-0, policy_version 680493 (0.00079) [2022-07-10 10:16:00,053][26022] Updated weights on worker 0-0, policy_version 680503 (0.00093) [2022-07-10 10:16:00,208][25689] Fps is (10 sec: 5717.8, 60 sec: 5560.2, 300 sec: 5557.9). Total num frames: 696835072. Throughput: 0: 5869.8. Samples: 696844512. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:16:00,209][25689] Avg episode reward: [(0, '-5.154')] [2022-07-10 10:16:02,435][26022] Updated weights on worker 0-0, policy_version 680513 (0.00089) [2022-07-10 10:16:04,184][26022] Updated weights on worker 0-0, policy_version 680523 (0.00088) [2022-07-10 10:16:05,311][25689] Fps is (10 sec: 5267.8, 60 sec: 5539.5, 300 sec: 5546.9). Total num frames: 696860672. Throughput: 0: 4919.5. Samples: 696859172. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:16:05,311][25689] Avg episode reward: [(0, '-5.940')] [2022-07-10 10:16:06,090][26022] Updated weights on worker 0-0, policy_version 680533 (0.00098) [2022-07-10 10:16:07,566][26022] Updated weights on worker 0-0, policy_version 680543 (0.00051) [2022-07-10 10:16:09,700][26022] Updated weights on worker 0-0, policy_version 680553 (0.00084) [2022-07-10 10:16:10,316][25689] Fps is (10 sec: 5369.0, 60 sec: 5557.8, 300 sec: 5555.2). Total num frames: 696889344. Throughput: 0: 5760.3. Samples: 696892806. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:16:10,318][25689] Avg episode reward: [(0, '-5.895')] [2022-07-10 10:16:11,387][26022] Updated weights on worker 0-0, policy_version 680563 (0.00892) [2022-07-10 10:16:13,290][26022] Updated weights on worker 0-0, policy_version 680573 (0.00086) [2022-07-10 10:16:15,207][26022] Updated weights on worker 0-0, policy_version 680583 (0.00102) [2022-07-10 10:16:15,342][25689] Fps is (10 sec: 5614.7, 60 sec: 5558.7, 300 sec: 5548.3). Total num frames: 696916992. Throughput: 0: 5734.9. Samples: 696926188. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:16:15,342][25689] Avg episode reward: [(0, '-4.310')] [2022-07-10 10:16:17,261][26022] Updated weights on worker 0-0, policy_version 680593 (0.00089) [2022-07-10 10:16:18,935][26022] Updated weights on worker 0-0, policy_version 680603 (0.00088) [2022-07-10 10:16:20,353][25689] Fps is (10 sec: 5509.7, 60 sec: 5542.3, 300 sec: 5546.2). Total num frames: 696944640. Throughput: 0: 4877.4. Samples: 696942764. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:16:20,353][25689] Avg episode reward: [(0, '-1.864')] [2022-07-10 10:16:20,731][26022] Updated weights on worker 0-0, policy_version 680613 (0.00088) [2022-07-10 10:16:22,652][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:16:22,666][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000680623_696957952.pth [2022-07-10 10:16:22,666][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000678670_694958080.pth [2022-07-10 10:16:22,669][26022] Updated weights on worker 0-0, policy_version 680623 (0.00091) [2022-07-10 10:16:24,403][26022] Updated weights on worker 0-0, policy_version 680633 (0.00084) [2022-07-10 10:16:25,465][25689] Fps is (10 sec: 5563.4, 60 sec: 5559.8, 300 sec: 5547.8). Total num frames: 696973312. Throughput: 0: 5803.2. Samples: 696976130. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:16:25,466][25689] Avg episode reward: [(0, '-2.852')] [2022-07-10 10:16:26,422][26022] Updated weights on worker 0-0, policy_version 680643 (0.00088) [2022-07-10 10:16:28,141][26022] Updated weights on worker 0-0, policy_version 680653 (0.00573) [2022-07-10 10:16:29,971][26022] Updated weights on worker 0-0, policy_version 680663 (0.00101) [2022-07-10 10:16:30,496][25689] Fps is (10 sec: 5653.1, 60 sec: 5558.1, 300 sec: 5543.9). Total num frames: 697001984. Throughput: 0: 5787.3. Samples: 697009592. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:16:30,497][25689] Avg episode reward: [(0, '-1.680')] [2022-07-10 10:16:31,869][26022] Updated weights on worker 0-0, policy_version 680673 (0.00088) [2022-07-10 10:16:33,600][26022] Updated weights on worker 0-0, policy_version 680683 (0.00090) [2022-07-10 10:16:35,445][26022] Updated weights on worker 0-0, policy_version 680693 (0.00083) [2022-07-10 10:16:35,541][25689] Fps is (10 sec: 5589.5, 60 sec: 5537.9, 300 sec: 5546.6). Total num frames: 697029632. Throughput: 0: 4954.3. Samples: 697026262. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:16:35,543][25689] Avg episode reward: [(0, '-1.815')] [2022-07-10 10:16:37,322][26022] Updated weights on worker 0-0, policy_version 680703 (0.00082) [2022-07-10 10:16:39,115][26022] Updated weights on worker 0-0, policy_version 680713 (0.00091) [2022-07-10 10:16:40,544][25689] Fps is (10 sec: 5503.4, 60 sec: 5538.9, 300 sec: 5547.5). Total num frames: 697057280. Throughput: 0: 5779.3. Samples: 697059456. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:16:40,544][25689] Avg episode reward: [(0, '-2.938')] [2022-07-10 10:16:41,117][26022] Updated weights on worker 0-0, policy_version 680723 (0.00093) [2022-07-10 10:16:42,746][26022] Updated weights on worker 0-0, policy_version 680733 (0.00093) [2022-07-10 10:16:44,656][26022] Updated weights on worker 0-0, policy_version 680743 (0.00103) [2022-07-10 10:16:45,598][25689] Fps is (10 sec: 5600.3, 60 sec: 5540.3, 300 sec: 5553.4). Total num frames: 697085952. Throughput: 0: 5798.5. Samples: 697092870. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:16:45,598][25689] Avg episode reward: [(0, '-2.655')] [2022-07-10 10:16:46,691][26022] Updated weights on worker 0-0, policy_version 680753 (0.00092) [2022-07-10 10:16:48,401][26022] Updated weights on worker 0-0, policy_version 680763 (0.00096) [2022-07-10 10:16:50,354][26022] Updated weights on worker 0-0, policy_version 680773 (0.00086) [2022-07-10 10:16:50,621][25689] Fps is (10 sec: 5487.1, 60 sec: 5540.0, 300 sec: 5544.5). Total num frames: 697112576. Throughput: 0: 4954.9. Samples: 697109310. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:16:50,622][25689] Avg episode reward: [(0, '-2.635')] [2022-07-10 10:16:51,998][26022] Updated weights on worker 0-0, policy_version 680783 (0.00092) [2022-07-10 10:16:54,176][26022] Updated weights on worker 0-0, policy_version 680793 (0.00055) [2022-07-10 10:16:55,667][25689] Fps is (10 sec: 5389.6, 60 sec: 5504.0, 300 sec: 5540.4). Total num frames: 697140224. Throughput: 0: 5771.4. Samples: 697142420. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:16:55,669][25689] Avg episode reward: [(0, '-1.704')] [2022-07-10 10:16:55,977][26022] Updated weights on worker 0-0, policy_version 680803 (0.00090) [2022-07-10 10:16:57,773][26022] Updated weights on worker 0-0, policy_version 680813 (0.00103) [2022-07-10 10:16:59,677][26022] Updated weights on worker 0-0, policy_version 680823 (0.00091) [2022-07-10 10:17:00,673][25689] Fps is (10 sec: 5500.9, 60 sec: 5504.0, 300 sec: 5547.9). Total num frames: 697167872. Throughput: 0: 5780.5. Samples: 697175814. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:17:00,674][25689] Avg episode reward: [(0, '-3.645')] [2022-07-10 10:17:01,828][26022] Updated weights on worker 0-0, policy_version 680833 (0.00100) [2022-07-10 10:17:03,711][26022] Updated weights on worker 0-0, policy_version 680843 (0.00084) [2022-07-10 10:17:05,314][26022] Updated weights on worker 0-0, policy_version 680853 (0.00067) [2022-07-10 10:17:05,739][25689] Fps is (10 sec: 5388.5, 60 sec: 5524.3, 300 sec: 5543.6). Total num frames: 697194496. Throughput: 0: 4851.4. Samples: 697190586. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:17:05,740][25689] Avg episode reward: [(0, '-3.736')] [2022-07-10 10:17:07,519][26022] Updated weights on worker 0-0, policy_version 680863 (0.00083) [2022-07-10 10:17:09,028][26022] Updated weights on worker 0-0, policy_version 680873 (0.00084) [2022-07-10 10:17:10,831][25689] Fps is (10 sec: 5343.1, 60 sec: 5499.5, 300 sec: 5539.0). Total num frames: 697222144. Throughput: 0: 5676.0. Samples: 697224020. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:17:10,831][25689] Avg episode reward: [(0, '-3.360')] [2022-07-10 10:17:11,208][26022] Updated weights on worker 0-0, policy_version 680883 (0.00088) [2022-07-10 10:17:12,536][26022] Updated weights on worker 0-0, policy_version 680893 (0.00083) [2022-07-10 10:17:14,603][26022] Updated weights on worker 0-0, policy_version 680903 (0.00086) [2022-07-10 10:17:15,873][25689] Fps is (10 sec: 5658.8, 60 sec: 5531.8, 300 sec: 5545.3). Total num frames: 697251840. Throughput: 0: 5725.9. Samples: 697258114. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:17:15,873][25689] Avg episode reward: [(0, '-4.944')] [2022-07-10 10:17:16,323][26022] Updated weights on worker 0-0, policy_version 680913 (0.00084) [2022-07-10 10:17:18,163][26022] Updated weights on worker 0-0, policy_version 680923 (0.00090) [2022-07-10 10:17:20,095][26022] Updated weights on worker 0-0, policy_version 680933 (0.00085) [2022-07-10 10:17:20,927][25689] Fps is (10 sec: 5578.3, 60 sec: 5511.0, 300 sec: 5543.4). Total num frames: 697278464. Throughput: 0: 4892.3. Samples: 697274900. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:17:20,928][25689] Avg episode reward: [(0, '-4.988')] [2022-07-10 10:17:21,955][26022] Updated weights on worker 0-0, policy_version 680943 (0.00093) [2022-07-10 10:17:23,832][26022] Updated weights on worker 0-0, policy_version 680953 (0.00096) [2022-07-10 10:17:25,755][26022] Updated weights on worker 0-0, policy_version 680963 (0.00086) [2022-07-10 10:17:26,002][25689] Fps is (10 sec: 5560.3, 60 sec: 5531.3, 300 sec: 5545.5). Total num frames: 697308160. Throughput: 0: 5821.9. Samples: 697308552. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:17:26,002][25689] Avg episode reward: [(0, '-4.681')] [2022-07-10 10:17:27,247][26022] Updated weights on worker 0-0, policy_version 680973 (0.00082) [2022-07-10 10:17:29,410][26022] Updated weights on worker 0-0, policy_version 680983 (0.00090) [2022-07-10 10:17:30,828][26022] Updated weights on worker 0-0, policy_version 680993 (0.00092) [2022-07-10 10:17:31,019][25689] Fps is (10 sec: 5783.3, 60 sec: 5532.6, 300 sec: 5546.6). Total num frames: 697336832. Throughput: 0: 5846.2. Samples: 697342046. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:17:31,020][25689] Avg episode reward: [(0, '-3.410')] [2022-07-10 10:17:32,917][26022] Updated weights on worker 0-0, policy_version 681003 (0.00092) [2022-07-10 10:17:34,656][26022] Updated weights on worker 0-0, policy_version 681013 (0.00052) [2022-07-10 10:17:36,043][25689] Fps is (10 sec: 5609.1, 60 sec: 5534.5, 300 sec: 5547.7). Total num frames: 697364480. Throughput: 0: 5016.9. Samples: 697359304. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:17:36,044][25689] Avg episode reward: [(0, '-4.189')] [2022-07-10 10:17:36,474][26022] Updated weights on worker 0-0, policy_version 681023 (0.00092) [2022-07-10 10:17:38,371][26022] Updated weights on worker 0-0, policy_version 681033 (0.00410) [2022-07-10 10:17:40,072][26022] Updated weights on worker 0-0, policy_version 681043 (0.00080) [2022-07-10 10:17:41,045][25689] Fps is (10 sec: 5515.5, 60 sec: 5534.6, 300 sec: 5542.0). Total num frames: 697392128. Throughput: 0: 5879.1. Samples: 697393176. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:17:41,046][25689] Avg episode reward: [(0, '-3.577')] [2022-07-10 10:17:41,780][26022] Updated weights on worker 0-0, policy_version 681053 (0.00083) [2022-07-10 10:17:43,995][26022] Updated weights on worker 0-0, policy_version 681063 (0.00093) [2022-07-10 10:17:45,543][26022] Updated weights on worker 0-0, policy_version 681073 (0.00098) [2022-07-10 10:17:46,163][25689] Fps is (10 sec: 5564.9, 60 sec: 5528.7, 300 sec: 5544.2). Total num frames: 697420800. Throughput: 0: 5863.6. Samples: 697426770. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:17:46,164][25689] Avg episode reward: [(0, '-2.306')] [2022-07-10 10:17:47,502][26022] Updated weights on worker 0-0, policy_version 681083 (0.00103) [2022-07-10 10:17:49,239][26022] Updated weights on worker 0-0, policy_version 681093 (0.00080) [2022-07-10 10:17:51,172][26022] Updated weights on worker 0-0, policy_version 681103 (0.00093) [2022-07-10 10:17:51,224][25689] Fps is (10 sec: 5734.0, 60 sec: 5576.0, 300 sec: 5550.4). Total num frames: 697450496. Throughput: 0: 5013.5. Samples: 697443344. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:17:51,225][25689] Avg episode reward: [(0, '-2.413')] [2022-07-10 10:17:52,936][26022] Updated weights on worker 0-0, policy_version 681113 (0.00086) [2022-07-10 10:17:54,901][26022] Updated weights on worker 0-0, policy_version 681123 (0.00093) [2022-07-10 10:17:56,287][25689] Fps is (10 sec: 5563.3, 60 sec: 5557.6, 300 sec: 5543.0). Total num frames: 697477120. Throughput: 0: 5810.6. Samples: 697476934. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:17:56,287][25689] Avg episode reward: [(0, '-2.822')] [2022-07-10 10:17:56,597][26022] Updated weights on worker 0-0, policy_version 681133 (0.00097) [2022-07-10 10:17:58,487][26022] Updated weights on worker 0-0, policy_version 681143 (0.00086) [2022-07-10 10:18:00,232][26022] Updated weights on worker 0-0, policy_version 681153 (0.00091) [2022-07-10 10:18:01,362][25689] Fps is (10 sec: 5353.8, 60 sec: 5551.3, 300 sec: 5543.1). Total num frames: 697504768. Throughput: 0: 5761.2. Samples: 697510224. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:18:01,362][25689] Avg episode reward: [(0, '-1.909')] [2022-07-10 10:18:02,552][26022] Updated weights on worker 0-0, policy_version 681163 (0.00084) [2022-07-10 10:18:04,278][26022] Updated weights on worker 0-0, policy_version 681173 (0.00089) [2022-07-10 10:18:06,209][26022] Updated weights on worker 0-0, policy_version 681183 (0.00088) [2022-07-10 10:18:06,488][25689] Fps is (10 sec: 5420.6, 60 sec: 5562.6, 300 sec: 5548.1). Total num frames: 697532416. Throughput: 0: 5652.2. Samples: 697541650. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:18:06,488][25689] Avg episode reward: [(0, '-2.483')] [2022-07-10 10:18:08,039][26022] Updated weights on worker 0-0, policy_version 681193 (0.00070) [2022-07-10 10:18:09,904][26022] Updated weights on worker 0-0, policy_version 681203 (0.00088) [2022-07-10 10:18:11,503][25689] Fps is (10 sec: 5452.7, 60 sec: 5569.6, 300 sec: 5544.6). Total num frames: 697560064. Throughput: 0: 5678.4. Samples: 697558492. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:18:11,504][25689] Avg episode reward: [(0, '-2.496')] [2022-07-10 10:18:11,753][26022] Updated weights on worker 0-0, policy_version 681213 (0.00088) [2022-07-10 10:18:13,666][26022] Updated weights on worker 0-0, policy_version 681223 (0.00089) [2022-07-10 10:18:15,260][26022] Updated weights on worker 0-0, policy_version 681233 (0.00086) [2022-07-10 10:18:16,507][25689] Fps is (10 sec: 5519.2, 60 sec: 5539.3, 300 sec: 5539.3). Total num frames: 697587712. Throughput: 0: 5697.6. Samples: 697592140. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 10:18:16,507][25689] Avg episode reward: [(0, '-2.566')] [2022-07-10 10:18:17,259][26022] Updated weights on worker 0-0, policy_version 681243 (0.00086) [2022-07-10 10:18:18,907][26022] Updated weights on worker 0-0, policy_version 681253 (0.00086) [2022-07-10 10:18:20,951][26022] Updated weights on worker 0-0, policy_version 681263 (0.00084) [2022-07-10 10:18:21,532][25689] Fps is (10 sec: 5615.3, 60 sec: 5575.7, 300 sec: 5548.0). Total num frames: 697616384. Throughput: 0: 5740.7. Samples: 697626018. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:18:21,533][25689] Avg episode reward: [(0, '-2.999')] [2022-07-10 10:18:22,660][26022] Updated weights on worker 0-0, policy_version 681273 (0.00085) [2022-07-10 10:18:22,823][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:18:22,837][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000681274_697624576.pth [2022-07-10 10:18:22,837][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000679322_695625728.pth [2022-07-10 10:18:24,609][26022] Updated weights on worker 0-0, policy_version 681283 (0.00089) [2022-07-10 10:18:26,238][26022] Updated weights on worker 0-0, policy_version 681293 (0.00053) [2022-07-10 10:18:26,613][25689] Fps is (10 sec: 5775.5, 60 sec: 5575.2, 300 sec: 5546.6). Total num frames: 697646080. Throughput: 0: 5038.2. Samples: 697643044. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:18:26,613][25689] Avg episode reward: [(0, '-2.248')] [2022-07-10 10:18:28,190][26022] Updated weights on worker 0-0, policy_version 681303 (0.00096) [2022-07-10 10:18:30,003][26022] Updated weights on worker 0-0, policy_version 681313 (0.00084) [2022-07-10 10:18:31,698][25689] Fps is (10 sec: 5641.0, 60 sec: 5552.2, 300 sec: 5549.0). Total num frames: 697673728. Throughput: 0: 5825.3. Samples: 697676136. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:18:31,698][25689] Avg episode reward: [(0, '-2.556')] [2022-07-10 10:18:31,732][26022] Updated weights on worker 0-0, policy_version 681323 (0.00096) [2022-07-10 10:18:33,783][26022] Updated weights on worker 0-0, policy_version 681333 (0.00085) [2022-07-10 10:18:35,225][26022] Updated weights on worker 0-0, policy_version 681343 (0.00091) [2022-07-10 10:18:36,726][25689] Fps is (10 sec: 5467.8, 60 sec: 5551.7, 300 sec: 5545.3). Total num frames: 697701376. Throughput: 0: 5818.9. Samples: 697709792. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:18:36,726][25689] Avg episode reward: [(0, '-3.352')] [2022-07-10 10:18:37,422][26022] Updated weights on worker 0-0, policy_version 681353 (0.00096) [2022-07-10 10:18:39,274][26022] Updated weights on worker 0-0, policy_version 681363 (0.00089) [2022-07-10 10:18:41,003][26022] Updated weights on worker 0-0, policy_version 681373 (0.00094) [2022-07-10 10:18:41,778][25689] Fps is (10 sec: 5587.0, 60 sec: 5564.0, 300 sec: 5548.6). Total num frames: 697730048. Throughput: 0: 4961.5. Samples: 697726468. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:18:41,779][25689] Avg episode reward: [(0, '-3.165')] [2022-07-10 10:18:43,020][26022] Updated weights on worker 0-0, policy_version 681383 (0.00091) [2022-07-10 10:18:44,497][26022] Updated weights on worker 0-0, policy_version 681393 (0.00083) [2022-07-10 10:18:46,680][26022] Updated weights on worker 0-0, policy_version 681403 (0.00083) [2022-07-10 10:18:46,895][25689] Fps is (10 sec: 5538.4, 60 sec: 5547.3, 300 sec: 5544.4). Total num frames: 697757696. Throughput: 0: 5764.5. Samples: 697759960. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:18:46,895][25689] Avg episode reward: [(0, '-2.843')] [2022-07-10 10:18:48,313][26022] Updated weights on worker 0-0, policy_version 681413 (0.00086) [2022-07-10 10:18:50,095][26022] Updated weights on worker 0-0, policy_version 681423 (0.00082) [2022-07-10 10:18:51,952][25689] Fps is (10 sec: 5636.8, 60 sec: 5547.7, 300 sec: 5547.3). Total num frames: 697787392. Throughput: 0: 5779.8. Samples: 697793198. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:18:51,952][25689] Avg episode reward: [(0, '-2.794')] [2022-07-10 10:18:51,963][26022] Updated weights on worker 0-0, policy_version 681433 (0.00060) [2022-07-10 10:18:53,718][26022] Updated weights on worker 0-0, policy_version 681443 (0.00085) [2022-07-10 10:18:55,631][26022] Updated weights on worker 0-0, policy_version 681453 (0.00094) [2022-07-10 10:18:56,986][25689] Fps is (10 sec: 5581.1, 60 sec: 5550.3, 300 sec: 5543.8). Total num frames: 697814016. Throughput: 0: 4958.0. Samples: 697810242. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:18:56,986][25689] Avg episode reward: [(0, '-2.515')] [2022-07-10 10:18:57,543][26022] Updated weights on worker 0-0, policy_version 681463 (0.00088) [2022-07-10 10:18:59,299][26022] Updated weights on worker 0-0, policy_version 681473 (0.00087) [2022-07-10 10:19:01,064][26022] Updated weights on worker 0-0, policy_version 681483 (0.00090) [2022-07-10 10:19:02,019][25689] Fps is (10 sec: 5391.0, 60 sec: 5554.1, 300 sec: 5547.7). Total num frames: 697841664. Throughput: 0: 5803.7. Samples: 697843936. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:19:02,019][25689] Avg episode reward: [(0, '-3.860')] [2022-07-10 10:19:03,343][26022] Updated weights on worker 0-0, policy_version 681493 (0.00083) [2022-07-10 10:19:05,151][26022] Updated weights on worker 0-0, policy_version 681503 (0.00084) [2022-07-10 10:19:07,106][26022] Updated weights on worker 0-0, policy_version 681513 (0.00094) [2022-07-10 10:19:07,199][25689] Fps is (10 sec: 5414.2, 60 sec: 5549.2, 300 sec: 5539.0). Total num frames: 697869312. Throughput: 0: 5707.1. Samples: 697875838. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:19:07,200][25689] Avg episode reward: [(0, '-3.777')] [2022-07-10 10:19:08,845][26022] Updated weights on worker 0-0, policy_version 681523 (0.00089) [2022-07-10 10:19:10,767][26022] Updated weights on worker 0-0, policy_version 681533 (0.00085) [2022-07-10 10:19:12,218][25689] Fps is (10 sec: 5521.8, 60 sec: 5565.6, 300 sec: 5542.2). Total num frames: 697897984. Throughput: 0: 4908.2. Samples: 697892670. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:19:12,219][25689] Avg episode reward: [(0, '-4.725')] [2022-07-10 10:19:12,378][26022] Updated weights on worker 0-0, policy_version 681543 (0.00088) [2022-07-10 10:19:14,432][26022] Updated weights on worker 0-0, policy_version 681553 (0.00089) [2022-07-10 10:19:16,023][26022] Updated weights on worker 0-0, policy_version 681563 (0.00095) [2022-07-10 10:19:17,267][25689] Fps is (10 sec: 5695.9, 60 sec: 5578.4, 300 sec: 5548.9). Total num frames: 697926656. Throughput: 0: 5720.1. Samples: 697926250. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:19:17,267][25689] Avg episode reward: [(0, '-5.958')] [2022-07-10 10:19:18,124][26022] Updated weights on worker 0-0, policy_version 681573 (0.00091) [2022-07-10 10:19:19,804][26022] Updated weights on worker 0-0, policy_version 681583 (0.00103) [2022-07-10 10:19:21,620][26022] Updated weights on worker 0-0, policy_version 681593 (0.00086) [2022-07-10 10:19:22,328][25689] Fps is (10 sec: 5570.7, 60 sec: 5558.3, 300 sec: 5541.8). Total num frames: 697954304. Throughput: 0: 5708.1. Samples: 697959866. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:19:22,330][25689] Avg episode reward: [(0, '-6.222')] [2022-07-10 10:19:23,527][26022] Updated weights on worker 0-0, policy_version 681603 (0.00082) [2022-07-10 10:19:25,243][26022] Updated weights on worker 0-0, policy_version 681613 (0.00089) [2022-07-10 10:19:27,185][26022] Updated weights on worker 0-0, policy_version 681623 (0.00096) [2022-07-10 10:19:27,460][25689] Fps is (10 sec: 5525.1, 60 sec: 5536.7, 300 sec: 5542.9). Total num frames: 697982976. Throughput: 0: 4982.7. Samples: 697976794. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:19:27,461][25689] Avg episode reward: [(0, '-4.135')] [2022-07-10 10:19:28,966][26022] Updated weights on worker 0-0, policy_version 681633 (0.00183) [2022-07-10 10:19:30,749][26022] Updated weights on worker 0-0, policy_version 681643 (0.00086) [2022-07-10 10:19:32,467][25689] Fps is (10 sec: 5656.2, 60 sec: 5560.8, 300 sec: 5543.1). Total num frames: 698011648. Throughput: 0: 5812.5. Samples: 698010364. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:19:32,467][25689] Avg episode reward: [(0, '-3.650')] [2022-07-10 10:19:32,620][26022] Updated weights on worker 0-0, policy_version 681653 (0.00084) [2022-07-10 10:19:34,438][26022] Updated weights on worker 0-0, policy_version 681663 (0.00087) [2022-07-10 10:19:36,203][26022] Updated weights on worker 0-0, policy_version 681673 (0.00094) [2022-07-10 10:19:37,485][25689] Fps is (10 sec: 5618.2, 60 sec: 5561.7, 300 sec: 5546.5). Total num frames: 698039296. Throughput: 0: 5841.7. Samples: 698044358. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:19:37,485][25689] Avg episode reward: [(0, '-2.648')] [2022-07-10 10:19:38,121][26022] Updated weights on worker 0-0, policy_version 681683 (0.00087) [2022-07-10 10:19:39,713][26022] Updated weights on worker 0-0, policy_version 681693 (0.00085) [2022-07-10 10:19:41,520][26022] Updated weights on worker 0-0, policy_version 681703 (0.00090) [2022-07-10 10:19:42,563][25689] Fps is (10 sec: 5679.7, 60 sec: 5576.2, 300 sec: 5549.4). Total num frames: 698068992. Throughput: 0: 5018.8. Samples: 698061418. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:19:42,563][25689] Avg episode reward: [(0, '-1.261')] [2022-07-10 10:19:43,643][26022] Updated weights on worker 0-0, policy_version 681713 (0.00091) [2022-07-10 10:19:45,040][26022] Updated weights on worker 0-0, policy_version 681723 (0.00085) [2022-07-10 10:19:47,119][26022] Updated weights on worker 0-0, policy_version 681733 (0.00082) [2022-07-10 10:19:47,639][25689] Fps is (10 sec: 5748.1, 60 sec: 5596.7, 300 sec: 5552.2). Total num frames: 698097664. Throughput: 0: 5893.7. Samples: 698095722. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:19:47,640][25689] Avg episode reward: [(0, '-1.013')] [2022-07-10 10:19:48,681][26022] Updated weights on worker 0-0, policy_version 681743 (0.00100) [2022-07-10 10:19:50,711][26022] Updated weights on worker 0-0, policy_version 681753 (0.00086) [2022-07-10 10:19:52,392][26022] Updated weights on worker 0-0, policy_version 681763 (0.00087) [2022-07-10 10:19:52,680][25689] Fps is (10 sec: 5667.8, 60 sec: 5581.3, 300 sec: 5554.9). Total num frames: 698126336. Throughput: 0: 5898.2. Samples: 698129590. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:19:52,681][25689] Avg episode reward: [(0, '-1.677')] [2022-07-10 10:19:54,391][26022] Updated weights on worker 0-0, policy_version 681773 (0.00087) [2022-07-10 10:19:56,120][26022] Updated weights on worker 0-0, policy_version 681783 (0.00070) [2022-07-10 10:19:57,776][25689] Fps is (10 sec: 5657.0, 60 sec: 5609.3, 300 sec: 5556.9). Total num frames: 698155008. Throughput: 0: 5861.6. Samples: 698163296. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:19:57,777][25689] Avg episode reward: [(0, '-2.607')] [2022-07-10 10:19:57,982][26022] Updated weights on worker 0-0, policy_version 681793 (0.00097) [2022-07-10 10:20:00,003][26022] Updated weights on worker 0-0, policy_version 681803 (0.00098) [2022-07-10 10:20:01,575][26022] Updated weights on worker 0-0, policy_version 681813 (0.00076) [2022-07-10 10:20:02,827][25689] Fps is (10 sec: 5349.0, 60 sec: 5574.0, 300 sec: 5553.7). Total num frames: 698180608. Throughput: 0: 5856.5. Samples: 698180092. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:20:02,829][25689] Avg episode reward: [(0, '-2.559')] [2022-07-10 10:20:03,939][26022] Updated weights on worker 0-0, policy_version 681823 (0.00082) [2022-07-10 10:20:05,622][26022] Updated weights on worker 0-0, policy_version 681833 (0.00096) [2022-07-10 10:20:07,454][26022] Updated weights on worker 0-0, policy_version 681843 (0.00082) [2022-07-10 10:20:07,882][25689] Fps is (10 sec: 5370.0, 60 sec: 5602.3, 300 sec: 5556.4). Total num frames: 698209280. Throughput: 0: 5726.2. Samples: 698211640. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:20:07,883][25689] Avg episode reward: [(0, '-3.227')] [2022-07-10 10:20:09,288][26022] Updated weights on worker 0-0, policy_version 681853 (0.00092) [2022-07-10 10:20:11,197][26022] Updated weights on worker 0-0, policy_version 681863 (0.00089) [2022-07-10 10:20:12,911][25689] Fps is (10 sec: 5585.1, 60 sec: 5584.6, 300 sec: 5556.6). Total num frames: 698236928. Throughput: 0: 5701.2. Samples: 698244926. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:20:12,911][25689] Avg episode reward: [(0, '-2.881')] [2022-07-10 10:20:13,115][26022] Updated weights on worker 0-0, policy_version 681873 (0.00088) [2022-07-10 10:20:14,914][26022] Updated weights on worker 0-0, policy_version 681883 (0.00082) [2022-07-10 10:20:16,782][26022] Updated weights on worker 0-0, policy_version 681893 (0.00087) [2022-07-10 10:20:17,923][25689] Fps is (10 sec: 5405.2, 60 sec: 5554.2, 300 sec: 5549.8). Total num frames: 698263552. Throughput: 0: 4884.3. Samples: 698261700. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:20:17,925][25689] Avg episode reward: [(0, '-3.648')] [2022-07-10 10:20:18,538][26022] Updated weights on worker 0-0, policy_version 681903 (0.00086) [2022-07-10 10:20:20,335][26022] Updated weights on worker 0-0, policy_version 681913 (0.00087) [2022-07-10 10:20:22,280][26022] Updated weights on worker 0-0, policy_version 681923 (0.00085) [2022-07-10 10:20:22,855][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:20:22,864][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000681926_698292224.pth [2022-07-10 10:20:22,873][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000679974_696293376.pth [2022-07-10 10:20:22,935][25689] Fps is (10 sec: 5516.1, 60 sec: 5575.6, 300 sec: 5555.2). Total num frames: 698292224. Throughput: 0: 5709.0. Samples: 698294892. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:20:22,936][25689] Avg episode reward: [(0, '-3.347')] [2022-07-10 10:20:24,256][26022] Updated weights on worker 0-0, policy_version 681933 (0.00097) [2022-07-10 10:20:26,027][26022] Updated weights on worker 0-0, policy_version 681943 (0.00096) [2022-07-10 10:20:27,830][26022] Updated weights on worker 0-0, policy_version 681953 (0.00083) [2022-07-10 10:20:27,982][25689] Fps is (10 sec: 5701.1, 60 sec: 5583.5, 300 sec: 5554.6). Total num frames: 698320896. Throughput: 0: 5799.4. Samples: 698328202. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:20:27,982][25689] Avg episode reward: [(0, '-3.730')] [2022-07-10 10:20:29,732][26022] Updated weights on worker 0-0, policy_version 681963 (0.00085) [2022-07-10 10:20:31,585][26022] Updated weights on worker 0-0, policy_version 681973 (0.00085) [2022-07-10 10:20:32,985][25689] Fps is (10 sec: 5502.0, 60 sec: 5549.9, 300 sec: 5547.8). Total num frames: 698347520. Throughput: 0: 4978.4. Samples: 698344866. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:20:32,986][25689] Avg episode reward: [(0, '-4.169')] [2022-07-10 10:20:33,346][26022] Updated weights on worker 0-0, policy_version 681983 (0.00086) [2022-07-10 10:20:35,114][26022] Updated weights on worker 0-0, policy_version 681993 (0.00081) [2022-07-10 10:20:36,983][26022] Updated weights on worker 0-0, policy_version 682003 (0.00089) [2022-07-10 10:20:37,995][25689] Fps is (10 sec: 5522.0, 60 sec: 5567.6, 300 sec: 5551.3). Total num frames: 698376192. Throughput: 0: 5808.7. Samples: 698378292. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:20:37,996][25689] Avg episode reward: [(0, '-4.693')] [2022-07-10 10:20:38,886][26022] Updated weights on worker 0-0, policy_version 682013 (0.00081) [2022-07-10 10:20:40,767][26022] Updated weights on worker 0-0, policy_version 682023 (0.00086) [2022-07-10 10:20:42,816][26022] Updated weights on worker 0-0, policy_version 682033 (0.00083) [2022-07-10 10:20:42,998][25689] Fps is (10 sec: 5522.5, 60 sec: 5523.7, 300 sec: 5545.6). Total num frames: 698402816. Throughput: 0: 5830.3. Samples: 698411866. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:20:42,999][25689] Avg episode reward: [(0, '-5.255')] [2022-07-10 10:20:44,220][26022] Updated weights on worker 0-0, policy_version 682043 (0.00100) [2022-07-10 10:20:46,471][26022] Updated weights on worker 0-0, policy_version 682053 (0.00092) [2022-07-10 10:20:47,944][26022] Updated weights on worker 0-0, policy_version 682063 (0.00090) [2022-07-10 10:20:48,115][25689] Fps is (10 sec: 5565.2, 60 sec: 5536.9, 300 sec: 5554.2). Total num frames: 698432512. Throughput: 0: 4988.5. Samples: 698428640. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:20:48,116][25689] Avg episode reward: [(0, '-4.890')] [2022-07-10 10:20:49,973][26022] Updated weights on worker 0-0, policy_version 682073 (0.00091) [2022-07-10 10:20:51,778][26022] Updated weights on worker 0-0, policy_version 682083 (0.00086) [2022-07-10 10:20:53,119][25689] Fps is (10 sec: 5767.3, 60 sec: 5540.3, 300 sec: 5551.1). Total num frames: 698461184. Throughput: 0: 5832.2. Samples: 698462288. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:20:53,119][25689] Avg episode reward: [(0, '-4.575')] [2022-07-10 10:20:53,428][26022] Updated weights on worker 0-0, policy_version 682093 (0.00092) [2022-07-10 10:20:55,533][26022] Updated weights on worker 0-0, policy_version 682103 (0.00082) [2022-07-10 10:20:57,200][26022] Updated weights on worker 0-0, policy_version 682113 (0.00081) [2022-07-10 10:20:58,193][25689] Fps is (10 sec: 5487.2, 60 sec: 5508.4, 300 sec: 5546.4). Total num frames: 698487808. Throughput: 0: 5797.8. Samples: 698495392. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:20:58,193][25689] Avg episode reward: [(0, '-3.924')] [2022-07-10 10:20:59,140][26022] Updated weights on worker 0-0, policy_version 682123 (0.00086) [2022-07-10 10:21:00,960][26022] Updated weights on worker 0-0, policy_version 682133 (0.00085) [2022-07-10 10:21:03,099][26022] Updated weights on worker 0-0, policy_version 682143 (0.00085) [2022-07-10 10:21:03,198][25689] Fps is (10 sec: 5283.0, 60 sec: 5529.5, 300 sec: 5551.6). Total num frames: 698514432. Throughput: 0: 4974.8. Samples: 698512356. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:21:03,198][25689] Avg episode reward: [(0, '-4.449')] [2022-07-10 10:21:05,064][26022] Updated weights on worker 0-0, policy_version 682153 (0.00085) [2022-07-10 10:21:06,954][26022] Updated weights on worker 0-0, policy_version 682163 (0.00082) [2022-07-10 10:21:08,320][25689] Fps is (10 sec: 5359.1, 60 sec: 5506.5, 300 sec: 5546.0). Total num frames: 698542080. Throughput: 0: 5699.5. Samples: 698543794. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:21:08,320][25689] Avg episode reward: [(0, '-3.242')] [2022-07-10 10:21:08,780][26022] Updated weights on worker 0-0, policy_version 682173 (0.00090) [2022-07-10 10:21:10,562][26022] Updated weights on worker 0-0, policy_version 682183 (0.00083) [2022-07-10 10:21:12,417][26022] Updated weights on worker 0-0, policy_version 682193 (0.00086) [2022-07-10 10:21:13,355][25689] Fps is (10 sec: 5444.0, 60 sec: 5505.9, 300 sec: 5545.8). Total num frames: 698569728. Throughput: 0: 5667.9. Samples: 698576986. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:21:13,356][25689] Avg episode reward: [(0, '-2.658')] [2022-07-10 10:21:14,283][26022] Updated weights on worker 0-0, policy_version 682203 (0.00093) [2022-07-10 10:21:16,053][26022] Updated weights on worker 0-0, policy_version 682213 (0.00088) [2022-07-10 10:21:17,871][26022] Updated weights on worker 0-0, policy_version 682223 (0.00084) [2022-07-10 10:21:18,373][25689] Fps is (10 sec: 5805.6, 60 sec: 5573.1, 300 sec: 5556.0). Total num frames: 698600448. Throughput: 0: 4878.2. Samples: 698593838. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:21:18,374][25689] Avg episode reward: [(0, '-1.676')] [2022-07-10 10:21:19,795][26022] Updated weights on worker 0-0, policy_version 682233 (0.00079) [2022-07-10 10:21:21,540][26022] Updated weights on worker 0-0, policy_version 682243 (0.00084) [2022-07-10 10:21:23,243][26022] Updated weights on worker 0-0, policy_version 682253 (0.00091) [2022-07-10 10:21:23,411][25689] Fps is (10 sec: 5702.6, 60 sec: 5536.9, 300 sec: 5550.5). Total num frames: 698627072. Throughput: 0: 5685.3. Samples: 698627272. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:21:23,411][25689] Avg episode reward: [(0, '-1.013')] [2022-07-10 10:21:25,192][26022] Updated weights on worker 0-0, policy_version 682263 (0.00089) [2022-07-10 10:21:27,159][26022] Updated weights on worker 0-0, policy_version 682273 (0.00087) [2022-07-10 10:21:28,475][25689] Fps is (10 sec: 5271.4, 60 sec: 5501.4, 300 sec: 5543.1). Total num frames: 698653696. Throughput: 0: 5789.7. Samples: 698660486. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:21:28,476][25689] Avg episode reward: [(0, '-1.250')] [2022-07-10 10:21:29,024][26022] Updated weights on worker 0-0, policy_version 682283 (0.00090) [2022-07-10 10:21:30,643][26022] Updated weights on worker 0-0, policy_version 682293 (0.00088) [2022-07-10 10:21:32,652][26022] Updated weights on worker 0-0, policy_version 682303 (0.00081) [2022-07-10 10:21:33,490][25689] Fps is (10 sec: 5485.9, 60 sec: 5534.2, 300 sec: 5547.0). Total num frames: 698682368. Throughput: 0: 4982.7. Samples: 698677316. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:21:33,491][25689] Avg episode reward: [(0, '-1.963')] [2022-07-10 10:21:34,403][26022] Updated weights on worker 0-0, policy_version 682313 (0.00086) [2022-07-10 10:21:36,241][26022] Updated weights on worker 0-0, policy_version 682323 (0.00085) [2022-07-10 10:21:38,132][26022] Updated weights on worker 0-0, policy_version 682333 (0.00089) [2022-07-10 10:21:38,503][25689] Fps is (10 sec: 5820.1, 60 sec: 5550.8, 300 sec: 5553.7). Total num frames: 698712064. Throughput: 0: 5823.8. Samples: 698711072. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:21:38,504][25689] Avg episode reward: [(0, '-2.504')] [2022-07-10 10:21:39,917][26022] Updated weights on worker 0-0, policy_version 682343 (0.00090) [2022-07-10 10:21:41,677][26022] Updated weights on worker 0-0, policy_version 682353 (0.00092) [2022-07-10 10:21:43,513][25689] Fps is (10 sec: 5517.3, 60 sec: 5533.3, 300 sec: 5544.2). Total num frames: 698737664. Throughput: 0: 5812.8. Samples: 698744122. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:21:43,523][25689] Avg episode reward: [(0, '-2.228')] [2022-07-10 10:21:43,755][26022] Updated weights on worker 0-0, policy_version 682363 (0.00092) [2022-07-10 10:21:45,530][26022] Updated weights on worker 0-0, policy_version 682373 (0.00094) [2022-07-10 10:21:47,212][26022] Updated weights on worker 0-0, policy_version 682383 (0.00090) [2022-07-10 10:21:48,640][25689] Fps is (10 sec: 5454.9, 60 sec: 5532.4, 300 sec: 5552.6). Total num frames: 698767360. Throughput: 0: 4977.1. Samples: 698760852. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:21:48,642][25689] Avg episode reward: [(0, '-2.965')] [2022-07-10 10:21:49,398][26022] Updated weights on worker 0-0, policy_version 682393 (0.00092) [2022-07-10 10:21:50,756][26022] Updated weights on worker 0-0, policy_version 682403 (0.00091) [2022-07-10 10:21:53,020][26022] Updated weights on worker 0-0, policy_version 682413 (0.00092) [2022-07-10 10:21:53,644][25689] Fps is (10 sec: 5761.2, 60 sec: 5532.4, 300 sec: 5556.8). Total num frames: 698796032. Throughput: 0: 5817.2. Samples: 698794554. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:21:53,655][25689] Avg episode reward: [(0, '-3.992')] [2022-07-10 10:21:54,355][26022] Updated weights on worker 0-0, policy_version 682423 (0.00090) [2022-07-10 10:21:56,576][26022] Updated weights on worker 0-0, policy_version 682433 (0.00098) [2022-07-10 10:21:58,571][26022] Updated weights on worker 0-0, policy_version 682443 (0.00081) [2022-07-10 10:21:58,692][25689] Fps is (10 sec: 5399.3, 60 sec: 5517.8, 300 sec: 5549.2). Total num frames: 698821632. Throughput: 0: 5780.1. Samples: 698827764. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:21:58,693][25689] Avg episode reward: [(0, '-4.093')] [2022-07-10 10:22:00,013][26022] Updated weights on worker 0-0, policy_version 682453 (0.00097) [2022-07-10 10:22:02,409][26022] Updated weights on worker 0-0, policy_version 682463 (0.00090) [2022-07-10 10:22:03,707][25689] Fps is (10 sec: 5393.5, 60 sec: 5550.8, 300 sec: 5557.0). Total num frames: 698850304. Throughput: 0: 4982.9. Samples: 698844746. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:22:03,707][25689] Avg episode reward: [(0, '-3.234')] [2022-07-10 10:22:04,375][26022] Updated weights on worker 0-0, policy_version 682473 (0.00086) [2022-07-10 10:22:05,874][26022] Updated weights on worker 0-0, policy_version 682483 (0.00083) [2022-07-10 10:22:08,048][26022] Updated weights on worker 0-0, policy_version 682493 (0.00083) [2022-07-10 10:22:08,756][25689] Fps is (10 sec: 5596.2, 60 sec: 5557.5, 300 sec: 5557.8). Total num frames: 698877952. Throughput: 0: 5750.7. Samples: 698876530. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:22:08,756][25689] Avg episode reward: [(0, '-3.517')] [2022-07-10 10:22:09,470][26022] Updated weights on worker 0-0, policy_version 682503 (0.00089) [2022-07-10 10:22:11,527][26022] Updated weights on worker 0-0, policy_version 682513 (0.00090) [2022-07-10 10:22:13,300][26022] Updated weights on worker 0-0, policy_version 682523 (0.00091) [2022-07-10 10:22:13,776][25689] Fps is (10 sec: 5593.2, 60 sec: 5575.8, 300 sec: 5554.8). Total num frames: 698906624. Throughput: 0: 5761.0. Samples: 698910532. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:22:13,776][25689] Avg episode reward: [(0, '-3.975')] [2022-07-10 10:22:14,962][26022] Updated weights on worker 0-0, policy_version 682533 (0.00084) [2022-07-10 10:22:16,958][26022] Updated weights on worker 0-0, policy_version 682543 (0.00094) [2022-07-10 10:22:18,611][26022] Updated weights on worker 0-0, policy_version 682553 (0.00087) [2022-07-10 10:22:18,835][25689] Fps is (10 sec: 5689.2, 60 sec: 5538.1, 300 sec: 5561.5). Total num frames: 698935296. Throughput: 0: 4958.1. Samples: 698927638. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:22:18,836][25689] Avg episode reward: [(0, '-3.345')] [2022-07-10 10:22:20,402][26022] Updated weights on worker 0-0, policy_version 682563 (0.00087) [2022-07-10 10:22:22,391][26022] Updated weights on worker 0-0, policy_version 682573 (0.00093) [2022-07-10 10:22:22,992][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:22:23,012][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000682576_698957824.pth [2022-07-10 10:22:23,015][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000680623_696957952.pth [2022-07-10 10:22:23,837][25689] Fps is (10 sec: 5699.3, 60 sec: 5575.3, 300 sec: 5559.5). Total num frames: 698963968. Throughput: 0: 5810.7. Samples: 698961720. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:22:23,838][25689] Avg episode reward: [(0, '-2.602')] [2022-07-10 10:22:24,077][26022] Updated weights on worker 0-0, policy_version 682583 (0.00086) [2022-07-10 10:22:26,036][26022] Updated weights on worker 0-0, policy_version 682593 (0.00094) [2022-07-10 10:22:27,874][26022] Updated weights on worker 0-0, policy_version 682603 (0.00084) [2022-07-10 10:22:28,937][25689] Fps is (10 sec: 5474.3, 60 sec: 5572.0, 300 sec: 5551.1). Total num frames: 698990592. Throughput: 0: 5865.6. Samples: 698994902. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:22:28,937][25689] Avg episode reward: [(0, '-2.813')] [2022-07-10 10:22:29,709][26022] Updated weights on worker 0-0, policy_version 682613 (0.00095) [2022-07-10 10:22:31,465][26022] Updated weights on worker 0-0, policy_version 682623 (0.00081) [2022-07-10 10:22:33,390][26022] Updated weights on worker 0-0, policy_version 682633 (0.00092) [2022-07-10 10:22:33,957][25689] Fps is (10 sec: 5464.5, 60 sec: 5571.7, 300 sec: 5554.6). Total num frames: 699019264. Throughput: 0: 5842.8. Samples: 699028444. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:22:33,957][25689] Avg episode reward: [(0, '-1.929')] [2022-07-10 10:22:35,023][26022] Updated weights on worker 0-0, policy_version 682643 (0.00409) [2022-07-10 10:22:37,021][26022] Updated weights on worker 0-0, policy_version 682653 (0.00086) [2022-07-10 10:22:38,837][26022] Updated weights on worker 0-0, policy_version 682663 (0.00087) [2022-07-10 10:22:38,958][25689] Fps is (10 sec: 5619.7, 60 sec: 5538.8, 300 sec: 5554.6). Total num frames: 699046912. Throughput: 0: 5845.8. Samples: 699045272. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:22:38,959][25689] Avg episode reward: [(0, '-1.815')] [2022-07-10 10:22:40,537][26022] Updated weights on worker 0-0, policy_version 682673 (0.00091) [2022-07-10 10:22:42,674][26022] Updated weights on worker 0-0, policy_version 682683 (0.00094) [2022-07-10 10:22:43,971][25689] Fps is (10 sec: 5624.0, 60 sec: 5589.3, 300 sec: 5556.6). Total num frames: 699075584. Throughput: 0: 5815.2. Samples: 699078800. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:22:43,971][25689] Avg episode reward: [(0, '-2.625')] [2022-07-10 10:22:44,102][26022] Updated weights on worker 0-0, policy_version 682693 (0.00089) [2022-07-10 10:22:46,366][26022] Updated weights on worker 0-0, policy_version 682703 (0.00087) [2022-07-10 10:22:47,974][26022] Updated weights on worker 0-0, policy_version 682713 (0.00096) [2022-07-10 10:22:49,073][25689] Fps is (10 sec: 5466.8, 60 sec: 5540.8, 300 sec: 5545.5). Total num frames: 699102208. Throughput: 0: 5825.5. Samples: 699112208. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:22:49,074][25689] Avg episode reward: [(0, '-3.483')] [2022-07-10 10:22:49,940][26022] Updated weights on worker 0-0, policy_version 682723 (0.00086) [2022-07-10 10:22:51,684][26022] Updated weights on worker 0-0, policy_version 682733 (0.00094) [2022-07-10 10:22:53,444][26022] Updated weights on worker 0-0, policy_version 682743 (0.00080) [2022-07-10 10:22:54,142][25689] Fps is (10 sec: 5436.7, 60 sec: 5534.9, 300 sec: 5552.2). Total num frames: 699130880. Throughput: 0: 4979.6. Samples: 699128958. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:22:54,143][25689] Avg episode reward: [(0, '-2.739')] [2022-07-10 10:22:55,299][26022] Updated weights on worker 0-0, policy_version 682753 (0.00086) [2022-07-10 10:22:57,404][26022] Updated weights on worker 0-0, policy_version 682763 (0.00088) [2022-07-10 10:22:59,010][26022] Updated weights on worker 0-0, policy_version 682773 (0.00091) [2022-07-10 10:22:59,241][25689] Fps is (10 sec: 5740.6, 60 sec: 5597.9, 300 sec: 5558.7). Total num frames: 699160576. Throughput: 0: 5763.3. Samples: 699162166. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:22:59,242][25689] Avg episode reward: [(0, '-2.679')] [2022-07-10 10:23:01,076][26022] Updated weights on worker 0-0, policy_version 682783 (0.00073) [2022-07-10 10:23:02,985][26022] Updated weights on worker 0-0, policy_version 682793 (0.00094) [2022-07-10 10:23:04,270][25689] Fps is (10 sec: 5257.3, 60 sec: 5512.0, 300 sec: 5546.7). Total num frames: 699184128. Throughput: 0: 5669.5. Samples: 699193886. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:23:04,272][25689] Avg episode reward: [(0, '-2.980')] [2022-07-10 10:23:04,989][26022] Updated weights on worker 0-0, policy_version 682803 (0.00081) [2022-07-10 10:23:06,709][26022] Updated weights on worker 0-0, policy_version 682813 (0.00083) [2022-07-10 10:23:08,633][26022] Updated weights on worker 0-0, policy_version 682823 (0.00090) [2022-07-10 10:23:09,353][25689] Fps is (10 sec: 5367.1, 60 sec: 5559.6, 300 sec: 5555.8). Total num frames: 699214848. Throughput: 0: 4858.9. Samples: 699210744. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:23:09,353][25689] Avg episode reward: [(0, '-2.947')] [2022-07-10 10:23:10,523][26022] Updated weights on worker 0-0, policy_version 682833 (0.00086) [2022-07-10 10:23:12,250][26022] Updated weights on worker 0-0, policy_version 682843 (0.00084) [2022-07-10 10:23:14,317][26022] Updated weights on worker 0-0, policy_version 682853 (0.00083) [2022-07-10 10:23:14,383][25689] Fps is (10 sec: 5670.5, 60 sec: 5524.9, 300 sec: 5551.8). Total num frames: 699241472. Throughput: 0: 5688.6. Samples: 699244102. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:23:14,383][25689] Avg episode reward: [(0, '-3.133')] [2022-07-10 10:23:15,836][26022] Updated weights on worker 0-0, policy_version 682863 (0.00084) [2022-07-10 10:23:17,911][26022] Updated weights on worker 0-0, policy_version 682873 (0.00091) [2022-07-10 10:23:19,395][25689] Fps is (10 sec: 5608.4, 60 sec: 5546.2, 300 sec: 5555.5). Total num frames: 699271168. Throughput: 0: 5730.3. Samples: 699277654. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:23:19,395][25689] Avg episode reward: [(0, '-2.579')] [2022-07-10 10:23:19,635][26022] Updated weights on worker 0-0, policy_version 682883 (0.00414) [2022-07-10 10:23:21,428][26022] Updated weights on worker 0-0, policy_version 682893 (0.00088) [2022-07-10 10:23:23,454][26022] Updated weights on worker 0-0, policy_version 682903 (0.00091) [2022-07-10 10:23:24,408][25689] Fps is (10 sec: 5617.8, 60 sec: 5511.3, 300 sec: 5546.5). Total num frames: 699297792. Throughput: 0: 4995.1. Samples: 699294478. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:23:24,408][25689] Avg episode reward: [(0, '-3.017')] [2022-07-10 10:23:25,206][26022] Updated weights on worker 0-0, policy_version 682913 (0.00086) [2022-07-10 10:23:27,095][26022] Updated weights on worker 0-0, policy_version 682923 (0.00080) [2022-07-10 10:23:28,875][26022] Updated weights on worker 0-0, policy_version 682933 (0.00098) [2022-07-10 10:23:29,482][25689] Fps is (10 sec: 5583.5, 60 sec: 5564.4, 300 sec: 5553.6). Total num frames: 699327488. Throughput: 0: 5805.2. Samples: 699327596. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:23:29,482][25689] Avg episode reward: [(0, '-2.807')] [2022-07-10 10:23:30,716][26022] Updated weights on worker 0-0, policy_version 682943 (0.00091) [2022-07-10 10:23:32,631][26022] Updated weights on worker 0-0, policy_version 682953 (0.00068) [2022-07-10 10:23:34,351][26022] Updated weights on worker 0-0, policy_version 682963 (0.00088) [2022-07-10 10:23:34,577][25689] Fps is (10 sec: 5538.2, 60 sec: 5523.6, 300 sec: 5548.9). Total num frames: 699354112. Throughput: 0: 5795.6. Samples: 699361142. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:23:34,578][25689] Avg episode reward: [(0, '-1.989')] [2022-07-10 10:23:36,227][26022] Updated weights on worker 0-0, policy_version 682973 (0.00087) [2022-07-10 10:23:38,107][26022] Updated weights on worker 0-0, policy_version 682983 (0.00088) [2022-07-10 10:23:39,597][25689] Fps is (10 sec: 5567.5, 60 sec: 5555.8, 300 sec: 5552.9). Total num frames: 699383808. Throughput: 0: 4978.4. Samples: 699378230. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:23:39,598][25689] Avg episode reward: [(0, '-1.838')] [2022-07-10 10:23:39,624][26022] Updated weights on worker 0-0, policy_version 682993 (0.00087) [2022-07-10 10:23:41,834][26022] Updated weights on worker 0-0, policy_version 683003 (0.00085) [2022-07-10 10:23:43,347][26022] Updated weights on worker 0-0, policy_version 683013 (0.00089) [2022-07-10 10:23:44,637][25689] Fps is (10 sec: 5598.6, 60 sec: 5519.5, 300 sec: 5550.9). Total num frames: 699410432. Throughput: 0: 5819.6. Samples: 699412202. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:23:44,637][25689] Avg episode reward: [(0, '-1.299')] [2022-07-10 10:23:45,299][26022] Updated weights on worker 0-0, policy_version 683023 (0.00088) [2022-07-10 10:23:47,018][26022] Updated weights on worker 0-0, policy_version 683033 (0.00095) [2022-07-10 10:23:48,890][26022] Updated weights on worker 0-0, policy_version 683043 (0.00089) [2022-07-10 10:23:49,698][25689] Fps is (10 sec: 5575.7, 60 sec: 5573.9, 300 sec: 5550.8). Total num frames: 699440128. Throughput: 0: 5846.4. Samples: 699445790. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:23:49,703][25689] Avg episode reward: [(0, '-1.991')] [2022-07-10 10:23:50,897][26022] Updated weights on worker 0-0, policy_version 683053 (0.00092) [2022-07-10 10:23:52,689][26022] Updated weights on worker 0-0, policy_version 683063 (0.00083) [2022-07-10 10:23:54,530][26022] Updated weights on worker 0-0, policy_version 683073 (0.00091) [2022-07-10 10:23:54,785][25689] Fps is (10 sec: 5650.6, 60 sec: 5555.3, 300 sec: 5553.3). Total num frames: 699467776. Throughput: 0: 4993.0. Samples: 699462040. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:23:54,786][25689] Avg episode reward: [(0, '-0.856')] [2022-07-10 10:23:56,491][26022] Updated weights on worker 0-0, policy_version 683083 (0.00088) [2022-07-10 10:23:58,226][26022] Updated weights on worker 0-0, policy_version 683093 (0.00086) [2022-07-10 10:23:59,802][25689] Fps is (10 sec: 5473.1, 60 sec: 5529.1, 300 sec: 5553.6). Total num frames: 699495424. Throughput: 0: 5802.0. Samples: 699495454. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:23:59,803][25689] Avg episode reward: [(0, '-1.619')] [2022-07-10 10:24:00,142][26022] Updated weights on worker 0-0, policy_version 683103 (0.00081) [2022-07-10 10:24:02,417][26022] Updated weights on worker 0-0, policy_version 683113 (0.00086) [2022-07-10 10:24:04,030][26022] Updated weights on worker 0-0, policy_version 683123 (0.00084) [2022-07-10 10:24:04,807][25689] Fps is (10 sec: 5313.1, 60 sec: 5565.1, 300 sec: 5550.0). Total num frames: 699521024. Throughput: 0: 5682.2. Samples: 699526814. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:24:04,808][25689] Avg episode reward: [(0, '-3.169')] [2022-07-10 10:24:06,146][26022] Updated weights on worker 0-0, policy_version 683133 (0.00093) [2022-07-10 10:24:07,635][26022] Updated weights on worker 0-0, policy_version 683143 (0.00085) [2022-07-10 10:24:09,686][26022] Updated weights on worker 0-0, policy_version 683153 (0.00084) [2022-07-10 10:24:09,889][25689] Fps is (10 sec: 5482.0, 60 sec: 5548.3, 300 sec: 5552.2). Total num frames: 699550720. Throughput: 0: 4842.3. Samples: 699543556. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:24:09,889][25689] Avg episode reward: [(0, '-2.998')] [2022-07-10 10:24:11,522][26022] Updated weights on worker 0-0, policy_version 683163 (0.00090) [2022-07-10 10:24:13,288][26022] Updated weights on worker 0-0, policy_version 683173 (0.00088) [2022-07-10 10:24:14,969][25689] Fps is (10 sec: 5542.6, 60 sec: 5543.7, 300 sec: 5544.8). Total num frames: 699577344. Throughput: 0: 5712.0. Samples: 699577328. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:24:14,969][25689] Avg episode reward: [(0, '-4.085')] [2022-07-10 10:24:15,250][26022] Updated weights on worker 0-0, policy_version 683183 (0.00081) [2022-07-10 10:24:17,004][26022] Updated weights on worker 0-0, policy_version 683193 (0.00091) [2022-07-10 10:24:18,715][26022] Updated weights on worker 0-0, policy_version 683203 (0.00084) [2022-07-10 10:24:19,990][25689] Fps is (10 sec: 5474.3, 60 sec: 5526.0, 300 sec: 5549.0). Total num frames: 699606016. Throughput: 0: 5731.8. Samples: 699611168. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:24:19,990][25689] Avg episode reward: [(0, '-3.700')] [2022-07-10 10:24:20,633][26022] Updated weights on worker 0-0, policy_version 683213 (0.00092) [2022-07-10 10:24:22,499][26022] Updated weights on worker 0-0, policy_version 683223 (0.00087) [2022-07-10 10:24:23,071][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:24:23,085][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000683226_699623424.pth [2022-07-10 10:24:23,086][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000681274_697624576.pth [2022-07-10 10:24:24,127][26022] Updated weights on worker 0-0, policy_version 683233 (0.00087) [2022-07-10 10:24:25,014][25689] Fps is (10 sec: 5810.5, 60 sec: 5575.7, 300 sec: 5554.4). Total num frames: 699635712. Throughput: 0: 5004.3. Samples: 699627934. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:24:25,015][25689] Avg episode reward: [(0, '-3.692')] [2022-07-10 10:24:26,360][26022] Updated weights on worker 0-0, policy_version 683243 (0.00096) [2022-07-10 10:24:27,869][26022] Updated weights on worker 0-0, policy_version 683253 (0.00090) [2022-07-10 10:24:29,856][26022] Updated weights on worker 0-0, policy_version 683263 (0.00094) [2022-07-10 10:24:30,075][25689] Fps is (10 sec: 5482.6, 60 sec: 5509.2, 300 sec: 5543.1). Total num frames: 699661312. Throughput: 0: 5835.0. Samples: 699661346. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:24:30,076][25689] Avg episode reward: [(0, '-4.237')] [2022-07-10 10:24:31,438][26022] Updated weights on worker 0-0, policy_version 683273 (0.00083) [2022-07-10 10:24:33,426][26022] Updated weights on worker 0-0, policy_version 683283 (0.00091) [2022-07-10 10:24:35,098][25689] Fps is (10 sec: 5382.1, 60 sec: 5549.7, 300 sec: 5546.4). Total num frames: 699689984. Throughput: 0: 5847.9. Samples: 699695042. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:24:35,098][25689] Avg episode reward: [(0, '-3.369')] [2022-07-10 10:24:35,296][26022] Updated weights on worker 0-0, policy_version 683293 (0.00083) [2022-07-10 10:24:37,043][26022] Updated weights on worker 0-0, policy_version 683303 (0.00094) [2022-07-10 10:24:38,850][26022] Updated weights on worker 0-0, policy_version 683313 (0.00093) [2022-07-10 10:24:40,135][25689] Fps is (10 sec: 5700.6, 60 sec: 5531.3, 300 sec: 5543.8). Total num frames: 699718656. Throughput: 0: 5002.9. Samples: 699711952. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:24:40,135][25689] Avg episode reward: [(0, '-3.946')] [2022-07-10 10:24:40,907][26022] Updated weights on worker 0-0, policy_version 683323 (0.00099) [2022-07-10 10:24:42,502][26022] Updated weights on worker 0-0, policy_version 683333 (0.00084) [2022-07-10 10:24:44,552][26022] Updated weights on worker 0-0, policy_version 683343 (0.00085) [2022-07-10 10:24:45,181][25689] Fps is (10 sec: 5585.5, 60 sec: 5547.6, 300 sec: 5540.9). Total num frames: 699746304. Throughput: 0: 5823.8. Samples: 699745382. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:24:45,182][25689] Avg episode reward: [(0, '-2.926')] [2022-07-10 10:24:46,203][26022] Updated weights on worker 0-0, policy_version 683353 (0.00091) [2022-07-10 10:24:48,135][26022] Updated weights on worker 0-0, policy_version 683363 (0.00087) [2022-07-10 10:24:49,892][26022] Updated weights on worker 0-0, policy_version 683373 (0.00086) [2022-07-10 10:24:50,240][25689] Fps is (10 sec: 5674.8, 60 sec: 5547.8, 300 sec: 5544.0). Total num frames: 699776000. Throughput: 0: 5835.3. Samples: 699779010. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-10 10:24:50,240][25689] Avg episode reward: [(0, '-3.479')] [2022-07-10 10:24:51,735][26022] Updated weights on worker 0-0, policy_version 683383 (0.00092) [2022-07-10 10:24:53,513][26022] Updated weights on worker 0-0, policy_version 683393 (0.00083) [2022-07-10 10:24:55,249][25689] Fps is (10 sec: 5593.9, 60 sec: 5538.0, 300 sec: 5538.7). Total num frames: 699802624. Throughput: 0: 5838.6. Samples: 699812698. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:24:55,250][25689] Avg episode reward: [(0, '-1.987')] [2022-07-10 10:24:55,483][26022] Updated weights on worker 0-0, policy_version 683403 (0.00099) [2022-07-10 10:24:57,211][26022] Updated weights on worker 0-0, policy_version 683413 (0.00085) [2022-07-10 10:24:59,306][26022] Updated weights on worker 0-0, policy_version 683423 (0.00082) [2022-07-10 10:25:00,307][25689] Fps is (10 sec: 5492.8, 60 sec: 5551.1, 300 sec: 5548.9). Total num frames: 699831296. Throughput: 0: 5822.4. Samples: 699829402. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:25:00,307][25689] Avg episode reward: [(0, '-2.037')] [2022-07-10 10:25:00,856][26022] Updated weights on worker 0-0, policy_version 683433 (0.00093) [2022-07-10 10:25:03,278][26022] Updated weights on worker 0-0, policy_version 683443 (0.00092) [2022-07-10 10:25:04,971][26022] Updated weights on worker 0-0, policy_version 683453 (0.00110) [2022-07-10 10:25:05,315][25689] Fps is (10 sec: 5594.9, 60 sec: 5584.7, 300 sec: 5546.3). Total num frames: 699858944. Throughput: 0: 5726.2. Samples: 699860676. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:25:05,316][25689] Avg episode reward: [(0, '-2.283')] [2022-07-10 10:25:06,943][26022] Updated weights on worker 0-0, policy_version 683463 (0.00093) [2022-07-10 10:25:08,614][26022] Updated weights on worker 0-0, policy_version 683473 (0.00098) [2022-07-10 10:25:10,345][25689] Fps is (10 sec: 5406.5, 60 sec: 5538.7, 300 sec: 5542.9). Total num frames: 699885568. Throughput: 0: 5727.3. Samples: 699894160. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:25:10,345][25689] Avg episode reward: [(0, '-2.332')] [2022-07-10 10:25:10,776][26022] Updated weights on worker 0-0, policy_version 683483 (0.00090) [2022-07-10 10:25:12,243][26022] Updated weights on worker 0-0, policy_version 683493 (0.00090) [2022-07-10 10:25:14,293][26022] Updated weights on worker 0-0, policy_version 683503 (0.00087) [2022-07-10 10:25:15,375][25689] Fps is (10 sec: 5496.8, 60 sec: 5577.2, 300 sec: 5549.4). Total num frames: 699914240. Throughput: 0: 4881.2. Samples: 699910936. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:25:15,376][25689] Avg episode reward: [(0, '-3.214')] [2022-07-10 10:25:15,857][26022] Updated weights on worker 0-0, policy_version 683513 (0.00087) [2022-07-10 10:25:17,782][26022] Updated weights on worker 0-0, policy_version 683523 (0.00090) [2022-07-10 10:25:19,617][26022] Updated weights on worker 0-0, policy_version 683533 (0.00081) [2022-07-10 10:25:20,383][25689] Fps is (10 sec: 5610.8, 60 sec: 5561.4, 300 sec: 5546.1). Total num frames: 699941888. Throughput: 0: 5758.3. Samples: 699945006. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:25:20,384][25689] Avg episode reward: [(0, '-2.590')] [2022-07-10 10:25:21,515][26022] Updated weights on worker 0-0, policy_version 683543 (0.00084) [2022-07-10 10:25:23,287][26022] Updated weights on worker 0-0, policy_version 683553 (0.00084) [2022-07-10 10:25:25,128][26022] Updated weights on worker 0-0, policy_version 683563 (0.00093) [2022-07-10 10:25:25,393][25689] Fps is (10 sec: 5519.9, 60 sec: 5528.8, 300 sec: 5543.3). Total num frames: 699969536. Throughput: 0: 5865.7. Samples: 699978442. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:25:25,393][25689] Avg episode reward: [(0, '-2.587')] [2022-07-10 10:25:26,916][26022] Updated weights on worker 0-0, policy_version 683573 (0.00101) [2022-07-10 10:25:28,912][26022] Updated weights on worker 0-0, policy_version 683583 (0.00088) [2022-07-10 10:25:30,507][25689] Fps is (10 sec: 5461.7, 60 sec: 5557.9, 300 sec: 5544.7). Total num frames: 699997184. Throughput: 0: 4998.5. Samples: 699994940. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:25:30,508][25689] Avg episode reward: [(0, '-3.012')] [2022-07-10 10:25:30,769][26022] Updated weights on worker 0-0, policy_version 683593 (0.00086) [2022-07-10 10:25:32,584][26022] Updated weights on worker 0-0, policy_version 683603 (0.00083) [2022-07-10 10:25:34,302][26022] Updated weights on worker 0-0, policy_version 683613 (0.00081) [2022-07-10 10:25:35,513][25689] Fps is (10 sec: 5463.6, 60 sec: 5542.4, 300 sec: 5541.3). Total num frames: 700024832. Throughput: 0: 5833.1. Samples: 700028406. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:25:35,514][25689] Avg episode reward: [(0, '-4.293')] [2022-07-10 10:25:36,279][26022] Updated weights on worker 0-0, policy_version 683623 (0.00095) [2022-07-10 10:25:37,951][26022] Updated weights on worker 0-0, policy_version 683633 (0.00088) [2022-07-10 10:25:39,950][26022] Updated weights on worker 0-0, policy_version 683643 (0.00101) [2022-07-10 10:25:40,593][25689] Fps is (10 sec: 5685.7, 60 sec: 5555.5, 300 sec: 5550.2). Total num frames: 700054528. Throughput: 0: 5784.4. Samples: 700061908. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:25:40,593][25689] Avg episode reward: [(0, '-4.025')] [2022-07-10 10:25:41,587][26022] Updated weights on worker 0-0, policy_version 683653 (0.00094) [2022-07-10 10:25:43,719][26022] Updated weights on worker 0-0, policy_version 683663 (0.00081) [2022-07-10 10:25:45,186][26022] Updated weights on worker 0-0, policy_version 683673 (0.00087) [2022-07-10 10:25:45,647][25689] Fps is (10 sec: 5659.0, 60 sec: 5554.8, 300 sec: 5544.5). Total num frames: 700082176. Throughput: 0: 4958.5. Samples: 700078872. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:25:45,647][25689] Avg episode reward: [(0, '-3.865')] [2022-07-10 10:25:47,418][26022] Updated weights on worker 0-0, policy_version 683683 (0.00085) [2022-07-10 10:25:48,924][26022] Updated weights on worker 0-0, policy_version 683693 (0.00083) [2022-07-10 10:25:50,789][25689] Fps is (10 sec: 5523.4, 60 sec: 5530.1, 300 sec: 5541.9). Total num frames: 700110848. Throughput: 0: 5797.3. Samples: 700112522. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:25:50,790][25689] Avg episode reward: [(0, '-3.307')] [2022-07-10 10:25:50,845][26022] Updated weights on worker 0-0, policy_version 683703 (0.00089) [2022-07-10 10:25:52,615][26022] Updated weights on worker 0-0, policy_version 683713 (0.00436) [2022-07-10 10:25:54,522][26022] Updated weights on worker 0-0, policy_version 683723 (0.00084) [2022-07-10 10:25:55,799][25689] Fps is (10 sec: 5648.3, 60 sec: 5563.9, 300 sec: 5550.0). Total num frames: 700139520. Throughput: 0: 5813.0. Samples: 700146328. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:25:55,800][25689] Avg episode reward: [(0, '-4.461')] [2022-07-10 10:25:56,194][26022] Updated weights on worker 0-0, policy_version 683733 (0.00093) [2022-07-10 10:25:58,187][26022] Updated weights on worker 0-0, policy_version 683743 (0.00092) [2022-07-10 10:25:59,977][26022] Updated weights on worker 0-0, policy_version 683753 (0.00087) [2022-07-10 10:26:00,848][25689] Fps is (10 sec: 5497.5, 60 sec: 5530.9, 300 sec: 5549.2). Total num frames: 700166144. Throughput: 0: 4989.6. Samples: 700162972. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:26:00,848][25689] Avg episode reward: [(0, '-3.380')] [2022-07-10 10:26:02,294][26022] Updated weights on worker 0-0, policy_version 683763 (0.00086) [2022-07-10 10:26:04,180][26022] Updated weights on worker 0-0, policy_version 683773 (0.00080) [2022-07-10 10:26:05,859][25689] Fps is (10 sec: 5293.3, 60 sec: 5513.8, 300 sec: 5547.8). Total num frames: 700192768. Throughput: 0: 5705.0. Samples: 700194182. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:26:05,859][25689] Avg episode reward: [(0, '-3.390')] [2022-07-10 10:26:05,923][26022] Updated weights on worker 0-0, policy_version 683783 (0.00165) [2022-07-10 10:26:08,024][26022] Updated weights on worker 0-0, policy_version 683793 (0.00085) [2022-07-10 10:26:09,402][26022] Updated weights on worker 0-0, policy_version 683803 (0.00097) [2022-07-10 10:26:10,941][25689] Fps is (10 sec: 5377.3, 60 sec: 5525.9, 300 sec: 5546.9). Total num frames: 700220416. Throughput: 0: 5722.4. Samples: 700227836. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:26:10,941][25689] Avg episode reward: [(0, '-2.451')] [2022-07-10 10:26:11,495][26022] Updated weights on worker 0-0, policy_version 683813 (0.00093) [2022-07-10 10:26:13,043][26022] Updated weights on worker 0-0, policy_version 683823 (0.00096) [2022-07-10 10:26:15,179][26022] Updated weights on worker 0-0, policy_version 683833 (0.00065) [2022-07-10 10:26:15,956][25689] Fps is (10 sec: 5780.6, 60 sec: 5561.1, 300 sec: 5547.0). Total num frames: 700251136. Throughput: 0: 4887.0. Samples: 700244834. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:26:15,956][25689] Avg episode reward: [(0, '-3.355')] [2022-07-10 10:26:16,815][26022] Updated weights on worker 0-0, policy_version 683843 (0.00089) [2022-07-10 10:26:18,528][26022] Updated weights on worker 0-0, policy_version 683853 (0.00094) [2022-07-10 10:26:20,641][26022] Updated weights on worker 0-0, policy_version 683863 (0.00088) [2022-07-10 10:26:21,005][25689] Fps is (10 sec: 5596.0, 60 sec: 5523.5, 300 sec: 5543.3). Total num frames: 700276736. Throughput: 0: 5736.9. Samples: 700278612. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:26:21,005][25689] Avg episode reward: [(0, '-2.230')] [2022-07-10 10:26:22,241][26022] Updated weights on worker 0-0, policy_version 683873 (0.00092) [2022-07-10 10:26:23,265][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:26:23,285][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000683878_700291072.pth [2022-07-10 10:26:23,285][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000681926_698292224.pth [2022-07-10 10:26:24,301][26022] Updated weights on worker 0-0, policy_version 683883 (0.00077) [2022-07-10 10:26:26,019][25689] Fps is (10 sec: 5393.4, 60 sec: 5540.1, 300 sec: 5551.2). Total num frames: 700305408. Throughput: 0: 5847.8. Samples: 700312072. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:26:26,019][25689] Avg episode reward: [(0, '-2.520')] [2022-07-10 10:26:26,170][26022] Updated weights on worker 0-0, policy_version 683893 (0.00083) [2022-07-10 10:26:27,819][26022] Updated weights on worker 0-0, policy_version 683903 (0.00090) [2022-07-10 10:26:29,829][26022] Updated weights on worker 0-0, policy_version 683913 (0.00095) [2022-07-10 10:26:31,149][25689] Fps is (10 sec: 5652.9, 60 sec: 5555.5, 300 sec: 5549.0). Total num frames: 700334080. Throughput: 0: 4985.6. Samples: 700328588. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:26:31,150][25689] Avg episode reward: [(0, '-1.428')] [2022-07-10 10:26:31,516][26022] Updated weights on worker 0-0, policy_version 683923 (0.00084) [2022-07-10 10:26:33,491][26022] Updated weights on worker 0-0, policy_version 683933 (0.00089) [2022-07-10 10:26:35,434][26022] Updated weights on worker 0-0, policy_version 683943 (0.00079) [2022-07-10 10:26:36,162][25689] Fps is (10 sec: 5552.4, 60 sec: 5554.9, 300 sec: 5542.1). Total num frames: 700361728. Throughput: 0: 5777.6. Samples: 700361576. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:26:36,164][25689] Avg episode reward: [(0, '-3.880')] [2022-07-10 10:26:37,336][26022] Updated weights on worker 0-0, policy_version 683953 (0.00093) [2022-07-10 10:26:39,035][26022] Updated weights on worker 0-0, policy_version 683963 (0.00083) [2022-07-10 10:26:41,065][26022] Updated weights on worker 0-0, policy_version 683973 (0.00085) [2022-07-10 10:26:41,164][25689] Fps is (10 sec: 5418.9, 60 sec: 5511.2, 300 sec: 5545.7). Total num frames: 700388352. Throughput: 0: 5770.9. Samples: 700394950. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:26:41,165][25689] Avg episode reward: [(0, '-2.743')] [2022-07-10 10:26:42,658][26022] Updated weights on worker 0-0, policy_version 683983 (0.00087) [2022-07-10 10:26:44,527][26022] Updated weights on worker 0-0, policy_version 683993 (0.00092) [2022-07-10 10:26:46,179][25689] Fps is (10 sec: 5622.6, 60 sec: 5548.7, 300 sec: 5547.8). Total num frames: 700418048. Throughput: 0: 4941.6. Samples: 700411694. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:26:46,179][25689] Avg episode reward: [(0, '-4.383')] [2022-07-10 10:26:46,254][26022] Updated weights on worker 0-0, policy_version 684003 (0.00084) [2022-07-10 10:26:48,439][26022] Updated weights on worker 0-0, policy_version 684013 (0.00105) [2022-07-10 10:26:49,981][26022] Updated weights on worker 0-0, policy_version 684023 (0.00083) [2022-07-10 10:26:51,256][25689] Fps is (10 sec: 5682.3, 60 sec: 5537.7, 300 sec: 5543.0). Total num frames: 700445696. Throughput: 0: 5790.8. Samples: 700445024. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:26:51,256][25689] Avg episode reward: [(0, '-5.359')] [2022-07-10 10:26:52,158][26022] Updated weights on worker 0-0, policy_version 684033 (0.00085) [2022-07-10 10:26:53,397][26022] Updated weights on worker 0-0, policy_version 684043 (0.00086) [2022-07-10 10:26:55,819][26022] Updated weights on worker 0-0, policy_version 684053 (0.00086) [2022-07-10 10:26:56,266][25689] Fps is (10 sec: 5583.5, 60 sec: 5537.8, 300 sec: 5554.0). Total num frames: 700474368. Throughput: 0: 5821.2. Samples: 700478604. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:26:56,266][25689] Avg episode reward: [(0, '-5.190')] [2022-07-10 10:26:57,358][26022] Updated weights on worker 0-0, policy_version 684063 (0.00092) [2022-07-10 10:26:59,380][26022] Updated weights on worker 0-0, policy_version 684073 (0.00092) [2022-07-10 10:27:00,943][26022] Updated weights on worker 0-0, policy_version 684083 (0.00094) [2022-07-10 10:27:01,293][25689] Fps is (10 sec: 5611.0, 60 sec: 5556.6, 300 sec: 5550.4). Total num frames: 700502016. Throughput: 0: 4983.3. Samples: 700495260. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:27:01,295][25689] Avg episode reward: [(0, '-6.414')] [2022-07-10 10:27:03,284][26022] Updated weights on worker 0-0, policy_version 684093 (0.00092) [2022-07-10 10:27:05,141][26022] Updated weights on worker 0-0, policy_version 684103 (0.00086) [2022-07-10 10:27:06,303][25689] Fps is (10 sec: 5305.1, 60 sec: 5539.8, 300 sec: 5544.2). Total num frames: 700527616. Throughput: 0: 5712.3. Samples: 700526648. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:27:06,303][25689] Avg episode reward: [(0, '-4.532')] [2022-07-10 10:27:07,035][26022] Updated weights on worker 0-0, policy_version 684113 (0.00112) [2022-07-10 10:27:08,654][26022] Updated weights on worker 0-0, policy_version 684123 (0.00088) [2022-07-10 10:27:10,564][26022] Updated weights on worker 0-0, policy_version 684133 (0.00087) [2022-07-10 10:27:11,429][25689] Fps is (10 sec: 5253.4, 60 sec: 5535.7, 300 sec: 5538.8). Total num frames: 700555264. Throughput: 0: 5723.8. Samples: 700560492. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:27:11,430][25689] Avg episode reward: [(0, '-4.400')] [2022-07-10 10:27:12,296][26022] Updated weights on worker 0-0, policy_version 684143 (0.00080) [2022-07-10 10:27:14,206][26022] Updated weights on worker 0-0, policy_version 684153 (0.00085) [2022-07-10 10:27:16,084][26022] Updated weights on worker 0-0, policy_version 684163 (0.00089) [2022-07-10 10:27:16,446][25689] Fps is (10 sec: 5552.7, 60 sec: 5501.8, 300 sec: 5539.6). Total num frames: 700583936. Throughput: 0: 5720.1. Samples: 700594036. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:27:16,446][25689] Avg episode reward: [(0, '-2.519')] [2022-07-10 10:27:18,001][26022] Updated weights on worker 0-0, policy_version 684173 (0.00093) [2022-07-10 10:27:19,777][26022] Updated weights on worker 0-0, policy_version 684183 (0.00088) [2022-07-10 10:27:21,495][25689] Fps is (10 sec: 5697.0, 60 sec: 5552.5, 300 sec: 5538.7). Total num frames: 700612608. Throughput: 0: 5730.1. Samples: 700611016. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:27:21,495][25689] Avg episode reward: [(0, '-2.280')] [2022-07-10 10:27:21,604][26022] Updated weights on worker 0-0, policy_version 684193 (0.00089) [2022-07-10 10:27:23,387][26022] Updated weights on worker 0-0, policy_version 684203 (0.00087) [2022-07-10 10:27:25,392][26022] Updated weights on worker 0-0, policy_version 684213 (0.00091) [2022-07-10 10:27:26,539][25689] Fps is (10 sec: 5579.7, 60 sec: 5532.8, 300 sec: 5543.2). Total num frames: 700640256. Throughput: 0: 5829.9. Samples: 700644626. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:27:26,540][25689] Avg episode reward: [(0, '-2.103')] [2022-07-10 10:27:27,170][26022] Updated weights on worker 0-0, policy_version 684223 (0.00094) [2022-07-10 10:27:28,894][26022] Updated weights on worker 0-0, policy_version 684233 (0.00058) [2022-07-10 10:27:30,915][26022] Updated weights on worker 0-0, policy_version 684243 (0.00086) [2022-07-10 10:27:31,594][25689] Fps is (10 sec: 5678.0, 60 sec: 5556.7, 300 sec: 5546.0). Total num frames: 700669952. Throughput: 0: 5827.4. Samples: 700678000. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:27:31,595][25689] Avg episode reward: [(0, '-1.851')] [2022-07-10 10:27:32,614][26022] Updated weights on worker 0-0, policy_version 684253 (0.00089) [2022-07-10 10:27:34,371][26022] Updated weights on worker 0-0, policy_version 684263 (0.00086) [2022-07-10 10:27:36,273][26022] Updated weights on worker 0-0, policy_version 684273 (0.00083) [2022-07-10 10:27:36,647][25689] Fps is (10 sec: 5572.2, 60 sec: 5536.1, 300 sec: 5541.6). Total num frames: 700696576. Throughput: 0: 4985.8. Samples: 700694754. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:27:36,647][25689] Avg episode reward: [(0, '-3.081')] [2022-07-10 10:27:38,036][26022] Updated weights on worker 0-0, policy_version 684283 (0.00095) [2022-07-10 10:27:40,011][26022] Updated weights on worker 0-0, policy_version 684293 (0.00086) [2022-07-10 10:27:41,679][25689] Fps is (10 sec: 5483.3, 60 sec: 5567.2, 300 sec: 5541.2). Total num frames: 700725248. Throughput: 0: 5806.7. Samples: 700728218. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:27:41,679][25689] Avg episode reward: [(0, '-1.853')] [2022-07-10 10:27:41,711][26022] Updated weights on worker 0-0, policy_version 684303 (0.00085) [2022-07-10 10:27:43,697][26022] Updated weights on worker 0-0, policy_version 684313 (0.00087) [2022-07-10 10:27:45,454][26022] Updated weights on worker 0-0, policy_version 684323 (0.00091) [2022-07-10 10:27:46,737][25689] Fps is (10 sec: 5581.7, 60 sec: 5529.4, 300 sec: 5545.5). Total num frames: 700752896. Throughput: 0: 5807.0. Samples: 700761912. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:27:46,737][25689] Avg episode reward: [(0, '-2.244')] [2022-07-10 10:27:47,264][26022] Updated weights on worker 0-0, policy_version 684333 (0.00086) [2022-07-10 10:27:49,179][26022] Updated weights on worker 0-0, policy_version 684343 (0.00085) [2022-07-10 10:27:50,919][26022] Updated weights on worker 0-0, policy_version 684353 (0.00091) [2022-07-10 10:27:51,839][25689] Fps is (10 sec: 5442.5, 60 sec: 5527.1, 300 sec: 5541.4). Total num frames: 700780544. Throughput: 0: 4966.1. Samples: 700778536. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:27:51,840][25689] Avg episode reward: [(0, '-3.826')] [2022-07-10 10:27:52,763][26022] Updated weights on worker 0-0, policy_version 684363 (0.00094) [2022-07-10 10:27:54,753][26022] Updated weights on worker 0-0, policy_version 684373 (0.00472) [2022-07-10 10:27:56,403][26022] Updated weights on worker 0-0, policy_version 684383 (0.00090) [2022-07-10 10:27:56,908][25689] Fps is (10 sec: 5738.5, 60 sec: 5555.4, 300 sec: 5545.4). Total num frames: 700811264. Throughput: 0: 5788.1. Samples: 700812030. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:27:56,908][25689] Avg episode reward: [(0, '-2.959')] [2022-07-10 10:27:58,339][26022] Updated weights on worker 0-0, policy_version 684393 (0.00086) [2022-07-10 10:28:00,000][26022] Updated weights on worker 0-0, policy_version 684403 (0.00083) [2022-07-10 10:28:01,964][25689] Fps is (10 sec: 5663.2, 60 sec: 5536.0, 300 sec: 5555.2). Total num frames: 700837888. Throughput: 0: 5777.4. Samples: 700845416. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:28:01,965][25689] Avg episode reward: [(0, '-2.399')] [2022-07-10 10:28:02,340][26022] Updated weights on worker 0-0, policy_version 684413 (0.00092) [2022-07-10 10:28:04,363][26022] Updated weights on worker 0-0, policy_version 684423 (0.00090) [2022-07-10 10:28:06,163][26022] Updated weights on worker 0-0, policy_version 684433 (0.00081) [2022-07-10 10:28:07,007][25689] Fps is (10 sec: 5272.5, 60 sec: 5549.8, 300 sec: 5542.2). Total num frames: 700864512. Throughput: 0: 4840.0. Samples: 700860022. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 10:28:07,008][25689] Avg episode reward: [(0, '-2.497')] [2022-07-10 10:28:07,887][26022] Updated weights on worker 0-0, policy_version 684443 (0.00086) [2022-07-10 10:28:09,955][26022] Updated weights on worker 0-0, policy_version 684453 (0.00080) [2022-07-10 10:28:11,645][26022] Updated weights on worker 0-0, policy_version 684463 (0.00082) [2022-07-10 10:28:12,111][25689] Fps is (10 sec: 5348.9, 60 sec: 5551.9, 300 sec: 5544.3). Total num frames: 700892160. Throughput: 0: 5673.8. Samples: 700893556. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:28:12,111][25689] Avg episode reward: [(0, '-2.022')] [2022-07-10 10:28:13,555][26022] Updated weights on worker 0-0, policy_version 684473 (0.00081) [2022-07-10 10:28:15,300][26022] Updated weights on worker 0-0, policy_version 684483 (0.00090) [2022-07-10 10:28:17,101][26022] Updated weights on worker 0-0, policy_version 684493 (0.00092) [2022-07-10 10:28:17,182][25689] Fps is (10 sec: 5535.5, 60 sec: 5546.9, 300 sec: 5539.8). Total num frames: 700920832. Throughput: 0: 5680.0. Samples: 700927184. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:28:17,182][25689] Avg episode reward: [(0, '-2.233')] [2022-07-10 10:28:19,039][26022] Updated weights on worker 0-0, policy_version 684503 (0.00091) [2022-07-10 10:28:20,881][26022] Updated weights on worker 0-0, policy_version 684513 (0.00086) [2022-07-10 10:28:22,245][25689] Fps is (10 sec: 5557.3, 60 sec: 5528.7, 300 sec: 5542.2). Total num frames: 700948480. Throughput: 0: 4847.2. Samples: 700943722. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:28:22,247][25689] Avg episode reward: [(0, '-2.396')] [2022-07-10 10:28:22,674][26022] Updated weights on worker 0-0, policy_version 684523 (0.00083) [2022-07-10 10:28:23,409][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:28:23,423][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000684527_700955648.pth [2022-07-10 10:28:23,424][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000682576_698957824.pth [2022-07-10 10:28:24,449][26022] Updated weights on worker 0-0, policy_version 684533 (0.00093) [2022-07-10 10:28:26,351][26022] Updated weights on worker 0-0, policy_version 684543 (0.00089) [2022-07-10 10:28:27,267][25689] Fps is (10 sec: 5584.4, 60 sec: 5547.7, 300 sec: 5539.8). Total num frames: 700977152. Throughput: 0: 5795.3. Samples: 700977434. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:28:27,269][25689] Avg episode reward: [(0, '-1.810')] [2022-07-10 10:28:28,212][26022] Updated weights on worker 0-0, policy_version 684553 (0.00080) [2022-07-10 10:28:29,996][26022] Updated weights on worker 0-0, policy_version 684563 (0.00093) [2022-07-10 10:28:31,871][26022] Updated weights on worker 0-0, policy_version 684573 (0.00085) [2022-07-10 10:28:32,313][25689] Fps is (10 sec: 5594.0, 60 sec: 5514.7, 300 sec: 5544.1). Total num frames: 701004800. Throughput: 0: 5790.1. Samples: 701010530. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:28:32,315][25689] Avg episode reward: [(0, '-2.673')] [2022-07-10 10:28:33,683][26022] Updated weights on worker 0-0, policy_version 684583 (0.00057) [2022-07-10 10:28:35,430][26022] Updated weights on worker 0-0, policy_version 684593 (0.00092) [2022-07-10 10:28:37,316][25689] Fps is (10 sec: 5400.4, 60 sec: 5519.2, 300 sec: 5534.1). Total num frames: 701031424. Throughput: 0: 4981.8. Samples: 701027492. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:28:37,317][25689] Avg episode reward: [(0, '-2.763')] [2022-07-10 10:28:37,543][26022] Updated weights on worker 0-0, policy_version 684603 (0.00083) [2022-07-10 10:28:39,029][26022] Updated weights on worker 0-0, policy_version 684613 (0.00097) [2022-07-10 10:28:41,059][26022] Updated weights on worker 0-0, policy_version 684623 (0.00082) [2022-07-10 10:28:42,332][25689] Fps is (10 sec: 5621.2, 60 sec: 5537.5, 300 sec: 5544.9). Total num frames: 701061120. Throughput: 0: 5854.6. Samples: 701061324. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:28:42,333][25689] Avg episode reward: [(0, '-3.002')] [2022-07-10 10:28:42,834][26022] Updated weights on worker 0-0, policy_version 684633 (0.00087) [2022-07-10 10:28:44,686][26022] Updated weights on worker 0-0, policy_version 684643 (0.00088) [2022-07-10 10:28:46,546][26022] Updated weights on worker 0-0, policy_version 684653 (0.00083) [2022-07-10 10:28:47,388][25689] Fps is (10 sec: 5795.2, 60 sec: 5554.6, 300 sec: 5541.6). Total num frames: 701089792. Throughput: 0: 5853.5. Samples: 701095214. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:28:47,389][25689] Avg episode reward: [(0, '-3.381')] [2022-07-10 10:28:48,428][26022] Updated weights on worker 0-0, policy_version 684663 (0.00086) [2022-07-10 10:28:50,045][26022] Updated weights on worker 0-0, policy_version 684673 (0.00093) [2022-07-10 10:28:51,992][26022] Updated weights on worker 0-0, policy_version 684683 (0.00082) [2022-07-10 10:28:52,493][25689] Fps is (10 sec: 5442.2, 60 sec: 5537.5, 300 sec: 5537.8). Total num frames: 701116416. Throughput: 0: 5030.8. Samples: 701112050. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:28:52,495][25689] Avg episode reward: [(0, '-4.386')] [2022-07-10 10:28:53,782][26022] Updated weights on worker 0-0, policy_version 684693 (0.00089) [2022-07-10 10:28:55,803][26022] Updated weights on worker 0-0, policy_version 684703 (0.00618) [2022-07-10 10:28:57,147][26022] Updated weights on worker 0-0, policy_version 684713 (0.00087) [2022-07-10 10:28:57,509][25689] Fps is (10 sec: 5665.6, 60 sec: 5542.3, 300 sec: 5548.1). Total num frames: 701147136. Throughput: 0: 5853.0. Samples: 701145682. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:28:57,511][25689] Avg episode reward: [(0, '-3.687')] [2022-07-10 10:28:59,356][26022] Updated weights on worker 0-0, policy_version 684723 (0.00091) [2022-07-10 10:29:01,075][26022] Updated weights on worker 0-0, policy_version 684733 (0.00082) [2022-07-10 10:29:02,563][25689] Fps is (10 sec: 5694.7, 60 sec: 5542.6, 300 sec: 5550.6). Total num frames: 701173760. Throughput: 0: 5823.6. Samples: 701179138. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:29:02,563][25689] Avg episode reward: [(0, '-3.657')] [2022-07-10 10:29:03,438][26022] Updated weights on worker 0-0, policy_version 684743 (0.00083) [2022-07-10 10:29:05,198][26022] Updated weights on worker 0-0, policy_version 684753 (0.00093) [2022-07-10 10:29:06,947][26022] Updated weights on worker 0-0, policy_version 684763 (0.00088) [2022-07-10 10:29:07,575][25689] Fps is (10 sec: 5290.1, 60 sec: 5545.4, 300 sec: 5541.6). Total num frames: 701200384. Throughput: 0: 4889.4. Samples: 701193916. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:29:07,577][25689] Avg episode reward: [(0, '-4.579')] [2022-07-10 10:29:08,702][26022] Updated weights on worker 0-0, policy_version 684773 (0.00091) [2022-07-10 10:29:10,694][26022] Updated weights on worker 0-0, policy_version 684783 (0.00085) [2022-07-10 10:29:12,404][26022] Updated weights on worker 0-0, policy_version 684793 (0.00094) [2022-07-10 10:29:12,674][25689] Fps is (10 sec: 5570.1, 60 sec: 5579.6, 300 sec: 5551.6). Total num frames: 701230080. Throughput: 0: 5725.3. Samples: 701227592. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:29:12,674][25689] Avg episode reward: [(0, '-5.508')] [2022-07-10 10:29:14,300][26022] Updated weights on worker 0-0, policy_version 684803 (0.00088) [2022-07-10 10:29:16,191][26022] Updated weights on worker 0-0, policy_version 684813 (0.00104) [2022-07-10 10:29:17,691][25689] Fps is (10 sec: 5567.7, 60 sec: 5550.8, 300 sec: 5544.8). Total num frames: 701256704. Throughput: 0: 5724.0. Samples: 701261198. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:29:17,692][25689] Avg episode reward: [(0, '-4.781')] [2022-07-10 10:29:17,850][26022] Updated weights on worker 0-0, policy_version 684823 (0.00089) [2022-07-10 10:29:19,757][26022] Updated weights on worker 0-0, policy_version 684833 (0.00090) [2022-07-10 10:29:21,646][26022] Updated weights on worker 0-0, policy_version 684843 (0.00093) [2022-07-10 10:29:22,699][25689] Fps is (10 sec: 5413.5, 60 sec: 5555.8, 300 sec: 5538.2). Total num frames: 701284352. Throughput: 0: 5735.4. Samples: 701294630. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:29:22,700][25689] Avg episode reward: [(0, '-4.176')] [2022-07-10 10:29:23,376][26022] Updated weights on worker 0-0, policy_version 684853 (0.00090) [2022-07-10 10:29:25,307][26022] Updated weights on worker 0-0, policy_version 684863 (0.00080) [2022-07-10 10:29:27,092][26022] Updated weights on worker 0-0, policy_version 684873 (0.00093) [2022-07-10 10:29:27,752][25689] Fps is (10 sec: 5597.7, 60 sec: 5553.0, 300 sec: 5548.7). Total num frames: 701313024. Throughput: 0: 5827.7. Samples: 701311500. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:29:27,752][25689] Avg episode reward: [(0, '-4.190')] [2022-07-10 10:29:29,147][26022] Updated weights on worker 0-0, policy_version 684883 (0.00090) [2022-07-10 10:29:30,805][26022] Updated weights on worker 0-0, policy_version 684893 (0.00091) [2022-07-10 10:29:32,723][26022] Updated weights on worker 0-0, policy_version 684903 (0.00087) [2022-07-10 10:29:32,823][25689] Fps is (10 sec: 5664.3, 60 sec: 5567.6, 300 sec: 5547.8). Total num frames: 701341696. Throughput: 0: 5813.6. Samples: 701344730. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:29:32,823][25689] Avg episode reward: [(0, '-3.459')] [2022-07-10 10:29:34,506][26022] Updated weights on worker 0-0, policy_version 684913 (0.00092) [2022-07-10 10:29:36,405][26022] Updated weights on worker 0-0, policy_version 684923 (0.01139) [2022-07-10 10:29:37,841][25689] Fps is (10 sec: 5582.0, 60 sec: 5583.1, 300 sec: 5544.7). Total num frames: 701369344. Throughput: 0: 5804.8. Samples: 701378170. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:29:37,842][25689] Avg episode reward: [(0, '-3.693')] [2022-07-10 10:29:38,270][26022] Updated weights on worker 0-0, policy_version 684933 (0.00089) [2022-07-10 10:29:40,067][26022] Updated weights on worker 0-0, policy_version 684943 (0.00085) [2022-07-10 10:29:41,741][26022] Updated weights on worker 0-0, policy_version 684953 (0.00080) [2022-07-10 10:29:42,874][25689] Fps is (10 sec: 5501.4, 60 sec: 5547.8, 300 sec: 5544.9). Total num frames: 701396992. Throughput: 0: 4974.0. Samples: 701394980. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:29:42,875][25689] Avg episode reward: [(0, '-2.526')] [2022-07-10 10:29:43,696][26022] Updated weights on worker 0-0, policy_version 684963 (0.00093) [2022-07-10 10:29:45,373][26022] Updated weights on worker 0-0, policy_version 684973 (0.00086) [2022-07-10 10:29:47,556][26022] Updated weights on worker 0-0, policy_version 684983 (0.00092) [2022-07-10 10:29:47,906][25689] Fps is (10 sec: 5596.0, 60 sec: 5550.0, 300 sec: 5542.0). Total num frames: 701425664. Throughput: 0: 5820.0. Samples: 701428796. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:29:47,907][25689] Avg episode reward: [(0, '-1.923')] [2022-07-10 10:29:48,984][26022] Updated weights on worker 0-0, policy_version 684993 (0.00083) [2022-07-10 10:29:51,115][26022] Updated weights on worker 0-0, policy_version 685003 (0.00099) [2022-07-10 10:29:52,732][26022] Updated weights on worker 0-0, policy_version 685013 (0.00089) [2022-07-10 10:29:52,954][25689] Fps is (10 sec: 5689.1, 60 sec: 5589.1, 300 sec: 5548.2). Total num frames: 701454336. Throughput: 0: 5844.9. Samples: 701462394. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:29:52,954][25689] Avg episode reward: [(0, '-2.336')] [2022-07-10 10:29:54,683][26022] Updated weights on worker 0-0, policy_version 685023 (0.00084) [2022-07-10 10:29:56,371][26022] Updated weights on worker 0-0, policy_version 685033 (0.00087) [2022-07-10 10:29:57,995][25689] Fps is (10 sec: 5683.5, 60 sec: 5552.9, 300 sec: 5548.5). Total num frames: 701483008. Throughput: 0: 5009.7. Samples: 701479136. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:29:57,996][25689] Avg episode reward: [(0, '-2.451')] [2022-07-10 10:29:58,230][26022] Updated weights on worker 0-0, policy_version 685043 (0.00088) [2022-07-10 10:30:00,085][26022] Updated weights on worker 0-0, policy_version 685053 (0.00093) [2022-07-10 10:30:02,583][26022] Updated weights on worker 0-0, policy_version 685063 (0.00082) [2022-07-10 10:30:03,064][25689] Fps is (10 sec: 5266.5, 60 sec: 5517.6, 300 sec: 5537.0). Total num frames: 701507584. Throughput: 0: 5813.0. Samples: 701512346. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:30:03,065][25689] Avg episode reward: [(0, '-3.748')] [2022-07-10 10:30:04,104][26022] Updated weights on worker 0-0, policy_version 685073 (0.00090) [2022-07-10 10:30:06,212][26022] Updated weights on worker 0-0, policy_version 685083 (0.00089) [2022-07-10 10:30:07,995][26022] Updated weights on worker 0-0, policy_version 685093 (0.00088) [2022-07-10 10:30:08,089][25689] Fps is (10 sec: 5173.9, 60 sec: 5533.4, 300 sec: 5540.5). Total num frames: 701535232. Throughput: 0: 5687.5. Samples: 701543590. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:30:08,090][25689] Avg episode reward: [(0, '-4.381')] [2022-07-10 10:30:09,791][26022] Updated weights on worker 0-0, policy_version 685103 (0.00091) [2022-07-10 10:30:11,852][26022] Updated weights on worker 0-0, policy_version 685114 (0.00088) [2022-07-10 10:30:13,148][25689] Fps is (10 sec: 5585.3, 60 sec: 5520.1, 300 sec: 5540.0). Total num frames: 701563904. Throughput: 0: 4839.1. Samples: 701560116. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:30:13,149][25689] Avg episode reward: [(0, '-4.393')] [2022-07-10 10:30:13,795][26022] Updated weights on worker 0-0, policy_version 685124 (0.00080) [2022-07-10 10:30:15,481][26022] Updated weights on worker 0-0, policy_version 685134 (0.00093) [2022-07-10 10:30:17,495][26022] Updated weights on worker 0-0, policy_version 685144 (0.00093) [2022-07-10 10:30:18,152][25689] Fps is (10 sec: 5597.1, 60 sec: 5538.2, 300 sec: 5540.1). Total num frames: 701591552. Throughput: 0: 5694.8. Samples: 701593922. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:30:18,153][25689] Avg episode reward: [(0, '-5.149')] [2022-07-10 10:30:19,010][26022] Updated weights on worker 0-0, policy_version 685154 (0.00087) [2022-07-10 10:30:20,996][26022] Updated weights on worker 0-0, policy_version 685164 (0.00109) [2022-07-10 10:30:22,770][26022] Updated weights on worker 0-0, policy_version 685174 (0.00093) [2022-07-10 10:30:23,162][25689] Fps is (10 sec: 5624.4, 60 sec: 5555.0, 300 sec: 5543.5). Total num frames: 701620224. Throughput: 0: 5738.3. Samples: 701627670. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:30:23,163][25689] Avg episode reward: [(0, '-6.245')] [2022-07-10 10:30:23,493][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:30:23,506][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000685177_701621248.pth [2022-07-10 10:30:23,507][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000683226_699623424.pth [2022-07-10 10:30:24,544][26022] Updated weights on worker 0-0, policy_version 685184 (0.00088) [2022-07-10 10:30:26,461][26022] Updated weights on worker 0-0, policy_version 685194 (0.00087) [2022-07-10 10:30:28,166][25689] Fps is (10 sec: 5624.1, 60 sec: 5542.6, 300 sec: 5545.6). Total num frames: 701647872. Throughput: 0: 5029.4. Samples: 701644562. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:30:28,167][25689] Avg episode reward: [(0, '-5.445')] [2022-07-10 10:30:28,221][26022] Updated weights on worker 0-0, policy_version 685204 (0.00093) [2022-07-10 10:30:30,056][26022] Updated weights on worker 0-0, policy_version 685214 (0.00087) [2022-07-10 10:30:32,076][26022] Updated weights on worker 0-0, policy_version 685224 (0.00090) [2022-07-10 10:30:33,221][25689] Fps is (10 sec: 5598.8, 60 sec: 5544.0, 300 sec: 5548.1). Total num frames: 701676544. Throughput: 0: 5879.9. Samples: 701678142. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:30:33,223][25689] Avg episode reward: [(0, '-4.313')] [2022-07-10 10:30:33,725][26022] Updated weights on worker 0-0, policy_version 685234 (0.00087) [2022-07-10 10:30:35,914][26022] Updated weights on worker 0-0, policy_version 685244 (0.00087) [2022-07-10 10:30:37,356][26022] Updated weights on worker 0-0, policy_version 685254 (0.00084) [2022-07-10 10:30:38,235][25689] Fps is (10 sec: 5593.2, 60 sec: 5544.4, 300 sec: 5542.4). Total num frames: 701704192. Throughput: 0: 5849.8. Samples: 701711406. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:30:38,237][25689] Avg episode reward: [(0, '-4.290')] [2022-07-10 10:30:39,318][26022] Updated weights on worker 0-0, policy_version 685264 (0.00090) [2022-07-10 10:30:41,064][26022] Updated weights on worker 0-0, policy_version 685274 (0.00091) [2022-07-10 10:30:42,974][26022] Updated weights on worker 0-0, policy_version 685284 (0.00084) [2022-07-10 10:30:43,256][25689] Fps is (10 sec: 5510.1, 60 sec: 5545.5, 300 sec: 5543.0). Total num frames: 701731840. Throughput: 0: 5003.2. Samples: 701728208. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:30:43,258][25689] Avg episode reward: [(0, '-5.243')] [2022-07-10 10:30:44,958][26022] Updated weights on worker 0-0, policy_version 685294 (0.00082) [2022-07-10 10:30:46,635][26022] Updated weights on worker 0-0, policy_version 685304 (0.00098) [2022-07-10 10:30:48,276][25689] Fps is (10 sec: 5405.4, 60 sec: 5512.7, 300 sec: 5538.5). Total num frames: 701758464. Throughput: 0: 5824.7. Samples: 701761696. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:30:48,276][25689] Avg episode reward: [(0, '-3.406')] [2022-07-10 10:30:48,519][26022] Updated weights on worker 0-0, policy_version 685314 (0.00097) [2022-07-10 10:30:50,446][26022] Updated weights on worker 0-0, policy_version 685324 (0.00091) [2022-07-10 10:30:52,067][26022] Updated weights on worker 0-0, policy_version 685334 (0.00084) [2022-07-10 10:30:53,389][25689] Fps is (10 sec: 5558.5, 60 sec: 5523.7, 300 sec: 5540.0). Total num frames: 701788160. Throughput: 0: 5812.1. Samples: 701795358. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:30:53,389][25689] Avg episode reward: [(0, '-2.957')] [2022-07-10 10:30:54,230][26022] Updated weights on worker 0-0, policy_version 685344 (0.00099) [2022-07-10 10:30:55,827][26022] Updated weights on worker 0-0, policy_version 685354 (0.00083) [2022-07-10 10:30:57,894][26022] Updated weights on worker 0-0, policy_version 685364 (0.00091) [2022-07-10 10:30:58,405][25689] Fps is (10 sec: 5762.0, 60 sec: 5526.0, 300 sec: 5547.5). Total num frames: 701816832. Throughput: 0: 4993.6. Samples: 701812128. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:30:58,406][25689] Avg episode reward: [(0, '-2.740')] [2022-07-10 10:30:59,399][26022] Updated weights on worker 0-0, policy_version 685374 (0.00083) [2022-07-10 10:31:01,292][26022] Updated weights on worker 0-0, policy_version 685384 (0.00612) [2022-07-10 10:31:03,459][25689] Fps is (10 sec: 5389.3, 60 sec: 5544.3, 300 sec: 5543.2). Total num frames: 701842432. Throughput: 0: 5767.0. Samples: 701844716. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:31:03,460][25689] Avg episode reward: [(0, '-2.537')] [2022-07-10 10:31:03,540][26022] Updated weights on worker 0-0, policy_version 685394 (0.00090) [2022-07-10 10:31:05,284][26022] Updated weights on worker 0-0, policy_version 685404 (0.00094) [2022-07-10 10:31:07,238][26022] Updated weights on worker 0-0, policy_version 685414 (0.00085) [2022-07-10 10:31:08,534][25689] Fps is (10 sec: 5459.4, 60 sec: 5573.6, 300 sec: 5550.2). Total num frames: 701872128. Throughput: 0: 5698.5. Samples: 701877138. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:31:08,534][25689] Avg episode reward: [(0, '-1.888')] [2022-07-10 10:31:09,088][26022] Updated weights on worker 0-0, policy_version 685424 (0.00098) [2022-07-10 10:31:10,932][26022] Updated weights on worker 0-0, policy_version 685434 (0.00089) [2022-07-10 10:31:12,948][26022] Updated weights on worker 0-0, policy_version 685444 (0.00082) [2022-07-10 10:31:13,591][25689] Fps is (10 sec: 5558.6, 60 sec: 5539.9, 300 sec: 5535.7). Total num frames: 701898752. Throughput: 0: 4873.5. Samples: 701893814. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:31:13,591][25689] Avg episode reward: [(0, '-2.697')] [2022-07-10 10:31:14,455][26022] Updated weights on worker 0-0, policy_version 685454 (0.00080) [2022-07-10 10:31:16,610][26022] Updated weights on worker 0-0, policy_version 685464 (0.00088) [2022-07-10 10:31:18,108][26022] Updated weights on worker 0-0, policy_version 685474 (0.00083) [2022-07-10 10:31:18,598][25689] Fps is (10 sec: 5596.3, 60 sec: 5573.5, 300 sec: 5550.2). Total num frames: 701928448. Throughput: 0: 5723.5. Samples: 701927698. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:31:18,598][25689] Avg episode reward: [(0, '-2.807')] [2022-07-10 10:31:20,238][26022] Updated weights on worker 0-0, policy_version 685484 (0.00091) [2022-07-10 10:31:21,863][26022] Updated weights on worker 0-0, policy_version 685494 (0.00087) [2022-07-10 10:31:23,609][25689] Fps is (10 sec: 5622.0, 60 sec: 5539.5, 300 sec: 5543.4). Total num frames: 701955072. Throughput: 0: 5794.9. Samples: 701961482. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:31:23,609][25689] Avg episode reward: [(0, '-2.699')] [2022-07-10 10:31:23,686][26022] Updated weights on worker 0-0, policy_version 685504 (0.00085) [2022-07-10 10:31:25,362][26022] Updated weights on worker 0-0, policy_version 685514 (0.00084) [2022-07-10 10:31:27,386][26022] Updated weights on worker 0-0, policy_version 685524 (0.00094) [2022-07-10 10:31:28,612][25689] Fps is (10 sec: 5521.7, 60 sec: 5556.5, 300 sec: 5545.8). Total num frames: 701983744. Throughput: 0: 5048.5. Samples: 701978502. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:31:28,613][25689] Avg episode reward: [(0, '-2.729')] [2022-07-10 10:31:28,927][26022] Updated weights on worker 0-0, policy_version 685534 (0.00089) [2022-07-10 10:31:30,982][26022] Updated weights on worker 0-0, policy_version 685544 (0.00091) [2022-07-10 10:31:32,532][26022] Updated weights on worker 0-0, policy_version 685554 (0.00089) [2022-07-10 10:31:33,739][25689] Fps is (10 sec: 5559.5, 60 sec: 5533.0, 300 sec: 5543.6). Total num frames: 702011392. Throughput: 0: 5876.5. Samples: 702012214. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:31:33,741][25689] Avg episode reward: [(0, '-2.529')] [2022-07-10 10:31:34,615][26022] Updated weights on worker 0-0, policy_version 685564 (0.00085) [2022-07-10 10:31:36,280][26022] Updated weights on worker 0-0, policy_version 685574 (0.00090) [2022-07-10 10:31:38,371][26022] Updated weights on worker 0-0, policy_version 685584 (0.00093) [2022-07-10 10:31:38,762][25689] Fps is (10 sec: 5549.0, 60 sec: 5549.2, 300 sec: 5550.2). Total num frames: 702040064. Throughput: 0: 5853.9. Samples: 702045736. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:31:38,763][25689] Avg episode reward: [(0, '-4.113')] [2022-07-10 10:31:40,077][26022] Updated weights on worker 0-0, policy_version 685594 (0.00061) [2022-07-10 10:31:42,059][26022] Updated weights on worker 0-0, policy_version 685604 (0.00095) [2022-07-10 10:31:43,767][25689] Fps is (10 sec: 5616.3, 60 sec: 5550.6, 300 sec: 5543.4). Total num frames: 702067712. Throughput: 0: 5010.9. Samples: 702062494. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:31:43,769][25689] Avg episode reward: [(0, '-3.651')] [2022-07-10 10:31:43,807][26022] Updated weights on worker 0-0, policy_version 685614 (0.00580) [2022-07-10 10:31:45,873][26022] Updated weights on worker 0-0, policy_version 685624 (0.00090) [2022-07-10 10:31:47,399][26022] Updated weights on worker 0-0, policy_version 685634 (0.00093) [2022-07-10 10:31:48,775][25689] Fps is (10 sec: 5420.0, 60 sec: 5551.7, 300 sec: 5541.3). Total num frames: 702094336. Throughput: 0: 5815.9. Samples: 702095766. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:31:48,775][25689] Avg episode reward: [(0, '-3.235')] [2022-07-10 10:31:49,474][26022] Updated weights on worker 0-0, policy_version 685644 (0.00086) [2022-07-10 10:31:51,220][26022] Updated weights on worker 0-0, policy_version 685654 (0.00085) [2022-07-10 10:31:53,095][26022] Updated weights on worker 0-0, policy_version 685664 (0.00084) [2022-07-10 10:31:53,839][25689] Fps is (10 sec: 5592.0, 60 sec: 5556.2, 300 sec: 5543.7). Total num frames: 702124032. Throughput: 0: 5829.5. Samples: 702129384. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:31:53,840][25689] Avg episode reward: [(0, '-4.985')] [2022-07-10 10:31:54,792][26022] Updated weights on worker 0-0, policy_version 685674 (0.00095) [2022-07-10 10:31:56,635][26022] Updated weights on worker 0-0, policy_version 685684 (0.00090) [2022-07-10 10:31:58,380][26022] Updated weights on worker 0-0, policy_version 685694 (0.00090) [2022-07-10 10:31:58,872][25689] Fps is (10 sec: 5780.7, 60 sec: 5554.7, 300 sec: 5547.1). Total num frames: 702152704. Throughput: 0: 4994.2. Samples: 702146170. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:31:58,879][25689] Avg episode reward: [(0, '-4.518')] [2022-07-10 10:32:00,464][26022] Updated weights on worker 0-0, policy_version 685704 (0.00083) [2022-07-10 10:32:02,358][26022] Updated weights on worker 0-0, policy_version 685714 (0.00092) [2022-07-10 10:32:03,902][25689] Fps is (10 sec: 5393.0, 60 sec: 5556.8, 300 sec: 5546.7). Total num frames: 702178304. Throughput: 0: 5741.7. Samples: 702178102. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:32:03,915][25689] Avg episode reward: [(0, '-2.726')] [2022-07-10 10:32:04,350][26022] Updated weights on worker 0-0, policy_version 685724 (0.00081) [2022-07-10 10:32:06,006][26022] Updated weights on worker 0-0, policy_version 685734 (0.00099) [2022-07-10 10:32:08,154][26022] Updated weights on worker 0-0, policy_version 685744 (0.00083) [2022-07-10 10:32:08,920][25689] Fps is (10 sec: 5401.4, 60 sec: 5545.1, 300 sec: 5552.2). Total num frames: 702206976. Throughput: 0: 5735.8. Samples: 702211312. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:32:08,920][25689] Avg episode reward: [(0, '-4.464')] [2022-07-10 10:32:09,866][26022] Updated weights on worker 0-0, policy_version 685754 (0.00082) [2022-07-10 10:32:11,832][26022] Updated weights on worker 0-0, policy_version 685764 (0.00089) [2022-07-10 10:32:13,564][26022] Updated weights on worker 0-0, policy_version 685774 (0.00090) [2022-07-10 10:32:14,040][25689] Fps is (10 sec: 5454.1, 60 sec: 5539.3, 300 sec: 5543.3). Total num frames: 702233600. Throughput: 0: 4883.7. Samples: 702228044. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:32:14,041][25689] Avg episode reward: [(0, '-3.894')] [2022-07-10 10:32:15,433][26022] Updated weights on worker 0-0, policy_version 685784 (0.00080) [2022-07-10 10:32:17,326][26022] Updated weights on worker 0-0, policy_version 685794 (0.00089) [2022-07-10 10:32:19,050][25689] Fps is (10 sec: 5559.6, 60 sec: 5539.0, 300 sec: 5547.5). Total num frames: 702263296. Throughput: 0: 5710.2. Samples: 702261392. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:32:19,050][25689] Avg episode reward: [(0, '-4.647')] [2022-07-10 10:32:19,059][26022] Updated weights on worker 0-0, policy_version 685804 (0.00097) [2022-07-10 10:32:21,010][26022] Updated weights on worker 0-0, policy_version 685814 (0.00093) [2022-07-10 10:32:22,745][26022] Updated weights on worker 0-0, policy_version 685824 (0.00093) [2022-07-10 10:32:23,628][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:32:23,639][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000685829_702288896.pth [2022-07-10 10:32:23,639][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000683878_700291072.pth [2022-07-10 10:32:24,110][25689] Fps is (10 sec: 5593.4, 60 sec: 5534.6, 300 sec: 5543.8). Total num frames: 702289920. Throughput: 0: 5773.5. Samples: 702294770. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:32:24,110][25689] Avg episode reward: [(0, '-2.462')] [2022-07-10 10:32:24,504][26022] Updated weights on worker 0-0, policy_version 685834 (0.00081) [2022-07-10 10:32:26,503][26022] Updated weights on worker 0-0, policy_version 685844 (0.00085) [2022-07-10 10:32:28,233][26022] Updated weights on worker 0-0, policy_version 685854 (0.00086) [2022-07-10 10:32:29,136][25689] Fps is (10 sec: 5380.7, 60 sec: 5515.5, 300 sec: 5537.4). Total num frames: 702317568. Throughput: 0: 5783.2. Samples: 702328230. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:32:29,137][25689] Avg episode reward: [(0, '-2.280')] [2022-07-10 10:32:30,143][26022] Updated weights on worker 0-0, policy_version 685864 (0.00092) [2022-07-10 10:32:32,095][26022] Updated weights on worker 0-0, policy_version 685874 (0.00087) [2022-07-10 10:32:33,750][26022] Updated weights on worker 0-0, policy_version 685884 (0.00085) [2022-07-10 10:32:34,185][25689] Fps is (10 sec: 5691.4, 60 sec: 5556.6, 300 sec: 5547.8). Total num frames: 702347264. Throughput: 0: 5797.3. Samples: 702344828. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:32:34,186][25689] Avg episode reward: [(0, '-2.924')] [2022-07-10 10:32:35,779][26022] Updated weights on worker 0-0, policy_version 685894 (0.00085) [2022-07-10 10:32:37,395][26022] Updated weights on worker 0-0, policy_version 685904 (0.00091) [2022-07-10 10:32:39,205][25689] Fps is (10 sec: 5593.6, 60 sec: 5522.9, 300 sec: 5541.2). Total num frames: 702373888. Throughput: 0: 5785.2. Samples: 702377994. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:32:39,207][25689] Avg episode reward: [(0, '-1.468')] [2022-07-10 10:32:39,324][26022] Updated weights on worker 0-0, policy_version 685914 (0.00091) [2022-07-10 10:32:41,313][26022] Updated weights on worker 0-0, policy_version 685924 (0.00086) [2022-07-10 10:32:43,000][26022] Updated weights on worker 0-0, policy_version 685934 (0.00086) [2022-07-10 10:32:44,238][25689] Fps is (10 sec: 5500.3, 60 sec: 5537.3, 300 sec: 5545.1). Total num frames: 702402560. Throughput: 0: 5795.7. Samples: 702411430. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:32:44,239][25689] Avg episode reward: [(0, '-1.282')] [2022-07-10 10:32:45,007][26022] Updated weights on worker 0-0, policy_version 685944 (0.00088) [2022-07-10 10:32:46,524][26022] Updated weights on worker 0-0, policy_version 685954 (0.00080) [2022-07-10 10:32:48,759][26022] Updated weights on worker 0-0, policy_version 685964 (0.00084) [2022-07-10 10:32:49,249][25689] Fps is (10 sec: 5607.3, 60 sec: 5554.0, 300 sec: 5546.8). Total num frames: 702430208. Throughput: 0: 4964.0. Samples: 702428070. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:32:49,250][25689] Avg episode reward: [(0, '-1.635')] [2022-07-10 10:32:50,491][26022] Updated weights on worker 0-0, policy_version 685974 (0.00090) [2022-07-10 10:32:52,354][26022] Updated weights on worker 0-0, policy_version 685984 (0.00091) [2022-07-10 10:32:54,312][25689] Fps is (10 sec: 5387.6, 60 sec: 5503.3, 300 sec: 5533.1). Total num frames: 702456832. Throughput: 0: 5780.2. Samples: 702461164. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:32:54,312][25689] Avg episode reward: [(0, '-2.038')] [2022-07-10 10:32:54,358][26022] Updated weights on worker 0-0, policy_version 685994 (0.00090) [2022-07-10 10:32:55,851][26022] Updated weights on worker 0-0, policy_version 686004 (0.00081) [2022-07-10 10:32:57,830][26022] Updated weights on worker 0-0, policy_version 686014 (0.00094) [2022-07-10 10:32:59,318][25689] Fps is (10 sec: 5593.5, 60 sec: 5522.7, 300 sec: 5544.4). Total num frames: 702486528. Throughput: 0: 5794.8. Samples: 702494544. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:32:59,318][25689] Avg episode reward: [(0, '-1.950')] [2022-07-10 10:32:59,616][26022] Updated weights on worker 0-0, policy_version 686024 (0.00087) [2022-07-10 10:33:02,074][26022] Updated weights on worker 0-0, policy_version 686034 (0.00093) [2022-07-10 10:33:03,862][26022] Updated weights on worker 0-0, policy_version 686044 (0.00092) [2022-07-10 10:33:04,334][25689] Fps is (10 sec: 5517.5, 60 sec: 5524.0, 300 sec: 5541.4). Total num frames: 702512128. Throughput: 0: 4858.8. Samples: 702509068. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:33:04,335][25689] Avg episode reward: [(0, '-2.632')] [2022-07-10 10:33:05,705][26022] Updated weights on worker 0-0, policy_version 686054 (0.00083) [2022-07-10 10:33:07,427][26022] Updated weights on worker 0-0, policy_version 686064 (0.00088) [2022-07-10 10:33:09,348][25689] Fps is (10 sec: 5206.6, 60 sec: 5490.4, 300 sec: 5539.7). Total num frames: 702538752. Throughput: 0: 5697.7. Samples: 702542590. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:33:09,349][25689] Avg episode reward: [(0, '-2.616')] [2022-07-10 10:33:09,488][26022] Updated weights on worker 0-0, policy_version 686074 (0.00091) [2022-07-10 10:33:11,188][26022] Updated weights on worker 0-0, policy_version 686084 (0.00110) [2022-07-10 10:33:13,134][26022] Updated weights on worker 0-0, policy_version 686094 (0.00092) [2022-07-10 10:33:14,437][25689] Fps is (10 sec: 5473.2, 60 sec: 5527.2, 300 sec: 5539.3). Total num frames: 702567424. Throughput: 0: 5706.3. Samples: 702576004. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:33:14,437][25689] Avg episode reward: [(0, '-3.644')] [2022-07-10 10:33:14,866][26022] Updated weights on worker 0-0, policy_version 686104 (0.00088) [2022-07-10 10:33:16,705][26022] Updated weights on worker 0-0, policy_version 686114 (0.00092) [2022-07-10 10:33:18,549][26022] Updated weights on worker 0-0, policy_version 686124 (0.00091) [2022-07-10 10:33:19,501][25689] Fps is (10 sec: 5648.4, 60 sec: 5505.3, 300 sec: 5542.8). Total num frames: 702596096. Throughput: 0: 4859.5. Samples: 702592622. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:33:19,501][25689] Avg episode reward: [(0, '-4.935')] [2022-07-10 10:33:20,463][26022] Updated weights on worker 0-0, policy_version 686134 (0.00087) [2022-07-10 10:33:22,088][26022] Updated weights on worker 0-0, policy_version 686144 (0.00088) [2022-07-10 10:33:24,236][26022] Updated weights on worker 0-0, policy_version 686154 (0.00095) [2022-07-10 10:33:24,564][25689] Fps is (10 sec: 5561.0, 60 sec: 5521.9, 300 sec: 5538.5). Total num frames: 702623744. Throughput: 0: 5777.1. Samples: 702625944. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:33:24,573][25689] Avg episode reward: [(0, '-5.179')] [2022-07-10 10:33:25,829][26022] Updated weights on worker 0-0, policy_version 686164 (0.00091) [2022-07-10 10:33:27,777][26022] Updated weights on worker 0-0, policy_version 686174 (0.00093) [2022-07-10 10:33:29,576][25689] Fps is (10 sec: 5488.0, 60 sec: 5523.2, 300 sec: 5539.2). Total num frames: 702651392. Throughput: 0: 5757.0. Samples: 702659044. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:33:29,577][25689] Avg episode reward: [(0, '-3.464')] [2022-07-10 10:33:29,795][26022] Updated weights on worker 0-0, policy_version 686184 (0.00087) [2022-07-10 10:33:31,443][26022] Updated weights on worker 0-0, policy_version 686194 (0.00627) [2022-07-10 10:33:33,343][26022] Updated weights on worker 0-0, policy_version 686204 (0.00089) [2022-07-10 10:33:34,636][25689] Fps is (10 sec: 5591.8, 60 sec: 5505.3, 300 sec: 5545.0). Total num frames: 702680064. Throughput: 0: 4927.0. Samples: 702675526. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:33:34,637][25689] Avg episode reward: [(0, '-3.919')] [2022-07-10 10:33:35,131][26022] Updated weights on worker 0-0, policy_version 686214 (0.00096) [2022-07-10 10:33:37,108][26022] Updated weights on worker 0-0, policy_version 686224 (0.00085) [2022-07-10 10:33:39,112][26022] Updated weights on worker 0-0, policy_version 686234 (0.00099) [2022-07-10 10:33:39,706][25689] Fps is (10 sec: 5459.0, 60 sec: 5500.8, 300 sec: 5533.7). Total num frames: 702706688. Throughput: 0: 5753.2. Samples: 702708866. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:33:39,706][25689] Avg episode reward: [(0, '-4.211')] [2022-07-10 10:33:40,704][26022] Updated weights on worker 0-0, policy_version 686244 (0.00086) [2022-07-10 10:33:42,849][26022] Updated weights on worker 0-0, policy_version 686254 (0.00094) [2022-07-10 10:33:44,415][26022] Updated weights on worker 0-0, policy_version 686264 (0.00085) [2022-07-10 10:33:44,760][25689] Fps is (10 sec: 5461.7, 60 sec: 5498.8, 300 sec: 5533.7). Total num frames: 702735360. Throughput: 0: 5756.0. Samples: 702742192. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:33:44,761][25689] Avg episode reward: [(0, '-3.100')] [2022-07-10 10:33:46,443][26022] Updated weights on worker 0-0, policy_version 686274 (0.00088) [2022-07-10 10:33:47,987][26022] Updated weights on worker 0-0, policy_version 686284 (0.00083) [2022-07-10 10:33:49,775][25689] Fps is (10 sec: 5593.2, 60 sec: 5498.5, 300 sec: 5538.8). Total num frames: 702763008. Throughput: 0: 4961.6. Samples: 702759262. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:33:49,776][25689] Avg episode reward: [(0, '-2.001')] [2022-07-10 10:33:49,912][26022] Updated weights on worker 0-0, policy_version 686294 (0.00085) [2022-07-10 10:33:51,955][26022] Updated weights on worker 0-0, policy_version 686304 (0.00084) [2022-07-10 10:33:53,672][26022] Updated weights on worker 0-0, policy_version 686314 (0.00082) [2022-07-10 10:33:54,826][25689] Fps is (10 sec: 5595.5, 60 sec: 5533.4, 300 sec: 5531.3). Total num frames: 702791680. Throughput: 0: 5793.1. Samples: 702792486. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:33:54,826][25689] Avg episode reward: [(0, '-0.913')] [2022-07-10 10:33:55,407][26022] Updated weights on worker 0-0, policy_version 686324 (0.00627) [2022-07-10 10:33:57,421][26022] Updated weights on worker 0-0, policy_version 686334 (0.00087) [2022-07-10 10:33:59,089][26022] Updated weights on worker 0-0, policy_version 686344 (0.00091) [2022-07-10 10:33:59,904][25689] Fps is (10 sec: 5560.3, 60 sec: 5493.0, 300 sec: 5534.3). Total num frames: 702819328. Throughput: 0: 5810.6. Samples: 702826230. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:33:59,904][25689] Avg episode reward: [(0, '-0.775')] [2022-07-10 10:34:00,934][26022] Updated weights on worker 0-0, policy_version 686354 (0.00083) [2022-07-10 10:34:03,097][26022] Updated weights on worker 0-0, policy_version 686364 (0.00213) [2022-07-10 10:34:04,909][25689] Fps is (10 sec: 5484.1, 60 sec: 5527.8, 300 sec: 5537.9). Total num frames: 702846976. Throughput: 0: 4966.2. Samples: 702842252. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:34:04,909][25689] Avg episode reward: [(0, '0.226')] [2022-07-10 10:34:04,914][26022] Updated weights on worker 0-0, policy_version 686374 (0.00088) [2022-07-10 10:34:06,939][26022] Updated weights on worker 0-0, policy_version 686384 (0.00088) [2022-07-10 10:34:08,575][26022] Updated weights on worker 0-0, policy_version 686394 (0.00083) [2022-07-10 10:34:09,992][25689] Fps is (10 sec: 5481.2, 60 sec: 5538.4, 300 sec: 5531.3). Total num frames: 702874624. Throughput: 0: 5715.6. Samples: 702874814. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:34:09,993][25689] Avg episode reward: [(0, '0.390')] [2022-07-10 10:34:10,601][26022] Updated weights on worker 0-0, policy_version 686404 (0.00096) [2022-07-10 10:34:12,147][26022] Updated weights on worker 0-0, policy_version 686414 (0.00090) [2022-07-10 10:34:13,925][26022] Updated weights on worker 0-0, policy_version 686424 (0.00081) [2022-07-10 10:34:15,119][25689] Fps is (10 sec: 5515.8, 60 sec: 5534.9, 300 sec: 5536.1). Total num frames: 702903296. Throughput: 0: 5726.0. Samples: 702908686. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:34:15,121][25689] Avg episode reward: [(0, '-0.944')] [2022-07-10 10:34:15,911][26022] Updated weights on worker 0-0, policy_version 686434 (0.00559) [2022-07-10 10:34:17,632][26022] Updated weights on worker 0-0, policy_version 686444 (0.00085) [2022-07-10 10:34:19,570][26022] Updated weights on worker 0-0, policy_version 686454 (0.00095) [2022-07-10 10:34:20,131][25689] Fps is (10 sec: 5656.0, 60 sec: 5539.7, 300 sec: 5539.5). Total num frames: 702931968. Throughput: 0: 4909.5. Samples: 702925534. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:34:20,131][25689] Avg episode reward: [(0, '-1.766')] [2022-07-10 10:34:21,426][26022] Updated weights on worker 0-0, policy_version 686464 (0.00085) [2022-07-10 10:34:23,135][26022] Updated weights on worker 0-0, policy_version 686474 (0.00087) [2022-07-10 10:34:23,898][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:34:23,916][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000686477_702952448.pth [2022-07-10 10:34:23,917][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000684527_700955648.pth [2022-07-10 10:34:24,963][26022] Updated weights on worker 0-0, policy_version 686484 (0.00081) [2022-07-10 10:34:25,140][25689] Fps is (10 sec: 5620.2, 60 sec: 5544.7, 300 sec: 5536.8). Total num frames: 702959616. Throughput: 0: 5774.5. Samples: 702959080. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:34:25,142][25689] Avg episode reward: [(0, '-1.719')] [2022-07-10 10:34:26,809][26022] Updated weights on worker 0-0, policy_version 686494 (0.00097) [2022-07-10 10:34:28,759][26022] Updated weights on worker 0-0, policy_version 686504 (0.00088) [2022-07-10 10:34:30,150][25689] Fps is (10 sec: 5519.1, 60 sec: 5544.9, 300 sec: 5534.5). Total num frames: 702987264. Throughput: 0: 5845.8. Samples: 702992652. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:34:30,150][25689] Avg episode reward: [(0, '-1.721')] [2022-07-10 10:34:30,729][26022] Updated weights on worker 0-0, policy_version 686514 (0.00089) [2022-07-10 10:34:32,269][26022] Updated weights on worker 0-0, policy_version 686524 (0.00094) [2022-07-10 10:34:34,079][26022] Updated weights on worker 0-0, policy_version 686534 (0.00095) [2022-07-10 10:34:35,257][25689] Fps is (10 sec: 5465.5, 60 sec: 5523.6, 300 sec: 5532.9). Total num frames: 703014912. Throughput: 0: 4997.0. Samples: 703009316. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:34:35,258][25689] Avg episode reward: [(0, '-2.762')] [2022-07-10 10:34:36,187][26022] Updated weights on worker 0-0, policy_version 686544 (0.00090) [2022-07-10 10:34:37,705][26022] Updated weights on worker 0-0, policy_version 686554 (0.00422) [2022-07-10 10:34:39,841][26022] Updated weights on worker 0-0, policy_version 686564 (0.00087) [2022-07-10 10:34:40,274][25689] Fps is (10 sec: 5562.6, 60 sec: 5562.2, 300 sec: 5536.6). Total num frames: 703043584. Throughput: 0: 5829.2. Samples: 703042956. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:34:40,275][25689] Avg episode reward: [(0, '-4.290')] [2022-07-10 10:34:41,491][26022] Updated weights on worker 0-0, policy_version 686574 (0.00089) [2022-07-10 10:34:43,431][26022] Updated weights on worker 0-0, policy_version 686584 (0.00090) [2022-07-10 10:34:45,292][25689] Fps is (10 sec: 5612.5, 60 sec: 5548.7, 300 sec: 5533.4). Total num frames: 703071232. Throughput: 0: 5814.9. Samples: 703076262. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 10:34:45,293][25689] Avg episode reward: [(0, '-3.847')] [2022-07-10 10:34:45,354][26022] Updated weights on worker 0-0, policy_version 686594 (0.00052) [2022-07-10 10:34:47,029][26022] Updated weights on worker 0-0, policy_version 686604 (0.00103) [2022-07-10 10:34:49,159][26022] Updated weights on worker 0-0, policy_version 686614 (0.00107) [2022-07-10 10:34:50,359][25689] Fps is (10 sec: 5584.7, 60 sec: 5560.8, 300 sec: 5533.1). Total num frames: 703099904. Throughput: 0: 5797.5. Samples: 703109816. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:34:50,360][25689] Avg episode reward: [(0, '-3.451')] [2022-07-10 10:34:50,588][26022] Updated weights on worker 0-0, policy_version 686624 (0.00085) [2022-07-10 10:34:52,661][26022] Updated weights on worker 0-0, policy_version 686634 (0.00085) [2022-07-10 10:34:54,372][26022] Updated weights on worker 0-0, policy_version 686644 (0.00084) [2022-07-10 10:34:55,495][25689] Fps is (10 sec: 5620.6, 60 sec: 5553.0, 300 sec: 5531.3). Total num frames: 703128576. Throughput: 0: 5788.1. Samples: 703126450. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:34:55,495][25689] Avg episode reward: [(0, '-3.141')] [2022-07-10 10:34:56,300][26022] Updated weights on worker 0-0, policy_version 686654 (0.00092) [2022-07-10 10:34:58,070][26022] Updated weights on worker 0-0, policy_version 686664 (0.00099) [2022-07-10 10:35:00,002][26022] Updated weights on worker 0-0, policy_version 686674 (0.00086) [2022-07-10 10:35:00,512][25689] Fps is (10 sec: 5547.4, 60 sec: 5558.6, 300 sec: 5542.6). Total num frames: 703156224. Throughput: 0: 5787.5. Samples: 703160078. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:35:00,512][25689] Avg episode reward: [(0, '-2.563')] [2022-07-10 10:35:01,806][26022] Updated weights on worker 0-0, policy_version 686684 (0.00076) [2022-07-10 10:35:04,179][26022] Updated weights on worker 0-0, policy_version 686694 (0.00083) [2022-07-10 10:35:05,534][25689] Fps is (10 sec: 5507.6, 60 sec: 5557.0, 300 sec: 5542.7). Total num frames: 703183872. Throughput: 0: 5709.1. Samples: 703191828. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:35:05,535][25689] Avg episode reward: [(0, '-2.974')] [2022-07-10 10:35:05,778][26022] Updated weights on worker 0-0, policy_version 686704 (0.00086) [2022-07-10 10:35:07,742][26022] Updated weights on worker 0-0, policy_version 686714 (0.00088) [2022-07-10 10:35:09,453][26022] Updated weights on worker 0-0, policy_version 686724 (0.00078) [2022-07-10 10:35:10,568][25689] Fps is (10 sec: 5396.9, 60 sec: 5544.7, 300 sec: 5536.3). Total num frames: 703210496. Throughput: 0: 4884.0. Samples: 703208516. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:35:10,568][25689] Avg episode reward: [(0, '-1.928')] [2022-07-10 10:35:11,319][26022] Updated weights on worker 0-0, policy_version 686734 (0.00089) [2022-07-10 10:35:13,307][26022] Updated weights on worker 0-0, policy_version 686744 (0.00089) [2022-07-10 10:35:15,067][26022] Updated weights on worker 0-0, policy_version 686754 (0.00095) [2022-07-10 10:35:15,635][25689] Fps is (10 sec: 5373.3, 60 sec: 5533.3, 300 sec: 5535.1). Total num frames: 703238144. Throughput: 0: 5724.9. Samples: 703241750. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:35:15,635][25689] Avg episode reward: [(0, '-2.539')] [2022-07-10 10:35:16,783][26022] Updated weights on worker 0-0, policy_version 686764 (0.00083) [2022-07-10 10:35:18,861][26022] Updated weights on worker 0-0, policy_version 686774 (0.00088) [2022-07-10 10:35:20,356][26022] Updated weights on worker 0-0, policy_version 686784 (0.00087) [2022-07-10 10:35:20,647][25689] Fps is (10 sec: 5587.8, 60 sec: 5533.2, 300 sec: 5535.0). Total num frames: 703266816. Throughput: 0: 5712.1. Samples: 703275092. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:35:20,649][25689] Avg episode reward: [(0, '-1.760')] [2022-07-10 10:35:22,459][26022] Updated weights on worker 0-0, policy_version 686794 (0.00089) [2022-07-10 10:35:24,250][26022] Updated weights on worker 0-0, policy_version 686804 (0.00085) [2022-07-10 10:35:25,670][25689] Fps is (10 sec: 5612.2, 60 sec: 5531.9, 300 sec: 5534.7). Total num frames: 703294464. Throughput: 0: 4964.3. Samples: 703291786. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:35:25,670][25689] Avg episode reward: [(0, '-2.946')] [2022-07-10 10:35:26,076][26022] Updated weights on worker 0-0, policy_version 686814 (0.00094) [2022-07-10 10:35:27,999][26022] Updated weights on worker 0-0, policy_version 686824 (0.00048) [2022-07-10 10:35:29,640][26022] Updated weights on worker 0-0, policy_version 686834 (0.00079) [2022-07-10 10:35:30,685][25689] Fps is (10 sec: 5610.7, 60 sec: 5548.4, 300 sec: 5535.4). Total num frames: 703323136. Throughput: 0: 5791.3. Samples: 703325020. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:35:30,687][25689] Avg episode reward: [(0, '-2.097')] [2022-07-10 10:35:31,796][26022] Updated weights on worker 0-0, policy_version 686844 (0.00090) [2022-07-10 10:35:33,505][26022] Updated weights on worker 0-0, policy_version 686854 (0.00089) [2022-07-10 10:35:35,279][26022] Updated weights on worker 0-0, policy_version 686864 (0.00097) [2022-07-10 10:35:35,792][25689] Fps is (10 sec: 5665.2, 60 sec: 5565.3, 300 sec: 5537.1). Total num frames: 703351808. Throughput: 0: 5794.7. Samples: 703358556. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:35:35,792][25689] Avg episode reward: [(0, '-2.028')] [2022-07-10 10:35:37,288][26022] Updated weights on worker 0-0, policy_version 686874 (0.00130) [2022-07-10 10:35:39,005][26022] Updated weights on worker 0-0, policy_version 686884 (0.00087) [2022-07-10 10:35:40,801][25689] Fps is (10 sec: 5365.1, 60 sec: 5515.4, 300 sec: 5530.5). Total num frames: 703377408. Throughput: 0: 4971.6. Samples: 703375288. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:35:40,801][25689] Avg episode reward: [(0, '-2.027')] [2022-07-10 10:35:40,980][26022] Updated weights on worker 0-0, policy_version 686894 (0.00089) [2022-07-10 10:35:42,545][26022] Updated weights on worker 0-0, policy_version 686904 (0.00080) [2022-07-10 10:35:44,567][26022] Updated weights on worker 0-0, policy_version 686914 (0.00092) [2022-07-10 10:35:45,811][25689] Fps is (10 sec: 5519.2, 60 sec: 5549.9, 300 sec: 5541.0). Total num frames: 703407104. Throughput: 0: 5797.2. Samples: 703408548. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:35:45,812][25689] Avg episode reward: [(0, '-3.268')] [2022-07-10 10:35:46,346][26022] Updated weights on worker 0-0, policy_version 686924 (0.00082) [2022-07-10 10:35:48,291][26022] Updated weights on worker 0-0, policy_version 686934 (0.00090) [2022-07-10 10:35:50,153][26022] Updated weights on worker 0-0, policy_version 686944 (0.00086) [2022-07-10 10:35:50,825][25689] Fps is (10 sec: 5618.0, 60 sec: 5520.8, 300 sec: 5532.5). Total num frames: 703433728. Throughput: 0: 5811.1. Samples: 703442060. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:35:50,826][25689] Avg episode reward: [(0, '-3.246')] [2022-07-10 10:35:51,941][26022] Updated weights on worker 0-0, policy_version 686954 (0.00086) [2022-07-10 10:35:53,955][26022] Updated weights on worker 0-0, policy_version 686964 (0.00088) [2022-07-10 10:35:55,632][26022] Updated weights on worker 0-0, policy_version 686974 (0.00090) [2022-07-10 10:35:55,899][25689] Fps is (10 sec: 5481.4, 60 sec: 5526.5, 300 sec: 5531.4). Total num frames: 703462400. Throughput: 0: 4971.8. Samples: 703458524. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:35:55,899][25689] Avg episode reward: [(0, '-4.675')] [2022-07-10 10:35:57,632][26022] Updated weights on worker 0-0, policy_version 686984 (0.00092) [2022-07-10 10:35:59,186][26022] Updated weights on worker 0-0, policy_version 686994 (0.00486) [2022-07-10 10:36:00,911][25689] Fps is (10 sec: 5584.3, 60 sec: 5527.0, 300 sec: 5539.1). Total num frames: 703490048. Throughput: 0: 5803.1. Samples: 703491990. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:36:00,911][25689] Avg episode reward: [(0, '-3.572')] [2022-07-10 10:36:01,076][26022] Updated weights on worker 0-0, policy_version 687004 (0.00095) [2022-07-10 10:36:03,324][26022] Updated weights on worker 0-0, policy_version 687014 (0.00096) [2022-07-10 10:36:05,090][26022] Updated weights on worker 0-0, policy_version 687024 (0.00098) [2022-07-10 10:36:05,975][25689] Fps is (10 sec: 5284.4, 60 sec: 5489.3, 300 sec: 5525.5). Total num frames: 703515648. Throughput: 0: 5709.3. Samples: 703523674. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:36:05,976][25689] Avg episode reward: [(0, '-3.540')] [2022-07-10 10:36:07,108][26022] Updated weights on worker 0-0, policy_version 687034 (0.00086) [2022-07-10 10:36:08,884][26022] Updated weights on worker 0-0, policy_version 687044 (0.00085) [2022-07-10 10:36:10,770][26022] Updated weights on worker 0-0, policy_version 687054 (0.00095) [2022-07-10 10:36:11,061][25689] Fps is (10 sec: 5347.1, 60 sec: 5518.4, 300 sec: 5531.9). Total num frames: 703544320. Throughput: 0: 4856.8. Samples: 703540338. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:36:11,061][25689] Avg episode reward: [(0, '-3.517')] [2022-07-10 10:36:12,707][26022] Updated weights on worker 0-0, policy_version 687064 (0.00091) [2022-07-10 10:36:14,492][26022] Updated weights on worker 0-0, policy_version 687074 (0.00085) [2022-07-10 10:36:16,180][25689] Fps is (10 sec: 5619.7, 60 sec: 5530.6, 300 sec: 5526.4). Total num frames: 703572992. Throughput: 0: 5652.3. Samples: 703573156. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:36:16,180][25689] Avg episode reward: [(0, '-1.853')] [2022-07-10 10:36:16,402][26022] Updated weights on worker 0-0, policy_version 687084 (0.00085) [2022-07-10 10:36:18,071][26022] Updated weights on worker 0-0, policy_version 687094 (0.00096) [2022-07-10 10:36:19,969][26022] Updated weights on worker 0-0, policy_version 687104 (0.00083) [2022-07-10 10:36:21,220][25689] Fps is (10 sec: 5543.7, 60 sec: 5511.1, 300 sec: 5529.3). Total num frames: 703600640. Throughput: 0: 5655.8. Samples: 703606854. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:36:21,220][25689] Avg episode reward: [(0, '-1.218')] [2022-07-10 10:36:21,878][26022] Updated weights on worker 0-0, policy_version 687114 (0.00091) [2022-07-10 10:36:23,618][26022] Updated weights on worker 0-0, policy_version 687124 (0.00084) [2022-07-10 10:36:24,077][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:36:24,089][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000687127_703618048.pth [2022-07-10 10:36:24,091][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000685177_701621248.pth [2022-07-10 10:36:25,419][26022] Updated weights on worker 0-0, policy_version 687134 (0.00093) [2022-07-10 10:36:26,298][25689] Fps is (10 sec: 5667.3, 60 sec: 5539.9, 300 sec: 5531.3). Total num frames: 703630336. Throughput: 0: 4918.5. Samples: 703623630. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:36:26,298][25689] Avg episode reward: [(0, '-1.758')] [2022-07-10 10:36:27,226][26022] Updated weights on worker 0-0, policy_version 687144 (0.00090) [2022-07-10 10:36:29,228][26022] Updated weights on worker 0-0, policy_version 687154 (0.00092) [2022-07-10 10:36:30,929][26022] Updated weights on worker 0-0, policy_version 687164 (0.00092) [2022-07-10 10:36:31,301][25689] Fps is (10 sec: 5688.2, 60 sec: 5524.1, 300 sec: 5533.6). Total num frames: 703657984. Throughput: 0: 5767.9. Samples: 703657082. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:36:31,302][25689] Avg episode reward: [(0, '-1.329')] [2022-07-10 10:36:32,871][26022] Updated weights on worker 0-0, policy_version 687174 (0.00096) [2022-07-10 10:36:34,572][26022] Updated weights on worker 0-0, policy_version 687184 (0.00087) [2022-07-10 10:36:36,374][25689] Fps is (10 sec: 5488.1, 60 sec: 5510.3, 300 sec: 5529.2). Total num frames: 703685632. Throughput: 0: 5819.4. Samples: 703690674. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:36:36,374][25689] Avg episode reward: [(0, '-1.344')] [2022-07-10 10:36:36,512][26022] Updated weights on worker 0-0, policy_version 687194 (0.00093) [2022-07-10 10:36:38,346][26022] Updated weights on worker 0-0, policy_version 687204 (0.00088) [2022-07-10 10:36:40,205][26022] Updated weights on worker 0-0, policy_version 687214 (0.00081) [2022-07-10 10:36:41,387][25689] Fps is (10 sec: 5482.4, 60 sec: 5543.7, 300 sec: 5529.1). Total num frames: 703713280. Throughput: 0: 4977.3. Samples: 703707232. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:36:41,388][25689] Avg episode reward: [(0, '-3.661')] [2022-07-10 10:36:42,175][26022] Updated weights on worker 0-0, policy_version 687224 (0.00087) [2022-07-10 10:36:43,828][26022] Updated weights on worker 0-0, policy_version 687234 (0.00090) [2022-07-10 10:36:45,857][26022] Updated weights on worker 0-0, policy_version 687244 (0.00092) [2022-07-10 10:36:46,419][25689] Fps is (10 sec: 5504.6, 60 sec: 5507.9, 300 sec: 5532.1). Total num frames: 703740928. Throughput: 0: 5805.8. Samples: 703740448. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:36:46,420][25689] Avg episode reward: [(0, '-3.647')] [2022-07-10 10:36:47,660][26022] Updated weights on worker 0-0, policy_version 687254 (0.00082) [2022-07-10 10:36:49,426][26022] Updated weights on worker 0-0, policy_version 687264 (0.00087) [2022-07-10 10:36:51,298][26022] Updated weights on worker 0-0, policy_version 687274 (0.00088) [2022-07-10 10:36:51,430][25689] Fps is (10 sec: 5607.8, 60 sec: 5542.0, 300 sec: 5529.6). Total num frames: 703769600. Throughput: 0: 5821.4. Samples: 703774262. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:36:51,431][25689] Avg episode reward: [(0, '-3.142')] [2022-07-10 10:36:52,969][26022] Updated weights on worker 0-0, policy_version 687284 (0.00086) [2022-07-10 10:36:54,962][26022] Updated weights on worker 0-0, policy_version 687294 (0.00086) [2022-07-10 10:36:56,480][25689] Fps is (10 sec: 5699.8, 60 sec: 5544.2, 300 sec: 5529.3). Total num frames: 703798272. Throughput: 0: 4989.5. Samples: 703790994. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:36:56,480][25689] Avg episode reward: [(0, '-5.497')] [2022-07-10 10:36:56,734][26022] Updated weights on worker 0-0, policy_version 687304 (0.00093) [2022-07-10 10:36:58,542][26022] Updated weights on worker 0-0, policy_version 687314 (0.00089) [2022-07-10 10:37:00,363][26022] Updated weights on worker 0-0, policy_version 687324 (0.00099) [2022-07-10 10:37:01,515][25689] Fps is (10 sec: 5584.9, 60 sec: 5542.1, 300 sec: 5536.1). Total num frames: 703825920. Throughput: 0: 5845.7. Samples: 703824892. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:37:01,515][25689] Avg episode reward: [(0, '-5.129')] [2022-07-10 10:37:02,541][26022] Updated weights on worker 0-0, policy_version 687334 (0.00081) [2022-07-10 10:37:04,416][26022] Updated weights on worker 0-0, policy_version 687344 (0.00090) [2022-07-10 10:37:06,236][26022] Updated weights on worker 0-0, policy_version 687354 (0.00092) [2022-07-10 10:37:06,567][25689] Fps is (10 sec: 5482.0, 60 sec: 5577.0, 300 sec: 5532.0). Total num frames: 703853568. Throughput: 0: 5751.0. Samples: 703856318. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:37:06,569][25689] Avg episode reward: [(0, '-4.911')] [2022-07-10 10:37:08,013][26022] Updated weights on worker 0-0, policy_version 687364 (0.00083) [2022-07-10 10:37:09,844][26022] Updated weights on worker 0-0, policy_version 687374 (0.00082) [2022-07-10 10:37:11,603][25689] Fps is (10 sec: 5379.4, 60 sec: 5547.7, 300 sec: 5533.6). Total num frames: 703880192. Throughput: 0: 5734.2. Samples: 703889940. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:37:11,604][25689] Avg episode reward: [(0, '-4.581')] [2022-07-10 10:37:11,680][26022] Updated weights on worker 0-0, policy_version 687384 (0.00090) [2022-07-10 10:37:13,554][26022] Updated weights on worker 0-0, policy_version 687394 (0.00094) [2022-07-10 10:37:15,397][26022] Updated weights on worker 0-0, policy_version 687404 (0.00085) [2022-07-10 10:37:16,655][25689] Fps is (10 sec: 5278.3, 60 sec: 5520.0, 300 sec: 5522.5). Total num frames: 703906816. Throughput: 0: 5715.4. Samples: 703906302. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:37:16,655][25689] Avg episode reward: [(0, '-4.440')] [2022-07-10 10:37:17,272][26022] Updated weights on worker 0-0, policy_version 687414 (0.00086) [2022-07-10 10:37:18,950][26022] Updated weights on worker 0-0, policy_version 687424 (0.00084) [2022-07-10 10:37:20,931][26022] Updated weights on worker 0-0, policy_version 687434 (0.00087) [2022-07-10 10:37:21,674][25689] Fps is (10 sec: 5694.0, 60 sec: 5572.7, 300 sec: 5537.0). Total num frames: 703937536. Throughput: 0: 5720.9. Samples: 703940224. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:37:21,675][25689] Avg episode reward: [(0, '-4.315')] [2022-07-10 10:37:22,809][26022] Updated weights on worker 0-0, policy_version 687444 (0.00089) [2022-07-10 10:37:24,394][26022] Updated weights on worker 0-0, policy_version 687454 (0.00090) [2022-07-10 10:37:26,411][26022] Updated weights on worker 0-0, policy_version 687464 (0.00082) [2022-07-10 10:37:26,698][25689] Fps is (10 sec: 5709.9, 60 sec: 5526.9, 300 sec: 5533.6). Total num frames: 703964160. Throughput: 0: 5839.5. Samples: 703973872. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:37:26,698][25689] Avg episode reward: [(0, '-3.481')] [2022-07-10 10:37:28,205][26022] Updated weights on worker 0-0, policy_version 687474 (0.00087) [2022-07-10 10:37:30,029][26022] Updated weights on worker 0-0, policy_version 687484 (0.00087) [2022-07-10 10:37:31,703][25689] Fps is (10 sec: 5514.0, 60 sec: 5543.7, 300 sec: 5531.0). Total num frames: 703992832. Throughput: 0: 5012.9. Samples: 703990696. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:37:31,703][25689] Avg episode reward: [(0, '-4.485')] [2022-07-10 10:37:31,740][26022] Updated weights on worker 0-0, policy_version 687494 (0.00082) [2022-07-10 10:37:33,646][26022] Updated weights on worker 0-0, policy_version 687504 (0.00085) [2022-07-10 10:37:35,334][26022] Updated weights on worker 0-0, policy_version 687514 (0.00078) [2022-07-10 10:37:36,736][25689] Fps is (10 sec: 5508.7, 60 sec: 5530.4, 300 sec: 5530.8). Total num frames: 704019456. Throughput: 0: 5880.9. Samples: 704024396. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:37:36,736][25689] Avg episode reward: [(0, '-3.657')] [2022-07-10 10:37:37,268][26022] Updated weights on worker 0-0, policy_version 687524 (0.00085) [2022-07-10 10:37:38,991][26022] Updated weights on worker 0-0, policy_version 687534 (0.00083) [2022-07-10 10:37:41,048][26022] Updated weights on worker 0-0, policy_version 687544 (0.00089) [2022-07-10 10:37:41,742][25689] Fps is (10 sec: 5508.3, 60 sec: 5548.0, 300 sec: 5531.3). Total num frames: 704048128. Throughput: 0: 5878.5. Samples: 704058188. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:37:41,742][25689] Avg episode reward: [(0, '-3.079')] [2022-07-10 10:37:42,622][26022] Updated weights on worker 0-0, policy_version 687554 (0.00088) [2022-07-10 10:37:44,530][26022] Updated weights on worker 0-0, policy_version 687564 (0.00087) [2022-07-10 10:37:46,431][26022] Updated weights on worker 0-0, policy_version 687574 (0.00093) [2022-07-10 10:37:46,744][25689] Fps is (10 sec: 5729.7, 60 sec: 5567.7, 300 sec: 5534.9). Total num frames: 704076800. Throughput: 0: 5037.7. Samples: 704074858. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:37:46,744][25689] Avg episode reward: [(0, '-3.931')] [2022-07-10 10:37:48,296][26022] Updated weights on worker 0-0, policy_version 687584 (0.00084) [2022-07-10 10:37:50,116][26022] Updated weights on worker 0-0, policy_version 687594 (0.00091) [2022-07-10 10:37:51,763][25689] Fps is (10 sec: 5620.1, 60 sec: 5550.1, 300 sec: 5539.1). Total num frames: 704104448. Throughput: 0: 5871.2. Samples: 704108472. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:37:51,764][25689] Avg episode reward: [(0, '-3.997')] [2022-07-10 10:37:52,040][26022] Updated weights on worker 0-0, policy_version 687604 (0.00093) [2022-07-10 10:37:53,877][26022] Updated weights on worker 0-0, policy_version 687614 (0.00086) [2022-07-10 10:37:55,749][26022] Updated weights on worker 0-0, policy_version 687624 (0.00087) [2022-07-10 10:37:56,824][25689] Fps is (10 sec: 5587.2, 60 sec: 5549.0, 300 sec: 5534.6). Total num frames: 704133120. Throughput: 0: 5855.7. Samples: 704142028. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:37:56,825][25689] Avg episode reward: [(0, '-4.943')] [2022-07-10 10:37:57,552][26022] Updated weights on worker 0-0, policy_version 687634 (0.00094) [2022-07-10 10:37:59,236][26022] Updated weights on worker 0-0, policy_version 687644 (0.00094) [2022-07-10 10:38:01,204][26022] Updated weights on worker 0-0, policy_version 687654 (0.00086) [2022-07-10 10:38:01,851][25689] Fps is (10 sec: 5481.4, 60 sec: 5532.7, 300 sec: 5537.9). Total num frames: 704159744. Throughput: 0: 5005.9. Samples: 704158852. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-10 10:38:01,851][25689] Avg episode reward: [(0, '-5.200')] [2022-07-10 10:38:03,532][26022] Updated weights on worker 0-0, policy_version 687664 (0.00090) [2022-07-10 10:38:05,170][26022] Updated weights on worker 0-0, policy_version 687674 (0.00091) [2022-07-10 10:38:06,861][25689] Fps is (10 sec: 5305.0, 60 sec: 5519.6, 300 sec: 5538.0). Total num frames: 704186368. Throughput: 0: 5757.4. Samples: 704190682. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:38:06,863][25689] Avg episode reward: [(0, '-5.558')] [2022-07-10 10:38:07,170][26022] Updated weights on worker 0-0, policy_version 687684 (0.00089) [2022-07-10 10:38:08,618][26022] Updated weights on worker 0-0, policy_version 687694 (0.00087) [2022-07-10 10:38:10,664][26022] Updated weights on worker 0-0, policy_version 687704 (0.00089) [2022-07-10 10:38:11,881][25689] Fps is (10 sec: 5615.0, 60 sec: 5572.1, 300 sec: 5542.7). Total num frames: 704216064. Throughput: 0: 5771.7. Samples: 704224588. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:38:11,882][25689] Avg episode reward: [(0, '-6.262')] [2022-07-10 10:38:12,203][26022] Updated weights on worker 0-0, policy_version 687714 (0.00093) [2022-07-10 10:38:14,434][26022] Updated weights on worker 0-0, policy_version 687724 (0.00090) [2022-07-10 10:38:16,078][26022] Updated weights on worker 0-0, policy_version 687734 (0.00090) [2022-07-10 10:38:17,015][25689] Fps is (10 sec: 5647.7, 60 sec: 5581.4, 300 sec: 5537.9). Total num frames: 704243712. Throughput: 0: 4899.1. Samples: 704240944. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:38:17,015][25689] Avg episode reward: [(0, '-5.022')] [2022-07-10 10:38:17,984][26022] Updated weights on worker 0-0, policy_version 687744 (0.00094) [2022-07-10 10:38:19,747][26022] Updated weights on worker 0-0, policy_version 687754 (0.00087) [2022-07-10 10:38:21,773][26022] Updated weights on worker 0-0, policy_version 687764 (0.00085) [2022-07-10 10:38:22,045][25689] Fps is (10 sec: 5440.1, 60 sec: 5529.5, 300 sec: 5538.6). Total num frames: 704271360. Throughput: 0: 5738.5. Samples: 704274740. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:38:22,046][25689] Avg episode reward: [(0, '-4.564')] [2022-07-10 10:38:23,297][26022] Updated weights on worker 0-0, policy_version 687774 (0.00083) [2022-07-10 10:38:24,403][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:38:24,414][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000687778_704284672.pth [2022-07-10 10:38:24,417][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000685829_702288896.pth [2022-07-10 10:38:25,387][26022] Updated weights on worker 0-0, policy_version 687784 (0.00093) [2022-07-10 10:38:27,038][26022] Updated weights on worker 0-0, policy_version 687794 (0.00089) [2022-07-10 10:38:27,133][25689] Fps is (10 sec: 5667.2, 60 sec: 5574.5, 300 sec: 5544.0). Total num frames: 704301056. Throughput: 0: 5789.4. Samples: 704308044. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:38:27,134][25689] Avg episode reward: [(0, '-3.256')] [2022-07-10 10:38:29,188][26022] Updated weights on worker 0-0, policy_version 687804 (0.00102) [2022-07-10 10:38:30,991][26022] Updated weights on worker 0-0, policy_version 687814 (0.00093) [2022-07-10 10:38:32,139][25689] Fps is (10 sec: 5580.0, 60 sec: 5540.6, 300 sec: 5538.2). Total num frames: 704327680. Throughput: 0: 4931.8. Samples: 704324496. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:38:32,139][25689] Avg episode reward: [(0, '-2.709')] [2022-07-10 10:38:32,672][26022] Updated weights on worker 0-0, policy_version 687824 (0.00088) [2022-07-10 10:38:34,694][26022] Updated weights on worker 0-0, policy_version 687834 (0.00099) [2022-07-10 10:38:36,293][26022] Updated weights on worker 0-0, policy_version 687844 (0.00089) [2022-07-10 10:38:37,265][25689] Fps is (10 sec: 5457.9, 60 sec: 5565.9, 300 sec: 5544.0). Total num frames: 704356352. Throughput: 0: 5782.6. Samples: 704358040. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:38:37,265][25689] Avg episode reward: [(0, '-2.511')] [2022-07-10 10:38:38,300][26022] Updated weights on worker 0-0, policy_version 687854 (0.00090) [2022-07-10 10:38:40,086][26022] Updated weights on worker 0-0, policy_version 687864 (0.00085) [2022-07-10 10:38:41,813][26022] Updated weights on worker 0-0, policy_version 687874 (0.00088) [2022-07-10 10:38:42,364][25689] Fps is (10 sec: 5608.1, 60 sec: 5557.3, 300 sec: 5543.2). Total num frames: 704385024. Throughput: 0: 5753.3. Samples: 704391638. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:38:42,365][25689] Avg episode reward: [(0, '-2.663')] [2022-07-10 10:38:43,856][26022] Updated weights on worker 0-0, policy_version 687884 (0.00095) [2022-07-10 10:38:45,610][26022] Updated weights on worker 0-0, policy_version 687894 (0.00069) [2022-07-10 10:38:47,299][26022] Updated weights on worker 0-0, policy_version 687904 (0.00082) [2022-07-10 10:38:47,390][25689] Fps is (10 sec: 5663.2, 60 sec: 5555.1, 300 sec: 5546.4). Total num frames: 704413696. Throughput: 0: 4945.1. Samples: 704408218. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:38:47,391][25689] Avg episode reward: [(0, '-3.342')] [2022-07-10 10:38:49,290][26022] Updated weights on worker 0-0, policy_version 687914 (0.00094) [2022-07-10 10:38:51,135][26022] Updated weights on worker 0-0, policy_version 687924 (0.00090) [2022-07-10 10:38:52,393][25689] Fps is (10 sec: 5615.8, 60 sec: 5556.6, 300 sec: 5543.8). Total num frames: 704441344. Throughput: 0: 5786.8. Samples: 704441702. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:38:52,393][25689] Avg episode reward: [(0, '-2.917')] [2022-07-10 10:38:52,927][26022] Updated weights on worker 0-0, policy_version 687934 (0.00078) [2022-07-10 10:38:55,055][26022] Updated weights on worker 0-0, policy_version 687944 (0.00074) [2022-07-10 10:38:56,535][26022] Updated weights on worker 0-0, policy_version 687954 (0.00087) [2022-07-10 10:38:57,462][25689] Fps is (10 sec: 5490.3, 60 sec: 5539.0, 300 sec: 5544.0). Total num frames: 704468992. Throughput: 0: 5800.8. Samples: 704475200. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:38:57,462][25689] Avg episode reward: [(0, '-2.089')] [2022-07-10 10:38:58,421][26022] Updated weights on worker 0-0, policy_version 687964 (0.00085) [2022-07-10 10:39:00,256][26022] Updated weights on worker 0-0, policy_version 687974 (0.00086) [2022-07-10 10:39:02,422][26022] Updated weights on worker 0-0, policy_version 687984 (0.00086) [2022-07-10 10:39:02,518][25689] Fps is (10 sec: 5360.1, 60 sec: 5536.3, 300 sec: 5539.6). Total num frames: 704495616. Throughput: 0: 4980.4. Samples: 704492012. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:39:02,518][25689] Avg episode reward: [(0, '-2.443')] [2022-07-10 10:39:04,452][26022] Updated weights on worker 0-0, policy_version 687994 (0.00084) [2022-07-10 10:39:06,057][26022] Updated weights on worker 0-0, policy_version 688004 (0.00083) [2022-07-10 10:39:07,525][25689] Fps is (10 sec: 5393.4, 60 sec: 5553.5, 300 sec: 5541.0). Total num frames: 704523264. Throughput: 0: 5741.9. Samples: 704523826. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:39:07,525][25689] Avg episode reward: [(0, '-2.055')] [2022-07-10 10:39:08,051][26022] Updated weights on worker 0-0, policy_version 688014 (0.00086) [2022-07-10 10:39:09,850][26022] Updated weights on worker 0-0, policy_version 688024 (0.00090) [2022-07-10 10:39:11,600][26022] Updated weights on worker 0-0, policy_version 688034 (0.00086) [2022-07-10 10:39:12,603][25689] Fps is (10 sec: 5584.4, 60 sec: 5531.3, 300 sec: 5541.9). Total num frames: 704551936. Throughput: 0: 5725.9. Samples: 704557424. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:39:12,604][25689] Avg episode reward: [(0, '-3.057')] [2022-07-10 10:39:13,537][26022] Updated weights on worker 0-0, policy_version 688044 (0.00088) [2022-07-10 10:39:15,416][26022] Updated weights on worker 0-0, policy_version 688054 (0.00090) [2022-07-10 10:39:17,196][26022] Updated weights on worker 0-0, policy_version 688064 (0.00091) [2022-07-10 10:39:17,726][25689] Fps is (10 sec: 5621.2, 60 sec: 5549.1, 300 sec: 5539.9). Total num frames: 704580608. Throughput: 0: 4869.7. Samples: 704573880. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:39:17,729][25689] Avg episode reward: [(0, '-2.311')] [2022-07-10 10:39:18,934][26022] Updated weights on worker 0-0, policy_version 688074 (0.00055) [2022-07-10 10:39:20,579][26022] Updated weights on worker 0-0, policy_version 688084 (0.00083) [2022-07-10 10:39:22,544][26022] Updated weights on worker 0-0, policy_version 688094 (0.00091) [2022-07-10 10:39:22,791][25689] Fps is (10 sec: 5628.7, 60 sec: 5562.9, 300 sec: 5542.3). Total num frames: 704609280. Throughput: 0: 5713.7. Samples: 704607846. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:39:22,792][25689] Avg episode reward: [(0, '-2.079')] [2022-07-10 10:39:24,238][26022] Updated weights on worker 0-0, policy_version 688104 (0.00087) [2022-07-10 10:39:26,173][26022] Updated weights on worker 0-0, policy_version 688114 (0.00090) [2022-07-10 10:39:27,818][25689] Fps is (10 sec: 5682.3, 60 sec: 5551.6, 300 sec: 5545.4). Total num frames: 704637952. Throughput: 0: 5797.4. Samples: 704641474. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:39:27,818][25689] Avg episode reward: [(0, '-2.793')] [2022-07-10 10:39:28,054][26022] Updated weights on worker 0-0, policy_version 688124 (0.00082) [2022-07-10 10:39:29,847][26022] Updated weights on worker 0-0, policy_version 688134 (0.00095) [2022-07-10 10:39:32,013][26022] Updated weights on worker 0-0, policy_version 688144 (0.00092) [2022-07-10 10:39:32,827][25689] Fps is (10 sec: 5611.9, 60 sec: 5568.1, 300 sec: 5547.2). Total num frames: 704665600. Throughput: 0: 4984.9. Samples: 704658236. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:39:32,827][25689] Avg episode reward: [(0, '-3.002')] [2022-07-10 10:39:33,542][26022] Updated weights on worker 0-0, policy_version 688154 (0.00094) [2022-07-10 10:39:35,527][26022] Updated weights on worker 0-0, policy_version 688164 (0.00093) [2022-07-10 10:39:37,243][26022] Updated weights on worker 0-0, policy_version 688174 (0.00087) [2022-07-10 10:39:37,945][25689] Fps is (10 sec: 5460.3, 60 sec: 5552.0, 300 sec: 5541.9). Total num frames: 704693248. Throughput: 0: 5832.6. Samples: 704691806. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:39:37,945][25689] Avg episode reward: [(0, '-3.626')] [2022-07-10 10:39:38,967][26022] Updated weights on worker 0-0, policy_version 688184 (0.00091) [2022-07-10 10:39:41,032][26022] Updated weights on worker 0-0, policy_version 688194 (0.00091) [2022-07-10 10:39:42,471][26022] Updated weights on worker 0-0, policy_version 688204 (0.00089) [2022-07-10 10:39:42,963][25689] Fps is (10 sec: 5556.5, 60 sec: 5559.5, 300 sec: 5545.3). Total num frames: 704721920. Throughput: 0: 5827.2. Samples: 704725392. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:39:42,963][25689] Avg episode reward: [(0, '-3.780')] [2022-07-10 10:39:44,676][26022] Updated weights on worker 0-0, policy_version 688214 (0.00092) [2022-07-10 10:39:46,556][26022] Updated weights on worker 0-0, policy_version 688224 (0.00082) [2022-07-10 10:39:47,996][25689] Fps is (10 sec: 5603.1, 60 sec: 5541.9, 300 sec: 5542.5). Total num frames: 704749568. Throughput: 0: 5820.4. Samples: 704758920. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:39:47,997][25689] Avg episode reward: [(0, '-3.686')] [2022-07-10 10:39:48,192][26022] Updated weights on worker 0-0, policy_version 688234 (0.00096) [2022-07-10 10:39:50,277][26022] Updated weights on worker 0-0, policy_version 688244 (0.00091) [2022-07-10 10:39:51,898][26022] Updated weights on worker 0-0, policy_version 688254 (0.00083) [2022-07-10 10:39:53,015][25689] Fps is (10 sec: 5602.9, 60 sec: 5557.3, 300 sec: 5544.7). Total num frames: 704778240. Throughput: 0: 5816.4. Samples: 704775658. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:39:53,015][25689] Avg episode reward: [(0, '-3.761')] [2022-07-10 10:39:53,791][26022] Updated weights on worker 0-0, policy_version 688264 (0.00087) [2022-07-10 10:39:55,625][26022] Updated weights on worker 0-0, policy_version 688274 (0.00090) [2022-07-10 10:39:57,375][26022] Updated weights on worker 0-0, policy_version 688284 (0.00093) [2022-07-10 10:39:58,059][25689] Fps is (10 sec: 5495.2, 60 sec: 5542.7, 300 sec: 5540.8). Total num frames: 704804864. Throughput: 0: 5838.6. Samples: 704809244. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:39:58,059][25689] Avg episode reward: [(0, '-3.040')] [2022-07-10 10:39:59,244][26022] Updated weights on worker 0-0, policy_version 688294 (0.00091) [2022-07-10 10:40:01,089][26022] Updated weights on worker 0-0, policy_version 688304 (0.00089) [2022-07-10 10:40:03,061][25689] Fps is (10 sec: 5402.4, 60 sec: 5564.6, 300 sec: 5541.2). Total num frames: 704832512. Throughput: 0: 5727.9. Samples: 704840510. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:40:03,061][25689] Avg episode reward: [(0, '-4.863')] [2022-07-10 10:40:03,307][26022] Updated weights on worker 0-0, policy_version 688314 (0.00090) [2022-07-10 10:40:05,099][26022] Updated weights on worker 0-0, policy_version 688324 (0.00088) [2022-07-10 10:40:06,912][26022] Updated weights on worker 0-0, policy_version 688334 (0.00091) [2022-07-10 10:40:08,085][25689] Fps is (10 sec: 5413.2, 60 sec: 5546.1, 300 sec: 5541.3). Total num frames: 704859136. Throughput: 0: 4889.6. Samples: 704857144. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:40:08,085][25689] Avg episode reward: [(0, '-4.009')] [2022-07-10 10:40:08,871][26022] Updated weights on worker 0-0, policy_version 688344 (0.00088) [2022-07-10 10:40:10,751][26022] Updated weights on worker 0-0, policy_version 688354 (0.00080) [2022-07-10 10:40:12,471][26022] Updated weights on worker 0-0, policy_version 688364 (0.00098) [2022-07-10 10:40:13,097][25689] Fps is (10 sec: 5407.5, 60 sec: 5535.2, 300 sec: 5542.3). Total num frames: 704886784. Throughput: 0: 5721.3. Samples: 704890554. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:40:13,098][25689] Avg episode reward: [(0, '-3.541')] [2022-07-10 10:40:14,481][26022] Updated weights on worker 0-0, policy_version 688374 (0.00096) [2022-07-10 10:40:16,411][26022] Updated weights on worker 0-0, policy_version 688384 (0.00087) [2022-07-10 10:40:17,952][26022] Updated weights on worker 0-0, policy_version 688394 (0.00089) [2022-07-10 10:40:18,165][25689] Fps is (10 sec: 5688.4, 60 sec: 5557.2, 300 sec: 5544.7). Total num frames: 704916480. Throughput: 0: 5698.6. Samples: 704923824. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:40:18,166][25689] Avg episode reward: [(0, '-3.976')] [2022-07-10 10:40:20,063][26022] Updated weights on worker 0-0, policy_version 688404 (0.00087) [2022-07-10 10:40:21,730][26022] Updated weights on worker 0-0, policy_version 688414 (0.00088) [2022-07-10 10:40:23,173][25689] Fps is (10 sec: 5589.5, 60 sec: 5528.5, 300 sec: 5541.6). Total num frames: 704943104. Throughput: 0: 4985.3. Samples: 704940776. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:40:23,174][25689] Avg episode reward: [(0, '-4.572')] [2022-07-10 10:40:23,641][26022] Updated weights on worker 0-0, policy_version 688424 (0.00093) [2022-07-10 10:40:24,427][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:40:24,438][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000688428_704950272.pth [2022-07-10 10:40:24,441][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000686477_702952448.pth [2022-07-10 10:40:25,531][26022] Updated weights on worker 0-0, policy_version 688434 (0.00088) [2022-07-10 10:40:27,287][26022] Updated weights on worker 0-0, policy_version 688444 (0.00099) [2022-07-10 10:40:28,213][25689] Fps is (10 sec: 5401.4, 60 sec: 5510.3, 300 sec: 5537.7). Total num frames: 704970752. Throughput: 0: 5825.1. Samples: 704974396. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:40:28,215][25689] Avg episode reward: [(0, '-3.642')] [2022-07-10 10:40:29,183][26022] Updated weights on worker 0-0, policy_version 688454 (0.00091) [2022-07-10 10:40:31,027][26022] Updated weights on worker 0-0, policy_version 688464 (0.00087) [2022-07-10 10:40:32,698][26022] Updated weights on worker 0-0, policy_version 688474 (0.00089) [2022-07-10 10:40:33,221][25689] Fps is (10 sec: 5707.1, 60 sec: 5544.4, 300 sec: 5543.0). Total num frames: 705000448. Throughput: 0: 5831.2. Samples: 705007900. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:40:33,221][25689] Avg episode reward: [(0, '-2.250')] [2022-07-10 10:40:34,676][26022] Updated weights on worker 0-0, policy_version 688484 (0.00089) [2022-07-10 10:40:36,421][26022] Updated weights on worker 0-0, policy_version 688494 (0.00089) [2022-07-10 10:40:38,233][26022] Updated weights on worker 0-0, policy_version 688504 (0.00086) [2022-07-10 10:40:38,305][25689] Fps is (10 sec: 5682.1, 60 sec: 5547.4, 300 sec: 5548.4). Total num frames: 705028096. Throughput: 0: 5011.6. Samples: 705024758. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:40:38,306][25689] Avg episode reward: [(0, '-3.283')] [2022-07-10 10:40:40,167][26022] Updated weights on worker 0-0, policy_version 688514 (0.00084) [2022-07-10 10:40:41,925][26022] Updated weights on worker 0-0, policy_version 688524 (0.00092) [2022-07-10 10:40:43,324][25689] Fps is (10 sec: 5372.0, 60 sec: 5513.5, 300 sec: 5538.0). Total num frames: 705054720. Throughput: 0: 5833.3. Samples: 705058320. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:40:43,324][25689] Avg episode reward: [(0, '-2.832')] [2022-07-10 10:40:43,803][26022] Updated weights on worker 0-0, policy_version 688534 (0.00083) [2022-07-10 10:40:45,532][26022] Updated weights on worker 0-0, policy_version 688544 (0.00086) [2022-07-10 10:40:47,620][26022] Updated weights on worker 0-0, policy_version 688554 (0.00092) [2022-07-10 10:40:48,326][25689] Fps is (10 sec: 5620.6, 60 sec: 5550.3, 300 sec: 5548.5). Total num frames: 705084416. Throughput: 0: 5840.5. Samples: 705091862. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:40:48,326][25689] Avg episode reward: [(0, '-1.762')] [2022-07-10 10:40:49,295][26022] Updated weights on worker 0-0, policy_version 688564 (0.00088) [2022-07-10 10:40:51,211][26022] Updated weights on worker 0-0, policy_version 688574 (0.00083) [2022-07-10 10:40:52,914][26022] Updated weights on worker 0-0, policy_version 688584 (0.00094) [2022-07-10 10:40:53,366][25689] Fps is (10 sec: 5608.4, 60 sec: 5514.4, 300 sec: 5542.3). Total num frames: 705111040. Throughput: 0: 4994.2. Samples: 705108508. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:40:53,366][25689] Avg episode reward: [(0, '-2.318')] [2022-07-10 10:40:54,852][26022] Updated weights on worker 0-0, policy_version 688594 (0.00090) [2022-07-10 10:40:56,661][26022] Updated weights on worker 0-0, policy_version 688604 (0.00090) [2022-07-10 10:40:58,418][25689] Fps is (10 sec: 5479.0, 60 sec: 5547.6, 300 sec: 5545.0). Total num frames: 705139712. Throughput: 0: 5823.2. Samples: 705141878. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:40:58,419][25689] Avg episode reward: [(0, '-3.312')] [2022-07-10 10:40:58,571][26022] Updated weights on worker 0-0, policy_version 688614 (0.00081) [2022-07-10 10:41:00,368][26022] Updated weights on worker 0-0, policy_version 688624 (0.00088) [2022-07-10 10:41:02,628][26022] Updated weights on worker 0-0, policy_version 688634 (0.00089) [2022-07-10 10:41:03,438][25689] Fps is (10 sec: 5489.7, 60 sec: 5528.9, 300 sec: 5549.2). Total num frames: 705166336. Throughput: 0: 5714.0. Samples: 705173256. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:41:03,439][25689] Avg episode reward: [(0, '-2.400')] [2022-07-10 10:41:04,404][26022] Updated weights on worker 0-0, policy_version 688644 (0.00088) [2022-07-10 10:41:06,385][26022] Updated weights on worker 0-0, policy_version 688654 (0.00091) [2022-07-10 10:41:08,141][26022] Updated weights on worker 0-0, policy_version 688664 (0.00096) [2022-07-10 10:41:08,527][25689] Fps is (10 sec: 5267.6, 60 sec: 5523.0, 300 sec: 5542.3). Total num frames: 705192960. Throughput: 0: 4859.4. Samples: 705190026. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:41:08,527][25689] Avg episode reward: [(0, '-2.266')] [2022-07-10 10:41:09,882][26022] Updated weights on worker 0-0, policy_version 688674 (0.00090) [2022-07-10 10:41:11,711][26022] Updated weights on worker 0-0, policy_version 688684 (0.00084) [2022-07-10 10:41:13,575][25689] Fps is (10 sec: 5455.2, 60 sec: 5536.7, 300 sec: 5543.6). Total num frames: 705221632. Throughput: 0: 5696.6. Samples: 705223630. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:41:13,575][25689] Avg episode reward: [(0, '-2.624')] [2022-07-10 10:41:13,687][26022] Updated weights on worker 0-0, policy_version 688694 (0.00086) [2022-07-10 10:41:15,375][26022] Updated weights on worker 0-0, policy_version 688704 (0.00077) [2022-07-10 10:41:17,387][26022] Updated weights on worker 0-0, policy_version 688714 (0.00092) [2022-07-10 10:41:18,677][25689] Fps is (10 sec: 5548.5, 60 sec: 5499.8, 300 sec: 5542.4). Total num frames: 705249280. Throughput: 0: 5691.0. Samples: 705257172. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:41:18,678][25689] Avg episode reward: [(0, '-3.057')] [2022-07-10 10:41:19,060][26022] Updated weights on worker 0-0, policy_version 688724 (0.00082) [2022-07-10 10:41:21,068][26022] Updated weights on worker 0-0, policy_version 688734 (0.00092) [2022-07-10 10:41:22,688][26022] Updated weights on worker 0-0, policy_version 688744 (0.00082) [2022-07-10 10:41:23,680][25689] Fps is (10 sec: 5674.6, 60 sec: 5551.0, 300 sec: 5543.8). Total num frames: 705278976. Throughput: 0: 4962.7. Samples: 705273710. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-10 10:41:23,681][25689] Avg episode reward: [(0, '-2.944')] [2022-07-10 10:41:24,767][26022] Updated weights on worker 0-0, policy_version 688754 (0.00097) [2022-07-10 10:41:26,448][26022] Updated weights on worker 0-0, policy_version 688764 (0.00086) [2022-07-10 10:41:28,306][26022] Updated weights on worker 0-0, policy_version 688774 (0.00086) [2022-07-10 10:41:28,722][25689] Fps is (10 sec: 5810.7, 60 sec: 5567.7, 300 sec: 5546.6). Total num frames: 705307648. Throughput: 0: 5807.6. Samples: 705307312. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:41:28,723][25689] Avg episode reward: [(0, '-3.212')] [2022-07-10 10:41:30,084][26022] Updated weights on worker 0-0, policy_version 688784 (0.00089) [2022-07-10 10:41:31,827][26022] Updated weights on worker 0-0, policy_version 688794 (0.00085) [2022-07-10 10:41:33,767][25689] Fps is (10 sec: 5482.0, 60 sec: 5513.6, 300 sec: 5543.6). Total num frames: 705334272. Throughput: 0: 5808.7. Samples: 705340920. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:41:33,767][25689] Avg episode reward: [(0, '-4.004')] [2022-07-10 10:41:33,944][26022] Updated weights on worker 0-0, policy_version 688804 (0.00087) [2022-07-10 10:41:35,503][26022] Updated weights on worker 0-0, policy_version 688814 (0.00089) [2022-07-10 10:41:37,326][26022] Updated weights on worker 0-0, policy_version 688824 (0.00090) [2022-07-10 10:41:38,838][25689] Fps is (10 sec: 5466.5, 60 sec: 5531.7, 300 sec: 5546.0). Total num frames: 705362944. Throughput: 0: 4988.9. Samples: 705357750. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:41:38,838][25689] Avg episode reward: [(0, '-4.123')] [2022-07-10 10:41:39,156][26022] Updated weights on worker 0-0, policy_version 688834 (0.00093) [2022-07-10 10:41:41,235][26022] Updated weights on worker 0-0, policy_version 688844 (0.00089) [2022-07-10 10:41:42,813][26022] Updated weights on worker 0-0, policy_version 688854 (0.00086) [2022-07-10 10:41:43,864][25689] Fps is (10 sec: 5577.9, 60 sec: 5547.9, 300 sec: 5546.1). Total num frames: 705390592. Throughput: 0: 5826.3. Samples: 705391306. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:41:43,864][25689] Avg episode reward: [(0, '-3.694')] [2022-07-10 10:41:44,763][26022] Updated weights on worker 0-0, policy_version 688864 (0.00838) [2022-07-10 10:41:46,563][26022] Updated weights on worker 0-0, policy_version 688874 (0.00086) [2022-07-10 10:41:48,459][26022] Updated weights on worker 0-0, policy_version 688884 (0.00095) [2022-07-10 10:41:48,911][25689] Fps is (10 sec: 5591.1, 60 sec: 5526.9, 300 sec: 5545.5). Total num frames: 705419264. Throughput: 0: 5825.2. Samples: 705424914. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:41:48,911][25689] Avg episode reward: [(0, '-3.758')] [2022-07-10 10:41:50,343][26022] Updated weights on worker 0-0, policy_version 688894 (0.00087) [2022-07-10 10:41:52,089][26022] Updated weights on worker 0-0, policy_version 688904 (0.00097) [2022-07-10 10:41:53,935][25689] Fps is (10 sec: 5592.1, 60 sec: 5545.2, 300 sec: 5542.5). Total num frames: 705446912. Throughput: 0: 4995.5. Samples: 705441670. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:41:53,936][25689] Avg episode reward: [(0, '-2.785')] [2022-07-10 10:41:54,017][26022] Updated weights on worker 0-0, policy_version 688914 (0.00627) [2022-07-10 10:41:55,995][26022] Updated weights on worker 0-0, policy_version 688924 (0.00095) [2022-07-10 10:41:57,591][26022] Updated weights on worker 0-0, policy_version 688934 (0.00085) [2022-07-10 10:41:58,970][25689] Fps is (10 sec: 5497.0, 60 sec: 5529.9, 300 sec: 5542.5). Total num frames: 705474560. Throughput: 0: 5820.8. Samples: 705474936. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:41:58,971][25689] Avg episode reward: [(0, '-2.619')] [2022-07-10 10:41:59,415][26022] Updated weights on worker 0-0, policy_version 688944 (0.00107) [2022-07-10 10:42:01,258][26022] Updated weights on worker 0-0, policy_version 688954 (0.00087) [2022-07-10 10:42:03,467][26022] Updated weights on worker 0-0, policy_version 688964 (0.00099) [2022-07-10 10:42:03,979][25689] Fps is (10 sec: 5403.7, 60 sec: 5531.0, 300 sec: 5539.8). Total num frames: 705501184. Throughput: 0: 5723.0. Samples: 705506422. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:42:03,979][25689] Avg episode reward: [(0, '-2.256')] [2022-07-10 10:42:05,374][26022] Updated weights on worker 0-0, policy_version 688974 (0.00090) [2022-07-10 10:42:07,205][26022] Updated weights on worker 0-0, policy_version 688984 (0.00087) [2022-07-10 10:42:08,981][25689] Fps is (10 sec: 5523.6, 60 sec: 5572.8, 300 sec: 5547.4). Total num frames: 705529856. Throughput: 0: 4901.5. Samples: 705523288. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:42:08,981][25689] Avg episode reward: [(0, '-1.640')] [2022-07-10 10:42:08,985][26022] Updated weights on worker 0-0, policy_version 688994 (0.00087) [2022-07-10 10:42:10,757][26022] Updated weights on worker 0-0, policy_version 689004 (0.00095) [2022-07-10 10:42:12,657][26022] Updated weights on worker 0-0, policy_version 689014 (0.00087) [2022-07-10 10:42:14,056][25689] Fps is (10 sec: 5690.1, 60 sec: 5570.2, 300 sec: 5553.8). Total num frames: 705558528. Throughput: 0: 5728.8. Samples: 705556942. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:42:14,057][25689] Avg episode reward: [(0, '-1.981')] [2022-07-10 10:42:14,424][26022] Updated weights on worker 0-0, policy_version 689024 (0.00090) [2022-07-10 10:42:16,407][26022] Updated weights on worker 0-0, policy_version 689034 (0.00091) [2022-07-10 10:42:18,288][26022] Updated weights on worker 0-0, policy_version 689044 (0.00090) [2022-07-10 10:42:19,129][25689] Fps is (10 sec: 5347.9, 60 sec: 5539.1, 300 sec: 5535.6). Total num frames: 705584128. Throughput: 0: 5730.2. Samples: 705590452. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:42:19,130][25689] Avg episode reward: [(0, '-2.236')] [2022-07-10 10:42:20,100][26022] Updated weights on worker 0-0, policy_version 689054 (0.00091) [2022-07-10 10:42:22,114][26022] Updated weights on worker 0-0, policy_version 689064 (0.00090) [2022-07-10 10:42:23,530][26022] Updated weights on worker 0-0, policy_version 689074 (0.00055) [2022-07-10 10:42:24,161][25689] Fps is (10 sec: 5472.4, 60 sec: 5536.4, 300 sec: 5545.8). Total num frames: 705613824. Throughput: 0: 4982.3. Samples: 705606976. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:42:24,162][25689] Avg episode reward: [(0, '-2.197')] [2022-07-10 10:42:24,756][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:42:24,771][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000689079_705616896.pth [2022-07-10 10:42:24,772][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000687127_703618048.pth [2022-07-10 10:42:25,655][26022] Updated weights on worker 0-0, policy_version 689084 (0.00086) [2022-07-10 10:42:27,428][26022] Updated weights on worker 0-0, policy_version 689094 (0.00084) [2022-07-10 10:42:29,174][25689] Fps is (10 sec: 5708.8, 60 sec: 5522.2, 300 sec: 5542.2). Total num frames: 705641472. Throughput: 0: 5788.0. Samples: 705640166. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:42:29,176][25689] Avg episode reward: [(0, '-1.933')] [2022-07-10 10:42:29,311][26022] Updated weights on worker 0-0, policy_version 689104 (0.00092) [2022-07-10 10:42:31,221][26022] Updated weights on worker 0-0, policy_version 689114 (0.00091) [2022-07-10 10:42:33,083][26022] Updated weights on worker 0-0, policy_version 689124 (0.00094) [2022-07-10 10:42:34,217][25689] Fps is (10 sec: 5397.2, 60 sec: 5522.3, 300 sec: 5542.0). Total num frames: 705668096. Throughput: 0: 5781.0. Samples: 705673488. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:42:34,218][25689] Avg episode reward: [(0, '-2.297')] [2022-07-10 10:42:34,831][26022] Updated weights on worker 0-0, policy_version 689134 (0.00084) [2022-07-10 10:42:36,654][26022] Updated weights on worker 0-0, policy_version 689144 (0.00100) [2022-07-10 10:42:38,519][26022] Updated weights on worker 0-0, policy_version 689154 (0.00092) [2022-07-10 10:42:39,247][25689] Fps is (10 sec: 5489.7, 60 sec: 5526.1, 300 sec: 5541.6). Total num frames: 705696768. Throughput: 0: 4970.5. Samples: 705690446. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:42:39,249][25689] Avg episode reward: [(0, '-2.469')] [2022-07-10 10:42:40,361][26022] Updated weights on worker 0-0, policy_version 689164 (0.00087) [2022-07-10 10:42:42,235][26022] Updated weights on worker 0-0, policy_version 689174 (0.00092) [2022-07-10 10:42:43,819][26022] Updated weights on worker 0-0, policy_version 689184 (0.00078) [2022-07-10 10:42:44,252][25689] Fps is (10 sec: 5816.3, 60 sec: 5561.9, 300 sec: 5544.9). Total num frames: 705726464. Throughput: 0: 5816.8. Samples: 705723840. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:42:44,254][25689] Avg episode reward: [(0, '-2.426')] [2022-07-10 10:42:45,852][26022] Updated weights on worker 0-0, policy_version 689194 (0.00087) [2022-07-10 10:42:47,752][26022] Updated weights on worker 0-0, policy_version 689204 (0.00087) [2022-07-10 10:42:49,273][25689] Fps is (10 sec: 5616.9, 60 sec: 5530.3, 300 sec: 5541.4). Total num frames: 705753088. Throughput: 0: 5823.3. Samples: 705757212. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:42:49,275][25689] Avg episode reward: [(0, '-2.810')] [2022-07-10 10:42:49,498][26022] Updated weights on worker 0-0, policy_version 689214 (0.00082) [2022-07-10 10:42:51,488][26022] Updated weights on worker 0-0, policy_version 689224 (0.00086) [2022-07-10 10:42:53,083][26022] Updated weights on worker 0-0, policy_version 689234 (0.00079) [2022-07-10 10:42:54,288][25689] Fps is (10 sec: 5509.8, 60 sec: 5548.2, 300 sec: 5542.3). Total num frames: 705781760. Throughput: 0: 5847.1. Samples: 705790846. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:42:54,289][25689] Avg episode reward: [(0, '-2.665')] [2022-07-10 10:42:55,184][26022] Updated weights on worker 0-0, policy_version 689244 (0.00092) [2022-07-10 10:42:56,940][26022] Updated weights on worker 0-0, policy_version 689254 (0.00092) [2022-07-10 10:42:58,692][26022] Updated weights on worker 0-0, policy_version 689264 (0.00087) [2022-07-10 10:42:59,369][25689] Fps is (10 sec: 5578.6, 60 sec: 5543.9, 300 sec: 5544.7). Total num frames: 705809408. Throughput: 0: 5815.7. Samples: 705807472. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:42:59,370][25689] Avg episode reward: [(0, '-2.199')] [2022-07-10 10:43:00,597][26022] Updated weights on worker 0-0, policy_version 689274 (0.00088) [2022-07-10 10:43:02,699][26022] Updated weights on worker 0-0, policy_version 689284 (0.00088) [2022-07-10 10:43:04,390][25689] Fps is (10 sec: 5271.0, 60 sec: 5525.9, 300 sec: 5541.1). Total num frames: 705835008. Throughput: 0: 5710.3. Samples: 705838834. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:43:04,392][25689] Avg episode reward: [(0, '-3.255')] [2022-07-10 10:43:04,603][26022] Updated weights on worker 0-0, policy_version 689294 (0.00083) [2022-07-10 10:43:06,490][26022] Updated weights on worker 0-0, policy_version 689304 (0.00092) [2022-07-10 10:43:08,209][26022] Updated weights on worker 0-0, policy_version 689314 (0.00091) [2022-07-10 10:43:09,446][25689] Fps is (10 sec: 5385.6, 60 sec: 5520.9, 300 sec: 5537.0). Total num frames: 705863680. Throughput: 0: 5717.1. Samples: 705872542. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:43:09,447][25689] Avg episode reward: [(0, '-2.789')] [2022-07-10 10:43:10,263][26022] Updated weights on worker 0-0, policy_version 689324 (0.00087) [2022-07-10 10:43:11,943][26022] Updated weights on worker 0-0, policy_version 689334 (0.00089) [2022-07-10 10:43:13,747][26022] Updated weights on worker 0-0, policy_version 689344 (0.00091) [2022-07-10 10:43:14,464][25689] Fps is (10 sec: 5590.6, 60 sec: 5509.2, 300 sec: 5539.2). Total num frames: 705891328. Throughput: 0: 4891.5. Samples: 705889538. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:43:14,465][25689] Avg episode reward: [(0, '-4.228')] [2022-07-10 10:43:15,636][26022] Updated weights on worker 0-0, policy_version 689354 (0.00089) [2022-07-10 10:43:17,547][26022] Updated weights on worker 0-0, policy_version 689364 (0.00087) [2022-07-10 10:43:19,290][26022] Updated weights on worker 0-0, policy_version 689374 (0.00086) [2022-07-10 10:43:19,572][25689] Fps is (10 sec: 5663.0, 60 sec: 5573.7, 300 sec: 5544.6). Total num frames: 705921024. Throughput: 0: 5710.6. Samples: 705922844. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:43:19,573][25689] Avg episode reward: [(0, '-5.143')] [2022-07-10 10:43:21,300][26022] Updated weights on worker 0-0, policy_version 689384 (0.00085) [2022-07-10 10:43:22,928][26022] Updated weights on worker 0-0, policy_version 689394 (0.00089) [2022-07-10 10:43:24,619][25689] Fps is (10 sec: 5546.2, 60 sec: 5521.6, 300 sec: 5535.0). Total num frames: 705947648. Throughput: 0: 5798.0. Samples: 705956122. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:43:24,620][25689] Avg episode reward: [(0, '-5.708')] [2022-07-10 10:43:24,841][26022] Updated weights on worker 0-0, policy_version 689404 (0.00084) [2022-07-10 10:43:26,616][26022] Updated weights on worker 0-0, policy_version 689414 (0.00089) [2022-07-10 10:43:28,417][26022] Updated weights on worker 0-0, policy_version 689424 (0.00089) [2022-07-10 10:43:29,648][25689] Fps is (10 sec: 5386.4, 60 sec: 5520.1, 300 sec: 5538.0). Total num frames: 705975296. Throughput: 0: 4953.9. Samples: 705972620. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:43:29,649][25689] Avg episode reward: [(0, '-5.971')] [2022-07-10 10:43:30,448][26022] Updated weights on worker 0-0, policy_version 689434 (0.00086) [2022-07-10 10:43:32,156][26022] Updated weights on worker 0-0, policy_version 689444 (0.00086) [2022-07-10 10:43:34,072][26022] Updated weights on worker 0-0, policy_version 689454 (0.00087) [2022-07-10 10:43:34,674][25689] Fps is (10 sec: 5601.3, 60 sec: 5555.6, 300 sec: 5539.9). Total num frames: 706003968. Throughput: 0: 5767.3. Samples: 706006092. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:43:34,674][25689] Avg episode reward: [(0, '-4.709')] [2022-07-10 10:43:35,919][26022] Updated weights on worker 0-0, policy_version 689464 (0.00085) [2022-07-10 10:43:37,686][26022] Updated weights on worker 0-0, policy_version 689474 (0.00090) [2022-07-10 10:43:39,630][26022] Updated weights on worker 0-0, policy_version 689484 (0.00084) [2022-07-10 10:43:39,720][25689] Fps is (10 sec: 5592.1, 60 sec: 5537.2, 300 sec: 5537.5). Total num frames: 706031616. Throughput: 0: 5804.5. Samples: 706039788. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:43:39,720][25689] Avg episode reward: [(0, '-4.668')] [2022-07-10 10:43:41,334][26022] Updated weights on worker 0-0, policy_version 689494 (0.00091) [2022-07-10 10:43:43,146][26022] Updated weights on worker 0-0, policy_version 689504 (0.00089) [2022-07-10 10:43:44,770][25689] Fps is (10 sec: 5578.2, 60 sec: 5516.1, 300 sec: 5537.0). Total num frames: 706060288. Throughput: 0: 4988.9. Samples: 706056654. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:43:44,770][25689] Avg episode reward: [(0, '-2.747')] [2022-07-10 10:43:45,141][26022] Updated weights on worker 0-0, policy_version 689514 (0.00093) [2022-07-10 10:43:46,654][26022] Updated weights on worker 0-0, policy_version 689524 (0.00088) [2022-07-10 10:43:48,832][26022] Updated weights on worker 0-0, policy_version 689534 (0.00091) [2022-07-10 10:43:49,783][25689] Fps is (10 sec: 5800.2, 60 sec: 5567.7, 300 sec: 5543.7). Total num frames: 706089984. Throughput: 0: 5850.8. Samples: 706090422. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:43:49,783][25689] Avg episode reward: [(0, '-3.821')] [2022-07-10 10:43:50,360][26022] Updated weights on worker 0-0, policy_version 689544 (0.00094) [2022-07-10 10:43:52,510][26022] Updated weights on worker 0-0, policy_version 689554 (0.00083) [2022-07-10 10:43:54,162][26022] Updated weights on worker 0-0, policy_version 689564 (0.00096) [2022-07-10 10:43:54,792][25689] Fps is (10 sec: 5517.4, 60 sec: 5517.4, 300 sec: 5537.9). Total num frames: 706115584. Throughput: 0: 5841.4. Samples: 706123612. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:43:54,793][25689] Avg episode reward: [(0, '-1.854')] [2022-07-10 10:43:56,004][26022] Updated weights on worker 0-0, policy_version 689574 (0.00094) [2022-07-10 10:43:57,886][26022] Updated weights on worker 0-0, policy_version 689584 (0.00087) [2022-07-10 10:43:59,772][26022] Updated weights on worker 0-0, policy_version 689594 (0.00079) [2022-07-10 10:43:59,850][25689] Fps is (10 sec: 5391.0, 60 sec: 5536.5, 300 sec: 5544.8). Total num frames: 706144256. Throughput: 0: 4982.4. Samples: 706140084. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:43:59,850][25689] Avg episode reward: [(0, '-1.849')] [2022-07-10 10:44:01,898][26022] Updated weights on worker 0-0, policy_version 689604 (0.00092) [2022-07-10 10:44:03,780][26022] Updated weights on worker 0-0, policy_version 689614 (0.00087) [2022-07-10 10:44:04,855][25689] Fps is (10 sec: 5291.6, 60 sec: 5521.0, 300 sec: 5534.5). Total num frames: 706168832. Throughput: 0: 5723.0. Samples: 706171600. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:44:04,855][25689] Avg episode reward: [(0, '-2.844')] [2022-07-10 10:44:05,782][26022] Updated weights on worker 0-0, policy_version 689624 (0.00085) [2022-07-10 10:44:07,610][26022] Updated weights on worker 0-0, policy_version 689634 (0.00088) [2022-07-10 10:44:09,418][26022] Updated weights on worker 0-0, policy_version 689644 (0.00082) [2022-07-10 10:44:09,871][25689] Fps is (10 sec: 5415.8, 60 sec: 5541.6, 300 sec: 5539.1). Total num frames: 706198528. Throughput: 0: 5717.7. Samples: 706205280. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:44:09,871][25689] Avg episode reward: [(0, '-4.363')] [2022-07-10 10:44:11,253][26022] Updated weights on worker 0-0, policy_version 689654 (0.00087) [2022-07-10 10:44:12,863][26022] Updated weights on worker 0-0, policy_version 689664 (0.00089) [2022-07-10 10:44:14,891][25689] Fps is (10 sec: 5509.9, 60 sec: 5507.5, 300 sec: 5530.7). Total num frames: 706224128. Throughput: 0: 4899.8. Samples: 706222090. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:44:14,893][25689] Avg episode reward: [(0, '-4.074')] [2022-07-10 10:44:15,063][26022] Updated weights on worker 0-0, policy_version 689674 (0.00085) [2022-07-10 10:44:16,626][26022] Updated weights on worker 0-0, policy_version 689684 (0.00088) [2022-07-10 10:44:18,501][26022] Updated weights on worker 0-0, policy_version 689694 (0.00082) [2022-07-10 10:44:19,943][25689] Fps is (10 sec: 5489.9, 60 sec: 5512.6, 300 sec: 5534.4). Total num frames: 706253824. Throughput: 0: 5741.8. Samples: 706255456. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:44:19,944][25689] Avg episode reward: [(0, '-4.730')] [2022-07-10 10:44:20,298][26022] Updated weights on worker 0-0, policy_version 689704 (0.00090) [2022-07-10 10:44:22,253][26022] Updated weights on worker 0-0, policy_version 689714 (0.00077) [2022-07-10 10:44:24,153][26022] Updated weights on worker 0-0, policy_version 689724 (0.00090) [2022-07-10 10:44:24,898][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:44:24,910][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000689729_706282496.pth [2022-07-10 10:44:24,911][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000687778_704284672.pth [2022-07-10 10:44:24,970][25689] Fps is (10 sec: 5790.6, 60 sec: 5548.3, 300 sec: 5534.3). Total num frames: 706282496. Throughput: 0: 5837.5. Samples: 706289024. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:44:24,972][25689] Avg episode reward: [(0, '-5.288')] [2022-07-10 10:44:25,980][26022] Updated weights on worker 0-0, policy_version 689734 (0.00114) [2022-07-10 10:44:27,677][26022] Updated weights on worker 0-0, policy_version 689744 (0.00087) [2022-07-10 10:44:29,690][26022] Updated weights on worker 0-0, policy_version 689754 (0.00094) [2022-07-10 10:44:29,983][25689] Fps is (10 sec: 5609.8, 60 sec: 5549.9, 300 sec: 5534.3). Total num frames: 706310144. Throughput: 0: 4993.7. Samples: 706305714. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:44:29,983][25689] Avg episode reward: [(0, '-5.928')] [2022-07-10 10:44:31,179][26022] Updated weights on worker 0-0, policy_version 689764 (0.00084) [2022-07-10 10:44:33,371][26022] Updated weights on worker 0-0, policy_version 689774 (0.00087) [2022-07-10 10:44:35,005][25689] Fps is (10 sec: 5510.2, 60 sec: 5533.1, 300 sec: 5536.1). Total num frames: 706337792. Throughput: 0: 5829.0. Samples: 706339340. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:44:35,006][25689] Avg episode reward: [(0, '-6.070')] [2022-07-10 10:44:35,047][26022] Updated weights on worker 0-0, policy_version 689784 (0.00084) [2022-07-10 10:44:36,880][26022] Updated weights on worker 0-0, policy_version 689794 (0.00373) [2022-07-10 10:44:38,668][26022] Updated weights on worker 0-0, policy_version 689804 (0.00090) [2022-07-10 10:44:40,142][25689] Fps is (10 sec: 5644.6, 60 sec: 5558.7, 300 sec: 5537.3). Total num frames: 706367488. Throughput: 0: 5807.9. Samples: 706372768. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-10 10:44:40,142][25689] Avg episode reward: [(0, '-4.977')] [2022-07-10 10:44:40,616][26022] Updated weights on worker 0-0, policy_version 689814 (0.00088) [2022-07-10 10:44:42,289][26022] Updated weights on worker 0-0, policy_version 689824 (0.00088) [2022-07-10 10:44:44,247][26022] Updated weights on worker 0-0, policy_version 689834 (0.00087) [2022-07-10 10:44:45,152][25689] Fps is (10 sec: 5550.9, 60 sec: 5528.6, 300 sec: 5534.3). Total num frames: 706394112. Throughput: 0: 4984.8. Samples: 706389624. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:44:45,152][25689] Avg episode reward: [(0, '-3.311')] [2022-07-10 10:44:46,007][26022] Updated weights on worker 0-0, policy_version 689844 (0.00104) [2022-07-10 10:44:47,960][26022] Updated weights on worker 0-0, policy_version 689854 (0.00085) [2022-07-10 10:44:49,657][26022] Updated weights on worker 0-0, policy_version 689864 (0.00091) [2022-07-10 10:44:50,176][25689] Fps is (10 sec: 5510.9, 60 sec: 5510.6, 300 sec: 5534.2). Total num frames: 706422784. Throughput: 0: 5808.7. Samples: 706423010. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:44:50,176][25689] Avg episode reward: [(0, '-2.777')] [2022-07-10 10:44:51,588][26022] Updated weights on worker 0-0, policy_version 689874 (0.00086) [2022-07-10 10:44:53,395][26022] Updated weights on worker 0-0, policy_version 689884 (0.00094) [2022-07-10 10:44:55,192][25689] Fps is (10 sec: 5609.2, 60 sec: 5543.8, 300 sec: 5538.1). Total num frames: 706450432. Throughput: 0: 5791.0. Samples: 706456242. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:44:55,193][25689] Avg episode reward: [(0, '-4.275')] [2022-07-10 10:44:55,322][26022] Updated weights on worker 0-0, policy_version 689894 (0.00086) [2022-07-10 10:44:57,149][26022] Updated weights on worker 0-0, policy_version 689904 (0.00091) [2022-07-10 10:44:59,263][26022] Updated weights on worker 0-0, policy_version 689914 (0.00088) [2022-07-10 10:45:00,289][25689] Fps is (10 sec: 5467.9, 60 sec: 5523.3, 300 sec: 5536.4). Total num frames: 706478080. Throughput: 0: 4971.6. Samples: 706472930. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:45:00,289][25689] Avg episode reward: [(0, '-3.920')] [2022-07-10 10:45:00,743][26022] Updated weights on worker 0-0, policy_version 689924 (0.00088) [2022-07-10 10:45:03,138][26022] Updated weights on worker 0-0, policy_version 689934 (0.00086) [2022-07-10 10:45:04,637][26022] Updated weights on worker 0-0, policy_version 689944 (0.00092) [2022-07-10 10:45:05,308][25689] Fps is (10 sec: 5466.5, 60 sec: 5572.9, 300 sec: 5539.9). Total num frames: 706505728. Throughput: 0: 5703.4. Samples: 706504582. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:45:05,308][25689] Avg episode reward: [(0, '-4.439')] [2022-07-10 10:45:06,654][26022] Updated weights on worker 0-0, policy_version 689954 (0.00088) [2022-07-10 10:45:08,342][26022] Updated weights on worker 0-0, policy_version 689964 (0.00095) [2022-07-10 10:45:10,279][26022] Updated weights on worker 0-0, policy_version 689974 (0.00092) [2022-07-10 10:45:10,313][25689] Fps is (10 sec: 5516.4, 60 sec: 5540.0, 300 sec: 5540.1). Total num frames: 706533376. Throughput: 0: 5733.3. Samples: 706538460. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:45:10,313][25689] Avg episode reward: [(0, '-4.597')] [2022-07-10 10:45:12,051][26022] Updated weights on worker 0-0, policy_version 689984 (0.00901) [2022-07-10 10:45:14,031][26022] Updated weights on worker 0-0, policy_version 689994 (0.00085) [2022-07-10 10:45:15,345][25689] Fps is (10 sec: 5407.2, 60 sec: 5555.8, 300 sec: 5530.4). Total num frames: 706560000. Throughput: 0: 4920.6. Samples: 706555404. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:45:15,345][25689] Avg episode reward: [(0, '-4.999')] [2022-07-10 10:45:15,756][26022] Updated weights on worker 0-0, policy_version 690004 (0.00088) [2022-07-10 10:45:17,600][26022] Updated weights on worker 0-0, policy_version 690014 (0.00087) [2022-07-10 10:45:19,411][26022] Updated weights on worker 0-0, policy_version 690024 (0.00089) [2022-07-10 10:45:20,402][25689] Fps is (10 sec: 5581.9, 60 sec: 5555.3, 300 sec: 5539.8). Total num frames: 706589696. Throughput: 0: 5767.9. Samples: 706588944. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:45:20,403][25689] Avg episode reward: [(0, '-7.012')] [2022-07-10 10:45:21,446][26022] Updated weights on worker 0-0, policy_version 690034 (0.00084) [2022-07-10 10:45:22,996][26022] Updated weights on worker 0-0, policy_version 690044 (0.00103) [2022-07-10 10:45:24,957][26022] Updated weights on worker 0-0, policy_version 690054 (0.00090) [2022-07-10 10:45:25,416][25689] Fps is (10 sec: 5693.6, 60 sec: 5539.6, 300 sec: 5540.3). Total num frames: 706617344. Throughput: 0: 5858.9. Samples: 706622396. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:45:25,417][25689] Avg episode reward: [(0, '-6.613')] [2022-07-10 10:45:26,725][26022] Updated weights on worker 0-0, policy_version 690064 (0.00095) [2022-07-10 10:45:28,550][26022] Updated weights on worker 0-0, policy_version 690074 (0.00093) [2022-07-10 10:45:30,426][25689] Fps is (10 sec: 5414.6, 60 sec: 5523.0, 300 sec: 5529.9). Total num frames: 706643968. Throughput: 0: 5001.1. Samples: 706639046. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:45:30,426][25689] Avg episode reward: [(0, '-6.804')] [2022-07-10 10:45:30,575][26022] Updated weights on worker 0-0, policy_version 690084 (0.00111) [2022-07-10 10:45:32,283][26022] Updated weights on worker 0-0, policy_version 690094 (0.00085) [2022-07-10 10:45:34,205][26022] Updated weights on worker 0-0, policy_version 690104 (0.00085) [2022-07-10 10:45:35,454][25689] Fps is (10 sec: 5610.9, 60 sec: 5556.3, 300 sec: 5537.9). Total num frames: 706673664. Throughput: 0: 5818.9. Samples: 706672418. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:45:35,454][25689] Avg episode reward: [(0, '-6.532')] [2022-07-10 10:45:36,020][26022] Updated weights on worker 0-0, policy_version 690114 (0.00086) [2022-07-10 10:45:37,884][26022] Updated weights on worker 0-0, policy_version 690124 (0.00093) [2022-07-10 10:45:39,726][26022] Updated weights on worker 0-0, policy_version 690134 (0.00105) [2022-07-10 10:45:40,498][25689] Fps is (10 sec: 5692.9, 60 sec: 5530.9, 300 sec: 5540.8). Total num frames: 706701312. Throughput: 0: 5831.2. Samples: 706706128. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:45:40,500][25689] Avg episode reward: [(0, '-5.876')] [2022-07-10 10:45:41,453][26022] Updated weights on worker 0-0, policy_version 690144 (0.00098) [2022-07-10 10:45:43,387][26022] Updated weights on worker 0-0, policy_version 690154 (0.00084) [2022-07-10 10:45:45,208][26022] Updated weights on worker 0-0, policy_version 690164 (0.00087) [2022-07-10 10:45:45,525][25689] Fps is (10 sec: 5592.2, 60 sec: 5563.2, 300 sec: 5536.9). Total num frames: 706729984. Throughput: 0: 4998.1. Samples: 706722902. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:45:45,526][25689] Avg episode reward: [(0, '-3.712')] [2022-07-10 10:45:46,977][26022] Updated weights on worker 0-0, policy_version 690174 (0.00087) [2022-07-10 10:45:48,848][26022] Updated weights on worker 0-0, policy_version 690184 (0.00085) [2022-07-10 10:45:50,533][25689] Fps is (10 sec: 5612.6, 60 sec: 5547.8, 300 sec: 5541.0). Total num frames: 706757632. Throughput: 0: 5829.4. Samples: 706756260. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:45:50,533][25689] Avg episode reward: [(0, '-3.686')] [2022-07-10 10:45:50,718][26022] Updated weights on worker 0-0, policy_version 690194 (0.00087) [2022-07-10 10:45:52,519][26022] Updated weights on worker 0-0, policy_version 690204 (0.00091) [2022-07-10 10:45:54,363][26022] Updated weights on worker 0-0, policy_version 690214 (0.00085) [2022-07-10 10:45:55,535][25689] Fps is (10 sec: 5422.0, 60 sec: 5532.2, 300 sec: 5535.0). Total num frames: 706784256. Throughput: 0: 5846.3. Samples: 706789816. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:45:55,536][25689] Avg episode reward: [(0, '-1.813')] [2022-07-10 10:45:56,247][26022] Updated weights on worker 0-0, policy_version 690224 (0.00079) [2022-07-10 10:45:58,086][26022] Updated weights on worker 0-0, policy_version 690234 (0.00088) [2022-07-10 10:45:59,866][26022] Updated weights on worker 0-0, policy_version 690244 (0.00080) [2022-07-10 10:46:00,646][25689] Fps is (10 sec: 5568.9, 60 sec: 5564.7, 300 sec: 5543.6). Total num frames: 706813952. Throughput: 0: 5808.9. Samples: 706823164. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:46:00,647][25689] Avg episode reward: [(0, '0.025')] [2022-07-10 10:46:02,081][26022] Updated weights on worker 0-0, policy_version 690254 (0.00087) [2022-07-10 10:46:03,896][26022] Updated weights on worker 0-0, policy_version 690264 (0.00097) [2022-07-10 10:46:05,654][25689] Fps is (10 sec: 5464.5, 60 sec: 5531.8, 300 sec: 5541.7). Total num frames: 706839552. Throughput: 0: 5696.9. Samples: 706837574. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:46:05,654][25689] Avg episode reward: [(0, '-0.068')] [2022-07-10 10:46:05,878][26022] Updated weights on worker 0-0, policy_version 690274 (0.00085) [2022-07-10 10:46:07,587][26022] Updated weights on worker 0-0, policy_version 690284 (0.00101) [2022-07-10 10:46:09,523][26022] Updated weights on worker 0-0, policy_version 690294 (0.00084) [2022-07-10 10:46:10,676][25689] Fps is (10 sec: 5308.8, 60 sec: 5530.2, 300 sec: 5538.7). Total num frames: 706867200. Throughput: 0: 5715.5. Samples: 706871390. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:46:10,677][25689] Avg episode reward: [(0, '-0.058')] [2022-07-10 10:46:11,192][26022] Updated weights on worker 0-0, policy_version 690304 (0.00093) [2022-07-10 10:46:13,238][26022] Updated weights on worker 0-0, policy_version 690314 (0.00091) [2022-07-10 10:46:14,675][26022] Updated weights on worker 0-0, policy_version 690324 (0.00085) [2022-07-10 10:46:15,767][25689] Fps is (10 sec: 5569.1, 60 sec: 5558.8, 300 sec: 5542.4). Total num frames: 706895872. Throughput: 0: 5686.1. Samples: 706904858. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:46:15,768][25689] Avg episode reward: [(0, '-0.129')] [2022-07-10 10:46:16,924][26022] Updated weights on worker 0-0, policy_version 690334 (0.00083) [2022-07-10 10:46:18,679][26022] Updated weights on worker 0-0, policy_version 690344 (0.00085) [2022-07-10 10:46:20,507][26022] Updated weights on worker 0-0, policy_version 690354 (0.00106) [2022-07-10 10:46:20,879][25689] Fps is (10 sec: 5620.6, 60 sec: 5536.8, 300 sec: 5536.9). Total num frames: 706924544. Throughput: 0: 4861.2. Samples: 706921518. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:46:20,879][25689] Avg episode reward: [(0, '-0.208')] [2022-07-10 10:46:22,447][26022] Updated weights on worker 0-0, policy_version 690364 (0.00091) [2022-07-10 10:46:24,003][26022] Updated weights on worker 0-0, policy_version 690374 (0.00106) [2022-07-10 10:46:24,926][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:46:24,936][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000690377_706946048.pth [2022-07-10 10:46:24,937][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000688428_704950272.pth [2022-07-10 10:46:25,910][25689] Fps is (10 sec: 5552.4, 60 sec: 5535.2, 300 sec: 5533.7). Total num frames: 706952192. Throughput: 0: 5806.4. Samples: 706955190. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:46:25,911][25689] Avg episode reward: [(0, '-2.016')] [2022-07-10 10:46:26,061][26022] Updated weights on worker 0-0, policy_version 690384 (0.00092) [2022-07-10 10:46:27,979][26022] Updated weights on worker 0-0, policy_version 690394 (0.00082) [2022-07-10 10:46:29,513][26022] Updated weights on worker 0-0, policy_version 690404 (0.00084) [2022-07-10 10:46:30,963][25689] Fps is (10 sec: 5584.8, 60 sec: 5565.1, 300 sec: 5540.4). Total num frames: 706980864. Throughput: 0: 5790.9. Samples: 706988870. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:46:30,964][25689] Avg episode reward: [(0, '-2.019')] [2022-07-10 10:46:31,563][26022] Updated weights on worker 0-0, policy_version 690414 (0.00090) [2022-07-10 10:46:33,151][26022] Updated weights on worker 0-0, policy_version 690424 (0.00095) [2022-07-10 10:46:35,221][26022] Updated weights on worker 0-0, policy_version 690434 (0.00092) [2022-07-10 10:46:36,002][25689] Fps is (10 sec: 5580.5, 60 sec: 5530.2, 300 sec: 5537.6). Total num frames: 707008512. Throughput: 0: 4980.4. Samples: 707005640. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:46:36,003][25689] Avg episode reward: [(0, '-1.990')] [2022-07-10 10:46:36,909][26022] Updated weights on worker 0-0, policy_version 690444 (0.00081) [2022-07-10 10:46:38,881][26022] Updated weights on worker 0-0, policy_version 690454 (0.00093) [2022-07-10 10:46:40,563][26022] Updated weights on worker 0-0, policy_version 690464 (0.00092) [2022-07-10 10:46:41,161][25689] Fps is (10 sec: 5522.7, 60 sec: 5536.7, 300 sec: 5538.5). Total num frames: 707037184. Throughput: 0: 5804.9. Samples: 707039256. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:46:41,163][25689] Avg episode reward: [(0, '-2.464')] [2022-07-10 10:46:42,603][26022] Updated weights on worker 0-0, policy_version 690474 (0.00094) [2022-07-10 10:46:44,371][26022] Updated weights on worker 0-0, policy_version 690484 (0.00096) [2022-07-10 10:46:46,138][26022] Updated weights on worker 0-0, policy_version 690494 (0.00095) [2022-07-10 10:46:46,237][25689] Fps is (10 sec: 5603.0, 60 sec: 5532.2, 300 sec: 5538.0). Total num frames: 707065856. Throughput: 0: 5775.6. Samples: 707072588. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:46:46,237][25689] Avg episode reward: [(0, '-1.925')] [2022-07-10 10:46:47,927][26022] Updated weights on worker 0-0, policy_version 690504 (0.00085) [2022-07-10 10:46:49,928][26022] Updated weights on worker 0-0, policy_version 690514 (0.00084) [2022-07-10 10:46:51,317][25689] Fps is (10 sec: 5646.4, 60 sec: 5542.5, 300 sec: 5540.4). Total num frames: 707094528. Throughput: 0: 4933.1. Samples: 707089278. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:46:51,318][25689] Avg episode reward: [(0, '-1.595')] [2022-07-10 10:46:51,740][26022] Updated weights on worker 0-0, policy_version 690524 (0.00091) [2022-07-10 10:46:53,691][26022] Updated weights on worker 0-0, policy_version 690534 (0.00092) [2022-07-10 10:46:55,223][26022] Updated weights on worker 0-0, policy_version 690544 (0.00098) [2022-07-10 10:46:56,390][25689] Fps is (10 sec: 5547.3, 60 sec: 5552.9, 300 sec: 5539.7). Total num frames: 707122176. Throughput: 0: 5750.8. Samples: 707122882. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:46:56,391][25689] Avg episode reward: [(0, '-1.133')] [2022-07-10 10:46:57,443][26022] Updated weights on worker 0-0, policy_version 690554 (0.00086) [2022-07-10 10:46:58,844][26022] Updated weights on worker 0-0, policy_version 690564 (0.00086) [2022-07-10 10:47:01,024][26022] Updated weights on worker 0-0, policy_version 690574 (0.00090) [2022-07-10 10:47:01,463][25689] Fps is (10 sec: 5550.9, 60 sec: 5539.5, 300 sec: 5545.4). Total num frames: 707150848. Throughput: 0: 5765.5. Samples: 707156306. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:47:01,464][25689] Avg episode reward: [(0, '-1.548')] [2022-07-10 10:47:02,999][26022] Updated weights on worker 0-0, policy_version 690584 (0.00088) [2022-07-10 10:47:05,007][26022] Updated weights on worker 0-0, policy_version 690594 (0.00084) [2022-07-10 10:47:06,503][25689] Fps is (10 sec: 5467.7, 60 sec: 5553.4, 300 sec: 5537.8). Total num frames: 707177472. Throughput: 0: 4848.5. Samples: 707170846. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:47:06,504][25689] Avg episode reward: [(0, '-1.501')] [2022-07-10 10:47:06,595][26022] Updated weights on worker 0-0, policy_version 690604 (0.00085) [2022-07-10 10:47:08,656][26022] Updated weights on worker 0-0, policy_version 690614 (0.00088) [2022-07-10 10:47:10,366][26022] Updated weights on worker 0-0, policy_version 690624 (0.00097) [2022-07-10 10:47:11,528][25689] Fps is (10 sec: 5290.4, 60 sec: 5536.3, 300 sec: 5531.8). Total num frames: 707204096. Throughput: 0: 5710.6. Samples: 707204696. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:47:11,529][25689] Avg episode reward: [(0, '-1.104')] [2022-07-10 10:47:12,268][26022] Updated weights on worker 0-0, policy_version 690634 (0.00086) [2022-07-10 10:47:14,169][26022] Updated weights on worker 0-0, policy_version 690644 (0.00089) [2022-07-10 10:47:15,934][26022] Updated weights on worker 0-0, policy_version 690654 (0.00222) [2022-07-10 10:47:16,537][25689] Fps is (10 sec: 5510.9, 60 sec: 5543.8, 300 sec: 5543.4). Total num frames: 707232768. Throughput: 0: 5715.9. Samples: 707238040. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:47:16,537][25689] Avg episode reward: [(0, '-1.694')] [2022-07-10 10:47:17,802][26022] Updated weights on worker 0-0, policy_version 690664 (0.00085) [2022-07-10 10:47:19,630][26022] Updated weights on worker 0-0, policy_version 690674 (0.00089) [2022-07-10 10:47:21,211][26022] Updated weights on worker 0-0, policy_version 690684 (0.00086) [2022-07-10 10:47:21,647][25689] Fps is (10 sec: 5565.9, 60 sec: 5527.1, 300 sec: 5535.0). Total num frames: 707260416. Throughput: 0: 4885.6. Samples: 707254916. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:47:21,647][25689] Avg episode reward: [(0, '-1.297')] [2022-07-10 10:47:23,307][26022] Updated weights on worker 0-0, policy_version 690694 (0.00082) [2022-07-10 10:47:24,953][26022] Updated weights on worker 0-0, policy_version 690704 (0.00095) [2022-07-10 10:47:26,667][25689] Fps is (10 sec: 5559.7, 60 sec: 5545.0, 300 sec: 5538.3). Total num frames: 707289088. Throughput: 0: 5841.7. Samples: 707288638. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:47:26,667][25689] Avg episode reward: [(0, '-1.566')] [2022-07-10 10:47:26,942][26022] Updated weights on worker 0-0, policy_version 690714 (0.00086) [2022-07-10 10:47:28,911][26022] Updated weights on worker 0-0, policy_version 690724 (0.00524) [2022-07-10 10:47:30,489][26022] Updated weights on worker 0-0, policy_version 690734 (0.00088) [2022-07-10 10:47:31,676][25689] Fps is (10 sec: 5615.4, 60 sec: 5532.1, 300 sec: 5542.4). Total num frames: 707316736. Throughput: 0: 5824.5. Samples: 707322050. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:47:31,677][25689] Avg episode reward: [(0, '-2.002')] [2022-07-10 10:47:32,575][26022] Updated weights on worker 0-0, policy_version 690744 (0.00087) [2022-07-10 10:47:34,253][26022] Updated weights on worker 0-0, policy_version 690754 (0.00084) [2022-07-10 10:47:36,219][26022] Updated weights on worker 0-0, policy_version 690764 (0.00080) [2022-07-10 10:47:36,691][25689] Fps is (10 sec: 5618.2, 60 sec: 5551.2, 300 sec: 5542.7). Total num frames: 707345408. Throughput: 0: 4996.2. Samples: 707338736. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:47:36,692][25689] Avg episode reward: [(0, '-1.832')] [2022-07-10 10:47:38,023][26022] Updated weights on worker 0-0, policy_version 690774 (0.00081) [2022-07-10 10:47:39,887][26022] Updated weights on worker 0-0, policy_version 690784 (0.00087) [2022-07-10 10:47:41,753][25689] Fps is (10 sec: 5589.4, 60 sec: 5543.2, 300 sec: 5534.8). Total num frames: 707373056. Throughput: 0: 5823.8. Samples: 707372008. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:47:41,753][25689] Avg episode reward: [(0, '-1.635')] [2022-07-10 10:47:41,755][26022] Updated weights on worker 0-0, policy_version 690794 (0.00088) [2022-07-10 10:47:43,411][26022] Updated weights on worker 0-0, policy_version 690804 (0.00092) [2022-07-10 10:47:45,385][26022] Updated weights on worker 0-0, policy_version 690814 (0.00086) [2022-07-10 10:47:46,820][25689] Fps is (10 sec: 5560.2, 60 sec: 5544.0, 300 sec: 5540.8). Total num frames: 707401728. Throughput: 0: 5818.5. Samples: 707405902. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:47:46,822][25689] Avg episode reward: [(0, '-1.217')] [2022-07-10 10:47:47,070][26022] Updated weights on worker 0-0, policy_version 690824 (0.00096) [2022-07-10 10:47:48,857][26022] Updated weights on worker 0-0, policy_version 690834 (0.00083) [2022-07-10 10:47:50,725][26022] Updated weights on worker 0-0, policy_version 690844 (0.00083) [2022-07-10 10:47:51,843][25689] Fps is (10 sec: 5683.2, 60 sec: 5549.2, 300 sec: 5540.6). Total num frames: 707430400. Throughput: 0: 5843.6. Samples: 707439894. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:47:51,844][25689] Avg episode reward: [(0, '-1.502')] [2022-07-10 10:47:52,396][26022] Updated weights on worker 0-0, policy_version 690854 (0.00085) [2022-07-10 10:47:54,447][26022] Updated weights on worker 0-0, policy_version 690864 (0.00087) [2022-07-10 10:47:56,365][26022] Updated weights on worker 0-0, policy_version 690874 (0.00084) [2022-07-10 10:47:56,859][25689] Fps is (10 sec: 5610.1, 60 sec: 5554.4, 300 sec: 5541.8). Total num frames: 707458048. Throughput: 0: 5849.6. Samples: 707456712. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:47:56,861][25689] Avg episode reward: [(0, '-2.949')] [2022-07-10 10:47:57,967][26022] Updated weights on worker 0-0, policy_version 690884 (0.00088) [2022-07-10 10:47:59,875][26022] Updated weights on worker 0-0, policy_version 690894 (0.00078) [2022-07-10 10:48:01,747][26022] Updated weights on worker 0-0, policy_version 690904 (0.00089) [2022-07-10 10:48:01,943][25689] Fps is (10 sec: 5474.7, 60 sec: 5536.5, 300 sec: 5547.5). Total num frames: 707485696. Throughput: 0: 5859.5. Samples: 707490314. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 10:48:01,947][25689] Avg episode reward: [(0, '-2.242')] [2022-07-10 10:48:03,849][26022] Updated weights on worker 0-0, policy_version 690914 (0.00088) [2022-07-10 10:48:05,833][26022] Updated weights on worker 0-0, policy_version 690924 (0.00087) [2022-07-10 10:48:06,952][25689] Fps is (10 sec: 5478.9, 60 sec: 5556.3, 300 sec: 5545.0). Total num frames: 707513344. Throughput: 0: 5752.2. Samples: 707521704. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:48:06,952][25689] Avg episode reward: [(0, '-2.435')] [2022-07-10 10:48:07,533][26022] Updated weights on worker 0-0, policy_version 690934 (0.00083) [2022-07-10 10:48:09,608][26022] Updated weights on worker 0-0, policy_version 690944 (0.00092) [2022-07-10 10:48:11,181][26022] Updated weights on worker 0-0, policy_version 690954 (0.00088) [2022-07-10 10:48:11,979][25689] Fps is (10 sec: 5305.6, 60 sec: 5539.2, 300 sec: 5537.9). Total num frames: 707538944. Throughput: 0: 4901.1. Samples: 707538584. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:48:11,980][25689] Avg episode reward: [(0, '-2.355')] [2022-07-10 10:48:13,036][26022] Updated weights on worker 0-0, policy_version 690964 (0.00090) [2022-07-10 10:48:15,046][26022] Updated weights on worker 0-0, policy_version 690974 (0.00086) [2022-07-10 10:48:16,743][26022] Updated weights on worker 0-0, policy_version 690984 (0.00096) [2022-07-10 10:48:17,000][25689] Fps is (10 sec: 5503.3, 60 sec: 5555.0, 300 sec: 5539.6). Total num frames: 707568640. Throughput: 0: 5746.4. Samples: 707572448. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:48:17,000][25689] Avg episode reward: [(0, '-3.152')] [2022-07-10 10:48:18,484][26022] Updated weights on worker 0-0, policy_version 690994 (0.00083) [2022-07-10 10:48:20,445][26022] Updated weights on worker 0-0, policy_version 691004 (0.00083) [2022-07-10 10:48:21,901][26022] Updated weights on worker 0-0, policy_version 691014 (0.00097) [2022-07-10 10:48:22,141][25689] Fps is (10 sec: 5945.5, 60 sec: 5603.0, 300 sec: 5551.6). Total num frames: 707599360. Throughput: 0: 5721.2. Samples: 707605870. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:48:22,141][25689] Avg episode reward: [(0, '-3.701')] [2022-07-10 10:48:24,158][26022] Updated weights on worker 0-0, policy_version 691024 (0.00086) [2022-07-10 10:48:24,968][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:48:24,978][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000691031_707615744.pth [2022-07-10 10:48:24,978][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000689079_705616896.pth [2022-07-10 10:48:25,536][26022] Updated weights on worker 0-0, policy_version 691034 (0.00086) [2022-07-10 10:48:27,232][25689] Fps is (10 sec: 5503.9, 60 sec: 5545.6, 300 sec: 5543.5). Total num frames: 707624960. Throughput: 0: 4981.8. Samples: 707622738. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:48:27,233][25689] Avg episode reward: [(0, '-2.527')] [2022-07-10 10:48:27,719][26022] Updated weights on worker 0-0, policy_version 691044 (0.00094) [2022-07-10 10:48:29,920][26022] Updated weights on worker 0-0, policy_version 691054 (0.00084) [2022-07-10 10:48:31,238][26022] Updated weights on worker 0-0, policy_version 691064 (0.00090) [2022-07-10 10:48:32,327][25689] Fps is (10 sec: 5327.9, 60 sec: 5554.7, 300 sec: 5542.2). Total num frames: 707653632. Throughput: 0: 5771.6. Samples: 707656024. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:48:32,327][25689] Avg episode reward: [(0, '-2.434')] [2022-07-10 10:48:33,438][26022] Updated weights on worker 0-0, policy_version 691074 (0.00092) [2022-07-10 10:48:35,001][26022] Updated weights on worker 0-0, policy_version 691084 (0.00085) [2022-07-10 10:48:36,844][26022] Updated weights on worker 0-0, policy_version 691094 (0.00088) [2022-07-10 10:48:37,372][25689] Fps is (10 sec: 5655.3, 60 sec: 5551.9, 300 sec: 5545.7). Total num frames: 707682304. Throughput: 0: 5764.3. Samples: 707689882. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:48:37,373][25689] Avg episode reward: [(0, '-2.444')] [2022-07-10 10:48:38,936][26022] Updated weights on worker 0-0, policy_version 691104 (0.00088) [2022-07-10 10:48:40,501][26022] Updated weights on worker 0-0, policy_version 691114 (0.00089) [2022-07-10 10:48:42,416][25689] Fps is (10 sec: 5683.7, 60 sec: 5570.4, 300 sec: 5545.8). Total num frames: 707710976. Throughput: 0: 4973.9. Samples: 707706718. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:48:42,417][25689] Avg episode reward: [(0, '-2.003')] [2022-07-10 10:48:42,426][26022] Updated weights on worker 0-0, policy_version 691124 (0.00093) [2022-07-10 10:48:44,147][26022] Updated weights on worker 0-0, policy_version 691134 (0.00090) [2022-07-10 10:48:46,137][26022] Updated weights on worker 0-0, policy_version 691144 (0.00093) [2022-07-10 10:48:47,455][25689] Fps is (10 sec: 5585.9, 60 sec: 5556.2, 300 sec: 5538.4). Total num frames: 707738624. Throughput: 0: 5816.7. Samples: 707740366. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:48:47,455][25689] Avg episode reward: [(0, '-1.282')] [2022-07-10 10:48:47,801][26022] Updated weights on worker 0-0, policy_version 691154 (0.00086) [2022-07-10 10:48:49,939][26022] Updated weights on worker 0-0, policy_version 691164 (0.00080) [2022-07-10 10:48:51,478][26022] Updated weights on worker 0-0, policy_version 691174 (0.00092) [2022-07-10 10:48:52,464][25689] Fps is (10 sec: 5605.2, 60 sec: 5557.4, 300 sec: 5548.8). Total num frames: 707767296. Throughput: 0: 5844.9. Samples: 707773724. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:48:52,465][25689] Avg episode reward: [(0, '-0.635')] [2022-07-10 10:48:53,651][26022] Updated weights on worker 0-0, policy_version 691184 (0.00093) [2022-07-10 10:48:55,088][26022] Updated weights on worker 0-0, policy_version 691194 (0.00092) [2022-07-10 10:48:57,272][26022] Updated weights on worker 0-0, policy_version 691204 (0.00087) [2022-07-10 10:48:57,476][25689] Fps is (10 sec: 5620.2, 60 sec: 5557.8, 300 sec: 5546.2). Total num frames: 707794944. Throughput: 0: 5012.0. Samples: 707790640. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:48:57,476][25689] Avg episode reward: [(0, '-1.604')] [2022-07-10 10:48:58,951][26022] Updated weights on worker 0-0, policy_version 691214 (0.00089) [2022-07-10 10:49:00,885][26022] Updated weights on worker 0-0, policy_version 691224 (0.00084) [2022-07-10 10:49:02,534][25689] Fps is (10 sec: 5287.6, 60 sec: 5526.3, 300 sec: 5548.6). Total num frames: 707820544. Throughput: 0: 5839.4. Samples: 707824196. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:49:02,535][25689] Avg episode reward: [(0, '-3.425')] [2022-07-10 10:49:02,896][26022] Updated weights on worker 0-0, policy_version 691234 (0.00096) [2022-07-10 10:49:04,881][26022] Updated weights on worker 0-0, policy_version 691244 (0.00085) [2022-07-10 10:49:06,528][26022] Updated weights on worker 0-0, policy_version 691254 (0.00086) [2022-07-10 10:49:07,545][25689] Fps is (10 sec: 5390.1, 60 sec: 5543.1, 300 sec: 5545.3). Total num frames: 707849216. Throughput: 0: 5746.2. Samples: 707855806. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:49:07,545][25689] Avg episode reward: [(0, '-4.237')] [2022-07-10 10:49:08,530][26022] Updated weights on worker 0-0, policy_version 691264 (0.00094) [2022-07-10 10:49:10,286][26022] Updated weights on worker 0-0, policy_version 691274 (0.00084) [2022-07-10 10:49:12,149][26022] Updated weights on worker 0-0, policy_version 691284 (0.00092) [2022-07-10 10:49:12,613][25689] Fps is (10 sec: 5689.8, 60 sec: 5590.0, 300 sec: 5554.7). Total num frames: 707877888. Throughput: 0: 4908.2. Samples: 707872618. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:49:12,613][25689] Avg episode reward: [(0, '-4.710')] [2022-07-10 10:49:13,975][26022] Updated weights on worker 0-0, policy_version 691294 (0.00089) [2022-07-10 10:49:15,578][26022] Updated weights on worker 0-0, policy_version 691304 (0.00092) [2022-07-10 10:49:17,630][25689] Fps is (10 sec: 5482.6, 60 sec: 5539.6, 300 sec: 5545.0). Total num frames: 707904512. Throughput: 0: 5718.3. Samples: 707905890. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:49:17,632][25689] Avg episode reward: [(0, '-4.760')] [2022-07-10 10:49:17,751][26022] Updated weights on worker 0-0, policy_version 691314 (0.00091) [2022-07-10 10:49:19,515][26022] Updated weights on worker 0-0, policy_version 691324 (0.00092) [2022-07-10 10:49:21,330][26022] Updated weights on worker 0-0, policy_version 691334 (0.00090) [2022-07-10 10:49:22,690][25689] Fps is (10 sec: 5487.3, 60 sec: 5513.3, 300 sec: 5544.4). Total num frames: 707933184. Throughput: 0: 5708.7. Samples: 707939258. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:49:22,690][25689] Avg episode reward: [(0, '-5.884')] [2022-07-10 10:49:23,207][26022] Updated weights on worker 0-0, policy_version 691344 (0.00087) [2022-07-10 10:49:24,897][26022] Updated weights on worker 0-0, policy_version 691354 (0.00082) [2022-07-10 10:49:27,163][26022] Updated weights on worker 0-0, policy_version 691364 (0.00089) [2022-07-10 10:49:27,711][25689] Fps is (10 sec: 5688.7, 60 sec: 5570.5, 300 sec: 5547.7). Total num frames: 707961856. Throughput: 0: 4964.8. Samples: 707955926. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:49:27,711][25689] Avg episode reward: [(0, '-4.121')] [2022-07-10 10:49:28,740][26022] Updated weights on worker 0-0, policy_version 691374 (0.00085) [2022-07-10 10:49:30,484][26022] Updated weights on worker 0-0, policy_version 691384 (0.00079) [2022-07-10 10:49:32,373][26022] Updated weights on worker 0-0, policy_version 691394 (0.00087) [2022-07-10 10:49:32,724][25689] Fps is (10 sec: 5612.9, 60 sec: 5561.1, 300 sec: 5547.9). Total num frames: 707989504. Throughput: 0: 5800.1. Samples: 707989264. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:49:32,724][25689] Avg episode reward: [(0, '-2.120')] [2022-07-10 10:49:34,249][26022] Updated weights on worker 0-0, policy_version 691404 (0.00090) [2022-07-10 10:49:36,035][26022] Updated weights on worker 0-0, policy_version 691414 (0.00090) [2022-07-10 10:49:37,754][25689] Fps is (10 sec: 5403.5, 60 sec: 5528.5, 300 sec: 5539.5). Total num frames: 708016128. Throughput: 0: 5815.7. Samples: 708022926. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:49:37,755][25689] Avg episode reward: [(0, '-2.226')] [2022-07-10 10:49:37,991][26022] Updated weights on worker 0-0, policy_version 691424 (0.00098) [2022-07-10 10:49:39,686][26022] Updated weights on worker 0-0, policy_version 691434 (0.00082) [2022-07-10 10:49:41,816][26022] Updated weights on worker 0-0, policy_version 691444 (0.00094) [2022-07-10 10:49:42,875][25689] Fps is (10 sec: 5648.8, 60 sec: 5555.4, 300 sec: 5551.2). Total num frames: 708046848. Throughput: 0: 4970.6. Samples: 708039592. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:49:42,876][25689] Avg episode reward: [(0, '-2.589')] [2022-07-10 10:49:43,302][26022] Updated weights on worker 0-0, policy_version 691454 (0.00078) [2022-07-10 10:49:45,293][26022] Updated weights on worker 0-0, policy_version 691464 (0.00087) [2022-07-10 10:49:46,941][26022] Updated weights on worker 0-0, policy_version 691474 (0.00084) [2022-07-10 10:49:47,887][25689] Fps is (10 sec: 5558.4, 60 sec: 5524.0, 300 sec: 5541.2). Total num frames: 708072448. Throughput: 0: 5820.0. Samples: 708073352. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:49:47,887][25689] Avg episode reward: [(0, '-3.448')] [2022-07-10 10:49:48,757][26022] Updated weights on worker 0-0, policy_version 691484 (0.00083) [2022-07-10 10:49:50,906][26022] Updated weights on worker 0-0, policy_version 691494 (0.00089) [2022-07-10 10:49:52,603][26022] Updated weights on worker 0-0, policy_version 691504 (0.00049) [2022-07-10 10:49:52,956][25689] Fps is (10 sec: 5485.1, 60 sec: 5535.4, 300 sec: 5547.0). Total num frames: 708102144. Throughput: 0: 5809.4. Samples: 708106804. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:49:52,957][25689] Avg episode reward: [(0, '-2.816')] [2022-07-10 10:49:54,524][26022] Updated weights on worker 0-0, policy_version 691514 (0.00091) [2022-07-10 10:49:56,219][26022] Updated weights on worker 0-0, policy_version 691524 (0.00083) [2022-07-10 10:49:57,985][25689] Fps is (10 sec: 5678.8, 60 sec: 5533.9, 300 sec: 5548.3). Total num frames: 708129792. Throughput: 0: 4981.3. Samples: 708123698. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:49:57,985][25689] Avg episode reward: [(0, '-4.432')] [2022-07-10 10:49:58,130][26022] Updated weights on worker 0-0, policy_version 691534 (0.00086) [2022-07-10 10:49:59,892][26022] Updated weights on worker 0-0, policy_version 691544 (0.00081) [2022-07-10 10:50:02,008][26022] Updated weights on worker 0-0, policy_version 691554 (0.00088) [2022-07-10 10:50:03,041][25689] Fps is (10 sec: 5381.4, 60 sec: 5551.0, 300 sec: 5544.2). Total num frames: 708156416. Throughput: 0: 5833.8. Samples: 708157238. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:50:03,042][25689] Avg episode reward: [(0, '-3.837')] [2022-07-10 10:50:04,024][26022] Updated weights on worker 0-0, policy_version 691564 (0.00089) [2022-07-10 10:50:05,679][26022] Updated weights on worker 0-0, policy_version 691574 (0.00093) [2022-07-10 10:50:07,876][26022] Updated weights on worker 0-0, policy_version 691584 (0.00093) [2022-07-10 10:50:08,072][25689] Fps is (10 sec: 5278.6, 60 sec: 5515.2, 300 sec: 5540.3). Total num frames: 708183040. Throughput: 0: 5715.6. Samples: 708188724. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:50:08,073][25689] Avg episode reward: [(0, '-2.798')] [2022-07-10 10:50:09,286][26022] Updated weights on worker 0-0, policy_version 691594 (0.00084) [2022-07-10 10:50:11,356][26022] Updated weights on worker 0-0, policy_version 691604 (0.00092) [2022-07-10 10:50:13,109][26022] Updated weights on worker 0-0, policy_version 691614 (0.00088) [2022-07-10 10:50:13,111][25689] Fps is (10 sec: 5491.6, 60 sec: 5517.9, 300 sec: 5547.0). Total num frames: 708211712. Throughput: 0: 4893.3. Samples: 708205426. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:50:13,111][25689] Avg episode reward: [(0, '-2.089')] [2022-07-10 10:50:14,784][26022] Updated weights on worker 0-0, policy_version 691624 (0.00082) [2022-07-10 10:50:17,048][26022] Updated weights on worker 0-0, policy_version 691634 (0.00082) [2022-07-10 10:50:18,116][25689] Fps is (10 sec: 5709.2, 60 sec: 5552.9, 300 sec: 5544.5). Total num frames: 708240384. Throughput: 0: 5726.4. Samples: 708238980. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:50:18,117][25689] Avg episode reward: [(0, '-2.323')] [2022-07-10 10:50:18,488][26022] Updated weights on worker 0-0, policy_version 691644 (0.00089) [2022-07-10 10:50:20,558][26022] Updated weights on worker 0-0, policy_version 691654 (0.00079) [2022-07-10 10:50:22,348][26022] Updated weights on worker 0-0, policy_version 691664 (0.00088) [2022-07-10 10:50:23,171][25689] Fps is (10 sec: 5700.0, 60 sec: 5553.3, 300 sec: 5547.2). Total num frames: 708269056. Throughput: 0: 5722.3. Samples: 708272426. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:50:23,175][25689] Avg episode reward: [(0, '-1.817')] [2022-07-10 10:50:24,255][26022] Updated weights on worker 0-0, policy_version 691674 (0.00090) [2022-07-10 10:50:25,175][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:50:25,186][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000691680_708280320.pth [2022-07-10 10:50:25,187][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000689729_706282496.pth [2022-07-10 10:50:25,962][26022] Updated weights on worker 0-0, policy_version 691684 (0.00091) [2022-07-10 10:50:27,782][26022] Updated weights on worker 0-0, policy_version 691694 (0.00089) [2022-07-10 10:50:28,179][25689] Fps is (10 sec: 5597.2, 60 sec: 5537.6, 300 sec: 5550.7). Total num frames: 708296704. Throughput: 0: 5833.2. Samples: 708306008. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:50:28,179][25689] Avg episode reward: [(0, '-0.738')] [2022-07-10 10:50:29,625][26022] Updated weights on worker 0-0, policy_version 691704 (0.00083) [2022-07-10 10:50:31,584][26022] Updated weights on worker 0-0, policy_version 691714 (0.00094) [2022-07-10 10:50:33,187][25689] Fps is (10 sec: 5520.6, 60 sec: 5538.0, 300 sec: 5544.2). Total num frames: 708324352. Throughput: 0: 5844.8. Samples: 708322770. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:50:33,189][25689] Avg episode reward: [(0, '-0.489')] [2022-07-10 10:50:33,247][26022] Updated weights on worker 0-0, policy_version 691724 (0.00088) [2022-07-10 10:50:35,177][26022] Updated weights on worker 0-0, policy_version 691734 (0.00085) [2022-07-10 10:50:36,726][26022] Updated weights on worker 0-0, policy_version 691744 (0.00098) [2022-07-10 10:50:38,215][25689] Fps is (10 sec: 5407.4, 60 sec: 5538.3, 300 sec: 5541.0). Total num frames: 708350976. Throughput: 0: 5860.5. Samples: 708356770. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:50:38,217][25689] Avg episode reward: [(0, '-0.577')] [2022-07-10 10:50:38,747][26022] Updated weights on worker 0-0, policy_version 691754 (0.00093) [2022-07-10 10:50:40,521][26022] Updated weights on worker 0-0, policy_version 691764 (0.00087) [2022-07-10 10:50:42,358][26022] Updated weights on worker 0-0, policy_version 691774 (0.00086) [2022-07-10 10:50:43,286][25689] Fps is (10 sec: 5678.2, 60 sec: 5542.8, 300 sec: 5547.1). Total num frames: 708381696. Throughput: 0: 5859.0. Samples: 708390282. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:50:43,288][25689] Avg episode reward: [(0, '-0.313')] [2022-07-10 10:50:44,301][26022] Updated weights on worker 0-0, policy_version 691784 (0.00091) [2022-07-10 10:50:46,155][26022] Updated weights on worker 0-0, policy_version 691794 (0.00088) [2022-07-10 10:50:47,801][26022] Updated weights on worker 0-0, policy_version 691804 (0.00087) [2022-07-10 10:50:48,308][25689] Fps is (10 sec: 5783.2, 60 sec: 5575.8, 300 sec: 5546.8). Total num frames: 708409344. Throughput: 0: 5017.9. Samples: 708407014. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:50:48,308][25689] Avg episode reward: [(0, '-1.643')] [2022-07-10 10:50:49,735][26022] Updated weights on worker 0-0, policy_version 691814 (0.00091) [2022-07-10 10:50:51,280][26022] Updated weights on worker 0-0, policy_version 691824 (0.00087) [2022-07-10 10:50:53,311][25689] Fps is (10 sec: 5414.0, 60 sec: 5531.1, 300 sec: 5546.8). Total num frames: 708435968. Throughput: 0: 5851.2. Samples: 708440516. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:50:53,311][25689] Avg episode reward: [(0, '-1.902')] [2022-07-10 10:50:53,566][26022] Updated weights on worker 0-0, policy_version 691834 (0.00091) [2022-07-10 10:50:55,258][26022] Updated weights on worker 0-0, policy_version 691844 (0.00096) [2022-07-10 10:50:57,136][26022] Updated weights on worker 0-0, policy_version 691854 (0.00861) [2022-07-10 10:50:58,325][25689] Fps is (10 sec: 5520.2, 60 sec: 5549.3, 300 sec: 5545.2). Total num frames: 708464640. Throughput: 0: 5835.0. Samples: 708474110. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:50:58,327][25689] Avg episode reward: [(0, '-4.757')] [2022-07-10 10:50:58,883][26022] Updated weights on worker 0-0, policy_version 691864 (0.00091) [2022-07-10 10:51:00,880][26022] Updated weights on worker 0-0, policy_version 691874 (0.00092) [2022-07-10 10:51:02,975][26022] Updated weights on worker 0-0, policy_version 691884 (0.00090) [2022-07-10 10:51:03,401][25689] Fps is (10 sec: 5581.7, 60 sec: 5564.5, 300 sec: 5550.8). Total num frames: 708492288. Throughput: 0: 4996.8. Samples: 708490790. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:51:03,401][25689] Avg episode reward: [(0, '-6.081')] [2022-07-10 10:51:04,959][26022] Updated weights on worker 0-0, policy_version 691894 (0.00082) [2022-07-10 10:51:06,702][26022] Updated weights on worker 0-0, policy_version 691904 (0.00087) [2022-07-10 10:51:08,456][25689] Fps is (10 sec: 5357.0, 60 sec: 5562.3, 300 sec: 5546.8). Total num frames: 708518912. Throughput: 0: 5717.5. Samples: 708522210. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:51:08,456][25689] Avg episode reward: [(0, '-6.237')] [2022-07-10 10:51:08,594][26022] Updated weights on worker 0-0, policy_version 691914 (0.01236) [2022-07-10 10:51:10,264][26022] Updated weights on worker 0-0, policy_version 691924 (0.00088) [2022-07-10 10:51:12,202][26022] Updated weights on worker 0-0, policy_version 691934 (0.00101) [2022-07-10 10:51:13,478][25689] Fps is (10 sec: 5385.4, 60 sec: 5546.8, 300 sec: 5544.6). Total num frames: 708546560. Throughput: 0: 5717.7. Samples: 708555828. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:51:13,479][25689] Avg episode reward: [(0, '-6.529')] [2022-07-10 10:51:14,028][26022] Updated weights on worker 0-0, policy_version 691944 (0.00096) [2022-07-10 10:51:15,955][26022] Updated weights on worker 0-0, policy_version 691954 (0.00089) [2022-07-10 10:51:17,452][26022] Updated weights on worker 0-0, policy_version 691964 (0.00084) [2022-07-10 10:51:18,501][25689] Fps is (10 sec: 5606.4, 60 sec: 5545.2, 300 sec: 5546.2). Total num frames: 708575232. Throughput: 0: 4883.0. Samples: 708572632. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 10:51:18,502][25689] Avg episode reward: [(0, '-4.871')] [2022-07-10 10:51:19,542][26022] Updated weights on worker 0-0, policy_version 691974 (0.00093) [2022-07-10 10:51:21,369][26022] Updated weights on worker 0-0, policy_version 691984 (0.00090) [2022-07-10 10:51:23,205][26022] Updated weights on worker 0-0, policy_version 691994 (0.00087) [2022-07-10 10:51:23,615][25689] Fps is (10 sec: 5657.1, 60 sec: 5539.8, 300 sec: 5548.2). Total num frames: 708603904. Throughput: 0: 5690.5. Samples: 708605820. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:51:23,616][25689] Avg episode reward: [(0, '-6.061')] [2022-07-10 10:51:25,210][26022] Updated weights on worker 0-0, policy_version 692004 (0.00091) [2022-07-10 10:51:26,718][26022] Updated weights on worker 0-0, policy_version 692014 (0.00087) [2022-07-10 10:51:28,655][25689] Fps is (10 sec: 5546.6, 60 sec: 5536.8, 300 sec: 5544.9). Total num frames: 708631552. Throughput: 0: 5788.9. Samples: 708639144. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:51:28,656][25689] Avg episode reward: [(0, '-3.820')] [2022-07-10 10:51:28,720][26022] Updated weights on worker 0-0, policy_version 692024 (0.00092) [2022-07-10 10:51:30,452][26022] Updated weights on worker 0-0, policy_version 692034 (0.00086) [2022-07-10 10:51:32,431][26022] Updated weights on worker 0-0, policy_version 692044 (0.00084) [2022-07-10 10:51:33,727][25689] Fps is (10 sec: 5468.5, 60 sec: 5531.1, 300 sec: 5544.3). Total num frames: 708659200. Throughput: 0: 4942.8. Samples: 708655912. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:51:33,728][25689] Avg episode reward: [(0, '-3.765')] [2022-07-10 10:51:34,145][26022] Updated weights on worker 0-0, policy_version 692054 (0.00086) [2022-07-10 10:51:36,046][26022] Updated weights on worker 0-0, policy_version 692064 (0.00081) [2022-07-10 10:51:38,111][26022] Updated weights on worker 0-0, policy_version 692074 (0.00102) [2022-07-10 10:51:38,737][25689] Fps is (10 sec: 5484.9, 60 sec: 5549.6, 300 sec: 5543.7). Total num frames: 708686848. Throughput: 0: 5767.4. Samples: 708689338. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:51:38,739][25689] Avg episode reward: [(0, '-4.006')] [2022-07-10 10:51:39,715][26022] Updated weights on worker 0-0, policy_version 692084 (0.00087) [2022-07-10 10:51:41,639][26022] Updated weights on worker 0-0, policy_version 692094 (0.00096) [2022-07-10 10:51:43,376][26022] Updated weights on worker 0-0, policy_version 692104 (0.00085) [2022-07-10 10:51:43,806][25689] Fps is (10 sec: 5689.6, 60 sec: 5532.9, 300 sec: 5547.2). Total num frames: 708716544. Throughput: 0: 5808.0. Samples: 708723086. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:51:43,806][25689] Avg episode reward: [(0, '-3.511')] [2022-07-10 10:51:45,294][26022] Updated weights on worker 0-0, policy_version 692114 (0.00087) [2022-07-10 10:51:47,219][26022] Updated weights on worker 0-0, policy_version 692124 (0.00089) [2022-07-10 10:51:48,899][25689] Fps is (10 sec: 5743.6, 60 sec: 5543.2, 300 sec: 5547.0). Total num frames: 708745216. Throughput: 0: 4971.5. Samples: 708739786. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:51:48,901][25689] Avg episode reward: [(0, '-1.796')] [2022-07-10 10:51:48,903][26022] Updated weights on worker 0-0, policy_version 692134 (0.00089) [2022-07-10 10:51:50,634][26022] Updated weights on worker 0-0, policy_version 692144 (0.00091) [2022-07-10 10:51:52,719][26022] Updated weights on worker 0-0, policy_version 692154 (0.00088) [2022-07-10 10:51:53,934][25689] Fps is (10 sec: 5661.7, 60 sec: 5574.1, 300 sec: 5551.1). Total num frames: 708773888. Throughput: 0: 5826.2. Samples: 708773642. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:51:53,934][25689] Avg episode reward: [(0, '-2.434')] [2022-07-10 10:51:54,159][26022] Updated weights on worker 0-0, policy_version 692164 (0.00080) [2022-07-10 10:51:56,364][26022] Updated weights on worker 0-0, policy_version 692174 (0.00088) [2022-07-10 10:51:57,954][26022] Updated weights on worker 0-0, policy_version 692184 (0.00087) [2022-07-10 10:51:58,943][25689] Fps is (10 sec: 5505.6, 60 sec: 5540.8, 300 sec: 5545.5). Total num frames: 708800512. Throughput: 0: 5850.7. Samples: 708807556. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:51:58,945][25689] Avg episode reward: [(0, '-2.858')] [2022-07-10 10:51:59,988][26022] Updated weights on worker 0-0, policy_version 692194 (0.00105) [2022-07-10 10:52:01,631][26022] Updated weights on worker 0-0, policy_version 692204 (0.00096) [2022-07-10 10:52:03,803][26022] Updated weights on worker 0-0, policy_version 692214 (0.00080) [2022-07-10 10:52:04,019][25689] Fps is (10 sec: 5381.4, 60 sec: 5540.8, 300 sec: 5548.2). Total num frames: 708828160. Throughput: 0: 5010.0. Samples: 708824356. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:52:04,020][25689] Avg episode reward: [(0, '-2.327')] [2022-07-10 10:52:05,632][26022] Updated weights on worker 0-0, policy_version 692224 (0.00096) [2022-07-10 10:52:07,625][26022] Updated weights on worker 0-0, policy_version 692234 (0.00087) [2022-07-10 10:52:09,027][25689] Fps is (10 sec: 5382.0, 60 sec: 5545.1, 300 sec: 5548.6). Total num frames: 708854784. Throughput: 0: 5758.8. Samples: 708855696. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:52:09,027][25689] Avg episode reward: [(0, '-2.881')] [2022-07-10 10:52:09,335][26022] Updated weights on worker 0-0, policy_version 692244 (0.00086) [2022-07-10 10:52:11,384][26022] Updated weights on worker 0-0, policy_version 692254 (0.00086) [2022-07-10 10:52:13,025][26022] Updated weights on worker 0-0, policy_version 692264 (0.00089) [2022-07-10 10:52:14,031][25689] Fps is (10 sec: 5523.3, 60 sec: 5563.7, 300 sec: 5548.6). Total num frames: 708883456. Throughput: 0: 5737.6. Samples: 708888948. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:52:14,031][25689] Avg episode reward: [(0, '-3.418')] [2022-07-10 10:52:14,931][26022] Updated weights on worker 0-0, policy_version 692274 (0.00086) [2022-07-10 10:52:16,815][26022] Updated weights on worker 0-0, policy_version 692284 (0.00083) [2022-07-10 10:52:18,572][26022] Updated weights on worker 0-0, policy_version 692294 (0.00088) [2022-07-10 10:52:19,061][25689] Fps is (10 sec: 5714.9, 60 sec: 5563.1, 300 sec: 5553.6). Total num frames: 708912128. Throughput: 0: 4883.1. Samples: 708905794. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:52:19,062][25689] Avg episode reward: [(0, '-4.281')] [2022-07-10 10:52:20,455][26022] Updated weights on worker 0-0, policy_version 692304 (0.00088) [2022-07-10 10:52:22,260][26022] Updated weights on worker 0-0, policy_version 692314 (0.00081) [2022-07-10 10:52:24,069][26022] Updated weights on worker 0-0, policy_version 692324 (0.00083) [2022-07-10 10:52:24,119][25689] Fps is (10 sec: 5582.5, 60 sec: 5551.2, 300 sec: 5549.4). Total num frames: 708939776. Throughput: 0: 5720.3. Samples: 708939334. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:52:24,120][25689] Avg episode reward: [(0, '-3.445')] [2022-07-10 10:52:25,195][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:52:25,214][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000692330_708945920.pth [2022-07-10 10:52:25,215][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000690377_706946048.pth [2022-07-10 10:52:25,972][26022] Updated weights on worker 0-0, policy_version 692334 (0.00098) [2022-07-10 10:52:27,953][26022] Updated weights on worker 0-0, policy_version 692344 (0.00090) [2022-07-10 10:52:29,122][25689] Fps is (10 sec: 5496.3, 60 sec: 5554.7, 300 sec: 5549.6). Total num frames: 708967424. Throughput: 0: 5826.8. Samples: 708972784. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:52:29,122][25689] Avg episode reward: [(0, '-2.820')] [2022-07-10 10:52:29,586][26022] Updated weights on worker 0-0, policy_version 692354 (0.00088) [2022-07-10 10:52:31,585][26022] Updated weights on worker 0-0, policy_version 692364 (0.00085) [2022-07-10 10:52:33,375][26022] Updated weights on worker 0-0, policy_version 692374 (0.00082) [2022-07-10 10:52:34,140][25689] Fps is (10 sec: 5416.0, 60 sec: 5542.7, 300 sec: 5542.6). Total num frames: 708994048. Throughput: 0: 4987.7. Samples: 708989248. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:52:34,140][25689] Avg episode reward: [(0, '-3.724')] [2022-07-10 10:52:35,199][26022] Updated weights on worker 0-0, policy_version 692384 (0.00087) [2022-07-10 10:52:36,885][26022] Updated weights on worker 0-0, policy_version 692394 (0.00092) [2022-07-10 10:52:38,801][26022] Updated weights on worker 0-0, policy_version 692404 (0.00087) [2022-07-10 10:52:39,147][25689] Fps is (10 sec: 5617.8, 60 sec: 5576.8, 300 sec: 5550.5). Total num frames: 709023744. Throughput: 0: 5841.1. Samples: 709023118. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:52:39,147][25689] Avg episode reward: [(0, '-2.268')] [2022-07-10 10:52:40,623][26022] Updated weights on worker 0-0, policy_version 692414 (0.00084) [2022-07-10 10:52:42,440][26022] Updated weights on worker 0-0, policy_version 692424 (0.00087) [2022-07-10 10:52:44,055][26022] Updated weights on worker 0-0, policy_version 692434 (0.00091) [2022-07-10 10:52:44,281][25689] Fps is (10 sec: 5755.5, 60 sec: 5553.9, 300 sec: 5549.3). Total num frames: 709052416. Throughput: 0: 5841.8. Samples: 709057116. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:52:44,282][25689] Avg episode reward: [(0, '-3.203')] [2022-07-10 10:52:46,211][26022] Updated weights on worker 0-0, policy_version 692444 (0.00089) [2022-07-10 10:52:47,788][26022] Updated weights on worker 0-0, policy_version 692454 (0.00109) [2022-07-10 10:52:49,312][25689] Fps is (10 sec: 5540.6, 60 sec: 5542.7, 300 sec: 5545.7). Total num frames: 709080064. Throughput: 0: 5016.6. Samples: 709074070. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:52:49,312][25689] Avg episode reward: [(0, '-3.076')] [2022-07-10 10:52:49,763][26022] Updated weights on worker 0-0, policy_version 692464 (0.00092) [2022-07-10 10:52:51,592][26022] Updated weights on worker 0-0, policy_version 692474 (0.00086) [2022-07-10 10:52:53,407][26022] Updated weights on worker 0-0, policy_version 692484 (0.00087) [2022-07-10 10:52:54,339][25689] Fps is (10 sec: 5599.4, 60 sec: 5543.3, 300 sec: 5548.9). Total num frames: 709108736. Throughput: 0: 5868.9. Samples: 709107798. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:52:54,341][25689] Avg episode reward: [(0, '-3.168')] [2022-07-10 10:52:55,309][26022] Updated weights on worker 0-0, policy_version 692494 (0.00082) [2022-07-10 10:52:57,023][26022] Updated weights on worker 0-0, policy_version 692504 (0.00093) [2022-07-10 10:52:58,864][26022] Updated weights on worker 0-0, policy_version 692514 (0.00097) [2022-07-10 10:52:59,351][25689] Fps is (10 sec: 5610.2, 60 sec: 5560.1, 300 sec: 5550.3). Total num frames: 709136384. Throughput: 0: 5850.6. Samples: 709141322. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:52:59,353][25689] Avg episode reward: [(0, '-3.964')] [2022-07-10 10:53:00,694][26022] Updated weights on worker 0-0, policy_version 692524 (0.00088) [2022-07-10 10:53:02,900][26022] Updated weights on worker 0-0, policy_version 692534 (0.00090) [2022-07-10 10:53:04,445][25689] Fps is (10 sec: 5269.2, 60 sec: 5524.5, 300 sec: 5541.8). Total num frames: 709161984. Throughput: 0: 5718.8. Samples: 709172428. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:53:04,447][25689] Avg episode reward: [(0, '-2.666')] [2022-07-10 10:53:04,763][26022] Updated weights on worker 0-0, policy_version 692544 (0.00088) [2022-07-10 10:53:06,401][26022] Updated weights on worker 0-0, policy_version 692554 (0.00088) [2022-07-10 10:53:08,602][26022] Updated weights on worker 0-0, policy_version 692564 (0.00079) [2022-07-10 10:53:09,472][25689] Fps is (10 sec: 5362.2, 60 sec: 5556.7, 300 sec: 5552.1). Total num frames: 709190656. Throughput: 0: 5718.5. Samples: 709189356. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:53:09,473][25689] Avg episode reward: [(0, '-2.837')] [2022-07-10 10:53:10,170][26022] Updated weights on worker 0-0, policy_version 692574 (0.00103) [2022-07-10 10:53:12,259][26022] Updated weights on worker 0-0, policy_version 692584 (0.00084) [2022-07-10 10:53:13,899][26022] Updated weights on worker 0-0, policy_version 692594 (0.00087) [2022-07-10 10:53:14,495][25689] Fps is (10 sec: 5604.0, 60 sec: 5537.9, 300 sec: 5545.2). Total num frames: 709218304. Throughput: 0: 5712.2. Samples: 709222930. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:53:14,496][25689] Avg episode reward: [(0, '-2.284')] [2022-07-10 10:53:15,759][26022] Updated weights on worker 0-0, policy_version 692604 (0.00098) [2022-07-10 10:53:17,482][26022] Updated weights on worker 0-0, policy_version 692614 (0.00079) [2022-07-10 10:53:19,257][26022] Updated weights on worker 0-0, policy_version 692624 (0.00084) [2022-07-10 10:53:19,569][25689] Fps is (10 sec: 5679.5, 60 sec: 5550.9, 300 sec: 5543.0). Total num frames: 709248000. Throughput: 0: 5721.8. Samples: 709257004. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:53:19,569][25689] Avg episode reward: [(0, '-3.177')] [2022-07-10 10:53:21,166][26022] Updated weights on worker 0-0, policy_version 692634 (0.00092) [2022-07-10 10:53:22,984][26022] Updated weights on worker 0-0, policy_version 692644 (0.00082) [2022-07-10 10:53:24,659][25689] Fps is (10 sec: 5642.0, 60 sec: 5548.0, 300 sec: 5549.9). Total num frames: 709275648. Throughput: 0: 5008.0. Samples: 709273660. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:53:24,659][25689] Avg episode reward: [(0, '-2.709')] [2022-07-10 10:53:24,906][26022] Updated weights on worker 0-0, policy_version 692654 (0.00085) [2022-07-10 10:53:26,674][26022] Updated weights on worker 0-0, policy_version 692664 (0.00091) [2022-07-10 10:53:28,524][26022] Updated weights on worker 0-0, policy_version 692674 (0.00090) [2022-07-10 10:53:29,669][25689] Fps is (10 sec: 5575.9, 60 sec: 5564.2, 300 sec: 5551.5). Total num frames: 709304320. Throughput: 0: 5830.5. Samples: 709307116. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:53:29,670][25689] Avg episode reward: [(0, '-3.275')] [2022-07-10 10:53:30,624][26022] Updated weights on worker 0-0, policy_version 692684 (0.00088) [2022-07-10 10:53:32,167][26022] Updated weights on worker 0-0, policy_version 692694 (0.00091) [2022-07-10 10:53:34,287][26022] Updated weights on worker 0-0, policy_version 692704 (0.00086) [2022-07-10 10:53:34,673][25689] Fps is (10 sec: 5623.7, 60 sec: 5582.4, 300 sec: 5548.8). Total num frames: 709331968. Throughput: 0: 5823.7. Samples: 709340444. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:53:34,674][25689] Avg episode reward: [(0, '-4.030')] [2022-07-10 10:53:35,851][26022] Updated weights on worker 0-0, policy_version 692714 (0.00083) [2022-07-10 10:53:37,835][26022] Updated weights on worker 0-0, policy_version 692724 (0.00083) [2022-07-10 10:53:39,487][26022] Updated weights on worker 0-0, policy_version 692734 (0.00087) [2022-07-10 10:53:39,704][25689] Fps is (10 sec: 5510.6, 60 sec: 5546.4, 300 sec: 5545.6). Total num frames: 709359616. Throughput: 0: 4978.9. Samples: 709357254. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:53:39,704][25689] Avg episode reward: [(0, '-4.029')] [2022-07-10 10:53:41,414][26022] Updated weights on worker 0-0, policy_version 692744 (0.00089) [2022-07-10 10:53:43,404][26022] Updated weights on worker 0-0, policy_version 692754 (0.00094) [2022-07-10 10:53:44,764][25689] Fps is (10 sec: 5581.5, 60 sec: 5553.2, 300 sec: 5548.6). Total num frames: 709388288. Throughput: 0: 5826.0. Samples: 709390792. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:53:44,764][25689] Avg episode reward: [(0, '-3.148')] [2022-07-10 10:53:44,903][26022] Updated weights on worker 0-0, policy_version 692764 (0.00086) [2022-07-10 10:53:46,922][26022] Updated weights on worker 0-0, policy_version 692774 (0.00082) [2022-07-10 10:53:48,887][26022] Updated weights on worker 0-0, policy_version 692784 (0.00094) [2022-07-10 10:53:49,809][25689] Fps is (10 sec: 5674.4, 60 sec: 5568.8, 300 sec: 5548.0). Total num frames: 709416960. Throughput: 0: 5835.6. Samples: 709424646. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:53:49,810][25689] Avg episode reward: [(0, '-2.824')] [2022-07-10 10:53:50,464][26022] Updated weights on worker 0-0, policy_version 692794 (0.00086) [2022-07-10 10:53:52,722][26022] Updated weights on worker 0-0, policy_version 692804 (0.00090) [2022-07-10 10:53:54,054][26022] Updated weights on worker 0-0, policy_version 692814 (0.00094) [2022-07-10 10:53:54,833][25689] Fps is (10 sec: 5491.5, 60 sec: 5535.2, 300 sec: 5544.3). Total num frames: 709443584. Throughput: 0: 4997.4. Samples: 709441192. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:53:54,834][25689] Avg episode reward: [(0, '-1.919')] [2022-07-10 10:53:56,150][26022] Updated weights on worker 0-0, policy_version 692824 (0.00089) [2022-07-10 10:53:57,931][26022] Updated weights on worker 0-0, policy_version 692834 (0.00089) [2022-07-10 10:53:59,524][26022] Updated weights on worker 0-0, policy_version 692844 (0.00088) [2022-07-10 10:53:59,858][25689] Fps is (10 sec: 5604.7, 60 sec: 5567.8, 300 sec: 5558.7). Total num frames: 709473280. Throughput: 0: 5855.7. Samples: 709475272. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:53:59,859][25689] Avg episode reward: [(0, '-1.293')] [2022-07-10 10:54:02,130][26022] Updated weights on worker 0-0, policy_version 692854 (0.00088) [2022-07-10 10:54:03,585][26022] Updated weights on worker 0-0, policy_version 692864 (0.00091) [2022-07-10 10:54:04,919][25689] Fps is (10 sec: 5482.4, 60 sec: 5570.9, 300 sec: 5547.4). Total num frames: 709498880. Throughput: 0: 5762.3. Samples: 709506934. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:54:04,920][25689] Avg episode reward: [(0, '-1.597')] [2022-07-10 10:54:05,752][26022] Updated weights on worker 0-0, policy_version 692874 (0.00088) [2022-07-10 10:54:07,451][26022] Updated weights on worker 0-0, policy_version 692884 (0.00093) [2022-07-10 10:54:09,311][26022] Updated weights on worker 0-0, policy_version 692894 (0.00086) [2022-07-10 10:54:09,997][25689] Fps is (10 sec: 5353.1, 60 sec: 5566.3, 300 sec: 5547.3). Total num frames: 709527552. Throughput: 0: 4911.2. Samples: 709523788. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:54:09,998][25689] Avg episode reward: [(0, '-2.525')] [2022-07-10 10:54:11,184][26022] Updated weights on worker 0-0, policy_version 692904 (0.00091) [2022-07-10 10:54:12,871][26022] Updated weights on worker 0-0, policy_version 692914 (0.00089) [2022-07-10 10:54:14,620][26022] Updated weights on worker 0-0, policy_version 692924 (0.00087) [2022-07-10 10:54:15,011][25689] Fps is (10 sec: 5682.4, 60 sec: 5584.0, 300 sec: 5554.2). Total num frames: 709556224. Throughput: 0: 5770.3. Samples: 709557622. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:54:15,012][25689] Avg episode reward: [(0, '-2.467')] [2022-07-10 10:54:16,675][26022] Updated weights on worker 0-0, policy_version 692934 (0.00084) [2022-07-10 10:54:18,123][26022] Updated weights on worker 0-0, policy_version 692944 (0.00090) [2022-07-10 10:54:20,016][25689] Fps is (10 sec: 5621.3, 60 sec: 5556.5, 300 sec: 5551.8). Total num frames: 709583872. Throughput: 0: 5782.0. Samples: 709591822. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:54:20,016][25689] Avg episode reward: [(0, '-2.454')] [2022-07-10 10:54:20,163][26022] Updated weights on worker 0-0, policy_version 692954 (0.00098) [2022-07-10 10:54:21,850][26022] Updated weights on worker 0-0, policy_version 692964 (0.00089) [2022-07-10 10:54:23,808][26022] Updated weights on worker 0-0, policy_version 692974 (0.00087) [2022-07-10 10:54:25,103][25689] Fps is (10 sec: 5580.7, 60 sec: 5573.7, 300 sec: 5550.5). Total num frames: 709612544. Throughput: 0: 5037.7. Samples: 709608612. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:54:25,103][25689] Avg episode reward: [(0, '-3.128')] [2022-07-10 10:54:25,365][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:54:25,381][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000692982_709613568.pth [2022-07-10 10:54:25,382][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000691031_707615744.pth [2022-07-10 10:54:25,670][26022] Updated weights on worker 0-0, policy_version 692984 (0.00090) [2022-07-10 10:54:27,488][26022] Updated weights on worker 0-0, policy_version 692994 (0.00085) [2022-07-10 10:54:29,171][26022] Updated weights on worker 0-0, policy_version 693004 (0.00095) [2022-07-10 10:54:30,115][25689] Fps is (10 sec: 5677.8, 60 sec: 5573.5, 300 sec: 5554.0). Total num frames: 709641216. Throughput: 0: 5893.7. Samples: 709642362. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:54:30,117][25689] Avg episode reward: [(0, '-4.520')] [2022-07-10 10:54:31,137][26022] Updated weights on worker 0-0, policy_version 693014 (0.00086) [2022-07-10 10:54:32,948][26022] Updated weights on worker 0-0, policy_version 693024 (0.00091) [2022-07-10 10:54:34,766][26022] Updated weights on worker 0-0, policy_version 693034 (0.00090) [2022-07-10 10:54:35,124][25689] Fps is (10 sec: 5518.0, 60 sec: 5556.2, 300 sec: 5554.4). Total num frames: 709667840. Throughput: 0: 5877.0. Samples: 709675826. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:54:35,124][25689] Avg episode reward: [(0, '-3.935')] [2022-07-10 10:54:36,676][26022] Updated weights on worker 0-0, policy_version 693044 (0.00088) [2022-07-10 10:54:38,428][26022] Updated weights on worker 0-0, policy_version 693054 (0.00095) [2022-07-10 10:54:40,171][25689] Fps is (10 sec: 5295.4, 60 sec: 5537.7, 300 sec: 5542.0). Total num frames: 709694464. Throughput: 0: 4990.2. Samples: 709692400. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 10:54:40,171][25689] Avg episode reward: [(0, '-4.714')] [2022-07-10 10:54:40,483][26022] Updated weights on worker 0-0, policy_version 693064 (0.00089) [2022-07-10 10:54:42,159][26022] Updated weights on worker 0-0, policy_version 693074 (0.00093) [2022-07-10 10:54:43,991][26022] Updated weights on worker 0-0, policy_version 693084 (0.00087) [2022-07-10 10:54:45,298][25689] Fps is (10 sec: 5636.1, 60 sec: 5565.4, 300 sec: 5557.1). Total num frames: 709725184. Throughput: 0: 5799.2. Samples: 709725728. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:54:45,299][25689] Avg episode reward: [(0, '-5.093')] [2022-07-10 10:54:45,975][26022] Updated weights on worker 0-0, policy_version 693094 (0.00096) [2022-07-10 10:54:47,598][26022] Updated weights on worker 0-0, policy_version 693104 (0.00081) [2022-07-10 10:54:49,637][26022] Updated weights on worker 0-0, policy_version 693114 (0.00086) [2022-07-10 10:54:50,314][25689] Fps is (10 sec: 5754.2, 60 sec: 5551.2, 300 sec: 5551.2). Total num frames: 709752832. Throughput: 0: 5796.7. Samples: 709759450. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:54:50,315][25689] Avg episode reward: [(0, '-4.587')] [2022-07-10 10:54:51,211][26022] Updated weights on worker 0-0, policy_version 693124 (0.00085) [2022-07-10 10:54:53,164][26022] Updated weights on worker 0-0, policy_version 693134 (0.00083) [2022-07-10 10:54:54,732][26022] Updated weights on worker 0-0, policy_version 693144 (0.00079) [2022-07-10 10:54:55,321][25689] Fps is (10 sec: 5619.2, 60 sec: 5586.6, 300 sec: 5555.0). Total num frames: 709781504. Throughput: 0: 4989.1. Samples: 709776592. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:54:55,321][25689] Avg episode reward: [(0, '-3.646')] [2022-07-10 10:54:57,188][26022] Updated weights on worker 0-0, policy_version 693155 (0.00089) [2022-07-10 10:54:58,673][26022] Updated weights on worker 0-0, policy_version 693165 (0.00082) [2022-07-10 10:55:00,369][25689] Fps is (10 sec: 5499.5, 60 sec: 5533.7, 300 sec: 5555.2). Total num frames: 709808128. Throughput: 0: 5820.7. Samples: 709809968. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:55:00,369][25689] Avg episode reward: [(0, '-4.020')] [2022-07-10 10:55:00,598][26022] Updated weights on worker 0-0, policy_version 693175 (0.00085) [2022-07-10 10:55:02,672][26022] Updated weights on worker 0-0, policy_version 693185 (0.00086) [2022-07-10 10:55:04,657][26022] Updated weights on worker 0-0, policy_version 693195 (0.00090) [2022-07-10 10:55:05,429][25689] Fps is (10 sec: 5470.3, 60 sec: 5584.6, 300 sec: 5561.5). Total num frames: 709836800. Throughput: 0: 5757.9. Samples: 709841640. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:55:05,430][25689] Avg episode reward: [(0, '-3.534')] [2022-07-10 10:55:06,670][26022] Updated weights on worker 0-0, policy_version 693205 (0.00093) [2022-07-10 10:55:08,367][26022] Updated weights on worker 0-0, policy_version 693215 (0.00097) [2022-07-10 10:55:10,218][26022] Updated weights on worker 0-0, policy_version 693225 (0.00092) [2022-07-10 10:55:10,447][25689] Fps is (10 sec: 5486.6, 60 sec: 5556.1, 300 sec: 5555.0). Total num frames: 709863424. Throughput: 0: 4903.6. Samples: 709858176. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:55:10,448][25689] Avg episode reward: [(0, '-2.989')] [2022-07-10 10:55:12,108][26022] Updated weights on worker 0-0, policy_version 693235 (0.00472) [2022-07-10 10:55:13,781][26022] Updated weights on worker 0-0, policy_version 693245 (0.00088) [2022-07-10 10:55:15,517][25689] Fps is (10 sec: 5380.0, 60 sec: 5534.1, 300 sec: 5550.4). Total num frames: 709891072. Throughput: 0: 5708.2. Samples: 709891876. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:55:15,517][25689] Avg episode reward: [(0, '-3.204')] [2022-07-10 10:55:15,773][26022] Updated weights on worker 0-0, policy_version 693255 (0.00081) [2022-07-10 10:55:17,501][26022] Updated weights on worker 0-0, policy_version 693265 (0.00089) [2022-07-10 10:55:19,392][26022] Updated weights on worker 0-0, policy_version 693275 (0.00090) [2022-07-10 10:55:20,527][25689] Fps is (10 sec: 5790.7, 60 sec: 5584.4, 300 sec: 5558.1). Total num frames: 709921792. Throughput: 0: 5734.9. Samples: 709925572. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:55:20,527][25689] Avg episode reward: [(0, '-2.629')] [2022-07-10 10:55:21,238][26022] Updated weights on worker 0-0, policy_version 693285 (0.00086) [2022-07-10 10:55:22,961][26022] Updated weights on worker 0-0, policy_version 693295 (0.00264) [2022-07-10 10:55:24,961][26022] Updated weights on worker 0-0, policy_version 693305 (0.00093) [2022-07-10 10:55:25,634][25689] Fps is (10 sec: 5566.8, 60 sec: 5531.8, 300 sec: 5549.4). Total num frames: 709947392. Throughput: 0: 4990.0. Samples: 709942462. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:55:25,634][25689] Avg episode reward: [(0, '-3.865')] [2022-07-10 10:55:26,501][26022] Updated weights on worker 0-0, policy_version 693315 (0.00091) [2022-07-10 10:55:28,580][26022] Updated weights on worker 0-0, policy_version 693325 (0.00091) [2022-07-10 10:55:30,247][26022] Updated weights on worker 0-0, policy_version 693335 (0.00073) [2022-07-10 10:55:30,650][25689] Fps is (10 sec: 5361.2, 60 sec: 5531.5, 300 sec: 5552.7). Total num frames: 709976064. Throughput: 0: 5838.5. Samples: 709976130. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:55:30,651][25689] Avg episode reward: [(0, '-3.339')] [2022-07-10 10:55:32,172][26022] Updated weights on worker 0-0, policy_version 693345 (0.00084) [2022-07-10 10:55:34,087][26022] Updated weights on worker 0-0, policy_version 693355 (0.00096) [2022-07-10 10:55:35,688][25689] Fps is (10 sec: 5703.3, 60 sec: 5562.6, 300 sec: 5559.4). Total num frames: 710004736. Throughput: 0: 5846.7. Samples: 710009814. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:55:35,689][25689] Avg episode reward: [(0, '-3.812')] [2022-07-10 10:55:35,758][26022] Updated weights on worker 0-0, policy_version 693365 (0.00096) [2022-07-10 10:55:37,765][26022] Updated weights on worker 0-0, policy_version 693375 (0.00090) [2022-07-10 10:55:39,238][26022] Updated weights on worker 0-0, policy_version 693385 (0.00086) [2022-07-10 10:55:40,691][25689] Fps is (10 sec: 5609.2, 60 sec: 5583.6, 300 sec: 5550.3). Total num frames: 710032384. Throughput: 0: 5015.7. Samples: 710026708. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:55:40,692][25689] Avg episode reward: [(0, '-4.138')] [2022-07-10 10:55:41,267][26022] Updated weights on worker 0-0, policy_version 693395 (0.00085) [2022-07-10 10:55:43,077][26022] Updated weights on worker 0-0, policy_version 693405 (0.00093) [2022-07-10 10:55:44,915][26022] Updated weights on worker 0-0, policy_version 693415 (0.00100) [2022-07-10 10:55:45,841][25689] Fps is (10 sec: 5648.2, 60 sec: 5564.6, 300 sec: 5554.8). Total num frames: 710062080. Throughput: 0: 5841.9. Samples: 710060510. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:55:45,842][25689] Avg episode reward: [(0, '-4.948')] [2022-07-10 10:55:46,745][26022] Updated weights on worker 0-0, policy_version 693425 (0.00077) [2022-07-10 10:55:48,405][26022] Updated weights on worker 0-0, policy_version 693435 (0.00088) [2022-07-10 10:55:50,457][26022] Updated weights on worker 0-0, policy_version 693445 (0.00093) [2022-07-10 10:55:50,865][25689] Fps is (10 sec: 5736.8, 60 sec: 5580.8, 300 sec: 5561.3). Total num frames: 710090752. Throughput: 0: 5861.4. Samples: 710094618. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:55:50,867][25689] Avg episode reward: [(0, '-5.833')] [2022-07-10 10:55:52,069][26022] Updated weights on worker 0-0, policy_version 693455 (0.00092) [2022-07-10 10:55:54,062][26022] Updated weights on worker 0-0, policy_version 693465 (0.00083) [2022-07-10 10:55:55,666][26022] Updated weights on worker 0-0, policy_version 693475 (0.00081) [2022-07-10 10:55:55,887][25689] Fps is (10 sec: 5708.4, 60 sec: 5579.4, 300 sec: 5561.1). Total num frames: 710119424. Throughput: 0: 5873.0. Samples: 710128438. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:55:55,888][25689] Avg episode reward: [(0, '-5.750')] [2022-07-10 10:55:57,475][26022] Updated weights on worker 0-0, policy_version 693485 (0.00103) [2022-07-10 10:55:59,199][26022] Updated weights on worker 0-0, policy_version 693495 (0.00087) [2022-07-10 10:56:00,898][25689] Fps is (10 sec: 5613.6, 60 sec: 5599.7, 300 sec: 5562.4). Total num frames: 710147072. Throughput: 0: 5885.1. Samples: 710145628. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:56:00,898][25689] Avg episode reward: [(0, '-5.208')] [2022-07-10 10:56:01,411][26022] Updated weights on worker 0-0, policy_version 693505 (0.00087) [2022-07-10 10:56:03,312][26022] Updated weights on worker 0-0, policy_version 693515 (0.00085) [2022-07-10 10:56:05,186][26022] Updated weights on worker 0-0, policy_version 693525 (0.00343) [2022-07-10 10:56:06,024][25689] Fps is (10 sec: 5252.8, 60 sec: 5542.9, 300 sec: 5557.6). Total num frames: 710172672. Throughput: 0: 5787.6. Samples: 710177320. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:56:06,024][25689] Avg episode reward: [(0, '-4.265')] [2022-07-10 10:56:06,913][26022] Updated weights on worker 0-0, policy_version 693535 (0.00087) [2022-07-10 10:56:09,237][26022] Updated weights on worker 0-0, policy_version 693545 (0.00084) [2022-07-10 10:56:10,543][26022] Updated weights on worker 0-0, policy_version 693555 (0.00089) [2022-07-10 10:56:11,074][25689] Fps is (10 sec: 5433.7, 60 sec: 5590.7, 300 sec: 5564.0). Total num frames: 710202368. Throughput: 0: 5754.6. Samples: 710210914. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:56:11,075][25689] Avg episode reward: [(0, '-3.985')] [2022-07-10 10:56:12,637][26022] Updated weights on worker 0-0, policy_version 693565 (0.00085) [2022-07-10 10:56:14,358][26022] Updated weights on worker 0-0, policy_version 693575 (0.00087) [2022-07-10 10:56:16,084][25689] Fps is (10 sec: 5699.8, 60 sec: 5596.1, 300 sec: 5560.7). Total num frames: 710230016. Throughput: 0: 4912.2. Samples: 710227654. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:56:16,085][25689] Avg episode reward: [(0, '-2.393')] [2022-07-10 10:56:16,183][26022] Updated weights on worker 0-0, policy_version 693585 (0.00092) [2022-07-10 10:56:17,979][26022] Updated weights on worker 0-0, policy_version 693595 (0.00084) [2022-07-10 10:56:19,801][26022] Updated weights on worker 0-0, policy_version 693605 (0.00086) [2022-07-10 10:56:21,108][25689] Fps is (10 sec: 5511.1, 60 sec: 5544.2, 300 sec: 5559.0). Total num frames: 710257664. Throughput: 0: 5731.4. Samples: 710261460. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:56:21,108][25689] Avg episode reward: [(0, '-1.783')] [2022-07-10 10:56:21,619][26022] Updated weights on worker 0-0, policy_version 693615 (0.00093) [2022-07-10 10:56:23,644][26022] Updated weights on worker 0-0, policy_version 693625 (0.00091) [2022-07-10 10:56:25,380][26022] Updated weights on worker 0-0, policy_version 693635 (0.00083) [2022-07-10 10:56:25,580][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:56:25,593][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000693636_710283264.pth [2022-07-10 10:56:25,593][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000691680_708280320.pth [2022-07-10 10:56:26,216][25689] Fps is (10 sec: 5558.7, 60 sec: 5594.8, 300 sec: 5561.2). Total num frames: 710286336. Throughput: 0: 5805.9. Samples: 710294556. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:56:26,217][25689] Avg episode reward: [(0, '-3.273')] [2022-07-10 10:56:27,292][26022] Updated weights on worker 0-0, policy_version 693645 (0.00095) [2022-07-10 10:56:29,188][26022] Updated weights on worker 0-0, policy_version 693655 (0.00102) [2022-07-10 10:56:30,897][26022] Updated weights on worker 0-0, policy_version 693665 (0.00088) [2022-07-10 10:56:31,254][25689] Fps is (10 sec: 5551.0, 60 sec: 5575.9, 300 sec: 5561.8). Total num frames: 710313984. Throughput: 0: 4967.3. Samples: 710311152. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:56:31,254][25689] Avg episode reward: [(0, '-4.112')] [2022-07-10 10:56:32,798][26022] Updated weights on worker 0-0, policy_version 693675 (0.00087) [2022-07-10 10:56:34,732][26022] Updated weights on worker 0-0, policy_version 693685 (0.00086) [2022-07-10 10:56:36,264][25689] Fps is (10 sec: 5503.1, 60 sec: 5561.5, 300 sec: 5561.8). Total num frames: 710341632. Throughput: 0: 5792.5. Samples: 710344546. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:56:36,265][25689] Avg episode reward: [(0, '-5.405')] [2022-07-10 10:56:36,483][26022] Updated weights on worker 0-0, policy_version 693695 (0.00085) [2022-07-10 10:56:38,475][26022] Updated weights on worker 0-0, policy_version 693705 (0.00953) [2022-07-10 10:56:40,060][26022] Updated weights on worker 0-0, policy_version 693715 (0.00086) [2022-07-10 10:56:41,288][25689] Fps is (10 sec: 5510.6, 60 sec: 5559.6, 300 sec: 5555.7). Total num frames: 710369280. Throughput: 0: 5770.3. Samples: 710377906. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:56:41,289][25689] Avg episode reward: [(0, '-4.887')] [2022-07-10 10:56:42,287][26022] Updated weights on worker 0-0, policy_version 693725 (0.00091) [2022-07-10 10:56:43,888][26022] Updated weights on worker 0-0, policy_version 693735 (0.00086) [2022-07-10 10:56:45,698][26022] Updated weights on worker 0-0, policy_version 693745 (0.00088) [2022-07-10 10:56:46,414][25689] Fps is (10 sec: 5750.7, 60 sec: 5578.7, 300 sec: 5562.0). Total num frames: 710400000. Throughput: 0: 4960.2. Samples: 710394742. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:56:46,415][25689] Avg episode reward: [(0, '-4.876')] [2022-07-10 10:56:47,449][26022] Updated weights on worker 0-0, policy_version 693755 (0.00089) [2022-07-10 10:56:49,253][26022] Updated weights on worker 0-0, policy_version 693765 (0.00089) [2022-07-10 10:56:51,066][26022] Updated weights on worker 0-0, policy_version 693775 (0.00095) [2022-07-10 10:56:51,441][25689] Fps is (10 sec: 5547.4, 60 sec: 5527.8, 300 sec: 5551.8). Total num frames: 710425600. Throughput: 0: 5800.4. Samples: 710428244. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:56:51,441][25689] Avg episode reward: [(0, '-3.417')] [2022-07-10 10:56:53,158][26022] Updated weights on worker 0-0, policy_version 693785 (0.00086) [2022-07-10 10:56:54,814][26022] Updated weights on worker 0-0, policy_version 693795 (0.00095) [2022-07-10 10:56:56,484][25689] Fps is (10 sec: 5287.7, 60 sec: 5508.9, 300 sec: 5554.6). Total num frames: 710453248. Throughput: 0: 5779.6. Samples: 710461408. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:56:56,485][25689] Avg episode reward: [(0, '-2.558')] [2022-07-10 10:56:56,663][26022] Updated weights on worker 0-0, policy_version 693805 (0.00086) [2022-07-10 10:56:58,511][26022] Updated weights on worker 0-0, policy_version 693815 (0.00091) [2022-07-10 10:57:00,537][26022] Updated weights on worker 0-0, policy_version 693826 (0.00084) [2022-07-10 10:57:01,512][25689] Fps is (10 sec: 5693.7, 60 sec: 5541.1, 300 sec: 5562.4). Total num frames: 710482944. Throughput: 0: 4966.1. Samples: 710478340. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:57:01,513][25689] Avg episode reward: [(0, '-2.458')] [2022-07-10 10:57:02,695][26022] Updated weights on worker 0-0, policy_version 693836 (0.00087) [2022-07-10 10:57:04,668][26022] Updated weights on worker 0-0, policy_version 693846 (0.00093) [2022-07-10 10:57:06,449][26022] Updated weights on worker 0-0, policy_version 693856 (0.00085) [2022-07-10 10:57:06,598][25689] Fps is (10 sec: 5569.0, 60 sec: 5561.7, 300 sec: 5561.0). Total num frames: 710509568. Throughput: 0: 5712.8. Samples: 710510046. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:57:06,598][25689] Avg episode reward: [(0, '-2.301')] [2022-07-10 10:57:08,316][26022] Updated weights on worker 0-0, policy_version 693866 (0.00085) [2022-07-10 10:57:10,202][26022] Updated weights on worker 0-0, policy_version 693876 (0.00088) [2022-07-10 10:57:11,631][25689] Fps is (10 sec: 5363.4, 60 sec: 5529.5, 300 sec: 5557.0). Total num frames: 710537216. Throughput: 0: 5705.2. Samples: 710543434. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:57:11,632][25689] Avg episode reward: [(0, '-3.763')] [2022-07-10 10:57:12,004][26022] Updated weights on worker 0-0, policy_version 693886 (0.00090) [2022-07-10 10:57:13,849][26022] Updated weights on worker 0-0, policy_version 693896 (0.00088) [2022-07-10 10:57:15,473][26022] Updated weights on worker 0-0, policy_version 693906 (0.00084) [2022-07-10 10:57:16,640][25689] Fps is (10 sec: 5608.3, 60 sec: 5546.5, 300 sec: 5557.4). Total num frames: 710565888. Throughput: 0: 4916.3. Samples: 710560498. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:57:16,640][25689] Avg episode reward: [(0, '-3.518')] [2022-07-10 10:57:17,562][26022] Updated weights on worker 0-0, policy_version 693916 (0.00094) [2022-07-10 10:57:19,095][26022] Updated weights on worker 0-0, policy_version 693926 (0.00089) [2022-07-10 10:57:21,187][26022] Updated weights on worker 0-0, policy_version 693936 (0.00085) [2022-07-10 10:57:21,703][25689] Fps is (10 sec: 5490.3, 60 sec: 5526.0, 300 sec: 5553.9). Total num frames: 710592512. Throughput: 0: 5736.1. Samples: 710594154. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:57:21,703][25689] Avg episode reward: [(0, '-2.857')] [2022-07-10 10:57:22,739][26022] Updated weights on worker 0-0, policy_version 693946 (0.00093) [2022-07-10 10:57:24,827][26022] Updated weights on worker 0-0, policy_version 693956 (0.00085) [2022-07-10 10:57:26,437][26022] Updated weights on worker 0-0, policy_version 693966 (0.00087) [2022-07-10 10:57:26,795][25689] Fps is (10 sec: 5545.9, 60 sec: 5544.4, 300 sec: 5559.1). Total num frames: 710622208. Throughput: 0: 5824.1. Samples: 710627678. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:57:26,796][25689] Avg episode reward: [(0, '-3.494')] [2022-07-10 10:57:28,476][26022] Updated weights on worker 0-0, policy_version 693976 (0.00106) [2022-07-10 10:57:30,213][26022] Updated weights on worker 0-0, policy_version 693986 (0.00089) [2022-07-10 10:57:31,827][25689] Fps is (10 sec: 5663.9, 60 sec: 5544.9, 300 sec: 5562.2). Total num frames: 710649856. Throughput: 0: 5003.3. Samples: 710644480. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:57:31,827][25689] Avg episode reward: [(0, '-2.410')] [2022-07-10 10:57:32,035][26022] Updated weights on worker 0-0, policy_version 693996 (0.00065) [2022-07-10 10:57:33,992][26022] Updated weights on worker 0-0, policy_version 694006 (0.00086) [2022-07-10 10:57:35,903][26022] Updated weights on worker 0-0, policy_version 694016 (0.00085) [2022-07-10 10:57:36,863][25689] Fps is (10 sec: 5492.4, 60 sec: 5542.6, 300 sec: 5554.8). Total num frames: 710677504. Throughput: 0: 5810.3. Samples: 710678000. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:57:36,863][25689] Avg episode reward: [(0, '-2.704')] [2022-07-10 10:57:37,447][26022] Updated weights on worker 0-0, policy_version 694026 (0.00084) [2022-07-10 10:57:39,700][26022] Updated weights on worker 0-0, policy_version 694036 (0.00084) [2022-07-10 10:57:40,996][26022] Updated weights on worker 0-0, policy_version 694046 (0.00095) [2022-07-10 10:57:41,880][25689] Fps is (10 sec: 5602.3, 60 sec: 5560.1, 300 sec: 5557.0). Total num frames: 710706176. Throughput: 0: 5795.1. Samples: 710711084. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:57:41,881][25689] Avg episode reward: [(0, '-2.454')] [2022-07-10 10:57:43,275][26022] Updated weights on worker 0-0, policy_version 694056 (0.00086) [2022-07-10 10:57:44,888][26022] Updated weights on worker 0-0, policy_version 694066 (0.00083) [2022-07-10 10:57:46,886][26022] Updated weights on worker 0-0, policy_version 694076 (0.00092) [2022-07-10 10:57:46,953][25689] Fps is (10 sec: 5683.2, 60 sec: 5531.1, 300 sec: 5559.7). Total num frames: 710734848. Throughput: 0: 4974.6. Samples: 710727956. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:57:46,954][25689] Avg episode reward: [(0, '-3.330')] [2022-07-10 10:57:48,815][26022] Updated weights on worker 0-0, policy_version 694086 (0.00084) [2022-07-10 10:57:50,415][26022] Updated weights on worker 0-0, policy_version 694096 (0.00084) [2022-07-10 10:57:52,035][25689] Fps is (10 sec: 5545.9, 60 sec: 5559.8, 300 sec: 5555.2). Total num frames: 710762496. Throughput: 0: 5789.9. Samples: 710761484. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:57:52,036][25689] Avg episode reward: [(0, '-2.346')] [2022-07-10 10:57:52,364][26022] Updated weights on worker 0-0, policy_version 694106 (0.00092) [2022-07-10 10:57:54,240][26022] Updated weights on worker 0-0, policy_version 694116 (0.01012) [2022-07-10 10:57:56,031][26022] Updated weights on worker 0-0, policy_version 694126 (0.00091) [2022-07-10 10:57:57,053][25689] Fps is (10 sec: 5576.0, 60 sec: 5579.1, 300 sec: 5558.5). Total num frames: 710791168. Throughput: 0: 5794.5. Samples: 710794994. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 10:57:57,054][25689] Avg episode reward: [(0, '-2.917')] [2022-07-10 10:57:57,915][26022] Updated weights on worker 0-0, policy_version 694136 (0.00076) [2022-07-10 10:57:59,643][26022] Updated weights on worker 0-0, policy_version 694146 (0.00096) [2022-07-10 10:58:01,500][26022] Updated weights on worker 0-0, policy_version 694156 (0.00094) [2022-07-10 10:58:02,059][25689] Fps is (10 sec: 5414.3, 60 sec: 5513.5, 300 sec: 5560.2). Total num frames: 710816768. Throughput: 0: 4993.1. Samples: 710811840. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 10:58:02,061][25689] Avg episode reward: [(0, '-2.397')] [2022-07-10 10:58:03,732][26022] Updated weights on worker 0-0, policy_version 694166 (0.00055) [2022-07-10 10:58:05,830][26022] Updated weights on worker 0-0, policy_version 694176 (0.00090) [2022-07-10 10:58:07,152][25689] Fps is (10 sec: 5374.5, 60 sec: 5546.7, 300 sec: 5558.9). Total num frames: 710845440. Throughput: 0: 5704.7. Samples: 710843182. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 10:58:07,152][25689] Avg episode reward: [(0, '-2.497')] [2022-07-10 10:58:07,309][26022] Updated weights on worker 0-0, policy_version 694186 (0.00078) [2022-07-10 10:58:09,215][26022] Updated weights on worker 0-0, policy_version 694196 (0.00086) [2022-07-10 10:58:11,184][26022] Updated weights on worker 0-0, policy_version 694206 (0.00088) [2022-07-10 10:58:12,227][25689] Fps is (10 sec: 5438.1, 60 sec: 5525.9, 300 sec: 5554.5). Total num frames: 710872064. Throughput: 0: 5700.7. Samples: 710876592. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 10:58:12,228][25689] Avg episode reward: [(0, '-1.528')] [2022-07-10 10:58:12,903][26022] Updated weights on worker 0-0, policy_version 694216 (0.00088) [2022-07-10 10:58:14,835][26022] Updated weights on worker 0-0, policy_version 694226 (0.00089) [2022-07-10 10:58:16,465][26022] Updated weights on worker 0-0, policy_version 694236 (0.00086) [2022-07-10 10:58:17,259][25689] Fps is (10 sec: 5470.7, 60 sec: 5523.8, 300 sec: 5551.9). Total num frames: 710900736. Throughput: 0: 5707.7. Samples: 710910322. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 10:58:17,260][25689] Avg episode reward: [(0, '-1.232')] [2022-07-10 10:58:18,511][26022] Updated weights on worker 0-0, policy_version 694246 (0.00083) [2022-07-10 10:58:20,264][26022] Updated weights on worker 0-0, policy_version 694256 (0.00095) [2022-07-10 10:58:21,926][26022] Updated weights on worker 0-0, policy_version 694266 (0.00090) [2022-07-10 10:58:22,304][25689] Fps is (10 sec: 5792.5, 60 sec: 5576.1, 300 sec: 5559.6). Total num frames: 710930432. Throughput: 0: 5690.4. Samples: 710927040. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 10:58:22,305][25689] Avg episode reward: [(0, '-2.445')] [2022-07-10 10:58:23,915][26022] Updated weights on worker 0-0, policy_version 694276 (0.00090) [2022-07-10 10:58:25,495][26022] Updated weights on worker 0-0, policy_version 694286 (0.00091) [2022-07-10 10:58:25,862][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 10:58:25,874][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000694287_710949888.pth [2022-07-10 10:58:25,875][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000692330_708945920.pth [2022-07-10 10:58:27,370][25689] Fps is (10 sec: 5570.2, 60 sec: 5527.8, 300 sec: 5551.7). Total num frames: 710957056. Throughput: 0: 5807.6. Samples: 710960604. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 10:58:27,371][25689] Avg episode reward: [(0, '-3.727')] [2022-07-10 10:58:27,554][26022] Updated weights on worker 0-0, policy_version 694296 (0.00092) [2022-07-10 10:58:29,230][26022] Updated weights on worker 0-0, policy_version 694306 (0.00085) [2022-07-10 10:58:31,237][26022] Updated weights on worker 0-0, policy_version 694316 (0.00082) [2022-07-10 10:58:32,431][25689] Fps is (10 sec: 5561.5, 60 sec: 5559.0, 300 sec: 5557.5). Total num frames: 710986752. Throughput: 0: 5815.2. Samples: 710994078. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 10:58:32,432][25689] Avg episode reward: [(0, '-4.136')] [2022-07-10 10:58:33,004][26022] Updated weights on worker 0-0, policy_version 694326 (0.00087) [2022-07-10 10:58:34,872][26022] Updated weights on worker 0-0, policy_version 694336 (0.00086) [2022-07-10 10:58:36,692][26022] Updated weights on worker 0-0, policy_version 694346 (0.00088) [2022-07-10 10:58:37,493][25689] Fps is (10 sec: 5563.4, 60 sec: 5539.6, 300 sec: 5553.4). Total num frames: 711013376. Throughput: 0: 4981.1. Samples: 711011110. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 10:58:37,494][25689] Avg episode reward: [(0, '-4.656')] [2022-07-10 10:58:38,483][26022] Updated weights on worker 0-0, policy_version 694356 (0.00080) [2022-07-10 10:58:40,331][26022] Updated weights on worker 0-0, policy_version 694366 (0.00083) [2022-07-10 10:58:42,183][26022] Updated weights on worker 0-0, policy_version 694376 (0.00083) [2022-07-10 10:58:42,505][25689] Fps is (10 sec: 5590.6, 60 sec: 5557.1, 300 sec: 5557.8). Total num frames: 711043072. Throughput: 0: 5828.5. Samples: 711044780. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 10:58:42,505][25689] Avg episode reward: [(0, '-5.607')] [2022-07-10 10:58:43,911][26022] Updated weights on worker 0-0, policy_version 694386 (0.00083) [2022-07-10 10:58:45,768][26022] Updated weights on worker 0-0, policy_version 694396 (0.00089) [2022-07-10 10:58:47,546][25689] Fps is (10 sec: 5704.6, 60 sec: 5543.1, 300 sec: 5554.4). Total num frames: 711070720. Throughput: 0: 5848.6. Samples: 711078602. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 10:58:47,546][25689] Avg episode reward: [(0, '-4.286')] [2022-07-10 10:58:47,667][26022] Updated weights on worker 0-0, policy_version 694406 (0.00086) [2022-07-10 10:58:49,435][26022] Updated weights on worker 0-0, policy_version 694416 (0.00083) [2022-07-10 10:58:51,207][26022] Updated weights on worker 0-0, policy_version 694426 (0.00090) [2022-07-10 10:58:52,574][25689] Fps is (10 sec: 5491.6, 60 sec: 5548.1, 300 sec: 5557.8). Total num frames: 711098368. Throughput: 0: 5040.5. Samples: 711095610. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 10:58:52,574][25689] Avg episode reward: [(0, '-4.237')] [2022-07-10 10:58:53,270][26022] Updated weights on worker 0-0, policy_version 694436 (0.00091) [2022-07-10 10:58:55,015][26022] Updated weights on worker 0-0, policy_version 694446 (0.00089) [2022-07-10 10:58:56,890][26022] Updated weights on worker 0-0, policy_version 694456 (0.00091) [2022-07-10 10:58:57,592][25689] Fps is (10 sec: 5606.3, 60 sec: 5548.1, 300 sec: 5554.5). Total num frames: 711127040. Throughput: 0: 5869.7. Samples: 711129080. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 10:58:57,592][25689] Avg episode reward: [(0, '-2.671')] [2022-07-10 10:58:58,569][26022] Updated weights on worker 0-0, policy_version 694466 (0.00086) [2022-07-10 10:59:00,498][26022] Updated weights on worker 0-0, policy_version 694476 (0.00626) [2022-07-10 10:59:02,611][25689] Fps is (10 sec: 5406.9, 60 sec: 5546.8, 300 sec: 5555.3). Total num frames: 711152640. Throughput: 0: 5747.9. Samples: 711160352. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 10:59:02,612][25689] Avg episode reward: [(0, '-2.488')] [2022-07-10 10:59:02,680][26022] Updated weights on worker 0-0, policy_version 694486 (0.00084) [2022-07-10 10:59:04,587][26022] Updated weights on worker 0-0, policy_version 694496 (0.00089) [2022-07-10 10:59:06,409][26022] Updated weights on worker 0-0, policy_version 694506 (0.01135) [2022-07-10 10:59:07,650][25689] Fps is (10 sec: 5294.0, 60 sec: 5534.8, 300 sec: 5552.6). Total num frames: 711180288. Throughput: 0: 4899.0. Samples: 711177092. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 10:59:07,651][25689] Avg episode reward: [(0, '-2.302')] [2022-07-10 10:59:08,217][26022] Updated weights on worker 0-0, policy_version 694516 (0.00091) [2022-07-10 10:59:10,079][26022] Updated weights on worker 0-0, policy_version 694526 (0.00087) [2022-07-10 10:59:12,079][26022] Updated weights on worker 0-0, policy_version 694536 (0.00085) [2022-07-10 10:59:12,652][25689] Fps is (10 sec: 5507.3, 60 sec: 5558.5, 300 sec: 5549.3). Total num frames: 711207936. Throughput: 0: 5703.2. Samples: 711210120. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 10:59:12,652][25689] Avg episode reward: [(0, '-1.716')] [2022-07-10 10:59:13,657][26022] Updated weights on worker 0-0, policy_version 694546 (0.00082) [2022-07-10 10:59:15,835][26022] Updated weights on worker 0-0, policy_version 694556 (0.00084) [2022-07-10 10:59:17,312][26022] Updated weights on worker 0-0, policy_version 694566 (0.00085) [2022-07-10 10:59:17,681][25689] Fps is (10 sec: 5614.6, 60 sec: 5558.8, 300 sec: 5552.3). Total num frames: 711236608. Throughput: 0: 5712.1. Samples: 711243832. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 10:59:17,681][25689] Avg episode reward: [(0, '-2.903')] [2022-07-10 10:59:19,364][26022] Updated weights on worker 0-0, policy_version 694576 (0.00095) [2022-07-10 10:59:21,336][26022] Updated weights on worker 0-0, policy_version 694586 (0.00087) [2022-07-10 10:59:22,708][25689] Fps is (10 sec: 5498.9, 60 sec: 5509.6, 300 sec: 5546.6). Total num frames: 711263232. Throughput: 0: 4990.6. Samples: 711260644. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 10:59:22,709][25689] Avg episode reward: [(0, '-3.495')] [2022-07-10 10:59:22,963][26022] Updated weights on worker 0-0, policy_version 694596 (0.00086) [2022-07-10 10:59:24,948][26022] Updated weights on worker 0-0, policy_version 694606 (0.00090) [2022-07-10 10:59:26,548][26022] Updated weights on worker 0-0, policy_version 694616 (0.00085) [2022-07-10 10:59:27,829][25689] Fps is (10 sec: 5549.7, 60 sec: 5555.4, 300 sec: 5548.0). Total num frames: 711292928. Throughput: 0: 5793.2. Samples: 711293994. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 10:59:27,830][25689] Avg episode reward: [(0, '-3.880')] [2022-07-10 10:59:28,556][26022] Updated weights on worker 0-0, policy_version 694626 (0.00085) [2022-07-10 10:59:30,392][26022] Updated weights on worker 0-0, policy_version 694636 (0.00087) [2022-07-10 10:59:32,176][26022] Updated weights on worker 0-0, policy_version 694646 (0.00089) [2022-07-10 10:59:32,840][25689] Fps is (10 sec: 5659.3, 60 sec: 5526.0, 300 sec: 5551.4). Total num frames: 711320576. Throughput: 0: 5803.4. Samples: 711327282. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 10:59:32,841][25689] Avg episode reward: [(0, '-3.287')] [2022-07-10 10:59:34,109][26022] Updated weights on worker 0-0, policy_version 694656 (0.00086) [2022-07-10 10:59:35,878][26022] Updated weights on worker 0-0, policy_version 694666 (0.00084) [2022-07-10 10:59:37,841][26022] Updated weights on worker 0-0, policy_version 694676 (0.00089) [2022-07-10 10:59:37,939][25689] Fps is (10 sec: 5469.3, 60 sec: 5539.6, 300 sec: 5553.8). Total num frames: 711348224. Throughput: 0: 4933.8. Samples: 711343784. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 10:59:37,940][25689] Avg episode reward: [(0, '-3.779')] [2022-07-10 10:59:39,582][26022] Updated weights on worker 0-0, policy_version 694686 (0.00081) [2022-07-10 10:59:41,613][26022] Updated weights on worker 0-0, policy_version 694696 (0.00092) [2022-07-10 10:59:42,967][25689] Fps is (10 sec: 5561.8, 60 sec: 5521.2, 300 sec: 5548.8). Total num frames: 711376896. Throughput: 0: 5753.0. Samples: 711377194. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 10:59:42,967][25689] Avg episode reward: [(0, '-3.965')] [2022-07-10 10:59:43,195][26022] Updated weights on worker 0-0, policy_version 694706 (0.00086) [2022-07-10 10:59:45,401][26022] Updated weights on worker 0-0, policy_version 694716 (0.00086) [2022-07-10 10:59:46,797][26022] Updated weights on worker 0-0, policy_version 694726 (0.00090) [2022-07-10 10:59:48,052][25689] Fps is (10 sec: 5670.4, 60 sec: 5534.1, 300 sec: 5551.0). Total num frames: 711405568. Throughput: 0: 5773.3. Samples: 711410748. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 10:59:48,053][25689] Avg episode reward: [(0, '-2.302')] [2022-07-10 10:59:49,044][26022] Updated weights on worker 0-0, policy_version 694736 (0.00100) [2022-07-10 10:59:50,443][26022] Updated weights on worker 0-0, policy_version 694746 (0.00082) [2022-07-10 10:59:52,645][26022] Updated weights on worker 0-0, policy_version 694756 (0.00084) [2022-07-10 10:59:53,153][25689] Fps is (10 sec: 5428.6, 60 sec: 5510.6, 300 sec: 5542.3). Total num frames: 711432192. Throughput: 0: 4939.6. Samples: 711427616. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 10:59:53,153][25689] Avg episode reward: [(0, '-2.834')] [2022-07-10 10:59:54,128][26022] Updated weights on worker 0-0, policy_version 694766 (0.00087) [2022-07-10 10:59:56,277][26022] Updated weights on worker 0-0, policy_version 694776 (0.00088) [2022-07-10 10:59:58,026][26022] Updated weights on worker 0-0, policy_version 694786 (0.00086) [2022-07-10 10:59:58,208][25689] Fps is (10 sec: 5545.7, 60 sec: 5524.0, 300 sec: 5552.5). Total num frames: 711461888. Throughput: 0: 5788.6. Samples: 711461110. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 10:59:58,209][25689] Avg episode reward: [(0, '-2.985')] [2022-07-10 11:00:00,051][26022] Updated weights on worker 0-0, policy_version 694796 (0.00085) [2022-07-10 11:00:01,940][26022] Updated weights on worker 0-0, policy_version 694806 (0.00082) [2022-07-10 11:00:03,226][25689] Fps is (10 sec: 5387.6, 60 sec: 5507.3, 300 sec: 5539.6). Total num frames: 711486464. Throughput: 0: 5697.8. Samples: 711492630. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 11:00:03,227][25689] Avg episode reward: [(0, '-3.401')] [2022-07-10 11:00:03,894][26022] Updated weights on worker 0-0, policy_version 694816 (0.00085) [2022-07-10 11:00:05,646][26022] Updated weights on worker 0-0, policy_version 694826 (0.00089) [2022-07-10 11:00:07,595][26022] Updated weights on worker 0-0, policy_version 694836 (0.00085) [2022-07-10 11:00:08,337][25689] Fps is (10 sec: 5560.4, 60 sec: 5568.3, 300 sec: 5555.0). Total num frames: 711518208. Throughput: 0: 4861.2. Samples: 711509354. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 11:00:08,337][25689] Avg episode reward: [(0, '-3.940')] [2022-07-10 11:00:09,448][26022] Updated weights on worker 0-0, policy_version 694846 (0.00087) [2022-07-10 11:00:11,351][26022] Updated weights on worker 0-0, policy_version 694856 (0.00086) [2022-07-10 11:00:13,099][26022] Updated weights on worker 0-0, policy_version 694866 (0.00090) [2022-07-10 11:00:13,370][25689] Fps is (10 sec: 5653.0, 60 sec: 5531.6, 300 sec: 5548.8). Total num frames: 711543808. Throughput: 0: 5701.4. Samples: 711542884. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 11:00:13,371][25689] Avg episode reward: [(0, '-3.205')] [2022-07-10 11:00:14,919][26022] Updated weights on worker 0-0, policy_version 694876 (0.00089) [2022-07-10 11:00:16,726][26022] Updated weights on worker 0-0, policy_version 694886 (0.00082) [2022-07-10 11:00:18,400][25689] Fps is (10 sec: 5494.9, 60 sec: 5548.4, 300 sec: 5545.0). Total num frames: 711573504. Throughput: 0: 5727.7. Samples: 711576762. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 11:00:18,400][25689] Avg episode reward: [(0, '-3.778')] [2022-07-10 11:00:18,403][26022] Updated weights on worker 0-0, policy_version 694896 (0.00086) [2022-07-10 11:00:20,374][26022] Updated weights on worker 0-0, policy_version 694906 (0.00095) [2022-07-10 11:00:22,215][26022] Updated weights on worker 0-0, policy_version 694916 (0.00101) [2022-07-10 11:00:23,439][25689] Fps is (10 sec: 5695.3, 60 sec: 5564.2, 300 sec: 5553.2). Total num frames: 711601152. Throughput: 0: 4993.0. Samples: 711593552. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 11:00:23,439][25689] Avg episode reward: [(0, '-2.431')] [2022-07-10 11:00:24,085][26022] Updated weights on worker 0-0, policy_version 694926 (0.00090) [2022-07-10 11:00:25,891][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:00:25,911][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000694936_711614464.pth [2022-07-10 11:00:25,911][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000692982_709613568.pth [2022-07-10 11:00:25,919][26022] Updated weights on worker 0-0, policy_version 694936 (0.00083) [2022-07-10 11:00:27,780][26022] Updated weights on worker 0-0, policy_version 694946 (0.00090) [2022-07-10 11:00:28,494][25689] Fps is (10 sec: 5376.5, 60 sec: 5519.6, 300 sec: 5545.6). Total num frames: 711627776. Throughput: 0: 5827.9. Samples: 711626830. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 11:00:28,495][25689] Avg episode reward: [(0, '-3.868')] [2022-07-10 11:00:29,552][26022] Updated weights on worker 0-0, policy_version 694956 (0.00088) [2022-07-10 11:00:31,433][26022] Updated weights on worker 0-0, policy_version 694966 (0.00086) [2022-07-10 11:00:33,364][26022] Updated weights on worker 0-0, policy_version 694976 (0.00092) [2022-07-10 11:00:33,504][25689] Fps is (10 sec: 5493.8, 60 sec: 5536.6, 300 sec: 5546.1). Total num frames: 711656448. Throughput: 0: 5818.2. Samples: 711660026. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 11:00:33,504][25689] Avg episode reward: [(0, '-4.155')] [2022-07-10 11:00:35,002][26022] Updated weights on worker 0-0, policy_version 694986 (0.00092) [2022-07-10 11:00:37,021][26022] Updated weights on worker 0-0, policy_version 694996 (0.00094) [2022-07-10 11:00:38,515][25689] Fps is (10 sec: 5722.4, 60 sec: 5561.6, 300 sec: 5549.4). Total num frames: 711685120. Throughput: 0: 5813.9. Samples: 711693710. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 11:00:38,516][25689] Avg episode reward: [(0, '-3.062')] [2022-07-10 11:00:38,795][26022] Updated weights on worker 0-0, policy_version 695006 (0.00086) [2022-07-10 11:00:40,711][26022] Updated weights on worker 0-0, policy_version 695016 (0.00091) [2022-07-10 11:00:42,298][26022] Updated weights on worker 0-0, policy_version 695026 (0.00086) [2022-07-10 11:00:43,521][25689] Fps is (10 sec: 5418.2, 60 sec: 5512.8, 300 sec: 5538.3). Total num frames: 711710720. Throughput: 0: 5828.4. Samples: 711710596. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 11:00:43,522][25689] Avg episode reward: [(0, '-4.888')] [2022-07-10 11:00:44,305][26022] Updated weights on worker 0-0, policy_version 695036 (0.00085) [2022-07-10 11:00:46,225][26022] Updated weights on worker 0-0, policy_version 695046 (0.00088) [2022-07-10 11:00:47,931][26022] Updated weights on worker 0-0, policy_version 695056 (0.00079) [2022-07-10 11:00:48,573][25689] Fps is (10 sec: 5396.1, 60 sec: 5515.9, 300 sec: 5537.8). Total num frames: 711739392. Throughput: 0: 5835.7. Samples: 711744004. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 11:00:48,573][25689] Avg episode reward: [(0, '-4.304')] [2022-07-10 11:00:49,799][26022] Updated weights on worker 0-0, policy_version 695066 (0.00083) [2022-07-10 11:00:51,539][26022] Updated weights on worker 0-0, policy_version 695076 (0.00090) [2022-07-10 11:00:53,540][26022] Updated weights on worker 0-0, policy_version 695086 (0.00085) [2022-07-10 11:00:53,578][25689] Fps is (10 sec: 5701.6, 60 sec: 5558.5, 300 sec: 5538.1). Total num frames: 711768064. Throughput: 0: 5872.1. Samples: 711777904. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 11:00:53,580][25689] Avg episode reward: [(0, '-4.349')] [2022-07-10 11:00:55,183][26022] Updated weights on worker 0-0, policy_version 695096 (0.00083) [2022-07-10 11:00:57,013][26022] Updated weights on worker 0-0, policy_version 695106 (0.00079) [2022-07-10 11:00:58,588][25689] Fps is (10 sec: 5828.1, 60 sec: 5562.7, 300 sec: 5545.0). Total num frames: 711797760. Throughput: 0: 5034.7. Samples: 711794770. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 11:00:58,589][25689] Avg episode reward: [(0, '-3.069')] [2022-07-10 11:00:58,724][26022] Updated weights on worker 0-0, policy_version 695116 (0.00090) [2022-07-10 11:01:00,877][26022] Updated weights on worker 0-0, policy_version 695126 (0.00085) [2022-07-10 11:01:02,761][26022] Updated weights on worker 0-0, policy_version 695136 (0.00080) [2022-07-10 11:01:03,601][25689] Fps is (10 sec: 5414.6, 60 sec: 5563.1, 300 sec: 5543.6). Total num frames: 711822336. Throughput: 0: 5762.5. Samples: 711826312. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 11:01:03,602][25689] Avg episode reward: [(0, '-3.019')] [2022-07-10 11:01:04,883][26022] Updated weights on worker 0-0, policy_version 695146 (0.00694) [2022-07-10 11:01:06,567][26022] Updated weights on worker 0-0, policy_version 695156 (0.00095) [2022-07-10 11:01:08,533][26022] Updated weights on worker 0-0, policy_version 695166 (0.00085) [2022-07-10 11:01:08,660][25689] Fps is (10 sec: 5286.8, 60 sec: 5517.0, 300 sec: 5540.1). Total num frames: 711851008. Throughput: 0: 5770.9. Samples: 711859922. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 11:01:08,660][25689] Avg episode reward: [(0, '-2.017')] [2022-07-10 11:01:10,351][26022] Updated weights on worker 0-0, policy_version 695176 (0.00093) [2022-07-10 11:01:12,083][26022] Updated weights on worker 0-0, policy_version 695186 (0.00090) [2022-07-10 11:01:13,751][25689] Fps is (10 sec: 5549.1, 60 sec: 5545.6, 300 sec: 5538.6). Total num frames: 711878656. Throughput: 0: 4902.4. Samples: 711876798. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 11:01:13,751][25689] Avg episode reward: [(0, '-2.378')] [2022-07-10 11:01:13,897][26022] Updated weights on worker 0-0, policy_version 695196 (0.00094) [2022-07-10 11:01:15,862][26022] Updated weights on worker 0-0, policy_version 695206 (0.00092) [2022-07-10 11:01:17,403][26022] Updated weights on worker 0-0, policy_version 695216 (0.00092) [2022-07-10 11:01:18,831][25689] Fps is (10 sec: 5537.2, 60 sec: 5524.0, 300 sec: 5540.9). Total num frames: 711907328. Throughput: 0: 5706.4. Samples: 711910286. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 11:01:18,831][25689] Avg episode reward: [(0, '-3.067')] [2022-07-10 11:01:19,567][26022] Updated weights on worker 0-0, policy_version 695226 (0.00080) [2022-07-10 11:01:21,295][26022] Updated weights on worker 0-0, policy_version 695236 (0.00082) [2022-07-10 11:01:23,192][26022] Updated weights on worker 0-0, policy_version 695246 (0.00104) [2022-07-10 11:01:23,835][25689] Fps is (10 sec: 5788.2, 60 sec: 5561.2, 300 sec: 5546.3). Total num frames: 711937024. Throughput: 0: 5807.4. Samples: 711943816. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:01:23,835][25689] Avg episode reward: [(0, '-2.442')] [2022-07-10 11:01:25,016][26022] Updated weights on worker 0-0, policy_version 695256 (0.00084) [2022-07-10 11:01:26,820][26022] Updated weights on worker 0-0, policy_version 695266 (0.00090) [2022-07-10 11:01:28,641][26022] Updated weights on worker 0-0, policy_version 695276 (0.00082) [2022-07-10 11:01:28,894][25689] Fps is (10 sec: 5596.7, 60 sec: 5560.8, 300 sec: 5542.5). Total num frames: 711963648. Throughput: 0: 4971.5. Samples: 711960518. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:01:28,896][25689] Avg episode reward: [(0, '-3.508')] [2022-07-10 11:01:30,382][26022] Updated weights on worker 0-0, policy_version 695286 (0.00095) [2022-07-10 11:01:32,256][26022] Updated weights on worker 0-0, policy_version 695296 (0.00086) [2022-07-10 11:01:33,991][25689] Fps is (10 sec: 5444.6, 60 sec: 5552.8, 300 sec: 5544.3). Total num frames: 711992320. Throughput: 0: 5806.4. Samples: 711994320. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:01:33,994][25689] Avg episode reward: [(0, '-3.655')] [2022-07-10 11:01:34,091][26022] Updated weights on worker 0-0, policy_version 695306 (0.00095) [2022-07-10 11:01:35,747][26022] Updated weights on worker 0-0, policy_version 695316 (0.00054) [2022-07-10 11:01:37,823][26022] Updated weights on worker 0-0, policy_version 695326 (0.00088) [2022-07-10 11:01:39,007][25689] Fps is (10 sec: 5670.2, 60 sec: 5552.3, 300 sec: 5547.9). Total num frames: 712020992. Throughput: 0: 5832.0. Samples: 712027954. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:01:39,008][25689] Avg episode reward: [(0, '-4.578')] [2022-07-10 11:01:39,592][26022] Updated weights on worker 0-0, policy_version 695336 (0.00094) [2022-07-10 11:01:41,491][26022] Updated weights on worker 0-0, policy_version 695346 (0.00096) [2022-07-10 11:01:43,213][26022] Updated weights on worker 0-0, policy_version 695356 (0.00081) [2022-07-10 11:01:44,009][25689] Fps is (10 sec: 5519.4, 60 sec: 5569.6, 300 sec: 5536.4). Total num frames: 712047616. Throughput: 0: 5000.2. Samples: 712044692. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:01:44,010][25689] Avg episode reward: [(0, '-3.169')] [2022-07-10 11:01:45,181][26022] Updated weights on worker 0-0, policy_version 695366 (0.00087) [2022-07-10 11:01:46,881][26022] Updated weights on worker 0-0, policy_version 695376 (0.00091) [2022-07-10 11:01:48,812][26022] Updated weights on worker 0-0, policy_version 695386 (0.00085) [2022-07-10 11:01:49,055][25689] Fps is (10 sec: 5503.5, 60 sec: 5570.2, 300 sec: 5546.4). Total num frames: 712076288. Throughput: 0: 5834.3. Samples: 712078140. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:01:49,059][25689] Avg episode reward: [(0, '-4.603')] [2022-07-10 11:01:50,589][26022] Updated weights on worker 0-0, policy_version 695396 (0.00085) [2022-07-10 11:01:52,530][26022] Updated weights on worker 0-0, policy_version 695406 (0.00089) [2022-07-10 11:01:54,080][25689] Fps is (10 sec: 5694.3, 60 sec: 5568.4, 300 sec: 5550.2). Total num frames: 712104960. Throughput: 0: 5852.2. Samples: 712111884. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:01:54,080][25689] Avg episode reward: [(0, '-4.309')] [2022-07-10 11:01:54,088][26022] Updated weights on worker 0-0, policy_version 695416 (0.00083) [2022-07-10 11:01:56,038][26022] Updated weights on worker 0-0, policy_version 695426 (0.00091) [2022-07-10 11:01:57,859][26022] Updated weights on worker 0-0, policy_version 695436 (0.00103) [2022-07-10 11:01:59,099][25689] Fps is (10 sec: 5607.3, 60 sec: 5533.7, 300 sec: 5543.5). Total num frames: 712132608. Throughput: 0: 5016.8. Samples: 712128748. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:01:59,099][25689] Avg episode reward: [(0, '-5.486')] [2022-07-10 11:01:59,774][26022] Updated weights on worker 0-0, policy_version 695446 (0.00083) [2022-07-10 11:02:01,620][26022] Updated weights on worker 0-0, policy_version 695456 (0.00087) [2022-07-10 11:02:03,793][26022] Updated weights on worker 0-0, policy_version 695466 (0.00091) [2022-07-10 11:02:04,110][25689] Fps is (10 sec: 5308.9, 60 sec: 5550.8, 300 sec: 5541.4). Total num frames: 712158208. Throughput: 0: 5752.7. Samples: 712160324. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:02:04,110][25689] Avg episode reward: [(0, '-4.415')] [2022-07-10 11:02:05,470][26022] Updated weights on worker 0-0, policy_version 695476 (0.00093) [2022-07-10 11:02:07,427][26022] Updated weights on worker 0-0, policy_version 695486 (0.00087) [2022-07-10 11:02:09,126][26022] Updated weights on worker 0-0, policy_version 695496 (0.00080) [2022-07-10 11:02:09,160][25689] Fps is (10 sec: 5495.7, 60 sec: 5568.5, 300 sec: 5548.0). Total num frames: 712187904. Throughput: 0: 5770.8. Samples: 712194166. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:02:09,161][25689] Avg episode reward: [(0, '-3.769')] [2022-07-10 11:02:11,046][26022] Updated weights on worker 0-0, policy_version 695506 (0.00085) [2022-07-10 11:02:12,672][26022] Updated weights on worker 0-0, policy_version 695516 (0.00084) [2022-07-10 11:02:14,178][25689] Fps is (10 sec: 5593.9, 60 sec: 5558.3, 300 sec: 5540.9). Total num frames: 712214528. Throughput: 0: 4927.8. Samples: 712210924. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:02:14,179][25689] Avg episode reward: [(0, '-4.087')] [2022-07-10 11:02:14,695][26022] Updated weights on worker 0-0, policy_version 695526 (0.00096) [2022-07-10 11:02:16,390][26022] Updated weights on worker 0-0, policy_version 695536 (0.00092) [2022-07-10 11:02:18,181][26022] Updated weights on worker 0-0, policy_version 695546 (0.00081) [2022-07-10 11:02:19,195][25689] Fps is (10 sec: 5510.6, 60 sec: 5564.1, 300 sec: 5548.7). Total num frames: 712243200. Throughput: 0: 5767.2. Samples: 712244646. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:02:19,196][25689] Avg episode reward: [(0, '-4.323')] [2022-07-10 11:02:20,077][26022] Updated weights on worker 0-0, policy_version 695556 (0.00094) [2022-07-10 11:02:22,049][26022] Updated weights on worker 0-0, policy_version 695566 (0.00086) [2022-07-10 11:02:23,791][26022] Updated weights on worker 0-0, policy_version 695576 (0.00084) [2022-07-10 11:02:24,199][25689] Fps is (10 sec: 5824.4, 60 sec: 5564.1, 300 sec: 5550.3). Total num frames: 712272896. Throughput: 0: 5881.2. Samples: 712278474. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:02:24,200][25689] Avg episode reward: [(0, '-2.041')] [2022-07-10 11:02:25,594][26022] Updated weights on worker 0-0, policy_version 695586 (0.00088) [2022-07-10 11:02:26,035][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:02:26,062][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000695588_712282112.pth [2022-07-10 11:02:26,063][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000693636_710283264.pth [2022-07-10 11:02:27,321][26022] Updated weights on worker 0-0, policy_version 695596 (0.00085) [2022-07-10 11:02:29,248][25689] Fps is (10 sec: 5602.1, 60 sec: 5565.0, 300 sec: 5546.6). Total num frames: 712299520. Throughput: 0: 5033.9. Samples: 712295286. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:02:29,249][25689] Avg episode reward: [(0, '-1.139')] [2022-07-10 11:02:29,313][26022] Updated weights on worker 0-0, policy_version 695606 (0.00061) [2022-07-10 11:02:31,055][26022] Updated weights on worker 0-0, policy_version 695616 (0.00104) [2022-07-10 11:02:32,903][26022] Updated weights on worker 0-0, policy_version 695626 (0.00096) [2022-07-10 11:02:34,262][25689] Fps is (10 sec: 5495.0, 60 sec: 5572.7, 300 sec: 5550.4). Total num frames: 712328192. Throughput: 0: 5875.6. Samples: 712328932. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:02:34,264][25689] Avg episode reward: [(0, '-2.415')] [2022-07-10 11:02:34,867][26022] Updated weights on worker 0-0, policy_version 695636 (0.00085) [2022-07-10 11:02:36,534][26022] Updated weights on worker 0-0, policy_version 695646 (0.00091) [2022-07-10 11:02:38,439][26022] Updated weights on worker 0-0, policy_version 695656 (0.00085) [2022-07-10 11:02:39,268][25689] Fps is (10 sec: 5825.3, 60 sec: 5590.6, 300 sec: 5554.1). Total num frames: 712357888. Throughput: 0: 5885.6. Samples: 712362790. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:02:39,269][25689] Avg episode reward: [(0, '-2.621')] [2022-07-10 11:02:40,392][26022] Updated weights on worker 0-0, policy_version 695666 (0.00092) [2022-07-10 11:02:41,938][26022] Updated weights on worker 0-0, policy_version 695676 (0.00085) [2022-07-10 11:02:43,817][26022] Updated weights on worker 0-0, policy_version 695686 (0.00094) [2022-07-10 11:02:44,270][25689] Fps is (10 sec: 5627.5, 60 sec: 5590.6, 300 sec: 5548.5). Total num frames: 712384512. Throughput: 0: 5054.9. Samples: 712379930. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:02:44,272][25689] Avg episode reward: [(0, '-3.774')] [2022-07-10 11:02:45,704][26022] Updated weights on worker 0-0, policy_version 695696 (0.00090) [2022-07-10 11:02:47,522][26022] Updated weights on worker 0-0, policy_version 695706 (0.00085) [2022-07-10 11:02:49,322][25689] Fps is (10 sec: 5398.2, 60 sec: 5573.0, 300 sec: 5549.1). Total num frames: 712412160. Throughput: 0: 5873.4. Samples: 712413186. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:02:49,322][25689] Avg episode reward: [(0, '-4.758')] [2022-07-10 11:02:49,411][26022] Updated weights on worker 0-0, policy_version 695716 (0.00090) [2022-07-10 11:02:51,087][26022] Updated weights on worker 0-0, policy_version 695726 (0.00095) [2022-07-10 11:02:52,959][26022] Updated weights on worker 0-0, policy_version 695736 (0.00095) [2022-07-10 11:02:54,340][25689] Fps is (10 sec: 5491.3, 60 sec: 5556.7, 300 sec: 5545.6). Total num frames: 712439808. Throughput: 0: 5876.8. Samples: 712446924. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:02:54,340][25689] Avg episode reward: [(0, '-6.532')] [2022-07-10 11:02:54,838][26022] Updated weights on worker 0-0, policy_version 695746 (0.00112) [2022-07-10 11:02:56,650][26022] Updated weights on worker 0-0, policy_version 695756 (0.00085) [2022-07-10 11:02:58,685][26022] Updated weights on worker 0-0, policy_version 695766 (0.00089) [2022-07-10 11:02:59,359][25689] Fps is (10 sec: 5713.2, 60 sec: 5590.7, 300 sec: 5559.2). Total num frames: 712469504. Throughput: 0: 5021.3. Samples: 712463672. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:02:59,361][25689] Avg episode reward: [(0, '-5.426')] [2022-07-10 11:03:00,336][26022] Updated weights on worker 0-0, policy_version 695776 (0.00089) [2022-07-10 11:03:02,626][26022] Updated weights on worker 0-0, policy_version 695786 (0.00091) [2022-07-10 11:03:04,380][25689] Fps is (10 sec: 5303.4, 60 sec: 5555.8, 300 sec: 5543.3). Total num frames: 712493056. Throughput: 0: 5733.9. Samples: 712495238. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:03:04,382][25689] Avg episode reward: [(0, '-5.727')] [2022-07-10 11:03:04,547][26022] Updated weights on worker 0-0, policy_version 695796 (0.00083) [2022-07-10 11:03:06,054][26022] Updated weights on worker 0-0, policy_version 695806 (0.00085) [2022-07-10 11:03:08,172][26022] Updated weights on worker 0-0, policy_version 695816 (0.00086) [2022-07-10 11:03:09,463][25689] Fps is (10 sec: 5270.0, 60 sec: 5552.8, 300 sec: 5553.5). Total num frames: 712522752. Throughput: 0: 5739.4. Samples: 712528784. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:03:09,463][25689] Avg episode reward: [(0, '-7.385')] [2022-07-10 11:03:09,851][26022] Updated weights on worker 0-0, policy_version 695826 (0.00095) [2022-07-10 11:03:11,692][26022] Updated weights on worker 0-0, policy_version 695836 (0.00092) [2022-07-10 11:03:13,740][26022] Updated weights on worker 0-0, policy_version 695846 (0.00088) [2022-07-10 11:03:14,491][25689] Fps is (10 sec: 5773.0, 60 sec: 5585.8, 300 sec: 5553.5). Total num frames: 712551424. Throughput: 0: 4887.5. Samples: 712545412. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:03:14,491][25689] Avg episode reward: [(0, '-4.589')] [2022-07-10 11:03:15,474][26022] Updated weights on worker 0-0, policy_version 695856 (0.00090) [2022-07-10 11:03:17,279][26022] Updated weights on worker 0-0, policy_version 695866 (0.01289) [2022-07-10 11:03:19,280][26022] Updated weights on worker 0-0, policy_version 695876 (0.00092) [2022-07-10 11:03:19,591][25689] Fps is (10 sec: 5459.8, 60 sec: 5544.2, 300 sec: 5542.2). Total num frames: 712578048. Throughput: 0: 5685.2. Samples: 712578694. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:03:19,591][25689] Avg episode reward: [(0, '-4.503')] [2022-07-10 11:03:20,903][26022] Updated weights on worker 0-0, policy_version 695886 (0.00080) [2022-07-10 11:03:22,785][26022] Updated weights on worker 0-0, policy_version 695896 (0.00083) [2022-07-10 11:03:24,534][26022] Updated weights on worker 0-0, policy_version 695906 (0.00084) [2022-07-10 11:03:24,631][25689] Fps is (10 sec: 5554.2, 60 sec: 5540.9, 300 sec: 5553.0). Total num frames: 712607744. Throughput: 0: 5804.6. Samples: 712612784. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:03:24,632][25689] Avg episode reward: [(0, '-5.677')] [2022-07-10 11:03:26,406][26022] Updated weights on worker 0-0, policy_version 695916 (0.00086) [2022-07-10 11:03:28,250][26022] Updated weights on worker 0-0, policy_version 695926 (0.00088) [2022-07-10 11:03:29,755][25689] Fps is (10 sec: 5742.4, 60 sec: 5567.9, 300 sec: 5548.4). Total num frames: 712636416. Throughput: 0: 4960.4. Samples: 712629442. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:03:29,755][25689] Avg episode reward: [(0, '-5.343')] [2022-07-10 11:03:30,118][26022] Updated weights on worker 0-0, policy_version 695936 (0.00086) [2022-07-10 11:03:31,883][26022] Updated weights on worker 0-0, policy_version 695946 (0.00624) [2022-07-10 11:03:33,781][26022] Updated weights on worker 0-0, policy_version 695956 (0.00086) [2022-07-10 11:03:34,786][25689] Fps is (10 sec: 5545.9, 60 sec: 5549.4, 300 sec: 5552.4). Total num frames: 712664064. Throughput: 0: 5788.4. Samples: 712662890. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:03:34,787][25689] Avg episode reward: [(0, '-4.304')] [2022-07-10 11:03:35,659][26022] Updated weights on worker 0-0, policy_version 695966 (0.00087) [2022-07-10 11:03:37,428][26022] Updated weights on worker 0-0, policy_version 695976 (0.00086) [2022-07-10 11:03:39,317][26022] Updated weights on worker 0-0, policy_version 695986 (0.00072) [2022-07-10 11:03:39,808][25689] Fps is (10 sec: 5500.3, 60 sec: 5514.1, 300 sec: 5545.3). Total num frames: 712691712. Throughput: 0: 5834.1. Samples: 712696646. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:03:39,809][25689] Avg episode reward: [(0, '-4.213')] [2022-07-10 11:03:40,948][26022] Updated weights on worker 0-0, policy_version 695996 (0.00094) [2022-07-10 11:03:43,040][26022] Updated weights on worker 0-0, policy_version 696006 (0.00091) [2022-07-10 11:03:44,662][26022] Updated weights on worker 0-0, policy_version 696016 (0.00093) [2022-07-10 11:03:44,827][25689] Fps is (10 sec: 5609.1, 60 sec: 5546.4, 300 sec: 5549.2). Total num frames: 712720384. Throughput: 0: 4983.5. Samples: 712713430. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:03:44,827][25689] Avg episode reward: [(0, '-3.657')] [2022-07-10 11:03:46,602][26022] Updated weights on worker 0-0, policy_version 696026 (0.00086) [2022-07-10 11:03:48,559][26022] Updated weights on worker 0-0, policy_version 696036 (0.00056) [2022-07-10 11:03:49,909][25689] Fps is (10 sec: 5677.3, 60 sec: 5560.5, 300 sec: 5551.6). Total num frames: 712749056. Throughput: 0: 5813.9. Samples: 712746616. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:03:49,909][25689] Avg episode reward: [(0, '-4.135')] [2022-07-10 11:03:50,302][26022] Updated weights on worker 0-0, policy_version 696046 (0.00090) [2022-07-10 11:03:52,198][26022] Updated weights on worker 0-0, policy_version 696056 (0.00083) [2022-07-10 11:03:54,122][26022] Updated weights on worker 0-0, policy_version 696066 (0.00082) [2022-07-10 11:03:54,961][25689] Fps is (10 sec: 5658.3, 60 sec: 5574.3, 300 sec: 5551.0). Total num frames: 712777728. Throughput: 0: 5800.0. Samples: 712779906. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:03:54,962][25689] Avg episode reward: [(0, '-2.120')] [2022-07-10 11:03:55,833][26022] Updated weights on worker 0-0, policy_version 696076 (0.00095) [2022-07-10 11:03:57,704][26022] Updated weights on worker 0-0, policy_version 696086 (0.00087) [2022-07-10 11:03:59,527][26022] Updated weights on worker 0-0, policy_version 696096 (0.00094) [2022-07-10 11:04:00,009][25689] Fps is (10 sec: 5474.6, 60 sec: 5521.0, 300 sec: 5553.9). Total num frames: 712804352. Throughput: 0: 5775.3. Samples: 712813312. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:04:00,010][25689] Avg episode reward: [(0, '-2.561')] [2022-07-10 11:04:01,519][26022] Updated weights on worker 0-0, policy_version 696106 (0.00087) [2022-07-10 11:04:03,505][26022] Updated weights on worker 0-0, policy_version 696116 (0.00085) [2022-07-10 11:04:05,032][25689] Fps is (10 sec: 5185.8, 60 sec: 5554.6, 300 sec: 5547.3). Total num frames: 712829952. Throughput: 0: 5666.8. Samples: 712827928. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:04:05,033][25689] Avg episode reward: [(0, '-2.123')] [2022-07-10 11:04:05,468][26022] Updated weights on worker 0-0, policy_version 696126 (0.00082) [2022-07-10 11:04:07,163][26022] Updated weights on worker 0-0, policy_version 696136 (0.00097) [2022-07-10 11:04:09,200][26022] Updated weights on worker 0-0, policy_version 696146 (0.00090) [2022-07-10 11:04:10,120][25689] Fps is (10 sec: 5469.0, 60 sec: 5554.1, 300 sec: 5552.6). Total num frames: 712859648. Throughput: 0: 5692.8. Samples: 712861674. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:04:10,121][25689] Avg episode reward: [(0, '-1.311')] [2022-07-10 11:04:10,826][26022] Updated weights on worker 0-0, policy_version 696156 (0.00087) [2022-07-10 11:04:12,774][26022] Updated weights on worker 0-0, policy_version 696166 (0.00088) [2022-07-10 11:04:14,661][26022] Updated weights on worker 0-0, policy_version 696176 (0.00090) [2022-07-10 11:04:15,134][25689] Fps is (10 sec: 5575.3, 60 sec: 5521.6, 300 sec: 5546.0). Total num frames: 712886272. Throughput: 0: 5730.9. Samples: 712895512. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:04:15,134][25689] Avg episode reward: [(0, '-0.995')] [2022-07-10 11:04:16,472][26022] Updated weights on worker 0-0, policy_version 696186 (0.00086) [2022-07-10 11:04:18,468][26022] Updated weights on worker 0-0, policy_version 696196 (0.00621) [2022-07-10 11:04:20,151][25689] Fps is (10 sec: 5410.7, 60 sec: 5546.1, 300 sec: 5549.6). Total num frames: 712913920. Throughput: 0: 4903.2. Samples: 712912070. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:04:20,152][25689] Avg episode reward: [(0, '-1.631')] [2022-07-10 11:04:20,264][26022] Updated weights on worker 0-0, policy_version 696206 (0.00089) [2022-07-10 11:04:21,849][26022] Updated weights on worker 0-0, policy_version 696216 (0.00084) [2022-07-10 11:04:23,767][26022] Updated weights on worker 0-0, policy_version 696226 (0.00091) [2022-07-10 11:04:25,180][25689] Fps is (10 sec: 5707.8, 60 sec: 5547.1, 300 sec: 5551.3). Total num frames: 712943616. Throughput: 0: 5846.0. Samples: 712945714. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:04:25,182][25689] Avg episode reward: [(0, '-1.667')] [2022-07-10 11:04:25,483][26022] Updated weights on worker 0-0, policy_version 696236 (0.00082) [2022-07-10 11:04:26,133][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:04:26,153][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000696239_712948736.pth [2022-07-10 11:04:26,154][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000694287_710949888.pth [2022-07-10 11:04:27,527][26022] Updated weights on worker 0-0, policy_version 696246 (0.00088) [2022-07-10 11:04:29,398][26022] Updated weights on worker 0-0, policy_version 696256 (0.00091) [2022-07-10 11:04:30,307][25689] Fps is (10 sec: 5545.0, 60 sec: 5513.0, 300 sec: 5545.7). Total num frames: 712970240. Throughput: 0: 5810.3. Samples: 712978968. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:04:30,308][25689] Avg episode reward: [(0, '-1.140')] [2022-07-10 11:04:31,087][26022] Updated weights on worker 0-0, policy_version 696266 (0.00088) [2022-07-10 11:04:33,031][26022] Updated weights on worker 0-0, policy_version 696276 (0.00096) [2022-07-10 11:04:34,897][26022] Updated weights on worker 0-0, policy_version 696286 (0.00091) [2022-07-10 11:04:35,343][25689] Fps is (10 sec: 5440.8, 60 sec: 5529.4, 300 sec: 5550.3). Total num frames: 712998912. Throughput: 0: 4950.3. Samples: 712995556. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 11:04:35,345][25689] Avg episode reward: [(0, '-0.984')] [2022-07-10 11:04:36,606][26022] Updated weights on worker 0-0, policy_version 696296 (0.00100) [2022-07-10 11:04:38,449][26022] Updated weights on worker 0-0, policy_version 696306 (0.00078) [2022-07-10 11:04:40,242][26022] Updated weights on worker 0-0, policy_version 696316 (0.00113) [2022-07-10 11:04:40,372][25689] Fps is (10 sec: 5799.1, 60 sec: 5562.6, 300 sec: 5553.7). Total num frames: 713028608. Throughput: 0: 5793.5. Samples: 713029226. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:04:40,373][25689] Avg episode reward: [(0, '-1.378')] [2022-07-10 11:04:42,278][26022] Updated weights on worker 0-0, policy_version 696326 (0.00087) [2022-07-10 11:04:43,776][26022] Updated weights on worker 0-0, policy_version 696336 (0.00093) [2022-07-10 11:04:45,384][25689] Fps is (10 sec: 5609.3, 60 sec: 5529.5, 300 sec: 5548.2). Total num frames: 713055232. Throughput: 0: 5809.9. Samples: 713063096. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:04:45,384][25689] Avg episode reward: [(0, '-2.114')] [2022-07-10 11:04:45,861][26022] Updated weights on worker 0-0, policy_version 696346 (0.00092) [2022-07-10 11:04:47,466][26022] Updated weights on worker 0-0, policy_version 696356 (0.00098) [2022-07-10 11:04:49,704][26022] Updated weights on worker 0-0, policy_version 696366 (0.00087) [2022-07-10 11:04:50,510][25689] Fps is (10 sec: 5555.8, 60 sec: 5542.4, 300 sec: 5558.1). Total num frames: 713084928. Throughput: 0: 4972.1. Samples: 713079416. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:04:50,510][25689] Avg episode reward: [(0, '-2.601')] [2022-07-10 11:04:51,207][26022] Updated weights on worker 0-0, policy_version 696376 (0.00092) [2022-07-10 11:04:53,272][26022] Updated weights on worker 0-0, policy_version 696386 (0.00083) [2022-07-10 11:04:55,036][26022] Updated weights on worker 0-0, policy_version 696396 (0.00092) [2022-07-10 11:04:55,534][25689] Fps is (10 sec: 5447.8, 60 sec: 5494.2, 300 sec: 5544.9). Total num frames: 713110528. Throughput: 0: 5815.7. Samples: 713112978. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:04:55,536][25689] Avg episode reward: [(0, '-3.489')] [2022-07-10 11:04:56,767][26022] Updated weights on worker 0-0, policy_version 696406 (0.00086) [2022-07-10 11:04:58,758][26022] Updated weights on worker 0-0, policy_version 696416 (0.00092) [2022-07-10 11:05:00,480][26022] Updated weights on worker 0-0, policy_version 696426 (0.00084) [2022-07-10 11:05:00,579][25689] Fps is (10 sec: 5491.6, 60 sec: 5545.2, 300 sec: 5561.6). Total num frames: 713140224. Throughput: 0: 5805.9. Samples: 713146542. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:05:00,579][25689] Avg episode reward: [(0, '-4.117')] [2022-07-10 11:05:02,888][26022] Updated weights on worker 0-0, policy_version 696436 (0.00088) [2022-07-10 11:05:04,544][26022] Updated weights on worker 0-0, policy_version 696446 (0.00091) [2022-07-10 11:05:05,602][25689] Fps is (10 sec: 5492.5, 60 sec: 5545.2, 300 sec: 5542.6). Total num frames: 713165824. Throughput: 0: 4854.0. Samples: 713161232. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:05:05,602][25689] Avg episode reward: [(0, '-4.416')] [2022-07-10 11:05:06,473][26022] Updated weights on worker 0-0, policy_version 696456 (0.00090) [2022-07-10 11:05:08,448][26022] Updated weights on worker 0-0, policy_version 696466 (0.00084) [2022-07-10 11:05:09,983][26022] Updated weights on worker 0-0, policy_version 696476 (0.00093) [2022-07-10 11:05:10,678][25689] Fps is (10 sec: 5373.9, 60 sec: 5529.4, 300 sec: 5552.1). Total num frames: 713194496. Throughput: 0: 5719.3. Samples: 713194766. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:05:10,679][25689] Avg episode reward: [(0, '-3.636')] [2022-07-10 11:05:11,957][26022] Updated weights on worker 0-0, policy_version 696486 (0.00081) [2022-07-10 11:05:13,675][26022] Updated weights on worker 0-0, policy_version 696496 (0.00090) [2022-07-10 11:05:15,681][25689] Fps is (10 sec: 5587.8, 60 sec: 5547.3, 300 sec: 5545.7). Total num frames: 713222144. Throughput: 0: 5737.3. Samples: 713228566. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:05:15,681][25689] Avg episode reward: [(0, '-3.609')] [2022-07-10 11:05:15,687][26022] Updated weights on worker 0-0, policy_version 696506 (0.00079) [2022-07-10 11:05:17,355][26022] Updated weights on worker 0-0, policy_version 696516 (0.00083) [2022-07-10 11:05:19,150][26022] Updated weights on worker 0-0, policy_version 696526 (0.00089) [2022-07-10 11:05:20,683][25689] Fps is (10 sec: 5629.4, 60 sec: 5565.6, 300 sec: 5549.9). Total num frames: 713250816. Throughput: 0: 4918.8. Samples: 713245428. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:05:20,683][25689] Avg episode reward: [(0, '-2.515')] [2022-07-10 11:05:21,165][26022] Updated weights on worker 0-0, policy_version 696536 (0.00085) [2022-07-10 11:05:22,943][26022] Updated weights on worker 0-0, policy_version 696546 (0.00083) [2022-07-10 11:05:24,554][26022] Updated weights on worker 0-0, policy_version 696556 (0.00093) [2022-07-10 11:05:25,727][25689] Fps is (10 sec: 5606.1, 60 sec: 5530.4, 300 sec: 5553.5). Total num frames: 713278464. Throughput: 0: 5864.5. Samples: 713279256. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:05:25,727][25689] Avg episode reward: [(0, '-2.676')] [2022-07-10 11:05:26,677][26022] Updated weights on worker 0-0, policy_version 696566 (0.00090) [2022-07-10 11:05:28,375][26022] Updated weights on worker 0-0, policy_version 696576 (0.00088) [2022-07-10 11:05:30,274][26022] Updated weights on worker 0-0, policy_version 696586 (0.00091) [2022-07-10 11:05:30,781][25689] Fps is (10 sec: 5577.2, 60 sec: 5571.0, 300 sec: 5552.7). Total num frames: 713307136. Throughput: 0: 5861.4. Samples: 713312598. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:05:30,781][25689] Avg episode reward: [(0, '-2.729')] [2022-07-10 11:05:32,047][26022] Updated weights on worker 0-0, policy_version 696596 (0.00087) [2022-07-10 11:05:33,986][26022] Updated weights on worker 0-0, policy_version 696606 (0.00087) [2022-07-10 11:05:35,666][26022] Updated weights on worker 0-0, policy_version 696616 (0.00078) [2022-07-10 11:05:35,822][25689] Fps is (10 sec: 5579.0, 60 sec: 5553.6, 300 sec: 5548.7). Total num frames: 713334784. Throughput: 0: 4995.6. Samples: 713329184. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:05:35,822][25689] Avg episode reward: [(0, '-3.776')] [2022-07-10 11:05:37,619][26022] Updated weights on worker 0-0, policy_version 696626 (0.00080) [2022-07-10 11:05:39,303][26022] Updated weights on worker 0-0, policy_version 696636 (0.00095) [2022-07-10 11:05:40,852][25689] Fps is (10 sec: 5388.8, 60 sec: 5502.7, 300 sec: 5551.7). Total num frames: 713361408. Throughput: 0: 5817.0. Samples: 713362754. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:05:40,853][25689] Avg episode reward: [(0, '-3.560')] [2022-07-10 11:05:41,289][26022] Updated weights on worker 0-0, policy_version 696646 (0.00092) [2022-07-10 11:05:42,974][26022] Updated weights on worker 0-0, policy_version 696656 (0.00093) [2022-07-10 11:05:44,895][26022] Updated weights on worker 0-0, policy_version 696666 (0.00085) [2022-07-10 11:05:45,884][25689] Fps is (10 sec: 5597.0, 60 sec: 5551.6, 300 sec: 5555.5). Total num frames: 713391104. Throughput: 0: 5825.5. Samples: 713396684. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:05:45,886][25689] Avg episode reward: [(0, '-2.936')] [2022-07-10 11:05:46,582][26022] Updated weights on worker 0-0, policy_version 696676 (0.00087) [2022-07-10 11:05:48,554][26022] Updated weights on worker 0-0, policy_version 696686 (0.00598) [2022-07-10 11:05:50,423][26022] Updated weights on worker 0-0, policy_version 696696 (0.00087) [2022-07-10 11:05:50,972][25689] Fps is (10 sec: 5767.6, 60 sec: 5538.1, 300 sec: 5554.0). Total num frames: 713419776. Throughput: 0: 4987.3. Samples: 713413298. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:05:50,974][25689] Avg episode reward: [(0, '-3.265')] [2022-07-10 11:05:52,358][26022] Updated weights on worker 0-0, policy_version 696706 (0.00087) [2022-07-10 11:05:54,055][26022] Updated weights on worker 0-0, policy_version 696716 (0.00093) [2022-07-10 11:05:55,995][25689] Fps is (10 sec: 5468.9, 60 sec: 5555.2, 300 sec: 5543.4). Total num frames: 713446400. Throughput: 0: 5812.8. Samples: 713446448. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:05:56,003][25689] Avg episode reward: [(0, '-2.772')] [2022-07-10 11:05:56,030][26022] Updated weights on worker 0-0, policy_version 696726 (0.00086) [2022-07-10 11:05:57,543][26022] Updated weights on worker 0-0, policy_version 696736 (0.00086) [2022-07-10 11:05:59,723][26022] Updated weights on worker 0-0, policy_version 696746 (0.00095) [2022-07-10 11:06:01,003][25689] Fps is (10 sec: 5614.6, 60 sec: 5558.5, 300 sec: 5560.7). Total num frames: 713476096. Throughput: 0: 5835.1. Samples: 713480338. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:06:01,004][25689] Avg episode reward: [(0, '-2.271')] [2022-07-10 11:06:01,355][26022] Updated weights on worker 0-0, policy_version 696756 (0.00089) [2022-07-10 11:06:03,595][26022] Updated weights on worker 0-0, policy_version 696766 (0.00099) [2022-07-10 11:06:05,429][26022] Updated weights on worker 0-0, policy_version 696776 (0.00090) [2022-07-10 11:06:06,021][25689] Fps is (10 sec: 5515.3, 60 sec: 5559.0, 300 sec: 5551.1). Total num frames: 713501696. Throughput: 0: 4893.3. Samples: 713495220. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:06:06,022][25689] Avg episode reward: [(0, '-2.386')] [2022-07-10 11:06:07,202][26022] Updated weights on worker 0-0, policy_version 696786 (0.00086) [2022-07-10 11:06:09,013][26022] Updated weights on worker 0-0, policy_version 696796 (0.00095) [2022-07-10 11:06:10,970][26022] Updated weights on worker 0-0, policy_version 696806 (0.00096) [2022-07-10 11:06:11,175][25689] Fps is (10 sec: 5335.6, 60 sec: 5551.9, 300 sec: 5553.4). Total num frames: 713530368. Throughput: 0: 5717.2. Samples: 713528802. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:06:11,176][25689] Avg episode reward: [(0, '-2.686')] [2022-07-10 11:06:12,680][26022] Updated weights on worker 0-0, policy_version 696816 (0.00100) [2022-07-10 11:06:14,687][26022] Updated weights on worker 0-0, policy_version 696826 (0.00084) [2022-07-10 11:06:16,254][25689] Fps is (10 sec: 5604.2, 60 sec: 5561.8, 300 sec: 5553.4). Total num frames: 713559040. Throughput: 0: 5737.0. Samples: 713562670. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:06:16,254][25689] Avg episode reward: [(0, '-2.620')] [2022-07-10 11:06:16,346][26022] Updated weights on worker 0-0, policy_version 696836 (0.00093) [2022-07-10 11:06:18,215][26022] Updated weights on worker 0-0, policy_version 696846 (0.00096) [2022-07-10 11:06:20,106][26022] Updated weights on worker 0-0, policy_version 696856 (0.00090) [2022-07-10 11:06:21,286][25689] Fps is (10 sec: 5570.3, 60 sec: 5542.2, 300 sec: 5546.0). Total num frames: 713586688. Throughput: 0: 5713.0. Samples: 713596212. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:06:21,286][25689] Avg episode reward: [(0, '-2.663')] [2022-07-10 11:06:21,669][26022] Updated weights on worker 0-0, policy_version 696866 (0.00089) [2022-07-10 11:06:24,039][26022] Updated weights on worker 0-0, policy_version 696876 (0.00087) [2022-07-10 11:06:25,406][26022] Updated weights on worker 0-0, policy_version 696886 (0.00090) [2022-07-10 11:06:26,307][25689] Fps is (10 sec: 5602.3, 60 sec: 5561.2, 300 sec: 5553.6). Total num frames: 713615360. Throughput: 0: 5809.6. Samples: 713613070. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:06:26,307][25689] Avg episode reward: [(0, '-4.283')] [2022-07-10 11:06:26,358][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:06:26,377][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000696891_713616384.pth [2022-07-10 11:06:26,378][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000694936_711614464.pth [2022-07-10 11:06:27,475][26022] Updated weights on worker 0-0, policy_version 696896 (0.00093) [2022-07-10 11:06:29,075][26022] Updated weights on worker 0-0, policy_version 696906 (0.00090) [2022-07-10 11:06:30,966][26022] Updated weights on worker 0-0, policy_version 696916 (0.00090) [2022-07-10 11:06:31,383][25689] Fps is (10 sec: 5577.8, 60 sec: 5542.3, 300 sec: 5550.5). Total num frames: 713643008. Throughput: 0: 5823.5. Samples: 713646484. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:06:31,384][25689] Avg episode reward: [(0, '-3.731')] [2022-07-10 11:06:33,002][26022] Updated weights on worker 0-0, policy_version 696926 (0.00092) [2022-07-10 11:06:34,826][26022] Updated weights on worker 0-0, policy_version 696936 (0.00082) [2022-07-10 11:06:36,394][25689] Fps is (10 sec: 5481.7, 60 sec: 5545.0, 300 sec: 5547.2). Total num frames: 713670656. Throughput: 0: 5826.3. Samples: 713680014. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:06:36,394][25689] Avg episode reward: [(0, '-4.396')] [2022-07-10 11:06:36,635][26022] Updated weights on worker 0-0, policy_version 696946 (0.00089) [2022-07-10 11:06:38,321][26022] Updated weights on worker 0-0, policy_version 696956 (0.00089) [2022-07-10 11:06:40,113][26022] Updated weights on worker 0-0, policy_version 696966 (0.00083) [2022-07-10 11:06:41,425][25689] Fps is (10 sec: 5608.4, 60 sec: 5578.7, 300 sec: 5553.5). Total num frames: 713699328. Throughput: 0: 4998.0. Samples: 713696868. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:06:41,427][25689] Avg episode reward: [(0, '-4.097')] [2022-07-10 11:06:42,049][26022] Updated weights on worker 0-0, policy_version 696976 (0.00054) [2022-07-10 11:06:43,875][26022] Updated weights on worker 0-0, policy_version 696986 (0.00101) [2022-07-10 11:06:45,639][26022] Updated weights on worker 0-0, policy_version 696996 (0.00083) [2022-07-10 11:06:46,436][25689] Fps is (10 sec: 5608.3, 60 sec: 5546.9, 300 sec: 5550.7). Total num frames: 713726976. Throughput: 0: 5837.6. Samples: 713730578. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:06:46,437][25689] Avg episode reward: [(0, '-4.272')] [2022-07-10 11:06:47,572][26022] Updated weights on worker 0-0, policy_version 697006 (0.00092) [2022-07-10 11:06:49,343][26022] Updated weights on worker 0-0, policy_version 697016 (0.00105) [2022-07-10 11:06:51,265][26022] Updated weights on worker 0-0, policy_version 697026 (0.00093) [2022-07-10 11:06:51,481][25689] Fps is (10 sec: 5600.7, 60 sec: 5550.8, 300 sec: 5550.4). Total num frames: 713755648. Throughput: 0: 5837.2. Samples: 713763800. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:06:51,481][25689] Avg episode reward: [(0, '-3.435')] [2022-07-10 11:06:52,986][26022] Updated weights on worker 0-0, policy_version 697036 (0.00099) [2022-07-10 11:06:54,851][26022] Updated weights on worker 0-0, policy_version 697046 (0.00085) [2022-07-10 11:06:56,570][25689] Fps is (10 sec: 5658.9, 60 sec: 5578.6, 300 sec: 5552.5). Total num frames: 713784320. Throughput: 0: 4981.0. Samples: 713780508. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:06:56,579][25689] Avg episode reward: [(0, '-1.998')] [2022-07-10 11:06:56,669][26022] Updated weights on worker 0-0, policy_version 697056 (0.00090) [2022-07-10 11:06:58,802][26022] Updated weights on worker 0-0, policy_version 697066 (0.00096) [2022-07-10 11:07:00,439][26022] Updated weights on worker 0-0, policy_version 697076 (0.00095) [2022-07-10 11:07:01,595][25689] Fps is (10 sec: 5568.7, 60 sec: 5543.2, 300 sec: 5559.1). Total num frames: 713811968. Throughput: 0: 5808.6. Samples: 713814024. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:07:01,595][25689] Avg episode reward: [(0, '-1.286')] [2022-07-10 11:07:02,837][26022] Updated weights on worker 0-0, policy_version 697086 (0.00095) [2022-07-10 11:07:04,459][26022] Updated weights on worker 0-0, policy_version 697096 (0.00424) [2022-07-10 11:07:06,349][26022] Updated weights on worker 0-0, policy_version 697106 (0.00088) [2022-07-10 11:07:06,606][25689] Fps is (10 sec: 5305.4, 60 sec: 5543.8, 300 sec: 5546.1). Total num frames: 713837568. Throughput: 0: 5687.9. Samples: 713845302. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:07:06,606][25689] Avg episode reward: [(0, '-0.956')] [2022-07-10 11:07:08,123][26022] Updated weights on worker 0-0, policy_version 697116 (0.00089) [2022-07-10 11:07:09,988][26022] Updated weights on worker 0-0, policy_version 697126 (0.00090) [2022-07-10 11:07:11,687][25689] Fps is (10 sec: 5377.6, 60 sec: 5550.5, 300 sec: 5551.8). Total num frames: 713866240. Throughput: 0: 4849.4. Samples: 713861786. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:07:11,687][25689] Avg episode reward: [(0, '-0.327')] [2022-07-10 11:07:11,938][26022] Updated weights on worker 0-0, policy_version 697136 (0.00091) [2022-07-10 11:07:13,752][26022] Updated weights on worker 0-0, policy_version 697146 (0.00108) [2022-07-10 11:07:15,870][26022] Updated weights on worker 0-0, policy_version 697156 (0.00092) [2022-07-10 11:07:16,715][25689] Fps is (10 sec: 5571.0, 60 sec: 5538.2, 300 sec: 5548.1). Total num frames: 713893888. Throughput: 0: 5697.7. Samples: 713895294. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:07:16,716][25689] Avg episode reward: [(0, '-0.633')] [2022-07-10 11:07:17,249][26022] Updated weights on worker 0-0, policy_version 697166 (0.00086) [2022-07-10 11:07:19,345][26022] Updated weights on worker 0-0, policy_version 697176 (0.00081) [2022-07-10 11:07:21,007][26022] Updated weights on worker 0-0, policy_version 697186 (0.00079) [2022-07-10 11:07:21,754][25689] Fps is (10 sec: 5391.1, 60 sec: 5520.7, 300 sec: 5537.2). Total num frames: 713920512. Throughput: 0: 5676.9. Samples: 713928466. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:07:21,754][25689] Avg episode reward: [(0, '-0.639')] [2022-07-10 11:07:22,965][26022] Updated weights on worker 0-0, policy_version 697196 (0.00082) [2022-07-10 11:07:24,865][26022] Updated weights on worker 0-0, policy_version 697206 (0.00087) [2022-07-10 11:07:26,499][26022] Updated weights on worker 0-0, policy_version 697216 (0.00087) [2022-07-10 11:07:26,836][25689] Fps is (10 sec: 5565.1, 60 sec: 5532.0, 300 sec: 5546.9). Total num frames: 713950208. Throughput: 0: 4938.7. Samples: 713945210. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:07:26,836][25689] Avg episode reward: [(0, '-0.523')] [2022-07-10 11:07:28,628][26022] Updated weights on worker 0-0, policy_version 697226 (0.00094) [2022-07-10 11:07:30,278][26022] Updated weights on worker 0-0, policy_version 697236 (0.00090) [2022-07-10 11:07:31,896][25689] Fps is (10 sec: 5654.2, 60 sec: 5533.5, 300 sec: 5542.6). Total num frames: 713977856. Throughput: 0: 5780.1. Samples: 713978594. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:07:31,896][25689] Avg episode reward: [(0, '-2.084')] [2022-07-10 11:07:32,123][26022] Updated weights on worker 0-0, policy_version 697246 (0.00088) [2022-07-10 11:07:33,906][26022] Updated weights on worker 0-0, policy_version 697256 (0.00095) [2022-07-10 11:07:36,037][26022] Updated weights on worker 0-0, policy_version 697266 (0.00085) [2022-07-10 11:07:36,913][25689] Fps is (10 sec: 5385.8, 60 sec: 5516.1, 300 sec: 5532.1). Total num frames: 714004480. Throughput: 0: 5787.5. Samples: 714012186. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:07:36,913][25689] Avg episode reward: [(0, '-2.071')] [2022-07-10 11:07:37,488][26022] Updated weights on worker 0-0, policy_version 697276 (0.00089) [2022-07-10 11:07:39,656][26022] Updated weights on worker 0-0, policy_version 697286 (0.00085) [2022-07-10 11:07:41,042][26022] Updated weights on worker 0-0, policy_version 697296 (0.00086) [2022-07-10 11:07:41,996][25689] Fps is (10 sec: 5575.8, 60 sec: 5528.2, 300 sec: 5540.9). Total num frames: 714034176. Throughput: 0: 4982.8. Samples: 714029330. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:07:41,997][25689] Avg episode reward: [(0, '-2.295')] [2022-07-10 11:07:43,177][26022] Updated weights on worker 0-0, policy_version 697306 (0.00091) [2022-07-10 11:07:44,893][26022] Updated weights on worker 0-0, policy_version 697316 (0.00094) [2022-07-10 11:07:46,754][26022] Updated weights on worker 0-0, policy_version 697326 (0.00090) [2022-07-10 11:07:47,064][25689] Fps is (10 sec: 5749.6, 60 sec: 5539.9, 300 sec: 5544.0). Total num frames: 714062848. Throughput: 0: 5827.1. Samples: 714063084. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:07:47,065][25689] Avg episode reward: [(0, '-1.971')] [2022-07-10 11:07:48,760][26022] Updated weights on worker 0-0, policy_version 697336 (0.00806) [2022-07-10 11:07:50,536][26022] Updated weights on worker 0-0, policy_version 697346 (0.00084) [2022-07-10 11:07:52,136][25689] Fps is (10 sec: 5655.1, 60 sec: 5537.4, 300 sec: 5546.4). Total num frames: 714091520. Throughput: 0: 5814.3. Samples: 714096280. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:07:52,137][25689] Avg episode reward: [(0, '-2.213')] [2022-07-10 11:07:52,216][26022] Updated weights on worker 0-0, policy_version 697356 (0.00091) [2022-07-10 11:07:54,231][26022] Updated weights on worker 0-0, policy_version 697366 (0.00101) [2022-07-10 11:07:55,774][26022] Updated weights on worker 0-0, policy_version 697376 (0.00092) [2022-07-10 11:07:57,237][25689] Fps is (10 sec: 5637.1, 60 sec: 5536.3, 300 sec: 5541.5). Total num frames: 714120192. Throughput: 0: 4972.0. Samples: 714113238. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-10 11:07:57,237][25689] Avg episode reward: [(0, '-3.460')] [2022-07-10 11:07:57,768][26022] Updated weights on worker 0-0, policy_version 697386 (0.00090) [2022-07-10 11:07:59,699][26022] Updated weights on worker 0-0, policy_version 697396 (0.00090) [2022-07-10 11:08:01,656][26022] Updated weights on worker 0-0, policy_version 697406 (0.00096) [2022-07-10 11:08:02,275][25689] Fps is (10 sec: 5353.1, 60 sec: 5501.4, 300 sec: 5548.1). Total num frames: 714145792. Throughput: 0: 5794.9. Samples: 714146842. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:08:02,277][25689] Avg episode reward: [(0, '-2.166')] [2022-07-10 11:08:03,640][26022] Updated weights on worker 0-0, policy_version 697416 (0.00083) [2022-07-10 11:08:05,316][26022] Updated weights on worker 0-0, policy_version 697426 (0.00082) [2022-07-10 11:08:07,294][25689] Fps is (10 sec: 5396.3, 60 sec: 5551.3, 300 sec: 5545.8). Total num frames: 714174464. Throughput: 0: 5708.2. Samples: 714178560. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:08:07,296][25689] Avg episode reward: [(0, '-2.098')] [2022-07-10 11:08:07,300][26022] Updated weights on worker 0-0, policy_version 697436 (0.00085) [2022-07-10 11:08:09,042][26022] Updated weights on worker 0-0, policy_version 697446 (0.00091) [2022-07-10 11:08:11,046][26022] Updated weights on worker 0-0, policy_version 697456 (0.00082) [2022-07-10 11:08:12,356][25689] Fps is (10 sec: 5586.6, 60 sec: 5536.1, 300 sec: 5541.7). Total num frames: 714202112. Throughput: 0: 4904.3. Samples: 714195444. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:08:12,357][25689] Avg episode reward: [(0, '-2.030')] [2022-07-10 11:08:12,816][26022] Updated weights on worker 0-0, policy_version 697466 (0.00084) [2022-07-10 11:08:14,778][26022] Updated weights on worker 0-0, policy_version 697476 (0.00083) [2022-07-10 11:08:16,316][26022] Updated weights on worker 0-0, policy_version 697486 (0.00092) [2022-07-10 11:08:17,400][25689] Fps is (10 sec: 5573.1, 60 sec: 5551.6, 300 sec: 5549.6). Total num frames: 714230784. Throughput: 0: 5733.6. Samples: 714228844. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:08:17,400][25689] Avg episode reward: [(0, '-2.713')] [2022-07-10 11:08:18,113][26022] Updated weights on worker 0-0, policy_version 697496 (0.00087) [2022-07-10 11:08:20,150][26022] Updated weights on worker 0-0, policy_version 697506 (0.00090) [2022-07-10 11:08:21,875][26022] Updated weights on worker 0-0, policy_version 697516 (0.00085) [2022-07-10 11:08:22,420][25689] Fps is (10 sec: 5596.2, 60 sec: 5570.1, 300 sec: 5543.1). Total num frames: 714258432. Throughput: 0: 5742.7. Samples: 714262530. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:08:22,422][25689] Avg episode reward: [(0, '-3.157')] [2022-07-10 11:08:23,898][26022] Updated weights on worker 0-0, policy_version 697526 (0.00089) [2022-07-10 11:08:25,547][26022] Updated weights on worker 0-0, policy_version 697536 (0.00085) [2022-07-10 11:08:26,499][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:08:26,517][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000697540_714280960.pth [2022-07-10 11:08:26,518][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000695588_712282112.pth [2022-07-10 11:08:27,423][25689] Fps is (10 sec: 5517.0, 60 sec: 5543.6, 300 sec: 5542.0). Total num frames: 714286080. Throughput: 0: 5835.5. Samples: 714296020. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:08:27,423][25689] Avg episode reward: [(0, '-2.116')] [2022-07-10 11:08:27,494][26022] Updated weights on worker 0-0, policy_version 697546 (0.00094) [2022-07-10 11:08:29,381][26022] Updated weights on worker 0-0, policy_version 697556 (0.00085) [2022-07-10 11:08:31,239][26022] Updated weights on worker 0-0, policy_version 697566 (0.00084) [2022-07-10 11:08:32,513][25689] Fps is (10 sec: 5478.7, 60 sec: 5540.8, 300 sec: 5540.9). Total num frames: 714313728. Throughput: 0: 5821.3. Samples: 714312784. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:08:32,514][25689] Avg episode reward: [(0, '-1.737')] [2022-07-10 11:08:33,070][26022] Updated weights on worker 0-0, policy_version 697576 (0.00097) [2022-07-10 11:08:34,861][26022] Updated weights on worker 0-0, policy_version 697586 (0.00090) [2022-07-10 11:08:36,657][26022] Updated weights on worker 0-0, policy_version 697596 (0.00085) [2022-07-10 11:08:37,539][25689] Fps is (10 sec: 5668.6, 60 sec: 5590.7, 300 sec: 5547.7). Total num frames: 714343424. Throughput: 0: 5833.0. Samples: 714346314. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:08:37,539][25689] Avg episode reward: [(0, '-3.486')] [2022-07-10 11:08:38,672][26022] Updated weights on worker 0-0, policy_version 697606 (0.00080) [2022-07-10 11:08:40,207][26022] Updated weights on worker 0-0, policy_version 697616 (0.00093) [2022-07-10 11:08:42,212][26022] Updated weights on worker 0-0, policy_version 697626 (0.00076) [2022-07-10 11:08:42,544][25689] Fps is (10 sec: 5716.6, 60 sec: 5564.1, 300 sec: 5544.5). Total num frames: 714371072. Throughput: 0: 5840.4. Samples: 714380062. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:08:42,545][25689] Avg episode reward: [(0, '-3.853')] [2022-07-10 11:08:44,063][26022] Updated weights on worker 0-0, policy_version 697636 (0.00081) [2022-07-10 11:08:45,854][26022] Updated weights on worker 0-0, policy_version 697646 (0.00088) [2022-07-10 11:08:47,542][26022] Updated weights on worker 0-0, policy_version 697656 (0.00087) [2022-07-10 11:08:47,563][25689] Fps is (10 sec: 5618.4, 60 sec: 5568.7, 300 sec: 5545.7). Total num frames: 714399744. Throughput: 0: 5011.9. Samples: 714396962. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:08:47,563][25689] Avg episode reward: [(0, '-4.320')] [2022-07-10 11:08:49,437][26022] Updated weights on worker 0-0, policy_version 697666 (0.00052) [2022-07-10 11:08:51,167][26022] Updated weights on worker 0-0, policy_version 697676 (0.00111) [2022-07-10 11:08:52,612][25689] Fps is (10 sec: 5492.3, 60 sec: 5536.9, 300 sec: 5538.8). Total num frames: 714426368. Throughput: 0: 5845.6. Samples: 714430274. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:08:52,613][25689] Avg episode reward: [(0, '-3.655')] [2022-07-10 11:08:53,326][26022] Updated weights on worker 0-0, policy_version 697686 (0.00483) [2022-07-10 11:08:54,856][26022] Updated weights on worker 0-0, policy_version 697696 (0.00092) [2022-07-10 11:08:56,852][26022] Updated weights on worker 0-0, policy_version 697706 (0.00088) [2022-07-10 11:08:57,659][25689] Fps is (10 sec: 5578.4, 60 sec: 5558.8, 300 sec: 5549.2). Total num frames: 714456064. Throughput: 0: 5875.4. Samples: 714464528. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:08:57,659][25689] Avg episode reward: [(0, '-3.540')] [2022-07-10 11:08:58,331][26022] Updated weights on worker 0-0, policy_version 697716 (0.00096) [2022-07-10 11:09:00,356][26022] Updated weights on worker 0-0, policy_version 697726 (0.00081) [2022-07-10 11:09:02,466][26022] Updated weights on worker 0-0, policy_version 697736 (0.00092) [2022-07-10 11:09:02,685][25689] Fps is (10 sec: 5591.1, 60 sec: 5576.8, 300 sec: 5552.6). Total num frames: 714482688. Throughput: 0: 5036.9. Samples: 714481510. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:09:02,686][25689] Avg episode reward: [(0, '-3.310')] [2022-07-10 11:09:04,382][26022] Updated weights on worker 0-0, policy_version 697746 (0.00086) [2022-07-10 11:09:06,159][26022] Updated weights on worker 0-0, policy_version 697756 (0.00085) [2022-07-10 11:09:07,704][25689] Fps is (10 sec: 5403.0, 60 sec: 5559.9, 300 sec: 5547.0). Total num frames: 714510336. Throughput: 0: 5789.3. Samples: 714513564. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:09:07,704][25689] Avg episode reward: [(0, '-2.632')] [2022-07-10 11:09:07,963][26022] Updated weights on worker 0-0, policy_version 697766 (0.00085) [2022-07-10 11:09:09,678][26022] Updated weights on worker 0-0, policy_version 697776 (0.00087) [2022-07-10 11:09:11,576][26022] Updated weights on worker 0-0, policy_version 697786 (0.00086) [2022-07-10 11:09:12,755][25689] Fps is (10 sec: 5593.2, 60 sec: 5577.9, 300 sec: 5553.2). Total num frames: 714539008. Throughput: 0: 5826.1. Samples: 714547626. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:09:12,755][25689] Avg episode reward: [(0, '-3.403')] [2022-07-10 11:09:13,319][26022] Updated weights on worker 0-0, policy_version 697796 (0.00081) [2022-07-10 11:09:15,105][26022] Updated weights on worker 0-0, policy_version 697806 (0.00084) [2022-07-10 11:09:16,980][26022] Updated weights on worker 0-0, policy_version 697816 (0.00085) [2022-07-10 11:09:17,758][25689] Fps is (10 sec: 5703.5, 60 sec: 5581.6, 300 sec: 5556.9). Total num frames: 714567680. Throughput: 0: 4989.5. Samples: 714564812. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:09:17,759][25689] Avg episode reward: [(0, '-2.622')] [2022-07-10 11:09:18,618][26022] Updated weights on worker 0-0, policy_version 697826 (0.00087) [2022-07-10 11:09:20,389][26022] Updated weights on worker 0-0, policy_version 697836 (0.00084) [2022-07-10 11:09:22,461][26022] Updated weights on worker 0-0, policy_version 697846 (0.00088) [2022-07-10 11:09:22,783][25689] Fps is (10 sec: 5616.3, 60 sec: 5581.2, 300 sec: 5550.1). Total num frames: 714595328. Throughput: 0: 5832.8. Samples: 714598734. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:09:22,785][25689] Avg episode reward: [(0, '-2.433')] [2022-07-10 11:09:24,227][26022] Updated weights on worker 0-0, policy_version 697856 (0.00087) [2022-07-10 11:09:26,246][26022] Updated weights on worker 0-0, policy_version 697866 (0.00084) [2022-07-10 11:09:27,791][25689] Fps is (10 sec: 5613.6, 60 sec: 5597.7, 300 sec: 5559.2). Total num frames: 714624000. Throughput: 0: 5912.7. Samples: 714632332. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:09:27,791][25689] Avg episode reward: [(0, '-2.839')] [2022-07-10 11:09:27,879][26022] Updated weights on worker 0-0, policy_version 697876 (0.00080) [2022-07-10 11:09:29,792][26022] Updated weights on worker 0-0, policy_version 697886 (0.00090) [2022-07-10 11:09:31,577][26022] Updated weights on worker 0-0, policy_version 697896 (0.00089) [2022-07-10 11:09:32,884][25689] Fps is (10 sec: 5677.1, 60 sec: 5614.4, 300 sec: 5558.1). Total num frames: 714652672. Throughput: 0: 5052.7. Samples: 714649332. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:09:32,884][25689] Avg episode reward: [(0, '-3.678')] [2022-07-10 11:09:33,338][26022] Updated weights on worker 0-0, policy_version 697906 (0.00085) [2022-07-10 11:09:35,164][26022] Updated weights on worker 0-0, policy_version 697916 (0.00087) [2022-07-10 11:09:36,991][26022] Updated weights on worker 0-0, policy_version 697926 (0.00086) [2022-07-10 11:09:37,891][25689] Fps is (10 sec: 5677.6, 60 sec: 5599.2, 300 sec: 5555.1). Total num frames: 714681344. Throughput: 0: 5882.2. Samples: 714683238. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:09:37,891][25689] Avg episode reward: [(0, '-3.314')] [2022-07-10 11:09:38,691][26022] Updated weights on worker 0-0, policy_version 697936 (0.00080) [2022-07-10 11:09:40,740][26022] Updated weights on worker 0-0, policy_version 697946 (0.00082) [2022-07-10 11:09:42,162][26022] Updated weights on worker 0-0, policy_version 697956 (0.00461) [2022-07-10 11:09:42,902][25689] Fps is (10 sec: 5723.8, 60 sec: 5615.6, 300 sec: 5562.0). Total num frames: 714710016. Throughput: 0: 5890.1. Samples: 714717240. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:09:42,903][25689] Avg episode reward: [(0, '-4.195')] [2022-07-10 11:09:44,273][26022] Updated weights on worker 0-0, policy_version 697966 (0.00090) [2022-07-10 11:09:45,706][26022] Updated weights on worker 0-0, policy_version 697976 (0.00085) [2022-07-10 11:09:47,914][25689] Fps is (10 sec: 5618.7, 60 sec: 5599.2, 300 sec: 5557.2). Total num frames: 714737664. Throughput: 0: 5064.2. Samples: 714734242. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:09:47,916][25689] Avg episode reward: [(0, '-4.362')] [2022-07-10 11:09:47,916][26022] Updated weights on worker 0-0, policy_version 697986 (0.00090) [2022-07-10 11:09:49,531][26022] Updated weights on worker 0-0, policy_version 697996 (0.00086) [2022-07-10 11:09:51,532][26022] Updated weights on worker 0-0, policy_version 698006 (0.00097) [2022-07-10 11:09:53,007][25689] Fps is (10 sec: 5573.5, 60 sec: 5629.1, 300 sec: 5566.2). Total num frames: 714766336. Throughput: 0: 5901.5. Samples: 714768090. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:09:53,008][25689] Avg episode reward: [(0, '-4.404')] [2022-07-10 11:09:53,204][26022] Updated weights on worker 0-0, policy_version 698016 (0.00087) [2022-07-10 11:09:55,237][26022] Updated weights on worker 0-0, policy_version 698026 (0.00093) [2022-07-10 11:09:57,073][26022] Updated weights on worker 0-0, policy_version 698036 (0.00089) [2022-07-10 11:09:58,022][25689] Fps is (10 sec: 5572.0, 60 sec: 5598.1, 300 sec: 5559.9). Total num frames: 714793984. Throughput: 0: 5900.6. Samples: 714802024. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:09:58,022][25689] Avg episode reward: [(0, '-3.635')] [2022-07-10 11:09:58,712][26022] Updated weights on worker 0-0, policy_version 698046 (0.00082) [2022-07-10 11:10:00,439][26022] Updated weights on worker 0-0, policy_version 698056 (0.00082) [2022-07-10 11:10:02,524][26022] Updated weights on worker 0-0, policy_version 698066 (0.00100) [2022-07-10 11:10:03,043][25689] Fps is (10 sec: 5509.9, 60 sec: 5615.6, 300 sec: 5566.8). Total num frames: 714821632. Throughput: 0: 5045.4. Samples: 714818858. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:10:03,043][25689] Avg episode reward: [(0, '-3.075')] [2022-07-10 11:10:04,543][26022] Updated weights on worker 0-0, policy_version 698076 (0.00082) [2022-07-10 11:10:06,540][26022] Updated weights on worker 0-0, policy_version 698086 (0.00510) [2022-07-10 11:10:08,084][25689] Fps is (10 sec: 5495.6, 60 sec: 5613.5, 300 sec: 5564.1). Total num frames: 714849280. Throughput: 0: 5786.2. Samples: 714850948. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:10:08,084][25689] Avg episode reward: [(0, '-2.026')] [2022-07-10 11:10:08,123][26022] Updated weights on worker 0-0, policy_version 698096 (0.00092) [2022-07-10 11:10:10,094][26022] Updated weights on worker 0-0, policy_version 698106 (0.00085) [2022-07-10 11:10:11,855][26022] Updated weights on worker 0-0, policy_version 698116 (0.00090) [2022-07-10 11:10:13,121][25689] Fps is (10 sec: 5486.8, 60 sec: 5597.8, 300 sec: 5563.4). Total num frames: 714876928. Throughput: 0: 5816.7. Samples: 714885086. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:10:13,121][25689] Avg episode reward: [(0, '-1.439')] [2022-07-10 11:10:13,648][26022] Updated weights on worker 0-0, policy_version 698126 (0.00086) [2022-07-10 11:10:15,446][26022] Updated weights on worker 0-0, policy_version 698136 (0.00088) [2022-07-10 11:10:16,978][26022] Updated weights on worker 0-0, policy_version 698146 (0.00082) [2022-07-10 11:10:18,132][25689] Fps is (10 sec: 5706.9, 60 sec: 5614.1, 300 sec: 5566.7). Total num frames: 714906624. Throughput: 0: 4984.9. Samples: 714902270. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:10:18,133][25689] Avg episode reward: [(0, '-0.720')] [2022-07-10 11:10:19,081][26022] Updated weights on worker 0-0, policy_version 698156 (0.00082) [2022-07-10 11:10:20,773][26022] Updated weights on worker 0-0, policy_version 698166 (0.00088) [2022-07-10 11:10:22,621][26022] Updated weights on worker 0-0, policy_version 698176 (0.00093) [2022-07-10 11:10:23,193][25689] Fps is (10 sec: 5795.0, 60 sec: 5627.7, 300 sec: 5569.8). Total num frames: 714935296. Throughput: 0: 5823.5. Samples: 714936202. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:10:23,193][25689] Avg episode reward: [(0, '-1.353')] [2022-07-10 11:10:24,431][26022] Updated weights on worker 0-0, policy_version 698186 (0.00088) [2022-07-10 11:10:26,294][26022] Updated weights on worker 0-0, policy_version 698196 (0.00086) [2022-07-10 11:10:26,653][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:10:26,670][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000698198_714954752.pth [2022-07-10 11:10:26,671][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000696239_712948736.pth [2022-07-10 11:10:28,176][26022] Updated weights on worker 0-0, policy_version 698206 (0.00096) [2022-07-10 11:10:28,218][25689] Fps is (10 sec: 5583.9, 60 sec: 5609.1, 300 sec: 5566.9). Total num frames: 714962944. Throughput: 0: 5907.3. Samples: 714969888. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:10:28,219][25689] Avg episode reward: [(0, '-1.283')] [2022-07-10 11:10:29,855][26022] Updated weights on worker 0-0, policy_version 698216 (0.00096) [2022-07-10 11:10:31,487][26022] Updated weights on worker 0-0, policy_version 698226 (0.00085) [2022-07-10 11:10:33,293][25689] Fps is (10 sec: 5576.0, 60 sec: 5610.8, 300 sec: 5569.7). Total num frames: 714991616. Throughput: 0: 5046.2. Samples: 714986880. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:10:33,295][25689] Avg episode reward: [(0, '-2.629')] [2022-07-10 11:10:33,467][26022] Updated weights on worker 0-0, policy_version 698236 (0.00089) [2022-07-10 11:10:35,446][26022] Updated weights on worker 0-0, policy_version 698246 (0.00086) [2022-07-10 11:10:37,041][26022] Updated weights on worker 0-0, policy_version 698256 (0.00086) [2022-07-10 11:10:38,352][25689] Fps is (10 sec: 5658.8, 60 sec: 5606.0, 300 sec: 5576.1). Total num frames: 715020288. Throughput: 0: 5863.8. Samples: 715020834. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:10:38,352][25689] Avg episode reward: [(0, '-2.791')] [2022-07-10 11:10:39,126][26022] Updated weights on worker 0-0, policy_version 698266 (0.00087) [2022-07-10 11:10:40,408][26022] Updated weights on worker 0-0, policy_version 698276 (0.00790) [2022-07-10 11:10:42,570][26022] Updated weights on worker 0-0, policy_version 698286 (0.00088) [2022-07-10 11:10:43,388][25689] Fps is (10 sec: 5985.0, 60 sec: 5654.5, 300 sec: 5582.9). Total num frames: 715052032. Throughput: 0: 5897.8. Samples: 715055308. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:10:43,390][25689] Avg episode reward: [(0, '-3.168')] [2022-07-10 11:10:44,363][26022] Updated weights on worker 0-0, policy_version 698296 (0.00081) [2022-07-10 11:10:46,037][26022] Updated weights on worker 0-0, policy_version 698306 (0.00085) [2022-07-10 11:10:48,003][26022] Updated weights on worker 0-0, policy_version 698316 (0.00086) [2022-07-10 11:10:48,425][25689] Fps is (10 sec: 5692.7, 60 sec: 5618.3, 300 sec: 5573.5). Total num frames: 715077632. Throughput: 0: 5075.1. Samples: 715072440. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:10:48,427][25689] Avg episode reward: [(0, '-3.497')] [2022-07-10 11:10:49,682][26022] Updated weights on worker 0-0, policy_version 698326 (0.00089) [2022-07-10 11:10:51,597][26022] Updated weights on worker 0-0, policy_version 698336 (0.00085) [2022-07-10 11:10:53,435][26022] Updated weights on worker 0-0, policy_version 698346 (0.00091) [2022-07-10 11:10:53,547][25689] Fps is (10 sec: 5342.2, 60 sec: 5615.6, 300 sec: 5578.5). Total num frames: 715106304. Throughput: 0: 5888.9. Samples: 715106152. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:10:53,547][25689] Avg episode reward: [(0, '-3.945')] [2022-07-10 11:10:55,120][26022] Updated weights on worker 0-0, policy_version 698356 (0.00087) [2022-07-10 11:10:57,035][26022] Updated weights on worker 0-0, policy_version 698366 (0.00088) [2022-07-10 11:10:58,568][25689] Fps is (10 sec: 5653.8, 60 sec: 5632.0, 300 sec: 5574.9). Total num frames: 715134976. Throughput: 0: 5873.3. Samples: 715139568. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:10:58,568][25689] Avg episode reward: [(0, '-3.726')] [2022-07-10 11:10:58,859][26022] Updated weights on worker 0-0, policy_version 698376 (0.00090) [2022-07-10 11:11:00,635][26022] Updated weights on worker 0-0, policy_version 698386 (0.00092) [2022-07-10 11:11:02,965][26022] Updated weights on worker 0-0, policy_version 698396 (0.00089) [2022-07-10 11:11:03,588][25689] Fps is (10 sec: 5506.9, 60 sec: 5615.1, 300 sec: 5578.3). Total num frames: 715161600. Throughput: 0: 5016.2. Samples: 715156636. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:11:03,589][25689] Avg episode reward: [(0, '-2.434')] [2022-07-10 11:11:04,442][26022] Updated weights on worker 0-0, policy_version 698406 (0.00086) [2022-07-10 11:11:06,384][26022] Updated weights on worker 0-0, policy_version 698416 (0.00086) [2022-07-10 11:11:08,271][26022] Updated weights on worker 0-0, policy_version 698426 (0.00096) [2022-07-10 11:11:08,615][25689] Fps is (10 sec: 5401.8, 60 sec: 5616.4, 300 sec: 5577.2). Total num frames: 715189248. Throughput: 0: 5753.0. Samples: 715188590. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:11:08,615][25689] Avg episode reward: [(0, '-2.312')] [2022-07-10 11:11:10,079][26022] Updated weights on worker 0-0, policy_version 698436 (0.00101) [2022-07-10 11:11:11,937][26022] Updated weights on worker 0-0, policy_version 698446 (0.00088) [2022-07-10 11:11:13,641][26022] Updated weights on worker 0-0, policy_version 698456 (0.00080) [2022-07-10 11:11:13,713][25689] Fps is (10 sec: 5663.6, 60 sec: 5644.5, 300 sec: 5580.3). Total num frames: 715218944. Throughput: 0: 5764.7. Samples: 715222404. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 11:11:13,714][25689] Avg episode reward: [(0, '-2.783')] [2022-07-10 11:11:15,436][26022] Updated weights on worker 0-0, policy_version 698466 (0.00089) [2022-07-10 11:11:17,499][26022] Updated weights on worker 0-0, policy_version 698476 (0.00092) [2022-07-10 11:11:18,730][25689] Fps is (10 sec: 5668.8, 60 sec: 5610.2, 300 sec: 5580.5). Total num frames: 715246592. Throughput: 0: 5782.2. Samples: 715256152. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:11:18,731][25689] Avg episode reward: [(0, '-2.496')] [2022-07-10 11:11:19,217][26022] Updated weights on worker 0-0, policy_version 698486 (0.00084) [2022-07-10 11:11:21,018][26022] Updated weights on worker 0-0, policy_version 698496 (0.00089) [2022-07-10 11:11:22,826][26022] Updated weights on worker 0-0, policy_version 698506 (0.00084) [2022-07-10 11:11:23,741][25689] Fps is (10 sec: 5514.2, 60 sec: 5597.9, 300 sec: 5577.3). Total num frames: 715274240. Throughput: 0: 5778.5. Samples: 715273090. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:11:23,742][25689] Avg episode reward: [(0, '-3.253')] [2022-07-10 11:11:24,487][26022] Updated weights on worker 0-0, policy_version 698516 (0.00091) [2022-07-10 11:11:26,640][26022] Updated weights on worker 0-0, policy_version 698526 (0.00092) [2022-07-10 11:11:28,344][26022] Updated weights on worker 0-0, policy_version 698536 (0.00090) [2022-07-10 11:11:28,775][25689] Fps is (10 sec: 5505.1, 60 sec: 5597.1, 300 sec: 5578.1). Total num frames: 715301888. Throughput: 0: 5867.8. Samples: 715306886. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:11:28,775][25689] Avg episode reward: [(0, '-4.207')] [2022-07-10 11:11:30,099][26022] Updated weights on worker 0-0, policy_version 698546 (0.00087) [2022-07-10 11:11:32,344][26022] Updated weights on worker 0-0, policy_version 698556 (0.00085) [2022-07-10 11:11:33,578][26022] Updated weights on worker 0-0, policy_version 698566 (0.00086) [2022-07-10 11:11:33,833][25689] Fps is (10 sec: 5885.5, 60 sec: 5649.5, 300 sec: 5591.0). Total num frames: 715333632. Throughput: 0: 5866.3. Samples: 715340430. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:11:33,833][25689] Avg episode reward: [(0, '-5.921')] [2022-07-10 11:11:35,857][26022] Updated weights on worker 0-0, policy_version 698576 (0.00076) [2022-07-10 11:11:37,303][26022] Updated weights on worker 0-0, policy_version 698586 (0.00087) [2022-07-10 11:11:38,861][25689] Fps is (10 sec: 5685.5, 60 sec: 5601.5, 300 sec: 5580.7). Total num frames: 715359232. Throughput: 0: 5037.7. Samples: 715357564. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:11:38,861][25689] Avg episode reward: [(0, '-5.844')] [2022-07-10 11:11:39,239][26022] Updated weights on worker 0-0, policy_version 698596 (0.00088) [2022-07-10 11:11:40,983][26022] Updated weights on worker 0-0, policy_version 698606 (0.00089) [2022-07-10 11:11:42,825][26022] Updated weights on worker 0-0, policy_version 698616 (0.00085) [2022-07-10 11:11:43,878][25689] Fps is (10 sec: 5300.6, 60 sec: 5535.6, 300 sec: 5580.6). Total num frames: 715386880. Throughput: 0: 5889.4. Samples: 715391684. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:11:43,879][25689] Avg episode reward: [(0, '-4.794')] [2022-07-10 11:11:44,562][26022] Updated weights on worker 0-0, policy_version 698626 (0.00079) [2022-07-10 11:11:46,499][26022] Updated weights on worker 0-0, policy_version 698636 (0.00075) [2022-07-10 11:11:48,243][26022] Updated weights on worker 0-0, policy_version 698646 (0.00088) [2022-07-10 11:11:48,891][25689] Fps is (10 sec: 5819.4, 60 sec: 5622.5, 300 sec: 5588.1). Total num frames: 715417600. Throughput: 0: 5929.6. Samples: 715426166. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:11:48,891][25689] Avg episode reward: [(0, '-4.887')] [2022-07-10 11:11:50,083][26022] Updated weights on worker 0-0, policy_version 698656 (0.00086) [2022-07-10 11:11:51,867][26022] Updated weights on worker 0-0, policy_version 698666 (0.00096) [2022-07-10 11:11:53,639][26022] Updated weights on worker 0-0, policy_version 698676 (0.00103) [2022-07-10 11:11:53,962][25689] Fps is (10 sec: 5788.1, 60 sec: 5610.2, 300 sec: 5585.0). Total num frames: 715445248. Throughput: 0: 5108.6. Samples: 715443266. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:11:53,963][25689] Avg episode reward: [(0, '-5.126')] [2022-07-10 11:11:55,485][26022] Updated weights on worker 0-0, policy_version 698686 (0.00103) [2022-07-10 11:11:57,329][26022] Updated weights on worker 0-0, policy_version 698696 (0.00093) [2022-07-10 11:11:58,982][25689] Fps is (10 sec: 5479.9, 60 sec: 5593.4, 300 sec: 5585.1). Total num frames: 715472896. Throughput: 0: 5929.6. Samples: 715476872. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:11:58,982][25689] Avg episode reward: [(0, '-4.091')] [2022-07-10 11:11:59,160][26022] Updated weights on worker 0-0, policy_version 698706 (0.00086) [2022-07-10 11:12:00,892][26022] Updated weights on worker 0-0, policy_version 698716 (0.00084) [2022-07-10 11:12:03,126][26022] Updated weights on worker 0-0, policy_version 698726 (0.00090) [2022-07-10 11:12:04,005][25689] Fps is (10 sec: 5404.2, 60 sec: 5593.2, 300 sec: 5588.3). Total num frames: 715499520. Throughput: 0: 5789.7. Samples: 715508212. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:12:04,006][25689] Avg episode reward: [(0, '-2.502')] [2022-07-10 11:12:05,010][26022] Updated weights on worker 0-0, policy_version 698736 (0.00094) [2022-07-10 11:12:06,948][26022] Updated weights on worker 0-0, policy_version 698746 (0.00090) [2022-07-10 11:12:08,490][26022] Updated weights on worker 0-0, policy_version 698756 (0.00116) [2022-07-10 11:12:09,019][25689] Fps is (10 sec: 5509.2, 60 sec: 5611.3, 300 sec: 5589.6). Total num frames: 715528192. Throughput: 0: 4903.9. Samples: 715524870. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:12:09,019][25689] Avg episode reward: [(0, '-3.224')] [2022-07-10 11:12:10,639][26022] Updated weights on worker 0-0, policy_version 698766 (0.00063) [2022-07-10 11:12:12,359][26022] Updated weights on worker 0-0, policy_version 698776 (0.00085) [2022-07-10 11:12:14,157][25689] Fps is (10 sec: 5547.8, 60 sec: 5573.7, 300 sec: 5587.5). Total num frames: 715555840. Throughput: 0: 5709.1. Samples: 715558558. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:12:14,157][25689] Avg episode reward: [(0, '-3.122')] [2022-07-10 11:12:14,221][26022] Updated weights on worker 0-0, policy_version 698786 (0.00086) [2022-07-10 11:12:15,840][26022] Updated weights on worker 0-0, policy_version 698796 (0.00099) [2022-07-10 11:12:17,760][26022] Updated weights on worker 0-0, policy_version 698806 (0.00085) [2022-07-10 11:12:19,234][25689] Fps is (10 sec: 5613.3, 60 sec: 5602.0, 300 sec: 5597.1). Total num frames: 715585536. Throughput: 0: 5717.3. Samples: 715592664. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:12:19,235][25689] Avg episode reward: [(0, '-3.060')] [2022-07-10 11:12:19,835][26022] Updated weights on worker 0-0, policy_version 698816 (0.00084) [2022-07-10 11:12:21,315][26022] Updated weights on worker 0-0, policy_version 698826 (0.00083) [2022-07-10 11:12:23,371][26022] Updated weights on worker 0-0, policy_version 698836 (0.00085) [2022-07-10 11:12:24,237][25689] Fps is (10 sec: 5587.4, 60 sec: 5585.9, 300 sec: 5588.3). Total num frames: 715612160. Throughput: 0: 5011.0. Samples: 715609592. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:12:24,237][25689] Avg episode reward: [(0, '-3.024')] [2022-07-10 11:12:24,870][26022] Updated weights on worker 0-0, policy_version 698846 (0.01092) [2022-07-10 11:12:26,963][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:12:26,974][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000698856_715628544.pth [2022-07-10 11:12:26,974][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000696891_713616384.pth [2022-07-10 11:12:26,978][26022] Updated weights on worker 0-0, policy_version 698856 (0.00101) [2022-07-10 11:12:28,670][26022] Updated weights on worker 0-0, policy_version 698866 (0.00095) [2022-07-10 11:12:29,246][25689] Fps is (10 sec: 5727.7, 60 sec: 5638.9, 300 sec: 5599.5). Total num frames: 715642880. Throughput: 0: 5874.5. Samples: 715643696. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:12:29,253][25689] Avg episode reward: [(0, '-3.138')] [2022-07-10 11:12:30,491][26022] Updated weights on worker 0-0, policy_version 698876 (0.00434) [2022-07-10 11:12:32,239][26022] Updated weights on worker 0-0, policy_version 698886 (0.00091) [2022-07-10 11:12:34,177][26022] Updated weights on worker 0-0, policy_version 698896 (0.00086) [2022-07-10 11:12:34,349][25689] Fps is (10 sec: 5670.5, 60 sec: 5550.1, 300 sec: 5597.9). Total num frames: 715669504. Throughput: 0: 5895.3. Samples: 715677600. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:12:34,350][25689] Avg episode reward: [(0, '-2.493')] [2022-07-10 11:12:35,960][26022] Updated weights on worker 0-0, policy_version 698906 (0.00092) [2022-07-10 11:12:37,763][26022] Updated weights on worker 0-0, policy_version 698916 (0.00092) [2022-07-10 11:12:39,355][25689] Fps is (10 sec: 5470.1, 60 sec: 5602.9, 300 sec: 5595.9). Total num frames: 715698176. Throughput: 0: 5058.8. Samples: 715694450. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:12:39,355][25689] Avg episode reward: [(0, '-2.681')] [2022-07-10 11:12:39,416][26022] Updated weights on worker 0-0, policy_version 698926 (0.00057) [2022-07-10 11:12:41,363][26022] Updated weights on worker 0-0, policy_version 698936 (0.00081) [2022-07-10 11:12:43,095][26022] Updated weights on worker 0-0, policy_version 698946 (0.00083) [2022-07-10 11:12:44,361][25689] Fps is (10 sec: 5727.8, 60 sec: 5620.9, 300 sec: 5597.1). Total num frames: 715726848. Throughput: 0: 5903.1. Samples: 715728388. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:12:44,361][25689] Avg episode reward: [(0, '-2.245')] [2022-07-10 11:12:45,069][26022] Updated weights on worker 0-0, policy_version 698956 (0.00087) [2022-07-10 11:12:46,845][26022] Updated weights on worker 0-0, policy_version 698966 (0.00084) [2022-07-10 11:12:48,655][26022] Updated weights on worker 0-0, policy_version 698976 (0.00083) [2022-07-10 11:12:49,372][25689] Fps is (10 sec: 5724.4, 60 sec: 5587.2, 300 sec: 5598.2). Total num frames: 715755520. Throughput: 0: 5888.7. Samples: 715762214. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:12:49,373][25689] Avg episode reward: [(0, '-3.764')] [2022-07-10 11:12:50,543][26022] Updated weights on worker 0-0, policy_version 698986 (0.00092) [2022-07-10 11:12:52,237][26022] Updated weights on worker 0-0, policy_version 698996 (0.00079) [2022-07-10 11:12:54,107][26022] Updated weights on worker 0-0, policy_version 699006 (0.00083) [2022-07-10 11:12:54,457][25689] Fps is (10 sec: 5578.4, 60 sec: 5586.0, 300 sec: 5595.1). Total num frames: 715783168. Throughput: 0: 5037.3. Samples: 715778888. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:12:54,457][25689] Avg episode reward: [(0, '-4.778')] [2022-07-10 11:12:55,947][26022] Updated weights on worker 0-0, policy_version 699016 (0.00079) [2022-07-10 11:12:57,697][26022] Updated weights on worker 0-0, policy_version 699026 (0.00085) [2022-07-10 11:12:59,250][26022] Updated weights on worker 0-0, policy_version 699036 (0.00085) [2022-07-10 11:12:59,476][25689] Fps is (10 sec: 5675.8, 60 sec: 5619.9, 300 sec: 5609.2). Total num frames: 715812864. Throughput: 0: 5895.2. Samples: 715813066. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:12:59,476][25689] Avg episode reward: [(0, '-4.921')] [2022-07-10 11:13:01,611][26022] Updated weights on worker 0-0, policy_version 699046 (0.00091) [2022-07-10 11:13:03,459][26022] Updated weights on worker 0-0, policy_version 699056 (0.00088) [2022-07-10 11:13:04,478][25689] Fps is (10 sec: 5415.7, 60 sec: 5587.9, 300 sec: 5595.8). Total num frames: 715837440. Throughput: 0: 5783.5. Samples: 715844736. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:13:04,480][25689] Avg episode reward: [(0, '-5.129')] [2022-07-10 11:13:05,355][26022] Updated weights on worker 0-0, policy_version 699066 (0.00084) [2022-07-10 11:13:07,119][26022] Updated weights on worker 0-0, policy_version 699076 (0.00081) [2022-07-10 11:13:08,819][26022] Updated weights on worker 0-0, policy_version 699086 (0.00087) [2022-07-10 11:13:09,580][25689] Fps is (10 sec: 5269.9, 60 sec: 5579.8, 300 sec: 5598.5). Total num frames: 715866112. Throughput: 0: 4931.4. Samples: 715861864. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:13:09,582][25689] Avg episode reward: [(0, '-4.328')] [2022-07-10 11:13:10,948][26022] Updated weights on worker 0-0, policy_version 699096 (0.00092) [2022-07-10 11:13:12,595][26022] Updated weights on worker 0-0, policy_version 699106 (0.00086) [2022-07-10 11:13:14,306][26022] Updated weights on worker 0-0, policy_version 699116 (0.00083) [2022-07-10 11:13:14,634][25689] Fps is (10 sec: 5747.5, 60 sec: 5621.5, 300 sec: 5601.7). Total num frames: 715895808. Throughput: 0: 5791.2. Samples: 715895736. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:13:14,634][25689] Avg episode reward: [(0, '-2.697')] [2022-07-10 11:13:16,286][26022] Updated weights on worker 0-0, policy_version 699126 (0.00051) [2022-07-10 11:13:17,947][26022] Updated weights on worker 0-0, policy_version 699136 (0.00088) [2022-07-10 11:13:19,643][25689] Fps is (10 sec: 5596.5, 60 sec: 5576.9, 300 sec: 5598.5). Total num frames: 715922432. Throughput: 0: 5788.7. Samples: 715929810. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:13:19,644][25689] Avg episode reward: [(0, '-1.963')] [2022-07-10 11:13:19,868][26022] Updated weights on worker 0-0, policy_version 699146 (0.00083) [2022-07-10 11:13:21,643][26022] Updated weights on worker 0-0, policy_version 699156 (0.00087) [2022-07-10 11:13:23,419][26022] Updated weights on worker 0-0, policy_version 699166 (0.00090) [2022-07-10 11:13:24,672][25689] Fps is (10 sec: 5712.6, 60 sec: 5642.3, 300 sec: 5608.3). Total num frames: 715953152. Throughput: 0: 5066.9. Samples: 715947054. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:13:24,674][25689] Avg episode reward: [(0, '-0.895')] [2022-07-10 11:13:25,058][26022] Updated weights on worker 0-0, policy_version 699176 (0.00086) [2022-07-10 11:13:27,070][26022] Updated weights on worker 0-0, policy_version 699186 (0.00091) [2022-07-10 11:13:28,798][26022] Updated weights on worker 0-0, policy_version 699196 (0.00090) [2022-07-10 11:13:29,686][25689] Fps is (10 sec: 5914.0, 60 sec: 5608.0, 300 sec: 5613.2). Total num frames: 715981824. Throughput: 0: 5941.6. Samples: 715981326. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:13:29,686][25689] Avg episode reward: [(0, '-1.270')] [2022-07-10 11:13:30,697][26022] Updated weights on worker 0-0, policy_version 699206 (0.00094) [2022-07-10 11:13:32,437][26022] Updated weights on worker 0-0, policy_version 699216 (0.00085) [2022-07-10 11:13:34,147][26022] Updated weights on worker 0-0, policy_version 699226 (0.00589) [2022-07-10 11:13:34,833][25689] Fps is (10 sec: 5542.6, 60 sec: 5620.8, 300 sec: 5604.0). Total num frames: 716009472. Throughput: 0: 5907.0. Samples: 716015054. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:13:34,834][25689] Avg episode reward: [(0, '-1.225')] [2022-07-10 11:13:35,946][26022] Updated weights on worker 0-0, policy_version 699236 (0.00082) [2022-07-10 11:13:37,934][26022] Updated weights on worker 0-0, policy_version 699246 (0.00089) [2022-07-10 11:13:39,655][26022] Updated weights on worker 0-0, policy_version 699256 (0.00083) [2022-07-10 11:13:39,901][25689] Fps is (10 sec: 5613.6, 60 sec: 5631.9, 300 sec: 5609.7). Total num frames: 716039168. Throughput: 0: 5051.8. Samples: 716032148. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:13:39,901][25689] Avg episode reward: [(0, '-1.676')] [2022-07-10 11:13:41,491][26022] Updated weights on worker 0-0, policy_version 699266 (0.00082) [2022-07-10 11:13:43,174][26022] Updated weights on worker 0-0, policy_version 699276 (0.00092) [2022-07-10 11:13:44,937][25689] Fps is (10 sec: 5574.2, 60 sec: 5595.4, 300 sec: 5602.5). Total num frames: 716065792. Throughput: 0: 5883.5. Samples: 716066284. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:13:44,938][25689] Avg episode reward: [(0, '-1.691')] [2022-07-10 11:13:45,263][26022] Updated weights on worker 0-0, policy_version 699286 (0.00091) [2022-07-10 11:13:46,961][26022] Updated weights on worker 0-0, policy_version 699296 (0.00089) [2022-07-10 11:13:48,674][26022] Updated weights on worker 0-0, policy_version 699306 (0.00051) [2022-07-10 11:13:49,942][25689] Fps is (10 sec: 5609.0, 60 sec: 5612.9, 300 sec: 5613.7). Total num frames: 716095488. Throughput: 0: 5872.2. Samples: 716100274. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:13:49,942][25689] Avg episode reward: [(0, '-2.032')] [2022-07-10 11:13:50,583][26022] Updated weights on worker 0-0, policy_version 699316 (0.00096) [2022-07-10 11:13:52,450][26022] Updated weights on worker 0-0, policy_version 699326 (0.00097) [2022-07-10 11:13:54,270][26022] Updated weights on worker 0-0, policy_version 699336 (0.00098) [2022-07-10 11:13:54,982][25689] Fps is (10 sec: 5810.3, 60 sec: 5633.9, 300 sec: 5610.4). Total num frames: 716124160. Throughput: 0: 5055.3. Samples: 716116914. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:13:54,983][25689] Avg episode reward: [(0, '-2.161')] [2022-07-10 11:13:55,920][26022] Updated weights on worker 0-0, policy_version 699346 (0.00082) [2022-07-10 11:13:57,669][26022] Updated weights on worker 0-0, policy_version 699356 (0.00082) [2022-07-10 11:13:59,694][26022] Updated weights on worker 0-0, policy_version 699366 (0.00089) [2022-07-10 11:14:00,053][25689] Fps is (10 sec: 5570.3, 60 sec: 5595.3, 300 sec: 5613.0). Total num frames: 716151808. Throughput: 0: 5901.4. Samples: 716151070. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:14:00,060][25689] Avg episode reward: [(0, '-1.337')] [2022-07-10 11:14:01,910][26022] Updated weights on worker 0-0, policy_version 699376 (0.00094) [2022-07-10 11:14:03,739][26022] Updated weights on worker 0-0, policy_version 699386 (0.00082) [2022-07-10 11:14:05,122][25689] Fps is (10 sec: 5453.7, 60 sec: 5639.8, 300 sec: 5612.0). Total num frames: 716179456. Throughput: 0: 5765.8. Samples: 716182666. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:14:05,122][25689] Avg episode reward: [(0, '-1.475')] [2022-07-10 11:14:05,344][26022] Updated weights on worker 0-0, policy_version 699396 (0.00090) [2022-07-10 11:14:07,320][26022] Updated weights on worker 0-0, policy_version 699406 (0.00081) [2022-07-10 11:14:09,131][26022] Updated weights on worker 0-0, policy_version 699416 (0.00086) [2022-07-10 11:14:10,180][25689] Fps is (10 sec: 5459.9, 60 sec: 5626.9, 300 sec: 5608.4). Total num frames: 716207104. Throughput: 0: 5755.6. Samples: 716216758. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:14:10,181][25689] Avg episode reward: [(0, '-1.864')] [2022-07-10 11:14:10,943][26022] Updated weights on worker 0-0, policy_version 699426 (0.00080) [2022-07-10 11:14:12,552][26022] Updated weights on worker 0-0, policy_version 699436 (0.00089) [2022-07-10 11:14:14,481][26022] Updated weights on worker 0-0, policy_version 699446 (0.00088) [2022-07-10 11:14:15,252][25689] Fps is (10 sec: 5660.7, 60 sec: 5625.2, 300 sec: 5610.6). Total num frames: 716236800. Throughput: 0: 5768.4. Samples: 716233834. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:14:15,253][25689] Avg episode reward: [(0, '-3.172')] [2022-07-10 11:14:16,146][26022] Updated weights on worker 0-0, policy_version 699456 (0.00105) [2022-07-10 11:14:18,029][26022] Updated weights on worker 0-0, policy_version 699466 (0.00085) [2022-07-10 11:14:19,792][26022] Updated weights on worker 0-0, policy_version 699476 (0.00085) [2022-07-10 11:14:20,279][25689] Fps is (10 sec: 5779.8, 60 sec: 5657.4, 300 sec: 5614.0). Total num frames: 716265472. Throughput: 0: 5806.2. Samples: 716268508. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:14:20,279][25689] Avg episode reward: [(0, '-3.025')] [2022-07-10 11:14:21,662][26022] Updated weights on worker 0-0, policy_version 699486 (0.00483) [2022-07-10 11:14:23,400][26022] Updated weights on worker 0-0, policy_version 699496 (0.00086) [2022-07-10 11:14:25,331][25689] Fps is (10 sec: 5587.9, 60 sec: 5604.6, 300 sec: 5609.7). Total num frames: 716293120. Throughput: 0: 5945.8. Samples: 716302824. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:14:25,332][25689] Avg episode reward: [(0, '-2.882')] [2022-07-10 11:14:25,365][26022] Updated weights on worker 0-0, policy_version 699506 (0.00083) [2022-07-10 11:14:27,115][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:14:27,124][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000699516_716304384.pth [2022-07-10 11:14:27,125][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000697540_714280960.pth [2022-07-10 11:14:27,126][26022] Updated weights on worker 0-0, policy_version 699516 (0.00088) [2022-07-10 11:14:28,922][26022] Updated weights on worker 0-0, policy_version 699526 (0.00087) [2022-07-10 11:14:30,335][25689] Fps is (10 sec: 5702.3, 60 sec: 5622.3, 300 sec: 5614.8). Total num frames: 716322816. Throughput: 0: 5112.0. Samples: 716319790. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:14:30,337][25689] Avg episode reward: [(0, '-2.835')] [2022-07-10 11:14:30,762][26022] Updated weights on worker 0-0, policy_version 699536 (0.00087) [2022-07-10 11:14:32,597][26022] Updated weights on worker 0-0, policy_version 699546 (0.00087) [2022-07-10 11:14:34,300][26022] Updated weights on worker 0-0, policy_version 699556 (0.00093) [2022-07-10 11:14:35,383][25689] Fps is (10 sec: 5704.6, 60 sec: 5631.6, 300 sec: 5610.6). Total num frames: 716350464. Throughput: 0: 5960.9. Samples: 716353834. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:14:35,383][25689] Avg episode reward: [(0, '-3.030')] [2022-07-10 11:14:36,073][26022] Updated weights on worker 0-0, policy_version 699566 (0.00087) [2022-07-10 11:14:37,933][26022] Updated weights on worker 0-0, policy_version 699576 (0.00084) [2022-07-10 11:14:39,872][26022] Updated weights on worker 0-0, policy_version 699586 (0.00083) [2022-07-10 11:14:40,395][25689] Fps is (10 sec: 5496.5, 60 sec: 5602.9, 300 sec: 5607.2). Total num frames: 716378112. Throughput: 0: 5915.6. Samples: 716387510. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:14:40,396][25689] Avg episode reward: [(0, '-2.946')] [2022-07-10 11:14:41,450][26022] Updated weights on worker 0-0, policy_version 699596 (0.00081) [2022-07-10 11:14:43,671][26022] Updated weights on worker 0-0, policy_version 699606 (0.00086) [2022-07-10 11:14:45,071][26022] Updated weights on worker 0-0, policy_version 699616 (0.00088) [2022-07-10 11:14:45,427][25689] Fps is (10 sec: 5709.1, 60 sec: 5654.1, 300 sec: 5613.7). Total num frames: 716407808. Throughput: 0: 5056.2. Samples: 716404436. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:14:45,428][25689] Avg episode reward: [(0, '-2.288')] [2022-07-10 11:14:47,238][26022] Updated weights on worker 0-0, policy_version 699626 (0.00377) [2022-07-10 11:14:48,764][26022] Updated weights on worker 0-0, policy_version 699636 (0.00082) [2022-07-10 11:14:50,458][25689] Fps is (10 sec: 5698.5, 60 sec: 5617.8, 300 sec: 5611.4). Total num frames: 716435456. Throughput: 0: 5894.7. Samples: 716438410. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:14:50,459][25689] Avg episode reward: [(0, '-3.774')] [2022-07-10 11:14:50,577][26022] Updated weights on worker 0-0, policy_version 699646 (0.00081) [2022-07-10 11:14:52,548][26022] Updated weights on worker 0-0, policy_version 699656 (0.00087) [2022-07-10 11:14:54,217][26022] Updated weights on worker 0-0, policy_version 699666 (0.00089) [2022-07-10 11:14:55,580][25689] Fps is (10 sec: 5648.3, 60 sec: 5627.2, 300 sec: 5616.3). Total num frames: 716465152. Throughput: 0: 5868.8. Samples: 716472364. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:14:55,581][25689] Avg episode reward: [(0, '-5.726')] [2022-07-10 11:14:56,020][26022] Updated weights on worker 0-0, policy_version 699676 (0.00095) [2022-07-10 11:14:57,972][26022] Updated weights on worker 0-0, policy_version 699686 (0.00094) [2022-07-10 11:14:59,670][26022] Updated weights on worker 0-0, policy_version 699696 (0.00086) [2022-07-10 11:15:00,591][25689] Fps is (10 sec: 5760.4, 60 sec: 5649.6, 300 sec: 5619.9). Total num frames: 716493824. Throughput: 0: 5040.2. Samples: 716489300. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:15:00,591][25689] Avg episode reward: [(0, '-5.859')] [2022-07-10 11:15:01,747][26022] Updated weights on worker 0-0, policy_version 699706 (0.00088) [2022-07-10 11:15:03,645][26022] Updated weights on worker 0-0, policy_version 699716 (0.00104) [2022-07-10 11:15:05,642][25689] Fps is (10 sec: 5292.0, 60 sec: 5600.5, 300 sec: 5609.4). Total num frames: 716518400. Throughput: 0: 5754.2. Samples: 716520754. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:15:05,642][25689] Avg episode reward: [(0, '-5.437')] [2022-07-10 11:15:05,770][26022] Updated weights on worker 0-0, policy_version 699726 (0.00089) [2022-07-10 11:15:07,296][26022] Updated weights on worker 0-0, policy_version 699736 (0.00084) [2022-07-10 11:15:09,463][26022] Updated weights on worker 0-0, policy_version 699746 (0.00087) [2022-07-10 11:15:10,667][25689] Fps is (10 sec: 5487.8, 60 sec: 5654.4, 300 sec: 5619.9). Total num frames: 716549120. Throughput: 0: 5742.1. Samples: 716554450. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:15:10,668][25689] Avg episode reward: [(0, '-6.344')] [2022-07-10 11:15:11,058][26022] Updated weights on worker 0-0, policy_version 699756 (0.00085) [2022-07-10 11:15:13,004][26022] Updated weights on worker 0-0, policy_version 699766 (0.00086) [2022-07-10 11:15:14,608][26022] Updated weights on worker 0-0, policy_version 699776 (0.00082) [2022-07-10 11:15:15,766][25689] Fps is (10 sec: 5765.4, 60 sec: 5618.0, 300 sec: 5611.4). Total num frames: 716576768. Throughput: 0: 4903.3. Samples: 716571342. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:15:15,766][25689] Avg episode reward: [(0, '-6.320')] [2022-07-10 11:15:16,608][26022] Updated weights on worker 0-0, policy_version 699786 (0.00092) [2022-07-10 11:15:18,184][26022] Updated weights on worker 0-0, policy_version 699796 (0.00094) [2022-07-10 11:15:20,253][26022] Updated weights on worker 0-0, policy_version 699806 (0.00916) [2022-07-10 11:15:20,860][25689] Fps is (10 sec: 5525.4, 60 sec: 5611.8, 300 sec: 5610.8). Total num frames: 716605440. Throughput: 0: 5735.0. Samples: 716605544. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:15:20,861][25689] Avg episode reward: [(0, '-3.937')] [2022-07-10 11:15:21,948][26022] Updated weights on worker 0-0, policy_version 699816 (0.00085) [2022-07-10 11:15:23,685][26022] Updated weights on worker 0-0, policy_version 699826 (0.00088) [2022-07-10 11:15:25,505][26022] Updated weights on worker 0-0, policy_version 699836 (0.00093) [2022-07-10 11:15:25,927][25689] Fps is (10 sec: 5441.6, 60 sec: 5593.4, 300 sec: 5606.5). Total num frames: 716632064. Throughput: 0: 5867.9. Samples: 716639788. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:15:25,929][25689] Avg episode reward: [(0, '-2.277')] [2022-07-10 11:15:27,255][26022] Updated weights on worker 0-0, policy_version 699846 (0.00080) [2022-07-10 11:15:29,215][26022] Updated weights on worker 0-0, policy_version 699856 (0.00085) [2022-07-10 11:15:30,904][26022] Updated weights on worker 0-0, policy_version 699866 (0.00089) [2022-07-10 11:15:30,955][25689] Fps is (10 sec: 5680.6, 60 sec: 5608.2, 300 sec: 5614.3). Total num frames: 716662784. Throughput: 0: 5038.2. Samples: 716656664. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:15:30,955][25689] Avg episode reward: [(0, '-2.341')] [2022-07-10 11:15:32,816][26022] Updated weights on worker 0-0, policy_version 699876 (0.00085) [2022-07-10 11:15:34,646][26022] Updated weights on worker 0-0, policy_version 699886 (0.00089) [2022-07-10 11:15:35,983][25689] Fps is (10 sec: 5804.6, 60 sec: 5610.0, 300 sec: 5611.5). Total num frames: 716690432. Throughput: 0: 5885.2. Samples: 716690324. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:15:35,985][25689] Avg episode reward: [(0, '-3.367')] [2022-07-10 11:15:36,566][26022] Updated weights on worker 0-0, policy_version 699896 (0.00092) [2022-07-10 11:15:38,175][26022] Updated weights on worker 0-0, policy_version 699906 (0.00086) [2022-07-10 11:15:40,214][26022] Updated weights on worker 0-0, policy_version 699916 (0.00087) [2022-07-10 11:15:41,023][25689] Fps is (10 sec: 5390.6, 60 sec: 5590.6, 300 sec: 5594.2). Total num frames: 716717056. Throughput: 0: 5891.2. Samples: 716724326. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:15:41,023][25689] Avg episode reward: [(0, '-2.485')] [2022-07-10 11:15:41,582][26022] Updated weights on worker 0-0, policy_version 699926 (0.00082) [2022-07-10 11:15:43,748][26022] Updated weights on worker 0-0, policy_version 699936 (0.00084) [2022-07-10 11:15:45,175][26022] Updated weights on worker 0-0, policy_version 699946 (0.00078) [2022-07-10 11:15:46,030][25689] Fps is (10 sec: 5707.2, 60 sec: 5609.7, 300 sec: 5611.9). Total num frames: 716747776. Throughput: 0: 5048.6. Samples: 716741280. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:15:46,031][25689] Avg episode reward: [(0, '-3.478')] [2022-07-10 11:15:47,307][26022] Updated weights on worker 0-0, policy_version 699956 (0.00085) [2022-07-10 11:15:49,035][26022] Updated weights on worker 0-0, policy_version 699966 (0.00099) [2022-07-10 11:15:50,901][26022] Updated weights on worker 0-0, policy_version 699976 (0.00089) [2022-07-10 11:15:51,042][25689] Fps is (10 sec: 5927.8, 60 sec: 5628.5, 300 sec: 5614.0). Total num frames: 716776448. Throughput: 0: 5899.0. Samples: 716775158. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:15:51,044][25689] Avg episode reward: [(0, '-3.913')] [2022-07-10 11:15:52,719][26022] Updated weights on worker 0-0, policy_version 699986 (0.00086) [2022-07-10 11:15:54,539][26022] Updated weights on worker 0-0, policy_version 699996 (0.00087) [2022-07-10 11:15:56,185][25689] Fps is (10 sec: 5546.1, 60 sec: 5592.6, 300 sec: 5608.3). Total num frames: 716804096. Throughput: 0: 5887.7. Samples: 716809270. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:15:56,186][25689] Avg episode reward: [(0, '-3.139')] [2022-07-10 11:15:56,403][26022] Updated weights on worker 0-0, policy_version 700006 (0.00092) [2022-07-10 11:15:58,189][26022] Updated weights on worker 0-0, policy_version 700016 (0.00085) [2022-07-10 11:15:59,996][26022] Updated weights on worker 0-0, policy_version 700026 (0.00098) [2022-07-10 11:16:01,195][25689] Fps is (10 sec: 5546.9, 60 sec: 5592.7, 300 sec: 5615.3). Total num frames: 716832768. Throughput: 0: 5055.2. Samples: 716826302. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:16:01,196][25689] Avg episode reward: [(0, '-3.165')] [2022-07-10 11:16:01,734][26022] Updated weights on worker 0-0, policy_version 700036 (0.00077) [2022-07-10 11:16:03,921][26022] Updated weights on worker 0-0, policy_version 700046 (0.00086) [2022-07-10 11:16:05,687][26022] Updated weights on worker 0-0, policy_version 700056 (0.00090) [2022-07-10 11:16:06,227][25689] Fps is (10 sec: 5506.8, 60 sec: 5628.3, 300 sec: 5611.8). Total num frames: 716859392. Throughput: 0: 5786.7. Samples: 716858150. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:16:06,227][25689] Avg episode reward: [(0, '-3.139')] [2022-07-10 11:16:07,695][26022] Updated weights on worker 0-0, policy_version 700066 (0.00121) [2022-07-10 11:16:09,521][26022] Updated weights on worker 0-0, policy_version 700076 (0.00086) [2022-07-10 11:16:11,092][26022] Updated weights on worker 0-0, policy_version 700086 (0.00095) [2022-07-10 11:16:11,300][25689] Fps is (10 sec: 5573.7, 60 sec: 5607.0, 300 sec: 5612.3). Total num frames: 716889088. Throughput: 0: 5765.9. Samples: 716891964. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:16:11,300][25689] Avg episode reward: [(0, '-2.806')] [2022-07-10 11:16:13,227][26022] Updated weights on worker 0-0, policy_version 700096 (0.00091) [2022-07-10 11:16:14,794][26022] Updated weights on worker 0-0, policy_version 700106 (0.00095) [2022-07-10 11:16:16,415][25689] Fps is (10 sec: 5527.7, 60 sec: 5588.5, 300 sec: 5607.0). Total num frames: 716915712. Throughput: 0: 5774.1. Samples: 716926080. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:16:16,416][25689] Avg episode reward: [(0, '-0.867')] [2022-07-10 11:16:16,756][26022] Updated weights on worker 0-0, policy_version 700116 (0.00083) [2022-07-10 11:16:18,391][26022] Updated weights on worker 0-0, policy_version 700126 (0.00096) [2022-07-10 11:16:20,459][26022] Updated weights on worker 0-0, policy_version 700136 (0.00097) [2022-07-10 11:16:21,478][25689] Fps is (10 sec: 5432.6, 60 sec: 5591.5, 300 sec: 5609.5). Total num frames: 716944384. Throughput: 0: 5758.4. Samples: 716943100. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:16:21,479][25689] Avg episode reward: [(0, '-1.378')] [2022-07-10 11:16:22,122][26022] Updated weights on worker 0-0, policy_version 700146 (0.00092) [2022-07-10 11:16:23,998][26022] Updated weights on worker 0-0, policy_version 700156 (0.00089) [2022-07-10 11:16:25,567][26022] Updated weights on worker 0-0, policy_version 700166 (0.00092) [2022-07-10 11:16:26,507][25689] Fps is (10 sec: 5682.5, 60 sec: 5628.8, 300 sec: 5613.0). Total num frames: 716973056. Throughput: 0: 5867.6. Samples: 716977144. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:16:26,507][25689] Avg episode reward: [(0, '-1.352')] [2022-07-10 11:16:27,208][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:16:27,217][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000700174_716978176.pth [2022-07-10 11:16:27,227][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000698198_714954752.pth [2022-07-10 11:16:27,541][26022] Updated weights on worker 0-0, policy_version 700176 (0.00086) [2022-07-10 11:16:29,236][26022] Updated weights on worker 0-0, policy_version 700186 (0.00092) [2022-07-10 11:16:31,089][26022] Updated weights on worker 0-0, policy_version 700196 (0.00096) [2022-07-10 11:16:31,529][25689] Fps is (10 sec: 5807.7, 60 sec: 5612.5, 300 sec: 5606.8). Total num frames: 717002752. Throughput: 0: 5875.3. Samples: 717010814. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:16:31,529][25689] Avg episode reward: [(0, '-1.966')] [2022-07-10 11:16:33,202][26022] Updated weights on worker 0-0, policy_version 700206 (0.00085) [2022-07-10 11:16:34,767][26022] Updated weights on worker 0-0, policy_version 700216 (0.00089) [2022-07-10 11:16:36,626][25689] Fps is (10 sec: 5666.8, 60 sec: 5606.0, 300 sec: 5612.4). Total num frames: 717030400. Throughput: 0: 5004.5. Samples: 717027224. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:16:36,627][25689] Avg episode reward: [(0, '-1.321')] [2022-07-10 11:16:36,902][26022] Updated weights on worker 0-0, policy_version 700226 (0.00090) [2022-07-10 11:16:38,590][26022] Updated weights on worker 0-0, policy_version 700236 (0.00097) [2022-07-10 11:16:40,301][26022] Updated weights on worker 0-0, policy_version 700246 (0.00093) [2022-07-10 11:16:41,667][25689] Fps is (10 sec: 5555.3, 60 sec: 5639.7, 300 sec: 5615.4). Total num frames: 717059072. Throughput: 0: 5831.4. Samples: 717060826. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:16:41,668][25689] Avg episode reward: [(0, '-2.447')] [2022-07-10 11:16:42,326][26022] Updated weights on worker 0-0, policy_version 700256 (0.00087) [2022-07-10 11:16:44,056][26022] Updated weights on worker 0-0, policy_version 700266 (0.00080) [2022-07-10 11:16:45,785][26022] Updated weights on worker 0-0, policy_version 700276 (0.00082) [2022-07-10 11:16:46,711][25689] Fps is (10 sec: 5686.3, 60 sec: 5602.6, 300 sec: 5607.9). Total num frames: 717087744. Throughput: 0: 5826.4. Samples: 717094860. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:16:46,712][25689] Avg episode reward: [(0, '-2.059')] [2022-07-10 11:16:47,859][26022] Updated weights on worker 0-0, policy_version 700286 (0.00087) [2022-07-10 11:16:49,169][26022] Updated weights on worker 0-0, policy_version 700296 (0.00092) [2022-07-10 11:16:51,498][26022] Updated weights on worker 0-0, policy_version 700306 (0.00090) [2022-07-10 11:16:51,726][25689] Fps is (10 sec: 5598.7, 60 sec: 5585.4, 300 sec: 5609.0). Total num frames: 717115392. Throughput: 0: 5012.4. Samples: 717112052. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:16:51,727][25689] Avg episode reward: [(0, '-1.740')] [2022-07-10 11:16:52,995][26022] Updated weights on worker 0-0, policy_version 700316 (0.00084) [2022-07-10 11:16:55,022][26022] Updated weights on worker 0-0, policy_version 700326 (0.00090) [2022-07-10 11:16:56,707][26022] Updated weights on worker 0-0, policy_version 700336 (0.00084) [2022-07-10 11:16:56,857][25689] Fps is (10 sec: 5550.7, 60 sec: 5603.4, 300 sec: 5610.3). Total num frames: 717144064. Throughput: 0: 5875.3. Samples: 717146088. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:16:56,858][25689] Avg episode reward: [(0, '-2.896')] [2022-07-10 11:16:58,276][26022] Updated weights on worker 0-0, policy_version 700346 (0.00081) [2022-07-10 11:17:00,231][26022] Updated weights on worker 0-0, policy_version 700356 (0.00080) [2022-07-10 11:17:01,918][25689] Fps is (10 sec: 5626.6, 60 sec: 5598.7, 300 sec: 5616.5). Total num frames: 717172736. Throughput: 0: 5894.3. Samples: 717180192. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:17:01,918][25689] Avg episode reward: [(0, '-2.120')] [2022-07-10 11:17:02,365][26022] Updated weights on worker 0-0, policy_version 700366 (0.00089) [2022-07-10 11:17:04,257][26022] Updated weights on worker 0-0, policy_version 700376 (0.00086) [2022-07-10 11:17:06,374][26022] Updated weights on worker 0-0, policy_version 700386 (0.00084) [2022-07-10 11:17:06,927][25689] Fps is (10 sec: 5491.5, 60 sec: 5600.8, 300 sec: 5609.7). Total num frames: 717199360. Throughput: 0: 4934.7. Samples: 717194620. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:17:06,929][25689] Avg episode reward: [(0, '-2.625')] [2022-07-10 11:17:07,933][26022] Updated weights on worker 0-0, policy_version 700396 (0.00089) [2022-07-10 11:17:09,912][26022] Updated weights on worker 0-0, policy_version 700406 (0.00087) [2022-07-10 11:17:11,704][26022] Updated weights on worker 0-0, policy_version 700416 (0.00379) [2022-07-10 11:17:11,937][25689] Fps is (10 sec: 5417.0, 60 sec: 5572.9, 300 sec: 5612.1). Total num frames: 717227008. Throughput: 0: 5748.9. Samples: 717228240. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:17:11,939][25689] Avg episode reward: [(0, '-3.559')] [2022-07-10 11:17:13,623][26022] Updated weights on worker 0-0, policy_version 700426 (0.00091) [2022-07-10 11:17:15,267][26022] Updated weights on worker 0-0, policy_version 700436 (0.00093) [2022-07-10 11:17:16,984][25689] Fps is (10 sec: 5599.9, 60 sec: 5613.0, 300 sec: 5609.2). Total num frames: 717255680. Throughput: 0: 5767.9. Samples: 717262176. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:17:16,984][25689] Avg episode reward: [(0, '-4.821')] [2022-07-10 11:17:17,082][26022] Updated weights on worker 0-0, policy_version 700446 (0.00092) [2022-07-10 11:17:18,911][26022] Updated weights on worker 0-0, policy_version 700456 (0.00483) [2022-07-10 11:17:20,927][26022] Updated weights on worker 0-0, policy_version 700466 (0.00086) [2022-07-10 11:17:21,995][25689] Fps is (10 sec: 5599.6, 60 sec: 5600.9, 300 sec: 5612.5). Total num frames: 717283328. Throughput: 0: 4921.3. Samples: 717278996. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:17:21,995][25689] Avg episode reward: [(0, '-4.031')] [2022-07-10 11:17:22,344][26022] Updated weights on worker 0-0, policy_version 700476 (0.00425) [2022-07-10 11:17:24,483][26022] Updated weights on worker 0-0, policy_version 700486 (0.00097) [2022-07-10 11:17:26,144][26022] Updated weights on worker 0-0, policy_version 700496 (0.00090) [2022-07-10 11:17:27,009][25689] Fps is (10 sec: 5515.9, 60 sec: 5585.3, 300 sec: 5602.1). Total num frames: 717310976. Throughput: 0: 5895.6. Samples: 717313016. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:17:27,009][25689] Avg episode reward: [(0, '-4.866')] [2022-07-10 11:17:28,009][26022] Updated weights on worker 0-0, policy_version 700506 (0.00087) [2022-07-10 11:17:29,737][26022] Updated weights on worker 0-0, policy_version 700516 (0.00084) [2022-07-10 11:17:31,624][26022] Updated weights on worker 0-0, policy_version 700526 (0.00091) [2022-07-10 11:17:32,032][25689] Fps is (10 sec: 5712.9, 60 sec: 5585.1, 300 sec: 5613.9). Total num frames: 717340672. Throughput: 0: 5913.2. Samples: 717347068. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:17:32,033][25689] Avg episode reward: [(0, '-5.367')] [2022-07-10 11:17:33,416][26022] Updated weights on worker 0-0, policy_version 700536 (0.00094) [2022-07-10 11:17:35,423][26022] Updated weights on worker 0-0, policy_version 700546 (0.00088) [2022-07-10 11:17:37,047][26022] Updated weights on worker 0-0, policy_version 700556 (0.00081) [2022-07-10 11:17:37,157][25689] Fps is (10 sec: 5751.2, 60 sec: 5599.5, 300 sec: 5611.7). Total num frames: 717369344. Throughput: 0: 5039.7. Samples: 717363844. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:17:37,158][25689] Avg episode reward: [(0, '-4.348')] [2022-07-10 11:17:38,871][26022] Updated weights on worker 0-0, policy_version 700566 (0.00095) [2022-07-10 11:17:40,625][26022] Updated weights on worker 0-0, policy_version 700576 (0.00092) [2022-07-10 11:17:42,217][25689] Fps is (10 sec: 5630.2, 60 sec: 5597.7, 300 sec: 5610.6). Total num frames: 717398016. Throughput: 0: 5864.0. Samples: 717397582. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:17:42,219][25689] Avg episode reward: [(0, '-2.593')] [2022-07-10 11:17:42,469][26022] Updated weights on worker 0-0, policy_version 700586 (0.00083) [2022-07-10 11:17:44,453][26022] Updated weights on worker 0-0, policy_version 700596 (0.00090) [2022-07-10 11:17:46,285][26022] Updated weights on worker 0-0, policy_version 700606 (0.00089) [2022-07-10 11:17:47,224][25689] Fps is (10 sec: 5594.8, 60 sec: 5584.2, 300 sec: 5607.3). Total num frames: 717425664. Throughput: 0: 5851.2. Samples: 717431300. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:17:47,226][25689] Avg episode reward: [(0, '-1.561')] [2022-07-10 11:17:47,914][26022] Updated weights on worker 0-0, policy_version 700616 (0.00087) [2022-07-10 11:17:49,887][26022] Updated weights on worker 0-0, policy_version 700626 (0.00090) [2022-07-10 11:17:51,664][26022] Updated weights on worker 0-0, policy_version 700636 (0.00087) [2022-07-10 11:17:52,237][25689] Fps is (10 sec: 5518.7, 60 sec: 5584.5, 300 sec: 5608.6). Total num frames: 717453312. Throughput: 0: 5004.7. Samples: 717448186. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:17:52,238][25689] Avg episode reward: [(0, '-0.992')] [2022-07-10 11:17:53,584][26022] Updated weights on worker 0-0, policy_version 700646 (0.00081) [2022-07-10 11:17:55,250][26022] Updated weights on worker 0-0, policy_version 700656 (0.00099) [2022-07-10 11:17:57,028][26022] Updated weights on worker 0-0, policy_version 700666 (0.00087) [2022-07-10 11:17:57,343][25689] Fps is (10 sec: 5667.2, 60 sec: 5603.7, 300 sec: 5607.0). Total num frames: 717483008. Throughput: 0: 5858.5. Samples: 717482098. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:17:57,344][25689] Avg episode reward: [(0, '-0.317')] [2022-07-10 11:17:59,317][26022] Updated weights on worker 0-0, policy_version 700677 (0.00092) [2022-07-10 11:18:00,964][26022] Updated weights on worker 0-0, policy_version 700687 (0.00085) [2022-07-10 11:18:02,365][25689] Fps is (10 sec: 5459.5, 60 sec: 5556.4, 300 sec: 5610.1). Total num frames: 717508608. Throughput: 0: 5841.7. Samples: 717515282. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:18:02,367][25689] Avg episode reward: [(0, '-0.821')] [2022-07-10 11:18:03,174][26022] Updated weights on worker 0-0, policy_version 700697 (0.00085) [2022-07-10 11:18:05,031][26022] Updated weights on worker 0-0, policy_version 700707 (0.00100) [2022-07-10 11:18:06,826][26022] Updated weights on worker 0-0, policy_version 700717 (0.00092) [2022-07-10 11:18:07,382][25689] Fps is (10 sec: 5405.7, 60 sec: 5589.5, 300 sec: 5611.7). Total num frames: 717537280. Throughput: 0: 4923.8. Samples: 717530556. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:18:07,383][25689] Avg episode reward: [(0, '-2.748')] [2022-07-10 11:18:08,619][26022] Updated weights on worker 0-0, policy_version 700727 (0.00084) [2022-07-10 11:18:10,443][26022] Updated weights on worker 0-0, policy_version 700737 (0.00081) [2022-07-10 11:18:12,182][26022] Updated weights on worker 0-0, policy_version 700747 (0.00082) [2022-07-10 11:18:12,399][25689] Fps is (10 sec: 5715.4, 60 sec: 5605.9, 300 sec: 5608.9). Total num frames: 717565952. Throughput: 0: 5756.7. Samples: 717564250. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:18:12,399][25689] Avg episode reward: [(0, '-3.326')] [2022-07-10 11:18:14,111][26022] Updated weights on worker 0-0, policy_version 700757 (0.00088) [2022-07-10 11:18:15,868][26022] Updated weights on worker 0-0, policy_version 700767 (0.00090) [2022-07-10 11:18:17,515][25689] Fps is (10 sec: 5558.4, 60 sec: 5582.6, 300 sec: 5610.4). Total num frames: 717593600. Throughput: 0: 5763.1. Samples: 717598352. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:18:17,515][25689] Avg episode reward: [(0, '-4.037')] [2022-07-10 11:18:17,710][26022] Updated weights on worker 0-0, policy_version 700777 (0.00087) [2022-07-10 11:18:19,512][26022] Updated weights on worker 0-0, policy_version 700787 (0.00088) [2022-07-10 11:18:21,195][26022] Updated weights on worker 0-0, policy_version 700797 (0.00084) [2022-07-10 11:18:22,531][25689] Fps is (10 sec: 5558.5, 60 sec: 5599.0, 300 sec: 5603.7). Total num frames: 717622272. Throughput: 0: 4961.3. Samples: 717615328. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:18:22,531][25689] Avg episode reward: [(0, '-3.698')] [2022-07-10 11:18:23,226][26022] Updated weights on worker 0-0, policy_version 700807 (0.00081) [2022-07-10 11:18:25,037][26022] Updated weights on worker 0-0, policy_version 700817 (0.00087) [2022-07-10 11:18:26,759][26022] Updated weights on worker 0-0, policy_version 700827 (0.00077) [2022-07-10 11:18:27,320][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:18:27,334][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000700830_717649920.pth [2022-07-10 11:18:27,334][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000698856_715628544.pth [2022-07-10 11:18:27,553][25689] Fps is (10 sec: 5610.4, 60 sec: 5598.2, 300 sec: 5600.1). Total num frames: 717649920. Throughput: 0: 5882.7. Samples: 717649216. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:18:27,554][25689] Avg episode reward: [(0, '-4.754')] [2022-07-10 11:18:28,795][26022] Updated weights on worker 0-0, policy_version 700837 (0.00089) [2022-07-10 11:18:30,537][26022] Updated weights on worker 0-0, policy_version 700847 (0.00084) [2022-07-10 11:18:32,380][26022] Updated weights on worker 0-0, policy_version 700857 (0.00086) [2022-07-10 11:18:32,634][25689] Fps is (10 sec: 5676.1, 60 sec: 5593.0, 300 sec: 5608.3). Total num frames: 717679616. Throughput: 0: 5857.5. Samples: 717682776. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:18:32,635][25689] Avg episode reward: [(0, '-3.140')] [2022-07-10 11:18:34,097][26022] Updated weights on worker 0-0, policy_version 700867 (0.00091) [2022-07-10 11:18:35,851][26022] Updated weights on worker 0-0, policy_version 700877 (0.00088) [2022-07-10 11:18:37,737][25689] Fps is (10 sec: 5631.0, 60 sec: 5578.1, 300 sec: 5600.7). Total num frames: 717707264. Throughput: 0: 5843.8. Samples: 717716526. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:18:37,738][25689] Avg episode reward: [(0, '-2.207')] [2022-07-10 11:18:37,842][26022] Updated weights on worker 0-0, policy_version 700887 (0.00089) [2022-07-10 11:18:39,471][26022] Updated weights on worker 0-0, policy_version 700897 (0.00085) [2022-07-10 11:18:41,630][26022] Updated weights on worker 0-0, policy_version 700907 (0.00095) [2022-07-10 11:18:42,822][25689] Fps is (10 sec: 5628.6, 60 sec: 5592.7, 300 sec: 5610.1). Total num frames: 717736960. Throughput: 0: 5821.8. Samples: 717733456. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:18:42,822][25689] Avg episode reward: [(0, '-1.862')] [2022-07-10 11:18:43,364][26022] Updated weights on worker 0-0, policy_version 700917 (0.00084) [2022-07-10 11:18:45,009][26022] Updated weights on worker 0-0, policy_version 700927 (0.00089) [2022-07-10 11:18:46,920][26022] Updated weights on worker 0-0, policy_version 700937 (0.00090) [2022-07-10 11:18:47,907][25689] Fps is (10 sec: 5739.2, 60 sec: 5602.4, 300 sec: 5605.2). Total num frames: 717765632. Throughput: 0: 5811.9. Samples: 717767510. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:18:47,908][25689] Avg episode reward: [(0, '-1.278')] [2022-07-10 11:18:48,656][26022] Updated weights on worker 0-0, policy_version 700947 (0.00081) [2022-07-10 11:18:50,438][26022] Updated weights on worker 0-0, policy_version 700957 (0.00092) [2022-07-10 11:18:52,635][26022] Updated weights on worker 0-0, policy_version 700967 (0.00100) [2022-07-10 11:18:52,955][25689] Fps is (10 sec: 5355.9, 60 sec: 5565.4, 300 sec: 5594.7). Total num frames: 717791232. Throughput: 0: 5821.8. Samples: 717801082. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:18:52,955][25689] Avg episode reward: [(0, '-1.539')] [2022-07-10 11:18:54,002][26022] Updated weights on worker 0-0, policy_version 700977 (0.00086) [2022-07-10 11:18:56,239][26022] Updated weights on worker 0-0, policy_version 700987 (0.00088) [2022-07-10 11:18:57,646][26022] Updated weights on worker 0-0, policy_version 700997 (0.00093) [2022-07-10 11:18:58,040][25689] Fps is (10 sec: 5558.1, 60 sec: 5584.2, 300 sec: 5604.7). Total num frames: 717821952. Throughput: 0: 5001.3. Samples: 717818068. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:18:58,041][25689] Avg episode reward: [(0, '-1.102')] [2022-07-10 11:18:59,604][26022] Updated weights on worker 0-0, policy_version 701007 (0.00094) [2022-07-10 11:19:01,932][26022] Updated weights on worker 0-0, policy_version 701017 (0.00087) [2022-07-10 11:19:03,048][25689] Fps is (10 sec: 5681.1, 60 sec: 5602.4, 300 sec: 5602.4). Total num frames: 717848576. Throughput: 0: 5767.9. Samples: 717850122. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:19:03,049][25689] Avg episode reward: [(0, '-1.309')] [2022-07-10 11:19:03,769][26022] Updated weights on worker 0-0, policy_version 701027 (0.00093) [2022-07-10 11:19:05,642][26022] Updated weights on worker 0-0, policy_version 701037 (0.00084) [2022-07-10 11:19:07,533][26022] Updated weights on worker 0-0, policy_version 701047 (0.00088) [2022-07-10 11:19:08,078][25689] Fps is (10 sec: 5304.7, 60 sec: 5567.5, 300 sec: 5599.6). Total num frames: 717875200. Throughput: 0: 5732.6. Samples: 717883142. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:19:08,079][25689] Avg episode reward: [(0, '-1.665')] [2022-07-10 11:19:09,072][26022] Updated weights on worker 0-0, policy_version 701057 (0.00090) [2022-07-10 11:19:11,225][26022] Updated weights on worker 0-0, policy_version 701067 (0.00084) [2022-07-10 11:19:12,692][26022] Updated weights on worker 0-0, policy_version 701077 (0.00091) [2022-07-10 11:19:13,099][25689] Fps is (10 sec: 5501.9, 60 sec: 5567.0, 300 sec: 5597.1). Total num frames: 717903872. Throughput: 0: 4910.5. Samples: 717900000. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:19:13,099][25689] Avg episode reward: [(0, '-1.850')] [2022-07-10 11:19:14,719][26022] Updated weights on worker 0-0, policy_version 701087 (0.00082) [2022-07-10 11:19:16,584][26022] Updated weights on worker 0-0, policy_version 701097 (0.00088) [2022-07-10 11:19:18,141][25689] Fps is (10 sec: 5800.5, 60 sec: 5607.7, 300 sec: 5600.2). Total num frames: 717933568. Throughput: 0: 5751.2. Samples: 717933670. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:19:18,141][25689] Avg episode reward: [(0, '-2.115')] [2022-07-10 11:19:18,147][26022] Updated weights on worker 0-0, policy_version 701107 (0.00090) [2022-07-10 11:19:20,218][26022] Updated weights on worker 0-0, policy_version 701117 (0.00088) [2022-07-10 11:19:21,936][26022] Updated weights on worker 0-0, policy_version 701127 (0.00076) [2022-07-10 11:19:23,182][25689] Fps is (10 sec: 5585.5, 60 sec: 5571.5, 300 sec: 5597.0). Total num frames: 717960192. Throughput: 0: 5839.4. Samples: 717967690. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:19:23,182][25689] Avg episode reward: [(0, '-2.583')] [2022-07-10 11:19:23,866][26022] Updated weights on worker 0-0, policy_version 701137 (0.00087) [2022-07-10 11:19:25,563][26022] Updated weights on worker 0-0, policy_version 701147 (0.00084) [2022-07-10 11:19:27,268][26022] Updated weights on worker 0-0, policy_version 701157 (0.00074) [2022-07-10 11:19:28,191][25689] Fps is (10 sec: 5501.6, 60 sec: 5589.6, 300 sec: 5593.5). Total num frames: 717988864. Throughput: 0: 5044.5. Samples: 717984604. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:19:28,192][25689] Avg episode reward: [(0, '-2.680')] [2022-07-10 11:19:29,294][26022] Updated weights on worker 0-0, policy_version 701167 (0.00086) [2022-07-10 11:19:31,013][26022] Updated weights on worker 0-0, policy_version 701177 (0.00087) [2022-07-10 11:19:32,881][26022] Updated weights on worker 0-0, policy_version 701187 (0.00090) [2022-07-10 11:19:33,200][25689] Fps is (10 sec: 5724.2, 60 sec: 5579.4, 300 sec: 5597.6). Total num frames: 718017536. Throughput: 0: 5886.3. Samples: 718018320. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:19:33,200][25689] Avg episode reward: [(0, '-2.569')] [2022-07-10 11:19:34,801][26022] Updated weights on worker 0-0, policy_version 701197 (0.00092) [2022-07-10 11:19:36,593][26022] Updated weights on worker 0-0, policy_version 701207 (0.00088) [2022-07-10 11:19:38,255][25689] Fps is (10 sec: 5698.2, 60 sec: 5600.7, 300 sec: 5600.3). Total num frames: 718046208. Throughput: 0: 5881.1. Samples: 718051966. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:19:38,257][25689] Avg episode reward: [(0, '-2.648')] [2022-07-10 11:19:38,259][26022] Updated weights on worker 0-0, policy_version 701217 (0.00082) [2022-07-10 11:19:40,211][26022] Updated weights on worker 0-0, policy_version 701227 (0.00085) [2022-07-10 11:19:42,016][26022] Updated weights on worker 0-0, policy_version 701237 (0.00090) [2022-07-10 11:19:43,294][25689] Fps is (10 sec: 5579.4, 60 sec: 5571.1, 300 sec: 5593.3). Total num frames: 718073856. Throughput: 0: 5028.3. Samples: 718068820. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:19:43,294][25689] Avg episode reward: [(0, '-2.784')] [2022-07-10 11:19:44,082][26022] Updated weights on worker 0-0, policy_version 701247 (0.00086) [2022-07-10 11:19:45,434][26022] Updated weights on worker 0-0, policy_version 701257 (0.00085) [2022-07-10 11:19:47,647][26022] Updated weights on worker 0-0, policy_version 701267 (0.00089) [2022-07-10 11:19:48,322][25689] Fps is (10 sec: 5492.6, 60 sec: 5559.4, 300 sec: 5593.3). Total num frames: 718101504. Throughput: 0: 5871.0. Samples: 718102792. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:19:48,323][25689] Avg episode reward: [(0, '-4.005')] [2022-07-10 11:19:49,101][26022] Updated weights on worker 0-0, policy_version 701277 (0.00089) [2022-07-10 11:19:51,207][26022] Updated weights on worker 0-0, policy_version 701287 (0.00090) [2022-07-10 11:19:52,759][26022] Updated weights on worker 0-0, policy_version 701297 (0.00100) [2022-07-10 11:19:53,327][25689] Fps is (10 sec: 5613.6, 60 sec: 5614.3, 300 sec: 5592.1). Total num frames: 718130176. Throughput: 0: 5886.0. Samples: 718136788. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:19:53,327][25689] Avg episode reward: [(0, '-3.757')] [2022-07-10 11:19:54,664][26022] Updated weights on worker 0-0, policy_version 701307 (0.00088) [2022-07-10 11:19:56,487][26022] Updated weights on worker 0-0, policy_version 701317 (0.00092) [2022-07-10 11:19:58,343][26022] Updated weights on worker 0-0, policy_version 701327 (0.00084) [2022-07-10 11:19:58,444][25689] Fps is (10 sec: 5665.7, 60 sec: 5577.4, 300 sec: 5590.1). Total num frames: 718158848. Throughput: 0: 5034.9. Samples: 718153614. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:19:58,444][25689] Avg episode reward: [(0, '-4.344')] [2022-07-10 11:20:00,082][26022] Updated weights on worker 0-0, policy_version 701337 (0.00088) [2022-07-10 11:20:02,443][26022] Updated weights on worker 0-0, policy_version 701347 (0.00088) [2022-07-10 11:20:03,460][25689] Fps is (10 sec: 5457.0, 60 sec: 5576.7, 300 sec: 5597.6). Total num frames: 718185472. Throughput: 0: 5883.4. Samples: 718187464. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:20:03,460][25689] Avg episode reward: [(0, '-4.532')] [2022-07-10 11:20:03,908][26022] Updated weights on worker 0-0, policy_version 701357 (0.00079) [2022-07-10 11:20:06,142][26022] Updated weights on worker 0-0, policy_version 701367 (0.00085) [2022-07-10 11:20:07,810][26022] Updated weights on worker 0-0, policy_version 701377 (0.00092) [2022-07-10 11:20:08,494][25689] Fps is (10 sec: 5400.2, 60 sec: 5593.2, 300 sec: 5587.1). Total num frames: 718213120. Throughput: 0: 5757.9. Samples: 718218936. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:20:08,494][25689] Avg episode reward: [(0, '-3.925')] [2022-07-10 11:20:09,747][26022] Updated weights on worker 0-0, policy_version 701387 (0.00081) [2022-07-10 11:20:11,455][26022] Updated weights on worker 0-0, policy_version 701397 (0.00076) [2022-07-10 11:20:13,347][26022] Updated weights on worker 0-0, policy_version 701407 (0.00092) [2022-07-10 11:20:13,513][25689] Fps is (10 sec: 5602.3, 60 sec: 5593.4, 300 sec: 5592.1). Total num frames: 718241792. Throughput: 0: 4903.7. Samples: 718235774. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:20:13,513][25689] Avg episode reward: [(0, '-4.286')] [2022-07-10 11:20:15,082][26022] Updated weights on worker 0-0, policy_version 701417 (0.00092) [2022-07-10 11:20:17,148][26022] Updated weights on worker 0-0, policy_version 701427 (0.00089) [2022-07-10 11:20:18,590][25689] Fps is (10 sec: 5679.4, 60 sec: 5573.1, 300 sec: 5592.4). Total num frames: 718270464. Throughput: 0: 5750.8. Samples: 718269474. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:20:18,591][25689] Avg episode reward: [(0, '-2.071')] [2022-07-10 11:20:18,698][26022] Updated weights on worker 0-0, policy_version 701437 (0.00093) [2022-07-10 11:20:20,799][26022] Updated weights on worker 0-0, policy_version 701447 (0.00086) [2022-07-10 11:20:22,444][26022] Updated weights on worker 0-0, policy_version 701457 (0.00093) [2022-07-10 11:20:23,598][25689] Fps is (10 sec: 5584.6, 60 sec: 5593.2, 300 sec: 5597.0). Total num frames: 718298112. Throughput: 0: 5752.3. Samples: 718303304. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:20:23,598][25689] Avg episode reward: [(0, '-1.012')] [2022-07-10 11:20:24,410][26022] Updated weights on worker 0-0, policy_version 701467 (0.00088) [2022-07-10 11:20:26,063][26022] Updated weights on worker 0-0, policy_version 701477 (0.00091) [2022-07-10 11:20:27,454][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:20:27,469][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000701485_718320640.pth [2022-07-10 11:20:27,469][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000699516_716304384.pth [2022-07-10 11:20:28,172][26022] Updated weights on worker 0-0, policy_version 701487 (0.00092) [2022-07-10 11:20:28,674][25689] Fps is (10 sec: 5382.4, 60 sec: 5553.2, 300 sec: 5582.3). Total num frames: 718324736. Throughput: 0: 5009.6. Samples: 718320030. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:20:28,675][25689] Avg episode reward: [(0, '-1.210')] [2022-07-10 11:20:29,630][26022] Updated weights on worker 0-0, policy_version 701497 (0.00094) [2022-07-10 11:20:31,850][26022] Updated weights on worker 0-0, policy_version 701507 (0.00096) [2022-07-10 11:20:33,364][26022] Updated weights on worker 0-0, policy_version 701517 (0.00088) [2022-07-10 11:20:33,723][25689] Fps is (10 sec: 5663.6, 60 sec: 5583.3, 300 sec: 5592.2). Total num frames: 718355456. Throughput: 0: 5839.6. Samples: 718353792. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:20:33,723][25689] Avg episode reward: [(0, '-1.467')] [2022-07-10 11:20:35,467][26022] Updated weights on worker 0-0, policy_version 701527 (0.00100) [2022-07-10 11:20:36,952][26022] Updated weights on worker 0-0, policy_version 701537 (0.00096) [2022-07-10 11:20:38,841][25689] Fps is (10 sec: 5640.0, 60 sec: 5543.7, 300 sec: 5590.7). Total num frames: 718382080. Throughput: 0: 5806.7. Samples: 718387062. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:20:38,841][25689] Avg episode reward: [(0, '-1.378')] [2022-07-10 11:20:39,160][26022] Updated weights on worker 0-0, policy_version 701547 (0.00101) [2022-07-10 11:20:40,920][26022] Updated weights on worker 0-0, policy_version 701557 (0.00096) [2022-07-10 11:20:42,700][26022] Updated weights on worker 0-0, policy_version 701567 (0.00088) [2022-07-10 11:20:43,905][25689] Fps is (10 sec: 5329.8, 60 sec: 5541.4, 300 sec: 5579.4). Total num frames: 718409728. Throughput: 0: 5784.7. Samples: 718420778. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:20:43,906][25689] Avg episode reward: [(0, '-1.692')] [2022-07-10 11:20:44,436][26022] Updated weights on worker 0-0, policy_version 701577 (0.00091) [2022-07-10 11:20:46,573][26022] Updated weights on worker 0-0, policy_version 701587 (0.00086) [2022-07-10 11:20:48,156][26022] Updated weights on worker 0-0, policy_version 701597 (0.01234) [2022-07-10 11:20:48,920][25689] Fps is (10 sec: 5689.3, 60 sec: 5576.4, 300 sec: 5582.7). Total num frames: 718439424. Throughput: 0: 5795.3. Samples: 718437366. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:20:48,921][25689] Avg episode reward: [(0, '-2.396')] [2022-07-10 11:20:50,170][26022] Updated weights on worker 0-0, policy_version 701607 (0.00097) [2022-07-10 11:20:51,752][26022] Updated weights on worker 0-0, policy_version 701617 (0.00089) [2022-07-10 11:20:53,633][26022] Updated weights on worker 0-0, policy_version 701627 (0.00087) [2022-07-10 11:20:53,927][25689] Fps is (10 sec: 5722.3, 60 sec: 5559.4, 300 sec: 5585.3). Total num frames: 718467072. Throughput: 0: 5803.8. Samples: 718471052. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:20:53,927][25689] Avg episode reward: [(0, '-2.831')] [2022-07-10 11:20:55,526][26022] Updated weights on worker 0-0, policy_version 701637 (0.00086) [2022-07-10 11:20:57,275][26022] Updated weights on worker 0-0, policy_version 701647 (0.00086) [2022-07-10 11:20:58,991][25689] Fps is (10 sec: 5694.0, 60 sec: 5581.1, 300 sec: 5587.7). Total num frames: 718496768. Throughput: 0: 5848.5. Samples: 718504910. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:20:58,992][25689] Avg episode reward: [(0, '-2.932')] [2022-07-10 11:20:59,002][26022] Updated weights on worker 0-0, policy_version 701657 (0.00149) [2022-07-10 11:21:00,967][26022] Updated weights on worker 0-0, policy_version 701667 (0.00085) [2022-07-10 11:21:03,103][26022] Updated weights on worker 0-0, policy_version 701677 (0.00084) [2022-07-10 11:21:04,015][25689] Fps is (10 sec: 5379.4, 60 sec: 5546.5, 300 sec: 5581.0). Total num frames: 718521344. Throughput: 0: 4999.5. Samples: 718521318. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:21:04,016][25689] Avg episode reward: [(0, '-2.842')] [2022-07-10 11:21:04,980][26022] Updated weights on worker 0-0, policy_version 701687 (0.00082) [2022-07-10 11:21:06,851][26022] Updated weights on worker 0-0, policy_version 701697 (0.00095) [2022-07-10 11:21:08,604][26022] Updated weights on worker 0-0, policy_version 701707 (0.00089) [2022-07-10 11:21:09,035][25689] Fps is (10 sec: 5301.5, 60 sec: 5564.7, 300 sec: 5578.5). Total num frames: 718550016. Throughput: 0: 5764.5. Samples: 718553318. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:21:09,036][25689] Avg episode reward: [(0, '-2.901')] [2022-07-10 11:21:10,585][26022] Updated weights on worker 0-0, policy_version 701717 (0.00080) [2022-07-10 11:21:12,282][26022] Updated weights on worker 0-0, policy_version 701727 (0.00089) [2022-07-10 11:21:14,065][25689] Fps is (10 sec: 5604.4, 60 sec: 5546.8, 300 sec: 5583.6). Total num frames: 718577664. Throughput: 0: 5765.1. Samples: 718587150. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 11:21:14,066][25689] Avg episode reward: [(0, '-2.325')] [2022-07-10 11:21:14,219][26022] Updated weights on worker 0-0, policy_version 701737 (0.00082) [2022-07-10 11:21:15,870][26022] Updated weights on worker 0-0, policy_version 701747 (0.00093) [2022-07-10 11:21:17,735][26022] Updated weights on worker 0-0, policy_version 701757 (0.00078) [2022-07-10 11:21:19,132][25689] Fps is (10 sec: 5578.0, 60 sec: 5547.8, 300 sec: 5583.5). Total num frames: 718606336. Throughput: 0: 4931.1. Samples: 718604226. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:21:19,133][25689] Avg episode reward: [(0, '-2.042')] [2022-07-10 11:21:19,354][26022] Updated weights on worker 0-0, policy_version 701767 (0.00087) [2022-07-10 11:21:21,379][26022] Updated weights on worker 0-0, policy_version 701777 (0.00089) [2022-07-10 11:21:23,206][26022] Updated weights on worker 0-0, policy_version 701787 (0.00087) [2022-07-10 11:21:24,160][25689] Fps is (10 sec: 5680.3, 60 sec: 5562.8, 300 sec: 5583.5). Total num frames: 718635008. Throughput: 0: 5801.2. Samples: 718638180. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:21:24,160][25689] Avg episode reward: [(0, '-2.903')] [2022-07-10 11:21:24,925][26022] Updated weights on worker 0-0, policy_version 701797 (0.00087) [2022-07-10 11:21:26,733][26022] Updated weights on worker 0-0, policy_version 701807 (0.00082) [2022-07-10 11:21:28,618][26022] Updated weights on worker 0-0, policy_version 701817 (0.00095) [2022-07-10 11:21:29,199][25689] Fps is (10 sec: 5798.2, 60 sec: 5617.0, 300 sec: 5583.2). Total num frames: 718664704. Throughput: 0: 5896.3. Samples: 718672208. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:21:29,199][25689] Avg episode reward: [(0, '-3.924')] [2022-07-10 11:21:30,481][26022] Updated weights on worker 0-0, policy_version 701827 (0.00082) [2022-07-10 11:21:32,196][26022] Updated weights on worker 0-0, policy_version 701837 (0.00087) [2022-07-10 11:21:34,117][26022] Updated weights on worker 0-0, policy_version 701847 (0.00083) [2022-07-10 11:21:34,202][25689] Fps is (10 sec: 5710.5, 60 sec: 5570.5, 300 sec: 5585.0). Total num frames: 718692352. Throughput: 0: 5069.5. Samples: 718689234. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:21:34,202][25689] Avg episode reward: [(0, '-3.549')] [2022-07-10 11:21:35,703][26022] Updated weights on worker 0-0, policy_version 701857 (0.00087) [2022-07-10 11:21:37,642][26022] Updated weights on worker 0-0, policy_version 701867 (0.00083) [2022-07-10 11:21:39,247][25689] Fps is (10 sec: 5604.9, 60 sec: 5611.1, 300 sec: 5584.9). Total num frames: 718721024. Throughput: 0: 5936.1. Samples: 718723630. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:21:39,248][25689] Avg episode reward: [(0, '-4.090')] [2022-07-10 11:21:39,345][26022] Updated weights on worker 0-0, policy_version 701877 (0.00081) [2022-07-10 11:21:41,207][26022] Updated weights on worker 0-0, policy_version 701887 (0.00097) [2022-07-10 11:21:43,027][26022] Updated weights on worker 0-0, policy_version 701897 (0.00088) [2022-07-10 11:21:44,343][25689] Fps is (10 sec: 5553.5, 60 sec: 5608.2, 300 sec: 5580.5). Total num frames: 718748672. Throughput: 0: 5900.5. Samples: 718757270. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:21:44,344][25689] Avg episode reward: [(0, '-5.809')] [2022-07-10 11:21:45,055][26022] Updated weights on worker 0-0, policy_version 701907 (0.00094) [2022-07-10 11:21:46,565][26022] Updated weights on worker 0-0, policy_version 701917 (0.00088) [2022-07-10 11:21:48,578][26022] Updated weights on worker 0-0, policy_version 701927 (0.00088) [2022-07-10 11:21:49,364][25689] Fps is (10 sec: 5567.1, 60 sec: 5590.7, 300 sec: 5583.8). Total num frames: 718777344. Throughput: 0: 5060.8. Samples: 718774258. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:21:49,364][25689] Avg episode reward: [(0, '-4.615')] [2022-07-10 11:21:50,150][26022] Updated weights on worker 0-0, policy_version 701937 (0.00082) [2022-07-10 11:21:52,047][26022] Updated weights on worker 0-0, policy_version 701947 (0.00081) [2022-07-10 11:21:53,823][26022] Updated weights on worker 0-0, policy_version 701957 (0.00084) [2022-07-10 11:21:54,411][25689] Fps is (10 sec: 5797.6, 60 sec: 5620.8, 300 sec: 5588.8). Total num frames: 718807040. Throughput: 0: 5902.4. Samples: 718808516. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:21:54,411][25689] Avg episode reward: [(0, '-5.019')] [2022-07-10 11:21:55,682][26022] Updated weights on worker 0-0, policy_version 701967 (0.00082) [2022-07-10 11:21:57,583][26022] Updated weights on worker 0-0, policy_version 701977 (0.00085) [2022-07-10 11:21:59,533][25689] Fps is (10 sec: 5538.2, 60 sec: 5564.7, 300 sec: 5580.8). Total num frames: 718833664. Throughput: 0: 5847.6. Samples: 718842254. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:21:59,533][25689] Avg episode reward: [(0, '-4.247')] [2022-07-10 11:21:59,562][26022] Updated weights on worker 0-0, policy_version 701987 (0.00764) [2022-07-10 11:22:01,151][26022] Updated weights on worker 0-0, policy_version 701997 (0.00089) [2022-07-10 11:22:03,318][26022] Updated weights on worker 0-0, policy_version 702007 (0.00087) [2022-07-10 11:22:04,541][25689] Fps is (10 sec: 5458.6, 60 sec: 5633.9, 300 sec: 5587.7). Total num frames: 718862336. Throughput: 0: 4986.0. Samples: 718857976. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:22:04,543][25689] Avg episode reward: [(0, '-4.098')] [2022-07-10 11:22:05,180][26022] Updated weights on worker 0-0, policy_version 702017 (0.00088) [2022-07-10 11:22:06,894][26022] Updated weights on worker 0-0, policy_version 702027 (0.00094) [2022-07-10 11:22:09,034][26022] Updated weights on worker 0-0, policy_version 702037 (0.00090) [2022-07-10 11:22:09,563][25689] Fps is (10 sec: 5513.1, 60 sec: 5599.8, 300 sec: 5584.0). Total num frames: 718888960. Throughput: 0: 5787.9. Samples: 718891170. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:22:09,564][25689] Avg episode reward: [(0, '-3.806')] [2022-07-10 11:22:10,498][26022] Updated weights on worker 0-0, policy_version 702047 (0.00089) [2022-07-10 11:22:12,507][26022] Updated weights on worker 0-0, policy_version 702057 (0.00081) [2022-07-10 11:22:14,143][26022] Updated weights on worker 0-0, policy_version 702067 (0.00055) [2022-07-10 11:22:14,594][25689] Fps is (10 sec: 5398.5, 60 sec: 5599.7, 300 sec: 5580.9). Total num frames: 718916608. Throughput: 0: 5768.7. Samples: 718924946. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:22:14,595][25689] Avg episode reward: [(0, '-2.108')] [2022-07-10 11:22:15,941][26022] Updated weights on worker 0-0, policy_version 702077 (0.00081) [2022-07-10 11:22:18,154][26022] Updated weights on worker 0-0, policy_version 702087 (0.00087) [2022-07-10 11:22:19,661][25689] Fps is (10 sec: 5678.7, 60 sec: 5616.6, 300 sec: 5586.7). Total num frames: 718946304. Throughput: 0: 4942.1. Samples: 718941728. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:22:19,663][25689] Avg episode reward: [(0, '-1.430')] [2022-07-10 11:22:19,674][26022] Updated weights on worker 0-0, policy_version 702097 (0.00090) [2022-07-10 11:22:21,666][26022] Updated weights on worker 0-0, policy_version 702107 (0.00082) [2022-07-10 11:22:23,331][26022] Updated weights on worker 0-0, policy_version 702117 (0.00083) [2022-07-10 11:22:24,692][25689] Fps is (10 sec: 5678.6, 60 sec: 5599.4, 300 sec: 5586.4). Total num frames: 718973952. Throughput: 0: 5836.6. Samples: 718975592. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:22:24,693][25689] Avg episode reward: [(0, '-1.583')] [2022-07-10 11:22:25,178][26022] Updated weights on worker 0-0, policy_version 702127 (0.00113) [2022-07-10 11:22:27,010][26022] Updated weights on worker 0-0, policy_version 702137 (0.00089) [2022-07-10 11:22:27,486][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:22:27,511][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000702139_718990336.pth [2022-07-10 11:22:27,512][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000700174_716978176.pth [2022-07-10 11:22:28,934][26022] Updated weights on worker 0-0, policy_version 702147 (0.00083) [2022-07-10 11:22:29,721][25689] Fps is (10 sec: 5700.1, 60 sec: 5600.3, 300 sec: 5586.3). Total num frames: 719003648. Throughput: 0: 5862.2. Samples: 719009342. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:22:29,722][25689] Avg episode reward: [(0, '-3.052')] [2022-07-10 11:22:30,547][26022] Updated weights on worker 0-0, policy_version 702157 (0.00087) [2022-07-10 11:22:32,641][26022] Updated weights on worker 0-0, policy_version 702167 (0.00087) [2022-07-10 11:22:34,059][26022] Updated weights on worker 0-0, policy_version 702177 (0.00085) [2022-07-10 11:22:34,786][25689] Fps is (10 sec: 5681.1, 60 sec: 5594.6, 300 sec: 5584.0). Total num frames: 719031296. Throughput: 0: 5020.5. Samples: 719026320. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:22:34,786][25689] Avg episode reward: [(0, '-4.001')] [2022-07-10 11:22:36,161][26022] Updated weights on worker 0-0, policy_version 702187 (0.00081) [2022-07-10 11:22:37,791][26022] Updated weights on worker 0-0, policy_version 702197 (0.00090) [2022-07-10 11:22:39,731][26022] Updated weights on worker 0-0, policy_version 702207 (0.00090) [2022-07-10 11:22:39,883][25689] Fps is (10 sec: 5542.5, 60 sec: 5589.8, 300 sec: 5583.3). Total num frames: 719059968. Throughput: 0: 5869.7. Samples: 719060424. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:22:39,883][25689] Avg episode reward: [(0, '-3.769')] [2022-07-10 11:22:41,694][26022] Updated weights on worker 0-0, policy_version 702217 (0.00089) [2022-07-10 11:22:43,378][26022] Updated weights on worker 0-0, policy_version 702227 (0.00085) [2022-07-10 11:22:44,923][25689] Fps is (10 sec: 5657.1, 60 sec: 5611.9, 300 sec: 5586.1). Total num frames: 719088640. Throughput: 0: 5863.3. Samples: 719094210. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:22:44,923][25689] Avg episode reward: [(0, '-4.414')] [2022-07-10 11:22:45,110][26022] Updated weights on worker 0-0, policy_version 702237 (0.00097) [2022-07-10 11:22:47,043][26022] Updated weights on worker 0-0, policy_version 702247 (0.00087) [2022-07-10 11:22:48,649][26022] Updated weights on worker 0-0, policy_version 702257 (0.00099) [2022-07-10 11:22:49,935][25689] Fps is (10 sec: 5602.8, 60 sec: 5595.8, 300 sec: 5586.1). Total num frames: 719116288. Throughput: 0: 5023.1. Samples: 719110876. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:22:49,935][25689] Avg episode reward: [(0, '-3.933')] [2022-07-10 11:22:50,722][26022] Updated weights on worker 0-0, policy_version 702267 (0.00089) [2022-07-10 11:22:52,459][26022] Updated weights on worker 0-0, policy_version 702277 (0.00085) [2022-07-10 11:22:54,418][26022] Updated weights on worker 0-0, policy_version 702287 (0.00608) [2022-07-10 11:22:54,938][25689] Fps is (10 sec: 5725.5, 60 sec: 5599.8, 300 sec: 5588.0). Total num frames: 719145984. Throughput: 0: 5882.2. Samples: 719144860. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:22:54,939][25689] Avg episode reward: [(0, '-3.364')] [2022-07-10 11:22:56,104][26022] Updated weights on worker 0-0, policy_version 702297 (0.00093) [2022-07-10 11:22:57,938][26022] Updated weights on worker 0-0, policy_version 702307 (0.00075) [2022-07-10 11:22:59,673][26022] Updated weights on worker 0-0, policy_version 702317 (0.00094) [2022-07-10 11:23:00,027][25689] Fps is (10 sec: 5681.8, 60 sec: 5619.8, 300 sec: 5593.7). Total num frames: 719173632. Throughput: 0: 5877.3. Samples: 719178820. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:23:00,028][25689] Avg episode reward: [(0, '-3.772')] [2022-07-10 11:23:01,763][26022] Updated weights on worker 0-0, policy_version 702327 (0.00088) [2022-07-10 11:23:03,816][26022] Updated weights on worker 0-0, policy_version 702337 (0.00085) [2022-07-10 11:23:05,049][25689] Fps is (10 sec: 5266.3, 60 sec: 5567.7, 300 sec: 5583.3). Total num frames: 719199232. Throughput: 0: 4949.7. Samples: 719193830. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:23:05,050][25689] Avg episode reward: [(0, '-3.059')] [2022-07-10 11:23:05,491][26022] Updated weights on worker 0-0, policy_version 702347 (0.00087) [2022-07-10 11:23:07,323][26022] Updated weights on worker 0-0, policy_version 702357 (0.00081) [2022-07-10 11:23:09,103][26022] Updated weights on worker 0-0, policy_version 702367 (0.00086) [2022-07-10 11:23:10,089][25689] Fps is (10 sec: 5495.5, 60 sec: 5616.8, 300 sec: 5586.3). Total num frames: 719228928. Throughput: 0: 5788.3. Samples: 719227538. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:23:10,090][25689] Avg episode reward: [(0, '-2.905')] [2022-07-10 11:23:10,972][26022] Updated weights on worker 0-0, policy_version 702377 (0.00096) [2022-07-10 11:23:12,648][26022] Updated weights on worker 0-0, policy_version 702387 (0.00079) [2022-07-10 11:23:14,610][26022] Updated weights on worker 0-0, policy_version 702397 (0.00097) [2022-07-10 11:23:15,121][25689] Fps is (10 sec: 5591.8, 60 sec: 5599.9, 300 sec: 5584.4). Total num frames: 719255552. Throughput: 0: 5779.3. Samples: 719261504. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:23:15,122][25689] Avg episode reward: [(0, '-2.284')] [2022-07-10 11:23:16,391][26022] Updated weights on worker 0-0, policy_version 702407 (0.00084) [2022-07-10 11:23:18,212][26022] Updated weights on worker 0-0, policy_version 702417 (0.00088) [2022-07-10 11:23:19,971][26022] Updated weights on worker 0-0, policy_version 702427 (0.00089) [2022-07-10 11:23:20,216][25689] Fps is (10 sec: 5561.7, 60 sec: 5597.3, 300 sec: 5586.4). Total num frames: 719285248. Throughput: 0: 5776.1. Samples: 719295430. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:23:20,219][25689] Avg episode reward: [(0, '-2.436')] [2022-07-10 11:23:21,720][26022] Updated weights on worker 0-0, policy_version 702437 (0.00085) [2022-07-10 11:23:23,732][26022] Updated weights on worker 0-0, policy_version 702447 (0.00095) [2022-07-10 11:23:25,301][25689] Fps is (10 sec: 5733.7, 60 sec: 5609.2, 300 sec: 5588.6). Total num frames: 719313920. Throughput: 0: 5848.7. Samples: 719312276. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:23:25,301][25689] Avg episode reward: [(0, '-1.919')] [2022-07-10 11:23:25,498][26022] Updated weights on worker 0-0, policy_version 702457 (0.00089) [2022-07-10 11:23:27,357][26022] Updated weights on worker 0-0, policy_version 702467 (0.00084) [2022-07-10 11:23:29,136][26022] Updated weights on worker 0-0, policy_version 702477 (0.00086) [2022-07-10 11:23:30,350][25689] Fps is (10 sec: 5658.6, 60 sec: 5590.5, 300 sec: 5585.8). Total num frames: 719342592. Throughput: 0: 5839.4. Samples: 719345846. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:23:30,350][25689] Avg episode reward: [(0, '-2.314')] [2022-07-10 11:23:31,139][26022] Updated weights on worker 0-0, policy_version 702487 (0.00082) [2022-07-10 11:23:32,822][26022] Updated weights on worker 0-0, policy_version 702497 (0.00081) [2022-07-10 11:23:34,708][26022] Updated weights on worker 0-0, policy_version 702507 (0.00089) [2022-07-10 11:23:35,420][25689] Fps is (10 sec: 5666.8, 60 sec: 5606.9, 300 sec: 5589.8). Total num frames: 719371264. Throughput: 0: 5836.9. Samples: 719379988. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:23:35,421][25689] Avg episode reward: [(0, '-3.052')] [2022-07-10 11:23:36,310][26022] Updated weights on worker 0-0, policy_version 702517 (0.00089) [2022-07-10 11:23:38,095][26022] Updated weights on worker 0-0, policy_version 702527 (0.00083) [2022-07-10 11:23:40,018][26022] Updated weights on worker 0-0, policy_version 702537 (0.00091) [2022-07-10 11:23:40,479][25689] Fps is (10 sec: 5661.3, 60 sec: 5610.4, 300 sec: 5586.9). Total num frames: 719399936. Throughput: 0: 5024.9. Samples: 719397250. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:23:40,479][25689] Avg episode reward: [(0, '-3.646')] [2022-07-10 11:23:41,810][26022] Updated weights on worker 0-0, policy_version 702547 (0.00094) [2022-07-10 11:23:43,612][26022] Updated weights on worker 0-0, policy_version 702557 (0.00083) [2022-07-10 11:23:45,482][25689] Fps is (10 sec: 5597.6, 60 sec: 5596.9, 300 sec: 5585.0). Total num frames: 719427584. Throughput: 0: 5889.8. Samples: 719431138. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:23:45,482][25689] Avg episode reward: [(0, '-3.792')] [2022-07-10 11:23:45,537][26022] Updated weights on worker 0-0, policy_version 702567 (0.00093) [2022-07-10 11:23:47,295][26022] Updated weights on worker 0-0, policy_version 702577 (0.00089) [2022-07-10 11:23:49,005][26022] Updated weights on worker 0-0, policy_version 702587 (0.00086) [2022-07-10 11:23:50,487][25689] Fps is (10 sec: 5729.8, 60 sec: 5631.4, 300 sec: 5599.6). Total num frames: 719457280. Throughput: 0: 5945.8. Samples: 719465578. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:23:50,487][25689] Avg episode reward: [(0, '-3.852')] [2022-07-10 11:23:50,827][26022] Updated weights on worker 0-0, policy_version 702597 (0.00095) [2022-07-10 11:23:52,637][26022] Updated weights on worker 0-0, policy_version 702607 (0.00088) [2022-07-10 11:23:54,511][26022] Updated weights on worker 0-0, policy_version 702617 (0.00083) [2022-07-10 11:23:55,491][25689] Fps is (10 sec: 5729.3, 60 sec: 5597.5, 300 sec: 5590.8). Total num frames: 719484928. Throughput: 0: 5093.0. Samples: 719482206. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:23:55,491][25689] Avg episode reward: [(0, '-3.733')] [2022-07-10 11:23:56,215][26022] Updated weights on worker 0-0, policy_version 702627 (0.00094) [2022-07-10 11:23:57,989][26022] Updated weights on worker 0-0, policy_version 702637 (0.00086) [2022-07-10 11:23:59,931][26022] Updated weights on worker 0-0, policy_version 702647 (0.00092) [2022-07-10 11:24:00,562][25689] Fps is (10 sec: 5589.8, 60 sec: 5616.1, 300 sec: 5596.5). Total num frames: 719513600. Throughput: 0: 5927.9. Samples: 719516304. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:24:00,563][25689] Avg episode reward: [(0, '-2.254')] [2022-07-10 11:24:02,159][26022] Updated weights on worker 0-0, policy_version 702657 (0.00081) [2022-07-10 11:24:03,897][26022] Updated weights on worker 0-0, policy_version 702667 (0.00090) [2022-07-10 11:24:05,587][25689] Fps is (10 sec: 5477.0, 60 sec: 5632.7, 300 sec: 5596.6). Total num frames: 719540224. Throughput: 0: 5819.3. Samples: 719548134. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:24:05,587][25689] Avg episode reward: [(0, '-2.434')] [2022-07-10 11:24:05,658][26022] Updated weights on worker 0-0, policy_version 702677 (0.00090) [2022-07-10 11:24:07,694][26022] Updated weights on worker 0-0, policy_version 702687 (0.00085) [2022-07-10 11:24:09,339][26022] Updated weights on worker 0-0, policy_version 702697 (0.00087) [2022-07-10 11:24:10,596][25689] Fps is (10 sec: 5511.3, 60 sec: 5618.7, 300 sec: 5596.8). Total num frames: 719568896. Throughput: 0: 4954.3. Samples: 719565204. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:24:10,596][25689] Avg episode reward: [(0, '-1.194')] [2022-07-10 11:24:10,947][26022] Updated weights on worker 0-0, policy_version 702707 (0.00091) [2022-07-10 11:24:13,008][26022] Updated weights on worker 0-0, policy_version 702717 (0.00085) [2022-07-10 11:24:14,615][26022] Updated weights on worker 0-0, policy_version 702727 (0.00091) [2022-07-10 11:24:15,613][25689] Fps is (10 sec: 5617.5, 60 sec: 5637.0, 300 sec: 5590.4). Total num frames: 719596544. Throughput: 0: 5817.8. Samples: 719599272. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:24:15,613][25689] Avg episode reward: [(0, '-0.407')] [2022-07-10 11:24:16,666][26022] Updated weights on worker 0-0, policy_version 702737 (0.00087) [2022-07-10 11:24:18,302][26022] Updated weights on worker 0-0, policy_version 702747 (0.00086) [2022-07-10 11:24:20,220][26022] Updated weights on worker 0-0, policy_version 702757 (0.00085) [2022-07-10 11:24:20,656][25689] Fps is (10 sec: 5598.3, 60 sec: 5624.9, 300 sec: 5597.2). Total num frames: 719625216. Throughput: 0: 5831.1. Samples: 719633472. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:24:20,656][25689] Avg episode reward: [(0, '-0.688')] [2022-07-10 11:24:22,084][26022] Updated weights on worker 0-0, policy_version 702767 (0.00087) [2022-07-10 11:24:23,799][26022] Updated weights on worker 0-0, policy_version 702777 (0.00082) [2022-07-10 11:24:25,665][25689] Fps is (10 sec: 5602.4, 60 sec: 5615.0, 300 sec: 5593.8). Total num frames: 719652864. Throughput: 0: 5095.5. Samples: 719650446. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:24:25,666][25689] Avg episode reward: [(0, '-0.944')] [2022-07-10 11:24:25,747][26022] Updated weights on worker 0-0, policy_version 702787 (0.00081) [2022-07-10 11:24:27,230][26022] Updated weights on worker 0-0, policy_version 702797 (0.00091) [2022-07-10 11:24:27,539][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:24:27,551][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000702799_719666176.pth [2022-07-10 11:24:27,551][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000700830_717649920.pth [2022-07-10 11:24:29,030][26022] Updated weights on worker 0-0, policy_version 702807 (0.00066) [2022-07-10 11:24:30,683][25689] Fps is (10 sec: 5820.8, 60 sec: 5651.8, 300 sec: 5600.5). Total num frames: 719683584. Throughput: 0: 5960.3. Samples: 719684932. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:24:30,684][25689] Avg episode reward: [(0, '-1.637')] [2022-07-10 11:24:30,830][26022] Updated weights on worker 0-0, policy_version 702817 (0.00091) [2022-07-10 11:24:32,673][26022] Updated weights on worker 0-0, policy_version 702827 (0.00083) [2022-07-10 11:24:34,714][26022] Updated weights on worker 0-0, policy_version 702837 (0.00084) [2022-07-10 11:24:35,723][25689] Fps is (10 sec: 5803.6, 60 sec: 5637.7, 300 sec: 5597.3). Total num frames: 719711232. Throughput: 0: 5958.0. Samples: 719719088. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 11:24:35,723][25689] Avg episode reward: [(0, '-1.394')] [2022-07-10 11:24:36,306][26022] Updated weights on worker 0-0, policy_version 702847 (0.00083) [2022-07-10 11:24:38,207][26022] Updated weights on worker 0-0, policy_version 702857 (0.00082) [2022-07-10 11:24:39,935][26022] Updated weights on worker 0-0, policy_version 702867 (0.00085) [2022-07-10 11:24:40,778][25689] Fps is (10 sec: 5680.6, 60 sec: 5655.0, 300 sec: 5603.9). Total num frames: 719740928. Throughput: 0: 5093.2. Samples: 719735958. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:24:40,779][25689] Avg episode reward: [(0, '-1.746')] [2022-07-10 11:24:41,889][26022] Updated weights on worker 0-0, policy_version 702877 (0.00091) [2022-07-10 11:24:43,605][26022] Updated weights on worker 0-0, policy_version 702887 (0.00083) [2022-07-10 11:24:45,444][26022] Updated weights on worker 0-0, policy_version 702897 (0.00088) [2022-07-10 11:24:45,789][25689] Fps is (10 sec: 5696.8, 60 sec: 5654.3, 300 sec: 5604.2). Total num frames: 719768576. Throughput: 0: 5935.8. Samples: 719769894. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:24:45,789][25689] Avg episode reward: [(0, '-1.679')] [2022-07-10 11:24:47,157][26022] Updated weights on worker 0-0, policy_version 702907 (0.00085) [2022-07-10 11:24:49,197][26022] Updated weights on worker 0-0, policy_version 702917 (0.00090) [2022-07-10 11:24:50,699][26022] Updated weights on worker 0-0, policy_version 702927 (0.00095) [2022-07-10 11:24:50,855][25689] Fps is (10 sec: 5589.1, 60 sec: 5631.6, 300 sec: 5603.1). Total num frames: 719797248. Throughput: 0: 5905.6. Samples: 719804058. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:24:50,855][25689] Avg episode reward: [(0, '-1.547')] [2022-07-10 11:24:52,709][26022] Updated weights on worker 0-0, policy_version 702937 (0.00085) [2022-07-10 11:24:54,282][26022] Updated weights on worker 0-0, policy_version 702947 (0.00095) [2022-07-10 11:24:55,886][25689] Fps is (10 sec: 5577.8, 60 sec: 5629.1, 300 sec: 5601.3). Total num frames: 719824896. Throughput: 0: 5057.5. Samples: 719821062. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:24:55,887][25689] Avg episode reward: [(0, '-0.727')] [2022-07-10 11:24:56,239][26022] Updated weights on worker 0-0, policy_version 702957 (0.00084) [2022-07-10 11:24:58,275][26022] Updated weights on worker 0-0, policy_version 702967 (0.00087) [2022-07-10 11:24:59,890][26022] Updated weights on worker 0-0, policy_version 702977 (0.00097) [2022-07-10 11:25:01,003][25689] Fps is (10 sec: 5550.0, 60 sec: 5624.9, 300 sec: 5606.2). Total num frames: 719853568. Throughput: 0: 5879.9. Samples: 719854876. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:25:01,003][25689] Avg episode reward: [(0, '-2.255')] [2022-07-10 11:25:02,185][26022] Updated weights on worker 0-0, policy_version 702987 (0.00088) [2022-07-10 11:25:04,130][26022] Updated weights on worker 0-0, policy_version 702997 (0.00086) [2022-07-10 11:25:05,846][26022] Updated weights on worker 0-0, policy_version 703007 (0.00085) [2022-07-10 11:25:06,056][25689] Fps is (10 sec: 5537.6, 60 sec: 5639.1, 300 sec: 5605.9). Total num frames: 719881216. Throughput: 0: 5748.8. Samples: 719886410. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:25:06,057][25689] Avg episode reward: [(0, '-1.209')] [2022-07-10 11:25:07,764][26022] Updated weights on worker 0-0, policy_version 703017 (0.00086) [2022-07-10 11:25:09,386][26022] Updated weights on worker 0-0, policy_version 703027 (0.00113) [2022-07-10 11:25:11,098][25689] Fps is (10 sec: 5375.8, 60 sec: 5602.1, 300 sec: 5598.6). Total num frames: 719907840. Throughput: 0: 4894.9. Samples: 719903146. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:25:11,099][25689] Avg episode reward: [(0, '-1.352')] [2022-07-10 11:25:11,266][26022] Updated weights on worker 0-0, policy_version 703037 (0.00092) [2022-07-10 11:25:13,283][26022] Updated weights on worker 0-0, policy_version 703047 (0.00099) [2022-07-10 11:25:14,884][26022] Updated weights on worker 0-0, policy_version 703057 (0.00095) [2022-07-10 11:25:16,156][25689] Fps is (10 sec: 5475.2, 60 sec: 5615.3, 300 sec: 5598.9). Total num frames: 719936512. Throughput: 0: 5705.4. Samples: 719936710. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:25:16,156][25689] Avg episode reward: [(0, '-1.486')] [2022-07-10 11:25:16,983][26022] Updated weights on worker 0-0, policy_version 703067 (0.00094) [2022-07-10 11:25:18,813][26022] Updated weights on worker 0-0, policy_version 703077 (0.00095) [2022-07-10 11:25:20,290][26022] Updated weights on worker 0-0, policy_version 703087 (0.00086) [2022-07-10 11:25:21,217][25689] Fps is (10 sec: 5667.1, 60 sec: 5613.6, 300 sec: 5601.4). Total num frames: 719965184. Throughput: 0: 5704.2. Samples: 719970182. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:25:21,217][25689] Avg episode reward: [(0, '-1.871')] [2022-07-10 11:25:22,432][26022] Updated weights on worker 0-0, policy_version 703097 (0.00086) [2022-07-10 11:25:23,903][26022] Updated weights on worker 0-0, policy_version 703107 (0.00082) [2022-07-10 11:25:25,889][26022] Updated weights on worker 0-0, policy_version 703117 (0.00100) [2022-07-10 11:25:26,241][25689] Fps is (10 sec: 5584.3, 60 sec: 5612.3, 300 sec: 5605.8). Total num frames: 719992832. Throughput: 0: 4986.7. Samples: 719987066. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:25:26,242][25689] Avg episode reward: [(0, '-2.049')] [2022-07-10 11:25:27,704][26022] Updated weights on worker 0-0, policy_version 703127 (0.00090) [2022-07-10 11:25:29,470][26022] Updated weights on worker 0-0, policy_version 703137 (0.00086) [2022-07-10 11:25:31,252][25689] Fps is (10 sec: 5510.3, 60 sec: 5562.2, 300 sec: 5596.2). Total num frames: 720020480. Throughput: 0: 5846.5. Samples: 720020974. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:25:31,252][25689] Avg episode reward: [(0, '-0.371')] [2022-07-10 11:25:31,430][26022] Updated weights on worker 0-0, policy_version 703147 (0.00083) [2022-07-10 11:25:33,345][26022] Updated weights on worker 0-0, policy_version 703157 (0.00120) [2022-07-10 11:25:34,973][26022] Updated weights on worker 0-0, policy_version 703167 (0.00083) [2022-07-10 11:25:36,317][25689] Fps is (10 sec: 5589.7, 60 sec: 5576.8, 300 sec: 5604.1). Total num frames: 720049152. Throughput: 0: 5865.2. Samples: 720054960. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:25:36,318][25689] Avg episode reward: [(0, '-0.625')] [2022-07-10 11:25:37,082][26022] Updated weights on worker 0-0, policy_version 703177 (0.00084) [2022-07-10 11:25:38,395][26022] Updated weights on worker 0-0, policy_version 703187 (0.00094) [2022-07-10 11:25:40,586][26022] Updated weights on worker 0-0, policy_version 703197 (0.00080) [2022-07-10 11:25:41,370][25689] Fps is (10 sec: 5869.8, 60 sec: 5593.9, 300 sec: 5614.6). Total num frames: 720079872. Throughput: 0: 5041.9. Samples: 720071794. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:25:41,371][25689] Avg episode reward: [(0, '-0.571')] [2022-07-10 11:25:42,279][26022] Updated weights on worker 0-0, policy_version 703207 (0.00085) [2022-07-10 11:25:44,207][26022] Updated weights on worker 0-0, policy_version 703217 (0.00095) [2022-07-10 11:25:45,910][26022] Updated weights on worker 0-0, policy_version 703227 (0.00088) [2022-07-10 11:25:46,396][25689] Fps is (10 sec: 5689.4, 60 sec: 5575.6, 300 sec: 5604.1). Total num frames: 720106496. Throughput: 0: 5869.6. Samples: 720105366. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:25:46,398][25689] Avg episode reward: [(0, '-0.990')] [2022-07-10 11:25:47,808][26022] Updated weights on worker 0-0, policy_version 703237 (0.00084) [2022-07-10 11:25:49,581][26022] Updated weights on worker 0-0, policy_version 703247 (0.00088) [2022-07-10 11:25:51,482][25689] Fps is (10 sec: 5367.3, 60 sec: 5556.9, 300 sec: 5602.6). Total num frames: 720134144. Throughput: 0: 5833.0. Samples: 720138974. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:25:51,484][25689] Avg episode reward: [(0, '-1.703')] [2022-07-10 11:25:51,643][26022] Updated weights on worker 0-0, policy_version 703257 (0.00094) [2022-07-10 11:25:53,383][26022] Updated weights on worker 0-0, policy_version 703267 (0.00089) [2022-07-10 11:25:55,036][26022] Updated weights on worker 0-0, policy_version 703277 (0.00078) [2022-07-10 11:25:56,513][25689] Fps is (10 sec: 5668.0, 60 sec: 5590.6, 300 sec: 5603.2). Total num frames: 720163840. Throughput: 0: 5842.3. Samples: 720172952. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:25:56,514][25689] Avg episode reward: [(0, '-1.707')] [2022-07-10 11:25:57,046][26022] Updated weights on worker 0-0, policy_version 703287 (0.00093) [2022-07-10 11:25:58,680][26022] Updated weights on worker 0-0, policy_version 703297 (0.00088) [2022-07-10 11:26:00,688][26022] Updated weights on worker 0-0, policy_version 703307 (0.00091) [2022-07-10 11:26:01,642][25689] Fps is (10 sec: 5845.5, 60 sec: 5606.4, 300 sec: 5618.4). Total num frames: 720193536. Throughput: 0: 5831.2. Samples: 720190004. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:26:01,642][25689] Avg episode reward: [(0, '-2.311')] [2022-07-10 11:26:02,658][26022] Updated weights on worker 0-0, policy_version 703317 (0.00051) [2022-07-10 11:26:04,569][26022] Updated weights on worker 0-0, policy_version 703327 (0.00987) [2022-07-10 11:26:06,342][26022] Updated weights on worker 0-0, policy_version 703337 (0.00085) [2022-07-10 11:26:06,650][25689] Fps is (10 sec: 5353.8, 60 sec: 5559.9, 300 sec: 5604.9). Total num frames: 720218112. Throughput: 0: 5751.7. Samples: 720221862. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:26:06,650][25689] Avg episode reward: [(0, '-2.002')] [2022-07-10 11:26:08,070][26022] Updated weights on worker 0-0, policy_version 703347 (0.00088) [2022-07-10 11:26:10,258][26022] Updated weights on worker 0-0, policy_version 703357 (0.00083) [2022-07-10 11:26:11,591][26022] Updated weights on worker 0-0, policy_version 703367 (0.00090) [2022-07-10 11:26:11,666][25689] Fps is (10 sec: 5414.3, 60 sec: 5613.0, 300 sec: 5612.0). Total num frames: 720247808. Throughput: 0: 5792.7. Samples: 720255894. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:26:11,666][25689] Avg episode reward: [(0, '-3.097')] [2022-07-10 11:26:13,805][26022] Updated weights on worker 0-0, policy_version 703377 (0.00085) [2022-07-10 11:26:15,366][26022] Updated weights on worker 0-0, policy_version 703387 (0.00085) [2022-07-10 11:26:16,688][25689] Fps is (10 sec: 5610.9, 60 sec: 5582.5, 300 sec: 5606.0). Total num frames: 720274432. Throughput: 0: 4946.5. Samples: 720272746. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:26:16,688][25689] Avg episode reward: [(0, '-2.664')] [2022-07-10 11:26:17,294][26022] Updated weights on worker 0-0, policy_version 703397 (0.00085) [2022-07-10 11:26:19,083][26022] Updated weights on worker 0-0, policy_version 703407 (0.00098) [2022-07-10 11:26:20,989][26022] Updated weights on worker 0-0, policy_version 703417 (0.00092) [2022-07-10 11:26:21,735][25689] Fps is (10 sec: 5389.7, 60 sec: 5566.8, 300 sec: 5602.2). Total num frames: 720302080. Throughput: 0: 5789.8. Samples: 720306340. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:26:21,736][25689] Avg episode reward: [(0, '-2.215')] [2022-07-10 11:26:22,664][26022] Updated weights on worker 0-0, policy_version 703427 (0.00086) [2022-07-10 11:26:24,786][26022] Updated weights on worker 0-0, policy_version 703437 (0.00086) [2022-07-10 11:26:26,383][26022] Updated weights on worker 0-0, policy_version 703447 (0.00085) [2022-07-10 11:26:26,757][25689] Fps is (10 sec: 5593.2, 60 sec: 5584.0, 300 sec: 5599.1). Total num frames: 720330752. Throughput: 0: 5869.4. Samples: 720339878. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:26:26,757][25689] Avg episode reward: [(0, '-1.273')] [2022-07-10 11:26:27,648][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:26:27,665][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000703453_720335872.pth [2022-07-10 11:26:27,665][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000701485_718320640.pth [2022-07-10 11:26:28,306][26022] Updated weights on worker 0-0, policy_version 703457 (0.00082) [2022-07-10 11:26:30,005][26022] Updated weights on worker 0-0, policy_version 703467 (0.00079) [2022-07-10 11:26:31,766][25689] Fps is (10 sec: 5717.1, 60 sec: 5601.1, 300 sec: 5602.4). Total num frames: 720359424. Throughput: 0: 5022.2. Samples: 720356840. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:26:31,766][25689] Avg episode reward: [(0, '-1.149')] [2022-07-10 11:26:31,947][26022] Updated weights on worker 0-0, policy_version 703477 (0.00084) [2022-07-10 11:26:33,664][26022] Updated weights on worker 0-0, policy_version 703487 (0.00084) [2022-07-10 11:26:35,435][26022] Updated weights on worker 0-0, policy_version 703497 (0.00086) [2022-07-10 11:26:36,784][25689] Fps is (10 sec: 5616.7, 60 sec: 5588.5, 300 sec: 5599.5). Total num frames: 720387072. Throughput: 0: 5886.0. Samples: 720391034. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:26:36,785][25689] Avg episode reward: [(0, '-0.410')] [2022-07-10 11:26:37,219][26022] Updated weights on worker 0-0, policy_version 703507 (0.00086) [2022-07-10 11:26:39,144][26022] Updated weights on worker 0-0, policy_version 703517 (0.00084) [2022-07-10 11:26:40,947][26022] Updated weights on worker 0-0, policy_version 703527 (0.00082) [2022-07-10 11:26:41,916][25689] Fps is (10 sec: 5649.7, 60 sec: 5564.4, 300 sec: 5605.7). Total num frames: 720416768. Throughput: 0: 5876.9. Samples: 720424936. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:26:41,917][25689] Avg episode reward: [(0, '-0.283')] [2022-07-10 11:26:42,755][26022] Updated weights on worker 0-0, policy_version 703537 (0.00088) [2022-07-10 11:26:44,599][26022] Updated weights on worker 0-0, policy_version 703547 (0.00081) [2022-07-10 11:26:46,242][26022] Updated weights on worker 0-0, policy_version 703557 (0.00081) [2022-07-10 11:26:46,918][25689] Fps is (10 sec: 5658.5, 60 sec: 5583.4, 300 sec: 5602.6). Total num frames: 720444416. Throughput: 0: 5075.2. Samples: 720442200. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:26:46,919][25689] Avg episode reward: [(0, '-0.022')] [2022-07-10 11:26:48,081][26022] Updated weights on worker 0-0, policy_version 703567 (0.00091) [2022-07-10 11:26:50,019][26022] Updated weights on worker 0-0, policy_version 703577 (0.00083) [2022-07-10 11:26:51,775][26022] Updated weights on worker 0-0, policy_version 703587 (0.00091) [2022-07-10 11:26:51,949][25689] Fps is (10 sec: 5715.1, 60 sec: 5622.3, 300 sec: 5602.9). Total num frames: 720474112. Throughput: 0: 5905.3. Samples: 720476032. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:26:51,953][25689] Avg episode reward: [(0, '-0.258')] [2022-07-10 11:26:53,615][26022] Updated weights on worker 0-0, policy_version 703597 (0.00105) [2022-07-10 11:26:55,235][26022] Updated weights on worker 0-0, policy_version 703607 (0.00088) [2022-07-10 11:26:56,975][25689] Fps is (10 sec: 5804.1, 60 sec: 5605.9, 300 sec: 5611.6). Total num frames: 720502784. Throughput: 0: 5911.5. Samples: 720510390. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:26:56,975][25689] Avg episode reward: [(0, '-0.522')] [2022-07-10 11:26:57,232][26022] Updated weights on worker 0-0, policy_version 703617 (0.00089) [2022-07-10 11:26:58,971][26022] Updated weights on worker 0-0, policy_version 703627 (0.00086) [2022-07-10 11:27:00,872][26022] Updated weights on worker 0-0, policy_version 703637 (0.00086) [2022-07-10 11:27:02,105][25689] Fps is (10 sec: 5545.6, 60 sec: 5571.9, 300 sec: 5605.8). Total num frames: 720530432. Throughput: 0: 5083.3. Samples: 720527566. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:27:02,107][25689] Avg episode reward: [(0, '-2.401')] [2022-07-10 11:27:02,839][26022] Updated weights on worker 0-0, policy_version 703647 (0.00102) [2022-07-10 11:27:04,791][26022] Updated weights on worker 0-0, policy_version 703657 (0.00091) [2022-07-10 11:27:06,405][26022] Updated weights on worker 0-0, policy_version 703667 (0.00091) [2022-07-10 11:27:07,130][25689] Fps is (10 sec: 5445.1, 60 sec: 5621.2, 300 sec: 5609.2). Total num frames: 720558080. Throughput: 0: 5804.5. Samples: 720559518. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:27:07,132][25689] Avg episode reward: [(0, '-2.757')] [2022-07-10 11:27:08,214][26022] Updated weights on worker 0-0, policy_version 703677 (0.00091) [2022-07-10 11:27:09,927][26022] Updated weights on worker 0-0, policy_version 703687 (0.00086) [2022-07-10 11:27:11,772][26022] Updated weights on worker 0-0, policy_version 703697 (0.00091) [2022-07-10 11:27:12,151][25689] Fps is (10 sec: 5606.3, 60 sec: 5603.7, 300 sec: 5612.8). Total num frames: 720586752. Throughput: 0: 5828.4. Samples: 720593776. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:27:12,153][25689] Avg episode reward: [(0, '-3.287')] [2022-07-10 11:27:13,711][26022] Updated weights on worker 0-0, policy_version 703707 (0.00092) [2022-07-10 11:27:15,459][26022] Updated weights on worker 0-0, policy_version 703717 (0.00092) [2022-07-10 11:27:17,165][25689] Fps is (10 sec: 5612.4, 60 sec: 5621.4, 300 sec: 5607.0). Total num frames: 720614400. Throughput: 0: 4973.1. Samples: 720610798. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:27:17,165][25689] Avg episode reward: [(0, '-3.000')] [2022-07-10 11:27:17,304][26022] Updated weights on worker 0-0, policy_version 703727 (0.00093) [2022-07-10 11:27:19,280][26022] Updated weights on worker 0-0, policy_version 703737 (0.00079) [2022-07-10 11:27:21,007][26022] Updated weights on worker 0-0, policy_version 703747 (0.00091) [2022-07-10 11:27:22,282][25689] Fps is (10 sec: 5559.1, 60 sec: 5631.9, 300 sec: 5608.8). Total num frames: 720643072. Throughput: 0: 5784.3. Samples: 720644276. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:27:22,283][25689] Avg episode reward: [(0, '-2.223')] [2022-07-10 11:27:22,927][26022] Updated weights on worker 0-0, policy_version 703757 (0.00091) [2022-07-10 11:27:24,715][26022] Updated weights on worker 0-0, policy_version 703767 (0.00083) [2022-07-10 11:27:26,558][26022] Updated weights on worker 0-0, policy_version 703777 (0.00081) [2022-07-10 11:27:27,301][25689] Fps is (10 sec: 5657.2, 60 sec: 5632.1, 300 sec: 5605.5). Total num frames: 720671744. Throughput: 0: 5873.7. Samples: 720678000. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:27:27,302][25689] Avg episode reward: [(0, '-2.474')] [2022-07-10 11:27:28,326][26022] Updated weights on worker 0-0, policy_version 703787 (0.00082) [2022-07-10 11:27:30,090][26022] Updated weights on worker 0-0, policy_version 703797 (0.00097) [2022-07-10 11:27:32,039][26022] Updated weights on worker 0-0, policy_version 703807 (0.00085) [2022-07-10 11:27:32,331][25689] Fps is (10 sec: 5604.8, 60 sec: 5613.3, 300 sec: 5606.2). Total num frames: 720699392. Throughput: 0: 5014.6. Samples: 720694968. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:27:32,331][25689] Avg episode reward: [(0, '-1.657')] [2022-07-10 11:27:33,835][26022] Updated weights on worker 0-0, policy_version 703817 (0.00081) [2022-07-10 11:27:35,774][26022] Updated weights on worker 0-0, policy_version 703827 (0.00093) [2022-07-10 11:27:37,338][25689] Fps is (10 sec: 5611.7, 60 sec: 5631.3, 300 sec: 5607.9). Total num frames: 720728064. Throughput: 0: 5843.3. Samples: 720728674. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:27:37,338][25689] Avg episode reward: [(0, '-2.617')] [2022-07-10 11:27:37,443][26022] Updated weights on worker 0-0, policy_version 703837 (0.00089) [2022-07-10 11:27:39,320][26022] Updated weights on worker 0-0, policy_version 703847 (0.00087) [2022-07-10 11:27:41,168][26022] Updated weights on worker 0-0, policy_version 703857 (0.00085) [2022-07-10 11:27:42,463][25689] Fps is (10 sec: 5659.6, 60 sec: 5614.9, 300 sec: 5606.3). Total num frames: 720756736. Throughput: 0: 5856.3. Samples: 720762460. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:27:42,463][25689] Avg episode reward: [(0, '-2.433')] [2022-07-10 11:27:42,811][26022] Updated weights on worker 0-0, policy_version 703867 (0.00090) [2022-07-10 11:27:44,847][26022] Updated weights on worker 0-0, policy_version 703877 (0.00093) [2022-07-10 11:27:46,535][26022] Updated weights on worker 0-0, policy_version 703887 (0.00087) [2022-07-10 11:27:47,495][25689] Fps is (10 sec: 5444.0, 60 sec: 5595.3, 300 sec: 5602.5). Total num frames: 720783360. Throughput: 0: 5016.0. Samples: 720779290. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:27:47,495][25689] Avg episode reward: [(0, '-3.940')] [2022-07-10 11:27:48,341][26022] Updated weights on worker 0-0, policy_version 703897 (0.00091) [2022-07-10 11:27:50,342][26022] Updated weights on worker 0-0, policy_version 703907 (0.00093) [2022-07-10 11:27:51,955][26022] Updated weights on worker 0-0, policy_version 703917 (0.00085) [2022-07-10 11:27:52,498][25689] Fps is (10 sec: 5816.1, 60 sec: 5631.7, 300 sec: 5609.4). Total num frames: 720815104. Throughput: 0: 5854.7. Samples: 720813044. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:27:52,499][25689] Avg episode reward: [(0, '-5.641')] [2022-07-10 11:27:54,036][26022] Updated weights on worker 0-0, policy_version 703927 (0.00084) [2022-07-10 11:27:55,424][26022] Updated weights on worker 0-0, policy_version 703937 (0.00087) [2022-07-10 11:27:57,501][25689] Fps is (10 sec: 5731.1, 60 sec: 5583.1, 300 sec: 5604.1). Total num frames: 720840704. Throughput: 0: 5879.0. Samples: 720847212. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 11:27:57,501][25689] Avg episode reward: [(0, '-4.839')] [2022-07-10 11:27:57,638][26022] Updated weights on worker 0-0, policy_version 703947 (0.00089) [2022-07-10 11:27:59,244][26022] Updated weights on worker 0-0, policy_version 703957 (0.00088) [2022-07-10 11:28:01,192][26022] Updated weights on worker 0-0, policy_version 703967 (0.00090) [2022-07-10 11:28:02,570][25689] Fps is (10 sec: 5286.9, 60 sec: 5588.7, 300 sec: 5610.1). Total num frames: 720868352. Throughput: 0: 5047.6. Samples: 720863954. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:28:02,571][25689] Avg episode reward: [(0, '-4.343')] [2022-07-10 11:28:03,262][26022] Updated weights on worker 0-0, policy_version 703977 (0.00086) [2022-07-10 11:28:05,002][26022] Updated weights on worker 0-0, policy_version 703987 (0.00090) [2022-07-10 11:28:06,937][26022] Updated weights on worker 0-0, policy_version 703997 (0.00087) [2022-07-10 11:28:07,583][25689] Fps is (10 sec: 5484.6, 60 sec: 5589.8, 300 sec: 5603.7). Total num frames: 720896000. Throughput: 0: 5816.3. Samples: 720896128. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:28:07,583][25689] Avg episode reward: [(0, '-3.159')] [2022-07-10 11:28:08,710][26022] Updated weights on worker 0-0, policy_version 704007 (0.00099) [2022-07-10 11:28:10,551][26022] Updated weights on worker 0-0, policy_version 704017 (0.00090) [2022-07-10 11:28:12,388][26022] Updated weights on worker 0-0, policy_version 704027 (0.00086) [2022-07-10 11:28:12,596][25689] Fps is (10 sec: 5719.5, 60 sec: 5607.5, 300 sec: 5614.4). Total num frames: 720925696. Throughput: 0: 5827.6. Samples: 720930166. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:28:12,598][25689] Avg episode reward: [(0, '-2.839')] [2022-07-10 11:28:13,990][26022] Updated weights on worker 0-0, policy_version 704037 (0.00086) [2022-07-10 11:28:15,757][26022] Updated weights on worker 0-0, policy_version 704047 (0.00086) [2022-07-10 11:28:17,616][25689] Fps is (10 sec: 5613.5, 60 sec: 5590.0, 300 sec: 5605.5). Total num frames: 720952320. Throughput: 0: 4983.3. Samples: 720947450. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:28:17,616][25689] Avg episode reward: [(0, '-2.156')] [2022-07-10 11:28:17,879][26022] Updated weights on worker 0-0, policy_version 704057 (0.00094) [2022-07-10 11:28:19,379][26022] Updated weights on worker 0-0, policy_version 704067 (0.00094) [2022-07-10 11:28:21,511][26022] Updated weights on worker 0-0, policy_version 704077 (0.00086) [2022-07-10 11:28:22,686][25689] Fps is (10 sec: 5683.2, 60 sec: 5628.2, 300 sec: 5612.7). Total num frames: 720983040. Throughput: 0: 5818.7. Samples: 720981004. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:28:22,688][25689] Avg episode reward: [(0, '-0.714')] [2022-07-10 11:28:23,231][26022] Updated weights on worker 0-0, policy_version 704087 (0.00091) [2022-07-10 11:28:25,009][26022] Updated weights on worker 0-0, policy_version 704097 (0.00086) [2022-07-10 11:28:26,936][26022] Updated weights on worker 0-0, policy_version 704107 (0.00090) [2022-07-10 11:28:27,759][25689] Fps is (10 sec: 5754.2, 60 sec: 5606.3, 300 sec: 5608.8). Total num frames: 721010688. Throughput: 0: 5873.4. Samples: 721014632. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:28:27,760][25689] Avg episode reward: [(0, '-0.726')] [2022-07-10 11:28:27,981][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:28:28,000][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000704113_721011712.pth [2022-07-10 11:28:28,000][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000702139_718990336.pth [2022-07-10 11:28:28,677][26022] Updated weights on worker 0-0, policy_version 704117 (0.00084) [2022-07-10 11:28:30,454][26022] Updated weights on worker 0-0, policy_version 704127 (0.00088) [2022-07-10 11:28:32,540][26022] Updated weights on worker 0-0, policy_version 704137 (0.00092) [2022-07-10 11:28:32,840][25689] Fps is (10 sec: 5445.8, 60 sec: 5601.5, 300 sec: 5605.1). Total num frames: 721038336. Throughput: 0: 5843.6. Samples: 721048464. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:28:32,841][25689] Avg episode reward: [(0, '-0.905')] [2022-07-10 11:28:34,066][26022] Updated weights on worker 0-0, policy_version 704147 (0.00081) [2022-07-10 11:28:36,052][26022] Updated weights on worker 0-0, policy_version 704157 (0.00083) [2022-07-10 11:28:37,775][26022] Updated weights on worker 0-0, policy_version 704167 (0.00083) [2022-07-10 11:28:37,871][25689] Fps is (10 sec: 5569.4, 60 sec: 5599.2, 300 sec: 5605.6). Total num frames: 721067008. Throughput: 0: 5828.4. Samples: 721065510. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:28:37,872][25689] Avg episode reward: [(0, '-1.610')] [2022-07-10 11:28:39,596][26022] Updated weights on worker 0-0, policy_version 704177 (0.00081) [2022-07-10 11:28:41,304][26022] Updated weights on worker 0-0, policy_version 704187 (0.00081) [2022-07-10 11:28:42,966][25689] Fps is (10 sec: 5663.4, 60 sec: 5602.1, 300 sec: 5607.4). Total num frames: 721095680. Throughput: 0: 5845.8. Samples: 721099552. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:28:42,966][25689] Avg episode reward: [(0, '-1.914')] [2022-07-10 11:28:43,183][26022] Updated weights on worker 0-0, policy_version 704197 (0.00089) [2022-07-10 11:28:45,056][26022] Updated weights on worker 0-0, policy_version 704207 (0.00089) [2022-07-10 11:28:47,120][26022] Updated weights on worker 0-0, policy_version 704217 (0.00092) [2022-07-10 11:28:47,982][25689] Fps is (10 sec: 5671.5, 60 sec: 5637.4, 300 sec: 5603.7). Total num frames: 721124352. Throughput: 0: 5869.6. Samples: 721133334. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:28:47,983][25689] Avg episode reward: [(0, '-1.496')] [2022-07-10 11:28:48,610][26022] Updated weights on worker 0-0, policy_version 704227 (0.00077) [2022-07-10 11:28:50,739][26022] Updated weights on worker 0-0, policy_version 704237 (0.00093) [2022-07-10 11:28:52,094][26022] Updated weights on worker 0-0, policy_version 704247 (0.00093) [2022-07-10 11:28:52,985][25689] Fps is (10 sec: 5723.0, 60 sec: 5586.6, 300 sec: 5607.1). Total num frames: 721153024. Throughput: 0: 5050.2. Samples: 721150202. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:28:52,986][25689] Avg episode reward: [(0, '-1.039')] [2022-07-10 11:28:54,342][26022] Updated weights on worker 0-0, policy_version 704257 (0.00092) [2022-07-10 11:28:55,825][26022] Updated weights on worker 0-0, policy_version 704267 (0.00086) [2022-07-10 11:28:57,823][26022] Updated weights on worker 0-0, policy_version 704277 (0.00085) [2022-07-10 11:28:57,992][25689] Fps is (10 sec: 5728.7, 60 sec: 5637.0, 300 sec: 5608.4). Total num frames: 721181696. Throughput: 0: 5884.3. Samples: 721183908. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:28:57,993][25689] Avg episode reward: [(0, '-1.075')] [2022-07-10 11:28:59,723][26022] Updated weights on worker 0-0, policy_version 704287 (0.00081) [2022-07-10 11:29:01,756][26022] Updated weights on worker 0-0, policy_version 704297 (0.00089) [2022-07-10 11:29:03,096][25689] Fps is (10 sec: 5266.4, 60 sec: 5583.0, 300 sec: 5600.0). Total num frames: 721206272. Throughput: 0: 5774.4. Samples: 721215796. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:29:03,097][25689] Avg episode reward: [(0, '-1.114')] [2022-07-10 11:29:03,615][26022] Updated weights on worker 0-0, policy_version 704307 (0.00623) [2022-07-10 11:29:05,544][26022] Updated weights on worker 0-0, policy_version 704317 (0.00087) [2022-07-10 11:29:07,196][26022] Updated weights on worker 0-0, policy_version 704327 (0.00086) [2022-07-10 11:29:08,129][25689] Fps is (10 sec: 5252.9, 60 sec: 5598.1, 300 sec: 5599.5). Total num frames: 721234944. Throughput: 0: 4927.0. Samples: 721232598. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:29:08,130][25689] Avg episode reward: [(0, '-0.069')] [2022-07-10 11:29:09,347][26022] Updated weights on worker 0-0, policy_version 704337 (0.00089) [2022-07-10 11:29:10,700][26022] Updated weights on worker 0-0, policy_version 704347 (0.00086) [2022-07-10 11:29:12,809][26022] Updated weights on worker 0-0, policy_version 704357 (0.00089) [2022-07-10 11:29:13,142][25689] Fps is (10 sec: 5810.4, 60 sec: 5598.1, 300 sec: 5606.5). Total num frames: 721264640. Throughput: 0: 5763.2. Samples: 721266368. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:29:13,142][25689] Avg episode reward: [(0, '-0.516')] [2022-07-10 11:29:14,764][26022] Updated weights on worker 0-0, policy_version 704367 (0.00090) [2022-07-10 11:29:16,218][26022] Updated weights on worker 0-0, policy_version 704377 (0.00609) [2022-07-10 11:29:18,159][25689] Fps is (10 sec: 5513.4, 60 sec: 5581.5, 300 sec: 5596.7). Total num frames: 721290240. Throughput: 0: 5744.6. Samples: 721299756. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:29:18,159][25689] Avg episode reward: [(0, '-1.048')] [2022-07-10 11:29:18,461][26022] Updated weights on worker 0-0, policy_version 704387 (0.00082) [2022-07-10 11:29:20,067][26022] Updated weights on worker 0-0, policy_version 704397 (0.00094) [2022-07-10 11:29:21,941][26022] Updated weights on worker 0-0, policy_version 704407 (0.00092) [2022-07-10 11:29:23,236][25689] Fps is (10 sec: 5478.2, 60 sec: 5564.0, 300 sec: 5602.3). Total num frames: 721319936. Throughput: 0: 4995.7. Samples: 721316406. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:29:23,236][25689] Avg episode reward: [(0, '-1.461')] [2022-07-10 11:29:23,966][26022] Updated weights on worker 0-0, policy_version 704417 (0.00093) [2022-07-10 11:29:25,537][26022] Updated weights on worker 0-0, policy_version 704427 (0.00089) [2022-07-10 11:29:27,633][26022] Updated weights on worker 0-0, policy_version 704437 (0.00084) [2022-07-10 11:29:28,262][25689] Fps is (10 sec: 5675.5, 60 sec: 5568.2, 300 sec: 5591.8). Total num frames: 721347584. Throughput: 0: 5822.3. Samples: 721349820. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:29:28,264][25689] Avg episode reward: [(0, '-1.472')] [2022-07-10 11:29:29,187][26022] Updated weights on worker 0-0, policy_version 704447 (0.00096) [2022-07-10 11:29:31,177][26022] Updated weights on worker 0-0, policy_version 704457 (0.00085) [2022-07-10 11:29:33,108][26022] Updated weights on worker 0-0, policy_version 704467 (0.00093) [2022-07-10 11:29:33,287][25689] Fps is (10 sec: 5399.5, 60 sec: 5556.5, 300 sec: 5588.6). Total num frames: 721374208. Throughput: 0: 5812.4. Samples: 721383460. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:29:33,289][25689] Avg episode reward: [(0, '-1.931')] [2022-07-10 11:29:34,654][26022] Updated weights on worker 0-0, policy_version 704477 (0.00090) [2022-07-10 11:29:36,733][26022] Updated weights on worker 0-0, policy_version 704487 (0.00086) [2022-07-10 11:29:38,293][25689] Fps is (10 sec: 5512.8, 60 sec: 5558.8, 300 sec: 5586.1). Total num frames: 721402880. Throughput: 0: 4990.8. Samples: 721400242. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:29:38,295][25689] Avg episode reward: [(0, '-1.694')] [2022-07-10 11:29:38,606][26022] Updated weights on worker 0-0, policy_version 704497 (0.00094) [2022-07-10 11:29:40,380][26022] Updated weights on worker 0-0, policy_version 704507 (0.00092) [2022-07-10 11:29:42,185][26022] Updated weights on worker 0-0, policy_version 704517 (0.00094) [2022-07-10 11:29:43,391][25689] Fps is (10 sec: 5675.5, 60 sec: 5558.5, 300 sec: 5587.9). Total num frames: 721431552. Throughput: 0: 5815.9. Samples: 721433626. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:29:43,393][25689] Avg episode reward: [(0, '-2.288')] [2022-07-10 11:29:44,070][26022] Updated weights on worker 0-0, policy_version 704527 (0.00091) [2022-07-10 11:29:45,818][26022] Updated weights on worker 0-0, policy_version 704537 (0.00094) [2022-07-10 11:29:47,747][26022] Updated weights on worker 0-0, policy_version 704547 (0.00088) [2022-07-10 11:29:48,411][25689] Fps is (10 sec: 5566.4, 60 sec: 5541.2, 300 sec: 5585.3). Total num frames: 721459200. Throughput: 0: 5825.6. Samples: 721467196. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:29:48,411][25689] Avg episode reward: [(0, '-2.096')] [2022-07-10 11:29:49,483][26022] Updated weights on worker 0-0, policy_version 704557 (0.00093) [2022-07-10 11:29:51,374][26022] Updated weights on worker 0-0, policy_version 704567 (0.00084) [2022-07-10 11:29:53,154][26022] Updated weights on worker 0-0, policy_version 704577 (0.00087) [2022-07-10 11:29:53,464][25689] Fps is (10 sec: 5591.0, 60 sec: 5536.6, 300 sec: 5588.4). Total num frames: 721487872. Throughput: 0: 4982.0. Samples: 721483982. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:29:53,467][25689] Avg episode reward: [(0, '-1.716')] [2022-07-10 11:29:54,991][26022] Updated weights on worker 0-0, policy_version 704587 (0.00090) [2022-07-10 11:29:56,749][26022] Updated weights on worker 0-0, policy_version 704597 (0.00090) [2022-07-10 11:29:58,477][25689] Fps is (10 sec: 5696.7, 60 sec: 5536.1, 300 sec: 5590.3). Total num frames: 721516544. Throughput: 0: 5833.4. Samples: 721517984. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:29:58,477][25689] Avg episode reward: [(0, '-1.711')] [2022-07-10 11:29:58,877][26022] Updated weights on worker 0-0, policy_version 704607 (0.00087) [2022-07-10 11:30:00,418][26022] Updated weights on worker 0-0, policy_version 704617 (0.00097) [2022-07-10 11:30:02,936][26022] Updated weights on worker 0-0, policy_version 704627 (0.00099) [2022-07-10 11:30:03,527][25689] Fps is (10 sec: 5393.3, 60 sec: 5558.0, 300 sec: 5583.5). Total num frames: 721542144. Throughput: 0: 5740.6. Samples: 721549218. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:30:03,527][25689] Avg episode reward: [(0, '-2.001')] [2022-07-10 11:30:04,500][26022] Updated weights on worker 0-0, policy_version 704637 (0.00089) [2022-07-10 11:30:06,555][26022] Updated weights on worker 0-0, policy_version 704647 (0.00084) [2022-07-10 11:30:08,219][26022] Updated weights on worker 0-0, policy_version 704657 (0.00385) [2022-07-10 11:30:08,605][25689] Fps is (10 sec: 5257.2, 60 sec: 5536.9, 300 sec: 5586.3). Total num frames: 721569792. Throughput: 0: 4877.0. Samples: 721565686. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:30:08,606][25689] Avg episode reward: [(0, '-2.059')] [2022-07-10 11:30:09,964][26022] Updated weights on worker 0-0, policy_version 704667 (0.00080) [2022-07-10 11:30:12,020][26022] Updated weights on worker 0-0, policy_version 704677 (0.00086) [2022-07-10 11:30:13,599][26022] Updated weights on worker 0-0, policy_version 704687 (0.00082) [2022-07-10 11:30:13,693][25689] Fps is (10 sec: 5640.7, 60 sec: 5530.0, 300 sec: 5589.1). Total num frames: 721599488. Throughput: 0: 5708.4. Samples: 721599458. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:30:13,693][25689] Avg episode reward: [(0, '-1.147')] [2022-07-10 11:30:15,595][26022] Updated weights on worker 0-0, policy_version 704697 (0.00091) [2022-07-10 11:30:17,480][26022] Updated weights on worker 0-0, policy_version 704707 (0.00090) [2022-07-10 11:30:18,743][25689] Fps is (10 sec: 5656.6, 60 sec: 5560.8, 300 sec: 5585.9). Total num frames: 721627136. Throughput: 0: 5681.4. Samples: 721633124. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:30:18,743][25689] Avg episode reward: [(0, '-0.506')] [2022-07-10 11:30:19,398][26022] Updated weights on worker 0-0, policy_version 704717 (0.00083) [2022-07-10 11:30:21,103][26022] Updated weights on worker 0-0, policy_version 704727 (0.00090) [2022-07-10 11:30:23,000][26022] Updated weights on worker 0-0, policy_version 704737 (0.00091) [2022-07-10 11:30:23,884][25689] Fps is (10 sec: 5526.4, 60 sec: 5538.0, 300 sec: 5587.2). Total num frames: 721655808. Throughput: 0: 4939.1. Samples: 721649764. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:30:23,885][25689] Avg episode reward: [(0, '-0.101')] [2022-07-10 11:30:24,755][26022] Updated weights on worker 0-0, policy_version 704747 (0.00086) [2022-07-10 11:30:26,608][26022] Updated weights on worker 0-0, policy_version 704757 (0.00088) [2022-07-10 11:30:28,006][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:30:28,017][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000704764_721678336.pth [2022-07-10 11:30:28,018][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000702799_719666176.pth [2022-07-10 11:30:28,494][26022] Updated weights on worker 0-0, policy_version 704767 (0.00085) [2022-07-10 11:30:28,925][25689] Fps is (10 sec: 5531.4, 60 sec: 5536.7, 300 sec: 5586.6). Total num frames: 721683456. Throughput: 0: 5774.2. Samples: 721683014. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:30:28,926][25689] Avg episode reward: [(0, '0.095')] [2022-07-10 11:30:30,178][26022] Updated weights on worker 0-0, policy_version 704777 (0.00084) [2022-07-10 11:30:32,327][26022] Updated weights on worker 0-0, policy_version 704787 (0.00089) [2022-07-10 11:30:33,898][26022] Updated weights on worker 0-0, policy_version 704797 (0.00086) [2022-07-10 11:30:33,956][25689] Fps is (10 sec: 5693.5, 60 sec: 5586.8, 300 sec: 5590.7). Total num frames: 721713152. Throughput: 0: 5784.9. Samples: 721716678. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:30:33,956][25689] Avg episode reward: [(0, '-0.065')] [2022-07-10 11:30:35,775][26022] Updated weights on worker 0-0, policy_version 704807 (0.00086) [2022-07-10 11:30:37,554][26022] Updated weights on worker 0-0, policy_version 704817 (0.00083) [2022-07-10 11:30:38,997][25689] Fps is (10 sec: 5693.2, 60 sec: 5566.7, 300 sec: 5580.6). Total num frames: 721740800. Throughput: 0: 4959.2. Samples: 721733570. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:30:38,998][25689] Avg episode reward: [(0, '-0.565')] [2022-07-10 11:30:39,275][26022] Updated weights on worker 0-0, policy_version 704827 (0.00087) [2022-07-10 11:30:41,333][26022] Updated weights on worker 0-0, policy_version 704837 (0.00106) [2022-07-10 11:30:43,055][26022] Updated weights on worker 0-0, policy_version 704847 (0.00083) [2022-07-10 11:30:44,111][25689] Fps is (10 sec: 5344.4, 60 sec: 5531.5, 300 sec: 5578.9). Total num frames: 721767424. Throughput: 0: 5803.9. Samples: 721767158. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:30:44,113][25689] Avg episode reward: [(0, '-0.622')] [2022-07-10 11:30:45,123][26022] Updated weights on worker 0-0, policy_version 704857 (0.00388) [2022-07-10 11:30:46,796][26022] Updated weights on worker 0-0, policy_version 704867 (0.00082) [2022-07-10 11:30:48,553][26022] Updated weights on worker 0-0, policy_version 704877 (0.00089) [2022-07-10 11:30:49,129][25689] Fps is (10 sec: 5558.9, 60 sec: 5565.4, 300 sec: 5587.1). Total num frames: 721797120. Throughput: 0: 5822.2. Samples: 721800644. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:30:49,129][25689] Avg episode reward: [(0, '-1.446')] [2022-07-10 11:30:50,573][26022] Updated weights on worker 0-0, policy_version 704887 (0.00084) [2022-07-10 11:30:52,330][26022] Updated weights on worker 0-0, policy_version 704897 (0.00089) [2022-07-10 11:30:54,042][26022] Updated weights on worker 0-0, policy_version 704907 (0.00094) [2022-07-10 11:30:54,143][25689] Fps is (10 sec: 5716.3, 60 sec: 5552.1, 300 sec: 5580.5). Total num frames: 721824768. Throughput: 0: 4993.2. Samples: 721817474. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:30:54,145][25689] Avg episode reward: [(0, '-0.631')] [2022-07-10 11:30:55,891][26022] Updated weights on worker 0-0, policy_version 704917 (0.00095) [2022-07-10 11:30:57,909][26022] Updated weights on worker 0-0, policy_version 704927 (0.00090) [2022-07-10 11:30:59,219][25689] Fps is (10 sec: 5581.4, 60 sec: 5546.3, 300 sec: 5578.1). Total num frames: 721853440. Throughput: 0: 5805.6. Samples: 721850972. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:30:59,221][25689] Avg episode reward: [(0, '-1.610')] [2022-07-10 11:30:59,576][26022] Updated weights on worker 0-0, policy_version 704937 (0.00091) [2022-07-10 11:31:01,550][26022] Updated weights on worker 0-0, policy_version 704947 (0.00098) [2022-07-10 11:31:03,718][26022] Updated weights on worker 0-0, policy_version 704957 (0.01236) [2022-07-10 11:31:04,266][25689] Fps is (10 sec: 5158.9, 60 sec: 5512.9, 300 sec: 5573.9). Total num frames: 721876992. Throughput: 0: 5710.8. Samples: 721882258. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:31:04,266][25689] Avg episode reward: [(0, '-1.929')] [2022-07-10 11:31:05,551][26022] Updated weights on worker 0-0, policy_version 704967 (0.00085) [2022-07-10 11:31:07,484][26022] Updated weights on worker 0-0, policy_version 704977 (0.00088) [2022-07-10 11:31:09,306][25689] Fps is (10 sec: 5177.8, 60 sec: 5533.3, 300 sec: 5570.0). Total num frames: 721905664. Throughput: 0: 5711.6. Samples: 721915886. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:31:09,306][25689] Avg episode reward: [(0, '-2.047')] [2022-07-10 11:31:09,330][26022] Updated weights on worker 0-0, policy_version 704987 (0.00093) [2022-07-10 11:31:11,119][26022] Updated weights on worker 0-0, policy_version 704997 (0.00083) [2022-07-10 11:31:13,014][26022] Updated weights on worker 0-0, policy_version 705007 (0.00109) [2022-07-10 11:31:14,343][25689] Fps is (10 sec: 5791.8, 60 sec: 5537.8, 300 sec: 5580.0). Total num frames: 721935360. Throughput: 0: 5708.9. Samples: 721932798. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:31:14,344][25689] Avg episode reward: [(0, '-1.763')] [2022-07-10 11:31:14,694][26022] Updated weights on worker 0-0, policy_version 705017 (0.00091) [2022-07-10 11:31:16,521][26022] Updated weights on worker 0-0, policy_version 705027 (0.00543) [2022-07-10 11:31:18,120][26022] Updated weights on worker 0-0, policy_version 705037 (0.00087) [2022-07-10 11:31:19,438][25689] Fps is (10 sec: 5760.3, 60 sec: 5550.6, 300 sec: 5582.6). Total num frames: 721964032. Throughput: 0: 5717.9. Samples: 721966582. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 11:31:19,439][25689] Avg episode reward: [(0, '-2.714')] [2022-07-10 11:31:20,367][26022] Updated weights on worker 0-0, policy_version 705047 (0.00090) [2022-07-10 11:31:21,750][26022] Updated weights on worker 0-0, policy_version 705057 (0.00099) [2022-07-10 11:31:23,947][26022] Updated weights on worker 0-0, policy_version 705067 (0.00086) [2022-07-10 11:31:24,511][25689] Fps is (10 sec: 5640.0, 60 sec: 5556.9, 300 sec: 5581.6). Total num frames: 721992704. Throughput: 0: 5808.3. Samples: 721999848. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:31:24,511][25689] Avg episode reward: [(0, '-3.163')] [2022-07-10 11:31:25,941][26022] Updated weights on worker 0-0, policy_version 705077 (0.00089) [2022-07-10 11:31:27,427][26022] Updated weights on worker 0-0, policy_version 705087 (0.00086) [2022-07-10 11:31:29,526][25689] Fps is (10 sec: 5380.0, 60 sec: 5525.4, 300 sec: 5571.2). Total num frames: 722018304. Throughput: 0: 4981.8. Samples: 722016620. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:31:29,527][25689] Avg episode reward: [(0, '-3.652')] [2022-07-10 11:31:29,547][26022] Updated weights on worker 0-0, policy_version 705097 (0.00084) [2022-07-10 11:31:31,057][26022] Updated weights on worker 0-0, policy_version 705107 (0.00086) [2022-07-10 11:31:33,115][26022] Updated weights on worker 0-0, policy_version 705117 (0.00096) [2022-07-10 11:31:34,540][25689] Fps is (10 sec: 5513.4, 60 sec: 5527.0, 300 sec: 5578.2). Total num frames: 722048000. Throughput: 0: 5805.2. Samples: 722050044. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:31:34,541][25689] Avg episode reward: [(0, '-3.370')] [2022-07-10 11:31:35,122][26022] Updated weights on worker 0-0, policy_version 705127 (0.00080) [2022-07-10 11:31:36,616][26022] Updated weights on worker 0-0, policy_version 705137 (0.00095) [2022-07-10 11:31:38,662][26022] Updated weights on worker 0-0, policy_version 705147 (0.00086) [2022-07-10 11:31:39,593][25689] Fps is (10 sec: 5797.8, 60 sec: 5542.8, 300 sec: 5576.2). Total num frames: 722076672. Throughput: 0: 5798.2. Samples: 722083446. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:31:39,594][25689] Avg episode reward: [(0, '-3.772')] [2022-07-10 11:31:40,460][26022] Updated weights on worker 0-0, policy_version 705157 (0.00088) [2022-07-10 11:31:42,276][26022] Updated weights on worker 0-0, policy_version 705167 (0.00086) [2022-07-10 11:31:44,254][26022] Updated weights on worker 0-0, policy_version 705177 (0.00088) [2022-07-10 11:31:44,704][25689] Fps is (10 sec: 5440.3, 60 sec: 5543.1, 300 sec: 5570.7). Total num frames: 722103296. Throughput: 0: 4968.2. Samples: 722100174. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:31:44,706][25689] Avg episode reward: [(0, '-2.964')] [2022-07-10 11:31:45,925][26022] Updated weights on worker 0-0, policy_version 705187 (0.00084) [2022-07-10 11:31:47,873][26022] Updated weights on worker 0-0, policy_version 705197 (0.00087) [2022-07-10 11:31:49,553][26022] Updated weights on worker 0-0, policy_version 705207 (0.00086) [2022-07-10 11:31:49,719][25689] Fps is (10 sec: 5461.2, 60 sec: 5526.4, 300 sec: 5567.6). Total num frames: 722131968. Throughput: 0: 5793.1. Samples: 722133598. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:31:49,723][25689] Avg episode reward: [(0, '-2.835')] [2022-07-10 11:31:51,332][26022] Updated weights on worker 0-0, policy_version 705217 (0.00898) [2022-07-10 11:31:53,400][26022] Updated weights on worker 0-0, policy_version 705227 (0.00086) [2022-07-10 11:31:54,743][25689] Fps is (10 sec: 5610.3, 60 sec: 5525.5, 300 sec: 5564.2). Total num frames: 722159616. Throughput: 0: 5795.5. Samples: 722167130. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:31:54,745][25689] Avg episode reward: [(0, '-1.681')] [2022-07-10 11:31:55,128][26022] Updated weights on worker 0-0, policy_version 705237 (0.00085) [2022-07-10 11:31:57,132][26022] Updated weights on worker 0-0, policy_version 705247 (0.00091) [2022-07-10 11:31:58,834][26022] Updated weights on worker 0-0, policy_version 705257 (0.00082) [2022-07-10 11:31:59,824][25689] Fps is (10 sec: 5573.5, 60 sec: 5525.1, 300 sec: 5568.5). Total num frames: 722188288. Throughput: 0: 4963.3. Samples: 722183856. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:31:59,826][25689] Avg episode reward: [(0, '-1.164')] [2022-07-10 11:32:00,656][26022] Updated weights on worker 0-0, policy_version 705267 (0.00085) [2022-07-10 11:32:02,827][26022] Updated weights on worker 0-0, policy_version 705277 (0.00089) [2022-07-10 11:32:04,818][26022] Updated weights on worker 0-0, policy_version 705287 (0.00091) [2022-07-10 11:32:04,924][25689] Fps is (10 sec: 5431.3, 60 sec: 5570.9, 300 sec: 5563.7). Total num frames: 722214912. Throughput: 0: 5699.6. Samples: 722215418. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:32:04,925][25689] Avg episode reward: [(0, '-1.311')] [2022-07-10 11:32:06,548][26022] Updated weights on worker 0-0, policy_version 705297 (0.00098) [2022-07-10 11:32:08,530][26022] Updated weights on worker 0-0, policy_version 705307 (0.00080) [2022-07-10 11:32:09,961][25689] Fps is (10 sec: 5353.6, 60 sec: 5554.2, 300 sec: 5559.9). Total num frames: 722242560. Throughput: 0: 5700.0. Samples: 722248980. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:32:09,962][25689] Avg episode reward: [(0, '-1.010')] [2022-07-10 11:32:10,241][26022] Updated weights on worker 0-0, policy_version 705317 (0.00087) [2022-07-10 11:32:12,088][26022] Updated weights on worker 0-0, policy_version 705327 (0.00087) [2022-07-10 11:32:13,941][26022] Updated weights on worker 0-0, policy_version 705337 (0.00092) [2022-07-10 11:32:15,033][25689] Fps is (10 sec: 5469.9, 60 sec: 5517.4, 300 sec: 5558.9). Total num frames: 722270208. Throughput: 0: 4854.7. Samples: 722265632. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:32:15,034][25689] Avg episode reward: [(0, '-1.224')] [2022-07-10 11:32:15,743][26022] Updated weights on worker 0-0, policy_version 705347 (0.00086) [2022-07-10 11:32:17,671][26022] Updated weights on worker 0-0, policy_version 705357 (0.00094) [2022-07-10 11:32:19,347][26022] Updated weights on worker 0-0, policy_version 705367 (0.00086) [2022-07-10 11:32:20,060][25689] Fps is (10 sec: 5779.9, 60 sec: 5557.4, 300 sec: 5567.4). Total num frames: 722300928. Throughput: 0: 5699.7. Samples: 722299194. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:32:20,060][25689] Avg episode reward: [(0, '-1.795')] [2022-07-10 11:32:21,494][26022] Updated weights on worker 0-0, policy_version 705377 (0.00084) [2022-07-10 11:32:22,988][26022] Updated weights on worker 0-0, policy_version 705387 (0.00057) [2022-07-10 11:32:25,052][26022] Updated weights on worker 0-0, policy_version 705397 (0.00087) [2022-07-10 11:32:25,129][25689] Fps is (10 sec: 5679.8, 60 sec: 5523.8, 300 sec: 5559.6). Total num frames: 722327552. Throughput: 0: 5810.1. Samples: 722332812. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:32:25,130][25689] Avg episode reward: [(0, '-1.685')] [2022-07-10 11:32:26,611][26022] Updated weights on worker 0-0, policy_version 705407 (0.00085) [2022-07-10 11:32:28,144][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:32:28,158][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000705415_722344960.pth [2022-07-10 11:32:28,159][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000703453_720335872.pth [2022-07-10 11:32:28,563][26022] Updated weights on worker 0-0, policy_version 705417 (0.00089) [2022-07-10 11:32:30,131][25689] Fps is (10 sec: 5389.0, 60 sec: 5558.9, 300 sec: 5560.1). Total num frames: 722355200. Throughput: 0: 4991.7. Samples: 722349660. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:32:30,131][25689] Avg episode reward: [(0, '-2.268')] [2022-07-10 11:32:30,505][26022] Updated weights on worker 0-0, policy_version 705427 (0.00083) [2022-07-10 11:32:32,156][26022] Updated weights on worker 0-0, policy_version 705437 (0.00089) [2022-07-10 11:32:33,908][26022] Updated weights on worker 0-0, policy_version 705447 (0.00083) [2022-07-10 11:32:35,140][25689] Fps is (10 sec: 5626.0, 60 sec: 5542.5, 300 sec: 5560.1). Total num frames: 722383872. Throughput: 0: 5861.0. Samples: 722383478. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:32:35,140][25689] Avg episode reward: [(0, '-3.323')] [2022-07-10 11:32:36,005][26022] Updated weights on worker 0-0, policy_version 705457 (0.00091) [2022-07-10 11:32:37,623][26022] Updated weights on worker 0-0, policy_version 705467 (0.00060) [2022-07-10 11:32:39,668][26022] Updated weights on worker 0-0, policy_version 705477 (0.00088) [2022-07-10 11:32:40,171][25689] Fps is (10 sec: 5609.5, 60 sec: 5527.6, 300 sec: 5558.4). Total num frames: 722411520. Throughput: 0: 5846.7. Samples: 722416778. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:32:40,171][25689] Avg episode reward: [(0, '-2.385')] [2022-07-10 11:32:41,409][26022] Updated weights on worker 0-0, policy_version 705487 (0.00090) [2022-07-10 11:32:43,150][26022] Updated weights on worker 0-0, policy_version 705497 (0.00104) [2022-07-10 11:32:45,299][25689] Fps is (10 sec: 5342.3, 60 sec: 5526.0, 300 sec: 5556.6). Total num frames: 722438144. Throughput: 0: 4972.2. Samples: 722433098. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:32:45,299][25689] Avg episode reward: [(0, '-2.936')] [2022-07-10 11:32:45,447][26022] Updated weights on worker 0-0, policy_version 705507 (0.00086) [2022-07-10 11:32:46,746][26022] Updated weights on worker 0-0, policy_version 705517 (0.00093) [2022-07-10 11:32:48,991][26022] Updated weights on worker 0-0, policy_version 705527 (0.00094) [2022-07-10 11:32:50,333][25689] Fps is (10 sec: 5542.1, 60 sec: 5541.2, 300 sec: 5549.2). Total num frames: 722467840. Throughput: 0: 5774.7. Samples: 722466322. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:32:50,333][25689] Avg episode reward: [(0, '-2.578')] [2022-07-10 11:32:50,570][26022] Updated weights on worker 0-0, policy_version 705537 (0.00083) [2022-07-10 11:32:52,475][26022] Updated weights on worker 0-0, policy_version 705547 (0.00085) [2022-07-10 11:32:54,559][26022] Updated weights on worker 0-0, policy_version 705557 (0.00098) [2022-07-10 11:32:55,345][25689] Fps is (10 sec: 5606.0, 60 sec: 5525.3, 300 sec: 5552.4). Total num frames: 722494464. Throughput: 0: 5743.6. Samples: 722499530. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:32:55,346][25689] Avg episode reward: [(0, '-2.700')] [2022-07-10 11:32:56,135][26022] Updated weights on worker 0-0, policy_version 705567 (0.00087) [2022-07-10 11:32:58,135][26022] Updated weights on worker 0-0, policy_version 705577 (0.00092) [2022-07-10 11:32:59,833][26022] Updated weights on worker 0-0, policy_version 705587 (0.00087) [2022-07-10 11:33:00,373][25689] Fps is (10 sec: 5405.2, 60 sec: 5513.2, 300 sec: 5553.2). Total num frames: 722522112. Throughput: 0: 4928.7. Samples: 722516352. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:33:00,374][25689] Avg episode reward: [(0, '-0.951')] [2022-07-10 11:33:02,255][26022] Updated weights on worker 0-0, policy_version 705597 (0.00083) [2022-07-10 11:33:04,052][26022] Updated weights on worker 0-0, policy_version 705607 (0.00089) [2022-07-10 11:33:05,420][25689] Fps is (10 sec: 5386.7, 60 sec: 5518.1, 300 sec: 5549.1). Total num frames: 722548736. Throughput: 0: 5690.7. Samples: 722547604. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:33:05,421][25689] Avg episode reward: [(0, '-1.117')] [2022-07-10 11:33:06,118][26022] Updated weights on worker 0-0, policy_version 705617 (0.00082) [2022-07-10 11:33:07,562][26022] Updated weights on worker 0-0, policy_version 705627 (0.00088) [2022-07-10 11:33:09,807][26022] Updated weights on worker 0-0, policy_version 705637 (0.00089) [2022-07-10 11:33:10,424][25689] Fps is (10 sec: 5501.9, 60 sec: 5538.1, 300 sec: 5545.9). Total num frames: 722577408. Throughput: 0: 5698.1. Samples: 722580802. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:33:10,425][25689] Avg episode reward: [(0, '-1.204')] [2022-07-10 11:33:11,187][26022] Updated weights on worker 0-0, policy_version 705647 (0.01094) [2022-07-10 11:33:13,226][26022] Updated weights on worker 0-0, policy_version 705657 (0.00085) [2022-07-10 11:33:15,058][26022] Updated weights on worker 0-0, policy_version 705667 (0.00095) [2022-07-10 11:33:15,433][25689] Fps is (10 sec: 5420.3, 60 sec: 5509.9, 300 sec: 5542.6). Total num frames: 722603008. Throughput: 0: 4886.0. Samples: 722597680. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:33:15,435][25689] Avg episode reward: [(0, '-0.391')] [2022-07-10 11:33:16,924][26022] Updated weights on worker 0-0, policy_version 705677 (0.00081) [2022-07-10 11:33:18,849][26022] Updated weights on worker 0-0, policy_version 705687 (0.00087) [2022-07-10 11:33:20,451][25689] Fps is (10 sec: 5514.8, 60 sec: 5493.8, 300 sec: 5540.2). Total num frames: 722632704. Throughput: 0: 5716.3. Samples: 722631120. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:33:20,451][25689] Avg episode reward: [(0, '-1.787')] [2022-07-10 11:33:20,596][26022] Updated weights on worker 0-0, policy_version 705697 (0.00099) [2022-07-10 11:33:22,427][26022] Updated weights on worker 0-0, policy_version 705707 (0.00394) [2022-07-10 11:33:24,528][26022] Updated weights on worker 0-0, policy_version 705717 (0.00090) [2022-07-10 11:33:25,553][25689] Fps is (10 sec: 5666.4, 60 sec: 5507.8, 300 sec: 5539.6). Total num frames: 722660352. Throughput: 0: 5785.0. Samples: 722664072. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:33:25,554][25689] Avg episode reward: [(0, '-2.256')] [2022-07-10 11:33:26,262][26022] Updated weights on worker 0-0, policy_version 705727 (0.00094) [2022-07-10 11:33:28,196][26022] Updated weights on worker 0-0, policy_version 705737 (0.00088) [2022-07-10 11:33:29,879][26022] Updated weights on worker 0-0, policy_version 705747 (0.00084) [2022-07-10 11:33:30,559][25689] Fps is (10 sec: 5369.2, 60 sec: 5490.4, 300 sec: 5537.6). Total num frames: 722686976. Throughput: 0: 5800.1. Samples: 722697586. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:33:30,559][25689] Avg episode reward: [(0, '-2.220')] [2022-07-10 11:33:31,636][26022] Updated weights on worker 0-0, policy_version 705757 (0.00091) [2022-07-10 11:33:33,609][26022] Updated weights on worker 0-0, policy_version 705767 (0.00088) [2022-07-10 11:33:35,319][26022] Updated weights on worker 0-0, policy_version 705777 (0.00093) [2022-07-10 11:33:35,584][25689] Fps is (10 sec: 5614.4, 60 sec: 5505.9, 300 sec: 5541.1). Total num frames: 722716672. Throughput: 0: 5790.5. Samples: 722714366. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:33:35,585][25689] Avg episode reward: [(0, '-1.990')] [2022-07-10 11:33:37,194][26022] Updated weights on worker 0-0, policy_version 705787 (0.00052) [2022-07-10 11:33:38,927][26022] Updated weights on worker 0-0, policy_version 705797 (0.00094) [2022-07-10 11:33:40,608][25689] Fps is (10 sec: 5706.1, 60 sec: 5506.5, 300 sec: 5539.0). Total num frames: 722744320. Throughput: 0: 5799.6. Samples: 722748026. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:33:40,609][25689] Avg episode reward: [(0, '-2.466')] [2022-07-10 11:33:40,824][26022] Updated weights on worker 0-0, policy_version 705807 (0.00092) [2022-07-10 11:33:42,702][26022] Updated weights on worker 0-0, policy_version 705817 (0.00094) [2022-07-10 11:33:44,600][26022] Updated weights on worker 0-0, policy_version 705827 (0.00091) [2022-07-10 11:33:45,661][25689] Fps is (10 sec: 5385.9, 60 sec: 5513.4, 300 sec: 5531.4). Total num frames: 722770944. Throughput: 0: 5831.2. Samples: 722781326. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:33:45,662][25689] Avg episode reward: [(0, '-2.604')] [2022-07-10 11:33:46,454][26022] Updated weights on worker 0-0, policy_version 705837 (0.00088) [2022-07-10 11:33:48,352][26022] Updated weights on worker 0-0, policy_version 705847 (0.00092) [2022-07-10 11:33:50,063][26022] Updated weights on worker 0-0, policy_version 705857 (0.00094) [2022-07-10 11:33:50,665][25689] Fps is (10 sec: 5600.0, 60 sec: 5516.1, 300 sec: 5534.9). Total num frames: 722800640. Throughput: 0: 4986.0. Samples: 722797838. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:33:50,666][25689] Avg episode reward: [(0, '-2.464')] [2022-07-10 11:33:51,910][26022] Updated weights on worker 0-0, policy_version 705867 (0.00090) [2022-07-10 11:33:53,971][26022] Updated weights on worker 0-0, policy_version 705877 (0.00084) [2022-07-10 11:33:55,680][25689] Fps is (10 sec: 5621.2, 60 sec: 5515.8, 300 sec: 5527.8). Total num frames: 722827264. Throughput: 0: 5796.0. Samples: 722830842. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:33:55,681][25689] Avg episode reward: [(0, '-1.924')] [2022-07-10 11:33:55,735][26022] Updated weights on worker 0-0, policy_version 705887 (0.00078) [2022-07-10 11:33:57,560][26022] Updated weights on worker 0-0, policy_version 705897 (0.00088) [2022-07-10 11:33:59,371][26022] Updated weights on worker 0-0, policy_version 705907 (0.00090) [2022-07-10 11:34:00,691][25689] Fps is (10 sec: 5311.3, 60 sec: 5500.5, 300 sec: 5536.4). Total num frames: 722853888. Throughput: 0: 5794.9. Samples: 722864402. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:34:00,691][25689] Avg episode reward: [(0, '-2.037')] [2022-07-10 11:34:01,256][26022] Updated weights on worker 0-0, policy_version 705917 (0.00085) [2022-07-10 11:34:03,459][26022] Updated weights on worker 0-0, policy_version 705927 (0.00092) [2022-07-10 11:34:05,338][26022] Updated weights on worker 0-0, policy_version 705937 (0.00073) [2022-07-10 11:34:05,781][25689] Fps is (10 sec: 5474.5, 60 sec: 5530.5, 300 sec: 5535.4). Total num frames: 722882560. Throughput: 0: 4861.1. Samples: 722879130. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:34:05,782][25689] Avg episode reward: [(0, '-2.836')] [2022-07-10 11:34:07,287][26022] Updated weights on worker 0-0, policy_version 705947 (0.00086) [2022-07-10 11:34:08,883][26022] Updated weights on worker 0-0, policy_version 705957 (0.00095) [2022-07-10 11:34:10,776][26022] Updated weights on worker 0-0, policy_version 705967 (0.00081) [2022-07-10 11:34:10,793][25689] Fps is (10 sec: 5575.0, 60 sec: 5512.7, 300 sec: 5528.5). Total num frames: 722910208. Throughput: 0: 5714.0. Samples: 722912848. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:34:10,794][25689] Avg episode reward: [(0, '-2.019')] [2022-07-10 11:34:12,667][26022] Updated weights on worker 0-0, policy_version 705977 (0.00094) [2022-07-10 11:34:14,551][26022] Updated weights on worker 0-0, policy_version 705987 (0.00087) [2022-07-10 11:34:15,832][25689] Fps is (10 sec: 5501.3, 60 sec: 5543.9, 300 sec: 5535.0). Total num frames: 722937856. Throughput: 0: 5735.9. Samples: 722946432. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:34:15,833][25689] Avg episode reward: [(0, '-1.722')] [2022-07-10 11:34:16,175][26022] Updated weights on worker 0-0, policy_version 705997 (0.00084) [2022-07-10 11:34:18,226][26022] Updated weights on worker 0-0, policy_version 706007 (0.00085) [2022-07-10 11:34:19,862][26022] Updated weights on worker 0-0, policy_version 706017 (0.00087) [2022-07-10 11:34:20,837][25689] Fps is (10 sec: 5505.2, 60 sec: 5511.1, 300 sec: 5529.4). Total num frames: 722965504. Throughput: 0: 4918.5. Samples: 722963492. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:34:20,838][25689] Avg episode reward: [(0, '-2.480')] [2022-07-10 11:34:21,713][26022] Updated weights on worker 0-0, policy_version 706027 (0.00081) [2022-07-10 11:34:23,549][26022] Updated weights on worker 0-0, policy_version 706037 (0.00083) [2022-07-10 11:34:25,564][26022] Updated weights on worker 0-0, policy_version 706047 (0.00093) [2022-07-10 11:34:25,893][25689] Fps is (10 sec: 5699.8, 60 sec: 5549.3, 300 sec: 5535.8). Total num frames: 722995200. Throughput: 0: 5868.5. Samples: 722997158. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:34:25,894][25689] Avg episode reward: [(0, '-2.577')] [2022-07-10 11:34:27,463][26022] Updated weights on worker 0-0, policy_version 706057 (0.00085) [2022-07-10 11:34:28,367][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:34:28,378][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000706063_723008512.pth [2022-07-10 11:34:28,379][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000704113_721011712.pth [2022-07-10 11:34:29,219][26022] Updated weights on worker 0-0, policy_version 706067 (0.00081) [2022-07-10 11:34:30,911][25689] Fps is (10 sec: 5591.0, 60 sec: 5548.3, 300 sec: 5535.9). Total num frames: 723021824. Throughput: 0: 5831.9. Samples: 723030172. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:34:30,911][25689] Avg episode reward: [(0, '-2.462')] [2022-07-10 11:34:31,054][26022] Updated weights on worker 0-0, policy_version 706077 (0.00093) [2022-07-10 11:34:32,824][26022] Updated weights on worker 0-0, policy_version 706087 (0.00089) [2022-07-10 11:34:34,782][26022] Updated weights on worker 0-0, policy_version 706097 (0.00094) [2022-07-10 11:34:36,011][25689] Fps is (10 sec: 5465.3, 60 sec: 5524.5, 300 sec: 5534.2). Total num frames: 723050496. Throughput: 0: 4966.2. Samples: 723046642. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:34:36,011][25689] Avg episode reward: [(0, '-1.648')] [2022-07-10 11:34:36,454][26022] Updated weights on worker 0-0, policy_version 706107 (0.00087) [2022-07-10 11:34:38,483][26022] Updated weights on worker 0-0, policy_version 706117 (0.00085) [2022-07-10 11:34:40,236][26022] Updated weights on worker 0-0, policy_version 706127 (0.00083) [2022-07-10 11:34:41,036][25689] Fps is (10 sec: 5562.4, 60 sec: 5524.4, 300 sec: 5532.1). Total num frames: 723078144. Throughput: 0: 5777.5. Samples: 723080188. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 11:34:41,038][25689] Avg episode reward: [(0, '-2.457')] [2022-07-10 11:34:42,227][26022] Updated weights on worker 0-0, policy_version 706137 (0.00092) [2022-07-10 11:34:43,927][26022] Updated weights on worker 0-0, policy_version 706147 (0.00093) [2022-07-10 11:34:45,740][26022] Updated weights on worker 0-0, policy_version 706157 (0.00090) [2022-07-10 11:34:46,175][25689] Fps is (10 sec: 5540.9, 60 sec: 5550.3, 300 sec: 5533.3). Total num frames: 723106816. Throughput: 0: 5742.1. Samples: 723113620. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:34:46,176][25689] Avg episode reward: [(0, '-2.822')] [2022-07-10 11:34:47,622][26022] Updated weights on worker 0-0, policy_version 706167 (0.00091) [2022-07-10 11:34:49,428][26022] Updated weights on worker 0-0, policy_version 706177 (0.00095) [2022-07-10 11:34:51,201][25689] Fps is (10 sec: 5540.6, 60 sec: 5514.5, 300 sec: 5530.4). Total num frames: 723134464. Throughput: 0: 4946.7. Samples: 723130542. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:34:51,201][25689] Avg episode reward: [(0, '-1.873')] [2022-07-10 11:34:51,302][26022] Updated weights on worker 0-0, policy_version 706187 (0.00094) [2022-07-10 11:34:53,027][26022] Updated weights on worker 0-0, policy_version 706197 (0.00098) [2022-07-10 11:34:54,906][26022] Updated weights on worker 0-0, policy_version 706207 (0.00086) [2022-07-10 11:34:56,227][25689] Fps is (10 sec: 5501.1, 60 sec: 5530.4, 300 sec: 5526.7). Total num frames: 723162112. Throughput: 0: 5803.5. Samples: 723163966. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:34:56,228][25689] Avg episode reward: [(0, '-2.224')] [2022-07-10 11:34:56,954][26022] Updated weights on worker 0-0, policy_version 706217 (0.00092) [2022-07-10 11:34:58,631][26022] Updated weights on worker 0-0, policy_version 706227 (0.00087) [2022-07-10 11:35:00,566][26022] Updated weights on worker 0-0, policy_version 706237 (0.00087) [2022-07-10 11:35:01,250][25689] Fps is (10 sec: 5706.3, 60 sec: 5580.0, 300 sec: 5540.9). Total num frames: 723191808. Throughput: 0: 5782.9. Samples: 723197084. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:35:01,251][25689] Avg episode reward: [(0, '-2.080')] [2022-07-10 11:35:02,642][26022] Updated weights on worker 0-0, policy_version 706247 (0.00087) [2022-07-10 11:35:04,731][26022] Updated weights on worker 0-0, policy_version 706257 (0.00085) [2022-07-10 11:35:06,351][25689] Fps is (10 sec: 5360.8, 60 sec: 5511.4, 300 sec: 5530.2). Total num frames: 723216384. Throughput: 0: 4853.1. Samples: 723211534. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:35:06,352][25689] Avg episode reward: [(0, '-2.084')] [2022-07-10 11:35:06,494][26022] Updated weights on worker 0-0, policy_version 706267 (0.00093) [2022-07-10 11:35:08,347][26022] Updated weights on worker 0-0, policy_version 706277 (0.00091) [2022-07-10 11:35:10,118][26022] Updated weights on worker 0-0, policy_version 706287 (0.00085) [2022-07-10 11:35:11,410][25689] Fps is (10 sec: 5140.4, 60 sec: 5507.1, 300 sec: 5523.9). Total num frames: 723244032. Throughput: 0: 5642.9. Samples: 723244578. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:35:11,410][25689] Avg episode reward: [(0, '-2.469')] [2022-07-10 11:35:12,049][26022] Updated weights on worker 0-0, policy_version 706297 (0.00084) [2022-07-10 11:35:13,771][26022] Updated weights on worker 0-0, policy_version 706307 (0.00088) [2022-07-10 11:35:15,785][26022] Updated weights on worker 0-0, policy_version 706317 (0.00103) [2022-07-10 11:35:16,414][25689] Fps is (10 sec: 5596.9, 60 sec: 5527.3, 300 sec: 5528.2). Total num frames: 723272704. Throughput: 0: 5661.8. Samples: 723278258. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:35:16,414][25689] Avg episode reward: [(0, '-1.092')] [2022-07-10 11:35:17,410][26022] Updated weights on worker 0-0, policy_version 706327 (0.00083) [2022-07-10 11:35:19,511][26022] Updated weights on worker 0-0, policy_version 706337 (0.00088) [2022-07-10 11:35:20,973][26022] Updated weights on worker 0-0, policy_version 706347 (0.00089) [2022-07-10 11:35:21,425][25689] Fps is (10 sec: 5623.2, 60 sec: 5526.6, 300 sec: 5527.1). Total num frames: 723300352. Throughput: 0: 4857.7. Samples: 723295090. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:35:21,426][25689] Avg episode reward: [(0, '-1.105')] [2022-07-10 11:35:23,261][26022] Updated weights on worker 0-0, policy_version 706357 (0.00085) [2022-07-10 11:35:24,588][26022] Updated weights on worker 0-0, policy_version 706367 (0.00110) [2022-07-10 11:35:26,555][25689] Fps is (10 sec: 5351.6, 60 sec: 5469.3, 300 sec: 5522.0). Total num frames: 723326976. Throughput: 0: 5776.2. Samples: 723328236. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:35:26,555][25689] Avg episode reward: [(0, '-1.647')] [2022-07-10 11:35:26,908][26022] Updated weights on worker 0-0, policy_version 706377 (0.00088) [2022-07-10 11:35:28,594][26022] Updated weights on worker 0-0, policy_version 706387 (0.00087) [2022-07-10 11:35:30,297][26022] Updated weights on worker 0-0, policy_version 706397 (0.00091) [2022-07-10 11:35:31,572][25689] Fps is (10 sec: 5651.7, 60 sec: 5536.9, 300 sec: 5525.7). Total num frames: 723357696. Throughput: 0: 5805.4. Samples: 723361626. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:35:31,572][25689] Avg episode reward: [(0, '-1.548')] [2022-07-10 11:35:32,455][26022] Updated weights on worker 0-0, policy_version 706407 (0.00089) [2022-07-10 11:35:34,129][26022] Updated weights on worker 0-0, policy_version 706417 (0.00096) [2022-07-10 11:35:35,890][26022] Updated weights on worker 0-0, policy_version 706427 (0.00093) [2022-07-10 11:35:36,589][25689] Fps is (10 sec: 5817.1, 60 sec: 5527.6, 300 sec: 5526.2). Total num frames: 723385344. Throughput: 0: 4965.2. Samples: 723378432. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:35:36,591][25689] Avg episode reward: [(0, '-1.373')] [2022-07-10 11:35:37,876][26022] Updated weights on worker 0-0, policy_version 706437 (0.00107) [2022-07-10 11:35:39,551][26022] Updated weights on worker 0-0, policy_version 706447 (0.00096) [2022-07-10 11:35:41,610][25689] Fps is (10 sec: 5304.8, 60 sec: 5494.1, 300 sec: 5524.5). Total num frames: 723410944. Throughput: 0: 5779.9. Samples: 723411752. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:35:41,612][25689] Avg episode reward: [(0, '-0.825')] [2022-07-10 11:35:41,635][26022] Updated weights on worker 0-0, policy_version 706457 (0.00080) [2022-07-10 11:35:43,335][26022] Updated weights on worker 0-0, policy_version 706467 (0.00103) [2022-07-10 11:35:45,295][26022] Updated weights on worker 0-0, policy_version 706477 (0.00094) [2022-07-10 11:35:46,664][25689] Fps is (10 sec: 5386.5, 60 sec: 5501.8, 300 sec: 5520.3). Total num frames: 723439616. Throughput: 0: 5801.7. Samples: 723444906. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:35:46,665][25689] Avg episode reward: [(0, '-0.967')] [2022-07-10 11:35:47,037][26022] Updated weights on worker 0-0, policy_version 706487 (0.00089) [2022-07-10 11:35:48,888][26022] Updated weights on worker 0-0, policy_version 706497 (0.00089) [2022-07-10 11:35:50,570][26022] Updated weights on worker 0-0, policy_version 706507 (0.00089) [2022-07-10 11:35:51,691][25689] Fps is (10 sec: 5688.1, 60 sec: 5518.7, 300 sec: 5523.5). Total num frames: 723468288. Throughput: 0: 4974.5. Samples: 723461708. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:35:51,691][25689] Avg episode reward: [(0, '-0.843')] [2022-07-10 11:35:52,634][26022] Updated weights on worker 0-0, policy_version 706517 (0.00099) [2022-07-10 11:35:54,292][26022] Updated weights on worker 0-0, policy_version 706527 (0.00614) [2022-07-10 11:35:56,175][26022] Updated weights on worker 0-0, policy_version 706537 (0.00083) [2022-07-10 11:35:56,696][25689] Fps is (10 sec: 5614.3, 60 sec: 5520.6, 300 sec: 5521.4). Total num frames: 723495936. Throughput: 0: 5799.7. Samples: 723495048. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:35:56,696][25689] Avg episode reward: [(0, '-0.200')] [2022-07-10 11:35:58,224][26022] Updated weights on worker 0-0, policy_version 706547 (0.00083) [2022-07-10 11:35:59,896][26022] Updated weights on worker 0-0, policy_version 706557 (0.00083) [2022-07-10 11:36:01,686][26022] Updated weights on worker 0-0, policy_version 706567 (0.00085) [2022-07-10 11:36:01,789][25689] Fps is (10 sec: 5577.4, 60 sec: 5497.3, 300 sec: 5537.8). Total num frames: 723524608. Throughput: 0: 5796.2. Samples: 723528714. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:36:01,789][25689] Avg episode reward: [(0, '-0.110')] [2022-07-10 11:36:03,939][26022] Updated weights on worker 0-0, policy_version 706577 (0.00092) [2022-07-10 11:36:05,498][26022] Updated weights on worker 0-0, policy_version 706587 (0.00091) [2022-07-10 11:36:06,916][25689] Fps is (10 sec: 5310.4, 60 sec: 5511.9, 300 sec: 5525.8). Total num frames: 723550208. Throughput: 0: 5698.4. Samples: 723560306. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:36:06,927][25689] Avg episode reward: [(0, '-0.617')] [2022-07-10 11:36:07,704][26022] Updated weights on worker 0-0, policy_version 706597 (0.00088) [2022-07-10 11:36:09,286][26022] Updated weights on worker 0-0, policy_version 706607 (0.00086) [2022-07-10 11:36:11,283][26022] Updated weights on worker 0-0, policy_version 706617 (0.00078) [2022-07-10 11:36:11,931][25689] Fps is (10 sec: 5553.2, 60 sec: 5566.6, 300 sec: 5529.7). Total num frames: 723580928. Throughput: 0: 5700.5. Samples: 723577084. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:36:11,931][25689] Avg episode reward: [(0, '-0.679')] [2022-07-10 11:36:13,191][26022] Updated weights on worker 0-0, policy_version 706627 (0.00085) [2022-07-10 11:36:14,812][26022] Updated weights on worker 0-0, policy_version 706637 (0.00087) [2022-07-10 11:36:16,727][26022] Updated weights on worker 0-0, policy_version 706647 (0.00084) [2022-07-10 11:36:16,993][25689] Fps is (10 sec: 5690.4, 60 sec: 5527.4, 300 sec: 5523.4). Total num frames: 723607552. Throughput: 0: 5694.0. Samples: 723610620. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:36:16,993][25689] Avg episode reward: [(0, '-0.830')] [2022-07-10 11:36:18,651][26022] Updated weights on worker 0-0, policy_version 706657 (0.00090) [2022-07-10 11:36:20,399][26022] Updated weights on worker 0-0, policy_version 706667 (0.00083) [2022-07-10 11:36:22,031][25689] Fps is (10 sec: 5373.3, 60 sec: 5525.1, 300 sec: 5520.6). Total num frames: 723635200. Throughput: 0: 5719.6. Samples: 723644492. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:36:22,031][25689] Avg episode reward: [(0, '-0.750')] [2022-07-10 11:36:22,267][26022] Updated weights on worker 0-0, policy_version 706677 (0.00085) [2022-07-10 11:36:23,868][26022] Updated weights on worker 0-0, policy_version 706687 (0.00093) [2022-07-10 11:36:26,127][26022] Updated weights on worker 0-0, policy_version 706697 (0.00086) [2022-07-10 11:36:27,112][25689] Fps is (10 sec: 5565.8, 60 sec: 5563.3, 300 sec: 5529.7). Total num frames: 723663872. Throughput: 0: 4993.8. Samples: 723661162. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:36:27,112][25689] Avg episode reward: [(0, '-1.669')] [2022-07-10 11:36:27,723][26022] Updated weights on worker 0-0, policy_version 706707 (0.00081) [2022-07-10 11:36:28,496][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:36:28,517][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000706711_723672064.pth [2022-07-10 11:36:28,518][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000704764_721678336.pth [2022-07-10 11:36:29,742][26022] Updated weights on worker 0-0, policy_version 706717 (0.00087) [2022-07-10 11:36:31,352][26022] Updated weights on worker 0-0, policy_version 706727 (0.00052) [2022-07-10 11:36:32,140][25689] Fps is (10 sec: 5571.2, 60 sec: 5511.5, 300 sec: 5522.6). Total num frames: 723691520. Throughput: 0: 5797.5. Samples: 723694248. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:36:32,141][25689] Avg episode reward: [(0, '-2.545')] [2022-07-10 11:36:33,416][26022] Updated weights on worker 0-0, policy_version 706737 (0.00086) [2022-07-10 11:36:35,194][26022] Updated weights on worker 0-0, policy_version 706747 (0.00086) [2022-07-10 11:36:36,951][26022] Updated weights on worker 0-0, policy_version 706757 (0.00094) [2022-07-10 11:36:37,147][25689] Fps is (10 sec: 5612.4, 60 sec: 5529.4, 300 sec: 5523.4). Total num frames: 723720192. Throughput: 0: 5818.2. Samples: 723727878. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:36:37,147][25689] Avg episode reward: [(0, '-3.340')] [2022-07-10 11:36:38,745][26022] Updated weights on worker 0-0, policy_version 706767 (0.00087) [2022-07-10 11:36:40,753][26022] Updated weights on worker 0-0, policy_version 706777 (0.00105) [2022-07-10 11:36:42,155][25689] Fps is (10 sec: 5623.3, 60 sec: 5564.3, 300 sec: 5528.8). Total num frames: 723747840. Throughput: 0: 4983.5. Samples: 723744784. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:36:42,156][25689] Avg episode reward: [(0, '-4.523')] [2022-07-10 11:36:42,304][26022] Updated weights on worker 0-0, policy_version 706787 (0.00081) [2022-07-10 11:36:44,290][26022] Updated weights on worker 0-0, policy_version 706797 (0.00096) [2022-07-10 11:36:46,038][26022] Updated weights on worker 0-0, policy_version 706807 (0.00086) [2022-07-10 11:36:47,275][25689] Fps is (10 sec: 5560.8, 60 sec: 5558.4, 300 sec: 5526.8). Total num frames: 723776512. Throughput: 0: 5812.8. Samples: 723778366. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:36:47,275][25689] Avg episode reward: [(0, '-4.975')] [2022-07-10 11:36:48,031][26022] Updated weights on worker 0-0, policy_version 706817 (0.00081) [2022-07-10 11:36:49,776][26022] Updated weights on worker 0-0, policy_version 706827 (0.00088) [2022-07-10 11:36:51,867][26022] Updated weights on worker 0-0, policy_version 706837 (0.00089) [2022-07-10 11:36:52,289][25689] Fps is (10 sec: 5456.7, 60 sec: 5525.7, 300 sec: 5523.6). Total num frames: 723803136. Throughput: 0: 5825.1. Samples: 723811618. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:36:52,289][25689] Avg episode reward: [(0, '-3.795')] [2022-07-10 11:36:53,326][26022] Updated weights on worker 0-0, policy_version 706847 (0.00088) [2022-07-10 11:36:55,477][26022] Updated weights on worker 0-0, policy_version 706857 (0.00090) [2022-07-10 11:36:57,007][26022] Updated weights on worker 0-0, policy_version 706867 (0.00079) [2022-07-10 11:36:57,368][25689] Fps is (10 sec: 5579.8, 60 sec: 5552.7, 300 sec: 5527.1). Total num frames: 723832832. Throughput: 0: 4969.7. Samples: 723828374. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:36:57,369][25689] Avg episode reward: [(0, '-4.022')] [2022-07-10 11:36:59,083][26022] Updated weights on worker 0-0, policy_version 706877 (0.00091) [2022-07-10 11:37:00,913][26022] Updated weights on worker 0-0, policy_version 706887 (0.00087) [2022-07-10 11:37:02,375][25689] Fps is (10 sec: 5381.1, 60 sec: 5493.0, 300 sec: 5521.9). Total num frames: 723857408. Throughput: 0: 5790.8. Samples: 723861870. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:37:02,375][25689] Avg episode reward: [(0, '-3.556')] [2022-07-10 11:37:03,048][26022] Updated weights on worker 0-0, policy_version 706897 (0.00091) [2022-07-10 11:37:05,049][26022] Updated weights on worker 0-0, policy_version 706907 (0.00096) [2022-07-10 11:37:06,801][26022] Updated weights on worker 0-0, policy_version 706917 (0.00085) [2022-07-10 11:37:07,443][25689] Fps is (10 sec: 5387.2, 60 sec: 5566.1, 300 sec: 5528.2). Total num frames: 723887104. Throughput: 0: 5700.5. Samples: 723893332. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:37:07,443][25689] Avg episode reward: [(0, '-2.821')] [2022-07-10 11:37:08,532][26022] Updated weights on worker 0-0, policy_version 706927 (0.00084) [2022-07-10 11:37:10,387][26022] Updated weights on worker 0-0, policy_version 706937 (0.00089) [2022-07-10 11:37:12,174][26022] Updated weights on worker 0-0, policy_version 706947 (0.00089) [2022-07-10 11:37:12,454][25689] Fps is (10 sec: 5689.5, 60 sec: 5515.6, 300 sec: 5529.4). Total num frames: 723914752. Throughput: 0: 4887.0. Samples: 723910162. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:37:12,455][25689] Avg episode reward: [(0, '-3.137')] [2022-07-10 11:37:14,188][26022] Updated weights on worker 0-0, policy_version 706957 (0.00088) [2022-07-10 11:37:15,828][26022] Updated weights on worker 0-0, policy_version 706967 (0.00085) [2022-07-10 11:37:17,511][25689] Fps is (10 sec: 5492.0, 60 sec: 5533.0, 300 sec: 5518.5). Total num frames: 723942400. Throughput: 0: 5735.3. Samples: 723943898. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:37:17,513][25689] Avg episode reward: [(0, '-2.355')] [2022-07-10 11:37:17,670][26022] Updated weights on worker 0-0, policy_version 706977 (0.00088) [2022-07-10 11:37:19,659][26022] Updated weights on worker 0-0, policy_version 706987 (0.00089) [2022-07-10 11:37:21,134][26022] Updated weights on worker 0-0, policy_version 706997 (0.00088) [2022-07-10 11:37:22,515][25689] Fps is (10 sec: 5597.7, 60 sec: 5553.1, 300 sec: 5526.6). Total num frames: 723971072. Throughput: 0: 5745.1. Samples: 723977578. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:37:22,515][25689] Avg episode reward: [(0, '-2.438')] [2022-07-10 11:37:23,350][26022] Updated weights on worker 0-0, policy_version 707007 (0.00089) [2022-07-10 11:37:24,807][26022] Updated weights on worker 0-0, policy_version 707017 (0.00088) [2022-07-10 11:37:27,144][26022] Updated weights on worker 0-0, policy_version 707027 (0.00087) [2022-07-10 11:37:27,550][25689] Fps is (10 sec: 5711.9, 60 sec: 5557.2, 300 sec: 5529.4). Total num frames: 723999744. Throughput: 0: 5032.2. Samples: 723994518. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:37:27,551][25689] Avg episode reward: [(0, '-2.179')] [2022-07-10 11:37:28,456][26022] Updated weights on worker 0-0, policy_version 707037 (0.00093) [2022-07-10 11:37:30,704][26022] Updated weights on worker 0-0, policy_version 707047 (0.00091) [2022-07-10 11:37:32,431][26022] Updated weights on worker 0-0, policy_version 707057 (0.00085) [2022-07-10 11:37:32,565][25689] Fps is (10 sec: 5603.7, 60 sec: 5558.5, 300 sec: 5525.8). Total num frames: 724027392. Throughput: 0: 5850.9. Samples: 724027834. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:37:32,566][25689] Avg episode reward: [(0, '-3.427')] [2022-07-10 11:37:34,083][26022] Updated weights on worker 0-0, policy_version 707067 (0.00084) [2022-07-10 11:37:36,052][26022] Updated weights on worker 0-0, policy_version 707077 (0.00091) [2022-07-10 11:37:37,576][25689] Fps is (10 sec: 5617.7, 60 sec: 5558.1, 300 sec: 5529.6). Total num frames: 724056064. Throughput: 0: 5860.0. Samples: 724061478. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:37:37,576][25689] Avg episode reward: [(0, '-2.427')] [2022-07-10 11:37:37,760][26022] Updated weights on worker 0-0, policy_version 707087 (0.00483) [2022-07-10 11:37:39,853][26022] Updated weights on worker 0-0, policy_version 707097 (0.00086) [2022-07-10 11:37:41,746][26022] Updated weights on worker 0-0, policy_version 707107 (0.00091) [2022-07-10 11:37:42,670][25689] Fps is (10 sec: 5371.2, 60 sec: 5516.5, 300 sec: 5526.8). Total num frames: 724081664. Throughput: 0: 4978.9. Samples: 724077926. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:37:42,670][25689] Avg episode reward: [(0, '-1.993')] [2022-07-10 11:37:43,466][26022] Updated weights on worker 0-0, policy_version 707117 (0.00089) [2022-07-10 11:37:45,483][26022] Updated weights on worker 0-0, policy_version 707127 (0.00087) [2022-07-10 11:37:47,000][26022] Updated weights on worker 0-0, policy_version 707137 (0.00090) [2022-07-10 11:37:47,713][25689] Fps is (10 sec: 5454.7, 60 sec: 5540.3, 300 sec: 5526.7). Total num frames: 724111360. Throughput: 0: 5794.2. Samples: 724111344. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:37:47,714][25689] Avg episode reward: [(0, '-2.778')] [2022-07-10 11:37:49,098][26022] Updated weights on worker 0-0, policy_version 707147 (0.00093) [2022-07-10 11:37:50,757][26022] Updated weights on worker 0-0, policy_version 707157 (0.00088) [2022-07-10 11:37:52,758][25689] Fps is (10 sec: 5582.5, 60 sec: 5537.5, 300 sec: 5526.1). Total num frames: 724137984. Throughput: 0: 5783.9. Samples: 724144628. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:37:52,759][25689] Avg episode reward: [(0, '-2.788')] [2022-07-10 11:37:52,760][26022] Updated weights on worker 0-0, policy_version 707167 (0.00082) [2022-07-10 11:37:54,264][26022] Updated weights on worker 0-0, policy_version 707177 (0.00098) [2022-07-10 11:37:56,404][26022] Updated weights on worker 0-0, policy_version 707187 (0.00620) [2022-07-10 11:37:57,760][25689] Fps is (10 sec: 5605.5, 60 sec: 5544.6, 300 sec: 5533.4). Total num frames: 724167680. Throughput: 0: 4951.8. Samples: 724161428. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:37:57,761][25689] Avg episode reward: [(0, '-2.874')] [2022-07-10 11:37:58,332][26022] Updated weights on worker 0-0, policy_version 707197 (0.00094) [2022-07-10 11:38:00,199][26022] Updated weights on worker 0-0, policy_version 707207 (0.00078) [2022-07-10 11:38:02,178][26022] Updated weights on worker 0-0, policy_version 707217 (0.00090) [2022-07-10 11:38:02,784][25689] Fps is (10 sec: 5515.6, 60 sec: 5560.0, 300 sec: 5530.4). Total num frames: 724193280. Throughput: 0: 5792.9. Samples: 724194444. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 11:38:02,784][25689] Avg episode reward: [(0, '-1.604')] [2022-07-10 11:38:04,252][26022] Updated weights on worker 0-0, policy_version 707227 (0.00094) [2022-07-10 11:38:05,997][26022] Updated weights on worker 0-0, policy_version 707237 (0.00088) [2022-07-10 11:38:07,735][26022] Updated weights on worker 0-0, policy_version 707247 (0.00076) [2022-07-10 11:38:07,883][25689] Fps is (10 sec: 5260.2, 60 sec: 5523.2, 300 sec: 5525.2). Total num frames: 724220928. Throughput: 0: 5672.4. Samples: 724225758. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:38:07,884][25689] Avg episode reward: [(0, '-1.588')] [2022-07-10 11:38:09,721][26022] Updated weights on worker 0-0, policy_version 707257 (0.00085) [2022-07-10 11:38:11,694][26022] Updated weights on worker 0-0, policy_version 707267 (0.00082) [2022-07-10 11:38:12,934][25689] Fps is (10 sec: 5447.5, 60 sec: 5519.5, 300 sec: 5531.3). Total num frames: 724248576. Throughput: 0: 4853.6. Samples: 724242554. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:38:12,935][25689] Avg episode reward: [(0, '-0.919')] [2022-07-10 11:38:13,309][26022] Updated weights on worker 0-0, policy_version 707277 (0.00089) [2022-07-10 11:38:15,332][26022] Updated weights on worker 0-0, policy_version 707287 (0.00088) [2022-07-10 11:38:16,927][26022] Updated weights on worker 0-0, policy_version 707297 (0.00089) [2022-07-10 11:38:17,956][25689] Fps is (10 sec: 5591.4, 60 sec: 5539.7, 300 sec: 5527.8). Total num frames: 724277248. Throughput: 0: 5682.8. Samples: 724276194. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:38:17,956][25689] Avg episode reward: [(0, '-0.778')] [2022-07-10 11:38:18,897][26022] Updated weights on worker 0-0, policy_version 707307 (0.00088) [2022-07-10 11:38:20,765][26022] Updated weights on worker 0-0, policy_version 707317 (0.00089) [2022-07-10 11:38:22,591][26022] Updated weights on worker 0-0, policy_version 707327 (0.00086) [2022-07-10 11:38:22,959][25689] Fps is (10 sec: 5618.3, 60 sec: 5522.9, 300 sec: 5529.6). Total num frames: 724304896. Throughput: 0: 5721.3. Samples: 724309872. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:38:22,959][25689] Avg episode reward: [(0, '-0.636')] [2022-07-10 11:38:24,347][26022] Updated weights on worker 0-0, policy_version 707337 (0.00090) [2022-07-10 11:38:26,127][26022] Updated weights on worker 0-0, policy_version 707347 (0.00096) [2022-07-10 11:38:27,992][26022] Updated weights on worker 0-0, policy_version 707357 (0.00087) [2022-07-10 11:38:28,012][25689] Fps is (10 sec: 5600.8, 60 sec: 5521.3, 300 sec: 5535.6). Total num frames: 724333568. Throughput: 0: 5015.4. Samples: 724326710. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:38:28,012][25689] Avg episode reward: [(0, '-2.301')] [2022-07-10 11:38:28,599][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:38:28,618][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000707360_724336640.pth [2022-07-10 11:38:28,619][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000705415_722344960.pth [2022-07-10 11:38:29,824][26022] Updated weights on worker 0-0, policy_version 707367 (0.00088) [2022-07-10 11:38:31,786][26022] Updated weights on worker 0-0, policy_version 707377 (0.00086) [2022-07-10 11:38:33,039][25689] Fps is (10 sec: 5587.4, 60 sec: 5520.2, 300 sec: 5528.7). Total num frames: 724361216. Throughput: 0: 5838.5. Samples: 724359934. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:38:33,039][25689] Avg episode reward: [(0, '-2.657')] [2022-07-10 11:38:33,624][26022] Updated weights on worker 0-0, policy_version 707387 (0.00089) [2022-07-10 11:38:35,247][26022] Updated weights on worker 0-0, policy_version 707397 (0.00088) [2022-07-10 11:38:37,402][26022] Updated weights on worker 0-0, policy_version 707407 (0.00098) [2022-07-10 11:38:38,059][25689] Fps is (10 sec: 5605.6, 60 sec: 5519.3, 300 sec: 5532.2). Total num frames: 724389888. Throughput: 0: 5844.7. Samples: 724393688. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:38:38,059][25689] Avg episode reward: [(0, '-2.863')] [2022-07-10 11:38:38,932][26022] Updated weights on worker 0-0, policy_version 707417 (0.00092) [2022-07-10 11:38:40,940][26022] Updated weights on worker 0-0, policy_version 707427 (0.00095) [2022-07-10 11:38:42,792][26022] Updated weights on worker 0-0, policy_version 707437 (0.00083) [2022-07-10 11:38:43,099][25689] Fps is (10 sec: 5496.4, 60 sec: 5541.1, 300 sec: 5532.5). Total num frames: 724416512. Throughput: 0: 4994.7. Samples: 724410466. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:38:43,100][25689] Avg episode reward: [(0, '-5.021')] [2022-07-10 11:38:44,447][26022] Updated weights on worker 0-0, policy_version 707447 (0.00083) [2022-07-10 11:38:46,451][26022] Updated weights on worker 0-0, policy_version 707457 (0.00089) [2022-07-10 11:38:48,091][26022] Updated weights on worker 0-0, policy_version 707467 (0.00097) [2022-07-10 11:38:48,166][25689] Fps is (10 sec: 5572.2, 60 sec: 5539.0, 300 sec: 5531.3). Total num frames: 724446208. Throughput: 0: 5824.9. Samples: 724444108. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:38:48,167][25689] Avg episode reward: [(0, '-4.831')] [2022-07-10 11:38:49,900][26022] Updated weights on worker 0-0, policy_version 707477 (0.00091) [2022-07-10 11:38:51,849][26022] Updated weights on worker 0-0, policy_version 707487 (0.00051) [2022-07-10 11:38:53,189][25689] Fps is (10 sec: 5582.0, 60 sec: 5541.0, 300 sec: 5531.2). Total num frames: 724472832. Throughput: 0: 5830.1. Samples: 724477410. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:38:53,189][25689] Avg episode reward: [(0, '-5.726')] [2022-07-10 11:38:53,696][26022] Updated weights on worker 0-0, policy_version 707497 (0.00219) [2022-07-10 11:38:55,586][26022] Updated weights on worker 0-0, policy_version 707507 (0.00081) [2022-07-10 11:38:57,177][26022] Updated weights on worker 0-0, policy_version 707517 (0.00087) [2022-07-10 11:38:58,212][25689] Fps is (10 sec: 5402.4, 60 sec: 5505.2, 300 sec: 5534.4). Total num frames: 724500480. Throughput: 0: 5821.2. Samples: 724511004. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:38:58,212][25689] Avg episode reward: [(0, '-7.345')] [2022-07-10 11:38:59,324][26022] Updated weights on worker 0-0, policy_version 707527 (0.00090) [2022-07-10 11:39:01,194][26022] Updated weights on worker 0-0, policy_version 707537 (0.00095) [2022-07-10 11:39:03,231][25689] Fps is (10 sec: 5404.4, 60 sec: 5522.6, 300 sec: 5528.8). Total num frames: 724527104. Throughput: 0: 5804.9. Samples: 724527328. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:39:03,231][25689] Avg episode reward: [(0, '-7.075')] [2022-07-10 11:39:03,450][26022] Updated weights on worker 0-0, policy_version 707547 (0.00097) [2022-07-10 11:39:05,273][26022] Updated weights on worker 0-0, policy_version 707557 (0.00086) [2022-07-10 11:39:07,035][26022] Updated weights on worker 0-0, policy_version 707567 (0.00091) [2022-07-10 11:39:08,357][25689] Fps is (10 sec: 5450.7, 60 sec: 5537.1, 300 sec: 5530.1). Total num frames: 724555776. Throughput: 0: 5691.9. Samples: 724559032. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:39:08,357][25689] Avg episode reward: [(0, '-5.628')] [2022-07-10 11:39:08,732][26022] Updated weights on worker 0-0, policy_version 707577 (0.00085) [2022-07-10 11:39:10,634][26022] Updated weights on worker 0-0, policy_version 707587 (0.00092) [2022-07-10 11:39:12,419][26022] Updated weights on worker 0-0, policy_version 707597 (0.00090) [2022-07-10 11:39:13,362][25689] Fps is (10 sec: 5559.0, 60 sec: 5541.3, 300 sec: 5530.8). Total num frames: 724583424. Throughput: 0: 5706.3. Samples: 724592526. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:39:13,363][25689] Avg episode reward: [(0, '-4.989')] [2022-07-10 11:39:14,322][26022] Updated weights on worker 0-0, policy_version 707607 (0.00088) [2022-07-10 11:39:16,200][26022] Updated weights on worker 0-0, policy_version 707617 (0.00087) [2022-07-10 11:39:17,980][26022] Updated weights on worker 0-0, policy_version 707627 (0.00081) [2022-07-10 11:39:18,386][25689] Fps is (10 sec: 5615.3, 60 sec: 5541.0, 300 sec: 5533.9). Total num frames: 724612096. Throughput: 0: 4874.8. Samples: 724609350. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:39:18,387][25689] Avg episode reward: [(0, '-4.853')] [2022-07-10 11:39:19,719][26022] Updated weights on worker 0-0, policy_version 707637 (0.00085) [2022-07-10 11:39:21,687][26022] Updated weights on worker 0-0, policy_version 707647 (0.00092) [2022-07-10 11:39:23,390][26022] Updated weights on worker 0-0, policy_version 707657 (0.00099) [2022-07-10 11:39:23,407][25689] Fps is (10 sec: 5708.7, 60 sec: 5556.3, 300 sec: 5531.1). Total num frames: 724640768. Throughput: 0: 5728.2. Samples: 724642902. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:39:23,408][25689] Avg episode reward: [(0, '-4.495')] [2022-07-10 11:39:25,519][26022] Updated weights on worker 0-0, policy_version 707667 (0.00083) [2022-07-10 11:39:27,079][26022] Updated weights on worker 0-0, policy_version 707677 (0.00100) [2022-07-10 11:39:28,487][25689] Fps is (10 sec: 5474.3, 60 sec: 5519.9, 300 sec: 5529.9). Total num frames: 724667392. Throughput: 0: 5813.7. Samples: 724676066. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:39:28,488][25689] Avg episode reward: [(0, '-1.213')] [2022-07-10 11:39:29,083][26022] Updated weights on worker 0-0, policy_version 707687 (0.00085) [2022-07-10 11:39:31,132][26022] Updated weights on worker 0-0, policy_version 707697 (0.00092) [2022-07-10 11:39:32,716][26022] Updated weights on worker 0-0, policy_version 707707 (0.00096) [2022-07-10 11:39:33,532][25689] Fps is (10 sec: 5461.3, 60 sec: 5535.3, 300 sec: 5530.9). Total num frames: 724696064. Throughput: 0: 4960.5. Samples: 724692580. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:39:33,533][25689] Avg episode reward: [(0, '-2.856')] [2022-07-10 11:39:34,687][26022] Updated weights on worker 0-0, policy_version 707717 (0.00095) [2022-07-10 11:39:36,251][26022] Updated weights on worker 0-0, policy_version 707727 (0.00086) [2022-07-10 11:39:38,354][26022] Updated weights on worker 0-0, policy_version 707737 (0.00092) [2022-07-10 11:39:38,553][25689] Fps is (10 sec: 5595.5, 60 sec: 5518.3, 300 sec: 5531.0). Total num frames: 724723712. Throughput: 0: 5798.9. Samples: 724726292. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:39:38,554][25689] Avg episode reward: [(0, '-1.904')] [2022-07-10 11:39:40,310][26022] Updated weights on worker 0-0, policy_version 707747 (0.00088) [2022-07-10 11:39:41,939][26022] Updated weights on worker 0-0, policy_version 707757 (0.00084) [2022-07-10 11:39:43,598][25689] Fps is (10 sec: 5391.8, 60 sec: 5517.9, 300 sec: 5525.9). Total num frames: 724750336. Throughput: 0: 5789.8. Samples: 724759802. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:39:43,600][25689] Avg episode reward: [(0, '-2.441')] [2022-07-10 11:39:43,814][26022] Updated weights on worker 0-0, policy_version 707767 (0.00082) [2022-07-10 11:39:45,596][26022] Updated weights on worker 0-0, policy_version 707777 (0.00085) [2022-07-10 11:39:47,555][26022] Updated weights on worker 0-0, policy_version 707787 (0.00099) [2022-07-10 11:39:48,686][25689] Fps is (10 sec: 5557.7, 60 sec: 5515.9, 300 sec: 5531.6). Total num frames: 724780032. Throughput: 0: 4968.0. Samples: 724776414. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:39:48,688][25689] Avg episode reward: [(0, '-1.871')] [2022-07-10 11:39:49,637][26022] Updated weights on worker 0-0, policy_version 707797 (0.00093) [2022-07-10 11:39:51,067][26022] Updated weights on worker 0-0, policy_version 707807 (0.00071) [2022-07-10 11:39:53,059][26022] Updated weights on worker 0-0, policy_version 707817 (0.00090) [2022-07-10 11:39:53,784][25689] Fps is (10 sec: 5730.0, 60 sec: 5542.9, 300 sec: 5533.7). Total num frames: 724808704. Throughput: 0: 5786.3. Samples: 724809764. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:39:53,785][25689] Avg episode reward: [(0, '-1.917')] [2022-07-10 11:39:54,983][26022] Updated weights on worker 0-0, policy_version 707827 (0.00472) [2022-07-10 11:39:56,733][26022] Updated weights on worker 0-0, policy_version 707837 (0.00079) [2022-07-10 11:39:58,638][26022] Updated weights on worker 0-0, policy_version 707847 (0.00078) [2022-07-10 11:39:58,793][25689] Fps is (10 sec: 5572.5, 60 sec: 5544.1, 300 sec: 5527.1). Total num frames: 724836352. Throughput: 0: 5769.9. Samples: 724843078. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:39:58,795][25689] Avg episode reward: [(0, '-1.676')] [2022-07-10 11:40:00,320][26022] Updated weights on worker 0-0, policy_version 707857 (0.00092) [2022-07-10 11:40:02,629][26022] Updated weights on worker 0-0, policy_version 707867 (0.00087) [2022-07-10 11:40:03,841][25689] Fps is (10 sec: 5396.5, 60 sec: 5541.5, 300 sec: 5534.9). Total num frames: 724862976. Throughput: 0: 4929.8. Samples: 724859598. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:40:03,842][25689] Avg episode reward: [(0, '-1.486')] [2022-07-10 11:40:04,417][26022] Updated weights on worker 0-0, policy_version 707877 (0.00094) [2022-07-10 11:40:06,336][26022] Updated weights on worker 0-0, policy_version 707887 (0.00084) [2022-07-10 11:40:08,227][26022] Updated weights on worker 0-0, policy_version 707897 (0.00086) [2022-07-10 11:40:08,955][25689] Fps is (10 sec: 5240.0, 60 sec: 5508.8, 300 sec: 5530.5). Total num frames: 724889600. Throughput: 0: 5651.5. Samples: 724890962. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:40:08,957][25689] Avg episode reward: [(0, '-2.051')] [2022-07-10 11:40:09,933][26022] Updated weights on worker 0-0, policy_version 707907 (0.00084) [2022-07-10 11:40:11,867][26022] Updated weights on worker 0-0, policy_version 707917 (0.00085) [2022-07-10 11:40:13,608][26022] Updated weights on worker 0-0, policy_version 707927 (0.00089) [2022-07-10 11:40:13,963][25689] Fps is (10 sec: 5463.2, 60 sec: 5525.5, 300 sec: 5530.4). Total num frames: 724918272. Throughput: 0: 5688.6. Samples: 724924552. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:40:13,963][25689] Avg episode reward: [(0, '-1.996')] [2022-07-10 11:40:15,526][26022] Updated weights on worker 0-0, policy_version 707937 (0.00092) [2022-07-10 11:40:17,326][26022] Updated weights on worker 0-0, policy_version 707947 (0.00498) [2022-07-10 11:40:18,988][25689] Fps is (10 sec: 5613.4, 60 sec: 5508.5, 300 sec: 5530.2). Total num frames: 724945920. Throughput: 0: 4849.2. Samples: 724941008. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:40:18,989][25689] Avg episode reward: [(0, '-2.802')] [2022-07-10 11:40:19,372][26022] Updated weights on worker 0-0, policy_version 707957 (0.00089) [2022-07-10 11:40:20,906][26022] Updated weights on worker 0-0, policy_version 707967 (0.00084) [2022-07-10 11:40:22,848][26022] Updated weights on worker 0-0, policy_version 707977 (0.00094) [2022-07-10 11:40:24,017][25689] Fps is (10 sec: 5601.4, 60 sec: 5507.7, 300 sec: 5538.9). Total num frames: 724974592. Throughput: 0: 5701.6. Samples: 724974634. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:40:24,018][25689] Avg episode reward: [(0, '-2.967')] [2022-07-10 11:40:24,765][26022] Updated weights on worker 0-0, policy_version 707987 (0.00090) [2022-07-10 11:40:26,665][26022] Updated weights on worker 0-0, policy_version 707997 (0.00082) [2022-07-10 11:40:28,410][26022] Updated weights on worker 0-0, policy_version 708007 (0.00086) [2022-07-10 11:40:28,751][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:40:28,763][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000708009_725001216.pth [2022-07-10 11:40:28,764][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000706063_723008512.pth [2022-07-10 11:40:29,116][25689] Fps is (10 sec: 5561.0, 60 sec: 5522.9, 300 sec: 5527.1). Total num frames: 725002240. Throughput: 0: 5813.4. Samples: 725008166. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:40:29,117][25689] Avg episode reward: [(0, '-2.183')] [2022-07-10 11:40:30,255][26022] Updated weights on worker 0-0, policy_version 708017 (0.00099) [2022-07-10 11:40:32,132][26022] Updated weights on worker 0-0, policy_version 708027 (0.00097) [2022-07-10 11:40:34,022][26022] Updated weights on worker 0-0, policy_version 708037 (0.00092) [2022-07-10 11:40:34,180][25689] Fps is (10 sec: 5541.8, 60 sec: 5521.2, 300 sec: 5529.6). Total num frames: 725030912. Throughput: 0: 4950.5. Samples: 725024638. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:40:34,182][25689] Avg episode reward: [(0, '-2.707')] [2022-07-10 11:40:35,785][26022] Updated weights on worker 0-0, policy_version 708047 (0.00087) [2022-07-10 11:40:37,782][26022] Updated weights on worker 0-0, policy_version 708057 (0.00090) [2022-07-10 11:40:39,212][25689] Fps is (10 sec: 5679.7, 60 sec: 5537.0, 300 sec: 5539.7). Total num frames: 725059584. Throughput: 0: 5778.9. Samples: 725057882. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:40:39,219][25689] Avg episode reward: [(0, '-2.410')] [2022-07-10 11:40:39,431][26022] Updated weights on worker 0-0, policy_version 708067 (0.00087) [2022-07-10 11:40:41,489][26022] Updated weights on worker 0-0, policy_version 708077 (0.00082) [2022-07-10 11:40:43,104][26022] Updated weights on worker 0-0, policy_version 708087 (0.00095) [2022-07-10 11:40:44,261][25689] Fps is (10 sec: 5485.0, 60 sec: 5536.6, 300 sec: 5533.0). Total num frames: 725086208. Throughput: 0: 5766.7. Samples: 725091374. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:40:44,262][25689] Avg episode reward: [(0, '-2.619')] [2022-07-10 11:40:45,088][26022] Updated weights on worker 0-0, policy_version 708097 (0.00092) [2022-07-10 11:40:46,846][26022] Updated weights on worker 0-0, policy_version 708107 (0.00089) [2022-07-10 11:40:48,941][26022] Updated weights on worker 0-0, policy_version 708117 (0.00085) [2022-07-10 11:40:49,329][25689] Fps is (10 sec: 5364.4, 60 sec: 5504.7, 300 sec: 5528.7). Total num frames: 725113856. Throughput: 0: 4934.9. Samples: 725107918. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:40:49,330][25689] Avg episode reward: [(0, '-4.141')] [2022-07-10 11:40:50,513][26022] Updated weights on worker 0-0, policy_version 708127 (0.00089) [2022-07-10 11:40:52,648][26022] Updated weights on worker 0-0, policy_version 708137 (0.00086) [2022-07-10 11:40:54,120][26022] Updated weights on worker 0-0, policy_version 708147 (0.00094) [2022-07-10 11:40:54,359][25689] Fps is (10 sec: 5678.8, 60 sec: 5527.8, 300 sec: 5535.2). Total num frames: 725143552. Throughput: 0: 5771.8. Samples: 725141108. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:40:54,361][25689] Avg episode reward: [(0, '-3.535')] [2022-07-10 11:40:56,395][26022] Updated weights on worker 0-0, policy_version 708157 (0.00092) [2022-07-10 11:40:57,745][26022] Updated weights on worker 0-0, policy_version 708167 (0.00085) [2022-07-10 11:40:59,367][25689] Fps is (10 sec: 5611.1, 60 sec: 5511.1, 300 sec: 5529.9). Total num frames: 725170176. Throughput: 0: 5787.5. Samples: 725174526. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:40:59,367][25689] Avg episode reward: [(0, '-3.248')] [2022-07-10 11:40:59,963][26022] Updated weights on worker 0-0, policy_version 708177 (0.00084) [2022-07-10 11:41:01,583][26022] Updated weights on worker 0-0, policy_version 708187 (0.00094) [2022-07-10 11:41:03,955][26022] Updated weights on worker 0-0, policy_version 708197 (0.00089) [2022-07-10 11:41:04,375][25689] Fps is (10 sec: 5214.4, 60 sec: 5497.8, 300 sec: 5532.1). Total num frames: 725195776. Throughput: 0: 5686.2. Samples: 725205742. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:41:04,375][25689] Avg episode reward: [(0, '-5.228')] [2022-07-10 11:41:05,654][26022] Updated weights on worker 0-0, policy_version 708207 (0.00091) [2022-07-10 11:41:07,835][26022] Updated weights on worker 0-0, policy_version 708217 (0.00088) [2022-07-10 11:41:09,360][26022] Updated weights on worker 0-0, policy_version 708227 (0.00082) [2022-07-10 11:41:09,451][25689] Fps is (10 sec: 5382.0, 60 sec: 5535.1, 300 sec: 5524.1). Total num frames: 725224448. Throughput: 0: 5681.7. Samples: 725222242. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:41:09,451][25689] Avg episode reward: [(0, '-5.081')] [2022-07-10 11:41:11,357][26022] Updated weights on worker 0-0, policy_version 708237 (0.00613) [2022-07-10 11:41:12,996][26022] Updated weights on worker 0-0, policy_version 708247 (0.00091) [2022-07-10 11:41:14,486][25689] Fps is (10 sec: 5570.0, 60 sec: 5515.6, 300 sec: 5528.0). Total num frames: 725252096. Throughput: 0: 5690.6. Samples: 725255640. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:41:14,486][25689] Avg episode reward: [(0, '-4.751')] [2022-07-10 11:41:14,989][26022] Updated weights on worker 0-0, policy_version 708257 (0.00095) [2022-07-10 11:41:16,835][26022] Updated weights on worker 0-0, policy_version 708267 (0.00096) [2022-07-10 11:41:18,679][26022] Updated weights on worker 0-0, policy_version 708277 (0.00081) [2022-07-10 11:41:19,494][25689] Fps is (10 sec: 5403.7, 60 sec: 5500.3, 300 sec: 5525.1). Total num frames: 725278720. Throughput: 0: 5685.3. Samples: 725288958. Policy #0 lag: (min: 0.0, avg: 10.4, max: 26.0) [2022-07-10 11:41:19,495][25689] Avg episode reward: [(0, '-2.546')] [2022-07-10 11:41:20,415][26022] Updated weights on worker 0-0, policy_version 708287 (0.00088) [2022-07-10 11:41:22,530][26022] Updated weights on worker 0-0, policy_version 708297 (0.00096) [2022-07-10 11:41:24,260][26022] Updated weights on worker 0-0, policy_version 708307 (0.00090) [2022-07-10 11:41:24,524][25689] Fps is (10 sec: 5610.5, 60 sec: 5517.1, 300 sec: 5529.5). Total num frames: 725308416. Throughput: 0: 4963.7. Samples: 725305758. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:41:24,525][25689] Avg episode reward: [(0, '-2.210')] [2022-07-10 11:41:26,227][26022] Updated weights on worker 0-0, policy_version 708317 (0.00084) [2022-07-10 11:41:27,810][26022] Updated weights on worker 0-0, policy_version 708327 (0.00091) [2022-07-10 11:41:29,655][25689] Fps is (10 sec: 5543.0, 60 sec: 5497.3, 300 sec: 5524.2). Total num frames: 725335040. Throughput: 0: 5772.9. Samples: 725338878. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:41:29,655][25689] Avg episode reward: [(0, '-2.266')] [2022-07-10 11:41:29,951][26022] Updated weights on worker 0-0, policy_version 708337 (0.00089) [2022-07-10 11:41:31,360][26022] Updated weights on worker 0-0, policy_version 708347 (0.00088) [2022-07-10 11:41:33,558][26022] Updated weights on worker 0-0, policy_version 708357 (0.00094) [2022-07-10 11:41:34,681][25689] Fps is (10 sec: 5545.3, 60 sec: 5517.7, 300 sec: 5527.2). Total num frames: 725364736. Throughput: 0: 5772.8. Samples: 725372220. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:41:34,681][25689] Avg episode reward: [(0, '-0.321')] [2022-07-10 11:41:35,272][26022] Updated weights on worker 0-0, policy_version 708367 (0.00090) [2022-07-10 11:41:37,176][26022] Updated weights on worker 0-0, policy_version 708377 (0.00084) [2022-07-10 11:41:39,099][26022] Updated weights on worker 0-0, policy_version 708387 (0.00094) [2022-07-10 11:41:39,709][25689] Fps is (10 sec: 5601.6, 60 sec: 5484.2, 300 sec: 5523.4). Total num frames: 725391360. Throughput: 0: 4933.0. Samples: 725388678. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:41:39,710][25689] Avg episode reward: [(0, '-0.741')] [2022-07-10 11:41:41,057][26022] Updated weights on worker 0-0, policy_version 708397 (0.00082) [2022-07-10 11:41:42,625][26022] Updated weights on worker 0-0, policy_version 708407 (0.00090) [2022-07-10 11:41:44,715][25689] Fps is (10 sec: 5408.8, 60 sec: 5505.1, 300 sec: 5522.1). Total num frames: 725419008. Throughput: 0: 5765.5. Samples: 725422166. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:41:44,715][25689] Avg episode reward: [(0, '-1.188')] [2022-07-10 11:41:44,716][26022] Updated weights on worker 0-0, policy_version 708417 (0.00077) [2022-07-10 11:41:46,405][26022] Updated weights on worker 0-0, policy_version 708427 (0.00082) [2022-07-10 11:41:48,172][26022] Updated weights on worker 0-0, policy_version 708437 (0.00112) [2022-07-10 11:41:49,823][25689] Fps is (10 sec: 5568.7, 60 sec: 5518.4, 300 sec: 5527.3). Total num frames: 725447680. Throughput: 0: 5806.2. Samples: 725455976. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:41:49,823][25689] Avg episode reward: [(0, '-1.339')] [2022-07-10 11:41:50,032][26022] Updated weights on worker 0-0, policy_version 708447 (0.00099) [2022-07-10 11:41:51,787][26022] Updated weights on worker 0-0, policy_version 708457 (0.00086) [2022-07-10 11:41:53,726][26022] Updated weights on worker 0-0, policy_version 708467 (0.00090) [2022-07-10 11:41:54,847][25689] Fps is (10 sec: 5659.3, 60 sec: 5501.9, 300 sec: 5524.8). Total num frames: 725476352. Throughput: 0: 4983.7. Samples: 725472724. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:41:54,848][25689] Avg episode reward: [(0, '-1.465')] [2022-07-10 11:41:55,713][26022] Updated weights on worker 0-0, policy_version 708477 (0.00091) [2022-07-10 11:41:57,394][26022] Updated weights on worker 0-0, policy_version 708487 (0.00083) [2022-07-10 11:41:59,606][26022] Updated weights on worker 0-0, policy_version 708497 (0.00090) [2022-07-10 11:41:59,865][25689] Fps is (10 sec: 5404.2, 60 sec: 5484.0, 300 sec: 5528.1). Total num frames: 725501952. Throughput: 0: 5802.3. Samples: 725505630. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:41:59,866][25689] Avg episode reward: [(0, '-2.105')] [2022-07-10 11:42:00,963][26022] Updated weights on worker 0-0, policy_version 708507 (0.00091) [2022-07-10 11:42:03,363][26022] Updated weights on worker 0-0, policy_version 708517 (0.00080) [2022-07-10 11:42:04,887][25689] Fps is (10 sec: 5304.0, 60 sec: 5516.7, 300 sec: 5522.0). Total num frames: 725529600. Throughput: 0: 5696.4. Samples: 725537074. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:42:04,889][25689] Avg episode reward: [(0, '-2.064')] [2022-07-10 11:42:05,219][26022] Updated weights on worker 0-0, policy_version 708527 (0.00094) [2022-07-10 11:42:07,084][26022] Updated weights on worker 0-0, policy_version 708537 (0.00083) [2022-07-10 11:42:09,007][26022] Updated weights on worker 0-0, policy_version 708547 (0.00096) [2022-07-10 11:42:09,958][25689] Fps is (10 sec: 5580.3, 60 sec: 5517.1, 300 sec: 5524.4). Total num frames: 725558272. Throughput: 0: 4848.2. Samples: 725553596. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:42:09,958][25689] Avg episode reward: [(0, '-2.197')] [2022-07-10 11:42:10,773][26022] Updated weights on worker 0-0, policy_version 708557 (0.00091) [2022-07-10 11:42:12,731][26022] Updated weights on worker 0-0, policy_version 708567 (0.00089) [2022-07-10 11:42:14,687][26022] Updated weights on worker 0-0, policy_version 708577 (0.00086) [2022-07-10 11:42:14,991][25689] Fps is (10 sec: 5371.3, 60 sec: 5483.5, 300 sec: 5517.9). Total num frames: 725583872. Throughput: 0: 5652.7. Samples: 725586588. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:42:14,991][25689] Avg episode reward: [(0, '-1.606')] [2022-07-10 11:42:16,227][26022] Updated weights on worker 0-0, policy_version 708587 (0.00083) [2022-07-10 11:42:18,198][26022] Updated weights on worker 0-0, policy_version 708597 (0.00083) [2022-07-10 11:42:19,962][26022] Updated weights on worker 0-0, policy_version 708607 (0.00084) [2022-07-10 11:42:20,002][25689] Fps is (10 sec: 5505.1, 60 sec: 5533.9, 300 sec: 5521.2). Total num frames: 725613568. Throughput: 0: 5704.2. Samples: 725620496. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:42:20,003][25689] Avg episode reward: [(0, '-0.631')] [2022-07-10 11:42:21,796][26022] Updated weights on worker 0-0, policy_version 708617 (0.00099) [2022-07-10 11:42:23,618][26022] Updated weights on worker 0-0, policy_version 708627 (0.00086) [2022-07-10 11:42:25,019][25689] Fps is (10 sec: 5718.4, 60 sec: 5501.3, 300 sec: 5518.1). Total num frames: 725641216. Throughput: 0: 4981.8. Samples: 725637368. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:42:25,019][25689] Avg episode reward: [(0, '0.213')] [2022-07-10 11:42:25,385][26022] Updated weights on worker 0-0, policy_version 708637 (0.00085) [2022-07-10 11:42:27,180][26022] Updated weights on worker 0-0, policy_version 708647 (0.00093) [2022-07-10 11:42:28,871][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:42:28,882][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000708656_725663744.pth [2022-07-10 11:42:28,882][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000706711_723672064.pth [2022-07-10 11:42:29,023][26022] Updated weights on worker 0-0, policy_version 708657 (0.00089) [2022-07-10 11:42:30,161][25689] Fps is (10 sec: 5645.2, 60 sec: 5551.1, 300 sec: 5522.7). Total num frames: 725670912. Throughput: 0: 5813.1. Samples: 725671036. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:42:30,161][25689] Avg episode reward: [(0, '0.572')] [2022-07-10 11:42:30,785][26022] Updated weights on worker 0-0, policy_version 708667 (0.00089) [2022-07-10 11:42:32,781][26022] Updated weights on worker 0-0, policy_version 708677 (0.00085) [2022-07-10 11:42:34,859][26022] Updated weights on worker 0-0, policy_version 708687 (0.00091) [2022-07-10 11:42:35,179][25689] Fps is (10 sec: 5643.7, 60 sec: 5517.9, 300 sec: 5519.1). Total num frames: 725698560. Throughput: 0: 5840.5. Samples: 725704500. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:42:35,180][25689] Avg episode reward: [(0, '0.764')] [2022-07-10 11:42:36,419][26022] Updated weights on worker 0-0, policy_version 708697 (0.00091) [2022-07-10 11:42:38,310][26022] Updated weights on worker 0-0, policy_version 708707 (0.00088) [2022-07-10 11:42:40,220][25689] Fps is (10 sec: 5496.9, 60 sec: 5533.7, 300 sec: 5526.9). Total num frames: 725726208. Throughput: 0: 4967.2. Samples: 725720920. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:42:40,220][25689] Avg episode reward: [(0, '0.363')] [2022-07-10 11:42:40,222][26022] Updated weights on worker 0-0, policy_version 708717 (0.00093) [2022-07-10 11:42:41,962][26022] Updated weights on worker 0-0, policy_version 708727 (0.00083) [2022-07-10 11:42:43,929][26022] Updated weights on worker 0-0, policy_version 708737 (0.00086) [2022-07-10 11:42:45,320][25689] Fps is (10 sec: 5453.1, 60 sec: 5525.1, 300 sec: 5519.0). Total num frames: 725753856. Throughput: 0: 5755.8. Samples: 725754216. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:42:45,321][25689] Avg episode reward: [(0, '0.154')] [2022-07-10 11:42:45,644][26022] Updated weights on worker 0-0, policy_version 708747 (0.00093) [2022-07-10 11:42:47,627][26022] Updated weights on worker 0-0, policy_version 708757 (0.00499) [2022-07-10 11:42:49,313][26022] Updated weights on worker 0-0, policy_version 708767 (0.00093) [2022-07-10 11:42:50,411][25689] Fps is (10 sec: 5426.1, 60 sec: 5509.7, 300 sec: 5521.6). Total num frames: 725781504. Throughput: 0: 5748.2. Samples: 725787438. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:42:50,411][25689] Avg episode reward: [(0, '-1.420')] [2022-07-10 11:42:51,303][26022] Updated weights on worker 0-0, policy_version 708777 (0.00091) [2022-07-10 11:42:53,043][26022] Updated weights on worker 0-0, policy_version 708787 (0.00085) [2022-07-10 11:42:55,006][26022] Updated weights on worker 0-0, policy_version 708797 (0.00085) [2022-07-10 11:42:55,428][25689] Fps is (10 sec: 5571.4, 60 sec: 5510.4, 300 sec: 5517.9). Total num frames: 725810176. Throughput: 0: 4926.5. Samples: 725804256. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:42:55,429][25689] Avg episode reward: [(0, '-2.536')] [2022-07-10 11:42:56,669][26022] Updated weights on worker 0-0, policy_version 708807 (0.00087) [2022-07-10 11:42:58,769][26022] Updated weights on worker 0-0, policy_version 708817 (0.00088) [2022-07-10 11:43:00,444][25689] Fps is (10 sec: 5613.5, 60 sec: 5544.4, 300 sec: 5524.9). Total num frames: 725837824. Throughput: 0: 5767.9. Samples: 725837568. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:43:00,454][25689] Avg episode reward: [(0, '-2.535')] [2022-07-10 11:43:00,457][26022] Updated weights on worker 0-0, policy_version 708827 (0.00086) [2022-07-10 11:43:02,863][26022] Updated weights on worker 0-0, policy_version 708837 (0.00976) [2022-07-10 11:43:04,548][26022] Updated weights on worker 0-0, policy_version 708847 (0.00097) [2022-07-10 11:43:05,472][25689] Fps is (10 sec: 5301.7, 60 sec: 5510.0, 300 sec: 5519.3). Total num frames: 725863424. Throughput: 0: 5672.2. Samples: 725868524. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:43:05,473][25689] Avg episode reward: [(0, '-3.297')] [2022-07-10 11:43:06,572][26022] Updated weights on worker 0-0, policy_version 708857 (0.00089) [2022-07-10 11:43:08,303][26022] Updated weights on worker 0-0, policy_version 708867 (0.00061) [2022-07-10 11:43:10,301][26022] Updated weights on worker 0-0, policy_version 708877 (0.00087) [2022-07-10 11:43:10,516][25689] Fps is (10 sec: 5185.0, 60 sec: 5478.6, 300 sec: 5516.0). Total num frames: 725890048. Throughput: 0: 4855.4. Samples: 725885056. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:43:10,516][25689] Avg episode reward: [(0, '-3.295')] [2022-07-10 11:43:11,892][26022] Updated weights on worker 0-0, policy_version 708887 (0.00089) [2022-07-10 11:43:13,796][26022] Updated weights on worker 0-0, policy_version 708897 (0.00091) [2022-07-10 11:43:15,524][25689] Fps is (10 sec: 5705.0, 60 sec: 5565.5, 300 sec: 5523.2). Total num frames: 725920768. Throughput: 0: 5691.7. Samples: 725918632. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:43:15,524][25689] Avg episode reward: [(0, '-3.516')] [2022-07-10 11:43:15,531][26022] Updated weights on worker 0-0, policy_version 708907 (0.00087) [2022-07-10 11:43:17,484][26022] Updated weights on worker 0-0, policy_version 708917 (0.00089) [2022-07-10 11:43:19,270][26022] Updated weights on worker 0-0, policy_version 708927 (0.00088) [2022-07-10 11:43:20,542][25689] Fps is (10 sec: 5821.5, 60 sec: 5531.0, 300 sec: 5522.9). Total num frames: 725948416. Throughput: 0: 5725.1. Samples: 725952634. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:43:20,544][25689] Avg episode reward: [(0, '-2.269')] [2022-07-10 11:43:21,081][26022] Updated weights on worker 0-0, policy_version 708937 (0.00090) [2022-07-10 11:43:22,906][26022] Updated weights on worker 0-0, policy_version 708947 (0.00091) [2022-07-10 11:43:24,641][26022] Updated weights on worker 0-0, policy_version 708957 (0.00087) [2022-07-10 11:43:25,559][25689] Fps is (10 sec: 5510.2, 60 sec: 5531.0, 300 sec: 5520.1). Total num frames: 725976064. Throughput: 0: 5041.0. Samples: 725969782. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:43:25,559][25689] Avg episode reward: [(0, '-2.295')] [2022-07-10 11:43:26,647][26022] Updated weights on worker 0-0, policy_version 708967 (0.00095) [2022-07-10 11:43:28,523][26022] Updated weights on worker 0-0, policy_version 708977 (0.00096) [2022-07-10 11:43:30,179][26022] Updated weights on worker 0-0, policy_version 708987 (0.00081) [2022-07-10 11:43:30,671][25689] Fps is (10 sec: 5560.5, 60 sec: 5516.8, 300 sec: 5522.0). Total num frames: 726004736. Throughput: 0: 5864.1. Samples: 726003246. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:43:30,672][25689] Avg episode reward: [(0, '-1.428')] [2022-07-10 11:43:32,295][26022] Updated weights on worker 0-0, policy_version 708997 (0.00086) [2022-07-10 11:43:33,799][26022] Updated weights on worker 0-0, policy_version 709007 (0.00091) [2022-07-10 11:43:35,674][25689] Fps is (10 sec: 5466.8, 60 sec: 5501.3, 300 sec: 5515.4). Total num frames: 726031360. Throughput: 0: 5860.6. Samples: 726036724. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:43:35,674][25689] Avg episode reward: [(0, '-1.476')] [2022-07-10 11:43:35,925][26022] Updated weights on worker 0-0, policy_version 709017 (0.00088) [2022-07-10 11:43:37,495][26022] Updated weights on worker 0-0, policy_version 709027 (0.00088) [2022-07-10 11:43:39,469][26022] Updated weights on worker 0-0, policy_version 709037 (0.00088) [2022-07-10 11:43:40,707][25689] Fps is (10 sec: 5611.6, 60 sec: 5535.9, 300 sec: 5525.9). Total num frames: 726061056. Throughput: 0: 5003.1. Samples: 726053520. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:43:40,708][25689] Avg episode reward: [(0, '-1.799')] [2022-07-10 11:43:41,241][26022] Updated weights on worker 0-0, policy_version 709047 (0.00083) [2022-07-10 11:43:43,171][26022] Updated weights on worker 0-0, policy_version 709057 (0.00085) [2022-07-10 11:43:44,796][26022] Updated weights on worker 0-0, policy_version 709067 (0.00089) [2022-07-10 11:43:45,751][25689] Fps is (10 sec: 5792.2, 60 sec: 5557.9, 300 sec: 5522.9). Total num frames: 726089728. Throughput: 0: 5837.0. Samples: 726087642. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:43:45,751][25689] Avg episode reward: [(0, '-1.773')] [2022-07-10 11:43:46,797][26022] Updated weights on worker 0-0, policy_version 709077 (0.00083) [2022-07-10 11:43:48,320][26022] Updated weights on worker 0-0, policy_version 709087 (0.00108) [2022-07-10 11:43:50,403][26022] Updated weights on worker 0-0, policy_version 709097 (0.00093) [2022-07-10 11:43:50,883][25689] Fps is (10 sec: 5534.4, 60 sec: 5554.1, 300 sec: 5524.3). Total num frames: 726117376. Throughput: 0: 5841.1. Samples: 726121310. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:43:50,884][25689] Avg episode reward: [(0, '-2.629')] [2022-07-10 11:43:51,982][26022] Updated weights on worker 0-0, policy_version 709107 (0.00086) [2022-07-10 11:43:54,140][26022] Updated weights on worker 0-0, policy_version 709117 (0.00093) [2022-07-10 11:43:55,578][26022] Updated weights on worker 0-0, policy_version 709127 (0.00086) [2022-07-10 11:43:55,973][25689] Fps is (10 sec: 5610.0, 60 sec: 5564.5, 300 sec: 5529.9). Total num frames: 726147072. Throughput: 0: 5820.6. Samples: 726154876. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:43:55,973][25689] Avg episode reward: [(0, '-2.481')] [2022-07-10 11:43:57,735][26022] Updated weights on worker 0-0, policy_version 709137 (0.00086) [2022-07-10 11:43:59,264][26022] Updated weights on worker 0-0, policy_version 709147 (0.00085) [2022-07-10 11:44:00,997][25689] Fps is (10 sec: 5568.5, 60 sec: 5546.7, 300 sec: 5529.8). Total num frames: 726173696. Throughput: 0: 5827.3. Samples: 726171758. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:44:00,998][25689] Avg episode reward: [(0, '-2.627')] [2022-07-10 11:44:01,875][26022] Updated weights on worker 0-0, policy_version 709157 (0.00089) [2022-07-10 11:44:03,381][26022] Updated weights on worker 0-0, policy_version 709167 (0.00089) [2022-07-10 11:44:05,328][26022] Updated weights on worker 0-0, policy_version 709177 (0.00100) [2022-07-10 11:44:06,035][25689] Fps is (10 sec: 5393.7, 60 sec: 5579.7, 300 sec: 5528.0). Total num frames: 726201344. Throughput: 0: 5710.3. Samples: 726203470. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:44:06,035][25689] Avg episode reward: [(0, '-2.970')] [2022-07-10 11:44:07,034][26022] Updated weights on worker 0-0, policy_version 709187 (0.00091) [2022-07-10 11:44:09,187][26022] Updated weights on worker 0-0, policy_version 709197 (0.00082) [2022-07-10 11:44:10,860][26022] Updated weights on worker 0-0, policy_version 709207 (0.00087) [2022-07-10 11:44:11,086][25689] Fps is (10 sec: 5481.0, 60 sec: 5595.9, 300 sec: 5527.2). Total num frames: 726228992. Throughput: 0: 5726.2. Samples: 726236994. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:44:11,087][25689] Avg episode reward: [(0, '-2.570')] [2022-07-10 11:44:12,512][26022] Updated weights on worker 0-0, policy_version 709217 (0.00085) [2022-07-10 11:44:14,360][26022] Updated weights on worker 0-0, policy_version 709227 (0.00089) [2022-07-10 11:44:16,099][25689] Fps is (10 sec: 5494.0, 60 sec: 5544.7, 300 sec: 5523.9). Total num frames: 726256640. Throughput: 0: 4926.5. Samples: 726254032. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:44:16,100][25689] Avg episode reward: [(0, '-3.937')] [2022-07-10 11:44:16,311][26022] Updated weights on worker 0-0, policy_version 709237 (0.00087) [2022-07-10 11:44:18,034][26022] Updated weights on worker 0-0, policy_version 709247 (0.00082) [2022-07-10 11:44:19,971][26022] Updated weights on worker 0-0, policy_version 709257 (0.00093) [2022-07-10 11:44:21,124][25689] Fps is (10 sec: 5610.9, 60 sec: 5561.0, 300 sec: 5523.9). Total num frames: 726285312. Throughput: 0: 5772.4. Samples: 726287936. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:44:21,124][25689] Avg episode reward: [(0, '-4.718')] [2022-07-10 11:44:21,563][26022] Updated weights on worker 0-0, policy_version 709267 (0.00087) [2022-07-10 11:44:23,535][26022] Updated weights on worker 0-0, policy_version 709277 (0.00100) [2022-07-10 11:44:25,192][26022] Updated weights on worker 0-0, policy_version 709287 (0.00085) [2022-07-10 11:44:26,141][25689] Fps is (10 sec: 5608.8, 60 sec: 5561.0, 300 sec: 5528.5). Total num frames: 726312960. Throughput: 0: 5875.2. Samples: 726321598. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:44:26,141][25689] Avg episode reward: [(0, '-4.214')] [2022-07-10 11:44:27,213][26022] Updated weights on worker 0-0, policy_version 709297 (0.00088) [2022-07-10 11:44:28,993][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:44:29,007][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000709307_726330368.pth [2022-07-10 11:44:29,007][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000707360_724336640.pth [2022-07-10 11:44:29,014][26022] Updated weights on worker 0-0, policy_version 709307 (0.00087) [2022-07-10 11:44:30,690][26022] Updated weights on worker 0-0, policy_version 709317 (0.00087) [2022-07-10 11:44:31,181][25689] Fps is (10 sec: 5599.8, 60 sec: 5567.6, 300 sec: 5528.6). Total num frames: 726341632. Throughput: 0: 5059.6. Samples: 726338668. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:44:31,182][25689] Avg episode reward: [(0, '-3.977')] [2022-07-10 11:44:32,716][26022] Updated weights on worker 0-0, policy_version 709327 (0.00089) [2022-07-10 11:44:34,516][26022] Updated weights on worker 0-0, policy_version 709337 (0.00086) [2022-07-10 11:44:36,221][25689] Fps is (10 sec: 5790.3, 60 sec: 5614.9, 300 sec: 5535.1). Total num frames: 726371328. Throughput: 0: 5871.9. Samples: 726372184. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:44:36,222][25689] Avg episode reward: [(0, '-3.679')] [2022-07-10 11:44:36,235][26022] Updated weights on worker 0-0, policy_version 709347 (0.00094) [2022-07-10 11:44:38,290][26022] Updated weights on worker 0-0, policy_version 709357 (0.00097) [2022-07-10 11:44:39,789][26022] Updated weights on worker 0-0, policy_version 709367 (0.00089) [2022-07-10 11:44:41,254][25689] Fps is (10 sec: 5591.3, 60 sec: 5564.2, 300 sec: 5535.3). Total num frames: 726397952. Throughput: 0: 5841.6. Samples: 726405528. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 11:44:41,256][25689] Avg episode reward: [(0, '-2.614')] [2022-07-10 11:44:41,926][26022] Updated weights on worker 0-0, policy_version 709377 (0.00090) [2022-07-10 11:44:43,814][26022] Updated weights on worker 0-0, policy_version 709387 (0.00084) [2022-07-10 11:44:45,410][26022] Updated weights on worker 0-0, policy_version 709397 (0.00064) [2022-07-10 11:44:46,321][25689] Fps is (10 sec: 5576.2, 60 sec: 5578.9, 300 sec: 5535.7). Total num frames: 726427648. Throughput: 0: 5003.1. Samples: 726422562. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:44:46,322][25689] Avg episode reward: [(0, '-1.468')] [2022-07-10 11:44:47,512][26022] Updated weights on worker 0-0, policy_version 709407 (0.00088) [2022-07-10 11:44:49,216][26022] Updated weights on worker 0-0, policy_version 709417 (0.00090) [2022-07-10 11:44:50,951][26022] Updated weights on worker 0-0, policy_version 709427 (0.00103) [2022-07-10 11:44:51,448][25689] Fps is (10 sec: 5625.3, 60 sec: 5579.5, 300 sec: 5531.7). Total num frames: 726455296. Throughput: 0: 5792.2. Samples: 726456056. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:44:51,449][25689] Avg episode reward: [(0, '-0.857')] [2022-07-10 11:44:53,033][26022] Updated weights on worker 0-0, policy_version 709437 (0.00090) [2022-07-10 11:44:54,396][26022] Updated weights on worker 0-0, policy_version 709447 (0.00085) [2022-07-10 11:44:56,470][25689] Fps is (10 sec: 5347.9, 60 sec: 5534.9, 300 sec: 5528.0). Total num frames: 726481920. Throughput: 0: 5792.7. Samples: 726489476. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:44:56,470][25689] Avg episode reward: [(0, '-0.812')] [2022-07-10 11:44:56,665][26022] Updated weights on worker 0-0, policy_version 709457 (0.00092) [2022-07-10 11:44:58,389][26022] Updated weights on worker 0-0, policy_version 709467 (0.00085) [2022-07-10 11:45:00,440][26022] Updated weights on worker 0-0, policy_version 709477 (0.00091) [2022-07-10 11:45:01,492][25689] Fps is (10 sec: 5607.5, 60 sec: 5586.0, 300 sec: 5538.8). Total num frames: 726511616. Throughput: 0: 4976.4. Samples: 726506238. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:45:01,492][25689] Avg episode reward: [(0, '-0.756')] [2022-07-10 11:45:02,357][26022] Updated weights on worker 0-0, policy_version 709487 (0.00087) [2022-07-10 11:45:04,428][26022] Updated weights on worker 0-0, policy_version 709497 (0.00085) [2022-07-10 11:45:06,024][26022] Updated weights on worker 0-0, policy_version 709507 (0.00091) [2022-07-10 11:45:06,531][25689] Fps is (10 sec: 5496.0, 60 sec: 5552.0, 300 sec: 5536.8). Total num frames: 726537216. Throughput: 0: 5689.6. Samples: 726537546. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:45:06,531][25689] Avg episode reward: [(0, '-0.350')] [2022-07-10 11:45:08,292][26022] Updated weights on worker 0-0, policy_version 709517 (0.00091) [2022-07-10 11:45:09,725][26022] Updated weights on worker 0-0, policy_version 709527 (0.00096) [2022-07-10 11:45:11,586][25689] Fps is (10 sec: 5072.2, 60 sec: 5517.8, 300 sec: 5525.6). Total num frames: 726562816. Throughput: 0: 5694.3. Samples: 726570726. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:45:11,586][25689] Avg episode reward: [(0, '0.083')] [2022-07-10 11:45:11,932][26022] Updated weights on worker 0-0, policy_version 709537 (0.00088) [2022-07-10 11:45:13,259][26022] Updated weights on worker 0-0, policy_version 709547 (0.00088) [2022-07-10 11:45:15,474][26022] Updated weights on worker 0-0, policy_version 709557 (0.00091) [2022-07-10 11:45:16,631][25689] Fps is (10 sec: 5576.2, 60 sec: 5565.7, 300 sec: 5535.6). Total num frames: 726593536. Throughput: 0: 4866.5. Samples: 726587590. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:45:16,631][25689] Avg episode reward: [(0, '0.348')] [2022-07-10 11:45:16,959][26022] Updated weights on worker 0-0, policy_version 709567 (0.00080) [2022-07-10 11:45:19,080][26022] Updated weights on worker 0-0, policy_version 709577 (0.00085) [2022-07-10 11:45:20,707][26022] Updated weights on worker 0-0, policy_version 709587 (0.00091) [2022-07-10 11:45:21,656][25689] Fps is (10 sec: 5796.0, 60 sec: 5548.6, 300 sec: 5532.2). Total num frames: 726621184. Throughput: 0: 5709.4. Samples: 726621366. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:45:21,657][25689] Avg episode reward: [(0, '-0.471')] [2022-07-10 11:45:22,568][26022] Updated weights on worker 0-0, policy_version 709597 (0.00089) [2022-07-10 11:45:24,464][26022] Updated weights on worker 0-0, policy_version 709607 (0.00090) [2022-07-10 11:45:26,252][26022] Updated weights on worker 0-0, policy_version 709617 (0.00083) [2022-07-10 11:45:26,687][25689] Fps is (10 sec: 5600.7, 60 sec: 5564.3, 300 sec: 5536.9). Total num frames: 726649856. Throughput: 0: 5839.3. Samples: 726655242. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:45:26,687][25689] Avg episode reward: [(0, '-0.337')] [2022-07-10 11:45:28,126][26022] Updated weights on worker 0-0, policy_version 709627 (0.00087) [2022-07-10 11:45:29,847][26022] Updated weights on worker 0-0, policy_version 709637 (0.00092) [2022-07-10 11:45:31,724][26022] Updated weights on worker 0-0, policy_version 709647 (0.00082) [2022-07-10 11:45:31,731][25689] Fps is (10 sec: 5692.0, 60 sec: 5564.0, 300 sec: 5537.3). Total num frames: 726678528. Throughput: 0: 5036.6. Samples: 726672188. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:45:31,731][25689] Avg episode reward: [(0, '-1.458')] [2022-07-10 11:45:33,498][26022] Updated weights on worker 0-0, policy_version 709657 (0.00092) [2022-07-10 11:45:35,313][26022] Updated weights on worker 0-0, policy_version 709667 (0.00093) [2022-07-10 11:45:36,735][25689] Fps is (10 sec: 5605.0, 60 sec: 5533.4, 300 sec: 5534.3). Total num frames: 726706176. Throughput: 0: 5886.7. Samples: 726705936. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:45:36,735][25689] Avg episode reward: [(0, '-1.372')] [2022-07-10 11:45:37,112][26022] Updated weights on worker 0-0, policy_version 709677 (0.00096) [2022-07-10 11:45:38,981][26022] Updated weights on worker 0-0, policy_version 709687 (0.00093) [2022-07-10 11:45:40,811][26022] Updated weights on worker 0-0, policy_version 709697 (0.00085) [2022-07-10 11:45:41,755][25689] Fps is (10 sec: 5618.2, 60 sec: 5568.4, 300 sec: 5541.8). Total num frames: 726734848. Throughput: 0: 5876.7. Samples: 726739482. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:45:41,757][25689] Avg episode reward: [(0, '-1.170')] [2022-07-10 11:45:42,548][26022] Updated weights on worker 0-0, policy_version 709707 (0.00090) [2022-07-10 11:45:44,293][26022] Updated weights on worker 0-0, policy_version 709717 (0.00092) [2022-07-10 11:45:46,319][26022] Updated weights on worker 0-0, policy_version 709727 (0.00089) [2022-07-10 11:45:46,767][25689] Fps is (10 sec: 5512.0, 60 sec: 5522.7, 300 sec: 5539.4). Total num frames: 726761472. Throughput: 0: 5036.1. Samples: 726756366. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:45:46,767][25689] Avg episode reward: [(0, '-1.381')] [2022-07-10 11:45:48,163][26022] Updated weights on worker 0-0, policy_version 709737 (0.00090) [2022-07-10 11:45:49,945][26022] Updated weights on worker 0-0, policy_version 709747 (0.00089) [2022-07-10 11:45:51,828][25689] Fps is (10 sec: 5489.5, 60 sec: 5545.6, 300 sec: 5535.3). Total num frames: 726790144. Throughput: 0: 5856.7. Samples: 726789894. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:45:51,830][25689] Avg episode reward: [(0, '-1.518')] [2022-07-10 11:45:51,972][26022] Updated weights on worker 0-0, policy_version 709757 (0.00086) [2022-07-10 11:45:53,424][26022] Updated weights on worker 0-0, policy_version 709767 (0.00085) [2022-07-10 11:45:55,716][26022] Updated weights on worker 0-0, policy_version 709777 (0.00087) [2022-07-10 11:45:56,851][25689] Fps is (10 sec: 5686.3, 60 sec: 5579.4, 300 sec: 5541.9). Total num frames: 726818816. Throughput: 0: 5835.9. Samples: 726823334. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:45:56,853][25689] Avg episode reward: [(0, '-0.891')] [2022-07-10 11:45:57,305][26022] Updated weights on worker 0-0, policy_version 709787 (0.00088) [2022-07-10 11:45:59,171][26022] Updated weights on worker 0-0, policy_version 709797 (0.00084) [2022-07-10 11:46:01,055][26022] Updated weights on worker 0-0, policy_version 709807 (0.00097) [2022-07-10 11:46:01,854][25689] Fps is (10 sec: 5617.7, 60 sec: 5547.3, 300 sec: 5548.9). Total num frames: 726846464. Throughput: 0: 5020.2. Samples: 726840380. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:46:01,854][25689] Avg episode reward: [(0, '-0.078')] [2022-07-10 11:46:03,182][26022] Updated weights on worker 0-0, policy_version 709817 (0.00087) [2022-07-10 11:46:05,107][26022] Updated weights on worker 0-0, policy_version 709827 (0.00085) [2022-07-10 11:46:06,891][25689] Fps is (10 sec: 5303.8, 60 sec: 5547.5, 300 sec: 5539.3). Total num frames: 726872064. Throughput: 0: 5755.4. Samples: 726872188. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:46:06,891][25689] Avg episode reward: [(0, '-0.280')] [2022-07-10 11:46:06,914][26022] Updated weights on worker 0-0, policy_version 709837 (0.00085) [2022-07-10 11:46:08,655][26022] Updated weights on worker 0-0, policy_version 709847 (0.00087) [2022-07-10 11:46:10,599][26022] Updated weights on worker 0-0, policy_version 709857 (0.00104) [2022-07-10 11:46:11,999][25689] Fps is (10 sec: 5551.4, 60 sec: 5627.4, 300 sec: 5548.3). Total num frames: 726902784. Throughput: 0: 5742.1. Samples: 726905716. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:46:12,000][25689] Avg episode reward: [(0, '-0.924')] [2022-07-10 11:46:12,241][26022] Updated weights on worker 0-0, policy_version 709867 (0.00091) [2022-07-10 11:46:14,188][26022] Updated weights on worker 0-0, policy_version 709877 (0.00086) [2022-07-10 11:46:15,978][26022] Updated weights on worker 0-0, policy_version 709887 (0.00096) [2022-07-10 11:46:17,037][25689] Fps is (10 sec: 5651.5, 60 sec: 5560.2, 300 sec: 5547.7). Total num frames: 726929408. Throughput: 0: 5740.9. Samples: 726939222. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:46:17,043][25689] Avg episode reward: [(0, '-0.782')] [2022-07-10 11:46:17,859][26022] Updated weights on worker 0-0, policy_version 709897 (0.00082) [2022-07-10 11:46:19,856][26022] Updated weights on worker 0-0, policy_version 709907 (0.00095) [2022-07-10 11:46:21,455][26022] Updated weights on worker 0-0, policy_version 709917 (0.00087) [2022-07-10 11:46:22,095][25689] Fps is (10 sec: 5477.1, 60 sec: 5574.2, 300 sec: 5543.8). Total num frames: 726958080. Throughput: 0: 5720.2. Samples: 726956164. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:46:22,095][25689] Avg episode reward: [(0, '-0.693')] [2022-07-10 11:46:23,284][26022] Updated weights on worker 0-0, policy_version 709927 (0.00087) [2022-07-10 11:46:25,279][26022] Updated weights on worker 0-0, policy_version 709937 (0.00089) [2022-07-10 11:46:26,856][26022] Updated weights on worker 0-0, policy_version 709947 (0.00096) [2022-07-10 11:46:27,102][25689] Fps is (10 sec: 5595.9, 60 sec: 5559.4, 300 sec: 5549.5). Total num frames: 726985728. Throughput: 0: 5834.6. Samples: 726990112. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:46:27,102][25689] Avg episode reward: [(0, '-0.760')] [2022-07-10 11:46:28,928][26022] Updated weights on worker 0-0, policy_version 709957 (0.00094) [2022-07-10 11:46:29,069][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:46:29,084][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000709958_726996992.pth [2022-07-10 11:46:29,084][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000708009_725001216.pth [2022-07-10 11:46:30,662][26022] Updated weights on worker 0-0, policy_version 709967 (0.00091) [2022-07-10 11:46:32,209][25689] Fps is (10 sec: 5568.4, 60 sec: 5553.6, 300 sec: 5544.6). Total num frames: 727014400. Throughput: 0: 5837.7. Samples: 727023696. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:46:32,210][25689] Avg episode reward: [(0, '-1.526')] [2022-07-10 11:46:32,622][26022] Updated weights on worker 0-0, policy_version 709977 (0.00091) [2022-07-10 11:46:34,217][26022] Updated weights on worker 0-0, policy_version 709987 (0.00050) [2022-07-10 11:46:36,146][26022] Updated weights on worker 0-0, policy_version 709997 (0.00090) [2022-07-10 11:46:37,236][25689] Fps is (10 sec: 5557.4, 60 sec: 5551.5, 300 sec: 5548.0). Total num frames: 727042048. Throughput: 0: 5011.0. Samples: 727040438. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:46:37,237][25689] Avg episode reward: [(0, '-1.917')] [2022-07-10 11:46:37,914][26022] Updated weights on worker 0-0, policy_version 710007 (0.00091) [2022-07-10 11:46:39,973][26022] Updated weights on worker 0-0, policy_version 710017 (0.00084) [2022-07-10 11:46:41,791][26022] Updated weights on worker 0-0, policy_version 710027 (0.00087) [2022-07-10 11:46:42,263][25689] Fps is (10 sec: 5601.7, 60 sec: 5550.9, 300 sec: 5551.1). Total num frames: 727070720. Throughput: 0: 5823.2. Samples: 727073608. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:46:42,264][25689] Avg episode reward: [(0, '-1.922')] [2022-07-10 11:46:43,586][26022] Updated weights on worker 0-0, policy_version 710037 (0.00085) [2022-07-10 11:46:45,450][26022] Updated weights on worker 0-0, policy_version 710047 (0.00092) [2022-07-10 11:46:46,925][26022] Updated weights on worker 0-0, policy_version 710057 (0.00084) [2022-07-10 11:46:47,290][25689] Fps is (10 sec: 5703.5, 60 sec: 5583.3, 300 sec: 5552.6). Total num frames: 727099392. Throughput: 0: 5814.2. Samples: 727107490. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:46:47,291][25689] Avg episode reward: [(0, '-2.990')] [2022-07-10 11:46:49,081][26022] Updated weights on worker 0-0, policy_version 710067 (0.00080) [2022-07-10 11:46:50,947][26022] Updated weights on worker 0-0, policy_version 710077 (0.00059) [2022-07-10 11:46:52,407][25689] Fps is (10 sec: 5552.4, 60 sec: 5561.3, 300 sec: 5547.4). Total num frames: 727127040. Throughput: 0: 4976.7. Samples: 727124214. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:46:52,407][25689] Avg episode reward: [(0, '-3.208')] [2022-07-10 11:46:52,649][26022] Updated weights on worker 0-0, policy_version 710087 (0.00080) [2022-07-10 11:46:54,471][26022] Updated weights on worker 0-0, policy_version 710097 (0.00088) [2022-07-10 11:46:56,216][26022] Updated weights on worker 0-0, policy_version 710107 (0.00091) [2022-07-10 11:46:57,415][25689] Fps is (10 sec: 5461.7, 60 sec: 5545.8, 300 sec: 5554.5). Total num frames: 727154688. Throughput: 0: 5826.3. Samples: 727158004. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:46:57,415][25689] Avg episode reward: [(0, '-2.890')] [2022-07-10 11:46:58,035][26022] Updated weights on worker 0-0, policy_version 710117 (0.00092) [2022-07-10 11:46:59,882][26022] Updated weights on worker 0-0, policy_version 710127 (0.00095) [2022-07-10 11:47:01,830][26022] Updated weights on worker 0-0, policy_version 710137 (0.00094) [2022-07-10 11:47:02,439][25689] Fps is (10 sec: 5409.7, 60 sec: 5526.9, 300 sec: 5551.0). Total num frames: 727181312. Throughput: 0: 5738.6. Samples: 727189388. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:47:02,440][25689] Avg episode reward: [(0, '-2.325')] [2022-07-10 11:47:04,159][26022] Updated weights on worker 0-0, policy_version 710147 (0.00094) [2022-07-10 11:47:05,964][26022] Updated weights on worker 0-0, policy_version 710157 (0.00056) [2022-07-10 11:47:07,510][25689] Fps is (10 sec: 5376.1, 60 sec: 5557.6, 300 sec: 5547.6). Total num frames: 727208960. Throughput: 0: 4867.1. Samples: 727205900. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:47:07,510][25689] Avg episode reward: [(0, '-3.263')] [2022-07-10 11:47:07,817][26022] Updated weights on worker 0-0, policy_version 710167 (0.00294) [2022-07-10 11:47:09,612][26022] Updated weights on worker 0-0, policy_version 710177 (0.00089) [2022-07-10 11:47:11,358][26022] Updated weights on worker 0-0, policy_version 710187 (0.00085) [2022-07-10 11:47:12,594][25689] Fps is (10 sec: 5546.2, 60 sec: 5526.0, 300 sec: 5556.9). Total num frames: 727237632. Throughput: 0: 5679.8. Samples: 727238872. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:47:12,594][25689] Avg episode reward: [(0, '-2.351')] [2022-07-10 11:47:13,439][26022] Updated weights on worker 0-0, policy_version 710197 (0.00089) [2022-07-10 11:47:15,087][26022] Updated weights on worker 0-0, policy_version 710207 (0.00099) [2022-07-10 11:47:17,052][26022] Updated weights on worker 0-0, policy_version 710217 (0.00092) [2022-07-10 11:47:17,602][25689] Fps is (10 sec: 5580.7, 60 sec: 5545.7, 300 sec: 5550.1). Total num frames: 727265280. Throughput: 0: 5670.1. Samples: 727272466. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:47:17,603][25689] Avg episode reward: [(0, '-2.068')] [2022-07-10 11:47:18,723][26022] Updated weights on worker 0-0, policy_version 710227 (0.00087) [2022-07-10 11:47:20,696][26022] Updated weights on worker 0-0, policy_version 710237 (0.00090) [2022-07-10 11:47:22,443][26022] Updated weights on worker 0-0, policy_version 710247 (0.00091) [2022-07-10 11:47:22,650][25689] Fps is (10 sec: 5600.7, 60 sec: 5546.6, 300 sec: 5553.0). Total num frames: 727293952. Throughput: 0: 4941.1. Samples: 727289250. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:47:22,650][25689] Avg episode reward: [(0, '-1.916')] [2022-07-10 11:47:24,383][26022] Updated weights on worker 0-0, policy_version 710257 (0.00054) [2022-07-10 11:47:26,068][26022] Updated weights on worker 0-0, policy_version 710267 (0.00096) [2022-07-10 11:47:27,685][25689] Fps is (10 sec: 5484.0, 60 sec: 5527.1, 300 sec: 5544.6). Total num frames: 727320576. Throughput: 0: 5794.5. Samples: 727322804. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:47:27,685][25689] Avg episode reward: [(0, '-2.102')] [2022-07-10 11:47:27,966][26022] Updated weights on worker 0-0, policy_version 710277 (0.00084) [2022-07-10 11:47:29,745][26022] Updated weights on worker 0-0, policy_version 710287 (0.00088) [2022-07-10 11:47:31,737][26022] Updated weights on worker 0-0, policy_version 710297 (0.00085) [2022-07-10 11:47:32,776][25689] Fps is (10 sec: 5561.7, 60 sec: 5545.5, 300 sec: 5550.2). Total num frames: 727350272. Throughput: 0: 5818.6. Samples: 727356304. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:47:32,777][25689] Avg episode reward: [(0, '-2.581')] [2022-07-10 11:47:33,377][26022] Updated weights on worker 0-0, policy_version 710307 (0.00091) [2022-07-10 11:47:35,432][26022] Updated weights on worker 0-0, policy_version 710317 (0.00080) [2022-07-10 11:47:37,257][26022] Updated weights on worker 0-0, policy_version 710327 (0.00097) [2022-07-10 11:47:37,818][25689] Fps is (10 sec: 5659.2, 60 sec: 5544.1, 300 sec: 5550.1). Total num frames: 727377920. Throughput: 0: 4976.4. Samples: 727373070. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:47:37,818][25689] Avg episode reward: [(0, '-1.864')] [2022-07-10 11:47:38,983][26022] Updated weights on worker 0-0, policy_version 710337 (0.00098) [2022-07-10 11:47:40,763][26022] Updated weights on worker 0-0, policy_version 710347 (0.00084) [2022-07-10 11:47:42,496][26022] Updated weights on worker 0-0, policy_version 710357 (0.00089) [2022-07-10 11:47:42,843][25689] Fps is (10 sec: 5594.4, 60 sec: 5544.3, 300 sec: 5555.0). Total num frames: 727406592. Throughput: 0: 5813.5. Samples: 727406646. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:47:42,844][25689] Avg episode reward: [(0, '-1.160')] [2022-07-10 11:47:44,517][26022] Updated weights on worker 0-0, policy_version 710367 (0.00084) [2022-07-10 11:47:46,347][26022] Updated weights on worker 0-0, policy_version 710377 (0.00088) [2022-07-10 11:47:47,902][25689] Fps is (10 sec: 5585.0, 60 sec: 5524.5, 300 sec: 5555.6). Total num frames: 727434240. Throughput: 0: 5820.5. Samples: 727440478. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:47:47,902][25689] Avg episode reward: [(0, '-0.493')] [2022-07-10 11:47:48,125][26022] Updated weights on worker 0-0, policy_version 710387 (0.00082) [2022-07-10 11:47:50,035][26022] Updated weights on worker 0-0, policy_version 710397 (0.00092) [2022-07-10 11:47:51,786][26022] Updated weights on worker 0-0, policy_version 710407 (0.00081) [2022-07-10 11:47:52,975][25689] Fps is (10 sec: 5457.8, 60 sec: 5528.4, 300 sec: 5551.1). Total num frames: 727461888. Throughput: 0: 5000.0. Samples: 727457298. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:47:52,975][25689] Avg episode reward: [(0, '-0.937')] [2022-07-10 11:47:53,661][26022] Updated weights on worker 0-0, policy_version 710417 (0.00090) [2022-07-10 11:47:55,312][26022] Updated weights on worker 0-0, policy_version 710427 (0.00090) [2022-07-10 11:47:57,261][26022] Updated weights on worker 0-0, policy_version 710437 (0.00093) [2022-07-10 11:47:57,997][25689] Fps is (10 sec: 5578.9, 60 sec: 5544.1, 300 sec: 5554.4). Total num frames: 727490560. Throughput: 0: 5846.3. Samples: 727491044. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:47:57,997][25689] Avg episode reward: [(0, '-1.088')] [2022-07-10 11:47:59,161][26022] Updated weights on worker 0-0, policy_version 710447 (0.00105) [2022-07-10 11:48:00,899][26022] Updated weights on worker 0-0, policy_version 710457 (0.00088) [2022-07-10 11:48:02,972][26022] Updated weights on worker 0-0, policy_version 710467 (0.00085) [2022-07-10 11:48:03,071][25689] Fps is (10 sec: 5578.2, 60 sec: 5556.4, 300 sec: 5560.4). Total num frames: 727518208. Throughput: 0: 5725.5. Samples: 727522462. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 11:48:03,072][25689] Avg episode reward: [(0, '-1.167')] [2022-07-10 11:48:05,204][26022] Updated weights on worker 0-0, policy_version 710477 (0.00089) [2022-07-10 11:48:06,772][26022] Updated weights on worker 0-0, policy_version 710487 (0.00091) [2022-07-10 11:48:08,134][25689] Fps is (10 sec: 5353.6, 60 sec: 5540.2, 300 sec: 5560.1). Total num frames: 727544832. Throughput: 0: 4874.1. Samples: 727539094. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:48:08,135][25689] Avg episode reward: [(0, '-2.708')] [2022-07-10 11:48:08,897][26022] Updated weights on worker 0-0, policy_version 710497 (0.00049) [2022-07-10 11:48:10,583][26022] Updated weights on worker 0-0, policy_version 710507 (0.00082) [2022-07-10 11:48:12,455][26022] Updated weights on worker 0-0, policy_version 710517 (0.00091) [2022-07-10 11:48:13,221][25689] Fps is (10 sec: 5548.9, 60 sec: 5556.8, 300 sec: 5555.2). Total num frames: 727574528. Throughput: 0: 5697.5. Samples: 727572650. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:48:13,222][25689] Avg episode reward: [(0, '-2.535')] [2022-07-10 11:48:14,210][26022] Updated weights on worker 0-0, policy_version 710527 (0.00097) [2022-07-10 11:48:16,142][26022] Updated weights on worker 0-0, policy_version 710537 (0.00092) [2022-07-10 11:48:17,897][26022] Updated weights on worker 0-0, policy_version 710547 (0.00087) [2022-07-10 11:48:18,275][25689] Fps is (10 sec: 5554.0, 60 sec: 5535.8, 300 sec: 5551.1). Total num frames: 727601152. Throughput: 0: 5667.3. Samples: 727605964. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:48:18,275][25689] Avg episode reward: [(0, '-3.508')] [2022-07-10 11:48:19,740][26022] Updated weights on worker 0-0, policy_version 710557 (0.00084) [2022-07-10 11:48:21,527][26022] Updated weights on worker 0-0, policy_version 710567 (0.00085) [2022-07-10 11:48:23,294][25689] Fps is (10 sec: 5489.9, 60 sec: 5538.4, 300 sec: 5554.5). Total num frames: 727629824. Throughput: 0: 4958.4. Samples: 727622732. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:48:23,294][25689] Avg episode reward: [(0, '-4.335')] [2022-07-10 11:48:23,537][26022] Updated weights on worker 0-0, policy_version 710577 (0.00400) [2022-07-10 11:48:25,232][26022] Updated weights on worker 0-0, policy_version 710587 (0.00090) [2022-07-10 11:48:26,975][26022] Updated weights on worker 0-0, policy_version 710597 (0.00092) [2022-07-10 11:48:28,302][25689] Fps is (10 sec: 5718.8, 60 sec: 5574.7, 300 sec: 5556.4). Total num frames: 727658496. Throughput: 0: 5804.2. Samples: 727656152. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:48:28,311][25689] Avg episode reward: [(0, '-4.814')] [2022-07-10 11:48:29,215][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:48:29,226][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000710607_727661568.pth [2022-07-10 11:48:29,226][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000708656_725663744.pth [2022-07-10 11:48:29,232][26022] Updated weights on worker 0-0, policy_version 710607 (0.00091) [2022-07-10 11:48:30,775][26022] Updated weights on worker 0-0, policy_version 710617 (0.00094) [2022-07-10 11:48:32,832][26022] Updated weights on worker 0-0, policy_version 710627 (0.00092) [2022-07-10 11:48:33,451][25689] Fps is (10 sec: 5544.7, 60 sec: 5535.6, 300 sec: 5557.1). Total num frames: 727686144. Throughput: 0: 5796.6. Samples: 727689914. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:48:33,451][25689] Avg episode reward: [(0, '-5.293')] [2022-07-10 11:48:34,320][26022] Updated weights on worker 0-0, policy_version 710637 (0.00081) [2022-07-10 11:48:36,219][26022] Updated weights on worker 0-0, policy_version 710647 (0.00097) [2022-07-10 11:48:38,070][26022] Updated weights on worker 0-0, policy_version 710657 (0.00094) [2022-07-10 11:48:38,473][25689] Fps is (10 sec: 5436.8, 60 sec: 5537.4, 300 sec: 5550.5). Total num frames: 727713792. Throughput: 0: 5843.4. Samples: 727723988. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:48:38,473][25689] Avg episode reward: [(0, '-6.289')] [2022-07-10 11:48:39,770][26022] Updated weights on worker 0-0, policy_version 710667 (0.00085) [2022-07-10 11:48:41,875][26022] Updated weights on worker 0-0, policy_version 710677 (0.00087) [2022-07-10 11:48:43,519][25689] Fps is (10 sec: 5593.9, 60 sec: 5535.5, 300 sec: 5550.4). Total num frames: 727742464. Throughput: 0: 5826.1. Samples: 727740568. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:48:43,520][25689] Avg episode reward: [(0, '-6.049')] [2022-07-10 11:48:43,550][26022] Updated weights on worker 0-0, policy_version 710687 (0.00092) [2022-07-10 11:48:45,487][26022] Updated weights on worker 0-0, policy_version 710697 (0.00085) [2022-07-10 11:48:47,131][26022] Updated weights on worker 0-0, policy_version 710707 (0.00959) [2022-07-10 11:48:48,556][25689] Fps is (10 sec: 5585.3, 60 sec: 5537.5, 300 sec: 5552.2). Total num frames: 727770112. Throughput: 0: 5813.1. Samples: 727773892. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:48:48,558][25689] Avg episode reward: [(0, '-4.783')] [2022-07-10 11:48:49,096][26022] Updated weights on worker 0-0, policy_version 710717 (0.00089) [2022-07-10 11:48:50,963][26022] Updated weights on worker 0-0, policy_version 710727 (0.00089) [2022-07-10 11:48:52,856][26022] Updated weights on worker 0-0, policy_version 710737 (0.00081) [2022-07-10 11:48:53,621][25689] Fps is (10 sec: 5676.6, 60 sec: 5572.0, 300 sec: 5552.6). Total num frames: 727799808. Throughput: 0: 5809.2. Samples: 727807086. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:48:53,621][25689] Avg episode reward: [(0, '-4.632')] [2022-07-10 11:48:54,736][26022] Updated weights on worker 0-0, policy_version 710747 (0.00087) [2022-07-10 11:48:56,579][26022] Updated weights on worker 0-0, policy_version 710757 (0.00100) [2022-07-10 11:48:58,389][26022] Updated weights on worker 0-0, policy_version 710767 (0.00096) [2022-07-10 11:48:58,714][25689] Fps is (10 sec: 5544.6, 60 sec: 5531.8, 300 sec: 5551.4). Total num frames: 727826432. Throughput: 0: 4936.8. Samples: 727823914. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:48:58,714][25689] Avg episode reward: [(0, '-3.736')] [2022-07-10 11:49:00,175][26022] Updated weights on worker 0-0, policy_version 710777 (0.00088) [2022-07-10 11:49:02,419][26022] Updated weights on worker 0-0, policy_version 710787 (0.00085) [2022-07-10 11:49:03,767][25689] Fps is (10 sec: 5248.4, 60 sec: 5516.9, 300 sec: 5547.6). Total num frames: 727853056. Throughput: 0: 5667.8. Samples: 727855326. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:49:03,768][25689] Avg episode reward: [(0, '-3.160')] [2022-07-10 11:49:04,162][26022] Updated weights on worker 0-0, policy_version 710797 (0.00090) [2022-07-10 11:49:05,829][26022] Updated weights on worker 0-0, policy_version 710807 (0.00090) [2022-07-10 11:49:07,846][26022] Updated weights on worker 0-0, policy_version 710817 (0.00088) [2022-07-10 11:49:08,856][25689] Fps is (10 sec: 5452.2, 60 sec: 5548.2, 300 sec: 5550.4). Total num frames: 727881728. Throughput: 0: 5690.0. Samples: 727889396. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:49:08,857][25689] Avg episode reward: [(0, '-2.655')] [2022-07-10 11:49:09,733][26022] Updated weights on worker 0-0, policy_version 710827 (0.00093) [2022-07-10 11:49:11,537][26022] Updated weights on worker 0-0, policy_version 710837 (0.00077) [2022-07-10 11:49:13,243][26022] Updated weights on worker 0-0, policy_version 710847 (0.00093) [2022-07-10 11:49:13,940][25689] Fps is (10 sec: 5536.0, 60 sec: 5514.7, 300 sec: 5549.1). Total num frames: 727909376. Throughput: 0: 4881.3. Samples: 727906270. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:49:13,942][25689] Avg episode reward: [(0, '-2.651')] [2022-07-10 11:49:15,150][26022] Updated weights on worker 0-0, policy_version 710857 (0.00088) [2022-07-10 11:49:17,092][26022] Updated weights on worker 0-0, policy_version 710867 (0.00086) [2022-07-10 11:49:18,880][26022] Updated weights on worker 0-0, policy_version 710877 (0.00093) [2022-07-10 11:49:18,976][25689] Fps is (10 sec: 5564.9, 60 sec: 5550.1, 300 sec: 5548.8). Total num frames: 727938048. Throughput: 0: 5705.8. Samples: 727939528. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:49:18,978][25689] Avg episode reward: [(0, '-3.131')] [2022-07-10 11:49:20,805][26022] Updated weights on worker 0-0, policy_version 710887 (0.00086) [2022-07-10 11:49:22,475][26022] Updated weights on worker 0-0, policy_version 710897 (0.00093) [2022-07-10 11:49:24,012][25689] Fps is (10 sec: 5693.3, 60 sec: 5548.5, 300 sec: 5551.9). Total num frames: 727966720. Throughput: 0: 5830.1. Samples: 727973360. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:49:24,014][25689] Avg episode reward: [(0, '-3.994')] [2022-07-10 11:49:24,303][26022] Updated weights on worker 0-0, policy_version 710907 (0.00086) [2022-07-10 11:49:26,226][26022] Updated weights on worker 0-0, policy_version 710917 (0.00086) [2022-07-10 11:49:27,838][26022] Updated weights on worker 0-0, policy_version 710927 (0.00091) [2022-07-10 11:49:29,036][25689] Fps is (10 sec: 5598.6, 60 sec: 5530.2, 300 sec: 5548.8). Total num frames: 727994368. Throughput: 0: 4981.9. Samples: 727989932. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:49:29,036][25689] Avg episode reward: [(0, '-3.118')] [2022-07-10 11:49:29,904][26022] Updated weights on worker 0-0, policy_version 710937 (0.00083) [2022-07-10 11:49:31,536][26022] Updated weights on worker 0-0, policy_version 710947 (0.00096) [2022-07-10 11:49:33,640][26022] Updated weights on worker 0-0, policy_version 710957 (0.00085) [2022-07-10 11:49:34,095][25689] Fps is (10 sec: 5585.5, 60 sec: 5555.3, 300 sec: 5545.0). Total num frames: 728023040. Throughput: 0: 5814.4. Samples: 728023462. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:49:34,096][25689] Avg episode reward: [(0, '-3.366')] [2022-07-10 11:49:35,348][26022] Updated weights on worker 0-0, policy_version 710967 (0.00108) [2022-07-10 11:49:37,336][26022] Updated weights on worker 0-0, policy_version 710977 (0.00084) [2022-07-10 11:49:38,868][26022] Updated weights on worker 0-0, policy_version 710987 (0.00093) [2022-07-10 11:49:39,146][25689] Fps is (10 sec: 5671.8, 60 sec: 5569.5, 300 sec: 5551.5). Total num frames: 728051712. Throughput: 0: 5834.0. Samples: 728057200. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:49:39,147][25689] Avg episode reward: [(0, '-1.504')] [2022-07-10 11:49:40,749][26022] Updated weights on worker 0-0, policy_version 710997 (0.00108) [2022-07-10 11:49:42,644][26022] Updated weights on worker 0-0, policy_version 711007 (0.00091) [2022-07-10 11:49:44,183][25689] Fps is (10 sec: 5583.2, 60 sec: 5553.5, 300 sec: 5545.2). Total num frames: 728079360. Throughput: 0: 4995.2. Samples: 728074116. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:49:44,184][25689] Avg episode reward: [(0, '-1.161')] [2022-07-10 11:49:44,706][26022] Updated weights on worker 0-0, policy_version 711017 (0.00085) [2022-07-10 11:49:46,342][26022] Updated weights on worker 0-0, policy_version 711027 (0.00090) [2022-07-10 11:49:48,298][26022] Updated weights on worker 0-0, policy_version 711037 (0.00088) [2022-07-10 11:49:49,191][25689] Fps is (10 sec: 5403.0, 60 sec: 5539.3, 300 sec: 5544.0). Total num frames: 728105984. Throughput: 0: 5822.6. Samples: 728107288. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:49:49,193][25689] Avg episode reward: [(0, '-0.689')] [2022-07-10 11:49:50,031][26022] Updated weights on worker 0-0, policy_version 711047 (0.00092) [2022-07-10 11:49:51,852][26022] Updated weights on worker 0-0, policy_version 711057 (0.00093) [2022-07-10 11:49:53,697][26022] Updated weights on worker 0-0, policy_version 711067 (0.00085) [2022-07-10 11:49:54,248][25689] Fps is (10 sec: 5595.3, 60 sec: 5539.9, 300 sec: 5553.7). Total num frames: 728135680. Throughput: 0: 5819.0. Samples: 728140732. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:49:54,249][25689] Avg episode reward: [(0, '-1.793')] [2022-07-10 11:49:55,575][26022] Updated weights on worker 0-0, policy_version 711077 (0.00083) [2022-07-10 11:49:57,494][26022] Updated weights on worker 0-0, policy_version 711087 (0.00086) [2022-07-10 11:49:59,196][26022] Updated weights on worker 0-0, policy_version 711097 (0.00091) [2022-07-10 11:49:59,292][25689] Fps is (10 sec: 5676.9, 60 sec: 5561.3, 300 sec: 5546.4). Total num frames: 728163328. Throughput: 0: 4977.1. Samples: 728157470. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:49:59,293][25689] Avg episode reward: [(0, '-1.624')] [2022-07-10 11:50:01,103][26022] Updated weights on worker 0-0, policy_version 711107 (0.00093) [2022-07-10 11:50:03,378][26022] Updated weights on worker 0-0, policy_version 711117 (0.00082) [2022-07-10 11:50:04,308][25689] Fps is (10 sec: 5293.3, 60 sec: 5547.8, 300 sec: 5546.8). Total num frames: 728188928. Throughput: 0: 5723.3. Samples: 728189298. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:50:04,308][25689] Avg episode reward: [(0, '-1.441')] [2022-07-10 11:50:05,153][26022] Updated weights on worker 0-0, policy_version 711127 (0.00092) [2022-07-10 11:50:07,010][26022] Updated weights on worker 0-0, policy_version 711137 (0.00085) [2022-07-10 11:50:09,009][26022] Updated weights on worker 0-0, policy_version 711147 (0.00092) [2022-07-10 11:50:09,347][25689] Fps is (10 sec: 5295.9, 60 sec: 5535.5, 300 sec: 5554.0). Total num frames: 728216576. Throughput: 0: 5702.5. Samples: 728222226. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:50:09,347][25689] Avg episode reward: [(0, '-1.582')] [2022-07-10 11:50:10,839][26022] Updated weights on worker 0-0, policy_version 711157 (0.00083) [2022-07-10 11:50:12,652][26022] Updated weights on worker 0-0, policy_version 711167 (0.00092) [2022-07-10 11:50:14,495][25689] Fps is (10 sec: 5428.2, 60 sec: 5529.7, 300 sec: 5541.8). Total num frames: 728244224. Throughput: 0: 4849.5. Samples: 728238916. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:50:14,495][25689] Avg episode reward: [(0, '-2.345')] [2022-07-10 11:50:14,712][26022] Updated weights on worker 0-0, policy_version 711177 (0.00104) [2022-07-10 11:50:16,274][26022] Updated weights on worker 0-0, policy_version 711187 (0.00091) [2022-07-10 11:50:18,401][26022] Updated weights on worker 0-0, policy_version 711197 (0.00094) [2022-07-10 11:50:19,536][25689] Fps is (10 sec: 5627.9, 60 sec: 5546.1, 300 sec: 5548.3). Total num frames: 728273920. Throughput: 0: 5644.2. Samples: 728271730. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:50:19,536][25689] Avg episode reward: [(0, '-1.482')] [2022-07-10 11:50:19,823][26022] Updated weights on worker 0-0, policy_version 711207 (0.00091) [2022-07-10 11:50:22,013][26022] Updated weights on worker 0-0, policy_version 711217 (0.00083) [2022-07-10 11:50:23,635][26022] Updated weights on worker 0-0, policy_version 711227 (0.00095) [2022-07-10 11:50:24,568][25689] Fps is (10 sec: 5692.7, 60 sec: 5529.6, 300 sec: 5544.9). Total num frames: 728301568. Throughput: 0: 5723.5. Samples: 728305256. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:50:24,568][25689] Avg episode reward: [(0, '-1.252')] [2022-07-10 11:50:25,671][26022] Updated weights on worker 0-0, policy_version 711237 (0.00083) [2022-07-10 11:50:27,297][26022] Updated weights on worker 0-0, policy_version 711247 (0.00630) [2022-07-10 11:50:29,285][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:50:29,303][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000711257_728327168.pth [2022-07-10 11:50:29,303][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000709307_726330368.pth [2022-07-10 11:50:29,307][26022] Updated weights on worker 0-0, policy_version 711257 (0.00091) [2022-07-10 11:50:29,575][25689] Fps is (10 sec: 5406.2, 60 sec: 5514.2, 300 sec: 5538.7). Total num frames: 728328192. Throughput: 0: 4926.2. Samples: 728321874. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:50:29,575][25689] Avg episode reward: [(0, '-0.803')] [2022-07-10 11:50:31,065][26022] Updated weights on worker 0-0, policy_version 711267 (0.00085) [2022-07-10 11:50:32,988][26022] Updated weights on worker 0-0, policy_version 711277 (0.00084) [2022-07-10 11:50:34,625][25689] Fps is (10 sec: 5498.3, 60 sec: 5515.1, 300 sec: 5541.3). Total num frames: 728356864. Throughput: 0: 5786.2. Samples: 728355394. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:50:34,625][25689] Avg episode reward: [(0, '-0.767')] [2022-07-10 11:50:34,745][26022] Updated weights on worker 0-0, policy_version 711287 (0.00096) [2022-07-10 11:50:36,671][26022] Updated weights on worker 0-0, policy_version 711297 (0.00088) [2022-07-10 11:50:38,475][26022] Updated weights on worker 0-0, policy_version 711307 (0.00082) [2022-07-10 11:50:39,631][25689] Fps is (10 sec: 5600.4, 60 sec: 5502.2, 300 sec: 5538.1). Total num frames: 728384512. Throughput: 0: 5839.2. Samples: 728389070. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:50:39,632][25689] Avg episode reward: [(0, '-2.191')] [2022-07-10 11:50:40,247][26022] Updated weights on worker 0-0, policy_version 711317 (0.00075) [2022-07-10 11:50:42,085][26022] Updated weights on worker 0-0, policy_version 711327 (0.00093) [2022-07-10 11:50:43,940][26022] Updated weights on worker 0-0, policy_version 711337 (0.00084) [2022-07-10 11:50:44,643][25689] Fps is (10 sec: 5519.5, 60 sec: 5504.4, 300 sec: 5541.5). Total num frames: 728412160. Throughput: 0: 5004.0. Samples: 728405712. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:50:44,645][25689] Avg episode reward: [(0, '-2.879')] [2022-07-10 11:50:45,748][26022] Updated weights on worker 0-0, policy_version 711347 (0.00092) [2022-07-10 11:50:47,691][26022] Updated weights on worker 0-0, policy_version 711357 (0.00081) [2022-07-10 11:50:49,423][26022] Updated weights on worker 0-0, policy_version 711367 (0.00087) [2022-07-10 11:50:49,659][25689] Fps is (10 sec: 5616.4, 60 sec: 5537.6, 300 sec: 5542.4). Total num frames: 728440832. Throughput: 0: 5827.5. Samples: 728438914. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:50:49,659][25689] Avg episode reward: [(0, '-2.846')] [2022-07-10 11:50:51,375][26022] Updated weights on worker 0-0, policy_version 711377 (0.00082) [2022-07-10 11:50:53,295][26022] Updated weights on worker 0-0, policy_version 711387 (0.00990) [2022-07-10 11:50:54,696][25689] Fps is (10 sec: 5602.1, 60 sec: 5505.5, 300 sec: 5538.7). Total num frames: 728468480. Throughput: 0: 5816.1. Samples: 728472134. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:50:54,697][25689] Avg episode reward: [(0, '-2.276')] [2022-07-10 11:50:55,071][26022] Updated weights on worker 0-0, policy_version 711397 (0.00088) [2022-07-10 11:50:57,023][26022] Updated weights on worker 0-0, policy_version 711407 (0.00092) [2022-07-10 11:50:58,762][26022] Updated weights on worker 0-0, policy_version 711417 (0.00087) [2022-07-10 11:50:59,721][25689] Fps is (10 sec: 5495.6, 60 sec: 5507.3, 300 sec: 5538.3). Total num frames: 728496128. Throughput: 0: 5786.6. Samples: 728505320. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:50:59,721][25689] Avg episode reward: [(0, '-2.962')] [2022-07-10 11:51:00,699][26022] Updated weights on worker 0-0, policy_version 711427 (0.00090) [2022-07-10 11:51:02,855][26022] Updated weights on worker 0-0, policy_version 711437 (0.00093) [2022-07-10 11:51:04,629][26022] Updated weights on worker 0-0, policy_version 711447 (0.00084) [2022-07-10 11:51:04,725][25689] Fps is (10 sec: 5309.6, 60 sec: 5508.3, 300 sec: 5538.9). Total num frames: 728521728. Throughput: 0: 5686.6. Samples: 728519912. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:51:04,727][25689] Avg episode reward: [(0, '-2.687')] [2022-07-10 11:51:06,420][26022] Updated weights on worker 0-0, policy_version 711457 (0.00090) [2022-07-10 11:51:08,295][26022] Updated weights on worker 0-0, policy_version 711467 (0.00089) [2022-07-10 11:51:09,761][25689] Fps is (10 sec: 5303.5, 60 sec: 5508.6, 300 sec: 5529.9). Total num frames: 728549376. Throughput: 0: 5702.3. Samples: 728553542. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:51:09,762][25689] Avg episode reward: [(0, '-3.004')] [2022-07-10 11:51:10,123][26022] Updated weights on worker 0-0, policy_version 711477 (0.00090) [2022-07-10 11:51:12,010][26022] Updated weights on worker 0-0, policy_version 711487 (0.00086) [2022-07-10 11:51:13,800][26022] Updated weights on worker 0-0, policy_version 711497 (0.00088) [2022-07-10 11:51:14,862][25689] Fps is (10 sec: 5556.0, 60 sec: 5529.9, 300 sec: 5535.6). Total num frames: 728578048. Throughput: 0: 5710.4. Samples: 728587288. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:51:14,862][25689] Avg episode reward: [(0, '-1.984')] [2022-07-10 11:51:15,660][26022] Updated weights on worker 0-0, policy_version 711507 (0.00090) [2022-07-10 11:51:17,483][26022] Updated weights on worker 0-0, policy_version 711517 (0.00092) [2022-07-10 11:51:19,366][26022] Updated weights on worker 0-0, policy_version 711527 (0.00084) [2022-07-10 11:51:19,935][25689] Fps is (10 sec: 5636.0, 60 sec: 5510.0, 300 sec: 5535.3). Total num frames: 728606720. Throughput: 0: 4878.3. Samples: 728603934. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:51:19,936][25689] Avg episode reward: [(0, '-3.284')] [2022-07-10 11:51:21,224][26022] Updated weights on worker 0-0, policy_version 711537 (0.00091) [2022-07-10 11:51:23,044][26022] Updated weights on worker 0-0, policy_version 711547 (0.00085) [2022-07-10 11:51:24,908][26022] Updated weights on worker 0-0, policy_version 711557 (0.00089) [2022-07-10 11:51:24,977][25689] Fps is (10 sec: 5567.7, 60 sec: 5509.0, 300 sec: 5534.7). Total num frames: 728634368. Throughput: 0: 5799.1. Samples: 728637356. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 11:51:24,978][25689] Avg episode reward: [(0, '-3.874')] [2022-07-10 11:51:26,620][26022] Updated weights on worker 0-0, policy_version 711567 (0.00086) [2022-07-10 11:51:28,547][26022] Updated weights on worker 0-0, policy_version 711577 (0.00084) [2022-07-10 11:51:30,066][25689] Fps is (10 sec: 5559.3, 60 sec: 5535.5, 300 sec: 5535.0). Total num frames: 728663040. Throughput: 0: 5772.4. Samples: 728670752. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:51:30,067][25689] Avg episode reward: [(0, '-2.647')] [2022-07-10 11:51:30,300][26022] Updated weights on worker 0-0, policy_version 711587 (0.00084) [2022-07-10 11:51:32,334][26022] Updated weights on worker 0-0, policy_version 711597 (0.00084) [2022-07-10 11:51:34,096][26022] Updated weights on worker 0-0, policy_version 711607 (0.00097) [2022-07-10 11:51:35,153][25689] Fps is (10 sec: 5534.7, 60 sec: 5515.2, 300 sec: 5533.9). Total num frames: 728690688. Throughput: 0: 4933.2. Samples: 728687388. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:51:35,154][25689] Avg episode reward: [(0, '-1.851')] [2022-07-10 11:51:35,903][26022] Updated weights on worker 0-0, policy_version 711617 (0.00091) [2022-07-10 11:51:37,666][26022] Updated weights on worker 0-0, policy_version 711627 (0.00088) [2022-07-10 11:51:39,675][26022] Updated weights on worker 0-0, policy_version 711637 (0.00083) [2022-07-10 11:51:40,171][25689] Fps is (10 sec: 5573.7, 60 sec: 5531.1, 300 sec: 5534.1). Total num frames: 728719360. Throughput: 0: 5788.1. Samples: 728721058. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:51:40,171][25689] Avg episode reward: [(0, '-1.705')] [2022-07-10 11:51:41,210][26022] Updated weights on worker 0-0, policy_version 711647 (0.00090) [2022-07-10 11:51:43,351][26022] Updated weights on worker 0-0, policy_version 711657 (0.00089) [2022-07-10 11:51:44,774][26022] Updated weights on worker 0-0, policy_version 711667 (0.00428) [2022-07-10 11:51:45,203][25689] Fps is (10 sec: 5706.1, 60 sec: 5546.1, 300 sec: 5534.0). Total num frames: 728748032. Throughput: 0: 5821.2. Samples: 728755092. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:51:45,203][25689] Avg episode reward: [(0, '-2.648')] [2022-07-10 11:51:46,948][26022] Updated weights on worker 0-0, policy_version 711677 (0.00084) [2022-07-10 11:51:48,678][26022] Updated weights on worker 0-0, policy_version 711687 (0.00090) [2022-07-10 11:51:50,205][25689] Fps is (10 sec: 5510.7, 60 sec: 5513.6, 300 sec: 5532.7). Total num frames: 728774656. Throughput: 0: 5020.2. Samples: 728771852. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:51:50,206][25689] Avg episode reward: [(0, '-1.363')] [2022-07-10 11:51:50,463][26022] Updated weights on worker 0-0, policy_version 711697 (0.00087) [2022-07-10 11:51:52,322][26022] Updated weights on worker 0-0, policy_version 711707 (0.00084) [2022-07-10 11:51:54,284][26022] Updated weights on worker 0-0, policy_version 711717 (0.00093) [2022-07-10 11:51:55,247][25689] Fps is (10 sec: 5607.0, 60 sec: 5546.9, 300 sec: 5538.9). Total num frames: 728804352. Throughput: 0: 5862.4. Samples: 728805188. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:51:55,248][25689] Avg episode reward: [(0, '-1.551')] [2022-07-10 11:51:55,939][26022] Updated weights on worker 0-0, policy_version 711727 (0.00088) [2022-07-10 11:51:58,066][26022] Updated weights on worker 0-0, policy_version 711737 (0.00085) [2022-07-10 11:51:59,536][26022] Updated weights on worker 0-0, policy_version 711747 (0.00089) [2022-07-10 11:52:00,313][25689] Fps is (10 sec: 5571.7, 60 sec: 5526.2, 300 sec: 5538.1). Total num frames: 728830976. Throughput: 0: 5844.7. Samples: 728838784. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:52:00,323][25689] Avg episode reward: [(0, '-3.044')] [2022-07-10 11:52:01,824][26022] Updated weights on worker 0-0, policy_version 711757 (0.00089) [2022-07-10 11:52:03,771][26022] Updated weights on worker 0-0, policy_version 711767 (0.00084) [2022-07-10 11:52:05,343][25689] Fps is (10 sec: 5173.0, 60 sec: 5523.9, 300 sec: 5532.0). Total num frames: 728856576. Throughput: 0: 4877.1. Samples: 728853318. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:52:05,344][25689] Avg episode reward: [(0, '-2.739')] [2022-07-10 11:52:05,631][26022] Updated weights on worker 0-0, policy_version 711777 (0.00522) [2022-07-10 11:52:07,646][26022] Updated weights on worker 0-0, policy_version 711787 (0.00295) [2022-07-10 11:52:09,215][26022] Updated weights on worker 0-0, policy_version 711797 (0.00080) [2022-07-10 11:52:10,360][25689] Fps is (10 sec: 5401.8, 60 sec: 5542.5, 300 sec: 5533.3). Total num frames: 728885248. Throughput: 0: 5682.5. Samples: 728886384. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:52:10,361][25689] Avg episode reward: [(0, '-1.917')] [2022-07-10 11:52:11,378][26022] Updated weights on worker 0-0, policy_version 711807 (0.00084) [2022-07-10 11:52:13,024][26022] Updated weights on worker 0-0, policy_version 711817 (0.00090) [2022-07-10 11:52:14,745][26022] Updated weights on worker 0-0, policy_version 711827 (0.00090) [2022-07-10 11:52:15,434][25689] Fps is (10 sec: 5682.6, 60 sec: 5545.0, 300 sec: 5535.5). Total num frames: 728913920. Throughput: 0: 5686.9. Samples: 728919988. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:52:15,435][25689] Avg episode reward: [(0, '-1.754')] [2022-07-10 11:52:16,755][26022] Updated weights on worker 0-0, policy_version 711837 (0.00093) [2022-07-10 11:52:18,374][26022] Updated weights on worker 0-0, policy_version 711847 (0.00088) [2022-07-10 11:52:20,447][25689] Fps is (10 sec: 5482.2, 60 sec: 5516.7, 300 sec: 5529.2). Total num frames: 728940544. Throughput: 0: 4858.5. Samples: 728936602. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:52:20,447][25689] Avg episode reward: [(0, '-1.237')] [2022-07-10 11:52:20,602][26022] Updated weights on worker 0-0, policy_version 711857 (0.00092) [2022-07-10 11:52:22,214][26022] Updated weights on worker 0-0, policy_version 711867 (0.00089) [2022-07-10 11:52:24,069][26022] Updated weights on worker 0-0, policy_version 711877 (0.00086) [2022-07-10 11:52:25,450][25689] Fps is (10 sec: 5623.2, 60 sec: 5554.1, 300 sec: 5540.2). Total num frames: 728970240. Throughput: 0: 5798.4. Samples: 728969906. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:52:25,450][25689] Avg episode reward: [(0, '-0.802')] [2022-07-10 11:52:25,886][26022] Updated weights on worker 0-0, policy_version 711887 (0.00081) [2022-07-10 11:52:27,783][26022] Updated weights on worker 0-0, policy_version 711897 (0.00093) [2022-07-10 11:52:29,630][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:52:29,646][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000711907_728992768.pth [2022-07-10 11:52:29,647][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000709958_726996992.pth [2022-07-10 11:52:29,649][26022] Updated weights on worker 0-0, policy_version 711907 (0.00090) [2022-07-10 11:52:30,478][25689] Fps is (10 sec: 5716.6, 60 sec: 5542.8, 300 sec: 5534.5). Total num frames: 728997888. Throughput: 0: 5818.4. Samples: 729003436. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:52:30,478][25689] Avg episode reward: [(0, '0.053')] [2022-07-10 11:52:31,554][26022] Updated weights on worker 0-0, policy_version 711917 (0.00092) [2022-07-10 11:52:33,094][26022] Updated weights on worker 0-0, policy_version 711927 (0.00082) [2022-07-10 11:52:35,026][26022] Updated weights on worker 0-0, policy_version 711937 (0.00087) [2022-07-10 11:52:35,551][25689] Fps is (10 sec: 5474.3, 60 sec: 5544.1, 300 sec: 5533.9). Total num frames: 729025536. Throughput: 0: 4988.1. Samples: 729020332. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:52:35,551][25689] Avg episode reward: [(0, '-0.027')] [2022-07-10 11:52:36,836][26022] Updated weights on worker 0-0, policy_version 711947 (0.00087) [2022-07-10 11:52:38,814][26022] Updated weights on worker 0-0, policy_version 711957 (0.00094) [2022-07-10 11:52:40,511][26022] Updated weights on worker 0-0, policy_version 711967 (0.00092) [2022-07-10 11:52:40,587][25689] Fps is (10 sec: 5571.3, 60 sec: 5542.4, 300 sec: 5533.7). Total num frames: 729054208. Throughput: 0: 5824.4. Samples: 729053906. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:52:40,587][25689] Avg episode reward: [(0, '0.038')] [2022-07-10 11:52:42,296][26022] Updated weights on worker 0-0, policy_version 711977 (0.00083) [2022-07-10 11:52:44,000][26022] Updated weights on worker 0-0, policy_version 711987 (0.00087) [2022-07-10 11:52:45,596][25689] Fps is (10 sec: 5504.9, 60 sec: 5510.6, 300 sec: 5531.2). Total num frames: 729080832. Throughput: 0: 5840.3. Samples: 729087564. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:52:45,596][25689] Avg episode reward: [(0, '-0.613')] [2022-07-10 11:52:46,218][26022] Updated weights on worker 0-0, policy_version 711997 (0.00083) [2022-07-10 11:52:47,787][26022] Updated weights on worker 0-0, policy_version 712007 (0.00087) [2022-07-10 11:52:49,891][26022] Updated weights on worker 0-0, policy_version 712017 (0.00097) [2022-07-10 11:52:50,623][25689] Fps is (10 sec: 5509.6, 60 sec: 5542.2, 300 sec: 5535.5). Total num frames: 729109504. Throughput: 0: 5004.9. Samples: 729104260. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:52:50,623][25689] Avg episode reward: [(0, '-1.081')] [2022-07-10 11:52:51,525][26022] Updated weights on worker 0-0, policy_version 712027 (0.00090) [2022-07-10 11:52:53,330][26022] Updated weights on worker 0-0, policy_version 712037 (0.00094) [2022-07-10 11:52:55,207][26022] Updated weights on worker 0-0, policy_version 712047 (0.00084) [2022-07-10 11:52:55,745][25689] Fps is (10 sec: 5650.1, 60 sec: 5518.0, 300 sec: 5533.6). Total num frames: 729138176. Throughput: 0: 5826.6. Samples: 729137996. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:52:55,746][25689] Avg episode reward: [(0, '-3.194')] [2022-07-10 11:52:56,954][26022] Updated weights on worker 0-0, policy_version 712057 (0.00092) [2022-07-10 11:52:58,955][26022] Updated weights on worker 0-0, policy_version 712067 (0.00082) [2022-07-10 11:53:00,533][26022] Updated weights on worker 0-0, policy_version 712077 (0.00091) [2022-07-10 11:53:00,773][25689] Fps is (10 sec: 5649.8, 60 sec: 5555.3, 300 sec: 5537.9). Total num frames: 729166848. Throughput: 0: 5840.5. Samples: 729171802. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:53:00,773][25689] Avg episode reward: [(0, '-3.466')] [2022-07-10 11:53:02,985][26022] Updated weights on worker 0-0, policy_version 712087 (0.00089) [2022-07-10 11:53:04,742][26022] Updated weights on worker 0-0, policy_version 712097 (0.00087) [2022-07-10 11:53:05,864][25689] Fps is (10 sec: 5363.2, 60 sec: 5549.6, 300 sec: 5533.9). Total num frames: 729192448. Throughput: 0: 4869.9. Samples: 729186270. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:53:05,865][25689] Avg episode reward: [(0, '-4.182')] [2022-07-10 11:53:06,603][26022] Updated weights on worker 0-0, policy_version 712107 (0.00086) [2022-07-10 11:53:08,507][26022] Updated weights on worker 0-0, policy_version 712117 (0.00097) [2022-07-10 11:53:10,619][26022] Updated weights on worker 0-0, policy_version 712127 (0.00093) [2022-07-10 11:53:10,888][25689] Fps is (10 sec: 5163.0, 60 sec: 5515.2, 300 sec: 5524.8). Total num frames: 729219072. Throughput: 0: 5696.5. Samples: 729219698. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:53:10,888][25689] Avg episode reward: [(0, '-5.364')] [2022-07-10 11:53:11,971][26022] Updated weights on worker 0-0, policy_version 712137 (0.00087) [2022-07-10 11:53:14,343][26022] Updated weights on worker 0-0, policy_version 712147 (0.00089) [2022-07-10 11:53:15,450][26022] Updated weights on worker 0-0, policy_version 712157 (0.00094) [2022-07-10 11:53:15,925][25689] Fps is (10 sec: 5700.0, 60 sec: 5552.5, 300 sec: 5538.9). Total num frames: 729249792. Throughput: 0: 5693.0. Samples: 729252878. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:53:15,925][25689] Avg episode reward: [(0, '-5.952')] [2022-07-10 11:53:18,140][26022] Updated weights on worker 0-0, policy_version 712167 (0.00093) [2022-07-10 11:53:19,458][26022] Updated weights on worker 0-0, policy_version 712177 (0.00093) [2022-07-10 11:53:20,934][25689] Fps is (10 sec: 5605.8, 60 sec: 5535.8, 300 sec: 5528.7). Total num frames: 729275392. Throughput: 0: 4846.5. Samples: 729269516. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:53:20,935][25689] Avg episode reward: [(0, '-5.352')] [2022-07-10 11:53:21,462][26022] Updated weights on worker 0-0, policy_version 712187 (0.00090) [2022-07-10 11:53:23,406][26022] Updated weights on worker 0-0, policy_version 712197 (0.00091) [2022-07-10 11:53:25,146][26022] Updated weights on worker 0-0, policy_version 712207 (0.00091) [2022-07-10 11:53:25,963][25689] Fps is (10 sec: 5406.6, 60 sec: 5516.6, 300 sec: 5528.3). Total num frames: 729304064. Throughput: 0: 5797.2. Samples: 729302786. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:53:25,963][25689] Avg episode reward: [(0, '-4.774')] [2022-07-10 11:53:27,055][26022] Updated weights on worker 0-0, policy_version 712217 (0.00094) [2022-07-10 11:53:28,909][26022] Updated weights on worker 0-0, policy_version 712227 (0.00092) [2022-07-10 11:53:30,673][26022] Updated weights on worker 0-0, policy_version 712237 (0.00085) [2022-07-10 11:53:30,972][25689] Fps is (10 sec: 5611.1, 60 sec: 5518.3, 300 sec: 5530.9). Total num frames: 729331712. Throughput: 0: 5776.1. Samples: 729335704. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:53:30,972][25689] Avg episode reward: [(0, '-3.567')] [2022-07-10 11:53:32,653][26022] Updated weights on worker 0-0, policy_version 712247 (0.00084) [2022-07-10 11:53:34,339][26022] Updated weights on worker 0-0, policy_version 712257 (0.00093) [2022-07-10 11:53:36,033][25689] Fps is (10 sec: 5491.1, 60 sec: 5519.4, 300 sec: 5530.2). Total num frames: 729359360. Throughput: 0: 5786.6. Samples: 729369236. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:53:36,033][25689] Avg episode reward: [(0, '-4.889')] [2022-07-10 11:53:36,179][26022] Updated weights on worker 0-0, policy_version 712267 (0.00087) [2022-07-10 11:53:38,115][26022] Updated weights on worker 0-0, policy_version 712277 (0.00084) [2022-07-10 11:53:39,820][26022] Updated weights on worker 0-0, policy_version 712287 (0.00091) [2022-07-10 11:53:41,060][25689] Fps is (10 sec: 5582.8, 60 sec: 5520.2, 300 sec: 5530.6). Total num frames: 729388032. Throughput: 0: 5795.5. Samples: 729386152. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:53:41,060][25689] Avg episode reward: [(0, '-3.719')] [2022-07-10 11:53:41,787][26022] Updated weights on worker 0-0, policy_version 712297 (0.00087) [2022-07-10 11:53:43,522][26022] Updated weights on worker 0-0, policy_version 712307 (0.00088) [2022-07-10 11:53:45,320][26022] Updated weights on worker 0-0, policy_version 712317 (0.00085) [2022-07-10 11:53:46,066][25689] Fps is (10 sec: 5613.6, 60 sec: 5537.4, 300 sec: 5531.1). Total num frames: 729415680. Throughput: 0: 5827.9. Samples: 729419944. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:53:46,066][25689] Avg episode reward: [(0, '-2.600')] [2022-07-10 11:53:47,261][26022] Updated weights on worker 0-0, policy_version 712327 (0.00091) [2022-07-10 11:53:49,155][26022] Updated weights on worker 0-0, policy_version 712337 (0.00096) [2022-07-10 11:53:50,812][26022] Updated weights on worker 0-0, policy_version 712347 (0.00090) [2022-07-10 11:53:51,097][25689] Fps is (10 sec: 5611.3, 60 sec: 5537.1, 300 sec: 5528.3). Total num frames: 729444352. Throughput: 0: 5820.6. Samples: 729452844. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:53:51,097][25689] Avg episode reward: [(0, '-2.418')] [2022-07-10 11:53:52,958][26022] Updated weights on worker 0-0, policy_version 712357 (0.00087) [2022-07-10 11:53:54,376][26022] Updated weights on worker 0-0, policy_version 712367 (0.00085) [2022-07-10 11:53:56,144][25689] Fps is (10 sec: 5486.7, 60 sec: 5510.0, 300 sec: 5529.2). Total num frames: 729470976. Throughput: 0: 4991.0. Samples: 729469610. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:53:56,144][25689] Avg episode reward: [(0, '-2.448')] [2022-07-10 11:53:56,505][26022] Updated weights on worker 0-0, policy_version 712377 (0.00086) [2022-07-10 11:53:58,396][26022] Updated weights on worker 0-0, policy_version 712387 (0.00090) [2022-07-10 11:54:00,111][26022] Updated weights on worker 0-0, policy_version 712397 (0.00091) [2022-07-10 11:54:01,158][25689] Fps is (10 sec: 5495.6, 60 sec: 5511.2, 300 sec: 5536.8). Total num frames: 729499648. Throughput: 0: 5806.4. Samples: 729502854. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:54:01,159][25689] Avg episode reward: [(0, '-3.340')] [2022-07-10 11:54:02,614][26022] Updated weights on worker 0-0, policy_version 712407 (0.00093) [2022-07-10 11:54:04,065][26022] Updated weights on worker 0-0, policy_version 712417 (0.00086) [2022-07-10 11:54:06,183][25689] Fps is (10 sec: 5303.8, 60 sec: 5500.3, 300 sec: 5524.2). Total num frames: 729524224. Throughput: 0: 5692.6. Samples: 729534466. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:54:06,184][25689] Avg episode reward: [(0, '-3.393')] [2022-07-10 11:54:06,209][26022] Updated weights on worker 0-0, policy_version 712427 (0.00111) [2022-07-10 11:54:07,830][26022] Updated weights on worker 0-0, policy_version 712437 (0.00092) [2022-07-10 11:54:09,567][26022] Updated weights on worker 0-0, policy_version 712447 (0.00087) [2022-07-10 11:54:11,201][25689] Fps is (10 sec: 5404.3, 60 sec: 5551.8, 300 sec: 5532.3). Total num frames: 729553920. Throughput: 0: 4885.6. Samples: 729551066. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:54:11,201][25689] Avg episode reward: [(0, '-3.981')] [2022-07-10 11:54:11,616][26022] Updated weights on worker 0-0, policy_version 712457 (0.00085) [2022-07-10 11:54:13,551][26022] Updated weights on worker 0-0, policy_version 712467 (0.00093) [2022-07-10 11:54:15,363][26022] Updated weights on worker 0-0, policy_version 712477 (0.00085) [2022-07-10 11:54:16,275][25689] Fps is (10 sec: 5580.7, 60 sec: 5480.5, 300 sec: 5524.7). Total num frames: 729580544. Throughput: 0: 5705.3. Samples: 729584466. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:54:16,276][25689] Avg episode reward: [(0, '-3.950')] [2022-07-10 11:54:17,091][26022] Updated weights on worker 0-0, policy_version 712487 (0.00088) [2022-07-10 11:54:18,986][26022] Updated weights on worker 0-0, policy_version 712497 (0.00085) [2022-07-10 11:54:20,858][26022] Updated weights on worker 0-0, policy_version 712507 (0.00092) [2022-07-10 11:54:21,362][25689] Fps is (10 sec: 5542.7, 60 sec: 5541.3, 300 sec: 5527.2). Total num frames: 729610240. Throughput: 0: 5689.3. Samples: 729617798. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:54:21,362][25689] Avg episode reward: [(0, '-4.696')] [2022-07-10 11:54:22,838][26022] Updated weights on worker 0-0, policy_version 712517 (0.00087) [2022-07-10 11:54:24,494][26022] Updated weights on worker 0-0, policy_version 712527 (0.00087) [2022-07-10 11:54:26,387][25689] Fps is (10 sec: 5569.8, 60 sec: 5507.7, 300 sec: 5523.8). Total num frames: 729636864. Throughput: 0: 4949.4. Samples: 729634462. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:54:26,387][25689] Avg episode reward: [(0, '-5.426')] [2022-07-10 11:54:26,448][26022] Updated weights on worker 0-0, policy_version 712537 (0.00089) [2022-07-10 11:54:28,392][26022] Updated weights on worker 0-0, policy_version 712547 (0.00083) [2022-07-10 11:54:29,827][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:54:29,837][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000712555_729656320.pth [2022-07-10 11:54:29,837][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000710607_727661568.pth [2022-07-10 11:54:30,070][26022] Updated weights on worker 0-0, policy_version 712557 (0.00094) [2022-07-10 11:54:31,407][25689] Fps is (10 sec: 5402.9, 60 sec: 5506.7, 300 sec: 5521.0). Total num frames: 729664512. Throughput: 0: 5776.5. Samples: 729667786. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:54:31,407][25689] Avg episode reward: [(0, '-2.984')] [2022-07-10 11:54:32,011][26022] Updated weights on worker 0-0, policy_version 712567 (0.00092) [2022-07-10 11:54:33,863][26022] Updated weights on worker 0-0, policy_version 712577 (0.00085) [2022-07-10 11:54:35,351][26022] Updated weights on worker 0-0, policy_version 712587 (0.00087) [2022-07-10 11:54:36,530][25689] Fps is (10 sec: 5552.4, 60 sec: 5518.0, 300 sec: 5519.7). Total num frames: 729693184. Throughput: 0: 5761.5. Samples: 729701166. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:54:36,534][25689] Avg episode reward: [(0, '-3.109')] [2022-07-10 11:54:37,582][26022] Updated weights on worker 0-0, policy_version 712597 (0.00085) [2022-07-10 11:54:39,080][26022] Updated weights on worker 0-0, policy_version 712607 (0.00085) [2022-07-10 11:54:41,147][26022] Updated weights on worker 0-0, policy_version 712617 (0.00095) [2022-07-10 11:54:41,584][25689] Fps is (10 sec: 5735.2, 60 sec: 5532.5, 300 sec: 5526.3). Total num frames: 729722880. Throughput: 0: 4957.6. Samples: 729718050. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:54:41,585][25689] Avg episode reward: [(0, '-3.118')] [2022-07-10 11:54:42,878][26022] Updated weights on worker 0-0, policy_version 712627 (0.00098) [2022-07-10 11:54:44,668][26022] Updated weights on worker 0-0, policy_version 712637 (0.00091) [2022-07-10 11:54:46,396][26022] Updated weights on worker 0-0, policy_version 712647 (0.00094) [2022-07-10 11:54:46,629][25689] Fps is (10 sec: 5678.1, 60 sec: 5528.9, 300 sec: 5529.0). Total num frames: 729750528. Throughput: 0: 5788.4. Samples: 729751632. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-10 11:54:46,630][25689] Avg episode reward: [(0, '-3.361')] [2022-07-10 11:54:48,277][26022] Updated weights on worker 0-0, policy_version 712657 (0.00095) [2022-07-10 11:54:50,268][26022] Updated weights on worker 0-0, policy_version 712667 (0.00091) [2022-07-10 11:54:51,668][25689] Fps is (10 sec: 5483.7, 60 sec: 5511.3, 300 sec: 5522.5). Total num frames: 729778176. Throughput: 0: 5787.7. Samples: 729785050. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:54:51,668][25689] Avg episode reward: [(0, '-2.190')] [2022-07-10 11:54:52,213][26022] Updated weights on worker 0-0, policy_version 712677 (0.00090) [2022-07-10 11:54:53,777][26022] Updated weights on worker 0-0, policy_version 712687 (0.00090) [2022-07-10 11:54:55,682][26022] Updated weights on worker 0-0, policy_version 712697 (0.00090) [2022-07-10 11:54:56,794][25689] Fps is (10 sec: 5540.8, 60 sec: 5537.9, 300 sec: 5524.4). Total num frames: 729806848. Throughput: 0: 4955.6. Samples: 729801584. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:54:56,794][25689] Avg episode reward: [(0, '-4.655')] [2022-07-10 11:54:57,799][26022] Updated weights on worker 0-0, policy_version 712707 (0.00085) [2022-07-10 11:54:59,370][26022] Updated weights on worker 0-0, policy_version 712717 (0.00092) [2022-07-10 11:55:01,270][26022] Updated weights on worker 0-0, policy_version 712727 (0.00085) [2022-07-10 11:55:01,815][25689] Fps is (10 sec: 5550.2, 60 sec: 5520.4, 300 sec: 5531.2). Total num frames: 729834496. Throughput: 0: 5787.1. Samples: 729835130. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:55:01,816][25689] Avg episode reward: [(0, '-5.249')] [2022-07-10 11:55:03,491][26022] Updated weights on worker 0-0, policy_version 712737 (0.00081) [2022-07-10 11:55:05,375][26022] Updated weights on worker 0-0, policy_version 712747 (0.00088) [2022-07-10 11:55:06,817][25689] Fps is (10 sec: 5312.5, 60 sec: 5539.4, 300 sec: 5525.0). Total num frames: 729860096. Throughput: 0: 5682.3. Samples: 729866344. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:55:06,817][25689] Avg episode reward: [(0, '-5.178')] [2022-07-10 11:55:07,182][26022] Updated weights on worker 0-0, policy_version 712757 (0.00087) [2022-07-10 11:55:09,138][26022] Updated weights on worker 0-0, policy_version 712767 (0.00084) [2022-07-10 11:55:10,776][26022] Updated weights on worker 0-0, policy_version 712777 (0.00089) [2022-07-10 11:55:11,906][25689] Fps is (10 sec: 5276.8, 60 sec: 5499.1, 300 sec: 5526.1). Total num frames: 729887744. Throughput: 0: 4842.6. Samples: 729883056. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:55:11,907][25689] Avg episode reward: [(0, '-5.400')] [2022-07-10 11:55:12,862][26022] Updated weights on worker 0-0, policy_version 712787 (0.00087) [2022-07-10 11:55:14,659][26022] Updated weights on worker 0-0, policy_version 712797 (0.00087) [2022-07-10 11:55:16,431][26022] Updated weights on worker 0-0, policy_version 712807 (0.00079) [2022-07-10 11:55:16,981][25689] Fps is (10 sec: 5641.6, 60 sec: 5549.6, 300 sec: 5525.4). Total num frames: 729917440. Throughput: 0: 5698.4. Samples: 729916622. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:55:16,982][25689] Avg episode reward: [(0, '-4.969')] [2022-07-10 11:55:18,274][26022] Updated weights on worker 0-0, policy_version 712817 (0.00084) [2022-07-10 11:55:19,987][26022] Updated weights on worker 0-0, policy_version 712827 (0.00089) [2022-07-10 11:55:21,858][26022] Updated weights on worker 0-0, policy_version 712837 (0.00085) [2022-07-10 11:55:22,007][25689] Fps is (10 sec: 5677.2, 60 sec: 5521.5, 300 sec: 5525.6). Total num frames: 729945088. Throughput: 0: 5710.9. Samples: 729950442. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:55:22,007][25689] Avg episode reward: [(0, '-3.080')] [2022-07-10 11:55:23,743][26022] Updated weights on worker 0-0, policy_version 712847 (0.00091) [2022-07-10 11:55:25,545][26022] Updated weights on worker 0-0, policy_version 712857 (0.00081) [2022-07-10 11:55:27,041][25689] Fps is (10 sec: 5496.7, 60 sec: 5537.5, 300 sec: 5528.5). Total num frames: 729972736. Throughput: 0: 4977.6. Samples: 729967012. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:55:27,042][25689] Avg episode reward: [(0, '-2.629')] [2022-07-10 11:55:27,580][26022] Updated weights on worker 0-0, policy_version 712867 (0.00091) [2022-07-10 11:55:29,287][26022] Updated weights on worker 0-0, policy_version 712877 (0.00084) [2022-07-10 11:55:31,163][26022] Updated weights on worker 0-0, policy_version 712887 (0.00083) [2022-07-10 11:55:32,086][25689] Fps is (10 sec: 5587.7, 60 sec: 5552.1, 300 sec: 5528.6). Total num frames: 730001408. Throughput: 0: 5797.8. Samples: 730000056. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:55:32,087][25689] Avg episode reward: [(0, '-2.841')] [2022-07-10 11:55:33,028][26022] Updated weights on worker 0-0, policy_version 712897 (0.00089) [2022-07-10 11:55:34,890][26022] Updated weights on worker 0-0, policy_version 712907 (0.00097) [2022-07-10 11:55:36,815][26022] Updated weights on worker 0-0, policy_version 712917 (0.00081) [2022-07-10 11:55:37,126][25689] Fps is (10 sec: 5584.8, 60 sec: 5542.9, 300 sec: 5528.0). Total num frames: 730029056. Throughput: 0: 5789.6. Samples: 730033248. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:55:37,126][25689] Avg episode reward: [(0, '-2.589')] [2022-07-10 11:55:38,640][26022] Updated weights on worker 0-0, policy_version 712927 (0.00094) [2022-07-10 11:55:40,298][26022] Updated weights on worker 0-0, policy_version 712937 (0.00081) [2022-07-10 11:55:42,118][26022] Updated weights on worker 0-0, policy_version 712947 (0.00622) [2022-07-10 11:55:42,145][25689] Fps is (10 sec: 5599.0, 60 sec: 5529.1, 300 sec: 5531.3). Total num frames: 730057728. Throughput: 0: 4942.8. Samples: 730049980. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:55:42,145][25689] Avg episode reward: [(0, '-2.861')] [2022-07-10 11:55:44,355][26022] Updated weights on worker 0-0, policy_version 712957 (0.00097) [2022-07-10 11:55:45,961][26022] Updated weights on worker 0-0, policy_version 712967 (0.00089) [2022-07-10 11:55:47,218][25689] Fps is (10 sec: 5377.4, 60 sec: 5492.8, 300 sec: 5519.9). Total num frames: 730083328. Throughput: 0: 5753.3. Samples: 730083096. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:55:47,219][25689] Avg episode reward: [(0, '-2.442')] [2022-07-10 11:55:47,916][26022] Updated weights on worker 0-0, policy_version 712977 (0.00104) [2022-07-10 11:55:49,646][26022] Updated weights on worker 0-0, policy_version 712987 (0.00089) [2022-07-10 11:55:51,604][26022] Updated weights on worker 0-0, policy_version 712997 (0.00092) [2022-07-10 11:55:52,239][25689] Fps is (10 sec: 5478.3, 60 sec: 5528.2, 300 sec: 5527.1). Total num frames: 730113024. Throughput: 0: 5768.6. Samples: 730116308. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:55:52,240][25689] Avg episode reward: [(0, '-2.974')] [2022-07-10 11:55:53,427][26022] Updated weights on worker 0-0, policy_version 713007 (0.00085) [2022-07-10 11:55:55,195][26022] Updated weights on worker 0-0, policy_version 713017 (0.00117) [2022-07-10 11:55:57,212][26022] Updated weights on worker 0-0, policy_version 713027 (0.00102) [2022-07-10 11:55:57,299][25689] Fps is (10 sec: 5587.0, 60 sec: 5500.4, 300 sec: 5523.0). Total num frames: 730139648. Throughput: 0: 5763.6. Samples: 730149518. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:55:57,299][25689] Avg episode reward: [(0, '-2.772')] [2022-07-10 11:55:58,847][26022] Updated weights on worker 0-0, policy_version 713037 (0.00069) [2022-07-10 11:56:00,792][26022] Updated weights on worker 0-0, policy_version 713047 (0.00091) [2022-07-10 11:56:02,315][25689] Fps is (10 sec: 5284.2, 60 sec: 5483.9, 300 sec: 5526.2). Total num frames: 730166272. Throughput: 0: 5769.6. Samples: 730166356. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:56:02,317][25689] Avg episode reward: [(0, '-2.776')] [2022-07-10 11:56:02,991][26022] Updated weights on worker 0-0, policy_version 713057 (0.00096) [2022-07-10 11:56:04,673][26022] Updated weights on worker 0-0, policy_version 713067 (0.00089) [2022-07-10 11:56:06,682][26022] Updated weights on worker 0-0, policy_version 713077 (0.00076) [2022-07-10 11:56:07,322][25689] Fps is (10 sec: 5414.8, 60 sec: 5517.3, 300 sec: 5526.8). Total num frames: 730193920. Throughput: 0: 5723.3. Samples: 730198154. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:56:07,323][25689] Avg episode reward: [(0, '-1.436')] [2022-07-10 11:56:08,270][26022] Updated weights on worker 0-0, policy_version 713087 (0.00088) [2022-07-10 11:56:10,231][26022] Updated weights on worker 0-0, policy_version 713097 (0.00091) [2022-07-10 11:56:12,042][26022] Updated weights on worker 0-0, policy_version 713107 (0.00091) [2022-07-10 11:56:12,337][25689] Fps is (10 sec: 5619.7, 60 sec: 5541.0, 300 sec: 5528.3). Total num frames: 730222592. Throughput: 0: 5748.2. Samples: 730231838. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:56:12,339][25689] Avg episode reward: [(0, '-2.042')] [2022-07-10 11:56:13,791][26022] Updated weights on worker 0-0, policy_version 713117 (0.00080) [2022-07-10 11:56:15,773][26022] Updated weights on worker 0-0, policy_version 713127 (0.00097) [2022-07-10 11:56:17,395][25689] Fps is (10 sec: 5591.1, 60 sec: 5508.7, 300 sec: 5525.2). Total num frames: 730250240. Throughput: 0: 4923.6. Samples: 730248462. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:56:17,395][25689] Avg episode reward: [(0, '-1.650')] [2022-07-10 11:56:17,523][26022] Updated weights on worker 0-0, policy_version 713137 (0.00087) [2022-07-10 11:56:19,513][26022] Updated weights on worker 0-0, policy_version 713147 (0.00094) [2022-07-10 11:56:21,380][26022] Updated weights on worker 0-0, policy_version 713157 (0.00087) [2022-07-10 11:56:22,407][25689] Fps is (10 sec: 5491.2, 60 sec: 5509.9, 300 sec: 5525.7). Total num frames: 730277888. Throughput: 0: 5745.8. Samples: 730281800. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:56:22,408][25689] Avg episode reward: [(0, '-1.264')] [2022-07-10 11:56:23,051][26022] Updated weights on worker 0-0, policy_version 713167 (0.00086) [2022-07-10 11:56:25,284][26022] Updated weights on worker 0-0, policy_version 713177 (0.00093) [2022-07-10 11:56:26,772][26022] Updated weights on worker 0-0, policy_version 713187 (0.00091) [2022-07-10 11:56:27,432][25689] Fps is (10 sec: 5611.1, 60 sec: 5527.7, 300 sec: 5526.9). Total num frames: 730306560. Throughput: 0: 5802.3. Samples: 730314840. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:56:27,433][25689] Avg episode reward: [(0, '-1.238')] [2022-07-10 11:56:28,914][26022] Updated weights on worker 0-0, policy_version 713197 (0.00089) [2022-07-10 11:56:29,974][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:56:29,992][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000713203_730319872.pth [2022-07-10 11:56:29,993][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000711257_728327168.pth [2022-07-10 11:56:30,567][26022] Updated weights on worker 0-0, policy_version 713207 (0.00094) [2022-07-10 11:56:32,458][25689] Fps is (10 sec: 5501.7, 60 sec: 5495.5, 300 sec: 5524.6). Total num frames: 730333184. Throughput: 0: 4951.2. Samples: 730331458. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:56:32,458][25689] Avg episode reward: [(0, '-2.080')] [2022-07-10 11:56:32,619][26022] Updated weights on worker 0-0, policy_version 713217 (0.00088) [2022-07-10 11:56:34,320][26022] Updated weights on worker 0-0, policy_version 713227 (0.00084) [2022-07-10 11:56:36,145][26022] Updated weights on worker 0-0, policy_version 713237 (0.01100) [2022-07-10 11:56:37,546][25689] Fps is (10 sec: 5568.3, 60 sec: 5525.0, 300 sec: 5526.7). Total num frames: 730362880. Throughput: 0: 5784.8. Samples: 730365036. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:56:37,547][25689] Avg episode reward: [(0, '-2.328')] [2022-07-10 11:56:37,827][26022] Updated weights on worker 0-0, policy_version 713247 (0.00088) [2022-07-10 11:56:39,754][26022] Updated weights on worker 0-0, policy_version 713257 (0.00088) [2022-07-10 11:56:41,467][26022] Updated weights on worker 0-0, policy_version 713267 (0.00091) [2022-07-10 11:56:42,595][25689] Fps is (10 sec: 5455.0, 60 sec: 5471.5, 300 sec: 5516.1). Total num frames: 730388480. Throughput: 0: 5791.1. Samples: 730398708. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:56:42,595][25689] Avg episode reward: [(0, '-2.649')] [2022-07-10 11:56:43,804][26022] Updated weights on worker 0-0, policy_version 713278 (0.00084) [2022-07-10 11:56:45,620][26022] Updated weights on worker 0-0, policy_version 713288 (0.00090) [2022-07-10 11:56:47,304][26022] Updated weights on worker 0-0, policy_version 713298 (0.00091) [2022-07-10 11:56:47,623][25689] Fps is (10 sec: 5487.7, 60 sec: 5543.4, 300 sec: 5526.0). Total num frames: 730418176. Throughput: 0: 4985.9. Samples: 730415508. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:56:47,623][25689] Avg episode reward: [(0, '-1.876')] [2022-07-10 11:56:49,028][26022] Updated weights on worker 0-0, policy_version 713308 (0.00080) [2022-07-10 11:56:50,959][26022] Updated weights on worker 0-0, policy_version 713318 (0.00091) [2022-07-10 11:56:52,625][25689] Fps is (10 sec: 5819.0, 60 sec: 5528.1, 300 sec: 5523.3). Total num frames: 730446848. Throughput: 0: 5835.6. Samples: 730449146. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:56:52,626][25689] Avg episode reward: [(0, '-2.717')] [2022-07-10 11:56:52,811][26022] Updated weights on worker 0-0, policy_version 713328 (0.00100) [2022-07-10 11:56:54,571][26022] Updated weights on worker 0-0, policy_version 713338 (0.00085) [2022-07-10 11:56:56,602][26022] Updated weights on worker 0-0, policy_version 713348 (0.00091) [2022-07-10 11:56:57,714][25689] Fps is (10 sec: 5580.9, 60 sec: 5542.4, 300 sec: 5526.3). Total num frames: 730474496. Throughput: 0: 5824.9. Samples: 730482512. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:56:57,715][25689] Avg episode reward: [(0, '-2.285')] [2022-07-10 11:56:58,179][26022] Updated weights on worker 0-0, policy_version 713358 (0.00090) [2022-07-10 11:57:00,232][26022] Updated weights on worker 0-0, policy_version 713368 (0.00089) [2022-07-10 11:57:02,184][26022] Updated weights on worker 0-0, policy_version 713378 (0.00091) [2022-07-10 11:57:02,735][25689] Fps is (10 sec: 5368.5, 60 sec: 5542.1, 300 sec: 5529.9). Total num frames: 730501120. Throughput: 0: 5007.2. Samples: 730499552. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:57:02,736][25689] Avg episode reward: [(0, '-2.493')] [2022-07-10 11:57:04,131][26022] Updated weights on worker 0-0, policy_version 713388 (0.00078) [2022-07-10 11:57:06,048][26022] Updated weights on worker 0-0, policy_version 713398 (0.00086) [2022-07-10 11:57:07,715][26022] Updated weights on worker 0-0, policy_version 713408 (0.00095) [2022-07-10 11:57:07,796][25689] Fps is (10 sec: 5484.9, 60 sec: 5554.0, 300 sec: 5529.1). Total num frames: 730529792. Throughput: 0: 5738.8. Samples: 730531276. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:57:07,797][25689] Avg episode reward: [(0, '-2.934')] [2022-07-10 11:57:09,758][26022] Updated weights on worker 0-0, policy_version 713418 (0.00089) [2022-07-10 11:57:11,281][26022] Updated weights on worker 0-0, policy_version 713428 (0.00084) [2022-07-10 11:57:12,806][25689] Fps is (10 sec: 5591.9, 60 sec: 5537.5, 300 sec: 5526.8). Total num frames: 730557440. Throughput: 0: 5727.6. Samples: 730564736. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:57:12,808][25689] Avg episode reward: [(0, '-2.756')] [2022-07-10 11:57:13,424][26022] Updated weights on worker 0-0, policy_version 713438 (0.00092) [2022-07-10 11:57:14,963][26022] Updated weights on worker 0-0, policy_version 713448 (0.00088) [2022-07-10 11:57:17,053][26022] Updated weights on worker 0-0, policy_version 713458 (0.00091) [2022-07-10 11:57:17,893][25689] Fps is (10 sec: 5577.6, 60 sec: 5551.7, 300 sec: 5532.3). Total num frames: 730586112. Throughput: 0: 4905.1. Samples: 730581492. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:57:17,895][25689] Avg episode reward: [(0, '-2.637')] [2022-07-10 11:57:18,784][26022] Updated weights on worker 0-0, policy_version 713468 (0.00374) [2022-07-10 11:57:20,684][26022] Updated weights on worker 0-0, policy_version 713478 (0.00085) [2022-07-10 11:57:22,281][26022] Updated weights on worker 0-0, policy_version 713488 (0.00094) [2022-07-10 11:57:22,897][25689] Fps is (10 sec: 5581.3, 60 sec: 5552.5, 300 sec: 5525.4). Total num frames: 730613760. Throughput: 0: 5740.6. Samples: 730615296. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:57:22,899][25689] Avg episode reward: [(0, '-2.296')] [2022-07-10 11:57:24,283][26022] Updated weights on worker 0-0, policy_version 713498 (0.00084) [2022-07-10 11:57:26,139][26022] Updated weights on worker 0-0, policy_version 713508 (0.00091) [2022-07-10 11:57:27,870][26022] Updated weights on worker 0-0, policy_version 713518 (0.00091) [2022-07-10 11:57:27,909][25689] Fps is (10 sec: 5623.2, 60 sec: 5553.7, 300 sec: 5529.2). Total num frames: 730642432. Throughput: 0: 5842.7. Samples: 730648792. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:57:27,911][25689] Avg episode reward: [(0, '-2.271')] [2022-07-10 11:57:29,798][26022] Updated weights on worker 0-0, policy_version 713528 (0.00094) [2022-07-10 11:57:31,609][26022] Updated weights on worker 0-0, policy_version 713538 (0.00085) [2022-07-10 11:57:32,926][25689] Fps is (10 sec: 5513.5, 60 sec: 5554.5, 300 sec: 5526.7). Total num frames: 730669056. Throughput: 0: 5011.6. Samples: 730665570. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:57:32,928][25689] Avg episode reward: [(0, '-1.498')] [2022-07-10 11:57:33,383][26022] Updated weights on worker 0-0, policy_version 713548 (0.00081) [2022-07-10 11:57:35,223][26022] Updated weights on worker 0-0, policy_version 713558 (0.00080) [2022-07-10 11:57:37,098][26022] Updated weights on worker 0-0, policy_version 713568 (0.00088) [2022-07-10 11:57:38,023][25689] Fps is (10 sec: 5568.6, 60 sec: 5553.8, 300 sec: 5529.1). Total num frames: 730698752. Throughput: 0: 5856.8. Samples: 730699386. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:57:38,023][25689] Avg episode reward: [(0, '-1.145')] [2022-07-10 11:57:38,978][26022] Updated weights on worker 0-0, policy_version 713578 (0.00086) [2022-07-10 11:57:40,844][26022] Updated weights on worker 0-0, policy_version 713588 (0.00092) [2022-07-10 11:57:42,540][26022] Updated weights on worker 0-0, policy_version 713598 (0.00052) [2022-07-10 11:57:43,053][25689] Fps is (10 sec: 5561.5, 60 sec: 5572.4, 300 sec: 5528.7). Total num frames: 730725376. Throughput: 0: 5818.9. Samples: 730732582. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:57:43,054][25689] Avg episode reward: [(0, '-0.630')] [2022-07-10 11:57:44,476][26022] Updated weights on worker 0-0, policy_version 713608 (0.00093) [2022-07-10 11:57:46,409][26022] Updated weights on worker 0-0, policy_version 713618 (0.00087) [2022-07-10 11:57:48,075][25689] Fps is (10 sec: 5398.7, 60 sec: 5539.0, 300 sec: 5525.3). Total num frames: 730753024. Throughput: 0: 4978.7. Samples: 730749196. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:57:48,076][25689] Avg episode reward: [(0, '-0.978')] [2022-07-10 11:57:48,191][26022] Updated weights on worker 0-0, policy_version 713628 (0.00096) [2022-07-10 11:57:50,080][26022] Updated weights on worker 0-0, policy_version 713638 (0.00100) [2022-07-10 11:57:51,749][26022] Updated weights on worker 0-0, policy_version 713648 (0.00090) [2022-07-10 11:57:53,088][25689] Fps is (10 sec: 5714.6, 60 sec: 5555.0, 300 sec: 5530.8). Total num frames: 730782720. Throughput: 0: 5803.2. Samples: 730782572. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:57:53,088][25689] Avg episode reward: [(0, '-0.650')] [2022-07-10 11:57:53,958][26022] Updated weights on worker 0-0, policy_version 713658 (0.01082) [2022-07-10 11:57:55,407][26022] Updated weights on worker 0-0, policy_version 713668 (0.00085) [2022-07-10 11:57:57,479][26022] Updated weights on worker 0-0, policy_version 713678 (0.00089) [2022-07-10 11:57:58,185][25689] Fps is (10 sec: 5672.0, 60 sec: 5554.2, 300 sec: 5526.1). Total num frames: 730810368. Throughput: 0: 5788.7. Samples: 730816102. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:57:58,186][25689] Avg episode reward: [(0, '-1.123')] [2022-07-10 11:57:59,092][26022] Updated weights on worker 0-0, policy_version 713688 (0.00093) [2022-07-10 11:58:01,232][26022] Updated weights on worker 0-0, policy_version 713698 (0.00096) [2022-07-10 11:58:03,195][25689] Fps is (10 sec: 5268.4, 60 sec: 5538.3, 300 sec: 5527.6). Total num frames: 730835968. Throughput: 0: 4981.9. Samples: 730832926. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:58:03,195][25689] Avg episode reward: [(0, '-1.839')] [2022-07-10 11:58:03,353][26022] Updated weights on worker 0-0, policy_version 713708 (0.00091) [2022-07-10 11:58:05,273][26022] Updated weights on worker 0-0, policy_version 713718 (0.00103) [2022-07-10 11:58:07,020][26022] Updated weights on worker 0-0, policy_version 713728 (0.00094) [2022-07-10 11:58:08,222][25689] Fps is (10 sec: 5203.2, 60 sec: 5507.5, 300 sec: 5527.5). Total num frames: 730862592. Throughput: 0: 5688.8. Samples: 730863808. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 11:58:08,224][25689] Avg episode reward: [(0, '-1.914')] [2022-07-10 11:58:08,877][26022] Updated weights on worker 0-0, policy_version 713738 (0.00087) [2022-07-10 11:58:10,672][26022] Updated weights on worker 0-0, policy_version 713748 (0.00088) [2022-07-10 11:58:12,650][26022] Updated weights on worker 0-0, policy_version 713758 (0.00097) [2022-07-10 11:58:13,257][25689] Fps is (10 sec: 5393.5, 60 sec: 5505.3, 300 sec: 5517.2). Total num frames: 730890240. Throughput: 0: 5675.6. Samples: 730897046. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 11:58:13,259][25689] Avg episode reward: [(0, '-1.695')] [2022-07-10 11:58:14,530][26022] Updated weights on worker 0-0, policy_version 713768 (0.00887) [2022-07-10 11:58:16,491][26022] Updated weights on worker 0-0, policy_version 713778 (0.00098) [2022-07-10 11:58:18,123][26022] Updated weights on worker 0-0, policy_version 713788 (0.00102) [2022-07-10 11:58:18,321][25689] Fps is (10 sec: 5678.4, 60 sec: 5524.3, 300 sec: 5530.0). Total num frames: 730919936. Throughput: 0: 4838.2. Samples: 730913522. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 11:58:18,322][25689] Avg episode reward: [(0, '-1.232')] [2022-07-10 11:58:19,987][26022] Updated weights on worker 0-0, policy_version 713798 (0.00088) [2022-07-10 11:58:21,849][26022] Updated weights on worker 0-0, policy_version 713808 (0.00080) [2022-07-10 11:58:23,419][25689] Fps is (10 sec: 5542.4, 60 sec: 5498.8, 300 sec: 5521.8). Total num frames: 730946560. Throughput: 0: 5619.7. Samples: 730946580. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 11:58:23,420][25689] Avg episode reward: [(0, '-1.817')] [2022-07-10 11:58:23,745][26022] Updated weights on worker 0-0, policy_version 713818 (0.00085) [2022-07-10 11:58:25,585][26022] Updated weights on worker 0-0, policy_version 713828 (0.00090) [2022-07-10 11:58:27,646][26022] Updated weights on worker 0-0, policy_version 713838 (0.00115) [2022-07-10 11:58:28,426][25689] Fps is (10 sec: 5370.7, 60 sec: 5482.3, 300 sec: 5521.8). Total num frames: 730974208. Throughput: 0: 5726.2. Samples: 730979498. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 11:58:28,428][25689] Avg episode reward: [(0, '-1.106')] [2022-07-10 11:58:29,291][26022] Updated weights on worker 0-0, policy_version 713848 (0.00091) [2022-07-10 11:58:30,132][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 11:58:30,146][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000713853_730985472.pth [2022-07-10 11:58:30,146][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000711907_728992768.pth [2022-07-10 11:58:31,494][26022] Updated weights on worker 0-0, policy_version 713858 (0.00090) [2022-07-10 11:58:33,042][26022] Updated weights on worker 0-0, policy_version 713868 (0.00086) [2022-07-10 11:58:33,512][25689] Fps is (10 sec: 5681.6, 60 sec: 5526.8, 300 sec: 5528.3). Total num frames: 731003904. Throughput: 0: 4877.3. Samples: 730995834. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 11:58:33,513][25689] Avg episode reward: [(0, '-1.169')] [2022-07-10 11:58:35,103][26022] Updated weights on worker 0-0, policy_version 713878 (0.00094) [2022-07-10 11:58:36,652][26022] Updated weights on worker 0-0, policy_version 713888 (0.00086) [2022-07-10 11:58:38,550][25689] Fps is (10 sec: 5461.9, 60 sec: 5464.5, 300 sec: 5517.7). Total num frames: 731029504. Throughput: 0: 5706.7. Samples: 731028964. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 11:58:38,552][25689] Avg episode reward: [(0, '-1.638')] [2022-07-10 11:58:38,787][26022] Updated weights on worker 0-0, policy_version 713898 (0.00091) [2022-07-10 11:58:40,543][26022] Updated weights on worker 0-0, policy_version 713908 (0.00096) [2022-07-10 11:58:42,265][26022] Updated weights on worker 0-0, policy_version 713918 (0.00098) [2022-07-10 11:58:43,558][25689] Fps is (10 sec: 5402.3, 60 sec: 5500.4, 300 sec: 5521.1). Total num frames: 731058176. Throughput: 0: 5740.1. Samples: 731062180. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 11:58:43,559][25689] Avg episode reward: [(0, '-2.514')] [2022-07-10 11:58:44,238][26022] Updated weights on worker 0-0, policy_version 713928 (0.00088) [2022-07-10 11:58:46,183][26022] Updated weights on worker 0-0, policy_version 713938 (0.00097) [2022-07-10 11:58:47,972][26022] Updated weights on worker 0-0, policy_version 713948 (0.00090) [2022-07-10 11:58:48,572][25689] Fps is (10 sec: 5722.2, 60 sec: 5518.1, 300 sec: 5521.5). Total num frames: 731086848. Throughput: 0: 4935.1. Samples: 731078918. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 11:58:48,572][25689] Avg episode reward: [(0, '-2.474')] [2022-07-10 11:58:49,801][26022] Updated weights on worker 0-0, policy_version 713958 (0.00087) [2022-07-10 11:58:51,579][26022] Updated weights on worker 0-0, policy_version 713968 (0.00093) [2022-07-10 11:58:53,475][26022] Updated weights on worker 0-0, policy_version 713978 (0.00086) [2022-07-10 11:58:53,573][25689] Fps is (10 sec: 5521.3, 60 sec: 5468.3, 300 sec: 5522.3). Total num frames: 731113472. Throughput: 0: 5808.4. Samples: 731112356. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 11:58:53,574][25689] Avg episode reward: [(0, '-2.429')] [2022-07-10 11:58:55,332][26022] Updated weights on worker 0-0, policy_version 713988 (0.00095) [2022-07-10 11:58:57,175][26022] Updated weights on worker 0-0, policy_version 713998 (0.00089) [2022-07-10 11:58:58,635][25689] Fps is (10 sec: 5596.3, 60 sec: 5505.4, 300 sec: 5524.9). Total num frames: 731143168. Throughput: 0: 5834.3. Samples: 731146146. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 11:58:58,636][25689] Avg episode reward: [(0, '-2.336')] [2022-07-10 11:58:58,758][26022] Updated weights on worker 0-0, policy_version 714008 (0.00098) [2022-07-10 11:59:00,705][26022] Updated weights on worker 0-0, policy_version 714018 (0.00087) [2022-07-10 11:59:02,942][26022] Updated weights on worker 0-0, policy_version 714028 (0.00086) [2022-07-10 11:59:03,719][25689] Fps is (10 sec: 5450.0, 60 sec: 5498.6, 300 sec: 5527.2). Total num frames: 731168768. Throughput: 0: 5745.5. Samples: 731178016. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 11:59:03,720][25689] Avg episode reward: [(0, '-2.214')] [2022-07-10 11:59:04,663][26022] Updated weights on worker 0-0, policy_version 714038 (0.00090) [2022-07-10 11:59:06,598][26022] Updated weights on worker 0-0, policy_version 714048 (0.00088) [2022-07-10 11:59:08,327][26022] Updated weights on worker 0-0, policy_version 714058 (0.00094) [2022-07-10 11:59:08,768][25689] Fps is (10 sec: 5356.0, 60 sec: 5530.5, 300 sec: 5523.2). Total num frames: 731197440. Throughput: 0: 5740.0. Samples: 731194846. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 11:59:08,769][25689] Avg episode reward: [(0, '-1.626')] [2022-07-10 11:59:10,144][26022] Updated weights on worker 0-0, policy_version 714068 (0.00091) [2022-07-10 11:59:12,279][26022] Updated weights on worker 0-0, policy_version 714078 (0.00096) [2022-07-10 11:59:13,796][25689] Fps is (10 sec: 5588.9, 60 sec: 5531.1, 300 sec: 5527.5). Total num frames: 731225088. Throughput: 0: 5735.8. Samples: 731228352. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 11:59:13,797][25689] Avg episode reward: [(0, '-0.769')] [2022-07-10 11:59:13,988][26022] Updated weights on worker 0-0, policy_version 714088 (0.00095) [2022-07-10 11:59:15,693][26022] Updated weights on worker 0-0, policy_version 714098 (0.00092) [2022-07-10 11:59:17,682][26022] Updated weights on worker 0-0, policy_version 714108 (0.00070) [2022-07-10 11:59:18,855][25689] Fps is (10 sec: 5482.1, 60 sec: 5497.8, 300 sec: 5521.1). Total num frames: 731252736. Throughput: 0: 5725.9. Samples: 731261922. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 11:59:18,855][25689] Avg episode reward: [(0, '-0.421')] [2022-07-10 11:59:19,304][26022] Updated weights on worker 0-0, policy_version 714118 (0.00088) [2022-07-10 11:59:21,312][26022] Updated weights on worker 0-0, policy_version 714128 (0.00083) [2022-07-10 11:59:23,085][26022] Updated weights on worker 0-0, policy_version 714138 (0.00083) [2022-07-10 11:59:23,903][25689] Fps is (10 sec: 5572.5, 60 sec: 5536.1, 300 sec: 5527.6). Total num frames: 731281408. Throughput: 0: 4993.3. Samples: 731278800. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 11:59:23,904][25689] Avg episode reward: [(0, '-1.022')] [2022-07-10 11:59:25,046][26022] Updated weights on worker 0-0, policy_version 714148 (0.00082) [2022-07-10 11:59:26,837][26022] Updated weights on worker 0-0, policy_version 714158 (0.00085) [2022-07-10 11:59:28,875][26022] Updated weights on worker 0-0, policy_version 714168 (0.00091) [2022-07-10 11:59:28,912][25689] Fps is (10 sec: 5599.9, 60 sec: 5536.0, 300 sec: 5527.8). Total num frames: 731309056. Throughput: 0: 5812.8. Samples: 731311938. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 11:59:28,913][25689] Avg episode reward: [(0, '-1.742')] [2022-07-10 11:59:30,509][26022] Updated weights on worker 0-0, policy_version 714178 (0.00097) [2022-07-10 11:59:32,397][26022] Updated weights on worker 0-0, policy_version 714188 (0.00085) [2022-07-10 11:59:33,934][25689] Fps is (10 sec: 5512.8, 60 sec: 5508.0, 300 sec: 5526.2). Total num frames: 731336704. Throughput: 0: 5828.0. Samples: 731345712. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 11:59:33,934][25689] Avg episode reward: [(0, '-1.285')] [2022-07-10 11:59:33,952][26022] Updated weights on worker 0-0, policy_version 714198 (0.00082) [2022-07-10 11:59:36,081][26022] Updated weights on worker 0-0, policy_version 714208 (0.00090) [2022-07-10 11:59:37,783][26022] Updated weights on worker 0-0, policy_version 714218 (0.00083) [2022-07-10 11:59:39,005][25689] Fps is (10 sec: 5478.5, 60 sec: 5538.8, 300 sec: 5519.0). Total num frames: 731364352. Throughput: 0: 4993.9. Samples: 731362552. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 11:59:39,007][25689] Avg episode reward: [(0, '-1.253')] [2022-07-10 11:59:39,583][26022] Updated weights on worker 0-0, policy_version 714228 (0.00082) [2022-07-10 11:59:41,402][26022] Updated weights on worker 0-0, policy_version 714238 (0.00053) [2022-07-10 11:59:43,223][26022] Updated weights on worker 0-0, policy_version 714248 (0.00081) [2022-07-10 11:59:44,029][25689] Fps is (10 sec: 5680.0, 60 sec: 5554.2, 300 sec: 5526.3). Total num frames: 731394048. Throughput: 0: 5827.8. Samples: 731396092. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 11:59:44,030][25689] Avg episode reward: [(0, '-1.070')] [2022-07-10 11:59:45,212][26022] Updated weights on worker 0-0, policy_version 714258 (0.00094) [2022-07-10 11:59:46,908][26022] Updated weights on worker 0-0, policy_version 714268 (0.00085) [2022-07-10 11:59:48,578][26022] Updated weights on worker 0-0, policy_version 714278 (0.00086) [2022-07-10 11:59:49,126][25689] Fps is (10 sec: 5767.0, 60 sec: 5546.6, 300 sec: 5528.6). Total num frames: 731422720. Throughput: 0: 5834.5. Samples: 731429878. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 11:59:49,127][25689] Avg episode reward: [(0, '-1.594')] [2022-07-10 11:59:50,621][26022] Updated weights on worker 0-0, policy_version 714288 (0.00089) [2022-07-10 11:59:52,415][26022] Updated weights on worker 0-0, policy_version 714298 (0.00083) [2022-07-10 11:59:54,163][25689] Fps is (10 sec: 5557.9, 60 sec: 5560.3, 300 sec: 5526.9). Total num frames: 731450368. Throughput: 0: 4991.0. Samples: 731446676. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 11:59:54,163][25689] Avg episode reward: [(0, '-1.187')] [2022-07-10 11:59:54,212][26022] Updated weights on worker 0-0, policy_version 714308 (0.00085) [2022-07-10 11:59:56,148][26022] Updated weights on worker 0-0, policy_version 714318 (0.00088) [2022-07-10 11:59:58,014][26022] Updated weights on worker 0-0, policy_version 714328 (0.00086) [2022-07-10 11:59:59,216][25689] Fps is (10 sec: 5581.8, 60 sec: 5544.2, 300 sec: 5529.7). Total num frames: 731479040. Throughput: 0: 5803.0. Samples: 731479836. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 11:59:59,217][25689] Avg episode reward: [(0, '-1.527')] [2022-07-10 12:00:00,007][26022] Updated weights on worker 0-0, policy_version 714338 (0.00089) [2022-07-10 12:00:01,711][26022] Updated weights on worker 0-0, policy_version 714348 (0.00094) [2022-07-10 12:00:03,935][26022] Updated weights on worker 0-0, policy_version 714358 (0.00085) [2022-07-10 12:00:04,308][25689] Fps is (10 sec: 5349.6, 60 sec: 5543.5, 300 sec: 5528.0). Total num frames: 731504640. Throughput: 0: 5680.9. Samples: 731511292. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 12:00:04,308][25689] Avg episode reward: [(0, '-3.395')] [2022-07-10 12:00:05,776][26022] Updated weights on worker 0-0, policy_version 714368 (0.00085) [2022-07-10 12:00:07,395][26022] Updated weights on worker 0-0, policy_version 714378 (0.00078) [2022-07-10 12:00:09,320][25689] Fps is (10 sec: 5371.6, 60 sec: 5546.9, 300 sec: 5532.9). Total num frames: 731533312. Throughput: 0: 4865.5. Samples: 731528128. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 12:00:09,320][25689] Avg episode reward: [(0, '-5.264')] [2022-07-10 12:00:09,323][26022] Updated weights on worker 0-0, policy_version 714388 (0.00085) [2022-07-10 12:00:11,253][26022] Updated weights on worker 0-0, policy_version 714398 (0.00052) [2022-07-10 12:00:13,055][26022] Updated weights on worker 0-0, policy_version 714408 (0.00088) [2022-07-10 12:00:14,326][25689] Fps is (10 sec: 5519.4, 60 sec: 5531.9, 300 sec: 5523.9). Total num frames: 731559936. Throughput: 0: 5689.4. Samples: 731561396. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 12:00:14,327][25689] Avg episode reward: [(0, '-5.026')] [2022-07-10 12:00:15,078][26022] Updated weights on worker 0-0, policy_version 714418 (0.00091) [2022-07-10 12:00:16,560][26022] Updated weights on worker 0-0, policy_version 714428 (0.00081) [2022-07-10 12:00:18,714][26022] Updated weights on worker 0-0, policy_version 714438 (0.00091) [2022-07-10 12:00:19,403][25689] Fps is (10 sec: 5585.7, 60 sec: 5564.1, 300 sec: 5529.8). Total num frames: 731589632. Throughput: 0: 5710.8. Samples: 731595116. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 12:00:19,403][25689] Avg episode reward: [(0, '-5.223')] [2022-07-10 12:00:20,378][26022] Updated weights on worker 0-0, policy_version 714448 (0.00084) [2022-07-10 12:00:22,291][26022] Updated weights on worker 0-0, policy_version 714458 (0.00082) [2022-07-10 12:00:23,827][26022] Updated weights on worker 0-0, policy_version 714468 (0.00086) [2022-07-10 12:00:24,418][25689] Fps is (10 sec: 5581.0, 60 sec: 5533.3, 300 sec: 5526.7). Total num frames: 731616256. Throughput: 0: 5014.4. Samples: 731612130. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 12:00:24,423][25689] Avg episode reward: [(0, '-6.244')] [2022-07-10 12:00:25,907][26022] Updated weights on worker 0-0, policy_version 714478 (0.00093) [2022-07-10 12:00:27,936][26022] Updated weights on worker 0-0, policy_version 714488 (0.00095) [2022-07-10 12:00:29,455][25689] Fps is (10 sec: 5399.1, 60 sec: 5530.8, 300 sec: 5523.4). Total num frames: 731643904. Throughput: 0: 5834.9. Samples: 731645614. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 12:00:29,455][25689] Avg episode reward: [(0, '-6.201')] [2022-07-10 12:00:29,662][26022] Updated weights on worker 0-0, policy_version 714498 (0.00091) [2022-07-10 12:00:30,149][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:00:30,178][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000714502_731650048.pth [2022-07-10 12:00:30,178][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000712555_729656320.pth [2022-07-10 12:00:31,553][26022] Updated weights on worker 0-0, policy_version 714508 (0.00085) [2022-07-10 12:00:33,239][26022] Updated weights on worker 0-0, policy_version 714518 (0.00087) [2022-07-10 12:00:34,466][25689] Fps is (10 sec: 5502.8, 60 sec: 5531.7, 300 sec: 5524.0). Total num frames: 731671552. Throughput: 0: 5829.3. Samples: 731678798. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 12:00:34,467][25689] Avg episode reward: [(0, '-6.065')] [2022-07-10 12:00:35,232][26022] Updated weights on worker 0-0, policy_version 714528 (0.00095) [2022-07-10 12:00:37,258][26022] Updated weights on worker 0-0, policy_version 714538 (0.00083) [2022-07-10 12:00:38,767][26022] Updated weights on worker 0-0, policy_version 714548 (0.00090) [2022-07-10 12:00:39,575][25689] Fps is (10 sec: 5666.3, 60 sec: 5562.1, 300 sec: 5525.7). Total num frames: 731701248. Throughput: 0: 4979.9. Samples: 731695572. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 12:00:39,575][25689] Avg episode reward: [(0, '-5.278')] [2022-07-10 12:00:40,819][26022] Updated weights on worker 0-0, policy_version 714558 (0.00083) [2022-07-10 12:00:42,322][26022] Updated weights on worker 0-0, policy_version 714568 (0.00092) [2022-07-10 12:00:44,318][26022] Updated weights on worker 0-0, policy_version 714578 (0.00083) [2022-07-10 12:00:44,605][25689] Fps is (10 sec: 5655.8, 60 sec: 5527.7, 300 sec: 5533.4). Total num frames: 731728896. Throughput: 0: 5782.1. Samples: 731728858. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 12:00:44,606][25689] Avg episode reward: [(0, '-3.487')] [2022-07-10 12:00:46,208][26022] Updated weights on worker 0-0, policy_version 714588 (0.00092) [2022-07-10 12:00:48,154][26022] Updated weights on worker 0-0, policy_version 714598 (0.00088) [2022-07-10 12:00:49,631][25689] Fps is (10 sec: 5498.6, 60 sec: 5517.3, 300 sec: 5526.4). Total num frames: 731756544. Throughput: 0: 5790.5. Samples: 731762446. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 12:00:49,633][25689] Avg episode reward: [(0, '-2.969')] [2022-07-10 12:00:49,891][26022] Updated weights on worker 0-0, policy_version 714608 (0.00095) [2022-07-10 12:00:51,708][26022] Updated weights on worker 0-0, policy_version 714618 (0.00092) [2022-07-10 12:00:53,486][26022] Updated weights on worker 0-0, policy_version 714628 (0.00400) [2022-07-10 12:00:54,729][25689] Fps is (10 sec: 5563.4, 60 sec: 5528.7, 300 sec: 5532.6). Total num frames: 731785216. Throughput: 0: 4951.2. Samples: 731779120. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 12:00:54,730][25689] Avg episode reward: [(0, '-2.114')] [2022-07-10 12:00:55,407][26022] Updated weights on worker 0-0, policy_version 714638 (0.00089) [2022-07-10 12:00:57,191][26022] Updated weights on worker 0-0, policy_version 714648 (0.00095) [2022-07-10 12:00:59,153][26022] Updated weights on worker 0-0, policy_version 714658 (0.00091) [2022-07-10 12:00:59,796][25689] Fps is (10 sec: 5641.6, 60 sec: 5527.4, 300 sec: 5538.6). Total num frames: 731813888. Throughput: 0: 5788.9. Samples: 731812628. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 12:00:59,798][25689] Avg episode reward: [(0, '-0.966')] [2022-07-10 12:01:01,049][26022] Updated weights on worker 0-0, policy_version 714668 (0.00085) [2022-07-10 12:01:03,348][26022] Updated weights on worker 0-0, policy_version 714678 (0.00089) [2022-07-10 12:01:04,821][25689] Fps is (10 sec: 5276.0, 60 sec: 5516.6, 300 sec: 5527.9). Total num frames: 731838464. Throughput: 0: 5650.9. Samples: 731843094. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 12:01:04,823][25689] Avg episode reward: [(0, '-0.707')] [2022-07-10 12:01:04,959][26022] Updated weights on worker 0-0, policy_version 714688 (0.00087) [2022-07-10 12:01:06,995][26022] Updated weights on worker 0-0, policy_version 714698 (0.00088) [2022-07-10 12:01:08,624][26022] Updated weights on worker 0-0, policy_version 714708 (0.00093) [2022-07-10 12:01:09,843][25689] Fps is (10 sec: 5198.0, 60 sec: 5498.7, 300 sec: 5524.3). Total num frames: 731866112. Throughput: 0: 4838.4. Samples: 731860238. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 12:01:09,843][25689] Avg episode reward: [(0, '-1.013')] [2022-07-10 12:01:10,567][26022] Updated weights on worker 0-0, policy_version 714718 (0.00089) [2022-07-10 12:01:12,151][26022] Updated weights on worker 0-0, policy_version 714728 (0.00092) [2022-07-10 12:01:14,323][26022] Updated weights on worker 0-0, policy_version 714738 (0.00091) [2022-07-10 12:01:14,846][25689] Fps is (10 sec: 5617.9, 60 sec: 5532.9, 300 sec: 5528.8). Total num frames: 731894784. Throughput: 0: 5701.4. Samples: 731893818. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 12:01:14,847][25689] Avg episode reward: [(0, '-1.863')] [2022-07-10 12:01:16,001][26022] Updated weights on worker 0-0, policy_version 714748 (0.00086) [2022-07-10 12:01:17,799][26022] Updated weights on worker 0-0, policy_version 714758 (0.00087) [2022-07-10 12:01:19,883][26022] Updated weights on worker 0-0, policy_version 714768 (0.00092) [2022-07-10 12:01:19,926][25689] Fps is (10 sec: 5585.3, 60 sec: 5498.7, 300 sec: 5527.5). Total num frames: 731922432. Throughput: 0: 5699.4. Samples: 731927360. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 12:01:19,927][25689] Avg episode reward: [(0, '-2.275')] [2022-07-10 12:01:21,467][26022] Updated weights on worker 0-0, policy_version 714778 (0.00089) [2022-07-10 12:01:23,486][26022] Updated weights on worker 0-0, policy_version 714788 (0.00084) [2022-07-10 12:01:24,939][25689] Fps is (10 sec: 5681.8, 60 sec: 5549.7, 300 sec: 5531.2). Total num frames: 731952128. Throughput: 0: 5018.8. Samples: 731944060. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 12:01:24,939][25689] Avg episode reward: [(0, '-2.117')] [2022-07-10 12:01:25,062][26022] Updated weights on worker 0-0, policy_version 714798 (0.00090) [2022-07-10 12:01:27,198][26022] Updated weights on worker 0-0, policy_version 714808 (0.00087) [2022-07-10 12:01:28,897][26022] Updated weights on worker 0-0, policy_version 714818 (0.00089) [2022-07-10 12:01:29,956][25689] Fps is (10 sec: 5615.1, 60 sec: 5534.5, 300 sec: 5531.3). Total num frames: 731978752. Throughput: 0: 5829.2. Samples: 731977484. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-10 12:01:29,957][25689] Avg episode reward: [(0, '-0.991')] [2022-07-10 12:01:30,827][26022] Updated weights on worker 0-0, policy_version 714828 (0.00108) [2022-07-10 12:01:32,579][26022] Updated weights on worker 0-0, policy_version 714838 (0.00091) [2022-07-10 12:01:34,702][26022] Updated weights on worker 0-0, policy_version 714848 (0.00086) [2022-07-10 12:01:34,967][25689] Fps is (10 sec: 5412.1, 60 sec: 5534.6, 300 sec: 5525.9). Total num frames: 732006400. Throughput: 0: 5796.6. Samples: 732010450. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:01:34,967][25689] Avg episode reward: [(0, '-1.289')] [2022-07-10 12:01:36,336][26022] Updated weights on worker 0-0, policy_version 714858 (0.00084) [2022-07-10 12:01:38,262][26022] Updated weights on worker 0-0, policy_version 714868 (0.00091) [2022-07-10 12:01:40,003][26022] Updated weights on worker 0-0, policy_version 714878 (0.00094) [2022-07-10 12:01:40,034][25689] Fps is (10 sec: 5588.9, 60 sec: 5521.5, 300 sec: 5535.9). Total num frames: 732035072. Throughput: 0: 5791.4. Samples: 732043810. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:01:40,034][25689] Avg episode reward: [(0, '-1.646')] [2022-07-10 12:01:41,776][26022] Updated weights on worker 0-0, policy_version 714888 (0.00090) [2022-07-10 12:01:43,866][26022] Updated weights on worker 0-0, policy_version 714898 (0.00084) [2022-07-10 12:01:45,087][25689] Fps is (10 sec: 5565.3, 60 sec: 5519.5, 300 sec: 5528.6). Total num frames: 732062720. Throughput: 0: 5781.8. Samples: 732060550. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:01:45,087][25689] Avg episode reward: [(0, '-1.353')] [2022-07-10 12:01:45,499][26022] Updated weights on worker 0-0, policy_version 714908 (0.00083) [2022-07-10 12:01:47,584][26022] Updated weights on worker 0-0, policy_version 714918 (0.00090) [2022-07-10 12:01:49,184][26022] Updated weights on worker 0-0, policy_version 714928 (0.00086) [2022-07-10 12:01:50,132][25689] Fps is (10 sec: 5374.4, 60 sec: 5500.8, 300 sec: 5520.9). Total num frames: 732089344. Throughput: 0: 5768.5. Samples: 732093866. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:01:50,132][25689] Avg episode reward: [(0, '-1.605')] [2022-07-10 12:01:51,136][26022] Updated weights on worker 0-0, policy_version 714938 (0.00085) [2022-07-10 12:01:53,244][26022] Updated weights on worker 0-0, policy_version 714948 (0.00095) [2022-07-10 12:01:54,772][26022] Updated weights on worker 0-0, policy_version 714958 (0.00091) [2022-07-10 12:01:55,166][25689] Fps is (10 sec: 5587.4, 60 sec: 5523.4, 300 sec: 5528.8). Total num frames: 732119040. Throughput: 0: 5795.2. Samples: 732127510. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:01:55,168][25689] Avg episode reward: [(0, '-2.231')] [2022-07-10 12:01:56,899][26022] Updated weights on worker 0-0, policy_version 714968 (0.00083) [2022-07-10 12:01:58,507][26022] Updated weights on worker 0-0, policy_version 714978 (0.00084) [2022-07-10 12:02:00,251][25689] Fps is (10 sec: 5666.6, 60 sec: 5504.9, 300 sec: 5531.0). Total num frames: 732146688. Throughput: 0: 4967.2. Samples: 732144236. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:02:00,253][25689] Avg episode reward: [(0, '-2.113')] [2022-07-10 12:02:00,411][26022] Updated weights on worker 0-0, policy_version 714988 (0.00052) [2022-07-10 12:02:02,501][26022] Updated weights on worker 0-0, policy_version 714998 (0.00084) [2022-07-10 12:02:04,573][26022] Updated weights on worker 0-0, policy_version 715008 (0.00090) [2022-07-10 12:02:05,258][25689] Fps is (10 sec: 5174.9, 60 sec: 5506.5, 300 sec: 5518.2). Total num frames: 732171264. Throughput: 0: 5694.4. Samples: 732175416. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:02:05,259][25689] Avg episode reward: [(0, '-2.859')] [2022-07-10 12:02:06,194][26022] Updated weights on worker 0-0, policy_version 715018 (0.00087) [2022-07-10 12:02:08,278][26022] Updated weights on worker 0-0, policy_version 715028 (0.00092) [2022-07-10 12:02:09,996][26022] Updated weights on worker 0-0, policy_version 715038 (0.00092) [2022-07-10 12:02:10,260][25689] Fps is (10 sec: 5422.3, 60 sec: 5542.2, 300 sec: 5525.3). Total num frames: 732200960. Throughput: 0: 5697.0. Samples: 732208538. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:02:10,262][25689] Avg episode reward: [(0, '-2.075')] [2022-07-10 12:02:11,927][26022] Updated weights on worker 0-0, policy_version 715048 (0.00094) [2022-07-10 12:02:13,684][26022] Updated weights on worker 0-0, policy_version 715058 (0.00087) [2022-07-10 12:02:15,289][25689] Fps is (10 sec: 5614.5, 60 sec: 5506.0, 300 sec: 5519.5). Total num frames: 732227584. Throughput: 0: 4856.5. Samples: 732225238. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:02:15,291][25689] Avg episode reward: [(0, '-1.980')] [2022-07-10 12:02:15,669][26022] Updated weights on worker 0-0, policy_version 715068 (0.00089) [2022-07-10 12:02:17,326][26022] Updated weights on worker 0-0, policy_version 715078 (0.00105) [2022-07-10 12:02:19,336][26022] Updated weights on worker 0-0, policy_version 715088 (0.00086) [2022-07-10 12:02:20,398][25689] Fps is (10 sec: 5454.3, 60 sec: 5520.3, 300 sec: 5521.0). Total num frames: 732256256. Throughput: 0: 5666.7. Samples: 732258404. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:02:20,400][25689] Avg episode reward: [(0, '-1.669')] [2022-07-10 12:02:20,971][26022] Updated weights on worker 0-0, policy_version 715098 (0.00093) [2022-07-10 12:02:23,019][26022] Updated weights on worker 0-0, policy_version 715108 (0.00085) [2022-07-10 12:02:24,717][26022] Updated weights on worker 0-0, policy_version 715118 (0.00085) [2022-07-10 12:02:25,424][25689] Fps is (10 sec: 5456.1, 60 sec: 5468.3, 300 sec: 5513.8). Total num frames: 732282880. Throughput: 0: 5749.0. Samples: 732291350. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:02:25,425][25689] Avg episode reward: [(0, '-2.396')] [2022-07-10 12:02:26,819][26022] Updated weights on worker 0-0, policy_version 715128 (0.00093) [2022-07-10 12:02:28,552][26022] Updated weights on worker 0-0, policy_version 715138 (0.00086) [2022-07-10 12:02:30,218][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:02:30,230][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000715147_732310528.pth [2022-07-10 12:02:30,236][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000713203_730319872.pth [2022-07-10 12:02:30,362][26022] Updated weights on worker 0-0, policy_version 715148 (0.00092) [2022-07-10 12:02:30,438][25689] Fps is (10 sec: 5507.5, 60 sec: 5502.5, 300 sec: 5520.8). Total num frames: 732311552. Throughput: 0: 4926.0. Samples: 732307936. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:02:30,440][25689] Avg episode reward: [(0, '-1.643')] [2022-07-10 12:02:32,520][26022] Updated weights on worker 0-0, policy_version 715158 (0.00087) [2022-07-10 12:02:34,094][26022] Updated weights on worker 0-0, policy_version 715168 (0.00084) [2022-07-10 12:02:35,446][25689] Fps is (10 sec: 5619.7, 60 sec: 5502.7, 300 sec: 5515.5). Total num frames: 732339200. Throughput: 0: 5740.3. Samples: 732340942. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:02:35,447][25689] Avg episode reward: [(0, '-1.337')] [2022-07-10 12:02:36,202][26022] Updated weights on worker 0-0, policy_version 715178 (0.00086) [2022-07-10 12:02:37,701][26022] Updated weights on worker 0-0, policy_version 715188 (0.00086) [2022-07-10 12:02:39,797][26022] Updated weights on worker 0-0, policy_version 715198 (0.00087) [2022-07-10 12:02:40,574][25689] Fps is (10 sec: 5556.6, 60 sec: 5497.1, 300 sec: 5520.6). Total num frames: 732367872. Throughput: 0: 5744.4. Samples: 732374302. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:02:40,575][25689] Avg episode reward: [(0, '-1.618')] [2022-07-10 12:02:41,719][26022] Updated weights on worker 0-0, policy_version 715208 (0.00084) [2022-07-10 12:02:43,200][26022] Updated weights on worker 0-0, policy_version 715218 (0.00089) [2022-07-10 12:02:45,548][26022] Updated weights on worker 0-0, policy_version 715228 (0.00086) [2022-07-10 12:02:45,644][25689] Fps is (10 sec: 5321.7, 60 sec: 5461.8, 300 sec: 5512.8). Total num frames: 732393472. Throughput: 0: 4928.9. Samples: 732391012. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:02:45,645][25689] Avg episode reward: [(0, '-1.879')] [2022-07-10 12:02:46,763][26022] Updated weights on worker 0-0, policy_version 715238 (0.00091) [2022-07-10 12:02:48,983][26022] Updated weights on worker 0-0, policy_version 715248 (0.00090) [2022-07-10 12:02:50,588][26022] Updated weights on worker 0-0, policy_version 715258 (0.00090) [2022-07-10 12:02:50,672][25689] Fps is (10 sec: 5577.7, 60 sec: 5531.0, 300 sec: 5516.0). Total num frames: 732424192. Throughput: 0: 5765.0. Samples: 732424580. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:02:50,672][25689] Avg episode reward: [(0, '-3.664')] [2022-07-10 12:02:52,495][26022] Updated weights on worker 0-0, policy_version 715268 (0.00091) [2022-07-10 12:02:54,071][26022] Updated weights on worker 0-0, policy_version 715278 (0.00090) [2022-07-10 12:02:55,674][25689] Fps is (10 sec: 5921.9, 60 sec: 5517.1, 300 sec: 5521.2). Total num frames: 732452864. Throughput: 0: 5799.2. Samples: 732458246. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:02:55,674][25689] Avg episode reward: [(0, '-2.800')] [2022-07-10 12:02:56,282][26022] Updated weights on worker 0-0, policy_version 715288 (0.00091) [2022-07-10 12:02:57,953][26022] Updated weights on worker 0-0, policy_version 715298 (0.00088) [2022-07-10 12:03:00,016][26022] Updated weights on worker 0-0, policy_version 715308 (0.00091) [2022-07-10 12:03:00,795][25689] Fps is (10 sec: 5462.3, 60 sec: 5496.9, 300 sec: 5522.6). Total num frames: 732479488. Throughput: 0: 4985.3. Samples: 732475108. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:03:00,795][25689] Avg episode reward: [(0, '-2.996')] [2022-07-10 12:03:01,473][26022] Updated weights on worker 0-0, policy_version 715318 (0.00085) [2022-07-10 12:03:04,132][26022] Updated weights on worker 0-0, policy_version 715328 (0.00104) [2022-07-10 12:03:05,641][26022] Updated weights on worker 0-0, policy_version 715338 (0.00089) [2022-07-10 12:03:05,804][25689] Fps is (10 sec: 5256.3, 60 sec: 5530.5, 300 sec: 5522.9). Total num frames: 732506112. Throughput: 0: 5709.6. Samples: 732506114. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:03:05,804][25689] Avg episode reward: [(0, '-3.680')] [2022-07-10 12:03:07,643][26022] Updated weights on worker 0-0, policy_version 715348 (0.00087) [2022-07-10 12:03:09,519][26022] Updated weights on worker 0-0, policy_version 715358 (0.00085) [2022-07-10 12:03:10,825][25689] Fps is (10 sec: 5309.0, 60 sec: 5478.1, 300 sec: 5519.7). Total num frames: 732532736. Throughput: 0: 5708.0. Samples: 732539614. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:03:10,827][25689] Avg episode reward: [(0, '-3.460')] [2022-07-10 12:03:11,262][26022] Updated weights on worker 0-0, policy_version 715368 (0.00087) [2022-07-10 12:03:13,115][26022] Updated weights on worker 0-0, policy_version 715378 (0.00085) [2022-07-10 12:03:14,884][26022] Updated weights on worker 0-0, policy_version 715388 (0.00084) [2022-07-10 12:03:15,855][25689] Fps is (10 sec: 5501.5, 60 sec: 5511.8, 300 sec: 5516.9). Total num frames: 732561408. Throughput: 0: 4861.0. Samples: 732556348. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:03:15,857][25689] Avg episode reward: [(0, '-4.251')] [2022-07-10 12:03:16,722][26022] Updated weights on worker 0-0, policy_version 715398 (0.00083) [2022-07-10 12:03:18,467][26022] Updated weights on worker 0-0, policy_version 715408 (0.00087) [2022-07-10 12:03:20,329][26022] Updated weights on worker 0-0, policy_version 715418 (0.00088) [2022-07-10 12:03:20,924][25689] Fps is (10 sec: 5677.8, 60 sec: 5515.4, 300 sec: 5524.3). Total num frames: 732590080. Throughput: 0: 5711.7. Samples: 732590082. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:03:20,925][25689] Avg episode reward: [(0, '-4.545')] [2022-07-10 12:03:22,360][26022] Updated weights on worker 0-0, policy_version 715428 (0.00087) [2022-07-10 12:03:24,051][26022] Updated weights on worker 0-0, policy_version 715438 (0.00083) [2022-07-10 12:03:25,925][26022] Updated weights on worker 0-0, policy_version 715448 (0.00084) [2022-07-10 12:03:25,927][25689] Fps is (10 sec: 5693.4, 60 sec: 5551.4, 300 sec: 5527.9). Total num frames: 732618752. Throughput: 0: 5831.0. Samples: 732623452. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:03:25,927][25689] Avg episode reward: [(0, '-4.299')] [2022-07-10 12:03:27,776][26022] Updated weights on worker 0-0, policy_version 715458 (0.00084) [2022-07-10 12:03:29,705][26022] Updated weights on worker 0-0, policy_version 715468 (0.00095) [2022-07-10 12:03:30,945][25689] Fps is (10 sec: 5518.3, 60 sec: 5517.2, 300 sec: 5518.8). Total num frames: 732645376. Throughput: 0: 4999.4. Samples: 732640204. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:03:30,946][25689] Avg episode reward: [(0, '-4.041')] [2022-07-10 12:03:31,624][26022] Updated weights on worker 0-0, policy_version 715478 (0.00093) [2022-07-10 12:03:33,348][26022] Updated weights on worker 0-0, policy_version 715488 (0.00097) [2022-07-10 12:03:35,331][26022] Updated weights on worker 0-0, policy_version 715498 (0.00092) [2022-07-10 12:03:35,967][25689] Fps is (10 sec: 5507.6, 60 sec: 5532.8, 300 sec: 5529.4). Total num frames: 732674048. Throughput: 0: 5830.2. Samples: 732673604. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:03:35,967][25689] Avg episode reward: [(0, '-5.032')] [2022-07-10 12:03:36,991][26022] Updated weights on worker 0-0, policy_version 715508 (0.00095) [2022-07-10 12:03:38,852][26022] Updated weights on worker 0-0, policy_version 715518 (0.00086) [2022-07-10 12:03:40,768][26022] Updated weights on worker 0-0, policy_version 715528 (0.00091) [2022-07-10 12:03:41,019][25689] Fps is (10 sec: 5590.7, 60 sec: 5522.8, 300 sec: 5525.2). Total num frames: 732701696. Throughput: 0: 5811.8. Samples: 732706866. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:03:41,019][25689] Avg episode reward: [(0, '-5.520')] [2022-07-10 12:03:42,731][26022] Updated weights on worker 0-0, policy_version 715538 (0.00099) [2022-07-10 12:03:44,319][26022] Updated weights on worker 0-0, policy_version 715548 (0.00065) [2022-07-10 12:03:46,038][25689] Fps is (10 sec: 5490.5, 60 sec: 5561.4, 300 sec: 5521.6). Total num frames: 732729344. Throughput: 0: 4982.0. Samples: 732723646. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:03:46,040][25689] Avg episode reward: [(0, '-6.115')] [2022-07-10 12:03:46,413][26022] Updated weights on worker 0-0, policy_version 715558 (0.00083) [2022-07-10 12:03:47,889][26022] Updated weights on worker 0-0, policy_version 715568 (0.00093) [2022-07-10 12:03:49,966][26022] Updated weights on worker 0-0, policy_version 715578 (0.00085) [2022-07-10 12:03:51,047][25689] Fps is (10 sec: 5514.2, 60 sec: 5512.2, 300 sec: 5524.9). Total num frames: 732756992. Throughput: 0: 5811.2. Samples: 732757020. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:03:51,047][25689] Avg episode reward: [(0, '-4.499')] [2022-07-10 12:03:51,901][26022] Updated weights on worker 0-0, policy_version 715588 (0.00094) [2022-07-10 12:03:53,682][26022] Updated weights on worker 0-0, policy_version 715598 (0.00088) [2022-07-10 12:03:55,447][26022] Updated weights on worker 0-0, policy_version 715608 (0.00087) [2022-07-10 12:03:56,058][25689] Fps is (10 sec: 5518.5, 60 sec: 5494.4, 300 sec: 5519.0). Total num frames: 732784640. Throughput: 0: 5795.3. Samples: 732790040. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:03:56,059][25689] Avg episode reward: [(0, '-4.507')] [2022-07-10 12:03:57,416][26022] Updated weights on worker 0-0, policy_version 715618 (0.00080) [2022-07-10 12:03:59,164][26022] Updated weights on worker 0-0, policy_version 715628 (0.00088) [2022-07-10 12:04:01,034][26022] Updated weights on worker 0-0, policy_version 715638 (0.00090) [2022-07-10 12:04:01,187][25689] Fps is (10 sec: 5554.2, 60 sec: 5527.6, 300 sec: 5528.5). Total num frames: 732813312. Throughput: 0: 4949.5. Samples: 732806688. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:04:01,187][25689] Avg episode reward: [(0, '-3.202')] [2022-07-10 12:04:03,171][26022] Updated weights on worker 0-0, policy_version 715648 (0.00087) [2022-07-10 12:04:05,129][26022] Updated weights on worker 0-0, policy_version 715658 (0.00085) [2022-07-10 12:04:06,206][25689] Fps is (10 sec: 5348.1, 60 sec: 5509.7, 300 sec: 5518.7). Total num frames: 732838912. Throughput: 0: 5667.8. Samples: 732837954. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:04:06,206][25689] Avg episode reward: [(0, '-2.715')] [2022-07-10 12:04:07,088][26022] Updated weights on worker 0-0, policy_version 715668 (0.00088) [2022-07-10 12:04:08,700][26022] Updated weights on worker 0-0, policy_version 715678 (0.00099) [2022-07-10 12:04:10,739][26022] Updated weights on worker 0-0, policy_version 715688 (0.00089) [2022-07-10 12:04:11,243][25689] Fps is (10 sec: 5397.0, 60 sec: 5542.2, 300 sec: 5522.0). Total num frames: 732867584. Throughput: 0: 5665.3. Samples: 732871436. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:04:11,243][25689] Avg episode reward: [(0, '-2.515')] [2022-07-10 12:04:12,371][26022] Updated weights on worker 0-0, policy_version 715698 (0.00085) [2022-07-10 12:04:14,348][26022] Updated weights on worker 0-0, policy_version 715708 (0.00085) [2022-07-10 12:04:16,235][26022] Updated weights on worker 0-0, policy_version 715718 (0.00096) [2022-07-10 12:04:16,258][25689] Fps is (10 sec: 5602.6, 60 sec: 5526.6, 300 sec: 5522.8). Total num frames: 732895232. Throughput: 0: 5684.8. Samples: 732904876. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:04:16,259][25689] Avg episode reward: [(0, '-3.646')] [2022-07-10 12:04:17,945][26022] Updated weights on worker 0-0, policy_version 715728 (0.00089) [2022-07-10 12:04:19,878][26022] Updated weights on worker 0-0, policy_version 715738 (0.00088) [2022-07-10 12:04:21,393][25689] Fps is (10 sec: 5548.7, 60 sec: 5520.6, 300 sec: 5521.2). Total num frames: 732923904. Throughput: 0: 5689.5. Samples: 732921650. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:04:21,393][25689] Avg episode reward: [(0, '-2.487')] [2022-07-10 12:04:21,747][26022] Updated weights on worker 0-0, policy_version 715748 (0.00082) [2022-07-10 12:04:23,437][26022] Updated weights on worker 0-0, policy_version 715758 (0.00089) [2022-07-10 12:04:25,555][26022] Updated weights on worker 0-0, policy_version 715768 (0.00092) [2022-07-10 12:04:26,408][25689] Fps is (10 sec: 5549.1, 60 sec: 5502.6, 300 sec: 5521.1). Total num frames: 732951552. Throughput: 0: 5784.4. Samples: 732954810. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:04:26,408][25689] Avg episode reward: [(0, '-2.695')] [2022-07-10 12:04:27,238][26022] Updated weights on worker 0-0, policy_version 715778 (0.00086) [2022-07-10 12:04:29,143][26022] Updated weights on worker 0-0, policy_version 715788 (0.00086) [2022-07-10 12:04:30,274][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:04:30,290][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000715794_732973056.pth [2022-07-10 12:04:30,291][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000713853_730985472.pth [2022-07-10 12:04:31,104][26022] Updated weights on worker 0-0, policy_version 715798 (0.00091) [2022-07-10 12:04:31,478][25689] Fps is (10 sec: 5482.8, 60 sec: 5514.7, 300 sec: 5520.2). Total num frames: 732979200. Throughput: 0: 5751.4. Samples: 732987818. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:04:31,479][25689] Avg episode reward: [(0, '-2.694')] [2022-07-10 12:04:32,762][26022] Updated weights on worker 0-0, policy_version 715808 (0.00087) [2022-07-10 12:04:34,790][26022] Updated weights on worker 0-0, policy_version 715818 (0.00088) [2022-07-10 12:04:36,398][26022] Updated weights on worker 0-0, policy_version 715828 (0.00078) [2022-07-10 12:04:36,494][25689] Fps is (10 sec: 5583.8, 60 sec: 5515.3, 300 sec: 5524.6). Total num frames: 733007872. Throughput: 0: 4921.5. Samples: 733004466. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:04:36,496][25689] Avg episode reward: [(0, '-2.677')] [2022-07-10 12:04:38,361][26022] Updated weights on worker 0-0, policy_version 715838 (0.00090) [2022-07-10 12:04:40,211][26022] Updated weights on worker 0-0, policy_version 715848 (0.00093) [2022-07-10 12:04:41,569][25689] Fps is (10 sec: 5479.9, 60 sec: 5496.3, 300 sec: 5513.4). Total num frames: 733034496. Throughput: 0: 5768.6. Samples: 733038038. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:04:41,570][25689] Avg episode reward: [(0, '-2.256')] [2022-07-10 12:04:42,015][26022] Updated weights on worker 0-0, policy_version 715858 (0.00084) [2022-07-10 12:04:43,753][26022] Updated weights on worker 0-0, policy_version 715868 (0.00872) [2022-07-10 12:04:45,895][26022] Updated weights on worker 0-0, policy_version 715878 (0.00090) [2022-07-10 12:04:46,571][25689] Fps is (10 sec: 5589.0, 60 sec: 5531.7, 300 sec: 5518.6). Total num frames: 733064192. Throughput: 0: 5783.1. Samples: 733071416. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:04:46,571][25689] Avg episode reward: [(0, '-0.556')] [2022-07-10 12:04:47,472][26022] Updated weights on worker 0-0, policy_version 715888 (0.00091) [2022-07-10 12:04:49,410][26022] Updated weights on worker 0-0, policy_version 715898 (0.00090) [2022-07-10 12:04:51,189][26022] Updated weights on worker 0-0, policy_version 715908 (0.00089) [2022-07-10 12:04:51,573][25689] Fps is (10 sec: 5629.4, 60 sec: 5515.3, 300 sec: 5515.8). Total num frames: 733090816. Throughput: 0: 4984.8. Samples: 733087990. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:04:51,574][25689] Avg episode reward: [(0, '0.050')] [2022-07-10 12:04:52,963][26022] Updated weights on worker 0-0, policy_version 715918 (0.00088) [2022-07-10 12:04:55,062][26022] Updated weights on worker 0-0, policy_version 715928 (0.00080) [2022-07-10 12:04:56,577][25689] Fps is (10 sec: 5526.0, 60 sec: 5532.9, 300 sec: 5516.7). Total num frames: 733119488. Throughput: 0: 5847.2. Samples: 733121898. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-10 12:04:56,578][25689] Avg episode reward: [(0, '0.474')] [2022-07-10 12:04:56,679][26022] Updated weights on worker 0-0, policy_version 715938 (0.00091) [2022-07-10 12:04:58,578][26022] Updated weights on worker 0-0, policy_version 715948 (0.00090) [2022-07-10 12:05:00,353][26022] Updated weights on worker 0-0, policy_version 715958 (0.00091) [2022-07-10 12:05:01,639][25689] Fps is (10 sec: 5595.5, 60 sec: 5522.1, 300 sec: 5524.1). Total num frames: 733147136. Throughput: 0: 5840.0. Samples: 733155246. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:05:01,639][25689] Avg episode reward: [(0, '-0.263')] [2022-07-10 12:05:02,725][26022] Updated weights on worker 0-0, policy_version 715968 (0.00086) [2022-07-10 12:05:04,463][26022] Updated weights on worker 0-0, policy_version 715978 (0.00084) [2022-07-10 12:05:06,327][26022] Updated weights on worker 0-0, policy_version 715988 (0.00405) [2022-07-10 12:05:06,671][25689] Fps is (10 sec: 5377.1, 60 sec: 5537.9, 300 sec: 5516.9). Total num frames: 733173760. Throughput: 0: 4896.3. Samples: 733169830. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:05:06,671][25689] Avg episode reward: [(0, '-0.564')] [2022-07-10 12:05:08,039][26022] Updated weights on worker 0-0, policy_version 715998 (0.00082) [2022-07-10 12:05:10,097][26022] Updated weights on worker 0-0, policy_version 716008 (0.00090) [2022-07-10 12:05:11,630][26022] Updated weights on worker 0-0, policy_version 716018 (0.00087) [2022-07-10 12:05:11,677][25689] Fps is (10 sec: 5508.3, 60 sec: 5540.7, 300 sec: 5523.8). Total num frames: 733202432. Throughput: 0: 5724.2. Samples: 733203070. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:05:11,678][25689] Avg episode reward: [(0, '-0.963')] [2022-07-10 12:05:13,781][26022] Updated weights on worker 0-0, policy_version 716028 (0.00085) [2022-07-10 12:05:15,438][26022] Updated weights on worker 0-0, policy_version 716038 (0.00079) [2022-07-10 12:05:16,692][25689] Fps is (10 sec: 5415.7, 60 sec: 5506.9, 300 sec: 5511.1). Total num frames: 733228032. Throughput: 0: 5704.2. Samples: 733236634. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:05:16,692][25689] Avg episode reward: [(0, '-1.306')] [2022-07-10 12:05:17,479][26022] Updated weights on worker 0-0, policy_version 716048 (0.00090) [2022-07-10 12:05:18,963][26022] Updated weights on worker 0-0, policy_version 716058 (0.00092) [2022-07-10 12:05:21,139][26022] Updated weights on worker 0-0, policy_version 716068 (0.00829) [2022-07-10 12:05:21,735][25689] Fps is (10 sec: 5498.1, 60 sec: 5532.2, 300 sec: 5520.9). Total num frames: 733257728. Throughput: 0: 4873.4. Samples: 733253182. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:05:21,735][25689] Avg episode reward: [(0, '-1.224')] [2022-07-10 12:05:22,820][26022] Updated weights on worker 0-0, policy_version 716078 (0.00086) [2022-07-10 12:05:24,774][26022] Updated weights on worker 0-0, policy_version 716088 (0.00088) [2022-07-10 12:05:26,475][26022] Updated weights on worker 0-0, policy_version 716098 (0.00082) [2022-07-10 12:05:26,741][25689] Fps is (10 sec: 5808.2, 60 sec: 5550.0, 300 sec: 5525.0). Total num frames: 733286400. Throughput: 0: 5842.2. Samples: 733287086. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:05:26,742][25689] Avg episode reward: [(0, '-1.543')] [2022-07-10 12:05:28,292][26022] Updated weights on worker 0-0, policy_version 716108 (0.00086) [2022-07-10 12:05:30,045][26022] Updated weights on worker 0-0, policy_version 716118 (0.00095) [2022-07-10 12:05:31,787][25689] Fps is (10 sec: 5501.0, 60 sec: 5535.3, 300 sec: 5520.9). Total num frames: 733313024. Throughput: 0: 5842.3. Samples: 733320554. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:05:31,787][25689] Avg episode reward: [(0, '-0.524')] [2022-07-10 12:05:32,181][26022] Updated weights on worker 0-0, policy_version 716128 (0.00091) [2022-07-10 12:05:33,847][26022] Updated weights on worker 0-0, policy_version 716138 (0.00089) [2022-07-10 12:05:35,862][26022] Updated weights on worker 0-0, policy_version 716148 (0.00069) [2022-07-10 12:05:36,802][25689] Fps is (10 sec: 5394.5, 60 sec: 5518.4, 300 sec: 5515.7). Total num frames: 733340672. Throughput: 0: 5008.6. Samples: 733337356. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:05:36,802][25689] Avg episode reward: [(0, '-0.228')] [2022-07-10 12:05:37,313][26022] Updated weights on worker 0-0, policy_version 716158 (0.00438) [2022-07-10 12:05:39,581][26022] Updated weights on worker 0-0, policy_version 716168 (0.00086) [2022-07-10 12:05:41,060][26022] Updated weights on worker 0-0, policy_version 716178 (0.00087) [2022-07-10 12:05:41,930][25689] Fps is (10 sec: 5653.4, 60 sec: 5564.4, 300 sec: 5520.8). Total num frames: 733370368. Throughput: 0: 5836.0. Samples: 733371042. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:05:41,931][25689] Avg episode reward: [(0, '-0.389')] [2022-07-10 12:05:43,226][26022] Updated weights on worker 0-0, policy_version 716188 (0.00090) [2022-07-10 12:05:44,578][26022] Updated weights on worker 0-0, policy_version 716198 (0.00094) [2022-07-10 12:05:46,958][25689] Fps is (10 sec: 5444.5, 60 sec: 5494.1, 300 sec: 5513.9). Total num frames: 733395968. Throughput: 0: 5809.2. Samples: 733404530. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:05:46,959][25689] Avg episode reward: [(0, '-0.417')] [2022-07-10 12:05:46,999][26022] Updated weights on worker 0-0, policy_version 716208 (0.00088) [2022-07-10 12:05:48,515][26022] Updated weights on worker 0-0, policy_version 716218 (0.00084) [2022-07-10 12:05:50,592][26022] Updated weights on worker 0-0, policy_version 716228 (0.00091) [2022-07-10 12:05:52,019][25689] Fps is (10 sec: 5582.2, 60 sec: 5556.6, 300 sec: 5521.4). Total num frames: 733426688. Throughput: 0: 4975.8. Samples: 733421226. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:05:52,020][25689] Avg episode reward: [(0, '-0.968')] [2022-07-10 12:05:52,094][26022] Updated weights on worker 0-0, policy_version 716238 (0.00088) [2022-07-10 12:05:54,308][26022] Updated weights on worker 0-0, policy_version 716248 (0.00083) [2022-07-10 12:05:55,849][26022] Updated weights on worker 0-0, policy_version 716258 (0.00093) [2022-07-10 12:05:57,074][25689] Fps is (10 sec: 5770.0, 60 sec: 5535.0, 300 sec: 5518.2). Total num frames: 733454336. Throughput: 0: 5791.2. Samples: 733454756. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:05:57,074][25689] Avg episode reward: [(0, '-2.822')] [2022-07-10 12:05:57,764][26022] Updated weights on worker 0-0, policy_version 716268 (0.00087) [2022-07-10 12:05:59,549][26022] Updated weights on worker 0-0, policy_version 716278 (0.00083) [2022-07-10 12:06:01,560][26022] Updated weights on worker 0-0, policy_version 716288 (0.00103) [2022-07-10 12:06:02,206][25689] Fps is (10 sec: 5227.3, 60 sec: 5494.7, 300 sec: 5519.7). Total num frames: 733479936. Throughput: 0: 5776.9. Samples: 733488172. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:06:02,206][25689] Avg episode reward: [(0, '-2.887')] [2022-07-10 12:06:03,520][26022] Updated weights on worker 0-0, policy_version 716298 (0.00091) [2022-07-10 12:06:05,579][26022] Updated weights on worker 0-0, policy_version 716308 (0.00094) [2022-07-10 12:06:07,237][25689] Fps is (10 sec: 5340.0, 60 sec: 5528.6, 300 sec: 5523.0). Total num frames: 733508608. Throughput: 0: 4856.5. Samples: 733503016. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:06:07,239][25689] Avg episode reward: [(0, '-3.349')] [2022-07-10 12:06:07,388][26022] Updated weights on worker 0-0, policy_version 716318 (0.00085) [2022-07-10 12:06:09,267][26022] Updated weights on worker 0-0, policy_version 716328 (0.00094) [2022-07-10 12:06:11,043][26022] Updated weights on worker 0-0, policy_version 716338 (0.00099) [2022-07-10 12:06:12,259][25689] Fps is (10 sec: 5602.1, 60 sec: 5510.3, 300 sec: 5519.2). Total num frames: 733536256. Throughput: 0: 5669.1. Samples: 733535968. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:06:12,261][25689] Avg episode reward: [(0, '-3.133')] [2022-07-10 12:06:12,958][26022] Updated weights on worker 0-0, policy_version 716348 (0.00090) [2022-07-10 12:06:14,841][26022] Updated weights on worker 0-0, policy_version 716358 (0.00084) [2022-07-10 12:06:16,537][26022] Updated weights on worker 0-0, policy_version 716368 (0.00085) [2022-07-10 12:06:17,352][25689] Fps is (10 sec: 5466.6, 60 sec: 5536.9, 300 sec: 5518.9). Total num frames: 733563904. Throughput: 0: 5653.0. Samples: 733569390. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:06:17,354][25689] Avg episode reward: [(0, '-3.191')] [2022-07-10 12:06:18,371][26022] Updated weights on worker 0-0, policy_version 716378 (0.00098) [2022-07-10 12:06:20,127][26022] Updated weights on worker 0-0, policy_version 716388 (0.00090) [2022-07-10 12:06:21,999][26022] Updated weights on worker 0-0, policy_version 716398 (0.00084) [2022-07-10 12:06:22,473][25689] Fps is (10 sec: 5614.2, 60 sec: 5529.8, 300 sec: 5516.9). Total num frames: 733593600. Throughput: 0: 4842.4. Samples: 733586316. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:06:22,474][25689] Avg episode reward: [(0, '-4.471')] [2022-07-10 12:06:24,096][26022] Updated weights on worker 0-0, policy_version 716408 (0.00089) [2022-07-10 12:06:25,534][26022] Updated weights on worker 0-0, policy_version 716418 (0.00082) [2022-07-10 12:06:27,485][25689] Fps is (10 sec: 5659.1, 60 sec: 5512.4, 300 sec: 5520.5). Total num frames: 733621248. Throughput: 0: 5781.1. Samples: 733620072. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:06:27,486][25689] Avg episode reward: [(0, '-2.629')] [2022-07-10 12:06:27,545][26022] Updated weights on worker 0-0, policy_version 716428 (0.00091) [2022-07-10 12:06:29,198][26022] Updated weights on worker 0-0, policy_version 716438 (0.00605) [2022-07-10 12:06:30,469][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:06:30,497][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000716443_733637632.pth [2022-07-10 12:06:30,498][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000714502_731650048.pth [2022-07-10 12:06:31,410][26022] Updated weights on worker 0-0, policy_version 716448 (0.00093) [2022-07-10 12:06:32,518][25689] Fps is (10 sec: 5606.9, 60 sec: 5547.3, 300 sec: 5523.5). Total num frames: 733649920. Throughput: 0: 5793.5. Samples: 733653338. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:06:32,519][25689] Avg episode reward: [(0, '-1.679')] [2022-07-10 12:06:33,063][26022] Updated weights on worker 0-0, policy_version 716458 (0.00084) [2022-07-10 12:06:34,903][26022] Updated weights on worker 0-0, policy_version 716468 (0.00086) [2022-07-10 12:06:36,738][26022] Updated weights on worker 0-0, policy_version 716478 (0.00084) [2022-07-10 12:06:37,621][25689] Fps is (10 sec: 5556.7, 60 sec: 5539.3, 300 sec: 5519.4). Total num frames: 733677568. Throughput: 0: 5798.3. Samples: 733686914. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:06:37,621][25689] Avg episode reward: [(0, '-2.453')] [2022-07-10 12:06:38,881][26022] Updated weights on worker 0-0, policy_version 716488 (0.00093) [2022-07-10 12:06:40,319][26022] Updated weights on worker 0-0, policy_version 716498 (0.00081) [2022-07-10 12:06:42,518][26022] Updated weights on worker 0-0, policy_version 716508 (0.00095) [2022-07-10 12:06:42,664][25689] Fps is (10 sec: 5450.1, 60 sec: 5513.3, 300 sec: 5519.6). Total num frames: 733705216. Throughput: 0: 5799.6. Samples: 733703414. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:06:42,665][25689] Avg episode reward: [(0, '-1.985')] [2022-07-10 12:06:44,069][26022] Updated weights on worker 0-0, policy_version 716518 (0.00089) [2022-07-10 12:06:46,028][26022] Updated weights on worker 0-0, policy_version 716528 (0.00093) [2022-07-10 12:06:47,698][25689] Fps is (10 sec: 5589.0, 60 sec: 5563.4, 300 sec: 5526.6). Total num frames: 733733888. Throughput: 0: 5788.0. Samples: 733737062. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:06:47,698][25689] Avg episode reward: [(0, '-2.332')] [2022-07-10 12:06:47,789][26022] Updated weights on worker 0-0, policy_version 716538 (0.00086) [2022-07-10 12:06:49,645][26022] Updated weights on worker 0-0, policy_version 716548 (0.00091) [2022-07-10 12:06:51,468][26022] Updated weights on worker 0-0, policy_version 716558 (0.00096) [2022-07-10 12:06:52,735][25689] Fps is (10 sec: 5592.4, 60 sec: 5515.0, 300 sec: 5519.7). Total num frames: 733761536. Throughput: 0: 5808.4. Samples: 733770766. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:06:52,735][25689] Avg episode reward: [(0, '-0.284')] [2022-07-10 12:06:53,323][26022] Updated weights on worker 0-0, policy_version 716568 (0.00090) [2022-07-10 12:06:55,090][26022] Updated weights on worker 0-0, policy_version 716578 (0.00097) [2022-07-10 12:06:56,979][26022] Updated weights on worker 0-0, policy_version 716588 (0.00103) [2022-07-10 12:06:57,806][25689] Fps is (10 sec: 5571.7, 60 sec: 5530.4, 300 sec: 5523.4). Total num frames: 733790208. Throughput: 0: 4981.2. Samples: 733787460. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:06:57,808][25689] Avg episode reward: [(0, '-0.719')] [2022-07-10 12:06:58,709][26022] Updated weights on worker 0-0, policy_version 716598 (0.00089) [2022-07-10 12:07:00,651][26022] Updated weights on worker 0-0, policy_version 716608 (0.00084) [2022-07-10 12:07:02,756][26022] Updated weights on worker 0-0, policy_version 716618 (0.00090) [2022-07-10 12:07:02,938][25689] Fps is (10 sec: 5419.3, 60 sec: 5547.2, 300 sec: 5528.0). Total num frames: 733816832. Throughput: 0: 5760.1. Samples: 733820196. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:07:02,939][25689] Avg episode reward: [(0, '-0.837')] [2022-07-10 12:07:04,535][26022] Updated weights on worker 0-0, policy_version 716628 (0.00087) [2022-07-10 12:07:06,448][26022] Updated weights on worker 0-0, policy_version 716638 (0.00091) [2022-07-10 12:07:08,002][25689] Fps is (10 sec: 5423.2, 60 sec: 5544.2, 300 sec: 5523.4). Total num frames: 733845504. Throughput: 0: 5696.3. Samples: 733852722. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:07:08,004][25689] Avg episode reward: [(0, '-1.185')] [2022-07-10 12:07:08,171][26022] Updated weights on worker 0-0, policy_version 716648 (0.00079) [2022-07-10 12:07:10,084][26022] Updated weights on worker 0-0, policy_version 716658 (0.00080) [2022-07-10 12:07:11,879][26022] Updated weights on worker 0-0, policy_version 716668 (0.00088) [2022-07-10 12:07:13,048][25689] Fps is (10 sec: 5570.8, 60 sec: 5542.0, 300 sec: 5526.5). Total num frames: 733873152. Throughput: 0: 4869.5. Samples: 733869682. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:07:13,049][25689] Avg episode reward: [(0, '-0.850')] [2022-07-10 12:07:13,642][26022] Updated weights on worker 0-0, policy_version 716678 (0.00087) [2022-07-10 12:07:15,604][26022] Updated weights on worker 0-0, policy_version 716688 (0.00093) [2022-07-10 12:07:17,298][26022] Updated weights on worker 0-0, policy_version 716698 (0.00092) [2022-07-10 12:07:18,076][25689] Fps is (10 sec: 5590.8, 60 sec: 5564.9, 300 sec: 5528.0). Total num frames: 733901824. Throughput: 0: 5729.2. Samples: 733903590. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:07:18,076][25689] Avg episode reward: [(0, '-1.283')] [2022-07-10 12:07:19,322][26022] Updated weights on worker 0-0, policy_version 716708 (0.00086) [2022-07-10 12:07:20,994][26022] Updated weights on worker 0-0, policy_version 716718 (0.00093) [2022-07-10 12:07:22,946][26022] Updated weights on worker 0-0, policy_version 716728 (0.00084) [2022-07-10 12:07:23,148][25689] Fps is (10 sec: 5677.8, 60 sec: 5552.5, 300 sec: 5534.0). Total num frames: 733930496. Throughput: 0: 5776.9. Samples: 733936944. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:07:23,148][25689] Avg episode reward: [(0, '-1.315')] [2022-07-10 12:07:24,493][26022] Updated weights on worker 0-0, policy_version 716738 (0.00087) [2022-07-10 12:07:26,751][26022] Updated weights on worker 0-0, policy_version 716748 (0.00391) [2022-07-10 12:07:28,214][25689] Fps is (10 sec: 5656.1, 60 sec: 5564.4, 300 sec: 5533.1). Total num frames: 733959168. Throughput: 0: 5000.6. Samples: 733953794. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:07:28,216][25689] Avg episode reward: [(0, '-1.056')] [2022-07-10 12:07:28,277][26022] Updated weights on worker 0-0, policy_version 716758 (0.00087) [2022-07-10 12:07:30,453][26022] Updated weights on worker 0-0, policy_version 716768 (0.00086) [2022-07-10 12:07:32,190][26022] Updated weights on worker 0-0, policy_version 716778 (0.00093) [2022-07-10 12:07:33,302][25689] Fps is (10 sec: 5445.6, 60 sec: 5525.7, 300 sec: 5528.1). Total num frames: 733985792. Throughput: 0: 5794.7. Samples: 733987048. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:07:33,303][25689] Avg episode reward: [(0, '-0.936')] [2022-07-10 12:07:34,035][26022] Updated weights on worker 0-0, policy_version 716788 (0.00087) [2022-07-10 12:07:35,836][26022] Updated weights on worker 0-0, policy_version 716798 (0.00093) [2022-07-10 12:07:37,743][26022] Updated weights on worker 0-0, policy_version 716808 (0.00090) [2022-07-10 12:07:38,368][25689] Fps is (10 sec: 5445.7, 60 sec: 5545.8, 300 sec: 5529.3). Total num frames: 734014464. Throughput: 0: 5744.8. Samples: 734020166. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:07:38,369][25689] Avg episode reward: [(0, '-0.124')] [2022-07-10 12:07:39,671][26022] Updated weights on worker 0-0, policy_version 716818 (0.00090) [2022-07-10 12:07:41,621][26022] Updated weights on worker 0-0, policy_version 716828 (0.00089) [2022-07-10 12:07:43,113][26022] Updated weights on worker 0-0, policy_version 716838 (0.00092) [2022-07-10 12:07:43,464][25689] Fps is (10 sec: 5743.7, 60 sec: 5574.7, 300 sec: 5542.6). Total num frames: 734044160. Throughput: 0: 4912.5. Samples: 734036748. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:07:43,465][25689] Avg episode reward: [(0, '-1.516')] [2022-07-10 12:07:45,088][26022] Updated weights on worker 0-0, policy_version 716848 (0.00087) [2022-07-10 12:07:46,727][26022] Updated weights on worker 0-0, policy_version 716858 (0.00094) [2022-07-10 12:07:48,546][25689] Fps is (10 sec: 5533.9, 60 sec: 5536.7, 300 sec: 5527.8). Total num frames: 734070784. Throughput: 0: 5743.6. Samples: 734070570. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:07:48,546][25689] Avg episode reward: [(0, '-1.757')] [2022-07-10 12:07:48,985][26022] Updated weights on worker 0-0, policy_version 716868 (0.00098) [2022-07-10 12:07:50,768][26022] Updated weights on worker 0-0, policy_version 716878 (0.00090) [2022-07-10 12:07:52,564][26022] Updated weights on worker 0-0, policy_version 716888 (0.00094) [2022-07-10 12:07:53,557][25689] Fps is (10 sec: 5377.3, 60 sec: 5539.0, 300 sec: 5524.2). Total num frames: 734098432. Throughput: 0: 5765.3. Samples: 734103824. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:07:53,558][25689] Avg episode reward: [(0, '-1.432')] [2022-07-10 12:07:54,243][26022] Updated weights on worker 0-0, policy_version 716898 (0.00094) [2022-07-10 12:07:56,219][26022] Updated weights on worker 0-0, policy_version 716908 (0.00449) [2022-07-10 12:07:57,833][26022] Updated weights on worker 0-0, policy_version 716918 (0.00088) [2022-07-10 12:07:58,603][25689] Fps is (10 sec: 5599.7, 60 sec: 5541.3, 300 sec: 5532.5). Total num frames: 734127104. Throughput: 0: 4972.3. Samples: 734120780. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:07:58,604][25689] Avg episode reward: [(0, '-2.393')] [2022-07-10 12:07:59,815][26022] Updated weights on worker 0-0, policy_version 716928 (0.00087) [2022-07-10 12:08:01,694][26022] Updated weights on worker 0-0, policy_version 716938 (0.00090) [2022-07-10 12:08:03,726][25689] Fps is (10 sec: 5438.0, 60 sec: 5542.2, 300 sec: 5530.4). Total num frames: 734153728. Throughput: 0: 5806.3. Samples: 734154392. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:08:03,726][25689] Avg episode reward: [(0, '-2.821')] [2022-07-10 12:08:03,818][26022] Updated weights on worker 0-0, policy_version 716948 (0.00093) [2022-07-10 12:08:05,551][26022] Updated weights on worker 0-0, policy_version 716958 (0.00087) [2022-07-10 12:08:07,489][26022] Updated weights on worker 0-0, policy_version 716968 (0.00085) [2022-07-10 12:08:08,728][25689] Fps is (10 sec: 5461.3, 60 sec: 5547.8, 300 sec: 5537.6). Total num frames: 734182400. Throughput: 0: 5730.0. Samples: 734186218. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:08:08,729][25689] Avg episode reward: [(0, '-2.395')] [2022-07-10 12:08:09,223][26022] Updated weights on worker 0-0, policy_version 716978 (0.00082) [2022-07-10 12:08:11,339][26022] Updated weights on worker 0-0, policy_version 716988 (0.00097) [2022-07-10 12:08:13,002][26022] Updated weights on worker 0-0, policy_version 716998 (0.00096) [2022-07-10 12:08:13,819][25689] Fps is (10 sec: 5579.8, 60 sec: 5543.7, 300 sec: 5533.0). Total num frames: 734210048. Throughput: 0: 5728.8. Samples: 734219900. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:08:13,819][25689] Avg episode reward: [(0, '-2.133')] [2022-07-10 12:08:14,724][26022] Updated weights on worker 0-0, policy_version 717008 (0.00086) [2022-07-10 12:08:16,631][26022] Updated weights on worker 0-0, policy_version 717018 (0.00087) [2022-07-10 12:08:18,173][26022] Updated weights on worker 0-0, policy_version 717028 (0.00087) [2022-07-10 12:08:18,877][25689] Fps is (10 sec: 5650.6, 60 sec: 5557.8, 300 sec: 5536.7). Total num frames: 734239744. Throughput: 0: 5726.5. Samples: 734236876. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-10 12:08:18,877][25689] Avg episode reward: [(0, '-1.999')] [2022-07-10 12:08:20,371][26022] Updated weights on worker 0-0, policy_version 717038 (0.00086) [2022-07-10 12:08:21,885][26022] Updated weights on worker 0-0, policy_version 717048 (0.00095) [2022-07-10 12:08:23,877][26022] Updated weights on worker 0-0, policy_version 717058 (0.00086) [2022-07-10 12:08:23,975][25689] Fps is (10 sec: 5646.5, 60 sec: 5538.6, 300 sec: 5531.5). Total num frames: 734267392. Throughput: 0: 5739.8. Samples: 734270618. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:08:23,975][25689] Avg episode reward: [(0, '-1.790')] [2022-07-10 12:08:25,579][26022] Updated weights on worker 0-0, policy_version 717068 (0.00087) [2022-07-10 12:08:27,477][26022] Updated weights on worker 0-0, policy_version 717078 (0.00092) [2022-07-10 12:08:29,018][25689] Fps is (10 sec: 5553.4, 60 sec: 5540.7, 300 sec: 5537.9). Total num frames: 734296064. Throughput: 0: 5822.3. Samples: 734304352. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:08:29,019][25689] Avg episode reward: [(0, '-0.763')] [2022-07-10 12:08:29,348][26022] Updated weights on worker 0-0, policy_version 717088 (0.00089) [2022-07-10 12:08:30,665][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:08:30,677][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000717095_734305280.pth [2022-07-10 12:08:30,677][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000715147_732310528.pth [2022-07-10 12:08:31,209][26022] Updated weights on worker 0-0, policy_version 717098 (0.00093) [2022-07-10 12:08:32,819][26022] Updated weights on worker 0-0, policy_version 717108 (0.00084) [2022-07-10 12:08:34,052][25689] Fps is (10 sec: 5487.5, 60 sec: 5545.7, 300 sec: 5530.8). Total num frames: 734322688. Throughput: 0: 5006.9. Samples: 734321202. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:08:34,054][25689] Avg episode reward: [(0, '-1.661')] [2022-07-10 12:08:34,633][26022] Updated weights on worker 0-0, policy_version 717118 (0.00088) [2022-07-10 12:08:36,619][26022] Updated weights on worker 0-0, policy_version 717128 (0.00085) [2022-07-10 12:08:38,332][26022] Updated weights on worker 0-0, policy_version 717138 (0.00085) [2022-07-10 12:08:39,068][25689] Fps is (10 sec: 5705.9, 60 sec: 5583.9, 300 sec: 5541.7). Total num frames: 734353408. Throughput: 0: 5847.0. Samples: 734354936. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:08:39,069][25689] Avg episode reward: [(0, '-1.209')] [2022-07-10 12:08:40,344][26022] Updated weights on worker 0-0, policy_version 717148 (0.00086) [2022-07-10 12:08:42,059][26022] Updated weights on worker 0-0, policy_version 717158 (0.00081) [2022-07-10 12:08:43,992][26022] Updated weights on worker 0-0, policy_version 717168 (0.00090) [2022-07-10 12:08:44,179][25689] Fps is (10 sec: 5763.5, 60 sec: 5548.9, 300 sec: 5540.0). Total num frames: 734381056. Throughput: 0: 5839.3. Samples: 734388596. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:08:44,179][25689] Avg episode reward: [(0, '-0.915')] [2022-07-10 12:08:45,892][26022] Updated weights on worker 0-0, policy_version 717178 (0.00091) [2022-07-10 12:08:47,608][26022] Updated weights on worker 0-0, policy_version 717188 (0.00094) [2022-07-10 12:08:49,194][25689] Fps is (10 sec: 5460.7, 60 sec: 5571.8, 300 sec: 5539.9). Total num frames: 734408704. Throughput: 0: 5000.8. Samples: 734405250. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:08:49,195][25689] Avg episode reward: [(0, '-2.459')] [2022-07-10 12:08:49,610][26022] Updated weights on worker 0-0, policy_version 717198 (0.00099) [2022-07-10 12:08:51,554][26022] Updated weights on worker 0-0, policy_version 717208 (0.00089) [2022-07-10 12:08:53,234][26022] Updated weights on worker 0-0, policy_version 717218 (0.00086) [2022-07-10 12:08:54,203][25689] Fps is (10 sec: 5516.4, 60 sec: 5572.0, 300 sec: 5540.0). Total num frames: 734436352. Throughput: 0: 5799.2. Samples: 734438062. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:08:54,203][25689] Avg episode reward: [(0, '-2.768')] [2022-07-10 12:08:55,139][26022] Updated weights on worker 0-0, policy_version 717228 (0.00086) [2022-07-10 12:08:56,969][26022] Updated weights on worker 0-0, policy_version 717238 (0.00091) [2022-07-10 12:08:58,801][26022] Updated weights on worker 0-0, policy_version 717248 (0.00095) [2022-07-10 12:08:59,208][25689] Fps is (10 sec: 5624.3, 60 sec: 5575.8, 300 sec: 5542.3). Total num frames: 734465024. Throughput: 0: 5777.6. Samples: 734471296. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:08:59,209][25689] Avg episode reward: [(0, '-3.636')] [2022-07-10 12:09:00,879][26022] Updated weights on worker 0-0, policy_version 717258 (0.00088) [2022-07-10 12:09:02,725][26022] Updated weights on worker 0-0, policy_version 717268 (0.00083) [2022-07-10 12:09:04,350][25689] Fps is (10 sec: 5146.4, 60 sec: 5523.3, 300 sec: 5533.1). Total num frames: 734488576. Throughput: 0: 4919.1. Samples: 734487822. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:09:04,351][25689] Avg episode reward: [(0, '-3.249')] [2022-07-10 12:09:04,864][26022] Updated weights on worker 0-0, policy_version 717278 (0.00085) [2022-07-10 12:09:06,415][26022] Updated weights on worker 0-0, policy_version 717288 (0.00082) [2022-07-10 12:09:08,617][26022] Updated weights on worker 0-0, policy_version 717298 (0.00085) [2022-07-10 12:09:09,368][25689] Fps is (10 sec: 5241.1, 60 sec: 5538.9, 300 sec: 5536.9). Total num frames: 734518272. Throughput: 0: 5646.2. Samples: 734519152. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:09:09,368][25689] Avg episode reward: [(0, '-3.146')] [2022-07-10 12:09:10,316][26022] Updated weights on worker 0-0, policy_version 717308 (0.00088) [2022-07-10 12:09:12,215][26022] Updated weights on worker 0-0, policy_version 717318 (0.00087) [2022-07-10 12:09:14,104][26022] Updated weights on worker 0-0, policy_version 717328 (0.00086) [2022-07-10 12:09:14,373][25689] Fps is (10 sec: 5721.5, 60 sec: 5546.7, 300 sec: 5537.1). Total num frames: 734545920. Throughput: 0: 5670.9. Samples: 734552446. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:09:14,373][25689] Avg episode reward: [(0, '-2.684')] [2022-07-10 12:09:15,893][26022] Updated weights on worker 0-0, policy_version 717338 (0.00088) [2022-07-10 12:09:17,546][26022] Updated weights on worker 0-0, policy_version 717348 (0.00088) [2022-07-10 12:09:19,380][25689] Fps is (10 sec: 5420.5, 60 sec: 5500.5, 300 sec: 5532.6). Total num frames: 734572544. Throughput: 0: 4867.9. Samples: 734569492. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:09:19,382][25689] Avg episode reward: [(0, '-1.560')] [2022-07-10 12:09:19,758][26022] Updated weights on worker 0-0, policy_version 717358 (0.00087) [2022-07-10 12:09:21,105][26022] Updated weights on worker 0-0, policy_version 717368 (0.00083) [2022-07-10 12:09:23,392][26022] Updated weights on worker 0-0, policy_version 717378 (0.00086) [2022-07-10 12:09:24,450][25689] Fps is (10 sec: 5589.0, 60 sec: 5537.0, 300 sec: 5538.4). Total num frames: 734602240. Throughput: 0: 5717.1. Samples: 734602732. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:09:24,450][25689] Avg episode reward: [(0, '-1.691')] [2022-07-10 12:09:24,742][26022] Updated weights on worker 0-0, policy_version 717388 (0.00086) [2022-07-10 12:09:26,916][26022] Updated weights on worker 0-0, policy_version 717398 (0.00084) [2022-07-10 12:09:28,707][26022] Updated weights on worker 0-0, policy_version 717408 (0.00092) [2022-07-10 12:09:29,492][25689] Fps is (10 sec: 5569.4, 60 sec: 5503.2, 300 sec: 5535.5). Total num frames: 734628864. Throughput: 0: 5812.9. Samples: 734636134. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:09:29,493][25689] Avg episode reward: [(0, '0.012')] [2022-07-10 12:09:30,403][26022] Updated weights on worker 0-0, policy_version 717418 (0.00086) [2022-07-10 12:09:32,393][26022] Updated weights on worker 0-0, policy_version 717428 (0.00086) [2022-07-10 12:09:34,085][26022] Updated weights on worker 0-0, policy_version 717438 (0.00083) [2022-07-10 12:09:34,496][25689] Fps is (10 sec: 5606.2, 60 sec: 5556.7, 300 sec: 5539.2). Total num frames: 734658560. Throughput: 0: 4989.5. Samples: 734652850. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:09:34,497][25689] Avg episode reward: [(0, '0.412')] [2022-07-10 12:09:36,157][26022] Updated weights on worker 0-0, policy_version 717448 (0.00088) [2022-07-10 12:09:37,793][26022] Updated weights on worker 0-0, policy_version 717458 (0.00085) [2022-07-10 12:09:39,507][25689] Fps is (10 sec: 5828.5, 60 sec: 5523.4, 300 sec: 5547.3). Total num frames: 734687232. Throughput: 0: 5823.8. Samples: 734686704. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:09:39,507][25689] Avg episode reward: [(0, '-0.533')] [2022-07-10 12:09:39,516][26022] Updated weights on worker 0-0, policy_version 717468 (0.00083) [2022-07-10 12:09:41,616][26022] Updated weights on worker 0-0, policy_version 717478 (0.00099) [2022-07-10 12:09:43,310][26022] Updated weights on worker 0-0, policy_version 717488 (0.00095) [2022-07-10 12:09:44,574][25689] Fps is (10 sec: 5588.4, 60 sec: 5527.3, 300 sec: 5539.2). Total num frames: 734714880. Throughput: 0: 5832.8. Samples: 734720112. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:09:44,575][25689] Avg episode reward: [(0, '-0.477')] [2022-07-10 12:09:45,110][26022] Updated weights on worker 0-0, policy_version 717498 (0.00089) [2022-07-10 12:09:46,765][26022] Updated weights on worker 0-0, policy_version 717508 (0.00080) [2022-07-10 12:09:48,962][26022] Updated weights on worker 0-0, policy_version 717518 (0.00090) [2022-07-10 12:09:49,613][25689] Fps is (10 sec: 5471.2, 60 sec: 5525.2, 300 sec: 5541.9). Total num frames: 734742528. Throughput: 0: 5017.0. Samples: 734737080. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:09:49,614][25689] Avg episode reward: [(0, '-0.301')] [2022-07-10 12:09:50,460][26022] Updated weights on worker 0-0, policy_version 717528 (0.00089) [2022-07-10 12:09:52,542][26022] Updated weights on worker 0-0, policy_version 717538 (0.00083) [2022-07-10 12:09:54,125][26022] Updated weights on worker 0-0, policy_version 717548 (0.00060) [2022-07-10 12:09:54,696][25689] Fps is (10 sec: 5564.3, 60 sec: 5535.3, 300 sec: 5540.5). Total num frames: 734771200. Throughput: 0: 5853.2. Samples: 734771084. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:09:54,698][25689] Avg episode reward: [(0, '-1.213')] [2022-07-10 12:09:56,147][26022] Updated weights on worker 0-0, policy_version 717558 (0.00087) [2022-07-10 12:09:57,898][26022] Updated weights on worker 0-0, policy_version 717568 (0.00082) [2022-07-10 12:09:59,531][26022] Updated weights on worker 0-0, policy_version 717578 (0.00089) [2022-07-10 12:09:59,707][25689] Fps is (10 sec: 5681.4, 60 sec: 5534.8, 300 sec: 5544.8). Total num frames: 734799872. Throughput: 0: 5852.1. Samples: 734804918. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:09:59,707][25689] Avg episode reward: [(0, '-2.555')] [2022-07-10 12:10:01,664][26022] Updated weights on worker 0-0, policy_version 717588 (0.00093) [2022-07-10 12:10:03,467][26022] Updated weights on worker 0-0, policy_version 717598 (0.00088) [2022-07-10 12:10:04,753][25689] Fps is (10 sec: 5497.9, 60 sec: 5594.4, 300 sec: 5544.6). Total num frames: 734826496. Throughput: 0: 4968.5. Samples: 734820374. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:10:04,754][25689] Avg episode reward: [(0, '-2.721')] [2022-07-10 12:10:05,557][26022] Updated weights on worker 0-0, policy_version 717608 (0.00085) [2022-07-10 12:10:07,436][26022] Updated weights on worker 0-0, policy_version 717618 (0.00092) [2022-07-10 12:10:09,173][26022] Updated weights on worker 0-0, policy_version 717628 (0.00084) [2022-07-10 12:10:09,795][25689] Fps is (10 sec: 5379.7, 60 sec: 5558.3, 300 sec: 5540.5). Total num frames: 734854144. Throughput: 0: 5756.9. Samples: 734853264. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:10:09,795][25689] Avg episode reward: [(0, '-2.428')] [2022-07-10 12:10:11,260][26022] Updated weights on worker 0-0, policy_version 717638 (0.00082) [2022-07-10 12:10:12,728][26022] Updated weights on worker 0-0, policy_version 717648 (0.00091) [2022-07-10 12:10:14,677][26022] Updated weights on worker 0-0, policy_version 717658 (0.00084) [2022-07-10 12:10:14,879][25689] Fps is (10 sec: 5461.0, 60 sec: 5551.0, 300 sec: 5546.0). Total num frames: 734881792. Throughput: 0: 5735.5. Samples: 734886846. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:10:14,880][25689] Avg episode reward: [(0, '-2.510')] [2022-07-10 12:10:16,521][26022] Updated weights on worker 0-0, policy_version 717668 (0.00083) [2022-07-10 12:10:18,383][26022] Updated weights on worker 0-0, policy_version 717678 (0.00084) [2022-07-10 12:10:19,907][25689] Fps is (10 sec: 5670.9, 60 sec: 5599.9, 300 sec: 5546.3). Total num frames: 734911488. Throughput: 0: 4896.7. Samples: 734903832. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:10:19,907][25689] Avg episode reward: [(0, '-2.593')] [2022-07-10 12:10:20,162][26022] Updated weights on worker 0-0, policy_version 717688 (0.01031) [2022-07-10 12:10:22,018][26022] Updated weights on worker 0-0, policy_version 717698 (0.00082) [2022-07-10 12:10:23,627][26022] Updated weights on worker 0-0, policy_version 717708 (0.00088) [2022-07-10 12:10:24,982][25689] Fps is (10 sec: 5574.6, 60 sec: 5548.7, 300 sec: 5538.2). Total num frames: 734938112. Throughput: 0: 5775.6. Samples: 734937206. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:10:24,983][25689] Avg episode reward: [(0, '-3.353')] [2022-07-10 12:10:25,719][26022] Updated weights on worker 0-0, policy_version 717718 (0.00096) [2022-07-10 12:10:27,515][26022] Updated weights on worker 0-0, policy_version 717728 (0.00083) [2022-07-10 12:10:29,491][26022] Updated weights on worker 0-0, policy_version 717738 (0.00089) [2022-07-10 12:10:30,026][25689] Fps is (10 sec: 5464.5, 60 sec: 5582.3, 300 sec: 5545.1). Total num frames: 734966784. Throughput: 0: 5806.7. Samples: 734970740. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:10:30,027][25689] Avg episode reward: [(0, '-2.008')] [2022-07-10 12:10:30,682][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:10:30,691][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000717746_734971904.pth [2022-07-10 12:10:30,692][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000715794_732973056.pth [2022-07-10 12:10:31,260][26022] Updated weights on worker 0-0, policy_version 717748 (0.00084) [2022-07-10 12:10:33,079][26022] Updated weights on worker 0-0, policy_version 717758 (0.00129) [2022-07-10 12:10:34,788][26022] Updated weights on worker 0-0, policy_version 717768 (0.00095) [2022-07-10 12:10:35,049][25689] Fps is (10 sec: 5594.4, 60 sec: 5546.7, 300 sec: 5544.9). Total num frames: 734994432. Throughput: 0: 5832.6. Samples: 735004492. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:10:35,050][25689] Avg episode reward: [(0, '-1.722')] [2022-07-10 12:10:36,759][26022] Updated weights on worker 0-0, policy_version 717778 (0.00089) [2022-07-10 12:10:38,475][26022] Updated weights on worker 0-0, policy_version 717788 (0.00098) [2022-07-10 12:10:40,051][25689] Fps is (10 sec: 5515.8, 60 sec: 5530.6, 300 sec: 5540.4). Total num frames: 735022080. Throughput: 0: 5818.3. Samples: 735021038. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:10:40,052][25689] Avg episode reward: [(0, '-1.877')] [2022-07-10 12:10:40,434][26022] Updated weights on worker 0-0, policy_version 717798 (0.00052) [2022-07-10 12:10:42,417][26022] Updated weights on worker 0-0, policy_version 717808 (0.00085) [2022-07-10 12:10:44,079][26022] Updated weights on worker 0-0, policy_version 717818 (0.00085) [2022-07-10 12:10:45,187][25689] Fps is (10 sec: 5555.6, 60 sec: 5541.3, 300 sec: 5548.7). Total num frames: 735050752. Throughput: 0: 5777.1. Samples: 735053932. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:10:45,187][25689] Avg episode reward: [(0, '-1.084')] [2022-07-10 12:10:46,048][26022] Updated weights on worker 0-0, policy_version 717828 (0.00097) [2022-07-10 12:10:47,741][26022] Updated weights on worker 0-0, policy_version 717838 (0.00090) [2022-07-10 12:10:49,900][26022] Updated weights on worker 0-0, policy_version 717848 (0.00090) [2022-07-10 12:10:50,217][25689] Fps is (10 sec: 5641.0, 60 sec: 5559.0, 300 sec: 5542.4). Total num frames: 735079424. Throughput: 0: 5769.7. Samples: 735087234. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:10:50,217][25689] Avg episode reward: [(0, '-0.937')] [2022-07-10 12:10:51,605][26022] Updated weights on worker 0-0, policy_version 717858 (0.00086) [2022-07-10 12:10:53,413][26022] Updated weights on worker 0-0, policy_version 717868 (0.00079) [2022-07-10 12:10:55,183][26022] Updated weights on worker 0-0, policy_version 717878 (0.00083) [2022-07-10 12:10:55,243][25689] Fps is (10 sec: 5600.7, 60 sec: 5547.3, 300 sec: 5543.0). Total num frames: 735107072. Throughput: 0: 4924.2. Samples: 735103928. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:10:55,243][25689] Avg episode reward: [(0, '-0.949')] [2022-07-10 12:10:57,031][26022] Updated weights on worker 0-0, policy_version 717888 (0.00085) [2022-07-10 12:10:58,938][26022] Updated weights on worker 0-0, policy_version 717898 (0.00083) [2022-07-10 12:11:00,283][25689] Fps is (10 sec: 5493.3, 60 sec: 5527.7, 300 sec: 5551.5). Total num frames: 735134720. Throughput: 0: 5766.7. Samples: 735137708. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:11:00,283][25689] Avg episode reward: [(0, '-1.320')] [2022-07-10 12:11:00,838][26022] Updated weights on worker 0-0, policy_version 717908 (0.00086) [2022-07-10 12:11:02,903][26022] Updated weights on worker 0-0, policy_version 717918 (0.00089) [2022-07-10 12:11:04,773][26022] Updated weights on worker 0-0, policy_version 717928 (0.00092) [2022-07-10 12:11:05,386][25689] Fps is (10 sec: 5451.3, 60 sec: 5539.4, 300 sec: 5546.8). Total num frames: 735162368. Throughput: 0: 5696.7. Samples: 735169002. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:11:05,387][25689] Avg episode reward: [(0, '-1.786')] [2022-07-10 12:11:06,610][26022] Updated weights on worker 0-0, policy_version 717938 (0.00095) [2022-07-10 12:11:08,386][26022] Updated weights on worker 0-0, policy_version 717948 (0.00091) [2022-07-10 12:11:10,195][26022] Updated weights on worker 0-0, policy_version 717958 (0.00087) [2022-07-10 12:11:10,396][25689] Fps is (10 sec: 5467.5, 60 sec: 5542.3, 300 sec: 5547.0). Total num frames: 735190016. Throughput: 0: 4890.3. Samples: 735185918. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:11:10,397][25689] Avg episode reward: [(0, '-1.774')] [2022-07-10 12:11:11,964][26022] Updated weights on worker 0-0, policy_version 717968 (0.00083) [2022-07-10 12:11:13,740][26022] Updated weights on worker 0-0, policy_version 717978 (0.00092) [2022-07-10 12:11:15,452][25689] Fps is (10 sec: 5391.7, 60 sec: 5528.0, 300 sec: 5544.2). Total num frames: 735216640. Throughput: 0: 5731.4. Samples: 735219756. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:11:15,454][25689] Avg episode reward: [(0, '-2.369')] [2022-07-10 12:11:15,759][26022] Updated weights on worker 0-0, policy_version 717988 (0.00080) [2022-07-10 12:11:17,283][26022] Updated weights on worker 0-0, policy_version 717998 (0.00085) [2022-07-10 12:11:19,391][26022] Updated weights on worker 0-0, policy_version 718008 (0.00086) [2022-07-10 12:11:20,483][25689] Fps is (10 sec: 5685.2, 60 sec: 5544.6, 300 sec: 5549.4). Total num frames: 735247360. Throughput: 0: 5730.8. Samples: 735253470. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:11:20,483][25689] Avg episode reward: [(0, '-2.130')] [2022-07-10 12:11:20,823][26022] Updated weights on worker 0-0, policy_version 718018 (0.00087) [2022-07-10 12:11:23,103][26022] Updated weights on worker 0-0, policy_version 718028 (0.00086) [2022-07-10 12:11:24,868][26022] Updated weights on worker 0-0, policy_version 718038 (0.00091) [2022-07-10 12:11:25,597][25689] Fps is (10 sec: 5652.5, 60 sec: 5541.1, 300 sec: 5544.0). Total num frames: 735273984. Throughput: 0: 5008.7. Samples: 735270232. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:11:25,598][25689] Avg episode reward: [(0, '-2.230')] [2022-07-10 12:11:26,699][26022] Updated weights on worker 0-0, policy_version 718048 (0.00093) [2022-07-10 12:11:28,648][26022] Updated weights on worker 0-0, policy_version 718058 (0.00095) [2022-07-10 12:11:30,456][26022] Updated weights on worker 0-0, policy_version 718068 (0.00090) [2022-07-10 12:11:30,629][25689] Fps is (10 sec: 5450.2, 60 sec: 5542.2, 300 sec: 5544.0). Total num frames: 735302656. Throughput: 0: 5811.8. Samples: 735303504. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:11:30,629][25689] Avg episode reward: [(0, '-2.152')] [2022-07-10 12:11:32,356][26022] Updated weights on worker 0-0, policy_version 718078 (0.00083) [2022-07-10 12:11:34,139][26022] Updated weights on worker 0-0, policy_version 718088 (0.00088) [2022-07-10 12:11:35,675][25689] Fps is (10 sec: 5689.9, 60 sec: 5556.9, 300 sec: 5548.5). Total num frames: 735331328. Throughput: 0: 5799.7. Samples: 735337044. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:11:35,676][25689] Avg episode reward: [(0, '-1.750')] [2022-07-10 12:11:35,827][26022] Updated weights on worker 0-0, policy_version 718098 (0.00084) [2022-07-10 12:11:37,757][26022] Updated weights on worker 0-0, policy_version 718108 (0.00079) [2022-07-10 12:11:39,432][26022] Updated weights on worker 0-0, policy_version 718118 (0.00091) [2022-07-10 12:11:40,686][25689] Fps is (10 sec: 5599.7, 60 sec: 5556.1, 300 sec: 5549.1). Total num frames: 735358976. Throughput: 0: 4963.5. Samples: 735353750. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 12:11:40,687][25689] Avg episode reward: [(0, '-2.110')] [2022-07-10 12:11:41,627][26022] Updated weights on worker 0-0, policy_version 718128 (0.00085) [2022-07-10 12:11:43,313][26022] Updated weights on worker 0-0, policy_version 718138 (0.00092) [2022-07-10 12:11:45,078][26022] Updated weights on worker 0-0, policy_version 718148 (0.00084) [2022-07-10 12:11:45,793][25689] Fps is (10 sec: 5465.2, 60 sec: 5541.9, 300 sec: 5544.3). Total num frames: 735386624. Throughput: 0: 5772.9. Samples: 735386822. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:11:45,794][25689] Avg episode reward: [(0, '-2.604')] [2022-07-10 12:11:47,029][26022] Updated weights on worker 0-0, policy_version 718158 (0.00084) [2022-07-10 12:11:48,825][26022] Updated weights on worker 0-0, policy_version 718168 (0.00087) [2022-07-10 12:11:50,570][26022] Updated weights on worker 0-0, policy_version 718178 (0.00088) [2022-07-10 12:11:50,806][25689] Fps is (10 sec: 5666.6, 60 sec: 5560.3, 300 sec: 5551.6). Total num frames: 735416320. Throughput: 0: 5800.3. Samples: 735420538. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:11:50,806][25689] Avg episode reward: [(0, '-1.387')] [2022-07-10 12:11:52,519][26022] Updated weights on worker 0-0, policy_version 718188 (0.00084) [2022-07-10 12:11:54,277][26022] Updated weights on worker 0-0, policy_version 718198 (0.00086) [2022-07-10 12:11:55,838][25689] Fps is (10 sec: 5607.0, 60 sec: 5542.9, 300 sec: 5545.5). Total num frames: 735442944. Throughput: 0: 4981.2. Samples: 735437478. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:11:55,839][25689] Avg episode reward: [(0, '-1.566')] [2022-07-10 12:11:56,243][26022] Updated weights on worker 0-0, policy_version 718208 (0.00087) [2022-07-10 12:11:57,979][26022] Updated weights on worker 0-0, policy_version 718218 (0.00087) [2022-07-10 12:11:59,757][26022] Updated weights on worker 0-0, policy_version 718228 (0.00086) [2022-07-10 12:12:00,916][25689] Fps is (10 sec: 5368.0, 60 sec: 5539.4, 300 sec: 5549.9). Total num frames: 735470592. Throughput: 0: 5801.8. Samples: 735471120. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:12:00,917][25689] Avg episode reward: [(0, '-1.634')] [2022-07-10 12:12:01,875][26022] Updated weights on worker 0-0, policy_version 718238 (0.00098) [2022-07-10 12:12:03,870][26022] Updated weights on worker 0-0, policy_version 718248 (0.00084) [2022-07-10 12:12:05,628][26022] Updated weights on worker 0-0, policy_version 718258 (0.00090) [2022-07-10 12:12:06,006][25689] Fps is (10 sec: 5437.9, 60 sec: 5540.6, 300 sec: 5546.0). Total num frames: 735498240. Throughput: 0: 5721.5. Samples: 735502472. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:12:06,007][25689] Avg episode reward: [(0, '-1.124')] [2022-07-10 12:12:07,505][26022] Updated weights on worker 0-0, policy_version 718268 (0.00091) [2022-07-10 12:12:09,425][26022] Updated weights on worker 0-0, policy_version 718278 (0.00089) [2022-07-10 12:12:11,031][25689] Fps is (10 sec: 5466.8, 60 sec: 5539.3, 300 sec: 5546.4). Total num frames: 735525888. Throughput: 0: 4875.6. Samples: 735519148. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:12:11,031][25689] Avg episode reward: [(0, '-0.784')] [2022-07-10 12:12:11,121][26022] Updated weights on worker 0-0, policy_version 718288 (0.00095) [2022-07-10 12:12:13,160][26022] Updated weights on worker 0-0, policy_version 718298 (0.00086) [2022-07-10 12:12:14,691][26022] Updated weights on worker 0-0, policy_version 718308 (0.00087) [2022-07-10 12:12:16,139][25689] Fps is (10 sec: 5356.3, 60 sec: 5534.5, 300 sec: 5538.1). Total num frames: 735552512. Throughput: 0: 5693.7. Samples: 735553064. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:12:16,141][25689] Avg episode reward: [(0, '-0.045')] [2022-07-10 12:12:16,761][26022] Updated weights on worker 0-0, policy_version 718318 (0.00089) [2022-07-10 12:12:18,296][26022] Updated weights on worker 0-0, policy_version 718328 (0.00086) [2022-07-10 12:12:20,347][26022] Updated weights on worker 0-0, policy_version 718338 (0.00085) [2022-07-10 12:12:21,157][25689] Fps is (10 sec: 5663.0, 60 sec: 5535.7, 300 sec: 5545.9). Total num frames: 735583232. Throughput: 0: 5706.8. Samples: 735586630. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:12:21,157][25689] Avg episode reward: [(0, '-0.392')] [2022-07-10 12:12:22,145][26022] Updated weights on worker 0-0, policy_version 718348 (0.00087) [2022-07-10 12:12:23,716][26022] Updated weights on worker 0-0, policy_version 718358 (0.00087) [2022-07-10 12:12:26,007][26022] Updated weights on worker 0-0, policy_version 718368 (0.00083) [2022-07-10 12:12:26,255][25689] Fps is (10 sec: 5668.2, 60 sec: 5537.1, 300 sec: 5538.5). Total num frames: 735609856. Throughput: 0: 5814.5. Samples: 735620210. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:12:26,256][25689] Avg episode reward: [(0, '-1.593')] [2022-07-10 12:12:27,343][26022] Updated weights on worker 0-0, policy_version 718378 (0.00078) [2022-07-10 12:12:29,491][26022] Updated weights on worker 0-0, policy_version 718388 (0.00098) [2022-07-10 12:12:30,801][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:12:30,817][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000718396_735637504.pth [2022-07-10 12:12:30,818][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000716443_733637632.pth [2022-07-10 12:12:31,270][25689] Fps is (10 sec: 5467.7, 60 sec: 5538.6, 300 sec: 5546.7). Total num frames: 735638528. Throughput: 0: 5833.5. Samples: 735637214. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:12:31,272][25689] Avg episode reward: [(0, '-2.211')] [2022-07-10 12:12:31,352][26022] Updated weights on worker 0-0, policy_version 718398 (0.00090) [2022-07-10 12:12:33,026][26022] Updated weights on worker 0-0, policy_version 718408 (0.00087) [2022-07-10 12:12:35,039][26022] Updated weights on worker 0-0, policy_version 718418 (0.00087) [2022-07-10 12:12:36,280][25689] Fps is (10 sec: 5822.4, 60 sec: 5558.9, 300 sec: 5551.2). Total num frames: 735668224. Throughput: 0: 5857.2. Samples: 735671036. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:12:36,281][25689] Avg episode reward: [(0, '-1.842')] [2022-07-10 12:12:36,625][26022] Updated weights on worker 0-0, policy_version 718428 (0.00087) [2022-07-10 12:12:38,581][26022] Updated weights on worker 0-0, policy_version 718438 (0.00084) [2022-07-10 12:12:40,556][26022] Updated weights on worker 0-0, policy_version 718448 (0.00048) [2022-07-10 12:12:41,307][25689] Fps is (10 sec: 5509.6, 60 sec: 5523.7, 300 sec: 5538.7). Total num frames: 735693824. Throughput: 0: 5854.8. Samples: 735704602. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:12:41,307][25689] Avg episode reward: [(0, '-1.951')] [2022-07-10 12:12:42,163][26022] Updated weights on worker 0-0, policy_version 718458 (0.00082) [2022-07-10 12:12:44,291][26022] Updated weights on worker 0-0, policy_version 718468 (0.00087) [2022-07-10 12:12:45,634][26022] Updated weights on worker 0-0, policy_version 718478 (0.00095) [2022-07-10 12:12:46,360][25689] Fps is (10 sec: 5587.4, 60 sec: 5579.3, 300 sec: 5553.0). Total num frames: 735724544. Throughput: 0: 5032.9. Samples: 735721394. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:12:46,360][25689] Avg episode reward: [(0, '-2.203')] [2022-07-10 12:12:47,980][26022] Updated weights on worker 0-0, policy_version 718488 (0.00085) [2022-07-10 12:12:49,474][26022] Updated weights on worker 0-0, policy_version 718498 (0.00086) [2022-07-10 12:12:51,415][25689] Fps is (10 sec: 5673.1, 60 sec: 5524.7, 300 sec: 5548.7). Total num frames: 735751168. Throughput: 0: 5832.9. Samples: 735754714. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:12:51,415][25689] Avg episode reward: [(0, '-2.300')] [2022-07-10 12:12:51,460][26022] Updated weights on worker 0-0, policy_version 718508 (0.00088) [2022-07-10 12:12:53,222][26022] Updated weights on worker 0-0, policy_version 718518 (0.00089) [2022-07-10 12:12:55,206][26022] Updated weights on worker 0-0, policy_version 718528 (0.00083) [2022-07-10 12:12:56,428][25689] Fps is (10 sec: 5492.0, 60 sec: 5560.2, 300 sec: 5549.4). Total num frames: 735779840. Throughput: 0: 5833.4. Samples: 735788568. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:12:56,429][25689] Avg episode reward: [(0, '-2.316')] [2022-07-10 12:12:56,859][26022] Updated weights on worker 0-0, policy_version 718538 (0.00096) [2022-07-10 12:12:59,031][26022] Updated weights on worker 0-0, policy_version 718548 (0.00095) [2022-07-10 12:13:00,364][26022] Updated weights on worker 0-0, policy_version 718558 (0.00089) [2022-07-10 12:13:01,454][25689] Fps is (10 sec: 5609.7, 60 sec: 5565.0, 300 sec: 5554.6). Total num frames: 735807488. Throughput: 0: 5005.4. Samples: 735805450. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:13:01,455][25689] Avg episode reward: [(0, '-2.569')] [2022-07-10 12:13:02,922][26022] Updated weights on worker 0-0, policy_version 718568 (0.00084) [2022-07-10 12:13:04,614][26022] Updated weights on worker 0-0, policy_version 718578 (0.00096) [2022-07-10 12:13:06,383][26022] Updated weights on worker 0-0, policy_version 718588 (0.00091) [2022-07-10 12:13:06,488][25689] Fps is (10 sec: 5394.9, 60 sec: 5553.3, 300 sec: 5547.1). Total num frames: 735834112. Throughput: 0: 5743.7. Samples: 735837004. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:13:06,489][25689] Avg episode reward: [(0, '-3.628')] [2022-07-10 12:13:08,244][26022] Updated weights on worker 0-0, policy_version 718598 (0.00087) [2022-07-10 12:13:09,996][26022] Updated weights on worker 0-0, policy_version 718608 (0.00084) [2022-07-10 12:13:11,491][25689] Fps is (10 sec: 5407.6, 60 sec: 5555.3, 300 sec: 5548.8). Total num frames: 735861760. Throughput: 0: 5776.9. Samples: 735870692. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:13:11,491][25689] Avg episode reward: [(0, '-3.875')] [2022-07-10 12:13:11,726][26022] Updated weights on worker 0-0, policy_version 718618 (0.00083) [2022-07-10 12:13:13,795][26022] Updated weights on worker 0-0, policy_version 718628 (0.00095) [2022-07-10 12:13:15,420][26022] Updated weights on worker 0-0, policy_version 718638 (0.00092) [2022-07-10 12:13:16,497][25689] Fps is (10 sec: 5627.1, 60 sec: 5598.6, 300 sec: 5546.3). Total num frames: 735890432. Throughput: 0: 4946.0. Samples: 735887826. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:13:16,497][25689] Avg episode reward: [(0, '-5.067')] [2022-07-10 12:13:17,495][26022] Updated weights on worker 0-0, policy_version 718648 (0.00083) [2022-07-10 12:13:19,295][26022] Updated weights on worker 0-0, policy_version 718658 (0.00094) [2022-07-10 12:13:21,058][26022] Updated weights on worker 0-0, policy_version 718668 (0.00084) [2022-07-10 12:13:21,514][25689] Fps is (10 sec: 5721.1, 60 sec: 5564.8, 300 sec: 5551.2). Total num frames: 735919104. Throughput: 0: 5770.7. Samples: 735921208. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:13:21,515][25689] Avg episode reward: [(0, '-4.793')] [2022-07-10 12:13:23,092][26022] Updated weights on worker 0-0, policy_version 718678 (0.00095) [2022-07-10 12:13:24,763][26022] Updated weights on worker 0-0, policy_version 718688 (0.00087) [2022-07-10 12:13:26,570][25689] Fps is (10 sec: 5489.3, 60 sec: 5568.6, 300 sec: 5544.1). Total num frames: 735945728. Throughput: 0: 5842.2. Samples: 735954328. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:13:26,571][25689] Avg episode reward: [(0, '-5.229')] [2022-07-10 12:13:26,650][26022] Updated weights on worker 0-0, policy_version 718698 (0.00087) [2022-07-10 12:13:28,614][26022] Updated weights on worker 0-0, policy_version 718708 (0.00093) [2022-07-10 12:13:30,117][26022] Updated weights on worker 0-0, policy_version 718718 (0.00090) [2022-07-10 12:13:31,573][25689] Fps is (10 sec: 5496.9, 60 sec: 5569.7, 300 sec: 5551.6). Total num frames: 735974400. Throughput: 0: 5010.3. Samples: 735971312. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:13:31,574][25689] Avg episode reward: [(0, '-5.618')] [2022-07-10 12:13:32,133][26022] Updated weights on worker 0-0, policy_version 718728 (0.00085) [2022-07-10 12:13:33,861][26022] Updated weights on worker 0-0, policy_version 718738 (0.00096) [2022-07-10 12:13:35,716][26022] Updated weights on worker 0-0, policy_version 718748 (0.00081) [2022-07-10 12:13:36,577][25689] Fps is (10 sec: 5628.0, 60 sec: 5536.3, 300 sec: 5541.5). Total num frames: 736002048. Throughput: 0: 5838.5. Samples: 736005066. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:13:36,579][25689] Avg episode reward: [(0, '-4.396')] [2022-07-10 12:13:37,513][26022] Updated weights on worker 0-0, policy_version 718758 (0.00085) [2022-07-10 12:13:39,264][26022] Updated weights on worker 0-0, policy_version 718768 (0.00088) [2022-07-10 12:13:41,159][26022] Updated weights on worker 0-0, policy_version 718778 (0.00084) [2022-07-10 12:13:41,593][25689] Fps is (10 sec: 5620.7, 60 sec: 5588.2, 300 sec: 5546.7). Total num frames: 736030720. Throughput: 0: 5850.7. Samples: 736038688. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:13:41,594][25689] Avg episode reward: [(0, '-4.073')] [2022-07-10 12:13:43,047][26022] Updated weights on worker 0-0, policy_version 718788 (0.00086) [2022-07-10 12:13:44,781][26022] Updated weights on worker 0-0, policy_version 718798 (0.00091) [2022-07-10 12:13:46,692][25689] Fps is (10 sec: 5568.0, 60 sec: 5533.1, 300 sec: 5545.1). Total num frames: 736058368. Throughput: 0: 5037.5. Samples: 736055694. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:13:46,693][25689] Avg episode reward: [(0, '-3.824')] [2022-07-10 12:13:46,788][26022] Updated weights on worker 0-0, policy_version 718808 (0.00083) [2022-07-10 12:13:48,337][26022] Updated weights on worker 0-0, policy_version 718818 (0.00085) [2022-07-10 12:13:50,219][26022] Updated weights on worker 0-0, policy_version 718828 (0.00088) [2022-07-10 12:13:51,730][25689] Fps is (10 sec: 5657.2, 60 sec: 5585.6, 300 sec: 5551.5). Total num frames: 736088064. Throughput: 0: 5883.8. Samples: 736089908. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:13:51,730][25689] Avg episode reward: [(0, '-4.648')] [2022-07-10 12:13:52,039][26022] Updated weights on worker 0-0, policy_version 718838 (0.00086) [2022-07-10 12:13:54,161][26022] Updated weights on worker 0-0, policy_version 718848 (0.00099) [2022-07-10 12:13:55,715][26022] Updated weights on worker 0-0, policy_version 718858 (0.00085) [2022-07-10 12:13:56,776][25689] Fps is (10 sec: 5686.9, 60 sec: 5565.6, 300 sec: 5547.3). Total num frames: 736115712. Throughput: 0: 5835.5. Samples: 736122934. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:13:56,776][25689] Avg episode reward: [(0, '-3.440')] [2022-07-10 12:13:57,843][26022] Updated weights on worker 0-0, policy_version 718868 (0.00086) [2022-07-10 12:13:59,463][26022] Updated weights on worker 0-0, policy_version 718878 (0.00085) [2022-07-10 12:14:01,547][26022] Updated weights on worker 0-0, policy_version 718888 (0.00081) [2022-07-10 12:14:01,825][25689] Fps is (10 sec: 5477.3, 60 sec: 5563.5, 300 sec: 5562.8). Total num frames: 736143360. Throughput: 0: 4992.4. Samples: 736139694. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:14:01,826][25689] Avg episode reward: [(0, '-3.465')] [2022-07-10 12:14:03,463][26022] Updated weights on worker 0-0, policy_version 718898 (0.00090) [2022-07-10 12:14:05,635][26022] Updated weights on worker 0-0, policy_version 718908 (0.00092) [2022-07-10 12:14:06,919][25689] Fps is (10 sec: 5451.7, 60 sec: 5574.9, 300 sec: 5554.5). Total num frames: 736171008. Throughput: 0: 5700.6. Samples: 736170996. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:14:06,919][25689] Avg episode reward: [(0, '-3.723')] [2022-07-10 12:14:07,254][26022] Updated weights on worker 0-0, policy_version 718918 (0.00089) [2022-07-10 12:14:09,297][26022] Updated weights on worker 0-0, policy_version 718928 (0.00084) [2022-07-10 12:14:10,768][26022] Updated weights on worker 0-0, policy_version 718938 (0.00094) [2022-07-10 12:14:11,938][25689] Fps is (10 sec: 5265.5, 60 sec: 5539.5, 300 sec: 5547.3). Total num frames: 736196608. Throughput: 0: 5644.1. Samples: 736203964. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:14:11,939][25689] Avg episode reward: [(0, '-3.844')] [2022-07-10 12:14:12,956][26022] Updated weights on worker 0-0, policy_version 718948 (0.00094) [2022-07-10 12:14:14,725][26022] Updated weights on worker 0-0, policy_version 718958 (0.00091) [2022-07-10 12:14:16,716][26022] Updated weights on worker 0-0, policy_version 718968 (0.00094) [2022-07-10 12:14:16,962][25689] Fps is (10 sec: 5301.9, 60 sec: 5520.9, 300 sec: 5550.4). Total num frames: 736224256. Throughput: 0: 4838.7. Samples: 736220608. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:14:16,962][25689] Avg episode reward: [(0, '-3.518')] [2022-07-10 12:14:18,463][26022] Updated weights on worker 0-0, policy_version 718978 (0.00089) [2022-07-10 12:14:20,325][26022] Updated weights on worker 0-0, policy_version 718988 (0.00091) [2022-07-10 12:14:21,975][25689] Fps is (10 sec: 5611.0, 60 sec: 5521.3, 300 sec: 5548.1). Total num frames: 736252928. Throughput: 0: 5665.6. Samples: 736253856. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:14:21,976][25689] Avg episode reward: [(0, '-1.791')] [2022-07-10 12:14:22,070][26022] Updated weights on worker 0-0, policy_version 718998 (0.00099) [2022-07-10 12:14:24,210][26022] Updated weights on worker 0-0, policy_version 719008 (0.00089) [2022-07-10 12:14:25,688][26022] Updated weights on worker 0-0, policy_version 719018 (0.00093) [2022-07-10 12:14:27,106][25689] Fps is (10 sec: 5552.0, 60 sec: 5531.4, 300 sec: 5549.9). Total num frames: 736280576. Throughput: 0: 5751.4. Samples: 736287100. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:14:27,107][25689] Avg episode reward: [(0, '-1.680')] [2022-07-10 12:14:27,632][26022] Updated weights on worker 0-0, policy_version 719028 (0.00091) [2022-07-10 12:14:29,607][26022] Updated weights on worker 0-0, policy_version 719038 (0.00097) [2022-07-10 12:14:30,824][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:14:30,845][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000719045_736302080.pth [2022-07-10 12:14:30,846][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000717095_734305280.pth [2022-07-10 12:14:31,299][26022] Updated weights on worker 0-0, policy_version 719048 (0.00089) [2022-07-10 12:14:32,111][25689] Fps is (10 sec: 5556.9, 60 sec: 5531.3, 300 sec: 5546.4). Total num frames: 736309248. Throughput: 0: 4940.2. Samples: 736303620. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:14:32,111][25689] Avg episode reward: [(0, '-3.519')] [2022-07-10 12:14:33,274][26022] Updated weights on worker 0-0, policy_version 719058 (0.00084) [2022-07-10 12:14:34,994][26022] Updated weights on worker 0-0, policy_version 719068 (0.00088) [2022-07-10 12:14:36,843][26022] Updated weights on worker 0-0, policy_version 719078 (0.00091) [2022-07-10 12:14:37,148][25689] Fps is (10 sec: 5710.2, 60 sec: 5545.1, 300 sec: 5545.9). Total num frames: 736337920. Throughput: 0: 5777.2. Samples: 736337230. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:14:37,149][25689] Avg episode reward: [(0, '-2.753')] [2022-07-10 12:14:38,851][26022] Updated weights on worker 0-0, policy_version 719088 (0.00092) [2022-07-10 12:14:40,318][26022] Updated weights on worker 0-0, policy_version 719098 (0.00092) [2022-07-10 12:14:42,160][25689] Fps is (10 sec: 5400.5, 60 sec: 5494.8, 300 sec: 5540.0). Total num frames: 736363520. Throughput: 0: 5792.0. Samples: 736370766. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:14:42,161][25689] Avg episode reward: [(0, '-2.226')] [2022-07-10 12:14:42,528][26022] Updated weights on worker 0-0, policy_version 719108 (0.00089) [2022-07-10 12:14:44,314][26022] Updated weights on worker 0-0, policy_version 719118 (0.00086) [2022-07-10 12:14:46,136][26022] Updated weights on worker 0-0, policy_version 719128 (0.00088) [2022-07-10 12:14:47,224][25689] Fps is (10 sec: 5488.0, 60 sec: 5531.8, 300 sec: 5546.5). Total num frames: 736393216. Throughput: 0: 4992.5. Samples: 736387540. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:14:47,226][25689] Avg episode reward: [(0, '-1.617')] [2022-07-10 12:14:48,010][26022] Updated weights on worker 0-0, policy_version 719138 (0.00083) [2022-07-10 12:14:49,645][26022] Updated weights on worker 0-0, policy_version 719148 (0.00088) [2022-07-10 12:14:51,555][26022] Updated weights on worker 0-0, policy_version 719158 (0.00086) [2022-07-10 12:14:52,228][25689] Fps is (10 sec: 5593.9, 60 sec: 5484.1, 300 sec: 5541.0). Total num frames: 736419840. Throughput: 0: 5831.8. Samples: 736420942. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:14:52,230][25689] Avg episode reward: [(0, '-1.725')] [2022-07-10 12:14:53,295][26022] Updated weights on worker 0-0, policy_version 719168 (0.00085) [2022-07-10 12:14:55,108][26022] Updated weights on worker 0-0, policy_version 719178 (0.00084) [2022-07-10 12:14:56,974][26022] Updated weights on worker 0-0, policy_version 719188 (0.00090) [2022-07-10 12:14:57,246][25689] Fps is (10 sec: 5619.7, 60 sec: 5520.5, 300 sec: 5544.4). Total num frames: 736449536. Throughput: 0: 5840.6. Samples: 736454612. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:14:57,247][25689] Avg episode reward: [(0, '0.204')] [2022-07-10 12:14:58,856][26022] Updated weights on worker 0-0, policy_version 719198 (0.00088) [2022-07-10 12:15:00,603][26022] Updated weights on worker 0-0, policy_version 719208 (0.00084) [2022-07-10 12:15:02,283][25689] Fps is (10 sec: 5601.4, 60 sec: 5504.7, 300 sec: 5544.5). Total num frames: 736476160. Throughput: 0: 5009.1. Samples: 736471562. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 12:15:02,284][25689] Avg episode reward: [(0, '-0.360')] [2022-07-10 12:15:03,027][26022] Updated weights on worker 0-0, policy_version 719218 (0.00079) [2022-07-10 12:15:04,505][26022] Updated weights on worker 0-0, policy_version 719228 (0.00087) [2022-07-10 12:15:06,619][26022] Updated weights on worker 0-0, policy_version 719238 (0.00085) [2022-07-10 12:15:07,415][25689] Fps is (10 sec: 5336.9, 60 sec: 5501.1, 300 sec: 5542.8). Total num frames: 736503808. Throughput: 0: 5722.2. Samples: 736503078. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:15:07,416][25689] Avg episode reward: [(0, '-0.609')] [2022-07-10 12:15:08,360][26022] Updated weights on worker 0-0, policy_version 719248 (0.00096) [2022-07-10 12:15:10,326][26022] Updated weights on worker 0-0, policy_version 719258 (0.01149) [2022-07-10 12:15:12,032][26022] Updated weights on worker 0-0, policy_version 719268 (0.00093) [2022-07-10 12:15:12,434][25689] Fps is (10 sec: 5648.9, 60 sec: 5568.9, 300 sec: 5550.9). Total num frames: 736533504. Throughput: 0: 5736.4. Samples: 736536852. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:15:12,435][25689] Avg episode reward: [(0, '-0.610')] [2022-07-10 12:15:13,859][26022] Updated weights on worker 0-0, policy_version 719278 (0.00098) [2022-07-10 12:15:15,586][26022] Updated weights on worker 0-0, policy_version 719288 (0.00092) [2022-07-10 12:15:17,477][25689] Fps is (10 sec: 5597.6, 60 sec: 5550.2, 300 sec: 5540.3). Total num frames: 736560128. Throughput: 0: 4888.2. Samples: 736553504. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:15:17,477][25689] Avg episode reward: [(0, '-0.656')] [2022-07-10 12:15:17,719][26022] Updated weights on worker 0-0, policy_version 719298 (0.00098) [2022-07-10 12:15:19,264][26022] Updated weights on worker 0-0, policy_version 719308 (0.00081) [2022-07-10 12:15:21,071][26022] Updated weights on worker 0-0, policy_version 719318 (0.00087) [2022-07-10 12:15:22,488][25689] Fps is (10 sec: 5601.7, 60 sec: 5567.4, 300 sec: 5551.9). Total num frames: 736589824. Throughput: 0: 5736.7. Samples: 736587474. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:15:22,489][25689] Avg episode reward: [(0, '-1.491')] [2022-07-10 12:15:22,814][26022] Updated weights on worker 0-0, policy_version 719328 (0.00086) [2022-07-10 12:15:24,622][26022] Updated weights on worker 0-0, policy_version 719338 (0.00090) [2022-07-10 12:15:26,704][26022] Updated weights on worker 0-0, policy_version 719348 (0.00090) [2022-07-10 12:15:27,557][25689] Fps is (10 sec: 5688.9, 60 sec: 5573.1, 300 sec: 5548.0). Total num frames: 736617472. Throughput: 0: 5862.1. Samples: 736621148. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:15:27,557][25689] Avg episode reward: [(0, '-1.645')] [2022-07-10 12:15:28,392][26022] Updated weights on worker 0-0, policy_version 719358 (0.00095) [2022-07-10 12:15:30,310][26022] Updated weights on worker 0-0, policy_version 719368 (0.00087) [2022-07-10 12:15:32,192][26022] Updated weights on worker 0-0, policy_version 719378 (0.00091) [2022-07-10 12:15:32,583][25689] Fps is (10 sec: 5477.8, 60 sec: 5554.2, 300 sec: 5547.9). Total num frames: 736645120. Throughput: 0: 5847.3. Samples: 736654666. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:15:32,583][25689] Avg episode reward: [(0, '-1.802')] [2022-07-10 12:15:33,849][26022] Updated weights on worker 0-0, policy_version 719388 (0.00086) [2022-07-10 12:15:35,856][26022] Updated weights on worker 0-0, policy_version 719398 (0.00089) [2022-07-10 12:15:37,408][26022] Updated weights on worker 0-0, policy_version 719408 (0.00095) [2022-07-10 12:15:37,604][25689] Fps is (10 sec: 5707.3, 60 sec: 5572.6, 300 sec: 5554.4). Total num frames: 736674816. Throughput: 0: 5866.5. Samples: 736671582. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:15:37,605][25689] Avg episode reward: [(0, '-2.381')] [2022-07-10 12:15:39,559][26022] Updated weights on worker 0-0, policy_version 719418 (0.00089) [2022-07-10 12:15:40,956][26022] Updated weights on worker 0-0, policy_version 719428 (0.00083) [2022-07-10 12:15:42,627][25689] Fps is (10 sec: 5505.4, 60 sec: 5571.6, 300 sec: 5546.2). Total num frames: 736700416. Throughput: 0: 5851.6. Samples: 736705316. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:15:42,627][25689] Avg episode reward: [(0, '-4.602')] [2022-07-10 12:15:43,253][26022] Updated weights on worker 0-0, policy_version 719438 (0.00093) [2022-07-10 12:15:44,757][26022] Updated weights on worker 0-0, policy_version 719448 (0.00094) [2022-07-10 12:15:46,749][26022] Updated weights on worker 0-0, policy_version 719458 (0.00086) [2022-07-10 12:15:47,689][25689] Fps is (10 sec: 5381.7, 60 sec: 5554.9, 300 sec: 5545.6). Total num frames: 736729088. Throughput: 0: 5841.1. Samples: 736738742. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:15:47,689][25689] Avg episode reward: [(0, '-4.730')] [2022-07-10 12:15:48,495][26022] Updated weights on worker 0-0, policy_version 719468 (0.00087) [2022-07-10 12:15:50,522][26022] Updated weights on worker 0-0, policy_version 719478 (0.00081) [2022-07-10 12:15:52,207][26022] Updated weights on worker 0-0, policy_version 719488 (0.00056) [2022-07-10 12:15:52,755][25689] Fps is (10 sec: 5661.9, 60 sec: 5583.0, 300 sec: 5548.3). Total num frames: 736757760. Throughput: 0: 4988.8. Samples: 736755300. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:15:52,755][25689] Avg episode reward: [(0, '-3.132')] [2022-07-10 12:15:54,087][26022] Updated weights on worker 0-0, policy_version 719498 (0.00087) [2022-07-10 12:15:55,743][26022] Updated weights on worker 0-0, policy_version 719508 (0.00093) [2022-07-10 12:15:57,752][26022] Updated weights on worker 0-0, policy_version 719518 (0.00099) [2022-07-10 12:15:57,777][25689] Fps is (10 sec: 5684.3, 60 sec: 5565.7, 300 sec: 5552.1). Total num frames: 736786432. Throughput: 0: 5821.7. Samples: 736789022. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:15:57,777][25689] Avg episode reward: [(0, '-3.675')] [2022-07-10 12:15:59,393][26022] Updated weights on worker 0-0, policy_version 719528 (0.00092) [2022-07-10 12:16:01,418][26022] Updated weights on worker 0-0, policy_version 719538 (0.00087) [2022-07-10 12:16:02,792][25689] Fps is (10 sec: 5406.9, 60 sec: 5550.7, 300 sec: 5546.8). Total num frames: 736812032. Throughput: 0: 5744.2. Samples: 736821152. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:16:02,793][25689] Avg episode reward: [(0, '-4.595')] [2022-07-10 12:16:03,515][26022] Updated weights on worker 0-0, policy_version 719548 (0.00091) [2022-07-10 12:16:05,513][26022] Updated weights on worker 0-0, policy_version 719558 (0.00088) [2022-07-10 12:16:07,000][26022] Updated weights on worker 0-0, policy_version 719568 (0.00086) [2022-07-10 12:16:07,869][25689] Fps is (10 sec: 5479.0, 60 sec: 5589.7, 300 sec: 5552.5). Total num frames: 736841728. Throughput: 0: 4916.3. Samples: 736837956. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:16:07,870][25689] Avg episode reward: [(0, '-3.105')] [2022-07-10 12:16:09,063][26022] Updated weights on worker 0-0, policy_version 719578 (0.00084) [2022-07-10 12:16:10,819][26022] Updated weights on worker 0-0, policy_version 719588 (0.00099) [2022-07-10 12:16:12,607][26022] Updated weights on worker 0-0, policy_version 719598 (0.00084) [2022-07-10 12:16:12,928][25689] Fps is (10 sec: 5657.4, 60 sec: 5552.1, 300 sec: 5555.9). Total num frames: 736869376. Throughput: 0: 5762.3. Samples: 736871546. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:16:12,929][25689] Avg episode reward: [(0, '-2.297')] [2022-07-10 12:16:14,496][26022] Updated weights on worker 0-0, policy_version 719608 (0.00088) [2022-07-10 12:16:16,344][26022] Updated weights on worker 0-0, policy_version 719618 (0.00098) [2022-07-10 12:16:17,992][25689] Fps is (10 sec: 5462.3, 60 sec: 5567.1, 300 sec: 5544.9). Total num frames: 736897024. Throughput: 0: 5739.7. Samples: 736905054. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:16:17,994][25689] Avg episode reward: [(0, '-2.224')] [2022-07-10 12:16:18,144][26022] Updated weights on worker 0-0, policy_version 719628 (0.00088) [2022-07-10 12:16:20,152][26022] Updated weights on worker 0-0, policy_version 719638 (0.00097) [2022-07-10 12:16:21,778][26022] Updated weights on worker 0-0, policy_version 719648 (0.00088) [2022-07-10 12:16:23,044][25689] Fps is (10 sec: 5567.5, 60 sec: 5546.5, 300 sec: 5553.0). Total num frames: 736925696. Throughput: 0: 4957.9. Samples: 736921560. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:16:23,046][25689] Avg episode reward: [(0, '-2.350')] [2022-07-10 12:16:23,833][26022] Updated weights on worker 0-0, policy_version 719658 (0.00086) [2022-07-10 12:16:25,462][26022] Updated weights on worker 0-0, policy_version 719668 (0.00090) [2022-07-10 12:16:27,520][26022] Updated weights on worker 0-0, policy_version 719678 (0.00086) [2022-07-10 12:16:28,080][25689] Fps is (10 sec: 5583.1, 60 sec: 5549.5, 300 sec: 5549.4). Total num frames: 736953344. Throughput: 0: 5795.9. Samples: 736955096. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:16:28,081][25689] Avg episode reward: [(0, '-2.518')] [2022-07-10 12:16:29,291][26022] Updated weights on worker 0-0, policy_version 719688 (0.00089) [2022-07-10 12:16:30,861][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:16:30,879][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000719697_736969728.pth [2022-07-10 12:16:30,880][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000717746_734971904.pth [2022-07-10 12:16:31,162][26022] Updated weights on worker 0-0, policy_version 719698 (0.00094) [2022-07-10 12:16:33,003][26022] Updated weights on worker 0-0, policy_version 719708 (0.00086) [2022-07-10 12:16:33,098][25689] Fps is (10 sec: 5499.9, 60 sec: 5550.2, 300 sec: 5546.5). Total num frames: 736980992. Throughput: 0: 5788.8. Samples: 736988306. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:16:33,099][25689] Avg episode reward: [(0, '-2.779')] [2022-07-10 12:16:34,767][26022] Updated weights on worker 0-0, policy_version 719718 (0.00088) [2022-07-10 12:16:36,624][26022] Updated weights on worker 0-0, policy_version 719728 (0.00084) [2022-07-10 12:16:38,106][25689] Fps is (10 sec: 5617.4, 60 sec: 5534.5, 300 sec: 5550.0). Total num frames: 737009664. Throughput: 0: 4965.0. Samples: 737004918. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:16:38,106][25689] Avg episode reward: [(0, '-3.747')] [2022-07-10 12:16:38,494][26022] Updated weights on worker 0-0, policy_version 719738 (0.00100) [2022-07-10 12:16:40,214][26022] Updated weights on worker 0-0, policy_version 719748 (0.00093) [2022-07-10 12:16:42,278][26022] Updated weights on worker 0-0, policy_version 719758 (0.00091) [2022-07-10 12:16:43,109][25689] Fps is (10 sec: 5523.8, 60 sec: 5553.2, 300 sec: 5548.5). Total num frames: 737036288. Throughput: 0: 5839.3. Samples: 737038722. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:16:43,109][25689] Avg episode reward: [(0, '-3.402')] [2022-07-10 12:16:43,760][26022] Updated weights on worker 0-0, policy_version 719768 (0.00082) [2022-07-10 12:16:45,899][26022] Updated weights on worker 0-0, policy_version 719778 (0.00095) [2022-07-10 12:16:47,419][26022] Updated weights on worker 0-0, policy_version 719788 (0.00087) [2022-07-10 12:16:48,187][25689] Fps is (10 sec: 5586.9, 60 sec: 5568.7, 300 sec: 5547.3). Total num frames: 737065984. Throughput: 0: 5825.4. Samples: 737072226. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:16:48,187][25689] Avg episode reward: [(0, '-2.773')] [2022-07-10 12:16:49,608][26022] Updated weights on worker 0-0, policy_version 719798 (0.00395) [2022-07-10 12:16:51,172][26022] Updated weights on worker 0-0, policy_version 719808 (0.00089) [2022-07-10 12:16:53,156][26022] Updated weights on worker 0-0, policy_version 719818 (0.00084) [2022-07-10 12:16:53,214][25689] Fps is (10 sec: 5674.5, 60 sec: 5555.3, 300 sec: 5550.8). Total num frames: 737093632. Throughput: 0: 5005.4. Samples: 737088994. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:16:53,215][25689] Avg episode reward: [(0, '-1.990')] [2022-07-10 12:16:54,927][26022] Updated weights on worker 0-0, policy_version 719828 (0.00092) [2022-07-10 12:16:56,772][26022] Updated weights on worker 0-0, policy_version 719838 (0.00085) [2022-07-10 12:16:58,230][25689] Fps is (10 sec: 5505.8, 60 sec: 5539.0, 300 sec: 5552.0). Total num frames: 737121280. Throughput: 0: 5830.6. Samples: 737122254. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:16:58,231][25689] Avg episode reward: [(0, '-3.152')] [2022-07-10 12:16:58,660][26022] Updated weights on worker 0-0, policy_version 719848 (0.00085) [2022-07-10 12:17:00,520][26022] Updated weights on worker 0-0, policy_version 719858 (0.00090) [2022-07-10 12:17:02,797][26022] Updated weights on worker 0-0, policy_version 719868 (0.00085) [2022-07-10 12:17:03,259][25689] Fps is (10 sec: 5301.0, 60 sec: 5537.7, 300 sec: 5546.3). Total num frames: 737146880. Throughput: 0: 5708.6. Samples: 737153752. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:17:03,260][25689] Avg episode reward: [(0, '-3.993')] [2022-07-10 12:17:04,475][26022] Updated weights on worker 0-0, policy_version 719878 (0.00202) [2022-07-10 12:17:06,577][26022] Updated weights on worker 0-0, policy_version 719888 (0.00087) [2022-07-10 12:17:08,254][26022] Updated weights on worker 0-0, policy_version 719898 (0.00091) [2022-07-10 12:17:08,388][25689] Fps is (10 sec: 5342.6, 60 sec: 5516.0, 300 sec: 5547.8). Total num frames: 737175552. Throughput: 0: 4863.2. Samples: 737170470. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:17:08,389][25689] Avg episode reward: [(0, '-3.957')] [2022-07-10 12:17:10,196][26022] Updated weights on worker 0-0, policy_version 719908 (0.00085) [2022-07-10 12:17:12,024][26022] Updated weights on worker 0-0, policy_version 719918 (0.00110) [2022-07-10 12:17:13,397][25689] Fps is (10 sec: 5555.5, 60 sec: 5520.6, 300 sec: 5553.0). Total num frames: 737203200. Throughput: 0: 5680.8. Samples: 737203646. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:17:13,399][25689] Avg episode reward: [(0, '-5.559')] [2022-07-10 12:17:13,680][26022] Updated weights on worker 0-0, policy_version 719928 (0.00090) [2022-07-10 12:17:15,627][26022] Updated weights on worker 0-0, policy_version 719938 (0.00090) [2022-07-10 12:17:17,555][26022] Updated weights on worker 0-0, policy_version 719948 (0.00085) [2022-07-10 12:17:18,412][25689] Fps is (10 sec: 5618.8, 60 sec: 5542.1, 300 sec: 5546.2). Total num frames: 737231872. Throughput: 0: 5703.3. Samples: 737237358. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:17:18,414][25689] Avg episode reward: [(0, '-7.919')] [2022-07-10 12:17:19,242][26022] Updated weights on worker 0-0, policy_version 719958 (0.00090) [2022-07-10 12:17:21,199][26022] Updated weights on worker 0-0, policy_version 719968 (0.00074) [2022-07-10 12:17:22,839][26022] Updated weights on worker 0-0, policy_version 719978 (0.00088) [2022-07-10 12:17:23,431][25689] Fps is (10 sec: 5715.0, 60 sec: 5545.1, 300 sec: 5554.6). Total num frames: 737260544. Throughput: 0: 4966.5. Samples: 737253934. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:17:23,431][25689] Avg episode reward: [(0, '-7.885')] [2022-07-10 12:17:24,833][26022] Updated weights on worker 0-0, policy_version 719988 (0.00086) [2022-07-10 12:17:26,350][26022] Updated weights on worker 0-0, policy_version 719998 (0.00087) [2022-07-10 12:17:28,447][26022] Updated weights on worker 0-0, policy_version 720008 (0.00085) [2022-07-10 12:17:28,542][25689] Fps is (10 sec: 5559.7, 60 sec: 5538.2, 300 sec: 5549.4). Total num frames: 737288192. Throughput: 0: 5826.0. Samples: 737287884. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:17:28,542][25689] Avg episode reward: [(0, '-6.075')] [2022-07-10 12:17:30,467][26022] Updated weights on worker 0-0, policy_version 720018 (0.00089) [2022-07-10 12:17:32,019][26022] Updated weights on worker 0-0, policy_version 720028 (0.00084) [2022-07-10 12:17:33,617][25689] Fps is (10 sec: 5529.3, 60 sec: 5549.9, 300 sec: 5544.7). Total num frames: 737316864. Throughput: 0: 5840.9. Samples: 737321748. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:17:33,619][25689] Avg episode reward: [(0, '-6.456')] [2022-07-10 12:17:33,970][26022] Updated weights on worker 0-0, policy_version 720038 (0.00084) [2022-07-10 12:17:35,511][26022] Updated weights on worker 0-0, policy_version 720048 (0.00086) [2022-07-10 12:17:37,680][26022] Updated weights on worker 0-0, policy_version 720058 (0.00817) [2022-07-10 12:17:38,695][25689] Fps is (10 sec: 5647.7, 60 sec: 5543.4, 300 sec: 5554.1). Total num frames: 737345536. Throughput: 0: 4990.6. Samples: 737338586. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:17:38,697][25689] Avg episode reward: [(0, '-4.843')] [2022-07-10 12:17:39,163][26022] Updated weights on worker 0-0, policy_version 720068 (0.00100) [2022-07-10 12:17:41,315][26022] Updated weights on worker 0-0, policy_version 720078 (0.00086) [2022-07-10 12:17:43,028][26022] Updated weights on worker 0-0, policy_version 720088 (0.00092) [2022-07-10 12:17:43,702][25689] Fps is (10 sec: 5686.1, 60 sec: 5576.9, 300 sec: 5548.0). Total num frames: 737374208. Throughput: 0: 5838.8. Samples: 737372290. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:17:43,702][25689] Avg episode reward: [(0, '-3.557')] [2022-07-10 12:17:44,924][26022] Updated weights on worker 0-0, policy_version 720098 (0.00089) [2022-07-10 12:17:46,505][26022] Updated weights on worker 0-0, policy_version 720108 (0.00095) [2022-07-10 12:17:48,623][26022] Updated weights on worker 0-0, policy_version 720118 (0.00097) [2022-07-10 12:17:48,753][25689] Fps is (10 sec: 5600.0, 60 sec: 5545.6, 300 sec: 5551.6). Total num frames: 737401856. Throughput: 0: 5844.2. Samples: 737406000. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:17:48,753][25689] Avg episode reward: [(0, '-2.642')] [2022-07-10 12:17:50,447][26022] Updated weights on worker 0-0, policy_version 720128 (0.00092) [2022-07-10 12:17:52,255][26022] Updated weights on worker 0-0, policy_version 720138 (0.00087) [2022-07-10 12:17:53,767][25689] Fps is (10 sec: 5493.9, 60 sec: 5546.8, 300 sec: 5548.1). Total num frames: 737429504. Throughput: 0: 5001.9. Samples: 737422536. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:17:53,767][25689] Avg episode reward: [(0, '-2.520')] [2022-07-10 12:17:54,203][26022] Updated weights on worker 0-0, policy_version 720148 (0.00086) [2022-07-10 12:17:55,859][26022] Updated weights on worker 0-0, policy_version 720158 (0.00092) [2022-07-10 12:17:57,811][26022] Updated weights on worker 0-0, policy_version 720168 (0.00085) [2022-07-10 12:17:58,862][25689] Fps is (10 sec: 5571.0, 60 sec: 5556.4, 300 sec: 5550.3). Total num frames: 737458176. Throughput: 0: 5821.4. Samples: 737455984. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:17:58,863][25689] Avg episode reward: [(0, '-2.332')] [2022-07-10 12:17:59,576][26022] Updated weights on worker 0-0, policy_version 720178 (0.00084) [2022-07-10 12:18:01,338][26022] Updated weights on worker 0-0, policy_version 720188 (0.00094) [2022-07-10 12:18:03,799][26022] Updated weights on worker 0-0, policy_version 720198 (0.00092) [2022-07-10 12:18:03,911][25689] Fps is (10 sec: 5249.2, 60 sec: 5537.7, 300 sec: 5543.1). Total num frames: 737482752. Throughput: 0: 5707.8. Samples: 737487638. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:18:03,911][25689] Avg episode reward: [(0, '-2.856')] [2022-07-10 12:18:05,192][26022] Updated weights on worker 0-0, policy_version 720208 (0.00103) [2022-07-10 12:18:07,382][26022] Updated weights on worker 0-0, policy_version 720218 (0.00090) [2022-07-10 12:18:09,001][25689] Fps is (10 sec: 5352.9, 60 sec: 5558.2, 300 sec: 5548.3). Total num frames: 737512448. Throughput: 0: 5695.2. Samples: 737521318. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:18:09,001][25689] Avg episode reward: [(0, '-3.510')] [2022-07-10 12:18:09,092][26022] Updated weights on worker 0-0, policy_version 720228 (0.00092) [2022-07-10 12:18:10,892][26022] Updated weights on worker 0-0, policy_version 720238 (0.00086) [2022-07-10 12:18:12,903][26022] Updated weights on worker 0-0, policy_version 720248 (0.00101) [2022-07-10 12:18:14,016][25689] Fps is (10 sec: 5775.8, 60 sec: 5574.4, 300 sec: 5548.2). Total num frames: 737541120. Throughput: 0: 5713.6. Samples: 737538234. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:18:14,017][25689] Avg episode reward: [(0, '-4.102')] [2022-07-10 12:18:14,552][26022] Updated weights on worker 0-0, policy_version 720258 (0.00092) [2022-07-10 12:18:16,350][26022] Updated weights on worker 0-0, policy_version 720268 (0.00117) [2022-07-10 12:18:18,395][26022] Updated weights on worker 0-0, policy_version 720278 (0.00087) [2022-07-10 12:18:19,035][25689] Fps is (10 sec: 5510.9, 60 sec: 5540.3, 300 sec: 5541.3). Total num frames: 737567744. Throughput: 0: 5725.6. Samples: 737571484. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:18:19,035][25689] Avg episode reward: [(0, '-4.624')] [2022-07-10 12:18:19,847][26022] Updated weights on worker 0-0, policy_version 720288 (0.00092) [2022-07-10 12:18:21,999][26022] Updated weights on worker 0-0, policy_version 720298 (0.00089) [2022-07-10 12:18:23,853][26022] Updated weights on worker 0-0, policy_version 720308 (0.00086) [2022-07-10 12:18:24,044][25689] Fps is (10 sec: 5412.1, 60 sec: 5524.3, 300 sec: 5545.6). Total num frames: 737595392. Throughput: 0: 5827.9. Samples: 737604974. Policy #0 lag: (min: 0.0, avg: 9.1, max: 18.0) [2022-07-10 12:18:24,045][25689] Avg episode reward: [(0, '-4.662')] [2022-07-10 12:18:25,542][26022] Updated weights on worker 0-0, policy_version 720318 (0.00089) [2022-07-10 12:18:27,721][26022] Updated weights on worker 0-0, policy_version 720328 (0.00092) [2022-07-10 12:18:29,133][25689] Fps is (10 sec: 5678.3, 60 sec: 5560.1, 300 sec: 5547.4). Total num frames: 737625088. Throughput: 0: 4970.6. Samples: 737621390. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:18:29,134][25689] Avg episode reward: [(0, '-3.365')] [2022-07-10 12:18:29,292][26022] Updated weights on worker 0-0, policy_version 720338 (0.00096) [2022-07-10 12:18:31,067][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:18:31,077][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000720346_737634304.pth [2022-07-10 12:18:31,079][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000718396_735637504.pth [2022-07-10 12:18:31,311][26022] Updated weights on worker 0-0, policy_version 720348 (0.00087) [2022-07-10 12:18:33,019][26022] Updated weights on worker 0-0, policy_version 720358 (0.00092) [2022-07-10 12:18:34,143][25689] Fps is (10 sec: 5475.6, 60 sec: 5515.4, 300 sec: 5540.4). Total num frames: 737650688. Throughput: 0: 5781.0. Samples: 737654586. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:18:34,143][25689] Avg episode reward: [(0, '-2.569')] [2022-07-10 12:18:34,829][26022] Updated weights on worker 0-0, policy_version 720368 (0.00087) [2022-07-10 12:18:36,838][26022] Updated weights on worker 0-0, policy_version 720378 (0.00094) [2022-07-10 12:18:38,507][26022] Updated weights on worker 0-0, policy_version 720388 (0.00088) [2022-07-10 12:18:39,147][25689] Fps is (10 sec: 5521.9, 60 sec: 5539.1, 300 sec: 5544.1). Total num frames: 737680384. Throughput: 0: 5808.8. Samples: 737688316. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:18:39,148][25689] Avg episode reward: [(0, '-3.116')] [2022-07-10 12:18:40,398][26022] Updated weights on worker 0-0, policy_version 720398 (0.00089) [2022-07-10 12:18:42,315][26022] Updated weights on worker 0-0, policy_version 720408 (0.00086) [2022-07-10 12:18:43,980][26022] Updated weights on worker 0-0, policy_version 720418 (0.00093) [2022-07-10 12:18:44,167][25689] Fps is (10 sec: 5720.4, 60 sec: 5520.9, 300 sec: 5545.6). Total num frames: 737708032. Throughput: 0: 4980.7. Samples: 737705202. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:18:44,168][25689] Avg episode reward: [(0, '-3.067')] [2022-07-10 12:18:45,984][26022] Updated weights on worker 0-0, policy_version 720428 (0.00086) [2022-07-10 12:18:47,843][26022] Updated weights on worker 0-0, policy_version 720438 (0.00097) [2022-07-10 12:18:49,212][25689] Fps is (10 sec: 5493.9, 60 sec: 5521.4, 300 sec: 5538.5). Total num frames: 737735680. Throughput: 0: 5828.7. Samples: 737738424. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:18:49,213][25689] Avg episode reward: [(0, '-2.366')] [2022-07-10 12:18:49,583][26022] Updated weights on worker 0-0, policy_version 720448 (0.00088) [2022-07-10 12:18:51,694][26022] Updated weights on worker 0-0, policy_version 720458 (0.00083) [2022-07-10 12:18:53,362][26022] Updated weights on worker 0-0, policy_version 720468 (0.00087) [2022-07-10 12:18:54,214][25689] Fps is (10 sec: 5503.8, 60 sec: 5522.5, 300 sec: 5539.4). Total num frames: 737763328. Throughput: 0: 5814.6. Samples: 737771294. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:18:54,215][25689] Avg episode reward: [(0, '-2.254')] [2022-07-10 12:18:55,187][26022] Updated weights on worker 0-0, policy_version 720478 (0.00092) [2022-07-10 12:18:57,165][26022] Updated weights on worker 0-0, policy_version 720488 (0.00835) [2022-07-10 12:18:58,898][26022] Updated weights on worker 0-0, policy_version 720498 (0.00089) [2022-07-10 12:18:59,254][25689] Fps is (10 sec: 5506.6, 60 sec: 5510.6, 300 sec: 5539.5). Total num frames: 737790976. Throughput: 0: 4966.1. Samples: 737788168. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:18:59,255][25689] Avg episode reward: [(0, '-2.072')] [2022-07-10 12:19:00,728][26022] Updated weights on worker 0-0, policy_version 720508 (0.00086) [2022-07-10 12:19:03,038][26022] Updated weights on worker 0-0, policy_version 720518 (0.00090) [2022-07-10 12:19:04,260][25689] Fps is (10 sec: 5300.5, 60 sec: 5531.5, 300 sec: 5534.3). Total num frames: 737816576. Throughput: 0: 5707.2. Samples: 737819874. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:19:04,261][25689] Avg episode reward: [(0, '-3.628')] [2022-07-10 12:19:04,703][26022] Updated weights on worker 0-0, policy_version 720528 (0.00095) [2022-07-10 12:19:06,875][26022] Updated weights on worker 0-0, policy_version 720538 (0.00086) [2022-07-10 12:19:08,304][26022] Updated weights on worker 0-0, policy_version 720548 (0.00090) [2022-07-10 12:19:09,303][25689] Fps is (10 sec: 5502.5, 60 sec: 5535.8, 300 sec: 5547.6). Total num frames: 737846272. Throughput: 0: 5711.3. Samples: 737853170. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:19:09,304][25689] Avg episode reward: [(0, '-4.335')] [2022-07-10 12:19:10,436][26022] Updated weights on worker 0-0, policy_version 720558 (0.00085) [2022-07-10 12:19:12,141][26022] Updated weights on worker 0-0, policy_version 720568 (0.00088) [2022-07-10 12:19:14,207][26022] Updated weights on worker 0-0, policy_version 720578 (0.00083) [2022-07-10 12:19:14,335][25689] Fps is (10 sec: 5488.6, 60 sec: 5483.4, 300 sec: 5540.6). Total num frames: 737871872. Throughput: 0: 4902.7. Samples: 737869938. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:19:14,335][25689] Avg episode reward: [(0, '-4.344')] [2022-07-10 12:19:15,839][26022] Updated weights on worker 0-0, policy_version 720588 (0.00090) [2022-07-10 12:19:17,751][26022] Updated weights on worker 0-0, policy_version 720598 (0.00092) [2022-07-10 12:19:19,355][25689] Fps is (10 sec: 5501.4, 60 sec: 5534.2, 300 sec: 5543.9). Total num frames: 737901568. Throughput: 0: 5730.2. Samples: 737903348. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:19:19,355][25689] Avg episode reward: [(0, '-4.756')] [2022-07-10 12:19:19,470][26022] Updated weights on worker 0-0, policy_version 720608 (0.00092) [2022-07-10 12:19:21,407][26022] Updated weights on worker 0-0, policy_version 720618 (0.00091) [2022-07-10 12:19:23,377][26022] Updated weights on worker 0-0, policy_version 720628 (0.00092) [2022-07-10 12:19:24,360][25689] Fps is (10 sec: 5617.8, 60 sec: 5517.6, 300 sec: 5542.8). Total num frames: 737928192. Throughput: 0: 5817.4. Samples: 737936804. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:19:24,362][25689] Avg episode reward: [(0, '-5.596')] [2022-07-10 12:19:24,946][26022] Updated weights on worker 0-0, policy_version 720638 (0.00084) [2022-07-10 12:19:26,975][26022] Updated weights on worker 0-0, policy_version 720648 (0.00091) [2022-07-10 12:19:28,599][26022] Updated weights on worker 0-0, policy_version 720658 (0.00081) [2022-07-10 12:19:29,465][25689] Fps is (10 sec: 5469.1, 60 sec: 5499.2, 300 sec: 5540.9). Total num frames: 737956864. Throughput: 0: 4976.8. Samples: 737953512. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:19:29,466][25689] Avg episode reward: [(0, '-5.570')] [2022-07-10 12:19:30,572][26022] Updated weights on worker 0-0, policy_version 720668 (0.00084) [2022-07-10 12:19:32,320][26022] Updated weights on worker 0-0, policy_version 720678 (0.00085) [2022-07-10 12:19:34,221][26022] Updated weights on worker 0-0, policy_version 720688 (0.00087) [2022-07-10 12:19:34,535][25689] Fps is (10 sec: 5635.9, 60 sec: 5544.6, 300 sec: 5540.3). Total num frames: 737985536. Throughput: 0: 5800.1. Samples: 737987100. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:19:34,535][25689] Avg episode reward: [(0, '-3.296')] [2022-07-10 12:19:36,062][26022] Updated weights on worker 0-0, policy_version 720698 (0.00088) [2022-07-10 12:19:37,913][26022] Updated weights on worker 0-0, policy_version 720708 (0.00094) [2022-07-10 12:19:39,551][25689] Fps is (10 sec: 5584.3, 60 sec: 5509.6, 300 sec: 5547.1). Total num frames: 738013184. Throughput: 0: 5813.2. Samples: 738020752. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:19:39,551][25689] Avg episode reward: [(0, '-1.801')] [2022-07-10 12:19:39,722][26022] Updated weights on worker 0-0, policy_version 720718 (0.00086) [2022-07-10 12:19:41,449][26022] Updated weights on worker 0-0, policy_version 720728 (0.00079) [2022-07-10 12:19:43,510][26022] Updated weights on worker 0-0, policy_version 720738 (0.00094) [2022-07-10 12:19:44,559][25689] Fps is (10 sec: 5618.6, 60 sec: 5527.7, 300 sec: 5544.7). Total num frames: 738041856. Throughput: 0: 4977.2. Samples: 738037336. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:19:44,559][25689] Avg episode reward: [(0, '-3.349')] [2022-07-10 12:19:45,407][26022] Updated weights on worker 0-0, policy_version 720748 (0.00094) [2022-07-10 12:19:47,169][26022] Updated weights on worker 0-0, policy_version 720758 (0.00200) [2022-07-10 12:19:48,965][26022] Updated weights on worker 0-0, policy_version 720768 (0.00083) [2022-07-10 12:19:49,658][25689] Fps is (10 sec: 5673.6, 60 sec: 5539.7, 300 sec: 5549.8). Total num frames: 738070528. Throughput: 0: 5803.3. Samples: 738070696. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:19:49,658][25689] Avg episode reward: [(0, '-4.150')] [2022-07-10 12:19:50,725][26022] Updated weights on worker 0-0, policy_version 720778 (0.00091) [2022-07-10 12:19:52,608][26022] Updated weights on worker 0-0, policy_version 720788 (0.00102) [2022-07-10 12:19:54,493][26022] Updated weights on worker 0-0, policy_version 720798 (0.00625) [2022-07-10 12:19:54,676][25689] Fps is (10 sec: 5566.7, 60 sec: 5538.2, 300 sec: 5543.0). Total num frames: 738098176. Throughput: 0: 5809.1. Samples: 738104102. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:19:54,676][25689] Avg episode reward: [(0, '-4.665')] [2022-07-10 12:19:56,235][26022] Updated weights on worker 0-0, policy_version 720808 (0.00093) [2022-07-10 12:19:58,188][26022] Updated weights on worker 0-0, policy_version 720818 (0.00094) [2022-07-10 12:19:59,727][25689] Fps is (10 sec: 5593.5, 60 sec: 5554.1, 300 sec: 5549.6). Total num frames: 738126848. Throughput: 0: 4962.3. Samples: 738120872. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:19:59,729][25689] Avg episode reward: [(0, '-6.403')] [2022-07-10 12:19:59,857][26022] Updated weights on worker 0-0, policy_version 720828 (0.00088) [2022-07-10 12:20:01,986][26022] Updated weights on worker 0-0, policy_version 720838 (0.00077) [2022-07-10 12:20:03,781][26022] Updated weights on worker 0-0, policy_version 720848 (0.00085) [2022-07-10 12:20:04,762][25689] Fps is (10 sec: 5381.0, 60 sec: 5551.4, 300 sec: 5544.5). Total num frames: 738152448. Throughput: 0: 5709.6. Samples: 738152688. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:20:04,764][25689] Avg episode reward: [(0, '-6.428')] [2022-07-10 12:20:05,843][26022] Updated weights on worker 0-0, policy_version 720858 (0.00087) [2022-07-10 12:20:07,670][26022] Updated weights on worker 0-0, policy_version 720868 (0.00100) [2022-07-10 12:20:09,422][26022] Updated weights on worker 0-0, policy_version 720878 (0.00081) [2022-07-10 12:20:09,892][25689] Fps is (10 sec: 5238.0, 60 sec: 5509.7, 300 sec: 5535.5). Total num frames: 738180096. Throughput: 0: 5705.7. Samples: 738186148. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:20:09,893][25689] Avg episode reward: [(0, '-6.037')] [2022-07-10 12:20:11,213][26022] Updated weights on worker 0-0, policy_version 720888 (0.00095) [2022-07-10 12:20:13,181][26022] Updated weights on worker 0-0, policy_version 720898 (0.00099) [2022-07-10 12:20:14,900][25689] Fps is (10 sec: 5555.2, 60 sec: 5562.5, 300 sec: 5543.1). Total num frames: 738208768. Throughput: 0: 4887.4. Samples: 738202946. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:20:14,901][25689] Avg episode reward: [(0, '-5.427')] [2022-07-10 12:20:14,914][26022] Updated weights on worker 0-0, policy_version 720908 (0.00092) [2022-07-10 12:20:16,744][26022] Updated weights on worker 0-0, policy_version 720918 (0.00084) [2022-07-10 12:20:18,519][26022] Updated weights on worker 0-0, policy_version 720928 (0.00085) [2022-07-10 12:20:19,907][25689] Fps is (10 sec: 5828.5, 60 sec: 5563.8, 300 sec: 5543.2). Total num frames: 738238464. Throughput: 0: 5733.9. Samples: 738236582. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:20:19,907][25689] Avg episode reward: [(0, '-4.083')] [2022-07-10 12:20:20,498][26022] Updated weights on worker 0-0, policy_version 720938 (0.00097) [2022-07-10 12:20:22,296][26022] Updated weights on worker 0-0, policy_version 720948 (0.00087) [2022-07-10 12:20:24,154][26022] Updated weights on worker 0-0, policy_version 720958 (0.00081) [2022-07-10 12:20:24,945][25689] Fps is (10 sec: 5607.0, 60 sec: 5560.7, 300 sec: 5540.3). Total num frames: 738265088. Throughput: 0: 5811.2. Samples: 738269974. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:20:24,945][25689] Avg episode reward: [(0, '-3.286')] [2022-07-10 12:20:25,859][26022] Updated weights on worker 0-0, policy_version 720968 (0.00100) [2022-07-10 12:20:27,899][26022] Updated weights on worker 0-0, policy_version 720978 (0.00084) [2022-07-10 12:20:29,592][26022] Updated weights on worker 0-0, policy_version 720988 (0.00088) [2022-07-10 12:20:29,991][25689] Fps is (10 sec: 5483.3, 60 sec: 5566.2, 300 sec: 5543.4). Total num frames: 738293760. Throughput: 0: 5010.2. Samples: 738286846. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:20:29,992][25689] Avg episode reward: [(0, '-2.273')] [2022-07-10 12:20:31,147][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:20:31,168][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000720996_738299904.pth [2022-07-10 12:20:31,169][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000719045_736302080.pth [2022-07-10 12:20:31,433][26022] Updated weights on worker 0-0, policy_version 720998 (0.00079) [2022-07-10 12:20:33,300][26022] Updated weights on worker 0-0, policy_version 721008 (0.00082) [2022-07-10 12:20:35,027][25689] Fps is (10 sec: 5585.9, 60 sec: 5552.3, 300 sec: 5536.2). Total num frames: 738321408. Throughput: 0: 5846.1. Samples: 738320612. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:20:35,029][25689] Avg episode reward: [(0, '-3.551')] [2022-07-10 12:20:35,052][26022] Updated weights on worker 0-0, policy_version 721018 (0.00092) [2022-07-10 12:20:36,984][26022] Updated weights on worker 0-0, policy_version 721028 (0.00094) [2022-07-10 12:20:38,465][26022] Updated weights on worker 0-0, policy_version 721038 (0.00087) [2022-07-10 12:20:40,099][25689] Fps is (10 sec: 5571.9, 60 sec: 5564.1, 300 sec: 5545.6). Total num frames: 738350080. Throughput: 0: 5848.1. Samples: 738354668. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:20:40,101][25689] Avg episode reward: [(0, '-4.811')] [2022-07-10 12:20:40,479][26022] Updated weights on worker 0-0, policy_version 721048 (0.00100) [2022-07-10 12:20:42,224][26022] Updated weights on worker 0-0, policy_version 721058 (0.00084) [2022-07-10 12:20:44,294][26022] Updated weights on worker 0-0, policy_version 721068 (0.00082) [2022-07-10 12:20:45,122][25689] Fps is (10 sec: 5680.4, 60 sec: 5562.7, 300 sec: 5546.3). Total num frames: 738378752. Throughput: 0: 5037.6. Samples: 738371620. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:20:45,123][25689] Avg episode reward: [(0, '-3.674')] [2022-07-10 12:20:45,933][26022] Updated weights on worker 0-0, policy_version 721078 (0.00095) [2022-07-10 12:20:47,919][26022] Updated weights on worker 0-0, policy_version 721088 (0.00098) [2022-07-10 12:20:49,494][26022] Updated weights on worker 0-0, policy_version 721098 (0.00087) [2022-07-10 12:20:50,163][25689] Fps is (10 sec: 5697.9, 60 sec: 5568.1, 300 sec: 5546.8). Total num frames: 738407424. Throughput: 0: 5868.8. Samples: 738405230. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:20:50,163][25689] Avg episode reward: [(0, '-3.848')] [2022-07-10 12:20:51,726][26022] Updated weights on worker 0-0, policy_version 721108 (0.00085) [2022-07-10 12:20:53,234][26022] Updated weights on worker 0-0, policy_version 721118 (0.00093) [2022-07-10 12:20:55,175][25689] Fps is (10 sec: 5500.8, 60 sec: 5551.7, 300 sec: 5540.1). Total num frames: 738434048. Throughput: 0: 5849.3. Samples: 738438460. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:20:55,175][25689] Avg episode reward: [(0, '-4.328')] [2022-07-10 12:20:55,225][26022] Updated weights on worker 0-0, policy_version 721128 (0.00085) [2022-07-10 12:20:57,149][26022] Updated weights on worker 0-0, policy_version 721138 (0.00053) [2022-07-10 12:20:58,834][26022] Updated weights on worker 0-0, policy_version 721148 (0.00094) [2022-07-10 12:21:00,199][25689] Fps is (10 sec: 5407.8, 60 sec: 5537.2, 300 sec: 5546.8). Total num frames: 738461696. Throughput: 0: 5821.6. Samples: 738471682. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:21:00,200][25689] Avg episode reward: [(0, '-4.794')] [2022-07-10 12:21:00,719][26022] Updated weights on worker 0-0, policy_version 721158 (0.00098) [2022-07-10 12:21:02,959][26022] Updated weights on worker 0-0, policy_version 721168 (0.00090) [2022-07-10 12:21:04,646][26022] Updated weights on worker 0-0, policy_version 721178 (0.00080) [2022-07-10 12:21:05,219][25689] Fps is (10 sec: 5403.2, 60 sec: 5555.5, 300 sec: 5537.6). Total num frames: 738488320. Throughput: 0: 5711.4. Samples: 738486400. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:21:05,221][25689] Avg episode reward: [(0, '-2.254')] [2022-07-10 12:21:06,791][26022] Updated weights on worker 0-0, policy_version 721188 (0.00098) [2022-07-10 12:21:08,460][26022] Updated weights on worker 0-0, policy_version 721198 (0.00089) [2022-07-10 12:21:10,323][25689] Fps is (10 sec: 5461.5, 60 sec: 5574.9, 300 sec: 5540.2). Total num frames: 738516992. Throughput: 0: 5685.6. Samples: 738519854. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:21:10,324][25689] Avg episode reward: [(0, '-2.297')] [2022-07-10 12:21:10,325][26022] Updated weights on worker 0-0, policy_version 721208 (0.00091) [2022-07-10 12:21:12,230][26022] Updated weights on worker 0-0, policy_version 721218 (0.00094) [2022-07-10 12:21:14,047][26022] Updated weights on worker 0-0, policy_version 721228 (0.00095) [2022-07-10 12:21:15,376][25689] Fps is (10 sec: 5544.6, 60 sec: 5553.8, 300 sec: 5540.4). Total num frames: 738544640. Throughput: 0: 5683.0. Samples: 738553268. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:21:15,378][25689] Avg episode reward: [(0, '-3.076')] [2022-07-10 12:21:15,736][26022] Updated weights on worker 0-0, policy_version 721238 (0.00084) [2022-07-10 12:21:17,780][26022] Updated weights on worker 0-0, policy_version 721248 (0.00086) [2022-07-10 12:21:19,450][26022] Updated weights on worker 0-0, policy_version 721258 (0.00080) [2022-07-10 12:21:20,407][25689] Fps is (10 sec: 5483.5, 60 sec: 5517.7, 300 sec: 5537.3). Total num frames: 738572288. Throughput: 0: 4871.7. Samples: 738570132. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:21:20,409][25689] Avg episode reward: [(0, '-3.262')] [2022-07-10 12:21:21,330][26022] Updated weights on worker 0-0, policy_version 721268 (0.00084) [2022-07-10 12:21:23,060][26022] Updated weights on worker 0-0, policy_version 721278 (0.00085) [2022-07-10 12:21:24,916][26022] Updated weights on worker 0-0, policy_version 721288 (0.00096) [2022-07-10 12:21:25,417][25689] Fps is (10 sec: 5609.0, 60 sec: 5554.1, 300 sec: 5541.2). Total num frames: 738600960. Throughput: 0: 5823.5. Samples: 738604026. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:21:25,420][25689] Avg episode reward: [(0, '-2.378')] [2022-07-10 12:21:27,026][26022] Updated weights on worker 0-0, policy_version 721298 (0.00090) [2022-07-10 12:21:28,606][26022] Updated weights on worker 0-0, policy_version 721308 (0.00086) [2022-07-10 12:21:30,520][25689] Fps is (10 sec: 5569.1, 60 sec: 5532.0, 300 sec: 5539.7). Total num frames: 738628608. Throughput: 0: 5809.5. Samples: 738637188. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:21:30,521][25689] Avg episode reward: [(0, '-1.907')] [2022-07-10 12:21:30,755][26022] Updated weights on worker 0-0, policy_version 721318 (0.00097) [2022-07-10 12:21:32,351][26022] Updated weights on worker 0-0, policy_version 721328 (0.00095) [2022-07-10 12:21:34,250][26022] Updated weights on worker 0-0, policy_version 721338 (0.00088) [2022-07-10 12:21:35,537][25689] Fps is (10 sec: 5565.3, 60 sec: 5550.7, 300 sec: 5539.5). Total num frames: 738657280. Throughput: 0: 4993.7. Samples: 738653944. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:21:35,538][25689] Avg episode reward: [(0, '-2.372')] [2022-07-10 12:21:36,031][26022] Updated weights on worker 0-0, policy_version 721348 (0.00082) [2022-07-10 12:21:37,935][26022] Updated weights on worker 0-0, policy_version 721358 (0.00091) [2022-07-10 12:21:39,714][26022] Updated weights on worker 0-0, policy_version 721368 (0.00093) [2022-07-10 12:21:40,540][25689] Fps is (10 sec: 5620.9, 60 sec: 5540.1, 300 sec: 5542.9). Total num frames: 738684928. Throughput: 0: 5816.9. Samples: 738687242. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:21:40,541][25689] Avg episode reward: [(0, '-2.397')] [2022-07-10 12:21:41,633][26022] Updated weights on worker 0-0, policy_version 721378 (0.00099) [2022-07-10 12:21:43,554][26022] Updated weights on worker 0-0, policy_version 721388 (0.00093) [2022-07-10 12:21:45,217][26022] Updated weights on worker 0-0, policy_version 721398 (0.00053) [2022-07-10 12:21:45,576][25689] Fps is (10 sec: 5610.0, 60 sec: 5538.9, 300 sec: 5540.3). Total num frames: 738713600. Throughput: 0: 5808.4. Samples: 738721118. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:21:45,577][25689] Avg episode reward: [(0, '-0.811')] [2022-07-10 12:21:47,159][26022] Updated weights on worker 0-0, policy_version 721408 (0.00094) [2022-07-10 12:21:48,841][26022] Updated weights on worker 0-0, policy_version 721418 (0.00080) [2022-07-10 12:21:50,636][25689] Fps is (10 sec: 5578.3, 60 sec: 5520.2, 300 sec: 5539.7). Total num frames: 738741248. Throughput: 0: 5008.2. Samples: 738737934. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:21:50,637][25689] Avg episode reward: [(0, '-1.045')] [2022-07-10 12:21:50,783][26022] Updated weights on worker 0-0, policy_version 721428 (0.00092) [2022-07-10 12:21:52,502][26022] Updated weights on worker 0-0, policy_version 721438 (0.00088) [2022-07-10 12:21:54,266][26022] Updated weights on worker 0-0, policy_version 721448 (0.00088) [2022-07-10 12:21:55,737][25689] Fps is (10 sec: 5543.2, 60 sec: 5545.9, 300 sec: 5541.5). Total num frames: 738769920. Throughput: 0: 5837.3. Samples: 738771854. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:21:55,737][25689] Avg episode reward: [(0, '-1.399')] [2022-07-10 12:21:56,292][26022] Updated weights on worker 0-0, policy_version 721458 (0.00104) [2022-07-10 12:21:57,851][26022] Updated weights on worker 0-0, policy_version 721468 (0.00091) [2022-07-10 12:21:59,930][26022] Updated weights on worker 0-0, policy_version 721478 (0.00090) [2022-07-10 12:22:00,835][25689] Fps is (10 sec: 5723.3, 60 sec: 5572.9, 300 sec: 5554.0). Total num frames: 738799616. Throughput: 0: 5818.7. Samples: 738805330. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:22:00,835][25689] Avg episode reward: [(0, '-2.073')] [2022-07-10 12:22:01,580][26022] Updated weights on worker 0-0, policy_version 721488 (0.00090) [2022-07-10 12:22:03,847][26022] Updated weights on worker 0-0, policy_version 721498 (0.00086) [2022-07-10 12:22:05,636][26022] Updated weights on worker 0-0, policy_version 721508 (0.00094) [2022-07-10 12:22:05,847][25689] Fps is (10 sec: 5469.1, 60 sec: 5556.7, 300 sec: 5545.9). Total num frames: 738825216. Throughput: 0: 4883.5. Samples: 738820110. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:22:05,848][25689] Avg episode reward: [(0, '-1.482')] [2022-07-10 12:22:07,513][26022] Updated weights on worker 0-0, policy_version 721518 (0.00091) [2022-07-10 12:22:09,279][26022] Updated weights on worker 0-0, policy_version 721528 (0.00078) [2022-07-10 12:22:10,955][25689] Fps is (10 sec: 5261.7, 60 sec: 5539.6, 300 sec: 5544.0). Total num frames: 738852864. Throughput: 0: 5708.4. Samples: 738853918. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:22:10,955][25689] Avg episode reward: [(0, '-2.006')] [2022-07-10 12:22:11,323][26022] Updated weights on worker 0-0, policy_version 721538 (0.00088) [2022-07-10 12:22:12,844][26022] Updated weights on worker 0-0, policy_version 721548 (0.00095) [2022-07-10 12:22:15,002][26022] Updated weights on worker 0-0, policy_version 721558 (0.00090) [2022-07-10 12:22:15,978][25689] Fps is (10 sec: 5559.3, 60 sec: 5559.2, 300 sec: 5543.9). Total num frames: 738881536. Throughput: 0: 5718.0. Samples: 738887594. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:22:15,979][25689] Avg episode reward: [(0, '-2.108')] [2022-07-10 12:22:16,540][26022] Updated weights on worker 0-0, policy_version 721568 (0.00091) [2022-07-10 12:22:18,627][26022] Updated weights on worker 0-0, policy_version 721578 (0.00092) [2022-07-10 12:22:20,359][26022] Updated weights on worker 0-0, policy_version 721588 (0.00088) [2022-07-10 12:22:21,075][25689] Fps is (10 sec: 5564.8, 60 sec: 5553.1, 300 sec: 5539.0). Total num frames: 738909184. Throughput: 0: 4886.4. Samples: 738904234. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:22:21,076][25689] Avg episode reward: [(0, '-2.351')] [2022-07-10 12:22:22,368][26022] Updated weights on worker 0-0, policy_version 721598 (0.00093) [2022-07-10 12:22:24,014][26022] Updated weights on worker 0-0, policy_version 721608 (0.00084) [2022-07-10 12:22:25,712][26022] Updated weights on worker 0-0, policy_version 721618 (0.00085) [2022-07-10 12:22:26,127][25689] Fps is (10 sec: 5549.1, 60 sec: 5549.3, 300 sec: 5543.5). Total num frames: 738937856. Throughput: 0: 5801.1. Samples: 738937756. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:22:26,128][25689] Avg episode reward: [(0, '-3.845')] [2022-07-10 12:22:27,713][26022] Updated weights on worker 0-0, policy_version 721628 (0.00088) [2022-07-10 12:22:29,628][26022] Updated weights on worker 0-0, policy_version 721638 (0.00091) [2022-07-10 12:22:31,185][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:22:31,203][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000721647_738966528.pth [2022-07-10 12:22:31,204][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000719697_736969728.pth [2022-07-10 12:22:31,205][25689] Fps is (10 sec: 5661.1, 60 sec: 5568.5, 300 sec: 5543.4). Total num frames: 738966528. Throughput: 0: 5784.9. Samples: 738971062. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:22:31,205][25689] Avg episode reward: [(0, '-4.128')] [2022-07-10 12:22:31,309][26022] Updated weights on worker 0-0, policy_version 721648 (0.00086) [2022-07-10 12:22:33,118][26022] Updated weights on worker 0-0, policy_version 721658 (0.00092) [2022-07-10 12:22:34,987][26022] Updated weights on worker 0-0, policy_version 721668 (0.00087) [2022-07-10 12:22:36,232][25689] Fps is (10 sec: 5573.6, 60 sec: 5550.7, 300 sec: 5541.0). Total num frames: 738994176. Throughput: 0: 4956.6. Samples: 738987986. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:22:36,233][25689] Avg episode reward: [(0, '-4.295')] [2022-07-10 12:22:36,612][26022] Updated weights on worker 0-0, policy_version 721678 (0.00086) [2022-07-10 12:22:38,827][26022] Updated weights on worker 0-0, policy_version 721688 (0.00088) [2022-07-10 12:22:40,415][26022] Updated weights on worker 0-0, policy_version 721698 (0.00087) [2022-07-10 12:22:41,304][25689] Fps is (10 sec: 5576.5, 60 sec: 5561.2, 300 sec: 5539.7). Total num frames: 739022848. Throughput: 0: 5813.7. Samples: 739021838. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:22:41,305][25689] Avg episode reward: [(0, '-3.389')] [2022-07-10 12:22:42,379][26022] Updated weights on worker 0-0, policy_version 721708 (0.00085) [2022-07-10 12:22:44,035][26022] Updated weights on worker 0-0, policy_version 721718 (0.00099) [2022-07-10 12:22:46,015][26022] Updated weights on worker 0-0, policy_version 721728 (0.00085) [2022-07-10 12:22:46,330][25689] Fps is (10 sec: 5475.8, 60 sec: 5528.4, 300 sec: 5536.8). Total num frames: 739049472. Throughput: 0: 5810.1. Samples: 739055138. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:22:46,331][25689] Avg episode reward: [(0, '-4.021')] [2022-07-10 12:22:48,028][26022] Updated weights on worker 0-0, policy_version 721738 (0.00092) [2022-07-10 12:22:49,838][26022] Updated weights on worker 0-0, policy_version 721748 (0.00374) [2022-07-10 12:22:51,418][25689] Fps is (10 sec: 5568.7, 60 sec: 5559.6, 300 sec: 5542.3). Total num frames: 739079168. Throughput: 0: 4986.1. Samples: 739071848. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:22:51,419][25689] Avg episode reward: [(0, '-4.881')] [2022-07-10 12:22:51,488][26022] Updated weights on worker 0-0, policy_version 721758 (0.00121) [2022-07-10 12:22:53,601][26022] Updated weights on worker 0-0, policy_version 721768 (0.00096) [2022-07-10 12:22:55,039][26022] Updated weights on worker 0-0, policy_version 721778 (0.00087) [2022-07-10 12:22:56,488][25689] Fps is (10 sec: 5645.5, 60 sec: 5545.5, 300 sec: 5539.3). Total num frames: 739106816. Throughput: 0: 5792.0. Samples: 739105308. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:22:56,488][25689] Avg episode reward: [(0, '-3.389')] [2022-07-10 12:22:57,239][26022] Updated weights on worker 0-0, policy_version 721788 (0.00108) [2022-07-10 12:22:58,959][26022] Updated weights on worker 0-0, policy_version 721798 (0.00087) [2022-07-10 12:23:00,749][26022] Updated weights on worker 0-0, policy_version 721808 (0.00086) [2022-07-10 12:23:01,572][25689] Fps is (10 sec: 5445.8, 60 sec: 5513.1, 300 sec: 5549.0). Total num frames: 739134464. Throughput: 0: 5756.0. Samples: 739138498. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:23:01,572][25689] Avg episode reward: [(0, '-2.371')] [2022-07-10 12:23:03,075][26022] Updated weights on worker 0-0, policy_version 721818 (0.00078) [2022-07-10 12:23:04,829][26022] Updated weights on worker 0-0, policy_version 721828 (0.00088) [2022-07-10 12:23:06,588][25689] Fps is (10 sec: 5474.7, 60 sec: 5546.4, 300 sec: 5543.5). Total num frames: 739162112. Throughput: 0: 5659.6. Samples: 739169788. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:23:06,589][25689] Avg episode reward: [(0, '-2.538')] [2022-07-10 12:23:06,596][26022] Updated weights on worker 0-0, policy_version 721838 (0.00092) [2022-07-10 12:23:08,388][26022] Updated weights on worker 0-0, policy_version 721848 (0.00089) [2022-07-10 12:23:10,303][26022] Updated weights on worker 0-0, policy_version 721858 (0.00096) [2022-07-10 12:23:11,651][25689] Fps is (10 sec: 5384.4, 60 sec: 5533.6, 300 sec: 5535.7). Total num frames: 739188736. Throughput: 0: 5670.0. Samples: 739186572. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:23:11,653][25689] Avg episode reward: [(0, '-3.130')] [2022-07-10 12:23:12,334][26022] Updated weights on worker 0-0, policy_version 721868 (0.00087) [2022-07-10 12:23:14,091][26022] Updated weights on worker 0-0, policy_version 721878 (0.00083) [2022-07-10 12:23:16,047][26022] Updated weights on worker 0-0, policy_version 721888 (0.00092) [2022-07-10 12:23:16,686][25689] Fps is (10 sec: 5374.8, 60 sec: 5515.7, 300 sec: 5538.8). Total num frames: 739216384. Throughput: 0: 5688.4. Samples: 739220202. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:23:16,688][25689] Avg episode reward: [(0, '-1.758')] [2022-07-10 12:23:17,741][26022] Updated weights on worker 0-0, policy_version 721898 (0.00090) [2022-07-10 12:23:19,576][26022] Updated weights on worker 0-0, policy_version 721908 (0.00087) [2022-07-10 12:23:21,404][26022] Updated weights on worker 0-0, policy_version 721918 (0.00093) [2022-07-10 12:23:21,709][25689] Fps is (10 sec: 5599.7, 60 sec: 5539.3, 300 sec: 5542.0). Total num frames: 739245056. Throughput: 0: 5717.4. Samples: 739253630. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:23:21,711][25689] Avg episode reward: [(0, '-0.484')] [2022-07-10 12:23:23,384][26022] Updated weights on worker 0-0, policy_version 721928 (0.00084) [2022-07-10 12:23:25,173][26022] Updated weights on worker 0-0, policy_version 721938 (0.00091) [2022-07-10 12:23:26,715][25689] Fps is (10 sec: 5615.7, 60 sec: 5526.7, 300 sec: 5536.7). Total num frames: 739272704. Throughput: 0: 4997.0. Samples: 739270360. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:23:26,715][25689] Avg episode reward: [(0, '-2.119')] [2022-07-10 12:23:27,185][26022] Updated weights on worker 0-0, policy_version 721948 (0.00090) [2022-07-10 12:23:28,776][26022] Updated weights on worker 0-0, policy_version 721958 (0.00092) [2022-07-10 12:23:30,749][26022] Updated weights on worker 0-0, policy_version 721968 (0.00053) [2022-07-10 12:23:31,769][25689] Fps is (10 sec: 5496.8, 60 sec: 5511.9, 300 sec: 5542.7). Total num frames: 739300352. Throughput: 0: 5802.4. Samples: 739303302. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:23:31,771][25689] Avg episode reward: [(0, '-2.572')] [2022-07-10 12:23:32,422][26022] Updated weights on worker 0-0, policy_version 721978 (0.00089) [2022-07-10 12:23:34,569][26022] Updated weights on worker 0-0, policy_version 721988 (0.00086) [2022-07-10 12:23:36,144][26022] Updated weights on worker 0-0, policy_version 721998 (0.00089) [2022-07-10 12:23:36,803][25689] Fps is (10 sec: 5481.3, 60 sec: 5511.3, 300 sec: 5535.3). Total num frames: 739328000. Throughput: 0: 5771.8. Samples: 739336314. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:23:36,803][25689] Avg episode reward: [(0, '-2.620')] [2022-07-10 12:23:38,272][26022] Updated weights on worker 0-0, policy_version 722008 (0.00050) [2022-07-10 12:23:39,910][26022] Updated weights on worker 0-0, policy_version 722018 (0.00080) [2022-07-10 12:23:41,813][25689] Fps is (10 sec: 5505.1, 60 sec: 5500.0, 300 sec: 5535.5). Total num frames: 739355648. Throughput: 0: 4956.8. Samples: 739353282. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:23:41,815][25689] Avg episode reward: [(0, '-2.887')] [2022-07-10 12:23:41,919][26022] Updated weights on worker 0-0, policy_version 722028 (0.00085) [2022-07-10 12:23:43,602][26022] Updated weights on worker 0-0, policy_version 722038 (0.00088) [2022-07-10 12:23:45,444][26022] Updated weights on worker 0-0, policy_version 722048 (0.00087) [2022-07-10 12:23:46,825][25689] Fps is (10 sec: 5619.7, 60 sec: 5535.2, 300 sec: 5539.6). Total num frames: 739384320. Throughput: 0: 5800.9. Samples: 739387016. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:23:46,826][25689] Avg episode reward: [(0, '-3.600')] [2022-07-10 12:23:47,264][26022] Updated weights on worker 0-0, policy_version 722058 (0.00085) [2022-07-10 12:23:49,075][26022] Updated weights on worker 0-0, policy_version 722068 (0.00090) [2022-07-10 12:23:51,164][26022] Updated weights on worker 0-0, policy_version 722078 (0.00093) [2022-07-10 12:23:51,952][25689] Fps is (10 sec: 5656.1, 60 sec: 5514.7, 300 sec: 5540.7). Total num frames: 739412992. Throughput: 0: 5782.1. Samples: 739420002. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:23:51,952][25689] Avg episode reward: [(0, '-3.710')] [2022-07-10 12:23:52,918][26022] Updated weights on worker 0-0, policy_version 722088 (0.00092) [2022-07-10 12:23:54,601][26022] Updated weights on worker 0-0, policy_version 722098 (0.00085) [2022-07-10 12:23:56,661][26022] Updated weights on worker 0-0, policy_version 722108 (0.00094) [2022-07-10 12:23:56,954][25689] Fps is (10 sec: 5559.9, 60 sec: 5520.8, 300 sec: 5541.4). Total num frames: 739440640. Throughput: 0: 4988.0. Samples: 739436830. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:23:56,955][25689] Avg episode reward: [(0, '-2.495')] [2022-07-10 12:23:58,422][26022] Updated weights on worker 0-0, policy_version 722118 (0.00086) [2022-07-10 12:24:00,371][26022] Updated weights on worker 0-0, policy_version 722128 (0.00084) [2022-07-10 12:24:01,964][25689] Fps is (10 sec: 5318.5, 60 sec: 5493.7, 300 sec: 5541.3). Total num frames: 739466240. Throughput: 0: 5791.1. Samples: 739469974. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:24:01,964][25689] Avg episode reward: [(0, '-4.024')] [2022-07-10 12:24:02,419][26022] Updated weights on worker 0-0, policy_version 722138 (0.00089) [2022-07-10 12:24:04,319][26022] Updated weights on worker 0-0, policy_version 722148 (0.00090) [2022-07-10 12:24:06,285][26022] Updated weights on worker 0-0, policy_version 722158 (0.00094) [2022-07-10 12:24:07,005][25689] Fps is (10 sec: 5196.0, 60 sec: 5474.5, 300 sec: 5531.0). Total num frames: 739492864. Throughput: 0: 5664.5. Samples: 739501328. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:24:07,006][25689] Avg episode reward: [(0, '-4.326')] [2022-07-10 12:24:08,026][26022] Updated weights on worker 0-0, policy_version 722168 (0.00424) [2022-07-10 12:24:09,933][26022] Updated weights on worker 0-0, policy_version 722178 (0.00084) [2022-07-10 12:24:11,478][26022] Updated weights on worker 0-0, policy_version 722188 (0.00083) [2022-07-10 12:24:12,103][25689] Fps is (10 sec: 5554.4, 60 sec: 5522.1, 300 sec: 5543.5). Total num frames: 739522560. Throughput: 0: 4867.1. Samples: 739518082. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:24:12,104][25689] Avg episode reward: [(0, '-2.406')] [2022-07-10 12:24:13,368][26022] Updated weights on worker 0-0, policy_version 722198 (0.00087) [2022-07-10 12:24:15,244][26022] Updated weights on worker 0-0, policy_version 722208 (0.00088) [2022-07-10 12:24:17,015][26022] Updated weights on worker 0-0, policy_version 722218 (0.00131) [2022-07-10 12:24:17,124][25689] Fps is (10 sec: 5869.7, 60 sec: 5557.3, 300 sec: 5543.5). Total num frames: 739552256. Throughput: 0: 5713.7. Samples: 739552072. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:24:17,124][25689] Avg episode reward: [(0, '-2.534')] [2022-07-10 12:24:19,048][26022] Updated weights on worker 0-0, policy_version 722228 (0.00092) [2022-07-10 12:24:20,636][26022] Updated weights on worker 0-0, policy_version 722238 (0.00077) [2022-07-10 12:24:22,132][25689] Fps is (10 sec: 5615.7, 60 sec: 5524.8, 300 sec: 5543.5). Total num frames: 739578880. Throughput: 0: 5737.2. Samples: 739585686. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:24:22,133][25689] Avg episode reward: [(0, '-2.314')] [2022-07-10 12:24:22,704][26022] Updated weights on worker 0-0, policy_version 722248 (0.00113) [2022-07-10 12:24:24,304][26022] Updated weights on worker 0-0, policy_version 722258 (0.00081) [2022-07-10 12:24:26,337][26022] Updated weights on worker 0-0, policy_version 722268 (0.00083) [2022-07-10 12:24:27,138][25689] Fps is (10 sec: 5419.2, 60 sec: 5524.7, 300 sec: 5541.9). Total num frames: 739606528. Throughput: 0: 5017.7. Samples: 739602350. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:24:27,139][25689] Avg episode reward: [(0, '-1.545')] [2022-07-10 12:24:27,967][26022] Updated weights on worker 0-0, policy_version 722278 (0.00083) [2022-07-10 12:24:29,924][26022] Updated weights on worker 0-0, policy_version 722288 (0.00085) [2022-07-10 12:24:31,322][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:24:31,343][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000722294_739629056.pth [2022-07-10 12:24:31,343][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000720346_737634304.pth [2022-07-10 12:24:31,883][26022] Updated weights on worker 0-0, policy_version 722298 (0.00089) [2022-07-10 12:24:32,221][25689] Fps is (10 sec: 5481.0, 60 sec: 5522.1, 300 sec: 5538.2). Total num frames: 739634176. Throughput: 0: 5827.9. Samples: 739635326. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:24:32,221][25689] Avg episode reward: [(0, '-1.817')] [2022-07-10 12:24:33,876][26022] Updated weights on worker 0-0, policy_version 722308 (0.00083) [2022-07-10 12:24:35,608][26022] Updated weights on worker 0-0, policy_version 722318 (0.00094) [2022-07-10 12:24:37,223][25689] Fps is (10 sec: 5584.6, 60 sec: 5542.0, 300 sec: 5541.9). Total num frames: 739662848. Throughput: 0: 5807.6. Samples: 739668802. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:24:37,225][25689] Avg episode reward: [(0, '-0.627')] [2022-07-10 12:24:37,482][26022] Updated weights on worker 0-0, policy_version 722328 (0.00053) [2022-07-10 12:24:39,127][26022] Updated weights on worker 0-0, policy_version 722338 (0.00085) [2022-07-10 12:24:41,264][26022] Updated weights on worker 0-0, policy_version 722348 (0.00090) [2022-07-10 12:24:42,226][25689] Fps is (10 sec: 5628.9, 60 sec: 5542.6, 300 sec: 5538.5). Total num frames: 739690496. Throughput: 0: 4961.3. Samples: 739685380. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:24:42,228][25689] Avg episode reward: [(0, '-4.083')] [2022-07-10 12:24:42,836][26022] Updated weights on worker 0-0, policy_version 722358 (0.00085) [2022-07-10 12:24:44,622][26022] Updated weights on worker 0-0, policy_version 722368 (0.00085) [2022-07-10 12:24:46,627][26022] Updated weights on worker 0-0, policy_version 722378 (0.00085) [2022-07-10 12:24:47,313][25689] Fps is (10 sec: 5480.3, 60 sec: 5518.8, 300 sec: 5535.3). Total num frames: 739718144. Throughput: 0: 5785.8. Samples: 739719078. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:24:47,314][25689] Avg episode reward: [(0, '-4.655')] [2022-07-10 12:24:48,455][26022] Updated weights on worker 0-0, policy_version 722388 (0.00088) [2022-07-10 12:24:50,294][26022] Updated weights on worker 0-0, policy_version 722398 (0.00087) [2022-07-10 12:24:52,076][26022] Updated weights on worker 0-0, policy_version 722408 (0.00091) [2022-07-10 12:24:52,401][25689] Fps is (10 sec: 5434.4, 60 sec: 5505.4, 300 sec: 5534.0). Total num frames: 739745792. Throughput: 0: 5811.4. Samples: 739752604. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:24:52,403][25689] Avg episode reward: [(0, '-4.618')] [2022-07-10 12:24:53,850][26022] Updated weights on worker 0-0, policy_version 722418 (0.00098) [2022-07-10 12:24:55,902][26022] Updated weights on worker 0-0, policy_version 722428 (0.00088) [2022-07-10 12:24:57,442][25689] Fps is (10 sec: 5661.5, 60 sec: 5535.8, 300 sec: 5537.6). Total num frames: 739775488. Throughput: 0: 4972.1. Samples: 739769332. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:24:57,443][25689] Avg episode reward: [(0, '-4.779')] [2022-07-10 12:24:57,633][26022] Updated weights on worker 0-0, policy_version 722438 (0.00083) [2022-07-10 12:24:59,455][26022] Updated weights on worker 0-0, policy_version 722448 (0.00091) [2022-07-10 12:25:01,432][26022] Updated weights on worker 0-0, policy_version 722458 (0.00095) [2022-07-10 12:25:02,499][25689] Fps is (10 sec: 5476.3, 60 sec: 5531.5, 300 sec: 5537.2). Total num frames: 739801088. Throughput: 0: 5769.8. Samples: 739802348. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:25:02,499][25689] Avg episode reward: [(0, '-5.605')] [2022-07-10 12:25:03,489][26022] Updated weights on worker 0-0, policy_version 722468 (0.00089) [2022-07-10 12:25:05,448][26022] Updated weights on worker 0-0, policy_version 722478 (0.00083) [2022-07-10 12:25:07,342][26022] Updated weights on worker 0-0, policy_version 722488 (0.00089) [2022-07-10 12:25:07,506][25689] Fps is (10 sec: 5290.9, 60 sec: 5551.5, 300 sec: 5539.5). Total num frames: 739828736. Throughput: 0: 5681.7. Samples: 739833808. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:25:07,508][25689] Avg episode reward: [(0, '-5.425')] [2022-07-10 12:25:08,942][26022] Updated weights on worker 0-0, policy_version 722498 (0.00084) [2022-07-10 12:25:11,122][26022] Updated weights on worker 0-0, policy_version 722508 (0.00100) [2022-07-10 12:25:12,615][25689] Fps is (10 sec: 5567.5, 60 sec: 5533.6, 300 sec: 5537.7). Total num frames: 739857408. Throughput: 0: 4835.1. Samples: 739850336. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-10 12:25:12,615][25689] Avg episode reward: [(0, '-4.217')] [2022-07-10 12:25:12,777][26022] Updated weights on worker 0-0, policy_version 722518 (0.00089) [2022-07-10 12:25:14,702][26022] Updated weights on worker 0-0, policy_version 722528 (0.00097) [2022-07-10 12:25:16,599][26022] Updated weights on worker 0-0, policy_version 722538 (0.00084) [2022-07-10 12:25:17,617][25689] Fps is (10 sec: 5570.1, 60 sec: 5501.4, 300 sec: 5530.8). Total num frames: 739885056. Throughput: 0: 5669.5. Samples: 739883716. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:25:17,618][25689] Avg episode reward: [(0, '-2.960')] [2022-07-10 12:25:18,324][26022] Updated weights on worker 0-0, policy_version 722548 (0.00088) [2022-07-10 12:25:20,255][26022] Updated weights on worker 0-0, policy_version 722558 (0.00081) [2022-07-10 12:25:22,150][26022] Updated weights on worker 0-0, policy_version 722568 (0.00087) [2022-07-10 12:25:22,661][25689] Fps is (10 sec: 5605.8, 60 sec: 5532.0, 300 sec: 5537.6). Total num frames: 739913728. Throughput: 0: 5694.4. Samples: 739917162. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:25:22,662][25689] Avg episode reward: [(0, '-3.239')] [2022-07-10 12:25:23,576][26022] Updated weights on worker 0-0, policy_version 722578 (0.00088) [2022-07-10 12:25:25,935][26022] Updated weights on worker 0-0, policy_version 722588 (0.00087) [2022-07-10 12:25:27,270][26022] Updated weights on worker 0-0, policy_version 722598 (0.00096) [2022-07-10 12:25:27,677][25689] Fps is (10 sec: 5598.5, 60 sec: 5531.1, 300 sec: 5534.7). Total num frames: 739941376. Throughput: 0: 4960.4. Samples: 739933864. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:25:27,678][25689] Avg episode reward: [(0, '-3.945')] [2022-07-10 12:25:29,592][26022] Updated weights on worker 0-0, policy_version 722608 (0.00086) [2022-07-10 12:25:31,340][26022] Updated weights on worker 0-0, policy_version 722618 (0.00086) [2022-07-10 12:25:32,736][25689] Fps is (10 sec: 5387.0, 60 sec: 5516.4, 300 sec: 5530.9). Total num frames: 739968000. Throughput: 0: 5779.1. Samples: 739966618. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:25:32,736][25689] Avg episode reward: [(0, '-2.522')] [2022-07-10 12:25:33,190][26022] Updated weights on worker 0-0, policy_version 722628 (0.00088) [2022-07-10 12:25:34,975][26022] Updated weights on worker 0-0, policy_version 722638 (0.00083) [2022-07-10 12:25:36,938][26022] Updated weights on worker 0-0, policy_version 722648 (0.00098) [2022-07-10 12:25:37,753][25689] Fps is (10 sec: 5487.7, 60 sec: 5515.0, 300 sec: 5531.9). Total num frames: 739996672. Throughput: 0: 5782.6. Samples: 740000156. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:25:37,754][25689] Avg episode reward: [(0, '-1.282')] [2022-07-10 12:25:38,565][26022] Updated weights on worker 0-0, policy_version 722658 (0.00091) [2022-07-10 12:25:40,779][26022] Updated weights on worker 0-0, policy_version 722668 (0.00091) [2022-07-10 12:25:42,010][26022] Updated weights on worker 0-0, policy_version 722678 (0.00087) [2022-07-10 12:25:42,819][25689] Fps is (10 sec: 5585.8, 60 sec: 5509.3, 300 sec: 5527.7). Total num frames: 740024320. Throughput: 0: 5792.9. Samples: 740033932. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:25:42,819][25689] Avg episode reward: [(0, '-0.557')] [2022-07-10 12:25:44,191][26022] Updated weights on worker 0-0, policy_version 722688 (0.00087) [2022-07-10 12:25:45,973][26022] Updated weights on worker 0-0, policy_version 722698 (0.00092) [2022-07-10 12:25:47,783][26022] Updated weights on worker 0-0, policy_version 722709 (0.00085) [2022-07-10 12:25:47,880][25689] Fps is (10 sec: 5662.8, 60 sec: 5545.5, 300 sec: 5530.7). Total num frames: 740054016. Throughput: 0: 5785.5. Samples: 740050748. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:25:47,881][25689] Avg episode reward: [(0, '-1.420')] [2022-07-10 12:25:50,110][26022] Updated weights on worker 0-0, policy_version 722719 (0.00087) [2022-07-10 12:25:51,493][26022] Updated weights on worker 0-0, policy_version 722729 (0.00089) [2022-07-10 12:25:52,998][25689] Fps is (10 sec: 5432.0, 60 sec: 5508.9, 300 sec: 5525.3). Total num frames: 740079616. Throughput: 0: 5803.3. Samples: 740084206. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:25:52,999][25689] Avg episode reward: [(0, '-0.306')] [2022-07-10 12:25:53,444][26022] Updated weights on worker 0-0, policy_version 722739 (0.00089) [2022-07-10 12:25:55,191][26022] Updated weights on worker 0-0, policy_version 722749 (0.00092) [2022-07-10 12:25:57,237][26022] Updated weights on worker 0-0, policy_version 722759 (0.00075) [2022-07-10 12:25:58,003][25689] Fps is (10 sec: 5462.2, 60 sec: 5512.2, 300 sec: 5532.5). Total num frames: 740109312. Throughput: 0: 5815.0. Samples: 740117908. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:25:58,004][25689] Avg episode reward: [(0, '-1.368')] [2022-07-10 12:25:58,910][26022] Updated weights on worker 0-0, policy_version 722769 (0.00096) [2022-07-10 12:26:01,126][26022] Updated weights on worker 0-0, policy_version 722779 (0.00093) [2022-07-10 12:26:02,833][26022] Updated weights on worker 0-0, policy_version 722789 (0.00083) [2022-07-10 12:26:03,029][25689] Fps is (10 sec: 5716.9, 60 sec: 5548.8, 300 sec: 5535.9). Total num frames: 740136960. Throughput: 0: 4981.4. Samples: 740134608. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:26:03,030][25689] Avg episode reward: [(0, '-2.084')] [2022-07-10 12:26:05,105][26022] Updated weights on worker 0-0, policy_version 722799 (0.00448) [2022-07-10 12:26:06,430][26022] Updated weights on worker 0-0, policy_version 722809 (0.00086) [2022-07-10 12:26:08,044][25689] Fps is (10 sec: 5405.1, 60 sec: 5531.2, 300 sec: 5530.7). Total num frames: 740163584. Throughput: 0: 5730.1. Samples: 740166290. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:26:08,045][25689] Avg episode reward: [(0, '-2.215')] [2022-07-10 12:26:08,575][26022] Updated weights on worker 0-0, policy_version 722819 (0.00091) [2022-07-10 12:26:10,327][26022] Updated weights on worker 0-0, policy_version 722829 (0.00107) [2022-07-10 12:26:12,353][26022] Updated weights on worker 0-0, policy_version 722839 (0.00081) [2022-07-10 12:26:13,127][25689] Fps is (10 sec: 5374.7, 60 sec: 5516.6, 300 sec: 5530.1). Total num frames: 740191232. Throughput: 0: 5738.0. Samples: 740199702. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:26:13,127][25689] Avg episode reward: [(0, '-2.697')] [2022-07-10 12:26:13,959][26022] Updated weights on worker 0-0, policy_version 722849 (0.00088) [2022-07-10 12:26:15,776][26022] Updated weights on worker 0-0, policy_version 722859 (0.00085) [2022-07-10 12:26:17,586][26022] Updated weights on worker 0-0, policy_version 722869 (0.00088) [2022-07-10 12:26:18,158][25689] Fps is (10 sec: 5568.5, 60 sec: 5531.0, 300 sec: 5533.5). Total num frames: 740219904. Throughput: 0: 4899.3. Samples: 740216652. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:26:18,159][25689] Avg episode reward: [(0, '-2.628')] [2022-07-10 12:26:19,527][26022] Updated weights on worker 0-0, policy_version 722879 (0.00089) [2022-07-10 12:26:21,276][26022] Updated weights on worker 0-0, policy_version 722889 (0.00082) [2022-07-10 12:26:23,170][25689] Fps is (10 sec: 5608.0, 60 sec: 5517.0, 300 sec: 5530.1). Total num frames: 740247552. Throughput: 0: 5739.8. Samples: 740250210. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:26:23,172][25689] Avg episode reward: [(0, '-2.547')] [2022-07-10 12:26:23,314][26022] Updated weights on worker 0-0, policy_version 722899 (0.00088) [2022-07-10 12:26:25,079][26022] Updated weights on worker 0-0, policy_version 722909 (0.00092) [2022-07-10 12:26:26,837][26022] Updated weights on worker 0-0, policy_version 722919 (0.00086) [2022-07-10 12:26:28,194][25689] Fps is (10 sec: 5509.8, 60 sec: 5516.2, 300 sec: 5531.5). Total num frames: 740275200. Throughput: 0: 5820.8. Samples: 740283580. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:26:28,196][25689] Avg episode reward: [(0, '-2.436')] [2022-07-10 12:26:28,728][26022] Updated weights on worker 0-0, policy_version 722929 (0.00089) [2022-07-10 12:26:30,458][26022] Updated weights on worker 0-0, policy_version 722939 (0.00085) [2022-07-10 12:26:31,419][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:26:31,434][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000722942_740292608.pth [2022-07-10 12:26:31,435][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000720996_738299904.pth [2022-07-10 12:26:32,549][26022] Updated weights on worker 0-0, policy_version 722949 (0.00092) [2022-07-10 12:26:33,273][25689] Fps is (10 sec: 5676.0, 60 sec: 5565.2, 300 sec: 5533.8). Total num frames: 740304896. Throughput: 0: 4983.9. Samples: 740300106. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:26:33,273][25689] Avg episode reward: [(0, '-1.768')] [2022-07-10 12:26:34,507][26022] Updated weights on worker 0-0, policy_version 722959 (0.00081) [2022-07-10 12:26:36,227][26022] Updated weights on worker 0-0, policy_version 722969 (0.00091) [2022-07-10 12:26:38,198][26022] Updated weights on worker 0-0, policy_version 722979 (0.00085) [2022-07-10 12:26:38,323][25689] Fps is (10 sec: 5459.4, 60 sec: 5511.4, 300 sec: 5526.1). Total num frames: 740330496. Throughput: 0: 5777.7. Samples: 740333156. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:26:38,323][25689] Avg episode reward: [(0, '-2.581')] [2022-07-10 12:26:39,835][26022] Updated weights on worker 0-0, policy_version 722989 (0.00091) [2022-07-10 12:26:42,020][26022] Updated weights on worker 0-0, policy_version 722999 (0.00085) [2022-07-10 12:26:43,355][25689] Fps is (10 sec: 5484.6, 60 sec: 5548.3, 300 sec: 5529.6). Total num frames: 740360192. Throughput: 0: 5752.9. Samples: 740366332. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:26:43,355][25689] Avg episode reward: [(0, '-1.773')] [2022-07-10 12:26:43,475][26022] Updated weights on worker 0-0, policy_version 723009 (0.00088) [2022-07-10 12:26:45,480][26022] Updated weights on worker 0-0, policy_version 723019 (0.00096) [2022-07-10 12:26:47,135][26022] Updated weights on worker 0-0, policy_version 723029 (0.00089) [2022-07-10 12:26:48,367][25689] Fps is (10 sec: 5709.3, 60 sec: 5519.0, 300 sec: 5530.5). Total num frames: 740387840. Throughput: 0: 4935.2. Samples: 740383136. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:26:48,368][25689] Avg episode reward: [(0, '-2.119')] [2022-07-10 12:26:49,155][26022] Updated weights on worker 0-0, policy_version 723039 (0.00080) [2022-07-10 12:26:50,971][26022] Updated weights on worker 0-0, policy_version 723049 (0.00091) [2022-07-10 12:26:52,715][26022] Updated weights on worker 0-0, policy_version 723059 (0.00088) [2022-07-10 12:26:53,407][25689] Fps is (10 sec: 5501.0, 60 sec: 5560.0, 300 sec: 5528.2). Total num frames: 740415488. Throughput: 0: 5791.5. Samples: 740416712. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:26:53,407][25689] Avg episode reward: [(0, '-1.834')] [2022-07-10 12:26:54,445][26022] Updated weights on worker 0-0, policy_version 723069 (0.00093) [2022-07-10 12:26:56,471][26022] Updated weights on worker 0-0, policy_version 723079 (0.00087) [2022-07-10 12:26:58,234][26022] Updated weights on worker 0-0, policy_version 723089 (0.00088) [2022-07-10 12:26:58,423][25689] Fps is (10 sec: 5498.8, 60 sec: 5525.1, 300 sec: 5522.8). Total num frames: 740443136. Throughput: 0: 5820.0. Samples: 740450138. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:26:58,423][25689] Avg episode reward: [(0, '-1.533')] [2022-07-10 12:27:00,137][26022] Updated weights on worker 0-0, policy_version 723099 (0.00087) [2022-07-10 12:27:02,618][26022] Updated weights on worker 0-0, policy_version 723109 (0.00097) [2022-07-10 12:27:03,461][25689] Fps is (10 sec: 5295.8, 60 sec: 5490.0, 300 sec: 5522.3). Total num frames: 740468736. Throughput: 0: 4996.6. Samples: 740466798. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:27:03,464][25689] Avg episode reward: [(0, '-1.499')] [2022-07-10 12:27:04,058][26022] Updated weights on worker 0-0, policy_version 723119 (0.00088) [2022-07-10 12:27:06,182][26022] Updated weights on worker 0-0, policy_version 723129 (0.00094) [2022-07-10 12:27:07,777][26022] Updated weights on worker 0-0, policy_version 723139 (0.00092) [2022-07-10 12:27:08,478][25689] Fps is (10 sec: 5295.4, 60 sec: 5506.8, 300 sec: 5524.0). Total num frames: 740496384. Throughput: 0: 5704.1. Samples: 740497854. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:27:08,479][25689] Avg episode reward: [(0, '-1.564')] [2022-07-10 12:27:09,819][26022] Updated weights on worker 0-0, policy_version 723149 (0.00094) [2022-07-10 12:27:11,734][26022] Updated weights on worker 0-0, policy_version 723159 (0.00087) [2022-07-10 12:27:13,417][26022] Updated weights on worker 0-0, policy_version 723169 (0.00090) [2022-07-10 12:27:13,536][25689] Fps is (10 sec: 5590.2, 60 sec: 5526.0, 300 sec: 5523.4). Total num frames: 740525056. Throughput: 0: 5684.2. Samples: 740531132. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:27:13,537][25689] Avg episode reward: [(0, '-2.228')] [2022-07-10 12:27:15,327][26022] Updated weights on worker 0-0, policy_version 723179 (0.00095) [2022-07-10 12:27:17,176][26022] Updated weights on worker 0-0, policy_version 723189 (0.00086) [2022-07-10 12:27:18,569][25689] Fps is (10 sec: 5682.6, 60 sec: 5525.9, 300 sec: 5528.0). Total num frames: 740553728. Throughput: 0: 4861.4. Samples: 740548080. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:27:18,570][25689] Avg episode reward: [(0, '-2.783')] [2022-07-10 12:27:18,681][26022] Updated weights on worker 0-0, policy_version 723199 (0.00088) [2022-07-10 12:27:20,875][26022] Updated weights on worker 0-0, policy_version 723209 (0.00086) [2022-07-10 12:27:22,618][26022] Updated weights on worker 0-0, policy_version 723219 (0.00085) [2022-07-10 12:27:23,589][25689] Fps is (10 sec: 5500.5, 60 sec: 5508.2, 300 sec: 5521.7). Total num frames: 740580352. Throughput: 0: 5716.0. Samples: 740581848. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:27:23,590][25689] Avg episode reward: [(0, '-4.361')] [2022-07-10 12:27:24,319][26022] Updated weights on worker 0-0, policy_version 723229 (0.00084) [2022-07-10 12:27:26,403][26022] Updated weights on worker 0-0, policy_version 723239 (0.00088) [2022-07-10 12:27:28,000][26022] Updated weights on worker 0-0, policy_version 723249 (0.00088) [2022-07-10 12:27:28,617][25689] Fps is (10 sec: 5605.3, 60 sec: 5541.7, 300 sec: 5526.1). Total num frames: 740610048. Throughput: 0: 5845.2. Samples: 740615568. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:27:28,618][25689] Avg episode reward: [(0, '-3.244')] [2022-07-10 12:27:29,996][26022] Updated weights on worker 0-0, policy_version 723259 (0.00084) [2022-07-10 12:27:31,603][26022] Updated weights on worker 0-0, policy_version 723269 (0.00088) [2022-07-10 12:27:33,647][26022] Updated weights on worker 0-0, policy_version 723279 (0.00091) [2022-07-10 12:27:33,655][25689] Fps is (10 sec: 5696.9, 60 sec: 5511.5, 300 sec: 5525.9). Total num frames: 740637696. Throughput: 0: 5038.6. Samples: 740632502. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:27:33,655][25689] Avg episode reward: [(0, '-2.927')] [2022-07-10 12:27:35,243][26022] Updated weights on worker 0-0, policy_version 723289 (0.00087) [2022-07-10 12:27:37,347][26022] Updated weights on worker 0-0, policy_version 723299 (0.00090) [2022-07-10 12:27:38,663][25689] Fps is (10 sec: 5606.1, 60 sec: 5566.3, 300 sec: 5527.1). Total num frames: 740666368. Throughput: 0: 5867.7. Samples: 740665984. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:27:38,665][25689] Avg episode reward: [(0, '-3.896')] [2022-07-10 12:27:38,927][26022] Updated weights on worker 0-0, policy_version 723309 (0.00086) [2022-07-10 12:27:41,100][26022] Updated weights on worker 0-0, policy_version 723319 (0.00087) [2022-07-10 12:27:42,519][26022] Updated weights on worker 0-0, policy_version 723329 (0.00088) [2022-07-10 12:27:43,679][25689] Fps is (10 sec: 5516.1, 60 sec: 5516.8, 300 sec: 5527.2). Total num frames: 740692992. Throughput: 0: 5859.4. Samples: 740699566. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:27:43,680][25689] Avg episode reward: [(0, '-3.650')] [2022-07-10 12:27:44,554][26022] Updated weights on worker 0-0, policy_version 723339 (0.00085) [2022-07-10 12:27:46,450][26022] Updated weights on worker 0-0, policy_version 723349 (0.00093) [2022-07-10 12:27:48,070][26022] Updated weights on worker 0-0, policy_version 723359 (0.00092) [2022-07-10 12:27:48,684][25689] Fps is (10 sec: 5620.3, 60 sec: 5551.4, 300 sec: 5528.8). Total num frames: 740722688. Throughput: 0: 5024.3. Samples: 740716390. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:27:48,684][25689] Avg episode reward: [(0, '-2.028')] [2022-07-10 12:27:50,097][26022] Updated weights on worker 0-0, policy_version 723369 (0.00087) [2022-07-10 12:27:51,794][26022] Updated weights on worker 0-0, policy_version 723379 (0.00086) [2022-07-10 12:27:53,757][25689] Fps is (10 sec: 5690.2, 60 sec: 5548.4, 300 sec: 5528.7). Total num frames: 740750336. Throughput: 0: 5846.9. Samples: 740750038. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:27:53,758][25689] Avg episode reward: [(0, '-3.496')] [2022-07-10 12:27:53,763][26022] Updated weights on worker 0-0, policy_version 723389 (0.00082) [2022-07-10 12:27:55,392][26022] Updated weights on worker 0-0, policy_version 723399 (0.00090) [2022-07-10 12:27:57,455][26022] Updated weights on worker 0-0, policy_version 723409 (0.00086) [2022-07-10 12:27:58,789][25689] Fps is (10 sec: 5573.5, 60 sec: 5563.9, 300 sec: 5533.1). Total num frames: 740779008. Throughput: 0: 5851.8. Samples: 740783756. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:27:58,790][25689] Avg episode reward: [(0, '-4.067')] [2022-07-10 12:27:59,140][26022] Updated weights on worker 0-0, policy_version 723419 (0.00083) [2022-07-10 12:28:01,094][26022] Updated weights on worker 0-0, policy_version 723429 (0.00082) [2022-07-10 12:28:03,275][26022] Updated weights on worker 0-0, policy_version 723439 (0.00093) [2022-07-10 12:28:03,815][25689] Fps is (10 sec: 5396.1, 60 sec: 5565.1, 300 sec: 5526.1). Total num frames: 740804608. Throughput: 0: 5014.6. Samples: 740800536. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:28:03,815][25689] Avg episode reward: [(0, '-4.589')] [2022-07-10 12:28:05,148][26022] Updated weights on worker 0-0, policy_version 723449 (0.00086) [2022-07-10 12:28:07,007][26022] Updated weights on worker 0-0, policy_version 723459 (0.00090) [2022-07-10 12:28:08,821][25689] Fps is (10 sec: 5205.8, 60 sec: 5549.1, 300 sec: 5527.1). Total num frames: 740831232. Throughput: 0: 5730.1. Samples: 740831776. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:28:08,822][25689] Avg episode reward: [(0, '-3.211')] [2022-07-10 12:28:09,029][26022] Updated weights on worker 0-0, policy_version 723469 (0.00097) [2022-07-10 12:28:10,829][26022] Updated weights on worker 0-0, policy_version 723479 (0.00095) [2022-07-10 12:28:12,574][26022] Updated weights on worker 0-0, policy_version 723489 (0.00084) [2022-07-10 12:28:13,956][25689] Fps is (10 sec: 5452.5, 60 sec: 5542.0, 300 sec: 5528.7). Total num frames: 740859904. Throughput: 0: 5680.9. Samples: 740864788. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:28:13,957][25689] Avg episode reward: [(0, '-3.737')] [2022-07-10 12:28:14,495][26022] Updated weights on worker 0-0, policy_version 723499 (0.00051) [2022-07-10 12:28:16,154][26022] Updated weights on worker 0-0, policy_version 723509 (0.00084) [2022-07-10 12:28:17,983][26022] Updated weights on worker 0-0, policy_version 723519 (0.00094) [2022-07-10 12:28:19,011][25689] Fps is (10 sec: 5727.6, 60 sec: 5556.9, 300 sec: 5531.6). Total num frames: 740889600. Throughput: 0: 4840.1. Samples: 740881634. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:28:19,012][25689] Avg episode reward: [(0, '-4.032')] [2022-07-10 12:28:20,085][26022] Updated weights on worker 0-0, policy_version 723529 (0.00093) [2022-07-10 12:28:21,728][26022] Updated weights on worker 0-0, policy_version 723539 (0.00090) [2022-07-10 12:28:23,643][26022] Updated weights on worker 0-0, policy_version 723549 (0.00085) [2022-07-10 12:28:24,078][25689] Fps is (10 sec: 5462.8, 60 sec: 5535.6, 300 sec: 5523.5). Total num frames: 740915200. Throughput: 0: 5648.8. Samples: 740915002. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:28:24,079][25689] Avg episode reward: [(0, '-1.759')] [2022-07-10 12:28:25,352][26022] Updated weights on worker 0-0, policy_version 723559 (0.00049) [2022-07-10 12:28:27,338][26022] Updated weights on worker 0-0, policy_version 723569 (0.00092) [2022-07-10 12:28:29,094][25689] Fps is (10 sec: 5382.8, 60 sec: 5519.9, 300 sec: 5527.7). Total num frames: 740943872. Throughput: 0: 5741.5. Samples: 740948174. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:28:29,094][25689] Avg episode reward: [(0, '-1.331')] [2022-07-10 12:28:29,155][26022] Updated weights on worker 0-0, policy_version 723579 (0.00086) [2022-07-10 12:28:30,889][26022] Updated weights on worker 0-0, policy_version 723589 (0.00090) [2022-07-10 12:28:31,552][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:28:31,561][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000723591_740957184.pth [2022-07-10 12:28:31,562][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000721647_738966528.pth [2022-07-10 12:28:31,562][25974] Saving a milestone ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/milestones/checkpoint_000723591_740957184.pth.milestone [2022-07-10 12:28:33,010][26022] Updated weights on worker 0-0, policy_version 723599 (0.00089) [2022-07-10 12:28:34,211][25689] Fps is (10 sec: 5659.5, 60 sec: 5529.6, 300 sec: 5529.6). Total num frames: 740972544. Throughput: 0: 5747.2. Samples: 740981196. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 12:28:34,211][25689] Avg episode reward: [(0, '-0.578')] [2022-07-10 12:28:34,843][26022] Updated weights on worker 0-0, policy_version 723609 (0.00092) [2022-07-10 12:28:36,528][26022] Updated weights on worker 0-0, policy_version 723619 (0.00092) [2022-07-10 12:28:38,448][26022] Updated weights on worker 0-0, policy_version 723629 (0.00094) [2022-07-10 12:28:39,294][25689] Fps is (10 sec: 5421.3, 60 sec: 5489.0, 300 sec: 5524.8). Total num frames: 740999168. Throughput: 0: 5736.6. Samples: 740997986. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:28:39,297][25689] Avg episode reward: [(0, '-0.376')] [2022-07-10 12:28:40,161][26022] Updated weights on worker 0-0, policy_version 723639 (0.00096) [2022-07-10 12:28:42,384][26022] Updated weights on worker 0-0, policy_version 723649 (0.00080) [2022-07-10 12:28:43,783][26022] Updated weights on worker 0-0, policy_version 723659 (0.00092) [2022-07-10 12:28:44,346][25689] Fps is (10 sec: 5657.9, 60 sec: 5553.2, 300 sec: 5530.9). Total num frames: 741029888. Throughput: 0: 5747.8. Samples: 741031496. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:28:44,347][25689] Avg episode reward: [(0, '0.044')] [2022-07-10 12:28:45,883][26022] Updated weights on worker 0-0, policy_version 723669 (0.00096) [2022-07-10 12:28:47,389][26022] Updated weights on worker 0-0, policy_version 723679 (0.00633) [2022-07-10 12:28:49,354][25689] Fps is (10 sec: 5598.1, 60 sec: 5485.4, 300 sec: 5522.8). Total num frames: 741055488. Throughput: 0: 5772.3. Samples: 741065122. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:28:49,356][25689] Avg episode reward: [(0, '-0.296')] [2022-07-10 12:28:49,534][26022] Updated weights on worker 0-0, policy_version 723689 (0.00089) [2022-07-10 12:28:51,136][26022] Updated weights on worker 0-0, policy_version 723699 (0.00087) [2022-07-10 12:28:53,193][26022] Updated weights on worker 0-0, policy_version 723709 (0.00084) [2022-07-10 12:28:54,436][25689] Fps is (10 sec: 5480.0, 60 sec: 5518.3, 300 sec: 5528.2). Total num frames: 741085184. Throughput: 0: 4960.8. Samples: 741081532. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:28:54,437][25689] Avg episode reward: [(0, '-1.563')] [2022-07-10 12:28:54,859][26022] Updated weights on worker 0-0, policy_version 723719 (0.00089) [2022-07-10 12:28:56,892][26022] Updated weights on worker 0-0, policy_version 723729 (0.00096) [2022-07-10 12:28:58,731][26022] Updated weights on worker 0-0, policy_version 723739 (0.00092) [2022-07-10 12:28:59,499][25689] Fps is (10 sec: 5753.7, 60 sec: 5515.6, 300 sec: 5537.6). Total num frames: 741113856. Throughput: 0: 5807.9. Samples: 741115334. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:28:59,499][25689] Avg episode reward: [(0, '-2.000')] [2022-07-10 12:29:00,477][26022] Updated weights on worker 0-0, policy_version 723749 (0.00093) [2022-07-10 12:29:02,692][26022] Updated weights on worker 0-0, policy_version 723759 (0.00093) [2022-07-10 12:29:04,511][25689] Fps is (10 sec: 5285.6, 60 sec: 5499.9, 300 sec: 5531.2). Total num frames: 741138432. Throughput: 0: 5703.2. Samples: 741146500. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:29:04,511][25689] Avg episode reward: [(0, '-3.512')] [2022-07-10 12:29:04,532][26022] Updated weights on worker 0-0, policy_version 723769 (0.00094) [2022-07-10 12:29:06,342][26022] Updated weights on worker 0-0, policy_version 723779 (0.00090) [2022-07-10 12:29:08,284][26022] Updated weights on worker 0-0, policy_version 723789 (0.00093) [2022-07-10 12:29:09,514][25689] Fps is (10 sec: 5214.2, 60 sec: 5517.0, 300 sec: 5526.1). Total num frames: 741166080. Throughput: 0: 4856.4. Samples: 741163028. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:29:09,515][25689] Avg episode reward: [(0, '-3.506')] [2022-07-10 12:29:10,087][26022] Updated weights on worker 0-0, policy_version 723799 (0.00093) [2022-07-10 12:29:11,851][26022] Updated weights on worker 0-0, policy_version 723809 (0.00088) [2022-07-10 12:29:13,946][26022] Updated weights on worker 0-0, policy_version 723819 (0.00088) [2022-07-10 12:29:14,651][25689] Fps is (10 sec: 5554.2, 60 sec: 5516.9, 300 sec: 5520.5). Total num frames: 741194752. Throughput: 0: 5677.2. Samples: 741196292. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:29:14,651][25689] Avg episode reward: [(0, '-3.811')] [2022-07-10 12:29:15,573][26022] Updated weights on worker 0-0, policy_version 723829 (0.00089) [2022-07-10 12:29:17,417][26022] Updated weights on worker 0-0, policy_version 723839 (0.00087) [2022-07-10 12:29:19,346][26022] Updated weights on worker 0-0, policy_version 723849 (0.00081) [2022-07-10 12:29:19,670][25689] Fps is (10 sec: 5444.7, 60 sec: 5469.5, 300 sec: 5520.3). Total num frames: 741221376. Throughput: 0: 5675.2. Samples: 741229812. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:29:19,671][25689] Avg episode reward: [(0, '-2.771')] [2022-07-10 12:29:21,147][26022] Updated weights on worker 0-0, policy_version 723859 (0.00090) [2022-07-10 12:29:23,102][26022] Updated weights on worker 0-0, policy_version 723869 (0.00086) [2022-07-10 12:29:24,694][25689] Fps is (10 sec: 5607.8, 60 sec: 5541.0, 300 sec: 5526.9). Total num frames: 741251072. Throughput: 0: 4961.3. Samples: 741246634. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:29:24,695][25689] Avg episode reward: [(0, '-2.369')] [2022-07-10 12:29:24,840][26022] Updated weights on worker 0-0, policy_version 723879 (0.00089) [2022-07-10 12:29:26,688][26022] Updated weights on worker 0-0, policy_version 723889 (0.00086) [2022-07-10 12:29:28,549][26022] Updated weights on worker 0-0, policy_version 723899 (0.00516) [2022-07-10 12:29:29,697][25689] Fps is (10 sec: 5821.2, 60 sec: 5542.1, 300 sec: 5531.8). Total num frames: 741279744. Throughput: 0: 5809.1. Samples: 741280272. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:29:29,699][25689] Avg episode reward: [(0, '-2.261')] [2022-07-10 12:29:30,163][26022] Updated weights on worker 0-0, policy_version 723909 (0.00096) [2022-07-10 12:29:32,238][26022] Updated weights on worker 0-0, policy_version 723919 (0.00083) [2022-07-10 12:29:34,104][26022] Updated weights on worker 0-0, policy_version 723929 (0.00083) [2022-07-10 12:29:34,787][25689] Fps is (10 sec: 5478.7, 60 sec: 5510.8, 300 sec: 5523.3). Total num frames: 741306368. Throughput: 0: 5825.9. Samples: 741313602. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:29:34,788][25689] Avg episode reward: [(0, '-0.311')] [2022-07-10 12:29:35,908][26022] Updated weights on worker 0-0, policy_version 723939 (0.00090) [2022-07-10 12:29:37,991][26022] Updated weights on worker 0-0, policy_version 723949 (0.00088) [2022-07-10 12:29:39,534][26022] Updated weights on worker 0-0, policy_version 723959 (0.00087) [2022-07-10 12:29:39,826][25689] Fps is (10 sec: 5459.3, 60 sec: 5548.6, 300 sec: 5526.0). Total num frames: 741335040. Throughput: 0: 4972.0. Samples: 741330026. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:29:39,828][25689] Avg episode reward: [(0, '-0.467')] [2022-07-10 12:29:41,562][26022] Updated weights on worker 0-0, policy_version 723969 (0.00096) [2022-07-10 12:29:43,317][26022] Updated weights on worker 0-0, policy_version 723979 (0.00085) [2022-07-10 12:29:44,831][25689] Fps is (10 sec: 5607.3, 60 sec: 5502.2, 300 sec: 5527.6). Total num frames: 741362688. Throughput: 0: 5799.8. Samples: 741363424. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:29:44,832][25689] Avg episode reward: [(0, '0.609')] [2022-07-10 12:29:45,099][26022] Updated weights on worker 0-0, policy_version 723989 (0.00085) [2022-07-10 12:29:47,112][26022] Updated weights on worker 0-0, policy_version 723999 (0.00085) [2022-07-10 12:29:48,700][26022] Updated weights on worker 0-0, policy_version 724009 (0.00081) [2022-07-10 12:29:49,858][25689] Fps is (10 sec: 5409.9, 60 sec: 5517.4, 300 sec: 5525.2). Total num frames: 741389312. Throughput: 0: 5791.9. Samples: 741397042. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:29:49,860][25689] Avg episode reward: [(0, '-0.208')] [2022-07-10 12:29:50,682][26022] Updated weights on worker 0-0, policy_version 724019 (0.00114) [2022-07-10 12:29:52,408][26022] Updated weights on worker 0-0, policy_version 724029 (0.00094) [2022-07-10 12:29:54,147][26022] Updated weights on worker 0-0, policy_version 724039 (0.00086) [2022-07-10 12:29:54,935][25689] Fps is (10 sec: 5573.9, 60 sec: 5517.9, 300 sec: 5524.6). Total num frames: 741419008. Throughput: 0: 4978.6. Samples: 741413912. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:29:54,935][25689] Avg episode reward: [(0, '-0.966')] [2022-07-10 12:29:56,301][26022] Updated weights on worker 0-0, policy_version 724049 (0.00083) [2022-07-10 12:29:57,831][26022] Updated weights on worker 0-0, policy_version 724059 (0.00090) [2022-07-10 12:29:59,808][26022] Updated weights on worker 0-0, policy_version 724069 (0.00090) [2022-07-10 12:29:59,939][25689] Fps is (10 sec: 5790.1, 60 sec: 5523.2, 300 sec: 5535.9). Total num frames: 741447680. Throughput: 0: 5847.8. Samples: 741447640. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:29:59,939][25689] Avg episode reward: [(0, '-1.396')] [2022-07-10 12:30:01,521][26022] Updated weights on worker 0-0, policy_version 724079 (0.00083) [2022-07-10 12:30:03,796][26022] Updated weights on worker 0-0, policy_version 724089 (0.00088) [2022-07-10 12:30:04,969][25689] Fps is (10 sec: 5408.9, 60 sec: 5538.5, 300 sec: 5528.6). Total num frames: 741473280. Throughput: 0: 5753.3. Samples: 741479284. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:30:04,971][25689] Avg episode reward: [(0, '-1.707')] [2022-07-10 12:30:05,709][26022] Updated weights on worker 0-0, policy_version 724099 (0.00091) [2022-07-10 12:30:07,632][26022] Updated weights on worker 0-0, policy_version 724109 (0.00097) [2022-07-10 12:30:09,367][26022] Updated weights on worker 0-0, policy_version 724120 (0.00087) [2022-07-10 12:30:10,070][25689] Fps is (10 sec: 5357.0, 60 sec: 5546.5, 300 sec: 5528.7). Total num frames: 741501952. Throughput: 0: 4888.0. Samples: 741495838. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:30:10,071][25689] Avg episode reward: [(0, '-1.124')] [2022-07-10 12:30:11,476][26022] Updated weights on worker 0-0, policy_version 724130 (0.00092) [2022-07-10 12:30:13,005][26022] Updated weights on worker 0-0, policy_version 724140 (0.00085) [2022-07-10 12:30:15,127][26022] Updated weights on worker 0-0, policy_version 724150 (0.00091) [2022-07-10 12:30:15,203][25689] Fps is (10 sec: 5603.2, 60 sec: 5546.8, 300 sec: 5529.7). Total num frames: 741530624. Throughput: 0: 5680.0. Samples: 741529034. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:30:15,203][25689] Avg episode reward: [(0, '-2.059')] [2022-07-10 12:30:16,802][26022] Updated weights on worker 0-0, policy_version 724160 (0.00081) [2022-07-10 12:30:18,751][26022] Updated weights on worker 0-0, policy_version 724170 (0.00090) [2022-07-10 12:30:20,289][25689] Fps is (10 sec: 5511.3, 60 sec: 5557.7, 300 sec: 5525.5). Total num frames: 741558272. Throughput: 0: 5656.5. Samples: 741562750. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:30:20,289][25689] Avg episode reward: [(0, '-2.807')] [2022-07-10 12:30:20,528][26022] Updated weights on worker 0-0, policy_version 724180 (0.00619) [2022-07-10 12:30:22,512][26022] Updated weights on worker 0-0, policy_version 724190 (0.00095) [2022-07-10 12:30:24,072][26022] Updated weights on worker 0-0, policy_version 724200 (0.00084) [2022-07-10 12:30:25,291][25689] Fps is (10 sec: 5582.5, 60 sec: 5542.7, 300 sec: 5529.2). Total num frames: 741586944. Throughput: 0: 5749.0. Samples: 741596120. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:30:25,292][25689] Avg episode reward: [(0, '-3.345')] [2022-07-10 12:30:26,354][26022] Updated weights on worker 0-0, policy_version 724210 (0.00088) [2022-07-10 12:30:27,505][26022] Updated weights on worker 0-0, policy_version 724220 (0.00085) [2022-07-10 12:30:29,885][26022] Updated weights on worker 0-0, policy_version 724230 (0.00084) [2022-07-10 12:30:30,319][25689] Fps is (10 sec: 5717.3, 60 sec: 5540.5, 300 sec: 5536.7). Total num frames: 741615616. Throughput: 0: 5779.5. Samples: 741612866. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:30:30,319][25689] Avg episode reward: [(0, '-3.737')] [2022-07-10 12:30:31,566][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:30:31,583][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000724240_741621760.pth [2022-07-10 12:30:31,584][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000722294_739629056.pth [2022-07-10 12:30:31,587][26022] Updated weights on worker 0-0, policy_version 724240 (0.00085) [2022-07-10 12:30:33,331][26022] Updated weights on worker 0-0, policy_version 724250 (0.00097) [2022-07-10 12:30:35,388][25689] Fps is (10 sec: 5375.2, 60 sec: 5525.4, 300 sec: 5525.4). Total num frames: 741641216. Throughput: 0: 5815.8. Samples: 741646428. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:30:35,389][25689] Avg episode reward: [(0, '-3.831')] [2022-07-10 12:30:35,432][26022] Updated weights on worker 0-0, policy_version 724260 (0.00086) [2022-07-10 12:30:37,022][26022] Updated weights on worker 0-0, policy_version 724270 (0.00087) [2022-07-10 12:30:38,932][26022] Updated weights on worker 0-0, policy_version 724280 (0.00102) [2022-07-10 12:30:40,419][25689] Fps is (10 sec: 5474.8, 60 sec: 5543.1, 300 sec: 5532.9). Total num frames: 741670912. Throughput: 0: 5804.5. Samples: 741679594. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:30:40,419][25689] Avg episode reward: [(0, '-5.152')] [2022-07-10 12:30:40,917][26022] Updated weights on worker 0-0, policy_version 724290 (0.00091) [2022-07-10 12:30:42,571][26022] Updated weights on worker 0-0, policy_version 724300 (0.00095) [2022-07-10 12:30:44,609][26022] Updated weights on worker 0-0, policy_version 724310 (0.00084) [2022-07-10 12:30:45,421][25689] Fps is (10 sec: 5817.7, 60 sec: 5560.2, 300 sec: 5530.6). Total num frames: 741699584. Throughput: 0: 4983.3. Samples: 741696432. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:30:45,422][25689] Avg episode reward: [(0, '-5.424')] [2022-07-10 12:30:46,251][26022] Updated weights on worker 0-0, policy_version 724320 (0.00089) [2022-07-10 12:30:48,039][26022] Updated weights on worker 0-0, policy_version 724330 (0.00088) [2022-07-10 12:30:50,076][26022] Updated weights on worker 0-0, policy_version 724340 (0.00085) [2022-07-10 12:30:50,508][25689] Fps is (10 sec: 5480.7, 60 sec: 5554.8, 300 sec: 5534.6). Total num frames: 741726208. Throughput: 0: 5794.4. Samples: 741729852. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:30:50,508][25689] Avg episode reward: [(0, '-5.591')] [2022-07-10 12:30:51,930][26022] Updated weights on worker 0-0, policy_version 724350 (0.00091) [2022-07-10 12:30:53,666][26022] Updated weights on worker 0-0, policy_version 724360 (0.00085) [2022-07-10 12:30:55,567][25689] Fps is (10 sec: 5449.8, 60 sec: 5539.5, 300 sec: 5530.1). Total num frames: 741754880. Throughput: 0: 5800.2. Samples: 741763472. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:30:55,568][25689] Avg episode reward: [(0, '-4.611')] [2022-07-10 12:30:55,573][26022] Updated weights on worker 0-0, policy_version 724370 (0.00083) [2022-07-10 12:30:57,247][26022] Updated weights on worker 0-0, policy_version 724380 (0.00094) [2022-07-10 12:30:59,241][26022] Updated weights on worker 0-0, policy_version 724390 (0.00110) [2022-07-10 12:31:00,623][25689] Fps is (10 sec: 5669.0, 60 sec: 5534.7, 300 sec: 5533.0). Total num frames: 741783552. Throughput: 0: 4978.6. Samples: 741780186. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:31:00,625][25689] Avg episode reward: [(0, '-3.872')] [2022-07-10 12:31:00,970][26022] Updated weights on worker 0-0, policy_version 724400 (0.00094) [2022-07-10 12:31:03,215][26022] Updated weights on worker 0-0, policy_version 724410 (0.00086) [2022-07-10 12:31:04,987][26022] Updated weights on worker 0-0, policy_version 724420 (0.00083) [2022-07-10 12:31:05,704][25689] Fps is (10 sec: 5354.2, 60 sec: 5530.1, 300 sec: 5528.4). Total num frames: 741809152. Throughput: 0: 5679.2. Samples: 741811622. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:31:05,706][25689] Avg episode reward: [(0, '-4.812')] [2022-07-10 12:31:06,769][26022] Updated weights on worker 0-0, policy_version 724430 (0.00086) [2022-07-10 12:31:08,678][26022] Updated weights on worker 0-0, policy_version 724440 (0.00094) [2022-07-10 12:31:10,639][26022] Updated weights on worker 0-0, policy_version 724450 (0.00092) [2022-07-10 12:31:10,738][25689] Fps is (10 sec: 5264.3, 60 sec: 5519.3, 300 sec: 5529.2). Total num frames: 741836800. Throughput: 0: 5690.7. Samples: 741844978. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:31:10,739][25689] Avg episode reward: [(0, '-4.326')] [2022-07-10 12:31:12,541][26022] Updated weights on worker 0-0, policy_version 724460 (0.00086) [2022-07-10 12:31:14,204][26022] Updated weights on worker 0-0, policy_version 724470 (0.00093) [2022-07-10 12:31:15,835][25689] Fps is (10 sec: 5559.2, 60 sec: 5522.6, 300 sec: 5528.0). Total num frames: 741865472. Throughput: 0: 4850.4. Samples: 741861776. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:31:15,839][25689] Avg episode reward: [(0, '-3.278')] [2022-07-10 12:31:16,031][26022] Updated weights on worker 0-0, policy_version 724480 (0.00087) [2022-07-10 12:31:17,795][26022] Updated weights on worker 0-0, policy_version 724490 (0.00090) [2022-07-10 12:31:19,786][26022] Updated weights on worker 0-0, policy_version 724500 (0.00094) [2022-07-10 12:31:20,846][25689] Fps is (10 sec: 5673.5, 60 sec: 5546.4, 300 sec: 5531.5). Total num frames: 741894144. Throughput: 0: 5695.0. Samples: 741895352. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:31:20,847][25689] Avg episode reward: [(0, '-2.556')] [2022-07-10 12:31:21,611][26022] Updated weights on worker 0-0, policy_version 724510 (0.00089) [2022-07-10 12:31:23,472][26022] Updated weights on worker 0-0, policy_version 724520 (0.00092) [2022-07-10 12:31:25,231][26022] Updated weights on worker 0-0, policy_version 724530 (0.00084) [2022-07-10 12:31:25,858][25689] Fps is (10 sec: 5618.7, 60 sec: 5528.6, 300 sec: 5531.7). Total num frames: 741921792. Throughput: 0: 5813.5. Samples: 741928790. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:31:25,860][25689] Avg episode reward: [(0, '-2.086')] [2022-07-10 12:31:27,282][26022] Updated weights on worker 0-0, policy_version 724540 (0.00094) [2022-07-10 12:31:28,961][26022] Updated weights on worker 0-0, policy_version 724550 (0.00083) [2022-07-10 12:31:30,865][25689] Fps is (10 sec: 5518.9, 60 sec: 5513.5, 300 sec: 5526.2). Total num frames: 741949440. Throughput: 0: 4983.2. Samples: 741945274. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:31:30,867][25689] Avg episode reward: [(0, '-2.000')] [2022-07-10 12:31:30,868][26022] Updated weights on worker 0-0, policy_version 724560 (0.00092) [2022-07-10 12:31:32,700][26022] Updated weights on worker 0-0, policy_version 724570 (0.00084) [2022-07-10 12:31:34,349][26022] Updated weights on worker 0-0, policy_version 724580 (0.00084) [2022-07-10 12:31:35,925][25689] Fps is (10 sec: 5391.2, 60 sec: 5531.3, 300 sec: 5529.4). Total num frames: 741976064. Throughput: 0: 5833.2. Samples: 741978968. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:31:35,926][25689] Avg episode reward: [(0, '-0.882')] [2022-07-10 12:31:36,423][26022] Updated weights on worker 0-0, policy_version 724590 (0.00087) [2022-07-10 12:31:38,059][26022] Updated weights on worker 0-0, policy_version 724600 (0.00089) [2022-07-10 12:31:40,052][26022] Updated weights on worker 0-0, policy_version 724610 (0.00086) [2022-07-10 12:31:40,929][25689] Fps is (10 sec: 5799.7, 60 sec: 5567.6, 300 sec: 5536.8). Total num frames: 742007808. Throughput: 0: 5822.4. Samples: 742012286. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:31:40,930][25689] Avg episode reward: [(0, '-0.813')] [2022-07-10 12:31:41,981][26022] Updated weights on worker 0-0, policy_version 724620 (0.00090) [2022-07-10 12:31:43,477][26022] Updated weights on worker 0-0, policy_version 724630 (0.00053) [2022-07-10 12:31:45,661][26022] Updated weights on worker 0-0, policy_version 724640 (0.00092) [2022-07-10 12:31:45,971][25689] Fps is (10 sec: 5708.4, 60 sec: 5513.2, 300 sec: 5529.4). Total num frames: 742033408. Throughput: 0: 4994.2. Samples: 742029234. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:31:45,971][25689] Avg episode reward: [(0, '-0.959')] [2022-07-10 12:31:46,981][26022] Updated weights on worker 0-0, policy_version 724650 (0.00096) [2022-07-10 12:31:49,151][26022] Updated weights on worker 0-0, policy_version 724660 (0.00907) [2022-07-10 12:31:50,829][26022] Updated weights on worker 0-0, policy_version 724670 (0.00087) [2022-07-10 12:31:50,988][25689] Fps is (10 sec: 5395.2, 60 sec: 5553.4, 300 sec: 5533.2). Total num frames: 742062080. Throughput: 0: 5859.5. Samples: 742063188. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:31:50,989][25689] Avg episode reward: [(0, '-1.145')] [2022-07-10 12:31:52,746][26022] Updated weights on worker 0-0, policy_version 724680 (0.00106) [2022-07-10 12:31:54,592][26022] Updated weights on worker 0-0, policy_version 724690 (0.00084) [2022-07-10 12:31:56,031][25689] Fps is (10 sec: 5598.4, 60 sec: 5538.0, 300 sec: 5532.8). Total num frames: 742089728. Throughput: 0: 5839.4. Samples: 742096372. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 12:31:56,031][25689] Avg episode reward: [(0, '-2.934')] [2022-07-10 12:31:56,496][26022] Updated weights on worker 0-0, policy_version 724700 (0.00085) [2022-07-10 12:31:58,223][26022] Updated weights on worker 0-0, policy_version 724710 (0.00083) [2022-07-10 12:32:00,376][26022] Updated weights on worker 0-0, policy_version 724720 (0.00089) [2022-07-10 12:32:01,051][25689] Fps is (10 sec: 5495.1, 60 sec: 5524.3, 300 sec: 5540.0). Total num frames: 742117376. Throughput: 0: 5009.6. Samples: 742113090. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:32:01,052][25689] Avg episode reward: [(0, '-2.763')] [2022-07-10 12:32:02,105][26022] Updated weights on worker 0-0, policy_version 724730 (0.00070) [2022-07-10 12:32:04,235][26022] Updated weights on worker 0-0, policy_version 724740 (0.00101) [2022-07-10 12:32:06,056][25689] Fps is (10 sec: 5311.3, 60 sec: 5531.3, 300 sec: 5533.3). Total num frames: 742142976. Throughput: 0: 5738.6. Samples: 742144496. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:32:06,057][25689] Avg episode reward: [(0, '-3.647')] [2022-07-10 12:32:06,121][26022] Updated weights on worker 0-0, policy_version 724750 (0.00082) [2022-07-10 12:32:07,949][26022] Updated weights on worker 0-0, policy_version 724760 (0.00087) [2022-07-10 12:32:09,854][26022] Updated weights on worker 0-0, policy_version 724770 (0.00092) [2022-07-10 12:32:11,084][25689] Fps is (10 sec: 5409.3, 60 sec: 5548.8, 300 sec: 5533.9). Total num frames: 742171648. Throughput: 0: 5702.4. Samples: 742177782. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:32:11,085][25689] Avg episode reward: [(0, '-3.885')] [2022-07-10 12:32:11,511][26022] Updated weights on worker 0-0, policy_version 724780 (0.00088) [2022-07-10 12:32:13,383][26022] Updated weights on worker 0-0, policy_version 724790 (0.00086) [2022-07-10 12:32:15,351][26022] Updated weights on worker 0-0, policy_version 724800 (0.00084) [2022-07-10 12:32:16,177][25689] Fps is (10 sec: 5666.1, 60 sec: 5549.2, 300 sec: 5532.8). Total num frames: 742200320. Throughput: 0: 4879.5. Samples: 742194674. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:32:16,177][25689] Avg episode reward: [(0, '-3.756')] [2022-07-10 12:32:16,955][26022] Updated weights on worker 0-0, policy_version 724810 (0.00090) [2022-07-10 12:32:18,868][26022] Updated weights on worker 0-0, policy_version 724820 (0.00093) [2022-07-10 12:32:20,483][26022] Updated weights on worker 0-0, policy_version 724830 (0.00084) [2022-07-10 12:32:21,187][25689] Fps is (10 sec: 5473.5, 60 sec: 5515.3, 300 sec: 5532.9). Total num frames: 742226944. Throughput: 0: 5726.7. Samples: 742228400. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:32:21,188][25689] Avg episode reward: [(0, '-2.617')] [2022-07-10 12:32:22,615][26022] Updated weights on worker 0-0, policy_version 724840 (0.00092) [2022-07-10 12:32:24,507][26022] Updated weights on worker 0-0, policy_version 724850 (0.00095) [2022-07-10 12:32:26,199][25689] Fps is (10 sec: 5517.1, 60 sec: 5532.3, 300 sec: 5529.8). Total num frames: 742255616. Throughput: 0: 5829.7. Samples: 742261924. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:32:26,201][25689] Avg episode reward: [(0, '-1.414')] [2022-07-10 12:32:26,282][26022] Updated weights on worker 0-0, policy_version 724860 (0.00085) [2022-07-10 12:32:27,922][26022] Updated weights on worker 0-0, policy_version 724870 (0.00087) [2022-07-10 12:32:30,035][26022] Updated weights on worker 0-0, policy_version 724880 (0.00085) [2022-07-10 12:32:31,203][25689] Fps is (10 sec: 5725.3, 60 sec: 5549.6, 300 sec: 5533.9). Total num frames: 742284288. Throughput: 0: 5022.6. Samples: 742278828. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:32:31,204][25689] Avg episode reward: [(0, '-1.693')] [2022-07-10 12:32:31,650][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:32:31,677][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000724890_742287360.pth [2022-07-10 12:32:31,678][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000722942_740292608.pth [2022-07-10 12:32:31,682][26022] Updated weights on worker 0-0, policy_version 724890 (0.00093) [2022-07-10 12:32:33,573][26022] Updated weights on worker 0-0, policy_version 724900 (0.00083) [2022-07-10 12:32:35,357][26022] Updated weights on worker 0-0, policy_version 724910 (0.00081) [2022-07-10 12:32:36,311][25689] Fps is (10 sec: 5569.9, 60 sec: 5562.1, 300 sec: 5528.6). Total num frames: 742311936. Throughput: 0: 5849.1. Samples: 742312440. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:32:36,311][25689] Avg episode reward: [(0, '-0.849')] [2022-07-10 12:32:37,230][26022] Updated weights on worker 0-0, policy_version 724920 (0.00088) [2022-07-10 12:32:38,914][26022] Updated weights on worker 0-0, policy_version 724930 (0.00093) [2022-07-10 12:32:40,890][26022] Updated weights on worker 0-0, policy_version 724940 (0.00097) [2022-07-10 12:32:41,339][25689] Fps is (10 sec: 5657.5, 60 sec: 5526.0, 300 sec: 5538.7). Total num frames: 742341632. Throughput: 0: 5836.6. Samples: 742346018. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:32:41,339][25689] Avg episode reward: [(0, '-1.095')] [2022-07-10 12:32:42,647][26022] Updated weights on worker 0-0, policy_version 724950 (0.00083) [2022-07-10 12:32:44,487][26022] Updated weights on worker 0-0, policy_version 724960 (0.00091) [2022-07-10 12:32:46,294][26022] Updated weights on worker 0-0, policy_version 724970 (0.00090) [2022-07-10 12:32:46,344][25689] Fps is (10 sec: 5715.5, 60 sec: 5563.3, 300 sec: 5531.8). Total num frames: 742369280. Throughput: 0: 5016.3. Samples: 742362974. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:32:46,344][25689] Avg episode reward: [(0, '-2.374')] [2022-07-10 12:32:48,247][26022] Updated weights on worker 0-0, policy_version 724980 (0.00090) [2022-07-10 12:32:49,969][26022] Updated weights on worker 0-0, policy_version 724990 (0.00080) [2022-07-10 12:32:51,346][25689] Fps is (10 sec: 5627.9, 60 sec: 5564.7, 300 sec: 5536.6). Total num frames: 742397952. Throughput: 0: 5857.4. Samples: 742396814. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:32:51,346][25689] Avg episode reward: [(0, '-2.740')] [2022-07-10 12:32:51,687][26022] Updated weights on worker 0-0, policy_version 725000 (0.00094) [2022-07-10 12:32:53,614][26022] Updated weights on worker 0-0, policy_version 725010 (0.00625) [2022-07-10 12:32:55,423][26022] Updated weights on worker 0-0, policy_version 725020 (0.00085) [2022-07-10 12:32:56,395][25689] Fps is (10 sec: 5603.5, 60 sec: 5564.1, 300 sec: 5532.8). Total num frames: 742425600. Throughput: 0: 5882.0. Samples: 742430574. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:32:56,395][25689] Avg episode reward: [(0, '-4.017')] [2022-07-10 12:32:57,201][26022] Updated weights on worker 0-0, policy_version 725030 (0.00081) [2022-07-10 12:32:58,999][26022] Updated weights on worker 0-0, policy_version 725040 (0.00081) [2022-07-10 12:33:00,740][26022] Updated weights on worker 0-0, policy_version 725050 (0.00092) [2022-07-10 12:33:01,401][25689] Fps is (10 sec: 5499.5, 60 sec: 5565.4, 300 sec: 5540.0). Total num frames: 742453248. Throughput: 0: 5063.1. Samples: 742447594. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:33:01,407][25689] Avg episode reward: [(0, '-3.224')] [2022-07-10 12:33:03,071][26022] Updated weights on worker 0-0, policy_version 725060 (0.00082) [2022-07-10 12:33:04,888][26022] Updated weights on worker 0-0, policy_version 725070 (0.00088) [2022-07-10 12:33:06,410][25689] Fps is (10 sec: 5316.7, 60 sec: 5565.1, 300 sec: 5536.5). Total num frames: 742478848. Throughput: 0: 5790.2. Samples: 742479162. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:33:06,410][25689] Avg episode reward: [(0, '-2.863')] [2022-07-10 12:33:06,724][26022] Updated weights on worker 0-0, policy_version 725080 (0.00094) [2022-07-10 12:33:08,764][26022] Updated weights on worker 0-0, policy_version 725090 (0.00091) [2022-07-10 12:33:10,243][26022] Updated weights on worker 0-0, policy_version 725100 (0.00090) [2022-07-10 12:33:11,447][25689] Fps is (10 sec: 5504.2, 60 sec: 5581.2, 300 sec: 5541.8). Total num frames: 742508544. Throughput: 0: 5763.2. Samples: 742512660. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:33:11,447][25689] Avg episode reward: [(0, '-3.260')] [2022-07-10 12:33:12,417][26022] Updated weights on worker 0-0, policy_version 725110 (0.00058) [2022-07-10 12:33:14,059][26022] Updated weights on worker 0-0, policy_version 725120 (0.00097) [2022-07-10 12:33:16,139][26022] Updated weights on worker 0-0, policy_version 725130 (0.00090) [2022-07-10 12:33:16,491][25689] Fps is (10 sec: 5586.7, 60 sec: 5551.7, 300 sec: 5531.7). Total num frames: 742535168. Throughput: 0: 4899.8. Samples: 742529042. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:33:16,491][25689] Avg episode reward: [(0, '-1.927')] [2022-07-10 12:33:17,741][26022] Updated weights on worker 0-0, policy_version 725140 (0.00084) [2022-07-10 12:33:19,762][26022] Updated weights on worker 0-0, policy_version 725150 (0.00086) [2022-07-10 12:33:21,508][25689] Fps is (10 sec: 5394.0, 60 sec: 5568.0, 300 sec: 5539.5). Total num frames: 742562816. Throughput: 0: 5708.3. Samples: 742562374. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:33:21,509][25689] Avg episode reward: [(0, '-0.452')] [2022-07-10 12:33:21,592][26022] Updated weights on worker 0-0, policy_version 725160 (0.00094) [2022-07-10 12:33:23,289][26022] Updated weights on worker 0-0, policy_version 725170 (0.00093) [2022-07-10 12:33:25,096][26022] Updated weights on worker 0-0, policy_version 725180 (0.00094) [2022-07-10 12:33:26,509][25689] Fps is (10 sec: 5519.4, 60 sec: 5552.1, 300 sec: 5536.3). Total num frames: 742590464. Throughput: 0: 5798.9. Samples: 742595718. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:33:26,510][25689] Avg episode reward: [(0, '-1.076')] [2022-07-10 12:33:27,259][26022] Updated weights on worker 0-0, policy_version 725190 (0.00082) [2022-07-10 12:33:28,745][26022] Updated weights on worker 0-0, policy_version 725200 (0.00093) [2022-07-10 12:33:30,779][26022] Updated weights on worker 0-0, policy_version 725210 (0.00087) [2022-07-10 12:33:31,514][25689] Fps is (10 sec: 5526.2, 60 sec: 5535.0, 300 sec: 5535.0). Total num frames: 742618112. Throughput: 0: 4977.0. Samples: 742612536. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:33:31,515][25689] Avg episode reward: [(0, '-1.288')] [2022-07-10 12:33:32,383][26022] Updated weights on worker 0-0, policy_version 725220 (0.00088) [2022-07-10 12:33:34,587][26022] Updated weights on worker 0-0, policy_version 725230 (0.00086) [2022-07-10 12:33:36,186][26022] Updated weights on worker 0-0, policy_version 725240 (0.00088) [2022-07-10 12:33:36,592][25689] Fps is (10 sec: 5687.5, 60 sec: 5571.8, 300 sec: 5545.4). Total num frames: 742647808. Throughput: 0: 5826.6. Samples: 742646162. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:33:36,592][25689] Avg episode reward: [(0, '-1.665')] [2022-07-10 12:33:38,114][26022] Updated weights on worker 0-0, policy_version 725250 (0.00089) [2022-07-10 12:33:39,667][26022] Updated weights on worker 0-0, policy_version 725260 (0.00087) [2022-07-10 12:33:41,621][25689] Fps is (10 sec: 5572.5, 60 sec: 5520.7, 300 sec: 5532.1). Total num frames: 742674432. Throughput: 0: 5844.5. Samples: 742679924. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:33:41,622][25689] Avg episode reward: [(0, '-1.274')] [2022-07-10 12:33:41,886][26022] Updated weights on worker 0-0, policy_version 725270 (0.00087) [2022-07-10 12:33:43,463][26022] Updated weights on worker 0-0, policy_version 725280 (0.00086) [2022-07-10 12:33:45,442][26022] Updated weights on worker 0-0, policy_version 725290 (0.00088) [2022-07-10 12:33:46,639][25689] Fps is (10 sec: 5605.4, 60 sec: 5553.4, 300 sec: 5545.7). Total num frames: 742704128. Throughput: 0: 5003.3. Samples: 742696434. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:33:46,640][25689] Avg episode reward: [(0, '-1.996')] [2022-07-10 12:33:47,194][26022] Updated weights on worker 0-0, policy_version 725300 (0.00088) [2022-07-10 12:33:49,038][26022] Updated weights on worker 0-0, policy_version 725310 (0.00087) [2022-07-10 12:33:50,935][26022] Updated weights on worker 0-0, policy_version 725320 (0.00086) [2022-07-10 12:33:51,683][25689] Fps is (10 sec: 5699.3, 60 sec: 5532.7, 300 sec: 5539.5). Total num frames: 742731776. Throughput: 0: 5824.5. Samples: 742730006. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:33:51,684][25689] Avg episode reward: [(0, '-1.644')] [2022-07-10 12:33:52,667][26022] Updated weights on worker 0-0, policy_version 725330 (0.00089) [2022-07-10 12:33:54,492][26022] Updated weights on worker 0-0, policy_version 725340 (0.00085) [2022-07-10 12:33:56,694][26022] Updated weights on worker 0-0, policy_version 725350 (0.00092) [2022-07-10 12:33:56,770][25689] Fps is (10 sec: 5357.1, 60 sec: 5512.2, 300 sec: 5532.1). Total num frames: 742758400. Throughput: 0: 5807.6. Samples: 742763350. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:33:56,771][25689] Avg episode reward: [(0, '-1.195')] [2022-07-10 12:33:58,172][26022] Updated weights on worker 0-0, policy_version 725360 (0.00094) [2022-07-10 12:34:00,304][26022] Updated weights on worker 0-0, policy_version 725370 (0.00089) [2022-07-10 12:34:01,728][26022] Updated weights on worker 0-0, policy_version 725380 (0.00095) [2022-07-10 12:34:01,792][25689] Fps is (10 sec: 5672.5, 60 sec: 5561.6, 300 sec: 5552.6). Total num frames: 742789120. Throughput: 0: 5786.3. Samples: 742796638. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:34:01,792][25689] Avg episode reward: [(0, '-1.171')] [2022-07-10 12:34:04,313][26022] Updated weights on worker 0-0, policy_version 725390 (0.00087) [2022-07-10 12:34:06,028][26022] Updated weights on worker 0-0, policy_version 725400 (0.00096) [2022-07-10 12:34:06,821][25689] Fps is (10 sec: 5501.7, 60 sec: 5542.8, 300 sec: 5541.8). Total num frames: 742813696. Throughput: 0: 5695.8. Samples: 742811384. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:34:06,821][25689] Avg episode reward: [(0, '-0.909')] [2022-07-10 12:34:07,898][26022] Updated weights on worker 0-0, policy_version 725410 (0.00088) [2022-07-10 12:34:09,662][26022] Updated weights on worker 0-0, policy_version 725420 (0.00138) [2022-07-10 12:34:11,607][26022] Updated weights on worker 0-0, policy_version 725430 (0.00092) [2022-07-10 12:34:11,844][25689] Fps is (10 sec: 5195.5, 60 sec: 5510.2, 300 sec: 5540.5). Total num frames: 742841344. Throughput: 0: 5701.1. Samples: 742844946. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:34:11,844][25689] Avg episode reward: [(0, '0.032')] [2022-07-10 12:34:13,346][26022] Updated weights on worker 0-0, policy_version 725440 (0.00087) [2022-07-10 12:34:15,430][26022] Updated weights on worker 0-0, policy_version 725450 (0.00090) [2022-07-10 12:34:16,960][25689] Fps is (10 sec: 5655.5, 60 sec: 5554.4, 300 sec: 5549.0). Total num frames: 742871040. Throughput: 0: 5692.6. Samples: 742878284. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:34:16,961][25689] Avg episode reward: [(0, '-0.098')] [2022-07-10 12:34:16,962][26022] Updated weights on worker 0-0, policy_version 725460 (0.00096) [2022-07-10 12:34:18,914][26022] Updated weights on worker 0-0, policy_version 725470 (0.00053) [2022-07-10 12:34:20,516][26022] Updated weights on worker 0-0, policy_version 725480 (0.00088) [2022-07-10 12:34:22,007][25689] Fps is (10 sec: 5541.7, 60 sec: 5534.8, 300 sec: 5538.2). Total num frames: 742897664. Throughput: 0: 4875.9. Samples: 742895202. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:34:22,007][25689] Avg episode reward: [(0, '-0.952')] [2022-07-10 12:34:22,718][26022] Updated weights on worker 0-0, policy_version 725490 (0.00091) [2022-07-10 12:34:24,361][26022] Updated weights on worker 0-0, policy_version 725500 (0.00092) [2022-07-10 12:34:26,301][26022] Updated weights on worker 0-0, policy_version 725510 (0.00086) [2022-07-10 12:34:27,008][25689] Fps is (10 sec: 5503.2, 60 sec: 5551.7, 300 sec: 5538.3). Total num frames: 742926336. Throughput: 0: 5801.2. Samples: 742928496. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:34:27,009][25689] Avg episode reward: [(0, '-1.139')] [2022-07-10 12:34:27,949][26022] Updated weights on worker 0-0, policy_version 725520 (0.00088) [2022-07-10 12:34:29,959][26022] Updated weights on worker 0-0, policy_version 725530 (0.00085) [2022-07-10 12:34:31,853][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:34:31,862][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000725540_742952960.pth [2022-07-10 12:34:31,863][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000723591_740957184.pth [2022-07-10 12:34:31,868][26022] Updated weights on worker 0-0, policy_version 725540 (0.00089) [2022-07-10 12:34:32,026][25689] Fps is (10 sec: 5518.7, 60 sec: 5533.6, 300 sec: 5539.6). Total num frames: 742952960. Throughput: 0: 5796.6. Samples: 742961936. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:34:32,027][25689] Avg episode reward: [(0, '-2.104')] [2022-07-10 12:34:33,627][26022] Updated weights on worker 0-0, policy_version 725550 (0.00084) [2022-07-10 12:34:35,442][26022] Updated weights on worker 0-0, policy_version 725560 (0.00087) [2022-07-10 12:34:37,133][25689] Fps is (10 sec: 5562.8, 60 sec: 5530.9, 300 sec: 5541.8). Total num frames: 742982656. Throughput: 0: 4975.5. Samples: 742978652. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:34:37,135][25689] Avg episode reward: [(0, '-2.007')] [2022-07-10 12:34:37,168][26022] Updated weights on worker 0-0, policy_version 725570 (0.00093) [2022-07-10 12:34:39,192][26022] Updated weights on worker 0-0, policy_version 725580 (0.00091) [2022-07-10 12:34:40,995][26022] Updated weights on worker 0-0, policy_version 725590 (0.00089) [2022-07-10 12:34:42,151][25689] Fps is (10 sec: 5562.9, 60 sec: 5532.0, 300 sec: 5538.1). Total num frames: 743009280. Throughput: 0: 5811.2. Samples: 743012262. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:34:42,152][25689] Avg episode reward: [(0, '-4.317')] [2022-07-10 12:34:42,863][26022] Updated weights on worker 0-0, policy_version 725600 (0.00089) [2022-07-10 12:34:44,706][26022] Updated weights on worker 0-0, policy_version 725610 (0.00086) [2022-07-10 12:34:46,483][26022] Updated weights on worker 0-0, policy_version 725620 (0.00088) [2022-07-10 12:34:47,185][25689] Fps is (10 sec: 5500.7, 60 sec: 5513.6, 300 sec: 5544.9). Total num frames: 743037952. Throughput: 0: 5785.6. Samples: 743045232. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:34:47,186][25689] Avg episode reward: [(0, '-3.512')] [2022-07-10 12:34:48,408][26022] Updated weights on worker 0-0, policy_version 725630 (0.00091) [2022-07-10 12:34:50,084][26022] Updated weights on worker 0-0, policy_version 725640 (0.00089) [2022-07-10 12:34:51,948][26022] Updated weights on worker 0-0, policy_version 725650 (0.00084) [2022-07-10 12:34:52,199][25689] Fps is (10 sec: 5604.9, 60 sec: 5516.3, 300 sec: 5539.2). Total num frames: 743065600. Throughput: 0: 4961.0. Samples: 743062012. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:34:52,201][25689] Avg episode reward: [(0, '-3.456')] [2022-07-10 12:34:53,643][26022] Updated weights on worker 0-0, policy_version 725660 (0.00081) [2022-07-10 12:34:55,759][26022] Updated weights on worker 0-0, policy_version 725670 (0.00085) [2022-07-10 12:34:57,297][25689] Fps is (10 sec: 5569.7, 60 sec: 5549.1, 300 sec: 5537.4). Total num frames: 743094272. Throughput: 0: 5819.5. Samples: 743096000. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:34:57,299][25689] Avg episode reward: [(0, '-3.167')] [2022-07-10 12:34:57,509][26022] Updated weights on worker 0-0, policy_version 725680 (0.00098) [2022-07-10 12:34:59,419][26022] Updated weights on worker 0-0, policy_version 725690 (0.00092) [2022-07-10 12:35:01,007][26022] Updated weights on worker 0-0, policy_version 725700 (0.00087) [2022-07-10 12:35:02,355][25689] Fps is (10 sec: 5343.8, 60 sec: 5461.3, 300 sec: 5536.9). Total num frames: 743119872. Throughput: 0: 5783.3. Samples: 743129110. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:35:02,356][25689] Avg episode reward: [(0, '-2.952')] [2022-07-10 12:35:03,244][26022] Updated weights on worker 0-0, policy_version 725710 (0.00090) [2022-07-10 12:35:05,051][26022] Updated weights on worker 0-0, policy_version 725720 (0.00087) [2022-07-10 12:35:07,173][26022] Updated weights on worker 0-0, policy_version 725730 (0.00091) [2022-07-10 12:35:07,384][25689] Fps is (10 sec: 5380.6, 60 sec: 5528.9, 300 sec: 5538.2). Total num frames: 743148544. Throughput: 0: 4900.7. Samples: 743144218. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:35:07,384][25689] Avg episode reward: [(0, '-3.218')] [2022-07-10 12:35:08,914][26022] Updated weights on worker 0-0, policy_version 725740 (0.00094) [2022-07-10 12:35:10,589][26022] Updated weights on worker 0-0, policy_version 725750 (0.00094) [2022-07-10 12:35:12,391][25689] Fps is (10 sec: 5816.0, 60 sec: 5564.2, 300 sec: 5544.0). Total num frames: 743178240. Throughput: 0: 5735.6. Samples: 743177824. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:35:12,391][25689] Avg episode reward: [(0, '-1.764')] [2022-07-10 12:35:12,399][26022] Updated weights on worker 0-0, policy_version 725760 (0.00082) [2022-07-10 12:35:14,398][26022] Updated weights on worker 0-0, policy_version 725770 (0.00085) [2022-07-10 12:35:16,294][26022] Updated weights on worker 0-0, policy_version 725780 (0.00086) [2022-07-10 12:35:17,474][25689] Fps is (10 sec: 5581.4, 60 sec: 5516.5, 300 sec: 5540.6). Total num frames: 743204864. Throughput: 0: 5706.9. Samples: 743211150. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:35:17,475][25689] Avg episode reward: [(0, '-1.780')] [2022-07-10 12:35:18,042][26022] Updated weights on worker 0-0, policy_version 725790 (0.00086) [2022-07-10 12:35:19,778][26022] Updated weights on worker 0-0, policy_version 725800 (0.00086) [2022-07-10 12:35:21,703][26022] Updated weights on worker 0-0, policy_version 725810 (0.00087) [2022-07-10 12:35:22,514][25689] Fps is (10 sec: 5462.1, 60 sec: 5550.9, 300 sec: 5539.9). Total num frames: 743233536. Throughput: 0: 4915.3. Samples: 743228198. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 12:35:22,515][25689] Avg episode reward: [(0, '-1.895')] [2022-07-10 12:35:23,411][26022] Updated weights on worker 0-0, policy_version 725820 (0.00086) [2022-07-10 12:35:25,294][26022] Updated weights on worker 0-0, policy_version 725830 (0.00079) [2022-07-10 12:35:27,123][26022] Updated weights on worker 0-0, policy_version 725840 (0.00086) [2022-07-10 12:35:27,551][25689] Fps is (10 sec: 5589.0, 60 sec: 5530.7, 300 sec: 5536.3). Total num frames: 743261184. Throughput: 0: 5820.1. Samples: 743261594. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:35:27,552][25689] Avg episode reward: [(0, '-1.734')] [2022-07-10 12:35:29,043][26022] Updated weights on worker 0-0, policy_version 725850 (0.00088) [2022-07-10 12:35:30,955][26022] Updated weights on worker 0-0, policy_version 725860 (0.00087) [2022-07-10 12:35:32,563][25689] Fps is (10 sec: 5604.9, 60 sec: 5565.2, 300 sec: 5547.7). Total num frames: 743289856. Throughput: 0: 5826.0. Samples: 743295346. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:35:32,563][25689] Avg episode reward: [(0, '-1.801')] [2022-07-10 12:35:32,803][26022] Updated weights on worker 0-0, policy_version 725870 (0.00084) [2022-07-10 12:35:34,487][26022] Updated weights on worker 0-0, policy_version 725880 (0.00089) [2022-07-10 12:35:36,442][26022] Updated weights on worker 0-0, policy_version 725890 (0.00086) [2022-07-10 12:35:37,681][25689] Fps is (10 sec: 5559.7, 60 sec: 5530.2, 300 sec: 5539.2). Total num frames: 743317504. Throughput: 0: 4993.4. Samples: 743312052. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:35:37,682][25689] Avg episode reward: [(0, '-2.570')] [2022-07-10 12:35:38,174][26022] Updated weights on worker 0-0, policy_version 725900 (0.00095) [2022-07-10 12:35:40,147][26022] Updated weights on worker 0-0, policy_version 725910 (0.00091) [2022-07-10 12:35:41,867][26022] Updated weights on worker 0-0, policy_version 725920 (0.00085) [2022-07-10 12:35:42,691][25689] Fps is (10 sec: 5560.8, 60 sec: 5564.8, 300 sec: 5539.1). Total num frames: 743346176. Throughput: 0: 5816.6. Samples: 743345556. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:35:42,691][25689] Avg episode reward: [(0, '-2.440')] [2022-07-10 12:35:43,759][26022] Updated weights on worker 0-0, policy_version 725930 (0.00447) [2022-07-10 12:35:45,621][26022] Updated weights on worker 0-0, policy_version 725940 (0.00088) [2022-07-10 12:35:47,466][26022] Updated weights on worker 0-0, policy_version 725950 (0.00091) [2022-07-10 12:35:47,718][25689] Fps is (10 sec: 5509.6, 60 sec: 5531.7, 300 sec: 5540.2). Total num frames: 743372800. Throughput: 0: 5819.9. Samples: 743378960. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:35:47,718][25689] Avg episode reward: [(0, '-3.406')] [2022-07-10 12:35:49,233][26022] Updated weights on worker 0-0, policy_version 725960 (0.00085) [2022-07-10 12:35:51,069][26022] Updated weights on worker 0-0, policy_version 725970 (0.00096) [2022-07-10 12:35:52,730][25689] Fps is (10 sec: 5507.8, 60 sec: 5548.7, 300 sec: 5541.0). Total num frames: 743401472. Throughput: 0: 4982.9. Samples: 743395838. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:35:52,732][25689] Avg episode reward: [(0, '-2.933')] [2022-07-10 12:35:52,928][26022] Updated weights on worker 0-0, policy_version 725980 (0.00093) [2022-07-10 12:35:54,799][26022] Updated weights on worker 0-0, policy_version 725990 (0.00079) [2022-07-10 12:35:56,647][26022] Updated weights on worker 0-0, policy_version 726000 (0.00091) [2022-07-10 12:35:57,836][25689] Fps is (10 sec: 5667.3, 60 sec: 5548.0, 300 sec: 5540.1). Total num frames: 743430144. Throughput: 0: 5824.3. Samples: 743429440. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:35:57,837][25689] Avg episode reward: [(0, '-3.716')] [2022-07-10 12:35:58,439][26022] Updated weights on worker 0-0, policy_version 726010 (0.00078) [2022-07-10 12:36:00,291][26022] Updated weights on worker 0-0, policy_version 726020 (0.00097) [2022-07-10 12:36:02,541][26022] Updated weights on worker 0-0, policy_version 726030 (0.00092) [2022-07-10 12:36:02,870][25689] Fps is (10 sec: 5352.3, 60 sec: 5550.2, 300 sec: 5541.0). Total num frames: 743455744. Throughput: 0: 5750.9. Samples: 743461606. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:36:02,871][25689] Avg episode reward: [(0, '-3.136')] [2022-07-10 12:36:04,143][26022] Updated weights on worker 0-0, policy_version 726040 (0.00078) [2022-07-10 12:36:05,991][26022] Updated weights on worker 0-0, policy_version 726050 (0.00083) [2022-07-10 12:36:07,791][26022] Updated weights on worker 0-0, policy_version 726060 (0.00095) [2022-07-10 12:36:07,882][25689] Fps is (10 sec: 5504.4, 60 sec: 5568.7, 300 sec: 5548.3). Total num frames: 743485440. Throughput: 0: 4916.7. Samples: 743478102. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:36:07,882][25689] Avg episode reward: [(0, '-1.714')] [2022-07-10 12:36:09,881][26022] Updated weights on worker 0-0, policy_version 726070 (0.00084) [2022-07-10 12:36:11,714][26022] Updated weights on worker 0-0, policy_version 726080 (0.00086) [2022-07-10 12:36:12,890][25689] Fps is (10 sec: 5722.8, 60 sec: 5534.7, 300 sec: 5546.5). Total num frames: 743513088. Throughput: 0: 5743.4. Samples: 743511626. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:36:12,892][25689] Avg episode reward: [(0, '-0.771')] [2022-07-10 12:36:13,371][26022] Updated weights on worker 0-0, policy_version 726090 (0.00093) [2022-07-10 12:36:15,233][26022] Updated weights on worker 0-0, policy_version 726100 (0.00095) [2022-07-10 12:36:17,358][26022] Updated weights on worker 0-0, policy_version 726110 (0.00089) [2022-07-10 12:36:17,972][25689] Fps is (10 sec: 5480.4, 60 sec: 5551.8, 300 sec: 5541.7). Total num frames: 743540736. Throughput: 0: 5731.2. Samples: 743544842. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:36:17,972][25689] Avg episode reward: [(0, '-1.384')] [2022-07-10 12:36:19,046][26022] Updated weights on worker 0-0, policy_version 726120 (0.00086) [2022-07-10 12:36:21,013][26022] Updated weights on worker 0-0, policy_version 726130 (0.00093) [2022-07-10 12:36:22,762][26022] Updated weights on worker 0-0, policy_version 726140 (0.00087) [2022-07-10 12:36:23,001][25689] Fps is (10 sec: 5469.0, 60 sec: 5535.9, 300 sec: 5541.4). Total num frames: 743568384. Throughput: 0: 5789.4. Samples: 743578152. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:36:23,002][25689] Avg episode reward: [(0, '-0.366')] [2022-07-10 12:36:24,584][26022] Updated weights on worker 0-0, policy_version 726150 (0.00087) [2022-07-10 12:36:26,348][26022] Updated weights on worker 0-0, policy_version 726160 (0.00084) [2022-07-10 12:36:28,021][25689] Fps is (10 sec: 5604.5, 60 sec: 5554.4, 300 sec: 5544.6). Total num frames: 743597056. Throughput: 0: 5800.0. Samples: 743594908. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:36:28,022][25689] Avg episode reward: [(0, '-0.894')] [2022-07-10 12:36:28,315][26022] Updated weights on worker 0-0, policy_version 726170 (0.00085) [2022-07-10 12:36:30,034][26022] Updated weights on worker 0-0, policy_version 726180 (0.00081) [2022-07-10 12:36:31,912][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:36:31,930][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000726190_743618560.pth [2022-07-10 12:36:31,930][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000724240_741621760.pth [2022-07-10 12:36:31,934][26022] Updated weights on worker 0-0, policy_version 726190 (0.00097) [2022-07-10 12:36:33,073][25689] Fps is (10 sec: 5490.2, 60 sec: 5516.8, 300 sec: 5544.8). Total num frames: 743623680. Throughput: 0: 5778.3. Samples: 743628248. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:36:33,074][25689] Avg episode reward: [(0, '-1.743')] [2022-07-10 12:36:33,788][26022] Updated weights on worker 0-0, policy_version 726200 (0.00087) [2022-07-10 12:36:35,489][26022] Updated weights on worker 0-0, policy_version 726210 (0.00083) [2022-07-10 12:36:37,537][26022] Updated weights on worker 0-0, policy_version 726220 (0.00642) [2022-07-10 12:36:38,182][25689] Fps is (10 sec: 5441.7, 60 sec: 5534.6, 300 sec: 5532.5). Total num frames: 743652352. Throughput: 0: 5784.5. Samples: 743661750. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:36:38,183][25689] Avg episode reward: [(0, '-2.384')] [2022-07-10 12:36:39,204][26022] Updated weights on worker 0-0, policy_version 726230 (0.00086) [2022-07-10 12:36:41,200][26022] Updated weights on worker 0-0, policy_version 726240 (0.00089) [2022-07-10 12:36:42,865][26022] Updated weights on worker 0-0, policy_version 726250 (0.00087) [2022-07-10 12:36:43,195][25689] Fps is (10 sec: 5665.4, 60 sec: 5534.3, 300 sec: 5543.3). Total num frames: 743681024. Throughput: 0: 4966.7. Samples: 743678448. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:36:43,195][25689] Avg episode reward: [(0, '-2.834')] [2022-07-10 12:36:44,871][26022] Updated weights on worker 0-0, policy_version 726260 (0.00088) [2022-07-10 12:36:46,647][26022] Updated weights on worker 0-0, policy_version 726270 (0.00657) [2022-07-10 12:36:48,214][25689] Fps is (10 sec: 5512.5, 60 sec: 5535.1, 300 sec: 5536.4). Total num frames: 743707648. Throughput: 0: 5798.9. Samples: 743712004. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:36:48,214][25689] Avg episode reward: [(0, '-2.463')] [2022-07-10 12:36:48,355][26022] Updated weights on worker 0-0, policy_version 726280 (0.00093) [2022-07-10 12:36:50,295][26022] Updated weights on worker 0-0, policy_version 726290 (0.00086) [2022-07-10 12:36:52,040][26022] Updated weights on worker 0-0, policy_version 726300 (0.00081) [2022-07-10 12:36:53,229][25689] Fps is (10 sec: 5612.6, 60 sec: 5551.7, 300 sec: 5543.8). Total num frames: 743737344. Throughput: 0: 5836.8. Samples: 743745896. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:36:53,239][25689] Avg episode reward: [(0, '-3.824')] [2022-07-10 12:36:54,040][26022] Updated weights on worker 0-0, policy_version 726310 (0.00083) [2022-07-10 12:36:55,864][26022] Updated weights on worker 0-0, policy_version 726320 (0.00088) [2022-07-10 12:36:57,602][26022] Updated weights on worker 0-0, policy_version 726330 (0.00092) [2022-07-10 12:36:58,336][25689] Fps is (10 sec: 5867.6, 60 sec: 5568.6, 300 sec: 5549.1). Total num frames: 743767040. Throughput: 0: 5017.4. Samples: 743762866. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:36:58,336][25689] Avg episode reward: [(0, '-3.127')] [2022-07-10 12:36:59,463][26022] Updated weights on worker 0-0, policy_version 726340 (0.00090) [2022-07-10 12:37:01,101][26022] Updated weights on worker 0-0, policy_version 726350 (0.00085) [2022-07-10 12:37:03,290][26022] Updated weights on worker 0-0, policy_version 726360 (0.00095) [2022-07-10 12:37:03,388][25689] Fps is (10 sec: 5443.3, 60 sec: 5566.9, 300 sec: 5548.2). Total num frames: 743792640. Throughput: 0: 5765.4. Samples: 743794868. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:37:03,388][25689] Avg episode reward: [(0, '-3.978')] [2022-07-10 12:37:05,102][26022] Updated weights on worker 0-0, policy_version 726370 (0.00051) [2022-07-10 12:37:07,178][26022] Updated weights on worker 0-0, policy_version 726380 (0.00087) [2022-07-10 12:37:08,440][25689] Fps is (10 sec: 5371.0, 60 sec: 5546.3, 300 sec: 5547.8). Total num frames: 743821312. Throughput: 0: 5759.3. Samples: 743828496. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:37:08,441][25689] Avg episode reward: [(0, '-2.562')] [2022-07-10 12:37:08,753][26022] Updated weights on worker 0-0, policy_version 726390 (0.00084) [2022-07-10 12:37:10,637][26022] Updated weights on worker 0-0, policy_version 726400 (0.00091) [2022-07-10 12:37:12,440][26022] Updated weights on worker 0-0, policy_version 726410 (0.00089) [2022-07-10 12:37:13,447][25689] Fps is (10 sec: 5598.8, 60 sec: 5546.4, 300 sec: 5545.9). Total num frames: 743848960. Throughput: 0: 4911.8. Samples: 743845202. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:37:13,448][25689] Avg episode reward: [(0, '-3.450')] [2022-07-10 12:37:14,301][26022] Updated weights on worker 0-0, policy_version 726420 (0.00087) [2022-07-10 12:37:16,388][26022] Updated weights on worker 0-0, policy_version 726430 (0.00095) [2022-07-10 12:37:18,008][26022] Updated weights on worker 0-0, policy_version 726440 (0.00092) [2022-07-10 12:37:18,513][25689] Fps is (10 sec: 5489.6, 60 sec: 5547.8, 300 sec: 5548.3). Total num frames: 743876608. Throughput: 0: 5731.1. Samples: 743878504. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:37:18,513][25689] Avg episode reward: [(0, '-4.117')] [2022-07-10 12:37:20,049][26022] Updated weights on worker 0-0, policy_version 726450 (0.00091) [2022-07-10 12:37:21,778][26022] Updated weights on worker 0-0, policy_version 726460 (0.00088) [2022-07-10 12:37:23,587][25689] Fps is (10 sec: 5453.5, 60 sec: 5543.8, 300 sec: 5543.7). Total num frames: 743904256. Throughput: 0: 5794.0. Samples: 743911900. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:37:23,587][25689] Avg episode reward: [(0, '-3.467')] [2022-07-10 12:37:23,595][26022] Updated weights on worker 0-0, policy_version 726470 (0.00087) [2022-07-10 12:37:25,621][26022] Updated weights on worker 0-0, policy_version 726480 (0.00624) [2022-07-10 12:37:27,237][26022] Updated weights on worker 0-0, policy_version 726490 (0.00094) [2022-07-10 12:37:28,642][25689] Fps is (10 sec: 5459.1, 60 sec: 5523.6, 300 sec: 5539.3). Total num frames: 743931904. Throughput: 0: 4944.4. Samples: 743928382. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:37:28,642][25689] Avg episode reward: [(0, '-2.730')] [2022-07-10 12:37:29,411][26022] Updated weights on worker 0-0, policy_version 726500 (0.00099) [2022-07-10 12:37:30,950][26022] Updated weights on worker 0-0, policy_version 726510 (0.00094) [2022-07-10 12:37:32,872][26022] Updated weights on worker 0-0, policy_version 726520 (0.00088) [2022-07-10 12:37:33,650][25689] Fps is (10 sec: 5596.5, 60 sec: 5561.4, 300 sec: 5544.6). Total num frames: 743960576. Throughput: 0: 5770.2. Samples: 743961776. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:37:33,653][25689] Avg episode reward: [(0, '-2.706')] [2022-07-10 12:37:34,656][26022] Updated weights on worker 0-0, policy_version 726530 (0.00101) [2022-07-10 12:37:36,377][26022] Updated weights on worker 0-0, policy_version 726540 (0.00090) [2022-07-10 12:37:38,451][26022] Updated weights on worker 0-0, policy_version 726550 (0.00092) [2022-07-10 12:37:38,777][25689] Fps is (10 sec: 5657.8, 60 sec: 5559.8, 300 sec: 5539.3). Total num frames: 743989248. Throughput: 0: 5769.3. Samples: 743995416. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:37:38,778][25689] Avg episode reward: [(0, '-3.313')] [2022-07-10 12:37:40,178][26022] Updated weights on worker 0-0, policy_version 726560 (0.00089) [2022-07-10 12:37:42,117][26022] Updated weights on worker 0-0, policy_version 726570 (0.00089) [2022-07-10 12:37:43,840][25689] Fps is (10 sec: 5527.2, 60 sec: 5538.3, 300 sec: 5538.3). Total num frames: 744016896. Throughput: 0: 4944.5. Samples: 744012042. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:37:43,840][25689] Avg episode reward: [(0, '-2.025')] [2022-07-10 12:37:43,954][26022] Updated weights on worker 0-0, policy_version 726580 (0.00090) [2022-07-10 12:37:45,819][26022] Updated weights on worker 0-0, policy_version 726590 (0.00094) [2022-07-10 12:37:47,753][26022] Updated weights on worker 0-0, policy_version 726600 (0.00093) [2022-07-10 12:37:48,867][25689] Fps is (10 sec: 5480.3, 60 sec: 5554.4, 300 sec: 5534.4). Total num frames: 744044544. Throughput: 0: 5763.4. Samples: 744044948. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:37:48,868][25689] Avg episode reward: [(0, '-1.023')] [2022-07-10 12:37:49,484][26022] Updated weights on worker 0-0, policy_version 726610 (0.00087) [2022-07-10 12:37:51,373][26022] Updated weights on worker 0-0, policy_version 726620 (0.00085) [2022-07-10 12:37:53,265][26022] Updated weights on worker 0-0, policy_version 726630 (0.00086) [2022-07-10 12:37:53,959][25689] Fps is (10 sec: 5565.9, 60 sec: 5530.7, 300 sec: 5537.0). Total num frames: 744073216. Throughput: 0: 5750.2. Samples: 744078554. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:37:53,959][25689] Avg episode reward: [(0, '-1.616')] [2022-07-10 12:37:55,035][26022] Updated weights on worker 0-0, policy_version 726640 (0.00078) [2022-07-10 12:37:56,805][26022] Updated weights on worker 0-0, policy_version 726650 (0.00089) [2022-07-10 12:37:58,629][26022] Updated weights on worker 0-0, policy_version 726660 (0.00093) [2022-07-10 12:37:59,046][25689] Fps is (10 sec: 5633.8, 60 sec: 5515.5, 300 sec: 5538.9). Total num frames: 744101888. Throughput: 0: 4937.9. Samples: 744095500. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:37:59,046][25689] Avg episode reward: [(0, '-2.786')] [2022-07-10 12:38:00,427][26022] Updated weights on worker 0-0, policy_version 726670 (0.00084) [2022-07-10 12:38:02,755][26022] Updated weights on worker 0-0, policy_version 726680 (0.00090) [2022-07-10 12:38:04,081][25689] Fps is (10 sec: 5260.5, 60 sec: 5500.2, 300 sec: 5535.0). Total num frames: 744126464. Throughput: 0: 5754.0. Samples: 744128508. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:38:04,081][25689] Avg episode reward: [(0, '-2.527')] [2022-07-10 12:38:04,415][26022] Updated weights on worker 0-0, policy_version 726690 (0.00089) [2022-07-10 12:38:06,355][26022] Updated weights on worker 0-0, policy_version 726700 (0.00086) [2022-07-10 12:38:07,998][26022] Updated weights on worker 0-0, policy_version 726710 (0.00087) [2022-07-10 12:38:09,087][25689] Fps is (10 sec: 5405.0, 60 sec: 5521.3, 300 sec: 5535.6). Total num frames: 744156160. Throughput: 0: 5715.9. Samples: 744160520. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:38:09,087][25689] Avg episode reward: [(0, '-0.827')] [2022-07-10 12:38:10,186][26022] Updated weights on worker 0-0, policy_version 726720 (0.00092) [2022-07-10 12:38:11,842][26022] Updated weights on worker 0-0, policy_version 726730 (0.00085) [2022-07-10 12:38:13,695][26022] Updated weights on worker 0-0, policy_version 726740 (0.00086) [2022-07-10 12:38:14,111][25689] Fps is (10 sec: 5717.3, 60 sec: 5519.8, 300 sec: 5539.4). Total num frames: 744183808. Throughput: 0: 4895.4. Samples: 744177206. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:38:14,112][25689] Avg episode reward: [(0, '-2.417')] [2022-07-10 12:38:15,493][26022] Updated weights on worker 0-0, policy_version 726750 (0.00087) [2022-07-10 12:38:17,415][26022] Updated weights on worker 0-0, policy_version 726760 (0.00088) [2022-07-10 12:38:19,173][25689] Fps is (10 sec: 5482.2, 60 sec: 5520.0, 300 sec: 5538.6). Total num frames: 744211456. Throughput: 0: 5727.1. Samples: 744210772. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:38:19,174][25689] Avg episode reward: [(0, '-3.621')] [2022-07-10 12:38:19,290][26022] Updated weights on worker 0-0, policy_version 726770 (0.00095) [2022-07-10 12:38:21,102][26022] Updated weights on worker 0-0, policy_version 726780 (0.00086) [2022-07-10 12:38:22,787][26022] Updated weights on worker 0-0, policy_version 726790 (0.00085) [2022-07-10 12:38:24,188][25689] Fps is (10 sec: 5487.1, 60 sec: 5525.4, 300 sec: 5538.3). Total num frames: 744239104. Throughput: 0: 5767.8. Samples: 744244482. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:38:24,189][25689] Avg episode reward: [(0, '-2.144')] [2022-07-10 12:38:24,728][26022] Updated weights on worker 0-0, policy_version 726800 (0.00089) [2022-07-10 12:38:26,465][26022] Updated weights on worker 0-0, policy_version 726810 (0.00082) [2022-07-10 12:38:28,453][26022] Updated weights on worker 0-0, policy_version 726820 (0.00095) [2022-07-10 12:38:29,204][25689] Fps is (10 sec: 5717.0, 60 sec: 5562.9, 300 sec: 5545.0). Total num frames: 744268800. Throughput: 0: 5003.8. Samples: 744261180. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:38:29,205][25689] Avg episode reward: [(0, '-3.157')] [2022-07-10 12:38:30,199][26022] Updated weights on worker 0-0, policy_version 726830 (0.00090) [2022-07-10 12:38:31,993][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:38:32,002][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000726839_744283136.pth [2022-07-10 12:38:32,002][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000724890_742287360.pth [2022-07-10 12:38:32,224][26022] Updated weights on worker 0-0, policy_version 726840 (0.00090) [2022-07-10 12:38:33,856][26022] Updated weights on worker 0-0, policy_version 726850 (0.00090) [2022-07-10 12:38:34,243][25689] Fps is (10 sec: 5703.3, 60 sec: 5543.1, 300 sec: 5538.8). Total num frames: 744296448. Throughput: 0: 5840.8. Samples: 744294792. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:38:34,245][25689] Avg episode reward: [(0, '-3.329')] [2022-07-10 12:38:35,780][26022] Updated weights on worker 0-0, policy_version 726860 (0.00092) [2022-07-10 12:38:37,413][26022] Updated weights on worker 0-0, policy_version 726870 (0.00097) [2022-07-10 12:38:39,291][25689] Fps is (10 sec: 5481.9, 60 sec: 5533.5, 300 sec: 5541.9). Total num frames: 744324096. Throughput: 0: 5848.9. Samples: 744328436. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:38:39,291][25689] Avg episode reward: [(0, '-3.219')] [2022-07-10 12:38:39,341][26022] Updated weights on worker 0-0, policy_version 726880 (0.00095) [2022-07-10 12:38:41,125][26022] Updated weights on worker 0-0, policy_version 726890 (0.00081) [2022-07-10 12:38:43,047][26022] Updated weights on worker 0-0, policy_version 726900 (0.00083) [2022-07-10 12:38:44,305][25689] Fps is (10 sec: 5597.2, 60 sec: 5554.8, 300 sec: 5538.6). Total num frames: 744352768. Throughput: 0: 5013.3. Samples: 744345334. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-10 12:38:44,305][25689] Avg episode reward: [(0, '-1.549')] [2022-07-10 12:38:44,854][26022] Updated weights on worker 0-0, policy_version 726910 (0.00091) [2022-07-10 12:38:46,825][26022] Updated weights on worker 0-0, policy_version 726920 (0.01150) [2022-07-10 12:38:48,658][26022] Updated weights on worker 0-0, policy_version 726930 (0.00075) [2022-07-10 12:38:49,324][25689] Fps is (10 sec: 5511.4, 60 sec: 5538.7, 300 sec: 5535.6). Total num frames: 744379392. Throughput: 0: 5822.0. Samples: 744378318. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:38:49,324][25689] Avg episode reward: [(0, '-1.996')] [2022-07-10 12:38:50,402][26022] Updated weights on worker 0-0, policy_version 726940 (0.00086) [2022-07-10 12:38:52,490][26022] Updated weights on worker 0-0, policy_version 726950 (0.00088) [2022-07-10 12:38:54,099][26022] Updated weights on worker 0-0, policy_version 726960 (0.00086) [2022-07-10 12:38:54,383][25689] Fps is (10 sec: 5486.9, 60 sec: 5541.6, 300 sec: 5543.0). Total num frames: 744408064. Throughput: 0: 5799.5. Samples: 744411594. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:38:54,383][25689] Avg episode reward: [(0, '-3.418')] [2022-07-10 12:38:56,264][26022] Updated weights on worker 0-0, policy_version 726970 (0.00086) [2022-07-10 12:38:57,697][26022] Updated weights on worker 0-0, policy_version 726980 (0.00097) [2022-07-10 12:38:59,487][25689] Fps is (10 sec: 5440.4, 60 sec: 5506.1, 300 sec: 5527.7). Total num frames: 744434688. Throughput: 0: 5771.5. Samples: 744445002. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:38:59,488][25689] Avg episode reward: [(0, '-2.808')] [2022-07-10 12:38:59,856][26022] Updated weights on worker 0-0, policy_version 726990 (0.00096) [2022-07-10 12:39:01,518][26022] Updated weights on worker 0-0, policy_version 727000 (0.00093) [2022-07-10 12:39:03,727][26022] Updated weights on worker 0-0, policy_version 727010 (0.00095) [2022-07-10 12:39:04,516][25689] Fps is (10 sec: 5355.8, 60 sec: 5557.6, 300 sec: 5538.0). Total num frames: 744462336. Throughput: 0: 5670.7. Samples: 744459946. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:39:04,516][25689] Avg episode reward: [(0, '-3.110')] [2022-07-10 12:39:05,608][26022] Updated weights on worker 0-0, policy_version 727020 (0.00089) [2022-07-10 12:39:07,520][26022] Updated weights on worker 0-0, policy_version 727030 (0.00091) [2022-07-10 12:39:09,207][26022] Updated weights on worker 0-0, policy_version 727040 (0.00079) [2022-07-10 12:39:09,524][25689] Fps is (10 sec: 5611.5, 60 sec: 5540.4, 300 sec: 5541.7). Total num frames: 744491008. Throughput: 0: 5687.2. Samples: 744493202. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:39:09,524][25689] Avg episode reward: [(0, '-3.227')] [2022-07-10 12:39:11,293][26022] Updated weights on worker 0-0, policy_version 727050 (0.00094) [2022-07-10 12:39:12,798][26022] Updated weights on worker 0-0, policy_version 727060 (0.00088) [2022-07-10 12:39:14,531][25689] Fps is (10 sec: 5419.1, 60 sec: 5508.1, 300 sec: 5530.0). Total num frames: 744516608. Throughput: 0: 5711.6. Samples: 744526672. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:39:14,531][25689] Avg episode reward: [(0, '-3.678')] [2022-07-10 12:39:14,971][26022] Updated weights on worker 0-0, policy_version 727070 (0.00090) [2022-07-10 12:39:16,466][26022] Updated weights on worker 0-0, policy_version 727080 (0.00095) [2022-07-10 12:39:18,527][26022] Updated weights on worker 0-0, policy_version 727090 (0.00083) [2022-07-10 12:39:19,580][25689] Fps is (10 sec: 5396.8, 60 sec: 5526.2, 300 sec: 5536.8). Total num frames: 744545280. Throughput: 0: 4889.8. Samples: 744543252. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:39:19,581][25689] Avg episode reward: [(0, '-4.509')] [2022-07-10 12:39:20,400][26022] Updated weights on worker 0-0, policy_version 727100 (0.00091) [2022-07-10 12:39:22,206][26022] Updated weights on worker 0-0, policy_version 727110 (0.00095) [2022-07-10 12:39:24,005][26022] Updated weights on worker 0-0, policy_version 727120 (0.00091) [2022-07-10 12:39:24,603][25689] Fps is (10 sec: 5693.1, 60 sec: 5542.4, 300 sec: 5536.4). Total num frames: 744573952. Throughput: 0: 5805.7. Samples: 744576568. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:39:24,604][25689] Avg episode reward: [(0, '-3.231')] [2022-07-10 12:39:25,794][26022] Updated weights on worker 0-0, policy_version 727130 (0.00098) [2022-07-10 12:39:27,630][26022] Updated weights on worker 0-0, policy_version 727140 (0.00094) [2022-07-10 12:39:29,619][25689] Fps is (10 sec: 5610.5, 60 sec: 5508.6, 300 sec: 5539.9). Total num frames: 744601600. Throughput: 0: 5819.0. Samples: 744610134. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:39:29,619][25689] Avg episode reward: [(0, '-4.656')] [2022-07-10 12:39:29,619][26022] Updated weights on worker 0-0, policy_version 727150 (0.00083) [2022-07-10 12:39:31,339][26022] Updated weights on worker 0-0, policy_version 727160 (0.00086) [2022-07-10 12:39:33,117][26022] Updated weights on worker 0-0, policy_version 727170 (0.00092) [2022-07-10 12:39:34,664][25689] Fps is (10 sec: 5496.4, 60 sec: 5508.0, 300 sec: 5534.2). Total num frames: 744629248. Throughput: 0: 4974.4. Samples: 744626822. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:39:34,665][25689] Avg episode reward: [(0, '-5.629')] [2022-07-10 12:39:35,199][26022] Updated weights on worker 0-0, policy_version 727180 (0.00084) [2022-07-10 12:39:36,911][26022] Updated weights on worker 0-0, policy_version 727190 (0.00091) [2022-07-10 12:39:38,778][26022] Updated weights on worker 0-0, policy_version 727200 (0.00093) [2022-07-10 12:39:39,723][25689] Fps is (10 sec: 5675.3, 60 sec: 5540.9, 300 sec: 5543.7). Total num frames: 744658944. Throughput: 0: 5821.5. Samples: 744660512. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:39:39,725][25689] Avg episode reward: [(0, '-5.086')] [2022-07-10 12:39:40,593][26022] Updated weights on worker 0-0, policy_version 727210 (0.00089) [2022-07-10 12:39:42,295][26022] Updated weights on worker 0-0, policy_version 727220 (0.00094) [2022-07-10 12:39:44,424][26022] Updated weights on worker 0-0, policy_version 727230 (0.00087) [2022-07-10 12:39:44,726][25689] Fps is (10 sec: 5495.4, 60 sec: 5491.1, 300 sec: 5534.0). Total num frames: 744684544. Throughput: 0: 5838.6. Samples: 744694056. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:39:44,728][25689] Avg episode reward: [(0, '-5.603')] [2022-07-10 12:39:45,975][26022] Updated weights on worker 0-0, policy_version 727240 (0.00087) [2022-07-10 12:39:48,007][26022] Updated weights on worker 0-0, policy_version 727250 (0.00090) [2022-07-10 12:39:49,721][26022] Updated weights on worker 0-0, policy_version 727260 (0.01149) [2022-07-10 12:39:49,766][25689] Fps is (10 sec: 5505.7, 60 sec: 5539.9, 300 sec: 5540.4). Total num frames: 744714240. Throughput: 0: 4978.7. Samples: 744710442. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:39:49,767][25689] Avg episode reward: [(0, '-4.099')] [2022-07-10 12:39:51,634][26022] Updated weights on worker 0-0, policy_version 727270 (0.00092) [2022-07-10 12:39:53,464][26022] Updated weights on worker 0-0, policy_version 727280 (0.00088) [2022-07-10 12:39:54,798][25689] Fps is (10 sec: 5795.4, 60 sec: 5542.5, 300 sec: 5541.6). Total num frames: 744742912. Throughput: 0: 5813.0. Samples: 744743860. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:39:54,798][25689] Avg episode reward: [(0, '-4.872')] [2022-07-10 12:39:55,306][26022] Updated weights on worker 0-0, policy_version 727290 (0.00089) [2022-07-10 12:39:57,171][26022] Updated weights on worker 0-0, policy_version 727300 (0.00099) [2022-07-10 12:39:58,982][26022] Updated weights on worker 0-0, policy_version 727310 (0.00084) [2022-07-10 12:39:59,893][25689] Fps is (10 sec: 5561.5, 60 sec: 5560.3, 300 sec: 5547.8). Total num frames: 744770560. Throughput: 0: 5789.2. Samples: 744777282. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:39:59,894][25689] Avg episode reward: [(0, '-3.733')] [2022-07-10 12:40:00,878][26022] Updated weights on worker 0-0, policy_version 727320 (0.00092) [2022-07-10 12:40:03,132][26022] Updated weights on worker 0-0, policy_version 727330 (0.00084) [2022-07-10 12:40:04,774][26022] Updated weights on worker 0-0, policy_version 727340 (0.00085) [2022-07-10 12:40:04,991][25689] Fps is (10 sec: 5324.1, 60 sec: 5536.9, 300 sec: 5539.6). Total num frames: 744797184. Throughput: 0: 4819.9. Samples: 744791732. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:40:04,992][25689] Avg episode reward: [(0, '-3.293')] [2022-07-10 12:40:06,715][26022] Updated weights on worker 0-0, policy_version 727350 (0.00082) [2022-07-10 12:40:08,544][26022] Updated weights on worker 0-0, policy_version 727360 (0.00088) [2022-07-10 12:40:10,072][25689] Fps is (10 sec: 5331.6, 60 sec: 5513.3, 300 sec: 5531.4). Total num frames: 744824832. Throughput: 0: 5649.2. Samples: 744825156. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:40:10,073][25689] Avg episode reward: [(0, '-3.175')] [2022-07-10 12:40:10,320][26022] Updated weights on worker 0-0, policy_version 727370 (0.00091) [2022-07-10 12:40:12,305][26022] Updated weights on worker 0-0, policy_version 727380 (0.00090) [2022-07-10 12:40:14,002][26022] Updated weights on worker 0-0, policy_version 727390 (0.00091) [2022-07-10 12:40:15,081][25689] Fps is (10 sec: 5480.5, 60 sec: 5547.0, 300 sec: 5536.2). Total num frames: 744852480. Throughput: 0: 5667.7. Samples: 744858820. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:40:15,082][25689] Avg episode reward: [(0, '-3.442')] [2022-07-10 12:40:16,039][26022] Updated weights on worker 0-0, policy_version 727400 (0.00088) [2022-07-10 12:40:17,601][26022] Updated weights on worker 0-0, policy_version 727410 (0.00087) [2022-07-10 12:40:19,760][26022] Updated weights on worker 0-0, policy_version 727420 (0.00092) [2022-07-10 12:40:20,140][25689] Fps is (10 sec: 5492.2, 60 sec: 5529.2, 300 sec: 5532.4). Total num frames: 744880128. Throughput: 0: 4856.1. Samples: 744875606. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:40:20,141][25689] Avg episode reward: [(0, '-3.343')] [2022-07-10 12:40:21,556][26022] Updated weights on worker 0-0, policy_version 727430 (0.00092) [2022-07-10 12:40:23,365][26022] Updated weights on worker 0-0, policy_version 727440 (0.00094) [2022-07-10 12:40:25,149][25689] Fps is (10 sec: 5492.0, 60 sec: 5513.6, 300 sec: 5532.9). Total num frames: 744907776. Throughput: 0: 5803.5. Samples: 744908720. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:40:25,150][25689] Avg episode reward: [(0, '-3.976')] [2022-07-10 12:40:25,370][26022] Updated weights on worker 0-0, policy_version 727450 (0.00085) [2022-07-10 12:40:27,019][26022] Updated weights on worker 0-0, policy_version 727460 (0.00084) [2022-07-10 12:40:28,887][26022] Updated weights on worker 0-0, policy_version 727470 (0.00087) [2022-07-10 12:40:30,192][25689] Fps is (10 sec: 5603.4, 60 sec: 5528.0, 300 sec: 5532.4). Total num frames: 744936448. Throughput: 0: 5829.6. Samples: 744942442. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:40:30,192][25689] Avg episode reward: [(0, '-3.742')] [2022-07-10 12:40:30,736][26022] Updated weights on worker 0-0, policy_version 727480 (0.00086) [2022-07-10 12:40:32,016][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:40:32,033][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000727487_744946688.pth [2022-07-10 12:40:32,033][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000725540_742952960.pth [2022-07-10 12:40:32,587][26022] Updated weights on worker 0-0, policy_version 727490 (0.00086) [2022-07-10 12:40:34,345][26022] Updated weights on worker 0-0, policy_version 727500 (0.00088) [2022-07-10 12:40:35,225][25689] Fps is (10 sec: 5691.1, 60 sec: 5546.0, 300 sec: 5537.4). Total num frames: 744965120. Throughput: 0: 4975.8. Samples: 744959056. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:40:35,226][25689] Avg episode reward: [(0, '-4.011')] [2022-07-10 12:40:36,167][26022] Updated weights on worker 0-0, policy_version 727510 (0.00085) [2022-07-10 12:40:38,006][26022] Updated weights on worker 0-0, policy_version 727520 (0.00106) [2022-07-10 12:40:40,016][26022] Updated weights on worker 0-0, policy_version 727530 (0.00092) [2022-07-10 12:40:40,299][25689] Fps is (10 sec: 5572.4, 60 sec: 5510.9, 300 sec: 5532.8). Total num frames: 744992768. Throughput: 0: 5789.7. Samples: 744992316. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:40:40,299][25689] Avg episode reward: [(0, '-3.452')] [2022-07-10 12:40:41,606][26022] Updated weights on worker 0-0, policy_version 727540 (0.00086) [2022-07-10 12:40:43,670][26022] Updated weights on worker 0-0, policy_version 727550 (0.00085) [2022-07-10 12:40:45,343][25689] Fps is (10 sec: 5465.3, 60 sec: 5540.9, 300 sec: 5535.9). Total num frames: 745020416. Throughput: 0: 5794.8. Samples: 745025738. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:40:45,344][25689] Avg episode reward: [(0, '-2.754')] [2022-07-10 12:40:45,487][26022] Updated weights on worker 0-0, policy_version 727560 (0.00093) [2022-07-10 12:40:47,293][26022] Updated weights on worker 0-0, policy_version 727570 (0.00091) [2022-07-10 12:40:49,227][26022] Updated weights on worker 0-0, policy_version 727580 (0.00093) [2022-07-10 12:40:50,409][25689] Fps is (10 sec: 5368.1, 60 sec: 5487.9, 300 sec: 5528.0). Total num frames: 745047040. Throughput: 0: 4939.2. Samples: 745042298. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:40:50,409][25689] Avg episode reward: [(0, '-1.981')] [2022-07-10 12:40:50,992][26022] Updated weights on worker 0-0, policy_version 727590 (0.00092) [2022-07-10 12:40:52,935][26022] Updated weights on worker 0-0, policy_version 727600 (0.00532) [2022-07-10 12:40:54,796][26022] Updated weights on worker 0-0, policy_version 727610 (0.00103) [2022-07-10 12:40:55,435][25689] Fps is (10 sec: 5580.5, 60 sec: 5505.2, 300 sec: 5532.9). Total num frames: 745076736. Throughput: 0: 5745.0. Samples: 745075162. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:40:55,436][25689] Avg episode reward: [(0, '-1.437')] [2022-07-10 12:40:56,730][26022] Updated weights on worker 0-0, policy_version 727620 (0.00094) [2022-07-10 12:40:58,422][26022] Updated weights on worker 0-0, policy_version 727630 (0.00089) [2022-07-10 12:41:00,455][26022] Updated weights on worker 0-0, policy_version 727640 (0.00080) [2022-07-10 12:41:00,547][25689] Fps is (10 sec: 5555.1, 60 sec: 5486.9, 300 sec: 5534.9). Total num frames: 745103360. Throughput: 0: 5735.1. Samples: 745108442. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:41:00,547][25689] Avg episode reward: [(0, '-0.841')] [2022-07-10 12:41:02,458][26022] Updated weights on worker 0-0, policy_version 727650 (0.00090) [2022-07-10 12:41:04,723][26022] Updated weights on worker 0-0, policy_version 727660 (0.00094) [2022-07-10 12:41:05,574][25689] Fps is (10 sec: 5252.0, 60 sec: 5493.3, 300 sec: 5524.3). Total num frames: 745129984. Throughput: 0: 4808.3. Samples: 745123014. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:41:05,576][25689] Avg episode reward: [(0, '-0.982')] [2022-07-10 12:41:05,972][26022] Updated weights on worker 0-0, policy_version 727670 (0.00082) [2022-07-10 12:41:08,317][26022] Updated weights on worker 0-0, policy_version 727680 (0.00084) [2022-07-10 12:41:09,980][26022] Updated weights on worker 0-0, policy_version 727690 (0.00061) [2022-07-10 12:41:10,597][25689] Fps is (10 sec: 5400.2, 60 sec: 5498.6, 300 sec: 5524.0). Total num frames: 745157632. Throughput: 0: 5664.8. Samples: 745156662. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:41:10,597][25689] Avg episode reward: [(0, '-1.175')] [2022-07-10 12:41:11,755][26022] Updated weights on worker 0-0, policy_version 727700 (0.00084) [2022-07-10 12:41:13,742][26022] Updated weights on worker 0-0, policy_version 727710 (0.00086) [2022-07-10 12:41:15,264][26022] Updated weights on worker 0-0, policy_version 727720 (0.00084) [2022-07-10 12:41:15,604][25689] Fps is (10 sec: 5717.0, 60 sec: 5532.5, 300 sec: 5532.3). Total num frames: 745187328. Throughput: 0: 5706.5. Samples: 745190256. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:41:15,606][25689] Avg episode reward: [(0, '-1.625')] [2022-07-10 12:41:17,425][26022] Updated weights on worker 0-0, policy_version 727730 (0.00091) [2022-07-10 12:41:19,156][26022] Updated weights on worker 0-0, policy_version 727740 (0.00091) [2022-07-10 12:41:20,741][25689] Fps is (10 sec: 5451.1, 60 sec: 5491.7, 300 sec: 5523.4). Total num frames: 745212928. Throughput: 0: 5692.4. Samples: 745223394. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:41:20,751][25689] Avg episode reward: [(0, '-2.710')] [2022-07-10 12:41:21,069][26022] Updated weights on worker 0-0, policy_version 727750 (0.00089) [2022-07-10 12:41:22,931][26022] Updated weights on worker 0-0, policy_version 727760 (0.00088) [2022-07-10 12:41:24,885][26022] Updated weights on worker 0-0, policy_version 727770 (0.00085) [2022-07-10 12:41:25,762][25689] Fps is (10 sec: 5343.0, 60 sec: 5507.5, 300 sec: 5523.4). Total num frames: 745241600. Throughput: 0: 5781.1. Samples: 745239724. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:41:25,762][25689] Avg episode reward: [(0, '-3.072')] [2022-07-10 12:41:26,642][26022] Updated weights on worker 0-0, policy_version 727780 (0.00093) [2022-07-10 12:41:28,638][26022] Updated weights on worker 0-0, policy_version 727790 (0.00086) [2022-07-10 12:41:30,316][26022] Updated weights on worker 0-0, policy_version 727800 (0.00088) [2022-07-10 12:41:30,775][25689] Fps is (10 sec: 5510.6, 60 sec: 5476.3, 300 sec: 5524.1). Total num frames: 745268224. Throughput: 0: 5751.6. Samples: 745272720. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:41:30,776][25689] Avg episode reward: [(0, '-2.205')] [2022-07-10 12:41:32,275][26022] Updated weights on worker 0-0, policy_version 727810 (0.00087) [2022-07-10 12:41:34,244][26022] Updated weights on worker 0-0, policy_version 727820 (0.00087) [2022-07-10 12:41:35,779][25689] Fps is (10 sec: 5520.1, 60 sec: 5479.0, 300 sec: 5526.1). Total num frames: 745296896. Throughput: 0: 5745.4. Samples: 745306168. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:41:35,780][25689] Avg episode reward: [(0, '-2.835')] [2022-07-10 12:41:35,807][26022] Updated weights on worker 0-0, policy_version 727830 (0.00087) [2022-07-10 12:41:37,874][26022] Updated weights on worker 0-0, policy_version 727840 (0.00089) [2022-07-10 12:41:39,457][26022] Updated weights on worker 0-0, policy_version 727850 (0.00092) [2022-07-10 12:41:40,871][25689] Fps is (10 sec: 5680.1, 60 sec: 5494.2, 300 sec: 5524.6). Total num frames: 745325568. Throughput: 0: 4942.3. Samples: 745322884. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:41:40,872][25689] Avg episode reward: [(0, '-3.887')] [2022-07-10 12:41:41,403][26022] Updated weights on worker 0-0, policy_version 727860 (0.00093) [2022-07-10 12:41:43,303][26022] Updated weights on worker 0-0, policy_version 727870 (0.00096) [2022-07-10 12:41:45,196][26022] Updated weights on worker 0-0, policy_version 727880 (0.00096) [2022-07-10 12:41:45,929][25689] Fps is (10 sec: 5548.8, 60 sec: 5493.0, 300 sec: 5527.3). Total num frames: 745353216. Throughput: 0: 5774.3. Samples: 745356174. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:41:45,929][25689] Avg episode reward: [(0, '-3.558')] [2022-07-10 12:41:46,952][26022] Updated weights on worker 0-0, policy_version 727890 (0.00092) [2022-07-10 12:41:48,802][26022] Updated weights on worker 0-0, policy_version 727900 (0.00091) [2022-07-10 12:41:50,505][26022] Updated weights on worker 0-0, policy_version 727910 (0.00097) [2022-07-10 12:41:50,968][25689] Fps is (10 sec: 5577.9, 60 sec: 5529.2, 300 sec: 5523.4). Total num frames: 745381888. Throughput: 0: 5794.0. Samples: 745389716. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:41:50,969][25689] Avg episode reward: [(0, '-3.841')] [2022-07-10 12:41:52,543][26022] Updated weights on worker 0-0, policy_version 727920 (0.00051) [2022-07-10 12:41:54,251][26022] Updated weights on worker 0-0, policy_version 727930 (0.00095) [2022-07-10 12:41:56,044][25689] Fps is (10 sec: 5466.6, 60 sec: 5474.1, 300 sec: 5513.7). Total num frames: 745408512. Throughput: 0: 4947.8. Samples: 745406436. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:41:56,045][25689] Avg episode reward: [(0, '-4.586')] [2022-07-10 12:41:56,223][26022] Updated weights on worker 0-0, policy_version 727940 (0.00092) [2022-07-10 12:41:58,118][26022] Updated weights on worker 0-0, policy_version 727950 (0.00096) [2022-07-10 12:41:59,677][26022] Updated weights on worker 0-0, policy_version 727960 (0.00092) [2022-07-10 12:42:01,147][25689] Fps is (10 sec: 5432.6, 60 sec: 5508.7, 300 sec: 5523.1). Total num frames: 745437184. Throughput: 0: 5772.7. Samples: 745439928. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:42:01,147][25689] Avg episode reward: [(0, '-6.096')] [2022-07-10 12:42:02,122][26022] Updated weights on worker 0-0, policy_version 727970 (0.00085) [2022-07-10 12:42:03,785][26022] Updated weights on worker 0-0, policy_version 727980 (0.00091) [2022-07-10 12:42:05,715][26022] Updated weights on worker 0-0, policy_version 727990 (0.00084) [2022-07-10 12:42:06,225][25689] Fps is (10 sec: 5531.9, 60 sec: 5520.9, 300 sec: 5519.2). Total num frames: 745464832. Throughput: 0: 5683.3. Samples: 745471524. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-10 12:42:06,226][25689] Avg episode reward: [(0, '-6.060')] [2022-07-10 12:42:07,651][26022] Updated weights on worker 0-0, policy_version 728000 (0.00087) [2022-07-10 12:42:09,354][26022] Updated weights on worker 0-0, policy_version 728010 (0.00113) [2022-07-10 12:42:11,247][25689] Fps is (10 sec: 5373.5, 60 sec: 5504.1, 300 sec: 5515.4). Total num frames: 745491456. Throughput: 0: 4861.6. Samples: 745488302. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:42:11,247][25689] Avg episode reward: [(0, '-4.533')] [2022-07-10 12:42:11,477][26022] Updated weights on worker 0-0, policy_version 728020 (0.00089) [2022-07-10 12:42:13,058][26022] Updated weights on worker 0-0, policy_version 728030 (0.00086) [2022-07-10 12:42:15,043][26022] Updated weights on worker 0-0, policy_version 728040 (0.00086) [2022-07-10 12:42:16,292][25689] Fps is (10 sec: 5594.6, 60 sec: 5500.7, 300 sec: 5522.7). Total num frames: 745521152. Throughput: 0: 5697.8. Samples: 745521804. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:42:16,292][25689] Avg episode reward: [(0, '-4.098')] [2022-07-10 12:42:16,683][26022] Updated weights on worker 0-0, policy_version 728050 (0.00086) [2022-07-10 12:42:18,607][26022] Updated weights on worker 0-0, policy_version 728060 (0.00084) [2022-07-10 12:42:20,444][26022] Updated weights on worker 0-0, policy_version 728070 (0.00088) [2022-07-10 12:42:21,394][25689] Fps is (10 sec: 5651.3, 60 sec: 5537.6, 300 sec: 5522.2). Total num frames: 745548800. Throughput: 0: 5698.5. Samples: 745555306. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:42:21,394][25689] Avg episode reward: [(0, '-3.691')] [2022-07-10 12:42:22,411][26022] Updated weights on worker 0-0, policy_version 728080 (0.00102) [2022-07-10 12:42:24,082][26022] Updated weights on worker 0-0, policy_version 728090 (0.00092) [2022-07-10 12:42:26,126][26022] Updated weights on worker 0-0, policy_version 728100 (0.00095) [2022-07-10 12:42:26,418][25689] Fps is (10 sec: 5460.8, 60 sec: 5520.5, 300 sec: 5522.8). Total num frames: 745576448. Throughput: 0: 4972.1. Samples: 745571924. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:42:26,418][25689] Avg episode reward: [(0, '-4.339')] [2022-07-10 12:42:27,680][26022] Updated weights on worker 0-0, policy_version 728110 (0.00085) [2022-07-10 12:42:29,750][26022] Updated weights on worker 0-0, policy_version 728120 (0.00095) [2022-07-10 12:42:31,313][26022] Updated weights on worker 0-0, policy_version 728130 (0.00094) [2022-07-10 12:42:31,507][25689] Fps is (10 sec: 5569.0, 60 sec: 5547.3, 300 sec: 5521.3). Total num frames: 745605120. Throughput: 0: 5781.7. Samples: 745605440. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:42:31,509][25689] Avg episode reward: [(0, '-3.116')] [2022-07-10 12:42:32,111][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:42:32,122][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000728134_745609216.pth [2022-07-10 12:42:32,122][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000726190_743618560.pth [2022-07-10 12:42:33,389][26022] Updated weights on worker 0-0, policy_version 728140 (0.00087) [2022-07-10 12:42:35,063][26022] Updated weights on worker 0-0, policy_version 728150 (0.00085) [2022-07-10 12:42:36,569][25689] Fps is (10 sec: 5548.3, 60 sec: 5525.1, 300 sec: 5519.0). Total num frames: 745632768. Throughput: 0: 5764.7. Samples: 745638694. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:42:36,570][25689] Avg episode reward: [(0, '-3.853')] [2022-07-10 12:42:37,223][26022] Updated weights on worker 0-0, policy_version 728160 (0.00085) [2022-07-10 12:42:38,732][26022] Updated weights on worker 0-0, policy_version 728170 (0.00100) [2022-07-10 12:42:40,924][26022] Updated weights on worker 0-0, policy_version 728180 (0.00098) [2022-07-10 12:42:41,660][25689] Fps is (10 sec: 5547.2, 60 sec: 5525.2, 300 sec: 5521.9). Total num frames: 745661440. Throughput: 0: 4924.8. Samples: 745655112. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:42:41,660][25689] Avg episode reward: [(0, '-3.598')] [2022-07-10 12:42:42,428][26022] Updated weights on worker 0-0, policy_version 728190 (0.00083) [2022-07-10 12:42:44,551][26022] Updated weights on worker 0-0, policy_version 728200 (0.00090) [2022-07-10 12:42:46,241][26022] Updated weights on worker 0-0, policy_version 728210 (0.00086) [2022-07-10 12:42:46,677][25689] Fps is (10 sec: 5571.7, 60 sec: 5528.9, 300 sec: 5522.1). Total num frames: 745689088. Throughput: 0: 5765.4. Samples: 745688726. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:42:46,677][25689] Avg episode reward: [(0, '-3.568')] [2022-07-10 12:42:48,084][26022] Updated weights on worker 0-0, policy_version 728220 (0.00086) [2022-07-10 12:42:49,931][26022] Updated weights on worker 0-0, policy_version 728230 (0.00092) [2022-07-10 12:42:51,522][26022] Updated weights on worker 0-0, policy_version 728240 (0.00089) [2022-07-10 12:42:51,691][25689] Fps is (10 sec: 5716.8, 60 sec: 5548.1, 300 sec: 5527.0). Total num frames: 745718784. Throughput: 0: 5785.2. Samples: 745722206. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:42:51,691][25689] Avg episode reward: [(0, '-3.646')] [2022-07-10 12:42:53,582][26022] Updated weights on worker 0-0, policy_version 728250 (0.00087) [2022-07-10 12:42:55,299][26022] Updated weights on worker 0-0, policy_version 728260 (0.00087) [2022-07-10 12:42:56,738][25689] Fps is (10 sec: 5496.2, 60 sec: 5533.9, 300 sec: 5517.4). Total num frames: 745744384. Throughput: 0: 4969.1. Samples: 745738916. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:42:56,738][25689] Avg episode reward: [(0, '-3.139')] [2022-07-10 12:42:57,152][26022] Updated weights on worker 0-0, policy_version 728270 (0.00636) [2022-07-10 12:42:59,251][26022] Updated weights on worker 0-0, policy_version 728280 (0.00085) [2022-07-10 12:43:00,859][26022] Updated weights on worker 0-0, policy_version 728290 (0.00089) [2022-07-10 12:43:01,890][25689] Fps is (10 sec: 5421.3, 60 sec: 5546.2, 300 sec: 5532.4). Total num frames: 745774080. Throughput: 0: 5796.4. Samples: 745772376. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:43:01,891][25689] Avg episode reward: [(0, '-3.770')] [2022-07-10 12:43:03,105][26022] Updated weights on worker 0-0, policy_version 728300 (0.00088) [2022-07-10 12:43:04,835][26022] Updated weights on worker 0-0, policy_version 728310 (0.00095) [2022-07-10 12:43:06,740][26022] Updated weights on worker 0-0, policy_version 728320 (0.00088) [2022-07-10 12:43:06,905][25689] Fps is (10 sec: 5539.5, 60 sec: 5535.2, 300 sec: 5522.0). Total num frames: 745800704. Throughput: 0: 5688.8. Samples: 745803798. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:43:06,905][25689] Avg episode reward: [(0, '-4.722')] [2022-07-10 12:43:08,600][26022] Updated weights on worker 0-0, policy_version 728330 (0.00088) [2022-07-10 12:43:10,476][26022] Updated weights on worker 0-0, policy_version 728340 (0.00095) [2022-07-10 12:43:11,939][25689] Fps is (10 sec: 5299.2, 60 sec: 5534.0, 300 sec: 5518.4). Total num frames: 745827328. Throughput: 0: 5670.9. Samples: 745837030. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:43:11,940][25689] Avg episode reward: [(0, '-6.107')] [2022-07-10 12:43:12,225][26022] Updated weights on worker 0-0, policy_version 728350 (0.00092) [2022-07-10 12:43:14,193][26022] Updated weights on worker 0-0, policy_version 728360 (0.00090) [2022-07-10 12:43:15,844][26022] Updated weights on worker 0-0, policy_version 728370 (0.00086) [2022-07-10 12:43:16,980][25689] Fps is (10 sec: 5488.4, 60 sec: 5517.5, 300 sec: 5522.2). Total num frames: 745856000. Throughput: 0: 5683.2. Samples: 745853956. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:43:16,980][25689] Avg episode reward: [(0, '-5.330')] [2022-07-10 12:43:17,862][26022] Updated weights on worker 0-0, policy_version 728380 (0.00092) [2022-07-10 12:43:19,542][26022] Updated weights on worker 0-0, policy_version 728390 (0.00086) [2022-07-10 12:43:21,375][26022] Updated weights on worker 0-0, policy_version 728400 (0.00084) [2022-07-10 12:43:22,082][25689] Fps is (10 sec: 5552.5, 60 sec: 5517.5, 300 sec: 5520.6). Total num frames: 745883648. Throughput: 0: 5705.9. Samples: 745887586. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:43:22,082][25689] Avg episode reward: [(0, '-6.823')] [2022-07-10 12:43:23,191][26022] Updated weights on worker 0-0, policy_version 728410 (0.00087) [2022-07-10 12:43:25,144][26022] Updated weights on worker 0-0, policy_version 728420 (0.00083) [2022-07-10 12:43:27,011][26022] Updated weights on worker 0-0, policy_version 728430 (0.00083) [2022-07-10 12:43:27,088][25689] Fps is (10 sec: 5571.4, 60 sec: 5536.0, 300 sec: 5517.3). Total num frames: 745912320. Throughput: 0: 5806.9. Samples: 745921002. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:43:27,089][25689] Avg episode reward: [(0, '-6.277')] [2022-07-10 12:43:28,755][26022] Updated weights on worker 0-0, policy_version 728440 (0.00086) [2022-07-10 12:43:30,711][26022] Updated weights on worker 0-0, policy_version 728450 (0.00095) [2022-07-10 12:43:32,108][25689] Fps is (10 sec: 5719.2, 60 sec: 5542.3, 300 sec: 5521.1). Total num frames: 745940992. Throughput: 0: 4983.0. Samples: 745937536. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:43:32,110][25689] Avg episode reward: [(0, '-5.234')] [2022-07-10 12:43:32,651][26022] Updated weights on worker 0-0, policy_version 728460 (0.00083) [2022-07-10 12:43:34,314][26022] Updated weights on worker 0-0, policy_version 728470 (0.00085) [2022-07-10 12:43:36,461][26022] Updated weights on worker 0-0, policy_version 728480 (0.00092) [2022-07-10 12:43:37,112][25689] Fps is (10 sec: 5516.6, 60 sec: 5530.7, 300 sec: 5518.5). Total num frames: 745967616. Throughput: 0: 5810.4. Samples: 745970932. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:43:37,112][25689] Avg episode reward: [(0, '-5.110')] [2022-07-10 12:43:37,752][26022] Updated weights on worker 0-0, policy_version 728490 (0.00085) [2022-07-10 12:43:39,989][26022] Updated weights on worker 0-0, policy_version 728500 (0.00089) [2022-07-10 12:43:41,666][26022] Updated weights on worker 0-0, policy_version 728510 (0.00080) [2022-07-10 12:43:42,179][25689] Fps is (10 sec: 5490.9, 60 sec: 5532.9, 300 sec: 5517.5). Total num frames: 745996288. Throughput: 0: 5821.2. Samples: 746004574. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:43:42,180][25689] Avg episode reward: [(0, '-4.340')] [2022-07-10 12:43:43,527][26022] Updated weights on worker 0-0, policy_version 728520 (0.00086) [2022-07-10 12:43:45,310][26022] Updated weights on worker 0-0, policy_version 728530 (0.00086) [2022-07-10 12:43:47,214][25689] Fps is (10 sec: 5676.4, 60 sec: 5548.2, 300 sec: 5524.1). Total num frames: 746024960. Throughput: 0: 4987.3. Samples: 746021376. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:43:47,215][25689] Avg episode reward: [(0, '-4.210')] [2022-07-10 12:43:47,219][26022] Updated weights on worker 0-0, policy_version 728540 (0.00085) [2022-07-10 12:43:49,070][26022] Updated weights on worker 0-0, policy_version 728550 (0.00084) [2022-07-10 12:43:51,012][26022] Updated weights on worker 0-0, policy_version 728560 (0.00151) [2022-07-10 12:43:52,221][25689] Fps is (10 sec: 5404.5, 60 sec: 5481.2, 300 sec: 5514.7). Total num frames: 746050560. Throughput: 0: 5812.4. Samples: 746054438. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:43:52,222][25689] Avg episode reward: [(0, '-3.055')] [2022-07-10 12:43:52,829][26022] Updated weights on worker 0-0, policy_version 728570 (0.00617) [2022-07-10 12:43:54,809][26022] Updated weights on worker 0-0, policy_version 728580 (0.00087) [2022-07-10 12:43:56,408][26022] Updated weights on worker 0-0, policy_version 728590 (0.00091) [2022-07-10 12:43:57,267][25689] Fps is (10 sec: 5500.8, 60 sec: 5548.9, 300 sec: 5526.1). Total num frames: 746080256. Throughput: 0: 5781.4. Samples: 746087454. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:43:57,267][25689] Avg episode reward: [(0, '-3.926')] [2022-07-10 12:43:58,507][26022] Updated weights on worker 0-0, policy_version 728600 (0.00089) [2022-07-10 12:44:00,238][26022] Updated weights on worker 0-0, policy_version 728610 (0.00084) [2022-07-10 12:44:02,381][25689] Fps is (10 sec: 5442.1, 60 sec: 5484.7, 300 sec: 5517.7). Total num frames: 746105856. Throughput: 0: 4908.9. Samples: 746103746. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:44:02,382][25689] Avg episode reward: [(0, '-5.002')] [2022-07-10 12:44:02,723][26022] Updated weights on worker 0-0, policy_version 728620 (0.00087) [2022-07-10 12:44:04,497][26022] Updated weights on worker 0-0, policy_version 728630 (0.00090) [2022-07-10 12:44:06,280][26022] Updated weights on worker 0-0, policy_version 728640 (0.00091) [2022-07-10 12:44:07,383][25689] Fps is (10 sec: 5162.4, 60 sec: 5485.9, 300 sec: 5510.9). Total num frames: 746132480. Throughput: 0: 5631.3. Samples: 746134952. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:44:07,383][25689] Avg episode reward: [(0, '-3.952')] [2022-07-10 12:44:08,160][26022] Updated weights on worker 0-0, policy_version 728650 (0.00092) [2022-07-10 12:44:09,786][26022] Updated weights on worker 0-0, policy_version 728660 (0.00085) [2022-07-10 12:44:11,799][26022] Updated weights on worker 0-0, policy_version 728670 (0.00088) [2022-07-10 12:44:12,425][25689] Fps is (10 sec: 5505.6, 60 sec: 5519.0, 300 sec: 5520.6). Total num frames: 746161152. Throughput: 0: 5633.4. Samples: 746168256. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:44:12,426][25689] Avg episode reward: [(0, '-4.388')] [2022-07-10 12:44:13,668][26022] Updated weights on worker 0-0, policy_version 728680 (0.00086) [2022-07-10 12:44:15,283][26022] Updated weights on worker 0-0, policy_version 728690 (0.00086) [2022-07-10 12:44:17,428][26022] Updated weights on worker 0-0, policy_version 728700 (0.00092) [2022-07-10 12:44:17,521][25689] Fps is (10 sec: 5555.4, 60 sec: 5497.1, 300 sec: 5516.3). Total num frames: 746188800. Throughput: 0: 4818.8. Samples: 746185054. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:44:17,521][25689] Avg episode reward: [(0, '-3.355')] [2022-07-10 12:44:19,032][26022] Updated weights on worker 0-0, policy_version 728710 (0.00090) [2022-07-10 12:44:21,032][26022] Updated weights on worker 0-0, policy_version 728720 (0.00090) [2022-07-10 12:44:22,576][25689] Fps is (10 sec: 5649.1, 60 sec: 5535.2, 300 sec: 5519.1). Total num frames: 746218496. Throughput: 0: 5686.5. Samples: 746218582. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:44:22,576][25689] Avg episode reward: [(0, '-3.592')] [2022-07-10 12:44:22,780][26022] Updated weights on worker 0-0, policy_version 728730 (0.00089) [2022-07-10 12:44:24,637][26022] Updated weights on worker 0-0, policy_version 728740 (0.00092) [2022-07-10 12:44:26,505][26022] Updated weights on worker 0-0, policy_version 728750 (0.00086) [2022-07-10 12:44:27,596][25689] Fps is (10 sec: 5590.3, 60 sec: 5500.2, 300 sec: 5515.6). Total num frames: 746245120. Throughput: 0: 5800.2. Samples: 746252190. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:44:27,596][25689] Avg episode reward: [(0, '-2.911')] [2022-07-10 12:44:28,130][26022] Updated weights on worker 0-0, policy_version 728760 (0.00087) [2022-07-10 12:44:30,107][26022] Updated weights on worker 0-0, policy_version 728770 (0.00086) [2022-07-10 12:44:32,073][26022] Updated weights on worker 0-0, policy_version 728780 (0.00089) [2022-07-10 12:44:32,254][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:44:32,265][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000728781_746271744.pth [2022-07-10 12:44:32,265][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000726839_744283136.pth [2022-07-10 12:44:32,607][25689] Fps is (10 sec: 5512.3, 60 sec: 5500.9, 300 sec: 5519.7). Total num frames: 746273792. Throughput: 0: 4988.5. Samples: 746268936. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:44:32,608][25689] Avg episode reward: [(0, '-2.274')] [2022-07-10 12:44:33,915][26022] Updated weights on worker 0-0, policy_version 728790 (0.00085) [2022-07-10 12:44:35,791][26022] Updated weights on worker 0-0, policy_version 728800 (0.00095) [2022-07-10 12:44:37,444][26022] Updated weights on worker 0-0, policy_version 728810 (0.00097) [2022-07-10 12:44:37,639][25689] Fps is (10 sec: 5709.5, 60 sec: 5532.2, 300 sec: 5516.7). Total num frames: 746302464. Throughput: 0: 5823.4. Samples: 746302210. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:44:37,640][25689] Avg episode reward: [(0, '-2.756')] [2022-07-10 12:44:39,435][26022] Updated weights on worker 0-0, policy_version 728820 (0.00084) [2022-07-10 12:44:41,273][26022] Updated weights on worker 0-0, policy_version 728830 (0.00086) [2022-07-10 12:44:42,773][25689] Fps is (10 sec: 5439.2, 60 sec: 5492.2, 300 sec: 5517.7). Total num frames: 746329088. Throughput: 0: 5788.0. Samples: 746335484. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:44:42,774][25689] Avg episode reward: [(0, '-2.470')] [2022-07-10 12:44:43,096][26022] Updated weights on worker 0-0, policy_version 728840 (0.00092) [2022-07-10 12:44:44,993][26022] Updated weights on worker 0-0, policy_version 728850 (0.00094) [2022-07-10 12:44:46,772][26022] Updated weights on worker 0-0, policy_version 728860 (0.00093) [2022-07-10 12:44:47,796][25689] Fps is (10 sec: 5545.0, 60 sec: 5510.3, 300 sec: 5518.1). Total num frames: 746358784. Throughput: 0: 4956.8. Samples: 746352320. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:44:47,797][25689] Avg episode reward: [(0, '-1.612')] [2022-07-10 12:44:48,638][26022] Updated weights on worker 0-0, policy_version 728870 (0.00087) [2022-07-10 12:44:50,453][26022] Updated weights on worker 0-0, policy_version 728880 (0.00090) [2022-07-10 12:44:52,194][26022] Updated weights on worker 0-0, policy_version 728890 (0.00091) [2022-07-10 12:44:52,841][25689] Fps is (10 sec: 5695.8, 60 sec: 5540.6, 300 sec: 5514.4). Total num frames: 746386432. Throughput: 0: 5780.0. Samples: 746385888. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:44:52,842][25689] Avg episode reward: [(0, '-2.534')] [2022-07-10 12:44:53,930][26022] Updated weights on worker 0-0, policy_version 728900 (0.00086) [2022-07-10 12:44:55,932][26022] Updated weights on worker 0-0, policy_version 728910 (0.00084) [2022-07-10 12:44:57,691][26022] Updated weights on worker 0-0, policy_version 728920 (0.00086) [2022-07-10 12:44:57,850][25689] Fps is (10 sec: 5499.9, 60 sec: 5510.2, 300 sec: 5516.0). Total num frames: 746414080. Throughput: 0: 5786.8. Samples: 746419166. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:44:57,851][25689] Avg episode reward: [(0, '-1.619')] [2022-07-10 12:44:59,692][26022] Updated weights on worker 0-0, policy_version 728930 (0.00096) [2022-07-10 12:45:01,512][26022] Updated weights on worker 0-0, policy_version 728940 (0.00090) [2022-07-10 12:45:02,904][25689] Fps is (10 sec: 5291.5, 60 sec: 5515.7, 300 sec: 5513.3). Total num frames: 746439680. Throughput: 0: 4988.6. Samples: 746435908. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:45:02,905][25689] Avg episode reward: [(0, '-2.607')] [2022-07-10 12:45:03,735][26022] Updated weights on worker 0-0, policy_version 728950 (0.00065) [2022-07-10 12:45:05,610][26022] Updated weights on worker 0-0, policy_version 728960 (0.00099) [2022-07-10 12:45:07,536][26022] Updated weights on worker 0-0, policy_version 728970 (0.00087) [2022-07-10 12:45:07,906][25689] Fps is (10 sec: 5193.3, 60 sec: 5515.6, 300 sec: 5511.4). Total num frames: 746466304. Throughput: 0: 5697.3. Samples: 746466894. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:45:07,907][25689] Avg episode reward: [(0, '-3.923')] [2022-07-10 12:45:09,217][26022] Updated weights on worker 0-0, policy_version 728980 (0.00092) [2022-07-10 12:45:11,270][26022] Updated weights on worker 0-0, policy_version 728990 (0.00096) [2022-07-10 12:45:12,912][25689] Fps is (10 sec: 5525.4, 60 sec: 5519.0, 300 sec: 5514.9). Total num frames: 746494976. Throughput: 0: 5672.0. Samples: 746499728. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:45:12,912][25689] Avg episode reward: [(0, '-3.489')] [2022-07-10 12:45:13,061][26022] Updated weights on worker 0-0, policy_version 729000 (0.00082) [2022-07-10 12:45:15,098][26022] Updated weights on worker 0-0, policy_version 729010 (0.00089) [2022-07-10 12:45:16,823][26022] Updated weights on worker 0-0, policy_version 729020 (0.00084) [2022-07-10 12:45:17,927][25689] Fps is (10 sec: 5416.1, 60 sec: 5492.5, 300 sec: 5508.8). Total num frames: 746520576. Throughput: 0: 4840.0. Samples: 746516336. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:45:17,927][25689] Avg episode reward: [(0, '-4.453')] [2022-07-10 12:45:18,615][26022] Updated weights on worker 0-0, policy_version 729030 (0.00096) [2022-07-10 12:45:20,782][26022] Updated weights on worker 0-0, policy_version 729040 (0.00084) [2022-07-10 12:45:22,342][26022] Updated weights on worker 0-0, policy_version 729050 (0.00052) [2022-07-10 12:45:23,067][25689] Fps is (10 sec: 5445.1, 60 sec: 5484.7, 300 sec: 5513.3). Total num frames: 746550272. Throughput: 0: 5634.3. Samples: 746549512. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:45:23,067][25689] Avg episode reward: [(0, '-3.581')] [2022-07-10 12:45:24,283][26022] Updated weights on worker 0-0, policy_version 729060 (0.00091) [2022-07-10 12:45:26,096][26022] Updated weights on worker 0-0, policy_version 729070 (0.00088) [2022-07-10 12:45:28,038][26022] Updated weights on worker 0-0, policy_version 729080 (0.00081) [2022-07-10 12:45:28,135][25689] Fps is (10 sec: 5617.3, 60 sec: 5497.2, 300 sec: 5509.3). Total num frames: 746577920. Throughput: 0: 5714.6. Samples: 746582494. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:45:28,136][25689] Avg episode reward: [(0, '-2.790')] [2022-07-10 12:45:29,869][26022] Updated weights on worker 0-0, policy_version 729090 (0.00079) [2022-07-10 12:45:31,595][26022] Updated weights on worker 0-0, policy_version 729100 (0.00086) [2022-07-10 12:45:33,154][25689] Fps is (10 sec: 5482.0, 60 sec: 5479.7, 300 sec: 5506.2). Total num frames: 746605568. Throughput: 0: 4907.4. Samples: 746599064. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 12:45:33,154][25689] Avg episode reward: [(0, '-1.157')] [2022-07-10 12:45:33,616][26022] Updated weights on worker 0-0, policy_version 729110 (0.00091) [2022-07-10 12:45:35,540][26022] Updated weights on worker 0-0, policy_version 729120 (0.00083) [2022-07-10 12:45:37,298][26022] Updated weights on worker 0-0, policy_version 729130 (0.00092) [2022-07-10 12:45:38,225][25689] Fps is (10 sec: 5582.0, 60 sec: 5476.2, 300 sec: 5509.7). Total num frames: 746634240. Throughput: 0: 5712.5. Samples: 746632288. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:45:38,225][25689] Avg episode reward: [(0, '-1.145')] [2022-07-10 12:45:39,137][26022] Updated weights on worker 0-0, policy_version 729140 (0.00088) [2022-07-10 12:45:40,895][26022] Updated weights on worker 0-0, policy_version 729150 (0.00096) [2022-07-10 12:45:42,759][26022] Updated weights on worker 0-0, policy_version 729160 (0.00097) [2022-07-10 12:45:43,329][25689] Fps is (10 sec: 5635.7, 60 sec: 5512.7, 300 sec: 5512.0). Total num frames: 746662912. Throughput: 0: 5736.7. Samples: 746665748. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:45:43,329][25689] Avg episode reward: [(0, '-2.671')] [2022-07-10 12:45:44,688][26022] Updated weights on worker 0-0, policy_version 729170 (0.00089) [2022-07-10 12:45:46,407][26022] Updated weights on worker 0-0, policy_version 729180 (0.00086) [2022-07-10 12:45:48,354][25689] Fps is (10 sec: 5459.0, 60 sec: 5461.7, 300 sec: 5512.7). Total num frames: 746689536. Throughput: 0: 5782.8. Samples: 746699416. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:45:48,355][25689] Avg episode reward: [(0, '-1.806')] [2022-07-10 12:45:48,390][26022] Updated weights on worker 0-0, policy_version 729190 (0.00102) [2022-07-10 12:45:50,203][26022] Updated weights on worker 0-0, policy_version 729200 (0.00101) [2022-07-10 12:45:51,957][26022] Updated weights on worker 0-0, policy_version 729210 (0.00089) [2022-07-10 12:45:53,418][25689] Fps is (10 sec: 5480.4, 60 sec: 5476.9, 300 sec: 5508.6). Total num frames: 746718208. Throughput: 0: 5764.7. Samples: 746715884. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:45:53,419][25689] Avg episode reward: [(0, '-2.012')] [2022-07-10 12:45:53,912][26022] Updated weights on worker 0-0, policy_version 729220 (0.00091) [2022-07-10 12:45:55,576][26022] Updated weights on worker 0-0, policy_version 729230 (0.00084) [2022-07-10 12:45:57,481][26022] Updated weights on worker 0-0, policy_version 729240 (0.00236) [2022-07-10 12:45:58,472][25689] Fps is (10 sec: 5667.6, 60 sec: 5489.8, 300 sec: 5516.6). Total num frames: 746746880. Throughput: 0: 5777.6. Samples: 746749268. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:45:58,473][25689] Avg episode reward: [(0, '-2.007')] [2022-07-10 12:45:59,455][26022] Updated weights on worker 0-0, policy_version 729250 (0.00086) [2022-07-10 12:46:01,151][26022] Updated weights on worker 0-0, policy_version 729260 (0.00092) [2022-07-10 12:46:03,437][26022] Updated weights on worker 0-0, policy_version 729270 (0.00099) [2022-07-10 12:46:03,557][25689] Fps is (10 sec: 5352.8, 60 sec: 5486.9, 300 sec: 5512.0). Total num frames: 746772480. Throughput: 0: 5677.6. Samples: 746780598. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:46:03,558][25689] Avg episode reward: [(0, '-1.402')] [2022-07-10 12:46:05,342][26022] Updated weights on worker 0-0, policy_version 729280 (0.00088) [2022-07-10 12:46:07,111][26022] Updated weights on worker 0-0, policy_version 729290 (0.00091) [2022-07-10 12:46:08,571][25689] Fps is (10 sec: 5171.1, 60 sec: 5485.9, 300 sec: 5508.8). Total num frames: 746799104. Throughput: 0: 4835.8. Samples: 746797182. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:46:08,572][25689] Avg episode reward: [(0, '-1.429')] [2022-07-10 12:46:09,025][26022] Updated weights on worker 0-0, policy_version 729300 (0.00087) [2022-07-10 12:46:11,009][26022] Updated weights on worker 0-0, policy_version 729310 (0.00087) [2022-07-10 12:46:12,658][26022] Updated weights on worker 0-0, policy_version 729320 (0.00086) [2022-07-10 12:46:13,579][25689] Fps is (10 sec: 5619.7, 60 sec: 5502.5, 300 sec: 5508.8). Total num frames: 746828800. Throughput: 0: 5684.0. Samples: 746830478. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:46:13,580][25689] Avg episode reward: [(0, '-1.903')] [2022-07-10 12:46:14,503][26022] Updated weights on worker 0-0, policy_version 729330 (0.00084) [2022-07-10 12:46:16,235][26022] Updated weights on worker 0-0, policy_version 729340 (0.00081) [2022-07-10 12:46:18,327][26022] Updated weights on worker 0-0, policy_version 729350 (0.00087) [2022-07-10 12:46:18,599][25689] Fps is (10 sec: 5616.5, 60 sec: 5519.0, 300 sec: 5514.4). Total num frames: 746855424. Throughput: 0: 5690.9. Samples: 746863808. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:46:18,599][25689] Avg episode reward: [(0, '-3.015')] [2022-07-10 12:46:20,088][26022] Updated weights on worker 0-0, policy_version 729360 (0.00084) [2022-07-10 12:46:21,973][26022] Updated weights on worker 0-0, policy_version 729370 (0.00081) [2022-07-10 12:46:23,725][25689] Fps is (10 sec: 5449.9, 60 sec: 5503.3, 300 sec: 5512.4). Total num frames: 746884096. Throughput: 0: 4950.9. Samples: 746880448. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:46:23,726][25689] Avg episode reward: [(0, '-2.756')] [2022-07-10 12:46:23,854][26022] Updated weights on worker 0-0, policy_version 729380 (0.00098) [2022-07-10 12:46:25,740][26022] Updated weights on worker 0-0, policy_version 729390 (0.00096) [2022-07-10 12:46:27,462][26022] Updated weights on worker 0-0, policy_version 729400 (0.00089) [2022-07-10 12:46:28,746][25689] Fps is (10 sec: 5550.0, 60 sec: 5507.6, 300 sec: 5515.7). Total num frames: 746911744. Throughput: 0: 5767.0. Samples: 746913534. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:46:28,747][25689] Avg episode reward: [(0, '-2.924')] [2022-07-10 12:46:29,488][26022] Updated weights on worker 0-0, policy_version 729410 (0.00093) [2022-07-10 12:46:31,127][26022] Updated weights on worker 0-0, policy_version 729420 (0.00099) [2022-07-10 12:46:32,329][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:46:32,342][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000729426_746932224.pth [2022-07-10 12:46:32,342][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000727487_744946688.pth [2022-07-10 12:46:33,126][26022] Updated weights on worker 0-0, policy_version 729430 (0.00050) [2022-07-10 12:46:33,767][25689] Fps is (10 sec: 5608.8, 60 sec: 5524.3, 300 sec: 5515.4). Total num frames: 746940416. Throughput: 0: 5766.6. Samples: 746946892. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:46:33,769][25689] Avg episode reward: [(0, '-2.916')] [2022-07-10 12:46:34,819][26022] Updated weights on worker 0-0, policy_version 729440 (0.00088) [2022-07-10 12:46:36,731][26022] Updated weights on worker 0-0, policy_version 729450 (0.00101) [2022-07-10 12:46:38,570][26022] Updated weights on worker 0-0, policy_version 729460 (0.00085) [2022-07-10 12:46:38,787][25689] Fps is (10 sec: 5609.1, 60 sec: 5512.0, 300 sec: 5513.3). Total num frames: 746968064. Throughput: 0: 4940.9. Samples: 746963558. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:46:38,788][25689] Avg episode reward: [(0, '-2.962')] [2022-07-10 12:46:40,613][26022] Updated weights on worker 0-0, policy_version 729470 (0.00089) [2022-07-10 12:46:42,271][26022] Updated weights on worker 0-0, policy_version 729480 (0.00088) [2022-07-10 12:46:43,873][25689] Fps is (10 sec: 5471.3, 60 sec: 5496.8, 300 sec: 5512.8). Total num frames: 746995712. Throughput: 0: 5772.0. Samples: 746996742. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:46:43,874][25689] Avg episode reward: [(0, '-0.514')] [2022-07-10 12:46:44,303][26022] Updated weights on worker 0-0, policy_version 729490 (0.00086) [2022-07-10 12:46:45,795][26022] Updated weights on worker 0-0, policy_version 729500 (0.00083) [2022-07-10 12:46:47,984][26022] Updated weights on worker 0-0, policy_version 729510 (0.00089) [2022-07-10 12:46:48,888][25689] Fps is (10 sec: 5576.0, 60 sec: 5531.6, 300 sec: 5513.2). Total num frames: 747024384. Throughput: 0: 5779.6. Samples: 747029942. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:46:48,888][25689] Avg episode reward: [(0, '0.354')] [2022-07-10 12:46:49,741][26022] Updated weights on worker 0-0, policy_version 729520 (0.00088) [2022-07-10 12:46:51,548][26022] Updated weights on worker 0-0, policy_version 729530 (0.00087) [2022-07-10 12:46:53,423][26022] Updated weights on worker 0-0, policy_version 729540 (0.00090) [2022-07-10 12:46:53,943][25689] Fps is (10 sec: 5491.4, 60 sec: 5498.6, 300 sec: 5513.6). Total num frames: 747051008. Throughput: 0: 4942.1. Samples: 747046604. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:46:53,943][25689] Avg episode reward: [(0, '0.079')] [2022-07-10 12:46:55,162][26022] Updated weights on worker 0-0, policy_version 729550 (0.00093) [2022-07-10 12:46:57,150][26022] Updated weights on worker 0-0, policy_version 729560 (0.00090) [2022-07-10 12:46:58,971][25689] Fps is (10 sec: 5382.7, 60 sec: 5484.0, 300 sec: 5511.6). Total num frames: 747078656. Throughput: 0: 5766.5. Samples: 747079944. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:46:58,971][25689] Avg episode reward: [(0, '-0.159')] [2022-07-10 12:46:59,034][26022] Updated weights on worker 0-0, policy_version 729570 (0.00086) [2022-07-10 12:47:00,679][26022] Updated weights on worker 0-0, policy_version 729580 (0.00092) [2022-07-10 12:47:02,953][26022] Updated weights on worker 0-0, policy_version 729590 (0.00090) [2022-07-10 12:47:04,095][25689] Fps is (10 sec: 5446.9, 60 sec: 5514.3, 300 sec: 5510.7). Total num frames: 747106304. Throughput: 0: 5658.2. Samples: 747111160. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:47:04,097][25689] Avg episode reward: [(0, '-1.602')] [2022-07-10 12:47:04,854][26022] Updated weights on worker 0-0, policy_version 729600 (0.00086) [2022-07-10 12:47:06,660][26022] Updated weights on worker 0-0, policy_version 729610 (0.00099) [2022-07-10 12:47:08,644][26022] Updated weights on worker 0-0, policy_version 729620 (0.00085) [2022-07-10 12:47:09,160][25689] Fps is (10 sec: 5326.5, 60 sec: 5509.7, 300 sec: 5509.9). Total num frames: 747132928. Throughput: 0: 4824.4. Samples: 747127742. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:47:09,161][25689] Avg episode reward: [(0, '-1.820')] [2022-07-10 12:47:10,262][26022] Updated weights on worker 0-0, policy_version 729630 (0.00092) [2022-07-10 12:47:12,387][26022] Updated weights on worker 0-0, policy_version 729640 (0.00096) [2022-07-10 12:47:13,963][26022] Updated weights on worker 0-0, policy_version 729650 (0.00095) [2022-07-10 12:47:14,255][25689] Fps is (10 sec: 5543.3, 60 sec: 5501.8, 300 sec: 5509.0). Total num frames: 747162624. Throughput: 0: 5630.0. Samples: 747160960. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:47:14,256][25689] Avg episode reward: [(0, '-2.255')] [2022-07-10 12:47:15,931][26022] Updated weights on worker 0-0, policy_version 729660 (0.00080) [2022-07-10 12:47:17,808][26022] Updated weights on worker 0-0, policy_version 729670 (0.00094) [2022-07-10 12:47:19,274][25689] Fps is (10 sec: 5568.2, 60 sec: 5501.8, 300 sec: 5507.1). Total num frames: 747189248. Throughput: 0: 5645.2. Samples: 747194562. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:47:19,275][25689] Avg episode reward: [(0, '-3.159')] [2022-07-10 12:47:19,686][26022] Updated weights on worker 0-0, policy_version 729680 (0.00089) [2022-07-10 12:47:21,378][26022] Updated weights on worker 0-0, policy_version 729690 (0.00095) [2022-07-10 12:47:23,286][26022] Updated weights on worker 0-0, policy_version 729700 (0.00083) [2022-07-10 12:47:24,347][25689] Fps is (10 sec: 5479.0, 60 sec: 5506.7, 300 sec: 5509.6). Total num frames: 747217920. Throughput: 0: 4935.6. Samples: 747211118. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:47:24,349][25689] Avg episode reward: [(0, '-3.517')] [2022-07-10 12:47:25,115][26022] Updated weights on worker 0-0, policy_version 729710 (0.00094) [2022-07-10 12:47:27,031][26022] Updated weights on worker 0-0, policy_version 729720 (0.00114) [2022-07-10 12:47:28,862][26022] Updated weights on worker 0-0, policy_version 729730 (0.00094) [2022-07-10 12:47:29,433][25689] Fps is (10 sec: 5543.8, 60 sec: 5500.8, 300 sec: 5506.2). Total num frames: 747245568. Throughput: 0: 5768.9. Samples: 747244698. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:47:29,434][25689] Avg episode reward: [(0, '-4.643')] [2022-07-10 12:47:30,608][26022] Updated weights on worker 0-0, policy_version 729740 (0.00086) [2022-07-10 12:47:32,529][26022] Updated weights on worker 0-0, policy_version 729750 (0.00085) [2022-07-10 12:47:34,318][26022] Updated weights on worker 0-0, policy_version 729760 (0.00086) [2022-07-10 12:47:34,450][25689] Fps is (10 sec: 5676.2, 60 sec: 5518.0, 300 sec: 5513.9). Total num frames: 747275264. Throughput: 0: 5797.6. Samples: 747278042. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:47:34,452][25689] Avg episode reward: [(0, '-3.136')] [2022-07-10 12:47:36,275][26022] Updated weights on worker 0-0, policy_version 729770 (0.00087) [2022-07-10 12:47:37,982][26022] Updated weights on worker 0-0, policy_version 729780 (0.00087) [2022-07-10 12:47:39,491][25689] Fps is (10 sec: 5599.8, 60 sec: 5499.3, 300 sec: 5508.0). Total num frames: 747301888. Throughput: 0: 5787.7. Samples: 747311570. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:47:39,491][25689] Avg episode reward: [(0, '-4.365')] [2022-07-10 12:47:40,079][26022] Updated weights on worker 0-0, policy_version 729790 (0.00095) [2022-07-10 12:47:41,639][26022] Updated weights on worker 0-0, policy_version 729800 (0.00089) [2022-07-10 12:47:43,568][26022] Updated weights on worker 0-0, policy_version 729810 (0.00091) [2022-07-10 12:47:44,607][25689] Fps is (10 sec: 5544.9, 60 sec: 5530.3, 300 sec: 5513.0). Total num frames: 747331584. Throughput: 0: 5787.1. Samples: 747328362. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:47:44,607][25689] Avg episode reward: [(0, '-3.520')] [2022-07-10 12:47:45,362][26022] Updated weights on worker 0-0, policy_version 729820 (0.00085) [2022-07-10 12:47:47,195][26022] Updated weights on worker 0-0, policy_version 729830 (0.00094) [2022-07-10 12:47:48,856][26022] Updated weights on worker 0-0, policy_version 729840 (0.00095) [2022-07-10 12:47:49,621][25689] Fps is (10 sec: 5559.4, 60 sec: 5496.5, 300 sec: 5502.7). Total num frames: 747358208. Throughput: 0: 5822.0. Samples: 747362232. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:47:49,622][25689] Avg episode reward: [(0, '-3.395')] [2022-07-10 12:47:50,712][26022] Updated weights on worker 0-0, policy_version 729850 (0.00085) [2022-07-10 12:47:52,520][26022] Updated weights on worker 0-0, policy_version 729860 (0.00082) [2022-07-10 12:47:54,485][26022] Updated weights on worker 0-0, policy_version 729870 (0.00090) [2022-07-10 12:47:54,636][25689] Fps is (10 sec: 5513.4, 60 sec: 5533.9, 300 sec: 5513.6). Total num frames: 747386880. Throughput: 0: 5845.1. Samples: 747396034. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:47:54,637][25689] Avg episode reward: [(0, '-2.875')] [2022-07-10 12:47:56,512][26022] Updated weights on worker 0-0, policy_version 729880 (0.00085) [2022-07-10 12:47:58,056][26022] Updated weights on worker 0-0, policy_version 729890 (0.00095) [2022-07-10 12:47:59,651][25689] Fps is (10 sec: 5615.4, 60 sec: 5535.1, 300 sec: 5509.3). Total num frames: 747414528. Throughput: 0: 5030.2. Samples: 747412980. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:47:59,652][25689] Avg episode reward: [(0, '-3.171')] [2022-07-10 12:48:00,015][26022] Updated weights on worker 0-0, policy_version 729900 (0.00087) [2022-07-10 12:48:01,790][26022] Updated weights on worker 0-0, policy_version 729910 (0.00096) [2022-07-10 12:48:04,240][26022] Updated weights on worker 0-0, policy_version 729920 (0.00092) [2022-07-10 12:48:04,704][25689] Fps is (10 sec: 5390.5, 60 sec: 5524.7, 300 sec: 5508.6). Total num frames: 747441152. Throughput: 0: 5752.1. Samples: 747443964. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:48:04,707][25689] Avg episode reward: [(0, '-2.756')] [2022-07-10 12:48:05,928][26022] Updated weights on worker 0-0, policy_version 729930 (0.00084) [2022-07-10 12:48:07,687][26022] Updated weights on worker 0-0, policy_version 729940 (0.00389) [2022-07-10 12:48:09,614][26022] Updated weights on worker 0-0, policy_version 729950 (0.00092) [2022-07-10 12:48:09,723][25689] Fps is (10 sec: 5388.3, 60 sec: 5545.8, 300 sec: 5512.3). Total num frames: 747468800. Throughput: 0: 5724.6. Samples: 747477306. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:48:09,723][25689] Avg episode reward: [(0, '-2.725')] [2022-07-10 12:48:11,410][26022] Updated weights on worker 0-0, policy_version 729960 (0.00091) [2022-07-10 12:48:13,256][26022] Updated weights on worker 0-0, policy_version 729970 (0.00092) [2022-07-10 12:48:14,772][25689] Fps is (10 sec: 5593.9, 60 sec: 5533.1, 300 sec: 5512.1). Total num frames: 747497472. Throughput: 0: 4876.2. Samples: 747494222. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:48:14,773][25689] Avg episode reward: [(0, '-3.207')] [2022-07-10 12:48:15,131][26022] Updated weights on worker 0-0, policy_version 729980 (0.00088) [2022-07-10 12:48:16,814][26022] Updated weights on worker 0-0, policy_version 729990 (0.00086) [2022-07-10 12:48:18,735][26022] Updated weights on worker 0-0, policy_version 730000 (0.00087) [2022-07-10 12:48:19,830][25689] Fps is (10 sec: 5572.6, 60 sec: 5546.5, 300 sec: 5513.0). Total num frames: 747525120. Throughput: 0: 5707.6. Samples: 747528152. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:48:19,831][25689] Avg episode reward: [(0, '-4.355')] [2022-07-10 12:48:20,485][26022] Updated weights on worker 0-0, policy_version 730010 (0.00106) [2022-07-10 12:48:22,279][26022] Updated weights on worker 0-0, policy_version 730020 (0.00088) [2022-07-10 12:48:24,284][26022] Updated weights on worker 0-0, policy_version 730030 (0.00089) [2022-07-10 12:48:24,898][25689] Fps is (10 sec: 5764.6, 60 sec: 5580.8, 300 sec: 5518.7). Total num frames: 747555840. Throughput: 0: 5848.1. Samples: 747562056. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:48:24,899][25689] Avg episode reward: [(0, '-5.096')] [2022-07-10 12:48:26,000][26022] Updated weights on worker 0-0, policy_version 730040 (0.00114) [2022-07-10 12:48:27,813][26022] Updated weights on worker 0-0, policy_version 730050 (0.00085) [2022-07-10 12:48:29,688][26022] Updated weights on worker 0-0, policy_version 730060 (0.00094) [2022-07-10 12:48:29,968][25689] Fps is (10 sec: 5656.0, 60 sec: 5565.3, 300 sec: 5510.9). Total num frames: 747582464. Throughput: 0: 5016.4. Samples: 747578864. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:48:29,969][25689] Avg episode reward: [(0, '-4.252')] [2022-07-10 12:48:31,490][26022] Updated weights on worker 0-0, policy_version 730070 (0.00086) [2022-07-10 12:48:32,520][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:48:32,535][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000730077_747598848.pth [2022-07-10 12:48:32,536][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000728134_745609216.pth [2022-07-10 12:48:33,308][26022] Updated weights on worker 0-0, policy_version 730080 (0.00077) [2022-07-10 12:48:34,979][25689] Fps is (10 sec: 5484.9, 60 sec: 5548.9, 300 sec: 5517.6). Total num frames: 747611136. Throughput: 0: 5841.3. Samples: 747612256. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:48:34,981][25689] Avg episode reward: [(0, '-4.681')] [2022-07-10 12:48:35,138][26022] Updated weights on worker 0-0, policy_version 730090 (0.00086) [2022-07-10 12:48:37,023][26022] Updated weights on worker 0-0, policy_version 730100 (0.00090) [2022-07-10 12:48:38,755][26022] Updated weights on worker 0-0, policy_version 730110 (0.00086) [2022-07-10 12:48:40,036][25689] Fps is (10 sec: 5695.7, 60 sec: 5581.3, 300 sec: 5517.8). Total num frames: 747639808. Throughput: 0: 5820.2. Samples: 747645758. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:48:40,037][25689] Avg episode reward: [(0, '-3.819')] [2022-07-10 12:48:40,944][26022] Updated weights on worker 0-0, policy_version 730120 (0.00092) [2022-07-10 12:48:42,291][26022] Updated weights on worker 0-0, policy_version 730130 (0.00092) [2022-07-10 12:48:44,409][26022] Updated weights on worker 0-0, policy_version 730140 (0.00086) [2022-07-10 12:48:45,099][25689] Fps is (10 sec: 5464.4, 60 sec: 5535.4, 300 sec: 5510.4). Total num frames: 747666432. Throughput: 0: 4975.3. Samples: 747662562. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:48:45,099][25689] Avg episode reward: [(0, '-3.886')] [2022-07-10 12:48:46,179][26022] Updated weights on worker 0-0, policy_version 730150 (0.00086) [2022-07-10 12:48:47,939][26022] Updated weights on worker 0-0, policy_version 730160 (0.00091) [2022-07-10 12:48:49,786][26022] Updated weights on worker 0-0, policy_version 730170 (0.00093) [2022-07-10 12:48:50,115][25689] Fps is (10 sec: 5486.5, 60 sec: 5569.1, 300 sec: 5520.6). Total num frames: 747695104. Throughput: 0: 5819.1. Samples: 747696098. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:48:50,115][25689] Avg episode reward: [(0, '-3.943')] [2022-07-10 12:48:51,656][26022] Updated weights on worker 0-0, policy_version 730180 (0.00088) [2022-07-10 12:48:53,480][26022] Updated weights on worker 0-0, policy_version 730190 (0.00089) [2022-07-10 12:48:55,123][25689] Fps is (10 sec: 5618.1, 60 sec: 5552.8, 300 sec: 5514.4). Total num frames: 747722752. Throughput: 0: 5802.9. Samples: 747729150. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 12:48:55,124][25689] Avg episode reward: [(0, '-3.744')] [2022-07-10 12:48:55,427][26022] Updated weights on worker 0-0, policy_version 730200 (0.00088) [2022-07-10 12:48:57,303][26022] Updated weights on worker 0-0, policy_version 730210 (0.00089) [2022-07-10 12:48:59,188][26022] Updated weights on worker 0-0, policy_version 730220 (0.00090) [2022-07-10 12:49:00,135][25689] Fps is (10 sec: 5518.5, 60 sec: 5553.1, 300 sec: 5523.2). Total num frames: 747750400. Throughput: 0: 4978.9. Samples: 747745828. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:49:00,135][25689] Avg episode reward: [(0, '-2.791')] [2022-07-10 12:49:01,057][26022] Updated weights on worker 0-0, policy_version 730230 (0.00083) [2022-07-10 12:49:03,022][26022] Updated weights on worker 0-0, policy_version 730240 (0.00084) [2022-07-10 12:49:05,188][25689] Fps is (10 sec: 5188.9, 60 sec: 5519.3, 300 sec: 5515.3). Total num frames: 747774976. Throughput: 0: 5686.6. Samples: 747776802. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:49:05,189][25689] Avg episode reward: [(0, '-3.328')] [2022-07-10 12:49:05,232][26022] Updated weights on worker 0-0, policy_version 730250 (0.00086) [2022-07-10 12:49:06,828][26022] Updated weights on worker 0-0, policy_version 730260 (0.00088) [2022-07-10 12:49:08,895][26022] Updated weights on worker 0-0, policy_version 730270 (0.00085) [2022-07-10 12:49:10,215][25689] Fps is (10 sec: 5282.6, 60 sec: 5535.4, 300 sec: 5515.6). Total num frames: 747803648. Throughput: 0: 5675.4. Samples: 747810174. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:49:10,215][25689] Avg episode reward: [(0, '-3.259')] [2022-07-10 12:49:10,639][26022] Updated weights on worker 0-0, policy_version 730280 (0.00492) [2022-07-10 12:49:12,398][26022] Updated weights on worker 0-0, policy_version 730290 (0.00092) [2022-07-10 12:49:14,391][26022] Updated weights on worker 0-0, policy_version 730300 (0.00080) [2022-07-10 12:49:15,233][25689] Fps is (10 sec: 5606.9, 60 sec: 5521.4, 300 sec: 5517.1). Total num frames: 747831296. Throughput: 0: 4842.8. Samples: 747826536. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:49:15,233][25689] Avg episode reward: [(0, '-3.274')] [2022-07-10 12:49:16,206][26022] Updated weights on worker 0-0, policy_version 730310 (0.00090) [2022-07-10 12:49:17,999][26022] Updated weights on worker 0-0, policy_version 730320 (0.00606) [2022-07-10 12:49:19,834][26022] Updated weights on worker 0-0, policy_version 730330 (0.00089) [2022-07-10 12:49:20,269][25689] Fps is (10 sec: 5499.6, 60 sec: 5523.3, 300 sec: 5510.5). Total num frames: 747858944. Throughput: 0: 5669.7. Samples: 747859984. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:49:20,270][25689] Avg episode reward: [(0, '-1.361')] [2022-07-10 12:49:21,731][26022] Updated weights on worker 0-0, policy_version 730340 (0.00086) [2022-07-10 12:49:23,499][26022] Updated weights on worker 0-0, policy_version 730350 (0.00081) [2022-07-10 12:49:25,386][25689] Fps is (10 sec: 5546.8, 60 sec: 5484.9, 300 sec: 5515.6). Total num frames: 747887616. Throughput: 0: 5769.6. Samples: 747893340. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:49:25,387][25689] Avg episode reward: [(0, '-1.158')] [2022-07-10 12:49:25,494][26022] Updated weights on worker 0-0, policy_version 730360 (0.00089) [2022-07-10 12:49:27,087][26022] Updated weights on worker 0-0, policy_version 730370 (0.00091) [2022-07-10 12:49:29,288][26022] Updated weights on worker 0-0, policy_version 730380 (0.00095) [2022-07-10 12:49:30,392][25689] Fps is (10 sec: 5664.8, 60 sec: 5524.7, 300 sec: 5515.7). Total num frames: 747916288. Throughput: 0: 4942.6. Samples: 747909902. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:49:30,392][25689] Avg episode reward: [(0, '-0.905')] [2022-07-10 12:49:31,095][26022] Updated weights on worker 0-0, policy_version 730390 (0.00095) [2022-07-10 12:49:32,857][26022] Updated weights on worker 0-0, policy_version 730400 (0.00091) [2022-07-10 12:49:34,648][26022] Updated weights on worker 0-0, policy_version 730410 (0.00106) [2022-07-10 12:49:35,411][25689] Fps is (10 sec: 5414.1, 60 sec: 5473.2, 300 sec: 5505.6). Total num frames: 747941888. Throughput: 0: 5769.3. Samples: 747942948. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:49:35,411][25689] Avg episode reward: [(0, '-0.530')] [2022-07-10 12:49:36,397][26022] Updated weights on worker 0-0, policy_version 730420 (0.00081) [2022-07-10 12:49:38,443][26022] Updated weights on worker 0-0, policy_version 730430 (0.00095) [2022-07-10 12:49:40,111][26022] Updated weights on worker 0-0, policy_version 730440 (0.00086) [2022-07-10 12:49:40,428][25689] Fps is (10 sec: 5509.7, 60 sec: 5493.7, 300 sec: 5518.1). Total num frames: 747971584. Throughput: 0: 5780.2. Samples: 747976506. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:49:40,429][25689] Avg episode reward: [(0, '-0.981')] [2022-07-10 12:49:42,175][26022] Updated weights on worker 0-0, policy_version 730450 (0.00095) [2022-07-10 12:49:43,694][26022] Updated weights on worker 0-0, policy_version 730460 (0.00087) [2022-07-10 12:49:45,530][25689] Fps is (10 sec: 5565.5, 60 sec: 5490.1, 300 sec: 5506.3). Total num frames: 747998208. Throughput: 0: 4950.5. Samples: 747993062. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:49:45,531][25689] Avg episode reward: [(0, '-1.083')] [2022-07-10 12:49:45,950][26022] Updated weights on worker 0-0, policy_version 730470 (0.00088) [2022-07-10 12:49:47,462][26022] Updated weights on worker 0-0, policy_version 730480 (0.00085) [2022-07-10 12:49:49,456][26022] Updated weights on worker 0-0, policy_version 730490 (0.00080) [2022-07-10 12:49:50,580][25689] Fps is (10 sec: 5547.9, 60 sec: 5504.0, 300 sec: 5513.1). Total num frames: 748027904. Throughput: 0: 5773.5. Samples: 748026456. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:49:50,580][25689] Avg episode reward: [(0, '-0.668')] [2022-07-10 12:49:51,180][26022] Updated weights on worker 0-0, policy_version 730500 (0.00101) [2022-07-10 12:49:53,152][26022] Updated weights on worker 0-0, policy_version 730510 (0.00085) [2022-07-10 12:49:55,055][26022] Updated weights on worker 0-0, policy_version 730520 (0.00085) [2022-07-10 12:49:55,625][25689] Fps is (10 sec: 5680.2, 60 sec: 5500.6, 300 sec: 5512.4). Total num frames: 748055552. Throughput: 0: 5804.1. Samples: 748060278. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:49:55,626][25689] Avg episode reward: [(0, '-0.015')] [2022-07-10 12:49:56,831][26022] Updated weights on worker 0-0, policy_version 730530 (0.00092) [2022-07-10 12:49:58,515][26022] Updated weights on worker 0-0, policy_version 730540 (0.00092) [2022-07-10 12:50:00,371][26022] Updated weights on worker 0-0, policy_version 730550 (0.00098) [2022-07-10 12:50:00,633][25689] Fps is (10 sec: 5602.4, 60 sec: 5518.0, 300 sec: 5523.6). Total num frames: 748084224. Throughput: 0: 4976.0. Samples: 748077044. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:50:00,633][25689] Avg episode reward: [(0, '-0.337')] [2022-07-10 12:50:02,580][26022] Updated weights on worker 0-0, policy_version 730560 (0.00088) [2022-07-10 12:50:04,525][26022] Updated weights on worker 0-0, policy_version 730570 (0.00086) [2022-07-10 12:50:05,709][25689] Fps is (10 sec: 5483.9, 60 sec: 5549.7, 300 sec: 5522.3). Total num frames: 748110848. Throughput: 0: 5717.6. Samples: 748108436. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:50:05,710][25689] Avg episode reward: [(0, '-2.734')] [2022-07-10 12:50:06,303][26022] Updated weights on worker 0-0, policy_version 730580 (0.00087) [2022-07-10 12:50:08,115][26022] Updated weights on worker 0-0, policy_version 730590 (0.00095) [2022-07-10 12:50:10,108][26022] Updated weights on worker 0-0, policy_version 730600 (0.00081) [2022-07-10 12:50:10,727][25689] Fps is (10 sec: 5275.1, 60 sec: 5516.7, 300 sec: 5515.2). Total num frames: 748137472. Throughput: 0: 5730.4. Samples: 748141906. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:50:10,727][25689] Avg episode reward: [(0, '-3.833')] [2022-07-10 12:50:11,690][26022] Updated weights on worker 0-0, policy_version 730610 (0.00096) [2022-07-10 12:50:13,792][26022] Updated weights on worker 0-0, policy_version 730620 (0.00092) [2022-07-10 12:50:15,307][26022] Updated weights on worker 0-0, policy_version 730630 (0.00081) [2022-07-10 12:50:15,730][25689] Fps is (10 sec: 5518.0, 60 sec: 5534.9, 300 sec: 5525.7). Total num frames: 748166144. Throughput: 0: 5729.2. Samples: 748175460. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:50:15,730][25689] Avg episode reward: [(0, '-3.576')] [2022-07-10 12:50:17,435][26022] Updated weights on worker 0-0, policy_version 730640 (0.00090) [2022-07-10 12:50:19,103][26022] Updated weights on worker 0-0, policy_version 730650 (0.00087) [2022-07-10 12:50:20,736][25689] Fps is (10 sec: 5524.5, 60 sec: 5520.8, 300 sec: 5517.9). Total num frames: 748192768. Throughput: 0: 5716.6. Samples: 748191968. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:50:20,737][25689] Avg episode reward: [(0, '-4.297')] [2022-07-10 12:50:21,207][26022] Updated weights on worker 0-0, policy_version 730660 (0.00084) [2022-07-10 12:50:22,778][26022] Updated weights on worker 0-0, policy_version 730670 (0.00087) [2022-07-10 12:50:24,843][26022] Updated weights on worker 0-0, policy_version 730680 (0.00085) [2022-07-10 12:50:25,789][25689] Fps is (10 sec: 5598.8, 60 sec: 5543.6, 300 sec: 5525.0). Total num frames: 748222464. Throughput: 0: 5818.9. Samples: 748225282. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:50:25,790][25689] Avg episode reward: [(0, '-4.216')] [2022-07-10 12:50:26,603][26022] Updated weights on worker 0-0, policy_version 730690 (0.00087) [2022-07-10 12:50:28,482][26022] Updated weights on worker 0-0, policy_version 730700 (0.00094) [2022-07-10 12:50:30,176][26022] Updated weights on worker 0-0, policy_version 730710 (0.00088) [2022-07-10 12:50:30,839][25689] Fps is (10 sec: 5777.2, 60 sec: 5539.5, 300 sec: 5527.9). Total num frames: 748251136. Throughput: 0: 5809.4. Samples: 748258748. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:50:30,840][25689] Avg episode reward: [(0, '-4.711')] [2022-07-10 12:50:32,232][26022] Updated weights on worker 0-0, policy_version 730720 (0.00091) [2022-07-10 12:50:32,556][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:50:32,570][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000730721_748258304.pth [2022-07-10 12:50:32,580][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000728781_746271744.pth [2022-07-10 12:50:33,862][26022] Updated weights on worker 0-0, policy_version 730730 (0.00091) [2022-07-10 12:50:35,906][25689] Fps is (10 sec: 5364.4, 60 sec: 5535.1, 300 sec: 5517.6). Total num frames: 748276736. Throughput: 0: 4950.6. Samples: 748275346. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:50:35,907][25689] Avg episode reward: [(0, '-2.876')] [2022-07-10 12:50:36,031][26022] Updated weights on worker 0-0, policy_version 730740 (0.00108) [2022-07-10 12:50:37,364][26022] Updated weights on worker 0-0, policy_version 730750 (0.00100) [2022-07-10 12:50:39,527][26022] Updated weights on worker 0-0, policy_version 730760 (0.00086) [2022-07-10 12:50:40,930][25689] Fps is (10 sec: 5581.4, 60 sec: 5551.5, 300 sec: 5526.0). Total num frames: 748307456. Throughput: 0: 5811.3. Samples: 748309320. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:50:40,930][25689] Avg episode reward: [(0, '-3.253')] [2022-07-10 12:50:41,002][26022] Updated weights on worker 0-0, policy_version 730770 (0.00090) [2022-07-10 12:50:43,091][26022] Updated weights on worker 0-0, policy_version 730780 (0.00109) [2022-07-10 12:50:45,063][26022] Updated weights on worker 0-0, policy_version 730790 (0.00094) [2022-07-10 12:50:46,002][25689] Fps is (10 sec: 5781.5, 60 sec: 5571.1, 300 sec: 5528.6). Total num frames: 748335104. Throughput: 0: 5808.5. Samples: 748342688. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:50:46,002][25689] Avg episode reward: [(0, '-3.660')] [2022-07-10 12:50:46,596][26022] Updated weights on worker 0-0, policy_version 730800 (0.00095) [2022-07-10 12:50:48,503][26022] Updated weights on worker 0-0, policy_version 730810 (0.00087) [2022-07-10 12:50:50,532][26022] Updated weights on worker 0-0, policy_version 730820 (0.00091) [2022-07-10 12:50:51,024][25689] Fps is (10 sec: 5477.8, 60 sec: 5539.8, 300 sec: 5525.9). Total num frames: 748362752. Throughput: 0: 4989.6. Samples: 748359464. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:50:51,025][25689] Avg episode reward: [(0, '-2.989')] [2022-07-10 12:50:52,239][26022] Updated weights on worker 0-0, policy_version 730830 (0.00092) [2022-07-10 12:50:54,086][26022] Updated weights on worker 0-0, policy_version 730840 (0.00090) [2022-07-10 12:50:55,980][26022] Updated weights on worker 0-0, policy_version 730850 (0.00092) [2022-07-10 12:50:56,039][25689] Fps is (10 sec: 5509.2, 60 sec: 5542.6, 300 sec: 5523.2). Total num frames: 748390400. Throughput: 0: 5843.7. Samples: 748392996. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:50:56,039][25689] Avg episode reward: [(0, '-3.626')] [2022-07-10 12:50:57,759][26022] Updated weights on worker 0-0, policy_version 730860 (0.00086) [2022-07-10 12:50:59,648][26022] Updated weights on worker 0-0, policy_version 730870 (0.00088) [2022-07-10 12:51:01,058][25689] Fps is (10 sec: 5511.1, 60 sec: 5524.6, 300 sec: 5531.3). Total num frames: 748418048. Throughput: 0: 5818.4. Samples: 748426434. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:51:01,058][25689] Avg episode reward: [(0, '-4.092')] [2022-07-10 12:51:01,647][26022] Updated weights on worker 0-0, policy_version 730880 (0.00110) [2022-07-10 12:51:03,746][26022] Updated weights on worker 0-0, policy_version 730890 (0.00090) [2022-07-10 12:51:05,486][26022] Updated weights on worker 0-0, policy_version 730900 (0.00089) [2022-07-10 12:51:06,140][25689] Fps is (10 sec: 5372.7, 60 sec: 5524.0, 300 sec: 5530.0). Total num frames: 748444672. Throughput: 0: 4887.5. Samples: 748441114. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:51:06,141][25689] Avg episode reward: [(0, '-3.955')] [2022-07-10 12:51:07,365][26022] Updated weights on worker 0-0, policy_version 730910 (0.00082) [2022-07-10 12:51:09,127][26022] Updated weights on worker 0-0, policy_version 730920 (0.00079) [2022-07-10 12:51:11,126][26022] Updated weights on worker 0-0, policy_version 730930 (0.00086) [2022-07-10 12:51:11,150][25689] Fps is (10 sec: 5377.7, 60 sec: 5541.7, 300 sec: 5523.1). Total num frames: 748472320. Throughput: 0: 5718.7. Samples: 748474556. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:51:11,150][25689] Avg episode reward: [(0, '-3.908')] [2022-07-10 12:51:12,716][26022] Updated weights on worker 0-0, policy_version 730940 (0.00087) [2022-07-10 12:51:14,688][26022] Updated weights on worker 0-0, policy_version 730950 (0.00095) [2022-07-10 12:51:16,240][25689] Fps is (10 sec: 5677.5, 60 sec: 5550.6, 300 sec: 5532.1). Total num frames: 748502016. Throughput: 0: 5695.0. Samples: 748508044. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:51:16,241][25689] Avg episode reward: [(0, '-3.996')] [2022-07-10 12:51:16,528][26022] Updated weights on worker 0-0, policy_version 730960 (0.00092) [2022-07-10 12:51:18,288][26022] Updated weights on worker 0-0, policy_version 730970 (0.00090) [2022-07-10 12:51:20,052][26022] Updated weights on worker 0-0, policy_version 730980 (0.00100) [2022-07-10 12:51:21,270][25689] Fps is (10 sec: 5565.2, 60 sec: 5548.5, 300 sec: 5527.1). Total num frames: 748528640. Throughput: 0: 4877.0. Samples: 748525008. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:51:21,271][25689] Avg episode reward: [(0, '-6.091')] [2022-07-10 12:51:21,831][26022] Updated weights on worker 0-0, policy_version 730990 (0.00083) [2022-07-10 12:51:23,901][26022] Updated weights on worker 0-0, policy_version 731000 (0.00087) [2022-07-10 12:51:25,497][26022] Updated weights on worker 0-0, policy_version 731010 (0.00088) [2022-07-10 12:51:26,396][25689] Fps is (10 sec: 5444.9, 60 sec: 5524.9, 300 sec: 5528.5). Total num frames: 748557312. Throughput: 0: 5797.9. Samples: 748558554. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:51:26,396][25689] Avg episode reward: [(0, '-4.725')] [2022-07-10 12:51:27,569][26022] Updated weights on worker 0-0, policy_version 731020 (0.00089) [2022-07-10 12:51:29,347][26022] Updated weights on worker 0-0, policy_version 731030 (0.00098) [2022-07-10 12:51:31,231][26022] Updated weights on worker 0-0, policy_version 731040 (0.00085) [2022-07-10 12:51:31,432][25689] Fps is (10 sec: 5642.8, 60 sec: 5526.2, 300 sec: 5528.2). Total num frames: 748585984. Throughput: 0: 5785.7. Samples: 748591904. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:51:31,433][25689] Avg episode reward: [(0, '-4.662')] [2022-07-10 12:51:33,125][26022] Updated weights on worker 0-0, policy_version 731050 (0.00093) [2022-07-10 12:51:34,744][26022] Updated weights on worker 0-0, policy_version 731060 (0.00095) [2022-07-10 12:51:36,534][25689] Fps is (10 sec: 5555.5, 60 sec: 5556.8, 300 sec: 5526.7). Total num frames: 748613632. Throughput: 0: 4960.2. Samples: 748608702. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:51:36,534][25689] Avg episode reward: [(0, '-5.258')] [2022-07-10 12:51:36,795][26022] Updated weights on worker 0-0, policy_version 731070 (0.00091) [2022-07-10 12:51:38,554][26022] Updated weights on worker 0-0, policy_version 731080 (0.00088) [2022-07-10 12:51:40,517][26022] Updated weights on worker 0-0, policy_version 731090 (0.00090) [2022-07-10 12:51:41,566][25689] Fps is (10 sec: 5557.5, 60 sec: 5522.2, 300 sec: 5531.2). Total num frames: 748642304. Throughput: 0: 5761.9. Samples: 748641954. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:51:41,568][25689] Avg episode reward: [(0, '-4.415')] [2022-07-10 12:51:42,321][26022] Updated weights on worker 0-0, policy_version 731100 (0.00087) [2022-07-10 12:51:44,221][26022] Updated weights on worker 0-0, policy_version 731110 (0.00092) [2022-07-10 12:51:45,915][26022] Updated weights on worker 0-0, policy_version 731120 (0.00088) [2022-07-10 12:51:46,661][25689] Fps is (10 sec: 5459.9, 60 sec: 5503.2, 300 sec: 5522.8). Total num frames: 748668928. Throughput: 0: 5768.0. Samples: 748675444. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:51:46,662][25689] Avg episode reward: [(0, '-2.345')] [2022-07-10 12:51:47,964][26022] Updated weights on worker 0-0, policy_version 731130 (0.00091) [2022-07-10 12:51:49,581][26022] Updated weights on worker 0-0, policy_version 731140 (0.00084) [2022-07-10 12:51:51,448][26022] Updated weights on worker 0-0, policy_version 731150 (0.00092) [2022-07-10 12:51:51,664][25689] Fps is (10 sec: 5577.5, 60 sec: 5538.8, 300 sec: 5534.1). Total num frames: 748698624. Throughput: 0: 4963.6. Samples: 748692326. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:51:51,665][25689] Avg episode reward: [(0, '-3.553')] [2022-07-10 12:51:53,287][26022] Updated weights on worker 0-0, policy_version 731160 (0.00086) [2022-07-10 12:51:55,211][26022] Updated weights on worker 0-0, policy_version 731170 (0.00092) [2022-07-10 12:51:56,694][25689] Fps is (10 sec: 5715.9, 60 sec: 5537.4, 300 sec: 5534.1). Total num frames: 748726272. Throughput: 0: 5819.0. Samples: 748726014. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:51:56,694][25689] Avg episode reward: [(0, '-3.722')] [2022-07-10 12:51:56,899][26022] Updated weights on worker 0-0, policy_version 731180 (0.00087) [2022-07-10 12:51:58,796][26022] Updated weights on worker 0-0, policy_version 731190 (0.00086) [2022-07-10 12:52:00,567][26022] Updated weights on worker 0-0, policy_version 731200 (0.00094) [2022-07-10 12:52:01,727][25689] Fps is (10 sec: 5393.1, 60 sec: 5519.2, 300 sec: 5532.3). Total num frames: 748752896. Throughput: 0: 5825.1. Samples: 748759394. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:52:01,728][25689] Avg episode reward: [(0, '-4.594')] [2022-07-10 12:52:02,891][26022] Updated weights on worker 0-0, policy_version 731210 (0.00097) [2022-07-10 12:52:04,626][26022] Updated weights on worker 0-0, policy_version 731220 (0.00087) [2022-07-10 12:52:06,432][26022] Updated weights on worker 0-0, policy_version 731230 (0.00081) [2022-07-10 12:52:06,838][25689] Fps is (10 sec: 5451.1, 60 sec: 5550.4, 300 sec: 5538.3). Total num frames: 748781568. Throughput: 0: 4886.8. Samples: 748774042. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:52:06,838][25689] Avg episode reward: [(0, '-4.285')] [2022-07-10 12:52:08,489][26022] Updated weights on worker 0-0, policy_version 731240 (0.00094) [2022-07-10 12:52:10,150][26022] Updated weights on worker 0-0, policy_version 731250 (0.00097) [2022-07-10 12:52:11,843][25689] Fps is (10 sec: 5567.8, 60 sec: 5550.8, 300 sec: 5533.1). Total num frames: 748809216. Throughput: 0: 5697.0. Samples: 748807284. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:52:11,843][25689] Avg episode reward: [(0, '-4.111')] [2022-07-10 12:52:11,919][26022] Updated weights on worker 0-0, policy_version 731260 (0.00085) [2022-07-10 12:52:13,920][26022] Updated weights on worker 0-0, policy_version 731270 (0.00089) [2022-07-10 12:52:15,483][26022] Updated weights on worker 0-0, policy_version 731280 (0.00098) [2022-07-10 12:52:16,868][25689] Fps is (10 sec: 5410.9, 60 sec: 5506.2, 300 sec: 5533.0). Total num frames: 748835840. Throughput: 0: 5691.0. Samples: 748840824. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:52:16,868][25689] Avg episode reward: [(0, '-4.284')] [2022-07-10 12:52:17,618][26022] Updated weights on worker 0-0, policy_version 731290 (0.00097) [2022-07-10 12:52:19,288][26022] Updated weights on worker 0-0, policy_version 731300 (0.00083) [2022-07-10 12:52:21,231][26022] Updated weights on worker 0-0, policy_version 731310 (0.00083) [2022-07-10 12:52:21,890][25689] Fps is (10 sec: 5605.4, 60 sec: 5557.5, 300 sec: 5537.4). Total num frames: 748865536. Throughput: 0: 4869.5. Samples: 748857576. Policy #0 lag: (min: 0.0, avg: 8.8, max: 18.0) [2022-07-10 12:52:21,891][25689] Avg episode reward: [(0, '-2.687')] [2022-07-10 12:52:23,196][26022] Updated weights on worker 0-0, policy_version 731320 (0.00093) [2022-07-10 12:52:24,840][26022] Updated weights on worker 0-0, policy_version 731330 (0.00086) [2022-07-10 12:52:26,765][26022] Updated weights on worker 0-0, policy_version 731340 (0.00083) [2022-07-10 12:52:26,959][25689] Fps is (10 sec: 5682.6, 60 sec: 5545.8, 300 sec: 5537.7). Total num frames: 748893184. Throughput: 0: 5831.4. Samples: 748891376. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:52:26,959][25689] Avg episode reward: [(0, '-2.111')] [2022-07-10 12:52:28,498][26022] Updated weights on worker 0-0, policy_version 731350 (0.00086) [2022-07-10 12:52:30,345][26022] Updated weights on worker 0-0, policy_version 731360 (0.00098) [2022-07-10 12:52:31,990][25689] Fps is (10 sec: 5475.0, 60 sec: 5529.4, 300 sec: 5530.6). Total num frames: 748920832. Throughput: 0: 5821.1. Samples: 748924564. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:52:31,990][25689] Avg episode reward: [(0, '-2.175')] [2022-07-10 12:52:32,323][26022] Updated weights on worker 0-0, policy_version 731370 (0.00090) [2022-07-10 12:52:32,676][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:52:32,690][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000731372_748924928.pth [2022-07-10 12:52:32,690][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000729426_746932224.pth [2022-07-10 12:52:34,017][26022] Updated weights on worker 0-0, policy_version 731380 (0.00086) [2022-07-10 12:52:35,973][26022] Updated weights on worker 0-0, policy_version 731390 (0.00095) [2022-07-10 12:52:37,007][25689] Fps is (10 sec: 5503.3, 60 sec: 5537.1, 300 sec: 5534.5). Total num frames: 748948480. Throughput: 0: 5802.8. Samples: 748957686. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:52:37,007][25689] Avg episode reward: [(0, '-2.371')] [2022-07-10 12:52:37,877][26022] Updated weights on worker 0-0, policy_version 731400 (0.00083) [2022-07-10 12:52:39,571][26022] Updated weights on worker 0-0, policy_version 731410 (0.00084) [2022-07-10 12:52:41,481][26022] Updated weights on worker 0-0, policy_version 731420 (0.00093) [2022-07-10 12:52:42,031][25689] Fps is (10 sec: 5507.1, 60 sec: 5521.0, 300 sec: 5529.3). Total num frames: 748976128. Throughput: 0: 5809.9. Samples: 748974590. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:52:42,031][25689] Avg episode reward: [(0, '-1.746')] [2022-07-10 12:52:43,405][26022] Updated weights on worker 0-0, policy_version 731430 (0.00110) [2022-07-10 12:52:45,167][26022] Updated weights on worker 0-0, policy_version 731440 (0.00085) [2022-07-10 12:52:47,084][25689] Fps is (10 sec: 5487.4, 60 sec: 5541.8, 300 sec: 5532.0). Total num frames: 749003776. Throughput: 0: 5786.8. Samples: 749007834. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:52:47,084][25689] Avg episode reward: [(0, '-1.698')] [2022-07-10 12:52:47,159][26022] Updated weights on worker 0-0, policy_version 731450 (0.00088) [2022-07-10 12:52:48,856][26022] Updated weights on worker 0-0, policy_version 731460 (0.00093) [2022-07-10 12:52:50,634][26022] Updated weights on worker 0-0, policy_version 731470 (0.00094) [2022-07-10 12:52:52,097][25689] Fps is (10 sec: 5594.7, 60 sec: 5523.8, 300 sec: 5532.0). Total num frames: 749032448. Throughput: 0: 5828.5. Samples: 749041760. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:52:52,098][25689] Avg episode reward: [(0, '-1.453')] [2022-07-10 12:52:52,717][26022] Updated weights on worker 0-0, policy_version 731480 (0.00083) [2022-07-10 12:52:54,136][26022] Updated weights on worker 0-0, policy_version 731490 (0.00096) [2022-07-10 12:52:56,139][26022] Updated weights on worker 0-0, policy_version 731500 (0.00514) [2022-07-10 12:52:57,121][25689] Fps is (10 sec: 5713.0, 60 sec: 5541.3, 300 sec: 5535.3). Total num frames: 749061120. Throughput: 0: 5020.5. Samples: 749058670. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:52:57,122][25689] Avg episode reward: [(0, '-1.230')] [2022-07-10 12:52:57,894][26022] Updated weights on worker 0-0, policy_version 731510 (0.00090) [2022-07-10 12:52:59,787][26022] Updated weights on worker 0-0, policy_version 731520 (0.00097) [2022-07-10 12:53:01,645][26022] Updated weights on worker 0-0, policy_version 731530 (0.00087) [2022-07-10 12:53:02,133][25689] Fps is (10 sec: 5714.0, 60 sec: 5577.2, 300 sec: 5543.0). Total num frames: 749089792. Throughput: 0: 5871.9. Samples: 749092628. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:53:02,133][25689] Avg episode reward: [(0, '-0.599')] [2022-07-10 12:53:03,856][26022] Updated weights on worker 0-0, policy_version 731540 (0.00094) [2022-07-10 12:53:05,514][26022] Updated weights on worker 0-0, policy_version 731550 (0.00082) [2022-07-10 12:53:07,231][25689] Fps is (10 sec: 5368.2, 60 sec: 5527.5, 300 sec: 5534.6). Total num frames: 749115392. Throughput: 0: 5773.9. Samples: 749124162. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:53:07,231][25689] Avg episode reward: [(0, '-1.041')] [2022-07-10 12:53:07,716][26022] Updated weights on worker 0-0, policy_version 731560 (0.00089) [2022-07-10 12:53:09,207][26022] Updated weights on worker 0-0, policy_version 731570 (0.00096) [2022-07-10 12:53:11,282][26022] Updated weights on worker 0-0, policy_version 731580 (0.00087) [2022-07-10 12:53:12,239][25689] Fps is (10 sec: 5370.4, 60 sec: 5544.2, 300 sec: 5535.4). Total num frames: 749144064. Throughput: 0: 4921.0. Samples: 749140876. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:53:12,239][25689] Avg episode reward: [(0, '-0.578')] [2022-07-10 12:53:12,983][26022] Updated weights on worker 0-0, policy_version 731590 (0.00088) [2022-07-10 12:53:14,848][26022] Updated weights on worker 0-0, policy_version 731600 (0.00090) [2022-07-10 12:53:16,584][26022] Updated weights on worker 0-0, policy_version 731610 (0.00089) [2022-07-10 12:53:17,247][25689] Fps is (10 sec: 5520.6, 60 sec: 5545.7, 300 sec: 5532.8). Total num frames: 749170688. Throughput: 0: 5754.1. Samples: 749174478. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:53:17,248][25689] Avg episode reward: [(0, '-1.455')] [2022-07-10 12:53:18,478][26022] Updated weights on worker 0-0, policy_version 731620 (0.00096) [2022-07-10 12:53:20,169][26022] Updated weights on worker 0-0, policy_version 731630 (0.00100) [2022-07-10 12:53:22,143][26022] Updated weights on worker 0-0, policy_version 731640 (0.00093) [2022-07-10 12:53:22,293][25689] Fps is (10 sec: 5499.6, 60 sec: 5526.6, 300 sec: 5526.4). Total num frames: 749199360. Throughput: 0: 5731.7. Samples: 749208180. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:53:22,294][25689] Avg episode reward: [(0, '-2.348')] [2022-07-10 12:53:23,850][26022] Updated weights on worker 0-0, policy_version 731650 (0.00092) [2022-07-10 12:53:25,762][26022] Updated weights on worker 0-0, policy_version 731660 (0.00090) [2022-07-10 12:53:27,392][25689] Fps is (10 sec: 5753.4, 60 sec: 5557.7, 300 sec: 5536.2). Total num frames: 749229056. Throughput: 0: 5007.3. Samples: 749225116. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:53:27,394][25689] Avg episode reward: [(0, '-1.642')] [2022-07-10 12:53:27,517][26022] Updated weights on worker 0-0, policy_version 731670 (0.00091) [2022-07-10 12:53:29,527][26022] Updated weights on worker 0-0, policy_version 731680 (0.00097) [2022-07-10 12:53:31,269][26022] Updated weights on worker 0-0, policy_version 731690 (0.00084) [2022-07-10 12:53:32,494][25689] Fps is (10 sec: 5621.5, 60 sec: 5551.2, 300 sec: 5531.0). Total num frames: 749256704. Throughput: 0: 5803.4. Samples: 749258426. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:53:32,496][25689] Avg episode reward: [(0, '-2.097')] [2022-07-10 12:53:33,249][26022] Updated weights on worker 0-0, policy_version 731700 (0.00089) [2022-07-10 12:53:34,771][26022] Updated weights on worker 0-0, policy_version 731710 (0.00089) [2022-07-10 12:53:37,028][26022] Updated weights on worker 0-0, policy_version 731720 (0.00094) [2022-07-10 12:53:37,562][25689] Fps is (10 sec: 5538.2, 60 sec: 5563.5, 300 sec: 5530.8). Total num frames: 749285376. Throughput: 0: 5785.5. Samples: 749292006. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:53:37,562][25689] Avg episode reward: [(0, '-0.654')] [2022-07-10 12:53:38,416][26022] Updated weights on worker 0-0, policy_version 731730 (0.00090) [2022-07-10 12:53:40,537][26022] Updated weights on worker 0-0, policy_version 731740 (0.00081) [2022-07-10 12:53:42,247][26022] Updated weights on worker 0-0, policy_version 731750 (0.00102) [2022-07-10 12:53:42,568][25689] Fps is (10 sec: 5590.9, 60 sec: 5565.1, 300 sec: 5535.3). Total num frames: 749313024. Throughput: 0: 4967.3. Samples: 749308888. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:53:42,568][25689] Avg episode reward: [(0, '-1.304')] [2022-07-10 12:53:44,135][26022] Updated weights on worker 0-0, policy_version 731760 (0.00086) [2022-07-10 12:53:45,947][26022] Updated weights on worker 0-0, policy_version 731770 (0.00091) [2022-07-10 12:53:47,629][25689] Fps is (10 sec: 5594.5, 60 sec: 5581.3, 300 sec: 5534.5). Total num frames: 749341696. Throughput: 0: 5791.7. Samples: 749342318. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:53:47,629][25689] Avg episode reward: [(0, '-0.851')] [2022-07-10 12:53:47,971][26022] Updated weights on worker 0-0, policy_version 731780 (0.00087) [2022-07-10 12:53:49,570][26022] Updated weights on worker 0-0, policy_version 731790 (0.00091) [2022-07-10 12:53:51,694][26022] Updated weights on worker 0-0, policy_version 731800 (0.00093) [2022-07-10 12:53:52,650][25689] Fps is (10 sec: 5585.7, 60 sec: 5563.7, 300 sec: 5534.3). Total num frames: 749369344. Throughput: 0: 5814.7. Samples: 749375628. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:53:52,651][25689] Avg episode reward: [(0, '-1.318')] [2022-07-10 12:53:53,230][26022] Updated weights on worker 0-0, policy_version 731810 (0.00102) [2022-07-10 12:53:55,285][26022] Updated weights on worker 0-0, policy_version 731820 (0.00091) [2022-07-10 12:53:56,907][26022] Updated weights on worker 0-0, policy_version 731830 (0.00087) [2022-07-10 12:53:57,667][25689] Fps is (10 sec: 5406.6, 60 sec: 5530.5, 300 sec: 5530.7). Total num frames: 749395968. Throughput: 0: 4996.3. Samples: 749392458. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:53:57,667][25689] Avg episode reward: [(0, '-1.890')] [2022-07-10 12:53:58,744][26022] Updated weights on worker 0-0, policy_version 731840 (0.00084) [2022-07-10 12:54:00,546][26022] Updated weights on worker 0-0, policy_version 731850 (0.00090) [2022-07-10 12:54:02,682][25689] Fps is (10 sec: 5410.3, 60 sec: 5513.3, 300 sec: 5541.8). Total num frames: 749423616. Throughput: 0: 5816.5. Samples: 749425882. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:54:02,684][25689] Avg episode reward: [(0, '-2.315')] [2022-07-10 12:54:02,844][26022] Updated weights on worker 0-0, policy_version 731860 (0.00087) [2022-07-10 12:54:04,763][26022] Updated weights on worker 0-0, policy_version 731870 (0.00088) [2022-07-10 12:54:06,579][26022] Updated weights on worker 0-0, policy_version 731880 (0.00087) [2022-07-10 12:54:07,755][25689] Fps is (10 sec: 5278.4, 60 sec: 5515.6, 300 sec: 5530.6). Total num frames: 749449216. Throughput: 0: 5697.2. Samples: 749456980. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:54:07,755][25689] Avg episode reward: [(0, '-2.088')] [2022-07-10 12:54:08,446][26022] Updated weights on worker 0-0, policy_version 731890 (0.00086) [2022-07-10 12:54:10,449][26022] Updated weights on worker 0-0, policy_version 731900 (0.00079) [2022-07-10 12:54:12,287][26022] Updated weights on worker 0-0, policy_version 731910 (0.00093) [2022-07-10 12:54:12,766][25689] Fps is (10 sec: 5381.8, 60 sec: 5515.3, 300 sec: 5534.1). Total num frames: 749477888. Throughput: 0: 4851.7. Samples: 749473222. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:54:12,766][25689] Avg episode reward: [(0, '-2.090')] [2022-07-10 12:54:14,046][26022] Updated weights on worker 0-0, policy_version 731920 (0.00084) [2022-07-10 12:54:15,929][26022] Updated weights on worker 0-0, policy_version 731930 (0.00085) [2022-07-10 12:54:17,647][26022] Updated weights on worker 0-0, policy_version 731940 (0.00085) [2022-07-10 12:54:17,788][25689] Fps is (10 sec: 5817.6, 60 sec: 5564.8, 300 sec: 5541.3). Total num frames: 749507584. Throughput: 0: 5682.7. Samples: 749506800. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:54:17,788][25689] Avg episode reward: [(0, '-2.414')] [2022-07-10 12:54:19,579][26022] Updated weights on worker 0-0, policy_version 731950 (0.00086) [2022-07-10 12:54:21,434][26022] Updated weights on worker 0-0, policy_version 731960 (0.00091) [2022-07-10 12:54:22,802][25689] Fps is (10 sec: 5611.5, 60 sec: 5533.8, 300 sec: 5536.3). Total num frames: 749534208. Throughput: 0: 5700.4. Samples: 749540578. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:54:22,803][25689] Avg episode reward: [(0, '-1.741')] [2022-07-10 12:54:23,092][26022] Updated weights on worker 0-0, policy_version 731970 (0.00088) [2022-07-10 12:54:25,199][26022] Updated weights on worker 0-0, policy_version 731980 (0.00089) [2022-07-10 12:54:26,726][26022] Updated weights on worker 0-0, policy_version 731990 (0.00084) [2022-07-10 12:54:27,874][25689] Fps is (10 sec: 5380.5, 60 sec: 5502.4, 300 sec: 5531.6). Total num frames: 749561856. Throughput: 0: 4992.8. Samples: 749557434. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:54:27,875][25689] Avg episode reward: [(0, '-1.309')] [2022-07-10 12:54:28,698][26022] Updated weights on worker 0-0, policy_version 732000 (0.00094) [2022-07-10 12:54:30,594][26022] Updated weights on worker 0-0, policy_version 732010 (0.00105) [2022-07-10 12:54:32,367][26022] Updated weights on worker 0-0, policy_version 732020 (0.00090) [2022-07-10 12:54:32,837][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:54:32,851][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000732023_749591552.pth [2022-07-10 12:54:32,851][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000730077_747598848.pth [2022-07-10 12:54:32,925][25689] Fps is (10 sec: 5665.2, 60 sec: 5541.1, 300 sec: 5544.8). Total num frames: 749591552. Throughput: 0: 5838.1. Samples: 749590910. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:54:32,925][25689] Avg episode reward: [(0, '-1.260')] [2022-07-10 12:54:34,260][26022] Updated weights on worker 0-0, policy_version 732030 (0.00105) [2022-07-10 12:54:36,084][26022] Updated weights on worker 0-0, policy_version 732040 (0.00093) [2022-07-10 12:54:37,759][26022] Updated weights on worker 0-0, policy_version 732050 (0.00062) [2022-07-10 12:54:37,942][25689] Fps is (10 sec: 5797.4, 60 sec: 5545.6, 300 sec: 5541.4). Total num frames: 749620224. Throughput: 0: 5834.9. Samples: 749624402. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:54:37,943][25689] Avg episode reward: [(0, '-0.638')] [2022-07-10 12:54:39,800][26022] Updated weights on worker 0-0, policy_version 732060 (0.00085) [2022-07-10 12:54:41,464][26022] Updated weights on worker 0-0, policy_version 732070 (0.00088) [2022-07-10 12:54:42,981][25689] Fps is (10 sec: 5498.7, 60 sec: 5525.7, 300 sec: 5542.6). Total num frames: 749646848. Throughput: 0: 4974.9. Samples: 749640964. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:54:42,981][25689] Avg episode reward: [(0, '-2.168')] [2022-07-10 12:54:43,471][26022] Updated weights on worker 0-0, policy_version 732080 (0.00095) [2022-07-10 12:54:45,274][26022] Updated weights on worker 0-0, policy_version 732090 (0.00089) [2022-07-10 12:54:47,194][26022] Updated weights on worker 0-0, policy_version 732100 (0.00087) [2022-07-10 12:54:48,021][25689] Fps is (10 sec: 5486.5, 60 sec: 5527.6, 300 sec: 5539.3). Total num frames: 749675520. Throughput: 0: 5810.4. Samples: 749674492. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:54:48,023][25689] Avg episode reward: [(0, '-1.678')] [2022-07-10 12:54:48,810][26022] Updated weights on worker 0-0, policy_version 732110 (0.00098) [2022-07-10 12:54:50,860][26022] Updated weights on worker 0-0, policy_version 732120 (0.00087) [2022-07-10 12:54:52,471][26022] Updated weights on worker 0-0, policy_version 732130 (0.00088) [2022-07-10 12:54:53,028][25689] Fps is (10 sec: 5605.7, 60 sec: 5529.0, 300 sec: 5540.0). Total num frames: 749703168. Throughput: 0: 5832.3. Samples: 749708158. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:54:53,030][25689] Avg episode reward: [(0, '-1.698')] [2022-07-10 12:54:54,391][26022] Updated weights on worker 0-0, policy_version 732140 (0.00085) [2022-07-10 12:54:56,236][26022] Updated weights on worker 0-0, policy_version 732150 (0.00090) [2022-07-10 12:54:58,004][26022] Updated weights on worker 0-0, policy_version 732160 (0.00087) [2022-07-10 12:54:58,059][25689] Fps is (10 sec: 5610.8, 60 sec: 5561.5, 300 sec: 5539.6). Total num frames: 749731840. Throughput: 0: 5002.0. Samples: 749725024. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:54:58,060][25689] Avg episode reward: [(0, '-1.171')] [2022-07-10 12:54:59,780][26022] Updated weights on worker 0-0, policy_version 732170 (0.00090) [2022-07-10 12:55:01,607][26022] Updated weights on worker 0-0, policy_version 732180 (0.00084) [2022-07-10 12:55:03,086][25689] Fps is (10 sec: 5395.6, 60 sec: 5526.5, 300 sec: 5537.0). Total num frames: 749757440. Throughput: 0: 5861.0. Samples: 749758804. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:55:03,088][25689] Avg episode reward: [(0, '-1.600')] [2022-07-10 12:55:03,803][26022] Updated weights on worker 0-0, policy_version 732190 (0.00095) [2022-07-10 12:55:05,720][26022] Updated weights on worker 0-0, policy_version 732200 (0.00082) [2022-07-10 12:55:07,601][26022] Updated weights on worker 0-0, policy_version 732210 (0.00430) [2022-07-10 12:55:08,199][25689] Fps is (10 sec: 5251.2, 60 sec: 5556.7, 300 sec: 5538.7). Total num frames: 749785088. Throughput: 0: 5733.2. Samples: 749790176. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:55:08,199][25689] Avg episode reward: [(0, '-1.719')] [2022-07-10 12:55:09,435][26022] Updated weights on worker 0-0, policy_version 732220 (0.00091) [2022-07-10 12:55:11,281][26022] Updated weights on worker 0-0, policy_version 732230 (0.00093) [2022-07-10 12:55:13,225][25689] Fps is (10 sec: 5454.1, 60 sec: 5538.4, 300 sec: 5534.9). Total num frames: 749812736. Throughput: 0: 5707.4. Samples: 749823430. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:55:13,226][25689] Avg episode reward: [(0, '-1.268')] [2022-07-10 12:55:13,231][26022] Updated weights on worker 0-0, policy_version 732240 (0.00083) [2022-07-10 12:55:15,013][26022] Updated weights on worker 0-0, policy_version 732250 (0.00084) [2022-07-10 12:55:16,768][26022] Updated weights on worker 0-0, policy_version 732260 (0.00088) [2022-07-10 12:55:18,242][25689] Fps is (10 sec: 5709.5, 60 sec: 5538.8, 300 sec: 5545.0). Total num frames: 749842432. Throughput: 0: 5720.8. Samples: 749840490. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:55:18,244][25689] Avg episode reward: [(0, '-2.412')] [2022-07-10 12:55:18,413][26022] Updated weights on worker 0-0, policy_version 732270 (0.00082) [2022-07-10 12:55:20,375][26022] Updated weights on worker 0-0, policy_version 732280 (0.00095) [2022-07-10 12:55:22,355][26022] Updated weights on worker 0-0, policy_version 732290 (0.00095) [2022-07-10 12:55:23,280][25689] Fps is (10 sec: 5601.2, 60 sec: 5536.8, 300 sec: 5534.9). Total num frames: 749869056. Throughput: 0: 5688.5. Samples: 749873674. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:55:23,280][25689] Avg episode reward: [(0, '-3.085')] [2022-07-10 12:55:23,976][26022] Updated weights on worker 0-0, policy_version 732300 (0.00094) [2022-07-10 12:55:26,140][26022] Updated weights on worker 0-0, policy_version 732310 (0.00088) [2022-07-10 12:55:27,751][26022] Updated weights on worker 0-0, policy_version 732320 (0.00087) [2022-07-10 12:55:28,353][25689] Fps is (10 sec: 5367.6, 60 sec: 5536.6, 300 sec: 5531.1). Total num frames: 749896704. Throughput: 0: 5791.2. Samples: 749906896. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:55:28,354][25689] Avg episode reward: [(0, '-3.151')] [2022-07-10 12:55:29,757][26022] Updated weights on worker 0-0, policy_version 732330 (0.00087) [2022-07-10 12:55:31,851][26022] Updated weights on worker 0-0, policy_version 732340 (0.00091) [2022-07-10 12:55:33,336][26022] Updated weights on worker 0-0, policy_version 732350 (0.00094) [2022-07-10 12:55:33,407][25689] Fps is (10 sec: 5662.4, 60 sec: 5536.3, 300 sec: 5545.1). Total num frames: 749926400. Throughput: 0: 4947.4. Samples: 749923278. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:55:33,407][25689] Avg episode reward: [(0, '-2.529')] [2022-07-10 12:55:35,331][26022] Updated weights on worker 0-0, policy_version 732360 (0.00089) [2022-07-10 12:55:37,015][26022] Updated weights on worker 0-0, policy_version 732370 (0.00110) [2022-07-10 12:55:38,441][25689] Fps is (10 sec: 5684.6, 60 sec: 5517.9, 300 sec: 5534.6). Total num frames: 749954048. Throughput: 0: 5749.8. Samples: 749956630. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:55:38,442][25689] Avg episode reward: [(0, '-2.860')] [2022-07-10 12:55:39,141][26022] Updated weights on worker 0-0, policy_version 732380 (0.00094) [2022-07-10 12:55:41,024][26022] Updated weights on worker 0-0, policy_version 732390 (0.00082) [2022-07-10 12:55:42,602][26022] Updated weights on worker 0-0, policy_version 732400 (0.00085) [2022-07-10 12:55:43,466][25689] Fps is (10 sec: 5497.0, 60 sec: 5536.0, 300 sec: 5535.4). Total num frames: 749981696. Throughput: 0: 5763.4. Samples: 749990016. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 12:55:43,467][25689] Avg episode reward: [(0, '-2.483')] [2022-07-10 12:55:44,737][26022] Updated weights on worker 0-0, policy_version 732410 (0.00085) [2022-07-10 12:55:46,220][26022] Updated weights on worker 0-0, policy_version 732420 (0.00094) [2022-07-10 12:55:48,379][26022] Updated weights on worker 0-0, policy_version 732430 (0.00495) [2022-07-10 12:55:48,547][25689] Fps is (10 sec: 5370.5, 60 sec: 5498.5, 300 sec: 5530.9). Total num frames: 750008320. Throughput: 0: 4951.5. Samples: 750006882. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:55:48,547][25689] Avg episode reward: [(0, '-1.982')] [2022-07-10 12:55:49,862][26022] Updated weights on worker 0-0, policy_version 732440 (0.00088) [2022-07-10 12:55:51,876][26022] Updated weights on worker 0-0, policy_version 732450 (0.00084) [2022-07-10 12:55:53,571][25689] Fps is (10 sec: 5573.5, 60 sec: 5530.7, 300 sec: 5537.6). Total num frames: 750038016. Throughput: 0: 5804.8. Samples: 750040326. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:55:53,573][25689] Avg episode reward: [(0, '-0.675')] [2022-07-10 12:55:53,871][26022] Updated weights on worker 0-0, policy_version 732460 (0.00089) [2022-07-10 12:55:55,569][26022] Updated weights on worker 0-0, policy_version 732470 (0.00084) [2022-07-10 12:55:57,547][26022] Updated weights on worker 0-0, policy_version 732480 (0.00088) [2022-07-10 12:55:58,581][25689] Fps is (10 sec: 5714.9, 60 sec: 5515.7, 300 sec: 5537.8). Total num frames: 750065664. Throughput: 0: 5817.1. Samples: 750073784. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:55:58,581][25689] Avg episode reward: [(0, '-0.767')] [2022-07-10 12:55:59,348][26022] Updated weights on worker 0-0, policy_version 732490 (0.00088) [2022-07-10 12:56:01,087][26022] Updated weights on worker 0-0, policy_version 732500 (0.00097) [2022-07-10 12:56:03,422][26022] Updated weights on worker 0-0, policy_version 732510 (0.00089) [2022-07-10 12:56:03,604][25689] Fps is (10 sec: 5307.1, 60 sec: 5516.1, 300 sec: 5535.4). Total num frames: 750091264. Throughput: 0: 4985.6. Samples: 750090416. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:56:03,606][25689] Avg episode reward: [(0, '-1.751')] [2022-07-10 12:56:05,042][26022] Updated weights on worker 0-0, policy_version 732520 (0.00088) [2022-07-10 12:56:07,078][26022] Updated weights on worker 0-0, policy_version 732530 (0.00093) [2022-07-10 12:56:08,726][25689] Fps is (10 sec: 5349.3, 60 sec: 5532.2, 300 sec: 5536.8). Total num frames: 750119936. Throughput: 0: 5686.5. Samples: 750121634. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:56:08,727][25689] Avg episode reward: [(0, '-2.063')] [2022-07-10 12:56:08,840][26022] Updated weights on worker 0-0, policy_version 732540 (0.00093) [2022-07-10 12:56:10,788][26022] Updated weights on worker 0-0, policy_version 732550 (0.00089) [2022-07-10 12:56:12,739][26022] Updated weights on worker 0-0, policy_version 732560 (0.00087) [2022-07-10 12:56:13,811][25689] Fps is (10 sec: 5518.0, 60 sec: 5526.8, 300 sec: 5530.0). Total num frames: 750147584. Throughput: 0: 5649.2. Samples: 750154664. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:56:13,812][25689] Avg episode reward: [(0, '-2.443')] [2022-07-10 12:56:14,404][26022] Updated weights on worker 0-0, policy_version 732570 (0.00097) [2022-07-10 12:56:16,205][26022] Updated weights on worker 0-0, policy_version 732580 (0.00088) [2022-07-10 12:56:18,047][26022] Updated weights on worker 0-0, policy_version 732590 (0.00084) [2022-07-10 12:56:18,834][25689] Fps is (10 sec: 5470.7, 60 sec: 5492.5, 300 sec: 5533.6). Total num frames: 750175232. Throughput: 0: 4821.7. Samples: 750171440. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:56:18,836][25689] Avg episode reward: [(0, '-3.063')] [2022-07-10 12:56:19,909][26022] Updated weights on worker 0-0, policy_version 732600 (0.00090) [2022-07-10 12:56:21,776][26022] Updated weights on worker 0-0, policy_version 732610 (0.00085) [2022-07-10 12:56:23,481][26022] Updated weights on worker 0-0, policy_version 732620 (0.00084) [2022-07-10 12:56:23,840][25689] Fps is (10 sec: 5717.9, 60 sec: 5546.1, 300 sec: 5539.3). Total num frames: 750204928. Throughput: 0: 5669.3. Samples: 750205136. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:56:23,841][25689] Avg episode reward: [(0, '-3.172')] [2022-07-10 12:56:25,475][26022] Updated weights on worker 0-0, policy_version 732630 (0.00089) [2022-07-10 12:56:27,051][26022] Updated weights on worker 0-0, policy_version 732640 (0.00089) [2022-07-10 12:56:28,906][25689] Fps is (10 sec: 5592.0, 60 sec: 5529.9, 300 sec: 5531.8). Total num frames: 750231552. Throughput: 0: 5801.6. Samples: 750238706. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:56:28,906][25689] Avg episode reward: [(0, '-2.780')] [2022-07-10 12:56:29,154][26022] Updated weights on worker 0-0, policy_version 732650 (0.00091) [2022-07-10 12:56:30,796][26022] Updated weights on worker 0-0, policy_version 732660 (0.00088) [2022-07-10 12:56:32,819][26022] Updated weights on worker 0-0, policy_version 732670 (0.00091) [2022-07-10 12:56:32,997][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:56:33,006][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000732671_750255104.pth [2022-07-10 12:56:33,007][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000730721_748258304.pth [2022-07-10 12:56:33,967][25689] Fps is (10 sec: 5561.2, 60 sec: 5529.2, 300 sec: 5539.4). Total num frames: 750261248. Throughput: 0: 5010.3. Samples: 750255652. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:56:33,967][25689] Avg episode reward: [(0, '-2.582')] [2022-07-10 12:56:34,257][26022] Updated weights on worker 0-0, policy_version 732680 (0.00085) [2022-07-10 12:56:36,564][26022] Updated weights on worker 0-0, policy_version 732690 (0.00049) [2022-07-10 12:56:38,107][26022] Updated weights on worker 0-0, policy_version 732700 (0.00087) [2022-07-10 12:56:39,039][25689] Fps is (10 sec: 5658.8, 60 sec: 5525.7, 300 sec: 5535.3). Total num frames: 750288896. Throughput: 0: 5842.0. Samples: 750289478. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:56:39,040][25689] Avg episode reward: [(0, '-1.379')] [2022-07-10 12:56:40,222][26022] Updated weights on worker 0-0, policy_version 732710 (0.00095) [2022-07-10 12:56:41,767][26022] Updated weights on worker 0-0, policy_version 732720 (0.00084) [2022-07-10 12:56:43,922][26022] Updated weights on worker 0-0, policy_version 732730 (0.00084) [2022-07-10 12:56:44,052][25689] Fps is (10 sec: 5483.0, 60 sec: 5526.9, 300 sec: 5540.2). Total num frames: 750316544. Throughput: 0: 5822.7. Samples: 750322826. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:56:44,052][25689] Avg episode reward: [(0, '-0.200')] [2022-07-10 12:56:45,455][26022] Updated weights on worker 0-0, policy_version 732740 (0.00090) [2022-07-10 12:56:47,476][26022] Updated weights on worker 0-0, policy_version 732750 (0.00086) [2022-07-10 12:56:49,187][25689] Fps is (10 sec: 5549.7, 60 sec: 5555.6, 300 sec: 5534.3). Total num frames: 750345216. Throughput: 0: 4967.6. Samples: 750339458. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:56:49,188][25689] Avg episode reward: [(0, '-0.645')] [2022-07-10 12:56:49,209][26022] Updated weights on worker 0-0, policy_version 732760 (0.00087) [2022-07-10 12:56:51,120][26022] Updated weights on worker 0-0, policy_version 732770 (0.00084) [2022-07-10 12:56:52,772][26022] Updated weights on worker 0-0, policy_version 732780 (0.00082) [2022-07-10 12:56:54,236][25689] Fps is (10 sec: 5630.5, 60 sec: 5536.5, 300 sec: 5537.4). Total num frames: 750373888. Throughput: 0: 5799.5. Samples: 750373204. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:56:54,237][25689] Avg episode reward: [(0, '-1.381')] [2022-07-10 12:56:54,711][26022] Updated weights on worker 0-0, policy_version 732790 (0.00084) [2022-07-10 12:56:56,540][26022] Updated weights on worker 0-0, policy_version 732800 (0.00084) [2022-07-10 12:56:58,547][26022] Updated weights on worker 0-0, policy_version 732810 (0.00092) [2022-07-10 12:56:59,324][25689] Fps is (10 sec: 5556.2, 60 sec: 5529.4, 300 sec: 5539.8). Total num frames: 750401536. Throughput: 0: 5777.6. Samples: 750406674. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:56:59,324][25689] Avg episode reward: [(0, '-3.908')] [2022-07-10 12:57:00,087][26022] Updated weights on worker 0-0, policy_version 732820 (0.00088) [2022-07-10 12:57:02,558][26022] Updated weights on worker 0-0, policy_version 732830 (0.00086) [2022-07-10 12:57:04,243][26022] Updated weights on worker 0-0, policy_version 732840 (0.00085) [2022-07-10 12:57:04,326][25689] Fps is (10 sec: 5480.5, 60 sec: 5565.1, 300 sec: 5538.4). Total num frames: 750429184. Throughput: 0: 4971.5. Samples: 750423618. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:57:04,328][25689] Avg episode reward: [(0, '-3.295')] [2022-07-10 12:57:06,168][26022] Updated weights on worker 0-0, policy_version 732850 (0.00091) [2022-07-10 12:57:07,905][26022] Updated weights on worker 0-0, policy_version 732860 (0.00089) [2022-07-10 12:57:09,369][25689] Fps is (10 sec: 5300.8, 60 sec: 5521.6, 300 sec: 5530.8). Total num frames: 750454784. Throughput: 0: 5727.4. Samples: 750455048. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:57:09,370][25689] Avg episode reward: [(0, '-3.567')] [2022-07-10 12:57:09,869][26022] Updated weights on worker 0-0, policy_version 732870 (0.00093) [2022-07-10 12:57:11,653][26022] Updated weights on worker 0-0, policy_version 732880 (0.00084) [2022-07-10 12:57:13,495][26022] Updated weights on worker 0-0, policy_version 732890 (0.00091) [2022-07-10 12:57:14,403][25689] Fps is (10 sec: 5385.9, 60 sec: 5543.2, 300 sec: 5537.6). Total num frames: 750483456. Throughput: 0: 5709.0. Samples: 750488334. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:57:14,404][25689] Avg episode reward: [(0, '-4.383')] [2022-07-10 12:57:15,489][26022] Updated weights on worker 0-0, policy_version 732900 (0.00087) [2022-07-10 12:57:17,196][26022] Updated weights on worker 0-0, policy_version 732910 (0.00088) [2022-07-10 12:57:19,053][26022] Updated weights on worker 0-0, policy_version 732920 (0.00086) [2022-07-10 12:57:19,409][25689] Fps is (10 sec: 5711.4, 60 sec: 5561.6, 300 sec: 5534.4). Total num frames: 750512128. Throughput: 0: 4897.2. Samples: 750505038. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:57:19,411][25689] Avg episode reward: [(0, '-3.596')] [2022-07-10 12:57:20,928][26022] Updated weights on worker 0-0, policy_version 732930 (0.00091) [2022-07-10 12:57:22,763][26022] Updated weights on worker 0-0, policy_version 732940 (0.00092) [2022-07-10 12:57:24,435][25689] Fps is (10 sec: 5614.0, 60 sec: 5526.0, 300 sec: 5535.2). Total num frames: 750539776. Throughput: 0: 5722.3. Samples: 750538686. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:57:24,437][25689] Avg episode reward: [(0, '-1.419')] [2022-07-10 12:57:24,476][26022] Updated weights on worker 0-0, policy_version 732950 (0.00105) [2022-07-10 12:57:26,421][26022] Updated weights on worker 0-0, policy_version 732960 (0.00084) [2022-07-10 12:57:28,295][26022] Updated weights on worker 0-0, policy_version 732970 (0.00087) [2022-07-10 12:57:29,579][25689] Fps is (10 sec: 5437.3, 60 sec: 5535.7, 300 sec: 5533.1). Total num frames: 750567424. Throughput: 0: 5778.1. Samples: 750571824. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:57:29,580][25689] Avg episode reward: [(0, '-0.376')] [2022-07-10 12:57:30,140][26022] Updated weights on worker 0-0, policy_version 732980 (0.00054) [2022-07-10 12:57:31,935][26022] Updated weights on worker 0-0, policy_version 732990 (0.00089) [2022-07-10 12:57:33,659][26022] Updated weights on worker 0-0, policy_version 733000 (0.00113) [2022-07-10 12:57:34,624][25689] Fps is (10 sec: 5527.5, 60 sec: 5520.4, 300 sec: 5536.0). Total num frames: 750596096. Throughput: 0: 5778.0. Samples: 750605172. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:57:34,624][25689] Avg episode reward: [(0, '-0.052')] [2022-07-10 12:57:35,842][26022] Updated weights on worker 0-0, policy_version 733010 (0.00090) [2022-07-10 12:57:37,307][26022] Updated weights on worker 0-0, policy_version 733020 (0.00088) [2022-07-10 12:57:39,384][26022] Updated weights on worker 0-0, policy_version 733030 (0.00086) [2022-07-10 12:57:39,659][25689] Fps is (10 sec: 5689.1, 60 sec: 5540.6, 300 sec: 5539.3). Total num frames: 750624768. Throughput: 0: 5766.9. Samples: 750621814. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:57:39,661][25689] Avg episode reward: [(0, '-0.409')] [2022-07-10 12:57:40,992][26022] Updated weights on worker 0-0, policy_version 733040 (0.00091) [2022-07-10 12:57:43,043][26022] Updated weights on worker 0-0, policy_version 733050 (0.00083) [2022-07-10 12:57:44,740][25689] Fps is (10 sec: 5567.5, 60 sec: 5534.4, 300 sec: 5538.7). Total num frames: 750652416. Throughput: 0: 5743.2. Samples: 750655302. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:57:44,740][25689] Avg episode reward: [(0, '0.048')] [2022-07-10 12:57:44,797][26022] Updated weights on worker 0-0, policy_version 733060 (0.00092) [2022-07-10 12:57:46,687][26022] Updated weights on worker 0-0, policy_version 733070 (0.00081) [2022-07-10 12:57:48,401][26022] Updated weights on worker 0-0, policy_version 733080 (0.00090) [2022-07-10 12:57:49,826][25689] Fps is (10 sec: 5438.7, 60 sec: 5522.0, 300 sec: 5533.9). Total num frames: 750680064. Throughput: 0: 5777.0. Samples: 750688790. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:57:49,828][25689] Avg episode reward: [(0, '-0.593')] [2022-07-10 12:57:50,597][26022] Updated weights on worker 0-0, policy_version 733090 (0.00086) [2022-07-10 12:57:52,180][26022] Updated weights on worker 0-0, policy_version 733100 (0.00104) [2022-07-10 12:57:54,222][26022] Updated weights on worker 0-0, policy_version 733110 (0.00087) [2022-07-10 12:57:54,919][25689] Fps is (10 sec: 5532.6, 60 sec: 5518.0, 300 sec: 5532.7). Total num frames: 750708736. Throughput: 0: 4957.6. Samples: 750705788. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:57:54,920][25689] Avg episode reward: [(0, '-0.721')] [2022-07-10 12:57:55,708][26022] Updated weights on worker 0-0, policy_version 733120 (0.00085) [2022-07-10 12:57:57,819][26022] Updated weights on worker 0-0, policy_version 733130 (0.00089) [2022-07-10 12:57:59,446][26022] Updated weights on worker 0-0, policy_version 733140 (0.00081) [2022-07-10 12:57:59,939][25689] Fps is (10 sec: 5670.2, 60 sec: 5541.0, 300 sec: 5532.5). Total num frames: 750737408. Throughput: 0: 5809.2. Samples: 750739626. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:57:59,940][25689] Avg episode reward: [(0, '-3.592')] [2022-07-10 12:58:01,596][26022] Updated weights on worker 0-0, policy_version 733150 (0.00088) [2022-07-10 12:58:03,440][26022] Updated weights on worker 0-0, policy_version 733160 (0.00086) [2022-07-10 12:58:04,965][25689] Fps is (10 sec: 5504.3, 60 sec: 5522.0, 300 sec: 5537.3). Total num frames: 750764032. Throughput: 0: 5733.1. Samples: 750771258. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:58:04,966][25689] Avg episode reward: [(0, '-2.861')] [2022-07-10 12:58:05,320][26022] Updated weights on worker 0-0, policy_version 733170 (0.00086) [2022-07-10 12:58:07,144][26022] Updated weights on worker 0-0, policy_version 733180 (0.00093) [2022-07-10 12:58:09,044][26022] Updated weights on worker 0-0, policy_version 733190 (0.00089) [2022-07-10 12:58:10,060][25689] Fps is (10 sec: 5362.4, 60 sec: 5551.0, 300 sec: 5532.2). Total num frames: 750791680. Throughput: 0: 4910.8. Samples: 750788154. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:58:10,061][25689] Avg episode reward: [(0, '-3.475')] [2022-07-10 12:58:10,724][26022] Updated weights on worker 0-0, policy_version 733200 (0.00090) [2022-07-10 12:58:12,801][26022] Updated weights on worker 0-0, policy_version 733210 (0.00086) [2022-07-10 12:58:14,493][26022] Updated weights on worker 0-0, policy_version 733220 (0.00089) [2022-07-10 12:58:15,135][25689] Fps is (10 sec: 5538.2, 60 sec: 5547.3, 300 sec: 5537.9). Total num frames: 750820352. Throughput: 0: 5721.0. Samples: 750821440. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:58:15,135][25689] Avg episode reward: [(0, '-3.882')] [2022-07-10 12:58:16,432][26022] Updated weights on worker 0-0, policy_version 733230 (0.00085) [2022-07-10 12:58:18,083][26022] Updated weights on worker 0-0, policy_version 733240 (0.00086) [2022-07-10 12:58:19,911][26022] Updated weights on worker 0-0, policy_version 733250 (0.00086) [2022-07-10 12:58:20,233][25689] Fps is (10 sec: 5636.7, 60 sec: 5538.9, 300 sec: 5536.9). Total num frames: 750849024. Throughput: 0: 5697.9. Samples: 750855260. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:58:20,234][25689] Avg episode reward: [(0, '-5.458')] [2022-07-10 12:58:21,798][26022] Updated weights on worker 0-0, policy_version 733260 (0.00081) [2022-07-10 12:58:23,607][26022] Updated weights on worker 0-0, policy_version 733270 (0.00088) [2022-07-10 12:58:25,240][25689] Fps is (10 sec: 5573.4, 60 sec: 5540.6, 300 sec: 5531.7). Total num frames: 750876672. Throughput: 0: 4979.5. Samples: 750872216. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:58:25,240][25689] Avg episode reward: [(0, '-6.844')] [2022-07-10 12:58:25,487][26022] Updated weights on worker 0-0, policy_version 733280 (0.00091) [2022-07-10 12:58:27,316][26022] Updated weights on worker 0-0, policy_version 733290 (0.00086) [2022-07-10 12:58:29,100][26022] Updated weights on worker 0-0, policy_version 733300 (0.00089) [2022-07-10 12:58:30,363][25689] Fps is (10 sec: 5559.9, 60 sec: 5559.4, 300 sec: 5534.8). Total num frames: 750905344. Throughput: 0: 5790.8. Samples: 750905726. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:58:30,364][25689] Avg episode reward: [(0, '-4.883')] [2022-07-10 12:58:31,089][26022] Updated weights on worker 0-0, policy_version 733310 (0.00085) [2022-07-10 12:58:32,610][26022] Updated weights on worker 0-0, policy_version 733320 (0.00088) [2022-07-10 12:58:33,098][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 12:58:33,110][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000733322_750921728.pth [2022-07-10 12:58:33,110][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000731372_748924928.pth [2022-07-10 12:58:34,800][26022] Updated weights on worker 0-0, policy_version 733330 (0.00089) [2022-07-10 12:58:35,390][25689] Fps is (10 sec: 5750.7, 60 sec: 5577.9, 300 sec: 5539.0). Total num frames: 750935040. Throughput: 0: 5839.1. Samples: 750939710. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:58:35,390][25689] Avg episode reward: [(0, '-5.830')] [2022-07-10 12:58:36,393][26022] Updated weights on worker 0-0, policy_version 733340 (0.00084) [2022-07-10 12:58:38,246][26022] Updated weights on worker 0-0, policy_version 733350 (0.00103) [2022-07-10 12:58:40,150][26022] Updated weights on worker 0-0, policy_version 733360 (0.00087) [2022-07-10 12:58:40,404][25689] Fps is (10 sec: 5609.0, 60 sec: 5546.0, 300 sec: 5535.4). Total num frames: 750961664. Throughput: 0: 5019.6. Samples: 750956508. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:58:40,405][25689] Avg episode reward: [(0, '-7.328')] [2022-07-10 12:58:41,828][26022] Updated weights on worker 0-0, policy_version 733370 (0.00084) [2022-07-10 12:58:43,764][26022] Updated weights on worker 0-0, policy_version 733380 (0.00089) [2022-07-10 12:58:45,351][26022] Updated weights on worker 0-0, policy_version 733390 (0.00095) [2022-07-10 12:58:45,418][25689] Fps is (10 sec: 5616.1, 60 sec: 5585.9, 300 sec: 5539.7). Total num frames: 750991360. Throughput: 0: 5841.9. Samples: 750990094. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:58:45,418][25689] Avg episode reward: [(0, '-5.670')] [2022-07-10 12:58:47,437][26022] Updated weights on worker 0-0, policy_version 733400 (0.00094) [2022-07-10 12:58:49,072][26022] Updated weights on worker 0-0, policy_version 733410 (0.00089) [2022-07-10 12:58:50,475][25689] Fps is (10 sec: 5592.3, 60 sec: 5571.7, 300 sec: 5535.6). Total num frames: 751017984. Throughput: 0: 5862.0. Samples: 751023622. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:58:50,476][25689] Avg episode reward: [(0, '-4.464')] [2022-07-10 12:58:51,228][26022] Updated weights on worker 0-0, policy_version 733420 (0.00090) [2022-07-10 12:58:52,850][26022] Updated weights on worker 0-0, policy_version 733430 (0.00123) [2022-07-10 12:58:54,952][26022] Updated weights on worker 0-0, policy_version 733440 (0.00092) [2022-07-10 12:58:55,512][25689] Fps is (10 sec: 5478.1, 60 sec: 5576.9, 300 sec: 5542.1). Total num frames: 751046656. Throughput: 0: 4978.9. Samples: 751039896. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:58:55,514][25689] Avg episode reward: [(0, '-4.514')] [2022-07-10 12:58:56,627][26022] Updated weights on worker 0-0, policy_version 733450 (0.00091) [2022-07-10 12:58:58,520][26022] Updated weights on worker 0-0, policy_version 733460 (0.00087) [2022-07-10 12:59:00,217][26022] Updated weights on worker 0-0, policy_version 733470 (0.00098) [2022-07-10 12:59:00,556][25689] Fps is (10 sec: 5586.8, 60 sec: 5557.8, 300 sec: 5541.6). Total num frames: 751074304. Throughput: 0: 5804.2. Samples: 751073474. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:59:00,558][25689] Avg episode reward: [(0, '-4.136')] [2022-07-10 12:59:02,119][26022] Updated weights on worker 0-0, policy_version 733480 (0.00612) [2022-07-10 12:59:04,411][26022] Updated weights on worker 0-0, policy_version 733490 (0.00085) [2022-07-10 12:59:05,586][25689] Fps is (10 sec: 5285.5, 60 sec: 5540.5, 300 sec: 5542.4). Total num frames: 751099904. Throughput: 0: 5683.8. Samples: 751104726. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:59:05,587][25689] Avg episode reward: [(0, '-5.315')] [2022-07-10 12:59:06,063][26022] Updated weights on worker 0-0, policy_version 733500 (0.00092) [2022-07-10 12:59:08,131][26022] Updated weights on worker 0-0, policy_version 733510 (0.00086) [2022-07-10 12:59:10,054][26022] Updated weights on worker 0-0, policy_version 733520 (0.00087) [2022-07-10 12:59:10,697][25689] Fps is (10 sec: 5250.5, 60 sec: 5539.0, 300 sec: 5537.1). Total num frames: 751127552. Throughput: 0: 5650.8. Samples: 751137894. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-10 12:59:10,698][25689] Avg episode reward: [(0, '-3.237')] [2022-07-10 12:59:11,685][26022] Updated weights on worker 0-0, policy_version 733530 (0.00086) [2022-07-10 12:59:13,701][26022] Updated weights on worker 0-0, policy_version 733540 (0.00094) [2022-07-10 12:59:15,459][26022] Updated weights on worker 0-0, policy_version 733550 (0.00090) [2022-07-10 12:59:15,711][25689] Fps is (10 sec: 5461.4, 60 sec: 5527.7, 300 sec: 5530.3). Total num frames: 751155200. Throughput: 0: 5659.2. Samples: 751154208. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 12:59:15,712][25689] Avg episode reward: [(0, '-2.533')] [2022-07-10 12:59:17,262][26022] Updated weights on worker 0-0, policy_version 733560 (0.00085) [2022-07-10 12:59:19,272][26022] Updated weights on worker 0-0, policy_version 733570 (0.00089) [2022-07-10 12:59:20,771][25689] Fps is (10 sec: 5692.7, 60 sec: 5548.2, 300 sec: 5539.8). Total num frames: 751184896. Throughput: 0: 5664.8. Samples: 751187986. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 12:59:20,771][25689] Avg episode reward: [(0, '-2.417')] [2022-07-10 12:59:20,929][26022] Updated weights on worker 0-0, policy_version 733580 (0.00090) [2022-07-10 12:59:22,836][26022] Updated weights on worker 0-0, policy_version 733590 (0.00092) [2022-07-10 12:59:24,451][26022] Updated weights on worker 0-0, policy_version 733600 (0.00093) [2022-07-10 12:59:25,797][25689] Fps is (10 sec: 5584.2, 60 sec: 5529.5, 300 sec: 5537.2). Total num frames: 751211520. Throughput: 0: 5779.3. Samples: 751221528. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 12:59:25,797][25689] Avg episode reward: [(0, '-2.297')] [2022-07-10 12:59:26,451][26022] Updated weights on worker 0-0, policy_version 733610 (0.00088) [2022-07-10 12:59:28,332][26022] Updated weights on worker 0-0, policy_version 733620 (0.00087) [2022-07-10 12:59:30,127][26022] Updated weights on worker 0-0, policy_version 733630 (0.00611) [2022-07-10 12:59:30,875][25689] Fps is (10 sec: 5472.6, 60 sec: 5533.6, 300 sec: 5533.3). Total num frames: 751240192. Throughput: 0: 4973.6. Samples: 751238248. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 12:59:30,875][25689] Avg episode reward: [(0, '-1.124')] [2022-07-10 12:59:32,024][26022] Updated weights on worker 0-0, policy_version 733640 (0.00108) [2022-07-10 12:59:33,721][26022] Updated weights on worker 0-0, policy_version 733650 (0.01094) [2022-07-10 12:59:35,815][26022] Updated weights on worker 0-0, policy_version 733660 (0.00085) [2022-07-10 12:59:35,905][25689] Fps is (10 sec: 5571.9, 60 sec: 5499.5, 300 sec: 5529.6). Total num frames: 751267840. Throughput: 0: 5822.8. Samples: 751271790. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 12:59:35,905][25689] Avg episode reward: [(0, '-0.083')] [2022-07-10 12:59:37,447][26022] Updated weights on worker 0-0, policy_version 733670 (0.00085) [2022-07-10 12:59:39,438][26022] Updated weights on worker 0-0, policy_version 733680 (0.00095) [2022-07-10 12:59:40,912][25689] Fps is (10 sec: 5611.0, 60 sec: 5533.9, 300 sec: 5537.1). Total num frames: 751296512. Throughput: 0: 5824.4. Samples: 751305298. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 12:59:40,913][25689] Avg episode reward: [(0, '-0.840')] [2022-07-10 12:59:41,032][26022] Updated weights on worker 0-0, policy_version 733690 (0.00084) [2022-07-10 12:59:43,103][26022] Updated weights on worker 0-0, policy_version 733700 (0.00092) [2022-07-10 12:59:44,896][26022] Updated weights on worker 0-0, policy_version 733710 (0.00082) [2022-07-10 12:59:45,965][25689] Fps is (10 sec: 5700.1, 60 sec: 5513.5, 300 sec: 5536.8). Total num frames: 751325184. Throughput: 0: 4980.1. Samples: 751321968. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 12:59:45,965][25689] Avg episode reward: [(0, '-0.675')] [2022-07-10 12:59:46,715][26022] Updated weights on worker 0-0, policy_version 733720 (0.00387) [2022-07-10 12:59:48,476][26022] Updated weights on worker 0-0, policy_version 733730 (0.00088) [2022-07-10 12:59:50,268][26022] Updated weights on worker 0-0, policy_version 733740 (0.00089) [2022-07-10 12:59:51,053][25689] Fps is (10 sec: 5554.0, 60 sec: 5527.6, 300 sec: 5535.3). Total num frames: 751352832. Throughput: 0: 5819.1. Samples: 751355664. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 12:59:51,053][25689] Avg episode reward: [(0, '-0.243')] [2022-07-10 12:59:52,093][26022] Updated weights on worker 0-0, policy_version 733750 (0.00099) [2022-07-10 12:59:54,069][26022] Updated weights on worker 0-0, policy_version 733760 (0.00088) [2022-07-10 12:59:55,879][26022] Updated weights on worker 0-0, policy_version 733770 (0.00087) [2022-07-10 12:59:56,054][25689] Fps is (10 sec: 5683.8, 60 sec: 5547.7, 300 sec: 5539.3). Total num frames: 751382528. Throughput: 0: 5831.5. Samples: 751389290. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 12:59:56,054][25689] Avg episode reward: [(0, '-0.864')] [2022-07-10 12:59:57,731][26022] Updated weights on worker 0-0, policy_version 733780 (0.00089) [2022-07-10 12:59:59,548][26022] Updated weights on worker 0-0, policy_version 733790 (0.00088) [2022-07-10 13:00:01,092][25689] Fps is (10 sec: 5712.1, 60 sec: 5548.3, 300 sec: 5546.0). Total num frames: 751410176. Throughput: 0: 4993.2. Samples: 751406058. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:00:01,092][25689] Avg episode reward: [(0, '-1.580')] [2022-07-10 13:00:01,495][26022] Updated weights on worker 0-0, policy_version 733800 (0.00092) [2022-07-10 13:00:03,674][26022] Updated weights on worker 0-0, policy_version 733810 (0.00090) [2022-07-10 13:00:05,363][26022] Updated weights on worker 0-0, policy_version 733820 (0.00083) [2022-07-10 13:00:06,166][25689] Fps is (10 sec: 5265.6, 60 sec: 5544.3, 300 sec: 5539.8). Total num frames: 751435776. Throughput: 0: 5736.2. Samples: 751437848. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:00:06,168][25689] Avg episode reward: [(0, '-1.358')] [2022-07-10 13:00:07,508][26022] Updated weights on worker 0-0, policy_version 733830 (0.00085) [2022-07-10 13:00:08,940][26022] Updated weights on worker 0-0, policy_version 733840 (0.00083) [2022-07-10 13:00:11,051][26022] Updated weights on worker 0-0, policy_version 733850 (0.00097) [2022-07-10 13:00:11,212][25689] Fps is (10 sec: 5261.3, 60 sec: 5550.2, 300 sec: 5539.4). Total num frames: 751463424. Throughput: 0: 5740.1. Samples: 751471384. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:00:11,213][25689] Avg episode reward: [(0, '-0.738')] [2022-07-10 13:00:12,492][26022] Updated weights on worker 0-0, policy_version 733860 (0.00090) [2022-07-10 13:00:14,717][26022] Updated weights on worker 0-0, policy_version 733870 (0.00091) [2022-07-10 13:00:16,214][25689] Fps is (10 sec: 5605.1, 60 sec: 5568.2, 300 sec: 5536.3). Total num frames: 751492096. Throughput: 0: 4902.0. Samples: 751488116. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:00:16,215][25689] Avg episode reward: [(0, '-0.772')] [2022-07-10 13:00:16,514][26022] Updated weights on worker 0-0, policy_version 733880 (0.00104) [2022-07-10 13:00:18,340][26022] Updated weights on worker 0-0, policy_version 733890 (0.00089) [2022-07-10 13:00:20,220][26022] Updated weights on worker 0-0, policy_version 733900 (0.00089) [2022-07-10 13:00:21,237][25689] Fps is (10 sec: 5515.9, 60 sec: 5520.8, 300 sec: 5536.6). Total num frames: 751518720. Throughput: 0: 5739.0. Samples: 751521674. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:00:21,238][25689] Avg episode reward: [(0, '-1.745')] [2022-07-10 13:00:21,892][26022] Updated weights on worker 0-0, policy_version 733910 (0.00091) [2022-07-10 13:00:23,832][26022] Updated weights on worker 0-0, policy_version 733920 (0.00085) [2022-07-10 13:00:25,447][26022] Updated weights on worker 0-0, policy_version 733930 (0.00095) [2022-07-10 13:00:26,260][25689] Fps is (10 sec: 5504.4, 60 sec: 5555.0, 300 sec: 5541.0). Total num frames: 751547392. Throughput: 0: 5837.4. Samples: 751555144. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:00:26,261][25689] Avg episode reward: [(0, '-2.593')] [2022-07-10 13:00:27,563][26022] Updated weights on worker 0-0, policy_version 733940 (0.00065) [2022-07-10 13:00:29,255][26022] Updated weights on worker 0-0, policy_version 733950 (0.00097) [2022-07-10 13:00:31,209][26022] Updated weights on worker 0-0, policy_version 733960 (0.00081) [2022-07-10 13:00:31,386][25689] Fps is (10 sec: 5650.6, 60 sec: 5550.6, 300 sec: 5536.2). Total num frames: 751576064. Throughput: 0: 4962.8. Samples: 751571500. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:00:31,386][25689] Avg episode reward: [(0, '-2.403')] [2022-07-10 13:00:32,999][26022] Updated weights on worker 0-0, policy_version 733970 (0.00083) [2022-07-10 13:00:33,187][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:00:33,200][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000733971_751586304.pth [2022-07-10 13:00:33,201][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000732023_749591552.pth [2022-07-10 13:00:34,703][26022] Updated weights on worker 0-0, policy_version 733980 (0.00093) [2022-07-10 13:00:36,424][25689] Fps is (10 sec: 5642.0, 60 sec: 5566.8, 300 sec: 5539.5). Total num frames: 751604736. Throughput: 0: 5801.6. Samples: 751605364. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:00:36,425][25689] Avg episode reward: [(0, '-2.007')] [2022-07-10 13:00:36,845][26022] Updated weights on worker 0-0, policy_version 733990 (0.00089) [2022-07-10 13:00:38,371][26022] Updated weights on worker 0-0, policy_version 734000 (0.00095) [2022-07-10 13:00:40,575][26022] Updated weights on worker 0-0, policy_version 734010 (0.00094) [2022-07-10 13:00:41,463][25689] Fps is (10 sec: 5588.9, 60 sec: 5547.0, 300 sec: 5539.3). Total num frames: 751632384. Throughput: 0: 5772.3. Samples: 751638422. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:00:41,463][25689] Avg episode reward: [(0, '-2.048')] [2022-07-10 13:00:42,011][26022] Updated weights on worker 0-0, policy_version 734020 (0.00086) [2022-07-10 13:00:44,047][26022] Updated weights on worker 0-0, policy_version 734030 (0.00086) [2022-07-10 13:00:46,242][26022] Updated weights on worker 0-0, policy_version 734040 (0.00088) [2022-07-10 13:00:46,515][25689] Fps is (10 sec: 5378.1, 60 sec: 5513.2, 300 sec: 5539.8). Total num frames: 751659008. Throughput: 0: 4940.1. Samples: 751655206. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:00:46,516][25689] Avg episode reward: [(0, '-2.656')] [2022-07-10 13:00:47,701][26022] Updated weights on worker 0-0, policy_version 734050 (0.00085) [2022-07-10 13:00:49,708][26022] Updated weights on worker 0-0, policy_version 734060 (0.00088) [2022-07-10 13:00:51,212][26022] Updated weights on worker 0-0, policy_version 734070 (0.00081) [2022-07-10 13:00:51,587][25689] Fps is (10 sec: 5563.0, 60 sec: 5548.5, 300 sec: 5538.9). Total num frames: 751688704. Throughput: 0: 5795.8. Samples: 751688582. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:00:51,587][25689] Avg episode reward: [(0, '-1.858')] [2022-07-10 13:00:53,332][26022] Updated weights on worker 0-0, policy_version 734080 (0.00087) [2022-07-10 13:00:55,012][26022] Updated weights on worker 0-0, policy_version 734090 (0.00087) [2022-07-10 13:00:56,613][25689] Fps is (10 sec: 5678.6, 60 sec: 5512.3, 300 sec: 5538.6). Total num frames: 751716352. Throughput: 0: 5790.5. Samples: 751722272. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:00:56,614][25689] Avg episode reward: [(0, '-0.944')] [2022-07-10 13:00:57,032][26022] Updated weights on worker 0-0, policy_version 734100 (0.00099) [2022-07-10 13:00:58,712][26022] Updated weights on worker 0-0, policy_version 734110 (0.00085) [2022-07-10 13:01:00,679][26022] Updated weights on worker 0-0, policy_version 734120 (0.00088) [2022-07-10 13:01:01,626][25689] Fps is (10 sec: 5610.1, 60 sec: 5531.6, 300 sec: 5549.1). Total num frames: 751745024. Throughput: 0: 4996.0. Samples: 751739156. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:01:01,626][25689] Avg episode reward: [(0, '-0.357')] [2022-07-10 13:01:02,768][26022] Updated weights on worker 0-0, policy_version 734130 (0.00091) [2022-07-10 13:01:04,828][26022] Updated weights on worker 0-0, policy_version 734140 (0.00096) [2022-07-10 13:01:06,224][26022] Updated weights on worker 0-0, policy_version 734150 (0.00091) [2022-07-10 13:01:06,651][25689] Fps is (10 sec: 5407.0, 60 sec: 5536.1, 300 sec: 5540.6). Total num frames: 751770624. Throughput: 0: 5724.5. Samples: 751770472. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:01:06,651][25689] Avg episode reward: [(0, '-1.271')] [2022-07-10 13:01:08,609][26022] Updated weights on worker 0-0, policy_version 734160 (0.00088) [2022-07-10 13:01:10,103][26022] Updated weights on worker 0-0, policy_version 734170 (0.00088) [2022-07-10 13:01:11,773][25689] Fps is (10 sec: 5146.5, 60 sec: 5512.2, 300 sec: 5536.5). Total num frames: 751797248. Throughput: 0: 5702.6. Samples: 751803698. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:01:11,774][25689] Avg episode reward: [(0, '-2.253')] [2022-07-10 13:01:12,126][26022] Updated weights on worker 0-0, policy_version 734180 (0.00086) [2022-07-10 13:01:13,826][26022] Updated weights on worker 0-0, policy_version 734190 (0.00089) [2022-07-10 13:01:15,749][26022] Updated weights on worker 0-0, policy_version 734200 (0.00088) [2022-07-10 13:01:16,776][25689] Fps is (10 sec: 5663.4, 60 sec: 5545.9, 300 sec: 5547.2). Total num frames: 751827968. Throughput: 0: 4873.6. Samples: 751820536. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:01:16,777][25689] Avg episode reward: [(0, '-2.654')] [2022-07-10 13:01:17,459][26022] Updated weights on worker 0-0, policy_version 734210 (0.00088) [2022-07-10 13:01:19,452][26022] Updated weights on worker 0-0, policy_version 734220 (0.00082) [2022-07-10 13:01:21,282][26022] Updated weights on worker 0-0, policy_version 734230 (0.00083) [2022-07-10 13:01:21,847][25689] Fps is (10 sec: 5489.2, 60 sec: 5507.8, 300 sec: 5528.8). Total num frames: 751852544. Throughput: 0: 5662.3. Samples: 751853654. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:01:21,847][25689] Avg episode reward: [(0, '-2.576')] [2022-07-10 13:01:23,112][26022] Updated weights on worker 0-0, policy_version 734240 (0.00096) [2022-07-10 13:01:25,153][26022] Updated weights on worker 0-0, policy_version 734250 (0.00093) [2022-07-10 13:01:26,598][26022] Updated weights on worker 0-0, policy_version 734260 (0.00093) [2022-07-10 13:01:26,867][25689] Fps is (10 sec: 5581.1, 60 sec: 5558.7, 300 sec: 5546.8). Total num frames: 751884288. Throughput: 0: 5781.9. Samples: 751887362. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:01:26,868][25689] Avg episode reward: [(0, '-2.664')] [2022-07-10 13:01:28,883][26022] Updated weights on worker 0-0, policy_version 734270 (0.00084) [2022-07-10 13:01:30,336][26022] Updated weights on worker 0-0, policy_version 734280 (0.00091) [2022-07-10 13:01:31,914][25689] Fps is (10 sec: 5696.2, 60 sec: 5515.2, 300 sec: 5533.3). Total num frames: 751909888. Throughput: 0: 4970.9. Samples: 751903816. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:01:31,914][25689] Avg episode reward: [(0, '-3.090')] [2022-07-10 13:01:32,397][26022] Updated weights on worker 0-0, policy_version 734290 (0.00083) [2022-07-10 13:01:34,160][26022] Updated weights on worker 0-0, policy_version 734300 (0.00086) [2022-07-10 13:01:36,040][26022] Updated weights on worker 0-0, policy_version 734310 (0.00090) [2022-07-10 13:01:36,948][25689] Fps is (10 sec: 5282.0, 60 sec: 5498.6, 300 sec: 5534.0). Total num frames: 751937536. Throughput: 0: 5775.3. Samples: 751937036. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:01:36,949][25689] Avg episode reward: [(0, '-1.559')] [2022-07-10 13:01:37,947][26022] Updated weights on worker 0-0, policy_version 734320 (0.00087) [2022-07-10 13:01:39,886][26022] Updated weights on worker 0-0, policy_version 734330 (0.00085) [2022-07-10 13:01:41,663][26022] Updated weights on worker 0-0, policy_version 734340 (0.00089) [2022-07-10 13:01:41,956][25689] Fps is (10 sec: 5506.6, 60 sec: 5501.5, 300 sec: 5534.1). Total num frames: 751965184. Throughput: 0: 5798.8. Samples: 751970262. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:01:41,956][25689] Avg episode reward: [(0, '-1.349')] [2022-07-10 13:01:43,442][26022] Updated weights on worker 0-0, policy_version 734350 (0.00088) [2022-07-10 13:01:45,319][26022] Updated weights on worker 0-0, policy_version 734360 (0.00085) [2022-07-10 13:01:46,973][25689] Fps is (10 sec: 5618.0, 60 sec: 5538.5, 300 sec: 5536.3). Total num frames: 751993856. Throughput: 0: 4956.2. Samples: 751987010. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:01:46,974][25689] Avg episode reward: [(0, '-2.456')] [2022-07-10 13:01:47,012][26022] Updated weights on worker 0-0, policy_version 734370 (0.00080) [2022-07-10 13:01:49,115][26022] Updated weights on worker 0-0, policy_version 734380 (0.00086) [2022-07-10 13:01:50,830][26022] Updated weights on worker 0-0, policy_version 734390 (0.00090) [2022-07-10 13:01:52,053][25689] Fps is (10 sec: 5577.6, 60 sec: 5503.9, 300 sec: 5532.3). Total num frames: 752021504. Throughput: 0: 5788.9. Samples: 752020400. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:01:52,054][25689] Avg episode reward: [(0, '-2.305')] [2022-07-10 13:01:52,666][26022] Updated weights on worker 0-0, policy_version 734400 (0.00093) [2022-07-10 13:01:54,421][26022] Updated weights on worker 0-0, policy_version 734410 (0.00296) [2022-07-10 13:01:56,241][26022] Updated weights on worker 0-0, policy_version 734420 (0.00083) [2022-07-10 13:01:57,065][25689] Fps is (10 sec: 5580.5, 60 sec: 5522.1, 300 sec: 5537.2). Total num frames: 752050176. Throughput: 0: 5819.0. Samples: 752054098. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:01:57,066][25689] Avg episode reward: [(0, '-3.296')] [2022-07-10 13:01:58,094][26022] Updated weights on worker 0-0, policy_version 734430 (0.00394) [2022-07-10 13:02:00,046][26022] Updated weights on worker 0-0, policy_version 734440 (0.00080) [2022-07-10 13:02:02,052][26022] Updated weights on worker 0-0, policy_version 734450 (0.00093) [2022-07-10 13:02:02,119][25689] Fps is (10 sec: 5493.6, 60 sec: 5484.5, 300 sec: 5532.8). Total num frames: 752076800. Throughput: 0: 5739.7. Samples: 752085992. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:02:02,119][25689] Avg episode reward: [(0, '-4.737')] [2022-07-10 13:02:03,977][26022] Updated weights on worker 0-0, policy_version 734460 (0.00086) [2022-07-10 13:02:05,763][26022] Updated weights on worker 0-0, policy_version 734470 (0.00088) [2022-07-10 13:02:07,123][25689] Fps is (10 sec: 5396.3, 60 sec: 5520.3, 300 sec: 5540.4). Total num frames: 752104448. Throughput: 0: 5721.2. Samples: 752102290. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:02:07,123][25689] Avg episode reward: [(0, '-5.231')] [2022-07-10 13:02:07,768][26022] Updated weights on worker 0-0, policy_version 734480 (0.00090) [2022-07-10 13:02:09,479][26022] Updated weights on worker 0-0, policy_version 734490 (0.00090) [2022-07-10 13:02:11,499][26022] Updated weights on worker 0-0, policy_version 734500 (0.00087) [2022-07-10 13:02:12,244][25689] Fps is (10 sec: 5562.1, 60 sec: 5554.3, 300 sec: 5538.7). Total num frames: 752133120. Throughput: 0: 5714.7. Samples: 752135786. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:02:12,245][25689] Avg episode reward: [(0, '-4.763')] [2022-07-10 13:02:13,078][26022] Updated weights on worker 0-0, policy_version 734510 (0.00056) [2022-07-10 13:02:15,148][26022] Updated weights on worker 0-0, policy_version 734520 (0.00086) [2022-07-10 13:02:16,654][26022] Updated weights on worker 0-0, policy_version 734530 (0.00089) [2022-07-10 13:02:17,263][25689] Fps is (10 sec: 5554.2, 60 sec: 5502.0, 300 sec: 5535.1). Total num frames: 752160768. Throughput: 0: 5711.1. Samples: 752169448. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:02:17,263][25689] Avg episode reward: [(0, '-3.322')] [2022-07-10 13:02:18,696][26022] Updated weights on worker 0-0, policy_version 734540 (0.00080) [2022-07-10 13:02:20,500][26022] Updated weights on worker 0-0, policy_version 734550 (0.00052) [2022-07-10 13:02:22,332][25689] Fps is (10 sec: 5582.9, 60 sec: 5569.9, 300 sec: 5537.7). Total num frames: 752189440. Throughput: 0: 4952.8. Samples: 752186106. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:02:22,333][25689] Avg episode reward: [(0, '-3.594')] [2022-07-10 13:02:22,335][26022] Updated weights on worker 0-0, policy_version 734560 (0.00089) [2022-07-10 13:02:24,246][26022] Updated weights on worker 0-0, policy_version 734570 (0.00099) [2022-07-10 13:02:26,048][26022] Updated weights on worker 0-0, policy_version 734580 (0.00094) [2022-07-10 13:02:27,377][25689] Fps is (10 sec: 5467.1, 60 sec: 5483.0, 300 sec: 5536.1). Total num frames: 752216064. Throughput: 0: 5774.3. Samples: 752219244. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:02:27,378][25689] Avg episode reward: [(0, '-3.418')] [2022-07-10 13:02:27,835][26022] Updated weights on worker 0-0, policy_version 734590 (0.00088) [2022-07-10 13:02:30,204][26022] Updated weights on worker 0-0, policy_version 734600 (0.00102) [2022-07-10 13:02:31,434][26022] Updated weights on worker 0-0, policy_version 734610 (0.00094) [2022-07-10 13:02:32,479][25689] Fps is (10 sec: 5449.7, 60 sec: 5528.7, 300 sec: 5535.0). Total num frames: 752244736. Throughput: 0: 5757.9. Samples: 752252294. Policy #0 lag: (min: 0.0, avg: 8.6, max: 18.0) [2022-07-10 13:02:32,479][25689] Avg episode reward: [(0, '-3.021')] [2022-07-10 13:02:33,320][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:02:33,332][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000734618_752248832.pth [2022-07-10 13:02:33,332][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000732671_750255104.pth [2022-07-10 13:02:33,811][26022] Updated weights on worker 0-0, policy_version 734620 (0.00084) [2022-07-10 13:02:35,053][26022] Updated weights on worker 0-0, policy_version 734630 (0.00095) [2022-07-10 13:02:37,357][26022] Updated weights on worker 0-0, policy_version 734640 (0.00082) [2022-07-10 13:02:37,534][25689] Fps is (10 sec: 5545.0, 60 sec: 5526.8, 300 sec: 5531.2). Total num frames: 752272384. Throughput: 0: 4909.5. Samples: 752268968. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:02:37,535][25689] Avg episode reward: [(0, '-4.441')] [2022-07-10 13:02:38,960][26022] Updated weights on worker 0-0, policy_version 734650 (0.00086) [2022-07-10 13:02:40,954][26022] Updated weights on worker 0-0, policy_version 734660 (0.00088) [2022-07-10 13:02:42,537][25689] Fps is (10 sec: 5497.7, 60 sec: 5527.3, 300 sec: 5532.7). Total num frames: 752300032. Throughput: 0: 5746.2. Samples: 752302206. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:02:42,537][25689] Avg episode reward: [(0, '-4.382')] [2022-07-10 13:02:42,850][26022] Updated weights on worker 0-0, policy_version 734670 (0.00082) [2022-07-10 13:02:44,577][26022] Updated weights on worker 0-0, policy_version 734680 (0.00084) [2022-07-10 13:02:46,550][26022] Updated weights on worker 0-0, policy_version 734690 (0.00092) [2022-07-10 13:02:47,541][25689] Fps is (10 sec: 5628.1, 60 sec: 5528.5, 300 sec: 5537.7). Total num frames: 752328704. Throughput: 0: 5765.4. Samples: 752335496. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:02:47,541][25689] Avg episode reward: [(0, '-5.876')] [2022-07-10 13:02:48,297][26022] Updated weights on worker 0-0, policy_version 734700 (0.00084) [2022-07-10 13:02:50,328][26022] Updated weights on worker 0-0, policy_version 734710 (0.00093) [2022-07-10 13:02:52,095][26022] Updated weights on worker 0-0, policy_version 734720 (0.00081) [2022-07-10 13:02:52,659][25689] Fps is (10 sec: 5462.9, 60 sec: 5508.1, 300 sec: 5530.3). Total num frames: 752355328. Throughput: 0: 4937.0. Samples: 752351924. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:02:52,660][25689] Avg episode reward: [(0, '-7.468')] [2022-07-10 13:02:53,927][26022] Updated weights on worker 0-0, policy_version 734730 (0.00102) [2022-07-10 13:02:56,022][26022] Updated weights on worker 0-0, policy_version 734740 (0.00091) [2022-07-10 13:02:57,534][26022] Updated weights on worker 0-0, policy_version 734750 (0.00085) [2022-07-10 13:02:57,670][25689] Fps is (10 sec: 5560.5, 60 sec: 5525.2, 300 sec: 5533.9). Total num frames: 752385024. Throughput: 0: 5759.9. Samples: 752384948. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:02:57,670][25689] Avg episode reward: [(0, '-5.686')] [2022-07-10 13:02:59,435][26022] Updated weights on worker 0-0, policy_version 734760 (0.00083) [2022-07-10 13:03:01,431][26022] Updated weights on worker 0-0, policy_version 734770 (0.00105) [2022-07-10 13:03:02,748][25689] Fps is (10 sec: 5278.1, 60 sec: 5472.2, 300 sec: 5522.6). Total num frames: 752408576. Throughput: 0: 5640.6. Samples: 752416206. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:03:02,748][25689] Avg episode reward: [(0, '-4.302')] [2022-07-10 13:03:03,440][26022] Updated weights on worker 0-0, policy_version 734780 (0.00091) [2022-07-10 13:03:05,462][26022] Updated weights on worker 0-0, policy_version 734790 (0.00090) [2022-07-10 13:03:07,111][26022] Updated weights on worker 0-0, policy_version 734800 (0.00087) [2022-07-10 13:03:07,750][25689] Fps is (10 sec: 5282.5, 60 sec: 5506.2, 300 sec: 5531.2). Total num frames: 752438272. Throughput: 0: 4818.4. Samples: 752432870. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:03:07,750][25689] Avg episode reward: [(0, '-4.831')] [2022-07-10 13:03:09,294][26022] Updated weights on worker 0-0, policy_version 734810 (0.00090) [2022-07-10 13:03:10,832][26022] Updated weights on worker 0-0, policy_version 734820 (0.00086) [2022-07-10 13:03:12,625][26022] Updated weights on worker 0-0, policy_version 734830 (0.00086) [2022-07-10 13:03:12,802][25689] Fps is (10 sec: 5805.3, 60 sec: 5512.5, 300 sec: 5531.7). Total num frames: 752466944. Throughput: 0: 5672.2. Samples: 752466178. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:03:12,802][25689] Avg episode reward: [(0, '-3.293')] [2022-07-10 13:03:14,659][26022] Updated weights on worker 0-0, policy_version 734840 (0.00091) [2022-07-10 13:03:16,391][26022] Updated weights on worker 0-0, policy_version 734850 (0.00095) [2022-07-10 13:03:17,846][25689] Fps is (10 sec: 5477.0, 60 sec: 5493.3, 300 sec: 5525.8). Total num frames: 752493568. Throughput: 0: 5695.0. Samples: 752499852. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:03:17,846][25689] Avg episode reward: [(0, '-3.617')] [2022-07-10 13:03:18,362][26022] Updated weights on worker 0-0, policy_version 734860 (0.00884) [2022-07-10 13:03:20,196][26022] Updated weights on worker 0-0, policy_version 734870 (0.00092) [2022-07-10 13:03:21,834][26022] Updated weights on worker 0-0, policy_version 734880 (0.00087) [2022-07-10 13:03:22,862][25689] Fps is (10 sec: 5394.5, 60 sec: 5481.2, 300 sec: 5525.6). Total num frames: 752521216. Throughput: 0: 4987.9. Samples: 752516536. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:03:22,863][25689] Avg episode reward: [(0, '-3.009')] [2022-07-10 13:03:23,967][26022] Updated weights on worker 0-0, policy_version 734890 (0.00086) [2022-07-10 13:03:25,547][26022] Updated weights on worker 0-0, policy_version 734900 (0.00089) [2022-07-10 13:03:27,522][26022] Updated weights on worker 0-0, policy_version 734910 (0.00086) [2022-07-10 13:03:27,885][25689] Fps is (10 sec: 5609.7, 60 sec: 5517.0, 300 sec: 5527.5). Total num frames: 752549888. Throughput: 0: 5814.0. Samples: 752549940. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:03:27,886][25689] Avg episode reward: [(0, '-2.883')] [2022-07-10 13:03:29,197][26022] Updated weights on worker 0-0, policy_version 734920 (0.00089) [2022-07-10 13:03:31,231][26022] Updated weights on worker 0-0, policy_version 734930 (0.00083) [2022-07-10 13:03:32,927][26022] Updated weights on worker 0-0, policy_version 734940 (0.00087) [2022-07-10 13:03:33,002][25689] Fps is (10 sec: 5655.5, 60 sec: 5515.7, 300 sec: 5522.4). Total num frames: 752578560. Throughput: 0: 5811.0. Samples: 752583560. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:03:33,002][25689] Avg episode reward: [(0, '-1.879')] [2022-07-10 13:03:34,702][26022] Updated weights on worker 0-0, policy_version 734950 (0.00085) [2022-07-10 13:03:36,572][26022] Updated weights on worker 0-0, policy_version 734960 (0.00092) [2022-07-10 13:03:38,046][25689] Fps is (10 sec: 5643.5, 60 sec: 5533.5, 300 sec: 5528.7). Total num frames: 752607232. Throughput: 0: 4982.5. Samples: 752600502. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:03:38,047][25689] Avg episode reward: [(0, '-1.329')] [2022-07-10 13:03:38,567][26022] Updated weights on worker 0-0, policy_version 734970 (0.00083) [2022-07-10 13:03:40,303][26022] Updated weights on worker 0-0, policy_version 734980 (0.00090) [2022-07-10 13:03:42,118][26022] Updated weights on worker 0-0, policy_version 734990 (0.00094) [2022-07-10 13:03:43,094][25689] Fps is (10 sec: 5580.8, 60 sec: 5529.5, 300 sec: 5521.2). Total num frames: 752634880. Throughput: 0: 5809.1. Samples: 752634062. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:03:43,096][25689] Avg episode reward: [(0, '-0.945')] [2022-07-10 13:03:43,971][26022] Updated weights on worker 0-0, policy_version 735000 (0.00090) [2022-07-10 13:03:45,828][26022] Updated weights on worker 0-0, policy_version 735010 (0.00092) [2022-07-10 13:03:47,626][26022] Updated weights on worker 0-0, policy_version 735020 (0.00091) [2022-07-10 13:03:48,102][25689] Fps is (10 sec: 5499.3, 60 sec: 5512.2, 300 sec: 5525.5). Total num frames: 752662528. Throughput: 0: 5822.5. Samples: 752667648. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:03:48,102][25689] Avg episode reward: [(0, '-0.967')] [2022-07-10 13:03:49,455][26022] Updated weights on worker 0-0, policy_version 735030 (0.00095) [2022-07-10 13:03:51,370][26022] Updated weights on worker 0-0, policy_version 735040 (0.00099) [2022-07-10 13:03:53,207][25689] Fps is (10 sec: 5568.7, 60 sec: 5547.2, 300 sec: 5524.2). Total num frames: 752691200. Throughput: 0: 4985.7. Samples: 752684298. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:03:53,208][25689] Avg episode reward: [(0, '-0.384')] [2022-07-10 13:03:53,210][26022] Updated weights on worker 0-0, policy_version 735050 (0.00093) [2022-07-10 13:03:55,198][26022] Updated weights on worker 0-0, policy_version 735060 (0.00087) [2022-07-10 13:03:56,960][26022] Updated weights on worker 0-0, policy_version 735070 (0.00088) [2022-07-10 13:03:58,296][25689] Fps is (10 sec: 5524.6, 60 sec: 5506.2, 300 sec: 5523.4). Total num frames: 752718848. Throughput: 0: 5769.1. Samples: 752717324. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:03:58,296][25689] Avg episode reward: [(0, '-0.856')] [2022-07-10 13:03:58,856][26022] Updated weights on worker 0-0, policy_version 735080 (0.00109) [2022-07-10 13:04:00,527][26022] Updated weights on worker 0-0, policy_version 735090 (0.00157) [2022-07-10 13:04:02,665][26022] Updated weights on worker 0-0, policy_version 735100 (0.00091) [2022-07-10 13:04:03,301][25689] Fps is (10 sec: 5377.1, 60 sec: 5563.7, 300 sec: 5527.3). Total num frames: 752745472. Throughput: 0: 5774.8. Samples: 752750754. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:04:03,301][25689] Avg episode reward: [(0, '-0.738')] [2022-07-10 13:04:04,552][26022] Updated weights on worker 0-0, policy_version 735110 (0.00090) [2022-07-10 13:04:06,287][26022] Updated weights on worker 0-0, policy_version 735120 (0.00090) [2022-07-10 13:04:08,341][25689] Fps is (10 sec: 5402.9, 60 sec: 5526.3, 300 sec: 5528.7). Total num frames: 752773120. Throughput: 0: 5706.8. Samples: 752783150. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:04:08,341][25689] Avg episode reward: [(0, '-1.209')] [2022-07-10 13:04:08,342][26022] Updated weights on worker 0-0, policy_version 735130 (0.00087) [2022-07-10 13:04:09,972][26022] Updated weights on worker 0-0, policy_version 735140 (0.00100) [2022-07-10 13:04:12,021][26022] Updated weights on worker 0-0, policy_version 735150 (0.00082) [2022-07-10 13:04:13,439][25689] Fps is (10 sec: 5656.3, 60 sec: 5539.1, 300 sec: 5534.0). Total num frames: 752802816. Throughput: 0: 5690.8. Samples: 752799430. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:04:13,439][25689] Avg episode reward: [(0, '-1.482')] [2022-07-10 13:04:13,638][26022] Updated weights on worker 0-0, policy_version 735160 (0.00087) [2022-07-10 13:04:15,602][26022] Updated weights on worker 0-0, policy_version 735170 (0.00088) [2022-07-10 13:04:17,355][26022] Updated weights on worker 0-0, policy_version 735180 (0.00092) [2022-07-10 13:04:18,524][25689] Fps is (10 sec: 5631.5, 60 sec: 5552.2, 300 sec: 5526.6). Total num frames: 752830464. Throughput: 0: 5717.4. Samples: 752832974. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:04:18,524][25689] Avg episode reward: [(0, '-2.079')] [2022-07-10 13:04:19,358][26022] Updated weights on worker 0-0, policy_version 735190 (0.00095) [2022-07-10 13:04:21,113][26022] Updated weights on worker 0-0, policy_version 735200 (0.00086) [2022-07-10 13:04:23,056][26022] Updated weights on worker 0-0, policy_version 735210 (0.00081) [2022-07-10 13:04:23,578][25689] Fps is (10 sec: 5352.6, 60 sec: 5531.9, 300 sec: 5526.1). Total num frames: 752857088. Throughput: 0: 5697.2. Samples: 752866278. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:04:23,578][25689] Avg episode reward: [(0, '-1.715')] [2022-07-10 13:04:24,723][26022] Updated weights on worker 0-0, policy_version 735220 (0.00467) [2022-07-10 13:04:26,812][26022] Updated weights on worker 0-0, policy_version 735230 (0.00110) [2022-07-10 13:04:28,466][26022] Updated weights on worker 0-0, policy_version 735240 (0.00090) [2022-07-10 13:04:28,648][25689] Fps is (10 sec: 5562.7, 60 sec: 5544.4, 300 sec: 5529.7). Total num frames: 752886784. Throughput: 0: 4912.3. Samples: 752882906. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:04:28,649][25689] Avg episode reward: [(0, '-1.184')] [2022-07-10 13:04:30,358][26022] Updated weights on worker 0-0, policy_version 735250 (0.00415) [2022-07-10 13:04:32,055][26022] Updated weights on worker 0-0, policy_version 735260 (0.00091) [2022-07-10 13:04:33,511][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:04:33,520][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000735268_752914432.pth [2022-07-10 13:04:33,522][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000733322_750921728.pth [2022-07-10 13:04:33,723][25689] Fps is (10 sec: 5652.4, 60 sec: 5531.4, 300 sec: 5528.8). Total num frames: 752914432. Throughput: 0: 5781.4. Samples: 752916704. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:04:33,724][25689] Avg episode reward: [(0, '-1.751')] [2022-07-10 13:04:34,145][26022] Updated weights on worker 0-0, policy_version 735270 (0.00095) [2022-07-10 13:04:35,710][26022] Updated weights on worker 0-0, policy_version 735280 (0.00086) [2022-07-10 13:04:37,698][26022] Updated weights on worker 0-0, policy_version 735290 (0.00077) [2022-07-10 13:04:38,759][25689] Fps is (10 sec: 5570.4, 60 sec: 5532.2, 300 sec: 5528.3). Total num frames: 752943104. Throughput: 0: 5814.3. Samples: 752950626. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:04:38,759][25689] Avg episode reward: [(0, '-5.020')] [2022-07-10 13:04:39,386][26022] Updated weights on worker 0-0, policy_version 735300 (0.00093) [2022-07-10 13:04:41,279][26022] Updated weights on worker 0-0, policy_version 735310 (0.00086) [2022-07-10 13:04:43,317][26022] Updated weights on worker 0-0, policy_version 735320 (0.00087) [2022-07-10 13:04:43,851][25689] Fps is (10 sec: 5661.9, 60 sec: 5544.9, 300 sec: 5527.6). Total num frames: 752971776. Throughput: 0: 4989.9. Samples: 752967438. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:04:43,852][25689] Avg episode reward: [(0, '-5.163')] [2022-07-10 13:04:44,797][26022] Updated weights on worker 0-0, policy_version 735330 (0.00084) [2022-07-10 13:04:46,812][26022] Updated weights on worker 0-0, policy_version 735340 (0.00090) [2022-07-10 13:04:48,519][26022] Updated weights on worker 0-0, policy_version 735350 (0.00086) [2022-07-10 13:04:48,879][25689] Fps is (10 sec: 5565.2, 60 sec: 5543.1, 300 sec: 5528.7). Total num frames: 752999424. Throughput: 0: 5846.2. Samples: 753001178. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:04:48,879][25689] Avg episode reward: [(0, '-6.150')] [2022-07-10 13:04:50,492][26022] Updated weights on worker 0-0, policy_version 735360 (0.00109) [2022-07-10 13:04:52,359][26022] Updated weights on worker 0-0, policy_version 735370 (0.00094) [2022-07-10 13:04:53,945][25689] Fps is (10 sec: 5579.7, 60 sec: 5546.7, 300 sec: 5524.1). Total num frames: 753028096. Throughput: 0: 5826.0. Samples: 753034516. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:04:53,945][25689] Avg episode reward: [(0, '-7.702')] [2022-07-10 13:04:54,073][26022] Updated weights on worker 0-0, policy_version 735380 (0.00086) [2022-07-10 13:04:55,988][26022] Updated weights on worker 0-0, policy_version 735390 (0.00091) [2022-07-10 13:04:57,988][26022] Updated weights on worker 0-0, policy_version 735400 (0.00621) [2022-07-10 13:04:58,960][25689] Fps is (10 sec: 5586.6, 60 sec: 5553.5, 300 sec: 5524.5). Total num frames: 753055744. Throughput: 0: 4978.5. Samples: 753051198. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:04:58,961][25689] Avg episode reward: [(0, '-7.822')] [2022-07-10 13:04:59,548][26022] Updated weights on worker 0-0, policy_version 735410 (0.00090) [2022-07-10 13:05:01,638][26022] Updated weights on worker 0-0, policy_version 735420 (0.00095) [2022-07-10 13:05:03,472][26022] Updated weights on worker 0-0, policy_version 735430 (0.00080) [2022-07-10 13:05:03,980][25689] Fps is (10 sec: 5408.4, 60 sec: 5552.1, 300 sec: 5528.9). Total num frames: 753082368. Throughput: 0: 5744.3. Samples: 753083062. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:05:03,980][25689] Avg episode reward: [(0, '-8.741')] [2022-07-10 13:05:05,554][26022] Updated weights on worker 0-0, policy_version 735440 (0.00088) [2022-07-10 13:05:07,384][26022] Updated weights on worker 0-0, policy_version 735450 (0.00086) [2022-07-10 13:05:08,986][25689] Fps is (10 sec: 5413.2, 60 sec: 5555.2, 300 sec: 5529.7). Total num frames: 753110016. Throughput: 0: 5722.3. Samples: 753116238. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:05:08,987][25689] Avg episode reward: [(0, '-7.342')] [2022-07-10 13:05:09,029][26022] Updated weights on worker 0-0, policy_version 735460 (0.00093) [2022-07-10 13:05:11,119][26022] Updated weights on worker 0-0, policy_version 735470 (0.00093) [2022-07-10 13:05:12,819][26022] Updated weights on worker 0-0, policy_version 735480 (0.00092) [2022-07-10 13:05:14,059][25689] Fps is (10 sec: 5486.1, 60 sec: 5523.7, 300 sec: 5524.9). Total num frames: 753137664. Throughput: 0: 4881.2. Samples: 753132696. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:05:14,060][25689] Avg episode reward: [(0, '-5.785')] [2022-07-10 13:05:14,810][26022] Updated weights on worker 0-0, policy_version 735490 (0.00088) [2022-07-10 13:05:16,840][26022] Updated weights on worker 0-0, policy_version 735500 (0.00095) [2022-07-10 13:05:18,497][26022] Updated weights on worker 0-0, policy_version 735510 (0.00099) [2022-07-10 13:05:19,121][25689] Fps is (10 sec: 5456.2, 60 sec: 5525.8, 300 sec: 5527.6). Total num frames: 753165312. Throughput: 0: 5685.4. Samples: 753165818. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:05:19,121][25689] Avg episode reward: [(0, '-4.717')] [2022-07-10 13:05:20,425][26022] Updated weights on worker 0-0, policy_version 735520 (0.00091) [2022-07-10 13:05:22,254][26022] Updated weights on worker 0-0, policy_version 735530 (0.00086) [2022-07-10 13:05:23,931][26022] Updated weights on worker 0-0, policy_version 735540 (0.00087) [2022-07-10 13:05:24,145][25689] Fps is (10 sec: 5584.1, 60 sec: 5562.4, 300 sec: 5527.6). Total num frames: 753193984. Throughput: 0: 5766.2. Samples: 753199338. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:05:24,146][25689] Avg episode reward: [(0, '-2.846')] [2022-07-10 13:05:25,953][26022] Updated weights on worker 0-0, policy_version 735550 (0.00082) [2022-07-10 13:05:27,689][26022] Updated weights on worker 0-0, policy_version 735560 (0.00088) [2022-07-10 13:05:29,162][25689] Fps is (10 sec: 5507.1, 60 sec: 5516.5, 300 sec: 5522.8). Total num frames: 753220608. Throughput: 0: 4948.2. Samples: 753216070. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:05:29,162][25689] Avg episode reward: [(0, '-2.622')] [2022-07-10 13:05:29,503][26022] Updated weights on worker 0-0, policy_version 735570 (0.00088) [2022-07-10 13:05:31,438][26022] Updated weights on worker 0-0, policy_version 735580 (0.00086) [2022-07-10 13:05:33,033][26022] Updated weights on worker 0-0, policy_version 735590 (0.00080) [2022-07-10 13:05:34,230][25689] Fps is (10 sec: 5483.0, 60 sec: 5534.1, 300 sec: 5522.2). Total num frames: 753249280. Throughput: 0: 5789.1. Samples: 753249466. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:05:34,230][25689] Avg episode reward: [(0, '-0.381')] [2022-07-10 13:05:35,299][26022] Updated weights on worker 0-0, policy_version 735600 (0.00088) [2022-07-10 13:05:36,531][26022] Updated weights on worker 0-0, policy_version 735610 (0.00096) [2022-07-10 13:05:38,873][26022] Updated weights on worker 0-0, policy_version 735620 (0.00091) [2022-07-10 13:05:39,246][25689] Fps is (10 sec: 5584.8, 60 sec: 5518.9, 300 sec: 5522.6). Total num frames: 753276928. Throughput: 0: 5820.3. Samples: 753282954. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:05:39,247][25689] Avg episode reward: [(0, '-0.472')] [2022-07-10 13:05:40,759][26022] Updated weights on worker 0-0, policy_version 735630 (0.00089) [2022-07-10 13:05:42,338][26022] Updated weights on worker 0-0, policy_version 735640 (0.00087) [2022-07-10 13:05:44,254][25689] Fps is (10 sec: 5516.4, 60 sec: 5509.7, 300 sec: 5526.9). Total num frames: 753304576. Throughput: 0: 4976.7. Samples: 753299414. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:05:44,254][25689] Avg episode reward: [(0, '0.057')] [2022-07-10 13:05:44,302][26022] Updated weights on worker 0-0, policy_version 735650 (0.00089) [2022-07-10 13:05:46,076][26022] Updated weights on worker 0-0, policy_version 735660 (0.00088) [2022-07-10 13:05:47,991][26022] Updated weights on worker 0-0, policy_version 735670 (0.00092) [2022-07-10 13:05:49,259][25689] Fps is (10 sec: 5522.5, 60 sec: 5511.7, 300 sec: 5521.2). Total num frames: 753332224. Throughput: 0: 5810.3. Samples: 753332842. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:05:49,260][25689] Avg episode reward: [(0, '0.013')] [2022-07-10 13:05:49,883][26022] Updated weights on worker 0-0, policy_version 735680 (0.00084) [2022-07-10 13:05:51,554][26022] Updated weights on worker 0-0, policy_version 735690 (0.00106) [2022-07-10 13:05:53,595][26022] Updated weights on worker 0-0, policy_version 735700 (0.00092) [2022-07-10 13:05:54,399][25689] Fps is (10 sec: 5652.3, 60 sec: 5521.9, 300 sec: 5526.0). Total num frames: 753361920. Throughput: 0: 5765.3. Samples: 753365748. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:05:54,400][25689] Avg episode reward: [(0, '-1.999')] [2022-07-10 13:05:55,300][26022] Updated weights on worker 0-0, policy_version 735710 (0.00089) [2022-07-10 13:05:57,214][26022] Updated weights on worker 0-0, policy_version 735720 (0.00090) [2022-07-10 13:05:59,209][26022] Updated weights on worker 0-0, policy_version 735730 (0.00088) [2022-07-10 13:05:59,416][25689] Fps is (10 sec: 5443.9, 60 sec: 5487.9, 300 sec: 5515.6). Total num frames: 753387520. Throughput: 0: 4926.8. Samples: 753382330. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-10 13:05:59,417][25689] Avg episode reward: [(0, '-1.465')] [2022-07-10 13:06:00,739][26022] Updated weights on worker 0-0, policy_version 735740 (0.00084) [2022-07-10 13:06:03,332][26022] Updated weights on worker 0-0, policy_version 735750 (0.00102) [2022-07-10 13:06:04,423][25689] Fps is (10 sec: 5414.3, 60 sec: 5522.9, 300 sec: 5526.3). Total num frames: 753416192. Throughput: 0: 5680.2. Samples: 753413978. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:06:04,423][25689] Avg episode reward: [(0, '-3.119')] [2022-07-10 13:06:04,919][26022] Updated weights on worker 0-0, policy_version 735760 (0.00091) [2022-07-10 13:06:06,739][26022] Updated weights on worker 0-0, policy_version 735770 (0.00091) [2022-07-10 13:06:08,669][26022] Updated weights on worker 0-0, policy_version 735780 (0.00084) [2022-07-10 13:06:09,521][25689] Fps is (10 sec: 5472.4, 60 sec: 5497.7, 300 sec: 5526.7). Total num frames: 753442816. Throughput: 0: 5654.9. Samples: 753447420. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:06:09,521][25689] Avg episode reward: [(0, '-4.328')] [2022-07-10 13:06:10,477][26022] Updated weights on worker 0-0, policy_version 735790 (0.00096) [2022-07-10 13:06:12,394][26022] Updated weights on worker 0-0, policy_version 735800 (0.00104) [2022-07-10 13:06:14,034][26022] Updated weights on worker 0-0, policy_version 735810 (0.00092) [2022-07-10 13:06:14,646][25689] Fps is (10 sec: 5508.9, 60 sec: 5526.7, 300 sec: 5521.0). Total num frames: 753472512. Throughput: 0: 5695.3. Samples: 753481062. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:06:14,647][25689] Avg episode reward: [(0, '-5.588')] [2022-07-10 13:06:15,947][26022] Updated weights on worker 0-0, policy_version 735820 (0.00094) [2022-07-10 13:06:17,616][26022] Updated weights on worker 0-0, policy_version 735830 (0.00085) [2022-07-10 13:06:19,549][26022] Updated weights on worker 0-0, policy_version 735840 (0.00110) [2022-07-10 13:06:19,703][25689] Fps is (10 sec: 5631.8, 60 sec: 5527.1, 300 sec: 5531.6). Total num frames: 753500160. Throughput: 0: 5700.4. Samples: 753497972. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:06:19,704][25689] Avg episode reward: [(0, '-4.219')] [2022-07-10 13:06:21,287][26022] Updated weights on worker 0-0, policy_version 735850 (0.00056) [2022-07-10 13:06:23,343][26022] Updated weights on worker 0-0, policy_version 735860 (0.00087) [2022-07-10 13:06:24,740][25689] Fps is (10 sec: 5579.5, 60 sec: 5526.0, 300 sec: 5521.0). Total num frames: 753528832. Throughput: 0: 5780.7. Samples: 753531428. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:06:24,741][25689] Avg episode reward: [(0, '-3.901')] [2022-07-10 13:06:25,053][26022] Updated weights on worker 0-0, policy_version 735870 (0.00087) [2022-07-10 13:06:27,147][26022] Updated weights on worker 0-0, policy_version 735880 (0.00089) [2022-07-10 13:06:28,816][26022] Updated weights on worker 0-0, policy_version 735890 (0.00089) [2022-07-10 13:06:29,836][25689] Fps is (10 sec: 5558.1, 60 sec: 5535.6, 300 sec: 5526.9). Total num frames: 753556480. Throughput: 0: 5784.9. Samples: 753564942. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:06:29,837][25689] Avg episode reward: [(0, '-4.941')] [2022-07-10 13:06:30,679][26022] Updated weights on worker 0-0, policy_version 735900 (0.00085) [2022-07-10 13:06:32,417][26022] Updated weights on worker 0-0, policy_version 735910 (0.00096) [2022-07-10 13:06:33,536][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:06:33,550][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000735915_753576960.pth [2022-07-10 13:06:33,550][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000733971_751586304.pth [2022-07-10 13:06:34,211][26022] Updated weights on worker 0-0, policy_version 735920 (0.00089) [2022-07-10 13:06:34,922][25689] Fps is (10 sec: 5531.4, 60 sec: 5534.0, 300 sec: 5529.4). Total num frames: 753585152. Throughput: 0: 4960.5. Samples: 753581640. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:06:34,924][25689] Avg episode reward: [(0, '-4.987')] [2022-07-10 13:06:36,200][26022] Updated weights on worker 0-0, policy_version 735930 (0.00091) [2022-07-10 13:06:37,871][26022] Updated weights on worker 0-0, policy_version 735940 (0.00085) [2022-07-10 13:06:39,947][25689] Fps is (10 sec: 5469.1, 60 sec: 5516.4, 300 sec: 5525.6). Total num frames: 753611776. Throughput: 0: 5785.3. Samples: 753615088. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:06:39,947][25689] Avg episode reward: [(0, '-2.813')] [2022-07-10 13:06:40,062][26022] Updated weights on worker 0-0, policy_version 735950 (0.00091) [2022-07-10 13:06:41,692][26022] Updated weights on worker 0-0, policy_version 735960 (0.00113) [2022-07-10 13:06:43,624][26022] Updated weights on worker 0-0, policy_version 735970 (0.00101) [2022-07-10 13:06:44,955][25689] Fps is (10 sec: 5613.8, 60 sec: 5550.1, 300 sec: 5529.3). Total num frames: 753641472. Throughput: 0: 5803.5. Samples: 753648742. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:06:44,955][25689] Avg episode reward: [(0, '-1.319')] [2022-07-10 13:06:45,143][26022] Updated weights on worker 0-0, policy_version 735980 (0.00095) [2022-07-10 13:06:47,268][26022] Updated weights on worker 0-0, policy_version 735990 (0.00085) [2022-07-10 13:06:48,979][26022] Updated weights on worker 0-0, policy_version 736000 (0.00095) [2022-07-10 13:06:49,961][25689] Fps is (10 sec: 5623.9, 60 sec: 5533.1, 300 sec: 5527.2). Total num frames: 753668096. Throughput: 0: 4996.8. Samples: 753665502. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:06:49,962][25689] Avg episode reward: [(0, '-2.094')] [2022-07-10 13:06:50,864][26022] Updated weights on worker 0-0, policy_version 736010 (0.00087) [2022-07-10 13:06:52,792][26022] Updated weights on worker 0-0, policy_version 736020 (0.00090) [2022-07-10 13:06:54,423][26022] Updated weights on worker 0-0, policy_version 736030 (0.00084) [2022-07-10 13:06:55,057][25689] Fps is (10 sec: 5473.6, 60 sec: 5520.3, 300 sec: 5525.6). Total num frames: 753696768. Throughput: 0: 5825.8. Samples: 753698942. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:06:55,058][25689] Avg episode reward: [(0, '-0.939')] [2022-07-10 13:06:56,091][26022] Updated weights on worker 0-0, policy_version 736040 (0.00095) [2022-07-10 13:06:58,262][26022] Updated weights on worker 0-0, policy_version 736050 (0.00083) [2022-07-10 13:06:59,894][26022] Updated weights on worker 0-0, policy_version 736060 (0.00088) [2022-07-10 13:07:00,066][25689] Fps is (10 sec: 5674.6, 60 sec: 5571.6, 300 sec: 5533.3). Total num frames: 753725440. Throughput: 0: 5840.4. Samples: 753732596. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:07:00,067][25689] Avg episode reward: [(0, '-1.274')] [2022-07-10 13:07:02,228][26022] Updated weights on worker 0-0, policy_version 736070 (0.00093) [2022-07-10 13:07:03,899][26022] Updated weights on worker 0-0, policy_version 736080 (0.00122) [2022-07-10 13:07:05,079][25689] Fps is (10 sec: 5313.3, 60 sec: 5503.6, 300 sec: 5522.9). Total num frames: 753750016. Throughput: 0: 4897.8. Samples: 753747306. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:07:05,079][25689] Avg episode reward: [(0, '-1.223')] [2022-07-10 13:07:05,819][26022] Updated weights on worker 0-0, policy_version 736090 (0.00093) [2022-07-10 13:07:07,691][26022] Updated weights on worker 0-0, policy_version 736100 (0.00085) [2022-07-10 13:07:09,539][26022] Updated weights on worker 0-0, policy_version 736110 (0.00089) [2022-07-10 13:07:10,089][25689] Fps is (10 sec: 5312.9, 60 sec: 5545.4, 300 sec: 5524.9). Total num frames: 753778688. Throughput: 0: 5726.6. Samples: 753780766. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:07:10,089][25689] Avg episode reward: [(0, '-1.273')] [2022-07-10 13:07:11,486][26022] Updated weights on worker 0-0, policy_version 736120 (0.00088) [2022-07-10 13:07:13,344][26022] Updated weights on worker 0-0, policy_version 736130 (0.00092) [2022-07-10 13:07:14,959][26022] Updated weights on worker 0-0, policy_version 736140 (0.00096) [2022-07-10 13:07:15,180][25689] Fps is (10 sec: 5778.5, 60 sec: 5548.5, 300 sec: 5530.5). Total num frames: 753808384. Throughput: 0: 5732.2. Samples: 753814288. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:07:15,180][25689] Avg episode reward: [(0, '-1.971')] [2022-07-10 13:07:17,093][26022] Updated weights on worker 0-0, policy_version 736150 (0.00367) [2022-07-10 13:07:18,474][26022] Updated weights on worker 0-0, policy_version 736160 (0.00088) [2022-07-10 13:07:20,195][25689] Fps is (10 sec: 5673.9, 60 sec: 5552.3, 300 sec: 5528.0). Total num frames: 753836032. Throughput: 0: 4901.5. Samples: 753831258. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:07:20,196][25689] Avg episode reward: [(0, '-1.711')] [2022-07-10 13:07:20,520][26022] Updated weights on worker 0-0, policy_version 736170 (0.00094) [2022-07-10 13:07:22,310][26022] Updated weights on worker 0-0, policy_version 736180 (0.00081) [2022-07-10 13:07:24,205][26022] Updated weights on worker 0-0, policy_version 736190 (0.00094) [2022-07-10 13:07:25,227][25689] Fps is (10 sec: 5503.4, 60 sec: 5535.9, 300 sec: 5531.7). Total num frames: 753863680. Throughput: 0: 5833.9. Samples: 753864850. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:07:25,228][25689] Avg episode reward: [(0, '-2.024')] [2022-07-10 13:07:26,194][26022] Updated weights on worker 0-0, policy_version 736200 (0.00086) [2022-07-10 13:07:27,708][26022] Updated weights on worker 0-0, policy_version 736210 (0.00092) [2022-07-10 13:07:29,685][26022] Updated weights on worker 0-0, policy_version 736220 (0.00092) [2022-07-10 13:07:30,243][25689] Fps is (10 sec: 5503.6, 60 sec: 5543.2, 300 sec: 5529.9). Total num frames: 753891328. Throughput: 0: 5822.3. Samples: 753898108. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:07:30,243][25689] Avg episode reward: [(0, '-2.378')] [2022-07-10 13:07:31,588][26022] Updated weights on worker 0-0, policy_version 736230 (0.00087) [2022-07-10 13:07:33,399][26022] Updated weights on worker 0-0, policy_version 736240 (0.00089) [2022-07-10 13:07:35,204][26022] Updated weights on worker 0-0, policy_version 736250 (0.00086) [2022-07-10 13:07:35,354][25689] Fps is (10 sec: 5561.3, 60 sec: 5540.9, 300 sec: 5532.2). Total num frames: 753920000. Throughput: 0: 4981.7. Samples: 753914792. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:07:35,355][25689] Avg episode reward: [(0, '-3.002')] [2022-07-10 13:07:36,890][26022] Updated weights on worker 0-0, policy_version 736260 (0.00091) [2022-07-10 13:07:38,845][26022] Updated weights on worker 0-0, policy_version 736270 (0.00088) [2022-07-10 13:07:40,454][25689] Fps is (10 sec: 5716.3, 60 sec: 5584.8, 300 sec: 5537.3). Total num frames: 753949696. Throughput: 0: 5800.8. Samples: 753948772. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:07:40,454][25689] Avg episode reward: [(0, '-3.302')] [2022-07-10 13:07:40,756][26022] Updated weights on worker 0-0, policy_version 736280 (0.00083) [2022-07-10 13:07:42,449][26022] Updated weights on worker 0-0, policy_version 736290 (0.00090) [2022-07-10 13:07:44,286][26022] Updated weights on worker 0-0, policy_version 736300 (0.00090) [2022-07-10 13:07:45,521][25689] Fps is (10 sec: 5640.4, 60 sec: 5545.5, 300 sec: 5532.7). Total num frames: 753977344. Throughput: 0: 5790.5. Samples: 753982362. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:07:45,521][25689] Avg episode reward: [(0, '-3.141')] [2022-07-10 13:07:46,120][26022] Updated weights on worker 0-0, policy_version 736310 (0.00090) [2022-07-10 13:07:47,923][26022] Updated weights on worker 0-0, policy_version 736320 (0.00095) [2022-07-10 13:07:49,763][26022] Updated weights on worker 0-0, policy_version 736330 (0.00088) [2022-07-10 13:07:50,617][25689] Fps is (10 sec: 5440.7, 60 sec: 5554.2, 300 sec: 5536.6). Total num frames: 754004992. Throughput: 0: 4959.0. Samples: 753999144. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:07:50,617][25689] Avg episode reward: [(0, '-3.566')] [2022-07-10 13:07:51,638][26022] Updated weights on worker 0-0, policy_version 736340 (0.00081) [2022-07-10 13:07:53,489][26022] Updated weights on worker 0-0, policy_version 736350 (0.00095) [2022-07-10 13:07:55,419][26022] Updated weights on worker 0-0, policy_version 736360 (0.00094) [2022-07-10 13:07:55,667][25689] Fps is (10 sec: 5550.8, 60 sec: 5558.4, 300 sec: 5532.4). Total num frames: 754033664. Throughput: 0: 5805.4. Samples: 754032716. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:07:55,668][25689] Avg episode reward: [(0, '-3.282')] [2022-07-10 13:07:57,156][26022] Updated weights on worker 0-0, policy_version 736370 (0.00093) [2022-07-10 13:07:59,140][26022] Updated weights on worker 0-0, policy_version 736380 (0.00080) [2022-07-10 13:08:00,668][25689] Fps is (10 sec: 5603.1, 60 sec: 5542.3, 300 sec: 5547.6). Total num frames: 754061312. Throughput: 0: 5790.5. Samples: 754065826. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:08:00,669][25689] Avg episode reward: [(0, '-2.623')] [2022-07-10 13:08:01,068][26022] Updated weights on worker 0-0, policy_version 736390 (0.00090) [2022-07-10 13:08:03,141][26022] Updated weights on worker 0-0, policy_version 736400 (0.00093) [2022-07-10 13:08:04,949][26022] Updated weights on worker 0-0, policy_version 736410 (0.00087) [2022-07-10 13:08:05,696][25689] Fps is (10 sec: 5309.7, 60 sec: 5557.8, 300 sec: 5533.4). Total num frames: 754086912. Throughput: 0: 5701.7. Samples: 754097392. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:08:05,696][25689] Avg episode reward: [(0, '-2.719')] [2022-07-10 13:08:06,638][26022] Updated weights on worker 0-0, policy_version 736420 (0.00088) [2022-07-10 13:08:08,829][26022] Updated weights on worker 0-0, policy_version 736430 (0.00084) [2022-07-10 13:08:10,437][26022] Updated weights on worker 0-0, policy_version 736440 (0.00081) [2022-07-10 13:08:10,722][25689] Fps is (10 sec: 5296.4, 60 sec: 5539.4, 300 sec: 5530.4). Total num frames: 754114560. Throughput: 0: 5722.5. Samples: 754114196. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:08:10,723][25689] Avg episode reward: [(0, '-3.537')] [2022-07-10 13:08:12,305][26022] Updated weights on worker 0-0, policy_version 736450 (0.00085) [2022-07-10 13:08:14,076][26022] Updated weights on worker 0-0, policy_version 736460 (0.00085) [2022-07-10 13:08:15,741][26022] Updated weights on worker 0-0, policy_version 736470 (0.00084) [2022-07-10 13:08:15,780][25689] Fps is (10 sec: 5788.1, 60 sec: 5559.3, 300 sec: 5543.9). Total num frames: 754145280. Throughput: 0: 5731.6. Samples: 754147994. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:08:15,780][25689] Avg episode reward: [(0, '-2.295')] [2022-07-10 13:08:17,960][26022] Updated weights on worker 0-0, policy_version 736480 (0.00087) [2022-07-10 13:08:19,458][26022] Updated weights on worker 0-0, policy_version 736490 (0.00085) [2022-07-10 13:08:20,791][25689] Fps is (10 sec: 5695.3, 60 sec: 5542.9, 300 sec: 5540.6). Total num frames: 754171904. Throughput: 0: 5760.2. Samples: 754181734. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:08:20,791][25689] Avg episode reward: [(0, '-2.245')] [2022-07-10 13:08:21,409][26022] Updated weights on worker 0-0, policy_version 736500 (0.00085) [2022-07-10 13:08:23,050][26022] Updated weights on worker 0-0, policy_version 736510 (0.00081) [2022-07-10 13:08:25,182][26022] Updated weights on worker 0-0, policy_version 736520 (0.00084) [2022-07-10 13:08:25,795][25689] Fps is (10 sec: 5521.2, 60 sec: 5562.3, 300 sec: 5540.9). Total num frames: 754200576. Throughput: 0: 5038.0. Samples: 754198652. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:08:25,796][25689] Avg episode reward: [(0, '-5.069')] [2022-07-10 13:08:26,715][26022] Updated weights on worker 0-0, policy_version 736530 (0.00085) [2022-07-10 13:08:28,799][26022] Updated weights on worker 0-0, policy_version 736540 (0.00091) [2022-07-10 13:08:30,593][26022] Updated weights on worker 0-0, policy_version 736550 (0.00094) [2022-07-10 13:08:30,842][25689] Fps is (10 sec: 5603.5, 60 sec: 5559.4, 300 sec: 5538.8). Total num frames: 754228224. Throughput: 0: 5837.9. Samples: 754231652. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:08:30,842][25689] Avg episode reward: [(0, '-5.962')] [2022-07-10 13:08:32,513][26022] Updated weights on worker 0-0, policy_version 736560 (0.00087) [2022-07-10 13:08:33,645][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:08:33,660][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000736567_754244608.pth [2022-07-10 13:08:33,660][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000734618_752248832.pth [2022-07-10 13:08:34,275][26022] Updated weights on worker 0-0, policy_version 736570 (0.00082) [2022-07-10 13:08:35,920][25689] Fps is (10 sec: 5461.6, 60 sec: 5545.6, 300 sec: 5534.7). Total num frames: 754255872. Throughput: 0: 5822.3. Samples: 754265254. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:08:35,920][25689] Avg episode reward: [(0, '-4.573')] [2022-07-10 13:08:36,113][26022] Updated weights on worker 0-0, policy_version 736580 (0.00087) [2022-07-10 13:08:37,748][26022] Updated weights on worker 0-0, policy_version 736590 (0.00086) [2022-07-10 13:08:39,785][26022] Updated weights on worker 0-0, policy_version 736600 (0.00088) [2022-07-10 13:08:40,943][25689] Fps is (10 sec: 5575.7, 60 sec: 5535.7, 300 sec: 5538.6). Total num frames: 754284544. Throughput: 0: 4976.5. Samples: 754282020. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:08:40,943][25689] Avg episode reward: [(0, '-5.994')] [2022-07-10 13:08:41,636][26022] Updated weights on worker 0-0, policy_version 736610 (0.00087) [2022-07-10 13:08:43,529][26022] Updated weights on worker 0-0, policy_version 736620 (0.00092) [2022-07-10 13:08:45,183][26022] Updated weights on worker 0-0, policy_version 736630 (0.00090) [2022-07-10 13:08:45,958][25689] Fps is (10 sec: 5610.2, 60 sec: 5540.4, 300 sec: 5538.4). Total num frames: 754312192. Throughput: 0: 5793.6. Samples: 754315470. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:08:45,959][25689] Avg episode reward: [(0, '-5.216')] [2022-07-10 13:08:47,194][26022] Updated weights on worker 0-0, policy_version 736640 (0.00971) [2022-07-10 13:08:48,750][26022] Updated weights on worker 0-0, policy_version 736650 (0.00087) [2022-07-10 13:08:50,893][26022] Updated weights on worker 0-0, policy_version 736660 (0.00094) [2022-07-10 13:08:51,028][25689] Fps is (10 sec: 5482.5, 60 sec: 5542.8, 300 sec: 5535.7). Total num frames: 754339840. Throughput: 0: 5814.0. Samples: 754349018. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:08:51,029][25689] Avg episode reward: [(0, '-4.766')] [2022-07-10 13:08:52,586][26022] Updated weights on worker 0-0, policy_version 736670 (0.00095) [2022-07-10 13:08:54,698][26022] Updated weights on worker 0-0, policy_version 736680 (0.00094) [2022-07-10 13:08:56,102][25689] Fps is (10 sec: 5653.2, 60 sec: 5557.6, 300 sec: 5542.8). Total num frames: 754369536. Throughput: 0: 4959.4. Samples: 754365348. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:08:56,103][25689] Avg episode reward: [(0, '-4.814')] [2022-07-10 13:08:56,366][26022] Updated weights on worker 0-0, policy_version 736690 (0.00088) [2022-07-10 13:08:58,313][26022] Updated weights on worker 0-0, policy_version 736700 (0.00094) [2022-07-10 13:08:59,902][26022] Updated weights on worker 0-0, policy_version 736710 (0.00085) [2022-07-10 13:09:01,159][25689] Fps is (10 sec: 5458.5, 60 sec: 5518.6, 300 sec: 5538.4). Total num frames: 754395136. Throughput: 0: 5773.5. Samples: 754398738. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:09:01,160][25689] Avg episode reward: [(0, '-1.831')] [2022-07-10 13:09:01,968][26022] Updated weights on worker 0-0, policy_version 736720 (0.00214) [2022-07-10 13:09:04,058][26022] Updated weights on worker 0-0, policy_version 736730 (0.00061) [2022-07-10 13:09:05,967][26022] Updated weights on worker 0-0, policy_version 736740 (0.00087) [2022-07-10 13:09:06,171][25689] Fps is (10 sec: 5186.8, 60 sec: 5537.0, 300 sec: 5535.5). Total num frames: 754421760. Throughput: 0: 5675.9. Samples: 754430194. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:09:06,171][25689] Avg episode reward: [(0, '-3.098')] [2022-07-10 13:09:07,529][26022] Updated weights on worker 0-0, policy_version 736750 (0.00089) [2022-07-10 13:09:09,739][26022] Updated weights on worker 0-0, policy_version 736760 (0.00086) [2022-07-10 13:09:11,179][25689] Fps is (10 sec: 5620.6, 60 sec: 5572.5, 300 sec: 5537.2). Total num frames: 754451456. Throughput: 0: 4858.0. Samples: 754446908. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:09:11,180][25689] Avg episode reward: [(0, '-3.311')] [2022-07-10 13:09:11,200][26022] Updated weights on worker 0-0, policy_version 736770 (0.00088) [2022-07-10 13:09:13,422][26022] Updated weights on worker 0-0, policy_version 736780 (0.00086) [2022-07-10 13:09:15,009][26022] Updated weights on worker 0-0, policy_version 736790 (0.00090) [2022-07-10 13:09:16,223][25689] Fps is (10 sec: 5501.1, 60 sec: 5489.1, 300 sec: 5531.1). Total num frames: 754477056. Throughput: 0: 5722.4. Samples: 754480486. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:09:16,223][25689] Avg episode reward: [(0, '-4.879')] [2022-07-10 13:09:16,958][26022] Updated weights on worker 0-0, policy_version 736800 (0.00091) [2022-07-10 13:09:18,949][26022] Updated weights on worker 0-0, policy_version 736810 (0.00101) [2022-07-10 13:09:20,711][26022] Updated weights on worker 0-0, policy_version 736820 (0.00091) [2022-07-10 13:09:21,237][25689] Fps is (10 sec: 5599.7, 60 sec: 5556.6, 300 sec: 5545.6). Total num frames: 754507776. Throughput: 0: 5733.0. Samples: 754513844. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:09:21,237][25689] Avg episode reward: [(0, '-4.531')] [2022-07-10 13:09:22,582][26022] Updated weights on worker 0-0, policy_version 736830 (0.00088) [2022-07-10 13:09:24,385][26022] Updated weights on worker 0-0, policy_version 736840 (0.00086) [2022-07-10 13:09:26,154][26022] Updated weights on worker 0-0, policy_version 736850 (0.00092) [2022-07-10 13:09:26,248][25689] Fps is (10 sec: 5720.1, 60 sec: 5522.1, 300 sec: 5536.4). Total num frames: 754534400. Throughput: 0: 5007.8. Samples: 754530734. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 13:09:26,248][25689] Avg episode reward: [(0, '-4.608')] [2022-07-10 13:09:28,196][26022] Updated weights on worker 0-0, policy_version 736860 (0.00094) [2022-07-10 13:09:29,811][26022] Updated weights on worker 0-0, policy_version 736870 (0.00092) [2022-07-10 13:09:31,261][25689] Fps is (10 sec: 5413.8, 60 sec: 5525.1, 300 sec: 5537.5). Total num frames: 754562048. Throughput: 0: 5829.6. Samples: 754563980. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:09:31,262][25689] Avg episode reward: [(0, '-4.002')] [2022-07-10 13:09:31,786][26022] Updated weights on worker 0-0, policy_version 736880 (0.00066) [2022-07-10 13:09:33,607][26022] Updated weights on worker 0-0, policy_version 736890 (0.00091) [2022-07-10 13:09:35,436][26022] Updated weights on worker 0-0, policy_version 736900 (0.00086) [2022-07-10 13:09:36,322][25689] Fps is (10 sec: 5590.5, 60 sec: 5543.6, 300 sec: 5537.0). Total num frames: 754590720. Throughput: 0: 5823.2. Samples: 754597528. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:09:36,322][25689] Avg episode reward: [(0, '-3.349')] [2022-07-10 13:09:37,231][26022] Updated weights on worker 0-0, policy_version 736910 (0.00097) [2022-07-10 13:09:39,143][26022] Updated weights on worker 0-0, policy_version 736920 (0.00091) [2022-07-10 13:09:41,003][26022] Updated weights on worker 0-0, policy_version 736930 (0.00079) [2022-07-10 13:09:41,327][25689] Fps is (10 sec: 5493.7, 60 sec: 5511.4, 300 sec: 5531.8). Total num frames: 754617344. Throughput: 0: 4988.0. Samples: 754614054. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:09:41,327][25689] Avg episode reward: [(0, '-1.985')] [2022-07-10 13:09:42,639][26022] Updated weights on worker 0-0, policy_version 736940 (0.00086) [2022-07-10 13:09:44,707][26022] Updated weights on worker 0-0, policy_version 736950 (0.00088) [2022-07-10 13:09:46,268][26022] Updated weights on worker 0-0, policy_version 736960 (0.00085) [2022-07-10 13:09:46,330][25689] Fps is (10 sec: 5627.2, 60 sec: 5546.4, 300 sec: 5539.1). Total num frames: 754647040. Throughput: 0: 5820.6. Samples: 754647628. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:09:46,331][25689] Avg episode reward: [(0, '-2.289')] [2022-07-10 13:09:48,392][26022] Updated weights on worker 0-0, policy_version 736970 (0.00094) [2022-07-10 13:09:50,185][26022] Updated weights on worker 0-0, policy_version 736980 (0.00085) [2022-07-10 13:09:51,339][25689] Fps is (10 sec: 5727.5, 60 sec: 5552.1, 300 sec: 5536.7). Total num frames: 754674688. Throughput: 0: 5835.3. Samples: 754681138. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:09:51,340][25689] Avg episode reward: [(0, '-3.050')] [2022-07-10 13:09:52,002][26022] Updated weights on worker 0-0, policy_version 736990 (0.00086) [2022-07-10 13:09:54,025][26022] Updated weights on worker 0-0, policy_version 737000 (0.00085) [2022-07-10 13:09:55,615][26022] Updated weights on worker 0-0, policy_version 737010 (0.00095) [2022-07-10 13:09:56,383][25689] Fps is (10 sec: 5398.9, 60 sec: 5503.9, 300 sec: 5532.8). Total num frames: 754701312. Throughput: 0: 5001.7. Samples: 754697866. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:09:56,383][25689] Avg episode reward: [(0, '-3.381')] [2022-07-10 13:09:57,618][26022] Updated weights on worker 0-0, policy_version 737020 (0.00092) [2022-07-10 13:09:59,217][26022] Updated weights on worker 0-0, policy_version 737030 (0.00079) [2022-07-10 13:10:01,203][26022] Updated weights on worker 0-0, policy_version 737040 (0.00087) [2022-07-10 13:10:01,391][25689] Fps is (10 sec: 5501.0, 60 sec: 5559.3, 300 sec: 5539.9). Total num frames: 754729984. Throughput: 0: 5845.7. Samples: 754731342. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:10:01,391][25689] Avg episode reward: [(0, '-3.542')] [2022-07-10 13:10:03,524][26022] Updated weights on worker 0-0, policy_version 737050 (0.00086) [2022-07-10 13:10:05,014][26022] Updated weights on worker 0-0, policy_version 737060 (0.00092) [2022-07-10 13:10:06,408][25689] Fps is (10 sec: 5515.8, 60 sec: 5558.8, 300 sec: 5536.2). Total num frames: 754756608. Throughput: 0: 5757.8. Samples: 754763230. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:10:06,408][25689] Avg episode reward: [(0, '-3.493')] [2022-07-10 13:10:07,071][26022] Updated weights on worker 0-0, policy_version 737070 (0.00091) [2022-07-10 13:10:08,645][26022] Updated weights on worker 0-0, policy_version 737080 (0.01102) [2022-07-10 13:10:10,757][26022] Updated weights on worker 0-0, policy_version 737090 (0.00090) [2022-07-10 13:10:11,426][25689] Fps is (10 sec: 5407.9, 60 sec: 5523.9, 300 sec: 5537.2). Total num frames: 754784256. Throughput: 0: 4920.9. Samples: 754779988. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:10:11,427][25689] Avg episode reward: [(0, '-3.552')] [2022-07-10 13:10:12,448][26022] Updated weights on worker 0-0, policy_version 737100 (0.00088) [2022-07-10 13:10:14,372][26022] Updated weights on worker 0-0, policy_version 737110 (0.00090) [2022-07-10 13:10:16,096][26022] Updated weights on worker 0-0, policy_version 737120 (0.00088) [2022-07-10 13:10:16,474][25689] Fps is (10 sec: 5696.6, 60 sec: 5591.5, 300 sec: 5544.4). Total num frames: 754813952. Throughput: 0: 5751.6. Samples: 754813424. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:10:16,476][25689] Avg episode reward: [(0, '-1.418')] [2022-07-10 13:10:18,014][26022] Updated weights on worker 0-0, policy_version 737130 (0.00769) [2022-07-10 13:10:19,722][26022] Updated weights on worker 0-0, policy_version 737140 (0.00087) [2022-07-10 13:10:21,501][25689] Fps is (10 sec: 5590.3, 60 sec: 5522.3, 300 sec: 5537.4). Total num frames: 754840576. Throughput: 0: 5744.8. Samples: 754846872. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:10:21,503][25689] Avg episode reward: [(0, '-0.950')] [2022-07-10 13:10:21,563][26022] Updated weights on worker 0-0, policy_version 737150 (0.00087) [2022-07-10 13:10:23,540][26022] Updated weights on worker 0-0, policy_version 737160 (0.00089) [2022-07-10 13:10:25,192][26022] Updated weights on worker 0-0, policy_version 737170 (0.00087) [2022-07-10 13:10:26,529][25689] Fps is (10 sec: 5295.8, 60 sec: 5520.8, 300 sec: 5537.2). Total num frames: 754867200. Throughput: 0: 4992.2. Samples: 754863678. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:10:26,529][25689] Avg episode reward: [(0, '-0.210')] [2022-07-10 13:10:27,228][26022] Updated weights on worker 0-0, policy_version 737180 (0.00092) [2022-07-10 13:10:29,008][26022] Updated weights on worker 0-0, policy_version 737190 (0.00090) [2022-07-10 13:10:30,987][26022] Updated weights on worker 0-0, policy_version 737200 (0.00096) [2022-07-10 13:10:31,566][25689] Fps is (10 sec: 5494.0, 60 sec: 5535.7, 300 sec: 5537.8). Total num frames: 754895872. Throughput: 0: 5803.8. Samples: 754896872. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:10:31,568][25689] Avg episode reward: [(0, '-0.734')] [2022-07-10 13:10:32,654][26022] Updated weights on worker 0-0, policy_version 737210 (0.00088) [2022-07-10 13:10:33,691][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:10:33,699][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000737215_754908160.pth [2022-07-10 13:10:33,700][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000735268_752914432.pth [2022-07-10 13:10:34,503][26022] Updated weights on worker 0-0, policy_version 737220 (0.00089) [2022-07-10 13:10:36,531][26022] Updated weights on worker 0-0, policy_version 737230 (0.00095) [2022-07-10 13:10:36,704][25689] Fps is (10 sec: 5535.2, 60 sec: 5511.6, 300 sec: 5535.5). Total num frames: 754923520. Throughput: 0: 5781.8. Samples: 754930388. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:10:36,704][25689] Avg episode reward: [(0, '-1.376')] [2022-07-10 13:10:38,226][26022] Updated weights on worker 0-0, policy_version 737240 (0.00094) [2022-07-10 13:10:40,143][26022] Updated weights on worker 0-0, policy_version 737250 (0.00089) [2022-07-10 13:10:41,741][25689] Fps is (10 sec: 5534.7, 60 sec: 5542.5, 300 sec: 5538.4). Total num frames: 754952192. Throughput: 0: 4954.1. Samples: 754947146. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:10:41,742][25689] Avg episode reward: [(0, '-1.621')] [2022-07-10 13:10:42,022][26022] Updated weights on worker 0-0, policy_version 737260 (0.00086) [2022-07-10 13:10:43,740][26022] Updated weights on worker 0-0, policy_version 737270 (0.00055) [2022-07-10 13:10:45,666][26022] Updated weights on worker 0-0, policy_version 737280 (0.00086) [2022-07-10 13:10:46,753][25689] Fps is (10 sec: 5808.5, 60 sec: 5541.8, 300 sec: 5545.2). Total num frames: 754981888. Throughput: 0: 5782.7. Samples: 754980626. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:10:46,753][25689] Avg episode reward: [(0, '-2.611')] [2022-07-10 13:10:47,465][26022] Updated weights on worker 0-0, policy_version 737290 (0.00088) [2022-07-10 13:10:49,335][26022] Updated weights on worker 0-0, policy_version 737300 (0.00092) [2022-07-10 13:10:51,223][26022] Updated weights on worker 0-0, policy_version 737310 (0.00086) [2022-07-10 13:10:51,782][25689] Fps is (10 sec: 5711.5, 60 sec: 5539.9, 300 sec: 5540.4). Total num frames: 755009536. Throughput: 0: 5802.1. Samples: 755014166. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:10:51,782][25689] Avg episode reward: [(0, '-3.508')] [2022-07-10 13:10:53,029][26022] Updated weights on worker 0-0, policy_version 737320 (0.00094) [2022-07-10 13:10:54,756][26022] Updated weights on worker 0-0, policy_version 737330 (0.00083) [2022-07-10 13:10:56,627][26022] Updated weights on worker 0-0, policy_version 737340 (0.00088) [2022-07-10 13:10:56,870][25689] Fps is (10 sec: 5465.5, 60 sec: 5552.8, 300 sec: 5546.0). Total num frames: 755037184. Throughput: 0: 4980.9. Samples: 755030832. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:10:56,870][25689] Avg episode reward: [(0, '-5.146')] [2022-07-10 13:10:58,413][26022] Updated weights on worker 0-0, policy_version 737350 (0.00083) [2022-07-10 13:11:00,508][26022] Updated weights on worker 0-0, policy_version 737360 (0.00090) [2022-07-10 13:11:01,891][25689] Fps is (10 sec: 5368.7, 60 sec: 5517.8, 300 sec: 5538.8). Total num frames: 755063808. Throughput: 0: 5790.6. Samples: 755063822. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:11:01,891][25689] Avg episode reward: [(0, '-4.946')] [2022-07-10 13:11:02,513][26022] Updated weights on worker 0-0, policy_version 737370 (0.00083) [2022-07-10 13:11:04,459][26022] Updated weights on worker 0-0, policy_version 737380 (0.00103) [2022-07-10 13:11:06,232][26022] Updated weights on worker 0-0, policy_version 737390 (0.00087) [2022-07-10 13:11:06,918][25689] Fps is (10 sec: 5197.4, 60 sec: 5499.9, 300 sec: 5536.7). Total num frames: 755089408. Throughput: 0: 5689.4. Samples: 755095356. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:11:06,918][25689] Avg episode reward: [(0, '-4.063')] [2022-07-10 13:11:08,116][26022] Updated weights on worker 0-0, policy_version 737400 (0.00086) [2022-07-10 13:11:10,054][26022] Updated weights on worker 0-0, policy_version 737410 (0.00085) [2022-07-10 13:11:11,923][25689] Fps is (10 sec: 5409.6, 60 sec: 5518.0, 300 sec: 5535.5). Total num frames: 755118080. Throughput: 0: 4864.8. Samples: 755112150. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:11:11,924][25689] Avg episode reward: [(0, '-3.782')] [2022-07-10 13:11:11,927][26022] Updated weights on worker 0-0, policy_version 737420 (0.00094) [2022-07-10 13:11:13,633][26022] Updated weights on worker 0-0, policy_version 737430 (0.00087) [2022-07-10 13:11:15,549][26022] Updated weights on worker 0-0, policy_version 737440 (0.00087) [2022-07-10 13:11:16,968][25689] Fps is (10 sec: 5706.0, 60 sec: 5501.4, 300 sec: 5539.1). Total num frames: 755146752. Throughput: 0: 5716.3. Samples: 755145718. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:11:16,968][25689] Avg episode reward: [(0, '-3.820')] [2022-07-10 13:11:17,267][26022] Updated weights on worker 0-0, policy_version 737450 (0.00051) [2022-07-10 13:11:19,163][26022] Updated weights on worker 0-0, policy_version 737460 (0.00083) [2022-07-10 13:11:20,958][26022] Updated weights on worker 0-0, policy_version 737470 (0.00088) [2022-07-10 13:11:21,976][25689] Fps is (10 sec: 5602.4, 60 sec: 5520.0, 300 sec: 5536.2). Total num frames: 755174400. Throughput: 0: 5742.4. Samples: 755179160. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:11:21,977][25689] Avg episode reward: [(0, '-4.396')] [2022-07-10 13:11:22,698][26022] Updated weights on worker 0-0, policy_version 737480 (0.00092) [2022-07-10 13:11:24,747][26022] Updated weights on worker 0-0, policy_version 737490 (0.00080) [2022-07-10 13:11:26,543][26022] Updated weights on worker 0-0, policy_version 737500 (0.00082) [2022-07-10 13:11:27,018][25689] Fps is (10 sec: 5604.0, 60 sec: 5552.6, 300 sec: 5540.7). Total num frames: 755203072. Throughput: 0: 5005.5. Samples: 755195966. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:11:27,018][25689] Avg episode reward: [(0, '-2.908')] [2022-07-10 13:11:28,368][26022] Updated weights on worker 0-0, policy_version 737510 (0.00096) [2022-07-10 13:11:30,270][26022] Updated weights on worker 0-0, policy_version 737520 (0.00092) [2022-07-10 13:11:32,019][26022] Updated weights on worker 0-0, policy_version 737530 (0.00086) [2022-07-10 13:11:32,027][25689] Fps is (10 sec: 5603.4, 60 sec: 5538.2, 300 sec: 5538.7). Total num frames: 755230720. Throughput: 0: 5812.4. Samples: 755229002. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:11:32,028][25689] Avg episode reward: [(0, '-2.396')] [2022-07-10 13:11:33,891][26022] Updated weights on worker 0-0, policy_version 737540 (0.00096) [2022-07-10 13:11:35,782][26022] Updated weights on worker 0-0, policy_version 737550 (0.00090) [2022-07-10 13:11:37,147][25689] Fps is (10 sec: 5458.9, 60 sec: 5539.9, 300 sec: 5540.3). Total num frames: 755258368. Throughput: 0: 5788.3. Samples: 755262524. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:11:37,148][25689] Avg episode reward: [(0, '-2.277')] [2022-07-10 13:11:37,459][26022] Updated weights on worker 0-0, policy_version 737560 (0.00089) [2022-07-10 13:11:39,500][26022] Updated weights on worker 0-0, policy_version 737570 (0.00091) [2022-07-10 13:11:41,295][26022] Updated weights on worker 0-0, policy_version 737580 (0.00086) [2022-07-10 13:11:42,166][25689] Fps is (10 sec: 5655.9, 60 sec: 5558.6, 300 sec: 5540.1). Total num frames: 755288064. Throughput: 0: 5793.2. Samples: 755296124. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:11:42,166][25689] Avg episode reward: [(0, '-0.752')] [2022-07-10 13:11:43,047][26022] Updated weights on worker 0-0, policy_version 737590 (0.00085) [2022-07-10 13:11:44,883][26022] Updated weights on worker 0-0, policy_version 737600 (0.00090) [2022-07-10 13:11:46,858][26022] Updated weights on worker 0-0, policy_version 737610 (0.00088) [2022-07-10 13:11:47,204][25689] Fps is (10 sec: 5498.4, 60 sec: 5488.3, 300 sec: 5536.1). Total num frames: 755313664. Throughput: 0: 5805.5. Samples: 755313158. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:11:47,205][25689] Avg episode reward: [(0, '-0.717')] [2022-07-10 13:11:48,405][26022] Updated weights on worker 0-0, policy_version 737620 (0.00088) [2022-07-10 13:11:50,500][26022] Updated weights on worker 0-0, policy_version 737630 (0.00093) [2022-07-10 13:11:52,088][26022] Updated weights on worker 0-0, policy_version 737640 (0.00086) [2022-07-10 13:11:52,259][25689] Fps is (10 sec: 5580.2, 60 sec: 5536.8, 300 sec: 5543.8). Total num frames: 755344384. Throughput: 0: 5827.9. Samples: 755346910. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:11:52,259][25689] Avg episode reward: [(0, '0.127')] [2022-07-10 13:11:53,939][26022] Updated weights on worker 0-0, policy_version 737650 (0.00095) [2022-07-10 13:11:55,697][26022] Updated weights on worker 0-0, policy_version 737660 (0.00081) [2022-07-10 13:11:57,383][25689] Fps is (10 sec: 5734.1, 60 sec: 5533.5, 300 sec: 5538.2). Total num frames: 755372032. Throughput: 0: 5836.7. Samples: 755380634. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:11:57,384][25689] Avg episode reward: [(0, '-0.651')] [2022-07-10 13:11:57,497][26022] Updated weights on worker 0-0, policy_version 737670 (0.00086) [2022-07-10 13:11:59,373][26022] Updated weights on worker 0-0, policy_version 737680 (0.00091) [2022-07-10 13:12:01,360][26022] Updated weights on worker 0-0, policy_version 737690 (0.00088) [2022-07-10 13:12:02,443][25689] Fps is (10 sec: 5228.5, 60 sec: 5513.0, 300 sec: 5540.7). Total num frames: 755397632. Throughput: 0: 5006.2. Samples: 755397636. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:12:02,444][25689] Avg episode reward: [(0, '-2.529')] [2022-07-10 13:12:03,310][26022] Updated weights on worker 0-0, policy_version 737700 (0.00089) [2022-07-10 13:12:05,503][26022] Updated weights on worker 0-0, policy_version 737710 (0.00086) [2022-07-10 13:12:07,109][26022] Updated weights on worker 0-0, policy_version 737720 (0.00095) [2022-07-10 13:12:07,470][25689] Fps is (10 sec: 5380.5, 60 sec: 5563.8, 300 sec: 5540.4). Total num frames: 755426304. Throughput: 0: 5710.8. Samples: 755428894. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:12:07,471][25689] Avg episode reward: [(0, '-3.170')] [2022-07-10 13:12:08,978][26022] Updated weights on worker 0-0, policy_version 737730 (0.00086) [2022-07-10 13:12:10,731][26022] Updated weights on worker 0-0, policy_version 737740 (0.00080) [2022-07-10 13:12:12,520][25689] Fps is (10 sec: 5792.0, 60 sec: 5576.5, 300 sec: 5541.2). Total num frames: 755456000. Throughput: 0: 5706.4. Samples: 755462530. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:12:12,521][25689] Avg episode reward: [(0, '-3.068')] [2022-07-10 13:12:12,536][26022] Updated weights on worker 0-0, policy_version 737750 (0.00167) [2022-07-10 13:12:14,436][26022] Updated weights on worker 0-0, policy_version 737760 (0.00086) [2022-07-10 13:12:16,287][26022] Updated weights on worker 0-0, policy_version 737770 (0.00088) [2022-07-10 13:12:17,589][25689] Fps is (10 sec: 5565.5, 60 sec: 5540.5, 300 sec: 5536.7). Total num frames: 755482624. Throughput: 0: 4886.7. Samples: 755479376. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:12:17,592][25689] Avg episode reward: [(0, '-3.200')] [2022-07-10 13:12:18,226][26022] Updated weights on worker 0-0, policy_version 737780 (0.00089) [2022-07-10 13:12:20,067][26022] Updated weights on worker 0-0, policy_version 737790 (0.00094) [2022-07-10 13:12:21,620][26022] Updated weights on worker 0-0, policy_version 737800 (0.00085) [2022-07-10 13:12:22,600][25689] Fps is (10 sec: 5485.9, 60 sec: 5557.1, 300 sec: 5540.6). Total num frames: 755511296. Throughput: 0: 5731.8. Samples: 755513172. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:12:22,600][25689] Avg episode reward: [(0, '-3.615')] [2022-07-10 13:12:23,483][26022] Updated weights on worker 0-0, policy_version 737810 (0.00083) [2022-07-10 13:12:25,302][26022] Updated weights on worker 0-0, policy_version 737820 (0.00091) [2022-07-10 13:12:27,327][26022] Updated weights on worker 0-0, policy_version 737830 (0.00087) [2022-07-10 13:12:27,620][25689] Fps is (10 sec: 5614.7, 60 sec: 5542.2, 300 sec: 5540.5). Total num frames: 755538944. Throughput: 0: 5857.1. Samples: 755546916. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:12:27,621][25689] Avg episode reward: [(0, '-2.599')] [2022-07-10 13:12:29,036][26022] Updated weights on worker 0-0, policy_version 737840 (0.00092) [2022-07-10 13:12:31,053][26022] Updated weights on worker 0-0, policy_version 737850 (0.00089) [2022-07-10 13:12:32,627][25689] Fps is (10 sec: 5617.0, 60 sec: 5559.4, 300 sec: 5542.4). Total num frames: 755567616. Throughput: 0: 5035.4. Samples: 755563774. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:12:32,628][25689] Avg episode reward: [(0, '-2.116')] [2022-07-10 13:12:32,651][26022] Updated weights on worker 0-0, policy_version 737860 (0.00114) [2022-07-10 13:12:33,775][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:12:33,789][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000737865_755573760.pth [2022-07-10 13:12:33,790][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000735915_753576960.pth [2022-07-10 13:12:34,588][26022] Updated weights on worker 0-0, policy_version 737870 (0.00089) [2022-07-10 13:12:36,112][26022] Updated weights on worker 0-0, policy_version 737880 (0.00085) [2022-07-10 13:12:37,707][25689] Fps is (10 sec: 5685.1, 60 sec: 5579.9, 300 sec: 5539.4). Total num frames: 755596288. Throughput: 0: 5875.3. Samples: 755597572. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:12:37,707][25689] Avg episode reward: [(0, '-2.772')] [2022-07-10 13:12:38,208][26022] Updated weights on worker 0-0, policy_version 737890 (0.00089) [2022-07-10 13:12:39,912][26022] Updated weights on worker 0-0, policy_version 737900 (0.00078) [2022-07-10 13:12:41,975][26022] Updated weights on worker 0-0, policy_version 737910 (0.00098) [2022-07-10 13:12:42,728][25689] Fps is (10 sec: 5677.0, 60 sec: 5562.8, 300 sec: 5543.7). Total num frames: 755624960. Throughput: 0: 5861.9. Samples: 755631158. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:12:42,728][25689] Avg episode reward: [(0, '-4.999')] [2022-07-10 13:12:43,757][26022] Updated weights on worker 0-0, policy_version 737920 (0.00086) [2022-07-10 13:12:45,512][26022] Updated weights on worker 0-0, policy_version 737930 (0.00086) [2022-07-10 13:12:47,314][26022] Updated weights on worker 0-0, policy_version 737940 (0.00092) [2022-07-10 13:12:47,735][25689] Fps is (10 sec: 5514.0, 60 sec: 5582.6, 300 sec: 5541.9). Total num frames: 755651584. Throughput: 0: 5025.6. Samples: 755648006. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 13:12:47,736][25689] Avg episode reward: [(0, '-6.227')] [2022-07-10 13:12:49,251][26022] Updated weights on worker 0-0, policy_version 737950 (0.00088) [2022-07-10 13:12:51,009][26022] Updated weights on worker 0-0, policy_version 737960 (0.00366) [2022-07-10 13:12:52,756][25689] Fps is (10 sec: 5412.0, 60 sec: 5535.0, 300 sec: 5539.0). Total num frames: 755679232. Throughput: 0: 5860.9. Samples: 755681748. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:12:52,757][25689] Avg episode reward: [(0, '-6.634')] [2022-07-10 13:12:52,880][26022] Updated weights on worker 0-0, policy_version 737970 (0.00577) [2022-07-10 13:12:54,546][26022] Updated weights on worker 0-0, policy_version 737980 (0.00087) [2022-07-10 13:12:56,496][26022] Updated weights on worker 0-0, policy_version 737990 (0.00084) [2022-07-10 13:12:57,859][25689] Fps is (10 sec: 5765.7, 60 sec: 5587.7, 300 sec: 5547.4). Total num frames: 755709952. Throughput: 0: 5852.8. Samples: 755715516. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:12:57,859][25689] Avg episode reward: [(0, '-7.336')] [2022-07-10 13:12:58,388][26022] Updated weights on worker 0-0, policy_version 738000 (0.00116) [2022-07-10 13:13:00,147][26022] Updated weights on worker 0-0, policy_version 738010 (0.00092) [2022-07-10 13:13:02,284][26022] Updated weights on worker 0-0, policy_version 738020 (0.00088) [2022-07-10 13:13:02,890][25689] Fps is (10 sec: 5557.3, 60 sec: 5590.3, 300 sec: 5547.3). Total num frames: 755735552. Throughput: 0: 5019.8. Samples: 755732370. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:13:02,891][25689] Avg episode reward: [(0, '-6.063')] [2022-07-10 13:13:04,186][26022] Updated weights on worker 0-0, policy_version 738030 (0.00095) [2022-07-10 13:13:05,918][26022] Updated weights on worker 0-0, policy_version 738040 (0.00085) [2022-07-10 13:13:07,836][26022] Updated weights on worker 0-0, policy_version 738050 (0.00085) [2022-07-10 13:13:07,908][25689] Fps is (10 sec: 5298.5, 60 sec: 5574.2, 300 sec: 5547.5). Total num frames: 755763200. Throughput: 0: 5743.9. Samples: 755763878. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:13:07,908][25689] Avg episode reward: [(0, '-5.243')] [2022-07-10 13:13:09,560][26022] Updated weights on worker 0-0, policy_version 738060 (0.00094) [2022-07-10 13:13:11,509][26022] Updated weights on worker 0-0, policy_version 738070 (0.00090) [2022-07-10 13:13:12,916][25689] Fps is (10 sec: 5515.5, 60 sec: 5544.3, 300 sec: 5538.1). Total num frames: 755790848. Throughput: 0: 5716.1. Samples: 755796984. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:13:12,916][25689] Avg episode reward: [(0, '-4.508')] [2022-07-10 13:13:13,394][26022] Updated weights on worker 0-0, policy_version 738080 (0.00089) [2022-07-10 13:13:15,147][26022] Updated weights on worker 0-0, policy_version 738090 (0.00093) [2022-07-10 13:13:17,073][26022] Updated weights on worker 0-0, policy_version 738100 (0.00094) [2022-07-10 13:13:18,018][25689] Fps is (10 sec: 5469.6, 60 sec: 5558.2, 300 sec: 5539.8). Total num frames: 755818496. Throughput: 0: 4874.1. Samples: 755813774. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:13:18,018][25689] Avg episode reward: [(0, '-4.337')] [2022-07-10 13:13:18,755][26022] Updated weights on worker 0-0, policy_version 738110 (0.00087) [2022-07-10 13:13:20,825][26022] Updated weights on worker 0-0, policy_version 738120 (0.00091) [2022-07-10 13:13:22,485][26022] Updated weights on worker 0-0, policy_version 738130 (0.00093) [2022-07-10 13:13:23,027][25689] Fps is (10 sec: 5570.1, 60 sec: 5558.3, 300 sec: 5539.8). Total num frames: 755847168. Throughput: 0: 5714.5. Samples: 755847442. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:13:23,027][25689] Avg episode reward: [(0, '-2.152')] [2022-07-10 13:13:24,524][26022] Updated weights on worker 0-0, policy_version 738140 (0.00090) [2022-07-10 13:13:26,204][26022] Updated weights on worker 0-0, policy_version 738150 (0.00092) [2022-07-10 13:13:28,039][25689] Fps is (10 sec: 5517.7, 60 sec: 5542.1, 300 sec: 5536.9). Total num frames: 755873792. Throughput: 0: 5790.8. Samples: 755880454. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:13:28,040][25689] Avg episode reward: [(0, '-1.191')] [2022-07-10 13:13:28,299][26022] Updated weights on worker 0-0, policy_version 738160 (0.00086) [2022-07-10 13:13:30,019][26022] Updated weights on worker 0-0, policy_version 738170 (0.00084) [2022-07-10 13:13:31,976][26022] Updated weights on worker 0-0, policy_version 738180 (0.00094) [2022-07-10 13:13:33,125][25689] Fps is (10 sec: 5577.4, 60 sec: 5551.8, 300 sec: 5543.7). Total num frames: 755903488. Throughput: 0: 4948.8. Samples: 755896994. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:13:33,125][25689] Avg episode reward: [(0, '-2.909')] [2022-07-10 13:13:33,775][26022] Updated weights on worker 0-0, policy_version 738190 (0.00089) [2022-07-10 13:13:35,827][26022] Updated weights on worker 0-0, policy_version 738200 (0.00090) [2022-07-10 13:13:37,461][26022] Updated weights on worker 0-0, policy_version 738210 (0.00093) [2022-07-10 13:13:38,255][25689] Fps is (10 sec: 5613.6, 60 sec: 5530.3, 300 sec: 5538.3). Total num frames: 755931136. Throughput: 0: 5755.5. Samples: 755930248. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:13:38,255][25689] Avg episode reward: [(0, '-2.861')] [2022-07-10 13:13:39,652][26022] Updated weights on worker 0-0, policy_version 738220 (0.00090) [2022-07-10 13:13:40,905][26022] Updated weights on worker 0-0, policy_version 738230 (0.00090) [2022-07-10 13:13:43,279][25689] Fps is (10 sec: 5243.7, 60 sec: 5479.2, 300 sec: 5531.2). Total num frames: 755956736. Throughput: 0: 5720.8. Samples: 755963302. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:13:43,280][25689] Avg episode reward: [(0, '-1.736')] [2022-07-10 13:13:43,305][26022] Updated weights on worker 0-0, policy_version 738240 (0.00089) [2022-07-10 13:13:44,771][26022] Updated weights on worker 0-0, policy_version 738250 (0.00093) [2022-07-10 13:13:46,755][26022] Updated weights on worker 0-0, policy_version 738260 (0.00097) [2022-07-10 13:13:48,350][25689] Fps is (10 sec: 5477.5, 60 sec: 5524.2, 300 sec: 5538.1). Total num frames: 755986432. Throughput: 0: 5727.3. Samples: 755996776. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:13:48,350][25689] Avg episode reward: [(0, '-2.024')] [2022-07-10 13:13:48,867][26022] Updated weights on worker 0-0, policy_version 738270 (0.00096) [2022-07-10 13:13:50,223][26022] Updated weights on worker 0-0, policy_version 738280 (0.00088) [2022-07-10 13:13:52,459][26022] Updated weights on worker 0-0, policy_version 738290 (0.00082) [2022-07-10 13:13:53,413][25689] Fps is (10 sec: 5860.8, 60 sec: 5554.1, 300 sec: 5538.3). Total num frames: 756016128. Throughput: 0: 5725.7. Samples: 756013158. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:13:53,414][25689] Avg episode reward: [(0, '-2.348')] [2022-07-10 13:13:54,082][26022] Updated weights on worker 0-0, policy_version 738300 (0.00109) [2022-07-10 13:13:55,980][26022] Updated weights on worker 0-0, policy_version 738310 (0.00350) [2022-07-10 13:13:57,975][26022] Updated weights on worker 0-0, policy_version 738320 (0.00090) [2022-07-10 13:13:58,472][25689] Fps is (10 sec: 5665.1, 60 sec: 5507.4, 300 sec: 5545.1). Total num frames: 756043776. Throughput: 0: 5772.1. Samples: 756046944. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:13:58,473][25689] Avg episode reward: [(0, '-3.236')] [2022-07-10 13:13:59,391][26022] Updated weights on worker 0-0, policy_version 738330 (0.00088) [2022-07-10 13:14:01,471][26022] Updated weights on worker 0-0, policy_version 738340 (0.00092) [2022-07-10 13:14:03,486][25689] Fps is (10 sec: 5286.1, 60 sec: 5509.0, 300 sec: 5541.6). Total num frames: 756069376. Throughput: 0: 5693.5. Samples: 756078350. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:14:03,487][25689] Avg episode reward: [(0, '-1.242')] [2022-07-10 13:14:03,535][26022] Updated weights on worker 0-0, policy_version 738350 (0.00086) [2022-07-10 13:14:05,577][26022] Updated weights on worker 0-0, policy_version 738360 (0.00085) [2022-07-10 13:14:07,276][26022] Updated weights on worker 0-0, policy_version 738370 (0.00086) [2022-07-10 13:14:08,509][25689] Fps is (10 sec: 5203.3, 60 sec: 5491.7, 300 sec: 5531.0). Total num frames: 756096000. Throughput: 0: 4877.5. Samples: 756095102. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:14:08,510][25689] Avg episode reward: [(0, '-2.346')] [2022-07-10 13:14:09,157][26022] Updated weights on worker 0-0, policy_version 738380 (0.00183) [2022-07-10 13:14:10,939][26022] Updated weights on worker 0-0, policy_version 738390 (0.00096) [2022-07-10 13:14:12,914][26022] Updated weights on worker 0-0, policy_version 738400 (0.00088) [2022-07-10 13:14:13,564][25689] Fps is (10 sec: 5486.9, 60 sec: 5504.2, 300 sec: 5541.2). Total num frames: 756124672. Throughput: 0: 5716.0. Samples: 756128342. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:14:13,565][25689] Avg episode reward: [(0, '-2.910')] [2022-07-10 13:14:14,671][26022] Updated weights on worker 0-0, policy_version 738410 (0.00088) [2022-07-10 13:14:16,593][26022] Updated weights on worker 0-0, policy_version 738420 (0.00083) [2022-07-10 13:14:18,193][26022] Updated weights on worker 0-0, policy_version 738430 (0.00081) [2022-07-10 13:14:18,636][25689] Fps is (10 sec: 5662.3, 60 sec: 5523.9, 300 sec: 5533.2). Total num frames: 756153344. Throughput: 0: 5701.9. Samples: 756161918. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:14:18,637][25689] Avg episode reward: [(0, '-2.274')] [2022-07-10 13:14:20,105][26022] Updated weights on worker 0-0, policy_version 738440 (0.00091) [2022-07-10 13:14:22,016][26022] Updated weights on worker 0-0, policy_version 738450 (0.00084) [2022-07-10 13:14:23,650][25689] Fps is (10 sec: 5685.5, 60 sec: 5523.4, 300 sec: 5540.0). Total num frames: 756182016. Throughput: 0: 4981.0. Samples: 756178786. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:14:23,651][25689] Avg episode reward: [(0, '-1.905')] [2022-07-10 13:14:23,808][26022] Updated weights on worker 0-0, policy_version 738460 (0.00081) [2022-07-10 13:14:25,604][26022] Updated weights on worker 0-0, policy_version 738470 (0.00090) [2022-07-10 13:14:27,611][26022] Updated weights on worker 0-0, policy_version 738480 (0.00089) [2022-07-10 13:14:28,737][25689] Fps is (10 sec: 5676.9, 60 sec: 5550.4, 300 sec: 5542.1). Total num frames: 756210688. Throughput: 0: 5798.7. Samples: 756212402. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:14:28,738][25689] Avg episode reward: [(0, '-1.930')] [2022-07-10 13:14:29,150][26022] Updated weights on worker 0-0, policy_version 738490 (0.00085) [2022-07-10 13:14:31,344][26022] Updated weights on worker 0-0, policy_version 738500 (0.00082) [2022-07-10 13:14:32,836][26022] Updated weights on worker 0-0, policy_version 738510 (0.00095) [2022-07-10 13:14:33,835][25689] Fps is (10 sec: 5429.6, 60 sec: 5498.7, 300 sec: 5534.5). Total num frames: 756237312. Throughput: 0: 5785.4. Samples: 756245614. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:14:33,835][25689] Avg episode reward: [(0, '-0.701')] [2022-07-10 13:14:33,898][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:14:33,911][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000738514_756238336.pth [2022-07-10 13:14:33,911][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000736567_754244608.pth [2022-07-10 13:14:34,966][26022] Updated weights on worker 0-0, policy_version 738520 (0.00086) [2022-07-10 13:14:36,799][26022] Updated weights on worker 0-0, policy_version 738530 (0.00081) [2022-07-10 13:14:38,399][26022] Updated weights on worker 0-0, policy_version 738540 (0.00089) [2022-07-10 13:14:38,938][25689] Fps is (10 sec: 5521.4, 60 sec: 5534.9, 300 sec: 5543.0). Total num frames: 756267008. Throughput: 0: 4943.0. Samples: 756262268. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:14:38,938][25689] Avg episode reward: [(0, '-0.749')] [2022-07-10 13:14:40,575][26022] Updated weights on worker 0-0, policy_version 738550 (0.00084) [2022-07-10 13:14:42,108][26022] Updated weights on worker 0-0, policy_version 738560 (0.00093) [2022-07-10 13:14:43,940][25689] Fps is (10 sec: 5775.7, 60 sec: 5587.5, 300 sec: 5539.6). Total num frames: 756295680. Throughput: 0: 5780.9. Samples: 756296082. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:14:43,941][25689] Avg episode reward: [(0, '-0.420')] [2022-07-10 13:14:43,945][26022] Updated weights on worker 0-0, policy_version 738570 (0.00085) [2022-07-10 13:14:45,919][26022] Updated weights on worker 0-0, policy_version 738580 (0.00087) [2022-07-10 13:14:47,699][26022] Updated weights on worker 0-0, policy_version 738590 (0.00086) [2022-07-10 13:14:48,980][25689] Fps is (10 sec: 5608.1, 60 sec: 5556.5, 300 sec: 5539.0). Total num frames: 756323328. Throughput: 0: 5789.1. Samples: 756329590. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:14:48,981][25689] Avg episode reward: [(0, '0.069')] [2022-07-10 13:14:49,675][26022] Updated weights on worker 0-0, policy_version 738600 (0.00086) [2022-07-10 13:14:51,470][26022] Updated weights on worker 0-0, policy_version 738610 (0.00089) [2022-07-10 13:14:53,111][26022] Updated weights on worker 0-0, policy_version 738620 (0.00141) [2022-07-10 13:14:54,029][25689] Fps is (10 sec: 5481.1, 60 sec: 5524.1, 300 sec: 5542.4). Total num frames: 756350976. Throughput: 0: 5002.4. Samples: 756346634. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:14:54,030][25689] Avg episode reward: [(0, '-0.290')] [2022-07-10 13:14:55,007][26022] Updated weights on worker 0-0, policy_version 738630 (0.00083) [2022-07-10 13:14:56,846][26022] Updated weights on worker 0-0, policy_version 738640 (0.00087) [2022-07-10 13:14:58,778][26022] Updated weights on worker 0-0, policy_version 738650 (0.00093) [2022-07-10 13:14:59,116][25689] Fps is (10 sec: 5556.4, 60 sec: 5538.4, 300 sec: 5540.9). Total num frames: 756379648. Throughput: 0: 5830.4. Samples: 756379918. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:14:59,117][25689] Avg episode reward: [(0, '-0.208')] [2022-07-10 13:15:00,491][26022] Updated weights on worker 0-0, policy_version 738660 (0.00083) [2022-07-10 13:15:02,822][26022] Updated weights on worker 0-0, policy_version 738670 (0.00087) [2022-07-10 13:15:04,129][25689] Fps is (10 sec: 5373.2, 60 sec: 5538.5, 300 sec: 5537.5). Total num frames: 756405248. Throughput: 0: 5691.5. Samples: 756410988. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:15:04,130][25689] Avg episode reward: [(0, '-0.523')] [2022-07-10 13:15:04,702][26022] Updated weights on worker 0-0, policy_version 738680 (0.00086) [2022-07-10 13:15:06,620][26022] Updated weights on worker 0-0, policy_version 738690 (0.00112) [2022-07-10 13:15:08,450][26022] Updated weights on worker 0-0, policy_version 738700 (0.00052) [2022-07-10 13:15:09,133][25689] Fps is (10 sec: 5213.6, 60 sec: 5540.2, 300 sec: 5534.4). Total num frames: 756431872. Throughput: 0: 4855.8. Samples: 756427448. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:15:09,142][25689] Avg episode reward: [(0, '-0.081')] [2022-07-10 13:15:10,185][26022] Updated weights on worker 0-0, policy_version 738710 (0.00084) [2022-07-10 13:15:12,035][26022] Updated weights on worker 0-0, policy_version 738720 (0.00084) [2022-07-10 13:15:13,953][26022] Updated weights on worker 0-0, policy_version 738730 (0.00086) [2022-07-10 13:15:14,228][25689] Fps is (10 sec: 5475.3, 60 sec: 5536.6, 300 sec: 5530.0). Total num frames: 756460544. Throughput: 0: 5655.2. Samples: 756460866. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:15:14,229][25689] Avg episode reward: [(0, '-0.973')] [2022-07-10 13:15:15,859][26022] Updated weights on worker 0-0, policy_version 738740 (0.00088) [2022-07-10 13:15:17,594][26022] Updated weights on worker 0-0, policy_version 738750 (0.00084) [2022-07-10 13:15:19,361][25689] Fps is (10 sec: 5606.8, 60 sec: 5531.0, 300 sec: 5534.9). Total num frames: 756489216. Throughput: 0: 5654.5. Samples: 756494390. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:15:19,361][25689] Avg episode reward: [(0, '-1.205')] [2022-07-10 13:15:19,399][26022] Updated weights on worker 0-0, policy_version 738760 (0.00084) [2022-07-10 13:15:21,169][26022] Updated weights on worker 0-0, policy_version 738770 (0.00085) [2022-07-10 13:15:23,024][26022] Updated weights on worker 0-0, policy_version 738780 (0.00089) [2022-07-10 13:15:24,400][25689] Fps is (10 sec: 5537.1, 60 sec: 5511.9, 300 sec: 5538.2). Total num frames: 756516864. Throughput: 0: 4947.6. Samples: 756511278. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:15:24,401][25689] Avg episode reward: [(0, '-1.262')] [2022-07-10 13:15:25,048][26022] Updated weights on worker 0-0, policy_version 738790 (0.00089) [2022-07-10 13:15:26,503][26022] Updated weights on worker 0-0, policy_version 738800 (0.00080) [2022-07-10 13:15:28,691][26022] Updated weights on worker 0-0, policy_version 738810 (0.00089) [2022-07-10 13:15:29,427][25689] Fps is (10 sec: 5696.9, 60 sec: 5534.3, 300 sec: 5541.8). Total num frames: 756546560. Throughput: 0: 5805.7. Samples: 756545266. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:15:29,427][25689] Avg episode reward: [(0, '-1.083')] [2022-07-10 13:15:30,155][26022] Updated weights on worker 0-0, policy_version 738820 (0.00084) [2022-07-10 13:15:32,273][26022] Updated weights on worker 0-0, policy_version 738830 (0.00092) [2022-07-10 13:15:34,068][26022] Updated weights on worker 0-0, policy_version 738840 (0.00091) [2022-07-10 13:15:34,435][25689] Fps is (10 sec: 5612.5, 60 sec: 5542.4, 300 sec: 5540.8). Total num frames: 756573184. Throughput: 0: 5834.6. Samples: 756578760. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:15:34,435][25689] Avg episode reward: [(0, '-0.500')] [2022-07-10 13:15:35,936][26022] Updated weights on worker 0-0, policy_version 738850 (0.00084) [2022-07-10 13:15:37,673][26022] Updated weights on worker 0-0, policy_version 738860 (0.00094) [2022-07-10 13:15:39,475][25689] Fps is (10 sec: 5502.9, 60 sec: 5531.2, 300 sec: 5540.7). Total num frames: 756601856. Throughput: 0: 5018.6. Samples: 756595336. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:15:39,476][25689] Avg episode reward: [(0, '-0.354')] [2022-07-10 13:15:39,625][26022] Updated weights on worker 0-0, policy_version 738870 (0.00091) [2022-07-10 13:15:41,319][26022] Updated weights on worker 0-0, policy_version 738880 (0.00092) [2022-07-10 13:15:43,302][26022] Updated weights on worker 0-0, policy_version 738890 (0.00089) [2022-07-10 13:15:44,575][25689] Fps is (10 sec: 5756.2, 60 sec: 5539.3, 300 sec: 5539.1). Total num frames: 756631552. Throughput: 0: 5847.0. Samples: 756629240. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:15:44,575][25689] Avg episode reward: [(0, '-0.949')] [2022-07-10 13:15:44,781][26022] Updated weights on worker 0-0, policy_version 738900 (0.00089) [2022-07-10 13:15:47,000][26022] Updated weights on worker 0-0, policy_version 738910 (0.00090) [2022-07-10 13:15:48,583][26022] Updated weights on worker 0-0, policy_version 738920 (0.00088) [2022-07-10 13:15:49,587][25689] Fps is (10 sec: 5569.6, 60 sec: 5524.9, 300 sec: 5536.0). Total num frames: 756658176. Throughput: 0: 5832.6. Samples: 756662854. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:15:49,588][25689] Avg episode reward: [(0, '-1.117')] [2022-07-10 13:15:50,525][26022] Updated weights on worker 0-0, policy_version 738930 (0.00090) [2022-07-10 13:15:52,312][26022] Updated weights on worker 0-0, policy_version 738940 (0.00084) [2022-07-10 13:15:54,203][26022] Updated weights on worker 0-0, policy_version 738950 (0.00088) [2022-07-10 13:15:54,618][25689] Fps is (10 sec: 5403.9, 60 sec: 5526.6, 300 sec: 5537.0). Total num frames: 756685824. Throughput: 0: 4997.9. Samples: 756679634. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:15:54,618][25689] Avg episode reward: [(0, '-1.842')] [2022-07-10 13:15:56,024][26022] Updated weights on worker 0-0, policy_version 738960 (0.00089) [2022-07-10 13:15:57,905][26022] Updated weights on worker 0-0, policy_version 738970 (0.00095) [2022-07-10 13:15:59,722][25689] Fps is (10 sec: 5557.2, 60 sec: 5525.0, 300 sec: 5542.4). Total num frames: 756714496. Throughput: 0: 5809.2. Samples: 756712954. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:15:59,722][25689] Avg episode reward: [(0, '-2.070')] [2022-07-10 13:15:59,866][26022] Updated weights on worker 0-0, policy_version 738980 (0.00092) [2022-07-10 13:16:01,781][26022] Updated weights on worker 0-0, policy_version 738990 (0.00086) [2022-07-10 13:16:03,846][26022] Updated weights on worker 0-0, policy_version 739000 (0.00099) [2022-07-10 13:16:04,756][25689] Fps is (10 sec: 5353.0, 60 sec: 5523.1, 300 sec: 5542.2). Total num frames: 756740096. Throughput: 0: 5689.3. Samples: 756744060. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:16:04,758][25689] Avg episode reward: [(0, '-1.427')] [2022-07-10 13:16:05,599][26022] Updated weights on worker 0-0, policy_version 739010 (0.00096) [2022-07-10 13:16:07,619][26022] Updated weights on worker 0-0, policy_version 739020 (0.00088) [2022-07-10 13:16:09,576][26022] Updated weights on worker 0-0, policy_version 739030 (0.00093) [2022-07-10 13:16:09,769][25689] Fps is (10 sec: 5197.9, 60 sec: 5522.3, 300 sec: 5535.2). Total num frames: 756766720. Throughput: 0: 5656.1. Samples: 756777004. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:16:09,770][25689] Avg episode reward: [(0, '-2.387')] [2022-07-10 13:16:11,241][26022] Updated weights on worker 0-0, policy_version 739040 (0.00088) [2022-07-10 13:16:13,346][26022] Updated weights on worker 0-0, policy_version 739050 (0.00086) [2022-07-10 13:16:14,779][25689] Fps is (10 sec: 5517.3, 60 sec: 5530.1, 300 sec: 5535.9). Total num frames: 756795392. Throughput: 0: 5647.5. Samples: 756793492. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 13:16:14,779][25689] Avg episode reward: [(0, '-1.786')] [2022-07-10 13:16:14,966][26022] Updated weights on worker 0-0, policy_version 739060 (0.01133) [2022-07-10 13:16:16,877][26022] Updated weights on worker 0-0, policy_version 739070 (0.00087) [2022-07-10 13:16:18,683][26022] Updated weights on worker 0-0, policy_version 739080 (0.00087) [2022-07-10 13:16:19,897][25689] Fps is (10 sec: 5561.0, 60 sec: 5514.5, 300 sec: 5533.8). Total num frames: 756823040. Throughput: 0: 5648.1. Samples: 756826904. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:16:19,899][25689] Avg episode reward: [(0, '-2.378')] [2022-07-10 13:16:20,467][26022] Updated weights on worker 0-0, policy_version 739090 (0.00093) [2022-07-10 13:16:22,390][26022] Updated weights on worker 0-0, policy_version 739100 (0.00109) [2022-07-10 13:16:24,143][26022] Updated weights on worker 0-0, policy_version 739110 (0.00084) [2022-07-10 13:16:24,911][25689] Fps is (10 sec: 5558.4, 60 sec: 5533.7, 300 sec: 5534.3). Total num frames: 756851712. Throughput: 0: 5769.0. Samples: 756860332. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:16:24,914][25689] Avg episode reward: [(0, '-1.432')] [2022-07-10 13:16:26,089][26022] Updated weights on worker 0-0, policy_version 739120 (0.00098) [2022-07-10 13:16:27,945][26022] Updated weights on worker 0-0, policy_version 739130 (0.00088) [2022-07-10 13:16:29,656][26022] Updated weights on worker 0-0, policy_version 739140 (0.00088) [2022-07-10 13:16:29,936][25689] Fps is (10 sec: 5711.8, 60 sec: 5516.9, 300 sec: 5537.5). Total num frames: 756880384. Throughput: 0: 4955.1. Samples: 756876934. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:16:29,937][25689] Avg episode reward: [(0, '-1.376')] [2022-07-10 13:16:31,626][26022] Updated weights on worker 0-0, policy_version 739150 (0.00092) [2022-07-10 13:16:33,387][26022] Updated weights on worker 0-0, policy_version 739160 (0.00098) [2022-07-10 13:16:34,067][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:16:34,082][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000739163_756902912.pth [2022-07-10 13:16:34,085][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000737215_754908160.pth [2022-07-10 13:16:34,991][25689] Fps is (10 sec: 5587.6, 60 sec: 5529.6, 300 sec: 5538.7). Total num frames: 756908032. Throughput: 0: 5799.4. Samples: 756910710. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:16:34,991][25689] Avg episode reward: [(0, '-1.558')] [2022-07-10 13:16:35,317][26022] Updated weights on worker 0-0, policy_version 739170 (0.00090) [2022-07-10 13:16:37,132][26022] Updated weights on worker 0-0, policy_version 739180 (0.00089) [2022-07-10 13:16:38,948][26022] Updated weights on worker 0-0, policy_version 739190 (0.00099) [2022-07-10 13:16:40,039][25689] Fps is (10 sec: 5574.9, 60 sec: 5528.9, 300 sec: 5534.7). Total num frames: 756936704. Throughput: 0: 5810.8. Samples: 756943944. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:16:40,039][25689] Avg episode reward: [(0, '-1.281')] [2022-07-10 13:16:40,739][26022] Updated weights on worker 0-0, policy_version 739200 (0.00093) [2022-07-10 13:16:42,740][26022] Updated weights on worker 0-0, policy_version 739210 (0.00087) [2022-07-10 13:16:44,441][26022] Updated weights on worker 0-0, policy_version 739220 (0.00088) [2022-07-10 13:16:45,040][25689] Fps is (10 sec: 5604.2, 60 sec: 5504.0, 300 sec: 5542.3). Total num frames: 756964352. Throughput: 0: 4983.4. Samples: 756960644. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:16:45,041][25689] Avg episode reward: [(0, '-1.226')] [2022-07-10 13:16:46,227][26022] Updated weights on worker 0-0, policy_version 739230 (0.00084) [2022-07-10 13:16:48,014][26022] Updated weights on worker 0-0, policy_version 739240 (0.00091) [2022-07-10 13:16:50,043][25689] Fps is (10 sec: 5425.0, 60 sec: 5504.9, 300 sec: 5529.5). Total num frames: 756990976. Throughput: 0: 5828.8. Samples: 756994130. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:16:50,043][25689] Avg episode reward: [(0, '-1.138')] [2022-07-10 13:16:50,172][26022] Updated weights on worker 0-0, policy_version 739250 (0.00084) [2022-07-10 13:16:51,756][26022] Updated weights on worker 0-0, policy_version 739260 (0.00087) [2022-07-10 13:16:53,747][26022] Updated weights on worker 0-0, policy_version 739270 (0.00088) [2022-07-10 13:16:55,049][25689] Fps is (10 sec: 5627.2, 60 sec: 5541.0, 300 sec: 5538.6). Total num frames: 757020672. Throughput: 0: 5852.9. Samples: 757028108. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:16:55,049][25689] Avg episode reward: [(0, '-1.554')] [2022-07-10 13:16:55,352][26022] Updated weights on worker 0-0, policy_version 739280 (0.00093) [2022-07-10 13:16:57,378][26022] Updated weights on worker 0-0, policy_version 739290 (0.00077) [2022-07-10 13:16:59,055][26022] Updated weights on worker 0-0, policy_version 739300 (0.00088) [2022-07-10 13:17:00,084][25689] Fps is (10 sec: 5710.6, 60 sec: 5530.3, 300 sec: 5545.9). Total num frames: 757048320. Throughput: 0: 5039.8. Samples: 757044970. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:17:00,085][25689] Avg episode reward: [(0, '-1.110')] [2022-07-10 13:17:00,850][26022] Updated weights on worker 0-0, policy_version 739310 (0.00089) [2022-07-10 13:17:03,031][26022] Updated weights on worker 0-0, policy_version 739320 (0.00093) [2022-07-10 13:17:05,024][26022] Updated weights on worker 0-0, policy_version 739330 (0.00091) [2022-07-10 13:17:05,112][25689] Fps is (10 sec: 5393.0, 60 sec: 5547.9, 300 sec: 5539.0). Total num frames: 757074944. Throughput: 0: 5791.0. Samples: 757076880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:17:05,113][25689] Avg episode reward: [(0, '-1.231')] [2022-07-10 13:17:06,748][26022] Updated weights on worker 0-0, policy_version 739340 (0.00084) [2022-07-10 13:17:08,607][26022] Updated weights on worker 0-0, policy_version 739350 (0.00084) [2022-07-10 13:17:10,151][25689] Fps is (10 sec: 5493.2, 60 sec: 5579.4, 300 sec: 5535.8). Total num frames: 757103616. Throughput: 0: 5769.5. Samples: 757110142. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:17:10,151][25689] Avg episode reward: [(0, '-1.532')] [2022-07-10 13:17:10,320][26022] Updated weights on worker 0-0, policy_version 739360 (0.00093) [2022-07-10 13:17:12,386][26022] Updated weights on worker 0-0, policy_version 739370 (0.00086) [2022-07-10 13:17:14,079][26022] Updated weights on worker 0-0, policy_version 739380 (0.00085) [2022-07-10 13:17:15,162][25689] Fps is (10 sec: 5502.1, 60 sec: 5545.3, 300 sec: 5536.8). Total num frames: 757130240. Throughput: 0: 4907.3. Samples: 757126810. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:17:15,163][25689] Avg episode reward: [(0, '-2.015')] [2022-07-10 13:17:15,939][26022] Updated weights on worker 0-0, policy_version 739390 (0.00084) [2022-07-10 13:17:17,686][26022] Updated weights on worker 0-0, policy_version 739400 (0.00092) [2022-07-10 13:17:19,573][26022] Updated weights on worker 0-0, policy_version 739410 (0.00090) [2022-07-10 13:17:20,226][25689] Fps is (10 sec: 5386.6, 60 sec: 5550.3, 300 sec: 5532.4). Total num frames: 757157888. Throughput: 0: 5735.1. Samples: 757160484. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:17:20,227][25689] Avg episode reward: [(0, '-2.111')] [2022-07-10 13:17:21,294][26022] Updated weights on worker 0-0, policy_version 739420 (0.00091) [2022-07-10 13:17:23,303][26022] Updated weights on worker 0-0, policy_version 739430 (0.00089) [2022-07-10 13:17:25,043][26022] Updated weights on worker 0-0, policy_version 739440 (0.00098) [2022-07-10 13:17:25,231][25689] Fps is (10 sec: 5695.0, 60 sec: 5568.1, 300 sec: 5539.6). Total num frames: 757187584. Throughput: 0: 5817.3. Samples: 757193920. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:17:25,232][25689] Avg episode reward: [(0, '-2.031')] [2022-07-10 13:17:26,883][26022] Updated weights on worker 0-0, policy_version 739450 (0.00090) [2022-07-10 13:17:28,847][26022] Updated weights on worker 0-0, policy_version 739460 (0.00083) [2022-07-10 13:17:30,259][25689] Fps is (10 sec: 5715.7, 60 sec: 5551.0, 300 sec: 5535.7). Total num frames: 757215232. Throughput: 0: 4989.8. Samples: 757210478. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:17:30,259][25689] Avg episode reward: [(0, '-2.752')] [2022-07-10 13:17:30,574][26022] Updated weights on worker 0-0, policy_version 739470 (0.00098) [2022-07-10 13:17:32,479][26022] Updated weights on worker 0-0, policy_version 739480 (0.00087) [2022-07-10 13:17:34,389][26022] Updated weights on worker 0-0, policy_version 739490 (0.00089) [2022-07-10 13:17:35,263][25689] Fps is (10 sec: 5512.4, 60 sec: 5555.6, 300 sec: 5533.7). Total num frames: 757242880. Throughput: 0: 5837.8. Samples: 757244152. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:17:35,263][25689] Avg episode reward: [(0, '-3.670')] [2022-07-10 13:17:35,972][26022] Updated weights on worker 0-0, policy_version 739500 (0.00092) [2022-07-10 13:17:38,064][26022] Updated weights on worker 0-0, policy_version 739510 (0.00095) [2022-07-10 13:17:39,644][26022] Updated weights on worker 0-0, policy_version 739520 (0.00093) [2022-07-10 13:17:40,380][25689] Fps is (10 sec: 5463.3, 60 sec: 5532.2, 300 sec: 5528.5). Total num frames: 757270528. Throughput: 0: 5801.1. Samples: 757277400. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:17:40,382][25689] Avg episode reward: [(0, '-2.351')] [2022-07-10 13:17:41,635][26022] Updated weights on worker 0-0, policy_version 739530 (0.00082) [2022-07-10 13:17:43,638][26022] Updated weights on worker 0-0, policy_version 739540 (0.00092) [2022-07-10 13:17:45,393][25689] Fps is (10 sec: 5458.3, 60 sec: 5531.2, 300 sec: 5531.8). Total num frames: 757298176. Throughput: 0: 4967.2. Samples: 757294068. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:17:45,395][25689] Avg episode reward: [(0, '-3.074')] [2022-07-10 13:17:45,510][26022] Updated weights on worker 0-0, policy_version 739550 (0.00098) [2022-07-10 13:17:47,272][26022] Updated weights on worker 0-0, policy_version 739560 (0.00090) [2022-07-10 13:17:48,980][26022] Updated weights on worker 0-0, policy_version 739570 (0.00081) [2022-07-10 13:17:50,420][25689] Fps is (10 sec: 5609.5, 60 sec: 5562.9, 300 sec: 5535.1). Total num frames: 757326848. Throughput: 0: 5807.0. Samples: 757327556. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:17:50,422][25689] Avg episode reward: [(0, '-3.470')] [2022-07-10 13:17:50,747][26022] Updated weights on worker 0-0, policy_version 739580 (0.00091) [2022-07-10 13:17:52,828][26022] Updated weights on worker 0-0, policy_version 739590 (0.00080) [2022-07-10 13:17:54,650][26022] Updated weights on worker 0-0, policy_version 739600 (0.00091) [2022-07-10 13:17:55,440][25689] Fps is (10 sec: 5504.0, 60 sec: 5510.8, 300 sec: 5522.9). Total num frames: 757353472. Throughput: 0: 5796.1. Samples: 757361100. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:17:55,440][25689] Avg episode reward: [(0, '-3.794')] [2022-07-10 13:17:56,340][26022] Updated weights on worker 0-0, policy_version 739610 (0.00092) [2022-07-10 13:17:58,272][26022] Updated weights on worker 0-0, policy_version 739620 (0.00771) [2022-07-10 13:17:59,962][26022] Updated weights on worker 0-0, policy_version 739630 (0.00089) [2022-07-10 13:18:00,572][25689] Fps is (10 sec: 5547.7, 60 sec: 5535.8, 300 sec: 5534.8). Total num frames: 757383168. Throughput: 0: 4980.5. Samples: 757377966. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:18:00,573][25689] Avg episode reward: [(0, '-3.092')] [2022-07-10 13:18:02,376][26022] Updated weights on worker 0-0, policy_version 739640 (0.00095) [2022-07-10 13:18:04,230][26022] Updated weights on worker 0-0, policy_version 739650 (0.00089) [2022-07-10 13:18:05,615][25689] Fps is (10 sec: 5535.2, 60 sec: 5534.5, 300 sec: 5530.9). Total num frames: 757409792. Throughput: 0: 5697.4. Samples: 757409278. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:18:05,615][25689] Avg episode reward: [(0, '-3.589')] [2022-07-10 13:18:05,691][26022] Updated weights on worker 0-0, policy_version 739660 (0.00082) [2022-07-10 13:18:07,974][26022] Updated weights on worker 0-0, policy_version 739670 (0.00082) [2022-07-10 13:18:09,652][26022] Updated weights on worker 0-0, policy_version 739680 (0.00093) [2022-07-10 13:18:10,621][25689] Fps is (10 sec: 5299.2, 60 sec: 5503.6, 300 sec: 5527.5). Total num frames: 757436416. Throughput: 0: 5685.1. Samples: 757442396. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:18:10,621][25689] Avg episode reward: [(0, '-4.206')] [2022-07-10 13:18:11,551][26022] Updated weights on worker 0-0, policy_version 739690 (0.00088) [2022-07-10 13:18:13,347][26022] Updated weights on worker 0-0, policy_version 739700 (0.00094) [2022-07-10 13:18:14,922][26022] Updated weights on worker 0-0, policy_version 739710 (0.00088) [2022-07-10 13:18:15,647][25689] Fps is (10 sec: 5511.8, 60 sec: 5536.1, 300 sec: 5532.4). Total num frames: 757465088. Throughput: 0: 4865.2. Samples: 757459412. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:18:15,647][25689] Avg episode reward: [(0, '-4.122')] [2022-07-10 13:18:17,291][26022] Updated weights on worker 0-0, policy_version 739720 (0.00094) [2022-07-10 13:18:18,635][26022] Updated weights on worker 0-0, policy_version 739730 (0.00087) [2022-07-10 13:18:20,719][25689] Fps is (10 sec: 5577.4, 60 sec: 5535.4, 300 sec: 5527.8). Total num frames: 757492736. Throughput: 0: 5699.0. Samples: 757492780. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:18:20,719][25689] Avg episode reward: [(0, '-4.330')] [2022-07-10 13:18:20,735][26022] Updated weights on worker 0-0, policy_version 739740 (0.00087) [2022-07-10 13:18:22,499][26022] Updated weights on worker 0-0, policy_version 739750 (0.00050) [2022-07-10 13:18:24,166][26022] Updated weights on worker 0-0, policy_version 739760 (0.00088) [2022-07-10 13:18:25,727][25689] Fps is (10 sec: 5587.1, 60 sec: 5518.1, 300 sec: 5534.7). Total num frames: 757521408. Throughput: 0: 5820.6. Samples: 757526346. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:18:25,728][25689] Avg episode reward: [(0, '-5.329')] [2022-07-10 13:18:26,368][26022] Updated weights on worker 0-0, policy_version 739770 (0.00096) [2022-07-10 13:18:28,013][26022] Updated weights on worker 0-0, policy_version 739780 (0.00085) [2022-07-10 13:18:29,809][26022] Updated weights on worker 0-0, policy_version 739790 (0.00090) [2022-07-10 13:18:30,766][25689] Fps is (10 sec: 5707.4, 60 sec: 5534.0, 300 sec: 5532.1). Total num frames: 757550080. Throughput: 0: 4986.2. Samples: 757542844. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:18:30,766][25689] Avg episode reward: [(0, '-5.318')] [2022-07-10 13:18:31,685][26022] Updated weights on worker 0-0, policy_version 739800 (0.00087) [2022-07-10 13:18:33,284][26022] Updated weights on worker 0-0, policy_version 739810 (0.00077) [2022-07-10 13:18:34,106][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:18:34,126][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000739814_757569536.pth [2022-07-10 13:18:34,126][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000737865_755573760.pth [2022-07-10 13:18:35,583][26022] Updated weights on worker 0-0, policy_version 739820 (0.00085) [2022-07-10 13:18:35,776][25689] Fps is (10 sec: 5502.7, 60 sec: 5516.5, 300 sec: 5530.9). Total num frames: 757576704. Throughput: 0: 5826.0. Samples: 757576684. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:18:35,777][25689] Avg episode reward: [(0, '-4.786')] [2022-07-10 13:18:36,950][26022] Updated weights on worker 0-0, policy_version 739830 (0.00088) [2022-07-10 13:18:39,041][26022] Updated weights on worker 0-0, policy_version 739840 (0.00085) [2022-07-10 13:18:40,829][26022] Updated weights on worker 0-0, policy_version 739850 (0.00061) [2022-07-10 13:18:40,923][25689] Fps is (10 sec: 5544.9, 60 sec: 5547.7, 300 sec: 5542.4). Total num frames: 757606400. Throughput: 0: 5803.4. Samples: 757610034. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:18:40,924][25689] Avg episode reward: [(0, '-4.584')] [2022-07-10 13:18:42,815][26022] Updated weights on worker 0-0, policy_version 739860 (0.00085) [2022-07-10 13:18:44,517][26022] Updated weights on worker 0-0, policy_version 739870 (0.00086) [2022-07-10 13:18:45,989][25689] Fps is (10 sec: 5615.0, 60 sec: 5542.8, 300 sec: 5535.6). Total num frames: 757634048. Throughput: 0: 5784.9. Samples: 757643556. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:18:45,989][25689] Avg episode reward: [(0, '-3.155')] [2022-07-10 13:18:46,586][26022] Updated weights on worker 0-0, policy_version 739880 (0.00079) [2022-07-10 13:18:48,144][26022] Updated weights on worker 0-0, policy_version 739890 (0.00090) [2022-07-10 13:18:50,108][26022] Updated weights on worker 0-0, policy_version 739900 (0.00086) [2022-07-10 13:18:51,013][25689] Fps is (10 sec: 5480.1, 60 sec: 5526.2, 300 sec: 5529.5). Total num frames: 757661696. Throughput: 0: 5806.5. Samples: 757660410. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:18:51,016][25689] Avg episode reward: [(0, '-1.644')] [2022-07-10 13:18:51,839][26022] Updated weights on worker 0-0, policy_version 739910 (0.00092) [2022-07-10 13:18:53,886][26022] Updated weights on worker 0-0, policy_version 739920 (0.00085) [2022-07-10 13:18:55,551][26022] Updated weights on worker 0-0, policy_version 739930 (0.00067) [2022-07-10 13:18:56,017][25689] Fps is (10 sec: 5718.0, 60 sec: 5578.3, 300 sec: 5537.4). Total num frames: 757691392. Throughput: 0: 5779.0. Samples: 757693658. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:18:56,018][25689] Avg episode reward: [(0, '-1.140')] [2022-07-10 13:18:57,442][26022] Updated weights on worker 0-0, policy_version 739940 (0.00094) [2022-07-10 13:18:59,149][26022] Updated weights on worker 0-0, policy_version 739950 (0.00085) [2022-07-10 13:19:01,114][25689] Fps is (10 sec: 5677.1, 60 sec: 5547.8, 300 sec: 5542.7). Total num frames: 757719040. Throughput: 0: 5811.0. Samples: 757727366. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:19:01,115][25689] Avg episode reward: [(0, '-1.213')] [2022-07-10 13:19:01,125][26022] Updated weights on worker 0-0, policy_version 739960 (0.00086) [2022-07-10 13:19:03,044][26022] Updated weights on worker 0-0, policy_version 739970 (0.00087) [2022-07-10 13:19:05,288][26022] Updated weights on worker 0-0, policy_version 739980 (0.00091) [2022-07-10 13:19:06,222][25689] Fps is (10 sec: 5218.3, 60 sec: 5524.9, 300 sec: 5537.7). Total num frames: 757744640. Throughput: 0: 4866.8. Samples: 757742024. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:19:06,222][25689] Avg episode reward: [(0, '-1.175')] [2022-07-10 13:19:06,798][26022] Updated weights on worker 0-0, policy_version 739990 (0.00088) [2022-07-10 13:19:08,802][26022] Updated weights on worker 0-0, policy_version 740000 (0.00090) [2022-07-10 13:19:10,750][26022] Updated weights on worker 0-0, policy_version 740010 (0.00094) [2022-07-10 13:19:11,235][25689] Fps is (10 sec: 5261.5, 60 sec: 5541.2, 300 sec: 5535.0). Total num frames: 757772288. Throughput: 0: 5678.7. Samples: 757775240. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:19:11,236][25689] Avg episode reward: [(0, '-0.013')] [2022-07-10 13:19:12,524][26022] Updated weights on worker 0-0, policy_version 740020 (0.00103) [2022-07-10 13:19:14,518][26022] Updated weights on worker 0-0, policy_version 740030 (0.00089) [2022-07-10 13:19:16,082][26022] Updated weights on worker 0-0, policy_version 740040 (0.00095) [2022-07-10 13:19:16,250][25689] Fps is (10 sec: 5616.1, 60 sec: 5542.2, 300 sec: 5536.1). Total num frames: 757800960. Throughput: 0: 5678.8. Samples: 757808554. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:19:16,252][25689] Avg episode reward: [(0, '-1.051')] [2022-07-10 13:19:18,115][26022] Updated weights on worker 0-0, policy_version 740050 (0.00089) [2022-07-10 13:19:19,998][26022] Updated weights on worker 0-0, policy_version 740060 (0.00088) [2022-07-10 13:19:21,339][25689] Fps is (10 sec: 5574.0, 60 sec: 5540.6, 300 sec: 5531.3). Total num frames: 757828608. Throughput: 0: 4842.3. Samples: 757825298. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:19:21,339][25689] Avg episode reward: [(0, '-1.496')] [2022-07-10 13:19:21,730][26022] Updated weights on worker 0-0, policy_version 740070 (0.00089) [2022-07-10 13:19:23,512][26022] Updated weights on worker 0-0, policy_version 740080 (0.00086) [2022-07-10 13:19:25,470][26022] Updated weights on worker 0-0, policy_version 740090 (0.00090) [2022-07-10 13:19:26,355][25689] Fps is (10 sec: 5573.4, 60 sec: 5539.9, 300 sec: 5532.6). Total num frames: 757857280. Throughput: 0: 5809.1. Samples: 757858980. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:19:26,355][25689] Avg episode reward: [(0, '-1.177')] [2022-07-10 13:19:27,051][26022] Updated weights on worker 0-0, policy_version 740100 (0.00088) [2022-07-10 13:19:29,172][26022] Updated weights on worker 0-0, policy_version 740110 (0.00086) [2022-07-10 13:19:30,941][26022] Updated weights on worker 0-0, policy_version 740120 (0.00087) [2022-07-10 13:19:31,387][25689] Fps is (10 sec: 5503.2, 60 sec: 5506.8, 300 sec: 5533.8). Total num frames: 757883904. Throughput: 0: 5823.6. Samples: 757892598. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:19:31,387][25689] Avg episode reward: [(0, '-1.697')] [2022-07-10 13:19:32,555][26022] Updated weights on worker 0-0, policy_version 740130 (0.00093) [2022-07-10 13:19:34,820][26022] Updated weights on worker 0-0, policy_version 740140 (0.00119) [2022-07-10 13:19:36,404][25689] Fps is (10 sec: 5502.6, 60 sec: 5539.9, 300 sec: 5531.9). Total num frames: 757912576. Throughput: 0: 5000.2. Samples: 757909328. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:19:36,405][25689] Avg episode reward: [(0, '-1.888')] [2022-07-10 13:19:36,487][26022] Updated weights on worker 0-0, policy_version 740150 (0.00085) [2022-07-10 13:19:38,343][26022] Updated weights on worker 0-0, policy_version 740160 (0.00096) [2022-07-10 13:19:40,226][26022] Updated weights on worker 0-0, policy_version 740170 (0.00094) [2022-07-10 13:19:41,513][25689] Fps is (10 sec: 5561.5, 60 sec: 5509.5, 300 sec: 5526.5). Total num frames: 757940224. Throughput: 0: 5813.0. Samples: 757942574. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-10 13:19:41,514][25689] Avg episode reward: [(0, '-2.362')] [2022-07-10 13:19:41,887][26022] Updated weights on worker 0-0, policy_version 740180 (0.00088) [2022-07-10 13:19:43,906][26022] Updated weights on worker 0-0, policy_version 740190 (0.01028) [2022-07-10 13:19:45,556][26022] Updated weights on worker 0-0, policy_version 740200 (0.00090) [2022-07-10 13:19:46,531][25689] Fps is (10 sec: 5561.3, 60 sec: 5530.8, 300 sec: 5530.4). Total num frames: 757968896. Throughput: 0: 5804.0. Samples: 757976082. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:19:46,532][25689] Avg episode reward: [(0, '-2.434')] [2022-07-10 13:19:47,575][26022] Updated weights on worker 0-0, policy_version 740210 (0.00091) [2022-07-10 13:19:49,277][26022] Updated weights on worker 0-0, policy_version 740220 (0.00084) [2022-07-10 13:19:51,199][26022] Updated weights on worker 0-0, policy_version 740230 (0.00377) [2022-07-10 13:19:51,604][25689] Fps is (10 sec: 5682.7, 60 sec: 5543.3, 300 sec: 5533.4). Total num frames: 757997568. Throughput: 0: 4954.3. Samples: 757992762. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:19:51,605][25689] Avg episode reward: [(0, '-3.018')] [2022-07-10 13:19:52,877][26022] Updated weights on worker 0-0, policy_version 740240 (0.00085) [2022-07-10 13:19:54,967][26022] Updated weights on worker 0-0, policy_version 740250 (0.00085) [2022-07-10 13:19:56,631][25689] Fps is (10 sec: 5475.0, 60 sec: 5490.5, 300 sec: 5527.6). Total num frames: 758024192. Throughput: 0: 5776.7. Samples: 758026172. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:19:56,632][25689] Avg episode reward: [(0, '-4.728')] [2022-07-10 13:19:56,933][26022] Updated weights on worker 0-0, policy_version 740260 (0.00091) [2022-07-10 13:19:58,544][26022] Updated weights on worker 0-0, policy_version 740270 (0.00089) [2022-07-10 13:20:00,587][26022] Updated weights on worker 0-0, policy_version 740280 (0.00085) [2022-07-10 13:20:01,728][25689] Fps is (10 sec: 5563.4, 60 sec: 5524.3, 300 sec: 5539.8). Total num frames: 758053888. Throughput: 0: 5774.8. Samples: 758059308. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:20:01,729][25689] Avg episode reward: [(0, '-4.197')] [2022-07-10 13:20:02,767][26022] Updated weights on worker 0-0, policy_version 740290 (0.00095) [2022-07-10 13:20:04,467][26022] Updated weights on worker 0-0, policy_version 740300 (0.00097) [2022-07-10 13:20:06,513][26022] Updated weights on worker 0-0, policy_version 740310 (0.00086) [2022-07-10 13:20:06,739][25689] Fps is (10 sec: 5368.9, 60 sec: 5516.1, 300 sec: 5532.8). Total num frames: 758078464. Throughput: 0: 4831.9. Samples: 758073726. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:20:06,740][25689] Avg episode reward: [(0, '-6.561')] [2022-07-10 13:20:08,039][26022] Updated weights on worker 0-0, policy_version 740320 (0.00085) [2022-07-10 13:20:10,161][26022] Updated weights on worker 0-0, policy_version 740330 (0.00086) [2022-07-10 13:20:11,808][25689] Fps is (10 sec: 5282.1, 60 sec: 5527.9, 300 sec: 5533.3). Total num frames: 758107136. Throughput: 0: 5646.5. Samples: 758106844. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:20:11,809][25689] Avg episode reward: [(0, '-6.164')] [2022-07-10 13:20:12,066][26022] Updated weights on worker 0-0, policy_version 740340 (0.00089) [2022-07-10 13:20:13,764][26022] Updated weights on worker 0-0, policy_version 740350 (0.00090) [2022-07-10 13:20:15,795][26022] Updated weights on worker 0-0, policy_version 740360 (0.00085) [2022-07-10 13:20:16,835][25689] Fps is (10 sec: 5578.6, 60 sec: 5510.0, 300 sec: 5531.8). Total num frames: 758134784. Throughput: 0: 5638.2. Samples: 758140088. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:20:16,836][25689] Avg episode reward: [(0, '-6.745')] [2022-07-10 13:20:17,387][26022] Updated weights on worker 0-0, policy_version 740370 (0.00095) [2022-07-10 13:20:19,417][26022] Updated weights on worker 0-0, policy_version 740380 (0.00084) [2022-07-10 13:20:21,267][26022] Updated weights on worker 0-0, policy_version 740390 (0.00094) [2022-07-10 13:20:21,907][25689] Fps is (10 sec: 5374.2, 60 sec: 5494.6, 300 sec: 5527.7). Total num frames: 758161408. Throughput: 0: 4830.5. Samples: 758156784. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:20:21,908][25689] Avg episode reward: [(0, '-5.831')] [2022-07-10 13:20:23,054][26022] Updated weights on worker 0-0, policy_version 740400 (0.00080) [2022-07-10 13:20:25,072][26022] Updated weights on worker 0-0, policy_version 740410 (0.00093) [2022-07-10 13:20:26,635][26022] Updated weights on worker 0-0, policy_version 740420 (0.00090) [2022-07-10 13:20:26,914][25689] Fps is (10 sec: 5587.9, 60 sec: 5512.4, 300 sec: 5528.1). Total num frames: 758191104. Throughput: 0: 5770.6. Samples: 758190146. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:20:26,914][25689] Avg episode reward: [(0, '-4.915')] [2022-07-10 13:20:28,796][26022] Updated weights on worker 0-0, policy_version 740430 (0.00090) [2022-07-10 13:20:30,401][26022] Updated weights on worker 0-0, policy_version 740440 (0.00093) [2022-07-10 13:20:31,945][25689] Fps is (10 sec: 5508.8, 60 sec: 5495.5, 300 sec: 5524.2). Total num frames: 758216704. Throughput: 0: 5784.1. Samples: 758223314. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:20:31,946][25689] Avg episode reward: [(0, '-5.005')] [2022-07-10 13:20:32,515][26022] Updated weights on worker 0-0, policy_version 740450 (0.00085) [2022-07-10 13:20:33,927][26022] Updated weights on worker 0-0, policy_version 740460 (0.00082) [2022-07-10 13:20:34,328][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:20:34,342][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000740462_758233088.pth [2022-07-10 13:20:34,346][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000738514_756238336.pth [2022-07-10 13:20:36,060][26022] Updated weights on worker 0-0, policy_version 740470 (0.00083) [2022-07-10 13:20:36,976][25689] Fps is (10 sec: 5495.3, 60 sec: 5511.2, 300 sec: 5527.9). Total num frames: 758246400. Throughput: 0: 4969.6. Samples: 758240182. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:20:36,977][25689] Avg episode reward: [(0, '-2.081')] [2022-07-10 13:20:37,608][26022] Updated weights on worker 0-0, policy_version 740480 (0.00093) [2022-07-10 13:20:39,790][26022] Updated weights on worker 0-0, policy_version 740490 (0.00083) [2022-07-10 13:20:41,475][26022] Updated weights on worker 0-0, policy_version 740500 (0.00080) [2022-07-10 13:20:42,107][25689] Fps is (10 sec: 5743.4, 60 sec: 5526.1, 300 sec: 5523.8). Total num frames: 758275072. Throughput: 0: 5791.8. Samples: 758273780. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:20:42,108][25689] Avg episode reward: [(0, '-2.263')] [2022-07-10 13:20:43,417][26022] Updated weights on worker 0-0, policy_version 740510 (0.00088) [2022-07-10 13:20:45,018][26022] Updated weights on worker 0-0, policy_version 740520 (0.00087) [2022-07-10 13:20:47,118][25689] Fps is (10 sec: 5553.1, 60 sec: 5509.8, 300 sec: 5527.3). Total num frames: 758302720. Throughput: 0: 5782.5. Samples: 758306978. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:20:47,119][25689] Avg episode reward: [(0, '-3.123')] [2022-07-10 13:20:47,126][26022] Updated weights on worker 0-0, policy_version 740530 (0.00368) [2022-07-10 13:20:48,698][26022] Updated weights on worker 0-0, policy_version 740540 (0.00087) [2022-07-10 13:20:50,927][26022] Updated weights on worker 0-0, policy_version 740550 (0.00086) [2022-07-10 13:20:52,154][25689] Fps is (10 sec: 5605.7, 60 sec: 5513.2, 300 sec: 5530.6). Total num frames: 758331392. Throughput: 0: 4968.1. Samples: 758323718. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:20:52,155][25689] Avg episode reward: [(0, '-2.366')] [2022-07-10 13:20:52,493][26022] Updated weights on worker 0-0, policy_version 740560 (0.00090) [2022-07-10 13:20:54,625][26022] Updated weights on worker 0-0, policy_version 740570 (0.00092) [2022-07-10 13:20:56,361][26022] Updated weights on worker 0-0, policy_version 740580 (0.00089) [2022-07-10 13:20:57,160][25689] Fps is (10 sec: 5506.7, 60 sec: 5515.1, 300 sec: 5525.6). Total num frames: 758358016. Throughput: 0: 5784.7. Samples: 758356938. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:20:57,161][25689] Avg episode reward: [(0, '-2.507')] [2022-07-10 13:20:58,111][26022] Updated weights on worker 0-0, policy_version 740590 (0.00095) [2022-07-10 13:20:59,933][26022] Updated weights on worker 0-0, policy_version 740600 (0.00091) [2022-07-10 13:21:02,299][25689] Fps is (10 sec: 5148.1, 60 sec: 5443.7, 300 sec: 5523.7). Total num frames: 758383616. Throughput: 0: 5770.0. Samples: 758390284. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:21:02,299][25689] Avg episode reward: [(0, '-2.239')] [2022-07-10 13:21:02,403][26022] Updated weights on worker 0-0, policy_version 740610 (0.00089) [2022-07-10 13:21:04,034][26022] Updated weights on worker 0-0, policy_version 740620 (0.00091) [2022-07-10 13:21:06,023][26022] Updated weights on worker 0-0, policy_version 740630 (0.00082) [2022-07-10 13:21:07,309][25689] Fps is (10 sec: 5448.2, 60 sec: 5528.3, 300 sec: 5534.0). Total num frames: 758413312. Throughput: 0: 5684.8. Samples: 758421760. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:21:07,310][25689] Avg episode reward: [(0, '-4.013')] [2022-07-10 13:21:07,607][26022] Updated weights on worker 0-0, policy_version 740640 (0.00082) [2022-07-10 13:21:09,698][26022] Updated weights on worker 0-0, policy_version 740650 (0.00078) [2022-07-10 13:21:11,312][26022] Updated weights on worker 0-0, policy_version 740660 (0.00093) [2022-07-10 13:21:12,324][25689] Fps is (10 sec: 5617.5, 60 sec: 5499.4, 300 sec: 5527.0). Total num frames: 758439936. Throughput: 0: 5684.5. Samples: 758438374. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:21:12,325][25689] Avg episode reward: [(0, '-3.141')] [2022-07-10 13:21:13,287][26022] Updated weights on worker 0-0, policy_version 740670 (0.00084) [2022-07-10 13:21:15,029][26022] Updated weights on worker 0-0, policy_version 740680 (0.00088) [2022-07-10 13:21:17,106][26022] Updated weights on worker 0-0, policy_version 740690 (0.00085) [2022-07-10 13:21:17,338][25689] Fps is (10 sec: 5513.9, 60 sec: 5517.5, 300 sec: 5532.4). Total num frames: 758468608. Throughput: 0: 5676.9. Samples: 758471486. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:21:17,338][25689] Avg episode reward: [(0, '-3.172')] [2022-07-10 13:21:18,618][26022] Updated weights on worker 0-0, policy_version 740700 (0.00084) [2022-07-10 13:21:20,674][26022] Updated weights on worker 0-0, policy_version 740710 (0.00085) [2022-07-10 13:21:22,411][25689] Fps is (10 sec: 5583.5, 60 sec: 5534.3, 300 sec: 5527.9). Total num frames: 758496256. Throughput: 0: 5692.2. Samples: 758504768. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:21:22,412][25689] Avg episode reward: [(0, '-2.901')] [2022-07-10 13:21:22,598][26022] Updated weights on worker 0-0, policy_version 740720 (0.00087) [2022-07-10 13:21:24,298][26022] Updated weights on worker 0-0, policy_version 740730 (0.00087) [2022-07-10 13:21:26,212][26022] Updated weights on worker 0-0, policy_version 740740 (0.00095) [2022-07-10 13:21:27,461][25689] Fps is (10 sec: 5563.6, 60 sec: 5513.5, 300 sec: 5527.4). Total num frames: 758524928. Throughput: 0: 4958.0. Samples: 758521670. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:21:27,462][25689] Avg episode reward: [(0, '-3.144')] [2022-07-10 13:21:27,966][26022] Updated weights on worker 0-0, policy_version 740750 (0.00091) [2022-07-10 13:21:29,811][26022] Updated weights on worker 0-0, policy_version 740760 (0.00087) [2022-07-10 13:21:31,754][26022] Updated weights on worker 0-0, policy_version 740770 (0.00092) [2022-07-10 13:21:32,509][25689] Fps is (10 sec: 5475.9, 60 sec: 5528.8, 300 sec: 5524.1). Total num frames: 758551552. Throughput: 0: 5766.6. Samples: 758554772. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:21:32,511][25689] Avg episode reward: [(0, '-1.041')] [2022-07-10 13:21:33,519][26022] Updated weights on worker 0-0, policy_version 740780 (0.00087) [2022-07-10 13:21:35,605][26022] Updated weights on worker 0-0, policy_version 740790 (0.00084) [2022-07-10 13:21:37,083][26022] Updated weights on worker 0-0, policy_version 740800 (0.00088) [2022-07-10 13:21:37,527][25689] Fps is (10 sec: 5493.4, 60 sec: 5513.2, 300 sec: 5524.7). Total num frames: 758580224. Throughput: 0: 5792.1. Samples: 758588422. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:21:37,527][25689] Avg episode reward: [(0, '-0.882')] [2022-07-10 13:21:39,076][26022] Updated weights on worker 0-0, policy_version 740810 (0.00084) [2022-07-10 13:21:40,791][26022] Updated weights on worker 0-0, policy_version 740820 (0.00096) [2022-07-10 13:21:42,655][25689] Fps is (10 sec: 5551.3, 60 sec: 5496.5, 300 sec: 5522.3). Total num frames: 758607872. Throughput: 0: 4966.0. Samples: 758605296. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:21:42,655][25689] Avg episode reward: [(0, '-0.637')] [2022-07-10 13:21:42,780][26022] Updated weights on worker 0-0, policy_version 740830 (0.00103) [2022-07-10 13:21:44,628][26022] Updated weights on worker 0-0, policy_version 740840 (0.00083) [2022-07-10 13:21:46,508][26022] Updated weights on worker 0-0, policy_version 740850 (0.00095) [2022-07-10 13:21:47,662][25689] Fps is (10 sec: 5556.8, 60 sec: 5513.8, 300 sec: 5529.1). Total num frames: 758636544. Throughput: 0: 5800.0. Samples: 758638838. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:21:47,662][25689] Avg episode reward: [(0, '-0.057')] [2022-07-10 13:21:48,271][26022] Updated weights on worker 0-0, policy_version 740860 (0.00084) [2022-07-10 13:21:50,019][26022] Updated weights on worker 0-0, policy_version 740870 (0.00094) [2022-07-10 13:21:52,006][26022] Updated weights on worker 0-0, policy_version 740880 (0.00090) [2022-07-10 13:21:52,698][25689] Fps is (10 sec: 5811.5, 60 sec: 5530.7, 300 sec: 5528.5). Total num frames: 758666240. Throughput: 0: 5834.6. Samples: 758672566. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:21:52,699][25689] Avg episode reward: [(0, '-1.077')] [2022-07-10 13:21:53,594][26022] Updated weights on worker 0-0, policy_version 740890 (0.00090) [2022-07-10 13:21:55,519][26022] Updated weights on worker 0-0, policy_version 740900 (0.00098) [2022-07-10 13:21:57,441][26022] Updated weights on worker 0-0, policy_version 740910 (0.00080) [2022-07-10 13:21:57,717][25689] Fps is (10 sec: 5601.1, 60 sec: 5529.5, 300 sec: 5525.4). Total num frames: 758692864. Throughput: 0: 5005.9. Samples: 758689494. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:21:57,718][25689] Avg episode reward: [(0, '-1.581')] [2022-07-10 13:21:59,283][26022] Updated weights on worker 0-0, policy_version 740920 (0.00104) [2022-07-10 13:22:00,982][26022] Updated weights on worker 0-0, policy_version 740930 (0.00089) [2022-07-10 13:22:02,857][25689] Fps is (10 sec: 5241.7, 60 sec: 5546.3, 300 sec: 5523.3). Total num frames: 758719488. Throughput: 0: 5831.3. Samples: 758723100. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:22:02,857][25689] Avg episode reward: [(0, '-1.754')] [2022-07-10 13:22:03,294][26022] Updated weights on worker 0-0, policy_version 740940 (0.00087) [2022-07-10 13:22:05,052][26022] Updated weights on worker 0-0, policy_version 740950 (0.00084) [2022-07-10 13:22:07,172][26022] Updated weights on worker 0-0, policy_version 740960 (0.00087) [2022-07-10 13:22:07,875][25689] Fps is (10 sec: 5443.5, 60 sec: 5528.6, 300 sec: 5523.7). Total num frames: 758748160. Throughput: 0: 5729.1. Samples: 758754642. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:22:07,876][25689] Avg episode reward: [(0, '-1.519')] [2022-07-10 13:22:08,626][26022] Updated weights on worker 0-0, policy_version 740970 (0.00093) [2022-07-10 13:22:10,592][26022] Updated weights on worker 0-0, policy_version 740980 (0.00092) [2022-07-10 13:22:12,248][26022] Updated weights on worker 0-0, policy_version 740990 (0.00099) [2022-07-10 13:22:12,915][25689] Fps is (10 sec: 5599.2, 60 sec: 5543.3, 300 sec: 5526.6). Total num frames: 758775808. Throughput: 0: 4889.0. Samples: 758771408. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:22:12,916][25689] Avg episode reward: [(0, '-2.965')] [2022-07-10 13:22:14,259][26022] Updated weights on worker 0-0, policy_version 741000 (0.00079) [2022-07-10 13:22:16,089][26022] Updated weights on worker 0-0, policy_version 741010 (0.00086) [2022-07-10 13:22:17,919][25689] Fps is (10 sec: 5505.4, 60 sec: 5527.2, 300 sec: 5527.7). Total num frames: 758803456. Throughput: 0: 5714.8. Samples: 758804946. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:22:17,920][25689] Avg episode reward: [(0, '-3.611')] [2022-07-10 13:22:18,033][26022] Updated weights on worker 0-0, policy_version 741020 (0.00087) [2022-07-10 13:22:19,620][26022] Updated weights on worker 0-0, policy_version 741030 (0.00091) [2022-07-10 13:22:21,706][26022] Updated weights on worker 0-0, policy_version 741040 (0.00089) [2022-07-10 13:22:22,990][25689] Fps is (10 sec: 5691.9, 60 sec: 5561.3, 300 sec: 5526.5). Total num frames: 758833152. Throughput: 0: 5718.1. Samples: 758838224. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:22:22,990][25689] Avg episode reward: [(0, '-3.760')] [2022-07-10 13:22:23,364][26022] Updated weights on worker 0-0, policy_version 741050 (0.00089) [2022-07-10 13:22:25,319][26022] Updated weights on worker 0-0, policy_version 741060 (0.00087) [2022-07-10 13:22:27,087][26022] Updated weights on worker 0-0, policy_version 741070 (0.00086) [2022-07-10 13:22:27,993][25689] Fps is (10 sec: 5590.8, 60 sec: 5531.7, 300 sec: 5523.5). Total num frames: 758859776. Throughput: 0: 4993.8. Samples: 758855106. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:22:27,994][25689] Avg episode reward: [(0, '-2.579')] [2022-07-10 13:22:28,743][26022] Updated weights on worker 0-0, policy_version 741080 (0.00093) [2022-07-10 13:22:30,771][26022] Updated weights on worker 0-0, policy_version 741090 (0.00086) [2022-07-10 13:22:32,494][26022] Updated weights on worker 0-0, policy_version 741100 (0.01232) [2022-07-10 13:22:33,075][25689] Fps is (10 sec: 5381.3, 60 sec: 5545.6, 300 sec: 5522.1). Total num frames: 758887424. Throughput: 0: 5818.9. Samples: 758888716. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:22:33,076][25689] Avg episode reward: [(0, '-4.285')] [2022-07-10 13:22:34,472][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:22:34,485][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000741110_758896640.pth [2022-07-10 13:22:34,489][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000739163_756902912.pth [2022-07-10 13:22:34,491][26022] Updated weights on worker 0-0, policy_version 741110 (0.00090) [2022-07-10 13:22:36,553][26022] Updated weights on worker 0-0, policy_version 741120 (0.00086) [2022-07-10 13:22:37,891][26022] Updated weights on worker 0-0, policy_version 741130 (0.00088) [2022-07-10 13:22:38,097][25689] Fps is (10 sec: 5776.7, 60 sec: 5579.0, 300 sec: 5534.2). Total num frames: 758918144. Throughput: 0: 5817.1. Samples: 758922322. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:22:38,099][25689] Avg episode reward: [(0, '-4.561')] [2022-07-10 13:22:40,040][26022] Updated weights on worker 0-0, policy_version 741140 (0.00091) [2022-07-10 13:22:41,751][26022] Updated weights on worker 0-0, policy_version 741150 (0.00085) [2022-07-10 13:22:43,147][25689] Fps is (10 sec: 5693.3, 60 sec: 5569.2, 300 sec: 5530.0). Total num frames: 758944768. Throughput: 0: 5005.0. Samples: 758939112. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:22:43,148][25689] Avg episode reward: [(0, '-3.617')] [2022-07-10 13:22:43,489][26022] Updated weights on worker 0-0, policy_version 741160 (0.00091) [2022-07-10 13:22:45,517][26022] Updated weights on worker 0-0, policy_version 741170 (0.00090) [2022-07-10 13:22:47,134][26022] Updated weights on worker 0-0, policy_version 741180 (0.00083) [2022-07-10 13:22:48,151][25689] Fps is (10 sec: 5398.2, 60 sec: 5552.6, 300 sec: 5527.0). Total num frames: 758972416. Throughput: 0: 5827.4. Samples: 758972572. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:22:48,151][25689] Avg episode reward: [(0, '-3.581')] [2022-07-10 13:22:49,275][26022] Updated weights on worker 0-0, policy_version 741190 (0.00085) [2022-07-10 13:22:50,983][26022] Updated weights on worker 0-0, policy_version 741200 (0.00091) [2022-07-10 13:22:52,668][26022] Updated weights on worker 0-0, policy_version 741210 (0.00094) [2022-07-10 13:22:53,174][25689] Fps is (10 sec: 5617.1, 60 sec: 5536.9, 300 sec: 5533.9). Total num frames: 759001088. Throughput: 0: 5845.8. Samples: 759006210. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:22:53,183][25689] Avg episode reward: [(0, '-2.864')] [2022-07-10 13:22:54,749][26022] Updated weights on worker 0-0, policy_version 741220 (0.00086) [2022-07-10 13:22:56,488][26022] Updated weights on worker 0-0, policy_version 741230 (0.00085) [2022-07-10 13:22:58,203][25689] Fps is (10 sec: 5603.0, 60 sec: 5552.9, 300 sec: 5528.9). Total num frames: 759028736. Throughput: 0: 4999.4. Samples: 759022838. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:22:58,203][25689] Avg episode reward: [(0, '-4.034')] [2022-07-10 13:22:58,256][26022] Updated weights on worker 0-0, policy_version 741240 (0.00079) [2022-07-10 13:23:00,191][26022] Updated weights on worker 0-0, policy_version 741250 (0.00106) [2022-07-10 13:23:02,308][26022] Updated weights on worker 0-0, policy_version 741260 (0.00087) [2022-07-10 13:23:03,272][25689] Fps is (10 sec: 5272.9, 60 sec: 5542.4, 300 sec: 5525.0). Total num frames: 759054336. Throughput: 0: 5800.2. Samples: 759055840. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:23:03,273][25689] Avg episode reward: [(0, '-3.544')] [2022-07-10 13:23:04,332][26022] Updated weights on worker 0-0, policy_version 741270 (0.00096) [2022-07-10 13:23:05,895][26022] Updated weights on worker 0-0, policy_version 741280 (0.00094) [2022-07-10 13:23:07,903][26022] Updated weights on worker 0-0, policy_version 741290 (0.00087) [2022-07-10 13:23:08,299][25689] Fps is (10 sec: 5375.6, 60 sec: 5541.7, 300 sec: 5531.5). Total num frames: 759083008. Throughput: 0: 5721.8. Samples: 759087854. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 13:23:08,299][25689] Avg episode reward: [(0, '-3.281')] [2022-07-10 13:23:09,695][26022] Updated weights on worker 0-0, policy_version 741300 (0.00097) [2022-07-10 13:23:11,485][26022] Updated weights on worker 0-0, policy_version 741310 (0.00084) [2022-07-10 13:23:13,325][25689] Fps is (10 sec: 5602.3, 60 sec: 5542.9, 300 sec: 5528.0). Total num frames: 759110656. Throughput: 0: 4874.8. Samples: 759104444. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:23:13,326][25689] Avg episode reward: [(0, '-3.312')] [2022-07-10 13:23:13,588][26022] Updated weights on worker 0-0, policy_version 741320 (0.00088) [2022-07-10 13:23:15,301][26022] Updated weights on worker 0-0, policy_version 741330 (0.00089) [2022-07-10 13:23:17,128][26022] Updated weights on worker 0-0, policy_version 741340 (0.00091) [2022-07-10 13:23:18,359][25689] Fps is (10 sec: 5496.3, 60 sec: 5540.2, 300 sec: 5528.7). Total num frames: 759138304. Throughput: 0: 5694.6. Samples: 759137622. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:23:18,360][25689] Avg episode reward: [(0, '-3.261')] [2022-07-10 13:23:18,993][26022] Updated weights on worker 0-0, policy_version 741350 (0.00089) [2022-07-10 13:23:21,029][26022] Updated weights on worker 0-0, policy_version 741360 (0.00065) [2022-07-10 13:23:22,598][26022] Updated weights on worker 0-0, policy_version 741370 (0.00090) [2022-07-10 13:23:23,454][25689] Fps is (10 sec: 5560.4, 60 sec: 5521.0, 300 sec: 5527.1). Total num frames: 759166976. Throughput: 0: 5708.6. Samples: 759171050. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:23:23,454][25689] Avg episode reward: [(0, '-3.193')] [2022-07-10 13:23:24,659][26022] Updated weights on worker 0-0, policy_version 741380 (0.00083) [2022-07-10 13:23:26,246][26022] Updated weights on worker 0-0, policy_version 741390 (0.00090) [2022-07-10 13:23:28,455][25689] Fps is (10 sec: 5477.0, 60 sec: 5521.2, 300 sec: 5520.9). Total num frames: 759193600. Throughput: 0: 4955.7. Samples: 759187744. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:23:28,456][25689] Avg episode reward: [(0, '-2.995')] [2022-07-10 13:23:28,459][26022] Updated weights on worker 0-0, policy_version 741400 (0.00091) [2022-07-10 13:23:29,811][26022] Updated weights on worker 0-0, policy_version 741410 (0.00085) [2022-07-10 13:23:32,118][26022] Updated weights on worker 0-0, policy_version 741420 (0.00093) [2022-07-10 13:23:33,472][25689] Fps is (10 sec: 5621.8, 60 sec: 5561.1, 300 sec: 5531.1). Total num frames: 759223296. Throughput: 0: 5783.5. Samples: 759220964. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:23:33,473][25689] Avg episode reward: [(0, '-2.598')] [2022-07-10 13:23:33,587][26022] Updated weights on worker 0-0, policy_version 741430 (0.00082) [2022-07-10 13:23:35,499][26022] Updated weights on worker 0-0, policy_version 741440 (0.00088) [2022-07-10 13:23:37,343][26022] Updated weights on worker 0-0, policy_version 741450 (0.00083) [2022-07-10 13:23:38,480][25689] Fps is (10 sec: 5720.3, 60 sec: 5511.5, 300 sec: 5526.8). Total num frames: 759250944. Throughput: 0: 5820.4. Samples: 759254732. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:23:38,480][25689] Avg episode reward: [(0, '-2.548')] [2022-07-10 13:23:39,234][26022] Updated weights on worker 0-0, policy_version 741460 (0.00090) [2022-07-10 13:23:41,068][26022] Updated weights on worker 0-0, policy_version 741470 (0.00093) [2022-07-10 13:23:42,968][26022] Updated weights on worker 0-0, policy_version 741480 (0.00086) [2022-07-10 13:23:43,607][25689] Fps is (10 sec: 5354.9, 60 sec: 5504.5, 300 sec: 5522.2). Total num frames: 759277568. Throughput: 0: 5812.2. Samples: 759288184. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:23:43,608][25689] Avg episode reward: [(0, '-2.607')] [2022-07-10 13:23:44,478][26022] Updated weights on worker 0-0, policy_version 741490 (0.00113) [2022-07-10 13:23:46,835][26022] Updated weights on worker 0-0, policy_version 741500 (0.00533) [2022-07-10 13:23:48,277][26022] Updated weights on worker 0-0, policy_version 741510 (0.00085) [2022-07-10 13:23:48,617][25689] Fps is (10 sec: 5556.0, 60 sec: 5537.8, 300 sec: 5529.4). Total num frames: 759307264. Throughput: 0: 5798.4. Samples: 759304648. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:23:48,617][25689] Avg episode reward: [(0, '-3.069')] [2022-07-10 13:23:50,307][26022] Updated weights on worker 0-0, policy_version 741520 (0.00087) [2022-07-10 13:23:52,318][26022] Updated weights on worker 0-0, policy_version 741530 (0.00099) [2022-07-10 13:23:53,652][25689] Fps is (10 sec: 5810.6, 60 sec: 5536.7, 300 sec: 5525.4). Total num frames: 759335936. Throughput: 0: 5790.9. Samples: 759337824. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:23:53,653][25689] Avg episode reward: [(0, '-3.136')] [2022-07-10 13:23:53,797][26022] Updated weights on worker 0-0, policy_version 741540 (0.00085) [2022-07-10 13:23:56,009][26022] Updated weights on worker 0-0, policy_version 741550 (0.00068) [2022-07-10 13:23:57,495][26022] Updated weights on worker 0-0, policy_version 741560 (0.00095) [2022-07-10 13:23:58,656][25689] Fps is (10 sec: 5406.2, 60 sec: 5505.1, 300 sec: 5520.2). Total num frames: 759361536. Throughput: 0: 5789.0. Samples: 759371530. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:23:58,656][25689] Avg episode reward: [(0, '-2.705')] [2022-07-10 13:23:59,505][26022] Updated weights on worker 0-0, policy_version 741570 (0.00083) [2022-07-10 13:24:01,959][26022] Updated weights on worker 0-0, policy_version 741580 (0.00083) [2022-07-10 13:24:03,478][26022] Updated weights on worker 0-0, policy_version 741590 (0.00093) [2022-07-10 13:24:03,702][25689] Fps is (10 sec: 5298.5, 60 sec: 5541.1, 300 sec: 5528.2). Total num frames: 759389184. Throughput: 0: 4902.4. Samples: 759386696. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:24:03,702][25689] Avg episode reward: [(0, '-2.618')] [2022-07-10 13:24:05,685][26022] Updated weights on worker 0-0, policy_version 741600 (0.00087) [2022-07-10 13:24:07,264][26022] Updated weights on worker 0-0, policy_version 741610 (0.00088) [2022-07-10 13:24:08,720][25689] Fps is (10 sec: 5392.6, 60 sec: 5508.0, 300 sec: 5524.7). Total num frames: 759415808. Throughput: 0: 5711.7. Samples: 759419470. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:24:08,721][25689] Avg episode reward: [(0, '-2.410')] [2022-07-10 13:24:09,280][26022] Updated weights on worker 0-0, policy_version 741620 (0.00084) [2022-07-10 13:24:10,952][26022] Updated weights on worker 0-0, policy_version 741630 (0.00079) [2022-07-10 13:24:12,852][26022] Updated weights on worker 0-0, policy_version 741640 (0.00081) [2022-07-10 13:24:13,729][25689] Fps is (10 sec: 5514.6, 60 sec: 5526.5, 300 sec: 5524.8). Total num frames: 759444480. Throughput: 0: 5740.1. Samples: 759453068. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:24:13,731][25689] Avg episode reward: [(0, '-2.110')] [2022-07-10 13:24:14,535][26022] Updated weights on worker 0-0, policy_version 741650 (0.00086) [2022-07-10 13:24:16,634][26022] Updated weights on worker 0-0, policy_version 741660 (0.00087) [2022-07-10 13:24:17,903][26022] Updated weights on worker 0-0, policy_version 741670 (0.00090) [2022-07-10 13:24:18,770][25689] Fps is (10 sec: 5705.9, 60 sec: 5542.9, 300 sec: 5529.2). Total num frames: 759473152. Throughput: 0: 4884.7. Samples: 759469782. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:24:18,771][25689] Avg episode reward: [(0, '-3.224')] [2022-07-10 13:24:20,311][26022] Updated weights on worker 0-0, policy_version 741680 (0.00085) [2022-07-10 13:24:21,652][26022] Updated weights on worker 0-0, policy_version 741690 (0.00093) [2022-07-10 13:24:23,764][26022] Updated weights on worker 0-0, policy_version 741700 (0.00085) [2022-07-10 13:24:23,824][25689] Fps is (10 sec: 5579.0, 60 sec: 5529.6, 300 sec: 5525.0). Total num frames: 759500800. Throughput: 0: 5813.1. Samples: 759503668. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:24:23,824][25689] Avg episode reward: [(0, '-3.814')] [2022-07-10 13:24:25,460][26022] Updated weights on worker 0-0, policy_version 741710 (0.00088) [2022-07-10 13:24:27,377][26022] Updated weights on worker 0-0, policy_version 741720 (0.00087) [2022-07-10 13:24:28,905][25689] Fps is (10 sec: 5455.7, 60 sec: 5539.3, 300 sec: 5527.5). Total num frames: 759528448. Throughput: 0: 5819.2. Samples: 759536934. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:24:28,906][25689] Avg episode reward: [(0, '-3.342')] [2022-07-10 13:24:29,352][26022] Updated weights on worker 0-0, policy_version 741730 (0.00120) [2022-07-10 13:24:31,068][26022] Updated weights on worker 0-0, policy_version 741740 (0.00092) [2022-07-10 13:24:32,870][26022] Updated weights on worker 0-0, policy_version 741750 (0.00089) [2022-07-10 13:24:33,943][25689] Fps is (10 sec: 5464.8, 60 sec: 5503.5, 300 sec: 5523.7). Total num frames: 759556096. Throughput: 0: 4981.0. Samples: 759553752. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:24:33,943][25689] Avg episode reward: [(0, '-2.872')] [2022-07-10 13:24:34,674][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:24:34,690][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000741759_759561216.pth [2022-07-10 13:24:34,690][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000739814_757569536.pth [2022-07-10 13:24:34,900][26022] Updated weights on worker 0-0, policy_version 741760 (0.00082) [2022-07-10 13:24:36,475][26022] Updated weights on worker 0-0, policy_version 741770 (0.00085) [2022-07-10 13:24:38,507][26022] Updated weights on worker 0-0, policy_version 741780 (0.00093) [2022-07-10 13:24:38,975][25689] Fps is (10 sec: 5694.4, 60 sec: 5535.1, 300 sec: 5532.0). Total num frames: 759585792. Throughput: 0: 5819.2. Samples: 759587364. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:24:38,976][25689] Avg episode reward: [(0, '-2.634')] [2022-07-10 13:24:40,141][26022] Updated weights on worker 0-0, policy_version 741790 (0.00091) [2022-07-10 13:24:42,256][26022] Updated weights on worker 0-0, policy_version 741800 (0.00086) [2022-07-10 13:24:43,972][26022] Updated weights on worker 0-0, policy_version 741810 (0.00096) [2022-07-10 13:24:44,068][25689] Fps is (10 sec: 5663.6, 60 sec: 5555.2, 300 sec: 5527.2). Total num frames: 759613440. Throughput: 0: 5782.3. Samples: 759620724. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:24:44,068][25689] Avg episode reward: [(0, '-1.699')] [2022-07-10 13:24:45,768][26022] Updated weights on worker 0-0, policy_version 741820 (0.00089) [2022-07-10 13:24:47,658][26022] Updated weights on worker 0-0, policy_version 741830 (0.00089) [2022-07-10 13:24:49,081][25689] Fps is (10 sec: 5370.2, 60 sec: 5504.0, 300 sec: 5521.4). Total num frames: 759640064. Throughput: 0: 4970.7. Samples: 759637224. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:24:49,082][25689] Avg episode reward: [(0, '-2.565')] [2022-07-10 13:24:49,596][26022] Updated weights on worker 0-0, policy_version 741840 (0.00092) [2022-07-10 13:24:51,147][26022] Updated weights on worker 0-0, policy_version 741850 (0.00089) [2022-07-10 13:24:53,320][26022] Updated weights on worker 0-0, policy_version 741860 (0.00087) [2022-07-10 13:24:54,097][25689] Fps is (10 sec: 5615.6, 60 sec: 5522.8, 300 sec: 5532.0). Total num frames: 759669760. Throughput: 0: 5813.7. Samples: 759670922. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:24:54,097][25689] Avg episode reward: [(0, '-2.605')] [2022-07-10 13:24:54,855][26022] Updated weights on worker 0-0, policy_version 741870 (0.00089) [2022-07-10 13:24:56,788][26022] Updated weights on worker 0-0, policy_version 741880 (0.00091) [2022-07-10 13:24:58,647][26022] Updated weights on worker 0-0, policy_version 741890 (0.00086) [2022-07-10 13:24:59,110][25689] Fps is (10 sec: 5718.3, 60 sec: 5555.8, 300 sec: 5526.6). Total num frames: 759697408. Throughput: 0: 5818.1. Samples: 759704508. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:24:59,110][25689] Avg episode reward: [(0, '-3.448')] [2022-07-10 13:25:00,339][26022] Updated weights on worker 0-0, policy_version 741900 (0.00086) [2022-07-10 13:25:02,611][26022] Updated weights on worker 0-0, policy_version 741910 (0.00092) [2022-07-10 13:25:04,239][25689] Fps is (10 sec: 5351.3, 60 sec: 5531.3, 300 sec: 5531.3). Total num frames: 759724032. Throughput: 0: 4986.2. Samples: 759721300. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:25:04,239][25689] Avg episode reward: [(0, '-4.978')] [2022-07-10 13:25:04,655][26022] Updated weights on worker 0-0, policy_version 741920 (0.00095) [2022-07-10 13:25:06,301][26022] Updated weights on worker 0-0, policy_version 741930 (0.00091) [2022-07-10 13:25:08,189][26022] Updated weights on worker 0-0, policy_version 741940 (0.00086) [2022-07-10 13:25:09,282][25689] Fps is (10 sec: 5335.4, 60 sec: 5545.9, 300 sec: 5528.4). Total num frames: 759751680. Throughput: 0: 5721.7. Samples: 759752804. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:25:09,282][25689] Avg episode reward: [(0, '-6.510')] [2022-07-10 13:25:10,069][26022] Updated weights on worker 0-0, policy_version 741950 (0.00093) [2022-07-10 13:25:11,914][26022] Updated weights on worker 0-0, policy_version 741960 (0.00086) [2022-07-10 13:25:13,747][26022] Updated weights on worker 0-0, policy_version 741970 (0.00082) [2022-07-10 13:25:14,288][25689] Fps is (10 sec: 5502.5, 60 sec: 5529.3, 300 sec: 5528.8). Total num frames: 759779328. Throughput: 0: 5708.8. Samples: 759786188. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:25:14,290][25689] Avg episode reward: [(0, '-6.317')] [2022-07-10 13:25:15,548][26022] Updated weights on worker 0-0, policy_version 741980 (0.00086) [2022-07-10 13:25:17,411][26022] Updated weights on worker 0-0, policy_version 741990 (0.00091) [2022-07-10 13:25:19,222][26022] Updated weights on worker 0-0, policy_version 742000 (0.00088) [2022-07-10 13:25:19,304][25689] Fps is (10 sec: 5619.7, 60 sec: 5531.6, 300 sec: 5536.7). Total num frames: 759808000. Throughput: 0: 4871.7. Samples: 759802888. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:25:19,304][25689] Avg episode reward: [(0, '-4.713')] [2022-07-10 13:25:21,149][26022] Updated weights on worker 0-0, policy_version 742010 (0.00089) [2022-07-10 13:25:23,101][26022] Updated weights on worker 0-0, policy_version 742020 (0.00083) [2022-07-10 13:25:24,363][25689] Fps is (10 sec: 5590.0, 60 sec: 5531.1, 300 sec: 5528.8). Total num frames: 759835648. Throughput: 0: 5715.4. Samples: 759836318. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:25:24,363][25689] Avg episode reward: [(0, '-4.669')] [2022-07-10 13:25:24,773][26022] Updated weights on worker 0-0, policy_version 742030 (0.00090) [2022-07-10 13:25:26,829][26022] Updated weights on worker 0-0, policy_version 742040 (0.00087) [2022-07-10 13:25:28,436][26022] Updated weights on worker 0-0, policy_version 742050 (0.00095) [2022-07-10 13:25:29,367][25689] Fps is (10 sec: 5495.0, 60 sec: 5538.2, 300 sec: 5536.2). Total num frames: 759863296. Throughput: 0: 5802.1. Samples: 759869338. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:25:29,367][25689] Avg episode reward: [(0, '-4.400')] [2022-07-10 13:25:30,384][26022] Updated weights on worker 0-0, policy_version 742060 (0.00092) [2022-07-10 13:25:32,160][26022] Updated weights on worker 0-0, policy_version 742070 (0.00092) [2022-07-10 13:25:34,091][26022] Updated weights on worker 0-0, policy_version 742080 (0.00087) [2022-07-10 13:25:34,373][25689] Fps is (10 sec: 5728.7, 60 sec: 5574.9, 300 sec: 5536.7). Total num frames: 759892992. Throughput: 0: 4978.1. Samples: 759886174. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:25:34,373][25689] Avg episode reward: [(0, '-2.447')] [2022-07-10 13:25:36,067][26022] Updated weights on worker 0-0, policy_version 742090 (0.00085) [2022-07-10 13:25:37,823][26022] Updated weights on worker 0-0, policy_version 742100 (0.00084) [2022-07-10 13:25:39,375][25689] Fps is (10 sec: 5525.0, 60 sec: 5510.0, 300 sec: 5528.8). Total num frames: 759918592. Throughput: 0: 5825.5. Samples: 759919814. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:25:39,376][25689] Avg episode reward: [(0, '-3.570')] [2022-07-10 13:25:39,541][26022] Updated weights on worker 0-0, policy_version 742110 (0.00087) [2022-07-10 13:25:41,556][26022] Updated weights on worker 0-0, policy_version 742120 (0.00091) [2022-07-10 13:25:43,394][26022] Updated weights on worker 0-0, policy_version 742130 (0.00095) [2022-07-10 13:25:44,439][25689] Fps is (10 sec: 5290.0, 60 sec: 5512.6, 300 sec: 5527.8). Total num frames: 759946240. Throughput: 0: 5806.8. Samples: 759952894. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:25:44,439][25689] Avg episode reward: [(0, '-2.134')] [2022-07-10 13:25:45,111][26022] Updated weights on worker 0-0, policy_version 742140 (0.00095) [2022-07-10 13:25:47,134][26022] Updated weights on worker 0-0, policy_version 742150 (0.00086) [2022-07-10 13:25:48,832][26022] Updated weights on worker 0-0, policy_version 742160 (0.00084) [2022-07-10 13:25:49,440][25689] Fps is (10 sec: 5493.6, 60 sec: 5530.6, 300 sec: 5525.0). Total num frames: 759973888. Throughput: 0: 4980.4. Samples: 759969314. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:25:49,441][25689] Avg episode reward: [(0, '-1.919')] [2022-07-10 13:25:50,737][26022] Updated weights on worker 0-0, policy_version 742170 (0.00086) [2022-07-10 13:25:52,614][26022] Updated weights on worker 0-0, policy_version 742180 (0.00086) [2022-07-10 13:25:54,455][25689] Fps is (10 sec: 5520.8, 60 sec: 5496.8, 300 sec: 5528.3). Total num frames: 760001536. Throughput: 0: 5786.2. Samples: 760002370. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:25:54,455][25689] Avg episode reward: [(0, '-1.959')] [2022-07-10 13:25:54,468][26022] Updated weights on worker 0-0, policy_version 742190 (0.00103) [2022-07-10 13:25:56,289][26022] Updated weights on worker 0-0, policy_version 742200 (0.00097) [2022-07-10 13:25:58,247][26022] Updated weights on worker 0-0, policy_version 742210 (0.00083) [2022-07-10 13:25:59,464][25689] Fps is (10 sec: 5618.9, 60 sec: 5514.1, 300 sec: 5541.0). Total num frames: 760030208. Throughput: 0: 5766.8. Samples: 760035662. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:25:59,464][25689] Avg episode reward: [(0, '-3.396')] [2022-07-10 13:25:59,939][26022] Updated weights on worker 0-0, policy_version 742220 (0.00107) [2022-07-10 13:26:02,272][26022] Updated weights on worker 0-0, policy_version 742230 (0.00093) [2022-07-10 13:26:04,122][26022] Updated weights on worker 0-0, policy_version 742240 (0.00088) [2022-07-10 13:26:04,505][25689] Fps is (10 sec: 5298.0, 60 sec: 5488.2, 300 sec: 5523.2). Total num frames: 760054784. Throughput: 0: 4927.9. Samples: 760051780. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:26:04,506][25689] Avg episode reward: [(0, '-3.121')] [2022-07-10 13:26:05,988][26022] Updated weights on worker 0-0, policy_version 742250 (0.00092) [2022-07-10 13:26:07,841][26022] Updated weights on worker 0-0, policy_version 742260 (0.00091) [2022-07-10 13:26:09,507][25689] Fps is (10 sec: 5200.1, 60 sec: 5491.9, 300 sec: 5526.9). Total num frames: 760082432. Throughput: 0: 5696.7. Samples: 760083626. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:26:09,507][25689] Avg episode reward: [(0, '-1.145')] [2022-07-10 13:26:09,673][26022] Updated weights on worker 0-0, policy_version 742270 (0.00092) [2022-07-10 13:26:11,391][26022] Updated weights on worker 0-0, policy_version 742280 (0.00791) [2022-07-10 13:26:13,372][26022] Updated weights on worker 0-0, policy_version 742290 (0.00084) [2022-07-10 13:26:14,522][25689] Fps is (10 sec: 5724.5, 60 sec: 5525.1, 300 sec: 5530.3). Total num frames: 760112128. Throughput: 0: 5698.3. Samples: 760116724. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:26:14,523][25689] Avg episode reward: [(0, '-1.639')] [2022-07-10 13:26:15,088][26022] Updated weights on worker 0-0, policy_version 742300 (0.00096) [2022-07-10 13:26:17,111][26022] Updated weights on worker 0-0, policy_version 742310 (0.00086) [2022-07-10 13:26:18,900][26022] Updated weights on worker 0-0, policy_version 742320 (0.00086) [2022-07-10 13:26:19,542][25689] Fps is (10 sec: 5510.4, 60 sec: 5473.8, 300 sec: 5524.4). Total num frames: 760137728. Throughput: 0: 4881.5. Samples: 760133672. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:26:19,542][25689] Avg episode reward: [(0, '-1.484')] [2022-07-10 13:26:20,741][26022] Updated weights on worker 0-0, policy_version 742330 (0.00095) [2022-07-10 13:26:22,758][26022] Updated weights on worker 0-0, policy_version 742340 (0.00108) [2022-07-10 13:26:24,407][26022] Updated weights on worker 0-0, policy_version 742350 (0.00094) [2022-07-10 13:26:24,594][25689] Fps is (10 sec: 5490.6, 60 sec: 5508.4, 300 sec: 5527.8). Total num frames: 760167424. Throughput: 0: 5730.6. Samples: 760166898. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:26:24,594][25689] Avg episode reward: [(0, '-1.226')] [2022-07-10 13:26:26,381][26022] Updated weights on worker 0-0, policy_version 742360 (0.00066) [2022-07-10 13:26:28,120][26022] Updated weights on worker 0-0, policy_version 742370 (0.00088) [2022-07-10 13:26:29,595][25689] Fps is (10 sec: 5601.9, 60 sec: 5491.6, 300 sec: 5528.7). Total num frames: 760194048. Throughput: 0: 5799.6. Samples: 760200132. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:26:29,596][25689] Avg episode reward: [(0, '-0.092')] [2022-07-10 13:26:30,018][26022] Updated weights on worker 0-0, policy_version 742380 (0.00089) [2022-07-10 13:26:31,897][26022] Updated weights on worker 0-0, policy_version 742390 (0.00081) [2022-07-10 13:26:33,819][26022] Updated weights on worker 0-0, policy_version 742400 (0.00092) [2022-07-10 13:26:34,609][25689] Fps is (10 sec: 5418.6, 60 sec: 5456.9, 300 sec: 5525.3). Total num frames: 760221696. Throughput: 0: 4975.4. Samples: 760216664. Policy #0 lag: (min: 0.0, avg: 9.8, max: 18.0) [2022-07-10 13:26:34,610][25689] Avg episode reward: [(0, '-0.506')] [2022-07-10 13:26:34,766][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:26:34,776][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000742405_760222720.pth [2022-07-10 13:26:34,779][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000740462_758233088.pth [2022-07-10 13:26:35,795][26022] Updated weights on worker 0-0, policy_version 742410 (0.00089) [2022-07-10 13:26:37,514][26022] Updated weights on worker 0-0, policy_version 742420 (0.00086) [2022-07-10 13:26:39,151][26022] Updated weights on worker 0-0, policy_version 742430 (0.00090) [2022-07-10 13:26:39,631][25689] Fps is (10 sec: 5612.2, 60 sec: 5506.1, 300 sec: 5530.7). Total num frames: 760250368. Throughput: 0: 5775.4. Samples: 760249694. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:26:39,631][25689] Avg episode reward: [(0, '-0.573')] [2022-07-10 13:26:41,308][26022] Updated weights on worker 0-0, policy_version 742440 (0.00091) [2022-07-10 13:26:42,976][26022] Updated weights on worker 0-0, policy_version 742450 (0.00088) [2022-07-10 13:26:44,664][25689] Fps is (10 sec: 5397.5, 60 sec: 5474.9, 300 sec: 5519.9). Total num frames: 760275968. Throughput: 0: 5757.1. Samples: 760282448. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:26:44,665][25689] Avg episode reward: [(0, '-0.224')] [2022-07-10 13:26:45,042][26022] Updated weights on worker 0-0, policy_version 742460 (0.00092) [2022-07-10 13:26:46,819][26022] Updated weights on worker 0-0, policy_version 742470 (0.00089) [2022-07-10 13:26:48,626][26022] Updated weights on worker 0-0, policy_version 742480 (0.00084) [2022-07-10 13:26:49,671][25689] Fps is (10 sec: 5405.6, 60 sec: 5491.5, 300 sec: 5517.0). Total num frames: 760304640. Throughput: 0: 4934.3. Samples: 760299190. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:26:49,671][25689] Avg episode reward: [(0, '-0.091')] [2022-07-10 13:26:50,432][26022] Updated weights on worker 0-0, policy_version 742490 (0.00091) [2022-07-10 13:26:52,414][26022] Updated weights on worker 0-0, policy_version 742500 (0.00085) [2022-07-10 13:26:53,922][26022] Updated weights on worker 0-0, policy_version 742510 (0.00081) [2022-07-10 13:26:54,687][25689] Fps is (10 sec: 5619.4, 60 sec: 5491.3, 300 sec: 5520.5). Total num frames: 760332288. Throughput: 0: 5768.5. Samples: 760332480. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:26:54,687][25689] Avg episode reward: [(0, '0.143')] [2022-07-10 13:26:56,028][26022] Updated weights on worker 0-0, policy_version 742520 (0.00092) [2022-07-10 13:26:57,783][26022] Updated weights on worker 0-0, policy_version 742530 (0.00089) [2022-07-10 13:26:59,703][25689] Fps is (10 sec: 5613.9, 60 sec: 5490.6, 300 sec: 5529.7). Total num frames: 760360960. Throughput: 0: 5790.6. Samples: 760365924. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:26:59,703][25689] Avg episode reward: [(0, '-0.509')] [2022-07-10 13:26:59,717][26022] Updated weights on worker 0-0, policy_version 742540 (0.00099) [2022-07-10 13:27:01,382][26022] Updated weights on worker 0-0, policy_version 742550 (0.00089) [2022-07-10 13:27:03,688][26022] Updated weights on worker 0-0, policy_version 742560 (0.00086) [2022-07-10 13:27:04,775][25689] Fps is (10 sec: 5481.2, 60 sec: 5521.8, 300 sec: 5521.8). Total num frames: 760387584. Throughput: 0: 4913.8. Samples: 760381268. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:27:04,775][25689] Avg episode reward: [(0, '-1.541')] [2022-07-10 13:27:05,385][26022] Updated weights on worker 0-0, policy_version 742570 (0.00087) [2022-07-10 13:27:07,429][26022] Updated weights on worker 0-0, policy_version 742580 (0.00091) [2022-07-10 13:27:09,149][26022] Updated weights on worker 0-0, policy_version 742590 (0.00087) [2022-07-10 13:27:09,786][25689] Fps is (10 sec: 5280.8, 60 sec: 5504.0, 300 sec: 5518.9). Total num frames: 760414208. Throughput: 0: 5728.4. Samples: 760414420. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:27:09,787][25689] Avg episode reward: [(0, '-2.212')] [2022-07-10 13:27:10,975][26022] Updated weights on worker 0-0, policy_version 742600 (0.00084) [2022-07-10 13:27:12,946][26022] Updated weights on worker 0-0, policy_version 742610 (0.00090) [2022-07-10 13:27:14,692][26022] Updated weights on worker 0-0, policy_version 742620 (0.00087) [2022-07-10 13:27:14,791][25689] Fps is (10 sec: 5622.9, 60 sec: 5505.0, 300 sec: 5525.8). Total num frames: 760443904. Throughput: 0: 5749.5. Samples: 760448070. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:27:14,792][25689] Avg episode reward: [(0, '-2.382')] [2022-07-10 13:27:16,640][26022] Updated weights on worker 0-0, policy_version 742630 (0.00087) [2022-07-10 13:27:18,236][26022] Updated weights on worker 0-0, policy_version 742640 (0.00085) [2022-07-10 13:27:19,808][25689] Fps is (10 sec: 5721.8, 60 sec: 5539.1, 300 sec: 5519.9). Total num frames: 760471552. Throughput: 0: 4928.0. Samples: 760465002. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:27:19,809][25689] Avg episode reward: [(0, '-2.431')] [2022-07-10 13:27:20,044][26022] Updated weights on worker 0-0, policy_version 742650 (0.00091) [2022-07-10 13:27:21,936][26022] Updated weights on worker 0-0, policy_version 742660 (0.01128) [2022-07-10 13:27:23,852][26022] Updated weights on worker 0-0, policy_version 742670 (0.00085) [2022-07-10 13:27:24,856][25689] Fps is (10 sec: 5494.2, 60 sec: 5505.5, 300 sec: 5522.5). Total num frames: 760499200. Throughput: 0: 5833.5. Samples: 760498408. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:27:24,856][25689] Avg episode reward: [(0, '-3.122')] [2022-07-10 13:27:25,662][26022] Updated weights on worker 0-0, policy_version 742680 (0.00091) [2022-07-10 13:27:27,418][26022] Updated weights on worker 0-0, policy_version 742690 (0.00092) [2022-07-10 13:27:29,583][26022] Updated weights on worker 0-0, policy_version 742700 (0.00094) [2022-07-10 13:27:29,867][25689] Fps is (10 sec: 5497.4, 60 sec: 5521.7, 300 sec: 5523.8). Total num frames: 760526848. Throughput: 0: 5850.6. Samples: 760531904. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:27:29,867][25689] Avg episode reward: [(0, '-3.482')] [2022-07-10 13:27:31,243][26022] Updated weights on worker 0-0, policy_version 742710 (0.00085) [2022-07-10 13:27:33,040][26022] Updated weights on worker 0-0, policy_version 742720 (0.00088) [2022-07-10 13:27:34,627][26022] Updated weights on worker 0-0, policy_version 742730 (0.00086) [2022-07-10 13:27:34,887][25689] Fps is (10 sec: 5716.7, 60 sec: 5555.1, 300 sec: 5520.4). Total num frames: 760556544. Throughput: 0: 5006.4. Samples: 760548676. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:27:34,887][25689] Avg episode reward: [(0, '-2.077')] [2022-07-10 13:27:36,608][26022] Updated weights on worker 0-0, policy_version 742740 (0.00105) [2022-07-10 13:27:38,529][26022] Updated weights on worker 0-0, policy_version 742750 (0.00087) [2022-07-10 13:27:39,892][25689] Fps is (10 sec: 5720.0, 60 sec: 5539.6, 300 sec: 5524.7). Total num frames: 760584192. Throughput: 0: 5864.7. Samples: 760582788. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:27:39,892][25689] Avg episode reward: [(0, '-1.817')] [2022-07-10 13:27:40,317][26022] Updated weights on worker 0-0, policy_version 742760 (0.00085) [2022-07-10 13:27:42,149][26022] Updated weights on worker 0-0, policy_version 742770 (0.00084) [2022-07-10 13:27:43,664][26022] Updated weights on worker 0-0, policy_version 742780 (0.00082) [2022-07-10 13:27:44,932][25689] Fps is (10 sec: 5402.8, 60 sec: 5556.0, 300 sec: 5520.6). Total num frames: 760610816. Throughput: 0: 5882.0. Samples: 760616496. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:27:44,932][25689] Avg episode reward: [(0, '-1.691')] [2022-07-10 13:27:45,761][26022] Updated weights on worker 0-0, policy_version 742790 (0.00095) [2022-07-10 13:27:47,786][26022] Updated weights on worker 0-0, policy_version 742800 (0.00093) [2022-07-10 13:27:49,225][26022] Updated weights on worker 0-0, policy_version 742810 (0.00094) [2022-07-10 13:27:49,939][25689] Fps is (10 sec: 5605.8, 60 sec: 5573.0, 300 sec: 5524.3). Total num frames: 760640512. Throughput: 0: 5049.0. Samples: 760633248. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:27:49,940][25689] Avg episode reward: [(0, '-1.721')] [2022-07-10 13:27:51,446][26022] Updated weights on worker 0-0, policy_version 742820 (0.00089) [2022-07-10 13:27:52,953][26022] Updated weights on worker 0-0, policy_version 742830 (0.00086) [2022-07-10 13:27:54,967][25689] Fps is (10 sec: 5510.1, 60 sec: 5537.8, 300 sec: 5517.4). Total num frames: 760666112. Throughput: 0: 5866.1. Samples: 760666470. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:27:54,969][25689] Avg episode reward: [(0, '-1.350')] [2022-07-10 13:27:55,053][26022] Updated weights on worker 0-0, policy_version 742840 (0.00091) [2022-07-10 13:27:56,656][26022] Updated weights on worker 0-0, policy_version 742850 (0.00091) [2022-07-10 13:27:58,653][26022] Updated weights on worker 0-0, policy_version 742860 (0.00094) [2022-07-10 13:27:59,994][25689] Fps is (10 sec: 5499.1, 60 sec: 5553.9, 300 sec: 5532.0). Total num frames: 760695808. Throughput: 0: 5824.1. Samples: 760699864. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:27:59,996][25689] Avg episode reward: [(0, '-1.680')] [2022-07-10 13:28:00,571][26022] Updated weights on worker 0-0, policy_version 742870 (0.00094) [2022-07-10 13:28:02,854][26022] Updated weights on worker 0-0, policy_version 742880 (0.00087) [2022-07-10 13:28:04,643][26022] Updated weights on worker 0-0, policy_version 742890 (0.00089) [2022-07-10 13:28:05,034][25689] Fps is (10 sec: 5594.4, 60 sec: 5556.8, 300 sec: 5524.9). Total num frames: 760722432. Throughput: 0: 5705.6. Samples: 760731194. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:28:05,035][25689] Avg episode reward: [(0, '-2.555')] [2022-07-10 13:28:06,496][26022] Updated weights on worker 0-0, policy_version 742900 (0.00093) [2022-07-10 13:28:08,287][26022] Updated weights on worker 0-0, policy_version 742910 (0.00059) [2022-07-10 13:28:10,040][25689] Fps is (10 sec: 5300.6, 60 sec: 5557.3, 300 sec: 5521.8). Total num frames: 760749056. Throughput: 0: 5705.0. Samples: 760747924. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:28:10,040][25689] Avg episode reward: [(0, '-3.225')] [2022-07-10 13:28:10,171][26022] Updated weights on worker 0-0, policy_version 742920 (0.00092) [2022-07-10 13:28:12,046][26022] Updated weights on worker 0-0, policy_version 742930 (0.00084) [2022-07-10 13:28:13,882][26022] Updated weights on worker 0-0, policy_version 742940 (0.00085) [2022-07-10 13:28:15,063][25689] Fps is (10 sec: 5411.5, 60 sec: 5521.6, 300 sec: 5522.0). Total num frames: 760776704. Throughput: 0: 5694.0. Samples: 760780898. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:28:15,064][25689] Avg episode reward: [(0, '-3.976')] [2022-07-10 13:28:15,687][26022] Updated weights on worker 0-0, policy_version 742950 (0.00100) [2022-07-10 13:28:17,576][26022] Updated weights on worker 0-0, policy_version 742960 (0.00087) [2022-07-10 13:28:19,320][26022] Updated weights on worker 0-0, policy_version 742970 (0.00047) [2022-07-10 13:28:20,070][25689] Fps is (10 sec: 5614.9, 60 sec: 5539.6, 300 sec: 5523.6). Total num frames: 760805376. Throughput: 0: 5715.0. Samples: 760814598. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:28:20,071][25689] Avg episode reward: [(0, '-4.435')] [2022-07-10 13:28:21,268][26022] Updated weights on worker 0-0, policy_version 742980 (0.00095) [2022-07-10 13:28:23,052][26022] Updated weights on worker 0-0, policy_version 742990 (0.00101) [2022-07-10 13:28:25,071][26022] Updated weights on worker 0-0, policy_version 743000 (0.00097) [2022-07-10 13:28:25,130][25689] Fps is (10 sec: 5492.8, 60 sec: 5521.4, 300 sec: 5522.5). Total num frames: 760832000. Throughput: 0: 4977.4. Samples: 760831220. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:28:25,131][25689] Avg episode reward: [(0, '-3.574')] [2022-07-10 13:28:26,658][26022] Updated weights on worker 0-0, policy_version 743010 (0.00084) [2022-07-10 13:28:28,645][26022] Updated weights on worker 0-0, policy_version 743020 (0.00086) [2022-07-10 13:28:30,132][25689] Fps is (10 sec: 5495.3, 60 sec: 5539.2, 300 sec: 5519.4). Total num frames: 760860672. Throughput: 0: 5793.8. Samples: 760864338. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:28:30,133][25689] Avg episode reward: [(0, '-2.410')] [2022-07-10 13:28:30,372][26022] Updated weights on worker 0-0, policy_version 743030 (0.00093) [2022-07-10 13:28:32,434][26022] Updated weights on worker 0-0, policy_version 743040 (0.00088) [2022-07-10 13:28:34,095][26022] Updated weights on worker 0-0, policy_version 743050 (0.00086) [2022-07-10 13:28:35,071][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:28:35,082][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000743055_760888320.pth [2022-07-10 13:28:35,082][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000741110_758896640.pth [2022-07-10 13:28:35,238][25689] Fps is (10 sec: 5572.2, 60 sec: 5497.4, 300 sec: 5517.5). Total num frames: 760888320. Throughput: 0: 5793.9. Samples: 760897786. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:28:35,238][25689] Avg episode reward: [(0, '-1.624')] [2022-07-10 13:28:36,013][26022] Updated weights on worker 0-0, policy_version 743060 (0.00087) [2022-07-10 13:28:37,551][26022] Updated weights on worker 0-0, policy_version 743070 (0.00085) [2022-07-10 13:28:39,891][26022] Updated weights on worker 0-0, policy_version 743080 (0.00093) [2022-07-10 13:28:40,287][25689] Fps is (10 sec: 5546.3, 60 sec: 5510.4, 300 sec: 5525.9). Total num frames: 760916992. Throughput: 0: 4951.5. Samples: 760914702. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:28:40,288][25689] Avg episode reward: [(0, '-1.080')] [2022-07-10 13:28:41,305][26022] Updated weights on worker 0-0, policy_version 743090 (0.00091) [2022-07-10 13:28:43,453][26022] Updated weights on worker 0-0, policy_version 743100 (0.00091) [2022-07-10 13:28:44,987][26022] Updated weights on worker 0-0, policy_version 743110 (0.00086) [2022-07-10 13:28:45,351][25689] Fps is (10 sec: 5670.3, 60 sec: 5542.1, 300 sec: 5521.4). Total num frames: 760945664. Throughput: 0: 5792.2. Samples: 760948340. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:28:45,351][25689] Avg episode reward: [(0, '-1.805')] [2022-07-10 13:28:47,087][26022] Updated weights on worker 0-0, policy_version 743120 (0.00103) [2022-07-10 13:28:48,830][26022] Updated weights on worker 0-0, policy_version 743130 (0.00088) [2022-07-10 13:28:50,379][25689] Fps is (10 sec: 5580.9, 60 sec: 5506.3, 300 sec: 5518.1). Total num frames: 760973312. Throughput: 0: 5806.1. Samples: 760981890. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:28:50,379][25689] Avg episode reward: [(0, '-2.905')] [2022-07-10 13:28:50,545][26022] Updated weights on worker 0-0, policy_version 743140 (0.00096) [2022-07-10 13:28:52,528][26022] Updated weights on worker 0-0, policy_version 743150 (0.00097) [2022-07-10 13:28:54,214][26022] Updated weights on worker 0-0, policy_version 743160 (0.00086) [2022-07-10 13:28:55,395][25689] Fps is (10 sec: 5505.4, 60 sec: 5541.3, 300 sec: 5524.8). Total num frames: 761000960. Throughput: 0: 4977.5. Samples: 760998118. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:28:55,396][25689] Avg episode reward: [(0, '-2.898')] [2022-07-10 13:28:56,234][26022] Updated weights on worker 0-0, policy_version 743170 (0.00839) [2022-07-10 13:28:58,152][26022] Updated weights on worker 0-0, policy_version 743180 (0.00093) [2022-07-10 13:28:59,951][26022] Updated weights on worker 0-0, policy_version 743190 (0.00087) [2022-07-10 13:29:00,435][25689] Fps is (10 sec: 5600.6, 60 sec: 5523.2, 300 sec: 5528.4). Total num frames: 761029632. Throughput: 0: 5804.8. Samples: 761031654. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:29:00,436][25689] Avg episode reward: [(0, '-2.770')] [2022-07-10 13:29:02,124][26022] Updated weights on worker 0-0, policy_version 743200 (0.00085) [2022-07-10 13:29:03,948][26022] Updated weights on worker 0-0, policy_version 743210 (0.00050) [2022-07-10 13:29:05,566][25689] Fps is (10 sec: 5436.6, 60 sec: 5514.9, 300 sec: 5526.3). Total num frames: 761056256. Throughput: 0: 5672.7. Samples: 761063014. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:29:05,566][25689] Avg episode reward: [(0, '-1.555')] [2022-07-10 13:29:05,801][26022] Updated weights on worker 0-0, policy_version 743220 (0.00088) [2022-07-10 13:29:07,584][26022] Updated weights on worker 0-0, policy_version 743230 (0.00089) [2022-07-10 13:29:09,273][26022] Updated weights on worker 0-0, policy_version 743240 (0.00091) [2022-07-10 13:29:10,656][25689] Fps is (10 sec: 5209.8, 60 sec: 5507.2, 300 sec: 5517.9). Total num frames: 761082880. Throughput: 0: 4824.6. Samples: 761079718. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:29:10,656][25689] Avg episode reward: [(0, '-1.225')] [2022-07-10 13:29:11,371][26022] Updated weights on worker 0-0, policy_version 743250 (0.00087) [2022-07-10 13:29:13,005][26022] Updated weights on worker 0-0, policy_version 743260 (0.00093) [2022-07-10 13:29:14,847][26022] Updated weights on worker 0-0, policy_version 743270 (0.00084) [2022-07-10 13:29:15,721][25689] Fps is (10 sec: 5546.0, 60 sec: 5537.2, 300 sec: 5520.9). Total num frames: 761112576. Throughput: 0: 5669.9. Samples: 761113366. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:29:15,723][25689] Avg episode reward: [(0, '-2.754')] [2022-07-10 13:29:16,713][26022] Updated weights on worker 0-0, policy_version 743280 (0.00097) [2022-07-10 13:29:18,688][26022] Updated weights on worker 0-0, policy_version 743290 (0.00091) [2022-07-10 13:29:20,435][26022] Updated weights on worker 0-0, policy_version 743300 (0.00080) [2022-07-10 13:29:20,724][25689] Fps is (10 sec: 5797.2, 60 sec: 5537.5, 300 sec: 5525.3). Total num frames: 761141248. Throughput: 0: 5687.1. Samples: 761147040. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:29:20,724][25689] Avg episode reward: [(0, '-2.155')] [2022-07-10 13:29:22,271][26022] Updated weights on worker 0-0, policy_version 743310 (0.00099) [2022-07-10 13:29:24,074][26022] Updated weights on worker 0-0, policy_version 743320 (0.00096) [2022-07-10 13:29:25,789][25689] Fps is (10 sec: 5593.8, 60 sec: 5553.9, 300 sec: 5525.6). Total num frames: 761168896. Throughput: 0: 4963.1. Samples: 761163386. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:29:25,790][25689] Avg episode reward: [(0, '-2.419')] [2022-07-10 13:29:26,133][26022] Updated weights on worker 0-0, policy_version 743330 (0.00090) [2022-07-10 13:29:27,764][26022] Updated weights on worker 0-0, policy_version 743340 (0.00057) [2022-07-10 13:29:29,652][26022] Updated weights on worker 0-0, policy_version 743350 (0.00089) [2022-07-10 13:29:30,802][25689] Fps is (10 sec: 5588.2, 60 sec: 5552.9, 300 sec: 5529.5). Total num frames: 761197568. Throughput: 0: 5819.0. Samples: 761196952. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:29:30,803][25689] Avg episode reward: [(0, '-3.308')] [2022-07-10 13:29:31,542][26022] Updated weights on worker 0-0, policy_version 743360 (0.00086) [2022-07-10 13:29:33,328][26022] Updated weights on worker 0-0, policy_version 743370 (0.00091) [2022-07-10 13:29:35,238][26022] Updated weights on worker 0-0, policy_version 743380 (0.00088) [2022-07-10 13:29:35,831][25689] Fps is (10 sec: 5506.4, 60 sec: 5543.0, 300 sec: 5519.2). Total num frames: 761224192. Throughput: 0: 5832.6. Samples: 761230662. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:29:35,832][25689] Avg episode reward: [(0, '-3.769')] [2022-07-10 13:29:36,703][26022] Updated weights on worker 0-0, policy_version 743390 (0.00090) [2022-07-10 13:29:38,947][26022] Updated weights on worker 0-0, policy_version 743400 (0.00049) [2022-07-10 13:29:40,540][26022] Updated weights on worker 0-0, policy_version 743410 (0.00088) [2022-07-10 13:29:40,856][25689] Fps is (10 sec: 5499.9, 60 sec: 5545.3, 300 sec: 5523.9). Total num frames: 761252864. Throughput: 0: 4983.7. Samples: 761247374. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:29:40,857][25689] Avg episode reward: [(0, '-3.365')] [2022-07-10 13:29:42,419][26022] Updated weights on worker 0-0, policy_version 743420 (0.00089) [2022-07-10 13:29:44,415][26022] Updated weights on worker 0-0, policy_version 743430 (0.00084) [2022-07-10 13:29:45,934][25689] Fps is (10 sec: 5676.0, 60 sec: 5543.9, 300 sec: 5529.6). Total num frames: 761281536. Throughput: 0: 5820.8. Samples: 761280646. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:29:45,935][25689] Avg episode reward: [(0, '-2.918')] [2022-07-10 13:29:46,232][26022] Updated weights on worker 0-0, policy_version 743440 (0.00087) [2022-07-10 13:29:48,010][26022] Updated weights on worker 0-0, policy_version 743450 (0.00095) [2022-07-10 13:29:49,820][26022] Updated weights on worker 0-0, policy_version 743460 (0.00104) [2022-07-10 13:29:50,947][25689] Fps is (10 sec: 5480.1, 60 sec: 5528.5, 300 sec: 5519.3). Total num frames: 761308160. Throughput: 0: 5812.3. Samples: 761314038. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:29:50,947][25689] Avg episode reward: [(0, '-4.177')] [2022-07-10 13:29:51,875][26022] Updated weights on worker 0-0, policy_version 743470 (0.00083) [2022-07-10 13:29:53,569][26022] Updated weights on worker 0-0, policy_version 743480 (0.00090) [2022-07-10 13:29:55,325][26022] Updated weights on worker 0-0, policy_version 743490 (0.00094) [2022-07-10 13:29:55,975][25689] Fps is (10 sec: 5405.5, 60 sec: 5527.4, 300 sec: 5519.0). Total num frames: 761335808. Throughput: 0: 4962.3. Samples: 761330618. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 13:29:55,975][25689] Avg episode reward: [(0, '-3.649')] [2022-07-10 13:29:57,102][26022] Updated weights on worker 0-0, policy_version 743500 (0.00086) [2022-07-10 13:29:59,209][26022] Updated weights on worker 0-0, policy_version 743510 (0.00087) [2022-07-10 13:30:00,968][26022] Updated weights on worker 0-0, policy_version 743520 (0.00091) [2022-07-10 13:30:00,994][25689] Fps is (10 sec: 5605.4, 60 sec: 5529.2, 300 sec: 5527.9). Total num frames: 761364480. Throughput: 0: 5806.9. Samples: 761364312. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:30:00,995][25689] Avg episode reward: [(0, '-3.201')] [2022-07-10 13:30:03,117][26022] Updated weights on worker 0-0, policy_version 743530 (0.00089) [2022-07-10 13:30:04,930][26022] Updated weights on worker 0-0, policy_version 743540 (0.00082) [2022-07-10 13:30:06,058][25689] Fps is (10 sec: 5484.1, 60 sec: 5535.4, 300 sec: 5524.1). Total num frames: 761391104. Throughput: 0: 5730.1. Samples: 761395954. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:30:06,058][25689] Avg episode reward: [(0, '-4.190')] [2022-07-10 13:30:06,713][26022] Updated weights on worker 0-0, policy_version 743550 (0.00083) [2022-07-10 13:30:08,607][26022] Updated weights on worker 0-0, policy_version 743560 (0.00091) [2022-07-10 13:30:10,488][26022] Updated weights on worker 0-0, policy_version 743570 (0.00095) [2022-07-10 13:30:11,084][25689] Fps is (10 sec: 5378.9, 60 sec: 5558.2, 300 sec: 5523.7). Total num frames: 761418752. Throughput: 0: 4902.7. Samples: 761412766. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:30:11,085][25689] Avg episode reward: [(0, '-3.055')] [2022-07-10 13:30:12,223][26022] Updated weights on worker 0-0, policy_version 743580 (0.00081) [2022-07-10 13:30:14,073][26022] Updated weights on worker 0-0, policy_version 743590 (0.00081) [2022-07-10 13:30:16,092][25689] Fps is (10 sec: 5408.6, 60 sec: 5512.6, 300 sec: 5517.0). Total num frames: 761445376. Throughput: 0: 5764.1. Samples: 761446576. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:30:16,092][25689] Avg episode reward: [(0, '-0.899')] [2022-07-10 13:30:16,120][26022] Updated weights on worker 0-0, policy_version 743600 (0.00093) [2022-07-10 13:30:17,683][26022] Updated weights on worker 0-0, policy_version 743610 (0.00087) [2022-07-10 13:30:19,803][26022] Updated weights on worker 0-0, policy_version 743620 (0.00088) [2022-07-10 13:30:21,140][25689] Fps is (10 sec: 5600.6, 60 sec: 5525.4, 300 sec: 5524.1). Total num frames: 761475072. Throughput: 0: 5750.1. Samples: 761480152. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:30:21,141][25689] Avg episode reward: [(0, '-1.616')] [2022-07-10 13:30:21,275][26022] Updated weights on worker 0-0, policy_version 743630 (0.00087) [2022-07-10 13:30:23,441][26022] Updated weights on worker 0-0, policy_version 743640 (0.00089) [2022-07-10 13:30:25,246][26022] Updated weights on worker 0-0, policy_version 743650 (0.00085) [2022-07-10 13:30:26,248][25689] Fps is (10 sec: 5847.9, 60 sec: 5555.4, 300 sec: 5529.1). Total num frames: 761504768. Throughput: 0: 5007.6. Samples: 761497060. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:30:26,248][25689] Avg episode reward: [(0, '-2.878')] [2022-07-10 13:30:27,099][26022] Updated weights on worker 0-0, policy_version 743660 (0.00093) [2022-07-10 13:30:28,816][26022] Updated weights on worker 0-0, policy_version 743670 (0.00088) [2022-07-10 13:30:30,778][26022] Updated weights on worker 0-0, policy_version 743680 (0.00090) [2022-07-10 13:30:31,264][25689] Fps is (10 sec: 5664.4, 60 sec: 5538.2, 300 sec: 5522.0). Total num frames: 761532416. Throughput: 0: 5819.8. Samples: 761530208. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:30:31,265][25689] Avg episode reward: [(0, '-3.053')] [2022-07-10 13:30:32,335][26022] Updated weights on worker 0-0, policy_version 743690 (0.00092) [2022-07-10 13:30:34,367][26022] Updated weights on worker 0-0, policy_version 743700 (0.00088) [2022-07-10 13:30:35,125][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:30:35,138][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000743705_761553920.pth [2022-07-10 13:30:35,138][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000741759_759561216.pth [2022-07-10 13:30:36,040][26022] Updated weights on worker 0-0, policy_version 743710 (0.00090) [2022-07-10 13:30:36,272][25689] Fps is (10 sec: 5413.9, 60 sec: 5540.1, 300 sec: 5525.3). Total num frames: 761559040. Throughput: 0: 5799.0. Samples: 761563604. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:30:36,273][25689] Avg episode reward: [(0, '-2.112')] [2022-07-10 13:30:38,060][26022] Updated weights on worker 0-0, policy_version 743720 (0.00093) [2022-07-10 13:30:39,804][26022] Updated weights on worker 0-0, policy_version 743730 (0.00109) [2022-07-10 13:30:41,316][25689] Fps is (10 sec: 5398.8, 60 sec: 5521.4, 300 sec: 5525.7). Total num frames: 761586688. Throughput: 0: 5799.0. Samples: 761597154. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:30:41,317][25689] Avg episode reward: [(0, '-3.185')] [2022-07-10 13:30:41,541][26022] Updated weights on worker 0-0, policy_version 743740 (0.00090) [2022-07-10 13:30:43,419][26022] Updated weights on worker 0-0, policy_version 743750 (0.00083) [2022-07-10 13:30:45,314][26022] Updated weights on worker 0-0, policy_version 743760 (0.00087) [2022-07-10 13:30:46,457][25689] Fps is (10 sec: 5731.3, 60 sec: 5549.6, 300 sec: 5533.4). Total num frames: 761617408. Throughput: 0: 5773.2. Samples: 761613728. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:30:46,457][25689] Avg episode reward: [(0, '-2.551')] [2022-07-10 13:30:47,232][26022] Updated weights on worker 0-0, policy_version 743770 (0.00104) [2022-07-10 13:30:49,131][26022] Updated weights on worker 0-0, policy_version 743780 (0.00431) [2022-07-10 13:30:50,760][26022] Updated weights on worker 0-0, policy_version 743790 (0.00095) [2022-07-10 13:30:51,481][25689] Fps is (10 sec: 5641.7, 60 sec: 5548.5, 300 sec: 5529.8). Total num frames: 761644032. Throughput: 0: 5791.9. Samples: 761647304. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:30:51,481][25689] Avg episode reward: [(0, '-2.096')] [2022-07-10 13:30:53,023][26022] Updated weights on worker 0-0, policy_version 743800 (0.00083) [2022-07-10 13:30:54,492][26022] Updated weights on worker 0-0, policy_version 743810 (0.00085) [2022-07-10 13:30:56,499][25689] Fps is (10 sec: 5302.1, 60 sec: 5532.4, 300 sec: 5522.7). Total num frames: 761670656. Throughput: 0: 5783.2. Samples: 761680580. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:30:56,500][25689] Avg episode reward: [(0, '-1.952')] [2022-07-10 13:30:56,645][26022] Updated weights on worker 0-0, policy_version 743820 (0.00093) [2022-07-10 13:30:58,113][26022] Updated weights on worker 0-0, policy_version 743830 (0.00088) [2022-07-10 13:31:00,256][26022] Updated weights on worker 0-0, policy_version 743840 (0.00087) [2022-07-10 13:31:01,508][25689] Fps is (10 sec: 5718.8, 60 sec: 5567.3, 300 sec: 5544.0). Total num frames: 761701376. Throughput: 0: 4950.7. Samples: 761697116. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:31:01,510][25689] Avg episode reward: [(0, '-2.535')] [2022-07-10 13:31:02,288][26022] Updated weights on worker 0-0, policy_version 743850 (0.00107) [2022-07-10 13:31:04,334][26022] Updated weights on worker 0-0, policy_version 743860 (0.00088) [2022-07-10 13:31:05,983][26022] Updated weights on worker 0-0, policy_version 743870 (0.00092) [2022-07-10 13:31:06,565][25689] Fps is (10 sec: 5493.2, 60 sec: 5534.0, 300 sec: 5532.6). Total num frames: 761725952. Throughput: 0: 5703.1. Samples: 761728412. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:31:06,567][25689] Avg episode reward: [(0, '-3.100')] [2022-07-10 13:31:07,889][26022] Updated weights on worker 0-0, policy_version 743880 (0.00085) [2022-07-10 13:31:09,523][26022] Updated weights on worker 0-0, policy_version 743890 (0.00090) [2022-07-10 13:31:11,611][25689] Fps is (10 sec: 5067.8, 60 sec: 5515.3, 300 sec: 5521.8). Total num frames: 761752576. Throughput: 0: 5681.1. Samples: 761761666. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:31:11,612][25689] Avg episode reward: [(0, '-2.238')] [2022-07-10 13:31:11,694][26022] Updated weights on worker 0-0, policy_version 743900 (0.00088) [2022-07-10 13:31:13,628][26022] Updated weights on worker 0-0, policy_version 743910 (0.00086) [2022-07-10 13:31:15,316][26022] Updated weights on worker 0-0, policy_version 743920 (0.00094) [2022-07-10 13:31:16,640][25689] Fps is (10 sec: 5488.9, 60 sec: 5547.2, 300 sec: 5531.9). Total num frames: 761781248. Throughput: 0: 4845.5. Samples: 761778170. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:31:16,640][25689] Avg episode reward: [(0, '-2.359')] [2022-07-10 13:31:17,095][26022] Updated weights on worker 0-0, policy_version 743930 (0.00506) [2022-07-10 13:31:19,091][26022] Updated weights on worker 0-0, policy_version 743940 (0.00092) [2022-07-10 13:31:20,678][26022] Updated weights on worker 0-0, policy_version 743950 (0.00105) [2022-07-10 13:31:21,660][25689] Fps is (10 sec: 5604.5, 60 sec: 5515.9, 300 sec: 5525.6). Total num frames: 761808896. Throughput: 0: 5684.2. Samples: 761811664. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:31:21,660][25689] Avg episode reward: [(0, '-3.380')] [2022-07-10 13:31:22,827][26022] Updated weights on worker 0-0, policy_version 743960 (0.00093) [2022-07-10 13:31:24,471][26022] Updated weights on worker 0-0, policy_version 743970 (0.00080) [2022-07-10 13:31:26,469][26022] Updated weights on worker 0-0, policy_version 743980 (0.00086) [2022-07-10 13:31:26,771][25689] Fps is (10 sec: 5559.0, 60 sec: 5498.7, 300 sec: 5530.5). Total num frames: 761837568. Throughput: 0: 5764.8. Samples: 761844892. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:31:26,771][25689] Avg episode reward: [(0, '-2.639')] [2022-07-10 13:31:28,467][26022] Updated weights on worker 0-0, policy_version 743990 (0.00094) [2022-07-10 13:31:30,158][26022] Updated weights on worker 0-0, policy_version 744000 (0.00082) [2022-07-10 13:31:31,803][25689] Fps is (10 sec: 5451.5, 60 sec: 5480.3, 300 sec: 5526.7). Total num frames: 761864192. Throughput: 0: 4939.4. Samples: 761861404. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:31:31,805][25689] Avg episode reward: [(0, '-3.063')] [2022-07-10 13:31:32,027][26022] Updated weights on worker 0-0, policy_version 744010 (0.00088) [2022-07-10 13:31:33,630][26022] Updated weights on worker 0-0, policy_version 744020 (0.00092) [2022-07-10 13:31:35,760][26022] Updated weights on worker 0-0, policy_version 744030 (0.00084) [2022-07-10 13:31:36,813][25689] Fps is (10 sec: 5608.2, 60 sec: 5530.9, 300 sec: 5530.3). Total num frames: 761893888. Throughput: 0: 5791.7. Samples: 761895010. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:31:36,816][25689] Avg episode reward: [(0, '-3.305')] [2022-07-10 13:31:37,295][26022] Updated weights on worker 0-0, policy_version 744040 (0.00082) [2022-07-10 13:31:39,436][26022] Updated weights on worker 0-0, policy_version 744050 (0.00087) [2022-07-10 13:31:41,074][26022] Updated weights on worker 0-0, policy_version 744060 (0.00091) [2022-07-10 13:31:41,838][25689] Fps is (10 sec: 5612.7, 60 sec: 5515.8, 300 sec: 5534.0). Total num frames: 761920512. Throughput: 0: 5787.6. Samples: 761928444. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:31:41,838][25689] Avg episode reward: [(0, '-3.495')] [2022-07-10 13:31:43,085][26022] Updated weights on worker 0-0, policy_version 744070 (0.00095) [2022-07-10 13:31:44,843][26022] Updated weights on worker 0-0, policy_version 744080 (0.00085) [2022-07-10 13:31:46,545][26022] Updated weights on worker 0-0, policy_version 744090 (0.00084) [2022-07-10 13:31:46,972][25689] Fps is (10 sec: 5644.9, 60 sec: 5516.3, 300 sec: 5538.5). Total num frames: 761951232. Throughput: 0: 4975.9. Samples: 761945410. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:31:46,972][25689] Avg episode reward: [(0, '-2.341')] [2022-07-10 13:31:48,467][26022] Updated weights on worker 0-0, policy_version 744100 (0.00083) [2022-07-10 13:31:50,120][26022] Updated weights on worker 0-0, policy_version 744110 (0.00082) [2022-07-10 13:31:52,020][25689] Fps is (10 sec: 5631.5, 60 sec: 5514.1, 300 sec: 5534.4). Total num frames: 761977856. Throughput: 0: 5818.8. Samples: 761979044. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:31:52,022][25689] Avg episode reward: [(0, '-3.004')] [2022-07-10 13:31:52,143][26022] Updated weights on worker 0-0, policy_version 744120 (0.00084) [2022-07-10 13:31:54,103][26022] Updated weights on worker 0-0, policy_version 744130 (0.00087) [2022-07-10 13:31:55,671][26022] Updated weights on worker 0-0, policy_version 744140 (0.00086) [2022-07-10 13:31:57,060][25689] Fps is (10 sec: 5481.4, 60 sec: 5546.0, 300 sec: 5534.0). Total num frames: 762006528. Throughput: 0: 5812.9. Samples: 762012700. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:31:57,060][25689] Avg episode reward: [(0, '-3.971')] [2022-07-10 13:31:57,565][26022] Updated weights on worker 0-0, policy_version 744150 (0.00095) [2022-07-10 13:31:59,390][26022] Updated weights on worker 0-0, policy_version 744160 (0.00089) [2022-07-10 13:32:01,303][26022] Updated weights on worker 0-0, policy_version 744170 (0.00083) [2022-07-10 13:32:02,083][25689] Fps is (10 sec: 5393.1, 60 sec: 5460.1, 300 sec: 5531.4). Total num frames: 762032128. Throughput: 0: 4985.0. Samples: 762029370. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:32:02,084][25689] Avg episode reward: [(0, '-3.192')] [2022-07-10 13:32:03,622][26022] Updated weights on worker 0-0, policy_version 744180 (0.00084) [2022-07-10 13:32:05,166][26022] Updated weights on worker 0-0, policy_version 744190 (0.00090) [2022-07-10 13:32:07,177][25689] Fps is (10 sec: 5262.9, 60 sec: 5507.6, 300 sec: 5533.3). Total num frames: 762059776. Throughput: 0: 5710.2. Samples: 762060788. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:32:07,179][25689] Avg episode reward: [(0, '-2.857')] [2022-07-10 13:32:07,239][26022] Updated weights on worker 0-0, policy_version 744200 (0.00089) [2022-07-10 13:32:09,204][26022] Updated weights on worker 0-0, policy_version 744210 (0.00090) [2022-07-10 13:32:10,779][26022] Updated weights on worker 0-0, policy_version 744220 (0.00099) [2022-07-10 13:32:12,198][25689] Fps is (10 sec: 5568.2, 60 sec: 5543.6, 300 sec: 5529.6). Total num frames: 762088448. Throughput: 0: 5711.8. Samples: 762094296. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:32:12,199][25689] Avg episode reward: [(0, '-2.551')] [2022-07-10 13:32:12,918][26022] Updated weights on worker 0-0, policy_version 744230 (0.00093) [2022-07-10 13:32:14,494][26022] Updated weights on worker 0-0, policy_version 744240 (0.00085) [2022-07-10 13:32:16,363][26022] Updated weights on worker 0-0, policy_version 744250 (0.00088) [2022-07-10 13:32:17,205][25689] Fps is (10 sec: 5718.6, 60 sec: 5545.6, 300 sec: 5533.2). Total num frames: 762117120. Throughput: 0: 4874.1. Samples: 762110890. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:32:17,214][25689] Avg episode reward: [(0, '-2.579')] [2022-07-10 13:32:18,104][26022] Updated weights on worker 0-0, policy_version 744260 (0.00088) [2022-07-10 13:32:19,797][26022] Updated weights on worker 0-0, policy_version 744270 (0.00089) [2022-07-10 13:32:21,990][26022] Updated weights on worker 0-0, policy_version 744280 (0.00091) [2022-07-10 13:32:22,268][25689] Fps is (10 sec: 5491.2, 60 sec: 5524.8, 300 sec: 5529.5). Total num frames: 762143744. Throughput: 0: 5720.8. Samples: 762144842. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:32:22,269][25689] Avg episode reward: [(0, '-1.207')] [2022-07-10 13:32:23,504][26022] Updated weights on worker 0-0, policy_version 744290 (0.00090) [2022-07-10 13:32:25,232][26022] Updated weights on worker 0-0, policy_version 744300 (0.00086) [2022-07-10 13:32:27,315][25689] Fps is (10 sec: 5368.3, 60 sec: 5513.8, 300 sec: 5528.8). Total num frames: 762171392. Throughput: 0: 5837.6. Samples: 762178340. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:32:27,315][25689] Avg episode reward: [(0, '-1.205')] [2022-07-10 13:32:27,435][26022] Updated weights on worker 0-0, policy_version 744310 (0.00089) [2022-07-10 13:32:28,983][26022] Updated weights on worker 0-0, policy_version 744320 (0.00094) [2022-07-10 13:32:31,061][26022] Updated weights on worker 0-0, policy_version 744330 (0.00092) [2022-07-10 13:32:32,347][25689] Fps is (10 sec: 5791.3, 60 sec: 5581.4, 300 sec: 5532.1). Total num frames: 762202112. Throughput: 0: 4981.7. Samples: 762194670. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:32:32,347][25689] Avg episode reward: [(0, '-1.148')] [2022-07-10 13:32:33,002][26022] Updated weights on worker 0-0, policy_version 744340 (0.00089) [2022-07-10 13:32:34,715][26022] Updated weights on worker 0-0, policy_version 744350 (0.00088) [2022-07-10 13:32:35,198][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:32:35,207][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000744354_762218496.pth [2022-07-10 13:32:35,211][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000742405_760222720.pth [2022-07-10 13:32:36,643][26022] Updated weights on worker 0-0, policy_version 744360 (0.00089) [2022-07-10 13:32:37,351][25689] Fps is (10 sec: 5713.5, 60 sec: 5531.2, 300 sec: 5528.6). Total num frames: 762228736. Throughput: 0: 5834.2. Samples: 762228426. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:32:37,352][25689] Avg episode reward: [(0, '-0.965')] [2022-07-10 13:32:38,223][26022] Updated weights on worker 0-0, policy_version 744370 (0.00084) [2022-07-10 13:32:40,320][26022] Updated weights on worker 0-0, policy_version 744380 (0.00091) [2022-07-10 13:32:41,972][26022] Updated weights on worker 0-0, policy_version 744390 (0.00092) [2022-07-10 13:32:42,358][25689] Fps is (10 sec: 5420.9, 60 sec: 5549.7, 300 sec: 5532.7). Total num frames: 762256384. Throughput: 0: 5828.0. Samples: 762261928. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:32:42,360][25689] Avg episode reward: [(0, '-0.270')] [2022-07-10 13:32:43,849][26022] Updated weights on worker 0-0, policy_version 744400 (0.00085) [2022-07-10 13:32:45,683][26022] Updated weights on worker 0-0, policy_version 744410 (0.00080) [2022-07-10 13:32:47,389][26022] Updated weights on worker 0-0, policy_version 744420 (0.00125) [2022-07-10 13:32:47,482][25689] Fps is (10 sec: 5660.6, 60 sec: 5533.8, 300 sec: 5530.5). Total num frames: 762286080. Throughput: 0: 4981.6. Samples: 762278804. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:32:47,482][25689] Avg episode reward: [(0, '0.145')] [2022-07-10 13:32:49,388][26022] Updated weights on worker 0-0, policy_version 744430 (0.00098) [2022-07-10 13:32:51,252][26022] Updated weights on worker 0-0, policy_version 744440 (0.00089) [2022-07-10 13:32:52,486][25689] Fps is (10 sec: 5561.0, 60 sec: 5537.8, 300 sec: 5534.4). Total num frames: 762312704. Throughput: 0: 5832.8. Samples: 762312138. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:32:52,487][25689] Avg episode reward: [(0, '-0.086')] [2022-07-10 13:32:53,091][26022] Updated weights on worker 0-0, policy_version 744450 (0.00109) [2022-07-10 13:32:54,795][26022] Updated weights on worker 0-0, policy_version 744460 (0.00087) [2022-07-10 13:32:56,647][26022] Updated weights on worker 0-0, policy_version 744470 (0.00088) [2022-07-10 13:32:57,530][25689] Fps is (10 sec: 5503.1, 60 sec: 5537.4, 300 sec: 5530.7). Total num frames: 762341376. Throughput: 0: 5819.5. Samples: 762345854. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:32:57,530][25689] Avg episode reward: [(0, '-0.178')] [2022-07-10 13:32:58,420][26022] Updated weights on worker 0-0, policy_version 744480 (0.00088) [2022-07-10 13:33:00,547][26022] Updated weights on worker 0-0, policy_version 744490 (0.00058) [2022-07-10 13:33:02,544][25689] Fps is (10 sec: 5497.7, 60 sec: 5555.2, 300 sec: 5531.1). Total num frames: 762368000. Throughput: 0: 4982.9. Samples: 762362510. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:33:02,546][25689] Avg episode reward: [(0, '-0.247')] [2022-07-10 13:33:02,548][26022] Updated weights on worker 0-0, policy_version 744500 (0.00082) [2022-07-10 13:33:04,532][26022] Updated weights on worker 0-0, policy_version 744510 (0.00084) [2022-07-10 13:33:06,135][26022] Updated weights on worker 0-0, policy_version 744520 (0.00090) [2022-07-10 13:33:07,606][25689] Fps is (10 sec: 5386.1, 60 sec: 5558.2, 300 sec: 5533.5). Total num frames: 762395648. Throughput: 0: 5720.3. Samples: 762393922. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:33:07,607][25689] Avg episode reward: [(0, '-0.478')] [2022-07-10 13:33:08,223][26022] Updated weights on worker 0-0, policy_version 744530 (0.00103) [2022-07-10 13:33:09,815][26022] Updated weights on worker 0-0, policy_version 744540 (0.00112) [2022-07-10 13:33:11,956][26022] Updated weights on worker 0-0, policy_version 744550 (0.00087) [2022-07-10 13:33:12,680][25689] Fps is (10 sec: 5354.5, 60 sec: 5519.4, 300 sec: 5529.2). Total num frames: 762422272. Throughput: 0: 5695.8. Samples: 762427158. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:33:12,680][25689] Avg episode reward: [(0, '-0.850')] [2022-07-10 13:33:13,492][26022] Updated weights on worker 0-0, policy_version 744560 (0.00076) [2022-07-10 13:33:15,554][26022] Updated weights on worker 0-0, policy_version 744570 (0.00093) [2022-07-10 13:33:17,233][26022] Updated weights on worker 0-0, policy_version 744580 (0.00085) [2022-07-10 13:33:17,712][25689] Fps is (10 sec: 5573.2, 60 sec: 5534.1, 300 sec: 5532.1). Total num frames: 762451968. Throughput: 0: 5699.2. Samples: 762460874. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:33:17,712][25689] Avg episode reward: [(0, '-0.424')] [2022-07-10 13:33:19,116][26022] Updated weights on worker 0-0, policy_version 744590 (0.00061) [2022-07-10 13:33:20,912][26022] Updated weights on worker 0-0, policy_version 744600 (0.00086) [2022-07-10 13:33:22,722][25689] Fps is (10 sec: 5710.0, 60 sec: 5555.8, 300 sec: 5536.5). Total num frames: 762479616. Throughput: 0: 5709.4. Samples: 762477718. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 13:33:22,723][25689] Avg episode reward: [(0, '-1.174')] [2022-07-10 13:33:22,955][26022] Updated weights on worker 0-0, policy_version 744610 (0.00093) [2022-07-10 13:33:24,596][26022] Updated weights on worker 0-0, policy_version 744620 (0.00096) [2022-07-10 13:33:26,672][26022] Updated weights on worker 0-0, policy_version 744630 (0.00087) [2022-07-10 13:33:27,855][25689] Fps is (10 sec: 5451.5, 60 sec: 5547.9, 300 sec: 5530.6). Total num frames: 762507264. Throughput: 0: 5772.8. Samples: 762510814. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:33:27,856][25689] Avg episode reward: [(0, '-0.230')] [2022-07-10 13:33:28,304][26022] Updated weights on worker 0-0, policy_version 744640 (0.00091) [2022-07-10 13:33:30,420][26022] Updated weights on worker 0-0, policy_version 744650 (0.00088) [2022-07-10 13:33:32,105][26022] Updated weights on worker 0-0, policy_version 744660 (0.00092) [2022-07-10 13:33:32,874][25689] Fps is (10 sec: 5447.3, 60 sec: 5498.4, 300 sec: 5532.2). Total num frames: 762534912. Throughput: 0: 5770.2. Samples: 762543680. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:33:32,874][25689] Avg episode reward: [(0, '0.006')] [2022-07-10 13:33:34,188][26022] Updated weights on worker 0-0, policy_version 744670 (0.00084) [2022-07-10 13:33:35,870][26022] Updated weights on worker 0-0, policy_version 744680 (0.00090) [2022-07-10 13:33:37,649][26022] Updated weights on worker 0-0, policy_version 744690 (0.00083) [2022-07-10 13:33:37,929][25689] Fps is (10 sec: 5590.3, 60 sec: 5527.5, 300 sec: 5532.1). Total num frames: 762563584. Throughput: 0: 4926.0. Samples: 762560468. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:33:37,930][25689] Avg episode reward: [(0, '-0.859')] [2022-07-10 13:33:39,478][26022] Updated weights on worker 0-0, policy_version 744700 (0.00087) [2022-07-10 13:33:41,512][26022] Updated weights on worker 0-0, policy_version 744710 (0.00087) [2022-07-10 13:33:43,018][25689] Fps is (10 sec: 5551.9, 60 sec: 5520.1, 300 sec: 5528.2). Total num frames: 762591232. Throughput: 0: 5725.8. Samples: 762593924. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:33:43,018][25689] Avg episode reward: [(0, '-0.395')] [2022-07-10 13:33:43,348][26022] Updated weights on worker 0-0, policy_version 744720 (0.00105) [2022-07-10 13:33:45,121][26022] Updated weights on worker 0-0, policy_version 744730 (0.00088) [2022-07-10 13:33:46,850][26022] Updated weights on worker 0-0, policy_version 744740 (0.00096) [2022-07-10 13:33:48,133][25689] Fps is (10 sec: 5519.4, 60 sec: 5503.9, 300 sec: 5530.0). Total num frames: 762619904. Throughput: 0: 5739.8. Samples: 762627208. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:33:48,136][25689] Avg episode reward: [(0, '-0.189')] [2022-07-10 13:33:48,746][26022] Updated weights on worker 0-0, policy_version 744750 (0.00089) [2022-07-10 13:33:50,694][26022] Updated weights on worker 0-0, policy_version 744760 (0.00088) [2022-07-10 13:33:52,479][26022] Updated weights on worker 0-0, policy_version 744770 (0.00087) [2022-07-10 13:33:53,149][25689] Fps is (10 sec: 5559.1, 60 sec: 5519.8, 300 sec: 5530.0). Total num frames: 762647552. Throughput: 0: 4939.9. Samples: 762643838. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:33:53,149][25689] Avg episode reward: [(0, '-0.486')] [2022-07-10 13:33:54,379][26022] Updated weights on worker 0-0, policy_version 744780 (0.00083) [2022-07-10 13:33:56,200][26022] Updated weights on worker 0-0, policy_version 744790 (0.00091) [2022-07-10 13:33:58,030][26022] Updated weights on worker 0-0, policy_version 744800 (0.00206) [2022-07-10 13:33:58,222][25689] Fps is (10 sec: 5480.8, 60 sec: 5500.2, 300 sec: 5526.0). Total num frames: 762675200. Throughput: 0: 5747.3. Samples: 762677098. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:33:58,223][25689] Avg episode reward: [(0, '-2.544')] [2022-07-10 13:34:00,129][26022] Updated weights on worker 0-0, policy_version 744810 (0.00097) [2022-07-10 13:34:02,022][26022] Updated weights on worker 0-0, policy_version 744820 (0.00087) [2022-07-10 13:34:03,287][25689] Fps is (10 sec: 5252.5, 60 sec: 5478.9, 300 sec: 5523.8). Total num frames: 762700800. Throughput: 0: 5638.1. Samples: 762708202. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:34:03,287][25689] Avg episode reward: [(0, '-2.766')] [2022-07-10 13:34:03,911][26022] Updated weights on worker 0-0, policy_version 744830 (0.00088) [2022-07-10 13:34:05,968][26022] Updated weights on worker 0-0, policy_version 744840 (0.00619) [2022-07-10 13:34:07,615][26022] Updated weights on worker 0-0, policy_version 744850 (0.00087) [2022-07-10 13:34:08,362][25689] Fps is (10 sec: 5352.4, 60 sec: 5494.5, 300 sec: 5530.9). Total num frames: 762729472. Throughput: 0: 4831.2. Samples: 762724938. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:34:08,363][25689] Avg episode reward: [(0, '-1.860')] [2022-07-10 13:34:09,579][26022] Updated weights on worker 0-0, policy_version 744860 (0.00088) [2022-07-10 13:34:11,294][26022] Updated weights on worker 0-0, policy_version 744870 (0.00090) [2022-07-10 13:34:13,039][26022] Updated weights on worker 0-0, policy_version 744880 (0.00097) [2022-07-10 13:34:13,400][25689] Fps is (10 sec: 5771.1, 60 sec: 5548.3, 300 sec: 5531.4). Total num frames: 762759168. Throughput: 0: 5677.9. Samples: 762758824. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:34:13,401][25689] Avg episode reward: [(0, '-2.182')] [2022-07-10 13:34:15,184][26022] Updated weights on worker 0-0, policy_version 744890 (0.00086) [2022-07-10 13:34:16,710][26022] Updated weights on worker 0-0, policy_version 744900 (0.00086) [2022-07-10 13:34:18,494][25689] Fps is (10 sec: 5558.6, 60 sec: 5492.1, 300 sec: 5522.9). Total num frames: 762785792. Throughput: 0: 5663.2. Samples: 762791902. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:34:18,496][25689] Avg episode reward: [(0, '-2.026')] [2022-07-10 13:34:18,699][26022] Updated weights on worker 0-0, policy_version 744910 (0.00094) [2022-07-10 13:34:20,393][26022] Updated weights on worker 0-0, policy_version 744920 (0.00087) [2022-07-10 13:34:22,501][26022] Updated weights on worker 0-0, policy_version 744930 (0.00086) [2022-07-10 13:34:23,504][25689] Fps is (10 sec: 5574.4, 60 sec: 5525.9, 300 sec: 5530.8). Total num frames: 762815488. Throughput: 0: 4974.5. Samples: 762808772. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:34:23,504][25689] Avg episode reward: [(0, '-1.176')] [2022-07-10 13:34:24,011][26022] Updated weights on worker 0-0, policy_version 744940 (0.00088) [2022-07-10 13:34:26,065][26022] Updated weights on worker 0-0, policy_version 744950 (0.00092) [2022-07-10 13:34:27,823][26022] Updated weights on worker 0-0, policy_version 744960 (0.00087) [2022-07-10 13:34:28,563][25689] Fps is (10 sec: 5593.7, 60 sec: 5515.7, 300 sec: 5523.0). Total num frames: 762842112. Throughput: 0: 5806.8. Samples: 762842238. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:34:28,563][25689] Avg episode reward: [(0, '-0.805')] [2022-07-10 13:34:29,705][26022] Updated weights on worker 0-0, policy_version 744970 (0.00093) [2022-07-10 13:34:31,714][26022] Updated weights on worker 0-0, policy_version 744980 (0.00090) [2022-07-10 13:34:33,549][26022] Updated weights on worker 0-0, policy_version 744990 (0.00087) [2022-07-10 13:34:33,588][25689] Fps is (10 sec: 5381.7, 60 sec: 5515.1, 300 sec: 5526.5). Total num frames: 762869760. Throughput: 0: 5760.2. Samples: 762875110. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:34:33,589][25689] Avg episode reward: [(0, '-1.218')] [2022-07-10 13:34:35,177][26022] Updated weights on worker 0-0, policy_version 745000 (0.00086) [2022-07-10 13:34:35,448][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:34:35,462][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000745001_762881024.pth [2022-07-10 13:34:35,462][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000743055_760888320.pth [2022-07-10 13:34:37,092][26022] Updated weights on worker 0-0, policy_version 745010 (0.00088) [2022-07-10 13:34:38,625][25689] Fps is (10 sec: 5495.2, 60 sec: 5499.9, 300 sec: 5522.9). Total num frames: 762897408. Throughput: 0: 4963.1. Samples: 762891816. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:34:38,626][25689] Avg episode reward: [(0, '-1.694')] [2022-07-10 13:34:39,105][26022] Updated weights on worker 0-0, policy_version 745020 (0.00085) [2022-07-10 13:34:40,745][26022] Updated weights on worker 0-0, policy_version 745030 (0.00088) [2022-07-10 13:34:42,773][26022] Updated weights on worker 0-0, policy_version 745040 (0.00097) [2022-07-10 13:34:43,635][25689] Fps is (10 sec: 5606.0, 60 sec: 5524.0, 300 sec: 5524.2). Total num frames: 762926080. Throughput: 0: 5774.8. Samples: 762925024. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:34:43,635][25689] Avg episode reward: [(0, '-1.953')] [2022-07-10 13:34:44,426][26022] Updated weights on worker 0-0, policy_version 745050 (0.00091) [2022-07-10 13:34:46,453][26022] Updated weights on worker 0-0, policy_version 745060 (0.00085) [2022-07-10 13:34:48,069][26022] Updated weights on worker 0-0, policy_version 745070 (0.00080) [2022-07-10 13:34:48,800][25689] Fps is (10 sec: 5635.7, 60 sec: 5519.4, 300 sec: 5528.2). Total num frames: 762954752. Throughput: 0: 5747.7. Samples: 762958558. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:34:48,801][25689] Avg episode reward: [(0, '-1.832')] [2022-07-10 13:34:49,959][26022] Updated weights on worker 0-0, policy_version 745080 (0.00089) [2022-07-10 13:34:52,034][26022] Updated weights on worker 0-0, policy_version 745090 (0.00088) [2022-07-10 13:34:53,699][26022] Updated weights on worker 0-0, policy_version 745100 (0.00093) [2022-07-10 13:34:53,819][25689] Fps is (10 sec: 5630.7, 60 sec: 5536.0, 300 sec: 5531.8). Total num frames: 762983424. Throughput: 0: 5782.3. Samples: 762992088. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:34:53,819][25689] Avg episode reward: [(0, '-1.562')] [2022-07-10 13:34:55,524][26022] Updated weights on worker 0-0, policy_version 745110 (0.00087) [2022-07-10 13:34:57,416][26022] Updated weights on worker 0-0, policy_version 745120 (0.00086) [2022-07-10 13:34:58,875][25689] Fps is (10 sec: 5590.4, 60 sec: 5537.6, 300 sec: 5527.7). Total num frames: 763011072. Throughput: 0: 5786.1. Samples: 763008982. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:34:58,875][25689] Avg episode reward: [(0, '-0.208')] [2022-07-10 13:34:59,159][26022] Updated weights on worker 0-0, policy_version 745130 (0.00092) [2022-07-10 13:35:00,854][26022] Updated weights on worker 0-0, policy_version 745140 (0.00094) [2022-07-10 13:35:03,188][26022] Updated weights on worker 0-0, policy_version 745150 (0.00090) [2022-07-10 13:35:03,936][25689] Fps is (10 sec: 5364.3, 60 sec: 5554.8, 300 sec: 5527.7). Total num frames: 763037696. Throughput: 0: 5684.6. Samples: 763040430. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:35:03,937][25689] Avg episode reward: [(0, '-0.113')] [2022-07-10 13:35:05,148][26022] Updated weights on worker 0-0, policy_version 745160 (0.00081) [2022-07-10 13:35:06,804][26022] Updated weights on worker 0-0, policy_version 745170 (0.00093) [2022-07-10 13:35:08,889][26022] Updated weights on worker 0-0, policy_version 745180 (0.00101) [2022-07-10 13:35:09,007][25689] Fps is (10 sec: 5356.2, 60 sec: 5538.3, 300 sec: 5526.9). Total num frames: 763065344. Throughput: 0: 5716.1. Samples: 763074062. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:35:09,008][25689] Avg episode reward: [(0, '0.624')] [2022-07-10 13:35:10,437][26022] Updated weights on worker 0-0, policy_version 745190 (0.00078) [2022-07-10 13:35:12,485][26022] Updated weights on worker 0-0, policy_version 745200 (0.00081) [2022-07-10 13:35:14,048][25689] Fps is (10 sec: 5569.6, 60 sec: 5521.2, 300 sec: 5533.2). Total num frames: 763094016. Throughput: 0: 4882.1. Samples: 763090850. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:35:14,049][25689] Avg episode reward: [(0, '-0.404')] [2022-07-10 13:35:14,115][26022] Updated weights on worker 0-0, policy_version 745210 (0.00086) [2022-07-10 13:35:16,198][26022] Updated weights on worker 0-0, policy_version 745220 (0.00089) [2022-07-10 13:35:17,867][26022] Updated weights on worker 0-0, policy_version 745230 (0.00090) [2022-07-10 13:35:19,111][25689] Fps is (10 sec: 5472.7, 60 sec: 5524.0, 300 sec: 5522.6). Total num frames: 763120640. Throughput: 0: 5688.7. Samples: 763124102. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:35:19,112][25689] Avg episode reward: [(0, '-0.746')] [2022-07-10 13:35:19,879][26022] Updated weights on worker 0-0, policy_version 745240 (0.00085) [2022-07-10 13:35:21,628][26022] Updated weights on worker 0-0, policy_version 745250 (0.00090) [2022-07-10 13:35:23,508][26022] Updated weights on worker 0-0, policy_version 745260 (0.00105) [2022-07-10 13:35:24,119][25689] Fps is (10 sec: 5490.5, 60 sec: 5507.2, 300 sec: 5521.0). Total num frames: 763149312. Throughput: 0: 5816.6. Samples: 763157828. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:35:24,120][25689] Avg episode reward: [(0, '-1.836')] [2022-07-10 13:35:25,142][26022] Updated weights on worker 0-0, policy_version 745270 (0.00090) [2022-07-10 13:35:27,221][26022] Updated weights on worker 0-0, policy_version 745280 (0.00086) [2022-07-10 13:35:28,804][26022] Updated weights on worker 0-0, policy_version 745290 (0.00095) [2022-07-10 13:35:29,206][25689] Fps is (10 sec: 5680.7, 60 sec: 5538.5, 300 sec: 5523.1). Total num frames: 763177984. Throughput: 0: 4972.4. Samples: 763174498. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:35:29,206][25689] Avg episode reward: [(0, '-1.559')] [2022-07-10 13:35:30,802][26022] Updated weights on worker 0-0, policy_version 745300 (0.00081) [2022-07-10 13:35:32,533][26022] Updated weights on worker 0-0, policy_version 745310 (0.00091) [2022-07-10 13:35:34,214][25689] Fps is (10 sec: 5579.4, 60 sec: 5540.1, 300 sec: 5526.6). Total num frames: 763205632. Throughput: 0: 5810.6. Samples: 763208024. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:35:34,214][25689] Avg episode reward: [(0, '-2.015')] [2022-07-10 13:35:34,440][26022] Updated weights on worker 0-0, policy_version 745320 (0.00088) [2022-07-10 13:35:36,378][26022] Updated weights on worker 0-0, policy_version 745330 (0.00087) [2022-07-10 13:35:37,983][26022] Updated weights on worker 0-0, policy_version 745340 (0.00086) [2022-07-10 13:35:39,234][25689] Fps is (10 sec: 5514.1, 60 sec: 5541.6, 300 sec: 5527.0). Total num frames: 763233280. Throughput: 0: 5846.9. Samples: 763241758. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:35:39,235][25689] Avg episode reward: [(0, '-2.625')] [2022-07-10 13:35:40,060][26022] Updated weights on worker 0-0, policy_version 745350 (0.00082) [2022-07-10 13:35:41,627][26022] Updated weights on worker 0-0, policy_version 745360 (0.00089) [2022-07-10 13:35:43,697][26022] Updated weights on worker 0-0, policy_version 745370 (0.00090) [2022-07-10 13:35:44,266][25689] Fps is (10 sec: 5602.6, 60 sec: 5539.6, 300 sec: 5522.1). Total num frames: 763261952. Throughput: 0: 5000.8. Samples: 763258578. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:35:44,268][25689] Avg episode reward: [(0, '-1.458')] [2022-07-10 13:35:45,272][26022] Updated weights on worker 0-0, policy_version 745380 (0.00090) [2022-07-10 13:35:47,341][26022] Updated weights on worker 0-0, policy_version 745390 (0.00087) [2022-07-10 13:35:48,978][26022] Updated weights on worker 0-0, policy_version 745400 (0.00087) [2022-07-10 13:35:49,316][25689] Fps is (10 sec: 5687.6, 60 sec: 5550.1, 300 sec: 5528.5). Total num frames: 763290624. Throughput: 0: 5858.2. Samples: 763292308. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:35:49,317][25689] Avg episode reward: [(0, '-1.474')] [2022-07-10 13:35:50,967][26022] Updated weights on worker 0-0, policy_version 745410 (0.00096) [2022-07-10 13:35:52,778][26022] Updated weights on worker 0-0, policy_version 745420 (0.00086) [2022-07-10 13:35:54,333][25689] Fps is (10 sec: 5594.4, 60 sec: 5533.4, 300 sec: 5532.0). Total num frames: 763318272. Throughput: 0: 5857.8. Samples: 763325880. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:35:54,334][25689] Avg episode reward: [(0, '-1.666')] [2022-07-10 13:35:54,564][26022] Updated weights on worker 0-0, policy_version 745430 (0.00093) [2022-07-10 13:35:56,459][26022] Updated weights on worker 0-0, policy_version 745440 (0.00086) [2022-07-10 13:35:58,229][26022] Updated weights on worker 0-0, policy_version 745450 (0.00105) [2022-07-10 13:35:59,335][25689] Fps is (10 sec: 5417.2, 60 sec: 5521.4, 300 sec: 5518.4). Total num frames: 763344896. Throughput: 0: 5019.9. Samples: 763342662. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:35:59,335][25689] Avg episode reward: [(0, '-1.213')] [2022-07-10 13:36:00,246][26022] Updated weights on worker 0-0, policy_version 745460 (0.00091) [2022-07-10 13:36:01,880][26022] Updated weights on worker 0-0, policy_version 745470 (0.00094) [2022-07-10 13:36:04,150][26022] Updated weights on worker 0-0, policy_version 745480 (0.00088) [2022-07-10 13:36:04,337][25689] Fps is (10 sec: 5425.1, 60 sec: 5543.8, 300 sec: 5529.7). Total num frames: 763372544. Throughput: 0: 5750.9. Samples: 763374006. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:36:04,338][25689] Avg episode reward: [(0, '-0.677')] [2022-07-10 13:36:05,987][26022] Updated weights on worker 0-0, policy_version 745490 (0.00091) [2022-07-10 13:36:07,909][26022] Updated weights on worker 0-0, policy_version 745500 (0.00092) [2022-07-10 13:36:09,403][25689] Fps is (10 sec: 5492.3, 60 sec: 5544.3, 300 sec: 5532.8). Total num frames: 763400192. Throughput: 0: 5750.8. Samples: 763407820. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:36:09,403][25689] Avg episode reward: [(0, '-0.327')] [2022-07-10 13:36:09,700][26022] Updated weights on worker 0-0, policy_version 745510 (0.00092) [2022-07-10 13:36:11,543][26022] Updated weights on worker 0-0, policy_version 745520 (0.00110) [2022-07-10 13:36:13,303][26022] Updated weights on worker 0-0, policy_version 745530 (0.00092) [2022-07-10 13:36:14,438][25689] Fps is (10 sec: 5474.1, 60 sec: 5527.8, 300 sec: 5529.2). Total num frames: 763427840. Throughput: 0: 4914.0. Samples: 763424678. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:36:14,439][25689] Avg episode reward: [(0, '-0.581')] [2022-07-10 13:36:15,113][26022] Updated weights on worker 0-0, policy_version 745540 (0.00093) [2022-07-10 13:36:16,912][26022] Updated weights on worker 0-0, policy_version 745550 (0.00085) [2022-07-10 13:36:19,038][26022] Updated weights on worker 0-0, policy_version 745560 (0.00091) [2022-07-10 13:36:19,447][25689] Fps is (10 sec: 5607.3, 60 sec: 5566.8, 300 sec: 5532.9). Total num frames: 763456512. Throughput: 0: 5744.6. Samples: 763458198. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:36:19,447][25689] Avg episode reward: [(0, '-0.579')] [2022-07-10 13:36:20,527][26022] Updated weights on worker 0-0, policy_version 745570 (0.00082) [2022-07-10 13:36:22,710][26022] Updated weights on worker 0-0, policy_version 745580 (0.00098) [2022-07-10 13:36:24,165][26022] Updated weights on worker 0-0, policy_version 745590 (0.00085) [2022-07-10 13:36:24,471][25689] Fps is (10 sec: 5716.0, 60 sec: 5565.3, 300 sec: 5534.5). Total num frames: 763485184. Throughput: 0: 5842.8. Samples: 763491642. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:36:24,471][25689] Avg episode reward: [(0, '0.086')] [2022-07-10 13:36:26,454][26022] Updated weights on worker 0-0, policy_version 745600 (0.00085) [2022-07-10 13:36:28,002][26022] Updated weights on worker 0-0, policy_version 745610 (0.00084) [2022-07-10 13:36:29,527][25689] Fps is (10 sec: 5586.9, 60 sec: 5551.1, 300 sec: 5537.5). Total num frames: 763512832. Throughput: 0: 4985.5. Samples: 763508152. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:36:29,528][25689] Avg episode reward: [(0, '-0.540')] [2022-07-10 13:36:30,055][26022] Updated weights on worker 0-0, policy_version 745620 (0.00082) [2022-07-10 13:36:31,668][26022] Updated weights on worker 0-0, policy_version 745630 (0.00090) [2022-07-10 13:36:33,716][26022] Updated weights on worker 0-0, policy_version 745640 (0.00083) [2022-07-10 13:36:34,620][25689] Fps is (10 sec: 5448.1, 60 sec: 5543.2, 300 sec: 5529.1). Total num frames: 763540480. Throughput: 0: 5775.2. Samples: 763541232. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:36:34,621][25689] Avg episode reward: [(0, '-0.488')] [2022-07-10 13:36:35,498][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:36:35,510][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000745650_763545600.pth [2022-07-10 13:36:35,510][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000743705_761553920.pth [2022-07-10 13:36:35,516][26022] Updated weights on worker 0-0, policy_version 745650 (0.00097) [2022-07-10 13:36:37,267][26022] Updated weights on worker 0-0, policy_version 745660 (0.00087) [2022-07-10 13:36:38,958][26022] Updated weights on worker 0-0, policy_version 745670 (0.00096) [2022-07-10 13:36:39,666][25689] Fps is (10 sec: 5656.0, 60 sec: 5574.8, 300 sec: 5539.0). Total num frames: 763570176. Throughput: 0: 5778.5. Samples: 763575036. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:36:39,667][25689] Avg episode reward: [(0, '-1.420')] [2022-07-10 13:36:41,151][26022] Updated weights on worker 0-0, policy_version 745680 (0.00092) [2022-07-10 13:36:42,703][26022] Updated weights on worker 0-0, policy_version 745690 (0.00086) [2022-07-10 13:36:44,559][26022] Updated weights on worker 0-0, policy_version 745700 (0.00064) [2022-07-10 13:36:44,705][25689] Fps is (10 sec: 5584.5, 60 sec: 5540.2, 300 sec: 5527.0). Total num frames: 763596800. Throughput: 0: 4948.9. Samples: 763591782. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:36:44,706][25689] Avg episode reward: [(0, '-2.109')] [2022-07-10 13:36:46,392][26022] Updated weights on worker 0-0, policy_version 745710 (0.00083) [2022-07-10 13:36:48,307][26022] Updated weights on worker 0-0, policy_version 745720 (0.00081) [2022-07-10 13:36:49,783][25689] Fps is (10 sec: 5465.6, 60 sec: 5537.7, 300 sec: 5533.3). Total num frames: 763625472. Throughput: 0: 5787.3. Samples: 763625378. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 13:36:49,785][25689] Avg episode reward: [(0, '-2.499')] [2022-07-10 13:36:50,050][26022] Updated weights on worker 0-0, policy_version 745730 (0.00095) [2022-07-10 13:36:52,028][26022] Updated weights on worker 0-0, policy_version 745740 (0.00091) [2022-07-10 13:36:53,641][26022] Updated weights on worker 0-0, policy_version 745750 (0.00093) [2022-07-10 13:36:54,830][25689] Fps is (10 sec: 5562.8, 60 sec: 5535.0, 300 sec: 5529.7). Total num frames: 763653120. Throughput: 0: 5817.5. Samples: 763658800. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:36:54,830][25689] Avg episode reward: [(0, '-2.037')] [2022-07-10 13:36:55,787][26022] Updated weights on worker 0-0, policy_version 745760 (0.00093) [2022-07-10 13:36:57,366][26022] Updated weights on worker 0-0, policy_version 745770 (0.00081) [2022-07-10 13:36:59,331][26022] Updated weights on worker 0-0, policy_version 745780 (0.00083) [2022-07-10 13:36:59,893][25689] Fps is (10 sec: 5570.8, 60 sec: 5563.2, 300 sec: 5539.3). Total num frames: 763681792. Throughput: 0: 4962.4. Samples: 763675410. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:36:59,893][25689] Avg episode reward: [(0, '-1.678')] [2022-07-10 13:37:01,095][26022] Updated weights on worker 0-0, policy_version 745790 (0.00087) [2022-07-10 13:37:03,455][26022] Updated weights on worker 0-0, policy_version 745800 (0.00095) [2022-07-10 13:37:04,983][25689] Fps is (10 sec: 5345.4, 60 sec: 5521.4, 300 sec: 5532.5). Total num frames: 763707392. Throughput: 0: 5677.0. Samples: 763706896. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:37:04,983][25689] Avg episode reward: [(0, '-1.946')] [2022-07-10 13:37:05,211][26022] Updated weights on worker 0-0, policy_version 745810 (0.00086) [2022-07-10 13:37:07,053][26022] Updated weights on worker 0-0, policy_version 745820 (0.00092) [2022-07-10 13:37:08,822][26022] Updated weights on worker 0-0, policy_version 745830 (0.00086) [2022-07-10 13:37:10,050][25689] Fps is (10 sec: 5343.5, 60 sec: 5538.2, 300 sec: 5531.7). Total num frames: 763736064. Throughput: 0: 5665.2. Samples: 763740192. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:37:10,050][25689] Avg episode reward: [(0, '-1.944')] [2022-07-10 13:37:10,749][26022] Updated weights on worker 0-0, policy_version 745840 (0.00083) [2022-07-10 13:37:12,640][26022] Updated weights on worker 0-0, policy_version 745850 (0.00087) [2022-07-10 13:37:14,535][26022] Updated weights on worker 0-0, policy_version 745860 (0.00088) [2022-07-10 13:37:15,104][25689] Fps is (10 sec: 5463.3, 60 sec: 5519.6, 300 sec: 5523.9). Total num frames: 763762688. Throughput: 0: 4843.5. Samples: 763757000. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:37:15,105][25689] Avg episode reward: [(0, '-1.066')] [2022-07-10 13:37:16,132][26022] Updated weights on worker 0-0, policy_version 745870 (0.00085) [2022-07-10 13:37:18,261][26022] Updated weights on worker 0-0, policy_version 745880 (0.00085) [2022-07-10 13:37:19,842][26022] Updated weights on worker 0-0, policy_version 745890 (0.00091) [2022-07-10 13:37:20,105][25689] Fps is (10 sec: 5600.7, 60 sec: 5537.1, 300 sec: 5535.4). Total num frames: 763792384. Throughput: 0: 5688.5. Samples: 763790388. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:37:20,106][25689] Avg episode reward: [(0, '-0.912')] [2022-07-10 13:37:21,783][26022] Updated weights on worker 0-0, policy_version 745900 (0.00082) [2022-07-10 13:37:23,488][26022] Updated weights on worker 0-0, policy_version 745910 (0.00085) [2022-07-10 13:37:25,107][25689] Fps is (10 sec: 5630.4, 60 sec: 5505.4, 300 sec: 5532.8). Total num frames: 763819008. Throughput: 0: 5827.2. Samples: 763824164. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:37:25,107][25689] Avg episode reward: [(0, '-0.814')] [2022-07-10 13:37:25,506][26022] Updated weights on worker 0-0, policy_version 745920 (0.00088) [2022-07-10 13:37:27,199][26022] Updated weights on worker 0-0, policy_version 745930 (0.00085) [2022-07-10 13:37:29,125][26022] Updated weights on worker 0-0, policy_version 745940 (0.00089) [2022-07-10 13:37:30,209][25689] Fps is (10 sec: 5574.2, 60 sec: 5535.0, 300 sec: 5528.0). Total num frames: 763848704. Throughput: 0: 5825.6. Samples: 763857634. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:37:30,209][25689] Avg episode reward: [(0, '-1.893')] [2022-07-10 13:37:30,797][26022] Updated weights on worker 0-0, policy_version 745950 (0.00084) [2022-07-10 13:37:32,971][26022] Updated weights on worker 0-0, policy_version 745960 (0.00091) [2022-07-10 13:37:34,506][26022] Updated weights on worker 0-0, policy_version 745970 (0.00087) [2022-07-10 13:37:35,276][25689] Fps is (10 sec: 5639.0, 60 sec: 5537.3, 300 sec: 5530.3). Total num frames: 763876352. Throughput: 0: 5821.6. Samples: 763874434. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:37:35,277][25689] Avg episode reward: [(0, '-2.460')] [2022-07-10 13:37:36,585][26022] Updated weights on worker 0-0, policy_version 745980 (0.00086) [2022-07-10 13:37:38,132][26022] Updated weights on worker 0-0, policy_version 745990 (0.00085) [2022-07-10 13:37:40,227][26022] Updated weights on worker 0-0, policy_version 746000 (0.00087) [2022-07-10 13:37:40,338][25689] Fps is (10 sec: 5459.0, 60 sec: 5502.1, 300 sec: 5529.3). Total num frames: 763904000. Throughput: 0: 5808.8. Samples: 763907918. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:37:40,339][25689] Avg episode reward: [(0, '-3.541')] [2022-07-10 13:37:41,792][26022] Updated weights on worker 0-0, policy_version 746010 (0.00090) [2022-07-10 13:37:43,606][26022] Updated weights on worker 0-0, policy_version 746020 (0.00093) [2022-07-10 13:37:45,408][25689] Fps is (10 sec: 5659.8, 60 sec: 5550.0, 300 sec: 5530.3). Total num frames: 763933696. Throughput: 0: 5798.0. Samples: 763941868. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:37:45,408][25689] Avg episode reward: [(0, '-3.486')] [2022-07-10 13:37:45,634][26022] Updated weights on worker 0-0, policy_version 746030 (0.00088) [2022-07-10 13:37:47,450][26022] Updated weights on worker 0-0, policy_version 746040 (0.00098) [2022-07-10 13:37:49,126][26022] Updated weights on worker 0-0, policy_version 746050 (0.00085) [2022-07-10 13:37:50,507][25689] Fps is (10 sec: 5639.3, 60 sec: 5531.1, 300 sec: 5532.0). Total num frames: 763961344. Throughput: 0: 4977.2. Samples: 763958662. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:37:50,507][25689] Avg episode reward: [(0, '-4.018')] [2022-07-10 13:37:50,955][26022] Updated weights on worker 0-0, policy_version 746060 (0.00088) [2022-07-10 13:37:52,792][26022] Updated weights on worker 0-0, policy_version 746070 (0.00091) [2022-07-10 13:37:54,516][26022] Updated weights on worker 0-0, policy_version 746080 (0.00091) [2022-07-10 13:37:55,511][25689] Fps is (10 sec: 5574.6, 60 sec: 5551.9, 300 sec: 5532.7). Total num frames: 763990016. Throughput: 0: 5835.7. Samples: 763992518. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:37:55,511][25689] Avg episode reward: [(0, '-3.029')] [2022-07-10 13:37:56,668][26022] Updated weights on worker 0-0, policy_version 746090 (0.00091) [2022-07-10 13:37:58,279][26022] Updated weights on worker 0-0, policy_version 746100 (0.00086) [2022-07-10 13:38:00,139][26022] Updated weights on worker 0-0, policy_version 746110 (0.00086) [2022-07-10 13:38:00,542][25689] Fps is (10 sec: 5612.5, 60 sec: 5538.0, 300 sec: 5535.8). Total num frames: 764017664. Throughput: 0: 5850.6. Samples: 764026120. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:38:00,542][25689] Avg episode reward: [(0, '-4.197')] [2022-07-10 13:38:02,404][26022] Updated weights on worker 0-0, policy_version 746120 (0.00087) [2022-07-10 13:38:04,174][26022] Updated weights on worker 0-0, policy_version 746130 (0.00085) [2022-07-10 13:38:05,555][25689] Fps is (10 sec: 5403.4, 60 sec: 5561.9, 300 sec: 5533.3). Total num frames: 764044288. Throughput: 0: 4911.8. Samples: 764040826. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:38:05,555][25689] Avg episode reward: [(0, '-3.380')] [2022-07-10 13:38:06,076][26022] Updated weights on worker 0-0, policy_version 746140 (0.00083) [2022-07-10 13:38:07,878][26022] Updated weights on worker 0-0, policy_version 746150 (0.00091) [2022-07-10 13:38:09,595][26022] Updated weights on worker 0-0, policy_version 746160 (0.00083) [2022-07-10 13:38:10,622][25689] Fps is (10 sec: 5485.3, 60 sec: 5561.8, 300 sec: 5540.3). Total num frames: 764072960. Throughput: 0: 5762.0. Samples: 764074568. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:38:10,623][25689] Avg episode reward: [(0, '-1.988')] [2022-07-10 13:38:11,628][26022] Updated weights on worker 0-0, policy_version 746170 (0.00095) [2022-07-10 13:38:13,274][26022] Updated weights on worker 0-0, policy_version 746180 (0.00094) [2022-07-10 13:38:15,211][26022] Updated weights on worker 0-0, policy_version 746190 (0.00094) [2022-07-10 13:38:15,645][25689] Fps is (10 sec: 5581.4, 60 sec: 5581.6, 300 sec: 5533.6). Total num frames: 764100608. Throughput: 0: 5743.3. Samples: 764108158. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:38:15,646][25689] Avg episode reward: [(0, '-3.158')] [2022-07-10 13:38:16,983][26022] Updated weights on worker 0-0, policy_version 746200 (0.00090) [2022-07-10 13:38:18,940][26022] Updated weights on worker 0-0, policy_version 746210 (0.00091) [2022-07-10 13:38:20,655][25689] Fps is (10 sec: 5511.5, 60 sec: 5547.0, 300 sec: 5533.6). Total num frames: 764128256. Throughput: 0: 4915.8. Samples: 764124994. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:38:20,656][25689] Avg episode reward: [(0, '-3.562')] [2022-07-10 13:38:20,727][26022] Updated weights on worker 0-0, policy_version 746220 (0.00087) [2022-07-10 13:38:22,520][26022] Updated weights on worker 0-0, policy_version 746230 (0.00090) [2022-07-10 13:38:24,364][26022] Updated weights on worker 0-0, policy_version 746240 (0.00089) [2022-07-10 13:38:25,659][25689] Fps is (10 sec: 5624.5, 60 sec: 5580.7, 300 sec: 5539.4). Total num frames: 764156928. Throughput: 0: 5852.3. Samples: 764158480. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:38:25,660][25689] Avg episode reward: [(0, '-3.976')] [2022-07-10 13:38:26,248][26022] Updated weights on worker 0-0, policy_version 746250 (0.00080) [2022-07-10 13:38:27,991][26022] Updated weights on worker 0-0, policy_version 746260 (0.00090) [2022-07-10 13:38:29,873][26022] Updated weights on worker 0-0, policy_version 746270 (0.00084) [2022-07-10 13:38:30,707][25689] Fps is (10 sec: 5399.4, 60 sec: 5517.9, 300 sec: 5532.0). Total num frames: 764182528. Throughput: 0: 5845.5. Samples: 764191972. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:38:30,707][25689] Avg episode reward: [(0, '-2.767')] [2022-07-10 13:38:31,582][26022] Updated weights on worker 0-0, policy_version 746280 (0.00083) [2022-07-10 13:38:33,702][26022] Updated weights on worker 0-0, policy_version 746290 (0.00085) [2022-07-10 13:38:35,239][26022] Updated weights on worker 0-0, policy_version 746300 (0.00087) [2022-07-10 13:38:35,574][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:38:35,586][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000746302_764213248.pth [2022-07-10 13:38:35,586][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000744354_762218496.pth [2022-07-10 13:38:35,741][25689] Fps is (10 sec: 5687.8, 60 sec: 5588.7, 300 sec: 5542.7). Total num frames: 764214272. Throughput: 0: 5008.0. Samples: 764208796. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:38:35,741][25689] Avg episode reward: [(0, '-2.483')] [2022-07-10 13:38:37,333][26022] Updated weights on worker 0-0, policy_version 746310 (0.00091) [2022-07-10 13:38:38,860][26022] Updated weights on worker 0-0, policy_version 746320 (0.00092) [2022-07-10 13:38:40,778][25689] Fps is (10 sec: 5795.5, 60 sec: 5574.1, 300 sec: 5540.2). Total num frames: 764240896. Throughput: 0: 5843.6. Samples: 764242584. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:38:40,779][25689] Avg episode reward: [(0, '-2.528')] [2022-07-10 13:38:40,871][26022] Updated weights on worker 0-0, policy_version 746330 (0.00081) [2022-07-10 13:38:42,631][26022] Updated weights on worker 0-0, policy_version 746340 (0.00087) [2022-07-10 13:38:44,450][26022] Updated weights on worker 0-0, policy_version 746350 (0.00083) [2022-07-10 13:38:45,799][25689] Fps is (10 sec: 5395.9, 60 sec: 5544.7, 300 sec: 5538.6). Total num frames: 764268544. Throughput: 0: 5855.1. Samples: 764276402. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:38:45,799][25689] Avg episode reward: [(0, '-1.799')] [2022-07-10 13:38:46,235][26022] Updated weights on worker 0-0, policy_version 746360 (0.00084) [2022-07-10 13:38:48,175][26022] Updated weights on worker 0-0, policy_version 746370 (0.00085) [2022-07-10 13:38:49,786][26022] Updated weights on worker 0-0, policy_version 746380 (0.00082) [2022-07-10 13:38:50,857][25689] Fps is (10 sec: 5689.7, 60 sec: 5582.4, 300 sec: 5544.7). Total num frames: 764298240. Throughput: 0: 5025.7. Samples: 764293238. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:38:50,857][25689] Avg episode reward: [(0, '-1.373')] [2022-07-10 13:38:52,050][26022] Updated weights on worker 0-0, policy_version 746390 (0.00093) [2022-07-10 13:38:53,584][26022] Updated weights on worker 0-0, policy_version 746400 (0.00086) [2022-07-10 13:38:55,683][26022] Updated weights on worker 0-0, policy_version 746410 (0.00088) [2022-07-10 13:38:55,875][25689] Fps is (10 sec: 5690.9, 60 sec: 5564.1, 300 sec: 5545.7). Total num frames: 764325888. Throughput: 0: 5853.6. Samples: 764326656. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:38:55,876][25689] Avg episode reward: [(0, '-0.780')] [2022-07-10 13:38:57,284][26022] Updated weights on worker 0-0, policy_version 746420 (0.00083) [2022-07-10 13:38:59,291][26022] Updated weights on worker 0-0, policy_version 746430 (0.00090) [2022-07-10 13:39:00,870][26022] Updated weights on worker 0-0, policy_version 746440 (0.00088) [2022-07-10 13:39:00,890][25689] Fps is (10 sec: 5613.0, 60 sec: 5582.5, 300 sec: 5556.9). Total num frames: 764354560. Throughput: 0: 5858.1. Samples: 764360404. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:39:00,891][25689] Avg episode reward: [(0, '-0.852')] [2022-07-10 13:39:03,009][26022] Updated weights on worker 0-0, policy_version 746450 (0.00106) [2022-07-10 13:39:04,871][26022] Updated weights on worker 0-0, policy_version 746460 (0.00089) [2022-07-10 13:39:05,920][25689] Fps is (10 sec: 5301.0, 60 sec: 5547.0, 300 sec: 5544.0). Total num frames: 764379136. Throughput: 0: 4920.0. Samples: 764375400. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:39:05,921][25689] Avg episode reward: [(0, '-0.643')] [2022-07-10 13:39:06,874][26022] Updated weights on worker 0-0, policy_version 746470 (0.00095) [2022-07-10 13:39:08,641][26022] Updated weights on worker 0-0, policy_version 746480 (0.00081) [2022-07-10 13:39:10,432][26022] Updated weights on worker 0-0, policy_version 746490 (0.00084) [2022-07-10 13:39:11,035][25689] Fps is (10 sec: 5249.1, 60 sec: 5542.7, 300 sec: 5539.1). Total num frames: 764407808. Throughput: 0: 5735.1. Samples: 764408960. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:39:11,035][25689] Avg episode reward: [(0, '-1.813')] [2022-07-10 13:39:12,090][26022] Updated weights on worker 0-0, policy_version 746500 (0.00100) [2022-07-10 13:39:14,143][26022] Updated weights on worker 0-0, policy_version 746510 (0.00100) [2022-07-10 13:39:15,929][26022] Updated weights on worker 0-0, policy_version 746520 (0.00084) [2022-07-10 13:39:16,079][25689] Fps is (10 sec: 5644.8, 60 sec: 5557.7, 300 sec: 5547.0). Total num frames: 764436480. Throughput: 0: 5736.3. Samples: 764442550. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:39:16,079][25689] Avg episode reward: [(0, '-1.546')] [2022-07-10 13:39:17,826][26022] Updated weights on worker 0-0, policy_version 746530 (0.00089) [2022-07-10 13:39:19,624][26022] Updated weights on worker 0-0, policy_version 746540 (0.00089) [2022-07-10 13:39:21,151][25689] Fps is (10 sec: 5668.7, 60 sec: 5569.0, 300 sec: 5542.4). Total num frames: 764465152. Throughput: 0: 4881.3. Samples: 764459300. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:39:21,151][25689] Avg episode reward: [(0, '-3.248')] [2022-07-10 13:39:21,382][26022] Updated weights on worker 0-0, policy_version 746550 (0.00084) [2022-07-10 13:39:23,344][26022] Updated weights on worker 0-0, policy_version 746560 (0.00094) [2022-07-10 13:39:25,045][26022] Updated weights on worker 0-0, policy_version 746570 (0.00090) [2022-07-10 13:39:26,165][25689] Fps is (10 sec: 5583.7, 60 sec: 5551.0, 300 sec: 5546.6). Total num frames: 764492800. Throughput: 0: 5809.1. Samples: 764493006. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:39:26,166][25689] Avg episode reward: [(0, '-3.452')] [2022-07-10 13:39:26,894][26022] Updated weights on worker 0-0, policy_version 746580 (0.00087) [2022-07-10 13:39:28,683][26022] Updated weights on worker 0-0, policy_version 746590 (0.00089) [2022-07-10 13:39:30,675][26022] Updated weights on worker 0-0, policy_version 746600 (0.00090) [2022-07-10 13:39:31,222][25689] Fps is (10 sec: 5592.1, 60 sec: 5601.0, 300 sec: 5549.5). Total num frames: 764521472. Throughput: 0: 5827.4. Samples: 764526598. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:39:31,222][25689] Avg episode reward: [(0, '-2.771')] [2022-07-10 13:39:32,549][26022] Updated weights on worker 0-0, policy_version 746610 (0.00087) [2022-07-10 13:39:34,392][26022] Updated weights on worker 0-0, policy_version 746620 (0.00085) [2022-07-10 13:39:36,151][26022] Updated weights on worker 0-0, policy_version 746630 (0.00089) [2022-07-10 13:39:36,309][25689] Fps is (10 sec: 5552.1, 60 sec: 5528.4, 300 sec: 5548.5). Total num frames: 764549120. Throughput: 0: 4980.6. Samples: 764543310. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:39:36,310][25689] Avg episode reward: [(0, '-2.181')] [2022-07-10 13:39:37,994][26022] Updated weights on worker 0-0, policy_version 746640 (0.00086) [2022-07-10 13:39:39,973][26022] Updated weights on worker 0-0, policy_version 746650 (0.00087) [2022-07-10 13:39:41,387][25689] Fps is (10 sec: 5540.3, 60 sec: 5558.5, 300 sec: 5547.3). Total num frames: 764577792. Throughput: 0: 5801.0. Samples: 764576694. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:39:41,388][25689] Avg episode reward: [(0, '-1.754')] [2022-07-10 13:39:41,598][26022] Updated weights on worker 0-0, policy_version 746660 (0.00080) [2022-07-10 13:39:43,587][26022] Updated weights on worker 0-0, policy_version 746670 (0.00105) [2022-07-10 13:39:45,243][26022] Updated weights on worker 0-0, policy_version 746680 (0.00083) [2022-07-10 13:39:46,411][25689] Fps is (10 sec: 5575.3, 60 sec: 5558.2, 300 sec: 5546.5). Total num frames: 764605440. Throughput: 0: 5790.3. Samples: 764610234. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:39:46,411][25689] Avg episode reward: [(0, '-2.240')] [2022-07-10 13:39:47,181][26022] Updated weights on worker 0-0, policy_version 746690 (0.00086) [2022-07-10 13:39:49,004][26022] Updated weights on worker 0-0, policy_version 746700 (0.00083) [2022-07-10 13:39:50,735][26022] Updated weights on worker 0-0, policy_version 746710 (0.00089) [2022-07-10 13:39:51,498][25689] Fps is (10 sec: 5468.9, 60 sec: 5521.8, 300 sec: 5541.7). Total num frames: 764633088. Throughput: 0: 5787.6. Samples: 764643950. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:39:51,499][25689] Avg episode reward: [(0, '-1.692')] [2022-07-10 13:39:52,478][26022] Updated weights on worker 0-0, policy_version 746720 (0.00088) [2022-07-10 13:39:54,631][26022] Updated weights on worker 0-0, policy_version 746730 (0.00091) [2022-07-10 13:39:56,311][26022] Updated weights on worker 0-0, policy_version 746740 (0.00090) [2022-07-10 13:39:56,512][25689] Fps is (10 sec: 5677.0, 60 sec: 5556.0, 300 sec: 5549.4). Total num frames: 764662784. Throughput: 0: 5808.1. Samples: 764660650. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:39:56,512][25689] Avg episode reward: [(0, '-2.669')] [2022-07-10 13:39:58,233][26022] Updated weights on worker 0-0, policy_version 746750 (0.00092) [2022-07-10 13:39:59,727][26022] Updated weights on worker 0-0, policy_version 746760 (0.00096) [2022-07-10 13:40:01,516][25689] Fps is (10 sec: 5621.9, 60 sec: 5523.2, 300 sec: 5550.5). Total num frames: 764689408. Throughput: 0: 5845.4. Samples: 764694356. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:40:01,517][25689] Avg episode reward: [(0, '-3.630')] [2022-07-10 13:40:02,167][26022] Updated weights on worker 0-0, policy_version 746770 (0.00092) [2022-07-10 13:40:04,040][26022] Updated weights on worker 0-0, policy_version 746780 (0.00092) [2022-07-10 13:40:05,843][26022] Updated weights on worker 0-0, policy_version 746790 (0.00099) [2022-07-10 13:40:06,521][25689] Fps is (10 sec: 5320.1, 60 sec: 5559.3, 300 sec: 5548.3). Total num frames: 764716032. Throughput: 0: 5737.8. Samples: 764725620. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:40:06,521][25689] Avg episode reward: [(0, '-3.242')] [2022-07-10 13:40:07,891][26022] Updated weights on worker 0-0, policy_version 746800 (0.00088) [2022-07-10 13:40:09,581][26022] Updated weights on worker 0-0, policy_version 746810 (0.00084) [2022-07-10 13:40:11,592][25689] Fps is (10 sec: 5284.9, 60 sec: 5529.5, 300 sec: 5540.8). Total num frames: 764742656. Throughput: 0: 4895.3. Samples: 764742314. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:40:11,592][25689] Avg episode reward: [(0, '-3.359')] [2022-07-10 13:40:11,629][26022] Updated weights on worker 0-0, policy_version 746820 (0.00090) [2022-07-10 13:40:13,128][26022] Updated weights on worker 0-0, policy_version 746830 (0.00092) [2022-07-10 13:40:15,108][26022] Updated weights on worker 0-0, policy_version 746840 (0.00086) [2022-07-10 13:40:16,610][25689] Fps is (10 sec: 5582.0, 60 sec: 5548.7, 300 sec: 5552.0). Total num frames: 764772352. Throughput: 0: 5748.1. Samples: 764776176. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2022-07-10 13:40:16,611][25689] Avg episode reward: [(0, '-2.288')] [2022-07-10 13:40:17,088][26022] Updated weights on worker 0-0, policy_version 746850 (0.00101) [2022-07-10 13:40:18,680][26022] Updated weights on worker 0-0, policy_version 746860 (0.00095) [2022-07-10 13:40:20,535][26022] Updated weights on worker 0-0, policy_version 746870 (0.00089) [2022-07-10 13:40:21,698][25689] Fps is (10 sec: 5775.4, 60 sec: 5547.2, 300 sec: 5550.5). Total num frames: 764801024. Throughput: 0: 5720.4. Samples: 764809804. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:40:21,699][25689] Avg episode reward: [(0, '-2.096')] [2022-07-10 13:40:22,324][26022] Updated weights on worker 0-0, policy_version 746880 (0.00094) [2022-07-10 13:40:24,451][26022] Updated weights on worker 0-0, policy_version 746890 (0.00089) [2022-07-10 13:40:26,054][26022] Updated weights on worker 0-0, policy_version 746900 (0.00094) [2022-07-10 13:40:26,768][25689] Fps is (10 sec: 5544.6, 60 sec: 5542.2, 300 sec: 5547.4). Total num frames: 764828672. Throughput: 0: 4984.3. Samples: 764826538. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:40:26,768][25689] Avg episode reward: [(0, '-2.391')] [2022-07-10 13:40:28,037][26022] Updated weights on worker 0-0, policy_version 746910 (0.00086) [2022-07-10 13:40:29,670][26022] Updated weights on worker 0-0, policy_version 746920 (0.00096) [2022-07-10 13:40:31,580][26022] Updated weights on worker 0-0, policy_version 746930 (0.00084) [2022-07-10 13:40:31,896][25689] Fps is (10 sec: 5522.5, 60 sec: 5535.6, 300 sec: 5548.6). Total num frames: 764857344. Throughput: 0: 5789.8. Samples: 764859874. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:40:31,897][25689] Avg episode reward: [(0, '-2.601')] [2022-07-10 13:40:33,601][26022] Updated weights on worker 0-0, policy_version 746940 (0.00091) [2022-07-10 13:40:35,151][26022] Updated weights on worker 0-0, policy_version 746950 (0.00095) [2022-07-10 13:40:35,604][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:40:35,621][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000746953_764879872.pth [2022-07-10 13:40:35,622][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000745001_762881024.pth [2022-07-10 13:40:36,924][25689] Fps is (10 sec: 5545.6, 60 sec: 5541.1, 300 sec: 5548.4). Total num frames: 764884992. Throughput: 0: 5776.9. Samples: 764893524. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:40:36,926][25689] Avg episode reward: [(0, '-2.858')] [2022-07-10 13:40:37,112][26022] Updated weights on worker 0-0, policy_version 746960 (0.00086) [2022-07-10 13:40:38,827][26022] Updated weights on worker 0-0, policy_version 746970 (0.00087) [2022-07-10 13:40:40,801][26022] Updated weights on worker 0-0, policy_version 746980 (0.00100) [2022-07-10 13:40:41,966][25689] Fps is (10 sec: 5593.2, 60 sec: 5544.4, 300 sec: 5548.2). Total num frames: 764913664. Throughput: 0: 4948.9. Samples: 764910106. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:40:41,966][25689] Avg episode reward: [(0, '-3.083')] [2022-07-10 13:40:42,659][26022] Updated weights on worker 0-0, policy_version 746990 (0.00078) [2022-07-10 13:40:44,270][26022] Updated weights on worker 0-0, policy_version 747000 (0.00088) [2022-07-10 13:40:46,295][26022] Updated weights on worker 0-0, policy_version 747010 (0.00099) [2022-07-10 13:40:46,970][25689] Fps is (10 sec: 5708.1, 60 sec: 5563.1, 300 sec: 5549.1). Total num frames: 764942336. Throughput: 0: 5820.5. Samples: 764944124. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:40:46,970][25689] Avg episode reward: [(0, '-3.468')] [2022-07-10 13:40:48,018][26022] Updated weights on worker 0-0, policy_version 747020 (0.00085) [2022-07-10 13:40:49,909][26022] Updated weights on worker 0-0, policy_version 747030 (0.00088) [2022-07-10 13:40:51,725][26022] Updated weights on worker 0-0, policy_version 747040 (0.00093) [2022-07-10 13:40:52,008][25689] Fps is (10 sec: 5710.3, 60 sec: 5584.6, 300 sec: 5552.2). Total num frames: 764971008. Throughput: 0: 5853.5. Samples: 764977598. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:40:52,009][25689] Avg episode reward: [(0, '-4.112')] [2022-07-10 13:40:53,485][26022] Updated weights on worker 0-0, policy_version 747050 (0.00094) [2022-07-10 13:40:55,440][26022] Updated weights on worker 0-0, policy_version 747060 (0.00092) [2022-07-10 13:40:57,025][25689] Fps is (10 sec: 5499.1, 60 sec: 5533.5, 300 sec: 5551.9). Total num frames: 764997632. Throughput: 0: 5022.0. Samples: 764994474. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:40:57,026][25689] Avg episode reward: [(0, '-3.990')] [2022-07-10 13:40:57,376][26022] Updated weights on worker 0-0, policy_version 747070 (0.00087) [2022-07-10 13:40:58,940][26022] Updated weights on worker 0-0, policy_version 747080 (0.00086) [2022-07-10 13:41:00,871][26022] Updated weights on worker 0-0, policy_version 747090 (0.00081) [2022-07-10 13:41:02,049][25689] Fps is (10 sec: 5302.9, 60 sec: 5531.7, 300 sec: 5548.0). Total num frames: 765024256. Throughput: 0: 5883.5. Samples: 765028266. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:41:02,050][25689] Avg episode reward: [(0, '-3.608')] [2022-07-10 13:41:03,021][26022] Updated weights on worker 0-0, policy_version 747100 (0.00105) [2022-07-10 13:41:05,067][26022] Updated weights on worker 0-0, policy_version 747110 (0.00098) [2022-07-10 13:41:06,751][26022] Updated weights on worker 0-0, policy_version 747120 (0.00089) [2022-07-10 13:41:07,066][25689] Fps is (10 sec: 5405.3, 60 sec: 5547.5, 300 sec: 5548.9). Total num frames: 765051904. Throughput: 0: 5760.5. Samples: 765059888. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:41:07,067][25689] Avg episode reward: [(0, '-2.915')] [2022-07-10 13:41:08,676][26022] Updated weights on worker 0-0, policy_version 747130 (0.00085) [2022-07-10 13:41:10,499][26022] Updated weights on worker 0-0, policy_version 747140 (0.00095) [2022-07-10 13:41:12,141][25689] Fps is (10 sec: 5580.9, 60 sec: 5580.9, 300 sec: 5551.6). Total num frames: 765080576. Throughput: 0: 4914.6. Samples: 765076542. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:41:12,141][25689] Avg episode reward: [(0, '-4.004')] [2022-07-10 13:41:12,171][26022] Updated weights on worker 0-0, policy_version 747150 (0.00087) [2022-07-10 13:41:14,034][26022] Updated weights on worker 0-0, policy_version 747160 (0.00087) [2022-07-10 13:41:15,883][26022] Updated weights on worker 0-0, policy_version 747170 (0.00082) [2022-07-10 13:41:17,157][25689] Fps is (10 sec: 5682.7, 60 sec: 5564.2, 300 sec: 5551.5). Total num frames: 765109248. Throughput: 0: 5752.9. Samples: 765110288. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:41:17,157][25689] Avg episode reward: [(0, '-3.777')] [2022-07-10 13:41:17,682][26022] Updated weights on worker 0-0, policy_version 747180 (0.00090) [2022-07-10 13:41:19,500][26022] Updated weights on worker 0-0, policy_version 747190 (0.00089) [2022-07-10 13:41:21,269][26022] Updated weights on worker 0-0, policy_version 747200 (0.00085) [2022-07-10 13:41:22,172][25689] Fps is (10 sec: 5512.5, 60 sec: 5537.1, 300 sec: 5544.8). Total num frames: 765135872. Throughput: 0: 5746.0. Samples: 765143890. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:41:22,172][25689] Avg episode reward: [(0, '-2.794')] [2022-07-10 13:41:23,115][26022] Updated weights on worker 0-0, policy_version 747210 (0.00092) [2022-07-10 13:41:25,129][26022] Updated weights on worker 0-0, policy_version 747220 (0.00086) [2022-07-10 13:41:26,878][26022] Updated weights on worker 0-0, policy_version 747230 (0.00081) [2022-07-10 13:41:27,199][25689] Fps is (10 sec: 5710.2, 60 sec: 5591.8, 300 sec: 5555.6). Total num frames: 765166592. Throughput: 0: 5013.5. Samples: 765160826. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:41:27,200][25689] Avg episode reward: [(0, '-3.261')] [2022-07-10 13:41:28,791][26022] Updated weights on worker 0-0, policy_version 747240 (0.00097) [2022-07-10 13:41:30,564][26022] Updated weights on worker 0-0, policy_version 747250 (0.00092) [2022-07-10 13:41:32,264][25689] Fps is (10 sec: 5682.2, 60 sec: 5563.8, 300 sec: 5552.7). Total num frames: 765193216. Throughput: 0: 5835.7. Samples: 765193972. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:41:32,264][25689] Avg episode reward: [(0, '-3.421')] [2022-07-10 13:41:32,347][26022] Updated weights on worker 0-0, policy_version 747260 (0.00092) [2022-07-10 13:41:34,372][26022] Updated weights on worker 0-0, policy_version 747270 (0.00090) [2022-07-10 13:41:35,995][26022] Updated weights on worker 0-0, policy_version 747280 (0.00087) [2022-07-10 13:41:37,283][25689] Fps is (10 sec: 5382.1, 60 sec: 5564.5, 300 sec: 5546.4). Total num frames: 765220864. Throughput: 0: 5813.2. Samples: 765227286. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:41:37,284][25689] Avg episode reward: [(0, '-2.164')] [2022-07-10 13:41:38,000][26022] Updated weights on worker 0-0, policy_version 747290 (0.00084) [2022-07-10 13:41:39,716][26022] Updated weights on worker 0-0, policy_version 747300 (0.00089) [2022-07-10 13:41:41,595][26022] Updated weights on worker 0-0, policy_version 747310 (0.00090) [2022-07-10 13:41:42,290][25689] Fps is (10 sec: 5515.0, 60 sec: 5550.8, 300 sec: 5550.4). Total num frames: 765248512. Throughput: 0: 4977.5. Samples: 765244028. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:41:42,292][25689] Avg episode reward: [(0, '-2.569')] [2022-07-10 13:41:43,459][26022] Updated weights on worker 0-0, policy_version 747320 (0.00102) [2022-07-10 13:41:45,175][26022] Updated weights on worker 0-0, policy_version 747330 (0.00094) [2022-07-10 13:41:46,943][26022] Updated weights on worker 0-0, policy_version 747340 (0.00087) [2022-07-10 13:41:47,312][25689] Fps is (10 sec: 5717.8, 60 sec: 5566.1, 300 sec: 5554.9). Total num frames: 765278208. Throughput: 0: 5820.7. Samples: 765277896. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:41:47,312][25689] Avg episode reward: [(0, '-2.284')] [2022-07-10 13:41:48,974][26022] Updated weights on worker 0-0, policy_version 747350 (0.00087) [2022-07-10 13:41:50,632][26022] Updated weights on worker 0-0, policy_version 747360 (0.00088) [2022-07-10 13:41:52,387][25689] Fps is (10 sec: 5679.5, 60 sec: 5545.8, 300 sec: 5554.3). Total num frames: 765305856. Throughput: 0: 5835.8. Samples: 765311406. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:41:52,387][25689] Avg episode reward: [(0, '-2.606')] [2022-07-10 13:41:52,488][26022] Updated weights on worker 0-0, policy_version 747370 (0.00089) [2022-07-10 13:41:54,382][26022] Updated weights on worker 0-0, policy_version 747380 (0.00086) [2022-07-10 13:41:56,311][26022] Updated weights on worker 0-0, policy_version 747390 (0.00090) [2022-07-10 13:41:57,441][25689] Fps is (10 sec: 5459.2, 60 sec: 5559.4, 300 sec: 5551.1). Total num frames: 765333504. Throughput: 0: 4997.5. Samples: 765328026. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:41:57,441][25689] Avg episode reward: [(0, '-1.746')] [2022-07-10 13:41:58,251][26022] Updated weights on worker 0-0, policy_version 747400 (0.00418) [2022-07-10 13:41:59,829][26022] Updated weights on worker 0-0, policy_version 747410 (0.00099) [2022-07-10 13:42:02,267][26022] Updated weights on worker 0-0, policy_version 747420 (0.00086) [2022-07-10 13:42:02,455][25689] Fps is (10 sec: 5187.1, 60 sec: 5526.4, 300 sec: 5549.1). Total num frames: 765358080. Throughput: 0: 5816.2. Samples: 765361310. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:42:02,455][25689] Avg episode reward: [(0, '-1.098')] [2022-07-10 13:42:03,992][26022] Updated weights on worker 0-0, policy_version 747430 (0.00801) [2022-07-10 13:42:05,918][26022] Updated weights on worker 0-0, policy_version 747440 (0.00086) [2022-07-10 13:42:07,457][25689] Fps is (10 sec: 5316.1, 60 sec: 5544.6, 300 sec: 5550.3). Total num frames: 765386752. Throughput: 0: 5690.3. Samples: 765392528. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:42:07,458][25689] Avg episode reward: [(0, '-0.715')] [2022-07-10 13:42:07,635][26022] Updated weights on worker 0-0, policy_version 747450 (0.00090) [2022-07-10 13:42:09,699][26022] Updated weights on worker 0-0, policy_version 747460 (0.00091) [2022-07-10 13:42:11,437][26022] Updated weights on worker 0-0, policy_version 747470 (0.00090) [2022-07-10 13:42:12,552][25689] Fps is (10 sec: 5577.5, 60 sec: 5525.8, 300 sec: 5552.9). Total num frames: 765414400. Throughput: 0: 4850.0. Samples: 765409210. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:42:12,553][25689] Avg episode reward: [(0, '-1.694')] [2022-07-10 13:42:13,239][26022] Updated weights on worker 0-0, policy_version 747480 (0.00091) [2022-07-10 13:42:15,045][26022] Updated weights on worker 0-0, policy_version 747490 (0.00079) [2022-07-10 13:42:16,820][26022] Updated weights on worker 0-0, policy_version 747500 (0.00103) [2022-07-10 13:42:17,554][25689] Fps is (10 sec: 5578.1, 60 sec: 5527.1, 300 sec: 5549.5). Total num frames: 765443072. Throughput: 0: 5706.0. Samples: 765442790. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:42:17,554][25689] Avg episode reward: [(0, '-2.453')] [2022-07-10 13:42:18,607][26022] Updated weights on worker 0-0, policy_version 747510 (0.00098) [2022-07-10 13:42:20,494][26022] Updated weights on worker 0-0, policy_version 747520 (0.00083) [2022-07-10 13:42:22,211][26022] Updated weights on worker 0-0, policy_version 747530 (0.00092) [2022-07-10 13:42:22,598][25689] Fps is (10 sec: 5708.5, 60 sec: 5558.4, 300 sec: 5555.6). Total num frames: 765471744. Throughput: 0: 5737.2. Samples: 765476874. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:42:22,598][25689] Avg episode reward: [(0, '-3.510')] [2022-07-10 13:42:24,275][26022] Updated weights on worker 0-0, policy_version 747540 (0.00084) [2022-07-10 13:42:25,958][26022] Updated weights on worker 0-0, policy_version 747550 (0.00088) [2022-07-10 13:42:27,616][25689] Fps is (10 sec: 5699.2, 60 sec: 5525.4, 300 sec: 5553.7). Total num frames: 765500416. Throughput: 0: 5017.6. Samples: 765493674. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:42:27,618][25689] Avg episode reward: [(0, '-4.075')] [2022-07-10 13:42:28,024][26022] Updated weights on worker 0-0, policy_version 747560 (0.00092) [2022-07-10 13:42:29,648][26022] Updated weights on worker 0-0, policy_version 747570 (0.00089) [2022-07-10 13:42:31,600][26022] Updated weights on worker 0-0, policy_version 747580 (0.00087) [2022-07-10 13:42:32,703][25689] Fps is (10 sec: 5573.5, 60 sec: 5540.3, 300 sec: 5553.3). Total num frames: 765528064. Throughput: 0: 5854.8. Samples: 765527186. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:42:32,703][25689] Avg episode reward: [(0, '-3.992')] [2022-07-10 13:42:33,100][26022] Updated weights on worker 0-0, policy_version 747590 (0.00083) [2022-07-10 13:42:35,336][26022] Updated weights on worker 0-0, policy_version 747600 (0.00087) [2022-07-10 13:42:35,670][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:42:35,682][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000747602_765544448.pth [2022-07-10 13:42:35,683][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000745650_763545600.pth [2022-07-10 13:42:36,996][26022] Updated weights on worker 0-0, policy_version 747610 (0.00094) [2022-07-10 13:42:37,710][25689] Fps is (10 sec: 5477.7, 60 sec: 5541.3, 300 sec: 5554.4). Total num frames: 765555712. Throughput: 0: 5843.3. Samples: 765560570. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:42:37,711][25689] Avg episode reward: [(0, '-4.784')] [2022-07-10 13:42:38,869][26022] Updated weights on worker 0-0, policy_version 747620 (0.00103) [2022-07-10 13:42:40,604][26022] Updated weights on worker 0-0, policy_version 747630 (0.00092) [2022-07-10 13:42:42,456][26022] Updated weights on worker 0-0, policy_version 747640 (0.00081) [2022-07-10 13:42:42,763][25689] Fps is (10 sec: 5496.4, 60 sec: 5537.1, 300 sec: 5547.8). Total num frames: 765583360. Throughput: 0: 4990.6. Samples: 765577512. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:42:42,764][25689] Avg episode reward: [(0, '-2.477')] [2022-07-10 13:42:44,489][26022] Updated weights on worker 0-0, policy_version 747650 (0.00095) [2022-07-10 13:42:46,239][26022] Updated weights on worker 0-0, policy_version 747660 (0.00086) [2022-07-10 13:42:47,820][25689] Fps is (10 sec: 5672.4, 60 sec: 5534.0, 300 sec: 5555.5). Total num frames: 765613056. Throughput: 0: 5800.7. Samples: 765610872. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:42:47,821][25689] Avg episode reward: [(0, '-1.010')] [2022-07-10 13:42:47,933][26022] Updated weights on worker 0-0, policy_version 747670 (0.00087) [2022-07-10 13:42:49,942][26022] Updated weights on worker 0-0, policy_version 747680 (0.00091) [2022-07-10 13:42:51,430][26022] Updated weights on worker 0-0, policy_version 747690 (0.00092) [2022-07-10 13:42:52,861][25689] Fps is (10 sec: 5678.5, 60 sec: 5537.0, 300 sec: 5551.3). Total num frames: 765640704. Throughput: 0: 5827.7. Samples: 765644666. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:42:52,863][25689] Avg episode reward: [(0, '-2.725')] [2022-07-10 13:42:53,458][26022] Updated weights on worker 0-0, policy_version 747700 (0.00093) [2022-07-10 13:42:55,246][26022] Updated weights on worker 0-0, policy_version 747710 (0.00093) [2022-07-10 13:42:57,300][26022] Updated weights on worker 0-0, policy_version 747720 (0.00090) [2022-07-10 13:42:57,868][25689] Fps is (10 sec: 5502.8, 60 sec: 5541.3, 300 sec: 5551.8). Total num frames: 765668352. Throughput: 0: 5833.7. Samples: 765678166. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:42:57,869][25689] Avg episode reward: [(0, '-3.003')] [2022-07-10 13:42:58,876][26022] Updated weights on worker 0-0, policy_version 747730 (0.00098) [2022-07-10 13:43:00,759][26022] Updated weights on worker 0-0, policy_version 747740 (0.00082) [2022-07-10 13:43:02,888][25689] Fps is (10 sec: 5412.9, 60 sec: 5574.7, 300 sec: 5551.6). Total num frames: 765694976. Throughput: 0: 5833.5. Samples: 765694910. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:43:02,888][25689] Avg episode reward: [(0, '-2.194')] [2022-07-10 13:43:03,176][26022] Updated weights on worker 0-0, policy_version 747750 (0.00077) [2022-07-10 13:43:04,827][26022] Updated weights on worker 0-0, policy_version 747760 (0.00091) [2022-07-10 13:43:06,746][26022] Updated weights on worker 0-0, policy_version 747770 (0.00082) [2022-07-10 13:43:07,906][25689] Fps is (10 sec: 5406.8, 60 sec: 5556.3, 300 sec: 5549.1). Total num frames: 765722624. Throughput: 0: 5760.0. Samples: 765726570. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:43:07,908][25689] Avg episode reward: [(0, '-2.196')] [2022-07-10 13:43:08,511][26022] Updated weights on worker 0-0, policy_version 747780 (0.00087) [2022-07-10 13:43:10,303][26022] Updated weights on worker 0-0, policy_version 747790 (0.00095) [2022-07-10 13:43:12,121][26022] Updated weights on worker 0-0, policy_version 747800 (0.00084) [2022-07-10 13:43:12,961][25689] Fps is (10 sec: 5387.6, 60 sec: 5543.0, 300 sec: 5545.1). Total num frames: 765749248. Throughput: 0: 5726.4. Samples: 765759766. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:43:12,962][25689] Avg episode reward: [(0, '-3.126')] [2022-07-10 13:43:14,001][26022] Updated weights on worker 0-0, policy_version 747810 (0.00091) [2022-07-10 13:43:15,989][26022] Updated weights on worker 0-0, policy_version 747820 (0.00086) [2022-07-10 13:43:17,760][26022] Updated weights on worker 0-0, policy_version 747830 (0.00086) [2022-07-10 13:43:17,963][25689] Fps is (10 sec: 5600.1, 60 sec: 5559.9, 300 sec: 5552.1). Total num frames: 765778944. Throughput: 0: 4892.5. Samples: 765776480. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:43:17,964][25689] Avg episode reward: [(0, '-3.422')] [2022-07-10 13:43:19,683][26022] Updated weights on worker 0-0, policy_version 747840 (0.00085) [2022-07-10 13:43:21,470][26022] Updated weights on worker 0-0, policy_version 747850 (0.00095) [2022-07-10 13:43:22,978][25689] Fps is (10 sec: 5826.8, 60 sec: 5562.6, 300 sec: 5551.9). Total num frames: 765807616. Throughput: 0: 5719.4. Samples: 765809816. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:43:22,979][25689] Avg episode reward: [(0, '-2.464')] [2022-07-10 13:43:23,283][26022] Updated weights on worker 0-0, policy_version 747860 (0.00092) [2022-07-10 13:43:25,121][26022] Updated weights on worker 0-0, policy_version 747870 (0.00086) [2022-07-10 13:43:27,059][26022] Updated weights on worker 0-0, policy_version 747880 (0.00087) [2022-07-10 13:43:28,011][25689] Fps is (10 sec: 5503.2, 60 sec: 5527.3, 300 sec: 5555.6). Total num frames: 765834240. Throughput: 0: 5821.6. Samples: 765843614. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:43:28,012][25689] Avg episode reward: [(0, '-2.228')] [2022-07-10 13:43:28,730][26022] Updated weights on worker 0-0, policy_version 747890 (0.00090) [2022-07-10 13:43:30,511][26022] Updated weights on worker 0-0, policy_version 747900 (0.00091) [2022-07-10 13:43:32,412][26022] Updated weights on worker 0-0, policy_version 747910 (0.00088) [2022-07-10 13:43:33,112][25689] Fps is (10 sec: 5355.8, 60 sec: 5526.1, 300 sec: 5540.6). Total num frames: 765861888. Throughput: 0: 4993.4. Samples: 765860386. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:43:33,112][25689] Avg episode reward: [(0, '-1.671')] [2022-07-10 13:43:34,183][26022] Updated weights on worker 0-0, policy_version 747920 (0.00093) [2022-07-10 13:43:36,204][26022] Updated weights on worker 0-0, policy_version 747930 (0.00090) [2022-07-10 13:43:37,817][26022] Updated weights on worker 0-0, policy_version 747940 (0.00094) [2022-07-10 13:43:38,145][25689] Fps is (10 sec: 5658.9, 60 sec: 5557.7, 300 sec: 5551.0). Total num frames: 765891584. Throughput: 0: 5829.9. Samples: 765894136. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:43:38,145][25689] Avg episode reward: [(0, '-1.572')] [2022-07-10 13:43:39,719][26022] Updated weights on worker 0-0, policy_version 747950 (0.00084) [2022-07-10 13:43:41,660][26022] Updated weights on worker 0-0, policy_version 747960 (0.00095) [2022-07-10 13:43:43,162][25689] Fps is (10 sec: 5807.5, 60 sec: 5577.8, 300 sec: 5554.5). Total num frames: 765920256. Throughput: 0: 5836.0. Samples: 765927608. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2022-07-10 13:43:43,163][25689] Avg episode reward: [(0, '-1.648')] [2022-07-10 13:43:43,288][26022] Updated weights on worker 0-0, policy_version 747970 (0.00082) [2022-07-10 13:43:45,424][26022] Updated weights on worker 0-0, policy_version 747980 (0.00088) [2022-07-10 13:43:46,951][26022] Updated weights on worker 0-0, policy_version 747990 (0.00093) [2022-07-10 13:43:48,182][25689] Fps is (10 sec: 5406.8, 60 sec: 5513.3, 300 sec: 5541.5). Total num frames: 765945856. Throughput: 0: 4988.1. Samples: 765944230. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:43:48,183][25689] Avg episode reward: [(0, '-1.098')] [2022-07-10 13:43:48,908][26022] Updated weights on worker 0-0, policy_version 748000 (0.00093) [2022-07-10 13:43:50,923][26022] Updated weights on worker 0-0, policy_version 748010 (0.00101) [2022-07-10 13:43:52,574][26022] Updated weights on worker 0-0, policy_version 748020 (0.00089) [2022-07-10 13:43:53,302][25689] Fps is (10 sec: 5453.5, 60 sec: 5540.2, 300 sec: 5546.5). Total num frames: 765975552. Throughput: 0: 5819.7. Samples: 765977886. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:43:53,302][25689] Avg episode reward: [(0, '-1.660')] [2022-07-10 13:43:54,550][26022] Updated weights on worker 0-0, policy_version 748030 (0.00082) [2022-07-10 13:43:56,111][26022] Updated weights on worker 0-0, policy_version 748040 (0.00088) [2022-07-10 13:43:58,129][26022] Updated weights on worker 0-0, policy_version 748050 (0.00094) [2022-07-10 13:43:58,305][25689] Fps is (10 sec: 5664.9, 60 sec: 5540.5, 300 sec: 5543.2). Total num frames: 766003200. Throughput: 0: 5829.7. Samples: 766011666. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:43:58,305][25689] Avg episode reward: [(0, '-3.517')] [2022-07-10 13:43:59,920][26022] Updated weights on worker 0-0, policy_version 748060 (0.00089) [2022-07-10 13:44:01,752][26022] Updated weights on worker 0-0, policy_version 748070 (0.00082) [2022-07-10 13:44:03,315][25689] Fps is (10 sec: 5522.4, 60 sec: 5558.3, 300 sec: 5553.9). Total num frames: 766030848. Throughput: 0: 5013.8. Samples: 766028648. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:44:03,317][25689] Avg episode reward: [(0, '-3.989')] [2022-07-10 13:44:03,874][26022] Updated weights on worker 0-0, policy_version 748080 (0.00083) [2022-07-10 13:44:05,909][26022] Updated weights on worker 0-0, policy_version 748090 (0.00086) [2022-07-10 13:44:07,569][26022] Updated weights on worker 0-0, policy_version 748100 (0.00086) [2022-07-10 13:44:08,330][25689] Fps is (10 sec: 5413.8, 60 sec: 5541.7, 300 sec: 5548.9). Total num frames: 766057472. Throughput: 0: 5750.4. Samples: 766060086. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:44:08,332][25689] Avg episode reward: [(0, '-3.541')] [2022-07-10 13:44:09,504][26022] Updated weights on worker 0-0, policy_version 748110 (0.00085) [2022-07-10 13:44:11,295][26022] Updated weights on worker 0-0, policy_version 748120 (0.00088) [2022-07-10 13:44:13,096][26022] Updated weights on worker 0-0, policy_version 748130 (0.00094) [2022-07-10 13:44:13,446][25689] Fps is (10 sec: 5458.1, 60 sec: 5570.0, 300 sec: 5547.6). Total num frames: 766086144. Throughput: 0: 5737.8. Samples: 766093468. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:44:13,448][25689] Avg episode reward: [(0, '-3.352')] [2022-07-10 13:44:15,028][26022] Updated weights on worker 0-0, policy_version 748140 (0.00091) [2022-07-10 13:44:16,902][26022] Updated weights on worker 0-0, policy_version 748150 (0.00093) [2022-07-10 13:44:18,451][25689] Fps is (10 sec: 5564.7, 60 sec: 5535.8, 300 sec: 5545.4). Total num frames: 766113792. Throughput: 0: 4887.3. Samples: 766110124. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:44:18,452][25689] Avg episode reward: [(0, '-3.161')] [2022-07-10 13:44:18,811][26022] Updated weights on worker 0-0, policy_version 748160 (0.00090) [2022-07-10 13:44:20,605][26022] Updated weights on worker 0-0, policy_version 748170 (0.00084) [2022-07-10 13:44:22,333][26022] Updated weights on worker 0-0, policy_version 748180 (0.00090) [2022-07-10 13:44:23,479][25689] Fps is (10 sec: 5613.5, 60 sec: 5534.7, 300 sec: 5548.6). Total num frames: 766142464. Throughput: 0: 5692.3. Samples: 766143426. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:44:23,479][25689] Avg episode reward: [(0, '-1.072')] [2022-07-10 13:44:24,390][26022] Updated weights on worker 0-0, policy_version 748190 (0.00095) [2022-07-10 13:44:26,174][26022] Updated weights on worker 0-0, policy_version 748200 (0.00084) [2022-07-10 13:44:27,963][26022] Updated weights on worker 0-0, policy_version 748210 (0.00088) [2022-07-10 13:44:28,522][25689] Fps is (10 sec: 5490.3, 60 sec: 5533.7, 300 sec: 5541.9). Total num frames: 766169088. Throughput: 0: 5777.6. Samples: 766176750. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:44:28,523][25689] Avg episode reward: [(0, '1.162')] [2022-07-10 13:44:29,759][26022] Updated weights on worker 0-0, policy_version 748220 (0.00099) [2022-07-10 13:44:31,675][26022] Updated weights on worker 0-0, policy_version 748230 (0.00086) [2022-07-10 13:44:33,624][25689] Fps is (10 sec: 5349.5, 60 sec: 5533.6, 300 sec: 5541.7). Total num frames: 766196736. Throughput: 0: 4944.5. Samples: 766193240. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:44:33,624][25689] Avg episode reward: [(0, '0.018')] [2022-07-10 13:44:33,684][26022] Updated weights on worker 0-0, policy_version 748240 (0.00080) [2022-07-10 13:44:35,243][26022] Updated weights on worker 0-0, policy_version 748250 (0.00084) [2022-07-10 13:44:35,916][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:44:35,930][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000748253_766211072.pth [2022-07-10 13:44:35,930][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000746302_764213248.pth [2022-07-10 13:44:37,307][26022] Updated weights on worker 0-0, policy_version 748260 (0.00087) [2022-07-10 13:44:38,649][25689] Fps is (10 sec: 5662.8, 60 sec: 5534.3, 300 sec: 5546.1). Total num frames: 766226432. Throughput: 0: 5794.3. Samples: 766227158. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:44:38,649][25689] Avg episode reward: [(0, '-0.617')] [2022-07-10 13:44:39,117][26022] Updated weights on worker 0-0, policy_version 748270 (0.00096) [2022-07-10 13:44:40,884][26022] Updated weights on worker 0-0, policy_version 748280 (0.00567) [2022-07-10 13:44:42,711][26022] Updated weights on worker 0-0, policy_version 748290 (0.00083) [2022-07-10 13:44:43,667][25689] Fps is (10 sec: 5710.0, 60 sec: 5517.3, 300 sec: 5546.2). Total num frames: 766254080. Throughput: 0: 5804.9. Samples: 766260616. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:44:43,667][25689] Avg episode reward: [(0, '-1.374')] [2022-07-10 13:44:44,486][26022] Updated weights on worker 0-0, policy_version 748300 (0.00089) [2022-07-10 13:44:46,519][26022] Updated weights on worker 0-0, policy_version 748310 (0.00085) [2022-07-10 13:44:47,964][26022] Updated weights on worker 0-0, policy_version 748320 (0.00091) [2022-07-10 13:44:48,687][25689] Fps is (10 sec: 5508.7, 60 sec: 5551.2, 300 sec: 5547.5). Total num frames: 766281728. Throughput: 0: 4981.6. Samples: 766277202. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:44:48,687][25689] Avg episode reward: [(0, '-2.214')] [2022-07-10 13:44:50,022][26022] Updated weights on worker 0-0, policy_version 748330 (0.00095) [2022-07-10 13:44:51,947][26022] Updated weights on worker 0-0, policy_version 748340 (0.00085) [2022-07-10 13:44:53,649][26022] Updated weights on worker 0-0, policy_version 748350 (0.00088) [2022-07-10 13:44:53,779][25689] Fps is (10 sec: 5670.5, 60 sec: 5553.7, 300 sec: 5546.0). Total num frames: 766311424. Throughput: 0: 5825.5. Samples: 766310656. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:44:53,780][25689] Avg episode reward: [(0, '-2.387')] [2022-07-10 13:44:55,591][26022] Updated weights on worker 0-0, policy_version 748360 (0.00087) [2022-07-10 13:44:57,224][26022] Updated weights on worker 0-0, policy_version 748370 (0.00090) [2022-07-10 13:44:58,832][25689] Fps is (10 sec: 5551.1, 60 sec: 5532.2, 300 sec: 5545.1). Total num frames: 766338048. Throughput: 0: 5815.9. Samples: 766344544. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:44:58,833][25689] Avg episode reward: [(0, '-3.077')] [2022-07-10 13:44:59,124][26022] Updated weights on worker 0-0, policy_version 748380 (0.00090) [2022-07-10 13:45:00,948][26022] Updated weights on worker 0-0, policy_version 748390 (0.00085) [2022-07-10 13:45:03,076][26022] Updated weights on worker 0-0, policy_version 748400 (0.00084) [2022-07-10 13:45:03,851][25689] Fps is (10 sec: 5388.8, 60 sec: 5531.4, 300 sec: 5548.3). Total num frames: 766365696. Throughput: 0: 5000.8. Samples: 766361552. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:45:03,851][25689] Avg episode reward: [(0, '-2.846')] [2022-07-10 13:45:05,030][26022] Updated weights on worker 0-0, policy_version 748410 (0.00084) [2022-07-10 13:45:06,707][26022] Updated weights on worker 0-0, policy_version 748420 (0.00093) [2022-07-10 13:45:08,704][26022] Updated weights on worker 0-0, policy_version 748430 (0.00099) [2022-07-10 13:45:08,884][25689] Fps is (10 sec: 5500.9, 60 sec: 5546.6, 300 sec: 5552.4). Total num frames: 766393344. Throughput: 0: 5741.6. Samples: 766393170. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:45:08,885][25689] Avg episode reward: [(0, '-2.744')] [2022-07-10 13:45:10,437][26022] Updated weights on worker 0-0, policy_version 748440 (0.00110) [2022-07-10 13:45:12,352][26022] Updated weights on worker 0-0, policy_version 748450 (0.00102) [2022-07-10 13:45:14,003][25689] Fps is (10 sec: 5547.6, 60 sec: 5546.4, 300 sec: 5547.1). Total num frames: 766422016. Throughput: 0: 5754.9. Samples: 766427040. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:45:14,004][25689] Avg episode reward: [(0, '-1.260')] [2022-07-10 13:45:14,064][26022] Updated weights on worker 0-0, policy_version 748460 (0.00093) [2022-07-10 13:45:16,102][26022] Updated weights on worker 0-0, policy_version 748470 (0.00084) [2022-07-10 13:45:17,595][26022] Updated weights on worker 0-0, policy_version 748480 (0.00082) [2022-07-10 13:45:19,053][25689] Fps is (10 sec: 5437.8, 60 sec: 5525.3, 300 sec: 5540.9). Total num frames: 766448640. Throughput: 0: 4898.4. Samples: 766443594. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:45:19,054][25689] Avg episode reward: [(0, '-1.651')] [2022-07-10 13:45:19,726][26022] Updated weights on worker 0-0, policy_version 748490 (0.00084) [2022-07-10 13:45:21,440][26022] Updated weights on worker 0-0, policy_version 748500 (0.00100) [2022-07-10 13:45:23,244][26022] Updated weights on worker 0-0, policy_version 748510 (0.00083) [2022-07-10 13:45:24,070][25689] Fps is (10 sec: 5696.1, 60 sec: 5560.1, 300 sec: 5552.2). Total num frames: 766479360. Throughput: 0: 5708.8. Samples: 766476982. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:45:24,071][25689] Avg episode reward: [(0, '-1.480')] [2022-07-10 13:45:25,311][26022] Updated weights on worker 0-0, policy_version 748520 (0.00087) [2022-07-10 13:45:26,960][26022] Updated weights on worker 0-0, policy_version 748530 (0.00086) [2022-07-10 13:45:28,886][26022] Updated weights on worker 0-0, policy_version 748540 (0.00086) [2022-07-10 13:45:29,156][25689] Fps is (10 sec: 5676.2, 60 sec: 5556.3, 300 sec: 5546.1). Total num frames: 766505984. Throughput: 0: 5791.5. Samples: 766510570. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:45:29,156][25689] Avg episode reward: [(0, '-2.004')] [2022-07-10 13:45:30,813][26022] Updated weights on worker 0-0, policy_version 748550 (0.00099) [2022-07-10 13:45:32,392][26022] Updated weights on worker 0-0, policy_version 748560 (0.00085) [2022-07-10 13:45:34,298][25689] Fps is (10 sec: 5406.6, 60 sec: 5569.5, 300 sec: 5547.4). Total num frames: 766534656. Throughput: 0: 5778.7. Samples: 766544318. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:45:34,298][25689] Avg episode reward: [(0, '-1.323')] [2022-07-10 13:45:34,351][26022] Updated weights on worker 0-0, policy_version 748570 (0.00086) [2022-07-10 13:45:35,974][26022] Updated weights on worker 0-0, policy_version 748580 (0.00087) [2022-07-10 13:45:38,005][26022] Updated weights on worker 0-0, policy_version 748590 (0.00079) [2022-07-10 13:45:39,301][25689] Fps is (10 sec: 5652.1, 60 sec: 5554.5, 300 sec: 5548.2). Total num frames: 766563328. Throughput: 0: 5813.4. Samples: 766561304. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:45:39,302][25689] Avg episode reward: [(0, '-0.905')] [2022-07-10 13:45:39,655][26022] Updated weights on worker 0-0, policy_version 748600 (0.00094) [2022-07-10 13:45:41,651][26022] Updated weights on worker 0-0, policy_version 748610 (0.00097) [2022-07-10 13:45:43,340][26022] Updated weights on worker 0-0, policy_version 748620 (0.00086) [2022-07-10 13:45:44,391][25689] Fps is (10 sec: 5681.4, 60 sec: 5564.8, 300 sec: 5546.6). Total num frames: 766592000. Throughput: 0: 5801.9. Samples: 766594882. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:45:44,391][25689] Avg episode reward: [(0, '-1.153')] [2022-07-10 13:45:45,278][26022] Updated weights on worker 0-0, policy_version 748630 (0.00093) [2022-07-10 13:45:46,956][26022] Updated weights on worker 0-0, policy_version 748640 (0.00083) [2022-07-10 13:45:48,904][26022] Updated weights on worker 0-0, policy_version 748650 (0.00084) [2022-07-10 13:45:49,466][25689] Fps is (10 sec: 5641.6, 60 sec: 5576.6, 300 sec: 5545.9). Total num frames: 766620672. Throughput: 0: 5803.5. Samples: 766628440. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:45:49,466][25689] Avg episode reward: [(0, '-0.633')] [2022-07-10 13:45:50,801][26022] Updated weights on worker 0-0, policy_version 748660 (0.00079) [2022-07-10 13:45:52,660][26022] Updated weights on worker 0-0, policy_version 748670 (0.00084) [2022-07-10 13:45:54,531][25689] Fps is (10 sec: 5453.0, 60 sec: 5528.6, 300 sec: 5545.0). Total num frames: 766647296. Throughput: 0: 4991.0. Samples: 766645304. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:45:54,533][25689] Avg episode reward: [(0, '-0.016')] [2022-07-10 13:45:54,588][26022] Updated weights on worker 0-0, policy_version 748680 (0.00090) [2022-07-10 13:45:56,216][26022] Updated weights on worker 0-0, policy_version 748690 (0.00090) [2022-07-10 13:45:58,116][26022] Updated weights on worker 0-0, policy_version 748700 (0.00093) [2022-07-10 13:45:59,538][25689] Fps is (10 sec: 5591.6, 60 sec: 5583.4, 300 sec: 5555.6). Total num frames: 766676992. Throughput: 0: 5794.4. Samples: 766678564. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:45:59,539][25689] Avg episode reward: [(0, '0.831')] [2022-07-10 13:46:00,172][26022] Updated weights on worker 0-0, policy_version 748710 (0.00093) [2022-07-10 13:46:02,215][26022] Updated weights on worker 0-0, policy_version 748720 (0.00081) [2022-07-10 13:46:03,991][26022] Updated weights on worker 0-0, policy_version 748730 (0.00089) [2022-07-10 13:46:04,587][25689] Fps is (10 sec: 5499.2, 60 sec: 5546.9, 300 sec: 5548.1). Total num frames: 766702592. Throughput: 0: 5688.4. Samples: 766709764. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:46:04,587][25689] Avg episode reward: [(0, '0.706')] [2022-07-10 13:46:06,039][26022] Updated weights on worker 0-0, policy_version 748740 (0.00094) [2022-07-10 13:46:07,733][26022] Updated weights on worker 0-0, policy_version 748750 (0.00069) [2022-07-10 13:46:09,594][25689] Fps is (10 sec: 5193.5, 60 sec: 5532.5, 300 sec: 5542.5). Total num frames: 766729216. Throughput: 0: 4874.9. Samples: 766726560. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:46:09,594][25689] Avg episode reward: [(0, '0.355')] [2022-07-10 13:46:09,720][26022] Updated weights on worker 0-0, policy_version 748760 (0.00090) [2022-07-10 13:46:11,309][26022] Updated weights on worker 0-0, policy_version 748770 (0.00089) [2022-07-10 13:46:13,305][26022] Updated weights on worker 0-0, policy_version 748780 (0.00085) [2022-07-10 13:46:14,668][25689] Fps is (10 sec: 5586.5, 60 sec: 5553.4, 300 sec: 5544.9). Total num frames: 766758912. Throughput: 0: 5699.8. Samples: 766760080. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:46:14,669][25689] Avg episode reward: [(0, '-0.330')] [2022-07-10 13:46:14,989][26022] Updated weights on worker 0-0, policy_version 748790 (0.00088) [2022-07-10 13:46:17,067][26022] Updated weights on worker 0-0, policy_version 748800 (0.00088) [2022-07-10 13:46:18,665][26022] Updated weights on worker 0-0, policy_version 748810 (0.00093) [2022-07-10 13:46:19,676][25689] Fps is (10 sec: 5687.6, 60 sec: 5574.2, 300 sec: 5548.5). Total num frames: 766786560. Throughput: 0: 5711.7. Samples: 766793584. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:46:19,676][25689] Avg episode reward: [(0, '-1.995')] [2022-07-10 13:46:20,667][26022] Updated weights on worker 0-0, policy_version 748820 (0.00084) [2022-07-10 13:46:22,419][26022] Updated weights on worker 0-0, policy_version 748830 (0.00085) [2022-07-10 13:46:24,332][26022] Updated weights on worker 0-0, policy_version 748840 (0.00090) [2022-07-10 13:46:24,712][25689] Fps is (10 sec: 5403.4, 60 sec: 5504.8, 300 sec: 5534.5). Total num frames: 766813184. Throughput: 0: 5006.4. Samples: 766810520. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:46:24,714][25689] Avg episode reward: [(0, '-2.811')] [2022-07-10 13:46:26,011][26022] Updated weights on worker 0-0, policy_version 748850 (0.00097) [2022-07-10 13:46:27,788][26022] Updated weights on worker 0-0, policy_version 748860 (0.00089) [2022-07-10 13:46:29,673][26022] Updated weights on worker 0-0, policy_version 748870 (0.00092) [2022-07-10 13:46:29,743][25689] Fps is (10 sec: 5594.4, 60 sec: 5560.5, 300 sec: 5545.5). Total num frames: 766842880. Throughput: 0: 5815.0. Samples: 766843728. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:46:29,744][25689] Avg episode reward: [(0, '-3.427')] [2022-07-10 13:46:31,784][26022] Updated weights on worker 0-0, policy_version 748880 (0.00084) [2022-07-10 13:46:33,347][26022] Updated weights on worker 0-0, policy_version 748890 (0.00084) [2022-07-10 13:46:34,797][25689] Fps is (10 sec: 5686.2, 60 sec: 5551.7, 300 sec: 5544.8). Total num frames: 766870528. Throughput: 0: 5833.4. Samples: 766877500. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:46:34,798][25689] Avg episode reward: [(0, '-3.160')] [2022-07-10 13:46:35,464][26022] Updated weights on worker 0-0, policy_version 748900 (0.00085) [2022-07-10 13:46:36,119][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:46:36,128][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000748904_766877696.pth [2022-07-10 13:46:36,128][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000746953_764879872.pth [2022-07-10 13:46:37,001][26022] Updated weights on worker 0-0, policy_version 748910 (0.00088) [2022-07-10 13:46:38,975][26022] Updated weights on worker 0-0, policy_version 748920 (0.00087) [2022-07-10 13:46:39,836][25689] Fps is (10 sec: 5478.5, 60 sec: 5531.5, 300 sec: 5544.2). Total num frames: 766898176. Throughput: 0: 4989.8. Samples: 766894180. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:46:39,837][25689] Avg episode reward: [(0, '-2.259')] [2022-07-10 13:46:40,573][26022] Updated weights on worker 0-0, policy_version 748930 (0.00084) [2022-07-10 13:46:42,595][26022] Updated weights on worker 0-0, policy_version 748940 (0.00083) [2022-07-10 13:46:44,344][26022] Updated weights on worker 0-0, policy_version 748950 (0.00090) [2022-07-10 13:46:44,839][25689] Fps is (10 sec: 5608.3, 60 sec: 5539.4, 300 sec: 5541.1). Total num frames: 766926848. Throughput: 0: 5828.9. Samples: 766927838. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:46:44,840][25689] Avg episode reward: [(0, '-0.378')] [2022-07-10 13:46:46,270][26022] Updated weights on worker 0-0, policy_version 748960 (0.00082) [2022-07-10 13:46:48,143][26022] Updated weights on worker 0-0, policy_version 748970 (0.00084) [2022-07-10 13:46:49,824][26022] Updated weights on worker 0-0, policy_version 748980 (0.00089) [2022-07-10 13:46:49,871][25689] Fps is (10 sec: 5714.8, 60 sec: 5543.4, 300 sec: 5545.4). Total num frames: 766955520. Throughput: 0: 5828.2. Samples: 766961036. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:46:49,871][25689] Avg episode reward: [(0, '0.061')] [2022-07-10 13:46:51,934][26022] Updated weights on worker 0-0, policy_version 748990 (0.00084) [2022-07-10 13:46:53,685][26022] Updated weights on worker 0-0, policy_version 749000 (0.00087) [2022-07-10 13:46:54,951][25689] Fps is (10 sec: 5468.6, 60 sec: 5542.1, 300 sec: 5541.5). Total num frames: 766982144. Throughput: 0: 4981.8. Samples: 766977902. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:46:54,951][25689] Avg episode reward: [(0, '-0.485')] [2022-07-10 13:46:55,420][26022] Updated weights on worker 0-0, policy_version 749010 (0.00083) [2022-07-10 13:46:57,401][26022] Updated weights on worker 0-0, policy_version 749020 (0.00091) [2022-07-10 13:46:59,054][26022] Updated weights on worker 0-0, policy_version 749030 (0.00089) [2022-07-10 13:46:59,960][25689] Fps is (10 sec: 5480.9, 60 sec: 5524.9, 300 sec: 5555.3). Total num frames: 767010816. Throughput: 0: 5825.5. Samples: 767011408. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:46:59,960][25689] Avg episode reward: [(0, '0.249')] [2022-07-10 13:47:01,154][26022] Updated weights on worker 0-0, policy_version 749040 (0.00090) [2022-07-10 13:47:03,115][26022] Updated weights on worker 0-0, policy_version 749050 (0.00092) [2022-07-10 13:47:04,966][25689] Fps is (10 sec: 5418.9, 60 sec: 5528.8, 300 sec: 5544.9). Total num frames: 767036416. Throughput: 0: 5709.1. Samples: 767042744. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:47:04,968][25689] Avg episode reward: [(0, '0.603')] [2022-07-10 13:47:05,106][26022] Updated weights on worker 0-0, policy_version 749060 (0.00086) [2022-07-10 13:47:06,945][26022] Updated weights on worker 0-0, policy_version 749070 (0.00096) [2022-07-10 13:47:08,982][26022] Updated weights on worker 0-0, policy_version 749080 (0.00104) [2022-07-10 13:47:10,004][25689] Fps is (10 sec: 5403.2, 60 sec: 5559.9, 300 sec: 5549.4). Total num frames: 767065088. Throughput: 0: 4878.0. Samples: 767059246. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 13:47:10,004][25689] Avg episode reward: [(0, '0.179')] [2022-07-10 13:47:10,657][26022] Updated weights on worker 0-0, policy_version 749090 (0.00087) [2022-07-10 13:47:12,743][26022] Updated weights on worker 0-0, policy_version 749100 (0.00084) [2022-07-10 13:47:14,378][26022] Updated weights on worker 0-0, policy_version 749110 (0.00537) [2022-07-10 13:47:15,065][25689] Fps is (10 sec: 5475.7, 60 sec: 5510.3, 300 sec: 5541.4). Total num frames: 767091712. Throughput: 0: 5683.4. Samples: 767092218. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:47:15,066][25689] Avg episode reward: [(0, '-0.004')] [2022-07-10 13:47:16,396][26022] Updated weights on worker 0-0, policy_version 749120 (0.00086) [2022-07-10 13:47:18,130][26022] Updated weights on worker 0-0, policy_version 749130 (0.00094) [2022-07-10 13:47:20,023][26022] Updated weights on worker 0-0, policy_version 749140 (0.00097) [2022-07-10 13:47:20,089][25689] Fps is (10 sec: 5381.4, 60 sec: 5508.7, 300 sec: 5538.4). Total num frames: 767119360. Throughput: 0: 5665.3. Samples: 767125448. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:47:20,091][25689] Avg episode reward: [(0, '-0.180')] [2022-07-10 13:47:21,907][26022] Updated weights on worker 0-0, policy_version 749150 (0.00082) [2022-07-10 13:47:23,851][26022] Updated weights on worker 0-0, policy_version 749160 (0.00082) [2022-07-10 13:47:25,106][25689] Fps is (10 sec: 5608.6, 60 sec: 5544.4, 300 sec: 5538.4). Total num frames: 767148032. Throughput: 0: 4928.3. Samples: 767142000. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:47:25,107][25689] Avg episode reward: [(0, '-0.233')] [2022-07-10 13:47:25,367][26022] Updated weights on worker 0-0, policy_version 749170 (0.00091) [2022-07-10 13:47:27,575][26022] Updated weights on worker 0-0, policy_version 749180 (0.00079) [2022-07-10 13:47:29,147][26022] Updated weights on worker 0-0, policy_version 749190 (0.00091) [2022-07-10 13:47:30,162][25689] Fps is (10 sec: 5591.2, 60 sec: 5508.2, 300 sec: 5539.0). Total num frames: 767175680. Throughput: 0: 5768.8. Samples: 767175534. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:47:30,164][25689] Avg episode reward: [(0, '-0.100')] [2022-07-10 13:47:31,044][26022] Updated weights on worker 0-0, policy_version 749200 (0.00113) [2022-07-10 13:47:33,025][26022] Updated weights on worker 0-0, policy_version 749210 (0.00095) [2022-07-10 13:47:34,636][26022] Updated weights on worker 0-0, policy_version 749220 (0.00082) [2022-07-10 13:47:35,243][25689] Fps is (10 sec: 5556.1, 60 sec: 5522.7, 300 sec: 5541.0). Total num frames: 767204352. Throughput: 0: 5787.7. Samples: 767209004. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:47:35,245][25689] Avg episode reward: [(0, '-0.951')] [2022-07-10 13:47:36,534][26022] Updated weights on worker 0-0, policy_version 749230 (0.00091) [2022-07-10 13:47:38,265][26022] Updated weights on worker 0-0, policy_version 749240 (0.00088) [2022-07-10 13:47:40,176][26022] Updated weights on worker 0-0, policy_version 749250 (0.00089) [2022-07-10 13:47:40,285][25689] Fps is (10 sec: 5563.4, 60 sec: 5522.4, 300 sec: 5541.2). Total num frames: 767232000. Throughput: 0: 4969.2. Samples: 767225806. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:47:40,286][25689] Avg episode reward: [(0, '-1.270')] [2022-07-10 13:47:42,160][26022] Updated weights on worker 0-0, policy_version 749260 (0.00086) [2022-07-10 13:47:43,889][26022] Updated weights on worker 0-0, policy_version 749270 (0.00090) [2022-07-10 13:47:45,303][25689] Fps is (10 sec: 5598.1, 60 sec: 5521.0, 300 sec: 5538.5). Total num frames: 767260672. Throughput: 0: 5807.4. Samples: 767259292. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:47:45,304][25689] Avg episode reward: [(0, '-1.059')] [2022-07-10 13:47:45,612][26022] Updated weights on worker 0-0, policy_version 749280 (0.00084) [2022-07-10 13:47:47,678][26022] Updated weights on worker 0-0, policy_version 749290 (0.00090) [2022-07-10 13:47:49,228][26022] Updated weights on worker 0-0, policy_version 749300 (0.00085) [2022-07-10 13:47:50,327][25689] Fps is (10 sec: 5608.4, 60 sec: 5504.8, 300 sec: 5538.8). Total num frames: 767288320. Throughput: 0: 5818.3. Samples: 767292862. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:47:50,328][25689] Avg episode reward: [(0, '-1.200')] [2022-07-10 13:47:51,280][26022] Updated weights on worker 0-0, policy_version 749310 (0.00092) [2022-07-10 13:47:53,107][26022] Updated weights on worker 0-0, policy_version 749320 (0.00090) [2022-07-10 13:47:54,848][26022] Updated weights on worker 0-0, policy_version 749330 (0.00084) [2022-07-10 13:47:55,388][25689] Fps is (10 sec: 5584.4, 60 sec: 5540.4, 300 sec: 5541.3). Total num frames: 767316992. Throughput: 0: 5000.9. Samples: 767309750. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:47:55,389][25689] Avg episode reward: [(0, '-0.650')] [2022-07-10 13:47:56,868][26022] Updated weights on worker 0-0, policy_version 749340 (0.00091) [2022-07-10 13:47:58,440][26022] Updated weights on worker 0-0, policy_version 749350 (0.00088) [2022-07-10 13:48:00,392][25689] Fps is (10 sec: 5595.7, 60 sec: 5523.9, 300 sec: 5545.0). Total num frames: 767344640. Throughput: 0: 5854.4. Samples: 767343520. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:48:00,393][25689] Avg episode reward: [(0, '-1.185')] [2022-07-10 13:48:00,410][26022] Updated weights on worker 0-0, policy_version 749360 (0.00096) [2022-07-10 13:48:02,360][26022] Updated weights on worker 0-0, policy_version 749370 (0.00089) [2022-07-10 13:48:04,486][26022] Updated weights on worker 0-0, policy_version 749380 (0.00098) [2022-07-10 13:48:05,441][25689] Fps is (10 sec: 5296.9, 60 sec: 5520.0, 300 sec: 5537.6). Total num frames: 767370240. Throughput: 0: 5753.9. Samples: 767375162. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:48:05,442][25689] Avg episode reward: [(0, '-0.676')] [2022-07-10 13:48:06,128][26022] Updated weights on worker 0-0, policy_version 749390 (0.00091) [2022-07-10 13:48:08,043][26022] Updated weights on worker 0-0, policy_version 749400 (0.00085) [2022-07-10 13:48:09,766][26022] Updated weights on worker 0-0, policy_version 749410 (0.00090) [2022-07-10 13:48:10,446][25689] Fps is (10 sec: 5397.9, 60 sec: 5523.0, 300 sec: 5545.4). Total num frames: 767398912. Throughput: 0: 4924.1. Samples: 767391932. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:48:10,447][25689] Avg episode reward: [(0, '-0.928')] [2022-07-10 13:48:11,659][26022] Updated weights on worker 0-0, policy_version 749420 (0.00093) [2022-07-10 13:48:13,508][26022] Updated weights on worker 0-0, policy_version 749430 (0.00085) [2022-07-10 13:48:15,267][26022] Updated weights on worker 0-0, policy_version 749440 (0.00086) [2022-07-10 13:48:15,536][25689] Fps is (10 sec: 5680.7, 60 sec: 5554.3, 300 sec: 5540.3). Total num frames: 767427584. Throughput: 0: 5751.8. Samples: 767425632. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:48:15,536][25689] Avg episode reward: [(0, '-2.627')] [2022-07-10 13:48:17,092][26022] Updated weights on worker 0-0, policy_version 749450 (0.00091) [2022-07-10 13:48:18,975][26022] Updated weights on worker 0-0, policy_version 749460 (0.00087) [2022-07-10 13:48:20,551][25689] Fps is (10 sec: 5573.8, 60 sec: 5555.1, 300 sec: 5536.9). Total num frames: 767455232. Throughput: 0: 5754.9. Samples: 767459532. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:48:20,551][25689] Avg episode reward: [(0, '-3.292')] [2022-07-10 13:48:20,745][26022] Updated weights on worker 0-0, policy_version 749470 (0.00089) [2022-07-10 13:48:22,599][26022] Updated weights on worker 0-0, policy_version 749480 (0.00054) [2022-07-10 13:48:24,246][26022] Updated weights on worker 0-0, policy_version 749490 (0.00076) [2022-07-10 13:48:25,616][25689] Fps is (10 sec: 5485.7, 60 sec: 5533.8, 300 sec: 5539.7). Total num frames: 767482880. Throughput: 0: 5856.5. Samples: 767493314. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:48:25,616][25689] Avg episode reward: [(0, '-3.948')] [2022-07-10 13:48:26,425][26022] Updated weights on worker 0-0, policy_version 749500 (0.00081) [2022-07-10 13:48:28,085][26022] Updated weights on worker 0-0, policy_version 749510 (0.00088) [2022-07-10 13:48:30,081][26022] Updated weights on worker 0-0, policy_version 749520 (0.00091) [2022-07-10 13:48:30,621][25689] Fps is (10 sec: 5796.1, 60 sec: 5589.2, 300 sec: 5551.8). Total num frames: 767513600. Throughput: 0: 5850.8. Samples: 767509970. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:48:30,622][25689] Avg episode reward: [(0, '-3.105')] [2022-07-10 13:48:31,804][26022] Updated weights on worker 0-0, policy_version 749530 (0.00088) [2022-07-10 13:48:33,612][26022] Updated weights on worker 0-0, policy_version 749540 (0.00090) [2022-07-10 13:48:35,472][26022] Updated weights on worker 0-0, policy_version 749550 (0.00092) [2022-07-10 13:48:35,766][25689] Fps is (10 sec: 5548.6, 60 sec: 5532.6, 300 sec: 5535.9). Total num frames: 767539200. Throughput: 0: 5813.7. Samples: 767543246. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:48:35,769][25689] Avg episode reward: [(0, '-5.196')] [2022-07-10 13:48:36,159][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:48:36,170][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000749553_767542272.pth [2022-07-10 13:48:36,171][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000747602_765544448.pth [2022-07-10 13:48:37,307][26022] Updated weights on worker 0-0, policy_version 749560 (0.00088) [2022-07-10 13:48:39,311][26022] Updated weights on worker 0-0, policy_version 749570 (0.00096) [2022-07-10 13:48:40,812][25689] Fps is (10 sec: 5426.2, 60 sec: 5566.1, 300 sec: 5538.9). Total num frames: 767568896. Throughput: 0: 5781.4. Samples: 767576668. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:48:40,812][25689] Avg episode reward: [(0, '-6.443')] [2022-07-10 13:48:41,007][26022] Updated weights on worker 0-0, policy_version 749580 (0.00087) [2022-07-10 13:48:42,863][26022] Updated weights on worker 0-0, policy_version 749590 (0.00098) [2022-07-10 13:48:44,606][26022] Updated weights on worker 0-0, policy_version 749600 (0.00087) [2022-07-10 13:48:45,848][25689] Fps is (10 sec: 5687.5, 60 sec: 5547.5, 300 sec: 5545.4). Total num frames: 767596544. Throughput: 0: 4945.3. Samples: 767593372. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:48:45,849][25689] Avg episode reward: [(0, '-4.909')] [2022-07-10 13:48:46,595][26022] Updated weights on worker 0-0, policy_version 749610 (0.00616) [2022-07-10 13:48:48,306][26022] Updated weights on worker 0-0, policy_version 749620 (0.00082) [2022-07-10 13:48:50,310][26022] Updated weights on worker 0-0, policy_version 749630 (0.01134) [2022-07-10 13:48:50,862][25689] Fps is (10 sec: 5501.9, 60 sec: 5548.4, 300 sec: 5540.5). Total num frames: 767624192. Throughput: 0: 5762.6. Samples: 767626610. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:48:50,862][25689] Avg episode reward: [(0, '-3.732')] [2022-07-10 13:48:52,295][26022] Updated weights on worker 0-0, policy_version 749640 (0.00434) [2022-07-10 13:48:53,990][26022] Updated weights on worker 0-0, policy_version 749650 (0.00083) [2022-07-10 13:48:55,902][25689] Fps is (10 sec: 5398.3, 60 sec: 5516.5, 300 sec: 5536.4). Total num frames: 767650816. Throughput: 0: 5786.1. Samples: 767659754. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:48:55,904][25689] Avg episode reward: [(0, '-3.467')] [2022-07-10 13:48:55,923][26022] Updated weights on worker 0-0, policy_version 749660 (0.00086) [2022-07-10 13:48:57,467][26022] Updated weights on worker 0-0, policy_version 749670 (0.00093) [2022-07-10 13:48:59,602][26022] Updated weights on worker 0-0, policy_version 749680 (0.00090) [2022-07-10 13:49:00,907][25689] Fps is (10 sec: 5606.9, 60 sec: 5550.3, 300 sec: 5543.4). Total num frames: 767680512. Throughput: 0: 4966.7. Samples: 767676474. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:49:00,907][25689] Avg episode reward: [(0, '-3.775')] [2022-07-10 13:49:01,256][26022] Updated weights on worker 0-0, policy_version 749690 (0.00090) [2022-07-10 13:49:03,626][26022] Updated weights on worker 0-0, policy_version 749700 (0.00087) [2022-07-10 13:49:05,531][26022] Updated weights on worker 0-0, policy_version 749710 (0.00092) [2022-07-10 13:49:05,926][25689] Fps is (10 sec: 5414.5, 60 sec: 5536.1, 300 sec: 5536.4). Total num frames: 767705088. Throughput: 0: 5694.1. Samples: 767707692. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:49:05,927][25689] Avg episode reward: [(0, '-0.917')] [2022-07-10 13:49:07,348][26022] Updated weights on worker 0-0, policy_version 749720 (0.00093) [2022-07-10 13:49:09,279][26022] Updated weights on worker 0-0, policy_version 749730 (0.00093) [2022-07-10 13:49:10,931][25689] Fps is (10 sec: 5209.6, 60 sec: 5519.1, 300 sec: 5535.0). Total num frames: 767732736. Throughput: 0: 5692.3. Samples: 767740850. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:49:10,933][25689] Avg episode reward: [(0, '-0.680')] [2022-07-10 13:49:10,959][26022] Updated weights on worker 0-0, policy_version 749740 (0.00086) [2022-07-10 13:49:12,951][26022] Updated weights on worker 0-0, policy_version 749750 (0.00092) [2022-07-10 13:49:14,748][26022] Updated weights on worker 0-0, policy_version 749760 (0.00086) [2022-07-10 13:49:16,035][25689] Fps is (10 sec: 5570.9, 60 sec: 5517.8, 300 sec: 5536.6). Total num frames: 767761408. Throughput: 0: 4862.0. Samples: 767757642. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:49:16,038][25689] Avg episode reward: [(0, '-0.759')] [2022-07-10 13:49:16,617][26022] Updated weights on worker 0-0, policy_version 749770 (0.00085) [2022-07-10 13:49:18,272][26022] Updated weights on worker 0-0, policy_version 749780 (0.00094) [2022-07-10 13:49:20,138][26022] Updated weights on worker 0-0, policy_version 749790 (0.00090) [2022-07-10 13:49:21,039][25689] Fps is (10 sec: 5369.5, 60 sec: 5485.0, 300 sec: 5526.8). Total num frames: 767787008. Throughput: 0: 5689.8. Samples: 767791022. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:49:21,041][25689] Avg episode reward: [(0, '-1.181')] [2022-07-10 13:49:21,916][26022] Updated weights on worker 0-0, policy_version 749800 (0.00082) [2022-07-10 13:49:24,116][26022] Updated weights on worker 0-0, policy_version 749810 (0.00090) [2022-07-10 13:49:25,595][26022] Updated weights on worker 0-0, policy_version 749820 (0.00088) [2022-07-10 13:49:26,043][25689] Fps is (10 sec: 5627.4, 60 sec: 5541.3, 300 sec: 5541.3). Total num frames: 767817728. Throughput: 0: 5798.9. Samples: 767824354. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:49:26,045][25689] Avg episode reward: [(0, '-1.662')] [2022-07-10 13:49:27,674][26022] Updated weights on worker 0-0, policy_version 749830 (0.00086) [2022-07-10 13:49:29,353][26022] Updated weights on worker 0-0, policy_version 749840 (0.00087) [2022-07-10 13:49:31,055][25689] Fps is (10 sec: 5725.3, 60 sec: 5473.0, 300 sec: 5539.5). Total num frames: 767844352. Throughput: 0: 4979.1. Samples: 767841046. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:49:31,055][25689] Avg episode reward: [(0, '-0.994')] [2022-07-10 13:49:31,377][26022] Updated weights on worker 0-0, policy_version 749850 (0.00091) [2022-07-10 13:49:33,054][26022] Updated weights on worker 0-0, policy_version 749860 (0.00095) [2022-07-10 13:49:35,088][26022] Updated weights on worker 0-0, policy_version 749870 (0.00091) [2022-07-10 13:49:36,135][25689] Fps is (10 sec: 5580.6, 60 sec: 5546.6, 300 sec: 5538.5). Total num frames: 767874048. Throughput: 0: 5832.4. Samples: 767874874. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:49:36,139][25689] Avg episode reward: [(0, '-0.715')] [2022-07-10 13:49:36,557][26022] Updated weights on worker 0-0, policy_version 749880 (0.00084) [2022-07-10 13:49:38,736][26022] Updated weights on worker 0-0, policy_version 749890 (0.00088) [2022-07-10 13:49:40,326][26022] Updated weights on worker 0-0, policy_version 749900 (0.00091) [2022-07-10 13:49:41,224][25689] Fps is (10 sec: 5639.1, 60 sec: 5508.8, 300 sec: 5537.1). Total num frames: 767901696. Throughput: 0: 5825.5. Samples: 767908610. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:49:41,224][25689] Avg episode reward: [(0, '-1.616')] [2022-07-10 13:49:42,154][26022] Updated weights on worker 0-0, policy_version 749910 (0.00091) [2022-07-10 13:49:44,041][26022] Updated weights on worker 0-0, policy_version 749920 (0.00100) [2022-07-10 13:49:45,751][26022] Updated weights on worker 0-0, policy_version 749930 (0.00093) [2022-07-10 13:49:46,247][25689] Fps is (10 sec: 5468.9, 60 sec: 5510.1, 300 sec: 5537.1). Total num frames: 767929344. Throughput: 0: 4989.1. Samples: 767925150. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:49:46,247][25689] Avg episode reward: [(0, '-0.351')] [2022-07-10 13:49:47,724][26022] Updated weights on worker 0-0, policy_version 749940 (0.00091) [2022-07-10 13:49:49,509][26022] Updated weights on worker 0-0, policy_version 749950 (0.00085) [2022-07-10 13:49:51,275][25689] Fps is (10 sec: 5501.5, 60 sec: 5508.8, 300 sec: 5531.4). Total num frames: 767956992. Throughput: 0: 5818.6. Samples: 767958700. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:49:51,275][25689] Avg episode reward: [(0, '-0.088')] [2022-07-10 13:49:51,518][26022] Updated weights on worker 0-0, policy_version 749960 (0.00083) [2022-07-10 13:49:53,078][26022] Updated weights on worker 0-0, policy_version 749970 (0.00084) [2022-07-10 13:49:55,007][26022] Updated weights on worker 0-0, policy_version 749980 (0.00083) [2022-07-10 13:49:56,326][25689] Fps is (10 sec: 5587.6, 60 sec: 5541.6, 300 sec: 5538.3). Total num frames: 767985664. Throughput: 0: 5822.2. Samples: 767992428. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:49:56,327][25689] Avg episode reward: [(0, '-0.677')] [2022-07-10 13:49:56,914][26022] Updated weights on worker 0-0, policy_version 749990 (0.00093) [2022-07-10 13:49:58,752][26022] Updated weights on worker 0-0, policy_version 750000 (0.00090) [2022-07-10 13:50:00,682][26022] Updated weights on worker 0-0, policy_version 750010 (0.00086) [2022-07-10 13:50:01,376][25689] Fps is (10 sec: 5575.4, 60 sec: 5503.6, 300 sec: 5537.7). Total num frames: 768013312. Throughput: 0: 4996.6. Samples: 768009304. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:50:01,377][25689] Avg episode reward: [(0, '-1.152')] [2022-07-10 13:50:02,708][26022] Updated weights on worker 0-0, policy_version 750020 (0.00092) [2022-07-10 13:50:04,822][26022] Updated weights on worker 0-0, policy_version 750030 (0.00086) [2022-07-10 13:50:06,420][25689] Fps is (10 sec: 5376.9, 60 sec: 5535.2, 300 sec: 5534.1). Total num frames: 768039936. Throughput: 0: 5723.2. Samples: 768040604. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:50:06,420][25689] Avg episode reward: [(0, '-1.141')] [2022-07-10 13:50:06,426][26022] Updated weights on worker 0-0, policy_version 750040 (0.00089) [2022-07-10 13:50:08,268][26022] Updated weights on worker 0-0, policy_version 750050 (0.00093) [2022-07-10 13:50:10,112][26022] Updated weights on worker 0-0, policy_version 750060 (0.00088) [2022-07-10 13:50:11,497][25689] Fps is (10 sec: 5463.8, 60 sec: 5545.6, 300 sec: 5534.9). Total num frames: 768068608. Throughput: 0: 5701.8. Samples: 768074002. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:50:11,497][25689] Avg episode reward: [(0, '-0.941')] [2022-07-10 13:50:12,206][26022] Updated weights on worker 0-0, policy_version 750070 (0.00087) [2022-07-10 13:50:13,826][26022] Updated weights on worker 0-0, policy_version 750080 (0.00087) [2022-07-10 13:50:15,822][26022] Updated weights on worker 0-0, policy_version 750090 (0.00094) [2022-07-10 13:50:16,604][25689] Fps is (10 sec: 5529.9, 60 sec: 5528.4, 300 sec: 5537.3). Total num frames: 768096256. Throughput: 0: 4835.9. Samples: 768090490. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:50:16,606][25689] Avg episode reward: [(0, '-3.275')] [2022-07-10 13:50:17,495][26022] Updated weights on worker 0-0, policy_version 750100 (0.00091) [2022-07-10 13:50:19,526][26022] Updated weights on worker 0-0, policy_version 750110 (0.00086) [2022-07-10 13:50:21,136][26022] Updated weights on worker 0-0, policy_version 750120 (0.00091) [2022-07-10 13:50:21,668][25689] Fps is (10 sec: 5637.8, 60 sec: 5590.4, 300 sec: 5532.9). Total num frames: 768125952. Throughput: 0: 5638.2. Samples: 768123714. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:50:21,669][25689] Avg episode reward: [(0, '-3.532')] [2022-07-10 13:50:23,226][26022] Updated weights on worker 0-0, policy_version 750130 (0.00086) [2022-07-10 13:50:24,775][26022] Updated weights on worker 0-0, policy_version 750140 (0.00091) [2022-07-10 13:50:26,675][25689] Fps is (10 sec: 5490.9, 60 sec: 5505.8, 300 sec: 5531.0). Total num frames: 768151552. Throughput: 0: 5739.1. Samples: 768156850. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:50:26,675][25689] Avg episode reward: [(0, '-3.328')] [2022-07-10 13:50:26,910][26022] Updated weights on worker 0-0, policy_version 750150 (0.00095) [2022-07-10 13:50:28,720][26022] Updated weights on worker 0-0, policy_version 750160 (0.00094) [2022-07-10 13:50:30,663][26022] Updated weights on worker 0-0, policy_version 750170 (0.00084) [2022-07-10 13:50:31,686][25689] Fps is (10 sec: 5417.4, 60 sec: 5539.5, 300 sec: 5533.4). Total num frames: 768180224. Throughput: 0: 4929.9. Samples: 768173534. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:50:31,687][25689] Avg episode reward: [(0, '-3.687')] [2022-07-10 13:50:32,619][26022] Updated weights on worker 0-0, policy_version 750180 (0.00093) [2022-07-10 13:50:34,408][26022] Updated weights on worker 0-0, policy_version 750190 (0.00082) [2022-07-10 13:50:36,205][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:50:36,220][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000750200_768204800.pth [2022-07-10 13:50:36,220][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000748253_766211072.pth [2022-07-10 13:50:36,233][26022] Updated weights on worker 0-0, policy_version 750200 (0.00093) [2022-07-10 13:50:36,735][25689] Fps is (10 sec: 5496.6, 60 sec: 5491.8, 300 sec: 5525.7). Total num frames: 768206848. Throughput: 0: 5778.7. Samples: 768206818. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 13:50:36,737][25689] Avg episode reward: [(0, '-3.731')] [2022-07-10 13:50:37,794][26022] Updated weights on worker 0-0, policy_version 750210 (0.00093) [2022-07-10 13:50:39,853][26022] Updated weights on worker 0-0, policy_version 750220 (0.00085) [2022-07-10 13:50:41,573][26022] Updated weights on worker 0-0, policy_version 750230 (0.00087) [2022-07-10 13:50:41,739][25689] Fps is (10 sec: 5602.6, 60 sec: 5533.3, 300 sec: 5530.7). Total num frames: 768236544. Throughput: 0: 5811.0. Samples: 768240344. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:50:41,739][25689] Avg episode reward: [(0, '-1.621')] [2022-07-10 13:50:43,632][26022] Updated weights on worker 0-0, policy_version 750240 (0.00089) [2022-07-10 13:50:45,270][26022] Updated weights on worker 0-0, policy_version 750250 (0.00093) [2022-07-10 13:50:46,751][25689] Fps is (10 sec: 5623.0, 60 sec: 5517.4, 300 sec: 5525.0). Total num frames: 768263168. Throughput: 0: 4991.0. Samples: 768257048. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:50:46,751][25689] Avg episode reward: [(0, '-0.707')] [2022-07-10 13:50:47,101][26022] Updated weights on worker 0-0, policy_version 750260 (0.00086) [2022-07-10 13:50:48,917][26022] Updated weights on worker 0-0, policy_version 750270 (0.00089) [2022-07-10 13:50:50,773][26022] Updated weights on worker 0-0, policy_version 750280 (0.00085) [2022-07-10 13:50:51,820][25689] Fps is (10 sec: 5586.6, 60 sec: 5547.5, 300 sec: 5535.3). Total num frames: 768292864. Throughput: 0: 5835.7. Samples: 768291028. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:50:51,821][25689] Avg episode reward: [(0, '-0.731')] [2022-07-10 13:50:52,744][26022] Updated weights on worker 0-0, policy_version 750290 (0.00091) [2022-07-10 13:50:54,499][26022] Updated weights on worker 0-0, policy_version 750300 (0.00091) [2022-07-10 13:50:56,268][26022] Updated weights on worker 0-0, policy_version 750310 (0.00087) [2022-07-10 13:50:56,880][25689] Fps is (10 sec: 5660.9, 60 sec: 5529.7, 300 sec: 5527.4). Total num frames: 768320512. Throughput: 0: 5856.0. Samples: 768324792. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:50:56,881][25689] Avg episode reward: [(0, '-1.503')] [2022-07-10 13:50:57,977][26022] Updated weights on worker 0-0, policy_version 750320 (0.00373) [2022-07-10 13:50:59,943][26022] Updated weights on worker 0-0, policy_version 750330 (0.00091) [2022-07-10 13:51:01,912][25689] Fps is (10 sec: 5479.1, 60 sec: 5531.4, 300 sec: 5534.6). Total num frames: 768348160. Throughput: 0: 5015.7. Samples: 768341528. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:51:01,913][25689] Avg episode reward: [(0, '-2.912')] [2022-07-10 13:51:01,914][26022] Updated weights on worker 0-0, policy_version 750340 (0.00084) [2022-07-10 13:51:03,929][26022] Updated weights on worker 0-0, policy_version 750350 (0.00086) [2022-07-10 13:51:05,678][26022] Updated weights on worker 0-0, policy_version 750360 (0.00090) [2022-07-10 13:51:06,953][25689] Fps is (10 sec: 5388.2, 60 sec: 5531.6, 300 sec: 5533.9). Total num frames: 768374784. Throughput: 0: 5745.5. Samples: 768373120. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:51:06,953][25689] Avg episode reward: [(0, '-5.392')] [2022-07-10 13:51:07,741][26022] Updated weights on worker 0-0, policy_version 750370 (0.00091) [2022-07-10 13:51:09,514][26022] Updated weights on worker 0-0, policy_version 750380 (0.00078) [2022-07-10 13:51:11,440][26022] Updated weights on worker 0-0, policy_version 750390 (0.00085) [2022-07-10 13:51:12,022][25689] Fps is (10 sec: 5469.6, 60 sec: 5532.4, 300 sec: 5530.6). Total num frames: 768403456. Throughput: 0: 5714.1. Samples: 768406464. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:51:12,022][25689] Avg episode reward: [(0, '-7.503')] [2022-07-10 13:51:13,135][26022] Updated weights on worker 0-0, policy_version 750400 (0.00096) [2022-07-10 13:51:14,899][26022] Updated weights on worker 0-0, policy_version 750410 (0.00092) [2022-07-10 13:51:17,079][25689] Fps is (10 sec: 5460.6, 60 sec: 5520.0, 300 sec: 5526.2). Total num frames: 768430080. Throughput: 0: 5702.4. Samples: 768439974. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:51:17,080][26022] Updated weights on worker 0-0, policy_version 750420 (0.00085) [2022-07-10 13:51:17,080][25689] Avg episode reward: [(0, '-7.283')] [2022-07-10 13:51:18,544][26022] Updated weights on worker 0-0, policy_version 750430 (0.00096) [2022-07-10 13:51:20,668][26022] Updated weights on worker 0-0, policy_version 750440 (0.00091) [2022-07-10 13:51:22,141][25689] Fps is (10 sec: 5565.7, 60 sec: 5520.2, 300 sec: 5536.1). Total num frames: 768459776. Throughput: 0: 5699.5. Samples: 768456822. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:51:22,143][25689] Avg episode reward: [(0, '-5.524')] [2022-07-10 13:51:22,314][26022] Updated weights on worker 0-0, policy_version 750450 (0.00093) [2022-07-10 13:51:24,361][26022] Updated weights on worker 0-0, policy_version 750460 (0.00095) [2022-07-10 13:51:26,031][26022] Updated weights on worker 0-0, policy_version 750470 (0.00091) [2022-07-10 13:51:27,219][25689] Fps is (10 sec: 5554.5, 60 sec: 5530.6, 300 sec: 5524.9). Total num frames: 768486400. Throughput: 0: 5749.5. Samples: 768489638. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:51:27,221][25689] Avg episode reward: [(0, '-5.753')] [2022-07-10 13:51:28,034][26022] Updated weights on worker 0-0, policy_version 750480 (0.00085) [2022-07-10 13:51:29,956][26022] Updated weights on worker 0-0, policy_version 750490 (0.00090) [2022-07-10 13:51:31,604][26022] Updated weights on worker 0-0, policy_version 750500 (0.00279) [2022-07-10 13:51:32,306][25689] Fps is (10 sec: 5339.4, 60 sec: 5506.9, 300 sec: 5524.3). Total num frames: 768514048. Throughput: 0: 5742.7. Samples: 768522946. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:51:32,306][25689] Avg episode reward: [(0, '-2.827')] [2022-07-10 13:51:33,584][26022] Updated weights on worker 0-0, policy_version 750510 (0.00082) [2022-07-10 13:51:35,227][26022] Updated weights on worker 0-0, policy_version 750520 (0.00085) [2022-07-10 13:51:37,159][26022] Updated weights on worker 0-0, policy_version 750530 (0.00086) [2022-07-10 13:51:37,349][25689] Fps is (10 sec: 5761.7, 60 sec: 5574.9, 300 sec: 5534.5). Total num frames: 768544768. Throughput: 0: 4928.3. Samples: 768539870. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:51:37,351][25689] Avg episode reward: [(0, '-2.590')] [2022-07-10 13:51:39,112][26022] Updated weights on worker 0-0, policy_version 750540 (0.00086) [2022-07-10 13:51:40,773][26022] Updated weights on worker 0-0, policy_version 750550 (0.00093) [2022-07-10 13:51:42,359][25689] Fps is (10 sec: 5602.2, 60 sec: 5506.8, 300 sec: 5524.1). Total num frames: 768570368. Throughput: 0: 5770.9. Samples: 768573496. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:51:42,359][25689] Avg episode reward: [(0, '-0.938')] [2022-07-10 13:51:42,776][26022] Updated weights on worker 0-0, policy_version 750560 (0.00096) [2022-07-10 13:51:44,420][26022] Updated weights on worker 0-0, policy_version 750570 (0.00381) [2022-07-10 13:51:46,306][26022] Updated weights on worker 0-0, policy_version 750580 (0.00087) [2022-07-10 13:51:47,379][25689] Fps is (10 sec: 5411.1, 60 sec: 5539.9, 300 sec: 5524.3). Total num frames: 768599040. Throughput: 0: 5821.7. Samples: 768607004. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:51:47,381][25689] Avg episode reward: [(0, '-2.078')] [2022-07-10 13:51:48,204][26022] Updated weights on worker 0-0, policy_version 750590 (0.00086) [2022-07-10 13:51:50,000][26022] Updated weights on worker 0-0, policy_version 750600 (0.00084) [2022-07-10 13:51:51,789][26022] Updated weights on worker 0-0, policy_version 750610 (0.00089) [2022-07-10 13:51:52,398][25689] Fps is (10 sec: 5609.7, 60 sec: 5510.6, 300 sec: 5528.9). Total num frames: 768626688. Throughput: 0: 5013.3. Samples: 768623678. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:51:52,400][25689] Avg episode reward: [(0, '-2.331')] [2022-07-10 13:51:53,751][26022] Updated weights on worker 0-0, policy_version 750620 (0.00089) [2022-07-10 13:51:55,513][26022] Updated weights on worker 0-0, policy_version 750630 (0.00080) [2022-07-10 13:51:57,447][25689] Fps is (10 sec: 5594.1, 60 sec: 5528.6, 300 sec: 5528.1). Total num frames: 768655360. Throughput: 0: 5828.5. Samples: 768657008. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:51:57,447][25689] Avg episode reward: [(0, '-2.612')] [2022-07-10 13:51:57,450][26022] Updated weights on worker 0-0, policy_version 750640 (0.00090) [2022-07-10 13:51:59,311][26022] Updated weights on worker 0-0, policy_version 750650 (0.00100) [2022-07-10 13:52:01,055][26022] Updated weights on worker 0-0, policy_version 750660 (0.00091) [2022-07-10 13:52:02,534][25689] Fps is (10 sec: 5354.5, 60 sec: 5489.8, 300 sec: 5526.6). Total num frames: 768680960. Throughput: 0: 5686.7. Samples: 768688228. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:52:02,535][25689] Avg episode reward: [(0, '-1.333')] [2022-07-10 13:52:03,467][26022] Updated weights on worker 0-0, policy_version 750670 (0.00090) [2022-07-10 13:52:05,155][26022] Updated weights on worker 0-0, policy_version 750680 (0.00087) [2022-07-10 13:52:07,158][26022] Updated weights on worker 0-0, policy_version 750690 (0.00090) [2022-07-10 13:52:07,583][25689] Fps is (10 sec: 5354.4, 60 sec: 5522.8, 300 sec: 5526.4). Total num frames: 768709632. Throughput: 0: 4838.7. Samples: 768704764. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:52:07,583][25689] Avg episode reward: [(0, '-1.185')] [2022-07-10 13:52:09,001][26022] Updated weights on worker 0-0, policy_version 750700 (0.00092) [2022-07-10 13:52:10,709][26022] Updated weights on worker 0-0, policy_version 750710 (0.00091) [2022-07-10 13:52:12,619][25689] Fps is (10 sec: 5483.3, 60 sec: 5492.1, 300 sec: 5526.9). Total num frames: 768736256. Throughput: 0: 5662.9. Samples: 768738182. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:52:12,619][25689] Avg episode reward: [(0, '-3.615')] [2022-07-10 13:52:12,680][26022] Updated weights on worker 0-0, policy_version 750720 (0.00088) [2022-07-10 13:52:14,366][26022] Updated weights on worker 0-0, policy_version 750730 (0.00086) [2022-07-10 13:52:16,407][26022] Updated weights on worker 0-0, policy_version 750740 (0.00090) [2022-07-10 13:52:17,678][25689] Fps is (10 sec: 5375.8, 60 sec: 5508.7, 300 sec: 5526.2). Total num frames: 768763904. Throughput: 0: 5644.4. Samples: 768771204. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:52:17,679][25689] Avg episode reward: [(0, '-3.885')] [2022-07-10 13:52:18,202][26022] Updated weights on worker 0-0, policy_version 750750 (0.00092) [2022-07-10 13:52:19,972][26022] Updated weights on worker 0-0, policy_version 750760 (0.00094) [2022-07-10 13:52:21,964][26022] Updated weights on worker 0-0, policy_version 750770 (0.00092) [2022-07-10 13:52:22,692][25689] Fps is (10 sec: 5590.7, 60 sec: 5496.2, 300 sec: 5526.3). Total num frames: 768792576. Throughput: 0: 4936.8. Samples: 768787746. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:52:22,693][25689] Avg episode reward: [(0, '-4.455')] [2022-07-10 13:52:23,755][26022] Updated weights on worker 0-0, policy_version 750780 (0.00096) [2022-07-10 13:52:25,663][26022] Updated weights on worker 0-0, policy_version 750790 (0.00067) [2022-07-10 13:52:27,582][26022] Updated weights on worker 0-0, policy_version 750800 (0.00088) [2022-07-10 13:52:27,719][25689] Fps is (10 sec: 5507.1, 60 sec: 5500.8, 300 sec: 5523.4). Total num frames: 768819200. Throughput: 0: 5746.5. Samples: 768820478. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:52:27,719][25689] Avg episode reward: [(0, '-5.201')] [2022-07-10 13:52:29,334][26022] Updated weights on worker 0-0, policy_version 750810 (0.00086) [2022-07-10 13:52:31,231][26022] Updated weights on worker 0-0, policy_version 750820 (0.00084) [2022-07-10 13:52:32,816][25689] Fps is (10 sec: 5462.3, 60 sec: 5516.8, 300 sec: 5523.1). Total num frames: 768847872. Throughput: 0: 5727.1. Samples: 768853852. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:52:32,823][25689] Avg episode reward: [(0, '-6.407')] [2022-07-10 13:52:33,043][26022] Updated weights on worker 0-0, policy_version 750830 (0.00096) [2022-07-10 13:52:35,081][26022] Updated weights on worker 0-0, policy_version 750840 (0.00089) [2022-07-10 13:52:36,387][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:52:36,394][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000750848_768868352.pth [2022-07-10 13:52:36,394][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000748904_766877696.pth [2022-07-10 13:52:36,641][26022] Updated weights on worker 0-0, policy_version 750850 (0.00087) [2022-07-10 13:52:37,943][25689] Fps is (10 sec: 5508.7, 60 sec: 5458.5, 300 sec: 5521.5). Total num frames: 768875520. Throughput: 0: 4899.7. Samples: 768870496. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:52:37,943][25689] Avg episode reward: [(0, '-6.440')] [2022-07-10 13:52:38,639][26022] Updated weights on worker 0-0, policy_version 750860 (0.00587) [2022-07-10 13:52:40,365][26022] Updated weights on worker 0-0, policy_version 750870 (0.00086) [2022-07-10 13:52:42,235][26022] Updated weights on worker 0-0, policy_version 750880 (0.00088) [2022-07-10 13:52:43,017][25689] Fps is (10 sec: 5621.2, 60 sec: 5520.2, 300 sec: 5523.9). Total num frames: 768905216. Throughput: 0: 5716.0. Samples: 768903922. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:52:43,017][25689] Avg episode reward: [(0, '-5.571')] [2022-07-10 13:52:44,308][26022] Updated weights on worker 0-0, policy_version 750890 (0.00084) [2022-07-10 13:52:45,882][26022] Updated weights on worker 0-0, policy_version 750900 (0.00086) [2022-07-10 13:52:47,890][26022] Updated weights on worker 0-0, policy_version 750910 (0.00101) [2022-07-10 13:52:48,057][25689] Fps is (10 sec: 5669.8, 60 sec: 5501.6, 300 sec: 5523.6). Total num frames: 768932864. Throughput: 0: 5744.0. Samples: 768937298. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:52:48,057][25689] Avg episode reward: [(0, '-4.495')] [2022-07-10 13:52:49,785][26022] Updated weights on worker 0-0, policy_version 750920 (0.00086) [2022-07-10 13:52:51,409][26022] Updated weights on worker 0-0, policy_version 750930 (0.00088) [2022-07-10 13:52:53,132][25689] Fps is (10 sec: 5466.7, 60 sec: 5496.5, 300 sec: 5519.9). Total num frames: 768960512. Throughput: 0: 4933.8. Samples: 768954090. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:52:53,133][25689] Avg episode reward: [(0, '-4.052')] [2022-07-10 13:52:53,358][26022] Updated weights on worker 0-0, policy_version 750940 (0.00089) [2022-07-10 13:52:55,293][26022] Updated weights on worker 0-0, policy_version 750950 (0.00085) [2022-07-10 13:52:57,020][26022] Updated weights on worker 0-0, policy_version 750960 (0.00092) [2022-07-10 13:52:58,171][25689] Fps is (10 sec: 5568.6, 60 sec: 5497.4, 300 sec: 5522.7). Total num frames: 768989184. Throughput: 0: 5771.3. Samples: 768987236. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:52:58,171][25689] Avg episode reward: [(0, '-2.684')] [2022-07-10 13:52:59,116][26022] Updated weights on worker 0-0, policy_version 750970 (0.00090) [2022-07-10 13:53:00,529][26022] Updated weights on worker 0-0, policy_version 750980 (0.00092) [2022-07-10 13:53:03,139][26022] Updated weights on worker 0-0, policy_version 750990 (0.00526) [2022-07-10 13:53:03,182][25689] Fps is (10 sec: 5298.5, 60 sec: 5487.5, 300 sec: 5520.0). Total num frames: 769013760. Throughput: 0: 5764.0. Samples: 769020150. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:53:03,182][25689] Avg episode reward: [(0, '-2.170')] [2022-07-10 13:53:04,745][26022] Updated weights on worker 0-0, policy_version 751000 (0.00093) [2022-07-10 13:53:06,676][26022] Updated weights on worker 0-0, policy_version 751010 (0.00092) [2022-07-10 13:53:08,214][25689] Fps is (10 sec: 5302.0, 60 sec: 5488.9, 300 sec: 5519.5). Total num frames: 769042432. Throughput: 0: 5716.9. Samples: 769052532. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:53:08,214][25689] Avg episode reward: [(0, '-2.892')] [2022-07-10 13:53:08,527][26022] Updated weights on worker 0-0, policy_version 751020 (0.00089) [2022-07-10 13:53:10,252][26022] Updated weights on worker 0-0, policy_version 751030 (0.00082) [2022-07-10 13:53:11,997][26022] Updated weights on worker 0-0, policy_version 751040 (0.00088) [2022-07-10 13:53:13,219][25689] Fps is (10 sec: 5713.0, 60 sec: 5525.5, 300 sec: 5521.0). Total num frames: 769071104. Throughput: 0: 5742.2. Samples: 769069434. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:53:13,220][25689] Avg episode reward: [(0, '-2.197')] [2022-07-10 13:53:14,101][26022] Updated weights on worker 0-0, policy_version 751050 (0.00092) [2022-07-10 13:53:15,618][26022] Updated weights on worker 0-0, policy_version 751060 (0.00091) [2022-07-10 13:53:17,656][26022] Updated weights on worker 0-0, policy_version 751070 (0.00088) [2022-07-10 13:53:18,259][25689] Fps is (10 sec: 5504.8, 60 sec: 5510.4, 300 sec: 5517.1). Total num frames: 769097728. Throughput: 0: 5745.8. Samples: 769102658. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:53:18,259][25689] Avg episode reward: [(0, '-3.445')] [2022-07-10 13:53:19,333][26022] Updated weights on worker 0-0, policy_version 751080 (0.00091) [2022-07-10 13:53:21,444][26022] Updated weights on worker 0-0, policy_version 751090 (0.00085) [2022-07-10 13:53:22,955][26022] Updated weights on worker 0-0, policy_version 751100 (0.00091) [2022-07-10 13:53:23,277][25689] Fps is (10 sec: 5599.3, 60 sec: 5526.9, 300 sec: 5524.9). Total num frames: 769127424. Throughput: 0: 5783.5. Samples: 769136374. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:53:23,278][25689] Avg episode reward: [(0, '-4.871')] [2022-07-10 13:53:24,823][26022] Updated weights on worker 0-0, policy_version 751110 (0.00095) [2022-07-10 13:53:26,722][26022] Updated weights on worker 0-0, policy_version 751120 (0.00088) [2022-07-10 13:53:28,283][25689] Fps is (10 sec: 5720.7, 60 sec: 5545.8, 300 sec: 5514.6). Total num frames: 769155072. Throughput: 0: 4992.5. Samples: 769152726. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:53:28,283][25689] Avg episode reward: [(0, '-3.794')] [2022-07-10 13:53:28,639][26022] Updated weights on worker 0-0, policy_version 751130 (0.00080) [2022-07-10 13:53:30,619][26022] Updated weights on worker 0-0, policy_version 751140 (0.00087) [2022-07-10 13:53:32,416][26022] Updated weights on worker 0-0, policy_version 751150 (0.00092) [2022-07-10 13:53:33,302][25689] Fps is (10 sec: 5413.9, 60 sec: 5519.0, 300 sec: 5520.3). Total num frames: 769181696. Throughput: 0: 5803.2. Samples: 769185980. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:53:33,302][25689] Avg episode reward: [(0, '-4.014')] [2022-07-10 13:53:34,314][26022] Updated weights on worker 0-0, policy_version 751160 (0.00082) [2022-07-10 13:53:36,141][26022] Updated weights on worker 0-0, policy_version 751170 (0.00086) [2022-07-10 13:53:37,988][26022] Updated weights on worker 0-0, policy_version 751180 (0.00094) [2022-07-10 13:53:38,348][25689] Fps is (10 sec: 5493.8, 60 sec: 5543.4, 300 sec: 5516.9). Total num frames: 769210368. Throughput: 0: 5794.5. Samples: 769219064. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:53:38,348][25689] Avg episode reward: [(0, '-5.272')] [2022-07-10 13:53:39,746][26022] Updated weights on worker 0-0, policy_version 751190 (0.00091) [2022-07-10 13:53:41,549][26022] Updated weights on worker 0-0, policy_version 751200 (0.00095) [2022-07-10 13:53:43,414][25689] Fps is (10 sec: 5569.8, 60 sec: 5510.2, 300 sec: 5516.3). Total num frames: 769238016. Throughput: 0: 4935.3. Samples: 769235752. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:53:43,414][25689] Avg episode reward: [(0, '-3.570')] [2022-07-10 13:53:43,445][26022] Updated weights on worker 0-0, policy_version 751210 (0.00609) [2022-07-10 13:53:45,300][26022] Updated weights on worker 0-0, policy_version 751220 (0.00087) [2022-07-10 13:53:47,162][26022] Updated weights on worker 0-0, policy_version 751230 (0.00101) [2022-07-10 13:53:48,420][25689] Fps is (10 sec: 5489.9, 60 sec: 5513.3, 300 sec: 5516.5). Total num frames: 769265664. Throughput: 0: 5798.0. Samples: 769269484. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:53:48,423][25689] Avg episode reward: [(0, '-2.085')] [2022-07-10 13:53:48,811][26022] Updated weights on worker 0-0, policy_version 751240 (0.00082) [2022-07-10 13:53:50,879][26022] Updated weights on worker 0-0, policy_version 751250 (0.00095) [2022-07-10 13:53:52,623][26022] Updated weights on worker 0-0, policy_version 751260 (0.00090) [2022-07-10 13:53:53,454][25689] Fps is (10 sec: 5609.7, 60 sec: 5534.1, 300 sec: 5523.5). Total num frames: 769294336. Throughput: 0: 5802.7. Samples: 769302914. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:53:53,454][25689] Avg episode reward: [(0, '-1.073')] [2022-07-10 13:53:54,671][26022] Updated weights on worker 0-0, policy_version 751270 (0.00082) [2022-07-10 13:53:56,432][26022] Updated weights on worker 0-0, policy_version 751280 (0.00085) [2022-07-10 13:53:58,259][26022] Updated weights on worker 0-0, policy_version 751290 (0.00081) [2022-07-10 13:53:58,553][25689] Fps is (10 sec: 5659.1, 60 sec: 5528.5, 300 sec: 5518.3). Total num frames: 769323008. Throughput: 0: 4966.8. Samples: 769319420. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:53:58,554][25689] Avg episode reward: [(0, '-2.077')] [2022-07-10 13:54:00,243][26022] Updated weights on worker 0-0, policy_version 751300 (0.00077) [2022-07-10 13:54:02,172][26022] Updated weights on worker 0-0, policy_version 751310 (0.00092) [2022-07-10 13:54:03,571][25689] Fps is (10 sec: 5262.8, 60 sec: 5527.8, 300 sec: 5518.3). Total num frames: 769347584. Throughput: 0: 5727.5. Samples: 769351204. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 13:54:03,572][25689] Avg episode reward: [(0, '-3.353')] [2022-07-10 13:54:04,062][26022] Updated weights on worker 0-0, policy_version 751320 (0.00081) [2022-07-10 13:54:05,955][26022] Updated weights on worker 0-0, policy_version 751330 (0.00089) [2022-07-10 13:54:07,663][26022] Updated weights on worker 0-0, policy_version 751340 (0.00088) [2022-07-10 13:54:08,576][25689] Fps is (10 sec: 5312.8, 60 sec: 5530.3, 300 sec: 5521.7). Total num frames: 769376256. Throughput: 0: 5714.4. Samples: 769384662. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:54:08,577][25689] Avg episode reward: [(0, '-1.043')] [2022-07-10 13:54:09,513][26022] Updated weights on worker 0-0, policy_version 751350 (0.00094) [2022-07-10 13:54:11,399][26022] Updated weights on worker 0-0, policy_version 751360 (0.00086) [2022-07-10 13:54:13,196][26022] Updated weights on worker 0-0, policy_version 751370 (0.00090) [2022-07-10 13:54:13,599][25689] Fps is (10 sec: 5718.4, 60 sec: 5528.7, 300 sec: 5523.2). Total num frames: 769404928. Throughput: 0: 4898.4. Samples: 769401596. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:54:13,600][25689] Avg episode reward: [(0, '-0.607')] [2022-07-10 13:54:15,051][26022] Updated weights on worker 0-0, policy_version 751380 (0.00097) [2022-07-10 13:54:16,914][26022] Updated weights on worker 0-0, policy_version 751390 (0.00090) [2022-07-10 13:54:18,601][26022] Updated weights on worker 0-0, policy_version 751400 (0.00090) [2022-07-10 13:54:18,696][25689] Fps is (10 sec: 5767.3, 60 sec: 5574.3, 300 sec: 5535.3). Total num frames: 769434624. Throughput: 0: 5759.7. Samples: 769435438. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:54:18,697][25689] Avg episode reward: [(0, '-1.175')] [2022-07-10 13:54:20,575][26022] Updated weights on worker 0-0, policy_version 751410 (0.00089) [2022-07-10 13:54:22,180][26022] Updated weights on worker 0-0, policy_version 751420 (0.00085) [2022-07-10 13:54:23,756][25689] Fps is (10 sec: 5444.2, 60 sec: 5502.7, 300 sec: 5517.0). Total num frames: 769460224. Throughput: 0: 5843.8. Samples: 769469160. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:54:23,758][25689] Avg episode reward: [(0, '-1.467')] [2022-07-10 13:54:24,089][26022] Updated weights on worker 0-0, policy_version 751430 (0.00088) [2022-07-10 13:54:25,916][26022] Updated weights on worker 0-0, policy_version 751440 (0.00097) [2022-07-10 13:54:27,751][26022] Updated weights on worker 0-0, policy_version 751450 (0.00086) [2022-07-10 13:54:28,776][25689] Fps is (10 sec: 5384.4, 60 sec: 5518.4, 300 sec: 5523.8). Total num frames: 769488896. Throughput: 0: 4997.4. Samples: 769485610. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:54:28,778][25689] Avg episode reward: [(0, '-1.468')] [2022-07-10 13:54:29,789][26022] Updated weights on worker 0-0, policy_version 751460 (0.00082) [2022-07-10 13:54:31,474][26022] Updated weights on worker 0-0, policy_version 751470 (0.00092) [2022-07-10 13:54:33,332][26022] Updated weights on worker 0-0, policy_version 751480 (0.00112) [2022-07-10 13:54:33,793][25689] Fps is (10 sec: 5713.5, 60 sec: 5552.4, 300 sec: 5521.5). Total num frames: 769517568. Throughput: 0: 5820.3. Samples: 769519128. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:54:33,793][25689] Avg episode reward: [(0, '-1.533')] [2022-07-10 13:54:35,322][26022] Updated weights on worker 0-0, policy_version 751490 (0.00085) [2022-07-10 13:54:36,473][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:54:36,488][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000751497_769532928.pth [2022-07-10 13:54:36,495][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000749553_767542272.pth [2022-07-10 13:54:36,926][26022] Updated weights on worker 0-0, policy_version 751500 (0.00087) [2022-07-10 13:54:38,908][25689] Fps is (10 sec: 5558.5, 60 sec: 5529.2, 300 sec: 5521.0). Total num frames: 769545216. Throughput: 0: 5789.4. Samples: 769552452. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:54:38,909][25689] Avg episode reward: [(0, '-1.871')] [2022-07-10 13:54:39,050][26022] Updated weights on worker 0-0, policy_version 751510 (0.00096) [2022-07-10 13:54:40,778][26022] Updated weights on worker 0-0, policy_version 751520 (0.00084) [2022-07-10 13:54:42,655][26022] Updated weights on worker 0-0, policy_version 751530 (0.00083) [2022-07-10 13:54:43,911][25689] Fps is (10 sec: 5465.3, 60 sec: 5535.0, 300 sec: 5521.4). Total num frames: 769572864. Throughput: 0: 4960.8. Samples: 769569142. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:54:43,912][25689] Avg episode reward: [(0, '-2.235')] [2022-07-10 13:54:44,456][26022] Updated weights on worker 0-0, policy_version 751540 (0.00090) [2022-07-10 13:54:46,325][26022] Updated weights on worker 0-0, policy_version 751550 (0.00083) [2022-07-10 13:54:47,993][26022] Updated weights on worker 0-0, policy_version 751560 (0.00087) [2022-07-10 13:54:48,939][25689] Fps is (10 sec: 5614.8, 60 sec: 5549.9, 300 sec: 5524.8). Total num frames: 769601536. Throughput: 0: 5796.9. Samples: 769602492. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:54:48,940][25689] Avg episode reward: [(0, '-2.708')] [2022-07-10 13:54:50,057][26022] Updated weights on worker 0-0, policy_version 751570 (0.00087) [2022-07-10 13:54:51,703][26022] Updated weights on worker 0-0, policy_version 751580 (0.00087) [2022-07-10 13:54:53,743][26022] Updated weights on worker 0-0, policy_version 751590 (0.00095) [2022-07-10 13:54:53,948][25689] Fps is (10 sec: 5508.8, 60 sec: 5518.2, 300 sec: 5518.7). Total num frames: 769628160. Throughput: 0: 5802.9. Samples: 769636088. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:54:53,950][25689] Avg episode reward: [(0, '-2.238')] [2022-07-10 13:54:55,299][26022] Updated weights on worker 0-0, policy_version 751600 (0.00085) [2022-07-10 13:54:57,507][26022] Updated weights on worker 0-0, policy_version 751610 (0.00091) [2022-07-10 13:54:59,070][25689] Fps is (10 sec: 5458.0, 60 sec: 5516.2, 300 sec: 5520.8). Total num frames: 769656832. Throughput: 0: 4965.0. Samples: 769652554. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:54:59,070][25689] Avg episode reward: [(0, '-0.755')] [2022-07-10 13:54:59,247][26022] Updated weights on worker 0-0, policy_version 751620 (0.00084) [2022-07-10 13:55:01,207][26022] Updated weights on worker 0-0, policy_version 751630 (0.00085) [2022-07-10 13:55:03,080][26022] Updated weights on worker 0-0, policy_version 751640 (0.00089) [2022-07-10 13:55:04,150][25689] Fps is (10 sec: 5420.4, 60 sec: 5544.4, 300 sec: 5520.2). Total num frames: 769683456. Throughput: 0: 5674.0. Samples: 769683980. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:55:04,150][25689] Avg episode reward: [(0, '-1.579')] [2022-07-10 13:55:05,070][26022] Updated weights on worker 0-0, policy_version 751650 (0.00092) [2022-07-10 13:55:06,903][26022] Updated weights on worker 0-0, policy_version 751660 (0.00087) [2022-07-10 13:55:08,575][26022] Updated weights on worker 0-0, policy_version 751670 (0.00094) [2022-07-10 13:55:09,185][25689] Fps is (10 sec: 5466.6, 60 sec: 5541.6, 300 sec: 5520.9). Total num frames: 769712128. Throughput: 0: 5695.6. Samples: 769717808. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:55:09,187][25689] Avg episode reward: [(0, '-1.934')] [2022-07-10 13:55:10,638][26022] Updated weights on worker 0-0, policy_version 751680 (0.00090) [2022-07-10 13:55:12,225][26022] Updated weights on worker 0-0, policy_version 751690 (0.00082) [2022-07-10 13:55:14,170][26022] Updated weights on worker 0-0, policy_version 751700 (0.00088) [2022-07-10 13:55:14,218][25689] Fps is (10 sec: 5695.7, 60 sec: 5540.8, 300 sec: 5525.8). Total num frames: 769740800. Throughput: 0: 4853.4. Samples: 769734464. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:55:14,218][25689] Avg episode reward: [(0, '-2.305')] [2022-07-10 13:55:16,116][26022] Updated weights on worker 0-0, policy_version 751710 (0.00087) [2022-07-10 13:55:17,904][26022] Updated weights on worker 0-0, policy_version 751720 (0.00091) [2022-07-10 13:55:19,307][25689] Fps is (10 sec: 5665.6, 60 sec: 5524.6, 300 sec: 5521.9). Total num frames: 769769472. Throughput: 0: 5704.1. Samples: 769767986. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:55:19,308][25689] Avg episode reward: [(0, '-2.189')] [2022-07-10 13:55:19,530][26022] Updated weights on worker 0-0, policy_version 751730 (0.00090) [2022-07-10 13:55:21,759][26022] Updated weights on worker 0-0, policy_version 751740 (0.00082) [2022-07-10 13:55:23,303][26022] Updated weights on worker 0-0, policy_version 751750 (0.00090) [2022-07-10 13:55:24,361][25689] Fps is (10 sec: 5350.6, 60 sec: 5525.1, 300 sec: 5521.0). Total num frames: 769795072. Throughput: 0: 5811.3. Samples: 769801432. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:55:24,362][25689] Avg episode reward: [(0, '-2.453')] [2022-07-10 13:55:25,504][26022] Updated weights on worker 0-0, policy_version 751760 (0.00090) [2022-07-10 13:55:27,211][26022] Updated weights on worker 0-0, policy_version 751770 (0.00081) [2022-07-10 13:55:29,062][26022] Updated weights on worker 0-0, policy_version 751780 (0.00085) [2022-07-10 13:55:29,392][25689] Fps is (10 sec: 5584.4, 60 sec: 5557.9, 300 sec: 5527.5). Total num frames: 769825792. Throughput: 0: 4964.1. Samples: 769818116. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:55:29,392][25689] Avg episode reward: [(0, '-3.266')] [2022-07-10 13:55:30,917][26022] Updated weights on worker 0-0, policy_version 751790 (0.00090) [2022-07-10 13:55:32,551][26022] Updated weights on worker 0-0, policy_version 751800 (0.00085) [2022-07-10 13:55:34,421][25689] Fps is (10 sec: 5700.0, 60 sec: 5523.0, 300 sec: 5527.9). Total num frames: 769852416. Throughput: 0: 5810.2. Samples: 769851850. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:55:34,423][25689] Avg episode reward: [(0, '-2.357')] [2022-07-10 13:55:34,468][26022] Updated weights on worker 0-0, policy_version 751810 (0.00086) [2022-07-10 13:55:36,375][26022] Updated weights on worker 0-0, policy_version 751820 (0.00087) [2022-07-10 13:55:38,058][26022] Updated weights on worker 0-0, policy_version 751830 (0.00091) [2022-07-10 13:55:39,482][25689] Fps is (10 sec: 5480.3, 60 sec: 5544.9, 300 sec: 5523.4). Total num frames: 769881088. Throughput: 0: 5819.3. Samples: 769885392. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:55:39,482][25689] Avg episode reward: [(0, '-3.116')] [2022-07-10 13:55:40,016][26022] Updated weights on worker 0-0, policy_version 751840 (0.00083) [2022-07-10 13:55:41,714][26022] Updated weights on worker 0-0, policy_version 751850 (0.00085) [2022-07-10 13:55:43,658][26022] Updated weights on worker 0-0, policy_version 751860 (0.00073) [2022-07-10 13:55:44,486][25689] Fps is (10 sec: 5595.6, 60 sec: 5544.7, 300 sec: 5526.9). Total num frames: 769908736. Throughput: 0: 5012.4. Samples: 769902308. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:55:44,487][25689] Avg episode reward: [(0, '-2.462')] [2022-07-10 13:55:45,443][26022] Updated weights on worker 0-0, policy_version 751870 (0.00087) [2022-07-10 13:55:47,280][26022] Updated weights on worker 0-0, policy_version 751880 (0.00090) [2022-07-10 13:55:48,944][26022] Updated weights on worker 0-0, policy_version 751890 (0.00085) [2022-07-10 13:55:49,537][25689] Fps is (10 sec: 5601.1, 60 sec: 5542.6, 300 sec: 5523.9). Total num frames: 769937408. Throughput: 0: 5866.2. Samples: 769936292. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:55:49,537][25689] Avg episode reward: [(0, '-3.568')] [2022-07-10 13:55:50,919][26022] Updated weights on worker 0-0, policy_version 751900 (0.00094) [2022-07-10 13:55:52,619][26022] Updated weights on worker 0-0, policy_version 751910 (0.00087) [2022-07-10 13:55:54,494][26022] Updated weights on worker 0-0, policy_version 751920 (0.00085) [2022-07-10 13:55:54,593][25689] Fps is (10 sec: 5674.1, 60 sec: 5572.2, 300 sec: 5527.4). Total num frames: 769966080. Throughput: 0: 5847.3. Samples: 769969798. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:55:54,593][25689] Avg episode reward: [(0, '-3.663')] [2022-07-10 13:55:56,284][26022] Updated weights on worker 0-0, policy_version 751930 (0.00106) [2022-07-10 13:55:58,181][26022] Updated weights on worker 0-0, policy_version 751940 (0.00087) [2022-07-10 13:55:59,711][25689] Fps is (10 sec: 5535.3, 60 sec: 5555.5, 300 sec: 5525.8). Total num frames: 769993728. Throughput: 0: 5824.2. Samples: 770003214. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:55:59,712][25689] Avg episode reward: [(0, '-2.938')] [2022-07-10 13:56:00,020][26022] Updated weights on worker 0-0, policy_version 751950 (0.00087) [2022-07-10 13:56:02,321][26022] Updated weights on worker 0-0, policy_version 751960 (0.00094) [2022-07-10 13:56:03,982][26022] Updated weights on worker 0-0, policy_version 751970 (0.00091) [2022-07-10 13:56:04,799][25689] Fps is (10 sec: 5317.5, 60 sec: 5554.8, 300 sec: 5524.9). Total num frames: 770020352. Throughput: 0: 5695.3. Samples: 770017994. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:56:04,800][25689] Avg episode reward: [(0, '-2.591')] [2022-07-10 13:56:05,852][26022] Updated weights on worker 0-0, policy_version 751980 (0.00089) [2022-07-10 13:56:07,587][26022] Updated weights on worker 0-0, policy_version 751990 (0.00088) [2022-07-10 13:56:09,579][26022] Updated weights on worker 0-0, policy_version 752000 (0.00085) [2022-07-10 13:56:09,868][25689] Fps is (10 sec: 5444.3, 60 sec: 5551.7, 300 sec: 5524.9). Total num frames: 770049024. Throughput: 0: 5682.4. Samples: 770051822. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:56:09,869][25689] Avg episode reward: [(0, '-2.076')] [2022-07-10 13:56:11,382][26022] Updated weights on worker 0-0, policy_version 752010 (0.00094) [2022-07-10 13:56:13,252][26022] Updated weights on worker 0-0, policy_version 752020 (0.00094) [2022-07-10 13:56:14,904][25689] Fps is (10 sec: 5674.6, 60 sec: 5551.4, 300 sec: 5532.2). Total num frames: 770077696. Throughput: 0: 5702.3. Samples: 770085622. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:56:14,906][25689] Avg episode reward: [(0, '-2.253')] [2022-07-10 13:56:15,096][26022] Updated weights on worker 0-0, policy_version 752030 (0.00089) [2022-07-10 13:56:16,837][26022] Updated weights on worker 0-0, policy_version 752040 (0.00086) [2022-07-10 13:56:18,539][26022] Updated weights on worker 0-0, policy_version 752050 (0.00085) [2022-07-10 13:56:19,978][25689] Fps is (10 sec: 5570.8, 60 sec: 5535.9, 300 sec: 5525.1). Total num frames: 770105344. Throughput: 0: 4891.3. Samples: 770102344. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:56:19,980][25689] Avg episode reward: [(0, '-2.954')] [2022-07-10 13:56:20,578][26022] Updated weights on worker 0-0, policy_version 752060 (0.00092) [2022-07-10 13:56:22,223][26022] Updated weights on worker 0-0, policy_version 752070 (0.00088) [2022-07-10 13:56:24,005][26022] Updated weights on worker 0-0, policy_version 752080 (0.00089) [2022-07-10 13:56:24,995][25689] Fps is (10 sec: 5784.6, 60 sec: 5623.8, 300 sec: 5540.0). Total num frames: 770136064. Throughput: 0: 5866.9. Samples: 770136480. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:56:24,996][25689] Avg episode reward: [(0, '-2.363')] [2022-07-10 13:56:25,839][26022] Updated weights on worker 0-0, policy_version 752090 (0.00089) [2022-07-10 13:56:27,760][26022] Updated weights on worker 0-0, policy_version 752100 (0.00087) [2022-07-10 13:56:29,579][26022] Updated weights on worker 0-0, policy_version 752110 (0.00083) [2022-07-10 13:56:30,004][25689] Fps is (10 sec: 5617.5, 60 sec: 5541.3, 300 sec: 5534.5). Total num frames: 770161664. Throughput: 0: 5867.5. Samples: 770169968. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:56:30,004][25689] Avg episode reward: [(0, '-3.454')] [2022-07-10 13:56:31,636][26022] Updated weights on worker 0-0, policy_version 752120 (0.00089) [2022-07-10 13:56:33,189][26022] Updated weights on worker 0-0, policy_version 752130 (0.00086) [2022-07-10 13:56:35,027][25689] Fps is (10 sec: 5307.3, 60 sec: 5558.8, 300 sec: 5524.6). Total num frames: 770189312. Throughput: 0: 5017.0. Samples: 770186580. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:56:35,028][25689] Avg episode reward: [(0, '-2.590')] [2022-07-10 13:56:35,337][26022] Updated weights on worker 0-0, policy_version 752140 (0.00083) [2022-07-10 13:56:36,533][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:56:36,546][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000752149_770200576.pth [2022-07-10 13:56:36,546][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000750200_768204800.pth [2022-07-10 13:56:36,811][26022] Updated weights on worker 0-0, policy_version 752150 (0.00098) [2022-07-10 13:56:38,931][26022] Updated weights on worker 0-0, policy_version 752160 (0.00097) [2022-07-10 13:56:40,126][25689] Fps is (10 sec: 5766.6, 60 sec: 5589.1, 300 sec: 5540.1). Total num frames: 770220032. Throughput: 0: 5826.6. Samples: 770219736. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:56:40,126][25689] Avg episode reward: [(0, '-2.613')] [2022-07-10 13:56:40,787][26022] Updated weights on worker 0-0, policy_version 752170 (0.00087) [2022-07-10 13:56:42,487][26022] Updated weights on worker 0-0, policy_version 752180 (0.00466) [2022-07-10 13:56:44,380][26022] Updated weights on worker 0-0, policy_version 752190 (0.00303) [2022-07-10 13:56:45,150][25689] Fps is (10 sec: 5665.2, 60 sec: 5570.4, 300 sec: 5533.2). Total num frames: 770246656. Throughput: 0: 5797.6. Samples: 770253330. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:56:45,154][25689] Avg episode reward: [(0, '-1.933')] [2022-07-10 13:56:46,184][26022] Updated weights on worker 0-0, policy_version 752200 (0.00093) [2022-07-10 13:56:47,932][26022] Updated weights on worker 0-0, policy_version 752210 (0.00091) [2022-07-10 13:56:50,167][26022] Updated weights on worker 0-0, policy_version 752220 (0.00084) [2022-07-10 13:56:50,168][25689] Fps is (10 sec: 5200.3, 60 sec: 5522.7, 300 sec: 5526.3). Total num frames: 770272256. Throughput: 0: 4963.0. Samples: 770270044. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:56:50,169][25689] Avg episode reward: [(0, '-1.764')] [2022-07-10 13:56:51,711][26022] Updated weights on worker 0-0, policy_version 752230 (0.00088) [2022-07-10 13:56:53,721][26022] Updated weights on worker 0-0, policy_version 752240 (0.00086) [2022-07-10 13:56:55,187][25689] Fps is (10 sec: 5611.2, 60 sec: 5559.9, 300 sec: 5533.7). Total num frames: 770302976. Throughput: 0: 5799.7. Samples: 770303498. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:56:55,188][25689] Avg episode reward: [(0, '-2.949')] [2022-07-10 13:56:55,241][26022] Updated weights on worker 0-0, policy_version 752250 (0.00086) [2022-07-10 13:56:57,295][26022] Updated weights on worker 0-0, policy_version 752260 (0.00090) [2022-07-10 13:56:59,044][26022] Updated weights on worker 0-0, policy_version 752270 (0.00092) [2022-07-10 13:57:00,299][25689] Fps is (10 sec: 5660.6, 60 sec: 5543.6, 300 sec: 5536.7). Total num frames: 770329600. Throughput: 0: 5794.9. Samples: 770336636. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:57:00,299][25689] Avg episode reward: [(0, '-2.869')] [2022-07-10 13:57:01,064][26022] Updated weights on worker 0-0, policy_version 752280 (0.00095) [2022-07-10 13:57:03,078][26022] Updated weights on worker 0-0, policy_version 752290 (0.00090) [2022-07-10 13:57:05,158][26022] Updated weights on worker 0-0, policy_version 752300 (0.00090) [2022-07-10 13:57:05,367][25689] Fps is (10 sec: 5230.4, 60 sec: 5545.3, 300 sec: 5529.5). Total num frames: 770356224. Throughput: 0: 4843.0. Samples: 770351244. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:57:05,368][25689] Avg episode reward: [(0, '-2.491')] [2022-07-10 13:57:06,837][26022] Updated weights on worker 0-0, policy_version 752310 (0.00093) [2022-07-10 13:57:08,816][26022] Updated weights on worker 0-0, policy_version 752320 (0.00090) [2022-07-10 13:57:10,427][25689] Fps is (10 sec: 5358.3, 60 sec: 5529.3, 300 sec: 5532.5). Total num frames: 770383872. Throughput: 0: 5660.3. Samples: 770384716. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:57:10,428][25689] Avg episode reward: [(0, '-3.708')] [2022-07-10 13:57:10,646][26022] Updated weights on worker 0-0, policy_version 752330 (0.00090) [2022-07-10 13:57:12,419][26022] Updated weights on worker 0-0, policy_version 752340 (0.00083) [2022-07-10 13:57:14,371][26022] Updated weights on worker 0-0, policy_version 752350 (0.00087) [2022-07-10 13:57:15,439][25689] Fps is (10 sec: 5591.9, 60 sec: 5531.5, 300 sec: 5536.8). Total num frames: 770412544. Throughput: 0: 5664.6. Samples: 770418220. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:57:15,439][25689] Avg episode reward: [(0, '-3.559')] [2022-07-10 13:57:16,218][26022] Updated weights on worker 0-0, policy_version 752360 (0.00089) [2022-07-10 13:57:17,972][26022] Updated weights on worker 0-0, policy_version 752370 (0.00085) [2022-07-10 13:57:19,869][26022] Updated weights on worker 0-0, policy_version 752380 (0.00089) [2022-07-10 13:57:20,531][25689] Fps is (10 sec: 5675.6, 60 sec: 5546.8, 300 sec: 5535.4). Total num frames: 770441216. Throughput: 0: 4852.1. Samples: 770434804. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:57:20,531][25689] Avg episode reward: [(0, '-1.782')] [2022-07-10 13:57:21,522][26022] Updated weights on worker 0-0, policy_version 752390 (0.00083) [2022-07-10 13:57:23,655][26022] Updated weights on worker 0-0, policy_version 752400 (0.00090) [2022-07-10 13:57:25,335][26022] Updated weights on worker 0-0, policy_version 752410 (0.00080) [2022-07-10 13:57:25,562][25689] Fps is (10 sec: 5563.8, 60 sec: 5494.7, 300 sec: 5538.7). Total num frames: 770468864. Throughput: 0: 5788.1. Samples: 770468130. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:57:25,562][25689] Avg episode reward: [(0, '-2.212')] [2022-07-10 13:57:27,296][26022] Updated weights on worker 0-0, policy_version 752420 (0.00091) [2022-07-10 13:57:28,804][26022] Updated weights on worker 0-0, policy_version 752430 (0.00091) [2022-07-10 13:57:30,628][25689] Fps is (10 sec: 5374.8, 60 sec: 5506.4, 300 sec: 5532.4). Total num frames: 770495488. Throughput: 0: 5771.3. Samples: 770501304. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 13:57:30,629][25689] Avg episode reward: [(0, '-2.016')] [2022-07-10 13:57:31,055][26022] Updated weights on worker 0-0, policy_version 752440 (0.00076) [2022-07-10 13:57:32,674][26022] Updated weights on worker 0-0, policy_version 752450 (0.00270) [2022-07-10 13:57:34,725][26022] Updated weights on worker 0-0, policy_version 752460 (0.00084) [2022-07-10 13:57:35,634][25689] Fps is (10 sec: 5489.8, 60 sec: 5524.9, 300 sec: 5538.1). Total num frames: 770524160. Throughput: 0: 4938.5. Samples: 770517954. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:57:35,636][25689] Avg episode reward: [(0, '-2.960')] [2022-07-10 13:57:36,327][26022] Updated weights on worker 0-0, policy_version 752470 (0.00088) [2022-07-10 13:57:38,436][26022] Updated weights on worker 0-0, policy_version 752480 (0.00088) [2022-07-10 13:57:39,975][26022] Updated weights on worker 0-0, policy_version 752490 (0.00081) [2022-07-10 13:57:40,768][25689] Fps is (10 sec: 5554.6, 60 sec: 5471.1, 300 sec: 5530.1). Total num frames: 770551808. Throughput: 0: 5758.6. Samples: 770551340. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:57:40,770][25689] Avg episode reward: [(0, '-2.960')] [2022-07-10 13:57:42,183][26022] Updated weights on worker 0-0, policy_version 752500 (0.00090) [2022-07-10 13:57:43,722][26022] Updated weights on worker 0-0, policy_version 752510 (0.00088) [2022-07-10 13:57:45,752][26022] Updated weights on worker 0-0, policy_version 752520 (0.00093) [2022-07-10 13:57:45,818][25689] Fps is (10 sec: 5530.4, 60 sec: 5502.4, 300 sec: 5533.3). Total num frames: 770580480. Throughput: 0: 5763.7. Samples: 770584882. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:57:45,818][25689] Avg episode reward: [(0, '-2.771')] [2022-07-10 13:57:47,580][26022] Updated weights on worker 0-0, policy_version 752530 (0.00087) [2022-07-10 13:57:49,396][26022] Updated weights on worker 0-0, policy_version 752540 (0.00086) [2022-07-10 13:57:50,825][25689] Fps is (10 sec: 5701.8, 60 sec: 5554.1, 300 sec: 5538.1). Total num frames: 770609152. Throughput: 0: 5803.0. Samples: 770618504. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:57:50,825][25689] Avg episode reward: [(0, '-3.320')] [2022-07-10 13:57:51,115][26022] Updated weights on worker 0-0, policy_version 752550 (0.00093) [2022-07-10 13:57:53,110][26022] Updated weights on worker 0-0, policy_version 752560 (0.00097) [2022-07-10 13:57:54,817][26022] Updated weights on worker 0-0, policy_version 752570 (0.00087) [2022-07-10 13:57:55,842][25689] Fps is (10 sec: 5516.2, 60 sec: 5486.7, 300 sec: 5531.6). Total num frames: 770635776. Throughput: 0: 5810.4. Samples: 770635370. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:57:55,844][25689] Avg episode reward: [(0, '-3.263')] [2022-07-10 13:57:56,870][26022] Updated weights on worker 0-0, policy_version 752580 (0.00104) [2022-07-10 13:57:58,507][26022] Updated weights on worker 0-0, policy_version 752590 (0.00082) [2022-07-10 13:58:00,447][26022] Updated weights on worker 0-0, policy_version 752600 (0.00085) [2022-07-10 13:58:00,909][25689] Fps is (10 sec: 5483.7, 60 sec: 5524.6, 300 sec: 5544.3). Total num frames: 770664448. Throughput: 0: 5828.3. Samples: 770668728. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:58:00,909][25689] Avg episode reward: [(0, '-4.404')] [2022-07-10 13:58:02,659][26022] Updated weights on worker 0-0, policy_version 752610 (0.00092) [2022-07-10 13:58:04,393][26022] Updated weights on worker 0-0, policy_version 752620 (0.00084) [2022-07-10 13:58:05,911][25689] Fps is (10 sec: 5390.3, 60 sec: 5513.8, 300 sec: 5534.5). Total num frames: 770690048. Throughput: 0: 5708.1. Samples: 770699574. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:58:05,911][25689] Avg episode reward: [(0, '-3.571')] [2022-07-10 13:58:06,407][26022] Updated weights on worker 0-0, policy_version 752630 (0.00094) [2022-07-10 13:58:08,253][26022] Updated weights on worker 0-0, policy_version 752640 (0.00086) [2022-07-10 13:58:10,109][26022] Updated weights on worker 0-0, policy_version 752650 (0.00091) [2022-07-10 13:58:10,935][25689] Fps is (10 sec: 5413.0, 60 sec: 5534.0, 300 sec: 5534.2). Total num frames: 770718720. Throughput: 0: 4853.3. Samples: 770716104. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:58:10,935][25689] Avg episode reward: [(0, '-2.531')] [2022-07-10 13:58:11,965][26022] Updated weights on worker 0-0, policy_version 752660 (0.00572) [2022-07-10 13:58:13,688][26022] Updated weights on worker 0-0, policy_version 752670 (0.00086) [2022-07-10 13:58:15,675][26022] Updated weights on worker 0-0, policy_version 752680 (0.00089) [2022-07-10 13:58:15,950][25689] Fps is (10 sec: 5508.1, 60 sec: 5499.9, 300 sec: 5534.7). Total num frames: 770745344. Throughput: 0: 5684.5. Samples: 770749670. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:58:15,950][25689] Avg episode reward: [(0, '-3.450')] [2022-07-10 13:58:17,407][26022] Updated weights on worker 0-0, policy_version 752690 (0.00082) [2022-07-10 13:58:19,161][26022] Updated weights on worker 0-0, policy_version 752700 (0.00082) [2022-07-10 13:58:21,061][25689] Fps is (10 sec: 5460.6, 60 sec: 5498.1, 300 sec: 5529.5). Total num frames: 770774016. Throughput: 0: 5689.7. Samples: 770783390. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:58:21,062][25689] Avg episode reward: [(0, '-2.735')] [2022-07-10 13:58:21,212][26022] Updated weights on worker 0-0, policy_version 752710 (0.00090) [2022-07-10 13:58:22,847][26022] Updated weights on worker 0-0, policy_version 752720 (0.00072) [2022-07-10 13:58:24,767][26022] Updated weights on worker 0-0, policy_version 752730 (0.00085) [2022-07-10 13:58:26,084][25689] Fps is (10 sec: 5557.4, 60 sec: 5498.8, 300 sec: 5529.2). Total num frames: 770801664. Throughput: 0: 4984.2. Samples: 770800120. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:58:26,084][25689] Avg episode reward: [(0, '-1.727')] [2022-07-10 13:58:26,542][26022] Updated weights on worker 0-0, policy_version 752740 (0.00086) [2022-07-10 13:58:28,503][26022] Updated weights on worker 0-0, policy_version 752750 (0.00092) [2022-07-10 13:58:30,447][26022] Updated weights on worker 0-0, policy_version 752760 (0.00087) [2022-07-10 13:58:31,126][25689] Fps is (10 sec: 5494.2, 60 sec: 5518.0, 300 sec: 5532.2). Total num frames: 770829312. Throughput: 0: 5808.6. Samples: 770833384. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:58:31,126][25689] Avg episode reward: [(0, '-1.186')] [2022-07-10 13:58:32,065][26022] Updated weights on worker 0-0, policy_version 752770 (0.00090) [2022-07-10 13:58:34,216][26022] Updated weights on worker 0-0, policy_version 752780 (0.00088) [2022-07-10 13:58:35,810][26022] Updated weights on worker 0-0, policy_version 752790 (0.00086) [2022-07-10 13:58:36,138][25689] Fps is (10 sec: 5601.6, 60 sec: 5517.4, 300 sec: 5532.8). Total num frames: 770857984. Throughput: 0: 5784.6. Samples: 770866452. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:58:36,138][25689] Avg episode reward: [(0, '-1.377')] [2022-07-10 13:58:36,569][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 13:58:36,584][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000752794_770861056.pth [2022-07-10 13:58:36,585][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000750848_768868352.pth [2022-07-10 13:58:37,822][26022] Updated weights on worker 0-0, policy_version 752800 (0.00091) [2022-07-10 13:58:39,572][26022] Updated weights on worker 0-0, policy_version 752810 (0.00092) [2022-07-10 13:58:41,241][25689] Fps is (10 sec: 5567.7, 60 sec: 5520.2, 300 sec: 5532.1). Total num frames: 770885632. Throughput: 0: 4932.7. Samples: 770882932. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:58:41,242][25689] Avg episode reward: [(0, '-0.670')] [2022-07-10 13:58:41,623][26022] Updated weights on worker 0-0, policy_version 752820 (0.00086) [2022-07-10 13:58:43,052][26022] Updated weights on worker 0-0, policy_version 752830 (0.00092) [2022-07-10 13:58:45,205][26022] Updated weights on worker 0-0, policy_version 752840 (0.00090) [2022-07-10 13:58:46,263][25689] Fps is (10 sec: 5461.1, 60 sec: 5505.8, 300 sec: 5531.8). Total num frames: 770913280. Throughput: 0: 5761.5. Samples: 770916384. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:58:46,264][25689] Avg episode reward: [(0, '-0.105')] [2022-07-10 13:58:46,955][26022] Updated weights on worker 0-0, policy_version 752850 (0.00095) [2022-07-10 13:58:49,026][26022] Updated weights on worker 0-0, policy_version 752860 (0.00088) [2022-07-10 13:58:50,589][26022] Updated weights on worker 0-0, policy_version 752870 (0.00098) [2022-07-10 13:58:51,304][25689] Fps is (10 sec: 5495.0, 60 sec: 5485.8, 300 sec: 5528.2). Total num frames: 770940928. Throughput: 0: 5758.8. Samples: 770949588. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:58:51,305][25689] Avg episode reward: [(0, '-0.649')] [2022-07-10 13:58:52,631][26022] Updated weights on worker 0-0, policy_version 752880 (0.00093) [2022-07-10 13:58:54,416][26022] Updated weights on worker 0-0, policy_version 752890 (0.00085) [2022-07-10 13:58:56,315][25689] Fps is (10 sec: 5501.3, 60 sec: 5503.3, 300 sec: 5526.5). Total num frames: 770968576. Throughput: 0: 4937.8. Samples: 770966082. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:58:56,315][25689] Avg episode reward: [(0, '-2.683')] [2022-07-10 13:58:56,481][26022] Updated weights on worker 0-0, policy_version 752900 (0.00087) [2022-07-10 13:58:57,967][26022] Updated weights on worker 0-0, policy_version 752910 (0.00094) [2022-07-10 13:59:00,147][26022] Updated weights on worker 0-0, policy_version 752920 (0.00094) [2022-07-10 13:59:01,406][25689] Fps is (10 sec: 5575.3, 60 sec: 5501.1, 300 sec: 5538.9). Total num frames: 770997248. Throughput: 0: 5783.6. Samples: 770999556. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:59:01,408][25689] Avg episode reward: [(0, '-2.249')] [2022-07-10 13:59:02,152][26022] Updated weights on worker 0-0, policy_version 752930 (0.00090) [2022-07-10 13:59:04,097][26022] Updated weights on worker 0-0, policy_version 752940 (0.00094) [2022-07-10 13:59:06,053][26022] Updated weights on worker 0-0, policy_version 752950 (0.00086) [2022-07-10 13:59:06,446][25689] Fps is (10 sec: 5356.8, 60 sec: 5497.6, 300 sec: 5527.9). Total num frames: 771022848. Throughput: 0: 5665.6. Samples: 771030732. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:59:06,448][25689] Avg episode reward: [(0, '-2.190')] [2022-07-10 13:59:07,651][26022] Updated weights on worker 0-0, policy_version 752960 (0.00088) [2022-07-10 13:59:09,568][26022] Updated weights on worker 0-0, policy_version 752970 (0.00093) [2022-07-10 13:59:11,451][25689] Fps is (10 sec: 5402.7, 60 sec: 5499.3, 300 sec: 5528.2). Total num frames: 771051520. Throughput: 0: 4855.9. Samples: 771047420. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:59:11,453][25689] Avg episode reward: [(0, '-2.901')] [2022-07-10 13:59:11,453][26022] Updated weights on worker 0-0, policy_version 752980 (0.00118) [2022-07-10 13:59:13,231][26022] Updated weights on worker 0-0, policy_version 752990 (0.00087) [2022-07-10 13:59:15,171][26022] Updated weights on worker 0-0, policy_version 753000 (0.00085) [2022-07-10 13:59:16,483][25689] Fps is (10 sec: 5611.6, 60 sec: 5514.8, 300 sec: 5522.6). Total num frames: 771079168. Throughput: 0: 5706.6. Samples: 771081170. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:59:16,483][25689] Avg episode reward: [(0, '-2.458')] [2022-07-10 13:59:16,897][26022] Updated weights on worker 0-0, policy_version 753010 (0.00084) [2022-07-10 13:59:18,904][26022] Updated weights on worker 0-0, policy_version 753020 (0.00092) [2022-07-10 13:59:20,626][26022] Updated weights on worker 0-0, policy_version 753030 (0.00087) [2022-07-10 13:59:21,514][25689] Fps is (10 sec: 5597.1, 60 sec: 5522.1, 300 sec: 5533.4). Total num frames: 771107840. Throughput: 0: 5721.1. Samples: 771114594. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:59:21,514][25689] Avg episode reward: [(0, '-1.897')] [2022-07-10 13:59:22,666][26022] Updated weights on worker 0-0, policy_version 753040 (0.00085) [2022-07-10 13:59:24,015][26022] Updated weights on worker 0-0, policy_version 753050 (0.00083) [2022-07-10 13:59:26,383][26022] Updated weights on worker 0-0, policy_version 753060 (0.00088) [2022-07-10 13:59:26,517][25689] Fps is (10 sec: 5408.3, 60 sec: 5489.9, 300 sec: 5523.4). Total num frames: 771133440. Throughput: 0: 5014.8. Samples: 771131386. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:59:26,518][25689] Avg episode reward: [(0, '-1.768')] [2022-07-10 13:59:27,845][26022] Updated weights on worker 0-0, policy_version 753070 (0.00083) [2022-07-10 13:59:29,863][26022] Updated weights on worker 0-0, policy_version 753080 (0.00083) [2022-07-10 13:59:31,525][25689] Fps is (10 sec: 5523.6, 60 sec: 5527.0, 300 sec: 5527.0). Total num frames: 771163136. Throughput: 0: 5852.2. Samples: 771164892. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:59:31,526][25689] Avg episode reward: [(0, '-2.070')] [2022-07-10 13:59:31,538][26022] Updated weights on worker 0-0, policy_version 753090 (0.00088) [2022-07-10 13:59:33,555][26022] Updated weights on worker 0-0, policy_version 753100 (0.00089) [2022-07-10 13:59:35,439][26022] Updated weights on worker 0-0, policy_version 753110 (0.00095) [2022-07-10 13:59:36,527][25689] Fps is (10 sec: 5728.9, 60 sec: 5510.9, 300 sec: 5529.1). Total num frames: 771190784. Throughput: 0: 5840.6. Samples: 771198242. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:59:36,528][25689] Avg episode reward: [(0, '-1.984')] [2022-07-10 13:59:37,157][26022] Updated weights on worker 0-0, policy_version 753120 (0.00088) [2022-07-10 13:59:39,068][26022] Updated weights on worker 0-0, policy_version 753130 (0.00051) [2022-07-10 13:59:40,820][26022] Updated weights on worker 0-0, policy_version 753140 (0.00092) [2022-07-10 13:59:41,618][25689] Fps is (10 sec: 5579.9, 60 sec: 5529.0, 300 sec: 5530.9). Total num frames: 771219456. Throughput: 0: 4988.2. Samples: 771214878. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:59:41,618][25689] Avg episode reward: [(0, '-1.203')] [2022-07-10 13:59:42,691][26022] Updated weights on worker 0-0, policy_version 753150 (0.00086) [2022-07-10 13:59:44,591][26022] Updated weights on worker 0-0, policy_version 753160 (0.00303) [2022-07-10 13:59:46,259][26022] Updated weights on worker 0-0, policy_version 753170 (0.00094) [2022-07-10 13:59:46,655][25689] Fps is (10 sec: 5662.1, 60 sec: 5544.6, 300 sec: 5530.7). Total num frames: 771248128. Throughput: 0: 5824.1. Samples: 771248668. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:59:46,655][25689] Avg episode reward: [(0, '-2.136')] [2022-07-10 13:59:48,261][26022] Updated weights on worker 0-0, policy_version 753180 (0.00085) [2022-07-10 13:59:49,927][26022] Updated weights on worker 0-0, policy_version 753190 (0.00612) [2022-07-10 13:59:51,681][25689] Fps is (10 sec: 5596.7, 60 sec: 5546.0, 300 sec: 5533.9). Total num frames: 771275776. Throughput: 0: 5823.7. Samples: 771282276. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:59:51,683][25689] Avg episode reward: [(0, '-0.457')] [2022-07-10 13:59:51,869][26022] Updated weights on worker 0-0, policy_version 753200 (0.00090) [2022-07-10 13:59:53,702][26022] Updated weights on worker 0-0, policy_version 753210 (0.00087) [2022-07-10 13:59:55,471][26022] Updated weights on worker 0-0, policy_version 753220 (0.00095) [2022-07-10 13:59:56,708][25689] Fps is (10 sec: 5500.6, 60 sec: 5544.5, 300 sec: 5532.2). Total num frames: 771303424. Throughput: 0: 4992.5. Samples: 771298994. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 13:59:56,708][25689] Avg episode reward: [(0, '-0.530')] [2022-07-10 13:59:57,392][26022] Updated weights on worker 0-0, policy_version 753230 (0.00091) [2022-07-10 13:59:59,052][26022] Updated weights on worker 0-0, policy_version 753240 (0.00089) [2022-07-10 14:00:01,102][26022] Updated weights on worker 0-0, policy_version 753250 (0.00090) [2022-07-10 14:00:01,815][25689] Fps is (10 sec: 5456.6, 60 sec: 5526.1, 300 sec: 5535.1). Total num frames: 771331072. Throughput: 0: 5825.5. Samples: 771332534. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 14:00:01,815][25689] Avg episode reward: [(0, '-0.224')] [2022-07-10 14:00:03,446][26022] Updated weights on worker 0-0, policy_version 753260 (0.00084) [2022-07-10 14:00:05,015][26022] Updated weights on worker 0-0, policy_version 753270 (0.00078) [2022-07-10 14:00:06,829][25689] Fps is (10 sec: 5159.6, 60 sec: 5511.5, 300 sec: 5521.8). Total num frames: 771355648. Throughput: 0: 5692.1. Samples: 771363504. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 14:00:06,830][25689] Avg episode reward: [(0, '-0.562')] [2022-07-10 14:00:07,151][26022] Updated weights on worker 0-0, policy_version 753280 (0.00087) [2022-07-10 14:00:08,602][26022] Updated weights on worker 0-0, policy_version 753290 (0.00091) [2022-07-10 14:00:10,723][26022] Updated weights on worker 0-0, policy_version 753300 (0.00087) [2022-07-10 14:00:11,832][25689] Fps is (10 sec: 5520.0, 60 sec: 5545.6, 300 sec: 5529.2). Total num frames: 771386368. Throughput: 0: 4869.3. Samples: 771380398. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 14:00:11,833][25689] Avg episode reward: [(0, '-2.479')] [2022-07-10 14:00:12,355][26022] Updated weights on worker 0-0, policy_version 753310 (0.00082) [2022-07-10 14:00:14,184][26022] Updated weights on worker 0-0, policy_version 753320 (0.00092) [2022-07-10 14:00:16,191][26022] Updated weights on worker 0-0, policy_version 753330 (0.00082) [2022-07-10 14:00:16,861][25689] Fps is (10 sec: 5818.4, 60 sec: 5545.8, 300 sec: 5526.9). Total num frames: 771414016. Throughput: 0: 5712.3. Samples: 771414116. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 14:00:16,861][25689] Avg episode reward: [(0, '-1.928')] [2022-07-10 14:00:17,812][26022] Updated weights on worker 0-0, policy_version 753340 (0.00083) [2022-07-10 14:00:19,802][26022] Updated weights on worker 0-0, policy_version 753350 (0.00089) [2022-07-10 14:00:21,690][26022] Updated weights on worker 0-0, policy_version 753360 (0.00092) [2022-07-10 14:00:21,991][25689] Fps is (10 sec: 5544.0, 60 sec: 5536.8, 300 sec: 5535.8). Total num frames: 771442688. Throughput: 0: 5706.3. Samples: 771447664. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 14:00:21,991][25689] Avg episode reward: [(0, '-2.693')] [2022-07-10 14:00:23,237][26022] Updated weights on worker 0-0, policy_version 753370 (0.00086) [2022-07-10 14:00:25,334][26022] Updated weights on worker 0-0, policy_version 753380 (0.00098) [2022-07-10 14:00:26,951][26022] Updated weights on worker 0-0, policy_version 753390 (0.00090) [2022-07-10 14:00:27,070][25689] Fps is (10 sec: 5617.2, 60 sec: 5580.7, 300 sec: 5528.0). Total num frames: 771471360. Throughput: 0: 5002.4. Samples: 771464754. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 14:00:27,070][25689] Avg episode reward: [(0, '-2.970')] [2022-07-10 14:00:28,819][26022] Updated weights on worker 0-0, policy_version 753400 (0.00095) [2022-07-10 14:00:31,000][26022] Updated weights on worker 0-0, policy_version 753410 (0.00096) [2022-07-10 14:00:32,143][25689] Fps is (10 sec: 5749.1, 60 sec: 5574.5, 300 sec: 5537.5). Total num frames: 771501056. Throughput: 0: 5802.5. Samples: 771498254. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 14:00:32,144][25689] Avg episode reward: [(0, '-3.515')] [2022-07-10 14:00:32,472][26022] Updated weights on worker 0-0, policy_version 753420 (0.00099) [2022-07-10 14:00:34,446][26022] Updated weights on worker 0-0, policy_version 753430 (0.00086) [2022-07-10 14:00:36,399][26022] Updated weights on worker 0-0, policy_version 753440 (0.00085) [2022-07-10 14:00:36,734][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:00:36,758][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000753442_771524608.pth [2022-07-10 14:00:36,758][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000751497_769532928.pth [2022-07-10 14:00:37,159][25689] Fps is (10 sec: 5581.8, 60 sec: 5556.4, 300 sec: 5531.5). Total num frames: 771527680. Throughput: 0: 5787.0. Samples: 771531584. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 14:00:37,160][25689] Avg episode reward: [(0, '-2.815')] [2022-07-10 14:00:37,994][26022] Updated weights on worker 0-0, policy_version 753450 (0.00091) [2022-07-10 14:00:39,989][26022] Updated weights on worker 0-0, policy_version 753460 (0.00090) [2022-07-10 14:00:41,779][26022] Updated weights on worker 0-0, policy_version 753470 (0.00085) [2022-07-10 14:00:42,315][25689] Fps is (10 sec: 5335.7, 60 sec: 5533.6, 300 sec: 5528.7). Total num frames: 771555328. Throughput: 0: 5770.2. Samples: 771564938. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 14:00:42,315][25689] Avg episode reward: [(0, '-2.159')] [2022-07-10 14:00:43,655][26022] Updated weights on worker 0-0, policy_version 753480 (0.00089) [2022-07-10 14:00:45,581][26022] Updated weights on worker 0-0, policy_version 753490 (0.00093) [2022-07-10 14:00:47,163][26022] Updated weights on worker 0-0, policy_version 753500 (0.00089) [2022-07-10 14:00:47,354][25689] Fps is (10 sec: 5625.2, 60 sec: 5550.3, 300 sec: 5532.3). Total num frames: 771585024. Throughput: 0: 5769.7. Samples: 771581786. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 14:00:47,354][25689] Avg episode reward: [(0, '-2.110')] [2022-07-10 14:00:49,095][26022] Updated weights on worker 0-0, policy_version 753510 (0.00083) [2022-07-10 14:00:50,959][26022] Updated weights on worker 0-0, policy_version 753520 (0.00083) [2022-07-10 14:00:52,396][25689] Fps is (10 sec: 5688.1, 60 sec: 5548.8, 300 sec: 5529.1). Total num frames: 771612672. Throughput: 0: 5772.2. Samples: 771615158. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 14:00:52,397][25689] Avg episode reward: [(0, '-1.633')] [2022-07-10 14:00:52,827][26022] Updated weights on worker 0-0, policy_version 753530 (0.00086) [2022-07-10 14:00:54,603][26022] Updated weights on worker 0-0, policy_version 753540 (0.00082) [2022-07-10 14:00:56,461][26022] Updated weights on worker 0-0, policy_version 753550 (0.00089) [2022-07-10 14:00:57,428][25689] Fps is (10 sec: 5387.0, 60 sec: 5531.4, 300 sec: 5527.3). Total num frames: 771639296. Throughput: 0: 5796.7. Samples: 771649076. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-10 14:00:57,429][25689] Avg episode reward: [(0, '-1.417')] [2022-07-10 14:00:58,159][26022] Updated weights on worker 0-0, policy_version 753560 (0.00081) [2022-07-10 14:01:00,252][26022] Updated weights on worker 0-0, policy_version 753570 (0.00082) [2022-07-10 14:01:02,196][26022] Updated weights on worker 0-0, policy_version 753580 (0.00092) [2022-07-10 14:01:02,490][25689] Fps is (10 sec: 5376.7, 60 sec: 5535.6, 300 sec: 5531.2). Total num frames: 771666944. Throughput: 0: 5007.3. Samples: 771665962. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:01:02,491][25689] Avg episode reward: [(0, '-0.569')] [2022-07-10 14:01:04,189][26022] Updated weights on worker 0-0, policy_version 753590 (0.00086) [2022-07-10 14:01:05,886][26022] Updated weights on worker 0-0, policy_version 753600 (0.00089) [2022-07-10 14:01:07,498][25689] Fps is (10 sec: 5491.5, 60 sec: 5586.8, 300 sec: 5528.9). Total num frames: 771694592. Throughput: 0: 5716.0. Samples: 771696930. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:01:07,498][25689] Avg episode reward: [(0, '-3.335')] [2022-07-10 14:01:07,759][26022] Updated weights on worker 0-0, policy_version 753610 (0.00514) [2022-07-10 14:01:09,706][26022] Updated weights on worker 0-0, policy_version 753620 (0.00087) [2022-07-10 14:01:11,513][26022] Updated weights on worker 0-0, policy_version 753630 (0.00085) [2022-07-10 14:01:12,511][25689] Fps is (10 sec: 5415.8, 60 sec: 5518.4, 300 sec: 5522.5). Total num frames: 771721216. Throughput: 0: 5738.2. Samples: 771730582. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:01:12,512][25689] Avg episode reward: [(0, '-3.173')] [2022-07-10 14:01:13,336][26022] Updated weights on worker 0-0, policy_version 753640 (0.00091) [2022-07-10 14:01:15,307][26022] Updated weights on worker 0-0, policy_version 753650 (0.00082) [2022-07-10 14:01:16,975][26022] Updated weights on worker 0-0, policy_version 753660 (0.00086) [2022-07-10 14:01:17,535][25689] Fps is (10 sec: 5611.2, 60 sec: 5552.6, 300 sec: 5530.3). Total num frames: 771750912. Throughput: 0: 4887.7. Samples: 771747350. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:01:17,535][25689] Avg episode reward: [(0, '-3.207')] [2022-07-10 14:01:18,928][26022] Updated weights on worker 0-0, policy_version 753670 (0.00085) [2022-07-10 14:01:20,697][26022] Updated weights on worker 0-0, policy_version 753680 (0.00096) [2022-07-10 14:01:22,586][25689] Fps is (10 sec: 5590.2, 60 sec: 5526.0, 300 sec: 5515.9). Total num frames: 771777536. Throughput: 0: 5718.4. Samples: 771780878. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:01:22,586][25689] Avg episode reward: [(0, '-3.225')] [2022-07-10 14:01:22,661][26022] Updated weights on worker 0-0, policy_version 753690 (0.00090) [2022-07-10 14:01:24,450][26022] Updated weights on worker 0-0, policy_version 753700 (0.00301) [2022-07-10 14:01:26,211][26022] Updated weights on worker 0-0, policy_version 753710 (0.00086) [2022-07-10 14:01:27,618][25689] Fps is (10 sec: 5382.2, 60 sec: 5513.3, 300 sec: 5522.3). Total num frames: 771805184. Throughput: 0: 5832.6. Samples: 771814286. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:01:27,619][25689] Avg episode reward: [(0, '-2.764')] [2022-07-10 14:01:28,129][26022] Updated weights on worker 0-0, policy_version 753720 (0.00091) [2022-07-10 14:01:29,701][26022] Updated weights on worker 0-0, policy_version 753730 (0.00088) [2022-07-10 14:01:31,702][26022] Updated weights on worker 0-0, policy_version 753740 (0.00092) [2022-07-10 14:01:32,626][25689] Fps is (10 sec: 5711.5, 60 sec: 5519.3, 300 sec: 5529.5). Total num frames: 771834880. Throughput: 0: 5000.0. Samples: 771831156. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:01:32,626][25689] Avg episode reward: [(0, '-2.886')] [2022-07-10 14:01:33,585][26022] Updated weights on worker 0-0, policy_version 753750 (0.00086) [2022-07-10 14:01:35,452][26022] Updated weights on worker 0-0, policy_version 753760 (0.00087) [2022-07-10 14:01:37,338][26022] Updated weights on worker 0-0, policy_version 753770 (0.00090) [2022-07-10 14:01:37,642][25689] Fps is (10 sec: 5721.0, 60 sec: 5536.3, 300 sec: 5520.7). Total num frames: 771862528. Throughput: 0: 5826.7. Samples: 771864508. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:01:37,642][25689] Avg episode reward: [(0, '0.191')] [2022-07-10 14:01:39,120][26022] Updated weights on worker 0-0, policy_version 753780 (0.00093) [2022-07-10 14:01:40,730][26022] Updated weights on worker 0-0, policy_version 753790 (0.00089) [2022-07-10 14:01:42,721][25689] Fps is (10 sec: 5376.0, 60 sec: 5526.3, 300 sec: 5519.7). Total num frames: 771889152. Throughput: 0: 5826.1. Samples: 771898188. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:01:42,722][25689] Avg episode reward: [(0, '0.139')] [2022-07-10 14:01:43,004][26022] Updated weights on worker 0-0, policy_version 753800 (0.00086) [2022-07-10 14:01:44,659][26022] Updated weights on worker 0-0, policy_version 753810 (0.00095) [2022-07-10 14:01:46,459][26022] Updated weights on worker 0-0, policy_version 753820 (0.00094) [2022-07-10 14:01:47,743][25689] Fps is (10 sec: 5575.7, 60 sec: 5527.9, 300 sec: 5533.4). Total num frames: 771918848. Throughput: 0: 5005.1. Samples: 771915010. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:01:47,744][25689] Avg episode reward: [(0, '-1.326')] [2022-07-10 14:01:48,211][26022] Updated weights on worker 0-0, policy_version 753830 (0.00090) [2022-07-10 14:01:50,007][26022] Updated weights on worker 0-0, policy_version 753840 (0.00098) [2022-07-10 14:01:51,910][26022] Updated weights on worker 0-0, policy_version 753850 (0.00084) [2022-07-10 14:01:52,745][25689] Fps is (10 sec: 5822.7, 60 sec: 5548.5, 300 sec: 5526.8). Total num frames: 771947520. Throughput: 0: 5837.5. Samples: 771948604. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:01:52,747][25689] Avg episode reward: [(0, '-1.208')] [2022-07-10 14:01:53,792][26022] Updated weights on worker 0-0, policy_version 753860 (0.00093) [2022-07-10 14:01:55,463][26022] Updated weights on worker 0-0, policy_version 753870 (0.00099) [2022-07-10 14:01:57,661][26022] Updated weights on worker 0-0, policy_version 753880 (0.00089) [2022-07-10 14:01:57,797][25689] Fps is (10 sec: 5499.9, 60 sec: 5546.7, 300 sec: 5527.9). Total num frames: 771974144. Throughput: 0: 5846.8. Samples: 771982352. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:01:57,798][25689] Avg episode reward: [(0, '-2.537')] [2022-07-10 14:01:59,060][26022] Updated weights on worker 0-0, policy_version 753890 (0.00097) [2022-07-10 14:02:01,420][26022] Updated weights on worker 0-0, policy_version 753900 (0.00084) [2022-07-10 14:02:02,857][25689] Fps is (10 sec: 5468.9, 60 sec: 5563.9, 300 sec: 5535.0). Total num frames: 772002816. Throughput: 0: 5012.7. Samples: 771999120. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:02:02,857][25689] Avg episode reward: [(0, '-2.514')] [2022-07-10 14:02:03,032][26022] Updated weights on worker 0-0, policy_version 753910 (0.00087) [2022-07-10 14:02:05,134][26022] Updated weights on worker 0-0, policy_version 753920 (0.00084) [2022-07-10 14:02:06,945][26022] Updated weights on worker 0-0, policy_version 753930 (0.00092) [2022-07-10 14:02:07,858][25689] Fps is (10 sec: 5292.2, 60 sec: 5513.5, 300 sec: 5525.7). Total num frames: 772027392. Throughput: 0: 5741.1. Samples: 772030496. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:02:07,859][25689] Avg episode reward: [(0, '-2.706')] [2022-07-10 14:02:08,744][26022] Updated weights on worker 0-0, policy_version 753940 (0.00095) [2022-07-10 14:02:10,514][26022] Updated weights on worker 0-0, policy_version 753950 (0.00085) [2022-07-10 14:02:12,505][26022] Updated weights on worker 0-0, policy_version 753960 (0.00083) [2022-07-10 14:02:12,863][25689] Fps is (10 sec: 5321.5, 60 sec: 5548.3, 300 sec: 5525.9). Total num frames: 772056064. Throughput: 0: 5747.9. Samples: 772064236. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:02:12,863][25689] Avg episode reward: [(0, '-3.175')] [2022-07-10 14:02:14,112][26022] Updated weights on worker 0-0, policy_version 753970 (0.00086) [2022-07-10 14:02:16,187][26022] Updated weights on worker 0-0, policy_version 753980 (0.00089) [2022-07-10 14:02:17,593][26022] Updated weights on worker 0-0, policy_version 753990 (0.00094) [2022-07-10 14:02:17,871][25689] Fps is (10 sec: 5829.5, 60 sec: 5549.7, 300 sec: 5530.9). Total num frames: 772085760. Throughput: 0: 4918.0. Samples: 772081076. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:02:17,871][25689] Avg episode reward: [(0, '-2.038')] [2022-07-10 14:02:19,862][26022] Updated weights on worker 0-0, policy_version 754000 (0.00092) [2022-07-10 14:02:21,391][26022] Updated weights on worker 0-0, policy_version 754010 (0.00086) [2022-07-10 14:02:22,935][25689] Fps is (10 sec: 5591.7, 60 sec: 5548.5, 300 sec: 5526.8). Total num frames: 772112384. Throughput: 0: 5743.5. Samples: 772114438. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:02:22,935][25689] Avg episode reward: [(0, '-4.526')] [2022-07-10 14:02:23,570][26022] Updated weights on worker 0-0, policy_version 754020 (0.00085) [2022-07-10 14:02:25,216][26022] Updated weights on worker 0-0, policy_version 754030 (0.00089) [2022-07-10 14:02:27,139][26022] Updated weights on worker 0-0, policy_version 754040 (0.00085) [2022-07-10 14:02:27,948][25689] Fps is (10 sec: 5487.1, 60 sec: 5567.2, 300 sec: 5534.7). Total num frames: 772141056. Throughput: 0: 5873.8. Samples: 772148500. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:02:27,949][25689] Avg episode reward: [(0, '-4.286')] [2022-07-10 14:02:28,729][26022] Updated weights on worker 0-0, policy_version 754050 (0.00088) [2022-07-10 14:02:30,759][26022] Updated weights on worker 0-0, policy_version 754060 (0.00082) [2022-07-10 14:02:32,401][26022] Updated weights on worker 0-0, policy_version 754070 (0.00088) [2022-07-10 14:02:32,967][25689] Fps is (10 sec: 5716.1, 60 sec: 5549.3, 300 sec: 5534.4). Total num frames: 772169728. Throughput: 0: 5031.0. Samples: 772165380. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:02:32,967][25689] Avg episode reward: [(0, '-5.140')] [2022-07-10 14:02:34,407][26022] Updated weights on worker 0-0, policy_version 754080 (0.00092) [2022-07-10 14:02:36,156][26022] Updated weights on worker 0-0, policy_version 754090 (0.00090) [2022-07-10 14:02:36,940][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:02:36,953][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000754094_772192256.pth [2022-07-10 14:02:36,953][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000752149_770200576.pth [2022-07-10 14:02:37,993][25689] Fps is (10 sec: 5505.1, 60 sec: 5531.3, 300 sec: 5533.0). Total num frames: 772196352. Throughput: 0: 5866.1. Samples: 772199114. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:02:37,993][25689] Avg episode reward: [(0, '-4.883')] [2022-07-10 14:02:38,102][26022] Updated weights on worker 0-0, policy_version 754100 (0.00090) [2022-07-10 14:02:39,685][26022] Updated weights on worker 0-0, policy_version 754110 (0.00085) [2022-07-10 14:02:41,878][26022] Updated weights on worker 0-0, policy_version 754120 (0.00089) [2022-07-10 14:02:43,070][25689] Fps is (10 sec: 5676.0, 60 sec: 5599.5, 300 sec: 5539.4). Total num frames: 772227072. Throughput: 0: 5846.8. Samples: 772232162. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:02:43,070][25689] Avg episode reward: [(0, '-4.345')] [2022-07-10 14:02:43,467][26022] Updated weights on worker 0-0, policy_version 754130 (0.00088) [2022-07-10 14:02:45,626][26022] Updated weights on worker 0-0, policy_version 754140 (0.00093) [2022-07-10 14:02:47,185][26022] Updated weights on worker 0-0, policy_version 754150 (0.00086) [2022-07-10 14:02:48,130][25689] Fps is (10 sec: 5657.1, 60 sec: 5545.0, 300 sec: 5531.5). Total num frames: 772253696. Throughput: 0: 4972.5. Samples: 772248852. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:02:48,130][25689] Avg episode reward: [(0, '-2.518')] [2022-07-10 14:02:49,361][26022] Updated weights on worker 0-0, policy_version 754160 (0.00089) [2022-07-10 14:02:50,885][26022] Updated weights on worker 0-0, policy_version 754170 (0.00086) [2022-07-10 14:02:52,856][26022] Updated weights on worker 0-0, policy_version 754180 (0.00091) [2022-07-10 14:02:53,203][25689] Fps is (10 sec: 5457.0, 60 sec: 5538.6, 300 sec: 5537.4). Total num frames: 772282368. Throughput: 0: 5773.7. Samples: 772282216. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:02:53,205][25689] Avg episode reward: [(0, '-2.578')] [2022-07-10 14:02:54,694][26022] Updated weights on worker 0-0, policy_version 754190 (0.00085) [2022-07-10 14:02:56,468][26022] Updated weights on worker 0-0, policy_version 754200 (0.00095) [2022-07-10 14:02:58,208][25689] Fps is (10 sec: 5588.0, 60 sec: 5559.7, 300 sec: 5535.0). Total num frames: 772310016. Throughput: 0: 5753.0. Samples: 772315414. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:02:58,209][25689] Avg episode reward: [(0, '-2.506')] [2022-07-10 14:02:58,298][26022] Updated weights on worker 0-0, policy_version 754210 (0.00087) [2022-07-10 14:03:00,267][26022] Updated weights on worker 0-0, policy_version 754220 (0.00084) [2022-07-10 14:03:02,228][26022] Updated weights on worker 0-0, policy_version 754230 (0.00095) [2022-07-10 14:03:03,257][25689] Fps is (10 sec: 5296.2, 60 sec: 5509.9, 300 sec: 5534.2). Total num frames: 772335616. Throughput: 0: 4959.1. Samples: 772332272. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:03:03,257][25689] Avg episode reward: [(0, '-1.519')] [2022-07-10 14:03:04,191][26022] Updated weights on worker 0-0, policy_version 754240 (0.00085) [2022-07-10 14:03:05,820][26022] Updated weights on worker 0-0, policy_version 754250 (0.00085) [2022-07-10 14:03:08,000][26022] Updated weights on worker 0-0, policy_version 754260 (0.00085) [2022-07-10 14:03:08,308][25689] Fps is (10 sec: 5373.6, 60 sec: 5573.2, 300 sec: 5533.7). Total num frames: 772364288. Throughput: 0: 5695.3. Samples: 772363774. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:03:08,309][25689] Avg episode reward: [(0, '-1.480')] [2022-07-10 14:03:09,625][26022] Updated weights on worker 0-0, policy_version 754270 (0.00092) [2022-07-10 14:03:11,681][26022] Updated weights on worker 0-0, policy_version 754280 (0.00087) [2022-07-10 14:03:13,278][26022] Updated weights on worker 0-0, policy_version 754290 (0.00090) [2022-07-10 14:03:13,374][25689] Fps is (10 sec: 5667.7, 60 sec: 5567.5, 300 sec: 5539.6). Total num frames: 772392960. Throughput: 0: 5698.0. Samples: 772397154. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:03:13,375][25689] Avg episode reward: [(0, '-2.414')] [2022-07-10 14:03:15,161][26022] Updated weights on worker 0-0, policy_version 754300 (0.00097) [2022-07-10 14:03:17,027][26022] Updated weights on worker 0-0, policy_version 754310 (0.00084) [2022-07-10 14:03:18,400][25689] Fps is (10 sec: 5479.4, 60 sec: 5515.1, 300 sec: 5534.3). Total num frames: 772419584. Throughput: 0: 4882.4. Samples: 772413996. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:03:18,400][25689] Avg episode reward: [(0, '-1.509')] [2022-07-10 14:03:18,780][26022] Updated weights on worker 0-0, policy_version 754320 (0.00091) [2022-07-10 14:03:20,719][26022] Updated weights on worker 0-0, policy_version 754330 (0.00094) [2022-07-10 14:03:22,606][26022] Updated weights on worker 0-0, policy_version 754340 (0.00100) [2022-07-10 14:03:23,504][25689] Fps is (10 sec: 5559.8, 60 sec: 5562.1, 300 sec: 5539.7). Total num frames: 772449280. Throughput: 0: 5692.2. Samples: 772447526. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:03:23,505][25689] Avg episode reward: [(0, '-2.286')] [2022-07-10 14:03:24,382][26022] Updated weights on worker 0-0, policy_version 754350 (0.00088) [2022-07-10 14:03:26,186][26022] Updated weights on worker 0-0, policy_version 754360 (0.00092) [2022-07-10 14:03:28,066][26022] Updated weights on worker 0-0, policy_version 754370 (0.00084) [2022-07-10 14:03:28,517][25689] Fps is (10 sec: 5566.6, 60 sec: 5528.4, 300 sec: 5536.8). Total num frames: 772475904. Throughput: 0: 5789.4. Samples: 772480774. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:03:28,518][25689] Avg episode reward: [(0, '-2.504')] [2022-07-10 14:03:29,943][26022] Updated weights on worker 0-0, policy_version 754380 (0.00083) [2022-07-10 14:03:31,713][26022] Updated weights on worker 0-0, policy_version 754390 (0.00084) [2022-07-10 14:03:33,520][25689] Fps is (10 sec: 5521.0, 60 sec: 5529.8, 300 sec: 5537.0). Total num frames: 772504576. Throughput: 0: 4984.7. Samples: 772497572. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:03:33,520][25689] Avg episode reward: [(0, '-3.498')] [2022-07-10 14:03:33,575][26022] Updated weights on worker 0-0, policy_version 754400 (0.00089) [2022-07-10 14:03:35,326][26022] Updated weights on worker 0-0, policy_version 754410 (0.00089) [2022-07-10 14:03:37,200][26022] Updated weights on worker 0-0, policy_version 754420 (0.00098) [2022-07-10 14:03:38,535][25689] Fps is (10 sec: 5724.5, 60 sec: 5564.7, 300 sec: 5542.1). Total num frames: 772533248. Throughput: 0: 5832.2. Samples: 772531426. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:03:38,535][25689] Avg episode reward: [(0, '-3.483')] [2022-07-10 14:03:39,146][26022] Updated weights on worker 0-0, policy_version 754430 (0.00050) [2022-07-10 14:03:40,943][26022] Updated weights on worker 0-0, policy_version 754440 (0.00093) [2022-07-10 14:03:42,877][26022] Updated weights on worker 0-0, policy_version 754450 (0.00080) [2022-07-10 14:03:43,673][25689] Fps is (10 sec: 5547.3, 60 sec: 5508.4, 300 sec: 5539.9). Total num frames: 772560896. Throughput: 0: 5816.1. Samples: 772564826. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:03:43,673][25689] Avg episode reward: [(0, '-2.187')] [2022-07-10 14:03:44,609][26022] Updated weights on worker 0-0, policy_version 754460 (0.00071) [2022-07-10 14:03:46,475][26022] Updated weights on worker 0-0, policy_version 754470 (0.00088) [2022-07-10 14:03:48,363][26022] Updated weights on worker 0-0, policy_version 754480 (0.00086) [2022-07-10 14:03:48,702][25689] Fps is (10 sec: 5539.3, 60 sec: 5545.0, 300 sec: 5543.6). Total num frames: 772589568. Throughput: 0: 5835.5. Samples: 772598562. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:03:48,703][25689] Avg episode reward: [(0, '-2.118')] [2022-07-10 14:03:50,174][26022] Updated weights on worker 0-0, policy_version 754490 (0.00087) [2022-07-10 14:03:52,085][26022] Updated weights on worker 0-0, policy_version 754500 (0.00087) [2022-07-10 14:03:53,733][25689] Fps is (10 sec: 5598.1, 60 sec: 5531.9, 300 sec: 5543.2). Total num frames: 772617216. Throughput: 0: 5802.4. Samples: 772614858. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:03:53,734][25689] Avg episode reward: [(0, '-2.233')] [2022-07-10 14:03:53,906][26022] Updated weights on worker 0-0, policy_version 754510 (0.00089) [2022-07-10 14:03:55,624][26022] Updated weights on worker 0-0, policy_version 754520 (0.00087) [2022-07-10 14:03:57,699][26022] Updated weights on worker 0-0, policy_version 754530 (0.00093) [2022-07-10 14:03:58,761][25689] Fps is (10 sec: 5497.2, 60 sec: 5529.9, 300 sec: 5540.9). Total num frames: 772644864. Throughput: 0: 5776.8. Samples: 772648268. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:03:58,762][25689] Avg episode reward: [(0, '-0.764')] [2022-07-10 14:03:59,271][26022] Updated weights on worker 0-0, policy_version 754540 (0.00094) [2022-07-10 14:04:01,163][26022] Updated weights on worker 0-0, policy_version 754550 (0.00091) [2022-07-10 14:04:03,270][26022] Updated weights on worker 0-0, policy_version 754560 (0.00737) [2022-07-10 14:04:03,816][25689] Fps is (10 sec: 5484.4, 60 sec: 5563.1, 300 sec: 5547.5). Total num frames: 772672512. Throughput: 0: 5720.3. Samples: 772680050. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:04:03,816][25689] Avg episode reward: [(0, '-1.527')] [2022-07-10 14:04:05,285][26022] Updated weights on worker 0-0, policy_version 754570 (0.00082) [2022-07-10 14:04:06,834][26022] Updated weights on worker 0-0, policy_version 754580 (0.00087) [2022-07-10 14:04:08,832][25689] Fps is (10 sec: 5388.7, 60 sec: 5532.5, 300 sec: 5540.4). Total num frames: 772699136. Throughput: 0: 4892.7. Samples: 772697054. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:04:08,833][25689] Avg episode reward: [(0, '-2.381')] [2022-07-10 14:04:08,975][26022] Updated weights on worker 0-0, policy_version 754590 (0.00099) [2022-07-10 14:04:10,552][26022] Updated weights on worker 0-0, policy_version 754600 (0.00089) [2022-07-10 14:04:12,720][26022] Updated weights on worker 0-0, policy_version 754610 (0.00090) [2022-07-10 14:04:13,867][25689] Fps is (10 sec: 5399.6, 60 sec: 5518.5, 300 sec: 5540.4). Total num frames: 772726784. Throughput: 0: 5739.5. Samples: 772730414. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:04:13,867][25689] Avg episode reward: [(0, '-3.554')] [2022-07-10 14:04:14,159][26022] Updated weights on worker 0-0, policy_version 754620 (0.00095) [2022-07-10 14:04:16,369][26022] Updated weights on worker 0-0, policy_version 754630 (0.00081) [2022-07-10 14:04:17,899][26022] Updated weights on worker 0-0, policy_version 754640 (0.00110) [2022-07-10 14:04:18,893][25689] Fps is (10 sec: 5699.9, 60 sec: 5569.2, 300 sec: 5543.9). Total num frames: 772756480. Throughput: 0: 5753.3. Samples: 772764094. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:04:18,894][25689] Avg episode reward: [(0, '-4.696')] [2022-07-10 14:04:19,834][26022] Updated weights on worker 0-0, policy_version 754650 (0.00091) [2022-07-10 14:04:21,474][26022] Updated weights on worker 0-0, policy_version 754660 (0.00087) [2022-07-10 14:04:23,435][26022] Updated weights on worker 0-0, policy_version 754670 (0.00079) [2022-07-10 14:04:23,950][25689] Fps is (10 sec: 5788.9, 60 sec: 5556.6, 300 sec: 5553.2). Total num frames: 772785152. Throughput: 0: 5022.5. Samples: 772781172. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-10 14:04:23,950][25689] Avg episode reward: [(0, '-4.496')] [2022-07-10 14:04:25,283][26022] Updated weights on worker 0-0, policy_version 754680 (0.00084) [2022-07-10 14:04:27,072][26022] Updated weights on worker 0-0, policy_version 754690 (0.00080) [2022-07-10 14:04:28,906][26022] Updated weights on worker 0-0, policy_version 754700 (0.00087) [2022-07-10 14:04:28,967][25689] Fps is (10 sec: 5692.1, 60 sec: 5590.1, 300 sec: 5549.6). Total num frames: 772813824. Throughput: 0: 5855.7. Samples: 772814958. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:04:28,968][25689] Avg episode reward: [(0, '-4.132')] [2022-07-10 14:04:30,964][26022] Updated weights on worker 0-0, policy_version 754710 (0.00087) [2022-07-10 14:04:32,405][26022] Updated weights on worker 0-0, policy_version 754720 (0.00086) [2022-07-10 14:04:33,978][25689] Fps is (10 sec: 5411.8, 60 sec: 5538.5, 300 sec: 5542.6). Total num frames: 772839424. Throughput: 0: 5865.5. Samples: 772848376. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:04:33,979][25689] Avg episode reward: [(0, '-3.615')] [2022-07-10 14:04:34,467][26022] Updated weights on worker 0-0, policy_version 754730 (0.00091) [2022-07-10 14:04:36,175][26022] Updated weights on worker 0-0, policy_version 754740 (0.00089) [2022-07-10 14:04:37,075][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:04:37,088][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000754744_772857856.pth [2022-07-10 14:04:37,089][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000752794_770861056.pth [2022-07-10 14:04:38,077][26022] Updated weights on worker 0-0, policy_version 754750 (0.00082) [2022-07-10 14:04:38,995][25689] Fps is (10 sec: 5412.4, 60 sec: 5538.4, 300 sec: 5543.9). Total num frames: 772868096. Throughput: 0: 5036.6. Samples: 772865338. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:04:38,995][25689] Avg episode reward: [(0, '-2.895')] [2022-07-10 14:04:40,238][26022] Updated weights on worker 0-0, policy_version 754760 (0.00089) [2022-07-10 14:04:41,647][26022] Updated weights on worker 0-0, policy_version 754770 (0.00084) [2022-07-10 14:04:43,504][26022] Updated weights on worker 0-0, policy_version 754780 (0.00086) [2022-07-10 14:04:44,053][25689] Fps is (10 sec: 5793.1, 60 sec: 5579.5, 300 sec: 5547.0). Total num frames: 772897792. Throughput: 0: 5848.4. Samples: 772898748. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:04:44,054][25689] Avg episode reward: [(0, '-3.257')] [2022-07-10 14:04:45,480][26022] Updated weights on worker 0-0, policy_version 754790 (0.00087) [2022-07-10 14:04:47,210][26022] Updated weights on worker 0-0, policy_version 754800 (0.00101) [2022-07-10 14:04:49,055][25689] Fps is (10 sec: 5496.5, 60 sec: 5531.2, 300 sec: 5540.6). Total num frames: 772923392. Throughput: 0: 5846.6. Samples: 772932402. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:04:49,055][25689] Avg episode reward: [(0, '-2.163')] [2022-07-10 14:04:49,224][26022] Updated weights on worker 0-0, policy_version 754810 (0.00085) [2022-07-10 14:04:50,940][26022] Updated weights on worker 0-0, policy_version 754820 (0.00088) [2022-07-10 14:04:52,811][26022] Updated weights on worker 0-0, policy_version 754830 (0.00088) [2022-07-10 14:04:54,062][25689] Fps is (10 sec: 5422.5, 60 sec: 5550.4, 300 sec: 5544.4). Total num frames: 772952064. Throughput: 0: 5012.9. Samples: 772949056. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:04:54,065][25689] Avg episode reward: [(0, '-2.676')] [2022-07-10 14:04:54,544][26022] Updated weights on worker 0-0, policy_version 754840 (0.00090) [2022-07-10 14:04:56,553][26022] Updated weights on worker 0-0, policy_version 754850 (0.00089) [2022-07-10 14:04:58,233][26022] Updated weights on worker 0-0, policy_version 754860 (0.00086) [2022-07-10 14:04:59,106][25689] Fps is (10 sec: 5806.9, 60 sec: 5582.8, 300 sec: 5552.4). Total num frames: 772981760. Throughput: 0: 5831.9. Samples: 772982628. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:04:59,107][25689] Avg episode reward: [(0, '-2.029')] [2022-07-10 14:05:00,343][26022] Updated weights on worker 0-0, policy_version 754870 (0.00098) [2022-07-10 14:05:02,259][26022] Updated weights on worker 0-0, policy_version 754880 (0.00088) [2022-07-10 14:05:04,151][25689] Fps is (10 sec: 5278.0, 60 sec: 5515.9, 300 sec: 5548.4). Total num frames: 773005312. Throughput: 0: 5738.2. Samples: 773014070. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:05:04,151][25689] Avg episode reward: [(0, '-1.368')] [2022-07-10 14:05:04,346][26022] Updated weights on worker 0-0, policy_version 754890 (0.00808) [2022-07-10 14:05:05,679][26022] Updated weights on worker 0-0, policy_version 754900 (0.00078) [2022-07-10 14:05:07,974][26022] Updated weights on worker 0-0, policy_version 754910 (0.00087) [2022-07-10 14:05:09,159][25689] Fps is (10 sec: 5296.8, 60 sec: 5567.6, 300 sec: 5544.9). Total num frames: 773035008. Throughput: 0: 4894.3. Samples: 773030798. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:05:09,159][25689] Avg episode reward: [(0, '-0.539')] [2022-07-10 14:05:09,767][26022] Updated weights on worker 0-0, policy_version 754920 (0.00085) [2022-07-10 14:05:11,613][26022] Updated weights on worker 0-0, policy_version 754930 (0.00087) [2022-07-10 14:05:13,298][26022] Updated weights on worker 0-0, policy_version 754940 (0.00092) [2022-07-10 14:05:14,175][25689] Fps is (10 sec: 5822.9, 60 sec: 5586.3, 300 sec: 5548.6). Total num frames: 773063680. Throughput: 0: 5733.0. Samples: 773064362. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:05:14,175][25689] Avg episode reward: [(0, '0.475')] [2022-07-10 14:05:15,359][26022] Updated weights on worker 0-0, policy_version 754950 (0.00083) [2022-07-10 14:05:16,917][26022] Updated weights on worker 0-0, policy_version 754960 (0.00087) [2022-07-10 14:05:19,012][26022] Updated weights on worker 0-0, policy_version 754970 (0.00091) [2022-07-10 14:05:19,189][25689] Fps is (10 sec: 5513.2, 60 sec: 5536.4, 300 sec: 5543.8). Total num frames: 773090304. Throughput: 0: 5751.7. Samples: 773098136. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:05:19,189][25689] Avg episode reward: [(0, '0.648')] [2022-07-10 14:05:20,463][26022] Updated weights on worker 0-0, policy_version 754980 (0.00083) [2022-07-10 14:05:22,548][26022] Updated weights on worker 0-0, policy_version 754990 (0.00091) [2022-07-10 14:05:24,168][26022] Updated weights on worker 0-0, policy_version 755000 (0.00088) [2022-07-10 14:05:24,232][25689] Fps is (10 sec: 5599.8, 60 sec: 5554.6, 300 sec: 5547.9). Total num frames: 773120000. Throughput: 0: 5025.7. Samples: 773114992. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:05:24,233][25689] Avg episode reward: [(0, '0.815')] [2022-07-10 14:05:26,154][26022] Updated weights on worker 0-0, policy_version 755011 (0.00089) [2022-07-10 14:05:27,987][26022] Updated weights on worker 0-0, policy_version 755021 (0.00381) [2022-07-10 14:05:29,248][25689] Fps is (10 sec: 5802.5, 60 sec: 5554.8, 300 sec: 5545.6). Total num frames: 773148672. Throughput: 0: 5870.1. Samples: 773148722. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:05:29,250][25689] Avg episode reward: [(0, '0.851')] [2022-07-10 14:05:30,061][26022] Updated weights on worker 0-0, policy_version 755031 (0.00090) [2022-07-10 14:05:31,788][26022] Updated weights on worker 0-0, policy_version 755041 (0.00087) [2022-07-10 14:05:33,955][26022] Updated weights on worker 0-0, policy_version 755051 (0.00086) [2022-07-10 14:05:34,263][25689] Fps is (10 sec: 5410.3, 60 sec: 5554.4, 300 sec: 5542.1). Total num frames: 773174272. Throughput: 0: 5865.2. Samples: 773182186. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:05:34,265][25689] Avg episode reward: [(0, '0.255')] [2022-07-10 14:05:35,372][26022] Updated weights on worker 0-0, policy_version 755061 (0.00095) [2022-07-10 14:05:37,486][26022] Updated weights on worker 0-0, policy_version 755071 (0.00084) [2022-07-10 14:05:39,082][26022] Updated weights on worker 0-0, policy_version 755081 (0.00088) [2022-07-10 14:05:39,276][25689] Fps is (10 sec: 5514.0, 60 sec: 5571.7, 300 sec: 5551.7). Total num frames: 773203968. Throughput: 0: 5013.0. Samples: 773198834. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:05:39,278][25689] Avg episode reward: [(0, '-0.273')] [2022-07-10 14:05:41,023][26022] Updated weights on worker 0-0, policy_version 755091 (0.00091) [2022-07-10 14:05:42,880][26022] Updated weights on worker 0-0, policy_version 755101 (0.00087) [2022-07-10 14:05:44,315][25689] Fps is (10 sec: 5704.7, 60 sec: 5539.6, 300 sec: 5544.8). Total num frames: 773231616. Throughput: 0: 5825.5. Samples: 773231986. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:05:44,316][25689] Avg episode reward: [(0, '-1.206')] [2022-07-10 14:05:44,960][26022] Updated weights on worker 0-0, policy_version 755111 (0.00090) [2022-07-10 14:05:46,507][26022] Updated weights on worker 0-0, policy_version 755121 (0.00104) [2022-07-10 14:05:48,558][26022] Updated weights on worker 0-0, policy_version 755131 (0.00085) [2022-07-10 14:05:49,317][25689] Fps is (10 sec: 5405.2, 60 sec: 5556.5, 300 sec: 5542.1). Total num frames: 773258240. Throughput: 0: 5811.2. Samples: 773265346. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:05:49,318][25689] Avg episode reward: [(0, '-0.995')] [2022-07-10 14:05:50,192][26022] Updated weights on worker 0-0, policy_version 755141 (0.00088) [2022-07-10 14:05:52,392][26022] Updated weights on worker 0-0, policy_version 755151 (0.00094) [2022-07-10 14:05:54,114][26022] Updated weights on worker 0-0, policy_version 755161 (0.00095) [2022-07-10 14:05:54,356][25689] Fps is (10 sec: 5404.9, 60 sec: 5536.6, 300 sec: 5545.4). Total num frames: 773285888. Throughput: 0: 4960.0. Samples: 773281848. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:05:54,357][25689] Avg episode reward: [(0, '-3.115')] [2022-07-10 14:05:55,982][26022] Updated weights on worker 0-0, policy_version 755171 (0.00091) [2022-07-10 14:05:57,696][26022] Updated weights on worker 0-0, policy_version 755181 (0.00080) [2022-07-10 14:05:59,366][25689] Fps is (10 sec: 5604.5, 60 sec: 5522.8, 300 sec: 5549.9). Total num frames: 773314560. Throughput: 0: 5786.4. Samples: 773315080. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:05:59,367][25689] Avg episode reward: [(0, '-2.921')] [2022-07-10 14:05:59,657][26022] Updated weights on worker 0-0, policy_version 755191 (0.00053) [2022-07-10 14:06:01,600][26022] Updated weights on worker 0-0, policy_version 755201 (0.00091) [2022-07-10 14:06:03,847][26022] Updated weights on worker 0-0, policy_version 755211 (0.00097) [2022-07-10 14:06:04,412][25689] Fps is (10 sec: 5397.2, 60 sec: 5556.6, 300 sec: 5542.2). Total num frames: 773340160. Throughput: 0: 5696.8. Samples: 773346474. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:06:04,413][25689] Avg episode reward: [(0, '-2.426')] [2022-07-10 14:06:05,397][26022] Updated weights on worker 0-0, policy_version 755221 (0.00090) [2022-07-10 14:06:07,424][26022] Updated weights on worker 0-0, policy_version 755231 (0.00082) [2022-07-10 14:06:09,215][26022] Updated weights on worker 0-0, policy_version 755241 (0.00091) [2022-07-10 14:06:09,429][25689] Fps is (10 sec: 5189.5, 60 sec: 5504.8, 300 sec: 5542.2). Total num frames: 773366784. Throughput: 0: 4861.1. Samples: 773363114. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:06:09,430][25689] Avg episode reward: [(0, '-1.727')] [2022-07-10 14:06:11,367][26022] Updated weights on worker 0-0, policy_version 755252 (0.00087) [2022-07-10 14:06:12,999][26022] Updated weights on worker 0-0, policy_version 755262 (0.00088) [2022-07-10 14:06:14,455][25689] Fps is (10 sec: 5506.1, 60 sec: 5503.9, 300 sec: 5538.7). Total num frames: 773395456. Throughput: 0: 5694.5. Samples: 773396296. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:06:14,455][25689] Avg episode reward: [(0, '-1.587')] [2022-07-10 14:06:14,888][26022] Updated weights on worker 0-0, policy_version 755272 (0.00085) [2022-07-10 14:06:16,589][26022] Updated weights on worker 0-0, policy_version 755282 (0.00094) [2022-07-10 14:06:18,754][26022] Updated weights on worker 0-0, policy_version 755292 (0.00109) [2022-07-10 14:06:19,460][25689] Fps is (10 sec: 5717.0, 60 sec: 5538.7, 300 sec: 5546.4). Total num frames: 773424128. Throughput: 0: 5726.3. Samples: 773430144. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:06:19,462][25689] Avg episode reward: [(0, '-2.075')] [2022-07-10 14:06:20,439][26022] Updated weights on worker 0-0, policy_version 755302 (0.00083) [2022-07-10 14:06:22,282][26022] Updated weights on worker 0-0, policy_version 755312 (0.00090) [2022-07-10 14:06:24,142][26022] Updated weights on worker 0-0, policy_version 755322 (0.00088) [2022-07-10 14:06:24,531][25689] Fps is (10 sec: 5589.5, 60 sec: 5502.2, 300 sec: 5545.7). Total num frames: 773451776. Throughput: 0: 4997.0. Samples: 773447006. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:06:24,531][25689] Avg episode reward: [(0, '-0.235')] [2022-07-10 14:06:25,799][26022] Updated weights on worker 0-0, policy_version 755332 (0.00100) [2022-07-10 14:06:27,769][26022] Updated weights on worker 0-0, policy_version 755342 (0.00086) [2022-07-10 14:06:29,570][25689] Fps is (10 sec: 5469.2, 60 sec: 5483.1, 300 sec: 5538.2). Total num frames: 773479424. Throughput: 0: 5822.1. Samples: 773480376. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:06:29,571][25689] Avg episode reward: [(0, '-0.470')] [2022-07-10 14:06:29,701][26022] Updated weights on worker 0-0, policy_version 755352 (0.00086) [2022-07-10 14:06:31,467][26022] Updated weights on worker 0-0, policy_version 755362 (0.00087) [2022-07-10 14:06:33,365][26022] Updated weights on worker 0-0, policy_version 755372 (0.00091) [2022-07-10 14:06:34,583][25689] Fps is (10 sec: 5602.8, 60 sec: 5534.3, 300 sec: 5541.7). Total num frames: 773508096. Throughput: 0: 5832.0. Samples: 773513682. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:06:34,583][25689] Avg episode reward: [(0, '-1.082')] [2022-07-10 14:06:35,008][26022] Updated weights on worker 0-0, policy_version 755382 (0.00090) [2022-07-10 14:06:36,998][26022] Updated weights on worker 0-0, policy_version 755392 (0.00080) [2022-07-10 14:06:37,106][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:06:37,118][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000755393_773522432.pth [2022-07-10 14:06:37,119][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000753442_771524608.pth [2022-07-10 14:06:38,705][26022] Updated weights on worker 0-0, policy_version 755402 (0.00088) [2022-07-10 14:06:39,591][25689] Fps is (10 sec: 5518.0, 60 sec: 5483.7, 300 sec: 5543.1). Total num frames: 773534720. Throughput: 0: 4987.1. Samples: 773530542. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:06:39,592][25689] Avg episode reward: [(0, '-0.969')] [2022-07-10 14:06:40,761][26022] Updated weights on worker 0-0, policy_version 755412 (0.00087) [2022-07-10 14:06:42,428][26022] Updated weights on worker 0-0, policy_version 755422 (0.00092) [2022-07-10 14:06:44,314][26022] Updated weights on worker 0-0, policy_version 755432 (0.00086) [2022-07-10 14:06:44,713][25689] Fps is (10 sec: 5458.3, 60 sec: 5493.1, 300 sec: 5537.7). Total num frames: 773563392. Throughput: 0: 5790.0. Samples: 773563864. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:06:44,714][25689] Avg episode reward: [(0, '-1.209')] [2022-07-10 14:06:46,050][26022] Updated weights on worker 0-0, policy_version 755442 (0.00082) [2022-07-10 14:06:48,002][26022] Updated weights on worker 0-0, policy_version 755452 (0.00079) [2022-07-10 14:06:49,733][25689] Fps is (10 sec: 5654.0, 60 sec: 5525.4, 300 sec: 5537.4). Total num frames: 773592064. Throughput: 0: 5800.3. Samples: 773597328. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:06:49,735][25689] Avg episode reward: [(0, '-1.775')] [2022-07-10 14:06:49,787][26022] Updated weights on worker 0-0, policy_version 755462 (0.00093) [2022-07-10 14:06:51,695][26022] Updated weights on worker 0-0, policy_version 755472 (0.00089) [2022-07-10 14:06:53,367][26022] Updated weights on worker 0-0, policy_version 755482 (0.01133) [2022-07-10 14:06:54,768][25689] Fps is (10 sec: 5703.5, 60 sec: 5542.8, 300 sec: 5544.6). Total num frames: 773620736. Throughput: 0: 4983.4. Samples: 773614272. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:06:54,768][25689] Avg episode reward: [(0, '-2.714')] [2022-07-10 14:06:55,240][26022] Updated weights on worker 0-0, policy_version 755492 (0.00088) [2022-07-10 14:06:57,224][26022] Updated weights on worker 0-0, policy_version 755502 (0.00087) [2022-07-10 14:06:58,873][26022] Updated weights on worker 0-0, policy_version 755512 (0.00078) [2022-07-10 14:06:59,798][25689] Fps is (10 sec: 5494.2, 60 sec: 5507.0, 300 sec: 5538.3). Total num frames: 773647360. Throughput: 0: 5809.8. Samples: 773647938. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:06:59,799][25689] Avg episode reward: [(0, '-2.579')] [2022-07-10 14:07:00,828][26022] Updated weights on worker 0-0, policy_version 755522 (0.00082) [2022-07-10 14:07:02,995][26022] Updated weights on worker 0-0, policy_version 755532 (0.00092) [2022-07-10 14:07:04,848][25689] Fps is (10 sec: 5282.1, 60 sec: 5523.6, 300 sec: 5544.3). Total num frames: 773673984. Throughput: 0: 5748.2. Samples: 773679604. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:07:04,849][25689] Avg episode reward: [(0, '-2.999')] [2022-07-10 14:07:04,906][26022] Updated weights on worker 0-0, policy_version 755542 (0.00086) [2022-07-10 14:07:06,383][26022] Updated weights on worker 0-0, policy_version 755552 (0.00086) [2022-07-10 14:07:08,410][26022] Updated weights on worker 0-0, policy_version 755562 (0.00083) [2022-07-10 14:07:09,916][25689] Fps is (10 sec: 5667.7, 60 sec: 5586.7, 300 sec: 5550.0). Total num frames: 773704704. Throughput: 0: 4915.9. Samples: 773696542. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:07:09,916][25689] Avg episode reward: [(0, '-3.098')] [2022-07-10 14:07:10,125][26022] Updated weights on worker 0-0, policy_version 755572 (0.00089) [2022-07-10 14:07:12,431][26022] Updated weights on worker 0-0, policy_version 755582 (0.00091) [2022-07-10 14:07:13,853][26022] Updated weights on worker 0-0, policy_version 755592 (0.00089) [2022-07-10 14:07:14,944][25689] Fps is (10 sec: 5578.4, 60 sec: 5535.6, 300 sec: 5535.8). Total num frames: 773730304. Throughput: 0: 5725.8. Samples: 773729800. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:07:14,945][25689] Avg episode reward: [(0, '-2.917')] [2022-07-10 14:07:15,855][26022] Updated weights on worker 0-0, policy_version 755602 (0.00094) [2022-07-10 14:07:17,721][26022] Updated weights on worker 0-0, policy_version 755612 (0.00091) [2022-07-10 14:07:19,447][26022] Updated weights on worker 0-0, policy_version 755622 (0.00085) [2022-07-10 14:07:19,992][25689] Fps is (10 sec: 5487.7, 60 sec: 5548.7, 300 sec: 5546.5). Total num frames: 773760000. Throughput: 0: 5717.6. Samples: 773763400. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:07:19,992][25689] Avg episode reward: [(0, '-2.606')] [2022-07-10 14:07:21,327][26022] Updated weights on worker 0-0, policy_version 755632 (0.00084) [2022-07-10 14:07:23,001][26022] Updated weights on worker 0-0, policy_version 755642 (0.00090) [2022-07-10 14:07:25,026][26022] Updated weights on worker 0-0, policy_version 755652 (0.00088) [2022-07-10 14:07:25,043][25689] Fps is (10 sec: 5678.6, 60 sec: 5550.5, 300 sec: 5542.3). Total num frames: 773787648. Throughput: 0: 5798.5. Samples: 773796700. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:07:25,043][25689] Avg episode reward: [(0, '-2.491')] [2022-07-10 14:07:26,901][26022] Updated weights on worker 0-0, policy_version 755662 (0.00093) [2022-07-10 14:07:28,618][26022] Updated weights on worker 0-0, policy_version 755672 (0.00089) [2022-07-10 14:07:30,068][25689] Fps is (10 sec: 5386.4, 60 sec: 5534.9, 300 sec: 5535.3). Total num frames: 773814272. Throughput: 0: 5792.4. Samples: 773813272. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:07:30,068][25689] Avg episode reward: [(0, '-2.646')] [2022-07-10 14:07:30,661][26022] Updated weights on worker 0-0, policy_version 755682 (0.00095) [2022-07-10 14:07:32,515][26022] Updated weights on worker 0-0, policy_version 755692 (0.00084) [2022-07-10 14:07:34,231][26022] Updated weights on worker 0-0, policy_version 755702 (0.00081) [2022-07-10 14:07:35,097][25689] Fps is (10 sec: 5601.8, 60 sec: 5550.3, 300 sec: 5545.6). Total num frames: 773843968. Throughput: 0: 5796.7. Samples: 773846616. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:07:35,097][25689] Avg episode reward: [(0, '-2.065')] [2022-07-10 14:07:36,144][26022] Updated weights on worker 0-0, policy_version 755712 (0.00088) [2022-07-10 14:07:37,813][26022] Updated weights on worker 0-0, policy_version 755722 (0.00089) [2022-07-10 14:07:39,974][26022] Updated weights on worker 0-0, policy_version 755732 (0.00086) [2022-07-10 14:07:40,108][25689] Fps is (10 sec: 5609.2, 60 sec: 5550.0, 300 sec: 5533.0). Total num frames: 773870592. Throughput: 0: 5820.1. Samples: 773880480. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:07:40,109][25689] Avg episode reward: [(0, '-3.595')] [2022-07-10 14:07:41,496][26022] Updated weights on worker 0-0, policy_version 755742 (0.00096) [2022-07-10 14:07:43,581][26022] Updated weights on worker 0-0, policy_version 755752 (0.00091) [2022-07-10 14:07:45,246][25689] Fps is (10 sec: 5448.2, 60 sec: 5548.6, 300 sec: 5538.5). Total num frames: 773899264. Throughput: 0: 4967.8. Samples: 773897066. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:07:45,247][25689] Avg episode reward: [(0, '-2.960')] [2022-07-10 14:07:45,251][26022] Updated weights on worker 0-0, policy_version 755762 (0.00090) [2022-07-10 14:07:47,119][26022] Updated weights on worker 0-0, policy_version 755772 (0.00091) [2022-07-10 14:07:48,970][26022] Updated weights on worker 0-0, policy_version 755782 (0.00094) [2022-07-10 14:07:50,309][25689] Fps is (10 sec: 5721.7, 60 sec: 5561.5, 300 sec: 5542.1). Total num frames: 773928960. Throughput: 0: 5796.4. Samples: 773930600. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:07:50,310][25689] Avg episode reward: [(0, '-3.068')] [2022-07-10 14:07:50,818][26022] Updated weights on worker 0-0, policy_version 755792 (0.00083) [2022-07-10 14:07:52,502][26022] Updated weights on worker 0-0, policy_version 755802 (0.00092) [2022-07-10 14:07:54,572][26022] Updated weights on worker 0-0, policy_version 755812 (0.00096) [2022-07-10 14:07:55,366][25689] Fps is (10 sec: 5565.4, 60 sec: 5525.7, 300 sec: 5537.7). Total num frames: 773955584. Throughput: 0: 5798.0. Samples: 773964136. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:07:55,367][25689] Avg episode reward: [(0, '-2.261')] [2022-07-10 14:07:56,259][26022] Updated weights on worker 0-0, policy_version 755822 (0.00085) [2022-07-10 14:07:58,229][26022] Updated weights on worker 0-0, policy_version 755832 (0.00096) [2022-07-10 14:07:59,993][26022] Updated weights on worker 0-0, policy_version 755842 (0.00093) [2022-07-10 14:08:00,443][25689] Fps is (10 sec: 5355.8, 60 sec: 5538.3, 300 sec: 5544.1). Total num frames: 773983232. Throughput: 0: 4930.9. Samples: 773980748. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:08:00,444][25689] Avg episode reward: [(0, '-1.560')] [2022-07-10 14:08:01,859][26022] Updated weights on worker 0-0, policy_version 755852 (0.00090) [2022-07-10 14:08:04,099][26022] Updated weights on worker 0-0, policy_version 755862 (0.00093) [2022-07-10 14:08:05,501][25689] Fps is (10 sec: 5456.0, 60 sec: 5554.5, 300 sec: 5540.5). Total num frames: 774010880. Throughput: 0: 5683.0. Samples: 774012174. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:08:05,501][25689] Avg episode reward: [(0, '-1.013')] [2022-07-10 14:08:05,952][26022] Updated weights on worker 0-0, policy_version 755872 (0.00086) [2022-07-10 14:08:07,775][26022] Updated weights on worker 0-0, policy_version 755882 (0.00087) [2022-07-10 14:08:09,549][26022] Updated weights on worker 0-0, policy_version 755892 (0.00090) [2022-07-10 14:08:10,514][25689] Fps is (10 sec: 5388.6, 60 sec: 5491.9, 300 sec: 5534.6). Total num frames: 774037504. Throughput: 0: 5693.1. Samples: 774045628. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:08:10,516][25689] Avg episode reward: [(0, '-0.634')] [2022-07-10 14:08:11,355][26022] Updated weights on worker 0-0, policy_version 755902 (0.00086) [2022-07-10 14:08:13,052][26022] Updated weights on worker 0-0, policy_version 755912 (0.00097) [2022-07-10 14:08:15,114][26022] Updated weights on worker 0-0, policy_version 755922 (0.00093) [2022-07-10 14:08:15,520][25689] Fps is (10 sec: 5518.9, 60 sec: 5544.7, 300 sec: 5541.8). Total num frames: 774066176. Throughput: 0: 4892.0. Samples: 774062732. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:08:15,521][25689] Avg episode reward: [(0, '-0.208')] [2022-07-10 14:08:16,867][26022] Updated weights on worker 0-0, policy_version 755932 (0.00084) [2022-07-10 14:08:18,823][26022] Updated weights on worker 0-0, policy_version 755942 (0.00086) [2022-07-10 14:08:20,538][25689] Fps is (10 sec: 5721.0, 60 sec: 5530.5, 300 sec: 5540.0). Total num frames: 774094848. Throughput: 0: 5741.1. Samples: 774096116. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:08:20,538][25689] Avg episode reward: [(0, '-0.963')] [2022-07-10 14:08:20,539][26022] Updated weights on worker 0-0, policy_version 755952 (0.00087) [2022-07-10 14:08:22,286][26022] Updated weights on worker 0-0, policy_version 755962 (0.00086) [2022-07-10 14:08:24,140][26022] Updated weights on worker 0-0, policy_version 755972 (0.00088) [2022-07-10 14:08:25,590][25689] Fps is (10 sec: 5694.7, 60 sec: 5547.3, 300 sec: 5546.2). Total num frames: 774123520. Throughput: 0: 5856.4. Samples: 774129824. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:08:25,590][25689] Avg episode reward: [(0, '-1.420')] [2022-07-10 14:08:26,299][26022] Updated weights on worker 0-0, policy_version 755982 (0.00088) [2022-07-10 14:08:27,766][26022] Updated weights on worker 0-0, policy_version 755992 (0.00093) [2022-07-10 14:08:29,916][26022] Updated weights on worker 0-0, policy_version 756002 (0.00092) [2022-07-10 14:08:30,654][25689] Fps is (10 sec: 5567.3, 60 sec: 5560.6, 300 sec: 5541.6). Total num frames: 774151168. Throughput: 0: 5007.9. Samples: 774146486. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:08:30,655][25689] Avg episode reward: [(0, '-1.402')] [2022-07-10 14:08:31,596][26022] Updated weights on worker 0-0, policy_version 756012 (0.00105) [2022-07-10 14:08:33,519][26022] Updated weights on worker 0-0, policy_version 756022 (0.00083) [2022-07-10 14:08:35,343][26022] Updated weights on worker 0-0, policy_version 756032 (0.00088) [2022-07-10 14:08:35,706][25689] Fps is (10 sec: 5567.3, 60 sec: 5541.6, 300 sec: 5540.9). Total num frames: 774179840. Throughput: 0: 5809.4. Samples: 774180000. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:08:35,707][25689] Avg episode reward: [(0, '-2.220')] [2022-07-10 14:08:37,174][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:08:37,185][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000756042_774187008.pth [2022-07-10 14:08:37,186][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000754094_772192256.pth [2022-07-10 14:08:37,197][26022] Updated weights on worker 0-0, policy_version 756042 (0.00095) [2022-07-10 14:08:38,803][26022] Updated weights on worker 0-0, policy_version 756052 (0.00082) [2022-07-10 14:08:40,795][25689] Fps is (10 sec: 5452.6, 60 sec: 5534.5, 300 sec: 5538.3). Total num frames: 774206464. Throughput: 0: 5813.6. Samples: 774213884. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:08:40,796][25689] Avg episode reward: [(0, '-2.068')] [2022-07-10 14:08:40,801][26022] Updated weights on worker 0-0, policy_version 756062 (0.00088) [2022-07-10 14:08:42,331][26022] Updated weights on worker 0-0, policy_version 756072 (0.00380) [2022-07-10 14:08:44,591][26022] Updated weights on worker 0-0, policy_version 756082 (0.00094) [2022-07-10 14:08:45,907][25689] Fps is (10 sec: 5621.6, 60 sec: 5570.7, 300 sec: 5543.7). Total num frames: 774237184. Throughput: 0: 4964.4. Samples: 774230684. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:08:45,907][25689] Avg episode reward: [(0, '-2.210')] [2022-07-10 14:08:46,403][26022] Updated weights on worker 0-0, policy_version 756092 (0.00080) [2022-07-10 14:08:47,947][26022] Updated weights on worker 0-0, policy_version 756102 (0.00089) [2022-07-10 14:08:50,011][26022] Updated weights on worker 0-0, policy_version 756112 (0.00095) [2022-07-10 14:08:50,932][25689] Fps is (10 sec: 5657.1, 60 sec: 5523.5, 300 sec: 5540.4). Total num frames: 774263808. Throughput: 0: 5802.0. Samples: 774264138. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:08:50,933][25689] Avg episode reward: [(0, '-2.746')] [2022-07-10 14:08:51,648][26022] Updated weights on worker 0-0, policy_version 756122 (0.00092) [2022-07-10 14:08:53,818][26022] Updated weights on worker 0-0, policy_version 756132 (0.00087) [2022-07-10 14:08:55,508][26022] Updated weights on worker 0-0, policy_version 756142 (0.00092) [2022-07-10 14:08:55,977][25689] Fps is (10 sec: 5287.8, 60 sec: 5524.5, 300 sec: 5536.6). Total num frames: 774290432. Throughput: 0: 5772.5. Samples: 774297012. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:08:55,977][25689] Avg episode reward: [(0, '-2.195')] [2022-07-10 14:08:57,403][26022] Updated weights on worker 0-0, policy_version 756152 (0.00085) [2022-07-10 14:08:59,240][26022] Updated weights on worker 0-0, policy_version 756162 (0.00085) [2022-07-10 14:09:00,974][26022] Updated weights on worker 0-0, policy_version 756172 (0.00092) [2022-07-10 14:09:01,037][25689] Fps is (10 sec: 5573.4, 60 sec: 5559.9, 300 sec: 5543.4). Total num frames: 774320128. Throughput: 0: 4937.5. Samples: 774313828. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:09:01,038][25689] Avg episode reward: [(0, '-2.126')] [2022-07-10 14:09:03,204][26022] Updated weights on worker 0-0, policy_version 756182 (0.00086) [2022-07-10 14:09:05,060][26022] Updated weights on worker 0-0, policy_version 756192 (0.00098) [2022-07-10 14:09:06,162][25689] Fps is (10 sec: 5529.6, 60 sec: 5536.8, 300 sec: 5541.4). Total num frames: 774346752. Throughput: 0: 5664.0. Samples: 774345410. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:09:06,164][25689] Avg episode reward: [(0, '-0.722')] [2022-07-10 14:09:06,964][26022] Updated weights on worker 0-0, policy_version 756202 (0.00090) [2022-07-10 14:09:08,790][26022] Updated weights on worker 0-0, policy_version 756212 (0.00095) [2022-07-10 14:09:10,611][26022] Updated weights on worker 0-0, policy_version 756222 (0.00088) [2022-07-10 14:09:11,183][25689] Fps is (10 sec: 5349.0, 60 sec: 5553.0, 300 sec: 5541.6). Total num frames: 774374400. Throughput: 0: 5665.2. Samples: 774378866. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:09:11,185][25689] Avg episode reward: [(0, '-0.522')] [2022-07-10 14:09:12,399][26022] Updated weights on worker 0-0, policy_version 756232 (0.00087) [2022-07-10 14:09:14,283][26022] Updated weights on worker 0-0, policy_version 756242 (0.00093) [2022-07-10 14:09:16,162][26022] Updated weights on worker 0-0, policy_version 756252 (0.00085) [2022-07-10 14:09:16,255][25689] Fps is (10 sec: 5478.7, 60 sec: 5530.1, 300 sec: 5533.9). Total num frames: 774402048. Throughput: 0: 4873.4. Samples: 774395836. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:09:16,255][25689] Avg episode reward: [(0, '-0.818')] [2022-07-10 14:09:17,825][26022] Updated weights on worker 0-0, policy_version 756262 (0.00082) [2022-07-10 14:09:19,631][26022] Updated weights on worker 0-0, policy_version 756272 (0.00092) [2022-07-10 14:09:21,343][25689] Fps is (10 sec: 5543.5, 60 sec: 5523.8, 300 sec: 5533.3). Total num frames: 774430720. Throughput: 0: 5696.8. Samples: 774429506. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:09:21,343][25689] Avg episode reward: [(0, '0.601')] [2022-07-10 14:09:21,578][26022] Updated weights on worker 0-0, policy_version 756282 (0.00093) [2022-07-10 14:09:23,324][26022] Updated weights on worker 0-0, policy_version 756292 (0.00085) [2022-07-10 14:09:25,263][26022] Updated weights on worker 0-0, policy_version 756302 (0.00081) [2022-07-10 14:09:26,398][25689] Fps is (10 sec: 5754.4, 60 sec: 5540.3, 300 sec: 5536.0). Total num frames: 774460416. Throughput: 0: 5803.0. Samples: 774462840. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:09:26,399][25689] Avg episode reward: [(0, '-0.170')] [2022-07-10 14:09:26,935][26022] Updated weights on worker 0-0, policy_version 756312 (0.00086) [2022-07-10 14:09:28,963][26022] Updated weights on worker 0-0, policy_version 756322 (0.00089) [2022-07-10 14:09:30,847][26022] Updated weights on worker 0-0, policy_version 756332 (0.00089) [2022-07-10 14:09:31,411][25689] Fps is (10 sec: 5594.0, 60 sec: 5528.1, 300 sec: 5539.5). Total num frames: 774487040. Throughput: 0: 5798.9. Samples: 774496164. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:09:31,411][25689] Avg episode reward: [(0, '-0.221')] [2022-07-10 14:09:32,575][26022] Updated weights on worker 0-0, policy_version 756342 (0.00091) [2022-07-10 14:09:34,482][26022] Updated weights on worker 0-0, policy_version 756352 (0.00085) [2022-07-10 14:09:36,267][26022] Updated weights on worker 0-0, policy_version 756362 (0.00096) [2022-07-10 14:09:36,413][25689] Fps is (10 sec: 5419.1, 60 sec: 5515.8, 300 sec: 5536.3). Total num frames: 774514688. Throughput: 0: 5808.8. Samples: 774512930. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:09:36,415][25689] Avg episode reward: [(0, '-1.989')] [2022-07-10 14:09:37,966][26022] Updated weights on worker 0-0, policy_version 756372 (0.00087) [2022-07-10 14:09:40,058][26022] Updated weights on worker 0-0, policy_version 756382 (0.00089) [2022-07-10 14:09:41,435][25689] Fps is (10 sec: 5618.5, 60 sec: 5555.7, 300 sec: 5533.5). Total num frames: 774543360. Throughput: 0: 5824.4. Samples: 774546530. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:09:41,435][25689] Avg episode reward: [(0, '-1.715')] [2022-07-10 14:09:41,787][26022] Updated weights on worker 0-0, policy_version 756392 (0.00091) [2022-07-10 14:09:43,830][26022] Updated weights on worker 0-0, policy_version 756402 (0.00093) [2022-07-10 14:09:45,595][26022] Updated weights on worker 0-0, policy_version 756412 (0.00092) [2022-07-10 14:09:46,497][25689] Fps is (10 sec: 5585.0, 60 sec: 5509.5, 300 sec: 5539.3). Total num frames: 774571008. Throughput: 0: 5815.2. Samples: 774579720. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:09:46,498][25689] Avg episode reward: [(0, '-1.032')] [2022-07-10 14:09:47,451][26022] Updated weights on worker 0-0, policy_version 756422 (0.00073) [2022-07-10 14:09:49,207][26022] Updated weights on worker 0-0, policy_version 756432 (0.00092) [2022-07-10 14:09:51,040][26022] Updated weights on worker 0-0, policy_version 756442 (0.00089) [2022-07-10 14:09:51,506][25689] Fps is (10 sec: 5592.1, 60 sec: 5544.8, 300 sec: 5539.2). Total num frames: 774599680. Throughput: 0: 5002.3. Samples: 774596688. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:09:51,507][25689] Avg episode reward: [(0, '-2.083')] [2022-07-10 14:09:52,770][26022] Updated weights on worker 0-0, policy_version 756452 (0.00089) [2022-07-10 14:09:54,718][26022] Updated weights on worker 0-0, policy_version 756462 (0.00085) [2022-07-10 14:09:56,431][26022] Updated weights on worker 0-0, policy_version 756472 (0.00090) [2022-07-10 14:09:56,564][25689] Fps is (10 sec: 5594.8, 60 sec: 5560.5, 300 sec: 5532.1). Total num frames: 774627328. Throughput: 0: 5818.4. Samples: 774630174. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:09:56,568][25689] Avg episode reward: [(0, '-2.658')] [2022-07-10 14:09:58,413][26022] Updated weights on worker 0-0, policy_version 756482 (0.00088) [2022-07-10 14:10:00,196][26022] Updated weights on worker 0-0, policy_version 756492 (0.00091) [2022-07-10 14:10:01,579][25689] Fps is (10 sec: 5489.7, 60 sec: 5530.9, 300 sec: 5546.4). Total num frames: 774654976. Throughput: 0: 5820.3. Samples: 774663774. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:10:01,579][25689] Avg episode reward: [(0, '-2.762')] [2022-07-10 14:10:02,338][26022] Updated weights on worker 0-0, policy_version 756502 (0.00085) [2022-07-10 14:10:04,224][26022] Updated weights on worker 0-0, policy_version 756512 (0.00093) [2022-07-10 14:10:06,015][26022] Updated weights on worker 0-0, policy_version 756522 (0.00088) [2022-07-10 14:10:06,728][25689] Fps is (10 sec: 5238.8, 60 sec: 5511.8, 300 sec: 5530.0). Total num frames: 774680576. Throughput: 0: 4872.1. Samples: 774678284. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:10:06,728][25689] Avg episode reward: [(0, '-1.971')] [2022-07-10 14:10:07,927][26022] Updated weights on worker 0-0, policy_version 756532 (0.00091) [2022-07-10 14:10:09,845][26022] Updated weights on worker 0-0, policy_version 756542 (0.00088) [2022-07-10 14:10:11,432][26022] Updated weights on worker 0-0, policy_version 756552 (0.00084) [2022-07-10 14:10:11,791][25689] Fps is (10 sec: 5514.8, 60 sec: 5558.6, 300 sec: 5536.0). Total num frames: 774711296. Throughput: 0: 5666.8. Samples: 774711640. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:10:11,793][25689] Avg episode reward: [(0, '-2.037')] [2022-07-10 14:10:13,564][26022] Updated weights on worker 0-0, policy_version 756562 (0.00089) [2022-07-10 14:10:15,095][26022] Updated weights on worker 0-0, policy_version 756572 (0.00091) [2022-07-10 14:10:16,820][25689] Fps is (10 sec: 5682.1, 60 sec: 5545.7, 300 sec: 5535.8). Total num frames: 774737920. Throughput: 0: 5664.0. Samples: 774744904. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:10:16,820][25689] Avg episode reward: [(0, '-3.828')] [2022-07-10 14:10:17,320][26022] Updated weights on worker 0-0, policy_version 756582 (0.00087) [2022-07-10 14:10:18,886][26022] Updated weights on worker 0-0, policy_version 756592 (0.00087) [2022-07-10 14:10:20,834][26022] Updated weights on worker 0-0, policy_version 756602 (0.00088) [2022-07-10 14:10:21,840][25689] Fps is (10 sec: 5299.0, 60 sec: 5518.0, 300 sec: 5525.9). Total num frames: 774764544. Throughput: 0: 4822.8. Samples: 774761490. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:10:21,842][25689] Avg episode reward: [(0, '-2.857')] [2022-07-10 14:10:22,533][26022] Updated weights on worker 0-0, policy_version 756612 (0.00089) [2022-07-10 14:10:24,483][26022] Updated weights on worker 0-0, policy_version 756622 (0.00080) [2022-07-10 14:10:25,967][26022] Updated weights on worker 0-0, policy_version 756632 (0.00086) [2022-07-10 14:10:26,914][25689] Fps is (10 sec: 5579.5, 60 sec: 5516.3, 300 sec: 5528.2). Total num frames: 774794240. Throughput: 0: 5794.2. Samples: 774795246. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:10:26,914][25689] Avg episode reward: [(0, '-2.556')] [2022-07-10 14:10:28,325][26022] Updated weights on worker 0-0, policy_version 756642 (0.00090) [2022-07-10 14:10:29,855][26022] Updated weights on worker 0-0, policy_version 756652 (0.00096) [2022-07-10 14:10:31,819][26022] Updated weights on worker 0-0, policy_version 756662 (0.00091) [2022-07-10 14:10:31,971][25689] Fps is (10 sec: 5761.5, 60 sec: 5546.1, 300 sec: 5537.8). Total num frames: 774822912. Throughput: 0: 5802.8. Samples: 774828736. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:10:31,972][25689] Avg episode reward: [(0, '-1.678')] [2022-07-10 14:10:33,675][26022] Updated weights on worker 0-0, policy_version 756672 (0.00097) [2022-07-10 14:10:35,401][26022] Updated weights on worker 0-0, policy_version 756682 (0.00085) [2022-07-10 14:10:36,992][25689] Fps is (10 sec: 5689.7, 60 sec: 5561.2, 300 sec: 5534.2). Total num frames: 774851584. Throughput: 0: 4995.4. Samples: 774845674. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:10:36,995][25689] Avg episode reward: [(0, '-1.812')] [2022-07-10 14:10:37,273][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:10:37,287][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000756692_774852608.pth [2022-07-10 14:10:37,289][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000754744_772857856.pth [2022-07-10 14:10:37,294][26022] Updated weights on worker 0-0, policy_version 756692 (0.00087) [2022-07-10 14:10:39,099][26022] Updated weights on worker 0-0, policy_version 756702 (0.00088) [2022-07-10 14:10:40,889][26022] Updated weights on worker 0-0, policy_version 756712 (0.00090) [2022-07-10 14:10:42,016][25689] Fps is (10 sec: 5606.7, 60 sec: 5544.2, 300 sec: 5534.5). Total num frames: 774879232. Throughput: 0: 5850.6. Samples: 774879530. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:10:42,018][25689] Avg episode reward: [(0, '-1.511')] [2022-07-10 14:10:42,825][26022] Updated weights on worker 0-0, policy_version 756722 (0.00085) [2022-07-10 14:10:44,460][26022] Updated weights on worker 0-0, policy_version 756732 (0.00095) [2022-07-10 14:10:46,507][26022] Updated weights on worker 0-0, policy_version 756742 (0.00093) [2022-07-10 14:10:47,064][25689] Fps is (10 sec: 5490.3, 60 sec: 5545.5, 300 sec: 5537.0). Total num frames: 774906880. Throughput: 0: 5852.9. Samples: 774913182. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:10:47,066][25689] Avg episode reward: [(0, '-0.035')] [2022-07-10 14:10:48,286][26022] Updated weights on worker 0-0, policy_version 756752 (0.00106) [2022-07-10 14:10:49,974][26022] Updated weights on worker 0-0, policy_version 756762 (0.00101) [2022-07-10 14:10:51,791][26022] Updated weights on worker 0-0, policy_version 756772 (0.00094) [2022-07-10 14:10:52,105][25689] Fps is (10 sec: 5683.9, 60 sec: 5559.5, 300 sec: 5543.9). Total num frames: 774936576. Throughput: 0: 5023.6. Samples: 774929882. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:10:52,105][25689] Avg episode reward: [(0, '-1.672')] [2022-07-10 14:10:53,815][26022] Updated weights on worker 0-0, policy_version 756782 (0.00088) [2022-07-10 14:10:55,404][26022] Updated weights on worker 0-0, policy_version 756792 (0.00088) [2022-07-10 14:10:57,125][25689] Fps is (10 sec: 5597.9, 60 sec: 5546.0, 300 sec: 5536.8). Total num frames: 774963200. Throughput: 0: 5843.7. Samples: 774963322. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:10:57,125][25689] Avg episode reward: [(0, '-2.146')] [2022-07-10 14:10:57,643][26022] Updated weights on worker 0-0, policy_version 756802 (0.00094) [2022-07-10 14:10:59,126][26022] Updated weights on worker 0-0, policy_version 756812 (0.00087) [2022-07-10 14:11:01,174][26022] Updated weights on worker 0-0, policy_version 756822 (0.00089) [2022-07-10 14:11:02,158][25689] Fps is (10 sec: 5093.0, 60 sec: 5493.6, 300 sec: 5533.6). Total num frames: 774987776. Throughput: 0: 5803.0. Samples: 774996414. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:11:02,158][25689] Avg episode reward: [(0, '-2.140')] [2022-07-10 14:11:03,039][26022] Updated weights on worker 0-0, policy_version 756832 (0.00084) [2022-07-10 14:11:05,142][26022] Updated weights on worker 0-0, policy_version 756842 (0.00085) [2022-07-10 14:11:06,757][26022] Updated weights on worker 0-0, policy_version 756852 (0.00086) [2022-07-10 14:11:07,247][25689] Fps is (10 sec: 5463.0, 60 sec: 5583.7, 300 sec: 5546.1). Total num frames: 775018496. Throughput: 0: 4881.5. Samples: 775011704. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:11:07,247][25689] Avg episode reward: [(0, '-1.881')] [2022-07-10 14:11:08,781][26022] Updated weights on worker 0-0, policy_version 756862 (0.00094) [2022-07-10 14:11:10,631][26022] Updated weights on worker 0-0, policy_version 756872 (0.00092) [2022-07-10 14:11:12,255][25689] Fps is (10 sec: 5780.8, 60 sec: 5538.0, 300 sec: 5542.9). Total num frames: 775046144. Throughput: 0: 5748.1. Samples: 775045706. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:11:12,255][25689] Avg episode reward: [(0, '-1.913')] [2022-07-10 14:11:12,386][26022] Updated weights on worker 0-0, policy_version 756882 (0.00110) [2022-07-10 14:11:14,159][26022] Updated weights on worker 0-0, policy_version 756892 (0.00095) [2022-07-10 14:11:15,970][26022] Updated weights on worker 0-0, policy_version 756902 (0.00094) [2022-07-10 14:11:17,261][25689] Fps is (10 sec: 5521.9, 60 sec: 5557.0, 300 sec: 5539.5). Total num frames: 775073792. Throughput: 0: 5761.5. Samples: 775079334. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:11:17,261][25689] Avg episode reward: [(0, '-1.552')] [2022-07-10 14:11:17,858][26022] Updated weights on worker 0-0, policy_version 756912 (0.00091) [2022-07-10 14:11:19,742][26022] Updated weights on worker 0-0, policy_version 756922 (0.00086) [2022-07-10 14:11:21,471][26022] Updated weights on worker 0-0, policy_version 756932 (0.00086) [2022-07-10 14:11:22,266][25689] Fps is (10 sec: 5625.4, 60 sec: 5592.3, 300 sec: 5544.1). Total num frames: 775102464. Throughput: 0: 4942.1. Samples: 775095792. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-10 14:11:22,267][25689] Avg episode reward: [(0, '-0.837')] [2022-07-10 14:11:23,516][26022] Updated weights on worker 0-0, policy_version 756942 (0.00088) [2022-07-10 14:11:25,138][26022] Updated weights on worker 0-0, policy_version 756952 (0.00091) [2022-07-10 14:11:27,116][26022] Updated weights on worker 0-0, policy_version 756962 (0.00087) [2022-07-10 14:11:27,367][25689] Fps is (10 sec: 5572.9, 60 sec: 5555.9, 300 sec: 5543.0). Total num frames: 775130112. Throughput: 0: 5837.7. Samples: 775129158. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:11:27,367][25689] Avg episode reward: [(0, '-0.631')] [2022-07-10 14:11:28,863][26022] Updated weights on worker 0-0, policy_version 756972 (0.00089) [2022-07-10 14:11:30,763][26022] Updated weights on worker 0-0, policy_version 756982 (0.00084) [2022-07-10 14:11:32,395][25689] Fps is (10 sec: 5459.6, 60 sec: 5541.6, 300 sec: 5539.3). Total num frames: 775157760. Throughput: 0: 5807.3. Samples: 775162666. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:11:32,395][25689] Avg episode reward: [(0, '-1.053')] [2022-07-10 14:11:32,638][26022] Updated weights on worker 0-0, policy_version 756992 (0.00099) [2022-07-10 14:11:34,583][26022] Updated weights on worker 0-0, policy_version 757002 (0.00091) [2022-07-10 14:11:36,178][26022] Updated weights on worker 0-0, policy_version 757012 (0.00087) [2022-07-10 14:11:37,399][25689] Fps is (10 sec: 5511.8, 60 sec: 5526.3, 300 sec: 5542.8). Total num frames: 775185408. Throughput: 0: 4962.7. Samples: 775179274. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:11:37,403][25689] Avg episode reward: [(0, '-0.970')] [2022-07-10 14:11:38,405][26022] Updated weights on worker 0-0, policy_version 757022 (0.00091) [2022-07-10 14:11:39,959][26022] Updated weights on worker 0-0, policy_version 757032 (0.00087) [2022-07-10 14:11:41,940][26022] Updated weights on worker 0-0, policy_version 757042 (0.00086) [2022-07-10 14:11:42,442][25689] Fps is (10 sec: 5503.7, 60 sec: 5524.5, 300 sec: 5540.8). Total num frames: 775213056. Throughput: 0: 5791.5. Samples: 775212638. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:11:42,442][25689] Avg episode reward: [(0, '-1.500')] [2022-07-10 14:11:43,509][26022] Updated weights on worker 0-0, policy_version 757052 (0.00091) [2022-07-10 14:11:45,415][26022] Updated weights on worker 0-0, policy_version 757062 (0.00084) [2022-07-10 14:11:47,202][26022] Updated weights on worker 0-0, policy_version 757072 (0.00095) [2022-07-10 14:11:47,570][25689] Fps is (10 sec: 5638.2, 60 sec: 5551.1, 300 sec: 5542.3). Total num frames: 775242752. Throughput: 0: 5807.1. Samples: 775246478. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:11:47,570][25689] Avg episode reward: [(0, '-1.508')] [2022-07-10 14:11:49,168][26022] Updated weights on worker 0-0, policy_version 757082 (0.00092) [2022-07-10 14:11:50,888][26022] Updated weights on worker 0-0, policy_version 757092 (0.00087) [2022-07-10 14:11:52,596][25689] Fps is (10 sec: 5647.3, 60 sec: 5518.5, 300 sec: 5539.0). Total num frames: 775270400. Throughput: 0: 5812.9. Samples: 775280094. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:11:52,597][25689] Avg episode reward: [(0, '-1.143')] [2022-07-10 14:11:52,984][26022] Updated weights on worker 0-0, policy_version 757102 (0.00090) [2022-07-10 14:11:54,611][26022] Updated weights on worker 0-0, policy_version 757112 (0.00083) [2022-07-10 14:11:56,594][26022] Updated weights on worker 0-0, policy_version 757122 (0.00087) [2022-07-10 14:11:57,638][25689] Fps is (10 sec: 5695.6, 60 sec: 5567.3, 300 sec: 5549.1). Total num frames: 775300096. Throughput: 0: 5817.5. Samples: 775297014. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:11:57,638][25689] Avg episode reward: [(0, '-0.976')] [2022-07-10 14:11:58,203][26022] Updated weights on worker 0-0, policy_version 757132 (0.00091) [2022-07-10 14:12:00,091][26022] Updated weights on worker 0-0, policy_version 757142 (0.00563) [2022-07-10 14:12:02,301][26022] Updated weights on worker 0-0, policy_version 757152 (0.00080) [2022-07-10 14:12:02,663][25689] Fps is (10 sec: 5391.2, 60 sec: 5568.1, 300 sec: 5542.7). Total num frames: 775324672. Throughput: 0: 5764.2. Samples: 775329196. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:12:02,663][25689] Avg episode reward: [(0, '-1.204')] [2022-07-10 14:12:04,255][26022] Updated weights on worker 0-0, policy_version 757162 (0.00084) [2022-07-10 14:12:06,087][26022] Updated weights on worker 0-0, policy_version 757172 (0.00097) [2022-07-10 14:12:07,736][25689] Fps is (10 sec: 5374.2, 60 sec: 5552.6, 300 sec: 5539.1). Total num frames: 775354368. Throughput: 0: 5731.6. Samples: 775362064. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:12:07,737][25689] Avg episode reward: [(0, '-2.969')] [2022-07-10 14:12:07,739][26022] Updated weights on worker 0-0, policy_version 757182 (0.00084) [2022-07-10 14:12:09,699][26022] Updated weights on worker 0-0, policy_version 757192 (0.00084) [2022-07-10 14:12:11,387][26022] Updated weights on worker 0-0, policy_version 757202 (0.00091) [2022-07-10 14:12:12,774][25689] Fps is (10 sec: 5570.2, 60 sec: 5532.9, 300 sec: 5542.4). Total num frames: 775380992. Throughput: 0: 4883.4. Samples: 775378630. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:12:12,774][25689] Avg episode reward: [(0, '-2.680')] [2022-07-10 14:12:13,397][26022] Updated weights on worker 0-0, policy_version 757212 (0.00086) [2022-07-10 14:12:15,218][26022] Updated weights on worker 0-0, policy_version 757222 (0.00094) [2022-07-10 14:12:16,819][26022] Updated weights on worker 0-0, policy_version 757232 (0.00088) [2022-07-10 14:12:17,791][25689] Fps is (10 sec: 5397.9, 60 sec: 5531.9, 300 sec: 5536.1). Total num frames: 775408640. Throughput: 0: 5711.2. Samples: 775412112. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:12:17,801][25689] Avg episode reward: [(0, '-3.694')] [2022-07-10 14:12:18,934][26022] Updated weights on worker 0-0, policy_version 757242 (0.00080) [2022-07-10 14:12:20,866][26022] Updated weights on worker 0-0, policy_version 757252 (0.00091) [2022-07-10 14:12:22,646][26022] Updated weights on worker 0-0, policy_version 757262 (0.00100) [2022-07-10 14:12:22,804][25689] Fps is (10 sec: 5615.3, 60 sec: 5531.3, 300 sec: 5540.2). Total num frames: 775437312. Throughput: 0: 5770.8. Samples: 775445424. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:12:22,804][25689] Avg episode reward: [(0, '-3.838')] [2022-07-10 14:12:24,591][26022] Updated weights on worker 0-0, policy_version 757272 (0.00082) [2022-07-10 14:12:26,204][26022] Updated weights on worker 0-0, policy_version 757282 (0.00089) [2022-07-10 14:12:27,847][25689] Fps is (10 sec: 5600.5, 60 sec: 5536.5, 300 sec: 5543.3). Total num frames: 775464960. Throughput: 0: 4980.0. Samples: 775462214. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:12:27,847][25689] Avg episode reward: [(0, '-3.699')] [2022-07-10 14:12:28,159][26022] Updated weights on worker 0-0, policy_version 757292 (0.00085) [2022-07-10 14:12:29,927][26022] Updated weights on worker 0-0, policy_version 757302 (0.00088) [2022-07-10 14:12:31,719][26022] Updated weights on worker 0-0, policy_version 757312 (0.00084) [2022-07-10 14:12:32,870][25689] Fps is (10 sec: 5493.1, 60 sec: 5536.9, 300 sec: 5536.6). Total num frames: 775492608. Throughput: 0: 5830.3. Samples: 775495798. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:12:32,871][25689] Avg episode reward: [(0, '-3.603')] [2022-07-10 14:12:33,590][26022] Updated weights on worker 0-0, policy_version 757322 (0.00085) [2022-07-10 14:12:35,470][26022] Updated weights on worker 0-0, policy_version 757332 (0.00093) [2022-07-10 14:12:37,306][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:12:37,320][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000757342_775518208.pth [2022-07-10 14:12:37,320][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000755393_773522432.pth [2022-07-10 14:12:37,323][26022] Updated weights on worker 0-0, policy_version 757342 (0.00087) [2022-07-10 14:12:37,872][25689] Fps is (10 sec: 5617.7, 60 sec: 5554.1, 300 sec: 5543.6). Total num frames: 775521280. Throughput: 0: 5847.5. Samples: 775529540. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:12:37,873][25689] Avg episode reward: [(0, '-3.196')] [2022-07-10 14:12:38,969][26022] Updated weights on worker 0-0, policy_version 757352 (0.00084) [2022-07-10 14:12:40,894][26022] Updated weights on worker 0-0, policy_version 757362 (0.00089) [2022-07-10 14:12:42,878][25689] Fps is (10 sec: 5524.9, 60 sec: 5540.5, 300 sec: 5539.2). Total num frames: 775547904. Throughput: 0: 5029.4. Samples: 775546388. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:12:42,879][25689] Avg episode reward: [(0, '-3.154')] [2022-07-10 14:12:42,919][26022] Updated weights on worker 0-0, policy_version 757372 (0.00081) [2022-07-10 14:12:44,448][26022] Updated weights on worker 0-0, policy_version 757382 (0.00089) [2022-07-10 14:12:46,401][26022] Updated weights on worker 0-0, policy_version 757392 (0.00091) [2022-07-10 14:12:47,919][25689] Fps is (10 sec: 5707.4, 60 sec: 5565.4, 300 sec: 5543.0). Total num frames: 775578624. Throughput: 0: 5870.8. Samples: 775580056. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:12:47,920][25689] Avg episode reward: [(0, '-2.567')] [2022-07-10 14:12:48,248][26022] Updated weights on worker 0-0, policy_version 757402 (0.00093) [2022-07-10 14:12:49,993][26022] Updated weights on worker 0-0, policy_version 757412 (0.00083) [2022-07-10 14:12:51,985][26022] Updated weights on worker 0-0, policy_version 757422 (0.00083) [2022-07-10 14:12:52,942][25689] Fps is (10 sec: 5698.4, 60 sec: 5548.8, 300 sec: 5543.7). Total num frames: 775605248. Throughput: 0: 5861.8. Samples: 775613454. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:12:52,942][25689] Avg episode reward: [(0, '-2.271')] [2022-07-10 14:12:53,827][26022] Updated weights on worker 0-0, policy_version 757432 (0.00091) [2022-07-10 14:12:55,627][26022] Updated weights on worker 0-0, policy_version 757442 (0.00093) [2022-07-10 14:12:57,467][26022] Updated weights on worker 0-0, policy_version 757452 (0.00093) [2022-07-10 14:12:58,024][25689] Fps is (10 sec: 5573.8, 60 sec: 5545.1, 300 sec: 5550.5). Total num frames: 775634944. Throughput: 0: 4993.4. Samples: 775630166. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:12:58,024][25689] Avg episode reward: [(0, '-1.688')] [2022-07-10 14:12:59,218][26022] Updated weights on worker 0-0, policy_version 757462 (0.00092) [2022-07-10 14:13:01,087][26022] Updated weights on worker 0-0, policy_version 757472 (0.00086) [2022-07-10 14:13:03,069][25689] Fps is (10 sec: 5460.3, 60 sec: 5560.3, 300 sec: 5543.8). Total num frames: 775660544. Throughput: 0: 5802.1. Samples: 775663534. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:13:03,069][25689] Avg episode reward: [(0, '0.252')] [2022-07-10 14:13:03,556][26022] Updated weights on worker 0-0, policy_version 757482 (0.00083) [2022-07-10 14:13:05,209][26022] Updated weights on worker 0-0, policy_version 757492 (0.00092) [2022-07-10 14:13:07,304][26022] Updated weights on worker 0-0, policy_version 757502 (0.00087) [2022-07-10 14:13:08,127][25689] Fps is (10 sec: 5169.2, 60 sec: 5510.8, 300 sec: 5543.0). Total num frames: 775687168. Throughput: 0: 5661.4. Samples: 775694458. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:13:08,127][25689] Avg episode reward: [(0, '-0.183')] [2022-07-10 14:13:08,850][26022] Updated weights on worker 0-0, policy_version 757512 (0.00084) [2022-07-10 14:13:10,963][26022] Updated weights on worker 0-0, policy_version 757522 (0.00084) [2022-07-10 14:13:12,604][26022] Updated weights on worker 0-0, policy_version 757532 (0.00089) [2022-07-10 14:13:13,143][25689] Fps is (10 sec: 5386.8, 60 sec: 5529.7, 300 sec: 5539.4). Total num frames: 775714816. Throughput: 0: 4828.5. Samples: 775710998. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:13:13,144][25689] Avg episode reward: [(0, '-0.400')] [2022-07-10 14:13:14,479][26022] Updated weights on worker 0-0, policy_version 757542 (0.00086) [2022-07-10 14:13:16,416][26022] Updated weights on worker 0-0, policy_version 757552 (0.00087) [2022-07-10 14:13:18,183][25689] Fps is (10 sec: 5498.6, 60 sec: 5527.6, 300 sec: 5535.5). Total num frames: 775742464. Throughput: 0: 5677.0. Samples: 775744610. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:13:18,184][25689] Avg episode reward: [(0, '-0.613')] [2022-07-10 14:13:18,275][26022] Updated weights on worker 0-0, policy_version 757562 (0.00083) [2022-07-10 14:13:20,234][26022] Updated weights on worker 0-0, policy_version 757572 (0.00082) [2022-07-10 14:13:21,667][26022] Updated weights on worker 0-0, policy_version 757582 (0.00088) [2022-07-10 14:13:23,185][25689] Fps is (10 sec: 5506.7, 60 sec: 5511.6, 300 sec: 5533.0). Total num frames: 775770112. Throughput: 0: 5688.9. Samples: 775777976. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:13:23,187][25689] Avg episode reward: [(0, '-1.724')] [2022-07-10 14:13:23,899][26022] Updated weights on worker 0-0, policy_version 757592 (0.00090) [2022-07-10 14:13:25,480][26022] Updated weights on worker 0-0, policy_version 757602 (0.00092) [2022-07-10 14:13:27,451][26022] Updated weights on worker 0-0, policy_version 757612 (0.00089) [2022-07-10 14:13:28,282][25689] Fps is (10 sec: 5678.4, 60 sec: 5540.6, 300 sec: 5539.3). Total num frames: 775799808. Throughput: 0: 4980.8. Samples: 775794846. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:13:28,282][25689] Avg episode reward: [(0, '-1.618')] [2022-07-10 14:13:29,238][26022] Updated weights on worker 0-0, policy_version 757622 (0.00100) [2022-07-10 14:13:30,995][26022] Updated weights on worker 0-0, policy_version 757632 (0.00089) [2022-07-10 14:13:33,043][26022] Updated weights on worker 0-0, policy_version 757642 (0.00081) [2022-07-10 14:13:33,310][25689] Fps is (10 sec: 5562.7, 60 sec: 5523.3, 300 sec: 5532.8). Total num frames: 775826432. Throughput: 0: 5816.3. Samples: 775828290. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:13:33,310][25689] Avg episode reward: [(0, '-3.599')] [2022-07-10 14:13:34,705][26022] Updated weights on worker 0-0, policy_version 757652 (0.00086) [2022-07-10 14:13:36,536][26022] Updated weights on worker 0-0, policy_version 757662 (0.00087) [2022-07-10 14:13:38,405][25689] Fps is (10 sec: 5462.0, 60 sec: 5514.7, 300 sec: 5539.6). Total num frames: 775855104. Throughput: 0: 5818.5. Samples: 775862274. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:13:38,406][25689] Avg episode reward: [(0, '-3.419')] [2022-07-10 14:13:38,441][26022] Updated weights on worker 0-0, policy_version 757672 (0.01087) [2022-07-10 14:13:39,982][26022] Updated weights on worker 0-0, policy_version 757682 (0.00094) [2022-07-10 14:13:42,100][26022] Updated weights on worker 0-0, policy_version 757692 (0.00084) [2022-07-10 14:13:43,419][25689] Fps is (10 sec: 5773.9, 60 sec: 5564.8, 300 sec: 5538.0). Total num frames: 775884800. Throughput: 0: 5006.0. Samples: 775879264. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:13:43,419][25689] Avg episode reward: [(0, '-3.325')] [2022-07-10 14:13:43,706][26022] Updated weights on worker 0-0, policy_version 757702 (0.00089) [2022-07-10 14:13:45,718][26022] Updated weights on worker 0-0, policy_version 757712 (0.00087) [2022-07-10 14:13:47,303][26022] Updated weights on worker 0-0, policy_version 757722 (0.00096) [2022-07-10 14:13:48,522][25689] Fps is (10 sec: 5567.2, 60 sec: 5491.5, 300 sec: 5536.5). Total num frames: 775911424. Throughput: 0: 5825.5. Samples: 775912754. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:13:48,522][25689] Avg episode reward: [(0, '-1.672')] [2022-07-10 14:13:49,231][26022] Updated weights on worker 0-0, policy_version 757732 (0.00092) [2022-07-10 14:13:51,052][26022] Updated weights on worker 0-0, policy_version 757742 (0.00086) [2022-07-10 14:13:53,114][26022] Updated weights on worker 0-0, policy_version 757752 (0.00094) [2022-07-10 14:13:53,557][25689] Fps is (10 sec: 5454.1, 60 sec: 5524.1, 300 sec: 5543.6). Total num frames: 775940096. Throughput: 0: 5821.7. Samples: 775946164. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:13:53,558][25689] Avg episode reward: [(0, '-1.760')] [2022-07-10 14:13:54,714][26022] Updated weights on worker 0-0, policy_version 757762 (0.00084) [2022-07-10 14:13:56,669][26022] Updated weights on worker 0-0, policy_version 757772 (0.00110) [2022-07-10 14:13:58,363][26022] Updated weights on worker 0-0, policy_version 757782 (0.00089) [2022-07-10 14:13:58,644][25689] Fps is (10 sec: 5665.3, 60 sec: 5506.8, 300 sec: 5539.7). Total num frames: 775968768. Throughput: 0: 4971.9. Samples: 775962898. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:13:58,644][25689] Avg episode reward: [(0, '-2.092')] [2022-07-10 14:14:00,419][26022] Updated weights on worker 0-0, policy_version 757792 (0.00085) [2022-07-10 14:14:02,793][26022] Updated weights on worker 0-0, policy_version 757802 (0.00089) [2022-07-10 14:14:03,659][25689] Fps is (10 sec: 5474.0, 60 sec: 5526.5, 300 sec: 5541.7). Total num frames: 775995392. Throughput: 0: 5749.5. Samples: 775995634. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:14:03,659][25689] Avg episode reward: [(0, '-0.229')] [2022-07-10 14:14:04,454][26022] Updated weights on worker 0-0, policy_version 757812 (0.00091) [2022-07-10 14:14:06,353][26022] Updated weights on worker 0-0, policy_version 757822 (0.00092) [2022-07-10 14:14:08,132][26022] Updated weights on worker 0-0, policy_version 757832 (0.00090) [2022-07-10 14:14:08,722][25689] Fps is (10 sec: 5385.3, 60 sec: 5542.9, 300 sec: 5540.9). Total num frames: 776023040. Throughput: 0: 5684.9. Samples: 776027588. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:14:08,722][25689] Avg episode reward: [(0, '-0.060')] [2022-07-10 14:14:10,028][26022] Updated weights on worker 0-0, policy_version 757842 (0.00089) [2022-07-10 14:14:11,695][26022] Updated weights on worker 0-0, policy_version 757852 (0.00083) [2022-07-10 14:14:13,779][25689] Fps is (10 sec: 5463.8, 60 sec: 5539.2, 300 sec: 5541.2). Total num frames: 776050688. Throughput: 0: 4848.4. Samples: 776044208. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:14:13,780][25689] Avg episode reward: [(0, '-0.271')] [2022-07-10 14:14:13,786][26022] Updated weights on worker 0-0, policy_version 757862 (0.00087) [2022-07-10 14:14:15,346][26022] Updated weights on worker 0-0, policy_version 757872 (0.00087) [2022-07-10 14:14:17,382][26022] Updated weights on worker 0-0, policy_version 757882 (0.00088) [2022-07-10 14:14:18,838][25689] Fps is (10 sec: 5567.2, 60 sec: 5554.3, 300 sec: 5541.7). Total num frames: 776079360. Throughput: 0: 5693.8. Samples: 776077880. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:14:18,839][25689] Avg episode reward: [(0, '-0.826')] [2022-07-10 14:14:19,139][26022] Updated weights on worker 0-0, policy_version 757892 (0.00089) [2022-07-10 14:14:20,862][26022] Updated weights on worker 0-0, policy_version 757902 (0.00092) [2022-07-10 14:14:22,972][26022] Updated weights on worker 0-0, policy_version 757912 (0.00088) [2022-07-10 14:14:23,885][25689] Fps is (10 sec: 5471.6, 60 sec: 5533.3, 300 sec: 5531.6). Total num frames: 776105984. Throughput: 0: 5716.0. Samples: 776111248. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:14:23,886][25689] Avg episode reward: [(0, '-0.945')] [2022-07-10 14:14:24,658][26022] Updated weights on worker 0-0, policy_version 757922 (0.00388) [2022-07-10 14:14:26,547][26022] Updated weights on worker 0-0, policy_version 757932 (0.00092) [2022-07-10 14:14:28,420][26022] Updated weights on worker 0-0, policy_version 757942 (0.00086) [2022-07-10 14:14:28,951][25689] Fps is (10 sec: 5468.1, 60 sec: 5519.3, 300 sec: 5537.5). Total num frames: 776134656. Throughput: 0: 5788.6. Samples: 776144684. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:14:28,951][25689] Avg episode reward: [(0, '-0.297')] [2022-07-10 14:14:30,117][26022] Updated weights on worker 0-0, policy_version 757952 (0.00088) [2022-07-10 14:14:32,111][26022] Updated weights on worker 0-0, policy_version 757962 (0.00089) [2022-07-10 14:14:33,952][26022] Updated weights on worker 0-0, policy_version 757972 (0.00090) [2022-07-10 14:14:33,962][25689] Fps is (10 sec: 5690.9, 60 sec: 5554.6, 300 sec: 5540.7). Total num frames: 776163328. Throughput: 0: 5806.4. Samples: 776161396. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:14:33,962][25689] Avg episode reward: [(0, '-0.537')] [2022-07-10 14:14:35,721][26022] Updated weights on worker 0-0, policy_version 757982 (0.00087) [2022-07-10 14:14:37,526][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:14:37,541][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000757991_776182784.pth [2022-07-10 14:14:37,541][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000756042_774187008.pth [2022-07-10 14:14:37,771][26022] Updated weights on worker 0-0, policy_version 757992 (0.00092) [2022-07-10 14:14:38,994][25689] Fps is (10 sec: 5607.4, 60 sec: 5543.5, 300 sec: 5537.1). Total num frames: 776190976. Throughput: 0: 5795.1. Samples: 776194688. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:14:38,995][25689] Avg episode reward: [(0, '-0.876')] [2022-07-10 14:14:39,468][26022] Updated weights on worker 0-0, policy_version 758002 (0.00093) [2022-07-10 14:14:41,306][26022] Updated weights on worker 0-0, policy_version 758012 (0.00085) [2022-07-10 14:14:43,262][26022] Updated weights on worker 0-0, policy_version 758022 (0.00082) [2022-07-10 14:14:44,028][25689] Fps is (10 sec: 5493.3, 60 sec: 5507.8, 300 sec: 5537.6). Total num frames: 776218624. Throughput: 0: 5801.1. Samples: 776228096. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:14:44,028][25689] Avg episode reward: [(0, '-1.937')] [2022-07-10 14:14:44,991][26022] Updated weights on worker 0-0, policy_version 758032 (0.00473) [2022-07-10 14:14:47,042][26022] Updated weights on worker 0-0, policy_version 758042 (0.00081) [2022-07-10 14:14:48,575][26022] Updated weights on worker 0-0, policy_version 758052 (0.00088) [2022-07-10 14:14:49,161][25689] Fps is (10 sec: 5640.3, 60 sec: 5555.7, 300 sec: 5538.8). Total num frames: 776248320. Throughput: 0: 4954.4. Samples: 776244816. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 14:14:49,162][25689] Avg episode reward: [(0, '-1.440')] [2022-07-10 14:14:50,480][26022] Updated weights on worker 0-0, policy_version 758062 (0.00085) [2022-07-10 14:14:52,311][26022] Updated weights on worker 0-0, policy_version 758072 (0.00096) [2022-07-10 14:14:53,980][26022] Updated weights on worker 0-0, policy_version 758082 (0.00081) [2022-07-10 14:14:54,215][25689] Fps is (10 sec: 5628.8, 60 sec: 5537.1, 300 sec: 5538.8). Total num frames: 776275968. Throughput: 0: 5780.6. Samples: 776278474. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:14:54,216][25689] Avg episode reward: [(0, '-0.791')] [2022-07-10 14:14:56,102][26022] Updated weights on worker 0-0, policy_version 758092 (0.00080) [2022-07-10 14:14:57,840][26022] Updated weights on worker 0-0, policy_version 758102 (0.00100) [2022-07-10 14:14:59,229][25689] Fps is (10 sec: 5492.2, 60 sec: 5526.9, 300 sec: 5538.8). Total num frames: 776303616. Throughput: 0: 5806.4. Samples: 776312180. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:14:59,230][25689] Avg episode reward: [(0, '-1.523')] [2022-07-10 14:14:59,601][26022] Updated weights on worker 0-0, policy_version 758112 (0.00088) [2022-07-10 14:15:01,606][26022] Updated weights on worker 0-0, policy_version 758122 (0.00084) [2022-07-10 14:15:03,662][26022] Updated weights on worker 0-0, policy_version 758132 (0.00085) [2022-07-10 14:15:04,241][25689] Fps is (10 sec: 5413.6, 60 sec: 5527.2, 300 sec: 5544.9). Total num frames: 776330240. Throughput: 0: 4918.2. Samples: 776327506. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:15:04,241][25689] Avg episode reward: [(0, '-2.020')] [2022-07-10 14:15:05,631][26022] Updated weights on worker 0-0, policy_version 758142 (0.00091) [2022-07-10 14:15:07,310][26022] Updated weights on worker 0-0, policy_version 758152 (0.00094) [2022-07-10 14:15:09,137][26022] Updated weights on worker 0-0, policy_version 758162 (0.00080) [2022-07-10 14:15:09,329][25689] Fps is (10 sec: 5474.9, 60 sec: 5541.8, 300 sec: 5537.5). Total num frames: 776358912. Throughput: 0: 5731.0. Samples: 776360400. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:15:09,330][25689] Avg episode reward: [(0, '-1.796')] [2022-07-10 14:15:10,951][26022] Updated weights on worker 0-0, policy_version 758172 (0.00109) [2022-07-10 14:15:12,772][26022] Updated weights on worker 0-0, policy_version 758182 (0.00090) [2022-07-10 14:15:14,342][25689] Fps is (10 sec: 5575.6, 60 sec: 5545.9, 300 sec: 5541.2). Total num frames: 776386560. Throughput: 0: 5736.0. Samples: 776393920. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:15:14,343][25689] Avg episode reward: [(0, '-0.777')] [2022-07-10 14:15:14,726][26022] Updated weights on worker 0-0, policy_version 758192 (0.00088) [2022-07-10 14:15:16,618][26022] Updated weights on worker 0-0, policy_version 758202 (0.00087) [2022-07-10 14:15:18,354][26022] Updated weights on worker 0-0, policy_version 758212 (0.00084) [2022-07-10 14:15:19,423][25689] Fps is (10 sec: 5376.8, 60 sec: 5510.0, 300 sec: 5540.1). Total num frames: 776413184. Throughput: 0: 4877.2. Samples: 776410668. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:15:19,423][25689] Avg episode reward: [(0, '-1.267')] [2022-07-10 14:15:20,190][26022] Updated weights on worker 0-0, policy_version 758222 (0.00094) [2022-07-10 14:15:21,991][26022] Updated weights on worker 0-0, policy_version 758232 (0.00092) [2022-07-10 14:15:24,020][26022] Updated weights on worker 0-0, policy_version 758242 (0.00080) [2022-07-10 14:15:24,469][25689] Fps is (10 sec: 5561.4, 60 sec: 5560.9, 300 sec: 5540.6). Total num frames: 776442880. Throughput: 0: 5765.1. Samples: 776444124. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:15:24,469][25689] Avg episode reward: [(0, '-1.739')] [2022-07-10 14:15:25,883][26022] Updated weights on worker 0-0, policy_version 758252 (0.00089) [2022-07-10 14:15:27,518][26022] Updated weights on worker 0-0, policy_version 758262 (0.00087) [2022-07-10 14:15:29,539][25689] Fps is (10 sec: 5567.4, 60 sec: 5526.6, 300 sec: 5533.5). Total num frames: 776469504. Throughput: 0: 5775.1. Samples: 776477112. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:15:29,539][25689] Avg episode reward: [(0, '-1.001')] [2022-07-10 14:15:29,564][26022] Updated weights on worker 0-0, policy_version 758272 (0.00088) [2022-07-10 14:15:31,355][26022] Updated weights on worker 0-0, policy_version 758282 (0.00101) [2022-07-10 14:15:33,190][26022] Updated weights on worker 0-0, policy_version 758292 (0.00055) [2022-07-10 14:15:34,546][25689] Fps is (10 sec: 5588.6, 60 sec: 5543.8, 300 sec: 5537.2). Total num frames: 776499200. Throughput: 0: 4949.5. Samples: 776493922. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:15:34,547][25689] Avg episode reward: [(0, '-0.481')] [2022-07-10 14:15:35,043][26022] Updated weights on worker 0-0, policy_version 758302 (0.00087) [2022-07-10 14:15:36,819][26022] Updated weights on worker 0-0, policy_version 758312 (0.00086) [2022-07-10 14:15:38,857][26022] Updated weights on worker 0-0, policy_version 758322 (0.00087) [2022-07-10 14:15:39,624][25689] Fps is (10 sec: 5685.7, 60 sec: 5539.7, 300 sec: 5536.2). Total num frames: 776526848. Throughput: 0: 5779.5. Samples: 776527424. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:15:39,625][25689] Avg episode reward: [(0, '-0.380')] [2022-07-10 14:15:40,494][26022] Updated weights on worker 0-0, policy_version 758332 (0.00084) [2022-07-10 14:15:42,298][26022] Updated weights on worker 0-0, policy_version 758342 (0.00087) [2022-07-10 14:15:44,225][26022] Updated weights on worker 0-0, policy_version 758352 (0.00090) [2022-07-10 14:15:44,721][25689] Fps is (10 sec: 5434.4, 60 sec: 5533.9, 300 sec: 5535.3). Total num frames: 776554496. Throughput: 0: 5766.4. Samples: 776560910. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:15:44,722][25689] Avg episode reward: [(0, '0.233')] [2022-07-10 14:15:46,050][26022] Updated weights on worker 0-0, policy_version 758362 (0.00096) [2022-07-10 14:15:47,869][26022] Updated weights on worker 0-0, policy_version 758372 (0.00087) [2022-07-10 14:15:49,768][25689] Fps is (10 sec: 5451.4, 60 sec: 5508.0, 300 sec: 5528.3). Total num frames: 776582144. Throughput: 0: 4960.5. Samples: 776577458. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:15:49,768][25689] Avg episode reward: [(0, '-2.168')] [2022-07-10 14:15:49,834][26022] Updated weights on worker 0-0, policy_version 758382 (0.00092) [2022-07-10 14:15:51,482][26022] Updated weights on worker 0-0, policy_version 758392 (0.00086) [2022-07-10 14:15:53,551][26022] Updated weights on worker 0-0, policy_version 758402 (0.00089) [2022-07-10 14:15:54,778][25689] Fps is (10 sec: 5498.3, 60 sec: 5512.0, 300 sec: 5531.9). Total num frames: 776609792. Throughput: 0: 5785.2. Samples: 776610970. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:15:54,779][25689] Avg episode reward: [(0, '-2.317')] [2022-07-10 14:15:55,118][26022] Updated weights on worker 0-0, policy_version 758412 (0.00087) [2022-07-10 14:15:56,923][26022] Updated weights on worker 0-0, policy_version 758422 (0.00087) [2022-07-10 14:15:59,071][26022] Updated weights on worker 0-0, policy_version 758432 (0.00092) [2022-07-10 14:15:59,780][25689] Fps is (10 sec: 5625.0, 60 sec: 5530.0, 300 sec: 5546.2). Total num frames: 776638464. Throughput: 0: 5796.9. Samples: 776644266. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:15:59,781][25689] Avg episode reward: [(0, '-2.770')] [2022-07-10 14:16:00,784][26022] Updated weights on worker 0-0, policy_version 758442 (0.00090) [2022-07-10 14:16:03,107][26022] Updated weights on worker 0-0, policy_version 758452 (0.00087) [2022-07-10 14:16:04,601][26022] Updated weights on worker 0-0, policy_version 758462 (0.00085) [2022-07-10 14:16:04,804][25689] Fps is (10 sec: 5515.7, 60 sec: 5528.9, 300 sec: 5533.7). Total num frames: 776665088. Throughput: 0: 4875.1. Samples: 776658812. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:16:04,804][25689] Avg episode reward: [(0, '-3.181')] [2022-07-10 14:16:06,719][26022] Updated weights on worker 0-0, policy_version 758472 (0.00097) [2022-07-10 14:16:08,576][26022] Updated weights on worker 0-0, policy_version 758482 (0.00093) [2022-07-10 14:16:09,862][25689] Fps is (10 sec: 5383.1, 60 sec: 5514.7, 300 sec: 5532.8). Total num frames: 776692736. Throughput: 0: 5713.4. Samples: 776692264. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:16:09,863][25689] Avg episode reward: [(0, '-3.915')] [2022-07-10 14:16:10,339][26022] Updated weights on worker 0-0, policy_version 758492 (0.00092) [2022-07-10 14:16:12,108][26022] Updated weights on worker 0-0, policy_version 758502 (0.00085) [2022-07-10 14:16:14,139][26022] Updated weights on worker 0-0, policy_version 758512 (0.00085) [2022-07-10 14:16:14,891][25689] Fps is (10 sec: 5481.8, 60 sec: 5513.3, 300 sec: 5532.3). Total num frames: 776720384. Throughput: 0: 5708.3. Samples: 776725778. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:16:14,891][25689] Avg episode reward: [(0, '-3.957')] [2022-07-10 14:16:15,844][26022] Updated weights on worker 0-0, policy_version 758522 (0.00083) [2022-07-10 14:16:17,754][26022] Updated weights on worker 0-0, policy_version 758532 (0.00089) [2022-07-10 14:16:19,524][26022] Updated weights on worker 0-0, policy_version 758542 (0.00092) [2022-07-10 14:16:19,932][25689] Fps is (10 sec: 5592.7, 60 sec: 5550.7, 300 sec: 5531.7). Total num frames: 776749056. Throughput: 0: 4888.1. Samples: 776742772. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:16:19,933][25689] Avg episode reward: [(0, '-3.065')] [2022-07-10 14:16:21,302][26022] Updated weights on worker 0-0, policy_version 758552 (0.00092) [2022-07-10 14:16:23,157][26022] Updated weights on worker 0-0, policy_version 758562 (0.00094) [2022-07-10 14:16:24,901][26022] Updated weights on worker 0-0, policy_version 758572 (0.00086) [2022-07-10 14:16:24,942][25689] Fps is (10 sec: 5705.2, 60 sec: 5537.1, 300 sec: 5536.8). Total num frames: 776777728. Throughput: 0: 5830.4. Samples: 776776226. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:16:24,943][25689] Avg episode reward: [(0, '-0.858')] [2022-07-10 14:16:26,999][26022] Updated weights on worker 0-0, policy_version 758582 (0.00091) [2022-07-10 14:16:28,656][26022] Updated weights on worker 0-0, policy_version 758592 (0.00089) [2022-07-10 14:16:30,027][25689] Fps is (10 sec: 5477.6, 60 sec: 5535.7, 300 sec: 5532.3). Total num frames: 776804352. Throughput: 0: 5799.9. Samples: 776809220. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:16:30,029][25689] Avg episode reward: [(0, '-0.845')] [2022-07-10 14:16:30,691][26022] Updated weights on worker 0-0, policy_version 758602 (0.00097) [2022-07-10 14:16:32,536][26022] Updated weights on worker 0-0, policy_version 758612 (0.00082) [2022-07-10 14:16:34,431][26022] Updated weights on worker 0-0, policy_version 758622 (0.00084) [2022-07-10 14:16:35,044][25689] Fps is (10 sec: 5372.4, 60 sec: 5501.0, 300 sec: 5532.1). Total num frames: 776832000. Throughput: 0: 4962.4. Samples: 776825786. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:16:35,045][25689] Avg episode reward: [(0, '-0.097')] [2022-07-10 14:16:36,096][26022] Updated weights on worker 0-0, policy_version 758632 (0.00086) [2022-07-10 14:16:37,739][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:16:37,754][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000758640_776847360.pth [2022-07-10 14:16:37,755][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000756692_774852608.pth [2022-07-10 14:16:38,223][26022] Updated weights on worker 0-0, policy_version 758642 (0.00092) [2022-07-10 14:16:39,764][26022] Updated weights on worker 0-0, policy_version 758652 (0.00091) [2022-07-10 14:16:40,071][25689] Fps is (10 sec: 5607.7, 60 sec: 5522.7, 300 sec: 5535.8). Total num frames: 776860672. Throughput: 0: 5782.0. Samples: 776859212. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:16:40,071][25689] Avg episode reward: [(0, '-0.195')] [2022-07-10 14:16:41,929][26022] Updated weights on worker 0-0, policy_version 758662 (0.00087) [2022-07-10 14:16:43,411][26022] Updated weights on worker 0-0, policy_version 758672 (0.00087) [2022-07-10 14:16:45,095][25689] Fps is (10 sec: 5501.7, 60 sec: 5512.4, 300 sec: 5527.4). Total num frames: 776887296. Throughput: 0: 5780.9. Samples: 776892728. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:16:45,096][25689] Avg episode reward: [(0, '-0.212')] [2022-07-10 14:16:45,579][26022] Updated weights on worker 0-0, policy_version 758682 (0.00086) [2022-07-10 14:16:47,122][26022] Updated weights on worker 0-0, policy_version 758692 (0.00090) [2022-07-10 14:16:49,299][26022] Updated weights on worker 0-0, policy_version 758702 (0.00094) [2022-07-10 14:16:50,143][25689] Fps is (10 sec: 5489.9, 60 sec: 5529.2, 300 sec: 5530.4). Total num frames: 776915968. Throughput: 0: 5811.7. Samples: 776926126. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:16:50,144][25689] Avg episode reward: [(0, '-0.428')] [2022-07-10 14:16:50,794][26022] Updated weights on worker 0-0, policy_version 758712 (0.00085) [2022-07-10 14:16:52,846][26022] Updated weights on worker 0-0, policy_version 758722 (0.00087) [2022-07-10 14:16:54,422][26022] Updated weights on worker 0-0, policy_version 758732 (0.00049) [2022-07-10 14:16:55,205][25689] Fps is (10 sec: 5672.1, 60 sec: 5541.4, 300 sec: 5526.6). Total num frames: 776944640. Throughput: 0: 5808.6. Samples: 776942890. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:16:55,205][25689] Avg episode reward: [(0, '-0.128')] [2022-07-10 14:16:56,560][26022] Updated weights on worker 0-0, policy_version 758742 (0.00090) [2022-07-10 14:16:58,091][26022] Updated weights on worker 0-0, policy_version 758752 (0.00093) [2022-07-10 14:17:00,210][25689] Fps is (10 sec: 5493.0, 60 sec: 5507.2, 300 sec: 5533.9). Total num frames: 776971264. Throughput: 0: 5815.2. Samples: 776976324. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:17:00,213][25689] Avg episode reward: [(0, '-0.657')] [2022-07-10 14:17:00,231][26022] Updated weights on worker 0-0, policy_version 758762 (0.00084) [2022-07-10 14:17:01,984][26022] Updated weights on worker 0-0, policy_version 758772 (0.00137) [2022-07-10 14:17:04,124][26022] Updated weights on worker 0-0, policy_version 758782 (0.00082) [2022-07-10 14:17:05,223][25689] Fps is (10 sec: 5315.2, 60 sec: 5508.2, 300 sec: 5524.7). Total num frames: 776997888. Throughput: 0: 5719.9. Samples: 777007858. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:17:05,224][25689] Avg episode reward: [(0, '-1.379')] [2022-07-10 14:17:05,920][26022] Updated weights on worker 0-0, policy_version 758792 (0.00082) [2022-07-10 14:17:07,967][26022] Updated weights on worker 0-0, policy_version 758802 (0.00083) [2022-07-10 14:17:09,611][26022] Updated weights on worker 0-0, policy_version 758812 (0.00079) [2022-07-10 14:17:10,354][25689] Fps is (10 sec: 5451.1, 60 sec: 5518.5, 300 sec: 5529.8). Total num frames: 777026560. Throughput: 0: 4863.6. Samples: 777024424. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:17:10,357][25689] Avg episode reward: [(0, '-0.813')] [2022-07-10 14:17:11,694][26022] Updated weights on worker 0-0, policy_version 758822 (0.00087) [2022-07-10 14:17:13,332][26022] Updated weights on worker 0-0, policy_version 758832 (0.00090) [2022-07-10 14:17:15,269][26022] Updated weights on worker 0-0, policy_version 758842 (0.00097) [2022-07-10 14:17:15,373][25689] Fps is (10 sec: 5548.5, 60 sec: 5519.4, 300 sec: 5529.8). Total num frames: 777054208. Throughput: 0: 5693.1. Samples: 777057712. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:17:15,374][25689] Avg episode reward: [(0, '-1.205')] [2022-07-10 14:17:17,032][26022] Updated weights on worker 0-0, policy_version 758852 (0.00099) [2022-07-10 14:17:18,896][26022] Updated weights on worker 0-0, policy_version 758862 (0.00092) [2022-07-10 14:17:20,436][25689] Fps is (10 sec: 5484.3, 60 sec: 5500.5, 300 sec: 5525.4). Total num frames: 777081856. Throughput: 0: 5678.3. Samples: 777091178. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:17:20,437][25689] Avg episode reward: [(0, '-1.060')] [2022-07-10 14:17:20,737][26022] Updated weights on worker 0-0, policy_version 758872 (0.00085) [2022-07-10 14:17:22,660][26022] Updated weights on worker 0-0, policy_version 758882 (0.00087) [2022-07-10 14:17:24,421][26022] Updated weights on worker 0-0, policy_version 758892 (0.00082) [2022-07-10 14:17:25,531][25689] Fps is (10 sec: 5544.7, 60 sec: 5492.8, 300 sec: 5527.9). Total num frames: 777110528. Throughput: 0: 4936.2. Samples: 777108102. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:17:25,531][25689] Avg episode reward: [(0, '-0.568')] [2022-07-10 14:17:26,363][26022] Updated weights on worker 0-0, policy_version 758902 (0.00090) [2022-07-10 14:17:28,040][26022] Updated weights on worker 0-0, policy_version 758912 (0.00085) [2022-07-10 14:17:30,027][26022] Updated weights on worker 0-0, policy_version 758922 (0.00095) [2022-07-10 14:17:30,660][25689] Fps is (10 sec: 5609.1, 60 sec: 5522.6, 300 sec: 5529.4). Total num frames: 777139200. Throughput: 0: 5765.3. Samples: 777141494. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:17:30,660][25689] Avg episode reward: [(0, '-0.845')] [2022-07-10 14:17:31,763][26022] Updated weights on worker 0-0, policy_version 758932 (0.00084) [2022-07-10 14:17:33,744][26022] Updated weights on worker 0-0, policy_version 758942 (0.00091) [2022-07-10 14:17:35,455][26022] Updated weights on worker 0-0, policy_version 758952 (0.00091) [2022-07-10 14:17:35,688][25689] Fps is (10 sec: 5645.5, 60 sec: 5538.4, 300 sec: 5528.9). Total num frames: 777167872. Throughput: 0: 5788.9. Samples: 777175312. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:17:35,689][25689] Avg episode reward: [(0, '-0.146')] [2022-07-10 14:17:37,320][26022] Updated weights on worker 0-0, policy_version 758962 (0.00082) [2022-07-10 14:17:39,091][26022] Updated weights on worker 0-0, policy_version 758972 (0.00096) [2022-07-10 14:17:40,699][25689] Fps is (10 sec: 5609.9, 60 sec: 5522.9, 300 sec: 5532.2). Total num frames: 777195520. Throughput: 0: 4986.2. Samples: 777192212. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:17:40,700][25689] Avg episode reward: [(0, '0.360')] [2022-07-10 14:17:40,910][26022] Updated weights on worker 0-0, policy_version 758982 (0.00086) [2022-07-10 14:17:42,572][26022] Updated weights on worker 0-0, policy_version 758992 (0.00628) [2022-07-10 14:17:44,660][26022] Updated weights on worker 0-0, policy_version 759002 (0.00080) [2022-07-10 14:17:45,710][25689] Fps is (10 sec: 5721.9, 60 sec: 5574.8, 300 sec: 5529.4). Total num frames: 777225216. Throughput: 0: 5847.7. Samples: 777226104. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:17:45,710][25689] Avg episode reward: [(0, '-1.458')] [2022-07-10 14:17:46,376][26022] Updated weights on worker 0-0, policy_version 759012 (0.00092) [2022-07-10 14:17:48,255][26022] Updated weights on worker 0-0, policy_version 759022 (0.00083) [2022-07-10 14:17:49,904][26022] Updated weights on worker 0-0, policy_version 759032 (0.00090) [2022-07-10 14:17:50,823][25689] Fps is (10 sec: 5664.6, 60 sec: 5552.1, 300 sec: 5531.1). Total num frames: 777252864. Throughput: 0: 5864.8. Samples: 777259744. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:17:50,823][25689] Avg episode reward: [(0, '-1.194')] [2022-07-10 14:17:51,835][26022] Updated weights on worker 0-0, policy_version 759042 (0.00085) [2022-07-10 14:17:53,667][26022] Updated weights on worker 0-0, policy_version 759052 (0.00093) [2022-07-10 14:17:55,457][26022] Updated weights on worker 0-0, policy_version 759062 (0.00080) [2022-07-10 14:17:55,849][25689] Fps is (10 sec: 5554.7, 60 sec: 5555.3, 300 sec: 5528.7). Total num frames: 777281536. Throughput: 0: 5034.5. Samples: 777276812. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:17:55,850][25689] Avg episode reward: [(0, '-1.321')] [2022-07-10 14:17:57,255][26022] Updated weights on worker 0-0, policy_version 759072 (0.00096) [2022-07-10 14:17:59,266][26022] Updated weights on worker 0-0, policy_version 759082 (0.00090) [2022-07-10 14:18:00,863][25689] Fps is (10 sec: 5609.2, 60 sec: 5571.3, 300 sec: 5536.2). Total num frames: 777309184. Throughput: 0: 5848.4. Samples: 777310138. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:18:00,864][25689] Avg episode reward: [(0, '-1.395')] [2022-07-10 14:18:00,908][26022] Updated weights on worker 0-0, policy_version 759092 (0.00083) [2022-07-10 14:18:03,231][26022] Updated weights on worker 0-0, policy_version 759102 (0.00098) [2022-07-10 14:18:05,066][26022] Updated weights on worker 0-0, policy_version 759112 (0.00107) [2022-07-10 14:18:05,879][25689] Fps is (10 sec: 5309.2, 60 sec: 5554.2, 300 sec: 5533.5). Total num frames: 777334784. Throughput: 0: 5720.3. Samples: 777341474. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:18:05,879][25689] Avg episode reward: [(0, '-2.568')] [2022-07-10 14:18:06,884][26022] Updated weights on worker 0-0, policy_version 759122 (0.00081) [2022-07-10 14:18:08,852][26022] Updated weights on worker 0-0, policy_version 759132 (0.00089) [2022-07-10 14:18:10,388][26022] Updated weights on worker 0-0, policy_version 759142 (0.00086) [2022-07-10 14:18:10,954][25689] Fps is (10 sec: 5276.6, 60 sec: 5542.4, 300 sec: 5532.4). Total num frames: 777362432. Throughput: 0: 4888.5. Samples: 777358158. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:18:10,955][25689] Avg episode reward: [(0, '-2.988')] [2022-07-10 14:18:12,321][26022] Updated weights on worker 0-0, policy_version 759152 (0.00084) [2022-07-10 14:18:14,112][26022] Updated weights on worker 0-0, policy_version 759162 (0.00094) [2022-07-10 14:18:15,982][25689] Fps is (10 sec: 5574.5, 60 sec: 5558.6, 300 sec: 5536.1). Total num frames: 777391104. Throughput: 0: 5696.3. Samples: 777391492. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 14:18:15,983][25689] Avg episode reward: [(0, '-3.831')] [2022-07-10 14:18:16,172][26022] Updated weights on worker 0-0, policy_version 759172 (0.00086) [2022-07-10 14:18:17,932][26022] Updated weights on worker 0-0, policy_version 759182 (0.00093) [2022-07-10 14:18:19,663][26022] Updated weights on worker 0-0, policy_version 759192 (0.00086) [2022-07-10 14:18:20,997][25689] Fps is (10 sec: 5710.0, 60 sec: 5579.9, 300 sec: 5539.3). Total num frames: 777419776. Throughput: 0: 5688.5. Samples: 777424670. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:18:20,998][25689] Avg episode reward: [(0, '-4.203')] [2022-07-10 14:18:21,704][26022] Updated weights on worker 0-0, policy_version 759202 (0.00090) [2022-07-10 14:18:23,483][26022] Updated weights on worker 0-0, policy_version 759212 (0.00089) [2022-07-10 14:18:25,311][26022] Updated weights on worker 0-0, policy_version 759222 (0.00088) [2022-07-10 14:18:26,039][25689] Fps is (10 sec: 5497.9, 60 sec: 5550.8, 300 sec: 5530.0). Total num frames: 777446400. Throughput: 0: 4958.8. Samples: 777441448. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:18:26,040][25689] Avg episode reward: [(0, '-4.379')] [2022-07-10 14:18:27,147][26022] Updated weights on worker 0-0, policy_version 759232 (0.00098) [2022-07-10 14:18:29,136][26022] Updated weights on worker 0-0, policy_version 759242 (0.00090) [2022-07-10 14:18:30,913][26022] Updated weights on worker 0-0, policy_version 759252 (0.00092) [2022-07-10 14:18:31,125][25689] Fps is (10 sec: 5459.5, 60 sec: 5554.8, 300 sec: 5535.8). Total num frames: 777475072. Throughput: 0: 5776.4. Samples: 777474674. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:18:31,126][25689] Avg episode reward: [(0, '-4.462')] [2022-07-10 14:18:32,899][26022] Updated weights on worker 0-0, policy_version 759262 (0.00096) [2022-07-10 14:18:34,647][26022] Updated weights on worker 0-0, policy_version 759272 (0.00097) [2022-07-10 14:18:36,155][25689] Fps is (10 sec: 5567.8, 60 sec: 5537.7, 300 sec: 5533.6). Total num frames: 777502720. Throughput: 0: 5780.7. Samples: 777508106. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:18:36,155][25689] Avg episode reward: [(0, '-3.523')] [2022-07-10 14:18:36,393][26022] Updated weights on worker 0-0, policy_version 759282 (0.00086) [2022-07-10 14:18:37,936][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:18:37,950][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000759290_777512960.pth [2022-07-10 14:18:37,955][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000757342_775518208.pth [2022-07-10 14:18:38,408][26022] Updated weights on worker 0-0, policy_version 759292 (0.00093) [2022-07-10 14:18:40,199][26022] Updated weights on worker 0-0, policy_version 759302 (0.00093) [2022-07-10 14:18:41,248][25689] Fps is (10 sec: 5563.9, 60 sec: 5547.2, 300 sec: 5528.6). Total num frames: 777531392. Throughput: 0: 4946.8. Samples: 777524846. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:18:41,248][25689] Avg episode reward: [(0, '-4.513')] [2022-07-10 14:18:42,180][26022] Updated weights on worker 0-0, policy_version 759312 (0.00093) [2022-07-10 14:18:43,694][26022] Updated weights on worker 0-0, policy_version 759322 (0.00088) [2022-07-10 14:18:45,722][26022] Updated weights on worker 0-0, policy_version 759332 (0.00086) [2022-07-10 14:18:46,307][25689] Fps is (10 sec: 5446.8, 60 sec: 5492.1, 300 sec: 5529.5). Total num frames: 777558016. Throughput: 0: 5763.1. Samples: 777558250. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:18:46,307][25689] Avg episode reward: [(0, '-2.363')] [2022-07-10 14:18:47,441][26022] Updated weights on worker 0-0, policy_version 759342 (0.00092) [2022-07-10 14:18:49,481][26022] Updated weights on worker 0-0, policy_version 759352 (0.00091) [2022-07-10 14:18:51,181][26022] Updated weights on worker 0-0, policy_version 759362 (0.00092) [2022-07-10 14:18:51,398][25689] Fps is (10 sec: 5548.8, 60 sec: 5527.8, 300 sec: 5531.9). Total num frames: 777587712. Throughput: 0: 5777.3. Samples: 777591794. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:18:51,399][25689] Avg episode reward: [(0, '-2.737')] [2022-07-10 14:18:52,951][26022] Updated weights on worker 0-0, policy_version 759372 (0.00091) [2022-07-10 14:18:54,832][26022] Updated weights on worker 0-0, policy_version 759382 (0.00091) [2022-07-10 14:18:56,402][25689] Fps is (10 sec: 5680.2, 60 sec: 5512.9, 300 sec: 5530.0). Total num frames: 777615360. Throughput: 0: 5785.5. Samples: 777625248. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:18:56,403][25689] Avg episode reward: [(0, '-2.161')] [2022-07-10 14:18:56,831][26022] Updated weights on worker 0-0, policy_version 759392 (0.00094) [2022-07-10 14:18:58,567][26022] Updated weights on worker 0-0, policy_version 759402 (0.00088) [2022-07-10 14:19:00,360][26022] Updated weights on worker 0-0, policy_version 759412 (0.00089) [2022-07-10 14:19:01,440][25689] Fps is (10 sec: 5506.1, 60 sec: 5510.7, 300 sec: 5533.0). Total num frames: 777643008. Throughput: 0: 5788.9. Samples: 777641738. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:19:01,441][25689] Avg episode reward: [(0, '-1.968')] [2022-07-10 14:19:02,503][26022] Updated weights on worker 0-0, policy_version 759422 (0.00091) [2022-07-10 14:19:04,501][26022] Updated weights on worker 0-0, policy_version 759432 (0.00088) [2022-07-10 14:19:06,392][26022] Updated weights on worker 0-0, policy_version 759442 (0.00087) [2022-07-10 14:19:06,463][25689] Fps is (10 sec: 5394.5, 60 sec: 5527.0, 300 sec: 5530.3). Total num frames: 777669632. Throughput: 0: 5712.5. Samples: 777673392. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:19:06,463][25689] Avg episode reward: [(0, '-0.207')] [2022-07-10 14:19:08,096][26022] Updated weights on worker 0-0, policy_version 759452 (0.00087) [2022-07-10 14:19:09,816][26022] Updated weights on worker 0-0, policy_version 759462 (0.00088) [2022-07-10 14:19:11,504][25689] Fps is (10 sec: 5392.9, 60 sec: 5530.1, 300 sec: 5530.6). Total num frames: 777697280. Throughput: 0: 5717.4. Samples: 777706750. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:19:11,505][25689] Avg episode reward: [(0, '-0.053')] [2022-07-10 14:19:11,764][26022] Updated weights on worker 0-0, policy_version 759472 (0.00093) [2022-07-10 14:19:13,766][26022] Updated weights on worker 0-0, policy_version 759482 (0.00090) [2022-07-10 14:19:15,346][26022] Updated weights on worker 0-0, policy_version 759492 (0.00090) [2022-07-10 14:19:16,523][25689] Fps is (10 sec: 5394.9, 60 sec: 5497.1, 300 sec: 5524.5). Total num frames: 777723904. Throughput: 0: 4888.2. Samples: 777723602. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:19:16,523][25689] Avg episode reward: [(0, '-0.030')] [2022-07-10 14:19:17,264][26022] Updated weights on worker 0-0, policy_version 759502 (0.00327) [2022-07-10 14:19:18,943][26022] Updated weights on worker 0-0, policy_version 759512 (0.00096) [2022-07-10 14:19:21,046][26022] Updated weights on worker 0-0, policy_version 759522 (0.00087) [2022-07-10 14:19:21,549][25689] Fps is (10 sec: 5607.0, 60 sec: 5513.1, 300 sec: 5535.2). Total num frames: 777753600. Throughput: 0: 5723.1. Samples: 777756820. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:19:21,549][25689] Avg episode reward: [(0, '0.086')] [2022-07-10 14:19:23,042][26022] Updated weights on worker 0-0, policy_version 759532 (0.00091) [2022-07-10 14:19:24,428][26022] Updated weights on worker 0-0, policy_version 759542 (0.00091) [2022-07-10 14:19:26,536][26022] Updated weights on worker 0-0, policy_version 759552 (0.00087) [2022-07-10 14:19:26,575][25689] Fps is (10 sec: 5704.7, 60 sec: 5531.4, 300 sec: 5532.5). Total num frames: 777781248. Throughput: 0: 5821.9. Samples: 777790482. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:19:26,576][25689] Avg episode reward: [(0, '-1.422')] [2022-07-10 14:19:28,200][26022] Updated weights on worker 0-0, policy_version 759562 (0.00092) [2022-07-10 14:19:30,060][26022] Updated weights on worker 0-0, policy_version 759572 (0.00086) [2022-07-10 14:19:31,627][25689] Fps is (10 sec: 5486.7, 60 sec: 5517.6, 300 sec: 5528.3). Total num frames: 777808896. Throughput: 0: 4982.3. Samples: 777807008. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:19:31,628][25689] Avg episode reward: [(0, '-2.170')] [2022-07-10 14:19:32,066][26022] Updated weights on worker 0-0, policy_version 759582 (0.00086) [2022-07-10 14:19:33,858][26022] Updated weights on worker 0-0, policy_version 759592 (0.00080) [2022-07-10 14:19:35,731][26022] Updated weights on worker 0-0, policy_version 759602 (0.00097) [2022-07-10 14:19:36,721][25689] Fps is (10 sec: 5651.6, 60 sec: 5545.5, 300 sec: 5534.0). Total num frames: 777838592. Throughput: 0: 5797.2. Samples: 777840696. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:19:36,723][25689] Avg episode reward: [(0, '-2.405')] [2022-07-10 14:19:37,527][26022] Updated weights on worker 0-0, policy_version 759612 (0.00088) [2022-07-10 14:19:39,396][26022] Updated weights on worker 0-0, policy_version 759622 (0.00087) [2022-07-10 14:19:41,294][26022] Updated weights on worker 0-0, policy_version 759632 (0.00091) [2022-07-10 14:19:41,813][25689] Fps is (10 sec: 5629.4, 60 sec: 5528.7, 300 sec: 5532.9). Total num frames: 777866240. Throughput: 0: 5801.8. Samples: 777874392. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:19:41,814][25689] Avg episode reward: [(0, '-2.503')] [2022-07-10 14:19:42,906][26022] Updated weights on worker 0-0, policy_version 759642 (0.00086) [2022-07-10 14:19:44,751][26022] Updated weights on worker 0-0, policy_version 759652 (0.00084) [2022-07-10 14:19:46,488][26022] Updated weights on worker 0-0, policy_version 759662 (0.00081) [2022-07-10 14:19:46,842][25689] Fps is (10 sec: 5565.0, 60 sec: 5565.3, 300 sec: 5531.4). Total num frames: 777894912. Throughput: 0: 4979.6. Samples: 777891402. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:19:46,842][25689] Avg episode reward: [(0, '-1.909')] [2022-07-10 14:19:48,387][26022] Updated weights on worker 0-0, policy_version 759672 (0.00087) [2022-07-10 14:19:50,198][26022] Updated weights on worker 0-0, policy_version 759682 (0.00084) [2022-07-10 14:19:51,907][25689] Fps is (10 sec: 5681.4, 60 sec: 5550.8, 300 sec: 5534.6). Total num frames: 777923584. Throughput: 0: 5846.3. Samples: 777925570. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:19:51,907][25689] Avg episode reward: [(0, '-0.966')] [2022-07-10 14:19:52,043][26022] Updated weights on worker 0-0, policy_version 759692 (0.00087) [2022-07-10 14:19:53,671][26022] Updated weights on worker 0-0, policy_version 759702 (0.00089) [2022-07-10 14:19:55,771][26022] Updated weights on worker 0-0, policy_version 759712 (0.00087) [2022-07-10 14:19:56,922][25689] Fps is (10 sec: 5790.4, 60 sec: 5583.6, 300 sec: 5541.5). Total num frames: 777953280. Throughput: 0: 5872.1. Samples: 777959316. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:19:56,922][25689] Avg episode reward: [(0, '-1.462')] [2022-07-10 14:19:57,392][26022] Updated weights on worker 0-0, policy_version 759722 (0.00090) [2022-07-10 14:19:59,548][26022] Updated weights on worker 0-0, policy_version 759732 (0.00867) [2022-07-10 14:20:01,092][26022] Updated weights on worker 0-0, policy_version 759742 (0.00089) [2022-07-10 14:20:01,956][25689] Fps is (10 sec: 5502.2, 60 sec: 5550.1, 300 sec: 5537.6). Total num frames: 777978880. Throughput: 0: 5044.0. Samples: 777975994. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:20:01,958][25689] Avg episode reward: [(0, '-2.048')] [2022-07-10 14:20:03,310][26022] Updated weights on worker 0-0, policy_version 759752 (0.00084) [2022-07-10 14:20:05,118][26022] Updated weights on worker 0-0, policy_version 759762 (0.00088) [2022-07-10 14:20:06,791][26022] Updated weights on worker 0-0, policy_version 759772 (0.00083) [2022-07-10 14:20:06,976][25689] Fps is (10 sec: 5296.1, 60 sec: 5567.3, 300 sec: 5535.5). Total num frames: 778006528. Throughput: 0: 5767.7. Samples: 778007530. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:20:06,976][25689] Avg episode reward: [(0, '-2.404')] [2022-07-10 14:20:08,830][26022] Updated weights on worker 0-0, policy_version 759782 (0.00089) [2022-07-10 14:20:10,655][26022] Updated weights on worker 0-0, policy_version 759792 (0.00090) [2022-07-10 14:20:12,110][25689] Fps is (10 sec: 5546.7, 60 sec: 5575.7, 300 sec: 5536.7). Total num frames: 778035200. Throughput: 0: 5719.8. Samples: 778041130. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:20:12,111][25689] Avg episode reward: [(0, '-3.108')] [2022-07-10 14:20:12,240][26022] Updated weights on worker 0-0, policy_version 759802 (0.00092) [2022-07-10 14:20:14,512][26022] Updated weights on worker 0-0, policy_version 759812 (0.00089) [2022-07-10 14:20:15,858][26022] Updated weights on worker 0-0, policy_version 759822 (0.00084) [2022-07-10 14:20:17,133][25689] Fps is (10 sec: 5545.0, 60 sec: 5592.2, 300 sec: 5541.2). Total num frames: 778062848. Throughput: 0: 4882.0. Samples: 778057986. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:20:17,133][25689] Avg episode reward: [(0, '-3.604')] [2022-07-10 14:20:18,114][26022] Updated weights on worker 0-0, policy_version 759832 (0.00083) [2022-07-10 14:20:19,882][26022] Updated weights on worker 0-0, policy_version 759842 (0.00087) [2022-07-10 14:20:21,703][26022] Updated weights on worker 0-0, policy_version 759852 (0.00095) [2022-07-10 14:20:22,143][25689] Fps is (10 sec: 5511.7, 60 sec: 5559.9, 300 sec: 5535.0). Total num frames: 778090496. Throughput: 0: 5701.7. Samples: 778091090. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:20:22,143][25689] Avg episode reward: [(0, '-3.481')] [2022-07-10 14:20:23,510][26022] Updated weights on worker 0-0, policy_version 759862 (0.00089) [2022-07-10 14:20:25,503][26022] Updated weights on worker 0-0, policy_version 759872 (0.00088) [2022-07-10 14:20:27,230][25689] Fps is (10 sec: 5476.3, 60 sec: 5554.3, 300 sec: 5538.1). Total num frames: 778118144. Throughput: 0: 5782.6. Samples: 778124652. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:20:27,232][25689] Avg episode reward: [(0, '-2.962')] [2022-07-10 14:20:27,239][26022] Updated weights on worker 0-0, policy_version 759882 (0.00087) [2022-07-10 14:20:29,352][26022] Updated weights on worker 0-0, policy_version 759892 (0.00094) [2022-07-10 14:20:30,939][26022] Updated weights on worker 0-0, policy_version 759902 (0.00095) [2022-07-10 14:20:32,352][25689] Fps is (10 sec: 5516.7, 60 sec: 5564.8, 300 sec: 5532.5). Total num frames: 778146816. Throughput: 0: 4946.7. Samples: 778141258. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:20:32,359][25689] Avg episode reward: [(0, '-2.262')] [2022-07-10 14:20:32,812][26022] Updated weights on worker 0-0, policy_version 759912 (0.00089) [2022-07-10 14:20:34,615][26022] Updated weights on worker 0-0, policy_version 759922 (0.00085) [2022-07-10 14:20:36,519][26022] Updated weights on worker 0-0, policy_version 759932 (0.00096) [2022-07-10 14:20:37,370][25689] Fps is (10 sec: 5655.1, 60 sec: 5554.8, 300 sec: 5537.1). Total num frames: 778175488. Throughput: 0: 5768.0. Samples: 778174716. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:20:37,371][25689] Avg episode reward: [(0, '-1.575')] [2022-07-10 14:20:38,040][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:20:38,048][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000759941_778179584.pth [2022-07-10 14:20:38,049][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000757991_776182784.pth [2022-07-10 14:20:38,249][26022] Updated weights on worker 0-0, policy_version 759942 (0.00089) [2022-07-10 14:20:40,137][26022] Updated weights on worker 0-0, policy_version 759952 (0.00090) [2022-07-10 14:20:41,878][26022] Updated weights on worker 0-0, policy_version 759962 (0.00091) [2022-07-10 14:20:42,392][25689] Fps is (10 sec: 5711.5, 60 sec: 5578.2, 300 sec: 5541.9). Total num frames: 778204160. Throughput: 0: 5791.6. Samples: 778208364. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:20:42,392][25689] Avg episode reward: [(0, '-2.223')] [2022-07-10 14:20:44,003][26022] Updated weights on worker 0-0, policy_version 759972 (0.00085) [2022-07-10 14:20:45,546][26022] Updated weights on worker 0-0, policy_version 759982 (0.00093) [2022-07-10 14:20:47,449][25689] Fps is (10 sec: 5486.4, 60 sec: 5541.7, 300 sec: 5538.3). Total num frames: 778230784. Throughput: 0: 4969.6. Samples: 778225132. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:20:47,451][25689] Avg episode reward: [(0, '-1.741')] [2022-07-10 14:20:47,501][26022] Updated weights on worker 0-0, policy_version 759992 (0.00099) [2022-07-10 14:20:49,200][26022] Updated weights on worker 0-0, policy_version 760002 (0.00098) [2022-07-10 14:20:51,023][26022] Updated weights on worker 0-0, policy_version 760012 (0.00086) [2022-07-10 14:20:52,580][25689] Fps is (10 sec: 5528.2, 60 sec: 5552.7, 300 sec: 5542.9). Total num frames: 778260480. Throughput: 0: 5807.1. Samples: 778258724. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:20:52,580][25689] Avg episode reward: [(0, '-1.680')] [2022-07-10 14:20:52,863][26022] Updated weights on worker 0-0, policy_version 760022 (0.01253) [2022-07-10 14:20:54,883][26022] Updated weights on worker 0-0, policy_version 760032 (0.00088) [2022-07-10 14:20:56,675][26022] Updated weights on worker 0-0, policy_version 760042 (0.00085) [2022-07-10 14:20:57,602][25689] Fps is (10 sec: 5648.1, 60 sec: 5518.2, 300 sec: 5539.1). Total num frames: 778288128. Throughput: 0: 5793.1. Samples: 778291920. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:20:57,603][25689] Avg episode reward: [(0, '-1.938')] [2022-07-10 14:20:58,529][26022] Updated weights on worker 0-0, policy_version 760052 (0.00089) [2022-07-10 14:21:00,311][26022] Updated weights on worker 0-0, policy_version 760062 (0.00088) [2022-07-10 14:21:02,613][25689] Fps is (10 sec: 5204.8, 60 sec: 5503.5, 300 sec: 5532.4). Total num frames: 778312704. Throughput: 0: 5670.5. Samples: 778323030. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:21:02,614][25689] Avg episode reward: [(0, '-0.825')] [2022-07-10 14:21:02,767][26022] Updated weights on worker 0-0, policy_version 760072 (0.00090) [2022-07-10 14:21:04,336][26022] Updated weights on worker 0-0, policy_version 760082 (0.00091) [2022-07-10 14:21:06,524][26022] Updated weights on worker 0-0, policy_version 760092 (0.00088) [2022-07-10 14:21:07,680][25689] Fps is (10 sec: 5385.0, 60 sec: 5532.9, 300 sec: 5539.2). Total num frames: 778342400. Throughput: 0: 5668.9. Samples: 778339820. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:21:07,681][25689] Avg episode reward: [(0, '-1.125')] [2022-07-10 14:21:07,967][26022] Updated weights on worker 0-0, policy_version 760102 (0.00092) [2022-07-10 14:21:09,895][26022] Updated weights on worker 0-0, policy_version 760112 (0.00081) [2022-07-10 14:21:11,829][26022] Updated weights on worker 0-0, policy_version 760122 (0.00087) [2022-07-10 14:21:12,801][25689] Fps is (10 sec: 5729.5, 60 sec: 5534.2, 300 sec: 5540.9). Total num frames: 778371072. Throughput: 0: 5689.2. Samples: 778373766. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:21:12,801][25689] Avg episode reward: [(0, '-1.509')] [2022-07-10 14:21:13,548][26022] Updated weights on worker 0-0, policy_version 760132 (0.00069) [2022-07-10 14:21:15,487][26022] Updated weights on worker 0-0, policy_version 760142 (0.00091) [2022-07-10 14:21:17,007][26022] Updated weights on worker 0-0, policy_version 760152 (0.00085) [2022-07-10 14:21:17,823][25689] Fps is (10 sec: 5552.7, 60 sec: 5534.2, 300 sec: 5537.8). Total num frames: 778398720. Throughput: 0: 5722.8. Samples: 778407642. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:21:17,824][25689] Avg episode reward: [(0, '-1.586')] [2022-07-10 14:21:18,959][26022] Updated weights on worker 0-0, policy_version 760162 (0.00088) [2022-07-10 14:21:20,852][26022] Updated weights on worker 0-0, policy_version 760172 (0.00093) [2022-07-10 14:21:22,635][26022] Updated weights on worker 0-0, policy_version 760182 (0.00085) [2022-07-10 14:21:22,838][25689] Fps is (10 sec: 5610.9, 60 sec: 5550.6, 300 sec: 5537.7). Total num frames: 778427392. Throughput: 0: 5016.1. Samples: 778424480. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:21:22,843][25689] Avg episode reward: [(0, '-0.789')] [2022-07-10 14:21:24,665][26022] Updated weights on worker 0-0, policy_version 760192 (0.00091) [2022-07-10 14:21:26,116][26022] Updated weights on worker 0-0, policy_version 760202 (0.00087) [2022-07-10 14:21:27,853][25689] Fps is (10 sec: 5513.3, 60 sec: 5540.4, 300 sec: 5539.0). Total num frames: 778454016. Throughput: 0: 5871.9. Samples: 778458270. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:21:27,853][25689] Avg episode reward: [(0, '-0.632')] [2022-07-10 14:21:28,289][26022] Updated weights on worker 0-0, policy_version 760212 (0.00092) [2022-07-10 14:21:29,816][26022] Updated weights on worker 0-0, policy_version 760222 (0.00087) [2022-07-10 14:21:31,854][26022] Updated weights on worker 0-0, policy_version 760232 (0.00085) [2022-07-10 14:21:32,941][25689] Fps is (10 sec: 5574.7, 60 sec: 5560.3, 300 sec: 5544.6). Total num frames: 778483712. Throughput: 0: 5837.2. Samples: 778491330. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:21:32,943][25689] Avg episode reward: [(0, '-0.643')] [2022-07-10 14:21:33,749][26022] Updated weights on worker 0-0, policy_version 760242 (0.00087) [2022-07-10 14:21:35,389][26022] Updated weights on worker 0-0, policy_version 760252 (0.00087) [2022-07-10 14:21:37,664][26022] Updated weights on worker 0-0, policy_version 760262 (0.00091) [2022-07-10 14:21:37,964][25689] Fps is (10 sec: 5570.2, 60 sec: 5526.1, 300 sec: 5537.8). Total num frames: 778510336. Throughput: 0: 4980.4. Samples: 778507950. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:21:37,964][25689] Avg episode reward: [(0, '-0.856')] [2022-07-10 14:21:39,019][26022] Updated weights on worker 0-0, policy_version 760272 (0.00084) [2022-07-10 14:21:41,041][26022] Updated weights on worker 0-0, policy_version 760282 (0.00086) [2022-07-10 14:21:42,791][26022] Updated weights on worker 0-0, policy_version 760292 (0.00086) [2022-07-10 14:21:42,988][25689] Fps is (10 sec: 5503.6, 60 sec: 5525.9, 300 sec: 5544.6). Total num frames: 778539008. Throughput: 0: 5814.6. Samples: 778541646. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 14:21:42,989][25689] Avg episode reward: [(0, '-0.310')] [2022-07-10 14:21:44,707][26022] Updated weights on worker 0-0, policy_version 760302 (0.00090) [2022-07-10 14:21:46,676][26022] Updated weights on worker 0-0, policy_version 760312 (0.00094) [2022-07-10 14:21:47,992][25689] Fps is (10 sec: 5718.6, 60 sec: 5564.6, 300 sec: 5545.5). Total num frames: 778567680. Throughput: 0: 5815.8. Samples: 778575392. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:21:47,993][25689] Avg episode reward: [(0, '0.318')] [2022-07-10 14:21:48,261][26022] Updated weights on worker 0-0, policy_version 760322 (0.00090) [2022-07-10 14:21:50,179][26022] Updated weights on worker 0-0, policy_version 760332 (0.00084) [2022-07-10 14:21:51,940][26022] Updated weights on worker 0-0, policy_version 760342 (0.00088) [2022-07-10 14:21:53,035][25689] Fps is (10 sec: 5503.7, 60 sec: 5521.8, 300 sec: 5538.9). Total num frames: 778594304. Throughput: 0: 5011.8. Samples: 778592040. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:21:53,036][25689] Avg episode reward: [(0, '0.125')] [2022-07-10 14:21:54,005][26022] Updated weights on worker 0-0, policy_version 760352 (0.00089) [2022-07-10 14:21:55,800][26022] Updated weights on worker 0-0, policy_version 760362 (0.00091) [2022-07-10 14:21:57,624][26022] Updated weights on worker 0-0, policy_version 760372 (0.00091) [2022-07-10 14:21:58,054][25689] Fps is (10 sec: 5495.5, 60 sec: 5539.1, 300 sec: 5545.6). Total num frames: 778622976. Throughput: 0: 5843.7. Samples: 778625348. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:21:58,055][25689] Avg episode reward: [(0, '-0.524')] [2022-07-10 14:21:59,436][26022] Updated weights on worker 0-0, policy_version 760382 (0.00082) [2022-07-10 14:22:01,765][26022] Updated weights on worker 0-0, policy_version 760392 (0.00103) [2022-07-10 14:22:03,067][25689] Fps is (10 sec: 5410.4, 60 sec: 5555.9, 300 sec: 5542.1). Total num frames: 778648576. Throughput: 0: 5717.6. Samples: 778656444. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:22:03,069][25689] Avg episode reward: [(0, '-0.899')] [2022-07-10 14:22:03,762][26022] Updated weights on worker 0-0, policy_version 760402 (0.00097) [2022-07-10 14:22:05,388][26022] Updated weights on worker 0-0, policy_version 760412 (0.00097) [2022-07-10 14:22:07,299][26022] Updated weights on worker 0-0, policy_version 760422 (0.00093) [2022-07-10 14:22:08,097][25689] Fps is (10 sec: 5302.0, 60 sec: 5525.4, 300 sec: 5540.6). Total num frames: 778676224. Throughput: 0: 4857.7. Samples: 778673056. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:22:08,097][25689] Avg episode reward: [(0, '-0.931')] [2022-07-10 14:22:08,895][26022] Updated weights on worker 0-0, policy_version 760432 (0.00089) [2022-07-10 14:22:10,995][26022] Updated weights on worker 0-0, policy_version 760442 (0.00095) [2022-07-10 14:22:12,883][26022] Updated weights on worker 0-0, policy_version 760452 (0.00081) [2022-07-10 14:22:13,138][25689] Fps is (10 sec: 5592.3, 60 sec: 5532.7, 300 sec: 5543.6). Total num frames: 778704896. Throughput: 0: 5695.4. Samples: 778706530. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:22:13,138][25689] Avg episode reward: [(0, '-1.087')] [2022-07-10 14:22:14,538][26022] Updated weights on worker 0-0, policy_version 760462 (0.00100) [2022-07-10 14:22:16,650][26022] Updated weights on worker 0-0, policy_version 760472 (0.00099) [2022-07-10 14:22:18,099][26022] Updated weights on worker 0-0, policy_version 760482 (0.00090) [2022-07-10 14:22:18,197][25689] Fps is (10 sec: 5677.6, 60 sec: 5546.3, 300 sec: 5547.1). Total num frames: 778733568. Throughput: 0: 5703.5. Samples: 778740234. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:22:18,197][25689] Avg episode reward: [(0, '-0.587')] [2022-07-10 14:22:20,272][26022] Updated weights on worker 0-0, policy_version 760492 (0.00085) [2022-07-10 14:22:21,970][26022] Updated weights on worker 0-0, policy_version 760502 (0.00090) [2022-07-10 14:22:23,201][25689] Fps is (10 sec: 5495.0, 60 sec: 5513.4, 300 sec: 5541.9). Total num frames: 778760192. Throughput: 0: 4987.8. Samples: 778756868. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:22:23,201][25689] Avg episode reward: [(0, '-1.154')] [2022-07-10 14:22:23,678][26022] Updated weights on worker 0-0, policy_version 760512 (0.00085) [2022-07-10 14:22:25,762][26022] Updated weights on worker 0-0, policy_version 760522 (0.00094) [2022-07-10 14:22:27,480][26022] Updated weights on worker 0-0, policy_version 760532 (0.00089) [2022-07-10 14:22:28,251][25689] Fps is (10 sec: 5398.0, 60 sec: 5527.1, 300 sec: 5540.0). Total num frames: 778787840. Throughput: 0: 5825.8. Samples: 778790470. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:22:28,251][25689] Avg episode reward: [(0, '-1.447')] [2022-07-10 14:22:29,268][26022] Updated weights on worker 0-0, policy_version 760542 (0.00092) [2022-07-10 14:22:31,259][26022] Updated weights on worker 0-0, policy_version 760552 (0.00090) [2022-07-10 14:22:32,897][26022] Updated weights on worker 0-0, policy_version 760562 (0.00093) [2022-07-10 14:22:33,347][25689] Fps is (10 sec: 5651.8, 60 sec: 5526.4, 300 sec: 5542.1). Total num frames: 778817536. Throughput: 0: 5799.8. Samples: 778823738. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:22:33,347][25689] Avg episode reward: [(0, '-3.344')] [2022-07-10 14:22:35,092][26022] Updated weights on worker 0-0, policy_version 760572 (0.00083) [2022-07-10 14:22:36,619][26022] Updated weights on worker 0-0, policy_version 760582 (0.00083) [2022-07-10 14:22:38,276][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:22:38,294][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000760590_778844160.pth [2022-07-10 14:22:38,294][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000758640_776847360.pth [2022-07-10 14:22:38,358][25689] Fps is (10 sec: 5572.0, 60 sec: 5527.4, 300 sec: 5538.7). Total num frames: 778844160. Throughput: 0: 4959.3. Samples: 778840222. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:22:38,359][25689] Avg episode reward: [(0, '-3.663')] [2022-07-10 14:22:38,782][26022] Updated weights on worker 0-0, policy_version 760592 (0.00089) [2022-07-10 14:22:40,385][26022] Updated weights on worker 0-0, policy_version 760602 (0.00088) [2022-07-10 14:22:42,271][26022] Updated weights on worker 0-0, policy_version 760612 (0.00097) [2022-07-10 14:22:43,368][25689] Fps is (10 sec: 5517.9, 60 sec: 5528.8, 300 sec: 5535.3). Total num frames: 778872832. Throughput: 0: 5811.4. Samples: 778874068. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:22:43,368][25689] Avg episode reward: [(0, '-4.516')] [2022-07-10 14:22:43,926][26022] Updated weights on worker 0-0, policy_version 760622 (0.00089) [2022-07-10 14:22:45,856][26022] Updated weights on worker 0-0, policy_version 760632 (0.00111) [2022-07-10 14:22:47,649][26022] Updated weights on worker 0-0, policy_version 760642 (0.00088) [2022-07-10 14:22:48,385][25689] Fps is (10 sec: 5616.9, 60 sec: 5510.5, 300 sec: 5537.0). Total num frames: 778900480. Throughput: 0: 5835.7. Samples: 778907968. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:22:48,386][25689] Avg episode reward: [(0, '-3.663')] [2022-07-10 14:22:49,617][26022] Updated weights on worker 0-0, policy_version 760652 (0.00092) [2022-07-10 14:22:51,315][26022] Updated weights on worker 0-0, policy_version 760662 (0.00091) [2022-07-10 14:22:53,210][26022] Updated weights on worker 0-0, policy_version 760672 (0.00091) [2022-07-10 14:22:53,505][25689] Fps is (10 sec: 5656.9, 60 sec: 5554.4, 300 sec: 5538.7). Total num frames: 778930176. Throughput: 0: 5009.5. Samples: 778924720. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:22:53,505][25689] Avg episode reward: [(0, '-3.570')] [2022-07-10 14:22:54,972][26022] Updated weights on worker 0-0, policy_version 760682 (0.00062) [2022-07-10 14:22:56,795][26022] Updated weights on worker 0-0, policy_version 760692 (0.00085) [2022-07-10 14:22:58,534][25689] Fps is (10 sec: 5650.4, 60 sec: 5536.5, 300 sec: 5538.5). Total num frames: 778957824. Throughput: 0: 5870.9. Samples: 778958670. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:22:58,535][25689] Avg episode reward: [(0, '-3.500')] [2022-07-10 14:22:58,686][26022] Updated weights on worker 0-0, policy_version 760702 (0.00081) [2022-07-10 14:23:00,507][26022] Updated weights on worker 0-0, policy_version 760712 (0.00084) [2022-07-10 14:23:02,542][26022] Updated weights on worker 0-0, policy_version 760722 (0.00088) [2022-07-10 14:23:03,551][25689] Fps is (10 sec: 5300.1, 60 sec: 5536.1, 300 sec: 5538.4). Total num frames: 778983424. Throughput: 0: 5763.2. Samples: 778990390. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:23:03,554][25689] Avg episode reward: [(0, '-1.401')] [2022-07-10 14:23:04,517][26022] Updated weights on worker 0-0, policy_version 760732 (0.00087) [2022-07-10 14:23:06,336][26022] Updated weights on worker 0-0, policy_version 760742 (0.00091) [2022-07-10 14:23:08,136][26022] Updated weights on worker 0-0, policy_version 760752 (0.00094) [2022-07-10 14:23:08,565][25689] Fps is (10 sec: 5410.5, 60 sec: 5554.5, 300 sec: 5543.0). Total num frames: 779012096. Throughput: 0: 4900.6. Samples: 779006860. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:23:08,566][25689] Avg episode reward: [(0, '-1.253')] [2022-07-10 14:23:09,900][26022] Updated weights on worker 0-0, policy_version 760762 (0.00078) [2022-07-10 14:23:11,769][26022] Updated weights on worker 0-0, policy_version 760772 (0.00099) [2022-07-10 14:23:13,619][25689] Fps is (10 sec: 5695.8, 60 sec: 5553.3, 300 sec: 5542.5). Total num frames: 779040768. Throughput: 0: 5753.0. Samples: 779040438. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:23:13,621][25689] Avg episode reward: [(0, '-0.653')] [2022-07-10 14:23:13,623][26022] Updated weights on worker 0-0, policy_version 760782 (0.00094) [2022-07-10 14:23:15,432][26022] Updated weights on worker 0-0, policy_version 760792 (0.00091) [2022-07-10 14:23:17,283][26022] Updated weights on worker 0-0, policy_version 760802 (0.00083) [2022-07-10 14:23:18,637][25689] Fps is (10 sec: 5693.1, 60 sec: 5557.0, 300 sec: 5542.5). Total num frames: 779069440. Throughput: 0: 5747.4. Samples: 779074212. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:23:18,638][25689] Avg episode reward: [(0, '-0.704')] [2022-07-10 14:23:18,987][26022] Updated weights on worker 0-0, policy_version 760812 (0.00083) [2022-07-10 14:23:20,939][26022] Updated weights on worker 0-0, policy_version 760822 (0.00083) [2022-07-10 14:23:22,911][26022] Updated weights on worker 0-0, policy_version 760832 (0.00308) [2022-07-10 14:23:23,657][25689] Fps is (10 sec: 5508.8, 60 sec: 5555.6, 300 sec: 5542.9). Total num frames: 779096064. Throughput: 0: 4999.6. Samples: 779090910. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:23:23,657][25689] Avg episode reward: [(0, '-0.677')] [2022-07-10 14:23:24,499][26022] Updated weights on worker 0-0, policy_version 760842 (0.00101) [2022-07-10 14:23:26,590][26022] Updated weights on worker 0-0, policy_version 760852 (0.00094) [2022-07-10 14:23:28,211][26022] Updated weights on worker 0-0, policy_version 760862 (0.00051) [2022-07-10 14:23:28,679][25689] Fps is (10 sec: 5506.6, 60 sec: 5575.1, 300 sec: 5544.1). Total num frames: 779124736. Throughput: 0: 5824.6. Samples: 779124018. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:23:28,680][25689] Avg episode reward: [(0, '1.085')] [2022-07-10 14:23:30,192][26022] Updated weights on worker 0-0, policy_version 760872 (0.00090) [2022-07-10 14:23:31,994][26022] Updated weights on worker 0-0, policy_version 760882 (0.00087) [2022-07-10 14:23:33,735][25689] Fps is (10 sec: 5588.2, 60 sec: 5544.9, 300 sec: 5543.6). Total num frames: 779152384. Throughput: 0: 5801.8. Samples: 779157148. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:23:33,736][25689] Avg episode reward: [(0, '-0.355')] [2022-07-10 14:23:33,832][26022] Updated weights on worker 0-0, policy_version 760892 (0.00097) [2022-07-10 14:23:35,638][26022] Updated weights on worker 0-0, policy_version 760902 (0.00097) [2022-07-10 14:23:37,577][26022] Updated weights on worker 0-0, policy_version 760912 (0.00081) [2022-07-10 14:23:38,764][25689] Fps is (10 sec: 5381.3, 60 sec: 5543.3, 300 sec: 5537.9). Total num frames: 779179008. Throughput: 0: 4949.6. Samples: 779173832. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:23:38,765][25689] Avg episode reward: [(0, '-0.183')] [2022-07-10 14:23:39,374][26022] Updated weights on worker 0-0, policy_version 760922 (0.00628) [2022-07-10 14:23:41,375][26022] Updated weights on worker 0-0, policy_version 760932 (0.00084) [2022-07-10 14:23:42,937][26022] Updated weights on worker 0-0, policy_version 760942 (0.00083) [2022-07-10 14:23:43,803][25689] Fps is (10 sec: 5492.5, 60 sec: 5540.6, 300 sec: 5545.2). Total num frames: 779207680. Throughput: 0: 5784.1. Samples: 779207436. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:23:43,803][25689] Avg episode reward: [(0, '-0.677')] [2022-07-10 14:23:44,963][26022] Updated weights on worker 0-0, policy_version 760952 (0.00090) [2022-07-10 14:23:46,826][26022] Updated weights on worker 0-0, policy_version 760962 (0.00104) [2022-07-10 14:23:48,340][26022] Updated weights on worker 0-0, policy_version 760972 (0.00089) [2022-07-10 14:23:48,830][25689] Fps is (10 sec: 5798.6, 60 sec: 5573.6, 300 sec: 5546.4). Total num frames: 779237376. Throughput: 0: 5834.9. Samples: 779241598. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:23:48,831][25689] Avg episode reward: [(0, '-0.950')] [2022-07-10 14:23:50,392][26022] Updated weights on worker 0-0, policy_version 760982 (0.00085) [2022-07-10 14:23:52,107][26022] Updated weights on worker 0-0, policy_version 760992 (0.00086) [2022-07-10 14:23:53,963][25689] Fps is (10 sec: 5543.4, 60 sec: 5521.6, 300 sec: 5540.5). Total num frames: 779264000. Throughput: 0: 4984.1. Samples: 779257964. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:23:53,963][25689] Avg episode reward: [(0, '-1.057')] [2022-07-10 14:23:54,146][26022] Updated weights on worker 0-0, policy_version 761002 (0.00090) [2022-07-10 14:23:55,981][26022] Updated weights on worker 0-0, policy_version 761012 (0.00089) [2022-07-10 14:23:57,850][26022] Updated weights on worker 0-0, policy_version 761022 (0.00086) [2022-07-10 14:23:58,965][25689] Fps is (10 sec: 5557.3, 60 sec: 5558.0, 300 sec: 5548.1). Total num frames: 779293696. Throughput: 0: 5827.6. Samples: 779291552. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:23:58,965][25689] Avg episode reward: [(0, '-2.054')] [2022-07-10 14:23:59,375][26022] Updated weights on worker 0-0, policy_version 761032 (0.00092) [2022-07-10 14:24:01,667][26022] Updated weights on worker 0-0, policy_version 761042 (0.00159) [2022-07-10 14:24:03,446][26022] Updated weights on worker 0-0, policy_version 761052 (0.00090) [2022-07-10 14:24:03,999][25689] Fps is (10 sec: 5509.6, 60 sec: 5556.4, 300 sec: 5544.4). Total num frames: 779319296. Throughput: 0: 5728.7. Samples: 779323134. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:24:04,001][25689] Avg episode reward: [(0, '-0.995')] [2022-07-10 14:24:05,564][26022] Updated weights on worker 0-0, policy_version 761062 (0.00094) [2022-07-10 14:24:07,272][26022] Updated weights on worker 0-0, policy_version 761072 (0.00085) [2022-07-10 14:24:09,024][25689] Fps is (10 sec: 5293.1, 60 sec: 5538.4, 300 sec: 5544.7). Total num frames: 779346944. Throughput: 0: 5680.2. Samples: 779356306. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:24:09,025][25689] Avg episode reward: [(0, '-1.608')] [2022-07-10 14:24:09,145][26022] Updated weights on worker 0-0, policy_version 761082 (0.00088) [2022-07-10 14:24:10,984][26022] Updated weights on worker 0-0, policy_version 761092 (0.00087) [2022-07-10 14:24:12,823][26022] Updated weights on worker 0-0, policy_version 761102 (0.00103) [2022-07-10 14:24:14,126][25689] Fps is (10 sec: 5460.2, 60 sec: 5517.1, 300 sec: 5546.6). Total num frames: 779374592. Throughput: 0: 5711.5. Samples: 779373128. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:24:14,126][25689] Avg episode reward: [(0, '-2.194')] [2022-07-10 14:24:14,485][26022] Updated weights on worker 0-0, policy_version 761112 (0.00088) [2022-07-10 14:24:16,739][26022] Updated weights on worker 0-0, policy_version 761122 (0.00087) [2022-07-10 14:24:18,222][26022] Updated weights on worker 0-0, policy_version 761132 (0.00085) [2022-07-10 14:24:19,143][25689] Fps is (10 sec: 5566.1, 60 sec: 5517.2, 300 sec: 5543.3). Total num frames: 779403264. Throughput: 0: 5711.6. Samples: 779406802. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:24:19,145][25689] Avg episode reward: [(0, '-3.098')] [2022-07-10 14:24:20,234][26022] Updated weights on worker 0-0, policy_version 761142 (0.00086) [2022-07-10 14:24:21,843][26022] Updated weights on worker 0-0, policy_version 761152 (0.00087) [2022-07-10 14:24:23,871][26022] Updated weights on worker 0-0, policy_version 761162 (0.00085) [2022-07-10 14:24:24,207][25689] Fps is (10 sec: 5587.0, 60 sec: 5530.1, 300 sec: 5542.6). Total num frames: 779430912. Throughput: 0: 5793.0. Samples: 779440198. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:24:24,209][25689] Avg episode reward: [(0, '-2.448')] [2022-07-10 14:24:25,509][26022] Updated weights on worker 0-0, policy_version 761172 (0.00078) [2022-07-10 14:24:27,654][26022] Updated weights on worker 0-0, policy_version 761182 (0.00093) [2022-07-10 14:24:29,048][26022] Updated weights on worker 0-0, policy_version 761192 (0.00092) [2022-07-10 14:24:29,242][25689] Fps is (10 sec: 5678.3, 60 sec: 5545.9, 300 sec: 5549.8). Total num frames: 779460608. Throughput: 0: 4982.2. Samples: 779457032. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:24:29,243][25689] Avg episode reward: [(0, '-4.098')] [2022-07-10 14:24:31,402][26022] Updated weights on worker 0-0, policy_version 761202 (0.00093) [2022-07-10 14:24:32,866][26022] Updated weights on worker 0-0, policy_version 761212 (0.00101) [2022-07-10 14:24:34,362][25689] Fps is (10 sec: 5445.2, 60 sec: 5506.3, 300 sec: 5535.6). Total num frames: 779486208. Throughput: 0: 5798.2. Samples: 779490458. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:24:34,362][25689] Avg episode reward: [(0, '-4.739')] [2022-07-10 14:24:34,840][26022] Updated weights on worker 0-0, policy_version 761222 (0.00088) [2022-07-10 14:24:36,603][26022] Updated weights on worker 0-0, policy_version 761232 (0.00085) [2022-07-10 14:24:38,334][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:24:38,347][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000761241_779510784.pth [2022-07-10 14:24:38,348][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000759290_777512960.pth [2022-07-10 14:24:38,475][26022] Updated weights on worker 0-0, policy_version 761242 (0.00081) [2022-07-10 14:24:39,385][25689] Fps is (10 sec: 5653.1, 60 sec: 5591.2, 300 sec: 5550.6). Total num frames: 779517952. Throughput: 0: 5787.2. Samples: 779523950. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:24:39,387][25689] Avg episode reward: [(0, '-4.497')] [2022-07-10 14:24:40,391][26022] Updated weights on worker 0-0, policy_version 761252 (0.00083) [2022-07-10 14:24:42,263][26022] Updated weights on worker 0-0, policy_version 761262 (0.00098) [2022-07-10 14:24:43,843][26022] Updated weights on worker 0-0, policy_version 761272 (0.00080) [2022-07-10 14:24:44,435][25689] Fps is (10 sec: 5794.5, 60 sec: 5556.4, 300 sec: 5543.4). Total num frames: 779544576. Throughput: 0: 4963.3. Samples: 779540596. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:24:44,435][25689] Avg episode reward: [(0, '-3.474')] [2022-07-10 14:24:46,099][26022] Updated weights on worker 0-0, policy_version 761282 (0.00084) [2022-07-10 14:24:47,343][26022] Updated weights on worker 0-0, policy_version 761292 (0.00086) [2022-07-10 14:24:49,444][25689] Fps is (10 sec: 5191.8, 60 sec: 5490.5, 300 sec: 5534.1). Total num frames: 779570176. Throughput: 0: 5797.1. Samples: 779574150. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:24:49,445][25689] Avg episode reward: [(0, '-2.762')] [2022-07-10 14:24:49,671][26022] Updated weights on worker 0-0, policy_version 761302 (0.00080) [2022-07-10 14:24:51,411][26022] Updated weights on worker 0-0, policy_version 761312 (0.00094) [2022-07-10 14:24:53,295][26022] Updated weights on worker 0-0, policy_version 761322 (0.00497) [2022-07-10 14:24:54,548][25689] Fps is (10 sec: 5670.3, 60 sec: 5577.6, 300 sec: 5539.3). Total num frames: 779601920. Throughput: 0: 5816.3. Samples: 779607866. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:24:54,550][25689] Avg episode reward: [(0, '-2.959')] [2022-07-10 14:24:54,928][26022] Updated weights on worker 0-0, policy_version 761332 (0.00088) [2022-07-10 14:24:57,000][26022] Updated weights on worker 0-0, policy_version 761342 (0.00090) [2022-07-10 14:24:58,479][26022] Updated weights on worker 0-0, policy_version 761352 (0.00083) [2022-07-10 14:24:59,626][25689] Fps is (10 sec: 5933.9, 60 sec: 5553.7, 300 sec: 5548.8). Total num frames: 779630592. Throughput: 0: 4983.6. Samples: 779624820. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:24:59,628][25689] Avg episode reward: [(0, '-1.255')] [2022-07-10 14:25:00,743][26022] Updated weights on worker 0-0, policy_version 761362 (0.00091) [2022-07-10 14:25:02,504][26022] Updated weights on worker 0-0, policy_version 761372 (0.00087) [2022-07-10 14:25:04,636][25689] Fps is (10 sec: 5278.3, 60 sec: 5539.1, 300 sec: 5538.7). Total num frames: 779655168. Throughput: 0: 5724.1. Samples: 779656228. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:25:04,638][25689] Avg episode reward: [(0, '-0.694')] [2022-07-10 14:25:04,646][26022] Updated weights on worker 0-0, policy_version 761382 (0.00082) [2022-07-10 14:25:06,303][26022] Updated weights on worker 0-0, policy_version 761392 (0.00090) [2022-07-10 14:25:08,322][26022] Updated weights on worker 0-0, policy_version 761402 (0.00094) [2022-07-10 14:25:09,651][25689] Fps is (10 sec: 5107.2, 60 sec: 5523.1, 300 sec: 5534.0). Total num frames: 779681792. Throughput: 0: 5718.0. Samples: 779689690. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:25:09,651][25689] Avg episode reward: [(0, '-0.613')] [2022-07-10 14:25:10,019][26022] Updated weights on worker 0-0, policy_version 761412 (0.00091) [2022-07-10 14:25:11,970][26022] Updated weights on worker 0-0, policy_version 761422 (0.00089) [2022-07-10 14:25:13,515][26022] Updated weights on worker 0-0, policy_version 761432 (0.00085) [2022-07-10 14:25:14,711][25689] Fps is (10 sec: 5589.9, 60 sec: 5560.7, 300 sec: 5540.2). Total num frames: 779711488. Throughput: 0: 4892.0. Samples: 779706506. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-10 14:25:14,712][25689] Avg episode reward: [(0, '-1.186')] [2022-07-10 14:25:15,799][26022] Updated weights on worker 0-0, policy_version 761442 (0.00093) [2022-07-10 14:25:17,231][26022] Updated weights on worker 0-0, policy_version 761452 (0.00099) [2022-07-10 14:25:19,364][26022] Updated weights on worker 0-0, policy_version 761462 (0.00096) [2022-07-10 14:25:19,738][25689] Fps is (10 sec: 5786.5, 60 sec: 5559.8, 300 sec: 5543.3). Total num frames: 779740160. Throughput: 0: 5732.3. Samples: 779740108. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:25:19,738][25689] Avg episode reward: [(0, '-0.837')] [2022-07-10 14:25:21,083][26022] Updated weights on worker 0-0, policy_version 761472 (0.00093) [2022-07-10 14:25:22,868][26022] Updated weights on worker 0-0, policy_version 761482 (0.00090) [2022-07-10 14:25:24,745][25689] Fps is (10 sec: 5510.7, 60 sec: 5548.1, 300 sec: 5541.4). Total num frames: 779766784. Throughput: 0: 5848.5. Samples: 779773838. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:25:24,746][25689] Avg episode reward: [(0, '-1.445')] [2022-07-10 14:25:24,763][26022] Updated weights on worker 0-0, policy_version 761492 (0.00086) [2022-07-10 14:25:26,506][26022] Updated weights on worker 0-0, policy_version 761502 (0.00087) [2022-07-10 14:25:28,354][26022] Updated weights on worker 0-0, policy_version 761512 (0.00085) [2022-07-10 14:25:29,753][25689] Fps is (10 sec: 5521.3, 60 sec: 5533.7, 300 sec: 5543.5). Total num frames: 779795456. Throughput: 0: 5023.4. Samples: 779790670. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:25:29,753][25689] Avg episode reward: [(0, '-2.245')] [2022-07-10 14:25:30,354][26022] Updated weights on worker 0-0, policy_version 761522 (0.00094) [2022-07-10 14:25:31,952][26022] Updated weights on worker 0-0, policy_version 761532 (0.00090) [2022-07-10 14:25:33,927][26022] Updated weights on worker 0-0, policy_version 761542 (0.00091) [2022-07-10 14:25:34,809][25689] Fps is (10 sec: 5799.6, 60 sec: 5607.2, 300 sec: 5546.2). Total num frames: 779825152. Throughput: 0: 5864.2. Samples: 779824368. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:25:34,810][25689] Avg episode reward: [(0, '-2.466')] [2022-07-10 14:25:35,536][26022] Updated weights on worker 0-0, policy_version 761552 (0.00093) [2022-07-10 14:25:37,557][26022] Updated weights on worker 0-0, policy_version 761562 (0.00088) [2022-07-10 14:25:39,282][26022] Updated weights on worker 0-0, policy_version 761572 (0.00083) [2022-07-10 14:25:39,911][25689] Fps is (10 sec: 5544.6, 60 sec: 5515.5, 300 sec: 5537.9). Total num frames: 779851776. Throughput: 0: 5830.3. Samples: 779857720. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:25:39,912][25689] Avg episode reward: [(0, '-3.203')] [2022-07-10 14:25:41,225][26022] Updated weights on worker 0-0, policy_version 761582 (0.00089) [2022-07-10 14:25:42,882][26022] Updated weights on worker 0-0, policy_version 761592 (0.00082) [2022-07-10 14:25:44,847][26022] Updated weights on worker 0-0, policy_version 761602 (0.00103) [2022-07-10 14:25:44,933][25689] Fps is (10 sec: 5461.9, 60 sec: 5551.7, 300 sec: 5545.4). Total num frames: 779880448. Throughput: 0: 4992.9. Samples: 779874634. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:25:44,935][25689] Avg episode reward: [(0, '-3.170')] [2022-07-10 14:25:46,442][26022] Updated weights on worker 0-0, policy_version 761612 (0.00085) [2022-07-10 14:25:48,618][26022] Updated weights on worker 0-0, policy_version 761622 (0.00094) [2022-07-10 14:25:49,946][25689] Fps is (10 sec: 5713.9, 60 sec: 5602.2, 300 sec: 5544.1). Total num frames: 779909120. Throughput: 0: 5833.0. Samples: 779908460. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:25:49,948][25689] Avg episode reward: [(0, '-3.272')] [2022-07-10 14:25:50,172][26022] Updated weights on worker 0-0, policy_version 761632 (0.00087) [2022-07-10 14:25:52,079][26022] Updated weights on worker 0-0, policy_version 761642 (0.00081) [2022-07-10 14:25:53,939][26022] Updated weights on worker 0-0, policy_version 761652 (0.00086) [2022-07-10 14:25:55,012][25689] Fps is (10 sec: 5486.4, 60 sec: 5521.0, 300 sec: 5539.9). Total num frames: 779935744. Throughput: 0: 5832.6. Samples: 779942202. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:25:55,012][25689] Avg episode reward: [(0, '-1.213')] [2022-07-10 14:25:55,693][26022] Updated weights on worker 0-0, policy_version 761662 (0.00085) [2022-07-10 14:25:57,700][26022] Updated weights on worker 0-0, policy_version 761672 (0.00086) [2022-07-10 14:25:59,351][26022] Updated weights on worker 0-0, policy_version 761682 (0.00083) [2022-07-10 14:26:00,107][25689] Fps is (10 sec: 5643.8, 60 sec: 5553.3, 300 sec: 5559.0). Total num frames: 779966464. Throughput: 0: 5014.8. Samples: 779959000. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:26:00,107][25689] Avg episode reward: [(0, '-1.887')] [2022-07-10 14:26:01,428][26022] Updated weights on worker 0-0, policy_version 761692 (0.00087) [2022-07-10 14:26:03,415][26022] Updated weights on worker 0-0, policy_version 761702 (0.00094) [2022-07-10 14:26:05,204][25689] Fps is (10 sec: 5525.8, 60 sec: 5562.2, 300 sec: 5544.6). Total num frames: 779992064. Throughput: 0: 5718.2. Samples: 779990548. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:26:05,205][25689] Avg episode reward: [(0, '-1.609')] [2022-07-10 14:26:05,335][26022] Updated weights on worker 0-0, policy_version 761712 (0.00086) [2022-07-10 14:26:07,158][26022] Updated weights on worker 0-0, policy_version 761722 (0.00095) [2022-07-10 14:26:08,913][26022] Updated weights on worker 0-0, policy_version 761732 (0.00091) [2022-07-10 14:26:10,210][25689] Fps is (10 sec: 5372.0, 60 sec: 5596.9, 300 sec: 5546.8). Total num frames: 780020736. Throughput: 0: 5685.9. Samples: 780023676. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:26:10,210][25689] Avg episode reward: [(0, '-1.099')] [2022-07-10 14:26:10,930][26022] Updated weights on worker 0-0, policy_version 761742 (0.00090) [2022-07-10 14:26:12,806][26022] Updated weights on worker 0-0, policy_version 761752 (0.00091) [2022-07-10 14:26:14,510][26022] Updated weights on worker 0-0, policy_version 761762 (0.00101) [2022-07-10 14:26:15,293][25689] Fps is (10 sec: 5683.9, 60 sec: 5577.9, 300 sec: 5549.1). Total num frames: 780049408. Throughput: 0: 5681.1. Samples: 780057420. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:26:15,294][25689] Avg episode reward: [(0, '-1.000')] [2022-07-10 14:26:16,485][26022] Updated weights on worker 0-0, policy_version 761772 (0.00085) [2022-07-10 14:26:18,260][26022] Updated weights on worker 0-0, policy_version 761782 (0.00090) [2022-07-10 14:26:20,180][26022] Updated weights on worker 0-0, policy_version 761792 (0.00090) [2022-07-10 14:26:20,349][25689] Fps is (10 sec: 5554.8, 60 sec: 5558.3, 300 sec: 5544.9). Total num frames: 780077056. Throughput: 0: 5692.8. Samples: 780074232. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:26:20,349][25689] Avg episode reward: [(0, '-0.803')] [2022-07-10 14:26:21,866][26022] Updated weights on worker 0-0, policy_version 761802 (0.00090) [2022-07-10 14:26:23,788][26022] Updated weights on worker 0-0, policy_version 761812 (0.00094) [2022-07-10 14:26:25,363][25689] Fps is (10 sec: 5491.3, 60 sec: 5574.6, 300 sec: 5548.3). Total num frames: 780104704. Throughput: 0: 5801.1. Samples: 780107490. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:26:25,363][25689] Avg episode reward: [(0, '-2.234')] [2022-07-10 14:26:25,766][26022] Updated weights on worker 0-0, policy_version 761822 (0.00084) [2022-07-10 14:26:27,251][26022] Updated weights on worker 0-0, policy_version 761832 (0.00084) [2022-07-10 14:26:29,422][26022] Updated weights on worker 0-0, policy_version 761842 (0.00091) [2022-07-10 14:26:30,391][25689] Fps is (10 sec: 5608.5, 60 sec: 5572.8, 300 sec: 5546.0). Total num frames: 780133376. Throughput: 0: 5812.9. Samples: 780140986. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:26:30,391][25689] Avg episode reward: [(0, '-2.481')] [2022-07-10 14:26:31,126][26022] Updated weights on worker 0-0, policy_version 761852 (0.00092) [2022-07-10 14:26:33,040][26022] Updated weights on worker 0-0, policy_version 761862 (0.00091) [2022-07-10 14:26:35,001][26022] Updated weights on worker 0-0, policy_version 761872 (0.00085) [2022-07-10 14:26:35,493][25689] Fps is (10 sec: 5458.5, 60 sec: 5517.9, 300 sec: 5544.5). Total num frames: 780160000. Throughput: 0: 4958.8. Samples: 780157588. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:26:35,494][25689] Avg episode reward: [(0, '-2.357')] [2022-07-10 14:26:36,520][26022] Updated weights on worker 0-0, policy_version 761882 (0.00090) [2022-07-10 14:26:38,448][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:26:38,461][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000761891_780176384.pth [2022-07-10 14:26:38,461][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000759941_778179584.pth [2022-07-10 14:26:38,680][26022] Updated weights on worker 0-0, policy_version 761892 (0.00086) [2022-07-10 14:26:40,230][26022] Updated weights on worker 0-0, policy_version 761902 (0.00087) [2022-07-10 14:26:40,547][25689] Fps is (10 sec: 5545.5, 60 sec: 5572.9, 300 sec: 5547.4). Total num frames: 780189696. Throughput: 0: 5784.7. Samples: 780191070. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:26:40,547][25689] Avg episode reward: [(0, '-3.011')] [2022-07-10 14:26:42,313][26022] Updated weights on worker 0-0, policy_version 761912 (0.00101) [2022-07-10 14:26:43,835][26022] Updated weights on worker 0-0, policy_version 761922 (0.00087) [2022-07-10 14:26:45,551][25689] Fps is (10 sec: 5599.8, 60 sec: 5540.8, 300 sec: 5540.5). Total num frames: 780216320. Throughput: 0: 5807.6. Samples: 780224732. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:26:45,551][25689] Avg episode reward: [(0, '-3.865')] [2022-07-10 14:26:45,942][26022] Updated weights on worker 0-0, policy_version 761932 (0.00102) [2022-07-10 14:26:47,615][26022] Updated weights on worker 0-0, policy_version 761942 (0.00095) [2022-07-10 14:26:49,609][26022] Updated weights on worker 0-0, policy_version 761952 (0.00084) [2022-07-10 14:26:50,565][25689] Fps is (10 sec: 5519.5, 60 sec: 5540.7, 300 sec: 5548.0). Total num frames: 780244992. Throughput: 0: 4980.3. Samples: 780241460. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:26:50,566][25689] Avg episode reward: [(0, '-3.000')] [2022-07-10 14:26:51,349][26022] Updated weights on worker 0-0, policy_version 761962 (0.00092) [2022-07-10 14:26:53,099][26022] Updated weights on worker 0-0, policy_version 761972 (0.00085) [2022-07-10 14:26:54,927][26022] Updated weights on worker 0-0, policy_version 761982 (0.00089) [2022-07-10 14:26:55,610][25689] Fps is (10 sec: 5497.1, 60 sec: 5542.6, 300 sec: 5540.6). Total num frames: 780271616. Throughput: 0: 5839.5. Samples: 780275060. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:26:55,611][25689] Avg episode reward: [(0, '-2.274')] [2022-07-10 14:26:56,730][26022] Updated weights on worker 0-0, policy_version 761992 (0.00091) [2022-07-10 14:26:58,744][26022] Updated weights on worker 0-0, policy_version 762002 (0.00094) [2022-07-10 14:27:00,614][25689] Fps is (10 sec: 5400.7, 60 sec: 5500.2, 300 sec: 5547.6). Total num frames: 780299264. Throughput: 0: 5853.9. Samples: 780308544. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:27:00,615][25689] Avg episode reward: [(0, '-1.706')] [2022-07-10 14:27:00,625][26022] Updated weights on worker 0-0, policy_version 762012 (0.00089) [2022-07-10 14:27:02,623][26022] Updated weights on worker 0-0, policy_version 762022 (0.00101) [2022-07-10 14:27:04,567][26022] Updated weights on worker 0-0, policy_version 762032 (0.00334) [2022-07-10 14:27:05,637][25689] Fps is (10 sec: 5514.6, 60 sec: 5540.8, 300 sec: 5547.8). Total num frames: 780326912. Throughput: 0: 4895.5. Samples: 780323064. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:27:05,638][25689] Avg episode reward: [(0, '-2.014')] [2022-07-10 14:27:06,325][26022] Updated weights on worker 0-0, policy_version 762042 (0.00086) [2022-07-10 14:27:08,401][26022] Updated weights on worker 0-0, policy_version 762052 (0.01031) [2022-07-10 14:27:10,195][26022] Updated weights on worker 0-0, policy_version 762062 (0.00086) [2022-07-10 14:27:10,657][25689] Fps is (10 sec: 5505.8, 60 sec: 5522.5, 300 sec: 5544.7). Total num frames: 780354560. Throughput: 0: 5721.8. Samples: 780356424. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:27:10,658][25689] Avg episode reward: [(0, '-2.700')] [2022-07-10 14:27:11,812][26022] Updated weights on worker 0-0, policy_version 762072 (0.00084) [2022-07-10 14:27:13,783][26022] Updated weights on worker 0-0, policy_version 762082 (0.00085) [2022-07-10 14:27:15,564][26022] Updated weights on worker 0-0, policy_version 762092 (0.00092) [2022-07-10 14:27:15,765][25689] Fps is (10 sec: 5561.1, 60 sec: 5520.4, 300 sec: 5543.8). Total num frames: 780383232. Throughput: 0: 5707.6. Samples: 780390094. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:27:15,765][25689] Avg episode reward: [(0, '-1.963')] [2022-07-10 14:27:17,354][26022] Updated weights on worker 0-0, policy_version 762102 (0.00062) [2022-07-10 14:27:19,242][26022] Updated weights on worker 0-0, policy_version 762112 (0.00086) [2022-07-10 14:27:20,807][25689] Fps is (10 sec: 5650.0, 60 sec: 5538.5, 300 sec: 5550.0). Total num frames: 780411904. Throughput: 0: 4878.5. Samples: 780407054. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:27:20,807][25689] Avg episode reward: [(0, '-2.101')] [2022-07-10 14:27:21,035][26022] Updated weights on worker 0-0, policy_version 762122 (0.00081) [2022-07-10 14:27:22,901][26022] Updated weights on worker 0-0, policy_version 762132 (0.00077) [2022-07-10 14:27:24,849][26022] Updated weights on worker 0-0, policy_version 762142 (0.00096) [2022-07-10 14:27:25,824][25689] Fps is (10 sec: 5496.9, 60 sec: 5521.3, 300 sec: 5547.2). Total num frames: 780438528. Throughput: 0: 5825.9. Samples: 780440672. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:27:25,825][25689] Avg episode reward: [(0, '-3.239')] [2022-07-10 14:27:26,495][26022] Updated weights on worker 0-0, policy_version 762152 (0.00088) [2022-07-10 14:27:28,427][26022] Updated weights on worker 0-0, policy_version 762162 (0.00090) [2022-07-10 14:27:30,201][26022] Updated weights on worker 0-0, policy_version 762172 (0.00087) [2022-07-10 14:27:30,874][25689] Fps is (10 sec: 5492.6, 60 sec: 5519.3, 300 sec: 5544.6). Total num frames: 780467200. Throughput: 0: 5825.4. Samples: 780474194. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:27:30,875][25689] Avg episode reward: [(0, '-1.303')] [2022-07-10 14:27:32,054][26022] Updated weights on worker 0-0, policy_version 762182 (0.00095) [2022-07-10 14:27:34,138][26022] Updated weights on worker 0-0, policy_version 762192 (0.00087) [2022-07-10 14:27:35,654][26022] Updated weights on worker 0-0, policy_version 762202 (0.00091) [2022-07-10 14:27:35,987][25689] Fps is (10 sec: 5642.6, 60 sec: 5552.2, 300 sec: 5549.6). Total num frames: 780495872. Throughput: 0: 4987.1. Samples: 780490944. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:27:35,987][25689] Avg episode reward: [(0, '-1.149')] [2022-07-10 14:27:37,487][26022] Updated weights on worker 0-0, policy_version 762212 (0.00085) [2022-07-10 14:27:39,454][26022] Updated weights on worker 0-0, policy_version 762222 (0.00097) [2022-07-10 14:27:41,006][25689] Fps is (10 sec: 5659.5, 60 sec: 5538.4, 300 sec: 5549.4). Total num frames: 780524544. Throughput: 0: 5817.6. Samples: 780524566. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:27:41,007][25689] Avg episode reward: [(0, '-1.823')] [2022-07-10 14:27:41,135][26022] Updated weights on worker 0-0, policy_version 762232 (0.00084) [2022-07-10 14:27:43,227][26022] Updated weights on worker 0-0, policy_version 762242 (0.00092) [2022-07-10 14:27:44,594][26022] Updated weights on worker 0-0, policy_version 762252 (0.00088) [2022-07-10 14:27:46,016][25689] Fps is (10 sec: 5513.3, 60 sec: 5537.8, 300 sec: 5546.1). Total num frames: 780551168. Throughput: 0: 5833.2. Samples: 780558458. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:27:46,017][25689] Avg episode reward: [(0, '-2.106')] [2022-07-10 14:27:46,661][26022] Updated weights on worker 0-0, policy_version 762262 (0.00085) [2022-07-10 14:27:48,190][26022] Updated weights on worker 0-0, policy_version 762272 (0.00087) [2022-07-10 14:27:50,354][26022] Updated weights on worker 0-0, policy_version 762282 (0.00087) [2022-07-10 14:27:51,027][25689] Fps is (10 sec: 5620.4, 60 sec: 5555.1, 300 sec: 5548.1). Total num frames: 780580864. Throughput: 0: 5027.2. Samples: 780575506. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:27:51,027][25689] Avg episode reward: [(0, '-2.144')] [2022-07-10 14:27:52,233][26022] Updated weights on worker 0-0, policy_version 762292 (0.00086) [2022-07-10 14:27:54,015][26022] Updated weights on worker 0-0, policy_version 762302 (0.00054) [2022-07-10 14:27:55,640][26022] Updated weights on worker 0-0, policy_version 762312 (0.00091) [2022-07-10 14:27:56,167][25689] Fps is (10 sec: 5649.5, 60 sec: 5563.3, 300 sec: 5546.1). Total num frames: 780608512. Throughput: 0: 5852.5. Samples: 780609046. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:27:56,167][25689] Avg episode reward: [(0, '-0.859')] [2022-07-10 14:27:57,739][26022] Updated weights on worker 0-0, policy_version 762322 (0.00084) [2022-07-10 14:27:59,517][26022] Updated weights on worker 0-0, policy_version 762332 (0.00088) [2022-07-10 14:28:01,197][25689] Fps is (10 sec: 5537.9, 60 sec: 5577.8, 300 sec: 5556.1). Total num frames: 780637184. Throughput: 0: 5857.7. Samples: 780642836. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:28:01,199][25689] Avg episode reward: [(0, '-1.702')] [2022-07-10 14:28:01,220][26022] Updated weights on worker 0-0, policy_version 762342 (0.00083) [2022-07-10 14:28:03,309][26022] Updated weights on worker 0-0, policy_version 762352 (0.00093) [2022-07-10 14:28:05,235][26022] Updated weights on worker 0-0, policy_version 762362 (0.00088) [2022-07-10 14:28:06,229][25689] Fps is (10 sec: 5597.3, 60 sec: 5577.0, 300 sec: 5552.4). Total num frames: 780664832. Throughput: 0: 4904.4. Samples: 780657586. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:28:06,231][25689] Avg episode reward: [(0, '-0.788')] [2022-07-10 14:28:07,102][26022] Updated weights on worker 0-0, policy_version 762372 (0.00088) [2022-07-10 14:28:09,036][26022] Updated weights on worker 0-0, policy_version 762382 (0.00089) [2022-07-10 14:28:10,794][26022] Updated weights on worker 0-0, policy_version 762392 (0.00092) [2022-07-10 14:28:11,258][25689] Fps is (10 sec: 5394.3, 60 sec: 5559.3, 300 sec: 5545.9). Total num frames: 780691456. Throughput: 0: 5699.2. Samples: 780690806. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:28:11,259][25689] Avg episode reward: [(0, '-0.142')] [2022-07-10 14:28:12,546][26022] Updated weights on worker 0-0, policy_version 762402 (0.00083) [2022-07-10 14:28:14,503][26022] Updated weights on worker 0-0, policy_version 762412 (0.00094) [2022-07-10 14:28:16,259][26022] Updated weights on worker 0-0, policy_version 762422 (0.00083) [2022-07-10 14:28:16,350][25689] Fps is (10 sec: 5463.2, 60 sec: 5560.7, 300 sec: 5544.6). Total num frames: 780720128. Throughput: 0: 5721.0. Samples: 780724516. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:28:16,351][25689] Avg episode reward: [(0, '0.333')] [2022-07-10 14:28:18,191][26022] Updated weights on worker 0-0, policy_version 762432 (0.00089) [2022-07-10 14:28:19,948][26022] Updated weights on worker 0-0, policy_version 762442 (0.00067) [2022-07-10 14:28:21,413][25689] Fps is (10 sec: 5646.8, 60 sec: 5558.8, 300 sec: 5550.6). Total num frames: 780748800. Throughput: 0: 4874.7. Samples: 780741384. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:28:21,414][25689] Avg episode reward: [(0, '-0.542')] [2022-07-10 14:28:21,786][26022] Updated weights on worker 0-0, policy_version 762452 (0.00094) [2022-07-10 14:28:23,816][26022] Updated weights on worker 0-0, policy_version 762462 (0.00092) [2022-07-10 14:28:25,458][26022] Updated weights on worker 0-0, policy_version 762472 (0.00087) [2022-07-10 14:28:26,444][25689] Fps is (10 sec: 5579.6, 60 sec: 5574.4, 300 sec: 5547.0). Total num frames: 780776448. Throughput: 0: 5799.6. Samples: 780774828. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:28:26,446][25689] Avg episode reward: [(0, '-0.998')] [2022-07-10 14:28:27,460][26022] Updated weights on worker 0-0, policy_version 762482 (0.01023) [2022-07-10 14:28:29,380][26022] Updated weights on worker 0-0, policy_version 762492 (0.00085) [2022-07-10 14:28:31,094][26022] Updated weights on worker 0-0, policy_version 762502 (0.00085) [2022-07-10 14:28:31,455][25689] Fps is (10 sec: 5404.6, 60 sec: 5544.2, 300 sec: 5544.4). Total num frames: 780803072. Throughput: 0: 5791.5. Samples: 780807776. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:28:31,457][25689] Avg episode reward: [(0, '-1.562')] [2022-07-10 14:28:32,924][26022] Updated weights on worker 0-0, policy_version 762512 (0.00090) [2022-07-10 14:28:34,705][26022] Updated weights on worker 0-0, policy_version 762522 (0.00084) [2022-07-10 14:28:36,552][25689] Fps is (10 sec: 5470.6, 60 sec: 5545.7, 300 sec: 5550.0). Total num frames: 780831744. Throughput: 0: 5760.5. Samples: 780840888. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:28:36,552][25689] Avg episode reward: [(0, '-3.939')] [2022-07-10 14:28:36,584][26022] Updated weights on worker 0-0, policy_version 762532 (0.00097) [2022-07-10 14:28:38,512][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:28:38,533][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000762542_780843008.pth [2022-07-10 14:28:38,534][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000760590_778844160.pth [2022-07-10 14:28:38,540][26022] Updated weights on worker 0-0, policy_version 762542 (0.00091) [2022-07-10 14:28:40,377][26022] Updated weights on worker 0-0, policy_version 762552 (0.00084) [2022-07-10 14:28:41,558][25689] Fps is (10 sec: 5473.4, 60 sec: 5513.1, 300 sec: 5543.8). Total num frames: 780858368. Throughput: 0: 5769.1. Samples: 780857600. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 14:28:41,558][25689] Avg episode reward: [(0, '-4.014')] [2022-07-10 14:28:42,159][26022] Updated weights on worker 0-0, policy_version 762562 (0.00090) [2022-07-10 14:28:44,076][26022] Updated weights on worker 0-0, policy_version 762572 (0.00094) [2022-07-10 14:28:45,754][26022] Updated weights on worker 0-0, policy_version 762582 (0.00080) [2022-07-10 14:28:46,586][25689] Fps is (10 sec: 5612.6, 60 sec: 5562.1, 300 sec: 5543.8). Total num frames: 780888064. Throughput: 0: 5763.2. Samples: 780890912. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:28:46,587][25689] Avg episode reward: [(0, '-3.615')] [2022-07-10 14:28:47,795][26022] Updated weights on worker 0-0, policy_version 762592 (0.00083) [2022-07-10 14:28:49,556][26022] Updated weights on worker 0-0, policy_version 762602 (0.00088) [2022-07-10 14:28:51,226][26022] Updated weights on worker 0-0, policy_version 762612 (0.00084) [2022-07-10 14:28:51,669][25689] Fps is (10 sec: 5670.9, 60 sec: 5521.7, 300 sec: 5548.1). Total num frames: 780915712. Throughput: 0: 5781.8. Samples: 780924652. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:28:51,670][25689] Avg episode reward: [(0, '-2.450')] [2022-07-10 14:28:53,192][26022] Updated weights on worker 0-0, policy_version 762622 (0.00091) [2022-07-10 14:28:54,993][26022] Updated weights on worker 0-0, policy_version 762632 (0.00088) [2022-07-10 14:28:56,699][26022] Updated weights on worker 0-0, policy_version 762642 (0.00091) [2022-07-10 14:28:56,793][25689] Fps is (10 sec: 5618.3, 60 sec: 5556.9, 300 sec: 5545.9). Total num frames: 780945408. Throughput: 0: 4977.9. Samples: 780941646. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:28:56,793][25689] Avg episode reward: [(0, '-2.286')] [2022-07-10 14:28:58,694][26022] Updated weights on worker 0-0, policy_version 762652 (0.00090) [2022-07-10 14:29:00,619][26022] Updated weights on worker 0-0, policy_version 762662 (0.00087) [2022-07-10 14:29:01,876][25689] Fps is (10 sec: 5618.4, 60 sec: 5535.2, 300 sec: 5551.8). Total num frames: 780973056. Throughput: 0: 5805.0. Samples: 780975548. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:29:01,879][25689] Avg episode reward: [(0, '-1.666')] [2022-07-10 14:29:02,799][26022] Updated weights on worker 0-0, policy_version 762672 (0.00088) [2022-07-10 14:29:04,460][26022] Updated weights on worker 0-0, policy_version 762682 (0.00081) [2022-07-10 14:29:06,243][26022] Updated weights on worker 0-0, policy_version 762692 (0.00082) [2022-07-10 14:29:06,966][25689] Fps is (10 sec: 5335.1, 60 sec: 5513.1, 300 sec: 5547.2). Total num frames: 780999680. Throughput: 0: 5691.6. Samples: 781006902. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:29:06,966][25689] Avg episode reward: [(0, '-0.514')] [2022-07-10 14:29:08,207][26022] Updated weights on worker 0-0, policy_version 762702 (0.00089) [2022-07-10 14:29:10,100][26022] Updated weights on worker 0-0, policy_version 762712 (0.00093) [2022-07-10 14:29:11,923][26022] Updated weights on worker 0-0, policy_version 762722 (0.00094) [2022-07-10 14:29:12,007][25689] Fps is (10 sec: 5357.1, 60 sec: 5528.9, 300 sec: 5548.3). Total num frames: 781027328. Throughput: 0: 4871.9. Samples: 781023734. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:29:12,008][25689] Avg episode reward: [(0, '-2.020')] [2022-07-10 14:29:13,669][26022] Updated weights on worker 0-0, policy_version 762732 (0.00091) [2022-07-10 14:29:15,509][26022] Updated weights on worker 0-0, policy_version 762742 (0.00089) [2022-07-10 14:29:17,117][25689] Fps is (10 sec: 5446.9, 60 sec: 5510.3, 300 sec: 5543.1). Total num frames: 781054976. Throughput: 0: 5678.3. Samples: 781057054. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:29:17,118][25689] Avg episode reward: [(0, '-2.999')] [2022-07-10 14:29:17,254][26022] Updated weights on worker 0-0, policy_version 762752 (0.00083) [2022-07-10 14:29:19,040][26022] Updated weights on worker 0-0, policy_version 762762 (0.00093) [2022-07-10 14:29:21,180][26022] Updated weights on worker 0-0, policy_version 762772 (0.00087) [2022-07-10 14:29:22,176][25689] Fps is (10 sec: 5638.9, 60 sec: 5527.6, 300 sec: 5550.1). Total num frames: 781084672. Throughput: 0: 5662.0. Samples: 781090488. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:29:22,178][25689] Avg episode reward: [(0, '-3.135')] [2022-07-10 14:29:22,657][26022] Updated weights on worker 0-0, policy_version 762782 (0.00088) [2022-07-10 14:29:24,943][26022] Updated weights on worker 0-0, policy_version 762792 (0.00085) [2022-07-10 14:29:26,616][26022] Updated weights on worker 0-0, policy_version 762802 (0.00089) [2022-07-10 14:29:27,179][25689] Fps is (10 sec: 5699.2, 60 sec: 5530.1, 300 sec: 5543.8). Total num frames: 781112320. Throughput: 0: 4969.8. Samples: 781107360. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:29:27,180][25689] Avg episode reward: [(0, '-1.534')] [2022-07-10 14:29:28,530][26022] Updated weights on worker 0-0, policy_version 762812 (0.00083) [2022-07-10 14:29:30,231][26022] Updated weights on worker 0-0, policy_version 762822 (0.00086) [2022-07-10 14:29:32,206][25689] Fps is (10 sec: 5411.2, 60 sec: 5528.7, 300 sec: 5549.0). Total num frames: 781138944. Throughput: 0: 5789.4. Samples: 781140674. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:29:32,206][25689] Avg episode reward: [(0, '-2.377')] [2022-07-10 14:29:32,318][26022] Updated weights on worker 0-0, policy_version 762832 (0.00085) [2022-07-10 14:29:33,798][26022] Updated weights on worker 0-0, policy_version 762842 (0.00089) [2022-07-10 14:29:35,950][26022] Updated weights on worker 0-0, policy_version 762852 (0.00086) [2022-07-10 14:29:37,308][25689] Fps is (10 sec: 5661.2, 60 sec: 5561.9, 300 sec: 5544.1). Total num frames: 781169664. Throughput: 0: 5793.1. Samples: 781174022. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:29:37,309][25689] Avg episode reward: [(0, '-1.382')] [2022-07-10 14:29:37,579][26022] Updated weights on worker 0-0, policy_version 762862 (0.00088) [2022-07-10 14:29:39,516][26022] Updated weights on worker 0-0, policy_version 762872 (0.00090) [2022-07-10 14:29:41,316][26022] Updated weights on worker 0-0, policy_version 762882 (0.00090) [2022-07-10 14:29:42,323][25689] Fps is (10 sec: 5566.8, 60 sec: 5544.2, 300 sec: 5541.3). Total num frames: 781195264. Throughput: 0: 4983.4. Samples: 781190888. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:29:42,323][25689] Avg episode reward: [(0, '-0.633')] [2022-07-10 14:29:43,171][26022] Updated weights on worker 0-0, policy_version 762892 (0.00082) [2022-07-10 14:29:45,027][26022] Updated weights on worker 0-0, policy_version 762902 (0.00090) [2022-07-10 14:29:46,763][26022] Updated weights on worker 0-0, policy_version 762912 (0.00090) [2022-07-10 14:29:47,338][25689] Fps is (10 sec: 5513.0, 60 sec: 5545.4, 300 sec: 5554.9). Total num frames: 781224960. Throughput: 0: 5795.8. Samples: 781224200. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:29:47,339][25689] Avg episode reward: [(0, '0.049')] [2022-07-10 14:29:48,581][26022] Updated weights on worker 0-0, policy_version 762922 (0.00087) [2022-07-10 14:29:50,448][26022] Updated weights on worker 0-0, policy_version 762932 (0.00093) [2022-07-10 14:29:52,263][26022] Updated weights on worker 0-0, policy_version 762942 (0.00090) [2022-07-10 14:29:52,356][25689] Fps is (10 sec: 5715.5, 60 sec: 5551.4, 300 sec: 5542.8). Total num frames: 781252608. Throughput: 0: 5796.9. Samples: 781257482. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:29:52,356][25689] Avg episode reward: [(0, '0.121')] [2022-07-10 14:29:54,391][26022] Updated weights on worker 0-0, policy_version 762952 (0.00086) [2022-07-10 14:29:55,835][26022] Updated weights on worker 0-0, policy_version 762962 (0.00086) [2022-07-10 14:29:57,411][25689] Fps is (10 sec: 5388.3, 60 sec: 5507.0, 300 sec: 5536.3). Total num frames: 781279232. Throughput: 0: 4988.5. Samples: 781274302. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:29:57,411][25689] Avg episode reward: [(0, '-0.376')] [2022-07-10 14:29:57,888][26022] Updated weights on worker 0-0, policy_version 762972 (0.00083) [2022-07-10 14:29:59,705][26022] Updated weights on worker 0-0, policy_version 762982 (0.00082) [2022-07-10 14:30:01,696][26022] Updated weights on worker 0-0, policy_version 762992 (0.00095) [2022-07-10 14:30:02,419][25689] Fps is (10 sec: 5291.7, 60 sec: 5497.0, 300 sec: 5543.3). Total num frames: 781305856. Throughput: 0: 5818.2. Samples: 781307808. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:30:02,421][25689] Avg episode reward: [(0, '0.487')] [2022-07-10 14:30:03,874][26022] Updated weights on worker 0-0, policy_version 763002 (0.00091) [2022-07-10 14:30:05,857][26022] Updated weights on worker 0-0, policy_version 763012 (0.00089) [2022-07-10 14:30:07,279][26022] Updated weights on worker 0-0, policy_version 763022 (0.00084) [2022-07-10 14:30:07,432][25689] Fps is (10 sec: 5620.0, 60 sec: 5554.7, 300 sec: 5553.6). Total num frames: 781335552. Throughput: 0: 5723.2. Samples: 781339200. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:30:07,433][25689] Avg episode reward: [(0, '0.305')] [2022-07-10 14:30:09,414][26022] Updated weights on worker 0-0, policy_version 763032 (0.00092) [2022-07-10 14:30:10,833][26022] Updated weights on worker 0-0, policy_version 763042 (0.00095) [2022-07-10 14:30:12,461][25689] Fps is (10 sec: 5506.4, 60 sec: 5522.0, 300 sec: 5540.4). Total num frames: 781361152. Throughput: 0: 4903.2. Samples: 781356060. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:30:12,463][25689] Avg episode reward: [(0, '0.587')] [2022-07-10 14:30:13,216][26022] Updated weights on worker 0-0, policy_version 763052 (0.00083) [2022-07-10 14:30:14,641][26022] Updated weights on worker 0-0, policy_version 763062 (0.00100) [2022-07-10 14:30:16,723][26022] Updated weights on worker 0-0, policy_version 763072 (0.00092) [2022-07-10 14:30:17,518][25689] Fps is (10 sec: 5482.5, 60 sec: 5560.7, 300 sec: 5543.3). Total num frames: 781390848. Throughput: 0: 5711.0. Samples: 781389136. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:30:17,519][25689] Avg episode reward: [(0, '0.354')] [2022-07-10 14:30:18,625][26022] Updated weights on worker 0-0, policy_version 763082 (0.00088) [2022-07-10 14:30:20,203][26022] Updated weights on worker 0-0, policy_version 763092 (0.00086) [2022-07-10 14:30:22,425][26022] Updated weights on worker 0-0, policy_version 763102 (0.00084) [2022-07-10 14:30:22,532][25689] Fps is (10 sec: 5490.3, 60 sec: 5497.0, 300 sec: 5539.7). Total num frames: 781416448. Throughput: 0: 5694.2. Samples: 781422340. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:30:22,534][25689] Avg episode reward: [(0, '-0.696')] [2022-07-10 14:30:23,829][26022] Updated weights on worker 0-0, policy_version 763112 (0.00093) [2022-07-10 14:30:26,020][26022] Updated weights on worker 0-0, policy_version 763122 (0.00089) [2022-07-10 14:30:27,552][25689] Fps is (10 sec: 5408.7, 60 sec: 5512.4, 300 sec: 5539.5). Total num frames: 781445120. Throughput: 0: 4962.9. Samples: 781439054. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:30:27,553][25689] Avg episode reward: [(0, '-0.819')] [2022-07-10 14:30:27,904][26022] Updated weights on worker 0-0, policy_version 763132 (0.00086) [2022-07-10 14:30:29,438][26022] Updated weights on worker 0-0, policy_version 763142 (0.00087) [2022-07-10 14:30:31,639][26022] Updated weights on worker 0-0, policy_version 763152 (0.00096) [2022-07-10 14:30:32,573][25689] Fps is (10 sec: 5711.4, 60 sec: 5546.9, 300 sec: 5536.7). Total num frames: 781473792. Throughput: 0: 5787.5. Samples: 781472458. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:30:32,574][25689] Avg episode reward: [(0, '-2.148')] [2022-07-10 14:30:33,112][26022] Updated weights on worker 0-0, policy_version 763162 (0.00087) [2022-07-10 14:30:35,279][26022] Updated weights on worker 0-0, policy_version 763172 (0.01494) [2022-07-10 14:30:36,960][26022] Updated weights on worker 0-0, policy_version 763182 (0.00093) [2022-07-10 14:30:37,674][25689] Fps is (10 sec: 5463.1, 60 sec: 5479.1, 300 sec: 5536.7). Total num frames: 781500416. Throughput: 0: 5788.0. Samples: 781505798. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:30:37,676][25689] Avg episode reward: [(0, '-2.782')] [2022-07-10 14:30:38,578][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:30:38,594][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000763190_781506560.pth [2022-07-10 14:30:38,594][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000761241_779510784.pth [2022-07-10 14:30:38,811][26022] Updated weights on worker 0-0, policy_version 763192 (0.00088) [2022-07-10 14:30:40,728][26022] Updated weights on worker 0-0, policy_version 763202 (0.00082) [2022-07-10 14:30:42,514][26022] Updated weights on worker 0-0, policy_version 763212 (0.00087) [2022-07-10 14:30:42,677][25689] Fps is (10 sec: 5472.5, 60 sec: 5531.1, 300 sec: 5537.1). Total num frames: 781529088. Throughput: 0: 5810.2. Samples: 781539384. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:30:42,679][25689] Avg episode reward: [(0, '-2.961')] [2022-07-10 14:30:44,107][26022] Updated weights on worker 0-0, policy_version 763222 (0.00083) [2022-07-10 14:30:46,277][26022] Updated weights on worker 0-0, policy_version 763232 (0.00084) [2022-07-10 14:30:47,743][25689] Fps is (10 sec: 5695.2, 60 sec: 5509.5, 300 sec: 5536.1). Total num frames: 781557760. Throughput: 0: 5788.2. Samples: 781555920. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:30:47,743][25689] Avg episode reward: [(0, '-2.975')] [2022-07-10 14:30:48,148][26022] Updated weights on worker 0-0, policy_version 763242 (0.00096) [2022-07-10 14:30:50,071][26022] Updated weights on worker 0-0, policy_version 763252 (0.00092) [2022-07-10 14:30:51,574][26022] Updated weights on worker 0-0, policy_version 763262 (0.00088) [2022-07-10 14:30:52,795][25689] Fps is (10 sec: 5667.7, 60 sec: 5523.4, 300 sec: 5543.3). Total num frames: 781586432. Throughput: 0: 5789.1. Samples: 781589526. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:30:52,796][25689] Avg episode reward: [(0, '-1.560')] [2022-07-10 14:30:53,563][26022] Updated weights on worker 0-0, policy_version 763272 (0.00084) [2022-07-10 14:30:55,088][26022] Updated weights on worker 0-0, policy_version 763282 (0.00089) [2022-07-10 14:30:57,211][26022] Updated weights on worker 0-0, policy_version 763292 (0.00096) [2022-07-10 14:30:57,914][25689] Fps is (10 sec: 5537.5, 60 sec: 5534.4, 300 sec: 5532.5). Total num frames: 781614080. Throughput: 0: 5798.4. Samples: 781623154. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:30:57,914][25689] Avg episode reward: [(0, '-1.854')] [2022-07-10 14:30:58,887][26022] Updated weights on worker 0-0, policy_version 763302 (0.00093) [2022-07-10 14:31:00,996][26022] Updated weights on worker 0-0, policy_version 763312 (0.00090) [2022-07-10 14:31:02,841][26022] Updated weights on worker 0-0, policy_version 763322 (0.00089) [2022-07-10 14:31:02,936][25689] Fps is (10 sec: 5452.4, 60 sec: 5550.0, 300 sec: 5540.8). Total num frames: 781641728. Throughput: 0: 4964.6. Samples: 781639962. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:31:02,937][25689] Avg episode reward: [(0, '-1.130')] [2022-07-10 14:31:05,000][26022] Updated weights on worker 0-0, policy_version 763332 (0.00089) [2022-07-10 14:31:06,572][26022] Updated weights on worker 0-0, policy_version 763342 (0.00085) [2022-07-10 14:31:08,004][25689] Fps is (10 sec: 5378.7, 60 sec: 5494.3, 300 sec: 5532.7). Total num frames: 781668352. Throughput: 0: 5714.3. Samples: 781671696. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:31:08,005][25689] Avg episode reward: [(0, '-0.967')] [2022-07-10 14:31:08,514][26022] Updated weights on worker 0-0, policy_version 763352 (0.00089) [2022-07-10 14:31:10,318][26022] Updated weights on worker 0-0, policy_version 763362 (0.00093) [2022-07-10 14:31:12,160][26022] Updated weights on worker 0-0, policy_version 763372 (0.00092) [2022-07-10 14:31:13,017][25689] Fps is (10 sec: 5383.4, 60 sec: 5529.5, 300 sec: 5530.6). Total num frames: 781696000. Throughput: 0: 5715.2. Samples: 781705104. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:31:13,018][25689] Avg episode reward: [(0, '-1.196')] [2022-07-10 14:31:14,147][26022] Updated weights on worker 0-0, policy_version 763382 (0.00087) [2022-07-10 14:31:15,935][26022] Updated weights on worker 0-0, policy_version 763392 (0.00086) [2022-07-10 14:31:17,723][26022] Updated weights on worker 0-0, policy_version 763402 (0.00097) [2022-07-10 14:31:18,079][25689] Fps is (10 sec: 5590.1, 60 sec: 5512.2, 300 sec: 5533.9). Total num frames: 781724672. Throughput: 0: 4879.2. Samples: 781721544. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:31:18,079][25689] Avg episode reward: [(0, '-1.036')] [2022-07-10 14:31:19,536][26022] Updated weights on worker 0-0, policy_version 763412 (0.00092) [2022-07-10 14:31:21,370][26022] Updated weights on worker 0-0, policy_version 763422 (0.00085) [2022-07-10 14:31:23,102][25689] Fps is (10 sec: 5584.9, 60 sec: 5545.2, 300 sec: 5533.8). Total num frames: 781752320. Throughput: 0: 5703.0. Samples: 781754966. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:31:23,102][25689] Avg episode reward: [(0, '-1.696')] [2022-07-10 14:31:23,384][26022] Updated weights on worker 0-0, policy_version 763432 (0.00093) [2022-07-10 14:31:25,211][26022] Updated weights on worker 0-0, policy_version 763442 (0.00083) [2022-07-10 14:31:26,872][26022] Updated weights on worker 0-0, policy_version 763452 (0.00047) [2022-07-10 14:31:28,104][25689] Fps is (10 sec: 5617.7, 60 sec: 5546.9, 300 sec: 5534.2). Total num frames: 781780992. Throughput: 0: 5808.8. Samples: 781788456. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:31:28,104][25689] Avg episode reward: [(0, '-3.144')] [2022-07-10 14:31:28,952][26022] Updated weights on worker 0-0, policy_version 763462 (0.00090) [2022-07-10 14:31:30,623][26022] Updated weights on worker 0-0, policy_version 763472 (0.00089) [2022-07-10 14:31:32,486][26022] Updated weights on worker 0-0, policy_version 763482 (0.00096) [2022-07-10 14:31:33,106][25689] Fps is (10 sec: 5629.5, 60 sec: 5531.6, 300 sec: 5539.5). Total num frames: 781808640. Throughput: 0: 4975.0. Samples: 781805046. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:31:33,108][25689] Avg episode reward: [(0, '-2.903')] [2022-07-10 14:31:34,435][26022] Updated weights on worker 0-0, policy_version 763492 (0.00077) [2022-07-10 14:31:36,243][26022] Updated weights on worker 0-0, policy_version 763502 (0.00500) [2022-07-10 14:31:38,168][26022] Updated weights on worker 0-0, policy_version 763512 (0.00087) [2022-07-10 14:31:38,179][25689] Fps is (10 sec: 5387.0, 60 sec: 5534.3, 300 sec: 5528.9). Total num frames: 781835264. Throughput: 0: 5797.1. Samples: 781838066. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:31:38,179][25689] Avg episode reward: [(0, '-2.490')] [2022-07-10 14:31:39,884][26022] Updated weights on worker 0-0, policy_version 763522 (0.00086) [2022-07-10 14:31:41,871][26022] Updated weights on worker 0-0, policy_version 763532 (0.00097) [2022-07-10 14:31:43,188][25689] Fps is (10 sec: 5484.5, 60 sec: 5533.6, 300 sec: 5535.6). Total num frames: 781863936. Throughput: 0: 5800.1. Samples: 781871470. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:31:43,190][25689] Avg episode reward: [(0, '-3.130')] [2022-07-10 14:31:43,636][26022] Updated weights on worker 0-0, policy_version 763542 (0.00094) [2022-07-10 14:31:45,547][26022] Updated weights on worker 0-0, policy_version 763552 (0.00106) [2022-07-10 14:31:47,282][26022] Updated weights on worker 0-0, policy_version 763562 (0.00089) [2022-07-10 14:31:48,232][25689] Fps is (10 sec: 5602.3, 60 sec: 5518.8, 300 sec: 5531.7). Total num frames: 781891584. Throughput: 0: 4945.5. Samples: 781887996. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:31:48,232][25689] Avg episode reward: [(0, '-2.205')] [2022-07-10 14:31:49,348][26022] Updated weights on worker 0-0, policy_version 763572 (0.00084) [2022-07-10 14:31:51,071][26022] Updated weights on worker 0-0, policy_version 763582 (0.00086) [2022-07-10 14:31:52,946][26022] Updated weights on worker 0-0, policy_version 763592 (0.00087) [2022-07-10 14:31:53,259][25689] Fps is (10 sec: 5592.6, 60 sec: 5521.0, 300 sec: 5538.9). Total num frames: 781920256. Throughput: 0: 5788.4. Samples: 781921698. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:31:53,259][25689] Avg episode reward: [(0, '-2.973')] [2022-07-10 14:31:54,741][26022] Updated weights on worker 0-0, policy_version 763602 (0.00092) [2022-07-10 14:31:56,685][26022] Updated weights on worker 0-0, policy_version 763612 (0.00086) [2022-07-10 14:31:58,393][25689] Fps is (10 sec: 5542.3, 60 sec: 5519.6, 300 sec: 5536.5). Total num frames: 781947904. Throughput: 0: 5807.5. Samples: 781955462. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:31:58,394][25689] Avg episode reward: [(0, '-1.734')] [2022-07-10 14:31:58,447][26022] Updated weights on worker 0-0, policy_version 763622 (0.00091) [2022-07-10 14:32:00,283][26022] Updated weights on worker 0-0, policy_version 763632 (0.00092) [2022-07-10 14:32:01,858][26022] Updated weights on worker 0-0, policy_version 763642 (0.00084) [2022-07-10 14:32:03,477][25689] Fps is (10 sec: 5311.3, 60 sec: 5497.1, 300 sec: 5531.9). Total num frames: 781974528. Throughput: 0: 4963.3. Samples: 781972168. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:32:03,478][25689] Avg episode reward: [(0, '-1.407')] [2022-07-10 14:32:04,383][26022] Updated weights on worker 0-0, policy_version 763652 (0.00081) [2022-07-10 14:32:05,948][26022] Updated weights on worker 0-0, policy_version 763662 (0.00085) [2022-07-10 14:32:07,815][26022] Updated weights on worker 0-0, policy_version 763672 (0.00090) [2022-07-10 14:32:08,508][25689] Fps is (10 sec: 5669.2, 60 sec: 5568.1, 300 sec: 5542.0). Total num frames: 782005248. Throughput: 0: 5724.9. Samples: 782004078. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 14:32:08,509][25689] Avg episode reward: [(0, '-1.525')] [2022-07-10 14:32:09,483][26022] Updated weights on worker 0-0, policy_version 763682 (0.00085) [2022-07-10 14:32:11,460][26022] Updated weights on worker 0-0, policy_version 763692 (0.00083) [2022-07-10 14:32:13,315][26022] Updated weights on worker 0-0, policy_version 763702 (0.00088) [2022-07-10 14:32:13,533][25689] Fps is (10 sec: 5702.2, 60 sec: 5550.1, 300 sec: 5536.6). Total num frames: 782031872. Throughput: 0: 5727.7. Samples: 782037826. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:32:13,535][25689] Avg episode reward: [(0, '-1.983')] [2022-07-10 14:32:15,205][26022] Updated weights on worker 0-0, policy_version 763712 (0.00088) [2022-07-10 14:32:16,942][26022] Updated weights on worker 0-0, policy_version 763722 (0.00089) [2022-07-10 14:32:18,617][25689] Fps is (10 sec: 5368.6, 60 sec: 5531.1, 300 sec: 5532.4). Total num frames: 782059520. Throughput: 0: 4889.6. Samples: 782054350. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:32:18,618][25689] Avg episode reward: [(0, '-2.472')] [2022-07-10 14:32:18,890][26022] Updated weights on worker 0-0, policy_version 763732 (0.00094) [2022-07-10 14:32:20,480][26022] Updated weights on worker 0-0, policy_version 763742 (0.00086) [2022-07-10 14:32:22,495][26022] Updated weights on worker 0-0, policy_version 763752 (0.00089) [2022-07-10 14:32:23,641][25689] Fps is (10 sec: 5673.4, 60 sec: 5564.9, 300 sec: 5542.6). Total num frames: 782089216. Throughput: 0: 5753.8. Samples: 782088188. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:32:23,642][25689] Avg episode reward: [(0, '-1.711')] [2022-07-10 14:32:24,154][26022] Updated weights on worker 0-0, policy_version 763762 (0.00086) [2022-07-10 14:32:26,146][26022] Updated weights on worker 0-0, policy_version 763772 (0.00091) [2022-07-10 14:32:27,987][26022] Updated weights on worker 0-0, policy_version 763782 (0.00086) [2022-07-10 14:32:28,728][25689] Fps is (10 sec: 5671.7, 60 sec: 5540.3, 300 sec: 5538.5). Total num frames: 782116864. Throughput: 0: 5830.8. Samples: 782121974. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:32:28,729][25689] Avg episode reward: [(0, '-1.667')] [2022-07-10 14:32:29,770][26022] Updated weights on worker 0-0, policy_version 763792 (0.00090) [2022-07-10 14:32:31,604][26022] Updated weights on worker 0-0, policy_version 763802 (0.00095) [2022-07-10 14:32:33,406][26022] Updated weights on worker 0-0, policy_version 763812 (0.00088) [2022-07-10 14:32:33,764][25689] Fps is (10 sec: 5462.2, 60 sec: 5537.1, 300 sec: 5536.5). Total num frames: 782144512. Throughput: 0: 4974.2. Samples: 782138458. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:32:33,765][25689] Avg episode reward: [(0, '-1.490')] [2022-07-10 14:32:35,250][26022] Updated weights on worker 0-0, policy_version 763822 (0.00098) [2022-07-10 14:32:37,017][26022] Updated weights on worker 0-0, policy_version 763832 (0.00091) [2022-07-10 14:32:38,688][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:32:38,719][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000763841_782173184.pth [2022-07-10 14:32:38,725][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000761891_780176384.pth [2022-07-10 14:32:38,853][25689] Fps is (10 sec: 5562.3, 60 sec: 5569.4, 300 sec: 5535.2). Total num frames: 782173184. Throughput: 0: 5815.9. Samples: 782172040. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:32:38,854][25689] Avg episode reward: [(0, '-0.844')] [2022-07-10 14:32:38,867][26022] Updated weights on worker 0-0, policy_version 763842 (0.00095) [2022-07-10 14:32:40,803][26022] Updated weights on worker 0-0, policy_version 763852 (0.00086) [2022-07-10 14:32:42,732][26022] Updated weights on worker 0-0, policy_version 763862 (0.00087) [2022-07-10 14:32:43,887][25689] Fps is (10 sec: 5563.6, 60 sec: 5550.3, 300 sec: 5538.2). Total num frames: 782200832. Throughput: 0: 5794.3. Samples: 782205502. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:32:43,888][25689] Avg episode reward: [(0, '-0.876')] [2022-07-10 14:32:44,538][26022] Updated weights on worker 0-0, policy_version 763872 (0.00085) [2022-07-10 14:32:46,335][26022] Updated weights on worker 0-0, policy_version 763882 (0.00107) [2022-07-10 14:32:48,262][26022] Updated weights on worker 0-0, policy_version 763892 (0.00086) [2022-07-10 14:32:48,892][25689] Fps is (10 sec: 5712.2, 60 sec: 5587.6, 300 sec: 5538.3). Total num frames: 782230528. Throughput: 0: 4967.3. Samples: 782222136. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:32:48,893][25689] Avg episode reward: [(0, '-0.480')] [2022-07-10 14:32:50,094][26022] Updated weights on worker 0-0, policy_version 763902 (0.00089) [2022-07-10 14:32:51,779][26022] Updated weights on worker 0-0, policy_version 763912 (0.00091) [2022-07-10 14:32:53,898][25689] Fps is (10 sec: 5421.6, 60 sec: 5522.0, 300 sec: 5530.5). Total num frames: 782255104. Throughput: 0: 5823.2. Samples: 782255698. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:32:53,898][25689] Avg episode reward: [(0, '-0.113')] [2022-07-10 14:32:53,964][26022] Updated weights on worker 0-0, policy_version 763922 (0.00094) [2022-07-10 14:32:55,433][26022] Updated weights on worker 0-0, policy_version 763932 (0.00087) [2022-07-10 14:32:57,473][26022] Updated weights on worker 0-0, policy_version 763942 (0.00504) [2022-07-10 14:32:58,982][25689] Fps is (10 sec: 5480.2, 60 sec: 5577.3, 300 sec: 5536.3). Total num frames: 782285824. Throughput: 0: 5807.0. Samples: 782288928. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:32:58,983][25689] Avg episode reward: [(0, '0.076')] [2022-07-10 14:32:59,079][26022] Updated weights on worker 0-0, policy_version 763952 (0.00086) [2022-07-10 14:33:01,203][26022] Updated weights on worker 0-0, policy_version 763962 (0.00094) [2022-07-10 14:33:03,252][26022] Updated weights on worker 0-0, policy_version 763972 (0.00088) [2022-07-10 14:33:04,053][25689] Fps is (10 sec: 5546.1, 60 sec: 5561.6, 300 sec: 5528.7). Total num frames: 782311424. Throughput: 0: 5693.1. Samples: 782320304. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:33:04,053][25689] Avg episode reward: [(0, '-0.895')] [2022-07-10 14:33:04,924][26022] Updated weights on worker 0-0, policy_version 763982 (0.00083) [2022-07-10 14:33:07,036][26022] Updated weights on worker 0-0, policy_version 763992 (0.00085) [2022-07-10 14:33:08,559][26022] Updated weights on worker 0-0, policy_version 764002 (0.00087) [2022-07-10 14:33:09,064][25689] Fps is (10 sec: 5180.3, 60 sec: 5495.8, 300 sec: 5529.1). Total num frames: 782338048. Throughput: 0: 5691.1. Samples: 782336934. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:33:09,064][25689] Avg episode reward: [(0, '-0.673')] [2022-07-10 14:33:10,714][26022] Updated weights on worker 0-0, policy_version 764012 (0.00267) [2022-07-10 14:33:12,439][26022] Updated weights on worker 0-0, policy_version 764022 (0.00083) [2022-07-10 14:33:14,078][25689] Fps is (10 sec: 5617.6, 60 sec: 5547.5, 300 sec: 5534.0). Total num frames: 782367744. Throughput: 0: 5702.4. Samples: 782370774. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:33:14,079][25689] Avg episode reward: [(0, '-0.546')] [2022-07-10 14:33:14,220][26022] Updated weights on worker 0-0, policy_version 764032 (0.00084) [2022-07-10 14:33:16,279][26022] Updated weights on worker 0-0, policy_version 764042 (0.00089) [2022-07-10 14:33:18,008][26022] Updated weights on worker 0-0, policy_version 764052 (0.00090) [2022-07-10 14:33:19,187][25689] Fps is (10 sec: 5664.5, 60 sec: 5545.2, 300 sec: 5529.7). Total num frames: 782395392. Throughput: 0: 5700.3. Samples: 782404102. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:33:19,188][25689] Avg episode reward: [(0, '-1.689')] [2022-07-10 14:33:19,847][26022] Updated weights on worker 0-0, policy_version 764062 (0.00095) [2022-07-10 14:33:21,662][26022] Updated weights on worker 0-0, policy_version 764072 (0.00098) [2022-07-10 14:33:23,513][26022] Updated weights on worker 0-0, policy_version 764082 (0.00087) [2022-07-10 14:33:24,223][25689] Fps is (10 sec: 5450.8, 60 sec: 5510.3, 300 sec: 5529.6). Total num frames: 782423040. Throughput: 0: 4989.7. Samples: 782420946. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:33:24,223][25689] Avg episode reward: [(0, '-3.075')] [2022-07-10 14:33:25,380][26022] Updated weights on worker 0-0, policy_version 764092 (0.00091) [2022-07-10 14:33:27,290][26022] Updated weights on worker 0-0, policy_version 764102 (0.00084) [2022-07-10 14:33:28,993][26022] Updated weights on worker 0-0, policy_version 764112 (0.00084) [2022-07-10 14:33:29,228][25689] Fps is (10 sec: 5608.9, 60 sec: 5534.7, 300 sec: 5536.6). Total num frames: 782451712. Throughput: 0: 5829.6. Samples: 782454486. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:33:29,230][25689] Avg episode reward: [(0, '-3.128')] [2022-07-10 14:33:30,869][26022] Updated weights on worker 0-0, policy_version 764122 (0.00098) [2022-07-10 14:33:32,769][26022] Updated weights on worker 0-0, policy_version 764132 (0.00085) [2022-07-10 14:33:34,239][25689] Fps is (10 sec: 5622.9, 60 sec: 5537.0, 300 sec: 5534.7). Total num frames: 782479360. Throughput: 0: 5810.0. Samples: 782487908. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:33:34,239][25689] Avg episode reward: [(0, '-1.830')] [2022-07-10 14:33:34,489][26022] Updated weights on worker 0-0, policy_version 764142 (0.00089) [2022-07-10 14:33:36,513][26022] Updated weights on worker 0-0, policy_version 764152 (0.00090) [2022-07-10 14:33:38,260][26022] Updated weights on worker 0-0, policy_version 764162 (0.00085) [2022-07-10 14:33:39,368][25689] Fps is (10 sec: 5554.3, 60 sec: 5533.3, 300 sec: 5539.3). Total num frames: 782508032. Throughput: 0: 4981.9. Samples: 782504642. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:33:39,369][25689] Avg episode reward: [(0, '-1.965')] [2022-07-10 14:33:40,123][26022] Updated weights on worker 0-0, policy_version 764172 (0.00084) [2022-07-10 14:33:41,850][26022] Updated weights on worker 0-0, policy_version 764182 (0.00093) [2022-07-10 14:33:43,687][26022] Updated weights on worker 0-0, policy_version 764192 (0.00081) [2022-07-10 14:33:44,405][25689] Fps is (10 sec: 5640.8, 60 sec: 5550.0, 300 sec: 5535.7). Total num frames: 782536704. Throughput: 0: 5819.9. Samples: 782538404. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:33:44,406][25689] Avg episode reward: [(0, '-0.560')] [2022-07-10 14:33:45,651][26022] Updated weights on worker 0-0, policy_version 764202 (0.00083) [2022-07-10 14:33:47,253][26022] Updated weights on worker 0-0, policy_version 764212 (0.00094) [2022-07-10 14:33:49,239][26022] Updated weights on worker 0-0, policy_version 764222 (0.00088) [2022-07-10 14:33:49,419][25689] Fps is (10 sec: 5501.4, 60 sec: 5498.4, 300 sec: 5533.6). Total num frames: 782563328. Throughput: 0: 5796.0. Samples: 782571514. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:33:49,420][25689] Avg episode reward: [(0, '-0.870')] [2022-07-10 14:33:51,069][26022] Updated weights on worker 0-0, policy_version 764232 (0.00095) [2022-07-10 14:33:52,928][26022] Updated weights on worker 0-0, policy_version 764242 (0.00089) [2022-07-10 14:33:54,442][25689] Fps is (10 sec: 5407.3, 60 sec: 5547.6, 300 sec: 5528.6). Total num frames: 782590976. Throughput: 0: 4942.9. Samples: 782587768. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:33:54,442][25689] Avg episode reward: [(0, '-0.795')] [2022-07-10 14:33:54,915][26022] Updated weights on worker 0-0, policy_version 764252 (0.00092) [2022-07-10 14:33:56,654][26022] Updated weights on worker 0-0, policy_version 764262 (0.00087) [2022-07-10 14:33:58,508][26022] Updated weights on worker 0-0, policy_version 764272 (0.00089) [2022-07-10 14:33:59,519][25689] Fps is (10 sec: 5576.6, 60 sec: 5514.4, 300 sec: 5532.1). Total num frames: 782619648. Throughput: 0: 5796.8. Samples: 782621450. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:33:59,520][25689] Avg episode reward: [(0, '-0.656')] [2022-07-10 14:34:00,273][26022] Updated weights on worker 0-0, policy_version 764282 (0.00079) [2022-07-10 14:34:02,415][26022] Updated weights on worker 0-0, policy_version 764292 (0.00087) [2022-07-10 14:34:04,248][26022] Updated weights on worker 0-0, policy_version 764302 (0.00087) [2022-07-10 14:34:04,562][25689] Fps is (10 sec: 5362.6, 60 sec: 5516.9, 300 sec: 5529.5). Total num frames: 782645248. Throughput: 0: 5668.6. Samples: 782652668. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:34:04,563][25689] Avg episode reward: [(0, '-0.767')] [2022-07-10 14:34:06,242][26022] Updated weights on worker 0-0, policy_version 764312 (0.00095) [2022-07-10 14:34:08,128][26022] Updated weights on worker 0-0, policy_version 764322 (0.00092) [2022-07-10 14:34:09,573][25689] Fps is (10 sec: 5397.8, 60 sec: 5550.7, 300 sec: 5533.5). Total num frames: 782673920. Throughput: 0: 4857.8. Samples: 782669418. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:34:09,575][25689] Avg episode reward: [(0, '-1.208')] [2022-07-10 14:34:09,895][26022] Updated weights on worker 0-0, policy_version 764332 (0.00087) [2022-07-10 14:34:11,838][26022] Updated weights on worker 0-0, policy_version 764342 (0.00086) [2022-07-10 14:34:13,520][26022] Updated weights on worker 0-0, policy_version 764352 (0.00088) [2022-07-10 14:34:14,582][25689] Fps is (10 sec: 5621.2, 60 sec: 5517.4, 300 sec: 5535.4). Total num frames: 782701568. Throughput: 0: 5724.8. Samples: 782703064. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:34:14,582][25689] Avg episode reward: [(0, '-0.744')] [2022-07-10 14:34:15,576][26022] Updated weights on worker 0-0, policy_version 764362 (0.00096) [2022-07-10 14:34:17,248][26022] Updated weights on worker 0-0, policy_version 764372 (0.00099) [2022-07-10 14:34:19,110][26022] Updated weights on worker 0-0, policy_version 764382 (0.00089) [2022-07-10 14:34:19,714][25689] Fps is (10 sec: 5553.8, 60 sec: 5532.2, 300 sec: 5530.6). Total num frames: 782730240. Throughput: 0: 5693.2. Samples: 782736426. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:34:19,716][25689] Avg episode reward: [(0, '0.059')] [2022-07-10 14:34:20,995][26022] Updated weights on worker 0-0, policy_version 764392 (0.00083) [2022-07-10 14:34:22,769][26022] Updated weights on worker 0-0, policy_version 764402 (0.00094) [2022-07-10 14:34:24,632][26022] Updated weights on worker 0-0, policy_version 764412 (0.00082) [2022-07-10 14:34:24,731][25689] Fps is (10 sec: 5549.2, 60 sec: 5533.9, 300 sec: 5530.4). Total num frames: 782757888. Throughput: 0: 4993.0. Samples: 782753370. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:34:24,732][25689] Avg episode reward: [(0, '0.233')] [2022-07-10 14:34:26,311][26022] Updated weights on worker 0-0, policy_version 764422 (0.00095) [2022-07-10 14:34:28,246][26022] Updated weights on worker 0-0, policy_version 764432 (0.00093) [2022-07-10 14:34:29,782][25689] Fps is (10 sec: 5593.9, 60 sec: 5529.8, 300 sec: 5536.8). Total num frames: 782786560. Throughput: 0: 5811.1. Samples: 782786854. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:34:29,784][25689] Avg episode reward: [(0, '0.744')] [2022-07-10 14:34:30,188][26022] Updated weights on worker 0-0, policy_version 764442 (0.00084) [2022-07-10 14:34:31,975][26022] Updated weights on worker 0-0, policy_version 764452 (0.00083) [2022-07-10 14:34:33,793][26022] Updated weights on worker 0-0, policy_version 764462 (0.00090) [2022-07-10 14:34:34,797][25689] Fps is (10 sec: 5595.3, 60 sec: 5529.4, 300 sec: 5528.1). Total num frames: 782814208. Throughput: 0: 5793.9. Samples: 782820188. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:34:34,798][25689] Avg episode reward: [(0, '0.362')] [2022-07-10 14:34:35,591][26022] Updated weights on worker 0-0, policy_version 764472 (0.00092) [2022-07-10 14:34:37,468][26022] Updated weights on worker 0-0, policy_version 764482 (0.00085) [2022-07-10 14:34:38,743][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:34:38,752][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000764489_782836736.pth [2022-07-10 14:34:38,753][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000762542_780843008.pth [2022-07-10 14:34:39,385][26022] Updated weights on worker 0-0, policy_version 764492 (0.00091) [2022-07-10 14:34:39,854][25689] Fps is (10 sec: 5592.0, 60 sec: 5536.0, 300 sec: 5537.6). Total num frames: 782842880. Throughput: 0: 4995.5. Samples: 782837034. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:34:39,854][25689] Avg episode reward: [(0, '0.521')] [2022-07-10 14:34:41,203][26022] Updated weights on worker 0-0, policy_version 764502 (0.00070) [2022-07-10 14:34:43,019][26022] Updated weights on worker 0-0, policy_version 764512 (0.00087) [2022-07-10 14:34:44,877][25689] Fps is (10 sec: 5587.0, 60 sec: 5520.3, 300 sec: 5530.6). Total num frames: 782870528. Throughput: 0: 5801.0. Samples: 782870238. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:34:44,878][25689] Avg episode reward: [(0, '0.264')] [2022-07-10 14:34:44,882][26022] Updated weights on worker 0-0, policy_version 764522 (0.00088) [2022-07-10 14:34:46,866][26022] Updated weights on worker 0-0, policy_version 764532 (0.00093) [2022-07-10 14:34:48,560][26022] Updated weights on worker 0-0, policy_version 764542 (0.00094) [2022-07-10 14:34:49,916][25689] Fps is (10 sec: 5495.4, 60 sec: 5535.0, 300 sec: 5530.2). Total num frames: 782898176. Throughput: 0: 5793.7. Samples: 782903504. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:34:49,916][25689] Avg episode reward: [(0, '-0.779')] [2022-07-10 14:34:50,495][26022] Updated weights on worker 0-0, policy_version 764552 (0.00085) [2022-07-10 14:34:52,259][26022] Updated weights on worker 0-0, policy_version 764562 (0.00093) [2022-07-10 14:34:54,095][26022] Updated weights on worker 0-0, policy_version 764572 (0.00084) [2022-07-10 14:34:55,009][25689] Fps is (10 sec: 5558.6, 60 sec: 5545.4, 300 sec: 5536.4). Total num frames: 782926848. Throughput: 0: 4965.3. Samples: 782920550. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:34:55,010][25689] Avg episode reward: [(0, '-1.363')] [2022-07-10 14:34:55,830][26022] Updated weights on worker 0-0, policy_version 764582 (0.00088) [2022-07-10 14:34:57,597][26022] Updated weights on worker 0-0, policy_version 764592 (0.00085) [2022-07-10 14:34:59,755][26022] Updated weights on worker 0-0, policy_version 764602 (0.00091) [2022-07-10 14:35:00,084][25689] Fps is (10 sec: 5538.7, 60 sec: 5528.7, 300 sec: 5538.6). Total num frames: 782954496. Throughput: 0: 5789.4. Samples: 782954158. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:35:00,085][25689] Avg episode reward: [(0, '-2.532')] [2022-07-10 14:35:01,597][26022] Updated weights on worker 0-0, policy_version 764612 (0.00087) [2022-07-10 14:35:03,530][26022] Updated weights on worker 0-0, policy_version 764622 (0.00095) [2022-07-10 14:35:05,167][25689] Fps is (10 sec: 5343.1, 60 sec: 5542.1, 300 sec: 5527.0). Total num frames: 782981120. Throughput: 0: 5669.8. Samples: 782985276. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:35:05,167][25689] Avg episode reward: [(0, '-2.579')] [2022-07-10 14:35:05,462][26022] Updated weights on worker 0-0, policy_version 764632 (0.00089) [2022-07-10 14:35:07,515][26022] Updated weights on worker 0-0, policy_version 764642 (0.00088) [2022-07-10 14:35:09,067][26022] Updated weights on worker 0-0, policy_version 764652 (0.00084) [2022-07-10 14:35:10,176][25689] Fps is (10 sec: 5378.1, 60 sec: 5525.3, 300 sec: 5534.2). Total num frames: 783008768. Throughput: 0: 4874.4. Samples: 783002258. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:35:10,176][25689] Avg episode reward: [(0, '-2.398')] [2022-07-10 14:35:11,240][26022] Updated weights on worker 0-0, policy_version 764662 (0.00082) [2022-07-10 14:35:12,733][26022] Updated weights on worker 0-0, policy_version 764672 (0.00093) [2022-07-10 14:35:14,815][26022] Updated weights on worker 0-0, policy_version 764682 (0.00089) [2022-07-10 14:35:15,186][25689] Fps is (10 sec: 5518.8, 60 sec: 5525.1, 300 sec: 5528.2). Total num frames: 783036416. Throughput: 0: 5692.2. Samples: 783035402. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:35:15,187][25689] Avg episode reward: [(0, '-2.142')] [2022-07-10 14:35:16,578][26022] Updated weights on worker 0-0, policy_version 764692 (0.00092) [2022-07-10 14:35:18,399][26022] Updated weights on worker 0-0, policy_version 764702 (0.00090) [2022-07-10 14:35:20,239][25689] Fps is (10 sec: 5494.7, 60 sec: 5515.5, 300 sec: 5534.4). Total num frames: 783064064. Throughput: 0: 5675.4. Samples: 783068544. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:35:20,240][25689] Avg episode reward: [(0, '-2.004')] [2022-07-10 14:35:20,370][26022] Updated weights on worker 0-0, policy_version 764712 (0.00091) [2022-07-10 14:35:22,099][26022] Updated weights on worker 0-0, policy_version 764722 (0.00086) [2022-07-10 14:35:24,029][26022] Updated weights on worker 0-0, policy_version 764732 (0.00083) [2022-07-10 14:35:25,241][25689] Fps is (10 sec: 5703.1, 60 sec: 5550.7, 300 sec: 5538.1). Total num frames: 783093760. Throughput: 0: 5812.5. Samples: 783101960. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:35:25,242][25689] Avg episode reward: [(0, '-2.161')] [2022-07-10 14:35:25,848][26022] Updated weights on worker 0-0, policy_version 764742 (0.00090) [2022-07-10 14:35:27,639][26022] Updated weights on worker 0-0, policy_version 764752 (0.00092) [2022-07-10 14:35:29,712][26022] Updated weights on worker 0-0, policy_version 764762 (0.00084) [2022-07-10 14:35:30,251][25689] Fps is (10 sec: 5625.5, 60 sec: 5520.6, 300 sec: 5531.5). Total num frames: 783120384. Throughput: 0: 5793.3. Samples: 783118560. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:35:30,252][25689] Avg episode reward: [(0, '-1.049')] [2022-07-10 14:35:31,269][26022] Updated weights on worker 0-0, policy_version 764772 (0.00083) [2022-07-10 14:35:33,214][26022] Updated weights on worker 0-0, policy_version 764782 (0.00093) [2022-07-10 14:35:35,038][26022] Updated weights on worker 0-0, policy_version 764792 (0.00078) [2022-07-10 14:35:35,293][25689] Fps is (10 sec: 5501.3, 60 sec: 5535.0, 300 sec: 5539.4). Total num frames: 783149056. Throughput: 0: 5804.6. Samples: 783152112. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:35:35,293][25689] Avg episode reward: [(0, '-2.185')] [2022-07-10 14:35:36,876][26022] Updated weights on worker 0-0, policy_version 764802 (0.00090) [2022-07-10 14:35:38,659][26022] Updated weights on worker 0-0, policy_version 764812 (0.00090) [2022-07-10 14:35:40,418][25689] Fps is (10 sec: 5539.6, 60 sec: 5511.9, 300 sec: 5533.7). Total num frames: 783176704. Throughput: 0: 5805.2. Samples: 783185684. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 14:35:40,418][25689] Avg episode reward: [(0, '-1.834')] [2022-07-10 14:35:40,440][26022] Updated weights on worker 0-0, policy_version 764822 (0.00095) [2022-07-10 14:35:42,183][26022] Updated weights on worker 0-0, policy_version 764832 (0.00080) [2022-07-10 14:35:44,309][26022] Updated weights on worker 0-0, policy_version 764842 (0.00089) [2022-07-10 14:35:45,454][25689] Fps is (10 sec: 5643.3, 60 sec: 5544.5, 300 sec: 5537.7). Total num frames: 783206400. Throughput: 0: 4981.0. Samples: 783202642. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:35:45,455][25689] Avg episode reward: [(0, '-1.778')] [2022-07-10 14:35:45,905][26022] Updated weights on worker 0-0, policy_version 764852 (0.00077) [2022-07-10 14:35:47,959][26022] Updated weights on worker 0-0, policy_version 764862 (0.00091) [2022-07-10 14:35:49,567][26022] Updated weights on worker 0-0, policy_version 764872 (0.00091) [2022-07-10 14:35:50,480][25689] Fps is (10 sec: 5495.4, 60 sec: 5511.9, 300 sec: 5527.9). Total num frames: 783232000. Throughput: 0: 5820.4. Samples: 783236302. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:35:50,480][25689] Avg episode reward: [(0, '-1.670')] [2022-07-10 14:35:51,466][26022] Updated weights on worker 0-0, policy_version 764882 (0.00096) [2022-07-10 14:35:53,346][26022] Updated weights on worker 0-0, policy_version 764892 (0.00089) [2022-07-10 14:35:55,080][26022] Updated weights on worker 0-0, policy_version 764902 (0.00080) [2022-07-10 14:35:55,518][25689] Fps is (10 sec: 5596.0, 60 sec: 5550.7, 300 sec: 5539.7). Total num frames: 783262720. Throughput: 0: 5811.3. Samples: 783269650. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:35:55,519][25689] Avg episode reward: [(0, '-1.160')] [2022-07-10 14:35:57,143][26022] Updated weights on worker 0-0, policy_version 764912 (0.00085) [2022-07-10 14:35:58,660][26022] Updated weights on worker 0-0, policy_version 764922 (0.00097) [2022-07-10 14:36:00,587][25689] Fps is (10 sec: 5673.7, 60 sec: 5534.4, 300 sec: 5535.4). Total num frames: 783289344. Throughput: 0: 4998.4. Samples: 783286498. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:36:00,587][25689] Avg episode reward: [(0, '-0.386')] [2022-07-10 14:36:00,796][26022] Updated weights on worker 0-0, policy_version 764932 (0.00085) [2022-07-10 14:36:02,781][26022] Updated weights on worker 0-0, policy_version 764942 (0.00089) [2022-07-10 14:36:04,820][26022] Updated weights on worker 0-0, policy_version 764952 (0.00092) [2022-07-10 14:36:05,651][25689] Fps is (10 sec: 5154.2, 60 sec: 5519.2, 300 sec: 5532.0). Total num frames: 783314944. Throughput: 0: 5688.7. Samples: 783317534. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:36:05,651][25689] Avg episode reward: [(0, '0.401')] [2022-07-10 14:36:06,527][26022] Updated weights on worker 0-0, policy_version 764962 (0.00089) [2022-07-10 14:36:08,466][26022] Updated weights on worker 0-0, policy_version 764972 (0.00100) [2022-07-10 14:36:10,256][26022] Updated weights on worker 0-0, policy_version 764982 (0.00085) [2022-07-10 14:36:10,652][25689] Fps is (10 sec: 5493.6, 60 sec: 5553.7, 300 sec: 5539.1). Total num frames: 783344640. Throughput: 0: 5692.0. Samples: 783351124. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:36:10,653][25689] Avg episode reward: [(0, '-0.841')] [2022-07-10 14:36:12,103][26022] Updated weights on worker 0-0, policy_version 764992 (0.00091) [2022-07-10 14:36:13,887][26022] Updated weights on worker 0-0, policy_version 765002 (0.00086) [2022-07-10 14:36:15,672][25689] Fps is (10 sec: 5517.7, 60 sec: 5519.0, 300 sec: 5529.6). Total num frames: 783370240. Throughput: 0: 4887.4. Samples: 783368150. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:36:15,673][25689] Avg episode reward: [(0, '-0.997')] [2022-07-10 14:36:15,815][26022] Updated weights on worker 0-0, policy_version 765012 (0.00085) [2022-07-10 14:36:17,499][26022] Updated weights on worker 0-0, policy_version 765022 (0.00085) [2022-07-10 14:36:19,596][26022] Updated weights on worker 0-0, policy_version 765032 (0.00089) [2022-07-10 14:36:20,761][25689] Fps is (10 sec: 5571.5, 60 sec: 5566.5, 300 sec: 5538.7). Total num frames: 783400960. Throughput: 0: 5713.5. Samples: 783401764. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:36:20,762][25689] Avg episode reward: [(0, '-1.512')] [2022-07-10 14:36:21,060][26022] Updated weights on worker 0-0, policy_version 765042 (0.00094) [2022-07-10 14:36:23,133][26022] Updated weights on worker 0-0, policy_version 765052 (0.00080) [2022-07-10 14:36:24,911][26022] Updated weights on worker 0-0, policy_version 765062 (0.00089) [2022-07-10 14:36:25,773][25689] Fps is (10 sec: 5677.5, 60 sec: 5514.9, 300 sec: 5531.6). Total num frames: 783427584. Throughput: 0: 5859.9. Samples: 783435446. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:36:25,773][25689] Avg episode reward: [(0, '-1.508')] [2022-07-10 14:36:26,775][26022] Updated weights on worker 0-0, policy_version 765072 (0.00385) [2022-07-10 14:36:28,750][26022] Updated weights on worker 0-0, policy_version 765082 (0.00089) [2022-07-10 14:36:30,193][26022] Updated weights on worker 0-0, policy_version 765092 (0.00101) [2022-07-10 14:36:30,789][25689] Fps is (10 sec: 5412.0, 60 sec: 5531.2, 300 sec: 5531.4). Total num frames: 783455232. Throughput: 0: 5010.7. Samples: 783452022. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:36:30,790][25689] Avg episode reward: [(0, '-1.942')] [2022-07-10 14:36:32,300][26022] Updated weights on worker 0-0, policy_version 765102 (0.00091) [2022-07-10 14:36:34,014][26022] Updated weights on worker 0-0, policy_version 765112 (0.00085) [2022-07-10 14:36:35,804][25689] Fps is (10 sec: 5614.3, 60 sec: 5533.6, 300 sec: 5539.3). Total num frames: 783483904. Throughput: 0: 5828.5. Samples: 783485488. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:36:35,804][25689] Avg episode reward: [(0, '-1.935')] [2022-07-10 14:36:35,920][26022] Updated weights on worker 0-0, policy_version 765122 (0.00089) [2022-07-10 14:36:37,659][26022] Updated weights on worker 0-0, policy_version 765132 (0.00085) [2022-07-10 14:36:38,816][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:36:38,824][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000765137_783500288.pth [2022-07-10 14:36:38,824][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000763190_781506560.pth [2022-07-10 14:36:39,615][26022] Updated weights on worker 0-0, policy_version 765142 (0.00091) [2022-07-10 14:36:40,863][25689] Fps is (10 sec: 5590.9, 60 sec: 5539.7, 300 sec: 5535.0). Total num frames: 783511552. Throughput: 0: 5827.7. Samples: 783518910. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:36:40,863][25689] Avg episode reward: [(0, '-1.161')] [2022-07-10 14:36:41,414][26022] Updated weights on worker 0-0, policy_version 765152 (0.00086) [2022-07-10 14:36:43,155][26022] Updated weights on worker 0-0, policy_version 765162 (0.00058) [2022-07-10 14:36:45,202][26022] Updated weights on worker 0-0, policy_version 765172 (0.00092) [2022-07-10 14:36:45,890][25689] Fps is (10 sec: 5482.7, 60 sec: 5506.7, 300 sec: 5535.3). Total num frames: 783539200. Throughput: 0: 4994.7. Samples: 783535924. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:36:45,890][25689] Avg episode reward: [(0, '-0.155')] [2022-07-10 14:36:46,747][26022] Updated weights on worker 0-0, policy_version 765182 (0.00082) [2022-07-10 14:36:48,859][26022] Updated weights on worker 0-0, policy_version 765192 (0.00098) [2022-07-10 14:36:50,534][26022] Updated weights on worker 0-0, policy_version 765202 (0.00092) [2022-07-10 14:36:50,919][25689] Fps is (10 sec: 5702.4, 60 sec: 5574.2, 300 sec: 5538.7). Total num frames: 783568896. Throughput: 0: 5831.6. Samples: 783569410. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:36:50,919][25689] Avg episode reward: [(0, '-0.829')] [2022-07-10 14:36:52,443][26022] Updated weights on worker 0-0, policy_version 765212 (0.00093) [2022-07-10 14:36:54,199][26022] Updated weights on worker 0-0, policy_version 765222 (0.00067) [2022-07-10 14:36:55,941][25689] Fps is (10 sec: 5704.9, 60 sec: 5524.8, 300 sec: 5540.8). Total num frames: 783596544. Throughput: 0: 5837.0. Samples: 783603030. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:36:55,942][25689] Avg episode reward: [(0, '-2.212')] [2022-07-10 14:36:56,108][26022] Updated weights on worker 0-0, policy_version 765232 (0.00095) [2022-07-10 14:36:57,856][26022] Updated weights on worker 0-0, policy_version 765242 (0.00097) [2022-07-10 14:36:59,752][26022] Updated weights on worker 0-0, policy_version 765252 (0.00095) [2022-07-10 14:37:01,000][25689] Fps is (10 sec: 5586.5, 60 sec: 5559.6, 300 sec: 5548.1). Total num frames: 783625216. Throughput: 0: 5007.3. Samples: 783619744. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:37:01,001][25689] Avg episode reward: [(0, '-1.504')] [2022-07-10 14:37:01,582][26022] Updated weights on worker 0-0, policy_version 765262 (0.00083) [2022-07-10 14:37:04,051][26022] Updated weights on worker 0-0, policy_version 765272 (0.00098) [2022-07-10 14:37:05,528][26022] Updated weights on worker 0-0, policy_version 765282 (0.00089) [2022-07-10 14:37:06,030][25689] Fps is (10 sec: 5379.6, 60 sec: 5562.7, 300 sec: 5530.9). Total num frames: 783650816. Throughput: 0: 5708.8. Samples: 783650900. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:37:06,030][25689] Avg episode reward: [(0, '-1.355')] [2022-07-10 14:37:07,605][26022] Updated weights on worker 0-0, policy_version 765292 (0.00083) [2022-07-10 14:37:09,107][26022] Updated weights on worker 0-0, policy_version 765302 (0.00083) [2022-07-10 14:37:11,078][25689] Fps is (10 sec: 5283.7, 60 sec: 5524.6, 300 sec: 5533.9). Total num frames: 783678464. Throughput: 0: 5716.9. Samples: 783684658. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:37:11,078][25689] Avg episode reward: [(0, '-1.121')] [2022-07-10 14:37:11,299][26022] Updated weights on worker 0-0, policy_version 765312 (0.00089) [2022-07-10 14:37:12,945][26022] Updated weights on worker 0-0, policy_version 765322 (0.00088) [2022-07-10 14:37:14,958][26022] Updated weights on worker 0-0, policy_version 765332 (0.00087) [2022-07-10 14:37:16,088][25689] Fps is (10 sec: 5599.2, 60 sec: 5576.3, 300 sec: 5538.7). Total num frames: 783707136. Throughput: 0: 4886.7. Samples: 783701482. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:37:16,089][25689] Avg episode reward: [(0, '-1.117')] [2022-07-10 14:37:16,566][26022] Updated weights on worker 0-0, policy_version 765342 (0.00089) [2022-07-10 14:37:18,444][26022] Updated weights on worker 0-0, policy_version 765352 (0.00090) [2022-07-10 14:37:20,470][26022] Updated weights on worker 0-0, policy_version 765362 (0.00091) [2022-07-10 14:37:21,237][25689] Fps is (10 sec: 5644.9, 60 sec: 5536.9, 300 sec: 5533.0). Total num frames: 783735808. Throughput: 0: 5694.4. Samples: 783734978. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:37:21,237][25689] Avg episode reward: [(0, '-1.337')] [2022-07-10 14:37:22,387][26022] Updated weights on worker 0-0, policy_version 765372 (0.00077) [2022-07-10 14:37:24,080][26022] Updated weights on worker 0-0, policy_version 765382 (0.00081) [2022-07-10 14:37:26,103][26022] Updated weights on worker 0-0, policy_version 765392 (0.00088) [2022-07-10 14:37:26,251][25689] Fps is (10 sec: 5340.3, 60 sec: 5519.7, 300 sec: 5527.5). Total num frames: 783761408. Throughput: 0: 5801.5. Samples: 783768214. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:37:26,252][25689] Avg episode reward: [(0, '-1.583')] [2022-07-10 14:37:27,692][26022] Updated weights on worker 0-0, policy_version 765402 (0.00085) [2022-07-10 14:37:29,880][26022] Updated weights on worker 0-0, policy_version 765412 (0.00088) [2022-07-10 14:37:31,206][26022] Updated weights on worker 0-0, policy_version 765422 (0.00091) [2022-07-10 14:37:31,305][25689] Fps is (10 sec: 5593.5, 60 sec: 5567.0, 300 sec: 5537.4). Total num frames: 783792128. Throughput: 0: 4954.0. Samples: 783784864. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:37:31,306][25689] Avg episode reward: [(0, '-2.272')] [2022-07-10 14:37:33,517][26022] Updated weights on worker 0-0, policy_version 765432 (0.00107) [2022-07-10 14:37:34,860][26022] Updated weights on worker 0-0, policy_version 765442 (0.00087) [2022-07-10 14:37:36,331][25689] Fps is (10 sec: 5587.4, 60 sec: 5515.3, 300 sec: 5528.3). Total num frames: 783817728. Throughput: 0: 5785.1. Samples: 783818588. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:37:36,331][25689] Avg episode reward: [(0, '-2.806')] [2022-07-10 14:37:36,951][26022] Updated weights on worker 0-0, policy_version 765452 (0.00081) [2022-07-10 14:37:38,628][26022] Updated weights on worker 0-0, policy_version 765462 (0.00093) [2022-07-10 14:37:40,461][26022] Updated weights on worker 0-0, policy_version 765472 (0.00092) [2022-07-10 14:37:41,408][25689] Fps is (10 sec: 5574.9, 60 sec: 5564.4, 300 sec: 5537.8). Total num frames: 783848448. Throughput: 0: 5813.4. Samples: 783852242. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:37:41,408][25689] Avg episode reward: [(0, '-3.260')] [2022-07-10 14:37:42,280][26022] Updated weights on worker 0-0, policy_version 765482 (0.00086) [2022-07-10 14:37:43,915][26022] Updated weights on worker 0-0, policy_version 765492 (0.00092) [2022-07-10 14:37:46,071][26022] Updated weights on worker 0-0, policy_version 765502 (0.00094) [2022-07-10 14:37:46,474][25689] Fps is (10 sec: 5754.2, 60 sec: 5560.7, 300 sec: 5529.8). Total num frames: 783876096. Throughput: 0: 4995.0. Samples: 783869230. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:37:46,475][25689] Avg episode reward: [(0, '-3.606')] [2022-07-10 14:37:47,648][26022] Updated weights on worker 0-0, policy_version 765512 (0.00092) [2022-07-10 14:37:49,701][26022] Updated weights on worker 0-0, policy_version 765522 (0.00090) [2022-07-10 14:37:51,503][25689] Fps is (10 sec: 5578.9, 60 sec: 5543.9, 300 sec: 5543.1). Total num frames: 783904768. Throughput: 0: 5840.8. Samples: 783902834. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:37:51,503][25689] Avg episode reward: [(0, '-1.475')] [2022-07-10 14:37:51,504][26022] Updated weights on worker 0-0, policy_version 765532 (0.00091) [2022-07-10 14:37:53,303][26022] Updated weights on worker 0-0, policy_version 765542 (0.00088) [2022-07-10 14:37:55,369][26022] Updated weights on worker 0-0, policy_version 765552 (0.00087) [2022-07-10 14:37:56,599][25689] Fps is (10 sec: 5765.1, 60 sec: 5571.0, 300 sec: 5539.5). Total num frames: 783934464. Throughput: 0: 5820.0. Samples: 783936546. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:37:56,599][25689] Avg episode reward: [(0, '-2.679')] [2022-07-10 14:37:56,766][26022] Updated weights on worker 0-0, policy_version 765562 (0.00084) [2022-07-10 14:37:58,883][26022] Updated weights on worker 0-0, policy_version 765572 (0.00081) [2022-07-10 14:38:00,616][26022] Updated weights on worker 0-0, policy_version 765582 (0.00089) [2022-07-10 14:38:01,675][25689] Fps is (10 sec: 5536.8, 60 sec: 5535.6, 300 sec: 5542.8). Total num frames: 783961088. Throughput: 0: 5824.7. Samples: 783970292. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:38:01,675][25689] Avg episode reward: [(0, '-2.224')] [2022-07-10 14:38:02,856][26022] Updated weights on worker 0-0, policy_version 765592 (0.00097) [2022-07-10 14:38:04,776][26022] Updated weights on worker 0-0, policy_version 765602 (0.00083) [2022-07-10 14:38:06,602][26022] Updated weights on worker 0-0, policy_version 765612 (0.00086) [2022-07-10 14:38:06,691][25689] Fps is (10 sec: 5174.6, 60 sec: 5536.8, 300 sec: 5539.3). Total num frames: 783986688. Throughput: 0: 5718.1. Samples: 783984830. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:38:06,692][25689] Avg episode reward: [(0, '-2.107')] [2022-07-10 14:38:08,137][26022] Updated weights on worker 0-0, policy_version 765622 (0.00091) [2022-07-10 14:38:10,323][26022] Updated weights on worker 0-0, policy_version 765632 (0.00085) [2022-07-10 14:38:11,707][25689] Fps is (10 sec: 5512.0, 60 sec: 5573.6, 300 sec: 5539.3). Total num frames: 784016384. Throughput: 0: 5723.0. Samples: 784018460. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:38:11,707][25689] Avg episode reward: [(0, '-2.447')] [2022-07-10 14:38:11,711][26022] Updated weights on worker 0-0, policy_version 765642 (0.00090) [2022-07-10 14:38:13,984][26022] Updated weights on worker 0-0, policy_version 765652 (0.00091) [2022-07-10 14:38:15,467][26022] Updated weights on worker 0-0, policy_version 765662 (0.00094) [2022-07-10 14:38:16,722][25689] Fps is (10 sec: 5614.8, 60 sec: 5539.4, 300 sec: 5537.6). Total num frames: 784043008. Throughput: 0: 5738.5. Samples: 784052022. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:38:16,723][25689] Avg episode reward: [(0, '-3.862')] [2022-07-10 14:38:17,580][26022] Updated weights on worker 0-0, policy_version 765672 (0.00088) [2022-07-10 14:38:19,284][26022] Updated weights on worker 0-0, policy_version 765682 (0.00093) [2022-07-10 14:38:21,180][26022] Updated weights on worker 0-0, policy_version 765692 (0.00084) [2022-07-10 14:38:21,856][25689] Fps is (10 sec: 5347.1, 60 sec: 5523.7, 300 sec: 5535.7). Total num frames: 784070656. Throughput: 0: 4872.0. Samples: 784068616. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:38:21,858][25689] Avg episode reward: [(0, '-3.808')] [2022-07-10 14:38:22,988][26022] Updated weights on worker 0-0, policy_version 765702 (0.00097) [2022-07-10 14:38:25,060][26022] Updated weights on worker 0-0, policy_version 765712 (0.00093) [2022-07-10 14:38:26,721][26022] Updated weights on worker 0-0, policy_version 765722 (0.00081) [2022-07-10 14:38:26,877][25689] Fps is (10 sec: 5545.9, 60 sec: 5573.8, 300 sec: 5535.4). Total num frames: 784099328. Throughput: 0: 5794.7. Samples: 784101802. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:38:26,879][25689] Avg episode reward: [(0, '-2.355')] [2022-07-10 14:38:28,910][26022] Updated weights on worker 0-0, policy_version 765732 (0.00094) [2022-07-10 14:38:30,435][26022] Updated weights on worker 0-0, policy_version 765742 (0.00086) [2022-07-10 14:38:31,900][25689] Fps is (10 sec: 5505.4, 60 sec: 5509.1, 300 sec: 5531.8). Total num frames: 784125952. Throughput: 0: 5779.8. Samples: 784135176. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:38:31,902][25689] Avg episode reward: [(0, '-2.434')] [2022-07-10 14:38:32,496][26022] Updated weights on worker 0-0, policy_version 765752 (0.00089) [2022-07-10 14:38:33,926][26022] Updated weights on worker 0-0, policy_version 765762 (0.00081) [2022-07-10 14:38:36,183][26022] Updated weights on worker 0-0, policy_version 765772 (0.00092) [2022-07-10 14:38:36,912][25689] Fps is (10 sec: 5510.2, 60 sec: 5561.1, 300 sec: 5533.9). Total num frames: 784154624. Throughput: 0: 4955.8. Samples: 784152082. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:38:36,913][25689] Avg episode reward: [(0, '-2.134')] [2022-07-10 14:38:37,527][26022] Updated weights on worker 0-0, policy_version 765782 (0.00080) [2022-07-10 14:38:38,886][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:38:38,900][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000765788_784166912.pth [2022-07-10 14:38:38,901][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000763841_782173184.pth [2022-07-10 14:38:39,693][26022] Updated weights on worker 0-0, policy_version 765792 (0.00092) [2022-07-10 14:38:41,271][26022] Updated weights on worker 0-0, policy_version 765802 (0.00096) [2022-07-10 14:38:41,994][25689] Fps is (10 sec: 5782.4, 60 sec: 5543.7, 300 sec: 5536.5). Total num frames: 784184320. Throughput: 0: 5818.9. Samples: 784185798. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:38:41,995][25689] Avg episode reward: [(0, '-0.669')] [2022-07-10 14:38:43,149][26022] Updated weights on worker 0-0, policy_version 765812 (0.00082) [2022-07-10 14:38:44,963][26022] Updated weights on worker 0-0, policy_version 765822 (0.00085) [2022-07-10 14:38:46,763][26022] Updated weights on worker 0-0, policy_version 765832 (0.00091) [2022-07-10 14:38:47,042][25689] Fps is (10 sec: 5661.1, 60 sec: 5545.4, 300 sec: 5539.4). Total num frames: 784211968. Throughput: 0: 5845.3. Samples: 784219670. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:38:47,042][25689] Avg episode reward: [(0, '-0.409')] [2022-07-10 14:38:48,694][26022] Updated weights on worker 0-0, policy_version 765842 (0.00089) [2022-07-10 14:38:50,609][26022] Updated weights on worker 0-0, policy_version 765852 (0.00091) [2022-07-10 14:38:52,070][25689] Fps is (10 sec: 5488.4, 60 sec: 5528.6, 300 sec: 5539.3). Total num frames: 784239616. Throughput: 0: 5017.6. Samples: 784236378. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:38:52,070][25689] Avg episode reward: [(0, '-0.315')] [2022-07-10 14:38:52,600][26022] Updated weights on worker 0-0, policy_version 765862 (0.00089) [2022-07-10 14:38:54,315][26022] Updated weights on worker 0-0, policy_version 765872 (0.00086) [2022-07-10 14:38:56,306][26022] Updated weights on worker 0-0, policy_version 765882 (0.00083) [2022-07-10 14:38:57,102][25689] Fps is (10 sec: 5598.1, 60 sec: 5517.4, 300 sec: 5540.1). Total num frames: 784268288. Throughput: 0: 5795.5. Samples: 784269094. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:38:57,103][25689] Avg episode reward: [(0, '-0.877')] [2022-07-10 14:38:57,979][26022] Updated weights on worker 0-0, policy_version 765892 (0.00082) [2022-07-10 14:38:59,885][26022] Updated weights on worker 0-0, policy_version 765902 (0.00087) [2022-07-10 14:39:01,911][26022] Updated weights on worker 0-0, policy_version 765912 (0.00082) [2022-07-10 14:39:02,191][25689] Fps is (10 sec: 5362.2, 60 sec: 5499.4, 300 sec: 5539.3). Total num frames: 784293888. Throughput: 0: 5759.2. Samples: 784302114. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:39:02,191][25689] Avg episode reward: [(0, '-1.106')] [2022-07-10 14:39:03,807][26022] Updated weights on worker 0-0, policy_version 765922 (0.00084) [2022-07-10 14:39:05,820][26022] Updated weights on worker 0-0, policy_version 765932 (0.00094) [2022-07-10 14:39:07,264][25689] Fps is (10 sec: 5340.6, 60 sec: 5544.9, 300 sec: 5538.1). Total num frames: 784322560. Throughput: 0: 4837.5. Samples: 784317496. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 14:39:07,265][25689] Avg episode reward: [(0, '-3.031')] [2022-07-10 14:39:07,454][26022] Updated weights on worker 0-0, policy_version 765942 (0.00091) [2022-07-10 14:39:09,422][26022] Updated weights on worker 0-0, policy_version 765952 (0.00093) [2022-07-10 14:39:11,235][26022] Updated weights on worker 0-0, policy_version 765962 (0.00084) [2022-07-10 14:39:12,322][25689] Fps is (10 sec: 5559.3, 60 sec: 5507.3, 300 sec: 5537.2). Total num frames: 784350208. Throughput: 0: 5654.2. Samples: 784350888. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:39:12,322][25689] Avg episode reward: [(0, '-2.647')] [2022-07-10 14:39:13,043][26022] Updated weights on worker 0-0, policy_version 765972 (0.00091) [2022-07-10 14:39:14,968][26022] Updated weights on worker 0-0, policy_version 765982 (0.00087) [2022-07-10 14:39:16,698][26022] Updated weights on worker 0-0, policy_version 765992 (0.00082) [2022-07-10 14:39:17,340][25689] Fps is (10 sec: 5589.7, 60 sec: 5540.8, 300 sec: 5539.3). Total num frames: 784378880. Throughput: 0: 5706.4. Samples: 784384578. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:39:17,341][25689] Avg episode reward: [(0, '-2.157')] [2022-07-10 14:39:18,775][26022] Updated weights on worker 0-0, policy_version 766002 (0.00092) [2022-07-10 14:39:20,494][26022] Updated weights on worker 0-0, policy_version 766012 (0.00092) [2022-07-10 14:39:22,259][26022] Updated weights on worker 0-0, policy_version 766022 (0.00079) [2022-07-10 14:39:22,398][25689] Fps is (10 sec: 5589.1, 60 sec: 5547.8, 300 sec: 5538.5). Total num frames: 784406528. Throughput: 0: 4904.0. Samples: 784401212. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:39:22,399][25689] Avg episode reward: [(0, '-1.763')] [2022-07-10 14:39:24,107][26022] Updated weights on worker 0-0, policy_version 766032 (0.00083) [2022-07-10 14:39:25,967][26022] Updated weights on worker 0-0, policy_version 766042 (0.00081) [2022-07-10 14:39:27,399][25689] Fps is (10 sec: 5598.8, 60 sec: 5549.6, 300 sec: 5539.5). Total num frames: 784435200. Throughput: 0: 5815.1. Samples: 784434582. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:39:27,401][25689] Avg episode reward: [(0, '-2.563')] [2022-07-10 14:39:27,986][26022] Updated weights on worker 0-0, policy_version 766052 (0.00089) [2022-07-10 14:39:29,704][26022] Updated weights on worker 0-0, policy_version 766062 (0.00092) [2022-07-10 14:39:31,499][26022] Updated weights on worker 0-0, policy_version 766072 (0.00088) [2022-07-10 14:39:32,404][25689] Fps is (10 sec: 5526.3, 60 sec: 5551.2, 300 sec: 5536.2). Total num frames: 784461824. Throughput: 0: 5829.8. Samples: 784467966. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:39:32,406][25689] Avg episode reward: [(0, '-1.598')] [2022-07-10 14:39:33,310][26022] Updated weights on worker 0-0, policy_version 766082 (0.00088) [2022-07-10 14:39:35,183][26022] Updated weights on worker 0-0, policy_version 766092 (0.00087) [2022-07-10 14:39:37,150][26022] Updated weights on worker 0-0, policy_version 766102 (0.00083) [2022-07-10 14:39:37,457][25689] Fps is (10 sec: 5395.8, 60 sec: 5530.5, 300 sec: 5532.8). Total num frames: 784489472. Throughput: 0: 4983.4. Samples: 784484832. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:39:37,458][25689] Avg episode reward: [(0, '-0.077')] [2022-07-10 14:39:38,878][26022] Updated weights on worker 0-0, policy_version 766112 (0.00090) [2022-07-10 14:39:40,842][26022] Updated weights on worker 0-0, policy_version 766122 (0.00084) [2022-07-10 14:39:42,507][25689] Fps is (10 sec: 5574.7, 60 sec: 5516.6, 300 sec: 5535.8). Total num frames: 784518144. Throughput: 0: 5812.8. Samples: 784518100. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:39:42,507][25689] Avg episode reward: [(0, '0.216')] [2022-07-10 14:39:42,611][26022] Updated weights on worker 0-0, policy_version 766132 (0.00097) [2022-07-10 14:39:44,429][26022] Updated weights on worker 0-0, policy_version 766142 (0.00086) [2022-07-10 14:39:46,384][26022] Updated weights on worker 0-0, policy_version 766152 (0.00093) [2022-07-10 14:39:47,527][25689] Fps is (10 sec: 5694.7, 60 sec: 5536.0, 300 sec: 5539.6). Total num frames: 784546816. Throughput: 0: 5824.3. Samples: 784551812. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:39:47,528][25689] Avg episode reward: [(0, '-1.087')] [2022-07-10 14:39:48,003][26022] Updated weights on worker 0-0, policy_version 766162 (0.00088) [2022-07-10 14:39:50,019][26022] Updated weights on worker 0-0, policy_version 766172 (0.00079) [2022-07-10 14:39:51,619][26022] Updated weights on worker 0-0, policy_version 766182 (0.00079) [2022-07-10 14:39:52,543][25689] Fps is (10 sec: 5611.6, 60 sec: 5537.1, 300 sec: 5537.6). Total num frames: 784574464. Throughput: 0: 4992.1. Samples: 784568502. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:39:52,544][25689] Avg episode reward: [(0, '-0.708')] [2022-07-10 14:39:53,798][26022] Updated weights on worker 0-0, policy_version 766192 (0.00091) [2022-07-10 14:39:55,452][26022] Updated weights on worker 0-0, policy_version 766202 (0.00085) [2022-07-10 14:39:57,309][26022] Updated weights on worker 0-0, policy_version 766212 (0.00617) [2022-07-10 14:39:57,571][25689] Fps is (10 sec: 5607.5, 60 sec: 5537.6, 300 sec: 5541.9). Total num frames: 784603136. Throughput: 0: 5828.0. Samples: 784602052. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:39:57,571][25689] Avg episode reward: [(0, '-0.933')] [2022-07-10 14:39:59,048][26022] Updated weights on worker 0-0, policy_version 766222 (0.00084) [2022-07-10 14:40:00,822][26022] Updated weights on worker 0-0, policy_version 766232 (0.00091) [2022-07-10 14:40:02,707][25689] Fps is (10 sec: 5440.4, 60 sec: 5550.1, 300 sec: 5540.9). Total num frames: 784629760. Throughput: 0: 5749.6. Samples: 784634244. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:40:02,708][25689] Avg episode reward: [(0, '-1.194')] [2022-07-10 14:40:03,027][26022] Updated weights on worker 0-0, policy_version 766242 (0.00083) [2022-07-10 14:40:05,078][26022] Updated weights on worker 0-0, policy_version 766252 (0.00095) [2022-07-10 14:40:06,683][26022] Updated weights on worker 0-0, policy_version 766262 (0.00084) [2022-07-10 14:40:07,723][25689] Fps is (10 sec: 5345.9, 60 sec: 5538.5, 300 sec: 5540.8). Total num frames: 784657408. Throughput: 0: 4883.1. Samples: 784650430. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:40:07,723][25689] Avg episode reward: [(0, '-1.471')] [2022-07-10 14:40:08,733][26022] Updated weights on worker 0-0, policy_version 766272 (0.00086) [2022-07-10 14:40:10,236][26022] Updated weights on worker 0-0, policy_version 766282 (0.00080) [2022-07-10 14:40:12,273][26022] Updated weights on worker 0-0, policy_version 766292 (0.00084) [2022-07-10 14:40:12,750][25689] Fps is (10 sec: 5505.8, 60 sec: 5541.2, 300 sec: 5540.4). Total num frames: 784685056. Throughput: 0: 5711.1. Samples: 784683906. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:40:12,751][25689] Avg episode reward: [(0, '-1.158')] [2022-07-10 14:40:14,157][26022] Updated weights on worker 0-0, policy_version 766302 (0.00093) [2022-07-10 14:40:15,890][26022] Updated weights on worker 0-0, policy_version 766312 (0.00097) [2022-07-10 14:40:17,766][25689] Fps is (10 sec: 5505.7, 60 sec: 5524.5, 300 sec: 5541.1). Total num frames: 784712704. Throughput: 0: 5716.4. Samples: 784717496. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:40:17,767][25689] Avg episode reward: [(0, '-0.912')] [2022-07-10 14:40:17,983][26022] Updated weights on worker 0-0, policy_version 766322 (0.00079) [2022-07-10 14:40:19,472][26022] Updated weights on worker 0-0, policy_version 766332 (0.00093) [2022-07-10 14:40:21,592][26022] Updated weights on worker 0-0, policy_version 766342 (0.00090) [2022-07-10 14:40:22,913][25689] Fps is (10 sec: 5642.7, 60 sec: 5550.3, 300 sec: 5538.4). Total num frames: 784742400. Throughput: 0: 5772.5. Samples: 784750882. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:40:22,914][25689] Avg episode reward: [(0, '-0.754')] [2022-07-10 14:40:23,268][26022] Updated weights on worker 0-0, policy_version 766352 (0.00097) [2022-07-10 14:40:25,080][26022] Updated weights on worker 0-0, policy_version 766362 (0.00090) [2022-07-10 14:40:27,157][26022] Updated weights on worker 0-0, policy_version 766372 (0.00086) [2022-07-10 14:40:27,945][25689] Fps is (10 sec: 5532.8, 60 sec: 5513.6, 300 sec: 5538.0). Total num frames: 784769024. Throughput: 0: 5780.6. Samples: 784767330. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:40:27,946][25689] Avg episode reward: [(0, '-0.602')] [2022-07-10 14:40:28,853][26022] Updated weights on worker 0-0, policy_version 766382 (0.00088) [2022-07-10 14:40:30,808][26022] Updated weights on worker 0-0, policy_version 766392 (0.00086) [2022-07-10 14:40:32,620][26022] Updated weights on worker 0-0, policy_version 766402 (0.00083) [2022-07-10 14:40:32,949][25689] Fps is (10 sec: 5407.7, 60 sec: 5530.6, 300 sec: 5535.3). Total num frames: 784796672. Throughput: 0: 5776.5. Samples: 784800584. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:40:32,951][25689] Avg episode reward: [(0, '-1.336')] [2022-07-10 14:40:34,387][26022] Updated weights on worker 0-0, policy_version 766412 (0.00416) [2022-07-10 14:40:36,312][26022] Updated weights on worker 0-0, policy_version 766422 (0.00094) [2022-07-10 14:40:38,033][25689] Fps is (10 sec: 5583.1, 60 sec: 5544.7, 300 sec: 5539.5). Total num frames: 784825344. Throughput: 0: 5733.6. Samples: 784833698. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:40:38,033][25689] Avg episode reward: [(0, '-1.236')] [2022-07-10 14:40:38,127][26022] Updated weights on worker 0-0, policy_version 766432 (0.00356) [2022-07-10 14:40:38,912][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:40:38,925][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000766435_784829440.pth [2022-07-10 14:40:38,925][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000764489_782836736.pth [2022-07-10 14:40:39,938][26022] Updated weights on worker 0-0, policy_version 766442 (0.00084) [2022-07-10 14:40:42,078][26022] Updated weights on worker 0-0, policy_version 766452 (0.00088) [2022-07-10 14:40:43,114][25689] Fps is (10 sec: 5540.7, 60 sec: 5524.9, 300 sec: 5531.8). Total num frames: 784852992. Throughput: 0: 4924.5. Samples: 784850360. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:40:43,114][25689] Avg episode reward: [(0, '-2.209')] [2022-07-10 14:40:43,580][26022] Updated weights on worker 0-0, policy_version 766462 (0.00095) [2022-07-10 14:40:45,631][26022] Updated weights on worker 0-0, policy_version 766472 (0.00092) [2022-07-10 14:40:47,236][26022] Updated weights on worker 0-0, policy_version 766482 (0.00089) [2022-07-10 14:40:48,118][25689] Fps is (10 sec: 5482.7, 60 sec: 5509.4, 300 sec: 5539.1). Total num frames: 784880640. Throughput: 0: 5779.5. Samples: 784883920. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:40:48,119][25689] Avg episode reward: [(0, '-2.507')] [2022-07-10 14:40:49,122][26022] Updated weights on worker 0-0, policy_version 766492 (0.00093) [2022-07-10 14:40:51,142][26022] Updated weights on worker 0-0, policy_version 766502 (0.00086) [2022-07-10 14:40:52,819][26022] Updated weights on worker 0-0, policy_version 766512 (0.00091) [2022-07-10 14:40:53,141][25689] Fps is (10 sec: 5616.5, 60 sec: 5525.7, 300 sec: 5532.5). Total num frames: 784909312. Throughput: 0: 5800.5. Samples: 784917710. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:40:53,143][25689] Avg episode reward: [(0, '-3.113')] [2022-07-10 14:40:54,610][26022] Updated weights on worker 0-0, policy_version 766522 (0.00094) [2022-07-10 14:40:56,615][26022] Updated weights on worker 0-0, policy_version 766532 (0.00085) [2022-07-10 14:40:58,171][25689] Fps is (10 sec: 5704.3, 60 sec: 5525.5, 300 sec: 5540.1). Total num frames: 784937984. Throughput: 0: 5002.0. Samples: 784934430. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:40:58,172][25689] Avg episode reward: [(0, '-3.382')] [2022-07-10 14:40:58,332][26022] Updated weights on worker 0-0, policy_version 766542 (0.00089) [2022-07-10 14:41:00,244][26022] Updated weights on worker 0-0, policy_version 766552 (0.00089) [2022-07-10 14:41:02,533][26022] Updated weights on worker 0-0, policy_version 766562 (0.00464) [2022-07-10 14:41:03,227][25689] Fps is (10 sec: 5279.6, 60 sec: 5499.0, 300 sec: 5536.8). Total num frames: 784962560. Throughput: 0: 5735.5. Samples: 784965718. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:41:03,231][25689] Avg episode reward: [(0, '-2.561')] [2022-07-10 14:41:04,147][26022] Updated weights on worker 0-0, policy_version 766572 (0.00085) [2022-07-10 14:41:06,064][26022] Updated weights on worker 0-0, policy_version 766582 (0.00089) [2022-07-10 14:41:07,898][26022] Updated weights on worker 0-0, policy_version 766592 (0.00086) [2022-07-10 14:41:08,242][25689] Fps is (10 sec: 5185.9, 60 sec: 5499.1, 300 sec: 5529.7). Total num frames: 784990208. Throughput: 0: 5730.3. Samples: 784999232. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:41:08,242][25689] Avg episode reward: [(0, '-2.782')] [2022-07-10 14:41:09,642][26022] Updated weights on worker 0-0, policy_version 766602 (0.00086) [2022-07-10 14:41:11,710][26022] Updated weights on worker 0-0, policy_version 766612 (0.00086) [2022-07-10 14:41:13,218][26022] Updated weights on worker 0-0, policy_version 766622 (0.00092) [2022-07-10 14:41:13,271][25689] Fps is (10 sec: 5811.4, 60 sec: 5549.7, 300 sec: 5546.7). Total num frames: 785020928. Throughput: 0: 4884.5. Samples: 785016032. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:41:13,271][25689] Avg episode reward: [(0, '-2.212')] [2022-07-10 14:41:15,299][26022] Updated weights on worker 0-0, policy_version 766632 (0.00086) [2022-07-10 14:41:16,910][26022] Updated weights on worker 0-0, policy_version 766642 (0.00095) [2022-07-10 14:41:18,275][25689] Fps is (10 sec: 5817.6, 60 sec: 5550.8, 300 sec: 5538.0). Total num frames: 785048576. Throughput: 0: 5743.6. Samples: 785049896. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:41:18,275][25689] Avg episode reward: [(0, '-1.785')] [2022-07-10 14:41:18,895][26022] Updated weights on worker 0-0, policy_version 766652 (0.00092) [2022-07-10 14:41:20,664][26022] Updated weights on worker 0-0, policy_version 766662 (0.00506) [2022-07-10 14:41:22,687][26022] Updated weights on worker 0-0, policy_version 766672 (0.00089) [2022-07-10 14:41:23,399][25689] Fps is (10 sec: 5459.8, 60 sec: 5519.1, 300 sec: 5539.3). Total num frames: 785076224. Throughput: 0: 5834.8. Samples: 785083416. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:41:23,399][25689] Avg episode reward: [(0, '-0.748')] [2022-07-10 14:41:24,359][26022] Updated weights on worker 0-0, policy_version 766682 (0.00080) [2022-07-10 14:41:26,128][26022] Updated weights on worker 0-0, policy_version 766692 (0.00087) [2022-07-10 14:41:27,847][26022] Updated weights on worker 0-0, policy_version 766702 (0.00084) [2022-07-10 14:41:28,463][25689] Fps is (10 sec: 5528.0, 60 sec: 5550.0, 300 sec: 5541.9). Total num frames: 785104896. Throughput: 0: 4995.9. Samples: 785100254. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:41:28,463][25689] Avg episode reward: [(0, '-0.617')] [2022-07-10 14:41:29,888][26022] Updated weights on worker 0-0, policy_version 766712 (0.00084) [2022-07-10 14:41:31,613][26022] Updated weights on worker 0-0, policy_version 766722 (0.00084) [2022-07-10 14:41:33,534][25689] Fps is (10 sec: 5556.6, 60 sec: 5543.8, 300 sec: 5537.4). Total num frames: 785132544. Throughput: 0: 5799.4. Samples: 785133548. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:41:33,535][25689] Avg episode reward: [(0, '-0.210')] [2022-07-10 14:41:33,558][26022] Updated weights on worker 0-0, policy_version 766732 (0.00085) [2022-07-10 14:41:35,402][26022] Updated weights on worker 0-0, policy_version 766742 (0.00090) [2022-07-10 14:41:37,328][26022] Updated weights on worker 0-0, policy_version 766752 (0.00085) [2022-07-10 14:41:38,616][25689] Fps is (10 sec: 5546.8, 60 sec: 5544.0, 300 sec: 5540.4). Total num frames: 785161216. Throughput: 0: 5776.2. Samples: 785167394. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:41:38,617][25689] Avg episode reward: [(0, '-0.103')] [2022-07-10 14:41:38,984][26022] Updated weights on worker 0-0, policy_version 766762 (0.00089) [2022-07-10 14:41:41,042][26022] Updated weights on worker 0-0, policy_version 766772 (0.00095) [2022-07-10 14:41:42,625][26022] Updated weights on worker 0-0, policy_version 766782 (0.00088) [2022-07-10 14:41:43,689][25689] Fps is (10 sec: 5546.4, 60 sec: 5544.7, 300 sec: 5539.5). Total num frames: 785188864. Throughput: 0: 4968.5. Samples: 785184228. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:41:43,689][25689] Avg episode reward: [(0, '-0.005')] [2022-07-10 14:41:44,459][26022] Updated weights on worker 0-0, policy_version 766792 (0.00087) [2022-07-10 14:41:46,323][26022] Updated weights on worker 0-0, policy_version 766802 (0.00078) [2022-07-10 14:41:48,013][26022] Updated weights on worker 0-0, policy_version 766812 (0.00090) [2022-07-10 14:41:48,706][25689] Fps is (10 sec: 5581.8, 60 sec: 5560.5, 300 sec: 5536.3). Total num frames: 785217536. Throughput: 0: 5828.9. Samples: 785218252. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:41:48,707][25689] Avg episode reward: [(0, '0.195')] [2022-07-10 14:41:49,883][26022] Updated weights on worker 0-0, policy_version 766822 (0.00085) [2022-07-10 14:41:51,903][26022] Updated weights on worker 0-0, policy_version 766832 (0.00091) [2022-07-10 14:41:53,604][26022] Updated weights on worker 0-0, policy_version 766842 (0.00085) [2022-07-10 14:41:53,714][25689] Fps is (10 sec: 5719.7, 60 sec: 5561.8, 300 sec: 5540.0). Total num frames: 785246208. Throughput: 0: 5867.3. Samples: 785251950. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:41:53,716][25689] Avg episode reward: [(0, '0.234')] [2022-07-10 14:41:55,527][26022] Updated weights on worker 0-0, policy_version 766852 (0.00097) [2022-07-10 14:41:57,073][26022] Updated weights on worker 0-0, policy_version 766862 (0.00086) [2022-07-10 14:41:58,745][25689] Fps is (10 sec: 5610.1, 60 sec: 5544.8, 300 sec: 5537.1). Total num frames: 785273856. Throughput: 0: 5036.4. Samples: 785268772. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:41:58,747][25689] Avg episode reward: [(0, '-0.694')] [2022-07-10 14:41:59,210][26022] Updated weights on worker 0-0, policy_version 766872 (0.00088) [2022-07-10 14:42:01,059][26022] Updated weights on worker 0-0, policy_version 766882 (0.00095) [2022-07-10 14:42:03,155][26022] Updated weights on worker 0-0, policy_version 766892 (0.00092) [2022-07-10 14:42:03,874][25689] Fps is (10 sec: 5442.9, 60 sec: 5588.8, 300 sec: 5542.1). Total num frames: 785301504. Throughput: 0: 5834.2. Samples: 785301992. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:42:03,876][25689] Avg episode reward: [(0, '-0.404')] [2022-07-10 14:42:04,968][26022] Updated weights on worker 0-0, policy_version 766902 (0.00088) [2022-07-10 14:42:06,646][26022] Updated weights on worker 0-0, policy_version 766912 (0.00090) [2022-07-10 14:42:08,509][26022] Updated weights on worker 0-0, policy_version 766922 (0.00091) [2022-07-10 14:42:08,899][25689] Fps is (10 sec: 5446.1, 60 sec: 5587.9, 300 sec: 5542.6). Total num frames: 785329152. Throughput: 0: 5747.6. Samples: 785334310. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:42:08,901][25689] Avg episode reward: [(0, '-2.073')] [2022-07-10 14:42:10,313][26022] Updated weights on worker 0-0, policy_version 766932 (0.00089) [2022-07-10 14:42:12,192][26022] Updated weights on worker 0-0, policy_version 766942 (0.00086) [2022-07-10 14:42:13,945][25689] Fps is (10 sec: 5490.5, 60 sec: 5535.7, 300 sec: 5538.5). Total num frames: 785356800. Throughput: 0: 4900.8. Samples: 785351096. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:42:13,945][25689] Avg episode reward: [(0, '-2.543')] [2022-07-10 14:42:14,180][26022] Updated weights on worker 0-0, policy_version 766952 (0.00088) [2022-07-10 14:42:15,827][26022] Updated weights on worker 0-0, policy_version 766962 (0.00088) [2022-07-10 14:42:17,807][26022] Updated weights on worker 0-0, policy_version 766972 (0.00080) [2022-07-10 14:42:18,969][25689] Fps is (10 sec: 5592.8, 60 sec: 5550.7, 300 sec: 5540.8). Total num frames: 785385472. Throughput: 0: 5751.2. Samples: 785385082. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:42:18,970][25689] Avg episode reward: [(0, '-2.091')] [2022-07-10 14:42:19,342][26022] Updated weights on worker 0-0, policy_version 766982 (0.00085) [2022-07-10 14:42:21,203][26022] Updated weights on worker 0-0, policy_version 766992 (0.00083) [2022-07-10 14:42:23,367][26022] Updated weights on worker 0-0, policy_version 767002 (0.00083) [2022-07-10 14:42:24,043][25689] Fps is (10 sec: 5780.2, 60 sec: 5589.1, 300 sec: 5553.4). Total num frames: 785415168. Throughput: 0: 5786.5. Samples: 785418700. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:42:24,043][25689] Avg episode reward: [(0, '-2.205')] [2022-07-10 14:42:25,102][26022] Updated weights on worker 0-0, policy_version 767012 (0.00081) [2022-07-10 14:42:26,915][26022] Updated weights on worker 0-0, policy_version 767022 (0.00088) [2022-07-10 14:42:28,765][26022] Updated weights on worker 0-0, policy_version 767032 (0.00087) [2022-07-10 14:42:29,069][25689] Fps is (10 sec: 5677.2, 60 sec: 5575.6, 300 sec: 5543.6). Total num frames: 785442816. Throughput: 0: 5848.7. Samples: 785452284. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:42:29,070][25689] Avg episode reward: [(0, '-2.287')] [2022-07-10 14:42:30,548][26022] Updated weights on worker 0-0, policy_version 767042 (0.00092) [2022-07-10 14:42:32,480][26022] Updated weights on worker 0-0, policy_version 767052 (0.00085) [2022-07-10 14:42:34,104][25689] Fps is (10 sec: 5496.0, 60 sec: 5579.0, 300 sec: 5550.3). Total num frames: 785470464. Throughput: 0: 5842.2. Samples: 785468870. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:42:34,105][25689] Avg episode reward: [(0, '-2.433')] [2022-07-10 14:42:34,188][26022] Updated weights on worker 0-0, policy_version 767062 (0.00096) [2022-07-10 14:42:36,105][26022] Updated weights on worker 0-0, policy_version 767072 (0.00086) [2022-07-10 14:42:37,977][26022] Updated weights on worker 0-0, policy_version 767082 (0.00087) [2022-07-10 14:42:39,050][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:42:39,059][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000767088_785498112.pth [2022-07-10 14:42:39,059][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000765137_783500288.pth [2022-07-10 14:42:39,137][25689] Fps is (10 sec: 5594.3, 60 sec: 5583.6, 300 sec: 5544.3). Total num frames: 785499136. Throughput: 0: 5831.6. Samples: 785502694. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 14:42:39,137][25689] Avg episode reward: [(0, '-1.706')] [2022-07-10 14:42:39,613][26022] Updated weights on worker 0-0, policy_version 767092 (0.00086) [2022-07-10 14:42:41,594][26022] Updated weights on worker 0-0, policy_version 767102 (0.00097) [2022-07-10 14:42:43,363][26022] Updated weights on worker 0-0, policy_version 767112 (0.00091) [2022-07-10 14:42:44,218][25689] Fps is (10 sec: 5568.5, 60 sec: 5582.7, 300 sec: 5544.0). Total num frames: 785526784. Throughput: 0: 5830.2. Samples: 785536326. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:42:44,219][25689] Avg episode reward: [(0, '-1.630')] [2022-07-10 14:42:45,264][26022] Updated weights on worker 0-0, policy_version 767122 (0.00089) [2022-07-10 14:42:47,002][26022] Updated weights on worker 0-0, policy_version 767132 (0.00086) [2022-07-10 14:42:48,767][26022] Updated weights on worker 0-0, policy_version 767142 (0.00090) [2022-07-10 14:42:49,233][25689] Fps is (10 sec: 5578.3, 60 sec: 5583.0, 300 sec: 5544.2). Total num frames: 785555456. Throughput: 0: 4998.7. Samples: 785553076. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:42:49,233][25689] Avg episode reward: [(0, '-2.303')] [2022-07-10 14:42:50,694][26022] Updated weights on worker 0-0, policy_version 767152 (0.00083) [2022-07-10 14:42:52,330][26022] Updated weights on worker 0-0, policy_version 767162 (0.00104) [2022-07-10 14:42:54,104][26022] Updated weights on worker 0-0, policy_version 767172 (0.00085) [2022-07-10 14:42:54,250][25689] Fps is (10 sec: 5716.0, 60 sec: 5582.1, 300 sec: 5542.3). Total num frames: 785584128. Throughput: 0: 5858.8. Samples: 785586902. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:42:54,251][25689] Avg episode reward: [(0, '-2.361')] [2022-07-10 14:42:56,116][26022] Updated weights on worker 0-0, policy_version 767182 (0.00085) [2022-07-10 14:42:57,787][26022] Updated weights on worker 0-0, policy_version 767192 (0.00083) [2022-07-10 14:42:59,268][25689] Fps is (10 sec: 5510.4, 60 sec: 5566.4, 300 sec: 5543.4). Total num frames: 785610752. Throughput: 0: 5873.6. Samples: 785620936. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:42:59,268][25689] Avg episode reward: [(0, '-1.942')] [2022-07-10 14:42:59,717][26022] Updated weights on worker 0-0, policy_version 767202 (0.00087) [2022-07-10 14:43:01,790][26022] Updated weights on worker 0-0, policy_version 767212 (0.00086) [2022-07-10 14:43:03,693][26022] Updated weights on worker 0-0, policy_version 767222 (0.00092) [2022-07-10 14:43:04,383][25689] Fps is (10 sec: 5356.3, 60 sec: 5567.7, 300 sec: 5548.4). Total num frames: 785638400. Throughput: 0: 4944.6. Samples: 785636032. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:43:04,383][25689] Avg episode reward: [(0, '-1.641')] [2022-07-10 14:43:05,870][26022] Updated weights on worker 0-0, policy_version 767232 (0.00091) [2022-07-10 14:43:07,362][26022] Updated weights on worker 0-0, policy_version 767242 (0.00086) [2022-07-10 14:43:09,333][26022] Updated weights on worker 0-0, policy_version 767252 (0.00092) [2022-07-10 14:43:09,427][25689] Fps is (10 sec: 5544.0, 60 sec: 5582.9, 300 sec: 5544.4). Total num frames: 785667072. Throughput: 0: 5747.2. Samples: 785669132. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:43:09,427][25689] Avg episode reward: [(0, '-2.490')] [2022-07-10 14:43:11,179][26022] Updated weights on worker 0-0, policy_version 767262 (0.00083) [2022-07-10 14:43:12,834][26022] Updated weights on worker 0-0, policy_version 767272 (0.00093) [2022-07-10 14:43:14,464][25689] Fps is (10 sec: 5484.8, 60 sec: 5566.7, 300 sec: 5544.0). Total num frames: 785693696. Throughput: 0: 5724.1. Samples: 785702610. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:43:14,465][25689] Avg episode reward: [(0, '-3.034')] [2022-07-10 14:43:14,776][26022] Updated weights on worker 0-0, policy_version 767282 (0.00088) [2022-07-10 14:43:16,525][26022] Updated weights on worker 0-0, policy_version 767292 (0.00093) [2022-07-10 14:43:18,428][26022] Updated weights on worker 0-0, policy_version 767302 (0.00089) [2022-07-10 14:43:19,522][25689] Fps is (10 sec: 5578.7, 60 sec: 5580.5, 300 sec: 5552.3). Total num frames: 785723392. Throughput: 0: 4869.5. Samples: 785719564. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:43:19,523][25689] Avg episode reward: [(0, '-2.035')] [2022-07-10 14:43:20,356][26022] Updated weights on worker 0-0, policy_version 767312 (0.00089) [2022-07-10 14:43:22,125][26022] Updated weights on worker 0-0, policy_version 767322 (0.00100) [2022-07-10 14:43:23,908][26022] Updated weights on worker 0-0, policy_version 767332 (0.00087) [2022-07-10 14:43:24,624][25689] Fps is (10 sec: 5543.5, 60 sec: 5527.3, 300 sec: 5543.9). Total num frames: 785750016. Throughput: 0: 5775.8. Samples: 785752942. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:43:24,624][25689] Avg episode reward: [(0, '-1.207')] [2022-07-10 14:43:25,677][26022] Updated weights on worker 0-0, policy_version 767342 (0.00050) [2022-07-10 14:43:27,620][26022] Updated weights on worker 0-0, policy_version 767352 (0.00086) [2022-07-10 14:43:29,368][26022] Updated weights on worker 0-0, policy_version 767362 (0.00090) [2022-07-10 14:43:29,679][25689] Fps is (10 sec: 5444.0, 60 sec: 5541.5, 300 sec: 5550.2). Total num frames: 785778688. Throughput: 0: 5797.7. Samples: 785786552. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:43:29,680][25689] Avg episode reward: [(0, '-1.710')] [2022-07-10 14:43:31,372][26022] Updated weights on worker 0-0, policy_version 767372 (0.00085) [2022-07-10 14:43:33,275][26022] Updated weights on worker 0-0, policy_version 767382 (0.00086) [2022-07-10 14:43:34,708][25689] Fps is (10 sec: 5686.7, 60 sec: 5559.0, 300 sec: 5549.9). Total num frames: 785807360. Throughput: 0: 4956.5. Samples: 785802948. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:43:34,708][25689] Avg episode reward: [(0, '-0.479')] [2022-07-10 14:43:35,056][26022] Updated weights on worker 0-0, policy_version 767392 (0.00089) [2022-07-10 14:43:36,799][26022] Updated weights on worker 0-0, policy_version 767402 (0.00086) [2022-07-10 14:43:38,747][26022] Updated weights on worker 0-0, policy_version 767412 (0.00097) [2022-07-10 14:43:39,794][25689] Fps is (10 sec: 5568.0, 60 sec: 5537.2, 300 sec: 5542.9). Total num frames: 785835008. Throughput: 0: 5780.0. Samples: 785836738. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:43:39,796][25689] Avg episode reward: [(0, '-0.880')] [2022-07-10 14:43:40,455][26022] Updated weights on worker 0-0, policy_version 767422 (0.00095) [2022-07-10 14:43:42,368][26022] Updated weights on worker 0-0, policy_version 767432 (0.00095) [2022-07-10 14:43:44,080][26022] Updated weights on worker 0-0, policy_version 767442 (0.00090) [2022-07-10 14:43:44,834][25689] Fps is (10 sec: 5662.8, 60 sec: 5574.7, 300 sec: 5550.0). Total num frames: 785864704. Throughput: 0: 5812.3. Samples: 785870410. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:43:44,835][25689] Avg episode reward: [(0, '-1.175')] [2022-07-10 14:43:46,137][26022] Updated weights on worker 0-0, policy_version 767452 (0.00091) [2022-07-10 14:43:47,683][26022] Updated weights on worker 0-0, policy_version 767462 (0.00095) [2022-07-10 14:43:49,872][25689] Fps is (10 sec: 5487.0, 60 sec: 5522.0, 300 sec: 5542.9). Total num frames: 785890304. Throughput: 0: 4990.0. Samples: 785887312. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:43:49,872][25689] Avg episode reward: [(0, '-1.139')] [2022-07-10 14:43:49,926][26022] Updated weights on worker 0-0, policy_version 767472 (0.00083) [2022-07-10 14:43:51,272][26022] Updated weights on worker 0-0, policy_version 767482 (0.00094) [2022-07-10 14:43:53,621][26022] Updated weights on worker 0-0, policy_version 767492 (0.00084) [2022-07-10 14:43:54,882][25689] Fps is (10 sec: 5605.2, 60 sec: 5556.4, 300 sec: 5550.2). Total num frames: 785921024. Throughput: 0: 5834.2. Samples: 785920648. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:43:54,883][25689] Avg episode reward: [(0, '-1.702')] [2022-07-10 14:43:55,202][26022] Updated weights on worker 0-0, policy_version 767502 (0.00089) [2022-07-10 14:43:57,183][26022] Updated weights on worker 0-0, policy_version 767512 (0.00088) [2022-07-10 14:43:58,946][26022] Updated weights on worker 0-0, policy_version 767522 (0.00089) [2022-07-10 14:43:59,915][25689] Fps is (10 sec: 5709.8, 60 sec: 5555.0, 300 sec: 5554.7). Total num frames: 785947648. Throughput: 0: 5836.9. Samples: 785954180. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:43:59,915][25689] Avg episode reward: [(0, '-1.158')] [2022-07-10 14:44:00,733][26022] Updated weights on worker 0-0, policy_version 767532 (0.00091) [2022-07-10 14:44:02,879][26022] Updated weights on worker 0-0, policy_version 767542 (0.00100) [2022-07-10 14:44:04,929][26022] Updated weights on worker 0-0, policy_version 767552 (0.00091) [2022-07-10 14:44:05,024][25689] Fps is (10 sec: 5149.2, 60 sec: 5521.8, 300 sec: 5543.7). Total num frames: 785973248. Throughput: 0: 4862.3. Samples: 785968580. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:44:05,025][25689] Avg episode reward: [(0, '-1.150')] [2022-07-10 14:44:06,647][26022] Updated weights on worker 0-0, policy_version 767562 (0.00089) [2022-07-10 14:44:08,720][26022] Updated weights on worker 0-0, policy_version 767572 (0.00089) [2022-07-10 14:44:10,045][25689] Fps is (10 sec: 5357.7, 60 sec: 5523.9, 300 sec: 5547.8). Total num frames: 786001920. Throughput: 0: 5675.1. Samples: 786001794. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:44:10,047][25689] Avg episode reward: [(0, '-1.173')] [2022-07-10 14:44:10,300][26022] Updated weights on worker 0-0, policy_version 767582 (0.00091) [2022-07-10 14:44:12,328][26022] Updated weights on worker 0-0, policy_version 767592 (0.00088) [2022-07-10 14:44:13,935][26022] Updated weights on worker 0-0, policy_version 767602 (0.00086) [2022-07-10 14:44:15,067][25689] Fps is (10 sec: 5710.1, 60 sec: 5559.1, 300 sec: 5547.7). Total num frames: 786030592. Throughput: 0: 5688.1. Samples: 786035460. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:44:15,067][25689] Avg episode reward: [(0, '-1.656')] [2022-07-10 14:44:15,971][26022] Updated weights on worker 0-0, policy_version 767612 (0.00094) [2022-07-10 14:44:17,623][26022] Updated weights on worker 0-0, policy_version 767622 (0.00088) [2022-07-10 14:44:19,604][26022] Updated weights on worker 0-0, policy_version 767632 (0.00095) [2022-07-10 14:44:20,086][25689] Fps is (10 sec: 5506.7, 60 sec: 5511.9, 300 sec: 5545.0). Total num frames: 786057216. Throughput: 0: 4865.3. Samples: 786052320. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:44:20,087][25689] Avg episode reward: [(0, '-2.041')] [2022-07-10 14:44:21,315][26022] Updated weights on worker 0-0, policy_version 767642 (0.00080) [2022-07-10 14:44:23,283][26022] Updated weights on worker 0-0, policy_version 767652 (0.00080) [2022-07-10 14:44:24,951][26022] Updated weights on worker 0-0, policy_version 767662 (0.00092) [2022-07-10 14:44:25,234][25689] Fps is (10 sec: 5640.3, 60 sec: 5575.3, 300 sec: 5549.2). Total num frames: 786087936. Throughput: 0: 5801.5. Samples: 786085826. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:44:25,234][25689] Avg episode reward: [(0, '-1.652')] [2022-07-10 14:44:27,076][26022] Updated weights on worker 0-0, policy_version 767672 (0.00086) [2022-07-10 14:44:28,594][26022] Updated weights on worker 0-0, policy_version 767682 (0.00080) [2022-07-10 14:44:30,271][25689] Fps is (10 sec: 5630.7, 60 sec: 5543.2, 300 sec: 5548.6). Total num frames: 786114560. Throughput: 0: 5822.8. Samples: 786119566. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:44:30,271][25689] Avg episode reward: [(0, '-2.160')] [2022-07-10 14:44:30,653][26022] Updated weights on worker 0-0, policy_version 767692 (0.00088) [2022-07-10 14:44:32,252][26022] Updated weights on worker 0-0, policy_version 767702 (0.00090) [2022-07-10 14:44:34,270][26022] Updated weights on worker 0-0, policy_version 767712 (0.00087) [2022-07-10 14:44:35,275][25689] Fps is (10 sec: 5507.0, 60 sec: 5545.5, 300 sec: 5552.9). Total num frames: 786143232. Throughput: 0: 4982.2. Samples: 786136140. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:44:35,276][25689] Avg episode reward: [(0, '-2.073')] [2022-07-10 14:44:35,941][26022] Updated weights on worker 0-0, policy_version 767722 (0.00089) [2022-07-10 14:44:38,050][26022] Updated weights on worker 0-0, policy_version 767732 (0.00090) [2022-07-10 14:44:39,212][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:44:39,227][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000767740_786165760.pth [2022-07-10 14:44:39,228][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000765788_784166912.pth [2022-07-10 14:44:39,573][26022] Updated weights on worker 0-0, policy_version 767742 (0.00089) [2022-07-10 14:44:40,286][25689] Fps is (10 sec: 5520.9, 60 sec: 5535.4, 300 sec: 5546.8). Total num frames: 786169856. Throughput: 0: 5806.4. Samples: 786169610. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:44:40,288][25689] Avg episode reward: [(0, '-0.956')] [2022-07-10 14:44:41,675][26022] Updated weights on worker 0-0, policy_version 767752 (0.00097) [2022-07-10 14:44:43,398][26022] Updated weights on worker 0-0, policy_version 767762 (0.00095) [2022-07-10 14:44:45,275][26022] Updated weights on worker 0-0, policy_version 767772 (0.00083) [2022-07-10 14:44:45,331][25689] Fps is (10 sec: 5600.4, 60 sec: 5535.0, 300 sec: 5549.7). Total num frames: 786199552. Throughput: 0: 5842.2. Samples: 786203240. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:44:45,332][25689] Avg episode reward: [(0, '-2.110')] [2022-07-10 14:44:47,029][26022] Updated weights on worker 0-0, policy_version 767782 (0.00104) [2022-07-10 14:44:48,916][26022] Updated weights on worker 0-0, policy_version 767792 (0.00095) [2022-07-10 14:44:50,398][25689] Fps is (10 sec: 5569.6, 60 sec: 5549.2, 300 sec: 5545.4). Total num frames: 786226176. Throughput: 0: 5830.6. Samples: 786236924. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:44:50,399][25689] Avg episode reward: [(0, '-1.979')] [2022-07-10 14:44:50,786][26022] Updated weights on worker 0-0, policy_version 767802 (0.00086) [2022-07-10 14:44:52,467][26022] Updated weights on worker 0-0, policy_version 767812 (0.00079) [2022-07-10 14:44:54,291][26022] Updated weights on worker 0-0, policy_version 767822 (0.00087) [2022-07-10 14:44:55,476][25689] Fps is (10 sec: 5551.6, 60 sec: 5526.1, 300 sec: 5547.9). Total num frames: 786255872. Throughput: 0: 5815.8. Samples: 786253628. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:44:55,476][25689] Avg episode reward: [(0, '-1.975')] [2022-07-10 14:44:56,295][26022] Updated weights on worker 0-0, policy_version 767832 (0.00088) [2022-07-10 14:44:58,001][26022] Updated weights on worker 0-0, policy_version 767842 (0.00083) [2022-07-10 14:44:59,900][26022] Updated weights on worker 0-0, policy_version 767852 (0.00082) [2022-07-10 14:45:00,540][25689] Fps is (10 sec: 5654.1, 60 sec: 5540.1, 300 sec: 5552.6). Total num frames: 786283520. Throughput: 0: 5818.1. Samples: 786287450. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:45:00,541][25689] Avg episode reward: [(0, '-1.890')] [2022-07-10 14:45:02,064][26022] Updated weights on worker 0-0, policy_version 767862 (0.00089) [2022-07-10 14:45:04,022][26022] Updated weights on worker 0-0, policy_version 767872 (0.00086) [2022-07-10 14:45:05,636][25689] Fps is (10 sec: 5240.8, 60 sec: 5541.3, 300 sec: 5544.3). Total num frames: 786309120. Throughput: 0: 5685.2. Samples: 786318680. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:45:05,637][25689] Avg episode reward: [(0, '-1.710')] [2022-07-10 14:45:05,835][26022] Updated weights on worker 0-0, policy_version 767882 (0.00083) [2022-07-10 14:45:07,567][26022] Updated weights on worker 0-0, policy_version 767892 (0.00085) [2022-07-10 14:45:09,447][26022] Updated weights on worker 0-0, policy_version 767902 (0.00093) [2022-07-10 14:45:10,691][25689] Fps is (10 sec: 5447.5, 60 sec: 5555.1, 300 sec: 5550.6). Total num frames: 786338816. Throughput: 0: 4856.2. Samples: 786335470. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:45:10,692][25689] Avg episode reward: [(0, '-1.124')] [2022-07-10 14:45:11,315][26022] Updated weights on worker 0-0, policy_version 767912 (0.00087) [2022-07-10 14:45:13,067][26022] Updated weights on worker 0-0, policy_version 767922 (0.00087) [2022-07-10 14:45:14,935][26022] Updated weights on worker 0-0, policy_version 767932 (0.00094) [2022-07-10 14:45:15,720][25689] Fps is (10 sec: 5686.9, 60 sec: 5537.6, 300 sec: 5550.4). Total num frames: 786366464. Throughput: 0: 5714.6. Samples: 786369314. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:45:15,720][25689] Avg episode reward: [(0, '-0.310')] [2022-07-10 14:45:16,593][26022] Updated weights on worker 0-0, policy_version 767942 (0.00090) [2022-07-10 14:45:18,574][26022] Updated weights on worker 0-0, policy_version 767952 (0.00086) [2022-07-10 14:45:20,309][26022] Updated weights on worker 0-0, policy_version 767962 (0.00089) [2022-07-10 14:45:20,762][25689] Fps is (10 sec: 5592.2, 60 sec: 5569.2, 300 sec: 5548.9). Total num frames: 786395136. Throughput: 0: 5711.4. Samples: 786402946. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:45:20,763][25689] Avg episode reward: [(0, '-0.151')] [2022-07-10 14:45:22,123][26022] Updated weights on worker 0-0, policy_version 767972 (0.00092) [2022-07-10 14:45:24,091][26022] Updated weights on worker 0-0, policy_version 767982 (0.00538) [2022-07-10 14:45:25,748][26022] Updated weights on worker 0-0, policy_version 767992 (0.00095) [2022-07-10 14:45:25,823][25689] Fps is (10 sec: 5676.0, 60 sec: 5543.4, 300 sec: 5555.3). Total num frames: 786423808. Throughput: 0: 4991.5. Samples: 786419440. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:45:25,823][25689] Avg episode reward: [(0, '-0.173')] [2022-07-10 14:45:27,692][26022] Updated weights on worker 0-0, policy_version 768002 (0.00088) [2022-07-10 14:45:29,480][26022] Updated weights on worker 0-0, policy_version 768012 (0.00092) [2022-07-10 14:45:30,842][25689] Fps is (10 sec: 5486.1, 60 sec: 5545.1, 300 sec: 5551.5). Total num frames: 786450432. Throughput: 0: 5838.5. Samples: 786453118. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:45:30,842][25689] Avg episode reward: [(0, '-1.570')] [2022-07-10 14:45:31,393][26022] Updated weights on worker 0-0, policy_version 768022 (0.00088) [2022-07-10 14:45:33,282][26022] Updated weights on worker 0-0, policy_version 768032 (0.00081) [2022-07-10 14:45:35,057][26022] Updated weights on worker 0-0, policy_version 768042 (0.00087) [2022-07-10 14:45:35,849][25689] Fps is (10 sec: 5412.9, 60 sec: 5527.9, 300 sec: 5549.5). Total num frames: 786478080. Throughput: 0: 5828.6. Samples: 786486640. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:45:35,853][25689] Avg episode reward: [(0, '-0.952')] [2022-07-10 14:45:36,830][26022] Updated weights on worker 0-0, policy_version 768052 (0.00095) [2022-07-10 14:45:38,623][26022] Updated weights on worker 0-0, policy_version 768062 (0.00091) [2022-07-10 14:45:40,450][26022] Updated weights on worker 0-0, policy_version 768072 (0.00083) [2022-07-10 14:45:40,890][25689] Fps is (10 sec: 5706.8, 60 sec: 5575.9, 300 sec: 5557.1). Total num frames: 786507776. Throughput: 0: 4999.1. Samples: 786503566. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:45:40,890][25689] Avg episode reward: [(0, '-1.226')] [2022-07-10 14:45:42,300][26022] Updated weights on worker 0-0, policy_version 768082 (0.00085) [2022-07-10 14:45:43,997][26022] Updated weights on worker 0-0, policy_version 768092 (0.00104) [2022-07-10 14:45:45,956][25689] Fps is (10 sec: 5774.9, 60 sec: 5557.0, 300 sec: 5559.4). Total num frames: 786536448. Throughput: 0: 5880.6. Samples: 786537838. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:45:45,958][25689] Avg episode reward: [(0, '-0.749')] [2022-07-10 14:45:45,963][26022] Updated weights on worker 0-0, policy_version 768102 (0.00085) [2022-07-10 14:45:47,585][26022] Updated weights on worker 0-0, policy_version 768112 (0.00087) [2022-07-10 14:45:49,702][26022] Updated weights on worker 0-0, policy_version 768122 (0.00087) [2022-07-10 14:45:51,025][25689] Fps is (10 sec: 5657.8, 60 sec: 5590.7, 300 sec: 5558.6). Total num frames: 786565120. Throughput: 0: 5861.3. Samples: 786571420. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:45:51,026][25689] Avg episode reward: [(0, '-0.519')] [2022-07-10 14:45:51,381][26022] Updated weights on worker 0-0, policy_version 768132 (0.00082) [2022-07-10 14:45:53,409][26022] Updated weights on worker 0-0, policy_version 768142 (0.00090) [2022-07-10 14:45:55,003][26022] Updated weights on worker 0-0, policy_version 768152 (0.00081) [2022-07-10 14:45:56,042][25689] Fps is (10 sec: 5482.4, 60 sec: 5545.5, 300 sec: 5551.9). Total num frames: 786591744. Throughput: 0: 5026.6. Samples: 786588144. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:45:56,043][25689] Avg episode reward: [(0, '-0.779')] [2022-07-10 14:45:57,098][26022] Updated weights on worker 0-0, policy_version 768162 (0.00096) [2022-07-10 14:45:58,811][26022] Updated weights on worker 0-0, policy_version 768172 (0.00087) [2022-07-10 14:46:00,730][26022] Updated weights on worker 0-0, policy_version 768182 (0.00095) [2022-07-10 14:46:01,091][25689] Fps is (10 sec: 5493.8, 60 sec: 5563.9, 300 sec: 5565.8). Total num frames: 786620416. Throughput: 0: 5857.0. Samples: 786621882. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:46:01,091][25689] Avg episode reward: [(0, '0.127')] [2022-07-10 14:46:02,663][26022] Updated weights on worker 0-0, policy_version 768192 (0.00093) [2022-07-10 14:46:04,765][26022] Updated weights on worker 0-0, policy_version 768202 (0.00080) [2022-07-10 14:46:06,209][25689] Fps is (10 sec: 5439.3, 60 sec: 5578.8, 300 sec: 5560.5). Total num frames: 786647040. Throughput: 0: 5698.5. Samples: 786653246. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 14:46:06,209][25689] Avg episode reward: [(0, '-0.065')] [2022-07-10 14:46:06,439][26022] Updated weights on worker 0-0, policy_version 768212 (0.00091) [2022-07-10 14:46:08,495][26022] Updated weights on worker 0-0, policy_version 768222 (0.00088) [2022-07-10 14:46:10,148][26022] Updated weights on worker 0-0, policy_version 768232 (0.00092) [2022-07-10 14:46:11,289][25689] Fps is (10 sec: 5221.4, 60 sec: 5525.8, 300 sec: 5545.8). Total num frames: 786673664. Throughput: 0: 4860.2. Samples: 786669904. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:46:11,289][25689] Avg episode reward: [(0, '-0.632')] [2022-07-10 14:46:11,890][26022] Updated weights on worker 0-0, policy_version 768242 (0.00095) [2022-07-10 14:46:13,767][26022] Updated weights on worker 0-0, policy_version 768252 (0.00091) [2022-07-10 14:46:15,347][26022] Updated weights on worker 0-0, policy_version 768262 (0.00092) [2022-07-10 14:46:16,313][25689] Fps is (10 sec: 5573.8, 60 sec: 5560.0, 300 sec: 5552.2). Total num frames: 786703360. Throughput: 0: 5695.3. Samples: 786703590. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:46:16,314][25689] Avg episode reward: [(0, '-2.315')] [2022-07-10 14:46:17,590][26022] Updated weights on worker 0-0, policy_version 768272 (0.00778) [2022-07-10 14:46:19,267][26022] Updated weights on worker 0-0, policy_version 768282 (0.00094) [2022-07-10 14:46:21,171][26022] Updated weights on worker 0-0, policy_version 768292 (0.00092) [2022-07-10 14:46:21,410][25689] Fps is (10 sec: 5767.0, 60 sec: 5555.0, 300 sec: 5556.2). Total num frames: 786732032. Throughput: 0: 5674.6. Samples: 786737184. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:46:21,410][25689] Avg episode reward: [(0, '-2.648')] [2022-07-10 14:46:23,066][26022] Updated weights on worker 0-0, policy_version 768302 (0.00083) [2022-07-10 14:46:24,678][26022] Updated weights on worker 0-0, policy_version 768312 (0.00088) [2022-07-10 14:46:26,492][25689] Fps is (10 sec: 5533.1, 60 sec: 5536.1, 300 sec: 5552.4). Total num frames: 786759680. Throughput: 0: 4956.5. Samples: 786753780. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:46:26,492][25689] Avg episode reward: [(0, '-2.120')] [2022-07-10 14:46:26,872][26022] Updated weights on worker 0-0, policy_version 768322 (0.00093) [2022-07-10 14:46:28,368][26022] Updated weights on worker 0-0, policy_version 768332 (0.00093) [2022-07-10 14:46:30,564][26022] Updated weights on worker 0-0, policy_version 768342 (0.00088) [2022-07-10 14:46:31,593][25689] Fps is (10 sec: 5631.1, 60 sec: 5579.2, 300 sec: 5558.7). Total num frames: 786789376. Throughput: 0: 5774.0. Samples: 786787142. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:46:31,594][25689] Avg episode reward: [(0, '-2.109')] [2022-07-10 14:46:32,243][26022] Updated weights on worker 0-0, policy_version 768352 (0.00095) [2022-07-10 14:46:34,098][26022] Updated weights on worker 0-0, policy_version 768362 (0.00083) [2022-07-10 14:46:35,985][26022] Updated weights on worker 0-0, policy_version 768372 (0.00089) [2022-07-10 14:46:36,651][25689] Fps is (10 sec: 5543.5, 60 sec: 5557.7, 300 sec: 5552.3). Total num frames: 786816000. Throughput: 0: 5748.6. Samples: 786820508. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:46:36,652][25689] Avg episode reward: [(0, '-1.519')] [2022-07-10 14:46:37,776][26022] Updated weights on worker 0-0, policy_version 768382 (0.00084) [2022-07-10 14:46:39,265][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:46:39,279][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000768391_786832384.pth [2022-07-10 14:46:39,280][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000766435_784829440.pth [2022-07-10 14:46:39,667][26022] Updated weights on worker 0-0, policy_version 768392 (0.00088) [2022-07-10 14:46:41,395][26022] Updated weights on worker 0-0, policy_version 768402 (0.00084) [2022-07-10 14:46:41,740][25689] Fps is (10 sec: 5550.6, 60 sec: 5553.3, 300 sec: 5558.9). Total num frames: 786845696. Throughput: 0: 5760.6. Samples: 786854298. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:46:41,740][25689] Avg episode reward: [(0, '-1.374')] [2022-07-10 14:46:43,068][26022] Updated weights on worker 0-0, policy_version 768412 (0.00087) [2022-07-10 14:46:44,995][26022] Updated weights on worker 0-0, policy_version 768422 (0.00085) [2022-07-10 14:46:46,822][25689] Fps is (10 sec: 5537.5, 60 sec: 5518.2, 300 sec: 5550.8). Total num frames: 786872320. Throughput: 0: 5777.4. Samples: 786871236. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:46:46,823][25689] Avg episode reward: [(0, '0.291')] [2022-07-10 14:46:46,967][26022] Updated weights on worker 0-0, policy_version 768432 (0.00087) [2022-07-10 14:46:48,651][26022] Updated weights on worker 0-0, policy_version 768442 (0.00630) [2022-07-10 14:46:50,575][26022] Updated weights on worker 0-0, policy_version 768452 (0.00090) [2022-07-10 14:46:51,839][25689] Fps is (10 sec: 5576.4, 60 sec: 5539.8, 300 sec: 5554.1). Total num frames: 786902016. Throughput: 0: 5826.8. Samples: 786905114. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:46:51,840][25689] Avg episode reward: [(0, '0.123')] [2022-07-10 14:46:52,202][26022] Updated weights on worker 0-0, policy_version 768462 (0.00089) [2022-07-10 14:46:54,043][26022] Updated weights on worker 0-0, policy_version 768472 (0.00093) [2022-07-10 14:46:55,942][26022] Updated weights on worker 0-0, policy_version 768482 (0.00084) [2022-07-10 14:46:56,865][25689] Fps is (10 sec: 5709.9, 60 sec: 5555.8, 300 sec: 5554.2). Total num frames: 786929664. Throughput: 0: 5863.7. Samples: 786939034. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:46:56,865][25689] Avg episode reward: [(0, '-1.268')] [2022-07-10 14:46:57,699][26022] Updated weights on worker 0-0, policy_version 768492 (0.00074) [2022-07-10 14:46:59,657][26022] Updated weights on worker 0-0, policy_version 768502 (0.00088) [2022-07-10 14:47:01,480][26022] Updated weights on worker 0-0, policy_version 768512 (0.00084) [2022-07-10 14:47:01,872][25689] Fps is (10 sec: 5613.8, 60 sec: 5559.6, 300 sec: 5559.9). Total num frames: 786958336. Throughput: 0: 5045.1. Samples: 786955866. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:47:01,872][25689] Avg episode reward: [(0, '-2.376')] [2022-07-10 14:47:03,696][26022] Updated weights on worker 0-0, policy_version 768522 (0.00101) [2022-07-10 14:47:05,380][26022] Updated weights on worker 0-0, policy_version 768532 (0.00089) [2022-07-10 14:47:06,979][25689] Fps is (10 sec: 5467.4, 60 sec: 5560.6, 300 sec: 5554.9). Total num frames: 786984960. Throughput: 0: 5728.4. Samples: 786986702. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:47:06,979][25689] Avg episode reward: [(0, '-3.471')] [2022-07-10 14:47:07,346][26022] Updated weights on worker 0-0, policy_version 768542 (0.00093) [2022-07-10 14:47:09,166][26022] Updated weights on worker 0-0, policy_version 768552 (0.00087) [2022-07-10 14:47:11,218][26022] Updated weights on worker 0-0, policy_version 768562 (0.00092) [2022-07-10 14:47:12,003][25689] Fps is (10 sec: 5256.0, 60 sec: 5565.8, 300 sec: 5551.9). Total num frames: 787011584. Throughput: 0: 5690.1. Samples: 787019844. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:47:12,003][25689] Avg episode reward: [(0, '-3.186')] [2022-07-10 14:47:12,831][26022] Updated weights on worker 0-0, policy_version 768572 (0.00055) [2022-07-10 14:47:14,764][26022] Updated weights on worker 0-0, policy_version 768582 (0.00092) [2022-07-10 14:47:16,619][26022] Updated weights on worker 0-0, policy_version 768592 (0.00100) [2022-07-10 14:47:17,035][25689] Fps is (10 sec: 5396.9, 60 sec: 5531.3, 300 sec: 5548.3). Total num frames: 787039232. Throughput: 0: 4846.5. Samples: 787036788. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:47:17,035][25689] Avg episode reward: [(0, '-3.845')] [2022-07-10 14:47:18,247][26022] Updated weights on worker 0-0, policy_version 768602 (0.00090) [2022-07-10 14:47:20,432][26022] Updated weights on worker 0-0, policy_version 768612 (0.00094) [2022-07-10 14:47:21,923][26022] Updated weights on worker 0-0, policy_version 768622 (0.01121) [2022-07-10 14:47:22,042][25689] Fps is (10 sec: 5712.1, 60 sec: 5556.4, 300 sec: 5549.5). Total num frames: 787068928. Throughput: 0: 5665.9. Samples: 787070146. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:47:22,042][25689] Avg episode reward: [(0, '-5.231')] [2022-07-10 14:47:24,036][26022] Updated weights on worker 0-0, policy_version 768632 (0.00087) [2022-07-10 14:47:25,623][26022] Updated weights on worker 0-0, policy_version 768642 (0.00103) [2022-07-10 14:47:27,131][25689] Fps is (10 sec: 5578.6, 60 sec: 5538.9, 300 sec: 5545.0). Total num frames: 787095552. Throughput: 0: 5791.5. Samples: 787103412. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:47:27,131][25689] Avg episode reward: [(0, '-3.208')] [2022-07-10 14:47:27,699][26022] Updated weights on worker 0-0, policy_version 768652 (0.00432) [2022-07-10 14:47:29,429][26022] Updated weights on worker 0-0, policy_version 768662 (0.00090) [2022-07-10 14:47:31,454][26022] Updated weights on worker 0-0, policy_version 768672 (0.00096) [2022-07-10 14:47:32,163][25689] Fps is (10 sec: 5362.3, 60 sec: 5511.4, 300 sec: 5545.0). Total num frames: 787123200. Throughput: 0: 4970.7. Samples: 787120054. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:47:32,163][25689] Avg episode reward: [(0, '-3.292')] [2022-07-10 14:47:33,109][26022] Updated weights on worker 0-0, policy_version 768682 (0.00087) [2022-07-10 14:47:35,224][26022] Updated weights on worker 0-0, policy_version 768692 (0.00086) [2022-07-10 14:47:36,763][26022] Updated weights on worker 0-0, policy_version 768702 (0.00091) [2022-07-10 14:47:37,183][25689] Fps is (10 sec: 5704.7, 60 sec: 5565.6, 300 sec: 5548.7). Total num frames: 787152896. Throughput: 0: 5780.3. Samples: 787153248. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:47:37,183][25689] Avg episode reward: [(0, '-2.811')] [2022-07-10 14:47:38,736][26022] Updated weights on worker 0-0, policy_version 768712 (0.00086) [2022-07-10 14:47:40,564][26022] Updated weights on worker 0-0, policy_version 768722 (0.00090) [2022-07-10 14:47:42,191][25689] Fps is (10 sec: 5616.4, 60 sec: 5522.2, 300 sec: 5546.6). Total num frames: 787179520. Throughput: 0: 5796.3. Samples: 787186934. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:47:42,191][25689] Avg episode reward: [(0, '-2.960')] [2022-07-10 14:47:42,437][26022] Updated weights on worker 0-0, policy_version 768732 (0.00097) [2022-07-10 14:47:44,339][26022] Updated weights on worker 0-0, policy_version 768742 (0.00089) [2022-07-10 14:47:46,045][26022] Updated weights on worker 0-0, policy_version 768752 (0.00093) [2022-07-10 14:47:47,328][25689] Fps is (10 sec: 5450.6, 60 sec: 5551.1, 300 sec: 5544.3). Total num frames: 787208192. Throughput: 0: 4958.1. Samples: 787203552. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:47:47,328][25689] Avg episode reward: [(0, '-2.270')] [2022-07-10 14:47:47,932][26022] Updated weights on worker 0-0, policy_version 768762 (0.00092) [2022-07-10 14:47:49,767][26022] Updated weights on worker 0-0, policy_version 768772 (0.00091) [2022-07-10 14:47:51,644][26022] Updated weights on worker 0-0, policy_version 768782 (0.00090) [2022-07-10 14:47:52,367][25689] Fps is (10 sec: 5534.5, 60 sec: 5515.3, 300 sec: 5540.5). Total num frames: 787235840. Throughput: 0: 5790.0. Samples: 787237036. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:47:52,369][25689] Avg episode reward: [(0, '0.431')] [2022-07-10 14:47:53,541][26022] Updated weights on worker 0-0, policy_version 768792 (0.00080) [2022-07-10 14:47:55,203][26022] Updated weights on worker 0-0, policy_version 768802 (0.00089) [2022-07-10 14:47:57,020][26022] Updated weights on worker 0-0, policy_version 768812 (0.00089) [2022-07-10 14:47:57,386][25689] Fps is (10 sec: 5701.0, 60 sec: 5549.6, 300 sec: 5550.8). Total num frames: 787265536. Throughput: 0: 5810.0. Samples: 787270630. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:47:57,388][25689] Avg episode reward: [(0, '0.344')] [2022-07-10 14:47:59,177][26022] Updated weights on worker 0-0, policy_version 768822 (0.00091) [2022-07-10 14:48:00,795][26022] Updated weights on worker 0-0, policy_version 768832 (0.00085) [2022-07-10 14:48:02,408][25689] Fps is (10 sec: 5303.2, 60 sec: 5463.7, 300 sec: 5538.8). Total num frames: 787289088. Throughput: 0: 4967.0. Samples: 787287352. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:48:02,408][25689] Avg episode reward: [(0, '-0.155')] [2022-07-10 14:48:03,100][26022] Updated weights on worker 0-0, policy_version 768842 (0.00096) [2022-07-10 14:48:04,954][26022] Updated weights on worker 0-0, policy_version 768852 (0.00091) [2022-07-10 14:48:06,667][26022] Updated weights on worker 0-0, policy_version 768862 (0.00089) [2022-07-10 14:48:07,507][25689] Fps is (10 sec: 5261.1, 60 sec: 5515.1, 300 sec: 5541.2). Total num frames: 787318784. Throughput: 0: 5697.6. Samples: 787318528. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:48:07,508][25689] Avg episode reward: [(0, '0.050')] [2022-07-10 14:48:08,922][26022] Updated weights on worker 0-0, policy_version 768872 (0.00087) [2022-07-10 14:48:10,317][26022] Updated weights on worker 0-0, policy_version 768882 (0.00092) [2022-07-10 14:48:12,227][26022] Updated weights on worker 0-0, policy_version 768892 (0.00082) [2022-07-10 14:48:12,606][25689] Fps is (10 sec: 5723.2, 60 sec: 5542.1, 300 sec: 5546.9). Total num frames: 787347456. Throughput: 0: 5690.4. Samples: 787352206. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:48:12,607][25689] Avg episode reward: [(0, '-0.286')] [2022-07-10 14:48:14,137][26022] Updated weights on worker 0-0, policy_version 768902 (0.00087) [2022-07-10 14:48:15,870][26022] Updated weights on worker 0-0, policy_version 768912 (0.00087) [2022-07-10 14:48:17,644][25689] Fps is (10 sec: 5455.2, 60 sec: 5524.7, 300 sec: 5537.0). Total num frames: 787374080. Throughput: 0: 4856.4. Samples: 787369008. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:48:17,644][25689] Avg episode reward: [(0, '-0.303')] [2022-07-10 14:48:17,741][26022] Updated weights on worker 0-0, policy_version 768922 (0.00092) [2022-07-10 14:48:19,466][26022] Updated weights on worker 0-0, policy_version 768932 (0.00093) [2022-07-10 14:48:21,346][26022] Updated weights on worker 0-0, policy_version 768942 (0.00103) [2022-07-10 14:48:22,737][25689] Fps is (10 sec: 5559.3, 60 sec: 5516.9, 300 sec: 5547.4). Total num frames: 787403776. Throughput: 0: 5672.1. Samples: 787402662. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:48:22,737][25689] Avg episode reward: [(0, '0.189')] [2022-07-10 14:48:23,348][26022] Updated weights on worker 0-0, policy_version 768952 (0.00083) [2022-07-10 14:48:24,993][26022] Updated weights on worker 0-0, policy_version 768962 (0.00087) [2022-07-10 14:48:27,152][26022] Updated weights on worker 0-0, policy_version 768972 (0.00092) [2022-07-10 14:48:27,842][25689] Fps is (10 sec: 5522.3, 60 sec: 5515.4, 300 sec: 5539.6). Total num frames: 787430400. Throughput: 0: 5763.7. Samples: 787435732. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:48:27,843][25689] Avg episode reward: [(0, '-1.152')] [2022-07-10 14:48:28,635][26022] Updated weights on worker 0-0, policy_version 768982 (0.00085) [2022-07-10 14:48:30,891][26022] Updated weights on worker 0-0, policy_version 768992 (0.00094) [2022-07-10 14:48:32,483][26022] Updated weights on worker 0-0, policy_version 769002 (0.00086) [2022-07-10 14:48:32,870][25689] Fps is (10 sec: 5558.1, 60 sec: 5549.5, 300 sec: 5543.1). Total num frames: 787460096. Throughput: 0: 4949.5. Samples: 787452500. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:48:32,870][25689] Avg episode reward: [(0, '-1.598')] [2022-07-10 14:48:34,283][26022] Updated weights on worker 0-0, policy_version 769012 (0.00090) [2022-07-10 14:48:36,127][26022] Updated weights on worker 0-0, policy_version 769022 (0.00095) [2022-07-10 14:48:37,887][25689] Fps is (10 sec: 5607.1, 60 sec: 5499.2, 300 sec: 5540.9). Total num frames: 787486720. Throughput: 0: 5790.5. Samples: 787486226. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:48:37,887][25689] Avg episode reward: [(0, '-1.576')] [2022-07-10 14:48:38,202][26022] Updated weights on worker 0-0, policy_version 769032 (0.00090) [2022-07-10 14:48:39,532][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:48:39,556][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000769040_787496960.pth [2022-07-10 14:48:39,556][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000767088_785498112.pth [2022-07-10 14:48:39,813][26022] Updated weights on worker 0-0, policy_version 769042 (0.00094) [2022-07-10 14:48:41,601][26022] Updated weights on worker 0-0, policy_version 769052 (0.00097) [2022-07-10 14:48:42,897][25689] Fps is (10 sec: 5514.3, 60 sec: 5532.7, 300 sec: 5538.0). Total num frames: 787515392. Throughput: 0: 5802.8. Samples: 787519650. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:48:42,898][25689] Avg episode reward: [(0, '-1.657')] [2022-07-10 14:48:43,532][26022] Updated weights on worker 0-0, policy_version 769062 (0.00082) [2022-07-10 14:48:45,372][26022] Updated weights on worker 0-0, policy_version 769072 (0.00088) [2022-07-10 14:48:47,197][26022] Updated weights on worker 0-0, policy_version 769082 (0.00086) [2022-07-10 14:48:47,959][25689] Fps is (10 sec: 5794.9, 60 sec: 5556.5, 300 sec: 5551.3). Total num frames: 787545088. Throughput: 0: 5857.4. Samples: 787553564. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:48:47,960][25689] Avg episode reward: [(0, '-2.425')] [2022-07-10 14:48:49,037][26022] Updated weights on worker 0-0, policy_version 769092 (0.00088) [2022-07-10 14:48:50,860][26022] Updated weights on worker 0-0, policy_version 769102 (0.00103) [2022-07-10 14:48:52,832][26022] Updated weights on worker 0-0, policy_version 769112 (0.00087) [2022-07-10 14:48:52,972][25689] Fps is (10 sec: 5590.3, 60 sec: 5542.0, 300 sec: 5537.5). Total num frames: 787571712. Throughput: 0: 5861.7. Samples: 787570334. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:48:52,973][25689] Avg episode reward: [(0, '-2.246')] [2022-07-10 14:48:54,457][26022] Updated weights on worker 0-0, policy_version 769122 (0.00095) [2022-07-10 14:48:56,528][26022] Updated weights on worker 0-0, policy_version 769132 (0.00088) [2022-07-10 14:48:57,973][25689] Fps is (10 sec: 5521.7, 60 sec: 5526.7, 300 sec: 5545.0). Total num frames: 787600384. Throughput: 0: 5843.5. Samples: 787603602. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:48:57,974][25689] Avg episode reward: [(0, '-1.927')] [2022-07-10 14:48:58,068][26022] Updated weights on worker 0-0, policy_version 769142 (0.00093) [2022-07-10 14:49:00,185][26022] Updated weights on worker 0-0, policy_version 769152 (0.00086) [2022-07-10 14:49:01,758][26022] Updated weights on worker 0-0, policy_version 769162 (0.00090) [2022-07-10 14:49:03,026][25689] Fps is (10 sec: 5296.0, 60 sec: 5540.7, 300 sec: 5542.6). Total num frames: 787624960. Throughput: 0: 5744.8. Samples: 787635288. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:49:03,027][25689] Avg episode reward: [(0, '-2.387')] [2022-07-10 14:49:04,155][26022] Updated weights on worker 0-0, policy_version 769172 (0.00095) [2022-07-10 14:49:05,719][26022] Updated weights on worker 0-0, policy_version 769182 (0.00083) [2022-07-10 14:49:07,820][26022] Updated weights on worker 0-0, policy_version 769192 (0.00084) [2022-07-10 14:49:08,102][25689] Fps is (10 sec: 5257.1, 60 sec: 5526.0, 300 sec: 5541.6). Total num frames: 787653632. Throughput: 0: 4881.5. Samples: 787651892. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:49:08,104][25689] Avg episode reward: [(0, '-2.478')] [2022-07-10 14:49:09,621][26022] Updated weights on worker 0-0, policy_version 769202 (0.00089) [2022-07-10 14:49:11,592][26022] Updated weights on worker 0-0, policy_version 769212 (0.00087) [2022-07-10 14:49:13,125][25689] Fps is (10 sec: 5678.5, 60 sec: 5533.0, 300 sec: 5541.6). Total num frames: 787682304. Throughput: 0: 5703.0. Samples: 787685266. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:49:13,126][25689] Avg episode reward: [(0, '-2.462')] [2022-07-10 14:49:13,179][26022] Updated weights on worker 0-0, policy_version 769222 (0.00608) [2022-07-10 14:49:15,464][26022] Updated weights on worker 0-0, policy_version 769232 (0.00081) [2022-07-10 14:49:16,955][26022] Updated weights on worker 0-0, policy_version 769242 (0.00088) [2022-07-10 14:49:18,160][25689] Fps is (10 sec: 5599.6, 60 sec: 5550.1, 300 sec: 5544.7). Total num frames: 787709952. Throughput: 0: 5696.3. Samples: 787718590. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:49:18,160][25689] Avg episode reward: [(0, '-1.864')] [2022-07-10 14:49:19,038][26022] Updated weights on worker 0-0, policy_version 769252 (0.00091) [2022-07-10 14:49:20,648][26022] Updated weights on worker 0-0, policy_version 769262 (0.00092) [2022-07-10 14:49:22,711][26022] Updated weights on worker 0-0, policy_version 769272 (0.00083) [2022-07-10 14:49:23,165][25689] Fps is (10 sec: 5507.4, 60 sec: 5524.3, 300 sec: 5537.1). Total num frames: 787737600. Throughput: 0: 4953.5. Samples: 787735046. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:49:23,166][25689] Avg episode reward: [(0, '-1.607')] [2022-07-10 14:49:24,363][26022] Updated weights on worker 0-0, policy_version 769282 (0.00080) [2022-07-10 14:49:26,259][26022] Updated weights on worker 0-0, policy_version 769292 (0.00091) [2022-07-10 14:49:27,906][26022] Updated weights on worker 0-0, policy_version 769302 (0.00093) [2022-07-10 14:49:28,212][25689] Fps is (10 sec: 5602.9, 60 sec: 5563.6, 300 sec: 5543.7). Total num frames: 787766272. Throughput: 0: 5810.3. Samples: 787768736. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:49:28,214][25689] Avg episode reward: [(0, '-0.562')] [2022-07-10 14:49:29,976][26022] Updated weights on worker 0-0, policy_version 769312 (0.00094) [2022-07-10 14:49:31,745][26022] Updated weights on worker 0-0, policy_version 769322 (0.00087) [2022-07-10 14:49:33,309][25689] Fps is (10 sec: 5552.1, 60 sec: 5523.3, 300 sec: 5538.6). Total num frames: 787793920. Throughput: 0: 5794.7. Samples: 787802228. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 14:49:33,313][25689] Avg episode reward: [(0, '0.434')] [2022-07-10 14:49:33,687][26022] Updated weights on worker 0-0, policy_version 769332 (0.00089) [2022-07-10 14:49:35,255][26022] Updated weights on worker 0-0, policy_version 769342 (0.00089) [2022-07-10 14:49:37,318][26022] Updated weights on worker 0-0, policy_version 769352 (0.00093) [2022-07-10 14:49:38,347][25689] Fps is (10 sec: 5658.1, 60 sec: 5572.2, 300 sec: 5548.4). Total num frames: 787823616. Throughput: 0: 4981.0. Samples: 787819140. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:49:38,347][25689] Avg episode reward: [(0, '0.636')] [2022-07-10 14:49:38,957][26022] Updated weights on worker 0-0, policy_version 769362 (0.00089) [2022-07-10 14:49:40,933][26022] Updated weights on worker 0-0, policy_version 769372 (0.00089) [2022-07-10 14:49:42,693][26022] Updated weights on worker 0-0, policy_version 769382 (0.00082) [2022-07-10 14:49:43,391][25689] Fps is (10 sec: 5484.7, 60 sec: 5518.3, 300 sec: 5534.7). Total num frames: 787849216. Throughput: 0: 5804.3. Samples: 787852444. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:49:43,392][25689] Avg episode reward: [(0, '0.094')] [2022-07-10 14:49:44,436][26022] Updated weights on worker 0-0, policy_version 769392 (0.00086) [2022-07-10 14:49:46,517][26022] Updated weights on worker 0-0, policy_version 769402 (0.00088) [2022-07-10 14:49:48,208][26022] Updated weights on worker 0-0, policy_version 769412 (0.00091) [2022-07-10 14:49:48,459][25689] Fps is (10 sec: 5468.2, 60 sec: 5517.8, 300 sec: 5545.0). Total num frames: 787878912. Throughput: 0: 5785.2. Samples: 787885870. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:49:48,460][25689] Avg episode reward: [(0, '0.195')] [2022-07-10 14:49:50,173][26022] Updated weights on worker 0-0, policy_version 769422 (0.00081) [2022-07-10 14:49:52,043][26022] Updated weights on worker 0-0, policy_version 769432 (0.00088) [2022-07-10 14:49:53,538][25689] Fps is (10 sec: 5651.5, 60 sec: 5528.7, 300 sec: 5538.1). Total num frames: 787906560. Throughput: 0: 4959.6. Samples: 787902554. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:49:53,538][25689] Avg episode reward: [(0, '-1.252')] [2022-07-10 14:49:53,807][26022] Updated weights on worker 0-0, policy_version 769442 (0.00093) [2022-07-10 14:49:55,858][26022] Updated weights on worker 0-0, policy_version 769452 (0.00097) [2022-07-10 14:49:57,606][26022] Updated weights on worker 0-0, policy_version 769462 (0.00087) [2022-07-10 14:49:58,623][25689] Fps is (10 sec: 5339.7, 60 sec: 5487.3, 300 sec: 5534.3). Total num frames: 787933184. Throughput: 0: 5750.9. Samples: 787935746. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:49:58,623][25689] Avg episode reward: [(0, '-1.666')] [2022-07-10 14:49:59,384][26022] Updated weights on worker 0-0, policy_version 769472 (0.00102) [2022-07-10 14:50:01,207][26022] Updated weights on worker 0-0, policy_version 769482 (0.00097) [2022-07-10 14:50:03,362][26022] Updated weights on worker 0-0, policy_version 769492 (0.00084) [2022-07-10 14:50:03,628][25689] Fps is (10 sec: 5378.9, 60 sec: 5542.3, 300 sec: 5542.8). Total num frames: 787960832. Throughput: 0: 5674.3. Samples: 787967272. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:50:03,628][25689] Avg episode reward: [(0, '-1.730')] [2022-07-10 14:50:05,235][26022] Updated weights on worker 0-0, policy_version 769502 (0.00079) [2022-07-10 14:50:07,008][26022] Updated weights on worker 0-0, policy_version 769512 (0.00092) [2022-07-10 14:50:08,747][25689] Fps is (10 sec: 5360.8, 60 sec: 5504.6, 300 sec: 5531.3). Total num frames: 787987456. Throughput: 0: 4842.0. Samples: 787984104. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:50:08,747][25689] Avg episode reward: [(0, '-2.540')] [2022-07-10 14:50:08,991][26022] Updated weights on worker 0-0, policy_version 769522 (0.00093) [2022-07-10 14:50:10,827][26022] Updated weights on worker 0-0, policy_version 769532 (0.00085) [2022-07-10 14:50:12,604][26022] Updated weights on worker 0-0, policy_version 769542 (0.00088) [2022-07-10 14:50:13,823][25689] Fps is (10 sec: 5523.8, 60 sec: 5516.6, 300 sec: 5537.3). Total num frames: 788017152. Throughput: 0: 5655.1. Samples: 788017272. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:50:13,824][25689] Avg episode reward: [(0, '-2.590')] [2022-07-10 14:50:14,675][26022] Updated weights on worker 0-0, policy_version 769552 (0.00088) [2022-07-10 14:50:16,219][26022] Updated weights on worker 0-0, policy_version 769562 (0.00087) [2022-07-10 14:50:18,263][26022] Updated weights on worker 0-0, policy_version 769572 (0.00089) [2022-07-10 14:50:18,871][25689] Fps is (10 sec: 5765.2, 60 sec: 5532.3, 300 sec: 5537.2). Total num frames: 788045824. Throughput: 0: 5689.9. Samples: 788050956. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:50:18,872][25689] Avg episode reward: [(0, '-2.464')] [2022-07-10 14:50:19,984][26022] Updated weights on worker 0-0, policy_version 769582 (0.00083) [2022-07-10 14:50:21,740][26022] Updated weights on worker 0-0, policy_version 769592 (0.00346) [2022-07-10 14:50:23,681][26022] Updated weights on worker 0-0, policy_version 769602 (0.00086) [2022-07-10 14:50:23,873][25689] Fps is (10 sec: 5604.5, 60 sec: 5532.6, 300 sec: 5534.9). Total num frames: 788073472. Throughput: 0: 4957.3. Samples: 788067632. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:50:23,873][25689] Avg episode reward: [(0, '-1.043')] [2022-07-10 14:50:25,303][26022] Updated weights on worker 0-0, policy_version 769612 (0.00090) [2022-07-10 14:50:27,326][26022] Updated weights on worker 0-0, policy_version 769622 (0.00093) [2022-07-10 14:50:28,955][25689] Fps is (10 sec: 5585.3, 60 sec: 5529.4, 300 sec: 5540.6). Total num frames: 788102144. Throughput: 0: 5807.5. Samples: 788101462. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:50:28,955][25689] Avg episode reward: [(0, '-0.603')] [2022-07-10 14:50:29,087][26022] Updated weights on worker 0-0, policy_version 769632 (0.00878) [2022-07-10 14:50:30,970][26022] Updated weights on worker 0-0, policy_version 769642 (0.00097) [2022-07-10 14:50:32,715][26022] Updated weights on worker 0-0, policy_version 769652 (0.00080) [2022-07-10 14:50:33,959][25689] Fps is (10 sec: 5380.5, 60 sec: 5504.1, 300 sec: 5533.7). Total num frames: 788127744. Throughput: 0: 5851.2. Samples: 788135092. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:50:33,960][25689] Avg episode reward: [(0, '-0.959')] [2022-07-10 14:50:34,547][26022] Updated weights on worker 0-0, policy_version 769662 (0.00089) [2022-07-10 14:50:36,467][26022] Updated weights on worker 0-0, policy_version 769672 (0.00086) [2022-07-10 14:50:38,358][26022] Updated weights on worker 0-0, policy_version 769682 (0.00123) [2022-07-10 14:50:38,965][25689] Fps is (10 sec: 5523.8, 60 sec: 5507.0, 300 sec: 5534.4). Total num frames: 788157440. Throughput: 0: 5017.3. Samples: 788151774. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:50:38,966][25689] Avg episode reward: [(0, '-0.110')] [2022-07-10 14:50:39,574][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:50:39,589][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000769688_788160512.pth [2022-07-10 14:50:39,590][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000767740_786165760.pth [2022-07-10 14:50:40,068][26022] Updated weights on worker 0-0, policy_version 769692 (0.00090) [2022-07-10 14:50:42,023][26022] Updated weights on worker 0-0, policy_version 769702 (0.00090) [2022-07-10 14:50:43,693][26022] Updated weights on worker 0-0, policy_version 769712 (0.00091) [2022-07-10 14:50:43,980][25689] Fps is (10 sec: 5927.0, 60 sec: 5577.3, 300 sec: 5538.8). Total num frames: 788187136. Throughput: 0: 5849.5. Samples: 788185250. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:50:43,980][25689] Avg episode reward: [(0, '0.662')] [2022-07-10 14:50:45,959][26022] Updated weights on worker 0-0, policy_version 769722 (0.00096) [2022-07-10 14:50:47,300][26022] Updated weights on worker 0-0, policy_version 769732 (0.00086) [2022-07-10 14:50:49,084][25689] Fps is (10 sec: 5464.5, 60 sec: 5506.3, 300 sec: 5527.8). Total num frames: 788212736. Throughput: 0: 5824.0. Samples: 788218698. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:50:49,085][25689] Avg episode reward: [(0, '0.777')] [2022-07-10 14:50:49,602][26022] Updated weights on worker 0-0, policy_version 769742 (0.00096) [2022-07-10 14:50:50,848][26022] Updated weights on worker 0-0, policy_version 769752 (0.00386) [2022-07-10 14:50:53,159][26022] Updated weights on worker 0-0, policy_version 769762 (0.00085) [2022-07-10 14:50:54,105][25689] Fps is (10 sec: 5461.5, 60 sec: 5545.5, 300 sec: 5538.1). Total num frames: 788242432. Throughput: 0: 4978.9. Samples: 788235392. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:50:54,106][25689] Avg episode reward: [(0, '0.127')] [2022-07-10 14:50:54,833][26022] Updated weights on worker 0-0, policy_version 769772 (0.00085) [2022-07-10 14:50:56,897][26022] Updated weights on worker 0-0, policy_version 769782 (0.00097) [2022-07-10 14:50:58,463][26022] Updated weights on worker 0-0, policy_version 769792 (0.00085) [2022-07-10 14:50:59,117][25689] Fps is (10 sec: 5715.6, 60 sec: 5569.0, 300 sec: 5535.3). Total num frames: 788270080. Throughput: 0: 5807.0. Samples: 788268796. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:50:59,118][25689] Avg episode reward: [(0, '-0.210')] [2022-07-10 14:51:00,515][26022] Updated weights on worker 0-0, policy_version 769802 (0.00085) [2022-07-10 14:51:02,475][26022] Updated weights on worker 0-0, policy_version 769812 (0.00094) [2022-07-10 14:51:04,133][25689] Fps is (10 sec: 5207.7, 60 sec: 5517.3, 300 sec: 5530.3). Total num frames: 788294656. Throughput: 0: 5704.3. Samples: 788300208. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:51:04,135][25689] Avg episode reward: [(0, '-0.173')] [2022-07-10 14:51:04,462][26022] Updated weights on worker 0-0, policy_version 769822 (0.00090) [2022-07-10 14:51:06,118][26022] Updated weights on worker 0-0, policy_version 769832 (0.00615) [2022-07-10 14:51:08,223][26022] Updated weights on worker 0-0, policy_version 769842 (0.00093) [2022-07-10 14:51:09,202][25689] Fps is (10 sec: 5381.5, 60 sec: 5572.6, 300 sec: 5540.8). Total num frames: 788324352. Throughput: 0: 5728.6. Samples: 788333944. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:51:09,203][25689] Avg episode reward: [(0, '-0.301')] [2022-07-10 14:51:09,853][26022] Updated weights on worker 0-0, policy_version 769852 (0.00086) [2022-07-10 14:51:11,923][26022] Updated weights on worker 0-0, policy_version 769862 (0.00082) [2022-07-10 14:51:13,350][26022] Updated weights on worker 0-0, policy_version 769872 (0.00084) [2022-07-10 14:51:14,240][25689] Fps is (10 sec: 5673.6, 60 sec: 5542.3, 300 sec: 5533.7). Total num frames: 788352000. Throughput: 0: 5713.9. Samples: 788350444. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:51:14,240][25689] Avg episode reward: [(0, '-2.030')] [2022-07-10 14:51:15,535][26022] Updated weights on worker 0-0, policy_version 769882 (0.00088) [2022-07-10 14:51:17,038][26022] Updated weights on worker 0-0, policy_version 769892 (0.00087) [2022-07-10 14:51:19,161][26022] Updated weights on worker 0-0, policy_version 769902 (0.00086) [2022-07-10 14:51:19,251][25689] Fps is (10 sec: 5502.6, 60 sec: 5528.7, 300 sec: 5531.9). Total num frames: 788379648. Throughput: 0: 5730.1. Samples: 788384166. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:51:19,252][25689] Avg episode reward: [(0, '-2.116')] [2022-07-10 14:51:20,958][26022] Updated weights on worker 0-0, policy_version 769912 (0.00086) [2022-07-10 14:51:22,732][26022] Updated weights on worker 0-0, policy_version 769922 (0.00089) [2022-07-10 14:51:24,272][25689] Fps is (10 sec: 5512.1, 60 sec: 5527.0, 300 sec: 5533.0). Total num frames: 788407296. Throughput: 0: 5834.4. Samples: 788417706. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:51:24,272][25689] Avg episode reward: [(0, '-1.602')] [2022-07-10 14:51:24,688][26022] Updated weights on worker 0-0, policy_version 769932 (0.00092) [2022-07-10 14:51:26,351][26022] Updated weights on worker 0-0, policy_version 769942 (0.00092) [2022-07-10 14:51:28,166][26022] Updated weights on worker 0-0, policy_version 769952 (0.00091) [2022-07-10 14:51:29,377][25689] Fps is (10 sec: 5663.4, 60 sec: 5541.8, 300 sec: 5532.9). Total num frames: 788436992. Throughput: 0: 4974.6. Samples: 788434304. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:51:29,377][25689] Avg episode reward: [(0, '-1.807')] [2022-07-10 14:51:30,292][26022] Updated weights on worker 0-0, policy_version 769962 (0.00105) [2022-07-10 14:51:31,700][26022] Updated weights on worker 0-0, policy_version 769972 (0.00052) [2022-07-10 14:51:34,109][26022] Updated weights on worker 0-0, policy_version 769982 (0.00090) [2022-07-10 14:51:34,444][25689] Fps is (10 sec: 5536.7, 60 sec: 5553.0, 300 sec: 5532.8). Total num frames: 788463616. Throughput: 0: 5820.2. Samples: 788468034. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:51:34,444][25689] Avg episode reward: [(0, '-1.722')] [2022-07-10 14:51:35,458][26022] Updated weights on worker 0-0, policy_version 769992 (0.00093) [2022-07-10 14:51:37,570][26022] Updated weights on worker 0-0, policy_version 770002 (0.00102) [2022-07-10 14:51:39,030][26022] Updated weights on worker 0-0, policy_version 770012 (0.00088) [2022-07-10 14:51:39,458][25689] Fps is (10 sec: 5586.7, 60 sec: 5552.3, 300 sec: 5534.1). Total num frames: 788493312. Throughput: 0: 5808.2. Samples: 788501530. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:51:39,458][25689] Avg episode reward: [(0, '-2.402')] [2022-07-10 14:51:41,331][26022] Updated weights on worker 0-0, policy_version 770022 (0.00085) [2022-07-10 14:51:42,957][26022] Updated weights on worker 0-0, policy_version 770032 (0.00089) [2022-07-10 14:51:44,469][25689] Fps is (10 sec: 5617.9, 60 sec: 5501.8, 300 sec: 5535.5). Total num frames: 788519936. Throughput: 0: 4960.0. Samples: 788517884. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:51:44,469][25689] Avg episode reward: [(0, '-1.311')] [2022-07-10 14:51:45,037][26022] Updated weights on worker 0-0, policy_version 770042 (0.00286) [2022-07-10 14:51:46,596][26022] Updated weights on worker 0-0, policy_version 770052 (0.00083) [2022-07-10 14:51:48,680][26022] Updated weights on worker 0-0, policy_version 770062 (0.00087) [2022-07-10 14:51:49,511][25689] Fps is (10 sec: 5500.1, 60 sec: 5558.3, 300 sec: 5531.6). Total num frames: 788548608. Throughput: 0: 5804.0. Samples: 788551166. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:51:49,512][25689] Avg episode reward: [(0, '-1.379')] [2022-07-10 14:51:50,293][26022] Updated weights on worker 0-0, policy_version 770072 (0.00085) [2022-07-10 14:51:52,083][26022] Updated weights on worker 0-0, policy_version 770082 (0.00093) [2022-07-10 14:51:54,231][26022] Updated weights on worker 0-0, policy_version 770092 (0.00095) [2022-07-10 14:51:54,519][25689] Fps is (10 sec: 5604.1, 60 sec: 5525.6, 300 sec: 5531.9). Total num frames: 788576256. Throughput: 0: 5809.9. Samples: 788584666. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:51:54,519][25689] Avg episode reward: [(0, '-2.099')] [2022-07-10 14:51:55,933][26022] Updated weights on worker 0-0, policy_version 770102 (0.00086) [2022-07-10 14:51:57,751][26022] Updated weights on worker 0-0, policy_version 770112 (0.00089) [2022-07-10 14:51:59,531][25689] Fps is (10 sec: 5518.9, 60 sec: 5525.7, 300 sec: 5528.4). Total num frames: 788603904. Throughput: 0: 4973.0. Samples: 788601354. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:51:59,531][25689] Avg episode reward: [(0, '-2.171')] [2022-07-10 14:51:59,669][26022] Updated weights on worker 0-0, policy_version 770122 (0.00082) [2022-07-10 14:52:01,447][26022] Updated weights on worker 0-0, policy_version 770132 (0.00085) [2022-07-10 14:52:03,575][26022] Updated weights on worker 0-0, policy_version 770142 (0.00086) [2022-07-10 14:52:04,538][25689] Fps is (10 sec: 5416.9, 60 sec: 5560.4, 300 sec: 5530.2). Total num frames: 788630528. Throughput: 0: 5720.1. Samples: 788632680. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:52:04,540][25689] Avg episode reward: [(0, '-1.445')] [2022-07-10 14:52:05,574][26022] Updated weights on worker 0-0, policy_version 770152 (0.00090) [2022-07-10 14:52:07,426][26022] Updated weights on worker 0-0, policy_version 770162 (0.00072) [2022-07-10 14:52:09,114][26022] Updated weights on worker 0-0, policy_version 770172 (0.00087) [2022-07-10 14:52:09,649][25689] Fps is (10 sec: 5464.9, 60 sec: 5539.5, 300 sec: 5535.5). Total num frames: 788659200. Throughput: 0: 5711.8. Samples: 788666188. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:52:09,651][25689] Avg episode reward: [(0, '-1.722')] [2022-07-10 14:52:11,123][26022] Updated weights on worker 0-0, policy_version 770182 (0.00090) [2022-07-10 14:52:12,773][26022] Updated weights on worker 0-0, policy_version 770192 (0.00092) [2022-07-10 14:52:14,675][25689] Fps is (10 sec: 5454.9, 60 sec: 5523.7, 300 sec: 5532.2). Total num frames: 788685824. Throughput: 0: 4861.3. Samples: 788682650. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:52:14,677][25689] Avg episode reward: [(0, '-1.709')] [2022-07-10 14:52:14,792][26022] Updated weights on worker 0-0, policy_version 770202 (0.00085) [2022-07-10 14:52:16,411][26022] Updated weights on worker 0-0, policy_version 770212 (0.00093) [2022-07-10 14:52:18,697][26022] Updated weights on worker 0-0, policy_version 770222 (0.00096) [2022-07-10 14:52:19,690][25689] Fps is (10 sec: 5405.1, 60 sec: 5523.4, 300 sec: 5525.1). Total num frames: 788713472. Throughput: 0: 5680.8. Samples: 788715874. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:52:19,692][25689] Avg episode reward: [(0, '-1.496')] [2022-07-10 14:52:20,190][26022] Updated weights on worker 0-0, policy_version 770232 (0.00086) [2022-07-10 14:52:22,263][26022] Updated weights on worker 0-0, policy_version 770242 (0.00080) [2022-07-10 14:52:23,832][26022] Updated weights on worker 0-0, policy_version 770252 (0.00092) [2022-07-10 14:52:24,713][25689] Fps is (10 sec: 5508.2, 60 sec: 5523.1, 300 sec: 5529.8). Total num frames: 788741120. Throughput: 0: 5788.2. Samples: 788749462. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:52:24,714][25689] Avg episode reward: [(0, '-0.969')] [2022-07-10 14:52:25,992][26022] Updated weights on worker 0-0, policy_version 770262 (0.00100) [2022-07-10 14:52:27,564][26022] Updated weights on worker 0-0, policy_version 770272 (0.00092) [2022-07-10 14:52:29,646][26022] Updated weights on worker 0-0, policy_version 770282 (0.00083) [2022-07-10 14:52:29,821][25689] Fps is (10 sec: 5558.8, 60 sec: 5505.8, 300 sec: 5531.8). Total num frames: 788769792. Throughput: 0: 4952.3. Samples: 788766088. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:52:29,822][25689] Avg episode reward: [(0, '-0.036')] [2022-07-10 14:52:31,166][26022] Updated weights on worker 0-0, policy_version 770292 (0.00089) [2022-07-10 14:52:33,301][26022] Updated weights on worker 0-0, policy_version 770302 (0.00093) [2022-07-10 14:52:34,839][25689] Fps is (10 sec: 5663.5, 60 sec: 5544.3, 300 sec: 5528.4). Total num frames: 788798464. Throughput: 0: 5799.4. Samples: 788799588. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:52:34,839][25689] Avg episode reward: [(0, '0.410')] [2022-07-10 14:52:34,951][26022] Updated weights on worker 0-0, policy_version 770312 (0.00556) [2022-07-10 14:52:36,974][26022] Updated weights on worker 0-0, policy_version 770322 (0.00091) [2022-07-10 14:52:38,774][26022] Updated weights on worker 0-0, policy_version 770332 (0.00089) [2022-07-10 14:52:39,679][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:52:39,694][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000770337_788825088.pth [2022-07-10 14:52:39,694][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000768391_786832384.pth [2022-07-10 14:52:39,871][25689] Fps is (10 sec: 5604.3, 60 sec: 5508.7, 300 sec: 5531.4). Total num frames: 788826112. Throughput: 0: 5821.8. Samples: 788833364. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:52:39,871][25689] Avg episode reward: [(0, '1.314')] [2022-07-10 14:52:40,490][26022] Updated weights on worker 0-0, policy_version 770342 (0.00094) [2022-07-10 14:52:42,507][26022] Updated weights on worker 0-0, policy_version 770352 (0.00086) [2022-07-10 14:52:44,152][26022] Updated weights on worker 0-0, policy_version 770362 (0.00091) [2022-07-10 14:52:44,882][25689] Fps is (10 sec: 5505.9, 60 sec: 5525.7, 300 sec: 5530.3). Total num frames: 788853760. Throughput: 0: 4983.5. Samples: 788849970. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:52:44,882][25689] Avg episode reward: [(0, '0.788')] [2022-07-10 14:52:46,169][26022] Updated weights on worker 0-0, policy_version 770372 (0.00094) [2022-07-10 14:52:48,026][26022] Updated weights on worker 0-0, policy_version 770382 (0.00087) [2022-07-10 14:52:49,764][26022] Updated weights on worker 0-0, policy_version 770392 (0.00093) [2022-07-10 14:52:49,926][25689] Fps is (10 sec: 5601.2, 60 sec: 5525.6, 300 sec: 5533.7). Total num frames: 788882432. Throughput: 0: 5815.6. Samples: 788883008. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:52:49,926][25689] Avg episode reward: [(0, '0.753')] [2022-07-10 14:52:51,688][26022] Updated weights on worker 0-0, policy_version 770402 (0.00089) [2022-07-10 14:52:53,537][26022] Updated weights on worker 0-0, policy_version 770412 (0.00064) [2022-07-10 14:52:54,935][25689] Fps is (10 sec: 5500.4, 60 sec: 5508.5, 300 sec: 5523.5). Total num frames: 788909056. Throughput: 0: 5814.0. Samples: 788916426. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:52:54,935][25689] Avg episode reward: [(0, '0.952')] [2022-07-10 14:52:55,336][26022] Updated weights on worker 0-0, policy_version 770422 (0.00088) [2022-07-10 14:52:57,063][26022] Updated weights on worker 0-0, policy_version 770432 (0.00091) [2022-07-10 14:52:59,093][26022] Updated weights on worker 0-0, policy_version 770442 (0.00090) [2022-07-10 14:52:59,959][25689] Fps is (10 sec: 5511.6, 60 sec: 5524.3, 300 sec: 5540.7). Total num frames: 788937728. Throughput: 0: 4969.3. Samples: 788933186. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:52:59,959][25689] Avg episode reward: [(0, '-0.048')] [2022-07-10 14:53:00,812][26022] Updated weights on worker 0-0, policy_version 770452 (0.00092) [2022-07-10 14:53:03,055][26022] Updated weights on worker 0-0, policy_version 770462 (0.00095) [2022-07-10 14:53:04,872][26022] Updated weights on worker 0-0, policy_version 770472 (0.00086) [2022-07-10 14:53:04,969][25689] Fps is (10 sec: 5408.8, 60 sec: 5507.1, 300 sec: 5528.6). Total num frames: 788963328. Throughput: 0: 5699.4. Samples: 788964454. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 14:53:04,969][25689] Avg episode reward: [(0, '-0.868')] [2022-07-10 14:53:06,891][26022] Updated weights on worker 0-0, policy_version 770482 (0.00096) [2022-07-10 14:53:08,631][26022] Updated weights on worker 0-0, policy_version 770492 (0.00093) [2022-07-10 14:53:10,023][25689] Fps is (10 sec: 5290.8, 60 sec: 5495.3, 300 sec: 5526.0). Total num frames: 788990976. Throughput: 0: 5691.3. Samples: 788997386. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:53:10,024][25689] Avg episode reward: [(0, '-1.160')] [2022-07-10 14:53:10,679][26022] Updated weights on worker 0-0, policy_version 770502 (0.00096) [2022-07-10 14:53:12,384][26022] Updated weights on worker 0-0, policy_version 770512 (0.00093) [2022-07-10 14:53:14,363][26022] Updated weights on worker 0-0, policy_version 770522 (0.00092) [2022-07-10 14:53:15,027][25689] Fps is (10 sec: 5497.7, 60 sec: 5514.3, 300 sec: 5530.1). Total num frames: 789018624. Throughput: 0: 4851.6. Samples: 789013904. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:53:15,027][25689] Avg episode reward: [(0, '-1.632')] [2022-07-10 14:53:16,089][26022] Updated weights on worker 0-0, policy_version 770532 (0.00097) [2022-07-10 14:53:18,146][26022] Updated weights on worker 0-0, policy_version 770542 (0.00088) [2022-07-10 14:53:19,915][26022] Updated weights on worker 0-0, policy_version 770552 (0.00092) [2022-07-10 14:53:20,031][25689] Fps is (10 sec: 5422.9, 60 sec: 5498.4, 300 sec: 5521.4). Total num frames: 789045248. Throughput: 0: 5650.6. Samples: 789046606. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:53:20,031][25689] Avg episode reward: [(0, '-2.067')] [2022-07-10 14:53:21,716][26022] Updated weights on worker 0-0, policy_version 770562 (0.00087) [2022-07-10 14:53:23,597][26022] Updated weights on worker 0-0, policy_version 770572 (0.00354) [2022-07-10 14:53:25,056][25689] Fps is (10 sec: 5513.3, 60 sec: 5515.2, 300 sec: 5529.8). Total num frames: 789073920. Throughput: 0: 5758.6. Samples: 789080130. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:53:25,058][25689] Avg episode reward: [(0, '-1.321')] [2022-07-10 14:53:25,432][26022] Updated weights on worker 0-0, policy_version 770582 (0.00087) [2022-07-10 14:53:27,620][26022] Updated weights on worker 0-0, policy_version 770592 (0.00082) [2022-07-10 14:53:29,061][26022] Updated weights on worker 0-0, policy_version 770602 (0.00086) [2022-07-10 14:53:30,111][25689] Fps is (10 sec: 5587.4, 60 sec: 5503.1, 300 sec: 5522.4). Total num frames: 789101568. Throughput: 0: 4939.5. Samples: 789096608. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:53:30,112][25689] Avg episode reward: [(0, '-1.416')] [2022-07-10 14:53:31,188][26022] Updated weights on worker 0-0, policy_version 770612 (0.00089) [2022-07-10 14:53:32,556][26022] Updated weights on worker 0-0, policy_version 770622 (0.00081) [2022-07-10 14:53:34,764][26022] Updated weights on worker 0-0, policy_version 770632 (0.00090) [2022-07-10 14:53:35,178][25689] Fps is (10 sec: 5361.9, 60 sec: 5464.5, 300 sec: 5521.4). Total num frames: 789128192. Throughput: 0: 5761.1. Samples: 789129998. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:53:35,180][25689] Avg episode reward: [(0, '-1.306')] [2022-07-10 14:53:36,403][26022] Updated weights on worker 0-0, policy_version 770642 (0.00083) [2022-07-10 14:53:38,384][26022] Updated weights on worker 0-0, policy_version 770652 (0.00098) [2022-07-10 14:53:40,185][25689] Fps is (10 sec: 5488.9, 60 sec: 5483.9, 300 sec: 5521.5). Total num frames: 789156864. Throughput: 0: 5794.5. Samples: 789163388. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:53:40,185][25689] Avg episode reward: [(0, '-1.598')] [2022-07-10 14:53:40,395][26022] Updated weights on worker 0-0, policy_version 770662 (0.00086) [2022-07-10 14:53:41,965][26022] Updated weights on worker 0-0, policy_version 770672 (0.00079) [2022-07-10 14:53:44,001][26022] Updated weights on worker 0-0, policy_version 770682 (0.00090) [2022-07-10 14:53:45,213][25689] Fps is (10 sec: 5714.4, 60 sec: 5499.2, 300 sec: 5518.7). Total num frames: 789185536. Throughput: 0: 4954.5. Samples: 789179996. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:53:45,214][25689] Avg episode reward: [(0, '-1.412')] [2022-07-10 14:53:45,929][26022] Updated weights on worker 0-0, policy_version 770692 (0.00084) [2022-07-10 14:53:47,431][26022] Updated weights on worker 0-0, policy_version 770702 (0.00084) [2022-07-10 14:53:49,540][26022] Updated weights on worker 0-0, policy_version 770712 (0.00093) [2022-07-10 14:53:50,261][25689] Fps is (10 sec: 5589.4, 60 sec: 5481.9, 300 sec: 5521.5). Total num frames: 789213184. Throughput: 0: 5782.4. Samples: 789213126. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:53:50,261][25689] Avg episode reward: [(0, '-1.321')] [2022-07-10 14:53:51,273][26022] Updated weights on worker 0-0, policy_version 770722 (0.00083) [2022-07-10 14:53:53,406][26022] Updated weights on worker 0-0, policy_version 770732 (0.00085) [2022-07-10 14:53:55,055][26022] Updated weights on worker 0-0, policy_version 770742 (0.00082) [2022-07-10 14:53:55,275][25689] Fps is (10 sec: 5495.7, 60 sec: 5498.4, 300 sec: 5517.8). Total num frames: 789240832. Throughput: 0: 5785.0. Samples: 789246256. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:53:55,275][25689] Avg episode reward: [(0, '-1.229')] [2022-07-10 14:53:56,773][26022] Updated weights on worker 0-0, policy_version 770752 (0.00085) [2022-07-10 14:53:58,831][26022] Updated weights on worker 0-0, policy_version 770762 (0.00085) [2022-07-10 14:54:00,287][25689] Fps is (10 sec: 5617.5, 60 sec: 5499.5, 300 sec: 5532.3). Total num frames: 789269504. Throughput: 0: 4961.7. Samples: 789263128. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:54:00,287][25689] Avg episode reward: [(0, '-0.962')] [2022-07-10 14:54:00,463][26022] Updated weights on worker 0-0, policy_version 770772 (0.00084) [2022-07-10 14:54:02,879][26022] Updated weights on worker 0-0, policy_version 770782 (0.00086) [2022-07-10 14:54:04,500][26022] Updated weights on worker 0-0, policy_version 770792 (0.00090) [2022-07-10 14:54:05,397][25689] Fps is (10 sec: 5361.8, 60 sec: 5490.4, 300 sec: 5521.4). Total num frames: 789295104. Throughput: 0: 5680.1. Samples: 789294640. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:54:05,397][25689] Avg episode reward: [(0, '-0.288')] [2022-07-10 14:54:06,412][26022] Updated weights on worker 0-0, policy_version 770802 (0.00086) [2022-07-10 14:54:08,366][26022] Updated weights on worker 0-0, policy_version 770812 (0.00096) [2022-07-10 14:54:10,064][26022] Updated weights on worker 0-0, policy_version 770822 (0.00091) [2022-07-10 14:54:10,502][25689] Fps is (10 sec: 5413.3, 60 sec: 5519.7, 300 sec: 5523.3). Total num frames: 789324800. Throughput: 0: 5682.5. Samples: 789328144. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:54:10,502][25689] Avg episode reward: [(0, '-0.210')] [2022-07-10 14:54:11,956][26022] Updated weights on worker 0-0, policy_version 770832 (0.00086) [2022-07-10 14:54:13,811][26022] Updated weights on worker 0-0, policy_version 770842 (0.00092) [2022-07-10 14:54:15,495][26022] Updated weights on worker 0-0, policy_version 770852 (0.00094) [2022-07-10 14:54:15,569][25689] Fps is (10 sec: 5637.2, 60 sec: 5513.9, 300 sec: 5522.7). Total num frames: 789352448. Throughput: 0: 5662.8. Samples: 789361178. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:54:15,569][25689] Avg episode reward: [(0, '-0.201')] [2022-07-10 14:54:17,663][26022] Updated weights on worker 0-0, policy_version 770862 (0.00084) [2022-07-10 14:54:19,233][26022] Updated weights on worker 0-0, policy_version 770872 (0.00101) [2022-07-10 14:54:20,617][25689] Fps is (10 sec: 5263.9, 60 sec: 5493.0, 300 sec: 5515.0). Total num frames: 789378048. Throughput: 0: 5631.4. Samples: 789377618. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:54:20,618][25689] Avg episode reward: [(0, '-1.237')] [2022-07-10 14:54:21,244][26022] Updated weights on worker 0-0, policy_version 770882 (0.00794) [2022-07-10 14:54:23,064][26022] Updated weights on worker 0-0, policy_version 770892 (0.00083) [2022-07-10 14:54:24,893][26022] Updated weights on worker 0-0, policy_version 770902 (0.00087) [2022-07-10 14:54:25,651][25689] Fps is (10 sec: 5484.7, 60 sec: 5509.1, 300 sec: 5518.7). Total num frames: 789407744. Throughput: 0: 5751.7. Samples: 789411136. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:54:25,652][25689] Avg episode reward: [(0, '-2.343')] [2022-07-10 14:54:26,923][26022] Updated weights on worker 0-0, policy_version 770912 (0.00083) [2022-07-10 14:54:28,453][26022] Updated weights on worker 0-0, policy_version 770922 (0.00092) [2022-07-10 14:54:30,617][26022] Updated weights on worker 0-0, policy_version 770932 (0.00087) [2022-07-10 14:54:30,706][25689] Fps is (10 sec: 5582.3, 60 sec: 5492.2, 300 sec: 5516.0). Total num frames: 789434368. Throughput: 0: 5738.5. Samples: 789444088. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:54:30,707][25689] Avg episode reward: [(0, '-2.238')] [2022-07-10 14:54:32,461][26022] Updated weights on worker 0-0, policy_version 770942 (0.00086) [2022-07-10 14:54:34,142][26022] Updated weights on worker 0-0, policy_version 770952 (0.00092) [2022-07-10 14:54:35,756][25689] Fps is (10 sec: 5370.8, 60 sec: 5510.7, 300 sec: 5508.9). Total num frames: 789462016. Throughput: 0: 4928.8. Samples: 789460676. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:54:35,758][25689] Avg episode reward: [(0, '-2.531')] [2022-07-10 14:54:36,267][26022] Updated weights on worker 0-0, policy_version 770962 (0.00087) [2022-07-10 14:54:37,571][26022] Updated weights on worker 0-0, policy_version 770972 (0.00358) [2022-07-10 14:54:39,757][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:54:39,771][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000770981_789484544.pth [2022-07-10 14:54:39,772][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000769040_787496960.pth [2022-07-10 14:54:39,872][26022] Updated weights on worker 0-0, policy_version 770982 (0.00090) [2022-07-10 14:54:40,760][25689] Fps is (10 sec: 5703.6, 60 sec: 5527.8, 300 sec: 5523.4). Total num frames: 789491712. Throughput: 0: 5771.8. Samples: 789493880. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:54:40,762][25689] Avg episode reward: [(0, '-2.307')] [2022-07-10 14:54:41,443][26022] Updated weights on worker 0-0, policy_version 770992 (0.00085) [2022-07-10 14:54:43,420][26022] Updated weights on worker 0-0, policy_version 771002 (0.00092) [2022-07-10 14:54:45,446][26022] Updated weights on worker 0-0, policy_version 771012 (0.00092) [2022-07-10 14:54:45,781][25689] Fps is (10 sec: 5617.8, 60 sec: 5494.6, 300 sec: 5514.0). Total num frames: 789518336. Throughput: 0: 5761.9. Samples: 789527124. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:54:45,783][25689] Avg episode reward: [(0, '-2.572')] [2022-07-10 14:54:47,234][26022] Updated weights on worker 0-0, policy_version 771022 (0.00748) [2022-07-10 14:54:49,013][26022] Updated weights on worker 0-0, policy_version 771032 (0.00094) [2022-07-10 14:54:50,897][25689] Fps is (10 sec: 5354.2, 60 sec: 5488.5, 300 sec: 5513.3). Total num frames: 789545984. Throughput: 0: 4935.5. Samples: 789543738. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:54:50,897][25689] Avg episode reward: [(0, '0.580')] [2022-07-10 14:54:51,050][26022] Updated weights on worker 0-0, policy_version 771042 (0.00091) [2022-07-10 14:54:52,680][26022] Updated weights on worker 0-0, policy_version 771052 (0.00090) [2022-07-10 14:54:54,743][26022] Updated weights on worker 0-0, policy_version 771062 (0.00090) [2022-07-10 14:54:55,907][25689] Fps is (10 sec: 5663.1, 60 sec: 5522.6, 300 sec: 5525.0). Total num frames: 789575680. Throughput: 0: 5788.4. Samples: 789577318. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:54:55,908][25689] Avg episode reward: [(0, '0.298')] [2022-07-10 14:54:56,306][26022] Updated weights on worker 0-0, policy_version 771072 (0.00094) [2022-07-10 14:54:58,311][26022] Updated weights on worker 0-0, policy_version 771082 (0.00087) [2022-07-10 14:55:00,044][26022] Updated weights on worker 0-0, policy_version 771092 (0.00093) [2022-07-10 14:55:00,920][25689] Fps is (10 sec: 5516.6, 60 sec: 5471.8, 300 sec: 5518.0). Total num frames: 789601280. Throughput: 0: 5784.1. Samples: 789610486. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:55:00,921][25689] Avg episode reward: [(0, '0.007')] [2022-07-10 14:55:02,282][26022] Updated weights on worker 0-0, policy_version 771102 (0.00092) [2022-07-10 14:55:04,265][26022] Updated weights on worker 0-0, policy_version 771112 (0.00085) [2022-07-10 14:55:05,961][25689] Fps is (10 sec: 5194.2, 60 sec: 5494.9, 300 sec: 5519.4). Total num frames: 789627904. Throughput: 0: 4851.1. Samples: 789625020. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:55:05,962][25689] Avg episode reward: [(0, '-0.504')] [2022-07-10 14:55:06,119][26022] Updated weights on worker 0-0, policy_version 771122 (0.00082) [2022-07-10 14:55:07,914][26022] Updated weights on worker 0-0, policy_version 771132 (0.00091) [2022-07-10 14:55:09,780][26022] Updated weights on worker 0-0, policy_version 771142 (0.00086) [2022-07-10 14:55:11,029][25689] Fps is (10 sec: 5571.6, 60 sec: 5498.4, 300 sec: 5519.6). Total num frames: 789657600. Throughput: 0: 5720.6. Samples: 789658906. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:55:11,029][25689] Avg episode reward: [(0, '-0.896')] [2022-07-10 14:55:11,495][26022] Updated weights on worker 0-0, policy_version 771152 (0.00086) [2022-07-10 14:55:13,405][26022] Updated weights on worker 0-0, policy_version 771162 (0.00092) [2022-07-10 14:55:15,220][26022] Updated weights on worker 0-0, policy_version 771172 (0.00083) [2022-07-10 14:55:16,031][25689] Fps is (10 sec: 5491.6, 60 sec: 5470.4, 300 sec: 5510.1). Total num frames: 789683200. Throughput: 0: 5712.3. Samples: 789692270. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:55:16,031][25689] Avg episode reward: [(0, '-0.701')] [2022-07-10 14:55:17,019][26022] Updated weights on worker 0-0, policy_version 771182 (0.00089) [2022-07-10 14:55:19,019][26022] Updated weights on worker 0-0, policy_version 771192 (0.00088) [2022-07-10 14:55:20,768][26022] Updated weights on worker 0-0, policy_version 771202 (0.00081) [2022-07-10 14:55:21,046][25689] Fps is (10 sec: 5417.8, 60 sec: 5524.3, 300 sec: 5513.3). Total num frames: 789711872. Throughput: 0: 4895.5. Samples: 789709012. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:55:21,046][25689] Avg episode reward: [(0, '-0.859')] [2022-07-10 14:55:22,567][26022] Updated weights on worker 0-0, policy_version 771212 (0.00088) [2022-07-10 14:55:24,402][26022] Updated weights on worker 0-0, policy_version 771222 (0.00091) [2022-07-10 14:55:25,946][26022] Updated weights on worker 0-0, policy_version 771232 (0.00090) [2022-07-10 14:55:26,107][25689] Fps is (10 sec: 5894.5, 60 sec: 5538.7, 300 sec: 5520.6). Total num frames: 789742592. Throughput: 0: 5851.7. Samples: 789742904. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:55:26,107][25689] Avg episode reward: [(0, '-1.502')] [2022-07-10 14:55:28,099][26022] Updated weights on worker 0-0, policy_version 771242 (0.00084) [2022-07-10 14:55:29,827][26022] Updated weights on worker 0-0, policy_version 771252 (0.00092) [2022-07-10 14:55:31,241][25689] Fps is (10 sec: 5624.6, 60 sec: 5531.5, 300 sec: 5521.6). Total num frames: 789769216. Throughput: 0: 5821.4. Samples: 789776570. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:55:31,241][25689] Avg episode reward: [(0, '-0.622')] [2022-07-10 14:55:31,703][26022] Updated weights on worker 0-0, policy_version 771262 (0.00084) [2022-07-10 14:55:33,524][26022] Updated weights on worker 0-0, policy_version 771272 (0.00088) [2022-07-10 14:55:35,100][26022] Updated weights on worker 0-0, policy_version 771282 (0.00089) [2022-07-10 14:55:36,319][25689] Fps is (10 sec: 5515.1, 60 sec: 5562.8, 300 sec: 5520.3). Total num frames: 789798912. Throughput: 0: 4988.2. Samples: 789793470. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:55:36,319][25689] Avg episode reward: [(0, '-0.955')] [2022-07-10 14:55:37,247][26022] Updated weights on worker 0-0, policy_version 771292 (0.00089) [2022-07-10 14:55:38,900][26022] Updated weights on worker 0-0, policy_version 771302 (0.00088) [2022-07-10 14:55:40,744][26022] Updated weights on worker 0-0, policy_version 771312 (0.00086) [2022-07-10 14:55:41,367][25689] Fps is (10 sec: 5764.1, 60 sec: 5541.8, 300 sec: 5516.2). Total num frames: 789827584. Throughput: 0: 5824.7. Samples: 789827378. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:55:41,368][25689] Avg episode reward: [(0, '-0.383')] [2022-07-10 14:55:42,751][26022] Updated weights on worker 0-0, policy_version 771322 (0.00088) [2022-07-10 14:55:44,255][26022] Updated weights on worker 0-0, policy_version 771332 (0.00089) [2022-07-10 14:55:46,373][25689] Fps is (10 sec: 5499.5, 60 sec: 5543.2, 300 sec: 5521.5). Total num frames: 789854208. Throughput: 0: 5818.3. Samples: 789860822. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:55:46,374][25689] Avg episode reward: [(0, '-0.337')] [2022-07-10 14:55:46,379][26022] Updated weights on worker 0-0, policy_version 771342 (0.00084) [2022-07-10 14:55:48,030][26022] Updated weights on worker 0-0, policy_version 771352 (0.00093) [2022-07-10 14:55:49,908][26022] Updated weights on worker 0-0, policy_version 771362 (0.00082) [2022-07-10 14:55:51,438][25689] Fps is (10 sec: 5490.5, 60 sec: 5564.7, 300 sec: 5517.2). Total num frames: 789882880. Throughput: 0: 4995.4. Samples: 789877464. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:55:51,439][25689] Avg episode reward: [(0, '-0.034')] [2022-07-10 14:55:52,000][26022] Updated weights on worker 0-0, policy_version 771372 (0.00091) [2022-07-10 14:55:53,651][26022] Updated weights on worker 0-0, policy_version 771382 (0.00094) [2022-07-10 14:55:55,692][26022] Updated weights on worker 0-0, policy_version 771392 (0.00094) [2022-07-10 14:55:56,446][25689] Fps is (10 sec: 5489.4, 60 sec: 5514.2, 300 sec: 5513.8). Total num frames: 789909504. Throughput: 0: 5829.1. Samples: 789910798. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:55:56,447][25689] Avg episode reward: [(0, '0.840')] [2022-07-10 14:55:57,228][26022] Updated weights on worker 0-0, policy_version 771402 (0.00091) [2022-07-10 14:55:59,139][26022] Updated weights on worker 0-0, policy_version 771412 (0.00088) [2022-07-10 14:56:01,004][26022] Updated weights on worker 0-0, policy_version 771422 (0.00093) [2022-07-10 14:56:01,472][25689] Fps is (10 sec: 5511.2, 60 sec: 5563.8, 300 sec: 5527.4). Total num frames: 789938176. Throughput: 0: 5822.0. Samples: 789944428. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:56:01,472][25689] Avg episode reward: [(0, '0.847')] [2022-07-10 14:56:03,364][26022] Updated weights on worker 0-0, policy_version 771432 (0.00092) [2022-07-10 14:56:04,974][26022] Updated weights on worker 0-0, policy_version 771442 (0.00088) [2022-07-10 14:56:06,496][25689] Fps is (10 sec: 5298.7, 60 sec: 5531.5, 300 sec: 5511.0). Total num frames: 789962752. Throughput: 0: 4872.6. Samples: 789958872. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:56:06,496][25689] Avg episode reward: [(0, '0.746')] [2022-07-10 14:56:07,023][26022] Updated weights on worker 0-0, policy_version 771452 (0.00087) [2022-07-10 14:56:08,676][26022] Updated weights on worker 0-0, policy_version 771462 (0.00091) [2022-07-10 14:56:10,612][26022] Updated weights on worker 0-0, policy_version 771472 (0.00085) [2022-07-10 14:56:11,548][25689] Fps is (10 sec: 5284.6, 60 sec: 5516.0, 300 sec: 5514.2). Total num frames: 789991424. Throughput: 0: 5693.0. Samples: 789991948. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:56:11,548][25689] Avg episode reward: [(0, '1.137')] [2022-07-10 14:56:12,389][26022] Updated weights on worker 0-0, policy_version 771482 (0.00085) [2022-07-10 14:56:14,298][26022] Updated weights on worker 0-0, policy_version 771492 (0.00081) [2022-07-10 14:56:16,224][26022] Updated weights on worker 0-0, policy_version 771502 (0.00094) [2022-07-10 14:56:16,560][25689] Fps is (10 sec: 5596.0, 60 sec: 5548.9, 300 sec: 5514.2). Total num frames: 790019072. Throughput: 0: 5697.3. Samples: 790025392. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:56:16,561][25689] Avg episode reward: [(0, '1.194')] [2022-07-10 14:56:17,996][26022] Updated weights on worker 0-0, policy_version 771512 (0.00095) [2022-07-10 14:56:19,976][26022] Updated weights on worker 0-0, policy_version 771522 (0.00086) [2022-07-10 14:56:21,586][25689] Fps is (10 sec: 5508.8, 60 sec: 5531.0, 300 sec: 5514.1). Total num frames: 790046720. Throughput: 0: 4845.1. Samples: 790041882. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:56:21,586][25689] Avg episode reward: [(0, '0.771')] [2022-07-10 14:56:21,774][26022] Updated weights on worker 0-0, policy_version 771532 (0.00088) [2022-07-10 14:56:23,595][26022] Updated weights on worker 0-0, policy_version 771542 (0.00087) [2022-07-10 14:56:25,491][26022] Updated weights on worker 0-0, policy_version 771552 (0.00089) [2022-07-10 14:56:26,599][25689] Fps is (10 sec: 5712.3, 60 sec: 5518.5, 300 sec: 5515.8). Total num frames: 790076416. Throughput: 0: 5789.9. Samples: 790075266. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:56:26,600][25689] Avg episode reward: [(0, '-0.359')] [2022-07-10 14:56:27,369][26022] Updated weights on worker 0-0, policy_version 771562 (0.00088) [2022-07-10 14:56:29,185][26022] Updated weights on worker 0-0, policy_version 771572 (0.00084) [2022-07-10 14:56:30,956][26022] Updated weights on worker 0-0, policy_version 771582 (0.00082) [2022-07-10 14:56:31,727][25689] Fps is (10 sec: 5452.6, 60 sec: 5502.1, 300 sec: 5511.2). Total num frames: 790102016. Throughput: 0: 5772.2. Samples: 790108424. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:56:31,727][25689] Avg episode reward: [(0, '-0.378')] [2022-07-10 14:56:32,855][26022] Updated weights on worker 0-0, policy_version 771592 (0.00095) [2022-07-10 14:56:34,818][26022] Updated weights on worker 0-0, policy_version 771602 (0.00098) [2022-07-10 14:56:36,483][26022] Updated weights on worker 0-0, policy_version 771612 (0.00088) [2022-07-10 14:56:36,753][25689] Fps is (10 sec: 5445.8, 60 sec: 5506.8, 300 sec: 5511.0). Total num frames: 790131712. Throughput: 0: 4940.2. Samples: 790125146. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-10 14:56:36,753][25689] Avg episode reward: [(0, '-0.959')] [2022-07-10 14:56:38,444][26022] Updated weights on worker 0-0, policy_version 771622 (0.00096) [2022-07-10 14:56:39,959][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:56:39,974][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000771630_790149120.pth [2022-07-10 14:56:39,974][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000769688_788160512.pth [2022-07-10 14:56:40,416][26022] Updated weights on worker 0-0, policy_version 771632 (0.00092) [2022-07-10 14:56:41,839][25689] Fps is (10 sec: 5670.5, 60 sec: 5486.4, 300 sec: 5513.1). Total num frames: 790159360. Throughput: 0: 5749.8. Samples: 790158338. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:56:41,840][25689] Avg episode reward: [(0, '-2.049')] [2022-07-10 14:56:42,165][26022] Updated weights on worker 0-0, policy_version 771642 (0.00085) [2022-07-10 14:56:43,762][26022] Updated weights on worker 0-0, policy_version 771652 (0.00099) [2022-07-10 14:56:45,741][26022] Updated weights on worker 0-0, policy_version 771662 (0.00096) [2022-07-10 14:56:46,943][25689] Fps is (10 sec: 5527.0, 60 sec: 5511.4, 300 sec: 5511.9). Total num frames: 790188032. Throughput: 0: 5735.9. Samples: 790191956. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:56:46,943][25689] Avg episode reward: [(0, '-2.153')] [2022-07-10 14:56:47,656][26022] Updated weights on worker 0-0, policy_version 771672 (0.00094) [2022-07-10 14:56:49,474][26022] Updated weights on worker 0-0, policy_version 771682 (0.00086) [2022-07-10 14:56:51,246][26022] Updated weights on worker 0-0, policy_version 771692 (0.00089) [2022-07-10 14:56:52,063][25689] Fps is (10 sec: 5508.8, 60 sec: 5489.5, 300 sec: 5509.8). Total num frames: 790215680. Throughput: 0: 5739.7. Samples: 790225148. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:56:52,064][25689] Avg episode reward: [(0, '-1.048')] [2022-07-10 14:56:53,154][26022] Updated weights on worker 0-0, policy_version 771702 (0.00091) [2022-07-10 14:56:54,962][26022] Updated weights on worker 0-0, policy_version 771712 (0.00089) [2022-07-10 14:56:56,869][26022] Updated weights on worker 0-0, policy_version 771722 (0.00086) [2022-07-10 14:56:57,076][25689] Fps is (10 sec: 5659.0, 60 sec: 5539.7, 300 sec: 5516.7). Total num frames: 790245376. Throughput: 0: 5739.2. Samples: 790241786. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:56:57,077][25689] Avg episode reward: [(0, '-1.664')] [2022-07-10 14:56:58,724][26022] Updated weights on worker 0-0, policy_version 771732 (0.00094) [2022-07-10 14:57:00,506][26022] Updated weights on worker 0-0, policy_version 771742 (0.00085) [2022-07-10 14:57:02,125][25689] Fps is (10 sec: 5495.8, 60 sec: 5487.0, 300 sec: 5512.5). Total num frames: 790270976. Throughput: 0: 5752.7. Samples: 790275032. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:57:02,135][25689] Avg episode reward: [(0, '-1.459')] [2022-07-10 14:57:02,939][26022] Updated weights on worker 0-0, policy_version 771752 (0.00088) [2022-07-10 14:57:04,632][26022] Updated weights on worker 0-0, policy_version 771762 (0.00080) [2022-07-10 14:57:06,551][26022] Updated weights on worker 0-0, policy_version 771772 (0.00086) [2022-07-10 14:57:07,177][25689] Fps is (10 sec: 5271.5, 60 sec: 5535.0, 300 sec: 5510.1). Total num frames: 790298624. Throughput: 0: 5658.3. Samples: 790306448. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:57:07,179][25689] Avg episode reward: [(0, '-0.587')] [2022-07-10 14:57:08,259][26022] Updated weights on worker 0-0, policy_version 771782 (0.00085) [2022-07-10 14:57:10,223][26022] Updated weights on worker 0-0, policy_version 771792 (0.00095) [2022-07-10 14:57:12,031][26022] Updated weights on worker 0-0, policy_version 771802 (0.00116) [2022-07-10 14:57:12,294][25689] Fps is (10 sec: 5437.6, 60 sec: 5512.3, 300 sec: 5511.9). Total num frames: 790326272. Throughput: 0: 4842.3. Samples: 790323112. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:57:12,294][25689] Avg episode reward: [(0, '0.111')] [2022-07-10 14:57:13,731][26022] Updated weights on worker 0-0, policy_version 771812 (0.00095) [2022-07-10 14:57:15,767][26022] Updated weights on worker 0-0, policy_version 771822 (0.00093) [2022-07-10 14:57:17,353][25689] Fps is (10 sec: 5434.0, 60 sec: 5508.0, 300 sec: 5511.1). Total num frames: 790353920. Throughput: 0: 5662.3. Samples: 790356600. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:57:17,354][25689] Avg episode reward: [(0, '-0.726')] [2022-07-10 14:57:17,475][26022] Updated weights on worker 0-0, policy_version 771832 (0.00102) [2022-07-10 14:57:19,448][26022] Updated weights on worker 0-0, policy_version 771842 (0.00089) [2022-07-10 14:57:21,151][26022] Updated weights on worker 0-0, policy_version 771852 (0.00093) [2022-07-10 14:57:22,383][25689] Fps is (10 sec: 5582.4, 60 sec: 5524.5, 300 sec: 5514.4). Total num frames: 790382592. Throughput: 0: 5658.2. Samples: 790389656. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:57:22,383][25689] Avg episode reward: [(0, '-1.374')] [2022-07-10 14:57:23,118][26022] Updated weights on worker 0-0, policy_version 771862 (0.00778) [2022-07-10 14:57:24,765][26022] Updated weights on worker 0-0, policy_version 771872 (0.00086) [2022-07-10 14:57:26,710][26022] Updated weights on worker 0-0, policy_version 771882 (0.00090) [2022-07-10 14:57:27,411][25689] Fps is (10 sec: 5396.0, 60 sec: 5455.8, 300 sec: 5505.6). Total num frames: 790408192. Throughput: 0: 4943.8. Samples: 790406480. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:57:27,411][25689] Avg episode reward: [(0, '-0.798')] [2022-07-10 14:57:28,508][26022] Updated weights on worker 0-0, policy_version 771892 (0.00094) [2022-07-10 14:57:30,423][26022] Updated weights on worker 0-0, policy_version 771902 (0.00444) [2022-07-10 14:57:32,026][26022] Updated weights on worker 0-0, policy_version 771912 (0.00085) [2022-07-10 14:57:32,471][25689] Fps is (10 sec: 5582.8, 60 sec: 5546.2, 300 sec: 5511.7). Total num frames: 790438912. Throughput: 0: 5781.7. Samples: 790439768. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:57:32,471][25689] Avg episode reward: [(0, '-1.513')] [2022-07-10 14:57:34,175][26022] Updated weights on worker 0-0, policy_version 771922 (0.00083) [2022-07-10 14:57:35,791][26022] Updated weights on worker 0-0, policy_version 771932 (0.00084) [2022-07-10 14:57:37,515][25689] Fps is (10 sec: 5776.6, 60 sec: 5510.8, 300 sec: 5511.4). Total num frames: 790466560. Throughput: 0: 5815.8. Samples: 790473858. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:57:37,520][25689] Avg episode reward: [(0, '-1.737')] [2022-07-10 14:57:37,706][26022] Updated weights on worker 0-0, policy_version 771942 (0.00083) [2022-07-10 14:57:39,419][26022] Updated weights on worker 0-0, policy_version 771952 (0.00091) [2022-07-10 14:57:41,370][26022] Updated weights on worker 0-0, policy_version 771962 (0.00094) [2022-07-10 14:57:42,579][25689] Fps is (10 sec: 5571.7, 60 sec: 5529.7, 300 sec: 5513.9). Total num frames: 790495232. Throughput: 0: 5005.0. Samples: 790490738. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:57:42,579][25689] Avg episode reward: [(0, '-3.000')] [2022-07-10 14:57:43,318][26022] Updated weights on worker 0-0, policy_version 771972 (0.00085) [2022-07-10 14:57:44,909][26022] Updated weights on worker 0-0, policy_version 771982 (0.00087) [2022-07-10 14:57:46,979][26022] Updated weights on worker 0-0, policy_version 771992 (0.00086) [2022-07-10 14:57:47,660][25689] Fps is (10 sec: 5753.2, 60 sec: 5548.6, 300 sec: 5516.7). Total num frames: 790524928. Throughput: 0: 5825.2. Samples: 790524438. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:57:47,661][25689] Avg episode reward: [(0, '-2.743')] [2022-07-10 14:57:48,651][26022] Updated weights on worker 0-0, policy_version 772002 (0.00087) [2022-07-10 14:57:50,563][26022] Updated weights on worker 0-0, policy_version 772012 (0.00083) [2022-07-10 14:57:52,380][26022] Updated weights on worker 0-0, policy_version 772022 (0.00091) [2022-07-10 14:57:52,720][25689] Fps is (10 sec: 5654.4, 60 sec: 5554.1, 300 sec: 5519.1). Total num frames: 790552576. Throughput: 0: 5845.7. Samples: 790558142. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:57:52,722][25689] Avg episode reward: [(0, '-1.892')] [2022-07-10 14:57:54,127][26022] Updated weights on worker 0-0, policy_version 772032 (0.00088) [2022-07-10 14:57:55,742][26022] Updated weights on worker 0-0, policy_version 772042 (0.00089) [2022-07-10 14:57:57,736][25689] Fps is (10 sec: 5589.5, 60 sec: 5536.9, 300 sec: 5519.3). Total num frames: 790581248. Throughput: 0: 5015.5. Samples: 790575278. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:57:57,737][25689] Avg episode reward: [(0, '-2.096')] [2022-07-10 14:57:57,738][26022] Updated weights on worker 0-0, policy_version 772052 (0.00085) [2022-07-10 14:57:59,398][26022] Updated weights on worker 0-0, policy_version 772062 (0.00083) [2022-07-10 14:58:01,530][26022] Updated weights on worker 0-0, policy_version 772072 (0.00090) [2022-07-10 14:58:02,767][25689] Fps is (10 sec: 5401.7, 60 sec: 5538.5, 300 sec: 5518.9). Total num frames: 790606848. Throughput: 0: 5850.5. Samples: 790608852. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:58:02,768][25689] Avg episode reward: [(0, '-1.823')] [2022-07-10 14:58:03,575][26022] Updated weights on worker 0-0, policy_version 772082 (0.00090) [2022-07-10 14:58:05,554][26022] Updated weights on worker 0-0, policy_version 772092 (0.00093) [2022-07-10 14:58:07,227][26022] Updated weights on worker 0-0, policy_version 772102 (0.00076) [2022-07-10 14:58:07,772][25689] Fps is (10 sec: 5305.8, 60 sec: 5542.9, 300 sec: 5519.8). Total num frames: 790634496. Throughput: 0: 5766.7. Samples: 790640416. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:58:07,773][25689] Avg episode reward: [(0, '-1.672')] [2022-07-10 14:58:09,086][26022] Updated weights on worker 0-0, policy_version 772112 (0.00071) [2022-07-10 14:58:10,913][26022] Updated weights on worker 0-0, policy_version 772122 (0.00093) [2022-07-10 14:58:12,555][26022] Updated weights on worker 0-0, policy_version 772132 (0.00081) [2022-07-10 14:58:12,909][25689] Fps is (10 sec: 5553.4, 60 sec: 5557.9, 300 sec: 5520.8). Total num frames: 790663168. Throughput: 0: 4896.4. Samples: 790656994. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:58:12,910][25689] Avg episode reward: [(0, '-2.107')] [2022-07-10 14:58:14,486][26022] Updated weights on worker 0-0, policy_version 772142 (0.00087) [2022-07-10 14:58:16,652][26022] Updated weights on worker 0-0, policy_version 772152 (0.00092) [2022-07-10 14:58:17,964][25689] Fps is (10 sec: 5726.6, 60 sec: 5592.1, 300 sec: 5530.1). Total num frames: 790692864. Throughput: 0: 5712.0. Samples: 790690822. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:58:17,966][25689] Avg episode reward: [(0, '-2.360')] [2022-07-10 14:58:18,198][26022] Updated weights on worker 0-0, policy_version 772162 (0.00103) [2022-07-10 14:58:20,224][26022] Updated weights on worker 0-0, policy_version 772172 (0.00086) [2022-07-10 14:58:21,831][26022] Updated weights on worker 0-0, policy_version 772182 (0.00086) [2022-07-10 14:58:23,002][25689] Fps is (10 sec: 5579.9, 60 sec: 5557.5, 300 sec: 5523.0). Total num frames: 790719488. Throughput: 0: 5688.9. Samples: 790723966. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:58:23,004][25689] Avg episode reward: [(0, '-2.591')] [2022-07-10 14:58:23,905][26022] Updated weights on worker 0-0, policy_version 772192 (0.00086) [2022-07-10 14:58:25,714][26022] Updated weights on worker 0-0, policy_version 772202 (0.00092) [2022-07-10 14:58:27,523][26022] Updated weights on worker 0-0, policy_version 772212 (0.00092) [2022-07-10 14:58:28,019][25689] Fps is (10 sec: 5499.4, 60 sec: 5609.2, 300 sec: 5527.2). Total num frames: 790748160. Throughput: 0: 4954.2. Samples: 790740726. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:58:28,020][25689] Avg episode reward: [(0, '-1.810')] [2022-07-10 14:58:29,514][26022] Updated weights on worker 0-0, policy_version 772222 (0.00088) [2022-07-10 14:58:31,126][26022] Updated weights on worker 0-0, policy_version 772232 (0.00083) [2022-07-10 14:58:32,881][26022] Updated weights on worker 0-0, policy_version 772242 (0.00087) [2022-07-10 14:58:33,098][25689] Fps is (10 sec: 5578.8, 60 sec: 5556.8, 300 sec: 5530.4). Total num frames: 790775808. Throughput: 0: 5794.0. Samples: 790773968. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:58:33,100][25689] Avg episode reward: [(0, '-1.446')] [2022-07-10 14:58:34,856][26022] Updated weights on worker 0-0, policy_version 772252 (0.00095) [2022-07-10 14:58:36,537][26022] Updated weights on worker 0-0, policy_version 772262 (0.00091) [2022-07-10 14:58:38,168][25689] Fps is (10 sec: 5448.8, 60 sec: 5554.5, 300 sec: 5525.8). Total num frames: 790803456. Throughput: 0: 5775.4. Samples: 790807504. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:58:38,168][25689] Avg episode reward: [(0, '-0.435')] [2022-07-10 14:58:38,527][26022] Updated weights on worker 0-0, policy_version 772272 (0.00094) [2022-07-10 14:58:40,290][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 14:58:40,302][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000772281_790815744.pth [2022-07-10 14:58:40,302][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000770337_788825088.pth [2022-07-10 14:58:40,399][26022] Updated weights on worker 0-0, policy_version 772282 (0.00088) [2022-07-10 14:58:42,057][26022] Updated weights on worker 0-0, policy_version 772292 (0.00088) [2022-07-10 14:58:43,187][25689] Fps is (10 sec: 5480.9, 60 sec: 5541.7, 300 sec: 5522.5). Total num frames: 790831104. Throughput: 0: 5809.9. Samples: 790841236. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:58:43,187][25689] Avg episode reward: [(0, '0.551')] [2022-07-10 14:58:44,012][26022] Updated weights on worker 0-0, policy_version 772302 (0.00088) [2022-07-10 14:58:45,832][26022] Updated weights on worker 0-0, policy_version 772312 (0.00088) [2022-07-10 14:58:47,601][26022] Updated weights on worker 0-0, policy_version 772322 (0.00090) [2022-07-10 14:58:48,206][25689] Fps is (10 sec: 5610.3, 60 sec: 5530.5, 300 sec: 5526.5). Total num frames: 790859776. Throughput: 0: 5808.9. Samples: 790857990. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:58:48,207][25689] Avg episode reward: [(0, '0.735')] [2022-07-10 14:58:49,510][26022] Updated weights on worker 0-0, policy_version 772332 (0.00089) [2022-07-10 14:58:51,422][26022] Updated weights on worker 0-0, policy_version 772342 (0.00086) [2022-07-10 14:58:53,216][26022] Updated weights on worker 0-0, policy_version 772352 (0.00093) [2022-07-10 14:58:53,285][25689] Fps is (10 sec: 5779.9, 60 sec: 5562.5, 300 sec: 5532.2). Total num frames: 790889472. Throughput: 0: 5788.7. Samples: 790890826. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:58:53,286][25689] Avg episode reward: [(0, '-0.416')] [2022-07-10 14:58:55,255][26022] Updated weights on worker 0-0, policy_version 772362 (0.00087) [2022-07-10 14:58:56,764][26022] Updated weights on worker 0-0, policy_version 772372 (0.00086) [2022-07-10 14:58:58,339][25689] Fps is (10 sec: 5457.4, 60 sec: 5508.4, 300 sec: 5521.1). Total num frames: 790915072. Throughput: 0: 5791.7. Samples: 790924328. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:58:58,339][25689] Avg episode reward: [(0, '-1.047')] [2022-07-10 14:58:59,056][26022] Updated weights on worker 0-0, policy_version 772382 (0.00084) [2022-07-10 14:59:00,707][26022] Updated weights on worker 0-0, policy_version 772393 (0.00210) [2022-07-10 14:59:03,079][26022] Updated weights on worker 0-0, policy_version 772403 (0.00083) [2022-07-10 14:59:03,399][25689] Fps is (10 sec: 5264.7, 60 sec: 5539.5, 300 sec: 5528.9). Total num frames: 790942720. Throughput: 0: 4946.9. Samples: 790941226. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:59:03,400][25689] Avg episode reward: [(0, '-1.741')] [2022-07-10 14:59:04,903][26022] Updated weights on worker 0-0, policy_version 772413 (0.00085) [2022-07-10 14:59:06,618][26022] Updated weights on worker 0-0, policy_version 772423 (0.00085) [2022-07-10 14:59:08,471][25689] Fps is (10 sec: 5356.4, 60 sec: 5516.5, 300 sec: 5519.2). Total num frames: 790969344. Throughput: 0: 5669.0. Samples: 790972870. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:59:08,471][25689] Avg episode reward: [(0, '-2.012')] [2022-07-10 14:59:08,595][26022] Updated weights on worker 0-0, policy_version 772433 (0.00081) [2022-07-10 14:59:10,155][26022] Updated weights on worker 0-0, policy_version 772443 (0.00094) [2022-07-10 14:59:12,369][26022] Updated weights on worker 0-0, policy_version 772453 (0.00082) [2022-07-10 14:59:13,534][25689] Fps is (10 sec: 5658.2, 60 sec: 5557.0, 300 sec: 5529.6). Total num frames: 791000064. Throughput: 0: 5715.2. Samples: 791006552. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:59:13,535][25689] Avg episode reward: [(0, '-2.092')] [2022-07-10 14:59:13,784][26022] Updated weights on worker 0-0, policy_version 772463 (0.00088) [2022-07-10 14:59:16,041][26022] Updated weights on worker 0-0, policy_version 772473 (0.00088) [2022-07-10 14:59:17,511][26022] Updated weights on worker 0-0, policy_version 772483 (0.00084) [2022-07-10 14:59:18,623][25689] Fps is (10 sec: 5749.5, 60 sec: 5520.2, 300 sec: 5535.7). Total num frames: 791027712. Throughput: 0: 4886.7. Samples: 791023450. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:59:18,623][25689] Avg episode reward: [(0, '-1.940')] [2022-07-10 14:59:19,692][26022] Updated weights on worker 0-0, policy_version 772493 (0.00091) [2022-07-10 14:59:21,286][26022] Updated weights on worker 0-0, policy_version 772503 (0.00080) [2022-07-10 14:59:23,295][26022] Updated weights on worker 0-0, policy_version 772513 (0.00088) [2022-07-10 14:59:23,644][25689] Fps is (10 sec: 5368.0, 60 sec: 5521.7, 300 sec: 5525.6). Total num frames: 791054336. Throughput: 0: 5700.8. Samples: 791056638. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:59:23,645][25689] Avg episode reward: [(0, '-1.091')] [2022-07-10 14:59:24,948][26022] Updated weights on worker 0-0, policy_version 772523 (0.00086) [2022-07-10 14:59:26,878][26022] Updated weights on worker 0-0, policy_version 772533 (0.00089) [2022-07-10 14:59:28,643][26022] Updated weights on worker 0-0, policy_version 772543 (0.00084) [2022-07-10 14:59:28,649][25689] Fps is (10 sec: 5617.3, 60 sec: 5539.7, 300 sec: 5536.8). Total num frames: 791084032. Throughput: 0: 5818.8. Samples: 791090282. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:59:28,650][25689] Avg episode reward: [(0, '-1.762')] [2022-07-10 14:59:30,778][26022] Updated weights on worker 0-0, policy_version 772553 (0.00084) [2022-07-10 14:59:32,355][26022] Updated weights on worker 0-0, policy_version 772563 (0.00089) [2022-07-10 14:59:33,738][25689] Fps is (10 sec: 5681.0, 60 sec: 5538.7, 300 sec: 5536.1). Total num frames: 791111680. Throughput: 0: 4974.4. Samples: 791107056. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:59:33,739][25689] Avg episode reward: [(0, '-1.617')] [2022-07-10 14:59:34,356][26022] Updated weights on worker 0-0, policy_version 772573 (0.00085) [2022-07-10 14:59:35,934][26022] Updated weights on worker 0-0, policy_version 772583 (0.00093) [2022-07-10 14:59:38,053][26022] Updated weights on worker 0-0, policy_version 772593 (0.00083) [2022-07-10 14:59:38,792][25689] Fps is (10 sec: 5653.8, 60 sec: 5574.0, 300 sec: 5535.2). Total num frames: 791141376. Throughput: 0: 5810.4. Samples: 791140638. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:59:38,792][25689] Avg episode reward: [(0, '-1.542')] [2022-07-10 14:59:39,867][26022] Updated weights on worker 0-0, policy_version 772603 (0.00087) [2022-07-10 14:59:41,558][26022] Updated weights on worker 0-0, policy_version 772613 (0.00090) [2022-07-10 14:59:43,594][26022] Updated weights on worker 0-0, policy_version 772623 (0.00085) [2022-07-10 14:59:43,833][25689] Fps is (10 sec: 5579.3, 60 sec: 5555.1, 300 sec: 5534.8). Total num frames: 791168000. Throughput: 0: 5817.3. Samples: 791174080. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:59:43,833][25689] Avg episode reward: [(0, '-1.568')] [2022-07-10 14:59:45,190][26022] Updated weights on worker 0-0, policy_version 772633 (0.00090) [2022-07-10 14:59:47,091][26022] Updated weights on worker 0-0, policy_version 772643 (0.00079) [2022-07-10 14:59:48,883][25689] Fps is (10 sec: 5378.0, 60 sec: 5535.4, 300 sec: 5536.0). Total num frames: 791195648. Throughput: 0: 4976.7. Samples: 791190980. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:59:48,884][25689] Avg episode reward: [(0, '-2.041')] [2022-07-10 14:59:48,938][26022] Updated weights on worker 0-0, policy_version 772653 (0.00077) [2022-07-10 14:59:50,756][26022] Updated weights on worker 0-0, policy_version 772663 (0.00092) [2022-07-10 14:59:52,581][26022] Updated weights on worker 0-0, policy_version 772673 (0.00093) [2022-07-10 14:59:53,934][25689] Fps is (10 sec: 5575.5, 60 sec: 5521.1, 300 sec: 5531.8). Total num frames: 791224320. Throughput: 0: 5813.9. Samples: 791224472. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:59:53,935][25689] Avg episode reward: [(0, '-1.359')] [2022-07-10 14:59:54,398][26022] Updated weights on worker 0-0, policy_version 772683 (0.00092) [2022-07-10 14:59:56,295][26022] Updated weights on worker 0-0, policy_version 772693 (0.00090) [2022-07-10 14:59:57,922][26022] Updated weights on worker 0-0, policy_version 772703 (0.00087) [2022-07-10 14:59:58,972][25689] Fps is (10 sec: 5684.0, 60 sec: 5573.2, 300 sec: 5541.7). Total num frames: 791252992. Throughput: 0: 5815.3. Samples: 791257992. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 14:59:58,972][25689] Avg episode reward: [(0, '-1.138')] [2022-07-10 15:00:00,042][26022] Updated weights on worker 0-0, policy_version 772713 (0.00089) [2022-07-10 15:00:02,081][26022] Updated weights on worker 0-0, policy_version 772723 (0.00096) [2022-07-10 15:00:03,990][25689] Fps is (10 sec: 5295.5, 60 sec: 5526.5, 300 sec: 5535.3). Total num frames: 791277568. Throughput: 0: 4989.0. Samples: 791274642. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:00:03,990][25689] Avg episode reward: [(0, '-0.685')] [2022-07-10 15:00:04,008][26022] Updated weights on worker 0-0, policy_version 772733 (0.00082) [2022-07-10 15:00:05,622][26022] Updated weights on worker 0-0, policy_version 772743 (0.00089) [2022-07-10 15:00:07,683][26022] Updated weights on worker 0-0, policy_version 772753 (0.00085) [2022-07-10 15:00:08,999][25689] Fps is (10 sec: 5412.5, 60 sec: 5582.9, 300 sec: 5536.3). Total num frames: 791307264. Throughput: 0: 5733.6. Samples: 791306316. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:00:08,999][25689] Avg episode reward: [(0, '-0.545')] [2022-07-10 15:00:09,505][26022] Updated weights on worker 0-0, policy_version 772763 (0.00087) [2022-07-10 15:00:11,223][26022] Updated weights on worker 0-0, policy_version 772773 (0.00083) [2022-07-10 15:00:12,991][26022] Updated weights on worker 0-0, policy_version 772783 (0.00098) [2022-07-10 15:00:14,137][25689] Fps is (10 sec: 5550.4, 60 sec: 5508.4, 300 sec: 5537.3). Total num frames: 791333888. Throughput: 0: 5728.8. Samples: 791340208. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:00:14,137][25689] Avg episode reward: [(0, '-0.151')] [2022-07-10 15:00:14,773][26022] Updated weights on worker 0-0, policy_version 772793 (0.00102) [2022-07-10 15:00:16,853][26022] Updated weights on worker 0-0, policy_version 772803 (0.00103) [2022-07-10 15:00:18,536][26022] Updated weights on worker 0-0, policy_version 772813 (0.00098) [2022-07-10 15:00:19,178][25689] Fps is (10 sec: 5532.9, 60 sec: 5546.5, 300 sec: 5540.2). Total num frames: 791363584. Throughput: 0: 4901.8. Samples: 791357040. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:00:19,179][25689] Avg episode reward: [(0, '0.085')] [2022-07-10 15:00:20,275][26022] Updated weights on worker 0-0, policy_version 772823 (0.00090) [2022-07-10 15:00:22,300][26022] Updated weights on worker 0-0, policy_version 772833 (0.00084) [2022-07-10 15:00:24,185][26022] Updated weights on worker 0-0, policy_version 772843 (0.00086) [2022-07-10 15:00:24,222][25689] Fps is (10 sec: 5686.1, 60 sec: 5561.4, 300 sec: 5530.2). Total num frames: 791391232. Throughput: 0: 5715.1. Samples: 791390270. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:00:24,222][25689] Avg episode reward: [(0, '0.350')] [2022-07-10 15:00:25,935][26022] Updated weights on worker 0-0, policy_version 772853 (0.00095) [2022-07-10 15:00:27,779][26022] Updated weights on worker 0-0, policy_version 772863 (0.00082) [2022-07-10 15:00:29,237][25689] Fps is (10 sec: 5599.0, 60 sec: 5543.5, 300 sec: 5539.3). Total num frames: 791419904. Throughput: 0: 5818.6. Samples: 791424074. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:00:29,238][25689] Avg episode reward: [(0, '0.525')] [2022-07-10 15:00:29,543][26022] Updated weights on worker 0-0, policy_version 772873 (0.00955) [2022-07-10 15:00:31,420][26022] Updated weights on worker 0-0, policy_version 772883 (0.00092) [2022-07-10 15:00:33,358][26022] Updated weights on worker 0-0, policy_version 772893 (0.00087) [2022-07-10 15:00:34,310][25689] Fps is (10 sec: 5684.0, 60 sec: 5561.9, 300 sec: 5535.9). Total num frames: 791448576. Throughput: 0: 5820.0. Samples: 791457618. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:00:34,311][25689] Avg episode reward: [(0, '0.611')] [2022-07-10 15:00:35,055][26022] Updated weights on worker 0-0, policy_version 772903 (0.00108) [2022-07-10 15:00:36,939][26022] Updated weights on worker 0-0, policy_version 772913 (0.00087) [2022-07-10 15:00:38,655][26022] Updated weights on worker 0-0, policy_version 772923 (0.00090) [2022-07-10 15:00:39,378][25689] Fps is (10 sec: 5452.8, 60 sec: 5509.9, 300 sec: 5528.7). Total num frames: 791475200. Throughput: 0: 5820.0. Samples: 791474604. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:00:39,378][25689] Avg episode reward: [(0, '0.358')] [2022-07-10 15:00:40,435][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:00:40,448][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000772932_791482368.pth [2022-07-10 15:00:40,451][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000770981_789484544.pth [2022-07-10 15:00:40,692][26022] Updated weights on worker 0-0, policy_version 772933 (0.00093) [2022-07-10 15:00:42,578][26022] Updated weights on worker 0-0, policy_version 772943 (0.00085) [2022-07-10 15:00:44,327][26022] Updated weights on worker 0-0, policy_version 772953 (0.00089) [2022-07-10 15:00:44,424][25689] Fps is (10 sec: 5467.1, 60 sec: 5543.2, 300 sec: 5534.8). Total num frames: 791503872. Throughput: 0: 5827.3. Samples: 791507998. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:00:44,425][25689] Avg episode reward: [(0, '0.503')] [2022-07-10 15:00:46,019][26022] Updated weights on worker 0-0, policy_version 772963 (0.00095) [2022-07-10 15:00:47,955][26022] Updated weights on worker 0-0, policy_version 772973 (0.00088) [2022-07-10 15:00:49,428][25689] Fps is (10 sec: 5705.9, 60 sec: 5564.4, 300 sec: 5536.0). Total num frames: 791532544. Throughput: 0: 5811.6. Samples: 791541416. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:00:49,429][25689] Avg episode reward: [(0, '0.674')] [2022-07-10 15:00:49,788][26022] Updated weights on worker 0-0, policy_version 772983 (0.00093) [2022-07-10 15:00:51,603][26022] Updated weights on worker 0-0, policy_version 772993 (0.00086) [2022-07-10 15:00:53,567][26022] Updated weights on worker 0-0, policy_version 773003 (0.00098) [2022-07-10 15:00:54,493][25689] Fps is (10 sec: 5492.0, 60 sec: 5529.3, 300 sec: 5534.9). Total num frames: 791559168. Throughput: 0: 4979.0. Samples: 791558108. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:00:54,493][25689] Avg episode reward: [(0, '0.704')] [2022-07-10 15:00:55,179][26022] Updated weights on worker 0-0, policy_version 773013 (0.00062) [2022-07-10 15:00:57,247][26022] Updated weights on worker 0-0, policy_version 773023 (0.00087) [2022-07-10 15:00:58,815][26022] Updated weights on worker 0-0, policy_version 773033 (0.00084) [2022-07-10 15:00:59,512][25689] Fps is (10 sec: 5585.0, 60 sec: 5547.9, 300 sec: 5538.5). Total num frames: 791588864. Throughput: 0: 5809.6. Samples: 791591576. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:00:59,513][25689] Avg episode reward: [(0, '0.806')] [2022-07-10 15:01:00,896][26022] Updated weights on worker 0-0, policy_version 773043 (0.00095) [2022-07-10 15:01:03,101][26022] Updated weights on worker 0-0, policy_version 773053 (0.00088) [2022-07-10 15:01:04,525][25689] Fps is (10 sec: 5307.7, 60 sec: 5531.4, 300 sec: 5535.2). Total num frames: 791612416. Throughput: 0: 5694.4. Samples: 791622460. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:01:04,527][25689] Avg episode reward: [(0, '0.733')] [2022-07-10 15:01:04,984][26022] Updated weights on worker 0-0, policy_version 773063 (0.00089) [2022-07-10 15:01:07,039][26022] Updated weights on worker 0-0, policy_version 773073 (0.00087) [2022-07-10 15:01:08,559][26022] Updated weights on worker 0-0, policy_version 773083 (0.00089) [2022-07-10 15:01:09,544][25689] Fps is (10 sec: 5307.7, 60 sec: 5530.5, 300 sec: 5539.3). Total num frames: 791642112. Throughput: 0: 4868.4. Samples: 791639350. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:01:09,546][25689] Avg episode reward: [(0, '0.906')] [2022-07-10 15:01:10,706][26022] Updated weights on worker 0-0, policy_version 773093 (0.00085) [2022-07-10 15:01:12,391][26022] Updated weights on worker 0-0, policy_version 773103 (0.00087) [2022-07-10 15:01:14,176][26022] Updated weights on worker 0-0, policy_version 773113 (0.00085) [2022-07-10 15:01:14,599][25689] Fps is (10 sec: 5692.5, 60 sec: 5555.1, 300 sec: 5538.5). Total num frames: 791669760. Throughput: 0: 5699.0. Samples: 791672690. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:01:14,599][25689] Avg episode reward: [(0, '0.604')] [2022-07-10 15:01:16,051][26022] Updated weights on worker 0-0, policy_version 773123 (0.00092) [2022-07-10 15:01:17,803][26022] Updated weights on worker 0-0, policy_version 773133 (0.00092) [2022-07-10 15:01:19,588][26022] Updated weights on worker 0-0, policy_version 773143 (0.00091) [2022-07-10 15:01:19,601][25689] Fps is (10 sec: 5599.9, 60 sec: 5541.7, 300 sec: 5542.4). Total num frames: 791698432. Throughput: 0: 5711.6. Samples: 791706318. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:01:19,602][25689] Avg episode reward: [(0, '0.565')] [2022-07-10 15:01:21,524][26022] Updated weights on worker 0-0, policy_version 773153 (0.00086) [2022-07-10 15:01:23,343][26022] Updated weights on worker 0-0, policy_version 773163 (0.00099) [2022-07-10 15:01:24,611][25689] Fps is (10 sec: 5523.0, 60 sec: 5527.9, 300 sec: 5532.1). Total num frames: 791725056. Throughput: 0: 5010.3. Samples: 791723094. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:01:24,611][25689] Avg episode reward: [(0, '0.022')] [2022-07-10 15:01:25,337][26022] Updated weights on worker 0-0, policy_version 773173 (0.00085) [2022-07-10 15:01:26,987][26022] Updated weights on worker 0-0, policy_version 773183 (0.00086) [2022-07-10 15:01:29,012][26022] Updated weights on worker 0-0, policy_version 773193 (0.00089) [2022-07-10 15:01:29,618][25689] Fps is (10 sec: 5520.6, 60 sec: 5528.6, 300 sec: 5544.7). Total num frames: 791753728. Throughput: 0: 5824.6. Samples: 791756270. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:01:29,618][25689] Avg episode reward: [(0, '-0.355')] [2022-07-10 15:01:30,806][26022] Updated weights on worker 0-0, policy_version 773203 (0.00101) [2022-07-10 15:01:32,689][26022] Updated weights on worker 0-0, policy_version 773213 (0.00091) [2022-07-10 15:01:34,355][26022] Updated weights on worker 0-0, policy_version 773223 (0.00727) [2022-07-10 15:01:34,701][25689] Fps is (10 sec: 5581.6, 60 sec: 5510.8, 300 sec: 5536.7). Total num frames: 791781376. Throughput: 0: 5812.4. Samples: 791789532. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:01:34,701][25689] Avg episode reward: [(0, '-0.201')] [2022-07-10 15:01:36,314][26022] Updated weights on worker 0-0, policy_version 773233 (0.00901) [2022-07-10 15:01:38,075][26022] Updated weights on worker 0-0, policy_version 773243 (0.00087) [2022-07-10 15:01:39,786][25689] Fps is (10 sec: 5438.0, 60 sec: 5526.1, 300 sec: 5536.8). Total num frames: 791809024. Throughput: 0: 4957.1. Samples: 791806376. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:01:39,787][25689] Avg episode reward: [(0, '-0.099')] [2022-07-10 15:01:39,927][26022] Updated weights on worker 0-0, policy_version 773253 (0.00087) [2022-07-10 15:01:41,831][26022] Updated weights on worker 0-0, policy_version 773263 (0.00088) [2022-07-10 15:01:43,494][26022] Updated weights on worker 0-0, policy_version 773273 (0.00083) [2022-07-10 15:01:44,809][25689] Fps is (10 sec: 5572.0, 60 sec: 5528.3, 300 sec: 5538.3). Total num frames: 791837696. Throughput: 0: 5789.1. Samples: 791840022. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:01:44,809][25689] Avg episode reward: [(0, '0.471')] [2022-07-10 15:01:45,589][26022] Updated weights on worker 0-0, policy_version 773283 (0.00096) [2022-07-10 15:01:47,102][26022] Updated weights on worker 0-0, policy_version 773293 (0.00089) [2022-07-10 15:01:49,205][26022] Updated weights on worker 0-0, policy_version 773303 (0.00078) [2022-07-10 15:01:49,868][25689] Fps is (10 sec: 5687.9, 60 sec: 5523.2, 300 sec: 5542.8). Total num frames: 791866368. Throughput: 0: 5787.6. Samples: 791873470. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:01:49,868][25689] Avg episode reward: [(0, '0.954')] [2022-07-10 15:01:50,979][26022] Updated weights on worker 0-0, policy_version 773313 (0.00476) [2022-07-10 15:01:52,892][26022] Updated weights on worker 0-0, policy_version 773323 (0.00089) [2022-07-10 15:01:54,630][26022] Updated weights on worker 0-0, policy_version 773333 (0.00096) [2022-07-10 15:01:54,931][25689] Fps is (10 sec: 5665.1, 60 sec: 5557.3, 300 sec: 5538.5). Total num frames: 791895040. Throughput: 0: 4978.7. Samples: 791890250. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:01:54,931][25689] Avg episode reward: [(0, '0.442')] [2022-07-10 15:01:56,588][26022] Updated weights on worker 0-0, policy_version 773343 (0.00544) [2022-07-10 15:01:58,356][26022] Updated weights on worker 0-0, policy_version 773353 (0.00079) [2022-07-10 15:01:59,964][25689] Fps is (10 sec: 5477.1, 60 sec: 5505.2, 300 sec: 5542.2). Total num frames: 791921664. Throughput: 0: 5805.5. Samples: 791923516. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:01:59,964][25689] Avg episode reward: [(0, '0.633')] [2022-07-10 15:02:00,187][26022] Updated weights on worker 0-0, policy_version 773363 (0.00536) [2022-07-10 15:02:02,308][26022] Updated weights on worker 0-0, policy_version 773373 (0.00092) [2022-07-10 15:02:04,271][26022] Updated weights on worker 0-0, policy_version 773383 (0.00089) [2022-07-10 15:02:04,987][25689] Fps is (10 sec: 5294.9, 60 sec: 5555.1, 300 sec: 5539.3). Total num frames: 791948288. Throughput: 0: 5697.3. Samples: 791954986. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:02:04,988][25689] Avg episode reward: [(0, '-0.889')] [2022-07-10 15:02:05,985][26022] Updated weights on worker 0-0, policy_version 773393 (0.00085) [2022-07-10 15:02:07,974][26022] Updated weights on worker 0-0, policy_version 773403 (0.00085) [2022-07-10 15:02:09,699][26022] Updated weights on worker 0-0, policy_version 773413 (0.00092) [2022-07-10 15:02:09,989][25689] Fps is (10 sec: 5311.5, 60 sec: 5505.9, 300 sec: 5538.0). Total num frames: 791974912. Throughput: 0: 4879.3. Samples: 791971646. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:02:09,990][25689] Avg episode reward: [(0, '-1.096')] [2022-07-10 15:02:11,712][26022] Updated weights on worker 0-0, policy_version 773423 (0.00085) [2022-07-10 15:02:13,398][26022] Updated weights on worker 0-0, policy_version 773433 (0.00084) [2022-07-10 15:02:15,068][25689] Fps is (10 sec: 5485.1, 60 sec: 5520.5, 300 sec: 5541.1). Total num frames: 792003584. Throughput: 0: 5714.1. Samples: 792005318. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:02:15,071][25689] Avg episode reward: [(0, '-0.871')] [2022-07-10 15:02:15,228][26022] Updated weights on worker 0-0, policy_version 773443 (0.00095) [2022-07-10 15:02:16,995][26022] Updated weights on worker 0-0, policy_version 773453 (0.00095) [2022-07-10 15:02:18,923][26022] Updated weights on worker 0-0, policy_version 773463 (0.00085) [2022-07-10 15:02:20,129][25689] Fps is (10 sec: 5655.2, 60 sec: 5515.2, 300 sec: 5540.5). Total num frames: 792032256. Throughput: 0: 5722.6. Samples: 792038914. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:02:20,130][25689] Avg episode reward: [(0, '-0.877')] [2022-07-10 15:02:20,746][26022] Updated weights on worker 0-0, policy_version 773473 (0.00095) [2022-07-10 15:02:22,774][26022] Updated weights on worker 0-0, policy_version 773483 (0.00088) [2022-07-10 15:02:24,462][26022] Updated weights on worker 0-0, policy_version 773493 (0.00480) [2022-07-10 15:02:25,179][25689] Fps is (10 sec: 5570.5, 60 sec: 5528.4, 300 sec: 5547.0). Total num frames: 792059904. Throughput: 0: 4984.3. Samples: 792055626. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:02:25,179][25689] Avg episode reward: [(0, '-0.374')] [2022-07-10 15:02:26,316][26022] Updated weights on worker 0-0, policy_version 773503 (0.00090) [2022-07-10 15:02:28,218][26022] Updated weights on worker 0-0, policy_version 773513 (0.00088) [2022-07-10 15:02:29,976][26022] Updated weights on worker 0-0, policy_version 773523 (0.00091) [2022-07-10 15:02:30,200][25689] Fps is (10 sec: 5694.0, 60 sec: 5544.1, 300 sec: 5544.3). Total num frames: 792089600. Throughput: 0: 5803.3. Samples: 792088938. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:02:30,201][25689] Avg episode reward: [(0, '0.190')] [2022-07-10 15:02:31,796][26022] Updated weights on worker 0-0, policy_version 773533 (0.00086) [2022-07-10 15:02:33,912][26022] Updated weights on worker 0-0, policy_version 773543 (0.00092) [2022-07-10 15:02:35,320][25689] Fps is (10 sec: 5553.8, 60 sec: 5523.8, 300 sec: 5539.4). Total num frames: 792116224. Throughput: 0: 5761.4. Samples: 792121994. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:02:35,320][25689] Avg episode reward: [(0, '0.950')] [2022-07-10 15:02:35,415][26022] Updated weights on worker 0-0, policy_version 773553 (0.00089) [2022-07-10 15:02:37,443][26022] Updated weights on worker 0-0, policy_version 773563 (0.00081) [2022-07-10 15:02:39,180][26022] Updated weights on worker 0-0, policy_version 773573 (0.00089) [2022-07-10 15:02:40,321][25689] Fps is (10 sec: 5362.4, 60 sec: 5531.5, 300 sec: 5537.1). Total num frames: 792143872. Throughput: 0: 4955.8. Samples: 792138982. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:02:40,321][25689] Avg episode reward: [(0, '1.132')] [2022-07-10 15:02:40,458][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:02:40,470][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000773579_792144896.pth [2022-07-10 15:02:40,470][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000771630_790149120.pth [2022-07-10 15:02:40,929][26022] Updated weights on worker 0-0, policy_version 773583 (0.00089) [2022-07-10 15:02:43,104][26022] Updated weights on worker 0-0, policy_version 773593 (0.00091) [2022-07-10 15:02:44,812][26022] Updated weights on worker 0-0, policy_version 773603 (0.00085) [2022-07-10 15:02:45,375][25689] Fps is (10 sec: 5600.9, 60 sec: 5528.6, 300 sec: 5534.2). Total num frames: 792172544. Throughput: 0: 5765.5. Samples: 792172068. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:02:45,376][25689] Avg episode reward: [(0, '-0.005')] [2022-07-10 15:02:46,655][26022] Updated weights on worker 0-0, policy_version 773613 (0.00091) [2022-07-10 15:02:48,539][26022] Updated weights on worker 0-0, policy_version 773623 (0.00089) [2022-07-10 15:02:50,378][25689] Fps is (10 sec: 5498.5, 60 sec: 5500.0, 300 sec: 5531.8). Total num frames: 792199168. Throughput: 0: 5778.4. Samples: 792205532. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:02:50,378][25689] Avg episode reward: [(0, '-0.370')] [2022-07-10 15:02:50,566][26022] Updated weights on worker 0-0, policy_version 773633 (0.00091) [2022-07-10 15:02:52,234][26022] Updated weights on worker 0-0, policy_version 773643 (0.00087) [2022-07-10 15:02:54,083][26022] Updated weights on worker 0-0, policy_version 773653 (0.00100) [2022-07-10 15:02:55,488][25689] Fps is (10 sec: 5568.8, 60 sec: 5512.5, 300 sec: 5533.5). Total num frames: 792228864. Throughput: 0: 4968.8. Samples: 792222210. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:02:55,489][25689] Avg episode reward: [(0, '-1.310')] [2022-07-10 15:02:55,901][26022] Updated weights on worker 0-0, policy_version 773663 (0.00094) [2022-07-10 15:02:57,880][26022] Updated weights on worker 0-0, policy_version 773673 (0.00085) [2022-07-10 15:02:59,721][26022] Updated weights on worker 0-0, policy_version 773683 (0.00086) [2022-07-10 15:03:00,519][25689] Fps is (10 sec: 5553.5, 60 sec: 5512.7, 300 sec: 5537.0). Total num frames: 792255488. Throughput: 0: 5764.7. Samples: 792255416. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:03:00,519][25689] Avg episode reward: [(0, '-1.175')] [2022-07-10 15:03:01,422][26022] Updated weights on worker 0-0, policy_version 773693 (0.00083) [2022-07-10 15:03:03,796][26022] Updated weights on worker 0-0, policy_version 773703 (0.00094) [2022-07-10 15:03:05,478][26022] Updated weights on worker 0-0, policy_version 773713 (0.00085) [2022-07-10 15:03:05,579][25689] Fps is (10 sec: 5277.2, 60 sec: 5509.4, 300 sec: 5532.5). Total num frames: 792282112. Throughput: 0: 5679.2. Samples: 792286808. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:03:05,579][25689] Avg episode reward: [(0, '-1.922')] [2022-07-10 15:03:07,520][26022] Updated weights on worker 0-0, policy_version 773723 (0.00085) [2022-07-10 15:03:09,172][26022] Updated weights on worker 0-0, policy_version 773733 (0.00091) [2022-07-10 15:03:10,596][25689] Fps is (10 sec: 5284.0, 60 sec: 5508.0, 300 sec: 5527.8). Total num frames: 792308736. Throughput: 0: 4850.2. Samples: 792303596. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:03:10,597][25689] Avg episode reward: [(0, '-2.454')] [2022-07-10 15:03:11,093][26022] Updated weights on worker 0-0, policy_version 773743 (0.00128) [2022-07-10 15:03:12,901][26022] Updated weights on worker 0-0, policy_version 773753 (0.00090) [2022-07-10 15:03:14,767][26022] Updated weights on worker 0-0, policy_version 773763 (0.00088) [2022-07-10 15:03:15,644][25689] Fps is (10 sec: 5595.5, 60 sec: 5527.8, 300 sec: 5528.0). Total num frames: 792338432. Throughput: 0: 5693.1. Samples: 792336956. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:03:15,644][25689] Avg episode reward: [(0, '-1.543')] [2022-07-10 15:03:16,538][26022] Updated weights on worker 0-0, policy_version 773773 (0.00090) [2022-07-10 15:03:18,418][26022] Updated weights on worker 0-0, policy_version 773783 (0.00088) [2022-07-10 15:03:20,140][26022] Updated weights on worker 0-0, policy_version 773793 (0.00086) [2022-07-10 15:03:20,667][25689] Fps is (10 sec: 5693.8, 60 sec: 5514.3, 300 sec: 5531.7). Total num frames: 792366080. Throughput: 0: 5719.1. Samples: 792370648. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:03:20,668][25689] Avg episode reward: [(0, '-1.075')] [2022-07-10 15:03:22,015][26022] Updated weights on worker 0-0, policy_version 773803 (0.00098) [2022-07-10 15:03:23,914][26022] Updated weights on worker 0-0, policy_version 773813 (0.00091) [2022-07-10 15:03:25,677][25689] Fps is (10 sec: 5511.1, 60 sec: 5517.9, 300 sec: 5528.4). Total num frames: 792393728. Throughput: 0: 5006.6. Samples: 792387434. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:03:25,678][25689] Avg episode reward: [(0, '-0.557')] [2022-07-10 15:03:25,728][26022] Updated weights on worker 0-0, policy_version 773823 (0.00095) [2022-07-10 15:03:27,459][26022] Updated weights on worker 0-0, policy_version 773833 (0.00087) [2022-07-10 15:03:29,523][26022] Updated weights on worker 0-0, policy_version 773843 (0.00086) [2022-07-10 15:03:30,717][25689] Fps is (10 sec: 5705.9, 60 sec: 5516.2, 300 sec: 5536.0). Total num frames: 792423424. Throughput: 0: 5826.6. Samples: 792420834. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:03:30,719][25689] Avg episode reward: [(0, '-0.779')] [2022-07-10 15:03:31,153][26022] Updated weights on worker 0-0, policy_version 773853 (0.01090) [2022-07-10 15:03:33,095][26022] Updated weights on worker 0-0, policy_version 773863 (0.00088) [2022-07-10 15:03:34,956][26022] Updated weights on worker 0-0, policy_version 773873 (0.00088) [2022-07-10 15:03:35,777][25689] Fps is (10 sec: 5576.6, 60 sec: 5521.7, 300 sec: 5532.7). Total num frames: 792450048. Throughput: 0: 5821.6. Samples: 792454162. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:03:35,777][25689] Avg episode reward: [(0, '-1.119')] [2022-07-10 15:03:36,742][26022] Updated weights on worker 0-0, policy_version 773883 (0.00085) [2022-07-10 15:03:38,655][26022] Updated weights on worker 0-0, policy_version 773893 (0.00086) [2022-07-10 15:03:40,200][26022] Updated weights on worker 0-0, policy_version 773903 (0.00098) [2022-07-10 15:03:40,791][25689] Fps is (10 sec: 5489.3, 60 sec: 5537.4, 300 sec: 5536.3). Total num frames: 792478720. Throughput: 0: 5835.7. Samples: 792488082. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:03:40,791][25689] Avg episode reward: [(0, '-0.988')] [2022-07-10 15:03:42,291][26022] Updated weights on worker 0-0, policy_version 773913 (0.00086) [2022-07-10 15:03:44,035][26022] Updated weights on worker 0-0, policy_version 773923 (0.00093) [2022-07-10 15:03:45,821][25689] Fps is (10 sec: 5607.0, 60 sec: 5522.7, 300 sec: 5532.6). Total num frames: 792506368. Throughput: 0: 5826.9. Samples: 792504810. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:03:45,822][25689] Avg episode reward: [(0, '-0.970')] [2022-07-10 15:03:45,844][26022] Updated weights on worker 0-0, policy_version 773933 (0.00094) [2022-07-10 15:03:47,742][26022] Updated weights on worker 0-0, policy_version 773943 (0.00086) [2022-07-10 15:03:49,513][26022] Updated weights on worker 0-0, policy_version 773953 (0.00087) [2022-07-10 15:03:50,835][25689] Fps is (10 sec: 5505.4, 60 sec: 5538.6, 300 sec: 5527.0). Total num frames: 792534016. Throughput: 0: 5846.1. Samples: 792538442. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:03:50,835][25689] Avg episode reward: [(0, '-1.012')] [2022-07-10 15:03:51,440][26022] Updated weights on worker 0-0, policy_version 773963 (0.00084) [2022-07-10 15:03:53,463][26022] Updated weights on worker 0-0, policy_version 773973 (0.00096) [2022-07-10 15:03:54,943][26022] Updated weights on worker 0-0, policy_version 773983 (0.00090) [2022-07-10 15:03:55,884][25689] Fps is (10 sec: 5597.0, 60 sec: 5527.3, 300 sec: 5537.4). Total num frames: 792562688. Throughput: 0: 5826.8. Samples: 792571322. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:03:55,884][25689] Avg episode reward: [(0, '-0.624')] [2022-07-10 15:03:57,201][26022] Updated weights on worker 0-0, policy_version 773993 (0.00091) [2022-07-10 15:03:59,007][26022] Updated weights on worker 0-0, policy_version 774003 (0.00093) [2022-07-10 15:04:00,764][26022] Updated weights on worker 0-0, policy_version 774013 (0.00087) [2022-07-10 15:04:00,888][25689] Fps is (10 sec: 5500.2, 60 sec: 5529.7, 300 sec: 5535.0). Total num frames: 792589312. Throughput: 0: 4960.2. Samples: 792587770. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:04:00,889][25689] Avg episode reward: [(0, '0.452')] [2022-07-10 15:04:03,069][26022] Updated weights on worker 0-0, policy_version 774023 (0.00087) [2022-07-10 15:04:04,779][26022] Updated weights on worker 0-0, policy_version 774033 (0.00086) [2022-07-10 15:04:05,919][25689] Fps is (10 sec: 5204.2, 60 sec: 5515.4, 300 sec: 5532.3). Total num frames: 792614912. Throughput: 0: 5699.7. Samples: 792619360. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:04:05,919][25689] Avg episode reward: [(0, '0.689')] [2022-07-10 15:04:06,454][26022] Updated weights on worker 0-0, policy_version 774043 (0.00106) [2022-07-10 15:04:08,513][26022] Updated weights on worker 0-0, policy_version 774053 (0.00084) [2022-07-10 15:04:10,228][26022] Updated weights on worker 0-0, policy_version 774063 (0.00097) [2022-07-10 15:04:10,931][25689] Fps is (10 sec: 5404.3, 60 sec: 5549.9, 300 sec: 5526.4). Total num frames: 792643584. Throughput: 0: 5677.4. Samples: 792652534. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:04:10,931][25689] Avg episode reward: [(0, '-0.162')] [2022-07-10 15:04:12,215][26022] Updated weights on worker 0-0, policy_version 774073 (0.00083) [2022-07-10 15:04:13,849][26022] Updated weights on worker 0-0, policy_version 774083 (0.00085) [2022-07-10 15:04:15,771][26022] Updated weights on worker 0-0, policy_version 774093 (0.00089) [2022-07-10 15:04:16,039][25689] Fps is (10 sec: 5767.7, 60 sec: 5544.3, 300 sec: 5532.9). Total num frames: 792673280. Throughput: 0: 4860.2. Samples: 792669278. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:04:16,039][25689] Avg episode reward: [(0, '-1.624')] [2022-07-10 15:04:17,803][26022] Updated weights on worker 0-0, policy_version 774103 (0.00090) [2022-07-10 15:04:19,240][26022] Updated weights on worker 0-0, policy_version 774113 (0.00085) [2022-07-10 15:04:21,059][25689] Fps is (10 sec: 5661.8, 60 sec: 5544.6, 300 sec: 5536.4). Total num frames: 792700928. Throughput: 0: 5729.0. Samples: 792703328. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:04:21,059][25689] Avg episode reward: [(0, '-1.972')] [2022-07-10 15:04:21,142][26022] Updated weights on worker 0-0, policy_version 774123 (0.00091) [2022-07-10 15:04:23,147][26022] Updated weights on worker 0-0, policy_version 774133 (0.00080) [2022-07-10 15:04:24,840][26022] Updated weights on worker 0-0, policy_version 774143 (0.00089) [2022-07-10 15:04:26,083][25689] Fps is (10 sec: 5403.3, 60 sec: 5526.4, 300 sec: 5525.7). Total num frames: 792727552. Throughput: 0: 5827.7. Samples: 792736870. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:04:26,083][25689] Avg episode reward: [(0, '-2.903')] [2022-07-10 15:04:26,936][26022] Updated weights on worker 0-0, policy_version 774153 (0.00091) [2022-07-10 15:04:28,411][26022] Updated weights on worker 0-0, policy_version 774163 (0.00089) [2022-07-10 15:04:30,558][26022] Updated weights on worker 0-0, policy_version 774173 (0.00103) [2022-07-10 15:04:31,085][25689] Fps is (10 sec: 5617.3, 60 sec: 5529.8, 300 sec: 5534.2). Total num frames: 792757248. Throughput: 0: 5009.5. Samples: 792753498. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:04:31,086][25689] Avg episode reward: [(0, '-2.907')] [2022-07-10 15:04:32,407][26022] Updated weights on worker 0-0, policy_version 774183 (0.00084) [2022-07-10 15:04:34,110][26022] Updated weights on worker 0-0, policy_version 774193 (0.00090) [2022-07-10 15:04:36,100][26022] Updated weights on worker 0-0, policy_version 774203 (0.00093) [2022-07-10 15:04:36,209][25689] Fps is (10 sec: 5561.8, 60 sec: 5523.9, 300 sec: 5522.5). Total num frames: 792783872. Throughput: 0: 5837.1. Samples: 792787016. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:04:36,210][25689] Avg episode reward: [(0, '-3.623')] [2022-07-10 15:04:37,663][26022] Updated weights on worker 0-0, policy_version 774213 (0.00097) [2022-07-10 15:04:39,684][26022] Updated weights on worker 0-0, policy_version 774223 (0.00087) [2022-07-10 15:04:40,488][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:04:40,497][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000774227_792808448.pth [2022-07-10 15:04:40,498][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000772281_790815744.pth [2022-07-10 15:04:41,279][25689] Fps is (10 sec: 5424.6, 60 sec: 5518.8, 300 sec: 5528.9). Total num frames: 792812544. Throughput: 0: 5792.0. Samples: 792820442. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:04:41,279][25689] Avg episode reward: [(0, '-3.312')] [2022-07-10 15:04:41,503][26022] Updated weights on worker 0-0, policy_version 774233 (0.00080) [2022-07-10 15:04:43,099][26022] Updated weights on worker 0-0, policy_version 774243 (0.00081) [2022-07-10 15:04:45,174][26022] Updated weights on worker 0-0, policy_version 774253 (0.00093) [2022-07-10 15:04:46,330][25689] Fps is (10 sec: 5665.8, 60 sec: 5533.9, 300 sec: 5532.3). Total num frames: 792841216. Throughput: 0: 4958.0. Samples: 792837256. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:04:46,331][25689] Avg episode reward: [(0, '-1.380')] [2022-07-10 15:04:46,887][26022] Updated weights on worker 0-0, policy_version 774263 (0.00094) [2022-07-10 15:04:48,962][26022] Updated weights on worker 0-0, policy_version 774273 (0.00089) [2022-07-10 15:04:50,546][26022] Updated weights on worker 0-0, policy_version 774283 (0.00097) [2022-07-10 15:04:51,343][25689] Fps is (10 sec: 5698.0, 60 sec: 5550.9, 300 sec: 5533.0). Total num frames: 792869888. Throughput: 0: 5786.5. Samples: 792870718. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:04:51,343][25689] Avg episode reward: [(0, '-0.974')] [2022-07-10 15:04:52,694][26022] Updated weights on worker 0-0, policy_version 774293 (0.00107) [2022-07-10 15:04:54,083][26022] Updated weights on worker 0-0, policy_version 774303 (0.00085) [2022-07-10 15:04:56,452][25689] Fps is (10 sec: 5462.9, 60 sec: 5511.5, 300 sec: 5524.8). Total num frames: 792896512. Throughput: 0: 5800.8. Samples: 792904442. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:04:56,453][25689] Avg episode reward: [(0, '-0.734')] [2022-07-10 15:04:56,460][26022] Updated weights on worker 0-0, policy_version 774313 (0.00091) [2022-07-10 15:04:58,046][26022] Updated weights on worker 0-0, policy_version 774323 (0.00090) [2022-07-10 15:04:59,880][26022] Updated weights on worker 0-0, policy_version 774333 (0.00088) [2022-07-10 15:05:01,507][25689] Fps is (10 sec: 5440.4, 60 sec: 5540.8, 300 sec: 5537.9). Total num frames: 792925184. Throughput: 0: 4972.2. Samples: 792921020. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:05:01,507][25689] Avg episode reward: [(0, '-0.300')] [2022-07-10 15:05:01,929][26022] Updated weights on worker 0-0, policy_version 774343 (0.00079) [2022-07-10 15:05:04,108][26022] Updated weights on worker 0-0, policy_version 774353 (0.00092) [2022-07-10 15:05:05,775][26022] Updated weights on worker 0-0, policy_version 774363 (0.00087) [2022-07-10 15:05:06,546][25689] Fps is (10 sec: 5478.4, 60 sec: 5556.9, 300 sec: 5527.0). Total num frames: 792951808. Throughput: 0: 5688.9. Samples: 792952262. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:05:06,546][25689] Avg episode reward: [(0, '-0.477')] [2022-07-10 15:05:07,986][26022] Updated weights on worker 0-0, policy_version 774373 (0.00087) [2022-07-10 15:05:09,417][26022] Updated weights on worker 0-0, policy_version 774383 (0.00054) [2022-07-10 15:05:11,563][25689] Fps is (10 sec: 5193.2, 60 sec: 5505.7, 300 sec: 5525.8). Total num frames: 792977408. Throughput: 0: 5660.1. Samples: 792985166. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:05:11,563][25689] Avg episode reward: [(0, '-0.329')] [2022-07-10 15:05:11,621][26022] Updated weights on worker 0-0, policy_version 774393 (0.00079) [2022-07-10 15:05:13,146][26022] Updated weights on worker 0-0, policy_version 774403 (0.00057) [2022-07-10 15:05:15,066][26022] Updated weights on worker 0-0, policy_version 774413 (0.00077) [2022-07-10 15:05:16,596][25689] Fps is (10 sec: 5603.7, 60 sec: 5529.4, 300 sec: 5529.4). Total num frames: 793008128. Throughput: 0: 4849.4. Samples: 793002128. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:05:16,597][25689] Avg episode reward: [(0, '-1.437')] [2022-07-10 15:05:16,755][26022] Updated weights on worker 0-0, policy_version 774423 (0.00090) [2022-07-10 15:05:18,848][26022] Updated weights on worker 0-0, policy_version 774433 (0.00089) [2022-07-10 15:05:20,429][26022] Updated weights on worker 0-0, policy_version 774443 (0.00096) [2022-07-10 15:05:21,628][25689] Fps is (10 sec: 5798.9, 60 sec: 5528.3, 300 sec: 5529.6). Total num frames: 793035776. Throughput: 0: 5693.7. Samples: 793035586. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:05:21,629][25689] Avg episode reward: [(0, '-1.227')] [2022-07-10 15:05:22,480][26022] Updated weights on worker 0-0, policy_version 774453 (0.00087) [2022-07-10 15:05:24,178][26022] Updated weights on worker 0-0, policy_version 774463 (0.00097) [2022-07-10 15:05:26,055][26022] Updated weights on worker 0-0, policy_version 774473 (0.00083) [2022-07-10 15:05:26,645][25689] Fps is (10 sec: 5400.8, 60 sec: 5529.0, 300 sec: 5522.7). Total num frames: 793062400. Throughput: 0: 5796.1. Samples: 793068760. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:05:26,646][25689] Avg episode reward: [(0, '-0.781')] [2022-07-10 15:05:28,208][26022] Updated weights on worker 0-0, policy_version 774483 (0.00092) [2022-07-10 15:05:29,818][26022] Updated weights on worker 0-0, policy_version 774493 (0.00090) [2022-07-10 15:05:31,670][25689] Fps is (10 sec: 5404.7, 60 sec: 5493.1, 300 sec: 5520.2). Total num frames: 793090048. Throughput: 0: 4986.9. Samples: 793085438. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:05:31,671][25689] Avg episode reward: [(0, '-0.845')] [2022-07-10 15:05:31,774][26022] Updated weights on worker 0-0, policy_version 774503 (0.00094) [2022-07-10 15:05:33,372][26022] Updated weights on worker 0-0, policy_version 774513 (0.00084) [2022-07-10 15:05:35,336][26022] Updated weights on worker 0-0, policy_version 774523 (0.00092) [2022-07-10 15:05:36,801][25689] Fps is (10 sec: 5545.8, 60 sec: 5526.3, 300 sec: 5525.9). Total num frames: 793118720. Throughput: 0: 5791.7. Samples: 793119144. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:05:36,802][25689] Avg episode reward: [(0, '-0.856')] [2022-07-10 15:05:37,027][26022] Updated weights on worker 0-0, policy_version 774533 (0.00086) [2022-07-10 15:05:38,953][26022] Updated weights on worker 0-0, policy_version 774543 (0.00081) [2022-07-10 15:05:40,939][26022] Updated weights on worker 0-0, policy_version 774553 (0.00084) [2022-07-10 15:05:41,823][25689] Fps is (10 sec: 5547.0, 60 sec: 5513.7, 300 sec: 5522.9). Total num frames: 793146368. Throughput: 0: 5783.3. Samples: 793152380. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:05:41,824][25689] Avg episode reward: [(0, '-0.128')] [2022-07-10 15:05:42,823][26022] Updated weights on worker 0-0, policy_version 774563 (0.00087) [2022-07-10 15:05:44,506][26022] Updated weights on worker 0-0, policy_version 774573 (0.01517) [2022-07-10 15:05:46,467][26022] Updated weights on worker 0-0, policy_version 774583 (0.00086) [2022-07-10 15:05:46,839][25689] Fps is (10 sec: 5610.6, 60 sec: 5517.0, 300 sec: 5522.7). Total num frames: 793175040. Throughput: 0: 4967.8. Samples: 793169078. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:05:46,839][25689] Avg episode reward: [(0, '-0.505')] [2022-07-10 15:05:48,223][26022] Updated weights on worker 0-0, policy_version 774593 (0.00106) [2022-07-10 15:05:50,153][26022] Updated weights on worker 0-0, policy_version 774603 (0.00090) [2022-07-10 15:05:51,862][25689] Fps is (10 sec: 5610.5, 60 sec: 5499.1, 300 sec: 5526.9). Total num frames: 793202688. Throughput: 0: 5807.6. Samples: 793202702. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:05:51,862][25689] Avg episode reward: [(0, '-0.491')] [2022-07-10 15:05:52,060][26022] Updated weights on worker 0-0, policy_version 774613 (0.00086) [2022-07-10 15:05:53,792][26022] Updated weights on worker 0-0, policy_version 774623 (0.00091) [2022-07-10 15:05:55,783][26022] Updated weights on worker 0-0, policy_version 774633 (0.00087) [2022-07-10 15:05:56,952][25689] Fps is (10 sec: 5670.3, 60 sec: 5551.7, 300 sec: 5525.6). Total num frames: 793232384. Throughput: 0: 5797.5. Samples: 793235970. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:05:56,952][25689] Avg episode reward: [(0, '-0.105')] [2022-07-10 15:05:57,300][26022] Updated weights on worker 0-0, policy_version 774643 (0.00093) [2022-07-10 15:05:59,471][26022] Updated weights on worker 0-0, policy_version 774653 (0.00089) [2022-07-10 15:06:01,026][26022] Updated weights on worker 0-0, policy_version 774663 (0.00087) [2022-07-10 15:06:01,978][25689] Fps is (10 sec: 5364.7, 60 sec: 5486.5, 300 sec: 5528.8). Total num frames: 793256960. Throughput: 0: 4969.5. Samples: 793252540. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:06:01,979][25689] Avg episode reward: [(0, '-0.058')] [2022-07-10 15:06:03,333][26022] Updated weights on worker 0-0, policy_version 774673 (0.00087) [2022-07-10 15:06:05,172][26022] Updated weights on worker 0-0, policy_version 774683 (0.00092) [2022-07-10 15:06:06,837][26022] Updated weights on worker 0-0, policy_version 774693 (0.00092) [2022-07-10 15:06:06,989][25689] Fps is (10 sec: 5304.8, 60 sec: 5522.9, 300 sec: 5525.5). Total num frames: 793285632. Throughput: 0: 5709.1. Samples: 793284120. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:06:06,992][25689] Avg episode reward: [(0, '0.240')] [2022-07-10 15:06:09,009][26022] Updated weights on worker 0-0, policy_version 774703 (0.00089) [2022-07-10 15:06:10,601][26022] Updated weights on worker 0-0, policy_version 774713 (0.00084) [2022-07-10 15:06:12,022][25689] Fps is (10 sec: 5505.2, 60 sec: 5538.4, 300 sec: 5522.4). Total num frames: 793312256. Throughput: 0: 5687.3. Samples: 793317362. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:06:12,024][25689] Avg episode reward: [(0, '0.331')] [2022-07-10 15:06:12,671][26022] Updated weights on worker 0-0, policy_version 774723 (0.00084) [2022-07-10 15:06:14,358][26022] Updated weights on worker 0-0, policy_version 774733 (0.00082) [2022-07-10 15:06:16,143][26022] Updated weights on worker 0-0, policy_version 774743 (0.00090) [2022-07-10 15:06:17,062][25689] Fps is (10 sec: 5591.3, 60 sec: 5520.9, 300 sec: 5525.2). Total num frames: 793341952. Throughput: 0: 5709.4. Samples: 793350788. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:06:17,063][25689] Avg episode reward: [(0, '0.613')] [2022-07-10 15:06:18,312][26022] Updated weights on worker 0-0, policy_version 774753 (0.00057) [2022-07-10 15:06:19,771][26022] Updated weights on worker 0-0, policy_version 774763 (0.00084) [2022-07-10 15:06:21,861][26022] Updated weights on worker 0-0, policy_version 774773 (0.00092) [2022-07-10 15:06:22,158][25689] Fps is (10 sec: 5556.9, 60 sec: 5498.2, 300 sec: 5523.6). Total num frames: 793368576. Throughput: 0: 5698.8. Samples: 793367538. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:06:22,158][25689] Avg episode reward: [(0, '-0.387')] [2022-07-10 15:06:23,457][26022] Updated weights on worker 0-0, policy_version 774783 (0.00086) [2022-07-10 15:06:25,429][26022] Updated weights on worker 0-0, policy_version 774793 (0.00295) [2022-07-10 15:06:27,177][25689] Fps is (10 sec: 5466.9, 60 sec: 5531.8, 300 sec: 5523.3). Total num frames: 793397248. Throughput: 0: 5790.4. Samples: 793401014. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:06:27,177][25689] Avg episode reward: [(0, '-0.623')] [2022-07-10 15:06:27,322][26022] Updated weights on worker 0-0, policy_version 774803 (0.00083) [2022-07-10 15:06:29,188][26022] Updated weights on worker 0-0, policy_version 774813 (0.00086) [2022-07-10 15:06:30,984][26022] Updated weights on worker 0-0, policy_version 774823 (0.00093) [2022-07-10 15:06:32,204][25689] Fps is (10 sec: 5708.0, 60 sec: 5548.5, 300 sec: 5527.8). Total num frames: 793425920. Throughput: 0: 5795.1. Samples: 793434316. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:06:32,205][25689] Avg episode reward: [(0, '-1.369')] [2022-07-10 15:06:32,827][26022] Updated weights on worker 0-0, policy_version 774833 (0.00089) [2022-07-10 15:06:34,759][26022] Updated weights on worker 0-0, policy_version 774843 (0.00088) [2022-07-10 15:06:36,310][26022] Updated weights on worker 0-0, policy_version 774853 (0.00100) [2022-07-10 15:06:37,340][25689] Fps is (10 sec: 5541.5, 60 sec: 5531.1, 300 sec: 5526.9). Total num frames: 793453568. Throughput: 0: 4947.5. Samples: 793451114. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:06:37,341][25689] Avg episode reward: [(0, '-1.980')] [2022-07-10 15:06:38,318][26022] Updated weights on worker 0-0, policy_version 774863 (0.00086) [2022-07-10 15:06:40,115][26022] Updated weights on worker 0-0, policy_version 774873 (0.00091) [2022-07-10 15:06:40,657][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:06:40,677][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000774876_793473024.pth [2022-07-10 15:06:40,682][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000772932_791482368.pth [2022-07-10 15:06:42,172][26022] Updated weights on worker 0-0, policy_version 774883 (0.00069) [2022-07-10 15:06:42,440][25689] Fps is (10 sec: 5402.2, 60 sec: 5524.0, 300 sec: 5522.0). Total num frames: 793481216. Throughput: 0: 5765.3. Samples: 793484472. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:06:42,440][25689] Avg episode reward: [(0, '-2.450')] [2022-07-10 15:06:43,801][26022] Updated weights on worker 0-0, policy_version 774893 (0.00090) [2022-07-10 15:06:45,953][26022] Updated weights on worker 0-0, policy_version 774903 (0.00085) [2022-07-10 15:06:47,294][26022] Updated weights on worker 0-0, policy_version 774913 (0.00087) [2022-07-10 15:06:47,446][25689] Fps is (10 sec: 5674.5, 60 sec: 5541.8, 300 sec: 5526.4). Total num frames: 793510912. Throughput: 0: 5770.1. Samples: 793517968. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:06:47,446][25689] Avg episode reward: [(0, '-2.213')] [2022-07-10 15:06:49,305][26022] Updated weights on worker 0-0, policy_version 774923 (0.00086) [2022-07-10 15:06:51,005][26022] Updated weights on worker 0-0, policy_version 774933 (0.00088) [2022-07-10 15:06:52,498][25689] Fps is (10 sec: 5599.3, 60 sec: 5522.2, 300 sec: 5519.7). Total num frames: 793537536. Throughput: 0: 4960.9. Samples: 793534994. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:06:52,499][25689] Avg episode reward: [(0, '-1.428')] [2022-07-10 15:06:52,978][26022] Updated weights on worker 0-0, policy_version 774943 (0.00088) [2022-07-10 15:06:54,788][26022] Updated weights on worker 0-0, policy_version 774953 (0.00088) [2022-07-10 15:06:56,606][26022] Updated weights on worker 0-0, policy_version 774963 (0.00097) [2022-07-10 15:06:57,555][25689] Fps is (10 sec: 5571.1, 60 sec: 5525.3, 300 sec: 5529.6). Total num frames: 793567232. Throughput: 0: 5806.8. Samples: 793568498. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:06:57,555][25689] Avg episode reward: [(0, '-1.314')] [2022-07-10 15:06:58,466][26022] Updated weights on worker 0-0, policy_version 774973 (0.00100) [2022-07-10 15:07:00,374][26022] Updated weights on worker 0-0, policy_version 774983 (0.00094) [2022-07-10 15:07:02,591][25689] Fps is (10 sec: 5478.8, 60 sec: 5541.3, 300 sec: 5526.0). Total num frames: 793592832. Throughput: 0: 5712.4. Samples: 793599584. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 15:07:02,591][25689] Avg episode reward: [(0, '0.071')] [2022-07-10 15:07:02,592][26022] Updated weights on worker 0-0, policy_version 774993 (0.00110) [2022-07-10 15:07:04,321][26022] Updated weights on worker 0-0, policy_version 775003 (0.00088) [2022-07-10 15:07:06,198][26022] Updated weights on worker 0-0, policy_version 775013 (0.00086) [2022-07-10 15:07:07,636][25689] Fps is (10 sec: 5281.9, 60 sec: 5521.3, 300 sec: 5528.6). Total num frames: 793620480. Throughput: 0: 4872.6. Samples: 793616348. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:07:07,637][25689] Avg episode reward: [(0, '0.689')] [2022-07-10 15:07:07,911][26022] Updated weights on worker 0-0, policy_version 775023 (0.00097) [2022-07-10 15:07:09,911][26022] Updated weights on worker 0-0, policy_version 775033 (0.00093) [2022-07-10 15:07:11,528][26022] Updated weights on worker 0-0, policy_version 775043 (0.00087) [2022-07-10 15:07:12,672][25689] Fps is (10 sec: 5485.3, 60 sec: 5537.9, 300 sec: 5526.0). Total num frames: 793648128. Throughput: 0: 5704.5. Samples: 793650074. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:07:12,673][25689] Avg episode reward: [(0, '0.086')] [2022-07-10 15:07:13,538][26022] Updated weights on worker 0-0, policy_version 775053 (0.00098) [2022-07-10 15:07:15,443][26022] Updated weights on worker 0-0, policy_version 775063 (0.00086) [2022-07-10 15:07:17,120][26022] Updated weights on worker 0-0, policy_version 775073 (0.00092) [2022-07-10 15:07:17,741][25689] Fps is (10 sec: 5674.6, 60 sec: 5535.2, 300 sec: 5529.2). Total num frames: 793677824. Throughput: 0: 5720.4. Samples: 793683972. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:07:17,742][25689] Avg episode reward: [(0, '-0.446')] [2022-07-10 15:07:18,935][26022] Updated weights on worker 0-0, policy_version 775083 (0.00090) [2022-07-10 15:07:20,627][26022] Updated weights on worker 0-0, policy_version 775093 (0.00086) [2022-07-10 15:07:22,451][26022] Updated weights on worker 0-0, policy_version 775103 (0.00096) [2022-07-10 15:07:22,780][25689] Fps is (10 sec: 5875.8, 60 sec: 5591.1, 300 sec: 5536.3). Total num frames: 793707520. Throughput: 0: 5017.4. Samples: 793700878. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:07:22,780][25689] Avg episode reward: [(0, '-0.715')] [2022-07-10 15:07:24,588][26022] Updated weights on worker 0-0, policy_version 775113 (0.00085) [2022-07-10 15:07:26,108][26022] Updated weights on worker 0-0, policy_version 775123 (0.00092) [2022-07-10 15:07:27,881][25689] Fps is (10 sec: 5453.5, 60 sec: 5532.9, 300 sec: 5521.1). Total num frames: 793733120. Throughput: 0: 5829.4. Samples: 793734364. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:07:27,881][25689] Avg episode reward: [(0, '-1.246')] [2022-07-10 15:07:28,134][26022] Updated weights on worker 0-0, policy_version 775133 (0.00089) [2022-07-10 15:07:29,982][26022] Updated weights on worker 0-0, policy_version 775143 (0.00082) [2022-07-10 15:07:31,751][26022] Updated weights on worker 0-0, policy_version 775153 (0.00095) [2022-07-10 15:07:32,884][25689] Fps is (10 sec: 5472.6, 60 sec: 5552.0, 300 sec: 5533.6). Total num frames: 793762816. Throughput: 0: 5824.0. Samples: 793767788. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:07:32,884][25689] Avg episode reward: [(0, '-1.520')] [2022-07-10 15:07:33,899][26022] Updated weights on worker 0-0, policy_version 775163 (0.00092) [2022-07-10 15:07:35,299][26022] Updated weights on worker 0-0, policy_version 775173 (0.00090) [2022-07-10 15:07:37,495][26022] Updated weights on worker 0-0, policy_version 775183 (0.00078) [2022-07-10 15:07:37,933][25689] Fps is (10 sec: 5603.0, 60 sec: 5543.1, 300 sec: 5529.2). Total num frames: 793789440. Throughput: 0: 4981.9. Samples: 793784564. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:07:37,934][25689] Avg episode reward: [(0, '-2.298')] [2022-07-10 15:07:38,887][26022] Updated weights on worker 0-0, policy_version 775193 (0.00084) [2022-07-10 15:07:41,132][26022] Updated weights on worker 0-0, policy_version 775203 (0.00087) [2022-07-10 15:07:42,851][26022] Updated weights on worker 0-0, policy_version 775213 (0.00086) [2022-07-10 15:07:42,943][25689] Fps is (10 sec: 5497.2, 60 sec: 5568.3, 300 sec: 5530.1). Total num frames: 793818112. Throughput: 0: 5814.2. Samples: 793818110. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:07:42,943][25689] Avg episode reward: [(0, '-1.810')] [2022-07-10 15:07:44,760][26022] Updated weights on worker 0-0, policy_version 775223 (0.00092) [2022-07-10 15:07:46,591][26022] Updated weights on worker 0-0, policy_version 775233 (0.00090) [2022-07-10 15:07:47,948][25689] Fps is (10 sec: 5623.5, 60 sec: 5534.5, 300 sec: 5533.5). Total num frames: 793845760. Throughput: 0: 5834.9. Samples: 793851452. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:07:47,948][25689] Avg episode reward: [(0, '-0.852')] [2022-07-10 15:07:48,325][26022] Updated weights on worker 0-0, policy_version 775243 (0.00080) [2022-07-10 15:07:50,148][26022] Updated weights on worker 0-0, policy_version 775253 (0.00080) [2022-07-10 15:07:52,053][26022] Updated weights on worker 0-0, policy_version 775263 (0.00093) [2022-07-10 15:07:52,969][25689] Fps is (10 sec: 5617.0, 60 sec: 5571.2, 300 sec: 5531.7). Total num frames: 793874432. Throughput: 0: 5006.7. Samples: 793868350. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:07:52,970][25689] Avg episode reward: [(0, '-0.269')] [2022-07-10 15:07:53,743][26022] Updated weights on worker 0-0, policy_version 775273 (0.00083) [2022-07-10 15:07:55,639][26022] Updated weights on worker 0-0, policy_version 775283 (0.00082) [2022-07-10 15:07:57,539][26022] Updated weights on worker 0-0, policy_version 775293 (0.00090) [2022-07-10 15:07:58,103][25689] Fps is (10 sec: 5546.1, 60 sec: 5530.3, 300 sec: 5533.2). Total num frames: 793902080. Throughput: 0: 5813.7. Samples: 793901826. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:07:58,104][25689] Avg episode reward: [(0, '-1.607')] [2022-07-10 15:07:59,299][26022] Updated weights on worker 0-0, policy_version 775303 (0.00087) [2022-07-10 15:08:01,277][26022] Updated weights on worker 0-0, policy_version 775313 (0.00093) [2022-07-10 15:08:03,147][25689] Fps is (10 sec: 5433.0, 60 sec: 5563.4, 300 sec: 5536.9). Total num frames: 793929728. Throughput: 0: 5737.2. Samples: 793934028. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:08:03,148][25689] Avg episode reward: [(0, '-0.856')] [2022-07-10 15:08:03,387][26022] Updated weights on worker 0-0, policy_version 775323 (0.00088) [2022-07-10 15:08:05,222][26022] Updated weights on worker 0-0, policy_version 775333 (0.00099) [2022-07-10 15:08:07,039][26022] Updated weights on worker 0-0, policy_version 775343 (0.00089) [2022-07-10 15:08:08,227][25689] Fps is (10 sec: 5360.7, 60 sec: 5543.3, 300 sec: 5535.8). Total num frames: 793956352. Throughput: 0: 4857.6. Samples: 793949960. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:08:08,227][25689] Avg episode reward: [(0, '-0.737')] [2022-07-10 15:08:08,882][26022] Updated weights on worker 0-0, policy_version 775353 (0.00091) [2022-07-10 15:08:10,830][26022] Updated weights on worker 0-0, policy_version 775363 (0.00091) [2022-07-10 15:08:12,689][26022] Updated weights on worker 0-0, policy_version 775373 (0.00613) [2022-07-10 15:08:13,234][25689] Fps is (10 sec: 5380.6, 60 sec: 5545.9, 300 sec: 5529.7). Total num frames: 793984000. Throughput: 0: 5675.5. Samples: 793983360. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:08:13,234][25689] Avg episode reward: [(0, '-0.740')] [2022-07-10 15:08:14,499][26022] Updated weights on worker 0-0, policy_version 775383 (0.00082) [2022-07-10 15:08:16,496][26022] Updated weights on worker 0-0, policy_version 775393 (0.00091) [2022-07-10 15:08:18,148][26022] Updated weights on worker 0-0, policy_version 775403 (0.00085) [2022-07-10 15:08:18,372][25689] Fps is (10 sec: 5652.7, 60 sec: 5539.7, 300 sec: 5534.4). Total num frames: 794013696. Throughput: 0: 5676.1. Samples: 794016874. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:08:18,372][25689] Avg episode reward: [(0, '-1.228')] [2022-07-10 15:08:19,941][26022] Updated weights on worker 0-0, policy_version 775413 (0.00084) [2022-07-10 15:08:21,972][26022] Updated weights on worker 0-0, policy_version 775423 (0.00091) [2022-07-10 15:08:23,374][25689] Fps is (10 sec: 5655.3, 60 sec: 5509.2, 300 sec: 5534.6). Total num frames: 794041344. Throughput: 0: 5759.2. Samples: 794050516. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:08:23,374][25689] Avg episode reward: [(0, '-1.326')] [2022-07-10 15:08:23,639][26022] Updated weights on worker 0-0, policy_version 775433 (0.00088) [2022-07-10 15:08:25,615][26022] Updated weights on worker 0-0, policy_version 775443 (0.00093) [2022-07-10 15:08:27,397][26022] Updated weights on worker 0-0, policy_version 775453 (0.00082) [2022-07-10 15:08:28,416][25689] Fps is (10 sec: 5505.3, 60 sec: 5548.4, 300 sec: 5527.6). Total num frames: 794068992. Throughput: 0: 5801.3. Samples: 794067080. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:08:28,416][25689] Avg episode reward: [(0, '0.031')] [2022-07-10 15:08:29,283][26022] Updated weights on worker 0-0, policy_version 775463 (0.00078) [2022-07-10 15:08:30,975][26022] Updated weights on worker 0-0, policy_version 775473 (0.00564) [2022-07-10 15:08:32,928][26022] Updated weights on worker 0-0, policy_version 775483 (0.00091) [2022-07-10 15:08:33,423][25689] Fps is (10 sec: 5502.3, 60 sec: 5514.2, 300 sec: 5532.0). Total num frames: 794096640. Throughput: 0: 5792.6. Samples: 794100310. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:08:33,424][25689] Avg episode reward: [(0, '-1.329')] [2022-07-10 15:08:34,883][26022] Updated weights on worker 0-0, policy_version 775493 (0.00092) [2022-07-10 15:08:36,711][26022] Updated weights on worker 0-0, policy_version 775503 (0.00086) [2022-07-10 15:08:38,490][25689] Fps is (10 sec: 5590.6, 60 sec: 5546.4, 300 sec: 5531.1). Total num frames: 794125312. Throughput: 0: 5814.9. Samples: 794133858. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:08:38,490][25689] Avg episode reward: [(0, '-1.165')] [2022-07-10 15:08:38,491][26022] Updated weights on worker 0-0, policy_version 775513 (0.00095) [2022-07-10 15:08:40,319][26022] Updated weights on worker 0-0, policy_version 775523 (0.00089) [2022-07-10 15:08:40,697][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:08:40,706][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000775526_794138624.pth [2022-07-10 15:08:40,706][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000773579_792144896.pth [2022-07-10 15:08:42,177][26022] Updated weights on worker 0-0, policy_version 775533 (0.00086) [2022-07-10 15:08:43,573][25689] Fps is (10 sec: 5548.6, 60 sec: 5522.7, 300 sec: 5530.1). Total num frames: 794152960. Throughput: 0: 4949.5. Samples: 794150496. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:08:43,574][25689] Avg episode reward: [(0, '-1.304')] [2022-07-10 15:08:43,875][26022] Updated weights on worker 0-0, policy_version 775543 (0.00081) [2022-07-10 15:08:45,885][26022] Updated weights on worker 0-0, policy_version 775553 (0.00092) [2022-07-10 15:08:47,473][26022] Updated weights on worker 0-0, policy_version 775563 (0.00090) [2022-07-10 15:08:48,584][25689] Fps is (10 sec: 5579.3, 60 sec: 5539.1, 300 sec: 5533.6). Total num frames: 794181632. Throughput: 0: 5805.7. Samples: 794184172. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:08:48,585][25689] Avg episode reward: [(0, '-1.300')] [2022-07-10 15:08:49,511][26022] Updated weights on worker 0-0, policy_version 775573 (0.00088) [2022-07-10 15:08:51,212][26022] Updated weights on worker 0-0, policy_version 775583 (0.01151) [2022-07-10 15:08:53,266][26022] Updated weights on worker 0-0, policy_version 775593 (0.00087) [2022-07-10 15:08:53,603][25689] Fps is (10 sec: 5615.6, 60 sec: 5522.5, 300 sec: 5530.7). Total num frames: 794209280. Throughput: 0: 5802.3. Samples: 794217398. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:08:53,603][25689] Avg episode reward: [(0, '-1.321')] [2022-07-10 15:08:55,058][26022] Updated weights on worker 0-0, policy_version 775603 (0.00093) [2022-07-10 15:08:56,864][26022] Updated weights on worker 0-0, policy_version 775613 (0.00087) [2022-07-10 15:08:58,653][25689] Fps is (10 sec: 5491.5, 60 sec: 5530.0, 300 sec: 5533.3). Total num frames: 794236928. Throughput: 0: 4959.6. Samples: 794233862. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:08:58,654][25689] Avg episode reward: [(0, '-1.788')] [2022-07-10 15:08:58,654][26022] Updated weights on worker 0-0, policy_version 775623 (0.00081) [2022-07-10 15:09:00,719][26022] Updated weights on worker 0-0, policy_version 775633 (0.00084) [2022-07-10 15:09:02,842][26022] Updated weights on worker 0-0, policy_version 775643 (0.00081) [2022-07-10 15:09:03,663][25689] Fps is (10 sec: 5394.7, 60 sec: 5516.3, 300 sec: 5537.1). Total num frames: 794263552. Throughput: 0: 5711.5. Samples: 794265236. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:09:03,664][25689] Avg episode reward: [(0, '-1.467')] [2022-07-10 15:09:04,671][26022] Updated weights on worker 0-0, policy_version 775653 (0.00092) [2022-07-10 15:09:06,543][26022] Updated weights on worker 0-0, policy_version 775663 (0.00088) [2022-07-10 15:09:08,239][26022] Updated weights on worker 0-0, policy_version 775673 (0.00092) [2022-07-10 15:09:08,724][25689] Fps is (10 sec: 5491.0, 60 sec: 5551.9, 300 sec: 5536.2). Total num frames: 794292224. Throughput: 0: 5681.6. Samples: 794298596. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:09:08,724][25689] Avg episode reward: [(0, '-1.475')] [2022-07-10 15:09:10,247][26022] Updated weights on worker 0-0, policy_version 775683 (0.00093) [2022-07-10 15:09:11,799][26022] Updated weights on worker 0-0, policy_version 775693 (0.00087) [2022-07-10 15:09:13,778][25689] Fps is (10 sec: 5466.9, 60 sec: 5530.7, 300 sec: 5526.9). Total num frames: 794318848. Throughput: 0: 4844.6. Samples: 794315138. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:09:13,780][25689] Avg episode reward: [(0, '-1.651')] [2022-07-10 15:09:13,843][26022] Updated weights on worker 0-0, policy_version 775703 (0.00086) [2022-07-10 15:09:15,702][26022] Updated weights on worker 0-0, policy_version 775713 (0.00088) [2022-07-10 15:09:17,453][26022] Updated weights on worker 0-0, policy_version 775723 (0.00097) [2022-07-10 15:09:18,852][25689] Fps is (10 sec: 5459.3, 60 sec: 5519.5, 300 sec: 5529.3). Total num frames: 794347520. Throughput: 0: 5694.3. Samples: 794348880. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:09:18,854][25689] Avg episode reward: [(0, '-1.357')] [2022-07-10 15:09:19,423][26022] Updated weights on worker 0-0, policy_version 775733 (0.00099) [2022-07-10 15:09:21,066][26022] Updated weights on worker 0-0, policy_version 775743 (0.00086) [2022-07-10 15:09:23,151][26022] Updated weights on worker 0-0, policy_version 775753 (0.00089) [2022-07-10 15:09:23,862][25689] Fps is (10 sec: 5584.7, 60 sec: 5518.8, 300 sec: 5533.0). Total num frames: 794375168. Throughput: 0: 5795.9. Samples: 794382310. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:09:23,863][25689] Avg episode reward: [(0, '-1.258')] [2022-07-10 15:09:24,919][26022] Updated weights on worker 0-0, policy_version 775763 (0.00095) [2022-07-10 15:09:26,845][26022] Updated weights on worker 0-0, policy_version 775773 (0.00088) [2022-07-10 15:09:28,571][26022] Updated weights on worker 0-0, policy_version 775783 (0.00088) [2022-07-10 15:09:28,869][25689] Fps is (10 sec: 5622.7, 60 sec: 5539.0, 300 sec: 5529.5). Total num frames: 794403840. Throughput: 0: 4980.8. Samples: 794398936. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:09:28,869][25689] Avg episode reward: [(0, '-0.857')] [2022-07-10 15:09:30,501][26022] Updated weights on worker 0-0, policy_version 775793 (0.00090) [2022-07-10 15:09:32,138][26022] Updated weights on worker 0-0, policy_version 775803 (0.00084) [2022-07-10 15:09:33,878][25689] Fps is (10 sec: 5418.7, 60 sec: 5504.9, 300 sec: 5528.2). Total num frames: 794429440. Throughput: 0: 5836.4. Samples: 794432452. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:09:33,880][25689] Avg episode reward: [(0, '-0.574')] [2022-07-10 15:09:34,301][26022] Updated weights on worker 0-0, policy_version 775813 (0.00093) [2022-07-10 15:09:35,863][26022] Updated weights on worker 0-0, policy_version 775823 (0.00083) [2022-07-10 15:09:38,063][26022] Updated weights on worker 0-0, policy_version 775833 (0.00083) [2022-07-10 15:09:39,000][25689] Fps is (10 sec: 5458.0, 60 sec: 5516.8, 300 sec: 5530.6). Total num frames: 794459136. Throughput: 0: 5784.7. Samples: 794465428. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:09:39,001][25689] Avg episode reward: [(0, '-1.425')] [2022-07-10 15:09:39,497][26022] Updated weights on worker 0-0, policy_version 775843 (0.00082) [2022-07-10 15:09:41,750][26022] Updated weights on worker 0-0, policy_version 775853 (0.00096) [2022-07-10 15:09:43,258][26022] Updated weights on worker 0-0, policy_version 775863 (0.00087) [2022-07-10 15:09:44,008][25689] Fps is (10 sec: 5660.8, 60 sec: 5523.7, 300 sec: 5528.0). Total num frames: 794486784. Throughput: 0: 4960.9. Samples: 794482250. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:09:44,009][25689] Avg episode reward: [(0, '-0.698')] [2022-07-10 15:09:45,161][26022] Updated weights on worker 0-0, policy_version 775873 (0.00092) [2022-07-10 15:09:47,057][26022] Updated weights on worker 0-0, policy_version 775883 (0.00095) [2022-07-10 15:09:48,650][26022] Updated weights on worker 0-0, policy_version 775893 (0.00091) [2022-07-10 15:09:49,023][25689] Fps is (10 sec: 5517.2, 60 sec: 5506.4, 300 sec: 5524.5). Total num frames: 794514432. Throughput: 0: 5797.4. Samples: 794515776. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:09:49,023][25689] Avg episode reward: [(0, '-0.985')] [2022-07-10 15:09:50,764][26022] Updated weights on worker 0-0, policy_version 775903 (0.00086) [2022-07-10 15:09:52,592][26022] Updated weights on worker 0-0, policy_version 775913 (0.00084) [2022-07-10 15:09:54,043][25689] Fps is (10 sec: 5510.2, 60 sec: 5506.2, 300 sec: 5529.6). Total num frames: 794542080. Throughput: 0: 5782.1. Samples: 794549050. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:09:54,045][25689] Avg episode reward: [(0, '-0.858')] [2022-07-10 15:09:54,483][26022] Updated weights on worker 0-0, policy_version 775923 (0.00087) [2022-07-10 15:09:56,392][26022] Updated weights on worker 0-0, policy_version 775933 (0.00092) [2022-07-10 15:09:58,112][26022] Updated weights on worker 0-0, policy_version 775943 (0.00091) [2022-07-10 15:09:59,136][25689] Fps is (10 sec: 5568.9, 60 sec: 5519.3, 300 sec: 5528.9). Total num frames: 794570752. Throughput: 0: 4957.7. Samples: 794565256. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:09:59,137][25689] Avg episode reward: [(0, '-1.239')] [2022-07-10 15:10:00,057][26022] Updated weights on worker 0-0, policy_version 775953 (0.00088) [2022-07-10 15:10:02,291][26022] Updated weights on worker 0-0, policy_version 775963 (0.00091) [2022-07-10 15:10:04,155][25689] Fps is (10 sec: 5164.9, 60 sec: 5467.7, 300 sec: 5519.0). Total num frames: 794594304. Throughput: 0: 5660.7. Samples: 794596294. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:10:04,155][25689] Avg episode reward: [(0, '-0.989')] [2022-07-10 15:10:04,248][26022] Updated weights on worker 0-0, policy_version 775973 (0.00093) [2022-07-10 15:10:06,110][26022] Updated weights on worker 0-0, policy_version 775983 (0.00090) [2022-07-10 15:10:07,911][26022] Updated weights on worker 0-0, policy_version 775993 (0.00085) [2022-07-10 15:10:09,164][25689] Fps is (10 sec: 5310.2, 60 sec: 5489.4, 300 sec: 5532.9). Total num frames: 794624000. Throughput: 0: 5653.6. Samples: 794629644. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:10:09,164][25689] Avg episode reward: [(0, '-0.204')] [2022-07-10 15:10:09,575][26022] Updated weights on worker 0-0, policy_version 776003 (0.00087) [2022-07-10 15:10:11,525][26022] Updated weights on worker 0-0, policy_version 776013 (0.00090) [2022-07-10 15:10:13,343][26022] Updated weights on worker 0-0, policy_version 776023 (0.00092) [2022-07-10 15:10:14,188][25689] Fps is (10 sec: 5715.3, 60 sec: 5509.0, 300 sec: 5522.7). Total num frames: 794651648. Throughput: 0: 4830.6. Samples: 794646362. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:10:14,189][25689] Avg episode reward: [(0, '-0.122')] [2022-07-10 15:10:15,269][26022] Updated weights on worker 0-0, policy_version 776033 (0.00080) [2022-07-10 15:10:17,101][26022] Updated weights on worker 0-0, policy_version 776043 (0.00088) [2022-07-10 15:10:18,879][26022] Updated weights on worker 0-0, policy_version 776053 (0.00103) [2022-07-10 15:10:19,235][25689] Fps is (10 sec: 5592.1, 60 sec: 5511.5, 300 sec: 5525.9). Total num frames: 794680320. Throughput: 0: 5709.7. Samples: 794680014. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:10:19,235][25689] Avg episode reward: [(0, '-0.981')] [2022-07-10 15:10:20,722][26022] Updated weights on worker 0-0, policy_version 776063 (0.00435) [2022-07-10 15:10:22,541][26022] Updated weights on worker 0-0, policy_version 776073 (0.00082) [2022-07-10 15:10:24,311][25689] Fps is (10 sec: 5563.8, 60 sec: 5505.5, 300 sec: 5528.2). Total num frames: 794707968. Throughput: 0: 5817.0. Samples: 794713542. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:10:24,312][25689] Avg episode reward: [(0, '-0.298')] [2022-07-10 15:10:24,334][26022] Updated weights on worker 0-0, policy_version 776083 (0.00094) [2022-07-10 15:10:26,226][26022] Updated weights on worker 0-0, policy_version 776093 (0.00095) [2022-07-10 15:10:28,243][26022] Updated weights on worker 0-0, policy_version 776103 (0.00087) [2022-07-10 15:10:29,353][25689] Fps is (10 sec: 5465.0, 60 sec: 5485.3, 300 sec: 5527.9). Total num frames: 794735616. Throughput: 0: 4965.8. Samples: 794729900. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:10:29,355][25689] Avg episode reward: [(0, '0.050')] [2022-07-10 15:10:29,981][26022] Updated weights on worker 0-0, policy_version 776113 (0.00082) [2022-07-10 15:10:32,064][26022] Updated weights on worker 0-0, policy_version 776123 (0.00090) [2022-07-10 15:10:33,605][26022] Updated weights on worker 0-0, policy_version 776133 (0.00095) [2022-07-10 15:10:34,409][25689] Fps is (10 sec: 5577.1, 60 sec: 5531.8, 300 sec: 5529.3). Total num frames: 794764288. Throughput: 0: 5772.8. Samples: 794763094. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-10 15:10:34,410][25689] Avg episode reward: [(0, '-0.867')] [2022-07-10 15:10:35,804][26022] Updated weights on worker 0-0, policy_version 776143 (0.00091) [2022-07-10 15:10:37,238][26022] Updated weights on worker 0-0, policy_version 776153 (0.00087) [2022-07-10 15:10:39,332][26022] Updated weights on worker 0-0, policy_version 776163 (0.00083) [2022-07-10 15:10:39,473][25689] Fps is (10 sec: 5565.0, 60 sec: 5503.2, 300 sec: 5528.5). Total num frames: 794791936. Throughput: 0: 5745.3. Samples: 794796290. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:10:39,478][25689] Avg episode reward: [(0, '-0.718')] [2022-07-10 15:10:40,903][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:10:40,912][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000776172_794800128.pth [2022-07-10 15:10:40,913][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000774227_792808448.pth [2022-07-10 15:10:41,122][26022] Updated weights on worker 0-0, policy_version 776173 (0.00084) [2022-07-10 15:10:42,794][26022] Updated weights on worker 0-0, policy_version 776183 (0.00091) [2022-07-10 15:10:44,528][25689] Fps is (10 sec: 5464.8, 60 sec: 5499.0, 300 sec: 5524.4). Total num frames: 794819584. Throughput: 0: 5742.2. Samples: 794829632. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:10:44,528][25689] Avg episode reward: [(0, '-0.215')] [2022-07-10 15:10:44,829][26022] Updated weights on worker 0-0, policy_version 776193 (0.00094) [2022-07-10 15:10:46,502][26022] Updated weights on worker 0-0, policy_version 776203 (0.00083) [2022-07-10 15:10:48,487][26022] Updated weights on worker 0-0, policy_version 776213 (0.00092) [2022-07-10 15:10:49,580][25689] Fps is (10 sec: 5572.6, 60 sec: 5512.5, 300 sec: 5527.3). Total num frames: 794848256. Throughput: 0: 5744.7. Samples: 794846098. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:10:49,581][25689] Avg episode reward: [(0, '-0.620')] [2022-07-10 15:10:50,354][26022] Updated weights on worker 0-0, policy_version 776223 (0.00085) [2022-07-10 15:10:52,049][26022] Updated weights on worker 0-0, policy_version 776233 (0.00090) [2022-07-10 15:10:53,970][26022] Updated weights on worker 0-0, policy_version 776243 (0.00094) [2022-07-10 15:10:54,618][25689] Fps is (10 sec: 5581.6, 60 sec: 5510.9, 300 sec: 5521.3). Total num frames: 794875904. Throughput: 0: 5773.3. Samples: 794879766. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:10:54,619][25689] Avg episode reward: [(0, '-2.130')] [2022-07-10 15:10:55,874][26022] Updated weights on worker 0-0, policy_version 776253 (0.00089) [2022-07-10 15:10:57,563][26022] Updated weights on worker 0-0, policy_version 776263 (0.00086) [2022-07-10 15:10:59,607][26022] Updated weights on worker 0-0, policy_version 776273 (0.00098) [2022-07-10 15:10:59,686][25689] Fps is (10 sec: 5573.0, 60 sec: 5513.2, 300 sec: 5534.3). Total num frames: 794904576. Throughput: 0: 5796.1. Samples: 794913444. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:10:59,688][25689] Avg episode reward: [(0, '-2.234')] [2022-07-10 15:11:01,403][26022] Updated weights on worker 0-0, policy_version 776283 (0.00088) [2022-07-10 15:11:03,570][26022] Updated weights on worker 0-0, policy_version 776293 (0.00091) [2022-07-10 15:11:04,715][25689] Fps is (10 sec: 5375.3, 60 sec: 5546.1, 300 sec: 5523.7). Total num frames: 794930176. Throughput: 0: 4875.7. Samples: 794928058. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:11:04,715][25689] Avg episode reward: [(0, '-1.738')] [2022-07-10 15:11:05,147][26022] Updated weights on worker 0-0, policy_version 776303 (0.00086) [2022-07-10 15:11:07,271][26022] Updated weights on worker 0-0, policy_version 776313 (0.00096) [2022-07-10 15:11:09,012][26022] Updated weights on worker 0-0, policy_version 776323 (0.00085) [2022-07-10 15:11:09,721][25689] Fps is (10 sec: 5306.3, 60 sec: 5512.6, 300 sec: 5527.6). Total num frames: 794957824. Throughput: 0: 5727.1. Samples: 794961446. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:11:09,721][25689] Avg episode reward: [(0, '-1.855')] [2022-07-10 15:11:10,872][26022] Updated weights on worker 0-0, policy_version 776333 (0.00085) [2022-07-10 15:11:12,686][26022] Updated weights on worker 0-0, policy_version 776343 (0.00086) [2022-07-10 15:11:14,448][26022] Updated weights on worker 0-0, policy_version 776353 (0.00082) [2022-07-10 15:11:14,779][25689] Fps is (10 sec: 5697.8, 60 sec: 5543.3, 300 sec: 5527.3). Total num frames: 794987520. Throughput: 0: 5719.0. Samples: 794995066. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:11:14,779][25689] Avg episode reward: [(0, '-1.993')] [2022-07-10 15:11:16,488][26022] Updated weights on worker 0-0, policy_version 776363 (0.00103) [2022-07-10 15:11:18,099][26022] Updated weights on worker 0-0, policy_version 776373 (0.00086) [2022-07-10 15:11:19,882][25689] Fps is (10 sec: 5542.7, 60 sec: 5504.4, 300 sec: 5527.2). Total num frames: 795014144. Throughput: 0: 4867.1. Samples: 795011740. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:11:19,882][25689] Avg episode reward: [(0, '-1.874')] [2022-07-10 15:11:20,216][26022] Updated weights on worker 0-0, policy_version 776383 (0.00116) [2022-07-10 15:11:21,752][26022] Updated weights on worker 0-0, policy_version 776393 (0.00091) [2022-07-10 15:11:23,754][26022] Updated weights on worker 0-0, policy_version 776403 (0.00093) [2022-07-10 15:11:24,904][25689] Fps is (10 sec: 5562.6, 60 sec: 5543.1, 300 sec: 5530.5). Total num frames: 795043840. Throughput: 0: 5814.2. Samples: 795045440. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:11:24,904][25689] Avg episode reward: [(0, '-2.445')] [2022-07-10 15:11:25,520][26022] Updated weights on worker 0-0, policy_version 776413 (0.00095) [2022-07-10 15:11:27,475][26022] Updated weights on worker 0-0, policy_version 776423 (0.00087) [2022-07-10 15:11:29,156][26022] Updated weights on worker 0-0, policy_version 776433 (0.00100) [2022-07-10 15:11:29,923][25689] Fps is (10 sec: 5608.9, 60 sec: 5528.3, 300 sec: 5523.8). Total num frames: 795070464. Throughput: 0: 5817.5. Samples: 795078972. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:11:29,923][25689] Avg episode reward: [(0, '-3.042')] [2022-07-10 15:11:31,073][26022] Updated weights on worker 0-0, policy_version 776443 (0.00084) [2022-07-10 15:11:32,706][26022] Updated weights on worker 0-0, policy_version 776453 (0.00087) [2022-07-10 15:11:34,744][26022] Updated weights on worker 0-0, policy_version 776463 (0.00096) [2022-07-10 15:11:34,939][25689] Fps is (10 sec: 5510.3, 60 sec: 5532.0, 300 sec: 5529.5). Total num frames: 795099136. Throughput: 0: 4994.3. Samples: 795095750. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:11:34,939][25689] Avg episode reward: [(0, '-3.405')] [2022-07-10 15:11:36,573][26022] Updated weights on worker 0-0, policy_version 776473 (0.00087) [2022-07-10 15:11:38,283][26022] Updated weights on worker 0-0, policy_version 776483 (0.00090) [2022-07-10 15:11:39,980][25689] Fps is (10 sec: 5599.8, 60 sec: 5534.0, 300 sec: 5530.6). Total num frames: 795126784. Throughput: 0: 5840.8. Samples: 795129132. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:11:39,981][25689] Avg episode reward: [(0, '-3.128')] [2022-07-10 15:11:40,127][26022] Updated weights on worker 0-0, policy_version 776493 (0.00086) [2022-07-10 15:11:41,912][26022] Updated weights on worker 0-0, policy_version 776503 (0.00086) [2022-07-10 15:11:43,884][26022] Updated weights on worker 0-0, policy_version 776513 (0.00089) [2022-07-10 15:11:44,988][25689] Fps is (10 sec: 5706.3, 60 sec: 5572.2, 300 sec: 5530.5). Total num frames: 795156480. Throughput: 0: 5835.6. Samples: 795162644. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:11:44,988][25689] Avg episode reward: [(0, '-3.471')] [2022-07-10 15:11:45,964][26022] Updated weights on worker 0-0, policy_version 776523 (0.00092) [2022-07-10 15:11:47,647][26022] Updated weights on worker 0-0, policy_version 776533 (0.00085) [2022-07-10 15:11:49,492][26022] Updated weights on worker 0-0, policy_version 776543 (0.00097) [2022-07-10 15:11:49,991][25689] Fps is (10 sec: 5523.5, 60 sec: 5525.8, 300 sec: 5528.0). Total num frames: 795182080. Throughput: 0: 4990.5. Samples: 795179124. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:11:49,992][25689] Avg episode reward: [(0, '-1.654')] [2022-07-10 15:11:51,290][26022] Updated weights on worker 0-0, policy_version 776553 (0.00089) [2022-07-10 15:11:53,126][26022] Updated weights on worker 0-0, policy_version 776563 (0.00099) [2022-07-10 15:11:54,999][25689] Fps is (10 sec: 5319.1, 60 sec: 5528.7, 300 sec: 5522.0). Total num frames: 795209728. Throughput: 0: 5839.4. Samples: 795212888. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:11:55,009][25689] Avg episode reward: [(0, '-1.088')] [2022-07-10 15:11:55,013][26022] Updated weights on worker 0-0, policy_version 776573 (0.00084) [2022-07-10 15:11:56,869][26022] Updated weights on worker 0-0, policy_version 776583 (0.00086) [2022-07-10 15:11:58,756][26022] Updated weights on worker 0-0, policy_version 776593 (0.00086) [2022-07-10 15:12:00,123][25689] Fps is (10 sec: 5659.9, 60 sec: 5540.4, 300 sec: 5534.2). Total num frames: 795239424. Throughput: 0: 5808.9. Samples: 795246138. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:12:00,125][25689] Avg episode reward: [(0, '-0.949')] [2022-07-10 15:12:00,706][26022] Updated weights on worker 0-0, policy_version 776603 (0.00088) [2022-07-10 15:12:02,776][26022] Updated weights on worker 0-0, policy_version 776613 (0.00089) [2022-07-10 15:12:04,633][26022] Updated weights on worker 0-0, policy_version 776623 (0.00095) [2022-07-10 15:12:05,134][25689] Fps is (10 sec: 5354.7, 60 sec: 5525.1, 300 sec: 5524.5). Total num frames: 795264000. Throughput: 0: 4869.1. Samples: 795260738. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:12:05,136][25689] Avg episode reward: [(0, '0.162')] [2022-07-10 15:12:06,445][26022] Updated weights on worker 0-0, policy_version 776633 (0.00089) [2022-07-10 15:12:08,455][26022] Updated weights on worker 0-0, policy_version 776643 (0.00090) [2022-07-10 15:12:10,160][25689] Fps is (10 sec: 5203.1, 60 sec: 5523.3, 300 sec: 5524.7). Total num frames: 795291648. Throughput: 0: 5677.5. Samples: 795293632. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:12:10,161][25689] Avg episode reward: [(0, '-0.022')] [2022-07-10 15:12:10,231][26022] Updated weights on worker 0-0, policy_version 776653 (0.00093) [2022-07-10 15:12:11,969][26022] Updated weights on worker 0-0, policy_version 776663 (0.00086) [2022-07-10 15:12:13,865][26022] Updated weights on worker 0-0, policy_version 776673 (0.00093) [2022-07-10 15:12:15,184][25689] Fps is (10 sec: 5604.0, 60 sec: 5509.5, 300 sec: 5522.1). Total num frames: 795320320. Throughput: 0: 5669.2. Samples: 795327322. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:12:15,185][25689] Avg episode reward: [(0, '-0.075')] [2022-07-10 15:12:15,762][26022] Updated weights on worker 0-0, policy_version 776683 (0.00106) [2022-07-10 15:12:17,301][26022] Updated weights on worker 0-0, policy_version 776693 (0.00085) [2022-07-10 15:12:19,308][26022] Updated weights on worker 0-0, policy_version 776703 (0.00084) [2022-07-10 15:12:20,250][25689] Fps is (10 sec: 5785.0, 60 sec: 5563.7, 300 sec: 5521.5). Total num frames: 795350016. Throughput: 0: 4872.9. Samples: 795344214. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:12:20,250][25689] Avg episode reward: [(0, '-0.623')] [2022-07-10 15:12:20,968][26022] Updated weights on worker 0-0, policy_version 776713 (0.00086) [2022-07-10 15:12:22,910][26022] Updated weights on worker 0-0, policy_version 776723 (0.00096) [2022-07-10 15:12:24,638][26022] Updated weights on worker 0-0, policy_version 776733 (0.00083) [2022-07-10 15:12:25,323][25689] Fps is (10 sec: 5453.9, 60 sec: 5491.2, 300 sec: 5522.1). Total num frames: 795375616. Throughput: 0: 5815.6. Samples: 795378148. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:12:25,323][25689] Avg episode reward: [(0, '-0.730')] [2022-07-10 15:12:26,535][26022] Updated weights on worker 0-0, policy_version 776743 (0.00084) [2022-07-10 15:12:28,418][26022] Updated weights on worker 0-0, policy_version 776753 (0.00086) [2022-07-10 15:12:30,279][26022] Updated weights on worker 0-0, policy_version 776763 (0.00086) [2022-07-10 15:12:30,367][25689] Fps is (10 sec: 5465.5, 60 sec: 5539.8, 300 sec: 5521.3). Total num frames: 795405312. Throughput: 0: 5836.7. Samples: 795411574. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:12:30,368][25689] Avg episode reward: [(0, '-1.022')] [2022-07-10 15:12:32,155][26022] Updated weights on worker 0-0, policy_version 776773 (0.00086) [2022-07-10 15:12:33,935][26022] Updated weights on worker 0-0, policy_version 776783 (0.00094) [2022-07-10 15:12:35,394][25689] Fps is (10 sec: 5795.8, 60 sec: 5538.8, 300 sec: 5528.6). Total num frames: 795433984. Throughput: 0: 5002.0. Samples: 795428410. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:12:35,395][25689] Avg episode reward: [(0, '-1.989')] [2022-07-10 15:12:35,895][26022] Updated weights on worker 0-0, policy_version 776793 (0.00086) [2022-07-10 15:12:37,595][26022] Updated weights on worker 0-0, policy_version 776803 (0.00083) [2022-07-10 15:12:39,559][26022] Updated weights on worker 0-0, policy_version 776813 (0.00087) [2022-07-10 15:12:40,463][25689] Fps is (10 sec: 5578.5, 60 sec: 5536.3, 300 sec: 5524.1). Total num frames: 795461632. Throughput: 0: 5821.6. Samples: 795461888. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:12:40,464][25689] Avg episode reward: [(0, '-2.004')] [2022-07-10 15:12:41,078][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:12:41,097][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000776822_795465728.pth [2022-07-10 15:12:41,097][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000774876_793473024.pth [2022-07-10 15:12:41,328][26022] Updated weights on worker 0-0, policy_version 776823 (0.00091) [2022-07-10 15:12:43,237][26022] Updated weights on worker 0-0, policy_version 776833 (0.00090) [2022-07-10 15:12:44,924][26022] Updated weights on worker 0-0, policy_version 776843 (0.00091) [2022-07-10 15:12:45,529][25689] Fps is (10 sec: 5455.9, 60 sec: 5497.1, 300 sec: 5522.9). Total num frames: 795489280. Throughput: 0: 5810.8. Samples: 795495560. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:12:45,529][25689] Avg episode reward: [(0, '-1.016')] [2022-07-10 15:12:46,743][26022] Updated weights on worker 0-0, policy_version 776853 (0.00084) [2022-07-10 15:12:48,753][26022] Updated weights on worker 0-0, policy_version 776863 (0.00372) [2022-07-10 15:12:50,378][26022] Updated weights on worker 0-0, policy_version 776873 (0.00109) [2022-07-10 15:12:50,532][25689] Fps is (10 sec: 5695.3, 60 sec: 5564.8, 300 sec: 5526.7). Total num frames: 795518976. Throughput: 0: 4997.8. Samples: 795512352. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:12:50,533][25689] Avg episode reward: [(0, '-0.894')] [2022-07-10 15:12:52,265][26022] Updated weights on worker 0-0, policy_version 776883 (0.00089) [2022-07-10 15:12:53,877][26022] Updated weights on worker 0-0, policy_version 776893 (0.00088) [2022-07-10 15:12:55,586][25689] Fps is (10 sec: 5701.5, 60 sec: 5560.5, 300 sec: 5528.2). Total num frames: 795546624. Throughput: 0: 5836.7. Samples: 795546270. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:12:55,587][25689] Avg episode reward: [(0, '-0.792')] [2022-07-10 15:12:55,975][26022] Updated weights on worker 0-0, policy_version 776903 (0.00090) [2022-07-10 15:12:57,562][26022] Updated weights on worker 0-0, policy_version 776913 (0.00084) [2022-07-10 15:12:59,705][26022] Updated weights on worker 0-0, policy_version 776923 (0.00084) [2022-07-10 15:13:00,671][25689] Fps is (10 sec: 5655.9, 60 sec: 5564.2, 300 sec: 5534.3). Total num frames: 795576320. Throughput: 0: 5846.0. Samples: 795580022. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:13:00,671][25689] Avg episode reward: [(0, '-0.140')] [2022-07-10 15:13:01,315][26022] Updated weights on worker 0-0, policy_version 776933 (0.00088) [2022-07-10 15:13:03,729][26022] Updated weights on worker 0-0, policy_version 776943 (0.00093) [2022-07-10 15:13:05,386][26022] Updated weights on worker 0-0, policy_version 776953 (0.00096) [2022-07-10 15:13:05,698][25689] Fps is (10 sec: 5468.4, 60 sec: 5579.5, 300 sec: 5531.9). Total num frames: 795601920. Throughput: 0: 4913.3. Samples: 795594660. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:13:05,699][25689] Avg episode reward: [(0, '1.022')] [2022-07-10 15:13:07,342][26022] Updated weights on worker 0-0, policy_version 776963 (0.00090) [2022-07-10 15:13:09,091][26022] Updated weights on worker 0-0, policy_version 776973 (0.00085) [2022-07-10 15:13:10,760][25689] Fps is (10 sec: 5176.4, 60 sec: 5559.4, 300 sec: 5527.4). Total num frames: 795628544. Throughput: 0: 5721.3. Samples: 795628080. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:13:10,760][25689] Avg episode reward: [(0, '1.094')] [2022-07-10 15:13:10,907][26022] Updated weights on worker 0-0, policy_version 776983 (0.00085) [2022-07-10 15:13:12,777][26022] Updated weights on worker 0-0, policy_version 776993 (0.00087) [2022-07-10 15:13:14,484][26022] Updated weights on worker 0-0, policy_version 777003 (0.00085) [2022-07-10 15:13:15,811][25689] Fps is (10 sec: 5569.3, 60 sec: 5573.8, 300 sec: 5529.0). Total num frames: 795658240. Throughput: 0: 5697.8. Samples: 795661504. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:13:15,811][25689] Avg episode reward: [(0, '0.420')] [2022-07-10 15:13:16,387][26022] Updated weights on worker 0-0, policy_version 777013 (0.00087) [2022-07-10 15:13:18,109][26022] Updated weights on worker 0-0, policy_version 777023 (0.00088) [2022-07-10 15:13:20,314][26022] Updated weights on worker 0-0, policy_version 777033 (0.00080) [2022-07-10 15:13:20,905][25689] Fps is (10 sec: 5652.3, 60 sec: 5537.4, 300 sec: 5527.3). Total num frames: 795685888. Throughput: 0: 5690.6. Samples: 795695166. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:13:20,905][25689] Avg episode reward: [(0, '0.378')] [2022-07-10 15:13:21,677][26022] Updated weights on worker 0-0, policy_version 777043 (0.00091) [2022-07-10 15:13:23,856][26022] Updated weights on worker 0-0, policy_version 777053 (0.00093) [2022-07-10 15:13:25,311][26022] Updated weights on worker 0-0, policy_version 777063 (0.00099) [2022-07-10 15:13:25,928][25689] Fps is (10 sec: 5465.8, 60 sec: 5575.8, 300 sec: 5527.7). Total num frames: 795713536. Throughput: 0: 5811.4. Samples: 795712220. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:13:25,929][25689] Avg episode reward: [(0, '0.363')] [2022-07-10 15:13:27,446][26022] Updated weights on worker 0-0, policy_version 777073 (0.00083) [2022-07-10 15:13:29,151][26022] Updated weights on worker 0-0, policy_version 777083 (0.00096) [2022-07-10 15:13:30,974][25689] Fps is (10 sec: 5593.3, 60 sec: 5558.7, 300 sec: 5530.4). Total num frames: 795742208. Throughput: 0: 5807.3. Samples: 795745472. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:13:30,976][25689] Avg episode reward: [(0, '-1.278')] [2022-07-10 15:13:31,287][26022] Updated weights on worker 0-0, policy_version 777093 (0.00084) [2022-07-10 15:13:32,763][26022] Updated weights on worker 0-0, policy_version 777103 (0.00096) [2022-07-10 15:13:34,848][26022] Updated weights on worker 0-0, policy_version 777113 (0.00092) [2022-07-10 15:13:35,979][25689] Fps is (10 sec: 5705.0, 60 sec: 5560.7, 300 sec: 5531.5). Total num frames: 795770880. Throughput: 0: 5827.1. Samples: 795779026. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:13:35,980][25689] Avg episode reward: [(0, '-1.833')] [2022-07-10 15:13:36,545][26022] Updated weights on worker 0-0, policy_version 777123 (0.00087) [2022-07-10 15:13:38,468][26022] Updated weights on worker 0-0, policy_version 777133 (0.00091) [2022-07-10 15:13:40,368][26022] Updated weights on worker 0-0, policy_version 777143 (0.00087) [2022-07-10 15:13:41,084][25689] Fps is (10 sec: 5469.6, 60 sec: 5540.5, 300 sec: 5527.7). Total num frames: 795797504. Throughput: 0: 4983.1. Samples: 795795720. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:13:41,085][25689] Avg episode reward: [(0, '-3.248')] [2022-07-10 15:13:42,232][26022] Updated weights on worker 0-0, policy_version 777153 (0.00092) [2022-07-10 15:13:44,131][26022] Updated weights on worker 0-0, policy_version 777163 (0.00090) [2022-07-10 15:13:45,774][26022] Updated weights on worker 0-0, policy_version 777173 (0.00083) [2022-07-10 15:13:46,130][25689] Fps is (10 sec: 5548.0, 60 sec: 5576.1, 300 sec: 5530.5). Total num frames: 795827200. Throughput: 0: 5771.3. Samples: 795828816. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:13:46,131][25689] Avg episode reward: [(0, '-2.601')] [2022-07-10 15:13:47,749][26022] Updated weights on worker 0-0, policy_version 777183 (0.00087) [2022-07-10 15:13:49,626][26022] Updated weights on worker 0-0, policy_version 777193 (0.00079) [2022-07-10 15:13:51,194][25689] Fps is (10 sec: 5570.5, 60 sec: 5519.9, 300 sec: 5526.2). Total num frames: 795853824. Throughput: 0: 5768.8. Samples: 795862118. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:13:51,195][25689] Avg episode reward: [(0, '-2.983')] [2022-07-10 15:13:51,474][26022] Updated weights on worker 0-0, policy_version 777203 (0.00099) [2022-07-10 15:13:53,348][26022] Updated weights on worker 0-0, policy_version 777213 (0.00078) [2022-07-10 15:13:55,316][26022] Updated weights on worker 0-0, policy_version 777223 (0.00088) [2022-07-10 15:13:56,210][25689] Fps is (10 sec: 5384.1, 60 sec: 5523.4, 300 sec: 5526.8). Total num frames: 795881472. Throughput: 0: 4935.8. Samples: 795878884. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:13:56,211][25689] Avg episode reward: [(0, '-2.353')] [2022-07-10 15:13:56,855][26022] Updated weights on worker 0-0, policy_version 777233 (0.00105) [2022-07-10 15:13:58,925][26022] Updated weights on worker 0-0, policy_version 777243 (0.00084) [2022-07-10 15:14:00,249][26022] Updated weights on worker 0-0, policy_version 777253 (0.00082) [2022-07-10 15:14:01,264][25689] Fps is (10 sec: 5694.4, 60 sec: 5526.1, 300 sec: 5536.3). Total num frames: 795911168. Throughput: 0: 5788.7. Samples: 795912540. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:14:01,265][25689] Avg episode reward: [(0, '-1.269')] [2022-07-10 15:14:02,989][26022] Updated weights on worker 0-0, policy_version 777263 (0.00092) [2022-07-10 15:14:04,650][26022] Updated weights on worker 0-0, policy_version 777273 (0.00095) [2022-07-10 15:14:06,302][25689] Fps is (10 sec: 5378.1, 60 sec: 5508.3, 300 sec: 5523.0). Total num frames: 795935744. Throughput: 0: 5712.0. Samples: 795944034. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 15:14:06,304][25689] Avg episode reward: [(0, '-0.698')] [2022-07-10 15:14:06,556][26022] Updated weights on worker 0-0, policy_version 777283 (0.00086) [2022-07-10 15:14:08,319][26022] Updated weights on worker 0-0, policy_version 777293 (0.00082) [2022-07-10 15:14:10,155][26022] Updated weights on worker 0-0, policy_version 777303 (0.00092) [2022-07-10 15:14:11,320][25689] Fps is (10 sec: 5193.8, 60 sec: 5529.2, 300 sec: 5527.1). Total num frames: 795963392. Throughput: 0: 4900.5. Samples: 795960742. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:14:11,320][25689] Avg episode reward: [(0, '-0.032')] [2022-07-10 15:14:12,008][26022] Updated weights on worker 0-0, policy_version 777313 (0.00082) [2022-07-10 15:14:13,922][26022] Updated weights on worker 0-0, policy_version 777323 (0.00510) [2022-07-10 15:14:15,672][26022] Updated weights on worker 0-0, policy_version 777333 (0.00076) [2022-07-10 15:14:16,344][25689] Fps is (10 sec: 5608.2, 60 sec: 5514.7, 300 sec: 5528.0). Total num frames: 795992064. Throughput: 0: 5720.4. Samples: 795994058. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:14:16,345][25689] Avg episode reward: [(0, '0.140')] [2022-07-10 15:14:17,653][26022] Updated weights on worker 0-0, policy_version 777343 (0.00098) [2022-07-10 15:14:19,325][26022] Updated weights on worker 0-0, policy_version 777353 (0.00091) [2022-07-10 15:14:21,220][26022] Updated weights on worker 0-0, policy_version 777363 (0.00090) [2022-07-10 15:14:21,434][25689] Fps is (10 sec: 5669.3, 60 sec: 5532.0, 300 sec: 5530.0). Total num frames: 796020736. Throughput: 0: 5696.0. Samples: 796027428. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:14:21,435][25689] Avg episode reward: [(0, '-0.350')] [2022-07-10 15:14:22,947][26022] Updated weights on worker 0-0, policy_version 777373 (0.00091) [2022-07-10 15:14:24,992][26022] Updated weights on worker 0-0, policy_version 777383 (0.00092) [2022-07-10 15:14:26,481][25689] Fps is (10 sec: 5555.9, 60 sec: 5529.8, 300 sec: 5525.8). Total num frames: 796048384. Throughput: 0: 4967.5. Samples: 796044276. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:14:26,482][25689] Avg episode reward: [(0, '-0.426')] [2022-07-10 15:14:26,695][26022] Updated weights on worker 0-0, policy_version 777393 (0.00091) [2022-07-10 15:14:28,775][26022] Updated weights on worker 0-0, policy_version 777403 (0.00097) [2022-07-10 15:14:30,515][26022] Updated weights on worker 0-0, policy_version 777413 (0.00098) [2022-07-10 15:14:31,538][25689] Fps is (10 sec: 5472.8, 60 sec: 5511.9, 300 sec: 5531.8). Total num frames: 796076032. Throughput: 0: 5762.8. Samples: 796077258. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:14:31,539][25689] Avg episode reward: [(0, '-0.717')] [2022-07-10 15:14:32,365][26022] Updated weights on worker 0-0, policy_version 777423 (0.00086) [2022-07-10 15:14:34,167][26022] Updated weights on worker 0-0, policy_version 777433 (0.00097) [2022-07-10 15:14:35,942][26022] Updated weights on worker 0-0, policy_version 777443 (0.00087) [2022-07-10 15:14:36,562][25689] Fps is (10 sec: 5586.9, 60 sec: 5510.2, 300 sec: 5530.2). Total num frames: 796104704. Throughput: 0: 5782.2. Samples: 796110962. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:14:36,563][25689] Avg episode reward: [(0, '-0.594')] [2022-07-10 15:14:37,795][26022] Updated weights on worker 0-0, policy_version 777453 (0.00085) [2022-07-10 15:14:39,779][26022] Updated weights on worker 0-0, policy_version 777463 (0.00222) [2022-07-10 15:14:41,222][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:14:41,242][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000777471_796130304.pth [2022-07-10 15:14:41,242][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000775526_794138624.pth [2022-07-10 15:14:41,491][26022] Updated weights on worker 0-0, policy_version 777473 (0.00090) [2022-07-10 15:14:41,647][25689] Fps is (10 sec: 5672.8, 60 sec: 5545.8, 300 sec: 5532.2). Total num frames: 796133376. Throughput: 0: 4958.9. Samples: 796127660. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:14:41,647][25689] Avg episode reward: [(0, '-0.622')] [2022-07-10 15:14:43,584][26022] Updated weights on worker 0-0, policy_version 777483 (0.00092) [2022-07-10 15:14:45,095][26022] Updated weights on worker 0-0, policy_version 777493 (0.00092) [2022-07-10 15:14:46,666][25689] Fps is (10 sec: 5472.8, 60 sec: 5497.6, 300 sec: 5528.7). Total num frames: 796160000. Throughput: 0: 5789.0. Samples: 796161124. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:14:46,666][25689] Avg episode reward: [(0, '-0.496')] [2022-07-10 15:14:47,155][26022] Updated weights on worker 0-0, policy_version 777503 (0.00076) [2022-07-10 15:14:48,636][26022] Updated weights on worker 0-0, policy_version 777513 (0.00085) [2022-07-10 15:14:50,696][26022] Updated weights on worker 0-0, policy_version 777523 (0.00087) [2022-07-10 15:14:51,682][25689] Fps is (10 sec: 5612.1, 60 sec: 5552.7, 300 sec: 5535.6). Total num frames: 796189696. Throughput: 0: 5845.6. Samples: 796195012. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:14:51,684][25689] Avg episode reward: [(0, '0.018')] [2022-07-10 15:14:52,361][26022] Updated weights on worker 0-0, policy_version 777533 (0.00086) [2022-07-10 15:14:54,392][26022] Updated weights on worker 0-0, policy_version 777543 (0.00086) [2022-07-10 15:14:56,201][26022] Updated weights on worker 0-0, policy_version 777553 (0.00083) [2022-07-10 15:14:56,692][25689] Fps is (10 sec: 5719.5, 60 sec: 5553.3, 300 sec: 5533.7). Total num frames: 796217344. Throughput: 0: 5010.8. Samples: 796211828. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:14:56,693][25689] Avg episode reward: [(0, '-1.144')] [2022-07-10 15:14:57,999][26022] Updated weights on worker 0-0, policy_version 777563 (0.00089) [2022-07-10 15:14:59,730][26022] Updated weights on worker 0-0, policy_version 777573 (0.00103) [2022-07-10 15:15:01,822][25689] Fps is (10 sec: 5352.3, 60 sec: 5495.6, 300 sec: 5542.0). Total num frames: 796243968. Throughput: 0: 5839.7. Samples: 796245480. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:15:01,823][25689] Avg episode reward: [(0, '-1.599')] [2022-07-10 15:15:01,992][26022] Updated weights on worker 0-0, policy_version 777583 (0.00103) [2022-07-10 15:15:03,755][26022] Updated weights on worker 0-0, policy_version 777593 (0.00083) [2022-07-10 15:15:05,636][26022] Updated weights on worker 0-0, policy_version 777603 (0.00093) [2022-07-10 15:15:06,864][25689] Fps is (10 sec: 5335.3, 60 sec: 5545.9, 300 sec: 5534.5). Total num frames: 796271616. Throughput: 0: 5749.9. Samples: 796277262. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:15:06,865][25689] Avg episode reward: [(0, '-1.212')] [2022-07-10 15:15:07,401][26022] Updated weights on worker 0-0, policy_version 777613 (0.00087) [2022-07-10 15:15:09,237][26022] Updated weights on worker 0-0, policy_version 777623 (0.00089) [2022-07-10 15:15:11,156][26022] Updated weights on worker 0-0, policy_version 777633 (0.00085) [2022-07-10 15:15:11,874][25689] Fps is (10 sec: 5603.1, 60 sec: 5563.5, 300 sec: 5538.2). Total num frames: 796300288. Throughput: 0: 4902.7. Samples: 796294006. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:15:11,874][25689] Avg episode reward: [(0, '-2.994')] [2022-07-10 15:15:12,921][26022] Updated weights on worker 0-0, policy_version 777643 (0.00083) [2022-07-10 15:15:14,766][26022] Updated weights on worker 0-0, policy_version 777653 (0.00088) [2022-07-10 15:15:16,684][26022] Updated weights on worker 0-0, policy_version 777663 (0.00090) [2022-07-10 15:15:16,950][25689] Fps is (10 sec: 5482.3, 60 sec: 5525.0, 300 sec: 5530.8). Total num frames: 796326912. Throughput: 0: 5698.1. Samples: 796327262. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:15:16,951][25689] Avg episode reward: [(0, '-3.239')] [2022-07-10 15:15:18,327][26022] Updated weights on worker 0-0, policy_version 777673 (0.00080) [2022-07-10 15:15:20,377][26022] Updated weights on worker 0-0, policy_version 777683 (0.00085) [2022-07-10 15:15:22,036][25689] Fps is (10 sec: 5542.2, 60 sec: 5542.3, 300 sec: 5537.5). Total num frames: 796356608. Throughput: 0: 5699.3. Samples: 796360682. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:15:22,036][25689] Avg episode reward: [(0, '-3.837')] [2022-07-10 15:15:22,143][26022] Updated weights on worker 0-0, policy_version 777693 (0.00088) [2022-07-10 15:15:24,154][26022] Updated weights on worker 0-0, policy_version 777703 (0.00086) [2022-07-10 15:15:25,992][26022] Updated weights on worker 0-0, policy_version 777713 (0.00090) [2022-07-10 15:15:27,099][25689] Fps is (10 sec: 5650.4, 60 sec: 5540.8, 300 sec: 5537.1). Total num frames: 796384256. Throughput: 0: 5787.3. Samples: 796394366. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:15:27,100][25689] Avg episode reward: [(0, '-2.724')] [2022-07-10 15:15:27,681][26022] Updated weights on worker 0-0, policy_version 777723 (0.00093) [2022-07-10 15:15:29,595][26022] Updated weights on worker 0-0, policy_version 777733 (0.00084) [2022-07-10 15:15:31,295][26022] Updated weights on worker 0-0, policy_version 777743 (0.00051) [2022-07-10 15:15:32,154][25689] Fps is (10 sec: 5465.0, 60 sec: 5541.0, 300 sec: 5533.6). Total num frames: 796411904. Throughput: 0: 5762.7. Samples: 796410874. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:15:32,155][25689] Avg episode reward: [(0, '-2.481')] [2022-07-10 15:15:33,262][26022] Updated weights on worker 0-0, policy_version 777753 (0.00091) [2022-07-10 15:15:35,055][26022] Updated weights on worker 0-0, policy_version 777763 (0.00089) [2022-07-10 15:15:36,986][26022] Updated weights on worker 0-0, policy_version 777773 (0.00090) [2022-07-10 15:15:37,226][25689] Fps is (10 sec: 5561.5, 60 sec: 5536.6, 300 sec: 5537.0). Total num frames: 796440576. Throughput: 0: 5769.1. Samples: 796444232. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:15:37,226][25689] Avg episode reward: [(0, '-0.773')] [2022-07-10 15:15:38,627][26022] Updated weights on worker 0-0, policy_version 777783 (0.00063) [2022-07-10 15:15:40,576][26022] Updated weights on worker 0-0, policy_version 777793 (0.00097) [2022-07-10 15:15:42,275][25689] Fps is (10 sec: 5665.9, 60 sec: 5539.9, 300 sec: 5540.5). Total num frames: 796469248. Throughput: 0: 5777.6. Samples: 796477616. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:15:42,276][25689] Avg episode reward: [(0, '-0.622')] [2022-07-10 15:15:42,514][26022] Updated weights on worker 0-0, policy_version 777803 (0.00080) [2022-07-10 15:15:44,263][26022] Updated weights on worker 0-0, policy_version 777813 (0.00090) [2022-07-10 15:15:46,088][26022] Updated weights on worker 0-0, policy_version 777823 (0.00087) [2022-07-10 15:15:47,326][25689] Fps is (10 sec: 5474.5, 60 sec: 5536.9, 300 sec: 5533.6). Total num frames: 796495872. Throughput: 0: 4947.6. Samples: 796494442. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:15:47,327][25689] Avg episode reward: [(0, '-1.067')] [2022-07-10 15:15:47,893][26022] Updated weights on worker 0-0, policy_version 777833 (0.00090) [2022-07-10 15:15:49,884][26022] Updated weights on worker 0-0, policy_version 777843 (0.00080) [2022-07-10 15:15:51,436][26022] Updated weights on worker 0-0, policy_version 777853 (0.00086) [2022-07-10 15:15:52,372][25689] Fps is (10 sec: 5577.7, 60 sec: 5534.2, 300 sec: 5540.4). Total num frames: 796525568. Throughput: 0: 5783.1. Samples: 796527798. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:15:52,373][25689] Avg episode reward: [(0, '0.057')] [2022-07-10 15:15:53,683][26022] Updated weights on worker 0-0, policy_version 777863 (0.00087) [2022-07-10 15:15:54,950][26022] Updated weights on worker 0-0, policy_version 777873 (0.00092) [2022-07-10 15:15:57,283][26022] Updated weights on worker 0-0, policy_version 777883 (0.00093) [2022-07-10 15:15:57,383][25689] Fps is (10 sec: 5702.3, 60 sec: 5534.2, 300 sec: 5538.0). Total num frames: 796553216. Throughput: 0: 5816.6. Samples: 796561476. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:15:57,383][25689] Avg episode reward: [(0, '0.461')] [2022-07-10 15:15:58,951][26022] Updated weights on worker 0-0, policy_version 777893 (0.00087) [2022-07-10 15:16:00,743][26022] Updated weights on worker 0-0, policy_version 777903 (0.00081) [2022-07-10 15:16:02,437][25689] Fps is (10 sec: 5392.5, 60 sec: 5541.1, 300 sec: 5541.0). Total num frames: 796579840. Throughput: 0: 4978.2. Samples: 796577984. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:16:02,437][25689] Avg episode reward: [(0, '-0.276')] [2022-07-10 15:16:02,982][26022] Updated weights on worker 0-0, policy_version 777913 (0.00087) [2022-07-10 15:16:04,865][26022] Updated weights on worker 0-0, policy_version 777923 (0.00094) [2022-07-10 15:16:06,619][26022] Updated weights on worker 0-0, policy_version 777933 (0.00091) [2022-07-10 15:16:07,467][25689] Fps is (10 sec: 5381.9, 60 sec: 5542.2, 300 sec: 5540.5). Total num frames: 796607488. Throughput: 0: 5722.3. Samples: 796609692. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:16:07,467][25689] Avg episode reward: [(0, '-0.736')] [2022-07-10 15:16:08,657][26022] Updated weights on worker 0-0, policy_version 777943 (0.00090) [2022-07-10 15:16:10,423][26022] Updated weights on worker 0-0, policy_version 777953 (0.00088) [2022-07-10 15:16:12,350][26022] Updated weights on worker 0-0, policy_version 777963 (0.00093) [2022-07-10 15:16:12,474][25689] Fps is (10 sec: 5508.8, 60 sec: 5525.5, 300 sec: 5534.6). Total num frames: 796635136. Throughput: 0: 5730.6. Samples: 796642996. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:16:12,475][25689] Avg episode reward: [(0, '-2.186')] [2022-07-10 15:16:14,054][26022] Updated weights on worker 0-0, policy_version 777973 (0.00085) [2022-07-10 15:16:15,854][26022] Updated weights on worker 0-0, policy_version 777983 (0.00078) [2022-07-10 15:16:17,480][25689] Fps is (10 sec: 5522.1, 60 sec: 5548.9, 300 sec: 5539.8). Total num frames: 796662784. Throughput: 0: 4881.2. Samples: 796659576. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:16:17,481][25689] Avg episode reward: [(0, '-1.550')] [2022-07-10 15:16:17,951][26022] Updated weights on worker 0-0, policy_version 777993 (0.00086) [2022-07-10 15:16:19,611][26022] Updated weights on worker 0-0, policy_version 778003 (0.00085) [2022-07-10 15:16:21,343][26022] Updated weights on worker 0-0, policy_version 778013 (0.00087) [2022-07-10 15:16:22,541][25689] Fps is (10 sec: 5696.2, 60 sec: 5551.1, 300 sec: 5539.1). Total num frames: 796692480. Throughput: 0: 5731.8. Samples: 796693220. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:16:22,542][25689] Avg episode reward: [(0, '-2.650')] [2022-07-10 15:16:23,160][26022] Updated weights on worker 0-0, policy_version 778023 (0.00100) [2022-07-10 15:16:25,025][26022] Updated weights on worker 0-0, policy_version 778033 (0.00091) [2022-07-10 15:16:26,923][26022] Updated weights on worker 0-0, policy_version 778043 (0.00086) [2022-07-10 15:16:27,543][25689] Fps is (10 sec: 5495.1, 60 sec: 5522.9, 300 sec: 5536.0). Total num frames: 796718080. Throughput: 0: 5845.5. Samples: 796727048. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:16:27,544][25689] Avg episode reward: [(0, '-2.999')] [2022-07-10 15:16:28,514][26022] Updated weights on worker 0-0, policy_version 778053 (0.00089) [2022-07-10 15:16:30,655][26022] Updated weights on worker 0-0, policy_version 778063 (0.00078) [2022-07-10 15:16:32,227][26022] Updated weights on worker 0-0, policy_version 778073 (0.00082) [2022-07-10 15:16:32,567][25689] Fps is (10 sec: 5515.7, 60 sec: 5559.6, 300 sec: 5539.3). Total num frames: 796747776. Throughput: 0: 5018.2. Samples: 796743822. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:16:32,570][25689] Avg episode reward: [(0, '-2.396')] [2022-07-10 15:16:34,095][26022] Updated weights on worker 0-0, policy_version 778083 (0.00086) [2022-07-10 15:16:35,949][26022] Updated weights on worker 0-0, policy_version 778093 (0.00094) [2022-07-10 15:16:37,593][25689] Fps is (10 sec: 5604.2, 60 sec: 5529.9, 300 sec: 5536.1). Total num frames: 796774400. Throughput: 0: 5856.3. Samples: 796777360. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:16:37,593][25689] Avg episode reward: [(0, '-2.400')] [2022-07-10 15:16:37,927][26022] Updated weights on worker 0-0, policy_version 778103 (0.00349) [2022-07-10 15:16:39,749][26022] Updated weights on worker 0-0, policy_version 778113 (0.00087) [2022-07-10 15:16:41,326][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:16:41,335][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000778122_796796928.pth [2022-07-10 15:16:41,335][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000776172_794800128.pth [2022-07-10 15:16:41,335][25974] Saving a milestone ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/milestones/checkpoint_000778122_796796928.pth.milestone [2022-07-10 15:16:41,562][26022] Updated weights on worker 0-0, policy_version 778123 (0.00086) [2022-07-10 15:16:42,723][25689] Fps is (10 sec: 5545.4, 60 sec: 5539.4, 300 sec: 5533.8). Total num frames: 796804096. Throughput: 0: 5834.8. Samples: 796810974. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:16:42,724][25689] Avg episode reward: [(0, '-0.686')] [2022-07-10 15:16:43,634][26022] Updated weights on worker 0-0, policy_version 778133 (0.00309) [2022-07-10 15:16:45,143][26022] Updated weights on worker 0-0, policy_version 778143 (0.00089) [2022-07-10 15:16:47,291][26022] Updated weights on worker 0-0, policy_version 778153 (0.00088) [2022-07-10 15:16:47,791][25689] Fps is (10 sec: 5622.8, 60 sec: 5554.8, 300 sec: 5539.5). Total num frames: 796831744. Throughput: 0: 4954.4. Samples: 796827366. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:16:47,792][25689] Avg episode reward: [(0, '-1.499')] [2022-07-10 15:16:48,763][26022] Updated weights on worker 0-0, policy_version 778163 (0.00083) [2022-07-10 15:16:50,778][26022] Updated weights on worker 0-0, policy_version 778173 (0.00091) [2022-07-10 15:16:52,542][26022] Updated weights on worker 0-0, policy_version 778183 (0.00089) [2022-07-10 15:16:52,798][25689] Fps is (10 sec: 5590.0, 60 sec: 5541.5, 300 sec: 5543.0). Total num frames: 796860416. Throughput: 0: 5799.4. Samples: 796861152. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:16:52,799][25689] Avg episode reward: [(0, '-1.358')] [2022-07-10 15:16:54,411][26022] Updated weights on worker 0-0, policy_version 778193 (0.00085) [2022-07-10 15:16:56,204][26022] Updated weights on worker 0-0, policy_version 778203 (0.00089) [2022-07-10 15:16:57,883][25689] Fps is (10 sec: 5682.0, 60 sec: 5551.5, 300 sec: 5540.2). Total num frames: 796889088. Throughput: 0: 5790.9. Samples: 796894862. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:16:57,884][25689] Avg episode reward: [(0, '-1.045')] [2022-07-10 15:16:57,951][26022] Updated weights on worker 0-0, policy_version 778213 (0.00084) [2022-07-10 15:16:59,870][26022] Updated weights on worker 0-0, policy_version 778223 (0.00092) [2022-07-10 15:17:01,823][26022] Updated weights on worker 0-0, policy_version 778233 (0.00083) [2022-07-10 15:17:02,935][25689] Fps is (10 sec: 5353.6, 60 sec: 5534.8, 300 sec: 5542.9). Total num frames: 796914688. Throughput: 0: 4980.5. Samples: 796911642. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:17:02,936][25689] Avg episode reward: [(0, '-1.059')] [2022-07-10 15:17:03,875][26022] Updated weights on worker 0-0, policy_version 778243 (0.00093) [2022-07-10 15:17:05,762][26022] Updated weights on worker 0-0, policy_version 778253 (0.00089) [2022-07-10 15:17:07,539][26022] Updated weights on worker 0-0, policy_version 778263 (0.00083) [2022-07-10 15:17:08,031][25689] Fps is (10 sec: 5348.1, 60 sec: 5545.7, 300 sec: 5545.1). Total num frames: 796943360. Throughput: 0: 5732.3. Samples: 796943388. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:17:08,032][25689] Avg episode reward: [(0, '-1.294')] [2022-07-10 15:17:09,289][26022] Updated weights on worker 0-0, policy_version 778273 (0.00093) [2022-07-10 15:17:11,287][26022] Updated weights on worker 0-0, policy_version 778283 (0.00081) [2022-07-10 15:17:12,883][26022] Updated weights on worker 0-0, policy_version 778293 (0.00089) [2022-07-10 15:17:13,051][25689] Fps is (10 sec: 5770.0, 60 sec: 5578.4, 300 sec: 5548.6). Total num frames: 796973056. Throughput: 0: 5727.2. Samples: 796977146. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:17:13,052][25689] Avg episode reward: [(0, '-1.468')] [2022-07-10 15:17:14,957][26022] Updated weights on worker 0-0, policy_version 778303 (0.00097) [2022-07-10 15:17:16,586][26022] Updated weights on worker 0-0, policy_version 778313 (0.00089) [2022-07-10 15:17:18,101][25689] Fps is (10 sec: 5592.8, 60 sec: 5557.4, 300 sec: 5538.5). Total num frames: 796999680. Throughput: 0: 4892.5. Samples: 796993778. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:17:18,102][25689] Avg episode reward: [(0, '-1.533')] [2022-07-10 15:17:18,535][26022] Updated weights on worker 0-0, policy_version 778323 (0.00094) [2022-07-10 15:17:20,362][26022] Updated weights on worker 0-0, policy_version 778333 (0.00091) [2022-07-10 15:17:22,214][26022] Updated weights on worker 0-0, policy_version 778343 (0.00096) [2022-07-10 15:17:23,151][25689] Fps is (10 sec: 5475.0, 60 sec: 5541.6, 300 sec: 5549.3). Total num frames: 797028352. Throughput: 0: 5728.1. Samples: 797027438. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:17:23,152][25689] Avg episode reward: [(0, '-1.947')] [2022-07-10 15:17:24,133][26022] Updated weights on worker 0-0, policy_version 778353 (0.00092) [2022-07-10 15:17:25,943][26022] Updated weights on worker 0-0, policy_version 778363 (0.00081) [2022-07-10 15:17:27,857][26022] Updated weights on worker 0-0, policy_version 778373 (0.00095) [2022-07-10 15:17:28,189][25689] Fps is (10 sec: 5481.5, 60 sec: 5555.1, 300 sec: 5539.1). Total num frames: 797054976. Throughput: 0: 5805.7. Samples: 797060418. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:17:28,191][25689] Avg episode reward: [(0, '-3.797')] [2022-07-10 15:17:29,567][26022] Updated weights on worker 0-0, policy_version 778383 (0.00090) [2022-07-10 15:17:31,643][26022] Updated weights on worker 0-0, policy_version 778393 (0.00090) [2022-07-10 15:17:33,217][25689] Fps is (10 sec: 5493.3, 60 sec: 5537.8, 300 sec: 5539.1). Total num frames: 797083648. Throughput: 0: 4952.6. Samples: 797077018. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:17:33,218][25689] Avg episode reward: [(0, '-4.242')] [2022-07-10 15:17:33,320][26022] Updated weights on worker 0-0, policy_version 778403 (0.00097) [2022-07-10 15:17:35,222][26022] Updated weights on worker 0-0, policy_version 778413 (0.00089) [2022-07-10 15:17:37,036][26022] Updated weights on worker 0-0, policy_version 778423 (0.00087) [2022-07-10 15:17:38,220][25689] Fps is (10 sec: 5614.7, 60 sec: 5556.8, 300 sec: 5540.3). Total num frames: 797111296. Throughput: 0: 5814.0. Samples: 797110748. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 15:17:38,220][25689] Avg episode reward: [(0, '-4.241')] [2022-07-10 15:17:38,883][26022] Updated weights on worker 0-0, policy_version 778433 (0.00096) [2022-07-10 15:17:40,657][26022] Updated weights on worker 0-0, policy_version 778443 (0.00092) [2022-07-10 15:17:42,570][26022] Updated weights on worker 0-0, policy_version 778453 (0.00086) [2022-07-10 15:17:43,346][25689] Fps is (10 sec: 5459.4, 60 sec: 5523.5, 300 sec: 5539.2). Total num frames: 797138944. Throughput: 0: 5780.4. Samples: 797144170. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:17:43,346][25689] Avg episode reward: [(0, '-3.975')] [2022-07-10 15:17:44,380][26022] Updated weights on worker 0-0, policy_version 778463 (0.00091) [2022-07-10 15:17:46,267][26022] Updated weights on worker 0-0, policy_version 778473 (0.00086) [2022-07-10 15:17:48,054][26022] Updated weights on worker 0-0, policy_version 778483 (0.00091) [2022-07-10 15:17:48,360][25689] Fps is (10 sec: 5554.3, 60 sec: 5545.3, 300 sec: 5535.5). Total num frames: 797167616. Throughput: 0: 5801.7. Samples: 797177442. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:17:48,360][25689] Avg episode reward: [(0, '-4.406')] [2022-07-10 15:17:50,126][26022] Updated weights on worker 0-0, policy_version 778493 (0.00084) [2022-07-10 15:17:51,820][26022] Updated weights on worker 0-0, policy_version 778503 (0.00090) [2022-07-10 15:17:53,429][25689] Fps is (10 sec: 5687.0, 60 sec: 5539.6, 300 sec: 5538.7). Total num frames: 797196288. Throughput: 0: 5800.8. Samples: 797194264. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:17:53,430][25689] Avg episode reward: [(0, '-2.650')] [2022-07-10 15:17:53,607][26022] Updated weights on worker 0-0, policy_version 778513 (0.00084) [2022-07-10 15:17:55,384][26022] Updated weights on worker 0-0, policy_version 778523 (0.00095) [2022-07-10 15:17:57,390][26022] Updated weights on worker 0-0, policy_version 778533 (0.00081) [2022-07-10 15:17:58,432][25689] Fps is (10 sec: 5591.6, 60 sec: 5530.2, 300 sec: 5533.3). Total num frames: 797223936. Throughput: 0: 5804.8. Samples: 797228076. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:17:58,433][25689] Avg episode reward: [(0, '-1.230')] [2022-07-10 15:17:58,912][26022] Updated weights on worker 0-0, policy_version 778543 (0.00090) [2022-07-10 15:18:01,095][26022] Updated weights on worker 0-0, policy_version 778553 (0.00085) [2022-07-10 15:18:02,989][26022] Updated weights on worker 0-0, policy_version 778563 (0.00091) [2022-07-10 15:18:03,549][25689] Fps is (10 sec: 5463.9, 60 sec: 5558.1, 300 sec: 5538.6). Total num frames: 797251584. Throughput: 0: 5705.1. Samples: 797259434. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:18:03,550][25689] Avg episode reward: [(0, '-1.054')] [2022-07-10 15:18:05,061][26022] Updated weights on worker 0-0, policy_version 778573 (0.00085) [2022-07-10 15:18:06,771][26022] Updated weights on worker 0-0, policy_version 778583 (0.00088) [2022-07-10 15:18:08,565][25689] Fps is (10 sec: 5356.1, 60 sec: 5531.6, 300 sec: 5539.4). Total num frames: 797278208. Throughput: 0: 4887.3. Samples: 797276190. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:18:08,565][25689] Avg episode reward: [(0, '-0.351')] [2022-07-10 15:18:08,655][26022] Updated weights on worker 0-0, policy_version 778593 (0.00088) [2022-07-10 15:18:10,599][26022] Updated weights on worker 0-0, policy_version 778603 (0.00458) [2022-07-10 15:18:12,228][26022] Updated weights on worker 0-0, policy_version 778613 (0.00087) [2022-07-10 15:18:13,617][25689] Fps is (10 sec: 5390.5, 60 sec: 5494.8, 300 sec: 5532.5). Total num frames: 797305856. Throughput: 0: 5720.2. Samples: 797309744. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:18:13,618][25689] Avg episode reward: [(0, '-0.070')] [2022-07-10 15:18:14,194][26022] Updated weights on worker 0-0, policy_version 778623 (0.00089) [2022-07-10 15:18:16,204][26022] Updated weights on worker 0-0, policy_version 778633 (0.00086) [2022-07-10 15:18:17,691][26022] Updated weights on worker 0-0, policy_version 778643 (0.00088) [2022-07-10 15:18:18,665][25689] Fps is (10 sec: 5677.6, 60 sec: 5545.8, 300 sec: 5540.2). Total num frames: 797335552. Throughput: 0: 5686.8. Samples: 797343136. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:18:18,665][25689] Avg episode reward: [(0, '0.391')] [2022-07-10 15:18:19,856][26022] Updated weights on worker 0-0, policy_version 778653 (0.00092) [2022-07-10 15:18:21,244][26022] Updated weights on worker 0-0, policy_version 778663 (0.00083) [2022-07-10 15:18:23,315][26022] Updated weights on worker 0-0, policy_version 778673 (0.00087) [2022-07-10 15:18:23,731][25689] Fps is (10 sec: 5670.1, 60 sec: 5527.4, 300 sec: 5539.4). Total num frames: 797363200. Throughput: 0: 4976.9. Samples: 797359876. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:18:23,731][25689] Avg episode reward: [(0, '0.376')] [2022-07-10 15:18:25,181][26022] Updated weights on worker 0-0, policy_version 778683 (0.00087) [2022-07-10 15:18:27,081][26022] Updated weights on worker 0-0, policy_version 778693 (0.00087) [2022-07-10 15:18:28,768][25689] Fps is (10 sec: 5371.7, 60 sec: 5527.5, 300 sec: 5532.7). Total num frames: 797389824. Throughput: 0: 5786.9. Samples: 797393106. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:18:28,769][25689] Avg episode reward: [(0, '0.430')] [2022-07-10 15:18:28,955][26022] Updated weights on worker 0-0, policy_version 778703 (0.00084) [2022-07-10 15:18:30,600][26022] Updated weights on worker 0-0, policy_version 778713 (0.00093) [2022-07-10 15:18:32,700][26022] Updated weights on worker 0-0, policy_version 778723 (0.00086) [2022-07-10 15:18:33,779][25689] Fps is (10 sec: 5503.1, 60 sec: 5529.0, 300 sec: 5532.6). Total num frames: 797418496. Throughput: 0: 5778.8. Samples: 797426254. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:18:33,781][25689] Avg episode reward: [(0, '0.585')] [2022-07-10 15:18:34,365][26022] Updated weights on worker 0-0, policy_version 778733 (0.00082) [2022-07-10 15:18:36,481][26022] Updated weights on worker 0-0, policy_version 778743 (0.00092) [2022-07-10 15:18:38,023][26022] Updated weights on worker 0-0, policy_version 778753 (0.00090) [2022-07-10 15:18:38,788][25689] Fps is (10 sec: 5723.2, 60 sec: 5545.4, 300 sec: 5541.3). Total num frames: 797447168. Throughput: 0: 4963.5. Samples: 797443016. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:18:38,788][25689] Avg episode reward: [(0, '-0.193')] [2022-07-10 15:18:40,037][26022] Updated weights on worker 0-0, policy_version 778763 (0.00096) [2022-07-10 15:18:41,361][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:18:41,375][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000778770_797460480.pth [2022-07-10 15:18:41,376][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000776822_795465728.pth [2022-07-10 15:18:41,768][26022] Updated weights on worker 0-0, policy_version 778773 (0.00095) [2022-07-10 15:18:43,608][26022] Updated weights on worker 0-0, policy_version 778783 (0.00091) [2022-07-10 15:18:43,853][25689] Fps is (10 sec: 5590.6, 60 sec: 5550.9, 300 sec: 5534.1). Total num frames: 797474816. Throughput: 0: 5786.8. Samples: 797476320. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:18:43,853][25689] Avg episode reward: [(0, '-0.814')] [2022-07-10 15:18:45,622][26022] Updated weights on worker 0-0, policy_version 778793 (0.00051) [2022-07-10 15:18:47,203][26022] Updated weights on worker 0-0, policy_version 778803 (0.00084) [2022-07-10 15:18:48,876][25689] Fps is (10 sec: 5379.8, 60 sec: 5516.3, 300 sec: 5534.8). Total num frames: 797501440. Throughput: 0: 5810.1. Samples: 797509934. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:18:48,876][25689] Avg episode reward: [(0, '-0.855')] [2022-07-10 15:18:49,423][26022] Updated weights on worker 0-0, policy_version 778813 (0.00084) [2022-07-10 15:18:50,873][26022] Updated weights on worker 0-0, policy_version 778823 (0.00152) [2022-07-10 15:18:53,122][26022] Updated weights on worker 0-0, policy_version 778833 (0.00088) [2022-07-10 15:18:53,907][25689] Fps is (10 sec: 5601.8, 60 sec: 5536.7, 300 sec: 5541.4). Total num frames: 797531136. Throughput: 0: 4974.2. Samples: 797526374. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:18:53,907][25689] Avg episode reward: [(0, '-0.891')] [2022-07-10 15:18:54,599][26022] Updated weights on worker 0-0, policy_version 778843 (0.00091) [2022-07-10 15:18:56,671][26022] Updated weights on worker 0-0, policy_version 778853 (0.00086) [2022-07-10 15:18:58,483][26022] Updated weights on worker 0-0, policy_version 778863 (0.00097) [2022-07-10 15:18:58,920][25689] Fps is (10 sec: 5505.2, 60 sec: 5501.9, 300 sec: 5528.4). Total num frames: 797556736. Throughput: 0: 5795.1. Samples: 797559686. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:18:58,921][25689] Avg episode reward: [(0, '-1.677')] [2022-07-10 15:19:00,314][26022] Updated weights on worker 0-0, policy_version 778873 (0.00085) [2022-07-10 15:19:02,555][26022] Updated weights on worker 0-0, policy_version 778883 (0.00085) [2022-07-10 15:19:04,033][25689] Fps is (10 sec: 5258.2, 60 sec: 5502.2, 300 sec: 5537.3). Total num frames: 797584384. Throughput: 0: 5673.3. Samples: 797590810. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:19:04,034][25689] Avg episode reward: [(0, '-1.818')] [2022-07-10 15:19:04,498][26022] Updated weights on worker 0-0, policy_version 778893 (0.00085) [2022-07-10 15:19:06,339][26022] Updated weights on worker 0-0, policy_version 778903 (0.00088) [2022-07-10 15:19:08,190][26022] Updated weights on worker 0-0, policy_version 778913 (0.00086) [2022-07-10 15:19:09,040][25689] Fps is (10 sec: 5362.7, 60 sec: 5503.0, 300 sec: 5534.1). Total num frames: 797611008. Throughput: 0: 4844.4. Samples: 797607620. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:19:09,041][25689] Avg episode reward: [(0, '-1.475')] [2022-07-10 15:19:09,681][26022] Updated weights on worker 0-0, policy_version 778923 (0.00089) [2022-07-10 15:19:11,945][26022] Updated weights on worker 0-0, policy_version 778933 (0.00090) [2022-07-10 15:19:13,463][26022] Updated weights on worker 0-0, policy_version 778943 (0.00138) [2022-07-10 15:19:14,112][25689] Fps is (10 sec: 5486.7, 60 sec: 5518.3, 300 sec: 5533.2). Total num frames: 797639680. Throughput: 0: 5673.7. Samples: 797641010. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:19:14,112][25689] Avg episode reward: [(0, '-0.773')] [2022-07-10 15:19:15,599][26022] Updated weights on worker 0-0, policy_version 778953 (0.00093) [2022-07-10 15:19:17,229][26022] Updated weights on worker 0-0, policy_version 778963 (0.00094) [2022-07-10 15:19:19,080][26022] Updated weights on worker 0-0, policy_version 778973 (0.00091) [2022-07-10 15:19:19,117][25689] Fps is (10 sec: 5792.5, 60 sec: 5522.1, 300 sec: 5538.2). Total num frames: 797669376. Throughput: 0: 5671.7. Samples: 797674236. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:19:19,118][25689] Avg episode reward: [(0, '-0.546')] [2022-07-10 15:19:21,049][26022] Updated weights on worker 0-0, policy_version 778983 (0.00083) [2022-07-10 15:19:22,720][26022] Updated weights on worker 0-0, policy_version 778993 (0.00087) [2022-07-10 15:19:24,271][25689] Fps is (10 sec: 5644.7, 60 sec: 5514.1, 300 sec: 5536.3). Total num frames: 797697024. Throughput: 0: 4943.1. Samples: 797690844. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:19:24,271][25689] Avg episode reward: [(0, '-0.469')] [2022-07-10 15:19:24,879][26022] Updated weights on worker 0-0, policy_version 779003 (0.00117) [2022-07-10 15:19:26,455][26022] Updated weights on worker 0-0, policy_version 779013 (0.00088) [2022-07-10 15:19:28,475][26022] Updated weights on worker 0-0, policy_version 779023 (0.00083) [2022-07-10 15:19:29,288][25689] Fps is (10 sec: 5436.9, 60 sec: 5532.9, 300 sec: 5537.0). Total num frames: 797724672. Throughput: 0: 5769.8. Samples: 797724442. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:19:29,288][25689] Avg episode reward: [(0, '-0.744')] [2022-07-10 15:19:30,136][26022] Updated weights on worker 0-0, policy_version 779033 (0.00055) [2022-07-10 15:19:31,882][26022] Updated weights on worker 0-0, policy_version 779043 (0.00089) [2022-07-10 15:19:33,770][26022] Updated weights on worker 0-0, policy_version 779053 (0.00081) [2022-07-10 15:19:34,292][25689] Fps is (10 sec: 5620.2, 60 sec: 5533.5, 300 sec: 5537.4). Total num frames: 797753344. Throughput: 0: 5810.6. Samples: 797758268. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:19:34,292][25689] Avg episode reward: [(0, '-0.292')] [2022-07-10 15:19:35,688][26022] Updated weights on worker 0-0, policy_version 779063 (0.00095) [2022-07-10 15:19:37,530][26022] Updated weights on worker 0-0, policy_version 779073 (0.00085) [2022-07-10 15:19:39,312][25689] Fps is (10 sec: 5516.0, 60 sec: 5498.6, 300 sec: 5531.7). Total num frames: 797779968. Throughput: 0: 4994.5. Samples: 797775102. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:19:39,314][25689] Avg episode reward: [(0, '-0.327')] [2022-07-10 15:19:39,423][26022] Updated weights on worker 0-0, policy_version 779083 (0.00091) [2022-07-10 15:19:40,979][26022] Updated weights on worker 0-0, policy_version 779093 (0.00100) [2022-07-10 15:19:42,841][26022] Updated weights on worker 0-0, policy_version 779103 (0.00089) [2022-07-10 15:19:44,448][25689] Fps is (10 sec: 5444.3, 60 sec: 5509.1, 300 sec: 5536.4). Total num frames: 797808640. Throughput: 0: 5847.6. Samples: 797808834. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:19:44,449][25689] Avg episode reward: [(0, '-2.500')] [2022-07-10 15:19:44,665][26022] Updated weights on worker 0-0, policy_version 779113 (0.00085) [2022-07-10 15:19:46,682][26022] Updated weights on worker 0-0, policy_version 779123 (0.00086) [2022-07-10 15:19:48,550][26022] Updated weights on worker 0-0, policy_version 779133 (0.00096) [2022-07-10 15:19:49,478][25689] Fps is (10 sec: 5641.0, 60 sec: 5542.3, 300 sec: 5532.7). Total num frames: 797837312. Throughput: 0: 5842.8. Samples: 797842410. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:19:49,478][25689] Avg episode reward: [(0, '-2.466')] [2022-07-10 15:19:50,226][26022] Updated weights on worker 0-0, policy_version 779143 (0.00079) [2022-07-10 15:19:52,416][26022] Updated weights on worker 0-0, policy_version 779153 (0.00087) [2022-07-10 15:19:53,943][26022] Updated weights on worker 0-0, policy_version 779163 (0.00094) [2022-07-10 15:19:54,578][25689] Fps is (10 sec: 5661.2, 60 sec: 5519.1, 300 sec: 5534.5). Total num frames: 797865984. Throughput: 0: 4975.4. Samples: 797859198. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:19:54,578][25689] Avg episode reward: [(0, '-2.056')] [2022-07-10 15:19:55,950][26022] Updated weights on worker 0-0, policy_version 779173 (0.00085) [2022-07-10 15:19:57,482][26022] Updated weights on worker 0-0, policy_version 779183 (0.00086) [2022-07-10 15:19:59,495][26022] Updated weights on worker 0-0, policy_version 779193 (0.00081) [2022-07-10 15:19:59,639][25689] Fps is (10 sec: 5643.3, 60 sec: 5565.3, 300 sec: 5542.6). Total num frames: 797894656. Throughput: 0: 5792.2. Samples: 797892840. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:19:59,640][25689] Avg episode reward: [(0, '-1.955')] [2022-07-10 15:20:01,129][26022] Updated weights on worker 0-0, policy_version 779203 (0.00083) [2022-07-10 15:20:03,485][26022] Updated weights on worker 0-0, policy_version 779213 (0.00089) [2022-07-10 15:20:04,692][25689] Fps is (10 sec: 5467.2, 60 sec: 5554.0, 300 sec: 5539.0). Total num frames: 797921280. Throughput: 0: 5707.7. Samples: 797924376. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:20:04,692][25689] Avg episode reward: [(0, '-1.567')] [2022-07-10 15:20:05,340][26022] Updated weights on worker 0-0, policy_version 779223 (0.00086) [2022-07-10 15:20:07,211][26022] Updated weights on worker 0-0, policy_version 779233 (0.00097) [2022-07-10 15:20:08,868][26022] Updated weights on worker 0-0, policy_version 779243 (0.00089) [2022-07-10 15:20:09,697][25689] Fps is (10 sec: 5294.0, 60 sec: 5554.1, 300 sec: 5532.2). Total num frames: 797947904. Throughput: 0: 5716.6. Samples: 797957998. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:20:09,698][25689] Avg episode reward: [(0, '-1.588')] [2022-07-10 15:20:11,035][26022] Updated weights on worker 0-0, policy_version 779253 (0.00091) [2022-07-10 15:20:12,604][26022] Updated weights on worker 0-0, policy_version 779263 (0.00084) [2022-07-10 15:20:14,581][26022] Updated weights on worker 0-0, policy_version 779273 (0.00087) [2022-07-10 15:20:14,700][25689] Fps is (10 sec: 5422.7, 60 sec: 5543.5, 300 sec: 5537.0). Total num frames: 797975552. Throughput: 0: 5732.5. Samples: 797974550. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:20:14,700][25689] Avg episode reward: [(0, '-0.459')] [2022-07-10 15:20:16,429][26022] Updated weights on worker 0-0, policy_version 779283 (0.00095) [2022-07-10 15:20:18,306][26022] Updated weights on worker 0-0, policy_version 779293 (0.00088) [2022-07-10 15:20:19,714][25689] Fps is (10 sec: 5520.7, 60 sec: 5509.0, 300 sec: 5531.5). Total num frames: 798003200. Throughput: 0: 5709.2. Samples: 798007448. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:20:19,714][25689] Avg episode reward: [(0, '-1.102')] [2022-07-10 15:20:20,273][26022] Updated weights on worker 0-0, policy_version 779303 (0.00094) [2022-07-10 15:20:22,059][26022] Updated weights on worker 0-0, policy_version 779313 (0.00087) [2022-07-10 15:20:23,872][26022] Updated weights on worker 0-0, policy_version 779323 (0.00090) [2022-07-10 15:20:24,772][25689] Fps is (10 sec: 5591.6, 60 sec: 5534.5, 300 sec: 5535.0). Total num frames: 798031872. Throughput: 0: 5790.2. Samples: 798040646. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:20:24,773][25689] Avg episode reward: [(0, '-1.358')] [2022-07-10 15:20:25,907][26022] Updated weights on worker 0-0, policy_version 779333 (0.00557) [2022-07-10 15:20:27,395][26022] Updated weights on worker 0-0, policy_version 779343 (0.00093) [2022-07-10 15:20:29,555][26022] Updated weights on worker 0-0, policy_version 779353 (0.00084) [2022-07-10 15:20:29,811][25689] Fps is (10 sec: 5476.4, 60 sec: 5515.6, 300 sec: 5531.9). Total num frames: 798058496. Throughput: 0: 4939.9. Samples: 798057356. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:20:29,811][25689] Avg episode reward: [(0, '-1.187')] [2022-07-10 15:20:31,245][26022] Updated weights on worker 0-0, policy_version 779363 (0.00086) [2022-07-10 15:20:33,160][26022] Updated weights on worker 0-0, policy_version 779373 (0.00889) [2022-07-10 15:20:34,837][25689] Fps is (10 sec: 5595.8, 60 sec: 5530.5, 300 sec: 5536.1). Total num frames: 798088192. Throughput: 0: 5791.1. Samples: 798091166. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:20:34,837][25689] Avg episode reward: [(0, '-2.057')] [2022-07-10 15:20:34,844][26022] Updated weights on worker 0-0, policy_version 779383 (0.00089) [2022-07-10 15:20:36,682][26022] Updated weights on worker 0-0, policy_version 779393 (0.00103) [2022-07-10 15:20:38,453][26022] Updated weights on worker 0-0, policy_version 779403 (0.00090) [2022-07-10 15:20:39,861][25689] Fps is (10 sec: 5705.5, 60 sec: 5547.1, 300 sec: 5533.2). Total num frames: 798115840. Throughput: 0: 5836.3. Samples: 798125036. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:20:39,862][25689] Avg episode reward: [(0, '-1.952')] [2022-07-10 15:20:40,523][26022] Updated weights on worker 0-0, policy_version 779413 (0.00098) [2022-07-10 15:20:41,402][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:20:41,416][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000779419_798125056.pth [2022-07-10 15:20:41,417][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000777471_796130304.pth [2022-07-10 15:20:41,999][26022] Updated weights on worker 0-0, policy_version 779423 (0.00085) [2022-07-10 15:20:44,152][26022] Updated weights on worker 0-0, policy_version 779433 (0.00099) [2022-07-10 15:20:44,946][25689] Fps is (10 sec: 5571.4, 60 sec: 5551.8, 300 sec: 5539.4). Total num frames: 798144512. Throughput: 0: 5017.4. Samples: 798141864. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:20:44,946][25689] Avg episode reward: [(0, '-1.443')] [2022-07-10 15:20:45,665][26022] Updated weights on worker 0-0, policy_version 779443 (0.00083) [2022-07-10 15:20:47,763][26022] Updated weights on worker 0-0, policy_version 779453 (0.00079) [2022-07-10 15:20:49,213][26022] Updated weights on worker 0-0, policy_version 779463 (0.00094) [2022-07-10 15:20:49,950][25689] Fps is (10 sec: 5684.2, 60 sec: 5554.2, 300 sec: 5536.8). Total num frames: 798173184. Throughput: 0: 5877.1. Samples: 798175714. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:20:49,950][25689] Avg episode reward: [(0, '-1.336')] [2022-07-10 15:20:51,270][26022] Updated weights on worker 0-0, policy_version 779473 (0.00093) [2022-07-10 15:20:52,936][26022] Updated weights on worker 0-0, policy_version 779483 (0.00086) [2022-07-10 15:20:54,969][25689] Fps is (10 sec: 5516.6, 60 sec: 5527.6, 300 sec: 5533.1). Total num frames: 798199808. Throughput: 0: 5857.4. Samples: 798209090. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:20:54,970][25689] Avg episode reward: [(0, '-2.097')] [2022-07-10 15:20:55,149][26022] Updated weights on worker 0-0, policy_version 779493 (0.00087) [2022-07-10 15:20:56,660][26022] Updated weights on worker 0-0, policy_version 779503 (0.00087) [2022-07-10 15:20:58,592][26022] Updated weights on worker 0-0, policy_version 779513 (0.00085) [2022-07-10 15:20:59,984][25689] Fps is (10 sec: 5612.6, 60 sec: 5548.9, 300 sec: 5544.2). Total num frames: 798229504. Throughput: 0: 5012.7. Samples: 798225906. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:20:59,985][25689] Avg episode reward: [(0, '-2.258')] [2022-07-10 15:21:00,455][26022] Updated weights on worker 0-0, policy_version 779523 (0.00088) [2022-07-10 15:21:02,560][26022] Updated weights on worker 0-0, policy_version 779533 (0.00092) [2022-07-10 15:21:04,535][26022] Updated weights on worker 0-0, policy_version 779543 (0.00086) [2022-07-10 15:21:05,089][25689] Fps is (10 sec: 5362.9, 60 sec: 5510.2, 300 sec: 5532.5). Total num frames: 798254080. Throughput: 0: 5721.6. Samples: 798257116. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 15:21:05,090][25689] Avg episode reward: [(0, '-3.044')] [2022-07-10 15:21:06,461][26022] Updated weights on worker 0-0, policy_version 779553 (0.00092) [2022-07-10 15:21:08,094][26022] Updated weights on worker 0-0, policy_version 779563 (0.00089) [2022-07-10 15:21:10,045][26022] Updated weights on worker 0-0, policy_version 779573 (0.00098) [2022-07-10 15:21:10,107][25689] Fps is (10 sec: 5260.4, 60 sec: 5543.0, 300 sec: 5535.7). Total num frames: 798282752. Throughput: 0: 5721.4. Samples: 798291040. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:21:10,110][25689] Avg episode reward: [(0, '-4.444')] [2022-07-10 15:21:11,763][26022] Updated weights on worker 0-0, policy_version 779583 (0.00083) [2022-07-10 15:21:13,654][26022] Updated weights on worker 0-0, policy_version 779593 (0.00083) [2022-07-10 15:21:15,134][25689] Fps is (10 sec: 5708.7, 60 sec: 5557.6, 300 sec: 5538.8). Total num frames: 798311424. Throughput: 0: 4898.8. Samples: 798307874. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:21:15,136][25689] Avg episode reward: [(0, '-4.008')] [2022-07-10 15:21:15,433][26022] Updated weights on worker 0-0, policy_version 779603 (0.00748) [2022-07-10 15:21:17,187][26022] Updated weights on worker 0-0, policy_version 779613 (0.00085) [2022-07-10 15:21:18,945][26022] Updated weights on worker 0-0, policy_version 779623 (0.00083) [2022-07-10 15:21:20,150][25689] Fps is (10 sec: 5607.6, 60 sec: 5557.4, 300 sec: 5532.7). Total num frames: 798339072. Throughput: 0: 5765.8. Samples: 798342180. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:21:20,151][25689] Avg episode reward: [(0, '-4.439')] [2022-07-10 15:21:21,034][26022] Updated weights on worker 0-0, policy_version 779633 (0.00084) [2022-07-10 15:21:22,774][26022] Updated weights on worker 0-0, policy_version 779643 (0.00086) [2022-07-10 15:21:24,563][26022] Updated weights on worker 0-0, policy_version 779653 (0.00093) [2022-07-10 15:21:25,281][25689] Fps is (10 sec: 5550.3, 60 sec: 5550.8, 300 sec: 5540.6). Total num frames: 798367744. Throughput: 0: 5858.5. Samples: 798375412. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:21:25,282][25689] Avg episode reward: [(0, '-4.575')] [2022-07-10 15:21:26,379][26022] Updated weights on worker 0-0, policy_version 779663 (0.00081) [2022-07-10 15:21:28,251][26022] Updated weights on worker 0-0, policy_version 779673 (0.00097) [2022-07-10 15:21:30,201][26022] Updated weights on worker 0-0, policy_version 779683 (0.00090) [2022-07-10 15:21:30,289][25689] Fps is (10 sec: 5554.8, 60 sec: 5570.5, 300 sec: 5534.1). Total num frames: 798395392. Throughput: 0: 5007.4. Samples: 798392102. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:21:30,290][25689] Avg episode reward: [(0, '-4.667')] [2022-07-10 15:21:31,992][26022] Updated weights on worker 0-0, policy_version 779693 (0.00087) [2022-07-10 15:21:33,672][26022] Updated weights on worker 0-0, policy_version 779703 (0.00087) [2022-07-10 15:21:35,319][25689] Fps is (10 sec: 5712.7, 60 sec: 5570.1, 300 sec: 5544.3). Total num frames: 798425088. Throughput: 0: 5832.1. Samples: 798425596. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:21:35,320][25689] Avg episode reward: [(0, '-4.053')] [2022-07-10 15:21:35,619][26022] Updated weights on worker 0-0, policy_version 779713 (0.00097) [2022-07-10 15:21:37,326][26022] Updated weights on worker 0-0, policy_version 779723 (0.00084) [2022-07-10 15:21:39,294][26022] Updated weights on worker 0-0, policy_version 779733 (0.00088) [2022-07-10 15:21:40,352][25689] Fps is (10 sec: 5597.0, 60 sec: 5552.5, 300 sec: 5535.8). Total num frames: 798451712. Throughput: 0: 5804.3. Samples: 798459436. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:21:40,352][25689] Avg episode reward: [(0, '-3.337')] [2022-07-10 15:21:40,938][26022] Updated weights on worker 0-0, policy_version 779743 (0.00093) [2022-07-10 15:21:43,066][26022] Updated weights on worker 0-0, policy_version 779753 (0.00086) [2022-07-10 15:21:44,551][26022] Updated weights on worker 0-0, policy_version 779763 (0.00094) [2022-07-10 15:21:45,475][25689] Fps is (10 sec: 5444.9, 60 sec: 5548.9, 300 sec: 5538.2). Total num frames: 798480384. Throughput: 0: 4987.6. Samples: 798476128. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:21:45,475][25689] Avg episode reward: [(0, '-4.443')] [2022-07-10 15:21:46,643][26022] Updated weights on worker 0-0, policy_version 779773 (0.00090) [2022-07-10 15:21:48,375][26022] Updated weights on worker 0-0, policy_version 779783 (0.00098) [2022-07-10 15:21:50,311][26022] Updated weights on worker 0-0, policy_version 779793 (0.00089) [2022-07-10 15:21:50,516][25689] Fps is (10 sec: 5642.0, 60 sec: 5545.6, 300 sec: 5537.6). Total num frames: 798509056. Throughput: 0: 5806.3. Samples: 798509542. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:21:50,516][25689] Avg episode reward: [(0, '-3.279')] [2022-07-10 15:21:52,291][26022] Updated weights on worker 0-0, policy_version 779803 (0.00084) [2022-07-10 15:21:53,807][26022] Updated weights on worker 0-0, policy_version 779813 (0.00086) [2022-07-10 15:21:55,550][25689] Fps is (10 sec: 5590.2, 60 sec: 5561.1, 300 sec: 5535.1). Total num frames: 798536704. Throughput: 0: 5829.2. Samples: 798543522. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:21:55,550][25689] Avg episode reward: [(0, '-3.023')] [2022-07-10 15:21:55,777][26022] Updated weights on worker 0-0, policy_version 779823 (0.00082) [2022-07-10 15:21:57,706][26022] Updated weights on worker 0-0, policy_version 779833 (0.00089) [2022-07-10 15:21:59,215][26022] Updated weights on worker 0-0, policy_version 779843 (0.00095) [2022-07-10 15:22:00,559][25689] Fps is (10 sec: 5607.6, 60 sec: 5544.7, 300 sec: 5546.2). Total num frames: 798565376. Throughput: 0: 4990.4. Samples: 798560278. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:22:00,560][25689] Avg episode reward: [(0, '-2.190')] [2022-07-10 15:22:01,413][26022] Updated weights on worker 0-0, policy_version 779853 (0.00087) [2022-07-10 15:22:03,200][26022] Updated weights on worker 0-0, policy_version 779863 (0.00091) [2022-07-10 15:22:05,267][26022] Updated weights on worker 0-0, policy_version 779873 (0.00084) [2022-07-10 15:22:05,620][25689] Fps is (10 sec: 5592.9, 60 sec: 5599.6, 300 sec: 5543.4). Total num frames: 798593024. Throughput: 0: 5748.2. Samples: 798591924. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:22:05,620][25689] Avg episode reward: [(0, '-2.061')] [2022-07-10 15:22:07,018][26022] Updated weights on worker 0-0, policy_version 779883 (0.00088) [2022-07-10 15:22:08,736][26022] Updated weights on worker 0-0, policy_version 779893 (0.00091) [2022-07-10 15:22:10,626][25689] Fps is (10 sec: 5391.3, 60 sec: 5566.8, 300 sec: 5533.4). Total num frames: 798619648. Throughput: 0: 5777.4. Samples: 798625726. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:22:10,626][25689] Avg episode reward: [(0, '-2.845')] [2022-07-10 15:22:10,795][26022] Updated weights on worker 0-0, policy_version 779903 (0.00094) [2022-07-10 15:22:12,391][26022] Updated weights on worker 0-0, policy_version 779913 (0.00084) [2022-07-10 15:22:14,326][26022] Updated weights on worker 0-0, policy_version 779923 (0.00084) [2022-07-10 15:22:15,710][25689] Fps is (10 sec: 5479.9, 60 sec: 5561.5, 300 sec: 5539.6). Total num frames: 798648320. Throughput: 0: 4908.0. Samples: 798642470. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:22:15,711][25689] Avg episode reward: [(0, '-2.154')] [2022-07-10 15:22:16,233][26022] Updated weights on worker 0-0, policy_version 779933 (0.00092) [2022-07-10 15:22:18,031][26022] Updated weights on worker 0-0, policy_version 779943 (0.00065) [2022-07-10 15:22:19,878][26022] Updated weights on worker 0-0, policy_version 779953 (0.00083) [2022-07-10 15:22:20,732][25689] Fps is (10 sec: 5775.1, 60 sec: 5594.8, 300 sec: 5543.6). Total num frames: 798678016. Throughput: 0: 5747.2. Samples: 798676218. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:22:20,733][25689] Avg episode reward: [(0, '-3.144')] [2022-07-10 15:22:21,774][26022] Updated weights on worker 0-0, policy_version 779963 (0.00098) [2022-07-10 15:22:23,457][26022] Updated weights on worker 0-0, policy_version 779973 (0.00088) [2022-07-10 15:22:25,335][26022] Updated weights on worker 0-0, policy_version 779983 (0.00092) [2022-07-10 15:22:25,835][25689] Fps is (10 sec: 5562.8, 60 sec: 5563.6, 300 sec: 5542.4). Total num frames: 798704640. Throughput: 0: 5823.4. Samples: 798709646. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:22:25,835][25689] Avg episode reward: [(0, '-2.524')] [2022-07-10 15:22:27,220][26022] Updated weights on worker 0-0, policy_version 779993 (0.00095) [2022-07-10 15:22:28,959][26022] Updated weights on worker 0-0, policy_version 780003 (0.00086) [2022-07-10 15:22:30,849][25689] Fps is (10 sec: 5263.6, 60 sec: 5546.2, 300 sec: 5535.8). Total num frames: 798731264. Throughput: 0: 4978.5. Samples: 798726406. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:22:30,850][25689] Avg episode reward: [(0, '-4.189')] [2022-07-10 15:22:30,908][26022] Updated weights on worker 0-0, policy_version 780013 (0.00089) [2022-07-10 15:22:32,597][26022] Updated weights on worker 0-0, policy_version 780023 (0.00091) [2022-07-10 15:22:34,562][26022] Updated weights on worker 0-0, policy_version 780033 (0.00094) [2022-07-10 15:22:35,859][25689] Fps is (10 sec: 5720.6, 60 sec: 5564.9, 300 sec: 5545.9). Total num frames: 798761984. Throughput: 0: 5834.6. Samples: 798760030. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:22:35,860][25689] Avg episode reward: [(0, '-3.952')] [2022-07-10 15:22:36,442][26022] Updated weights on worker 0-0, policy_version 780043 (0.00079) [2022-07-10 15:22:38,074][26022] Updated weights on worker 0-0, policy_version 780053 (0.00082) [2022-07-10 15:22:40,018][26022] Updated weights on worker 0-0, policy_version 780063 (0.00095) [2022-07-10 15:22:40,905][25689] Fps is (10 sec: 5804.1, 60 sec: 5580.5, 300 sec: 5547.4). Total num frames: 798789632. Throughput: 0: 5843.5. Samples: 798794096. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:22:40,906][25689] Avg episode reward: [(0, '-3.258')] [2022-07-10 15:22:41,464][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:22:41,471][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000780072_798793728.pth [2022-07-10 15:22:41,472][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000778122_796796928.pth [2022-07-10 15:22:41,652][26022] Updated weights on worker 0-0, policy_version 780073 (0.00093) [2022-07-10 15:22:43,735][26022] Updated weights on worker 0-0, policy_version 780083 (0.01297) [2022-07-10 15:22:45,327][26022] Updated weights on worker 0-0, policy_version 780093 (0.00089) [2022-07-10 15:22:45,955][25689] Fps is (10 sec: 5578.3, 60 sec: 5587.3, 300 sec: 5546.7). Total num frames: 798818304. Throughput: 0: 5860.4. Samples: 798827560. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:22:45,956][25689] Avg episode reward: [(0, '-2.403')] [2022-07-10 15:22:47,375][26022] Updated weights on worker 0-0, policy_version 780103 (0.00088) [2022-07-10 15:22:49,165][26022] Updated weights on worker 0-0, policy_version 780113 (0.00084) [2022-07-10 15:22:50,958][25689] Fps is (10 sec: 5500.5, 60 sec: 5556.9, 300 sec: 5541.1). Total num frames: 798844928. Throughput: 0: 5861.6. Samples: 798844278. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:22:50,958][25689] Avg episode reward: [(0, '-2.736')] [2022-07-10 15:22:50,972][26022] Updated weights on worker 0-0, policy_version 780123 (0.00112) [2022-07-10 15:22:52,891][26022] Updated weights on worker 0-0, policy_version 780133 (0.00096) [2022-07-10 15:22:54,500][26022] Updated weights on worker 0-0, policy_version 780143 (0.00090) [2022-07-10 15:22:55,967][25689] Fps is (10 sec: 5523.3, 60 sec: 5576.2, 300 sec: 5544.4). Total num frames: 798873600. Throughput: 0: 5844.9. Samples: 798877558. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:22:55,967][25689] Avg episode reward: [(0, '-2.454')] [2022-07-10 15:22:56,627][26022] Updated weights on worker 0-0, policy_version 780153 (0.00086) [2022-07-10 15:22:58,425][26022] Updated weights on worker 0-0, policy_version 780163 (0.00090) [2022-07-10 15:22:59,962][26022] Updated weights on worker 0-0, policy_version 780173 (0.00096) [2022-07-10 15:23:00,984][25689] Fps is (10 sec: 5719.7, 60 sec: 5575.5, 300 sec: 5549.8). Total num frames: 798902272. Throughput: 0: 5838.5. Samples: 798911326. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:23:00,984][25689] Avg episode reward: [(0, '-0.796')] [2022-07-10 15:23:02,488][26022] Updated weights on worker 0-0, policy_version 780183 (0.00080) [2022-07-10 15:23:04,064][26022] Updated weights on worker 0-0, policy_version 780193 (0.00087) [2022-07-10 15:23:06,038][26022] Updated weights on worker 0-0, policy_version 780203 (0.00106) [2022-07-10 15:23:06,042][25689] Fps is (10 sec: 5284.8, 60 sec: 5524.8, 300 sec: 5542.1). Total num frames: 798926848. Throughput: 0: 4904.5. Samples: 798926078. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:23:06,043][25689] Avg episode reward: [(0, '-0.634')] [2022-07-10 15:23:07,689][26022] Updated weights on worker 0-0, policy_version 780213 (0.00088) [2022-07-10 15:23:09,838][26022] Updated weights on worker 0-0, policy_version 780223 (0.00087) [2022-07-10 15:23:11,055][25689] Fps is (10 sec: 5388.8, 60 sec: 5575.0, 300 sec: 5549.7). Total num frames: 798956544. Throughput: 0: 5754.1. Samples: 798959918. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:23:11,057][25689] Avg episode reward: [(0, '-1.393')] [2022-07-10 15:23:11,295][26022] Updated weights on worker 0-0, policy_version 780233 (0.00090) [2022-07-10 15:23:13,498][26022] Updated weights on worker 0-0, policy_version 780243 (0.00089) [2022-07-10 15:23:14,786][26022] Updated weights on worker 0-0, policy_version 780253 (0.00085) [2022-07-10 15:23:16,082][25689] Fps is (10 sec: 5711.5, 60 sec: 5563.4, 300 sec: 5543.2). Total num frames: 798984192. Throughput: 0: 5777.2. Samples: 798993770. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:23:16,083][25689] Avg episode reward: [(0, '-0.024')] [2022-07-10 15:23:17,013][26022] Updated weights on worker 0-0, policy_version 780263 (0.00088) [2022-07-10 15:23:18,601][26022] Updated weights on worker 0-0, policy_version 780273 (0.00086) [2022-07-10 15:23:20,503][26022] Updated weights on worker 0-0, policy_version 780283 (0.00085) [2022-07-10 15:23:21,122][25689] Fps is (10 sec: 5594.7, 60 sec: 5544.8, 300 sec: 5547.1). Total num frames: 799012864. Throughput: 0: 4940.2. Samples: 799010812. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:23:21,123][25689] Avg episode reward: [(0, '-0.925')] [2022-07-10 15:23:22,353][26022] Updated weights on worker 0-0, policy_version 780293 (0.00089) [2022-07-10 15:23:24,132][26022] Updated weights on worker 0-0, policy_version 780303 (0.00084) [2022-07-10 15:23:25,975][26022] Updated weights on worker 0-0, policy_version 780313 (0.00097) [2022-07-10 15:23:26,175][25689] Fps is (10 sec: 5681.7, 60 sec: 5583.3, 300 sec: 5553.7). Total num frames: 799041536. Throughput: 0: 5885.7. Samples: 799044574. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:23:26,176][25689] Avg episode reward: [(0, '-1.061')] [2022-07-10 15:23:27,903][26022] Updated weights on worker 0-0, policy_version 780323 (0.00091) [2022-07-10 15:23:29,575][26022] Updated weights on worker 0-0, policy_version 780333 (0.00086) [2022-07-10 15:23:31,183][25689] Fps is (10 sec: 5496.0, 60 sec: 5583.8, 300 sec: 5546.9). Total num frames: 799068160. Throughput: 0: 5870.0. Samples: 799078068. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:23:31,183][25689] Avg episode reward: [(0, '-0.758')] [2022-07-10 15:23:31,592][26022] Updated weights on worker 0-0, policy_version 780343 (0.00085) [2022-07-10 15:23:33,267][26022] Updated weights on worker 0-0, policy_version 780353 (0.00091) [2022-07-10 15:23:35,268][26022] Updated weights on worker 0-0, policy_version 780363 (0.00088) [2022-07-10 15:23:36,185][25689] Fps is (10 sec: 5524.4, 60 sec: 5550.7, 300 sec: 5547.0). Total num frames: 799096832. Throughput: 0: 5025.2. Samples: 799094786. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:23:36,185][25689] Avg episode reward: [(0, '-1.377')] [2022-07-10 15:23:36,871][26022] Updated weights on worker 0-0, policy_version 780373 (0.00088) [2022-07-10 15:23:38,677][26022] Updated weights on worker 0-0, policy_version 780383 (0.00078) [2022-07-10 15:23:40,721][26022] Updated weights on worker 0-0, policy_version 780393 (0.00084) [2022-07-10 15:23:41,206][25689] Fps is (10 sec: 5619.2, 60 sec: 5553.0, 300 sec: 5547.8). Total num frames: 799124480. Throughput: 0: 5867.0. Samples: 799128644. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:23:41,207][25689] Avg episode reward: [(0, '-1.618')] [2022-07-10 15:23:42,380][26022] Updated weights on worker 0-0, policy_version 780403 (0.00086) [2022-07-10 15:23:44,295][26022] Updated weights on worker 0-0, policy_version 780413 (0.00088) [2022-07-10 15:23:46,097][26022] Updated weights on worker 0-0, policy_version 780423 (0.00086) [2022-07-10 15:23:46,327][25689] Fps is (10 sec: 5654.0, 60 sec: 5563.4, 300 sec: 5556.3). Total num frames: 799154176. Throughput: 0: 5844.6. Samples: 799162352. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:23:46,327][25689] Avg episode reward: [(0, '-1.475')] [2022-07-10 15:23:47,873][26022] Updated weights on worker 0-0, policy_version 780433 (0.00086) [2022-07-10 15:23:49,722][26022] Updated weights on worker 0-0, policy_version 780443 (0.00095) [2022-07-10 15:23:51,392][25689] Fps is (10 sec: 5629.2, 60 sec: 5574.6, 300 sec: 5548.8). Total num frames: 799181824. Throughput: 0: 5004.8. Samples: 799179214. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:23:51,393][25689] Avg episode reward: [(0, '-1.309')] [2022-07-10 15:23:51,643][26022] Updated weights on worker 0-0, policy_version 780453 (0.00084) [2022-07-10 15:23:53,219][26022] Updated weights on worker 0-0, policy_version 780463 (0.00081) [2022-07-10 15:23:55,328][26022] Updated weights on worker 0-0, policy_version 780473 (0.00083) [2022-07-10 15:23:56,469][25689] Fps is (10 sec: 5552.9, 60 sec: 5568.3, 300 sec: 5557.9). Total num frames: 799210496. Throughput: 0: 5815.4. Samples: 799212750. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:23:56,470][25689] Avg episode reward: [(0, '-1.318')] [2022-07-10 15:23:57,193][26022] Updated weights on worker 0-0, policy_version 780483 (0.00801) [2022-07-10 15:23:58,803][26022] Updated weights on worker 0-0, policy_version 780493 (0.00091) [2022-07-10 15:24:00,955][26022] Updated weights on worker 0-0, policy_version 780503 (0.00517) [2022-07-10 15:24:01,527][25689] Fps is (10 sec: 5557.4, 60 sec: 5547.7, 300 sec: 5559.0). Total num frames: 799238144. Throughput: 0: 5819.7. Samples: 799246906. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:24:01,527][25689] Avg episode reward: [(0, '-1.135')] [2022-07-10 15:24:02,721][26022] Updated weights on worker 0-0, policy_version 780513 (0.00202) [2022-07-10 15:24:04,970][26022] Updated weights on worker 0-0, policy_version 780523 (0.00091) [2022-07-10 15:24:06,255][26022] Updated weights on worker 0-0, policy_version 780533 (0.00087) [2022-07-10 15:24:06,591][25689] Fps is (10 sec: 5564.0, 60 sec: 5614.8, 300 sec: 5564.8). Total num frames: 799266816. Throughput: 0: 4896.5. Samples: 799261578. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:24:06,592][25689] Avg episode reward: [(0, '-0.647')] [2022-07-10 15:24:08,413][26022] Updated weights on worker 0-0, policy_version 780543 (0.00081) [2022-07-10 15:24:10,191][26022] Updated weights on worker 0-0, policy_version 780553 (0.00620) [2022-07-10 15:24:11,682][25689] Fps is (10 sec: 5545.6, 60 sec: 5573.7, 300 sec: 5561.0). Total num frames: 799294464. Throughput: 0: 5707.7. Samples: 799295024. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:24:11,689][25689] Avg episode reward: [(0, '-0.535')] [2022-07-10 15:24:12,032][26022] Updated weights on worker 0-0, policy_version 780563 (0.00090) [2022-07-10 15:24:13,904][26022] Updated weights on worker 0-0, policy_version 780573 (0.00083) [2022-07-10 15:24:15,580][26022] Updated weights on worker 0-0, policy_version 780583 (0.00086) [2022-07-10 15:24:16,783][25689] Fps is (10 sec: 5425.7, 60 sec: 5567.0, 300 sec: 5552.3). Total num frames: 799322112. Throughput: 0: 5712.8. Samples: 799328798. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:24:16,783][25689] Avg episode reward: [(0, '-0.233')] [2022-07-10 15:24:17,482][26022] Updated weights on worker 0-0, policy_version 780593 (0.00087) [2022-07-10 15:24:19,373][26022] Updated weights on worker 0-0, policy_version 780603 (0.00086) [2022-07-10 15:24:21,095][26022] Updated weights on worker 0-0, policy_version 780613 (0.00088) [2022-07-10 15:24:21,787][25689] Fps is (10 sec: 5675.1, 60 sec: 5587.1, 300 sec: 5562.0). Total num frames: 799351808. Throughput: 0: 4874.8. Samples: 799345670. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:24:21,787][25689] Avg episode reward: [(0, '-0.553')] [2022-07-10 15:24:22,987][26022] Updated weights on worker 0-0, policy_version 780623 (0.00096) [2022-07-10 15:24:24,839][26022] Updated weights on worker 0-0, policy_version 780633 (0.00084) [2022-07-10 15:24:26,695][26022] Updated weights on worker 0-0, policy_version 780643 (0.00086) [2022-07-10 15:24:26,896][25689] Fps is (10 sec: 5670.3, 60 sec: 5565.1, 300 sec: 5560.3). Total num frames: 799379456. Throughput: 0: 5800.6. Samples: 799379358. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:24:26,897][25689] Avg episode reward: [(0, '-2.414')] [2022-07-10 15:24:28,543][26022] Updated weights on worker 0-0, policy_version 780653 (0.00082) [2022-07-10 15:24:30,446][26022] Updated weights on worker 0-0, policy_version 780663 (0.00093) [2022-07-10 15:24:31,905][25689] Fps is (10 sec: 5465.1, 60 sec: 5581.9, 300 sec: 5556.7). Total num frames: 799407104. Throughput: 0: 5836.1. Samples: 799413044. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:24:31,905][25689] Avg episode reward: [(0, '-2.023')] [2022-07-10 15:24:32,149][26022] Updated weights on worker 0-0, policy_version 780673 (0.00087) [2022-07-10 15:24:33,977][26022] Updated weights on worker 0-0, policy_version 780683 (0.00094) [2022-07-10 15:24:35,919][26022] Updated weights on worker 0-0, policy_version 780693 (0.00088) [2022-07-10 15:24:36,976][25689] Fps is (10 sec: 5587.3, 60 sec: 5575.5, 300 sec: 5562.7). Total num frames: 799435776. Throughput: 0: 5008.3. Samples: 799429930. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-10 15:24:36,976][25689] Avg episode reward: [(0, '-2.470')] [2022-07-10 15:24:37,519][26022] Updated weights on worker 0-0, policy_version 780703 (0.00088) [2022-07-10 15:24:39,399][26022] Updated weights on worker 0-0, policy_version 780713 (0.00090) [2022-07-10 15:24:41,195][26022] Updated weights on worker 0-0, policy_version 780723 (0.00086) [2022-07-10 15:24:41,610][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:24:41,632][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000780725_799462400.pth [2022-07-10 15:24:41,633][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000778770_797460480.pth [2022-07-10 15:24:42,014][25689] Fps is (10 sec: 5571.4, 60 sec: 5574.0, 300 sec: 5561.1). Total num frames: 799463424. Throughput: 0: 5823.2. Samples: 799463454. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:24:42,015][25689] Avg episode reward: [(0, '-2.260')] [2022-07-10 15:24:43,043][26022] Updated weights on worker 0-0, policy_version 780733 (0.00089) [2022-07-10 15:24:44,932][26022] Updated weights on worker 0-0, policy_version 780743 (0.00087) [2022-07-10 15:24:46,750][26022] Updated weights on worker 0-0, policy_version 780753 (0.00089) [2022-07-10 15:24:47,116][25689] Fps is (10 sec: 5554.4, 60 sec: 5558.9, 300 sec: 5559.7). Total num frames: 799492096. Throughput: 0: 5821.4. Samples: 799497064. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:24:47,116][25689] Avg episode reward: [(0, '-3.152')] [2022-07-10 15:24:48,616][26022] Updated weights on worker 0-0, policy_version 780763 (0.00081) [2022-07-10 15:24:50,191][26022] Updated weights on worker 0-0, policy_version 780773 (0.00082) [2022-07-10 15:24:52,117][25689] Fps is (10 sec: 5675.8, 60 sec: 5581.7, 300 sec: 5561.6). Total num frames: 799520768. Throughput: 0: 4999.1. Samples: 799514086. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:24:52,119][25689] Avg episode reward: [(0, '-1.597')] [2022-07-10 15:24:52,236][26022] Updated weights on worker 0-0, policy_version 780783 (0.00086) [2022-07-10 15:24:54,034][26022] Updated weights on worker 0-0, policy_version 780793 (0.00087) [2022-07-10 15:24:56,044][26022] Updated weights on worker 0-0, policy_version 780803 (0.00090) [2022-07-10 15:24:57,143][25689] Fps is (10 sec: 5616.5, 60 sec: 5569.4, 300 sec: 5558.8). Total num frames: 799548416. Throughput: 0: 5839.2. Samples: 799547690. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:24:57,144][25689] Avg episode reward: [(0, '-1.671')] [2022-07-10 15:24:57,670][26022] Updated weights on worker 0-0, policy_version 780813 (0.00083) [2022-07-10 15:24:59,671][26022] Updated weights on worker 0-0, policy_version 780823 (0.00092) [2022-07-10 15:25:01,891][26022] Updated weights on worker 0-0, policy_version 780833 (0.00096) [2022-07-10 15:25:02,152][25689] Fps is (10 sec: 5306.2, 60 sec: 5540.1, 300 sec: 5556.2). Total num frames: 799574016. Throughput: 0: 5842.6. Samples: 799581114. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:25:02,152][25689] Avg episode reward: [(0, '-1.891')] [2022-07-10 15:25:03,696][26022] Updated weights on worker 0-0, policy_version 780843 (0.00085) [2022-07-10 15:25:05,507][26022] Updated weights on worker 0-0, policy_version 780853 (0.00093) [2022-07-10 15:25:07,209][25689] Fps is (10 sec: 5493.6, 60 sec: 5557.8, 300 sec: 5565.5). Total num frames: 799603712. Throughput: 0: 4919.1. Samples: 799595904. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:25:07,209][25689] Avg episode reward: [(0, '-1.633')] [2022-07-10 15:25:07,210][26022] Updated weights on worker 0-0, policy_version 780863 (0.00085) [2022-07-10 15:25:09,238][26022] Updated weights on worker 0-0, policy_version 780873 (0.00090) [2022-07-10 15:25:10,839][26022] Updated weights on worker 0-0, policy_version 780883 (0.00091) [2022-07-10 15:25:12,269][25689] Fps is (10 sec: 5566.8, 60 sec: 5543.7, 300 sec: 5561.0). Total num frames: 799630336. Throughput: 0: 5715.5. Samples: 799629266. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:25:12,269][25689] Avg episode reward: [(0, '-1.447')] [2022-07-10 15:25:12,878][26022] Updated weights on worker 0-0, policy_version 780893 (0.00084) [2022-07-10 15:25:14,599][26022] Updated weights on worker 0-0, policy_version 780903 (0.00093) [2022-07-10 15:25:16,366][26022] Updated weights on worker 0-0, policy_version 780913 (0.00079) [2022-07-10 15:25:17,291][25689] Fps is (10 sec: 5484.6, 60 sec: 5567.8, 300 sec: 5564.3). Total num frames: 799659008. Throughput: 0: 5709.9. Samples: 799662730. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:25:17,291][25689] Avg episode reward: [(0, '-0.979')] [2022-07-10 15:25:18,415][26022] Updated weights on worker 0-0, policy_version 780923 (0.00089) [2022-07-10 15:25:20,078][26022] Updated weights on worker 0-0, policy_version 780933 (0.00088) [2022-07-10 15:25:22,006][26022] Updated weights on worker 0-0, policy_version 780943 (0.00083) [2022-07-10 15:25:22,316][25689] Fps is (10 sec: 5707.5, 60 sec: 5548.9, 300 sec: 5564.9). Total num frames: 799687680. Throughput: 0: 5713.7. Samples: 799696326. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:25:22,317][25689] Avg episode reward: [(0, '0.363')] [2022-07-10 15:25:23,899][26022] Updated weights on worker 0-0, policy_version 780953 (0.00090) [2022-07-10 15:25:25,511][26022] Updated weights on worker 0-0, policy_version 780963 (0.00542) [2022-07-10 15:25:27,366][25689] Fps is (10 sec: 5488.5, 60 sec: 5537.5, 300 sec: 5564.7). Total num frames: 799714304. Throughput: 0: 5797.1. Samples: 799712756. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:25:27,366][25689] Avg episode reward: [(0, '-0.584')] [2022-07-10 15:25:27,671][26022] Updated weights on worker 0-0, policy_version 780973 (0.00094) [2022-07-10 15:25:29,380][26022] Updated weights on worker 0-0, policy_version 780983 (0.00084) [2022-07-10 15:25:31,171][26022] Updated weights on worker 0-0, policy_version 780993 (0.00096) [2022-07-10 15:25:32,388][25689] Fps is (10 sec: 5490.3, 60 sec: 5553.2, 300 sec: 5561.3). Total num frames: 799742976. Throughput: 0: 5808.2. Samples: 799746120. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:25:32,388][25689] Avg episode reward: [(0, '-1.248')] [2022-07-10 15:25:33,247][26022] Updated weights on worker 0-0, policy_version 781003 (0.00088) [2022-07-10 15:25:34,727][26022] Updated weights on worker 0-0, policy_version 781013 (0.00090) [2022-07-10 15:25:36,956][26022] Updated weights on worker 0-0, policy_version 781023 (0.00865) [2022-07-10 15:25:37,399][25689] Fps is (10 sec: 5715.5, 60 sec: 5558.7, 300 sec: 5565.0). Total num frames: 799771648. Throughput: 0: 5826.7. Samples: 799779894. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:25:37,399][25689] Avg episode reward: [(0, '-1.204')] [2022-07-10 15:25:38,300][26022] Updated weights on worker 0-0, policy_version 781033 (0.00080) [2022-07-10 15:25:40,457][26022] Updated weights on worker 0-0, policy_version 781043 (0.00094) [2022-07-10 15:25:42,253][26022] Updated weights on worker 0-0, policy_version 781053 (0.00093) [2022-07-10 15:25:42,407][25689] Fps is (10 sec: 5518.8, 60 sec: 5544.4, 300 sec: 5559.6). Total num frames: 799798272. Throughput: 0: 4989.3. Samples: 799796568. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:25:42,408][25689] Avg episode reward: [(0, '-1.544')] [2022-07-10 15:25:44,187][26022] Updated weights on worker 0-0, policy_version 781063 (0.00087) [2022-07-10 15:25:46,004][26022] Updated weights on worker 0-0, policy_version 781073 (0.00097) [2022-07-10 15:25:47,524][25689] Fps is (10 sec: 5461.3, 60 sec: 5543.1, 300 sec: 5557.5). Total num frames: 799826944. Throughput: 0: 5811.8. Samples: 799829910. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:25:47,524][25689] Avg episode reward: [(0, '-1.974')] [2022-07-10 15:25:47,860][26022] Updated weights on worker 0-0, policy_version 781083 (0.00086) [2022-07-10 15:25:49,681][26022] Updated weights on worker 0-0, policy_version 781093 (0.00093) [2022-07-10 15:25:51,383][26022] Updated weights on worker 0-0, policy_version 781103 (0.00083) [2022-07-10 15:25:52,571][25689] Fps is (10 sec: 5541.5, 60 sec: 5522.0, 300 sec: 5560.4). Total num frames: 799854592. Throughput: 0: 5805.1. Samples: 799863284. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:25:52,571][25689] Avg episode reward: [(0, '-2.088')] [2022-07-10 15:25:53,311][26022] Updated weights on worker 0-0, policy_version 781113 (0.00085) [2022-07-10 15:25:55,134][26022] Updated weights on worker 0-0, policy_version 781123 (0.00087) [2022-07-10 15:25:56,982][26022] Updated weights on worker 0-0, policy_version 781133 (0.00082) [2022-07-10 15:25:57,583][25689] Fps is (10 sec: 5598.6, 60 sec: 5540.1, 300 sec: 5557.0). Total num frames: 799883264. Throughput: 0: 4945.7. Samples: 799879722. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:25:57,584][25689] Avg episode reward: [(0, '-1.144')] [2022-07-10 15:25:58,962][26022] Updated weights on worker 0-0, policy_version 781143 (0.00096) [2022-07-10 15:26:00,569][26022] Updated weights on worker 0-0, policy_version 781153 (0.00086) [2022-07-10 15:26:02,614][25689] Fps is (10 sec: 5403.6, 60 sec: 5538.1, 300 sec: 5561.8). Total num frames: 799908864. Throughput: 0: 5774.0. Samples: 799913244. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:26:02,619][25689] Avg episode reward: [(0, '-1.292')] [2022-07-10 15:26:03,041][26022] Updated weights on worker 0-0, policy_version 781163 (0.00083) [2022-07-10 15:26:04,713][26022] Updated weights on worker 0-0, policy_version 781173 (0.00084) [2022-07-10 15:26:06,587][26022] Updated weights on worker 0-0, policy_version 781183 (0.00086) [2022-07-10 15:26:07,745][25689] Fps is (10 sec: 5341.0, 60 sec: 5514.5, 300 sec: 5559.7). Total num frames: 799937536. Throughput: 0: 5683.7. Samples: 799944840. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:26:07,745][25689] Avg episode reward: [(0, '-1.220')] [2022-07-10 15:26:08,364][26022] Updated weights on worker 0-0, policy_version 781193 (0.00094) [2022-07-10 15:26:10,190][26022] Updated weights on worker 0-0, policy_version 781203 (0.00085) [2022-07-10 15:26:12,174][26022] Updated weights on worker 0-0, policy_version 781213 (0.00087) [2022-07-10 15:26:12,774][25689] Fps is (10 sec: 5543.6, 60 sec: 5534.2, 300 sec: 5556.2). Total num frames: 799965184. Throughput: 0: 4869.3. Samples: 799961660. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:26:12,775][25689] Avg episode reward: [(0, '0.054')] [2022-07-10 15:26:13,879][26022] Updated weights on worker 0-0, policy_version 781223 (0.00092) [2022-07-10 15:26:15,760][26022] Updated weights on worker 0-0, policy_version 781233 (0.00084) [2022-07-10 15:26:17,505][26022] Updated weights on worker 0-0, policy_version 781243 (0.00088) [2022-07-10 15:26:17,837][25689] Fps is (10 sec: 5682.2, 60 sec: 5547.4, 300 sec: 5562.3). Total num frames: 799994880. Throughput: 0: 5717.2. Samples: 799995514. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:26:17,838][25689] Avg episode reward: [(0, '-0.565')] [2022-07-10 15:26:19,385][26022] Updated weights on worker 0-0, policy_version 781253 (0.00088) [2022-07-10 15:26:21,230][26022] Updated weights on worker 0-0, policy_version 781263 (0.00088) [2022-07-10 15:26:22,886][25689] Fps is (10 sec: 5772.2, 60 sec: 5545.2, 300 sec: 5563.8). Total num frames: 800023552. Throughput: 0: 5721.6. Samples: 800029230. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:26:22,886][25689] Avg episode reward: [(0, '-0.025')] [2022-07-10 15:26:22,906][26022] Updated weights on worker 0-0, policy_version 781273 (0.00237) [2022-07-10 15:26:24,843][26022] Updated weights on worker 0-0, policy_version 781283 (0.00080) [2022-07-10 15:26:26,824][26022] Updated weights on worker 0-0, policy_version 781293 (0.00089) [2022-07-10 15:26:27,925][25689] Fps is (10 sec: 5582.7, 60 sec: 5563.1, 300 sec: 5563.2). Total num frames: 800051200. Throughput: 0: 4995.9. Samples: 800045658. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:26:27,926][25689] Avg episode reward: [(0, '-0.947')] [2022-07-10 15:26:28,626][26022] Updated weights on worker 0-0, policy_version 781303 (0.00085) [2022-07-10 15:26:30,560][26022] Updated weights on worker 0-0, policy_version 781313 (0.00090) [2022-07-10 15:26:32,228][26022] Updated weights on worker 0-0, policy_version 781323 (0.00092) [2022-07-10 15:26:32,954][25689] Fps is (10 sec: 5492.0, 60 sec: 5545.5, 300 sec: 5556.3). Total num frames: 800078848. Throughput: 0: 5813.7. Samples: 800078982. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:26:32,955][25689] Avg episode reward: [(0, '-1.074')] [2022-07-10 15:26:34,244][26022] Updated weights on worker 0-0, policy_version 781333 (0.00091) [2022-07-10 15:26:35,906][26022] Updated weights on worker 0-0, policy_version 781343 (0.00123) [2022-07-10 15:26:37,835][26022] Updated weights on worker 0-0, policy_version 781353 (0.00088) [2022-07-10 15:26:37,966][25689] Fps is (10 sec: 5507.1, 60 sec: 5528.5, 300 sec: 5560.2). Total num frames: 800106496. Throughput: 0: 5821.6. Samples: 800112698. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:26:37,966][25689] Avg episode reward: [(0, '-1.644')] [2022-07-10 15:26:39,556][26022] Updated weights on worker 0-0, policy_version 781363 (0.00085) [2022-07-10 15:26:41,557][26022] Updated weights on worker 0-0, policy_version 781373 (0.00089) [2022-07-10 15:26:41,650][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:26:41,663][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000781374_800126976.pth [2022-07-10 15:26:41,664][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000779419_798125056.pth [2022-07-10 15:26:43,047][25689] Fps is (10 sec: 5580.4, 60 sec: 5555.7, 300 sec: 5561.0). Total num frames: 800135168. Throughput: 0: 4986.7. Samples: 800129766. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:26:43,047][25689] Avg episode reward: [(0, '-2.394')] [2022-07-10 15:26:43,098][26022] Updated weights on worker 0-0, policy_version 781383 (0.00090) [2022-07-10 15:26:45,137][26022] Updated weights on worker 0-0, policy_version 781393 (0.00086) [2022-07-10 15:26:46,689][26022] Updated weights on worker 0-0, policy_version 781403 (0.00087) [2022-07-10 15:26:48,112][25689] Fps is (10 sec: 5651.7, 60 sec: 5560.4, 300 sec: 5560.5). Total num frames: 800163840. Throughput: 0: 5824.2. Samples: 800163232. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:26:48,113][25689] Avg episode reward: [(0, '-1.880')] [2022-07-10 15:26:48,727][26022] Updated weights on worker 0-0, policy_version 781413 (0.00084) [2022-07-10 15:26:50,635][26022] Updated weights on worker 0-0, policy_version 781423 (0.00083) [2022-07-10 15:26:52,278][26022] Updated weights on worker 0-0, policy_version 781433 (0.00079) [2022-07-10 15:26:53,126][25689] Fps is (10 sec: 5587.6, 60 sec: 5563.4, 300 sec: 5560.9). Total num frames: 800191488. Throughput: 0: 5836.5. Samples: 800196716. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:26:53,127][25689] Avg episode reward: [(0, '-2.091')] [2022-07-10 15:26:54,216][26022] Updated weights on worker 0-0, policy_version 781443 (0.00087) [2022-07-10 15:26:56,069][26022] Updated weights on worker 0-0, policy_version 781453 (0.00095) [2022-07-10 15:26:57,888][26022] Updated weights on worker 0-0, policy_version 781463 (0.00094) [2022-07-10 15:26:58,161][25689] Fps is (10 sec: 5604.8, 60 sec: 5561.4, 300 sec: 5560.4). Total num frames: 800220160. Throughput: 0: 4990.6. Samples: 800213482. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:26:58,161][25689] Avg episode reward: [(0, '-0.741')] [2022-07-10 15:26:59,899][26022] Updated weights on worker 0-0, policy_version 781473 (0.00056) [2022-07-10 15:27:01,548][26022] Updated weights on worker 0-0, policy_version 781483 (0.00097) [2022-07-10 15:27:03,192][25689] Fps is (10 sec: 5290.2, 60 sec: 5544.5, 300 sec: 5550.6). Total num frames: 800244736. Throughput: 0: 5823.5. Samples: 800247080. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:27:03,192][25689] Avg episode reward: [(0, '0.052')] [2022-07-10 15:27:03,959][26022] Updated weights on worker 0-0, policy_version 781493 (0.00089) [2022-07-10 15:27:05,634][26022] Updated weights on worker 0-0, policy_version 781503 (0.00095) [2022-07-10 15:27:07,452][26022] Updated weights on worker 0-0, policy_version 781513 (0.00085) [2022-07-10 15:27:08,280][25689] Fps is (10 sec: 5262.0, 60 sec: 5548.4, 300 sec: 5556.0). Total num frames: 800273408. Throughput: 0: 5720.0. Samples: 800278592. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:27:08,280][25689] Avg episode reward: [(0, '-1.315')] [2022-07-10 15:27:09,383][26022] Updated weights on worker 0-0, policy_version 781523 (0.00089) [2022-07-10 15:27:10,833][26022] Updated weights on worker 0-0, policy_version 781533 (0.00089) [2022-07-10 15:27:12,975][26022] Updated weights on worker 0-0, policy_version 781543 (0.00087) [2022-07-10 15:27:13,287][25689] Fps is (10 sec: 5578.8, 60 sec: 5550.4, 300 sec: 5554.0). Total num frames: 800301056. Throughput: 0: 4906.8. Samples: 800295642. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:27:13,288][25689] Avg episode reward: [(0, '-0.522')] [2022-07-10 15:27:14,579][26022] Updated weights on worker 0-0, policy_version 781553 (0.00097) [2022-07-10 15:27:16,619][26022] Updated weights on worker 0-0, policy_version 781563 (0.00085) [2022-07-10 15:27:18,232][26022] Updated weights on worker 0-0, policy_version 781573 (0.00091) [2022-07-10 15:27:18,305][25689] Fps is (10 sec: 5720.1, 60 sec: 5554.5, 300 sec: 5554.1). Total num frames: 800330752. Throughput: 0: 5758.3. Samples: 800329478. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:27:18,305][25689] Avg episode reward: [(0, '-0.551')] [2022-07-10 15:27:20,133][26022] Updated weights on worker 0-0, policy_version 781583 (0.00091) [2022-07-10 15:27:22,025][26022] Updated weights on worker 0-0, policy_version 781593 (0.00081) [2022-07-10 15:27:23,324][25689] Fps is (10 sec: 5815.4, 60 sec: 5557.3, 300 sec: 5562.5). Total num frames: 800359424. Throughput: 0: 5777.7. Samples: 800363398. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:27:23,326][25689] Avg episode reward: [(0, '-1.633')] [2022-07-10 15:27:23,592][26022] Updated weights on worker 0-0, policy_version 781603 (0.00086) [2022-07-10 15:27:25,692][26022] Updated weights on worker 0-0, policy_version 781613 (0.00097) [2022-07-10 15:27:27,547][26022] Updated weights on worker 0-0, policy_version 781623 (0.00087) [2022-07-10 15:27:28,387][25689] Fps is (10 sec: 5484.4, 60 sec: 5538.1, 300 sec: 5561.6). Total num frames: 800386048. Throughput: 0: 5037.2. Samples: 800379876. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:27:28,389][25689] Avg episode reward: [(0, '-1.840')] [2022-07-10 15:27:29,234][26022] Updated weights on worker 0-0, policy_version 781633 (0.00094) [2022-07-10 15:27:31,308][26022] Updated weights on worker 0-0, policy_version 781643 (0.00097) [2022-07-10 15:27:32,950][26022] Updated weights on worker 0-0, policy_version 781653 (0.00091) [2022-07-10 15:27:33,411][25689] Fps is (10 sec: 5482.1, 60 sec: 5555.6, 300 sec: 5554.5). Total num frames: 800414720. Throughput: 0: 5857.6. Samples: 800413516. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:27:33,411][25689] Avg episode reward: [(0, '-2.450')] [2022-07-10 15:27:34,759][26022] Updated weights on worker 0-0, policy_version 781663 (0.00085) [2022-07-10 15:27:36,642][26022] Updated weights on worker 0-0, policy_version 781673 (0.00083) [2022-07-10 15:27:38,428][25689] Fps is (10 sec: 5609.4, 60 sec: 5555.1, 300 sec: 5555.0). Total num frames: 800442368. Throughput: 0: 5866.8. Samples: 800447534. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:27:38,428][25689] Avg episode reward: [(0, '-0.920')] [2022-07-10 15:27:38,436][26022] Updated weights on worker 0-0, policy_version 781683 (0.00090) [2022-07-10 15:27:40,543][26022] Updated weights on worker 0-0, policy_version 781693 (0.00085) [2022-07-10 15:27:41,932][26022] Updated weights on worker 0-0, policy_version 781703 (0.00088) [2022-07-10 15:27:43,447][25689] Fps is (10 sec: 5611.8, 60 sec: 5560.8, 300 sec: 5555.6). Total num frames: 800471040. Throughput: 0: 5009.7. Samples: 800464208. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:27:43,447][25689] Avg episode reward: [(0, '-2.097')] [2022-07-10 15:27:44,116][26022] Updated weights on worker 0-0, policy_version 781713 (0.00080) [2022-07-10 15:27:45,711][26022] Updated weights on worker 0-0, policy_version 781723 (0.00086) [2022-07-10 15:27:47,692][26022] Updated weights on worker 0-0, policy_version 781733 (0.00085) [2022-07-10 15:27:48,493][25689] Fps is (10 sec: 5595.6, 60 sec: 5545.6, 300 sec: 5558.2). Total num frames: 800498688. Throughput: 0: 5872.6. Samples: 800497946. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:27:48,493][25689] Avg episode reward: [(0, '-1.412')] [2022-07-10 15:27:49,243][26022] Updated weights on worker 0-0, policy_version 781743 (0.00087) [2022-07-10 15:27:51,204][26022] Updated weights on worker 0-0, policy_version 781753 (0.00086) [2022-07-10 15:27:52,940][26022] Updated weights on worker 0-0, policy_version 781763 (0.00090) [2022-07-10 15:27:53,510][25689] Fps is (10 sec: 5494.6, 60 sec: 5545.3, 300 sec: 5554.6). Total num frames: 800526336. Throughput: 0: 5875.7. Samples: 800531616. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:27:53,511][25689] Avg episode reward: [(0, '-1.353')] [2022-07-10 15:27:54,915][26022] Updated weights on worker 0-0, policy_version 781773 (0.00093) [2022-07-10 15:27:56,868][26022] Updated weights on worker 0-0, policy_version 781783 (0.00610) [2022-07-10 15:27:58,518][25689] Fps is (10 sec: 5515.8, 60 sec: 5530.8, 300 sec: 5551.3). Total num frames: 800553984. Throughput: 0: 5016.0. Samples: 800548306. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:27:58,518][25689] Avg episode reward: [(0, '-2.279')] [2022-07-10 15:27:58,688][26022] Updated weights on worker 0-0, policy_version 781793 (0.00090) [2022-07-10 15:28:00,608][26022] Updated weights on worker 0-0, policy_version 781803 (0.00091) [2022-07-10 15:28:02,763][26022] Updated weights on worker 0-0, policy_version 781813 (0.00088) [2022-07-10 15:28:03,551][25689] Fps is (10 sec: 5405.4, 60 sec: 5564.6, 300 sec: 5558.7). Total num frames: 800580608. Throughput: 0: 5813.4. Samples: 800581080. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:28:03,551][25689] Avg episode reward: [(0, '-2.151')] [2022-07-10 15:28:04,471][26022] Updated weights on worker 0-0, policy_version 781823 (0.00085) [2022-07-10 15:28:06,317][26022] Updated weights on worker 0-0, policy_version 781833 (0.00087) [2022-07-10 15:28:08,275][26022] Updated weights on worker 0-0, policy_version 781843 (0.00089) [2022-07-10 15:28:08,664][25689] Fps is (10 sec: 5349.0, 60 sec: 5545.3, 300 sec: 5549.9). Total num frames: 800608256. Throughput: 0: 5683.9. Samples: 800612596. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 15:28:08,666][25689] Avg episode reward: [(0, '-1.382')] [2022-07-10 15:28:10,025][26022] Updated weights on worker 0-0, policy_version 781853 (0.00086) [2022-07-10 15:28:11,722][26022] Updated weights on worker 0-0, policy_version 781863 (0.00080) [2022-07-10 15:28:13,686][25689] Fps is (10 sec: 5556.6, 60 sec: 5560.9, 300 sec: 5553.5). Total num frames: 800636928. Throughput: 0: 5701.4. Samples: 800646648. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:28:13,687][25689] Avg episode reward: [(0, '-0.816')] [2022-07-10 15:28:13,703][26022] Updated weights on worker 0-0, policy_version 781873 (0.00090) [2022-07-10 15:28:15,417][26022] Updated weights on worker 0-0, policy_version 781883 (0.00090) [2022-07-10 15:28:17,283][26022] Updated weights on worker 0-0, policy_version 781893 (0.00091) [2022-07-10 15:28:18,694][25689] Fps is (10 sec: 5717.0, 60 sec: 5544.8, 300 sec: 5554.1). Total num frames: 800665600. Throughput: 0: 5708.8. Samples: 800663490. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:28:18,695][25689] Avg episode reward: [(0, '-0.943')] [2022-07-10 15:28:19,131][26022] Updated weights on worker 0-0, policy_version 781903 (0.00083) [2022-07-10 15:28:20,867][26022] Updated weights on worker 0-0, policy_version 781913 (0.00085) [2022-07-10 15:28:22,741][26022] Updated weights on worker 0-0, policy_version 781923 (0.00085) [2022-07-10 15:28:23,709][25689] Fps is (10 sec: 5823.7, 60 sec: 5562.2, 300 sec: 5558.2). Total num frames: 800695296. Throughput: 0: 5774.5. Samples: 800697482. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:28:23,711][25689] Avg episode reward: [(0, '-0.031')] [2022-07-10 15:28:24,483][26022] Updated weights on worker 0-0, policy_version 781933 (0.00098) [2022-07-10 15:28:26,432][26022] Updated weights on worker 0-0, policy_version 781943 (0.00093) [2022-07-10 15:28:28,384][26022] Updated weights on worker 0-0, policy_version 781953 (0.00087) [2022-07-10 15:28:28,750][25689] Fps is (10 sec: 5600.8, 60 sec: 5564.2, 300 sec: 5557.6). Total num frames: 800721920. Throughput: 0: 5894.5. Samples: 800730994. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:28:28,752][25689] Avg episode reward: [(0, '0.388')] [2022-07-10 15:28:30,027][26022] Updated weights on worker 0-0, policy_version 781963 (0.00087) [2022-07-10 15:28:31,964][26022] Updated weights on worker 0-0, policy_version 781973 (0.00084) [2022-07-10 15:28:33,475][26022] Updated weights on worker 0-0, policy_version 781983 (0.00090) [2022-07-10 15:28:33,765][25689] Fps is (10 sec: 5600.3, 60 sec: 5581.9, 300 sec: 5560.8). Total num frames: 800751616. Throughput: 0: 5043.0. Samples: 800747906. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:28:33,766][25689] Avg episode reward: [(0, '0.030')] [2022-07-10 15:28:35,650][26022] Updated weights on worker 0-0, policy_version 781993 (0.00094) [2022-07-10 15:28:37,109][26022] Updated weights on worker 0-0, policy_version 782003 (0.00087) [2022-07-10 15:28:38,771][25689] Fps is (10 sec: 5518.3, 60 sec: 5549.1, 300 sec: 5554.2). Total num frames: 800777216. Throughput: 0: 5880.1. Samples: 800781540. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:28:38,773][25689] Avg episode reward: [(0, '-0.179')] [2022-07-10 15:28:39,319][26022] Updated weights on worker 0-0, policy_version 782013 (0.00088) [2022-07-10 15:28:40,927][26022] Updated weights on worker 0-0, policy_version 782023 (0.00083) [2022-07-10 15:28:41,769][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:28:41,778][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000782026_800794624.pth [2022-07-10 15:28:41,780][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000780072_798793728.pth [2022-07-10 15:28:42,891][26022] Updated weights on worker 0-0, policy_version 782033 (0.00096) [2022-07-10 15:28:43,788][25689] Fps is (10 sec: 5517.3, 60 sec: 5566.2, 300 sec: 5556.1). Total num frames: 800806912. Throughput: 0: 5865.0. Samples: 800815246. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:28:43,790][25689] Avg episode reward: [(0, '0.147')] [2022-07-10 15:28:44,772][26022] Updated weights on worker 0-0, policy_version 782043 (0.00091) [2022-07-10 15:28:46,558][26022] Updated weights on worker 0-0, policy_version 782053 (0.00098) [2022-07-10 15:28:48,353][26022] Updated weights on worker 0-0, policy_version 782063 (0.00086) [2022-07-10 15:28:48,831][25689] Fps is (10 sec: 5700.2, 60 sec: 5566.5, 300 sec: 5556.6). Total num frames: 800834560. Throughput: 0: 5019.1. Samples: 800831780. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:28:48,831][25689] Avg episode reward: [(0, '-0.668')] [2022-07-10 15:28:50,170][26022] Updated weights on worker 0-0, policy_version 782073 (0.00107) [2022-07-10 15:28:52,084][26022] Updated weights on worker 0-0, policy_version 782083 (0.00090) [2022-07-10 15:28:53,854][25689] Fps is (10 sec: 5493.0, 60 sec: 5565.9, 300 sec: 5554.1). Total num frames: 800862208. Throughput: 0: 5827.0. Samples: 800864964. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:28:53,855][25689] Avg episode reward: [(0, '-0.868')] [2022-07-10 15:28:54,047][26022] Updated weights on worker 0-0, policy_version 782093 (0.00084) [2022-07-10 15:28:55,848][26022] Updated weights on worker 0-0, policy_version 782103 (0.00083) [2022-07-10 15:28:57,593][26022] Updated weights on worker 0-0, policy_version 782113 (0.00086) [2022-07-10 15:28:58,880][25689] Fps is (10 sec: 5604.5, 60 sec: 5581.3, 300 sec: 5558.2). Total num frames: 800890880. Throughput: 0: 5826.9. Samples: 800898714. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:28:58,880][25689] Avg episode reward: [(0, '-0.934')] [2022-07-10 15:28:59,412][26022] Updated weights on worker 0-0, policy_version 782123 (0.00079) [2022-07-10 15:29:01,361][26022] Updated weights on worker 0-0, policy_version 782133 (0.00087) [2022-07-10 15:29:03,468][26022] Updated weights on worker 0-0, policy_version 782143 (0.00083) [2022-07-10 15:29:03,904][25689] Fps is (10 sec: 5502.3, 60 sec: 5582.1, 300 sec: 5552.0). Total num frames: 800917504. Throughput: 0: 4951.7. Samples: 800914856. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:29:03,905][25689] Avg episode reward: [(0, '-2.031')] [2022-07-10 15:29:05,184][26022] Updated weights on worker 0-0, policy_version 782153 (0.00083) [2022-07-10 15:29:07,045][26022] Updated weights on worker 0-0, policy_version 782163 (0.00086) [2022-07-10 15:29:08,610][26022] Updated weights on worker 0-0, policy_version 782173 (0.00095) [2022-07-10 15:29:08,986][25689] Fps is (10 sec: 5370.0, 60 sec: 5584.9, 300 sec: 5552.2). Total num frames: 800945152. Throughput: 0: 5755.3. Samples: 800947782. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:29:08,987][25689] Avg episode reward: [(0, '-2.908')] [2022-07-10 15:29:10,735][26022] Updated weights on worker 0-0, policy_version 782183 (0.00090) [2022-07-10 15:29:12,335][26022] Updated weights on worker 0-0, policy_version 782193 (0.00088) [2022-07-10 15:29:14,051][25689] Fps is (10 sec: 5550.2, 60 sec: 5581.0, 300 sec: 5556.3). Total num frames: 800973824. Throughput: 0: 5778.5. Samples: 800981672. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:29:14,052][25689] Avg episode reward: [(0, '-2.459')] [2022-07-10 15:29:14,254][26022] Updated weights on worker 0-0, policy_version 782203 (0.00091) [2022-07-10 15:29:16,064][26022] Updated weights on worker 0-0, policy_version 782213 (0.00088) [2022-07-10 15:29:17,706][26022] Updated weights on worker 0-0, policy_version 782223 (0.00083) [2022-07-10 15:29:19,105][25689] Fps is (10 sec: 5666.9, 60 sec: 5576.7, 300 sec: 5551.9). Total num frames: 801002496. Throughput: 0: 4952.4. Samples: 800998880. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:29:19,106][25689] Avg episode reward: [(0, '-1.835')] [2022-07-10 15:29:19,864][26022] Updated weights on worker 0-0, policy_version 782233 (0.00092) [2022-07-10 15:29:21,248][26022] Updated weights on worker 0-0, policy_version 782243 (0.00090) [2022-07-10 15:29:23,411][26022] Updated weights on worker 0-0, policy_version 782253 (0.00054) [2022-07-10 15:29:24,170][25689] Fps is (10 sec: 5869.7, 60 sec: 5589.1, 300 sec: 5563.1). Total num frames: 801033216. Throughput: 0: 5803.8. Samples: 801032474. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:29:24,170][25689] Avg episode reward: [(0, '-3.133')] [2022-07-10 15:29:25,293][26022] Updated weights on worker 0-0, policy_version 782263 (0.00090) [2022-07-10 15:29:27,018][26022] Updated weights on worker 0-0, policy_version 782273 (0.00085) [2022-07-10 15:29:29,094][26022] Updated weights on worker 0-0, policy_version 782283 (0.00089) [2022-07-10 15:29:29,292][25689] Fps is (10 sec: 5428.2, 60 sec: 5547.7, 300 sec: 5550.6). Total num frames: 801057792. Throughput: 0: 5808.0. Samples: 801065718. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:29:29,293][25689] Avg episode reward: [(0, '-2.247')] [2022-07-10 15:29:30,731][26022] Updated weights on worker 0-0, policy_version 782293 (0.00090) [2022-07-10 15:29:32,778][26022] Updated weights on worker 0-0, policy_version 782303 (0.00086) [2022-07-10 15:29:34,315][25689] Fps is (10 sec: 5349.4, 60 sec: 5547.0, 300 sec: 5555.0). Total num frames: 801087488. Throughput: 0: 4978.8. Samples: 801082562. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:29:34,317][25689] Avg episode reward: [(0, '-1.787')] [2022-07-10 15:29:34,418][26022] Updated weights on worker 0-0, policy_version 782313 (0.00092) [2022-07-10 15:29:36,269][26022] Updated weights on worker 0-0, policy_version 782323 (0.00088) [2022-07-10 15:29:38,238][26022] Updated weights on worker 0-0, policy_version 782333 (0.00636) [2022-07-10 15:29:39,321][25689] Fps is (10 sec: 5717.9, 60 sec: 5580.8, 300 sec: 5555.6). Total num frames: 801115136. Throughput: 0: 5796.1. Samples: 801116052. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:29:39,322][25689] Avg episode reward: [(0, '-1.477')] [2022-07-10 15:29:39,825][26022] Updated weights on worker 0-0, policy_version 782343 (0.00086) [2022-07-10 15:29:41,830][26022] Updated weights on worker 0-0, policy_version 782353 (0.00088) [2022-07-10 15:29:43,412][26022] Updated weights on worker 0-0, policy_version 782363 (0.00631) [2022-07-10 15:29:44,348][25689] Fps is (10 sec: 5511.4, 60 sec: 5546.1, 300 sec: 5553.5). Total num frames: 801142784. Throughput: 0: 5816.1. Samples: 801149834. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:29:44,349][25689] Avg episode reward: [(0, '-1.630')] [2022-07-10 15:29:45,349][26022] Updated weights on worker 0-0, policy_version 782373 (0.00087) [2022-07-10 15:29:47,213][26022] Updated weights on worker 0-0, policy_version 782383 (0.00086) [2022-07-10 15:29:49,011][26022] Updated weights on worker 0-0, policy_version 782393 (0.00084) [2022-07-10 15:29:49,415][25689] Fps is (10 sec: 5680.9, 60 sec: 5577.7, 300 sec: 5555.7). Total num frames: 801172480. Throughput: 0: 5012.9. Samples: 801166594. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:29:49,416][25689] Avg episode reward: [(0, '-0.791')] [2022-07-10 15:29:50,831][26022] Updated weights on worker 0-0, policy_version 782403 (0.00097) [2022-07-10 15:29:52,869][26022] Updated weights on worker 0-0, policy_version 782413 (0.00087) [2022-07-10 15:29:54,451][25689] Fps is (10 sec: 5574.9, 60 sec: 5559.7, 300 sec: 5552.1). Total num frames: 801199104. Throughput: 0: 5843.6. Samples: 801200226. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:29:54,451][25689] Avg episode reward: [(0, '0.292')] [2022-07-10 15:29:54,627][26022] Updated weights on worker 0-0, policy_version 782423 (0.00087) [2022-07-10 15:29:56,671][26022] Updated weights on worker 0-0, policy_version 782433 (0.00089) [2022-07-10 15:29:58,067][26022] Updated weights on worker 0-0, policy_version 782443 (0.00102) [2022-07-10 15:29:59,480][25689] Fps is (10 sec: 5392.3, 60 sec: 5542.4, 300 sec: 5558.6). Total num frames: 801226752. Throughput: 0: 5845.2. Samples: 801233886. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:29:59,481][25689] Avg episode reward: [(0, '0.405')] [2022-07-10 15:30:00,291][26022] Updated weights on worker 0-0, policy_version 782453 (0.00083) [2022-07-10 15:30:02,108][26022] Updated weights on worker 0-0, policy_version 782463 (0.00086) [2022-07-10 15:30:04,156][26022] Updated weights on worker 0-0, policy_version 782473 (0.00095) [2022-07-10 15:30:04,502][25689] Fps is (10 sec: 5399.5, 60 sec: 5542.6, 300 sec: 5548.9). Total num frames: 801253376. Throughput: 0: 4908.2. Samples: 801248752. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:30:04,502][25689] Avg episode reward: [(0, '0.552')] [2022-07-10 15:30:05,862][26022] Updated weights on worker 0-0, policy_version 782483 (0.00088) [2022-07-10 15:30:07,895][26022] Updated weights on worker 0-0, policy_version 782493 (0.00093) [2022-07-10 15:30:09,547][25689] Fps is (10 sec: 5493.0, 60 sec: 5563.0, 300 sec: 5556.1). Total num frames: 801282048. Throughput: 0: 5740.1. Samples: 801282150. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:30:09,547][25689] Avg episode reward: [(0, '-0.043')] [2022-07-10 15:30:09,627][26022] Updated weights on worker 0-0, policy_version 782503 (0.00090) [2022-07-10 15:30:11,611][26022] Updated weights on worker 0-0, policy_version 782513 (0.00083) [2022-07-10 15:30:13,242][26022] Updated weights on worker 0-0, policy_version 782523 (0.00089) [2022-07-10 15:30:14,552][25689] Fps is (10 sec: 5706.0, 60 sec: 5568.5, 300 sec: 5556.4). Total num frames: 801310720. Throughput: 0: 5734.0. Samples: 801315486. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:30:14,552][25689] Avg episode reward: [(0, '-1.413')] [2022-07-10 15:30:15,259][26022] Updated weights on worker 0-0, policy_version 782533 (0.00091) [2022-07-10 15:30:16,970][26022] Updated weights on worker 0-0, policy_version 782543 (0.00081) [2022-07-10 15:30:18,754][26022] Updated weights on worker 0-0, policy_version 782553 (0.00085) [2022-07-10 15:30:19,562][25689] Fps is (10 sec: 5623.3, 60 sec: 5555.6, 300 sec: 5553.3). Total num frames: 801338368. Throughput: 0: 4908.7. Samples: 801332462. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:30:19,563][25689] Avg episode reward: [(0, '-2.394')] [2022-07-10 15:30:20,542][26022] Updated weights on worker 0-0, policy_version 782563 (0.00090) [2022-07-10 15:30:22,132][26022] Updated weights on worker 0-0, policy_version 782573 (0.00083) [2022-07-10 15:30:24,205][26022] Updated weights on worker 0-0, policy_version 782583 (0.00084) [2022-07-10 15:30:24,627][25689] Fps is (10 sec: 5590.1, 60 sec: 5521.7, 300 sec: 5559.9). Total num frames: 801367040. Throughput: 0: 5848.4. Samples: 801366450. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:30:24,627][25689] Avg episode reward: [(0, '-2.331')] [2022-07-10 15:30:26,127][26022] Updated weights on worker 0-0, policy_version 782593 (0.00085) [2022-07-10 15:30:27,893][26022] Updated weights on worker 0-0, policy_version 782603 (0.00086) [2022-07-10 15:30:29,734][25689] Fps is (10 sec: 5537.0, 60 sec: 5573.9, 300 sec: 5554.9). Total num frames: 801394688. Throughput: 0: 5832.0. Samples: 801399880. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:30:29,734][25689] Avg episode reward: [(0, '-2.423')] [2022-07-10 15:30:29,900][26022] Updated weights on worker 0-0, policy_version 782613 (0.00090) [2022-07-10 15:30:31,610][26022] Updated weights on worker 0-0, policy_version 782623 (0.00096) [2022-07-10 15:30:33,345][26022] Updated weights on worker 0-0, policy_version 782633 (0.00087) [2022-07-10 15:30:34,781][25689] Fps is (10 sec: 5546.6, 60 sec: 5554.8, 300 sec: 5554.2). Total num frames: 801423360. Throughput: 0: 4994.4. Samples: 801416518. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:30:34,782][25689] Avg episode reward: [(0, '-2.283')] [2022-07-10 15:30:35,441][26022] Updated weights on worker 0-0, policy_version 782643 (0.00113) [2022-07-10 15:30:37,018][26022] Updated weights on worker 0-0, policy_version 782653 (0.00086) [2022-07-10 15:30:38,989][26022] Updated weights on worker 0-0, policy_version 782663 (0.00091) [2022-07-10 15:30:39,859][25689] Fps is (10 sec: 5663.5, 60 sec: 5565.1, 300 sec: 5559.8). Total num frames: 801452032. Throughput: 0: 5789.5. Samples: 801449970. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:30:39,859][25689] Avg episode reward: [(0, '-2.535')] [2022-07-10 15:30:40,801][26022] Updated weights on worker 0-0, policy_version 782673 (0.00083) [2022-07-10 15:30:41,849][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:30:41,868][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000782678_801462272.pth [2022-07-10 15:30:41,869][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000780725_799462400.pth [2022-07-10 15:30:42,544][26022] Updated weights on worker 0-0, policy_version 782683 (0.00096) [2022-07-10 15:30:44,517][26022] Updated weights on worker 0-0, policy_version 782693 (0.00093) [2022-07-10 15:30:44,884][25689] Fps is (10 sec: 5574.4, 60 sec: 5565.2, 300 sec: 5558.0). Total num frames: 801479680. Throughput: 0: 5778.2. Samples: 801483500. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:30:44,884][25689] Avg episode reward: [(0, '-1.252')] [2022-07-10 15:30:46,112][26022] Updated weights on worker 0-0, policy_version 782703 (0.00088) [2022-07-10 15:30:48,203][26022] Updated weights on worker 0-0, policy_version 782713 (0.00840) [2022-07-10 15:30:49,964][25689] Fps is (10 sec: 5472.1, 60 sec: 5530.3, 300 sec: 5557.4). Total num frames: 801507328. Throughput: 0: 5792.5. Samples: 801517064. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:30:49,964][25689] Avg episode reward: [(0, '-0.810')] [2022-07-10 15:30:50,041][26022] Updated weights on worker 0-0, policy_version 782723 (0.00086) [2022-07-10 15:30:51,719][26022] Updated weights on worker 0-0, policy_version 782733 (0.00086) [2022-07-10 15:30:53,629][26022] Updated weights on worker 0-0, policy_version 782743 (0.00083) [2022-07-10 15:30:54,976][25689] Fps is (10 sec: 5682.1, 60 sec: 5583.1, 300 sec: 5560.9). Total num frames: 801537024. Throughput: 0: 5823.7. Samples: 801534130. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:30:54,979][25689] Avg episode reward: [(0, '-0.956')] [2022-07-10 15:30:55,334][26022] Updated weights on worker 0-0, policy_version 782753 (0.00087) [2022-07-10 15:30:57,224][26022] Updated weights on worker 0-0, policy_version 782763 (0.00087) [2022-07-10 15:30:59,008][26022] Updated weights on worker 0-0, policy_version 782773 (0.00091) [2022-07-10 15:31:00,006][25689] Fps is (10 sec: 5710.3, 60 sec: 5583.1, 300 sec: 5567.8). Total num frames: 801564672. Throughput: 0: 5858.0. Samples: 801567992. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:31:00,008][25689] Avg episode reward: [(0, '-1.956')] [2022-07-10 15:31:00,792][26022] Updated weights on worker 0-0, policy_version 782783 (0.00093) [2022-07-10 15:31:03,096][26022] Updated weights on worker 0-0, policy_version 782793 (0.00090) [2022-07-10 15:31:05,011][25689] Fps is (10 sec: 5204.4, 60 sec: 5550.8, 300 sec: 5556.3). Total num frames: 801589248. Throughput: 0: 5764.2. Samples: 801599514. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:31:05,011][25689] Avg episode reward: [(0, '-2.407')] [2022-07-10 15:31:05,039][26022] Updated weights on worker 0-0, policy_version 782803 (0.00089) [2022-07-10 15:31:06,750][26022] Updated weights on worker 0-0, policy_version 782813 (0.00089) [2022-07-10 15:31:08,363][26022] Updated weights on worker 0-0, policy_version 782823 (0.00087) [2022-07-10 15:31:10,087][25689] Fps is (10 sec: 5383.7, 60 sec: 5564.8, 300 sec: 5562.4). Total num frames: 801618944. Throughput: 0: 4934.4. Samples: 801616358. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:31:10,087][25689] Avg episode reward: [(0, '-1.902')] [2022-07-10 15:31:10,556][26022] Updated weights on worker 0-0, policy_version 782833 (0.00091) [2022-07-10 15:31:12,244][26022] Updated weights on worker 0-0, policy_version 782843 (0.00099) [2022-07-10 15:31:14,018][26022] Updated weights on worker 0-0, policy_version 782853 (0.00091) [2022-07-10 15:31:15,092][25689] Fps is (10 sec: 5789.4, 60 sec: 5564.8, 300 sec: 5560.0). Total num frames: 801647616. Throughput: 0: 5765.5. Samples: 801650112. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:31:15,094][25689] Avg episode reward: [(0, '-2.305')] [2022-07-10 15:31:15,870][26022] Updated weights on worker 0-0, policy_version 782863 (0.00103) [2022-07-10 15:31:17,638][26022] Updated weights on worker 0-0, policy_version 782873 (0.00101) [2022-07-10 15:31:19,620][26022] Updated weights on worker 0-0, policy_version 782883 (0.00091) [2022-07-10 15:31:20,108][25689] Fps is (10 sec: 5620.0, 60 sec: 5564.3, 300 sec: 5557.2). Total num frames: 801675264. Throughput: 0: 5768.7. Samples: 801683956. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:31:20,110][25689] Avg episode reward: [(0, '-1.456')] [2022-07-10 15:31:21,234][26022] Updated weights on worker 0-0, policy_version 782893 (0.00085) [2022-07-10 15:31:23,115][26022] Updated weights on worker 0-0, policy_version 782903 (0.00086) [2022-07-10 15:31:24,914][26022] Updated weights on worker 0-0, policy_version 782913 (0.00094) [2022-07-10 15:31:25,111][25689] Fps is (10 sec: 5519.6, 60 sec: 5553.1, 300 sec: 5557.8). Total num frames: 801702912. Throughput: 0: 5038.7. Samples: 801700792. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:31:25,111][25689] Avg episode reward: [(0, '-1.194')] [2022-07-10 15:31:26,755][26022] Updated weights on worker 0-0, policy_version 782923 (0.00087) [2022-07-10 15:31:28,804][26022] Updated weights on worker 0-0, policy_version 782933 (0.00091) [2022-07-10 15:31:30,219][25689] Fps is (10 sec: 5570.5, 60 sec: 5569.9, 300 sec: 5559.8). Total num frames: 801731584. Throughput: 0: 5850.3. Samples: 801734134. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:31:30,219][25689] Avg episode reward: [(0, '-1.071')] [2022-07-10 15:31:30,499][26022] Updated weights on worker 0-0, policy_version 782943 (0.00083) [2022-07-10 15:31:32,254][26022] Updated weights on worker 0-0, policy_version 782953 (0.00093) [2022-07-10 15:31:34,498][26022] Updated weights on worker 0-0, policy_version 782963 (0.00088) [2022-07-10 15:31:35,250][25689] Fps is (10 sec: 5655.5, 60 sec: 5571.4, 300 sec: 5562.9). Total num frames: 801760256. Throughput: 0: 5829.7. Samples: 801767624. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:31:35,251][25689] Avg episode reward: [(0, '-3.511')] [2022-07-10 15:31:35,999][26022] Updated weights on worker 0-0, policy_version 782973 (0.00092) [2022-07-10 15:31:38,071][26022] Updated weights on worker 0-0, policy_version 782983 (0.00094) [2022-07-10 15:31:39,668][26022] Updated weights on worker 0-0, policy_version 782993 (0.00091) [2022-07-10 15:31:40,271][25689] Fps is (10 sec: 5602.5, 60 sec: 5559.7, 300 sec: 5560.6). Total num frames: 801787904. Throughput: 0: 4987.6. Samples: 801784520. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 15:31:40,272][25689] Avg episode reward: [(0, '-3.131')] [2022-07-10 15:31:41,791][26022] Updated weights on worker 0-0, policy_version 783003 (0.00095) [2022-07-10 15:31:43,272][26022] Updated weights on worker 0-0, policy_version 783013 (0.00100) [2022-07-10 15:31:45,295][25689] Fps is (10 sec: 5402.9, 60 sec: 5542.8, 300 sec: 5554.5). Total num frames: 801814528. Throughput: 0: 5808.1. Samples: 801818024. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:31:45,296][25689] Avg episode reward: [(0, '-4.067')] [2022-07-10 15:31:45,416][26022] Updated weights on worker 0-0, policy_version 783023 (0.00096) [2022-07-10 15:31:46,748][26022] Updated weights on worker 0-0, policy_version 783033 (0.00088) [2022-07-10 15:31:48,962][26022] Updated weights on worker 0-0, policy_version 783043 (0.00091) [2022-07-10 15:31:50,391][25689] Fps is (10 sec: 5666.3, 60 sec: 5592.1, 300 sec: 5563.3). Total num frames: 801845248. Throughput: 0: 5838.4. Samples: 801851912. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:31:50,392][25689] Avg episode reward: [(0, '-4.582')] [2022-07-10 15:31:50,552][26022] Updated weights on worker 0-0, policy_version 783053 (0.00089) [2022-07-10 15:31:52,440][26022] Updated weights on worker 0-0, policy_version 783063 (0.00083) [2022-07-10 15:31:54,256][26022] Updated weights on worker 0-0, policy_version 783073 (0.00091) [2022-07-10 15:31:55,420][25689] Fps is (10 sec: 5663.6, 60 sec: 5539.8, 300 sec: 5556.5). Total num frames: 801871872. Throughput: 0: 5026.2. Samples: 801869002. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:31:55,420][25689] Avg episode reward: [(0, '-4.196')] [2022-07-10 15:31:55,959][26022] Updated weights on worker 0-0, policy_version 783083 (0.00086) [2022-07-10 15:31:57,804][26022] Updated weights on worker 0-0, policy_version 783093 (0.00093) [2022-07-10 15:31:59,641][26022] Updated weights on worker 0-0, policy_version 783103 (0.00084) [2022-07-10 15:32:00,472][25689] Fps is (10 sec: 5485.2, 60 sec: 5554.7, 300 sec: 5569.8). Total num frames: 801900544. Throughput: 0: 5850.1. Samples: 801902698. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:32:00,475][25689] Avg episode reward: [(0, '-3.417')] [2022-07-10 15:32:01,457][26022] Updated weights on worker 0-0, policy_version 783113 (0.00089) [2022-07-10 15:32:03,836][26022] Updated weights on worker 0-0, policy_version 783123 (0.00092) [2022-07-10 15:32:05,478][25689] Fps is (10 sec: 5497.8, 60 sec: 5588.5, 300 sec: 5564.5). Total num frames: 801927168. Throughput: 0: 5753.9. Samples: 801934154. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:32:05,478][25689] Avg episode reward: [(0, '-1.871')] [2022-07-10 15:32:05,598][26022] Updated weights on worker 0-0, policy_version 783133 (0.00097) [2022-07-10 15:32:07,615][26022] Updated weights on worker 0-0, policy_version 783143 (0.00081) [2022-07-10 15:32:09,342][26022] Updated weights on worker 0-0, policy_version 783153 (0.00089) [2022-07-10 15:32:10,547][25689] Fps is (10 sec: 5387.2, 60 sec: 5555.3, 300 sec: 5563.4). Total num frames: 801954816. Throughput: 0: 4917.9. Samples: 801951034. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:32:10,547][25689] Avg episode reward: [(0, '-2.093')] [2022-07-10 15:32:11,175][26022] Updated weights on worker 0-0, policy_version 783163 (0.00086) [2022-07-10 15:32:13,205][26022] Updated weights on worker 0-0, policy_version 783173 (0.00096) [2022-07-10 15:32:14,723][26022] Updated weights on worker 0-0, policy_version 783183 (0.00087) [2022-07-10 15:32:15,565][25689] Fps is (10 sec: 5583.6, 60 sec: 5554.2, 300 sec: 5559.9). Total num frames: 801983488. Throughput: 0: 5728.5. Samples: 801984400. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:32:15,567][25689] Avg episode reward: [(0, '-2.131')] [2022-07-10 15:32:16,838][26022] Updated weights on worker 0-0, policy_version 783193 (0.00083) [2022-07-10 15:32:18,658][26022] Updated weights on worker 0-0, policy_version 783203 (0.00092) [2022-07-10 15:32:20,305][26022] Updated weights on worker 0-0, policy_version 783213 (0.00092) [2022-07-10 15:32:20,593][25689] Fps is (10 sec: 5708.1, 60 sec: 5570.0, 300 sec: 5559.7). Total num frames: 802012160. Throughput: 0: 5729.6. Samples: 802017978. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:32:20,593][25689] Avg episode reward: [(0, '-1.359')] [2022-07-10 15:32:22,450][26022] Updated weights on worker 0-0, policy_version 783223 (0.00100) [2022-07-10 15:32:23,783][26022] Updated weights on worker 0-0, policy_version 783233 (0.00085) [2022-07-10 15:32:25,655][25689] Fps is (10 sec: 5378.8, 60 sec: 5530.7, 300 sec: 5556.3). Total num frames: 802037760. Throughput: 0: 4988.4. Samples: 802034804. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:32:25,655][25689] Avg episode reward: [(0, '-0.067')] [2022-07-10 15:32:25,951][26022] Updated weights on worker 0-0, policy_version 783243 (0.00086) [2022-07-10 15:32:27,659][26022] Updated weights on worker 0-0, policy_version 783253 (0.00101) [2022-07-10 15:32:29,639][26022] Updated weights on worker 0-0, policy_version 783263 (0.00095) [2022-07-10 15:32:30,771][25689] Fps is (10 sec: 5533.2, 60 sec: 5563.7, 300 sec: 5561.5). Total num frames: 802068480. Throughput: 0: 5798.3. Samples: 802068302. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:32:30,772][25689] Avg episode reward: [(0, '0.095')] [2022-07-10 15:32:31,406][26022] Updated weights on worker 0-0, policy_version 783273 (0.00084) [2022-07-10 15:32:33,008][26022] Updated weights on worker 0-0, policy_version 783283 (0.00100) [2022-07-10 15:32:35,112][26022] Updated weights on worker 0-0, policy_version 783293 (0.00086) [2022-07-10 15:32:35,777][25689] Fps is (10 sec: 5564.2, 60 sec: 5515.3, 300 sec: 5554.8). Total num frames: 802094080. Throughput: 0: 5818.0. Samples: 802101994. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:32:35,777][25689] Avg episode reward: [(0, '0.257')] [2022-07-10 15:32:36,711][26022] Updated weights on worker 0-0, policy_version 783303 (0.00081) [2022-07-10 15:32:38,843][26022] Updated weights on worker 0-0, policy_version 783313 (0.00086) [2022-07-10 15:32:40,339][26022] Updated weights on worker 0-0, policy_version 783323 (0.00088) [2022-07-10 15:32:40,783][25689] Fps is (10 sec: 5625.5, 60 sec: 5567.5, 300 sec: 5561.9). Total num frames: 802124800. Throughput: 0: 5004.6. Samples: 802119022. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:32:40,783][25689] Avg episode reward: [(0, '-0.436')] [2022-07-10 15:32:42,393][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:32:42,407][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000783332_802131968.pth [2022-07-10 15:32:42,407][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000781374_800126976.pth [2022-07-10 15:32:42,514][26022] Updated weights on worker 0-0, policy_version 783333 (0.00091) [2022-07-10 15:32:43,884][26022] Updated weights on worker 0-0, policy_version 783343 (0.00086) [2022-07-10 15:32:45,803][25689] Fps is (10 sec: 5719.4, 60 sec: 5567.8, 300 sec: 5559.0). Total num frames: 802151424. Throughput: 0: 5848.2. Samples: 802152634. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:32:45,803][25689] Avg episode reward: [(0, '0.084')] [2022-07-10 15:32:46,066][26022] Updated weights on worker 0-0, policy_version 783353 (0.00086) [2022-07-10 15:32:47,674][26022] Updated weights on worker 0-0, policy_version 783363 (0.00083) [2022-07-10 15:32:49,616][26022] Updated weights on worker 0-0, policy_version 783373 (0.00088) [2022-07-10 15:32:50,917][25689] Fps is (10 sec: 5456.3, 60 sec: 5532.3, 300 sec: 5560.6). Total num frames: 802180096. Throughput: 0: 5862.5. Samples: 802186408. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:32:50,918][25689] Avg episode reward: [(0, '-0.843')] [2022-07-10 15:32:51,509][26022] Updated weights on worker 0-0, policy_version 783383 (0.00083) [2022-07-10 15:32:53,328][26022] Updated weights on worker 0-0, policy_version 783393 (0.00087) [2022-07-10 15:32:55,136][26022] Updated weights on worker 0-0, policy_version 783403 (0.00086) [2022-07-10 15:32:55,923][25689] Fps is (10 sec: 5767.3, 60 sec: 5585.1, 300 sec: 5567.5). Total num frames: 802209792. Throughput: 0: 5029.7. Samples: 802203326. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:32:55,924][25689] Avg episode reward: [(0, '-1.355')] [2022-07-10 15:32:56,878][26022] Updated weights on worker 0-0, policy_version 783413 (0.00090) [2022-07-10 15:32:58,783][26022] Updated weights on worker 0-0, policy_version 783423 (0.00084) [2022-07-10 15:33:00,657][26022] Updated weights on worker 0-0, policy_version 783433 (0.00088) [2022-07-10 15:33:00,943][25689] Fps is (10 sec: 5617.7, 60 sec: 5554.3, 300 sec: 5567.8). Total num frames: 802236416. Throughput: 0: 5855.2. Samples: 802237062. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:33:00,944][25689] Avg episode reward: [(0, '-2.829')] [2022-07-10 15:33:02,774][26022] Updated weights on worker 0-0, policy_version 783443 (0.00084) [2022-07-10 15:33:04,460][26022] Updated weights on worker 0-0, policy_version 783453 (0.00103) [2022-07-10 15:33:05,949][25689] Fps is (10 sec: 5311.1, 60 sec: 5554.2, 300 sec: 5566.3). Total num frames: 802263040. Throughput: 0: 5745.8. Samples: 802268390. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:33:05,950][25689] Avg episode reward: [(0, '-3.632')] [2022-07-10 15:33:06,514][26022] Updated weights on worker 0-0, policy_version 783463 (0.00083) [2022-07-10 15:33:08,142][26022] Updated weights on worker 0-0, policy_version 783473 (0.00093) [2022-07-10 15:33:10,155][26022] Updated weights on worker 0-0, policy_version 783483 (0.00072) [2022-07-10 15:33:11,075][25689] Fps is (10 sec: 5457.2, 60 sec: 5565.9, 300 sec: 5564.4). Total num frames: 802291712. Throughput: 0: 4899.1. Samples: 802285164. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:33:11,076][25689] Avg episode reward: [(0, '-2.448')] [2022-07-10 15:33:11,858][26022] Updated weights on worker 0-0, policy_version 783493 (0.00081) [2022-07-10 15:33:13,821][26022] Updated weights on worker 0-0, policy_version 783503 (0.00088) [2022-07-10 15:33:15,687][26022] Updated weights on worker 0-0, policy_version 783513 (0.00087) [2022-07-10 15:33:16,105][25689] Fps is (10 sec: 5545.5, 60 sec: 5547.9, 300 sec: 5560.5). Total num frames: 802319360. Throughput: 0: 5732.4. Samples: 802319016. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:33:16,107][25689] Avg episode reward: [(0, '-1.883')] [2022-07-10 15:33:17,381][26022] Updated weights on worker 0-0, policy_version 783523 (0.00083) [2022-07-10 15:33:19,420][26022] Updated weights on worker 0-0, policy_version 783533 (0.00093) [2022-07-10 15:33:20,971][26022] Updated weights on worker 0-0, policy_version 783543 (0.00088) [2022-07-10 15:33:21,116][25689] Fps is (10 sec: 5609.4, 60 sec: 5549.5, 300 sec: 5557.2). Total num frames: 802348032. Throughput: 0: 5736.6. Samples: 802352786. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:33:21,116][25689] Avg episode reward: [(0, '-1.721')] [2022-07-10 15:33:22,978][26022] Updated weights on worker 0-0, policy_version 783553 (0.00093) [2022-07-10 15:33:24,577][26022] Updated weights on worker 0-0, policy_version 783563 (0.00086) [2022-07-10 15:33:26,129][25689] Fps is (10 sec: 5618.7, 60 sec: 5587.8, 300 sec: 5561.1). Total num frames: 802375680. Throughput: 0: 5018.4. Samples: 802369660. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:33:26,131][25689] Avg episode reward: [(0, '-0.802')] [2022-07-10 15:33:26,720][26022] Updated weights on worker 0-0, policy_version 783573 (0.00698) [2022-07-10 15:33:28,314][26022] Updated weights on worker 0-0, policy_version 783583 (0.00086) [2022-07-10 15:33:30,475][26022] Updated weights on worker 0-0, policy_version 783593 (0.00086) [2022-07-10 15:33:31,214][25689] Fps is (10 sec: 5577.0, 60 sec: 5556.8, 300 sec: 5556.4). Total num frames: 802404352. Throughput: 0: 5869.6. Samples: 802403372. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:33:31,215][25689] Avg episode reward: [(0, '-0.159')] [2022-07-10 15:33:31,922][26022] Updated weights on worker 0-0, policy_version 783603 (0.00072) [2022-07-10 15:33:33,967][26022] Updated weights on worker 0-0, policy_version 783613 (0.00094) [2022-07-10 15:33:35,690][26022] Updated weights on worker 0-0, policy_version 783623 (0.00096) [2022-07-10 15:33:36,249][25689] Fps is (10 sec: 5666.1, 60 sec: 5604.9, 300 sec: 5566.1). Total num frames: 802433024. Throughput: 0: 5838.0. Samples: 802436618. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:33:36,251][25689] Avg episode reward: [(0, '0.793')] [2022-07-10 15:33:37,766][26022] Updated weights on worker 0-0, policy_version 783633 (0.00086) [2022-07-10 15:33:39,351][26022] Updated weights on worker 0-0, policy_version 783643 (0.00094) [2022-07-10 15:33:41,223][26022] Updated weights on worker 0-0, policy_version 783653 (0.00082) [2022-07-10 15:33:41,253][25689] Fps is (10 sec: 5610.3, 60 sec: 5554.4, 300 sec: 5559.5). Total num frames: 802460672. Throughput: 0: 5825.5. Samples: 802470096. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:33:41,253][25689] Avg episode reward: [(0, '0.470')] [2022-07-10 15:33:42,993][26022] Updated weights on worker 0-0, policy_version 783663 (0.00084) [2022-07-10 15:33:44,814][26022] Updated weights on worker 0-0, policy_version 783673 (0.00093) [2022-07-10 15:33:46,273][25689] Fps is (10 sec: 5618.5, 60 sec: 5588.2, 300 sec: 5563.4). Total num frames: 802489344. Throughput: 0: 5831.9. Samples: 802487142. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:33:46,274][25689] Avg episode reward: [(0, '0.103')] [2022-07-10 15:33:46,801][26022] Updated weights on worker 0-0, policy_version 783683 (0.00082) [2022-07-10 15:33:48,563][26022] Updated weights on worker 0-0, policy_version 783693 (0.00082) [2022-07-10 15:33:50,431][26022] Updated weights on worker 0-0, policy_version 783703 (0.00084) [2022-07-10 15:33:51,336][25689] Fps is (10 sec: 5585.7, 60 sec: 5576.0, 300 sec: 5562.6). Total num frames: 802516992. Throughput: 0: 5839.8. Samples: 802520878. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:33:51,336][25689] Avg episode reward: [(0, '-0.101')] [2022-07-10 15:33:52,102][26022] Updated weights on worker 0-0, policy_version 783713 (0.00088) [2022-07-10 15:33:54,079][26022] Updated weights on worker 0-0, policy_version 783723 (0.00094) [2022-07-10 15:33:55,824][26022] Updated weights on worker 0-0, policy_version 783733 (0.00088) [2022-07-10 15:33:56,339][25689] Fps is (10 sec: 5493.4, 60 sec: 5542.4, 300 sec: 5559.6). Total num frames: 802544640. Throughput: 0: 5855.7. Samples: 802554258. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:33:56,340][25689] Avg episode reward: [(0, '0.010')] [2022-07-10 15:33:57,795][26022] Updated weights on worker 0-0, policy_version 783743 (0.00094) [2022-07-10 15:33:59,354][26022] Updated weights on worker 0-0, policy_version 783753 (0.00085) [2022-07-10 15:34:01,349][25689] Fps is (10 sec: 5522.2, 60 sec: 5560.2, 300 sec: 5563.3). Total num frames: 802572288. Throughput: 0: 5023.1. Samples: 802571042. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:34:01,350][25689] Avg episode reward: [(0, '0.111')] [2022-07-10 15:34:01,619][26022] Updated weights on worker 0-0, policy_version 783763 (0.00087) [2022-07-10 15:34:03,355][26022] Updated weights on worker 0-0, policy_version 783773 (0.00089) [2022-07-10 15:34:05,294][26022] Updated weights on worker 0-0, policy_version 783783 (0.00084) [2022-07-10 15:34:06,351][25689] Fps is (10 sec: 5420.9, 60 sec: 5560.6, 300 sec: 5561.4). Total num frames: 802598912. Throughput: 0: 5759.6. Samples: 802602780. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:34:06,351][25689] Avg episode reward: [(0, '0.125')] [2022-07-10 15:34:07,198][26022] Updated weights on worker 0-0, policy_version 783793 (0.00090) [2022-07-10 15:34:08,953][26022] Updated weights on worker 0-0, policy_version 783803 (0.00084) [2022-07-10 15:34:11,041][26022] Updated weights on worker 0-0, policy_version 783813 (0.00088) [2022-07-10 15:34:11,486][25689] Fps is (10 sec: 5353.8, 60 sec: 5542.8, 300 sec: 5556.6). Total num frames: 802626560. Throughput: 0: 5724.0. Samples: 802636220. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:34:11,487][25689] Avg episode reward: [(0, '0.967')] [2022-07-10 15:34:12,628][26022] Updated weights on worker 0-0, policy_version 783823 (0.00091) [2022-07-10 15:34:14,493][26022] Updated weights on worker 0-0, policy_version 783833 (0.00086) [2022-07-10 15:34:16,306][26022] Updated weights on worker 0-0, policy_version 783843 (0.00086) [2022-07-10 15:34:16,519][25689] Fps is (10 sec: 5639.4, 60 sec: 5576.5, 300 sec: 5560.5). Total num frames: 802656256. Throughput: 0: 4901.4. Samples: 802653168. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:34:16,520][25689] Avg episode reward: [(0, '0.852')] [2022-07-10 15:34:18,150][26022] Updated weights on worker 0-0, policy_version 783853 (0.00088) [2022-07-10 15:34:19,894][26022] Updated weights on worker 0-0, policy_version 783863 (0.00094) [2022-07-10 15:34:21,523][25689] Fps is (10 sec: 5713.2, 60 sec: 5560.1, 300 sec: 5551.3). Total num frames: 802683904. Throughput: 0: 5769.1. Samples: 802687428. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:34:21,524][25689] Avg episode reward: [(0, '0.924')] [2022-07-10 15:34:21,859][26022] Updated weights on worker 0-0, policy_version 783873 (0.00088) [2022-07-10 15:34:23,414][26022] Updated weights on worker 0-0, policy_version 783883 (0.00085) [2022-07-10 15:34:25,363][26022] Updated weights on worker 0-0, policy_version 783893 (0.00094) [2022-07-10 15:34:26,617][25689] Fps is (10 sec: 5678.9, 60 sec: 5586.5, 300 sec: 5569.0). Total num frames: 802713600. Throughput: 0: 5834.3. Samples: 802721020. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:34:26,617][25689] Avg episode reward: [(0, '0.231')] [2022-07-10 15:34:27,243][26022] Updated weights on worker 0-0, policy_version 783903 (0.00089) [2022-07-10 15:34:29,065][26022] Updated weights on worker 0-0, policy_version 783913 (0.00089) [2022-07-10 15:34:30,857][26022] Updated weights on worker 0-0, policy_version 783923 (0.00086) [2022-07-10 15:34:31,673][25689] Fps is (10 sec: 5548.7, 60 sec: 5555.3, 300 sec: 5558.1). Total num frames: 802740224. Throughput: 0: 5024.4. Samples: 802737648. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:34:31,674][25689] Avg episode reward: [(0, '-0.612')] [2022-07-10 15:34:32,719][26022] Updated weights on worker 0-0, policy_version 783933 (0.00090) [2022-07-10 15:34:34,579][26022] Updated weights on worker 0-0, policy_version 783943 (0.00094) [2022-07-10 15:34:36,372][26022] Updated weights on worker 0-0, policy_version 783953 (0.00094) [2022-07-10 15:34:36,702][25689] Fps is (10 sec: 5483.0, 60 sec: 5556.0, 300 sec: 5561.1). Total num frames: 802768896. Throughput: 0: 5862.8. Samples: 802771496. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:34:36,704][25689] Avg episode reward: [(0, '-0.434')] [2022-07-10 15:34:38,102][26022] Updated weights on worker 0-0, policy_version 783963 (0.00088) [2022-07-10 15:34:40,185][26022] Updated weights on worker 0-0, policy_version 783973 (0.00089) [2022-07-10 15:34:41,717][25689] Fps is (10 sec: 5607.5, 60 sec: 5554.9, 300 sec: 5561.3). Total num frames: 802796544. Throughput: 0: 5809.5. Samples: 802804744. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:34:41,718][25689] Avg episode reward: [(0, '-0.096')] [2022-07-10 15:34:41,940][26022] Updated weights on worker 0-0, policy_version 783983 (0.00086) [2022-07-10 15:34:42,477][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:34:42,492][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000783986_802801664.pth [2022-07-10 15:34:42,493][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000782026_800794624.pth [2022-07-10 15:34:43,678][26022] Updated weights on worker 0-0, policy_version 783993 (0.00089) [2022-07-10 15:34:45,687][26022] Updated weights on worker 0-0, policy_version 784003 (0.00089) [2022-07-10 15:34:46,755][25689] Fps is (10 sec: 5703.8, 60 sec: 5570.2, 300 sec: 5561.8). Total num frames: 802826240. Throughput: 0: 4990.9. Samples: 802821530. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:34:46,756][25689] Avg episode reward: [(0, '-0.083')] [2022-07-10 15:34:47,314][26022] Updated weights on worker 0-0, policy_version 784013 (0.00089) [2022-07-10 15:34:49,306][26022] Updated weights on worker 0-0, policy_version 784023 (0.00082) [2022-07-10 15:34:51,303][26022] Updated weights on worker 0-0, policy_version 784033 (0.00086) [2022-07-10 15:34:51,840][25689] Fps is (10 sec: 5563.5, 60 sec: 5551.2, 300 sec: 5560.9). Total num frames: 802852864. Throughput: 0: 5830.3. Samples: 802855226. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:34:51,841][25689] Avg episode reward: [(0, '-0.587')] [2022-07-10 15:34:52,845][26022] Updated weights on worker 0-0, policy_version 784043 (0.00090) [2022-07-10 15:34:54,935][26022] Updated weights on worker 0-0, policy_version 784053 (0.00096) [2022-07-10 15:34:56,477][26022] Updated weights on worker 0-0, policy_version 784063 (0.00085) [2022-07-10 15:34:56,844][25689] Fps is (10 sec: 5582.4, 60 sec: 5585.0, 300 sec: 5568.3). Total num frames: 802882560. Throughput: 0: 5808.2. Samples: 802888486. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:34:56,846][25689] Avg episode reward: [(0, '-0.115')] [2022-07-10 15:34:58,544][26022] Updated weights on worker 0-0, policy_version 784073 (0.00089) [2022-07-10 15:35:00,401][26022] Updated weights on worker 0-0, policy_version 784083 (0.00090) [2022-07-10 15:35:01,861][25689] Fps is (10 sec: 5416.1, 60 sec: 5533.6, 300 sec: 5561.5). Total num frames: 802907136. Throughput: 0: 4982.3. Samples: 802905104. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:35:01,861][25689] Avg episode reward: [(0, '0.654')] [2022-07-10 15:35:02,555][26022] Updated weights on worker 0-0, policy_version 784093 (0.00087) [2022-07-10 15:35:04,451][26022] Updated weights on worker 0-0, policy_version 784103 (0.00085) [2022-07-10 15:35:06,339][26022] Updated weights on worker 0-0, policy_version 784113 (0.00086) [2022-07-10 15:35:06,871][25689] Fps is (10 sec: 5106.5, 60 sec: 5532.9, 300 sec: 5555.3). Total num frames: 802933760. Throughput: 0: 5716.7. Samples: 802936522. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:35:06,871][25689] Avg episode reward: [(0, '-0.791')] [2022-07-10 15:35:08,085][26022] Updated weights on worker 0-0, policy_version 784123 (0.00091) [2022-07-10 15:35:10,032][26022] Updated weights on worker 0-0, policy_version 784133 (0.00087) [2022-07-10 15:35:11,615][26022] Updated weights on worker 0-0, policy_version 784143 (0.00086) [2022-07-10 15:35:11,996][25689] Fps is (10 sec: 5556.6, 60 sec: 5567.6, 300 sec: 5556.5). Total num frames: 802963456. Throughput: 0: 5697.6. Samples: 802970068. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 15:35:11,997][25689] Avg episode reward: [(0, '-1.786')] [2022-07-10 15:35:13,581][26022] Updated weights on worker 0-0, policy_version 784153 (0.00095) [2022-07-10 15:35:15,573][26022] Updated weights on worker 0-0, policy_version 784163 (0.00087) [2022-07-10 15:35:17,012][25689] Fps is (10 sec: 5654.7, 60 sec: 5535.4, 300 sec: 5556.4). Total num frames: 802991104. Throughput: 0: 4878.2. Samples: 802986866. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:35:17,012][25689] Avg episode reward: [(0, '-2.685')] [2022-07-10 15:35:17,165][26022] Updated weights on worker 0-0, policy_version 784173 (0.00086) [2022-07-10 15:35:19,142][26022] Updated weights on worker 0-0, policy_version 784183 (0.00092) [2022-07-10 15:35:20,799][26022] Updated weights on worker 0-0, policy_version 784193 (0.00088) [2022-07-10 15:35:22,090][25689] Fps is (10 sec: 5478.1, 60 sec: 5528.6, 300 sec: 5552.7). Total num frames: 803018752. Throughput: 0: 5691.4. Samples: 803020240. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:35:22,091][25689] Avg episode reward: [(0, '-2.144')] [2022-07-10 15:35:22,715][26022] Updated weights on worker 0-0, policy_version 784203 (0.00092) [2022-07-10 15:35:24,697][26022] Updated weights on worker 0-0, policy_version 784213 (0.00086) [2022-07-10 15:35:26,326][26022] Updated weights on worker 0-0, policy_version 784223 (0.00087) [2022-07-10 15:35:27,165][25689] Fps is (10 sec: 5546.9, 60 sec: 5513.4, 300 sec: 5556.7). Total num frames: 803047424. Throughput: 0: 5785.0. Samples: 803053924. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:35:27,165][25689] Avg episode reward: [(0, '-2.187')] [2022-07-10 15:35:28,445][26022] Updated weights on worker 0-0, policy_version 784233 (0.00184) [2022-07-10 15:35:30,083][26022] Updated weights on worker 0-0, policy_version 784243 (0.00439) [2022-07-10 15:35:31,974][26022] Updated weights on worker 0-0, policy_version 784253 (0.00086) [2022-07-10 15:35:32,296][25689] Fps is (10 sec: 5618.5, 60 sec: 5540.3, 300 sec: 5555.1). Total num frames: 803076096. Throughput: 0: 4954.6. Samples: 803070648. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:35:32,297][25689] Avg episode reward: [(0, '-2.430')] [2022-07-10 15:35:33,699][26022] Updated weights on worker 0-0, policy_version 784263 (0.00090) [2022-07-10 15:35:35,532][26022] Updated weights on worker 0-0, policy_version 784273 (0.00092) [2022-07-10 15:35:37,321][25689] Fps is (10 sec: 5646.2, 60 sec: 5540.7, 300 sec: 5556.1). Total num frames: 803104768. Throughput: 0: 5803.8. Samples: 803104738. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:35:37,322][25689] Avg episode reward: [(0, '-1.085')] [2022-07-10 15:35:37,588][26022] Updated weights on worker 0-0, policy_version 784283 (0.00088) [2022-07-10 15:35:39,236][26022] Updated weights on worker 0-0, policy_version 784293 (0.00089) [2022-07-10 15:35:41,281][26022] Updated weights on worker 0-0, policy_version 784303 (0.00083) [2022-07-10 15:35:42,352][25689] Fps is (10 sec: 5804.5, 60 sec: 5573.0, 300 sec: 5562.9). Total num frames: 803134464. Throughput: 0: 5824.1. Samples: 803138248. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:35:42,353][25689] Avg episode reward: [(0, '-0.313')] [2022-07-10 15:35:42,824][26022] Updated weights on worker 0-0, policy_version 784313 (0.00086) [2022-07-10 15:35:44,790][26022] Updated weights on worker 0-0, policy_version 784323 (0.00093) [2022-07-10 15:35:46,629][26022] Updated weights on worker 0-0, policy_version 784333 (0.01436) [2022-07-10 15:35:47,370][25689] Fps is (10 sec: 5604.7, 60 sec: 5524.2, 300 sec: 5560.6). Total num frames: 803161088. Throughput: 0: 5009.5. Samples: 803155140. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:35:47,370][25689] Avg episode reward: [(0, '0.590')] [2022-07-10 15:35:48,379][26022] Updated weights on worker 0-0, policy_version 784343 (0.00089) [2022-07-10 15:35:50,291][26022] Updated weights on worker 0-0, policy_version 784353 (0.00093) [2022-07-10 15:35:52,003][26022] Updated weights on worker 0-0, policy_version 784363 (0.00086) [2022-07-10 15:35:52,433][25689] Fps is (10 sec: 5485.2, 60 sec: 5560.0, 300 sec: 5556.2). Total num frames: 803189760. Throughput: 0: 5854.1. Samples: 803188530. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:35:52,434][25689] Avg episode reward: [(0, '0.071')] [2022-07-10 15:35:53,917][26022] Updated weights on worker 0-0, policy_version 784373 (0.00090) [2022-07-10 15:35:55,623][26022] Updated weights on worker 0-0, policy_version 784383 (0.00093) [2022-07-10 15:35:57,499][25689] Fps is (10 sec: 5459.3, 60 sec: 5503.7, 300 sec: 5552.1). Total num frames: 803216384. Throughput: 0: 5810.8. Samples: 803221986. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:35:57,499][25689] Avg episode reward: [(0, '-0.365')] [2022-07-10 15:35:57,692][26022] Updated weights on worker 0-0, policy_version 784393 (0.00087) [2022-07-10 15:35:59,268][26022] Updated weights on worker 0-0, policy_version 784403 (0.00089) [2022-07-10 15:36:01,216][26022] Updated weights on worker 0-0, policy_version 784413 (0.00092) [2022-07-10 15:36:02,511][25689] Fps is (10 sec: 5487.1, 60 sec: 5571.6, 300 sec: 5565.7). Total num frames: 803245056. Throughput: 0: 4991.2. Samples: 803238862. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:36:02,511][25689] Avg episode reward: [(0, '-1.409')] [2022-07-10 15:36:03,588][26022] Updated weights on worker 0-0, policy_version 784423 (0.00946) [2022-07-10 15:36:05,228][26022] Updated weights on worker 0-0, policy_version 784433 (0.00090) [2022-07-10 15:36:07,071][26022] Updated weights on worker 0-0, policy_version 784443 (0.00089) [2022-07-10 15:36:07,523][25689] Fps is (10 sec: 5516.2, 60 sec: 5571.4, 300 sec: 5556.6). Total num frames: 803271680. Throughput: 0: 5726.9. Samples: 803270554. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:36:07,523][25689] Avg episode reward: [(0, '-1.696')] [2022-07-10 15:36:09,008][26022] Updated weights on worker 0-0, policy_version 784453 (0.00092) [2022-07-10 15:36:10,790][26022] Updated weights on worker 0-0, policy_version 784463 (0.00101) [2022-07-10 15:36:12,656][25689] Fps is (10 sec: 5349.4, 60 sec: 5537.0, 300 sec: 5550.8). Total num frames: 803299328. Throughput: 0: 5707.1. Samples: 803303944. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:36:12,657][25689] Avg episode reward: [(0, '-1.907')] [2022-07-10 15:36:12,704][26022] Updated weights on worker 0-0, policy_version 784473 (0.00084) [2022-07-10 15:36:14,344][26022] Updated weights on worker 0-0, policy_version 784483 (0.00088) [2022-07-10 15:36:16,316][26022] Updated weights on worker 0-0, policy_version 784493 (0.00094) [2022-07-10 15:36:17,729][25689] Fps is (10 sec: 5719.1, 60 sec: 5582.3, 300 sec: 5560.0). Total num frames: 803330048. Throughput: 0: 5715.2. Samples: 803337604. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:36:17,729][25689] Avg episode reward: [(0, '-2.269')] [2022-07-10 15:36:17,846][26022] Updated weights on worker 0-0, policy_version 784503 (0.00087) [2022-07-10 15:36:20,103][26022] Updated weights on worker 0-0, policy_version 784513 (0.00091) [2022-07-10 15:36:21,706][26022] Updated weights on worker 0-0, policy_version 784523 (0.00094) [2022-07-10 15:36:22,754][25689] Fps is (10 sec: 5577.2, 60 sec: 5553.5, 300 sec: 5552.7). Total num frames: 803355648. Throughput: 0: 5700.7. Samples: 803354266. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:36:22,755][25689] Avg episode reward: [(0, '-1.293')] [2022-07-10 15:36:23,525][26022] Updated weights on worker 0-0, policy_version 784533 (0.00090) [2022-07-10 15:36:25,448][26022] Updated weights on worker 0-0, policy_version 784543 (0.00091) [2022-07-10 15:36:27,226][26022] Updated weights on worker 0-0, policy_version 784553 (0.00087) [2022-07-10 15:36:27,841][25689] Fps is (10 sec: 5468.0, 60 sec: 5569.2, 300 sec: 5556.6). Total num frames: 803385344. Throughput: 0: 5772.8. Samples: 803387848. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:36:27,842][25689] Avg episode reward: [(0, '-0.987')] [2022-07-10 15:36:29,110][26022] Updated weights on worker 0-0, policy_version 784563 (0.00095) [2022-07-10 15:36:31,058][26022] Updated weights on worker 0-0, policy_version 784573 (0.00088) [2022-07-10 15:36:32,631][26022] Updated weights on worker 0-0, policy_version 784583 (0.00085) [2022-07-10 15:36:32,915][25689] Fps is (10 sec: 5744.7, 60 sec: 5574.6, 300 sec: 5555.8). Total num frames: 803414016. Throughput: 0: 5806.0. Samples: 803421566. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:36:32,915][25689] Avg episode reward: [(0, '-1.704')] [2022-07-10 15:36:34,770][26022] Updated weights on worker 0-0, policy_version 784593 (0.00093) [2022-07-10 15:36:36,344][26022] Updated weights on worker 0-0, policy_version 784603 (0.00087) [2022-07-10 15:36:37,998][25689] Fps is (10 sec: 5544.9, 60 sec: 5552.3, 300 sec: 5554.6). Total num frames: 803441664. Throughput: 0: 4984.3. Samples: 803438634. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:36:37,999][25689] Avg episode reward: [(0, '-1.489')] [2022-07-10 15:36:38,267][26022] Updated weights on worker 0-0, policy_version 784613 (0.00085) [2022-07-10 15:36:40,017][26022] Updated weights on worker 0-0, policy_version 784623 (0.00091) [2022-07-10 15:36:41,915][26022] Updated weights on worker 0-0, policy_version 784633 (0.00086) [2022-07-10 15:36:42,609][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:36:42,623][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000784637_803468288.pth [2022-07-10 15:36:42,624][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000782678_801462272.pth [2022-07-10 15:36:43,073][25689] Fps is (10 sec: 5544.3, 60 sec: 5531.4, 300 sec: 5560.5). Total num frames: 803470336. Throughput: 0: 5795.9. Samples: 803472030. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:36:43,073][25689] Avg episode reward: [(0, '-1.618')] [2022-07-10 15:36:43,654][26022] Updated weights on worker 0-0, policy_version 784643 (0.00093) [2022-07-10 15:36:45,655][26022] Updated weights on worker 0-0, policy_version 784653 (0.00096) [2022-07-10 15:36:47,229][26022] Updated weights on worker 0-0, policy_version 784663 (0.00085) [2022-07-10 15:36:48,169][25689] Fps is (10 sec: 5638.0, 60 sec: 5557.9, 300 sec: 5553.7). Total num frames: 803499008. Throughput: 0: 5799.3. Samples: 803505736. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:36:48,170][25689] Avg episode reward: [(0, '-1.407')] [2022-07-10 15:36:49,319][26022] Updated weights on worker 0-0, policy_version 784673 (0.00092) [2022-07-10 15:36:50,875][26022] Updated weights on worker 0-0, policy_version 784683 (0.00083) [2022-07-10 15:36:52,890][26022] Updated weights on worker 0-0, policy_version 784693 (0.00091) [2022-07-10 15:36:53,205][25689] Fps is (10 sec: 5558.8, 60 sec: 5543.7, 300 sec: 5557.0). Total num frames: 803526656. Throughput: 0: 4983.6. Samples: 803522682. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:36:53,205][25689] Avg episode reward: [(0, '-1.971')] [2022-07-10 15:36:54,526][26022] Updated weights on worker 0-0, policy_version 784703 (0.00088) [2022-07-10 15:36:56,475][26022] Updated weights on worker 0-0, policy_version 784713 (0.00085) [2022-07-10 15:36:58,207][25689] Fps is (10 sec: 5509.0, 60 sec: 5566.3, 300 sec: 5554.5). Total num frames: 803554304. Throughput: 0: 5833.6. Samples: 803556522. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:36:58,207][25689] Avg episode reward: [(0, '0.108')] [2022-07-10 15:36:58,406][26022] Updated weights on worker 0-0, policy_version 784723 (0.00093) [2022-07-10 15:36:59,936][26022] Updated weights on worker 0-0, policy_version 784733 (0.00619) [2022-07-10 15:37:02,388][26022] Updated weights on worker 0-0, policy_version 784743 (0.00092) [2022-07-10 15:37:03,247][25689] Fps is (10 sec: 5506.2, 60 sec: 5546.9, 300 sec: 5557.3). Total num frames: 803581952. Throughput: 0: 5762.0. Samples: 803588274. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:37:03,247][25689] Avg episode reward: [(0, '-0.306')] [2022-07-10 15:37:03,997][26022] Updated weights on worker 0-0, policy_version 784753 (0.00087) [2022-07-10 15:37:05,960][26022] Updated weights on worker 0-0, policy_version 784763 (0.00080) [2022-07-10 15:37:07,802][26022] Updated weights on worker 0-0, policy_version 784773 (0.00092) [2022-07-10 15:37:08,280][25689] Fps is (10 sec: 5489.6, 60 sec: 5561.9, 300 sec: 5557.9). Total num frames: 803609600. Throughput: 0: 4946.4. Samples: 803605210. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:37:08,280][25689] Avg episode reward: [(0, '-1.545')] [2022-07-10 15:37:09,724][26022] Updated weights on worker 0-0, policy_version 784783 (0.00087) [2022-07-10 15:37:11,383][26022] Updated weights on worker 0-0, policy_version 784793 (0.00091) [2022-07-10 15:37:13,320][25689] Fps is (10 sec: 5489.8, 60 sec: 5570.4, 300 sec: 5554.1). Total num frames: 803637248. Throughput: 0: 5758.9. Samples: 803638522. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:37:13,320][25689] Avg episode reward: [(0, '-1.100')] [2022-07-10 15:37:13,415][26022] Updated weights on worker 0-0, policy_version 784803 (0.00087) [2022-07-10 15:37:15,139][26022] Updated weights on worker 0-0, policy_version 784813 (0.00081) [2022-07-10 15:37:17,208][26022] Updated weights on worker 0-0, policy_version 784823 (0.00095) [2022-07-10 15:37:18,327][25689] Fps is (10 sec: 5605.3, 60 sec: 5542.6, 300 sec: 5554.5). Total num frames: 803665920. Throughput: 0: 5747.6. Samples: 803672168. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:37:18,328][25689] Avg episode reward: [(0, '-1.145')] [2022-07-10 15:37:18,698][26022] Updated weights on worker 0-0, policy_version 784833 (0.00085) [2022-07-10 15:37:20,705][26022] Updated weights on worker 0-0, policy_version 784843 (0.00086) [2022-07-10 15:37:22,454][26022] Updated weights on worker 0-0, policy_version 784853 (0.00087) [2022-07-10 15:37:23,340][25689] Fps is (10 sec: 5722.6, 60 sec: 5594.4, 300 sec: 5565.7). Total num frames: 803694592. Throughput: 0: 5024.1. Samples: 803689224. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:37:23,341][25689] Avg episode reward: [(0, '-0.510')] [2022-07-10 15:37:24,095][26022] Updated weights on worker 0-0, policy_version 784863 (0.00082) [2022-07-10 15:37:26,047][26022] Updated weights on worker 0-0, policy_version 784873 (0.00098) [2022-07-10 15:37:27,753][26022] Updated weights on worker 0-0, policy_version 784883 (0.00089) [2022-07-10 15:37:28,354][25689] Fps is (10 sec: 5616.8, 60 sec: 5567.3, 300 sec: 5557.3). Total num frames: 803722240. Throughput: 0: 5870.0. Samples: 803723048. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:37:28,355][25689] Avg episode reward: [(0, '-0.963')] [2022-07-10 15:37:29,638][26022] Updated weights on worker 0-0, policy_version 784893 (0.00081) [2022-07-10 15:37:31,816][26022] Updated weights on worker 0-0, policy_version 784903 (0.00091) [2022-07-10 15:37:33,195][26022] Updated weights on worker 0-0, policy_version 784913 (0.00087) [2022-07-10 15:37:33,479][25689] Fps is (10 sec: 5656.0, 60 sec: 5579.5, 300 sec: 5568.8). Total num frames: 803751936. Throughput: 0: 5866.8. Samples: 803756792. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:37:33,479][25689] Avg episode reward: [(0, '0.596')] [2022-07-10 15:37:35,227][26022] Updated weights on worker 0-0, policy_version 784923 (0.00056) [2022-07-10 15:37:36,942][26022] Updated weights on worker 0-0, policy_version 784933 (0.01022) [2022-07-10 15:37:38,489][25689] Fps is (10 sec: 5456.3, 60 sec: 5552.5, 300 sec: 5551.6). Total num frames: 803777536. Throughput: 0: 5032.9. Samples: 803773638. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:37:38,489][25689] Avg episode reward: [(0, '0.182')] [2022-07-10 15:37:38,886][26022] Updated weights on worker 0-0, policy_version 784943 (0.00085) [2022-07-10 15:37:40,640][26022] Updated weights on worker 0-0, policy_version 784953 (0.00082) [2022-07-10 15:37:42,427][26022] Updated weights on worker 0-0, policy_version 784963 (0.00106) [2022-07-10 15:37:43,536][25689] Fps is (10 sec: 5498.1, 60 sec: 5571.9, 300 sec: 5561.4). Total num frames: 803807232. Throughput: 0: 5841.4. Samples: 803807196. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:37:43,537][25689] Avg episode reward: [(0, '0.491')] [2022-07-10 15:37:44,439][26022] Updated weights on worker 0-0, policy_version 784973 (0.00098) [2022-07-10 15:37:46,106][26022] Updated weights on worker 0-0, policy_version 784983 (0.00093) [2022-07-10 15:37:47,940][26022] Updated weights on worker 0-0, policy_version 784993 (0.00078) [2022-07-10 15:37:48,578][25689] Fps is (10 sec: 5785.3, 60 sec: 5577.0, 300 sec: 5562.7). Total num frames: 803835904. Throughput: 0: 5841.7. Samples: 803841186. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:37:48,578][25689] Avg episode reward: [(0, '0.688')] [2022-07-10 15:37:49,767][26022] Updated weights on worker 0-0, policy_version 785003 (0.00086) [2022-07-10 15:37:51,576][26022] Updated weights on worker 0-0, policy_version 785013 (0.00084) [2022-07-10 15:37:53,353][26022] Updated weights on worker 0-0, policy_version 785023 (0.00086) [2022-07-10 15:37:53,658][25689] Fps is (10 sec: 5766.8, 60 sec: 5606.7, 300 sec: 5561.4). Total num frames: 803865600. Throughput: 0: 5028.9. Samples: 803858268. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:37:53,658][25689] Avg episode reward: [(0, '0.705')] [2022-07-10 15:37:55,322][26022] Updated weights on worker 0-0, policy_version 785033 (0.00093) [2022-07-10 15:37:56,819][26022] Updated weights on worker 0-0, policy_version 785043 (0.00082) [2022-07-10 15:37:58,659][25689] Fps is (10 sec: 5586.6, 60 sec: 5589.9, 300 sec: 5561.7). Total num frames: 803892224. Throughput: 0: 5867.9. Samples: 803891994. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:37:58,659][25689] Avg episode reward: [(0, '-0.577')] [2022-07-10 15:37:58,998][26022] Updated weights on worker 0-0, policy_version 785053 (0.00089) [2022-07-10 15:38:00,577][26022] Updated weights on worker 0-0, policy_version 785063 (0.00080) [2022-07-10 15:38:02,958][26022] Updated weights on worker 0-0, policy_version 785073 (0.00092) [2022-07-10 15:38:03,693][25689] Fps is (10 sec: 5305.9, 60 sec: 5573.5, 300 sec: 5561.2). Total num frames: 803918848. Throughput: 0: 5776.6. Samples: 803923634. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:38:03,694][25689] Avg episode reward: [(0, '-0.943')] [2022-07-10 15:38:04,775][26022] Updated weights on worker 0-0, policy_version 785083 (0.00089) [2022-07-10 15:38:06,405][26022] Updated weights on worker 0-0, policy_version 785093 (0.00086) [2022-07-10 15:38:08,326][26022] Updated weights on worker 0-0, policy_version 785103 (0.00097) [2022-07-10 15:38:08,707][25689] Fps is (10 sec: 5605.2, 60 sec: 5609.1, 300 sec: 5566.7). Total num frames: 803948544. Throughput: 0: 4939.1. Samples: 803940606. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:38:08,708][25689] Avg episode reward: [(0, '-0.789')] [2022-07-10 15:38:10,335][26022] Updated weights on worker 0-0, policy_version 785113 (0.00088) [2022-07-10 15:38:11,763][26022] Updated weights on worker 0-0, policy_version 785123 (0.00098) [2022-07-10 15:38:13,851][25689] Fps is (10 sec: 5443.8, 60 sec: 5565.7, 300 sec: 5557.7). Total num frames: 803974144. Throughput: 0: 5744.1. Samples: 803974260. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:38:13,851][25689] Avg episode reward: [(0, '-0.519')] [2022-07-10 15:38:13,929][26022] Updated weights on worker 0-0, policy_version 785133 (0.00087) [2022-07-10 15:38:15,464][26022] Updated weights on worker 0-0, policy_version 785143 (0.00087) [2022-07-10 15:38:17,558][26022] Updated weights on worker 0-0, policy_version 785153 (0.00094) [2022-07-10 15:38:18,875][25689] Fps is (10 sec: 5438.1, 60 sec: 5581.1, 300 sec: 5560.9). Total num frames: 804003840. Throughput: 0: 5735.0. Samples: 804007934. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:38:18,876][25689] Avg episode reward: [(0, '-0.722')] [2022-07-10 15:38:19,484][26022] Updated weights on worker 0-0, policy_version 785163 (0.00084) [2022-07-10 15:38:21,139][26022] Updated weights on worker 0-0, policy_version 785173 (0.00096) [2022-07-10 15:38:23,034][26022] Updated weights on worker 0-0, policy_version 785183 (0.00087) [2022-07-10 15:38:23,886][25689] Fps is (10 sec: 5816.1, 60 sec: 5581.2, 300 sec: 5564.4). Total num frames: 804032512. Throughput: 0: 5828.6. Samples: 804041332. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:38:23,888][25689] Avg episode reward: [(0, '-1.405')] [2022-07-10 15:38:24,888][26022] Updated weights on worker 0-0, policy_version 785193 (0.00092) [2022-07-10 15:38:26,669][26022] Updated weights on worker 0-0, policy_version 785203 (0.00086) [2022-07-10 15:38:28,729][26022] Updated weights on worker 0-0, policy_version 785213 (0.00094) [2022-07-10 15:38:28,896][25689] Fps is (10 sec: 5517.9, 60 sec: 5564.7, 300 sec: 5558.9). Total num frames: 804059136. Throughput: 0: 5815.2. Samples: 804058012. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:38:28,897][25689] Avg episode reward: [(0, '-0.731')] [2022-07-10 15:38:30,265][26022] Updated weights on worker 0-0, policy_version 785223 (0.00083) [2022-07-10 15:38:32,224][26022] Updated weights on worker 0-0, policy_version 785233 (0.00084) [2022-07-10 15:38:34,029][25689] Fps is (10 sec: 5451.5, 60 sec: 5547.0, 300 sec: 5557.1). Total num frames: 804087808. Throughput: 0: 5793.9. Samples: 804091174. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:38:34,030][25689] Avg episode reward: [(0, '-0.885')] [2022-07-10 15:38:34,155][26022] Updated weights on worker 0-0, policy_version 785243 (0.00093) [2022-07-10 15:38:35,793][26022] Updated weights on worker 0-0, policy_version 785253 (0.00084) [2022-07-10 15:38:37,955][26022] Updated weights on worker 0-0, policy_version 785263 (0.00085) [2022-07-10 15:38:39,056][25689] Fps is (10 sec: 5644.2, 60 sec: 5596.1, 300 sec: 5560.1). Total num frames: 804116480. Throughput: 0: 5810.2. Samples: 804125190. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:38:39,057][25689] Avg episode reward: [(0, '-0.934')] [2022-07-10 15:38:39,415][26022] Updated weights on worker 0-0, policy_version 785273 (0.00085) [2022-07-10 15:38:41,367][26022] Updated weights on worker 0-0, policy_version 785283 (0.00094) [2022-07-10 15:38:42,781][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:38:42,789][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000785290_804136960.pth [2022-07-10 15:38:42,790][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000783332_802131968.pth [2022-07-10 15:38:43,313][26022] Updated weights on worker 0-0, policy_version 785293 (0.00086) [2022-07-10 15:38:44,078][25689] Fps is (10 sec: 5706.8, 60 sec: 5581.6, 300 sec: 5560.0). Total num frames: 804145152. Throughput: 0: 4966.6. Samples: 804141616. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 15:38:44,080][25689] Avg episode reward: [(0, '-1.149')] [2022-07-10 15:38:45,006][26022] Updated weights on worker 0-0, policy_version 785303 (0.00087) [2022-07-10 15:38:46,926][26022] Updated weights on worker 0-0, policy_version 785313 (0.00088) [2022-07-10 15:38:48,840][26022] Updated weights on worker 0-0, policy_version 785323 (0.00090) [2022-07-10 15:38:49,086][25689] Fps is (10 sec: 5615.4, 60 sec: 5567.8, 300 sec: 5561.1). Total num frames: 804172800. Throughput: 0: 5807.3. Samples: 804175258. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:38:49,086][25689] Avg episode reward: [(0, '-1.356')] [2022-07-10 15:38:50,503][26022] Updated weights on worker 0-0, policy_version 785333 (0.00056) [2022-07-10 15:38:52,531][26022] Updated weights on worker 0-0, policy_version 785343 (0.00613) [2022-07-10 15:38:54,143][25689] Fps is (10 sec: 5493.9, 60 sec: 5536.0, 300 sec: 5560.1). Total num frames: 804200448. Throughput: 0: 5836.6. Samples: 804208568. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:38:54,144][25689] Avg episode reward: [(0, '0.187')] [2022-07-10 15:38:54,166][26022] Updated weights on worker 0-0, policy_version 785353 (0.00090) [2022-07-10 15:38:56,261][26022] Updated weights on worker 0-0, policy_version 785363 (0.00094) [2022-07-10 15:38:58,047][26022] Updated weights on worker 0-0, policy_version 785373 (0.00336) [2022-07-10 15:38:59,183][25689] Fps is (10 sec: 5476.3, 60 sec: 5549.4, 300 sec: 5559.5). Total num frames: 804228096. Throughput: 0: 4963.5. Samples: 804225090. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:38:59,184][25689] Avg episode reward: [(0, '0.116')] [2022-07-10 15:38:59,769][26022] Updated weights on worker 0-0, policy_version 785383 (0.00109) [2022-07-10 15:39:02,016][26022] Updated weights on worker 0-0, policy_version 785393 (0.00091) [2022-07-10 15:39:03,967][26022] Updated weights on worker 0-0, policy_version 785403 (0.00093) [2022-07-10 15:39:04,277][25689] Fps is (10 sec: 5254.2, 60 sec: 5527.0, 300 sec: 5554.3). Total num frames: 804253696. Throughput: 0: 5682.7. Samples: 804256404. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:39:04,278][25689] Avg episode reward: [(0, '0.344')] [2022-07-10 15:39:05,728][26022] Updated weights on worker 0-0, policy_version 785413 (0.00087) [2022-07-10 15:39:07,687][26022] Updated weights on worker 0-0, policy_version 785423 (0.00088) [2022-07-10 15:39:09,244][26022] Updated weights on worker 0-0, policy_version 785433 (0.00097) [2022-07-10 15:39:09,331][25689] Fps is (10 sec: 5448.7, 60 sec: 5523.3, 300 sec: 5562.7). Total num frames: 804283392. Throughput: 0: 5670.9. Samples: 804290070. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:39:09,332][25689] Avg episode reward: [(0, '0.537')] [2022-07-10 15:39:11,276][26022] Updated weights on worker 0-0, policy_version 785443 (0.00088) [2022-07-10 15:39:13,120][26022] Updated weights on worker 0-0, policy_version 785453 (0.00093) [2022-07-10 15:39:14,407][25689] Fps is (10 sec: 5661.2, 60 sec: 5563.4, 300 sec: 5555.1). Total num frames: 804311040. Throughput: 0: 4839.4. Samples: 804306632. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:39:14,407][25689] Avg episode reward: [(0, '0.750')] [2022-07-10 15:39:14,808][26022] Updated weights on worker 0-0, policy_version 785463 (0.00089) [2022-07-10 15:39:16,783][26022] Updated weights on worker 0-0, policy_version 785473 (0.00086) [2022-07-10 15:39:18,232][26022] Updated weights on worker 0-0, policy_version 785483 (0.00083) [2022-07-10 15:39:19,411][25689] Fps is (10 sec: 5384.5, 60 sec: 5514.5, 300 sec: 5551.6). Total num frames: 804337664. Throughput: 0: 5695.2. Samples: 804340290. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:39:19,411][25689] Avg episode reward: [(0, '0.906')] [2022-07-10 15:39:20,493][26022] Updated weights on worker 0-0, policy_version 785493 (0.00083) [2022-07-10 15:39:22,013][26022] Updated weights on worker 0-0, policy_version 785503 (0.00085) [2022-07-10 15:39:23,995][26022] Updated weights on worker 0-0, policy_version 785513 (0.00085) [2022-07-10 15:39:24,413][25689] Fps is (10 sec: 5525.9, 60 sec: 5515.3, 300 sec: 5549.9). Total num frames: 804366336. Throughput: 0: 5841.1. Samples: 804374020. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:39:24,414][25689] Avg episode reward: [(0, '-0.149')] [2022-07-10 15:39:25,868][26022] Updated weights on worker 0-0, policy_version 785523 (0.00090) [2022-07-10 15:39:27,874][26022] Updated weights on worker 0-0, policy_version 785533 (0.00085) [2022-07-10 15:39:29,438][25689] Fps is (10 sec: 5718.8, 60 sec: 5547.8, 300 sec: 5557.4). Total num frames: 804395008. Throughput: 0: 5000.8. Samples: 804390618. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:39:29,438][25689] Avg episode reward: [(0, '-0.455')] [2022-07-10 15:39:29,636][26022] Updated weights on worker 0-0, policy_version 785543 (0.00093) [2022-07-10 15:39:31,522][26022] Updated weights on worker 0-0, policy_version 785553 (0.00087) [2022-07-10 15:39:33,266][26022] Updated weights on worker 0-0, policy_version 785563 (0.00088) [2022-07-10 15:39:34,525][25689] Fps is (10 sec: 5671.3, 60 sec: 5552.1, 300 sec: 5556.3). Total num frames: 804423680. Throughput: 0: 5827.9. Samples: 804423878. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:39:34,526][25689] Avg episode reward: [(0, '-0.841')] [2022-07-10 15:39:34,965][26022] Updated weights on worker 0-0, policy_version 785573 (0.00222) [2022-07-10 15:39:36,984][26022] Updated weights on worker 0-0, policy_version 785583 (0.00087) [2022-07-10 15:39:38,747][26022] Updated weights on worker 0-0, policy_version 785593 (0.00085) [2022-07-10 15:39:39,626][25689] Fps is (10 sec: 5527.9, 60 sec: 5528.3, 300 sec: 5554.7). Total num frames: 804451328. Throughput: 0: 5797.8. Samples: 804457496. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:39:39,627][25689] Avg episode reward: [(0, '-1.752')] [2022-07-10 15:39:40,630][26022] Updated weights on worker 0-0, policy_version 785603 (0.00093) [2022-07-10 15:39:42,439][26022] Updated weights on worker 0-0, policy_version 785613 (0.00087) [2022-07-10 15:39:44,288][26022] Updated weights on worker 0-0, policy_version 785623 (0.00085) [2022-07-10 15:39:44,677][25689] Fps is (10 sec: 5446.4, 60 sec: 5508.7, 300 sec: 5547.6). Total num frames: 804478976. Throughput: 0: 4939.4. Samples: 804474112. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:39:44,678][25689] Avg episode reward: [(0, '-2.078')] [2022-07-10 15:39:46,106][26022] Updated weights on worker 0-0, policy_version 785633 (0.00056) [2022-07-10 15:39:47,905][26022] Updated weights on worker 0-0, policy_version 785643 (0.00092) [2022-07-10 15:39:49,691][25689] Fps is (10 sec: 5595.8, 60 sec: 5525.1, 300 sec: 5555.8). Total num frames: 804507648. Throughput: 0: 5791.1. Samples: 804507904. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:39:49,691][25689] Avg episode reward: [(0, '-2.907')] [2022-07-10 15:39:49,857][26022] Updated weights on worker 0-0, policy_version 785653 (0.00086) [2022-07-10 15:39:51,589][26022] Updated weights on worker 0-0, policy_version 785663 (0.00083) [2022-07-10 15:39:53,412][26022] Updated weights on worker 0-0, policy_version 785673 (0.00090) [2022-07-10 15:39:54,760][25689] Fps is (10 sec: 5687.3, 60 sec: 5540.9, 300 sec: 5551.1). Total num frames: 804536320. Throughput: 0: 5824.7. Samples: 804541744. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:39:54,762][25689] Avg episode reward: [(0, '-3.358')] [2022-07-10 15:39:55,183][26022] Updated weights on worker 0-0, policy_version 785683 (0.00089) [2022-07-10 15:39:56,929][26022] Updated weights on worker 0-0, policy_version 785693 (0.00094) [2022-07-10 15:39:58,736][26022] Updated weights on worker 0-0, policy_version 785703 (0.00084) [2022-07-10 15:39:59,781][25689] Fps is (10 sec: 5581.4, 60 sec: 5542.6, 300 sec: 5561.3). Total num frames: 804563968. Throughput: 0: 5006.4. Samples: 804558402. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:39:59,783][25689] Avg episode reward: [(0, '-3.010')] [2022-07-10 15:40:00,718][26022] Updated weights on worker 0-0, policy_version 785713 (0.00091) [2022-07-10 15:40:02,973][26022] Updated weights on worker 0-0, policy_version 785723 (0.00692) [2022-07-10 15:40:04,797][25689] Fps is (10 sec: 5305.1, 60 sec: 5549.8, 300 sec: 5557.8). Total num frames: 804589568. Throughput: 0: 5750.4. Samples: 804589810. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:40:04,798][25689] Avg episode reward: [(0, '-2.535')] [2022-07-10 15:40:05,056][26022] Updated weights on worker 0-0, policy_version 785733 (0.00088) [2022-07-10 15:40:06,473][26022] Updated weights on worker 0-0, policy_version 785743 (0.00093) [2022-07-10 15:40:08,522][26022] Updated weights on worker 0-0, policy_version 785753 (0.00082) [2022-07-10 15:40:09,819][25689] Fps is (10 sec: 5611.0, 60 sec: 5569.7, 300 sec: 5563.1). Total num frames: 804620288. Throughput: 0: 5735.3. Samples: 804623344. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:40:09,819][25689] Avg episode reward: [(0, '-1.605')] [2022-07-10 15:40:09,956][26022] Updated weights on worker 0-0, policy_version 785763 (0.00087) [2022-07-10 15:40:12,374][26022] Updated weights on worker 0-0, policy_version 785773 (0.00091) [2022-07-10 15:40:14,246][26022] Updated weights on worker 0-0, policy_version 785783 (0.00087) [2022-07-10 15:40:14,944][25689] Fps is (10 sec: 5550.5, 60 sec: 5531.2, 300 sec: 5554.2). Total num frames: 804645888. Throughput: 0: 4867.7. Samples: 804639994. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:40:14,945][25689] Avg episode reward: [(0, '-1.539')] [2022-07-10 15:40:15,838][26022] Updated weights on worker 0-0, policy_version 785793 (0.00085) [2022-07-10 15:40:17,746][26022] Updated weights on worker 0-0, policy_version 785803 (0.00087) [2022-07-10 15:40:19,639][26022] Updated weights on worker 0-0, policy_version 785813 (0.00093) [2022-07-10 15:40:19,948][25689] Fps is (10 sec: 5358.2, 60 sec: 5565.1, 300 sec: 5559.1). Total num frames: 804674560. Throughput: 0: 5697.4. Samples: 804673298. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:40:19,948][25689] Avg episode reward: [(0, '0.010')] [2022-07-10 15:40:21,303][26022] Updated weights on worker 0-0, policy_version 785823 (0.00088) [2022-07-10 15:40:23,419][26022] Updated weights on worker 0-0, policy_version 785833 (0.00083) [2022-07-10 15:40:24,739][26022] Updated weights on worker 0-0, policy_version 785843 (0.00090) [2022-07-10 15:40:24,997][25689] Fps is (10 sec: 5806.2, 60 sec: 5577.7, 300 sec: 5563.0). Total num frames: 804704256. Throughput: 0: 5802.5. Samples: 804707018. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:40:24,997][25689] Avg episode reward: [(0, '0.467')] [2022-07-10 15:40:27,118][26022] Updated weights on worker 0-0, policy_version 785853 (0.00088) [2022-07-10 15:40:28,411][26022] Updated weights on worker 0-0, policy_version 785863 (0.00092) [2022-07-10 15:40:30,001][25689] Fps is (10 sec: 5500.4, 60 sec: 5528.9, 300 sec: 5555.0). Total num frames: 804729856. Throughput: 0: 4978.4. Samples: 804723818. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:40:30,001][25689] Avg episode reward: [(0, '0.153')] [2022-07-10 15:40:30,641][26022] Updated weights on worker 0-0, policy_version 785873 (0.00093) [2022-07-10 15:40:32,275][26022] Updated weights on worker 0-0, policy_version 785883 (0.00053) [2022-07-10 15:40:34,460][26022] Updated weights on worker 0-0, policy_version 785893 (0.00086) [2022-07-10 15:40:35,098][25689] Fps is (10 sec: 5474.1, 60 sec: 5544.8, 300 sec: 5557.1). Total num frames: 804759552. Throughput: 0: 5812.5. Samples: 804757140. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:40:35,099][25689] Avg episode reward: [(0, '-0.678')] [2022-07-10 15:40:36,071][26022] Updated weights on worker 0-0, policy_version 785903 (0.00108) [2022-07-10 15:40:37,903][26022] Updated weights on worker 0-0, policy_version 785913 (0.00350) [2022-07-10 15:40:39,707][26022] Updated weights on worker 0-0, policy_version 785923 (0.00083) [2022-07-10 15:40:40,101][25689] Fps is (10 sec: 5677.2, 60 sec: 5553.9, 300 sec: 5550.8). Total num frames: 804787200. Throughput: 0: 5837.8. Samples: 804790952. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:40:40,102][25689] Avg episode reward: [(0, '-1.370')] [2022-07-10 15:40:41,639][26022] Updated weights on worker 0-0, policy_version 785933 (0.00084) [2022-07-10 15:40:43,005][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:40:43,024][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000785941_804803584.pth [2022-07-10 15:40:43,024][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000783986_802801664.pth [2022-07-10 15:40:43,479][26022] Updated weights on worker 0-0, policy_version 785943 (0.00083) [2022-07-10 15:40:45,114][25689] Fps is (10 sec: 5520.9, 60 sec: 5557.4, 300 sec: 5554.3). Total num frames: 804814848. Throughput: 0: 5004.5. Samples: 804807694. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:40:45,115][25689] Avg episode reward: [(0, '-1.321')] [2022-07-10 15:40:45,272][26022] Updated weights on worker 0-0, policy_version 785953 (0.00102) [2022-07-10 15:40:47,195][26022] Updated weights on worker 0-0, policy_version 785963 (0.00089) [2022-07-10 15:40:49,147][26022] Updated weights on worker 0-0, policy_version 785973 (0.00086) [2022-07-10 15:40:50,136][25689] Fps is (10 sec: 5408.4, 60 sec: 5522.7, 300 sec: 5548.2). Total num frames: 804841472. Throughput: 0: 5810.6. Samples: 804840818. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:40:50,137][25689] Avg episode reward: [(0, '-1.891')] [2022-07-10 15:40:50,713][26022] Updated weights on worker 0-0, policy_version 785983 (0.00089) [2022-07-10 15:40:52,826][26022] Updated weights on worker 0-0, policy_version 785993 (0.00098) [2022-07-10 15:40:54,402][26022] Updated weights on worker 0-0, policy_version 786003 (0.00095) [2022-07-10 15:40:55,192][25689] Fps is (10 sec: 5486.7, 60 sec: 5523.9, 300 sec: 5555.2). Total num frames: 804870144. Throughput: 0: 5813.0. Samples: 804873946. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:40:55,193][25689] Avg episode reward: [(0, '-1.467')] [2022-07-10 15:40:56,689][26022] Updated weights on worker 0-0, policy_version 786013 (0.00093) [2022-07-10 15:40:58,211][26022] Updated weights on worker 0-0, policy_version 786023 (0.00099) [2022-07-10 15:41:00,242][25689] Fps is (10 sec: 5572.9, 60 sec: 5521.3, 300 sec: 5551.1). Total num frames: 804897792. Throughput: 0: 5782.6. Samples: 804907418. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:41:00,243][25689] Avg episode reward: [(0, '-1.717')] [2022-07-10 15:41:00,245][26022] Updated weights on worker 0-0, policy_version 786033 (0.00084) [2022-07-10 15:41:01,914][26022] Updated weights on worker 0-0, policy_version 786043 (0.00085) [2022-07-10 15:41:04,211][26022] Updated weights on worker 0-0, policy_version 786053 (0.00097) [2022-07-10 15:41:05,288][25689] Fps is (10 sec: 5375.8, 60 sec: 5535.5, 300 sec: 5550.5). Total num frames: 804924416. Throughput: 0: 5659.6. Samples: 804921870. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:41:05,288][25689] Avg episode reward: [(0, '-1.100')] [2022-07-10 15:41:05,840][26022] Updated weights on worker 0-0, policy_version 786063 (0.00083) [2022-07-10 15:41:07,958][26022] Updated weights on worker 0-0, policy_version 786073 (0.00086) [2022-07-10 15:41:09,700][26022] Updated weights on worker 0-0, policy_version 786083 (0.00050) [2022-07-10 15:41:10,364][25689] Fps is (10 sec: 5362.0, 60 sec: 5479.8, 300 sec: 5551.5). Total num frames: 804952064. Throughput: 0: 5658.9. Samples: 804955284. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:41:10,366][25689] Avg episode reward: [(0, '-0.919')] [2022-07-10 15:41:11,477][26022] Updated weights on worker 0-0, policy_version 786093 (0.00084) [2022-07-10 15:41:13,211][26022] Updated weights on worker 0-0, policy_version 786103 (0.00081) [2022-07-10 15:41:15,509][25689] Fps is (10 sec: 5410.0, 60 sec: 5511.8, 300 sec: 5539.9). Total num frames: 804979712. Throughput: 0: 5662.0. Samples: 804988980. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:41:15,510][25689] Avg episode reward: [(0, '-0.591')] [2022-07-10 15:41:15,512][26022] Updated weights on worker 0-0, policy_version 786113 (0.00090) [2022-07-10 15:41:16,886][26022] Updated weights on worker 0-0, policy_version 786123 (0.00089) [2022-07-10 15:41:19,208][26022] Updated weights on worker 0-0, policy_version 786133 (0.00091) [2022-07-10 15:41:20,472][26022] Updated weights on worker 0-0, policy_version 786143 (0.00086) [2022-07-10 15:41:20,546][25689] Fps is (10 sec: 5732.4, 60 sec: 5542.6, 300 sec: 5556.9). Total num frames: 805010432. Throughput: 0: 4821.9. Samples: 805005320. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:41:20,547][25689] Avg episode reward: [(0, '-0.845')] [2022-07-10 15:41:22,811][26022] Updated weights on worker 0-0, policy_version 786153 (0.00083) [2022-07-10 15:41:24,223][26022] Updated weights on worker 0-0, policy_version 786163 (0.00081) [2022-07-10 15:41:25,560][25689] Fps is (10 sec: 5705.5, 60 sec: 5495.1, 300 sec: 5547.9). Total num frames: 805037056. Throughput: 0: 5771.3. Samples: 805038864. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:41:25,560][25689] Avg episode reward: [(0, '-1.244')] [2022-07-10 15:41:26,487][26022] Updated weights on worker 0-0, policy_version 786173 (0.00089) [2022-07-10 15:41:28,160][26022] Updated weights on worker 0-0, policy_version 786183 (0.00089) [2022-07-10 15:41:30,046][26022] Updated weights on worker 0-0, policy_version 786193 (0.00076) [2022-07-10 15:41:30,590][25689] Fps is (10 sec: 5301.4, 60 sec: 5509.6, 300 sec: 5541.8). Total num frames: 805063680. Throughput: 0: 5771.3. Samples: 805072016. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:41:30,592][25689] Avg episode reward: [(0, '-1.354')] [2022-07-10 15:41:31,757][26022] Updated weights on worker 0-0, policy_version 786203 (0.00085) [2022-07-10 15:41:33,825][26022] Updated weights on worker 0-0, policy_version 786213 (0.00091) [2022-07-10 15:41:35,456][26022] Updated weights on worker 0-0, policy_version 786223 (0.00092) [2022-07-10 15:41:35,648][25689] Fps is (10 sec: 5582.5, 60 sec: 5513.2, 300 sec: 5549.2). Total num frames: 805093376. Throughput: 0: 4956.7. Samples: 805088804. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:41:35,649][25689] Avg episode reward: [(0, '-1.682')] [2022-07-10 15:41:37,582][26022] Updated weights on worker 0-0, policy_version 786233 (0.00085) [2022-07-10 15:41:39,003][26022] Updated weights on worker 0-0, policy_version 786243 (0.00091) [2022-07-10 15:41:40,654][25689] Fps is (10 sec: 5596.1, 60 sec: 5496.0, 300 sec: 5543.6). Total num frames: 805120000. Throughput: 0: 5809.9. Samples: 805122146. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:41:40,655][25689] Avg episode reward: [(0, '-1.819')] [2022-07-10 15:41:41,135][26022] Updated weights on worker 0-0, policy_version 786253 (0.00093) [2022-07-10 15:41:42,697][26022] Updated weights on worker 0-0, policy_version 786263 (0.00111) [2022-07-10 15:41:44,680][26022] Updated weights on worker 0-0, policy_version 786273 (0.00280) [2022-07-10 15:41:45,718][25689] Fps is (10 sec: 5592.7, 60 sec: 5525.1, 300 sec: 5547.6). Total num frames: 805149696. Throughput: 0: 5797.3. Samples: 805155732. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:41:45,719][25689] Avg episode reward: [(0, '-1.319')] [2022-07-10 15:41:46,555][26022] Updated weights on worker 0-0, policy_version 786283 (0.00095) [2022-07-10 15:41:48,372][26022] Updated weights on worker 0-0, policy_version 786293 (0.00090) [2022-07-10 15:41:50,248][26022] Updated weights on worker 0-0, policy_version 786303 (0.00089) [2022-07-10 15:41:50,806][25689] Fps is (10 sec: 5547.9, 60 sec: 5519.2, 300 sec: 5543.2). Total num frames: 805176320. Throughput: 0: 4956.1. Samples: 805172206. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:41:50,807][25689] Avg episode reward: [(0, '-0.181')] [2022-07-10 15:41:52,204][26022] Updated weights on worker 0-0, policy_version 786313 (0.00091) [2022-07-10 15:41:53,884][26022] Updated weights on worker 0-0, policy_version 786323 (0.00104) [2022-07-10 15:41:55,852][26022] Updated weights on worker 0-0, policy_version 786333 (0.00087) [2022-07-10 15:41:55,880][25689] Fps is (10 sec: 5542.7, 60 sec: 5534.5, 300 sec: 5548.7). Total num frames: 805206016. Throughput: 0: 5790.1. Samples: 805205946. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:41:55,880][25689] Avg episode reward: [(0, '-0.129')] [2022-07-10 15:41:57,450][26022] Updated weights on worker 0-0, policy_version 786343 (0.00090) [2022-07-10 15:41:59,388][26022] Updated weights on worker 0-0, policy_version 786353 (0.00089) [2022-07-10 15:42:00,919][25689] Fps is (10 sec: 5771.4, 60 sec: 5552.3, 300 sec: 5552.2). Total num frames: 805234688. Throughput: 0: 5795.7. Samples: 805239596. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:42:00,920][25689] Avg episode reward: [(0, '0.749')] [2022-07-10 15:42:01,182][26022] Updated weights on worker 0-0, policy_version 786363 (0.00084) [2022-07-10 15:42:03,369][26022] Updated weights on worker 0-0, policy_version 786373 (0.00091) [2022-07-10 15:42:05,123][26022] Updated weights on worker 0-0, policy_version 786383 (0.00091) [2022-07-10 15:42:05,974][25689] Fps is (10 sec: 5275.0, 60 sec: 5517.7, 300 sec: 5541.5). Total num frames: 805259264. Throughput: 0: 4871.3. Samples: 805254402. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:42:05,975][25689] Avg episode reward: [(0, '0.564')] [2022-07-10 15:42:07,137][26022] Updated weights on worker 0-0, policy_version 786393 (0.00087) [2022-07-10 15:42:08,848][26022] Updated weights on worker 0-0, policy_version 786403 (0.00617) [2022-07-10 15:42:10,840][26022] Updated weights on worker 0-0, policy_version 786413 (0.00084) [2022-07-10 15:42:10,981][25689] Fps is (10 sec: 5292.1, 60 sec: 5540.9, 300 sec: 5545.5). Total num frames: 805287936. Throughput: 0: 5736.0. Samples: 805287930. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:42:10,983][25689] Avg episode reward: [(0, '0.584')] [2022-07-10 15:42:12,392][26022] Updated weights on worker 0-0, policy_version 786423 (0.00093) [2022-07-10 15:42:14,534][26022] Updated weights on worker 0-0, policy_version 786433 (0.00088) [2022-07-10 15:42:16,041][25689] Fps is (10 sec: 5696.4, 60 sec: 5565.6, 300 sec: 5544.6). Total num frames: 805316608. Throughput: 0: 5740.0. Samples: 805321670. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 15:42:16,041][25689] Avg episode reward: [(0, '0.576')] [2022-07-10 15:42:16,066][26022] Updated weights on worker 0-0, policy_version 786443 (0.00095) [2022-07-10 15:42:18,284][26022] Updated weights on worker 0-0, policy_version 786453 (0.00087) [2022-07-10 15:42:19,810][26022] Updated weights on worker 0-0, policy_version 786463 (0.00088) [2022-07-10 15:42:21,091][25689] Fps is (10 sec: 5469.7, 60 sec: 5496.8, 300 sec: 5537.0). Total num frames: 805343232. Throughput: 0: 4889.3. Samples: 805338220. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:42:21,091][25689] Avg episode reward: [(0, '0.403')] [2022-07-10 15:42:21,782][26022] Updated weights on worker 0-0, policy_version 786473 (0.00087) [2022-07-10 15:42:23,468][26022] Updated weights on worker 0-0, policy_version 786483 (0.00077) [2022-07-10 15:42:25,450][26022] Updated weights on worker 0-0, policy_version 786493 (0.00083) [2022-07-10 15:42:26,092][25689] Fps is (10 sec: 5501.4, 60 sec: 5531.7, 300 sec: 5540.7). Total num frames: 805371904. Throughput: 0: 5852.5. Samples: 805372142. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:42:26,092][25689] Avg episode reward: [(0, '0.392')] [2022-07-10 15:42:26,996][26022] Updated weights on worker 0-0, policy_version 786503 (0.00093) [2022-07-10 15:42:29,131][26022] Updated weights on worker 0-0, policy_version 786513 (0.00091) [2022-07-10 15:42:30,704][26022] Updated weights on worker 0-0, policy_version 786523 (0.00096) [2022-07-10 15:42:31,121][25689] Fps is (10 sec: 5717.0, 60 sec: 5565.7, 300 sec: 5539.0). Total num frames: 805400576. Throughput: 0: 5851.2. Samples: 805405772. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:42:31,123][25689] Avg episode reward: [(0, '0.387')] [2022-07-10 15:42:32,828][26022] Updated weights on worker 0-0, policy_version 786533 (0.00093) [2022-07-10 15:42:34,516][26022] Updated weights on worker 0-0, policy_version 786543 (0.00088) [2022-07-10 15:42:36,242][25689] Fps is (10 sec: 5649.9, 60 sec: 5543.1, 300 sec: 5547.3). Total num frames: 805429248. Throughput: 0: 4992.4. Samples: 805422520. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:42:36,242][25689] Avg episode reward: [(0, '0.653')] [2022-07-10 15:42:36,438][26022] Updated weights on worker 0-0, policy_version 786553 (0.00080) [2022-07-10 15:42:38,194][26022] Updated weights on worker 0-0, policy_version 786563 (0.00085) [2022-07-10 15:42:39,985][26022] Updated weights on worker 0-0, policy_version 786573 (0.00086) [2022-07-10 15:42:41,313][25689] Fps is (10 sec: 5626.1, 60 sec: 5570.8, 300 sec: 5543.4). Total num frames: 805457920. Throughput: 0: 5828.0. Samples: 805456078. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:42:41,314][25689] Avg episode reward: [(0, '0.048')] [2022-07-10 15:42:41,728][26022] Updated weights on worker 0-0, policy_version 786583 (0.00092) [2022-07-10 15:42:43,148][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:42:43,162][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000786590_805468160.pth [2022-07-10 15:42:43,162][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000784637_803468288.pth [2022-07-10 15:42:43,867][26022] Updated weights on worker 0-0, policy_version 786593 (0.00091) [2022-07-10 15:42:45,367][26022] Updated weights on worker 0-0, policy_version 786603 (0.00096) [2022-07-10 15:42:46,359][25689] Fps is (10 sec: 5566.6, 60 sec: 5538.7, 300 sec: 5539.8). Total num frames: 805485568. Throughput: 0: 5806.0. Samples: 805489812. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:42:46,359][25689] Avg episode reward: [(0, '-1.076')] [2022-07-10 15:42:47,372][26022] Updated weights on worker 0-0, policy_version 786613 (0.00094) [2022-07-10 15:42:49,087][26022] Updated weights on worker 0-0, policy_version 786623 (0.00088) [2022-07-10 15:42:51,059][26022] Updated weights on worker 0-0, policy_version 786633 (0.00082) [2022-07-10 15:42:51,366][25689] Fps is (10 sec: 5500.6, 60 sec: 5563.0, 300 sec: 5534.3). Total num frames: 805513216. Throughput: 0: 5818.5. Samples: 805523570. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:42:51,367][25689] Avg episode reward: [(0, '-0.755')] [2022-07-10 15:42:52,930][26022] Updated weights on worker 0-0, policy_version 786643 (0.00088) [2022-07-10 15:42:54,566][26022] Updated weights on worker 0-0, policy_version 786653 (0.00091) [2022-07-10 15:42:56,423][25689] Fps is (10 sec: 5596.1, 60 sec: 5547.6, 300 sec: 5540.2). Total num frames: 805541888. Throughput: 0: 5835.2. Samples: 805540284. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:42:56,424][25689] Avg episode reward: [(0, '-1.074')] [2022-07-10 15:42:56,499][26022] Updated weights on worker 0-0, policy_version 786663 (0.00087) [2022-07-10 15:42:58,302][26022] Updated weights on worker 0-0, policy_version 786673 (0.00089) [2022-07-10 15:43:00,138][26022] Updated weights on worker 0-0, policy_version 786683 (0.00084) [2022-07-10 15:43:01,433][25689] Fps is (10 sec: 5696.1, 60 sec: 5550.3, 300 sec: 5547.5). Total num frames: 805570560. Throughput: 0: 5855.6. Samples: 805573894. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:43:01,434][25689] Avg episode reward: [(0, '-1.856')] [2022-07-10 15:43:02,096][26022] Updated weights on worker 0-0, policy_version 786693 (0.00084) [2022-07-10 15:43:04,191][26022] Updated weights on worker 0-0, policy_version 786703 (0.00097) [2022-07-10 15:43:05,991][26022] Updated weights on worker 0-0, policy_version 786713 (0.00086) [2022-07-10 15:43:06,459][25689] Fps is (10 sec: 5407.9, 60 sec: 5569.9, 300 sec: 5533.5). Total num frames: 805596160. Throughput: 0: 5762.1. Samples: 805605630. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:43:06,459][25689] Avg episode reward: [(0, '-1.894')] [2022-07-10 15:43:07,766][26022] Updated weights on worker 0-0, policy_version 786723 (0.00084) [2022-07-10 15:43:09,687][26022] Updated weights on worker 0-0, policy_version 786733 (0.00092) [2022-07-10 15:43:11,467][26022] Updated weights on worker 0-0, policy_version 786743 (0.00080) [2022-07-10 15:43:11,471][25689] Fps is (10 sec: 5407.1, 60 sec: 5569.5, 300 sec: 5546.3). Total num frames: 805624832. Throughput: 0: 4917.8. Samples: 805622440. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:43:11,471][25689] Avg episode reward: [(0, '-0.849')] [2022-07-10 15:43:13,414][26022] Updated weights on worker 0-0, policy_version 786753 (0.00090) [2022-07-10 15:43:15,004][26022] Updated weights on worker 0-0, policy_version 786763 (0.00093) [2022-07-10 15:43:16,525][25689] Fps is (10 sec: 5595.1, 60 sec: 5553.0, 300 sec: 5538.9). Total num frames: 805652480. Throughput: 0: 5759.2. Samples: 805656054. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:43:16,525][25689] Avg episode reward: [(0, '-0.827')] [2022-07-10 15:43:17,033][26022] Updated weights on worker 0-0, policy_version 786773 (0.00101) [2022-07-10 15:43:18,748][26022] Updated weights on worker 0-0, policy_version 786783 (0.00090) [2022-07-10 15:43:20,636][26022] Updated weights on worker 0-0, policy_version 786793 (0.00092) [2022-07-10 15:43:21,552][25689] Fps is (10 sec: 5383.6, 60 sec: 5555.2, 300 sec: 5531.7). Total num frames: 805679104. Throughput: 0: 5766.2. Samples: 805689900. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:43:21,552][25689] Avg episode reward: [(0, '-3.182')] [2022-07-10 15:43:22,361][26022] Updated weights on worker 0-0, policy_version 786803 (0.00087) [2022-07-10 15:43:24,235][26022] Updated weights on worker 0-0, policy_version 786813 (0.00091) [2022-07-10 15:43:25,920][26022] Updated weights on worker 0-0, policy_version 786823 (0.00088) [2022-07-10 15:43:26,557][25689] Fps is (10 sec: 5716.0, 60 sec: 5588.7, 300 sec: 5545.5). Total num frames: 805709824. Throughput: 0: 5041.9. Samples: 805706962. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:43:26,558][25689] Avg episode reward: [(0, '-3.049')] [2022-07-10 15:43:27,949][26022] Updated weights on worker 0-0, policy_version 786833 (0.00088) [2022-07-10 15:43:29,661][26022] Updated weights on worker 0-0, policy_version 786843 (0.00083) [2022-07-10 15:43:31,564][25689] Fps is (10 sec: 5727.5, 60 sec: 5556.9, 300 sec: 5541.0). Total num frames: 805736448. Throughput: 0: 5871.7. Samples: 805740420. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:43:31,564][25689] Avg episode reward: [(0, '-2.350')] [2022-07-10 15:43:31,683][26022] Updated weights on worker 0-0, policy_version 786853 (0.00089) [2022-07-10 15:43:33,351][26022] Updated weights on worker 0-0, policy_version 786863 (0.00088) [2022-07-10 15:43:35,499][26022] Updated weights on worker 0-0, policy_version 786873 (0.00093) [2022-07-10 15:43:36,612][25689] Fps is (10 sec: 5499.6, 60 sec: 5563.6, 300 sec: 5540.6). Total num frames: 805765120. Throughput: 0: 5874.4. Samples: 805774050. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:43:36,612][25689] Avg episode reward: [(0, '-2.201')] [2022-07-10 15:43:36,876][26022] Updated weights on worker 0-0, policy_version 786883 (0.00094) [2022-07-10 15:43:38,940][26022] Updated weights on worker 0-0, policy_version 786893 (0.00092) [2022-07-10 15:43:40,511][26022] Updated weights on worker 0-0, policy_version 786903 (0.00088) [2022-07-10 15:43:41,700][25689] Fps is (10 sec: 5657.1, 60 sec: 5562.0, 300 sec: 5539.4). Total num frames: 805793792. Throughput: 0: 5021.1. Samples: 805791068. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:43:41,701][25689] Avg episode reward: [(0, '-1.857')] [2022-07-10 15:43:42,653][26022] Updated weights on worker 0-0, policy_version 786913 (0.00085) [2022-07-10 15:43:44,223][26022] Updated weights on worker 0-0, policy_version 786923 (0.00081) [2022-07-10 15:43:46,191][26022] Updated weights on worker 0-0, policy_version 786933 (0.00083) [2022-07-10 15:43:46,720][25689] Fps is (10 sec: 5774.1, 60 sec: 5598.3, 300 sec: 5546.0). Total num frames: 805823488. Throughput: 0: 5859.9. Samples: 805825116. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:43:46,721][25689] Avg episode reward: [(0, '-2.542')] [2022-07-10 15:43:48,074][26022] Updated weights on worker 0-0, policy_version 786943 (0.00086) [2022-07-10 15:43:49,570][26022] Updated weights on worker 0-0, policy_version 786953 (0.00084) [2022-07-10 15:43:51,722][25689] Fps is (10 sec: 5517.5, 60 sec: 5564.8, 300 sec: 5540.2). Total num frames: 805849088. Throughput: 0: 5850.5. Samples: 805858358. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:43:51,724][25689] Avg episode reward: [(0, '-0.775')] [2022-07-10 15:43:51,844][26022] Updated weights on worker 0-0, policy_version 786963 (0.00091) [2022-07-10 15:43:53,584][26022] Updated weights on worker 0-0, policy_version 786973 (0.00087) [2022-07-10 15:43:55,299][26022] Updated weights on worker 0-0, policy_version 786983 (0.00091) [2022-07-10 15:43:56,762][25689] Fps is (10 sec: 5404.5, 60 sec: 5566.4, 300 sec: 5543.6). Total num frames: 805877760. Throughput: 0: 5025.8. Samples: 805875322. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:43:56,763][25689] Avg episode reward: [(0, '-0.458')] [2022-07-10 15:43:57,259][26022] Updated weights on worker 0-0, policy_version 786993 (0.00086) [2022-07-10 15:43:58,852][26022] Updated weights on worker 0-0, policy_version 787003 (0.00086) [2022-07-10 15:44:00,871][26022] Updated weights on worker 0-0, policy_version 787013 (0.00085) [2022-07-10 15:44:01,771][25689] Fps is (10 sec: 5706.9, 60 sec: 5566.6, 300 sec: 5555.5). Total num frames: 805906432. Throughput: 0: 5873.3. Samples: 805908946. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:44:01,771][25689] Avg episode reward: [(0, '-1.069')] [2022-07-10 15:44:03,186][26022] Updated weights on worker 0-0, policy_version 787023 (0.00083) [2022-07-10 15:44:04,799][26022] Updated weights on worker 0-0, policy_version 787033 (0.00087) [2022-07-10 15:44:06,850][25689] Fps is (10 sec: 5379.7, 60 sec: 5561.6, 300 sec: 5541.3). Total num frames: 805932032. Throughput: 0: 5724.8. Samples: 805940358. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:44:06,851][25689] Avg episode reward: [(0, '-1.875')] [2022-07-10 15:44:06,855][26022] Updated weights on worker 0-0, policy_version 787043 (0.00095) [2022-07-10 15:44:08,343][26022] Updated weights on worker 0-0, policy_version 787053 (0.00097) [2022-07-10 15:44:10,429][26022] Updated weights on worker 0-0, policy_version 787063 (0.00096) [2022-07-10 15:44:11,863][25689] Fps is (10 sec: 5377.4, 60 sec: 5561.5, 300 sec: 5545.9). Total num frames: 805960704. Throughput: 0: 4894.9. Samples: 805956946. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:44:11,864][25689] Avg episode reward: [(0, '-1.578')] [2022-07-10 15:44:12,270][26022] Updated weights on worker 0-0, policy_version 787073 (0.00051) [2022-07-10 15:44:13,996][26022] Updated weights on worker 0-0, policy_version 787083 (0.00088) [2022-07-10 15:44:16,102][26022] Updated weights on worker 0-0, policy_version 787093 (0.00095) [2022-07-10 15:44:17,012][25689] Fps is (10 sec: 5542.6, 60 sec: 5552.8, 300 sec: 5546.6). Total num frames: 805988352. Throughput: 0: 5679.0. Samples: 805990318. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:44:17,012][25689] Avg episode reward: [(0, '-1.184')] [2022-07-10 15:44:17,702][26022] Updated weights on worker 0-0, policy_version 787103 (0.00090) [2022-07-10 15:44:19,721][26022] Updated weights on worker 0-0, policy_version 787113 (0.00509) [2022-07-10 15:44:21,361][26022] Updated weights on worker 0-0, policy_version 787123 (0.00097) [2022-07-10 15:44:22,051][25689] Fps is (10 sec: 5427.5, 60 sec: 5568.6, 300 sec: 5542.5). Total num frames: 806016000. Throughput: 0: 5653.7. Samples: 806023608. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:44:22,052][25689] Avg episode reward: [(0, '-1.226')] [2022-07-10 15:44:23,539][26022] Updated weights on worker 0-0, policy_version 787133 (0.00086) [2022-07-10 15:44:25,076][26022] Updated weights on worker 0-0, policy_version 787143 (0.00092) [2022-07-10 15:44:27,083][25689] Fps is (10 sec: 5592.0, 60 sec: 5532.3, 300 sec: 5542.4). Total num frames: 806044672. Throughput: 0: 4942.5. Samples: 806040358. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:44:27,084][25689] Avg episode reward: [(0, '-1.425')] [2022-07-10 15:44:27,087][26022] Updated weights on worker 0-0, policy_version 787153 (0.00087) [2022-07-10 15:44:28,788][26022] Updated weights on worker 0-0, policy_version 787163 (0.00099) [2022-07-10 15:44:30,771][26022] Updated weights on worker 0-0, policy_version 787173 (0.00096) [2022-07-10 15:44:32,106][25689] Fps is (10 sec: 5601.5, 60 sec: 5547.7, 300 sec: 5540.1). Total num frames: 806072320. Throughput: 0: 5761.0. Samples: 806073564. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:44:32,106][25689] Avg episode reward: [(0, '-1.182')] [2022-07-10 15:44:32,642][26022] Updated weights on worker 0-0, policy_version 787183 (0.00092) [2022-07-10 15:44:34,438][26022] Updated weights on worker 0-0, policy_version 787193 (0.00083) [2022-07-10 15:44:36,104][26022] Updated weights on worker 0-0, policy_version 787203 (0.00094) [2022-07-10 15:44:37,203][25689] Fps is (10 sec: 5565.3, 60 sec: 5543.2, 300 sec: 5543.7). Total num frames: 806100992. Throughput: 0: 5789.5. Samples: 806107216. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:44:37,203][25689] Avg episode reward: [(0, '-0.225')] [2022-07-10 15:44:38,213][26022] Updated weights on worker 0-0, policy_version 787213 (0.00086) [2022-07-10 15:44:39,800][26022] Updated weights on worker 0-0, policy_version 787223 (0.00090) [2022-07-10 15:44:41,897][26022] Updated weights on worker 0-0, policy_version 787233 (0.00081) [2022-07-10 15:44:42,306][25689] Fps is (10 sec: 5621.7, 60 sec: 5541.9, 300 sec: 5546.1). Total num frames: 806129664. Throughput: 0: 4964.6. Samples: 806124166. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:44:42,307][25689] Avg episode reward: [(0, '0.264')] [2022-07-10 15:44:43,212][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:44:43,223][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000787242_806135808.pth [2022-07-10 15:44:43,224][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000785290_804136960.pth [2022-07-10 15:44:43,539][26022] Updated weights on worker 0-0, policy_version 787243 (0.00086) [2022-07-10 15:44:45,435][26022] Updated weights on worker 0-0, policy_version 787253 (0.00089) [2022-07-10 15:44:47,234][26022] Updated weights on worker 0-0, policy_version 787263 (0.00093) [2022-07-10 15:44:47,320][25689] Fps is (10 sec: 5566.9, 60 sec: 5508.7, 300 sec: 5542.7). Total num frames: 806157312. Throughput: 0: 5803.4. Samples: 806157800. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:44:47,320][25689] Avg episode reward: [(0, '0.290')] [2022-07-10 15:44:48,894][26022] Updated weights on worker 0-0, policy_version 787273 (0.00081) [2022-07-10 15:44:51,144][26022] Updated weights on worker 0-0, policy_version 787283 (0.00095) [2022-07-10 15:44:52,324][25689] Fps is (10 sec: 5519.9, 60 sec: 5542.3, 300 sec: 5540.5). Total num frames: 806184960. Throughput: 0: 5819.2. Samples: 806191216. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:44:52,324][25689] Avg episode reward: [(0, '0.143')] [2022-07-10 15:44:52,571][26022] Updated weights on worker 0-0, policy_version 787293 (0.00085) [2022-07-10 15:44:54,583][26022] Updated weights on worker 0-0, policy_version 787303 (0.00087) [2022-07-10 15:44:56,269][26022] Updated weights on worker 0-0, policy_version 787313 (0.00077) [2022-07-10 15:44:57,416][25689] Fps is (10 sec: 5578.3, 60 sec: 5537.5, 300 sec: 5542.6). Total num frames: 806213632. Throughput: 0: 4985.9. Samples: 806207996. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:44:57,417][25689] Avg episode reward: [(0, '0.385')] [2022-07-10 15:44:58,176][26022] Updated weights on worker 0-0, policy_version 787323 (0.00092) [2022-07-10 15:45:00,008][26022] Updated weights on worker 0-0, policy_version 787333 (0.00085) [2022-07-10 15:45:02,104][26022] Updated weights on worker 0-0, policy_version 787343 (0.00090) [2022-07-10 15:45:02,442][25689] Fps is (10 sec: 5566.3, 60 sec: 5519.1, 300 sec: 5549.3). Total num frames: 806241280. Throughput: 0: 5845.9. Samples: 806241878. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:45:02,444][25689] Avg episode reward: [(0, '-0.472')] [2022-07-10 15:45:03,996][26022] Updated weights on worker 0-0, policy_version 787353 (0.00086) [2022-07-10 15:45:05,899][26022] Updated weights on worker 0-0, policy_version 787363 (0.00086) [2022-07-10 15:45:07,461][25689] Fps is (10 sec: 5504.7, 60 sec: 5558.3, 300 sec: 5539.0). Total num frames: 806268928. Throughput: 0: 5747.7. Samples: 806273568. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:45:07,463][25689] Avg episode reward: [(0, '-0.009')] [2022-07-10 15:45:07,486][26022] Updated weights on worker 0-0, policy_version 787373 (0.00094) [2022-07-10 15:45:09,439][26022] Updated weights on worker 0-0, policy_version 787383 (0.00089) [2022-07-10 15:45:11,157][26022] Updated weights on worker 0-0, policy_version 787393 (0.00092) [2022-07-10 15:45:12,471][25689] Fps is (10 sec: 5513.8, 60 sec: 5541.8, 300 sec: 5548.0). Total num frames: 806296576. Throughput: 0: 4931.4. Samples: 806290568. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:45:12,471][25689] Avg episode reward: [(0, '-0.215')] [2022-07-10 15:45:13,219][26022] Updated weights on worker 0-0, policy_version 787403 (0.00087) [2022-07-10 15:45:14,900][26022] Updated weights on worker 0-0, policy_version 787413 (0.00098) [2022-07-10 15:45:16,801][26022] Updated weights on worker 0-0, policy_version 787423 (0.00086) [2022-07-10 15:45:17,563][25689] Fps is (10 sec: 5575.0, 60 sec: 5563.8, 300 sec: 5546.4). Total num frames: 806325248. Throughput: 0: 5771.4. Samples: 806324274. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:45:17,564][25689] Avg episode reward: [(0, '-0.097')] [2022-07-10 15:45:18,582][26022] Updated weights on worker 0-0, policy_version 787433 (0.00088) [2022-07-10 15:45:20,503][26022] Updated weights on worker 0-0, policy_version 787443 (0.00085) [2022-07-10 15:45:22,285][26022] Updated weights on worker 0-0, policy_version 787453 (0.00091) [2022-07-10 15:45:22,606][25689] Fps is (10 sec: 5657.5, 60 sec: 5580.4, 300 sec: 5543.0). Total num frames: 806353920. Throughput: 0: 5755.4. Samples: 806357932. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:45:22,607][25689] Avg episode reward: [(0, '0.311')] [2022-07-10 15:45:24,055][26022] Updated weights on worker 0-0, policy_version 787463 (0.00091) [2022-07-10 15:45:25,922][26022] Updated weights on worker 0-0, policy_version 787473 (0.00087) [2022-07-10 15:45:27,655][25689] Fps is (10 sec: 5581.0, 60 sec: 5561.9, 300 sec: 5549.1). Total num frames: 806381568. Throughput: 0: 5017.1. Samples: 806374882. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:45:27,655][25689] Avg episode reward: [(0, '0.926')] [2022-07-10 15:45:27,664][26022] Updated weights on worker 0-0, policy_version 787483 (0.00082) [2022-07-10 15:45:29,652][26022] Updated weights on worker 0-0, policy_version 787493 (0.00088) [2022-07-10 15:45:31,310][26022] Updated weights on worker 0-0, policy_version 787503 (0.00086) [2022-07-10 15:45:32,707][25689] Fps is (10 sec: 5576.1, 60 sec: 5576.2, 300 sec: 5546.5). Total num frames: 806410240. Throughput: 0: 5828.9. Samples: 806408522. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:45:32,707][25689] Avg episode reward: [(0, '1.717')] [2022-07-10 15:45:33,438][26022] Updated weights on worker 0-0, policy_version 787513 (0.00103) [2022-07-10 15:45:35,116][26022] Updated weights on worker 0-0, policy_version 787523 (0.00089) [2022-07-10 15:45:36,905][26022] Updated weights on worker 0-0, policy_version 787533 (0.00090) [2022-07-10 15:45:37,800][25689] Fps is (10 sec: 5652.4, 60 sec: 5576.5, 300 sec: 5548.3). Total num frames: 806438912. Throughput: 0: 5825.4. Samples: 806442160. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:45:37,800][25689] Avg episode reward: [(0, '1.930')] [2022-07-10 15:45:38,804][26022] Updated weights on worker 0-0, policy_version 787543 (0.00084) [2022-07-10 15:45:40,616][26022] Updated weights on worker 0-0, policy_version 787553 (0.00091) [2022-07-10 15:45:42,554][26022] Updated weights on worker 0-0, policy_version 787563 (0.00084) [2022-07-10 15:45:42,811][25689] Fps is (10 sec: 5574.1, 60 sec: 5568.1, 300 sec: 5548.3). Total num frames: 806466560. Throughput: 0: 5831.6. Samples: 806475754. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:45:42,811][25689] Avg episode reward: [(0, '1.677')] [2022-07-10 15:45:44,117][26022] Updated weights on worker 0-0, policy_version 787573 (0.00085) [2022-07-10 15:45:46,005][26022] Updated weights on worker 0-0, policy_version 787583 (0.00089) [2022-07-10 15:45:47,812][25689] Fps is (10 sec: 5522.8, 60 sec: 5569.2, 300 sec: 5552.1). Total num frames: 806494208. Throughput: 0: 5837.5. Samples: 806492552. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-10 15:45:47,813][25689] Avg episode reward: [(0, '0.632')] [2022-07-10 15:45:47,869][26022] Updated weights on worker 0-0, policy_version 787593 (0.00093) [2022-07-10 15:45:49,545][26022] Updated weights on worker 0-0, policy_version 787603 (0.00087) [2022-07-10 15:45:51,428][26022] Updated weights on worker 0-0, policy_version 787613 (0.00092) [2022-07-10 15:45:52,848][25689] Fps is (10 sec: 5611.1, 60 sec: 5583.2, 300 sec: 5552.5). Total num frames: 806522880. Throughput: 0: 5832.2. Samples: 806525990. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:45:52,848][25689] Avg episode reward: [(0, '0.523')] [2022-07-10 15:45:53,407][26022] Updated weights on worker 0-0, policy_version 787623 (0.00096) [2022-07-10 15:45:55,147][26022] Updated weights on worker 0-0, policy_version 787633 (0.00086) [2022-07-10 15:45:57,084][26022] Updated weights on worker 0-0, policy_version 787643 (0.00086) [2022-07-10 15:45:57,904][25689] Fps is (10 sec: 5682.4, 60 sec: 5586.6, 300 sec: 5555.8). Total num frames: 806551552. Throughput: 0: 5846.1. Samples: 806559690. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:45:57,905][25689] Avg episode reward: [(0, '-0.007')] [2022-07-10 15:45:58,760][26022] Updated weights on worker 0-0, policy_version 787653 (0.00092) [2022-07-10 15:46:00,715][26022] Updated weights on worker 0-0, policy_version 787663 (0.00091) [2022-07-10 15:46:02,928][25689] Fps is (10 sec: 5384.1, 60 sec: 5552.9, 300 sec: 5552.8). Total num frames: 806577152. Throughput: 0: 5007.1. Samples: 806576484. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:46:02,928][25689] Avg episode reward: [(0, '-1.479')] [2022-07-10 15:46:02,934][26022] Updated weights on worker 0-0, policy_version 787673 (0.00090) [2022-07-10 15:46:04,764][26022] Updated weights on worker 0-0, policy_version 787683 (0.00095) [2022-07-10 15:46:06,589][26022] Updated weights on worker 0-0, policy_version 787693 (0.00086) [2022-07-10 15:46:08,012][25689] Fps is (10 sec: 5267.7, 60 sec: 5546.9, 300 sec: 5552.6). Total num frames: 806604800. Throughput: 0: 5711.6. Samples: 806607926. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:46:08,013][25689] Avg episode reward: [(0, '-1.502')] [2022-07-10 15:46:08,612][26022] Updated weights on worker 0-0, policy_version 787703 (0.00355) [2022-07-10 15:46:10,190][26022] Updated weights on worker 0-0, policy_version 787713 (0.00095) [2022-07-10 15:46:12,055][26022] Updated weights on worker 0-0, policy_version 787723 (0.00085) [2022-07-10 15:46:13,022][25689] Fps is (10 sec: 5478.1, 60 sec: 5546.9, 300 sec: 5555.2). Total num frames: 806632448. Throughput: 0: 5733.6. Samples: 806641660. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:46:13,023][25689] Avg episode reward: [(0, '-2.423')] [2022-07-10 15:46:13,625][26022] Updated weights on worker 0-0, policy_version 787733 (0.00096) [2022-07-10 15:46:15,879][26022] Updated weights on worker 0-0, policy_version 787743 (0.00086) [2022-07-10 15:46:17,495][26022] Updated weights on worker 0-0, policy_version 787753 (0.00084) [2022-07-10 15:46:18,089][25689] Fps is (10 sec: 5589.3, 60 sec: 5549.3, 300 sec: 5547.7). Total num frames: 806661120. Throughput: 0: 4888.4. Samples: 806658362. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:46:18,090][25689] Avg episode reward: [(0, '-2.598')] [2022-07-10 15:46:19,343][26022] Updated weights on worker 0-0, policy_version 787763 (0.00088) [2022-07-10 15:46:21,203][26022] Updated weights on worker 0-0, policy_version 787773 (0.00089) [2022-07-10 15:46:23,171][25689] Fps is (10 sec: 5549.4, 60 sec: 5528.8, 300 sec: 5549.9). Total num frames: 806688768. Throughput: 0: 5705.4. Samples: 806691978. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:46:23,171][25689] Avg episode reward: [(0, '-3.214')] [2022-07-10 15:46:23,290][26022] Updated weights on worker 0-0, policy_version 787783 (0.00089) [2022-07-10 15:46:24,768][26022] Updated weights on worker 0-0, policy_version 787793 (0.00088) [2022-07-10 15:46:26,790][26022] Updated weights on worker 0-0, policy_version 787803 (0.00087) [2022-07-10 15:46:28,200][25689] Fps is (10 sec: 5671.2, 60 sec: 5564.3, 300 sec: 5560.2). Total num frames: 806718464. Throughput: 0: 5839.8. Samples: 806725820. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:46:28,201][25689] Avg episode reward: [(0, '-3.089')] [2022-07-10 15:46:28,447][26022] Updated weights on worker 0-0, policy_version 787813 (0.00088) [2022-07-10 15:46:30,487][26022] Updated weights on worker 0-0, policy_version 787823 (0.00091) [2022-07-10 15:46:32,259][26022] Updated weights on worker 0-0, policy_version 787833 (0.00088) [2022-07-10 15:46:33,214][25689] Fps is (10 sec: 5811.8, 60 sec: 5567.9, 300 sec: 5557.6). Total num frames: 806747136. Throughput: 0: 5003.1. Samples: 806742682. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:46:33,214][25689] Avg episode reward: [(0, '-2.696')] [2022-07-10 15:46:34,006][26022] Updated weights on worker 0-0, policy_version 787843 (0.00088) [2022-07-10 15:46:35,853][26022] Updated weights on worker 0-0, policy_version 787853 (0.00084) [2022-07-10 15:46:37,515][26022] Updated weights on worker 0-0, policy_version 787863 (0.00087) [2022-07-10 15:46:38,276][25689] Fps is (10 sec: 5589.8, 60 sec: 5553.8, 300 sec: 5560.0). Total num frames: 806774784. Throughput: 0: 5856.4. Samples: 806776586. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:46:38,276][25689] Avg episode reward: [(0, '-1.895')] [2022-07-10 15:46:39,339][26022] Updated weights on worker 0-0, policy_version 787873 (0.00084) [2022-07-10 15:46:41,315][26022] Updated weights on worker 0-0, policy_version 787883 (0.00089) [2022-07-10 15:46:43,003][26022] Updated weights on worker 0-0, policy_version 787893 (0.00092) [2022-07-10 15:46:43,281][25689] Fps is (10 sec: 5594.3, 60 sec: 5571.2, 300 sec: 5557.7). Total num frames: 806803456. Throughput: 0: 5893.5. Samples: 806810498. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:46:43,282][25689] Avg episode reward: [(0, '-1.779')] [2022-07-10 15:46:43,313][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:46:43,326][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000787895_806804480.pth [2022-07-10 15:46:43,326][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000785941_804803584.pth [2022-07-10 15:46:45,010][26022] Updated weights on worker 0-0, policy_version 787903 (0.00095) [2022-07-10 15:46:46,687][26022] Updated weights on worker 0-0, policy_version 787913 (0.00092) [2022-07-10 15:46:48,301][25689] Fps is (10 sec: 5617.7, 60 sec: 5569.5, 300 sec: 5562.4). Total num frames: 806831104. Throughput: 0: 5053.3. Samples: 806827396. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:46:48,302][25689] Avg episode reward: [(0, '-1.105')] [2022-07-10 15:46:48,647][26022] Updated weights on worker 0-0, policy_version 787923 (0.00091) [2022-07-10 15:46:50,195][26022] Updated weights on worker 0-0, policy_version 787933 (0.00086) [2022-07-10 15:46:52,303][26022] Updated weights on worker 0-0, policy_version 787943 (0.00088) [2022-07-10 15:46:53,313][25689] Fps is (10 sec: 5512.1, 60 sec: 5554.8, 300 sec: 5556.7). Total num frames: 806858752. Throughput: 0: 5883.1. Samples: 806860928. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:46:53,313][25689] Avg episode reward: [(0, '-1.236')] [2022-07-10 15:46:53,764][26022] Updated weights on worker 0-0, policy_version 787953 (0.00082) [2022-07-10 15:46:55,989][26022] Updated weights on worker 0-0, policy_version 787963 (0.00095) [2022-07-10 15:46:57,675][26022] Updated weights on worker 0-0, policy_version 787973 (0.00493) [2022-07-10 15:46:58,419][25689] Fps is (10 sec: 5566.6, 60 sec: 5550.2, 300 sec: 5555.4). Total num frames: 806887424. Throughput: 0: 5865.4. Samples: 806894732. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:46:58,420][25689] Avg episode reward: [(0, '-0.653')] [2022-07-10 15:46:59,531][26022] Updated weights on worker 0-0, policy_version 787983 (0.00092) [2022-07-10 15:47:01,369][26022] Updated weights on worker 0-0, policy_version 787993 (0.00088) [2022-07-10 15:47:03,450][25689] Fps is (10 sec: 5454.5, 60 sec: 5566.4, 300 sec: 5562.7). Total num frames: 806914048. Throughput: 0: 5003.7. Samples: 806911420. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:47:03,451][25689] Avg episode reward: [(0, '-0.556')] [2022-07-10 15:47:03,507][26022] Updated weights on worker 0-0, policy_version 788003 (0.00065) [2022-07-10 15:47:05,207][26022] Updated weights on worker 0-0, policy_version 788013 (0.00082) [2022-07-10 15:47:07,263][26022] Updated weights on worker 0-0, policy_version 788023 (0.00099) [2022-07-10 15:47:08,476][25689] Fps is (10 sec: 5396.1, 60 sec: 5571.8, 300 sec: 5558.9). Total num frames: 806941696. Throughput: 0: 5730.7. Samples: 806943012. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:47:08,477][25689] Avg episode reward: [(0, '-0.241')] [2022-07-10 15:47:08,909][26022] Updated weights on worker 0-0, policy_version 788033 (0.00086) [2022-07-10 15:47:10,936][26022] Updated weights on worker 0-0, policy_version 788043 (0.00094) [2022-07-10 15:47:12,651][26022] Updated weights on worker 0-0, policy_version 788053 (0.00092) [2022-07-10 15:47:13,492][25689] Fps is (10 sec: 5506.8, 60 sec: 5571.3, 300 sec: 5556.3). Total num frames: 806969344. Throughput: 0: 5750.0. Samples: 806976958. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:47:13,492][25689] Avg episode reward: [(0, '-0.423')] [2022-07-10 15:47:14,348][26022] Updated weights on worker 0-0, policy_version 788063 (0.00085) [2022-07-10 15:47:16,346][26022] Updated weights on worker 0-0, policy_version 788073 (0.00084) [2022-07-10 15:47:17,965][26022] Updated weights on worker 0-0, policy_version 788083 (0.00091) [2022-07-10 15:47:18,546][25689] Fps is (10 sec: 5694.8, 60 sec: 5589.4, 300 sec: 5566.6). Total num frames: 806999040. Throughput: 0: 4928.3. Samples: 806993924. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:47:18,547][25689] Avg episode reward: [(0, '-0.227')] [2022-07-10 15:47:19,847][26022] Updated weights on worker 0-0, policy_version 788093 (0.00083) [2022-07-10 15:47:21,585][26022] Updated weights on worker 0-0, policy_version 788103 (0.00097) [2022-07-10 15:47:23,556][25689] Fps is (10 sec: 5799.4, 60 sec: 5613.0, 300 sec: 5566.4). Total num frames: 807027712. Throughput: 0: 5786.1. Samples: 807027754. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:47:23,557][25689] Avg episode reward: [(0, '-0.688')] [2022-07-10 15:47:23,559][26022] Updated weights on worker 0-0, policy_version 788113 (0.00083) [2022-07-10 15:47:25,187][26022] Updated weights on worker 0-0, policy_version 788123 (0.00086) [2022-07-10 15:47:27,334][26022] Updated weights on worker 0-0, policy_version 788133 (0.00080) [2022-07-10 15:47:28,595][25689] Fps is (10 sec: 5604.8, 60 sec: 5578.3, 300 sec: 5562.8). Total num frames: 807055360. Throughput: 0: 5891.6. Samples: 807061540. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:47:28,596][25689] Avg episode reward: [(0, '-0.612')] [2022-07-10 15:47:28,973][26022] Updated weights on worker 0-0, policy_version 788143 (0.00086) [2022-07-10 15:47:30,958][26022] Updated weights on worker 0-0, policy_version 788153 (0.00089) [2022-07-10 15:47:32,688][26022] Updated weights on worker 0-0, policy_version 788163 (0.00084) [2022-07-10 15:47:33,635][25689] Fps is (10 sec: 5385.0, 60 sec: 5541.9, 300 sec: 5557.4). Total num frames: 807081984. Throughput: 0: 5029.3. Samples: 807078262. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:47:33,636][25689] Avg episode reward: [(0, '-0.428')] [2022-07-10 15:47:34,486][26022] Updated weights on worker 0-0, policy_version 788173 (0.00091) [2022-07-10 15:47:36,517][26022] Updated weights on worker 0-0, policy_version 788183 (0.00097) [2022-07-10 15:47:38,228][26022] Updated weights on worker 0-0, policy_version 788193 (0.00090) [2022-07-10 15:47:38,691][25689] Fps is (10 sec: 5578.0, 60 sec: 5576.3, 300 sec: 5561.1). Total num frames: 807111680. Throughput: 0: 5842.0. Samples: 807111614. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:47:38,692][25689] Avg episode reward: [(0, '-1.155')] [2022-07-10 15:47:40,003][26022] Updated weights on worker 0-0, policy_version 788203 (0.00093) [2022-07-10 15:47:42,084][26022] Updated weights on worker 0-0, policy_version 788213 (0.00087) [2022-07-10 15:47:43,558][26022] Updated weights on worker 0-0, policy_version 788223 (0.00089) [2022-07-10 15:47:43,761][25689] Fps is (10 sec: 5763.9, 60 sec: 5570.4, 300 sec: 5564.1). Total num frames: 807140352. Throughput: 0: 5809.9. Samples: 807145142. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:47:43,762][25689] Avg episode reward: [(0, '-1.492')] [2022-07-10 15:47:45,604][26022] Updated weights on worker 0-0, policy_version 788233 (0.00097) [2022-07-10 15:47:47,351][26022] Updated weights on worker 0-0, policy_version 788243 (0.00108) [2022-07-10 15:47:48,852][25689] Fps is (10 sec: 5542.9, 60 sec: 5563.9, 300 sec: 5562.6). Total num frames: 807168000. Throughput: 0: 4960.3. Samples: 807162024. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:47:48,853][25689] Avg episode reward: [(0, '-3.017')] [2022-07-10 15:47:49,191][26022] Updated weights on worker 0-0, policy_version 788253 (0.00088) [2022-07-10 15:47:50,977][26022] Updated weights on worker 0-0, policy_version 788263 (0.00092) [2022-07-10 15:47:52,850][26022] Updated weights on worker 0-0, policy_version 788273 (0.00082) [2022-07-10 15:47:53,866][25689] Fps is (10 sec: 5675.0, 60 sec: 5597.5, 300 sec: 5566.8). Total num frames: 807197696. Throughput: 0: 5813.0. Samples: 807195866. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:47:53,866][25689] Avg episode reward: [(0, '-3.884')] [2022-07-10 15:47:54,812][26022] Updated weights on worker 0-0, policy_version 788283 (0.00104) [2022-07-10 15:47:56,703][26022] Updated weights on worker 0-0, policy_version 788293 (0.00095) [2022-07-10 15:47:58,323][26022] Updated weights on worker 0-0, policy_version 788303 (0.00075) [2022-07-10 15:47:58,918][25689] Fps is (10 sec: 5594.9, 60 sec: 5568.6, 300 sec: 5559.1). Total num frames: 807224320. Throughput: 0: 5805.5. Samples: 807229042. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:47:58,919][25689] Avg episode reward: [(0, '-3.734')] [2022-07-10 15:48:00,249][26022] Updated weights on worker 0-0, policy_version 788313 (0.00089) [2022-07-10 15:48:02,445][26022] Updated weights on worker 0-0, policy_version 788323 (0.00583) [2022-07-10 15:48:03,951][25689] Fps is (10 sec: 5279.8, 60 sec: 5568.5, 300 sec: 5562.4). Total num frames: 807250944. Throughput: 0: 4995.9. Samples: 807246010. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:48:03,952][25689] Avg episode reward: [(0, '-3.714')] [2022-07-10 15:48:04,227][26022] Updated weights on worker 0-0, policy_version 788333 (0.00084) [2022-07-10 15:48:06,244][26022] Updated weights on worker 0-0, policy_version 788343 (0.00087) [2022-07-10 15:48:07,682][26022] Updated weights on worker 0-0, policy_version 788353 (0.00085) [2022-07-10 15:48:08,985][25689] Fps is (10 sec: 5391.5, 60 sec: 5567.8, 300 sec: 5558.6). Total num frames: 807278592. Throughput: 0: 5741.3. Samples: 807277612. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:48:08,986][25689] Avg episode reward: [(0, '-3.802')] [2022-07-10 15:48:09,849][26022] Updated weights on worker 0-0, policy_version 788363 (0.00090) [2022-07-10 15:48:11,603][26022] Updated weights on worker 0-0, policy_version 788373 (0.00089) [2022-07-10 15:48:13,487][26022] Updated weights on worker 0-0, policy_version 788383 (0.00093) [2022-07-10 15:48:14,008][25689] Fps is (10 sec: 5600.0, 60 sec: 5583.9, 300 sec: 5562.6). Total num frames: 807307264. Throughput: 0: 5729.7. Samples: 807311276. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:48:14,009][25689] Avg episode reward: [(0, '-2.119')] [2022-07-10 15:48:15,255][26022] Updated weights on worker 0-0, policy_version 788393 (0.00080) [2022-07-10 15:48:17,220][26022] Updated weights on worker 0-0, policy_version 788403 (0.00088) [2022-07-10 15:48:18,903][26022] Updated weights on worker 0-0, policy_version 788413 (0.00085) [2022-07-10 15:48:19,059][25689] Fps is (10 sec: 5692.2, 60 sec: 5567.4, 300 sec: 5569.1). Total num frames: 807335936. Throughput: 0: 5753.1. Samples: 807344912. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:48:19,059][25689] Avg episode reward: [(0, '-2.492')] [2022-07-10 15:48:20,856][26022] Updated weights on worker 0-0, policy_version 788423 (0.00093) [2022-07-10 15:48:22,532][26022] Updated weights on worker 0-0, policy_version 788433 (0.00097) [2022-07-10 15:48:24,071][25689] Fps is (10 sec: 5596.8, 60 sec: 5550.2, 300 sec: 5558.6). Total num frames: 807363584. Throughput: 0: 5744.7. Samples: 807361594. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:48:24,072][25689] Avg episode reward: [(0, '-1.432')] [2022-07-10 15:48:24,480][26022] Updated weights on worker 0-0, policy_version 788443 (0.00108) [2022-07-10 15:48:26,369][26022] Updated weights on worker 0-0, policy_version 788453 (0.00087) [2022-07-10 15:48:28,207][26022] Updated weights on worker 0-0, policy_version 788463 (0.00093) [2022-07-10 15:48:29,078][25689] Fps is (10 sec: 5416.8, 60 sec: 5536.2, 300 sec: 5558.6). Total num frames: 807390208. Throughput: 0: 5830.5. Samples: 807394766. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:48:29,078][25689] Avg episode reward: [(0, '-0.772')] [2022-07-10 15:48:30,128][26022] Updated weights on worker 0-0, policy_version 788473 (0.00089) [2022-07-10 15:48:31,848][26022] Updated weights on worker 0-0, policy_version 788483 (0.00088) [2022-07-10 15:48:33,748][26022] Updated weights on worker 0-0, policy_version 788493 (0.00089) [2022-07-10 15:48:34,088][25689] Fps is (10 sec: 5418.3, 60 sec: 5555.9, 300 sec: 5555.9). Total num frames: 807417856. Throughput: 0: 5800.8. Samples: 807427752. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:48:34,088][25689] Avg episode reward: [(0, '-1.107')] [2022-07-10 15:48:35,822][26022] Updated weights on worker 0-0, policy_version 788503 (0.00085) [2022-07-10 15:48:37,380][26022] Updated weights on worker 0-0, policy_version 788513 (0.00086) [2022-07-10 15:48:39,132][25689] Fps is (10 sec: 5499.7, 60 sec: 5523.2, 300 sec: 5553.2). Total num frames: 807445504. Throughput: 0: 4960.9. Samples: 807444494. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:48:39,133][25689] Avg episode reward: [(0, '-0.348')] [2022-07-10 15:48:39,358][26022] Updated weights on worker 0-0, policy_version 788523 (0.00089) [2022-07-10 15:48:41,060][26022] Updated weights on worker 0-0, policy_version 788533 (0.00087) [2022-07-10 15:48:43,109][26022] Updated weights on worker 0-0, policy_version 788543 (0.00084) [2022-07-10 15:48:43,359][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:48:43,369][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000788544_807469056.pth [2022-07-10 15:48:43,371][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000786590_805468160.pth [2022-07-10 15:48:44,143][25689] Fps is (10 sec: 5499.4, 60 sec: 5511.6, 300 sec: 5546.5). Total num frames: 807473152. Throughput: 0: 5794.9. Samples: 807477906. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:48:44,143][25689] Avg episode reward: [(0, '-0.685')] [2022-07-10 15:48:44,699][26022] Updated weights on worker 0-0, policy_version 788553 (0.00082) [2022-07-10 15:48:46,748][26022] Updated weights on worker 0-0, policy_version 788563 (0.00084) [2022-07-10 15:48:48,377][26022] Updated weights on worker 0-0, policy_version 788573 (0.00093) [2022-07-10 15:48:49,164][25689] Fps is (10 sec: 5716.2, 60 sec: 5551.9, 300 sec: 5559.9). Total num frames: 807502848. Throughput: 0: 5828.7. Samples: 807511842. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:48:49,164][25689] Avg episode reward: [(0, '-0.725')] [2022-07-10 15:48:50,102][26022] Updated weights on worker 0-0, policy_version 788583 (0.00079) [2022-07-10 15:48:51,923][26022] Updated weights on worker 0-0, policy_version 788593 (0.00082) [2022-07-10 15:48:54,191][25689] Fps is (10 sec: 5503.2, 60 sec: 5482.9, 300 sec: 5549.9). Total num frames: 807528448. Throughput: 0: 5005.3. Samples: 807528374. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:48:54,191][25689] Avg episode reward: [(0, '-1.633')] [2022-07-10 15:48:54,201][26022] Updated weights on worker 0-0, policy_version 788603 (0.00093) [2022-07-10 15:48:55,680][26022] Updated weights on worker 0-0, policy_version 788613 (0.00086) [2022-07-10 15:48:57,873][26022] Updated weights on worker 0-0, policy_version 788623 (0.00087) [2022-07-10 15:48:59,331][25689] Fps is (10 sec: 5539.5, 60 sec: 5542.7, 300 sec: 5554.3). Total num frames: 807559168. Throughput: 0: 5816.8. Samples: 807561986. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:48:59,332][25689] Avg episode reward: [(0, '-1.869')] [2022-07-10 15:48:59,402][26022] Updated weights on worker 0-0, policy_version 788633 (0.00091) [2022-07-10 15:49:01,650][26022] Updated weights on worker 0-0, policy_version 788643 (0.00083) [2022-07-10 15:49:03,334][26022] Updated weights on worker 0-0, policy_version 788653 (0.00095) [2022-07-10 15:49:04,425][25689] Fps is (10 sec: 5502.8, 60 sec: 5520.1, 300 sec: 5554.0). Total num frames: 807584768. Throughput: 0: 5711.6. Samples: 807593754. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:49:04,428][25689] Avg episode reward: [(0, '-2.158')] [2022-07-10 15:49:05,355][26022] Updated weights on worker 0-0, policy_version 788663 (0.00086) [2022-07-10 15:49:07,047][26022] Updated weights on worker 0-0, policy_version 788673 (0.00085) [2022-07-10 15:49:09,138][26022] Updated weights on worker 0-0, policy_version 788683 (0.00086) [2022-07-10 15:49:09,440][25689] Fps is (10 sec: 5368.7, 60 sec: 5538.8, 300 sec: 5554.0). Total num frames: 807613440. Throughput: 0: 4874.3. Samples: 807610674. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:49:09,441][25689] Avg episode reward: [(0, '-2.809')] [2022-07-10 15:49:10,524][26022] Updated weights on worker 0-0, policy_version 788693 (0.00085) [2022-07-10 15:49:12,671][26022] Updated weights on worker 0-0, policy_version 788703 (0.00090) [2022-07-10 15:49:14,337][26022] Updated weights on worker 0-0, policy_version 788713 (0.00090) [2022-07-10 15:49:14,532][25689] Fps is (10 sec: 5775.4, 60 sec: 5549.5, 300 sec: 5561.9). Total num frames: 807643136. Throughput: 0: 5699.9. Samples: 807644316. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:49:14,537][25689] Avg episode reward: [(0, '-1.901')] [2022-07-10 15:49:16,264][26022] Updated weights on worker 0-0, policy_version 788723 (0.00088) [2022-07-10 15:49:18,219][26022] Updated weights on worker 0-0, policy_version 788733 (0.00089) [2022-07-10 15:49:19,638][25689] Fps is (10 sec: 5723.4, 60 sec: 5544.3, 300 sec: 5564.1). Total num frames: 807671808. Throughput: 0: 5707.6. Samples: 807677890. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-10 15:49:19,640][25689] Avg episode reward: [(0, '-1.881')] [2022-07-10 15:49:19,927][26022] Updated weights on worker 0-0, policy_version 788743 (0.00086) [2022-07-10 15:49:21,776][26022] Updated weights on worker 0-0, policy_version 788753 (0.00090) [2022-07-10 15:49:23,566][26022] Updated weights on worker 0-0, policy_version 788763 (0.00096) [2022-07-10 15:49:24,681][25689] Fps is (10 sec: 5448.5, 60 sec: 5524.7, 300 sec: 5557.0). Total num frames: 807698432. Throughput: 0: 4992.0. Samples: 807694868. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:49:24,683][25689] Avg episode reward: [(0, '-2.109')] [2022-07-10 15:49:25,490][26022] Updated weights on worker 0-0, policy_version 788773 (0.00092) [2022-07-10 15:49:27,399][26022] Updated weights on worker 0-0, policy_version 788783 (0.00103) [2022-07-10 15:49:28,890][26022] Updated weights on worker 0-0, policy_version 788793 (0.00095) [2022-07-10 15:49:29,702][25689] Fps is (10 sec: 5494.6, 60 sec: 5557.1, 300 sec: 5560.5). Total num frames: 807727104. Throughput: 0: 5793.2. Samples: 807728056. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:49:29,702][25689] Avg episode reward: [(0, '-1.565')] [2022-07-10 15:49:31,099][26022] Updated weights on worker 0-0, policy_version 788803 (0.00087) [2022-07-10 15:49:32,701][26022] Updated weights on worker 0-0, policy_version 788813 (0.00078) [2022-07-10 15:49:34,636][26022] Updated weights on worker 0-0, policy_version 788823 (0.00085) [2022-07-10 15:49:34,719][25689] Fps is (10 sec: 5610.7, 60 sec: 5556.5, 300 sec: 5558.6). Total num frames: 807754752. Throughput: 0: 5820.5. Samples: 807761814. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:49:34,720][25689] Avg episode reward: [(0, '-1.535')] [2022-07-10 15:49:36,404][26022] Updated weights on worker 0-0, policy_version 788833 (0.00084) [2022-07-10 15:49:38,210][26022] Updated weights on worker 0-0, policy_version 788843 (0.00096) [2022-07-10 15:49:39,768][25689] Fps is (10 sec: 5594.7, 60 sec: 5572.9, 300 sec: 5559.6). Total num frames: 807783424. Throughput: 0: 4998.3. Samples: 807778508. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:49:39,770][25689] Avg episode reward: [(0, '-1.017')] [2022-07-10 15:49:40,010][26022] Updated weights on worker 0-0, policy_version 788853 (0.00096) [2022-07-10 15:49:41,959][26022] Updated weights on worker 0-0, policy_version 788863 (0.00088) [2022-07-10 15:49:43,802][26022] Updated weights on worker 0-0, policy_version 788873 (0.00089) [2022-07-10 15:49:44,807][25689] Fps is (10 sec: 5481.4, 60 sec: 5553.5, 300 sec: 5555.7). Total num frames: 807810048. Throughput: 0: 5810.8. Samples: 807811816. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:49:44,809][25689] Avg episode reward: [(0, '-1.234')] [2022-07-10 15:49:45,635][26022] Updated weights on worker 0-0, policy_version 788883 (0.00091) [2022-07-10 15:49:47,527][26022] Updated weights on worker 0-0, policy_version 788893 (0.00087) [2022-07-10 15:49:49,329][26022] Updated weights on worker 0-0, policy_version 788903 (0.00054) [2022-07-10 15:49:49,815][25689] Fps is (10 sec: 5504.0, 60 sec: 5537.8, 300 sec: 5559.0). Total num frames: 807838720. Throughput: 0: 5830.7. Samples: 807845330. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:49:49,815][25689] Avg episode reward: [(0, '-0.878')] [2022-07-10 15:49:51,291][26022] Updated weights on worker 0-0, policy_version 788913 (0.00369) [2022-07-10 15:49:53,144][26022] Updated weights on worker 0-0, policy_version 788923 (0.00085) [2022-07-10 15:49:54,829][25689] Fps is (10 sec: 5619.6, 60 sec: 5572.7, 300 sec: 5557.0). Total num frames: 807866368. Throughput: 0: 4985.6. Samples: 807862074. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:49:54,829][25689] Avg episode reward: [(0, '-0.721')] [2022-07-10 15:49:54,958][26022] Updated weights on worker 0-0, policy_version 788933 (0.00092) [2022-07-10 15:49:56,498][26022] Updated weights on worker 0-0, policy_version 788943 (0.00086) [2022-07-10 15:49:58,675][26022] Updated weights on worker 0-0, policy_version 788953 (0.00088) [2022-07-10 15:49:59,943][25689] Fps is (10 sec: 5560.7, 60 sec: 5541.3, 300 sec: 5558.8). Total num frames: 807895040. Throughput: 0: 5789.6. Samples: 807895310. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:49:59,945][25689] Avg episode reward: [(0, '-0.504')] [2022-07-10 15:50:00,439][26022] Updated weights on worker 0-0, policy_version 788963 (0.00087) [2022-07-10 15:50:02,603][26022] Updated weights on worker 0-0, policy_version 788973 (0.00087) [2022-07-10 15:50:04,367][26022] Updated weights on worker 0-0, policy_version 788983 (0.00090) [2022-07-10 15:50:04,976][25689] Fps is (10 sec: 5348.4, 60 sec: 5547.0, 300 sec: 5551.7). Total num frames: 807920640. Throughput: 0: 5705.7. Samples: 807926898. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:50:04,977][25689] Avg episode reward: [(0, '-0.441')] [2022-07-10 15:50:06,245][26022] Updated weights on worker 0-0, policy_version 788993 (0.00091) [2022-07-10 15:50:08,050][26022] Updated weights on worker 0-0, policy_version 789003 (0.00090) [2022-07-10 15:50:09,997][25689] Fps is (10 sec: 5296.2, 60 sec: 5529.5, 300 sec: 5551.5). Total num frames: 807948288. Throughput: 0: 4874.9. Samples: 807943716. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:50:09,997][25689] Avg episode reward: [(0, '-0.793')] [2022-07-10 15:50:10,076][26022] Updated weights on worker 0-0, policy_version 789013 (0.01391) [2022-07-10 15:50:11,602][26022] Updated weights on worker 0-0, policy_version 789023 (0.00088) [2022-07-10 15:50:13,623][26022] Updated weights on worker 0-0, policy_version 789033 (0.00094) [2022-07-10 15:50:15,000][25689] Fps is (10 sec: 5720.8, 60 sec: 5537.6, 300 sec: 5556.6). Total num frames: 807977984. Throughput: 0: 5712.2. Samples: 807977296. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:50:15,001][25689] Avg episode reward: [(0, '-0.616')] [2022-07-10 15:50:15,325][26022] Updated weights on worker 0-0, policy_version 789043 (0.00097) [2022-07-10 15:50:17,356][26022] Updated weights on worker 0-0, policy_version 789053 (0.00090) [2022-07-10 15:50:19,169][26022] Updated weights on worker 0-0, policy_version 789063 (0.00084) [2022-07-10 15:50:20,073][25689] Fps is (10 sec: 5793.0, 60 sec: 5540.7, 300 sec: 5556.0). Total num frames: 808006656. Throughput: 0: 5738.0. Samples: 808010814. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:50:20,074][25689] Avg episode reward: [(0, '0.009')] [2022-07-10 15:50:20,892][26022] Updated weights on worker 0-0, policy_version 789073 (0.00090) [2022-07-10 15:50:22,646][26022] Updated weights on worker 0-0, policy_version 789083 (0.00096) [2022-07-10 15:50:24,568][26022] Updated weights on worker 0-0, policy_version 789093 (0.00091) [2022-07-10 15:50:25,092][25689] Fps is (10 sec: 5479.1, 60 sec: 5542.8, 300 sec: 5553.1). Total num frames: 808033280. Throughput: 0: 5010.6. Samples: 808027690. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:50:25,093][25689] Avg episode reward: [(0, '-0.184')] [2022-07-10 15:50:26,209][26022] Updated weights on worker 0-0, policy_version 789103 (0.00086) [2022-07-10 15:50:28,440][26022] Updated weights on worker 0-0, policy_version 789113 (0.00099) [2022-07-10 15:50:29,861][26022] Updated weights on worker 0-0, policy_version 789123 (0.00087) [2022-07-10 15:50:30,149][25689] Fps is (10 sec: 5487.6, 60 sec: 5539.5, 300 sec: 5553.0). Total num frames: 808061952. Throughput: 0: 5827.9. Samples: 808061160. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:50:30,150][25689] Avg episode reward: [(0, '-0.168')] [2022-07-10 15:50:32,115][26022] Updated weights on worker 0-0, policy_version 789133 (0.00080) [2022-07-10 15:50:33,578][26022] Updated weights on worker 0-0, policy_version 789143 (0.00086) [2022-07-10 15:50:35,167][25689] Fps is (10 sec: 5488.6, 60 sec: 5522.5, 300 sec: 5547.6). Total num frames: 808088576. Throughput: 0: 5838.2. Samples: 808095034. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:50:35,167][25689] Avg episode reward: [(0, '0.051')] [2022-07-10 15:50:35,568][26022] Updated weights on worker 0-0, policy_version 789153 (0.00086) [2022-07-10 15:50:37,439][26022] Updated weights on worker 0-0, policy_version 789163 (0.00087) [2022-07-10 15:50:39,159][26022] Updated weights on worker 0-0, policy_version 789173 (0.00085) [2022-07-10 15:50:40,254][25689] Fps is (10 sec: 5573.3, 60 sec: 5536.0, 300 sec: 5553.0). Total num frames: 808118272. Throughput: 0: 5003.1. Samples: 808111786. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:50:40,255][25689] Avg episode reward: [(0, '-0.503')] [2022-07-10 15:50:41,016][26022] Updated weights on worker 0-0, policy_version 789183 (0.00093) [2022-07-10 15:50:42,933][26022] Updated weights on worker 0-0, policy_version 789193 (0.00097) [2022-07-10 15:50:43,687][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:50:43,698][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000789197_808137728.pth [2022-07-10 15:50:43,698][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000787242_806135808.pth [2022-07-10 15:50:44,666][26022] Updated weights on worker 0-0, policy_version 789203 (0.00085) [2022-07-10 15:50:45,283][25689] Fps is (10 sec: 5769.5, 60 sec: 5570.7, 300 sec: 5556.0). Total num frames: 808146944. Throughput: 0: 5845.1. Samples: 808145710. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:50:45,284][25689] Avg episode reward: [(0, '-1.624')] [2022-07-10 15:50:46,571][26022] Updated weights on worker 0-0, policy_version 789213 (0.00091) [2022-07-10 15:50:48,213][26022] Updated weights on worker 0-0, policy_version 789223 (0.00096) [2022-07-10 15:50:50,212][26022] Updated weights on worker 0-0, policy_version 789233 (0.00085) [2022-07-10 15:50:50,302][25689] Fps is (10 sec: 5605.1, 60 sec: 5552.8, 300 sec: 5552.8). Total num frames: 808174592. Throughput: 0: 5870.8. Samples: 808179476. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:50:50,303][25689] Avg episode reward: [(0, '-1.711')] [2022-07-10 15:50:51,940][26022] Updated weights on worker 0-0, policy_version 789243 (0.00084) [2022-07-10 15:50:53,762][26022] Updated weights on worker 0-0, policy_version 789253 (0.00090) [2022-07-10 15:50:55,320][25689] Fps is (10 sec: 5611.3, 60 sec: 5569.3, 300 sec: 5553.5). Total num frames: 808203264. Throughput: 0: 5018.8. Samples: 808196180. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:50:55,321][25689] Avg episode reward: [(0, '-2.261')] [2022-07-10 15:50:55,577][26022] Updated weights on worker 0-0, policy_version 789263 (0.00093) [2022-07-10 15:50:57,411][26022] Updated weights on worker 0-0, policy_version 789273 (0.00088) [2022-07-10 15:50:59,170][26022] Updated weights on worker 0-0, policy_version 789283 (0.01013) [2022-07-10 15:51:00,386][25689] Fps is (10 sec: 5585.4, 60 sec: 5556.9, 300 sec: 5559.6). Total num frames: 808230912. Throughput: 0: 5859.9. Samples: 808229754. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:51:00,386][25689] Avg episode reward: [(0, '-2.699')] [2022-07-10 15:51:01,288][26022] Updated weights on worker 0-0, policy_version 789293 (0.00092) [2022-07-10 15:51:03,363][26022] Updated weights on worker 0-0, policy_version 789303 (0.00095) [2022-07-10 15:51:05,033][26022] Updated weights on worker 0-0, policy_version 789313 (0.00078) [2022-07-10 15:51:05,457][25689] Fps is (10 sec: 5353.8, 60 sec: 5570.3, 300 sec: 5556.4). Total num frames: 808257536. Throughput: 0: 5722.3. Samples: 808261152. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:51:05,458][25689] Avg episode reward: [(0, '-2.433')] [2022-07-10 15:51:07,122][26022] Updated weights on worker 0-0, policy_version 789323 (0.00084) [2022-07-10 15:51:08,716][26022] Updated weights on worker 0-0, policy_version 789333 (0.00095) [2022-07-10 15:51:10,472][25689] Fps is (10 sec: 5381.1, 60 sec: 5570.9, 300 sec: 5556.4). Total num frames: 808285184. Throughput: 0: 5696.5. Samples: 808294370. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:51:10,475][25689] Avg episode reward: [(0, '-1.913')] [2022-07-10 15:51:10,935][26022] Updated weights on worker 0-0, policy_version 789343 (0.00900) [2022-07-10 15:51:12,659][26022] Updated weights on worker 0-0, policy_version 789353 (0.00083) [2022-07-10 15:51:14,436][26022] Updated weights on worker 0-0, policy_version 789363 (0.00090) [2022-07-10 15:51:15,503][25689] Fps is (10 sec: 5504.7, 60 sec: 5534.4, 300 sec: 5553.6). Total num frames: 808312832. Throughput: 0: 5686.8. Samples: 808310952. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:51:15,503][25689] Avg episode reward: [(0, '-2.097')] [2022-07-10 15:51:16,193][26022] Updated weights on worker 0-0, policy_version 789373 (0.00081) [2022-07-10 15:51:18,165][26022] Updated weights on worker 0-0, policy_version 789383 (0.00093) [2022-07-10 15:51:19,910][26022] Updated weights on worker 0-0, policy_version 789393 (0.00099) [2022-07-10 15:51:20,564][25689] Fps is (10 sec: 5681.8, 60 sec: 5552.4, 300 sec: 5560.8). Total num frames: 808342528. Throughput: 0: 5682.7. Samples: 808344422. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:51:20,565][25689] Avg episode reward: [(0, '-2.413')] [2022-07-10 15:51:21,847][26022] Updated weights on worker 0-0, policy_version 789403 (0.00097) [2022-07-10 15:51:23,575][26022] Updated weights on worker 0-0, policy_version 789413 (0.00088) [2022-07-10 15:51:25,428][26022] Updated weights on worker 0-0, policy_version 789423 (0.00087) [2022-07-10 15:51:25,603][25689] Fps is (10 sec: 5677.5, 60 sec: 5567.5, 300 sec: 5553.8). Total num frames: 808370176. Throughput: 0: 5799.9. Samples: 808377994. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:51:25,603][25689] Avg episode reward: [(0, '-1.464')] [2022-07-10 15:51:27,333][26022] Updated weights on worker 0-0, policy_version 789433 (0.00274) [2022-07-10 15:51:29,128][26022] Updated weights on worker 0-0, policy_version 789443 (0.00090) [2022-07-10 15:51:30,655][25689] Fps is (10 sec: 5378.5, 60 sec: 5534.2, 300 sec: 5546.2). Total num frames: 808396800. Throughput: 0: 4980.3. Samples: 808394888. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:51:30,655][25689] Avg episode reward: [(0, '-0.886')] [2022-07-10 15:51:31,019][26022] Updated weights on worker 0-0, policy_version 789453 (0.00863) [2022-07-10 15:51:32,876][26022] Updated weights on worker 0-0, policy_version 789463 (0.00091) [2022-07-10 15:51:34,648][26022] Updated weights on worker 0-0, policy_version 789473 (0.00845) [2022-07-10 15:51:35,662][25689] Fps is (10 sec: 5598.8, 60 sec: 5585.9, 300 sec: 5554.1). Total num frames: 808426496. Throughput: 0: 5828.8. Samples: 808428458. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:51:35,663][25689] Avg episode reward: [(0, '-0.716')] [2022-07-10 15:51:36,488][26022] Updated weights on worker 0-0, policy_version 789483 (0.00089) [2022-07-10 15:51:38,167][26022] Updated weights on worker 0-0, policy_version 789493 (0.00092) [2022-07-10 15:51:40,274][26022] Updated weights on worker 0-0, policy_version 789503 (0.00061) [2022-07-10 15:51:40,782][25689] Fps is (10 sec: 5763.1, 60 sec: 5566.0, 300 sec: 5551.9). Total num frames: 808455168. Throughput: 0: 5828.6. Samples: 808462266. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:51:40,783][25689] Avg episode reward: [(0, '-0.300')] [2022-07-10 15:51:41,806][26022] Updated weights on worker 0-0, policy_version 789513 (0.00089) [2022-07-10 15:51:43,761][26022] Updated weights on worker 0-0, policy_version 789523 (0.00083) [2022-07-10 15:51:45,432][26022] Updated weights on worker 0-0, policy_version 789533 (0.00089) [2022-07-10 15:51:45,803][25689] Fps is (10 sec: 5452.9, 60 sec: 5532.9, 300 sec: 5548.5). Total num frames: 808481792. Throughput: 0: 5009.2. Samples: 808479180. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:51:45,804][25689] Avg episode reward: [(0, '-0.244')] [2022-07-10 15:51:47,375][26022] Updated weights on worker 0-0, policy_version 789543 (0.00082) [2022-07-10 15:51:49,328][26022] Updated weights on worker 0-0, policy_version 789553 (0.00094) [2022-07-10 15:51:50,810][25689] Fps is (10 sec: 5514.3, 60 sec: 5550.9, 300 sec: 5552.0). Total num frames: 808510464. Throughput: 0: 5861.6. Samples: 808513032. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:51:50,811][25689] Avg episode reward: [(0, '0.037')] [2022-07-10 15:51:50,953][26022] Updated weights on worker 0-0, policy_version 789563 (0.00082) [2022-07-10 15:51:52,730][26022] Updated weights on worker 0-0, policy_version 789573 (0.00086) [2022-07-10 15:51:54,704][26022] Updated weights on worker 0-0, policy_version 789583 (0.00091) [2022-07-10 15:51:55,870][25689] Fps is (10 sec: 5695.9, 60 sec: 5547.0, 300 sec: 5552.9). Total num frames: 808539136. Throughput: 0: 5854.8. Samples: 808546774. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:51:55,871][25689] Avg episode reward: [(0, '0.069')] [2022-07-10 15:51:56,375][26022] Updated weights on worker 0-0, policy_version 789593 (0.00085) [2022-07-10 15:51:58,561][26022] Updated weights on worker 0-0, policy_version 789603 (0.00086) [2022-07-10 15:52:00,111][26022] Updated weights on worker 0-0, policy_version 789613 (0.00090) [2022-07-10 15:52:00,914][25689] Fps is (10 sec: 5574.3, 60 sec: 5549.1, 300 sec: 5556.1). Total num frames: 808566784. Throughput: 0: 5024.3. Samples: 808563412. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:52:00,914][25689] Avg episode reward: [(0, '-0.130')] [2022-07-10 15:52:02,627][26022] Updated weights on worker 0-0, policy_version 789623 (0.00081) [2022-07-10 15:52:04,071][26022] Updated weights on worker 0-0, policy_version 789633 (0.00084) [2022-07-10 15:52:05,915][25689] Fps is (10 sec: 5301.3, 60 sec: 5538.6, 300 sec: 5549.7). Total num frames: 808592384. Throughput: 0: 5756.7. Samples: 808594958. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:52:05,915][25689] Avg episode reward: [(0, '-0.170')] [2022-07-10 15:52:06,263][26022] Updated weights on worker 0-0, policy_version 789643 (0.00088) [2022-07-10 15:52:07,694][26022] Updated weights on worker 0-0, policy_version 789653 (0.00090) [2022-07-10 15:52:09,992][26022] Updated weights on worker 0-0, policy_version 789663 (0.00092) [2022-07-10 15:52:10,956][25689] Fps is (10 sec: 5506.1, 60 sec: 5569.9, 300 sec: 5556.1). Total num frames: 808622080. Throughput: 0: 5727.9. Samples: 808628428. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:52:10,957][25689] Avg episode reward: [(0, '-0.474')] [2022-07-10 15:52:11,490][26022] Updated weights on worker 0-0, policy_version 789673 (0.00090) [2022-07-10 15:52:13,645][26022] Updated weights on worker 0-0, policy_version 789683 (0.00085) [2022-07-10 15:52:15,134][26022] Updated weights on worker 0-0, policy_version 789693 (0.00085) [2022-07-10 15:52:15,978][25689] Fps is (10 sec: 5596.4, 60 sec: 5553.8, 300 sec: 5546.3). Total num frames: 808648704. Throughput: 0: 4890.1. Samples: 808645104. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:52:15,979][25689] Avg episode reward: [(0, '-0.126')] [2022-07-10 15:52:17,357][26022] Updated weights on worker 0-0, policy_version 789703 (0.00084) [2022-07-10 15:52:18,806][26022] Updated weights on worker 0-0, policy_version 789713 (0.00104) [2022-07-10 15:52:21,019][26022] Updated weights on worker 0-0, policy_version 789723 (0.00084) [2022-07-10 15:52:21,100][25689] Fps is (10 sec: 5451.2, 60 sec: 5531.4, 300 sec: 5544.3). Total num frames: 808677376. Throughput: 0: 5699.5. Samples: 808678464. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:52:21,101][25689] Avg episode reward: [(0, '-1.120')] [2022-07-10 15:52:22,492][26022] Updated weights on worker 0-0, policy_version 789733 (0.00088) [2022-07-10 15:52:24,539][26022] Updated weights on worker 0-0, policy_version 789743 (0.00088) [2022-07-10 15:52:26,135][25689] Fps is (10 sec: 5646.1, 60 sec: 5548.7, 300 sec: 5547.8). Total num frames: 808706048. Throughput: 0: 5785.0. Samples: 808711930. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:52:26,136][25689] Avg episode reward: [(0, '-1.080')] [2022-07-10 15:52:26,361][26022] Updated weights on worker 0-0, policy_version 789753 (0.00086) [2022-07-10 15:52:28,184][26022] Updated weights on worker 0-0, policy_version 789763 (0.00083) [2022-07-10 15:52:30,167][26022] Updated weights on worker 0-0, policy_version 789773 (0.00091) [2022-07-10 15:52:31,164][25689] Fps is (10 sec: 5392.5, 60 sec: 5533.8, 300 sec: 5544.5). Total num frames: 808731648. Throughput: 0: 4954.1. Samples: 808728540. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:52:31,165][25689] Avg episode reward: [(0, '-1.015')] [2022-07-10 15:52:31,898][26022] Updated weights on worker 0-0, policy_version 789783 (0.00437) [2022-07-10 15:52:33,825][26022] Updated weights on worker 0-0, policy_version 789793 (0.00093) [2022-07-10 15:52:35,565][26022] Updated weights on worker 0-0, policy_version 789803 (0.00085) [2022-07-10 15:52:36,195][25689] Fps is (10 sec: 5598.4, 60 sec: 5548.6, 300 sec: 5548.5). Total num frames: 808762368. Throughput: 0: 5783.1. Samples: 808762018. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:52:36,197][25689] Avg episode reward: [(0, '-0.964')] [2022-07-10 15:52:37,447][26022] Updated weights on worker 0-0, policy_version 789813 (0.00085) [2022-07-10 15:52:39,412][26022] Updated weights on worker 0-0, policy_version 789823 (0.00088) [2022-07-10 15:52:41,103][26022] Updated weights on worker 0-0, policy_version 789833 (0.00094) [2022-07-10 15:52:41,235][25689] Fps is (10 sec: 5796.0, 60 sec: 5539.0, 300 sec: 5545.6). Total num frames: 808790016. Throughput: 0: 5811.3. Samples: 808795472. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:52:41,235][25689] Avg episode reward: [(0, '-0.608')] [2022-07-10 15:52:43,075][26022] Updated weights on worker 0-0, policy_version 789843 (0.00084) [2022-07-10 15:52:43,752][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:52:43,766][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000789848_808804352.pth [2022-07-10 15:52:43,767][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000787895_806804480.pth [2022-07-10 15:52:44,755][26022] Updated weights on worker 0-0, policy_version 789853 (0.00086) [2022-07-10 15:52:46,245][25689] Fps is (10 sec: 5298.0, 60 sec: 5523.0, 300 sec: 5540.2). Total num frames: 808815616. Throughput: 0: 4980.0. Samples: 808812082. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:52:46,246][25689] Avg episode reward: [(0, '0.006')] [2022-07-10 15:52:46,608][26022] Updated weights on worker 0-0, policy_version 789863 (0.00090) [2022-07-10 15:52:48,526][26022] Updated weights on worker 0-0, policy_version 789873 (0.00094) [2022-07-10 15:52:50,340][26022] Updated weights on worker 0-0, policy_version 789883 (0.00100) [2022-07-10 15:52:51,248][25689] Fps is (10 sec: 5522.1, 60 sec: 5540.3, 300 sec: 5540.4). Total num frames: 808845312. Throughput: 0: 5831.7. Samples: 808845664. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-10 15:52:51,249][25689] Avg episode reward: [(0, '0.293')] [2022-07-10 15:52:52,194][26022] Updated weights on worker 0-0, policy_version 789893 (0.00052) [2022-07-10 15:52:53,988][26022] Updated weights on worker 0-0, policy_version 789903 (0.00089) [2022-07-10 15:52:55,729][26022] Updated weights on worker 0-0, policy_version 789913 (0.00087) [2022-07-10 15:52:56,251][25689] Fps is (10 sec: 5628.9, 60 sec: 5511.7, 300 sec: 5541.3). Total num frames: 808871936. Throughput: 0: 5840.6. Samples: 808879158. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:52:56,252][25689] Avg episode reward: [(0, '-0.781')] [2022-07-10 15:52:57,789][26022] Updated weights on worker 0-0, policy_version 789923 (0.00086) [2022-07-10 15:52:59,340][26022] Updated weights on worker 0-0, policy_version 789933 (0.00087) [2022-07-10 15:53:01,317][25689] Fps is (10 sec: 5491.9, 60 sec: 5526.6, 300 sec: 5547.6). Total num frames: 808900608. Throughput: 0: 4993.0. Samples: 808895742. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:53:01,317][25689] Avg episode reward: [(0, '-0.774')] [2022-07-10 15:53:01,512][26022] Updated weights on worker 0-0, policy_version 789943 (0.00122) [2022-07-10 15:53:03,996][26022] Updated weights on worker 0-0, policy_version 789953 (0.00091) [2022-07-10 15:53:05,371][26022] Updated weights on worker 0-0, policy_version 789963 (0.00090) [2022-07-10 15:53:06,323][25689] Fps is (10 sec: 5388.4, 60 sec: 5526.1, 300 sec: 5541.2). Total num frames: 808926208. Throughput: 0: 5715.9. Samples: 808926842. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:53:06,325][25689] Avg episode reward: [(0, '-0.469')] [2022-07-10 15:53:07,376][26022] Updated weights on worker 0-0, policy_version 789973 (0.00085) [2022-07-10 15:53:08,955][26022] Updated weights on worker 0-0, policy_version 789983 (0.00076) [2022-07-10 15:53:10,895][26022] Updated weights on worker 0-0, policy_version 789993 (0.00088) [2022-07-10 15:53:11,359][25689] Fps is (10 sec: 5404.6, 60 sec: 5509.7, 300 sec: 5541.0). Total num frames: 808954880. Throughput: 0: 5714.9. Samples: 808960594. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:53:11,359][25689] Avg episode reward: [(0, '-0.611')] [2022-07-10 15:53:12,877][26022] Updated weights on worker 0-0, policy_version 790003 (0.00089) [2022-07-10 15:53:14,507][26022] Updated weights on worker 0-0, policy_version 790013 (0.00091) [2022-07-10 15:53:16,363][25689] Fps is (10 sec: 5507.5, 60 sec: 5511.3, 300 sec: 5535.0). Total num frames: 808981504. Throughput: 0: 4881.4. Samples: 808977334. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:53:16,365][25689] Avg episode reward: [(0, '-0.557')] [2022-07-10 15:53:16,642][26022] Updated weights on worker 0-0, policy_version 790023 (0.00088) [2022-07-10 15:53:18,371][26022] Updated weights on worker 0-0, policy_version 790033 (0.00089) [2022-07-10 15:53:20,206][26022] Updated weights on worker 0-0, policy_version 790043 (0.00090) [2022-07-10 15:53:21,462][25689] Fps is (10 sec: 5675.7, 60 sec: 5547.3, 300 sec: 5543.7). Total num frames: 809012224. Throughput: 0: 5703.2. Samples: 809010634. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:53:21,463][25689] Avg episode reward: [(0, '-1.105')] [2022-07-10 15:53:21,927][26022] Updated weights on worker 0-0, policy_version 790053 (0.00086) [2022-07-10 15:53:23,888][26022] Updated weights on worker 0-0, policy_version 790063 (0.00097) [2022-07-10 15:53:25,622][26022] Updated weights on worker 0-0, policy_version 790073 (0.00092) [2022-07-10 15:53:26,523][25689] Fps is (10 sec: 5745.1, 60 sec: 5528.0, 300 sec: 5546.1). Total num frames: 809039872. Throughput: 0: 5820.5. Samples: 809044416. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:53:26,525][25689] Avg episode reward: [(0, '-0.395')] [2022-07-10 15:53:27,598][26022] Updated weights on worker 0-0, policy_version 790083 (0.00085) [2022-07-10 15:53:29,061][26022] Updated weights on worker 0-0, policy_version 790093 (0.00085) [2022-07-10 15:53:31,491][26022] Updated weights on worker 0-0, policy_version 790103 (0.00082) [2022-07-10 15:53:31,571][25689] Fps is (10 sec: 5267.5, 60 sec: 5526.3, 300 sec: 5538.5). Total num frames: 809065472. Throughput: 0: 4955.7. Samples: 809060760. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:53:31,572][25689] Avg episode reward: [(0, '-0.923')] [2022-07-10 15:53:32,947][26022] Updated weights on worker 0-0, policy_version 790113 (0.00090) [2022-07-10 15:53:35,060][26022] Updated weights on worker 0-0, policy_version 790123 (0.00120) [2022-07-10 15:53:36,666][25689] Fps is (10 sec: 5451.3, 60 sec: 5503.4, 300 sec: 5544.5). Total num frames: 809095168. Throughput: 0: 5729.9. Samples: 809093670. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:53:36,668][25689] Avg episode reward: [(0, '-2.102')] [2022-07-10 15:53:36,819][26022] Updated weights on worker 0-0, policy_version 790133 (0.00082) [2022-07-10 15:53:38,706][26022] Updated weights on worker 0-0, policy_version 790143 (0.00085) [2022-07-10 15:53:40,638][26022] Updated weights on worker 0-0, policy_version 790153 (0.00090) [2022-07-10 15:53:41,761][25689] Fps is (10 sec: 5627.4, 60 sec: 5498.4, 300 sec: 5542.9). Total num frames: 809122816. Throughput: 0: 5739.5. Samples: 809127140. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:53:41,763][25689] Avg episode reward: [(0, '-2.101')] [2022-07-10 15:53:42,304][26022] Updated weights on worker 0-0, policy_version 790163 (0.00084) [2022-07-10 15:53:44,165][26022] Updated weights on worker 0-0, policy_version 790173 (0.00085) [2022-07-10 15:53:46,214][26022] Updated weights on worker 0-0, policy_version 790183 (0.00428) [2022-07-10 15:53:46,806][25689] Fps is (10 sec: 5554.3, 60 sec: 5546.0, 300 sec: 5539.0). Total num frames: 809151488. Throughput: 0: 5733.7. Samples: 809160716. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:53:46,808][25689] Avg episode reward: [(0, '-0.805')] [2022-07-10 15:53:47,824][26022] Updated weights on worker 0-0, policy_version 790193 (0.00086) [2022-07-10 15:53:49,711][26022] Updated weights on worker 0-0, policy_version 790203 (0.00100) [2022-07-10 15:53:51,455][26022] Updated weights on worker 0-0, policy_version 790213 (0.00084) [2022-07-10 15:53:51,893][25689] Fps is (10 sec: 5659.6, 60 sec: 5521.5, 300 sec: 5548.2). Total num frames: 809180160. Throughput: 0: 5739.6. Samples: 809177402. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:53:51,895][25689] Avg episode reward: [(0, '-0.999')] [2022-07-10 15:53:53,545][26022] Updated weights on worker 0-0, policy_version 790223 (0.00082) [2022-07-10 15:53:55,082][26022] Updated weights on worker 0-0, policy_version 790233 (0.00087) [2022-07-10 15:53:56,922][25689] Fps is (10 sec: 5466.4, 60 sec: 5519.1, 300 sec: 5536.5). Total num frames: 809206784. Throughput: 0: 5777.4. Samples: 809210694. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:53:56,923][25689] Avg episode reward: [(0, '-1.263')] [2022-07-10 15:53:57,240][26022] Updated weights on worker 0-0, policy_version 790243 (0.00091) [2022-07-10 15:53:58,807][26022] Updated weights on worker 0-0, policy_version 790253 (0.00088) [2022-07-10 15:54:00,713][26022] Updated weights on worker 0-0, policy_version 790263 (0.00084) [2022-07-10 15:54:01,971][25689] Fps is (10 sec: 5283.5, 60 sec: 5486.8, 300 sec: 5540.8). Total num frames: 809233408. Throughput: 0: 5719.5. Samples: 809242732. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:54:01,972][25689] Avg episode reward: [(0, '-0.803')] [2022-07-10 15:54:03,020][26022] Updated weights on worker 0-0, policy_version 790273 (0.00513) [2022-07-10 15:54:04,822][26022] Updated weights on worker 0-0, policy_version 790283 (0.00088) [2022-07-10 15:54:06,540][26022] Updated weights on worker 0-0, policy_version 790293 (0.00202) [2022-07-10 15:54:06,974][25689] Fps is (10 sec: 5602.8, 60 sec: 5554.7, 300 sec: 5544.5). Total num frames: 809263104. Throughput: 0: 4861.5. Samples: 809258762. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:54:06,975][25689] Avg episode reward: [(0, '0.843')] [2022-07-10 15:54:08,607][26022] Updated weights on worker 0-0, policy_version 790303 (0.00089) [2022-07-10 15:54:10,118][26022] Updated weights on worker 0-0, policy_version 790313 (0.00094) [2022-07-10 15:54:11,989][25689] Fps is (10 sec: 5519.8, 60 sec: 5505.9, 300 sec: 5532.1). Total num frames: 809288704. Throughput: 0: 5713.1. Samples: 809292212. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:54:11,989][25689] Avg episode reward: [(0, '0.220')] [2022-07-10 15:54:12,184][26022] Updated weights on worker 0-0, policy_version 790323 (0.00089) [2022-07-10 15:54:13,958][26022] Updated weights on worker 0-0, policy_version 790333 (0.00092) [2022-07-10 15:54:15,806][26022] Updated weights on worker 0-0, policy_version 790343 (0.00090) [2022-07-10 15:54:17,002][25689] Fps is (10 sec: 5412.0, 60 sec: 5538.9, 300 sec: 5533.9). Total num frames: 809317376. Throughput: 0: 5719.9. Samples: 809325550. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:54:17,003][25689] Avg episode reward: [(0, '-0.069')] [2022-07-10 15:54:17,726][26022] Updated weights on worker 0-0, policy_version 790353 (0.00086) [2022-07-10 15:54:19,527][26022] Updated weights on worker 0-0, policy_version 790363 (0.00084) [2022-07-10 15:54:21,404][26022] Updated weights on worker 0-0, policy_version 790373 (0.00090) [2022-07-10 15:54:22,110][25689] Fps is (10 sec: 5665.7, 60 sec: 5504.3, 300 sec: 5539.5). Total num frames: 809346048. Throughput: 0: 4944.3. Samples: 809342306. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:54:22,111][25689] Avg episode reward: [(0, '-0.176')] [2022-07-10 15:54:23,488][26022] Updated weights on worker 0-0, policy_version 790383 (0.00097) [2022-07-10 15:54:24,729][26022] Updated weights on worker 0-0, policy_version 790393 (0.00085) [2022-07-10 15:54:27,012][26022] Updated weights on worker 0-0, policy_version 790403 (0.00092) [2022-07-10 15:54:27,175][25689] Fps is (10 sec: 5435.8, 60 sec: 5487.1, 300 sec: 5531.8). Total num frames: 809372672. Throughput: 0: 5814.1. Samples: 809376210. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:54:27,175][25689] Avg episode reward: [(0, '0.048')] [2022-07-10 15:54:28,467][26022] Updated weights on worker 0-0, policy_version 790413 (0.00089) [2022-07-10 15:54:30,608][26022] Updated weights on worker 0-0, policy_version 790423 (0.00093) [2022-07-10 15:54:32,199][25689] Fps is (10 sec: 5481.1, 60 sec: 5539.9, 300 sec: 5535.1). Total num frames: 809401344. Throughput: 0: 5818.3. Samples: 809409798. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:54:32,199][25689] Avg episode reward: [(0, '0.008')] [2022-07-10 15:54:32,462][26022] Updated weights on worker 0-0, policy_version 790433 (0.00782) [2022-07-10 15:54:34,166][26022] Updated weights on worker 0-0, policy_version 790443 (0.00084) [2022-07-10 15:54:35,827][26022] Updated weights on worker 0-0, policy_version 790453 (0.00089) [2022-07-10 15:54:37,213][25689] Fps is (10 sec: 5712.7, 60 sec: 5530.4, 300 sec: 5535.8). Total num frames: 809430016. Throughput: 0: 5002.9. Samples: 809426662. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:54:37,215][25689] Avg episode reward: [(0, '-0.440')] [2022-07-10 15:54:37,904][26022] Updated weights on worker 0-0, policy_version 790463 (0.00082) [2022-07-10 15:54:39,730][26022] Updated weights on worker 0-0, policy_version 790473 (0.00085) [2022-07-10 15:54:41,652][26022] Updated weights on worker 0-0, policy_version 790483 (0.00091) [2022-07-10 15:54:42,310][25689] Fps is (10 sec: 5772.6, 60 sec: 5564.0, 300 sec: 5545.0). Total num frames: 809459712. Throughput: 0: 5838.3. Samples: 809460238. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:54:42,310][25689] Avg episode reward: [(0, '0.203')] [2022-07-10 15:54:43,234][26022] Updated weights on worker 0-0, policy_version 790493 (0.00092) [2022-07-10 15:54:43,798][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:54:43,818][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000790496_809467904.pth [2022-07-10 15:54:43,819][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000788544_807469056.pth [2022-07-10 15:54:45,298][26022] Updated weights on worker 0-0, policy_version 790503 (0.00084) [2022-07-10 15:54:46,890][26022] Updated weights on worker 0-0, policy_version 790513 (0.00101) [2022-07-10 15:54:47,365][25689] Fps is (10 sec: 5547.5, 60 sec: 5529.3, 300 sec: 5537.3). Total num frames: 809486336. Throughput: 0: 5826.1. Samples: 809493840. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:54:47,366][25689] Avg episode reward: [(0, '-0.508')] [2022-07-10 15:54:48,929][26022] Updated weights on worker 0-0, policy_version 790523 (0.00086) [2022-07-10 15:54:50,611][26022] Updated weights on worker 0-0, policy_version 790533 (0.00083) [2022-07-10 15:54:52,429][25689] Fps is (10 sec: 5363.3, 60 sec: 5514.5, 300 sec: 5536.3). Total num frames: 809513984. Throughput: 0: 4990.8. Samples: 809510764. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:54:52,429][25689] Avg episode reward: [(0, '-0.523')] [2022-07-10 15:54:52,694][26022] Updated weights on worker 0-0, policy_version 790543 (0.00085) [2022-07-10 15:54:54,281][26022] Updated weights on worker 0-0, policy_version 790553 (0.00097) [2022-07-10 15:54:56,510][26022] Updated weights on worker 0-0, policy_version 790563 (0.00088) [2022-07-10 15:54:57,464][25689] Fps is (10 sec: 5475.5, 60 sec: 5530.9, 300 sec: 5534.4). Total num frames: 809541632. Throughput: 0: 5794.9. Samples: 809544014. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:54:57,464][25689] Avg episode reward: [(0, '-0.247')] [2022-07-10 15:54:57,856][26022] Updated weights on worker 0-0, policy_version 790573 (0.00092) [2022-07-10 15:54:59,982][26022] Updated weights on worker 0-0, policy_version 790583 (0.00085) [2022-07-10 15:55:01,892][26022] Updated weights on worker 0-0, policy_version 790593 (0.00100) [2022-07-10 15:55:02,570][25689] Fps is (10 sec: 5553.7, 60 sec: 5559.5, 300 sec: 5543.3). Total num frames: 809570304. Throughput: 0: 5689.6. Samples: 809575510. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:55:02,571][25689] Avg episode reward: [(0, '-0.017')] [2022-07-10 15:55:04,127][26022] Updated weights on worker 0-0, policy_version 790603 (0.00088) [2022-07-10 15:55:05,711][26022] Updated weights on worker 0-0, policy_version 790613 (0.00090) [2022-07-10 15:55:07,629][25689] Fps is (10 sec: 5439.6, 60 sec: 5503.6, 300 sec: 5539.2). Total num frames: 809596928. Throughput: 0: 4848.5. Samples: 809592090. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:55:07,630][25689] Avg episode reward: [(0, '-0.042')] [2022-07-10 15:55:07,731][26022] Updated weights on worker 0-0, policy_version 790623 (0.00090) [2022-07-10 15:55:09,367][26022] Updated weights on worker 0-0, policy_version 790633 (0.00087) [2022-07-10 15:55:11,396][26022] Updated weights on worker 0-0, policy_version 790643 (0.00090) [2022-07-10 15:55:12,634][25689] Fps is (10 sec: 5494.6, 60 sec: 5555.3, 300 sec: 5535.7). Total num frames: 809625600. Throughput: 0: 5697.3. Samples: 809625876. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:55:12,634][25689] Avg episode reward: [(0, '-0.210')] [2022-07-10 15:55:13,051][26022] Updated weights on worker 0-0, policy_version 790653 (0.00086) [2022-07-10 15:55:14,827][26022] Updated weights on worker 0-0, policy_version 790663 (0.00085) [2022-07-10 15:55:16,756][26022] Updated weights on worker 0-0, policy_version 790673 (0.00081) [2022-07-10 15:55:17,656][25689] Fps is (10 sec: 5821.5, 60 sec: 5571.3, 300 sec: 5540.1). Total num frames: 809655296. Throughput: 0: 5733.9. Samples: 809659792. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:55:17,657][25689] Avg episode reward: [(0, '0.309')] [2022-07-10 15:55:18,423][26022] Updated weights on worker 0-0, policy_version 790683 (0.00200) [2022-07-10 15:55:20,390][26022] Updated weights on worker 0-0, policy_version 790693 (0.00102) [2022-07-10 15:55:22,394][26022] Updated weights on worker 0-0, policy_version 790703 (0.00082) [2022-07-10 15:55:22,803][25689] Fps is (10 sec: 5538.3, 60 sec: 5534.0, 300 sec: 5537.7). Total num frames: 809681920. Throughput: 0: 4993.5. Samples: 809676542. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:55:22,803][25689] Avg episode reward: [(0, '0.503')] [2022-07-10 15:55:24,000][26022] Updated weights on worker 0-0, policy_version 790713 (0.00084) [2022-07-10 15:55:25,958][26022] Updated weights on worker 0-0, policy_version 790723 (0.00053) [2022-07-10 15:55:27,601][26022] Updated weights on worker 0-0, policy_version 790733 (0.00088) [2022-07-10 15:55:27,827][25689] Fps is (10 sec: 5436.6, 60 sec: 5571.5, 300 sec: 5538.3). Total num frames: 809710592. Throughput: 0: 5850.0. Samples: 809710246. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:55:27,827][25689] Avg episode reward: [(0, '0.174')] [2022-07-10 15:55:29,614][26022] Updated weights on worker 0-0, policy_version 790743 (0.00088) [2022-07-10 15:55:31,216][26022] Updated weights on worker 0-0, policy_version 790753 (0.00093) [2022-07-10 15:55:32,859][25689] Fps is (10 sec: 5600.6, 60 sec: 5553.8, 300 sec: 5541.5). Total num frames: 809738240. Throughput: 0: 5818.6. Samples: 809743558. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:55:32,859][25689] Avg episode reward: [(0, '0.026')] [2022-07-10 15:55:33,319][26022] Updated weights on worker 0-0, policy_version 790763 (0.00097) [2022-07-10 15:55:35,012][26022] Updated weights on worker 0-0, policy_version 790773 (0.00086) [2022-07-10 15:55:36,939][26022] Updated weights on worker 0-0, policy_version 790783 (0.00630) [2022-07-10 15:55:37,873][25689] Fps is (10 sec: 5707.8, 60 sec: 5570.7, 300 sec: 5542.9). Total num frames: 809767936. Throughput: 0: 4964.6. Samples: 809760164. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:55:37,874][25689] Avg episode reward: [(0, '0.254')] [2022-07-10 15:55:38,783][26022] Updated weights on worker 0-0, policy_version 790793 (0.00083) [2022-07-10 15:55:40,509][26022] Updated weights on worker 0-0, policy_version 790803 (0.00091) [2022-07-10 15:55:42,379][26022] Updated weights on worker 0-0, policy_version 790813 (0.00082) [2022-07-10 15:55:42,967][25689] Fps is (10 sec: 5673.1, 60 sec: 5537.2, 300 sec: 5538.2). Total num frames: 809795584. Throughput: 0: 5832.7. Samples: 809794152. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:55:42,967][25689] Avg episode reward: [(0, '-0.791')] [2022-07-10 15:55:44,089][26022] Updated weights on worker 0-0, policy_version 790823 (0.00615) [2022-07-10 15:55:45,910][26022] Updated weights on worker 0-0, policy_version 790833 (0.00096) [2022-07-10 15:55:47,815][26022] Updated weights on worker 0-0, policy_version 790843 (0.00085) [2022-07-10 15:55:48,000][25689] Fps is (10 sec: 5460.2, 60 sec: 5556.1, 300 sec: 5538.0). Total num frames: 809823232. Throughput: 0: 5854.3. Samples: 809828348. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:55:48,001][25689] Avg episode reward: [(0, '-0.023')] [2022-07-10 15:55:49,535][26022] Updated weights on worker 0-0, policy_version 790853 (0.00090) [2022-07-10 15:55:51,413][26022] Updated weights on worker 0-0, policy_version 790863 (0.00090) [2022-07-10 15:55:53,003][25689] Fps is (10 sec: 5713.7, 60 sec: 5595.5, 300 sec: 5541.7). Total num frames: 809852928. Throughput: 0: 5046.8. Samples: 809845222. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:55:53,003][25689] Avg episode reward: [(0, '-1.359')] [2022-07-10 15:55:53,200][26022] Updated weights on worker 0-0, policy_version 790873 (0.00092) [2022-07-10 15:55:55,079][26022] Updated weights on worker 0-0, policy_version 790883 (0.00100) [2022-07-10 15:55:56,887][26022] Updated weights on worker 0-0, policy_version 790893 (0.00425) [2022-07-10 15:55:58,039][25689] Fps is (10 sec: 5712.2, 60 sec: 5595.4, 300 sec: 5542.2). Total num frames: 809880576. Throughput: 0: 5884.1. Samples: 809878822. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:55:58,040][25689] Avg episode reward: [(0, '-1.044')] [2022-07-10 15:55:58,728][26022] Updated weights on worker 0-0, policy_version 790903 (0.00086) [2022-07-10 15:56:00,626][26022] Updated weights on worker 0-0, policy_version 790913 (0.00089) [2022-07-10 15:56:02,826][26022] Updated weights on worker 0-0, policy_version 790923 (0.00086) [2022-07-10 15:56:03,111][25689] Fps is (10 sec: 5369.2, 60 sec: 5564.8, 300 sec: 5542.2). Total num frames: 809907200. Throughput: 0: 5853.4. Samples: 809912064. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:56:03,111][25689] Avg episode reward: [(0, '-1.139')] [2022-07-10 15:56:04,775][26022] Updated weights on worker 0-0, policy_version 790933 (0.00481) [2022-07-10 15:56:06,423][26022] Updated weights on worker 0-0, policy_version 790943 (0.00098) [2022-07-10 15:56:08,126][25689] Fps is (10 sec: 5380.5, 60 sec: 5585.8, 300 sec: 5542.2). Total num frames: 809934848. Throughput: 0: 5736.1. Samples: 809943790. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:56:08,126][25689] Avg episode reward: [(0, '-0.259')] [2022-07-10 15:56:08,505][26022] Updated weights on worker 0-0, policy_version 790953 (0.00086) [2022-07-10 15:56:10,119][26022] Updated weights on worker 0-0, policy_version 790963 (0.00080) [2022-07-10 15:56:12,127][26022] Updated weights on worker 0-0, policy_version 790973 (0.00085) [2022-07-10 15:56:13,171][25689] Fps is (10 sec: 5598.5, 60 sec: 5582.0, 300 sec: 5545.4). Total num frames: 809963520. Throughput: 0: 5719.2. Samples: 809960566. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:56:13,172][25689] Avg episode reward: [(0, '-0.584')] [2022-07-10 15:56:13,817][26022] Updated weights on worker 0-0, policy_version 790983 (0.00092) [2022-07-10 15:56:15,626][26022] Updated weights on worker 0-0, policy_version 790993 (0.00065) [2022-07-10 15:56:17,526][26022] Updated weights on worker 0-0, policy_version 791003 (0.00079) [2022-07-10 15:56:18,177][25689] Fps is (10 sec: 5501.7, 60 sec: 5532.7, 300 sec: 5536.1). Total num frames: 809990144. Throughput: 0: 5741.8. Samples: 809994448. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:56:18,177][25689] Avg episode reward: [(0, '-0.946')] [2022-07-10 15:56:19,256][26022] Updated weights on worker 0-0, policy_version 791013 (0.00083) [2022-07-10 15:56:21,101][26022] Updated weights on worker 0-0, policy_version 791023 (0.00084) [2022-07-10 15:56:23,098][26022] Updated weights on worker 0-0, policy_version 791033 (0.00097) [2022-07-10 15:56:23,263][25689] Fps is (10 sec: 5377.4, 60 sec: 5555.2, 300 sec: 5535.2). Total num frames: 810017792. Throughput: 0: 5752.9. Samples: 810027998. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 15:56:23,264][25689] Avg episode reward: [(0, '-1.364')] [2022-07-10 15:56:24,655][26022] Updated weights on worker 0-0, policy_version 791043 (0.00088) [2022-07-10 15:56:26,890][26022] Updated weights on worker 0-0, policy_version 791053 (0.00100) [2022-07-10 15:56:28,266][25689] Fps is (10 sec: 5683.4, 60 sec: 5574.1, 300 sec: 5546.4). Total num frames: 810047488. Throughput: 0: 4998.1. Samples: 810044454. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:56:28,267][25689] Avg episode reward: [(0, '-1.135')] [2022-07-10 15:56:28,343][26022] Updated weights on worker 0-0, policy_version 791063 (0.00088) [2022-07-10 15:56:30,415][26022] Updated weights on worker 0-0, policy_version 791073 (0.00083) [2022-07-10 15:56:32,165][26022] Updated weights on worker 0-0, policy_version 791083 (0.00091) [2022-07-10 15:56:33,274][25689] Fps is (10 sec: 5625.7, 60 sec: 5559.3, 300 sec: 5536.1). Total num frames: 810074112. Throughput: 0: 5824.3. Samples: 810077656. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:56:33,275][25689] Avg episode reward: [(0, '-1.673')] [2022-07-10 15:56:34,059][26022] Updated weights on worker 0-0, policy_version 791093 (0.00093) [2022-07-10 15:56:36,171][26022] Updated weights on worker 0-0, policy_version 791103 (0.00535) [2022-07-10 15:56:37,704][26022] Updated weights on worker 0-0, policy_version 791113 (0.00084) [2022-07-10 15:56:38,289][25689] Fps is (10 sec: 5415.0, 60 sec: 5525.5, 300 sec: 5534.6). Total num frames: 810101760. Throughput: 0: 5790.6. Samples: 810110910. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:56:38,289][25689] Avg episode reward: [(0, '-1.104')] [2022-07-10 15:56:39,660][26022] Updated weights on worker 0-0, policy_version 791123 (0.00093) [2022-07-10 15:56:41,560][26022] Updated weights on worker 0-0, policy_version 791133 (0.00081) [2022-07-10 15:56:43,267][26022] Updated weights on worker 0-0, policy_version 791143 (0.00094) [2022-07-10 15:56:43,382][25689] Fps is (10 sec: 5572.0, 60 sec: 5542.4, 300 sec: 5540.2). Total num frames: 810130432. Throughput: 0: 4952.4. Samples: 810127636. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:56:43,383][25689] Avg episode reward: [(0, '-1.472')] [2022-07-10 15:56:43,936][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:56:43,950][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000791147_810134528.pth [2022-07-10 15:56:43,950][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000789197_808137728.pth [2022-07-10 15:56:45,313][26022] Updated weights on worker 0-0, policy_version 791153 (0.00088) [2022-07-10 15:56:46,907][26022] Updated weights on worker 0-0, policy_version 791163 (0.00089) [2022-07-10 15:56:48,403][25689] Fps is (10 sec: 5669.8, 60 sec: 5560.6, 300 sec: 5539.9). Total num frames: 810159104. Throughput: 0: 5800.5. Samples: 810161256. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:56:48,403][25689] Avg episode reward: [(0, '-0.331')] [2022-07-10 15:56:48,834][26022] Updated weights on worker 0-0, policy_version 791173 (0.00088) [2022-07-10 15:56:50,644][26022] Updated weights on worker 0-0, policy_version 791183 (0.00089) [2022-07-10 15:56:52,559][26022] Updated weights on worker 0-0, policy_version 791193 (0.00089) [2022-07-10 15:56:53,424][25689] Fps is (10 sec: 5506.6, 60 sec: 5508.0, 300 sec: 5533.8). Total num frames: 810185728. Throughput: 0: 5814.2. Samples: 810194810. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:56:53,424][25689] Avg episode reward: [(0, '-0.067')] [2022-07-10 15:56:54,389][26022] Updated weights on worker 0-0, policy_version 791203 (0.00086) [2022-07-10 15:56:56,037][26022] Updated weights on worker 0-0, policy_version 791213 (0.00095) [2022-07-10 15:56:57,958][26022] Updated weights on worker 0-0, policy_version 791223 (0.00087) [2022-07-10 15:56:58,441][25689] Fps is (10 sec: 5610.5, 60 sec: 5543.7, 300 sec: 5541.1). Total num frames: 810215424. Throughput: 0: 4992.8. Samples: 810211526. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:56:58,442][25689] Avg episode reward: [(0, '-0.494')] [2022-07-10 15:56:59,719][26022] Updated weights on worker 0-0, policy_version 791233 (0.00094) [2022-07-10 15:57:01,699][26022] Updated weights on worker 0-0, policy_version 791243 (0.00118) [2022-07-10 15:57:03,496][25689] Fps is (10 sec: 5489.9, 60 sec: 5528.2, 300 sec: 5540.1). Total num frames: 810241024. Throughput: 0: 5772.1. Samples: 810243736. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:57:03,497][25689] Avg episode reward: [(0, '-0.108')] [2022-07-10 15:57:03,940][26022] Updated weights on worker 0-0, policy_version 791253 (0.00100) [2022-07-10 15:57:05,925][26022] Updated weights on worker 0-0, policy_version 791263 (0.00085) [2022-07-10 15:57:07,667][26022] Updated weights on worker 0-0, policy_version 791273 (0.00082) [2022-07-10 15:57:08,584][25689] Fps is (10 sec: 5149.1, 60 sec: 5504.7, 300 sec: 5529.0). Total num frames: 810267648. Throughput: 0: 5687.8. Samples: 810276040. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:57:08,584][25689] Avg episode reward: [(0, '-0.543')] [2022-07-10 15:57:09,454][26022] Updated weights on worker 0-0, policy_version 791283 (0.00080) [2022-07-10 15:57:11,411][26022] Updated weights on worker 0-0, policy_version 791293 (0.00088) [2022-07-10 15:57:13,139][26022] Updated weights on worker 0-0, policy_version 791303 (0.00086) [2022-07-10 15:57:13,589][25689] Fps is (10 sec: 5377.2, 60 sec: 5491.3, 300 sec: 5532.7). Total num frames: 810295296. Throughput: 0: 4857.4. Samples: 810292762. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:57:13,591][25689] Avg episode reward: [(0, '-0.706')] [2022-07-10 15:57:15,002][26022] Updated weights on worker 0-0, policy_version 791313 (0.00093) [2022-07-10 15:57:17,003][26022] Updated weights on worker 0-0, policy_version 791323 (0.00090) [2022-07-10 15:57:18,426][26022] Updated weights on worker 0-0, policy_version 791333 (0.00087) [2022-07-10 15:57:18,646][25689] Fps is (10 sec: 5800.5, 60 sec: 5554.4, 300 sec: 5540.8). Total num frames: 810326016. Throughput: 0: 5693.4. Samples: 810326562. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:57:18,648][25689] Avg episode reward: [(0, '-0.968')] [2022-07-10 15:57:20,613][26022] Updated weights on worker 0-0, policy_version 791343 (0.00082) [2022-07-10 15:57:22,003][26022] Updated weights on worker 0-0, policy_version 791353 (0.00081) [2022-07-10 15:57:23,748][25689] Fps is (10 sec: 5745.7, 60 sec: 5553.0, 300 sec: 5536.1). Total num frames: 810353664. Throughput: 0: 5769.1. Samples: 810360570. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:57:23,748][25689] Avg episode reward: [(0, '-0.994')] [2022-07-10 15:57:24,011][26022] Updated weights on worker 0-0, policy_version 791363 (0.00085) [2022-07-10 15:57:25,882][26022] Updated weights on worker 0-0, policy_version 791373 (0.00093) [2022-07-10 15:57:27,672][26022] Updated weights on worker 0-0, policy_version 791383 (0.00096) [2022-07-10 15:57:28,825][25689] Fps is (10 sec: 5532.9, 60 sec: 5529.3, 300 sec: 5545.5). Total num frames: 810382336. Throughput: 0: 5003.2. Samples: 810377312. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:57:28,827][25689] Avg episode reward: [(0, '-1.828')] [2022-07-10 15:57:29,645][26022] Updated weights on worker 0-0, policy_version 791393 (0.00086) [2022-07-10 15:57:31,259][26022] Updated weights on worker 0-0, policy_version 791403 (0.00410) [2022-07-10 15:57:33,307][26022] Updated weights on worker 0-0, policy_version 791413 (0.00094) [2022-07-10 15:57:33,867][25689] Fps is (10 sec: 5464.4, 60 sec: 5526.2, 300 sec: 5531.6). Total num frames: 810408960. Throughput: 0: 5826.5. Samples: 810410912. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:57:33,868][25689] Avg episode reward: [(0, '-1.596')] [2022-07-10 15:57:35,014][26022] Updated weights on worker 0-0, policy_version 791423 (0.00095) [2022-07-10 15:57:37,001][26022] Updated weights on worker 0-0, policy_version 791433 (0.00087) [2022-07-10 15:57:38,787][26022] Updated weights on worker 0-0, policy_version 791443 (0.00091) [2022-07-10 15:57:38,876][25689] Fps is (10 sec: 5501.7, 60 sec: 5543.6, 300 sec: 5535.6). Total num frames: 810437632. Throughput: 0: 5824.1. Samples: 810444382. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:57:38,877][25689] Avg episode reward: [(0, '-1.539')] [2022-07-10 15:57:40,712][26022] Updated weights on worker 0-0, policy_version 791453 (0.00084) [2022-07-10 15:57:42,258][26022] Updated weights on worker 0-0, policy_version 791463 (0.00085) [2022-07-10 15:57:43,950][25689] Fps is (10 sec: 5586.0, 60 sec: 5528.5, 300 sec: 5541.3). Total num frames: 810465280. Throughput: 0: 4972.8. Samples: 810461030. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:57:43,950][25689] Avg episode reward: [(0, '-1.438')] [2022-07-10 15:57:44,391][26022] Updated weights on worker 0-0, policy_version 791473 (0.00090) [2022-07-10 15:57:46,091][26022] Updated weights on worker 0-0, policy_version 791483 (0.01272) [2022-07-10 15:57:47,983][26022] Updated weights on worker 0-0, policy_version 791493 (0.00103) [2022-07-10 15:57:49,012][25689] Fps is (10 sec: 5556.3, 60 sec: 5524.7, 300 sec: 5536.7). Total num frames: 810493952. Throughput: 0: 5784.6. Samples: 810494086. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:57:49,013][25689] Avg episode reward: [(0, '-0.677')] [2022-07-10 15:57:49,802][26022] Updated weights on worker 0-0, policy_version 791503 (0.00106) [2022-07-10 15:57:51,668][26022] Updated weights on worker 0-0, policy_version 791513 (0.00086) [2022-07-10 15:57:53,610][26022] Updated weights on worker 0-0, policy_version 791523 (0.00094) [2022-07-10 15:57:54,055][25689] Fps is (10 sec: 5573.2, 60 sec: 5539.6, 300 sec: 5539.4). Total num frames: 810521600. Throughput: 0: 5770.9. Samples: 810527414. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:57:54,056][25689] Avg episode reward: [(0, '0.188')] [2022-07-10 15:57:55,258][26022] Updated weights on worker 0-0, policy_version 791533 (0.00108) [2022-07-10 15:57:57,218][26022] Updated weights on worker 0-0, policy_version 791543 (0.00088) [2022-07-10 15:57:59,088][25689] Fps is (10 sec: 5589.5, 60 sec: 5521.3, 300 sec: 5540.0). Total num frames: 810550272. Throughput: 0: 4925.1. Samples: 810543928. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:57:59,089][25689] Avg episode reward: [(0, '-0.555')] [2022-07-10 15:57:59,090][26022] Updated weights on worker 0-0, policy_version 791553 (0.00097) [2022-07-10 15:58:01,100][26022] Updated weights on worker 0-0, policy_version 791563 (0.00082) [2022-07-10 15:58:03,151][26022] Updated weights on worker 0-0, policy_version 791573 (0.00087) [2022-07-10 15:58:04,132][25689] Fps is (10 sec: 5284.1, 60 sec: 5505.4, 300 sec: 5535.9). Total num frames: 810574848. Throughput: 0: 5633.6. Samples: 810574730. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:58:04,134][25689] Avg episode reward: [(0, '-0.738')] [2022-07-10 15:58:05,013][26022] Updated weights on worker 0-0, policy_version 791583 (0.00500) [2022-07-10 15:58:07,020][26022] Updated weights on worker 0-0, policy_version 791593 (0.00091) [2022-07-10 15:58:08,974][26022] Updated weights on worker 0-0, policy_version 791603 (0.00085) [2022-07-10 15:58:09,173][25689] Fps is (10 sec: 5178.7, 60 sec: 5526.5, 300 sec: 5532.4). Total num frames: 810602496. Throughput: 0: 5626.2. Samples: 810607512. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:58:09,173][25689] Avg episode reward: [(0, '-1.317')] [2022-07-10 15:58:10,539][26022] Updated weights on worker 0-0, policy_version 791613 (0.00095) [2022-07-10 15:58:12,627][26022] Updated weights on worker 0-0, policy_version 791623 (0.00087) [2022-07-10 15:58:14,183][25689] Fps is (10 sec: 5603.6, 60 sec: 5543.0, 300 sec: 5539.1). Total num frames: 810631168. Throughput: 0: 4792.8. Samples: 810623882. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:58:14,183][25689] Avg episode reward: [(0, '-1.630')] [2022-07-10 15:58:14,238][26022] Updated weights on worker 0-0, policy_version 791633 (0.00092) [2022-07-10 15:58:16,477][26022] Updated weights on worker 0-0, policy_version 791643 (0.00082) [2022-07-10 15:58:18,103][26022] Updated weights on worker 0-0, policy_version 791653 (0.00096) [2022-07-10 15:58:19,217][25689] Fps is (10 sec: 5505.1, 60 sec: 5477.5, 300 sec: 5526.6). Total num frames: 810657792. Throughput: 0: 5636.7. Samples: 810657388. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:58:19,223][25689] Avg episode reward: [(0, '-1.898')] [2022-07-10 15:58:19,945][26022] Updated weights on worker 0-0, policy_version 791663 (0.00088) [2022-07-10 15:58:21,741][26022] Updated weights on worker 0-0, policy_version 791673 (0.00086) [2022-07-10 15:58:23,654][26022] Updated weights on worker 0-0, policy_version 791683 (0.00087) [2022-07-10 15:58:24,333][25689] Fps is (10 sec: 5447.7, 60 sec: 5493.1, 300 sec: 5529.0). Total num frames: 810686464. Throughput: 0: 5764.9. Samples: 810691188. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:58:24,334][25689] Avg episode reward: [(0, '-1.199')] [2022-07-10 15:58:25,402][26022] Updated weights on worker 0-0, policy_version 791693 (0.00089) [2022-07-10 15:58:27,278][26022] Updated weights on worker 0-0, policy_version 791703 (0.00092) [2022-07-10 15:58:29,093][26022] Updated weights on worker 0-0, policy_version 791713 (0.00089) [2022-07-10 15:58:29,342][25689] Fps is (10 sec: 5764.7, 60 sec: 5516.2, 300 sec: 5543.5). Total num frames: 810716160. Throughput: 0: 4972.6. Samples: 810707808. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:58:29,355][25689] Avg episode reward: [(0, '-0.559')] [2022-07-10 15:58:31,091][26022] Updated weights on worker 0-0, policy_version 791723 (0.00089) [2022-07-10 15:58:32,699][26022] Updated weights on worker 0-0, policy_version 791733 (0.00087) [2022-07-10 15:58:34,403][25689] Fps is (10 sec: 5592.9, 60 sec: 5514.5, 300 sec: 5533.8). Total num frames: 810742784. Throughput: 0: 5805.5. Samples: 810741270. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:58:34,404][25689] Avg episode reward: [(0, '-0.491')] [2022-07-10 15:58:34,698][26022] Updated weights on worker 0-0, policy_version 791743 (0.00087) [2022-07-10 15:58:36,494][26022] Updated weights on worker 0-0, policy_version 791753 (0.00089) [2022-07-10 15:58:38,539][26022] Updated weights on worker 0-0, policy_version 791763 (0.01008) [2022-07-10 15:58:39,421][25689] Fps is (10 sec: 5384.5, 60 sec: 5496.7, 300 sec: 5535.2). Total num frames: 810770432. Throughput: 0: 5801.9. Samples: 810774612. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:58:39,422][25689] Avg episode reward: [(0, '0.347')] [2022-07-10 15:58:40,108][26022] Updated weights on worker 0-0, policy_version 791773 (0.00095) [2022-07-10 15:58:42,099][26022] Updated weights on worker 0-0, policy_version 791783 (0.00081) [2022-07-10 15:58:43,966][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 15:58:43,982][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000791793_810796032.pth [2022-07-10 15:58:43,983][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000789848_808804352.pth [2022-07-10 15:58:43,992][26022] Updated weights on worker 0-0, policy_version 791793 (0.00087) [2022-07-10 15:58:44,473][25689] Fps is (10 sec: 5491.0, 60 sec: 5498.7, 300 sec: 5531.7). Total num frames: 810798080. Throughput: 0: 5805.7. Samples: 810808116. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:58:44,474][25689] Avg episode reward: [(0, '0.282')] [2022-07-10 15:58:45,669][26022] Updated weights on worker 0-0, policy_version 791803 (0.00082) [2022-07-10 15:58:47,694][26022] Updated weights on worker 0-0, policy_version 791813 (0.00085) [2022-07-10 15:58:49,381][26022] Updated weights on worker 0-0, policy_version 791823 (0.00096) [2022-07-10 15:58:49,478][25689] Fps is (10 sec: 5702.3, 60 sec: 5520.9, 300 sec: 5536.6). Total num frames: 810827776. Throughput: 0: 5808.9. Samples: 810824774. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:58:49,478][25689] Avg episode reward: [(0, '0.294')] [2022-07-10 15:58:51,468][26022] Updated weights on worker 0-0, policy_version 791833 (0.00092) [2022-07-10 15:58:53,036][26022] Updated weights on worker 0-0, policy_version 791843 (0.00083) [2022-07-10 15:58:54,491][25689] Fps is (10 sec: 5622.2, 60 sec: 5506.7, 300 sec: 5536.9). Total num frames: 810854400. Throughput: 0: 5809.0. Samples: 810857960. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:58:54,495][25689] Avg episode reward: [(0, '0.238')] [2022-07-10 15:58:55,035][26022] Updated weights on worker 0-0, policy_version 791853 (0.00099) [2022-07-10 15:58:56,716][26022] Updated weights on worker 0-0, policy_version 791863 (0.00086) [2022-07-10 15:58:58,783][26022] Updated weights on worker 0-0, policy_version 791873 (0.00089) [2022-07-10 15:58:59,509][25689] Fps is (10 sec: 5410.2, 60 sec: 5491.1, 300 sec: 5540.9). Total num frames: 810882048. Throughput: 0: 5810.9. Samples: 810891340. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:58:59,511][25689] Avg episode reward: [(0, '0.726')] [2022-07-10 15:59:00,431][26022] Updated weights on worker 0-0, policy_version 791883 (0.00080) [2022-07-10 15:59:02,746][26022] Updated weights on worker 0-0, policy_version 791893 (0.00088) [2022-07-10 15:59:04,326][26022] Updated weights on worker 0-0, policy_version 791903 (0.00088) [2022-07-10 15:59:04,574][25689] Fps is (10 sec: 5484.0, 60 sec: 5540.0, 300 sec: 5532.9). Total num frames: 810909696. Throughput: 0: 4873.2. Samples: 810906070. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:59:04,575][25689] Avg episode reward: [(0, '0.848')] [2022-07-10 15:59:06,421][26022] Updated weights on worker 0-0, policy_version 791913 (0.00079) [2022-07-10 15:59:08,051][26022] Updated weights on worker 0-0, policy_version 791923 (0.00107) [2022-07-10 15:59:09,583][25689] Fps is (10 sec: 5387.7, 60 sec: 5526.0, 300 sec: 5536.5). Total num frames: 810936320. Throughput: 0: 5698.3. Samples: 810939336. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:59:09,583][25689] Avg episode reward: [(0, '0.454')] [2022-07-10 15:59:10,313][26022] Updated weights on worker 0-0, policy_version 791933 (0.00090) [2022-07-10 15:59:11,701][26022] Updated weights on worker 0-0, policy_version 791943 (0.00088) [2022-07-10 15:59:13,930][26022] Updated weights on worker 0-0, policy_version 791953 (0.00089) [2022-07-10 15:59:14,627][25689] Fps is (10 sec: 5500.6, 60 sec: 5522.9, 300 sec: 5535.9). Total num frames: 810964992. Throughput: 0: 5708.2. Samples: 810972900. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:59:14,627][25689] Avg episode reward: [(0, '0.756')] [2022-07-10 15:59:15,454][26022] Updated weights on worker 0-0, policy_version 791963 (0.00086) [2022-07-10 15:59:17,356][26022] Updated weights on worker 0-0, policy_version 791973 (0.00092) [2022-07-10 15:59:19,307][26022] Updated weights on worker 0-0, policy_version 791983 (0.00090) [2022-07-10 15:59:19,652][25689] Fps is (10 sec: 5491.6, 60 sec: 5523.7, 300 sec: 5530.5). Total num frames: 810991616. Throughput: 0: 4876.7. Samples: 810989570. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:59:19,652][25689] Avg episode reward: [(0, '-0.607')] [2022-07-10 15:59:20,927][26022] Updated weights on worker 0-0, policy_version 791993 (0.00096) [2022-07-10 15:59:23,088][26022] Updated weights on worker 0-0, policy_version 792003 (0.00090) [2022-07-10 15:59:24,767][25689] Fps is (10 sec: 5453.2, 60 sec: 5523.8, 300 sec: 5536.5). Total num frames: 811020288. Throughput: 0: 5793.6. Samples: 811023058. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:59:24,767][25689] Avg episode reward: [(0, '-0.475')] [2022-07-10 15:59:24,769][26022] Updated weights on worker 0-0, policy_version 792013 (0.00088) [2022-07-10 15:59:26,673][26022] Updated weights on worker 0-0, policy_version 792023 (0.00084) [2022-07-10 15:59:28,594][26022] Updated weights on worker 0-0, policy_version 792033 (0.00089) [2022-07-10 15:59:29,783][25689] Fps is (10 sec: 5559.1, 60 sec: 5489.3, 300 sec: 5533.2). Total num frames: 811047936. Throughput: 0: 5793.3. Samples: 811056364. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:59:29,784][25689] Avg episode reward: [(0, '-1.082')] [2022-07-10 15:59:30,210][26022] Updated weights on worker 0-0, policy_version 792043 (0.00088) [2022-07-10 15:59:32,249][26022] Updated weights on worker 0-0, policy_version 792053 (0.00091) [2022-07-10 15:59:34,000][26022] Updated weights on worker 0-0, policy_version 792063 (0.00085) [2022-07-10 15:59:34,792][25689] Fps is (10 sec: 5515.7, 60 sec: 5511.0, 300 sec: 5529.8). Total num frames: 811075584. Throughput: 0: 4956.5. Samples: 811072848. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:59:34,792][25689] Avg episode reward: [(0, '-3.278')] [2022-07-10 15:59:35,815][26022] Updated weights on worker 0-0, policy_version 792073 (0.00089) [2022-07-10 15:59:37,837][26022] Updated weights on worker 0-0, policy_version 792083 (0.00094) [2022-07-10 15:59:39,630][26022] Updated weights on worker 0-0, policy_version 792093 (0.00098) [2022-07-10 15:59:39,801][25689] Fps is (10 sec: 5622.1, 60 sec: 5528.8, 300 sec: 5528.0). Total num frames: 811104256. Throughput: 0: 5775.8. Samples: 811105946. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:59:39,801][25689] Avg episode reward: [(0, '-2.586')] [2022-07-10 15:59:41,613][26022] Updated weights on worker 0-0, policy_version 792103 (0.00086) [2022-07-10 15:59:43,569][26022] Updated weights on worker 0-0, policy_version 792113 (0.00092) [2022-07-10 15:59:44,922][25689] Fps is (10 sec: 5458.5, 60 sec: 5505.5, 300 sec: 5526.8). Total num frames: 811130880. Throughput: 0: 5727.0. Samples: 811138488. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:59:44,924][25689] Avg episode reward: [(0, '-2.679')] [2022-07-10 15:59:45,326][26022] Updated weights on worker 0-0, policy_version 792123 (0.00092) [2022-07-10 15:59:47,422][26022] Updated weights on worker 0-0, policy_version 792133 (0.00090) [2022-07-10 15:59:49,121][26022] Updated weights on worker 0-0, policy_version 792143 (0.00095) [2022-07-10 15:59:49,934][25689] Fps is (10 sec: 5355.6, 60 sec: 5470.9, 300 sec: 5527.8). Total num frames: 811158528. Throughput: 0: 4892.8. Samples: 811154960. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:59:49,935][25689] Avg episode reward: [(0, '-1.915')] [2022-07-10 15:59:50,923][26022] Updated weights on worker 0-0, policy_version 792153 (0.00084) [2022-07-10 15:59:52,672][26022] Updated weights on worker 0-0, policy_version 792163 (0.00092) [2022-07-10 15:59:54,574][26022] Updated weights on worker 0-0, policy_version 792173 (0.00087) [2022-07-10 15:59:54,981][25689] Fps is (10 sec: 5497.3, 60 sec: 5484.8, 300 sec: 5527.6). Total num frames: 811186176. Throughput: 0: 5714.8. Samples: 811188224. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 15:59:54,983][25689] Avg episode reward: [(0, '-1.215')] [2022-07-10 15:59:56,440][26022] Updated weights on worker 0-0, policy_version 792183 (0.00085) [2022-07-10 15:59:58,328][26022] Updated weights on worker 0-0, policy_version 792193 (0.00091) [2022-07-10 15:59:59,984][26022] Updated weights on worker 0-0, policy_version 792203 (0.00087) [2022-07-10 16:00:00,038][25689] Fps is (10 sec: 5675.6, 60 sec: 5515.1, 300 sec: 5531.9). Total num frames: 811215872. Throughput: 0: 5727.5. Samples: 811221856. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:00:00,039][25689] Avg episode reward: [(0, '-0.315')] [2022-07-10 16:00:02,315][26022] Updated weights on worker 0-0, policy_version 792213 (0.00089) [2022-07-10 16:00:04,089][26022] Updated weights on worker 0-0, policy_version 792223 (0.00098) [2022-07-10 16:00:05,152][25689] Fps is (10 sec: 5335.7, 60 sec: 5459.9, 300 sec: 5524.0). Total num frames: 811240448. Throughput: 0: 4829.4. Samples: 811236188. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:00:05,153][25689] Avg episode reward: [(0, '-0.134')] [2022-07-10 16:00:05,884][26022] Updated weights on worker 0-0, policy_version 792233 (0.00087) [2022-07-10 16:00:08,087][26022] Updated weights on worker 0-0, policy_version 792243 (0.00093) [2022-07-10 16:00:09,471][26022] Updated weights on worker 0-0, policy_version 792253 (0.00093) [2022-07-10 16:00:10,155][25689] Fps is (10 sec: 5162.2, 60 sec: 5477.4, 300 sec: 5520.6). Total num frames: 811268096. Throughput: 0: 5673.6. Samples: 811269682. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:00:10,156][25689] Avg episode reward: [(0, '-0.261')] [2022-07-10 16:00:11,687][26022] Updated weights on worker 0-0, policy_version 792263 (0.00101) [2022-07-10 16:00:13,417][26022] Updated weights on worker 0-0, policy_version 792273 (0.00093) [2022-07-10 16:00:15,201][25689] Fps is (10 sec: 5604.5, 60 sec: 5477.1, 300 sec: 5516.7). Total num frames: 811296768. Throughput: 0: 5664.7. Samples: 811302766. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:00:15,203][25689] Avg episode reward: [(0, '-1.046')] [2022-07-10 16:00:15,251][26022] Updated weights on worker 0-0, policy_version 792283 (0.00092) [2022-07-10 16:00:17,033][26022] Updated weights on worker 0-0, policy_version 792293 (0.00091) [2022-07-10 16:00:18,935][26022] Updated weights on worker 0-0, policy_version 792303 (0.00091) [2022-07-10 16:00:20,208][25689] Fps is (10 sec: 5704.1, 60 sec: 5512.7, 300 sec: 5526.2). Total num frames: 811325440. Throughput: 0: 4837.2. Samples: 811319420. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:00:20,209][25689] Avg episode reward: [(0, '-0.610')] [2022-07-10 16:00:20,982][26022] Updated weights on worker 0-0, policy_version 792313 (0.00098) [2022-07-10 16:00:22,719][26022] Updated weights on worker 0-0, policy_version 792323 (0.00087) [2022-07-10 16:00:24,430][26022] Updated weights on worker 0-0, policy_version 792333 (0.00823) [2022-07-10 16:00:25,286][25689] Fps is (10 sec: 5483.3, 60 sec: 5482.2, 300 sec: 5518.3). Total num frames: 811352064. Throughput: 0: 5782.4. Samples: 811352608. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:00:25,286][25689] Avg episode reward: [(0, '-1.356')] [2022-07-10 16:00:26,551][26022] Updated weights on worker 0-0, policy_version 792343 (0.00094) [2022-07-10 16:00:28,079][26022] Updated weights on worker 0-0, policy_version 792353 (0.00096) [2022-07-10 16:00:30,215][26022] Updated weights on worker 0-0, policy_version 792363 (0.00101) [2022-07-10 16:00:30,303][25689] Fps is (10 sec: 5376.0, 60 sec: 5482.1, 300 sec: 5518.6). Total num frames: 811379712. Throughput: 0: 5766.9. Samples: 811385876. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:00:30,304][25689] Avg episode reward: [(0, '-0.961')] [2022-07-10 16:00:31,788][26022] Updated weights on worker 0-0, policy_version 792373 (0.00086) [2022-07-10 16:00:33,802][26022] Updated weights on worker 0-0, policy_version 792383 (0.00093) [2022-07-10 16:00:35,330][25689] Fps is (10 sec: 5709.4, 60 sec: 5514.3, 300 sec: 5518.4). Total num frames: 811409408. Throughput: 0: 4961.6. Samples: 811402634. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:00:35,330][25689] Avg episode reward: [(0, '-1.085')] [2022-07-10 16:00:35,627][26022] Updated weights on worker 0-0, policy_version 792393 (0.00102) [2022-07-10 16:00:37,549][26022] Updated weights on worker 0-0, policy_version 792403 (0.00086) [2022-07-10 16:00:39,527][26022] Updated weights on worker 0-0, policy_version 792413 (0.00910) [2022-07-10 16:00:40,362][25689] Fps is (10 sec: 5497.3, 60 sec: 5461.5, 300 sec: 5512.6). Total num frames: 811435008. Throughput: 0: 5771.5. Samples: 811435740. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:00:40,362][25689] Avg episode reward: [(0, '-2.630')] [2022-07-10 16:00:41,257][26022] Updated weights on worker 0-0, policy_version 792423 (0.00081) [2022-07-10 16:00:43,268][26022] Updated weights on worker 0-0, policy_version 792433 (0.00091) [2022-07-10 16:00:44,022][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:00:44,039][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000792438_811456512.pth [2022-07-10 16:00:44,040][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000790496_809467904.pth [2022-07-10 16:00:45,042][26022] Updated weights on worker 0-0, policy_version 792443 (0.00085) [2022-07-10 16:00:45,422][25689] Fps is (10 sec: 5377.4, 60 sec: 5500.8, 300 sec: 5515.6). Total num frames: 811463680. Throughput: 0: 5772.6. Samples: 811468848. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:00:45,423][25689] Avg episode reward: [(0, '-2.401')] [2022-07-10 16:00:46,776][26022] Updated weights on worker 0-0, policy_version 792453 (0.00087) [2022-07-10 16:00:48,725][26022] Updated weights on worker 0-0, policy_version 792463 (0.00057) [2022-07-10 16:00:50,457][25689] Fps is (10 sec: 5579.0, 60 sec: 5498.8, 300 sec: 5508.1). Total num frames: 811491328. Throughput: 0: 4940.6. Samples: 811485448. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:00:50,457][25689] Avg episode reward: [(0, '-2.059')] [2022-07-10 16:00:50,494][26022] Updated weights on worker 0-0, policy_version 792473 (0.00083) [2022-07-10 16:00:52,438][26022] Updated weights on worker 0-0, policy_version 792483 (0.00083) [2022-07-10 16:00:54,216][26022] Updated weights on worker 0-0, policy_version 792493 (0.00089) [2022-07-10 16:00:55,486][25689] Fps is (10 sec: 5494.8, 60 sec: 5500.4, 300 sec: 5508.2). Total num frames: 811518976. Throughput: 0: 5757.3. Samples: 811518678. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:00:55,486][25689] Avg episode reward: [(0, '-1.916')] [2022-07-10 16:00:56,055][26022] Updated weights on worker 0-0, policy_version 792503 (0.00596) [2022-07-10 16:00:57,888][26022] Updated weights on worker 0-0, policy_version 792513 (0.00085) [2022-07-10 16:00:59,847][26022] Updated weights on worker 0-0, policy_version 792523 (0.00084) [2022-07-10 16:01:00,508][25689] Fps is (10 sec: 5399.4, 60 sec: 5452.7, 300 sec: 5509.1). Total num frames: 811545600. Throughput: 0: 5772.1. Samples: 811552028. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:01:00,509][25689] Avg episode reward: [(0, '-2.542')] [2022-07-10 16:01:01,626][26022] Updated weights on worker 0-0, policy_version 792533 (0.00097) [2022-07-10 16:01:04,025][26022] Updated weights on worker 0-0, policy_version 792543 (0.00093) [2022-07-10 16:01:05,570][25689] Fps is (10 sec: 5280.4, 60 sec: 5491.4, 300 sec: 5504.8). Total num frames: 811572224. Throughput: 0: 4832.2. Samples: 811566204. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:01:05,570][25689] Avg episode reward: [(0, '-2.731')] [2022-07-10 16:01:05,755][26022] Updated weights on worker 0-0, policy_version 792553 (0.00092) [2022-07-10 16:01:07,655][26022] Updated weights on worker 0-0, policy_version 792563 (0.00084) [2022-07-10 16:01:09,563][26022] Updated weights on worker 0-0, policy_version 792573 (0.00090) [2022-07-10 16:01:10,607][25689] Fps is (10 sec: 5475.7, 60 sec: 5505.2, 300 sec: 5505.0). Total num frames: 811600896. Throughput: 0: 5653.7. Samples: 811599370. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:01:10,608][25689] Avg episode reward: [(0, '-0.316')] [2022-07-10 16:01:11,425][26022] Updated weights on worker 0-0, policy_version 792583 (0.00095) [2022-07-10 16:01:13,170][26022] Updated weights on worker 0-0, policy_version 792593 (0.00089) [2022-07-10 16:01:15,146][26022] Updated weights on worker 0-0, policy_version 792603 (0.00087) [2022-07-10 16:01:15,609][25689] Fps is (10 sec: 5508.0, 60 sec: 5475.4, 300 sec: 5505.0). Total num frames: 811627520. Throughput: 0: 5649.9. Samples: 811632372. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:01:15,609][25689] Avg episode reward: [(0, '-0.516')] [2022-07-10 16:01:16,863][26022] Updated weights on worker 0-0, policy_version 792613 (0.00082) [2022-07-10 16:01:18,897][26022] Updated weights on worker 0-0, policy_version 792623 (0.00557) [2022-07-10 16:01:20,617][25689] Fps is (10 sec: 5421.6, 60 sec: 5458.2, 300 sec: 5506.5). Total num frames: 811655168. Throughput: 0: 4818.9. Samples: 811648930. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:01:20,618][25689] Avg episode reward: [(0, '-0.566')] [2022-07-10 16:01:20,759][26022] Updated weights on worker 0-0, policy_version 792633 (0.00099) [2022-07-10 16:01:22,450][26022] Updated weights on worker 0-0, policy_version 792643 (0.00084) [2022-07-10 16:01:24,319][26022] Updated weights on worker 0-0, policy_version 792653 (0.00086) [2022-07-10 16:01:25,683][25689] Fps is (10 sec: 5590.6, 60 sec: 5493.2, 300 sec: 5501.9). Total num frames: 811683840. Throughput: 0: 5779.4. Samples: 811682446. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:01:25,684][25689] Avg episode reward: [(0, '-0.747')] [2022-07-10 16:01:26,210][26022] Updated weights on worker 0-0, policy_version 792663 (0.00099) [2022-07-10 16:01:27,997][26022] Updated weights on worker 0-0, policy_version 792673 (0.00081) [2022-07-10 16:01:29,905][26022] Updated weights on worker 0-0, policy_version 792683 (0.00090) [2022-07-10 16:01:30,785][25689] Fps is (10 sec: 5639.7, 60 sec: 5502.5, 300 sec: 5507.0). Total num frames: 811712512. Throughput: 0: 5768.4. Samples: 811715766. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:01:30,785][25689] Avg episode reward: [(0, '-0.186')] [2022-07-10 16:01:31,735][26022] Updated weights on worker 0-0, policy_version 792693 (0.00083) [2022-07-10 16:01:33,516][26022] Updated weights on worker 0-0, policy_version 792703 (0.00086) [2022-07-10 16:01:35,539][26022] Updated weights on worker 0-0, policy_version 792713 (0.00095) [2022-07-10 16:01:35,815][25689] Fps is (10 sec: 5457.6, 60 sec: 5451.4, 300 sec: 5503.3). Total num frames: 811739136. Throughput: 0: 5774.3. Samples: 811749048. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:01:35,815][25689] Avg episode reward: [(0, '-0.649')] [2022-07-10 16:01:37,117][26022] Updated weights on worker 0-0, policy_version 792723 (0.00089) [2022-07-10 16:01:39,080][26022] Updated weights on worker 0-0, policy_version 792733 (0.01144) [2022-07-10 16:01:40,863][25689] Fps is (10 sec: 5487.0, 60 sec: 5500.8, 300 sec: 5504.1). Total num frames: 811767808. Throughput: 0: 5764.1. Samples: 811765626. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:01:40,864][25689] Avg episode reward: [(0, '-0.460')] [2022-07-10 16:01:40,987][26022] Updated weights on worker 0-0, policy_version 792743 (0.00084) [2022-07-10 16:01:42,778][26022] Updated weights on worker 0-0, policy_version 792753 (0.00085) [2022-07-10 16:01:44,916][26022] Updated weights on worker 0-0, policy_version 792763 (0.00082) [2022-07-10 16:01:45,946][25689] Fps is (10 sec: 5558.9, 60 sec: 5481.7, 300 sec: 5499.5). Total num frames: 811795456. Throughput: 0: 5733.5. Samples: 811798626. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:01:45,947][25689] Avg episode reward: [(0, '-0.900')] [2022-07-10 16:01:46,318][26022] Updated weights on worker 0-0, policy_version 792773 (0.00054) [2022-07-10 16:01:48,568][26022] Updated weights on worker 0-0, policy_version 792783 (0.00120) [2022-07-10 16:01:50,231][26022] Updated weights on worker 0-0, policy_version 792793 (0.00085) [2022-07-10 16:01:50,960][25689] Fps is (10 sec: 5374.6, 60 sec: 5466.7, 300 sec: 5499.7). Total num frames: 811822080. Throughput: 0: 5764.3. Samples: 811832064. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:01:50,961][25689] Avg episode reward: [(0, '-0.523')] [2022-07-10 16:01:52,029][26022] Updated weights on worker 0-0, policy_version 792803 (0.00092) [2022-07-10 16:01:54,032][26022] Updated weights on worker 0-0, policy_version 792813 (0.00085) [2022-07-10 16:01:55,603][26022] Updated weights on worker 0-0, policy_version 792823 (0.00088) [2022-07-10 16:01:55,987][25689] Fps is (10 sec: 5609.3, 60 sec: 5500.7, 300 sec: 5499.5). Total num frames: 811851776. Throughput: 0: 4946.5. Samples: 811848828. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:01:55,987][25689] Avg episode reward: [(0, '-0.152')] [2022-07-10 16:01:57,569][26022] Updated weights on worker 0-0, policy_version 792833 (0.00094) [2022-07-10 16:01:59,383][26022] Updated weights on worker 0-0, policy_version 792843 (0.00399) [2022-07-10 16:02:00,995][25689] Fps is (10 sec: 5816.9, 60 sec: 5535.9, 300 sec: 5510.7). Total num frames: 811880448. Throughput: 0: 5818.7. Samples: 811882768. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:02:00,996][25689] Avg episode reward: [(0, '-1.203')] [2022-07-10 16:02:01,131][26022] Updated weights on worker 0-0, policy_version 792853 (0.00094) [2022-07-10 16:02:03,447][26022] Updated weights on worker 0-0, policy_version 792863 (0.00086) [2022-07-10 16:02:05,079][26022] Updated weights on worker 0-0, policy_version 792873 (0.00093) [2022-07-10 16:02:06,065][25689] Fps is (10 sec: 5384.9, 60 sec: 5518.1, 300 sec: 5507.6). Total num frames: 811906048. Throughput: 0: 5748.7. Samples: 811914284. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:02:06,066][25689] Avg episode reward: [(0, '-0.927')] [2022-07-10 16:02:07,123][26022] Updated weights on worker 0-0, policy_version 792883 (0.00096) [2022-07-10 16:02:08,964][26022] Updated weights on worker 0-0, policy_version 792893 (0.00086) [2022-07-10 16:02:10,718][26022] Updated weights on worker 0-0, policy_version 792903 (0.00087) [2022-07-10 16:02:11,115][25689] Fps is (10 sec: 5362.5, 60 sec: 5517.0, 300 sec: 5510.2). Total num frames: 811934720. Throughput: 0: 4913.6. Samples: 811931098. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:02:11,116][25689] Avg episode reward: [(0, '-1.896')] [2022-07-10 16:02:12,452][26022] Updated weights on worker 0-0, policy_version 792913 (0.00092) [2022-07-10 16:02:14,430][26022] Updated weights on worker 0-0, policy_version 792923 (0.00094) [2022-07-10 16:02:16,133][25689] Fps is (10 sec: 5695.8, 60 sec: 5549.4, 300 sec: 5504.0). Total num frames: 811963392. Throughput: 0: 5752.5. Samples: 811964720. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:02:16,136][26022] Updated weights on worker 0-0, policy_version 792933 (0.00086) [2022-07-10 16:02:16,136][25689] Avg episode reward: [(0, '-1.424')] [2022-07-10 16:02:18,072][26022] Updated weights on worker 0-0, policy_version 792943 (0.00093) [2022-07-10 16:02:19,834][26022] Updated weights on worker 0-0, policy_version 792953 (0.00087) [2022-07-10 16:02:21,166][25689] Fps is (10 sec: 5705.2, 60 sec: 5564.0, 300 sec: 5508.7). Total num frames: 811992064. Throughput: 0: 5720.5. Samples: 811998162. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:02:21,167][25689] Avg episode reward: [(0, '-1.551')] [2022-07-10 16:02:21,720][26022] Updated weights on worker 0-0, policy_version 792963 (0.00085) [2022-07-10 16:02:23,522][26022] Updated weights on worker 0-0, policy_version 792973 (0.00088) [2022-07-10 16:02:25,632][26022] Updated weights on worker 0-0, policy_version 792983 (0.00095) [2022-07-10 16:02:26,238][25689] Fps is (10 sec: 5472.6, 60 sec: 5529.7, 300 sec: 5502.0). Total num frames: 812018688. Throughput: 0: 4986.4. Samples: 812014870. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:02:26,238][25689] Avg episode reward: [(0, '-2.384')] [2022-07-10 16:02:27,142][26022] Updated weights on worker 0-0, policy_version 792993 (0.00088) [2022-07-10 16:02:29,260][26022] Updated weights on worker 0-0, policy_version 793003 (0.00095) [2022-07-10 16:02:30,991][26022] Updated weights on worker 0-0, policy_version 793013 (0.00081) [2022-07-10 16:02:31,298][25689] Fps is (10 sec: 5457.9, 60 sec: 5533.5, 300 sec: 5508.5). Total num frames: 812047360. Throughput: 0: 5800.3. Samples: 812048164. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:02:31,298][25689] Avg episode reward: [(0, '-1.830')] [2022-07-10 16:02:32,785][26022] Updated weights on worker 0-0, policy_version 793023 (0.00092) [2022-07-10 16:02:34,695][26022] Updated weights on worker 0-0, policy_version 793033 (0.00107) [2022-07-10 16:02:36,319][25689] Fps is (10 sec: 5586.3, 60 sec: 5551.2, 300 sec: 5504.8). Total num frames: 812075008. Throughput: 0: 5791.4. Samples: 812081626. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:02:36,320][25689] Avg episode reward: [(0, '-2.654')] [2022-07-10 16:02:36,374][26022] Updated weights on worker 0-0, policy_version 793043 (0.00088) [2022-07-10 16:02:38,332][26022] Updated weights on worker 0-0, policy_version 793053 (0.00089) [2022-07-10 16:02:40,203][26022] Updated weights on worker 0-0, policy_version 793063 (0.00081) [2022-07-10 16:02:41,411][25689] Fps is (10 sec: 5467.5, 60 sec: 5530.2, 300 sec: 5504.5). Total num frames: 812102656. Throughput: 0: 4945.6. Samples: 812098286. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:02:41,412][25689] Avg episode reward: [(0, '-2.587')] [2022-07-10 16:02:41,927][26022] Updated weights on worker 0-0, policy_version 793073 (0.00090) [2022-07-10 16:02:43,796][26022] Updated weights on worker 0-0, policy_version 793083 (0.00094) [2022-07-10 16:02:44,126][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:02:44,138][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000793084_812118016.pth [2022-07-10 16:02:44,139][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000791147_810134528.pth [2022-07-10 16:02:45,727][26022] Updated weights on worker 0-0, policy_version 793093 (0.00087) [2022-07-10 16:02:46,491][25689] Fps is (10 sec: 5537.2, 60 sec: 5547.5, 300 sec: 5504.2). Total num frames: 812131328. Throughput: 0: 5760.1. Samples: 812131530. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:02:46,491][25689] Avg episode reward: [(0, '-2.557')] [2022-07-10 16:02:47,558][26022] Updated weights on worker 0-0, policy_version 793103 (0.00081) [2022-07-10 16:02:49,349][26022] Updated weights on worker 0-0, policy_version 793113 (0.00091) [2022-07-10 16:02:51,124][26022] Updated weights on worker 0-0, policy_version 793123 (0.00090) [2022-07-10 16:02:51,513][25689] Fps is (10 sec: 5677.0, 60 sec: 5580.6, 300 sec: 5508.0). Total num frames: 812160000. Throughput: 0: 5797.1. Samples: 812165352. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:02:51,513][25689] Avg episode reward: [(0, '-3.659')] [2022-07-10 16:02:52,964][26022] Updated weights on worker 0-0, policy_version 793133 (0.00083) [2022-07-10 16:02:54,959][26022] Updated weights on worker 0-0, policy_version 793143 (0.00088) [2022-07-10 16:02:56,514][25689] Fps is (10 sec: 5618.9, 60 sec: 5549.1, 300 sec: 5505.2). Total num frames: 812187648. Throughput: 0: 4981.9. Samples: 812182234. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:02:56,515][25689] Avg episode reward: [(0, '-2.618')] [2022-07-10 16:02:56,559][26022] Updated weights on worker 0-0, policy_version 793153 (0.00090) [2022-07-10 16:02:58,495][26022] Updated weights on worker 0-0, policy_version 793163 (0.00090) [2022-07-10 16:03:00,112][26022] Updated weights on worker 0-0, policy_version 793173 (0.00087) [2022-07-10 16:03:01,559][25689] Fps is (10 sec: 5504.2, 60 sec: 5528.8, 300 sec: 5515.5). Total num frames: 812215296. Throughput: 0: 5844.3. Samples: 812216034. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:03:01,559][25689] Avg episode reward: [(0, '-2.335')] [2022-07-10 16:03:02,471][26022] Updated weights on worker 0-0, policy_version 793183 (0.00085) [2022-07-10 16:03:04,468][26022] Updated weights on worker 0-0, policy_version 793193 (0.00094) [2022-07-10 16:03:06,282][26022] Updated weights on worker 0-0, policy_version 793203 (0.00108) [2022-07-10 16:03:06,692][25689] Fps is (10 sec: 5432.7, 60 sec: 5556.9, 300 sec: 5513.7). Total num frames: 812242944. Throughput: 0: 5718.9. Samples: 812247062. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:03:06,693][25689] Avg episode reward: [(0, '-1.240')] [2022-07-10 16:03:08,215][26022] Updated weights on worker 0-0, policy_version 793213 (0.00095) [2022-07-10 16:03:09,799][26022] Updated weights on worker 0-0, policy_version 793223 (0.00084) [2022-07-10 16:03:11,711][25689] Fps is (10 sec: 5345.8, 60 sec: 5525.9, 300 sec: 5506.7). Total num frames: 812269568. Throughput: 0: 4873.7. Samples: 812263794. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:03:11,712][25689] Avg episode reward: [(0, '-2.275')] [2022-07-10 16:03:11,727][26022] Updated weights on worker 0-0, policy_version 793233 (0.00094) [2022-07-10 16:03:13,525][26022] Updated weights on worker 0-0, policy_version 793243 (0.00097) [2022-07-10 16:03:15,376][26022] Updated weights on worker 0-0, policy_version 793253 (0.00093) [2022-07-10 16:03:16,755][25689] Fps is (10 sec: 5393.4, 60 sec: 5506.7, 300 sec: 5510.0). Total num frames: 812297216. Throughput: 0: 5681.0. Samples: 812297224. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:03:16,756][25689] Avg episode reward: [(0, '-2.781')] [2022-07-10 16:03:17,370][26022] Updated weights on worker 0-0, policy_version 793263 (0.00092) [2022-07-10 16:03:18,993][26022] Updated weights on worker 0-0, policy_version 793273 (0.00063) [2022-07-10 16:03:20,908][26022] Updated weights on worker 0-0, policy_version 793283 (0.00089) [2022-07-10 16:03:21,808][25689] Fps is (10 sec: 5679.4, 60 sec: 5521.8, 300 sec: 5514.6). Total num frames: 812326912. Throughput: 0: 5657.4. Samples: 812330590. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:03:21,808][25689] Avg episode reward: [(0, '-1.372')] [2022-07-10 16:03:22,888][26022] Updated weights on worker 0-0, policy_version 793293 (0.00091) [2022-07-10 16:03:24,479][26022] Updated weights on worker 0-0, policy_version 793303 (0.00093) [2022-07-10 16:03:26,437][26022] Updated weights on worker 0-0, policy_version 793313 (0.00088) [2022-07-10 16:03:26,880][25689] Fps is (10 sec: 5663.6, 60 sec: 5538.6, 300 sec: 5506.5). Total num frames: 812354560. Throughput: 0: 5795.9. Samples: 812364068. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:03:26,880][25689] Avg episode reward: [(0, '-1.371')] [2022-07-10 16:03:28,424][26022] Updated weights on worker 0-0, policy_version 793323 (0.00085) [2022-07-10 16:03:30,241][26022] Updated weights on worker 0-0, policy_version 793333 (0.00086) [2022-07-10 16:03:31,911][25689] Fps is (10 sec: 5371.8, 60 sec: 5507.5, 300 sec: 5507.1). Total num frames: 812381184. Throughput: 0: 5789.7. Samples: 812380744. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:03:31,911][25689] Avg episode reward: [(0, '-0.656')] [2022-07-10 16:03:32,125][26022] Updated weights on worker 0-0, policy_version 793343 (0.00075) [2022-07-10 16:03:33,906][26022] Updated weights on worker 0-0, policy_version 793353 (0.00085) [2022-07-10 16:03:35,664][26022] Updated weights on worker 0-0, policy_version 793363 (0.00091) [2022-07-10 16:03:36,931][25689] Fps is (10 sec: 5603.6, 60 sec: 5541.4, 300 sec: 5513.9). Total num frames: 812410880. Throughput: 0: 5807.0. Samples: 812414384. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:03:36,932][25689] Avg episode reward: [(0, '-1.225')] [2022-07-10 16:03:37,516][26022] Updated weights on worker 0-0, policy_version 793373 (0.00088) [2022-07-10 16:03:39,219][26022] Updated weights on worker 0-0, policy_version 793383 (0.00087) [2022-07-10 16:03:41,234][26022] Updated weights on worker 0-0, policy_version 793393 (0.00087) [2022-07-10 16:03:41,940][25689] Fps is (10 sec: 5819.8, 60 sec: 5565.9, 300 sec: 5518.2). Total num frames: 812439552. Throughput: 0: 5835.0. Samples: 812448062. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:03:41,941][25689] Avg episode reward: [(0, '-0.027')] [2022-07-10 16:03:43,161][26022] Updated weights on worker 0-0, policy_version 793404 (0.00089) [2022-07-10 16:03:44,911][26022] Updated weights on worker 0-0, policy_version 793414 (0.00080) [2022-07-10 16:03:46,643][26022] Updated weights on worker 0-0, policy_version 793424 (0.00084) [2022-07-10 16:03:47,006][25689] Fps is (10 sec: 5589.7, 60 sec: 5550.2, 300 sec: 5510.1). Total num frames: 812467200. Throughput: 0: 5006.5. Samples: 812464830. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:03:47,007][25689] Avg episode reward: [(0, '-0.342')] [2022-07-10 16:03:48,472][26022] Updated weights on worker 0-0, policy_version 793434 (0.00087) [2022-07-10 16:03:50,293][26022] Updated weights on worker 0-0, policy_version 793444 (0.00085) [2022-07-10 16:03:52,013][25689] Fps is (10 sec: 5489.5, 60 sec: 5534.6, 300 sec: 5513.7). Total num frames: 812494848. Throughput: 0: 5871.4. Samples: 812498774. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:03:52,014][25689] Avg episode reward: [(0, '-1.725')] [2022-07-10 16:03:52,382][26022] Updated weights on worker 0-0, policy_version 793454 (0.00085) [2022-07-10 16:03:53,904][26022] Updated weights on worker 0-0, policy_version 793464 (0.00089) [2022-07-10 16:03:55,722][26022] Updated weights on worker 0-0, policy_version 793474 (0.00084) [2022-07-10 16:03:57,055][25689] Fps is (10 sec: 5604.9, 60 sec: 5547.9, 300 sec: 5516.7). Total num frames: 812523520. Throughput: 0: 5878.3. Samples: 812532680. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:03:57,057][25689] Avg episode reward: [(0, '-1.698')] [2022-07-10 16:03:57,660][26022] Updated weights on worker 0-0, policy_version 793484 (0.00091) [2022-07-10 16:03:59,316][26022] Updated weights on worker 0-0, policy_version 793494 (0.00084) [2022-07-10 16:04:01,753][26022] Updated weights on worker 0-0, policy_version 793504 (0.00100) [2022-07-10 16:04:02,087][25689] Fps is (10 sec: 5387.7, 60 sec: 5515.2, 300 sec: 5510.4). Total num frames: 812549120. Throughput: 0: 5038.1. Samples: 812549562. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:04:02,088][25689] Avg episode reward: [(0, '-2.244')] [2022-07-10 16:04:03,311][26022] Updated weights on worker 0-0, policy_version 793514 (0.00090) [2022-07-10 16:04:05,503][26022] Updated weights on worker 0-0, policy_version 793524 (0.00092) [2022-07-10 16:04:07,207][25689] Fps is (10 sec: 5446.5, 60 sec: 5550.2, 300 sec: 5518.7). Total num frames: 812578816. Throughput: 0: 5754.7. Samples: 812581082. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:04:07,208][25689] Avg episode reward: [(0, '-2.561')] [2022-07-10 16:04:07,209][26022] Updated weights on worker 0-0, policy_version 793534 (0.00088) [2022-07-10 16:04:09,038][26022] Updated weights on worker 0-0, policy_version 793544 (0.00088) [2022-07-10 16:04:10,982][26022] Updated weights on worker 0-0, policy_version 793554 (0.00098) [2022-07-10 16:04:12,220][25689] Fps is (10 sec: 5557.7, 60 sec: 5550.7, 300 sec: 5512.4). Total num frames: 812605440. Throughput: 0: 5720.1. Samples: 812614360. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:04:12,221][25689] Avg episode reward: [(0, '-4.716')] [2022-07-10 16:04:12,731][26022] Updated weights on worker 0-0, policy_version 793564 (0.00088) [2022-07-10 16:04:14,578][26022] Updated weights on worker 0-0, policy_version 793574 (0.00094) [2022-07-10 16:04:16,508][26022] Updated weights on worker 0-0, policy_version 793584 (0.00725) [2022-07-10 16:04:17,232][25689] Fps is (10 sec: 5413.8, 60 sec: 5553.7, 300 sec: 5516.1). Total num frames: 812633088. Throughput: 0: 4864.4. Samples: 812630830. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:04:17,233][25689] Avg episode reward: [(0, '-3.725')] [2022-07-10 16:04:18,364][26022] Updated weights on worker 0-0, policy_version 793594 (0.00083) [2022-07-10 16:04:20,091][26022] Updated weights on worker 0-0, policy_version 793604 (0.00088) [2022-07-10 16:04:22,080][26022] Updated weights on worker 0-0, policy_version 793614 (0.00091) [2022-07-10 16:04:22,248][25689] Fps is (10 sec: 5616.8, 60 sec: 5540.2, 300 sec: 5517.9). Total num frames: 812661760. Throughput: 0: 5706.3. Samples: 812664606. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:04:22,248][25689] Avg episode reward: [(0, '-4.889')] [2022-07-10 16:04:23,702][26022] Updated weights on worker 0-0, policy_version 793624 (0.00093) [2022-07-10 16:04:25,700][26022] Updated weights on worker 0-0, policy_version 793634 (0.00094) [2022-07-10 16:04:27,381][25689] Fps is (10 sec: 5650.4, 60 sec: 5551.5, 300 sec: 5519.2). Total num frames: 812690432. Throughput: 0: 5796.7. Samples: 812698022. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:04:27,381][25689] Avg episode reward: [(0, '-6.781')] [2022-07-10 16:04:27,466][26022] Updated weights on worker 0-0, policy_version 793644 (0.00084) [2022-07-10 16:04:29,325][26022] Updated weights on worker 0-0, policy_version 793654 (0.00083) [2022-07-10 16:04:31,126][26022] Updated weights on worker 0-0, policy_version 793664 (0.00089) [2022-07-10 16:04:32,383][25689] Fps is (10 sec: 5657.5, 60 sec: 5588.0, 300 sec: 5522.7). Total num frames: 812719104. Throughput: 0: 4981.6. Samples: 812714804. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:04:32,384][25689] Avg episode reward: [(0, '-6.141')] [2022-07-10 16:04:32,869][26022] Updated weights on worker 0-0, policy_version 793674 (0.00086) [2022-07-10 16:04:34,708][26022] Updated weights on worker 0-0, policy_version 793684 (0.00090) [2022-07-10 16:04:36,786][26022] Updated weights on worker 0-0, policy_version 793694 (0.00083) [2022-07-10 16:04:37,407][25689] Fps is (10 sec: 5413.2, 60 sec: 5519.9, 300 sec: 5512.1). Total num frames: 812744704. Throughput: 0: 5832.4. Samples: 812748496. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:04:37,407][25689] Avg episode reward: [(0, '-6.300')] [2022-07-10 16:04:38,238][26022] Updated weights on worker 0-0, policy_version 793704 (0.00094) [2022-07-10 16:04:40,487][26022] Updated weights on worker 0-0, policy_version 793714 (0.00089) [2022-07-10 16:04:42,019][26022] Updated weights on worker 0-0, policy_version 793724 (0.00090) [2022-07-10 16:04:42,422][25689] Fps is (10 sec: 5508.3, 60 sec: 5536.3, 300 sec: 5524.4). Total num frames: 812774400. Throughput: 0: 5820.5. Samples: 812782032. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:04:42,423][25689] Avg episode reward: [(0, '-4.963')] [2022-07-10 16:04:43,927][26022] Updated weights on worker 0-0, policy_version 793734 (0.00084) [2022-07-10 16:04:44,376][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:04:44,394][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000793736_812785664.pth [2022-07-10 16:04:44,394][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000791793_810796032.pth [2022-07-10 16:04:45,874][26022] Updated weights on worker 0-0, policy_version 793744 (0.00082) [2022-07-10 16:04:47,550][25689] Fps is (10 sec: 5855.4, 60 sec: 5564.5, 300 sec: 5529.2). Total num frames: 812804096. Throughput: 0: 5000.7. Samples: 812798880. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:04:47,550][25689] Avg episode reward: [(0, '-4.114')] [2022-07-10 16:04:47,555][26022] Updated weights on worker 0-0, policy_version 793754 (0.00096) [2022-07-10 16:04:49,456][26022] Updated weights on worker 0-0, policy_version 793764 (0.00086) [2022-07-10 16:04:51,277][26022] Updated weights on worker 0-0, policy_version 793774 (0.00087) [2022-07-10 16:04:52,646][25689] Fps is (10 sec: 5508.6, 60 sec: 5539.4, 300 sec: 5524.8). Total num frames: 812830720. Throughput: 0: 5805.7. Samples: 812832444. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:04:52,647][25689] Avg episode reward: [(0, '-2.495')] [2022-07-10 16:04:53,140][26022] Updated weights on worker 0-0, policy_version 793784 (0.00092) [2022-07-10 16:04:54,929][26022] Updated weights on worker 0-0, policy_version 793794 (0.00086) [2022-07-10 16:04:56,693][26022] Updated weights on worker 0-0, policy_version 793804 (0.00087) [2022-07-10 16:04:57,711][25689] Fps is (10 sec: 5542.8, 60 sec: 5554.2, 300 sec: 5524.7). Total num frames: 812860416. Throughput: 0: 5795.4. Samples: 812866166. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:04:57,711][25689] Avg episode reward: [(0, '-1.807')] [2022-07-10 16:04:58,588][26022] Updated weights on worker 0-0, policy_version 793814 (0.00087) [2022-07-10 16:05:00,299][26022] Updated weights on worker 0-0, policy_version 793824 (0.00090) [2022-07-10 16:05:02,560][26022] Updated weights on worker 0-0, policy_version 793834 (0.00090) [2022-07-10 16:05:02,801][25689] Fps is (10 sec: 5445.1, 60 sec: 5548.8, 300 sec: 5528.6). Total num frames: 812886016. Throughput: 0: 4961.5. Samples: 812883144. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:05:02,802][25689] Avg episode reward: [(0, '-0.112')] [2022-07-10 16:05:04,470][26022] Updated weights on worker 0-0, policy_version 793844 (0.00083) [2022-07-10 16:05:06,476][26022] Updated weights on worker 0-0, policy_version 793854 (0.00085) [2022-07-10 16:05:07,901][25689] Fps is (10 sec: 5326.1, 60 sec: 5533.9, 300 sec: 5530.2). Total num frames: 812914688. Throughput: 0: 5675.2. Samples: 812914376. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:05:07,901][25689] Avg episode reward: [(0, '-0.499')] [2022-07-10 16:05:07,998][26022] Updated weights on worker 0-0, policy_version 793864 (0.00087) [2022-07-10 16:05:09,986][26022] Updated weights on worker 0-0, policy_version 793874 (0.00084) [2022-07-10 16:05:11,908][26022] Updated weights on worker 0-0, policy_version 793884 (0.00094) [2022-07-10 16:05:12,915][25689] Fps is (10 sec: 5670.2, 60 sec: 5567.6, 300 sec: 5530.8). Total num frames: 812943360. Throughput: 0: 5689.9. Samples: 812947770. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:05:12,916][25689] Avg episode reward: [(0, '-0.752')] [2022-07-10 16:05:13,663][26022] Updated weights on worker 0-0, policy_version 793894 (0.00095) [2022-07-10 16:05:15,726][26022] Updated weights on worker 0-0, policy_version 793904 (0.00086) [2022-07-10 16:05:17,239][26022] Updated weights on worker 0-0, policy_version 793914 (0.00086) [2022-07-10 16:05:17,918][25689] Fps is (10 sec: 5520.1, 60 sec: 5551.5, 300 sec: 5524.0). Total num frames: 812969984. Throughput: 0: 4861.5. Samples: 812964404. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:05:17,918][25689] Avg episode reward: [(0, '-2.856')] [2022-07-10 16:05:19,287][26022] Updated weights on worker 0-0, policy_version 793924 (0.01247) [2022-07-10 16:05:21,175][26022] Updated weights on worker 0-0, policy_version 793934 (0.00099) [2022-07-10 16:05:22,930][25689] Fps is (10 sec: 5418.9, 60 sec: 5534.9, 300 sec: 5528.6). Total num frames: 812997632. Throughput: 0: 5689.3. Samples: 812997662. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:05:22,932][25689] Avg episode reward: [(0, '-2.650')] [2022-07-10 16:05:23,022][26022] Updated weights on worker 0-0, policy_version 793944 (0.00094) [2022-07-10 16:05:24,703][26022] Updated weights on worker 0-0, policy_version 793954 (0.00091) [2022-07-10 16:05:26,540][26022] Updated weights on worker 0-0, policy_version 793964 (0.00092) [2022-07-10 16:05:28,031][25689] Fps is (10 sec: 5568.8, 60 sec: 5537.8, 300 sec: 5530.5). Total num frames: 813026304. Throughput: 0: 5797.8. Samples: 813031090. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:05:28,032][25689] Avg episode reward: [(0, '-2.986')] [2022-07-10 16:05:28,452][26022] Updated weights on worker 0-0, policy_version 793974 (0.00090) [2022-07-10 16:05:30,309][26022] Updated weights on worker 0-0, policy_version 793984 (0.00087) [2022-07-10 16:05:32,135][26022] Updated weights on worker 0-0, policy_version 793994 (0.00096) [2022-07-10 16:05:33,036][25689] Fps is (10 sec: 5674.1, 60 sec: 5537.6, 300 sec: 5527.5). Total num frames: 813054976. Throughput: 0: 4964.9. Samples: 813047672. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:05:33,037][25689] Avg episode reward: [(0, '-3.503')] [2022-07-10 16:05:34,030][26022] Updated weights on worker 0-0, policy_version 794004 (0.00091) [2022-07-10 16:05:35,897][26022] Updated weights on worker 0-0, policy_version 794014 (0.00092) [2022-07-10 16:05:37,527][26022] Updated weights on worker 0-0, policy_version 794024 (0.00088) [2022-07-10 16:05:38,075][25689] Fps is (10 sec: 5505.8, 60 sec: 5553.1, 300 sec: 5530.8). Total num frames: 813081600. Throughput: 0: 5772.0. Samples: 813080748. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:05:38,075][25689] Avg episode reward: [(0, '-3.655')] [2022-07-10 16:05:39,759][26022] Updated weights on worker 0-0, policy_version 794034 (0.00093) [2022-07-10 16:05:41,244][26022] Updated weights on worker 0-0, policy_version 794044 (0.00084) [2022-07-10 16:05:43,084][25689] Fps is (10 sec: 5503.3, 60 sec: 5536.8, 300 sec: 5531.8). Total num frames: 813110272. Throughput: 0: 5796.5. Samples: 813114484. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:05:43,085][25689] Avg episode reward: [(0, '-3.125')] [2022-07-10 16:05:43,375][26022] Updated weights on worker 0-0, policy_version 794054 (0.00095) [2022-07-10 16:05:45,163][26022] Updated weights on worker 0-0, policy_version 794064 (0.00083) [2022-07-10 16:05:47,035][26022] Updated weights on worker 0-0, policy_version 794074 (0.00051) [2022-07-10 16:05:48,130][25689] Fps is (10 sec: 5600.8, 60 sec: 5510.4, 300 sec: 5531.5). Total num frames: 813137920. Throughput: 0: 5793.4. Samples: 813147532. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:05:48,131][25689] Avg episode reward: [(0, '-1.614')] [2022-07-10 16:05:48,924][26022] Updated weights on worker 0-0, policy_version 794084 (0.00092) [2022-07-10 16:05:50,647][26022] Updated weights on worker 0-0, policy_version 794094 (0.00085) [2022-07-10 16:05:52,625][26022] Updated weights on worker 0-0, policy_version 794104 (0.00086) [2022-07-10 16:05:53,135][25689] Fps is (10 sec: 5603.3, 60 sec: 5552.6, 300 sec: 5535.4). Total num frames: 813166592. Throughput: 0: 5791.5. Samples: 813164074. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:05:53,136][25689] Avg episode reward: [(0, '-2.982')] [2022-07-10 16:05:54,335][26022] Updated weights on worker 0-0, policy_version 794114 (0.00085) [2022-07-10 16:05:56,177][26022] Updated weights on worker 0-0, policy_version 794124 (0.00088) [2022-07-10 16:05:57,967][26022] Updated weights on worker 0-0, policy_version 794134 (0.00092) [2022-07-10 16:05:58,206][25689] Fps is (10 sec: 5487.8, 60 sec: 5501.2, 300 sec: 5534.5). Total num frames: 813193216. Throughput: 0: 5804.0. Samples: 813197592. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:05:58,207][25689] Avg episode reward: [(0, '-2.589')] [2022-07-10 16:05:59,862][26022] Updated weights on worker 0-0, policy_version 794144 (0.00091) [2022-07-10 16:06:01,779][26022] Updated weights on worker 0-0, policy_version 794154 (0.00085) [2022-07-10 16:06:03,256][25689] Fps is (10 sec: 5261.3, 60 sec: 5521.9, 300 sec: 5534.7). Total num frames: 813219840. Throughput: 0: 5696.2. Samples: 813229388. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:06:03,256][25689] Avg episode reward: [(0, '-3.029')] [2022-07-10 16:06:03,944][26022] Updated weights on worker 0-0, policy_version 794164 (0.00088) [2022-07-10 16:06:05,639][26022] Updated weights on worker 0-0, policy_version 794174 (0.00089) [2022-07-10 16:06:07,615][26022] Updated weights on worker 0-0, policy_version 794184 (0.00097) [2022-07-10 16:06:08,307][25689] Fps is (10 sec: 5576.0, 60 sec: 5543.3, 300 sec: 5537.9). Total num frames: 813249536. Throughput: 0: 4885.3. Samples: 813246100. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:06:08,308][25689] Avg episode reward: [(0, '-3.153')] [2022-07-10 16:06:09,381][26022] Updated weights on worker 0-0, policy_version 794194 (0.00078) [2022-07-10 16:06:11,252][26022] Updated weights on worker 0-0, policy_version 794204 (0.00576) [2022-07-10 16:06:13,152][26022] Updated weights on worker 0-0, policy_version 794214 (0.00088) [2022-07-10 16:06:13,327][25689] Fps is (10 sec: 5490.7, 60 sec: 5491.9, 300 sec: 5534.2). Total num frames: 813275136. Throughput: 0: 5701.6. Samples: 813279198. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:06:13,327][25689] Avg episode reward: [(0, '-3.082')] [2022-07-10 16:06:14,984][26022] Updated weights on worker 0-0, policy_version 794224 (0.00093) [2022-07-10 16:06:16,840][26022] Updated weights on worker 0-0, policy_version 794234 (0.00085) [2022-07-10 16:06:18,328][25689] Fps is (10 sec: 5415.7, 60 sec: 5526.0, 300 sec: 5537.7). Total num frames: 813303808. Throughput: 0: 5738.0. Samples: 813313050. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:06:18,329][25689] Avg episode reward: [(0, '-1.835')] [2022-07-10 16:06:18,618][26022] Updated weights on worker 0-0, policy_version 794244 (0.00090) [2022-07-10 16:06:20,379][26022] Updated weights on worker 0-0, policy_version 794254 (0.00084) [2022-07-10 16:06:22,226][26022] Updated weights on worker 0-0, policy_version 794264 (0.00081) [2022-07-10 16:06:23,336][25689] Fps is (10 sec: 5831.3, 60 sec: 5560.2, 300 sec: 5542.2). Total num frames: 813333504. Throughput: 0: 5007.5. Samples: 813329940. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:06:23,338][25689] Avg episode reward: [(0, '-0.416')] [2022-07-10 16:06:23,953][26022] Updated weights on worker 0-0, policy_version 794274 (0.00088) [2022-07-10 16:06:25,785][26022] Updated weights on worker 0-0, policy_version 794284 (0.00092) [2022-07-10 16:06:27,617][26022] Updated weights on worker 0-0, policy_version 794294 (0.00087) [2022-07-10 16:06:28,458][25689] Fps is (10 sec: 5560.0, 60 sec: 5524.5, 300 sec: 5535.0). Total num frames: 813360128. Throughput: 0: 5843.8. Samples: 813363858. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:06:28,458][25689] Avg episode reward: [(0, '-1.681')] [2022-07-10 16:06:29,512][26022] Updated weights on worker 0-0, policy_version 794304 (0.00086) [2022-07-10 16:06:31,435][26022] Updated weights on worker 0-0, policy_version 794314 (0.00086) [2022-07-10 16:06:33,143][26022] Updated weights on worker 0-0, policy_version 794324 (0.00090) [2022-07-10 16:06:33,477][25689] Fps is (10 sec: 5452.9, 60 sec: 5523.2, 300 sec: 5542.1). Total num frames: 813388800. Throughput: 0: 5865.9. Samples: 813397396. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:06:33,478][25689] Avg episode reward: [(0, '-1.167')] [2022-07-10 16:06:34,939][26022] Updated weights on worker 0-0, policy_version 794334 (0.00081) [2022-07-10 16:06:36,694][26022] Updated weights on worker 0-0, policy_version 794344 (0.00096) [2022-07-10 16:06:38,503][25689] Fps is (10 sec: 5708.6, 60 sec: 5558.2, 300 sec: 5542.5). Total num frames: 813417472. Throughput: 0: 5014.7. Samples: 813414220. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:06:38,503][25689] Avg episode reward: [(0, '-1.940')] [2022-07-10 16:06:38,545][26022] Updated weights on worker 0-0, policy_version 794354 (0.00094) [2022-07-10 16:06:40,739][26022] Updated weights on worker 0-0, policy_version 794364 (0.00055) [2022-07-10 16:06:42,423][26022] Updated weights on worker 0-0, policy_version 794374 (0.00495) [2022-07-10 16:06:43,545][25689] Fps is (10 sec: 5594.1, 60 sec: 5538.3, 300 sec: 5543.3). Total num frames: 813445120. Throughput: 0: 5820.1. Samples: 813447556. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:06:43,545][25689] Avg episode reward: [(0, '-1.569')] [2022-07-10 16:06:44,220][26022] Updated weights on worker 0-0, policy_version 794384 (0.00821) [2022-07-10 16:06:44,503][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:06:44,517][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000794386_813451264.pth [2022-07-10 16:06:44,518][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000792438_811456512.pth [2022-07-10 16:06:46,180][26022] Updated weights on worker 0-0, policy_version 794394 (0.00089) [2022-07-10 16:06:47,943][26022] Updated weights on worker 0-0, policy_version 794404 (0.00089) [2022-07-10 16:06:48,615][25689] Fps is (10 sec: 5468.4, 60 sec: 5536.1, 300 sec: 5545.7). Total num frames: 813472768. Throughput: 0: 5772.5. Samples: 813480214. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:06:48,615][25689] Avg episode reward: [(0, '-2.334')] [2022-07-10 16:06:50,088][26022] Updated weights on worker 0-0, policy_version 794414 (0.00086) [2022-07-10 16:06:51,612][26022] Updated weights on worker 0-0, policy_version 794424 (0.00084) [2022-07-10 16:06:53,578][26022] Updated weights on worker 0-0, policy_version 794434 (0.00088) [2022-07-10 16:06:53,623][25689] Fps is (10 sec: 5486.3, 60 sec: 5518.8, 300 sec: 5539.1). Total num frames: 813500416. Throughput: 0: 4931.2. Samples: 813496744. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:06:53,624][25689] Avg episode reward: [(0, '-1.884')] [2022-07-10 16:06:55,421][26022] Updated weights on worker 0-0, policy_version 794444 (0.00082) [2022-07-10 16:06:57,032][26022] Updated weights on worker 0-0, policy_version 794454 (0.00084) [2022-07-10 16:06:58,638][25689] Fps is (10 sec: 5516.9, 60 sec: 5541.0, 300 sec: 5535.5). Total num frames: 813528064. Throughput: 0: 5762.2. Samples: 813530240. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-10 16:06:58,638][25689] Avg episode reward: [(0, '-1.458')] [2022-07-10 16:06:59,068][26022] Updated weights on worker 0-0, policy_version 794464 (0.00082) [2022-07-10 16:07:01,065][26022] Updated weights on worker 0-0, policy_version 794474 (0.00089) [2022-07-10 16:07:03,045][26022] Updated weights on worker 0-0, policy_version 794484 (0.00089) [2022-07-10 16:07:03,650][25689] Fps is (10 sec: 5514.9, 60 sec: 5561.3, 300 sec: 5543.5). Total num frames: 813555712. Throughput: 0: 5693.6. Samples: 813562028. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:07:03,651][25689] Avg episode reward: [(0, '-1.443')] [2022-07-10 16:07:05,158][26022] Updated weights on worker 0-0, policy_version 794494 (0.00086) [2022-07-10 16:07:06,640][26022] Updated weights on worker 0-0, policy_version 794504 (0.00089) [2022-07-10 16:07:08,713][25689] Fps is (10 sec: 5386.7, 60 sec: 5509.4, 300 sec: 5536.4). Total num frames: 813582336. Throughput: 0: 4902.2. Samples: 813578740. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:07:08,713][25689] Avg episode reward: [(0, '-1.118')] [2022-07-10 16:07:08,716][26022] Updated weights on worker 0-0, policy_version 794514 (0.00081) [2022-07-10 16:07:10,468][26022] Updated weights on worker 0-0, policy_version 794524 (0.00091) [2022-07-10 16:07:12,541][26022] Updated weights on worker 0-0, policy_version 794534 (0.00093) [2022-07-10 16:07:13,791][25689] Fps is (10 sec: 5351.8, 60 sec: 5537.9, 300 sec: 5531.8). Total num frames: 813609984. Throughput: 0: 5711.8. Samples: 813611936. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:07:13,791][25689] Avg episode reward: [(0, '-1.281')] [2022-07-10 16:07:14,187][26022] Updated weights on worker 0-0, policy_version 794544 (0.00066) [2022-07-10 16:07:16,154][26022] Updated weights on worker 0-0, policy_version 794554 (0.00090) [2022-07-10 16:07:17,905][26022] Updated weights on worker 0-0, policy_version 794564 (0.00086) [2022-07-10 16:07:18,812][25689] Fps is (10 sec: 5576.8, 60 sec: 5536.2, 300 sec: 5532.1). Total num frames: 813638656. Throughput: 0: 5696.3. Samples: 813645158. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:07:18,814][25689] Avg episode reward: [(0, '-1.446')] [2022-07-10 16:07:19,961][26022] Updated weights on worker 0-0, policy_version 794574 (0.00093) [2022-07-10 16:07:21,430][26022] Updated weights on worker 0-0, policy_version 794584 (0.00054) [2022-07-10 16:07:23,487][26022] Updated weights on worker 0-0, policy_version 794594 (0.00083) [2022-07-10 16:07:23,855][25689] Fps is (10 sec: 5596.2, 60 sec: 5499.1, 300 sec: 5536.0). Total num frames: 813666304. Throughput: 0: 4933.2. Samples: 813661702. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:07:23,855][25689] Avg episode reward: [(0, '-0.037')] [2022-07-10 16:07:25,430][26022] Updated weights on worker 0-0, policy_version 794604 (0.00100) [2022-07-10 16:07:27,049][26022] Updated weights on worker 0-0, policy_version 794614 (0.00073) [2022-07-10 16:07:28,941][25689] Fps is (10 sec: 5559.9, 60 sec: 5536.2, 300 sec: 5535.5). Total num frames: 813694976. Throughput: 0: 5766.8. Samples: 813695394. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:07:28,942][25689] Avg episode reward: [(0, '0.372')] [2022-07-10 16:07:28,959][26022] Updated weights on worker 0-0, policy_version 794624 (0.00094) [2022-07-10 16:07:30,695][26022] Updated weights on worker 0-0, policy_version 794634 (0.00377) [2022-07-10 16:07:32,581][26022] Updated weights on worker 0-0, policy_version 794644 (0.00083) [2022-07-10 16:07:33,971][25689] Fps is (10 sec: 5567.6, 60 sec: 5518.3, 300 sec: 5535.4). Total num frames: 813722624. Throughput: 0: 5803.1. Samples: 813729042. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:07:33,972][25689] Avg episode reward: [(0, '0.919')] [2022-07-10 16:07:34,423][26022] Updated weights on worker 0-0, policy_version 794654 (0.00330) [2022-07-10 16:07:36,266][26022] Updated weights on worker 0-0, policy_version 794664 (0.00091) [2022-07-10 16:07:38,126][26022] Updated weights on worker 0-0, policy_version 794674 (0.00090) [2022-07-10 16:07:39,011][25689] Fps is (10 sec: 5592.9, 60 sec: 5517.0, 300 sec: 5539.8). Total num frames: 813751296. Throughput: 0: 4995.4. Samples: 813746064. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:07:39,012][25689] Avg episode reward: [(0, '0.310')] [2022-07-10 16:07:39,747][26022] Updated weights on worker 0-0, policy_version 794684 (0.00088) [2022-07-10 16:07:41,700][26022] Updated weights on worker 0-0, policy_version 794694 (0.00086) [2022-07-10 16:07:43,655][26022] Updated weights on worker 0-0, policy_version 794704 (0.00092) [2022-07-10 16:07:44,042][25689] Fps is (10 sec: 5592.2, 60 sec: 5518.0, 300 sec: 5537.3). Total num frames: 813778944. Throughput: 0: 5827.7. Samples: 813779346. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:07:44,042][25689] Avg episode reward: [(0, '0.218')] [2022-07-10 16:07:45,392][26022] Updated weights on worker 0-0, policy_version 794714 (0.00090) [2022-07-10 16:07:47,308][26022] Updated weights on worker 0-0, policy_version 794724 (0.00090) [2022-07-10 16:07:49,045][26022] Updated weights on worker 0-0, policy_version 794734 (0.00085) [2022-07-10 16:07:49,114][25689] Fps is (10 sec: 5574.6, 60 sec: 5534.7, 300 sec: 5536.3). Total num frames: 813807616. Throughput: 0: 5811.1. Samples: 813812620. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:07:49,115][25689] Avg episode reward: [(0, '0.662')] [2022-07-10 16:07:50,846][26022] Updated weights on worker 0-0, policy_version 794744 (0.00088) [2022-07-10 16:07:52,901][26022] Updated weights on worker 0-0, policy_version 794754 (0.00091) [2022-07-10 16:07:54,162][25689] Fps is (10 sec: 5666.2, 60 sec: 5548.1, 300 sec: 5538.9). Total num frames: 813836288. Throughput: 0: 4961.9. Samples: 813829230. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:07:54,163][25689] Avg episode reward: [(0, '0.552')] [2022-07-10 16:07:54,490][26022] Updated weights on worker 0-0, policy_version 794764 (0.00086) [2022-07-10 16:07:56,485][26022] Updated weights on worker 0-0, policy_version 794774 (0.00088) [2022-07-10 16:07:58,404][26022] Updated weights on worker 0-0, policy_version 794784 (0.00084) [2022-07-10 16:07:59,189][25689] Fps is (10 sec: 5387.3, 60 sec: 5513.1, 300 sec: 5532.4). Total num frames: 813861888. Throughput: 0: 5785.3. Samples: 813862794. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:07:59,189][25689] Avg episode reward: [(0, '0.496')] [2022-07-10 16:08:00,110][26022] Updated weights on worker 0-0, policy_version 794794 (0.00088) [2022-07-10 16:08:02,531][26022] Updated weights on worker 0-0, policy_version 794804 (0.00086) [2022-07-10 16:08:04,070][26022] Updated weights on worker 0-0, policy_version 794814 (0.00087) [2022-07-10 16:08:04,269][25689] Fps is (10 sec: 5268.3, 60 sec: 5506.9, 300 sec: 5533.3). Total num frames: 813889536. Throughput: 0: 5690.8. Samples: 813894458. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:08:04,270][25689] Avg episode reward: [(0, '0.550')] [2022-07-10 16:08:06,110][26022] Updated weights on worker 0-0, policy_version 794824 (0.00092) [2022-07-10 16:08:08,018][26022] Updated weights on worker 0-0, policy_version 794834 (0.00071) [2022-07-10 16:08:09,338][25689] Fps is (10 sec: 5448.3, 60 sec: 5523.3, 300 sec: 5535.8). Total num frames: 813917184. Throughput: 0: 4866.8. Samples: 813911042. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:08:09,338][25689] Avg episode reward: [(0, '0.977')] [2022-07-10 16:08:09,791][26022] Updated weights on worker 0-0, policy_version 794844 (0.00103) [2022-07-10 16:08:11,567][26022] Updated weights on worker 0-0, policy_version 794854 (0.00093) [2022-07-10 16:08:13,469][26022] Updated weights on worker 0-0, policy_version 794864 (0.00083) [2022-07-10 16:08:14,347][25689] Fps is (10 sec: 5588.8, 60 sec: 5546.5, 300 sec: 5539.9). Total num frames: 813945856. Throughput: 0: 5703.2. Samples: 813944346. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:08:14,347][25689] Avg episode reward: [(0, '0.184')] [2022-07-10 16:08:15,344][26022] Updated weights on worker 0-0, policy_version 794874 (0.00357) [2022-07-10 16:08:17,290][26022] Updated weights on worker 0-0, policy_version 794884 (0.00095) [2022-07-10 16:08:19,153][26022] Updated weights on worker 0-0, policy_version 794894 (0.00091) [2022-07-10 16:08:19,358][25689] Fps is (10 sec: 5518.3, 60 sec: 5513.5, 300 sec: 5530.4). Total num frames: 813972480. Throughput: 0: 5691.2. Samples: 813977584. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:08:19,359][25689] Avg episode reward: [(0, '-0.212')] [2022-07-10 16:08:20,874][26022] Updated weights on worker 0-0, policy_version 794904 (0.00650) [2022-07-10 16:08:22,865][26022] Updated weights on worker 0-0, policy_version 794914 (0.00091) [2022-07-10 16:08:24,386][25689] Fps is (10 sec: 5508.0, 60 sec: 5531.9, 300 sec: 5534.7). Total num frames: 814001152. Throughput: 0: 4954.5. Samples: 813994124. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:08:24,386][25689] Avg episode reward: [(0, '-0.578')] [2022-07-10 16:08:24,575][26022] Updated weights on worker 0-0, policy_version 794924 (0.00087) [2022-07-10 16:08:26,635][26022] Updated weights on worker 0-0, policy_version 794934 (0.00091) [2022-07-10 16:08:28,354][26022] Updated weights on worker 0-0, policy_version 794944 (0.00086) [2022-07-10 16:08:29,512][25689] Fps is (10 sec: 5446.0, 60 sec: 5494.5, 300 sec: 5532.9). Total num frames: 814027776. Throughput: 0: 5747.7. Samples: 814026996. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:08:29,512][25689] Avg episode reward: [(0, '-0.826')] [2022-07-10 16:08:30,125][26022] Updated weights on worker 0-0, policy_version 794954 (0.00091) [2022-07-10 16:08:32,042][26022] Updated weights on worker 0-0, policy_version 794964 (0.00095) [2022-07-10 16:08:33,925][26022] Updated weights on worker 0-0, policy_version 794974 (0.00086) [2022-07-10 16:08:34,526][25689] Fps is (10 sec: 5453.4, 60 sec: 5512.7, 300 sec: 5529.6). Total num frames: 814056448. Throughput: 0: 5746.9. Samples: 814060312. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:08:34,526][25689] Avg episode reward: [(0, '-1.470')] [2022-07-10 16:08:35,908][26022] Updated weights on worker 0-0, policy_version 794984 (0.00089) [2022-07-10 16:08:37,599][26022] Updated weights on worker 0-0, policy_version 794994 (0.00096) [2022-07-10 16:08:39,542][25689] Fps is (10 sec: 5614.7, 60 sec: 5498.0, 300 sec: 5526.0). Total num frames: 814084096. Throughput: 0: 5736.5. Samples: 814093372. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:08:39,544][26022] Updated weights on worker 0-0, policy_version 795004 (0.00088) [2022-07-10 16:08:39,547][25689] Avg episode reward: [(0, '-2.245')] [2022-07-10 16:08:41,351][26022] Updated weights on worker 0-0, policy_version 795014 (0.00087) [2022-07-10 16:08:43,216][26022] Updated weights on worker 0-0, policy_version 795024 (0.00090) [2022-07-10 16:08:44,572][25689] Fps is (10 sec: 5504.1, 60 sec: 5498.1, 300 sec: 5526.7). Total num frames: 814111744. Throughput: 0: 5744.8. Samples: 814110088. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:08:44,572][25689] Avg episode reward: [(0, '-2.189')] [2022-07-10 16:08:44,650][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:08:44,660][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000795032_814112768.pth [2022-07-10 16:08:44,660][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000793084_812118016.pth [2022-07-10 16:08:45,107][26022] Updated weights on worker 0-0, policy_version 795034 (0.00087) [2022-07-10 16:08:46,896][26022] Updated weights on worker 0-0, policy_version 795044 (0.00089) [2022-07-10 16:08:48,835][26022] Updated weights on worker 0-0, policy_version 795054 (0.00089) [2022-07-10 16:08:49,674][25689] Fps is (10 sec: 5457.9, 60 sec: 5478.5, 300 sec: 5524.9). Total num frames: 814139392. Throughput: 0: 5763.7. Samples: 814143206. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:08:49,675][25689] Avg episode reward: [(0, '-2.543')] [2022-07-10 16:08:50,529][26022] Updated weights on worker 0-0, policy_version 795064 (0.00090) [2022-07-10 16:08:52,487][26022] Updated weights on worker 0-0, policy_version 795074 (0.00082) [2022-07-10 16:08:54,330][26022] Updated weights on worker 0-0, policy_version 795084 (0.00091) [2022-07-10 16:08:54,719][25689] Fps is (10 sec: 5449.7, 60 sec: 5461.9, 300 sec: 5521.4). Total num frames: 814167040. Throughput: 0: 5742.2. Samples: 814176264. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:08:54,719][25689] Avg episode reward: [(0, '-1.517')] [2022-07-10 16:08:56,058][26022] Updated weights on worker 0-0, policy_version 795094 (0.00101) [2022-07-10 16:08:58,060][26022] Updated weights on worker 0-0, policy_version 795104 (0.00089) [2022-07-10 16:08:59,730][25689] Fps is (10 sec: 5600.7, 60 sec: 5514.0, 300 sec: 5532.1). Total num frames: 814195712. Throughput: 0: 4937.7. Samples: 814193050. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:08:59,730][25689] Avg episode reward: [(0, '-1.986')] [2022-07-10 16:08:59,916][26022] Updated weights on worker 0-0, policy_version 795114 (0.00090) [2022-07-10 16:09:01,969][26022] Updated weights on worker 0-0, policy_version 795124 (0.00098) [2022-07-10 16:09:03,967][26022] Updated weights on worker 0-0, policy_version 795134 (0.00090) [2022-07-10 16:09:04,745][25689] Fps is (10 sec: 5412.6, 60 sec: 5486.1, 300 sec: 5520.3). Total num frames: 814221312. Throughput: 0: 5686.5. Samples: 814224804. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:09:04,746][25689] Avg episode reward: [(0, '-0.863')] [2022-07-10 16:09:05,603][26022] Updated weights on worker 0-0, policy_version 795144 (0.00094) [2022-07-10 16:09:07,772][26022] Updated weights on worker 0-0, policy_version 795154 (0.00090) [2022-07-10 16:09:09,265][26022] Updated weights on worker 0-0, policy_version 795164 (0.00091) [2022-07-10 16:09:09,788][25689] Fps is (10 sec: 5395.5, 60 sec: 5505.3, 300 sec: 5526.6). Total num frames: 814249984. Throughput: 0: 5713.0. Samples: 814258120. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:09:09,789][25689] Avg episode reward: [(0, '-0.615')] [2022-07-10 16:09:11,460][26022] Updated weights on worker 0-0, policy_version 795174 (0.00090) [2022-07-10 16:09:12,981][26022] Updated weights on worker 0-0, policy_version 795184 (0.00092) [2022-07-10 16:09:14,794][25689] Fps is (10 sec: 5502.6, 60 sec: 5471.7, 300 sec: 5523.3). Total num frames: 814276608. Throughput: 0: 4920.5. Samples: 814275048. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:09:14,795][25689] Avg episode reward: [(0, '-0.364')] [2022-07-10 16:09:14,983][26022] Updated weights on worker 0-0, policy_version 795194 (0.00087) [2022-07-10 16:09:16,645][26022] Updated weights on worker 0-0, policy_version 795204 (0.00080) [2022-07-10 16:09:18,709][26022] Updated weights on worker 0-0, policy_version 795214 (0.00087) [2022-07-10 16:09:19,830][25689] Fps is (10 sec: 5506.5, 60 sec: 5503.4, 300 sec: 5522.9). Total num frames: 814305280. Throughput: 0: 5753.5. Samples: 814308700. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:09:19,831][25689] Avg episode reward: [(0, '-1.026')] [2022-07-10 16:09:20,257][26022] Updated weights on worker 0-0, policy_version 795224 (0.00087) [2022-07-10 16:09:22,253][26022] Updated weights on worker 0-0, policy_version 795234 (0.00084) [2022-07-10 16:09:23,992][26022] Updated weights on worker 0-0, policy_version 795244 (0.00088) [2022-07-10 16:09:24,833][25689] Fps is (10 sec: 5712.2, 60 sec: 5505.6, 300 sec: 5525.3). Total num frames: 814333952. Throughput: 0: 5834.3. Samples: 814342002. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:09:24,834][25689] Avg episode reward: [(0, '-1.342')] [2022-07-10 16:09:25,986][26022] Updated weights on worker 0-0, policy_version 795254 (0.00095) [2022-07-10 16:09:27,581][26022] Updated weights on worker 0-0, policy_version 795264 (0.00087) [2022-07-10 16:09:29,812][26022] Updated weights on worker 0-0, policy_version 795274 (0.00089) [2022-07-10 16:09:29,873][25689] Fps is (10 sec: 5608.0, 60 sec: 5530.4, 300 sec: 5521.2). Total num frames: 814361600. Throughput: 0: 4993.9. Samples: 814358422. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:09:29,874][25689] Avg episode reward: [(0, '-3.319')] [2022-07-10 16:09:31,358][26022] Updated weights on worker 0-0, policy_version 795284 (0.00091) [2022-07-10 16:09:33,342][26022] Updated weights on worker 0-0, policy_version 795294 (0.00974) [2022-07-10 16:09:34,894][25689] Fps is (10 sec: 5496.4, 60 sec: 5512.8, 300 sec: 5528.1). Total num frames: 814389248. Throughput: 0: 5818.5. Samples: 814391996. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:09:34,894][25689] Avg episode reward: [(0, '-3.243')] [2022-07-10 16:09:35,052][26022] Updated weights on worker 0-0, policy_version 795304 (0.00095) [2022-07-10 16:09:37,018][26022] Updated weights on worker 0-0, policy_version 795314 (0.00086) [2022-07-10 16:09:38,805][26022] Updated weights on worker 0-0, policy_version 795324 (0.00080) [2022-07-10 16:09:39,896][25689] Fps is (10 sec: 5517.0, 60 sec: 5514.2, 300 sec: 5521.5). Total num frames: 814416896. Throughput: 0: 5820.6. Samples: 814425494. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:09:39,897][25689] Avg episode reward: [(0, '-3.942')] [2022-07-10 16:09:40,728][26022] Updated weights on worker 0-0, policy_version 795334 (0.00090) [2022-07-10 16:09:42,359][26022] Updated weights on worker 0-0, policy_version 795344 (0.00087) [2022-07-10 16:09:44,321][26022] Updated weights on worker 0-0, policy_version 795354 (0.00085) [2022-07-10 16:09:44,911][25689] Fps is (10 sec: 5622.5, 60 sec: 5532.5, 300 sec: 5520.1). Total num frames: 814445568. Throughput: 0: 4990.2. Samples: 814442192. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:09:44,911][25689] Avg episode reward: [(0, '-3.233')] [2022-07-10 16:09:46,061][26022] Updated weights on worker 0-0, policy_version 795364 (0.00090) [2022-07-10 16:09:47,954][26022] Updated weights on worker 0-0, policy_version 795374 (0.00091) [2022-07-10 16:09:49,651][26022] Updated weights on worker 0-0, policy_version 795384 (0.00095) [2022-07-10 16:09:50,003][25689] Fps is (10 sec: 5775.1, 60 sec: 5567.3, 300 sec: 5530.5). Total num frames: 814475264. Throughput: 0: 5843.5. Samples: 814476050. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:09:50,005][25689] Avg episode reward: [(0, '-3.329')] [2022-07-10 16:09:51,641][26022] Updated weights on worker 0-0, policy_version 795394 (0.00085) [2022-07-10 16:09:53,238][26022] Updated weights on worker 0-0, policy_version 795404 (0.00089) [2022-07-10 16:09:55,006][25689] Fps is (10 sec: 5578.9, 60 sec: 5554.2, 300 sec: 5521.4). Total num frames: 814501888. Throughput: 0: 5847.4. Samples: 814509600. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:09:55,007][25689] Avg episode reward: [(0, '-2.488')] [2022-07-10 16:09:55,448][26022] Updated weights on worker 0-0, policy_version 795414 (0.00081) [2022-07-10 16:09:57,176][26022] Updated weights on worker 0-0, policy_version 795424 (0.00095) [2022-07-10 16:09:58,902][26022] Updated weights on worker 0-0, policy_version 795434 (0.00087) [2022-07-10 16:10:00,008][25689] Fps is (10 sec: 5526.7, 60 sec: 5555.0, 300 sec: 5533.3). Total num frames: 814530560. Throughput: 0: 5019.4. Samples: 814526444. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:10:00,009][25689] Avg episode reward: [(0, '-0.834')] [2022-07-10 16:10:01,009][26022] Updated weights on worker 0-0, policy_version 795444 (0.00854) [2022-07-10 16:10:02,569][26022] Updated weights on worker 0-0, policy_version 795454 (0.00089) [2022-07-10 16:10:04,877][26022] Updated weights on worker 0-0, policy_version 795464 (0.00089) [2022-07-10 16:10:05,067][25689] Fps is (10 sec: 5292.7, 60 sec: 5534.1, 300 sec: 5520.3). Total num frames: 814555136. Throughput: 0: 5743.5. Samples: 814557958. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:10:05,067][25689] Avg episode reward: [(0, '-1.111')] [2022-07-10 16:10:06,643][26022] Updated weights on worker 0-0, policy_version 795474 (0.00085) [2022-07-10 16:10:08,523][26022] Updated weights on worker 0-0, policy_version 795484 (0.00087) [2022-07-10 16:10:10,186][25689] Fps is (10 sec: 5332.6, 60 sec: 5544.1, 300 sec: 5521.8). Total num frames: 814584832. Throughput: 0: 5728.6. Samples: 814591668. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:10:10,187][25689] Avg episode reward: [(0, '-0.613')] [2022-07-10 16:10:10,315][26022] Updated weights on worker 0-0, policy_version 795494 (0.00088) [2022-07-10 16:10:12,049][26022] Updated weights on worker 0-0, policy_version 795504 (0.00094) [2022-07-10 16:10:14,023][26022] Updated weights on worker 0-0, policy_version 795514 (0.00099) [2022-07-10 16:10:15,241][25689] Fps is (10 sec: 5635.8, 60 sec: 5556.5, 300 sec: 5524.3). Total num frames: 814612480. Throughput: 0: 4887.0. Samples: 814608492. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:10:15,244][25689] Avg episode reward: [(0, '-1.560')] [2022-07-10 16:10:15,923][26022] Updated weights on worker 0-0, policy_version 795524 (0.00091) [2022-07-10 16:10:17,564][26022] Updated weights on worker 0-0, policy_version 795534 (0.00097) [2022-07-10 16:10:19,690][26022] Updated weights on worker 0-0, policy_version 795544 (0.00088) [2022-07-10 16:10:20,261][25689] Fps is (10 sec: 5589.7, 60 sec: 5558.0, 300 sec: 5527.6). Total num frames: 814641152. Throughput: 0: 5686.1. Samples: 814641606. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:10:20,262][25689] Avg episode reward: [(0, '-2.389')] [2022-07-10 16:10:21,240][26022] Updated weights on worker 0-0, policy_version 795554 (0.00090) [2022-07-10 16:10:23,387][26022] Updated weights on worker 0-0, policy_version 795564 (0.00091) [2022-07-10 16:10:24,910][26022] Updated weights on worker 0-0, policy_version 795574 (0.00087) [2022-07-10 16:10:25,271][25689] Fps is (10 sec: 5717.3, 60 sec: 5557.3, 300 sec: 5529.3). Total num frames: 814669824. Throughput: 0: 5788.1. Samples: 814674906. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:10:25,272][25689] Avg episode reward: [(0, '-2.475')] [2022-07-10 16:10:27,047][26022] Updated weights on worker 0-0, policy_version 795584 (0.00048) [2022-07-10 16:10:28,597][26022] Updated weights on worker 0-0, policy_version 795594 (0.00097) [2022-07-10 16:10:30,357][25689] Fps is (10 sec: 5477.3, 60 sec: 5536.2, 300 sec: 5520.9). Total num frames: 814696448. Throughput: 0: 4953.8. Samples: 814691592. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 16:10:30,357][25689] Avg episode reward: [(0, '-2.279')] [2022-07-10 16:10:30,618][26022] Updated weights on worker 0-0, policy_version 795604 (0.00090) [2022-07-10 16:10:32,347][26022] Updated weights on worker 0-0, policy_version 795614 (0.00085) [2022-07-10 16:10:34,190][26022] Updated weights on worker 0-0, policy_version 795624 (0.00085) [2022-07-10 16:10:35,415][25689] Fps is (10 sec: 5552.1, 60 sec: 5566.5, 300 sec: 5530.8). Total num frames: 814726144. Throughput: 0: 5797.6. Samples: 814725452. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:10:35,416][25689] Avg episode reward: [(0, '-1.685')] [2022-07-10 16:10:35,996][26022] Updated weights on worker 0-0, policy_version 795634 (0.00091) [2022-07-10 16:10:38,005][26022] Updated weights on worker 0-0, policy_version 795644 (0.00089) [2022-07-10 16:10:39,608][26022] Updated weights on worker 0-0, policy_version 795654 (0.00088) [2022-07-10 16:10:40,464][25689] Fps is (10 sec: 5673.6, 60 sec: 5562.3, 300 sec: 5526.7). Total num frames: 814753792. Throughput: 0: 5801.9. Samples: 814758820. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:10:40,464][25689] Avg episode reward: [(0, '-0.830')] [2022-07-10 16:10:41,639][26022] Updated weights on worker 0-0, policy_version 795664 (0.00091) [2022-07-10 16:10:43,208][26022] Updated weights on worker 0-0, policy_version 795674 (0.00082) [2022-07-10 16:10:44,686][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:10:44,699][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000795681_814777344.pth [2022-07-10 16:10:44,699][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000793736_812785664.pth [2022-07-10 16:10:45,219][26022] Updated weights on worker 0-0, policy_version 795684 (0.00087) [2022-07-10 16:10:45,506][25689] Fps is (10 sec: 5479.7, 60 sec: 5542.8, 300 sec: 5526.7). Total num frames: 814781440. Throughput: 0: 4985.1. Samples: 814775780. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:10:45,507][25689] Avg episode reward: [(0, '0.229')] [2022-07-10 16:10:46,748][26022] Updated weights on worker 0-0, policy_version 795694 (0.00099) [2022-07-10 16:10:48,978][26022] Updated weights on worker 0-0, policy_version 795704 (0.00087) [2022-07-10 16:10:50,421][26022] Updated weights on worker 0-0, policy_version 795714 (0.00085) [2022-07-10 16:10:50,551][25689] Fps is (10 sec: 5786.5, 60 sec: 5564.1, 300 sec: 5532.9). Total num frames: 814812160. Throughput: 0: 5843.9. Samples: 814809604. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:10:50,551][25689] Avg episode reward: [(0, '0.415')] [2022-07-10 16:10:52,814][26022] Updated weights on worker 0-0, policy_version 795724 (0.00621) [2022-07-10 16:10:53,947][26022] Updated weights on worker 0-0, policy_version 795734 (0.00096) [2022-07-10 16:10:55,599][25689] Fps is (10 sec: 5580.5, 60 sec: 5543.1, 300 sec: 5529.9). Total num frames: 814837760. Throughput: 0: 5824.0. Samples: 814843000. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:10:55,599][25689] Avg episode reward: [(0, '0.214')] [2022-07-10 16:10:56,348][26022] Updated weights on worker 0-0, policy_version 795744 (0.00091) [2022-07-10 16:10:57,759][26022] Updated weights on worker 0-0, policy_version 795754 (0.00088) [2022-07-10 16:10:59,804][26022] Updated weights on worker 0-0, policy_version 795764 (0.00095) [2022-07-10 16:11:00,602][25689] Fps is (10 sec: 5399.7, 60 sec: 5543.0, 300 sec: 5537.6). Total num frames: 814866432. Throughput: 0: 5020.0. Samples: 814859912. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:11:00,602][25689] Avg episode reward: [(0, '0.210')] [2022-07-10 16:11:01,523][26022] Updated weights on worker 0-0, policy_version 795774 (0.00084) [2022-07-10 16:11:03,714][26022] Updated weights on worker 0-0, policy_version 795784 (0.00087) [2022-07-10 16:11:05,639][25689] Fps is (10 sec: 5405.6, 60 sec: 5561.9, 300 sec: 5524.1). Total num frames: 814892032. Throughput: 0: 5746.6. Samples: 814891472. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:11:05,639][25689] Avg episode reward: [(0, '0.388')] [2022-07-10 16:11:05,722][26022] Updated weights on worker 0-0, policy_version 795794 (0.00090) [2022-07-10 16:11:07,671][26022] Updated weights on worker 0-0, policy_version 795804 (0.00088) [2022-07-10 16:11:09,240][26022] Updated weights on worker 0-0, policy_version 795814 (0.00095) [2022-07-10 16:11:10,759][25689] Fps is (10 sec: 5342.9, 60 sec: 5544.8, 300 sec: 5532.6). Total num frames: 814920704. Throughput: 0: 5702.3. Samples: 814924840. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:11:10,760][25689] Avg episode reward: [(0, '0.301')] [2022-07-10 16:11:11,550][26022] Updated weights on worker 0-0, policy_version 795824 (0.00093) [2022-07-10 16:11:12,763][26022] Updated weights on worker 0-0, policy_version 795834 (0.00088) [2022-07-10 16:11:14,981][26022] Updated weights on worker 0-0, policy_version 795844 (0.00091) [2022-07-10 16:11:15,791][25689] Fps is (10 sec: 5547.6, 60 sec: 5547.1, 300 sec: 5528.6). Total num frames: 814948352. Throughput: 0: 5718.2. Samples: 814958464. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:11:15,791][25689] Avg episode reward: [(0, '-0.120')] [2022-07-10 16:11:16,504][26022] Updated weights on worker 0-0, policy_version 795854 (0.00084) [2022-07-10 16:11:18,742][26022] Updated weights on worker 0-0, policy_version 795864 (0.00090) [2022-07-10 16:11:20,428][26022] Updated weights on worker 0-0, policy_version 795874 (0.00091) [2022-07-10 16:11:20,819][25689] Fps is (10 sec: 5598.8, 60 sec: 5546.3, 300 sec: 5524.8). Total num frames: 814977024. Throughput: 0: 5664.6. Samples: 814974434. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:11:20,823][25689] Avg episode reward: [(0, '-0.850')] [2022-07-10 16:11:22,197][26022] Updated weights on worker 0-0, policy_version 795884 (0.00059) [2022-07-10 16:11:23,921][26022] Updated weights on worker 0-0, policy_version 795894 (0.00085) [2022-07-10 16:11:25,839][25689] Fps is (10 sec: 5503.2, 60 sec: 5511.6, 300 sec: 5526.6). Total num frames: 815003648. Throughput: 0: 5749.0. Samples: 815007602. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:11:25,839][25689] Avg episode reward: [(0, '-0.152')] [2022-07-10 16:11:26,287][26022] Updated weights on worker 0-0, policy_version 795904 (0.00087) [2022-07-10 16:11:27,888][26022] Updated weights on worker 0-0, policy_version 795914 (0.00097) [2022-07-10 16:11:29,839][26022] Updated weights on worker 0-0, policy_version 795924 (0.00082) [2022-07-10 16:11:30,949][25689] Fps is (10 sec: 5357.5, 60 sec: 5526.3, 300 sec: 5521.5). Total num frames: 815031296. Throughput: 0: 5759.1. Samples: 815041112. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:11:30,949][25689] Avg episode reward: [(0, '-0.984')] [2022-07-10 16:11:31,613][26022] Updated weights on worker 0-0, policy_version 795934 (0.00087) [2022-07-10 16:11:33,328][26022] Updated weights on worker 0-0, policy_version 795944 (0.00085) [2022-07-10 16:11:35,203][26022] Updated weights on worker 0-0, policy_version 795954 (0.00079) [2022-07-10 16:11:35,954][25689] Fps is (10 sec: 5871.6, 60 sec: 5565.0, 300 sec: 5532.2). Total num frames: 815063040. Throughput: 0: 4941.8. Samples: 815058106. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:11:35,954][25689] Avg episode reward: [(0, '-1.726')] [2022-07-10 16:11:37,211][26022] Updated weights on worker 0-0, policy_version 795964 (0.00092) [2022-07-10 16:11:38,732][26022] Updated weights on worker 0-0, policy_version 795974 (0.00087) [2022-07-10 16:11:40,972][25689] Fps is (10 sec: 5618.7, 60 sec: 5517.0, 300 sec: 5522.3). Total num frames: 815087616. Throughput: 0: 5824.4. Samples: 815091816. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:11:40,973][25689] Avg episode reward: [(0, '-2.098')] [2022-07-10 16:11:40,979][26022] Updated weights on worker 0-0, policy_version 795984 (0.00097) [2022-07-10 16:11:42,569][26022] Updated weights on worker 0-0, policy_version 795994 (0.00089) [2022-07-10 16:11:44,443][26022] Updated weights on worker 0-0, policy_version 796004 (0.00084) [2022-07-10 16:11:45,982][25689] Fps is (10 sec: 5309.9, 60 sec: 5536.9, 300 sec: 5526.9). Total num frames: 815116288. Throughput: 0: 5850.1. Samples: 815125442. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:11:45,983][25689] Avg episode reward: [(0, '-2.239')] [2022-07-10 16:11:46,212][26022] Updated weights on worker 0-0, policy_version 796014 (0.00096) [2022-07-10 16:11:48,042][26022] Updated weights on worker 0-0, policy_version 796024 (0.01010) [2022-07-10 16:11:49,975][26022] Updated weights on worker 0-0, policy_version 796034 (0.00091) [2022-07-10 16:11:51,053][25689] Fps is (10 sec: 5688.4, 60 sec: 5500.6, 300 sec: 5529.2). Total num frames: 815144960. Throughput: 0: 5026.8. Samples: 815142174. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:11:51,054][25689] Avg episode reward: [(0, '-1.579')] [2022-07-10 16:11:51,612][26022] Updated weights on worker 0-0, policy_version 796044 (0.00101) [2022-07-10 16:11:53,693][26022] Updated weights on worker 0-0, policy_version 796054 (0.00080) [2022-07-10 16:11:55,532][26022] Updated weights on worker 0-0, policy_version 796064 (0.00092) [2022-07-10 16:11:56,081][25689] Fps is (10 sec: 5475.4, 60 sec: 5519.4, 300 sec: 5525.5). Total num frames: 815171584. Throughput: 0: 5827.4. Samples: 815175394. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:11:56,082][25689] Avg episode reward: [(0, '-2.055')] [2022-07-10 16:11:57,319][26022] Updated weights on worker 0-0, policy_version 796074 (0.00090) [2022-07-10 16:11:59,276][26022] Updated weights on worker 0-0, policy_version 796084 (0.00083) [2022-07-10 16:12:00,960][26022] Updated weights on worker 0-0, policy_version 796094 (0.00090) [2022-07-10 16:12:01,161][25689] Fps is (10 sec: 5572.3, 60 sec: 5529.3, 300 sec: 5531.1). Total num frames: 815201280. Throughput: 0: 5777.7. Samples: 815208456. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:12:01,161][25689] Avg episode reward: [(0, '-1.403')] [2022-07-10 16:12:03,204][26022] Updated weights on worker 0-0, policy_version 796104 (0.00090) [2022-07-10 16:12:05,174][26022] Updated weights on worker 0-0, policy_version 796114 (0.00089) [2022-07-10 16:12:06,260][25689] Fps is (10 sec: 5331.7, 60 sec: 5506.7, 300 sec: 5523.6). Total num frames: 815225856. Throughput: 0: 4810.9. Samples: 815222998. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:12:06,261][25689] Avg episode reward: [(0, '-0.955')] [2022-07-10 16:12:06,912][26022] Updated weights on worker 0-0, policy_version 796124 (0.00540) [2022-07-10 16:12:08,894][26022] Updated weights on worker 0-0, policy_version 796134 (0.00090) [2022-07-10 16:12:10,579][26022] Updated weights on worker 0-0, policy_version 796144 (0.00097) [2022-07-10 16:12:11,318][25689] Fps is (10 sec: 5242.3, 60 sec: 5512.4, 300 sec: 5527.4). Total num frames: 815254528. Throughput: 0: 5638.6. Samples: 815256438. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:12:11,319][25689] Avg episode reward: [(0, '-2.364')] [2022-07-10 16:12:12,546][26022] Updated weights on worker 0-0, policy_version 796154 (0.00060) [2022-07-10 16:12:14,336][26022] Updated weights on worker 0-0, policy_version 796164 (0.00085) [2022-07-10 16:12:16,276][26022] Updated weights on worker 0-0, policy_version 796174 (0.00092) [2022-07-10 16:12:16,351][25689] Fps is (10 sec: 5682.6, 60 sec: 5529.1, 300 sec: 5527.1). Total num frames: 815283200. Throughput: 0: 5653.5. Samples: 815289992. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:12:16,352][25689] Avg episode reward: [(0, '-2.859')] [2022-07-10 16:12:17,870][26022] Updated weights on worker 0-0, policy_version 796184 (0.00099) [2022-07-10 16:12:19,969][26022] Updated weights on worker 0-0, policy_version 796194 (0.00088) [2022-07-10 16:12:21,360][25689] Fps is (10 sec: 5710.2, 60 sec: 5530.9, 300 sec: 5531.2). Total num frames: 815311872. Throughput: 0: 4864.4. Samples: 815306716. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:12:21,361][25689] Avg episode reward: [(0, '-3.108')] [2022-07-10 16:12:21,690][26022] Updated weights on worker 0-0, policy_version 796204 (0.00097) [2022-07-10 16:12:23,451][26022] Updated weights on worker 0-0, policy_version 796214 (0.00089) [2022-07-10 16:12:25,516][26022] Updated weights on worker 0-0, policy_version 796224 (0.00092) [2022-07-10 16:12:26,416][25689] Fps is (10 sec: 5392.3, 60 sec: 5510.7, 300 sec: 5521.5). Total num frames: 815337472. Throughput: 0: 5817.7. Samples: 815340260. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:12:26,417][25689] Avg episode reward: [(0, '-4.148')] [2022-07-10 16:12:27,191][26022] Updated weights on worker 0-0, policy_version 796234 (0.00087) [2022-07-10 16:12:29,289][26022] Updated weights on worker 0-0, policy_version 796244 (0.00089) [2022-07-10 16:12:30,909][26022] Updated weights on worker 0-0, policy_version 796254 (0.00092) [2022-07-10 16:12:31,459][25689] Fps is (10 sec: 5475.8, 60 sec: 5550.7, 300 sec: 5528.1). Total num frames: 815367168. Throughput: 0: 5802.4. Samples: 815373302. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:12:31,459][25689] Avg episode reward: [(0, '-4.274')] [2022-07-10 16:12:32,911][26022] Updated weights on worker 0-0, policy_version 796264 (0.00092) [2022-07-10 16:12:34,507][26022] Updated weights on worker 0-0, policy_version 796274 (0.00094) [2022-07-10 16:12:36,476][25689] Fps is (10 sec: 5598.3, 60 sec: 5465.0, 300 sec: 5521.6). Total num frames: 815393792. Throughput: 0: 4968.4. Samples: 815389980. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:12:36,477][25689] Avg episode reward: [(0, '-2.625')] [2022-07-10 16:12:36,485][26022] Updated weights on worker 0-0, policy_version 796284 (0.00091) [2022-07-10 16:12:38,561][26022] Updated weights on worker 0-0, policy_version 796294 (0.00085) [2022-07-10 16:12:40,019][26022] Updated weights on worker 0-0, policy_version 796304 (0.00091) [2022-07-10 16:12:41,515][25689] Fps is (10 sec: 5397.0, 60 sec: 5513.9, 300 sec: 5521.5). Total num frames: 815421440. Throughput: 0: 5786.3. Samples: 815423334. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:12:41,515][25689] Avg episode reward: [(0, '-2.965')] [2022-07-10 16:12:41,953][26022] Updated weights on worker 0-0, policy_version 796314 (0.00084) [2022-07-10 16:12:44,122][26022] Updated weights on worker 0-0, policy_version 796324 (0.00083) [2022-07-10 16:12:44,956][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:12:44,972][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000796329_815440896.pth [2022-07-10 16:12:44,972][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000794386_813451264.pth [2022-07-10 16:12:45,574][26022] Updated weights on worker 0-0, policy_version 796334 (0.00093) [2022-07-10 16:12:46,532][25689] Fps is (10 sec: 5702.8, 60 sec: 5530.1, 300 sec: 5526.0). Total num frames: 815451136. Throughput: 0: 5788.2. Samples: 815456692. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:12:46,532][25689] Avg episode reward: [(0, '-2.239')] [2022-07-10 16:12:47,839][26022] Updated weights on worker 0-0, policy_version 796344 (0.00084) [2022-07-10 16:12:49,054][26022] Updated weights on worker 0-0, policy_version 796354 (0.00088) [2022-07-10 16:12:51,317][26022] Updated weights on worker 0-0, policy_version 796364 (0.00090) [2022-07-10 16:12:51,588][25689] Fps is (10 sec: 5591.1, 60 sec: 5497.7, 300 sec: 5518.9). Total num frames: 815477760. Throughput: 0: 4977.4. Samples: 815473492. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:12:51,588][25689] Avg episode reward: [(0, '-0.829')] [2022-07-10 16:12:53,113][26022] Updated weights on worker 0-0, policy_version 796374 (0.00091) [2022-07-10 16:12:54,763][26022] Updated weights on worker 0-0, policy_version 796384 (0.00089) [2022-07-10 16:12:56,597][25689] Fps is (10 sec: 5391.9, 60 sec: 5516.3, 300 sec: 5526.1). Total num frames: 815505408. Throughput: 0: 5812.0. Samples: 815506922. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:12:56,598][25689] Avg episode reward: [(0, '-1.562')] [2022-07-10 16:12:56,945][26022] Updated weights on worker 0-0, policy_version 796394 (0.00204) [2022-07-10 16:12:58,555][26022] Updated weights on worker 0-0, policy_version 796404 (0.00092) [2022-07-10 16:13:00,460][26022] Updated weights on worker 0-0, policy_version 796414 (0.00085) [2022-07-10 16:13:01,603][25689] Fps is (10 sec: 5725.9, 60 sec: 5523.0, 300 sec: 5534.4). Total num frames: 815535104. Throughput: 0: 5825.3. Samples: 815540352. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:13:01,603][25689] Avg episode reward: [(0, '-1.332')] [2022-07-10 16:13:02,548][26022] Updated weights on worker 0-0, policy_version 796424 (0.00095) [2022-07-10 16:13:04,372][26022] Updated weights on worker 0-0, policy_version 796434 (0.00091) [2022-07-10 16:13:06,212][26022] Updated weights on worker 0-0, policy_version 796444 (0.00091) [2022-07-10 16:13:06,621][25689] Fps is (10 sec: 5414.3, 60 sec: 5530.5, 300 sec: 5525.0). Total num frames: 815559680. Throughput: 0: 4906.0. Samples: 815555250. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:13:06,621][25689] Avg episode reward: [(0, '-1.436')] [2022-07-10 16:13:07,942][26022] Updated weights on worker 0-0, policy_version 796454 (0.00085) [2022-07-10 16:13:10,008][26022] Updated weights on worker 0-0, policy_version 796464 (0.00091) [2022-07-10 16:13:11,671][26022] Updated weights on worker 0-0, policy_version 796474 (0.00089) [2022-07-10 16:13:11,759][25689] Fps is (10 sec: 5343.8, 60 sec: 5540.1, 300 sec: 5526.1). Total num frames: 815589376. Throughput: 0: 5707.8. Samples: 815588624. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:13:11,759][25689] Avg episode reward: [(0, '-0.555')] [2022-07-10 16:13:13,659][26022] Updated weights on worker 0-0, policy_version 796484 (0.00087) [2022-07-10 16:13:15,371][26022] Updated weights on worker 0-0, policy_version 796494 (0.00084) [2022-07-10 16:13:16,770][25689] Fps is (10 sec: 5650.0, 60 sec: 5525.2, 300 sec: 5529.5). Total num frames: 815617024. Throughput: 0: 5726.1. Samples: 815622436. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:13:16,772][25689] Avg episode reward: [(0, '-0.870')] [2022-07-10 16:13:17,191][26022] Updated weights on worker 0-0, policy_version 796504 (0.00080) [2022-07-10 16:13:19,094][26022] Updated weights on worker 0-0, policy_version 796514 (0.00090) [2022-07-10 16:13:20,737][26022] Updated weights on worker 0-0, policy_version 796524 (0.00086) [2022-07-10 16:13:21,812][25689] Fps is (10 sec: 5602.4, 60 sec: 5522.2, 300 sec: 5529.3). Total num frames: 815645696. Throughput: 0: 4912.3. Samples: 815639624. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:13:21,812][25689] Avg episode reward: [(0, '-1.307')] [2022-07-10 16:13:22,787][26022] Updated weights on worker 0-0, policy_version 796534 (0.00093) [2022-07-10 16:13:24,507][26022] Updated weights on worker 0-0, policy_version 796544 (0.00107) [2022-07-10 16:13:26,511][26022] Updated weights on worker 0-0, policy_version 796554 (0.00052) [2022-07-10 16:13:26,824][25689] Fps is (10 sec: 5602.0, 60 sec: 5560.1, 300 sec: 5534.8). Total num frames: 815673344. Throughput: 0: 5838.5. Samples: 815673204. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:13:26,825][25689] Avg episode reward: [(0, '-1.278')] [2022-07-10 16:13:28,227][26022] Updated weights on worker 0-0, policy_version 796564 (0.00099) [2022-07-10 16:13:30,186][26022] Updated weights on worker 0-0, policy_version 796574 (0.00092) [2022-07-10 16:13:31,932][25689] Fps is (10 sec: 5463.8, 60 sec: 5520.2, 300 sec: 5529.6). Total num frames: 815700992. Throughput: 0: 5801.1. Samples: 815705652. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:13:31,933][25689] Avg episode reward: [(0, '-1.036')] [2022-07-10 16:13:32,100][26022] Updated weights on worker 0-0, policy_version 796584 (0.00088) [2022-07-10 16:13:33,982][26022] Updated weights on worker 0-0, policy_version 796594 (0.00091) [2022-07-10 16:13:35,663][26022] Updated weights on worker 0-0, policy_version 796604 (0.00089) [2022-07-10 16:13:36,935][25689] Fps is (10 sec: 5468.6, 60 sec: 5538.4, 300 sec: 5529.9). Total num frames: 815728640. Throughput: 0: 4959.8. Samples: 815722452. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:13:36,937][25689] Avg episode reward: [(0, '-1.616')] [2022-07-10 16:13:37,562][26022] Updated weights on worker 0-0, policy_version 796614 (0.00095) [2022-07-10 16:13:39,434][26022] Updated weights on worker 0-0, policy_version 796624 (0.00087) [2022-07-10 16:13:41,317][26022] Updated weights on worker 0-0, policy_version 796634 (0.00092) [2022-07-10 16:13:41,938][25689] Fps is (10 sec: 5628.6, 60 sec: 5558.6, 300 sec: 5533.8). Total num frames: 815757312. Throughput: 0: 5792.1. Samples: 815756200. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:13:41,939][25689] Avg episode reward: [(0, '-1.344')] [2022-07-10 16:13:42,999][26022] Updated weights on worker 0-0, policy_version 796644 (0.00086) [2022-07-10 16:13:44,964][26022] Updated weights on worker 0-0, policy_version 796654 (0.00088) [2022-07-10 16:13:46,657][26022] Updated weights on worker 0-0, policy_version 796664 (0.00084) [2022-07-10 16:13:46,984][25689] Fps is (10 sec: 5604.9, 60 sec: 5522.1, 300 sec: 5534.9). Total num frames: 815784960. Throughput: 0: 5786.1. Samples: 815789852. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:13:46,984][25689] Avg episode reward: [(0, '-1.068')] [2022-07-10 16:13:48,571][26022] Updated weights on worker 0-0, policy_version 796674 (0.00086) [2022-07-10 16:13:50,272][26022] Updated weights on worker 0-0, policy_version 796684 (0.00085) [2022-07-10 16:13:52,086][25689] Fps is (10 sec: 5448.8, 60 sec: 5534.8, 300 sec: 5533.8). Total num frames: 815812608. Throughput: 0: 5830.9. Samples: 815823170. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:13:52,087][25689] Avg episode reward: [(0, '-0.552')] [2022-07-10 16:13:52,259][26022] Updated weights on worker 0-0, policy_version 796694 (0.00113) [2022-07-10 16:13:54,224][26022] Updated weights on worker 0-0, policy_version 796704 (0.00059) [2022-07-10 16:13:55,939][26022] Updated weights on worker 0-0, policy_version 796714 (0.00085) [2022-07-10 16:13:57,111][25689] Fps is (10 sec: 5561.0, 60 sec: 5550.3, 300 sec: 5533.5). Total num frames: 815841280. Throughput: 0: 5818.9. Samples: 815839854. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:13:57,112][25689] Avg episode reward: [(0, '-0.357')] [2022-07-10 16:13:57,744][26022] Updated weights on worker 0-0, policy_version 796724 (0.00097) [2022-07-10 16:13:59,559][26022] Updated weights on worker 0-0, policy_version 796734 (0.00097) [2022-07-10 16:14:01,937][26022] Updated weights on worker 0-0, policy_version 796744 (0.00092) [2022-07-10 16:14:02,131][25689] Fps is (10 sec: 5301.4, 60 sec: 5464.5, 300 sec: 5530.0). Total num frames: 815865856. Throughput: 0: 5776.3. Samples: 815872838. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:14:02,131][25689] Avg episode reward: [(0, '-0.631')] [2022-07-10 16:14:03,647][26022] Updated weights on worker 0-0, policy_version 796754 (0.00087) [2022-07-10 16:14:05,545][26022] Updated weights on worker 0-0, policy_version 796764 (0.00088) [2022-07-10 16:14:07,141][25689] Fps is (10 sec: 5308.9, 60 sec: 5532.8, 300 sec: 5530.6). Total num frames: 815894528. Throughput: 0: 5664.9. Samples: 815904044. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 16:14:07,142][25689] Avg episode reward: [(0, '-0.017')] [2022-07-10 16:14:07,432][26022] Updated weights on worker 0-0, policy_version 796774 (0.00099) [2022-07-10 16:14:09,490][26022] Updated weights on worker 0-0, policy_version 796784 (0.00086) [2022-07-10 16:14:10,955][26022] Updated weights on worker 0-0, policy_version 796794 (0.00083) [2022-07-10 16:14:12,279][25689] Fps is (10 sec: 5549.9, 60 sec: 5499.0, 300 sec: 5531.6). Total num frames: 815922176. Throughput: 0: 4830.9. Samples: 815920716. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:14:12,279][25689] Avg episode reward: [(0, '0.241')] [2022-07-10 16:14:13,065][26022] Updated weights on worker 0-0, policy_version 796804 (0.00087) [2022-07-10 16:14:14,719][26022] Updated weights on worker 0-0, policy_version 796814 (0.00080) [2022-07-10 16:14:16,690][26022] Updated weights on worker 0-0, policy_version 796824 (0.00084) [2022-07-10 16:14:17,295][25689] Fps is (10 sec: 5647.7, 60 sec: 5532.4, 300 sec: 5535.4). Total num frames: 815951872. Throughput: 0: 5671.6. Samples: 815954326. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:14:17,295][25689] Avg episode reward: [(0, '-1.203')] [2022-07-10 16:14:18,470][26022] Updated weights on worker 0-0, policy_version 796834 (0.00088) [2022-07-10 16:14:20,161][26022] Updated weights on worker 0-0, policy_version 796844 (0.00086) [2022-07-10 16:14:22,310][25689] Fps is (10 sec: 5512.1, 60 sec: 5484.0, 300 sec: 5524.9). Total num frames: 815977472. Throughput: 0: 5684.9. Samples: 815987558. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:14:22,311][25689] Avg episode reward: [(0, '-1.839')] [2022-07-10 16:14:22,401][26022] Updated weights on worker 0-0, policy_version 796854 (0.00087) [2022-07-10 16:14:23,960][26022] Updated weights on worker 0-0, policy_version 796864 (0.00089) [2022-07-10 16:14:25,922][26022] Updated weights on worker 0-0, policy_version 796874 (0.00090) [2022-07-10 16:14:27,329][25689] Fps is (10 sec: 5408.9, 60 sec: 5500.4, 300 sec: 5528.7). Total num frames: 816006144. Throughput: 0: 4965.9. Samples: 816004292. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:14:27,329][25689] Avg episode reward: [(0, '-1.506')] [2022-07-10 16:14:27,704][26022] Updated weights on worker 0-0, policy_version 796884 (0.00093) [2022-07-10 16:14:29,685][26022] Updated weights on worker 0-0, policy_version 796894 (0.00090) [2022-07-10 16:14:31,395][26022] Updated weights on worker 0-0, policy_version 796904 (0.00096) [2022-07-10 16:14:32,419][25689] Fps is (10 sec: 5672.7, 60 sec: 5518.9, 300 sec: 5530.9). Total num frames: 816034816. Throughput: 0: 5788.3. Samples: 816037296. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:14:32,420][25689] Avg episode reward: [(0, '-2.751')] [2022-07-10 16:14:33,387][26022] Updated weights on worker 0-0, policy_version 796914 (0.00251) [2022-07-10 16:14:35,186][26022] Updated weights on worker 0-0, policy_version 796924 (0.00090) [2022-07-10 16:14:36,990][26022] Updated weights on worker 0-0, policy_version 796934 (0.00088) [2022-07-10 16:14:37,481][25689] Fps is (10 sec: 5547.4, 60 sec: 5513.6, 300 sec: 5529.7). Total num frames: 816062464. Throughput: 0: 5775.4. Samples: 816070910. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:14:37,482][25689] Avg episode reward: [(0, '-2.838')] [2022-07-10 16:14:38,807][26022] Updated weights on worker 0-0, policy_version 796944 (0.00094) [2022-07-10 16:14:40,591][26022] Updated weights on worker 0-0, policy_version 796954 (0.00119) [2022-07-10 16:14:42,420][26022] Updated weights on worker 0-0, policy_version 796964 (0.00090) [2022-07-10 16:14:42,516][25689] Fps is (10 sec: 5578.0, 60 sec: 5510.7, 300 sec: 5529.4). Total num frames: 816091136. Throughput: 0: 4958.9. Samples: 816087754. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:14:42,516][25689] Avg episode reward: [(0, '-3.802')] [2022-07-10 16:14:44,479][26022] Updated weights on worker 0-0, policy_version 796974 (0.00082) [2022-07-10 16:14:45,079][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:14:45,095][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000796978_816105472.pth [2022-07-10 16:14:45,095][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000795032_814112768.pth [2022-07-10 16:14:46,042][26022] Updated weights on worker 0-0, policy_version 796984 (0.00084) [2022-07-10 16:14:47,558][25689] Fps is (10 sec: 5588.9, 60 sec: 5511.0, 300 sec: 5523.4). Total num frames: 816118784. Throughput: 0: 5780.6. Samples: 816121230. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:14:47,559][25689] Avg episode reward: [(0, '-2.895')] [2022-07-10 16:14:48,100][26022] Updated weights on worker 0-0, policy_version 796994 (0.00090) [2022-07-10 16:14:49,609][26022] Updated weights on worker 0-0, policy_version 797004 (0.00086) [2022-07-10 16:14:51,624][26022] Updated weights on worker 0-0, policy_version 797014 (0.00081) [2022-07-10 16:14:52,693][25689] Fps is (10 sec: 5533.7, 60 sec: 5524.9, 300 sec: 5527.8). Total num frames: 816147456. Throughput: 0: 5805.8. Samples: 816155004. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:14:52,694][25689] Avg episode reward: [(0, '-2.177')] [2022-07-10 16:14:53,513][26022] Updated weights on worker 0-0, policy_version 797024 (0.00089) [2022-07-10 16:14:55,268][26022] Updated weights on worker 0-0, policy_version 797034 (0.00092) [2022-07-10 16:14:57,053][26022] Updated weights on worker 0-0, policy_version 797044 (0.00090) [2022-07-10 16:14:57,762][25689] Fps is (10 sec: 5519.6, 60 sec: 5504.1, 300 sec: 5523.2). Total num frames: 816175104. Throughput: 0: 4967.0. Samples: 816171644. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:14:57,763][25689] Avg episode reward: [(0, '-3.472')] [2022-07-10 16:14:59,155][26022] Updated weights on worker 0-0, policy_version 797054 (0.00093) [2022-07-10 16:15:00,939][26022] Updated weights on worker 0-0, policy_version 797064 (0.00065) [2022-07-10 16:15:02,813][25689] Fps is (10 sec: 5363.1, 60 sec: 5534.9, 300 sec: 5530.2). Total num frames: 816201728. Throughput: 0: 5751.5. Samples: 816204492. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:15:02,813][25689] Avg episode reward: [(0, '-2.199')] [2022-07-10 16:15:03,279][26022] Updated weights on worker 0-0, policy_version 797074 (0.00093) [2022-07-10 16:15:05,171][26022] Updated weights on worker 0-0, policy_version 797084 (0.00091) [2022-07-10 16:15:06,797][26022] Updated weights on worker 0-0, policy_version 797094 (0.00084) [2022-07-10 16:15:07,841][25689] Fps is (10 sec: 5384.5, 60 sec: 5516.5, 300 sec: 5525.0). Total num frames: 816229376. Throughput: 0: 5648.7. Samples: 816235802. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:15:07,842][25689] Avg episode reward: [(0, '-1.387')] [2022-07-10 16:15:08,781][26022] Updated weights on worker 0-0, policy_version 797104 (0.00099) [2022-07-10 16:15:10,506][26022] Updated weights on worker 0-0, policy_version 797114 (0.00091) [2022-07-10 16:15:12,459][26022] Updated weights on worker 0-0, policy_version 797124 (0.00096) [2022-07-10 16:15:12,946][25689] Fps is (10 sec: 5557.9, 60 sec: 5536.3, 300 sec: 5527.5). Total num frames: 816258048. Throughput: 0: 4813.8. Samples: 816252496. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:15:12,948][25689] Avg episode reward: [(0, '-0.672')] [2022-07-10 16:15:14,291][26022] Updated weights on worker 0-0, policy_version 797134 (0.00093) [2022-07-10 16:15:15,851][26022] Updated weights on worker 0-0, policy_version 797144 (0.00099) [2022-07-10 16:15:18,018][25689] Fps is (10 sec: 5433.6, 60 sec: 5480.6, 300 sec: 5519.7). Total num frames: 816284672. Throughput: 0: 5649.4. Samples: 816286078. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:15:18,018][25689] Avg episode reward: [(0, '-0.064')] [2022-07-10 16:15:18,074][26022] Updated weights on worker 0-0, policy_version 797154 (0.00085) [2022-07-10 16:15:19,604][26022] Updated weights on worker 0-0, policy_version 797164 (0.00082) [2022-07-10 16:15:21,643][26022] Updated weights on worker 0-0, policy_version 797174 (0.00084) [2022-07-10 16:15:23,032][25689] Fps is (10 sec: 5685.7, 60 sec: 5565.1, 300 sec: 5526.5). Total num frames: 816315392. Throughput: 0: 5706.3. Samples: 816319868. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:15:23,032][25689] Avg episode reward: [(0, '0.309')] [2022-07-10 16:15:23,283][26022] Updated weights on worker 0-0, policy_version 797184 (0.00087) [2022-07-10 16:15:25,251][26022] Updated weights on worker 0-0, policy_version 797194 (0.00082) [2022-07-10 16:15:26,950][26022] Updated weights on worker 0-0, policy_version 797204 (0.00096) [2022-07-10 16:15:28,039][25689] Fps is (10 sec: 5824.6, 60 sec: 5549.2, 300 sec: 5531.4). Total num frames: 816343040. Throughput: 0: 5005.3. Samples: 816336898. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:15:28,040][25689] Avg episode reward: [(0, '0.657')] [2022-07-10 16:15:28,828][26022] Updated weights on worker 0-0, policy_version 797214 (0.00083) [2022-07-10 16:15:30,436][26022] Updated weights on worker 0-0, policy_version 797224 (0.00092) [2022-07-10 16:15:32,632][26022] Updated weights on worker 0-0, policy_version 797234 (0.00088) [2022-07-10 16:15:33,126][25689] Fps is (10 sec: 5478.1, 60 sec: 5532.7, 300 sec: 5524.0). Total num frames: 816370688. Throughput: 0: 5823.3. Samples: 816370008. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:15:33,126][25689] Avg episode reward: [(0, '0.418')] [2022-07-10 16:15:34,248][26022] Updated weights on worker 0-0, policy_version 797244 (0.00092) [2022-07-10 16:15:36,330][26022] Updated weights on worker 0-0, policy_version 797254 (0.00084) [2022-07-10 16:15:38,098][26022] Updated weights on worker 0-0, policy_version 797264 (0.00090) [2022-07-10 16:15:38,219][25689] Fps is (10 sec: 5431.9, 60 sec: 5529.9, 300 sec: 5523.1). Total num frames: 816398336. Throughput: 0: 5822.1. Samples: 816403690. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:15:38,219][25689] Avg episode reward: [(0, '0.643')] [2022-07-10 16:15:39,877][26022] Updated weights on worker 0-0, policy_version 797274 (0.00730) [2022-07-10 16:15:41,639][26022] Updated weights on worker 0-0, policy_version 797284 (0.00098) [2022-07-10 16:15:43,230][25689] Fps is (10 sec: 5574.0, 60 sec: 5532.0, 300 sec: 5527.2). Total num frames: 816427008. Throughput: 0: 4973.7. Samples: 816420328. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:15:43,230][25689] Avg episode reward: [(0, '-1.669')] [2022-07-10 16:15:43,512][26022] Updated weights on worker 0-0, policy_version 797294 (0.00085) [2022-07-10 16:15:45,326][26022] Updated weights on worker 0-0, policy_version 797304 (0.00098) [2022-07-10 16:15:47,081][26022] Updated weights on worker 0-0, policy_version 797314 (0.00092) [2022-07-10 16:15:48,304][25689] Fps is (10 sec: 5685.9, 60 sec: 5546.0, 300 sec: 5519.7). Total num frames: 816455680. Throughput: 0: 5792.6. Samples: 816454286. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:15:48,305][25689] Avg episode reward: [(0, '-1.642')] [2022-07-10 16:15:48,988][26022] Updated weights on worker 0-0, policy_version 797324 (0.00085) [2022-07-10 16:15:50,747][26022] Updated weights on worker 0-0, policy_version 797334 (0.00088) [2022-07-10 16:15:52,658][26022] Updated weights on worker 0-0, policy_version 797344 (0.00087) [2022-07-10 16:15:53,386][25689] Fps is (10 sec: 5646.5, 60 sec: 5550.9, 300 sec: 5529.4). Total num frames: 816484352. Throughput: 0: 5826.4. Samples: 816488050. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:15:53,386][25689] Avg episode reward: [(0, '-2.245')] [2022-07-10 16:15:54,455][26022] Updated weights on worker 0-0, policy_version 797354 (0.00090) [2022-07-10 16:15:56,298][26022] Updated weights on worker 0-0, policy_version 797364 (0.00089) [2022-07-10 16:15:58,000][26022] Updated weights on worker 0-0, policy_version 797374 (0.00085) [2022-07-10 16:15:58,400][25689] Fps is (10 sec: 5679.8, 60 sec: 5572.7, 300 sec: 5529.2). Total num frames: 816513024. Throughput: 0: 5856.0. Samples: 816521872. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:15:58,401][25689] Avg episode reward: [(0, '-2.581')] [2022-07-10 16:15:59,870][26022] Updated weights on worker 0-0, policy_version 797384 (0.00086) [2022-07-10 16:16:01,788][26022] Updated weights on worker 0-0, policy_version 797394 (0.00091) [2022-07-10 16:16:03,474][25689] Fps is (10 sec: 5379.5, 60 sec: 5553.7, 300 sec: 5528.5). Total num frames: 816538624. Throughput: 0: 5849.0. Samples: 816538738. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:16:03,475][25689] Avg episode reward: [(0, '-2.179')] [2022-07-10 16:16:03,982][26022] Updated weights on worker 0-0, policy_version 797404 (0.00087) [2022-07-10 16:16:05,481][26022] Updated weights on worker 0-0, policy_version 797414 (0.00087) [2022-07-10 16:16:07,615][26022] Updated weights on worker 0-0, policy_version 797424 (0.00091) [2022-07-10 16:16:08,501][25689] Fps is (10 sec: 5373.2, 60 sec: 5570.8, 300 sec: 5530.3). Total num frames: 816567296. Throughput: 0: 5761.1. Samples: 816570642. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:16:08,501][25689] Avg episode reward: [(0, '-1.290')] [2022-07-10 16:16:09,106][26022] Updated weights on worker 0-0, policy_version 797434 (0.00086) [2022-07-10 16:16:11,260][26022] Updated weights on worker 0-0, policy_version 797444 (0.00087) [2022-07-10 16:16:12,928][26022] Updated weights on worker 0-0, policy_version 797454 (0.00086) [2022-07-10 16:16:13,561][25689] Fps is (10 sec: 5583.5, 60 sec: 5558.0, 300 sec: 5529.7). Total num frames: 816594944. Throughput: 0: 5746.5. Samples: 816603988. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:16:13,562][25689] Avg episode reward: [(0, '-1.502')] [2022-07-10 16:16:14,692][26022] Updated weights on worker 0-0, policy_version 797464 (0.00090) [2022-07-10 16:16:16,931][26022] Updated weights on worker 0-0, policy_version 797474 (0.00081) [2022-07-10 16:16:18,479][26022] Updated weights on worker 0-0, policy_version 797484 (0.00099) [2022-07-10 16:16:18,571][25689] Fps is (10 sec: 5694.3, 60 sec: 5614.4, 300 sec: 5533.5). Total num frames: 816624640. Throughput: 0: 4894.8. Samples: 816620604. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:16:18,572][25689] Avg episode reward: [(0, '-1.422')] [2022-07-10 16:16:20,364][26022] Updated weights on worker 0-0, policy_version 797494 (0.00090) [2022-07-10 16:16:22,246][26022] Updated weights on worker 0-0, policy_version 797504 (0.00088) [2022-07-10 16:16:23,633][25689] Fps is (10 sec: 5693.2, 60 sec: 5559.2, 300 sec: 5536.2). Total num frames: 816652288. Throughput: 0: 5716.8. Samples: 816653984. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:16:23,635][25689] Avg episode reward: [(0, '-1.803')] [2022-07-10 16:16:23,838][26022] Updated weights on worker 0-0, policy_version 797514 (0.00081) [2022-07-10 16:16:26,027][26022] Updated weights on worker 0-0, policy_version 797524 (0.00094) [2022-07-10 16:16:27,573][26022] Updated weights on worker 0-0, policy_version 797534 (0.00083) [2022-07-10 16:16:28,654][25689] Fps is (10 sec: 5382.5, 60 sec: 5541.0, 300 sec: 5534.4). Total num frames: 816678912. Throughput: 0: 5805.2. Samples: 816687638. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:16:28,656][25689] Avg episode reward: [(0, '-1.215')] [2022-07-10 16:16:29,748][26022] Updated weights on worker 0-0, policy_version 797544 (0.00092) [2022-07-10 16:16:31,504][26022] Updated weights on worker 0-0, policy_version 797554 (0.00091) [2022-07-10 16:16:33,391][26022] Updated weights on worker 0-0, policy_version 797564 (0.00099) [2022-07-10 16:16:33,693][25689] Fps is (10 sec: 5496.8, 60 sec: 5562.3, 300 sec: 5523.4). Total num frames: 816707584. Throughput: 0: 4953.0. Samples: 816703702. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:16:33,695][25689] Avg episode reward: [(0, '-2.221')] [2022-07-10 16:16:35,179][26022] Updated weights on worker 0-0, policy_version 797574 (0.00085) [2022-07-10 16:16:37,153][26022] Updated weights on worker 0-0, policy_version 797584 (0.00088) [2022-07-10 16:16:38,705][25689] Fps is (10 sec: 5603.7, 60 sec: 5569.8, 300 sec: 5533.9). Total num frames: 816735232. Throughput: 0: 5792.2. Samples: 816737222. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:16:38,706][25689] Avg episode reward: [(0, '-1.434')] [2022-07-10 16:16:38,818][26022] Updated weights on worker 0-0, policy_version 797594 (0.00081) [2022-07-10 16:16:40,807][26022] Updated weights on worker 0-0, policy_version 797604 (0.00094) [2022-07-10 16:16:42,384][26022] Updated weights on worker 0-0, policy_version 797614 (0.00087) [2022-07-10 16:16:43,721][25689] Fps is (10 sec: 5514.0, 60 sec: 5552.4, 300 sec: 5530.3). Total num frames: 816762880. Throughput: 0: 5812.1. Samples: 816770738. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:16:43,722][25689] Avg episode reward: [(0, '-1.040')] [2022-07-10 16:16:44,571][26022] Updated weights on worker 0-0, policy_version 797624 (0.00091) [2022-07-10 16:16:45,171][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:16:45,198][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000797629_816772096.pth [2022-07-10 16:16:45,199][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000795681_814777344.pth [2022-07-10 16:16:46,079][26022] Updated weights on worker 0-0, policy_version 797634 (0.00087) [2022-07-10 16:16:48,108][26022] Updated weights on worker 0-0, policy_version 797644 (0.00090) [2022-07-10 16:16:48,731][25689] Fps is (10 sec: 5719.5, 60 sec: 5575.3, 300 sec: 5534.9). Total num frames: 816792576. Throughput: 0: 4983.9. Samples: 816787696. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:16:48,731][25689] Avg episode reward: [(0, '-0.957')] [2022-07-10 16:16:49,797][26022] Updated weights on worker 0-0, policy_version 797654 (0.01017) [2022-07-10 16:16:51,599][26022] Updated weights on worker 0-0, policy_version 797664 (0.00101) [2022-07-10 16:16:53,684][26022] Updated weights on worker 0-0, policy_version 797674 (0.00085) [2022-07-10 16:16:53,794][25689] Fps is (10 sec: 5591.5, 60 sec: 5543.1, 300 sec: 5534.2). Total num frames: 816819200. Throughput: 0: 5849.9. Samples: 816821290. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:16:53,794][25689] Avg episode reward: [(0, '-0.615')] [2022-07-10 16:16:55,355][26022] Updated weights on worker 0-0, policy_version 797684 (0.00085) [2022-07-10 16:16:57,219][26022] Updated weights on worker 0-0, policy_version 797694 (0.00096) [2022-07-10 16:16:58,800][25689] Fps is (10 sec: 5491.5, 60 sec: 5543.9, 300 sec: 5532.2). Total num frames: 816847872. Throughput: 0: 5851.8. Samples: 816854816. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:16:58,801][25689] Avg episode reward: [(0, '-0.782')] [2022-07-10 16:16:58,975][26022] Updated weights on worker 0-0, policy_version 797704 (0.00086) [2022-07-10 16:17:01,024][26022] Updated weights on worker 0-0, policy_version 797714 (0.00092) [2022-07-10 16:17:03,203][26022] Updated weights on worker 0-0, policy_version 797724 (0.00118) [2022-07-10 16:17:03,807][25689] Fps is (10 sec: 5317.7, 60 sec: 5533.0, 300 sec: 5533.9). Total num frames: 816872448. Throughput: 0: 5007.3. Samples: 816871314. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:17:03,808][25689] Avg episode reward: [(0, '0.187')] [2022-07-10 16:17:04,965][26022] Updated weights on worker 0-0, policy_version 797734 (0.00088) [2022-07-10 16:17:06,991][26022] Updated weights on worker 0-0, policy_version 797744 (0.00093) [2022-07-10 16:17:08,574][26022] Updated weights on worker 0-0, policy_version 797754 (0.00083) [2022-07-10 16:17:08,813][25689] Fps is (10 sec: 5215.7, 60 sec: 5518.0, 300 sec: 5531.4). Total num frames: 816900096. Throughput: 0: 5715.6. Samples: 816902478. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:17:08,813][25689] Avg episode reward: [(0, '-0.181')] [2022-07-10 16:17:10,714][26022] Updated weights on worker 0-0, policy_version 797764 (0.00084) [2022-07-10 16:17:12,341][26022] Updated weights on worker 0-0, policy_version 797774 (0.00086) [2022-07-10 16:17:13,900][25689] Fps is (10 sec: 5478.9, 60 sec: 5515.5, 300 sec: 5527.0). Total num frames: 816927744. Throughput: 0: 5694.7. Samples: 816935788. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:17:13,900][25689] Avg episode reward: [(0, '-1.812')] [2022-07-10 16:17:14,312][26022] Updated weights on worker 0-0, policy_version 797784 (0.00090) [2022-07-10 16:17:16,139][26022] Updated weights on worker 0-0, policy_version 797794 (0.00087) [2022-07-10 16:17:17,863][26022] Updated weights on worker 0-0, policy_version 797804 (0.00087) [2022-07-10 16:17:18,936][25689] Fps is (10 sec: 5563.3, 60 sec: 5496.1, 300 sec: 5526.5). Total num frames: 816956416. Throughput: 0: 4846.3. Samples: 816952402. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:17:18,937][25689] Avg episode reward: [(0, '-1.806')] [2022-07-10 16:17:19,812][26022] Updated weights on worker 0-0, policy_version 797814 (0.00089) [2022-07-10 16:17:21,536][26022] Updated weights on worker 0-0, policy_version 797824 (0.00090) [2022-07-10 16:17:23,623][26022] Updated weights on worker 0-0, policy_version 797834 (0.00084) [2022-07-10 16:17:23,958][25689] Fps is (10 sec: 5599.4, 60 sec: 5499.9, 300 sec: 5534.0). Total num frames: 816984064. Throughput: 0: 5694.3. Samples: 816986058. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:17:23,960][25689] Avg episode reward: [(0, '-1.233')] [2022-07-10 16:17:25,373][26022] Updated weights on worker 0-0, policy_version 797844 (0.00051) [2022-07-10 16:17:27,102][26022] Updated weights on worker 0-0, policy_version 797854 (0.00091) [2022-07-10 16:17:28,996][25689] Fps is (10 sec: 5497.0, 60 sec: 5515.3, 300 sec: 5527.2). Total num frames: 817011712. Throughput: 0: 5798.2. Samples: 817019502. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:17:28,998][25689] Avg episode reward: [(0, '-1.758')] [2022-07-10 16:17:29,089][26022] Updated weights on worker 0-0, policy_version 797864 (0.00095) [2022-07-10 16:17:30,670][26022] Updated weights on worker 0-0, policy_version 797874 (0.00087) [2022-07-10 16:17:32,875][26022] Updated weights on worker 0-0, policy_version 797884 (0.00091) [2022-07-10 16:17:34,085][25689] Fps is (10 sec: 5561.2, 60 sec: 5510.6, 300 sec: 5532.7). Total num frames: 817040384. Throughput: 0: 4963.5. Samples: 817035980. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:17:34,086][25689] Avg episode reward: [(0, '-1.733')] [2022-07-10 16:17:34,508][26022] Updated weights on worker 0-0, policy_version 797894 (0.00086) [2022-07-10 16:17:36,472][26022] Updated weights on worker 0-0, policy_version 797904 (0.00091) [2022-07-10 16:17:38,107][26022] Updated weights on worker 0-0, policy_version 797914 (0.00087) [2022-07-10 16:17:39,093][25689] Fps is (10 sec: 5679.3, 60 sec: 5528.0, 300 sec: 5536.8). Total num frames: 817069056. Throughput: 0: 5806.5. Samples: 817069440. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 16:17:39,093][25689] Avg episode reward: [(0, '-1.815')] [2022-07-10 16:17:39,945][26022] Updated weights on worker 0-0, policy_version 797924 (0.00085) [2022-07-10 16:17:41,876][26022] Updated weights on worker 0-0, policy_version 797934 (0.00095) [2022-07-10 16:17:43,706][26022] Updated weights on worker 0-0, policy_version 797944 (0.00090) [2022-07-10 16:17:44,151][25689] Fps is (10 sec: 5595.5, 60 sec: 5524.2, 300 sec: 5529.1). Total num frames: 817096704. Throughput: 0: 5786.5. Samples: 817102902. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:17:44,152][25689] Avg episode reward: [(0, '-0.750')] [2022-07-10 16:17:45,586][26022] Updated weights on worker 0-0, policy_version 797954 (0.00079) [2022-07-10 16:17:47,222][26022] Updated weights on worker 0-0, policy_version 797964 (0.00394) [2022-07-10 16:17:49,067][26022] Updated weights on worker 0-0, policy_version 797974 (0.00081) [2022-07-10 16:17:49,156][25689] Fps is (10 sec: 5596.5, 60 sec: 5507.6, 300 sec: 5536.9). Total num frames: 817125376. Throughput: 0: 4978.2. Samples: 817119864. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:17:49,156][25689] Avg episode reward: [(0, '-1.709')] [2022-07-10 16:17:51,077][26022] Updated weights on worker 0-0, policy_version 797984 (0.00092) [2022-07-10 16:17:52,742][26022] Updated weights on worker 0-0, policy_version 797994 (0.00087) [2022-07-10 16:17:54,200][25689] Fps is (10 sec: 5706.3, 60 sec: 5543.3, 300 sec: 5539.7). Total num frames: 817154048. Throughput: 0: 5859.2. Samples: 817153836. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:17:54,200][25689] Avg episode reward: [(0, '-1.832')] [2022-07-10 16:17:54,510][26022] Updated weights on worker 0-0, policy_version 798004 (0.01172) [2022-07-10 16:17:56,377][26022] Updated weights on worker 0-0, policy_version 798014 (0.00086) [2022-07-10 16:17:58,207][26022] Updated weights on worker 0-0, policy_version 798024 (0.00082) [2022-07-10 16:17:59,240][25689] Fps is (10 sec: 5584.9, 60 sec: 5523.2, 300 sec: 5532.2). Total num frames: 817181696. Throughput: 0: 5868.9. Samples: 817187684. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:17:59,241][25689] Avg episode reward: [(0, '-1.529')] [2022-07-10 16:17:59,951][26022] Updated weights on worker 0-0, policy_version 798034 (0.00095) [2022-07-10 16:18:02,165][26022] Updated weights on worker 0-0, policy_version 798044 (0.00100) [2022-07-10 16:18:04,107][26022] Updated weights on worker 0-0, policy_version 798054 (0.00085) [2022-07-10 16:18:04,247][25689] Fps is (10 sec: 5299.9, 60 sec: 5540.2, 300 sec: 5535.9). Total num frames: 817207296. Throughput: 0: 5029.9. Samples: 817203984. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:18:04,247][25689] Avg episode reward: [(0, '-0.746')] [2022-07-10 16:18:05,867][26022] Updated weights on worker 0-0, policy_version 798064 (0.00094) [2022-07-10 16:18:07,853][26022] Updated weights on worker 0-0, policy_version 798074 (0.00087) [2022-07-10 16:18:09,319][25689] Fps is (10 sec: 5385.0, 60 sec: 5551.1, 300 sec: 5533.6). Total num frames: 817235968. Throughput: 0: 5747.3. Samples: 817235746. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:18:09,319][25689] Avg episode reward: [(0, '-0.548')] [2022-07-10 16:18:09,737][26022] Updated weights on worker 0-0, policy_version 798084 (0.00080) [2022-07-10 16:18:11,559][26022] Updated weights on worker 0-0, policy_version 798094 (0.00082) [2022-07-10 16:18:13,239][26022] Updated weights on worker 0-0, policy_version 798104 (0.00535) [2022-07-10 16:18:14,394][25689] Fps is (10 sec: 5550.2, 60 sec: 5552.1, 300 sec: 5532.5). Total num frames: 817263616. Throughput: 0: 5722.9. Samples: 817269406. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:18:14,395][25689] Avg episode reward: [(0, '-0.979')] [2022-07-10 16:18:15,184][26022] Updated weights on worker 0-0, policy_version 798114 (0.00088) [2022-07-10 16:18:17,145][26022] Updated weights on worker 0-0, policy_version 798124 (0.00095) [2022-07-10 16:18:18,780][26022] Updated weights on worker 0-0, policy_version 798134 (0.00102) [2022-07-10 16:18:19,411][25689] Fps is (10 sec: 5682.0, 60 sec: 5570.9, 300 sec: 5536.4). Total num frames: 817293312. Throughput: 0: 4881.8. Samples: 817286152. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:18:19,411][25689] Avg episode reward: [(0, '-0.642')] [2022-07-10 16:18:20,589][26022] Updated weights on worker 0-0, policy_version 798144 (0.00074) [2022-07-10 16:18:22,419][26022] Updated weights on worker 0-0, policy_version 798154 (0.00089) [2022-07-10 16:18:24,368][26022] Updated weights on worker 0-0, policy_version 798164 (0.00096) [2022-07-10 16:18:24,416][25689] Fps is (10 sec: 5620.1, 60 sec: 5555.5, 300 sec: 5533.1). Total num frames: 817319936. Throughput: 0: 5730.8. Samples: 817319566. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:18:24,416][25689] Avg episode reward: [(0, '-0.861')] [2022-07-10 16:18:26,068][26022] Updated weights on worker 0-0, policy_version 798174 (0.00094) [2022-07-10 16:18:28,078][26022] Updated weights on worker 0-0, policy_version 798184 (0.00087) [2022-07-10 16:18:29,455][25689] Fps is (10 sec: 5505.4, 60 sec: 5572.3, 300 sec: 5537.8). Total num frames: 817348608. Throughput: 0: 5830.6. Samples: 817353152. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:18:29,457][25689] Avg episode reward: [(0, '-1.546')] [2022-07-10 16:18:29,816][26022] Updated weights on worker 0-0, policy_version 798194 (0.00091) [2022-07-10 16:18:31,560][26022] Updated weights on worker 0-0, policy_version 798204 (0.00996) [2022-07-10 16:18:33,567][26022] Updated weights on worker 0-0, policy_version 798214 (0.00086) [2022-07-10 16:18:34,530][25689] Fps is (10 sec: 5669.7, 60 sec: 5573.7, 300 sec: 5539.9). Total num frames: 817377280. Throughput: 0: 4978.8. Samples: 817369656. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:18:34,530][25689] Avg episode reward: [(0, '-1.978')] [2022-07-10 16:18:35,388][26022] Updated weights on worker 0-0, policy_version 798224 (0.00086) [2022-07-10 16:18:37,075][26022] Updated weights on worker 0-0, policy_version 798234 (0.00084) [2022-07-10 16:18:39,226][26022] Updated weights on worker 0-0, policy_version 798244 (0.00085) [2022-07-10 16:18:39,571][25689] Fps is (10 sec: 5364.9, 60 sec: 5519.7, 300 sec: 5528.9). Total num frames: 817402880. Throughput: 0: 5808.8. Samples: 817403258. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:18:39,573][25689] Avg episode reward: [(0, '-2.735')] [2022-07-10 16:18:40,702][26022] Updated weights on worker 0-0, policy_version 798254 (0.00092) [2022-07-10 16:18:42,957][26022] Updated weights on worker 0-0, policy_version 798264 (0.00093) [2022-07-10 16:18:44,207][26022] Updated weights on worker 0-0, policy_version 798274 (0.00083) [2022-07-10 16:18:44,666][25689] Fps is (10 sec: 5556.4, 60 sec: 5567.1, 300 sec: 5538.3). Total num frames: 817433600. Throughput: 0: 5793.4. Samples: 817436884. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:18:44,667][25689] Avg episode reward: [(0, '-2.423')] [2022-07-10 16:18:45,327][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:18:45,336][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000798278_817436672.pth [2022-07-10 16:18:45,337][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000796329_815440896.pth [2022-07-10 16:18:46,417][26022] Updated weights on worker 0-0, policy_version 798284 (0.00091) [2022-07-10 16:18:48,260][26022] Updated weights on worker 0-0, policy_version 798294 (0.00083) [2022-07-10 16:18:49,703][25689] Fps is (10 sec: 5659.9, 60 sec: 5530.4, 300 sec: 5536.0). Total num frames: 817460224. Throughput: 0: 5804.2. Samples: 817470674. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:18:49,704][25689] Avg episode reward: [(0, '-2.690')] [2022-07-10 16:18:50,045][26022] Updated weights on worker 0-0, policy_version 798304 (0.00085) [2022-07-10 16:18:51,871][26022] Updated weights on worker 0-0, policy_version 798314 (0.00091) [2022-07-10 16:18:53,597][26022] Updated weights on worker 0-0, policy_version 798324 (0.00081) [2022-07-10 16:18:54,767][25689] Fps is (10 sec: 5575.8, 60 sec: 5545.5, 300 sec: 5538.8). Total num frames: 817489920. Throughput: 0: 5799.7. Samples: 817487024. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:18:54,768][25689] Avg episode reward: [(0, '-2.478')] [2022-07-10 16:18:55,645][26022] Updated weights on worker 0-0, policy_version 798334 (0.00087) [2022-07-10 16:18:57,429][26022] Updated weights on worker 0-0, policy_version 798344 (0.00098) [2022-07-10 16:18:59,169][26022] Updated weights on worker 0-0, policy_version 798354 (0.00106) [2022-07-10 16:18:59,792][25689] Fps is (10 sec: 5683.7, 60 sec: 5546.9, 300 sec: 5549.0). Total num frames: 817517568. Throughput: 0: 5809.0. Samples: 817520722. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:18:59,794][25689] Avg episode reward: [(0, '-3.283')] [2022-07-10 16:19:00,972][26022] Updated weights on worker 0-0, policy_version 798364 (0.00102) [2022-07-10 16:19:03,182][26022] Updated weights on worker 0-0, policy_version 798374 (0.00090) [2022-07-10 16:19:04,827][25689] Fps is (10 sec: 5293.0, 60 sec: 5544.3, 300 sec: 5538.2). Total num frames: 817543168. Throughput: 0: 5718.2. Samples: 817552168. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:19:04,827][25689] Avg episode reward: [(0, '-3.215')] [2022-07-10 16:19:05,245][26022] Updated weights on worker 0-0, policy_version 798384 (0.00089) [2022-07-10 16:19:06,760][26022] Updated weights on worker 0-0, policy_version 798394 (0.00085) [2022-07-10 16:19:08,741][26022] Updated weights on worker 0-0, policy_version 798404 (0.00081) [2022-07-10 16:19:09,857][25689] Fps is (10 sec: 5392.3, 60 sec: 5548.1, 300 sec: 5543.6). Total num frames: 817571840. Throughput: 0: 4884.4. Samples: 817569114. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:19:09,857][25689] Avg episode reward: [(0, '-2.015')] [2022-07-10 16:19:10,560][26022] Updated weights on worker 0-0, policy_version 798414 (0.00094) [2022-07-10 16:19:12,298][26022] Updated weights on worker 0-0, policy_version 798424 (0.00090) [2022-07-10 16:19:14,262][26022] Updated weights on worker 0-0, policy_version 798434 (0.00088) [2022-07-10 16:19:14,954][25689] Fps is (10 sec: 5662.2, 60 sec: 5563.0, 300 sec: 5538.7). Total num frames: 817600512. Throughput: 0: 5750.7. Samples: 817603116. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:19:14,955][25689] Avg episode reward: [(0, '-2.733')] [2022-07-10 16:19:15,924][26022] Updated weights on worker 0-0, policy_version 798444 (0.00085) [2022-07-10 16:19:17,777][26022] Updated weights on worker 0-0, policy_version 798454 (0.00093) [2022-07-10 16:19:19,392][26022] Updated weights on worker 0-0, policy_version 798464 (0.00091) [2022-07-10 16:19:20,037][25689] Fps is (10 sec: 5632.9, 60 sec: 5540.1, 300 sec: 5547.7). Total num frames: 817629184. Throughput: 0: 5743.8. Samples: 817637006. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:19:20,037][25689] Avg episode reward: [(0, '-3.639')] [2022-07-10 16:19:21,566][26022] Updated weights on worker 0-0, policy_version 798474 (0.00083) [2022-07-10 16:19:23,244][26022] Updated weights on worker 0-0, policy_version 798484 (0.00110) [2022-07-10 16:19:25,125][25689] Fps is (10 sec: 5537.4, 60 sec: 5549.3, 300 sec: 5543.0). Total num frames: 817656832. Throughput: 0: 5006.6. Samples: 817653794. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:19:25,126][25689] Avg episode reward: [(0, '-3.114')] [2022-07-10 16:19:25,144][26022] Updated weights on worker 0-0, policy_version 798494 (0.00082) [2022-07-10 16:19:26,759][26022] Updated weights on worker 0-0, policy_version 798504 (0.00086) [2022-07-10 16:19:28,522][26022] Updated weights on worker 0-0, policy_version 798514 (0.00083) [2022-07-10 16:19:30,169][25689] Fps is (10 sec: 5659.4, 60 sec: 5565.7, 300 sec: 5547.3). Total num frames: 817686528. Throughput: 0: 5847.3. Samples: 817687888. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:19:30,170][25689] Avg episode reward: [(0, '-1.250')] [2022-07-10 16:19:30,603][26022] Updated weights on worker 0-0, policy_version 798524 (0.00100) [2022-07-10 16:19:32,208][26022] Updated weights on worker 0-0, policy_version 798534 (0.00088) [2022-07-10 16:19:34,255][26022] Updated weights on worker 0-0, policy_version 798544 (0.00087) [2022-07-10 16:19:35,230][25689] Fps is (10 sec: 5674.8, 60 sec: 5550.1, 300 sec: 5547.3). Total num frames: 817714176. Throughput: 0: 5832.7. Samples: 817721380. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:19:35,231][25689] Avg episode reward: [(0, '-1.395')] [2022-07-10 16:19:35,998][26022] Updated weights on worker 0-0, policy_version 798554 (0.00085) [2022-07-10 16:19:37,765][26022] Updated weights on worker 0-0, policy_version 798564 (0.00087) [2022-07-10 16:19:39,668][26022] Updated weights on worker 0-0, policy_version 798574 (0.00091) [2022-07-10 16:19:40,277][25689] Fps is (10 sec: 5572.2, 60 sec: 5600.3, 300 sec: 5547.1). Total num frames: 817742848. Throughput: 0: 5012.0. Samples: 817738448. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:19:40,277][25689] Avg episode reward: [(0, '-1.019')] [2022-07-10 16:19:41,179][26022] Updated weights on worker 0-0, policy_version 798584 (0.00081) [2022-07-10 16:19:43,319][26022] Updated weights on worker 0-0, policy_version 798594 (0.00088) [2022-07-10 16:19:45,115][26022] Updated weights on worker 0-0, policy_version 798604 (0.00092) [2022-07-10 16:19:45,319][25689] Fps is (10 sec: 5683.9, 60 sec: 5571.4, 300 sec: 5550.6). Total num frames: 817771520. Throughput: 0: 5860.7. Samples: 817772144. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:19:45,319][25689] Avg episode reward: [(0, '-0.514')] [2022-07-10 16:19:46,958][26022] Updated weights on worker 0-0, policy_version 798614 (0.00085) [2022-07-10 16:19:48,928][26022] Updated weights on worker 0-0, policy_version 798624 (0.00086) [2022-07-10 16:19:50,348][25689] Fps is (10 sec: 5694.1, 60 sec: 5605.9, 300 sec: 5552.5). Total num frames: 817800192. Throughput: 0: 5836.5. Samples: 817805658. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:19:50,348][25689] Avg episode reward: [(0, '0.098')] [2022-07-10 16:19:50,651][26022] Updated weights on worker 0-0, policy_version 798634 (0.00092) [2022-07-10 16:19:52,563][26022] Updated weights on worker 0-0, policy_version 798644 (0.00109) [2022-07-10 16:19:54,202][26022] Updated weights on worker 0-0, policy_version 798654 (0.00081) [2022-07-10 16:19:55,453][25689] Fps is (10 sec: 5557.7, 60 sec: 5568.3, 300 sec: 5551.9). Total num frames: 817827840. Throughput: 0: 4994.1. Samples: 817822374. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:19:55,453][25689] Avg episode reward: [(0, '-1.072')] [2022-07-10 16:19:56,182][26022] Updated weights on worker 0-0, policy_version 798664 (0.00087) [2022-07-10 16:19:57,948][26022] Updated weights on worker 0-0, policy_version 798674 (0.00088) [2022-07-10 16:19:59,729][26022] Updated weights on worker 0-0, policy_version 798684 (0.00089) [2022-07-10 16:20:00,524][25689] Fps is (10 sec: 5534.6, 60 sec: 5581.0, 300 sec: 5558.4). Total num frames: 817856512. Throughput: 0: 5803.9. Samples: 817855958. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:20:00,524][25689] Avg episode reward: [(0, '-1.699')] [2022-07-10 16:20:01,795][26022] Updated weights on worker 0-0, policy_version 798694 (0.00097) [2022-07-10 16:20:03,872][26022] Updated weights on worker 0-0, policy_version 798704 (0.00088) [2022-07-10 16:20:05,527][25689] Fps is (10 sec: 5285.6, 60 sec: 5567.0, 300 sec: 5548.5). Total num frames: 817881088. Throughput: 0: 5680.7. Samples: 817886940. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:20:05,527][25689] Avg episode reward: [(0, '-1.635')] [2022-07-10 16:20:05,774][26022] Updated weights on worker 0-0, policy_version 798714 (0.00086) [2022-07-10 16:20:07,635][26022] Updated weights on worker 0-0, policy_version 798724 (0.00086) [2022-07-10 16:20:09,365][26022] Updated weights on worker 0-0, policy_version 798734 (0.00090) [2022-07-10 16:20:10,586][25689] Fps is (10 sec: 5291.7, 60 sec: 5564.4, 300 sec: 5549.4). Total num frames: 817909760. Throughput: 0: 4852.9. Samples: 817903874. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:20:10,587][25689] Avg episode reward: [(0, '-2.282')] [2022-07-10 16:20:11,166][26022] Updated weights on worker 0-0, policy_version 798744 (0.00090) [2022-07-10 16:20:13,070][26022] Updated weights on worker 0-0, policy_version 798754 (0.00087) [2022-07-10 16:20:14,967][26022] Updated weights on worker 0-0, policy_version 798764 (0.00086) [2022-07-10 16:20:15,667][25689] Fps is (10 sec: 5655.2, 60 sec: 5565.9, 300 sec: 5556.1). Total num frames: 817938432. Throughput: 0: 5695.6. Samples: 817937506. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:20:15,669][25689] Avg episode reward: [(0, '-3.392')] [2022-07-10 16:20:16,615][26022] Updated weights on worker 0-0, policy_version 798774 (0.00093) [2022-07-10 16:20:18,576][26022] Updated weights on worker 0-0, policy_version 798784 (0.00090) [2022-07-10 16:20:20,351][26022] Updated weights on worker 0-0, policy_version 798794 (0.00083) [2022-07-10 16:20:20,671][25689] Fps is (10 sec: 5584.8, 60 sec: 5556.2, 300 sec: 5545.9). Total num frames: 817966080. Throughput: 0: 5734.1. Samples: 817971484. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:20:20,671][25689] Avg episode reward: [(0, '-3.545')] [2022-07-10 16:20:22,108][26022] Updated weights on worker 0-0, policy_version 798804 (0.00091) [2022-07-10 16:20:23,909][26022] Updated weights on worker 0-0, policy_version 798814 (0.00088) [2022-07-10 16:20:25,683][25689] Fps is (10 sec: 5520.9, 60 sec: 5563.2, 300 sec: 5545.8). Total num frames: 817993728. Throughput: 0: 5026.2. Samples: 817988248. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:20:25,683][25689] Avg episode reward: [(0, '-2.481')] [2022-07-10 16:20:25,929][26022] Updated weights on worker 0-0, policy_version 798824 (0.00094) [2022-07-10 16:20:27,638][26022] Updated weights on worker 0-0, policy_version 798834 (0.00087) [2022-07-10 16:20:29,510][26022] Updated weights on worker 0-0, policy_version 798844 (0.00094) [2022-07-10 16:20:30,691][25689] Fps is (10 sec: 5518.6, 60 sec: 5532.7, 300 sec: 5547.3). Total num frames: 818021376. Throughput: 0: 5857.3. Samples: 818021632. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:20:30,691][25689] Avg episode reward: [(0, '-1.864')] [2022-07-10 16:20:31,337][26022] Updated weights on worker 0-0, policy_version 798854 (0.00088) [2022-07-10 16:20:33,340][26022] Updated weights on worker 0-0, policy_version 798864 (0.00094) [2022-07-10 16:20:35,111][26022] Updated weights on worker 0-0, policy_version 798874 (0.00089) [2022-07-10 16:20:35,803][25689] Fps is (10 sec: 5565.1, 60 sec: 5544.9, 300 sec: 5550.4). Total num frames: 818050048. Throughput: 0: 5815.0. Samples: 818054596. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:20:35,803][25689] Avg episode reward: [(0, '-1.587')] [2022-07-10 16:20:37,075][26022] Updated weights on worker 0-0, policy_version 798884 (0.00087) [2022-07-10 16:20:38,792][26022] Updated weights on worker 0-0, policy_version 798894 (0.00096) [2022-07-10 16:20:40,674][26022] Updated weights on worker 0-0, policy_version 798904 (0.00095) [2022-07-10 16:20:40,821][25689] Fps is (10 sec: 5559.5, 60 sec: 5530.6, 300 sec: 5546.8). Total num frames: 818077696. Throughput: 0: 5770.1. Samples: 818087754. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:20:40,822][25689] Avg episode reward: [(0, '-0.925')] [2022-07-10 16:20:42,624][26022] Updated weights on worker 0-0, policy_version 798914 (0.00073) [2022-07-10 16:20:44,359][26022] Updated weights on worker 0-0, policy_version 798924 (0.00088) [2022-07-10 16:20:45,466][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:20:45,484][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000798930_818104320.pth [2022-07-10 16:20:45,485][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000796978_816105472.pth [2022-07-10 16:20:45,905][25689] Fps is (10 sec: 5575.2, 60 sec: 5526.8, 300 sec: 5546.7). Total num frames: 818106368. Throughput: 0: 5751.0. Samples: 818104544. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:20:45,905][25689] Avg episode reward: [(0, '-0.711')] [2022-07-10 16:20:46,328][26022] Updated weights on worker 0-0, policy_version 798934 (0.00092) [2022-07-10 16:20:47,858][26022] Updated weights on worker 0-0, policy_version 798944 (0.00091) [2022-07-10 16:20:49,928][26022] Updated weights on worker 0-0, policy_version 798954 (0.00853) [2022-07-10 16:20:50,952][25689] Fps is (10 sec: 5660.1, 60 sec: 5525.1, 300 sec: 5547.3). Total num frames: 818135040. Throughput: 0: 5761.9. Samples: 818138378. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:20:50,953][25689] Avg episode reward: [(0, '-1.337')] [2022-07-10 16:20:51,692][26022] Updated weights on worker 0-0, policy_version 798964 (0.00085) [2022-07-10 16:20:53,485][26022] Updated weights on worker 0-0, policy_version 798974 (0.00079) [2022-07-10 16:20:55,461][26022] Updated weights on worker 0-0, policy_version 798984 (0.00462) [2022-07-10 16:20:56,046][25689] Fps is (10 sec: 5654.6, 60 sec: 5543.1, 300 sec: 5545.8). Total num frames: 818163712. Throughput: 0: 5801.2. Samples: 818172030. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:20:56,046][25689] Avg episode reward: [(0, '-2.253')] [2022-07-10 16:20:57,096][26022] Updated weights on worker 0-0, policy_version 798994 (0.00090) [2022-07-10 16:20:58,980][26022] Updated weights on worker 0-0, policy_version 799004 (0.00095) [2022-07-10 16:21:00,867][26022] Updated weights on worker 0-0, policy_version 799014 (0.00097) [2022-07-10 16:21:01,091][25689] Fps is (10 sec: 5555.1, 60 sec: 5528.5, 300 sec: 5553.2). Total num frames: 818191360. Throughput: 0: 4988.0. Samples: 818188860. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:21:01,091][25689] Avg episode reward: [(0, '-1.854')] [2022-07-10 16:21:02,721][26022] Updated weights on worker 0-0, policy_version 799024 (0.00087) [2022-07-10 16:21:04,973][26022] Updated weights on worker 0-0, policy_version 799034 (0.00086) [2022-07-10 16:21:06,103][25689] Fps is (10 sec: 5396.4, 60 sec: 5561.5, 300 sec: 5546.6). Total num frames: 818217984. Throughput: 0: 5740.4. Samples: 818220490. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:21:06,103][25689] Avg episode reward: [(0, '-1.905')] [2022-07-10 16:21:06,564][26022] Updated weights on worker 0-0, policy_version 799044 (0.00087) [2022-07-10 16:21:08,557][26022] Updated weights on worker 0-0, policy_version 799054 (0.00087) [2022-07-10 16:21:10,446][26022] Updated weights on worker 0-0, policy_version 799064 (0.00082) [2022-07-10 16:21:11,197][25689] Fps is (10 sec: 5370.3, 60 sec: 5541.5, 300 sec: 5546.0). Total num frames: 818245632. Throughput: 0: 5697.9. Samples: 818253728. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 16:21:11,207][25689] Avg episode reward: [(0, '-1.466')] [2022-07-10 16:21:12,109][26022] Updated weights on worker 0-0, policy_version 799074 (0.00081) [2022-07-10 16:21:14,079][26022] Updated weights on worker 0-0, policy_version 799084 (0.00084) [2022-07-10 16:21:15,739][26022] Updated weights on worker 0-0, policy_version 799094 (0.00093) [2022-07-10 16:21:16,335][25689] Fps is (10 sec: 5504.3, 60 sec: 5536.2, 300 sec: 5540.2). Total num frames: 818274304. Throughput: 0: 4855.3. Samples: 818270540. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:21:16,341][25689] Avg episode reward: [(0, '-0.894')] [2022-07-10 16:21:17,756][26022] Updated weights on worker 0-0, policy_version 799104 (0.00083) [2022-07-10 16:21:19,382][26022] Updated weights on worker 0-0, policy_version 799114 (0.00083) [2022-07-10 16:21:21,377][26022] Updated weights on worker 0-0, policy_version 799124 (0.00084) [2022-07-10 16:21:21,379][25689] Fps is (10 sec: 5531.0, 60 sec: 5532.5, 300 sec: 5540.5). Total num frames: 818301952. Throughput: 0: 5676.9. Samples: 818304036. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:21:21,380][25689] Avg episode reward: [(0, '-1.163')] [2022-07-10 16:21:22,965][26022] Updated weights on worker 0-0, policy_version 799134 (0.00091) [2022-07-10 16:21:25,167][26022] Updated weights on worker 0-0, policy_version 799144 (0.00085) [2022-07-10 16:21:26,397][25689] Fps is (10 sec: 5698.8, 60 sec: 5565.7, 300 sec: 5550.9). Total num frames: 818331648. Throughput: 0: 5765.3. Samples: 818337494. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:21:26,398][25689] Avg episode reward: [(0, '-1.151')] [2022-07-10 16:21:26,763][26022] Updated weights on worker 0-0, policy_version 799154 (0.00088) [2022-07-10 16:21:28,827][26022] Updated weights on worker 0-0, policy_version 799164 (0.00052) [2022-07-10 16:21:30,679][26022] Updated weights on worker 0-0, policy_version 799174 (0.00097) [2022-07-10 16:21:31,426][25689] Fps is (10 sec: 5605.8, 60 sec: 5547.0, 300 sec: 5544.2). Total num frames: 818358272. Throughput: 0: 4962.3. Samples: 818354112. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:21:31,426][25689] Avg episode reward: [(0, '-1.573')] [2022-07-10 16:21:32,621][26022] Updated weights on worker 0-0, policy_version 799184 (0.00087) [2022-07-10 16:21:34,227][26022] Updated weights on worker 0-0, policy_version 799194 (0.00090) [2022-07-10 16:21:36,239][26022] Updated weights on worker 0-0, policy_version 799204 (0.00091) [2022-07-10 16:21:36,480][25689] Fps is (10 sec: 5484.0, 60 sec: 5552.2, 300 sec: 5546.8). Total num frames: 818386944. Throughput: 0: 5809.1. Samples: 818387568. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:21:36,481][25689] Avg episode reward: [(0, '-1.570')] [2022-07-10 16:21:37,822][26022] Updated weights on worker 0-0, policy_version 799214 (0.00087) [2022-07-10 16:21:39,842][26022] Updated weights on worker 0-0, policy_version 799224 (0.00093) [2022-07-10 16:21:41,529][25689] Fps is (10 sec: 5675.7, 60 sec: 5566.3, 300 sec: 5549.7). Total num frames: 818415616. Throughput: 0: 5806.9. Samples: 818421046. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:21:41,530][25689] Avg episode reward: [(0, '-3.072')] [2022-07-10 16:21:41,532][26022] Updated weights on worker 0-0, policy_version 799234 (0.00091) [2022-07-10 16:21:43,552][26022] Updated weights on worker 0-0, policy_version 799244 (0.00097) [2022-07-10 16:21:45,448][26022] Updated weights on worker 0-0, policy_version 799254 (0.00086) [2022-07-10 16:21:46,599][25689] Fps is (10 sec: 5363.7, 60 sec: 5517.0, 300 sec: 5534.8). Total num frames: 818441216. Throughput: 0: 4952.4. Samples: 818437540. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:21:46,599][25689] Avg episode reward: [(0, '-3.486')] [2022-07-10 16:21:47,230][26022] Updated weights on worker 0-0, policy_version 799264 (0.00085) [2022-07-10 16:21:49,357][26022] Updated weights on worker 0-0, policy_version 799274 (0.00089) [2022-07-10 16:21:50,820][26022] Updated weights on worker 0-0, policy_version 799284 (0.00078) [2022-07-10 16:21:51,619][25689] Fps is (10 sec: 5480.7, 60 sec: 5536.4, 300 sec: 5545.9). Total num frames: 818470912. Throughput: 0: 5788.9. Samples: 818471006. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:21:51,619][25689] Avg episode reward: [(0, '-2.022')] [2022-07-10 16:21:52,951][26022] Updated weights on worker 0-0, policy_version 799294 (0.00086) [2022-07-10 16:21:54,458][26022] Updated weights on worker 0-0, policy_version 799304 (0.00050) [2022-07-10 16:21:56,428][26022] Updated weights on worker 0-0, policy_version 799314 (0.00079) [2022-07-10 16:21:56,684][25689] Fps is (10 sec: 5685.9, 60 sec: 5522.1, 300 sec: 5541.4). Total num frames: 818498560. Throughput: 0: 5800.3. Samples: 818504754. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:21:56,685][25689] Avg episode reward: [(0, '-1.800')] [2022-07-10 16:21:58,245][26022] Updated weights on worker 0-0, policy_version 799324 (0.00092) [2022-07-10 16:22:00,036][26022] Updated weights on worker 0-0, policy_version 799334 (0.00102) [2022-07-10 16:22:01,686][25689] Fps is (10 sec: 5492.8, 60 sec: 5526.0, 300 sec: 5551.8). Total num frames: 818526208. Throughput: 0: 4980.3. Samples: 818521430. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:22:01,686][25689] Avg episode reward: [(0, '-1.912')] [2022-07-10 16:22:02,366][26022] Updated weights on worker 0-0, policy_version 799344 (0.00082) [2022-07-10 16:22:04,127][26022] Updated weights on worker 0-0, policy_version 799354 (0.00094) [2022-07-10 16:22:05,980][26022] Updated weights on worker 0-0, policy_version 799364 (0.00093) [2022-07-10 16:22:06,695][25689] Fps is (10 sec: 5523.7, 60 sec: 5543.2, 300 sec: 5551.7). Total num frames: 818553856. Throughput: 0: 5739.2. Samples: 818552874. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:22:06,695][25689] Avg episode reward: [(0, '-0.899')] [2022-07-10 16:22:07,761][26022] Updated weights on worker 0-0, policy_version 799374 (0.00082) [2022-07-10 16:22:09,574][26022] Updated weights on worker 0-0, policy_version 799384 (0.00085) [2022-07-10 16:22:11,465][26022] Updated weights on worker 0-0, policy_version 799394 (0.00088) [2022-07-10 16:22:11,703][25689] Fps is (10 sec: 5315.7, 60 sec: 5517.2, 300 sec: 5546.3). Total num frames: 818579456. Throughput: 0: 5730.6. Samples: 818586100. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:22:11,703][25689] Avg episode reward: [(0, '-1.919')] [2022-07-10 16:22:13,343][26022] Updated weights on worker 0-0, policy_version 799404 (0.00082) [2022-07-10 16:22:15,194][26022] Updated weights on worker 0-0, policy_version 799414 (0.00088) [2022-07-10 16:22:16,778][25689] Fps is (10 sec: 5382.1, 60 sec: 5522.9, 300 sec: 5545.6). Total num frames: 818608128. Throughput: 0: 4879.4. Samples: 818602802. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:22:16,779][25689] Avg episode reward: [(0, '-3.521')] [2022-07-10 16:22:17,055][26022] Updated weights on worker 0-0, policy_version 799424 (0.00093) [2022-07-10 16:22:18,772][26022] Updated weights on worker 0-0, policy_version 799434 (0.00087) [2022-07-10 16:22:20,683][26022] Updated weights on worker 0-0, policy_version 799444 (0.00090) [2022-07-10 16:22:21,829][25689] Fps is (10 sec: 5663.0, 60 sec: 5539.3, 300 sec: 5548.5). Total num frames: 818636800. Throughput: 0: 5702.4. Samples: 818636294. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:22:21,831][25689] Avg episode reward: [(0, '-2.897')] [2022-07-10 16:22:22,456][26022] Updated weights on worker 0-0, policy_version 799454 (0.00091) [2022-07-10 16:22:24,394][26022] Updated weights on worker 0-0, policy_version 799464 (0.00092) [2022-07-10 16:22:26,293][26022] Updated weights on worker 0-0, policy_version 799474 (0.00092) [2022-07-10 16:22:26,852][25689] Fps is (10 sec: 5692.8, 60 sec: 5521.9, 300 sec: 5552.2). Total num frames: 818665472. Throughput: 0: 5797.3. Samples: 818669730. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:22:26,852][25689] Avg episode reward: [(0, '-3.093')] [2022-07-10 16:22:28,128][26022] Updated weights on worker 0-0, policy_version 799484 (0.00358) [2022-07-10 16:22:29,739][26022] Updated weights on worker 0-0, policy_version 799494 (0.00404) [2022-07-10 16:22:31,728][26022] Updated weights on worker 0-0, policy_version 799504 (0.00096) [2022-07-10 16:22:31,929][25689] Fps is (10 sec: 5576.3, 60 sec: 5534.4, 300 sec: 5549.0). Total num frames: 818693120. Throughput: 0: 4972.1. Samples: 818686666. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:22:31,929][25689] Avg episode reward: [(0, '-3.100')] [2022-07-10 16:22:33,558][26022] Updated weights on worker 0-0, policy_version 799514 (0.00089) [2022-07-10 16:22:35,319][26022] Updated weights on worker 0-0, policy_version 799524 (0.00083) [2022-07-10 16:22:37,070][25689] Fps is (10 sec: 5511.8, 60 sec: 5526.5, 300 sec: 5546.5). Total num frames: 818721792. Throughput: 0: 5785.7. Samples: 818720200. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:22:37,071][25689] Avg episode reward: [(0, '-2.656')] [2022-07-10 16:22:37,234][26022] Updated weights on worker 0-0, policy_version 799534 (0.00095) [2022-07-10 16:22:38,968][26022] Updated weights on worker 0-0, policy_version 799544 (0.00103) [2022-07-10 16:22:41,003][26022] Updated weights on worker 0-0, policy_version 799554 (0.00087) [2022-07-10 16:22:42,106][25689] Fps is (10 sec: 5533.8, 60 sec: 5510.8, 300 sec: 5546.9). Total num frames: 818749440. Throughput: 0: 5783.6. Samples: 818753570. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:22:42,107][25689] Avg episode reward: [(0, '-0.817')] [2022-07-10 16:22:42,690][26022] Updated weights on worker 0-0, policy_version 799564 (0.00086) [2022-07-10 16:22:44,503][26022] Updated weights on worker 0-0, policy_version 799574 (0.00087) [2022-07-10 16:22:45,509][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:22:45,526][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000799579_818768896.pth [2022-07-10 16:22:45,527][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000797629_816772096.pth [2022-07-10 16:22:46,291][26022] Updated weights on worker 0-0, policy_version 799584 (0.00101) [2022-07-10 16:22:47,187][25689] Fps is (10 sec: 5566.4, 60 sec: 5560.4, 300 sec: 5545.5). Total num frames: 818778112. Throughput: 0: 5796.9. Samples: 818787614. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:22:47,188][25689] Avg episode reward: [(0, '-0.821')] [2022-07-10 16:22:48,142][26022] Updated weights on worker 0-0, policy_version 799594 (0.00089) [2022-07-10 16:22:49,945][26022] Updated weights on worker 0-0, policy_version 799604 (0.00081) [2022-07-10 16:22:51,855][26022] Updated weights on worker 0-0, policy_version 799614 (0.00085) [2022-07-10 16:22:52,199][25689] Fps is (10 sec: 5681.5, 60 sec: 5544.2, 300 sec: 5546.1). Total num frames: 818806784. Throughput: 0: 5807.0. Samples: 818804376. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:22:52,200][25689] Avg episode reward: [(0, '-0.984')] [2022-07-10 16:22:53,725][26022] Updated weights on worker 0-0, policy_version 799624 (0.00076) [2022-07-10 16:22:55,483][26022] Updated weights on worker 0-0, policy_version 799634 (0.00093) [2022-07-10 16:22:57,287][25689] Fps is (10 sec: 5576.5, 60 sec: 5542.2, 300 sec: 5545.2). Total num frames: 818834432. Throughput: 0: 5817.4. Samples: 818837812. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:22:57,288][25689] Avg episode reward: [(0, '-2.477')] [2022-07-10 16:22:57,291][26022] Updated weights on worker 0-0, policy_version 799644 (0.00080) [2022-07-10 16:22:59,292][26022] Updated weights on worker 0-0, policy_version 799654 (0.00085) [2022-07-10 16:23:01,048][26022] Updated weights on worker 0-0, policy_version 799664 (0.00096) [2022-07-10 16:23:02,312][25689] Fps is (10 sec: 5366.8, 60 sec: 5523.2, 300 sec: 5548.3). Total num frames: 818861056. Throughput: 0: 5719.1. Samples: 818869128. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:23:02,312][25689] Avg episode reward: [(0, '-2.319')] [2022-07-10 16:23:03,340][26022] Updated weights on worker 0-0, policy_version 799674 (0.00090) [2022-07-10 16:23:05,038][26022] Updated weights on worker 0-0, policy_version 799684 (0.00088) [2022-07-10 16:23:06,807][26022] Updated weights on worker 0-0, policy_version 799694 (0.00093) [2022-07-10 16:23:07,321][25689] Fps is (10 sec: 5306.7, 60 sec: 5506.2, 300 sec: 5542.6). Total num frames: 818887680. Throughput: 0: 4873.5. Samples: 818885736. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:23:07,322][25689] Avg episode reward: [(0, '-2.838')] [2022-07-10 16:23:08,579][26022] Updated weights on worker 0-0, policy_version 799704 (0.00087) [2022-07-10 16:23:10,691][26022] Updated weights on worker 0-0, policy_version 799714 (0.00096) [2022-07-10 16:23:12,343][25689] Fps is (10 sec: 5410.4, 60 sec: 5538.8, 300 sec: 5543.6). Total num frames: 818915328. Throughput: 0: 5703.2. Samples: 818919260. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:23:12,343][25689] Avg episode reward: [(0, '-2.831')] [2022-07-10 16:23:12,515][26022] Updated weights on worker 0-0, policy_version 799724 (0.00089) [2022-07-10 16:23:14,215][26022] Updated weights on worker 0-0, policy_version 799734 (0.00091) [2022-07-10 16:23:16,117][26022] Updated weights on worker 0-0, policy_version 799744 (0.00087) [2022-07-10 16:23:17,437][25689] Fps is (10 sec: 5668.5, 60 sec: 5553.9, 300 sec: 5542.2). Total num frames: 818945024. Throughput: 0: 5720.0. Samples: 818953072. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:23:17,438][25689] Avg episode reward: [(0, '-1.909')] [2022-07-10 16:23:17,780][26022] Updated weights on worker 0-0, policy_version 799754 (0.00118) [2022-07-10 16:23:19,608][26022] Updated weights on worker 0-0, policy_version 799764 (0.00092) [2022-07-10 16:23:21,766][26022] Updated weights on worker 0-0, policy_version 799774 (0.00050) [2022-07-10 16:23:22,463][25689] Fps is (10 sec: 5666.1, 60 sec: 5539.3, 300 sec: 5545.2). Total num frames: 818972672. Throughput: 0: 5001.4. Samples: 818969914. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:23:22,464][25689] Avg episode reward: [(0, '-1.262')] [2022-07-10 16:23:23,381][26022] Updated weights on worker 0-0, policy_version 799784 (0.00089) [2022-07-10 16:23:25,325][26022] Updated weights on worker 0-0, policy_version 799794 (0.00089) [2022-07-10 16:23:27,094][26022] Updated weights on worker 0-0, policy_version 799804 (0.00092) [2022-07-10 16:23:27,464][25689] Fps is (10 sec: 5514.8, 60 sec: 5524.4, 300 sec: 5542.5). Total num frames: 819000320. Throughput: 0: 5840.7. Samples: 819003386. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:23:27,465][25689] Avg episode reward: [(0, '0.067')] [2022-07-10 16:23:28,950][26022] Updated weights on worker 0-0, policy_version 799814 (0.00086) [2022-07-10 16:23:30,891][26022] Updated weights on worker 0-0, policy_version 799824 (0.00088) [2022-07-10 16:23:32,491][25689] Fps is (10 sec: 5616.6, 60 sec: 5545.9, 300 sec: 5543.4). Total num frames: 819028992. Throughput: 0: 5825.2. Samples: 819036626. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:23:32,491][25689] Avg episode reward: [(0, '0.736')] [2022-07-10 16:23:32,639][26022] Updated weights on worker 0-0, policy_version 799834 (0.00071) [2022-07-10 16:23:34,565][26022] Updated weights on worker 0-0, policy_version 799844 (0.00091) [2022-07-10 16:23:36,386][26022] Updated weights on worker 0-0, policy_version 799854 (0.00093) [2022-07-10 16:23:37,551][25689] Fps is (10 sec: 5583.3, 60 sec: 5536.3, 300 sec: 5549.9). Total num frames: 819056640. Throughput: 0: 4982.4. Samples: 819053288. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:23:37,552][25689] Avg episode reward: [(0, '0.877')] [2022-07-10 16:23:38,147][26022] Updated weights on worker 0-0, policy_version 799864 (0.00086) [2022-07-10 16:23:40,038][26022] Updated weights on worker 0-0, policy_version 799874 (0.00089) [2022-07-10 16:23:41,912][26022] Updated weights on worker 0-0, policy_version 799884 (0.00079) [2022-07-10 16:23:42,591][25689] Fps is (10 sec: 5474.8, 60 sec: 5536.1, 300 sec: 5540.6). Total num frames: 819084288. Throughput: 0: 5791.7. Samples: 819086486. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:23:42,592][25689] Avg episode reward: [(0, '1.232')] [2022-07-10 16:23:43,751][26022] Updated weights on worker 0-0, policy_version 799894 (0.00086) [2022-07-10 16:23:45,557][26022] Updated weights on worker 0-0, policy_version 799904 (0.00087) [2022-07-10 16:23:47,342][26022] Updated weights on worker 0-0, policy_version 799914 (0.00091) [2022-07-10 16:23:47,630][25689] Fps is (10 sec: 5588.2, 60 sec: 5540.0, 300 sec: 5547.5). Total num frames: 819112960. Throughput: 0: 5801.1. Samples: 819120368. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:23:47,631][25689] Avg episode reward: [(0, '0.910')] [2022-07-10 16:23:49,500][26022] Updated weights on worker 0-0, policy_version 799924 (0.00079) [2022-07-10 16:23:50,937][26022] Updated weights on worker 0-0, policy_version 799934 (0.00071) [2022-07-10 16:23:52,632][25689] Fps is (10 sec: 5506.9, 60 sec: 5506.9, 300 sec: 5538.3). Total num frames: 819139584. Throughput: 0: 4987.3. Samples: 819137076. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:23:52,632][25689] Avg episode reward: [(0, '-0.416')] [2022-07-10 16:23:53,040][26022] Updated weights on worker 0-0, policy_version 799944 (0.00099) [2022-07-10 16:23:54,738][26022] Updated weights on worker 0-0, policy_version 799954 (0.00092) [2022-07-10 16:23:56,673][26022] Updated weights on worker 0-0, policy_version 799964 (0.00085) [2022-07-10 16:23:57,671][25689] Fps is (10 sec: 5609.1, 60 sec: 5545.3, 300 sec: 5544.9). Total num frames: 819169280. Throughput: 0: 5797.1. Samples: 819169922. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:23:57,672][25689] Avg episode reward: [(0, '-0.389')] [2022-07-10 16:23:58,638][26022] Updated weights on worker 0-0, policy_version 799974 (0.00091) [2022-07-10 16:24:00,266][26022] Updated weights on worker 0-0, policy_version 799984 (0.00113) [2022-07-10 16:24:02,519][26022] Updated weights on worker 0-0, policy_version 799994 (0.00083) [2022-07-10 16:24:02,686][25689] Fps is (10 sec: 5398.0, 60 sec: 5512.3, 300 sec: 5541.8). Total num frames: 819193856. Throughput: 0: 5719.4. Samples: 819201418. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:24:02,688][25689] Avg episode reward: [(0, '-0.839')] [2022-07-10 16:24:04,382][26022] Updated weights on worker 0-0, policy_version 800004 (0.00091) [2022-07-10 16:24:06,160][26022] Updated weights on worker 0-0, policy_version 800014 (0.00089) [2022-07-10 16:24:07,693][25689] Fps is (10 sec: 5312.9, 60 sec: 5546.4, 300 sec: 5542.3). Total num frames: 819222528. Throughput: 0: 4869.8. Samples: 819218072. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:24:07,694][25689] Avg episode reward: [(0, '-2.508')] [2022-07-10 16:24:08,034][26022] Updated weights on worker 0-0, policy_version 800024 (0.00086) [2022-07-10 16:24:10,141][26022] Updated weights on worker 0-0, policy_version 800034 (0.00083) [2022-07-10 16:24:11,762][26022] Updated weights on worker 0-0, policy_version 800044 (0.00091) [2022-07-10 16:24:12,712][25689] Fps is (10 sec: 5515.3, 60 sec: 5529.7, 300 sec: 5536.9). Total num frames: 819249152. Throughput: 0: 5699.2. Samples: 819251516. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:24:12,714][25689] Avg episode reward: [(0, '-2.497')] [2022-07-10 16:24:13,702][26022] Updated weights on worker 0-0, policy_version 800054 (0.00100) [2022-07-10 16:24:15,220][26022] Updated weights on worker 0-0, policy_version 800064 (0.00096) [2022-07-10 16:24:17,367][26022] Updated weights on worker 0-0, policy_version 800074 (0.00089) [2022-07-10 16:24:17,791][25689] Fps is (10 sec: 5374.7, 60 sec: 5497.2, 300 sec: 5533.5). Total num frames: 819276800. Throughput: 0: 5710.3. Samples: 819284814. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:24:17,791][25689] Avg episode reward: [(0, '-1.961')] [2022-07-10 16:24:19,259][26022] Updated weights on worker 0-0, policy_version 800084 (0.00091) [2022-07-10 16:24:21,039][26022] Updated weights on worker 0-0, policy_version 800094 (0.00500) [2022-07-10 16:24:22,870][25689] Fps is (10 sec: 5544.3, 60 sec: 5509.3, 300 sec: 5537.1). Total num frames: 819305472. Throughput: 0: 4959.7. Samples: 819301526. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:24:22,871][25689] Avg episode reward: [(0, '-2.321')] [2022-07-10 16:24:22,917][26022] Updated weights on worker 0-0, policy_version 800104 (0.00090) [2022-07-10 16:24:24,709][26022] Updated weights on worker 0-0, policy_version 800114 (0.00085) [2022-07-10 16:24:26,607][26022] Updated weights on worker 0-0, policy_version 800124 (0.00080) [2022-07-10 16:24:27,908][25689] Fps is (10 sec: 5667.8, 60 sec: 5522.9, 300 sec: 5533.8). Total num frames: 819334144. Throughput: 0: 5756.3. Samples: 819334436. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:24:27,909][25689] Avg episode reward: [(0, '-1.823')] [2022-07-10 16:24:28,456][26022] Updated weights on worker 0-0, policy_version 800134 (0.00093) [2022-07-10 16:24:30,178][26022] Updated weights on worker 0-0, policy_version 800144 (0.00091) [2022-07-10 16:24:32,174][26022] Updated weights on worker 0-0, policy_version 800154 (0.00087) [2022-07-10 16:24:32,926][25689] Fps is (10 sec: 5600.5, 60 sec: 5506.7, 300 sec: 5534.6). Total num frames: 819361792. Throughput: 0: 5759.3. Samples: 819367938. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:24:32,927][25689] Avg episode reward: [(0, '-1.150')] [2022-07-10 16:24:34,010][26022] Updated weights on worker 0-0, policy_version 800164 (0.00084) [2022-07-10 16:24:35,823][26022] Updated weights on worker 0-0, policy_version 800174 (0.00081) [2022-07-10 16:24:37,422][26022] Updated weights on worker 0-0, policy_version 800184 (0.00075) [2022-07-10 16:24:37,984][25689] Fps is (10 sec: 5589.5, 60 sec: 5523.9, 300 sec: 5534.3). Total num frames: 819390464. Throughput: 0: 4941.8. Samples: 819384606. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:24:37,985][25689] Avg episode reward: [(0, '-0.748')] [2022-07-10 16:24:39,523][26022] Updated weights on worker 0-0, policy_version 800194 (0.00080) [2022-07-10 16:24:41,235][26022] Updated weights on worker 0-0, policy_version 800204 (0.00087) [2022-07-10 16:24:43,030][25689] Fps is (10 sec: 5574.1, 60 sec: 5523.3, 300 sec: 5530.8). Total num frames: 819418112. Throughput: 0: 5780.6. Samples: 819418064. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 16:24:43,031][25689] Avg episode reward: [(0, '-1.559')] [2022-07-10 16:24:43,133][26022] Updated weights on worker 0-0, policy_version 800214 (0.00085) [2022-07-10 16:24:45,024][26022] Updated weights on worker 0-0, policy_version 800224 (0.00086) [2022-07-10 16:24:45,760][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:24:45,776][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000800227_819432448.pth [2022-07-10 16:24:45,776][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000798278_817436672.pth [2022-07-10 16:24:46,884][26022] Updated weights on worker 0-0, policy_version 800234 (0.00094) [2022-07-10 16:24:48,082][25689] Fps is (10 sec: 5475.9, 60 sec: 5505.1, 300 sec: 5527.0). Total num frames: 819445760. Throughput: 0: 5795.4. Samples: 819451352. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:24:48,083][25689] Avg episode reward: [(0, '-1.803')] [2022-07-10 16:24:48,699][26022] Updated weights on worker 0-0, policy_version 800244 (0.00086) [2022-07-10 16:24:50,618][26022] Updated weights on worker 0-0, policy_version 800254 (0.00087) [2022-07-10 16:24:52,344][26022] Updated weights on worker 0-0, policy_version 800264 (0.00088) [2022-07-10 16:24:53,128][25689] Fps is (10 sec: 5475.9, 60 sec: 5518.1, 300 sec: 5528.1). Total num frames: 819473408. Throughput: 0: 4958.2. Samples: 819468104. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:24:53,129][25689] Avg episode reward: [(0, '-1.028')] [2022-07-10 16:24:54,318][26022] Updated weights on worker 0-0, policy_version 800274 (0.00082) [2022-07-10 16:24:55,759][26022] Updated weights on worker 0-0, policy_version 800284 (0.00091) [2022-07-10 16:24:57,967][26022] Updated weights on worker 0-0, policy_version 800294 (0.00087) [2022-07-10 16:24:58,203][25689] Fps is (10 sec: 5565.0, 60 sec: 5497.9, 300 sec: 5528.0). Total num frames: 819502080. Throughput: 0: 5777.4. Samples: 819501416. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:24:58,205][25689] Avg episode reward: [(0, '-0.594')] [2022-07-10 16:24:59,655][26022] Updated weights on worker 0-0, policy_version 800304 (0.00092) [2022-07-10 16:25:01,901][26022] Updated weights on worker 0-0, policy_version 800314 (0.00096) [2022-07-10 16:25:03,225][25689] Fps is (10 sec: 5476.6, 60 sec: 5531.1, 300 sec: 5534.5). Total num frames: 819528704. Throughput: 0: 5685.2. Samples: 819532876. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:25:03,226][25689] Avg episode reward: [(0, '-0.383')] [2022-07-10 16:25:03,702][26022] Updated weights on worker 0-0, policy_version 800324 (0.00087) [2022-07-10 16:25:05,511][26022] Updated weights on worker 0-0, policy_version 800334 (0.00080) [2022-07-10 16:25:07,427][26022] Updated weights on worker 0-0, policy_version 800344 (0.00091) [2022-07-10 16:25:08,282][25689] Fps is (10 sec: 5384.4, 60 sec: 5509.6, 300 sec: 5531.1). Total num frames: 819556352. Throughput: 0: 4859.4. Samples: 819549512. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:25:08,284][25689] Avg episode reward: [(0, '-0.010')] [2022-07-10 16:25:09,248][26022] Updated weights on worker 0-0, policy_version 800354 (0.00089) [2022-07-10 16:25:11,043][26022] Updated weights on worker 0-0, policy_version 800364 (0.00093) [2022-07-10 16:25:13,103][26022] Updated weights on worker 0-0, policy_version 800374 (0.00083) [2022-07-10 16:25:13,308][25689] Fps is (10 sec: 5585.4, 60 sec: 5542.8, 300 sec: 5532.1). Total num frames: 819585024. Throughput: 0: 5714.4. Samples: 819583422. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:25:13,309][25689] Avg episode reward: [(0, '0.592')] [2022-07-10 16:25:14,921][26022] Updated weights on worker 0-0, policy_version 800384 (0.00084) [2022-07-10 16:25:16,502][26022] Updated weights on worker 0-0, policy_version 800394 (0.00106) [2022-07-10 16:25:18,409][25689] Fps is (10 sec: 5561.6, 60 sec: 5540.8, 300 sec: 5530.3). Total num frames: 819612672. Throughput: 0: 5711.1. Samples: 819616816. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:25:18,411][25689] Avg episode reward: [(0, '0.877')] [2022-07-10 16:25:18,516][26022] Updated weights on worker 0-0, policy_version 800404 (0.00087) [2022-07-10 16:25:20,267][26022] Updated weights on worker 0-0, policy_version 800414 (0.00097) [2022-07-10 16:25:22,241][26022] Updated weights on worker 0-0, policy_version 800424 (0.00090) [2022-07-10 16:25:23,462][25689] Fps is (10 sec: 5547.0, 60 sec: 5543.2, 300 sec: 5533.0). Total num frames: 819641344. Throughput: 0: 5804.6. Samples: 819650342. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:25:23,462][25689] Avg episode reward: [(0, '1.074')] [2022-07-10 16:25:23,915][26022] Updated weights on worker 0-0, policy_version 800434 (0.00083) [2022-07-10 16:25:25,671][26022] Updated weights on worker 0-0, policy_version 800444 (0.00096) [2022-07-10 16:25:27,786][26022] Updated weights on worker 0-0, policy_version 800454 (0.00088) [2022-07-10 16:25:28,477][25689] Fps is (10 sec: 5593.7, 60 sec: 5528.4, 300 sec: 5532.9). Total num frames: 819668992. Throughput: 0: 5816.5. Samples: 819666976. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:25:28,478][25689] Avg episode reward: [(0, '0.952')] [2022-07-10 16:25:29,636][26022] Updated weights on worker 0-0, policy_version 800464 (0.00089) [2022-07-10 16:25:31,439][26022] Updated weights on worker 0-0, policy_version 800474 (0.00082) [2022-07-10 16:25:33,300][26022] Updated weights on worker 0-0, policy_version 800484 (0.00120) [2022-07-10 16:25:33,484][25689] Fps is (10 sec: 5517.2, 60 sec: 5529.4, 300 sec: 5531.4). Total num frames: 819696640. Throughput: 0: 5790.0. Samples: 819700238. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:25:33,485][25689] Avg episode reward: [(0, '0.538')] [2022-07-10 16:25:34,927][26022] Updated weights on worker 0-0, policy_version 800494 (0.00083) [2022-07-10 16:25:36,935][26022] Updated weights on worker 0-0, policy_version 800504 (0.00080) [2022-07-10 16:25:38,558][25689] Fps is (10 sec: 5587.0, 60 sec: 5527.9, 300 sec: 5533.8). Total num frames: 819725312. Throughput: 0: 5802.7. Samples: 819733734. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:25:38,558][25689] Avg episode reward: [(0, '0.330')] [2022-07-10 16:25:38,674][26022] Updated weights on worker 0-0, policy_version 800514 (0.00086) [2022-07-10 16:25:40,469][26022] Updated weights on worker 0-0, policy_version 800524 (0.00091) [2022-07-10 16:25:42,494][26022] Updated weights on worker 0-0, policy_version 800534 (0.00104) [2022-07-10 16:25:43,611][25689] Fps is (10 sec: 5561.6, 60 sec: 5527.3, 300 sec: 5530.9). Total num frames: 819752960. Throughput: 0: 4965.5. Samples: 819750392. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:25:43,611][25689] Avg episode reward: [(0, '-0.928')] [2022-07-10 16:25:44,107][26022] Updated weights on worker 0-0, policy_version 800544 (0.00078) [2022-07-10 16:25:46,257][26022] Updated weights on worker 0-0, policy_version 800554 (0.00084) [2022-07-10 16:25:47,965][26022] Updated weights on worker 0-0, policy_version 800564 (0.00086) [2022-07-10 16:25:48,626][25689] Fps is (10 sec: 5492.3, 60 sec: 5530.7, 300 sec: 5528.1). Total num frames: 819780608. Throughput: 0: 5788.3. Samples: 819783602. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:25:48,626][25689] Avg episode reward: [(0, '-0.753')] [2022-07-10 16:25:49,664][26022] Updated weights on worker 0-0, policy_version 800574 (0.00092) [2022-07-10 16:25:51,909][26022] Updated weights on worker 0-0, policy_version 800584 (0.00088) [2022-07-10 16:25:53,286][26022] Updated weights on worker 0-0, policy_version 800594 (0.00091) [2022-07-10 16:25:53,671][25689] Fps is (10 sec: 5699.9, 60 sec: 5564.6, 300 sec: 5532.4). Total num frames: 819810304. Throughput: 0: 5793.6. Samples: 819817194. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:25:53,672][25689] Avg episode reward: [(0, '-0.948')] [2022-07-10 16:25:55,572][26022] Updated weights on worker 0-0, policy_version 800604 (0.00086) [2022-07-10 16:25:56,996][26022] Updated weights on worker 0-0, policy_version 800614 (0.00081) [2022-07-10 16:25:58,751][25689] Fps is (10 sec: 5562.5, 60 sec: 5530.3, 300 sec: 5528.3). Total num frames: 819836928. Throughput: 0: 4944.0. Samples: 819833568. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:25:58,752][25689] Avg episode reward: [(0, '-1.148')] [2022-07-10 16:25:59,170][26022] Updated weights on worker 0-0, policy_version 800624 (0.00089) [2022-07-10 16:26:00,763][26022] Updated weights on worker 0-0, policy_version 800634 (0.00088) [2022-07-10 16:26:03,183][26022] Updated weights on worker 0-0, policy_version 800644 (0.00085) [2022-07-10 16:26:03,759][25689] Fps is (10 sec: 5278.2, 60 sec: 5531.5, 300 sec: 5528.4). Total num frames: 819863552. Throughput: 0: 5716.4. Samples: 819865570. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:26:03,761][25689] Avg episode reward: [(0, '-0.434')] [2022-07-10 16:26:04,862][26022] Updated weights on worker 0-0, policy_version 800654 (0.00089) [2022-07-10 16:26:06,857][26022] Updated weights on worker 0-0, policy_version 800664 (0.00085) [2022-07-10 16:26:08,595][26022] Updated weights on worker 0-0, policy_version 800674 (0.00102) [2022-07-10 16:26:08,775][25689] Fps is (10 sec: 5516.1, 60 sec: 5552.2, 300 sec: 5533.3). Total num frames: 819892224. Throughput: 0: 5710.1. Samples: 819898658. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:26:08,776][25689] Avg episode reward: [(0, '0.222')] [2022-07-10 16:26:10,574][26022] Updated weights on worker 0-0, policy_version 800684 (0.00085) [2022-07-10 16:26:12,113][26022] Updated weights on worker 0-0, policy_version 800694 (0.00086) [2022-07-10 16:26:13,824][25689] Fps is (10 sec: 5494.1, 60 sec: 5516.3, 300 sec: 5528.1). Total num frames: 819918848. Throughput: 0: 4874.2. Samples: 819915424. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:26:13,825][25689] Avg episode reward: [(0, '0.792')] [2022-07-10 16:26:14,076][26022] Updated weights on worker 0-0, policy_version 800704 (0.00086) [2022-07-10 16:26:15,886][26022] Updated weights on worker 0-0, policy_version 800714 (0.00090) [2022-07-10 16:26:17,651][26022] Updated weights on worker 0-0, policy_version 800724 (0.00104) [2022-07-10 16:26:18,911][25689] Fps is (10 sec: 5455.4, 60 sec: 5534.5, 300 sec: 5530.7). Total num frames: 819947520. Throughput: 0: 5711.2. Samples: 819948708. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:26:18,912][25689] Avg episode reward: [(0, '0.669')] [2022-07-10 16:26:19,511][26022] Updated weights on worker 0-0, policy_version 800734 (0.00079) [2022-07-10 16:26:21,465][26022] Updated weights on worker 0-0, policy_version 800744 (0.00088) [2022-07-10 16:26:23,170][26022] Updated weights on worker 0-0, policy_version 800754 (0.00083) [2022-07-10 16:26:23,947][25689] Fps is (10 sec: 5664.5, 60 sec: 5536.0, 300 sec: 5526.9). Total num frames: 819976192. Throughput: 0: 5792.7. Samples: 819982514. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:26:23,948][25689] Avg episode reward: [(0, '0.964')] [2022-07-10 16:26:25,042][26022] Updated weights on worker 0-0, policy_version 800764 (0.00081) [2022-07-10 16:26:26,935][26022] Updated weights on worker 0-0, policy_version 800774 (0.00104) [2022-07-10 16:26:28,660][26022] Updated weights on worker 0-0, policy_version 800784 (0.00081) [2022-07-10 16:26:28,955][25689] Fps is (10 sec: 5607.7, 60 sec: 5536.7, 300 sec: 5530.8). Total num frames: 820003840. Throughput: 0: 4995.9. Samples: 819999472. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:26:28,956][25689] Avg episode reward: [(0, '0.855')] [2022-07-10 16:26:30,626][26022] Updated weights on worker 0-0, policy_version 800794 (0.00834) [2022-07-10 16:26:32,303][26022] Updated weights on worker 0-0, policy_version 800804 (0.00088) [2022-07-10 16:26:33,964][25689] Fps is (10 sec: 5520.3, 60 sec: 5536.5, 300 sec: 5528.2). Total num frames: 820031488. Throughput: 0: 5834.5. Samples: 820032932. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:26:33,965][25689] Avg episode reward: [(0, '0.660')] [2022-07-10 16:26:34,191][26022] Updated weights on worker 0-0, policy_version 800814 (0.00085) [2022-07-10 16:26:36,171][26022] Updated weights on worker 0-0, policy_version 800824 (0.00084) [2022-07-10 16:26:37,811][26022] Updated weights on worker 0-0, policy_version 800834 (0.00084) [2022-07-10 16:26:39,080][25689] Fps is (10 sec: 5461.2, 60 sec: 5515.7, 300 sec: 5523.5). Total num frames: 820059136. Throughput: 0: 5837.4. Samples: 820066440. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:26:39,081][25689] Avg episode reward: [(0, '0.798')] [2022-07-10 16:26:39,803][26022] Updated weights on worker 0-0, policy_version 800844 (0.00095) [2022-07-10 16:26:41,267][26022] Updated weights on worker 0-0, policy_version 800854 (0.00086) [2022-07-10 16:26:43,387][26022] Updated weights on worker 0-0, policy_version 800864 (0.00084) [2022-07-10 16:26:44,102][25689] Fps is (10 sec: 5656.7, 60 sec: 5552.4, 300 sec: 5538.2). Total num frames: 820088832. Throughput: 0: 5004.6. Samples: 820083378. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:26:44,102][25689] Avg episode reward: [(0, '0.677')] [2022-07-10 16:26:45,091][26022] Updated weights on worker 0-0, policy_version 800874 (0.00086) [2022-07-10 16:26:46,059][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:26:46,074][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000800877_820098048.pth [2022-07-10 16:26:46,075][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000798930_818104320.pth [2022-07-10 16:26:46,960][26022] Updated weights on worker 0-0, policy_version 800884 (0.00618) [2022-07-10 16:26:48,994][26022] Updated weights on worker 0-0, policy_version 800894 (0.00082) [2022-07-10 16:26:49,118][25689] Fps is (10 sec: 5610.6, 60 sec: 5535.4, 300 sec: 5527.9). Total num frames: 820115456. Throughput: 0: 5826.9. Samples: 820116962. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:26:49,119][25689] Avg episode reward: [(0, '1.107')] [2022-07-10 16:26:50,693][26022] Updated weights on worker 0-0, policy_version 800904 (0.00087) [2022-07-10 16:26:52,482][26022] Updated weights on worker 0-0, policy_version 800914 (0.00090) [2022-07-10 16:26:54,158][25689] Fps is (10 sec: 5600.5, 60 sec: 5535.9, 300 sec: 5535.3). Total num frames: 820145152. Throughput: 0: 5838.4. Samples: 820150830. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:26:54,158][25689] Avg episode reward: [(0, '0.661')] [2022-07-10 16:26:54,211][26022] Updated weights on worker 0-0, policy_version 800924 (0.00100) [2022-07-10 16:26:56,070][26022] Updated weights on worker 0-0, policy_version 800934 (0.00092) [2022-07-10 16:26:58,023][26022] Updated weights on worker 0-0, policy_version 800944 (0.00078) [2022-07-10 16:26:59,291][25689] Fps is (10 sec: 5737.8, 60 sec: 5564.9, 300 sec: 5536.2). Total num frames: 820173824. Throughput: 0: 4999.6. Samples: 820167486. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:26:59,291][25689] Avg episode reward: [(0, '-0.122')] [2022-07-10 16:26:59,761][26022] Updated weights on worker 0-0, policy_version 800954 (0.00093) [2022-07-10 16:27:01,776][26022] Updated weights on worker 0-0, policy_version 800964 (0.00103) [2022-07-10 16:27:03,731][26022] Updated weights on worker 0-0, policy_version 800974 (0.00093) [2022-07-10 16:27:04,309][25689] Fps is (10 sec: 5447.0, 60 sec: 5563.9, 300 sec: 5532.6). Total num frames: 820200448. Throughput: 0: 5737.6. Samples: 820199322. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:27:04,311][25689] Avg episode reward: [(0, '-0.257')] [2022-07-10 16:27:05,705][26022] Updated weights on worker 0-0, policy_version 800984 (0.00092) [2022-07-10 16:27:07,413][26022] Updated weights on worker 0-0, policy_version 800994 (0.00092) [2022-07-10 16:27:09,330][25689] Fps is (10 sec: 5304.2, 60 sec: 5529.7, 300 sec: 5535.8). Total num frames: 820227072. Throughput: 0: 5738.4. Samples: 820232942. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:27:09,330][25689] Avg episode reward: [(0, '-0.629')] [2022-07-10 16:27:09,409][26022] Updated weights on worker 0-0, policy_version 801004 (0.00087) [2022-07-10 16:27:11,011][26022] Updated weights on worker 0-0, policy_version 801014 (0.00094) [2022-07-10 16:27:12,957][26022] Updated weights on worker 0-0, policy_version 801024 (0.00088) [2022-07-10 16:27:14,332][25689] Fps is (10 sec: 5516.9, 60 sec: 5567.8, 300 sec: 5537.2). Total num frames: 820255744. Throughput: 0: 4903.7. Samples: 820249760. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:27:14,333][25689] Avg episode reward: [(0, '-0.442')] [2022-07-10 16:27:14,889][26022] Updated weights on worker 0-0, policy_version 801034 (0.00088) [2022-07-10 16:27:16,642][26022] Updated weights on worker 0-0, policy_version 801044 (0.00087) [2022-07-10 16:27:18,417][26022] Updated weights on worker 0-0, policy_version 801054 (0.00091) [2022-07-10 16:27:19,464][25689] Fps is (10 sec: 5658.5, 60 sec: 5563.7, 300 sec: 5535.7). Total num frames: 820284416. Throughput: 0: 5734.0. Samples: 820283158. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:27:19,464][25689] Avg episode reward: [(0, '-0.376')] [2022-07-10 16:27:20,362][26022] Updated weights on worker 0-0, policy_version 801064 (0.00092) [2022-07-10 16:27:22,136][26022] Updated weights on worker 0-0, policy_version 801074 (0.00092) [2022-07-10 16:27:24,206][26022] Updated weights on worker 0-0, policy_version 801084 (0.00086) [2022-07-10 16:27:24,471][25689] Fps is (10 sec: 5453.9, 60 sec: 5532.5, 300 sec: 5529.1). Total num frames: 820311040. Throughput: 0: 5810.9. Samples: 820316480. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:27:24,472][25689] Avg episode reward: [(0, '-0.440')] [2022-07-10 16:27:25,827][26022] Updated weights on worker 0-0, policy_version 801094 (0.00092) [2022-07-10 16:27:27,854][26022] Updated weights on worker 0-0, policy_version 801104 (0.00088) [2022-07-10 16:27:29,564][25689] Fps is (10 sec: 5474.8, 60 sec: 5541.6, 300 sec: 5532.2). Total num frames: 820339712. Throughput: 0: 4954.9. Samples: 820333200. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:27:29,564][25689] Avg episode reward: [(0, '-0.048')] [2022-07-10 16:27:29,592][26022] Updated weights on worker 0-0, policy_version 801114 (0.00091) [2022-07-10 16:27:31,564][26022] Updated weights on worker 0-0, policy_version 801124 (0.00087) [2022-07-10 16:27:33,295][26022] Updated weights on worker 0-0, policy_version 801134 (0.00095) [2022-07-10 16:27:34,603][25689] Fps is (10 sec: 5558.7, 60 sec: 5538.9, 300 sec: 5530.7). Total num frames: 820367360. Throughput: 0: 5753.6. Samples: 820366388. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:27:34,603][25689] Avg episode reward: [(0, '-0.195')] [2022-07-10 16:27:35,259][26022] Updated weights on worker 0-0, policy_version 801144 (0.00091) [2022-07-10 16:27:37,057][26022] Updated weights on worker 0-0, policy_version 801154 (0.00091) [2022-07-10 16:27:38,821][26022] Updated weights on worker 0-0, policy_version 801164 (0.00393) [2022-07-10 16:27:39,729][25689] Fps is (10 sec: 5540.7, 60 sec: 5554.9, 300 sec: 5532.5). Total num frames: 820396032. Throughput: 0: 5745.1. Samples: 820399582. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:27:39,729][25689] Avg episode reward: [(0, '-0.001')] [2022-07-10 16:27:40,797][26022] Updated weights on worker 0-0, policy_version 801174 (0.00091) [2022-07-10 16:27:42,367][26022] Updated weights on worker 0-0, policy_version 801184 (0.00086) [2022-07-10 16:27:44,399][26022] Updated weights on worker 0-0, policy_version 801194 (0.00083) [2022-07-10 16:27:44,778][25689] Fps is (10 sec: 5635.7, 60 sec: 5535.4, 300 sec: 5533.1). Total num frames: 820424704. Throughput: 0: 5741.6. Samples: 820433074. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:27:44,779][25689] Avg episode reward: [(0, '-0.440')] [2022-07-10 16:27:46,174][26022] Updated weights on worker 0-0, policy_version 801204 (0.00088) [2022-07-10 16:27:48,036][26022] Updated weights on worker 0-0, policy_version 801214 (0.00087) [2022-07-10 16:27:49,885][25689] Fps is (10 sec: 5444.6, 60 sec: 5527.3, 300 sec: 5524.4). Total num frames: 820451328. Throughput: 0: 5732.8. Samples: 820449696. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:27:49,885][25689] Avg episode reward: [(0, '-1.212')] [2022-07-10 16:27:50,124][26022] Updated weights on worker 0-0, policy_version 801224 (0.00089) [2022-07-10 16:27:51,536][26022] Updated weights on worker 0-0, policy_version 801234 (0.00085) [2022-07-10 16:27:53,704][26022] Updated weights on worker 0-0, policy_version 801244 (0.00090) [2022-07-10 16:27:54,924][25689] Fps is (10 sec: 5752.9, 60 sec: 5561.0, 300 sec: 5539.1). Total num frames: 820483072. Throughput: 0: 5762.1. Samples: 820483480. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:27:54,924][25689] Avg episode reward: [(0, '-1.409')] [2022-07-10 16:27:55,055][26022] Updated weights on worker 0-0, policy_version 801254 (0.00096) [2022-07-10 16:27:57,228][26022] Updated weights on worker 0-0, policy_version 801264 (0.00084) [2022-07-10 16:27:59,204][26022] Updated weights on worker 0-0, policy_version 801274 (0.00094) [2022-07-10 16:28:00,008][25689] Fps is (10 sec: 5664.7, 60 sec: 5514.9, 300 sec: 5534.6). Total num frames: 820508672. Throughput: 0: 5799.2. Samples: 820517186. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:28:00,009][25689] Avg episode reward: [(0, '-1.615')] [2022-07-10 16:28:00,849][26022] Updated weights on worker 0-0, policy_version 801284 (0.00095) [2022-07-10 16:28:03,019][26022] Updated weights on worker 0-0, policy_version 801294 (0.00082) [2022-07-10 16:28:04,899][26022] Updated weights on worker 0-0, policy_version 801304 (0.00084) [2022-07-10 16:28:05,037][25689] Fps is (10 sec: 5163.8, 60 sec: 5513.9, 300 sec: 5534.2). Total num frames: 820535296. Throughput: 0: 4880.2. Samples: 820531944. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:28:05,038][25689] Avg episode reward: [(0, '-1.878')] [2022-07-10 16:28:06,538][26022] Updated weights on worker 0-0, policy_version 801314 (0.00092) [2022-07-10 16:28:08,895][26022] Updated weights on worker 0-0, policy_version 801324 (0.00097) [2022-07-10 16:28:10,055][25689] Fps is (10 sec: 5605.6, 60 sec: 5564.7, 300 sec: 5541.1). Total num frames: 820564992. Throughput: 0: 5744.1. Samples: 820565556. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:28:10,055][25689] Avg episode reward: [(0, '-1.779')] [2022-07-10 16:28:10,121][26022] Updated weights on worker 0-0, policy_version 801334 (0.00082) [2022-07-10 16:28:12,310][26022] Updated weights on worker 0-0, policy_version 801344 (0.00084) [2022-07-10 16:28:13,736][26022] Updated weights on worker 0-0, policy_version 801354 (0.00097) [2022-07-10 16:28:15,067][25689] Fps is (10 sec: 5717.7, 60 sec: 5547.0, 300 sec: 5535.8). Total num frames: 820592640. Throughput: 0: 5755.9. Samples: 820599420. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:28:15,067][25689] Avg episode reward: [(0, '-2.176')] [2022-07-10 16:28:15,921][26022] Updated weights on worker 0-0, policy_version 801364 (0.00084) [2022-07-10 16:28:17,583][26022] Updated weights on worker 0-0, policy_version 801374 (0.00088) [2022-07-10 16:28:19,864][26022] Updated weights on worker 0-0, policy_version 801384 (0.00097) [2022-07-10 16:28:20,183][25689] Fps is (10 sec: 5257.5, 60 sec: 5497.8, 300 sec: 5527.2). Total num frames: 820618240. Throughput: 0: 4885.9. Samples: 820615760. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 16:28:20,184][25689] Avg episode reward: [(0, '-2.012')] [2022-07-10 16:28:21,047][26022] Updated weights on worker 0-0, policy_version 801394 (0.00086) [2022-07-10 16:28:23,511][26022] Updated weights on worker 0-0, policy_version 801404 (0.00096) [2022-07-10 16:28:24,959][26022] Updated weights on worker 0-0, policy_version 801414 (0.00084) [2022-07-10 16:28:25,214][25689] Fps is (10 sec: 5651.0, 60 sec: 5580.0, 300 sec: 5540.4). Total num frames: 820649984. Throughput: 0: 5812.4. Samples: 820649220. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:28:25,215][25689] Avg episode reward: [(0, '-1.598')] [2022-07-10 16:28:27,026][26022] Updated weights on worker 0-0, policy_version 801424 (0.00086) [2022-07-10 16:28:28,875][26022] Updated weights on worker 0-0, policy_version 801434 (0.00086) [2022-07-10 16:28:30,269][25689] Fps is (10 sec: 5685.7, 60 sec: 5532.9, 300 sec: 5529.6). Total num frames: 820675584. Throughput: 0: 5777.9. Samples: 820682350. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:28:30,270][25689] Avg episode reward: [(0, '-0.523')] [2022-07-10 16:28:30,661][26022] Updated weights on worker 0-0, policy_version 801444 (0.01051) [2022-07-10 16:28:32,501][26022] Updated weights on worker 0-0, policy_version 801454 (0.00088) [2022-07-10 16:28:34,461][26022] Updated weights on worker 0-0, policy_version 801464 (0.00092) [2022-07-10 16:28:35,277][25689] Fps is (10 sec: 5393.3, 60 sec: 5552.5, 300 sec: 5534.0). Total num frames: 820704256. Throughput: 0: 4929.7. Samples: 820699052. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:28:35,278][25689] Avg episode reward: [(0, '-0.535')] [2022-07-10 16:28:36,054][26022] Updated weights on worker 0-0, policy_version 801474 (0.00082) [2022-07-10 16:28:38,083][26022] Updated weights on worker 0-0, policy_version 801484 (0.00084) [2022-07-10 16:28:39,828][26022] Updated weights on worker 0-0, policy_version 801494 (0.00054) [2022-07-10 16:28:40,383][25689] Fps is (10 sec: 5568.1, 60 sec: 5537.4, 300 sec: 5532.8). Total num frames: 820731904. Throughput: 0: 5784.5. Samples: 820732610. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:28:40,384][25689] Avg episode reward: [(0, '-0.588')] [2022-07-10 16:28:41,680][26022] Updated weights on worker 0-0, policy_version 801504 (0.00091) [2022-07-10 16:28:43,630][26022] Updated weights on worker 0-0, policy_version 801514 (0.00099) [2022-07-10 16:28:45,373][26022] Updated weights on worker 0-0, policy_version 801524 (0.00613) [2022-07-10 16:28:45,466][25689] Fps is (10 sec: 5527.9, 60 sec: 5534.5, 300 sec: 5532.0). Total num frames: 820760576. Throughput: 0: 5773.1. Samples: 820766132. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:28:45,466][25689] Avg episode reward: [(0, '0.611')] [2022-07-10 16:28:46,290][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:28:46,298][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000801529_820765696.pth [2022-07-10 16:28:46,299][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000799579_818768896.pth [2022-07-10 16:28:47,218][26022] Updated weights on worker 0-0, policy_version 801534 (0.00087) [2022-07-10 16:28:49,119][26022] Updated weights on worker 0-0, policy_version 801544 (0.00090) [2022-07-10 16:28:50,512][25689] Fps is (10 sec: 5560.6, 60 sec: 5556.9, 300 sec: 5534.6). Total num frames: 820788224. Throughput: 0: 4952.0. Samples: 820782594. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:28:50,513][25689] Avg episode reward: [(0, '0.599')] [2022-07-10 16:28:50,924][26022] Updated weights on worker 0-0, policy_version 801554 (0.00099) [2022-07-10 16:28:52,933][26022] Updated weights on worker 0-0, policy_version 801564 (0.00084) [2022-07-10 16:28:54,611][26022] Updated weights on worker 0-0, policy_version 801574 (0.00087) [2022-07-10 16:28:55,522][25689] Fps is (10 sec: 5498.5, 60 sec: 5491.9, 300 sec: 5528.2). Total num frames: 820815872. Throughput: 0: 5778.4. Samples: 820816036. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:28:55,523][25689] Avg episode reward: [(0, '0.689')] [2022-07-10 16:28:56,459][26022] Updated weights on worker 0-0, policy_version 801584 (0.00090) [2022-07-10 16:28:58,359][26022] Updated weights on worker 0-0, policy_version 801594 (0.00092) [2022-07-10 16:28:59,999][26022] Updated weights on worker 0-0, policy_version 801604 (0.00083) [2022-07-10 16:29:00,587][25689] Fps is (10 sec: 5691.8, 60 sec: 5561.3, 300 sec: 5544.5). Total num frames: 820845568. Throughput: 0: 5797.5. Samples: 820849738. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:29:00,587][25689] Avg episode reward: [(0, '0.713')] [2022-07-10 16:29:02,435][26022] Updated weights on worker 0-0, policy_version 801614 (0.00080) [2022-07-10 16:29:03,923][26022] Updated weights on worker 0-0, policy_version 801624 (0.00088) [2022-07-10 16:29:05,595][25689] Fps is (10 sec: 5489.3, 60 sec: 5546.3, 300 sec: 5534.2). Total num frames: 820871168. Throughput: 0: 4887.8. Samples: 820864522. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:29:05,596][25689] Avg episode reward: [(0, '-0.010')] [2022-07-10 16:29:05,925][26022] Updated weights on worker 0-0, policy_version 801634 (0.00086) [2022-07-10 16:29:07,863][26022] Updated weights on worker 0-0, policy_version 801644 (0.00085) [2022-07-10 16:29:09,586][26022] Updated weights on worker 0-0, policy_version 801654 (0.00086) [2022-07-10 16:29:10,607][25689] Fps is (10 sec: 5314.2, 60 sec: 5513.0, 300 sec: 5537.7). Total num frames: 820898816. Throughput: 0: 5751.8. Samples: 820898176. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:29:10,607][25689] Avg episode reward: [(0, '-1.052')] [2022-07-10 16:29:11,266][26022] Updated weights on worker 0-0, policy_version 801664 (0.00085) [2022-07-10 16:29:13,120][26022] Updated weights on worker 0-0, policy_version 801674 (0.00081) [2022-07-10 16:29:15,061][26022] Updated weights on worker 0-0, policy_version 801684 (0.00090) [2022-07-10 16:29:15,618][25689] Fps is (10 sec: 5619.2, 60 sec: 5530.0, 300 sec: 5542.4). Total num frames: 820927488. Throughput: 0: 5770.8. Samples: 820932008. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:29:15,619][25689] Avg episode reward: [(0, '-1.037')] [2022-07-10 16:29:17,007][26022] Updated weights on worker 0-0, policy_version 801694 (0.00086) [2022-07-10 16:29:18,840][26022] Updated weights on worker 0-0, policy_version 801704 (0.00107) [2022-07-10 16:29:20,692][25689] Fps is (10 sec: 5584.5, 60 sec: 5567.7, 300 sec: 5539.1). Total num frames: 820955136. Throughput: 0: 4911.4. Samples: 820948482. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:29:20,692][25689] Avg episode reward: [(0, '-1.215')] [2022-07-10 16:29:20,700][26022] Updated weights on worker 0-0, policy_version 801714 (0.00095) [2022-07-10 16:29:22,269][26022] Updated weights on worker 0-0, policy_version 801724 (0.00089) [2022-07-10 16:29:24,479][26022] Updated weights on worker 0-0, policy_version 801734 (0.00083) [2022-07-10 16:29:25,783][25689] Fps is (10 sec: 5641.6, 60 sec: 5528.4, 300 sec: 5541.6). Total num frames: 820984832. Throughput: 0: 5814.3. Samples: 820981898. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:29:25,783][25689] Avg episode reward: [(0, '-1.312')] [2022-07-10 16:29:25,804][26022] Updated weights on worker 0-0, policy_version 801744 (0.00093) [2022-07-10 16:29:28,176][26022] Updated weights on worker 0-0, policy_version 801754 (0.00089) [2022-07-10 16:29:29,561][26022] Updated weights on worker 0-0, policy_version 801764 (0.00095) [2022-07-10 16:29:30,818][25689] Fps is (10 sec: 5561.5, 60 sec: 5547.0, 300 sec: 5537.8). Total num frames: 821011456. Throughput: 0: 5798.0. Samples: 821015364. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:29:30,827][25689] Avg episode reward: [(0, '-1.798')] [2022-07-10 16:29:31,721][26022] Updated weights on worker 0-0, policy_version 801774 (0.00087) [2022-07-10 16:29:33,525][26022] Updated weights on worker 0-0, policy_version 801784 (0.00091) [2022-07-10 16:29:35,296][26022] Updated weights on worker 0-0, policy_version 801794 (0.00084) [2022-07-10 16:29:35,884][25689] Fps is (10 sec: 5474.3, 60 sec: 5541.8, 300 sec: 5537.7). Total num frames: 821040128. Throughput: 0: 5777.2. Samples: 821049086. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:29:35,884][25689] Avg episode reward: [(0, '-0.995')] [2022-07-10 16:29:37,168][26022] Updated weights on worker 0-0, policy_version 801804 (0.00094) [2022-07-10 16:29:38,837][26022] Updated weights on worker 0-0, policy_version 801814 (0.00085) [2022-07-10 16:29:40,889][26022] Updated weights on worker 0-0, policy_version 801824 (0.00091) [2022-07-10 16:29:40,977][25689] Fps is (10 sec: 5543.9, 60 sec: 5543.0, 300 sec: 5536.8). Total num frames: 821067776. Throughput: 0: 5782.9. Samples: 821065792. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:29:40,978][25689] Avg episode reward: [(0, '-1.286')] [2022-07-10 16:29:42,695][26022] Updated weights on worker 0-0, policy_version 801834 (0.00087) [2022-07-10 16:29:44,465][26022] Updated weights on worker 0-0, policy_version 801844 (0.00085) [2022-07-10 16:29:46,021][25689] Fps is (10 sec: 5656.6, 60 sec: 5563.4, 300 sec: 5543.8). Total num frames: 821097472. Throughput: 0: 5797.1. Samples: 821099224. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:29:46,022][25689] Avg episode reward: [(0, '-1.769')] [2022-07-10 16:29:46,449][26022] Updated weights on worker 0-0, policy_version 801854 (0.00084) [2022-07-10 16:29:48,395][26022] Updated weights on worker 0-0, policy_version 801864 (0.00093) [2022-07-10 16:29:50,123][26022] Updated weights on worker 0-0, policy_version 801874 (0.00051) [2022-07-10 16:29:51,047][25689] Fps is (10 sec: 5593.2, 60 sec: 5548.4, 300 sec: 5540.7). Total num frames: 821124096. Throughput: 0: 5787.0. Samples: 821132426. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:29:51,047][25689] Avg episode reward: [(0, '-1.713')] [2022-07-10 16:29:51,972][26022] Updated weights on worker 0-0, policy_version 801884 (0.00091) [2022-07-10 16:29:53,822][26022] Updated weights on worker 0-0, policy_version 801894 (0.00086) [2022-07-10 16:29:55,707][26022] Updated weights on worker 0-0, policy_version 801904 (0.00097) [2022-07-10 16:29:56,071][25689] Fps is (10 sec: 5400.6, 60 sec: 5547.1, 300 sec: 5538.2). Total num frames: 821151744. Throughput: 0: 4959.7. Samples: 821149206. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:29:56,071][25689] Avg episode reward: [(0, '-0.987')] [2022-07-10 16:29:57,431][26022] Updated weights on worker 0-0, policy_version 801914 (0.00092) [2022-07-10 16:29:59,387][26022] Updated weights on worker 0-0, policy_version 801924 (0.00087) [2022-07-10 16:30:01,163][25689] Fps is (10 sec: 5567.0, 60 sec: 5527.7, 300 sec: 5543.8). Total num frames: 821180416. Throughput: 0: 5775.5. Samples: 821182376. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:30:01,164][25689] Avg episode reward: [(0, '-2.019')] [2022-07-10 16:30:01,171][26022] Updated weights on worker 0-0, policy_version 801934 (0.00097) [2022-07-10 16:30:03,272][26022] Updated weights on worker 0-0, policy_version 801944 (0.00096) [2022-07-10 16:30:05,258][26022] Updated weights on worker 0-0, policy_version 801954 (0.00088) [2022-07-10 16:30:06,263][25689] Fps is (10 sec: 5425.5, 60 sec: 5536.3, 300 sec: 5539.6). Total num frames: 821207040. Throughput: 0: 5652.5. Samples: 821213638. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:30:06,263][25689] Avg episode reward: [(0, '-2.009')] [2022-07-10 16:30:07,165][26022] Updated weights on worker 0-0, policy_version 801964 (0.00088) [2022-07-10 16:30:08,871][26022] Updated weights on worker 0-0, policy_version 801974 (0.00085) [2022-07-10 16:30:10,912][26022] Updated weights on worker 0-0, policy_version 801984 (0.00096) [2022-07-10 16:30:11,327][25689] Fps is (10 sec: 5340.1, 60 sec: 5531.5, 300 sec: 5535.4). Total num frames: 821234688. Throughput: 0: 4826.4. Samples: 821230306. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:30:11,327][25689] Avg episode reward: [(0, '-1.368')] [2022-07-10 16:30:12,556][26022] Updated weights on worker 0-0, policy_version 801994 (0.00090) [2022-07-10 16:30:14,609][26022] Updated weights on worker 0-0, policy_version 802004 (0.00119) [2022-07-10 16:30:16,152][26022] Updated weights on worker 0-0, policy_version 802014 (0.00089) [2022-07-10 16:30:16,349][25689] Fps is (10 sec: 5482.4, 60 sec: 5513.7, 300 sec: 5536.9). Total num frames: 821262336. Throughput: 0: 5645.1. Samples: 821263676. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:30:16,349][25689] Avg episode reward: [(0, '-1.601')] [2022-07-10 16:30:18,289][26022] Updated weights on worker 0-0, policy_version 802024 (0.00082) [2022-07-10 16:30:19,947][26022] Updated weights on worker 0-0, policy_version 802034 (0.00090) [2022-07-10 16:30:21,416][25689] Fps is (10 sec: 5480.4, 60 sec: 5514.2, 300 sec: 5533.2). Total num frames: 821289984. Throughput: 0: 5647.9. Samples: 821296762. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:30:21,427][25689] Avg episode reward: [(0, '-1.930')] [2022-07-10 16:30:21,998][26022] Updated weights on worker 0-0, policy_version 802044 (0.00086) [2022-07-10 16:30:23,696][26022] Updated weights on worker 0-0, policy_version 802054 (0.00084) [2022-07-10 16:30:25,606][26022] Updated weights on worker 0-0, policy_version 802064 (0.00080) [2022-07-10 16:30:26,452][25689] Fps is (10 sec: 5574.4, 60 sec: 5502.4, 300 sec: 5536.3). Total num frames: 821318656. Throughput: 0: 4943.4. Samples: 821313442. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:30:26,452][25689] Avg episode reward: [(0, '-1.950')] [2022-07-10 16:30:27,246][26022] Updated weights on worker 0-0, policy_version 802074 (0.00085) [2022-07-10 16:30:29,257][26022] Updated weights on worker 0-0, policy_version 802084 (0.00087) [2022-07-10 16:30:31,126][26022] Updated weights on worker 0-0, policy_version 802094 (0.00086) [2022-07-10 16:30:31,495][25689] Fps is (10 sec: 5486.6, 60 sec: 5501.7, 300 sec: 5532.1). Total num frames: 821345280. Throughput: 0: 5775.3. Samples: 821346784. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:30:31,495][25689] Avg episode reward: [(0, '-0.766')] [2022-07-10 16:30:32,963][26022] Updated weights on worker 0-0, policy_version 802104 (0.00085) [2022-07-10 16:30:34,834][26022] Updated weights on worker 0-0, policy_version 802114 (0.00081) [2022-07-10 16:30:36,506][25689] Fps is (10 sec: 5500.0, 60 sec: 5506.7, 300 sec: 5533.3). Total num frames: 821373952. Throughput: 0: 5769.1. Samples: 821379964. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:30:36,506][25689] Avg episode reward: [(0, '-1.070')] [2022-07-10 16:30:36,626][26022] Updated weights on worker 0-0, policy_version 802124 (0.00114) [2022-07-10 16:30:38,382][26022] Updated weights on worker 0-0, policy_version 802134 (0.00091) [2022-07-10 16:30:40,312][26022] Updated weights on worker 0-0, policy_version 802144 (0.00096) [2022-07-10 16:30:41,559][25689] Fps is (10 sec: 5596.0, 60 sec: 5510.3, 300 sec: 5533.3). Total num frames: 821401600. Throughput: 0: 4949.5. Samples: 821396458. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:30:41,560][25689] Avg episode reward: [(0, '0.088')] [2022-07-10 16:30:42,128][26022] Updated weights on worker 0-0, policy_version 802154 (0.00093) [2022-07-10 16:30:44,180][26022] Updated weights on worker 0-0, policy_version 802164 (0.00088) [2022-07-10 16:30:45,815][26022] Updated weights on worker 0-0, policy_version 802174 (0.00087) [2022-07-10 16:30:46,358][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:30:46,370][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000802177_821429248.pth [2022-07-10 16:30:46,374][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000800227_819432448.pth [2022-07-10 16:30:46,641][25689] Fps is (10 sec: 5557.0, 60 sec: 5490.0, 300 sec: 5535.5). Total num frames: 821430272. Throughput: 0: 5772.4. Samples: 821429980. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:30:46,641][25689] Avg episode reward: [(0, '0.009')] [2022-07-10 16:30:47,704][26022] Updated weights on worker 0-0, policy_version 802184 (0.00088) [2022-07-10 16:30:49,660][26022] Updated weights on worker 0-0, policy_version 802194 (0.00094) [2022-07-10 16:30:51,467][26022] Updated weights on worker 0-0, policy_version 802204 (0.00085) [2022-07-10 16:30:51,726][25689] Fps is (10 sec: 5539.3, 60 sec: 5501.4, 300 sec: 5527.9). Total num frames: 821457920. Throughput: 0: 5752.3. Samples: 821463162. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:30:51,727][25689] Avg episode reward: [(0, '-0.008')] [2022-07-10 16:30:53,359][26022] Updated weights on worker 0-0, policy_version 802214 (0.00089) [2022-07-10 16:30:55,230][26022] Updated weights on worker 0-0, policy_version 802224 (0.00088) [2022-07-10 16:30:56,740][25689] Fps is (10 sec: 5475.3, 60 sec: 5502.4, 300 sec: 5532.6). Total num frames: 821485568. Throughput: 0: 4934.6. Samples: 821479812. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:30:56,740][25689] Avg episode reward: [(0, '0.008')] [2022-07-10 16:30:56,900][26022] Updated weights on worker 0-0, policy_version 802234 (0.00090) [2022-07-10 16:30:58,990][26022] Updated weights on worker 0-0, policy_version 802244 (0.00091) [2022-07-10 16:31:00,538][26022] Updated weights on worker 0-0, policy_version 802254 (0.00087) [2022-07-10 16:31:01,858][25689] Fps is (10 sec: 5356.9, 60 sec: 5466.4, 300 sec: 5530.5). Total num frames: 821512192. Throughput: 0: 5760.2. Samples: 821513382. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:31:01,862][25689] Avg episode reward: [(0, '-0.186')] [2022-07-10 16:31:02,963][26022] Updated weights on worker 0-0, policy_version 802264 (0.00089) [2022-07-10 16:31:04,515][26022] Updated weights on worker 0-0, policy_version 802274 (0.00087) [2022-07-10 16:31:06,628][26022] Updated weights on worker 0-0, policy_version 802284 (0.00094) [2022-07-10 16:31:06,942][25689] Fps is (10 sec: 5319.6, 60 sec: 5484.5, 300 sec: 5525.8). Total num frames: 821539840. Throughput: 0: 5656.9. Samples: 821544824. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:31:06,943][25689] Avg episode reward: [(0, '-0.139')] [2022-07-10 16:31:08,388][26022] Updated weights on worker 0-0, policy_version 802294 (0.00109) [2022-07-10 16:31:10,333][26022] Updated weights on worker 0-0, policy_version 802304 (0.00085) [2022-07-10 16:31:11,981][25689] Fps is (10 sec: 5563.6, 60 sec: 5503.7, 300 sec: 5532.9). Total num frames: 821568512. Throughput: 0: 4845.1. Samples: 821561296. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:31:11,981][25689] Avg episode reward: [(0, '-1.084')] [2022-07-10 16:31:12,094][26022] Updated weights on worker 0-0, policy_version 802314 (0.00055) [2022-07-10 16:31:14,051][26022] Updated weights on worker 0-0, policy_version 802324 (0.00091) [2022-07-10 16:31:15,736][26022] Updated weights on worker 0-0, policy_version 802334 (0.00102) [2022-07-10 16:31:17,046][25689] Fps is (10 sec: 5574.1, 60 sec: 5499.8, 300 sec: 5529.8). Total num frames: 821596160. Throughput: 0: 5676.0. Samples: 821595070. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:31:17,047][25689] Avg episode reward: [(0, '-0.797')] [2022-07-10 16:31:17,678][26022] Updated weights on worker 0-0, policy_version 802344 (0.00086) [2022-07-10 16:31:19,340][26022] Updated weights on worker 0-0, policy_version 802354 (0.00083) [2022-07-10 16:31:21,340][26022] Updated weights on worker 0-0, policy_version 802364 (0.00091) [2022-07-10 16:31:22,101][25689] Fps is (10 sec: 5463.8, 60 sec: 5501.0, 300 sec: 5526.1). Total num frames: 821623808. Throughput: 0: 5687.8. Samples: 821628522. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:31:22,102][25689] Avg episode reward: [(0, '-1.123')] [2022-07-10 16:31:22,992][26022] Updated weights on worker 0-0, policy_version 802374 (0.00086) [2022-07-10 16:31:25,155][26022] Updated weights on worker 0-0, policy_version 802384 (0.00090) [2022-07-10 16:31:26,575][26022] Updated weights on worker 0-0, policy_version 802394 (0.00082) [2022-07-10 16:31:27,126][25689] Fps is (10 sec: 5689.1, 60 sec: 5518.8, 300 sec: 5532.6). Total num frames: 821653504. Throughput: 0: 5791.6. Samples: 821661718. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:31:27,126][25689] Avg episode reward: [(0, '-1.017')] [2022-07-10 16:31:28,910][26022] Updated weights on worker 0-0, policy_version 802404 (0.00081) [2022-07-10 16:31:30,390][26022] Updated weights on worker 0-0, policy_version 802414 (0.00090) [2022-07-10 16:31:32,205][25689] Fps is (10 sec: 5574.1, 60 sec: 5515.5, 300 sec: 5527.9). Total num frames: 821680128. Throughput: 0: 5795.7. Samples: 821678510. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:31:32,207][25689] Avg episode reward: [(0, '-1.340')] [2022-07-10 16:31:32,375][26022] Updated weights on worker 0-0, policy_version 802424 (0.00082) [2022-07-10 16:31:33,986][26022] Updated weights on worker 0-0, policy_version 802434 (0.00082) [2022-07-10 16:31:36,289][26022] Updated weights on worker 0-0, policy_version 802444 (0.00080) [2022-07-10 16:31:37,211][25689] Fps is (10 sec: 5482.8, 60 sec: 5515.9, 300 sec: 5533.4). Total num frames: 821708800. Throughput: 0: 5804.4. Samples: 821712116. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:31:37,213][25689] Avg episode reward: [(0, '-1.532')] [2022-07-10 16:31:37,781][26022] Updated weights on worker 0-0, policy_version 802454 (0.00085) [2022-07-10 16:31:39,941][26022] Updated weights on worker 0-0, policy_version 802464 (0.00085) [2022-07-10 16:31:41,397][26022] Updated weights on worker 0-0, policy_version 802474 (0.00090) [2022-07-10 16:31:42,266][25689] Fps is (10 sec: 5801.1, 60 sec: 5549.5, 300 sec: 5532.7). Total num frames: 821738496. Throughput: 0: 5798.4. Samples: 821745450. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:31:42,267][25689] Avg episode reward: [(0, '-1.281')] [2022-07-10 16:31:43,654][26022] Updated weights on worker 0-0, policy_version 802484 (0.00096) [2022-07-10 16:31:44,954][26022] Updated weights on worker 0-0, policy_version 802494 (0.00090) [2022-07-10 16:31:47,122][26022] Updated weights on worker 0-0, policy_version 802504 (0.00089) [2022-07-10 16:31:47,287][25689] Fps is (10 sec: 5488.1, 60 sec: 5504.5, 300 sec: 5529.2). Total num frames: 821764096. Throughput: 0: 4991.9. Samples: 821762360. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:31:47,288][25689] Avg episode reward: [(0, '-2.320')] [2022-07-10 16:31:48,503][26022] Updated weights on worker 0-0, policy_version 802514 (0.00092) [2022-07-10 16:31:50,923][26022] Updated weights on worker 0-0, policy_version 802524 (0.00082) [2022-07-10 16:31:52,291][25689] Fps is (10 sec: 5515.9, 60 sec: 5545.7, 300 sec: 5529.9). Total num frames: 821793792. Throughput: 0: 5851.9. Samples: 821796052. Policy #0 lag: (min: 0.0, avg: 9.7, max: 18.0) [2022-07-10 16:31:52,293][25689] Avg episode reward: [(0, '-2.058')] [2022-07-10 16:31:52,370][26022] Updated weights on worker 0-0, policy_version 802534 (0.00096) [2022-07-10 16:31:54,317][26022] Updated weights on worker 0-0, policy_version 802544 (0.00085) [2022-07-10 16:31:56,186][26022] Updated weights on worker 0-0, policy_version 802554 (0.00094) [2022-07-10 16:31:57,307][25689] Fps is (10 sec: 5723.0, 60 sec: 5545.5, 300 sec: 5528.6). Total num frames: 821821440. Throughput: 0: 5840.9. Samples: 821829492. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:31:57,307][25689] Avg episode reward: [(0, '-2.653')] [2022-07-10 16:31:57,969][26022] Updated weights on worker 0-0, policy_version 802564 (0.00094) [2022-07-10 16:31:59,877][26022] Updated weights on worker 0-0, policy_version 802574 (0.00090) [2022-07-10 16:32:01,827][26022] Updated weights on worker 0-0, policy_version 802584 (0.00096) [2022-07-10 16:32:02,380][25689] Fps is (10 sec: 5379.5, 60 sec: 5549.6, 300 sec: 5527.6). Total num frames: 821848064. Throughput: 0: 5011.2. Samples: 821846242. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:32:02,382][25689] Avg episode reward: [(0, '-2.295')] [2022-07-10 16:32:03,912][26022] Updated weights on worker 0-0, policy_version 802594 (0.00087) [2022-07-10 16:32:05,652][26022] Updated weights on worker 0-0, policy_version 802604 (0.00090) [2022-07-10 16:32:07,396][25689] Fps is (10 sec: 5277.5, 60 sec: 5538.9, 300 sec: 5527.7). Total num frames: 821874688. Throughput: 0: 5739.8. Samples: 821877784. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:32:07,397][25689] Avg episode reward: [(0, '-1.539')] [2022-07-10 16:32:07,555][26022] Updated weights on worker 0-0, policy_version 802614 (0.00085) [2022-07-10 16:32:09,418][26022] Updated weights on worker 0-0, policy_version 802624 (0.00092) [2022-07-10 16:32:11,161][26022] Updated weights on worker 0-0, policy_version 802634 (0.00050) [2022-07-10 16:32:12,402][25689] Fps is (10 sec: 5517.2, 60 sec: 5541.8, 300 sec: 5527.6). Total num frames: 821903360. Throughput: 0: 5712.3. Samples: 821910932. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:32:12,403][25689] Avg episode reward: [(0, '-1.442')] [2022-07-10 16:32:13,191][26022] Updated weights on worker 0-0, policy_version 802644 (0.00087) [2022-07-10 16:32:14,948][26022] Updated weights on worker 0-0, policy_version 802654 (0.00093) [2022-07-10 16:32:16,739][26022] Updated weights on worker 0-0, policy_version 802664 (0.01641) [2022-07-10 16:32:17,432][25689] Fps is (10 sec: 5612.2, 60 sec: 5545.2, 300 sec: 5526.1). Total num frames: 821931008. Throughput: 0: 4875.7. Samples: 821927616. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:32:17,432][25689] Avg episode reward: [(0, '-1.308')] [2022-07-10 16:32:18,575][26022] Updated weights on worker 0-0, policy_version 802674 (0.00094) [2022-07-10 16:32:20,467][26022] Updated weights on worker 0-0, policy_version 802684 (0.00086) [2022-07-10 16:32:22,450][26022] Updated weights on worker 0-0, policy_version 802694 (0.00096) [2022-07-10 16:32:22,525][25689] Fps is (10 sec: 5563.6, 60 sec: 5558.6, 300 sec: 5531.3). Total num frames: 821959680. Throughput: 0: 5707.5. Samples: 821961222. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:32:22,526][25689] Avg episode reward: [(0, '-0.499')] [2022-07-10 16:32:24,148][26022] Updated weights on worker 0-0, policy_version 802704 (0.00090) [2022-07-10 16:32:26,017][26022] Updated weights on worker 0-0, policy_version 802714 (0.00091) [2022-07-10 16:32:27,542][25689] Fps is (10 sec: 5672.0, 60 sec: 5542.4, 300 sec: 5532.8). Total num frames: 821988352. Throughput: 0: 5803.2. Samples: 821994692. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:32:27,547][25689] Avg episode reward: [(0, '-0.355')] [2022-07-10 16:32:27,820][26022] Updated weights on worker 0-0, policy_version 802724 (0.00089) [2022-07-10 16:32:29,591][26022] Updated weights on worker 0-0, policy_version 802734 (0.00096) [2022-07-10 16:32:31,609][26022] Updated weights on worker 0-0, policy_version 802744 (0.00088) [2022-07-10 16:32:32,556][25689] Fps is (10 sec: 5512.9, 60 sec: 5548.4, 300 sec: 5529.8). Total num frames: 822014976. Throughput: 0: 4973.0. Samples: 822011154. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:32:32,556][25689] Avg episode reward: [(0, '-0.848')] [2022-07-10 16:32:33,245][26022] Updated weights on worker 0-0, policy_version 802754 (0.00084) [2022-07-10 16:32:35,424][26022] Updated weights on worker 0-0, policy_version 802764 (0.00092) [2022-07-10 16:32:37,001][26022] Updated weights on worker 0-0, policy_version 802774 (0.00087) [2022-07-10 16:32:37,567][25689] Fps is (10 sec: 5413.6, 60 sec: 5531.0, 300 sec: 5528.5). Total num frames: 822042624. Throughput: 0: 5813.4. Samples: 822044670. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:32:37,568][25689] Avg episode reward: [(0, '-0.222')] [2022-07-10 16:32:38,987][26022] Updated weights on worker 0-0, policy_version 802784 (0.00096) [2022-07-10 16:32:40,587][26022] Updated weights on worker 0-0, policy_version 802794 (0.00085) [2022-07-10 16:32:42,659][26022] Updated weights on worker 0-0, policy_version 802804 (0.00088) [2022-07-10 16:32:42,667][25689] Fps is (10 sec: 5468.9, 60 sec: 5493.0, 300 sec: 5524.1). Total num frames: 822070272. Throughput: 0: 5801.9. Samples: 822078080. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:32:42,667][25689] Avg episode reward: [(0, '0.522')] [2022-07-10 16:32:44,350][26022] Updated weights on worker 0-0, policy_version 802814 (0.00097) [2022-07-10 16:32:46,397][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:32:46,417][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000802824_822091776.pth [2022-07-10 16:32:46,418][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000800877_820098048.pth [2022-07-10 16:32:46,420][26022] Updated weights on worker 0-0, policy_version 802824 (0.00089) [2022-07-10 16:32:47,671][25689] Fps is (10 sec: 5574.0, 60 sec: 5545.3, 300 sec: 5532.9). Total num frames: 822098944. Throughput: 0: 4980.5. Samples: 822094944. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:32:47,673][25689] Avg episode reward: [(0, '-0.298')] [2022-07-10 16:32:48,039][26022] Updated weights on worker 0-0, policy_version 802834 (0.00083) [2022-07-10 16:32:49,865][26022] Updated weights on worker 0-0, policy_version 802844 (0.00088) [2022-07-10 16:32:51,692][26022] Updated weights on worker 0-0, policy_version 802854 (0.00088) [2022-07-10 16:32:52,698][25689] Fps is (10 sec: 5716.4, 60 sec: 5526.3, 300 sec: 5522.8). Total num frames: 822127616. Throughput: 0: 5824.4. Samples: 822128472. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:32:52,699][25689] Avg episode reward: [(0, '-0.239')] [2022-07-10 16:32:53,632][26022] Updated weights on worker 0-0, policy_version 802864 (0.00091) [2022-07-10 16:32:55,372][26022] Updated weights on worker 0-0, policy_version 802874 (0.00084) [2022-07-10 16:32:57,324][26022] Updated weights on worker 0-0, policy_version 802884 (0.00087) [2022-07-10 16:32:57,716][25689] Fps is (10 sec: 5504.7, 60 sec: 5509.1, 300 sec: 5527.5). Total num frames: 822154240. Throughput: 0: 5804.9. Samples: 822161634. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:32:57,716][25689] Avg episode reward: [(0, '0.139')] [2022-07-10 16:32:59,145][26022] Updated weights on worker 0-0, policy_version 802894 (0.00082) [2022-07-10 16:33:01,014][26022] Updated weights on worker 0-0, policy_version 802904 (0.00084) [2022-07-10 16:33:02,799][25689] Fps is (10 sec: 5271.7, 60 sec: 5508.2, 300 sec: 5526.5). Total num frames: 822180864. Throughput: 0: 4985.3. Samples: 822178444. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:33:02,799][25689] Avg episode reward: [(0, '-0.830')] [2022-07-10 16:33:03,388][26022] Updated weights on worker 0-0, policy_version 802914 (0.00093) [2022-07-10 16:33:05,079][26022] Updated weights on worker 0-0, policy_version 802924 (0.00088) [2022-07-10 16:33:06,819][26022] Updated weights on worker 0-0, policy_version 802934 (0.00095) [2022-07-10 16:33:07,859][25689] Fps is (10 sec: 5451.6, 60 sec: 5538.1, 300 sec: 5522.2). Total num frames: 822209536. Throughput: 0: 5700.1. Samples: 822210020. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:33:07,860][25689] Avg episode reward: [(0, '-3.050')] [2022-07-10 16:33:08,617][26022] Updated weights on worker 0-0, policy_version 802944 (0.00087) [2022-07-10 16:33:10,530][26022] Updated weights on worker 0-0, policy_version 802954 (0.00089) [2022-07-10 16:33:12,420][26022] Updated weights on worker 0-0, policy_version 802964 (0.00086) [2022-07-10 16:33:12,875][25689] Fps is (10 sec: 5589.3, 60 sec: 5520.2, 300 sec: 5522.2). Total num frames: 822237184. Throughput: 0: 5713.9. Samples: 822243762. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:33:12,876][25689] Avg episode reward: [(0, '-3.146')] [2022-07-10 16:33:14,091][26022] Updated weights on worker 0-0, policy_version 802974 (0.00087) [2022-07-10 16:33:16,007][26022] Updated weights on worker 0-0, policy_version 802984 (0.00095) [2022-07-10 16:33:17,880][25689] Fps is (10 sec: 5620.4, 60 sec: 5539.4, 300 sec: 5534.5). Total num frames: 822265856. Throughput: 0: 4901.2. Samples: 822260460. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:33:17,880][25689] Avg episode reward: [(0, '-3.593')] [2022-07-10 16:33:17,883][26022] Updated weights on worker 0-0, policy_version 802994 (0.00087) [2022-07-10 16:33:19,825][26022] Updated weights on worker 0-0, policy_version 803004 (0.00078) [2022-07-10 16:33:21,453][26022] Updated weights on worker 0-0, policy_version 803014 (0.00092) [2022-07-10 16:33:22,927][25689] Fps is (10 sec: 5603.1, 60 sec: 5526.8, 300 sec: 5520.5). Total num frames: 822293504. Throughput: 0: 5750.4. Samples: 822294188. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:33:22,927][25689] Avg episode reward: [(0, '-3.832')] [2022-07-10 16:33:23,337][26022] Updated weights on worker 0-0, policy_version 803024 (0.00095) [2022-07-10 16:33:25,244][26022] Updated weights on worker 0-0, policy_version 803034 (0.00092) [2022-07-10 16:33:27,177][26022] Updated weights on worker 0-0, policy_version 803044 (0.00085) [2022-07-10 16:33:27,945][25689] Fps is (10 sec: 5595.2, 60 sec: 5526.5, 300 sec: 5531.5). Total num frames: 822322176. Throughput: 0: 5859.8. Samples: 822327722. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:33:27,946][25689] Avg episode reward: [(0, '-3.212')] [2022-07-10 16:33:28,753][26022] Updated weights on worker 0-0, policy_version 803054 (0.00086) [2022-07-10 16:33:30,690][26022] Updated weights on worker 0-0, policy_version 803064 (0.00087) [2022-07-10 16:33:32,339][26022] Updated weights on worker 0-0, policy_version 803074 (0.00068) [2022-07-10 16:33:32,959][25689] Fps is (10 sec: 5613.6, 60 sec: 5543.5, 300 sec: 5527.9). Total num frames: 822349824. Throughput: 0: 5019.3. Samples: 822344572. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:33:32,960][25689] Avg episode reward: [(0, '-2.629')] [2022-07-10 16:33:34,267][26022] Updated weights on worker 0-0, policy_version 803084 (0.00095) [2022-07-10 16:33:36,286][26022] Updated weights on worker 0-0, policy_version 803094 (0.00085) [2022-07-10 16:33:37,967][25689] Fps is (10 sec: 5518.0, 60 sec: 5543.9, 300 sec: 5529.8). Total num frames: 822377472. Throughput: 0: 5869.9. Samples: 822378368. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:33:37,967][25689] Avg episode reward: [(0, '-0.419')] [2022-07-10 16:33:38,011][26022] Updated weights on worker 0-0, policy_version 803104 (0.00084) [2022-07-10 16:33:39,837][26022] Updated weights on worker 0-0, policy_version 803114 (0.00089) [2022-07-10 16:33:41,774][26022] Updated weights on worker 0-0, policy_version 803124 (0.00090) [2022-07-10 16:33:43,105][25689] Fps is (10 sec: 5652.2, 60 sec: 5574.2, 300 sec: 5532.2). Total num frames: 822407168. Throughput: 0: 5819.5. Samples: 822411614. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:33:43,105][25689] Avg episode reward: [(0, '0.862')] [2022-07-10 16:33:43,427][26022] Updated weights on worker 0-0, policy_version 803134 (0.00086) [2022-07-10 16:33:45,520][26022] Updated weights on worker 0-0, policy_version 803144 (0.00085) [2022-07-10 16:33:46,916][26022] Updated weights on worker 0-0, policy_version 803154 (0.00087) [2022-07-10 16:33:48,107][25689] Fps is (10 sec: 5655.1, 60 sec: 5557.5, 300 sec: 5533.0). Total num frames: 822434816. Throughput: 0: 5002.0. Samples: 822428570. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:33:48,107][25689] Avg episode reward: [(0, '1.156')] [2022-07-10 16:33:49,131][26022] Updated weights on worker 0-0, policy_version 803164 (0.00093) [2022-07-10 16:33:50,793][26022] Updated weights on worker 0-0, policy_version 803174 (0.00102) [2022-07-10 16:33:52,724][26022] Updated weights on worker 0-0, policy_version 803184 (0.00090) [2022-07-10 16:33:53,141][25689] Fps is (10 sec: 5407.8, 60 sec: 5523.0, 300 sec: 5529.1). Total num frames: 822461440. Throughput: 0: 5808.0. Samples: 822461784. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:33:53,141][25689] Avg episode reward: [(0, '1.211')] [2022-07-10 16:33:54,536][26022] Updated weights on worker 0-0, policy_version 803194 (0.00089) [2022-07-10 16:33:56,502][26022] Updated weights on worker 0-0, policy_version 803204 (0.00088) [2022-07-10 16:33:58,154][25689] Fps is (10 sec: 5503.5, 60 sec: 5557.3, 300 sec: 5526.6). Total num frames: 822490112. Throughput: 0: 5768.0. Samples: 822494810. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:33:58,155][25689] Avg episode reward: [(0, '1.576')] [2022-07-10 16:33:58,238][26022] Updated weights on worker 0-0, policy_version 803214 (0.00080) [2022-07-10 16:34:00,345][26022] Updated weights on worker 0-0, policy_version 803224 (0.00082) [2022-07-10 16:34:01,729][26022] Updated weights on worker 0-0, policy_version 803234 (0.00085) [2022-07-10 16:34:03,251][25689] Fps is (10 sec: 5469.5, 60 sec: 5556.0, 300 sec: 5528.4). Total num frames: 822516736. Throughput: 0: 4962.9. Samples: 822511596. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:34:03,252][25689] Avg episode reward: [(0, '1.609')] [2022-07-10 16:34:04,427][26022] Updated weights on worker 0-0, policy_version 803244 (0.00094) [2022-07-10 16:34:05,998][26022] Updated weights on worker 0-0, policy_version 803254 (0.00085) [2022-07-10 16:34:07,963][26022] Updated weights on worker 0-0, policy_version 803264 (0.00097) [2022-07-10 16:34:08,303][25689] Fps is (10 sec: 5347.7, 60 sec: 5539.8, 300 sec: 5527.7). Total num frames: 822544384. Throughput: 0: 5661.9. Samples: 822542918. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:34:08,303][25689] Avg episode reward: [(0, '1.245')] [2022-07-10 16:34:09,800][26022] Updated weights on worker 0-0, policy_version 803274 (0.00088) [2022-07-10 16:34:11,555][26022] Updated weights on worker 0-0, policy_version 803284 (0.00085) [2022-07-10 16:34:13,335][25689] Fps is (10 sec: 5381.7, 60 sec: 5521.4, 300 sec: 5520.4). Total num frames: 822571008. Throughput: 0: 5661.2. Samples: 822576110. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:34:13,336][25689] Avg episode reward: [(0, '1.047')] [2022-07-10 16:34:13,474][26022] Updated weights on worker 0-0, policy_version 803294 (0.00090) [2022-07-10 16:34:15,215][26022] Updated weights on worker 0-0, policy_version 803304 (0.00086) [2022-07-10 16:34:17,125][26022] Updated weights on worker 0-0, policy_version 803314 (0.00085) [2022-07-10 16:34:18,338][25689] Fps is (10 sec: 5510.0, 60 sec: 5521.5, 300 sec: 5525.2). Total num frames: 822599680. Throughput: 0: 5682.7. Samples: 822609510. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:34:18,339][25689] Avg episode reward: [(0, '0.923')] [2022-07-10 16:34:19,016][26022] Updated weights on worker 0-0, policy_version 803324 (0.00089) [2022-07-10 16:34:20,695][26022] Updated weights on worker 0-0, policy_version 803334 (0.00083) [2022-07-10 16:34:22,792][26022] Updated weights on worker 0-0, policy_version 803344 (0.00087) [2022-07-10 16:34:23,386][25689] Fps is (10 sec: 5705.3, 60 sec: 5538.4, 300 sec: 5522.5). Total num frames: 822628352. Throughput: 0: 5694.2. Samples: 822626252. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:34:23,387][25689] Avg episode reward: [(0, '0.632')] [2022-07-10 16:34:24,488][26022] Updated weights on worker 0-0, policy_version 803354 (0.00091) [2022-07-10 16:34:26,205][26022] Updated weights on worker 0-0, policy_version 803364 (0.00091) [2022-07-10 16:34:28,095][26022] Updated weights on worker 0-0, policy_version 803374 (0.00092) [2022-07-10 16:34:28,392][25689] Fps is (10 sec: 5500.0, 60 sec: 5505.7, 300 sec: 5523.1). Total num frames: 822654976. Throughput: 0: 5805.8. Samples: 822659554. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:34:28,393][25689] Avg episode reward: [(0, '0.650')] [2022-07-10 16:34:30,086][26022] Updated weights on worker 0-0, policy_version 803384 (0.00085) [2022-07-10 16:34:31,893][26022] Updated weights on worker 0-0, policy_version 803394 (0.00082) [2022-07-10 16:34:33,430][25689] Fps is (10 sec: 5403.8, 60 sec: 5503.5, 300 sec: 5520.1). Total num frames: 822682624. Throughput: 0: 5820.5. Samples: 822693070. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:34:33,430][25689] Avg episode reward: [(0, '0.548')] [2022-07-10 16:34:33,745][26022] Updated weights on worker 0-0, policy_version 803404 (0.00096) [2022-07-10 16:34:35,548][26022] Updated weights on worker 0-0, policy_version 803414 (0.00089) [2022-07-10 16:34:37,394][26022] Updated weights on worker 0-0, policy_version 803424 (0.00086) [2022-07-10 16:34:38,463][25689] Fps is (10 sec: 5694.3, 60 sec: 5535.0, 300 sec: 5528.2). Total num frames: 822712320. Throughput: 0: 4979.5. Samples: 822709718. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:34:38,463][25689] Avg episode reward: [(0, '0.194')] [2022-07-10 16:34:39,250][26022] Updated weights on worker 0-0, policy_version 803434 (0.00096) [2022-07-10 16:34:41,074][26022] Updated weights on worker 0-0, policy_version 803444 (0.00089) [2022-07-10 16:34:42,810][26022] Updated weights on worker 0-0, policy_version 803454 (0.00086) [2022-07-10 16:34:43,514][25689] Fps is (10 sec: 5686.3, 60 sec: 5509.1, 300 sec: 5521.1). Total num frames: 822739968. Throughput: 0: 5811.4. Samples: 822743224. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:34:43,515][25689] Avg episode reward: [(0, '-0.266')] [2022-07-10 16:34:44,707][26022] Updated weights on worker 0-0, policy_version 803464 (0.00094) [2022-07-10 16:34:46,533][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:34:46,546][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000803474_822757376.pth [2022-07-10 16:34:46,551][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000801529_820765696.pth [2022-07-10 16:34:46,560][26022] Updated weights on worker 0-0, policy_version 803474 (0.00356) [2022-07-10 16:34:48,451][26022] Updated weights on worker 0-0, policy_version 803484 (0.00104) [2022-07-10 16:34:48,587][25689] Fps is (10 sec: 5461.8, 60 sec: 5502.7, 300 sec: 5523.7). Total num frames: 822767616. Throughput: 0: 5803.3. Samples: 822776748. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:34:48,587][25689] Avg episode reward: [(0, '-0.261')] [2022-07-10 16:34:50,131][26022] Updated weights on worker 0-0, policy_version 803494 (0.00094) [2022-07-10 16:34:52,263][26022] Updated weights on worker 0-0, policy_version 803504 (0.00090) [2022-07-10 16:34:53,678][25689] Fps is (10 sec: 5541.3, 60 sec: 5531.3, 300 sec: 5525.9). Total num frames: 822796288. Throughput: 0: 4952.0. Samples: 822793340. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:34:53,678][25689] Avg episode reward: [(0, '-0.828')] [2022-07-10 16:34:53,955][26022] Updated weights on worker 0-0, policy_version 803514 (0.00083) [2022-07-10 16:34:55,754][26022] Updated weights on worker 0-0, policy_version 803524 (0.00086) [2022-07-10 16:34:57,681][26022] Updated weights on worker 0-0, policy_version 803534 (0.00096) [2022-07-10 16:34:58,696][25689] Fps is (10 sec: 5571.1, 60 sec: 5513.9, 300 sec: 5523.8). Total num frames: 822823936. Throughput: 0: 5773.1. Samples: 822826528. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:34:58,696][25689] Avg episode reward: [(0, '-1.662')] [2022-07-10 16:34:59,356][26022] Updated weights on worker 0-0, policy_version 803544 (0.00090) [2022-07-10 16:35:01,463][26022] Updated weights on worker 0-0, policy_version 803554 (0.00085) [2022-07-10 16:35:03,496][26022] Updated weights on worker 0-0, policy_version 803564 (0.00092) [2022-07-10 16:35:03,831][25689] Fps is (10 sec: 5244.6, 60 sec: 5493.5, 300 sec: 5519.7). Total num frames: 822849536. Throughput: 0: 5648.7. Samples: 822857986. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:35:03,831][25689] Avg episode reward: [(0, '-0.651')] [2022-07-10 16:35:05,404][26022] Updated weights on worker 0-0, policy_version 803574 (0.00093) [2022-07-10 16:35:07,295][26022] Updated weights on worker 0-0, policy_version 803584 (0.00592) [2022-07-10 16:35:08,849][25689] Fps is (10 sec: 5244.5, 60 sec: 5496.6, 300 sec: 5520.6). Total num frames: 822877184. Throughput: 0: 4835.1. Samples: 822874718. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:35:08,850][25689] Avg episode reward: [(0, '-0.512')] [2022-07-10 16:35:09,111][26022] Updated weights on worker 0-0, policy_version 803594 (0.00090) [2022-07-10 16:35:10,924][26022] Updated weights on worker 0-0, policy_version 803604 (0.00094) [2022-07-10 16:35:12,808][26022] Updated weights on worker 0-0, policy_version 803614 (0.00096) [2022-07-10 16:35:13,931][25689] Fps is (10 sec: 5778.9, 60 sec: 5559.7, 300 sec: 5529.8). Total num frames: 822907904. Throughput: 0: 5667.5. Samples: 822908126. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:35:13,933][25689] Avg episode reward: [(0, '-1.645')] [2022-07-10 16:35:14,579][26022] Updated weights on worker 0-0, policy_version 803624 (0.00085) [2022-07-10 16:35:16,413][26022] Updated weights on worker 0-0, policy_version 803634 (0.00088) [2022-07-10 16:35:18,430][26022] Updated weights on worker 0-0, policy_version 803644 (0.00093) [2022-07-10 16:35:19,002][25689] Fps is (10 sec: 5648.0, 60 sec: 5519.7, 300 sec: 5526.3). Total num frames: 822934528. Throughput: 0: 5657.8. Samples: 822941416. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:35:19,003][25689] Avg episode reward: [(0, '-2.455')] [2022-07-10 16:35:20,205][26022] Updated weights on worker 0-0, policy_version 803654 (0.00106) [2022-07-10 16:35:21,918][26022] Updated weights on worker 0-0, policy_version 803664 (0.00104) [2022-07-10 16:35:23,980][26022] Updated weights on worker 0-0, policy_version 803674 (0.00992) [2022-07-10 16:35:24,087][25689] Fps is (10 sec: 5444.9, 60 sec: 5516.4, 300 sec: 5525.4). Total num frames: 822963200. Throughput: 0: 4933.4. Samples: 822957916. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:35:24,087][25689] Avg episode reward: [(0, '-1.992')] [2022-07-10 16:35:25,762][26022] Updated weights on worker 0-0, policy_version 803684 (0.00088) [2022-07-10 16:35:27,726][26022] Updated weights on worker 0-0, policy_version 803694 (0.00089) [2022-07-10 16:35:29,106][25689] Fps is (10 sec: 5574.0, 60 sec: 5532.0, 300 sec: 5529.3). Total num frames: 822990848. Throughput: 0: 5733.3. Samples: 822990858. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2022-07-10 16:35:29,107][25689] Avg episode reward: [(0, '-2.082')] [2022-07-10 16:35:29,407][26022] Updated weights on worker 0-0, policy_version 803704 (0.00083) [2022-07-10 16:35:31,417][26022] Updated weights on worker 0-0, policy_version 803714 (0.00086) [2022-07-10 16:35:33,167][26022] Updated weights on worker 0-0, policy_version 803724 (0.00092) [2022-07-10 16:35:34,119][25689] Fps is (10 sec: 5511.7, 60 sec: 5534.3, 300 sec: 5525.8). Total num frames: 823018496. Throughput: 0: 5755.1. Samples: 823024310. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:35:34,120][25689] Avg episode reward: [(0, '-2.349')] [2022-07-10 16:35:34,996][26022] Updated weights on worker 0-0, policy_version 803734 (0.00087) [2022-07-10 16:35:36,822][26022] Updated weights on worker 0-0, policy_version 803744 (0.00087) [2022-07-10 16:35:38,658][26022] Updated weights on worker 0-0, policy_version 803754 (0.00088) [2022-07-10 16:35:39,159][25689] Fps is (10 sec: 5500.8, 60 sec: 5499.9, 300 sec: 5526.0). Total num frames: 823046144. Throughput: 0: 4955.6. Samples: 823041302. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:35:39,159][25689] Avg episode reward: [(0, '-1.485')] [2022-07-10 16:35:40,560][26022] Updated weights on worker 0-0, policy_version 803764 (0.00078) [2022-07-10 16:35:42,501][26022] Updated weights on worker 0-0, policy_version 803774 (0.00084) [2022-07-10 16:35:43,953][26022] Updated weights on worker 0-0, policy_version 803784 (0.00086) [2022-07-10 16:35:44,214][25689] Fps is (10 sec: 5680.4, 60 sec: 5533.3, 300 sec: 5530.0). Total num frames: 823075840. Throughput: 0: 5817.8. Samples: 823075014. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:35:44,215][25689] Avg episode reward: [(0, '-0.625')] [2022-07-10 16:35:46,135][26022] Updated weights on worker 0-0, policy_version 803794 (0.00081) [2022-07-10 16:35:47,560][26022] Updated weights on worker 0-0, policy_version 803804 (0.00086) [2022-07-10 16:35:49,267][25689] Fps is (10 sec: 5673.0, 60 sec: 5535.1, 300 sec: 5530.6). Total num frames: 823103488. Throughput: 0: 5866.9. Samples: 823109138. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:35:49,267][25689] Avg episode reward: [(0, '0.718')] [2022-07-10 16:35:49,534][26022] Updated weights on worker 0-0, policy_version 803814 (0.00092) [2022-07-10 16:35:51,291][26022] Updated weights on worker 0-0, policy_version 803824 (0.00083) [2022-07-10 16:35:53,203][26022] Updated weights on worker 0-0, policy_version 803834 (0.00082) [2022-07-10 16:35:54,284][25689] Fps is (10 sec: 5694.6, 60 sec: 5558.8, 300 sec: 5537.4). Total num frames: 823133184. Throughput: 0: 5045.0. Samples: 823126042. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:35:54,285][25689] Avg episode reward: [(0, '0.531')] [2022-07-10 16:35:55,080][26022] Updated weights on worker 0-0, policy_version 803844 (0.00089) [2022-07-10 16:35:56,835][26022] Updated weights on worker 0-0, policy_version 803854 (0.00084) [2022-07-10 16:35:58,631][26022] Updated weights on worker 0-0, policy_version 803864 (0.00090) [2022-07-10 16:35:59,305][25689] Fps is (10 sec: 5610.7, 60 sec: 5541.6, 300 sec: 5539.2). Total num frames: 823159808. Throughput: 0: 5870.7. Samples: 823159574. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:35:59,305][25689] Avg episode reward: [(0, '-0.095')] [2022-07-10 16:36:00,416][26022] Updated weights on worker 0-0, policy_version 803874 (0.00085) [2022-07-10 16:36:02,599][26022] Updated weights on worker 0-0, policy_version 803884 (0.00067) [2022-07-10 16:36:04,347][25689] Fps is (10 sec: 5189.8, 60 sec: 5550.1, 300 sec: 5533.1). Total num frames: 823185408. Throughput: 0: 5774.5. Samples: 823191270. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:36:04,350][25689] Avg episode reward: [(0, '-0.075')] [2022-07-10 16:36:04,638][26022] Updated weights on worker 0-0, policy_version 803894 (0.00085) [2022-07-10 16:36:06,344][26022] Updated weights on worker 0-0, policy_version 803904 (0.00086) [2022-07-10 16:36:08,196][26022] Updated weights on worker 0-0, policy_version 803914 (0.00092) [2022-07-10 16:36:09,397][25689] Fps is (10 sec: 5377.5, 60 sec: 5564.1, 300 sec: 5532.9). Total num frames: 823214080. Throughput: 0: 4907.1. Samples: 823207920. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:36:09,398][25689] Avg episode reward: [(0, '-0.066')] [2022-07-10 16:36:10,155][26022] Updated weights on worker 0-0, policy_version 803924 (0.00102) [2022-07-10 16:36:11,956][26022] Updated weights on worker 0-0, policy_version 803934 (0.00092) [2022-07-10 16:36:13,756][26022] Updated weights on worker 0-0, policy_version 803944 (0.00097) [2022-07-10 16:36:14,429][25689] Fps is (10 sec: 5586.1, 60 sec: 5517.9, 300 sec: 5533.5). Total num frames: 823241728. Throughput: 0: 5723.4. Samples: 823241340. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:36:14,431][25689] Avg episode reward: [(0, '0.013')] [2022-07-10 16:36:15,663][26022] Updated weights on worker 0-0, policy_version 803954 (0.00087) [2022-07-10 16:36:17,481][26022] Updated weights on worker 0-0, policy_version 803964 (0.00087) [2022-07-10 16:36:19,303][26022] Updated weights on worker 0-0, policy_version 803974 (0.00090) [2022-07-10 16:36:19,435][25689] Fps is (10 sec: 5610.7, 60 sec: 5557.8, 300 sec: 5537.9). Total num frames: 823270400. Throughput: 0: 5733.0. Samples: 823274982. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:36:19,436][25689] Avg episode reward: [(0, '0.060')] [2022-07-10 16:36:21,088][26022] Updated weights on worker 0-0, policy_version 803984 (0.00092) [2022-07-10 16:36:22,867][26022] Updated weights on worker 0-0, policy_version 803994 (0.00107) [2022-07-10 16:36:24,552][25689] Fps is (10 sec: 5664.6, 60 sec: 5554.8, 300 sec: 5532.7). Total num frames: 823299072. Throughput: 0: 4974.9. Samples: 823291790. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:36:24,553][25689] Avg episode reward: [(0, '-0.254')] [2022-07-10 16:36:24,718][26022] Updated weights on worker 0-0, policy_version 804004 (0.00090) [2022-07-10 16:36:26,738][26022] Updated weights on worker 0-0, policy_version 804014 (0.00089) [2022-07-10 16:36:28,450][26022] Updated weights on worker 0-0, policy_version 804024 (0.00090) [2022-07-10 16:36:29,572][25689] Fps is (10 sec: 5454.9, 60 sec: 5537.8, 300 sec: 5533.8). Total num frames: 823325696. Throughput: 0: 5797.1. Samples: 823324876. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:36:29,573][25689] Avg episode reward: [(0, '0.702')] [2022-07-10 16:36:30,302][26022] Updated weights on worker 0-0, policy_version 804034 (0.00091) [2022-07-10 16:36:32,216][26022] Updated weights on worker 0-0, policy_version 804044 (0.00091) [2022-07-10 16:36:34,095][26022] Updated weights on worker 0-0, policy_version 804054 (0.00086) [2022-07-10 16:36:34,575][25689] Fps is (10 sec: 5517.2, 60 sec: 5555.7, 300 sec: 5533.9). Total num frames: 823354368. Throughput: 0: 5797.9. Samples: 823358142. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:36:34,575][25689] Avg episode reward: [(0, '0.630')] [2022-07-10 16:36:35,802][26022] Updated weights on worker 0-0, policy_version 804064 (0.00087) [2022-07-10 16:36:37,655][26022] Updated weights on worker 0-0, policy_version 804074 (0.00089) [2022-07-10 16:36:39,269][26022] Updated weights on worker 0-0, policy_version 804084 (0.00084) [2022-07-10 16:36:39,588][25689] Fps is (10 sec: 5725.5, 60 sec: 5575.1, 300 sec: 5531.2). Total num frames: 823383040. Throughput: 0: 4974.8. Samples: 823375236. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:36:39,588][25689] Avg episode reward: [(0, '-0.975')] [2022-07-10 16:36:41,251][26022] Updated weights on worker 0-0, policy_version 804094 (0.00086) [2022-07-10 16:36:43,097][26022] Updated weights on worker 0-0, policy_version 804104 (0.00054) [2022-07-10 16:36:44,694][25689] Fps is (10 sec: 5565.6, 60 sec: 5536.5, 300 sec: 5536.5). Total num frames: 823410688. Throughput: 0: 5817.8. Samples: 823408972. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:36:44,695][25689] Avg episode reward: [(0, '-1.144')] [2022-07-10 16:36:44,814][26022] Updated weights on worker 0-0, policy_version 804114 (0.00087) [2022-07-10 16:36:46,571][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:36:46,580][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000804122_823420928.pth [2022-07-10 16:36:46,580][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000802177_821429248.pth [2022-07-10 16:36:46,704][26022] Updated weights on worker 0-0, policy_version 804124 (0.00091) [2022-07-10 16:36:48,567][26022] Updated weights on worker 0-0, policy_version 804134 (0.00086) [2022-07-10 16:36:49,765][25689] Fps is (10 sec: 5433.2, 60 sec: 5534.8, 300 sec: 5528.4). Total num frames: 823438336. Throughput: 0: 5830.1. Samples: 823442602. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:36:49,766][25689] Avg episode reward: [(0, '-2.650')] [2022-07-10 16:36:50,387][26022] Updated weights on worker 0-0, policy_version 804144 (0.00219) [2022-07-10 16:36:52,354][26022] Updated weights on worker 0-0, policy_version 804154 (0.00085) [2022-07-10 16:36:53,999][26022] Updated weights on worker 0-0, policy_version 804164 (0.00087) [2022-07-10 16:36:54,812][25689] Fps is (10 sec: 5667.7, 60 sec: 5532.1, 300 sec: 5534.7). Total num frames: 823468032. Throughput: 0: 5842.0. Samples: 823476366. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:36:54,812][25689] Avg episode reward: [(0, '-2.508')] [2022-07-10 16:36:55,979][26022] Updated weights on worker 0-0, policy_version 804174 (0.00098) [2022-07-10 16:36:57,843][26022] Updated weights on worker 0-0, policy_version 804184 (0.00081) [2022-07-10 16:36:59,548][26022] Updated weights on worker 0-0, policy_version 804194 (0.00079) [2022-07-10 16:36:59,901][25689] Fps is (10 sec: 5657.4, 60 sec: 5542.8, 300 sec: 5537.8). Total num frames: 823495680. Throughput: 0: 5792.8. Samples: 823492908. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:36:59,902][25689] Avg episode reward: [(0, '-2.622')] [2022-07-10 16:37:01,423][26022] Updated weights on worker 0-0, policy_version 804204 (0.00090) [2022-07-10 16:37:03,577][26022] Updated weights on worker 0-0, policy_version 804214 (0.00085) [2022-07-10 16:37:04,971][25689] Fps is (10 sec: 5241.1, 60 sec: 5540.2, 300 sec: 5533.4). Total num frames: 823521280. Throughput: 0: 5702.7. Samples: 823524608. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:37:04,972][25689] Avg episode reward: [(0, '-1.515')] [2022-07-10 16:37:05,579][26022] Updated weights on worker 0-0, policy_version 804224 (0.00091) [2022-07-10 16:37:07,311][26022] Updated weights on worker 0-0, policy_version 804234 (0.00087) [2022-07-10 16:37:09,195][26022] Updated weights on worker 0-0, policy_version 804244 (0.00091) [2022-07-10 16:37:10,000][25689] Fps is (10 sec: 5475.2, 60 sec: 5559.0, 300 sec: 5536.4). Total num frames: 823550976. Throughput: 0: 5720.1. Samples: 823558350. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:37:10,001][25689] Avg episode reward: [(0, '-0.986')] [2022-07-10 16:37:11,104][26022] Updated weights on worker 0-0, policy_version 804254 (0.00093) [2022-07-10 16:37:12,544][26022] Updated weights on worker 0-0, policy_version 804264 (0.00092) [2022-07-10 16:37:14,896][26022] Updated weights on worker 0-0, policy_version 804274 (0.00093) [2022-07-10 16:37:15,006][25689] Fps is (10 sec: 5612.5, 60 sec: 5544.5, 300 sec: 5533.4). Total num frames: 823577600. Throughput: 0: 4887.2. Samples: 823575060. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:37:15,007][25689] Avg episode reward: [(0, '-0.711')] [2022-07-10 16:37:16,355][26022] Updated weights on worker 0-0, policy_version 804284 (0.00080) [2022-07-10 16:37:18,317][26022] Updated weights on worker 0-0, policy_version 804294 (0.00086) [2022-07-10 16:37:20,006][26022] Updated weights on worker 0-0, policy_version 804304 (0.00086) [2022-07-10 16:37:20,105][25689] Fps is (10 sec: 5573.7, 60 sec: 5552.9, 300 sec: 5536.7). Total num frames: 823607296. Throughput: 0: 5735.8. Samples: 823608794. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:37:20,105][25689] Avg episode reward: [(0, '0.682')] [2022-07-10 16:37:21,933][26022] Updated weights on worker 0-0, policy_version 804314 (0.00087) [2022-07-10 16:37:23,767][26022] Updated weights on worker 0-0, policy_version 804324 (0.01287) [2022-07-10 16:37:25,163][25689] Fps is (10 sec: 5646.1, 60 sec: 5541.5, 300 sec: 5532.5). Total num frames: 823634944. Throughput: 0: 5833.7. Samples: 823642398. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:37:25,164][25689] Avg episode reward: [(0, '-0.300')] [2022-07-10 16:37:25,638][26022] Updated weights on worker 0-0, policy_version 804334 (0.00092) [2022-07-10 16:37:27,395][26022] Updated weights on worker 0-0, policy_version 804344 (0.00085) [2022-07-10 16:37:29,340][26022] Updated weights on worker 0-0, policy_version 804354 (0.00089) [2022-07-10 16:37:30,227][25689] Fps is (10 sec: 5462.9, 60 sec: 5554.3, 300 sec: 5535.0). Total num frames: 823662592. Throughput: 0: 4965.3. Samples: 823658782. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:37:30,228][25689] Avg episode reward: [(0, '-0.292')] [2022-07-10 16:37:31,118][26022] Updated weights on worker 0-0, policy_version 804364 (0.00086) [2022-07-10 16:37:32,943][26022] Updated weights on worker 0-0, policy_version 804374 (0.00089) [2022-07-10 16:37:35,044][26022] Updated weights on worker 0-0, policy_version 804384 (0.00092) [2022-07-10 16:37:35,308][25689] Fps is (10 sec: 5551.5, 60 sec: 5547.2, 300 sec: 5537.2). Total num frames: 823691264. Throughput: 0: 5776.3. Samples: 823692328. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:37:35,308][25689] Avg episode reward: [(0, '-1.473')] [2022-07-10 16:37:36,606][26022] Updated weights on worker 0-0, policy_version 804394 (0.00054) [2022-07-10 16:37:38,478][26022] Updated weights on worker 0-0, policy_version 804404 (0.00088) [2022-07-10 16:37:40,261][26022] Updated weights on worker 0-0, policy_version 804414 (0.00087) [2022-07-10 16:37:40,395][25689] Fps is (10 sec: 5639.9, 60 sec: 5540.4, 300 sec: 5540.8). Total num frames: 823719936. Throughput: 0: 5766.0. Samples: 823725784. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:37:40,395][25689] Avg episode reward: [(0, '-1.291')] [2022-07-10 16:37:42,037][26022] Updated weights on worker 0-0, policy_version 804424 (0.00083) [2022-07-10 16:37:44,076][26022] Updated weights on worker 0-0, policy_version 804434 (0.00088) [2022-07-10 16:37:45,439][25689] Fps is (10 sec: 5558.7, 60 sec: 5546.0, 300 sec: 5536.6). Total num frames: 823747584. Throughput: 0: 4937.2. Samples: 823742510. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:37:45,440][25689] Avg episode reward: [(0, '-1.413')] [2022-07-10 16:37:45,665][26022] Updated weights on worker 0-0, policy_version 804444 (0.00085) [2022-07-10 16:37:47,858][26022] Updated weights on worker 0-0, policy_version 804454 (0.00098) [2022-07-10 16:37:49,607][26022] Updated weights on worker 0-0, policy_version 804464 (0.00085) [2022-07-10 16:37:50,460][25689] Fps is (10 sec: 5493.6, 60 sec: 5550.6, 300 sec: 5533.3). Total num frames: 823775232. Throughput: 0: 5783.1. Samples: 823775792. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:37:50,462][25689] Avg episode reward: [(0, '-1.702')] [2022-07-10 16:37:51,509][26022] Updated weights on worker 0-0, policy_version 804474 (0.00093) [2022-07-10 16:37:53,307][26022] Updated weights on worker 0-0, policy_version 804484 (0.00080) [2022-07-10 16:37:55,140][26022] Updated weights on worker 0-0, policy_version 804494 (0.00093) [2022-07-10 16:37:55,485][25689] Fps is (10 sec: 5504.5, 60 sec: 5518.9, 300 sec: 5536.6). Total num frames: 823802880. Throughput: 0: 5797.8. Samples: 823809314. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:37:55,486][25689] Avg episode reward: [(0, '-0.664')] [2022-07-10 16:37:56,860][26022] Updated weights on worker 0-0, policy_version 804504 (0.00086) [2022-07-10 16:37:59,069][26022] Updated weights on worker 0-0, policy_version 804514 (0.00095) [2022-07-10 16:38:00,531][25689] Fps is (10 sec: 5592.5, 60 sec: 5539.7, 300 sec: 5544.2). Total num frames: 823831552. Throughput: 0: 4981.1. Samples: 823826084. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:38:00,533][25689] Avg episode reward: [(0, '-1.641')] [2022-07-10 16:38:00,559][26022] Updated weights on worker 0-0, policy_version 804524 (0.00078) [2022-07-10 16:38:03,007][26022] Updated weights on worker 0-0, policy_version 804534 (0.00088) [2022-07-10 16:38:04,429][26022] Updated weights on worker 0-0, policy_version 804544 (0.00085) [2022-07-10 16:38:05,600][25689] Fps is (10 sec: 5466.8, 60 sec: 5556.7, 300 sec: 5537.2). Total num frames: 823858176. Throughput: 0: 5693.6. Samples: 823857296. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:38:05,601][25689] Avg episode reward: [(0, '-0.678')] [2022-07-10 16:38:06,591][26022] Updated weights on worker 0-0, policy_version 804554 (0.00092) [2022-07-10 16:38:08,202][26022] Updated weights on worker 0-0, policy_version 804564 (0.00084) [2022-07-10 16:38:10,527][26022] Updated weights on worker 0-0, policy_version 804574 (0.00093) [2022-07-10 16:38:10,627][25689] Fps is (10 sec: 5172.9, 60 sec: 5489.4, 300 sec: 5530.1). Total num frames: 823883776. Throughput: 0: 5700.6. Samples: 823890752. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:38:10,627][25689] Avg episode reward: [(0, '-0.973')] [2022-07-10 16:38:11,920][26022] Updated weights on worker 0-0, policy_version 804584 (0.00086) [2022-07-10 16:38:13,981][26022] Updated weights on worker 0-0, policy_version 804594 (0.00082) [2022-07-10 16:38:15,596][26022] Updated weights on worker 0-0, policy_version 804604 (0.00090) [2022-07-10 16:38:15,694][25689] Fps is (10 sec: 5579.7, 60 sec: 5551.3, 300 sec: 5535.8). Total num frames: 823914496. Throughput: 0: 4861.7. Samples: 823907562. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:38:15,694][25689] Avg episode reward: [(0, '-0.590')] [2022-07-10 16:38:17,535][26022] Updated weights on worker 0-0, policy_version 804614 (0.00087) [2022-07-10 16:38:19,439][26022] Updated weights on worker 0-0, policy_version 804624 (0.00094) [2022-07-10 16:38:20,719][25689] Fps is (10 sec: 5783.5, 60 sec: 5524.3, 300 sec: 5536.2). Total num frames: 823942144. Throughput: 0: 5709.5. Samples: 823941346. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:38:20,719][25689] Avg episode reward: [(0, '-0.064')] [2022-07-10 16:38:20,986][26022] Updated weights on worker 0-0, policy_version 804634 (0.00089) [2022-07-10 16:38:23,142][26022] Updated weights on worker 0-0, policy_version 804644 (0.00091) [2022-07-10 16:38:24,763][26022] Updated weights on worker 0-0, policy_version 804654 (0.00105) [2022-07-10 16:38:25,816][25689] Fps is (10 sec: 5462.6, 60 sec: 5520.6, 300 sec: 5531.3). Total num frames: 823969792. Throughput: 0: 5815.7. Samples: 823974868. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:38:25,817][25689] Avg episode reward: [(0, '0.322')] [2022-07-10 16:38:26,636][26022] Updated weights on worker 0-0, policy_version 804664 (0.00086) [2022-07-10 16:38:28,900][26022] Updated weights on worker 0-0, policy_version 804674 (0.00084) [2022-07-10 16:38:30,423][26022] Updated weights on worker 0-0, policy_version 804684 (0.00093) [2022-07-10 16:38:30,855][25689] Fps is (10 sec: 5556.4, 60 sec: 5539.9, 300 sec: 5534.3). Total num frames: 823998464. Throughput: 0: 5796.2. Samples: 824007998. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:38:30,855][25689] Avg episode reward: [(0, '0.896')] [2022-07-10 16:38:32,403][26022] Updated weights on worker 0-0, policy_version 804694 (0.00082) [2022-07-10 16:38:34,213][26022] Updated weights on worker 0-0, policy_version 804704 (0.00091) [2022-07-10 16:38:35,857][26022] Updated weights on worker 0-0, policy_version 804714 (0.00084) [2022-07-10 16:38:35,933][25689] Fps is (10 sec: 5769.5, 60 sec: 5557.0, 300 sec: 5539.8). Total num frames: 824028160. Throughput: 0: 5800.8. Samples: 824024966. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:38:35,934][25689] Avg episode reward: [(0, '0.934')] [2022-07-10 16:38:37,901][26022] Updated weights on worker 0-0, policy_version 804724 (0.00092) [2022-07-10 16:38:39,332][26022] Updated weights on worker 0-0, policy_version 804734 (0.00093) [2022-07-10 16:38:40,943][25689] Fps is (10 sec: 5481.2, 60 sec: 5513.4, 300 sec: 5528.5). Total num frames: 824053760. Throughput: 0: 5805.1. Samples: 824058750. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:38:40,943][25689] Avg episode reward: [(0, '1.012')] [2022-07-10 16:38:41,519][26022] Updated weights on worker 0-0, policy_version 804744 (0.00085) [2022-07-10 16:38:43,153][26022] Updated weights on worker 0-0, policy_version 804754 (0.00088) [2022-07-10 16:38:45,010][26022] Updated weights on worker 0-0, policy_version 804764 (0.00089) [2022-07-10 16:38:46,030][25689] Fps is (10 sec: 5476.4, 60 sec: 5543.3, 300 sec: 5533.8). Total num frames: 824083456. Throughput: 0: 5794.8. Samples: 824092002. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:38:46,032][25689] Avg episode reward: [(0, '1.088')] [2022-07-10 16:38:46,581][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:38:46,590][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000804772_824086528.pth [2022-07-10 16:38:46,591][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000802824_822091776.pth [2022-07-10 16:38:47,264][26022] Updated weights on worker 0-0, policy_version 804774 (0.00087) [2022-07-10 16:38:48,503][26022] Updated weights on worker 0-0, policy_version 804784 (0.00086) [2022-07-10 16:38:50,610][26022] Updated weights on worker 0-0, policy_version 804794 (0.00085) [2022-07-10 16:38:51,071][25689] Fps is (10 sec: 5661.8, 60 sec: 5541.4, 300 sec: 5537.1). Total num frames: 824111104. Throughput: 0: 4987.9. Samples: 824108836. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:38:51,073][25689] Avg episode reward: [(0, '1.178')] [2022-07-10 16:38:52,620][26022] Updated weights on worker 0-0, policy_version 804804 (0.00087) [2022-07-10 16:38:54,250][26022] Updated weights on worker 0-0, policy_version 804814 (0.00090) [2022-07-10 16:38:56,090][25689] Fps is (10 sec: 5598.6, 60 sec: 5558.9, 300 sec: 5537.0). Total num frames: 824139776. Throughput: 0: 5818.8. Samples: 824142256. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:38:56,092][25689] Avg episode reward: [(0, '1.443')] [2022-07-10 16:38:56,099][26022] Updated weights on worker 0-0, policy_version 804824 (0.00096) [2022-07-10 16:38:58,061][26022] Updated weights on worker 0-0, policy_version 804834 (0.00091) [2022-07-10 16:38:59,815][26022] Updated weights on worker 0-0, policy_version 804844 (0.00098) [2022-07-10 16:39:01,140][25689] Fps is (10 sec: 5593.7, 60 sec: 5541.6, 300 sec: 5541.3). Total num frames: 824167424. Throughput: 0: 5780.0. Samples: 824175486. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 16:39:01,141][25689] Avg episode reward: [(0, '0.927')] [2022-07-10 16:39:02,011][26022] Updated weights on worker 0-0, policy_version 804854 (0.00101) [2022-07-10 16:39:03,802][26022] Updated weights on worker 0-0, policy_version 804864 (0.00094) [2022-07-10 16:39:05,943][26022] Updated weights on worker 0-0, policy_version 804874 (0.00090) [2022-07-10 16:39:06,227][25689] Fps is (10 sec: 5354.0, 60 sec: 5540.0, 300 sec: 5537.2). Total num frames: 824194048. Throughput: 0: 4855.8. Samples: 824190072. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:39:06,227][25689] Avg episode reward: [(0, '-0.059')] [2022-07-10 16:39:07,599][26022] Updated weights on worker 0-0, policy_version 804884 (0.00098) [2022-07-10 16:39:09,326][26022] Updated weights on worker 0-0, policy_version 804894 (0.00083) [2022-07-10 16:39:11,232][25689] Fps is (10 sec: 5276.2, 60 sec: 5558.9, 300 sec: 5537.7). Total num frames: 824220672. Throughput: 0: 5703.3. Samples: 824223818. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:39:11,232][25689] Avg episode reward: [(0, '-0.351')] [2022-07-10 16:39:11,312][26022] Updated weights on worker 0-0, policy_version 804904 (0.00084) [2022-07-10 16:39:12,780][26022] Updated weights on worker 0-0, policy_version 804914 (0.00089) [2022-07-10 16:39:14,891][26022] Updated weights on worker 0-0, policy_version 804924 (0.00097) [2022-07-10 16:39:16,244][25689] Fps is (10 sec: 5520.2, 60 sec: 5530.1, 300 sec: 5537.5). Total num frames: 824249344. Throughput: 0: 5703.0. Samples: 824257192. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:39:16,244][25689] Avg episode reward: [(0, '-0.714')] [2022-07-10 16:39:16,887][26022] Updated weights on worker 0-0, policy_version 804934 (0.00087) [2022-07-10 16:39:18,571][26022] Updated weights on worker 0-0, policy_version 804944 (0.00091) [2022-07-10 16:39:20,439][26022] Updated weights on worker 0-0, policy_version 804954 (0.00082) [2022-07-10 16:39:21,252][25689] Fps is (10 sec: 5722.7, 60 sec: 5548.5, 300 sec: 5538.3). Total num frames: 824278016. Throughput: 0: 4896.1. Samples: 824273960. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:39:21,253][25689] Avg episode reward: [(0, '-0.901')] [2022-07-10 16:39:22,379][26022] Updated weights on worker 0-0, policy_version 804964 (0.00057) [2022-07-10 16:39:23,927][26022] Updated weights on worker 0-0, policy_version 804974 (0.00080) [2022-07-10 16:39:26,210][26022] Updated weights on worker 0-0, policy_version 804984 (0.00094) [2022-07-10 16:39:26,310][25689] Fps is (10 sec: 5493.1, 60 sec: 5535.2, 300 sec: 5537.3). Total num frames: 824304640. Throughput: 0: 5836.0. Samples: 824307280. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:39:26,311][25689] Avg episode reward: [(0, '-0.777')] [2022-07-10 16:39:27,623][26022] Updated weights on worker 0-0, policy_version 804994 (0.00086) [2022-07-10 16:39:29,820][26022] Updated weights on worker 0-0, policy_version 805004 (0.00097) [2022-07-10 16:39:31,313][25689] Fps is (10 sec: 5496.0, 60 sec: 5538.5, 300 sec: 5541.4). Total num frames: 824333312. Throughput: 0: 5791.5. Samples: 824340122. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:39:31,315][25689] Avg episode reward: [(0, '-0.596')] [2022-07-10 16:39:31,351][26022] Updated weights on worker 0-0, policy_version 805014 (0.00088) [2022-07-10 16:39:33,412][26022] Updated weights on worker 0-0, policy_version 805024 (0.00093) [2022-07-10 16:39:35,305][26022] Updated weights on worker 0-0, policy_version 805034 (0.00090) [2022-07-10 16:39:36,330][25689] Fps is (10 sec: 5621.1, 60 sec: 5510.2, 300 sec: 5534.8). Total num frames: 824360960. Throughput: 0: 4970.8. Samples: 824357034. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:39:36,331][25689] Avg episode reward: [(0, '0.711')] [2022-07-10 16:39:37,067][26022] Updated weights on worker 0-0, policy_version 805044 (0.00088) [2022-07-10 16:39:38,905][26022] Updated weights on worker 0-0, policy_version 805054 (0.00084) [2022-07-10 16:39:40,809][26022] Updated weights on worker 0-0, policy_version 805064 (0.00091) [2022-07-10 16:39:41,337][25689] Fps is (10 sec: 5414.2, 60 sec: 5527.4, 300 sec: 5532.2). Total num frames: 824387584. Throughput: 0: 5814.1. Samples: 824390738. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:39:41,339][25689] Avg episode reward: [(0, '-0.362')] [2022-07-10 16:39:42,424][26022] Updated weights on worker 0-0, policy_version 805074 (0.00623) [2022-07-10 16:39:44,466][26022] Updated weights on worker 0-0, policy_version 805084 (0.00090) [2022-07-10 16:39:46,187][26022] Updated weights on worker 0-0, policy_version 805094 (0.00098) [2022-07-10 16:39:46,382][25689] Fps is (10 sec: 5602.5, 60 sec: 5531.3, 300 sec: 5539.6). Total num frames: 824417280. Throughput: 0: 5848.6. Samples: 824424674. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:39:46,383][25689] Avg episode reward: [(0, '-0.217')] [2022-07-10 16:39:48,159][26022] Updated weights on worker 0-0, policy_version 805104 (0.00083) [2022-07-10 16:39:49,746][26022] Updated weights on worker 0-0, policy_version 805114 (0.00093) [2022-07-10 16:39:51,392][25689] Fps is (10 sec: 5805.1, 60 sec: 5551.1, 300 sec: 5541.1). Total num frames: 824445952. Throughput: 0: 5040.5. Samples: 824441328. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:39:51,393][25689] Avg episode reward: [(0, '-0.264')] [2022-07-10 16:39:51,757][26022] Updated weights on worker 0-0, policy_version 805124 (0.00089) [2022-07-10 16:39:53,552][26022] Updated weights on worker 0-0, policy_version 805134 (0.00092) [2022-07-10 16:39:55,233][26022] Updated weights on worker 0-0, policy_version 805144 (0.00084) [2022-07-10 16:39:56,397][25689] Fps is (10 sec: 5521.3, 60 sec: 5518.4, 300 sec: 5537.9). Total num frames: 824472576. Throughput: 0: 5870.3. Samples: 824474838. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:39:56,398][25689] Avg episode reward: [(0, '-0.051')] [2022-07-10 16:39:57,181][26022] Updated weights on worker 0-0, policy_version 805154 (0.00086) [2022-07-10 16:39:59,148][26022] Updated weights on worker 0-0, policy_version 805164 (0.00088) [2022-07-10 16:40:00,851][26022] Updated weights on worker 0-0, policy_version 805174 (0.00095) [2022-07-10 16:40:01,421][25689] Fps is (10 sec: 5513.5, 60 sec: 5537.7, 300 sec: 5550.3). Total num frames: 824501248. Throughput: 0: 5846.9. Samples: 824508166. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:40:01,422][25689] Avg episode reward: [(0, '-0.632')] [2022-07-10 16:40:03,171][26022] Updated weights on worker 0-0, policy_version 805184 (0.00435) [2022-07-10 16:40:04,975][26022] Updated weights on worker 0-0, policy_version 805194 (0.00082) [2022-07-10 16:40:06,483][25689] Fps is (10 sec: 5381.3, 60 sec: 5523.1, 300 sec: 5542.6). Total num frames: 824526848. Throughput: 0: 4880.5. Samples: 824522774. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:40:06,485][25689] Avg episode reward: [(0, '-0.654')] [2022-07-10 16:40:06,650][26022] Updated weights on worker 0-0, policy_version 805204 (0.00090) [2022-07-10 16:40:08,734][26022] Updated weights on worker 0-0, policy_version 805214 (0.00094) [2022-07-10 16:40:10,202][26022] Updated weights on worker 0-0, policy_version 805224 (0.00086) [2022-07-10 16:40:11,499][25689] Fps is (10 sec: 5284.0, 60 sec: 5539.1, 300 sec: 5533.5). Total num frames: 824554496. Throughput: 0: 5744.7. Samples: 824556834. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:40:11,500][25689] Avg episode reward: [(0, '0.399')] [2022-07-10 16:40:12,340][26022] Updated weights on worker 0-0, policy_version 805234 (0.00088) [2022-07-10 16:40:14,048][26022] Updated weights on worker 0-0, policy_version 805244 (0.00090) [2022-07-10 16:40:15,827][26022] Updated weights on worker 0-0, policy_version 805254 (0.00089) [2022-07-10 16:40:16,516][25689] Fps is (10 sec: 5715.2, 60 sec: 5555.6, 300 sec: 5544.8). Total num frames: 824584192. Throughput: 0: 5760.0. Samples: 824590724. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:40:16,517][25689] Avg episode reward: [(0, '0.567')] [2022-07-10 16:40:17,737][26022] Updated weights on worker 0-0, policy_version 805264 (0.00089) [2022-07-10 16:40:19,266][26022] Updated weights on worker 0-0, policy_version 805274 (0.00083) [2022-07-10 16:40:21,539][25689] Fps is (10 sec: 5609.2, 60 sec: 5520.3, 300 sec: 5539.1). Total num frames: 824610816. Throughput: 0: 4934.2. Samples: 824607432. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:40:21,539][26022] Updated weights on worker 0-0, policy_version 805284 (0.00084) [2022-07-10 16:40:21,540][25689] Avg episode reward: [(0, '0.261')] [2022-07-10 16:40:23,136][26022] Updated weights on worker 0-0, policy_version 805294 (0.00091) [2022-07-10 16:40:25,002][26022] Updated weights on worker 0-0, policy_version 805304 (0.00082) [2022-07-10 16:40:26,587][25689] Fps is (10 sec: 5490.6, 60 sec: 5555.1, 300 sec: 5542.0). Total num frames: 824639488. Throughput: 0: 5881.0. Samples: 824641010. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:40:26,588][25689] Avg episode reward: [(0, '-0.162')] [2022-07-10 16:40:26,856][26022] Updated weights on worker 0-0, policy_version 805314 (0.00113) [2022-07-10 16:40:28,512][26022] Updated weights on worker 0-0, policy_version 805324 (0.00086) [2022-07-10 16:40:30,469][26022] Updated weights on worker 0-0, policy_version 805334 (0.00103) [2022-07-10 16:40:31,623][25689] Fps is (10 sec: 5686.9, 60 sec: 5552.2, 300 sec: 5545.0). Total num frames: 824668160. Throughput: 0: 5837.7. Samples: 824674312. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:40:31,623][25689] Avg episode reward: [(0, '-0.121')] [2022-07-10 16:40:32,650][26022] Updated weights on worker 0-0, policy_version 805344 (0.00065) [2022-07-10 16:40:34,005][26022] Updated weights on worker 0-0, policy_version 805354 (0.00087) [2022-07-10 16:40:36,333][26022] Updated weights on worker 0-0, policy_version 805364 (0.00087) [2022-07-10 16:40:36,693][25689] Fps is (10 sec: 5471.9, 60 sec: 5530.2, 300 sec: 5541.0). Total num frames: 824694784. Throughput: 0: 4961.9. Samples: 824690838. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:40:36,693][25689] Avg episode reward: [(0, '-0.180')] [2022-07-10 16:40:37,689][26022] Updated weights on worker 0-0, policy_version 805374 (0.00087) [2022-07-10 16:40:39,962][26022] Updated weights on worker 0-0, policy_version 805384 (0.00085) [2022-07-10 16:40:41,286][26022] Updated weights on worker 0-0, policy_version 805394 (0.00091) [2022-07-10 16:40:41,720][25689] Fps is (10 sec: 5679.4, 60 sec: 5596.4, 300 sec: 5545.0). Total num frames: 824725504. Throughput: 0: 5799.6. Samples: 824724470. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:40:41,720][25689] Avg episode reward: [(0, '-0.918')] [2022-07-10 16:40:43,410][26022] Updated weights on worker 0-0, policy_version 805404 (0.00087) [2022-07-10 16:40:45,025][26022] Updated weights on worker 0-0, policy_version 805414 (0.00091) [2022-07-10 16:40:46,646][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:40:46,669][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000805422_824752128.pth [2022-07-10 16:40:46,670][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000803474_822757376.pth [2022-07-10 16:40:46,815][25689] Fps is (10 sec: 5665.4, 60 sec: 5540.9, 300 sec: 5540.8). Total num frames: 824752128. Throughput: 0: 5800.3. Samples: 824758334. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:40:46,815][25689] Avg episode reward: [(0, '-0.831')] [2022-07-10 16:40:47,130][26022] Updated weights on worker 0-0, policy_version 805424 (0.00091) [2022-07-10 16:40:48,664][26022] Updated weights on worker 0-0, policy_version 805434 (0.00080) [2022-07-10 16:40:50,710][26022] Updated weights on worker 0-0, policy_version 805444 (0.00098) [2022-07-10 16:40:51,832][25689] Fps is (10 sec: 5569.5, 60 sec: 5557.1, 300 sec: 5540.8). Total num frames: 824781824. Throughput: 0: 4984.6. Samples: 824775046. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:40:51,832][25689] Avg episode reward: [(0, '-0.664')] [2022-07-10 16:40:52,615][26022] Updated weights on worker 0-0, policy_version 805454 (0.00096) [2022-07-10 16:40:54,417][26022] Updated weights on worker 0-0, policy_version 805464 (0.00117) [2022-07-10 16:40:56,211][26022] Updated weights on worker 0-0, policy_version 805474 (0.00085) [2022-07-10 16:40:56,842][25689] Fps is (10 sec: 5616.6, 60 sec: 5556.7, 300 sec: 5541.0). Total num frames: 824808448. Throughput: 0: 5834.5. Samples: 824808398. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:40:56,843][25689] Avg episode reward: [(0, '-0.546')] [2022-07-10 16:40:58,066][26022] Updated weights on worker 0-0, policy_version 805484 (0.00088) [2022-07-10 16:40:59,903][26022] Updated weights on worker 0-0, policy_version 805494 (0.00089) [2022-07-10 16:41:01,847][25689] Fps is (10 sec: 5316.8, 60 sec: 5524.6, 300 sec: 5545.1). Total num frames: 824835072. Throughput: 0: 5846.4. Samples: 824842142. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:41:01,847][25689] Avg episode reward: [(0, '0.352')] [2022-07-10 16:41:01,991][26022] Updated weights on worker 0-0, policy_version 805504 (0.00811) [2022-07-10 16:41:03,825][26022] Updated weights on worker 0-0, policy_version 805514 (0.00091) [2022-07-10 16:41:05,837][26022] Updated weights on worker 0-0, policy_version 805524 (0.00087) [2022-07-10 16:41:06,930][25689] Fps is (10 sec: 5481.7, 60 sec: 5573.4, 300 sec: 5544.5). Total num frames: 824863744. Throughput: 0: 4888.8. Samples: 824856672. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:41:06,930][25689] Avg episode reward: [(0, '0.376')] [2022-07-10 16:41:07,364][26022] Updated weights on worker 0-0, policy_version 805534 (0.00091) [2022-07-10 16:41:09,600][26022] Updated weights on worker 0-0, policy_version 805544 (0.00087) [2022-07-10 16:41:11,179][26022] Updated weights on worker 0-0, policy_version 805554 (0.00088) [2022-07-10 16:41:11,964][25689] Fps is (10 sec: 5364.4, 60 sec: 5537.8, 300 sec: 5537.6). Total num frames: 824889344. Throughput: 0: 5708.8. Samples: 824889976. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:41:11,964][25689] Avg episode reward: [(0, '0.974')] [2022-07-10 16:41:12,975][26022] Updated weights on worker 0-0, policy_version 805564 (0.00081) [2022-07-10 16:41:14,882][26022] Updated weights on worker 0-0, policy_version 805574 (0.00090) [2022-07-10 16:41:16,967][25689] Fps is (10 sec: 5304.9, 60 sec: 5505.3, 300 sec: 5534.2). Total num frames: 824916992. Throughput: 0: 5707.9. Samples: 824923268. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:41:16,968][25689] Avg episode reward: [(0, '-1.232')] [2022-07-10 16:41:16,987][26022] Updated weights on worker 0-0, policy_version 805584 (0.00091) [2022-07-10 16:41:18,491][26022] Updated weights on worker 0-0, policy_version 805594 (0.00084) [2022-07-10 16:41:20,595][26022] Updated weights on worker 0-0, policy_version 805604 (0.00083) [2022-07-10 16:41:22,012][25689] Fps is (10 sec: 5706.9, 60 sec: 5554.1, 300 sec: 5539.0). Total num frames: 824946688. Throughput: 0: 4854.1. Samples: 824940026. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:41:22,014][25689] Avg episode reward: [(0, '-1.570')] [2022-07-10 16:41:22,213][26022] Updated weights on worker 0-0, policy_version 805614 (0.00087) [2022-07-10 16:41:24,316][26022] Updated weights on worker 0-0, policy_version 805624 (0.00087) [2022-07-10 16:41:25,943][26022] Updated weights on worker 0-0, policy_version 805634 (0.00084) [2022-07-10 16:41:27,143][25689] Fps is (10 sec: 5635.4, 60 sec: 5529.6, 300 sec: 5540.3). Total num frames: 824974336. Throughput: 0: 5783.0. Samples: 824973564. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:41:27,143][25689] Avg episode reward: [(0, '-1.265')] [2022-07-10 16:41:28,028][26022] Updated weights on worker 0-0, policy_version 805644 (0.00091) [2022-07-10 16:41:29,617][26022] Updated weights on worker 0-0, policy_version 805654 (0.00093) [2022-07-10 16:41:31,774][26022] Updated weights on worker 0-0, policy_version 805664 (0.00089) [2022-07-10 16:41:32,156][25689] Fps is (10 sec: 5551.9, 60 sec: 5531.6, 300 sec: 5540.1). Total num frames: 825003008. Throughput: 0: 5782.7. Samples: 825006742. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:41:32,158][25689] Avg episode reward: [(0, '-2.238')] [2022-07-10 16:41:33,291][26022] Updated weights on worker 0-0, policy_version 805674 (0.00079) [2022-07-10 16:41:35,440][26022] Updated weights on worker 0-0, policy_version 805684 (0.00087) [2022-07-10 16:41:37,161][25689] Fps is (10 sec: 5519.5, 60 sec: 5537.6, 300 sec: 5533.4). Total num frames: 825029632. Throughput: 0: 4952.2. Samples: 825023274. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:41:37,162][25689] Avg episode reward: [(0, '-2.931')] [2022-07-10 16:41:37,240][26022] Updated weights on worker 0-0, policy_version 805694 (0.00086) [2022-07-10 16:41:38,923][26022] Updated weights on worker 0-0, policy_version 805704 (0.00092) [2022-07-10 16:41:40,931][26022] Updated weights on worker 0-0, policy_version 805714 (0.00086) [2022-07-10 16:41:42,165][25689] Fps is (10 sec: 5524.9, 60 sec: 5505.8, 300 sec: 5538.7). Total num frames: 825058304. Throughput: 0: 5783.8. Samples: 825056586. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:41:42,165][25689] Avg episode reward: [(0, '-2.661')] [2022-07-10 16:41:42,606][26022] Updated weights on worker 0-0, policy_version 805724 (0.00618) [2022-07-10 16:41:44,531][26022] Updated weights on worker 0-0, policy_version 805734 (0.00084) [2022-07-10 16:41:46,319][26022] Updated weights on worker 0-0, policy_version 805744 (0.00085) [2022-07-10 16:41:47,237][25689] Fps is (10 sec: 5589.5, 60 sec: 5524.8, 300 sec: 5538.7). Total num frames: 825085952. Throughput: 0: 5795.9. Samples: 825090030. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:41:47,238][25689] Avg episode reward: [(0, '-0.352')] [2022-07-10 16:41:48,219][26022] Updated weights on worker 0-0, policy_version 805754 (0.00087) [2022-07-10 16:41:50,028][26022] Updated weights on worker 0-0, policy_version 805764 (0.00093) [2022-07-10 16:41:51,974][26022] Updated weights on worker 0-0, policy_version 805774 (0.00087) [2022-07-10 16:41:52,254][25689] Fps is (10 sec: 5480.9, 60 sec: 5491.0, 300 sec: 5532.4). Total num frames: 825113600. Throughput: 0: 5813.2. Samples: 825123572. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:41:52,254][25689] Avg episode reward: [(0, '-0.374')] [2022-07-10 16:41:53,682][26022] Updated weights on worker 0-0, policy_version 805784 (0.00087) [2022-07-10 16:41:55,608][26022] Updated weights on worker 0-0, policy_version 805794 (0.00090) [2022-07-10 16:41:57,288][25689] Fps is (10 sec: 5501.4, 60 sec: 5505.7, 300 sec: 5533.4). Total num frames: 825141248. Throughput: 0: 5809.9. Samples: 825140212. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:41:57,289][25689] Avg episode reward: [(0, '-1.104')] [2022-07-10 16:41:57,384][26022] Updated weights on worker 0-0, policy_version 805804 (0.00084) [2022-07-10 16:41:59,255][26022] Updated weights on worker 0-0, policy_version 805814 (0.00083) [2022-07-10 16:42:01,239][26022] Updated weights on worker 0-0, policy_version 805824 (0.00081) [2022-07-10 16:42:02,299][25689] Fps is (10 sec: 5402.8, 60 sec: 5505.2, 300 sec: 5538.0). Total num frames: 825167872. Throughput: 0: 5803.4. Samples: 825173432. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:42:02,299][25689] Avg episode reward: [(0, '-0.917')] [2022-07-10 16:42:03,444][26022] Updated weights on worker 0-0, policy_version 805834 (0.00091) [2022-07-10 16:42:05,308][26022] Updated weights on worker 0-0, policy_version 805844 (0.00100) [2022-07-10 16:42:07,066][26022] Updated weights on worker 0-0, policy_version 805854 (0.00052) [2022-07-10 16:42:07,442][25689] Fps is (10 sec: 5446.2, 60 sec: 5499.7, 300 sec: 5532.4). Total num frames: 825196544. Throughput: 0: 5682.1. Samples: 825204834. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:42:07,442][25689] Avg episode reward: [(0, '-0.998')] [2022-07-10 16:42:08,816][26022] Updated weights on worker 0-0, policy_version 805864 (0.00083) [2022-07-10 16:42:10,648][26022] Updated weights on worker 0-0, policy_version 805874 (0.00091) [2022-07-10 16:42:12,456][25689] Fps is (10 sec: 5544.9, 60 sec: 5535.4, 300 sec: 5535.7). Total num frames: 825224192. Throughput: 0: 4853.3. Samples: 825221618. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:42:12,456][25689] Avg episode reward: [(0, '-0.810')] [2022-07-10 16:42:12,563][26022] Updated weights on worker 0-0, policy_version 805884 (0.00085) [2022-07-10 16:42:14,311][26022] Updated weights on worker 0-0, policy_version 805894 (0.00097) [2022-07-10 16:42:16,239][26022] Updated weights on worker 0-0, policy_version 805904 (0.00089) [2022-07-10 16:42:17,491][25689] Fps is (10 sec: 5604.1, 60 sec: 5549.4, 300 sec: 5533.4). Total num frames: 825252864. Throughput: 0: 5681.6. Samples: 825254998. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:42:17,492][25689] Avg episode reward: [(0, '-0.767')] [2022-07-10 16:42:17,954][26022] Updated weights on worker 0-0, policy_version 805914 (0.00394) [2022-07-10 16:42:19,811][26022] Updated weights on worker 0-0, policy_version 805924 (0.00086) [2022-07-10 16:42:21,751][26022] Updated weights on worker 0-0, policy_version 805934 (0.00092) [2022-07-10 16:42:22,567][25689] Fps is (10 sec: 5468.6, 60 sec: 5495.8, 300 sec: 5529.6). Total num frames: 825279488. Throughput: 0: 5678.1. Samples: 825288520. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:42:22,568][25689] Avg episode reward: [(0, '-0.048')] [2022-07-10 16:42:23,522][26022] Updated weights on worker 0-0, policy_version 805944 (0.00087) [2022-07-10 16:42:25,388][26022] Updated weights on worker 0-0, policy_version 805954 (0.00094) [2022-07-10 16:42:27,178][26022] Updated weights on worker 0-0, policy_version 805964 (0.00086) [2022-07-10 16:42:27,666][25689] Fps is (10 sec: 5434.6, 60 sec: 5515.6, 300 sec: 5532.5). Total num frames: 825308160. Throughput: 0: 4964.4. Samples: 825305236. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:42:27,667][25689] Avg episode reward: [(0, '-0.254')] [2022-07-10 16:42:29,027][26022] Updated weights on worker 0-0, policy_version 805974 (0.00088) [2022-07-10 16:42:30,833][26022] Updated weights on worker 0-0, policy_version 805984 (0.00093) [2022-07-10 16:42:32,602][26022] Updated weights on worker 0-0, policy_version 805994 (0.00083) [2022-07-10 16:42:32,682][25689] Fps is (10 sec: 5770.7, 60 sec: 5532.3, 300 sec: 5537.1). Total num frames: 825337856. Throughput: 0: 5807.0. Samples: 825339072. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:42:32,682][25689] Avg episode reward: [(0, '0.205')] [2022-07-10 16:42:34,554][26022] Updated weights on worker 0-0, policy_version 806004 (0.00092) [2022-07-10 16:42:36,458][26022] Updated weights on worker 0-0, policy_version 806014 (0.00562) [2022-07-10 16:42:37,726][25689] Fps is (10 sec: 5598.4, 60 sec: 5528.7, 300 sec: 5531.0). Total num frames: 825364480. Throughput: 0: 5810.1. Samples: 825372564. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-10 16:42:37,728][25689] Avg episode reward: [(0, '0.200')] [2022-07-10 16:42:38,154][26022] Updated weights on worker 0-0, policy_version 806024 (0.00083) [2022-07-10 16:42:40,179][26022] Updated weights on worker 0-0, policy_version 806034 (0.00090) [2022-07-10 16:42:41,687][26022] Updated weights on worker 0-0, policy_version 806044 (0.00099) [2022-07-10 16:42:42,733][25689] Fps is (10 sec: 5501.7, 60 sec: 5528.5, 300 sec: 5535.2). Total num frames: 825393152. Throughput: 0: 4998.8. Samples: 825389322. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:42:42,733][25689] Avg episode reward: [(0, '-0.401')] [2022-07-10 16:42:43,859][26022] Updated weights on worker 0-0, policy_version 806054 (0.00089) [2022-07-10 16:42:45,492][26022] Updated weights on worker 0-0, policy_version 806064 (0.01391) [2022-07-10 16:42:46,781][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:42:46,792][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000806070_825415680.pth [2022-07-10 16:42:46,793][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000804122_823420928.pth [2022-07-10 16:42:47,513][26022] Updated weights on worker 0-0, policy_version 806074 (0.00087) [2022-07-10 16:42:47,796][25689] Fps is (10 sec: 5592.7, 60 sec: 5529.3, 300 sec: 5534.4). Total num frames: 825420800. Throughput: 0: 5839.1. Samples: 825422778. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:42:47,798][25689] Avg episode reward: [(0, '-0.502')] [2022-07-10 16:42:49,069][26022] Updated weights on worker 0-0, policy_version 806084 (0.00087) [2022-07-10 16:42:51,153][26022] Updated weights on worker 0-0, policy_version 806094 (0.00090) [2022-07-10 16:42:52,887][25689] Fps is (10 sec: 5546.7, 60 sec: 5539.4, 300 sec: 5536.6). Total num frames: 825449472. Throughput: 0: 5798.5. Samples: 825456228. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:42:52,888][25689] Avg episode reward: [(0, '-0.628')] [2022-07-10 16:42:53,017][26022] Updated weights on worker 0-0, policy_version 806104 (0.00059) [2022-07-10 16:42:54,766][26022] Updated weights on worker 0-0, policy_version 806114 (0.00086) [2022-07-10 16:42:56,643][26022] Updated weights on worker 0-0, policy_version 806124 (0.00093) [2022-07-10 16:42:57,902][25689] Fps is (10 sec: 5573.2, 60 sec: 5541.2, 300 sec: 5533.7). Total num frames: 825477120. Throughput: 0: 4977.5. Samples: 825472988. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:42:57,904][25689] Avg episode reward: [(0, '-1.092')] [2022-07-10 16:42:58,453][26022] Updated weights on worker 0-0, policy_version 806134 (0.00084) [2022-07-10 16:43:00,372][26022] Updated weights on worker 0-0, policy_version 806144 (0.00086) [2022-07-10 16:43:02,463][26022] Updated weights on worker 0-0, policy_version 806154 (0.00082) [2022-07-10 16:43:02,922][25689] Fps is (10 sec: 5408.0, 60 sec: 5540.3, 300 sec: 5534.6). Total num frames: 825503744. Throughput: 0: 5792.0. Samples: 825506260. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:43:02,923][25689] Avg episode reward: [(0, '-1.320')] [2022-07-10 16:43:04,326][26022] Updated weights on worker 0-0, policy_version 806164 (0.00086) [2022-07-10 16:43:06,451][26022] Updated weights on worker 0-0, policy_version 806174 (0.00087) [2022-07-10 16:43:07,978][25689] Fps is (10 sec: 5386.0, 60 sec: 5531.3, 300 sec: 5541.0). Total num frames: 825531392. Throughput: 0: 5654.5. Samples: 825536898. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:43:07,979][25689] Avg episode reward: [(0, '-1.285')] [2022-07-10 16:43:08,073][26022] Updated weights on worker 0-0, policy_version 806184 (0.00088) [2022-07-10 16:43:10,026][26022] Updated weights on worker 0-0, policy_version 806194 (0.00085) [2022-07-10 16:43:11,881][26022] Updated weights on worker 0-0, policy_version 806204 (0.00082) [2022-07-10 16:43:12,997][25689] Fps is (10 sec: 5488.3, 60 sec: 5530.9, 300 sec: 5531.5). Total num frames: 825559040. Throughput: 0: 4836.1. Samples: 825553486. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:43:12,998][25689] Avg episode reward: [(0, '-0.726')] [2022-07-10 16:43:13,717][26022] Updated weights on worker 0-0, policy_version 806214 (0.00096) [2022-07-10 16:43:15,584][26022] Updated weights on worker 0-0, policy_version 806224 (0.00085) [2022-07-10 16:43:17,417][26022] Updated weights on worker 0-0, policy_version 806234 (0.00094) [2022-07-10 16:43:18,095][25689] Fps is (10 sec: 5364.7, 60 sec: 5491.4, 300 sec: 5526.7). Total num frames: 825585664. Throughput: 0: 5645.6. Samples: 825586990. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:43:18,095][25689] Avg episode reward: [(0, '-1.585')] [2022-07-10 16:43:19,123][26022] Updated weights on worker 0-0, policy_version 806244 (0.00082) [2022-07-10 16:43:21,083][26022] Updated weights on worker 0-0, policy_version 806254 (0.00089) [2022-07-10 16:43:22,803][26022] Updated weights on worker 0-0, policy_version 806264 (0.00085) [2022-07-10 16:43:23,166][25689] Fps is (10 sec: 5538.6, 60 sec: 5542.6, 300 sec: 5534.1). Total num frames: 825615360. Throughput: 0: 5647.7. Samples: 825620590. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:43:23,167][25689] Avg episode reward: [(0, '-2.033')] [2022-07-10 16:43:24,827][26022] Updated weights on worker 0-0, policy_version 806274 (0.00594) [2022-07-10 16:43:26,538][26022] Updated weights on worker 0-0, policy_version 806284 (0.00081) [2022-07-10 16:43:28,267][25689] Fps is (10 sec: 5637.6, 60 sec: 5525.5, 300 sec: 5529.5). Total num frames: 825643008. Throughput: 0: 4938.2. Samples: 825637084. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:43:28,267][25689] Avg episode reward: [(0, '-0.918')] [2022-07-10 16:43:28,635][26022] Updated weights on worker 0-0, policy_version 806294 (0.00088) [2022-07-10 16:43:30,212][26022] Updated weights on worker 0-0, policy_version 806304 (0.00086) [2022-07-10 16:43:32,410][26022] Updated weights on worker 0-0, policy_version 806314 (0.00223) [2022-07-10 16:43:33,293][25689] Fps is (10 sec: 5561.1, 60 sec: 5507.6, 300 sec: 5527.0). Total num frames: 825671680. Throughput: 0: 5752.0. Samples: 825670230. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:43:33,294][25689] Avg episode reward: [(0, '-0.244')] [2022-07-10 16:43:34,041][26022] Updated weights on worker 0-0, policy_version 806324 (0.00083) [2022-07-10 16:43:35,829][26022] Updated weights on worker 0-0, policy_version 806334 (0.00088) [2022-07-10 16:43:37,937][26022] Updated weights on worker 0-0, policy_version 806344 (0.00087) [2022-07-10 16:43:38,349][25689] Fps is (10 sec: 5484.4, 60 sec: 5506.6, 300 sec: 5529.6). Total num frames: 825698304. Throughput: 0: 5742.2. Samples: 825703294. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:43:38,350][25689] Avg episode reward: [(0, '-0.235')] [2022-07-10 16:43:39,588][26022] Updated weights on worker 0-0, policy_version 806354 (0.00087) [2022-07-10 16:43:41,701][26022] Updated weights on worker 0-0, policy_version 806364 (0.00089) [2022-07-10 16:43:43,194][26022] Updated weights on worker 0-0, policy_version 806374 (0.00094) [2022-07-10 16:43:43,387][25689] Fps is (10 sec: 5580.1, 60 sec: 5520.6, 300 sec: 5530.6). Total num frames: 825728000. Throughput: 0: 4918.7. Samples: 825720050. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:43:43,387][25689] Avg episode reward: [(0, '-0.016')] [2022-07-10 16:43:45,284][26022] Updated weights on worker 0-0, policy_version 806384 (0.00090) [2022-07-10 16:43:47,190][26022] Updated weights on worker 0-0, policy_version 806394 (0.00102) [2022-07-10 16:43:48,484][25689] Fps is (10 sec: 5557.2, 60 sec: 5500.7, 300 sec: 5526.1). Total num frames: 825754624. Throughput: 0: 5750.1. Samples: 825753334. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:43:48,485][25689] Avg episode reward: [(0, '0.347')] [2022-07-10 16:43:48,900][26022] Updated weights on worker 0-0, policy_version 806404 (0.00091) [2022-07-10 16:43:50,874][26022] Updated weights on worker 0-0, policy_version 806414 (0.00088) [2022-07-10 16:43:52,563][26022] Updated weights on worker 0-0, policy_version 806424 (0.00086) [2022-07-10 16:43:53,509][25689] Fps is (10 sec: 5361.6, 60 sec: 5489.7, 300 sec: 5522.5). Total num frames: 825782272. Throughput: 0: 5745.1. Samples: 825786370. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:43:53,509][25689] Avg episode reward: [(0, '0.619')] [2022-07-10 16:43:54,604][26022] Updated weights on worker 0-0, policy_version 806434 (0.00087) [2022-07-10 16:43:56,326][26022] Updated weights on worker 0-0, policy_version 806444 (0.00352) [2022-07-10 16:43:58,118][26022] Updated weights on worker 0-0, policy_version 806454 (0.00086) [2022-07-10 16:43:58,539][25689] Fps is (10 sec: 5601.2, 60 sec: 5505.3, 300 sec: 5526.3). Total num frames: 825810944. Throughput: 0: 5777.6. Samples: 825819942. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:43:58,540][25689] Avg episode reward: [(0, '0.427')] [2022-07-10 16:44:00,036][26022] Updated weights on worker 0-0, policy_version 806464 (0.00089) [2022-07-10 16:44:01,945][26022] Updated weights on worker 0-0, policy_version 806474 (0.00092) [2022-07-10 16:44:03,557][25689] Fps is (10 sec: 5401.1, 60 sec: 5488.6, 300 sec: 5524.1). Total num frames: 825836544. Throughput: 0: 5784.0. Samples: 825836716. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:44:03,558][25689] Avg episode reward: [(0, '0.065')] [2022-07-10 16:44:03,952][26022] Updated weights on worker 0-0, policy_version 806484 (0.00089) [2022-07-10 16:44:05,973][26022] Updated weights on worker 0-0, policy_version 806494 (0.00089) [2022-07-10 16:44:07,565][26022] Updated weights on worker 0-0, policy_version 806504 (0.00088) [2022-07-10 16:44:08,623][25689] Fps is (10 sec: 5381.9, 60 sec: 5504.6, 300 sec: 5529.9). Total num frames: 825865216. Throughput: 0: 5686.7. Samples: 825867860. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:44:08,624][25689] Avg episode reward: [(0, '-0.888')] [2022-07-10 16:44:09,651][26022] Updated weights on worker 0-0, policy_version 806514 (0.00086) [2022-07-10 16:44:11,364][26022] Updated weights on worker 0-0, policy_version 806524 (0.00092) [2022-07-10 16:44:13,247][26022] Updated weights on worker 0-0, policy_version 806534 (0.01018) [2022-07-10 16:44:13,627][25689] Fps is (10 sec: 5592.9, 60 sec: 5505.9, 300 sec: 5526.6). Total num frames: 825892864. Throughput: 0: 5714.6. Samples: 825901338. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:44:13,628][25689] Avg episode reward: [(0, '-0.406')] [2022-07-10 16:44:15,167][26022] Updated weights on worker 0-0, policy_version 806544 (0.00084) [2022-07-10 16:44:17,010][26022] Updated weights on worker 0-0, policy_version 806554 (0.00086) [2022-07-10 16:44:18,632][25689] Fps is (10 sec: 5626.9, 60 sec: 5548.2, 300 sec: 5526.7). Total num frames: 825921536. Throughput: 0: 4878.3. Samples: 825917962. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:44:18,632][25689] Avg episode reward: [(0, '-1.227')] [2022-07-10 16:44:18,637][26022] Updated weights on worker 0-0, policy_version 806564 (0.00085) [2022-07-10 16:44:20,899][26022] Updated weights on worker 0-0, policy_version 806574 (0.00094) [2022-07-10 16:44:22,217][26022] Updated weights on worker 0-0, policy_version 806584 (0.00101) [2022-07-10 16:44:23,635][25689] Fps is (10 sec: 5525.4, 60 sec: 5503.7, 300 sec: 5527.7). Total num frames: 825948160. Throughput: 0: 5707.6. Samples: 825951312. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:44:23,635][25689] Avg episode reward: [(0, '-2.004')] [2022-07-10 16:44:24,581][26022] Updated weights on worker 0-0, policy_version 806594 (0.00085) [2022-07-10 16:44:26,120][26022] Updated weights on worker 0-0, policy_version 806604 (0.00088) [2022-07-10 16:44:28,000][26022] Updated weights on worker 0-0, policy_version 806614 (0.00095) [2022-07-10 16:44:28,673][25689] Fps is (10 sec: 5404.8, 60 sec: 5509.3, 300 sec: 5523.6). Total num frames: 825975808. Throughput: 0: 5824.4. Samples: 825984642. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:44:28,674][25689] Avg episode reward: [(0, '-2.134')] [2022-07-10 16:44:29,976][26022] Updated weights on worker 0-0, policy_version 806624 (0.00091) [2022-07-10 16:44:31,660][26022] Updated weights on worker 0-0, policy_version 806634 (0.00089) [2022-07-10 16:44:33,583][26022] Updated weights on worker 0-0, policy_version 806644 (0.00091) [2022-07-10 16:44:33,675][25689] Fps is (10 sec: 5507.3, 60 sec: 5494.6, 300 sec: 5523.9). Total num frames: 826003456. Throughput: 0: 4986.7. Samples: 826001312. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:44:33,676][25689] Avg episode reward: [(0, '-2.308')] [2022-07-10 16:44:35,546][26022] Updated weights on worker 0-0, policy_version 806654 (0.00088) [2022-07-10 16:44:37,279][26022] Updated weights on worker 0-0, policy_version 806664 (0.00086) [2022-07-10 16:44:38,682][25689] Fps is (10 sec: 5524.6, 60 sec: 5516.0, 300 sec: 5527.3). Total num frames: 826031104. Throughput: 0: 5811.0. Samples: 826034476. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:44:38,683][25689] Avg episode reward: [(0, '-1.764')] [2022-07-10 16:44:39,304][26022] Updated weights on worker 0-0, policy_version 806674 (0.00088) [2022-07-10 16:44:40,908][26022] Updated weights on worker 0-0, policy_version 806684 (0.00086) [2022-07-10 16:44:42,861][26022] Updated weights on worker 0-0, policy_version 806694 (0.00098) [2022-07-10 16:44:43,702][25689] Fps is (10 sec: 5617.2, 60 sec: 5500.7, 300 sec: 5524.4). Total num frames: 826059776. Throughput: 0: 5814.7. Samples: 826067994. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:44:43,702][25689] Avg episode reward: [(0, '-1.573')] [2022-07-10 16:44:44,652][26022] Updated weights on worker 0-0, policy_version 806704 (0.00092) [2022-07-10 16:44:46,540][26022] Updated weights on worker 0-0, policy_version 806714 (0.00091) [2022-07-10 16:44:47,031][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:44:47,042][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000806717_826078208.pth [2022-07-10 16:44:47,044][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000804772_824086528.pth [2022-07-10 16:44:48,467][26022] Updated weights on worker 0-0, policy_version 806724 (0.00080) [2022-07-10 16:44:48,764][25689] Fps is (10 sec: 5586.4, 60 sec: 5520.9, 300 sec: 5519.9). Total num frames: 826087424. Throughput: 0: 4989.9. Samples: 826084892. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:44:48,765][25689] Avg episode reward: [(0, '-2.090')] [2022-07-10 16:44:50,062][26022] Updated weights on worker 0-0, policy_version 806734 (0.00089) [2022-07-10 16:44:52,052][26022] Updated weights on worker 0-0, policy_version 806744 (0.00087) [2022-07-10 16:44:53,650][26022] Updated weights on worker 0-0, policy_version 806754 (0.00099) [2022-07-10 16:44:53,795][25689] Fps is (10 sec: 5580.0, 60 sec: 5537.4, 300 sec: 5526.3). Total num frames: 826116096. Throughput: 0: 5830.6. Samples: 826118620. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:44:53,795][25689] Avg episode reward: [(0, '-1.731')] [2022-07-10 16:44:55,725][26022] Updated weights on worker 0-0, policy_version 806764 (0.00086) [2022-07-10 16:44:57,504][26022] Updated weights on worker 0-0, policy_version 806774 (0.00090) [2022-07-10 16:44:58,808][25689] Fps is (10 sec: 5607.3, 60 sec: 5521.9, 300 sec: 5523.1). Total num frames: 826143744. Throughput: 0: 5852.4. Samples: 826152260. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:44:58,809][25689] Avg episode reward: [(0, '-1.211')] [2022-07-10 16:44:59,278][26022] Updated weights on worker 0-0, policy_version 806784 (0.00092) [2022-07-10 16:45:01,094][26022] Updated weights on worker 0-0, policy_version 806794 (0.00086) [2022-07-10 16:45:03,361][26022] Updated weights on worker 0-0, policy_version 806804 (0.00077) [2022-07-10 16:45:03,845][25689] Fps is (10 sec: 5502.1, 60 sec: 5554.2, 300 sec: 5530.4). Total num frames: 826171392. Throughput: 0: 5000.9. Samples: 826168728. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:45:03,845][25689] Avg episode reward: [(0, '-0.545')] [2022-07-10 16:45:05,102][26022] Updated weights on worker 0-0, policy_version 806814 (0.00095) [2022-07-10 16:45:06,876][26022] Updated weights on worker 0-0, policy_version 806824 (0.00097) [2022-07-10 16:45:08,952][25689] Fps is (10 sec: 5249.5, 60 sec: 5499.5, 300 sec: 5521.9). Total num frames: 826196992. Throughput: 0: 5707.1. Samples: 826200104. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:45:08,952][25689] Avg episode reward: [(0, '-0.306')] [2022-07-10 16:45:08,987][26022] Updated weights on worker 0-0, policy_version 806834 (0.00434) [2022-07-10 16:45:10,581][26022] Updated weights on worker 0-0, policy_version 806844 (0.00079) [2022-07-10 16:45:12,593][26022] Updated weights on worker 0-0, policy_version 806854 (0.00083) [2022-07-10 16:45:13,960][25689] Fps is (10 sec: 5365.6, 60 sec: 5516.1, 300 sec: 5518.6). Total num frames: 826225664. Throughput: 0: 5706.2. Samples: 826233684. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:45:13,960][25689] Avg episode reward: [(0, '0.020')] [2022-07-10 16:45:14,426][26022] Updated weights on worker 0-0, policy_version 806864 (0.00087) [2022-07-10 16:45:16,086][26022] Updated weights on worker 0-0, policy_version 806874 (0.00086) [2022-07-10 16:45:17,958][26022] Updated weights on worker 0-0, policy_version 806884 (0.00087) [2022-07-10 16:45:19,023][25689] Fps is (10 sec: 5693.5, 60 sec: 5510.7, 300 sec: 5524.7). Total num frames: 826254336. Throughput: 0: 4853.1. Samples: 826250366. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:45:19,024][25689] Avg episode reward: [(0, '-0.051')] [2022-07-10 16:45:19,811][26022] Updated weights on worker 0-0, policy_version 806894 (0.00484) [2022-07-10 16:45:21,710][26022] Updated weights on worker 0-0, policy_version 806904 (0.00083) [2022-07-10 16:45:23,527][26022] Updated weights on worker 0-0, policy_version 806914 (0.00088) [2022-07-10 16:45:24,050][25689] Fps is (10 sec: 5682.8, 60 sec: 5542.4, 300 sec: 5525.1). Total num frames: 826283008. Throughput: 0: 5714.2. Samples: 826284186. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:45:24,051][25689] Avg episode reward: [(0, '-0.244')] [2022-07-10 16:45:25,214][26022] Updated weights on worker 0-0, policy_version 806924 (0.00086) [2022-07-10 16:45:27,234][26022] Updated weights on worker 0-0, policy_version 806934 (0.00090) [2022-07-10 16:45:28,852][26022] Updated weights on worker 0-0, policy_version 806944 (0.00094) [2022-07-10 16:45:29,128][25689] Fps is (10 sec: 5675.0, 60 sec: 5555.8, 300 sec: 5524.3). Total num frames: 826311680. Throughput: 0: 5840.0. Samples: 826317934. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:45:29,129][25689] Avg episode reward: [(0, '-0.370')] [2022-07-10 16:45:30,711][26022] Updated weights on worker 0-0, policy_version 806954 (0.00086) [2022-07-10 16:45:32,634][26022] Updated weights on worker 0-0, policy_version 806964 (0.00089) [2022-07-10 16:45:34,150][25689] Fps is (10 sec: 5576.2, 60 sec: 5553.9, 300 sec: 5528.7). Total num frames: 826339328. Throughput: 0: 5000.5. Samples: 826334648. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:45:34,151][25689] Avg episode reward: [(0, '-1.307')] [2022-07-10 16:45:34,634][26022] Updated weights on worker 0-0, policy_version 806974 (0.00083) [2022-07-10 16:45:36,355][26022] Updated weights on worker 0-0, policy_version 806984 (0.00085) [2022-07-10 16:45:38,346][26022] Updated weights on worker 0-0, policy_version 806994 (0.00088) [2022-07-10 16:45:39,174][25689] Fps is (10 sec: 5402.2, 60 sec: 5535.4, 300 sec: 5514.9). Total num frames: 826365952. Throughput: 0: 5824.6. Samples: 826367738. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:45:39,175][25689] Avg episode reward: [(0, '-1.668')] [2022-07-10 16:45:40,056][26022] Updated weights on worker 0-0, policy_version 807004 (0.00086) [2022-07-10 16:45:42,052][26022] Updated weights on worker 0-0, policy_version 807014 (0.00092) [2022-07-10 16:45:43,558][26022] Updated weights on worker 0-0, policy_version 807024 (0.00096) [2022-07-10 16:45:44,203][25689] Fps is (10 sec: 5500.8, 60 sec: 5534.6, 300 sec: 5523.1). Total num frames: 826394624. Throughput: 0: 5813.6. Samples: 826401344. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:45:44,203][25689] Avg episode reward: [(0, '-0.829')] [2022-07-10 16:45:45,593][26022] Updated weights on worker 0-0, policy_version 807034 (0.00091) [2022-07-10 16:45:47,390][26022] Updated weights on worker 0-0, policy_version 807044 (0.00090) [2022-07-10 16:45:49,175][26022] Updated weights on worker 0-0, policy_version 807054 (0.00084) [2022-07-10 16:45:49,279][25689] Fps is (10 sec: 5675.2, 60 sec: 5550.3, 300 sec: 5518.5). Total num frames: 826423296. Throughput: 0: 4977.5. Samples: 826418234. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:45:49,279][25689] Avg episode reward: [(0, '-1.147')] [2022-07-10 16:45:50,939][26022] Updated weights on worker 0-0, policy_version 807064 (0.00093) [2022-07-10 16:45:52,975][26022] Updated weights on worker 0-0, policy_version 807074 (0.00087) [2022-07-10 16:45:54,329][25689] Fps is (10 sec: 5763.8, 60 sec: 5565.3, 300 sec: 5528.1). Total num frames: 826452992. Throughput: 0: 5828.2. Samples: 826452256. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:45:54,330][25689] Avg episode reward: [(0, '-0.786')] [2022-07-10 16:45:54,500][26022] Updated weights on worker 0-0, policy_version 807084 (0.00098) [2022-07-10 16:45:56,647][26022] Updated weights on worker 0-0, policy_version 807094 (0.00093) [2022-07-10 16:45:58,292][26022] Updated weights on worker 0-0, policy_version 807104 (0.00094) [2022-07-10 16:45:59,346][25689] Fps is (10 sec: 5594.4, 60 sec: 5548.1, 300 sec: 5527.9). Total num frames: 826479616. Throughput: 0: 5845.7. Samples: 826485656. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:45:59,346][25689] Avg episode reward: [(0, '-1.132')] [2022-07-10 16:46:00,161][26022] Updated weights on worker 0-0, policy_version 807114 (0.00091) [2022-07-10 16:46:02,244][26022] Updated weights on worker 0-0, policy_version 807124 (0.00050) [2022-07-10 16:46:04,107][26022] Updated weights on worker 0-0, policy_version 807134 (0.00087) [2022-07-10 16:46:04,359][25689] Fps is (10 sec: 5411.3, 60 sec: 5550.3, 300 sec: 5525.8). Total num frames: 826507264. Throughput: 0: 4984.4. Samples: 826501810. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:46:04,359][25689] Avg episode reward: [(0, '-0.376')] [2022-07-10 16:46:05,888][26022] Updated weights on worker 0-0, policy_version 807144 (0.00085) [2022-07-10 16:46:07,710][26022] Updated weights on worker 0-0, policy_version 807154 (0.00098) [2022-07-10 16:46:09,417][25689] Fps is (10 sec: 5287.2, 60 sec: 5554.8, 300 sec: 5525.3). Total num frames: 826532864. Throughput: 0: 5758.8. Samples: 826534206. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 16:46:09,418][25689] Avg episode reward: [(0, '0.184')] [2022-07-10 16:46:09,747][26022] Updated weights on worker 0-0, policy_version 807164 (0.00087) [2022-07-10 16:46:11,325][26022] Updated weights on worker 0-0, policy_version 807174 (0.00083) [2022-07-10 16:46:13,332][26022] Updated weights on worker 0-0, policy_version 807184 (0.00088) [2022-07-10 16:46:14,420][25689] Fps is (10 sec: 5394.0, 60 sec: 5555.2, 300 sec: 5528.7). Total num frames: 826561536. Throughput: 0: 5746.7. Samples: 826567712. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:46:14,422][25689] Avg episode reward: [(0, '-0.744')] [2022-07-10 16:46:14,884][26022] Updated weights on worker 0-0, policy_version 807194 (0.00085) [2022-07-10 16:46:17,155][26022] Updated weights on worker 0-0, policy_version 807204 (0.00105) [2022-07-10 16:46:18,774][26022] Updated weights on worker 0-0, policy_version 807214 (0.00089) [2022-07-10 16:46:19,428][25689] Fps is (10 sec: 5728.1, 60 sec: 5560.4, 300 sec: 5526.0). Total num frames: 826590208. Throughput: 0: 4916.2. Samples: 826584384. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:46:19,429][25689] Avg episode reward: [(0, '-0.933')] [2022-07-10 16:46:20,755][26022] Updated weights on worker 0-0, policy_version 807224 (0.00107) [2022-07-10 16:46:22,364][26022] Updated weights on worker 0-0, policy_version 807234 (0.00082) [2022-07-10 16:46:24,438][25689] Fps is (10 sec: 5520.0, 60 sec: 5528.1, 300 sec: 5524.8). Total num frames: 826616832. Throughput: 0: 5781.3. Samples: 826617892. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:46:24,439][25689] Avg episode reward: [(0, '-1.313')] [2022-07-10 16:46:24,510][26022] Updated weights on worker 0-0, policy_version 807244 (0.00088) [2022-07-10 16:46:26,049][26022] Updated weights on worker 0-0, policy_version 807254 (0.00105) [2022-07-10 16:46:28,191][26022] Updated weights on worker 0-0, policy_version 807264 (0.00085) [2022-07-10 16:46:29,532][25689] Fps is (10 sec: 5574.0, 60 sec: 5543.5, 300 sec: 5526.7). Total num frames: 826646528. Throughput: 0: 5831.9. Samples: 826651514. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:46:29,532][25689] Avg episode reward: [(0, '-1.354')] [2022-07-10 16:46:29,878][26022] Updated weights on worker 0-0, policy_version 807274 (0.00086) [2022-07-10 16:46:31,655][26022] Updated weights on worker 0-0, policy_version 807284 (0.00081) [2022-07-10 16:46:33,498][26022] Updated weights on worker 0-0, policy_version 807294 (0.00088) [2022-07-10 16:46:34,551][25689] Fps is (10 sec: 5771.5, 60 sec: 5560.8, 300 sec: 5533.4). Total num frames: 826675200. Throughput: 0: 4993.5. Samples: 826668234. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:46:34,551][25689] Avg episode reward: [(0, '-1.682')] [2022-07-10 16:46:35,398][26022] Updated weights on worker 0-0, policy_version 807304 (0.00079) [2022-07-10 16:46:37,264][26022] Updated weights on worker 0-0, policy_version 807314 (0.00086) [2022-07-10 16:46:39,202][26022] Updated weights on worker 0-0, policy_version 807324 (0.00089) [2022-07-10 16:46:39,651][25689] Fps is (10 sec: 5565.6, 60 sec: 5570.6, 300 sec: 5528.1). Total num frames: 826702848. Throughput: 0: 5812.3. Samples: 826701928. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:46:39,653][25689] Avg episode reward: [(0, '-1.450')] [2022-07-10 16:46:40,885][26022] Updated weights on worker 0-0, policy_version 807334 (0.00094) [2022-07-10 16:46:42,652][26022] Updated weights on worker 0-0, policy_version 807344 (0.00093) [2022-07-10 16:46:44,572][26022] Updated weights on worker 0-0, policy_version 807354 (0.00092) [2022-07-10 16:46:44,664][25689] Fps is (10 sec: 5568.7, 60 sec: 5572.1, 300 sec: 5532.7). Total num frames: 826731520. Throughput: 0: 5804.2. Samples: 826735292. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:46:44,665][25689] Avg episode reward: [(0, '-1.336')] [2022-07-10 16:46:46,204][26022] Updated weights on worker 0-0, policy_version 807364 (0.00085) [2022-07-10 16:46:47,229][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:46:47,251][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000807368_826744832.pth [2022-07-10 16:46:47,252][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000805422_824752128.pth [2022-07-10 16:46:48,267][26022] Updated weights on worker 0-0, policy_version 807374 (0.00092) [2022-07-10 16:46:49,730][25689] Fps is (10 sec: 5588.0, 60 sec: 5556.1, 300 sec: 5531.8). Total num frames: 826759168. Throughput: 0: 4984.3. Samples: 826752188. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:46:49,731][25689] Avg episode reward: [(0, '-0.682')] [2022-07-10 16:46:50,007][26022] Updated weights on worker 0-0, policy_version 807384 (0.00090) [2022-07-10 16:46:51,872][26022] Updated weights on worker 0-0, policy_version 807394 (0.00090) [2022-07-10 16:46:53,920][26022] Updated weights on worker 0-0, policy_version 807404 (0.00092) [2022-07-10 16:46:54,761][25689] Fps is (10 sec: 5476.5, 60 sec: 5524.0, 300 sec: 5531.8). Total num frames: 826786816. Throughput: 0: 5801.2. Samples: 826785480. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:46:54,762][25689] Avg episode reward: [(0, '-0.523')] [2022-07-10 16:46:55,459][26022] Updated weights on worker 0-0, policy_version 807414 (0.00089) [2022-07-10 16:46:57,439][26022] Updated weights on worker 0-0, policy_version 807424 (0.00086) [2022-07-10 16:46:59,095][26022] Updated weights on worker 0-0, policy_version 807434 (0.00088) [2022-07-10 16:46:59,854][25689] Fps is (10 sec: 5563.2, 60 sec: 5550.9, 300 sec: 5537.2). Total num frames: 826815488. Throughput: 0: 5806.4. Samples: 826819232. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:46:59,854][25689] Avg episode reward: [(0, '-0.233')] [2022-07-10 16:47:01,006][26022] Updated weights on worker 0-0, policy_version 807444 (0.00088) [2022-07-10 16:47:03,325][26022] Updated weights on worker 0-0, policy_version 807454 (0.00084) [2022-07-10 16:47:04,905][25689] Fps is (10 sec: 5451.0, 60 sec: 5530.4, 300 sec: 5532.0). Total num frames: 826842112. Throughput: 0: 5713.4. Samples: 826850938. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:47:04,906][25689] Avg episode reward: [(0, '-0.469')] [2022-07-10 16:47:04,978][26022] Updated weights on worker 0-0, policy_version 807464 (0.00086) [2022-07-10 16:47:07,045][26022] Updated weights on worker 0-0, policy_version 807474 (0.00086) [2022-07-10 16:47:08,690][26022] Updated weights on worker 0-0, policy_version 807484 (0.00083) [2022-07-10 16:47:09,952][25689] Fps is (10 sec: 5273.2, 60 sec: 5548.4, 300 sec: 5528.0). Total num frames: 826868736. Throughput: 0: 5712.4. Samples: 826867702. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:47:09,952][25689] Avg episode reward: [(0, '-0.377')] [2022-07-10 16:47:10,607][26022] Updated weights on worker 0-0, policy_version 807494 (0.00088) [2022-07-10 16:47:12,300][26022] Updated weights on worker 0-0, policy_version 807504 (0.00089) [2022-07-10 16:47:13,948][26022] Updated weights on worker 0-0, policy_version 807514 (0.00082) [2022-07-10 16:47:14,966][25689] Fps is (10 sec: 5598.3, 60 sec: 5564.4, 300 sec: 5531.8). Total num frames: 826898432. Throughput: 0: 5733.4. Samples: 826901320. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:47:14,966][25689] Avg episode reward: [(0, '0.248')] [2022-07-10 16:47:16,115][26022] Updated weights on worker 0-0, policy_version 807524 (0.00088) [2022-07-10 16:47:17,870][26022] Updated weights on worker 0-0, policy_version 807534 (0.00090) [2022-07-10 16:47:19,632][26022] Updated weights on worker 0-0, policy_version 807544 (0.00091) [2022-07-10 16:47:19,985][25689] Fps is (10 sec: 5715.5, 60 sec: 5546.4, 300 sec: 5536.3). Total num frames: 826926080. Throughput: 0: 5731.4. Samples: 826934610. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:47:19,991][25689] Avg episode reward: [(0, '-0.049')] [2022-07-10 16:47:21,491][26022] Updated weights on worker 0-0, policy_version 807554 (0.00077) [2022-07-10 16:47:23,126][26022] Updated weights on worker 0-0, policy_version 807564 (0.00092) [2022-07-10 16:47:25,007][25689] Fps is (10 sec: 5507.0, 60 sec: 5562.2, 300 sec: 5534.3). Total num frames: 826953728. Throughput: 0: 5015.4. Samples: 826951754. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:47:25,019][25689] Avg episode reward: [(0, '-0.105')] [2022-07-10 16:47:25,179][26022] Updated weights on worker 0-0, policy_version 807574 (0.00081) [2022-07-10 16:47:26,905][26022] Updated weights on worker 0-0, policy_version 807584 (0.00085) [2022-07-10 16:47:28,979][26022] Updated weights on worker 0-0, policy_version 807594 (0.00093) [2022-07-10 16:47:30,068][25689] Fps is (10 sec: 5585.6, 60 sec: 5548.3, 300 sec: 5530.0). Total num frames: 826982400. Throughput: 0: 5842.0. Samples: 826985220. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:47:30,070][25689] Avg episode reward: [(0, '-0.199')] [2022-07-10 16:47:30,815][26022] Updated weights on worker 0-0, policy_version 807604 (0.00090) [2022-07-10 16:47:32,390][26022] Updated weights on worker 0-0, policy_version 807614 (0.00097) [2022-07-10 16:47:34,397][26022] Updated weights on worker 0-0, policy_version 807624 (0.00089) [2022-07-10 16:47:35,080][25689] Fps is (10 sec: 5591.3, 60 sec: 5532.0, 300 sec: 5534.0). Total num frames: 827010048. Throughput: 0: 5839.5. Samples: 827018776. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:47:35,080][25689] Avg episode reward: [(0, '0.152')] [2022-07-10 16:47:36,225][26022] Updated weights on worker 0-0, policy_version 807634 (0.00090) [2022-07-10 16:47:38,175][26022] Updated weights on worker 0-0, policy_version 807644 (0.00085) [2022-07-10 16:47:39,947][26022] Updated weights on worker 0-0, policy_version 807654 (0.00088) [2022-07-10 16:47:40,093][25689] Fps is (10 sec: 5617.7, 60 sec: 5556.9, 300 sec: 5533.9). Total num frames: 827038720. Throughput: 0: 5019.2. Samples: 827035538. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:47:40,094][25689] Avg episode reward: [(0, '-0.904')] [2022-07-10 16:47:41,826][26022] Updated weights on worker 0-0, policy_version 807664 (0.00053) [2022-07-10 16:47:43,572][26022] Updated weights on worker 0-0, policy_version 807674 (0.00087) [2022-07-10 16:47:45,103][25689] Fps is (10 sec: 5516.8, 60 sec: 5523.4, 300 sec: 5531.5). Total num frames: 827065344. Throughput: 0: 5829.3. Samples: 827068900. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:47:45,103][25689] Avg episode reward: [(0, '-1.164')] [2022-07-10 16:47:45,496][26022] Updated weights on worker 0-0, policy_version 807684 (0.00087) [2022-07-10 16:47:47,231][26022] Updated weights on worker 0-0, policy_version 807694 (0.00085) [2022-07-10 16:47:49,128][26022] Updated weights on worker 0-0, policy_version 807704 (0.00095) [2022-07-10 16:47:50,163][25689] Fps is (10 sec: 5593.0, 60 sec: 5557.8, 300 sec: 5535.5). Total num frames: 827095040. Throughput: 0: 5834.1. Samples: 827102458. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:47:50,165][25689] Avg episode reward: [(0, '-0.436')] [2022-07-10 16:47:50,991][26022] Updated weights on worker 0-0, policy_version 807714 (0.00102) [2022-07-10 16:47:52,653][26022] Updated weights on worker 0-0, policy_version 807724 (0.00083) [2022-07-10 16:47:54,634][26022] Updated weights on worker 0-0, policy_version 807734 (0.00083) [2022-07-10 16:47:55,189][25689] Fps is (10 sec: 5786.7, 60 sec: 5575.1, 300 sec: 5538.7). Total num frames: 827123712. Throughput: 0: 5004.3. Samples: 827119412. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:47:55,191][25689] Avg episode reward: [(0, '-0.940')] [2022-07-10 16:47:56,318][26022] Updated weights on worker 0-0, policy_version 807744 (0.00084) [2022-07-10 16:47:58,329][26022] Updated weights on worker 0-0, policy_version 807754 (0.00090) [2022-07-10 16:48:00,099][26022] Updated weights on worker 0-0, policy_version 807764 (0.00089) [2022-07-10 16:48:00,207][25689] Fps is (10 sec: 5505.3, 60 sec: 5548.1, 300 sec: 5538.8). Total num frames: 827150336. Throughput: 0: 5850.6. Samples: 827153216. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:48:00,209][25689] Avg episode reward: [(0, '-0.803')] [2022-07-10 16:48:01,918][26022] Updated weights on worker 0-0, policy_version 807774 (0.00083) [2022-07-10 16:48:04,116][26022] Updated weights on worker 0-0, policy_version 807784 (0.00082) [2022-07-10 16:48:05,216][25689] Fps is (10 sec: 5310.4, 60 sec: 5552.0, 300 sec: 5536.2). Total num frames: 827176960. Throughput: 0: 5758.6. Samples: 827184726. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:48:05,217][25689] Avg episode reward: [(0, '-2.241')] [2022-07-10 16:48:05,999][26022] Updated weights on worker 0-0, policy_version 807794 (0.00086) [2022-07-10 16:48:07,915][26022] Updated weights on worker 0-0, policy_version 807804 (0.00090) [2022-07-10 16:48:09,516][26022] Updated weights on worker 0-0, policy_version 807814 (0.00054) [2022-07-10 16:48:10,349][25689] Fps is (10 sec: 5351.1, 60 sec: 5561.0, 300 sec: 5534.1). Total num frames: 827204608. Throughput: 0: 4892.4. Samples: 827201218. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:48:10,350][25689] Avg episode reward: [(0, '-2.386')] [2022-07-10 16:48:11,592][26022] Updated weights on worker 0-0, policy_version 807824 (0.00050) [2022-07-10 16:48:13,336][26022] Updated weights on worker 0-0, policy_version 807834 (0.00088) [2022-07-10 16:48:15,214][26022] Updated weights on worker 0-0, policy_version 807844 (0.00088) [2022-07-10 16:48:15,425][25689] Fps is (10 sec: 5617.3, 60 sec: 5555.3, 300 sec: 5544.8). Total num frames: 827234304. Throughput: 0: 5685.0. Samples: 827234452. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:48:15,425][25689] Avg episode reward: [(0, '-2.536')] [2022-07-10 16:48:17,224][26022] Updated weights on worker 0-0, policy_version 807854 (0.00096) [2022-07-10 16:48:18,648][26022] Updated weights on worker 0-0, policy_version 807864 (0.00091) [2022-07-10 16:48:20,460][25689] Fps is (10 sec: 5570.5, 60 sec: 5537.0, 300 sec: 5535.2). Total num frames: 827260928. Throughput: 0: 5669.0. Samples: 827268028. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:48:20,460][25689] Avg episode reward: [(0, '-2.734')] [2022-07-10 16:48:20,780][26022] Updated weights on worker 0-0, policy_version 807874 (0.00095) [2022-07-10 16:48:22,544][26022] Updated weights on worker 0-0, policy_version 807884 (0.00090) [2022-07-10 16:48:24,379][26022] Updated weights on worker 0-0, policy_version 807894 (0.00085) [2022-07-10 16:48:25,536][25689] Fps is (10 sec: 5570.4, 60 sec: 5565.9, 300 sec: 5542.5). Total num frames: 827290624. Throughput: 0: 4926.2. Samples: 827284822. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:48:25,536][25689] Avg episode reward: [(0, '-2.267')] [2022-07-10 16:48:26,263][26022] Updated weights on worker 0-0, policy_version 807904 (0.00094) [2022-07-10 16:48:28,035][26022] Updated weights on worker 0-0, policy_version 807914 (0.00089) [2022-07-10 16:48:29,904][26022] Updated weights on worker 0-0, policy_version 807924 (0.00090) [2022-07-10 16:48:30,635][25689] Fps is (10 sec: 5434.4, 60 sec: 5511.6, 300 sec: 5530.8). Total num frames: 827316224. Throughput: 0: 5759.3. Samples: 827318048. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:48:30,635][25689] Avg episode reward: [(0, '-1.047')] [2022-07-10 16:48:31,699][26022] Updated weights on worker 0-0, policy_version 807934 (0.00084) [2022-07-10 16:48:33,635][26022] Updated weights on worker 0-0, policy_version 807944 (0.00098) [2022-07-10 16:48:35,338][26022] Updated weights on worker 0-0, policy_version 807954 (0.00088) [2022-07-10 16:48:35,693][25689] Fps is (10 sec: 5544.5, 60 sec: 5558.1, 300 sec: 5544.5). Total num frames: 827346944. Throughput: 0: 5781.3. Samples: 827351630. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:48:35,694][25689] Avg episode reward: [(0, '-0.278')] [2022-07-10 16:48:37,408][26022] Updated weights on worker 0-0, policy_version 807964 (0.00090) [2022-07-10 16:48:39,028][26022] Updated weights on worker 0-0, policy_version 807974 (0.00082) [2022-07-10 16:48:40,741][25689] Fps is (10 sec: 5775.4, 60 sec: 5538.1, 300 sec: 5537.5). Total num frames: 827374592. Throughput: 0: 4951.1. Samples: 827368444. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:48:40,742][25689] Avg episode reward: [(0, '-0.700')] [2022-07-10 16:48:41,158][26022] Updated weights on worker 0-0, policy_version 807984 (0.00088) [2022-07-10 16:48:42,752][26022] Updated weights on worker 0-0, policy_version 807994 (0.00087) [2022-07-10 16:48:44,711][26022] Updated weights on worker 0-0, policy_version 808004 (0.00096) [2022-07-10 16:48:45,835][25689] Fps is (10 sec: 5553.3, 60 sec: 5564.1, 300 sec: 5544.4). Total num frames: 827403264. Throughput: 0: 5761.2. Samples: 827401772. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:48:45,836][25689] Avg episode reward: [(0, '0.325')] [2022-07-10 16:48:46,272][26022] Updated weights on worker 0-0, policy_version 808014 (0.00087) [2022-07-10 16:48:47,442][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:48:47,452][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000808019_827411456.pth [2022-07-10 16:48:47,452][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000806070_825415680.pth [2022-07-10 16:48:48,306][26022] Updated weights on worker 0-0, policy_version 808024 (0.00089) [2022-07-10 16:48:50,116][26022] Updated weights on worker 0-0, policy_version 808034 (0.00094) [2022-07-10 16:48:50,926][25689] Fps is (10 sec: 5429.1, 60 sec: 5510.7, 300 sec: 5539.7). Total num frames: 827429888. Throughput: 0: 5788.2. Samples: 827435500. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:48:50,935][25689] Avg episode reward: [(0, '-0.201')] [2022-07-10 16:48:52,047][26022] Updated weights on worker 0-0, policy_version 808044 (0.00090) [2022-07-10 16:48:53,953][26022] Updated weights on worker 0-0, policy_version 808054 (0.00088) [2022-07-10 16:48:55,560][26022] Updated weights on worker 0-0, policy_version 808064 (0.00089) [2022-07-10 16:48:56,003][25689] Fps is (10 sec: 5438.5, 60 sec: 5506.2, 300 sec: 5538.9). Total num frames: 827458560. Throughput: 0: 4934.6. Samples: 827451848. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:48:56,003][25689] Avg episode reward: [(0, '-0.444')] [2022-07-10 16:48:57,537][26022] Updated weights on worker 0-0, policy_version 808074 (0.00085) [2022-07-10 16:48:59,316][26022] Updated weights on worker 0-0, policy_version 808084 (0.00092) [2022-07-10 16:49:01,078][25689] Fps is (10 sec: 5648.5, 60 sec: 5534.6, 300 sec: 5548.1). Total num frames: 827487232. Throughput: 0: 5745.8. Samples: 827485298. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:49:01,079][25689] Avg episode reward: [(0, '-1.108')] [2022-07-10 16:49:01,211][26022] Updated weights on worker 0-0, policy_version 808094 (0.00085) [2022-07-10 16:49:03,522][26022] Updated weights on worker 0-0, policy_version 808104 (0.00089) [2022-07-10 16:49:05,289][26022] Updated weights on worker 0-0, policy_version 808114 (0.00088) [2022-07-10 16:49:06,084][25689] Fps is (10 sec: 5485.0, 60 sec: 5534.9, 300 sec: 5542.4). Total num frames: 827513856. Throughput: 0: 5696.1. Samples: 827517112. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:49:06,086][25689] Avg episode reward: [(0, '-1.221')] [2022-07-10 16:49:07,078][26022] Updated weights on worker 0-0, policy_version 808124 (0.00087) [2022-07-10 16:49:08,918][26022] Updated weights on worker 0-0, policy_version 808134 (0.00094) [2022-07-10 16:49:10,658][26022] Updated weights on worker 0-0, policy_version 808144 (0.00087) [2022-07-10 16:49:11,216][25689] Fps is (10 sec: 5353.6, 60 sec: 5535.0, 300 sec: 5540.0). Total num frames: 827541504. Throughput: 0: 5683.6. Samples: 827550818. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:49:11,224][25689] Avg episode reward: [(0, '-1.425')] [2022-07-10 16:49:12,535][26022] Updated weights on worker 0-0, policy_version 808154 (0.00103) [2022-07-10 16:49:14,503][26022] Updated weights on worker 0-0, policy_version 808164 (0.00084) [2022-07-10 16:49:16,187][26022] Updated weights on worker 0-0, policy_version 808174 (0.00088) [2022-07-10 16:49:16,282][25689] Fps is (10 sec: 5522.4, 60 sec: 5519.0, 300 sec: 5538.8). Total num frames: 827570176. Throughput: 0: 5702.3. Samples: 827567488. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:49:16,283][25689] Avg episode reward: [(0, '-0.981')] [2022-07-10 16:49:18,203][26022] Updated weights on worker 0-0, policy_version 808184 (0.00081) [2022-07-10 16:49:19,839][26022] Updated weights on worker 0-0, policy_version 808194 (0.00092) [2022-07-10 16:49:21,293][25689] Fps is (10 sec: 5589.1, 60 sec: 5538.0, 300 sec: 5542.1). Total num frames: 827597824. Throughput: 0: 5722.2. Samples: 827600968. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:49:21,294][25689] Avg episode reward: [(0, '-0.652')] [2022-07-10 16:49:21,778][26022] Updated weights on worker 0-0, policy_version 808204 (0.00085) [2022-07-10 16:49:23,462][26022] Updated weights on worker 0-0, policy_version 808214 (0.00086) [2022-07-10 16:49:25,533][26022] Updated weights on worker 0-0, policy_version 808224 (0.00087) [2022-07-10 16:49:26,317][25689] Fps is (10 sec: 5612.7, 60 sec: 5525.9, 300 sec: 5545.9). Total num frames: 827626496. Throughput: 0: 5816.7. Samples: 827634800. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:49:26,317][25689] Avg episode reward: [(0, '-0.698')] [2022-07-10 16:49:27,024][26022] Updated weights on worker 0-0, policy_version 808234 (0.00094) [2022-07-10 16:49:29,078][26022] Updated weights on worker 0-0, policy_version 808244 (0.00088) [2022-07-10 16:49:30,807][26022] Updated weights on worker 0-0, policy_version 808254 (0.00084) [2022-07-10 16:49:31,426][25689] Fps is (10 sec: 5659.1, 60 sec: 5575.6, 300 sec: 5547.3). Total num frames: 827655168. Throughput: 0: 4987.8. Samples: 827651622. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:49:31,426][25689] Avg episode reward: [(0, '-0.157')] [2022-07-10 16:49:32,628][26022] Updated weights on worker 0-0, policy_version 808264 (0.00050) [2022-07-10 16:49:34,383][26022] Updated weights on worker 0-0, policy_version 808274 (0.00086) [2022-07-10 16:49:36,334][26022] Updated weights on worker 0-0, policy_version 808284 (0.00097) [2022-07-10 16:49:36,427][25689] Fps is (10 sec: 5570.4, 60 sec: 5530.2, 300 sec: 5547.4). Total num frames: 827682816. Throughput: 0: 5845.3. Samples: 827685242. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:49:36,428][25689] Avg episode reward: [(0, '-0.183')] [2022-07-10 16:49:38,228][26022] Updated weights on worker 0-0, policy_version 808294 (0.00090) [2022-07-10 16:49:40,079][26022] Updated weights on worker 0-0, policy_version 808304 (0.00085) [2022-07-10 16:49:41,433][25689] Fps is (10 sec: 5628.0, 60 sec: 5550.9, 300 sec: 5547.7). Total num frames: 827711488. Throughput: 0: 5871.7. Samples: 827719226. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:49:41,433][25689] Avg episode reward: [(0, '-0.913')] [2022-07-10 16:49:41,873][26022] Updated weights on worker 0-0, policy_version 808314 (0.00090) [2022-07-10 16:49:43,631][26022] Updated weights on worker 0-0, policy_version 808324 (0.00103) [2022-07-10 16:49:45,595][26022] Updated weights on worker 0-0, policy_version 808334 (0.00092) [2022-07-10 16:49:46,455][25689] Fps is (10 sec: 5616.2, 60 sec: 5540.6, 300 sec: 5548.4). Total num frames: 827739136. Throughput: 0: 5023.2. Samples: 827735960. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-10 16:49:46,456][25689] Avg episode reward: [(0, '-1.798')] [2022-07-10 16:49:47,247][26022] Updated weights on worker 0-0, policy_version 808344 (0.00089) [2022-07-10 16:49:49,335][26022] Updated weights on worker 0-0, policy_version 808354 (0.00087) [2022-07-10 16:49:50,967][26022] Updated weights on worker 0-0, policy_version 808364 (0.00106) [2022-07-10 16:49:51,527][25689] Fps is (10 sec: 5478.3, 60 sec: 5559.3, 300 sec: 5544.2). Total num frames: 827766784. Throughput: 0: 5848.9. Samples: 827769190. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:49:51,527][25689] Avg episode reward: [(0, '-2.554')] [2022-07-10 16:49:52,959][26022] Updated weights on worker 0-0, policy_version 808374 (0.00097) [2022-07-10 16:49:54,815][26022] Updated weights on worker 0-0, policy_version 808384 (0.00087) [2022-07-10 16:49:56,534][25689] Fps is (10 sec: 5486.4, 60 sec: 5548.7, 300 sec: 5544.3). Total num frames: 827794432. Throughput: 0: 5840.3. Samples: 827802672. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:49:56,536][25689] Avg episode reward: [(0, '-2.087')] [2022-07-10 16:49:56,597][26022] Updated weights on worker 0-0, policy_version 808394 (0.00093) [2022-07-10 16:49:58,396][26022] Updated weights on worker 0-0, policy_version 808404 (0.00092) [2022-07-10 16:50:00,219][26022] Updated weights on worker 0-0, policy_version 808414 (0.00088) [2022-07-10 16:50:01,542][25689] Fps is (10 sec: 5520.9, 60 sec: 5538.0, 300 sec: 5544.8). Total num frames: 827822080. Throughput: 0: 4986.6. Samples: 827819504. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:50:01,545][25689] Avg episode reward: [(0, '-3.085')] [2022-07-10 16:50:02,299][26022] Updated weights on worker 0-0, policy_version 808424 (0.00111) [2022-07-10 16:50:04,318][26022] Updated weights on worker 0-0, policy_version 808434 (0.00084) [2022-07-10 16:50:06,003][26022] Updated weights on worker 0-0, policy_version 808444 (0.00086) [2022-07-10 16:50:06,598][25689] Fps is (10 sec: 5291.2, 60 sec: 5516.5, 300 sec: 5545.8). Total num frames: 827847680. Throughput: 0: 5714.9. Samples: 827851072. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:50:06,600][25689] Avg episode reward: [(0, '-3.058')] [2022-07-10 16:50:07,880][26022] Updated weights on worker 0-0, policy_version 808454 (0.00097) [2022-07-10 16:50:10,005][26022] Updated weights on worker 0-0, policy_version 808464 (0.00093) [2022-07-10 16:50:11,492][26022] Updated weights on worker 0-0, policy_version 808474 (0.00094) [2022-07-10 16:50:11,691][25689] Fps is (10 sec: 5549.3, 60 sec: 5570.8, 300 sec: 5551.1). Total num frames: 827878400. Throughput: 0: 5698.7. Samples: 827884102. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:50:11,694][25689] Avg episode reward: [(0, '-3.318')] [2022-07-10 16:50:13,499][26022] Updated weights on worker 0-0, policy_version 808484 (0.00081) [2022-07-10 16:50:15,142][26022] Updated weights on worker 0-0, policy_version 808494 (0.00089) [2022-07-10 16:50:16,717][25689] Fps is (10 sec: 5667.0, 60 sec: 5540.7, 300 sec: 5544.9). Total num frames: 827905024. Throughput: 0: 4865.1. Samples: 827900862. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:50:16,718][25689] Avg episode reward: [(0, '-3.042')] [2022-07-10 16:50:17,258][26022] Updated weights on worker 0-0, policy_version 808504 (0.00092) [2022-07-10 16:50:19,162][26022] Updated weights on worker 0-0, policy_version 808514 (0.00093) [2022-07-10 16:50:20,703][26022] Updated weights on worker 0-0, policy_version 808524 (0.00781) [2022-07-10 16:50:21,778][25689] Fps is (10 sec: 5380.3, 60 sec: 5536.0, 300 sec: 5540.8). Total num frames: 827932672. Throughput: 0: 5666.7. Samples: 827934176. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:50:21,779][25689] Avg episode reward: [(0, '-2.349')] [2022-07-10 16:50:22,828][26022] Updated weights on worker 0-0, policy_version 808534 (0.00092) [2022-07-10 16:50:24,352][26022] Updated weights on worker 0-0, policy_version 808544 (0.00089) [2022-07-10 16:50:26,396][26022] Updated weights on worker 0-0, policy_version 808554 (0.00089) [2022-07-10 16:50:26,824][25689] Fps is (10 sec: 5673.3, 60 sec: 5550.9, 300 sec: 5544.9). Total num frames: 827962368. Throughput: 0: 5767.0. Samples: 827967718. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:50:26,825][25689] Avg episode reward: [(0, '-2.240')] [2022-07-10 16:50:28,108][26022] Updated weights on worker 0-0, policy_version 808564 (0.00087) [2022-07-10 16:50:29,949][26022] Updated weights on worker 0-0, policy_version 808574 (0.00084) [2022-07-10 16:50:31,982][25689] Fps is (10 sec: 5519.8, 60 sec: 5512.7, 300 sec: 5538.9). Total num frames: 827988992. Throughput: 0: 4943.9. Samples: 827984416. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:50:31,982][25689] Avg episode reward: [(0, '-1.291')] [2022-07-10 16:50:32,015][26022] Updated weights on worker 0-0, policy_version 808584 (0.00090) [2022-07-10 16:50:33,755][26022] Updated weights on worker 0-0, policy_version 808594 (0.00091) [2022-07-10 16:50:35,595][26022] Updated weights on worker 0-0, policy_version 808604 (0.00092) [2022-07-10 16:50:36,999][25689] Fps is (10 sec: 5434.7, 60 sec: 5528.1, 300 sec: 5545.9). Total num frames: 828017664. Throughput: 0: 5775.8. Samples: 828018010. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:50:37,000][25689] Avg episode reward: [(0, '0.691')] [2022-07-10 16:50:37,491][26022] Updated weights on worker 0-0, policy_version 808614 (0.00086) [2022-07-10 16:50:39,164][26022] Updated weights on worker 0-0, policy_version 808624 (0.00088) [2022-07-10 16:50:41,338][26022] Updated weights on worker 0-0, policy_version 808634 (0.00093) [2022-07-10 16:50:42,013][25689] Fps is (10 sec: 5716.5, 60 sec: 5527.4, 300 sec: 5546.2). Total num frames: 828046336. Throughput: 0: 5789.8. Samples: 828051330. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:50:42,014][25689] Avg episode reward: [(0, '0.481')] [2022-07-10 16:50:42,911][26022] Updated weights on worker 0-0, policy_version 808644 (0.00084) [2022-07-10 16:50:44,762][26022] Updated weights on worker 0-0, policy_version 808654 (0.00089) [2022-07-10 16:50:46,673][26022] Updated weights on worker 0-0, policy_version 808664 (0.00086) [2022-07-10 16:50:47,015][25689] Fps is (10 sec: 5521.1, 60 sec: 5512.4, 300 sec: 5540.7). Total num frames: 828072960. Throughput: 0: 4973.0. Samples: 828068124. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:50:47,015][25689] Avg episode reward: [(0, '0.571')] [2022-07-10 16:50:47,665][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:50:47,676][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000808668_828076032.pth [2022-07-10 16:50:47,677][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000806717_826078208.pth [2022-07-10 16:50:48,559][26022] Updated weights on worker 0-0, policy_version 808674 (0.00082) [2022-07-10 16:50:50,524][26022] Updated weights on worker 0-0, policy_version 808684 (0.00084) [2022-07-10 16:50:52,067][25689] Fps is (10 sec: 5499.7, 60 sec: 5530.9, 300 sec: 5537.2). Total num frames: 828101632. Throughput: 0: 5811.3. Samples: 828101140. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:50:52,068][25689] Avg episode reward: [(0, '-0.917')] [2022-07-10 16:50:52,193][26022] Updated weights on worker 0-0, policy_version 808694 (0.00085) [2022-07-10 16:50:54,111][26022] Updated weights on worker 0-0, policy_version 808704 (0.00086) [2022-07-10 16:50:55,887][26022] Updated weights on worker 0-0, policy_version 808714 (0.00081) [2022-07-10 16:50:57,111][25689] Fps is (10 sec: 5476.5, 60 sec: 5510.7, 300 sec: 5536.7). Total num frames: 828128256. Throughput: 0: 5794.7. Samples: 828134556. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:50:57,112][25689] Avg episode reward: [(0, '0.030')] [2022-07-10 16:50:57,844][26022] Updated weights on worker 0-0, policy_version 808724 (0.00092) [2022-07-10 16:50:59,559][26022] Updated weights on worker 0-0, policy_version 808734 (0.00082) [2022-07-10 16:51:01,805][26022] Updated weights on worker 0-0, policy_version 808744 (0.00090) [2022-07-10 16:51:02,136][25689] Fps is (10 sec: 5288.5, 60 sec: 5492.3, 300 sec: 5533.0). Total num frames: 828154880. Throughput: 0: 4976.0. Samples: 828151462. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:51:02,137][25689] Avg episode reward: [(0, '0.050')] [2022-07-10 16:51:03,714][26022] Updated weights on worker 0-0, policy_version 808754 (0.00094) [2022-07-10 16:51:05,343][26022] Updated weights on worker 0-0, policy_version 808764 (0.00083) [2022-07-10 16:51:07,145][25689] Fps is (10 sec: 5511.0, 60 sec: 5547.3, 300 sec: 5544.3). Total num frames: 828183552. Throughput: 0: 5712.9. Samples: 828183130. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:51:07,146][25689] Avg episode reward: [(0, '0.074')] [2022-07-10 16:51:07,412][26022] Updated weights on worker 0-0, policy_version 808774 (0.00088) [2022-07-10 16:51:09,015][26022] Updated weights on worker 0-0, policy_version 808784 (0.00085) [2022-07-10 16:51:11,061][26022] Updated weights on worker 0-0, policy_version 808794 (0.00084) [2022-07-10 16:51:12,286][25689] Fps is (10 sec: 5548.8, 60 sec: 5492.2, 300 sec: 5538.3). Total num frames: 828211200. Throughput: 0: 5711.3. Samples: 828216614. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:51:12,287][25689] Avg episode reward: [(0, '-1.486')] [2022-07-10 16:51:12,824][26022] Updated weights on worker 0-0, policy_version 808804 (0.00089) [2022-07-10 16:51:14,760][26022] Updated weights on worker 0-0, policy_version 808814 (0.00083) [2022-07-10 16:51:16,535][26022] Updated weights on worker 0-0, policy_version 808824 (0.00099) [2022-07-10 16:51:17,325][25689] Fps is (10 sec: 5532.4, 60 sec: 5524.8, 300 sec: 5537.7). Total num frames: 828239872. Throughput: 0: 4888.2. Samples: 828233366. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:51:17,327][25689] Avg episode reward: [(0, '-0.714')] [2022-07-10 16:51:18,315][26022] Updated weights on worker 0-0, policy_version 808834 (0.00083) [2022-07-10 16:51:20,278][26022] Updated weights on worker 0-0, policy_version 808844 (0.00085) [2022-07-10 16:51:21,897][26022] Updated weights on worker 0-0, policy_version 808854 (0.00092) [2022-07-10 16:51:22,343][25689] Fps is (10 sec: 5600.2, 60 sec: 5528.8, 300 sec: 5541.0). Total num frames: 828267520. Throughput: 0: 5716.0. Samples: 828266964. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:51:22,343][25689] Avg episode reward: [(0, '-0.759')] [2022-07-10 16:51:23,859][26022] Updated weights on worker 0-0, policy_version 808864 (0.00097) [2022-07-10 16:51:25,663][26022] Updated weights on worker 0-0, policy_version 808874 (0.00089) [2022-07-10 16:51:27,357][25689] Fps is (10 sec: 5614.3, 60 sec: 5514.8, 300 sec: 5539.0). Total num frames: 828296192. Throughput: 0: 5804.8. Samples: 828300454. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:51:27,357][25689] Avg episode reward: [(0, '-0.975')] [2022-07-10 16:51:27,577][26022] Updated weights on worker 0-0, policy_version 808884 (0.00090) [2022-07-10 16:51:29,328][26022] Updated weights on worker 0-0, policy_version 808894 (0.00087) [2022-07-10 16:51:31,436][26022] Updated weights on worker 0-0, policy_version 808904 (0.00080) [2022-07-10 16:51:32,412][25689] Fps is (10 sec: 5695.1, 60 sec: 5558.0, 300 sec: 5538.4). Total num frames: 828324864. Throughput: 0: 5826.2. Samples: 828333870. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:51:32,412][25689] Avg episode reward: [(0, '-1.618')] [2022-07-10 16:51:33,019][26022] Updated weights on worker 0-0, policy_version 808914 (0.00086) [2022-07-10 16:51:35,114][26022] Updated weights on worker 0-0, policy_version 808924 (0.00094) [2022-07-10 16:51:36,619][26022] Updated weights on worker 0-0, policy_version 808934 (0.00089) [2022-07-10 16:51:37,417][25689] Fps is (10 sec: 5598.3, 60 sec: 5542.2, 300 sec: 5540.1). Total num frames: 828352512. Throughput: 0: 5832.2. Samples: 828350546. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:51:37,418][25689] Avg episode reward: [(0, '-2.178')] [2022-07-10 16:51:38,769][26022] Updated weights on worker 0-0, policy_version 808944 (0.00092) [2022-07-10 16:51:40,278][26022] Updated weights on worker 0-0, policy_version 808954 (0.00091) [2022-07-10 16:51:42,423][26022] Updated weights on worker 0-0, policy_version 808964 (0.00087) [2022-07-10 16:51:42,517][25689] Fps is (10 sec: 5370.9, 60 sec: 5500.5, 300 sec: 5531.7). Total num frames: 828379136. Throughput: 0: 5784.2. Samples: 828383654. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:51:42,517][25689] Avg episode reward: [(0, '-0.932')] [2022-07-10 16:51:43,963][26022] Updated weights on worker 0-0, policy_version 808974 (0.00092) [2022-07-10 16:51:46,100][26022] Updated weights on worker 0-0, policy_version 808984 (0.00084) [2022-07-10 16:51:47,603][25689] Fps is (10 sec: 5529.2, 60 sec: 5543.5, 300 sec: 5538.1). Total num frames: 828408832. Throughput: 0: 5757.1. Samples: 828417012. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:51:47,605][25689] Avg episode reward: [(0, '-0.370')] [2022-07-10 16:51:47,736][26022] Updated weights on worker 0-0, policy_version 808994 (0.00088) [2022-07-10 16:51:49,770][26022] Updated weights on worker 0-0, policy_version 809004 (0.00091) [2022-07-10 16:51:51,507][26022] Updated weights on worker 0-0, policy_version 809014 (0.00083) [2022-07-10 16:51:52,695][25689] Fps is (10 sec: 5533.1, 60 sec: 5506.1, 300 sec: 5533.6). Total num frames: 828435456. Throughput: 0: 4916.9. Samples: 828433602. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:51:52,696][25689] Avg episode reward: [(0, '-0.441')] [2022-07-10 16:51:53,332][26022] Updated weights on worker 0-0, policy_version 809024 (0.00084) [2022-07-10 16:51:55,318][26022] Updated weights on worker 0-0, policy_version 809034 (0.00092) [2022-07-10 16:51:57,000][26022] Updated weights on worker 0-0, policy_version 809044 (0.00090) [2022-07-10 16:51:57,703][25689] Fps is (10 sec: 5576.5, 60 sec: 5560.2, 300 sec: 5538.6). Total num frames: 828465152. Throughput: 0: 5738.4. Samples: 828466952. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:51:57,703][25689] Avg episode reward: [(0, '-0.186')] [2022-07-10 16:51:58,900][26022] Updated weights on worker 0-0, policy_version 809054 (0.01381) [2022-07-10 16:52:00,612][26022] Updated weights on worker 0-0, policy_version 809064 (0.00084) [2022-07-10 16:52:02,738][25689] Fps is (10 sec: 5404.4, 60 sec: 5525.4, 300 sec: 5532.0). Total num frames: 828489728. Throughput: 0: 5725.5. Samples: 828499428. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:52:02,738][25689] Avg episode reward: [(0, '-0.263')] [2022-07-10 16:52:02,958][26022] Updated weights on worker 0-0, policy_version 809074 (0.00091) [2022-07-10 16:52:04,727][26022] Updated weights on worker 0-0, policy_version 809084 (0.00088) [2022-07-10 16:52:06,611][26022] Updated weights on worker 0-0, policy_version 809094 (0.00088) [2022-07-10 16:52:07,830][25689] Fps is (10 sec: 5257.8, 60 sec: 5517.8, 300 sec: 5538.0). Total num frames: 828518400. Throughput: 0: 4870.3. Samples: 828515524. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:52:07,831][25689] Avg episode reward: [(0, '-0.110')] [2022-07-10 16:52:08,314][26022] Updated weights on worker 0-0, policy_version 809104 (0.00086) [2022-07-10 16:52:10,347][26022] Updated weights on worker 0-0, policy_version 809114 (0.00083) [2022-07-10 16:52:11,967][26022] Updated weights on worker 0-0, policy_version 809124 (0.00091) [2022-07-10 16:52:12,951][25689] Fps is (10 sec: 5714.8, 60 sec: 5553.4, 300 sec: 5536.1). Total num frames: 828548096. Throughput: 0: 5697.0. Samples: 828548998. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:52:12,952][25689] Avg episode reward: [(0, '-0.966')] [2022-07-10 16:52:14,005][26022] Updated weights on worker 0-0, policy_version 809134 (0.00095) [2022-07-10 16:52:15,694][26022] Updated weights on worker 0-0, policy_version 809144 (0.00082) [2022-07-10 16:52:17,728][26022] Updated weights on worker 0-0, policy_version 809154 (0.00089) [2022-07-10 16:52:17,959][25689] Fps is (10 sec: 5560.1, 60 sec: 5522.5, 300 sec: 5532.8). Total num frames: 828574720. Throughput: 0: 5707.0. Samples: 828582558. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:52:17,960][25689] Avg episode reward: [(0, '-2.037')] [2022-07-10 16:52:19,404][26022] Updated weights on worker 0-0, policy_version 809164 (0.00102) [2022-07-10 16:52:21,227][26022] Updated weights on worker 0-0, policy_version 809174 (0.00092) [2022-07-10 16:52:22,968][25689] Fps is (10 sec: 5520.2, 60 sec: 5540.1, 300 sec: 5536.5). Total num frames: 828603392. Throughput: 0: 4929.7. Samples: 828599160. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:52:22,969][25689] Avg episode reward: [(0, '-2.323')] [2022-07-10 16:52:23,024][26022] Updated weights on worker 0-0, policy_version 809184 (0.00088) [2022-07-10 16:52:24,991][26022] Updated weights on worker 0-0, policy_version 809194 (0.00093) [2022-07-10 16:52:26,921][26022] Updated weights on worker 0-0, policy_version 809204 (0.00086) [2022-07-10 16:52:28,029][25689] Fps is (10 sec: 5593.0, 60 sec: 5519.0, 300 sec: 5533.1). Total num frames: 828631040. Throughput: 0: 5785.6. Samples: 828632388. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:52:28,030][25689] Avg episode reward: [(0, '-2.338')] [2022-07-10 16:52:28,699][26022] Updated weights on worker 0-0, policy_version 809214 (0.00085) [2022-07-10 16:52:30,615][26022] Updated weights on worker 0-0, policy_version 809224 (0.00087) [2022-07-10 16:52:32,203][26022] Updated weights on worker 0-0, policy_version 809234 (0.00090) [2022-07-10 16:52:33,160][25689] Fps is (10 sec: 5525.9, 60 sec: 5512.0, 300 sec: 5534.3). Total num frames: 828659712. Throughput: 0: 5778.3. Samples: 828665772. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:52:33,161][25689] Avg episode reward: [(0, '-2.097')] [2022-07-10 16:52:34,430][26022] Updated weights on worker 0-0, policy_version 809244 (0.00057) [2022-07-10 16:52:35,718][26022] Updated weights on worker 0-0, policy_version 809254 (0.00086) [2022-07-10 16:52:37,988][26022] Updated weights on worker 0-0, policy_version 809264 (0.00087) [2022-07-10 16:52:38,223][25689] Fps is (10 sec: 5725.9, 60 sec: 5540.6, 300 sec: 5536.8). Total num frames: 828689408. Throughput: 0: 4930.2. Samples: 828682460. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:52:38,223][25689] Avg episode reward: [(0, '-1.027')] [2022-07-10 16:52:39,608][26022] Updated weights on worker 0-0, policy_version 809274 (0.00097) [2022-07-10 16:52:41,466][26022] Updated weights on worker 0-0, policy_version 809284 (0.00080) [2022-07-10 16:52:43,229][25689] Fps is (10 sec: 5593.4, 60 sec: 5549.0, 300 sec: 5536.9). Total num frames: 828716032. Throughput: 0: 5779.3. Samples: 828716254. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:52:43,230][25689] Avg episode reward: [(0, '0.142')] [2022-07-10 16:52:43,511][26022] Updated weights on worker 0-0, policy_version 809294 (0.00094) [2022-07-10 16:52:45,123][26022] Updated weights on worker 0-0, policy_version 809304 (0.00087) [2022-07-10 16:52:46,983][26022] Updated weights on worker 0-0, policy_version 809314 (0.00089) [2022-07-10 16:52:47,740][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:52:47,757][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000809318_828741632.pth [2022-07-10 16:52:47,757][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000807368_826744832.pth [2022-07-10 16:52:48,233][25689] Fps is (10 sec: 5626.1, 60 sec: 5556.6, 300 sec: 5537.9). Total num frames: 828745728. Throughput: 0: 5811.2. Samples: 828749798. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:52:48,235][25689] Avg episode reward: [(0, '0.214')] [2022-07-10 16:52:48,824][26022] Updated weights on worker 0-0, policy_version 809324 (0.00093) [2022-07-10 16:52:50,747][26022] Updated weights on worker 0-0, policy_version 809334 (0.00085) [2022-07-10 16:52:52,722][26022] Updated weights on worker 0-0, policy_version 809344 (0.00085) [2022-07-10 16:52:53,326][25689] Fps is (10 sec: 5476.5, 60 sec: 5539.6, 300 sec: 5526.4). Total num frames: 828771328. Throughput: 0: 4992.1. Samples: 828766442. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:52:53,328][25689] Avg episode reward: [(0, '0.205')] [2022-07-10 16:52:54,122][26022] Updated weights on worker 0-0, policy_version 809354 (0.00088) [2022-07-10 16:52:56,347][26022] Updated weights on worker 0-0, policy_version 809364 (0.00096) [2022-07-10 16:52:57,959][26022] Updated weights on worker 0-0, policy_version 809374 (0.00084) [2022-07-10 16:52:58,403][25689] Fps is (10 sec: 5336.5, 60 sec: 5516.4, 300 sec: 5532.1). Total num frames: 828800000. Throughput: 0: 5832.0. Samples: 828800152. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:52:58,405][25689] Avg episode reward: [(0, '0.794')] [2022-07-10 16:52:59,859][26022] Updated weights on worker 0-0, policy_version 809384 (0.00086) [2022-07-10 16:53:01,625][26022] Updated weights on worker 0-0, policy_version 809394 (0.00088) [2022-07-10 16:53:03,427][25689] Fps is (10 sec: 5474.6, 60 sec: 5551.2, 300 sec: 5531.9). Total num frames: 828826624. Throughput: 0: 5705.2. Samples: 828831484. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:53:03,427][25689] Avg episode reward: [(0, '0.538')] [2022-07-10 16:53:04,012][26022] Updated weights on worker 0-0, policy_version 809404 (0.00092) [2022-07-10 16:53:05,751][26022] Updated weights on worker 0-0, policy_version 809414 (0.00086) [2022-07-10 16:53:07,818][26022] Updated weights on worker 0-0, policy_version 809424 (0.00088) [2022-07-10 16:53:08,429][25689] Fps is (10 sec: 5515.5, 60 sec: 5559.5, 300 sec: 5537.7). Total num frames: 828855296. Throughput: 0: 4870.8. Samples: 828848166. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:53:08,429][25689] Avg episode reward: [(0, '0.663')] [2022-07-10 16:53:09,370][26022] Updated weights on worker 0-0, policy_version 809434 (0.00085) [2022-07-10 16:53:11,196][26022] Updated weights on worker 0-0, policy_version 809444 (0.00091) [2022-07-10 16:53:12,912][26022] Updated weights on worker 0-0, policy_version 809454 (0.00091) [2022-07-10 16:53:13,490][25689] Fps is (10 sec: 5494.8, 60 sec: 5514.2, 300 sec: 5527.7). Total num frames: 828881920. Throughput: 0: 5728.8. Samples: 828881956. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:53:13,490][25689] Avg episode reward: [(0, '0.492')] [2022-07-10 16:53:14,963][26022] Updated weights on worker 0-0, policy_version 809464 (0.00125) [2022-07-10 16:53:16,812][26022] Updated weights on worker 0-0, policy_version 809474 (0.00083) [2022-07-10 16:53:18,506][25689] Fps is (10 sec: 5487.2, 60 sec: 5547.3, 300 sec: 5534.9). Total num frames: 828910592. Throughput: 0: 5727.1. Samples: 828915282. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 16:53:18,510][25689] Avg episode reward: [(0, '0.491')] [2022-07-10 16:53:18,568][26022] Updated weights on worker 0-0, policy_version 809484 (0.00097) [2022-07-10 16:53:20,388][26022] Updated weights on worker 0-0, policy_version 809494 (0.00090) [2022-07-10 16:53:22,439][26022] Updated weights on worker 0-0, policy_version 809504 (0.00111) [2022-07-10 16:53:23,527][25689] Fps is (10 sec: 5611.4, 60 sec: 5529.3, 300 sec: 5529.1). Total num frames: 828938240. Throughput: 0: 4995.7. Samples: 828931898. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:53:23,527][25689] Avg episode reward: [(0, '0.330')] [2022-07-10 16:53:24,247][26022] Updated weights on worker 0-0, policy_version 809514 (0.00104) [2022-07-10 16:53:26,128][26022] Updated weights on worker 0-0, policy_version 809524 (0.00091) [2022-07-10 16:53:27,981][26022] Updated weights on worker 0-0, policy_version 809534 (0.00091) [2022-07-10 16:53:28,546][25689] Fps is (10 sec: 5405.8, 60 sec: 5516.3, 300 sec: 5534.0). Total num frames: 828964864. Throughput: 0: 5800.4. Samples: 828964852. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:53:28,546][25689] Avg episode reward: [(0, '0.233')] [2022-07-10 16:53:30,035][26022] Updated weights on worker 0-0, policy_version 809544 (0.00100) [2022-07-10 16:53:31,723][26022] Updated weights on worker 0-0, policy_version 809554 (0.00093) [2022-07-10 16:53:33,573][26022] Updated weights on worker 0-0, policy_version 809564 (0.00087) [2022-07-10 16:53:33,599][25689] Fps is (10 sec: 5490.1, 60 sec: 5523.4, 300 sec: 5527.2). Total num frames: 828993536. Throughput: 0: 5768.2. Samples: 828997946. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:53:33,599][25689] Avg episode reward: [(0, '-0.193')] [2022-07-10 16:53:35,347][26022] Updated weights on worker 0-0, policy_version 809574 (0.00093) [2022-07-10 16:53:37,117][26022] Updated weights on worker 0-0, policy_version 809584 (0.00090) [2022-07-10 16:53:38,619][25689] Fps is (10 sec: 5489.4, 60 sec: 5476.4, 300 sec: 5524.3). Total num frames: 829020160. Throughput: 0: 4931.9. Samples: 829014478. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:53:38,619][25689] Avg episode reward: [(0, '-0.102')] [2022-07-10 16:53:39,106][26022] Updated weights on worker 0-0, policy_version 809594 (0.00087) [2022-07-10 16:53:40,902][26022] Updated weights on worker 0-0, policy_version 809604 (0.00087) [2022-07-10 16:53:42,613][26022] Updated weights on worker 0-0, policy_version 809614 (0.00086) [2022-07-10 16:53:43,628][25689] Fps is (10 sec: 5717.7, 60 sec: 5544.0, 300 sec: 5532.8). Total num frames: 829050880. Throughput: 0: 5777.4. Samples: 829048030. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:53:43,628][25689] Avg episode reward: [(0, '0.345')] [2022-07-10 16:53:44,678][26022] Updated weights on worker 0-0, policy_version 809624 (0.00090) [2022-07-10 16:53:46,345][26022] Updated weights on worker 0-0, policy_version 809634 (0.00963) [2022-07-10 16:53:48,451][26022] Updated weights on worker 0-0, policy_version 809644 (0.00088) [2022-07-10 16:53:48,647][25689] Fps is (10 sec: 5616.3, 60 sec: 5474.8, 300 sec: 5530.7). Total num frames: 829076480. Throughput: 0: 5801.8. Samples: 829081476. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:53:48,648][25689] Avg episode reward: [(0, '0.873')] [2022-07-10 16:53:49,927][26022] Updated weights on worker 0-0, policy_version 809654 (0.00089) [2022-07-10 16:53:52,050][26022] Updated weights on worker 0-0, policy_version 809664 (0.00086) [2022-07-10 16:53:53,713][25689] Fps is (10 sec: 5483.1, 60 sec: 5545.1, 300 sec: 5534.3). Total num frames: 829106176. Throughput: 0: 5799.7. Samples: 829114600. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:53:53,713][25689] Avg episode reward: [(0, '1.084')] [2022-07-10 16:53:53,722][26022] Updated weights on worker 0-0, policy_version 809674 (0.00080) [2022-07-10 16:53:55,734][26022] Updated weights on worker 0-0, policy_version 809684 (0.00082) [2022-07-10 16:53:57,300][26022] Updated weights on worker 0-0, policy_version 809694 (0.00086) [2022-07-10 16:53:58,795][25689] Fps is (10 sec: 5549.7, 60 sec: 5510.7, 300 sec: 5527.3). Total num frames: 829132800. Throughput: 0: 5802.8. Samples: 829131556. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:53:58,796][25689] Avg episode reward: [(0, '1.616')] [2022-07-10 16:53:59,463][26022] Updated weights on worker 0-0, policy_version 809704 (0.00084) [2022-07-10 16:54:00,980][26022] Updated weights on worker 0-0, policy_version 809714 (0.00093) [2022-07-10 16:54:03,439][26022] Updated weights on worker 0-0, policy_version 809724 (0.00074) [2022-07-10 16:54:03,844][25689] Fps is (10 sec: 5357.0, 60 sec: 5525.3, 300 sec: 5529.9). Total num frames: 829160448. Throughput: 0: 5680.1. Samples: 829162858. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:54:03,844][25689] Avg episode reward: [(0, '1.480')] [2022-07-10 16:54:05,079][26022] Updated weights on worker 0-0, policy_version 809734 (0.00090) [2022-07-10 16:54:06,995][26022] Updated weights on worker 0-0, policy_version 809744 (0.00090) [2022-07-10 16:54:08,867][25689] Fps is (10 sec: 5388.5, 60 sec: 5489.5, 300 sec: 5528.5). Total num frames: 829187072. Throughput: 0: 5681.3. Samples: 829196352. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:54:08,868][25689] Avg episode reward: [(0, '1.042')] [2022-07-10 16:54:09,072][26022] Updated weights on worker 0-0, policy_version 809754 (0.00088) [2022-07-10 16:54:10,680][26022] Updated weights on worker 0-0, policy_version 809764 (0.00089) [2022-07-10 16:54:12,662][26022] Updated weights on worker 0-0, policy_version 809774 (0.00091) [2022-07-10 16:54:13,927][25689] Fps is (10 sec: 5585.3, 60 sec: 5540.4, 300 sec: 5532.1). Total num frames: 829216768. Throughput: 0: 4874.2. Samples: 829213130. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:54:13,928][25689] Avg episode reward: [(0, '0.814')] [2022-07-10 16:54:14,175][26022] Updated weights on worker 0-0, policy_version 809784 (0.00131) [2022-07-10 16:54:16,152][26022] Updated weights on worker 0-0, policy_version 809794 (0.00089) [2022-07-10 16:54:17,969][26022] Updated weights on worker 0-0, policy_version 809804 (0.00092) [2022-07-10 16:54:18,932][25689] Fps is (10 sec: 5697.6, 60 sec: 5524.5, 300 sec: 5532.2). Total num frames: 829244416. Throughput: 0: 5731.3. Samples: 829246964. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:54:18,933][25689] Avg episode reward: [(0, '-0.361')] [2022-07-10 16:54:19,715][26022] Updated weights on worker 0-0, policy_version 809814 (0.00096) [2022-07-10 16:54:21,588][26022] Updated weights on worker 0-0, policy_version 809824 (0.00086) [2022-07-10 16:54:23,499][26022] Updated weights on worker 0-0, policy_version 809834 (0.00085) [2022-07-10 16:54:23,959][25689] Fps is (10 sec: 5613.9, 60 sec: 5540.8, 300 sec: 5532.1). Total num frames: 829273088. Throughput: 0: 5842.8. Samples: 829280392. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:54:23,961][25689] Avg episode reward: [(0, '-0.424')] [2022-07-10 16:54:25,395][26022] Updated weights on worker 0-0, policy_version 809844 (0.00096) [2022-07-10 16:54:27,157][26022] Updated weights on worker 0-0, policy_version 809854 (0.00086) [2022-07-10 16:54:28,970][25689] Fps is (10 sec: 5508.5, 60 sec: 5541.6, 300 sec: 5527.1). Total num frames: 829299712. Throughput: 0: 5006.7. Samples: 829297002. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:54:28,970][25689] Avg episode reward: [(0, '-0.512')] [2022-07-10 16:54:29,055][26022] Updated weights on worker 0-0, policy_version 809864 (0.00086) [2022-07-10 16:54:30,978][26022] Updated weights on worker 0-0, policy_version 809874 (0.00095) [2022-07-10 16:54:32,839][26022] Updated weights on worker 0-0, policy_version 809884 (0.00087) [2022-07-10 16:54:34,042][25689] Fps is (10 sec: 5484.4, 60 sec: 5539.9, 300 sec: 5529.2). Total num frames: 829328384. Throughput: 0: 5800.0. Samples: 829329796. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:54:34,042][25689] Avg episode reward: [(0, '-0.098')] [2022-07-10 16:54:34,684][26022] Updated weights on worker 0-0, policy_version 809894 (0.00085) [2022-07-10 16:54:36,547][26022] Updated weights on worker 0-0, policy_version 809904 (0.00090) [2022-07-10 16:54:38,276][26022] Updated weights on worker 0-0, policy_version 809914 (0.01049) [2022-07-10 16:54:39,062][25689] Fps is (10 sec: 5478.9, 60 sec: 5539.9, 300 sec: 5522.0). Total num frames: 829355008. Throughput: 0: 5779.4. Samples: 829363308. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:54:39,063][25689] Avg episode reward: [(0, '-0.158')] [2022-07-10 16:54:40,326][26022] Updated weights on worker 0-0, policy_version 809924 (0.00087) [2022-07-10 16:54:42,032][26022] Updated weights on worker 0-0, policy_version 809934 (0.00090) [2022-07-10 16:54:44,035][26022] Updated weights on worker 0-0, policy_version 809944 (0.00088) [2022-07-10 16:54:44,128][25689] Fps is (10 sec: 5381.1, 60 sec: 5483.9, 300 sec: 5521.2). Total num frames: 829382656. Throughput: 0: 4942.8. Samples: 829380078. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:54:44,128][25689] Avg episode reward: [(0, '-1.498')] [2022-07-10 16:54:45,654][26022] Updated weights on worker 0-0, policy_version 809954 (0.00093) [2022-07-10 16:54:47,752][26022] Updated weights on worker 0-0, policy_version 809964 (0.00085) [2022-07-10 16:54:47,882][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:54:47,896][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000809965_829404160.pth [2022-07-10 16:54:47,897][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000808019_827411456.pth [2022-07-10 16:54:49,184][25689] Fps is (10 sec: 5665.2, 60 sec: 5548.2, 300 sec: 5528.4). Total num frames: 829412352. Throughput: 0: 5766.7. Samples: 829413574. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:54:49,185][25689] Avg episode reward: [(0, '-0.498')] [2022-07-10 16:54:49,349][26022] Updated weights on worker 0-0, policy_version 809974 (0.00087) [2022-07-10 16:54:51,407][26022] Updated weights on worker 0-0, policy_version 809984 (0.00089) [2022-07-10 16:54:52,991][26022] Updated weights on worker 0-0, policy_version 809994 (0.00093) [2022-07-10 16:54:54,298][25689] Fps is (10 sec: 5537.9, 60 sec: 5493.1, 300 sec: 5523.0). Total num frames: 829438976. Throughput: 0: 5776.0. Samples: 829446792. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:54:54,298][25689] Avg episode reward: [(0, '-0.733')] [2022-07-10 16:54:54,837][26022] Updated weights on worker 0-0, policy_version 810004 (0.00090) [2022-07-10 16:54:56,916][26022] Updated weights on worker 0-0, policy_version 810014 (0.00094) [2022-07-10 16:54:58,676][26022] Updated weights on worker 0-0, policy_version 810024 (0.00091) [2022-07-10 16:54:59,390][25689] Fps is (10 sec: 5418.2, 60 sec: 5526.0, 300 sec: 5524.9). Total num frames: 829467648. Throughput: 0: 4925.6. Samples: 829463434. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:54:59,390][25689] Avg episode reward: [(0, '-0.250')] [2022-07-10 16:55:00,502][26022] Updated weights on worker 0-0, policy_version 810034 (0.00085) [2022-07-10 16:55:02,711][26022] Updated weights on worker 0-0, policy_version 810044 (0.00439) [2022-07-10 16:55:04,401][25689] Fps is (10 sec: 5473.1, 60 sec: 5512.5, 300 sec: 5529.1). Total num frames: 829494272. Throughput: 0: 5657.7. Samples: 829494778. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:55:04,401][25689] Avg episode reward: [(0, '-1.071')] [2022-07-10 16:55:04,442][26022] Updated weights on worker 0-0, policy_version 810054 (0.00092) [2022-07-10 16:55:06,335][26022] Updated weights on worker 0-0, policy_version 810064 (0.00088) [2022-07-10 16:55:08,184][26022] Updated weights on worker 0-0, policy_version 810074 (0.00085) [2022-07-10 16:55:09,425][25689] Fps is (10 sec: 5408.0, 60 sec: 5529.3, 300 sec: 5520.1). Total num frames: 829521920. Throughput: 0: 5688.4. Samples: 829528714. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:55:09,426][25689] Avg episode reward: [(0, '-0.130')] [2022-07-10 16:55:10,095][26022] Updated weights on worker 0-0, policy_version 810084 (0.00092) [2022-07-10 16:55:11,719][26022] Updated weights on worker 0-0, policy_version 810094 (0.00087) [2022-07-10 16:55:13,588][26022] Updated weights on worker 0-0, policy_version 810104 (0.00091) [2022-07-10 16:55:14,479][25689] Fps is (10 sec: 5588.2, 60 sec: 5513.0, 300 sec: 5526.4). Total num frames: 829550592. Throughput: 0: 4890.3. Samples: 829545486. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:55:14,481][25689] Avg episode reward: [(0, '-0.565')] [2022-07-10 16:55:15,388][26022] Updated weights on worker 0-0, policy_version 810114 (0.00090) [2022-07-10 16:55:17,416][26022] Updated weights on worker 0-0, policy_version 810124 (0.00090) [2022-07-10 16:55:19,256][26022] Updated weights on worker 0-0, policy_version 810134 (0.00096) [2022-07-10 16:55:19,569][25689] Fps is (10 sec: 5552.5, 60 sec: 5505.3, 300 sec: 5525.9). Total num frames: 829578240. Throughput: 0: 5721.1. Samples: 829578878. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:55:19,571][25689] Avg episode reward: [(0, '-1.332')] [2022-07-10 16:55:21,007][26022] Updated weights on worker 0-0, policy_version 810144 (0.00092) [2022-07-10 16:55:22,841][26022] Updated weights on worker 0-0, policy_version 810154 (0.00086) [2022-07-10 16:55:24,590][25689] Fps is (10 sec: 5671.8, 60 sec: 5522.8, 300 sec: 5526.4). Total num frames: 829607936. Throughput: 0: 5818.1. Samples: 829612238. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:55:24,591][25689] Avg episode reward: [(0, '-0.883')] [2022-07-10 16:55:24,595][26022] Updated weights on worker 0-0, policy_version 810164 (0.00362) [2022-07-10 16:55:26,616][26022] Updated weights on worker 0-0, policy_version 810174 (0.00086) [2022-07-10 16:55:28,374][26022] Updated weights on worker 0-0, policy_version 810184 (0.00087) [2022-07-10 16:55:29,592][25689] Fps is (10 sec: 5516.5, 60 sec: 5506.6, 300 sec: 5525.8). Total num frames: 829633536. Throughput: 0: 4969.7. Samples: 829628938. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:55:29,593][25689] Avg episode reward: [(0, '-1.259')] [2022-07-10 16:55:30,371][26022] Updated weights on worker 0-0, policy_version 810194 (0.00093) [2022-07-10 16:55:31,980][26022] Updated weights on worker 0-0, policy_version 810204 (0.00085) [2022-07-10 16:55:33,998][26022] Updated weights on worker 0-0, policy_version 810214 (0.00084) [2022-07-10 16:55:34,645][25689] Fps is (10 sec: 5397.4, 60 sec: 5508.4, 300 sec: 5525.2). Total num frames: 829662208. Throughput: 0: 5783.4. Samples: 829662112. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:55:34,645][25689] Avg episode reward: [(0, '-1.319')] [2022-07-10 16:55:35,702][26022] Updated weights on worker 0-0, policy_version 810224 (0.00094) [2022-07-10 16:55:37,802][26022] Updated weights on worker 0-0, policy_version 810234 (0.00092) [2022-07-10 16:55:39,427][26022] Updated weights on worker 0-0, policy_version 810244 (0.00099) [2022-07-10 16:55:39,669][25689] Fps is (10 sec: 5690.8, 60 sec: 5541.8, 300 sec: 5525.0). Total num frames: 829690880. Throughput: 0: 5800.3. Samples: 829695466. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:55:39,669][25689] Avg episode reward: [(0, '-1.266')] [2022-07-10 16:55:41,534][26022] Updated weights on worker 0-0, policy_version 810254 (0.00089) [2022-07-10 16:55:43,066][26022] Updated weights on worker 0-0, policy_version 810264 (0.00091) [2022-07-10 16:55:44,680][25689] Fps is (10 sec: 5510.1, 60 sec: 5529.9, 300 sec: 5524.8). Total num frames: 829717504. Throughput: 0: 4973.7. Samples: 829712162. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:55:44,681][25689] Avg episode reward: [(0, '-0.274')] [2022-07-10 16:55:45,137][26022] Updated weights on worker 0-0, policy_version 810274 (0.00086) [2022-07-10 16:55:46,794][26022] Updated weights on worker 0-0, policy_version 810284 (0.00095) [2022-07-10 16:55:48,790][26022] Updated weights on worker 0-0, policy_version 810294 (0.00091) [2022-07-10 16:55:49,690][25689] Fps is (10 sec: 5517.9, 60 sec: 5517.2, 300 sec: 5525.6). Total num frames: 829746176. Throughput: 0: 5802.3. Samples: 829745550. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:55:49,691][25689] Avg episode reward: [(0, '-0.276')] [2022-07-10 16:55:50,701][26022] Updated weights on worker 0-0, policy_version 810304 (0.00084) [2022-07-10 16:55:52,483][26022] Updated weights on worker 0-0, policy_version 810314 (0.00090) [2022-07-10 16:55:54,176][26022] Updated weights on worker 0-0, policy_version 810324 (0.00081) [2022-07-10 16:55:54,801][25689] Fps is (10 sec: 5565.0, 60 sec: 5534.4, 300 sec: 5527.8). Total num frames: 829773824. Throughput: 0: 5787.5. Samples: 829778764. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:55:54,801][25689] Avg episode reward: [(0, '-0.491')] [2022-07-10 16:55:56,231][26022] Updated weights on worker 0-0, policy_version 810334 (0.00082) [2022-07-10 16:55:57,926][26022] Updated weights on worker 0-0, policy_version 810344 (0.00088) [2022-07-10 16:55:59,857][25689] Fps is (10 sec: 5539.2, 60 sec: 5537.7, 300 sec: 5534.1). Total num frames: 829802496. Throughput: 0: 4963.1. Samples: 829795664. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:55:59,858][25689] Avg episode reward: [(0, '-0.281')] [2022-07-10 16:55:59,863][26022] Updated weights on worker 0-0, policy_version 810354 (0.00094) [2022-07-10 16:56:01,831][26022] Updated weights on worker 0-0, policy_version 810364 (0.00112) [2022-07-10 16:56:03,833][26022] Updated weights on worker 0-0, policy_version 810374 (0.00080) [2022-07-10 16:56:04,943][25689] Fps is (10 sec: 5452.2, 60 sec: 5530.9, 300 sec: 5525.8). Total num frames: 829829120. Throughput: 0: 5662.3. Samples: 829826894. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:56:04,943][25689] Avg episode reward: [(0, '0.428')] [2022-07-10 16:56:05,947][26022] Updated weights on worker 0-0, policy_version 810384 (0.00086) [2022-07-10 16:56:07,527][26022] Updated weights on worker 0-0, policy_version 810394 (0.00088) [2022-07-10 16:56:09,504][26022] Updated weights on worker 0-0, policy_version 810404 (0.00091) [2022-07-10 16:56:09,955][25689] Fps is (10 sec: 5374.8, 60 sec: 5532.0, 300 sec: 5528.2). Total num frames: 829856768. Throughput: 0: 5661.6. Samples: 829860282. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:56:09,955][25689] Avg episode reward: [(0, '-0.709')] [2022-07-10 16:56:11,366][26022] Updated weights on worker 0-0, policy_version 810414 (0.00090) [2022-07-10 16:56:12,904][26022] Updated weights on worker 0-0, policy_version 810424 (0.00086) [2022-07-10 16:56:15,059][25689] Fps is (10 sec: 5364.9, 60 sec: 5493.6, 300 sec: 5520.1). Total num frames: 829883392. Throughput: 0: 5687.2. Samples: 829893976. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:56:15,059][25689] Avg episode reward: [(0, '-0.809')] [2022-07-10 16:56:15,107][26022] Updated weights on worker 0-0, policy_version 810434 (0.00090) [2022-07-10 16:56:16,349][26022] Updated weights on worker 0-0, policy_version 810444 (0.00084) [2022-07-10 16:56:18,674][26022] Updated weights on worker 0-0, policy_version 810454 (0.00621) [2022-07-10 16:56:20,066][25689] Fps is (10 sec: 5671.6, 60 sec: 5551.9, 300 sec: 5530.6). Total num frames: 829914112. Throughput: 0: 5699.5. Samples: 829910840. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:56:20,066][25689] Avg episode reward: [(0, '-1.871')] [2022-07-10 16:56:20,417][26022] Updated weights on worker 0-0, policy_version 810464 (0.00086) [2022-07-10 16:56:22,097][26022] Updated weights on worker 0-0, policy_version 810474 (0.00093) [2022-07-10 16:56:24,163][26022] Updated weights on worker 0-0, policy_version 810484 (0.00086) [2022-07-10 16:56:25,087][25689] Fps is (10 sec: 5718.3, 60 sec: 5501.1, 300 sec: 5523.6). Total num frames: 829940736. Throughput: 0: 5835.7. Samples: 829944450. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:56:25,088][25689] Avg episode reward: [(0, '-2.605')] [2022-07-10 16:56:25,669][26022] Updated weights on worker 0-0, policy_version 810494 (0.00088) [2022-07-10 16:56:27,666][26022] Updated weights on worker 0-0, policy_version 810504 (0.00087) [2022-07-10 16:56:29,803][26022] Updated weights on worker 0-0, policy_version 810514 (0.00105) [2022-07-10 16:56:30,107][25689] Fps is (10 sec: 5404.9, 60 sec: 5533.3, 300 sec: 5520.8). Total num frames: 829968384. Throughput: 0: 5826.5. Samples: 829977698. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:56:30,107][25689] Avg episode reward: [(0, '-3.118')] [2022-07-10 16:56:31,307][26022] Updated weights on worker 0-0, policy_version 810524 (0.00088) [2022-07-10 16:56:33,354][26022] Updated weights on worker 0-0, policy_version 810534 (0.00087) [2022-07-10 16:56:35,015][26022] Updated weights on worker 0-0, policy_version 810544 (0.00091) [2022-07-10 16:56:35,173][25689] Fps is (10 sec: 5583.9, 60 sec: 5532.1, 300 sec: 5523.1). Total num frames: 829997056. Throughput: 0: 4990.6. Samples: 829994358. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:56:35,175][25689] Avg episode reward: [(0, '-2.901')] [2022-07-10 16:56:36,879][26022] Updated weights on worker 0-0, policy_version 810554 (0.00090) [2022-07-10 16:56:38,807][26022] Updated weights on worker 0-0, policy_version 810564 (0.00092) [2022-07-10 16:56:40,199][25689] Fps is (10 sec: 5580.6, 60 sec: 5515.0, 300 sec: 5527.9). Total num frames: 830024704. Throughput: 0: 5806.9. Samples: 830027754. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:56:40,200][25689] Avg episode reward: [(0, '-2.444')] [2022-07-10 16:56:40,624][26022] Updated weights on worker 0-0, policy_version 810574 (0.00088) [2022-07-10 16:56:42,654][26022] Updated weights on worker 0-0, policy_version 810584 (0.00085) [2022-07-10 16:56:44,376][26022] Updated weights on worker 0-0, policy_version 810594 (0.00088) [2022-07-10 16:56:45,218][25689] Fps is (10 sec: 5606.7, 60 sec: 5548.1, 300 sec: 5525.7). Total num frames: 830053376. Throughput: 0: 5792.7. Samples: 830061064. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:56:45,220][25689] Avg episode reward: [(0, '-2.578')] [2022-07-10 16:56:46,418][26022] Updated weights on worker 0-0, policy_version 810604 (0.00089) [2022-07-10 16:56:47,941][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:56:47,949][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000810613_830067712.pth [2022-07-10 16:56:47,951][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000808668_828076032.pth [2022-07-10 16:56:48,077][26022] Updated weights on worker 0-0, policy_version 810614 (0.00085) [2022-07-10 16:56:49,754][26022] Updated weights on worker 0-0, policy_version 810624 (0.00090) [2022-07-10 16:56:50,238][25689] Fps is (10 sec: 5508.0, 60 sec: 5513.3, 300 sec: 5527.1). Total num frames: 830080000. Throughput: 0: 4971.7. Samples: 830077786. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:56:50,240][25689] Avg episode reward: [(0, '-0.748')] [2022-07-10 16:56:51,762][26022] Updated weights on worker 0-0, policy_version 810634 (0.00091) [2022-07-10 16:56:53,549][26022] Updated weights on worker 0-0, policy_version 810644 (0.00096) [2022-07-10 16:56:55,343][25689] Fps is (10 sec: 5461.6, 60 sec: 5530.8, 300 sec: 5521.8). Total num frames: 830108672. Throughput: 0: 5796.5. Samples: 830111272. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-10 16:56:55,343][25689] Avg episode reward: [(0, '-0.574')] [2022-07-10 16:56:55,444][26022] Updated weights on worker 0-0, policy_version 810654 (0.00101) [2022-07-10 16:56:57,144][26022] Updated weights on worker 0-0, policy_version 810664 (0.00091) [2022-07-10 16:56:59,110][26022] Updated weights on worker 0-0, policy_version 810674 (0.00626) [2022-07-10 16:57:00,392][25689] Fps is (10 sec: 5647.8, 60 sec: 5531.6, 300 sec: 5535.3). Total num frames: 830137344. Throughput: 0: 5793.7. Samples: 830144744. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:57:00,392][25689] Avg episode reward: [(0, '0.095')] [2022-07-10 16:57:00,723][26022] Updated weights on worker 0-0, policy_version 810684 (0.00084) [2022-07-10 16:57:03,195][26022] Updated weights on worker 0-0, policy_version 810694 (0.00089) [2022-07-10 16:57:04,817][26022] Updated weights on worker 0-0, policy_version 810704 (0.00086) [2022-07-10 16:57:05,413][25689] Fps is (10 sec: 5389.1, 60 sec: 5520.4, 300 sec: 5526.3). Total num frames: 830162944. Throughput: 0: 4875.7. Samples: 830159530. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:57:05,414][25689] Avg episode reward: [(0, '0.214')] [2022-07-10 16:57:06,727][26022] Updated weights on worker 0-0, policy_version 810714 (0.00090) [2022-07-10 16:57:08,636][26022] Updated weights on worker 0-0, policy_version 810724 (0.00084) [2022-07-10 16:57:10,427][25689] Fps is (10 sec: 5306.1, 60 sec: 5520.3, 300 sec: 5521.4). Total num frames: 830190592. Throughput: 0: 5716.3. Samples: 830193190. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:57:10,427][25689] Avg episode reward: [(0, '-0.017')] [2022-07-10 16:57:10,432][26022] Updated weights on worker 0-0, policy_version 810734 (0.00092) [2022-07-10 16:57:12,398][26022] Updated weights on worker 0-0, policy_version 810744 (0.00087) [2022-07-10 16:57:14,075][26022] Updated weights on worker 0-0, policy_version 810754 (0.00086) [2022-07-10 16:57:15,528][25689] Fps is (10 sec: 5567.9, 60 sec: 5554.4, 300 sec: 5526.6). Total num frames: 830219264. Throughput: 0: 5725.4. Samples: 830226844. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:57:15,529][25689] Avg episode reward: [(0, '-0.826')] [2022-07-10 16:57:16,063][26022] Updated weights on worker 0-0, policy_version 810764 (0.00088) [2022-07-10 16:57:17,719][26022] Updated weights on worker 0-0, policy_version 810774 (0.00087) [2022-07-10 16:57:19,658][26022] Updated weights on worker 0-0, policy_version 810784 (0.00085) [2022-07-10 16:57:20,591][25689] Fps is (10 sec: 5641.9, 60 sec: 5515.5, 300 sec: 5525.6). Total num frames: 830247936. Throughput: 0: 4889.5. Samples: 830243508. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:57:20,591][25689] Avg episode reward: [(0, '-0.717')] [2022-07-10 16:57:21,499][26022] Updated weights on worker 0-0, policy_version 810794 (0.00075) [2022-07-10 16:57:23,083][26022] Updated weights on worker 0-0, policy_version 810804 (0.00092) [2022-07-10 16:57:25,082][26022] Updated weights on worker 0-0, policy_version 810814 (0.00085) [2022-07-10 16:57:25,601][25689] Fps is (10 sec: 5693.4, 60 sec: 5550.3, 300 sec: 5530.0). Total num frames: 830276608. Throughput: 0: 5827.0. Samples: 830277162. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:57:25,601][25689] Avg episode reward: [(0, '-1.108')] [2022-07-10 16:57:27,037][26022] Updated weights on worker 0-0, policy_version 810824 (0.00094) [2022-07-10 16:57:28,663][26022] Updated weights on worker 0-0, policy_version 810834 (0.00089) [2022-07-10 16:57:30,606][25689] Fps is (10 sec: 5521.1, 60 sec: 5534.8, 300 sec: 5525.4). Total num frames: 830303232. Throughput: 0: 5814.8. Samples: 830310528. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:57:30,607][25689] Avg episode reward: [(0, '-1.719')] [2022-07-10 16:57:30,805][26022] Updated weights on worker 0-0, policy_version 810844 (0.00094) [2022-07-10 16:57:32,508][26022] Updated weights on worker 0-0, policy_version 810854 (0.00092) [2022-07-10 16:57:34,463][26022] Updated weights on worker 0-0, policy_version 810864 (0.00082) [2022-07-10 16:57:35,716][25689] Fps is (10 sec: 5466.7, 60 sec: 5530.8, 300 sec: 5521.1). Total num frames: 830331904. Throughput: 0: 4960.5. Samples: 830326982. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:57:35,716][25689] Avg episode reward: [(0, '-1.538')] [2022-07-10 16:57:36,320][26022] Updated weights on worker 0-0, policy_version 810874 (0.00090) [2022-07-10 16:57:38,095][26022] Updated weights on worker 0-0, policy_version 810884 (0.00087) [2022-07-10 16:57:39,971][26022] Updated weights on worker 0-0, policy_version 810894 (0.00087) [2022-07-10 16:57:40,741][25689] Fps is (10 sec: 5557.1, 60 sec: 5530.9, 300 sec: 5524.2). Total num frames: 830359552. Throughput: 0: 5784.9. Samples: 830360074. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:57:40,741][25689] Avg episode reward: [(0, '-2.198')] [2022-07-10 16:57:41,641][26022] Updated weights on worker 0-0, policy_version 810904 (0.00086) [2022-07-10 16:57:43,682][26022] Updated weights on worker 0-0, policy_version 810914 (0.00087) [2022-07-10 16:57:45,438][26022] Updated weights on worker 0-0, policy_version 810924 (0.00094) [2022-07-10 16:57:45,777][25689] Fps is (10 sec: 5495.9, 60 sec: 5512.4, 300 sec: 5516.7). Total num frames: 830387200. Throughput: 0: 5759.5. Samples: 830393368. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:57:45,777][25689] Avg episode reward: [(0, '-1.699')] [2022-07-10 16:57:47,317][26022] Updated weights on worker 0-0, policy_version 810934 (0.00091) [2022-07-10 16:57:49,082][26022] Updated weights on worker 0-0, policy_version 810944 (0.00085) [2022-07-10 16:57:50,823][25689] Fps is (10 sec: 5484.7, 60 sec: 5527.0, 300 sec: 5524.5). Total num frames: 830414848. Throughput: 0: 4920.2. Samples: 830409998. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:57:50,823][25689] Avg episode reward: [(0, '-1.714')] [2022-07-10 16:57:51,024][26022] Updated weights on worker 0-0, policy_version 810954 (0.00085) [2022-07-10 16:57:52,792][26022] Updated weights on worker 0-0, policy_version 810964 (0.00087) [2022-07-10 16:57:54,707][26022] Updated weights on worker 0-0, policy_version 810974 (0.00093) [2022-07-10 16:57:55,922][25689] Fps is (10 sec: 5450.6, 60 sec: 5510.5, 300 sec: 5520.6). Total num frames: 830442496. Throughput: 0: 5746.4. Samples: 830443094. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:57:55,922][25689] Avg episode reward: [(0, '-0.671')] [2022-07-10 16:57:56,628][26022] Updated weights on worker 0-0, policy_version 810984 (0.00087) [2022-07-10 16:57:58,385][26022] Updated weights on worker 0-0, policy_version 810994 (0.00080) [2022-07-10 16:58:00,255][26022] Updated weights on worker 0-0, policy_version 811004 (0.00091) [2022-07-10 16:58:00,940][25689] Fps is (10 sec: 5667.8, 60 sec: 5530.2, 300 sec: 5531.1). Total num frames: 830472192. Throughput: 0: 5771.8. Samples: 830476660. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:58:00,941][25689] Avg episode reward: [(0, '-0.879')] [2022-07-10 16:58:02,496][26022] Updated weights on worker 0-0, policy_version 811014 (0.00093) [2022-07-10 16:58:04,347][26022] Updated weights on worker 0-0, policy_version 811024 (0.00430) [2022-07-10 16:58:05,963][25689] Fps is (10 sec: 5405.0, 60 sec: 5513.2, 300 sec: 5516.9). Total num frames: 830496768. Throughput: 0: 4851.9. Samples: 830491308. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:58:05,963][25689] Avg episode reward: [(0, '-2.335')] [2022-07-10 16:58:06,299][26022] Updated weights on worker 0-0, policy_version 811034 (0.00086) [2022-07-10 16:58:08,018][26022] Updated weights on worker 0-0, policy_version 811044 (0.00086) [2022-07-10 16:58:09,809][26022] Updated weights on worker 0-0, policy_version 811054 (0.00082) [2022-07-10 16:58:10,976][25689] Fps is (10 sec: 5203.8, 60 sec: 5513.3, 300 sec: 5521.3). Total num frames: 830524416. Throughput: 0: 5696.0. Samples: 830524790. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:58:10,977][25689] Avg episode reward: [(0, '-1.576')] [2022-07-10 16:58:11,635][26022] Updated weights on worker 0-0, policy_version 811064 (0.00097) [2022-07-10 16:58:13,527][26022] Updated weights on worker 0-0, policy_version 811074 (0.00079) [2022-07-10 16:58:15,388][26022] Updated weights on worker 0-0, policy_version 811084 (0.00092) [2022-07-10 16:58:16,028][25689] Fps is (10 sec: 5595.5, 60 sec: 5517.8, 300 sec: 5520.6). Total num frames: 830553088. Throughput: 0: 5718.8. Samples: 830558078. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:58:16,029][25689] Avg episode reward: [(0, '-1.020')] [2022-07-10 16:58:17,287][26022] Updated weights on worker 0-0, policy_version 811094 (0.00085) [2022-07-10 16:58:19,004][26022] Updated weights on worker 0-0, policy_version 811104 (0.00088) [2022-07-10 16:58:20,854][26022] Updated weights on worker 0-0, policy_version 811114 (0.00087) [2022-07-10 16:58:21,031][25689] Fps is (10 sec: 5703.0, 60 sec: 5523.2, 300 sec: 5524.4). Total num frames: 830581760. Throughput: 0: 4881.6. Samples: 830574736. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:58:21,032][25689] Avg episode reward: [(0, '-0.900')] [2022-07-10 16:58:22,721][26022] Updated weights on worker 0-0, policy_version 811124 (0.00081) [2022-07-10 16:58:24,697][26022] Updated weights on worker 0-0, policy_version 811134 (0.00097) [2022-07-10 16:58:26,043][25689] Fps is (10 sec: 5521.6, 60 sec: 5489.2, 300 sec: 5524.5). Total num frames: 830608384. Throughput: 0: 5804.5. Samples: 830607860. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:58:26,043][25689] Avg episode reward: [(0, '-0.966')] [2022-07-10 16:58:26,577][26022] Updated weights on worker 0-0, policy_version 811144 (0.00389) [2022-07-10 16:58:28,455][26022] Updated weights on worker 0-0, policy_version 811154 (0.00087) [2022-07-10 16:58:30,262][26022] Updated weights on worker 0-0, policy_version 811164 (0.00088) [2022-07-10 16:58:31,050][25689] Fps is (10 sec: 5314.4, 60 sec: 5488.9, 300 sec: 5518.4). Total num frames: 830635008. Throughput: 0: 5782.1. Samples: 830640864. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:58:31,051][25689] Avg episode reward: [(0, '-0.813')] [2022-07-10 16:58:31,983][26022] Updated weights on worker 0-0, policy_version 811174 (0.00091) [2022-07-10 16:58:34,028][26022] Updated weights on worker 0-0, policy_version 811184 (0.00082) [2022-07-10 16:58:35,777][26022] Updated weights on worker 0-0, policy_version 811194 (0.00085) [2022-07-10 16:58:36,095][25689] Fps is (10 sec: 5602.9, 60 sec: 5511.9, 300 sec: 5528.3). Total num frames: 830664704. Throughput: 0: 4941.5. Samples: 830657236. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:58:36,095][25689] Avg episode reward: [(0, '1.257')] [2022-07-10 16:58:37,746][26022] Updated weights on worker 0-0, policy_version 811204 (0.00085) [2022-07-10 16:58:39,366][26022] Updated weights on worker 0-0, policy_version 811214 (0.00084) [2022-07-10 16:58:41,107][25689] Fps is (10 sec: 5498.3, 60 sec: 5479.0, 300 sec: 5511.0). Total num frames: 830690304. Throughput: 0: 5774.4. Samples: 830690666. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:58:41,108][25689] Avg episode reward: [(0, '0.363')] [2022-07-10 16:58:41,351][26022] Updated weights on worker 0-0, policy_version 811224 (0.00094) [2022-07-10 16:58:43,143][26022] Updated weights on worker 0-0, policy_version 811234 (0.00086) [2022-07-10 16:58:45,007][26022] Updated weights on worker 0-0, policy_version 811244 (0.00087) [2022-07-10 16:58:46,120][25689] Fps is (10 sec: 5515.7, 60 sec: 5515.2, 300 sec: 5524.9). Total num frames: 830720000. Throughput: 0: 5794.9. Samples: 830724204. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:58:46,120][25689] Avg episode reward: [(0, '0.263')] [2022-07-10 16:58:46,938][26022] Updated weights on worker 0-0, policy_version 811254 (0.00093) [2022-07-10 16:58:47,984][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 16:58:47,994][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000811260_830730240.pth [2022-07-10 16:58:47,994][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000809318_828741632.pth [2022-07-10 16:58:48,460][26022] Updated weights on worker 0-0, policy_version 811264 (0.00089) [2022-07-10 16:58:50,580][26022] Updated weights on worker 0-0, policy_version 811274 (0.00100) [2022-07-10 16:58:51,131][25689] Fps is (10 sec: 5619.0, 60 sec: 5501.4, 300 sec: 5515.6). Total num frames: 830746624. Throughput: 0: 4996.5. Samples: 830741194. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:58:51,131][25689] Avg episode reward: [(0, '0.339')] [2022-07-10 16:58:52,253][26022] Updated weights on worker 0-0, policy_version 811284 (0.00084) [2022-07-10 16:58:54,301][26022] Updated weights on worker 0-0, policy_version 811294 (0.00087) [2022-07-10 16:58:56,032][26022] Updated weights on worker 0-0, policy_version 811304 (0.00094) [2022-07-10 16:58:56,192][25689] Fps is (10 sec: 5489.7, 60 sec: 5521.8, 300 sec: 5522.9). Total num frames: 830775296. Throughput: 0: 5834.3. Samples: 830774492. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:58:56,193][25689] Avg episode reward: [(0, '0.544')] [2022-07-10 16:58:57,947][26022] Updated weights on worker 0-0, policy_version 811314 (0.00090) [2022-07-10 16:58:59,622][26022] Updated weights on worker 0-0, policy_version 811324 (0.00087) [2022-07-10 16:59:01,196][25689] Fps is (10 sec: 5595.4, 60 sec: 5489.1, 300 sec: 5523.7). Total num frames: 830802944. Throughput: 0: 5847.0. Samples: 830808124. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:59:01,197][25689] Avg episode reward: [(0, '-0.040')] [2022-07-10 16:59:01,822][26022] Updated weights on worker 0-0, policy_version 811334 (0.00095) [2022-07-10 16:59:03,537][26022] Updated weights on worker 0-0, policy_version 811344 (0.00094) [2022-07-10 16:59:05,626][26022] Updated weights on worker 0-0, policy_version 811354 (0.00087) [2022-07-10 16:59:06,197][25689] Fps is (10 sec: 5424.5, 60 sec: 5525.1, 300 sec: 5524.1). Total num frames: 830829568. Throughput: 0: 4911.1. Samples: 830822806. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:59:06,198][25689] Avg episode reward: [(0, '-0.754')] [2022-07-10 16:59:07,317][26022] Updated weights on worker 0-0, policy_version 811364 (0.00089) [2022-07-10 16:59:09,004][26022] Updated weights on worker 0-0, policy_version 811374 (0.00082) [2022-07-10 16:59:10,913][26022] Updated weights on worker 0-0, policy_version 811384 (0.00084) [2022-07-10 16:59:11,214][25689] Fps is (10 sec: 5519.7, 60 sec: 5541.7, 300 sec: 5521.5). Total num frames: 830858240. Throughput: 0: 5750.3. Samples: 830856680. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:59:11,215][25689] Avg episode reward: [(0, '-0.797')] [2022-07-10 16:59:12,748][26022] Updated weights on worker 0-0, policy_version 811394 (0.00093) [2022-07-10 16:59:14,652][26022] Updated weights on worker 0-0, policy_version 811404 (0.00088) [2022-07-10 16:59:16,261][25689] Fps is (10 sec: 5596.4, 60 sec: 5525.2, 300 sec: 5520.7). Total num frames: 830885888. Throughput: 0: 5768.1. Samples: 830890250. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:59:16,261][25689] Avg episode reward: [(0, '-2.218')] [2022-07-10 16:59:16,699][26022] Updated weights on worker 0-0, policy_version 811414 (0.00089) [2022-07-10 16:59:18,148][26022] Updated weights on worker 0-0, policy_version 811424 (0.00093) [2022-07-10 16:59:20,175][26022] Updated weights on worker 0-0, policy_version 811434 (0.00085) [2022-07-10 16:59:21,266][25689] Fps is (10 sec: 5602.6, 60 sec: 5525.0, 300 sec: 5521.1). Total num frames: 830914560. Throughput: 0: 4928.9. Samples: 830907048. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:59:21,268][25689] Avg episode reward: [(0, '-2.263')] [2022-07-10 16:59:22,021][26022] Updated weights on worker 0-0, policy_version 811444 (0.00091) [2022-07-10 16:59:23,883][26022] Updated weights on worker 0-0, policy_version 811454 (0.00086) [2022-07-10 16:59:25,638][26022] Updated weights on worker 0-0, policy_version 811464 (0.00106) [2022-07-10 16:59:26,274][25689] Fps is (10 sec: 5624.5, 60 sec: 5542.3, 300 sec: 5524.6). Total num frames: 830942208. Throughput: 0: 5881.9. Samples: 830940898. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:59:26,275][25689] Avg episode reward: [(0, '-3.178')] [2022-07-10 16:59:27,568][26022] Updated weights on worker 0-0, policy_version 811474 (0.00090) [2022-07-10 16:59:29,251][26022] Updated weights on worker 0-0, policy_version 811484 (0.00090) [2022-07-10 16:59:31,273][26022] Updated weights on worker 0-0, policy_version 811494 (0.00087) [2022-07-10 16:59:31,287][25689] Fps is (10 sec: 5518.2, 60 sec: 5558.9, 300 sec: 5522.3). Total num frames: 830969856. Throughput: 0: 5862.9. Samples: 830974366. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:59:31,288][25689] Avg episode reward: [(0, '-3.858')] [2022-07-10 16:59:32,901][26022] Updated weights on worker 0-0, policy_version 811504 (0.00092) [2022-07-10 16:59:35,036][26022] Updated weights on worker 0-0, policy_version 811514 (0.00086) [2022-07-10 16:59:36,381][25689] Fps is (10 sec: 5673.9, 60 sec: 5554.3, 300 sec: 5531.3). Total num frames: 830999552. Throughput: 0: 5840.6. Samples: 831007762. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:59:36,381][25689] Avg episode reward: [(0, '-2.788')] [2022-07-10 16:59:36,552][26022] Updated weights on worker 0-0, policy_version 811524 (0.00090) [2022-07-10 16:59:38,587][26022] Updated weights on worker 0-0, policy_version 811534 (0.00090) [2022-07-10 16:59:40,192][26022] Updated weights on worker 0-0, policy_version 811544 (0.00094) [2022-07-10 16:59:41,415][25689] Fps is (10 sec: 5358.8, 60 sec: 5535.4, 300 sec: 5521.5). Total num frames: 831024128. Throughput: 0: 5829.5. Samples: 831024502. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:59:41,415][25689] Avg episode reward: [(0, '-2.795')] [2022-07-10 16:59:42,123][26022] Updated weights on worker 0-0, policy_version 811554 (0.00086) [2022-07-10 16:59:44,170][26022] Updated weights on worker 0-0, policy_version 811564 (0.00090) [2022-07-10 16:59:45,870][26022] Updated weights on worker 0-0, policy_version 811574 (0.00093) [2022-07-10 16:59:46,443][25689] Fps is (10 sec: 5597.3, 60 sec: 5567.9, 300 sec: 5528.9). Total num frames: 831055872. Throughput: 0: 5797.2. Samples: 831057820. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:59:46,443][25689] Avg episode reward: [(0, '-2.038')] [2022-07-10 16:59:48,032][26022] Updated weights on worker 0-0, policy_version 811584 (0.00093) [2022-07-10 16:59:49,486][26022] Updated weights on worker 0-0, policy_version 811594 (0.00086) [2022-07-10 16:59:51,486][25689] Fps is (10 sec: 5693.6, 60 sec: 5547.9, 300 sec: 5526.8). Total num frames: 831081472. Throughput: 0: 5786.4. Samples: 831091248. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:59:51,487][25689] Avg episode reward: [(0, '-2.157')] [2022-07-10 16:59:51,508][26022] Updated weights on worker 0-0, policy_version 811604 (0.00086) [2022-07-10 16:59:53,186][26022] Updated weights on worker 0-0, policy_version 811614 (0.00090) [2022-07-10 16:59:55,135][26022] Updated weights on worker 0-0, policy_version 811624 (0.00052) [2022-07-10 16:59:56,605][25689] Fps is (10 sec: 5441.5, 60 sec: 5559.7, 300 sec: 5529.7). Total num frames: 831111168. Throughput: 0: 4956.9. Samples: 831108010. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 16:59:56,605][25689] Avg episode reward: [(0, '-0.479')] [2022-07-10 16:59:56,911][26022] Updated weights on worker 0-0, policy_version 811634 (0.00092) [2022-07-10 16:59:58,944][26022] Updated weights on worker 0-0, policy_version 811644 (0.00089) [2022-07-10 17:00:00,767][26022] Updated weights on worker 0-0, policy_version 811654 (0.00089) [2022-07-10 17:00:01,607][25689] Fps is (10 sec: 5666.2, 60 sec: 5559.8, 300 sec: 5533.3). Total num frames: 831138816. Throughput: 0: 5776.3. Samples: 831141136. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 17:00:01,607][25689] Avg episode reward: [(0, '-0.490')] [2022-07-10 17:00:02,908][26022] Updated weights on worker 0-0, policy_version 811664 (0.00093) [2022-07-10 17:00:04,716][26022] Updated weights on worker 0-0, policy_version 811674 (0.00085) [2022-07-10 17:00:06,570][26022] Updated weights on worker 0-0, policy_version 811684 (0.00084) [2022-07-10 17:00:06,673][25689] Fps is (10 sec: 5288.9, 60 sec: 5536.9, 300 sec: 5525.7). Total num frames: 831164416. Throughput: 0: 5683.8. Samples: 831172802. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 17:00:06,673][25689] Avg episode reward: [(0, '-1.600')] [2022-07-10 17:00:08,406][26022] Updated weights on worker 0-0, policy_version 811694 (0.00096) [2022-07-10 17:00:10,340][26022] Updated weights on worker 0-0, policy_version 811704 (0.00086) [2022-07-10 17:00:11,716][25689] Fps is (10 sec: 5368.6, 60 sec: 5534.5, 300 sec: 5525.9). Total num frames: 831193088. Throughput: 0: 4855.9. Samples: 831189480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 17:00:11,716][25689] Avg episode reward: [(0, '-0.977')] [2022-07-10 17:00:12,144][26022] Updated weights on worker 0-0, policy_version 811714 (0.00087) [2022-07-10 17:00:14,039][26022] Updated weights on worker 0-0, policy_version 811724 (0.00090) [2022-07-10 17:00:15,913][26022] Updated weights on worker 0-0, policy_version 811734 (0.00089) [2022-07-10 17:00:16,787][25689] Fps is (10 sec: 5669.7, 60 sec: 5549.2, 300 sec: 5529.7). Total num frames: 831221760. Throughput: 0: 5702.4. Samples: 831223096. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 17:00:16,787][25689] Avg episode reward: [(0, '-0.628')] [2022-07-10 17:00:17,552][26022] Updated weights on worker 0-0, policy_version 811744 (0.00989) [2022-07-10 17:00:19,349][26022] Updated weights on worker 0-0, policy_version 811754 (0.00082) [2022-07-10 17:00:21,171][26022] Updated weights on worker 0-0, policy_version 811764 (0.00082) [2022-07-10 17:00:21,803][25689] Fps is (10 sec: 5583.5, 60 sec: 5531.4, 300 sec: 5522.9). Total num frames: 831249408. Throughput: 0: 5719.0. Samples: 831256638. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 17:00:21,807][25689] Avg episode reward: [(0, '-0.570')] [2022-07-10 17:00:23,117][26022] Updated weights on worker 0-0, policy_version 811774 (0.00103) [2022-07-10 17:00:24,895][26022] Updated weights on worker 0-0, policy_version 811784 (0.00087) [2022-07-10 17:00:26,833][25689] Fps is (10 sec: 5503.9, 60 sec: 5529.3, 300 sec: 5529.2). Total num frames: 831277056. Throughput: 0: 4989.7. Samples: 831273398. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2022-07-10 17:00:26,834][25689] Avg episode reward: [(0, '-0.158')] [2022-07-10 17:00:26,835][26022] Updated weights on worker 0-0, policy_version 811794 (0.00089) [2022-07-10 17:00:28,640][26022] Updated weights on worker 0-0, policy_version 811804 (0.00087) [2022-07-10 17:00:30,436][26022] Updated weights on worker 0-0, policy_version 811814 (0.00083) [2022-07-10 17:00:31,868][25689] Fps is (10 sec: 5493.9, 60 sec: 5527.3, 300 sec: 5526.1). Total num frames: 831304704. Throughput: 0: 5808.5. Samples: 831306532. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:00:31,877][25689] Avg episode reward: [(0, '-0.922')] [2022-07-10 17:00:32,302][26022] Updated weights on worker 0-0, policy_version 811824 (0.00094) [2022-07-10 17:00:34,149][26022] Updated weights on worker 0-0, policy_version 811834 (0.00087) [2022-07-10 17:00:35,968][26022] Updated weights on worker 0-0, policy_version 811844 (0.00089) [2022-07-10 17:00:36,986][25689] Fps is (10 sec: 5547.1, 60 sec: 5508.1, 300 sec: 5524.4). Total num frames: 831333376. Throughput: 0: 5773.9. Samples: 831339728. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:00:36,989][25689] Avg episode reward: [(0, '0.099')] [2022-07-10 17:00:37,927][26022] Updated weights on worker 0-0, policy_version 811854 (0.00069) [2022-07-10 17:00:39,565][26022] Updated weights on worker 0-0, policy_version 811864 (0.00085) [2022-07-10 17:00:41,715][26022] Updated weights on worker 0-0, policy_version 811874 (0.00093) [2022-07-10 17:00:42,020][25689] Fps is (10 sec: 5446.4, 60 sec: 5541.9, 300 sec: 5524.0). Total num frames: 831360000. Throughput: 0: 4936.3. Samples: 831356438. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:00:42,021][25689] Avg episode reward: [(0, '0.254')] [2022-07-10 17:00:43,312][26022] Updated weights on worker 0-0, policy_version 811884 (0.00085) [2022-07-10 17:00:45,327][26022] Updated weights on worker 0-0, policy_version 811894 (0.00115) [2022-07-10 17:00:47,051][25689] Fps is (10 sec: 5493.9, 60 sec: 5491.0, 300 sec: 5523.6). Total num frames: 831388672. Throughput: 0: 5765.3. Samples: 831389960. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:00:47,052][25689] Avg episode reward: [(0, '-0.340')] [2022-07-10 17:00:47,058][26022] Updated weights on worker 0-0, policy_version 811904 (0.00095) [2022-07-10 17:00:48,247][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:00:48,257][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000811909_831394816.pth [2022-07-10 17:00:48,258][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000809965_829404160.pth [2022-07-10 17:00:48,926][26022] Updated weights on worker 0-0, policy_version 811914 (0.00087) [2022-07-10 17:00:51,011][26022] Updated weights on worker 0-0, policy_version 811924 (0.00093) [2022-07-10 17:00:52,071][25689] Fps is (10 sec: 5603.8, 60 sec: 5527.0, 300 sec: 5525.3). Total num frames: 831416320. Throughput: 0: 5775.7. Samples: 831423220. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:00:52,071][25689] Avg episode reward: [(0, '-0.499')] [2022-07-10 17:00:52,577][26022] Updated weights on worker 0-0, policy_version 811934 (0.00092) [2022-07-10 17:00:54,639][26022] Updated weights on worker 0-0, policy_version 811944 (0.00095) [2022-07-10 17:00:56,252][26022] Updated weights on worker 0-0, policy_version 811954 (0.00090) [2022-07-10 17:00:57,201][25689] Fps is (10 sec: 5448.1, 60 sec: 5492.1, 300 sec: 5520.5). Total num frames: 831443968. Throughput: 0: 4942.4. Samples: 831439636. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:00:57,202][25689] Avg episode reward: [(0, '-0.265')] [2022-07-10 17:00:58,203][26022] Updated weights on worker 0-0, policy_version 811964 (0.00079) [2022-07-10 17:01:00,006][26022] Updated weights on worker 0-0, policy_version 811974 (0.00095) [2022-07-10 17:01:02,217][25689] Fps is (10 sec: 5349.0, 60 sec: 5473.9, 300 sec: 5521.7). Total num frames: 831470592. Throughput: 0: 5772.3. Samples: 831473018. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:01:02,217][25689] Avg episode reward: [(0, '-0.403')] [2022-07-10 17:01:02,413][26022] Updated weights on worker 0-0, policy_version 811984 (0.00087) [2022-07-10 17:01:04,180][26022] Updated weights on worker 0-0, policy_version 811994 (0.00619) [2022-07-10 17:01:06,090][26022] Updated weights on worker 0-0, policy_version 812004 (0.00095) [2022-07-10 17:01:07,307][25689] Fps is (10 sec: 5370.3, 60 sec: 5505.5, 300 sec: 5520.3). Total num frames: 831498240. Throughput: 0: 5640.9. Samples: 831504220. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:01:07,307][25689] Avg episode reward: [(0, '-1.069')] [2022-07-10 17:01:07,668][26022] Updated weights on worker 0-0, policy_version 812014 (0.00116) [2022-07-10 17:01:09,759][26022] Updated weights on worker 0-0, policy_version 812024 (0.00088) [2022-07-10 17:01:11,497][26022] Updated weights on worker 0-0, policy_version 812034 (0.00087) [2022-07-10 17:01:12,379][25689] Fps is (10 sec: 5441.6, 60 sec: 5486.1, 300 sec: 5524.3). Total num frames: 831525888. Throughput: 0: 4808.5. Samples: 831520870. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:01:12,379][25689] Avg episode reward: [(0, '-0.746')] [2022-07-10 17:01:13,415][26022] Updated weights on worker 0-0, policy_version 812044 (0.00088) [2022-07-10 17:01:15,295][26022] Updated weights on worker 0-0, policy_version 812054 (0.00088) [2022-07-10 17:01:16,885][26022] Updated weights on worker 0-0, policy_version 812064 (0.00089) [2022-07-10 17:01:17,469][25689] Fps is (10 sec: 5643.1, 60 sec: 5501.2, 300 sec: 5519.3). Total num frames: 831555584. Throughput: 0: 5658.5. Samples: 831554320. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:01:17,469][25689] Avg episode reward: [(0, '-0.890')] [2022-07-10 17:01:18,983][26022] Updated weights on worker 0-0, policy_version 812074 (0.00089) [2022-07-10 17:01:20,752][26022] Updated weights on worker 0-0, policy_version 812084 (0.00084) [2022-07-10 17:01:22,478][25689] Fps is (10 sec: 5678.1, 60 sec: 5501.8, 300 sec: 5523.0). Total num frames: 831583232. Throughput: 0: 5659.8. Samples: 831587690. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:01:22,479][25689] Avg episode reward: [(0, '-1.991')] [2022-07-10 17:01:22,620][26022] Updated weights on worker 0-0, policy_version 812094 (0.00084) [2022-07-10 17:01:24,299][26022] Updated weights on worker 0-0, policy_version 812104 (0.00086) [2022-07-10 17:01:26,478][26022] Updated weights on worker 0-0, policy_version 812114 (0.00091) [2022-07-10 17:01:27,493][25689] Fps is (10 sec: 5720.3, 60 sec: 5537.0, 300 sec: 5530.0). Total num frames: 831612928. Throughput: 0: 4971.6. Samples: 831604578. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:01:27,494][25689] Avg episode reward: [(0, '-1.769')] [2022-07-10 17:01:28,031][26022] Updated weights on worker 0-0, policy_version 812124 (0.00081) [2022-07-10 17:01:30,200][26022] Updated weights on worker 0-0, policy_version 812134 (0.00089) [2022-07-10 17:01:31,827][26022] Updated weights on worker 0-0, policy_version 812144 (0.00088) [2022-07-10 17:01:32,535][25689] Fps is (10 sec: 5498.3, 60 sec: 5502.6, 300 sec: 5520.1). Total num frames: 831638528. Throughput: 0: 5790.9. Samples: 831637592. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:01:32,536][25689] Avg episode reward: [(0, '-1.083')] [2022-07-10 17:01:33,796][26022] Updated weights on worker 0-0, policy_version 812154 (0.00092) [2022-07-10 17:01:35,730][26022] Updated weights on worker 0-0, policy_version 812164 (0.00083) [2022-07-10 17:01:37,331][26022] Updated weights on worker 0-0, policy_version 812174 (0.00087) [2022-07-10 17:01:37,645][25689] Fps is (10 sec: 5346.3, 60 sec: 5503.3, 300 sec: 5522.0). Total num frames: 831667200. Throughput: 0: 5774.7. Samples: 831670830. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:01:37,647][25689] Avg episode reward: [(0, '-0.883')] [2022-07-10 17:01:39,247][26022] Updated weights on worker 0-0, policy_version 812184 (0.00083) [2022-07-10 17:01:41,159][26022] Updated weights on worker 0-0, policy_version 812194 (0.00097) [2022-07-10 17:01:42,655][25689] Fps is (10 sec: 5666.4, 60 sec: 5539.3, 300 sec: 5522.2). Total num frames: 831695872. Throughput: 0: 5786.8. Samples: 831704450. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:01:42,656][25689] Avg episode reward: [(0, '-0.529')] [2022-07-10 17:01:42,890][26022] Updated weights on worker 0-0, policy_version 812204 (0.00101) [2022-07-10 17:01:44,887][26022] Updated weights on worker 0-0, policy_version 812214 (0.00096) [2022-07-10 17:01:46,462][26022] Updated weights on worker 0-0, policy_version 812224 (0.00091) [2022-07-10 17:01:47,685][25689] Fps is (10 sec: 5507.6, 60 sec: 5505.6, 300 sec: 5522.0). Total num frames: 831722496. Throughput: 0: 5778.5. Samples: 831721254. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:01:47,686][25689] Avg episode reward: [(0, '-1.793')] [2022-07-10 17:01:48,293][26022] Updated weights on worker 0-0, policy_version 812234 (0.00080) [2022-07-10 17:01:50,197][26022] Updated weights on worker 0-0, policy_version 812244 (0.00090) [2022-07-10 17:01:52,004][26022] Updated weights on worker 0-0, policy_version 812254 (0.00093) [2022-07-10 17:01:52,706][25689] Fps is (10 sec: 5501.4, 60 sec: 5522.3, 300 sec: 5523.5). Total num frames: 831751168. Throughput: 0: 5813.0. Samples: 831754848. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:01:52,706][25689] Avg episode reward: [(0, '-1.416')] [2022-07-10 17:01:53,935][26022] Updated weights on worker 0-0, policy_version 812264 (0.00086) [2022-07-10 17:01:55,555][26022] Updated weights on worker 0-0, policy_version 812274 (0.00093) [2022-07-10 17:01:57,530][26022] Updated weights on worker 0-0, policy_version 812284 (0.00099) [2022-07-10 17:01:57,800][25689] Fps is (10 sec: 5770.2, 60 sec: 5559.4, 300 sec: 5526.1). Total num frames: 831780864. Throughput: 0: 5820.8. Samples: 831788150. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:01:57,801][25689] Avg episode reward: [(0, '-2.149')] [2022-07-10 17:01:59,478][26022] Updated weights on worker 0-0, policy_version 812294 (0.00094) [2022-07-10 17:02:01,147][26022] Updated weights on worker 0-0, policy_version 812304 (0.00089) [2022-07-10 17:02:02,814][25689] Fps is (10 sec: 5267.8, 60 sec: 5508.9, 300 sec: 5519.4). Total num frames: 831804416. Throughput: 0: 4977.6. Samples: 831804796. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:02:02,815][25689] Avg episode reward: [(0, '-2.105')] [2022-07-10 17:02:03,574][26022] Updated weights on worker 0-0, policy_version 812314 (0.00087) [2022-07-10 17:02:05,248][26022] Updated weights on worker 0-0, policy_version 812324 (0.00093) [2022-07-10 17:02:07,277][26022] Updated weights on worker 0-0, policy_version 812334 (0.00087) [2022-07-10 17:02:07,879][25689] Fps is (10 sec: 5181.7, 60 sec: 5528.1, 300 sec: 5521.9). Total num frames: 831833088. Throughput: 0: 5664.9. Samples: 831835650. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:02:07,879][25689] Avg episode reward: [(0, '-3.031')] [2022-07-10 17:02:08,904][26022] Updated weights on worker 0-0, policy_version 812344 (0.00081) [2022-07-10 17:02:10,924][26022] Updated weights on worker 0-0, policy_version 812354 (0.00089) [2022-07-10 17:02:12,558][26022] Updated weights on worker 0-0, policy_version 812364 (0.00084) [2022-07-10 17:02:12,945][25689] Fps is (10 sec: 5660.1, 60 sec: 5545.5, 300 sec: 5522.5). Total num frames: 831861760. Throughput: 0: 5652.4. Samples: 831869250. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:02:12,946][25689] Avg episode reward: [(0, '-3.027')] [2022-07-10 17:02:14,527][26022] Updated weights on worker 0-0, policy_version 812374 (0.00088) [2022-07-10 17:02:16,439][26022] Updated weights on worker 0-0, policy_version 812384 (0.00096) [2022-07-10 17:02:18,041][25689] Fps is (10 sec: 5542.2, 60 sec: 5511.2, 300 sec: 5518.5). Total num frames: 831889408. Throughput: 0: 4839.8. Samples: 831886110. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:02:18,041][25689] Avg episode reward: [(0, '-2.059')] [2022-07-10 17:02:18,349][26022] Updated weights on worker 0-0, policy_version 812394 (0.00107) [2022-07-10 17:02:19,970][26022] Updated weights on worker 0-0, policy_version 812404 (0.00093) [2022-07-10 17:02:21,940][26022] Updated weights on worker 0-0, policy_version 812414 (0.00089) [2022-07-10 17:02:23,065][25689] Fps is (10 sec: 5565.4, 60 sec: 5526.7, 300 sec: 5518.2). Total num frames: 831918080. Throughput: 0: 5675.3. Samples: 831919726. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:02:23,066][25689] Avg episode reward: [(0, '-2.091')] [2022-07-10 17:02:23,627][26022] Updated weights on worker 0-0, policy_version 812424 (0.00089) [2022-07-10 17:02:25,575][26022] Updated weights on worker 0-0, policy_version 812434 (0.00090) [2022-07-10 17:02:27,379][26022] Updated weights on worker 0-0, policy_version 812444 (0.00085) [2022-07-10 17:02:28,147][25689] Fps is (10 sec: 5572.7, 60 sec: 5486.9, 300 sec: 5520.2). Total num frames: 831945728. Throughput: 0: 5776.5. Samples: 831952730. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:02:28,147][25689] Avg episode reward: [(0, '-1.903')] [2022-07-10 17:02:29,268][26022] Updated weights on worker 0-0, policy_version 812454 (0.00067) [2022-07-10 17:02:31,184][26022] Updated weights on worker 0-0, policy_version 812464 (0.00088) [2022-07-10 17:02:33,017][26022] Updated weights on worker 0-0, policy_version 812474 (0.00085) [2022-07-10 17:02:33,223][25689] Fps is (10 sec: 5443.5, 60 sec: 5517.5, 300 sec: 5517.4). Total num frames: 831973376. Throughput: 0: 4933.1. Samples: 831969282. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:02:33,224][25689] Avg episode reward: [(0, '-3.029')] [2022-07-10 17:02:34,873][26022] Updated weights on worker 0-0, policy_version 812484 (0.00088) [2022-07-10 17:02:36,736][26022] Updated weights on worker 0-0, policy_version 812494 (0.00080) [2022-07-10 17:02:38,265][25689] Fps is (10 sec: 5566.1, 60 sec: 5523.7, 300 sec: 5520.5). Total num frames: 832002048. Throughput: 0: 5756.2. Samples: 832002528. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:02:38,266][25689] Avg episode reward: [(0, '-2.370')] [2022-07-10 17:02:38,705][26022] Updated weights on worker 0-0, policy_version 812504 (0.00086) [2022-07-10 17:02:40,314][26022] Updated weights on worker 0-0, policy_version 812514 (0.00084) [2022-07-10 17:02:42,297][26022] Updated weights on worker 0-0, policy_version 812524 (0.00086) [2022-07-10 17:02:43,295][25689] Fps is (10 sec: 5592.0, 60 sec: 5505.0, 300 sec: 5520.7). Total num frames: 832029696. Throughput: 0: 5748.9. Samples: 832036026. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:02:43,295][25689] Avg episode reward: [(0, '-2.595')] [2022-07-10 17:02:43,912][26022] Updated weights on worker 0-0, policy_version 812534 (0.00082) [2022-07-10 17:02:46,127][26022] Updated weights on worker 0-0, policy_version 812544 (0.00087) [2022-07-10 17:02:47,643][26022] Updated weights on worker 0-0, policy_version 812554 (0.00083) [2022-07-10 17:02:48,323][25689] Fps is (10 sec: 5599.8, 60 sec: 5538.9, 300 sec: 5524.4). Total num frames: 832058368. Throughput: 0: 4964.8. Samples: 832052900. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:02:48,323][25689] Avg episode reward: [(0, '-2.534')] [2022-07-10 17:02:48,589][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:02:48,603][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000812558_832059392.pth [2022-07-10 17:02:48,603][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000810613_830067712.pth [2022-07-10 17:02:49,630][26022] Updated weights on worker 0-0, policy_version 812564 (0.00090) [2022-07-10 17:02:51,219][26022] Updated weights on worker 0-0, policy_version 812574 (0.00085) [2022-07-10 17:02:53,330][26022] Updated weights on worker 0-0, policy_version 812584 (0.00088) [2022-07-10 17:02:53,346][25689] Fps is (10 sec: 5603.2, 60 sec: 5521.9, 300 sec: 5525.8). Total num frames: 832086016. Throughput: 0: 5831.5. Samples: 832086630. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:02:53,347][25689] Avg episode reward: [(0, '-1.664')] [2022-07-10 17:02:55,027][26022] Updated weights on worker 0-0, policy_version 812594 (0.00081) [2022-07-10 17:02:56,878][26022] Updated weights on worker 0-0, policy_version 812604 (0.00094) [2022-07-10 17:02:58,436][25689] Fps is (10 sec: 5670.3, 60 sec: 5522.3, 300 sec: 5524.5). Total num frames: 832115712. Throughput: 0: 5827.1. Samples: 832120064. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:02:58,437][25689] Avg episode reward: [(0, '-1.436')] [2022-07-10 17:02:58,738][26022] Updated weights on worker 0-0, policy_version 812614 (0.00091) [2022-07-10 17:03:00,584][26022] Updated weights on worker 0-0, policy_version 812624 (0.00084) [2022-07-10 17:03:02,706][26022] Updated weights on worker 0-0, policy_version 812634 (0.00092) [2022-07-10 17:03:03,491][25689] Fps is (10 sec: 5450.5, 60 sec: 5552.3, 300 sec: 5527.4). Total num frames: 832141312. Throughput: 0: 4997.8. Samples: 832136964. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:03:03,492][25689] Avg episode reward: [(0, '-0.401')] [2022-07-10 17:03:04,701][26022] Updated weights on worker 0-0, policy_version 812644 (0.00723) [2022-07-10 17:03:06,437][26022] Updated weights on worker 0-0, policy_version 812654 (0.00091) [2022-07-10 17:03:08,197][26022] Updated weights on worker 0-0, policy_version 812664 (0.00092) [2022-07-10 17:03:08,548][25689] Fps is (10 sec: 5265.7, 60 sec: 5536.1, 300 sec: 5526.5). Total num frames: 832168960. Throughput: 0: 5687.0. Samples: 832167920. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:03:08,549][25689] Avg episode reward: [(0, '-0.172')] [2022-07-10 17:03:09,999][26022] Updated weights on worker 0-0, policy_version 812674 (0.00092) [2022-07-10 17:03:12,132][26022] Updated weights on worker 0-0, policy_version 812684 (0.00092) [2022-07-10 17:03:13,575][25689] Fps is (10 sec: 5585.2, 60 sec: 5539.7, 300 sec: 5527.0). Total num frames: 832197632. Throughput: 0: 5676.3. Samples: 832201454. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:03:13,575][25689] Avg episode reward: [(0, '0.461')] [2022-07-10 17:03:13,703][26022] Updated weights on worker 0-0, policy_version 812694 (0.00526) [2022-07-10 17:03:15,756][26022] Updated weights on worker 0-0, policy_version 812704 (0.00091) [2022-07-10 17:03:17,348][26022] Updated weights on worker 0-0, policy_version 812714 (0.00088) [2022-07-10 17:03:18,652][25689] Fps is (10 sec: 5473.0, 60 sec: 5524.6, 300 sec: 5518.8). Total num frames: 832224256. Throughput: 0: 4858.0. Samples: 832218272. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:03:18,652][25689] Avg episode reward: [(0, '0.445')] [2022-07-10 17:03:19,300][26022] Updated weights on worker 0-0, policy_version 812724 (0.00085) [2022-07-10 17:03:21,227][26022] Updated weights on worker 0-0, policy_version 812734 (0.00087) [2022-07-10 17:03:22,795][26022] Updated weights on worker 0-0, policy_version 812744 (0.00089) [2022-07-10 17:03:23,666][25689] Fps is (10 sec: 5581.0, 60 sec: 5542.3, 300 sec: 5529.0). Total num frames: 832253952. Throughput: 0: 5718.0. Samples: 832252324. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:03:23,667][25689] Avg episode reward: [(0, '0.524')] [2022-07-10 17:03:24,972][26022] Updated weights on worker 0-0, policy_version 812754 (0.00094) [2022-07-10 17:03:26,604][26022] Updated weights on worker 0-0, policy_version 812764 (0.00095) [2022-07-10 17:03:28,529][26022] Updated weights on worker 0-0, policy_version 812774 (0.00091) [2022-07-10 17:03:28,676][25689] Fps is (10 sec: 5618.2, 60 sec: 5532.0, 300 sec: 5529.0). Total num frames: 832280576. Throughput: 0: 5859.0. Samples: 832285850. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:03:28,677][25689] Avg episode reward: [(0, '-0.320')] [2022-07-10 17:03:30,191][26022] Updated weights on worker 0-0, policy_version 812784 (0.00472) [2022-07-10 17:03:32,214][26022] Updated weights on worker 0-0, policy_version 812794 (0.00089) [2022-07-10 17:03:33,699][25689] Fps is (10 sec: 5511.5, 60 sec: 5553.8, 300 sec: 5525.9). Total num frames: 832309248. Throughput: 0: 5029.5. Samples: 832302668. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:03:33,700][25689] Avg episode reward: [(0, '-0.242')] [2022-07-10 17:03:33,968][26022] Updated weights on worker 0-0, policy_version 812804 (0.00091) [2022-07-10 17:03:35,975][26022] Updated weights on worker 0-0, policy_version 812814 (0.00088) [2022-07-10 17:03:37,808][26022] Updated weights on worker 0-0, policy_version 812824 (0.00086) [2022-07-10 17:03:38,825][25689] Fps is (10 sec: 5650.5, 60 sec: 5546.2, 300 sec: 5534.2). Total num frames: 832337920. Throughput: 0: 5820.5. Samples: 832335690. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:03:38,825][25689] Avg episode reward: [(0, '-0.540')] [2022-07-10 17:03:39,481][26022] Updated weights on worker 0-0, policy_version 812834 (0.00085) [2022-07-10 17:03:41,383][26022] Updated weights on worker 0-0, policy_version 812844 (0.00083) [2022-07-10 17:03:43,245][26022] Updated weights on worker 0-0, policy_version 812854 (0.00087) [2022-07-10 17:03:43,867][25689] Fps is (10 sec: 5639.7, 60 sec: 5561.9, 300 sec: 5530.2). Total num frames: 832366592. Throughput: 0: 5798.1. Samples: 832369450. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:03:43,868][25689] Avg episode reward: [(0, '-0.792')] [2022-07-10 17:03:45,119][26022] Updated weights on worker 0-0, policy_version 812864 (0.01287) [2022-07-10 17:03:46,851][26022] Updated weights on worker 0-0, policy_version 812874 (0.00091) [2022-07-10 17:03:48,835][26022] Updated weights on worker 0-0, policy_version 812884 (0.00090) [2022-07-10 17:03:48,934][25689] Fps is (10 sec: 5469.7, 60 sec: 5524.5, 300 sec: 5529.1). Total num frames: 832393216. Throughput: 0: 5782.9. Samples: 832403000. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:03:48,935][25689] Avg episode reward: [(0, '-1.561')] [2022-07-10 17:03:50,609][26022] Updated weights on worker 0-0, policy_version 812894 (0.00093) [2022-07-10 17:03:52,731][26022] Updated weights on worker 0-0, policy_version 812904 (0.00085) [2022-07-10 17:03:53,978][25689] Fps is (10 sec: 5469.2, 60 sec: 5539.6, 300 sec: 5529.5). Total num frames: 832421888. Throughput: 0: 5754.3. Samples: 832419356. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:03:53,978][25689] Avg episode reward: [(0, '-0.634')] [2022-07-10 17:03:54,172][26022] Updated weights on worker 0-0, policy_version 812914 (0.00088) [2022-07-10 17:03:56,211][26022] Updated weights on worker 0-0, policy_version 812924 (0.00088) [2022-07-10 17:03:57,851][26022] Updated weights on worker 0-0, policy_version 812934 (0.00089) [2022-07-10 17:03:59,035][25689] Fps is (10 sec: 5575.9, 60 sec: 5508.7, 300 sec: 5528.5). Total num frames: 832449536. Throughput: 0: 5791.7. Samples: 832452742. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:03:59,035][25689] Avg episode reward: [(0, '-1.338')] [2022-07-10 17:03:59,924][26022] Updated weights on worker 0-0, policy_version 812944 (0.00087) [2022-07-10 17:04:01,679][26022] Updated weights on worker 0-0, policy_version 812954 (0.00082) [2022-07-10 17:04:03,915][26022] Updated weights on worker 0-0, policy_version 812964 (0.00084) [2022-07-10 17:04:04,047][25689] Fps is (10 sec: 5389.9, 60 sec: 5529.6, 300 sec: 5528.3). Total num frames: 832476160. Throughput: 0: 5679.5. Samples: 832484060. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 17:04:04,047][25689] Avg episode reward: [(0, '-1.729')] [2022-07-10 17:04:05,708][26022] Updated weights on worker 0-0, policy_version 812974 (0.00090) [2022-07-10 17:04:07,543][26022] Updated weights on worker 0-0, policy_version 812984 (0.00078) [2022-07-10 17:04:09,060][25689] Fps is (10 sec: 5311.5, 60 sec: 5516.7, 300 sec: 5521.5). Total num frames: 832502784. Throughput: 0: 4854.9. Samples: 832500708. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:04:09,060][25689] Avg episode reward: [(0, '-2.972')] [2022-07-10 17:04:09,507][26022] Updated weights on worker 0-0, policy_version 812994 (0.00087) [2022-07-10 17:04:11,238][26022] Updated weights on worker 0-0, policy_version 813004 (0.00084) [2022-07-10 17:04:13,228][26022] Updated weights on worker 0-0, policy_version 813014 (0.00095) [2022-07-10 17:04:14,063][25689] Fps is (10 sec: 5520.8, 60 sec: 5518.9, 300 sec: 5525.7). Total num frames: 832531456. Throughput: 0: 5722.7. Samples: 832534298. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:04:14,063][25689] Avg episode reward: [(0, '-3.064')] [2022-07-10 17:04:14,952][26022] Updated weights on worker 0-0, policy_version 813024 (0.00081) [2022-07-10 17:04:16,769][26022] Updated weights on worker 0-0, policy_version 813034 (0.00087) [2022-07-10 17:04:18,671][26022] Updated weights on worker 0-0, policy_version 813044 (0.00087) [2022-07-10 17:04:19,105][25689] Fps is (10 sec: 5606.4, 60 sec: 5538.9, 300 sec: 5521.6). Total num frames: 832559104. Throughput: 0: 5734.3. Samples: 832567834. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:04:19,106][25689] Avg episode reward: [(0, '-2.492')] [2022-07-10 17:04:20,358][26022] Updated weights on worker 0-0, policy_version 813054 (0.00087) [2022-07-10 17:04:22,595][26022] Updated weights on worker 0-0, policy_version 813064 (0.00090) [2022-07-10 17:04:23,944][26022] Updated weights on worker 0-0, policy_version 813074 (0.00090) [2022-07-10 17:04:24,135][25689] Fps is (10 sec: 5591.6, 60 sec: 5520.6, 300 sec: 5524.6). Total num frames: 832587776. Throughput: 0: 5004.9. Samples: 832584600. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:04:24,135][25689] Avg episode reward: [(0, '-3.166')] [2022-07-10 17:04:26,151][26022] Updated weights on worker 0-0, policy_version 813084 (0.00094) [2022-07-10 17:04:27,707][26022] Updated weights on worker 0-0, policy_version 813094 (0.00089) [2022-07-10 17:04:29,158][25689] Fps is (10 sec: 5500.4, 60 sec: 5519.4, 300 sec: 5521.0). Total num frames: 832614400. Throughput: 0: 5845.6. Samples: 832618196. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:04:29,159][25689] Avg episode reward: [(0, '-2.620')] [2022-07-10 17:04:29,764][26022] Updated weights on worker 0-0, policy_version 813104 (0.00096) [2022-07-10 17:04:31,403][26022] Updated weights on worker 0-0, policy_version 813114 (0.00090) [2022-07-10 17:04:33,380][26022] Updated weights on worker 0-0, policy_version 813124 (0.00089) [2022-07-10 17:04:34,177][25689] Fps is (10 sec: 5608.4, 60 sec: 5536.7, 300 sec: 5522.4). Total num frames: 832644096. Throughput: 0: 5843.6. Samples: 832651838. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:04:34,178][25689] Avg episode reward: [(0, '-1.812')] [2022-07-10 17:04:35,171][26022] Updated weights on worker 0-0, policy_version 813134 (0.00084) [2022-07-10 17:04:36,933][26022] Updated weights on worker 0-0, policy_version 813144 (0.00081) [2022-07-10 17:04:38,703][26022] Updated weights on worker 0-0, policy_version 813154 (0.00101) [2022-07-10 17:04:39,282][25689] Fps is (10 sec: 5765.3, 60 sec: 5538.5, 300 sec: 5534.8). Total num frames: 832672768. Throughput: 0: 4997.6. Samples: 832668670. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:04:39,283][25689] Avg episode reward: [(0, '-0.754')] [2022-07-10 17:04:40,728][26022] Updated weights on worker 0-0, policy_version 813164 (0.00090) [2022-07-10 17:04:42,295][26022] Updated weights on worker 0-0, policy_version 813174 (0.00094) [2022-07-10 17:04:44,294][25689] Fps is (10 sec: 5566.9, 60 sec: 5524.4, 300 sec: 5521.4). Total num frames: 832700416. Throughput: 0: 5847.0. Samples: 832702470. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:04:44,295][25689] Avg episode reward: [(0, '-0.623')] [2022-07-10 17:04:44,307][26022] Updated weights on worker 0-0, policy_version 813184 (0.00092) [2022-07-10 17:04:46,046][26022] Updated weights on worker 0-0, policy_version 813194 (0.00086) [2022-07-10 17:04:48,050][26022] Updated weights on worker 0-0, policy_version 813204 (0.00087) [2022-07-10 17:04:48,646][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:04:48,665][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000813209_832726016.pth [2022-07-10 17:04:48,665][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000811260_830730240.pth [2022-07-10 17:04:49,351][25689] Fps is (10 sec: 5593.6, 60 sec: 5559.2, 300 sec: 5531.4). Total num frames: 832729088. Throughput: 0: 5831.5. Samples: 832735950. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:04:49,352][25689] Avg episode reward: [(0, '-0.527')] [2022-07-10 17:04:49,908][26022] Updated weights on worker 0-0, policy_version 813214 (0.00098) [2022-07-10 17:04:51,607][26022] Updated weights on worker 0-0, policy_version 813224 (0.00091) [2022-07-10 17:04:53,338][26022] Updated weights on worker 0-0, policy_version 813234 (0.00089) [2022-07-10 17:04:54,361][25689] Fps is (10 sec: 5696.1, 60 sec: 5562.3, 300 sec: 5530.0). Total num frames: 832757760. Throughput: 0: 4999.3. Samples: 832752744. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:04:54,362][25689] Avg episode reward: [(0, '-0.877')] [2022-07-10 17:04:55,303][26022] Updated weights on worker 0-0, policy_version 813244 (0.00084) [2022-07-10 17:04:56,995][26022] Updated weights on worker 0-0, policy_version 813254 (0.00086) [2022-07-10 17:04:59,008][26022] Updated weights on worker 0-0, policy_version 813264 (0.00100) [2022-07-10 17:04:59,474][25689] Fps is (10 sec: 5563.4, 60 sec: 5557.2, 300 sec: 5527.9). Total num frames: 832785408. Throughput: 0: 5842.0. Samples: 832786632. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:04:59,475][25689] Avg episode reward: [(0, '-0.234')] [2022-07-10 17:05:00,684][26022] Updated weights on worker 0-0, policy_version 813274 (0.00091) [2022-07-10 17:05:02,853][26022] Updated weights on worker 0-0, policy_version 813284 (0.00089) [2022-07-10 17:05:04,475][25689] Fps is (10 sec: 5366.0, 60 sec: 5558.2, 300 sec: 5532.6). Total num frames: 832812032. Throughput: 0: 5727.9. Samples: 832818066. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:05:04,476][25689] Avg episode reward: [(0, '-0.842')] [2022-07-10 17:05:04,637][26022] Updated weights on worker 0-0, policy_version 813294 (0.00084) [2022-07-10 17:05:06,570][26022] Updated weights on worker 0-0, policy_version 813304 (0.00105) [2022-07-10 17:05:08,488][26022] Updated weights on worker 0-0, policy_version 813314 (0.00092) [2022-07-10 17:05:09,506][25689] Fps is (10 sec: 5308.0, 60 sec: 5556.6, 300 sec: 5525.9). Total num frames: 832838656. Throughput: 0: 4891.7. Samples: 832834542. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:05:09,508][25689] Avg episode reward: [(0, '-0.385')] [2022-07-10 17:05:10,321][26022] Updated weights on worker 0-0, policy_version 813324 (0.00086) [2022-07-10 17:05:12,095][26022] Updated weights on worker 0-0, policy_version 813334 (0.00088) [2022-07-10 17:05:14,194][26022] Updated weights on worker 0-0, policy_version 813344 (0.00092) [2022-07-10 17:05:14,509][25689] Fps is (10 sec: 5408.8, 60 sec: 5539.5, 300 sec: 5523.7). Total num frames: 832866304. Throughput: 0: 5724.4. Samples: 832868080. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:05:14,510][25689] Avg episode reward: [(0, '-0.913')] [2022-07-10 17:05:15,651][26022] Updated weights on worker 0-0, policy_version 813354 (0.00085) [2022-07-10 17:05:17,850][26022] Updated weights on worker 0-0, policy_version 813364 (0.00088) [2022-07-10 17:05:19,368][26022] Updated weights on worker 0-0, policy_version 813374 (0.00095) [2022-07-10 17:05:19,550][25689] Fps is (10 sec: 5709.3, 60 sec: 5573.6, 300 sec: 5530.2). Total num frames: 832896000. Throughput: 0: 5721.1. Samples: 832901486. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:05:19,550][25689] Avg episode reward: [(0, '-0.290')] [2022-07-10 17:05:21,543][26022] Updated weights on worker 0-0, policy_version 813384 (0.00086) [2022-07-10 17:05:23,107][26022] Updated weights on worker 0-0, policy_version 813394 (0.00085) [2022-07-10 17:05:24,645][25689] Fps is (10 sec: 5556.6, 60 sec: 5533.7, 300 sec: 5525.5). Total num frames: 832922624. Throughput: 0: 4969.0. Samples: 832918290. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:05:24,646][25689] Avg episode reward: [(0, '-0.731')] [2022-07-10 17:05:25,266][26022] Updated weights on worker 0-0, policy_version 813404 (0.00086) [2022-07-10 17:05:26,955][26022] Updated weights on worker 0-0, policy_version 813414 (0.00091) [2022-07-10 17:05:28,823][26022] Updated weights on worker 0-0, policy_version 813424 (0.00090) [2022-07-10 17:05:29,667][25689] Fps is (10 sec: 5465.7, 60 sec: 5567.8, 300 sec: 5529.2). Total num frames: 832951296. Throughput: 0: 5812.9. Samples: 832951734. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:05:29,667][25689] Avg episode reward: [(0, '-1.172')] [2022-07-10 17:05:30,560][26022] Updated weights on worker 0-0, policy_version 813434 (0.00089) [2022-07-10 17:05:32,398][26022] Updated weights on worker 0-0, policy_version 813444 (0.00084) [2022-07-10 17:05:34,200][26022] Updated weights on worker 0-0, policy_version 813454 (0.00094) [2022-07-10 17:05:34,746][25689] Fps is (10 sec: 5576.0, 60 sec: 5528.4, 300 sec: 5526.5). Total num frames: 832978944. Throughput: 0: 5792.2. Samples: 832985292. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:05:34,746][25689] Avg episode reward: [(0, '-0.477')] [2022-07-10 17:05:36,263][26022] Updated weights on worker 0-0, policy_version 813464 (0.00084) [2022-07-10 17:05:37,899][26022] Updated weights on worker 0-0, policy_version 813474 (0.00086) [2022-07-10 17:05:39,792][25689] Fps is (10 sec: 5461.0, 60 sec: 5516.9, 300 sec: 5529.7). Total num frames: 833006592. Throughput: 0: 4952.8. Samples: 833001742. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:05:39,793][25689] Avg episode reward: [(0, '-1.349')] [2022-07-10 17:05:39,931][26022] Updated weights on worker 0-0, policy_version 813484 (0.00084) [2022-07-10 17:05:41,447][26022] Updated weights on worker 0-0, policy_version 813494 (0.00089) [2022-07-10 17:05:43,733][26022] Updated weights on worker 0-0, policy_version 813504 (0.00090) [2022-07-10 17:05:44,815][25689] Fps is (10 sec: 5695.0, 60 sec: 5549.7, 300 sec: 5533.3). Total num frames: 833036288. Throughput: 0: 5791.4. Samples: 833035100. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:05:44,815][25689] Avg episode reward: [(0, '-1.241')] [2022-07-10 17:05:45,178][26022] Updated weights on worker 0-0, policy_version 813514 (0.00089) [2022-07-10 17:05:47,218][26022] Updated weights on worker 0-0, policy_version 813524 (0.00090) [2022-07-10 17:05:48,932][26022] Updated weights on worker 0-0, policy_version 813534 (0.00558) [2022-07-10 17:05:49,819][25689] Fps is (10 sec: 5515.1, 60 sec: 5503.8, 300 sec: 5526.7). Total num frames: 833061888. Throughput: 0: 5794.3. Samples: 833068500. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:05:49,819][25689] Avg episode reward: [(0, '-0.983')] [2022-07-10 17:05:50,849][26022] Updated weights on worker 0-0, policy_version 813544 (0.00086) [2022-07-10 17:05:52,815][26022] Updated weights on worker 0-0, policy_version 813554 (0.00082) [2022-07-10 17:05:54,410][26022] Updated weights on worker 0-0, policy_version 813564 (0.00079) [2022-07-10 17:05:54,839][25689] Fps is (10 sec: 5413.9, 60 sec: 5502.8, 300 sec: 5532.2). Total num frames: 833090560. Throughput: 0: 4963.6. Samples: 833085030. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:05:54,840][25689] Avg episode reward: [(0, '-2.040')] [2022-07-10 17:05:56,542][26022] Updated weights on worker 0-0, policy_version 813574 (0.00088) [2022-07-10 17:05:58,162][26022] Updated weights on worker 0-0, policy_version 813584 (0.00091) [2022-07-10 17:05:59,962][25689] Fps is (10 sec: 5552.6, 60 sec: 5502.0, 300 sec: 5533.7). Total num frames: 833118208. Throughput: 0: 5799.6. Samples: 833118716. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:05:59,962][25689] Avg episode reward: [(0, '-2.662')] [2022-07-10 17:06:00,175][26022] Updated weights on worker 0-0, policy_version 813594 (0.00083) [2022-07-10 17:06:02,215][26022] Updated weights on worker 0-0, policy_version 813604 (0.00088) [2022-07-10 17:06:04,182][26022] Updated weights on worker 0-0, policy_version 813614 (0.00092) [2022-07-10 17:06:05,050][25689] Fps is (10 sec: 5415.3, 60 sec: 5510.9, 300 sec: 5533.7). Total num frames: 833145856. Throughput: 0: 5672.9. Samples: 833149896. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:06:05,051][25689] Avg episode reward: [(0, '-2.779')] [2022-07-10 17:06:06,239][26022] Updated weights on worker 0-0, policy_version 813624 (0.00088) [2022-07-10 17:06:07,966][26022] Updated weights on worker 0-0, policy_version 813634 (0.00086) [2022-07-10 17:06:09,927][26022] Updated weights on worker 0-0, policy_version 813644 (0.00102) [2022-07-10 17:06:10,116][25689] Fps is (10 sec: 5244.1, 60 sec: 5490.9, 300 sec: 5526.9). Total num frames: 833171456. Throughput: 0: 5631.8. Samples: 833182808. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:06:10,116][25689] Avg episode reward: [(0, '-1.888')] [2022-07-10 17:06:11,423][26022] Updated weights on worker 0-0, policy_version 813654 (0.00067) [2022-07-10 17:06:13,638][26022] Updated weights on worker 0-0, policy_version 813664 (0.00094) [2022-07-10 17:06:15,155][25689] Fps is (10 sec: 5371.1, 60 sec: 5504.5, 300 sec: 5524.4). Total num frames: 833200128. Throughput: 0: 5632.7. Samples: 833199462. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:06:15,157][25689] Avg episode reward: [(0, '-1.694')] [2022-07-10 17:06:15,479][26022] Updated weights on worker 0-0, policy_version 813674 (0.00084) [2022-07-10 17:06:17,103][26022] Updated weights on worker 0-0, policy_version 813684 (0.00094) [2022-07-10 17:06:19,015][26022] Updated weights on worker 0-0, policy_version 813694 (0.00083) [2022-07-10 17:06:20,262][25689] Fps is (10 sec: 5752.3, 60 sec: 5498.4, 300 sec: 5529.5). Total num frames: 833229824. Throughput: 0: 5622.6. Samples: 833232860. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:06:20,264][25689] Avg episode reward: [(0, '-0.814')] [2022-07-10 17:06:20,695][26022] Updated weights on worker 0-0, policy_version 813704 (0.00076) [2022-07-10 17:06:22,659][26022] Updated weights on worker 0-0, policy_version 813714 (0.00089) [2022-07-10 17:06:24,407][26022] Updated weights on worker 0-0, policy_version 813724 (0.00083) [2022-07-10 17:06:25,291][25689] Fps is (10 sec: 5657.4, 60 sec: 5521.4, 300 sec: 5522.4). Total num frames: 833257472. Throughput: 0: 5760.9. Samples: 833266500. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:06:25,293][25689] Avg episode reward: [(0, '0.179')] [2022-07-10 17:06:26,232][26022] Updated weights on worker 0-0, policy_version 813734 (0.00083) [2022-07-10 17:06:28,162][26022] Updated weights on worker 0-0, policy_version 813744 (0.00089) [2022-07-10 17:06:30,044][26022] Updated weights on worker 0-0, policy_version 813754 (0.00084) [2022-07-10 17:06:30,333][25689] Fps is (10 sec: 5490.7, 60 sec: 5502.6, 300 sec: 5529.2). Total num frames: 833285120. Throughput: 0: 4956.4. Samples: 833283014. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:06:30,334][25689] Avg episode reward: [(0, '0.311')] [2022-07-10 17:06:32,010][26022] Updated weights on worker 0-0, policy_version 813764 (0.00087) [2022-07-10 17:06:33,723][26022] Updated weights on worker 0-0, policy_version 813774 (0.00081) [2022-07-10 17:06:35,398][25689] Fps is (10 sec: 5572.6, 60 sec: 5520.8, 300 sec: 5530.1). Total num frames: 833313792. Throughput: 0: 5783.3. Samples: 833316532. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:06:35,398][25689] Avg episode reward: [(0, '0.236')] [2022-07-10 17:06:35,487][26022] Updated weights on worker 0-0, policy_version 813784 (0.00088) [2022-07-10 17:06:37,508][26022] Updated weights on worker 0-0, policy_version 813794 (0.00090) [2022-07-10 17:06:39,317][26022] Updated weights on worker 0-0, policy_version 813804 (0.00086) [2022-07-10 17:06:40,463][25689] Fps is (10 sec: 5560.0, 60 sec: 5519.1, 300 sec: 5525.6). Total num frames: 833341440. Throughput: 0: 5779.6. Samples: 833349610. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:06:40,463][25689] Avg episode reward: [(0, '-0.119')] [2022-07-10 17:06:41,159][26022] Updated weights on worker 0-0, policy_version 813814 (0.00085) [2022-07-10 17:06:42,858][26022] Updated weights on worker 0-0, policy_version 813824 (0.00083) [2022-07-10 17:06:44,789][26022] Updated weights on worker 0-0, policy_version 813834 (0.00096) [2022-07-10 17:06:45,486][25689] Fps is (10 sec: 5582.4, 60 sec: 5502.1, 300 sec: 5532.6). Total num frames: 833370112. Throughput: 0: 4943.4. Samples: 833366334. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:06:45,487][25689] Avg episode reward: [(0, '0.007')] [2022-07-10 17:06:46,674][26022] Updated weights on worker 0-0, policy_version 813844 (0.00089) [2022-07-10 17:06:48,376][26022] Updated weights on worker 0-0, policy_version 813854 (0.00092) [2022-07-10 17:06:48,825][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:06:48,838][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000813856_833388544.pth [2022-07-10 17:06:48,839][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000811909_831394816.pth [2022-07-10 17:06:50,316][26022] Updated weights on worker 0-0, policy_version 813864 (0.00090) [2022-07-10 17:06:50,497][25689] Fps is (10 sec: 5612.8, 60 sec: 5535.3, 300 sec: 5529.4). Total num frames: 833397760. Throughput: 0: 5797.3. Samples: 833399908. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:06:50,499][25689] Avg episode reward: [(0, '-1.182')] [2022-07-10 17:06:52,092][26022] Updated weights on worker 0-0, policy_version 813874 (0.00087) [2022-07-10 17:06:54,075][26022] Updated weights on worker 0-0, policy_version 813884 (0.00086) [2022-07-10 17:06:55,501][25689] Fps is (10 sec: 5623.6, 60 sec: 5536.8, 300 sec: 5527.6). Total num frames: 833426432. Throughput: 0: 5820.7. Samples: 833433550. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:06:55,502][25689] Avg episode reward: [(0, '-1.468')] [2022-07-10 17:06:55,764][26022] Updated weights on worker 0-0, policy_version 813894 (0.00090) [2022-07-10 17:06:57,715][26022] Updated weights on worker 0-0, policy_version 813904 (0.00084) [2022-07-10 17:06:59,343][26022] Updated weights on worker 0-0, policy_version 813914 (0.00084) [2022-07-10 17:07:00,576][25689] Fps is (10 sec: 5486.0, 60 sec: 5524.2, 300 sec: 5536.8). Total num frames: 833453056. Throughput: 0: 5007.5. Samples: 833450330. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:07:00,577][25689] Avg episode reward: [(0, '-0.963')] [2022-07-10 17:07:01,437][26022] Updated weights on worker 0-0, policy_version 813924 (0.00094) [2022-07-10 17:07:03,407][26022] Updated weights on worker 0-0, policy_version 813934 (0.00091) [2022-07-10 17:07:05,286][26022] Updated weights on worker 0-0, policy_version 813944 (0.00088) [2022-07-10 17:07:05,595][25689] Fps is (10 sec: 5377.1, 60 sec: 5530.6, 300 sec: 5534.2). Total num frames: 833480704. Throughput: 0: 5749.7. Samples: 833481950. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:07:05,595][25689] Avg episode reward: [(0, '-1.212')] [2022-07-10 17:07:07,234][26022] Updated weights on worker 0-0, policy_version 813954 (0.00087) [2022-07-10 17:07:09,156][26022] Updated weights on worker 0-0, policy_version 813964 (0.00082) [2022-07-10 17:07:10,625][25689] Fps is (10 sec: 5502.8, 60 sec: 5567.7, 300 sec: 5531.4). Total num frames: 833508352. Throughput: 0: 5723.3. Samples: 833515108. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:07:10,625][25689] Avg episode reward: [(0, '-0.795')] [2022-07-10 17:07:10,758][26022] Updated weights on worker 0-0, policy_version 813974 (0.00087) [2022-07-10 17:07:12,806][26022] Updated weights on worker 0-0, policy_version 813984 (0.00091) [2022-07-10 17:07:14,468][26022] Updated weights on worker 0-0, policy_version 813994 (0.00095) [2022-07-10 17:07:15,636][25689] Fps is (10 sec: 5405.0, 60 sec: 5536.4, 300 sec: 5529.6). Total num frames: 833534976. Throughput: 0: 4890.9. Samples: 833532026. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:07:15,637][25689] Avg episode reward: [(0, '-0.712')] [2022-07-10 17:07:16,410][26022] Updated weights on worker 0-0, policy_version 814004 (0.00090) [2022-07-10 17:07:18,147][26022] Updated weights on worker 0-0, policy_version 814014 (0.00088) [2022-07-10 17:07:20,111][26022] Updated weights on worker 0-0, policy_version 814024 (0.00092) [2022-07-10 17:07:20,715][25689] Fps is (10 sec: 5581.7, 60 sec: 5539.0, 300 sec: 5532.0). Total num frames: 833564672. Throughput: 0: 5714.4. Samples: 833565412. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:07:20,716][25689] Avg episode reward: [(0, '-0.822')] [2022-07-10 17:07:22,012][26022] Updated weights on worker 0-0, policy_version 814034 (0.00094) [2022-07-10 17:07:23,684][26022] Updated weights on worker 0-0, policy_version 814044 (0.00086) [2022-07-10 17:07:25,700][26022] Updated weights on worker 0-0, policy_version 814054 (0.00093) [2022-07-10 17:07:25,730][25689] Fps is (10 sec: 5579.5, 60 sec: 5523.3, 300 sec: 5529.8). Total num frames: 833591296. Throughput: 0: 5807.2. Samples: 833598880. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:07:25,730][25689] Avg episode reward: [(0, '-1.274')] [2022-07-10 17:07:27,451][26022] Updated weights on worker 0-0, policy_version 814064 (0.00095) [2022-07-10 17:07:29,414][26022] Updated weights on worker 0-0, policy_version 814074 (0.00086) [2022-07-10 17:07:30,768][25689] Fps is (10 sec: 5500.3, 60 sec: 5540.6, 300 sec: 5533.9). Total num frames: 833619968. Throughput: 0: 4976.0. Samples: 833615342. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:07:30,769][25689] Avg episode reward: [(0, '-1.228')] [2022-07-10 17:07:30,969][26022] Updated weights on worker 0-0, policy_version 814084 (0.00086) [2022-07-10 17:07:33,039][26022] Updated weights on worker 0-0, policy_version 814094 (0.00090) [2022-07-10 17:07:34,666][26022] Updated weights on worker 0-0, policy_version 814104 (0.00090) [2022-07-10 17:07:35,816][25689] Fps is (10 sec: 5584.1, 60 sec: 5525.2, 300 sec: 5530.4). Total num frames: 833647616. Throughput: 0: 5799.3. Samples: 833649056. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:07:35,816][25689] Avg episode reward: [(0, '-1.553')] [2022-07-10 17:07:36,722][26022] Updated weights on worker 0-0, policy_version 814114 (0.00096) [2022-07-10 17:07:38,520][26022] Updated weights on worker 0-0, policy_version 814124 (0.00093) [2022-07-10 17:07:40,609][26022] Updated weights on worker 0-0, policy_version 814134 (0.00084) [2022-07-10 17:07:40,879][25689] Fps is (10 sec: 5570.4, 60 sec: 5542.3, 300 sec: 5533.2). Total num frames: 833676288. Throughput: 0: 5781.9. Samples: 833681998. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:07:40,879][25689] Avg episode reward: [(0, '-3.476')] [2022-07-10 17:07:42,334][26022] Updated weights on worker 0-0, policy_version 814144 (0.00091) [2022-07-10 17:07:44,045][26022] Updated weights on worker 0-0, policy_version 814154 (0.00089) [2022-07-10 17:07:45,910][25689] Fps is (10 sec: 5579.5, 60 sec: 5524.8, 300 sec: 5529.7). Total num frames: 833703936. Throughput: 0: 4948.9. Samples: 833698748. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:07:45,910][25689] Avg episode reward: [(0, '-3.061')] [2022-07-10 17:07:45,912][26022] Updated weights on worker 0-0, policy_version 814164 (0.00080) [2022-07-10 17:07:47,793][26022] Updated weights on worker 0-0, policy_version 814174 (0.00086) [2022-07-10 17:07:49,578][26022] Updated weights on worker 0-0, policy_version 814184 (0.00091) [2022-07-10 17:07:50,936][25689] Fps is (10 sec: 5599.8, 60 sec: 5540.2, 300 sec: 5533.1). Total num frames: 833732608. Throughput: 0: 5802.3. Samples: 833732364. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:07:50,937][25689] Avg episode reward: [(0, '-2.464')] [2022-07-10 17:07:51,322][26022] Updated weights on worker 0-0, policy_version 814194 (0.00087) [2022-07-10 17:07:53,244][26022] Updated weights on worker 0-0, policy_version 814204 (0.00085) [2022-07-10 17:07:54,980][26022] Updated weights on worker 0-0, policy_version 814214 (0.00098) [2022-07-10 17:07:56,009][25689] Fps is (10 sec: 5576.5, 60 sec: 5517.0, 300 sec: 5526.5). Total num frames: 833760256. Throughput: 0: 5815.6. Samples: 833766494. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:07:56,011][25689] Avg episode reward: [(0, '-2.215')] [2022-07-10 17:07:56,759][26022] Updated weights on worker 0-0, policy_version 814224 (0.00088) [2022-07-10 17:07:58,484][26022] Updated weights on worker 0-0, policy_version 814234 (0.00087) [2022-07-10 17:08:00,331][26022] Updated weights on worker 0-0, policy_version 814244 (0.00081) [2022-07-10 17:08:01,080][25689] Fps is (10 sec: 5552.0, 60 sec: 5551.2, 300 sec: 5536.5). Total num frames: 833788928. Throughput: 0: 5022.2. Samples: 833783454. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:08:01,081][25689] Avg episode reward: [(0, '-1.820')] [2022-07-10 17:08:02,579][26022] Updated weights on worker 0-0, policy_version 814254 (0.00086) [2022-07-10 17:08:04,409][26022] Updated weights on worker 0-0, policy_version 814264 (0.00097) [2022-07-10 17:08:06,140][25689] Fps is (10 sec: 5559.1, 60 sec: 5547.4, 300 sec: 5536.5). Total num frames: 833816576. Throughput: 0: 5764.5. Samples: 833815366. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:08:06,141][25689] Avg episode reward: [(0, '0.001')] [2022-07-10 17:08:06,156][26022] Updated weights on worker 0-0, policy_version 814274 (0.00073) [2022-07-10 17:08:08,023][26022] Updated weights on worker 0-0, policy_version 814284 (0.00095) [2022-07-10 17:08:09,913][26022] Updated weights on worker 0-0, policy_version 814294 (0.00092) [2022-07-10 17:08:11,147][25689] Fps is (10 sec: 5493.3, 60 sec: 5549.6, 300 sec: 5533.4). Total num frames: 833844224. Throughput: 0: 5760.7. Samples: 833848788. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:08:11,147][25689] Avg episode reward: [(0, '-0.619')] [2022-07-10 17:08:11,668][26022] Updated weights on worker 0-0, policy_version 814304 (0.00090) [2022-07-10 17:08:13,473][26022] Updated weights on worker 0-0, policy_version 814314 (0.00099) [2022-07-10 17:08:15,411][26022] Updated weights on worker 0-0, policy_version 814324 (0.00093) [2022-07-10 17:08:16,226][25689] Fps is (10 sec: 5482.4, 60 sec: 5560.2, 300 sec: 5536.8). Total num frames: 833871872. Throughput: 0: 4906.6. Samples: 833865692. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:08:16,231][25689] Avg episode reward: [(0, '-1.065')] [2022-07-10 17:08:17,188][26022] Updated weights on worker 0-0, policy_version 814334 (0.00093) [2022-07-10 17:08:18,995][26022] Updated weights on worker 0-0, policy_version 814344 (0.00087) [2022-07-10 17:08:20,871][26022] Updated weights on worker 0-0, policy_version 814354 (0.00102) [2022-07-10 17:08:21,286][25689] Fps is (10 sec: 5453.7, 60 sec: 5528.2, 300 sec: 5529.1). Total num frames: 833899520. Throughput: 0: 5735.0. Samples: 833899332. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:08:21,286][25689] Avg episode reward: [(0, '-0.872')] [2022-07-10 17:08:22,833][26022] Updated weights on worker 0-0, policy_version 814364 (0.00086) [2022-07-10 17:08:24,646][26022] Updated weights on worker 0-0, policy_version 814374 (0.00086) [2022-07-10 17:08:26,357][25689] Fps is (10 sec: 5559.2, 60 sec: 5556.8, 300 sec: 5534.8). Total num frames: 833928192. Throughput: 0: 5812.5. Samples: 833932878. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:08:26,358][25689] Avg episode reward: [(0, '-0.630')] [2022-07-10 17:08:26,431][26022] Updated weights on worker 0-0, policy_version 814384 (0.00086) [2022-07-10 17:08:28,155][26022] Updated weights on worker 0-0, policy_version 814394 (0.00095) [2022-07-10 17:08:30,159][26022] Updated weights on worker 0-0, policy_version 814404 (0.00082) [2022-07-10 17:08:31,381][25689] Fps is (10 sec: 5680.0, 60 sec: 5558.1, 300 sec: 5534.8). Total num frames: 833956864. Throughput: 0: 5809.0. Samples: 833966332. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:08:31,383][25689] Avg episode reward: [(0, '-0.467')] [2022-07-10 17:08:31,924][26022] Updated weights on worker 0-0, policy_version 814414 (0.00094) [2022-07-10 17:08:33,805][26022] Updated weights on worker 0-0, policy_version 814424 (0.00634) [2022-07-10 17:08:35,377][26022] Updated weights on worker 0-0, policy_version 814434 (0.00084) [2022-07-10 17:08:36,440][25689] Fps is (10 sec: 5585.7, 60 sec: 5557.1, 300 sec: 5532.6). Total num frames: 833984512. Throughput: 0: 5805.1. Samples: 833983034. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:08:36,442][25689] Avg episode reward: [(0, '-0.175')] [2022-07-10 17:08:37,446][26022] Updated weights on worker 0-0, policy_version 814444 (0.00089) [2022-07-10 17:08:39,287][26022] Updated weights on worker 0-0, policy_version 814454 (0.00089) [2022-07-10 17:08:41,042][26022] Updated weights on worker 0-0, policy_version 814464 (0.00083) [2022-07-10 17:08:41,525][25689] Fps is (10 sec: 5451.2, 60 sec: 5538.2, 300 sec: 5528.4). Total num frames: 834012160. Throughput: 0: 5774.0. Samples: 834016194. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:08:41,531][25689] Avg episode reward: [(0, '-0.528')] [2022-07-10 17:08:43,006][26022] Updated weights on worker 0-0, policy_version 814474 (0.00106) [2022-07-10 17:08:44,765][26022] Updated weights on worker 0-0, policy_version 814484 (0.00085) [2022-07-10 17:08:46,534][25689] Fps is (10 sec: 5478.0, 60 sec: 5540.2, 300 sec: 5532.9). Total num frames: 834039808. Throughput: 0: 5797.6. Samples: 834049858. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:08:46,535][25689] Avg episode reward: [(0, '-0.305')] [2022-07-10 17:08:46,695][26022] Updated weights on worker 0-0, policy_version 814494 (0.00088) [2022-07-10 17:08:48,471][26022] Updated weights on worker 0-0, policy_version 814504 (0.00092) [2022-07-10 17:08:48,961][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:08:48,978][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000814506_834054144.pth [2022-07-10 17:08:48,978][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000812558_832059392.pth [2022-07-10 17:08:50,346][26022] Updated weights on worker 0-0, policy_version 814514 (0.00089) [2022-07-10 17:08:51,544][25689] Fps is (10 sec: 5621.4, 60 sec: 5541.7, 300 sec: 5533.5). Total num frames: 834068480. Throughput: 0: 4966.4. Samples: 834066468. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:08:51,545][25689] Avg episode reward: [(0, '-0.644')] [2022-07-10 17:08:52,381][26022] Updated weights on worker 0-0, policy_version 814524 (0.00089) [2022-07-10 17:08:54,149][26022] Updated weights on worker 0-0, policy_version 814534 (0.00086) [2022-07-10 17:08:55,969][26022] Updated weights on worker 0-0, policy_version 814544 (0.00091) [2022-07-10 17:08:56,548][25689] Fps is (10 sec: 5726.9, 60 sec: 5565.0, 300 sec: 5538.0). Total num frames: 834097152. Throughput: 0: 5790.4. Samples: 834099464. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:08:56,548][25689] Avg episode reward: [(0, '-0.655')] [2022-07-10 17:08:57,821][26022] Updated weights on worker 0-0, policy_version 814554 (0.00086) [2022-07-10 17:08:59,650][26022] Updated weights on worker 0-0, policy_version 814564 (0.00385) [2022-07-10 17:09:01,572][26022] Updated weights on worker 0-0, policy_version 814574 (0.00088) [2022-07-10 17:09:01,636][25689] Fps is (10 sec: 5479.3, 60 sec: 5529.6, 300 sec: 5536.5). Total num frames: 834123776. Throughput: 0: 5788.9. Samples: 834132612. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:09:01,636][25689] Avg episode reward: [(0, '-0.820')] [2022-07-10 17:09:03,510][26022] Updated weights on worker 0-0, policy_version 814584 (0.00099) [2022-07-10 17:09:05,585][26022] Updated weights on worker 0-0, policy_version 814594 (0.00089) [2022-07-10 17:09:06,679][25689] Fps is (10 sec: 5256.0, 60 sec: 5514.3, 300 sec: 5536.0). Total num frames: 834150400. Throughput: 0: 4836.6. Samples: 834147286. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:09:06,679][25689] Avg episode reward: [(0, '-0.671')] [2022-07-10 17:09:07,525][26022] Updated weights on worker 0-0, policy_version 814604 (0.00087) [2022-07-10 17:09:09,258][26022] Updated weights on worker 0-0, policy_version 814614 (0.00083) [2022-07-10 17:09:11,061][26022] Updated weights on worker 0-0, policy_version 814624 (0.00093) [2022-07-10 17:09:11,715][25689] Fps is (10 sec: 5283.1, 60 sec: 5494.6, 300 sec: 5528.5). Total num frames: 834177024. Throughput: 0: 5649.8. Samples: 834180428. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:09:11,715][25689] Avg episode reward: [(0, '0.954')] [2022-07-10 17:09:13,004][26022] Updated weights on worker 0-0, policy_version 814634 (0.00987) [2022-07-10 17:09:14,743][26022] Updated weights on worker 0-0, policy_version 814644 (0.00085) [2022-07-10 17:09:16,660][26022] Updated weights on worker 0-0, policy_version 814654 (0.00085) [2022-07-10 17:09:16,733][25689] Fps is (10 sec: 5499.7, 60 sec: 5517.2, 300 sec: 5532.4). Total num frames: 834205696. Throughput: 0: 5669.1. Samples: 834213896. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:09:16,733][25689] Avg episode reward: [(0, '1.477')] [2022-07-10 17:09:18,536][26022] Updated weights on worker 0-0, policy_version 814664 (0.00092) [2022-07-10 17:09:20,288][26022] Updated weights on worker 0-0, policy_version 814674 (0.00087) [2022-07-10 17:09:21,843][25689] Fps is (10 sec: 5661.8, 60 sec: 5529.5, 300 sec: 5530.9). Total num frames: 834234368. Throughput: 0: 4850.0. Samples: 834230616. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:09:21,843][25689] Avg episode reward: [(0, '0.591')] [2022-07-10 17:09:22,326][26022] Updated weights on worker 0-0, policy_version 814684 (0.00084) [2022-07-10 17:09:23,866][26022] Updated weights on worker 0-0, policy_version 814694 (0.00085) [2022-07-10 17:09:25,903][26022] Updated weights on worker 0-0, policy_version 814704 (0.00088) [2022-07-10 17:09:26,936][25689] Fps is (10 sec: 5620.2, 60 sec: 5527.5, 300 sec: 5536.5). Total num frames: 834263040. Throughput: 0: 5779.8. Samples: 834264370. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:09:26,937][25689] Avg episode reward: [(0, '-1.104')] [2022-07-10 17:09:27,619][26022] Updated weights on worker 0-0, policy_version 814714 (0.00092) [2022-07-10 17:09:29,594][26022] Updated weights on worker 0-0, policy_version 814724 (0.00083) [2022-07-10 17:09:31,501][26022] Updated weights on worker 0-0, policy_version 814734 (0.00087) [2022-07-10 17:09:31,985][25689] Fps is (10 sec: 5553.3, 60 sec: 5508.4, 300 sec: 5529.0). Total num frames: 834290688. Throughput: 0: 5786.9. Samples: 834297728. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:09:31,987][25689] Avg episode reward: [(0, '-1.135')] [2022-07-10 17:09:33,059][26022] Updated weights on worker 0-0, policy_version 814744 (0.00057) [2022-07-10 17:09:35,112][26022] Updated weights on worker 0-0, policy_version 814754 (0.00090) [2022-07-10 17:09:36,675][26022] Updated weights on worker 0-0, policy_version 814764 (0.00092) [2022-07-10 17:09:36,990][25689] Fps is (10 sec: 5601.6, 60 sec: 5530.1, 300 sec: 5530.9). Total num frames: 834319360. Throughput: 0: 4965.7. Samples: 834314490. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:09:36,991][25689] Avg episode reward: [(0, '-2.439')] [2022-07-10 17:09:38,686][26022] Updated weights on worker 0-0, policy_version 814774 (0.00089) [2022-07-10 17:09:40,749][26022] Updated weights on worker 0-0, policy_version 814784 (0.00955) [2022-07-10 17:09:42,036][25689] Fps is (10 sec: 5603.2, 60 sec: 5533.7, 300 sec: 5530.3). Total num frames: 834347008. Throughput: 0: 5790.6. Samples: 834347546. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:09:42,037][25689] Avg episode reward: [(0, '-2.610')] [2022-07-10 17:09:42,301][26022] Updated weights on worker 0-0, policy_version 814794 (0.00089) [2022-07-10 17:09:44,325][26022] Updated weights on worker 0-0, policy_version 814804 (0.00086) [2022-07-10 17:09:46,167][26022] Updated weights on worker 0-0, policy_version 814814 (0.00091) [2022-07-10 17:09:47,073][25689] Fps is (10 sec: 5382.6, 60 sec: 5514.2, 300 sec: 5523.8). Total num frames: 834373632. Throughput: 0: 5798.6. Samples: 834381138. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:09:47,074][25689] Avg episode reward: [(0, '-2.863')] [2022-07-10 17:09:47,763][26022] Updated weights on worker 0-0, policy_version 814824 (0.00088) [2022-07-10 17:09:49,943][26022] Updated weights on worker 0-0, policy_version 814834 (0.00097) [2022-07-10 17:09:51,424][26022] Updated weights on worker 0-0, policy_version 814844 (0.00091) [2022-07-10 17:09:52,088][25689] Fps is (10 sec: 5603.1, 60 sec: 5530.7, 300 sec: 5527.1). Total num frames: 834403328. Throughput: 0: 4987.6. Samples: 834397994. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:09:52,088][25689] Avg episode reward: [(0, '-1.694')] [2022-07-10 17:09:53,566][26022] Updated weights on worker 0-0, policy_version 814854 (0.00094) [2022-07-10 17:09:55,072][26022] Updated weights on worker 0-0, policy_version 814864 (0.00094) [2022-07-10 17:09:56,923][26022] Updated weights on worker 0-0, policy_version 814874 (0.00087) [2022-07-10 17:09:57,109][25689] Fps is (10 sec: 5714.0, 60 sec: 5512.2, 300 sec: 5528.8). Total num frames: 834430976. Throughput: 0: 5821.4. Samples: 834431608. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:09:57,110][25689] Avg episode reward: [(0, '-0.534')] [2022-07-10 17:09:58,903][26022] Updated weights on worker 0-0, policy_version 814884 (0.00089) [2022-07-10 17:10:00,805][26022] Updated weights on worker 0-0, policy_version 814894 (0.00085) [2022-07-10 17:10:02,209][25689] Fps is (10 sec: 5261.0, 60 sec: 5494.2, 300 sec: 5523.5). Total num frames: 834456576. Throughput: 0: 5789.7. Samples: 834464340. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:10:02,209][25689] Avg episode reward: [(0, '0.095')] [2022-07-10 17:10:02,746][26022] Updated weights on worker 0-0, policy_version 814904 (0.00090) [2022-07-10 17:10:04,869][26022] Updated weights on worker 0-0, policy_version 814914 (0.00097) [2022-07-10 17:10:06,443][26022] Updated weights on worker 0-0, policy_version 814924 (0.00084) [2022-07-10 17:10:07,258][25689] Fps is (10 sec: 5347.4, 60 sec: 5527.4, 300 sec: 5530.1). Total num frames: 834485248. Throughput: 0: 4881.8. Samples: 834479676. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:10:07,259][25689] Avg episode reward: [(0, '0.265')] [2022-07-10 17:10:08,469][26022] Updated weights on worker 0-0, policy_version 814934 (0.00084) [2022-07-10 17:10:10,272][26022] Updated weights on worker 0-0, policy_version 814944 (0.00091) [2022-07-10 17:10:12,272][25689] Fps is (10 sec: 5494.9, 60 sec: 5529.4, 300 sec: 5526.4). Total num frames: 834511872. Throughput: 0: 5695.0. Samples: 834512944. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:10:12,273][25689] Avg episode reward: [(0, '-0.075')] [2022-07-10 17:10:12,361][26022] Updated weights on worker 0-0, policy_version 814954 (0.00084) [2022-07-10 17:10:14,100][26022] Updated weights on worker 0-0, policy_version 814964 (0.00076) [2022-07-10 17:10:16,029][26022] Updated weights on worker 0-0, policy_version 814974 (0.00096) [2022-07-10 17:10:17,301][25689] Fps is (10 sec: 5506.2, 60 sec: 5528.5, 300 sec: 5523.2). Total num frames: 834540544. Throughput: 0: 5681.1. Samples: 834546318. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:10:17,302][25689] Avg episode reward: [(0, '0.129')] [2022-07-10 17:10:17,771][26022] Updated weights on worker 0-0, policy_version 814984 (0.00084) [2022-07-10 17:10:19,327][26022] Updated weights on worker 0-0, policy_version 814994 (0.00088) [2022-07-10 17:10:21,440][26022] Updated weights on worker 0-0, policy_version 815004 (0.00085) [2022-07-10 17:10:22,384][25689] Fps is (10 sec: 5873.6, 60 sec: 5564.8, 300 sec: 5537.2). Total num frames: 834571264. Throughput: 0: 4907.9. Samples: 834563356. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:10:22,384][25689] Avg episode reward: [(0, '0.053')] [2022-07-10 17:10:23,015][26022] Updated weights on worker 0-0, policy_version 815014 (0.00091) [2022-07-10 17:10:25,094][26022] Updated weights on worker 0-0, policy_version 815024 (0.00091) [2022-07-10 17:10:27,091][26022] Updated weights on worker 0-0, policy_version 815034 (0.00086) [2022-07-10 17:10:27,437][25689] Fps is (10 sec: 5455.2, 60 sec: 5500.7, 300 sec: 5522.9). Total num frames: 834595840. Throughput: 0: 5809.1. Samples: 834596898. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:10:27,438][25689] Avg episode reward: [(0, '-0.291')] [2022-07-10 17:10:28,578][26022] Updated weights on worker 0-0, policy_version 815044 (0.00092) [2022-07-10 17:10:30,751][26022] Updated weights on worker 0-0, policy_version 815054 (0.00150) [2022-07-10 17:10:32,262][26022] Updated weights on worker 0-0, policy_version 815064 (0.00092) [2022-07-10 17:10:32,539][25689] Fps is (10 sec: 5445.2, 60 sec: 5546.6, 300 sec: 5532.8). Total num frames: 834626560. Throughput: 0: 5786.5. Samples: 834630218. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:10:32,539][25689] Avg episode reward: [(0, '-0.668')] [2022-07-10 17:10:34,252][26022] Updated weights on worker 0-0, policy_version 815074 (0.00104) [2022-07-10 17:10:36,163][26022] Updated weights on worker 0-0, policy_version 815084 (0.00088) [2022-07-10 17:10:37,554][25689] Fps is (10 sec: 5769.3, 60 sec: 5528.9, 300 sec: 5533.3). Total num frames: 834654208. Throughput: 0: 4973.8. Samples: 834647054. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:10:37,555][25689] Avg episode reward: [(0, '-0.721')] [2022-07-10 17:10:37,865][26022] Updated weights on worker 0-0, policy_version 815094 (0.00071) [2022-07-10 17:10:39,593][26022] Updated weights on worker 0-0, policy_version 815104 (0.00087) [2022-07-10 17:10:41,696][26022] Updated weights on worker 0-0, policy_version 815114 (0.00102) [2022-07-10 17:10:42,643][25689] Fps is (10 sec: 5472.6, 60 sec: 5524.9, 300 sec: 5525.2). Total num frames: 834681856. Throughput: 0: 5791.9. Samples: 834680696. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:10:42,644][25689] Avg episode reward: [(0, '-1.043')] [2022-07-10 17:10:43,127][26022] Updated weights on worker 0-0, policy_version 815124 (0.00086) [2022-07-10 17:10:45,357][26022] Updated weights on worker 0-0, policy_version 815134 (0.00105) [2022-07-10 17:10:46,785][26022] Updated weights on worker 0-0, policy_version 815144 (0.00086) [2022-07-10 17:10:47,711][25689] Fps is (10 sec: 5545.1, 60 sec: 5555.9, 300 sec: 5534.4). Total num frames: 834710528. Throughput: 0: 5797.8. Samples: 834714440. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:10:47,711][25689] Avg episode reward: [(0, '-1.689')] [2022-07-10 17:10:48,806][26022] Updated weights on worker 0-0, policy_version 815154 (0.00104) [2022-07-10 17:10:49,010][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:10:49,024][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000815155_834718720.pth [2022-07-10 17:10:49,025][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000813209_832726016.pth [2022-07-10 17:10:50,791][26022] Updated weights on worker 0-0, policy_version 815164 (0.00085) [2022-07-10 17:10:52,526][26022] Updated weights on worker 0-0, policy_version 815174 (0.00083) [2022-07-10 17:10:52,758][25689] Fps is (10 sec: 5669.1, 60 sec: 5536.0, 300 sec: 5533.9). Total num frames: 834739200. Throughput: 0: 5849.5. Samples: 834748492. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:10:52,759][25689] Avg episode reward: [(0, '-1.483')] [2022-07-10 17:10:54,213][26022] Updated weights on worker 0-0, policy_version 815184 (0.00080) [2022-07-10 17:10:56,190][26022] Updated weights on worker 0-0, policy_version 815194 (0.00094) [2022-07-10 17:10:57,697][26022] Updated weights on worker 0-0, policy_version 815204 (0.00093) [2022-07-10 17:10:57,792][25689] Fps is (10 sec: 5789.8, 60 sec: 5568.6, 300 sec: 5542.4). Total num frames: 834768896. Throughput: 0: 5854.6. Samples: 834765538. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:10:57,793][25689] Avg episode reward: [(0, '-0.328')] [2022-07-10 17:10:59,767][26022] Updated weights on worker 0-0, policy_version 815214 (0.00093) [2022-07-10 17:11:01,627][26022] Updated weights on worker 0-0, policy_version 815224 (0.00085) [2022-07-10 17:11:02,865][25689] Fps is (10 sec: 5268.6, 60 sec: 5537.3, 300 sec: 5528.9). Total num frames: 834792448. Throughput: 0: 5774.4. Samples: 834797466. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:11:02,865][25689] Avg episode reward: [(0, '-0.361')] [2022-07-10 17:11:03,761][26022] Updated weights on worker 0-0, policy_version 815234 (0.00093) [2022-07-10 17:11:05,505][26022] Updated weights on worker 0-0, policy_version 815244 (0.00082) [2022-07-10 17:11:07,470][26022] Updated weights on worker 0-0, policy_version 815254 (0.00094) [2022-07-10 17:11:07,896][25689] Fps is (10 sec: 5270.1, 60 sec: 5555.9, 300 sec: 5543.3). Total num frames: 834822144. Throughput: 0: 5766.9. Samples: 834830846. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:11:07,896][25689] Avg episode reward: [(0, '-0.672')] [2022-07-10 17:11:09,191][26022] Updated weights on worker 0-0, policy_version 815264 (0.00079) [2022-07-10 17:11:11,112][26022] Updated weights on worker 0-0, policy_version 815274 (0.00089) [2022-07-10 17:11:12,899][26022] Updated weights on worker 0-0, policy_version 815284 (0.00087) [2022-07-10 17:11:12,963][25689] Fps is (10 sec: 5780.3, 60 sec: 5584.8, 300 sec: 5542.8). Total num frames: 834850816. Throughput: 0: 4912.1. Samples: 834847738. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-10 17:11:12,963][25689] Avg episode reward: [(0, '-1.334')] [2022-07-10 17:11:15,035][26022] Updated weights on worker 0-0, policy_version 815294 (0.00091) [2022-07-10 17:11:16,441][26022] Updated weights on worker 0-0, policy_version 815304 (0.00088) [2022-07-10 17:11:17,967][25689] Fps is (10 sec: 5490.8, 60 sec: 5553.3, 300 sec: 5534.4). Total num frames: 834877440. Throughput: 0: 5738.2. Samples: 834881304. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:11:17,967][25689] Avg episode reward: [(0, '-1.360')] [2022-07-10 17:11:18,508][26022] Updated weights on worker 0-0, policy_version 815314 (0.00086) [2022-07-10 17:11:20,192][26022] Updated weights on worker 0-0, policy_version 815324 (0.00080) [2022-07-10 17:11:22,071][26022] Updated weights on worker 0-0, policy_version 815334 (0.00084) [2022-07-10 17:11:23,080][25689] Fps is (10 sec: 5668.2, 60 sec: 5550.6, 300 sec: 5543.2). Total num frames: 834908160. Throughput: 0: 5808.3. Samples: 834914878. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:11:23,080][25689] Avg episode reward: [(0, '-2.004')] [2022-07-10 17:11:23,895][26022] Updated weights on worker 0-0, policy_version 815344 (0.00113) [2022-07-10 17:11:25,772][26022] Updated weights on worker 0-0, policy_version 815354 (0.00090) [2022-07-10 17:11:27,598][26022] Updated weights on worker 0-0, policy_version 815364 (0.00082) [2022-07-10 17:11:28,115][25689] Fps is (10 sec: 5751.4, 60 sec: 5602.8, 300 sec: 5543.3). Total num frames: 834935808. Throughput: 0: 5001.1. Samples: 834931964. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:11:28,117][25689] Avg episode reward: [(0, '-2.496')] [2022-07-10 17:11:29,449][26022] Updated weights on worker 0-0, policy_version 815374 (0.00092) [2022-07-10 17:11:31,186][26022] Updated weights on worker 0-0, policy_version 815384 (0.00087) [2022-07-10 17:11:33,069][26022] Updated weights on worker 0-0, policy_version 815394 (0.00087) [2022-07-10 17:11:33,131][25689] Fps is (10 sec: 5501.5, 60 sec: 5560.1, 300 sec: 5540.7). Total num frames: 834963456. Throughput: 0: 5831.2. Samples: 834965342. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:11:33,131][25689] Avg episode reward: [(0, '-2.226')] [2022-07-10 17:11:34,939][26022] Updated weights on worker 0-0, policy_version 815404 (0.00082) [2022-07-10 17:11:36,818][26022] Updated weights on worker 0-0, policy_version 815414 (0.00083) [2022-07-10 17:11:38,158][25689] Fps is (10 sec: 5608.1, 60 sec: 5575.9, 300 sec: 5544.9). Total num frames: 834992128. Throughput: 0: 5846.6. Samples: 834999354. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:11:38,158][25689] Avg episode reward: [(0, '-2.582')] [2022-07-10 17:11:38,443][26022] Updated weights on worker 0-0, policy_version 815424 (0.00082) [2022-07-10 17:11:40,437][26022] Updated weights on worker 0-0, policy_version 815434 (0.00081) [2022-07-10 17:11:42,114][26022] Updated weights on worker 0-0, policy_version 815444 (0.00089) [2022-07-10 17:11:43,291][25689] Fps is (10 sec: 5543.2, 60 sec: 5571.8, 300 sec: 5539.4). Total num frames: 835019776. Throughput: 0: 5008.6. Samples: 835016108. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:11:43,296][25689] Avg episode reward: [(0, '-2.585')] [2022-07-10 17:11:44,234][26022] Updated weights on worker 0-0, policy_version 815454 (0.00086) [2022-07-10 17:11:45,710][26022] Updated weights on worker 0-0, policy_version 815464 (0.00099) [2022-07-10 17:11:47,769][26022] Updated weights on worker 0-0, policy_version 815474 (0.00094) [2022-07-10 17:11:48,303][25689] Fps is (10 sec: 5652.4, 60 sec: 5593.9, 300 sec: 5546.3). Total num frames: 835049472. Throughput: 0: 5846.0. Samples: 835049982. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:11:48,309][25689] Avg episode reward: [(0, '-1.594')] [2022-07-10 17:11:49,401][26022] Updated weights on worker 0-0, policy_version 815484 (0.00088) [2022-07-10 17:11:51,316][26022] Updated weights on worker 0-0, policy_version 815494 (0.00083) [2022-07-10 17:11:53,133][26022] Updated weights on worker 0-0, policy_version 815504 (0.00088) [2022-07-10 17:11:53,378][25689] Fps is (10 sec: 5583.6, 60 sec: 5557.6, 300 sec: 5538.1). Total num frames: 835076096. Throughput: 0: 5834.4. Samples: 835083470. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:11:53,379][25689] Avg episode reward: [(0, '-0.229')] [2022-07-10 17:11:54,935][26022] Updated weights on worker 0-0, policy_version 815514 (0.00085) [2022-07-10 17:11:56,843][26022] Updated weights on worker 0-0, policy_version 815524 (0.00088) [2022-07-10 17:11:58,382][25689] Fps is (10 sec: 5486.2, 60 sec: 5543.4, 300 sec: 5546.3). Total num frames: 835104768. Throughput: 0: 4985.0. Samples: 835100172. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:11:58,387][25689] Avg episode reward: [(0, '-0.412')] [2022-07-10 17:11:58,715][26022] Updated weights on worker 0-0, policy_version 815534 (0.00089) [2022-07-10 17:12:00,379][26022] Updated weights on worker 0-0, policy_version 815544 (0.00091) [2022-07-10 17:12:02,759][26022] Updated weights on worker 0-0, policy_version 815554 (0.00079) [2022-07-10 17:12:03,437][25689] Fps is (10 sec: 5700.5, 60 sec: 5629.5, 300 sec: 5549.0). Total num frames: 835133440. Throughput: 0: 5860.5. Samples: 835134174. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:12:03,437][25689] Avg episode reward: [(0, '-0.584')] [2022-07-10 17:12:04,307][26022] Updated weights on worker 0-0, policy_version 815564 (0.00078) [2022-07-10 17:12:06,158][26022] Updated weights on worker 0-0, policy_version 815574 (0.00080) [2022-07-10 17:12:08,142][26022] Updated weights on worker 0-0, policy_version 815584 (0.00092) [2022-07-10 17:12:08,511][25689] Fps is (10 sec: 5358.2, 60 sec: 5558.0, 300 sec: 5541.3). Total num frames: 835159040. Throughput: 0: 5746.3. Samples: 835166104. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:12:08,511][25689] Avg episode reward: [(0, '-0.697')] [2022-07-10 17:12:09,670][26022] Updated weights on worker 0-0, policy_version 815594 (0.00089) [2022-07-10 17:12:11,738][26022] Updated weights on worker 0-0, policy_version 815604 (0.00087) [2022-07-10 17:12:13,347][26022] Updated weights on worker 0-0, policy_version 815614 (0.00080) [2022-07-10 17:12:13,542][25689] Fps is (10 sec: 5472.1, 60 sec: 5578.1, 300 sec: 5551.3). Total num frames: 835188736. Throughput: 0: 4937.8. Samples: 835183038. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:12:13,543][25689] Avg episode reward: [(0, '-1.417')] [2022-07-10 17:12:15,330][26022] Updated weights on worker 0-0, policy_version 815624 (0.00090) [2022-07-10 17:12:17,097][26022] Updated weights on worker 0-0, policy_version 815634 (0.00090) [2022-07-10 17:12:18,571][25689] Fps is (10 sec: 5700.3, 60 sec: 5592.8, 300 sec: 5545.3). Total num frames: 835216384. Throughput: 0: 5771.0. Samples: 835216680. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:12:18,571][25689] Avg episode reward: [(0, '-1.355')] [2022-07-10 17:12:19,063][26022] Updated weights on worker 0-0, policy_version 815644 (0.00412) [2022-07-10 17:12:20,565][26022] Updated weights on worker 0-0, policy_version 815654 (0.00091) [2022-07-10 17:12:22,733][26022] Updated weights on worker 0-0, policy_version 815664 (0.00087) [2022-07-10 17:12:23,632][25689] Fps is (10 sec: 5784.5, 60 sec: 5597.5, 300 sec: 5558.2). Total num frames: 835247104. Throughput: 0: 5761.2. Samples: 835250524. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:12:23,633][25689] Avg episode reward: [(0, '-2.071')] [2022-07-10 17:12:24,503][26022] Updated weights on worker 0-0, policy_version 815674 (0.00089) [2022-07-10 17:12:26,227][26022] Updated weights on worker 0-0, policy_version 815684 (0.00084) [2022-07-10 17:12:27,965][26022] Updated weights on worker 0-0, policy_version 815694 (0.00089) [2022-07-10 17:12:28,690][25689] Fps is (10 sec: 5667.0, 60 sec: 5578.6, 300 sec: 5551.0). Total num frames: 835273728. Throughput: 0: 5043.6. Samples: 835267878. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:12:28,690][25689] Avg episode reward: [(0, '-2.078')] [2022-07-10 17:12:29,905][26022] Updated weights on worker 0-0, policy_version 815704 (0.00090) [2022-07-10 17:12:31,683][26022] Updated weights on worker 0-0, policy_version 815714 (0.00093) [2022-07-10 17:12:33,618][26022] Updated weights on worker 0-0, policy_version 815724 (0.00083) [2022-07-10 17:12:33,717][25689] Fps is (10 sec: 5381.6, 60 sec: 5577.5, 300 sec: 5551.4). Total num frames: 835301376. Throughput: 0: 5877.8. Samples: 835301624. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:12:33,718][25689] Avg episode reward: [(0, '-2.504')] [2022-07-10 17:12:35,256][26022] Updated weights on worker 0-0, policy_version 815734 (0.00085) [2022-07-10 17:12:37,076][26022] Updated weights on worker 0-0, policy_version 815744 (0.00089) [2022-07-10 17:12:38,756][25689] Fps is (10 sec: 5696.5, 60 sec: 5593.3, 300 sec: 5555.3). Total num frames: 835331072. Throughput: 0: 5884.2. Samples: 835335454. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:12:38,756][25689] Avg episode reward: [(0, '-2.193')] [2022-07-10 17:12:38,894][26022] Updated weights on worker 0-0, policy_version 815754 (0.00087) [2022-07-10 17:12:40,738][26022] Updated weights on worker 0-0, policy_version 815764 (0.00082) [2022-07-10 17:12:42,527][26022] Updated weights on worker 0-0, policy_version 815774 (0.00081) [2022-07-10 17:12:43,830][25689] Fps is (10 sec: 5670.6, 60 sec: 5598.8, 300 sec: 5554.5). Total num frames: 835358720. Throughput: 0: 5034.0. Samples: 835352196. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:12:43,830][25689] Avg episode reward: [(0, '-1.998')] [2022-07-10 17:12:44,261][26022] Updated weights on worker 0-0, policy_version 815784 (0.00093) [2022-07-10 17:12:46,423][26022] Updated weights on worker 0-0, policy_version 815794 (0.00086) [2022-07-10 17:12:47,881][26022] Updated weights on worker 0-0, policy_version 815804 (0.00056) [2022-07-10 17:12:48,845][25689] Fps is (10 sec: 5480.9, 60 sec: 5564.7, 300 sec: 5551.2). Total num frames: 835386368. Throughput: 0: 5854.5. Samples: 835385876. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:12:48,845][25689] Avg episode reward: [(0, '-4.399')] [2022-07-10 17:12:49,149][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:12:49,163][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000815809_835388416.pth [2022-07-10 17:12:49,163][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000813856_833388544.pth [2022-07-10 17:12:49,938][26022] Updated weights on worker 0-0, policy_version 815814 (0.00091) [2022-07-10 17:12:51,608][26022] Updated weights on worker 0-0, policy_version 815824 (0.00084) [2022-07-10 17:12:53,620][26022] Updated weights on worker 0-0, policy_version 815834 (0.00087) [2022-07-10 17:12:53,894][25689] Fps is (10 sec: 5799.3, 60 sec: 5634.7, 300 sec: 5562.0). Total num frames: 835417088. Throughput: 0: 5860.5. Samples: 835419872. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:12:53,896][25689] Avg episode reward: [(0, '-4.206')] [2022-07-10 17:12:55,514][26022] Updated weights on worker 0-0, policy_version 815844 (0.00080) [2022-07-10 17:12:56,976][26022] Updated weights on worker 0-0, policy_version 815854 (0.00091) [2022-07-10 17:12:58,932][25689] Fps is (10 sec: 5684.7, 60 sec: 5597.7, 300 sec: 5555.7). Total num frames: 835443712. Throughput: 0: 5855.4. Samples: 835453594. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:12:58,933][25689] Avg episode reward: [(0, '-3.804')] [2022-07-10 17:12:59,043][26022] Updated weights on worker 0-0, policy_version 815864 (0.00091) [2022-07-10 17:13:00,713][26022] Updated weights on worker 0-0, policy_version 815874 (0.00093) [2022-07-10 17:13:03,077][26022] Updated weights on worker 0-0, policy_version 815884 (0.00085) [2022-07-10 17:13:04,061][25689] Fps is (10 sec: 5237.6, 60 sec: 5557.2, 300 sec: 5551.0). Total num frames: 835470336. Throughput: 0: 5804.8. Samples: 835469634. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:13:04,061][25689] Avg episode reward: [(0, '-2.320')] [2022-07-10 17:13:05,108][26022] Updated weights on worker 0-0, policy_version 815894 (0.00090) [2022-07-10 17:13:06,662][26022] Updated weights on worker 0-0, policy_version 815904 (0.00090) [2022-07-10 17:13:08,531][26022] Updated weights on worker 0-0, policy_version 815914 (0.00095) [2022-07-10 17:13:09,139][25689] Fps is (10 sec: 5417.3, 60 sec: 5607.4, 300 sec: 5553.1). Total num frames: 835499008. Throughput: 0: 5722.8. Samples: 835502018. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:13:09,140][25689] Avg episode reward: [(0, '-1.825')] [2022-07-10 17:13:10,417][26022] Updated weights on worker 0-0, policy_version 815924 (0.00087) [2022-07-10 17:13:12,087][26022] Updated weights on worker 0-0, policy_version 815934 (0.00089) [2022-07-10 17:13:14,218][25689] Fps is (10 sec: 5545.0, 60 sec: 5569.3, 300 sec: 5553.1). Total num frames: 835526656. Throughput: 0: 5693.4. Samples: 835535582. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:13:14,218][25689] Avg episode reward: [(0, '1.073')] [2022-07-10 17:13:14,221][26022] Updated weights on worker 0-0, policy_version 815944 (0.00091) [2022-07-10 17:13:15,943][26022] Updated weights on worker 0-0, policy_version 815954 (0.00094) [2022-07-10 17:13:17,739][26022] Updated weights on worker 0-0, policy_version 815964 (0.00087) [2022-07-10 17:13:19,273][25689] Fps is (10 sec: 5557.7, 60 sec: 5583.7, 300 sec: 5556.6). Total num frames: 835555328. Throughput: 0: 4845.1. Samples: 835552148. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:13:19,275][25689] Avg episode reward: [(0, '0.939')] [2022-07-10 17:13:19,643][26022] Updated weights on worker 0-0, policy_version 815974 (0.00109) [2022-07-10 17:13:21,290][26022] Updated weights on worker 0-0, policy_version 815984 (0.00093) [2022-07-10 17:13:23,371][26022] Updated weights on worker 0-0, policy_version 815994 (0.00090) [2022-07-10 17:13:24,373][25689] Fps is (10 sec: 5747.9, 60 sec: 5563.4, 300 sec: 5559.6). Total num frames: 835585024. Throughput: 0: 5712.8. Samples: 835585670. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:13:24,373][25689] Avg episode reward: [(0, '0.615')] [2022-07-10 17:13:24,999][26022] Updated weights on worker 0-0, policy_version 816004 (0.00086) [2022-07-10 17:13:26,911][26022] Updated weights on worker 0-0, policy_version 816014 (0.00087) [2022-07-10 17:13:28,754][26022] Updated weights on worker 0-0, policy_version 816024 (0.00101) [2022-07-10 17:13:29,417][25689] Fps is (10 sec: 5551.9, 60 sec: 5564.5, 300 sec: 5552.3). Total num frames: 835611648. Throughput: 0: 5772.0. Samples: 835619062. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:13:29,418][25689] Avg episode reward: [(0, '0.519')] [2022-07-10 17:13:30,395][26022] Updated weights on worker 0-0, policy_version 816034 (0.00093) [2022-07-10 17:13:32,573][26022] Updated weights on worker 0-0, policy_version 816044 (0.00087) [2022-07-10 17:13:34,435][25689] Fps is (10 sec: 5291.9, 60 sec: 5548.6, 300 sec: 5549.6). Total num frames: 835638272. Throughput: 0: 4949.4. Samples: 835635644. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:13:34,435][25689] Avg episode reward: [(0, '0.069')] [2022-07-10 17:13:34,499][26022] Updated weights on worker 0-0, policy_version 816054 (0.00087) [2022-07-10 17:13:36,100][26022] Updated weights on worker 0-0, policy_version 816064 (0.00079) [2022-07-10 17:13:38,051][26022] Updated weights on worker 0-0, policy_version 816074 (0.00090) [2022-07-10 17:13:39,443][25689] Fps is (10 sec: 5515.6, 60 sec: 5534.5, 300 sec: 5554.5). Total num frames: 835666944. Throughput: 0: 5813.0. Samples: 835669394. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:13:39,443][25689] Avg episode reward: [(0, '0.032')] [2022-07-10 17:13:39,843][26022] Updated weights on worker 0-0, policy_version 816084 (0.00082) [2022-07-10 17:13:41,560][26022] Updated weights on worker 0-0, policy_version 816094 (0.00090) [2022-07-10 17:13:43,530][26022] Updated weights on worker 0-0, policy_version 816104 (0.00083) [2022-07-10 17:13:44,475][25689] Fps is (10 sec: 5813.1, 60 sec: 5572.0, 300 sec: 5560.9). Total num frames: 835696640. Throughput: 0: 5843.6. Samples: 835703144. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:13:44,476][25689] Avg episode reward: [(0, '-0.775')] [2022-07-10 17:13:44,991][26022] Updated weights on worker 0-0, policy_version 816114 (0.00088) [2022-07-10 17:13:47,034][26022] Updated weights on worker 0-0, policy_version 816124 (0.00087) [2022-07-10 17:13:48,769][26022] Updated weights on worker 0-0, policy_version 816134 (0.00091) [2022-07-10 17:13:49,502][25689] Fps is (10 sec: 5599.1, 60 sec: 5554.1, 300 sec: 5553.8). Total num frames: 835723264. Throughput: 0: 5027.4. Samples: 835720034. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:13:49,502][25689] Avg episode reward: [(0, '-1.182')] [2022-07-10 17:13:50,625][26022] Updated weights on worker 0-0, policy_version 816144 (0.00094) [2022-07-10 17:13:52,751][26022] Updated weights on worker 0-0, policy_version 816154 (0.00088) [2022-07-10 17:13:54,295][26022] Updated weights on worker 0-0, policy_version 816164 (0.00084) [2022-07-10 17:13:54,513][25689] Fps is (10 sec: 5610.7, 60 sec: 5540.7, 300 sec: 5557.0). Total num frames: 835752960. Throughput: 0: 5860.3. Samples: 835753310. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:13:54,516][25689] Avg episode reward: [(0, '-1.175')] [2022-07-10 17:13:56,476][26022] Updated weights on worker 0-0, policy_version 816174 (0.00088) [2022-07-10 17:13:57,889][26022] Updated weights on worker 0-0, policy_version 816184 (0.00057) [2022-07-10 17:13:59,518][25689] Fps is (10 sec: 5520.7, 60 sec: 5526.9, 300 sec: 5555.2). Total num frames: 835778560. Throughput: 0: 5846.3. Samples: 835786758. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:13:59,518][25689] Avg episode reward: [(0, '-0.780')] [2022-07-10 17:14:00,039][26022] Updated weights on worker 0-0, policy_version 816194 (0.00092) [2022-07-10 17:14:02,166][26022] Updated weights on worker 0-0, policy_version 816204 (0.00092) [2022-07-10 17:14:03,964][26022] Updated weights on worker 0-0, policy_version 816214 (0.00087) [2022-07-10 17:14:04,626][25689] Fps is (10 sec: 5265.3, 60 sec: 5545.6, 300 sec: 5557.4). Total num frames: 835806208. Throughput: 0: 4900.8. Samples: 835801900. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:14:04,627][25689] Avg episode reward: [(0, '-1.010')] [2022-07-10 17:14:05,867][26022] Updated weights on worker 0-0, policy_version 816224 (0.00085) [2022-07-10 17:14:07,747][26022] Updated weights on worker 0-0, policy_version 816234 (0.00086) [2022-07-10 17:14:09,587][26022] Updated weights on worker 0-0, policy_version 816244 (0.00090) [2022-07-10 17:14:09,675][25689] Fps is (10 sec: 5444.0, 60 sec: 5531.4, 300 sec: 5560.6). Total num frames: 835833856. Throughput: 0: 5681.2. Samples: 835834642. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:14:09,675][25689] Avg episode reward: [(0, '-1.245')] [2022-07-10 17:14:11,537][26022] Updated weights on worker 0-0, policy_version 816254 (0.00091) [2022-07-10 17:14:13,210][26022] Updated weights on worker 0-0, policy_version 816264 (0.00087) [2022-07-10 17:14:14,695][25689] Fps is (10 sec: 5492.0, 60 sec: 5536.8, 300 sec: 5557.1). Total num frames: 835861504. Throughput: 0: 5685.7. Samples: 835868056. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:14:14,695][25689] Avg episode reward: [(0, '-0.784')] [2022-07-10 17:14:15,213][26022] Updated weights on worker 0-0, policy_version 816274 (0.00087) [2022-07-10 17:14:17,007][26022] Updated weights on worker 0-0, policy_version 816284 (0.00090) [2022-07-10 17:14:18,722][26022] Updated weights on worker 0-0, policy_version 816294 (0.00092) [2022-07-10 17:14:19,709][25689] Fps is (10 sec: 5510.6, 60 sec: 5523.6, 300 sec: 5555.5). Total num frames: 835889152. Throughput: 0: 4847.8. Samples: 835884642. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:14:19,710][25689] Avg episode reward: [(0, '-0.761')] [2022-07-10 17:14:20,663][26022] Updated weights on worker 0-0, policy_version 816304 (0.00084) [2022-07-10 17:14:22,583][26022] Updated weights on worker 0-0, policy_version 816314 (0.00097) [2022-07-10 17:14:24,248][26022] Updated weights on worker 0-0, policy_version 816324 (0.00089) [2022-07-10 17:14:24,831][25689] Fps is (10 sec: 5657.4, 60 sec: 5521.5, 300 sec: 5558.4). Total num frames: 835918848. Throughput: 0: 5771.6. Samples: 835918514. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:14:24,834][25689] Avg episode reward: [(0, '-1.091')] [2022-07-10 17:14:26,214][26022] Updated weights on worker 0-0, policy_version 816334 (0.00086) [2022-07-10 17:14:28,036][26022] Updated weights on worker 0-0, policy_version 816344 (0.00098) [2022-07-10 17:14:29,723][26022] Updated weights on worker 0-0, policy_version 816354 (0.00087) [2022-07-10 17:14:29,912][25689] Fps is (10 sec: 5620.3, 60 sec: 5535.2, 300 sec: 5557.8). Total num frames: 835946496. Throughput: 0: 5798.8. Samples: 835951994. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:14:29,913][25689] Avg episode reward: [(0, '-1.000')] [2022-07-10 17:14:31,637][26022] Updated weights on worker 0-0, policy_version 816364 (0.00092) [2022-07-10 17:14:33,422][26022] Updated weights on worker 0-0, policy_version 816374 (0.00087) [2022-07-10 17:14:34,931][25689] Fps is (10 sec: 5373.6, 60 sec: 5535.0, 300 sec: 5550.6). Total num frames: 835973120. Throughput: 0: 4975.3. Samples: 835968736. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:14:34,931][25689] Avg episode reward: [(0, '-0.599')] [2022-07-10 17:14:35,226][26022] Updated weights on worker 0-0, policy_version 816384 (0.00091) [2022-07-10 17:14:37,000][26022] Updated weights on worker 0-0, policy_version 816394 (0.00087) [2022-07-10 17:14:38,966][26022] Updated weights on worker 0-0, policy_version 816404 (0.00086) [2022-07-10 17:14:39,944][25689] Fps is (10 sec: 5716.2, 60 sec: 5568.4, 300 sec: 5561.6). Total num frames: 836003840. Throughput: 0: 5832.8. Samples: 836002668. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:14:39,944][25689] Avg episode reward: [(0, '-0.299')] [2022-07-10 17:14:40,775][26022] Updated weights on worker 0-0, policy_version 816414 (0.00091) [2022-07-10 17:14:42,567][26022] Updated weights on worker 0-0, policy_version 816424 (0.00095) [2022-07-10 17:14:44,479][26022] Updated weights on worker 0-0, policy_version 816434 (0.00085) [2022-07-10 17:14:45,033][25689] Fps is (10 sec: 5676.1, 60 sec: 5512.5, 300 sec: 5560.6). Total num frames: 836030464. Throughput: 0: 5828.5. Samples: 836036264. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:14:45,034][25689] Avg episode reward: [(0, '-0.338')] [2022-07-10 17:14:46,190][26022] Updated weights on worker 0-0, policy_version 816444 (0.00090) [2022-07-10 17:14:48,132][26022] Updated weights on worker 0-0, policy_version 816454 (0.00092) [2022-07-10 17:14:49,247][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:14:49,262][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000816460_836055040.pth [2022-07-10 17:14:49,263][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000814506_834054144.pth [2022-07-10 17:14:49,851][26022] Updated weights on worker 0-0, policy_version 816464 (0.00086) [2022-07-10 17:14:50,120][25689] Fps is (10 sec: 5534.7, 60 sec: 5557.7, 300 sec: 5559.2). Total num frames: 836060160. Throughput: 0: 4989.4. Samples: 836052818. Policy #0 lag: (min: 0.0, avg: 9.6, max: 18.0) [2022-07-10 17:14:50,120][25689] Avg episode reward: [(0, '0.110')] [2022-07-10 17:14:51,821][26022] Updated weights on worker 0-0, policy_version 816474 (0.00091) [2022-07-10 17:14:53,412][26022] Updated weights on worker 0-0, policy_version 816484 (0.00086) [2022-07-10 17:14:55,122][25689] Fps is (10 sec: 5684.2, 60 sec: 5524.8, 300 sec: 5559.6). Total num frames: 836087808. Throughput: 0: 5840.9. Samples: 836086670. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:14:55,122][25689] Avg episode reward: [(0, '1.555')] [2022-07-10 17:14:55,444][26022] Updated weights on worker 0-0, policy_version 816494 (0.00086) [2022-07-10 17:14:57,329][26022] Updated weights on worker 0-0, policy_version 816504 (0.00089) [2022-07-10 17:14:59,011][26022] Updated weights on worker 0-0, policy_version 816514 (0.00086) [2022-07-10 17:15:00,135][25689] Fps is (10 sec: 5521.4, 60 sec: 5557.8, 300 sec: 5568.1). Total num frames: 836115456. Throughput: 0: 5825.6. Samples: 836120290. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:15:00,136][25689] Avg episode reward: [(0, '1.505')] [2022-07-10 17:15:00,975][26022] Updated weights on worker 0-0, policy_version 816524 (0.00087) [2022-07-10 17:15:02,833][26022] Updated weights on worker 0-0, policy_version 816534 (0.00087) [2022-07-10 17:15:04,866][26022] Updated weights on worker 0-0, policy_version 816544 (0.00081) [2022-07-10 17:15:05,181][25689] Fps is (10 sec: 5395.4, 60 sec: 5546.6, 300 sec: 5561.3). Total num frames: 836142080. Throughput: 0: 4906.4. Samples: 836135114. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:15:05,181][25689] Avg episode reward: [(0, '1.424')] [2022-07-10 17:15:06,724][26022] Updated weights on worker 0-0, policy_version 816554 (0.00089) [2022-07-10 17:15:08,512][26022] Updated weights on worker 0-0, policy_version 816564 (0.00091) [2022-07-10 17:15:10,254][25689] Fps is (10 sec: 5464.3, 60 sec: 5561.2, 300 sec: 5567.0). Total num frames: 836170752. Throughput: 0: 5780.3. Samples: 836169200. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:15:10,255][25689] Avg episode reward: [(0, '1.257')] [2022-07-10 17:15:10,267][26022] Updated weights on worker 0-0, policy_version 816574 (0.00055) [2022-07-10 17:15:12,136][26022] Updated weights on worker 0-0, policy_version 816584 (0.00088) [2022-07-10 17:15:13,925][26022] Updated weights on worker 0-0, policy_version 816594 (0.00087) [2022-07-10 17:15:15,289][25689] Fps is (10 sec: 5672.7, 60 sec: 5576.7, 300 sec: 5566.9). Total num frames: 836199424. Throughput: 0: 5759.7. Samples: 836202830. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:15:15,291][25689] Avg episode reward: [(0, '1.724')] [2022-07-10 17:15:15,834][26022] Updated weights on worker 0-0, policy_version 816604 (0.00088) [2022-07-10 17:15:17,652][26022] Updated weights on worker 0-0, policy_version 816614 (0.00085) [2022-07-10 17:15:19,478][26022] Updated weights on worker 0-0, policy_version 816624 (0.00103) [2022-07-10 17:15:20,293][25689] Fps is (10 sec: 5610.1, 60 sec: 5577.7, 300 sec: 5558.1). Total num frames: 836227072. Throughput: 0: 5764.3. Samples: 836236490. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:15:20,294][25689] Avg episode reward: [(0, '1.138')] [2022-07-10 17:15:21,380][26022] Updated weights on worker 0-0, policy_version 816634 (0.00615) [2022-07-10 17:15:23,210][26022] Updated weights on worker 0-0, policy_version 816644 (0.00084) [2022-07-10 17:15:24,891][26022] Updated weights on worker 0-0, policy_version 816654 (0.00091) [2022-07-10 17:15:25,357][25689] Fps is (10 sec: 5593.9, 60 sec: 5566.1, 300 sec: 5571.6). Total num frames: 836255744. Throughput: 0: 5870.4. Samples: 836253560. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:15:25,359][25689] Avg episode reward: [(0, '1.065')] [2022-07-10 17:15:26,712][26022] Updated weights on worker 0-0, policy_version 816664 (0.00092) [2022-07-10 17:15:28,560][26022] Updated weights on worker 0-0, policy_version 816674 (0.00055) [2022-07-10 17:15:30,247][26022] Updated weights on worker 0-0, policy_version 816684 (0.00095) [2022-07-10 17:15:30,410][25689] Fps is (10 sec: 5668.0, 60 sec: 5585.6, 300 sec: 5565.7). Total num frames: 836284416. Throughput: 0: 5848.5. Samples: 836287084. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:15:30,411][25689] Avg episode reward: [(0, '0.912')] [2022-07-10 17:15:32,334][26022] Updated weights on worker 0-0, policy_version 816694 (0.00085) [2022-07-10 17:15:34,198][26022] Updated weights on worker 0-0, policy_version 816704 (0.00088) [2022-07-10 17:15:35,434][25689] Fps is (10 sec: 5589.3, 60 sec: 5602.1, 300 sec: 5565.5). Total num frames: 836312064. Throughput: 0: 5857.3. Samples: 836320824. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:15:35,436][25689] Avg episode reward: [(0, '0.912')] [2022-07-10 17:15:35,767][26022] Updated weights on worker 0-0, policy_version 816714 (0.00094) [2022-07-10 17:15:37,821][26022] Updated weights on worker 0-0, policy_version 816724 (0.00585) [2022-07-10 17:15:39,548][26022] Updated weights on worker 0-0, policy_version 816734 (0.00083) [2022-07-10 17:15:40,499][25689] Fps is (10 sec: 5480.9, 60 sec: 5546.5, 300 sec: 5566.0). Total num frames: 836339712. Throughput: 0: 4990.9. Samples: 836337336. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:15:40,500][25689] Avg episode reward: [(0, '0.681')] [2022-07-10 17:15:41,396][26022] Updated weights on worker 0-0, policy_version 816744 (0.00082) [2022-07-10 17:15:43,367][26022] Updated weights on worker 0-0, policy_version 816754 (0.00089) [2022-07-10 17:15:45,004][26022] Updated weights on worker 0-0, policy_version 816764 (0.00088) [2022-07-10 17:15:45,612][25689] Fps is (10 sec: 5533.7, 60 sec: 5578.2, 300 sec: 5565.1). Total num frames: 836368384. Throughput: 0: 5799.1. Samples: 836371016. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:15:45,612][25689] Avg episode reward: [(0, '0.636')] [2022-07-10 17:15:46,938][26022] Updated weights on worker 0-0, policy_version 816774 (0.00085) [2022-07-10 17:15:49,011][26022] Updated weights on worker 0-0, policy_version 816784 (0.00094) [2022-07-10 17:15:50,562][26022] Updated weights on worker 0-0, policy_version 816794 (0.00085) [2022-07-10 17:15:50,665][25689] Fps is (10 sec: 5641.1, 60 sec: 5564.4, 300 sec: 5565.0). Total num frames: 836397056. Throughput: 0: 5799.8. Samples: 836404556. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:15:50,665][25689] Avg episode reward: [(0, '-0.027')] [2022-07-10 17:15:52,541][26022] Updated weights on worker 0-0, policy_version 816804 (0.00084) [2022-07-10 17:15:54,209][26022] Updated weights on worker 0-0, policy_version 816814 (0.00081) [2022-07-10 17:15:55,715][25689] Fps is (10 sec: 5574.6, 60 sec: 5560.0, 300 sec: 5557.8). Total num frames: 836424704. Throughput: 0: 4953.4. Samples: 836421282. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:15:55,715][25689] Avg episode reward: [(0, '0.414')] [2022-07-10 17:15:56,112][26022] Updated weights on worker 0-0, policy_version 816824 (0.00092) [2022-07-10 17:15:58,108][26022] Updated weights on worker 0-0, policy_version 816834 (0.00093) [2022-07-10 17:15:59,555][26022] Updated weights on worker 0-0, policy_version 816844 (0.00376) [2022-07-10 17:16:00,742][25689] Fps is (10 sec: 5487.0, 60 sec: 5558.6, 300 sec: 5572.4). Total num frames: 836452352. Throughput: 0: 5805.7. Samples: 836454862. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:16:00,743][25689] Avg episode reward: [(0, '-0.096')] [2022-07-10 17:16:02,118][26022] Updated weights on worker 0-0, policy_version 816854 (0.00093) [2022-07-10 17:16:03,678][26022] Updated weights on worker 0-0, policy_version 816864 (0.00087) [2022-07-10 17:16:05,528][26022] Updated weights on worker 0-0, policy_version 816874 (0.00088) [2022-07-10 17:16:05,858][25689] Fps is (10 sec: 5350.3, 60 sec: 5552.2, 300 sec: 5560.5). Total num frames: 836478976. Throughput: 0: 5705.4. Samples: 836486532. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:16:05,859][25689] Avg episode reward: [(0, '-0.512')] [2022-07-10 17:16:07,489][26022] Updated weights on worker 0-0, policy_version 816884 (0.00085) [2022-07-10 17:16:09,331][26022] Updated weights on worker 0-0, policy_version 816894 (0.00086) [2022-07-10 17:16:10,908][25689] Fps is (10 sec: 5540.3, 60 sec: 5571.3, 300 sec: 5564.3). Total num frames: 836508672. Throughput: 0: 4878.2. Samples: 836503306. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:16:10,909][25689] Avg episode reward: [(0, '-1.344')] [2022-07-10 17:16:10,980][26022] Updated weights on worker 0-0, policy_version 816904 (0.00093) [2022-07-10 17:16:13,039][26022] Updated weights on worker 0-0, policy_version 816914 (0.00097) [2022-07-10 17:16:14,584][26022] Updated weights on worker 0-0, policy_version 816924 (0.00090) [2022-07-10 17:16:15,931][25689] Fps is (10 sec: 5591.5, 60 sec: 5538.7, 300 sec: 5563.9). Total num frames: 836535296. Throughput: 0: 5725.7. Samples: 836537034. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:16:15,931][25689] Avg episode reward: [(0, '-1.408')] [2022-07-10 17:16:16,660][26022] Updated weights on worker 0-0, policy_version 816934 (0.00086) [2022-07-10 17:16:18,472][26022] Updated weights on worker 0-0, policy_version 816944 (0.00105) [2022-07-10 17:16:20,182][26022] Updated weights on worker 0-0, policy_version 816954 (0.00743) [2022-07-10 17:16:20,948][25689] Fps is (10 sec: 5609.7, 60 sec: 5571.2, 300 sec: 5562.3). Total num frames: 836564992. Throughput: 0: 5730.7. Samples: 836570652. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:16:20,948][25689] Avg episode reward: [(0, '-1.096')] [2022-07-10 17:16:22,204][26022] Updated weights on worker 0-0, policy_version 816964 (0.00091) [2022-07-10 17:16:23,819][26022] Updated weights on worker 0-0, policy_version 816974 (0.00082) [2022-07-10 17:16:25,689][26022] Updated weights on worker 0-0, policy_version 816984 (0.00088) [2022-07-10 17:16:26,069][25689] Fps is (10 sec: 5656.5, 60 sec: 5549.2, 300 sec: 5560.7). Total num frames: 836592640. Throughput: 0: 4997.2. Samples: 836587526. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:16:26,069][25689] Avg episode reward: [(0, '-2.370')] [2022-07-10 17:16:27,460][26022] Updated weights on worker 0-0, policy_version 816994 (0.00086) [2022-07-10 17:16:29,371][26022] Updated weights on worker 0-0, policy_version 817004 (0.00085) [2022-07-10 17:16:31,087][25689] Fps is (10 sec: 5453.8, 60 sec: 5535.5, 300 sec: 5560.7). Total num frames: 836620288. Throughput: 0: 5851.8. Samples: 836621388. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:16:31,087][25689] Avg episode reward: [(0, '-2.877')] [2022-07-10 17:16:31,196][26022] Updated weights on worker 0-0, policy_version 817014 (0.00091) [2022-07-10 17:16:32,887][26022] Updated weights on worker 0-0, policy_version 817024 (0.00089) [2022-07-10 17:16:34,723][26022] Updated weights on worker 0-0, policy_version 817034 (0.00084) [2022-07-10 17:16:36,095][25689] Fps is (10 sec: 5719.4, 60 sec: 5570.6, 300 sec: 5564.5). Total num frames: 836649984. Throughput: 0: 5883.5. Samples: 836655668. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:16:36,095][25689] Avg episode reward: [(0, '-3.061')] [2022-07-10 17:16:36,642][26022] Updated weights on worker 0-0, policy_version 817044 (0.00091) [2022-07-10 17:16:38,372][26022] Updated weights on worker 0-0, policy_version 817054 (0.00096) [2022-07-10 17:16:40,242][26022] Updated weights on worker 0-0, policy_version 817064 (0.01200) [2022-07-10 17:16:41,100][25689] Fps is (10 sec: 5829.0, 60 sec: 5593.1, 300 sec: 5570.3). Total num frames: 836678656. Throughput: 0: 5049.3. Samples: 836672408. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:16:41,101][25689] Avg episode reward: [(0, '-2.704')] [2022-07-10 17:16:42,006][26022] Updated weights on worker 0-0, policy_version 817074 (0.00088) [2022-07-10 17:16:43,819][26022] Updated weights on worker 0-0, policy_version 817084 (0.00081) [2022-07-10 17:16:45,859][26022] Updated weights on worker 0-0, policy_version 817094 (0.00081) [2022-07-10 17:16:46,203][25689] Fps is (10 sec: 5470.4, 60 sec: 5560.1, 300 sec: 5558.3). Total num frames: 836705280. Throughput: 0: 5889.0. Samples: 836706098. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:16:46,204][25689] Avg episode reward: [(0, '-3.337')] [2022-07-10 17:16:47,390][26022] Updated weights on worker 0-0, policy_version 817104 (0.00089) [2022-07-10 17:16:49,350][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:16:49,366][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000817113_836723712.pth [2022-07-10 17:16:49,366][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000815155_834718720.pth [2022-07-10 17:16:49,588][26022] Updated weights on worker 0-0, policy_version 817114 (0.00065) [2022-07-10 17:16:51,104][26022] Updated weights on worker 0-0, policy_version 817124 (0.00086) [2022-07-10 17:16:51,229][25689] Fps is (10 sec: 5560.5, 60 sec: 5579.6, 300 sec: 5569.5). Total num frames: 836734976. Throughput: 0: 5881.3. Samples: 836739848. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:16:51,229][25689] Avg episode reward: [(0, '-3.153')] [2022-07-10 17:16:53,040][26022] Updated weights on worker 0-0, policy_version 817134 (0.00084) [2022-07-10 17:16:54,823][26022] Updated weights on worker 0-0, policy_version 817144 (0.00080) [2022-07-10 17:16:56,264][25689] Fps is (10 sec: 5699.5, 60 sec: 5580.9, 300 sec: 5565.5). Total num frames: 836762624. Throughput: 0: 5001.8. Samples: 836756552. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:16:56,266][25689] Avg episode reward: [(0, '-2.424')] [2022-07-10 17:16:56,599][26022] Updated weights on worker 0-0, policy_version 817154 (0.00089) [2022-07-10 17:16:58,543][26022] Updated weights on worker 0-0, policy_version 817164 (0.00084) [2022-07-10 17:17:00,255][26022] Updated weights on worker 0-0, policy_version 817174 (0.00086) [2022-07-10 17:17:01,294][25689] Fps is (10 sec: 5697.0, 60 sec: 5614.5, 300 sec: 5569.4). Total num frames: 836792320. Throughput: 0: 5845.8. Samples: 836790458. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:17:01,295][25689] Avg episode reward: [(0, '-1.414')] [2022-07-10 17:17:02,372][26022] Updated weights on worker 0-0, policy_version 817184 (0.00091) [2022-07-10 17:17:04,214][26022] Updated weights on worker 0-0, policy_version 817194 (0.00085) [2022-07-10 17:17:06,276][26022] Updated weights on worker 0-0, policy_version 817204 (0.00085) [2022-07-10 17:17:06,359][25689] Fps is (10 sec: 5376.2, 60 sec: 5585.4, 300 sec: 5566.1). Total num frames: 836816896. Throughput: 0: 5761.2. Samples: 836822222. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:17:06,360][25689] Avg episode reward: [(0, '-1.499')] [2022-07-10 17:17:07,838][26022] Updated weights on worker 0-0, policy_version 817214 (0.00091) [2022-07-10 17:17:09,960][26022] Updated weights on worker 0-0, policy_version 817224 (0.00096) [2022-07-10 17:17:11,374][25689] Fps is (10 sec: 5384.1, 60 sec: 5588.6, 300 sec: 5566.4). Total num frames: 836846592. Throughput: 0: 4917.6. Samples: 836838916. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:17:11,375][25689] Avg episode reward: [(0, '-0.691')] [2022-07-10 17:17:11,522][26022] Updated weights on worker 0-0, policy_version 817234 (0.00089) [2022-07-10 17:17:13,432][26022] Updated weights on worker 0-0, policy_version 817244 (0.00079) [2022-07-10 17:17:15,326][26022] Updated weights on worker 0-0, policy_version 817254 (0.00084) [2022-07-10 17:17:16,379][25689] Fps is (10 sec: 5620.8, 60 sec: 5590.2, 300 sec: 5563.4). Total num frames: 836873216. Throughput: 0: 5775.5. Samples: 836872726. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:17:16,380][25689] Avg episode reward: [(0, '-0.924')] [2022-07-10 17:17:17,001][26022] Updated weights on worker 0-0, policy_version 817264 (0.00095) [2022-07-10 17:17:19,050][26022] Updated weights on worker 0-0, policy_version 817274 (0.00098) [2022-07-10 17:17:20,737][26022] Updated weights on worker 0-0, policy_version 817284 (0.00086) [2022-07-10 17:17:21,384][25689] Fps is (10 sec: 5524.4, 60 sec: 5574.4, 300 sec: 5557.6). Total num frames: 836901888. Throughput: 0: 5766.0. Samples: 836906294. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:17:21,384][25689] Avg episode reward: [(0, '-0.979')] [2022-07-10 17:17:22,705][26022] Updated weights on worker 0-0, policy_version 817294 (0.00090) [2022-07-10 17:17:24,466][26022] Updated weights on worker 0-0, policy_version 817304 (0.00085) [2022-07-10 17:17:26,355][26022] Updated weights on worker 0-0, policy_version 817314 (0.00085) [2022-07-10 17:17:26,460][25689] Fps is (10 sec: 5586.9, 60 sec: 5578.5, 300 sec: 5560.7). Total num frames: 836929536. Throughput: 0: 5851.3. Samples: 836939836. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:17:26,460][25689] Avg episode reward: [(0, '-0.432')] [2022-07-10 17:17:28,137][26022] Updated weights on worker 0-0, policy_version 817324 (0.00088) [2022-07-10 17:17:30,081][26022] Updated weights on worker 0-0, policy_version 817334 (0.00093) [2022-07-10 17:17:31,523][25689] Fps is (10 sec: 5554.8, 60 sec: 5591.4, 300 sec: 5563.5). Total num frames: 836958208. Throughput: 0: 5838.4. Samples: 836956550. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:17:31,523][25689] Avg episode reward: [(0, '0.365')] [2022-07-10 17:17:31,757][26022] Updated weights on worker 0-0, policy_version 817344 (0.00085) [2022-07-10 17:17:33,779][26022] Updated weights on worker 0-0, policy_version 817354 (0.00078) [2022-07-10 17:17:35,422][26022] Updated weights on worker 0-0, policy_version 817364 (0.00088) [2022-07-10 17:17:36,562][25689] Fps is (10 sec: 5575.0, 60 sec: 5554.6, 300 sec: 5556.6). Total num frames: 836985856. Throughput: 0: 5817.0. Samples: 836990130. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:17:36,563][25689] Avg episode reward: [(0, '0.430')] [2022-07-10 17:17:37,340][26022] Updated weights on worker 0-0, policy_version 817374 (0.00090) [2022-07-10 17:17:39,245][26022] Updated weights on worker 0-0, policy_version 817384 (0.00846) [2022-07-10 17:17:40,953][26022] Updated weights on worker 0-0, policy_version 817394 (0.00084) [2022-07-10 17:17:41,652][25689] Fps is (10 sec: 5661.4, 60 sec: 5563.8, 300 sec: 5563.2). Total num frames: 837015552. Throughput: 0: 5797.9. Samples: 837023806. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:17:41,655][25689] Avg episode reward: [(0, '0.236')] [2022-07-10 17:17:42,877][26022] Updated weights on worker 0-0, policy_version 817404 (0.00089) [2022-07-10 17:17:44,620][26022] Updated weights on worker 0-0, policy_version 817414 (0.00087) [2022-07-10 17:17:46,395][26022] Updated weights on worker 0-0, policy_version 817424 (0.00086) [2022-07-10 17:17:46,772][25689] Fps is (10 sec: 5716.5, 60 sec: 5596.0, 300 sec: 5564.6). Total num frames: 837044224. Throughput: 0: 4964.5. Samples: 837040678. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:17:46,773][25689] Avg episode reward: [(0, '-1.009')] [2022-07-10 17:17:48,281][26022] Updated weights on worker 0-0, policy_version 817434 (0.00087) [2022-07-10 17:17:49,914][26022] Updated weights on worker 0-0, policy_version 817444 (0.00087) [2022-07-10 17:17:51,785][25689] Fps is (10 sec: 5557.7, 60 sec: 5563.3, 300 sec: 5555.0). Total num frames: 837071872. Throughput: 0: 5818.3. Samples: 837074444. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:17:51,786][25689] Avg episode reward: [(0, '-2.764')] [2022-07-10 17:17:51,941][26022] Updated weights on worker 0-0, policy_version 817454 (0.00087) [2022-07-10 17:17:53,749][26022] Updated weights on worker 0-0, policy_version 817464 (0.00090) [2022-07-10 17:17:55,531][26022] Updated weights on worker 0-0, policy_version 817474 (0.00081) [2022-07-10 17:17:56,793][25689] Fps is (10 sec: 5518.2, 60 sec: 5565.9, 300 sec: 5559.0). Total num frames: 837099520. Throughput: 0: 5829.8. Samples: 837108074. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:17:56,794][25689] Avg episode reward: [(0, '-4.404')] [2022-07-10 17:17:57,446][26022] Updated weights on worker 0-0, policy_version 817484 (0.00681) [2022-07-10 17:17:59,207][26022] Updated weights on worker 0-0, policy_version 817494 (0.00085) [2022-07-10 17:18:01,148][26022] Updated weights on worker 0-0, policy_version 817504 (0.00086) [2022-07-10 17:18:01,799][25689] Fps is (10 sec: 5624.5, 60 sec: 5551.2, 300 sec: 5568.2). Total num frames: 837128192. Throughput: 0: 5016.1. Samples: 837124866. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:18:01,799][25689] Avg episode reward: [(0, '-4.014')] [2022-07-10 17:18:03,243][26022] Updated weights on worker 0-0, policy_version 817514 (0.00088) [2022-07-10 17:18:05,092][26022] Updated weights on worker 0-0, policy_version 817524 (0.00084) [2022-07-10 17:18:06,923][25689] Fps is (10 sec: 5357.7, 60 sec: 5562.7, 300 sec: 5557.0). Total num frames: 837153792. Throughput: 0: 5737.7. Samples: 837156294. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:18:06,923][25689] Avg episode reward: [(0, '-4.063')] [2022-07-10 17:18:06,967][26022] Updated weights on worker 0-0, policy_version 817534 (0.00094) [2022-07-10 17:18:08,736][26022] Updated weights on worker 0-0, policy_version 817544 (0.00081) [2022-07-10 17:18:10,789][26022] Updated weights on worker 0-0, policy_version 817554 (0.00089) [2022-07-10 17:18:11,998][25689] Fps is (10 sec: 5321.3, 60 sec: 5540.3, 300 sec: 5560.5). Total num frames: 837182464. Throughput: 0: 5713.7. Samples: 837189930. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:18:11,998][25689] Avg episode reward: [(0, '-3.838')] [2022-07-10 17:18:12,308][26022] Updated weights on worker 0-0, policy_version 817564 (0.00085) [2022-07-10 17:18:14,219][26022] Updated weights on worker 0-0, policy_version 817574 (0.00091) [2022-07-10 17:18:16,025][26022] Updated weights on worker 0-0, policy_version 817584 (0.00084) [2022-07-10 17:18:17,046][25689] Fps is (10 sec: 5664.9, 60 sec: 5570.1, 300 sec: 5560.6). Total num frames: 837211136. Throughput: 0: 4878.6. Samples: 837206878. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:18:17,050][25689] Avg episode reward: [(0, '-3.818')] [2022-07-10 17:18:17,994][26022] Updated weights on worker 0-0, policy_version 817594 (0.00083) [2022-07-10 17:18:19,575][26022] Updated weights on worker 0-0, policy_version 817604 (0.00088) [2022-07-10 17:18:21,616][26022] Updated weights on worker 0-0, policy_version 817614 (0.00093) [2022-07-10 17:18:22,071][25689] Fps is (10 sec: 5692.7, 60 sec: 5568.2, 300 sec: 5558.6). Total num frames: 837239808. Throughput: 0: 5707.7. Samples: 837240574. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:18:22,072][25689] Avg episode reward: [(0, '-0.121')] [2022-07-10 17:18:23,378][26022] Updated weights on worker 0-0, policy_version 817624 (0.00423) [2022-07-10 17:18:25,105][26022] Updated weights on worker 0-0, policy_version 817634 (0.00089) [2022-07-10 17:18:27,038][26022] Updated weights on worker 0-0, policy_version 817644 (0.00093) [2022-07-10 17:18:27,147][25689] Fps is (10 sec: 5575.7, 60 sec: 5568.3, 300 sec: 5561.5). Total num frames: 837267456. Throughput: 0: 5822.2. Samples: 837274040. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 17:18:27,147][25689] Avg episode reward: [(0, '0.399')] [2022-07-10 17:18:29,036][26022] Updated weights on worker 0-0, policy_version 817654 (0.00107) [2022-07-10 17:18:30,802][26022] Updated weights on worker 0-0, policy_version 817664 (0.00091) [2022-07-10 17:18:32,151][25689] Fps is (10 sec: 5384.1, 60 sec: 5539.9, 300 sec: 5561.7). Total num frames: 837294080. Throughput: 0: 4998.5. Samples: 837290666. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:18:32,151][25689] Avg episode reward: [(0, '0.322')] [2022-07-10 17:18:32,623][26022] Updated weights on worker 0-0, policy_version 817674 (0.00087) [2022-07-10 17:18:34,376][26022] Updated weights on worker 0-0, policy_version 817684 (0.00096) [2022-07-10 17:18:36,374][26022] Updated weights on worker 0-0, policy_version 817694 (0.00091) [2022-07-10 17:18:37,162][25689] Fps is (10 sec: 5418.7, 60 sec: 5542.4, 300 sec: 5558.2). Total num frames: 837321728. Throughput: 0: 5806.5. Samples: 837323684. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:18:37,163][25689] Avg episode reward: [(0, '0.172')] [2022-07-10 17:18:38,158][26022] Updated weights on worker 0-0, policy_version 817704 (0.00092) [2022-07-10 17:18:40,083][26022] Updated weights on worker 0-0, policy_version 817714 (0.00082) [2022-07-10 17:18:41,743][26022] Updated weights on worker 0-0, policy_version 817724 (0.00088) [2022-07-10 17:18:42,186][25689] Fps is (10 sec: 5714.6, 60 sec: 5548.5, 300 sec: 5558.4). Total num frames: 837351424. Throughput: 0: 5800.6. Samples: 837357252. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:18:42,186][25689] Avg episode reward: [(0, '0.279')] [2022-07-10 17:18:43,870][26022] Updated weights on worker 0-0, policy_version 817734 (0.00084) [2022-07-10 17:18:45,271][26022] Updated weights on worker 0-0, policy_version 817744 (0.00094) [2022-07-10 17:18:47,234][25689] Fps is (10 sec: 5693.3, 60 sec: 5538.2, 300 sec: 5561.4). Total num frames: 837379072. Throughput: 0: 4981.4. Samples: 837374106. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:18:47,235][25689] Avg episode reward: [(0, '0.169')] [2022-07-10 17:18:47,506][26022] Updated weights on worker 0-0, policy_version 817754 (0.00082) [2022-07-10 17:18:49,051][26022] Updated weights on worker 0-0, policy_version 817764 (0.00095) [2022-07-10 17:18:49,552][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:18:49,569][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000817766_837392384.pth [2022-07-10 17:18:49,570][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000815809_835388416.pth [2022-07-10 17:18:50,971][26022] Updated weights on worker 0-0, policy_version 817774 (0.00102) [2022-07-10 17:18:52,239][25689] Fps is (10 sec: 5500.5, 60 sec: 5539.0, 300 sec: 5554.7). Total num frames: 837406720. Throughput: 0: 5843.7. Samples: 837408052. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:18:52,239][25689] Avg episode reward: [(0, '-1.153')] [2022-07-10 17:18:52,799][26022] Updated weights on worker 0-0, policy_version 817784 (0.00088) [2022-07-10 17:18:54,871][26022] Updated weights on worker 0-0, policy_version 817794 (0.00084) [2022-07-10 17:18:56,346][26022] Updated weights on worker 0-0, policy_version 817804 (0.00086) [2022-07-10 17:18:57,247][25689] Fps is (10 sec: 5624.8, 60 sec: 5555.8, 300 sec: 5564.9). Total num frames: 837435392. Throughput: 0: 5852.0. Samples: 837441222. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:18:57,248][25689] Avg episode reward: [(0, '-0.496')] [2022-07-10 17:18:58,544][26022] Updated weights on worker 0-0, policy_version 817814 (0.00092) [2022-07-10 17:18:59,984][26022] Updated weights on worker 0-0, policy_version 817824 (0.00091) [2022-07-10 17:19:02,271][25689] Fps is (10 sec: 5307.7, 60 sec: 5486.4, 300 sec: 5556.2). Total num frames: 837459968. Throughput: 0: 5014.3. Samples: 837457964. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:19:02,271][25689] Avg episode reward: [(0, '-0.320')] [2022-07-10 17:19:02,498][26022] Updated weights on worker 0-0, policy_version 817834 (0.00088) [2022-07-10 17:19:04,079][26022] Updated weights on worker 0-0, policy_version 817844 (0.00093) [2022-07-10 17:19:06,079][26022] Updated weights on worker 0-0, policy_version 817854 (0.00096) [2022-07-10 17:19:07,376][25689] Fps is (10 sec: 5358.2, 60 sec: 5555.9, 300 sec: 5562.0). Total num frames: 837489664. Throughput: 0: 5719.4. Samples: 837489304. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:19:07,376][25689] Avg episode reward: [(0, '-1.430')] [2022-07-10 17:19:07,949][26022] Updated weights on worker 0-0, policy_version 817864 (0.00090) [2022-07-10 17:19:09,899][26022] Updated weights on worker 0-0, policy_version 817874 (0.00090) [2022-07-10 17:19:11,639][26022] Updated weights on worker 0-0, policy_version 817884 (0.00083) [2022-07-10 17:19:12,383][25689] Fps is (10 sec: 5771.7, 60 sec: 5562.1, 300 sec: 5565.7). Total num frames: 837518336. Throughput: 0: 5686.9. Samples: 837522614. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:19:12,385][25689] Avg episode reward: [(0, '-1.418')] [2022-07-10 17:19:13,575][26022] Updated weights on worker 0-0, policy_version 817894 (0.00084) [2022-07-10 17:19:15,168][26022] Updated weights on worker 0-0, policy_version 817904 (0.00104) [2022-07-10 17:19:17,170][26022] Updated weights on worker 0-0, policy_version 817914 (0.00094) [2022-07-10 17:19:17,390][25689] Fps is (10 sec: 5521.9, 60 sec: 5532.0, 300 sec: 5562.4). Total num frames: 837544960. Throughput: 0: 4877.6. Samples: 837539470. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:19:17,390][25689] Avg episode reward: [(0, '-0.407')] [2022-07-10 17:19:19,046][26022] Updated weights on worker 0-0, policy_version 817924 (0.00096) [2022-07-10 17:19:20,862][26022] Updated weights on worker 0-0, policy_version 817934 (0.00086) [2022-07-10 17:19:22,393][25689] Fps is (10 sec: 5319.7, 60 sec: 5500.1, 300 sec: 5554.3). Total num frames: 837571584. Throughput: 0: 5707.6. Samples: 837572814. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:19:22,393][25689] Avg episode reward: [(0, '-1.400')] [2022-07-10 17:19:22,722][26022] Updated weights on worker 0-0, policy_version 817944 (0.00084) [2022-07-10 17:19:24,437][26022] Updated weights on worker 0-0, policy_version 817954 (0.00087) [2022-07-10 17:19:26,402][26022] Updated weights on worker 0-0, policy_version 817964 (0.00088) [2022-07-10 17:19:27,452][25689] Fps is (10 sec: 5597.2, 60 sec: 5535.6, 300 sec: 5561.6). Total num frames: 837601280. Throughput: 0: 5817.5. Samples: 837606096. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:19:27,452][25689] Avg episode reward: [(0, '-1.624')] [2022-07-10 17:19:28,466][26022] Updated weights on worker 0-0, policy_version 817974 (0.00107) [2022-07-10 17:19:30,096][26022] Updated weights on worker 0-0, policy_version 817984 (0.00092) [2022-07-10 17:19:31,984][26022] Updated weights on worker 0-0, policy_version 817994 (0.00086) [2022-07-10 17:19:32,454][25689] Fps is (10 sec: 5597.7, 60 sec: 5535.7, 300 sec: 5561.9). Total num frames: 837627904. Throughput: 0: 4985.4. Samples: 837622674. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:19:32,455][25689] Avg episode reward: [(0, '-1.684')] [2022-07-10 17:19:33,665][26022] Updated weights on worker 0-0, policy_version 818004 (0.00088) [2022-07-10 17:19:35,655][26022] Updated weights on worker 0-0, policy_version 818014 (0.00098) [2022-07-10 17:19:37,475][25689] Fps is (10 sec: 5414.9, 60 sec: 5534.9, 300 sec: 5551.4). Total num frames: 837655552. Throughput: 0: 5798.9. Samples: 837655940. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:19:37,475][25689] Avg episode reward: [(0, '-0.608')] [2022-07-10 17:19:37,518][26022] Updated weights on worker 0-0, policy_version 818024 (0.00080) [2022-07-10 17:19:39,443][26022] Updated weights on worker 0-0, policy_version 818034 (0.00088) [2022-07-10 17:19:41,215][26022] Updated weights on worker 0-0, policy_version 818044 (0.00090) [2022-07-10 17:19:42,495][25689] Fps is (10 sec: 5609.3, 60 sec: 5518.2, 300 sec: 5559.6). Total num frames: 837684224. Throughput: 0: 5793.6. Samples: 837689276. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:19:42,495][25689] Avg episode reward: [(0, '-0.434')] [2022-07-10 17:19:43,129][26022] Updated weights on worker 0-0, policy_version 818054 (0.00090) [2022-07-10 17:19:44,811][26022] Updated weights on worker 0-0, policy_version 818064 (0.00058) [2022-07-10 17:19:46,726][26022] Updated weights on worker 0-0, policy_version 818074 (0.00087) [2022-07-10 17:19:47,549][25689] Fps is (10 sec: 5590.3, 60 sec: 5517.7, 300 sec: 5553.3). Total num frames: 837711872. Throughput: 0: 4978.6. Samples: 837706152. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:19:47,550][25689] Avg episode reward: [(0, '-0.562')] [2022-07-10 17:19:48,589][26022] Updated weights on worker 0-0, policy_version 818084 (0.00085) [2022-07-10 17:19:50,400][26022] Updated weights on worker 0-0, policy_version 818094 (0.00091) [2022-07-10 17:19:52,207][26022] Updated weights on worker 0-0, policy_version 818104 (0.00085) [2022-07-10 17:19:52,563][25689] Fps is (10 sec: 5594.0, 60 sec: 5533.8, 300 sec: 5556.5). Total num frames: 837740544. Throughput: 0: 5816.9. Samples: 837739642. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:19:52,563][25689] Avg episode reward: [(0, '0.228')] [2022-07-10 17:19:54,184][26022] Updated weights on worker 0-0, policy_version 818114 (0.00086) [2022-07-10 17:19:55,845][26022] Updated weights on worker 0-0, policy_version 818124 (0.00087) [2022-07-10 17:19:57,577][25689] Fps is (10 sec: 5514.3, 60 sec: 5499.4, 300 sec: 5553.1). Total num frames: 837767168. Throughput: 0: 5816.9. Samples: 837772874. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:19:57,577][25689] Avg episode reward: [(0, '0.231')] [2022-07-10 17:19:57,916][26022] Updated weights on worker 0-0, policy_version 818134 (0.00090) [2022-07-10 17:19:59,604][26022] Updated weights on worker 0-0, policy_version 818144 (0.00092) [2022-07-10 17:20:01,884][26022] Updated weights on worker 0-0, policy_version 818154 (0.00093) [2022-07-10 17:20:02,584][25689] Fps is (10 sec: 5313.3, 60 sec: 5534.8, 300 sec: 5553.8). Total num frames: 837793792. Throughput: 0: 4987.4. Samples: 837789470. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:20:02,585][25689] Avg episode reward: [(0, '-0.096')] [2022-07-10 17:20:03,655][26022] Updated weights on worker 0-0, policy_version 818164 (0.00096) [2022-07-10 17:20:05,364][26022] Updated weights on worker 0-0, policy_version 818174 (0.00095) [2022-07-10 17:20:07,339][26022] Updated weights on worker 0-0, policy_version 818184 (0.00094) [2022-07-10 17:20:07,658][25689] Fps is (10 sec: 5485.2, 60 sec: 5520.7, 300 sec: 5553.8). Total num frames: 837822464. Throughput: 0: 5709.3. Samples: 837820958. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:20:07,658][25689] Avg episode reward: [(0, '-1.068')] [2022-07-10 17:20:09,217][26022] Updated weights on worker 0-0, policy_version 818194 (0.00095) [2022-07-10 17:20:10,915][26022] Updated weights on worker 0-0, policy_version 818204 (0.00096) [2022-07-10 17:20:12,687][25689] Fps is (10 sec: 5473.6, 60 sec: 5484.8, 300 sec: 5547.0). Total num frames: 837849088. Throughput: 0: 5707.2. Samples: 837854494. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:20:12,687][25689] Avg episode reward: [(0, '-1.260')] [2022-07-10 17:20:13,071][26022] Updated weights on worker 0-0, policy_version 818214 (0.00086) [2022-07-10 17:20:14,539][26022] Updated weights on worker 0-0, policy_version 818224 (0.00084) [2022-07-10 17:20:16,538][26022] Updated weights on worker 0-0, policy_version 818234 (0.00318) [2022-07-10 17:20:17,708][25689] Fps is (10 sec: 5603.8, 60 sec: 5534.4, 300 sec: 5553.6). Total num frames: 837878784. Throughput: 0: 4891.6. Samples: 837871348. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:20:17,710][25689] Avg episode reward: [(0, '-1.806')] [2022-07-10 17:20:18,392][26022] Updated weights on worker 0-0, policy_version 818244 (0.00089) [2022-07-10 17:20:19,914][26022] Updated weights on worker 0-0, policy_version 818254 (0.00087) [2022-07-10 17:20:22,037][26022] Updated weights on worker 0-0, policy_version 818264 (0.00088) [2022-07-10 17:20:22,716][25689] Fps is (10 sec: 5717.5, 60 sec: 5550.9, 300 sec: 5551.2). Total num frames: 837906432. Throughput: 0: 5742.5. Samples: 837905078. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:20:22,718][25689] Avg episode reward: [(0, '-1.742')] [2022-07-10 17:20:23,751][26022] Updated weights on worker 0-0, policy_version 818274 (0.00089) [2022-07-10 17:20:25,629][26022] Updated weights on worker 0-0, policy_version 818284 (0.00091) [2022-07-10 17:20:27,622][26022] Updated weights on worker 0-0, policy_version 818294 (0.00090) [2022-07-10 17:20:27,823][25689] Fps is (10 sec: 5568.2, 60 sec: 5529.5, 300 sec: 5550.2). Total num frames: 837935104. Throughput: 0: 5817.0. Samples: 837938258. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:20:27,825][25689] Avg episode reward: [(0, '-1.338')] [2022-07-10 17:20:29,278][26022] Updated weights on worker 0-0, policy_version 818304 (0.00090) [2022-07-10 17:20:31,212][26022] Updated weights on worker 0-0, policy_version 818314 (0.00091) [2022-07-10 17:20:32,838][25689] Fps is (10 sec: 5564.5, 60 sec: 5545.4, 300 sec: 5550.3). Total num frames: 837962752. Throughput: 0: 5812.9. Samples: 837971630. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:20:32,839][25689] Avg episode reward: [(0, '-1.019')] [2022-07-10 17:20:32,984][26022] Updated weights on worker 0-0, policy_version 818324 (0.00096) [2022-07-10 17:20:34,755][26022] Updated weights on worker 0-0, policy_version 818334 (0.00083) [2022-07-10 17:20:36,796][26022] Updated weights on worker 0-0, policy_version 818344 (0.00094) [2022-07-10 17:20:37,891][25689] Fps is (10 sec: 5492.3, 60 sec: 5542.4, 300 sec: 5550.6). Total num frames: 837990400. Throughput: 0: 5791.8. Samples: 837988242. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:20:37,891][25689] Avg episode reward: [(0, '-0.191')] [2022-07-10 17:20:38,483][26022] Updated weights on worker 0-0, policy_version 818354 (0.00084) [2022-07-10 17:20:40,566][26022] Updated weights on worker 0-0, policy_version 818364 (0.00427) [2022-07-10 17:20:42,143][26022] Updated weights on worker 0-0, policy_version 818374 (0.00090) [2022-07-10 17:20:42,921][25689] Fps is (10 sec: 5484.0, 60 sec: 5524.5, 300 sec: 5548.7). Total num frames: 838018048. Throughput: 0: 5762.0. Samples: 838021496. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:20:42,922][25689] Avg episode reward: [(0, '0.452')] [2022-07-10 17:20:44,221][26022] Updated weights on worker 0-0, policy_version 818384 (0.00089) [2022-07-10 17:20:45,888][26022] Updated weights on worker 0-0, policy_version 818394 (0.00092) [2022-07-10 17:20:47,800][26022] Updated weights on worker 0-0, policy_version 818404 (0.00091) [2022-07-10 17:20:47,971][25689] Fps is (10 sec: 5688.8, 60 sec: 5558.8, 300 sec: 5552.2). Total num frames: 838047744. Throughput: 0: 5785.6. Samples: 838054826. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:20:47,971][25689] Avg episode reward: [(0, '0.820')] [2022-07-10 17:20:49,751][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:20:49,760][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000818414_838055936.pth [2022-07-10 17:20:49,761][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000816460_836055040.pth [2022-07-10 17:20:49,774][26022] Updated weights on worker 0-0, policy_version 818414 (0.00090) [2022-07-10 17:20:51,474][26022] Updated weights on worker 0-0, policy_version 818424 (0.00978) [2022-07-10 17:20:52,984][25689] Fps is (10 sec: 5596.3, 60 sec: 5524.9, 300 sec: 5549.4). Total num frames: 838074368. Throughput: 0: 4969.7. Samples: 838071754. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:20:52,985][25689] Avg episode reward: [(0, '0.740')] [2022-07-10 17:20:53,391][26022] Updated weights on worker 0-0, policy_version 818434 (0.00095) [2022-07-10 17:20:55,262][26022] Updated weights on worker 0-0, policy_version 818444 (0.00087) [2022-07-10 17:20:57,146][26022] Updated weights on worker 0-0, policy_version 818454 (0.00098) [2022-07-10 17:20:58,021][25689] Fps is (10 sec: 5298.2, 60 sec: 5522.8, 300 sec: 5545.8). Total num frames: 838100992. Throughput: 0: 5774.2. Samples: 838104480. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:20:58,022][25689] Avg episode reward: [(0, '-0.374')] [2022-07-10 17:20:58,961][26022] Updated weights on worker 0-0, policy_version 818464 (0.00079) [2022-07-10 17:21:00,906][26022] Updated weights on worker 0-0, policy_version 818474 (0.00087) [2022-07-10 17:21:02,999][26022] Updated weights on worker 0-0, policy_version 818484 (0.00086) [2022-07-10 17:21:03,094][25689] Fps is (10 sec: 5267.2, 60 sec: 5516.9, 300 sec: 5546.6). Total num frames: 838127616. Throughput: 0: 5663.0. Samples: 838135736. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:21:03,094][25689] Avg episode reward: [(0, '-0.045')] [2022-07-10 17:21:04,918][26022] Updated weights on worker 0-0, policy_version 818494 (0.00082) [2022-07-10 17:21:06,772][26022] Updated weights on worker 0-0, policy_version 818504 (0.00088) [2022-07-10 17:21:08,228][25689] Fps is (10 sec: 5317.4, 60 sec: 5494.5, 300 sec: 5538.1). Total num frames: 838155264. Throughput: 0: 4814.2. Samples: 838152346. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:21:08,228][25689] Avg episode reward: [(0, '-0.498')] [2022-07-10 17:21:08,606][26022] Updated weights on worker 0-0, policy_version 818514 (0.00092) [2022-07-10 17:21:10,579][26022] Updated weights on worker 0-0, policy_version 818524 (0.00087) [2022-07-10 17:21:12,147][26022] Updated weights on worker 0-0, policy_version 818534 (0.00094) [2022-07-10 17:21:13,236][25689] Fps is (10 sec: 5653.9, 60 sec: 5547.1, 300 sec: 5548.7). Total num frames: 838184960. Throughput: 0: 5633.8. Samples: 838185848. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:21:13,237][25689] Avg episode reward: [(0, '-0.185')] [2022-07-10 17:21:14,133][26022] Updated weights on worker 0-0, policy_version 818544 (0.00085) [2022-07-10 17:21:16,031][26022] Updated weights on worker 0-0, policy_version 818554 (0.00080) [2022-07-10 17:21:17,673][26022] Updated weights on worker 0-0, policy_version 818564 (0.00086) [2022-07-10 17:21:18,265][25689] Fps is (10 sec: 5712.9, 60 sec: 5512.5, 300 sec: 5541.6). Total num frames: 838212608. Throughput: 0: 5688.8. Samples: 838219646. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:21:18,266][25689] Avg episode reward: [(0, '0.155')] [2022-07-10 17:21:19,640][26022] Updated weights on worker 0-0, policy_version 818574 (0.00085) [2022-07-10 17:21:21,437][26022] Updated weights on worker 0-0, policy_version 818584 (0.00084) [2022-07-10 17:21:23,207][26022] Updated weights on worker 0-0, policy_version 818594 (0.00088) [2022-07-10 17:21:23,299][25689] Fps is (10 sec: 5494.8, 60 sec: 5510.2, 300 sec: 5543.2). Total num frames: 838240256. Throughput: 0: 4991.7. Samples: 838236596. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:21:23,300][25689] Avg episode reward: [(0, '0.329')] [2022-07-10 17:21:25,124][26022] Updated weights on worker 0-0, policy_version 818604 (0.00087) [2022-07-10 17:21:26,712][26022] Updated weights on worker 0-0, policy_version 818614 (0.00105) [2022-07-10 17:21:28,431][25689] Fps is (10 sec: 5540.3, 60 sec: 5507.9, 300 sec: 5544.6). Total num frames: 838268928. Throughput: 0: 5829.5. Samples: 838270120. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:21:28,431][25689] Avg episode reward: [(0, '-0.312')] [2022-07-10 17:21:28,887][26022] Updated weights on worker 0-0, policy_version 818624 (0.00089) [2022-07-10 17:21:30,625][26022] Updated weights on worker 0-0, policy_version 818634 (0.00086) [2022-07-10 17:21:32,450][26022] Updated weights on worker 0-0, policy_version 818644 (0.00091) [2022-07-10 17:21:33,502][25689] Fps is (10 sec: 5520.1, 60 sec: 5502.8, 300 sec: 5536.5). Total num frames: 838296576. Throughput: 0: 5799.3. Samples: 838303376. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:21:33,503][25689] Avg episode reward: [(0, '-0.333')] [2022-07-10 17:21:34,286][26022] Updated weights on worker 0-0, policy_version 818654 (0.00086) [2022-07-10 17:21:35,971][26022] Updated weights on worker 0-0, policy_version 818664 (0.00093) [2022-07-10 17:21:38,063][26022] Updated weights on worker 0-0, policy_version 818674 (0.00105) [2022-07-10 17:21:38,508][25689] Fps is (10 sec: 5588.6, 60 sec: 5523.9, 300 sec: 5536.5). Total num frames: 838325248. Throughput: 0: 4977.3. Samples: 838320402. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:21:38,509][25689] Avg episode reward: [(0, '0.505')] [2022-07-10 17:21:39,738][26022] Updated weights on worker 0-0, policy_version 818684 (0.00092) [2022-07-10 17:21:41,611][26022] Updated weights on worker 0-0, policy_version 818694 (0.00085) [2022-07-10 17:21:43,438][26022] Updated weights on worker 0-0, policy_version 818704 (0.00080) [2022-07-10 17:21:43,519][25689] Fps is (10 sec: 5622.7, 60 sec: 5525.7, 300 sec: 5541.6). Total num frames: 838352896. Throughput: 0: 5787.0. Samples: 838353604. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:21:43,519][25689] Avg episode reward: [(0, '0.536')] [2022-07-10 17:21:45,131][26022] Updated weights on worker 0-0, policy_version 818714 (0.00093) [2022-07-10 17:21:47,182][26022] Updated weights on worker 0-0, policy_version 818724 (0.00097) [2022-07-10 17:21:48,657][25689] Fps is (10 sec: 5549.5, 60 sec: 5500.8, 300 sec: 5536.1). Total num frames: 838381568. Throughput: 0: 5776.9. Samples: 838386964. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:21:48,659][25689] Avg episode reward: [(0, '0.020')] [2022-07-10 17:21:49,136][26022] Updated weights on worker 0-0, policy_version 818734 (0.00091) [2022-07-10 17:21:50,746][26022] Updated weights on worker 0-0, policy_version 818744 (0.00093) [2022-07-10 17:21:52,785][26022] Updated weights on worker 0-0, policy_version 818754 (0.00088) [2022-07-10 17:21:53,754][25689] Fps is (10 sec: 5502.4, 60 sec: 5510.1, 300 sec: 5535.0). Total num frames: 838409216. Throughput: 0: 4955.9. Samples: 838403744. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:21:53,755][25689] Avg episode reward: [(0, '0.191')] [2022-07-10 17:21:54,242][26022] Updated weights on worker 0-0, policy_version 818764 (0.00083) [2022-07-10 17:21:56,374][26022] Updated weights on worker 0-0, policy_version 818774 (0.00087) [2022-07-10 17:21:58,252][26022] Updated weights on worker 0-0, policy_version 818784 (0.00089) [2022-07-10 17:21:58,773][25689] Fps is (10 sec: 5466.2, 60 sec: 5528.6, 300 sec: 5528.3). Total num frames: 838436864. Throughput: 0: 5755.6. Samples: 838437038. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:21:58,774][25689] Avg episode reward: [(0, '-0.574')] [2022-07-10 17:21:59,938][26022] Updated weights on worker 0-0, policy_version 818794 (0.00082) [2022-07-10 17:22:02,074][26022] Updated weights on worker 0-0, policy_version 818804 (0.00099) [2022-07-10 17:22:03,784][25689] Fps is (10 sec: 5513.2, 60 sec: 5551.1, 300 sec: 5539.6). Total num frames: 838464512. Throughput: 0: 5661.4. Samples: 838468334. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 17:22:03,786][25689] Avg episode reward: [(0, '-1.620')] [2022-07-10 17:22:04,028][26022] Updated weights on worker 0-0, policy_version 818814 (0.00095) [2022-07-10 17:22:05,997][26022] Updated weights on worker 0-0, policy_version 818824 (0.00092) [2022-07-10 17:22:07,782][26022] Updated weights on worker 0-0, policy_version 818834 (0.00095) [2022-07-10 17:22:08,864][25689] Fps is (10 sec: 5378.2, 60 sec: 5539.1, 300 sec: 5528.1). Total num frames: 838491136. Throughput: 0: 4858.5. Samples: 838485140. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:22:08,865][25689] Avg episode reward: [(0, '-2.003')] [2022-07-10 17:22:09,621][26022] Updated weights on worker 0-0, policy_version 818844 (0.00087) [2022-07-10 17:22:11,401][26022] Updated weights on worker 0-0, policy_version 818854 (0.00088) [2022-07-10 17:22:13,246][26022] Updated weights on worker 0-0, policy_version 818864 (0.00090) [2022-07-10 17:22:13,927][25689] Fps is (10 sec: 5350.6, 60 sec: 5500.4, 300 sec: 5530.4). Total num frames: 838518784. Throughput: 0: 5691.7. Samples: 838518562. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:22:13,927][25689] Avg episode reward: [(0, '-2.150')] [2022-07-10 17:22:15,173][26022] Updated weights on worker 0-0, policy_version 818874 (0.00089) [2022-07-10 17:22:17,295][26022] Updated weights on worker 0-0, policy_version 818884 (0.00091) [2022-07-10 17:22:18,687][26022] Updated weights on worker 0-0, policy_version 818894 (0.00086) [2022-07-10 17:22:18,939][25689] Fps is (10 sec: 5793.5, 60 sec: 5552.6, 300 sec: 5537.2). Total num frames: 838549504. Throughput: 0: 5706.9. Samples: 838552122. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:22:18,940][25689] Avg episode reward: [(0, '-1.491')] [2022-07-10 17:22:20,956][26022] Updated weights on worker 0-0, policy_version 818904 (0.00093) [2022-07-10 17:22:22,408][26022] Updated weights on worker 0-0, policy_version 818914 (0.00090) [2022-07-10 17:22:23,955][25689] Fps is (10 sec: 5514.3, 60 sec: 5503.6, 300 sec: 5528.0). Total num frames: 838574080. Throughput: 0: 4980.2. Samples: 838568790. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:22:23,955][25689] Avg episode reward: [(0, '-1.416')] [2022-07-10 17:22:24,418][26022] Updated weights on worker 0-0, policy_version 818924 (0.00086) [2022-07-10 17:22:26,072][26022] Updated weights on worker 0-0, policy_version 818934 (0.00079) [2022-07-10 17:22:28,226][26022] Updated weights on worker 0-0, policy_version 818944 (0.00091) [2022-07-10 17:22:29,026][25689] Fps is (10 sec: 5481.9, 60 sec: 5542.9, 300 sec: 5534.7). Total num frames: 838604800. Throughput: 0: 5799.4. Samples: 838602066. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:22:29,026][25689] Avg episode reward: [(0, '-0.766')] [2022-07-10 17:22:30,029][26022] Updated weights on worker 0-0, policy_version 818954 (0.00090) [2022-07-10 17:22:31,825][26022] Updated weights on worker 0-0, policy_version 818964 (0.00085) [2022-07-10 17:22:33,686][26022] Updated weights on worker 0-0, policy_version 818974 (0.00084) [2022-07-10 17:22:34,059][25689] Fps is (10 sec: 5675.4, 60 sec: 5529.5, 300 sec: 5531.4). Total num frames: 838631424. Throughput: 0: 5796.5. Samples: 838635256. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:22:34,059][25689] Avg episode reward: [(0, '-0.681')] [2022-07-10 17:22:35,386][26022] Updated weights on worker 0-0, policy_version 818984 (0.00084) [2022-07-10 17:22:37,329][26022] Updated weights on worker 0-0, policy_version 818994 (0.00090) [2022-07-10 17:22:38,988][26022] Updated weights on worker 0-0, policy_version 819004 (0.00094) [2022-07-10 17:22:39,089][25689] Fps is (10 sec: 5596.5, 60 sec: 5544.2, 300 sec: 5532.5). Total num frames: 838661120. Throughput: 0: 5795.9. Samples: 838668912. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:22:39,090][25689] Avg episode reward: [(0, '-0.258')] [2022-07-10 17:22:41,085][26022] Updated weights on worker 0-0, policy_version 819014 (0.00092) [2022-07-10 17:22:42,614][26022] Updated weights on worker 0-0, policy_version 819024 (0.00087) [2022-07-10 17:22:44,103][25689] Fps is (10 sec: 5505.3, 60 sec: 5510.1, 300 sec: 5524.1). Total num frames: 838686720. Throughput: 0: 5803.5. Samples: 838685720. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:22:44,103][25689] Avg episode reward: [(0, '-0.965')] [2022-07-10 17:22:44,789][26022] Updated weights on worker 0-0, policy_version 819034 (0.00085) [2022-07-10 17:22:46,338][26022] Updated weights on worker 0-0, policy_version 819044 (0.00089) [2022-07-10 17:22:48,409][26022] Updated weights on worker 0-0, policy_version 819054 (0.00093) [2022-07-10 17:22:49,174][25689] Fps is (10 sec: 5483.0, 60 sec: 5533.1, 300 sec: 5529.9). Total num frames: 838716416. Throughput: 0: 5810.7. Samples: 838719142. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:22:49,175][25689] Avg episode reward: [(0, '-0.233')] [2022-07-10 17:22:49,939][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:22:49,953][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000819064_838721536.pth [2022-07-10 17:22:49,954][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000817113_836723712.pth [2022-07-10 17:22:49,956][26022] Updated weights on worker 0-0, policy_version 819064 (0.00091) [2022-07-10 17:22:52,126][26022] Updated weights on worker 0-0, policy_version 819074 (0.00086) [2022-07-10 17:22:53,576][26022] Updated weights on worker 0-0, policy_version 819084 (0.00085) [2022-07-10 17:22:54,228][25689] Fps is (10 sec: 5663.6, 60 sec: 5537.1, 300 sec: 5529.1). Total num frames: 838744064. Throughput: 0: 5819.5. Samples: 838752630. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:22:54,228][25689] Avg episode reward: [(0, '-1.419')] [2022-07-10 17:22:55,719][26022] Updated weights on worker 0-0, policy_version 819094 (0.00080) [2022-07-10 17:22:57,388][26022] Updated weights on worker 0-0, policy_version 819104 (0.00093) [2022-07-10 17:22:59,234][25689] Fps is (10 sec: 5293.3, 60 sec: 5504.4, 300 sec: 5518.8). Total num frames: 838769664. Throughput: 0: 4986.4. Samples: 838769360. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:22:59,234][25689] Avg episode reward: [(0, '-1.231')] [2022-07-10 17:22:59,593][26022] Updated weights on worker 0-0, policy_version 819114 (0.00079) [2022-07-10 17:23:00,951][26022] Updated weights on worker 0-0, policy_version 819124 (0.00101) [2022-07-10 17:23:03,517][26022] Updated weights on worker 0-0, policy_version 819134 (0.00085) [2022-07-10 17:23:04,256][25689] Fps is (10 sec: 5309.8, 60 sec: 5503.3, 300 sec: 5527.5). Total num frames: 838797312. Throughput: 0: 5704.8. Samples: 838800690. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:23:04,257][25689] Avg episode reward: [(0, '-2.016')] [2022-07-10 17:23:05,190][26022] Updated weights on worker 0-0, policy_version 819144 (0.00090) [2022-07-10 17:23:07,232][26022] Updated weights on worker 0-0, policy_version 819154 (0.00088) [2022-07-10 17:23:09,054][26022] Updated weights on worker 0-0, policy_version 819164 (0.00087) [2022-07-10 17:23:09,371][25689] Fps is (10 sec: 5454.6, 60 sec: 5517.1, 300 sec: 5523.3). Total num frames: 838824960. Throughput: 0: 5673.2. Samples: 838833724. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:23:09,372][25689] Avg episode reward: [(0, '-1.775')] [2022-07-10 17:23:10,722][26022] Updated weights on worker 0-0, policy_version 819174 (0.00088) [2022-07-10 17:23:12,750][26022] Updated weights on worker 0-0, policy_version 819184 (0.00087) [2022-07-10 17:23:14,375][25689] Fps is (10 sec: 5565.8, 60 sec: 5539.4, 300 sec: 5524.2). Total num frames: 838853632. Throughput: 0: 4859.1. Samples: 838850528. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:23:14,377][25689] Avg episode reward: [(0, '-1.988')] [2022-07-10 17:23:14,467][26022] Updated weights on worker 0-0, policy_version 819194 (0.00089) [2022-07-10 17:23:16,237][26022] Updated weights on worker 0-0, policy_version 819204 (0.00088) [2022-07-10 17:23:18,068][26022] Updated weights on worker 0-0, policy_version 819214 (0.00098) [2022-07-10 17:23:19,380][25689] Fps is (10 sec: 5627.0, 60 sec: 5489.2, 300 sec: 5521.1). Total num frames: 838881280. Throughput: 0: 5703.2. Samples: 838884258. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:23:19,381][25689] Avg episode reward: [(0, '-2.105')] [2022-07-10 17:23:20,004][26022] Updated weights on worker 0-0, policy_version 819224 (0.00092) [2022-07-10 17:23:21,765][26022] Updated weights on worker 0-0, policy_version 819234 (0.00083) [2022-07-10 17:23:23,690][26022] Updated weights on worker 0-0, policy_version 819244 (0.00087) [2022-07-10 17:23:24,384][25689] Fps is (10 sec: 5627.2, 60 sec: 5558.1, 300 sec: 5525.9). Total num frames: 838909952. Throughput: 0: 5831.2. Samples: 838918058. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:23:24,384][25689] Avg episode reward: [(0, '-3.114')] [2022-07-10 17:23:25,475][26022] Updated weights on worker 0-0, policy_version 819254 (0.00096) [2022-07-10 17:23:27,166][26022] Updated weights on worker 0-0, policy_version 819264 (0.00078) [2022-07-10 17:23:29,303][26022] Updated weights on worker 0-0, policy_version 819274 (0.00085) [2022-07-10 17:23:29,497][25689] Fps is (10 sec: 5465.5, 60 sec: 5486.5, 300 sec: 5523.8). Total num frames: 838936576. Throughput: 0: 5016.2. Samples: 838934682. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:23:29,498][25689] Avg episode reward: [(0, '-2.726')] [2022-07-10 17:23:30,942][26022] Updated weights on worker 0-0, policy_version 819284 (0.00091) [2022-07-10 17:23:32,882][26022] Updated weights on worker 0-0, policy_version 819294 (0.00087) [2022-07-10 17:23:34,515][25689] Fps is (10 sec: 5457.9, 60 sec: 5521.7, 300 sec: 5527.2). Total num frames: 838965248. Throughput: 0: 5825.2. Samples: 838967846. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:23:34,515][25689] Avg episode reward: [(0, '-1.747')] [2022-07-10 17:23:34,729][26022] Updated weights on worker 0-0, policy_version 819304 (0.00088) [2022-07-10 17:23:36,535][26022] Updated weights on worker 0-0, policy_version 819314 (0.00089) [2022-07-10 17:23:38,436][26022] Updated weights on worker 0-0, policy_version 819324 (0.00089) [2022-07-10 17:23:39,559][25689] Fps is (10 sec: 5699.6, 60 sec: 5503.6, 300 sec: 5523.4). Total num frames: 838993920. Throughput: 0: 5783.5. Samples: 839000960. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:23:39,559][25689] Avg episode reward: [(0, '-2.087')] [2022-07-10 17:23:40,103][26022] Updated weights on worker 0-0, policy_version 819334 (0.00087) [2022-07-10 17:23:41,911][26022] Updated weights on worker 0-0, policy_version 819344 (0.00087) [2022-07-10 17:23:44,165][26022] Updated weights on worker 0-0, policy_version 819354 (0.00083) [2022-07-10 17:23:44,604][25689] Fps is (10 sec: 5480.6, 60 sec: 5517.6, 300 sec: 5520.0). Total num frames: 839020544. Throughput: 0: 4934.0. Samples: 839017828. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:23:44,607][25689] Avg episode reward: [(0, '-1.639')] [2022-07-10 17:23:45,537][26022] Updated weights on worker 0-0, policy_version 819364 (0.00081) [2022-07-10 17:23:47,882][26022] Updated weights on worker 0-0, policy_version 819374 (0.00086) [2022-07-10 17:23:49,175][26022] Updated weights on worker 0-0, policy_version 819384 (0.00087) [2022-07-10 17:23:49,719][25689] Fps is (10 sec: 5643.8, 60 sec: 5530.6, 300 sec: 5528.2). Total num frames: 839051264. Throughput: 0: 5766.1. Samples: 839051282. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:23:49,720][25689] Avg episode reward: [(0, '0.509')] [2022-07-10 17:23:51,441][26022] Updated weights on worker 0-0, policy_version 819394 (0.00085) [2022-07-10 17:23:53,138][26022] Updated weights on worker 0-0, policy_version 819404 (0.00087) [2022-07-10 17:23:54,734][25689] Fps is (10 sec: 5661.2, 60 sec: 5517.2, 300 sec: 5521.2). Total num frames: 839077888. Throughput: 0: 5770.1. Samples: 839084510. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:23:54,734][25689] Avg episode reward: [(0, '-0.029')] [2022-07-10 17:23:54,913][26022] Updated weights on worker 0-0, policy_version 819414 (0.00082) [2022-07-10 17:23:56,812][26022] Updated weights on worker 0-0, policy_version 819424 (0.00092) [2022-07-10 17:23:58,558][26022] Updated weights on worker 0-0, policy_version 819434 (0.00087) [2022-07-10 17:23:59,782][25689] Fps is (10 sec: 5393.6, 60 sec: 5547.2, 300 sec: 5531.1). Total num frames: 839105536. Throughput: 0: 4961.3. Samples: 839101292. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:23:59,782][25689] Avg episode reward: [(0, '0.034')] [2022-07-10 17:24:00,569][26022] Updated weights on worker 0-0, policy_version 819444 (0.00083) [2022-07-10 17:24:02,550][26022] Updated weights on worker 0-0, policy_version 819454 (0.00087) [2022-07-10 17:24:04,606][26022] Updated weights on worker 0-0, policy_version 819464 (0.00090) [2022-07-10 17:24:04,839][25689] Fps is (10 sec: 5269.4, 60 sec: 5510.2, 300 sec: 5518.2). Total num frames: 839131136. Throughput: 0: 5675.3. Samples: 839132664. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:24:04,839][25689] Avg episode reward: [(0, '-1.416')] [2022-07-10 17:24:06,255][26022] Updated weights on worker 0-0, policy_version 819474 (0.00085) [2022-07-10 17:24:08,335][26022] Updated weights on worker 0-0, policy_version 819484 (0.00102) [2022-07-10 17:24:09,970][25689] Fps is (10 sec: 5427.2, 60 sec: 5542.5, 300 sec: 5519.4). Total num frames: 839160832. Throughput: 0: 5669.4. Samples: 839166094. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:24:09,972][25689] Avg episode reward: [(0, '-1.774')] [2022-07-10 17:24:10,054][26022] Updated weights on worker 0-0, policy_version 819494 (0.00082) [2022-07-10 17:24:11,879][26022] Updated weights on worker 0-0, policy_version 819504 (0.00082) [2022-07-10 17:24:13,774][26022] Updated weights on worker 0-0, policy_version 819514 (0.00081) [2022-07-10 17:24:15,020][25689] Fps is (10 sec: 5732.6, 60 sec: 5538.3, 300 sec: 5525.5). Total num frames: 839189504. Throughput: 0: 4855.9. Samples: 839183020. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:24:15,023][25689] Avg episode reward: [(0, '-1.860')] [2022-07-10 17:24:15,363][26022] Updated weights on worker 0-0, policy_version 819524 (0.00086) [2022-07-10 17:24:17,452][26022] Updated weights on worker 0-0, policy_version 819534 (0.00087) [2022-07-10 17:24:19,120][26022] Updated weights on worker 0-0, policy_version 819544 (0.00093) [2022-07-10 17:24:20,036][25689] Fps is (10 sec: 5595.5, 60 sec: 5537.4, 300 sec: 5528.7). Total num frames: 839217152. Throughput: 0: 5703.2. Samples: 839216804. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:24:20,037][25689] Avg episode reward: [(0, '-3.376')] [2022-07-10 17:24:21,078][26022] Updated weights on worker 0-0, policy_version 819554 (0.00094) [2022-07-10 17:24:22,869][26022] Updated weights on worker 0-0, policy_version 819564 (0.00098) [2022-07-10 17:24:24,633][26022] Updated weights on worker 0-0, policy_version 819574 (0.00086) [2022-07-10 17:24:25,086][25689] Fps is (10 sec: 5493.7, 60 sec: 5516.2, 300 sec: 5522.0). Total num frames: 839244800. Throughput: 0: 5818.6. Samples: 839250474. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:24:25,086][25689] Avg episode reward: [(0, '-3.015')] [2022-07-10 17:24:26,490][26022] Updated weights on worker 0-0, policy_version 819584 (0.00100) [2022-07-10 17:24:28,476][26022] Updated weights on worker 0-0, policy_version 819594 (0.00086) [2022-07-10 17:24:30,167][25689] Fps is (10 sec: 5659.7, 60 sec: 5569.8, 300 sec: 5530.8). Total num frames: 839274496. Throughput: 0: 4992.7. Samples: 839266932. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:24:30,173][25689] Avg episode reward: [(0, '-2.876')] [2022-07-10 17:24:30,175][26022] Updated weights on worker 0-0, policy_version 819604 (0.00086) [2022-07-10 17:24:32,230][26022] Updated weights on worker 0-0, policy_version 819614 (0.00100) [2022-07-10 17:24:33,873][26022] Updated weights on worker 0-0, policy_version 819624 (0.00089) [2022-07-10 17:24:35,250][25689] Fps is (10 sec: 5641.8, 60 sec: 5547.0, 300 sec: 5529.6). Total num frames: 839302144. Throughput: 0: 5794.6. Samples: 839300242. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:24:35,252][25689] Avg episode reward: [(0, '-3.665')] [2022-07-10 17:24:35,854][26022] Updated weights on worker 0-0, policy_version 819634 (0.00088) [2022-07-10 17:24:37,457][26022] Updated weights on worker 0-0, policy_version 819644 (0.00094) [2022-07-10 17:24:39,531][26022] Updated weights on worker 0-0, policy_version 819654 (0.00083) [2022-07-10 17:24:40,312][25689] Fps is (10 sec: 5551.8, 60 sec: 5545.3, 300 sec: 5528.9). Total num frames: 839330816. Throughput: 0: 5782.2. Samples: 839334046. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:24:40,313][25689] Avg episode reward: [(0, '-2.931')] [2022-07-10 17:24:41,287][26022] Updated weights on worker 0-0, policy_version 819664 (0.00101) [2022-07-10 17:24:42,907][26022] Updated weights on worker 0-0, policy_version 819674 (0.00088) [2022-07-10 17:24:45,001][26022] Updated weights on worker 0-0, policy_version 819684 (0.00087) [2022-07-10 17:24:45,335][25689] Fps is (10 sec: 5584.5, 60 sec: 5564.2, 300 sec: 5529.5). Total num frames: 839358464. Throughput: 0: 5800.4. Samples: 839367928. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:24:45,336][25689] Avg episode reward: [(0, '-2.837')] [2022-07-10 17:24:46,524][26022] Updated weights on worker 0-0, policy_version 819694 (0.00088) [2022-07-10 17:24:48,613][26022] Updated weights on worker 0-0, policy_version 819704 (0.00091) [2022-07-10 17:24:50,433][25689] Fps is (10 sec: 5463.2, 60 sec: 5515.1, 300 sec: 5524.4). Total num frames: 839386112. Throughput: 0: 5805.1. Samples: 839384578. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:24:50,434][25689] Avg episode reward: [(0, '-2.006')] [2022-07-10 17:24:50,479][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:24:50,492][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000819714_839387136.pth [2022-07-10 17:24:50,492][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000817766_837392384.pth [2022-07-10 17:24:50,495][26022] Updated weights on worker 0-0, policy_version 819714 (0.00090) [2022-07-10 17:24:52,162][26022] Updated weights on worker 0-0, policy_version 819724 (0.00091) [2022-07-10 17:24:54,080][26022] Updated weights on worker 0-0, policy_version 819734 (0.00093) [2022-07-10 17:24:55,457][25689] Fps is (10 sec: 5665.5, 60 sec: 5565.0, 300 sec: 5534.6). Total num frames: 839415808. Throughput: 0: 5832.4. Samples: 839418096. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:24:55,457][25689] Avg episode reward: [(0, '-1.608')] [2022-07-10 17:24:55,767][26022] Updated weights on worker 0-0, policy_version 819744 (0.00087) [2022-07-10 17:24:57,747][26022] Updated weights on worker 0-0, policy_version 819754 (0.00104) [2022-07-10 17:24:59,499][26022] Updated weights on worker 0-0, policy_version 819764 (0.00090) [2022-07-10 17:25:00,480][25689] Fps is (10 sec: 5605.8, 60 sec: 5550.3, 300 sec: 5534.3). Total num frames: 839442432. Throughput: 0: 5827.9. Samples: 839451584. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:25:00,481][25689] Avg episode reward: [(0, '-0.807')] [2022-07-10 17:25:01,586][26022] Updated weights on worker 0-0, policy_version 819774 (0.00094) [2022-07-10 17:25:03,573][26022] Updated weights on worker 0-0, policy_version 819784 (0.00081) [2022-07-10 17:25:05,494][25689] Fps is (10 sec: 5203.1, 60 sec: 5554.3, 300 sec: 5525.1). Total num frames: 839468032. Throughput: 0: 4879.6. Samples: 839466294. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:25:05,494][25689] Avg episode reward: [(0, '-0.718')] [2022-07-10 17:25:05,515][26022] Updated weights on worker 0-0, policy_version 819794 (0.00089) [2022-07-10 17:25:07,247][26022] Updated weights on worker 0-0, policy_version 819804 (0.00092) [2022-07-10 17:25:09,240][26022] Updated weights on worker 0-0, policy_version 819814 (0.00091) [2022-07-10 17:25:10,640][25689] Fps is (10 sec: 5442.8, 60 sec: 5553.0, 300 sec: 5533.2). Total num frames: 839497728. Throughput: 0: 5696.1. Samples: 839499674. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:25:10,640][25689] Avg episode reward: [(0, '-0.693')] [2022-07-10 17:25:10,918][26022] Updated weights on worker 0-0, policy_version 819824 (0.00087) [2022-07-10 17:25:12,675][26022] Updated weights on worker 0-0, policy_version 819834 (0.00085) [2022-07-10 17:25:14,635][26022] Updated weights on worker 0-0, policy_version 819844 (0.00084) [2022-07-10 17:25:15,690][25689] Fps is (10 sec: 5724.5, 60 sec: 5553.0, 300 sec: 5529.3). Total num frames: 839526400. Throughput: 0: 5700.0. Samples: 839533426. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:25:15,690][25689] Avg episode reward: [(0, '-0.952')] [2022-07-10 17:25:16,389][26022] Updated weights on worker 0-0, policy_version 819854 (0.00091) [2022-07-10 17:25:18,113][26022] Updated weights on worker 0-0, policy_version 819864 (0.00084) [2022-07-10 17:25:20,064][26022] Updated weights on worker 0-0, policy_version 819874 (0.00092) [2022-07-10 17:25:20,705][25689] Fps is (10 sec: 5595.4, 60 sec: 5553.0, 300 sec: 5529.1). Total num frames: 839554048. Throughput: 0: 4879.3. Samples: 839550268. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:25:20,708][25689] Avg episode reward: [(0, '-1.466')] [2022-07-10 17:25:21,899][26022] Updated weights on worker 0-0, policy_version 819884 (0.00082) [2022-07-10 17:25:23,694][26022] Updated weights on worker 0-0, policy_version 819894 (0.00082) [2022-07-10 17:25:25,598][26022] Updated weights on worker 0-0, policy_version 819904 (0.00084) [2022-07-10 17:25:25,715][25689] Fps is (10 sec: 5515.7, 60 sec: 5556.7, 300 sec: 5527.5). Total num frames: 839581696. Throughput: 0: 5812.8. Samples: 839583838. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:25:25,717][25689] Avg episode reward: [(0, '-2.192')] [2022-07-10 17:25:27,422][26022] Updated weights on worker 0-0, policy_version 819914 (0.00087) [2022-07-10 17:25:29,193][26022] Updated weights on worker 0-0, policy_version 819924 (0.00113) [2022-07-10 17:25:30,800][25689] Fps is (10 sec: 5477.6, 60 sec: 5522.6, 300 sec: 5526.2). Total num frames: 839609344. Throughput: 0: 5844.7. Samples: 839617506. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:25:30,802][25689] Avg episode reward: [(0, '-2.268')] [2022-07-10 17:25:31,164][26022] Updated weights on worker 0-0, policy_version 819934 (0.00086) [2022-07-10 17:25:32,855][26022] Updated weights on worker 0-0, policy_version 819944 (0.00099) [2022-07-10 17:25:34,805][26022] Updated weights on worker 0-0, policy_version 819954 (0.00083) [2022-07-10 17:25:35,810][25689] Fps is (10 sec: 5680.5, 60 sec: 5563.0, 300 sec: 5533.8). Total num frames: 839639040. Throughput: 0: 5007.7. Samples: 839634182. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:25:35,812][25689] Avg episode reward: [(0, '-1.634')] [2022-07-10 17:25:36,445][26022] Updated weights on worker 0-0, policy_version 819964 (0.00081) [2022-07-10 17:25:38,524][26022] Updated weights on worker 0-0, policy_version 819974 (0.00083) [2022-07-10 17:25:40,335][26022] Updated weights on worker 0-0, policy_version 819984 (0.00086) [2022-07-10 17:25:40,827][25689] Fps is (10 sec: 5616.9, 60 sec: 5533.3, 300 sec: 5530.7). Total num frames: 839665664. Throughput: 0: 5835.2. Samples: 839667684. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:25:40,829][25689] Avg episode reward: [(0, '-1.574')] [2022-07-10 17:25:42,051][26022] Updated weights on worker 0-0, policy_version 819994 (0.00086) [2022-07-10 17:25:43,931][26022] Updated weights on worker 0-0, policy_version 820004 (0.00090) [2022-07-10 17:25:45,651][26022] Updated weights on worker 0-0, policy_version 820014 (0.00090) [2022-07-10 17:25:45,837][25689] Fps is (10 sec: 5514.4, 60 sec: 5551.4, 300 sec: 5528.0). Total num frames: 839694336. Throughput: 0: 5842.4. Samples: 839701402. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:25:45,838][25689] Avg episode reward: [(0, '-2.409')] [2022-07-10 17:25:47,639][26022] Updated weights on worker 0-0, policy_version 820024 (0.00091) [2022-07-10 17:25:49,530][26022] Updated weights on worker 0-0, policy_version 820034 (0.00088) [2022-07-10 17:25:50,908][25689] Fps is (10 sec: 5586.7, 60 sec: 5553.9, 300 sec: 5530.3). Total num frames: 839721984. Throughput: 0: 5012.5. Samples: 839718298. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:25:50,909][25689] Avg episode reward: [(0, '-2.226')] [2022-07-10 17:25:51,227][26022] Updated weights on worker 0-0, policy_version 820044 (0.00093) [2022-07-10 17:25:53,088][26022] Updated weights on worker 0-0, policy_version 820054 (0.00090) [2022-07-10 17:25:54,828][26022] Updated weights on worker 0-0, policy_version 820064 (0.00082) [2022-07-10 17:25:55,941][25689] Fps is (10 sec: 5574.0, 60 sec: 5536.1, 300 sec: 5537.3). Total num frames: 839750656. Throughput: 0: 5845.8. Samples: 839751868. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:25:55,942][25689] Avg episode reward: [(0, '-2.359')] [2022-07-10 17:25:56,810][26022] Updated weights on worker 0-0, policy_version 820074 (0.00083) [2022-07-10 17:25:58,426][26022] Updated weights on worker 0-0, policy_version 820084 (0.00093) [2022-07-10 17:26:00,336][26022] Updated weights on worker 0-0, policy_version 820094 (0.00086) [2022-07-10 17:26:00,953][25689] Fps is (10 sec: 5708.7, 60 sec: 5571.0, 300 sec: 5545.3). Total num frames: 839779328. Throughput: 0: 5866.5. Samples: 839785754. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:26:00,954][25689] Avg episode reward: [(0, '-2.548')] [2022-07-10 17:26:02,535][26022] Updated weights on worker 0-0, policy_version 820104 (0.00095) [2022-07-10 17:26:04,537][26022] Updated weights on worker 0-0, policy_version 820114 (0.00083) [2022-07-10 17:26:05,989][25689] Fps is (10 sec: 5401.7, 60 sec: 5569.0, 300 sec: 5540.2). Total num frames: 839804928. Throughput: 0: 4913.9. Samples: 839800422. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:26:05,989][25689] Avg episode reward: [(0, '-2.263')] [2022-07-10 17:26:06,273][26022] Updated weights on worker 0-0, policy_version 820124 (0.00086) [2022-07-10 17:26:08,120][26022] Updated weights on worker 0-0, policy_version 820134 (0.00092) [2022-07-10 17:26:09,865][26022] Updated weights on worker 0-0, policy_version 820144 (0.00086) [2022-07-10 17:26:11,065][25689] Fps is (10 sec: 5468.6, 60 sec: 5575.4, 300 sec: 5539.0). Total num frames: 839834624. Throughput: 0: 5741.4. Samples: 839834024. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:26:11,065][25689] Avg episode reward: [(0, '-1.828')] [2022-07-10 17:26:11,780][26022] Updated weights on worker 0-0, policy_version 820154 (0.00089) [2022-07-10 17:26:13,512][26022] Updated weights on worker 0-0, policy_version 820164 (0.00086) [2022-07-10 17:26:15,355][26022] Updated weights on worker 0-0, policy_version 820174 (0.00091) [2022-07-10 17:26:16,096][25689] Fps is (10 sec: 5572.1, 60 sec: 5543.3, 300 sec: 5535.5). Total num frames: 839861248. Throughput: 0: 5758.3. Samples: 839867924. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:26:16,097][25689] Avg episode reward: [(0, '-1.173')] [2022-07-10 17:26:17,106][26022] Updated weights on worker 0-0, policy_version 820184 (0.00093) [2022-07-10 17:26:19,043][26022] Updated weights on worker 0-0, policy_version 820194 (0.00093) [2022-07-10 17:26:20,710][26022] Updated weights on worker 0-0, policy_version 820204 (0.00087) [2022-07-10 17:26:21,117][25689] Fps is (10 sec: 5500.8, 60 sec: 5559.7, 300 sec: 5539.2). Total num frames: 839889920. Throughput: 0: 4899.7. Samples: 839884550. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:26:21,117][25689] Avg episode reward: [(0, '-0.940')] [2022-07-10 17:26:22,743][26022] Updated weights on worker 0-0, policy_version 820214 (0.00086) [2022-07-10 17:26:24,371][26022] Updated weights on worker 0-0, policy_version 820224 (0.00092) [2022-07-10 17:26:26,150][25689] Fps is (10 sec: 5601.8, 60 sec: 5557.6, 300 sec: 5537.6). Total num frames: 839917568. Throughput: 0: 5853.4. Samples: 839918434. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:26:26,151][25689] Avg episode reward: [(0, '-0.634')] [2022-07-10 17:26:26,359][26022] Updated weights on worker 0-0, policy_version 820234 (0.00084) [2022-07-10 17:26:28,095][26022] Updated weights on worker 0-0, policy_version 820244 (0.00085) [2022-07-10 17:26:29,883][26022] Updated weights on worker 0-0, policy_version 820254 (0.00085) [2022-07-10 17:26:31,248][25689] Fps is (10 sec: 5559.0, 60 sec: 5573.3, 300 sec: 5540.5). Total num frames: 839946240. Throughput: 0: 5844.0. Samples: 839951978. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:26:31,250][25689] Avg episode reward: [(0, '-0.467')] [2022-07-10 17:26:31,839][26022] Updated weights on worker 0-0, policy_version 820264 (0.00085) [2022-07-10 17:26:33,694][26022] Updated weights on worker 0-0, policy_version 820274 (0.00088) [2022-07-10 17:26:35,503][26022] Updated weights on worker 0-0, policy_version 820284 (0.00089) [2022-07-10 17:26:36,349][25689] Fps is (10 sec: 5722.7, 60 sec: 5564.9, 300 sec: 5542.2). Total num frames: 839975936. Throughput: 0: 4971.6. Samples: 839968614. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:26:36,350][25689] Avg episode reward: [(0, '-1.854')] [2022-07-10 17:26:37,402][26022] Updated weights on worker 0-0, policy_version 820294 (0.00085) [2022-07-10 17:26:39,157][26022] Updated weights on worker 0-0, policy_version 820304 (0.00105) [2022-07-10 17:26:41,140][26022] Updated weights on worker 0-0, policy_version 820314 (0.00090) [2022-07-10 17:26:41,376][25689] Fps is (10 sec: 5561.1, 60 sec: 5564.0, 300 sec: 5538.4). Total num frames: 840002560. Throughput: 0: 5792.6. Samples: 840001902. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:26:41,376][25689] Avg episode reward: [(0, '-1.951')] [2022-07-10 17:26:42,847][26022] Updated weights on worker 0-0, policy_version 820324 (0.00089) [2022-07-10 17:26:44,763][26022] Updated weights on worker 0-0, policy_version 820334 (0.00090) [2022-07-10 17:26:46,424][25689] Fps is (10 sec: 5590.1, 60 sec: 5577.5, 300 sec: 5543.6). Total num frames: 840032256. Throughput: 0: 5764.8. Samples: 840035312. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:26:46,425][25689] Avg episode reward: [(0, '-1.429')] [2022-07-10 17:26:46,428][26022] Updated weights on worker 0-0, policy_version 820344 (0.00092) [2022-07-10 17:26:48,430][26022] Updated weights on worker 0-0, policy_version 820354 (0.00089) [2022-07-10 17:26:50,228][26022] Updated weights on worker 0-0, policy_version 820364 (0.00086) [2022-07-10 17:26:50,532][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:26:50,547][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000820366_840054784.pth [2022-07-10 17:26:50,547][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000818414_838055936.pth [2022-07-10 17:26:51,565][25689] Fps is (10 sec: 5527.2, 60 sec: 5554.1, 300 sec: 5539.3). Total num frames: 840058880. Throughput: 0: 5748.8. Samples: 840068778. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:26:51,566][25689] Avg episode reward: [(0, '-0.626')] [2022-07-10 17:26:52,119][26022] Updated weights on worker 0-0, policy_version 820374 (0.00089) [2022-07-10 17:26:53,833][26022] Updated weights on worker 0-0, policy_version 820384 (0.00093) [2022-07-10 17:26:55,771][26022] Updated weights on worker 0-0, policy_version 820394 (0.00090) [2022-07-10 17:26:56,575][25689] Fps is (10 sec: 5447.6, 60 sec: 5556.3, 300 sec: 5542.9). Total num frames: 840087552. Throughput: 0: 5788.5. Samples: 840085690. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:26:56,576][25689] Avg episode reward: [(0, '-0.403')] [2022-07-10 17:26:57,333][26022] Updated weights on worker 0-0, policy_version 820404 (0.00093) [2022-07-10 17:26:59,475][26022] Updated weights on worker 0-0, policy_version 820414 (0.00084) [2022-07-10 17:27:01,249][26022] Updated weights on worker 0-0, policy_version 820424 (0.00086) [2022-07-10 17:27:01,629][25689] Fps is (10 sec: 5698.0, 60 sec: 5552.4, 300 sec: 5545.5). Total num frames: 840116224. Throughput: 0: 5802.0. Samples: 840119412. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:27:01,630][25689] Avg episode reward: [(0, '-0.042')] [2022-07-10 17:27:03,446][26022] Updated weights on worker 0-0, policy_version 820434 (0.00090) [2022-07-10 17:27:05,340][26022] Updated weights on worker 0-0, policy_version 820444 (0.00093) [2022-07-10 17:27:06,685][25689] Fps is (10 sec: 5469.5, 60 sec: 5567.4, 300 sec: 5546.0). Total num frames: 840142848. Throughput: 0: 5705.4. Samples: 840150906. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:27:06,685][25689] Avg episode reward: [(0, '0.957')] [2022-07-10 17:27:07,089][26022] Updated weights on worker 0-0, policy_version 820454 (0.00096) [2022-07-10 17:27:08,840][26022] Updated weights on worker 0-0, policy_version 820464 (0.00090) [2022-07-10 17:27:11,000][26022] Updated weights on worker 0-0, policy_version 820474 (0.00089) [2022-07-10 17:27:11,741][25689] Fps is (10 sec: 5367.1, 60 sec: 5535.4, 300 sec: 5546.1). Total num frames: 840170496. Throughput: 0: 4901.7. Samples: 840167674. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:27:11,742][25689] Avg episode reward: [(0, '0.996')] [2022-07-10 17:27:12,478][26022] Updated weights on worker 0-0, policy_version 820484 (0.00096) [2022-07-10 17:27:14,552][26022] Updated weights on worker 0-0, policy_version 820494 (0.00085) [2022-07-10 17:27:15,997][26022] Updated weights on worker 0-0, policy_version 820504 (0.00080) [2022-07-10 17:27:16,783][25689] Fps is (10 sec: 5475.8, 60 sec: 5551.4, 300 sec: 5535.2). Total num frames: 840198144. Throughput: 0: 5722.3. Samples: 840201326. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:27:16,783][25689] Avg episode reward: [(0, '0.897')] [2022-07-10 17:27:18,115][26022] Updated weights on worker 0-0, policy_version 820514 (0.00092) [2022-07-10 17:27:19,967][26022] Updated weights on worker 0-0, policy_version 820524 (0.00091) [2022-07-10 17:27:21,642][26022] Updated weights on worker 0-0, policy_version 820534 (0.00090) [2022-07-10 17:27:21,876][25689] Fps is (10 sec: 5658.0, 60 sec: 5561.6, 300 sec: 5551.0). Total num frames: 840227840. Throughput: 0: 5702.1. Samples: 840234862. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:27:21,877][25689] Avg episode reward: [(0, '0.665')] [2022-07-10 17:27:23,540][26022] Updated weights on worker 0-0, policy_version 820544 (0.00094) [2022-07-10 17:27:25,495][26022] Updated weights on worker 0-0, policy_version 820554 (0.00082) [2022-07-10 17:27:26,922][25689] Fps is (10 sec: 5655.8, 60 sec: 5560.5, 300 sec: 5541.2). Total num frames: 840255488. Throughput: 0: 4982.0. Samples: 840251728. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:27:26,922][25689] Avg episode reward: [(0, '0.288')] [2022-07-10 17:27:27,249][26022] Updated weights on worker 0-0, policy_version 820564 (0.00086) [2022-07-10 17:27:29,152][26022] Updated weights on worker 0-0, policy_version 820574 (0.00094) [2022-07-10 17:27:30,795][26022] Updated weights on worker 0-0, policy_version 820584 (0.00080) [2022-07-10 17:27:31,971][25689] Fps is (10 sec: 5478.0, 60 sec: 5548.1, 300 sec: 5544.3). Total num frames: 840283136. Throughput: 0: 5811.6. Samples: 840285238. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:27:31,972][25689] Avg episode reward: [(0, '-0.233')] [2022-07-10 17:27:32,685][26022] Updated weights on worker 0-0, policy_version 820594 (0.00098) [2022-07-10 17:27:34,638][26022] Updated weights on worker 0-0, policy_version 820604 (0.00092) [2022-07-10 17:27:36,517][26022] Updated weights on worker 0-0, policy_version 820614 (0.00093) [2022-07-10 17:27:36,990][25689] Fps is (10 sec: 5492.0, 60 sec: 5521.8, 300 sec: 5537.6). Total num frames: 840310784. Throughput: 0: 5796.7. Samples: 840318464. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:27:36,992][25689] Avg episode reward: [(0, '-0.351')] [2022-07-10 17:27:38,383][26022] Updated weights on worker 0-0, policy_version 820624 (0.00096) [2022-07-10 17:27:40,246][26022] Updated weights on worker 0-0, policy_version 820634 (0.00089) [2022-07-10 17:27:41,822][26022] Updated weights on worker 0-0, policy_version 820644 (0.00087) [2022-07-10 17:27:42,011][25689] Fps is (10 sec: 5711.4, 60 sec: 5573.0, 300 sec: 5551.2). Total num frames: 840340480. Throughput: 0: 4975.9. Samples: 840335048. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:27:42,011][25689] Avg episode reward: [(0, '-1.137')] [2022-07-10 17:27:43,999][26022] Updated weights on worker 0-0, policy_version 820654 (0.00082) [2022-07-10 17:27:45,381][26022] Updated weights on worker 0-0, policy_version 820664 (0.00091) [2022-07-10 17:27:47,033][25689] Fps is (10 sec: 5506.2, 60 sec: 5507.9, 300 sec: 5538.4). Total num frames: 840366080. Throughput: 0: 5820.2. Samples: 840368778. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:27:47,034][25689] Avg episode reward: [(0, '-0.742')] [2022-07-10 17:27:47,741][26022] Updated weights on worker 0-0, policy_version 820674 (0.00093) [2022-07-10 17:27:49,091][26022] Updated weights on worker 0-0, policy_version 820684 (0.00098) [2022-07-10 17:27:51,325][26022] Updated weights on worker 0-0, policy_version 820694 (0.00050) [2022-07-10 17:27:52,119][25689] Fps is (10 sec: 5470.6, 60 sec: 5563.6, 300 sec: 5544.7). Total num frames: 840395776. Throughput: 0: 5830.1. Samples: 840402704. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:27:52,119][25689] Avg episode reward: [(0, '-0.720')] [2022-07-10 17:27:52,811][26022] Updated weights on worker 0-0, policy_version 820704 (0.00105) [2022-07-10 17:27:54,833][26022] Updated weights on worker 0-0, policy_version 820714 (0.00088) [2022-07-10 17:27:56,456][26022] Updated weights on worker 0-0, policy_version 820724 (0.00083) [2022-07-10 17:27:57,161][25689] Fps is (10 sec: 5763.0, 60 sec: 5560.6, 300 sec: 5554.3). Total num frames: 840424448. Throughput: 0: 5022.7. Samples: 840419774. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:27:57,161][25689] Avg episode reward: [(0, '-1.826')] [2022-07-10 17:27:58,359][26022] Updated weights on worker 0-0, policy_version 820734 (0.00089) [2022-07-10 17:28:00,166][26022] Updated weights on worker 0-0, policy_version 820744 (0.00085) [2022-07-10 17:28:02,187][25689] Fps is (10 sec: 5492.2, 60 sec: 5529.4, 300 sec: 5550.8). Total num frames: 840451072. Throughput: 0: 5866.3. Samples: 840453406. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:28:02,187][25689] Avg episode reward: [(0, '-1.405')] [2022-07-10 17:28:02,588][26022] Updated weights on worker 0-0, policy_version 820754 (0.00092) [2022-07-10 17:28:04,213][26022] Updated weights on worker 0-0, policy_version 820764 (0.00088) [2022-07-10 17:28:06,319][26022] Updated weights on worker 0-0, policy_version 820774 (0.00096) [2022-07-10 17:28:07,211][25689] Fps is (10 sec: 5400.0, 60 sec: 5549.2, 300 sec: 5552.5). Total num frames: 840478720. Throughput: 0: 5743.1. Samples: 840484664. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:28:07,212][25689] Avg episode reward: [(0, '-1.243')] [2022-07-10 17:28:07,699][26022] Updated weights on worker 0-0, policy_version 820784 (0.00090) [2022-07-10 17:28:09,819][26022] Updated weights on worker 0-0, policy_version 820794 (0.00079) [2022-07-10 17:28:11,467][26022] Updated weights on worker 0-0, policy_version 820804 (0.00101) [2022-07-10 17:28:12,291][25689] Fps is (10 sec: 5573.9, 60 sec: 5564.0, 300 sec: 5551.1). Total num frames: 840507392. Throughput: 0: 4896.3. Samples: 840501474. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:28:12,291][25689] Avg episode reward: [(0, '-0.103')] [2022-07-10 17:28:13,522][26022] Updated weights on worker 0-0, policy_version 820814 (0.00087) [2022-07-10 17:28:15,189][26022] Updated weights on worker 0-0, policy_version 820824 (0.00089) [2022-07-10 17:28:17,044][26022] Updated weights on worker 0-0, policy_version 820834 (0.00090) [2022-07-10 17:28:17,351][25689] Fps is (10 sec: 5453.3, 60 sec: 5545.3, 300 sec: 5546.6). Total num frames: 840534016. Throughput: 0: 5717.1. Samples: 840535204. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:28:17,353][25689] Avg episode reward: [(0, '-1.743')] [2022-07-10 17:28:18,863][26022] Updated weights on worker 0-0, policy_version 820844 (0.00086) [2022-07-10 17:28:20,874][26022] Updated weights on worker 0-0, policy_version 820854 (0.00086) [2022-07-10 17:28:22,379][25689] Fps is (10 sec: 5582.9, 60 sec: 5551.4, 300 sec: 5549.6). Total num frames: 840563712. Throughput: 0: 5728.7. Samples: 840569080. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:28:22,381][25689] Avg episode reward: [(0, '-1.257')] [2022-07-10 17:28:22,417][26022] Updated weights on worker 0-0, policy_version 820864 (0.00090) [2022-07-10 17:28:24,623][26022] Updated weights on worker 0-0, policy_version 820874 (0.00099) [2022-07-10 17:28:25,990][26022] Updated weights on worker 0-0, policy_version 820884 (0.00103) [2022-07-10 17:28:27,390][25689] Fps is (10 sec: 5610.0, 60 sec: 5537.6, 300 sec: 5551.5). Total num frames: 840590336. Throughput: 0: 5023.3. Samples: 840586030. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:28:27,391][25689] Avg episode reward: [(0, '-1.285')] [2022-07-10 17:28:28,291][26022] Updated weights on worker 0-0, policy_version 820894 (0.00090) [2022-07-10 17:28:29,789][26022] Updated weights on worker 0-0, policy_version 820904 (0.00083) [2022-07-10 17:28:31,865][26022] Updated weights on worker 0-0, policy_version 820914 (0.00085) [2022-07-10 17:28:32,457][25689] Fps is (10 sec: 5486.9, 60 sec: 5552.9, 300 sec: 5550.6). Total num frames: 840619008. Throughput: 0: 5839.7. Samples: 840619236. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:28:32,457][25689] Avg episode reward: [(0, '-1.739')] [2022-07-10 17:28:33,492][26022] Updated weights on worker 0-0, policy_version 820924 (0.00086) [2022-07-10 17:28:35,434][26022] Updated weights on worker 0-0, policy_version 820934 (0.00089) [2022-07-10 17:28:37,147][26022] Updated weights on worker 0-0, policy_version 820944 (0.00088) [2022-07-10 17:28:37,501][25689] Fps is (10 sec: 5671.3, 60 sec: 5567.5, 300 sec: 5550.6). Total num frames: 840647680. Throughput: 0: 5837.3. Samples: 840652828. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:28:37,502][25689] Avg episode reward: [(0, '-1.266')] [2022-07-10 17:28:39,084][26022] Updated weights on worker 0-0, policy_version 820954 (0.00092) [2022-07-10 17:28:40,793][26022] Updated weights on worker 0-0, policy_version 820964 (0.00090) [2022-07-10 17:28:42,508][25689] Fps is (10 sec: 5603.4, 60 sec: 5535.0, 300 sec: 5554.8). Total num frames: 840675328. Throughput: 0: 4996.9. Samples: 840669662. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:28:42,508][25689] Avg episode reward: [(0, '-1.226')] [2022-07-10 17:28:42,831][26022] Updated weights on worker 0-0, policy_version 820974 (0.00096) [2022-07-10 17:28:44,518][26022] Updated weights on worker 0-0, policy_version 820984 (0.00088) [2022-07-10 17:28:46,288][26022] Updated weights on worker 0-0, policy_version 820994 (0.00091) [2022-07-10 17:28:47,517][25689] Fps is (10 sec: 5520.9, 60 sec: 5570.0, 300 sec: 5546.4). Total num frames: 840702976. Throughput: 0: 5822.3. Samples: 840703216. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:28:47,518][25689] Avg episode reward: [(0, '-0.370')] [2022-07-10 17:28:48,087][26022] Updated weights on worker 0-0, policy_version 821004 (0.00089) [2022-07-10 17:28:50,039][26022] Updated weights on worker 0-0, policy_version 821014 (0.00092) [2022-07-10 17:28:50,628][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:28:50,639][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000821016_840720384.pth [2022-07-10 17:28:50,641][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000819064_838721536.pth [2022-07-10 17:28:51,848][26022] Updated weights on worker 0-0, policy_version 821024 (0.00082) [2022-07-10 17:28:52,579][25689] Fps is (10 sec: 5592.3, 60 sec: 5555.3, 300 sec: 5552.4). Total num frames: 840731648. Throughput: 0: 5828.1. Samples: 840736508. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:28:52,579][25689] Avg episode reward: [(0, '-0.303')] [2022-07-10 17:28:53,927][26022] Updated weights on worker 0-0, policy_version 821034 (0.00094) [2022-07-10 17:28:55,379][26022] Updated weights on worker 0-0, policy_version 821044 (0.00095) [2022-07-10 17:28:57,431][26022] Updated weights on worker 0-0, policy_version 821054 (0.00081) [2022-07-10 17:28:57,616][25689] Fps is (10 sec: 5576.9, 60 sec: 5538.8, 300 sec: 5552.6). Total num frames: 840759296. Throughput: 0: 5003.6. Samples: 840753472. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:28:57,616][25689] Avg episode reward: [(0, '-1.246')] [2022-07-10 17:28:59,176][26022] Updated weights on worker 0-0, policy_version 821064 (0.00094) [2022-07-10 17:29:01,118][26022] Updated weights on worker 0-0, policy_version 821074 (0.00086) [2022-07-10 17:29:02,645][25689] Fps is (10 sec: 5493.2, 60 sec: 5555.4, 300 sec: 5560.0). Total num frames: 840786944. Throughput: 0: 5831.3. Samples: 840787088. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:29:02,645][25689] Avg episode reward: [(0, '-1.222')] [2022-07-10 17:29:03,178][26022] Updated weights on worker 0-0, policy_version 821084 (0.00087) [2022-07-10 17:29:05,174][26022] Updated weights on worker 0-0, policy_version 821094 (0.00086) [2022-07-10 17:29:06,779][26022] Updated weights on worker 0-0, policy_version 821104 (0.00087) [2022-07-10 17:29:07,659][25689] Fps is (10 sec: 5302.0, 60 sec: 5522.5, 300 sec: 5548.5). Total num frames: 840812544. Throughput: 0: 5721.7. Samples: 840818460. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:29:07,659][25689] Avg episode reward: [(0, '-1.106')] [2022-07-10 17:29:08,882][26022] Updated weights on worker 0-0, policy_version 821114 (0.00090) [2022-07-10 17:29:10,666][26022] Updated weights on worker 0-0, policy_version 821124 (0.00090) [2022-07-10 17:29:12,555][26022] Updated weights on worker 0-0, policy_version 821134 (0.00085) [2022-07-10 17:29:12,722][25689] Fps is (10 sec: 5385.8, 60 sec: 5524.1, 300 sec: 5548.2). Total num frames: 840841216. Throughput: 0: 4900.5. Samples: 840835220. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-10 17:29:12,724][25689] Avg episode reward: [(0, '-0.136')] [2022-07-10 17:29:14,312][26022] Updated weights on worker 0-0, policy_version 821144 (0.00091) [2022-07-10 17:29:16,274][26022] Updated weights on worker 0-0, policy_version 821154 (0.00088) [2022-07-10 17:29:17,761][25689] Fps is (10 sec: 5676.1, 60 sec: 5559.8, 300 sec: 5551.2). Total num frames: 840869888. Throughput: 0: 5706.1. Samples: 840868424. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:29:17,763][25689] Avg episode reward: [(0, '-0.441')] [2022-07-10 17:29:18,219][26022] Updated weights on worker 0-0, policy_version 821164 (0.00086) [2022-07-10 17:29:20,011][26022] Updated weights on worker 0-0, policy_version 821174 (0.00088) [2022-07-10 17:29:21,619][26022] Updated weights on worker 0-0, policy_version 821184 (0.00088) [2022-07-10 17:29:22,847][25689] Fps is (10 sec: 5562.2, 60 sec: 5520.6, 300 sec: 5550.5). Total num frames: 840897536. Throughput: 0: 5676.6. Samples: 840901768. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:29:22,849][25689] Avg episode reward: [(0, '-1.163')] [2022-07-10 17:29:23,614][26022] Updated weights on worker 0-0, policy_version 821194 (0.00086) [2022-07-10 17:29:25,371][26022] Updated weights on worker 0-0, policy_version 821204 (0.00093) [2022-07-10 17:29:27,460][26022] Updated weights on worker 0-0, policy_version 821214 (0.00092) [2022-07-10 17:29:27,883][25689] Fps is (10 sec: 5564.5, 60 sec: 5552.3, 300 sec: 5548.0). Total num frames: 840926208. Throughput: 0: 5762.1. Samples: 840934992. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:29:27,883][25689] Avg episode reward: [(0, '-1.027')] [2022-07-10 17:29:29,123][26022] Updated weights on worker 0-0, policy_version 821224 (0.00097) [2022-07-10 17:29:30,942][26022] Updated weights on worker 0-0, policy_version 821234 (0.00086) [2022-07-10 17:29:32,679][26022] Updated weights on worker 0-0, policy_version 821244 (0.00089) [2022-07-10 17:29:32,976][25689] Fps is (10 sec: 5661.4, 60 sec: 5549.8, 300 sec: 5551.2). Total num frames: 840954880. Throughput: 0: 5743.9. Samples: 840951558. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:29:32,977][25689] Avg episode reward: [(0, '-0.916')] [2022-07-10 17:29:34,956][26022] Updated weights on worker 0-0, policy_version 821254 (0.00091) [2022-07-10 17:29:36,396][26022] Updated weights on worker 0-0, policy_version 821264 (0.00087) [2022-07-10 17:29:38,002][25689] Fps is (10 sec: 5464.6, 60 sec: 5517.7, 300 sec: 5545.0). Total num frames: 840981504. Throughput: 0: 5739.7. Samples: 840984596. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:29:38,002][25689] Avg episode reward: [(0, '-0.587')] [2022-07-10 17:29:38,799][26022] Updated weights on worker 0-0, policy_version 821274 (0.00074) [2022-07-10 17:29:40,268][26022] Updated weights on worker 0-0, policy_version 821284 (0.00086) [2022-07-10 17:29:42,340][26022] Updated weights on worker 0-0, policy_version 821294 (0.00091) [2022-07-10 17:29:43,072][25689] Fps is (10 sec: 5476.9, 60 sec: 5528.8, 300 sec: 5547.5). Total num frames: 841010176. Throughput: 0: 5764.9. Samples: 841018364. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:29:43,073][25689] Avg episode reward: [(0, '-0.680')] [2022-07-10 17:29:43,941][26022] Updated weights on worker 0-0, policy_version 821304 (0.00106) [2022-07-10 17:29:45,710][26022] Updated weights on worker 0-0, policy_version 821314 (0.00499) [2022-07-10 17:29:47,494][26022] Updated weights on worker 0-0, policy_version 821324 (0.00082) [2022-07-10 17:29:48,089][25689] Fps is (10 sec: 5685.0, 60 sec: 5545.0, 300 sec: 5552.5). Total num frames: 841038848. Throughput: 0: 4962.3. Samples: 841035260. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:29:48,089][25689] Avg episode reward: [(0, '-0.459')] [2022-07-10 17:29:49,466][26022] Updated weights on worker 0-0, policy_version 821334 (0.00092) [2022-07-10 17:29:51,176][26022] Updated weights on worker 0-0, policy_version 821344 (0.00092) [2022-07-10 17:29:53,071][26022] Updated weights on worker 0-0, policy_version 821354 (0.00096) [2022-07-10 17:29:53,131][25689] Fps is (10 sec: 5599.0, 60 sec: 5529.9, 300 sec: 5545.3). Total num frames: 841066496. Throughput: 0: 5820.5. Samples: 841068870. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:29:53,132][25689] Avg episode reward: [(0, '0.579')] [2022-07-10 17:29:54,799][26022] Updated weights on worker 0-0, policy_version 821364 (0.00090) [2022-07-10 17:29:56,686][26022] Updated weights on worker 0-0, policy_version 821374 (0.00084) [2022-07-10 17:29:58,176][25689] Fps is (10 sec: 5481.6, 60 sec: 5529.2, 300 sec: 5548.3). Total num frames: 841094144. Throughput: 0: 5840.3. Samples: 841102422. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:29:58,177][25689] Avg episode reward: [(0, '0.429')] [2022-07-10 17:29:58,589][26022] Updated weights on worker 0-0, policy_version 821384 (0.00086) [2022-07-10 17:30:00,407][26022] Updated weights on worker 0-0, policy_version 821394 (0.00087) [2022-07-10 17:30:02,562][26022] Updated weights on worker 0-0, policy_version 821404 (0.00091) [2022-07-10 17:30:03,218][25689] Fps is (10 sec: 5279.2, 60 sec: 5494.2, 300 sec: 5547.8). Total num frames: 841119744. Throughput: 0: 5006.4. Samples: 841119220. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:30:03,218][25689] Avg episode reward: [(0, '-0.359')] [2022-07-10 17:30:04,388][26022] Updated weights on worker 0-0, policy_version 821414 (0.00091) [2022-07-10 17:30:06,383][26022] Updated weights on worker 0-0, policy_version 821424 (0.00094) [2022-07-10 17:30:08,199][26022] Updated weights on worker 0-0, policy_version 821434 (0.00094) [2022-07-10 17:30:08,238][25689] Fps is (10 sec: 5394.0, 60 sec: 5544.4, 300 sec: 5546.7). Total num frames: 841148416. Throughput: 0: 5713.0. Samples: 841150372. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:30:08,238][25689] Avg episode reward: [(0, '-0.505')] [2022-07-10 17:30:10,029][26022] Updated weights on worker 0-0, policy_version 821444 (0.00086) [2022-07-10 17:30:11,923][26022] Updated weights on worker 0-0, policy_version 821454 (0.00088) [2022-07-10 17:30:13,322][25689] Fps is (10 sec: 5674.9, 60 sec: 5542.4, 300 sec: 5546.1). Total num frames: 841177088. Throughput: 0: 5691.3. Samples: 841183784. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:30:13,324][25689] Avg episode reward: [(0, '-0.577')] [2022-07-10 17:30:14,004][26022] Updated weights on worker 0-0, policy_version 821464 (0.00087) [2022-07-10 17:30:15,539][26022] Updated weights on worker 0-0, policy_version 821474 (0.00087) [2022-07-10 17:30:17,564][26022] Updated weights on worker 0-0, policy_version 821484 (0.00085) [2022-07-10 17:30:18,363][25689] Fps is (10 sec: 5461.4, 60 sec: 5508.5, 300 sec: 5542.2). Total num frames: 841203712. Throughput: 0: 4862.6. Samples: 841200580. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:30:18,363][25689] Avg episode reward: [(0, '-0.777')] [2022-07-10 17:30:18,992][26022] Updated weights on worker 0-0, policy_version 821494 (0.00085) [2022-07-10 17:30:21,239][26022] Updated weights on worker 0-0, policy_version 821504 (0.00087) [2022-07-10 17:30:23,007][26022] Updated weights on worker 0-0, policy_version 821514 (0.00092) [2022-07-10 17:30:23,383][25689] Fps is (10 sec: 5598.3, 60 sec: 5548.4, 300 sec: 5548.8). Total num frames: 841233408. Throughput: 0: 5697.3. Samples: 841234106. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:30:23,383][25689] Avg episode reward: [(0, '-1.094')] [2022-07-10 17:30:24,741][26022] Updated weights on worker 0-0, policy_version 821524 (0.00086) [2022-07-10 17:30:26,642][26022] Updated weights on worker 0-0, policy_version 821534 (0.00081) [2022-07-10 17:30:28,253][26022] Updated weights on worker 0-0, policy_version 821544 (0.00088) [2022-07-10 17:30:28,396][25689] Fps is (10 sec: 5817.7, 60 sec: 5550.4, 300 sec: 5553.6). Total num frames: 841262080. Throughput: 0: 5829.3. Samples: 841267878. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:30:28,397][25689] Avg episode reward: [(0, '-0.368')] [2022-07-10 17:30:30,340][26022] Updated weights on worker 0-0, policy_version 821554 (0.00090) [2022-07-10 17:30:31,932][26022] Updated weights on worker 0-0, policy_version 821564 (0.00088) [2022-07-10 17:30:33,461][25689] Fps is (10 sec: 5385.2, 60 sec: 5502.2, 300 sec: 5538.8). Total num frames: 841287680. Throughput: 0: 4998.2. Samples: 841284438. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:30:33,461][25689] Avg episode reward: [(0, '-0.341')] [2022-07-10 17:30:33,972][26022] Updated weights on worker 0-0, policy_version 821574 (0.00090) [2022-07-10 17:30:35,544][26022] Updated weights on worker 0-0, policy_version 821584 (0.00102) [2022-07-10 17:30:37,760][26022] Updated weights on worker 0-0, policy_version 821594 (0.00094) [2022-07-10 17:30:38,483][25689] Fps is (10 sec: 5379.9, 60 sec: 5536.4, 300 sec: 5545.6). Total num frames: 841316352. Throughput: 0: 5834.2. Samples: 841317968. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:30:38,484][25689] Avg episode reward: [(0, '-0.244')] [2022-07-10 17:30:39,252][26022] Updated weights on worker 0-0, policy_version 821604 (0.00091) [2022-07-10 17:30:41,535][26022] Updated weights on worker 0-0, policy_version 821614 (0.00085) [2022-07-10 17:30:42,954][26022] Updated weights on worker 0-0, policy_version 821624 (0.00088) [2022-07-10 17:30:43,490][25689] Fps is (10 sec: 5717.6, 60 sec: 5542.2, 300 sec: 5545.7). Total num frames: 841345024. Throughput: 0: 5829.5. Samples: 841351322. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:30:43,491][25689] Avg episode reward: [(0, '-0.724')] [2022-07-10 17:30:45,135][26022] Updated weights on worker 0-0, policy_version 821634 (0.00096) [2022-07-10 17:30:46,414][26022] Updated weights on worker 0-0, policy_version 821644 (0.00086) [2022-07-10 17:30:48,493][25689] Fps is (10 sec: 5524.3, 60 sec: 5509.5, 300 sec: 5543.5). Total num frames: 841371648. Throughput: 0: 5003.2. Samples: 841368430. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:30:48,494][25689] Avg episode reward: [(0, '-0.811')] [2022-07-10 17:30:48,690][26022] Updated weights on worker 0-0, policy_version 821654 (0.00096) [2022-07-10 17:30:50,271][26022] Updated weights on worker 0-0, policy_version 821664 (0.00082) [2022-07-10 17:30:50,656][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:30:50,673][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000821665_841384960.pth [2022-07-10 17:30:50,673][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000819714_839387136.pth [2022-07-10 17:30:52,271][26022] Updated weights on worker 0-0, policy_version 821674 (0.00089) [2022-07-10 17:30:53,555][25689] Fps is (10 sec: 5595.8, 60 sec: 5541.7, 300 sec: 5546.4). Total num frames: 841401344. Throughput: 0: 5854.3. Samples: 841402074. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:30:53,555][25689] Avg episode reward: [(0, '-0.575')] [2022-07-10 17:30:54,072][26022] Updated weights on worker 0-0, policy_version 821684 (0.00086) [2022-07-10 17:30:55,734][26022] Updated weights on worker 0-0, policy_version 821694 (0.00091) [2022-07-10 17:30:57,661][26022] Updated weights on worker 0-0, policy_version 821704 (0.00093) [2022-07-10 17:30:58,580][25689] Fps is (10 sec: 5684.7, 60 sec: 5543.4, 300 sec: 5542.7). Total num frames: 841428992. Throughput: 0: 5865.1. Samples: 841435840. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:30:58,581][25689] Avg episode reward: [(0, '-0.251')] [2022-07-10 17:30:59,542][26022] Updated weights on worker 0-0, policy_version 821714 (0.00089) [2022-07-10 17:31:01,520][26022] Updated weights on worker 0-0, policy_version 821724 (0.00098) [2022-07-10 17:31:03,591][25689] Fps is (10 sec: 5305.7, 60 sec: 5546.3, 300 sec: 5543.2). Total num frames: 841454592. Throughput: 0: 5043.5. Samples: 841452700. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:31:03,591][25689] Avg episode reward: [(0, '-0.127')] [2022-07-10 17:31:03,700][26022] Updated weights on worker 0-0, policy_version 821734 (0.00089) [2022-07-10 17:31:05,460][26022] Updated weights on worker 0-0, policy_version 821744 (0.00091) [2022-07-10 17:31:07,322][26022] Updated weights on worker 0-0, policy_version 821754 (0.00095) [2022-07-10 17:31:08,614][25689] Fps is (10 sec: 5307.0, 60 sec: 5529.1, 300 sec: 5537.3). Total num frames: 841482240. Throughput: 0: 5759.5. Samples: 841484316. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:31:08,622][25689] Avg episode reward: [(0, '-0.290')] [2022-07-10 17:31:09,112][26022] Updated weights on worker 0-0, policy_version 821764 (0.00091) [2022-07-10 17:31:11,177][26022] Updated weights on worker 0-0, policy_version 821774 (0.00086) [2022-07-10 17:31:12,808][26022] Updated weights on worker 0-0, policy_version 821784 (0.00087) [2022-07-10 17:31:13,714][25689] Fps is (10 sec: 5664.3, 60 sec: 5544.6, 300 sec: 5546.3). Total num frames: 841511936. Throughput: 0: 5705.9. Samples: 841517104. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:31:13,715][25689] Avg episode reward: [(0, '0.435')] [2022-07-10 17:31:14,836][26022] Updated weights on worker 0-0, policy_version 821794 (0.00087) [2022-07-10 17:31:16,175][26022] Updated weights on worker 0-0, policy_version 821804 (0.00095) [2022-07-10 17:31:18,426][26022] Updated weights on worker 0-0, policy_version 821814 (0.00088) [2022-07-10 17:31:18,718][25689] Fps is (10 sec: 5573.8, 60 sec: 5547.9, 300 sec: 5539.8). Total num frames: 841538560. Throughput: 0: 4881.3. Samples: 841534140. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:31:18,719][25689] Avg episode reward: [(0, '0.723')] [2022-07-10 17:31:20,176][26022] Updated weights on worker 0-0, policy_version 821824 (0.00084) [2022-07-10 17:31:22,074][26022] Updated weights on worker 0-0, policy_version 821834 (0.00088) [2022-07-10 17:31:23,723][25689] Fps is (10 sec: 5524.8, 60 sec: 5532.3, 300 sec: 5543.7). Total num frames: 841567232. Throughput: 0: 5712.3. Samples: 841567702. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:31:23,724][25689] Avg episode reward: [(0, '-0.017')] [2022-07-10 17:31:23,869][26022] Updated weights on worker 0-0, policy_version 821844 (0.00083) [2022-07-10 17:31:25,527][26022] Updated weights on worker 0-0, policy_version 821854 (0.00096) [2022-07-10 17:31:27,572][26022] Updated weights on worker 0-0, policy_version 821864 (0.00086) [2022-07-10 17:31:28,730][25689] Fps is (10 sec: 5727.9, 60 sec: 5532.9, 300 sec: 5545.4). Total num frames: 841595904. Throughput: 0: 5821.0. Samples: 841601410. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:31:28,730][25689] Avg episode reward: [(0, '0.834')] [2022-07-10 17:31:29,319][26022] Updated weights on worker 0-0, policy_version 821874 (0.00085) [2022-07-10 17:31:31,137][26022] Updated weights on worker 0-0, policy_version 821884 (0.00084) [2022-07-10 17:31:33,014][26022] Updated weights on worker 0-0, policy_version 821894 (0.00084) [2022-07-10 17:31:33,833][25689] Fps is (10 sec: 5672.3, 60 sec: 5580.3, 300 sec: 5542.0). Total num frames: 841624576. Throughput: 0: 5018.1. Samples: 841618060. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:31:33,833][25689] Avg episode reward: [(0, '0.789')] [2022-07-10 17:31:34,888][26022] Updated weights on worker 0-0, policy_version 821904 (0.00086) [2022-07-10 17:31:36,527][26022] Updated weights on worker 0-0, policy_version 821914 (0.00087) [2022-07-10 17:31:38,611][26022] Updated weights on worker 0-0, policy_version 821924 (0.00091) [2022-07-10 17:31:38,862][25689] Fps is (10 sec: 5457.2, 60 sec: 5545.8, 300 sec: 5541.9). Total num frames: 841651200. Throughput: 0: 5847.8. Samples: 841651938. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:31:38,863][25689] Avg episode reward: [(0, '-0.472')] [2022-07-10 17:31:40,260][26022] Updated weights on worker 0-0, policy_version 821934 (0.00090) [2022-07-10 17:31:42,348][26022] Updated weights on worker 0-0, policy_version 821944 (0.00082) [2022-07-10 17:31:43,827][26022] Updated weights on worker 0-0, policy_version 821954 (0.00087) [2022-07-10 17:31:43,925][25689] Fps is (10 sec: 5580.4, 60 sec: 5557.6, 300 sec: 5541.6). Total num frames: 841680896. Throughput: 0: 5826.8. Samples: 841685414. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:31:43,926][25689] Avg episode reward: [(0, '-1.246')] [2022-07-10 17:31:45,854][26022] Updated weights on worker 0-0, policy_version 821964 (0.00085) [2022-07-10 17:31:47,700][26022] Updated weights on worker 0-0, policy_version 821974 (0.00088) [2022-07-10 17:31:49,008][25689] Fps is (10 sec: 5551.4, 60 sec: 5550.3, 300 sec: 5542.7). Total num frames: 841707520. Throughput: 0: 4969.5. Samples: 841702180. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:31:49,008][25689] Avg episode reward: [(0, '-2.171')] [2022-07-10 17:31:49,500][26022] Updated weights on worker 0-0, policy_version 821984 (0.01437) [2022-07-10 17:31:51,284][26022] Updated weights on worker 0-0, policy_version 821994 (0.00087) [2022-07-10 17:31:53,123][26022] Updated weights on worker 0-0, policy_version 822004 (0.00084) [2022-07-10 17:31:54,051][25689] Fps is (10 sec: 5461.0, 60 sec: 5535.0, 300 sec: 5542.1). Total num frames: 841736192. Throughput: 0: 5817.7. Samples: 841735682. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:31:54,051][25689] Avg episode reward: [(0, '-2.180')] [2022-07-10 17:31:55,158][26022] Updated weights on worker 0-0, policy_version 822014 (0.00091) [2022-07-10 17:31:56,785][26022] Updated weights on worker 0-0, policy_version 822024 (0.00087) [2022-07-10 17:31:58,762][26022] Updated weights on worker 0-0, policy_version 822034 (0.00082) [2022-07-10 17:31:59,067][25689] Fps is (10 sec: 5700.4, 60 sec: 5552.8, 300 sec: 5542.8). Total num frames: 841764864. Throughput: 0: 5806.7. Samples: 841769262. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:31:59,068][25689] Avg episode reward: [(0, '-2.133')] [2022-07-10 17:32:00,333][26022] Updated weights on worker 0-0, policy_version 822044 (0.00083) [2022-07-10 17:32:02,590][26022] Updated weights on worker 0-0, policy_version 822054 (0.00089) [2022-07-10 17:32:04,097][25689] Fps is (10 sec: 5402.2, 60 sec: 5551.0, 300 sec: 5539.8). Total num frames: 841790464. Throughput: 0: 4993.9. Samples: 841786150. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:32:04,097][25689] Avg episode reward: [(0, '-1.371')] [2022-07-10 17:32:04,612][26022] Updated weights on worker 0-0, policy_version 822064 (0.00087) [2022-07-10 17:32:06,179][26022] Updated weights on worker 0-0, policy_version 822074 (0.00088) [2022-07-10 17:32:08,043][26022] Updated weights on worker 0-0, policy_version 822084 (0.00091) [2022-07-10 17:32:09,102][25689] Fps is (10 sec: 5306.7, 60 sec: 5552.7, 300 sec: 5540.8). Total num frames: 841818112. Throughput: 0: 5749.0. Samples: 841817700. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:32:09,102][25689] Avg episode reward: [(0, '-0.175')] [2022-07-10 17:32:09,907][26022] Updated weights on worker 0-0, policy_version 822094 (0.00101) [2022-07-10 17:32:12,022][26022] Updated weights on worker 0-0, policy_version 822104 (0.00096) [2022-07-10 17:32:13,592][26022] Updated weights on worker 0-0, policy_version 822114 (0.00083) [2022-07-10 17:32:14,171][25689] Fps is (10 sec: 5692.3, 60 sec: 5555.6, 300 sec: 5547.2). Total num frames: 841847808. Throughput: 0: 5732.1. Samples: 841851014. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:32:14,171][25689] Avg episode reward: [(0, '0.253')] [2022-07-10 17:32:15,639][26022] Updated weights on worker 0-0, policy_version 822124 (0.00090) [2022-07-10 17:32:17,155][26022] Updated weights on worker 0-0, policy_version 822134 (0.00082) [2022-07-10 17:32:19,179][25689] Fps is (10 sec: 5588.5, 60 sec: 5555.2, 300 sec: 5538.4). Total num frames: 841874432. Throughput: 0: 5738.2. Samples: 841884668. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:32:19,180][25689] Avg episode reward: [(0, '1.106')] [2022-07-10 17:32:19,195][26022] Updated weights on worker 0-0, policy_version 822144 (0.00089) [2022-07-10 17:32:21,163][26022] Updated weights on worker 0-0, policy_version 822154 (0.00091) [2022-07-10 17:32:22,715][26022] Updated weights on worker 0-0, policy_version 822164 (0.00085) [2022-07-10 17:32:24,184][25689] Fps is (10 sec: 5522.6, 60 sec: 5555.2, 300 sec: 5542.6). Total num frames: 841903104. Throughput: 0: 5746.5. Samples: 841901578. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:32:24,184][25689] Avg episode reward: [(0, '1.310')] [2022-07-10 17:32:24,783][26022] Updated weights on worker 0-0, policy_version 822174 (0.00082) [2022-07-10 17:32:26,285][26022] Updated weights on worker 0-0, policy_version 822184 (0.00088) [2022-07-10 17:32:28,405][26022] Updated weights on worker 0-0, policy_version 822194 (0.00085) [2022-07-10 17:32:29,197][25689] Fps is (10 sec: 5622.3, 60 sec: 5537.7, 300 sec: 5543.3). Total num frames: 841930752. Throughput: 0: 5847.3. Samples: 841935202. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:32:29,197][25689] Avg episode reward: [(0, '1.186')] [2022-07-10 17:32:29,972][26022] Updated weights on worker 0-0, policy_version 822204 (0.00096) [2022-07-10 17:32:31,999][26022] Updated weights on worker 0-0, policy_version 822214 (0.00087) [2022-07-10 17:32:33,938][26022] Updated weights on worker 0-0, policy_version 822224 (0.00087) [2022-07-10 17:32:34,252][25689] Fps is (10 sec: 5593.9, 60 sec: 5542.1, 300 sec: 5546.1). Total num frames: 841959424. Throughput: 0: 5863.1. Samples: 841968750. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:32:34,252][25689] Avg episode reward: [(0, '0.984')] [2022-07-10 17:32:35,772][26022] Updated weights on worker 0-0, policy_version 822234 (0.00087) [2022-07-10 17:32:37,418][26022] Updated weights on worker 0-0, policy_version 822244 (0.00091) [2022-07-10 17:32:39,263][25689] Fps is (10 sec: 5595.0, 60 sec: 5560.8, 300 sec: 5539.4). Total num frames: 841987072. Throughput: 0: 5018.2. Samples: 841985450. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:32:39,263][25689] Avg episode reward: [(0, '1.116')] [2022-07-10 17:32:39,347][26022] Updated weights on worker 0-0, policy_version 822254 (0.00101) [2022-07-10 17:32:41,211][26022] Updated weights on worker 0-0, policy_version 822264 (0.00086) [2022-07-10 17:32:43,372][26022] Updated weights on worker 0-0, policy_version 822274 (0.00087) [2022-07-10 17:32:44,264][25689] Fps is (10 sec: 5625.2, 60 sec: 5549.5, 300 sec: 5550.1). Total num frames: 842015744. Throughput: 0: 5821.1. Samples: 842018468. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:32:44,264][25689] Avg episode reward: [(0, '0.969')] [2022-07-10 17:32:44,821][26022] Updated weights on worker 0-0, policy_version 822284 (0.00092) [2022-07-10 17:32:46,839][26022] Updated weights on worker 0-0, policy_version 822294 (0.00095) [2022-07-10 17:32:48,493][26022] Updated weights on worker 0-0, policy_version 822304 (0.00086) [2022-07-10 17:32:49,300][25689] Fps is (10 sec: 5509.1, 60 sec: 5553.7, 300 sec: 5540.7). Total num frames: 842042368. Throughput: 0: 5816.4. Samples: 842052132. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 17:32:49,300][25689] Avg episode reward: [(0, '1.082')] [2022-07-10 17:32:50,355][26022] Updated weights on worker 0-0, policy_version 822314 (0.00090) [2022-07-10 17:32:50,690][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:32:50,701][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000822316_842051584.pth [2022-07-10 17:32:50,702][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000820366_840054784.pth [2022-07-10 17:32:52,190][26022] Updated weights on worker 0-0, policy_version 822324 (0.00093) [2022-07-10 17:32:53,993][26022] Updated weights on worker 0-0, policy_version 822334 (0.00125) [2022-07-10 17:32:54,351][25689] Fps is (10 sec: 5583.4, 60 sec: 5570.0, 300 sec: 5544.0). Total num frames: 842072064. Throughput: 0: 4989.8. Samples: 842069040. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:32:54,353][25689] Avg episode reward: [(0, '0.771')] [2022-07-10 17:32:55,813][26022] Updated weights on worker 0-0, policy_version 822344 (0.00090) [2022-07-10 17:32:57,531][26022] Updated weights on worker 0-0, policy_version 822354 (0.00093) [2022-07-10 17:32:59,378][25689] Fps is (10 sec: 5689.9, 60 sec: 5552.0, 300 sec: 5547.4). Total num frames: 842099712. Throughput: 0: 5846.5. Samples: 842103054. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:32:59,380][25689] Avg episode reward: [(0, '0.152')] [2022-07-10 17:32:59,487][26022] Updated weights on worker 0-0, policy_version 822364 (0.00089) [2022-07-10 17:33:01,224][26022] Updated weights on worker 0-0, policy_version 822374 (0.00092) [2022-07-10 17:33:03,507][26022] Updated weights on worker 0-0, policy_version 822384 (0.00086) [2022-07-10 17:33:04,393][25689] Fps is (10 sec: 5302.3, 60 sec: 5553.4, 300 sec: 5540.7). Total num frames: 842125312. Throughput: 0: 5766.8. Samples: 842134550. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:33:04,395][25689] Avg episode reward: [(0, '-1.881')] [2022-07-10 17:33:05,397][26022] Updated weights on worker 0-0, policy_version 822394 (0.00078) [2022-07-10 17:33:07,325][26022] Updated weights on worker 0-0, policy_version 822404 (0.00089) [2022-07-10 17:33:09,039][26022] Updated weights on worker 0-0, policy_version 822414 (0.00088) [2022-07-10 17:33:09,411][25689] Fps is (10 sec: 5409.6, 60 sec: 5569.2, 300 sec: 5541.8). Total num frames: 842153984. Throughput: 0: 4941.2. Samples: 842151504. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:33:09,411][25689] Avg episode reward: [(0, '-1.869')] [2022-07-10 17:33:10,948][26022] Updated weights on worker 0-0, policy_version 822424 (0.00090) [2022-07-10 17:33:12,683][26022] Updated weights on worker 0-0, policy_version 822434 (0.00094) [2022-07-10 17:33:14,446][26022] Updated weights on worker 0-0, policy_version 822444 (0.00092) [2022-07-10 17:33:14,559][25689] Fps is (10 sec: 5641.0, 60 sec: 5545.0, 300 sec: 5547.1). Total num frames: 842182656. Throughput: 0: 5728.4. Samples: 842184798. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:33:14,559][25689] Avg episode reward: [(0, '-2.023')] [2022-07-10 17:33:16,377][26022] Updated weights on worker 0-0, policy_version 822454 (0.00087) [2022-07-10 17:33:18,238][26022] Updated weights on worker 0-0, policy_version 822465 (0.00085) [2022-07-10 17:33:19,562][25689] Fps is (10 sec: 5548.2, 60 sec: 5562.4, 300 sec: 5540.7). Total num frames: 842210304. Throughput: 0: 5714.3. Samples: 842218390. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:33:19,563][25689] Avg episode reward: [(0, '-1.947')] [2022-07-10 17:33:20,336][26022] Updated weights on worker 0-0, policy_version 822475 (0.00092) [2022-07-10 17:33:22,156][26022] Updated weights on worker 0-0, policy_version 822485 (0.00110) [2022-07-10 17:33:24,029][26022] Updated weights on worker 0-0, policy_version 822495 (0.01056) [2022-07-10 17:33:24,565][25689] Fps is (10 sec: 5628.7, 60 sec: 5562.5, 300 sec: 5547.7). Total num frames: 842238976. Throughput: 0: 4981.0. Samples: 842235024. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:33:24,565][25689] Avg episode reward: [(0, '-0.861')] [2022-07-10 17:33:25,851][26022] Updated weights on worker 0-0, policy_version 822505 (0.00090) [2022-07-10 17:33:27,771][26022] Updated weights on worker 0-0, policy_version 822515 (0.00092) [2022-07-10 17:33:29,498][26022] Updated weights on worker 0-0, policy_version 822525 (0.00091) [2022-07-10 17:33:29,631][25689] Fps is (10 sec: 5491.8, 60 sec: 5540.7, 300 sec: 5540.8). Total num frames: 842265600. Throughput: 0: 5784.4. Samples: 842268464. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:33:29,631][25689] Avg episode reward: [(0, '0.708')] [2022-07-10 17:33:31,403][26022] Updated weights on worker 0-0, policy_version 822535 (0.00093) [2022-07-10 17:33:33,270][26022] Updated weights on worker 0-0, policy_version 822545 (0.00090) [2022-07-10 17:33:34,749][25689] Fps is (10 sec: 5529.9, 60 sec: 5551.8, 300 sec: 5542.9). Total num frames: 842295296. Throughput: 0: 5808.8. Samples: 842302080. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:33:34,750][25689] Avg episode reward: [(0, '1.367')] [2022-07-10 17:33:34,754][26022] Updated weights on worker 0-0, policy_version 822555 (0.00089) [2022-07-10 17:33:37,015][26022] Updated weights on worker 0-0, policy_version 822565 (0.00094) [2022-07-10 17:33:38,453][26022] Updated weights on worker 0-0, policy_version 822575 (0.00085) [2022-07-10 17:33:39,766][25689] Fps is (10 sec: 5455.5, 60 sec: 5517.4, 300 sec: 5535.8). Total num frames: 842320896. Throughput: 0: 4971.5. Samples: 842318838. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:33:39,768][25689] Avg episode reward: [(0, '1.285')] [2022-07-10 17:33:40,407][26022] Updated weights on worker 0-0, policy_version 822585 (0.00095) [2022-07-10 17:33:42,381][26022] Updated weights on worker 0-0, policy_version 822595 (0.00608) [2022-07-10 17:33:43,939][26022] Updated weights on worker 0-0, policy_version 822605 (0.00077) [2022-07-10 17:33:44,789][25689] Fps is (10 sec: 5609.8, 60 sec: 5549.3, 300 sec: 5545.9). Total num frames: 842351616. Throughput: 0: 5794.7. Samples: 842352218. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:33:44,789][25689] Avg episode reward: [(0, '0.990')] [2022-07-10 17:33:46,186][26022] Updated weights on worker 0-0, policy_version 822615 (0.00092) [2022-07-10 17:33:47,539][26022] Updated weights on worker 0-0, policy_version 822625 (0.00088) [2022-07-10 17:33:49,811][26022] Updated weights on worker 0-0, policy_version 822635 (0.00088) [2022-07-10 17:33:49,821][25689] Fps is (10 sec: 5601.4, 60 sec: 5532.8, 300 sec: 5536.1). Total num frames: 842377216. Throughput: 0: 5815.3. Samples: 842385878. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:33:49,821][25689] Avg episode reward: [(0, '0.442')] [2022-07-10 17:33:51,288][26022] Updated weights on worker 0-0, policy_version 822645 (0.00084) [2022-07-10 17:33:53,306][26022] Updated weights on worker 0-0, policy_version 822655 (0.00091) [2022-07-10 17:33:54,930][25689] Fps is (10 sec: 5452.8, 60 sec: 5527.5, 300 sec: 5541.7). Total num frames: 842406912. Throughput: 0: 4963.4. Samples: 842402244. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:33:54,931][25689] Avg episode reward: [(0, '0.381')] [2022-07-10 17:33:55,248][26022] Updated weights on worker 0-0, policy_version 822665 (0.00079) [2022-07-10 17:33:57,216][26022] Updated weights on worker 0-0, policy_version 822675 (0.00090) [2022-07-10 17:33:58,979][26022] Updated weights on worker 0-0, policy_version 822685 (0.00086) [2022-07-10 17:33:59,976][25689] Fps is (10 sec: 5646.8, 60 sec: 5525.7, 300 sec: 5541.3). Total num frames: 842434560. Throughput: 0: 5774.2. Samples: 842435532. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:33:59,977][25689] Avg episode reward: [(0, '0.207')] [2022-07-10 17:34:00,871][26022] Updated weights on worker 0-0, policy_version 822695 (0.00086) [2022-07-10 17:34:02,679][26022] Updated weights on worker 0-0, policy_version 822705 (0.00082) [2022-07-10 17:34:04,972][26022] Updated weights on worker 0-0, policy_version 822715 (0.00088) [2022-07-10 17:34:05,068][25689] Fps is (10 sec: 5252.0, 60 sec: 5518.7, 300 sec: 5539.9). Total num frames: 842460160. Throughput: 0: 5673.3. Samples: 842467270. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:34:05,069][25689] Avg episode reward: [(0, '0.448')] [2022-07-10 17:34:06,347][26022] Updated weights on worker 0-0, policy_version 822725 (0.00087) [2022-07-10 17:34:08,545][26022] Updated weights on worker 0-0, policy_version 822735 (0.00095) [2022-07-10 17:34:10,103][25689] Fps is (10 sec: 5561.7, 60 sec: 5550.9, 300 sec: 5547.3). Total num frames: 842490880. Throughput: 0: 4848.9. Samples: 842484222. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:34:10,104][25689] Avg episode reward: [(0, '0.517')] [2022-07-10 17:34:10,107][26022] Updated weights on worker 0-0, policy_version 822745 (0.00518) [2022-07-10 17:34:12,171][26022] Updated weights on worker 0-0, policy_version 822755 (0.00080) [2022-07-10 17:34:13,864][26022] Updated weights on worker 0-0, policy_version 822765 (0.00081) [2022-07-10 17:34:15,203][25689] Fps is (10 sec: 5557.3, 60 sec: 5504.7, 300 sec: 5535.9). Total num frames: 842516480. Throughput: 0: 5678.2. Samples: 842517358. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:34:15,204][25689] Avg episode reward: [(0, '0.782')] [2022-07-10 17:34:15,846][26022] Updated weights on worker 0-0, policy_version 822775 (0.00054) [2022-07-10 17:34:17,523][26022] Updated weights on worker 0-0, policy_version 822785 (0.00089) [2022-07-10 17:34:19,444][26022] Updated weights on worker 0-0, policy_version 822795 (0.00093) [2022-07-10 17:34:20,217][25689] Fps is (10 sec: 5365.9, 60 sec: 5520.5, 300 sec: 5540.6). Total num frames: 842545152. Throughput: 0: 5701.3. Samples: 842550930. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:34:20,217][25689] Avg episode reward: [(0, '1.036')] [2022-07-10 17:34:21,282][26022] Updated weights on worker 0-0, policy_version 822805 (0.00087) [2022-07-10 17:34:23,143][26022] Updated weights on worker 0-0, policy_version 822815 (0.00088) [2022-07-10 17:34:24,860][26022] Updated weights on worker 0-0, policy_version 822825 (0.00084) [2022-07-10 17:34:25,234][25689] Fps is (10 sec: 5716.7, 60 sec: 5519.3, 300 sec: 5541.0). Total num frames: 842573824. Throughput: 0: 4975.8. Samples: 842567610. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:34:25,234][25689] Avg episode reward: [(0, '0.602')] [2022-07-10 17:34:27,029][26022] Updated weights on worker 0-0, policy_version 822835 (0.00089) [2022-07-10 17:34:28,798][26022] Updated weights on worker 0-0, policy_version 822845 (0.00089) [2022-07-10 17:34:30,246][25689] Fps is (10 sec: 5615.5, 60 sec: 5541.0, 300 sec: 5539.0). Total num frames: 842601472. Throughput: 0: 5799.0. Samples: 842601036. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:34:30,247][25689] Avg episode reward: [(0, '0.132')] [2022-07-10 17:34:30,649][26022] Updated weights on worker 0-0, policy_version 822855 (0.00057) [2022-07-10 17:34:32,347][26022] Updated weights on worker 0-0, policy_version 822865 (0.00085) [2022-07-10 17:34:34,323][26022] Updated weights on worker 0-0, policy_version 822875 (0.00095) [2022-07-10 17:34:35,371][25689] Fps is (10 sec: 5555.8, 60 sec: 5523.6, 300 sec: 5544.1). Total num frames: 842630144. Throughput: 0: 5812.3. Samples: 842634582. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:34:35,371][25689] Avg episode reward: [(0, '-0.265')] [2022-07-10 17:34:36,144][26022] Updated weights on worker 0-0, policy_version 822885 (0.00090) [2022-07-10 17:34:38,089][26022] Updated weights on worker 0-0, policy_version 822895 (0.00093) [2022-07-10 17:34:39,876][26022] Updated weights on worker 0-0, policy_version 822905 (0.00092) [2022-07-10 17:34:40,425][25689] Fps is (10 sec: 5633.6, 60 sec: 5570.8, 300 sec: 5544.4). Total num frames: 842658816. Throughput: 0: 4962.9. Samples: 842651222. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:34:40,426][25689] Avg episode reward: [(0, '-1.529')] [2022-07-10 17:34:41,717][26022] Updated weights on worker 0-0, policy_version 822915 (0.00087) [2022-07-10 17:34:43,433][26022] Updated weights on worker 0-0, policy_version 822925 (0.00088) [2022-07-10 17:34:45,451][25689] Fps is (10 sec: 5384.1, 60 sec: 5486.1, 300 sec: 5533.9). Total num frames: 842684416. Throughput: 0: 5764.9. Samples: 842684160. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:34:45,451][25689] Avg episode reward: [(0, '-1.121')] [2022-07-10 17:34:45,588][26022] Updated weights on worker 0-0, policy_version 822935 (0.00092) [2022-07-10 17:34:47,192][26022] Updated weights on worker 0-0, policy_version 822945 (0.00090) [2022-07-10 17:34:49,396][26022] Updated weights on worker 0-0, policy_version 822955 (0.00096) [2022-07-10 17:34:50,454][25689] Fps is (10 sec: 5411.7, 60 sec: 5539.4, 300 sec: 5538.1). Total num frames: 842713088. Throughput: 0: 5745.4. Samples: 842717136. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:34:50,454][25689] Avg episode reward: [(0, '-0.410')] [2022-07-10 17:34:50,890][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:34:50,899][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000822965_842716160.pth [2022-07-10 17:34:50,899][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000821016_840720384.pth [2022-07-10 17:34:50,907][26022] Updated weights on worker 0-0, policy_version 822965 (0.00087) [2022-07-10 17:34:52,968][26022] Updated weights on worker 0-0, policy_version 822975 (0.00091) [2022-07-10 17:34:54,662][26022] Updated weights on worker 0-0, policy_version 822985 (0.00093) [2022-07-10 17:34:55,562][25689] Fps is (10 sec: 5570.1, 60 sec: 5505.7, 300 sec: 5536.9). Total num frames: 842740736. Throughput: 0: 5727.0. Samples: 842750218. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:34:55,563][25689] Avg episode reward: [(0, '-0.674')] [2022-07-10 17:34:56,676][26022] Updated weights on worker 0-0, policy_version 822995 (0.00082) [2022-07-10 17:34:58,206][26022] Updated weights on worker 0-0, policy_version 823005 (0.00081) [2022-07-10 17:35:00,334][26022] Updated weights on worker 0-0, policy_version 823015 (0.00095) [2022-07-10 17:35:00,596][25689] Fps is (10 sec: 5452.3, 60 sec: 5506.9, 300 sec: 5543.9). Total num frames: 842768384. Throughput: 0: 5732.7. Samples: 842766854. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:35:00,596][25689] Avg episode reward: [(0, '0.097')] [2022-07-10 17:35:02,414][26022] Updated weights on worker 0-0, policy_version 823025 (0.00085) [2022-07-10 17:35:04,426][26022] Updated weights on worker 0-0, policy_version 823035 (0.00085) [2022-07-10 17:35:05,617][25689] Fps is (10 sec: 5397.7, 60 sec: 5530.2, 300 sec: 5537.0). Total num frames: 842795008. Throughput: 0: 5644.5. Samples: 842797986. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:35:05,619][25689] Avg episode reward: [(0, '0.375')] [2022-07-10 17:35:06,018][26022] Updated weights on worker 0-0, policy_version 823045 (0.00090) [2022-07-10 17:35:08,090][26022] Updated weights on worker 0-0, policy_version 823055 (0.00101) [2022-07-10 17:35:09,745][26022] Updated weights on worker 0-0, policy_version 823065 (0.00093) [2022-07-10 17:35:10,693][25689] Fps is (10 sec: 5172.1, 60 sec: 5441.9, 300 sec: 5526.8). Total num frames: 842820608. Throughput: 0: 5632.8. Samples: 842831138. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:35:10,694][25689] Avg episode reward: [(0, '0.748')] [2022-07-10 17:35:11,765][26022] Updated weights on worker 0-0, policy_version 823075 (0.00088) [2022-07-10 17:35:13,710][26022] Updated weights on worker 0-0, policy_version 823085 (0.00089) [2022-07-10 17:35:15,449][26022] Updated weights on worker 0-0, policy_version 823095 (0.00089) [2022-07-10 17:35:15,761][25689] Fps is (10 sec: 5451.1, 60 sec: 5512.5, 300 sec: 5536.7). Total num frames: 842850304. Throughput: 0: 4803.4. Samples: 842847240. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:35:15,763][25689] Avg episode reward: [(0, '0.596')] [2022-07-10 17:35:17,645][26022] Updated weights on worker 0-0, policy_version 823105 (0.00091) [2022-07-10 17:35:19,219][26022] Updated weights on worker 0-0, policy_version 823115 (0.00089) [2022-07-10 17:35:20,779][25689] Fps is (10 sec: 5584.0, 60 sec: 5478.3, 300 sec: 5526.4). Total num frames: 842876928. Throughput: 0: 5619.6. Samples: 842880276. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:35:20,779][25689] Avg episode reward: [(0, '0.816')] [2022-07-10 17:35:21,362][26022] Updated weights on worker 0-0, policy_version 823125 (0.00086) [2022-07-10 17:35:22,890][26022] Updated weights on worker 0-0, policy_version 823135 (0.00097) [2022-07-10 17:35:24,887][26022] Updated weights on worker 0-0, policy_version 823145 (0.00087) [2022-07-10 17:35:25,787][25689] Fps is (10 sec: 5515.3, 60 sec: 5479.1, 300 sec: 5526.5). Total num frames: 842905600. Throughput: 0: 5753.3. Samples: 842914030. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:35:25,788][25689] Avg episode reward: [(0, '-0.055')] [2022-07-10 17:35:26,487][26022] Updated weights on worker 0-0, policy_version 823155 (0.00093) [2022-07-10 17:35:28,326][26022] Updated weights on worker 0-0, policy_version 823165 (0.00084) [2022-07-10 17:35:30,575][26022] Updated weights on worker 0-0, policy_version 823175 (0.00085) [2022-07-10 17:35:30,813][25689] Fps is (10 sec: 5511.0, 60 sec: 5461.0, 300 sec: 5530.6). Total num frames: 842932224. Throughput: 0: 4942.9. Samples: 842930588. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:35:30,815][25689] Avg episode reward: [(0, '-0.175')] [2022-07-10 17:35:31,917][26022] Updated weights on worker 0-0, policy_version 823185 (0.00088) [2022-07-10 17:35:34,204][26022] Updated weights on worker 0-0, policy_version 823195 (0.00093) [2022-07-10 17:35:35,527][26022] Updated weights on worker 0-0, policy_version 823205 (0.00084) [2022-07-10 17:35:35,897][25689] Fps is (10 sec: 5671.9, 60 sec: 5498.5, 300 sec: 5536.4). Total num frames: 842962944. Throughput: 0: 5808.4. Samples: 842964198. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:35:35,898][25689] Avg episode reward: [(0, '-0.355')] [2022-07-10 17:35:37,587][26022] Updated weights on worker 0-0, policy_version 823215 (0.00086) [2022-07-10 17:35:39,567][26022] Updated weights on worker 0-0, policy_version 823225 (0.00086) [2022-07-10 17:35:40,899][25689] Fps is (10 sec: 5786.8, 60 sec: 5486.3, 300 sec: 5533.0). Total num frames: 842990592. Throughput: 0: 5832.7. Samples: 842997632. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:35:40,900][25689] Avg episode reward: [(0, '-0.054')] [2022-07-10 17:35:41,374][26022] Updated weights on worker 0-0, policy_version 823235 (0.00086) [2022-07-10 17:35:43,140][26022] Updated weights on worker 0-0, policy_version 823245 (0.00081) [2022-07-10 17:35:45,238][26022] Updated weights on worker 0-0, policy_version 823255 (0.00085) [2022-07-10 17:35:45,911][25689] Fps is (10 sec: 5521.9, 60 sec: 5521.4, 300 sec: 5536.3). Total num frames: 843018240. Throughput: 0: 4971.7. Samples: 843014080. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:35:45,911][25689] Avg episode reward: [(0, '-0.386')] [2022-07-10 17:35:46,799][26022] Updated weights on worker 0-0, policy_version 823265 (0.00087) [2022-07-10 17:35:48,929][26022] Updated weights on worker 0-0, policy_version 823275 (0.00088) [2022-07-10 17:35:50,454][26022] Updated weights on worker 0-0, policy_version 823285 (0.00087) [2022-07-10 17:35:50,966][25689] Fps is (10 sec: 5594.4, 60 sec: 5516.6, 300 sec: 5533.0). Total num frames: 843046912. Throughput: 0: 5815.3. Samples: 843047788. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:35:50,968][25689] Avg episode reward: [(0, '-1.806')] [2022-07-10 17:35:52,542][26022] Updated weights on worker 0-0, policy_version 823295 (0.00095) [2022-07-10 17:35:54,223][26022] Updated weights on worker 0-0, policy_version 823305 (0.00088) [2022-07-10 17:35:56,039][25689] Fps is (10 sec: 5459.7, 60 sec: 5503.0, 300 sec: 5528.7). Total num frames: 843073536. Throughput: 0: 5796.8. Samples: 843080956. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:35:56,039][25689] Avg episode reward: [(0, '-1.073')] [2022-07-10 17:35:56,262][26022] Updated weights on worker 0-0, policy_version 823315 (0.00089) [2022-07-10 17:35:57,970][26022] Updated weights on worker 0-0, policy_version 823325 (0.00098) [2022-07-10 17:35:59,794][26022] Updated weights on worker 0-0, policy_version 823335 (0.00082) [2022-07-10 17:36:01,066][25689] Fps is (10 sec: 5475.0, 60 sec: 5520.4, 300 sec: 5538.7). Total num frames: 843102208. Throughput: 0: 4955.7. Samples: 843097574. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:36:01,067][25689] Avg episode reward: [(0, '-0.865')] [2022-07-10 17:36:02,125][26022] Updated weights on worker 0-0, policy_version 823345 (0.00084) [2022-07-10 17:36:03,727][26022] Updated weights on worker 0-0, policy_version 823355 (0.00094) [2022-07-10 17:36:05,839][26022] Updated weights on worker 0-0, policy_version 823365 (0.00085) [2022-07-10 17:36:06,094][25689] Fps is (10 sec: 5193.7, 60 sec: 5469.0, 300 sec: 5524.8). Total num frames: 843125760. Throughput: 0: 5703.5. Samples: 843129194. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:36:06,094][25689] Avg episode reward: [(0, '-0.694')] [2022-07-10 17:36:07,339][26022] Updated weights on worker 0-0, policy_version 823375 (0.00096) [2022-07-10 17:36:09,639][26022] Updated weights on worker 0-0, policy_version 823385 (0.00087) [2022-07-10 17:36:11,072][26022] Updated weights on worker 0-0, policy_version 823395 (0.00100) [2022-07-10 17:36:11,123][25689] Fps is (10 sec: 5396.6, 60 sec: 5558.0, 300 sec: 5529.6). Total num frames: 843156480. Throughput: 0: 5679.3. Samples: 843162262. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:36:11,123][25689] Avg episode reward: [(0, '-0.906')] [2022-07-10 17:36:12,963][26022] Updated weights on worker 0-0, policy_version 823405 (0.00091) [2022-07-10 17:36:14,991][26022] Updated weights on worker 0-0, policy_version 823415 (0.00086) [2022-07-10 17:36:16,202][25689] Fps is (10 sec: 5774.3, 60 sec: 5523.1, 300 sec: 5531.6). Total num frames: 843184128. Throughput: 0: 4858.5. Samples: 843178920. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:36:16,203][25689] Avg episode reward: [(0, '-0.433')] [2022-07-10 17:36:16,599][26022] Updated weights on worker 0-0, policy_version 823425 (0.00088) [2022-07-10 17:36:18,696][26022] Updated weights on worker 0-0, policy_version 823435 (0.00376) [2022-07-10 17:36:20,300][26022] Updated weights on worker 0-0, policy_version 823445 (0.00090) [2022-07-10 17:36:21,250][25689] Fps is (10 sec: 5358.8, 60 sec: 5520.3, 300 sec: 5523.9). Total num frames: 843210752. Throughput: 0: 5683.8. Samples: 843212298. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:36:21,251][25689] Avg episode reward: [(0, '0.451')] [2022-07-10 17:36:22,406][26022] Updated weights on worker 0-0, policy_version 823455 (0.00084) [2022-07-10 17:36:23,930][26022] Updated weights on worker 0-0, policy_version 823465 (0.00093) [2022-07-10 17:36:26,158][26022] Updated weights on worker 0-0, policy_version 823475 (0.00092) [2022-07-10 17:36:26,256][25689] Fps is (10 sec: 5398.3, 60 sec: 5503.6, 300 sec: 5520.5). Total num frames: 843238400. Throughput: 0: 5781.9. Samples: 843245768. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-10 17:36:26,256][25689] Avg episode reward: [(0, '0.508')] [2022-07-10 17:36:27,632][26022] Updated weights on worker 0-0, policy_version 823485 (0.00089) [2022-07-10 17:36:29,787][26022] Updated weights on worker 0-0, policy_version 823495 (0.00087) [2022-07-10 17:36:31,289][25689] Fps is (10 sec: 5712.3, 60 sec: 5553.8, 300 sec: 5525.2). Total num frames: 843268096. Throughput: 0: 4962.3. Samples: 843262334. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:36:31,290][25689] Avg episode reward: [(0, '-0.480')] [2022-07-10 17:36:31,310][26022] Updated weights on worker 0-0, policy_version 823505 (0.00085) [2022-07-10 17:36:33,526][26022] Updated weights on worker 0-0, policy_version 823515 (0.00089) [2022-07-10 17:36:35,130][26022] Updated weights on worker 0-0, policy_version 823525 (0.00097) [2022-07-10 17:36:36,384][25689] Fps is (10 sec: 5459.7, 60 sec: 5468.1, 300 sec: 5520.6). Total num frames: 843293696. Throughput: 0: 5771.6. Samples: 843295400. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:36:36,384][25689] Avg episode reward: [(0, '-0.507')] [2022-07-10 17:36:37,071][26022] Updated weights on worker 0-0, policy_version 823535 (0.00086) [2022-07-10 17:36:38,771][26022] Updated weights on worker 0-0, policy_version 823545 (0.00059) [2022-07-10 17:36:41,044][26022] Updated weights on worker 0-0, policy_version 823555 (0.00089) [2022-07-10 17:36:41,386][25689] Fps is (10 sec: 5476.4, 60 sec: 5502.0, 300 sec: 5521.7). Total num frames: 843323392. Throughput: 0: 5774.4. Samples: 843328570. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:36:41,387][25689] Avg episode reward: [(0, '-0.329')] [2022-07-10 17:36:42,751][26022] Updated weights on worker 0-0, policy_version 823565 (0.00093) [2022-07-10 17:36:44,614][26022] Updated weights on worker 0-0, policy_version 823575 (0.00093) [2022-07-10 17:36:46,331][26022] Updated weights on worker 0-0, policy_version 823585 (0.00088) [2022-07-10 17:36:46,405][25689] Fps is (10 sec: 5722.4, 60 sec: 5501.4, 300 sec: 5526.3). Total num frames: 843351040. Throughput: 0: 4936.8. Samples: 843345240. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:36:46,405][25689] Avg episode reward: [(0, '-0.593')] [2022-07-10 17:36:48,140][26022] Updated weights on worker 0-0, policy_version 823595 (0.00091) [2022-07-10 17:36:50,069][26022] Updated weights on worker 0-0, policy_version 823605 (0.00085) [2022-07-10 17:36:50,967][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:36:50,983][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000823609_843375616.pth [2022-07-10 17:36:50,983][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000821665_841384960.pth [2022-07-10 17:36:51,428][25689] Fps is (10 sec: 5608.7, 60 sec: 5504.3, 300 sec: 5526.7). Total num frames: 843379712. Throughput: 0: 5777.4. Samples: 843378684. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:36:51,428][25689] Avg episode reward: [(0, '-0.190')] [2022-07-10 17:36:51,851][26022] Updated weights on worker 0-0, policy_version 823615 (0.00091) [2022-07-10 17:36:53,716][26022] Updated weights on worker 0-0, policy_version 823625 (0.00082) [2022-07-10 17:36:55,551][26022] Updated weights on worker 0-0, policy_version 823635 (0.00092) [2022-07-10 17:36:56,504][25689] Fps is (10 sec: 5576.6, 60 sec: 5520.9, 300 sec: 5522.2). Total num frames: 843407360. Throughput: 0: 5816.1. Samples: 843412422. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:36:56,506][25689] Avg episode reward: [(0, '-0.134')] [2022-07-10 17:36:57,263][26022] Updated weights on worker 0-0, policy_version 823645 (0.00080) [2022-07-10 17:36:59,346][26022] Updated weights on worker 0-0, policy_version 823655 (0.00091) [2022-07-10 17:37:00,939][26022] Updated weights on worker 0-0, policy_version 823665 (0.00080) [2022-07-10 17:37:01,529][25689] Fps is (10 sec: 5575.6, 60 sec: 5521.1, 300 sec: 5532.6). Total num frames: 843436032. Throughput: 0: 5004.0. Samples: 843429364. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:37:01,531][25689] Avg episode reward: [(0, '0.435')] [2022-07-10 17:37:03,342][26022] Updated weights on worker 0-0, policy_version 823675 (0.00091) [2022-07-10 17:37:04,983][26022] Updated weights on worker 0-0, policy_version 823685 (0.00092) [2022-07-10 17:37:06,607][25689] Fps is (10 sec: 5270.8, 60 sec: 5533.5, 300 sec: 5520.9). Total num frames: 843460608. Throughput: 0: 5717.7. Samples: 843460748. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:37:06,607][25689] Avg episode reward: [(0, '0.813')] [2022-07-10 17:37:06,892][26022] Updated weights on worker 0-0, policy_version 823695 (0.00086) [2022-07-10 17:37:08,952][26022] Updated weights on worker 0-0, policy_version 823705 (0.00090) [2022-07-10 17:37:10,466][26022] Updated weights on worker 0-0, policy_version 823715 (0.00080) [2022-07-10 17:37:11,639][25689] Fps is (10 sec: 5267.0, 60 sec: 5499.4, 300 sec: 5518.2). Total num frames: 843489280. Throughput: 0: 5722.1. Samples: 843494332. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:37:11,639][25689] Avg episode reward: [(0, '-0.452')] [2022-07-10 17:37:12,649][26022] Updated weights on worker 0-0, policy_version 823725 (0.00092) [2022-07-10 17:37:14,264][26022] Updated weights on worker 0-0, policy_version 823735 (0.00094) [2022-07-10 17:37:16,259][26022] Updated weights on worker 0-0, policy_version 823745 (0.00083) [2022-07-10 17:37:16,701][25689] Fps is (10 sec: 5579.5, 60 sec: 5501.0, 300 sec: 5520.6). Total num frames: 843516928. Throughput: 0: 4865.5. Samples: 843510688. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:37:16,701][25689] Avg episode reward: [(0, '-0.714')] [2022-07-10 17:37:17,970][26022] Updated weights on worker 0-0, policy_version 823755 (0.00093) [2022-07-10 17:37:19,931][26022] Updated weights on worker 0-0, policy_version 823765 (0.00084) [2022-07-10 17:37:21,661][26022] Updated weights on worker 0-0, policy_version 823775 (0.00092) [2022-07-10 17:37:21,707][25689] Fps is (10 sec: 5593.7, 60 sec: 5538.6, 300 sec: 5520.6). Total num frames: 843545600. Throughput: 0: 5682.9. Samples: 843544034. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:37:21,708][25689] Avg episode reward: [(0, '-0.042')] [2022-07-10 17:37:23,448][26022] Updated weights on worker 0-0, policy_version 823785 (0.00094) [2022-07-10 17:37:25,511][26022] Updated weights on worker 0-0, policy_version 823795 (0.00087) [2022-07-10 17:37:26,721][25689] Fps is (10 sec: 5620.4, 60 sec: 5537.8, 300 sec: 5520.5). Total num frames: 843573248. Throughput: 0: 5804.0. Samples: 843577494. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:37:26,722][25689] Avg episode reward: [(0, '-0.288')] [2022-07-10 17:37:27,248][26022] Updated weights on worker 0-0, policy_version 823805 (0.00088) [2022-07-10 17:37:29,098][26022] Updated weights on worker 0-0, policy_version 823815 (0.00084) [2022-07-10 17:37:30,971][26022] Updated weights on worker 0-0, policy_version 823825 (0.00087) [2022-07-10 17:37:31,743][25689] Fps is (10 sec: 5305.9, 60 sec: 5471.2, 300 sec: 5510.8). Total num frames: 843598848. Throughput: 0: 4956.9. Samples: 843593986. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:37:31,743][25689] Avg episode reward: [(0, '-0.369')] [2022-07-10 17:37:32,721][26022] Updated weights on worker 0-0, policy_version 823835 (0.00088) [2022-07-10 17:37:34,829][26022] Updated weights on worker 0-0, policy_version 823845 (0.00085) [2022-07-10 17:37:36,299][26022] Updated weights on worker 0-0, policy_version 823855 (0.00085) [2022-07-10 17:37:36,803][25689] Fps is (10 sec: 5586.4, 60 sec: 5559.0, 300 sec: 5520.2). Total num frames: 843629568. Throughput: 0: 5823.0. Samples: 843627744. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:37:36,804][25689] Avg episode reward: [(0, '-0.424')] [2022-07-10 17:37:38,346][26022] Updated weights on worker 0-0, policy_version 823865 (0.00092) [2022-07-10 17:37:40,097][26022] Updated weights on worker 0-0, policy_version 823875 (0.00090) [2022-07-10 17:37:41,817][25689] Fps is (10 sec: 5793.7, 60 sec: 5524.1, 300 sec: 5516.6). Total num frames: 843657216. Throughput: 0: 5839.8. Samples: 843661472. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:37:41,818][25689] Avg episode reward: [(0, '-0.807')] [2022-07-10 17:37:41,919][26022] Updated weights on worker 0-0, policy_version 823885 (0.00087) [2022-07-10 17:37:43,789][26022] Updated weights on worker 0-0, policy_version 823895 (0.00081) [2022-07-10 17:37:45,352][26022] Updated weights on worker 0-0, policy_version 823905 (0.00093) [2022-07-10 17:37:46,826][25689] Fps is (10 sec: 5517.2, 60 sec: 5525.0, 300 sec: 5520.5). Total num frames: 843684864. Throughput: 0: 5862.2. Samples: 843695350. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:37:46,826][25689] Avg episode reward: [(0, '-1.282')] [2022-07-10 17:37:47,420][26022] Updated weights on worker 0-0, policy_version 823915 (0.00082) [2022-07-10 17:37:49,100][26022] Updated weights on worker 0-0, policy_version 823925 (0.00086) [2022-07-10 17:37:51,063][26022] Updated weights on worker 0-0, policy_version 823935 (0.00086) [2022-07-10 17:37:51,839][25689] Fps is (10 sec: 5619.6, 60 sec: 5525.8, 300 sec: 5517.8). Total num frames: 843713536. Throughput: 0: 5880.9. Samples: 843712172. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:37:51,850][25689] Avg episode reward: [(0, '-1.252')] [2022-07-10 17:37:52,880][26022] Updated weights on worker 0-0, policy_version 823945 (0.00089) [2022-07-10 17:37:54,720][26022] Updated weights on worker 0-0, policy_version 823955 (0.00091) [2022-07-10 17:37:56,379][26022] Updated weights on worker 0-0, policy_version 823965 (0.00083) [2022-07-10 17:37:56,908][25689] Fps is (10 sec: 5687.7, 60 sec: 5543.5, 300 sec: 5520.4). Total num frames: 843742208. Throughput: 0: 5874.0. Samples: 843745840. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:37:56,908][25689] Avg episode reward: [(0, '-1.319')] [2022-07-10 17:37:58,355][26022] Updated weights on worker 0-0, policy_version 823975 (0.00087) [2022-07-10 17:37:59,935][26022] Updated weights on worker 0-0, policy_version 823985 (0.00086) [2022-07-10 17:38:01,910][25689] Fps is (10 sec: 5490.6, 60 sec: 5511.6, 300 sec: 5524.1). Total num frames: 843768832. Throughput: 0: 5833.1. Samples: 843778678. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:38:01,914][25689] Avg episode reward: [(0, '-0.767')] [2022-07-10 17:38:02,537][26022] Updated weights on worker 0-0, policy_version 823995 (0.00086) [2022-07-10 17:38:04,256][26022] Updated weights on worker 0-0, policy_version 824005 (0.00089) [2022-07-10 17:38:05,973][26022] Updated weights on worker 0-0, policy_version 824015 (0.00073) [2022-07-10 17:38:06,921][25689] Fps is (10 sec: 5317.8, 60 sec: 5551.7, 300 sec: 5517.4). Total num frames: 843795456. Throughput: 0: 4920.4. Samples: 843794226. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:38:06,921][25689] Avg episode reward: [(0, '-1.309')] [2022-07-10 17:38:07,748][26022] Updated weights on worker 0-0, policy_version 824025 (0.00092) [2022-07-10 17:38:09,971][26022] Updated weights on worker 0-0, policy_version 824035 (0.00101) [2022-07-10 17:38:11,343][26022] Updated weights on worker 0-0, policy_version 824045 (0.00083) [2022-07-10 17:38:11,951][25689] Fps is (10 sec: 5609.1, 60 sec: 5568.9, 300 sec: 5523.0). Total num frames: 843825152. Throughput: 0: 5744.7. Samples: 843827708. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:38:11,953][25689] Avg episode reward: [(0, '0.155')] [2022-07-10 17:38:13,571][26022] Updated weights on worker 0-0, policy_version 824055 (0.00090) [2022-07-10 17:38:14,943][26022] Updated weights on worker 0-0, policy_version 824065 (0.00093) [2022-07-10 17:38:17,033][25689] Fps is (10 sec: 5468.4, 60 sec: 5533.1, 300 sec: 5514.6). Total num frames: 843850752. Throughput: 0: 5720.3. Samples: 843860962. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:38:17,033][25689] Avg episode reward: [(0, '0.718')] [2022-07-10 17:38:17,216][26022] Updated weights on worker 0-0, policy_version 824075 (0.00087) [2022-07-10 17:38:18,703][26022] Updated weights on worker 0-0, policy_version 824085 (0.00088) [2022-07-10 17:38:20,817][26022] Updated weights on worker 0-0, policy_version 824095 (0.00085) [2022-07-10 17:38:22,041][25689] Fps is (10 sec: 5581.8, 60 sec: 5566.9, 300 sec: 5521.4). Total num frames: 843881472. Throughput: 0: 4925.4. Samples: 843877830. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:38:22,042][25689] Avg episode reward: [(0, '0.670')] [2022-07-10 17:38:22,663][26022] Updated weights on worker 0-0, policy_version 824105 (0.00098) [2022-07-10 17:38:24,259][26022] Updated weights on worker 0-0, policy_version 824115 (0.00093) [2022-07-10 17:38:26,325][26022] Updated weights on worker 0-0, policy_version 824125 (0.00093) [2022-07-10 17:38:27,063][25689] Fps is (10 sec: 5717.4, 60 sec: 5549.2, 300 sec: 5522.3). Total num frames: 843908096. Throughput: 0: 5809.2. Samples: 843911232. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:38:27,063][25689] Avg episode reward: [(0, '0.711')] [2022-07-10 17:38:28,087][26022] Updated weights on worker 0-0, policy_version 824135 (0.00084) [2022-07-10 17:38:30,060][26022] Updated weights on worker 0-0, policy_version 824145 (0.00093) [2022-07-10 17:38:31,698][26022] Updated weights on worker 0-0, policy_version 824155 (0.00090) [2022-07-10 17:38:32,102][25689] Fps is (10 sec: 5292.4, 60 sec: 5564.5, 300 sec: 5513.4). Total num frames: 843934720. Throughput: 0: 5799.2. Samples: 843944570. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:38:32,103][25689] Avg episode reward: [(0, '0.745')] [2022-07-10 17:38:33,808][26022] Updated weights on worker 0-0, policy_version 824165 (0.00963) [2022-07-10 17:38:35,324][26022] Updated weights on worker 0-0, policy_version 824175 (0.00051) [2022-07-10 17:38:37,221][25689] Fps is (10 sec: 5644.9, 60 sec: 5559.1, 300 sec: 5528.7). Total num frames: 843965440. Throughput: 0: 4977.4. Samples: 843961446. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:38:37,222][25689] Avg episode reward: [(0, '0.915')] [2022-07-10 17:38:37,223][26022] Updated weights on worker 0-0, policy_version 824185 (0.00091) [2022-07-10 17:38:39,102][26022] Updated weights on worker 0-0, policy_version 824195 (0.00085) [2022-07-10 17:38:40,976][26022] Updated weights on worker 0-0, policy_version 824205 (0.00092) [2022-07-10 17:38:42,315][25689] Fps is (10 sec: 5715.1, 60 sec: 5551.8, 300 sec: 5517.1). Total num frames: 843993088. Throughput: 0: 5780.9. Samples: 843995034. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:38:42,316][25689] Avg episode reward: [(0, '1.112')] [2022-07-10 17:38:42,959][26022] Updated weights on worker 0-0, policy_version 824215 (0.00089) [2022-07-10 17:38:44,608][26022] Updated weights on worker 0-0, policy_version 824225 (0.00082) [2022-07-10 17:38:46,492][26022] Updated weights on worker 0-0, policy_version 824235 (0.00095) [2022-07-10 17:38:47,338][25689] Fps is (10 sec: 5465.6, 60 sec: 5550.4, 300 sec: 5524.1). Total num frames: 844020736. Throughput: 0: 5779.1. Samples: 844028408. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:38:47,339][25689] Avg episode reward: [(0, '-0.588')] [2022-07-10 17:38:48,482][26022] Updated weights on worker 0-0, policy_version 824245 (0.00096) [2022-07-10 17:38:50,265][26022] Updated weights on worker 0-0, policy_version 824255 (0.00087) [2022-07-10 17:38:51,215][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:38:51,226][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000824260_844042240.pth [2022-07-10 17:38:51,239][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000822316_842051584.pth [2022-07-10 17:38:52,152][26022] Updated weights on worker 0-0, policy_version 824265 (0.00096) [2022-07-10 17:38:52,344][25689] Fps is (10 sec: 5514.0, 60 sec: 5534.3, 300 sec: 5519.1). Total num frames: 844048384. Throughput: 0: 4961.6. Samples: 844044998. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:38:52,344][25689] Avg episode reward: [(0, '-0.615')] [2022-07-10 17:38:53,947][26022] Updated weights on worker 0-0, policy_version 824275 (0.00088) [2022-07-10 17:38:55,820][26022] Updated weights on worker 0-0, policy_version 824285 (0.00089) [2022-07-10 17:38:57,435][25689] Fps is (10 sec: 5476.7, 60 sec: 5515.3, 300 sec: 5518.3). Total num frames: 844076032. Throughput: 0: 5768.9. Samples: 844078056. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:38:57,435][25689] Avg episode reward: [(0, '-0.558')] [2022-07-10 17:38:57,631][26022] Updated weights on worker 0-0, policy_version 824295 (0.00096) [2022-07-10 17:38:59,552][26022] Updated weights on worker 0-0, policy_version 824305 (0.00107) [2022-07-10 17:39:01,345][26022] Updated weights on worker 0-0, policy_version 824315 (0.00092) [2022-07-10 17:39:02,449][25689] Fps is (10 sec: 5472.1, 60 sec: 5531.2, 300 sec: 5526.7). Total num frames: 844103680. Throughput: 0: 5701.4. Samples: 844109822. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:39:02,449][25689] Avg episode reward: [(0, '-1.131')] [2022-07-10 17:39:03,551][26022] Updated weights on worker 0-0, policy_version 824325 (0.01018) [2022-07-10 17:39:05,452][26022] Updated weights on worker 0-0, policy_version 824335 (0.00090) [2022-07-10 17:39:07,231][26022] Updated weights on worker 0-0, policy_version 824345 (0.00084) [2022-07-10 17:39:07,507][25689] Fps is (10 sec: 5388.2, 60 sec: 5526.8, 300 sec: 5512.4). Total num frames: 844130304. Throughput: 0: 4859.2. Samples: 844126414. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:39:07,508][25689] Avg episode reward: [(0, '-2.205')] [2022-07-10 17:39:09,003][26022] Updated weights on worker 0-0, policy_version 824355 (0.00083) [2022-07-10 17:39:10,796][26022] Updated weights on worker 0-0, policy_version 824365 (0.00088) [2022-07-10 17:39:12,522][25689] Fps is (10 sec: 5387.6, 60 sec: 5494.4, 300 sec: 5520.9). Total num frames: 844157952. Throughput: 0: 5701.6. Samples: 844160048. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:39:12,523][25689] Avg episode reward: [(0, '-3.835')] [2022-07-10 17:39:12,717][26022] Updated weights on worker 0-0, policy_version 824375 (0.00334) [2022-07-10 17:39:14,492][26022] Updated weights on worker 0-0, policy_version 824385 (0.00088) [2022-07-10 17:39:16,521][26022] Updated weights on worker 0-0, policy_version 824395 (0.00087) [2022-07-10 17:39:17,595][25689] Fps is (10 sec: 5684.6, 60 sec: 5562.8, 300 sec: 5523.3). Total num frames: 844187648. Throughput: 0: 5712.7. Samples: 844193224. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:39:17,596][25689] Avg episode reward: [(0, '-2.474')] [2022-07-10 17:39:18,244][26022] Updated weights on worker 0-0, policy_version 824405 (0.00092) [2022-07-10 17:39:20,092][26022] Updated weights on worker 0-0, policy_version 824415 (0.00089) [2022-07-10 17:39:21,925][26022] Updated weights on worker 0-0, policy_version 824425 (0.00095) [2022-07-10 17:39:22,603][25689] Fps is (10 sec: 5485.0, 60 sec: 5478.2, 300 sec: 5513.1). Total num frames: 844213248. Throughput: 0: 4961.8. Samples: 844209826. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:39:22,604][25689] Avg episode reward: [(0, '-2.493')] [2022-07-10 17:39:23,552][26022] Updated weights on worker 0-0, policy_version 824435 (0.00083) [2022-07-10 17:39:25,687][26022] Updated weights on worker 0-0, policy_version 824445 (0.00089) [2022-07-10 17:39:27,395][26022] Updated weights on worker 0-0, policy_version 824455 (0.00089) [2022-07-10 17:39:27,614][25689] Fps is (10 sec: 5519.2, 60 sec: 5530.0, 300 sec: 5520.0). Total num frames: 844242944. Throughput: 0: 5820.7. Samples: 844243448. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:39:27,616][25689] Avg episode reward: [(0, '-2.532')] [2022-07-10 17:39:29,352][26022] Updated weights on worker 0-0, policy_version 824465 (0.00086) [2022-07-10 17:39:31,168][26022] Updated weights on worker 0-0, policy_version 824475 (0.00088) [2022-07-10 17:39:32,650][25689] Fps is (10 sec: 5707.8, 60 sec: 5547.2, 300 sec: 5518.2). Total num frames: 844270592. Throughput: 0: 5798.6. Samples: 844276762. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:39:32,651][25689] Avg episode reward: [(0, '-2.855')] [2022-07-10 17:39:32,999][26022] Updated weights on worker 0-0, policy_version 824485 (0.00080) [2022-07-10 17:39:35,002][26022] Updated weights on worker 0-0, policy_version 824495 (0.00085) [2022-07-10 17:39:36,647][26022] Updated weights on worker 0-0, policy_version 824505 (0.00087) [2022-07-10 17:39:37,799][25689] Fps is (10 sec: 5428.9, 60 sec: 5493.7, 300 sec: 5513.0). Total num frames: 844298240. Throughput: 0: 4966.0. Samples: 844293558. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:39:37,800][25689] Avg episode reward: [(0, '-1.986')] [2022-07-10 17:39:38,573][26022] Updated weights on worker 0-0, policy_version 824515 (0.00086) [2022-07-10 17:39:40,442][26022] Updated weights on worker 0-0, policy_version 824525 (0.00088) [2022-07-10 17:39:42,076][26022] Updated weights on worker 0-0, policy_version 824535 (0.00093) [2022-07-10 17:39:42,850][25689] Fps is (10 sec: 5521.4, 60 sec: 5514.6, 300 sec: 5522.9). Total num frames: 844326912. Throughput: 0: 5784.2. Samples: 844326938. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:39:42,851][25689] Avg episode reward: [(0, '-0.626')] [2022-07-10 17:39:44,158][26022] Updated weights on worker 0-0, policy_version 824545 (0.00101) [2022-07-10 17:39:45,674][26022] Updated weights on worker 0-0, policy_version 824555 (0.00088) [2022-07-10 17:39:47,742][26022] Updated weights on worker 0-0, policy_version 824565 (0.00084) [2022-07-10 17:39:47,863][25689] Fps is (10 sec: 5596.3, 60 sec: 5515.5, 300 sec: 5519.2). Total num frames: 844354560. Throughput: 0: 5770.0. Samples: 844360286. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:39:47,864][25689] Avg episode reward: [(0, '-1.129')] [2022-07-10 17:39:49,658][26022] Updated weights on worker 0-0, policy_version 824575 (0.00086) [2022-07-10 17:39:51,412][26022] Updated weights on worker 0-0, policy_version 824585 (0.00085) [2022-07-10 17:39:52,875][25689] Fps is (10 sec: 5515.8, 60 sec: 5514.9, 300 sec: 5521.0). Total num frames: 844382208. Throughput: 0: 4942.5. Samples: 844376726. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:39:52,876][25689] Avg episode reward: [(0, '-1.234')] [2022-07-10 17:39:53,277][26022] Updated weights on worker 0-0, policy_version 824595 (0.00090) [2022-07-10 17:39:55,211][26022] Updated weights on worker 0-0, policy_version 824605 (0.00089) [2022-07-10 17:39:57,051][26022] Updated weights on worker 0-0, policy_version 824615 (0.00073) [2022-07-10 17:39:57,991][25689] Fps is (10 sec: 5560.7, 60 sec: 5529.5, 300 sec: 5522.9). Total num frames: 844410880. Throughput: 0: 5755.0. Samples: 844409764. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:39:57,992][25689] Avg episode reward: [(0, '-2.506')] [2022-07-10 17:39:59,053][26022] Updated weights on worker 0-0, policy_version 824625 (0.00085) [2022-07-10 17:40:00,757][26022] Updated weights on worker 0-0, policy_version 824635 (0.00082) [2022-07-10 17:40:02,994][25689] Fps is (10 sec: 5262.4, 60 sec: 5479.8, 300 sec: 5516.4). Total num frames: 844435456. Throughput: 0: 5657.1. Samples: 844440892. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 17:40:02,996][25689] Avg episode reward: [(0, '-1.271')] [2022-07-10 17:40:03,029][26022] Updated weights on worker 0-0, policy_version 824645 (0.00086) [2022-07-10 17:40:04,722][26022] Updated weights on worker 0-0, policy_version 824655 (0.00092) [2022-07-10 17:40:06,695][26022] Updated weights on worker 0-0, policy_version 824665 (0.00089) [2022-07-10 17:40:08,040][25689] Fps is (10 sec: 5299.2, 60 sec: 5514.8, 300 sec: 5527.3). Total num frames: 844464128. Throughput: 0: 4819.7. Samples: 844457530. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:40:08,041][25689] Avg episode reward: [(0, '-0.795')] [2022-07-10 17:40:08,326][26022] Updated weights on worker 0-0, policy_version 824675 (0.00093) [2022-07-10 17:40:10,545][26022] Updated weights on worker 0-0, policy_version 824685 (0.00086) [2022-07-10 17:40:11,888][26022] Updated weights on worker 0-0, policy_version 824695 (0.00086) [2022-07-10 17:40:13,047][25689] Fps is (10 sec: 5500.4, 60 sec: 5498.6, 300 sec: 5518.1). Total num frames: 844490752. Throughput: 0: 5677.9. Samples: 844491258. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:40:13,049][25689] Avg episode reward: [(0, '-0.912')] [2022-07-10 17:40:14,031][26022] Updated weights on worker 0-0, policy_version 824705 (0.00090) [2022-07-10 17:40:15,629][26022] Updated weights on worker 0-0, policy_version 824715 (0.00092) [2022-07-10 17:40:17,550][26022] Updated weights on worker 0-0, policy_version 824725 (0.00088) [2022-07-10 17:40:18,161][25689] Fps is (10 sec: 5665.6, 60 sec: 5511.7, 300 sec: 5530.1). Total num frames: 844521472. Throughput: 0: 5708.2. Samples: 844524896. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:40:18,162][25689] Avg episode reward: [(0, '-0.330')] [2022-07-10 17:40:19,703][26022] Updated weights on worker 0-0, policy_version 824735 (0.00092) [2022-07-10 17:40:21,202][26022] Updated weights on worker 0-0, policy_version 824745 (0.00082) [2022-07-10 17:40:23,174][25689] Fps is (10 sec: 5662.6, 60 sec: 5528.3, 300 sec: 5523.1). Total num frames: 844548096. Throughput: 0: 4987.8. Samples: 844541546. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:40:23,176][25689] Avg episode reward: [(0, '-0.203')] [2022-07-10 17:40:23,225][26022] Updated weights on worker 0-0, policy_version 824755 (0.00094) [2022-07-10 17:40:25,022][26022] Updated weights on worker 0-0, policy_version 824765 (0.00079) [2022-07-10 17:40:26,775][26022] Updated weights on worker 0-0, policy_version 824775 (0.00084) [2022-07-10 17:40:28,193][25689] Fps is (10 sec: 5512.2, 60 sec: 5510.6, 300 sec: 5530.1). Total num frames: 844576768. Throughput: 0: 5839.9. Samples: 844575222. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:40:28,195][25689] Avg episode reward: [(0, '0.510')] [2022-07-10 17:40:28,856][26022] Updated weights on worker 0-0, policy_version 824785 (0.00092) [2022-07-10 17:40:30,262][26022] Updated weights on worker 0-0, policy_version 824795 (0.00095) [2022-07-10 17:40:32,498][26022] Updated weights on worker 0-0, policy_version 824805 (0.00053) [2022-07-10 17:40:33,227][25689] Fps is (10 sec: 5703.8, 60 sec: 5527.6, 300 sec: 5524.1). Total num frames: 844605440. Throughput: 0: 5823.8. Samples: 844608784. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:40:33,229][25689] Avg episode reward: [(0, '0.745')] [2022-07-10 17:40:34,223][26022] Updated weights on worker 0-0, policy_version 824815 (0.00083) [2022-07-10 17:40:35,984][26022] Updated weights on worker 0-0, policy_version 824825 (0.00087) [2022-07-10 17:40:37,810][26022] Updated weights on worker 0-0, policy_version 824835 (0.00092) [2022-07-10 17:40:38,283][25689] Fps is (10 sec: 5581.4, 60 sec: 5536.1, 300 sec: 5523.1). Total num frames: 844633088. Throughput: 0: 5843.5. Samples: 844642480. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:40:38,284][25689] Avg episode reward: [(0, '-0.023')] [2022-07-10 17:40:39,719][26022] Updated weights on worker 0-0, policy_version 824845 (0.00086) [2022-07-10 17:40:41,656][26022] Updated weights on worker 0-0, policy_version 824855 (0.00093) [2022-07-10 17:40:43,305][25689] Fps is (10 sec: 5486.8, 60 sec: 5521.8, 300 sec: 5523.0). Total num frames: 844660736. Throughput: 0: 5841.5. Samples: 844659146. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:40:43,306][25689] Avg episode reward: [(0, '-0.075')] [2022-07-10 17:40:43,405][26022] Updated weights on worker 0-0, policy_version 824865 (0.00087) [2022-07-10 17:40:45,193][26022] Updated weights on worker 0-0, policy_version 824875 (0.00090) [2022-07-10 17:40:47,052][26022] Updated weights on worker 0-0, policy_version 824885 (0.00106) [2022-07-10 17:40:48,320][25689] Fps is (10 sec: 5713.6, 60 sec: 5555.6, 300 sec: 5527.2). Total num frames: 844690432. Throughput: 0: 5849.1. Samples: 844692948. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:40:48,320][25689] Avg episode reward: [(0, '-0.186')] [2022-07-10 17:40:48,847][26022] Updated weights on worker 0-0, policy_version 824895 (0.00084) [2022-07-10 17:40:50,713][26022] Updated weights on worker 0-0, policy_version 824905 (0.00093) [2022-07-10 17:40:51,345][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:40:51,362][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000824909_844706816.pth [2022-07-10 17:40:51,366][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000822965_842716160.pth [2022-07-10 17:40:52,615][26022] Updated weights on worker 0-0, policy_version 824915 (0.00090) [2022-07-10 17:40:53,351][25689] Fps is (10 sec: 5504.7, 60 sec: 5520.0, 300 sec: 5524.5). Total num frames: 844716032. Throughput: 0: 5848.6. Samples: 844726476. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:40:53,351][25689] Avg episode reward: [(0, '-0.522')] [2022-07-10 17:40:54,411][26022] Updated weights on worker 0-0, policy_version 824925 (0.00097) [2022-07-10 17:40:56,157][26022] Updated weights on worker 0-0, policy_version 824935 (0.00095) [2022-07-10 17:40:58,274][26022] Updated weights on worker 0-0, policy_version 824945 (0.00087) [2022-07-10 17:40:58,442][25689] Fps is (10 sec: 5260.5, 60 sec: 5505.3, 300 sec: 5519.9). Total num frames: 844743680. Throughput: 0: 4986.9. Samples: 844743010. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:40:58,442][25689] Avg episode reward: [(0, '-1.174')] [2022-07-10 17:40:59,947][26022] Updated weights on worker 0-0, policy_version 824955 (0.00090) [2022-07-10 17:41:01,900][26022] Updated weights on worker 0-0, policy_version 824965 (0.00082) [2022-07-10 17:41:03,446][25689] Fps is (10 sec: 5375.9, 60 sec: 5539.0, 300 sec: 5530.6). Total num frames: 844770304. Throughput: 0: 5753.8. Samples: 844775032. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:41:03,447][25689] Avg episode reward: [(0, '-2.337')] [2022-07-10 17:41:03,998][26022] Updated weights on worker 0-0, policy_version 824975 (0.00088) [2022-07-10 17:41:05,915][26022] Updated weights on worker 0-0, policy_version 824985 (0.00085) [2022-07-10 17:41:07,641][26022] Updated weights on worker 0-0, policy_version 824995 (0.00265) [2022-07-10 17:41:08,451][25689] Fps is (10 sec: 5524.4, 60 sec: 5542.8, 300 sec: 5524.2). Total num frames: 844798976. Throughput: 0: 5689.8. Samples: 844807492. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:41:08,452][25689] Avg episode reward: [(0, '-2.500')] [2022-07-10 17:41:09,545][26022] Updated weights on worker 0-0, policy_version 825005 (0.00092) [2022-07-10 17:41:11,168][26022] Updated weights on worker 0-0, policy_version 825015 (0.00097) [2022-07-10 17:41:13,420][26022] Updated weights on worker 0-0, policy_version 825025 (0.00089) [2022-07-10 17:41:13,477][25689] Fps is (10 sec: 5512.5, 60 sec: 5541.1, 300 sec: 5521.7). Total num frames: 844825600. Throughput: 0: 4850.3. Samples: 844824096. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:41:13,478][25689] Avg episode reward: [(0, '-3.150')] [2022-07-10 17:41:15,076][26022] Updated weights on worker 0-0, policy_version 825035 (0.00085) [2022-07-10 17:41:17,160][26022] Updated weights on worker 0-0, policy_version 825045 (0.00086) [2022-07-10 17:41:18,618][25689] Fps is (10 sec: 5539.7, 60 sec: 5521.7, 300 sec: 5530.3). Total num frames: 844855296. Throughput: 0: 5664.2. Samples: 844857290. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:41:18,619][25689] Avg episode reward: [(0, '-2.775')] [2022-07-10 17:41:18,734][26022] Updated weights on worker 0-0, policy_version 825055 (0.00091) [2022-07-10 17:41:20,651][26022] Updated weights on worker 0-0, policy_version 825065 (0.00098) [2022-07-10 17:41:22,424][26022] Updated weights on worker 0-0, policy_version 825075 (0.00095) [2022-07-10 17:41:23,681][25689] Fps is (10 sec: 5619.6, 60 sec: 5534.0, 300 sec: 5529.3). Total num frames: 844882944. Throughput: 0: 5701.2. Samples: 844890398. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:41:23,682][25689] Avg episode reward: [(0, '-3.007')] [2022-07-10 17:41:24,511][26022] Updated weights on worker 0-0, policy_version 825085 (0.00087) [2022-07-10 17:41:26,127][26022] Updated weights on worker 0-0, policy_version 825095 (0.00086) [2022-07-10 17:41:28,203][26022] Updated weights on worker 0-0, policy_version 825105 (0.00085) [2022-07-10 17:41:28,693][25689] Fps is (10 sec: 5488.5, 60 sec: 5517.7, 300 sec: 5522.8). Total num frames: 844910592. Throughput: 0: 4916.9. Samples: 844907018. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:41:28,693][25689] Avg episode reward: [(0, '-2.360')] [2022-07-10 17:41:29,916][26022] Updated weights on worker 0-0, policy_version 825115 (0.00088) [2022-07-10 17:41:31,847][26022] Updated weights on worker 0-0, policy_version 825125 (0.00081) [2022-07-10 17:41:33,559][26022] Updated weights on worker 0-0, policy_version 825135 (0.00086) [2022-07-10 17:41:33,718][25689] Fps is (10 sec: 5611.2, 60 sec: 5518.6, 300 sec: 5534.4). Total num frames: 844939264. Throughput: 0: 5753.3. Samples: 844940550. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:41:33,719][25689] Avg episode reward: [(0, '-2.070')] [2022-07-10 17:41:35,515][26022] Updated weights on worker 0-0, policy_version 825145 (0.00097) [2022-07-10 17:41:37,346][26022] Updated weights on worker 0-0, policy_version 825155 (0.00085) [2022-07-10 17:41:38,856][25689] Fps is (10 sec: 5441.0, 60 sec: 5494.3, 300 sec: 5521.6). Total num frames: 844965888. Throughput: 0: 5778.3. Samples: 844974230. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:41:38,856][25689] Avg episode reward: [(0, '-1.615')] [2022-07-10 17:41:39,168][26022] Updated weights on worker 0-0, policy_version 825165 (0.00090) [2022-07-10 17:41:41,004][26022] Updated weights on worker 0-0, policy_version 825175 (0.00107) [2022-07-10 17:41:42,567][26022] Updated weights on worker 0-0, policy_version 825185 (0.00083) [2022-07-10 17:41:43,907][25689] Fps is (10 sec: 5427.2, 60 sec: 5508.5, 300 sec: 5524.4). Total num frames: 844994560. Throughput: 0: 4965.7. Samples: 844990832. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:41:43,908][25689] Avg episode reward: [(0, '-1.480')] [2022-07-10 17:41:44,765][26022] Updated weights on worker 0-0, policy_version 825195 (0.00090) [2022-07-10 17:41:46,492][26022] Updated weights on worker 0-0, policy_version 825205 (0.00092) [2022-07-10 17:41:48,280][26022] Updated weights on worker 0-0, policy_version 825215 (0.00087) [2022-07-10 17:41:48,909][25689] Fps is (10 sec: 5703.8, 60 sec: 5492.7, 300 sec: 5524.8). Total num frames: 845023232. Throughput: 0: 5794.8. Samples: 845024168. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:41:48,910][25689] Avg episode reward: [(0, '-0.950')] [2022-07-10 17:41:50,051][26022] Updated weights on worker 0-0, policy_version 825225 (0.00087) [2022-07-10 17:41:51,799][26022] Updated weights on worker 0-0, policy_version 825235 (0.00089) [2022-07-10 17:41:53,810][26022] Updated weights on worker 0-0, policy_version 825245 (0.00090) [2022-07-10 17:41:53,915][25689] Fps is (10 sec: 5729.7, 60 sec: 5545.7, 300 sec: 5529.5). Total num frames: 845051904. Throughput: 0: 5816.0. Samples: 845058014. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:41:53,916][25689] Avg episode reward: [(0, '-1.843')] [2022-07-10 17:41:55,598][26022] Updated weights on worker 0-0, policy_version 825255 (0.00094) [2022-07-10 17:41:57,420][26022] Updated weights on worker 0-0, policy_version 825265 (0.00081) [2022-07-10 17:41:59,026][25689] Fps is (10 sec: 5567.2, 60 sec: 5543.9, 300 sec: 5524.5). Total num frames: 845079552. Throughput: 0: 4991.5. Samples: 845074908. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:41:59,026][25689] Avg episode reward: [(0, '-0.645')] [2022-07-10 17:41:59,351][26022] Updated weights on worker 0-0, policy_version 825275 (0.00084) [2022-07-10 17:42:00,871][26022] Updated weights on worker 0-0, policy_version 825285 (0.00083) [2022-07-10 17:42:03,233][26022] Updated weights on worker 0-0, policy_version 825295 (0.00083) [2022-07-10 17:42:04,031][25689] Fps is (10 sec: 5365.0, 60 sec: 5543.8, 300 sec: 5532.7). Total num frames: 845106176. Throughput: 0: 5761.6. Samples: 845106780. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:42:04,032][25689] Avg episode reward: [(0, '-1.884')] [2022-07-10 17:42:05,100][26022] Updated weights on worker 0-0, policy_version 825305 (0.00793) [2022-07-10 17:42:06,848][26022] Updated weights on worker 0-0, policy_version 825315 (0.00088) [2022-07-10 17:42:08,795][26022] Updated weights on worker 0-0, policy_version 825325 (0.00087) [2022-07-10 17:42:09,049][25689] Fps is (10 sec: 5516.7, 60 sec: 5542.6, 300 sec: 5533.0). Total num frames: 845134848. Throughput: 0: 5774.0. Samples: 845140456. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:42:09,051][25689] Avg episode reward: [(0, '-2.152')] [2022-07-10 17:42:10,500][26022] Updated weights on worker 0-0, policy_version 825335 (0.00085) [2022-07-10 17:42:12,359][26022] Updated weights on worker 0-0, policy_version 825345 (0.00084) [2022-07-10 17:42:14,059][25689] Fps is (10 sec: 5616.2, 60 sec: 5561.0, 300 sec: 5534.0). Total num frames: 845162496. Throughput: 0: 4929.9. Samples: 845157322. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:42:14,062][25689] Avg episode reward: [(0, '-1.848')] [2022-07-10 17:42:14,350][26022] Updated weights on worker 0-0, policy_version 825355 (0.00084) [2022-07-10 17:42:15,914][26022] Updated weights on worker 0-0, policy_version 825365 (0.00081) [2022-07-10 17:42:17,916][26022] Updated weights on worker 0-0, policy_version 825375 (0.00092) [2022-07-10 17:42:19,099][25689] Fps is (10 sec: 5604.2, 60 sec: 5553.4, 300 sec: 5533.3). Total num frames: 845191168. Throughput: 0: 5789.6. Samples: 845191122. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:42:19,099][25689] Avg episode reward: [(0, '-3.666')] [2022-07-10 17:42:19,707][26022] Updated weights on worker 0-0, policy_version 825385 (0.00085) [2022-07-10 17:42:21,420][26022] Updated weights on worker 0-0, policy_version 825395 (0.00087) [2022-07-10 17:42:23,269][26022] Updated weights on worker 0-0, policy_version 825405 (0.00088) [2022-07-10 17:42:24,121][25689] Fps is (10 sec: 5597.8, 60 sec: 5557.2, 300 sec: 5533.2). Total num frames: 845218816. Throughput: 0: 5865.5. Samples: 845224614. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:42:24,121][25689] Avg episode reward: [(0, '-3.466')] [2022-07-10 17:42:25,352][26022] Updated weights on worker 0-0, policy_version 825415 (0.00091) [2022-07-10 17:42:26,976][26022] Updated weights on worker 0-0, policy_version 825425 (0.00079) [2022-07-10 17:42:29,100][26022] Updated weights on worker 0-0, policy_version 825435 (0.00088) [2022-07-10 17:42:29,144][25689] Fps is (10 sec: 5403.0, 60 sec: 5539.2, 300 sec: 5536.6). Total num frames: 845245440. Throughput: 0: 5018.2. Samples: 845241294. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:42:29,144][25689] Avg episode reward: [(0, '-3.172')] [2022-07-10 17:42:30,679][26022] Updated weights on worker 0-0, policy_version 825445 (0.00090) [2022-07-10 17:42:32,690][26022] Updated weights on worker 0-0, policy_version 825455 (0.00093) [2022-07-10 17:42:34,159][25689] Fps is (10 sec: 5610.6, 60 sec: 5557.1, 300 sec: 5534.0). Total num frames: 845275136. Throughput: 0: 5848.8. Samples: 845274878. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:42:34,159][25689] Avg episode reward: [(0, '-2.860')] [2022-07-10 17:42:34,359][26022] Updated weights on worker 0-0, policy_version 825465 (0.00083) [2022-07-10 17:42:36,203][26022] Updated weights on worker 0-0, policy_version 825475 (0.00093) [2022-07-10 17:42:38,055][26022] Updated weights on worker 0-0, policy_version 825485 (0.00085) [2022-07-10 17:42:39,206][25689] Fps is (10 sec: 5597.2, 60 sec: 5565.4, 300 sec: 5530.0). Total num frames: 845301760. Throughput: 0: 5831.5. Samples: 845308376. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:42:39,206][25689] Avg episode reward: [(0, '-2.000')] [2022-07-10 17:42:39,894][26022] Updated weights on worker 0-0, policy_version 825495 (0.00092) [2022-07-10 17:42:41,683][26022] Updated weights on worker 0-0, policy_version 825505 (0.00083) [2022-07-10 17:42:43,512][26022] Updated weights on worker 0-0, policy_version 825515 (0.00088) [2022-07-10 17:42:44,220][25689] Fps is (10 sec: 5597.5, 60 sec: 5585.8, 300 sec: 5536.7). Total num frames: 845331456. Throughput: 0: 5008.5. Samples: 845325284. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:42:44,221][25689] Avg episode reward: [(0, '-2.193')] [2022-07-10 17:42:45,387][26022] Updated weights on worker 0-0, policy_version 825525 (0.00086) [2022-07-10 17:42:47,224][26022] Updated weights on worker 0-0, policy_version 825535 (0.00090) [2022-07-10 17:42:49,092][26022] Updated weights on worker 0-0, policy_version 825545 (0.00090) [2022-07-10 17:42:49,224][25689] Fps is (10 sec: 5723.8, 60 sec: 5568.6, 300 sec: 5533.5). Total num frames: 845359104. Throughput: 0: 5847.9. Samples: 845358724. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:42:49,225][25689] Avg episode reward: [(0, '0.457')] [2022-07-10 17:42:50,861][26022] Updated weights on worker 0-0, policy_version 825555 (0.00086) [2022-07-10 17:42:51,453][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:42:51,466][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000825558_845371392.pth [2022-07-10 17:42:51,466][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000823609_843375616.pth [2022-07-10 17:42:52,768][26022] Updated weights on worker 0-0, policy_version 825565 (0.00091) [2022-07-10 17:42:54,245][25689] Fps is (10 sec: 5516.0, 60 sec: 5550.3, 300 sec: 5530.9). Total num frames: 845386752. Throughput: 0: 5856.0. Samples: 845392504. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:42:54,245][25689] Avg episode reward: [(0, '-0.286')] [2022-07-10 17:42:54,637][26022] Updated weights on worker 0-0, policy_version 825575 (0.00082) [2022-07-10 17:42:56,427][26022] Updated weights on worker 0-0, policy_version 825585 (0.00083) [2022-07-10 17:42:58,376][26022] Updated weights on worker 0-0, policy_version 825595 (0.00086) [2022-07-10 17:42:59,286][25689] Fps is (10 sec: 5495.4, 60 sec: 5556.7, 300 sec: 5533.6). Total num frames: 845414400. Throughput: 0: 5016.1. Samples: 845409102. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:42:59,287][25689] Avg episode reward: [(0, '-0.302')] [2022-07-10 17:42:59,892][26022] Updated weights on worker 0-0, policy_version 825605 (0.00092) [2022-07-10 17:43:02,393][26022] Updated weights on worker 0-0, policy_version 825615 (0.00093) [2022-07-10 17:43:03,978][26022] Updated weights on worker 0-0, policy_version 825625 (0.00085) [2022-07-10 17:43:04,313][25689] Fps is (10 sec: 5288.6, 60 sec: 5537.7, 300 sec: 5529.9). Total num frames: 845440000. Throughput: 0: 5733.2. Samples: 845440482. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:43:04,314][25689] Avg episode reward: [(0, '-0.201')] [2022-07-10 17:43:05,863][26022] Updated weights on worker 0-0, policy_version 825635 (0.00084) [2022-07-10 17:43:07,725][26022] Updated weights on worker 0-0, policy_version 825645 (0.00086) [2022-07-10 17:43:09,335][25689] Fps is (10 sec: 5299.2, 60 sec: 5520.4, 300 sec: 5523.2). Total num frames: 845467648. Throughput: 0: 5726.7. Samples: 845473890. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:43:09,336][25689] Avg episode reward: [(0, '-0.094')] [2022-07-10 17:43:09,542][26022] Updated weights on worker 0-0, policy_version 825655 (0.00085) [2022-07-10 17:43:11,447][26022] Updated weights on worker 0-0, policy_version 825665 (0.00068) [2022-07-10 17:43:13,405][26022] Updated weights on worker 0-0, policy_version 825675 (0.00084) [2022-07-10 17:43:14,343][25689] Fps is (10 sec: 5615.5, 60 sec: 5537.6, 300 sec: 5534.9). Total num frames: 845496320. Throughput: 0: 4886.1. Samples: 845490704. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:43:14,343][25689] Avg episode reward: [(0, '-0.510')] [2022-07-10 17:43:15,084][26022] Updated weights on worker 0-0, policy_version 825685 (0.00090) [2022-07-10 17:43:17,096][26022] Updated weights on worker 0-0, policy_version 825695 (0.00086) [2022-07-10 17:43:18,831][26022] Updated weights on worker 0-0, policy_version 825705 (0.00085) [2022-07-10 17:43:19,415][25689] Fps is (10 sec: 5688.6, 60 sec: 5534.5, 300 sec: 5526.8). Total num frames: 845524992. Throughput: 0: 5728.0. Samples: 845524400. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:43:19,417][25689] Avg episode reward: [(0, '-0.136')] [2022-07-10 17:43:20,758][26022] Updated weights on worker 0-0, policy_version 825715 (0.00086) [2022-07-10 17:43:22,555][26022] Updated weights on worker 0-0, policy_version 825725 (0.00094) [2022-07-10 17:43:24,184][26022] Updated weights on worker 0-0, policy_version 825735 (0.00088) [2022-07-10 17:43:24,435][25689] Fps is (10 sec: 5580.6, 60 sec: 5534.7, 300 sec: 5530.3). Total num frames: 845552640. Throughput: 0: 5847.0. Samples: 845558132. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:43:24,436][25689] Avg episode reward: [(0, '-1.185')] [2022-07-10 17:43:26,281][26022] Updated weights on worker 0-0, policy_version 825745 (0.00081) [2022-07-10 17:43:27,903][26022] Updated weights on worker 0-0, policy_version 825755 (0.00099) [2022-07-10 17:43:29,443][25689] Fps is (10 sec: 5514.5, 60 sec: 5553.1, 300 sec: 5534.3). Total num frames: 845580288. Throughput: 0: 5005.9. Samples: 845574548. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:43:29,444][25689] Avg episode reward: [(0, '-0.958')] [2022-07-10 17:43:29,893][26022] Updated weights on worker 0-0, policy_version 825765 (0.00084) [2022-07-10 17:43:31,578][26022] Updated weights on worker 0-0, policy_version 825775 (0.00090) [2022-07-10 17:43:33,738][26022] Updated weights on worker 0-0, policy_version 825785 (0.00095) [2022-07-10 17:43:34,445][25689] Fps is (10 sec: 5626.0, 60 sec: 5537.3, 300 sec: 5529.6). Total num frames: 845608960. Throughput: 0: 5843.3. Samples: 845608170. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:43:34,446][25689] Avg episode reward: [(0, '-1.298')] [2022-07-10 17:43:35,220][26022] Updated weights on worker 0-0, policy_version 825795 (0.00089) [2022-07-10 17:43:37,422][26022] Updated weights on worker 0-0, policy_version 825805 (0.00083) [2022-07-10 17:43:38,959][26022] Updated weights on worker 0-0, policy_version 825815 (0.00087) [2022-07-10 17:43:39,514][25689] Fps is (10 sec: 5694.1, 60 sec: 5569.3, 300 sec: 5533.5). Total num frames: 845637632. Throughput: 0: 5845.6. Samples: 845641886. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 17:43:39,514][25689] Avg episode reward: [(0, '-1.553')] [2022-07-10 17:43:40,746][26022] Updated weights on worker 0-0, policy_version 825825 (0.00096) [2022-07-10 17:43:42,602][26022] Updated weights on worker 0-0, policy_version 825835 (0.00085) [2022-07-10 17:43:44,362][26022] Updated weights on worker 0-0, policy_version 825845 (0.00087) [2022-07-10 17:43:44,546][25689] Fps is (10 sec: 5677.4, 60 sec: 5550.7, 300 sec: 5536.8). Total num frames: 845666304. Throughput: 0: 5001.3. Samples: 845658714. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:43:44,546][25689] Avg episode reward: [(0, '-1.923')] [2022-07-10 17:43:46,479][26022] Updated weights on worker 0-0, policy_version 825855 (0.00085) [2022-07-10 17:43:48,063][26022] Updated weights on worker 0-0, policy_version 825865 (0.00083) [2022-07-10 17:43:49,613][25689] Fps is (10 sec: 5475.2, 60 sec: 5527.9, 300 sec: 5532.2). Total num frames: 845692928. Throughput: 0: 5822.6. Samples: 845691990. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:43:49,613][25689] Avg episode reward: [(0, '-1.627')] [2022-07-10 17:43:50,116][26022] Updated weights on worker 0-0, policy_version 825875 (0.00087) [2022-07-10 17:43:51,905][26022] Updated weights on worker 0-0, policy_version 825885 (0.00086) [2022-07-10 17:43:53,695][26022] Updated weights on worker 0-0, policy_version 825895 (0.00092) [2022-07-10 17:43:54,644][25689] Fps is (10 sec: 5374.7, 60 sec: 5527.0, 300 sec: 5533.3). Total num frames: 845720576. Throughput: 0: 5797.9. Samples: 845725276. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:43:54,644][25689] Avg episode reward: [(0, '-2.460')] [2022-07-10 17:43:55,578][26022] Updated weights on worker 0-0, policy_version 825905 (0.00088) [2022-07-10 17:43:57,510][26022] Updated weights on worker 0-0, policy_version 825915 (0.00095) [2022-07-10 17:43:59,339][26022] Updated weights on worker 0-0, policy_version 825925 (0.00088) [2022-07-10 17:43:59,719][25689] Fps is (10 sec: 5572.8, 60 sec: 5540.8, 300 sec: 5535.6). Total num frames: 845749248. Throughput: 0: 5776.3. Samples: 845758600. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:43:59,720][25689] Avg episode reward: [(0, '-2.302')] [2022-07-10 17:44:01,238][26022] Updated weights on worker 0-0, policy_version 825935 (0.00087) [2022-07-10 17:44:03,245][26022] Updated weights on worker 0-0, policy_version 825945 (0.00084) [2022-07-10 17:44:04,741][25689] Fps is (10 sec: 5273.4, 60 sec: 5524.4, 300 sec: 5529.4). Total num frames: 845773824. Throughput: 0: 5667.7. Samples: 845773174. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:44:04,742][25689] Avg episode reward: [(0, '-2.731')] [2022-07-10 17:44:05,275][26022] Updated weights on worker 0-0, policy_version 825955 (0.00078) [2022-07-10 17:44:07,042][26022] Updated weights on worker 0-0, policy_version 825965 (0.00087) [2022-07-10 17:44:08,724][26022] Updated weights on worker 0-0, policy_version 825975 (0.00090) [2022-07-10 17:44:09,760][25689] Fps is (10 sec: 5404.9, 60 sec: 5558.4, 300 sec: 5536.2). Total num frames: 845803520. Throughput: 0: 5688.0. Samples: 845806588. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:44:09,762][25689] Avg episode reward: [(0, '-2.452')] [2022-07-10 17:44:10,942][26022] Updated weights on worker 0-0, policy_version 825985 (0.00098) [2022-07-10 17:44:12,475][26022] Updated weights on worker 0-0, policy_version 825995 (0.00099) [2022-07-10 17:44:14,558][26022] Updated weights on worker 0-0, policy_version 826005 (0.00085) [2022-07-10 17:44:14,769][25689] Fps is (10 sec: 5513.9, 60 sec: 5507.5, 300 sec: 5523.6). Total num frames: 845829120. Throughput: 0: 5696.5. Samples: 845839924. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:44:14,770][25689] Avg episode reward: [(0, '-1.868')] [2022-07-10 17:44:16,148][26022] Updated weights on worker 0-0, policy_version 826015 (0.00087) [2022-07-10 17:44:18,184][26022] Updated weights on worker 0-0, policy_version 826025 (0.00085) [2022-07-10 17:44:19,866][25689] Fps is (10 sec: 5573.3, 60 sec: 5539.2, 300 sec: 5539.2). Total num frames: 845859840. Throughput: 0: 4870.9. Samples: 845856732. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:44:19,868][25689] Avg episode reward: [(0, '-1.555')] [2022-07-10 17:44:19,870][26022] Updated weights on worker 0-0, policy_version 826035 (0.00089) [2022-07-10 17:44:21,932][26022] Updated weights on worker 0-0, policy_version 826045 (0.00088) [2022-07-10 17:44:23,660][26022] Updated weights on worker 0-0, policy_version 826055 (0.00096) [2022-07-10 17:44:24,881][25689] Fps is (10 sec: 5671.0, 60 sec: 5522.6, 300 sec: 5528.8). Total num frames: 845886464. Throughput: 0: 5812.6. Samples: 845890242. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:44:24,882][25689] Avg episode reward: [(0, '-1.422')] [2022-07-10 17:44:25,472][26022] Updated weights on worker 0-0, policy_version 826065 (0.00099) [2022-07-10 17:44:27,426][26022] Updated weights on worker 0-0, policy_version 826075 (0.00091) [2022-07-10 17:44:29,119][26022] Updated weights on worker 0-0, policy_version 826085 (0.00093) [2022-07-10 17:44:29,946][25689] Fps is (10 sec: 5485.3, 60 sec: 5534.3, 300 sec: 5531.7). Total num frames: 845915136. Throughput: 0: 5794.5. Samples: 845923556. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:44:29,947][25689] Avg episode reward: [(0, '-1.404')] [2022-07-10 17:44:31,092][26022] Updated weights on worker 0-0, policy_version 826095 (0.00093) [2022-07-10 17:44:32,782][26022] Updated weights on worker 0-0, policy_version 826105 (0.00094) [2022-07-10 17:44:34,708][26022] Updated weights on worker 0-0, policy_version 826115 (0.00091) [2022-07-10 17:44:34,960][25689] Fps is (10 sec: 5588.0, 60 sec: 5516.4, 300 sec: 5534.2). Total num frames: 845942784. Throughput: 0: 4960.1. Samples: 845940072. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:44:34,967][25689] Avg episode reward: [(0, '-0.644')] [2022-07-10 17:44:36,615][26022] Updated weights on worker 0-0, policy_version 826125 (0.00083) [2022-07-10 17:44:38,353][26022] Updated weights on worker 0-0, policy_version 826135 (0.00092) [2022-07-10 17:44:40,066][25689] Fps is (10 sec: 5464.2, 60 sec: 5496.1, 300 sec: 5529.7). Total num frames: 845970432. Throughput: 0: 5793.5. Samples: 845973762. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:44:40,066][25689] Avg episode reward: [(0, '-0.667')] [2022-07-10 17:44:40,172][26022] Updated weights on worker 0-0, policy_version 826145 (0.00089) [2022-07-10 17:44:42,126][26022] Updated weights on worker 0-0, policy_version 826155 (0.00083) [2022-07-10 17:44:44,018][26022] Updated weights on worker 0-0, policy_version 826165 (0.00087) [2022-07-10 17:44:45,079][25689] Fps is (10 sec: 5464.8, 60 sec: 5480.9, 300 sec: 5529.7). Total num frames: 845998080. Throughput: 0: 5786.5. Samples: 846007114. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:44:45,079][25689] Avg episode reward: [(0, '-0.870')] [2022-07-10 17:44:45,758][26022] Updated weights on worker 0-0, policy_version 826175 (0.00087) [2022-07-10 17:44:47,577][26022] Updated weights on worker 0-0, policy_version 826185 (0.00088) [2022-07-10 17:44:49,243][26022] Updated weights on worker 0-0, policy_version 826195 (0.00094) [2022-07-10 17:44:50,082][25689] Fps is (10 sec: 5622.8, 60 sec: 5520.5, 300 sec: 5533.3). Total num frames: 846026752. Throughput: 0: 4983.4. Samples: 846023902. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:44:50,083][25689] Avg episode reward: [(0, '-0.181')] [2022-07-10 17:44:51,299][26022] Updated weights on worker 0-0, policy_version 826205 (0.00080) [2022-07-10 17:44:51,475][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:44:51,486][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000826207_846035968.pth [2022-07-10 17:44:51,487][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000824260_844042240.pth [2022-07-10 17:44:53,144][26022] Updated weights on worker 0-0, policy_version 826215 (0.00089) [2022-07-10 17:44:54,939][26022] Updated weights on worker 0-0, policy_version 826225 (0.00085) [2022-07-10 17:44:55,099][25689] Fps is (10 sec: 5722.6, 60 sec: 5538.7, 300 sec: 5535.2). Total num frames: 846055424. Throughput: 0: 5838.9. Samples: 846057662. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:44:55,100][25689] Avg episode reward: [(0, '-0.670')] [2022-07-10 17:44:56,899][26022] Updated weights on worker 0-0, policy_version 826235 (0.00083) [2022-07-10 17:44:58,459][26022] Updated weights on worker 0-0, policy_version 826245 (0.00089) [2022-07-10 17:45:00,196][25689] Fps is (10 sec: 5467.8, 60 sec: 5503.0, 300 sec: 5540.3). Total num frames: 846082048. Throughput: 0: 5808.6. Samples: 846090686. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:45:00,196][25689] Avg episode reward: [(0, '-0.459')] [2022-07-10 17:45:00,512][26022] Updated weights on worker 0-0, policy_version 826255 (0.00089) [2022-07-10 17:45:02,797][26022] Updated weights on worker 0-0, policy_version 826265 (0.00090) [2022-07-10 17:45:04,591][26022] Updated weights on worker 0-0, policy_version 826275 (0.00070) [2022-07-10 17:45:05,212][25689] Fps is (10 sec: 5366.7, 60 sec: 5554.3, 300 sec: 5537.4). Total num frames: 846109696. Throughput: 0: 4864.2. Samples: 846105046. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:45:05,212][25689] Avg episode reward: [(0, '-0.451')] [2022-07-10 17:45:06,656][26022] Updated weights on worker 0-0, policy_version 826285 (0.00078) [2022-07-10 17:45:08,271][26022] Updated weights on worker 0-0, policy_version 826295 (0.00087) [2022-07-10 17:45:10,237][25689] Fps is (10 sec: 5404.6, 60 sec: 5502.9, 300 sec: 5537.1). Total num frames: 846136320. Throughput: 0: 5666.2. Samples: 846138104. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:45:10,238][25689] Avg episode reward: [(0, '-1.573')] [2022-07-10 17:45:10,239][26022] Updated weights on worker 0-0, policy_version 826305 (0.00088) [2022-07-10 17:45:11,915][26022] Updated weights on worker 0-0, policy_version 826315 (0.00094) [2022-07-10 17:45:13,770][26022] Updated weights on worker 0-0, policy_version 826325 (0.00115) [2022-07-10 17:45:15,241][25689] Fps is (10 sec: 5513.7, 60 sec: 5554.2, 300 sec: 5532.3). Total num frames: 846164992. Throughput: 0: 5664.6. Samples: 846171756. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:45:15,241][25689] Avg episode reward: [(0, '-1.781')] [2022-07-10 17:45:15,564][26022] Updated weights on worker 0-0, policy_version 826335 (0.00088) [2022-07-10 17:45:17,512][26022] Updated weights on worker 0-0, policy_version 826345 (0.00086) [2022-07-10 17:45:19,303][26022] Updated weights on worker 0-0, policy_version 826355 (0.00084) [2022-07-10 17:45:20,327][25689] Fps is (10 sec: 5582.1, 60 sec: 5504.4, 300 sec: 5534.3). Total num frames: 846192640. Throughput: 0: 4855.1. Samples: 846188424. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:45:20,327][25689] Avg episode reward: [(0, '-2.617')] [2022-07-10 17:45:21,041][26022] Updated weights on worker 0-0, policy_version 826365 (0.00515) [2022-07-10 17:45:23,136][26022] Updated weights on worker 0-0, policy_version 826375 (0.00089) [2022-07-10 17:45:24,814][26022] Updated weights on worker 0-0, policy_version 826385 (0.00090) [2022-07-10 17:45:25,335][25689] Fps is (10 sec: 5376.4, 60 sec: 5505.0, 300 sec: 5527.6). Total num frames: 846219264. Throughput: 0: 5804.1. Samples: 846221844. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:45:25,336][25689] Avg episode reward: [(0, '-1.284')] [2022-07-10 17:45:26,824][26022] Updated weights on worker 0-0, policy_version 826395 (0.00092) [2022-07-10 17:45:28,591][26022] Updated weights on worker 0-0, policy_version 826405 (0.00087) [2022-07-10 17:45:30,358][25689] Fps is (10 sec: 5512.3, 60 sec: 5508.9, 300 sec: 5527.9). Total num frames: 846247936. Throughput: 0: 5808.7. Samples: 846254980. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:45:30,359][25689] Avg episode reward: [(0, '-1.471')] [2022-07-10 17:45:30,498][26022] Updated weights on worker 0-0, policy_version 826415 (0.00087) [2022-07-10 17:45:32,239][26022] Updated weights on worker 0-0, policy_version 826425 (0.00085) [2022-07-10 17:45:34,069][26022] Updated weights on worker 0-0, policy_version 826435 (0.00091) [2022-07-10 17:45:35,382][25689] Fps is (10 sec: 5707.6, 60 sec: 5524.9, 300 sec: 5531.9). Total num frames: 846276608. Throughput: 0: 4973.5. Samples: 846271930. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:45:35,383][25689] Avg episode reward: [(0, '-0.423')] [2022-07-10 17:45:35,912][26022] Updated weights on worker 0-0, policy_version 826445 (0.00086) [2022-07-10 17:45:37,707][26022] Updated weights on worker 0-0, policy_version 826455 (0.00360) [2022-07-10 17:45:39,579][26022] Updated weights on worker 0-0, policy_version 826465 (0.00089) [2022-07-10 17:45:40,479][25689] Fps is (10 sec: 5564.9, 60 sec: 5525.7, 300 sec: 5530.5). Total num frames: 846304256. Throughput: 0: 5806.8. Samples: 846305442. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:45:40,479][25689] Avg episode reward: [(0, '-0.738')] [2022-07-10 17:45:41,380][26022] Updated weights on worker 0-0, policy_version 826475 (0.00087) [2022-07-10 17:45:43,303][26022] Updated weights on worker 0-0, policy_version 826485 (0.00102) [2022-07-10 17:45:44,992][26022] Updated weights on worker 0-0, policy_version 826495 (0.00083) [2022-07-10 17:45:45,511][25689] Fps is (10 sec: 5560.5, 60 sec: 5540.9, 300 sec: 5526.7). Total num frames: 846332928. Throughput: 0: 5822.0. Samples: 846339306. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:45:45,511][25689] Avg episode reward: [(0, '-0.035')] [2022-07-10 17:45:46,795][26022] Updated weights on worker 0-0, policy_version 826505 (0.00086) [2022-07-10 17:45:48,648][26022] Updated weights on worker 0-0, policy_version 826515 (0.00089) [2022-07-10 17:45:50,514][25689] Fps is (10 sec: 5612.1, 60 sec: 5524.0, 300 sec: 5534.1). Total num frames: 846360576. Throughput: 0: 5020.1. Samples: 846356166. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:45:50,515][25689] Avg episode reward: [(0, '-0.290')] [2022-07-10 17:45:50,755][26022] Updated weights on worker 0-0, policy_version 826525 (0.00083) [2022-07-10 17:45:52,319][26022] Updated weights on worker 0-0, policy_version 826535 (0.00090) [2022-07-10 17:45:54,288][26022] Updated weights on worker 0-0, policy_version 826545 (0.00088) [2022-07-10 17:45:55,528][25689] Fps is (10 sec: 5724.3, 60 sec: 5541.2, 300 sec: 5542.5). Total num frames: 846390272. Throughput: 0: 5840.6. Samples: 846389596. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:45:55,529][25689] Avg episode reward: [(0, '-0.461')] [2022-07-10 17:45:56,065][26022] Updated weights on worker 0-0, policy_version 826555 (0.00082) [2022-07-10 17:45:58,015][26022] Updated weights on worker 0-0, policy_version 826565 (0.00356) [2022-07-10 17:45:59,669][26022] Updated weights on worker 0-0, policy_version 826575 (0.00092) [2022-07-10 17:46:00,641][25689] Fps is (10 sec: 5460.2, 60 sec: 5522.7, 300 sec: 5537.0). Total num frames: 846415872. Throughput: 0: 5823.7. Samples: 846422864. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:46:00,642][25689] Avg episode reward: [(0, '-0.169')] [2022-07-10 17:46:01,469][26022] Updated weights on worker 0-0, policy_version 826585 (0.00088) [2022-07-10 17:46:03,698][26022] Updated weights on worker 0-0, policy_version 826595 (0.00084) [2022-07-10 17:46:05,669][25689] Fps is (10 sec: 5149.8, 60 sec: 5504.7, 300 sec: 5529.7). Total num frames: 846442496. Throughput: 0: 4885.5. Samples: 846437790. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:46:05,670][25689] Avg episode reward: [(0, '0.207')] [2022-07-10 17:46:05,686][26022] Updated weights on worker 0-0, policy_version 826605 (0.00084) [2022-07-10 17:46:07,378][26022] Updated weights on worker 0-0, policy_version 826615 (0.00085) [2022-07-10 17:46:09,135][26022] Updated weights on worker 0-0, policy_version 826625 (0.00085) [2022-07-10 17:46:10,683][25689] Fps is (10 sec: 5506.8, 60 sec: 5539.7, 300 sec: 5536.8). Total num frames: 846471168. Throughput: 0: 5713.2. Samples: 846471394. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:46:10,683][25689] Avg episode reward: [(0, '-0.801')] [2022-07-10 17:46:11,064][26022] Updated weights on worker 0-0, policy_version 826635 (0.00082) [2022-07-10 17:46:12,764][26022] Updated weights on worker 0-0, policy_version 826645 (0.00093) [2022-07-10 17:46:14,705][26022] Updated weights on worker 0-0, policy_version 826655 (0.00426) [2022-07-10 17:46:15,723][25689] Fps is (10 sec: 5805.7, 60 sec: 5553.3, 300 sec: 5538.7). Total num frames: 846500864. Throughput: 0: 5717.4. Samples: 846505056. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:46:15,723][25689] Avg episode reward: [(0, '-0.341')] [2022-07-10 17:46:16,494][26022] Updated weights on worker 0-0, policy_version 826665 (0.00089) [2022-07-10 17:46:18,277][26022] Updated weights on worker 0-0, policy_version 826675 (0.00089) [2022-07-10 17:46:20,456][26022] Updated weights on worker 0-0, policy_version 826685 (0.00090) [2022-07-10 17:46:20,855][25689] Fps is (10 sec: 5637.6, 60 sec: 5549.0, 300 sec: 5537.4). Total num frames: 846528512. Throughput: 0: 4893.9. Samples: 846521786. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:46:20,855][25689] Avg episode reward: [(0, '-0.238')] [2022-07-10 17:46:21,985][26022] Updated weights on worker 0-0, policy_version 826695 (0.00085) [2022-07-10 17:46:23,836][26022] Updated weights on worker 0-0, policy_version 826705 (0.00087) [2022-07-10 17:46:25,643][26022] Updated weights on worker 0-0, policy_version 826715 (0.00093) [2022-07-10 17:46:25,913][25689] Fps is (10 sec: 5526.9, 60 sec: 5578.3, 300 sec: 5540.0). Total num frames: 846557184. Throughput: 0: 5808.4. Samples: 846555374. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:46:25,913][25689] Avg episode reward: [(0, '-1.748')] [2022-07-10 17:46:27,369][26022] Updated weights on worker 0-0, policy_version 826725 (0.00097) [2022-07-10 17:46:29,404][26022] Updated weights on worker 0-0, policy_version 826735 (0.00068) [2022-07-10 17:46:30,935][25689] Fps is (10 sec: 5586.9, 60 sec: 5561.4, 300 sec: 5536.6). Total num frames: 846584832. Throughput: 0: 5787.7. Samples: 846588610. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:46:30,936][25689] Avg episode reward: [(0, '-1.868')] [2022-07-10 17:46:31,329][26022] Updated weights on worker 0-0, policy_version 826745 (0.00090) [2022-07-10 17:46:33,099][26022] Updated weights on worker 0-0, policy_version 826755 (0.00089) [2022-07-10 17:46:35,132][26022] Updated weights on worker 0-0, policy_version 826765 (0.00088) [2022-07-10 17:46:36,005][25689] Fps is (10 sec: 5479.4, 60 sec: 5540.4, 300 sec: 5541.3). Total num frames: 846612480. Throughput: 0: 5780.5. Samples: 846622296. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:46:36,005][25689] Avg episode reward: [(0, '-2.299')] [2022-07-10 17:46:36,874][26022] Updated weights on worker 0-0, policy_version 826775 (0.00097) [2022-07-10 17:46:38,656][26022] Updated weights on worker 0-0, policy_version 826785 (0.00085) [2022-07-10 17:46:40,547][26022] Updated weights on worker 0-0, policy_version 826795 (0.00087) [2022-07-10 17:46:41,076][25689] Fps is (10 sec: 5553.6, 60 sec: 5559.6, 300 sec: 5540.9). Total num frames: 846641152. Throughput: 0: 5803.4. Samples: 846639142. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:46:41,077][25689] Avg episode reward: [(0, '-0.858')] [2022-07-10 17:46:42,045][26022] Updated weights on worker 0-0, policy_version 826805 (0.00090) [2022-07-10 17:46:44,114][26022] Updated weights on worker 0-0, policy_version 826815 (0.00080) [2022-07-10 17:46:45,796][26022] Updated weights on worker 0-0, policy_version 826825 (0.00090) [2022-07-10 17:46:46,175][25689] Fps is (10 sec: 5537.6, 60 sec: 5536.6, 300 sec: 5535.7). Total num frames: 846668800. Throughput: 0: 5801.5. Samples: 846672924. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:46:46,176][25689] Avg episode reward: [(0, '-2.143')] [2022-07-10 17:46:47,627][26022] Updated weights on worker 0-0, policy_version 826835 (0.00085) [2022-07-10 17:46:49,582][26022] Updated weights on worker 0-0, policy_version 826845 (0.00083) [2022-07-10 17:46:51,230][25689] Fps is (10 sec: 5647.4, 60 sec: 5565.6, 300 sec: 5538.2). Total num frames: 846698496. Throughput: 0: 5818.6. Samples: 846706700. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:46:51,232][25689] Avg episode reward: [(0, '-1.649')] [2022-07-10 17:46:51,277][26022] Updated weights on worker 0-0, policy_version 826855 (0.00085) [2022-07-10 17:46:51,488][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:46:51,498][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000826856_846700544.pth [2022-07-10 17:46:51,498][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000824909_844706816.pth [2022-07-10 17:46:53,247][26022] Updated weights on worker 0-0, policy_version 826865 (0.00088) [2022-07-10 17:46:55,095][26022] Updated weights on worker 0-0, policy_version 826875 (0.00094) [2022-07-10 17:46:56,237][25689] Fps is (10 sec: 5699.0, 60 sec: 5532.5, 300 sec: 5540.1). Total num frames: 846726144. Throughput: 0: 4997.4. Samples: 846723406. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:46:56,238][25689] Avg episode reward: [(0, '-1.148')] [2022-07-10 17:46:56,899][26022] Updated weights on worker 0-0, policy_version 826885 (0.00089) [2022-07-10 17:46:58,892][26022] Updated weights on worker 0-0, policy_version 826895 (0.00093) [2022-07-10 17:47:00,689][26022] Updated weights on worker 0-0, policy_version 826905 (0.00087) [2022-07-10 17:47:01,276][25689] Fps is (10 sec: 5504.6, 60 sec: 5573.1, 300 sec: 5543.0). Total num frames: 846753792. Throughput: 0: 5807.8. Samples: 846756456. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:47:01,276][25689] Avg episode reward: [(0, '-2.069')] [2022-07-10 17:47:02,812][26022] Updated weights on worker 0-0, policy_version 826915 (0.00087) [2022-07-10 17:47:04,679][26022] Updated weights on worker 0-0, policy_version 826925 (0.00080) [2022-07-10 17:47:06,291][25689] Fps is (10 sec: 5194.5, 60 sec: 5540.5, 300 sec: 5529.2). Total num frames: 846778368. Throughput: 0: 5711.8. Samples: 846787822. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:47:06,291][25689] Avg episode reward: [(0, '-1.843')] [2022-07-10 17:47:06,582][26022] Updated weights on worker 0-0, policy_version 826935 (0.00087) [2022-07-10 17:47:08,491][26022] Updated weights on worker 0-0, policy_version 826945 (0.00087) [2022-07-10 17:47:10,348][26022] Updated weights on worker 0-0, policy_version 826955 (0.00085) [2022-07-10 17:47:11,302][25689] Fps is (10 sec: 5310.8, 60 sec: 5540.7, 300 sec: 5532.7). Total num frames: 846807040. Throughput: 0: 4877.9. Samples: 846804608. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:47:11,303][25689] Avg episode reward: [(0, '-2.334')] [2022-07-10 17:47:12,254][26022] Updated weights on worker 0-0, policy_version 826965 (0.00097) [2022-07-10 17:47:13,968][26022] Updated weights on worker 0-0, policy_version 826975 (0.00091) [2022-07-10 17:47:15,821][26022] Updated weights on worker 0-0, policy_version 826985 (0.00087) [2022-07-10 17:47:16,314][25689] Fps is (10 sec: 5721.3, 60 sec: 5526.4, 300 sec: 5533.2). Total num frames: 846835712. Throughput: 0: 5713.3. Samples: 846838110. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2022-07-10 17:47:16,315][25689] Avg episode reward: [(0, '-1.214')] [2022-07-10 17:47:17,591][26022] Updated weights on worker 0-0, policy_version 826995 (0.00091) [2022-07-10 17:47:19,615][26022] Updated weights on worker 0-0, policy_version 827005 (0.00084) [2022-07-10 17:47:21,195][26022] Updated weights on worker 0-0, policy_version 827015 (0.00057) [2022-07-10 17:47:21,431][25689] Fps is (10 sec: 5560.4, 60 sec: 5527.7, 300 sec: 5531.4). Total num frames: 846863360. Throughput: 0: 5692.9. Samples: 846871196. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:47:21,431][25689] Avg episode reward: [(0, '-1.222')] [2022-07-10 17:47:23,074][26022] Updated weights on worker 0-0, policy_version 827025 (0.00084) [2022-07-10 17:47:25,043][26022] Updated weights on worker 0-0, policy_version 827035 (0.00106) [2022-07-10 17:47:26,526][25689] Fps is (10 sec: 5515.0, 60 sec: 5524.4, 300 sec: 5537.0). Total num frames: 846892032. Throughput: 0: 4942.3. Samples: 846887830. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:47:26,526][25689] Avg episode reward: [(0, '-0.575')] [2022-07-10 17:47:26,790][26022] Updated weights on worker 0-0, policy_version 827045 (0.00088) [2022-07-10 17:47:28,895][26022] Updated weights on worker 0-0, policy_version 827055 (0.00091) [2022-07-10 17:47:30,602][26022] Updated weights on worker 0-0, policy_version 827065 (0.00088) [2022-07-10 17:47:31,561][25689] Fps is (10 sec: 5458.5, 60 sec: 5506.3, 300 sec: 5526.3). Total num frames: 846918656. Throughput: 0: 5726.6. Samples: 846920622. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:47:31,561][25689] Avg episode reward: [(0, '0.354')] [2022-07-10 17:47:32,474][26022] Updated weights on worker 0-0, policy_version 827075 (0.00085) [2022-07-10 17:47:34,335][26022] Updated weights on worker 0-0, policy_version 827085 (0.00093) [2022-07-10 17:47:36,024][26022] Updated weights on worker 0-0, policy_version 827095 (0.00085) [2022-07-10 17:47:36,568][25689] Fps is (10 sec: 5608.3, 60 sec: 5545.8, 300 sec: 5537.3). Total num frames: 846948352. Throughput: 0: 5740.3. Samples: 846954376. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:47:36,569][25689] Avg episode reward: [(0, '-0.144')] [2022-07-10 17:47:38,041][26022] Updated weights on worker 0-0, policy_version 827105 (0.00083) [2022-07-10 17:47:39,888][26022] Updated weights on worker 0-0, policy_version 827115 (0.00081) [2022-07-10 17:47:41,531][26022] Updated weights on worker 0-0, policy_version 827125 (0.00084) [2022-07-10 17:47:41,681][25689] Fps is (10 sec: 5666.7, 60 sec: 5525.2, 300 sec: 5528.6). Total num frames: 846976000. Throughput: 0: 4933.1. Samples: 846971090. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:47:41,681][25689] Avg episode reward: [(0, '0.887')] [2022-07-10 17:47:43,584][26022] Updated weights on worker 0-0, policy_version 827135 (0.00089) [2022-07-10 17:47:45,191][26022] Updated weights on worker 0-0, policy_version 827145 (0.00084) [2022-07-10 17:47:46,739][25689] Fps is (10 sec: 5335.7, 60 sec: 5511.9, 300 sec: 5524.2). Total num frames: 847002624. Throughput: 0: 5783.2. Samples: 847004730. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:47:46,740][25689] Avg episode reward: [(0, '0.741')] [2022-07-10 17:47:47,276][26022] Updated weights on worker 0-0, policy_version 827155 (0.00088) [2022-07-10 17:47:48,858][26022] Updated weights on worker 0-0, policy_version 827165 (0.00091) [2022-07-10 17:47:50,816][26022] Updated weights on worker 0-0, policy_version 827175 (0.00619) [2022-07-10 17:47:51,751][25689] Fps is (10 sec: 5592.5, 60 sec: 5515.9, 300 sec: 5531.2). Total num frames: 847032320. Throughput: 0: 5820.5. Samples: 847038138. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:47:51,752][25689] Avg episode reward: [(0, '0.119')] [2022-07-10 17:47:52,676][26022] Updated weights on worker 0-0, policy_version 827185 (0.00092) [2022-07-10 17:47:54,458][26022] Updated weights on worker 0-0, policy_version 827195 (0.00086) [2022-07-10 17:47:56,315][26022] Updated weights on worker 0-0, policy_version 827205 (0.00088) [2022-07-10 17:47:56,794][25689] Fps is (10 sec: 5703.5, 60 sec: 5512.6, 300 sec: 5531.2). Total num frames: 847059968. Throughput: 0: 4961.8. Samples: 847054730. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:47:56,794][25689] Avg episode reward: [(0, '-0.734')] [2022-07-10 17:47:58,336][26022] Updated weights on worker 0-0, policy_version 827215 (0.00083) [2022-07-10 17:47:59,879][26022] Updated weights on worker 0-0, policy_version 827225 (0.00078) [2022-07-10 17:48:01,862][25689] Fps is (10 sec: 5367.8, 60 sec: 5493.0, 300 sec: 5533.9). Total num frames: 847086592. Throughput: 0: 5801.6. Samples: 847088174. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:48:01,864][25689] Avg episode reward: [(0, '-1.165')] [2022-07-10 17:48:02,262][26022] Updated weights on worker 0-0, policy_version 827235 (0.00095) [2022-07-10 17:48:04,045][26022] Updated weights on worker 0-0, policy_version 827245 (0.00095) [2022-07-10 17:48:05,847][26022] Updated weights on worker 0-0, policy_version 827255 (0.00090) [2022-07-10 17:48:06,873][25689] Fps is (10 sec: 5384.5, 60 sec: 5544.1, 300 sec: 5534.1). Total num frames: 847114240. Throughput: 0: 5710.0. Samples: 847119692. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:48:06,873][25689] Avg episode reward: [(0, '-0.707')] [2022-07-10 17:48:07,752][26022] Updated weights on worker 0-0, policy_version 827265 (0.00083) [2022-07-10 17:48:09,643][26022] Updated weights on worker 0-0, policy_version 827275 (0.00099) [2022-07-10 17:48:11,441][26022] Updated weights on worker 0-0, policy_version 827285 (0.00084) [2022-07-10 17:48:11,904][25689] Fps is (10 sec: 5608.6, 60 sec: 5542.3, 300 sec: 5533.6). Total num frames: 847142912. Throughput: 0: 4864.3. Samples: 847136166. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:48:11,904][25689] Avg episode reward: [(0, '-1.130')] [2022-07-10 17:48:13,336][26022] Updated weights on worker 0-0, policy_version 827295 (0.00085) [2022-07-10 17:48:15,127][26022] Updated weights on worker 0-0, policy_version 827305 (0.00079) [2022-07-10 17:48:16,922][25689] Fps is (10 sec: 5400.6, 60 sec: 5491.0, 300 sec: 5524.3). Total num frames: 847168512. Throughput: 0: 5698.1. Samples: 847169424. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:48:16,923][25689] Avg episode reward: [(0, '-1.378')] [2022-07-10 17:48:17,068][26022] Updated weights on worker 0-0, policy_version 827315 (0.00092) [2022-07-10 17:48:18,920][26022] Updated weights on worker 0-0, policy_version 827325 (0.00094) [2022-07-10 17:48:20,661][26022] Updated weights on worker 0-0, policy_version 827335 (0.00093) [2022-07-10 17:48:22,005][25689] Fps is (10 sec: 5372.6, 60 sec: 5511.0, 300 sec: 5526.6). Total num frames: 847197184. Throughput: 0: 5707.1. Samples: 847203134. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:48:22,006][25689] Avg episode reward: [(0, '-0.254')] [2022-07-10 17:48:22,535][26022] Updated weights on worker 0-0, policy_version 827345 (0.00088) [2022-07-10 17:48:24,289][26022] Updated weights on worker 0-0, policy_version 827355 (0.00085) [2022-07-10 17:48:26,092][26022] Updated weights on worker 0-0, policy_version 827365 (0.00096) [2022-07-10 17:48:27,024][25689] Fps is (10 sec: 5676.8, 60 sec: 5517.9, 300 sec: 5529.8). Total num frames: 847225856. Throughput: 0: 4980.0. Samples: 847220044. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:48:27,024][25689] Avg episode reward: [(0, '0.501')] [2022-07-10 17:48:27,995][26022] Updated weights on worker 0-0, policy_version 827375 (0.00090) [2022-07-10 17:48:29,775][26022] Updated weights on worker 0-0, policy_version 827385 (0.00090) [2022-07-10 17:48:31,676][26022] Updated weights on worker 0-0, policy_version 827395 (0.00083) [2022-07-10 17:48:32,044][25689] Fps is (10 sec: 5712.3, 60 sec: 5553.2, 300 sec: 5529.5). Total num frames: 847254528. Throughput: 0: 5806.6. Samples: 847253112. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:48:32,044][25689] Avg episode reward: [(0, '0.769')] [2022-07-10 17:48:33,603][26022] Updated weights on worker 0-0, policy_version 827405 (0.00080) [2022-07-10 17:48:35,208][26022] Updated weights on worker 0-0, policy_version 827415 (0.00084) [2022-07-10 17:48:37,075][25689] Fps is (10 sec: 5603.4, 60 sec: 5517.1, 300 sec: 5526.7). Total num frames: 847282176. Throughput: 0: 5837.5. Samples: 847287064. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:48:37,075][25689] Avg episode reward: [(0, '0.982')] [2022-07-10 17:48:37,156][26022] Updated weights on worker 0-0, policy_version 827425 (0.00085) [2022-07-10 17:48:38,981][26022] Updated weights on worker 0-0, policy_version 827435 (0.00086) [2022-07-10 17:48:40,802][26022] Updated weights on worker 0-0, policy_version 827445 (0.00054) [2022-07-10 17:48:42,119][25689] Fps is (10 sec: 5590.2, 60 sec: 5540.4, 300 sec: 5526.5). Total num frames: 847310848. Throughput: 0: 5005.3. Samples: 847303808. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:48:42,119][25689] Avg episode reward: [(0, '1.046')] [2022-07-10 17:48:42,679][26022] Updated weights on worker 0-0, policy_version 827455 (0.00095) [2022-07-10 17:48:44,334][26022] Updated weights on worker 0-0, policy_version 827465 (0.00096) [2022-07-10 17:48:46,416][26022] Updated weights on worker 0-0, policy_version 827475 (0.00089) [2022-07-10 17:48:47,181][25689] Fps is (10 sec: 5775.5, 60 sec: 5590.9, 300 sec: 5536.9). Total num frames: 847340544. Throughput: 0: 5833.0. Samples: 847337620. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:48:47,181][25689] Avg episode reward: [(0, '-0.742')] [2022-07-10 17:48:48,170][26022] Updated weights on worker 0-0, policy_version 827485 (0.00084) [2022-07-10 17:48:49,942][26022] Updated weights on worker 0-0, policy_version 827495 (0.00086) [2022-07-10 17:48:51,557][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:48:51,582][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000827503_847363072.pth [2022-07-10 17:48:51,583][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000825558_845371392.pth [2022-07-10 17:48:51,887][26022] Updated weights on worker 0-0, policy_version 827505 (0.00086) [2022-07-10 17:48:52,208][25689] Fps is (10 sec: 5480.5, 60 sec: 5521.7, 300 sec: 5530.1). Total num frames: 847366144. Throughput: 0: 5842.8. Samples: 847370928. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:48:52,209][25689] Avg episode reward: [(0, '-0.825')] [2022-07-10 17:48:53,618][26022] Updated weights on worker 0-0, policy_version 827515 (0.00090) [2022-07-10 17:48:55,464][26022] Updated weights on worker 0-0, policy_version 827525 (0.00085) [2022-07-10 17:48:57,229][25689] Fps is (10 sec: 5401.4, 60 sec: 5540.6, 300 sec: 5531.1). Total num frames: 847394816. Throughput: 0: 5825.9. Samples: 847404478. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:48:57,229][25689] Avg episode reward: [(0, '-2.087')] [2022-07-10 17:48:57,395][26022] Updated weights on worker 0-0, policy_version 827535 (0.00090) [2022-07-10 17:48:59,028][26022] Updated weights on worker 0-0, policy_version 827545 (0.00091) [2022-07-10 17:49:01,101][26022] Updated weights on worker 0-0, policy_version 827555 (0.00088) [2022-07-10 17:49:02,283][25689] Fps is (10 sec: 5387.2, 60 sec: 5525.0, 300 sec: 5534.0). Total num frames: 847420416. Throughput: 0: 5802.0. Samples: 847420800. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:49:02,283][25689] Avg episode reward: [(0, '-1.994')] [2022-07-10 17:49:03,118][26022] Updated weights on worker 0-0, policy_version 827565 (0.00086) [2022-07-10 17:49:05,091][26022] Updated weights on worker 0-0, policy_version 827575 (0.00094) [2022-07-10 17:49:06,931][26022] Updated weights on worker 0-0, policy_version 827585 (0.00085) [2022-07-10 17:49:07,287][25689] Fps is (10 sec: 5293.9, 60 sec: 5525.6, 300 sec: 5527.4). Total num frames: 847448064. Throughput: 0: 5713.6. Samples: 847452498. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:49:07,288][25689] Avg episode reward: [(0, '-2.116')] [2022-07-10 17:49:08,713][26022] Updated weights on worker 0-0, policy_version 827595 (0.00086) [2022-07-10 17:49:10,917][26022] Updated weights on worker 0-0, policy_version 827605 (0.00097) [2022-07-10 17:49:12,339][25689] Fps is (10 sec: 5600.5, 60 sec: 5523.7, 300 sec: 5536.9). Total num frames: 847476736. Throughput: 0: 5713.1. Samples: 847485936. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:49:12,341][25689] Avg episode reward: [(0, '-1.755')] [2022-07-10 17:49:12,346][26022] Updated weights on worker 0-0, policy_version 827615 (0.00081) [2022-07-10 17:49:14,222][26022] Updated weights on worker 0-0, policy_version 827625 (0.00090) [2022-07-10 17:49:16,197][26022] Updated weights on worker 0-0, policy_version 827635 (0.00089) [2022-07-10 17:49:17,343][25689] Fps is (10 sec: 5600.8, 60 sec: 5558.9, 300 sec: 5528.3). Total num frames: 847504384. Throughput: 0: 4885.3. Samples: 847502742. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:49:17,344][25689] Avg episode reward: [(0, '0.010')] [2022-07-10 17:49:17,879][26022] Updated weights on worker 0-0, policy_version 827645 (0.00089) [2022-07-10 17:49:19,776][26022] Updated weights on worker 0-0, policy_version 827655 (0.00094) [2022-07-10 17:49:21,616][26022] Updated weights on worker 0-0, policy_version 827665 (0.00091) [2022-07-10 17:49:22,464][25689] Fps is (10 sec: 5461.3, 60 sec: 5538.5, 300 sec: 5529.8). Total num frames: 847532032. Throughput: 0: 5719.8. Samples: 847536232. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:49:22,466][25689] Avg episode reward: [(0, '-0.696')] [2022-07-10 17:49:23,409][26022] Updated weights on worker 0-0, policy_version 827675 (0.00091) [2022-07-10 17:49:25,519][26022] Updated weights on worker 0-0, policy_version 827685 (0.00090) [2022-07-10 17:49:27,063][26022] Updated weights on worker 0-0, policy_version 827695 (0.00085) [2022-07-10 17:49:27,486][25689] Fps is (10 sec: 5653.4, 60 sec: 5555.1, 300 sec: 5534.0). Total num frames: 847561728. Throughput: 0: 5789.1. Samples: 847569432. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:49:27,488][25689] Avg episode reward: [(0, '0.264')] [2022-07-10 17:49:29,220][26022] Updated weights on worker 0-0, policy_version 827705 (0.00093) [2022-07-10 17:49:30,730][26022] Updated weights on worker 0-0, policy_version 827715 (0.00086) [2022-07-10 17:49:32,503][25689] Fps is (10 sec: 5508.1, 60 sec: 5504.6, 300 sec: 5527.1). Total num frames: 847587328. Throughput: 0: 4970.3. Samples: 847586158. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:49:32,505][25689] Avg episode reward: [(0, '0.149')] [2022-07-10 17:49:32,813][26022] Updated weights on worker 0-0, policy_version 827725 (0.00098) [2022-07-10 17:49:34,462][26022] Updated weights on worker 0-0, policy_version 827735 (0.00088) [2022-07-10 17:49:36,372][26022] Updated weights on worker 0-0, policy_version 827745 (0.00090) [2022-07-10 17:49:37,533][25689] Fps is (10 sec: 5503.5, 60 sec: 5538.5, 300 sec: 5535.4). Total num frames: 847617024. Throughput: 0: 5792.4. Samples: 847619694. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:49:37,534][25689] Avg episode reward: [(0, '-0.055')] [2022-07-10 17:49:38,114][26022] Updated weights on worker 0-0, policy_version 827755 (0.00090) [2022-07-10 17:49:40,166][26022] Updated weights on worker 0-0, policy_version 827765 (0.00092) [2022-07-10 17:49:41,679][26022] Updated weights on worker 0-0, policy_version 827775 (0.00086) [2022-07-10 17:49:42,663][25689] Fps is (10 sec: 5644.1, 60 sec: 5513.7, 300 sec: 5533.2). Total num frames: 847644672. Throughput: 0: 5792.5. Samples: 847653234. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:49:42,663][25689] Avg episode reward: [(0, '-1.174')] [2022-07-10 17:49:43,856][26022] Updated weights on worker 0-0, policy_version 827785 (0.00088) [2022-07-10 17:49:45,388][26022] Updated weights on worker 0-0, policy_version 827795 (0.00089) [2022-07-10 17:49:47,433][26022] Updated weights on worker 0-0, policy_version 827805 (0.00082) [2022-07-10 17:49:47,762][25689] Fps is (10 sec: 5606.0, 60 sec: 5510.3, 300 sec: 5534.9). Total num frames: 847674368. Throughput: 0: 4961.9. Samples: 847670042. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:49:47,763][25689] Avg episode reward: [(0, '-0.489')] [2022-07-10 17:49:49,261][26022] Updated weights on worker 0-0, policy_version 827815 (0.00085) [2022-07-10 17:49:50,923][26022] Updated weights on worker 0-0, policy_version 827825 (0.00096) [2022-07-10 17:49:52,732][26022] Updated weights on worker 0-0, policy_version 827835 (0.00083) [2022-07-10 17:49:52,776][25689] Fps is (10 sec: 5771.5, 60 sec: 5562.3, 300 sec: 5534.9). Total num frames: 847703040. Throughput: 0: 5803.9. Samples: 847703822. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:49:52,776][25689] Avg episode reward: [(0, '-0.541')] [2022-07-10 17:49:54,659][26022] Updated weights on worker 0-0, policy_version 827845 (0.00089) [2022-07-10 17:49:56,404][26022] Updated weights on worker 0-0, policy_version 827855 (0.00090) [2022-07-10 17:49:57,785][25689] Fps is (10 sec: 5414.7, 60 sec: 5512.6, 300 sec: 5533.1). Total num frames: 847728640. Throughput: 0: 5809.3. Samples: 847737344. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:49:57,786][25689] Avg episode reward: [(0, '-1.594')] [2022-07-10 17:49:58,323][26022] Updated weights on worker 0-0, policy_version 827865 (0.00092) [2022-07-10 17:49:59,993][26022] Updated weights on worker 0-0, policy_version 827875 (0.00092) [2022-07-10 17:50:02,521][26022] Updated weights on worker 0-0, policy_version 827885 (0.00079) [2022-07-10 17:50:02,839][25689] Fps is (10 sec: 5291.4, 60 sec: 5546.4, 300 sec: 5532.4). Total num frames: 847756288. Throughput: 0: 4997.0. Samples: 847754054. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:50:02,839][25689] Avg episode reward: [(0, '-1.519')] [2022-07-10 17:50:04,491][26022] Updated weights on worker 0-0, policy_version 827895 (0.00087) [2022-07-10 17:50:06,118][26022] Updated weights on worker 0-0, policy_version 827905 (0.00090) [2022-07-10 17:50:07,846][25689] Fps is (10 sec: 5394.3, 60 sec: 5529.3, 300 sec: 5532.7). Total num frames: 847782912. Throughput: 0: 5716.3. Samples: 847784848. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:50:07,847][25689] Avg episode reward: [(0, '-0.758')] [2022-07-10 17:50:08,223][26022] Updated weights on worker 0-0, policy_version 827915 (0.00085) [2022-07-10 17:50:09,685][26022] Updated weights on worker 0-0, policy_version 827925 (0.00084) [2022-07-10 17:50:11,715][26022] Updated weights on worker 0-0, policy_version 827935 (0.00089) [2022-07-10 17:50:12,878][25689] Fps is (10 sec: 5406.2, 60 sec: 5514.2, 300 sec: 5528.8). Total num frames: 847810560. Throughput: 0: 5693.6. Samples: 847818274. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:50:12,878][25689] Avg episode reward: [(0, '0.067')] [2022-07-10 17:50:13,649][26022] Updated weights on worker 0-0, policy_version 827945 (0.00085) [2022-07-10 17:50:15,280][26022] Updated weights on worker 0-0, policy_version 827955 (0.00096) [2022-07-10 17:50:17,376][26022] Updated weights on worker 0-0, policy_version 827965 (0.00088) [2022-07-10 17:50:17,902][25689] Fps is (10 sec: 5600.5, 60 sec: 5529.2, 300 sec: 5533.4). Total num frames: 847839232. Throughput: 0: 4847.3. Samples: 847834858. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:50:17,904][25689] Avg episode reward: [(0, '0.002')] [2022-07-10 17:50:19,081][26022] Updated weights on worker 0-0, policy_version 827975 (0.00094) [2022-07-10 17:50:20,995][26022] Updated weights on worker 0-0, policy_version 827985 (0.00086) [2022-07-10 17:50:22,833][26022] Updated weights on worker 0-0, policy_version 827995 (0.00105) [2022-07-10 17:50:23,020][25689] Fps is (10 sec: 5553.0, 60 sec: 5529.5, 300 sec: 5534.8). Total num frames: 847866880. Throughput: 0: 5643.0. Samples: 847867936. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:50:23,022][25689] Avg episode reward: [(0, '0.015')] [2022-07-10 17:50:24,503][26022] Updated weights on worker 0-0, policy_version 828005 (0.00091) [2022-07-10 17:50:26,644][26022] Updated weights on worker 0-0, policy_version 828015 (0.00088) [2022-07-10 17:50:28,042][25689] Fps is (10 sec: 5655.3, 60 sec: 5529.5, 300 sec: 5538.2). Total num frames: 847896576. Throughput: 0: 5760.3. Samples: 847901184. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:50:28,044][25689] Avg episode reward: [(0, '-1.289')] [2022-07-10 17:50:28,214][26022] Updated weights on worker 0-0, policy_version 828025 (0.00098) [2022-07-10 17:50:30,285][26022] Updated weights on worker 0-0, policy_version 828035 (0.00086) [2022-07-10 17:50:32,173][26022] Updated weights on worker 0-0, policy_version 828045 (0.00086) [2022-07-10 17:50:33,077][25689] Fps is (10 sec: 5599.9, 60 sec: 5544.8, 300 sec: 5531.1). Total num frames: 847923200. Throughput: 0: 4933.9. Samples: 847917936. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:50:33,078][25689] Avg episode reward: [(0, '-2.019')] [2022-07-10 17:50:33,866][26022] Updated weights on worker 0-0, policy_version 828055 (0.00098) [2022-07-10 17:50:35,878][26022] Updated weights on worker 0-0, policy_version 828065 (0.00087) [2022-07-10 17:50:37,577][26022] Updated weights on worker 0-0, policy_version 828075 (0.00080) [2022-07-10 17:50:38,101][25689] Fps is (10 sec: 5293.5, 60 sec: 5494.6, 300 sec: 5529.0). Total num frames: 847949824. Throughput: 0: 5757.2. Samples: 847951148. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:50:38,103][25689] Avg episode reward: [(0, '-1.661')] [2022-07-10 17:50:39,473][26022] Updated weights on worker 0-0, policy_version 828085 (0.00087) [2022-07-10 17:50:41,387][26022] Updated weights on worker 0-0, policy_version 828095 (0.00090) [2022-07-10 17:50:43,067][26022] Updated weights on worker 0-0, policy_version 828105 (0.00093) [2022-07-10 17:50:43,142][25689] Fps is (10 sec: 5595.6, 60 sec: 5536.5, 300 sec: 5532.3). Total num frames: 847979520. Throughput: 0: 5805.1. Samples: 847984748. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:50:43,144][25689] Avg episode reward: [(0, '-2.121')] [2022-07-10 17:50:45,122][26022] Updated weights on worker 0-0, policy_version 828115 (0.00083) [2022-07-10 17:50:46,802][26022] Updated weights on worker 0-0, policy_version 828125 (0.00089) [2022-07-10 17:50:48,161][25689] Fps is (10 sec: 5598.4, 60 sec: 5493.1, 300 sec: 5528.6). Total num frames: 848006144. Throughput: 0: 4990.5. Samples: 848001588. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:50:48,162][25689] Avg episode reward: [(0, '-2.618')] [2022-07-10 17:50:48,550][26022] Updated weights on worker 0-0, policy_version 828135 (0.00098) [2022-07-10 17:50:50,558][26022] Updated weights on worker 0-0, policy_version 828145 (0.00087) [2022-07-10 17:50:51,635][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:50:51,646][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000828152_848027648.pth [2022-07-10 17:50:51,651][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000826207_846035968.pth [2022-07-10 17:50:52,252][26022] Updated weights on worker 0-0, policy_version 828155 (0.00095) [2022-07-10 17:50:53,173][25689] Fps is (10 sec: 5512.8, 60 sec: 5493.3, 300 sec: 5525.2). Total num frames: 848034816. Throughput: 0: 5837.0. Samples: 848035234. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2022-07-10 17:50:53,173][25689] Avg episode reward: [(0, '-1.877')] [2022-07-10 17:50:54,169][26022] Updated weights on worker 0-0, policy_version 828165 (0.00087) [2022-07-10 17:50:55,884][26022] Updated weights on worker 0-0, policy_version 828175 (0.00083) [2022-07-10 17:50:57,659][26022] Updated weights on worker 0-0, policy_version 828185 (0.00085) [2022-07-10 17:50:58,204][25689] Fps is (10 sec: 5812.1, 60 sec: 5559.1, 300 sec: 5540.5). Total num frames: 848064512. Throughput: 0: 5862.0. Samples: 848068990. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:50:58,204][25689] Avg episode reward: [(0, '-2.306')] [2022-07-10 17:50:59,714][26022] Updated weights on worker 0-0, policy_version 828195 (0.00091) [2022-07-10 17:51:01,523][26022] Updated weights on worker 0-0, policy_version 828205 (0.00088) [2022-07-10 17:51:03,281][25689] Fps is (10 sec: 5368.9, 60 sec: 5506.1, 300 sec: 5532.7). Total num frames: 848089088. Throughput: 0: 5015.2. Samples: 848085750. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:51:03,282][25689] Avg episode reward: [(0, '-1.114')] [2022-07-10 17:51:03,557][26022] Updated weights on worker 0-0, policy_version 828215 (0.00089) [2022-07-10 17:51:05,516][26022] Updated weights on worker 0-0, policy_version 828225 (0.00449) [2022-07-10 17:51:07,422][26022] Updated weights on worker 0-0, policy_version 828235 (0.00084) [2022-07-10 17:51:08,396][25689] Fps is (10 sec: 5224.4, 60 sec: 5530.1, 300 sec: 5530.8). Total num frames: 848117760. Throughput: 0: 5700.9. Samples: 848116944. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:51:08,397][25689] Avg episode reward: [(0, '-1.705')] [2022-07-10 17:51:09,368][26022] Updated weights on worker 0-0, policy_version 828245 (0.00085) [2022-07-10 17:51:11,226][26022] Updated weights on worker 0-0, policy_version 828255 (0.00092) [2022-07-10 17:51:12,939][26022] Updated weights on worker 0-0, policy_version 828265 (0.00093) [2022-07-10 17:51:13,435][25689] Fps is (10 sec: 5546.9, 60 sec: 5529.5, 300 sec: 5523.9). Total num frames: 848145408. Throughput: 0: 5668.9. Samples: 848150098. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:51:13,435][25689] Avg episode reward: [(0, '-1.435')] [2022-07-10 17:51:14,797][26022] Updated weights on worker 0-0, policy_version 828275 (0.00082) [2022-07-10 17:51:16,725][26022] Updated weights on worker 0-0, policy_version 828285 (0.00092) [2022-07-10 17:51:18,370][26022] Updated weights on worker 0-0, policy_version 828295 (0.00088) [2022-07-10 17:51:18,453][25689] Fps is (10 sec: 5599.9, 60 sec: 5530.0, 300 sec: 5529.5). Total num frames: 848174080. Throughput: 0: 4842.2. Samples: 848167042. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:51:18,454][25689] Avg episode reward: [(0, '-1.539')] [2022-07-10 17:51:20,549][26022] Updated weights on worker 0-0, policy_version 828305 (0.00096) [2022-07-10 17:51:22,055][26022] Updated weights on worker 0-0, policy_version 828315 (0.00094) [2022-07-10 17:51:23,519][25689] Fps is (10 sec: 5483.3, 60 sec: 5517.8, 300 sec: 5522.5). Total num frames: 848200704. Throughput: 0: 5667.3. Samples: 848200444. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:51:23,520][25689] Avg episode reward: [(0, '-0.963')] [2022-07-10 17:51:24,119][26022] Updated weights on worker 0-0, policy_version 828325 (0.00091) [2022-07-10 17:51:25,695][26022] Updated weights on worker 0-0, policy_version 828335 (0.00090) [2022-07-10 17:51:27,679][26022] Updated weights on worker 0-0, policy_version 828345 (0.00627) [2022-07-10 17:51:28,549][25689] Fps is (10 sec: 5578.7, 60 sec: 5517.2, 300 sec: 5529.2). Total num frames: 848230400. Throughput: 0: 5806.3. Samples: 848233958. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:51:28,549][25689] Avg episode reward: [(0, '-1.748')] [2022-07-10 17:51:29,634][26022] Updated weights on worker 0-0, policy_version 828355 (0.00090) [2022-07-10 17:51:31,411][26022] Updated weights on worker 0-0, policy_version 828365 (0.01308) [2022-07-10 17:51:33,315][26022] Updated weights on worker 0-0, policy_version 828375 (0.00085) [2022-07-10 17:51:33,561][25689] Fps is (10 sec: 5710.8, 60 sec: 5536.2, 300 sec: 5530.3). Total num frames: 848258048. Throughput: 0: 4995.7. Samples: 848250640. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:51:33,561][25689] Avg episode reward: [(0, '-1.667')] [2022-07-10 17:51:35,214][26022] Updated weights on worker 0-0, policy_version 828385 (0.00099) [2022-07-10 17:51:37,011][26022] Updated weights on worker 0-0, policy_version 828395 (0.00086) [2022-07-10 17:51:38,578][25689] Fps is (10 sec: 5411.3, 60 sec: 5536.8, 300 sec: 5524.4). Total num frames: 848284672. Throughput: 0: 5806.5. Samples: 848283896. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:51:38,579][25689] Avg episode reward: [(0, '-2.393')] [2022-07-10 17:51:38,824][26022] Updated weights on worker 0-0, policy_version 828405 (0.00102) [2022-07-10 17:51:40,603][26022] Updated weights on worker 0-0, policy_version 828415 (0.00086) [2022-07-10 17:51:42,579][26022] Updated weights on worker 0-0, policy_version 828425 (0.00087) [2022-07-10 17:51:43,636][25689] Fps is (10 sec: 5488.1, 60 sec: 5518.3, 300 sec: 5528.6). Total num frames: 848313344. Throughput: 0: 5784.7. Samples: 848316814. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:51:43,638][25689] Avg episode reward: [(0, '-2.065')] [2022-07-10 17:51:44,404][26022] Updated weights on worker 0-0, policy_version 828435 (0.00091) [2022-07-10 17:51:46,284][26022] Updated weights on worker 0-0, policy_version 828445 (0.00087) [2022-07-10 17:51:48,012][26022] Updated weights on worker 0-0, policy_version 828455 (0.00087) [2022-07-10 17:51:48,685][25689] Fps is (10 sec: 5471.0, 60 sec: 5515.6, 300 sec: 5518.4). Total num frames: 848339968. Throughput: 0: 5770.5. Samples: 848350154. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:51:48,695][25689] Avg episode reward: [(0, '-1.952')] [2022-07-10 17:51:49,901][26022] Updated weights on worker 0-0, policy_version 828465 (0.00088) [2022-07-10 17:51:51,973][26022] Updated weights on worker 0-0, policy_version 828475 (0.00090) [2022-07-10 17:51:53,459][26022] Updated weights on worker 0-0, policy_version 828485 (0.00091) [2022-07-10 17:51:53,706][25689] Fps is (10 sec: 5593.2, 60 sec: 5531.7, 300 sec: 5525.0). Total num frames: 848369664. Throughput: 0: 5775.5. Samples: 848366988. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:51:53,707][25689] Avg episode reward: [(0, '-0.830')] [2022-07-10 17:51:55,522][26022] Updated weights on worker 0-0, policy_version 828495 (0.00085) [2022-07-10 17:51:57,250][26022] Updated weights on worker 0-0, policy_version 828505 (0.00094) [2022-07-10 17:51:58,747][25689] Fps is (10 sec: 5699.4, 60 sec: 5496.9, 300 sec: 5525.0). Total num frames: 848397312. Throughput: 0: 5790.6. Samples: 848400684. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:51:58,748][25689] Avg episode reward: [(0, '-0.274')] [2022-07-10 17:51:59,004][26022] Updated weights on worker 0-0, policy_version 828515 (0.00092) [2022-07-10 17:52:01,012][26022] Updated weights on worker 0-0, policy_version 828525 (0.00083) [2022-07-10 17:52:03,001][26022] Updated weights on worker 0-0, policy_version 828535 (0.01097) [2022-07-10 17:52:03,847][25689] Fps is (10 sec: 5351.8, 60 sec: 5528.7, 300 sec: 5530.3). Total num frames: 848423936. Throughput: 0: 5704.2. Samples: 848432098. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:52:03,847][25689] Avg episode reward: [(0, '-0.381')] [2022-07-10 17:52:05,062][26022] Updated weights on worker 0-0, policy_version 828545 (0.00087) [2022-07-10 17:52:06,619][26022] Updated weights on worker 0-0, policy_version 828555 (0.00090) [2022-07-10 17:52:08,577][26022] Updated weights on worker 0-0, policy_version 828565 (0.00089) [2022-07-10 17:52:08,866][25689] Fps is (10 sec: 5363.4, 60 sec: 5520.5, 300 sec: 5526.7). Total num frames: 848451584. Throughput: 0: 4881.2. Samples: 848448658. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:52:08,867][25689] Avg episode reward: [(0, '-1.706')] [2022-07-10 17:52:10,488][26022] Updated weights on worker 0-0, policy_version 828575 (0.00092) [2022-07-10 17:52:12,353][26022] Updated weights on worker 0-0, policy_version 828585 (0.00082) [2022-07-10 17:52:13,910][25689] Fps is (10 sec: 5494.6, 60 sec: 5520.0, 300 sec: 5522.6). Total num frames: 848479232. Throughput: 0: 5692.0. Samples: 848481992. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:52:13,911][25689] Avg episode reward: [(0, '-1.643')] [2022-07-10 17:52:14,112][26022] Updated weights on worker 0-0, policy_version 828595 (0.00090) [2022-07-10 17:52:16,107][26022] Updated weights on worker 0-0, policy_version 828605 (0.00086) [2022-07-10 17:52:17,701][26022] Updated weights on worker 0-0, policy_version 828615 (0.00054) [2022-07-10 17:52:18,945][25689] Fps is (10 sec: 5485.8, 60 sec: 5501.5, 300 sec: 5524.2). Total num frames: 848506880. Throughput: 0: 5703.2. Samples: 848515882. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:52:18,946][25689] Avg episode reward: [(0, '-1.355')] [2022-07-10 17:52:19,686][26022] Updated weights on worker 0-0, policy_version 828625 (0.00091) [2022-07-10 17:52:21,359][26022] Updated weights on worker 0-0, policy_version 828635 (0.00092) [2022-07-10 17:52:23,303][26022] Updated weights on worker 0-0, policy_version 828645 (0.00087) [2022-07-10 17:52:24,050][25689] Fps is (10 sec: 5655.5, 60 sec: 5548.8, 300 sec: 5527.4). Total num frames: 848536576. Throughput: 0: 4973.5. Samples: 848532578. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:52:24,050][25689] Avg episode reward: [(0, '-1.548')] [2022-07-10 17:52:25,232][26022] Updated weights on worker 0-0, policy_version 828655 (0.00091) [2022-07-10 17:52:27,092][26022] Updated weights on worker 0-0, policy_version 828665 (0.00091) [2022-07-10 17:52:29,021][26022] Updated weights on worker 0-0, policy_version 828675 (0.00090) [2022-07-10 17:52:29,120][25689] Fps is (10 sec: 5535.3, 60 sec: 5494.4, 300 sec: 5526.8). Total num frames: 848563200. Throughput: 0: 5787.7. Samples: 848565884. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:52:29,121][25689] Avg episode reward: [(0, '-1.395')] [2022-07-10 17:52:30,598][26022] Updated weights on worker 0-0, policy_version 828685 (0.00087) [2022-07-10 17:52:32,527][26022] Updated weights on worker 0-0, policy_version 828695 (0.00086) [2022-07-10 17:52:34,158][25689] Fps is (10 sec: 5571.7, 60 sec: 5525.8, 300 sec: 5526.2). Total num frames: 848592896. Throughput: 0: 5794.9. Samples: 848599326. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:52:34,159][25689] Avg episode reward: [(0, '0.043')] [2022-07-10 17:52:34,275][26022] Updated weights on worker 0-0, policy_version 828705 (0.00092) [2022-07-10 17:52:36,415][26022] Updated weights on worker 0-0, policy_version 828715 (0.00096) [2022-07-10 17:52:37,993][26022] Updated weights on worker 0-0, policy_version 828725 (0.00091) [2022-07-10 17:52:39,177][25689] Fps is (10 sec: 5600.4, 60 sec: 5525.7, 300 sec: 5524.5). Total num frames: 848619520. Throughput: 0: 4946.9. Samples: 848615964. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:52:39,177][25689] Avg episode reward: [(0, '0.789')] [2022-07-10 17:52:40,050][26022] Updated weights on worker 0-0, policy_version 828735 (0.00094) [2022-07-10 17:52:41,695][26022] Updated weights on worker 0-0, policy_version 828745 (0.00086) [2022-07-10 17:52:43,859][26022] Updated weights on worker 0-0, policy_version 828755 (0.00094) [2022-07-10 17:52:44,262][25689] Fps is (10 sec: 5472.8, 60 sec: 5523.2, 300 sec: 5530.9). Total num frames: 848648192. Throughput: 0: 5762.5. Samples: 848649050. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:52:44,262][25689] Avg episode reward: [(0, '-0.653')] [2022-07-10 17:52:45,380][26022] Updated weights on worker 0-0, policy_version 828765 (0.00091) [2022-07-10 17:52:47,584][26022] Updated weights on worker 0-0, policy_version 828775 (0.00090) [2022-07-10 17:52:48,954][26022] Updated weights on worker 0-0, policy_version 828785 (0.00091) [2022-07-10 17:52:49,275][25689] Fps is (10 sec: 5678.7, 60 sec: 5560.3, 300 sec: 5527.4). Total num frames: 848676864. Throughput: 0: 5794.0. Samples: 848682660. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:52:49,275][25689] Avg episode reward: [(0, '-1.425')] [2022-07-10 17:52:51,067][26022] Updated weights on worker 0-0, policy_version 828795 (0.00086) [2022-07-10 17:52:51,770][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:52:51,790][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000828801_848692224.pth [2022-07-10 17:52:51,791][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000826856_846700544.pth [2022-07-10 17:52:52,768][26022] Updated weights on worker 0-0, policy_version 828805 (0.00085) [2022-07-10 17:52:54,287][25689] Fps is (10 sec: 5515.6, 60 sec: 5510.3, 300 sec: 5524.6). Total num frames: 848703488. Throughput: 0: 4980.0. Samples: 848699570. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:52:54,288][25689] Avg episode reward: [(0, '-1.851')] [2022-07-10 17:52:54,528][26022] Updated weights on worker 0-0, policy_version 828815 (0.00087) [2022-07-10 17:52:56,456][26022] Updated weights on worker 0-0, policy_version 828825 (0.00087) [2022-07-10 17:52:58,312][26022] Updated weights on worker 0-0, policy_version 828835 (0.00089) [2022-07-10 17:52:59,304][25689] Fps is (10 sec: 5513.3, 60 sec: 5529.4, 300 sec: 5532.4). Total num frames: 848732160. Throughput: 0: 5833.8. Samples: 848733384. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:52:59,305][25689] Avg episode reward: [(0, '-2.380')] [2022-07-10 17:53:00,189][26022] Updated weights on worker 0-0, policy_version 828845 (0.00091) [2022-07-10 17:53:02,420][26022] Updated weights on worker 0-0, policy_version 828855 (0.00086) [2022-07-10 17:53:04,157][26022] Updated weights on worker 0-0, policy_version 828865 (0.00094) [2022-07-10 17:53:04,395][25689] Fps is (10 sec: 5470.9, 60 sec: 5530.3, 300 sec: 5527.5). Total num frames: 848758784. Throughput: 0: 5739.3. Samples: 848764598. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:53:04,395][25689] Avg episode reward: [(0, '-2.543')] [2022-07-10 17:53:06,016][26022] Updated weights on worker 0-0, policy_version 828875 (0.00084) [2022-07-10 17:53:08,071][26022] Updated weights on worker 0-0, policy_version 828885 (0.00085) [2022-07-10 17:53:09,399][25689] Fps is (10 sec: 5376.4, 60 sec: 5531.7, 300 sec: 5524.5). Total num frames: 848786432. Throughput: 0: 4895.2. Samples: 848781172. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:53:09,399][25689] Avg episode reward: [(0, '-2.035')] [2022-07-10 17:53:09,607][26022] Updated weights on worker 0-0, policy_version 828895 (0.00088) [2022-07-10 17:53:11,680][26022] Updated weights on worker 0-0, policy_version 828905 (0.00091) [2022-07-10 17:53:13,476][26022] Updated weights on worker 0-0, policy_version 828915 (0.00087) [2022-07-10 17:53:14,401][25689] Fps is (10 sec: 5423.8, 60 sec: 5518.6, 300 sec: 5528.3). Total num frames: 848813056. Throughput: 0: 5699.0. Samples: 848814196. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:53:14,401][25689] Avg episode reward: [(0, '-1.099')] [2022-07-10 17:53:15,340][26022] Updated weights on worker 0-0, policy_version 828925 (0.00084) [2022-07-10 17:53:17,087][26022] Updated weights on worker 0-0, policy_version 828935 (0.00094) [2022-07-10 17:53:18,967][26022] Updated weights on worker 0-0, policy_version 828945 (0.00084) [2022-07-10 17:53:19,424][25689] Fps is (10 sec: 5515.5, 60 sec: 5536.6, 300 sec: 5529.4). Total num frames: 848841728. Throughput: 0: 5683.1. Samples: 848847728. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:53:19,426][25689] Avg episode reward: [(0, '-1.253')] [2022-07-10 17:53:20,744][26022] Updated weights on worker 0-0, policy_version 828955 (0.00085) [2022-07-10 17:53:22,680][26022] Updated weights on worker 0-0, policy_version 828965 (0.00084) [2022-07-10 17:53:24,471][25689] Fps is (10 sec: 5491.3, 60 sec: 5491.1, 300 sec: 5522.0). Total num frames: 848868352. Throughput: 0: 4966.8. Samples: 848864310. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:53:24,471][25689] Avg episode reward: [(0, '-1.185')] [2022-07-10 17:53:24,660][26022] Updated weights on worker 0-0, policy_version 828975 (0.00088) [2022-07-10 17:53:26,334][26022] Updated weights on worker 0-0, policy_version 828985 (0.00091) [2022-07-10 17:53:28,195][26022] Updated weights on worker 0-0, policy_version 828995 (0.00088) [2022-07-10 17:53:29,474][25689] Fps is (10 sec: 5604.2, 60 sec: 5548.1, 300 sec: 5525.7). Total num frames: 848898048. Throughput: 0: 5800.4. Samples: 848897616. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:53:29,474][25689] Avg episode reward: [(0, '-0.998')] [2022-07-10 17:53:30,063][26022] Updated weights on worker 0-0, policy_version 829005 (0.00084) [2022-07-10 17:53:31,868][26022] Updated weights on worker 0-0, policy_version 829015 (0.00092) [2022-07-10 17:53:33,751][26022] Updated weights on worker 0-0, policy_version 829025 (0.00088) [2022-07-10 17:53:34,512][25689] Fps is (10 sec: 5710.7, 60 sec: 5514.1, 300 sec: 5525.6). Total num frames: 848925696. Throughput: 0: 5815.5. Samples: 848931152. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:53:34,513][25689] Avg episode reward: [(0, '-0.909')] [2022-07-10 17:53:35,548][26022] Updated weights on worker 0-0, policy_version 829035 (0.00090) [2022-07-10 17:53:37,370][26022] Updated weights on worker 0-0, policy_version 829045 (0.00094) [2022-07-10 17:53:39,414][26022] Updated weights on worker 0-0, policy_version 829055 (0.00090) [2022-07-10 17:53:39,532][25689] Fps is (10 sec: 5498.0, 60 sec: 5531.0, 300 sec: 5522.6). Total num frames: 848953344. Throughput: 0: 4993.0. Samples: 848948124. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:53:39,532][25689] Avg episode reward: [(0, '-1.176')] [2022-07-10 17:53:41,206][26022] Updated weights on worker 0-0, policy_version 829065 (0.00093) [2022-07-10 17:53:43,177][26022] Updated weights on worker 0-0, policy_version 829075 (0.00088) [2022-07-10 17:53:44,664][25689] Fps is (10 sec: 5547.6, 60 sec: 5526.7, 300 sec: 5517.9). Total num frames: 848982016. Throughput: 0: 5782.9. Samples: 848981088. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:53:44,665][25689] Avg episode reward: [(0, '-0.970')] [2022-07-10 17:53:44,810][26022] Updated weights on worker 0-0, policy_version 829085 (0.00086) [2022-07-10 17:53:46,747][26022] Updated weights on worker 0-0, policy_version 829095 (0.00094) [2022-07-10 17:53:48,456][26022] Updated weights on worker 0-0, policy_version 829105 (0.00088) [2022-07-10 17:53:49,701][25689] Fps is (10 sec: 5538.4, 60 sec: 5507.6, 300 sec: 5524.6). Total num frames: 849009664. Throughput: 0: 5791.6. Samples: 849014760. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:53:49,701][25689] Avg episode reward: [(0, '-1.939')] [2022-07-10 17:53:50,436][26022] Updated weights on worker 0-0, policy_version 829115 (0.00089) [2022-07-10 17:53:52,102][26022] Updated weights on worker 0-0, policy_version 829125 (0.00086) [2022-07-10 17:53:54,080][26022] Updated weights on worker 0-0, policy_version 829135 (0.00090) [2022-07-10 17:53:54,764][25689] Fps is (10 sec: 5576.4, 60 sec: 5536.8, 300 sec: 5523.8). Total num frames: 849038336. Throughput: 0: 4940.4. Samples: 849031204. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:53:54,765][25689] Avg episode reward: [(0, '-1.915')] [2022-07-10 17:53:55,983][26022] Updated weights on worker 0-0, policy_version 829145 (0.00091) [2022-07-10 17:53:57,528][26022] Updated weights on worker 0-0, policy_version 829155 (0.00057) [2022-07-10 17:53:59,563][26022] Updated weights on worker 0-0, policy_version 829165 (0.00101) [2022-07-10 17:53:59,783][25689] Fps is (10 sec: 5586.1, 60 sec: 5519.7, 300 sec: 5531.3). Total num frames: 849065984. Throughput: 0: 5762.5. Samples: 849064822. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:53:59,783][25689] Avg episode reward: [(0, '-2.444')] [2022-07-10 17:54:01,459][26022] Updated weights on worker 0-0, policy_version 829175 (0.00078) [2022-07-10 17:54:03,400][26022] Updated weights on worker 0-0, policy_version 829185 (0.00084) [2022-07-10 17:54:04,895][25689] Fps is (10 sec: 5457.9, 60 sec: 5534.6, 300 sec: 5529.3). Total num frames: 849093632. Throughput: 0: 5694.3. Samples: 849096290. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:54:04,897][25689] Avg episode reward: [(0, '-2.436')] [2022-07-10 17:54:05,480][26022] Updated weights on worker 0-0, policy_version 829195 (0.00093) [2022-07-10 17:54:07,192][26022] Updated weights on worker 0-0, policy_version 829205 (0.00090) [2022-07-10 17:54:09,195][26022] Updated weights on worker 0-0, policy_version 829215 (0.00084) [2022-07-10 17:54:09,955][25689] Fps is (10 sec: 5436.1, 60 sec: 5529.5, 300 sec: 5525.7). Total num frames: 849121280. Throughput: 0: 5672.4. Samples: 849129650. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:54:09,955][25689] Avg episode reward: [(0, '-1.893')] [2022-07-10 17:54:11,033][26022] Updated weights on worker 0-0, policy_version 829225 (0.00087) [2022-07-10 17:54:12,580][26022] Updated weights on worker 0-0, policy_version 829235 (0.00089) [2022-07-10 17:54:14,727][26022] Updated weights on worker 0-0, policy_version 829245 (0.00089) [2022-07-10 17:54:15,018][25689] Fps is (10 sec: 5361.3, 60 sec: 5523.9, 300 sec: 5521.2). Total num frames: 849147904. Throughput: 0: 5693.3. Samples: 849146518. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:54:15,019][25689] Avg episode reward: [(0, '-2.540')] [2022-07-10 17:54:16,463][26022] Updated weights on worker 0-0, policy_version 829255 (0.00089) [2022-07-10 17:54:18,443][26022] Updated weights on worker 0-0, policy_version 829265 (0.00086) [2022-07-10 17:54:20,030][25689] Fps is (10 sec: 5488.1, 60 sec: 5525.0, 300 sec: 5526.6). Total num frames: 849176576. Throughput: 0: 5691.4. Samples: 849180060. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:54:20,031][25689] Avg episode reward: [(0, '-2.011')] [2022-07-10 17:54:20,113][26022] Updated weights on worker 0-0, policy_version 829275 (0.01120) [2022-07-10 17:54:21,827][26022] Updated weights on worker 0-0, policy_version 829285 (0.00090) [2022-07-10 17:54:23,715][26022] Updated weights on worker 0-0, policy_version 829295 (0.00086) [2022-07-10 17:54:25,098][25689] Fps is (10 sec: 5790.7, 60 sec: 5573.7, 300 sec: 5525.8). Total num frames: 849206272. Throughput: 0: 5822.7. Samples: 849213924. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:54:25,099][25689] Avg episode reward: [(0, '-2.357')] [2022-07-10 17:54:25,525][26022] Updated weights on worker 0-0, policy_version 829305 (0.00086) [2022-07-10 17:54:27,275][26022] Updated weights on worker 0-0, policy_version 829315 (0.00096) [2022-07-10 17:54:29,237][26022] Updated weights on worker 0-0, policy_version 829325 (0.00097) [2022-07-10 17:54:30,149][25689] Fps is (10 sec: 5464.6, 60 sec: 5501.8, 300 sec: 5525.2). Total num frames: 849231872. Throughput: 0: 5007.7. Samples: 849230778. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-10 17:54:30,151][25689] Avg episode reward: [(0, '-2.617')] [2022-07-10 17:54:30,932][26022] Updated weights on worker 0-0, policy_version 829335 (0.00091) [2022-07-10 17:54:32,831][26022] Updated weights on worker 0-0, policy_version 829345 (0.00082) [2022-07-10 17:54:34,715][26022] Updated weights on worker 0-0, policy_version 829355 (0.00086) [2022-07-10 17:54:35,221][25689] Fps is (10 sec: 5462.6, 60 sec: 5532.5, 300 sec: 5524.4). Total num frames: 849261568. Throughput: 0: 5829.4. Samples: 849264284. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:54:35,221][25689] Avg episode reward: [(0, '-2.868')] [2022-07-10 17:54:36,332][26022] Updated weights on worker 0-0, policy_version 829365 (0.00086) [2022-07-10 17:54:38,480][26022] Updated weights on worker 0-0, policy_version 829375 (0.00097) [2022-07-10 17:54:40,269][25689] Fps is (10 sec: 5666.8, 60 sec: 5529.9, 300 sec: 5525.9). Total num frames: 849289216. Throughput: 0: 5815.0. Samples: 849297746. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:54:40,269][25689] Avg episode reward: [(0, '-3.042')] [2022-07-10 17:54:40,456][26022] Updated weights on worker 0-0, policy_version 829385 (0.00085) [2022-07-10 17:54:42,148][26022] Updated weights on worker 0-0, policy_version 829395 (0.00084) [2022-07-10 17:54:44,051][26022] Updated weights on worker 0-0, policy_version 829405 (0.00094) [2022-07-10 17:54:45,396][25689] Fps is (10 sec: 5635.6, 60 sec: 5547.3, 300 sec: 5525.4). Total num frames: 849318912. Throughput: 0: 4946.4. Samples: 849314326. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:54:45,396][25689] Avg episode reward: [(0, '-3.344')] [2022-07-10 17:54:45,678][26022] Updated weights on worker 0-0, policy_version 829415 (0.00089) [2022-07-10 17:54:47,723][26022] Updated weights on worker 0-0, policy_version 829425 (0.00091) [2022-07-10 17:54:49,418][26022] Updated weights on worker 0-0, policy_version 829435 (0.00084) [2022-07-10 17:54:50,406][25689] Fps is (10 sec: 5656.7, 60 sec: 5549.7, 300 sec: 5522.0). Total num frames: 849346560. Throughput: 0: 5801.7. Samples: 849348300. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:54:50,406][25689] Avg episode reward: [(0, '-1.819')] [2022-07-10 17:54:51,253][26022] Updated weights on worker 0-0, policy_version 829445 (0.00087) [2022-07-10 17:54:51,963][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:54:51,973][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000829450_849356800.pth [2022-07-10 17:54:51,974][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000827503_847363072.pth [2022-07-10 17:54:53,162][26022] Updated weights on worker 0-0, policy_version 829455 (0.00088) [2022-07-10 17:54:54,868][26022] Updated weights on worker 0-0, policy_version 829465 (0.00093) [2022-07-10 17:54:55,424][25689] Fps is (10 sec: 5616.1, 60 sec: 5553.8, 300 sec: 5532.2). Total num frames: 849375232. Throughput: 0: 5824.2. Samples: 849381954. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:54:55,425][25689] Avg episode reward: [(0, '-1.232')] [2022-07-10 17:54:56,649][26022] Updated weights on worker 0-0, policy_version 829475 (0.00079) [2022-07-10 17:54:58,727][26022] Updated weights on worker 0-0, policy_version 829485 (0.00078) [2022-07-10 17:55:00,298][26022] Updated weights on worker 0-0, policy_version 829495 (0.00088) [2022-07-10 17:55:00,444][25689] Fps is (10 sec: 5610.7, 60 sec: 5553.7, 300 sec: 5532.8). Total num frames: 849402880. Throughput: 0: 5006.6. Samples: 849398756. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:55:00,444][25689] Avg episode reward: [(0, '-1.094')] [2022-07-10 17:55:02,780][26022] Updated weights on worker 0-0, policy_version 829505 (0.00088) [2022-07-10 17:55:04,238][26022] Updated weights on worker 0-0, policy_version 829515 (0.00083) [2022-07-10 17:55:05,531][25689] Fps is (10 sec: 5370.1, 60 sec: 5539.2, 300 sec: 5531.3). Total num frames: 849429504. Throughput: 0: 5758.1. Samples: 849430264. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:55:05,531][25689] Avg episode reward: [(0, '-0.870')] [2022-07-10 17:55:06,412][26022] Updated weights on worker 0-0, policy_version 829525 (0.00084) [2022-07-10 17:55:07,835][26022] Updated weights on worker 0-0, policy_version 829535 (0.00097) [2022-07-10 17:55:10,170][26022] Updated weights on worker 0-0, policy_version 829545 (0.00086) [2022-07-10 17:55:10,583][25689] Fps is (10 sec: 5352.8, 60 sec: 5539.9, 300 sec: 5531.0). Total num frames: 849457152. Throughput: 0: 5700.4. Samples: 849463316. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:55:10,583][25689] Avg episode reward: [(0, '-0.584')] [2022-07-10 17:55:11,733][26022] Updated weights on worker 0-0, policy_version 829555 (0.00083) [2022-07-10 17:55:13,786][26022] Updated weights on worker 0-0, policy_version 829565 (0.00085) [2022-07-10 17:55:15,498][26022] Updated weights on worker 0-0, policy_version 829575 (0.00093) [2022-07-10 17:55:15,623][25689] Fps is (10 sec: 5478.9, 60 sec: 5558.9, 300 sec: 5527.2). Total num frames: 849484800. Throughput: 0: 4854.0. Samples: 849480000. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:55:15,624][25689] Avg episode reward: [(0, '0.088')] [2022-07-10 17:55:17,451][26022] Updated weights on worker 0-0, policy_version 829585 (0.00086) [2022-07-10 17:55:19,088][26022] Updated weights on worker 0-0, policy_version 829595 (0.00090) [2022-07-10 17:55:20,629][25689] Fps is (10 sec: 5504.2, 60 sec: 5542.6, 300 sec: 5529.3). Total num frames: 849512448. Throughput: 0: 5685.9. Samples: 849513526. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:55:20,630][25689] Avg episode reward: [(0, '-0.945')] [2022-07-10 17:55:21,171][26022] Updated weights on worker 0-0, policy_version 829605 (0.00094) [2022-07-10 17:55:22,760][26022] Updated weights on worker 0-0, policy_version 829615 (0.00079) [2022-07-10 17:55:24,916][26022] Updated weights on worker 0-0, policy_version 829625 (0.00093) [2022-07-10 17:55:25,730][25689] Fps is (10 sec: 5673.7, 60 sec: 5539.5, 300 sec: 5527.8). Total num frames: 849542144. Throughput: 0: 5773.4. Samples: 849546884. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:55:25,731][25689] Avg episode reward: [(0, '-2.256')] [2022-07-10 17:55:26,393][26022] Updated weights on worker 0-0, policy_version 829635 (0.00093) [2022-07-10 17:55:28,601][26022] Updated weights on worker 0-0, policy_version 829645 (0.00091) [2022-07-10 17:55:30,140][26022] Updated weights on worker 0-0, policy_version 829655 (0.00093) [2022-07-10 17:55:30,733][25689] Fps is (10 sec: 5574.0, 60 sec: 5560.8, 300 sec: 5528.4). Total num frames: 849568768. Throughput: 0: 4956.8. Samples: 849563196. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:55:30,733][25689] Avg episode reward: [(0, '-1.562')] [2022-07-10 17:55:32,224][26022] Updated weights on worker 0-0, policy_version 829665 (0.00096) [2022-07-10 17:55:33,951][26022] Updated weights on worker 0-0, policy_version 829675 (0.00088) [2022-07-10 17:55:35,779][25689] Fps is (10 sec: 5401.0, 60 sec: 5529.4, 300 sec: 5531.5). Total num frames: 849596416. Throughput: 0: 5784.4. Samples: 849596586. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:55:35,779][25689] Avg episode reward: [(0, '-2.210')] [2022-07-10 17:55:35,904][26022] Updated weights on worker 0-0, policy_version 829685 (0.00086) [2022-07-10 17:55:37,773][26022] Updated weights on worker 0-0, policy_version 829695 (0.00089) [2022-07-10 17:55:39,643][26022] Updated weights on worker 0-0, policy_version 829705 (0.00091) [2022-07-10 17:55:40,786][25689] Fps is (10 sec: 5602.5, 60 sec: 5550.0, 300 sec: 5528.7). Total num frames: 849625088. Throughput: 0: 5770.3. Samples: 849629836. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:55:40,786][25689] Avg episode reward: [(0, '-2.214')] [2022-07-10 17:55:41,381][26022] Updated weights on worker 0-0, policy_version 829715 (0.00089) [2022-07-10 17:55:43,270][26022] Updated weights on worker 0-0, policy_version 829725 (0.00100) [2022-07-10 17:55:45,180][26022] Updated weights on worker 0-0, policy_version 829735 (0.00050) [2022-07-10 17:55:45,896][25689] Fps is (10 sec: 5465.6, 60 sec: 5500.9, 300 sec: 5527.0). Total num frames: 849651712. Throughput: 0: 4941.5. Samples: 849646532. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:55:45,896][25689] Avg episode reward: [(0, '-1.979')] [2022-07-10 17:55:46,968][26022] Updated weights on worker 0-0, policy_version 829745 (0.00092) [2022-07-10 17:55:48,875][26022] Updated weights on worker 0-0, policy_version 829755 (0.00089) [2022-07-10 17:55:50,564][26022] Updated weights on worker 0-0, policy_version 829765 (0.00104) [2022-07-10 17:55:50,911][25689] Fps is (10 sec: 5461.4, 60 sec: 5517.3, 300 sec: 5526.9). Total num frames: 849680384. Throughput: 0: 5787.7. Samples: 849679978. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:55:50,913][25689] Avg episode reward: [(0, '-0.870')] [2022-07-10 17:55:52,522][26022] Updated weights on worker 0-0, policy_version 829775 (0.00086) [2022-07-10 17:55:54,386][26022] Updated weights on worker 0-0, policy_version 829785 (0.00095) [2022-07-10 17:55:55,916][25689] Fps is (10 sec: 5620.3, 60 sec: 5501.5, 300 sec: 5520.5). Total num frames: 849708032. Throughput: 0: 5789.5. Samples: 849713174. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:55:55,918][25689] Avg episode reward: [(0, '0.031')] [2022-07-10 17:55:56,193][26022] Updated weights on worker 0-0, policy_version 829795 (0.00087) [2022-07-10 17:55:58,063][26022] Updated weights on worker 0-0, policy_version 829805 (0.00086) [2022-07-10 17:56:00,023][26022] Updated weights on worker 0-0, policy_version 829815 (0.00079) [2022-07-10 17:56:00,972][25689] Fps is (10 sec: 5597.9, 60 sec: 5515.2, 300 sec: 5534.7). Total num frames: 849736704. Throughput: 0: 4960.3. Samples: 849729964. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:56:00,972][25689] Avg episode reward: [(0, '0.733')] [2022-07-10 17:56:01,694][26022] Updated weights on worker 0-0, policy_version 829825 (0.00084) [2022-07-10 17:56:04,006][26022] Updated weights on worker 0-0, policy_version 829835 (0.00079) [2022-07-10 17:56:05,657][26022] Updated weights on worker 0-0, policy_version 829845 (0.00090) [2022-07-10 17:56:06,113][25689] Fps is (10 sec: 5322.5, 60 sec: 5493.3, 300 sec: 5523.8). Total num frames: 849762304. Throughput: 0: 5689.8. Samples: 849761566. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:56:06,114][25689] Avg episode reward: [(0, '0.761')] [2022-07-10 17:56:07,631][26022] Updated weights on worker 0-0, policy_version 829855 (0.00086) [2022-07-10 17:56:09,620][26022] Updated weights on worker 0-0, policy_version 829865 (0.00093) [2022-07-10 17:56:11,123][25689] Fps is (10 sec: 5346.4, 60 sec: 5514.1, 300 sec: 5527.8). Total num frames: 849790976. Throughput: 0: 5674.0. Samples: 849794662. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:56:11,123][25689] Avg episode reward: [(0, '-0.526')] [2022-07-10 17:56:11,206][26022] Updated weights on worker 0-0, policy_version 829875 (0.00092) [2022-07-10 17:56:13,224][26022] Updated weights on worker 0-0, policy_version 829885 (0.00089) [2022-07-10 17:56:14,970][26022] Updated weights on worker 0-0, policy_version 829895 (0.00090) [2022-07-10 17:56:16,135][25689] Fps is (10 sec: 5517.7, 60 sec: 5499.8, 300 sec: 5521.1). Total num frames: 849817600. Throughput: 0: 5677.3. Samples: 849827960. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:56:16,135][25689] Avg episode reward: [(0, '-0.685')] [2022-07-10 17:56:16,821][26022] Updated weights on worker 0-0, policy_version 829905 (0.00093) [2022-07-10 17:56:18,525][26022] Updated weights on worker 0-0, policy_version 829915 (0.00084) [2022-07-10 17:56:20,485][26022] Updated weights on worker 0-0, policy_version 829925 (0.00083) [2022-07-10 17:56:21,172][25689] Fps is (10 sec: 5604.1, 60 sec: 5530.7, 300 sec: 5531.9). Total num frames: 849847296. Throughput: 0: 5687.1. Samples: 849844848. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:56:21,173][25689] Avg episode reward: [(0, '-0.520')] [2022-07-10 17:56:22,239][26022] Updated weights on worker 0-0, policy_version 829935 (0.00081) [2022-07-10 17:56:24,160][26022] Updated weights on worker 0-0, policy_version 829945 (0.00090) [2022-07-10 17:56:25,923][26022] Updated weights on worker 0-0, policy_version 829955 (0.00079) [2022-07-10 17:56:26,264][25689] Fps is (10 sec: 5762.6, 60 sec: 5514.7, 300 sec: 5527.3). Total num frames: 849875968. Throughput: 0: 5799.7. Samples: 849878432. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:56:26,264][25689] Avg episode reward: [(0, '-0.622')] [2022-07-10 17:56:27,881][26022] Updated weights on worker 0-0, policy_version 829965 (0.00092) [2022-07-10 17:56:29,444][26022] Updated weights on worker 0-0, policy_version 829975 (0.00093) [2022-07-10 17:56:31,274][25689] Fps is (10 sec: 5372.8, 60 sec: 5497.1, 300 sec: 5520.5). Total num frames: 849901568. Throughput: 0: 5819.1. Samples: 849911922. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:56:31,274][25689] Avg episode reward: [(0, '-0.853')] [2022-07-10 17:56:31,640][26022] Updated weights on worker 0-0, policy_version 829985 (0.00091) [2022-07-10 17:56:33,363][26022] Updated weights on worker 0-0, policy_version 829995 (0.00084) [2022-07-10 17:56:35,255][26022] Updated weights on worker 0-0, policy_version 830005 (0.00096) [2022-07-10 17:56:36,293][25689] Fps is (10 sec: 5411.2, 60 sec: 5516.4, 300 sec: 5527.3). Total num frames: 849930240. Throughput: 0: 5005.9. Samples: 849928872. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:56:36,294][25689] Avg episode reward: [(0, '-0.107')] [2022-07-10 17:56:36,833][26022] Updated weights on worker 0-0, policy_version 830015 (0.00093) [2022-07-10 17:56:39,099][26022] Updated weights on worker 0-0, policy_version 830025 (0.00081) [2022-07-10 17:56:40,715][26022] Updated weights on worker 0-0, policy_version 830035 (0.00078) [2022-07-10 17:56:41,326][25689] Fps is (10 sec: 5704.6, 60 sec: 5514.1, 300 sec: 5527.8). Total num frames: 849958912. Throughput: 0: 5832.8. Samples: 849962398. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:56:41,326][25689] Avg episode reward: [(0, '0.368')] [2022-07-10 17:56:42,655][26022] Updated weights on worker 0-0, policy_version 830045 (0.00359) [2022-07-10 17:56:44,399][26022] Updated weights on worker 0-0, policy_version 830055 (0.00094) [2022-07-10 17:56:46,395][25689] Fps is (10 sec: 5474.1, 60 sec: 5517.8, 300 sec: 5527.4). Total num frames: 849985536. Throughput: 0: 5825.1. Samples: 849995698. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:56:46,395][25689] Avg episode reward: [(0, '0.524')] [2022-07-10 17:56:46,452][26022] Updated weights on worker 0-0, policy_version 830065 (0.00087) [2022-07-10 17:56:48,065][26022] Updated weights on worker 0-0, policy_version 830075 (0.00090) [2022-07-10 17:56:50,047][26022] Updated weights on worker 0-0, policy_version 830085 (0.00086) [2022-07-10 17:56:51,403][25689] Fps is (10 sec: 5690.5, 60 sec: 5552.3, 300 sec: 5531.1). Total num frames: 850016256. Throughput: 0: 4991.3. Samples: 850012394. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:56:51,404][25689] Avg episode reward: [(0, '0.186')] [2022-07-10 17:56:51,615][26022] Updated weights on worker 0-0, policy_version 830095 (0.00089) [2022-07-10 17:56:52,132][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:56:52,144][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000830098_850020352.pth [2022-07-10 17:56:52,145][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000828152_848027648.pth [2022-07-10 17:56:53,723][26022] Updated weights on worker 0-0, policy_version 830105 (0.00082) [2022-07-10 17:56:55,383][26022] Updated weights on worker 0-0, policy_version 830115 (0.00084) [2022-07-10 17:56:56,413][25689] Fps is (10 sec: 5723.7, 60 sec: 5535.0, 300 sec: 5528.2). Total num frames: 850042880. Throughput: 0: 5821.7. Samples: 850046004. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:56:56,414][25689] Avg episode reward: [(0, '0.679')] [2022-07-10 17:56:57,377][26022] Updated weights on worker 0-0, policy_version 830125 (0.00084) [2022-07-10 17:56:58,972][26022] Updated weights on worker 0-0, policy_version 830135 (0.00090) [2022-07-10 17:57:01,083][26022] Updated weights on worker 0-0, policy_version 830145 (0.00088) [2022-07-10 17:57:01,423][25689] Fps is (10 sec: 5518.5, 60 sec: 5539.1, 300 sec: 5536.8). Total num frames: 850071552. Throughput: 0: 5832.6. Samples: 850079616. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:57:01,424][25689] Avg episode reward: [(0, '-0.011')] [2022-07-10 17:57:02,951][26022] Updated weights on worker 0-0, policy_version 830155 (0.00088) [2022-07-10 17:57:05,041][26022] Updated weights on worker 0-0, policy_version 830165 (0.00092) [2022-07-10 17:57:06,480][25689] Fps is (10 sec: 5391.1, 60 sec: 5546.9, 300 sec: 5529.2). Total num frames: 850097152. Throughput: 0: 4901.5. Samples: 850094148. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:57:06,482][25689] Avg episode reward: [(0, '0.360')] [2022-07-10 17:57:06,880][26022] Updated weights on worker 0-0, policy_version 830175 (0.00088) [2022-07-10 17:57:08,597][26022] Updated weights on worker 0-0, policy_version 830185 (0.00082) [2022-07-10 17:57:10,617][26022] Updated weights on worker 0-0, policy_version 830195 (0.00093) [2022-07-10 17:57:11,493][25689] Fps is (10 sec: 5186.3, 60 sec: 5512.7, 300 sec: 5526.4). Total num frames: 850123776. Throughput: 0: 5734.6. Samples: 850127600. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:57:11,493][25689] Avg episode reward: [(0, '0.291')] [2022-07-10 17:57:12,531][26022] Updated weights on worker 0-0, policy_version 830205 (0.00089) [2022-07-10 17:57:14,299][26022] Updated weights on worker 0-0, policy_version 830215 (0.00086) [2022-07-10 17:57:16,130][26022] Updated weights on worker 0-0, policy_version 830225 (0.00095) [2022-07-10 17:57:16,503][25689] Fps is (10 sec: 5517.1, 60 sec: 5546.8, 300 sec: 5530.3). Total num frames: 850152448. Throughput: 0: 5711.1. Samples: 850160738. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:57:16,503][25689] Avg episode reward: [(0, '0.208')] [2022-07-10 17:57:18,065][26022] Updated weights on worker 0-0, policy_version 830235 (0.00086) [2022-07-10 17:57:19,784][26022] Updated weights on worker 0-0, policy_version 830245 (0.00087) [2022-07-10 17:57:21,539][25689] Fps is (10 sec: 5504.2, 60 sec: 5496.0, 300 sec: 5521.2). Total num frames: 850179072. Throughput: 0: 4864.0. Samples: 850177458. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:57:21,539][25689] Avg episode reward: [(0, '0.219')] [2022-07-10 17:57:21,637][26022] Updated weights on worker 0-0, policy_version 830255 (0.00094) [2022-07-10 17:57:23,503][26022] Updated weights on worker 0-0, policy_version 830265 (0.00090) [2022-07-10 17:57:25,176][26022] Updated weights on worker 0-0, policy_version 830275 (0.00092) [2022-07-10 17:57:26,600][25689] Fps is (10 sec: 5578.0, 60 sec: 5515.8, 300 sec: 5531.7). Total num frames: 850208768. Throughput: 0: 5796.7. Samples: 850210774. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:57:26,600][25689] Avg episode reward: [(0, '0.145')] [2022-07-10 17:57:27,168][26022] Updated weights on worker 0-0, policy_version 830285 (0.00096) [2022-07-10 17:57:29,014][26022] Updated weights on worker 0-0, policy_version 830295 (0.00107) [2022-07-10 17:57:30,713][26022] Updated weights on worker 0-0, policy_version 830305 (0.00089) [2022-07-10 17:57:31,695][25689] Fps is (10 sec: 5646.4, 60 sec: 5541.9, 300 sec: 5523.8). Total num frames: 850236416. Throughput: 0: 5761.3. Samples: 850243990. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:57:31,695][25689] Avg episode reward: [(0, '0.672')] [2022-07-10 17:57:32,811][26022] Updated weights on worker 0-0, policy_version 830315 (0.00097) [2022-07-10 17:57:34,548][26022] Updated weights on worker 0-0, policy_version 830325 (0.00091) [2022-07-10 17:57:36,498][26022] Updated weights on worker 0-0, policy_version 830335 (0.00057) [2022-07-10 17:57:36,731][25689] Fps is (10 sec: 5458.2, 60 sec: 5523.5, 300 sec: 5526.9). Total num frames: 850264064. Throughput: 0: 4922.6. Samples: 850260310. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:57:36,731][25689] Avg episode reward: [(0, '0.217')] [2022-07-10 17:57:38,296][26022] Updated weights on worker 0-0, policy_version 830345 (0.00094) [2022-07-10 17:57:40,103][26022] Updated weights on worker 0-0, policy_version 830355 (0.00095) [2022-07-10 17:57:41,795][25689] Fps is (10 sec: 5474.6, 60 sec: 5503.6, 300 sec: 5523.8). Total num frames: 850291712. Throughput: 0: 5739.1. Samples: 850293712. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:57:41,796][25689] Avg episode reward: [(0, '-0.693')] [2022-07-10 17:57:42,108][26022] Updated weights on worker 0-0, policy_version 830365 (0.00083) [2022-07-10 17:57:43,683][26022] Updated weights on worker 0-0, policy_version 830375 (0.00410) [2022-07-10 17:57:45,658][26022] Updated weights on worker 0-0, policy_version 830385 (0.00094) [2022-07-10 17:57:46,861][25689] Fps is (10 sec: 5660.6, 60 sec: 5554.7, 300 sec: 5526.3). Total num frames: 850321408. Throughput: 0: 5750.0. Samples: 850327276. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:57:46,862][25689] Avg episode reward: [(0, '-0.546')] [2022-07-10 17:57:47,340][26022] Updated weights on worker 0-0, policy_version 830395 (0.00085) [2022-07-10 17:57:49,402][26022] Updated weights on worker 0-0, policy_version 830405 (0.00082) [2022-07-10 17:57:51,103][26022] Updated weights on worker 0-0, policy_version 830415 (0.00090) [2022-07-10 17:57:51,898][25689] Fps is (10 sec: 5574.6, 60 sec: 5484.4, 300 sec: 5525.8). Total num frames: 850348032. Throughput: 0: 4965.2. Samples: 850344304. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:57:51,899][25689] Avg episode reward: [(0, '-1.015')] [2022-07-10 17:57:52,965][26022] Updated weights on worker 0-0, policy_version 830425 (0.00087) [2022-07-10 17:57:54,758][26022] Updated weights on worker 0-0, policy_version 830435 (0.00083) [2022-07-10 17:57:56,561][26022] Updated weights on worker 0-0, policy_version 830445 (0.00091) [2022-07-10 17:57:56,923][25689] Fps is (10 sec: 5597.3, 60 sec: 5533.8, 300 sec: 5529.1). Total num frames: 850377728. Throughput: 0: 5835.1. Samples: 850378134. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:57:56,923][25689] Avg episode reward: [(0, '-0.855')] [2022-07-10 17:57:58,336][26022] Updated weights on worker 0-0, policy_version 830455 (0.00087) [2022-07-10 17:58:00,120][26022] Updated weights on worker 0-0, policy_version 830465 (0.00096) [2022-07-10 17:58:01,990][25689] Fps is (10 sec: 5479.2, 60 sec: 5477.8, 300 sec: 5526.1). Total num frames: 850403328. Throughput: 0: 5831.1. Samples: 850411470. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:58:01,990][25689] Avg episode reward: [(0, '-0.740')] [2022-07-10 17:58:02,475][26022] Updated weights on worker 0-0, policy_version 830475 (0.00083) [2022-07-10 17:58:04,185][26022] Updated weights on worker 0-0, policy_version 830485 (0.00085) [2022-07-10 17:58:06,175][26022] Updated weights on worker 0-0, policy_version 830495 (0.00081) [2022-07-10 17:58:07,132][25689] Fps is (10 sec: 5316.1, 60 sec: 5520.9, 300 sec: 5527.0). Total num frames: 850432000. Throughput: 0: 4873.9. Samples: 850426074. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:58:07,132][25689] Avg episode reward: [(0, '-0.222')] [2022-07-10 17:58:07,800][26022] Updated weights on worker 0-0, policy_version 830505 (0.00089) [2022-07-10 17:58:09,899][26022] Updated weights on worker 0-0, policy_version 830515 (0.00087) [2022-07-10 17:58:11,532][26022] Updated weights on worker 0-0, policy_version 830525 (0.00093) [2022-07-10 17:58:12,190][25689] Fps is (10 sec: 5420.9, 60 sec: 5516.6, 300 sec: 5525.9). Total num frames: 850458624. Throughput: 0: 5670.4. Samples: 850459370. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2022-07-10 17:58:12,191][25689] Avg episode reward: [(0, '0.570')] [2022-07-10 17:58:13,625][26022] Updated weights on worker 0-0, policy_version 830535 (0.00086) [2022-07-10 17:58:15,527][26022] Updated weights on worker 0-0, policy_version 830545 (0.00076) [2022-07-10 17:58:17,269][25689] Fps is (10 sec: 5454.6, 60 sec: 5510.4, 300 sec: 5524.9). Total num frames: 850487296. Throughput: 0: 5624.9. Samples: 850492580. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 17:58:17,270][25689] Avg episode reward: [(0, '0.058')] [2022-07-10 17:58:17,311][26022] Updated weights on worker 0-0, policy_version 830555 (0.00093) [2022-07-10 17:58:18,947][26022] Updated weights on worker 0-0, policy_version 830565 (0.00091) [2022-07-10 17:58:21,080][26022] Updated weights on worker 0-0, policy_version 830575 (0.00090) [2022-07-10 17:58:22,313][25689] Fps is (10 sec: 5766.3, 60 sec: 5560.3, 300 sec: 5535.3). Total num frames: 850516992. Throughput: 0: 4808.0. Samples: 850509182. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 17:58:22,313][25689] Avg episode reward: [(0, '0.120')] [2022-07-10 17:58:22,637][26022] Updated weights on worker 0-0, policy_version 830585 (0.00620) [2022-07-10 17:58:24,759][26022] Updated weights on worker 0-0, policy_version 830595 (0.00091) [2022-07-10 17:58:26,485][26022] Updated weights on worker 0-0, policy_version 830605 (0.00085) [2022-07-10 17:58:27,359][25689] Fps is (10 sec: 5480.3, 60 sec: 5494.2, 300 sec: 5520.7). Total num frames: 850542592. Throughput: 0: 5761.8. Samples: 850542620. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 17:58:27,360][25689] Avg episode reward: [(0, '0.261')] [2022-07-10 17:58:28,497][26022] Updated weights on worker 0-0, policy_version 830615 (0.00081) [2022-07-10 17:58:30,046][26022] Updated weights on worker 0-0, policy_version 830625 (0.00093) [2022-07-10 17:58:32,162][26022] Updated weights on worker 0-0, policy_version 830635 (0.00092) [2022-07-10 17:58:32,402][25689] Fps is (10 sec: 5379.1, 60 sec: 5515.8, 300 sec: 5524.1). Total num frames: 850571264. Throughput: 0: 5767.8. Samples: 850575948. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 17:58:32,403][25689] Avg episode reward: [(0, '0.251')] [2022-07-10 17:58:33,711][26022] Updated weights on worker 0-0, policy_version 830645 (0.00089) [2022-07-10 17:58:35,777][26022] Updated weights on worker 0-0, policy_version 830655 (0.00075) [2022-07-10 17:58:37,406][25689] Fps is (10 sec: 5708.0, 60 sec: 5535.6, 300 sec: 5527.8). Total num frames: 850599936. Throughput: 0: 5813.4. Samples: 850609640. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 17:58:37,406][25689] Avg episode reward: [(0, '0.708')] [2022-07-10 17:58:37,432][26022] Updated weights on worker 0-0, policy_version 830665 (0.00100) [2022-07-10 17:58:39,392][26022] Updated weights on worker 0-0, policy_version 830675 (0.00093) [2022-07-10 17:58:41,027][26022] Updated weights on worker 0-0, policy_version 830685 (0.00085) [2022-07-10 17:58:42,424][25689] Fps is (10 sec: 5619.6, 60 sec: 5539.8, 300 sec: 5526.5). Total num frames: 850627584. Throughput: 0: 5831.6. Samples: 850626464. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 17:58:42,425][25689] Avg episode reward: [(0, '0.197')] [2022-07-10 17:58:43,137][26022] Updated weights on worker 0-0, policy_version 830695 (0.00082) [2022-07-10 17:58:44,863][26022] Updated weights on worker 0-0, policy_version 830705 (0.00090) [2022-07-10 17:58:46,973][26022] Updated weights on worker 0-0, policy_version 830715 (0.00087) [2022-07-10 17:58:47,481][25689] Fps is (10 sec: 5488.1, 60 sec: 5506.8, 300 sec: 5526.1). Total num frames: 850655232. Throughput: 0: 5797.3. Samples: 850659274. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 17:58:47,482][25689] Avg episode reward: [(0, '-0.085')] [2022-07-10 17:58:48,570][26022] Updated weights on worker 0-0, policy_version 830725 (0.00099) [2022-07-10 17:58:50,576][26022] Updated weights on worker 0-0, policy_version 830735 (0.00089) [2022-07-10 17:58:52,251][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 17:58:52,265][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000830745_850682880.pth [2022-07-10 17:58:52,265][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000828801_848692224.pth [2022-07-10 17:58:52,269][26022] Updated weights on worker 0-0, policy_version 830745 (0.00087) [2022-07-10 17:58:52,495][25689] Fps is (10 sec: 5592.3, 60 sec: 5542.7, 300 sec: 5527.0). Total num frames: 850683904. Throughput: 0: 5800.5. Samples: 850692498. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 17:58:52,496][25689] Avg episode reward: [(0, '-0.168')] [2022-07-10 17:58:54,460][26022] Updated weights on worker 0-0, policy_version 830755 (0.00090) [2022-07-10 17:58:55,944][26022] Updated weights on worker 0-0, policy_version 830765 (0.00092) [2022-07-10 17:58:57,507][25689] Fps is (10 sec: 5515.6, 60 sec: 5493.2, 300 sec: 5523.7). Total num frames: 850710528. Throughput: 0: 4961.9. Samples: 850709380. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 17:58:57,507][25689] Avg episode reward: [(0, '-0.127')] [2022-07-10 17:58:57,956][26022] Updated weights on worker 0-0, policy_version 830775 (0.00090) [2022-07-10 17:58:59,658][26022] Updated weights on worker 0-0, policy_version 830785 (0.00093) [2022-07-10 17:59:02,106][26022] Updated weights on worker 0-0, policy_version 830795 (0.00095) [2022-07-10 17:59:02,516][25689] Fps is (10 sec: 5313.9, 60 sec: 5515.4, 300 sec: 5522.2). Total num frames: 850737152. Throughput: 0: 5754.8. Samples: 850742086. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 17:59:02,516][25689] Avg episode reward: [(0, '-0.275')] [2022-07-10 17:59:03,636][26022] Updated weights on worker 0-0, policy_version 830805 (0.00091) [2022-07-10 17:59:05,732][26022] Updated weights on worker 0-0, policy_version 830815 (0.00090) [2022-07-10 17:59:07,117][26022] Updated weights on worker 0-0, policy_version 830825 (0.00085) [2022-07-10 17:59:07,619][25689] Fps is (10 sec: 5468.4, 60 sec: 5519.0, 300 sec: 5524.8). Total num frames: 850765824. Throughput: 0: 5717.6. Samples: 850774412. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 17:59:07,619][25689] Avg episode reward: [(0, '-0.460')] [2022-07-10 17:59:09,402][26022] Updated weights on worker 0-0, policy_version 830835 (0.00084) [2022-07-10 17:59:10,798][26022] Updated weights on worker 0-0, policy_version 830845 (0.00086) [2022-07-10 17:59:12,674][25689] Fps is (10 sec: 5443.2, 60 sec: 5519.2, 300 sec: 5525.0). Total num frames: 850792448. Throughput: 0: 4897.7. Samples: 850791330. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 17:59:12,683][25689] Avg episode reward: [(0, '-0.232')] [2022-07-10 17:59:12,994][26022] Updated weights on worker 0-0, policy_version 830855 (0.00094) [2022-07-10 17:59:14,682][26022] Updated weights on worker 0-0, policy_version 830865 (0.00082) [2022-07-10 17:59:16,520][26022] Updated weights on worker 0-0, policy_version 830875 (0.00091) [2022-07-10 17:59:17,684][25689] Fps is (10 sec: 5697.1, 60 sec: 5559.4, 300 sec: 5531.9). Total num frames: 850823168. Throughput: 0: 5723.7. Samples: 850824870. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 17:59:17,685][25689] Avg episode reward: [(0, '0.431')] [2022-07-10 17:59:18,370][26022] Updated weights on worker 0-0, policy_version 830885 (0.00090) [2022-07-10 17:59:20,254][26022] Updated weights on worker 0-0, policy_version 830895 (0.00082) [2022-07-10 17:59:22,108][26022] Updated weights on worker 0-0, policy_version 830905 (0.00092) [2022-07-10 17:59:22,694][25689] Fps is (10 sec: 5722.9, 60 sec: 5511.6, 300 sec: 5522.6). Total num frames: 850849792. Throughput: 0: 5765.6. Samples: 850858428. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 17:59:22,695][25689] Avg episode reward: [(0, '0.107')] [2022-07-10 17:59:23,938][26022] Updated weights on worker 0-0, policy_version 830915 (0.00095) [2022-07-10 17:59:25,756][26022] Updated weights on worker 0-0, policy_version 830925 (0.00093) [2022-07-10 17:59:27,731][26022] Updated weights on worker 0-0, policy_version 830935 (0.00085) [2022-07-10 17:59:27,768][25689] Fps is (10 sec: 5382.3, 60 sec: 5543.1, 300 sec: 5529.1). Total num frames: 850877440. Throughput: 0: 5008.9. Samples: 850875336. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 17:59:27,768][25689] Avg episode reward: [(0, '-0.181')] [2022-07-10 17:59:29,339][26022] Updated weights on worker 0-0, policy_version 830945 (0.00088) [2022-07-10 17:59:31,234][26022] Updated weights on worker 0-0, policy_version 830955 (0.00080) [2022-07-10 17:59:32,807][25689] Fps is (10 sec: 5467.9, 60 sec: 5526.4, 300 sec: 5522.8). Total num frames: 850905088. Throughput: 0: 5829.4. Samples: 850908692. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 17:59:32,809][25689] Avg episode reward: [(0, '-0.374')] [2022-07-10 17:59:33,183][26022] Updated weights on worker 0-0, policy_version 830965 (0.00085) [2022-07-10 17:59:34,998][26022] Updated weights on worker 0-0, policy_version 830975 (0.00085) [2022-07-10 17:59:36,675][26022] Updated weights on worker 0-0, policy_version 830985 (0.00088) [2022-07-10 17:59:37,855][25689] Fps is (10 sec: 5583.3, 60 sec: 5522.4, 300 sec: 5526.3). Total num frames: 850933760. Throughput: 0: 5823.8. Samples: 850942340. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 17:59:37,856][25689] Avg episode reward: [(0, '-1.247')] [2022-07-10 17:59:38,700][26022] Updated weights on worker 0-0, policy_version 830995 (0.00086) [2022-07-10 17:59:40,356][26022] Updated weights on worker 0-0, policy_version 831005 (0.00096) [2022-07-10 17:59:42,332][26022] Updated weights on worker 0-0, policy_version 831015 (0.00093) [2022-07-10 17:59:42,879][25689] Fps is (10 sec: 5592.0, 60 sec: 5521.9, 300 sec: 5521.3). Total num frames: 850961408. Throughput: 0: 4984.5. Samples: 850959036. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 17:59:42,880][25689] Avg episode reward: [(0, '-0.586')] [2022-07-10 17:59:44,188][26022] Updated weights on worker 0-0, policy_version 831025 (0.00097) [2022-07-10 17:59:46,232][26022] Updated weights on worker 0-0, policy_version 831035 (0.00085) [2022-07-10 17:59:47,967][25689] Fps is (10 sec: 5468.1, 60 sec: 5519.0, 300 sec: 5519.9). Total num frames: 850989056. Throughput: 0: 5754.6. Samples: 850991576. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 17:59:47,968][25689] Avg episode reward: [(0, '-0.243')] [2022-07-10 17:59:48,028][26022] Updated weights on worker 0-0, policy_version 831045 (0.00081) [2022-07-10 17:59:49,759][26022] Updated weights on worker 0-0, policy_version 831055 (0.00094) [2022-07-10 17:59:51,552][26022] Updated weights on worker 0-0, policy_version 831065 (0.00086) [2022-07-10 17:59:53,034][25689] Fps is (10 sec: 5646.9, 60 sec: 5531.2, 300 sec: 5522.4). Total num frames: 851018752. Throughput: 0: 5758.4. Samples: 851025164. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 17:59:53,034][25689] Avg episode reward: [(0, '-0.435')] [2022-07-10 17:59:53,610][26022] Updated weights on worker 0-0, policy_version 831075 (0.00089) [2022-07-10 17:59:55,422][26022] Updated weights on worker 0-0, policy_version 831085 (0.00085) [2022-07-10 17:59:57,068][26022] Updated weights on worker 0-0, policy_version 831095 (0.00085) [2022-07-10 17:59:58,047][25689] Fps is (10 sec: 5587.6, 60 sec: 5531.0, 300 sec: 5519.1). Total num frames: 851045376. Throughput: 0: 4921.2. Samples: 851041708. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 17:59:58,047][25689] Avg episode reward: [(0, '-1.322')] [2022-07-10 17:59:59,052][26022] Updated weights on worker 0-0, policy_version 831105 (0.00085) [2022-07-10 18:00:00,882][26022] Updated weights on worker 0-0, policy_version 831115 (0.00083) [2022-07-10 18:00:03,054][25689] Fps is (10 sec: 5211.7, 60 sec: 5514.2, 300 sec: 5517.1). Total num frames: 851070976. Throughput: 0: 5746.6. Samples: 851074976. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 18:00:03,059][25689] Avg episode reward: [(0, '-2.108')] [2022-07-10 18:00:03,149][26022] Updated weights on worker 0-0, policy_version 831125 (0.00086) [2022-07-10 18:00:04,816][26022] Updated weights on worker 0-0, policy_version 831135 (0.00085) [2022-07-10 18:00:06,831][26022] Updated weights on worker 0-0, policy_version 831145 (0.00098) [2022-07-10 18:00:08,182][25689] Fps is (10 sec: 5354.6, 60 sec: 5512.0, 300 sec: 5519.2). Total num frames: 851099648. Throughput: 0: 5670.8. Samples: 851106212. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 18:00:08,183][25689] Avg episode reward: [(0, '-1.903')] [2022-07-10 18:00:08,606][26022] Updated weights on worker 0-0, policy_version 831155 (0.00088) [2022-07-10 18:00:10,631][26022] Updated weights on worker 0-0, policy_version 831165 (0.00094) [2022-07-10 18:00:12,286][26022] Updated weights on worker 0-0, policy_version 831175 (0.00088) [2022-07-10 18:00:13,205][25689] Fps is (10 sec: 5548.6, 60 sec: 5531.9, 300 sec: 5519.5). Total num frames: 851127296. Throughput: 0: 4835.8. Samples: 851122706. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 18:00:13,205][25689] Avg episode reward: [(0, '-2.308')] [2022-07-10 18:00:14,527][26022] Updated weights on worker 0-0, policy_version 831185 (0.00090) [2022-07-10 18:00:16,137][26022] Updated weights on worker 0-0, policy_version 831195 (0.00090) [2022-07-10 18:00:18,111][26022] Updated weights on worker 0-0, policy_version 831205 (0.00088) [2022-07-10 18:00:18,272][25689] Fps is (10 sec: 5480.3, 60 sec: 5475.9, 300 sec: 5518.3). Total num frames: 851154944. Throughput: 0: 5642.0. Samples: 851155820. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 18:00:18,273][25689] Avg episode reward: [(0, '-2.396')] [2022-07-10 18:00:19,515][26022] Updated weights on worker 0-0, policy_version 831215 (0.00086) [2022-07-10 18:00:21,610][26022] Updated weights on worker 0-0, policy_version 831225 (0.00089) [2022-07-10 18:00:23,290][25689] Fps is (10 sec: 5482.8, 60 sec: 5492.2, 300 sec: 5513.0). Total num frames: 851182592. Throughput: 0: 5652.9. Samples: 851189366. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 18:00:23,290][25689] Avg episode reward: [(0, '-2.244')] [2022-07-10 18:00:23,658][26022] Updated weights on worker 0-0, policy_version 831235 (0.00090) [2022-07-10 18:00:25,306][26022] Updated weights on worker 0-0, policy_version 831245 (0.00094) [2022-07-10 18:00:27,223][26022] Updated weights on worker 0-0, policy_version 831255 (0.00085) [2022-07-10 18:00:28,339][25689] Fps is (10 sec: 5493.1, 60 sec: 5494.4, 300 sec: 5515.6). Total num frames: 851210240. Throughput: 0: 4952.7. Samples: 851206042. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 18:00:28,339][25689] Avg episode reward: [(0, '-1.821')] [2022-07-10 18:00:29,073][26022] Updated weights on worker 0-0, policy_version 831265 (0.00606) [2022-07-10 18:00:30,791][26022] Updated weights on worker 0-0, policy_version 831275 (0.00097) [2022-07-10 18:00:32,823][26022] Updated weights on worker 0-0, policy_version 831285 (0.00090) [2022-07-10 18:00:33,367][25689] Fps is (10 sec: 5589.0, 60 sec: 5512.3, 300 sec: 5519.4). Total num frames: 851238912. Throughput: 0: 5778.8. Samples: 851239220. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 18:00:33,367][25689] Avg episode reward: [(0, '-1.057')] [2022-07-10 18:00:34,497][26022] Updated weights on worker 0-0, policy_version 831295 (0.00081) [2022-07-10 18:00:36,451][26022] Updated weights on worker 0-0, policy_version 831305 (0.00085) [2022-07-10 18:00:38,260][26022] Updated weights on worker 0-0, policy_version 831315 (0.00090) [2022-07-10 18:00:38,389][25689] Fps is (10 sec: 5603.6, 60 sec: 5497.7, 300 sec: 5515.6). Total num frames: 851266560. Throughput: 0: 5793.6. Samples: 851272372. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 18:00:38,390][25689] Avg episode reward: [(0, '-0.375')] [2022-07-10 18:00:40,244][26022] Updated weights on worker 0-0, policy_version 831325 (0.00094) [2022-07-10 18:00:42,089][26022] Updated weights on worker 0-0, policy_version 831335 (0.00098) [2022-07-10 18:00:43,411][25689] Fps is (10 sec: 5505.2, 60 sec: 5497.9, 300 sec: 5520.7). Total num frames: 851294208. Throughput: 0: 4947.0. Samples: 851288906. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 18:00:43,412][25689] Avg episode reward: [(0, '-1.523')] [2022-07-10 18:00:43,950][26022] Updated weights on worker 0-0, policy_version 831345 (0.00094) [2022-07-10 18:00:45,621][26022] Updated weights on worker 0-0, policy_version 831355 (0.00090) [2022-07-10 18:00:47,745][26022] Updated weights on worker 0-0, policy_version 831365 (0.00088) [2022-07-10 18:00:48,495][25689] Fps is (10 sec: 5573.2, 60 sec: 5515.2, 300 sec: 5519.4). Total num frames: 851322880. Throughput: 0: 5764.3. Samples: 851322228. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 18:00:48,495][25689] Avg episode reward: [(0, '-1.122')] [2022-07-10 18:00:49,383][26022] Updated weights on worker 0-0, policy_version 831375 (0.00090) [2022-07-10 18:00:51,232][26022] Updated weights on worker 0-0, policy_version 831385 (0.00086) [2022-07-10 18:00:52,269][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:00:52,285][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000831391_851344384.pth [2022-07-10 18:00:52,285][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000829450_849356800.pth [2022-07-10 18:00:53,141][26022] Updated weights on worker 0-0, policy_version 831395 (0.00089) [2022-07-10 18:00:53,543][25689] Fps is (10 sec: 5558.7, 60 sec: 5483.1, 300 sec: 5518.7). Total num frames: 851350528. Throughput: 0: 5764.3. Samples: 851355522. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 18:00:53,543][25689] Avg episode reward: [(0, '-1.386')] [2022-07-10 18:00:54,823][26022] Updated weights on worker 0-0, policy_version 831405 (0.00088) [2022-07-10 18:00:56,743][26022] Updated weights on worker 0-0, policy_version 831415 (0.00085) [2022-07-10 18:00:58,373][26022] Updated weights on worker 0-0, policy_version 831425 (0.00083) [2022-07-10 18:00:58,592][25689] Fps is (10 sec: 5577.9, 60 sec: 5513.6, 300 sec: 5518.8). Total num frames: 851379200. Throughput: 0: 4947.1. Samples: 851372316. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 18:00:58,592][25689] Avg episode reward: [(0, '-1.007')] [2022-07-10 18:01:00,581][26022] Updated weights on worker 0-0, policy_version 831435 (0.00085) [2022-07-10 18:01:02,678][26022] Updated weights on worker 0-0, policy_version 831445 (0.00089) [2022-07-10 18:01:03,596][25689] Fps is (10 sec: 5398.4, 60 sec: 5513.9, 300 sec: 5521.3). Total num frames: 851404800. Throughput: 0: 5774.1. Samples: 851405458. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 18:01:03,597][25689] Avg episode reward: [(0, '-0.558')] [2022-07-10 18:01:04,768][26022] Updated weights on worker 0-0, policy_version 831455 (0.00092) [2022-07-10 18:01:06,306][26022] Updated weights on worker 0-0, policy_version 831465 (0.00086) [2022-07-10 18:01:08,446][26022] Updated weights on worker 0-0, policy_version 831475 (0.00099) [2022-07-10 18:01:08,728][25689] Fps is (10 sec: 5152.4, 60 sec: 5479.8, 300 sec: 5512.2). Total num frames: 851431424. Throughput: 0: 5688.1. Samples: 851437314. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 18:01:08,728][25689] Avg episode reward: [(0, '-1.574')] [2022-07-10 18:01:09,971][26022] Updated weights on worker 0-0, policy_version 831485 (0.00084) [2022-07-10 18:01:11,996][26022] Updated weights on worker 0-0, policy_version 831495 (0.00102) [2022-07-10 18:01:13,644][26022] Updated weights on worker 0-0, policy_version 831505 (0.00085) [2022-07-10 18:01:13,736][25689] Fps is (10 sec: 5554.3, 60 sec: 5514.9, 300 sec: 5522.6). Total num frames: 851461120. Throughput: 0: 4880.2. Samples: 851454068. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 18:01:13,737][25689] Avg episode reward: [(0, '-1.766')] [2022-07-10 18:01:15,657][26022] Updated weights on worker 0-0, policy_version 831515 (0.00095) [2022-07-10 18:01:17,447][26022] Updated weights on worker 0-0, policy_version 831525 (0.00097) [2022-07-10 18:01:18,754][25689] Fps is (10 sec: 5719.5, 60 sec: 5519.4, 300 sec: 5516.0). Total num frames: 851488768. Throughput: 0: 5714.6. Samples: 851487534. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 18:01:18,754][25689] Avg episode reward: [(0, '-1.772')] [2022-07-10 18:01:19,311][26022] Updated weights on worker 0-0, policy_version 831535 (0.00095) [2022-07-10 18:01:21,019][26022] Updated weights on worker 0-0, policy_version 831545 (0.00094) [2022-07-10 18:01:22,851][26022] Updated weights on worker 0-0, policy_version 831555 (0.00081) [2022-07-10 18:01:23,781][25689] Fps is (10 sec: 5504.7, 60 sec: 5518.5, 300 sec: 5513.8). Total num frames: 851516416. Throughput: 0: 5728.5. Samples: 851521088. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 18:01:23,782][25689] Avg episode reward: [(0, '-0.932')] [2022-07-10 18:01:24,798][26022] Updated weights on worker 0-0, policy_version 831565 (0.00092) [2022-07-10 18:01:26,523][26022] Updated weights on worker 0-0, policy_version 831575 (0.00091) [2022-07-10 18:01:28,440][26022] Updated weights on worker 0-0, policy_version 831585 (0.00094) [2022-07-10 18:01:28,911][25689] Fps is (10 sec: 5343.1, 60 sec: 5494.2, 300 sec: 5515.0). Total num frames: 851543040. Throughput: 0: 5801.2. Samples: 851554402. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 18:01:28,912][25689] Avg episode reward: [(0, '-1.420')] [2022-07-10 18:01:30,178][26022] Updated weights on worker 0-0, policy_version 831595 (0.00093) [2022-07-10 18:01:32,228][26022] Updated weights on worker 0-0, policy_version 831605 (0.00095) [2022-07-10 18:01:33,723][26022] Updated weights on worker 0-0, policy_version 831615 (0.00085) [2022-07-10 18:01:33,931][25689] Fps is (10 sec: 5649.6, 60 sec: 5528.8, 300 sec: 5521.9). Total num frames: 851573760. Throughput: 0: 5790.1. Samples: 851571002. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 18:01:33,932][25689] Avg episode reward: [(0, '-0.677')] [2022-07-10 18:01:35,913][26022] Updated weights on worker 0-0, policy_version 831625 (0.00096) [2022-07-10 18:01:37,450][26022] Updated weights on worker 0-0, policy_version 831635 (0.00084) [2022-07-10 18:01:38,980][25689] Fps is (10 sec: 5695.0, 60 sec: 5509.5, 300 sec: 5514.7). Total num frames: 851600384. Throughput: 0: 5799.6. Samples: 851604840. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 18:01:38,981][25689] Avg episode reward: [(0, '0.321')] [2022-07-10 18:01:39,292][26022] Updated weights on worker 0-0, policy_version 831645 (0.00096) [2022-07-10 18:01:41,044][26022] Updated weights on worker 0-0, policy_version 831655 (0.00083) [2022-07-10 18:01:43,084][26022] Updated weights on worker 0-0, policy_version 831665 (0.00097) [2022-07-10 18:01:43,986][25689] Fps is (10 sec: 5601.2, 60 sec: 5544.7, 300 sec: 5526.2). Total num frames: 851630080. Throughput: 0: 5812.9. Samples: 851638538. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 18:01:43,987][25689] Avg episode reward: [(0, '1.071')] [2022-07-10 18:01:44,744][26022] Updated weights on worker 0-0, policy_version 831675 (0.00090) [2022-07-10 18:01:46,702][26022] Updated weights on worker 0-0, policy_version 831685 (0.00086) [2022-07-10 18:01:48,555][26022] Updated weights on worker 0-0, policy_version 831695 (0.00088) [2022-07-10 18:01:49,072][25689] Fps is (10 sec: 5580.7, 60 sec: 5510.7, 300 sec: 5511.0). Total num frames: 851656704. Throughput: 0: 4999.5. Samples: 851655198. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-10 18:01:49,073][25689] Avg episode reward: [(0, '0.999')] [2022-07-10 18:01:50,492][26022] Updated weights on worker 0-0, policy_version 831705 (0.00088) [2022-07-10 18:01:52,215][26022] Updated weights on worker 0-0, policy_version 831715 (0.00095) [2022-07-10 18:01:53,977][26022] Updated weights on worker 0-0, policy_version 831725 (0.00053) [2022-07-10 18:01:54,076][25689] Fps is (10 sec: 5581.8, 60 sec: 5548.5, 300 sec: 5521.4). Total num frames: 851686400. Throughput: 0: 5841.5. Samples: 851688678. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:01:54,077][25689] Avg episode reward: [(0, '0.723')] [2022-07-10 18:01:55,893][26022] Updated weights on worker 0-0, policy_version 831735 (0.00089) [2022-07-10 18:01:57,659][26022] Updated weights on worker 0-0, policy_version 831745 (0.00087) [2022-07-10 18:01:59,166][25689] Fps is (10 sec: 5579.5, 60 sec: 5511.0, 300 sec: 5513.0). Total num frames: 851713024. Throughput: 0: 5832.7. Samples: 851722580. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:01:59,168][25689] Avg episode reward: [(0, '1.293')] [2022-07-10 18:01:59,619][26022] Updated weights on worker 0-0, policy_version 831755 (0.00085) [2022-07-10 18:02:01,202][26022] Updated weights on worker 0-0, policy_version 831765 (0.00087) [2022-07-10 18:02:03,571][26022] Updated weights on worker 0-0, policy_version 831775 (0.00094) [2022-07-10 18:02:04,200][25689] Fps is (10 sec: 5260.0, 60 sec: 5525.2, 300 sec: 5516.9). Total num frames: 851739648. Throughput: 0: 4917.7. Samples: 851737940. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:02:04,201][25689] Avg episode reward: [(0, '1.336')] [2022-07-10 18:02:05,287][26022] Updated weights on worker 0-0, policy_version 831785 (0.00094) [2022-07-10 18:02:07,333][26022] Updated weights on worker 0-0, policy_version 831795 (0.00087) [2022-07-10 18:02:08,973][26022] Updated weights on worker 0-0, policy_version 831805 (0.00088) [2022-07-10 18:02:09,283][25689] Fps is (10 sec: 5567.2, 60 sec: 5580.4, 300 sec: 5525.9). Total num frames: 851769344. Throughput: 0: 5733.1. Samples: 851771066. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:02:09,284][25689] Avg episode reward: [(0, '1.158')] [2022-07-10 18:02:10,845][26022] Updated weights on worker 0-0, policy_version 831815 (0.00084) [2022-07-10 18:02:12,690][26022] Updated weights on worker 0-0, policy_version 831825 (0.00084) [2022-07-10 18:02:14,299][25689] Fps is (10 sec: 5678.1, 60 sec: 5545.9, 300 sec: 5522.4). Total num frames: 851796992. Throughput: 0: 5754.3. Samples: 851805044. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:02:14,299][25689] Avg episode reward: [(0, '0.229')] [2022-07-10 18:02:14,651][26022] Updated weights on worker 0-0, policy_version 831835 (0.00094) [2022-07-10 18:02:16,092][26022] Updated weights on worker 0-0, policy_version 831845 (0.00092) [2022-07-10 18:02:18,046][26022] Updated weights on worker 0-0, policy_version 831855 (0.00083) [2022-07-10 18:02:19,309][25689] Fps is (10 sec: 5617.5, 60 sec: 5563.5, 300 sec: 5529.7). Total num frames: 851825664. Throughput: 0: 4934.4. Samples: 851821970. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:02:19,310][25689] Avg episode reward: [(0, '0.392')] [2022-07-10 18:02:19,988][26022] Updated weights on worker 0-0, policy_version 831865 (0.00053) [2022-07-10 18:02:21,791][26022] Updated weights on worker 0-0, policy_version 831875 (0.00085) [2022-07-10 18:02:23,607][26022] Updated weights on worker 0-0, policy_version 831885 (0.00091) [2022-07-10 18:02:24,323][25689] Fps is (10 sec: 5516.0, 60 sec: 5547.7, 300 sec: 5520.3). Total num frames: 851852288. Throughput: 0: 5832.9. Samples: 851855320. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:02:24,324][25689] Avg episode reward: [(0, '0.814')] [2022-07-10 18:02:25,291][26022] Updated weights on worker 0-0, policy_version 831895 (0.00084) [2022-07-10 18:02:27,492][26022] Updated weights on worker 0-0, policy_version 831905 (0.00093) [2022-07-10 18:02:29,328][26022] Updated weights on worker 0-0, policy_version 831915 (0.00092) [2022-07-10 18:02:29,448][25689] Fps is (10 sec: 5453.3, 60 sec: 5582.0, 300 sec: 5523.2). Total num frames: 851880960. Throughput: 0: 5828.2. Samples: 851888596. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:02:29,449][25689] Avg episode reward: [(0, '0.092')] [2022-07-10 18:02:31,076][26022] Updated weights on worker 0-0, policy_version 831925 (0.00084) [2022-07-10 18:02:32,931][26022] Updated weights on worker 0-0, policy_version 831935 (0.00096) [2022-07-10 18:02:34,459][25689] Fps is (10 sec: 5556.6, 60 sec: 5532.1, 300 sec: 5523.6). Total num frames: 851908608. Throughput: 0: 4972.1. Samples: 851905284. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:02:34,460][25689] Avg episode reward: [(0, '-0.247')] [2022-07-10 18:02:34,917][26022] Updated weights on worker 0-0, policy_version 831945 (0.00086) [2022-07-10 18:02:36,385][26022] Updated weights on worker 0-0, policy_version 831955 (0.00092) [2022-07-10 18:02:38,550][26022] Updated weights on worker 0-0, policy_version 831965 (0.00088) [2022-07-10 18:02:39,478][25689] Fps is (10 sec: 5819.4, 60 sec: 5602.6, 300 sec: 5534.8). Total num frames: 851939328. Throughput: 0: 5794.7. Samples: 851938848. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:02:39,479][25689] Avg episode reward: [(0, '0.259')] [2022-07-10 18:02:40,204][26022] Updated weights on worker 0-0, policy_version 831975 (0.00092) [2022-07-10 18:02:42,111][26022] Updated weights on worker 0-0, policy_version 831985 (0.00086) [2022-07-10 18:02:43,999][26022] Updated weights on worker 0-0, policy_version 831995 (0.00086) [2022-07-10 18:02:44,507][25689] Fps is (10 sec: 5605.3, 60 sec: 5532.8, 300 sec: 5521.7). Total num frames: 851964928. Throughput: 0: 5799.2. Samples: 851972368. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:02:44,507][25689] Avg episode reward: [(0, '-0.146')] [2022-07-10 18:02:45,648][26022] Updated weights on worker 0-0, policy_version 832005 (0.00084) [2022-07-10 18:02:47,739][26022] Updated weights on worker 0-0, policy_version 832015 (0.00089) [2022-07-10 18:02:49,553][25689] Fps is (10 sec: 5386.9, 60 sec: 5570.3, 300 sec: 5528.4). Total num frames: 851993600. Throughput: 0: 4993.5. Samples: 851988988. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:02:49,554][25689] Avg episode reward: [(0, '-0.607')] [2022-07-10 18:02:49,559][26022] Updated weights on worker 0-0, policy_version 832025 (0.00085) [2022-07-10 18:02:51,374][26022] Updated weights on worker 0-0, policy_version 832035 (0.00089) [2022-07-10 18:02:52,486][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:02:52,498][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000832041_852009984.pth [2022-07-10 18:02:52,498][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000830098_850020352.pth [2022-07-10 18:02:53,212][26022] Updated weights on worker 0-0, policy_version 832045 (0.00861) [2022-07-10 18:02:54,613][25689] Fps is (10 sec: 5775.3, 60 sec: 5565.1, 300 sec: 5527.8). Total num frames: 852023296. Throughput: 0: 5816.8. Samples: 852022516. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:02:54,614][25689] Avg episode reward: [(0, '-0.919')] [2022-07-10 18:02:55,083][26022] Updated weights on worker 0-0, policy_version 832055 (0.00084) [2022-07-10 18:02:56,782][26022] Updated weights on worker 0-0, policy_version 832065 (0.00084) [2022-07-10 18:02:58,430][26022] Updated weights on worker 0-0, policy_version 832075 (0.00087) [2022-07-10 18:02:59,630][25689] Fps is (10 sec: 5690.7, 60 sec: 5588.8, 300 sec: 5535.6). Total num frames: 852050944. Throughput: 0: 5832.2. Samples: 852056376. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:02:59,631][25689] Avg episode reward: [(0, '-0.429')] [2022-07-10 18:03:00,341][26022] Updated weights on worker 0-0, policy_version 832085 (0.00085) [2022-07-10 18:03:02,695][26022] Updated weights on worker 0-0, policy_version 832095 (0.00088) [2022-07-10 18:03:04,520][26022] Updated weights on worker 0-0, policy_version 832105 (0.00089) [2022-07-10 18:03:04,641][25689] Fps is (10 sec: 5208.1, 60 sec: 5557.0, 300 sec: 5524.3). Total num frames: 852075520. Throughput: 0: 4896.8. Samples: 852070960. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:03:04,641][25689] Avg episode reward: [(0, '-0.618')] [2022-07-10 18:03:06,136][26022] Updated weights on worker 0-0, policy_version 832115 (0.00097) [2022-07-10 18:03:08,302][26022] Updated weights on worker 0-0, policy_version 832125 (0.00087) [2022-07-10 18:03:09,725][25689] Fps is (10 sec: 5376.2, 60 sec: 5556.9, 300 sec: 5534.1). Total num frames: 852105216. Throughput: 0: 5720.9. Samples: 852104388. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:03:09,725][25689] Avg episode reward: [(0, '-3.081')] [2022-07-10 18:03:09,911][26022] Updated weights on worker 0-0, policy_version 832135 (0.00083) [2022-07-10 18:03:11,886][26022] Updated weights on worker 0-0, policy_version 832145 (0.00090) [2022-07-10 18:03:13,668][26022] Updated weights on worker 0-0, policy_version 832155 (0.00089) [2022-07-10 18:03:14,747][25689] Fps is (10 sec: 5572.6, 60 sec: 5539.4, 300 sec: 5528.3). Total num frames: 852131840. Throughput: 0: 5737.6. Samples: 852138036. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:03:14,749][25689] Avg episode reward: [(0, '-2.020')] [2022-07-10 18:03:15,486][26022] Updated weights on worker 0-0, policy_version 832165 (0.00086) [2022-07-10 18:03:17,290][26022] Updated weights on worker 0-0, policy_version 832175 (0.00051) [2022-07-10 18:03:19,060][26022] Updated weights on worker 0-0, policy_version 832185 (0.00098) [2022-07-10 18:03:19,809][25689] Fps is (10 sec: 5483.5, 60 sec: 5534.7, 300 sec: 5524.5). Total num frames: 852160512. Throughput: 0: 4879.4. Samples: 852154836. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:03:19,809][25689] Avg episode reward: [(0, '-3.582')] [2022-07-10 18:03:20,785][26022] Updated weights on worker 0-0, policy_version 832195 (0.00092) [2022-07-10 18:03:22,978][26022] Updated weights on worker 0-0, policy_version 832205 (0.00092) [2022-07-10 18:03:24,455][26022] Updated weights on worker 0-0, policy_version 832215 (0.00089) [2022-07-10 18:03:24,814][25689] Fps is (10 sec: 5696.4, 60 sec: 5569.4, 300 sec: 5535.6). Total num frames: 852189184. Throughput: 0: 5825.2. Samples: 852188474. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:03:24,814][25689] Avg episode reward: [(0, '-2.871')] [2022-07-10 18:03:26,709][26022] Updated weights on worker 0-0, policy_version 832225 (0.00087) [2022-07-10 18:03:28,171][26022] Updated weights on worker 0-0, policy_version 832235 (0.00084) [2022-07-10 18:03:29,907][25689] Fps is (10 sec: 5374.5, 60 sec: 5521.5, 300 sec: 5524.4). Total num frames: 852214784. Throughput: 0: 5821.4. Samples: 852221876. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:03:29,907][25689] Avg episode reward: [(0, '-2.626')] [2022-07-10 18:03:30,202][26022] Updated weights on worker 0-0, policy_version 832245 (0.00091) [2022-07-10 18:03:32,083][26022] Updated weights on worker 0-0, policy_version 832255 (0.00094) [2022-07-10 18:03:33,910][26022] Updated weights on worker 0-0, policy_version 832265 (0.00085) [2022-07-10 18:03:34,931][25689] Fps is (10 sec: 5567.0, 60 sec: 5571.1, 300 sec: 5530.9). Total num frames: 852245504. Throughput: 0: 4988.9. Samples: 852238728. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:03:34,931][25689] Avg episode reward: [(0, '-2.241')] [2022-07-10 18:03:35,556][26022] Updated weights on worker 0-0, policy_version 832275 (0.00228) [2022-07-10 18:03:37,550][26022] Updated weights on worker 0-0, policy_version 832285 (0.00087) [2022-07-10 18:03:39,233][26022] Updated weights on worker 0-0, policy_version 832295 (0.00086) [2022-07-10 18:03:39,934][25689] Fps is (10 sec: 5821.0, 60 sec: 5521.8, 300 sec: 5531.1). Total num frames: 852273152. Throughput: 0: 5842.3. Samples: 852272414. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:03:39,935][25689] Avg episode reward: [(0, '-1.463')] [2022-07-10 18:03:41,275][26022] Updated weights on worker 0-0, policy_version 832305 (0.00084) [2022-07-10 18:03:42,971][26022] Updated weights on worker 0-0, policy_version 832315 (0.00080) [2022-07-10 18:03:44,965][25689] Fps is (10 sec: 5408.6, 60 sec: 5538.5, 300 sec: 5528.2). Total num frames: 852299776. Throughput: 0: 5836.5. Samples: 852306086. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:03:44,966][25689] Avg episode reward: [(0, '-1.333')] [2022-07-10 18:03:45,003][26022] Updated weights on worker 0-0, policy_version 832325 (0.00087) [2022-07-10 18:03:46,483][26022] Updated weights on worker 0-0, policy_version 832335 (0.00086) [2022-07-10 18:03:48,672][26022] Updated weights on worker 0-0, policy_version 832345 (0.00083) [2022-07-10 18:03:50,021][25689] Fps is (10 sec: 5684.9, 60 sec: 5571.5, 300 sec: 5534.3). Total num frames: 852330496. Throughput: 0: 5025.8. Samples: 852322966. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:03:50,022][25689] Avg episode reward: [(0, '-1.256')] [2022-07-10 18:03:50,194][26022] Updated weights on worker 0-0, policy_version 832355 (0.00089) [2022-07-10 18:03:52,392][26022] Updated weights on worker 0-0, policy_version 832365 (0.00091) [2022-07-10 18:03:54,001][26022] Updated weights on worker 0-0, policy_version 832375 (0.00092) [2022-07-10 18:03:55,032][25689] Fps is (10 sec: 5493.1, 60 sec: 5491.3, 300 sec: 5527.4). Total num frames: 852355072. Throughput: 0: 5824.9. Samples: 852355814. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:03:55,032][25689] Avg episode reward: [(0, '-2.181')] [2022-07-10 18:03:55,968][26022] Updated weights on worker 0-0, policy_version 832385 (0.00097) [2022-07-10 18:03:57,955][26022] Updated weights on worker 0-0, policy_version 832395 (0.00087) [2022-07-10 18:03:59,601][26022] Updated weights on worker 0-0, policy_version 832405 (0.00099) [2022-07-10 18:04:00,071][25689] Fps is (10 sec: 5298.4, 60 sec: 5506.2, 300 sec: 5533.7). Total num frames: 852383744. Throughput: 0: 5798.3. Samples: 852389174. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:04:00,071][25689] Avg episode reward: [(0, '-2.881')] [2022-07-10 18:04:01,538][26022] Updated weights on worker 0-0, policy_version 832415 (0.00086) [2022-07-10 18:04:03,644][26022] Updated weights on worker 0-0, policy_version 832425 (0.00092) [2022-07-10 18:04:05,135][25689] Fps is (10 sec: 5472.7, 60 sec: 5535.2, 300 sec: 5527.6). Total num frames: 852410368. Throughput: 0: 4932.8. Samples: 852405582. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:04:05,136][25689] Avg episode reward: [(0, '-1.935')] [2022-07-10 18:04:05,511][26022] Updated weights on worker 0-0, policy_version 832435 (0.00092) [2022-07-10 18:04:07,555][26022] Updated weights on worker 0-0, policy_version 832445 (0.00086) [2022-07-10 18:04:09,131][26022] Updated weights on worker 0-0, policy_version 832455 (0.00085) [2022-07-10 18:04:10,226][25689] Fps is (10 sec: 5545.9, 60 sec: 5534.6, 300 sec: 5537.2). Total num frames: 852440064. Throughput: 0: 5657.8. Samples: 852437282. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:04:10,227][25689] Avg episode reward: [(0, '-1.887')] [2022-07-10 18:04:10,955][26022] Updated weights on worker 0-0, policy_version 832465 (0.00084) [2022-07-10 18:04:12,999][26022] Updated weights on worker 0-0, policy_version 832475 (0.00086) [2022-07-10 18:04:14,818][26022] Updated weights on worker 0-0, policy_version 832485 (0.00080) [2022-07-10 18:04:15,239][25689] Fps is (10 sec: 5675.7, 60 sec: 5552.4, 300 sec: 5526.9). Total num frames: 852467712. Throughput: 0: 5709.5. Samples: 852471188. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:04:15,239][25689] Avg episode reward: [(0, '-2.344')] [2022-07-10 18:04:16,566][26022] Updated weights on worker 0-0, policy_version 832495 (0.00088) [2022-07-10 18:04:18,365][26022] Updated weights on worker 0-0, policy_version 832505 (0.00086) [2022-07-10 18:04:20,188][26022] Updated weights on worker 0-0, policy_version 832515 (0.00096) [2022-07-10 18:04:20,271][25689] Fps is (10 sec: 5606.8, 60 sec: 5555.1, 300 sec: 5533.3). Total num frames: 852496384. Throughput: 0: 5724.6. Samples: 852504812. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:04:20,273][25689] Avg episode reward: [(0, '-1.642')] [2022-07-10 18:04:22,075][26022] Updated weights on worker 0-0, policy_version 832525 (0.00088) [2022-07-10 18:04:23,723][26022] Updated weights on worker 0-0, policy_version 832535 (0.00090) [2022-07-10 18:04:25,298][25689] Fps is (10 sec: 5497.2, 60 sec: 5519.2, 300 sec: 5530.8). Total num frames: 852523008. Throughput: 0: 5748.5. Samples: 852521486. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:04:25,299][25689] Avg episode reward: [(0, '-0.994')] [2022-07-10 18:04:25,663][26022] Updated weights on worker 0-0, policy_version 832545 (0.00093) [2022-07-10 18:04:27,525][26022] Updated weights on worker 0-0, policy_version 832555 (0.00087) [2022-07-10 18:04:29,412][26022] Updated weights on worker 0-0, policy_version 832565 (0.00087) [2022-07-10 18:04:30,350][25689] Fps is (10 sec: 5587.9, 60 sec: 5590.7, 300 sec: 5537.4). Total num frames: 852552704. Throughput: 0: 5855.5. Samples: 852555118. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:04:30,351][25689] Avg episode reward: [(0, '-0.148')] [2022-07-10 18:04:31,032][26022] Updated weights on worker 0-0, policy_version 832575 (0.00086) [2022-07-10 18:04:33,179][26022] Updated weights on worker 0-0, policy_version 832585 (0.00093) [2022-07-10 18:04:34,747][26022] Updated weights on worker 0-0, policy_version 832595 (0.00078) [2022-07-10 18:04:35,370][25689] Fps is (10 sec: 5693.6, 60 sec: 5540.3, 300 sec: 5534.5). Total num frames: 852580352. Throughput: 0: 5836.3. Samples: 852588676. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:04:35,370][25689] Avg episode reward: [(0, '-0.619')] [2022-07-10 18:04:36,813][26022] Updated weights on worker 0-0, policy_version 832605 (0.00093) [2022-07-10 18:04:38,233][26022] Updated weights on worker 0-0, policy_version 832615 (0.00085) [2022-07-10 18:04:40,379][25689] Fps is (10 sec: 5411.7, 60 sec: 5522.8, 300 sec: 5531.3). Total num frames: 852606976. Throughput: 0: 5001.7. Samples: 852605384. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:04:40,381][25689] Avg episode reward: [(0, '-0.306')] [2022-07-10 18:04:40,467][26022] Updated weights on worker 0-0, policy_version 832625 (0.00088) [2022-07-10 18:04:42,108][26022] Updated weights on worker 0-0, policy_version 832635 (0.00090) [2022-07-10 18:04:43,992][26022] Updated weights on worker 0-0, policy_version 832645 (0.00089) [2022-07-10 18:04:45,427][25689] Fps is (10 sec: 5599.8, 60 sec: 5572.0, 300 sec: 5538.9). Total num frames: 852636672. Throughput: 0: 5834.4. Samples: 852638928. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:04:45,428][25689] Avg episode reward: [(0, '-0.011')] [2022-07-10 18:04:45,988][26022] Updated weights on worker 0-0, policy_version 832655 (0.00094) [2022-07-10 18:04:47,605][26022] Updated weights on worker 0-0, policy_version 832665 (0.00085) [2022-07-10 18:04:49,596][26022] Updated weights on worker 0-0, policy_version 832675 (0.00093) [2022-07-10 18:04:50,464][25689] Fps is (10 sec: 5584.2, 60 sec: 5506.0, 300 sec: 5529.2). Total num frames: 852663296. Throughput: 0: 5827.2. Samples: 852672328. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:04:50,465][25689] Avg episode reward: [(0, '-0.587')] [2022-07-10 18:04:51,429][26022] Updated weights on worker 0-0, policy_version 832685 (0.00084) [2022-07-10 18:04:52,588][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:04:52,610][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000832691_852675584.pth [2022-07-10 18:04:52,611][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000830745_850682880.pth [2022-07-10 18:04:52,611][25974] Saving a milestone ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/milestones/checkpoint_000832691_852675584.pth.milestone [2022-07-10 18:04:53,199][26022] Updated weights on worker 0-0, policy_version 832695 (0.00089) [2022-07-10 18:04:55,236][26022] Updated weights on worker 0-0, policy_version 832705 (0.00091) [2022-07-10 18:04:55,478][25689] Fps is (10 sec: 5399.7, 60 sec: 5556.5, 300 sec: 5532.6). Total num frames: 852690944. Throughput: 0: 4985.2. Samples: 852688922. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:04:55,479][25689] Avg episode reward: [(0, '-0.686')] [2022-07-10 18:04:56,903][26022] Updated weights on worker 0-0, policy_version 832715 (0.00094) [2022-07-10 18:04:58,796][26022] Updated weights on worker 0-0, policy_version 832725 (0.00089) [2022-07-10 18:05:00,498][25689] Fps is (10 sec: 5714.9, 60 sec: 5575.2, 300 sec: 5546.1). Total num frames: 852720640. Throughput: 0: 5837.7. Samples: 852722836. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:05:00,499][25689] Avg episode reward: [(0, '-1.318')] [2022-07-10 18:05:00,502][26022] Updated weights on worker 0-0, policy_version 832735 (0.00100) [2022-07-10 18:05:02,761][26022] Updated weights on worker 0-0, policy_version 832745 (0.00091) [2022-07-10 18:05:04,466][26022] Updated weights on worker 0-0, policy_version 832755 (0.00370) [2022-07-10 18:05:05,528][25689] Fps is (10 sec: 5400.4, 60 sec: 5544.5, 300 sec: 5534.2). Total num frames: 852745216. Throughput: 0: 5754.0. Samples: 852754586. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:05:05,528][25689] Avg episode reward: [(0, '-0.628')] [2022-07-10 18:05:06,360][26022] Updated weights on worker 0-0, policy_version 832765 (0.00091) [2022-07-10 18:05:08,156][26022] Updated weights on worker 0-0, policy_version 832775 (0.00089) [2022-07-10 18:05:09,975][26022] Updated weights on worker 0-0, policy_version 832785 (0.00085) [2022-07-10 18:05:10,593][25689] Fps is (10 sec: 5376.2, 60 sec: 5546.9, 300 sec: 5540.3). Total num frames: 852774912. Throughput: 0: 4924.1. Samples: 852771446. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:05:10,594][25689] Avg episode reward: [(0, '-1.225')] [2022-07-10 18:05:11,859][26022] Updated weights on worker 0-0, policy_version 832795 (0.00085) [2022-07-10 18:05:13,718][26022] Updated weights on worker 0-0, policy_version 832805 (0.00092) [2022-07-10 18:05:15,462][26022] Updated weights on worker 0-0, policy_version 832815 (0.00096) [2022-07-10 18:05:15,598][25689] Fps is (10 sec: 5796.2, 60 sec: 5564.6, 300 sec: 5544.9). Total num frames: 852803584. Throughput: 0: 5772.6. Samples: 852805064. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:05:15,598][25689] Avg episode reward: [(0, '-0.106')] [2022-07-10 18:05:17,457][26022] Updated weights on worker 0-0, policy_version 832825 (0.00092) [2022-07-10 18:05:19,369][26022] Updated weights on worker 0-0, policy_version 832835 (0.00088) [2022-07-10 18:05:20,605][25689] Fps is (10 sec: 5420.9, 60 sec: 5516.0, 300 sec: 5538.2). Total num frames: 852829184. Throughput: 0: 5747.5. Samples: 852838398. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:05:20,605][25689] Avg episode reward: [(0, '-0.012')] [2022-07-10 18:05:21,102][26022] Updated weights on worker 0-0, policy_version 832845 (0.00054) [2022-07-10 18:05:23,027][26022] Updated weights on worker 0-0, policy_version 832855 (0.00086) [2022-07-10 18:05:24,630][26022] Updated weights on worker 0-0, policy_version 832865 (0.00084) [2022-07-10 18:05:25,617][25689] Fps is (10 sec: 5518.9, 60 sec: 5568.2, 300 sec: 5545.8). Total num frames: 852858880. Throughput: 0: 5006.6. Samples: 852855166. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:05:25,618][25689] Avg episode reward: [(0, '1.233')] [2022-07-10 18:05:26,651][26022] Updated weights on worker 0-0, policy_version 832875 (0.00086) [2022-07-10 18:05:28,436][26022] Updated weights on worker 0-0, policy_version 832885 (0.00096) [2022-07-10 18:05:30,231][26022] Updated weights on worker 0-0, policy_version 832895 (0.00099) [2022-07-10 18:05:30,756][25689] Fps is (10 sec: 5649.0, 60 sec: 5526.3, 300 sec: 5540.3). Total num frames: 852886528. Throughput: 0: 5802.7. Samples: 852888444. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:05:30,757][25689] Avg episode reward: [(0, '1.239')] [2022-07-10 18:05:32,074][26022] Updated weights on worker 0-0, policy_version 832905 (0.00088) [2022-07-10 18:05:33,919][26022] Updated weights on worker 0-0, policy_version 832915 (0.00090) [2022-07-10 18:05:35,797][25689] Fps is (10 sec: 5431.8, 60 sec: 5524.4, 300 sec: 5539.9). Total num frames: 852914176. Throughput: 0: 5786.4. Samples: 852921946. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:05:35,799][25689] Avg episode reward: [(0, '0.478')] [2022-07-10 18:05:35,821][26022] Updated weights on worker 0-0, policy_version 832925 (0.00087) [2022-07-10 18:05:37,559][26022] Updated weights on worker 0-0, policy_version 832935 (0.00095) [2022-07-10 18:05:39,567][26022] Updated weights on worker 0-0, policy_version 832945 (0.00081) [2022-07-10 18:05:40,820][25689] Fps is (10 sec: 5596.3, 60 sec: 5557.0, 300 sec: 5543.3). Total num frames: 852942848. Throughput: 0: 4951.5. Samples: 852938492. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:05:40,822][25689] Avg episode reward: [(0, '1.362')] [2022-07-10 18:05:41,350][26022] Updated weights on worker 0-0, policy_version 832955 (0.00096) [2022-07-10 18:05:43,165][26022] Updated weights on worker 0-0, policy_version 832965 (0.00089) [2022-07-10 18:05:44,992][26022] Updated weights on worker 0-0, policy_version 832975 (0.00093) [2022-07-10 18:05:45,849][25689] Fps is (10 sec: 5501.4, 60 sec: 5508.0, 300 sec: 5537.5). Total num frames: 852969472. Throughput: 0: 5763.1. Samples: 852971762. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:05:45,849][25689] Avg episode reward: [(0, '0.581')] [2022-07-10 18:05:46,846][26022] Updated weights on worker 0-0, policy_version 832985 (0.00101) [2022-07-10 18:05:48,686][26022] Updated weights on worker 0-0, policy_version 832995 (0.00082) [2022-07-10 18:05:50,546][26022] Updated weights on worker 0-0, policy_version 833005 (0.00087) [2022-07-10 18:05:50,937][25689] Fps is (10 sec: 5667.8, 60 sec: 5571.0, 300 sec: 5547.1). Total num frames: 853000192. Throughput: 0: 5785.8. Samples: 853005208. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:05:50,939][25689] Avg episode reward: [(0, '-0.454')] [2022-07-10 18:05:52,550][26022] Updated weights on worker 0-0, policy_version 833015 (0.00095) [2022-07-10 18:05:54,105][26022] Updated weights on worker 0-0, policy_version 833025 (0.00083) [2022-07-10 18:05:55,987][25689] Fps is (10 sec: 5555.0, 60 sec: 5533.9, 300 sec: 5536.7). Total num frames: 853025792. Throughput: 0: 4945.3. Samples: 853021792. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:05:55,988][25689] Avg episode reward: [(0, '-0.997')] [2022-07-10 18:05:56,281][26022] Updated weights on worker 0-0, policy_version 833035 (0.00105) [2022-07-10 18:05:57,880][26022] Updated weights on worker 0-0, policy_version 833045 (0.00085) [2022-07-10 18:05:59,800][26022] Updated weights on worker 0-0, policy_version 833055 (0.00100) [2022-07-10 18:06:00,991][25689] Fps is (10 sec: 5500.4, 60 sec: 5535.4, 300 sec: 5550.5). Total num frames: 853055488. Throughput: 0: 5795.8. Samples: 853055398. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:06:00,991][25689] Avg episode reward: [(0, '-1.345')] [2022-07-10 18:06:01,626][26022] Updated weights on worker 0-0, policy_version 833065 (0.00082) [2022-07-10 18:06:03,805][26022] Updated weights on worker 0-0, policy_version 833075 (0.00086) [2022-07-10 18:06:05,712][26022] Updated weights on worker 0-0, policy_version 833085 (0.00092) [2022-07-10 18:06:06,012][25689] Fps is (10 sec: 5516.0, 60 sec: 5553.1, 300 sec: 5549.1). Total num frames: 853081088. Throughput: 0: 5714.3. Samples: 853086980. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:06:06,012][25689] Avg episode reward: [(0, '-1.724')] [2022-07-10 18:06:07,442][26022] Updated weights on worker 0-0, policy_version 833095 (0.00092) [2022-07-10 18:06:09,351][26022] Updated weights on worker 0-0, policy_version 833105 (0.00100) [2022-07-10 18:06:11,071][26022] Updated weights on worker 0-0, policy_version 833115 (0.00087) [2022-07-10 18:06:11,170][25689] Fps is (10 sec: 5331.8, 60 sec: 5527.7, 300 sec: 5542.9). Total num frames: 853109760. Throughput: 0: 4860.5. Samples: 853103546. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:06:11,170][25689] Avg episode reward: [(0, '-1.950')] [2022-07-10 18:06:12,989][26022] Updated weights on worker 0-0, policy_version 833125 (0.00076) [2022-07-10 18:06:14,831][26022] Updated weights on worker 0-0, policy_version 833135 (0.00086) [2022-07-10 18:06:16,189][25689] Fps is (10 sec: 5634.6, 60 sec: 5526.4, 300 sec: 5546.3). Total num frames: 853138432. Throughput: 0: 5710.8. Samples: 853137160. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:06:16,189][25689] Avg episode reward: [(0, '-0.750')] [2022-07-10 18:06:16,660][26022] Updated weights on worker 0-0, policy_version 833145 (0.00087) [2022-07-10 18:06:18,351][26022] Updated weights on worker 0-0, policy_version 833155 (0.00083) [2022-07-10 18:06:20,346][26022] Updated weights on worker 0-0, policy_version 833165 (0.00090) [2022-07-10 18:06:21,229][25689] Fps is (10 sec: 5598.7, 60 sec: 5557.1, 300 sec: 5546.1). Total num frames: 853166080. Throughput: 0: 5697.8. Samples: 853170712. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:06:21,229][25689] Avg episode reward: [(0, '-0.764')] [2022-07-10 18:06:22,161][26022] Updated weights on worker 0-0, policy_version 833175 (0.00093) [2022-07-10 18:06:23,982][26022] Updated weights on worker 0-0, policy_version 833185 (0.00091) [2022-07-10 18:06:26,057][26022] Updated weights on worker 0-0, policy_version 833195 (0.00091) [2022-07-10 18:06:26,262][25689] Fps is (10 sec: 5387.6, 60 sec: 5504.5, 300 sec: 5547.9). Total num frames: 853192704. Throughput: 0: 4945.9. Samples: 853187138. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:06:26,263][25689] Avg episode reward: [(0, '-0.672')] [2022-07-10 18:06:27,654][26022] Updated weights on worker 0-0, policy_version 833205 (0.00087) [2022-07-10 18:06:29,728][26022] Updated weights on worker 0-0, policy_version 833215 (0.00094) [2022-07-10 18:06:31,317][26022] Updated weights on worker 0-0, policy_version 833225 (0.00087) [2022-07-10 18:06:31,323][25689] Fps is (10 sec: 5579.6, 60 sec: 5545.5, 300 sec: 5543.7). Total num frames: 853222400. Throughput: 0: 5804.1. Samples: 853220518. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:06:31,323][25689] Avg episode reward: [(0, '0.634')] [2022-07-10 18:06:33,208][26022] Updated weights on worker 0-0, policy_version 833235 (0.00085) [2022-07-10 18:06:35,021][26022] Updated weights on worker 0-0, policy_version 833245 (0.00085) [2022-07-10 18:06:36,333][25689] Fps is (10 sec: 5592.8, 60 sec: 5531.5, 300 sec: 5544.4). Total num frames: 853249024. Throughput: 0: 5812.5. Samples: 853254246. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:06:36,333][25689] Avg episode reward: [(0, '0.865')] [2022-07-10 18:06:36,738][26022] Updated weights on worker 0-0, policy_version 833255 (0.00086) [2022-07-10 18:06:38,950][26022] Updated weights on worker 0-0, policy_version 833266 (0.00088) [2022-07-10 18:06:40,855][26022] Updated weights on worker 0-0, policy_version 833276 (0.00093) [2022-07-10 18:06:41,358][25689] Fps is (10 sec: 5510.4, 60 sec: 5531.2, 300 sec: 5540.6). Total num frames: 853277696. Throughput: 0: 4972.8. Samples: 853270808. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:06:41,358][25689] Avg episode reward: [(0, '0.827')] [2022-07-10 18:06:42,432][26022] Updated weights on worker 0-0, policy_version 833286 (0.00082) [2022-07-10 18:06:44,483][26022] Updated weights on worker 0-0, policy_version 833296 (0.00095) [2022-07-10 18:06:46,259][26022] Updated weights on worker 0-0, policy_version 833306 (0.00084) [2022-07-10 18:06:46,383][25689] Fps is (10 sec: 5603.9, 60 sec: 5548.5, 300 sec: 5545.2). Total num frames: 853305344. Throughput: 0: 5825.6. Samples: 853304350. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:06:46,383][25689] Avg episode reward: [(0, '0.547')] [2022-07-10 18:06:48,094][26022] Updated weights on worker 0-0, policy_version 833316 (0.00085) [2022-07-10 18:06:50,223][26022] Updated weights on worker 0-0, policy_version 833326 (0.00077) [2022-07-10 18:06:51,451][25689] Fps is (10 sec: 5579.8, 60 sec: 5516.5, 300 sec: 5540.5). Total num frames: 853334016. Throughput: 0: 5816.7. Samples: 853337598. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:06:51,452][25689] Avg episode reward: [(0, '1.025')] [2022-07-10 18:06:51,779][26022] Updated weights on worker 0-0, policy_version 833336 (0.00084) [2022-07-10 18:06:52,679][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:06:52,693][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000833341_853341184.pth [2022-07-10 18:06:52,694][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000831391_851344384.pth [2022-07-10 18:06:53,878][26022] Updated weights on worker 0-0, policy_version 833346 (0.00095) [2022-07-10 18:06:55,557][26022] Updated weights on worker 0-0, policy_version 833356 (0.00083) [2022-07-10 18:06:56,460][25689] Fps is (10 sec: 5588.8, 60 sec: 5554.1, 300 sec: 5545.5). Total num frames: 853361664. Throughput: 0: 5788.9. Samples: 853370762. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:06:56,460][25689] Avg episode reward: [(0, '0.707')] [2022-07-10 18:06:57,396][26022] Updated weights on worker 0-0, policy_version 833366 (0.00090) [2022-07-10 18:06:59,473][26022] Updated weights on worker 0-0, policy_version 833376 (0.00085) [2022-07-10 18:07:00,935][26022] Updated weights on worker 0-0, policy_version 833386 (0.00081) [2022-07-10 18:07:01,507][25689] Fps is (10 sec: 5397.2, 60 sec: 5499.3, 300 sec: 5545.2). Total num frames: 853388288. Throughput: 0: 5790.0. Samples: 853387472. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:07:01,511][25689] Avg episode reward: [(0, '0.866')] [2022-07-10 18:07:03,372][26022] Updated weights on worker 0-0, policy_version 833396 (0.00090) [2022-07-10 18:07:05,001][26022] Updated weights on worker 0-0, policy_version 833406 (0.00085) [2022-07-10 18:07:06,518][25689] Fps is (10 sec: 5293.9, 60 sec: 5517.2, 300 sec: 5536.2). Total num frames: 853414912. Throughput: 0: 5693.4. Samples: 853418990. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:07:06,519][25689] Avg episode reward: [(0, '0.937')] [2022-07-10 18:07:06,829][26022] Updated weights on worker 0-0, policy_version 833416 (0.00080) [2022-07-10 18:07:08,884][26022] Updated weights on worker 0-0, policy_version 833426 (0.00088) [2022-07-10 18:07:10,622][26022] Updated weights on worker 0-0, policy_version 833436 (0.00091) [2022-07-10 18:07:11,575][25689] Fps is (10 sec: 5492.3, 60 sec: 5526.4, 300 sec: 5538.9). Total num frames: 853443584. Throughput: 0: 5703.9. Samples: 853452380. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:07:11,575][25689] Avg episode reward: [(0, '1.093')] [2022-07-10 18:07:12,457][26022] Updated weights on worker 0-0, policy_version 833446 (0.00092) [2022-07-10 18:07:14,226][26022] Updated weights on worker 0-0, policy_version 833456 (0.00086) [2022-07-10 18:07:15,982][26022] Updated weights on worker 0-0, policy_version 833466 (0.00080) [2022-07-10 18:07:16,597][25689] Fps is (10 sec: 5588.3, 60 sec: 5509.2, 300 sec: 5535.3). Total num frames: 853471232. Throughput: 0: 4890.2. Samples: 853469234. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:07:16,597][25689] Avg episode reward: [(0, '1.108')] [2022-07-10 18:07:18,286][26022] Updated weights on worker 0-0, policy_version 833476 (0.00095) [2022-07-10 18:07:19,491][26022] Updated weights on worker 0-0, policy_version 833486 (0.00088) [2022-07-10 18:07:21,625][25689] Fps is (10 sec: 5399.8, 60 sec: 5493.3, 300 sec: 5535.0). Total num frames: 853497856. Throughput: 0: 5724.5. Samples: 853502642. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:07:21,626][25689] Avg episode reward: [(0, '1.201')] [2022-07-10 18:07:21,751][26022] Updated weights on worker 0-0, policy_version 833496 (0.00088) [2022-07-10 18:07:23,367][26022] Updated weights on worker 0-0, policy_version 833506 (0.00097) [2022-07-10 18:07:25,405][26022] Updated weights on worker 0-0, policy_version 833516 (0.00091) [2022-07-10 18:07:26,631][25689] Fps is (10 sec: 5612.6, 60 sec: 5546.7, 300 sec: 5540.7). Total num frames: 853527552. Throughput: 0: 5814.3. Samples: 853535932. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:07:26,632][25689] Avg episode reward: [(0, '0.737')] [2022-07-10 18:07:27,334][26022] Updated weights on worker 0-0, policy_version 833526 (0.00081) [2022-07-10 18:07:28,975][26022] Updated weights on worker 0-0, policy_version 833536 (0.00089) [2022-07-10 18:07:30,880][26022] Updated weights on worker 0-0, policy_version 833546 (0.00090) [2022-07-10 18:07:31,675][25689] Fps is (10 sec: 5604.4, 60 sec: 5497.4, 300 sec: 5536.6). Total num frames: 853554176. Throughput: 0: 4990.2. Samples: 853552684. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:07:31,675][25689] Avg episode reward: [(0, '-1.662')] [2022-07-10 18:07:32,647][26022] Updated weights on worker 0-0, policy_version 833556 (0.00085) [2022-07-10 18:07:34,415][26022] Updated weights on worker 0-0, policy_version 833566 (0.00089) [2022-07-10 18:07:36,544][26022] Updated weights on worker 0-0, policy_version 833576 (0.00092) [2022-07-10 18:07:36,691][25689] Fps is (10 sec: 5496.9, 60 sec: 5530.7, 300 sec: 5529.8). Total num frames: 853582848. Throughput: 0: 5828.0. Samples: 853586342. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:07:36,691][25689] Avg episode reward: [(0, '-1.868')] [2022-07-10 18:07:38,288][26022] Updated weights on worker 0-0, policy_version 833586 (0.00086) [2022-07-10 18:07:39,897][26022] Updated weights on worker 0-0, policy_version 833596 (0.00091) [2022-07-10 18:07:41,697][25689] Fps is (10 sec: 5619.2, 60 sec: 5515.4, 300 sec: 5537.1). Total num frames: 853610496. Throughput: 0: 5838.3. Samples: 853619828. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:07:41,698][25689] Avg episode reward: [(0, '-1.820')] [2022-07-10 18:07:41,880][26022] Updated weights on worker 0-0, policy_version 833606 (0.00088) [2022-07-10 18:07:43,774][26022] Updated weights on worker 0-0, policy_version 833616 (0.00401) [2022-07-10 18:07:45,553][26022] Updated weights on worker 0-0, policy_version 833626 (0.00084) [2022-07-10 18:07:46,711][25689] Fps is (10 sec: 5620.6, 60 sec: 5533.5, 300 sec: 5537.7). Total num frames: 853639168. Throughput: 0: 5007.0. Samples: 853636472. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:07:46,711][25689] Avg episode reward: [(0, '-2.442')] [2022-07-10 18:07:47,526][26022] Updated weights on worker 0-0, policy_version 833636 (0.00096) [2022-07-10 18:07:49,137][26022] Updated weights on worker 0-0, policy_version 833646 (0.00089) [2022-07-10 18:07:51,223][26022] Updated weights on worker 0-0, policy_version 833656 (0.00088) [2022-07-10 18:07:51,826][25689] Fps is (10 sec: 5661.9, 60 sec: 5529.2, 300 sec: 5533.2). Total num frames: 853667840. Throughput: 0: 5826.6. Samples: 853670096. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:07:51,826][25689] Avg episode reward: [(0, '-2.652')] [2022-07-10 18:07:52,978][26022] Updated weights on worker 0-0, policy_version 833666 (0.00083) [2022-07-10 18:07:54,691][26022] Updated weights on worker 0-0, policy_version 833676 (0.00096) [2022-07-10 18:07:56,743][26022] Updated weights on worker 0-0, policy_version 833686 (0.00087) [2022-07-10 18:07:56,883][25689] Fps is (10 sec: 5436.1, 60 sec: 5507.8, 300 sec: 5529.0). Total num frames: 853694464. Throughput: 0: 5819.1. Samples: 853703842. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:07:56,886][25689] Avg episode reward: [(0, '-1.632')] [2022-07-10 18:07:58,272][26022] Updated weights on worker 0-0, policy_version 833696 (0.00086) [2022-07-10 18:08:00,355][26022] Updated weights on worker 0-0, policy_version 833706 (0.00088) [2022-07-10 18:08:01,895][25689] Fps is (10 sec: 5491.7, 60 sec: 5545.0, 300 sec: 5542.8). Total num frames: 853723136. Throughput: 0: 4988.3. Samples: 853720578. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:08:01,895][25689] Avg episode reward: [(0, '-0.352')] [2022-07-10 18:08:02,366][26022] Updated weights on worker 0-0, policy_version 833716 (0.00095) [2022-07-10 18:08:04,335][26022] Updated weights on worker 0-0, policy_version 833726 (0.00088) [2022-07-10 18:08:06,100][26022] Updated weights on worker 0-0, policy_version 833736 (0.00089) [2022-07-10 18:08:06,906][25689] Fps is (10 sec: 5414.8, 60 sec: 5528.0, 300 sec: 5530.4). Total num frames: 853748736. Throughput: 0: 5714.1. Samples: 853751868. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:08:06,906][25689] Avg episode reward: [(0, '-0.626')] [2022-07-10 18:08:08,132][26022] Updated weights on worker 0-0, policy_version 833746 (0.00093) [2022-07-10 18:08:09,795][26022] Updated weights on worker 0-0, policy_version 833756 (0.00102) [2022-07-10 18:08:11,833][26022] Updated weights on worker 0-0, policy_version 833766 (0.00095) [2022-07-10 18:08:11,977][25689] Fps is (10 sec: 5484.2, 60 sec: 5543.6, 300 sec: 5539.8). Total num frames: 853778432. Throughput: 0: 5703.6. Samples: 853785034. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:08:11,978][25689] Avg episode reward: [(0, '-0.665')] [2022-07-10 18:08:13,559][26022] Updated weights on worker 0-0, policy_version 833776 (0.00085) [2022-07-10 18:08:15,439][26022] Updated weights on worker 0-0, policy_version 833786 (0.00092) [2022-07-10 18:08:16,991][25689] Fps is (10 sec: 5584.6, 60 sec: 5527.4, 300 sec: 5533.8). Total num frames: 853805056. Throughput: 0: 4859.0. Samples: 853801546. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:08:16,991][25689] Avg episode reward: [(0, '-0.751')] [2022-07-10 18:08:17,387][26022] Updated weights on worker 0-0, policy_version 833796 (0.00083) [2022-07-10 18:08:19,158][26022] Updated weights on worker 0-0, policy_version 833806 (0.00095) [2022-07-10 18:08:20,851][26022] Updated weights on worker 0-0, policy_version 833816 (0.00088) [2022-07-10 18:08:22,019][25689] Fps is (10 sec: 5404.6, 60 sec: 5544.4, 300 sec: 5529.9). Total num frames: 853832704. Throughput: 0: 5692.7. Samples: 853835142. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:08:22,020][25689] Avg episode reward: [(0, '-0.651')] [2022-07-10 18:08:23,015][26022] Updated weights on worker 0-0, policy_version 833826 (0.00438) [2022-07-10 18:08:24,683][26022] Updated weights on worker 0-0, policy_version 833836 (0.00084) [2022-07-10 18:08:26,776][26022] Updated weights on worker 0-0, policy_version 833846 (0.00086) [2022-07-10 18:08:27,043][25689] Fps is (10 sec: 5603.0, 60 sec: 5525.8, 300 sec: 5541.5). Total num frames: 853861376. Throughput: 0: 5771.9. Samples: 853868096. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:08:27,043][25689] Avg episode reward: [(0, '0.104')] [2022-07-10 18:08:28,492][26022] Updated weights on worker 0-0, policy_version 833856 (0.00089) [2022-07-10 18:08:30,308][26022] Updated weights on worker 0-0, policy_version 833866 (0.00097) [2022-07-10 18:08:32,095][25689] Fps is (10 sec: 5488.1, 60 sec: 5525.0, 300 sec: 5527.2). Total num frames: 853888000. Throughput: 0: 4950.2. Samples: 853884620. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:08:32,095][25689] Avg episode reward: [(0, '-0.141')] [2022-07-10 18:08:32,252][26022] Updated weights on worker 0-0, policy_version 833876 (0.00091) [2022-07-10 18:08:33,962][26022] Updated weights on worker 0-0, policy_version 833886 (0.00084) [2022-07-10 18:08:35,905][26022] Updated weights on worker 0-0, policy_version 833896 (0.00096) [2022-07-10 18:08:37,130][25689] Fps is (10 sec: 5380.1, 60 sec: 5506.3, 300 sec: 5526.6). Total num frames: 853915648. Throughput: 0: 5762.7. Samples: 853917604. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:08:37,131][25689] Avg episode reward: [(0, '-0.163')] [2022-07-10 18:08:37,706][26022] Updated weights on worker 0-0, policy_version 833906 (0.00085) [2022-07-10 18:08:39,483][26022] Updated weights on worker 0-0, policy_version 833916 (0.00084) [2022-07-10 18:08:41,290][26022] Updated weights on worker 0-0, policy_version 833926 (0.00093) [2022-07-10 18:08:42,150][25689] Fps is (10 sec: 5499.1, 60 sec: 5505.1, 300 sec: 5530.3). Total num frames: 853943296. Throughput: 0: 5773.4. Samples: 853951366. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:08:42,151][25689] Avg episode reward: [(0, '0.590')] [2022-07-10 18:08:43,041][26022] Updated weights on worker 0-0, policy_version 833936 (0.00081) [2022-07-10 18:08:45,051][26022] Updated weights on worker 0-0, policy_version 833946 (0.00097) [2022-07-10 18:08:46,731][26022] Updated weights on worker 0-0, policy_version 833956 (0.00087) [2022-07-10 18:08:47,191][25689] Fps is (10 sec: 5598.0, 60 sec: 5502.6, 300 sec: 5523.7). Total num frames: 853971968. Throughput: 0: 4962.4. Samples: 853968080. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:08:47,192][25689] Avg episode reward: [(0, '0.560')] [2022-07-10 18:08:48,726][26022] Updated weights on worker 0-0, policy_version 833966 (0.00095) [2022-07-10 18:08:50,688][26022] Updated weights on worker 0-0, policy_version 833976 (0.00123) [2022-07-10 18:08:52,259][25689] Fps is (10 sec: 5672.6, 60 sec: 5506.9, 300 sec: 5536.4). Total num frames: 854000640. Throughput: 0: 5800.4. Samples: 854001582. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:08:52,260][25689] Avg episode reward: [(0, '-1.156')] [2022-07-10 18:08:52,527][26022] Updated weights on worker 0-0, policy_version 833986 (0.00098) [2022-07-10 18:08:52,701][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:08:52,713][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000833987_854002688.pth [2022-07-10 18:08:52,714][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000832041_852009984.pth [2022-07-10 18:08:54,443][26022] Updated weights on worker 0-0, policy_version 833996 (0.00093) [2022-07-10 18:08:56,336][26022] Updated weights on worker 0-0, policy_version 834006 (0.00083) [2022-07-10 18:08:57,271][25689] Fps is (10 sec: 5587.5, 60 sec: 5528.0, 300 sec: 5533.5). Total num frames: 854028288. Throughput: 0: 5803.9. Samples: 854034496. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:08:57,271][25689] Avg episode reward: [(0, '-1.204')] [2022-07-10 18:08:58,055][26022] Updated weights on worker 0-0, policy_version 834016 (0.00082) [2022-07-10 18:08:59,820][26022] Updated weights on worker 0-0, policy_version 834026 (0.00085) [2022-07-10 18:09:01,673][26022] Updated weights on worker 0-0, policy_version 834036 (0.00086) [2022-07-10 18:09:02,363][25689] Fps is (10 sec: 5270.1, 60 sec: 5469.8, 300 sec: 5529.5). Total num frames: 854053888. Throughput: 0: 4946.1. Samples: 854051340. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 18:09:02,364][25689] Avg episode reward: [(0, '-1.453')] [2022-07-10 18:09:03,787][26022] Updated weights on worker 0-0, policy_version 834046 (0.00093) [2022-07-10 18:09:05,871][26022] Updated weights on worker 0-0, policy_version 834056 (0.00090) [2022-07-10 18:09:07,382][25689] Fps is (10 sec: 5266.3, 60 sec: 5503.0, 300 sec: 5523.9). Total num frames: 854081536. Throughput: 0: 5669.7. Samples: 854082554. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:09:07,382][25689] Avg episode reward: [(0, '-0.616')] [2022-07-10 18:09:07,664][26022] Updated weights on worker 0-0, policy_version 834066 (0.00088) [2022-07-10 18:09:09,435][26022] Updated weights on worker 0-0, policy_version 834076 (0.00099) [2022-07-10 18:09:11,387][26022] Updated weights on worker 0-0, policy_version 834086 (0.00090) [2022-07-10 18:09:12,486][25689] Fps is (10 sec: 5664.7, 60 sec: 5500.0, 300 sec: 5529.1). Total num frames: 854111232. Throughput: 0: 5656.4. Samples: 854115992. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:09:12,487][25689] Avg episode reward: [(0, '-0.322')] [2022-07-10 18:09:13,316][26022] Updated weights on worker 0-0, policy_version 834096 (0.00092) [2022-07-10 18:09:14,927][26022] Updated weights on worker 0-0, policy_version 834106 (0.00089) [2022-07-10 18:09:16,978][26022] Updated weights on worker 0-0, policy_version 834116 (0.00094) [2022-07-10 18:09:17,507][25689] Fps is (10 sec: 5460.9, 60 sec: 5482.4, 300 sec: 5519.0). Total num frames: 854136832. Throughput: 0: 4857.1. Samples: 854132788. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:09:17,508][25689] Avg episode reward: [(0, '0.938')] [2022-07-10 18:09:18,493][26022] Updated weights on worker 0-0, policy_version 834126 (0.00080) [2022-07-10 18:09:20,648][26022] Updated weights on worker 0-0, policy_version 834136 (0.00092) [2022-07-10 18:09:22,254][26022] Updated weights on worker 0-0, policy_version 834146 (0.00084) [2022-07-10 18:09:22,514][25689] Fps is (10 sec: 5514.2, 60 sec: 5518.2, 300 sec: 5529.7). Total num frames: 854166528. Throughput: 0: 5703.9. Samples: 854166280. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:09:22,515][25689] Avg episode reward: [(0, '0.817')] [2022-07-10 18:09:24,082][26022] Updated weights on worker 0-0, policy_version 834156 (0.00091) [2022-07-10 18:09:26,203][26022] Updated weights on worker 0-0, policy_version 834166 (0.00086) [2022-07-10 18:09:27,523][25689] Fps is (10 sec: 5725.7, 60 sec: 5502.6, 300 sec: 5523.6). Total num frames: 854194176. Throughput: 0: 5788.8. Samples: 854199146. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:09:27,523][25689] Avg episode reward: [(0, '-0.978')] [2022-07-10 18:09:27,956][26022] Updated weights on worker 0-0, policy_version 834176 (0.00090) [2022-07-10 18:09:29,810][26022] Updated weights on worker 0-0, policy_version 834186 (0.00096) [2022-07-10 18:09:31,843][26022] Updated weights on worker 0-0, policy_version 834196 (0.00086) [2022-07-10 18:09:32,587][25689] Fps is (10 sec: 5387.9, 60 sec: 5501.5, 300 sec: 5519.4). Total num frames: 854220800. Throughput: 0: 4955.5. Samples: 854215602. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:09:32,589][25689] Avg episode reward: [(0, '-0.924')] [2022-07-10 18:09:33,437][26022] Updated weights on worker 0-0, policy_version 834206 (0.00080) [2022-07-10 18:09:35,613][26022] Updated weights on worker 0-0, policy_version 834216 (0.00086) [2022-07-10 18:09:37,202][26022] Updated weights on worker 0-0, policy_version 834226 (0.00106) [2022-07-10 18:09:37,599][25689] Fps is (10 sec: 5386.4, 60 sec: 5503.7, 300 sec: 5522.8). Total num frames: 854248448. Throughput: 0: 5768.1. Samples: 854248674. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:09:37,600][25689] Avg episode reward: [(0, '-0.815')] [2022-07-10 18:09:39,308][26022] Updated weights on worker 0-0, policy_version 834236 (0.00086) [2022-07-10 18:09:40,946][26022] Updated weights on worker 0-0, policy_version 834246 (0.00081) [2022-07-10 18:09:42,603][25689] Fps is (10 sec: 5418.7, 60 sec: 5488.2, 300 sec: 5513.2). Total num frames: 854275072. Throughput: 0: 5761.4. Samples: 854282018. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:09:42,604][25689] Avg episode reward: [(0, '-1.179')] [2022-07-10 18:09:42,931][26022] Updated weights on worker 0-0, policy_version 834256 (0.00091) [2022-07-10 18:09:44,491][26022] Updated weights on worker 0-0, policy_version 834266 (0.00095) [2022-07-10 18:09:46,549][26022] Updated weights on worker 0-0, policy_version 834276 (0.00090) [2022-07-10 18:09:47,607][25689] Fps is (10 sec: 5627.1, 60 sec: 5508.5, 300 sec: 5524.2). Total num frames: 854304768. Throughput: 0: 4956.0. Samples: 854298686. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:09:47,608][25689] Avg episode reward: [(0, '-0.869')] [2022-07-10 18:09:48,261][26022] Updated weights on worker 0-0, policy_version 834286 (0.00087) [2022-07-10 18:09:50,314][26022] Updated weights on worker 0-0, policy_version 834296 (0.00084) [2022-07-10 18:09:52,003][26022] Updated weights on worker 0-0, policy_version 834306 (0.00091) [2022-07-10 18:09:52,679][25689] Fps is (10 sec: 5691.0, 60 sec: 5491.2, 300 sec: 5523.1). Total num frames: 854332416. Throughput: 0: 5800.6. Samples: 854332148. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:09:52,679][25689] Avg episode reward: [(0, '0.018')] [2022-07-10 18:09:53,865][26022] Updated weights on worker 0-0, policy_version 834316 (0.00092) [2022-07-10 18:09:55,770][26022] Updated weights on worker 0-0, policy_version 834326 (0.00098) [2022-07-10 18:09:57,552][26022] Updated weights on worker 0-0, policy_version 834336 (0.00091) [2022-07-10 18:09:57,680][25689] Fps is (10 sec: 5489.5, 60 sec: 5492.1, 300 sec: 5516.6). Total num frames: 854360064. Throughput: 0: 5808.5. Samples: 854365320. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:09:57,681][25689] Avg episode reward: [(0, '1.204')] [2022-07-10 18:09:59,483][26022] Updated weights on worker 0-0, policy_version 834346 (0.00083) [2022-07-10 18:10:01,263][26022] Updated weights on worker 0-0, policy_version 834356 (0.00095) [2022-07-10 18:10:02,686][25689] Fps is (10 sec: 5423.2, 60 sec: 5517.0, 300 sec: 5523.9). Total num frames: 854386688. Throughput: 0: 4988.5. Samples: 854382206. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:10:02,687][25689] Avg episode reward: [(0, '0.407')] [2022-07-10 18:10:03,442][26022] Updated weights on worker 0-0, policy_version 834366 (0.00095) [2022-07-10 18:10:05,555][26022] Updated weights on worker 0-0, policy_version 834376 (0.00094) [2022-07-10 18:10:07,189][26022] Updated weights on worker 0-0, policy_version 834386 (0.00090) [2022-07-10 18:10:07,698][25689] Fps is (10 sec: 5417.7, 60 sec: 5517.6, 300 sec: 5518.0). Total num frames: 854414336. Throughput: 0: 5713.5. Samples: 854413474. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:10:07,698][25689] Avg episode reward: [(0, '0.032')] [2022-07-10 18:10:09,112][26022] Updated weights on worker 0-0, policy_version 834396 (0.00090) [2022-07-10 18:10:10,711][26022] Updated weights on worker 0-0, policy_version 834406 (0.00094) [2022-07-10 18:10:12,795][25689] Fps is (10 sec: 5368.7, 60 sec: 5467.3, 300 sec: 5509.4). Total num frames: 854440960. Throughput: 0: 5697.0. Samples: 854446750. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:10:12,796][25689] Avg episode reward: [(0, '-0.384')] [2022-07-10 18:10:12,840][26022] Updated weights on worker 0-0, policy_version 834416 (0.00086) [2022-07-10 18:10:14,626][26022] Updated weights on worker 0-0, policy_version 834426 (0.00052) [2022-07-10 18:10:16,311][26022] Updated weights on worker 0-0, policy_version 834436 (0.00086) [2022-07-10 18:10:17,800][25689] Fps is (10 sec: 5473.9, 60 sec: 5519.8, 300 sec: 5519.8). Total num frames: 854469632. Throughput: 0: 4891.0. Samples: 854463724. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:10:17,800][25689] Avg episode reward: [(0, '-0.304')] [2022-07-10 18:10:18,254][26022] Updated weights on worker 0-0, policy_version 834446 (0.00086) [2022-07-10 18:10:20,127][26022] Updated weights on worker 0-0, policy_version 834456 (0.00093) [2022-07-10 18:10:21,735][26022] Updated weights on worker 0-0, policy_version 834466 (0.00085) [2022-07-10 18:10:22,803][25689] Fps is (10 sec: 5730.2, 60 sec: 5503.1, 300 sec: 5516.5). Total num frames: 854498304. Throughput: 0: 5717.7. Samples: 854497226. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:10:22,803][25689] Avg episode reward: [(0, '-0.172')] [2022-07-10 18:10:23,833][26022] Updated weights on worker 0-0, policy_version 834476 (0.00087) [2022-07-10 18:10:25,420][26022] Updated weights on worker 0-0, policy_version 834486 (0.00090) [2022-07-10 18:10:27,656][26022] Updated weights on worker 0-0, policy_version 834496 (0.00089) [2022-07-10 18:10:27,811][25689] Fps is (10 sec: 5625.7, 60 sec: 5503.2, 300 sec: 5518.9). Total num frames: 854525952. Throughput: 0: 5800.6. Samples: 854530142. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:10:27,811][25689] Avg episode reward: [(0, '-0.519')] [2022-07-10 18:10:29,346][26022] Updated weights on worker 0-0, policy_version 834506 (0.00094) [2022-07-10 18:10:31,112][26022] Updated weights on worker 0-0, policy_version 834516 (0.01337) [2022-07-10 18:10:32,871][25689] Fps is (10 sec: 5390.4, 60 sec: 5503.6, 300 sec: 5515.1). Total num frames: 854552576. Throughput: 0: 4980.0. Samples: 854546726. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:10:32,879][25689] Avg episode reward: [(0, '0.312')] [2022-07-10 18:10:33,187][26022] Updated weights on worker 0-0, policy_version 834526 (0.00087) [2022-07-10 18:10:34,878][26022] Updated weights on worker 0-0, policy_version 834536 (0.00260) [2022-07-10 18:10:36,813][26022] Updated weights on worker 0-0, policy_version 834546 (0.00082) [2022-07-10 18:10:37,884][25689] Fps is (10 sec: 5489.5, 60 sec: 5520.4, 300 sec: 5515.3). Total num frames: 854581248. Throughput: 0: 5794.1. Samples: 854580094. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:10:37,884][25689] Avg episode reward: [(0, '0.621')] [2022-07-10 18:10:38,642][26022] Updated weights on worker 0-0, policy_version 834556 (0.00094) [2022-07-10 18:10:40,401][26022] Updated weights on worker 0-0, policy_version 834566 (0.00090) [2022-07-10 18:10:42,203][26022] Updated weights on worker 0-0, policy_version 834576 (0.00096) [2022-07-10 18:10:42,908][25689] Fps is (10 sec: 5713.2, 60 sec: 5552.5, 300 sec: 5522.3). Total num frames: 854609920. Throughput: 0: 5788.3. Samples: 854613602. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:10:42,909][25689] Avg episode reward: [(0, '1.002')] [2022-07-10 18:10:44,006][26022] Updated weights on worker 0-0, policy_version 834586 (0.00086) [2022-07-10 18:10:45,840][26022] Updated weights on worker 0-0, policy_version 834596 (0.00093) [2022-07-10 18:10:47,783][26022] Updated weights on worker 0-0, policy_version 834606 (0.00083) [2022-07-10 18:10:47,918][25689] Fps is (10 sec: 5510.9, 60 sec: 5501.1, 300 sec: 5510.0). Total num frames: 854636544. Throughput: 0: 4989.9. Samples: 854630474. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:10:47,918][25689] Avg episode reward: [(0, '1.095')] [2022-07-10 18:10:49,721][26022] Updated weights on worker 0-0, policy_version 834616 (0.00091) [2022-07-10 18:10:51,498][26022] Updated weights on worker 0-0, policy_version 834626 (0.00437) [2022-07-10 18:10:52,815][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:10:52,832][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000834634_854665216.pth [2022-07-10 18:10:52,832][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000832691_852675584.pth [2022-07-10 18:10:53,029][25689] Fps is (10 sec: 5463.8, 60 sec: 5514.5, 300 sec: 5519.2). Total num frames: 854665216. Throughput: 0: 5804.8. Samples: 854663736. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:10:53,029][25689] Avg episode reward: [(0, '1.183')] [2022-07-10 18:10:53,392][26022] Updated weights on worker 0-0, policy_version 834636 (0.00087) [2022-07-10 18:10:55,168][26022] Updated weights on worker 0-0, policy_version 834646 (0.00096) [2022-07-10 18:10:57,200][26022] Updated weights on worker 0-0, policy_version 834656 (0.00086) [2022-07-10 18:10:58,030][25689] Fps is (10 sec: 5569.4, 60 sec: 5514.5, 300 sec: 5512.3). Total num frames: 854692864. Throughput: 0: 5792.3. Samples: 854696788. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:10:58,031][25689] Avg episode reward: [(0, '1.023')] [2022-07-10 18:10:58,882][26022] Updated weights on worker 0-0, policy_version 834666 (0.00091) [2022-07-10 18:11:00,826][26022] Updated weights on worker 0-0, policy_version 834676 (0.00084) [2022-07-10 18:11:02,969][26022] Updated weights on worker 0-0, policy_version 834686 (0.00080) [2022-07-10 18:11:03,074][25689] Fps is (10 sec: 5300.6, 60 sec: 5494.1, 300 sec: 5511.9). Total num frames: 854718464. Throughput: 0: 5665.0. Samples: 854727842. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:11:03,075][25689] Avg episode reward: [(0, '1.198')] [2022-07-10 18:11:04,779][26022] Updated weights on worker 0-0, policy_version 834696 (0.00089) [2022-07-10 18:11:06,571][26022] Updated weights on worker 0-0, policy_version 834706 (0.00109) [2022-07-10 18:11:08,088][25689] Fps is (10 sec: 5294.0, 60 sec: 5493.8, 300 sec: 5511.1). Total num frames: 854746112. Throughput: 0: 5653.9. Samples: 854744516. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:11:08,090][25689] Avg episode reward: [(0, '0.627')] [2022-07-10 18:11:08,556][26022] Updated weights on worker 0-0, policy_version 834716 (0.00087) [2022-07-10 18:11:10,378][26022] Updated weights on worker 0-0, policy_version 834726 (0.00097) [2022-07-10 18:11:12,195][26022] Updated weights on worker 0-0, policy_version 834736 (0.00086) [2022-07-10 18:11:13,202][25689] Fps is (10 sec: 5561.2, 60 sec: 5526.3, 300 sec: 5509.4). Total num frames: 854774784. Throughput: 0: 5672.0. Samples: 854778158. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:11:13,202][25689] Avg episode reward: [(0, '-0.098')] [2022-07-10 18:11:13,754][26022] Updated weights on worker 0-0, policy_version 834746 (0.00089) [2022-07-10 18:11:15,878][26022] Updated weights on worker 0-0, policy_version 834756 (0.00088) [2022-07-10 18:11:17,818][26022] Updated weights on worker 0-0, policy_version 834766 (0.00083) [2022-07-10 18:11:18,287][25689] Fps is (10 sec: 5622.5, 60 sec: 5518.9, 300 sec: 5512.0). Total num frames: 854803456. Throughput: 0: 5667.9. Samples: 854811604. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:11:18,288][25689] Avg episode reward: [(0, '-0.433')] [2022-07-10 18:11:19,420][26022] Updated weights on worker 0-0, policy_version 834776 (0.00096) [2022-07-10 18:11:21,427][26022] Updated weights on worker 0-0, policy_version 834786 (0.00091) [2022-07-10 18:11:23,034][26022] Updated weights on worker 0-0, policy_version 834796 (0.00086) [2022-07-10 18:11:23,319][25689] Fps is (10 sec: 5769.0, 60 sec: 5533.2, 300 sec: 5522.3). Total num frames: 854833152. Throughput: 0: 4971.9. Samples: 854828498. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:11:23,320][25689] Avg episode reward: [(0, '-0.137')] [2022-07-10 18:11:25,092][26022] Updated weights on worker 0-0, policy_version 834806 (0.00087) [2022-07-10 18:11:26,742][26022] Updated weights on worker 0-0, policy_version 834816 (0.00088) [2022-07-10 18:11:28,331][25689] Fps is (10 sec: 5607.7, 60 sec: 5515.9, 300 sec: 5512.9). Total num frames: 854859776. Throughput: 0: 5796.2. Samples: 854861844. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:11:28,331][25689] Avg episode reward: [(0, '-0.021')] [2022-07-10 18:11:28,578][26022] Updated weights on worker 0-0, policy_version 834826 (0.00096) [2022-07-10 18:11:30,485][26022] Updated weights on worker 0-0, policy_version 834836 (0.00102) [2022-07-10 18:11:32,640][26022] Updated weights on worker 0-0, policy_version 834846 (0.00095) [2022-07-10 18:11:33,388][25689] Fps is (10 sec: 5390.0, 60 sec: 5533.1, 300 sec: 5515.5). Total num frames: 854887424. Throughput: 0: 5784.0. Samples: 854894916. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:11:33,389][25689] Avg episode reward: [(0, '0.100')] [2022-07-10 18:11:34,222][26022] Updated weights on worker 0-0, policy_version 834856 (0.00099) [2022-07-10 18:11:36,243][26022] Updated weights on worker 0-0, policy_version 834866 (0.00089) [2022-07-10 18:11:37,866][26022] Updated weights on worker 0-0, policy_version 834876 (0.00097) [2022-07-10 18:11:38,393][25689] Fps is (10 sec: 5495.3, 60 sec: 5516.9, 300 sec: 5512.4). Total num frames: 854915072. Throughput: 0: 4973.7. Samples: 854911604. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:11:38,394][25689] Avg episode reward: [(0, '0.958')] [2022-07-10 18:11:39,866][26022] Updated weights on worker 0-0, policy_version 834886 (0.00092) [2022-07-10 18:11:41,649][26022] Updated weights on worker 0-0, policy_version 834896 (0.00091) [2022-07-10 18:11:43,396][26022] Updated weights on worker 0-0, policy_version 834906 (0.00087) [2022-07-10 18:11:43,494][25689] Fps is (10 sec: 5573.0, 60 sec: 5509.9, 300 sec: 5514.4). Total num frames: 854943744. Throughput: 0: 5785.3. Samples: 854945216. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:11:43,494][25689] Avg episode reward: [(0, '0.951')] [2022-07-10 18:11:45,191][26022] Updated weights on worker 0-0, policy_version 834916 (0.00083) [2022-07-10 18:11:47,172][26022] Updated weights on worker 0-0, policy_version 834926 (0.01021) [2022-07-10 18:11:48,522][25689] Fps is (10 sec: 5560.6, 60 sec: 5525.2, 300 sec: 5511.7). Total num frames: 854971392. Throughput: 0: 5792.1. Samples: 854978792. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:11:48,523][25689] Avg episode reward: [(0, '0.741')] [2022-07-10 18:11:48,838][26022] Updated weights on worker 0-0, policy_version 834936 (0.00084) [2022-07-10 18:11:50,806][26022] Updated weights on worker 0-0, policy_version 834946 (0.00084) [2022-07-10 18:11:52,463][26022] Updated weights on worker 0-0, policy_version 834956 (0.00098) [2022-07-10 18:11:53,605][25689] Fps is (10 sec: 5570.0, 60 sec: 5527.7, 300 sec: 5513.8). Total num frames: 855000064. Throughput: 0: 4977.2. Samples: 854995540. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:11:53,606][25689] Avg episode reward: [(0, '-0.645')] [2022-07-10 18:11:54,571][26022] Updated weights on worker 0-0, policy_version 834966 (0.00094) [2022-07-10 18:11:56,195][26022] Updated weights on worker 0-0, policy_version 834976 (0.00066) [2022-07-10 18:11:58,115][26022] Updated weights on worker 0-0, policy_version 834986 (0.00626) [2022-07-10 18:11:58,611][25689] Fps is (10 sec: 5683.4, 60 sec: 5544.2, 300 sec: 5521.5). Total num frames: 855028736. Throughput: 0: 5813.7. Samples: 855029146. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:11:58,612][25689] Avg episode reward: [(0, '-1.516')] [2022-07-10 18:12:00,048][26022] Updated weights on worker 0-0, policy_version 834996 (0.00097) [2022-07-10 18:12:02,136][26022] Updated weights on worker 0-0, policy_version 835006 (0.00090) [2022-07-10 18:12:03,666][25689] Fps is (10 sec: 5292.6, 60 sec: 5526.3, 300 sec: 5513.8). Total num frames: 855053312. Throughput: 0: 5700.1. Samples: 855060200. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:12:03,667][25689] Avg episode reward: [(0, '-1.051')] [2022-07-10 18:12:04,088][26022] Updated weights on worker 0-0, policy_version 835016 (0.00088) [2022-07-10 18:12:06,171][26022] Updated weights on worker 0-0, policy_version 835026 (0.00107) [2022-07-10 18:12:07,615][26022] Updated weights on worker 0-0, policy_version 835036 (0.00080) [2022-07-10 18:12:08,676][25689] Fps is (10 sec: 5188.6, 60 sec: 5526.6, 300 sec: 5511.2). Total num frames: 855080960. Throughput: 0: 4864.4. Samples: 855076834. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:12:08,677][25689] Avg episode reward: [(0, '-1.913')] [2022-07-10 18:12:09,978][26022] Updated weights on worker 0-0, policy_version 835046 (0.00094) [2022-07-10 18:12:11,375][26022] Updated weights on worker 0-0, policy_version 835056 (0.00092) [2022-07-10 18:12:13,383][26022] Updated weights on worker 0-0, policy_version 835066 (0.00092) [2022-07-10 18:12:13,771][25689] Fps is (10 sec: 5573.3, 60 sec: 5528.3, 300 sec: 5513.3). Total num frames: 855109632. Throughput: 0: 5674.2. Samples: 855109966. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:12:13,772][25689] Avg episode reward: [(0, '-2.194')] [2022-07-10 18:12:15,084][26022] Updated weights on worker 0-0, policy_version 835076 (0.00372) [2022-07-10 18:12:16,902][26022] Updated weights on worker 0-0, policy_version 835086 (0.00090) [2022-07-10 18:12:18,782][25689] Fps is (10 sec: 5674.5, 60 sec: 5535.2, 300 sec: 5520.5). Total num frames: 855138304. Throughput: 0: 5671.4. Samples: 855143542. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:12:18,782][25689] Avg episode reward: [(0, '-1.372')] [2022-07-10 18:12:18,783][26022] Updated weights on worker 0-0, policy_version 835096 (0.00084) [2022-07-10 18:12:20,706][26022] Updated weights on worker 0-0, policy_version 835106 (0.00086) [2022-07-10 18:12:22,351][26022] Updated weights on worker 0-0, policy_version 835116 (0.00087) [2022-07-10 18:12:23,832][25689] Fps is (10 sec: 5598.1, 60 sec: 5499.6, 300 sec: 5512.8). Total num frames: 855165952. Throughput: 0: 4974.0. Samples: 855160506. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:12:23,833][25689] Avg episode reward: [(0, '-0.846')] [2022-07-10 18:12:24,271][26022] Updated weights on worker 0-0, policy_version 835126 (0.00098) [2022-07-10 18:12:26,120][26022] Updated weights on worker 0-0, policy_version 835136 (0.00086) [2022-07-10 18:12:27,948][26022] Updated weights on worker 0-0, policy_version 835146 (0.00085) [2022-07-10 18:12:28,862][25689] Fps is (10 sec: 5485.8, 60 sec: 5514.9, 300 sec: 5516.5). Total num frames: 855193600. Throughput: 0: 5808.9. Samples: 855194088. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:12:28,862][25689] Avg episode reward: [(0, '-0.746')] [2022-07-10 18:12:29,805][26022] Updated weights on worker 0-0, policy_version 835156 (0.00090) [2022-07-10 18:12:31,678][26022] Updated weights on worker 0-0, policy_version 835166 (0.00087) [2022-07-10 18:12:33,369][26022] Updated weights on worker 0-0, policy_version 835176 (0.00092) [2022-07-10 18:12:33,958][25689] Fps is (10 sec: 5662.9, 60 sec: 5545.2, 300 sec: 5518.4). Total num frames: 855223296. Throughput: 0: 5820.8. Samples: 855227470. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:12:33,959][25689] Avg episode reward: [(0, '0.202')] [2022-07-10 18:12:35,315][26022] Updated weights on worker 0-0, policy_version 835186 (0.00084) [2022-07-10 18:12:37,036][26022] Updated weights on worker 0-0, policy_version 835196 (0.00091) [2022-07-10 18:12:38,987][25689] Fps is (10 sec: 5663.2, 60 sec: 5543.0, 300 sec: 5518.0). Total num frames: 855250944. Throughput: 0: 4990.9. Samples: 855244386. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:12:38,988][25689] Avg episode reward: [(0, '0.280')] [2022-07-10 18:12:38,990][26022] Updated weights on worker 0-0, policy_version 835206 (0.00090) [2022-07-10 18:12:40,733][26022] Updated weights on worker 0-0, policy_version 835216 (0.00087) [2022-07-10 18:12:42,724][26022] Updated weights on worker 0-0, policy_version 835226 (0.00094) [2022-07-10 18:12:43,993][25689] Fps is (10 sec: 5408.2, 60 sec: 5517.8, 300 sec: 5511.3). Total num frames: 855277568. Throughput: 0: 5813.3. Samples: 855277710. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 18:12:43,994][25689] Avg episode reward: [(0, '0.238')] [2022-07-10 18:12:44,503][26022] Updated weights on worker 0-0, policy_version 835236 (0.00088) [2022-07-10 18:12:46,351][26022] Updated weights on worker 0-0, policy_version 835246 (0.00087) [2022-07-10 18:12:48,040][26022] Updated weights on worker 0-0, policy_version 835256 (0.00077) [2022-07-10 18:12:48,995][25689] Fps is (10 sec: 5525.5, 60 sec: 5537.1, 300 sec: 5513.4). Total num frames: 855306240. Throughput: 0: 5811.1. Samples: 855311084. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:12:48,995][25689] Avg episode reward: [(0, '0.099')] [2022-07-10 18:12:50,237][26022] Updated weights on worker 0-0, policy_version 835266 (0.00090) [2022-07-10 18:12:51,845][26022] Updated weights on worker 0-0, policy_version 835276 (0.00089) [2022-07-10 18:12:52,904][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:12:52,924][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000835282_855328768.pth [2022-07-10 18:12:52,925][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000833341_853341184.pth [2022-07-10 18:12:53,784][26022] Updated weights on worker 0-0, policy_version 835286 (0.00090) [2022-07-10 18:12:54,081][25689] Fps is (10 sec: 5583.3, 60 sec: 5520.0, 300 sec: 5516.3). Total num frames: 855333888. Throughput: 0: 4975.2. Samples: 855327582. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:12:54,081][25689] Avg episode reward: [(0, '-0.021')] [2022-07-10 18:12:55,657][26022] Updated weights on worker 0-0, policy_version 835296 (0.00090) [2022-07-10 18:12:57,476][26022] Updated weights on worker 0-0, policy_version 835306 (0.00088) [2022-07-10 18:12:59,103][25689] Fps is (10 sec: 5470.6, 60 sec: 5501.6, 300 sec: 5512.6). Total num frames: 855361536. Throughput: 0: 5785.6. Samples: 855360766. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:12:59,103][25689] Avg episode reward: [(0, '0.041')] [2022-07-10 18:12:59,254][26022] Updated weights on worker 0-0, policy_version 835316 (0.00085) [2022-07-10 18:13:01,539][26022] Updated weights on worker 0-0, policy_version 835326 (0.00501) [2022-07-10 18:13:03,400][26022] Updated weights on worker 0-0, policy_version 835336 (0.00088) [2022-07-10 18:13:04,123][25689] Fps is (10 sec: 5302.3, 60 sec: 5521.7, 300 sec: 5512.5). Total num frames: 855387136. Throughput: 0: 5667.6. Samples: 855391798. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:13:04,124][25689] Avg episode reward: [(0, '0.063')] [2022-07-10 18:13:05,244][26022] Updated weights on worker 0-0, policy_version 835346 (0.00087) [2022-07-10 18:13:07,117][26022] Updated weights on worker 0-0, policy_version 835356 (0.00093) [2022-07-10 18:13:09,011][26022] Updated weights on worker 0-0, policy_version 835366 (0.00083) [2022-07-10 18:13:09,129][25689] Fps is (10 sec: 5310.6, 60 sec: 5522.0, 300 sec: 5506.8). Total num frames: 855414784. Throughput: 0: 4845.6. Samples: 855408648. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:13:09,130][25689] Avg episode reward: [(0, '-0.725')] [2022-07-10 18:13:11,078][26022] Updated weights on worker 0-0, policy_version 835376 (0.00093) [2022-07-10 18:13:12,535][26022] Updated weights on worker 0-0, policy_version 835386 (0.00093) [2022-07-10 18:13:14,209][25689] Fps is (10 sec: 5482.4, 60 sec: 5506.5, 300 sec: 5509.0). Total num frames: 855442432. Throughput: 0: 5675.6. Samples: 855441824. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:13:14,210][25689] Avg episode reward: [(0, '-1.042')] [2022-07-10 18:13:14,539][26022] Updated weights on worker 0-0, policy_version 835396 (0.00090) [2022-07-10 18:13:16,144][26022] Updated weights on worker 0-0, policy_version 835406 (0.00084) [2022-07-10 18:13:18,185][26022] Updated weights on worker 0-0, policy_version 835416 (0.00076) [2022-07-10 18:13:19,265][25689] Fps is (10 sec: 5758.7, 60 sec: 5536.2, 300 sec: 5518.8). Total num frames: 855473152. Throughput: 0: 5707.9. Samples: 855475850. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:13:19,265][25689] Avg episode reward: [(0, '-0.995')] [2022-07-10 18:13:19,907][26022] Updated weights on worker 0-0, policy_version 835426 (0.00084) [2022-07-10 18:13:21,679][26022] Updated weights on worker 0-0, policy_version 835436 (0.00090) [2022-07-10 18:13:23,603][26022] Updated weights on worker 0-0, policy_version 835446 (0.00083) [2022-07-10 18:13:24,273][25689] Fps is (10 sec: 5596.3, 60 sec: 5506.2, 300 sec: 5508.8). Total num frames: 855498752. Throughput: 0: 5847.1. Samples: 855509616. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:13:24,274][25689] Avg episode reward: [(0, '-1.532')] [2022-07-10 18:13:25,363][26022] Updated weights on worker 0-0, policy_version 835456 (0.00092) [2022-07-10 18:13:27,085][26022] Updated weights on worker 0-0, policy_version 835466 (0.00083) [2022-07-10 18:13:28,974][26022] Updated weights on worker 0-0, policy_version 835476 (0.00086) [2022-07-10 18:13:29,315][25689] Fps is (10 sec: 5502.3, 60 sec: 5539.0, 300 sec: 5519.3). Total num frames: 855528448. Throughput: 0: 5833.2. Samples: 855526392. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:13:29,315][25689] Avg episode reward: [(0, '-1.462')] [2022-07-10 18:13:31,107][26022] Updated weights on worker 0-0, policy_version 835486 (0.00086) [2022-07-10 18:13:32,749][26022] Updated weights on worker 0-0, policy_version 835496 (0.00087) [2022-07-10 18:13:34,421][25689] Fps is (10 sec: 5650.6, 60 sec: 5504.2, 300 sec: 5518.0). Total num frames: 855556096. Throughput: 0: 5827.8. Samples: 855559616. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:13:34,422][25689] Avg episode reward: [(0, '-1.325')] [2022-07-10 18:13:34,735][26022] Updated weights on worker 0-0, policy_version 835506 (0.00092) [2022-07-10 18:13:36,446][26022] Updated weights on worker 0-0, policy_version 835516 (0.00080) [2022-07-10 18:13:38,411][26022] Updated weights on worker 0-0, policy_version 835526 (0.00087) [2022-07-10 18:13:39,438][25689] Fps is (10 sec: 5563.1, 60 sec: 5522.2, 300 sec: 5521.5). Total num frames: 855584768. Throughput: 0: 5829.7. Samples: 855593454. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:13:39,439][25689] Avg episode reward: [(0, '-1.970')] [2022-07-10 18:13:40,151][26022] Updated weights on worker 0-0, policy_version 835536 (0.00096) [2022-07-10 18:13:41,984][26022] Updated weights on worker 0-0, policy_version 835546 (0.00083) [2022-07-10 18:13:43,878][26022] Updated weights on worker 0-0, policy_version 835556 (0.00093) [2022-07-10 18:13:44,466][25689] Fps is (10 sec: 5708.6, 60 sec: 5554.1, 300 sec: 5521.7). Total num frames: 855613440. Throughput: 0: 4980.5. Samples: 855610190. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:13:44,467][25689] Avg episode reward: [(0, '-1.168')] [2022-07-10 18:13:45,678][26022] Updated weights on worker 0-0, policy_version 835566 (0.00087) [2022-07-10 18:13:47,408][26022] Updated weights on worker 0-0, policy_version 835576 (0.00092) [2022-07-10 18:13:49,323][26022] Updated weights on worker 0-0, policy_version 835586 (0.00083) [2022-07-10 18:13:49,470][25689] Fps is (10 sec: 5614.1, 60 sec: 5536.9, 300 sec: 5519.5). Total num frames: 855641088. Throughput: 0: 5833.2. Samples: 855643964. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:13:49,471][25689] Avg episode reward: [(0, '-1.530')] [2022-07-10 18:13:51,134][26022] Updated weights on worker 0-0, policy_version 835596 (0.00094) [2022-07-10 18:13:53,081][26022] Updated weights on worker 0-0, policy_version 835606 (0.00088) [2022-07-10 18:13:54,539][25689] Fps is (10 sec: 5388.0, 60 sec: 5521.5, 300 sec: 5515.0). Total num frames: 855667712. Throughput: 0: 5830.5. Samples: 855676914. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:13:54,540][25689] Avg episode reward: [(0, '-0.773')] [2022-07-10 18:13:54,916][26022] Updated weights on worker 0-0, policy_version 835616 (0.00097) [2022-07-10 18:13:56,668][26022] Updated weights on worker 0-0, policy_version 835626 (0.00094) [2022-07-10 18:13:58,731][26022] Updated weights on worker 0-0, policy_version 835636 (0.00089) [2022-07-10 18:13:59,551][25689] Fps is (10 sec: 5688.4, 60 sec: 5573.3, 300 sec: 5533.7). Total num frames: 855698432. Throughput: 0: 4979.6. Samples: 855693606. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:13:59,552][25689] Avg episode reward: [(0, '-0.746')] [2022-07-10 18:14:00,215][26022] Updated weights on worker 0-0, policy_version 835646 (0.00085) [2022-07-10 18:14:02,366][26022] Updated weights on worker 0-0, policy_version 835656 (0.00101) [2022-07-10 18:14:04,573][25689] Fps is (10 sec: 5408.7, 60 sec: 5539.2, 300 sec: 5519.8). Total num frames: 855721984. Throughput: 0: 5713.3. Samples: 855725068. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:14:04,574][25689] Avg episode reward: [(0, '-1.317')] [2022-07-10 18:14:04,576][26022] Updated weights on worker 0-0, policy_version 835666 (0.00092) [2022-07-10 18:14:06,144][26022] Updated weights on worker 0-0, policy_version 835676 (0.00085) [2022-07-10 18:14:08,201][26022] Updated weights on worker 0-0, policy_version 835686 (0.00089) [2022-07-10 18:14:09,596][25689] Fps is (10 sec: 5198.9, 60 sec: 5554.6, 300 sec: 5517.9). Total num frames: 855750656. Throughput: 0: 5692.1. Samples: 855758524. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:14:09,597][25689] Avg episode reward: [(0, '-2.143')] [2022-07-10 18:14:09,848][26022] Updated weights on worker 0-0, policy_version 835696 (0.00094) [2022-07-10 18:14:11,998][26022] Updated weights on worker 0-0, policy_version 835706 (0.00078) [2022-07-10 18:14:13,432][26022] Updated weights on worker 0-0, policy_version 835716 (0.00088) [2022-07-10 18:14:14,665][25689] Fps is (10 sec: 5682.3, 60 sec: 5572.6, 300 sec: 5527.4). Total num frames: 855779328. Throughput: 0: 4877.6. Samples: 855775080. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:14:14,666][25689] Avg episode reward: [(0, '-2.000')] [2022-07-10 18:14:15,594][26022] Updated weights on worker 0-0, policy_version 835726 (0.00138) [2022-07-10 18:14:17,269][26022] Updated weights on worker 0-0, policy_version 835736 (0.00093) [2022-07-10 18:14:19,211][26022] Updated weights on worker 0-0, policy_version 835746 (0.00084) [2022-07-10 18:14:19,683][25689] Fps is (10 sec: 5583.8, 60 sec: 5525.3, 300 sec: 5520.3). Total num frames: 855806976. Throughput: 0: 5714.1. Samples: 855808640. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:14:19,683][25689] Avg episode reward: [(0, '-2.365')] [2022-07-10 18:14:21,026][26022] Updated weights on worker 0-0, policy_version 835756 (0.00086) [2022-07-10 18:14:22,805][26022] Updated weights on worker 0-0, policy_version 835766 (0.00098) [2022-07-10 18:14:24,546][26022] Updated weights on worker 0-0, policy_version 835776 (0.00086) [2022-07-10 18:14:24,693][25689] Fps is (10 sec: 5514.4, 60 sec: 5559.0, 300 sec: 5520.2). Total num frames: 855834624. Throughput: 0: 5822.4. Samples: 855842210. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:14:24,693][25689] Avg episode reward: [(0, '-2.467')] [2022-07-10 18:14:26,593][26022] Updated weights on worker 0-0, policy_version 835786 (0.00051) [2022-07-10 18:14:28,219][26022] Updated weights on worker 0-0, policy_version 835796 (0.00055) [2022-07-10 18:14:29,700][25689] Fps is (10 sec: 5418.0, 60 sec: 5511.3, 300 sec: 5521.3). Total num frames: 855861248. Throughput: 0: 4981.5. Samples: 855858668. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:14:29,700][25689] Avg episode reward: [(0, '-2.381')] [2022-07-10 18:14:30,247][26022] Updated weights on worker 0-0, policy_version 835806 (0.00083) [2022-07-10 18:14:32,111][26022] Updated weights on worker 0-0, policy_version 835816 (0.00083) [2022-07-10 18:14:33,895][26022] Updated weights on worker 0-0, policy_version 835826 (0.00095) [2022-07-10 18:14:34,786][25689] Fps is (10 sec: 5478.3, 60 sec: 5530.1, 300 sec: 5523.3). Total num frames: 855889920. Throughput: 0: 5798.3. Samples: 855891750. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:14:34,787][25689] Avg episode reward: [(0, '-0.892')] [2022-07-10 18:14:35,963][26022] Updated weights on worker 0-0, policy_version 835836 (0.00053) [2022-07-10 18:14:37,548][26022] Updated weights on worker 0-0, policy_version 835846 (0.00085) [2022-07-10 18:14:39,555][26022] Updated weights on worker 0-0, policy_version 835856 (0.00088) [2022-07-10 18:14:39,827][25689] Fps is (10 sec: 5662.5, 60 sec: 5528.0, 300 sec: 5529.6). Total num frames: 855918592. Throughput: 0: 5791.3. Samples: 855925300. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:14:39,827][25689] Avg episode reward: [(0, '0.257')] [2022-07-10 18:14:41,194][26022] Updated weights on worker 0-0, policy_version 835866 (0.00087) [2022-07-10 18:14:43,288][26022] Updated weights on worker 0-0, policy_version 835876 (0.00092) [2022-07-10 18:14:44,843][25689] Fps is (10 sec: 5600.2, 60 sec: 5512.1, 300 sec: 5522.5). Total num frames: 855946240. Throughput: 0: 4944.8. Samples: 855941850. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:14:44,844][25689] Avg episode reward: [(0, '0.157')] [2022-07-10 18:14:44,970][26022] Updated weights on worker 0-0, policy_version 835886 (0.00092) [2022-07-10 18:14:47,071][26022] Updated weights on worker 0-0, policy_version 835896 (0.00083) [2022-07-10 18:14:48,451][26022] Updated weights on worker 0-0, policy_version 835906 (0.00093) [2022-07-10 18:14:49,876][25689] Fps is (10 sec: 5400.5, 60 sec: 5492.4, 300 sec: 5519.7). Total num frames: 855972864. Throughput: 0: 5777.4. Samples: 855975236. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:14:49,877][25689] Avg episode reward: [(0, '0.008')] [2022-07-10 18:14:50,705][26022] Updated weights on worker 0-0, policy_version 835916 (0.00088) [2022-07-10 18:14:52,414][26022] Updated weights on worker 0-0, policy_version 835926 (0.00088) [2022-07-10 18:14:53,172][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:14:53,185][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000835929_855991296.pth [2022-07-10 18:14:53,185][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000833987_854002688.pth [2022-07-10 18:14:54,239][26022] Updated weights on worker 0-0, policy_version 835936 (0.00088) [2022-07-10 18:14:54,981][25689] Fps is (10 sec: 5555.2, 60 sec: 5540.0, 300 sec: 5524.7). Total num frames: 856002560. Throughput: 0: 5792.3. Samples: 856008726. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:14:54,982][25689] Avg episode reward: [(0, '-0.342')] [2022-07-10 18:14:56,177][26022] Updated weights on worker 0-0, policy_version 835946 (0.00076) [2022-07-10 18:14:57,885][26022] Updated weights on worker 0-0, policy_version 835956 (0.00095) [2022-07-10 18:15:00,001][25689] Fps is (10 sec: 5461.5, 60 sec: 5454.6, 300 sec: 5521.0). Total num frames: 856028160. Throughput: 0: 4949.7. Samples: 856025156. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:15:00,001][25689] Avg episode reward: [(0, '-0.541')] [2022-07-10 18:15:00,059][26022] Updated weights on worker 0-0, policy_version 835966 (0.00088) [2022-07-10 18:15:01,504][26022] Updated weights on worker 0-0, policy_version 835976 (0.00087) [2022-07-10 18:15:04,027][26022] Updated weights on worker 0-0, policy_version 835986 (0.00089) [2022-07-10 18:15:05,056][25689] Fps is (10 sec: 5285.4, 60 sec: 5519.4, 300 sec: 5520.2). Total num frames: 856055808. Throughput: 0: 5660.0. Samples: 856056254. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:15:05,056][25689] Avg episode reward: [(0, '-2.226')] [2022-07-10 18:15:05,701][26022] Updated weights on worker 0-0, policy_version 835996 (0.00085) [2022-07-10 18:15:07,804][26022] Updated weights on worker 0-0, policy_version 836006 (0.00081) [2022-07-10 18:15:09,521][26022] Updated weights on worker 0-0, policy_version 836016 (0.00091) [2022-07-10 18:15:10,082][25689] Fps is (10 sec: 5485.3, 60 sec: 5502.2, 300 sec: 5525.0). Total num frames: 856083456. Throughput: 0: 5652.5. Samples: 856089446. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:15:10,082][25689] Avg episode reward: [(0, '-2.590')] [2022-07-10 18:15:11,325][26022] Updated weights on worker 0-0, policy_version 836026 (0.00093) [2022-07-10 18:15:13,217][26022] Updated weights on worker 0-0, policy_version 836036 (0.00095) [2022-07-10 18:15:15,044][26022] Updated weights on worker 0-0, policy_version 836046 (0.00090) [2022-07-10 18:15:15,132][25689] Fps is (10 sec: 5488.0, 60 sec: 5486.9, 300 sec: 5520.7). Total num frames: 856111104. Throughput: 0: 4832.0. Samples: 856106092. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:15:15,132][25689] Avg episode reward: [(0, '-2.588')] [2022-07-10 18:15:16,923][26022] Updated weights on worker 0-0, policy_version 836056 (0.00083) [2022-07-10 18:15:18,674][26022] Updated weights on worker 0-0, policy_version 836066 (0.00086) [2022-07-10 18:15:20,140][25689] Fps is (10 sec: 5497.5, 60 sec: 5487.8, 300 sec: 5517.1). Total num frames: 856138752. Throughput: 0: 5667.0. Samples: 856139284. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:15:20,140][25689] Avg episode reward: [(0, '-1.681')] [2022-07-10 18:15:20,660][26022] Updated weights on worker 0-0, policy_version 836076 (0.00090) [2022-07-10 18:15:22,523][26022] Updated weights on worker 0-0, policy_version 836086 (0.00085) [2022-07-10 18:15:24,298][26022] Updated weights on worker 0-0, policy_version 836096 (0.00095) [2022-07-10 18:15:25,218][25689] Fps is (10 sec: 5584.0, 60 sec: 5498.6, 300 sec: 5519.3). Total num frames: 856167424. Throughput: 0: 5770.4. Samples: 856172596. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:15:25,221][25689] Avg episode reward: [(0, '-1.145')] [2022-07-10 18:15:26,324][26022] Updated weights on worker 0-0, policy_version 836106 (0.00089) [2022-07-10 18:15:27,963][26022] Updated weights on worker 0-0, policy_version 836116 (0.00095) [2022-07-10 18:15:29,934][26022] Updated weights on worker 0-0, policy_version 836126 (0.00569) [2022-07-10 18:15:30,237][25689] Fps is (10 sec: 5577.7, 60 sec: 5514.3, 300 sec: 5523.5). Total num frames: 856195072. Throughput: 0: 4948.3. Samples: 856189182. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:15:30,239][25689] Avg episode reward: [(0, '-0.369')] [2022-07-10 18:15:31,866][26022] Updated weights on worker 0-0, policy_version 836136 (0.00090) [2022-07-10 18:15:33,688][26022] Updated weights on worker 0-0, policy_version 836146 (0.00090) [2022-07-10 18:15:35,346][25689] Fps is (10 sec: 5257.1, 60 sec: 5461.6, 300 sec: 5511.4). Total num frames: 856220672. Throughput: 0: 5739.0. Samples: 856222104. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:15:35,347][25689] Avg episode reward: [(0, '0.413')] [2022-07-10 18:15:35,635][26022] Updated weights on worker 0-0, policy_version 836156 (0.00087) [2022-07-10 18:15:37,432][26022] Updated weights on worker 0-0, policy_version 836166 (0.00091) [2022-07-10 18:15:39,119][26022] Updated weights on worker 0-0, policy_version 836176 (0.00088) [2022-07-10 18:15:40,359][25689] Fps is (10 sec: 5463.2, 60 sec: 5481.0, 300 sec: 5515.0). Total num frames: 856250368. Throughput: 0: 5746.7. Samples: 856255476. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:15:40,359][25689] Avg episode reward: [(0, '0.794')] [2022-07-10 18:15:41,139][26022] Updated weights on worker 0-0, policy_version 836186 (0.00088) [2022-07-10 18:15:42,669][26022] Updated weights on worker 0-0, policy_version 836196 (0.00091) [2022-07-10 18:15:44,759][26022] Updated weights on worker 0-0, policy_version 836206 (0.00092) [2022-07-10 18:15:45,381][25689] Fps is (10 sec: 5714.6, 60 sec: 5480.5, 300 sec: 5518.2). Total num frames: 856278016. Throughput: 0: 4939.8. Samples: 856272200. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:15:45,383][25689] Avg episode reward: [(0, '0.276')] [2022-07-10 18:15:46,552][26022] Updated weights on worker 0-0, policy_version 836216 (0.00090) [2022-07-10 18:15:48,159][26022] Updated weights on worker 0-0, policy_version 836226 (0.00085) [2022-07-10 18:15:50,210][26022] Updated weights on worker 0-0, policy_version 836236 (0.00091) [2022-07-10 18:15:50,401][25689] Fps is (10 sec: 5506.2, 60 sec: 5498.5, 300 sec: 5516.5). Total num frames: 856305664. Throughput: 0: 5785.4. Samples: 856305840. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:15:50,402][25689] Avg episode reward: [(0, '0.366')] [2022-07-10 18:15:52,050][26022] Updated weights on worker 0-0, policy_version 836246 (0.00082) [2022-07-10 18:15:53,805][26022] Updated weights on worker 0-0, policy_version 836256 (0.00088) [2022-07-10 18:15:55,484][25689] Fps is (10 sec: 5676.0, 60 sec: 5500.6, 300 sec: 5521.9). Total num frames: 856335360. Throughput: 0: 5833.0. Samples: 856339566. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:15:55,484][25689] Avg episode reward: [(0, '-0.381')] [2022-07-10 18:15:55,803][26022] Updated weights on worker 0-0, policy_version 836266 (0.00094) [2022-07-10 18:15:57,321][26022] Updated weights on worker 0-0, policy_version 836276 (0.00091) [2022-07-10 18:15:59,524][26022] Updated weights on worker 0-0, policy_version 836286 (0.00090) [2022-07-10 18:16:00,533][25689] Fps is (10 sec: 5760.7, 60 sec: 5548.6, 300 sec: 5532.1). Total num frames: 856364032. Throughput: 0: 4990.1. Samples: 856356150. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:16:00,534][25689] Avg episode reward: [(0, '-1.390')] [2022-07-10 18:16:01,106][26022] Updated weights on worker 0-0, policy_version 836296 (0.00084) [2022-07-10 18:16:03,318][26022] Updated weights on worker 0-0, policy_version 836306 (0.00094) [2022-07-10 18:16:05,548][25689] Fps is (10 sec: 5189.1, 60 sec: 5484.6, 300 sec: 5518.3). Total num frames: 856387584. Throughput: 0: 5706.3. Samples: 856387280. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:16:05,549][25689] Avg episode reward: [(0, '-1.697')] [2022-07-10 18:16:05,561][26022] Updated weights on worker 0-0, policy_version 836316 (0.00098) [2022-07-10 18:16:07,096][26022] Updated weights on worker 0-0, policy_version 836326 (0.00097) [2022-07-10 18:16:09,170][26022] Updated weights on worker 0-0, policy_version 836336 (0.00097) [2022-07-10 18:16:10,629][25689] Fps is (10 sec: 5172.7, 60 sec: 5496.5, 300 sec: 5518.9). Total num frames: 856416256. Throughput: 0: 5640.5. Samples: 856419936. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:16:10,630][25689] Avg episode reward: [(0, '-2.032')] [2022-07-10 18:16:11,024][26022] Updated weights on worker 0-0, policy_version 836346 (0.00087) [2022-07-10 18:16:12,706][26022] Updated weights on worker 0-0, policy_version 836356 (0.00087) [2022-07-10 18:16:14,758][26022] Updated weights on worker 0-0, policy_version 836366 (0.00082) [2022-07-10 18:16:15,702][25689] Fps is (10 sec: 5647.3, 60 sec: 5511.3, 300 sec: 5519.1). Total num frames: 856444928. Throughput: 0: 5638.3. Samples: 856453564. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:16:15,703][25689] Avg episode reward: [(0, '-1.978')] [2022-07-10 18:16:16,350][26022] Updated weights on worker 0-0, policy_version 836376 (0.00100) [2022-07-10 18:16:18,315][26022] Updated weights on worker 0-0, policy_version 836386 (0.00088) [2022-07-10 18:16:20,041][26022] Updated weights on worker 0-0, policy_version 836396 (0.00089) [2022-07-10 18:16:20,718][25689] Fps is (10 sec: 5481.2, 60 sec: 5493.8, 300 sec: 5509.1). Total num frames: 856471552. Throughput: 0: 5657.9. Samples: 856470352. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:16:20,718][25689] Avg episode reward: [(0, '-1.565')] [2022-07-10 18:16:22,031][26022] Updated weights on worker 0-0, policy_version 836406 (0.00096) [2022-07-10 18:16:23,711][26022] Updated weights on worker 0-0, policy_version 836416 (0.00087) [2022-07-10 18:16:25,710][26022] Updated weights on worker 0-0, policy_version 836426 (0.00091) [2022-07-10 18:16:25,734][25689] Fps is (10 sec: 5512.0, 60 sec: 5499.3, 300 sec: 5515.9). Total num frames: 856500224. Throughput: 0: 5773.9. Samples: 856503832. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:16:25,735][25689] Avg episode reward: [(0, '-1.304')] [2022-07-10 18:16:27,419][26022] Updated weights on worker 0-0, policy_version 836436 (0.00074) [2022-07-10 18:16:29,455][26022] Updated weights on worker 0-0, policy_version 836446 (0.00089) [2022-07-10 18:16:30,759][25689] Fps is (10 sec: 5710.9, 60 sec: 5515.8, 300 sec: 5519.9). Total num frames: 856528896. Throughput: 0: 5805.1. Samples: 856536790. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:16:30,759][25689] Avg episode reward: [(0, '-0.758')] [2022-07-10 18:16:31,099][26022] Updated weights on worker 0-0, policy_version 836456 (0.00079) [2022-07-10 18:16:33,200][26022] Updated weights on worker 0-0, policy_version 836466 (0.00087) [2022-07-10 18:16:35,100][26022] Updated weights on worker 0-0, policy_version 836476 (0.00090) [2022-07-10 18:16:35,874][25689] Fps is (10 sec: 5554.4, 60 sec: 5549.1, 300 sec: 5517.9). Total num frames: 856556544. Throughput: 0: 4954.0. Samples: 856553494. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:16:35,874][25689] Avg episode reward: [(0, '-0.204')] [2022-07-10 18:16:36,737][26022] Updated weights on worker 0-0, policy_version 836486 (0.00091) [2022-07-10 18:16:38,667][26022] Updated weights on worker 0-0, policy_version 836496 (0.00094) [2022-07-10 18:16:40,336][26022] Updated weights on worker 0-0, policy_version 836506 (0.00090) [2022-07-10 18:16:40,883][25689] Fps is (10 sec: 5461.6, 60 sec: 5515.5, 300 sec: 5516.2). Total num frames: 856584192. Throughput: 0: 5787.2. Samples: 856587056. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:16:40,884][25689] Avg episode reward: [(0, '0.564')] [2022-07-10 18:16:42,286][26022] Updated weights on worker 0-0, policy_version 836516 (0.00087) [2022-07-10 18:16:44,175][26022] Updated weights on worker 0-0, policy_version 836526 (0.00093) [2022-07-10 18:16:45,815][26022] Updated weights on worker 0-0, policy_version 836536 (0.00092) [2022-07-10 18:16:45,924][25689] Fps is (10 sec: 5603.8, 60 sec: 5530.7, 300 sec: 5519.4). Total num frames: 856612864. Throughput: 0: 5781.7. Samples: 856620564. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:16:45,925][25689] Avg episode reward: [(0, '0.305')] [2022-07-10 18:16:47,874][26022] Updated weights on worker 0-0, policy_version 836546 (0.00093) [2022-07-10 18:16:49,434][26022] Updated weights on worker 0-0, policy_version 836556 (0.00087) [2022-07-10 18:16:50,936][25689] Fps is (10 sec: 5500.7, 60 sec: 5514.6, 300 sec: 5513.8). Total num frames: 856639488. Throughput: 0: 4989.5. Samples: 856637464. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:16:50,937][25689] Avg episode reward: [(0, '-0.204')] [2022-07-10 18:16:51,553][26022] Updated weights on worker 0-0, policy_version 836566 (0.00080) [2022-07-10 18:16:53,194][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:16:53,206][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000836576_856653824.pth [2022-07-10 18:16:53,206][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000834634_854665216.pth [2022-07-10 18:16:53,210][26022] Updated weights on worker 0-0, policy_version 836576 (0.00086) [2022-07-10 18:16:55,118][26022] Updated weights on worker 0-0, policy_version 836586 (0.00088) [2022-07-10 18:16:56,030][25689] Fps is (10 sec: 5573.3, 60 sec: 5513.6, 300 sec: 5515.6). Total num frames: 856669184. Throughput: 0: 5831.5. Samples: 856671032. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:16:56,030][25689] Avg episode reward: [(0, '-0.449')] [2022-07-10 18:16:56,915][26022] Updated weights on worker 0-0, policy_version 836596 (0.00089) [2022-07-10 18:16:58,943][26022] Updated weights on worker 0-0, policy_version 836606 (0.00078) [2022-07-10 18:17:00,575][26022] Updated weights on worker 0-0, policy_version 836616 (0.00088) [2022-07-10 18:17:01,067][25689] Fps is (10 sec: 5761.2, 60 sec: 5514.7, 300 sec: 5529.7). Total num frames: 856697856. Throughput: 0: 5831.4. Samples: 856704756. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:17:01,068][25689] Avg episode reward: [(0, '-0.364')] [2022-07-10 18:17:02,847][26022] Updated weights on worker 0-0, policy_version 836626 (0.00088) [2022-07-10 18:17:04,444][26022] Updated weights on worker 0-0, policy_version 836636 (0.00086) [2022-07-10 18:17:06,076][25689] Fps is (10 sec: 5300.1, 60 sec: 5532.1, 300 sec: 5519.4). Total num frames: 856722432. Throughput: 0: 4912.9. Samples: 856719570. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:17:06,077][25689] Avg episode reward: [(0, '-1.434')] [2022-07-10 18:17:06,665][26022] Updated weights on worker 0-0, policy_version 836646 (0.00084) [2022-07-10 18:17:08,175][26022] Updated weights on worker 0-0, policy_version 836656 (0.00083) [2022-07-10 18:17:10,080][26022] Updated weights on worker 0-0, policy_version 836666 (0.00087) [2022-07-10 18:17:11,107][25689] Fps is (10 sec: 5405.9, 60 sec: 5553.7, 300 sec: 5524.1). Total num frames: 856752128. Throughput: 0: 5730.4. Samples: 856753050. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:17:11,107][25689] Avg episode reward: [(0, '-1.479')] [2022-07-10 18:17:11,891][26022] Updated weights on worker 0-0, policy_version 836676 (0.00098) [2022-07-10 18:17:13,913][26022] Updated weights on worker 0-0, policy_version 836686 (0.00085) [2022-07-10 18:17:15,675][26022] Updated weights on worker 0-0, policy_version 836696 (0.00087) [2022-07-10 18:17:16,192][25689] Fps is (10 sec: 5669.0, 60 sec: 5535.7, 300 sec: 5519.2). Total num frames: 856779776. Throughput: 0: 5723.3. Samples: 856786426. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:17:16,192][25689] Avg episode reward: [(0, '-0.757')] [2022-07-10 18:17:17,555][26022] Updated weights on worker 0-0, policy_version 836706 (0.00085) [2022-07-10 18:17:19,282][26022] Updated weights on worker 0-0, policy_version 836716 (0.00085) [2022-07-10 18:17:21,211][25689] Fps is (10 sec: 5371.4, 60 sec: 5535.3, 300 sec: 5516.4). Total num frames: 856806400. Throughput: 0: 4876.5. Samples: 856802984. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:17:21,212][25689] Avg episode reward: [(0, '-0.419')] [2022-07-10 18:17:21,310][26022] Updated weights on worker 0-0, policy_version 836726 (0.00090) [2022-07-10 18:17:22,944][26022] Updated weights on worker 0-0, policy_version 836736 (0.00114) [2022-07-10 18:17:24,935][26022] Updated weights on worker 0-0, policy_version 836746 (0.00086) [2022-07-10 18:17:26,215][25689] Fps is (10 sec: 5516.9, 60 sec: 5536.5, 300 sec: 5520.3). Total num frames: 856835072. Throughput: 0: 5803.5. Samples: 856836444. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:17:26,215][25689] Avg episode reward: [(0, '-0.701')] [2022-07-10 18:17:26,775][26022] Updated weights on worker 0-0, policy_version 836756 (0.00097) [2022-07-10 18:17:28,518][26022] Updated weights on worker 0-0, policy_version 836766 (0.00089) [2022-07-10 18:17:30,376][26022] Updated weights on worker 0-0, policy_version 836776 (0.00094) [2022-07-10 18:17:31,232][25689] Fps is (10 sec: 5517.8, 60 sec: 5503.3, 300 sec: 5511.4). Total num frames: 856861696. Throughput: 0: 5802.7. Samples: 856869832. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:17:31,232][25689] Avg episode reward: [(0, '0.088')] [2022-07-10 18:17:32,152][26022] Updated weights on worker 0-0, policy_version 836786 (0.00085) [2022-07-10 18:17:34,164][26022] Updated weights on worker 0-0, policy_version 836796 (0.00083) [2022-07-10 18:17:35,832][26022] Updated weights on worker 0-0, policy_version 836806 (0.00092) [2022-07-10 18:17:36,333][25689] Fps is (10 sec: 5566.2, 60 sec: 5538.4, 300 sec: 5517.0). Total num frames: 856891392. Throughput: 0: 4971.3. Samples: 856886554. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:17:36,333][25689] Avg episode reward: [(0, '0.274')] [2022-07-10 18:17:37,639][26022] Updated weights on worker 0-0, policy_version 836816 (0.00084) [2022-07-10 18:17:39,564][26022] Updated weights on worker 0-0, policy_version 836826 (0.00637) [2022-07-10 18:17:41,311][26022] Updated weights on worker 0-0, policy_version 836836 (0.00093) [2022-07-10 18:17:41,352][25689] Fps is (10 sec: 5767.8, 60 sec: 5554.5, 300 sec: 5523.6). Total num frames: 856920064. Throughput: 0: 5809.1. Samples: 856919986. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:17:41,352][25689] Avg episode reward: [(0, '0.232')] [2022-07-10 18:17:43,289][26022] Updated weights on worker 0-0, policy_version 836846 (0.00091) [2022-07-10 18:17:44,959][26022] Updated weights on worker 0-0, policy_version 836856 (0.00092) [2022-07-10 18:17:46,355][25689] Fps is (10 sec: 5415.4, 60 sec: 5507.2, 300 sec: 5513.3). Total num frames: 856945664. Throughput: 0: 5801.4. Samples: 856953284. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:17:46,355][25689] Avg episode reward: [(0, '0.324')] [2022-07-10 18:17:46,952][26022] Updated weights on worker 0-0, policy_version 836866 (0.00089) [2022-07-10 18:17:48,843][26022] Updated weights on worker 0-0, policy_version 836876 (0.00083) [2022-07-10 18:17:50,587][26022] Updated weights on worker 0-0, policy_version 836886 (0.00087) [2022-07-10 18:17:51,379][25689] Fps is (10 sec: 5412.5, 60 sec: 5539.9, 300 sec: 5517.9). Total num frames: 856974336. Throughput: 0: 4962.4. Samples: 856969808. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:17:51,379][25689] Avg episode reward: [(0, '0.770')] [2022-07-10 18:17:52,550][26022] Updated weights on worker 0-0, policy_version 836896 (0.00088) [2022-07-10 18:17:54,275][26022] Updated weights on worker 0-0, policy_version 836906 (0.00087) [2022-07-10 18:17:56,046][26022] Updated weights on worker 0-0, policy_version 836916 (0.00092) [2022-07-10 18:17:56,448][25689] Fps is (10 sec: 5681.1, 60 sec: 5525.2, 300 sec: 5520.4). Total num frames: 857003008. Throughput: 0: 5811.9. Samples: 857003464. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:17:56,449][25689] Avg episode reward: [(0, '0.799')] [2022-07-10 18:17:58,044][26022] Updated weights on worker 0-0, policy_version 836926 (0.00089) [2022-07-10 18:17:59,875][26022] Updated weights on worker 0-0, policy_version 836936 (0.00092) [2022-07-10 18:18:01,511][25689] Fps is (10 sec: 5558.7, 60 sec: 5506.0, 300 sec: 5526.5). Total num frames: 857030656. Throughput: 0: 5798.6. Samples: 857036880. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:18:01,511][25689] Avg episode reward: [(0, '0.371')] [2022-07-10 18:18:02,228][26022] Updated weights on worker 0-0, policy_version 836946 (0.00090) [2022-07-10 18:18:04,039][26022] Updated weights on worker 0-0, policy_version 836956 (0.00087) [2022-07-10 18:18:05,747][26022] Updated weights on worker 0-0, policy_version 836966 (0.00093) [2022-07-10 18:18:06,517][25689] Fps is (10 sec: 5288.7, 60 sec: 5523.2, 300 sec: 5519.6). Total num frames: 857056256. Throughput: 0: 4866.6. Samples: 857051406. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:18:06,517][25689] Avg episode reward: [(0, '-0.224')] [2022-07-10 18:18:07,833][26022] Updated weights on worker 0-0, policy_version 836976 (0.00084) [2022-07-10 18:18:09,522][26022] Updated weights on worker 0-0, policy_version 836986 (0.00089) [2022-07-10 18:18:11,389][26022] Updated weights on worker 0-0, policy_version 836996 (0.00096) [2022-07-10 18:18:11,547][25689] Fps is (10 sec: 5407.7, 60 sec: 5506.3, 300 sec: 5524.0). Total num frames: 857084928. Throughput: 0: 5687.7. Samples: 857084516. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:18:11,547][25689] Avg episode reward: [(0, '-0.344')] [2022-07-10 18:18:13,037][26022] Updated weights on worker 0-0, policy_version 837006 (0.00085) [2022-07-10 18:18:14,997][26022] Updated weights on worker 0-0, policy_version 837016 (0.00082) [2022-07-10 18:18:16,606][25689] Fps is (10 sec: 5582.3, 60 sec: 5508.6, 300 sec: 5513.6). Total num frames: 857112576. Throughput: 0: 5690.5. Samples: 857118170. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:18:16,606][25689] Avg episode reward: [(0, '-0.761')] [2022-07-10 18:18:16,790][26022] Updated weights on worker 0-0, policy_version 837026 (0.00093) [2022-07-10 18:18:18,742][26022] Updated weights on worker 0-0, policy_version 837036 (0.00101) [2022-07-10 18:18:20,539][26022] Updated weights on worker 0-0, policy_version 837046 (0.00083) [2022-07-10 18:18:21,707][25689] Fps is (10 sec: 5442.2, 60 sec: 5518.1, 300 sec: 5518.8). Total num frames: 857140224. Throughput: 0: 4838.7. Samples: 857134606. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:18:21,708][25689] Avg episode reward: [(0, '-1.231')] [2022-07-10 18:18:22,431][26022] Updated weights on worker 0-0, policy_version 837056 (0.00104) [2022-07-10 18:18:24,247][26022] Updated weights on worker 0-0, policy_version 837066 (0.00087) [2022-07-10 18:18:26,011][26022] Updated weights on worker 0-0, policy_version 837076 (0.00093) [2022-07-10 18:18:26,773][25689] Fps is (10 sec: 5740.7, 60 sec: 5546.2, 300 sec: 5521.8). Total num frames: 857170944. Throughput: 0: 5772.1. Samples: 857168328. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:18:26,775][25689] Avg episode reward: [(0, '-1.065')] [2022-07-10 18:18:28,019][26022] Updated weights on worker 0-0, policy_version 837086 (0.00086) [2022-07-10 18:18:29,743][26022] Updated weights on worker 0-0, policy_version 837096 (0.00087) [2022-07-10 18:18:31,806][25689] Fps is (10 sec: 5576.8, 60 sec: 5527.9, 300 sec: 5516.2). Total num frames: 857196544. Throughput: 0: 5779.0. Samples: 857201596. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:18:31,807][25689] Avg episode reward: [(0, '-0.083')] [2022-07-10 18:18:31,812][26022] Updated weights on worker 0-0, policy_version 837106 (0.00100) [2022-07-10 18:18:33,343][26022] Updated weights on worker 0-0, policy_version 837116 (0.00093) [2022-07-10 18:18:35,304][26022] Updated weights on worker 0-0, policy_version 837126 (0.00086) [2022-07-10 18:18:36,860][25689] Fps is (10 sec: 5380.3, 60 sec: 5515.2, 300 sec: 5515.6). Total num frames: 857225216. Throughput: 0: 4938.0. Samples: 857218188. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:18:36,861][25689] Avg episode reward: [(0, '-0.005')] [2022-07-10 18:18:37,053][26022] Updated weights on worker 0-0, policy_version 837136 (0.00092) [2022-07-10 18:18:39,139][26022] Updated weights on worker 0-0, policy_version 837146 (0.00085) [2022-07-10 18:18:40,866][26022] Updated weights on worker 0-0, policy_version 837156 (0.00090) [2022-07-10 18:18:41,902][25689] Fps is (10 sec: 5578.7, 60 sec: 5496.3, 300 sec: 5511.9). Total num frames: 857252864. Throughput: 0: 5790.6. Samples: 857251544. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:18:41,902][25689] Avg episode reward: [(0, '0.038')] [2022-07-10 18:18:42,668][26022] Updated weights on worker 0-0, policy_version 837166 (0.00091) [2022-07-10 18:18:44,440][26022] Updated weights on worker 0-0, policy_version 837176 (0.00097) [2022-07-10 18:18:46,402][26022] Updated weights on worker 0-0, policy_version 837186 (0.00101) [2022-07-10 18:18:46,994][25689] Fps is (10 sec: 5557.8, 60 sec: 5538.8, 300 sec: 5513.7). Total num frames: 857281536. Throughput: 0: 5760.6. Samples: 857284812. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:18:46,994][25689] Avg episode reward: [(0, '0.518')] [2022-07-10 18:18:48,199][26022] Updated weights on worker 0-0, policy_version 837196 (0.00100) [2022-07-10 18:18:50,027][26022] Updated weights on worker 0-0, policy_version 837206 (0.00092) [2022-07-10 18:18:51,892][26022] Updated weights on worker 0-0, policy_version 837216 (0.00086) [2022-07-10 18:18:52,080][25689] Fps is (10 sec: 5633.9, 60 sec: 5533.2, 300 sec: 5520.2). Total num frames: 857310208. Throughput: 0: 5750.4. Samples: 857318178. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:18:52,081][25689] Avg episode reward: [(0, '0.223')] [2022-07-10 18:18:53,487][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:18:53,497][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000837224_857317376.pth [2022-07-10 18:18:53,498][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000835282_855328768.pth [2022-07-10 18:18:53,815][26022] Updated weights on worker 0-0, policy_version 837226 (0.00088) [2022-07-10 18:18:55,583][26022] Updated weights on worker 0-0, policy_version 837236 (0.00086) [2022-07-10 18:18:57,174][25689] Fps is (10 sec: 5532.5, 60 sec: 5514.1, 300 sec: 5508.4). Total num frames: 857337856. Throughput: 0: 5749.7. Samples: 857334984. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:18:57,176][25689] Avg episode reward: [(0, '-0.133')] [2022-07-10 18:18:57,436][26022] Updated weights on worker 0-0, policy_version 837246 (0.00093) [2022-07-10 18:18:59,285][26022] Updated weights on worker 0-0, policy_version 837256 (0.00091) [2022-07-10 18:19:01,321][26022] Updated weights on worker 0-0, policy_version 837266 (0.00081) [2022-07-10 18:19:02,212][25689] Fps is (10 sec: 5255.3, 60 sec: 5482.5, 300 sec: 5515.0). Total num frames: 857363456. Throughput: 0: 5748.9. Samples: 857368306. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:19:02,213][25689] Avg episode reward: [(0, '-0.004')] [2022-07-10 18:19:03,382][26022] Updated weights on worker 0-0, policy_version 837276 (0.00084) [2022-07-10 18:19:05,334][26022] Updated weights on worker 0-0, policy_version 837286 (0.00086) [2022-07-10 18:19:07,067][26022] Updated weights on worker 0-0, policy_version 837296 (0.00093) [2022-07-10 18:19:07,235][25689] Fps is (10 sec: 5394.4, 60 sec: 5531.7, 300 sec: 5515.0). Total num frames: 857392128. Throughput: 0: 5665.0. Samples: 857399476. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:19:07,236][25689] Avg episode reward: [(0, '0.264')] [2022-07-10 18:19:09,065][26022] Updated weights on worker 0-0, policy_version 837306 (0.00092) [2022-07-10 18:19:10,758][26022] Updated weights on worker 0-0, policy_version 837316 (0.00092) [2022-07-10 18:19:12,239][25689] Fps is (10 sec: 5515.1, 60 sec: 5500.3, 300 sec: 5509.3). Total num frames: 857418752. Throughput: 0: 4848.7. Samples: 857415920. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:19:12,241][25689] Avg episode reward: [(0, '-0.810')] [2022-07-10 18:19:12,812][26022] Updated weights on worker 0-0, policy_version 837326 (0.00084) [2022-07-10 18:19:14,485][26022] Updated weights on worker 0-0, policy_version 837336 (0.00088) [2022-07-10 18:19:16,375][26022] Updated weights on worker 0-0, policy_version 837346 (0.00091) [2022-07-10 18:19:17,281][25689] Fps is (10 sec: 5402.1, 60 sec: 5501.8, 300 sec: 5508.8). Total num frames: 857446400. Throughput: 0: 5682.8. Samples: 857449250. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:19:17,283][25689] Avg episode reward: [(0, '-0.755')] [2022-07-10 18:19:18,095][26022] Updated weights on worker 0-0, policy_version 837356 (0.00092) [2022-07-10 18:19:20,107][26022] Updated weights on worker 0-0, policy_version 837366 (0.00087) [2022-07-10 18:19:21,831][26022] Updated weights on worker 0-0, policy_version 837376 (0.00086) [2022-07-10 18:19:22,313][25689] Fps is (10 sec: 5590.7, 60 sec: 5525.1, 300 sec: 5511.9). Total num frames: 857475072. Throughput: 0: 5694.1. Samples: 857482758. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:19:22,314][25689] Avg episode reward: [(0, '-0.279')] [2022-07-10 18:19:23,923][26022] Updated weights on worker 0-0, policy_version 837386 (0.00084) [2022-07-10 18:19:25,415][26022] Updated weights on worker 0-0, policy_version 837396 (0.00087) [2022-07-10 18:19:27,327][25689] Fps is (10 sec: 5606.2, 60 sec: 5479.0, 300 sec: 5515.2). Total num frames: 857502720. Throughput: 0: 4983.9. Samples: 857499616. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:19:27,328][25689] Avg episode reward: [(0, '-0.559')] [2022-07-10 18:19:27,491][26022] Updated weights on worker 0-0, policy_version 837406 (0.00085) [2022-07-10 18:19:29,286][26022] Updated weights on worker 0-0, policy_version 837416 (0.00088) [2022-07-10 18:19:31,104][26022] Updated weights on worker 0-0, policy_version 837426 (0.00086) [2022-07-10 18:19:32,331][25689] Fps is (10 sec: 5519.5, 60 sec: 5515.5, 300 sec: 5513.3). Total num frames: 857530368. Throughput: 0: 5819.0. Samples: 857532836. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:19:32,331][25689] Avg episode reward: [(0, '-0.582')] [2022-07-10 18:19:32,974][26022] Updated weights on worker 0-0, policy_version 837436 (0.00084) [2022-07-10 18:19:34,765][26022] Updated weights on worker 0-0, policy_version 837446 (0.00089) [2022-07-10 18:19:36,692][26022] Updated weights on worker 0-0, policy_version 837456 (0.00087) [2022-07-10 18:19:37,446][25689] Fps is (10 sec: 5566.1, 60 sec: 5510.0, 300 sec: 5511.9). Total num frames: 857559040. Throughput: 0: 5796.8. Samples: 857566138. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:19:37,446][25689] Avg episode reward: [(0, '0.048')] [2022-07-10 18:19:38,675][26022] Updated weights on worker 0-0, policy_version 837466 (0.00081) [2022-07-10 18:19:40,295][26022] Updated weights on worker 0-0, policy_version 837476 (0.00094) [2022-07-10 18:19:42,245][26022] Updated weights on worker 0-0, policy_version 837486 (0.00091) [2022-07-10 18:19:42,489][25689] Fps is (10 sec: 5544.1, 60 sec: 5509.8, 300 sec: 5511.4). Total num frames: 857586688. Throughput: 0: 4947.6. Samples: 857582584. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:19:42,491][25689] Avg episode reward: [(0, '0.356')] [2022-07-10 18:19:44,035][26022] Updated weights on worker 0-0, policy_version 837496 (0.00084) [2022-07-10 18:19:45,981][26022] Updated weights on worker 0-0, policy_version 837506 (0.00088) [2022-07-10 18:19:47,508][25689] Fps is (10 sec: 5495.4, 60 sec: 5499.6, 300 sec: 5515.1). Total num frames: 857614336. Throughput: 0: 5758.8. Samples: 857615832. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:19:47,510][25689] Avg episode reward: [(0, '0.089')] [2022-07-10 18:19:48,044][26022] Updated weights on worker 0-0, policy_version 837516 (0.00086) [2022-07-10 18:19:49,473][26022] Updated weights on worker 0-0, policy_version 837526 (0.00081) [2022-07-10 18:19:51,643][26022] Updated weights on worker 0-0, policy_version 837536 (0.00091) [2022-07-10 18:19:52,515][25689] Fps is (10 sec: 5617.9, 60 sec: 5506.8, 300 sec: 5513.5). Total num frames: 857643008. Throughput: 0: 5766.5. Samples: 857649224. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:19:52,515][25689] Avg episode reward: [(0, '0.331')] [2022-07-10 18:19:53,498][26022] Updated weights on worker 0-0, policy_version 837546 (0.00093) [2022-07-10 18:19:55,158][26022] Updated weights on worker 0-0, policy_version 837556 (0.00053) [2022-07-10 18:19:57,373][26022] Updated weights on worker 0-0, policy_version 837566 (0.00095) [2022-07-10 18:19:57,646][25689] Fps is (10 sec: 5353.0, 60 sec: 5469.4, 300 sec: 5511.4). Total num frames: 857668608. Throughput: 0: 4937.6. Samples: 857665882. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2022-07-10 18:19:57,647][25689] Avg episode reward: [(0, '-0.985')] [2022-07-10 18:19:58,635][26022] Updated weights on worker 0-0, policy_version 837576 (0.00089) [2022-07-10 18:20:00,945][26022] Updated weights on worker 0-0, policy_version 837586 (0.00086) [2022-07-10 18:20:02,671][25689] Fps is (10 sec: 5343.7, 60 sec: 5521.6, 300 sec: 5515.4). Total num frames: 857697280. Throughput: 0: 5782.7. Samples: 857699286. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:20:02,671][25689] Avg episode reward: [(0, '-1.009')] [2022-07-10 18:20:02,733][26022] Updated weights on worker 0-0, policy_version 837596 (0.00094) [2022-07-10 18:20:04,927][26022] Updated weights on worker 0-0, policy_version 837606 (0.00101) [2022-07-10 18:20:06,554][26022] Updated weights on worker 0-0, policy_version 837616 (0.00088) [2022-07-10 18:20:07,714][25689] Fps is (10 sec: 5492.3, 60 sec: 5485.7, 300 sec: 5511.6). Total num frames: 857723904. Throughput: 0: 5675.8. Samples: 857730520. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:20:07,715][25689] Avg episode reward: [(0, '-1.544')] [2022-07-10 18:20:08,531][26022] Updated weights on worker 0-0, policy_version 837626 (0.00087) [2022-07-10 18:20:10,307][26022] Updated weights on worker 0-0, policy_version 837636 (0.00085) [2022-07-10 18:20:12,119][26022] Updated weights on worker 0-0, policy_version 837646 (0.00087) [2022-07-10 18:20:12,783][25689] Fps is (10 sec: 5366.9, 60 sec: 5496.8, 300 sec: 5511.3). Total num frames: 857751552. Throughput: 0: 4821.0. Samples: 857746940. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:20:12,784][25689] Avg episode reward: [(0, '-1.546')] [2022-07-10 18:20:13,997][26022] Updated weights on worker 0-0, policy_version 837656 (0.00087) [2022-07-10 18:20:15,973][26022] Updated weights on worker 0-0, policy_version 837666 (0.00092) [2022-07-10 18:20:17,699][26022] Updated weights on worker 0-0, policy_version 837676 (0.00083) [2022-07-10 18:20:17,824][25689] Fps is (10 sec: 5672.3, 60 sec: 5530.8, 300 sec: 5517.6). Total num frames: 857781248. Throughput: 0: 5665.7. Samples: 857780202. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:20:17,824][25689] Avg episode reward: [(0, '-2.295')] [2022-07-10 18:20:19,778][26022] Updated weights on worker 0-0, policy_version 837686 (0.00091) [2022-07-10 18:20:21,240][26022] Updated weights on worker 0-0, policy_version 837696 (0.00089) [2022-07-10 18:20:22,906][25689] Fps is (10 sec: 5563.7, 60 sec: 5492.4, 300 sec: 5510.6). Total num frames: 857807872. Throughput: 0: 5643.4. Samples: 857813480. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:20:22,906][25689] Avg episode reward: [(0, '-2.206')] [2022-07-10 18:20:23,579][26022] Updated weights on worker 0-0, policy_version 837706 (0.00093) [2022-07-10 18:20:25,123][26022] Updated weights on worker 0-0, policy_version 837716 (0.00084) [2022-07-10 18:20:27,174][26022] Updated weights on worker 0-0, policy_version 837726 (0.00084) [2022-07-10 18:20:27,907][25689] Fps is (10 sec: 5280.7, 60 sec: 5476.7, 300 sec: 5507.5). Total num frames: 857834496. Throughput: 0: 4930.9. Samples: 857830090. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:20:27,908][25689] Avg episode reward: [(0, '-0.982')] [2022-07-10 18:20:28,905][26022] Updated weights on worker 0-0, policy_version 837736 (0.00088) [2022-07-10 18:20:30,782][26022] Updated weights on worker 0-0, policy_version 837746 (0.00088) [2022-07-10 18:20:32,640][26022] Updated weights on worker 0-0, policy_version 837756 (0.00087) [2022-07-10 18:20:32,992][25689] Fps is (10 sec: 5584.1, 60 sec: 5503.1, 300 sec: 5521.7). Total num frames: 857864192. Throughput: 0: 5752.3. Samples: 857863186. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:20:32,992][25689] Avg episode reward: [(0, '-1.561')] [2022-07-10 18:20:34,484][26022] Updated weights on worker 0-0, policy_version 837766 (0.00085) [2022-07-10 18:20:36,306][26022] Updated weights on worker 0-0, policy_version 837776 (0.00087) [2022-07-10 18:20:38,033][25689] Fps is (10 sec: 5663.2, 60 sec: 5492.9, 300 sec: 5514.3). Total num frames: 857891840. Throughput: 0: 5752.4. Samples: 857896454. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:20:38,034][25689] Avg episode reward: [(0, '-0.842')] [2022-07-10 18:20:38,113][26022] Updated weights on worker 0-0, policy_version 837786 (0.00092) [2022-07-10 18:20:39,867][26022] Updated weights on worker 0-0, policy_version 837796 (0.00092) [2022-07-10 18:20:41,984][26022] Updated weights on worker 0-0, policy_version 837806 (0.00096) [2022-07-10 18:20:43,085][25689] Fps is (10 sec: 5478.3, 60 sec: 5492.1, 300 sec: 5513.8). Total num frames: 857919488. Throughput: 0: 4936.3. Samples: 857913092. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:20:43,088][25689] Avg episode reward: [(0, '-0.644')] [2022-07-10 18:20:43,655][26022] Updated weights on worker 0-0, policy_version 837816 (0.00087) [2022-07-10 18:20:45,665][26022] Updated weights on worker 0-0, policy_version 837826 (0.00086) [2022-07-10 18:20:47,456][26022] Updated weights on worker 0-0, policy_version 837836 (0.00089) [2022-07-10 18:20:48,095][25689] Fps is (10 sec: 5495.8, 60 sec: 5493.0, 300 sec: 5514.0). Total num frames: 857947136. Throughput: 0: 5748.7. Samples: 857946140. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:20:48,095][25689] Avg episode reward: [(0, '-0.727')] [2022-07-10 18:20:49,376][26022] Updated weights on worker 0-0, policy_version 837846 (0.00088) [2022-07-10 18:20:51,112][26022] Updated weights on worker 0-0, policy_version 837856 (0.00091) [2022-07-10 18:20:52,908][26022] Updated weights on worker 0-0, policy_version 837866 (0.00087) [2022-07-10 18:20:53,105][25689] Fps is (10 sec: 5518.9, 60 sec: 5475.7, 300 sec: 5508.4). Total num frames: 857974784. Throughput: 0: 5774.9. Samples: 857979338. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:20:53,105][25689] Avg episode reward: [(0, '-1.690')] [2022-07-10 18:20:53,581][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:20:53,590][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000837870_857978880.pth [2022-07-10 18:20:53,591][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000835929_855991296.pth [2022-07-10 18:20:54,829][26022] Updated weights on worker 0-0, policy_version 837876 (0.00083) [2022-07-10 18:20:56,618][26022] Updated weights on worker 0-0, policy_version 837886 (0.00089) [2022-07-10 18:20:58,160][25689] Fps is (10 sec: 5493.7, 60 sec: 5516.5, 300 sec: 5504.9). Total num frames: 858002432. Throughput: 0: 5784.9. Samples: 858012886. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:20:58,162][25689] Avg episode reward: [(0, '-2.807')] [2022-07-10 18:20:58,546][26022] Updated weights on worker 0-0, policy_version 837896 (0.00094) [2022-07-10 18:21:00,373][26022] Updated weights on worker 0-0, policy_version 837906 (0.00092) [2022-07-10 18:21:02,610][26022] Updated weights on worker 0-0, policy_version 837916 (0.00081) [2022-07-10 18:21:03,169][25689] Fps is (10 sec: 5392.8, 60 sec: 5484.1, 300 sec: 5515.3). Total num frames: 858029056. Throughput: 0: 5783.1. Samples: 858029236. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:21:03,169][25689] Avg episode reward: [(0, '-2.052')] [2022-07-10 18:21:04,473][26022] Updated weights on worker 0-0, policy_version 837926 (0.00085) [2022-07-10 18:21:06,307][26022] Updated weights on worker 0-0, policy_version 837936 (0.00086) [2022-07-10 18:21:08,181][25689] Fps is (10 sec: 5415.6, 60 sec: 5503.8, 300 sec: 5513.1). Total num frames: 858056704. Throughput: 0: 5699.6. Samples: 858060626. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:21:08,183][25689] Avg episode reward: [(0, '-1.668')] [2022-07-10 18:21:08,184][26022] Updated weights on worker 0-0, policy_version 837946 (0.00183) [2022-07-10 18:21:09,885][26022] Updated weights on worker 0-0, policy_version 837956 (0.00083) [2022-07-10 18:21:11,954][26022] Updated weights on worker 0-0, policy_version 837966 (0.00585) [2022-07-10 18:21:13,186][25689] Fps is (10 sec: 5520.2, 60 sec: 5509.7, 300 sec: 5511.0). Total num frames: 858084352. Throughput: 0: 5682.2. Samples: 858093442. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:21:13,186][25689] Avg episode reward: [(0, '-1.617')] [2022-07-10 18:21:13,735][26022] Updated weights on worker 0-0, policy_version 837976 (0.00085) [2022-07-10 18:21:15,651][26022] Updated weights on worker 0-0, policy_version 837986 (0.00099) [2022-07-10 18:21:17,412][26022] Updated weights on worker 0-0, policy_version 837996 (0.00086) [2022-07-10 18:21:18,280][25689] Fps is (10 sec: 5475.5, 60 sec: 5470.9, 300 sec: 5513.0). Total num frames: 858112000. Throughput: 0: 4831.5. Samples: 858110098. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:21:18,281][25689] Avg episode reward: [(0, '-1.789')] [2022-07-10 18:21:19,261][26022] Updated weights on worker 0-0, policy_version 838006 (0.00089) [2022-07-10 18:21:21,089][26022] Updated weights on worker 0-0, policy_version 838016 (0.00094) [2022-07-10 18:21:22,934][26022] Updated weights on worker 0-0, policy_version 838026 (0.01140) [2022-07-10 18:21:23,301][25689] Fps is (10 sec: 5365.5, 60 sec: 5476.5, 300 sec: 5506.0). Total num frames: 858138624. Throughput: 0: 5667.2. Samples: 858143328. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:21:23,301][25689] Avg episode reward: [(0, '-1.986')] [2022-07-10 18:21:24,857][26022] Updated weights on worker 0-0, policy_version 838036 (0.00083) [2022-07-10 18:21:26,660][26022] Updated weights on worker 0-0, policy_version 838046 (0.00086) [2022-07-10 18:21:28,319][25689] Fps is (10 sec: 5508.4, 60 sec: 5508.9, 300 sec: 5506.1). Total num frames: 858167296. Throughput: 0: 5770.3. Samples: 858176824. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:21:28,319][25689] Avg episode reward: [(0, '-0.452')] [2022-07-10 18:21:28,393][26022] Updated weights on worker 0-0, policy_version 838056 (0.00091) [2022-07-10 18:21:30,359][26022] Updated weights on worker 0-0, policy_version 838066 (0.00061) [2022-07-10 18:21:32,152][26022] Updated weights on worker 0-0, policy_version 838076 (0.00092) [2022-07-10 18:21:33,382][25689] Fps is (10 sec: 5688.2, 60 sec: 5493.9, 300 sec: 5510.5). Total num frames: 858195968. Throughput: 0: 4945.1. Samples: 858193314. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:21:33,383][25689] Avg episode reward: [(0, '-0.887')] [2022-07-10 18:21:33,975][26022] Updated weights on worker 0-0, policy_version 838086 (0.00081) [2022-07-10 18:21:36,029][26022] Updated weights on worker 0-0, policy_version 838096 (0.00085) [2022-07-10 18:21:37,640][26022] Updated weights on worker 0-0, policy_version 838106 (0.00085) [2022-07-10 18:21:38,423][25689] Fps is (10 sec: 5573.9, 60 sec: 5493.9, 300 sec: 5509.9). Total num frames: 858223616. Throughput: 0: 5796.7. Samples: 858226860. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:21:38,423][25689] Avg episode reward: [(0, '-3.385')] [2022-07-10 18:21:39,600][26022] Updated weights on worker 0-0, policy_version 838116 (0.00087) [2022-07-10 18:21:41,383][26022] Updated weights on worker 0-0, policy_version 838126 (0.00080) [2022-07-10 18:21:43,409][26022] Updated weights on worker 0-0, policy_version 838136 (0.00110) [2022-07-10 18:21:43,444][25689] Fps is (10 sec: 5495.3, 60 sec: 5496.7, 300 sec: 5506.9). Total num frames: 858251264. Throughput: 0: 5803.4. Samples: 858260230. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:21:43,445][25689] Avg episode reward: [(0, '-2.348')] [2022-07-10 18:21:45,302][26022] Updated weights on worker 0-0, policy_version 838146 (0.00094) [2022-07-10 18:21:46,968][26022] Updated weights on worker 0-0, policy_version 838156 (0.00090) [2022-07-10 18:21:48,468][25689] Fps is (10 sec: 5504.7, 60 sec: 5495.4, 300 sec: 5510.1). Total num frames: 858278912. Throughput: 0: 4966.6. Samples: 858276898. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:21:48,469][25689] Avg episode reward: [(0, '-2.690')] [2022-07-10 18:21:48,840][26022] Updated weights on worker 0-0, policy_version 838166 (0.00091) [2022-07-10 18:21:50,554][26022] Updated weights on worker 0-0, policy_version 838176 (0.00096) [2022-07-10 18:21:52,693][26022] Updated weights on worker 0-0, policy_version 838186 (0.00049) [2022-07-10 18:21:53,471][25689] Fps is (10 sec: 5617.1, 60 sec: 5513.0, 300 sec: 5508.3). Total num frames: 858307584. Throughput: 0: 5830.2. Samples: 858310436. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:21:53,471][25689] Avg episode reward: [(0, '-3.476')] [2022-07-10 18:21:54,174][26022] Updated weights on worker 0-0, policy_version 838196 (0.00091) [2022-07-10 18:21:56,228][26022] Updated weights on worker 0-0, policy_version 838206 (0.00090) [2022-07-10 18:21:58,071][26022] Updated weights on worker 0-0, policy_version 838216 (0.00082) [2022-07-10 18:21:58,566][25689] Fps is (10 sec: 5476.1, 60 sec: 5492.4, 300 sec: 5500.4). Total num frames: 858334208. Throughput: 0: 5812.3. Samples: 858343936. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:21:58,566][25689] Avg episode reward: [(0, '-3.253')] [2022-07-10 18:21:59,924][26022] Updated weights on worker 0-0, policy_version 838226 (0.00055) [2022-07-10 18:22:01,739][26022] Updated weights on worker 0-0, policy_version 838236 (0.00094) [2022-07-10 18:22:03,585][25689] Fps is (10 sec: 5264.9, 60 sec: 5491.5, 300 sec: 5507.1). Total num frames: 858360832. Throughput: 0: 4985.0. Samples: 858360630. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:22:03,585][25689] Avg episode reward: [(0, '-3.053')] [2022-07-10 18:22:03,970][26022] Updated weights on worker 0-0, policy_version 838246 (0.00092) [2022-07-10 18:22:05,732][26022] Updated weights on worker 0-0, policy_version 838256 (0.00083) [2022-07-10 18:22:07,881][26022] Updated weights on worker 0-0, policy_version 838266 (0.00092) [2022-07-10 18:22:08,611][25689] Fps is (10 sec: 5504.9, 60 sec: 5507.3, 300 sec: 5503.7). Total num frames: 858389504. Throughput: 0: 5693.4. Samples: 858391578. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:22:08,612][25689] Avg episode reward: [(0, '-1.339')] [2022-07-10 18:22:09,455][26022] Updated weights on worker 0-0, policy_version 838276 (0.00094) [2022-07-10 18:22:11,491][26022] Updated weights on worker 0-0, policy_version 838286 (0.00096) [2022-07-10 18:22:13,123][26022] Updated weights on worker 0-0, policy_version 838296 (0.00098) [2022-07-10 18:22:13,635][25689] Fps is (10 sec: 5502.1, 60 sec: 5488.5, 300 sec: 5501.4). Total num frames: 858416128. Throughput: 0: 5677.9. Samples: 858424924. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:22:13,636][25689] Avg episode reward: [(0, '-1.453')] [2022-07-10 18:22:15,133][26022] Updated weights on worker 0-0, policy_version 838306 (0.00084) [2022-07-10 18:22:16,917][26022] Updated weights on worker 0-0, policy_version 838316 (0.00088) [2022-07-10 18:22:18,678][25689] Fps is (10 sec: 5391.2, 60 sec: 5493.2, 300 sec: 5504.4). Total num frames: 858443776. Throughput: 0: 4850.8. Samples: 858441494. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:22:18,678][25689] Avg episode reward: [(0, '-0.847')] [2022-07-10 18:22:18,925][26022] Updated weights on worker 0-0, policy_version 838326 (0.00088) [2022-07-10 18:22:20,579][26022] Updated weights on worker 0-0, policy_version 838336 (0.00092) [2022-07-10 18:22:22,671][26022] Updated weights on worker 0-0, policy_version 838346 (0.00088) [2022-07-10 18:22:23,687][25689] Fps is (10 sec: 5704.6, 60 sec: 5545.1, 300 sec: 5507.7). Total num frames: 858473472. Throughput: 0: 5692.1. Samples: 858475054. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:22:23,688][25689] Avg episode reward: [(0, '-0.437')] [2022-07-10 18:22:24,071][26022] Updated weights on worker 0-0, policy_version 838356 (0.00088) [2022-07-10 18:22:26,303][26022] Updated weights on worker 0-0, policy_version 838366 (0.00088) [2022-07-10 18:22:28,018][26022] Updated weights on worker 0-0, policy_version 838376 (0.00087) [2022-07-10 18:22:28,692][25689] Fps is (10 sec: 5726.0, 60 sec: 5529.3, 300 sec: 5511.4). Total num frames: 858501120. Throughput: 0: 5817.2. Samples: 858508396. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:22:28,693][25689] Avg episode reward: [(0, '-0.236')] [2022-07-10 18:22:29,915][26022] Updated weights on worker 0-0, policy_version 838386 (0.00086) [2022-07-10 18:22:31,724][26022] Updated weights on worker 0-0, policy_version 838396 (0.00055) [2022-07-10 18:22:33,350][26022] Updated weights on worker 0-0, policy_version 838406 (0.00045) [2022-07-10 18:22:33,727][25689] Fps is (10 sec: 5507.3, 60 sec: 5514.9, 300 sec: 5505.7). Total num frames: 858528768. Throughput: 0: 4985.9. Samples: 858525104. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:22:33,728][25689] Avg episode reward: [(0, '0.328')] [2022-07-10 18:22:35,409][26022] Updated weights on worker 0-0, policy_version 838416 (0.00398) [2022-07-10 18:22:37,032][26022] Updated weights on worker 0-0, policy_version 838426 (0.00081) [2022-07-10 18:22:38,815][25689] Fps is (10 sec: 5563.6, 60 sec: 5527.6, 300 sec: 5504.5). Total num frames: 858557440. Throughput: 0: 5832.2. Samples: 858558940. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:22:38,819][25689] Avg episode reward: [(0, '0.283')] [2022-07-10 18:22:38,914][26022] Updated weights on worker 0-0, policy_version 838436 (0.00086) [2022-07-10 18:22:40,959][26022] Updated weights on worker 0-0, policy_version 838446 (0.00086) [2022-07-10 18:22:42,596][26022] Updated weights on worker 0-0, policy_version 838456 (0.00084) [2022-07-10 18:22:43,897][25689] Fps is (10 sec: 5538.1, 60 sec: 5522.1, 300 sec: 5509.9). Total num frames: 858585088. Throughput: 0: 5821.1. Samples: 858592698. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:22:43,897][25689] Avg episode reward: [(0, '0.376')] [2022-07-10 18:22:44,653][26022] Updated weights on worker 0-0, policy_version 838466 (0.00082) [2022-07-10 18:22:46,280][26022] Updated weights on worker 0-0, policy_version 838476 (0.00094) [2022-07-10 18:22:48,158][26022] Updated weights on worker 0-0, policy_version 838486 (0.00091) [2022-07-10 18:22:48,924][25689] Fps is (10 sec: 5571.3, 60 sec: 5538.7, 300 sec: 5509.8). Total num frames: 858613760. Throughput: 0: 4982.7. Samples: 858609206. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:22:48,925][25689] Avg episode reward: [(0, '1.076')] [2022-07-10 18:22:49,988][26022] Updated weights on worker 0-0, policy_version 838496 (0.00088) [2022-07-10 18:22:51,731][26022] Updated weights on worker 0-0, policy_version 838506 (0.00091) [2022-07-10 18:22:53,659][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:22:53,669][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000838516_858640384.pth [2022-07-10 18:22:53,669][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000836576_856653824.pth [2022-07-10 18:22:53,671][26022] Updated weights on worker 0-0, policy_version 838516 (0.00080) [2022-07-10 18:22:53,930][25689] Fps is (10 sec: 5511.3, 60 sec: 5504.5, 300 sec: 5504.1). Total num frames: 858640384. Throughput: 0: 5833.3. Samples: 858642952. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:22:53,931][25689] Avg episode reward: [(0, '0.304')] [2022-07-10 18:22:55,301][26022] Updated weights on worker 0-0, policy_version 838526 (0.00086) [2022-07-10 18:22:57,199][26022] Updated weights on worker 0-0, policy_version 838536 (0.00088) [2022-07-10 18:22:58,993][25689] Fps is (10 sec: 5593.5, 60 sec: 5558.3, 300 sec: 5511.0). Total num frames: 858670080. Throughput: 0: 5826.9. Samples: 858676514. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:22:58,993][25689] Avg episode reward: [(0, '0.180')] [2022-07-10 18:22:59,223][26022] Updated weights on worker 0-0, policy_version 838546 (0.00090) [2022-07-10 18:23:00,850][26022] Updated weights on worker 0-0, policy_version 838556 (0.00098) [2022-07-10 18:23:03,300][26022] Updated weights on worker 0-0, policy_version 838566 (0.00089) [2022-07-10 18:23:04,048][25689] Fps is (10 sec: 5566.4, 60 sec: 5555.0, 300 sec: 5513.5). Total num frames: 858696704. Throughput: 0: 4995.1. Samples: 858693354. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:23:04,048][25689] Avg episode reward: [(0, '-0.150')] [2022-07-10 18:23:04,927][26022] Updated weights on worker 0-0, policy_version 838576 (0.00087) [2022-07-10 18:23:06,858][26022] Updated weights on worker 0-0, policy_version 838586 (0.00087) [2022-07-10 18:23:08,678][26022] Updated weights on worker 0-0, policy_version 838596 (0.00093) [2022-07-10 18:23:09,061][25689] Fps is (10 sec: 5288.6, 60 sec: 5522.3, 300 sec: 5507.0). Total num frames: 858723328. Throughput: 0: 5733.4. Samples: 858724660. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:23:09,062][25689] Avg episode reward: [(0, '-0.164')] [2022-07-10 18:23:10,369][26022] Updated weights on worker 0-0, policy_version 838606 (0.00095) [2022-07-10 18:23:12,487][26022] Updated weights on worker 0-0, policy_version 838616 (0.00094) [2022-07-10 18:23:14,063][26022] Updated weights on worker 0-0, policy_version 838626 (0.00083) [2022-07-10 18:23:14,070][25689] Fps is (10 sec: 5619.8, 60 sec: 5574.5, 300 sec: 5514.8). Total num frames: 858753024. Throughput: 0: 5724.4. Samples: 858758238. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:23:14,070][25689] Avg episode reward: [(0, '-0.235')] [2022-07-10 18:23:16,156][26022] Updated weights on worker 0-0, policy_version 838636 (0.01029) [2022-07-10 18:23:18,007][26022] Updated weights on worker 0-0, policy_version 838646 (0.00398) [2022-07-10 18:23:19,157][25689] Fps is (10 sec: 5578.4, 60 sec: 5553.4, 300 sec: 5511.6). Total num frames: 858779648. Throughput: 0: 4873.1. Samples: 858774778. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:23:19,158][25689] Avg episode reward: [(0, '-0.544')] [2022-07-10 18:23:19,859][26022] Updated weights on worker 0-0, policy_version 838656 (0.00086) [2022-07-10 18:23:21,702][26022] Updated weights on worker 0-0, policy_version 838666 (0.00091) [2022-07-10 18:23:23,377][26022] Updated weights on worker 0-0, policy_version 838676 (0.00086) [2022-07-10 18:23:24,162][25689] Fps is (10 sec: 5478.9, 60 sec: 5536.9, 300 sec: 5505.8). Total num frames: 858808320. Throughput: 0: 5712.1. Samples: 858808248. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:23:24,163][25689] Avg episode reward: [(0, '-0.367')] [2022-07-10 18:23:25,188][26022] Updated weights on worker 0-0, policy_version 838686 (0.00090) [2022-07-10 18:23:26,994][26022] Updated weights on worker 0-0, policy_version 838696 (0.00085) [2022-07-10 18:23:28,840][26022] Updated weights on worker 0-0, policy_version 838706 (0.00111) [2022-07-10 18:23:29,175][25689] Fps is (10 sec: 5622.0, 60 sec: 5536.2, 300 sec: 5513.1). Total num frames: 858835968. Throughput: 0: 5826.6. Samples: 858841856. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:23:29,177][25689] Avg episode reward: [(0, '-0.101')] [2022-07-10 18:23:31,013][26022] Updated weights on worker 0-0, policy_version 838716 (0.00090) [2022-07-10 18:23:32,455][26022] Updated weights on worker 0-0, policy_version 838726 (0.00063) [2022-07-10 18:23:34,213][25689] Fps is (10 sec: 5399.9, 60 sec: 5519.1, 300 sec: 5506.5). Total num frames: 858862592. Throughput: 0: 4973.8. Samples: 858858426. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:23:34,213][25689] Avg episode reward: [(0, '0.131')] [2022-07-10 18:23:34,584][26022] Updated weights on worker 0-0, policy_version 838736 (0.00085) [2022-07-10 18:23:36,487][26022] Updated weights on worker 0-0, policy_version 838746 (0.00079) [2022-07-10 18:23:38,355][26022] Updated weights on worker 0-0, policy_version 838756 (0.00097) [2022-07-10 18:23:39,332][25689] Fps is (10 sec: 5544.9, 60 sec: 5533.1, 300 sec: 5512.0). Total num frames: 858892288. Throughput: 0: 5788.6. Samples: 858891562. Policy #0 lag: (min: 0.0, avg: 6.9, max: 19.0) [2022-07-10 18:23:39,333][25689] Avg episode reward: [(0, '-0.259')] [2022-07-10 18:23:39,990][26022] Updated weights on worker 0-0, policy_version 838766 (0.00094) [2022-07-10 18:23:41,925][26022] Updated weights on worker 0-0, policy_version 838776 (0.00091) [2022-07-10 18:23:43,669][26022] Updated weights on worker 0-0, policy_version 838786 (0.00086) [2022-07-10 18:23:44,348][25689] Fps is (10 sec: 5758.9, 60 sec: 5556.1, 300 sec: 5513.4). Total num frames: 858920960. Throughput: 0: 5772.5. Samples: 858924770. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:23:44,348][25689] Avg episode reward: [(0, '-0.762')] [2022-07-10 18:23:45,968][26022] Updated weights on worker 0-0, policy_version 838796 (0.00091) [2022-07-10 18:23:47,391][26022] Updated weights on worker 0-0, policy_version 838806 (0.00084) [2022-07-10 18:23:49,357][25689] Fps is (10 sec: 5311.6, 60 sec: 5490.0, 300 sec: 5501.0). Total num frames: 858945536. Throughput: 0: 4922.2. Samples: 858941196. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:23:49,358][25689] Avg episode reward: [(0, '-0.158')] [2022-07-10 18:23:49,593][26022] Updated weights on worker 0-0, policy_version 838816 (0.00085) [2022-07-10 18:23:50,987][26022] Updated weights on worker 0-0, policy_version 838826 (0.00104) [2022-07-10 18:23:53,298][26022] Updated weights on worker 0-0, policy_version 838836 (0.00088) [2022-07-10 18:23:54,378][25689] Fps is (10 sec: 5512.9, 60 sec: 5556.4, 300 sec: 5512.7). Total num frames: 858976256. Throughput: 0: 5757.4. Samples: 858974528. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:23:54,380][25689] Avg episode reward: [(0, '-1.167')] [2022-07-10 18:23:54,841][26022] Updated weights on worker 0-0, policy_version 838846 (0.00087) [2022-07-10 18:23:56,891][26022] Updated weights on worker 0-0, policy_version 838856 (0.00091) [2022-07-10 18:23:58,456][26022] Updated weights on worker 0-0, policy_version 838866 (0.00093) [2022-07-10 18:23:59,483][25689] Fps is (10 sec: 5764.3, 60 sec: 5518.6, 300 sec: 5518.4). Total num frames: 859003904. Throughput: 0: 5774.7. Samples: 859007926. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:23:59,484][25689] Avg episode reward: [(0, '-1.051')] [2022-07-10 18:24:00,714][26022] Updated weights on worker 0-0, policy_version 838876 (0.00093) [2022-07-10 18:24:02,459][26022] Updated weights on worker 0-0, policy_version 838886 (0.00087) [2022-07-10 18:24:04,453][26022] Updated weights on worker 0-0, policy_version 838896 (0.00090) [2022-07-10 18:24:04,549][25689] Fps is (10 sec: 5235.5, 60 sec: 5500.7, 300 sec: 5507.2). Total num frames: 859029504. Throughput: 0: 4941.7. Samples: 859024596. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:24:04,551][25689] Avg episode reward: [(0, '-0.756')] [2022-07-10 18:24:06,283][26022] Updated weights on worker 0-0, policy_version 838906 (0.00087) [2022-07-10 18:24:08,084][26022] Updated weights on worker 0-0, policy_version 838916 (0.00091) [2022-07-10 18:24:09,558][25689] Fps is (10 sec: 5285.1, 60 sec: 5518.0, 300 sec: 5510.6). Total num frames: 859057152. Throughput: 0: 5696.8. Samples: 859056276. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:24:09,559][25689] Avg episode reward: [(0, '-1.200')] [2022-07-10 18:24:09,903][26022] Updated weights on worker 0-0, policy_version 838926 (0.00089) [2022-07-10 18:24:11,657][26022] Updated weights on worker 0-0, policy_version 838936 (0.00090) [2022-07-10 18:24:13,595][26022] Updated weights on worker 0-0, policy_version 838946 (0.00091) [2022-07-10 18:24:14,568][25689] Fps is (10 sec: 5723.6, 60 sec: 5517.9, 300 sec: 5518.1). Total num frames: 859086848. Throughput: 0: 5727.2. Samples: 859090158. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:24:14,568][25689] Avg episode reward: [(0, '-0.642')] [2022-07-10 18:24:15,497][26022] Updated weights on worker 0-0, policy_version 838956 (0.00092) [2022-07-10 18:24:17,275][26022] Updated weights on worker 0-0, policy_version 838966 (0.00085) [2022-07-10 18:24:19,219][26022] Updated weights on worker 0-0, policy_version 838976 (0.00821) [2022-07-10 18:24:19,615][25689] Fps is (10 sec: 5498.3, 60 sec: 5504.7, 300 sec: 5507.4). Total num frames: 859112448. Throughput: 0: 5726.3. Samples: 859123208. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:24:19,616][25689] Avg episode reward: [(0, '-0.649')] [2022-07-10 18:24:20,909][26022] Updated weights on worker 0-0, policy_version 838986 (0.00087) [2022-07-10 18:24:22,911][26022] Updated weights on worker 0-0, policy_version 838996 (0.00087) [2022-07-10 18:24:24,708][25689] Fps is (10 sec: 5352.5, 60 sec: 5496.7, 300 sec: 5509.4). Total num frames: 859141120. Throughput: 0: 5715.4. Samples: 859139810. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:24:24,708][25689] Avg episode reward: [(0, '-0.810')] [2022-07-10 18:24:24,713][26022] Updated weights on worker 0-0, policy_version 839006 (0.00090) [2022-07-10 18:24:26,626][26022] Updated weights on worker 0-0, policy_version 839016 (0.00090) [2022-07-10 18:24:28,515][26022] Updated weights on worker 0-0, policy_version 839026 (0.00087) [2022-07-10 18:24:29,713][25689] Fps is (10 sec: 5679.0, 60 sec: 5514.3, 300 sec: 5512.9). Total num frames: 859169792. Throughput: 0: 5793.8. Samples: 859173048. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:24:29,713][25689] Avg episode reward: [(0, '-1.041')] [2022-07-10 18:24:30,283][26022] Updated weights on worker 0-0, policy_version 839036 (0.00086) [2022-07-10 18:24:32,203][26022] Updated weights on worker 0-0, policy_version 839046 (0.00093) [2022-07-10 18:24:33,839][26022] Updated weights on worker 0-0, policy_version 839056 (0.00085) [2022-07-10 18:24:34,739][25689] Fps is (10 sec: 5512.0, 60 sec: 5515.3, 300 sec: 5507.6). Total num frames: 859196416. Throughput: 0: 5757.7. Samples: 859206300. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:24:34,740][25689] Avg episode reward: [(0, '-1.261')] [2022-07-10 18:24:35,883][26022] Updated weights on worker 0-0, policy_version 839066 (0.00085) [2022-07-10 18:24:37,575][26022] Updated weights on worker 0-0, policy_version 839076 (0.00089) [2022-07-10 18:24:39,512][26022] Updated weights on worker 0-0, policy_version 839086 (0.00092) [2022-07-10 18:24:39,864][25689] Fps is (10 sec: 5547.8, 60 sec: 5514.8, 300 sec: 5513.0). Total num frames: 859226112. Throughput: 0: 4923.3. Samples: 859222902. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:24:39,865][25689] Avg episode reward: [(0, '-1.688')] [2022-07-10 18:24:41,367][26022] Updated weights on worker 0-0, policy_version 839096 (0.00088) [2022-07-10 18:24:43,182][26022] Updated weights on worker 0-0, policy_version 839106 (0.00091) [2022-07-10 18:24:44,932][25689] Fps is (10 sec: 5525.4, 60 sec: 5476.3, 300 sec: 5508.6). Total num frames: 859252736. Throughput: 0: 5758.9. Samples: 859256282. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:24:44,933][25689] Avg episode reward: [(0, '-1.735')] [2022-07-10 18:24:45,221][26022] Updated weights on worker 0-0, policy_version 839116 (0.00087) [2022-07-10 18:24:46,840][26022] Updated weights on worker 0-0, policy_version 839126 (0.00085) [2022-07-10 18:24:48,843][26022] Updated weights on worker 0-0, policy_version 839136 (0.00090) [2022-07-10 18:24:49,949][25689] Fps is (10 sec: 5483.3, 60 sec: 5543.2, 300 sec: 5508.4). Total num frames: 859281408. Throughput: 0: 5753.9. Samples: 859289486. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:24:49,949][25689] Avg episode reward: [(0, '-0.930')] [2022-07-10 18:24:50,571][26022] Updated weights on worker 0-0, policy_version 839146 (0.00080) [2022-07-10 18:24:52,320][26022] Updated weights on worker 0-0, policy_version 839156 (0.00100) [2022-07-10 18:24:53,826][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:24:53,839][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000839164_859303936.pth [2022-07-10 18:24:53,840][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000837224_857317376.pth [2022-07-10 18:24:54,417][26022] Updated weights on worker 0-0, policy_version 839166 (0.00091) [2022-07-10 18:24:54,956][25689] Fps is (10 sec: 5618.7, 60 sec: 5493.8, 300 sec: 5517.6). Total num frames: 859309056. Throughput: 0: 4953.9. Samples: 859306452. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:24:54,957][25689] Avg episode reward: [(0, '-1.159')] [2022-07-10 18:24:56,116][26022] Updated weights on worker 0-0, policy_version 839176 (0.00089) [2022-07-10 18:24:57,900][26022] Updated weights on worker 0-0, policy_version 839186 (0.00087) [2022-07-10 18:24:59,758][26022] Updated weights on worker 0-0, policy_version 839196 (0.00084) [2022-07-10 18:25:00,086][25689] Fps is (10 sec: 5455.1, 60 sec: 5491.5, 300 sec: 5512.2). Total num frames: 859336704. Throughput: 0: 5791.5. Samples: 859340012. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:25:00,086][25689] Avg episode reward: [(0, '-1.384')] [2022-07-10 18:25:01,488][26022] Updated weights on worker 0-0, policy_version 839206 (0.00089) [2022-07-10 18:25:04,016][26022] Updated weights on worker 0-0, policy_version 839216 (0.00088) [2022-07-10 18:25:05,146][25689] Fps is (10 sec: 5426.8, 60 sec: 5525.9, 300 sec: 5515.4). Total num frames: 859364352. Throughput: 0: 5692.0. Samples: 859371336. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:25:05,146][25689] Avg episode reward: [(0, '-1.348')] [2022-07-10 18:25:05,515][26022] Updated weights on worker 0-0, policy_version 839226 (0.00085) [2022-07-10 18:25:07,508][26022] Updated weights on worker 0-0, policy_version 839236 (0.00086) [2022-07-10 18:25:09,179][26022] Updated weights on worker 0-0, policy_version 839246 (0.00086) [2022-07-10 18:25:10,156][25689] Fps is (10 sec: 5490.8, 60 sec: 5525.7, 300 sec: 5516.5). Total num frames: 859392000. Throughput: 0: 4890.3. Samples: 859388306. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:25:10,157][25689] Avg episode reward: [(0, '-1.331')] [2022-07-10 18:25:11,165][26022] Updated weights on worker 0-0, policy_version 839256 (0.00089) [2022-07-10 18:25:12,922][26022] Updated weights on worker 0-0, policy_version 839266 (0.00079) [2022-07-10 18:25:14,879][26022] Updated weights on worker 0-0, policy_version 839276 (0.00080) [2022-07-10 18:25:15,171][25689] Fps is (10 sec: 5515.7, 60 sec: 5491.5, 300 sec: 5510.1). Total num frames: 859419648. Throughput: 0: 5707.5. Samples: 859421828. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:25:15,171][25689] Avg episode reward: [(0, '-0.631')] [2022-07-10 18:25:16,760][26022] Updated weights on worker 0-0, policy_version 839286 (0.00089) [2022-07-10 18:25:18,667][26022] Updated weights on worker 0-0, policy_version 839296 (0.00090) [2022-07-10 18:25:20,288][25689] Fps is (10 sec: 5457.6, 60 sec: 5518.9, 300 sec: 5512.9). Total num frames: 859447296. Throughput: 0: 5677.0. Samples: 859454702. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:25:20,289][25689] Avg episode reward: [(0, '-1.209')] [2022-07-10 18:25:20,498][26022] Updated weights on worker 0-0, policy_version 839306 (0.00079) [2022-07-10 18:25:22,150][26022] Updated weights on worker 0-0, policy_version 839316 (0.00092) [2022-07-10 18:25:24,292][26022] Updated weights on worker 0-0, policy_version 839326 (0.00093) [2022-07-10 18:25:25,349][25689] Fps is (10 sec: 5634.2, 60 sec: 5538.7, 300 sec: 5522.1). Total num frames: 859476992. Throughput: 0: 4965.9. Samples: 859471662. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:25:25,350][25689] Avg episode reward: [(0, '-0.605')] [2022-07-10 18:25:25,860][26022] Updated weights on worker 0-0, policy_version 839336 (0.00088) [2022-07-10 18:25:27,783][26022] Updated weights on worker 0-0, policy_version 839346 (0.00092) [2022-07-10 18:25:29,459][26022] Updated weights on worker 0-0, policy_version 839356 (0.00094) [2022-07-10 18:25:30,375][25689] Fps is (10 sec: 5583.6, 60 sec: 5503.0, 300 sec: 5512.8). Total num frames: 859503616. Throughput: 0: 5781.3. Samples: 859505196. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:25:30,375][25689] Avg episode reward: [(0, '-0.390')] [2022-07-10 18:25:31,491][26022] Updated weights on worker 0-0, policy_version 839366 (0.00088) [2022-07-10 18:25:33,301][26022] Updated weights on worker 0-0, policy_version 839376 (0.00090) [2022-07-10 18:25:35,340][26022] Updated weights on worker 0-0, policy_version 839386 (0.00095) [2022-07-10 18:25:35,383][25689] Fps is (10 sec: 5510.6, 60 sec: 5538.5, 300 sec: 5516.9). Total num frames: 859532288. Throughput: 0: 5796.1. Samples: 859538980. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:25:35,384][25689] Avg episode reward: [(0, '-1.351')] [2022-07-10 18:25:36,785][26022] Updated weights on worker 0-0, policy_version 839396 (0.00094) [2022-07-10 18:25:38,836][26022] Updated weights on worker 0-0, policy_version 839406 (0.00093) [2022-07-10 18:25:40,426][25689] Fps is (10 sec: 5705.3, 60 sec: 5529.1, 300 sec: 5520.5). Total num frames: 859560960. Throughput: 0: 5011.1. Samples: 859555614. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:25:40,428][25689] Avg episode reward: [(0, '-1.148')] [2022-07-10 18:25:40,733][26022] Updated weights on worker 0-0, policy_version 839416 (0.00092) [2022-07-10 18:25:42,573][26022] Updated weights on worker 0-0, policy_version 839426 (0.00092) [2022-07-10 18:25:44,392][26022] Updated weights on worker 0-0, policy_version 839436 (0.00094) [2022-07-10 18:25:45,437][25689] Fps is (10 sec: 5499.9, 60 sec: 5534.2, 300 sec: 5517.0). Total num frames: 859587584. Throughput: 0: 5843.6. Samples: 859589050. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:25:45,439][25689] Avg episode reward: [(0, '-0.970')] [2022-07-10 18:25:46,175][26022] Updated weights on worker 0-0, policy_version 839446 (0.00093) [2022-07-10 18:25:48,116][26022] Updated weights on worker 0-0, policy_version 839456 (0.00080) [2022-07-10 18:25:49,947][26022] Updated weights on worker 0-0, policy_version 839466 (0.00084) [2022-07-10 18:25:50,443][25689] Fps is (10 sec: 5417.9, 60 sec: 5518.3, 300 sec: 5517.1). Total num frames: 859615232. Throughput: 0: 5818.4. Samples: 859621960. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:25:50,445][25689] Avg episode reward: [(0, '-2.125')] [2022-07-10 18:25:51,800][26022] Updated weights on worker 0-0, policy_version 839476 (0.00102) [2022-07-10 18:25:53,611][26022] Updated weights on worker 0-0, policy_version 839486 (0.00087) [2022-07-10 18:25:55,279][26022] Updated weights on worker 0-0, policy_version 839496 (0.00092) [2022-07-10 18:25:55,473][25689] Fps is (10 sec: 5612.1, 60 sec: 5533.2, 300 sec: 5521.0). Total num frames: 859643904. Throughput: 0: 4958.5. Samples: 859638594. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:25:55,473][25689] Avg episode reward: [(0, '-2.131')] [2022-07-10 18:25:57,492][26022] Updated weights on worker 0-0, policy_version 839506 (0.00086) [2022-07-10 18:25:59,145][26022] Updated weights on worker 0-0, policy_version 839516 (0.00094) [2022-07-10 18:26:00,508][25689] Fps is (10 sec: 5493.9, 60 sec: 5524.9, 300 sec: 5520.5). Total num frames: 859670528. Throughput: 0: 5796.8. Samples: 859672026. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:26:00,510][25689] Avg episode reward: [(0, '-2.705')] [2022-07-10 18:26:01,099][26022] Updated weights on worker 0-0, policy_version 839526 (0.00085) [2022-07-10 18:26:03,187][26022] Updated weights on worker 0-0, policy_version 839536 (0.00087) [2022-07-10 18:26:05,082][26022] Updated weights on worker 0-0, policy_version 839546 (0.00097) [2022-07-10 18:26:05,538][25689] Fps is (10 sec: 5188.7, 60 sec: 5493.7, 300 sec: 5513.3). Total num frames: 859696128. Throughput: 0: 5669.4. Samples: 859703008. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:26:05,538][25689] Avg episode reward: [(0, '-2.019')] [2022-07-10 18:26:07,097][26022] Updated weights on worker 0-0, policy_version 839556 (0.00086) [2022-07-10 18:26:08,946][26022] Updated weights on worker 0-0, policy_version 839566 (0.00091) [2022-07-10 18:26:10,427][26022] Updated weights on worker 0-0, policy_version 839576 (0.00085) [2022-07-10 18:26:10,555][25689] Fps is (10 sec: 5503.8, 60 sec: 5527.0, 300 sec: 5520.0). Total num frames: 859725824. Throughput: 0: 4868.5. Samples: 859719874. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:26:10,557][25689] Avg episode reward: [(0, '-2.463')] [2022-07-10 18:26:12,571][26022] Updated weights on worker 0-0, policy_version 839586 (0.00086) [2022-07-10 18:26:13,977][26022] Updated weights on worker 0-0, policy_version 839596 (0.00088) [2022-07-10 18:26:15,585][25689] Fps is (10 sec: 5605.4, 60 sec: 5508.6, 300 sec: 5517.7). Total num frames: 859752448. Throughput: 0: 5718.2. Samples: 859753600. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:26:15,586][25689] Avg episode reward: [(0, '-2.548')] [2022-07-10 18:26:16,191][26022] Updated weights on worker 0-0, policy_version 839606 (0.00080) [2022-07-10 18:26:17,886][26022] Updated weights on worker 0-0, policy_version 839616 (0.00098) [2022-07-10 18:26:19,788][26022] Updated weights on worker 0-0, policy_version 839626 (0.00089) [2022-07-10 18:26:20,694][25689] Fps is (10 sec: 5554.7, 60 sec: 5543.3, 300 sec: 5526.4). Total num frames: 859782144. Throughput: 0: 5678.0. Samples: 859786642. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:26:20,694][25689] Avg episode reward: [(0, '-2.493')] [2022-07-10 18:26:21,875][26022] Updated weights on worker 0-0, policy_version 839636 (0.00084) [2022-07-10 18:26:23,391][26022] Updated weights on worker 0-0, policy_version 839646 (0.00087) [2022-07-10 18:26:25,462][26022] Updated weights on worker 0-0, policy_version 839656 (0.00083) [2022-07-10 18:26:25,697][25689] Fps is (10 sec: 5671.1, 60 sec: 5514.7, 300 sec: 5523.3). Total num frames: 859809792. Throughput: 0: 4980.1. Samples: 859803402. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:26:25,697][25689] Avg episode reward: [(0, '-2.200')] [2022-07-10 18:26:27,153][26022] Updated weights on worker 0-0, policy_version 839666 (0.00091) [2022-07-10 18:26:29,058][26022] Updated weights on worker 0-0, policy_version 839676 (0.00376) [2022-07-10 18:26:30,713][25689] Fps is (10 sec: 5518.8, 60 sec: 5532.5, 300 sec: 5520.7). Total num frames: 859837440. Throughput: 0: 5810.3. Samples: 859837002. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:26:30,714][25689] Avg episode reward: [(0, '-1.657')] [2022-07-10 18:26:30,904][26022] Updated weights on worker 0-0, policy_version 839686 (0.00085) [2022-07-10 18:26:32,612][26022] Updated weights on worker 0-0, policy_version 839696 (0.00086) [2022-07-10 18:26:34,528][26022] Updated weights on worker 0-0, policy_version 839706 (0.00094) [2022-07-10 18:26:35,739][25689] Fps is (10 sec: 5608.5, 60 sec: 5531.0, 300 sec: 5524.4). Total num frames: 859866112. Throughput: 0: 5804.8. Samples: 859870588. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:26:35,739][25689] Avg episode reward: [(0, '-1.582')] [2022-07-10 18:26:36,404][26022] Updated weights on worker 0-0, policy_version 839716 (0.00091) [2022-07-10 18:26:38,161][26022] Updated weights on worker 0-0, policy_version 839726 (0.00088) [2022-07-10 18:26:39,846][26022] Updated weights on worker 0-0, policy_version 839736 (0.00095) [2022-07-10 18:26:40,859][25689] Fps is (10 sec: 5450.5, 60 sec: 5490.0, 300 sec: 5519.1). Total num frames: 859892736. Throughput: 0: 4997.4. Samples: 859887414. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:26:40,859][25689] Avg episode reward: [(0, '-0.995')] [2022-07-10 18:26:41,714][26022] Updated weights on worker 0-0, policy_version 839746 (0.00084) [2022-07-10 18:26:43,692][26022] Updated weights on worker 0-0, policy_version 839756 (0.00091) [2022-07-10 18:26:45,379][26022] Updated weights on worker 0-0, policy_version 839766 (0.00090) [2022-07-10 18:26:45,875][25689] Fps is (10 sec: 5556.3, 60 sec: 5540.4, 300 sec: 5526.2). Total num frames: 859922432. Throughput: 0: 5848.4. Samples: 859921412. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:26:45,875][25689] Avg episode reward: [(0, '0.309')] [2022-07-10 18:26:47,427][26022] Updated weights on worker 0-0, policy_version 839776 (0.00088) [2022-07-10 18:26:49,112][26022] Updated weights on worker 0-0, policy_version 839786 (0.00091) [2022-07-10 18:26:50,883][25689] Fps is (10 sec: 5720.3, 60 sec: 5540.1, 300 sec: 5522.6). Total num frames: 859950080. Throughput: 0: 5836.5. Samples: 859954724. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:26:50,884][25689] Avg episode reward: [(0, '-0.221')] [2022-07-10 18:26:50,986][26022] Updated weights on worker 0-0, policy_version 839796 (0.00092) [2022-07-10 18:26:52,798][26022] Updated weights on worker 0-0, policy_version 839806 (0.00089) [2022-07-10 18:26:53,958][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:26:53,972][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000839811_859966464.pth [2022-07-10 18:26:53,973][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000837870_857978880.pth [2022-07-10 18:26:54,592][26022] Updated weights on worker 0-0, policy_version 839816 (0.00088) [2022-07-10 18:26:55,969][25689] Fps is (10 sec: 5478.2, 60 sec: 5518.1, 300 sec: 5526.2). Total num frames: 859977728. Throughput: 0: 5805.4. Samples: 859988032. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:26:55,969][25689] Avg episode reward: [(0, '-0.728')] [2022-07-10 18:26:56,616][26022] Updated weights on worker 0-0, policy_version 839826 (0.00084) [2022-07-10 18:26:58,247][26022] Updated weights on worker 0-0, policy_version 839836 (0.00085) [2022-07-10 18:27:00,160][26022] Updated weights on worker 0-0, policy_version 839846 (0.00090) [2022-07-10 18:27:01,070][25689] Fps is (10 sec: 5629.4, 60 sec: 5562.8, 300 sec: 5535.0). Total num frames: 860007424. Throughput: 0: 5805.3. Samples: 860004746. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:27:01,070][25689] Avg episode reward: [(0, '-0.607')] [2022-07-10 18:27:02,260][26022] Updated weights on worker 0-0, policy_version 839856 (0.00094) [2022-07-10 18:27:04,258][26022] Updated weights on worker 0-0, policy_version 839866 (0.00087) [2022-07-10 18:27:06,144][25689] Fps is (10 sec: 5333.8, 60 sec: 5541.9, 300 sec: 5520.4). Total num frames: 860032000. Throughput: 0: 5659.3. Samples: 860036120. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:27:06,146][25689] Avg episode reward: [(0, '-0.955')] [2022-07-10 18:27:06,149][26022] Updated weights on worker 0-0, policy_version 839876 (0.00081) [2022-07-10 18:27:07,868][26022] Updated weights on worker 0-0, policy_version 839886 (0.00087) [2022-07-10 18:27:09,856][26022] Updated weights on worker 0-0, policy_version 839896 (0.00092) [2022-07-10 18:27:11,197][25689] Fps is (10 sec: 5258.0, 60 sec: 5521.7, 300 sec: 5526.7). Total num frames: 860060672. Throughput: 0: 5657.2. Samples: 860069642. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:27:11,197][25689] Avg episode reward: [(0, '-2.223')] [2022-07-10 18:27:11,704][26022] Updated weights on worker 0-0, policy_version 839906 (0.00054) [2022-07-10 18:27:13,541][26022] Updated weights on worker 0-0, policy_version 839916 (0.00091) [2022-07-10 18:27:15,241][26022] Updated weights on worker 0-0, policy_version 839926 (0.00093) [2022-07-10 18:27:16,280][25689] Fps is (10 sec: 5657.6, 60 sec: 5550.7, 300 sec: 5529.4). Total num frames: 860089344. Throughput: 0: 4843.7. Samples: 860086412. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 18:27:16,281][25689] Avg episode reward: [(0, '-1.802')] [2022-07-10 18:27:17,223][26022] Updated weights on worker 0-0, policy_version 839936 (0.00094) [2022-07-10 18:27:18,797][26022] Updated weights on worker 0-0, policy_version 839946 (0.00101) [2022-07-10 18:27:20,898][26022] Updated weights on worker 0-0, policy_version 839956 (0.00084) [2022-07-10 18:27:21,433][25689] Fps is (10 sec: 5602.3, 60 sec: 5529.8, 300 sec: 5523.3). Total num frames: 860118016. Throughput: 0: 5648.0. Samples: 860119756. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:27:21,433][25689] Avg episode reward: [(0, '-2.260')] [2022-07-10 18:27:22,696][26022] Updated weights on worker 0-0, policy_version 839966 (0.00087) [2022-07-10 18:27:24,522][26022] Updated weights on worker 0-0, policy_version 839976 (0.00088) [2022-07-10 18:27:26,428][26022] Updated weights on worker 0-0, policy_version 839986 (0.00096) [2022-07-10 18:27:26,524][25689] Fps is (10 sec: 5497.5, 60 sec: 5521.7, 300 sec: 5521.7). Total num frames: 860145664. Throughput: 0: 5748.1. Samples: 860153268. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:27:26,525][25689] Avg episode reward: [(0, '-2.193')] [2022-07-10 18:27:28,238][26022] Updated weights on worker 0-0, policy_version 839996 (0.00096) [2022-07-10 18:27:29,886][26022] Updated weights on worker 0-0, policy_version 840006 (0.00080) [2022-07-10 18:27:31,586][25689] Fps is (10 sec: 5546.7, 60 sec: 5534.4, 300 sec: 5524.7). Total num frames: 860174336. Throughput: 0: 4930.3. Samples: 860170168. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:27:31,587][25689] Avg episode reward: [(0, '-2.255')] [2022-07-10 18:27:31,977][26022] Updated weights on worker 0-0, policy_version 840016 (0.00084) [2022-07-10 18:27:33,479][26022] Updated weights on worker 0-0, policy_version 840026 (0.00093) [2022-07-10 18:27:35,594][26022] Updated weights on worker 0-0, policy_version 840036 (0.00092) [2022-07-10 18:27:36,637][25689] Fps is (10 sec: 5771.5, 60 sec: 5548.9, 300 sec: 5528.8). Total num frames: 860204032. Throughput: 0: 5764.9. Samples: 860203772. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:27:36,638][25689] Avg episode reward: [(0, '-1.409')] [2022-07-10 18:27:37,134][26022] Updated weights on worker 0-0, policy_version 840046 (0.00098) [2022-07-10 18:27:39,189][26022] Updated weights on worker 0-0, policy_version 840056 (0.00087) [2022-07-10 18:27:40,999][26022] Updated weights on worker 0-0, policy_version 840066 (0.00092) [2022-07-10 18:27:41,719][25689] Fps is (10 sec: 5558.3, 60 sec: 5552.4, 300 sec: 5525.3). Total num frames: 860230656. Throughput: 0: 5798.8. Samples: 860237392. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:27:41,720][25689] Avg episode reward: [(0, '-1.834')] [2022-07-10 18:27:42,875][26022] Updated weights on worker 0-0, policy_version 840076 (0.00084) [2022-07-10 18:27:44,655][26022] Updated weights on worker 0-0, policy_version 840086 (0.00088) [2022-07-10 18:27:46,274][26022] Updated weights on worker 0-0, policy_version 840096 (0.00085) [2022-07-10 18:27:46,729][25689] Fps is (10 sec: 5682.3, 60 sec: 5569.8, 300 sec: 5532.5). Total num frames: 860261376. Throughput: 0: 5001.6. Samples: 860254324. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:27:46,729][25689] Avg episode reward: [(0, '-1.622')] [2022-07-10 18:27:48,271][26022] Updated weights on worker 0-0, policy_version 840106 (0.00095) [2022-07-10 18:27:50,033][26022] Updated weights on worker 0-0, policy_version 840116 (0.00087) [2022-07-10 18:27:51,732][25689] Fps is (10 sec: 5727.2, 60 sec: 5553.5, 300 sec: 5532.6). Total num frames: 860288000. Throughput: 0: 5844.5. Samples: 860287908. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:27:51,733][25689] Avg episode reward: [(0, '-1.005')] [2022-07-10 18:27:51,948][26022] Updated weights on worker 0-0, policy_version 840126 (0.00088) [2022-07-10 18:27:53,692][26022] Updated weights on worker 0-0, policy_version 840136 (0.00085) [2022-07-10 18:27:55,519][26022] Updated weights on worker 0-0, policy_version 840146 (0.00087) [2022-07-10 18:27:56,743][25689] Fps is (10 sec: 5419.8, 60 sec: 5560.3, 300 sec: 5526.7). Total num frames: 860315648. Throughput: 0: 5866.2. Samples: 860321714. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:27:56,744][25689] Avg episode reward: [(0, '-0.706')] [2022-07-10 18:27:57,474][26022] Updated weights on worker 0-0, policy_version 840156 (0.00093) [2022-07-10 18:27:59,392][26022] Updated weights on worker 0-0, policy_version 840166 (0.00091) [2022-07-10 18:28:01,205][26022] Updated weights on worker 0-0, policy_version 840176 (0.00086) [2022-07-10 18:28:01,879][25689] Fps is (10 sec: 5651.2, 60 sec: 5557.1, 300 sec: 5535.5). Total num frames: 860345344. Throughput: 0: 5011.4. Samples: 860338420. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:28:01,879][25689] Avg episode reward: [(0, '-0.989')] [2022-07-10 18:28:03,230][26022] Updated weights on worker 0-0, policy_version 840186 (0.00086) [2022-07-10 18:28:05,118][26022] Updated weights on worker 0-0, policy_version 840196 (0.00090) [2022-07-10 18:28:06,884][25689] Fps is (10 sec: 5351.4, 60 sec: 5563.3, 300 sec: 5528.8). Total num frames: 860369920. Throughput: 0: 5744.3. Samples: 860370102. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:28:06,885][25689] Avg episode reward: [(0, '-0.852')] [2022-07-10 18:28:07,207][26022] Updated weights on worker 0-0, policy_version 840206 (0.00081) [2022-07-10 18:28:08,807][26022] Updated weights on worker 0-0, policy_version 840216 (0.00091) [2022-07-10 18:28:10,867][26022] Updated weights on worker 0-0, policy_version 840226 (0.00092) [2022-07-10 18:28:11,929][25689] Fps is (10 sec: 5297.9, 60 sec: 5564.1, 300 sec: 5524.6). Total num frames: 860398592. Throughput: 0: 5719.4. Samples: 860403428. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:28:11,931][25689] Avg episode reward: [(0, '-0.309')] [2022-07-10 18:28:12,433][26022] Updated weights on worker 0-0, policy_version 840236 (0.00086) [2022-07-10 18:28:14,336][26022] Updated weights on worker 0-0, policy_version 840246 (0.00090) [2022-07-10 18:28:16,116][26022] Updated weights on worker 0-0, policy_version 840256 (0.00091) [2022-07-10 18:28:16,941][25689] Fps is (10 sec: 5599.9, 60 sec: 5553.7, 300 sec: 5529.5). Total num frames: 860426240. Throughput: 0: 4876.5. Samples: 860420214. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:28:16,944][25689] Avg episode reward: [(0, '-0.672')] [2022-07-10 18:28:18,035][26022] Updated weights on worker 0-0, policy_version 840266 (0.00087) [2022-07-10 18:28:19,915][26022] Updated weights on worker 0-0, policy_version 840276 (0.00084) [2022-07-10 18:28:21,535][26022] Updated weights on worker 0-0, policy_version 840286 (0.00097) [2022-07-10 18:28:21,997][25689] Fps is (10 sec: 5492.5, 60 sec: 5545.7, 300 sec: 5525.1). Total num frames: 860453888. Throughput: 0: 5724.4. Samples: 860453586. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:28:21,999][25689] Avg episode reward: [(0, '-0.880')] [2022-07-10 18:28:23,653][26022] Updated weights on worker 0-0, policy_version 840296 (0.00086) [2022-07-10 18:28:25,176][26022] Updated weights on worker 0-0, policy_version 840306 (0.00088) [2022-07-10 18:28:27,004][25689] Fps is (10 sec: 5495.3, 60 sec: 5553.5, 300 sec: 5525.2). Total num frames: 860481536. Throughput: 0: 5796.1. Samples: 860486718. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:28:27,006][25689] Avg episode reward: [(0, '-2.323')] [2022-07-10 18:28:27,245][26022] Updated weights on worker 0-0, policy_version 840316 (0.00088) [2022-07-10 18:28:28,949][26022] Updated weights on worker 0-0, policy_version 840326 (0.00086) [2022-07-10 18:28:31,040][26022] Updated weights on worker 0-0, policy_version 840336 (0.00083) [2022-07-10 18:28:32,016][25689] Fps is (10 sec: 5519.4, 60 sec: 5541.2, 300 sec: 5529.1). Total num frames: 860509184. Throughput: 0: 4973.5. Samples: 860503326. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:28:32,019][25689] Avg episode reward: [(0, '-2.325')] [2022-07-10 18:28:32,742][26022] Updated weights on worker 0-0, policy_version 840346 (0.00081) [2022-07-10 18:28:34,672][26022] Updated weights on worker 0-0, policy_version 840356 (0.00087) [2022-07-10 18:28:36,281][26022] Updated weights on worker 0-0, policy_version 840366 (0.00089) [2022-07-10 18:28:37,041][25689] Fps is (10 sec: 5713.3, 60 sec: 5543.5, 300 sec: 5530.9). Total num frames: 860538880. Throughput: 0: 5809.6. Samples: 860536984. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:28:37,043][25689] Avg episode reward: [(0, '-2.075')] [2022-07-10 18:28:38,351][26022] Updated weights on worker 0-0, policy_version 840376 (0.00097) [2022-07-10 18:28:39,943][26022] Updated weights on worker 0-0, policy_version 840386 (0.00083) [2022-07-10 18:28:42,047][26022] Updated weights on worker 0-0, policy_version 840396 (0.00086) [2022-07-10 18:28:42,091][25689] Fps is (10 sec: 5589.8, 60 sec: 5546.4, 300 sec: 5523.4). Total num frames: 860565504. Throughput: 0: 5825.2. Samples: 860570638. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:28:42,091][25689] Avg episode reward: [(0, '-2.167')] [2022-07-10 18:28:43,754][26022] Updated weights on worker 0-0, policy_version 840406 (0.00086) [2022-07-10 18:28:45,647][26022] Updated weights on worker 0-0, policy_version 840416 (0.00094) [2022-07-10 18:28:47,100][25689] Fps is (10 sec: 5497.0, 60 sec: 5512.6, 300 sec: 5537.1). Total num frames: 860594176. Throughput: 0: 5012.9. Samples: 860587458. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:28:47,101][25689] Avg episode reward: [(0, '-3.244')] [2022-07-10 18:28:47,372][26022] Updated weights on worker 0-0, policy_version 840426 (0.00092) [2022-07-10 18:28:49,429][26022] Updated weights on worker 0-0, policy_version 840436 (0.00086) [2022-07-10 18:28:51,095][26022] Updated weights on worker 0-0, policy_version 840446 (0.00097) [2022-07-10 18:28:52,114][25689] Fps is (10 sec: 5516.9, 60 sec: 5511.5, 300 sec: 5523.5). Total num frames: 860620800. Throughput: 0: 5848.0. Samples: 860620862. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:28:52,115][25689] Avg episode reward: [(0, '-2.664')] [2022-07-10 18:28:53,239][26022] Updated weights on worker 0-0, policy_version 840456 (0.00093) [2022-07-10 18:28:54,084][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:28:54,101][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000840462_860633088.pth [2022-07-10 18:28:54,102][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000838516_858640384.pth [2022-07-10 18:28:54,679][26022] Updated weights on worker 0-0, policy_version 840466 (0.00088) [2022-07-10 18:28:56,787][26022] Updated weights on worker 0-0, policy_version 840476 (0.00090) [2022-07-10 18:28:57,121][25689] Fps is (10 sec: 5416.1, 60 sec: 5511.9, 300 sec: 5525.3). Total num frames: 860648448. Throughput: 0: 5817.1. Samples: 860653790. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:28:57,121][25689] Avg episode reward: [(0, '-1.335')] [2022-07-10 18:28:58,772][26022] Updated weights on worker 0-0, policy_version 840486 (0.00089) [2022-07-10 18:29:00,355][26022] Updated weights on worker 0-0, policy_version 840496 (0.00088) [2022-07-10 18:29:02,198][25689] Fps is (10 sec: 5382.0, 60 sec: 5466.4, 300 sec: 5528.6). Total num frames: 860675072. Throughput: 0: 4956.3. Samples: 860670294. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:29:02,203][25689] Avg episode reward: [(0, '-1.654')] [2022-07-10 18:29:02,780][26022] Updated weights on worker 0-0, policy_version 840506 (0.00089) [2022-07-10 18:29:04,608][26022] Updated weights on worker 0-0, policy_version 840516 (0.00085) [2022-07-10 18:29:06,304][26022] Updated weights on worker 0-0, policy_version 840526 (0.00092) [2022-07-10 18:29:07,212][25689] Fps is (10 sec: 5377.9, 60 sec: 5516.5, 300 sec: 5528.5). Total num frames: 860702720. Throughput: 0: 5666.6. Samples: 860701426. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:29:07,213][25689] Avg episode reward: [(0, '-1.966')] [2022-07-10 18:29:08,516][26022] Updated weights on worker 0-0, policy_version 840536 (0.00085) [2022-07-10 18:29:09,803][26022] Updated weights on worker 0-0, policy_version 840546 (0.00085) [2022-07-10 18:29:12,033][26022] Updated weights on worker 0-0, policy_version 840556 (0.00091) [2022-07-10 18:29:12,219][25689] Fps is (10 sec: 5620.5, 60 sec: 5520.1, 300 sec: 5525.1). Total num frames: 860731392. Throughput: 0: 5680.9. Samples: 860735074. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:29:12,219][25689] Avg episode reward: [(0, '-1.949')] [2022-07-10 18:29:13,632][26022] Updated weights on worker 0-0, policy_version 840566 (0.00077) [2022-07-10 18:29:15,515][26022] Updated weights on worker 0-0, policy_version 840576 (0.00083) [2022-07-10 18:29:17,237][25689] Fps is (10 sec: 5618.2, 60 sec: 5519.5, 300 sec: 5532.5). Total num frames: 860759040. Throughput: 0: 5723.1. Samples: 860768916. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:29:17,237][25689] Avg episode reward: [(0, '-1.295')] [2022-07-10 18:29:17,428][26022] Updated weights on worker 0-0, policy_version 840586 (0.00092) [2022-07-10 18:29:19,038][26022] Updated weights on worker 0-0, policy_version 840596 (0.00089) [2022-07-10 18:29:20,960][26022] Updated weights on worker 0-0, policy_version 840606 (0.00080) [2022-07-10 18:29:22,269][25689] Fps is (10 sec: 5501.7, 60 sec: 5521.6, 300 sec: 5530.2). Total num frames: 860786688. Throughput: 0: 5751.4. Samples: 860785732. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:29:22,270][25689] Avg episode reward: [(0, '-2.145')] [2022-07-10 18:29:22,766][26022] Updated weights on worker 0-0, policy_version 840616 (0.00086) [2022-07-10 18:29:24,553][26022] Updated weights on worker 0-0, policy_version 840626 (0.00076) [2022-07-10 18:29:26,540][26022] Updated weights on worker 0-0, policy_version 840636 (0.00092) [2022-07-10 18:29:27,282][25689] Fps is (10 sec: 5606.8, 60 sec: 5538.1, 300 sec: 5530.0). Total num frames: 860815360. Throughput: 0: 5865.8. Samples: 860819150. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:29:27,282][25689] Avg episode reward: [(0, '-1.772')] [2022-07-10 18:29:28,431][26022] Updated weights on worker 0-0, policy_version 840646 (0.00088) [2022-07-10 18:29:30,138][26022] Updated weights on worker 0-0, policy_version 840656 (0.00087) [2022-07-10 18:29:32,028][26022] Updated weights on worker 0-0, policy_version 840666 (0.00090) [2022-07-10 18:29:32,287][25689] Fps is (10 sec: 5622.2, 60 sec: 5538.7, 300 sec: 5533.9). Total num frames: 860843008. Throughput: 0: 5862.9. Samples: 860852732. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:29:32,288][25689] Avg episode reward: [(0, '-0.600')] [2022-07-10 18:29:33,802][26022] Updated weights on worker 0-0, policy_version 840676 (0.00084) [2022-07-10 18:29:35,609][26022] Updated weights on worker 0-0, policy_version 840686 (0.00083) [2022-07-10 18:29:37,299][25689] Fps is (10 sec: 5520.0, 60 sec: 5505.9, 300 sec: 5529.1). Total num frames: 860870656. Throughput: 0: 5014.3. Samples: 860869516. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:29:37,301][25689] Avg episode reward: [(0, '-0.502')] [2022-07-10 18:29:37,672][26022] Updated weights on worker 0-0, policy_version 840696 (0.00083) [2022-07-10 18:29:39,168][26022] Updated weights on worker 0-0, policy_version 840706 (0.00089) [2022-07-10 18:29:41,133][26022] Updated weights on worker 0-0, policy_version 840716 (0.00390) [2022-07-10 18:29:42,338][25689] Fps is (10 sec: 5603.5, 60 sec: 5541.0, 300 sec: 5536.5). Total num frames: 860899328. Throughput: 0: 5858.2. Samples: 860903296. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:29:42,338][25689] Avg episode reward: [(0, '-0.654')] [2022-07-10 18:29:42,957][26022] Updated weights on worker 0-0, policy_version 840726 (0.00092) [2022-07-10 18:29:44,600][26022] Updated weights on worker 0-0, policy_version 840736 (0.00086) [2022-07-10 18:29:46,716][26022] Updated weights on worker 0-0, policy_version 840746 (0.00083) [2022-07-10 18:29:47,356][25689] Fps is (10 sec: 5600.1, 60 sec: 5523.1, 300 sec: 5533.0). Total num frames: 860926976. Throughput: 0: 5882.6. Samples: 860937240. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:29:47,358][25689] Avg episode reward: [(0, '-0.118')] [2022-07-10 18:29:48,183][26022] Updated weights on worker 0-0, policy_version 840756 (0.00099) [2022-07-10 18:29:50,331][26022] Updated weights on worker 0-0, policy_version 840766 (0.00103) [2022-07-10 18:29:52,131][26022] Updated weights on worker 0-0, policy_version 840776 (0.00891) [2022-07-10 18:29:52,395][25689] Fps is (10 sec: 5599.9, 60 sec: 5554.8, 300 sec: 5535.9). Total num frames: 860955648. Throughput: 0: 5037.9. Samples: 860954036. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:29:52,397][25689] Avg episode reward: [(0, '0.606')] [2022-07-10 18:29:53,831][26022] Updated weights on worker 0-0, policy_version 840786 (0.00049) [2022-07-10 18:29:55,860][26022] Updated weights on worker 0-0, policy_version 840796 (0.00079) [2022-07-10 18:29:57,427][25689] Fps is (10 sec: 5694.1, 60 sec: 5569.4, 300 sec: 5541.2). Total num frames: 860984320. Throughput: 0: 5865.7. Samples: 860987580. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:29:57,427][25689] Avg episode reward: [(0, '0.421')] [2022-07-10 18:29:57,645][26022] Updated weights on worker 0-0, policy_version 840806 (0.00090) [2022-07-10 18:29:59,523][26022] Updated weights on worker 0-0, policy_version 840816 (0.00092) [2022-07-10 18:30:01,211][26022] Updated weights on worker 0-0, policy_version 840826 (0.00084) [2022-07-10 18:30:02,498][25689] Fps is (10 sec: 5473.2, 60 sec: 5570.0, 300 sec: 5537.5). Total num frames: 861010944. Throughput: 0: 5725.6. Samples: 861018726. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:30:02,499][25689] Avg episode reward: [(0, '0.464')] [2022-07-10 18:30:03,414][26022] Updated weights on worker 0-0, policy_version 840836 (0.00086) [2022-07-10 18:30:05,279][26022] Updated weights on worker 0-0, policy_version 840846 (0.00093) [2022-07-10 18:30:07,228][26022] Updated weights on worker 0-0, policy_version 840856 (0.00091) [2022-07-10 18:30:07,566][25689] Fps is (10 sec: 5251.6, 60 sec: 5548.0, 300 sec: 5533.0). Total num frames: 861037568. Throughput: 0: 4865.7. Samples: 861035578. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:30:07,567][25689] Avg episode reward: [(0, '0.143')] [2022-07-10 18:30:08,965][26022] Updated weights on worker 0-0, policy_version 840866 (0.00087) [2022-07-10 18:30:10,903][26022] Updated weights on worker 0-0, policy_version 840876 (0.00096) [2022-07-10 18:30:12,605][25689] Fps is (10 sec: 5471.0, 60 sec: 5545.1, 300 sec: 5536.0). Total num frames: 861066240. Throughput: 0: 5692.1. Samples: 861069074. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:30:12,605][25689] Avg episode reward: [(0, '0.426')] [2022-07-10 18:30:12,638][26022] Updated weights on worker 0-0, policy_version 840886 (0.00091) [2022-07-10 18:30:14,570][26022] Updated weights on worker 0-0, policy_version 840896 (0.00088) [2022-07-10 18:30:16,189][26022] Updated weights on worker 0-0, policy_version 840906 (0.00084) [2022-07-10 18:30:17,614][25689] Fps is (10 sec: 5605.3, 60 sec: 5545.9, 300 sec: 5538.0). Total num frames: 861093888. Throughput: 0: 5693.6. Samples: 861102516. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:30:17,615][25689] Avg episode reward: [(0, '0.578')] [2022-07-10 18:30:18,184][26022] Updated weights on worker 0-0, policy_version 840916 (0.00089) [2022-07-10 18:30:19,885][26022] Updated weights on worker 0-0, policy_version 840926 (0.00089) [2022-07-10 18:30:22,010][26022] Updated weights on worker 0-0, policy_version 840936 (0.00091) [2022-07-10 18:30:22,699][25689] Fps is (10 sec: 5579.7, 60 sec: 5558.1, 300 sec: 5534.1). Total num frames: 861122560. Throughput: 0: 4983.8. Samples: 861119402. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:30:22,699][25689] Avg episode reward: [(0, '0.667')] [2022-07-10 18:30:23,690][26022] Updated weights on worker 0-0, policy_version 840946 (0.00087) [2022-07-10 18:30:25,593][26022] Updated weights on worker 0-0, policy_version 840956 (0.00095) [2022-07-10 18:30:27,436][26022] Updated weights on worker 0-0, policy_version 840966 (0.00088) [2022-07-10 18:30:27,725][25689] Fps is (10 sec: 5569.9, 60 sec: 5539.8, 300 sec: 5537.5). Total num frames: 861150208. Throughput: 0: 5797.7. Samples: 861152454. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:30:27,726][25689] Avg episode reward: [(0, '0.779')] [2022-07-10 18:30:29,234][26022] Updated weights on worker 0-0, policy_version 840976 (0.00087) [2022-07-10 18:30:31,206][26022] Updated weights on worker 0-0, policy_version 840986 (0.00086) [2022-07-10 18:30:32,739][25689] Fps is (10 sec: 5507.5, 60 sec: 5539.1, 300 sec: 5534.0). Total num frames: 861177856. Throughput: 0: 5787.0. Samples: 861185588. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:30:32,739][25689] Avg episode reward: [(0, '1.036')] [2022-07-10 18:30:33,011][26022] Updated weights on worker 0-0, policy_version 840996 (0.00086) [2022-07-10 18:30:34,831][26022] Updated weights on worker 0-0, policy_version 841006 (0.00084) [2022-07-10 18:30:36,599][26022] Updated weights on worker 0-0, policy_version 841016 (0.00094) [2022-07-10 18:30:37,743][25689] Fps is (10 sec: 5519.9, 60 sec: 5539.8, 300 sec: 5531.3). Total num frames: 861205504. Throughput: 0: 4964.0. Samples: 861202436. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:30:37,745][25689] Avg episode reward: [(0, '1.035')] [2022-07-10 18:30:38,453][26022] Updated weights on worker 0-0, policy_version 841026 (0.00085) [2022-07-10 18:30:40,337][26022] Updated weights on worker 0-0, policy_version 841036 (0.00055) [2022-07-10 18:30:42,247][26022] Updated weights on worker 0-0, policy_version 841046 (0.00087) [2022-07-10 18:30:42,793][25689] Fps is (10 sec: 5499.8, 60 sec: 5521.8, 300 sec: 5534.0). Total num frames: 861233152. Throughput: 0: 5791.9. Samples: 861235786. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:30:42,794][25689] Avg episode reward: [(0, '1.071')] [2022-07-10 18:30:44,122][26022] Updated weights on worker 0-0, policy_version 841056 (0.00085) [2022-07-10 18:30:45,933][26022] Updated weights on worker 0-0, policy_version 841066 (0.00085) [2022-07-10 18:30:47,758][26022] Updated weights on worker 0-0, policy_version 841076 (0.00089) [2022-07-10 18:30:47,824][25689] Fps is (10 sec: 5586.6, 60 sec: 5537.6, 300 sec: 5537.0). Total num frames: 861261824. Throughput: 0: 5821.0. Samples: 861269450. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:30:47,826][25689] Avg episode reward: [(0, '0.076')] [2022-07-10 18:30:49,487][26022] Updated weights on worker 0-0, policy_version 841086 (0.00095) [2022-07-10 18:30:51,543][26022] Updated weights on worker 0-0, policy_version 841096 (0.00067) [2022-07-10 18:30:52,851][25689] Fps is (10 sec: 5701.2, 60 sec: 5538.7, 300 sec: 5537.0). Total num frames: 861290496. Throughput: 0: 5005.9. Samples: 861286268. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2022-07-10 18:30:52,853][25689] Avg episode reward: [(0, '-0.506')] [2022-07-10 18:30:53,068][26022] Updated weights on worker 0-0, policy_version 841106 (0.00096) [2022-07-10 18:30:54,111][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:30:54,127][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000841111_861297664.pth [2022-07-10 18:30:54,127][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000839164_859303936.pth [2022-07-10 18:30:55,085][26022] Updated weights on worker 0-0, policy_version 841116 (0.00083) [2022-07-10 18:30:56,850][26022] Updated weights on worker 0-0, policy_version 841126 (0.00099) [2022-07-10 18:30:57,859][25689] Fps is (10 sec: 5510.2, 60 sec: 5507.0, 300 sec: 5537.5). Total num frames: 861317120. Throughput: 0: 5809.8. Samples: 861319308. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:30:57,861][25689] Avg episode reward: [(0, '-0.344')] [2022-07-10 18:30:58,850][26022] Updated weights on worker 0-0, policy_version 841136 (0.00102) [2022-07-10 18:31:00,610][26022] Updated weights on worker 0-0, policy_version 841146 (0.00090) [2022-07-10 18:31:03,000][25689] Fps is (10 sec: 5145.8, 60 sec: 5483.7, 300 sec: 5535.5). Total num frames: 861342720. Throughput: 0: 5675.7. Samples: 861350474. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:31:03,000][25689] Avg episode reward: [(0, '-0.357')] [2022-07-10 18:31:03,079][26022] Updated weights on worker 0-0, policy_version 841156 (0.00092) [2022-07-10 18:31:04,758][26022] Updated weights on worker 0-0, policy_version 841166 (0.01200) [2022-07-10 18:31:06,716][26022] Updated weights on worker 0-0, policy_version 841176 (0.00082) [2022-07-10 18:31:08,001][25689] Fps is (10 sec: 5452.4, 60 sec: 5540.7, 300 sec: 5535.8). Total num frames: 861372416. Throughput: 0: 4843.1. Samples: 861367168. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:31:08,001][25689] Avg episode reward: [(0, '0.243')] [2022-07-10 18:31:08,251][26022] Updated weights on worker 0-0, policy_version 841186 (0.00100) [2022-07-10 18:31:10,621][26022] Updated weights on worker 0-0, policy_version 841196 (0.00083) [2022-07-10 18:31:11,783][26022] Updated weights on worker 0-0, policy_version 841206 (0.00081) [2022-07-10 18:31:13,007][25689] Fps is (10 sec: 5628.0, 60 sec: 5509.8, 300 sec: 5536.2). Total num frames: 861399040. Throughput: 0: 5670.0. Samples: 861400552. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:31:13,008][25689] Avg episode reward: [(0, '0.171')] [2022-07-10 18:31:14,074][26022] Updated weights on worker 0-0, policy_version 841216 (0.00439) [2022-07-10 18:31:15,600][26022] Updated weights on worker 0-0, policy_version 841226 (0.00086) [2022-07-10 18:31:17,715][26022] Updated weights on worker 0-0, policy_version 841236 (0.00089) [2022-07-10 18:31:18,098][25689] Fps is (10 sec: 5476.7, 60 sec: 5519.2, 300 sec: 5533.1). Total num frames: 861427712. Throughput: 0: 5678.5. Samples: 861434230. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:31:18,098][25689] Avg episode reward: [(0, '0.821')] [2022-07-10 18:31:19,347][26022] Updated weights on worker 0-0, policy_version 841246 (0.00084) [2022-07-10 18:31:21,271][26022] Updated weights on worker 0-0, policy_version 841256 (0.00085) [2022-07-10 18:31:23,026][26022] Updated weights on worker 0-0, policy_version 841266 (0.00085) [2022-07-10 18:31:23,207][25689] Fps is (10 sec: 5722.3, 60 sec: 5533.9, 300 sec: 5538.0). Total num frames: 861457408. Throughput: 0: 4977.7. Samples: 861451060. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:31:23,208][25689] Avg episode reward: [(0, '0.920')] [2022-07-10 18:31:25,017][26022] Updated weights on worker 0-0, policy_version 841276 (0.00092) [2022-07-10 18:31:26,731][26022] Updated weights on worker 0-0, policy_version 841286 (0.00060) [2022-07-10 18:31:28,212][25689] Fps is (10 sec: 5365.8, 60 sec: 5485.1, 300 sec: 5527.9). Total num frames: 861481984. Throughput: 0: 5800.6. Samples: 861484404. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:31:28,213][25689] Avg episode reward: [(0, '0.597')] [2022-07-10 18:31:28,654][26022] Updated weights on worker 0-0, policy_version 841296 (0.00088) [2022-07-10 18:31:30,470][26022] Updated weights on worker 0-0, policy_version 841306 (0.00099) [2022-07-10 18:31:32,301][26022] Updated weights on worker 0-0, policy_version 841316 (0.00089) [2022-07-10 18:31:33,216][25689] Fps is (10 sec: 5422.6, 60 sec: 5519.9, 300 sec: 5531.7). Total num frames: 861511680. Throughput: 0: 5783.2. Samples: 861517422. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:31:33,216][25689] Avg episode reward: [(0, '0.611')] [2022-07-10 18:31:34,490][26022] Updated weights on worker 0-0, policy_version 841326 (0.00099) [2022-07-10 18:31:35,940][26022] Updated weights on worker 0-0, policy_version 841336 (0.00090) [2022-07-10 18:31:37,883][26022] Updated weights on worker 0-0, policy_version 841346 (0.00092) [2022-07-10 18:31:38,247][25689] Fps is (10 sec: 5612.4, 60 sec: 5500.5, 300 sec: 5533.4). Total num frames: 861538304. Throughput: 0: 4966.9. Samples: 861534310. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:31:38,247][25689] Avg episode reward: [(0, '0.357')] [2022-07-10 18:31:39,857][26022] Updated weights on worker 0-0, policy_version 841356 (0.00091) [2022-07-10 18:31:41,522][26022] Updated weights on worker 0-0, policy_version 841366 (0.00089) [2022-07-10 18:31:43,367][25689] Fps is (10 sec: 5447.2, 60 sec: 5511.0, 300 sec: 5528.0). Total num frames: 861566976. Throughput: 0: 5756.5. Samples: 861567110. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:31:43,367][25689] Avg episode reward: [(0, '0.365')] [2022-07-10 18:31:43,630][26022] Updated weights on worker 0-0, policy_version 841376 (0.00090) [2022-07-10 18:31:45,391][26022] Updated weights on worker 0-0, policy_version 841386 (0.00051) [2022-07-10 18:31:47,160][26022] Updated weights on worker 0-0, policy_version 841396 (0.00085) [2022-07-10 18:31:48,395][25689] Fps is (10 sec: 5751.9, 60 sec: 5528.3, 300 sec: 5534.6). Total num frames: 861596672. Throughput: 0: 5751.2. Samples: 861600478. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:31:48,395][25689] Avg episode reward: [(0, '0.653')] [2022-07-10 18:31:49,183][26022] Updated weights on worker 0-0, policy_version 841406 (0.00090) [2022-07-10 18:31:50,818][26022] Updated weights on worker 0-0, policy_version 841416 (0.00091) [2022-07-10 18:31:52,654][26022] Updated weights on worker 0-0, policy_version 841426 (0.00086) [2022-07-10 18:31:53,481][25689] Fps is (10 sec: 5669.9, 60 sec: 5506.0, 300 sec: 5534.5). Total num frames: 861624320. Throughput: 0: 4936.1. Samples: 861617452. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:31:53,481][25689] Avg episode reward: [(0, '0.709')] [2022-07-10 18:31:54,546][26022] Updated weights on worker 0-0, policy_version 841436 (0.00084) [2022-07-10 18:31:56,320][26022] Updated weights on worker 0-0, policy_version 841446 (0.00090) [2022-07-10 18:31:58,395][26022] Updated weights on worker 0-0, policy_version 841456 (0.00091) [2022-07-10 18:31:58,492][25689] Fps is (10 sec: 5374.8, 60 sec: 5505.7, 300 sec: 5525.9). Total num frames: 861650944. Throughput: 0: 5763.3. Samples: 861650988. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:31:58,492][25689] Avg episode reward: [(0, '0.077')] [2022-07-10 18:31:59,983][26022] Updated weights on worker 0-0, policy_version 841466 (0.00087) [2022-07-10 18:32:01,950][26022] Updated weights on worker 0-0, policy_version 841476 (0.00092) [2022-07-10 18:32:03,611][25689] Fps is (10 sec: 5357.3, 60 sec: 5541.4, 300 sec: 5535.4). Total num frames: 861678592. Throughput: 0: 5711.5. Samples: 861682736. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:32:03,612][25689] Avg episode reward: [(0, '0.019')] [2022-07-10 18:32:03,948][26022] Updated weights on worker 0-0, policy_version 841486 (0.00084) [2022-07-10 18:32:05,915][26022] Updated weights on worker 0-0, policy_version 841496 (0.00083) [2022-07-10 18:32:07,693][26022] Updated weights on worker 0-0, policy_version 841506 (0.00091) [2022-07-10 18:32:08,626][25689] Fps is (10 sec: 5355.5, 60 sec: 5489.5, 300 sec: 5529.2). Total num frames: 861705216. Throughput: 0: 5718.0. Samples: 861716162. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:32:08,627][25689] Avg episode reward: [(0, '0.471')] [2022-07-10 18:32:09,439][26022] Updated weights on worker 0-0, policy_version 841516 (0.00077) [2022-07-10 18:32:11,512][26022] Updated weights on worker 0-0, policy_version 841526 (0.00095) [2022-07-10 18:32:13,115][26022] Updated weights on worker 0-0, policy_version 841536 (0.00095) [2022-07-10 18:32:13,644][25689] Fps is (10 sec: 5613.6, 60 sec: 5539.1, 300 sec: 5533.8). Total num frames: 861734912. Throughput: 0: 5723.9. Samples: 861732864. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:32:13,644][25689] Avg episode reward: [(0, '-0.177')] [2022-07-10 18:32:14,948][26022] Updated weights on worker 0-0, policy_version 841546 (0.00093) [2022-07-10 18:32:16,899][26022] Updated weights on worker 0-0, policy_version 841556 (0.00086) [2022-07-10 18:32:18,508][26022] Updated weights on worker 0-0, policy_version 841566 (0.00094) [2022-07-10 18:32:18,671][25689] Fps is (10 sec: 5810.7, 60 sec: 5544.9, 300 sec: 5536.2). Total num frames: 861763584. Throughput: 0: 5732.9. Samples: 861766672. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:32:18,671][25689] Avg episode reward: [(0, '0.145')] [2022-07-10 18:32:20,607][26022] Updated weights on worker 0-0, policy_version 841576 (0.00091) [2022-07-10 18:32:22,470][26022] Updated weights on worker 0-0, policy_version 841586 (0.00089) [2022-07-10 18:32:23,715][25689] Fps is (10 sec: 5592.4, 60 sec: 5517.1, 300 sec: 5537.1). Total num frames: 861791232. Throughput: 0: 5847.6. Samples: 861800292. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:32:23,715][25689] Avg episode reward: [(0, '-0.005')] [2022-07-10 18:32:24,163][26022] Updated weights on worker 0-0, policy_version 841596 (0.00087) [2022-07-10 18:32:26,231][26022] Updated weights on worker 0-0, policy_version 841606 (0.00097) [2022-07-10 18:32:27,959][26022] Updated weights on worker 0-0, policy_version 841616 (0.00095) [2022-07-10 18:32:28,745][25689] Fps is (10 sec: 5387.4, 60 sec: 5548.6, 300 sec: 5530.8). Total num frames: 861817856. Throughput: 0: 5007.2. Samples: 861816902. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:32:28,746][25689] Avg episode reward: [(0, '-0.356')] [2022-07-10 18:32:29,783][26022] Updated weights on worker 0-0, policy_version 841626 (0.00085) [2022-07-10 18:32:31,751][26022] Updated weights on worker 0-0, policy_version 841636 (0.00087) [2022-07-10 18:32:33,366][26022] Updated weights on worker 0-0, policy_version 841646 (0.00086) [2022-07-10 18:32:33,749][25689] Fps is (10 sec: 5510.7, 60 sec: 5531.6, 300 sec: 5528.2). Total num frames: 861846528. Throughput: 0: 5827.6. Samples: 861850028. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:32:33,749][25689] Avg episode reward: [(0, '-0.287')] [2022-07-10 18:32:35,398][26022] Updated weights on worker 0-0, policy_version 841656 (0.00096) [2022-07-10 18:32:37,171][26022] Updated weights on worker 0-0, policy_version 841666 (0.00086) [2022-07-10 18:32:38,790][25689] Fps is (10 sec: 5606.5, 60 sec: 5547.6, 300 sec: 5532.4). Total num frames: 861874176. Throughput: 0: 5812.1. Samples: 861883608. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:32:38,791][25689] Avg episode reward: [(0, '-0.347')] [2022-07-10 18:32:38,975][26022] Updated weights on worker 0-0, policy_version 841676 (0.00099) [2022-07-10 18:32:40,898][26022] Updated weights on worker 0-0, policy_version 841686 (0.00082) [2022-07-10 18:32:42,696][26022] Updated weights on worker 0-0, policy_version 841696 (0.00084) [2022-07-10 18:32:43,890][25689] Fps is (10 sec: 5452.6, 60 sec: 5532.6, 300 sec: 5520.4). Total num frames: 861901824. Throughput: 0: 4957.8. Samples: 861900318. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:32:43,891][25689] Avg episode reward: [(0, '-0.864')] [2022-07-10 18:32:44,374][26022] Updated weights on worker 0-0, policy_version 841706 (0.00088) [2022-07-10 18:32:46,588][26022] Updated weights on worker 0-0, policy_version 841716 (0.00107) [2022-07-10 18:32:48,039][26022] Updated weights on worker 0-0, policy_version 841726 (0.00110) [2022-07-10 18:32:48,899][25689] Fps is (10 sec: 5672.5, 60 sec: 5534.2, 300 sec: 5530.6). Total num frames: 861931520. Throughput: 0: 5802.3. Samples: 861933844. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:32:48,900][25689] Avg episode reward: [(0, '-1.034')] [2022-07-10 18:32:50,121][26022] Updated weights on worker 0-0, policy_version 841736 (0.00080) [2022-07-10 18:32:51,794][26022] Updated weights on worker 0-0, policy_version 841746 (0.00082) [2022-07-10 18:32:53,640][26022] Updated weights on worker 0-0, policy_version 841756 (0.00083) [2022-07-10 18:32:53,920][25689] Fps is (10 sec: 5819.2, 60 sec: 5557.1, 300 sec: 5533.9). Total num frames: 861960192. Throughput: 0: 5849.3. Samples: 861968016. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:32:53,921][25689] Avg episode reward: [(0, '-0.610')] [2022-07-10 18:32:54,242][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:32:54,257][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000841760_861962240.pth [2022-07-10 18:32:54,258][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000839811_859966464.pth [2022-07-10 18:32:55,512][26022] Updated weights on worker 0-0, policy_version 841766 (0.00097) [2022-07-10 18:32:57,489][26022] Updated weights on worker 0-0, policy_version 841776 (0.00090) [2022-07-10 18:32:58,937][25689] Fps is (10 sec: 5508.6, 60 sec: 5556.6, 300 sec: 5525.8). Total num frames: 861986816. Throughput: 0: 4999.1. Samples: 861984326. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:32:58,938][25689] Avg episode reward: [(0, '-0.708')] [2022-07-10 18:32:59,219][26022] Updated weights on worker 0-0, policy_version 841786 (0.00084) [2022-07-10 18:33:01,404][26022] Updated weights on worker 0-0, policy_version 841796 (0.00087) [2022-07-10 18:33:03,125][26022] Updated weights on worker 0-0, policy_version 841806 (0.00090) [2022-07-10 18:33:04,003][25689] Fps is (10 sec: 5281.1, 60 sec: 5544.6, 300 sec: 5531.5). Total num frames: 862013440. Throughput: 0: 5719.8. Samples: 862015360. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:33:04,004][25689] Avg episode reward: [(0, '-0.803')] [2022-07-10 18:33:05,336][26022] Updated weights on worker 0-0, policy_version 841816 (0.00090) [2022-07-10 18:33:06,575][26022] Updated weights on worker 0-0, policy_version 841826 (0.00082) [2022-07-10 18:33:08,923][26022] Updated weights on worker 0-0, policy_version 841836 (0.00088) [2022-07-10 18:33:09,007][25689] Fps is (10 sec: 5389.9, 60 sec: 5562.6, 300 sec: 5528.9). Total num frames: 862041088. Throughput: 0: 5732.1. Samples: 862049102. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:33:09,007][25689] Avg episode reward: [(0, '-0.848')] [2022-07-10 18:33:10,524][26022] Updated weights on worker 0-0, policy_version 841846 (0.00094) [2022-07-10 18:33:12,428][26022] Updated weights on worker 0-0, policy_version 841856 (0.00087) [2022-07-10 18:33:14,052][25689] Fps is (10 sec: 5604.3, 60 sec: 5543.0, 300 sec: 5531.7). Total num frames: 862069760. Throughput: 0: 4860.2. Samples: 862065862. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:33:14,053][25689] Avg episode reward: [(0, '0.359')] [2022-07-10 18:33:14,290][26022] Updated weights on worker 0-0, policy_version 841866 (0.00092) [2022-07-10 18:33:16,261][26022] Updated weights on worker 0-0, policy_version 841876 (0.00087) [2022-07-10 18:33:17,979][26022] Updated weights on worker 0-0, policy_version 841886 (0.00090) [2022-07-10 18:33:19,086][25689] Fps is (10 sec: 5485.9, 60 sec: 5508.5, 300 sec: 5528.6). Total num frames: 862096384. Throughput: 0: 5713.6. Samples: 862099450. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:33:19,087][25689] Avg episode reward: [(0, '0.412')] [2022-07-10 18:33:19,899][26022] Updated weights on worker 0-0, policy_version 841896 (0.00093) [2022-07-10 18:33:21,514][26022] Updated weights on worker 0-0, policy_version 841906 (0.00092) [2022-07-10 18:33:23,427][26022] Updated weights on worker 0-0, policy_version 841916 (0.00084) [2022-07-10 18:33:24,160][25689] Fps is (10 sec: 5470.9, 60 sec: 5522.7, 300 sec: 5530.8). Total num frames: 862125056. Throughput: 0: 5838.3. Samples: 862133042. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:33:24,160][25689] Avg episode reward: [(0, '0.514')] [2022-07-10 18:33:25,246][26022] Updated weights on worker 0-0, policy_version 841926 (0.00085) [2022-07-10 18:33:27,250][26022] Updated weights on worker 0-0, policy_version 841936 (0.00097) [2022-07-10 18:33:29,076][26022] Updated weights on worker 0-0, policy_version 841946 (0.00095) [2022-07-10 18:33:29,167][25689] Fps is (10 sec: 5587.2, 60 sec: 5541.8, 300 sec: 5530.9). Total num frames: 862152704. Throughput: 0: 4985.0. Samples: 862149598. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:33:29,167][25689] Avg episode reward: [(0, '0.812')] [2022-07-10 18:33:30,911][26022] Updated weights on worker 0-0, policy_version 841956 (0.00095) [2022-07-10 18:33:32,855][26022] Updated weights on worker 0-0, policy_version 841966 (0.00096) [2022-07-10 18:33:34,203][25689] Fps is (10 sec: 5505.6, 60 sec: 5521.9, 300 sec: 5523.8). Total num frames: 862180352. Throughput: 0: 5786.3. Samples: 862182462. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:33:34,205][25689] Avg episode reward: [(0, '0.956')] [2022-07-10 18:33:34,688][26022] Updated weights on worker 0-0, policy_version 841976 (0.00087) [2022-07-10 18:33:36,382][26022] Updated weights on worker 0-0, policy_version 841986 (0.00083) [2022-07-10 18:33:38,462][26022] Updated weights on worker 0-0, policy_version 841996 (0.00092) [2022-07-10 18:33:39,236][25689] Fps is (10 sec: 5593.2, 60 sec: 5539.7, 300 sec: 5531.0). Total num frames: 862209024. Throughput: 0: 5784.3. Samples: 862216000. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:33:39,236][25689] Avg episode reward: [(0, '0.945')] [2022-07-10 18:33:40,241][26022] Updated weights on worker 0-0, policy_version 842006 (0.00087) [2022-07-10 18:33:42,020][26022] Updated weights on worker 0-0, policy_version 842016 (0.00079) [2022-07-10 18:33:43,795][26022] Updated weights on worker 0-0, policy_version 842026 (0.00091) [2022-07-10 18:33:44,287][25689] Fps is (10 sec: 5585.5, 60 sec: 5544.1, 300 sec: 5526.8). Total num frames: 862236672. Throughput: 0: 4942.2. Samples: 862232514. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:33:44,287][25689] Avg episode reward: [(0, '0.185')] [2022-07-10 18:33:45,827][26022] Updated weights on worker 0-0, policy_version 842036 (0.00092) [2022-07-10 18:33:47,586][26022] Updated weights on worker 0-0, policy_version 842046 (0.00590) [2022-07-10 18:33:49,310][25689] Fps is (10 sec: 5488.7, 60 sec: 5508.9, 300 sec: 5530.1). Total num frames: 862264320. Throughput: 0: 5765.7. Samples: 862265742. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:33:49,311][25689] Avg episode reward: [(0, '0.217')] [2022-07-10 18:33:49,772][26022] Updated weights on worker 0-0, policy_version 842056 (0.00089) [2022-07-10 18:33:51,206][26022] Updated weights on worker 0-0, policy_version 842066 (0.00080) [2022-07-10 18:33:53,374][26022] Updated weights on worker 0-0, policy_version 842076 (0.00092) [2022-07-10 18:33:54,361][25689] Fps is (10 sec: 5590.6, 60 sec: 5506.2, 300 sec: 5532.7). Total num frames: 862292992. Throughput: 0: 5764.6. Samples: 862298660. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:33:54,363][25689] Avg episode reward: [(0, '0.674')] [2022-07-10 18:33:54,763][26022] Updated weights on worker 0-0, policy_version 842086 (0.00082) [2022-07-10 18:33:57,018][26022] Updated weights on worker 0-0, policy_version 842096 (0.00086) [2022-07-10 18:33:58,773][26022] Updated weights on worker 0-0, policy_version 842106 (0.00092) [2022-07-10 18:33:59,371][25689] Fps is (10 sec: 5496.3, 60 sec: 5506.9, 300 sec: 5534.0). Total num frames: 862319616. Throughput: 0: 4939.0. Samples: 862315448. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:33:59,371][25689] Avg episode reward: [(0, '0.383')] [2022-07-10 18:34:00,720][26022] Updated weights on worker 0-0, policy_version 842116 (0.00087) [2022-07-10 18:34:02,760][26022] Updated weights on worker 0-0, policy_version 842126 (0.00087) [2022-07-10 18:34:04,502][25689] Fps is (10 sec: 5149.6, 60 sec: 5484.0, 300 sec: 5524.9). Total num frames: 862345216. Throughput: 0: 5628.7. Samples: 862346300. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:34:04,503][25689] Avg episode reward: [(0, '0.376')] [2022-07-10 18:34:04,600][26022] Updated weights on worker 0-0, policy_version 842136 (0.00091) [2022-07-10 18:34:06,501][26022] Updated weights on worker 0-0, policy_version 842146 (0.00087) [2022-07-10 18:34:08,424][26022] Updated weights on worker 0-0, policy_version 842156 (0.00084) [2022-07-10 18:34:09,522][25689] Fps is (10 sec: 5346.4, 60 sec: 5499.4, 300 sec: 5524.6). Total num frames: 862373888. Throughput: 0: 5649.6. Samples: 862379930. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:34:09,523][25689] Avg episode reward: [(0, '0.713')] [2022-07-10 18:34:10,232][26022] Updated weights on worker 0-0, policy_version 842166 (0.00088) [2022-07-10 18:34:11,956][26022] Updated weights on worker 0-0, policy_version 842176 (0.00089) [2022-07-10 18:34:13,792][26022] Updated weights on worker 0-0, policy_version 842186 (0.00087) [2022-07-10 18:34:14,524][25689] Fps is (10 sec: 5619.9, 60 sec: 5486.5, 300 sec: 5524.9). Total num frames: 862401536. Throughput: 0: 4867.2. Samples: 862396798. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:34:14,525][25689] Avg episode reward: [(0, '0.966')] [2022-07-10 18:34:15,808][26022] Updated weights on worker 0-0, policy_version 842196 (0.00089) [2022-07-10 18:34:17,521][26022] Updated weights on worker 0-0, policy_version 842206 (0.00086) [2022-07-10 18:34:19,515][26022] Updated weights on worker 0-0, policy_version 842216 (0.00091) [2022-07-10 18:34:19,593][25689] Fps is (10 sec: 5592.3, 60 sec: 5517.2, 300 sec: 5527.7). Total num frames: 862430208. Throughput: 0: 5673.6. Samples: 862430180. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:34:19,595][25689] Avg episode reward: [(0, '1.159')] [2022-07-10 18:34:21,127][26022] Updated weights on worker 0-0, policy_version 842226 (0.00090) [2022-07-10 18:34:23,054][26022] Updated weights on worker 0-0, policy_version 842236 (0.00081) [2022-07-10 18:34:24,660][25689] Fps is (10 sec: 5556.1, 60 sec: 5500.8, 300 sec: 5523.2). Total num frames: 862457856. Throughput: 0: 5840.5. Samples: 862464034. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:34:24,662][25689] Avg episode reward: [(0, '1.306')] [2022-07-10 18:34:24,903][26022] Updated weights on worker 0-0, policy_version 842246 (0.00089) [2022-07-10 18:34:26,728][26022] Updated weights on worker 0-0, policy_version 842256 (0.00091) [2022-07-10 18:34:28,518][26022] Updated weights on worker 0-0, policy_version 842266 (0.00091) [2022-07-10 18:34:29,693][25689] Fps is (10 sec: 5677.4, 60 sec: 5532.2, 300 sec: 5529.6). Total num frames: 862487552. Throughput: 0: 4992.4. Samples: 862480634. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:34:29,695][25689] Avg episode reward: [(0, '1.378')] [2022-07-10 18:34:30,387][26022] Updated weights on worker 0-0, policy_version 842276 (0.00080) [2022-07-10 18:34:32,028][26022] Updated weights on worker 0-0, policy_version 842286 (0.00092) [2022-07-10 18:34:34,264][26022] Updated weights on worker 0-0, policy_version 842296 (0.00055) [2022-07-10 18:34:34,734][25689] Fps is (10 sec: 5692.6, 60 sec: 5531.9, 300 sec: 5529.1). Total num frames: 862515200. Throughput: 0: 5813.4. Samples: 862514286. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 18:34:34,737][25689] Avg episode reward: [(0, '1.378')] [2022-07-10 18:34:35,665][26022] Updated weights on worker 0-0, policy_version 842306 (0.00088) [2022-07-10 18:34:37,807][26022] Updated weights on worker 0-0, policy_version 842316 (0.00085) [2022-07-10 18:34:39,391][26022] Updated weights on worker 0-0, policy_version 842326 (0.00086) [2022-07-10 18:34:39,784][25689] Fps is (10 sec: 5581.4, 60 sec: 5530.3, 300 sec: 5528.9). Total num frames: 862543872. Throughput: 0: 5841.3. Samples: 862548122. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:34:39,785][25689] Avg episode reward: [(0, '1.003')] [2022-07-10 18:34:41,359][26022] Updated weights on worker 0-0, policy_version 842336 (0.00084) [2022-07-10 18:34:43,004][26022] Updated weights on worker 0-0, policy_version 842346 (0.00099) [2022-07-10 18:34:44,861][25689] Fps is (10 sec: 5561.2, 60 sec: 5527.9, 300 sec: 5527.8). Total num frames: 862571520. Throughput: 0: 5001.1. Samples: 862565060. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:34:44,861][25689] Avg episode reward: [(0, '-0.027')] [2022-07-10 18:34:45,042][26022] Updated weights on worker 0-0, policy_version 842356 (0.00083) [2022-07-10 18:34:46,657][26022] Updated weights on worker 0-0, policy_version 842366 (0.00085) [2022-07-10 18:34:48,794][26022] Updated weights on worker 0-0, policy_version 842376 (0.00085) [2022-07-10 18:34:49,891][25689] Fps is (10 sec: 5572.6, 60 sec: 5544.2, 300 sec: 5528.0). Total num frames: 862600192. Throughput: 0: 5845.0. Samples: 862598688. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:34:49,892][25689] Avg episode reward: [(0, '-0.484')] [2022-07-10 18:34:50,342][26022] Updated weights on worker 0-0, policy_version 842386 (0.00415) [2022-07-10 18:34:52,192][26022] Updated weights on worker 0-0, policy_version 842396 (0.00087) [2022-07-10 18:34:54,110][26022] Updated weights on worker 0-0, policy_version 842406 (0.00093) [2022-07-10 18:34:54,736][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:34:54,750][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000842409_862626816.pth [2022-07-10 18:34:54,751][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000840462_860633088.pth [2022-07-10 18:34:54,899][25689] Fps is (10 sec: 5712.6, 60 sec: 5548.1, 300 sec: 5528.4). Total num frames: 862628864. Throughput: 0: 5855.2. Samples: 862632360. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:34:54,900][25689] Avg episode reward: [(0, '-1.596')] [2022-07-10 18:34:56,075][26022] Updated weights on worker 0-0, policy_version 842416 (0.00083) [2022-07-10 18:34:57,724][26022] Updated weights on worker 0-0, policy_version 842426 (0.00086) [2022-07-10 18:34:59,780][26022] Updated weights on worker 0-0, policy_version 842436 (0.00087) [2022-07-10 18:34:59,916][25689] Fps is (10 sec: 5515.4, 60 sec: 5547.4, 300 sec: 5529.4). Total num frames: 862655488. Throughput: 0: 5014.1. Samples: 862649068. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:34:59,917][25689] Avg episode reward: [(0, '-1.521')] [2022-07-10 18:35:01,358][26022] Updated weights on worker 0-0, policy_version 842446 (0.00085) [2022-07-10 18:35:03,657][26022] Updated weights on worker 0-0, policy_version 842456 (0.00087) [2022-07-10 18:35:04,964][25689] Fps is (10 sec: 5290.7, 60 sec: 5572.1, 300 sec: 5529.8). Total num frames: 862682112. Throughput: 0: 5741.0. Samples: 862680470. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:35:04,965][25689] Avg episode reward: [(0, '-1.477')] [2022-07-10 18:35:05,472][26022] Updated weights on worker 0-0, policy_version 842466 (0.00097) [2022-07-10 18:35:07,263][26022] Updated weights on worker 0-0, policy_version 842476 (0.00086) [2022-07-10 18:35:09,159][26022] Updated weights on worker 0-0, policy_version 842486 (0.00083) [2022-07-10 18:35:09,975][25689] Fps is (10 sec: 5497.5, 60 sec: 5572.9, 300 sec: 5530.3). Total num frames: 862710784. Throughput: 0: 5758.5. Samples: 862714342. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:35:09,975][25689] Avg episode reward: [(0, '-1.146')] [2022-07-10 18:35:10,921][26022] Updated weights on worker 0-0, policy_version 842496 (0.00087) [2022-07-10 18:35:12,793][26022] Updated weights on worker 0-0, policy_version 842506 (0.00095) [2022-07-10 18:35:14,537][26022] Updated weights on worker 0-0, policy_version 842516 (0.00086) [2022-07-10 18:35:14,987][25689] Fps is (10 sec: 5619.1, 60 sec: 5571.9, 300 sec: 5530.2). Total num frames: 862738432. Throughput: 0: 5750.9. Samples: 862747882. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:35:14,987][25689] Avg episode reward: [(0, '-0.098')] [2022-07-10 18:35:16,460][26022] Updated weights on worker 0-0, policy_version 842526 (0.00090) [2022-07-10 18:35:18,250][26022] Updated weights on worker 0-0, policy_version 842536 (0.00087) [2022-07-10 18:35:19,995][25689] Fps is (10 sec: 5416.3, 60 sec: 5543.6, 300 sec: 5524.8). Total num frames: 862765056. Throughput: 0: 5756.7. Samples: 862764654. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:35:19,995][25689] Avg episode reward: [(0, '-0.189')] [2022-07-10 18:35:20,207][26022] Updated weights on worker 0-0, policy_version 842546 (0.00089) [2022-07-10 18:35:21,877][26022] Updated weights on worker 0-0, policy_version 842556 (0.00083) [2022-07-10 18:35:23,754][26022] Updated weights on worker 0-0, policy_version 842566 (0.00091) [2022-07-10 18:35:25,061][25689] Fps is (10 sec: 5590.2, 60 sec: 5577.6, 300 sec: 5530.9). Total num frames: 862794752. Throughput: 0: 5868.6. Samples: 862798416. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:35:25,062][25689] Avg episode reward: [(0, '0.906')] [2022-07-10 18:35:25,451][26022] Updated weights on worker 0-0, policy_version 842576 (0.00090) [2022-07-10 18:35:27,302][26022] Updated weights on worker 0-0, policy_version 842586 (0.00086) [2022-07-10 18:35:29,462][26022] Updated weights on worker 0-0, policy_version 842596 (0.00089) [2022-07-10 18:35:30,071][25689] Fps is (10 sec: 5691.3, 60 sec: 5545.9, 300 sec: 5531.0). Total num frames: 862822400. Throughput: 0: 5852.0. Samples: 862831944. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:35:30,071][25689] Avg episode reward: [(0, '0.561')] [2022-07-10 18:35:30,848][26022] Updated weights on worker 0-0, policy_version 842606 (0.00088) [2022-07-10 18:35:33,120][26022] Updated weights on worker 0-0, policy_version 842616 (0.00092) [2022-07-10 18:35:34,615][26022] Updated weights on worker 0-0, policy_version 842626 (0.00084) [2022-07-10 18:35:35,091][25689] Fps is (10 sec: 5615.5, 60 sec: 5564.7, 300 sec: 5534.2). Total num frames: 862851072. Throughput: 0: 5012.7. Samples: 862848658. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:35:35,091][25689] Avg episode reward: [(0, '0.385')] [2022-07-10 18:35:36,675][26022] Updated weights on worker 0-0, policy_version 842636 (0.00090) [2022-07-10 18:35:38,251][26022] Updated weights on worker 0-0, policy_version 842646 (0.00096) [2022-07-10 18:35:40,109][25689] Fps is (10 sec: 5610.3, 60 sec: 5550.7, 300 sec: 5534.7). Total num frames: 862878720. Throughput: 0: 5866.7. Samples: 862882658. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:35:40,111][25689] Avg episode reward: [(0, '0.310')] [2022-07-10 18:35:40,182][26022] Updated weights on worker 0-0, policy_version 842656 (0.00090) [2022-07-10 18:35:41,969][26022] Updated weights on worker 0-0, policy_version 842666 (0.00088) [2022-07-10 18:35:43,907][26022] Updated weights on worker 0-0, policy_version 842676 (0.00090) [2022-07-10 18:35:45,229][25689] Fps is (10 sec: 5555.3, 60 sec: 5563.7, 300 sec: 5533.1). Total num frames: 862907392. Throughput: 0: 5854.7. Samples: 862916490. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:35:45,230][25689] Avg episode reward: [(0, '-0.771')] [2022-07-10 18:35:45,520][26022] Updated weights on worker 0-0, policy_version 842686 (0.00090) [2022-07-10 18:35:47,507][26022] Updated weights on worker 0-0, policy_version 842696 (0.00089) [2022-07-10 18:35:49,275][26022] Updated weights on worker 0-0, policy_version 842706 (0.00092) [2022-07-10 18:35:50,305][25689] Fps is (10 sec: 5523.8, 60 sec: 5542.5, 300 sec: 5528.7). Total num frames: 862935040. Throughput: 0: 5008.1. Samples: 862933280. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:35:50,306][25689] Avg episode reward: [(0, '-0.379')] [2022-07-10 18:35:51,197][26022] Updated weights on worker 0-0, policy_version 842716 (0.00085) [2022-07-10 18:35:53,063][26022] Updated weights on worker 0-0, policy_version 842726 (0.01022) [2022-07-10 18:35:54,858][26022] Updated weights on worker 0-0, policy_version 842736 (0.00089) [2022-07-10 18:35:55,318][25689] Fps is (10 sec: 5683.7, 60 sec: 5559.0, 300 sec: 5539.0). Total num frames: 862964736. Throughput: 0: 5845.3. Samples: 862966892. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:35:55,319][25689] Avg episode reward: [(0, '-0.510')] [2022-07-10 18:35:56,651][26022] Updated weights on worker 0-0, policy_version 842746 (0.00087) [2022-07-10 18:35:58,595][26022] Updated weights on worker 0-0, policy_version 842756 (0.00083) [2022-07-10 18:36:00,335][25689] Fps is (10 sec: 5615.5, 60 sec: 5559.1, 300 sec: 5544.7). Total num frames: 862991360. Throughput: 0: 5823.5. Samples: 863000440. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:36:00,335][25689] Avg episode reward: [(0, '-0.136')] [2022-07-10 18:36:00,460][26022] Updated weights on worker 0-0, policy_version 842766 (0.00096) [2022-07-10 18:36:02,547][26022] Updated weights on worker 0-0, policy_version 842776 (0.00095) [2022-07-10 18:36:04,363][26022] Updated weights on worker 0-0, policy_version 842786 (0.00081) [2022-07-10 18:36:05,419][25689] Fps is (10 sec: 5271.9, 60 sec: 5555.7, 300 sec: 5532.9). Total num frames: 863017984. Throughput: 0: 4880.4. Samples: 863015024. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:36:05,421][25689] Avg episode reward: [(0, '-0.342')] [2022-07-10 18:36:06,183][26022] Updated weights on worker 0-0, policy_version 842796 (0.00084) [2022-07-10 18:36:08,018][26022] Updated weights on worker 0-0, policy_version 842806 (0.00089) [2022-07-10 18:36:09,930][26022] Updated weights on worker 0-0, policy_version 842816 (0.00089) [2022-07-10 18:36:10,459][25689] Fps is (10 sec: 5461.8, 60 sec: 5553.1, 300 sec: 5539.1). Total num frames: 863046656. Throughput: 0: 5738.2. Samples: 863048926. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:36:10,459][25689] Avg episode reward: [(0, '0.389')] [2022-07-10 18:36:11,617][26022] Updated weights on worker 0-0, policy_version 842826 (0.00090) [2022-07-10 18:36:13,594][26022] Updated weights on worker 0-0, policy_version 842836 (0.00091) [2022-07-10 18:36:15,204][26022] Updated weights on worker 0-0, policy_version 842846 (0.00092) [2022-07-10 18:36:15,486][25689] Fps is (10 sec: 5696.2, 60 sec: 5568.6, 300 sec: 5540.3). Total num frames: 863075328. Throughput: 0: 5730.3. Samples: 863082458. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:36:15,486][25689] Avg episode reward: [(0, '0.577')] [2022-07-10 18:36:17,163][26022] Updated weights on worker 0-0, policy_version 842856 (0.00085) [2022-07-10 18:36:18,872][26022] Updated weights on worker 0-0, policy_version 842866 (0.00084) [2022-07-10 18:36:20,494][25689] Fps is (10 sec: 5612.3, 60 sec: 5585.5, 300 sec: 5535.3). Total num frames: 863102976. Throughput: 0: 4906.4. Samples: 863099350. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:36:20,495][25689] Avg episode reward: [(0, '0.945')] [2022-07-10 18:36:20,706][26022] Updated weights on worker 0-0, policy_version 842876 (0.00092) [2022-07-10 18:36:22,651][26022] Updated weights on worker 0-0, policy_version 842886 (0.00092) [2022-07-10 18:36:24,415][26022] Updated weights on worker 0-0, policy_version 842896 (0.00083) [2022-07-10 18:36:25,549][25689] Fps is (10 sec: 5494.9, 60 sec: 5552.8, 300 sec: 5544.7). Total num frames: 863130624. Throughput: 0: 5875.9. Samples: 863133308. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:36:25,551][25689] Avg episode reward: [(0, '1.046')] [2022-07-10 18:36:26,359][26022] Updated weights on worker 0-0, policy_version 842906 (0.00088) [2022-07-10 18:36:28,107][26022] Updated weights on worker 0-0, policy_version 842916 (0.00090) [2022-07-10 18:36:30,063][26022] Updated weights on worker 0-0, policy_version 842926 (0.00086) [2022-07-10 18:36:30,575][25689] Fps is (10 sec: 5688.5, 60 sec: 5585.1, 300 sec: 5544.3). Total num frames: 863160320. Throughput: 0: 5869.0. Samples: 863166986. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:36:30,575][25689] Avg episode reward: [(0, '1.203')] [2022-07-10 18:36:31,750][26022] Updated weights on worker 0-0, policy_version 842936 (0.00088) [2022-07-10 18:36:33,618][26022] Updated weights on worker 0-0, policy_version 842946 (0.00444) [2022-07-10 18:36:35,515][26022] Updated weights on worker 0-0, policy_version 842956 (0.00084) [2022-07-10 18:36:35,599][25689] Fps is (10 sec: 5604.0, 60 sec: 5550.9, 300 sec: 5544.4). Total num frames: 863186944. Throughput: 0: 5024.4. Samples: 863183514. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:36:35,600][25689] Avg episode reward: [(0, '1.285')] [2022-07-10 18:36:37,178][26022] Updated weights on worker 0-0, policy_version 842966 (0.00086) [2022-07-10 18:36:39,379][26022] Updated weights on worker 0-0, policy_version 842976 (0.00091) [2022-07-10 18:36:40,603][25689] Fps is (10 sec: 5513.8, 60 sec: 5569.1, 300 sec: 5546.6). Total num frames: 863215616. Throughput: 0: 5847.0. Samples: 863216930. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:36:40,604][25689] Avg episode reward: [(0, '0.730')] [2022-07-10 18:36:40,843][26022] Updated weights on worker 0-0, policy_version 842986 (0.00085) [2022-07-10 18:36:42,865][26022] Updated weights on worker 0-0, policy_version 842996 (0.00104) [2022-07-10 18:36:44,691][26022] Updated weights on worker 0-0, policy_version 843006 (0.00084) [2022-07-10 18:36:45,693][25689] Fps is (10 sec: 5579.3, 60 sec: 5554.9, 300 sec: 5538.5). Total num frames: 863243264. Throughput: 0: 5799.5. Samples: 863250136. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:36:45,694][25689] Avg episode reward: [(0, '0.975')] [2022-07-10 18:36:46,557][26022] Updated weights on worker 0-0, policy_version 843016 (0.00090) [2022-07-10 18:36:48,375][26022] Updated weights on worker 0-0, policy_version 843026 (0.00084) [2022-07-10 18:36:50,360][26022] Updated weights on worker 0-0, policy_version 843036 (0.00091) [2022-07-10 18:36:50,773][25689] Fps is (10 sec: 5336.7, 60 sec: 5537.6, 300 sec: 5535.2). Total num frames: 863269888. Throughput: 0: 4944.1. Samples: 863266848. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:36:50,774][25689] Avg episode reward: [(0, '0.708')] [2022-07-10 18:36:51,912][26022] Updated weights on worker 0-0, policy_version 843046 (0.00094) [2022-07-10 18:36:54,192][26022] Updated weights on worker 0-0, policy_version 843056 (0.00092) [2022-07-10 18:36:54,817][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:36:54,828][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000843062_863295488.pth [2022-07-10 18:36:54,829][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000841111_861297664.pth [2022-07-10 18:36:55,602][26022] Updated weights on worker 0-0, policy_version 843066 (0.00096) [2022-07-10 18:36:55,860][25689] Fps is (10 sec: 5640.5, 60 sec: 5547.8, 300 sec: 5547.6). Total num frames: 863300608. Throughput: 0: 5763.3. Samples: 863300282. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:36:55,860][25689] Avg episode reward: [(0, '0.609')] [2022-07-10 18:36:57,704][26022] Updated weights on worker 0-0, policy_version 843076 (0.00088) [2022-07-10 18:36:59,385][26022] Updated weights on worker 0-0, policy_version 843086 (0.00083) [2022-07-10 18:37:00,887][25689] Fps is (10 sec: 5669.8, 60 sec: 5546.8, 300 sec: 5545.8). Total num frames: 863327232. Throughput: 0: 5780.4. Samples: 863334176. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:37:00,887][25689] Avg episode reward: [(0, '0.689')] [2022-07-10 18:37:01,279][26022] Updated weights on worker 0-0, policy_version 843096 (0.00092) [2022-07-10 18:37:03,524][26022] Updated weights on worker 0-0, policy_version 843106 (0.00081) [2022-07-10 18:37:05,283][26022] Updated weights on worker 0-0, policy_version 843116 (0.00093) [2022-07-10 18:37:05,973][25689] Fps is (10 sec: 5265.0, 60 sec: 5546.6, 300 sec: 5544.5). Total num frames: 863353856. Throughput: 0: 4849.8. Samples: 863348490. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:37:05,974][25689] Avg episode reward: [(0, '0.187')] [2022-07-10 18:37:07,457][26022] Updated weights on worker 0-0, policy_version 843126 (0.00084) [2022-07-10 18:37:08,837][26022] Updated weights on worker 0-0, policy_version 843136 (0.00927) [2022-07-10 18:37:10,863][26022] Updated weights on worker 0-0, policy_version 843146 (0.00095) [2022-07-10 18:37:11,003][25689] Fps is (10 sec: 5364.6, 60 sec: 5530.6, 300 sec: 5537.4). Total num frames: 863381504. Throughput: 0: 5691.5. Samples: 863381990. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:37:11,005][25689] Avg episode reward: [(0, '0.745')] [2022-07-10 18:37:12,652][26022] Updated weights on worker 0-0, policy_version 843156 (0.00055) [2022-07-10 18:37:14,552][26022] Updated weights on worker 0-0, policy_version 843166 (0.00091) [2022-07-10 18:37:16,018][25689] Fps is (10 sec: 5504.8, 60 sec: 5514.8, 300 sec: 5534.2). Total num frames: 863409152. Throughput: 0: 5711.9. Samples: 863415426. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:37:16,020][25689] Avg episode reward: [(0, '0.436')] [2022-07-10 18:37:16,377][26022] Updated weights on worker 0-0, policy_version 843176 (0.00084) [2022-07-10 18:37:18,156][26022] Updated weights on worker 0-0, policy_version 843186 (0.00086) [2022-07-10 18:37:19,919][26022] Updated weights on worker 0-0, policy_version 843196 (0.00085) [2022-07-10 18:37:21,036][25689] Fps is (10 sec: 5715.6, 60 sec: 5547.7, 300 sec: 5541.5). Total num frames: 863438848. Throughput: 0: 4870.2. Samples: 863432308. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:37:21,037][25689] Avg episode reward: [(0, '0.051')] [2022-07-10 18:37:21,975][26022] Updated weights on worker 0-0, policy_version 843206 (0.00085) [2022-07-10 18:37:23,575][26022] Updated weights on worker 0-0, policy_version 843216 (0.00089) [2022-07-10 18:37:25,470][26022] Updated weights on worker 0-0, policy_version 843226 (0.00085) [2022-07-10 18:37:26,103][25689] Fps is (10 sec: 5584.5, 60 sec: 5529.7, 300 sec: 5540.8). Total num frames: 863465472. Throughput: 0: 5834.5. Samples: 863465940. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:37:26,104][25689] Avg episode reward: [(0, '-0.168')] [2022-07-10 18:37:27,155][26022] Updated weights on worker 0-0, policy_version 843236 (0.00091) [2022-07-10 18:37:29,259][26022] Updated weights on worker 0-0, policy_version 843246 (0.00090) [2022-07-10 18:37:30,999][26022] Updated weights on worker 0-0, policy_version 843256 (0.00095) [2022-07-10 18:37:31,130][25689] Fps is (10 sec: 5478.2, 60 sec: 5512.7, 300 sec: 5540.4). Total num frames: 863494144. Throughput: 0: 5834.0. Samples: 863499410. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:37:31,132][25689] Avg episode reward: [(0, '-0.897')] [2022-07-10 18:37:32,806][26022] Updated weights on worker 0-0, policy_version 843266 (0.00087) [2022-07-10 18:37:34,747][26022] Updated weights on worker 0-0, policy_version 843276 (0.00082) [2022-07-10 18:37:36,141][25689] Fps is (10 sec: 5814.7, 60 sec: 5564.6, 300 sec: 5547.9). Total num frames: 863523840. Throughput: 0: 5005.8. Samples: 863516158. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:37:36,142][25689] Avg episode reward: [(0, '-0.380')] [2022-07-10 18:37:36,289][26022] Updated weights on worker 0-0, policy_version 843286 (0.00090) [2022-07-10 18:37:38,364][26022] Updated weights on worker 0-0, policy_version 843296 (0.00092) [2022-07-10 18:37:40,179][26022] Updated weights on worker 0-0, policy_version 843306 (0.00064) [2022-07-10 18:37:41,155][25689] Fps is (10 sec: 5618.3, 60 sec: 5530.0, 300 sec: 5546.0). Total num frames: 863550464. Throughput: 0: 5834.2. Samples: 863549682. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:37:41,155][25689] Avg episode reward: [(0, '-0.178')] [2022-07-10 18:37:42,119][26022] Updated weights on worker 0-0, policy_version 843316 (0.00089) [2022-07-10 18:37:43,738][26022] Updated weights on worker 0-0, policy_version 843326 (0.00086) [2022-07-10 18:37:45,871][26022] Updated weights on worker 0-0, policy_version 843336 (0.00086) [2022-07-10 18:37:46,213][25689] Fps is (10 sec: 5388.4, 60 sec: 5532.8, 300 sec: 5538.2). Total num frames: 863578112. Throughput: 0: 5834.4. Samples: 863583270. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:37:46,214][25689] Avg episode reward: [(0, '0.297')] [2022-07-10 18:37:47,441][26022] Updated weights on worker 0-0, policy_version 843346 (0.00079) [2022-07-10 18:37:49,440][26022] Updated weights on worker 0-0, policy_version 843356 (0.00088) [2022-07-10 18:37:50,988][26022] Updated weights on worker 0-0, policy_version 843366 (0.00081) [2022-07-10 18:37:51,234][25689] Fps is (10 sec: 5587.9, 60 sec: 5572.2, 300 sec: 5538.2). Total num frames: 863606784. Throughput: 0: 5001.7. Samples: 863599960. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:37:51,234][25689] Avg episode reward: [(0, '0.167')] [2022-07-10 18:37:53,106][26022] Updated weights on worker 0-0, policy_version 843376 (0.00086) [2022-07-10 18:37:54,878][26022] Updated weights on worker 0-0, policy_version 843386 (0.00093) [2022-07-10 18:37:56,238][25689] Fps is (10 sec: 5618.4, 60 sec: 5528.9, 300 sec: 5541.9). Total num frames: 863634432. Throughput: 0: 5858.5. Samples: 863633892. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:37:56,238][25689] Avg episode reward: [(0, '-0.358')] [2022-07-10 18:37:56,529][26022] Updated weights on worker 0-0, policy_version 843396 (0.00425) [2022-07-10 18:37:58,545][26022] Updated weights on worker 0-0, policy_version 843406 (0.00085) [2022-07-10 18:38:00,150][26022] Updated weights on worker 0-0, policy_version 843416 (0.00086) [2022-07-10 18:38:01,260][25689] Fps is (10 sec: 5617.4, 60 sec: 5563.3, 300 sec: 5549.6). Total num frames: 863663104. Throughput: 0: 5875.4. Samples: 863667806. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:38:01,260][25689] Avg episode reward: [(0, '0.067')] [2022-07-10 18:38:02,348][26022] Updated weights on worker 0-0, policy_version 843426 (0.00119) [2022-07-10 18:38:04,206][26022] Updated weights on worker 0-0, policy_version 843436 (0.00089) [2022-07-10 18:38:06,291][26022] Updated weights on worker 0-0, policy_version 843446 (0.00092) [2022-07-10 18:38:06,334][25689] Fps is (10 sec: 5375.3, 60 sec: 5547.4, 300 sec: 5541.4). Total num frames: 863688704. Throughput: 0: 5777.7. Samples: 863699522. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:38:06,335][25689] Avg episode reward: [(0, '-0.604')] [2022-07-10 18:38:07,923][26022] Updated weights on worker 0-0, policy_version 843456 (0.00089) [2022-07-10 18:38:09,880][26022] Updated weights on worker 0-0, policy_version 843466 (0.00085) [2022-07-10 18:38:11,349][25689] Fps is (10 sec: 5480.7, 60 sec: 5582.8, 300 sec: 5545.4). Total num frames: 863718400. Throughput: 0: 5779.0. Samples: 863716206. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-10 18:38:11,349][25689] Avg episode reward: [(0, '-0.605')] [2022-07-10 18:38:11,620][26022] Updated weights on worker 0-0, policy_version 843476 (0.00094) [2022-07-10 18:38:13,512][26022] Updated weights on worker 0-0, policy_version 843486 (0.00087) [2022-07-10 18:38:15,240][26022] Updated weights on worker 0-0, policy_version 843496 (0.00083) [2022-07-10 18:38:16,426][25689] Fps is (10 sec: 5682.2, 60 sec: 5577.0, 300 sec: 5548.1). Total num frames: 863746048. Throughput: 0: 5731.9. Samples: 863749610. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:38:16,428][25689] Avg episode reward: [(0, '-0.922')] [2022-07-10 18:38:17,249][26022] Updated weights on worker 0-0, policy_version 843506 (0.00090) [2022-07-10 18:38:18,922][26022] Updated weights on worker 0-0, policy_version 843516 (0.00083) [2022-07-10 18:38:20,974][26022] Updated weights on worker 0-0, policy_version 843526 (0.00091) [2022-07-10 18:38:21,446][25689] Fps is (10 sec: 5374.7, 60 sec: 5526.0, 300 sec: 5542.2). Total num frames: 863772672. Throughput: 0: 5716.7. Samples: 863783208. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:38:21,447][25689] Avg episode reward: [(0, '-1.062')] [2022-07-10 18:38:22,412][26022] Updated weights on worker 0-0, policy_version 843536 (0.00085) [2022-07-10 18:38:24,613][26022] Updated weights on worker 0-0, policy_version 843546 (0.00092) [2022-07-10 18:38:26,170][26022] Updated weights on worker 0-0, policy_version 843556 (0.00091) [2022-07-10 18:38:26,582][25689] Fps is (10 sec: 5545.3, 60 sec: 5570.5, 300 sec: 5546.7). Total num frames: 863802368. Throughput: 0: 4962.3. Samples: 863800000. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:38:26,583][25689] Avg episode reward: [(0, '-0.282')] [2022-07-10 18:38:28,048][26022] Updated weights on worker 0-0, policy_version 843566 (0.00079) [2022-07-10 18:38:30,163][26022] Updated weights on worker 0-0, policy_version 843576 (0.00088) [2022-07-10 18:38:31,645][25689] Fps is (10 sec: 5723.1, 60 sec: 5567.2, 300 sec: 5549.6). Total num frames: 863831040. Throughput: 0: 5764.5. Samples: 863833204. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:38:31,646][25689] Avg episode reward: [(0, '0.277')] [2022-07-10 18:38:31,692][26022] Updated weights on worker 0-0, policy_version 843586 (0.00085) [2022-07-10 18:38:33,790][26022] Updated weights on worker 0-0, policy_version 843596 (0.00100) [2022-07-10 18:38:35,663][26022] Updated weights on worker 0-0, policy_version 843606 (0.00090) [2022-07-10 18:38:36,681][25689] Fps is (10 sec: 5475.7, 60 sec: 5514.2, 300 sec: 5542.7). Total num frames: 863857664. Throughput: 0: 5789.6. Samples: 863866878. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:38:36,683][25689] Avg episode reward: [(0, '0.284')] [2022-07-10 18:38:37,247][26022] Updated weights on worker 0-0, policy_version 843616 (0.00089) [2022-07-10 18:38:39,211][26022] Updated weights on worker 0-0, policy_version 843626 (0.00088) [2022-07-10 18:38:40,852][26022] Updated weights on worker 0-0, policy_version 843636 (0.00108) [2022-07-10 18:38:41,687][25689] Fps is (10 sec: 5608.5, 60 sec: 5565.6, 300 sec: 5550.4). Total num frames: 863887360. Throughput: 0: 4960.0. Samples: 863883600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:38:41,687][25689] Avg episode reward: [(0, '0.083')] [2022-07-10 18:38:42,783][26022] Updated weights on worker 0-0, policy_version 843646 (0.00088) [2022-07-10 18:38:44,494][26022] Updated weights on worker 0-0, policy_version 843656 (0.00096) [2022-07-10 18:38:46,415][26022] Updated weights on worker 0-0, policy_version 843666 (0.00087) [2022-07-10 18:38:46,815][25689] Fps is (10 sec: 5658.6, 60 sec: 5559.2, 300 sec: 5548.4). Total num frames: 863915008. Throughput: 0: 5789.5. Samples: 863917136. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:38:46,815][25689] Avg episode reward: [(0, '-0.040')] [2022-07-10 18:38:48,326][26022] Updated weights on worker 0-0, policy_version 843676 (0.00085) [2022-07-10 18:38:50,094][26022] Updated weights on worker 0-0, policy_version 843686 (0.00095) [2022-07-10 18:38:51,842][25689] Fps is (10 sec: 5546.0, 60 sec: 5558.6, 300 sec: 5548.9). Total num frames: 863943680. Throughput: 0: 5806.6. Samples: 863950480. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:38:51,842][25689] Avg episode reward: [(0, '0.062')] [2022-07-10 18:38:51,953][26022] Updated weights on worker 0-0, policy_version 843696 (0.00088) [2022-07-10 18:38:53,885][26022] Updated weights on worker 0-0, policy_version 843706 (0.00086) [2022-07-10 18:38:54,884][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:38:54,898][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000843711_863960064.pth [2022-07-10 18:38:54,898][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000841760_861962240.pth [2022-07-10 18:38:55,719][26022] Updated weights on worker 0-0, policy_version 843716 (0.00087) [2022-07-10 18:38:56,924][25689] Fps is (10 sec: 5469.8, 60 sec: 5534.5, 300 sec: 5547.5). Total num frames: 863970304. Throughput: 0: 4951.0. Samples: 863967102. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:38:56,925][25689] Avg episode reward: [(0, '-0.388')] [2022-07-10 18:38:57,715][26022] Updated weights on worker 0-0, policy_version 843726 (0.00089) [2022-07-10 18:38:59,308][26022] Updated weights on worker 0-0, policy_version 843736 (0.00086) [2022-07-10 18:39:01,482][26022] Updated weights on worker 0-0, policy_version 843746 (0.00078) [2022-07-10 18:39:01,975][25689] Fps is (10 sec: 5255.1, 60 sec: 5498.2, 300 sec: 5552.5). Total num frames: 863996928. Throughput: 0: 5758.7. Samples: 864000432. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:39:01,975][25689] Avg episode reward: [(0, '-0.352')] [2022-07-10 18:39:03,317][26022] Updated weights on worker 0-0, policy_version 843756 (0.00093) [2022-07-10 18:39:05,582][26022] Updated weights on worker 0-0, policy_version 843766 (0.00091) [2022-07-10 18:39:07,057][25689] Fps is (10 sec: 5457.3, 60 sec: 5548.1, 300 sec: 5551.3). Total num frames: 864025600. Throughput: 0: 5638.5. Samples: 864031270. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:39:07,057][25689] Avg episode reward: [(0, '-0.794')] [2022-07-10 18:39:07,177][26022] Updated weights on worker 0-0, policy_version 843776 (0.00087) [2022-07-10 18:39:09,211][26022] Updated weights on worker 0-0, policy_version 843786 (0.00096) [2022-07-10 18:39:10,819][26022] Updated weights on worker 0-0, policy_version 843796 (0.00086) [2022-07-10 18:39:12,068][25689] Fps is (10 sec: 5579.7, 60 sec: 5514.7, 300 sec: 5551.1). Total num frames: 864053248. Throughput: 0: 4810.9. Samples: 864047788. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:39:12,069][25689] Avg episode reward: [(0, '-0.684')] [2022-07-10 18:39:12,919][26022] Updated weights on worker 0-0, policy_version 843806 (0.00079) [2022-07-10 18:39:14,638][26022] Updated weights on worker 0-0, policy_version 843816 (0.00085) [2022-07-10 18:39:16,736][26022] Updated weights on worker 0-0, policy_version 843826 (0.00088) [2022-07-10 18:39:17,117][25689] Fps is (10 sec: 5394.6, 60 sec: 5500.4, 300 sec: 5544.6). Total num frames: 864079872. Throughput: 0: 5628.6. Samples: 864080760. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:39:17,118][25689] Avg episode reward: [(0, '-0.811')] [2022-07-10 18:39:18,395][26022] Updated weights on worker 0-0, policy_version 843836 (0.00088) [2022-07-10 18:39:20,563][26022] Updated weights on worker 0-0, policy_version 843846 (0.00096) [2022-07-10 18:39:21,998][26022] Updated weights on worker 0-0, policy_version 843856 (0.00086) [2022-07-10 18:39:22,130][25689] Fps is (10 sec: 5495.8, 60 sec: 5534.8, 300 sec: 5549.1). Total num frames: 864108544. Throughput: 0: 5647.2. Samples: 864114254. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:39:22,130][25689] Avg episode reward: [(0, '-0.870')] [2022-07-10 18:39:24,098][26022] Updated weights on worker 0-0, policy_version 843866 (0.00087) [2022-07-10 18:39:25,804][26022] Updated weights on worker 0-0, policy_version 843876 (0.00089) [2022-07-10 18:39:27,188][25689] Fps is (10 sec: 5694.2, 60 sec: 5525.1, 300 sec: 5545.2). Total num frames: 864137216. Throughput: 0: 4953.6. Samples: 864130992. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:39:27,188][25689] Avg episode reward: [(0, '-0.778')] [2022-07-10 18:39:27,666][26022] Updated weights on worker 0-0, policy_version 843886 (0.00104) [2022-07-10 18:39:29,551][26022] Updated weights on worker 0-0, policy_version 843896 (0.00086) [2022-07-10 18:39:31,401][26022] Updated weights on worker 0-0, policy_version 843906 (0.00093) [2022-07-10 18:39:32,219][25689] Fps is (10 sec: 5480.7, 60 sec: 5494.1, 300 sec: 5541.9). Total num frames: 864163840. Throughput: 0: 5787.2. Samples: 864164406. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:39:32,220][25689] Avg episode reward: [(0, '-0.291')] [2022-07-10 18:39:33,069][26022] Updated weights on worker 0-0, policy_version 843916 (0.00086) [2022-07-10 18:39:35,165][26022] Updated weights on worker 0-0, policy_version 843926 (0.00089) [2022-07-10 18:39:36,781][26022] Updated weights on worker 0-0, policy_version 843936 (0.00095) [2022-07-10 18:39:37,304][25689] Fps is (10 sec: 5465.9, 60 sec: 5523.4, 300 sec: 5541.3). Total num frames: 864192512. Throughput: 0: 5780.8. Samples: 864197458. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:39:37,305][25689] Avg episode reward: [(0, '-0.071')] [2022-07-10 18:39:38,841][26022] Updated weights on worker 0-0, policy_version 843946 (0.00083) [2022-07-10 18:39:40,416][26022] Updated weights on worker 0-0, policy_version 843956 (0.00088) [2022-07-10 18:39:42,334][25689] Fps is (10 sec: 5568.3, 60 sec: 5487.5, 300 sec: 5542.2). Total num frames: 864220160. Throughput: 0: 4954.5. Samples: 864214354. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:39:42,335][25689] Avg episode reward: [(0, '-0.170')] [2022-07-10 18:39:42,497][26022] Updated weights on worker 0-0, policy_version 843966 (0.00091) [2022-07-10 18:39:44,255][26022] Updated weights on worker 0-0, policy_version 843976 (0.00087) [2022-07-10 18:39:46,115][26022] Updated weights on worker 0-0, policy_version 843986 (0.00086) [2022-07-10 18:39:47,438][25689] Fps is (10 sec: 5557.5, 60 sec: 5506.5, 300 sec: 5540.8). Total num frames: 864248832. Throughput: 0: 5772.4. Samples: 864247886. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:39:47,439][25689] Avg episode reward: [(0, '-0.393')] [2022-07-10 18:39:47,718][26022] Updated weights on worker 0-0, policy_version 843996 (0.00088) [2022-07-10 18:39:49,783][26022] Updated weights on worker 0-0, policy_version 844006 (0.00088) [2022-07-10 18:39:51,231][26022] Updated weights on worker 0-0, policy_version 844016 (0.00086) [2022-07-10 18:39:52,500][25689] Fps is (10 sec: 5640.7, 60 sec: 5503.4, 300 sec: 5539.8). Total num frames: 864277504. Throughput: 0: 5786.7. Samples: 864281762. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:39:52,500][25689] Avg episode reward: [(0, '0.192')] [2022-07-10 18:39:53,583][26022] Updated weights on worker 0-0, policy_version 844026 (0.00082) [2022-07-10 18:39:55,029][26022] Updated weights on worker 0-0, policy_version 844036 (0.00085) [2022-07-10 18:39:57,098][26022] Updated weights on worker 0-0, policy_version 844046 (0.00086) [2022-07-10 18:39:57,536][25689] Fps is (10 sec: 5679.1, 60 sec: 5541.4, 300 sec: 5546.3). Total num frames: 864306176. Throughput: 0: 5007.8. Samples: 864298774. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:39:57,537][25689] Avg episode reward: [(0, '-0.690')] [2022-07-10 18:39:58,683][26022] Updated weights on worker 0-0, policy_version 844056 (0.00093) [2022-07-10 18:40:00,723][26022] Updated weights on worker 0-0, policy_version 844066 (0.00088) [2022-07-10 18:40:02,594][25689] Fps is (10 sec: 5478.2, 60 sec: 5540.7, 300 sec: 5546.1). Total num frames: 864332800. Throughput: 0: 5839.2. Samples: 864332656. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:40:02,594][25689] Avg episode reward: [(0, '-1.150')] [2022-07-10 18:40:02,798][26022] Updated weights on worker 0-0, policy_version 844076 (0.00090) [2022-07-10 18:40:04,711][26022] Updated weights on worker 0-0, policy_version 844086 (0.00092) [2022-07-10 18:40:06,415][26022] Updated weights on worker 0-0, policy_version 844096 (0.00092) [2022-07-10 18:40:07,645][25689] Fps is (10 sec: 5267.0, 60 sec: 5509.7, 300 sec: 5538.5). Total num frames: 864359424. Throughput: 0: 5739.6. Samples: 864363866. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:40:07,646][25689] Avg episode reward: [(0, '-1.027')] [2022-07-10 18:40:08,418][26022] Updated weights on worker 0-0, policy_version 844106 (0.00097) [2022-07-10 18:40:09,945][26022] Updated weights on worker 0-0, policy_version 844116 (0.00087) [2022-07-10 18:40:12,097][26022] Updated weights on worker 0-0, policy_version 844126 (0.00092) [2022-07-10 18:40:12,676][25689] Fps is (10 sec: 5484.3, 60 sec: 5524.8, 300 sec: 5541.6). Total num frames: 864388096. Throughput: 0: 5741.3. Samples: 864397600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:40:12,677][25689] Avg episode reward: [(0, '-0.739')] [2022-07-10 18:40:13,522][26022] Updated weights on worker 0-0, policy_version 844136 (0.00088) [2022-07-10 18:40:15,734][26022] Updated weights on worker 0-0, policy_version 844146 (0.00091) [2022-07-10 18:40:17,459][26022] Updated weights on worker 0-0, policy_version 844156 (0.00085) [2022-07-10 18:40:17,687][25689] Fps is (10 sec: 5609.0, 60 sec: 5545.2, 300 sec: 5545.0). Total num frames: 864415744. Throughput: 0: 5728.0. Samples: 864414198. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:40:17,687][25689] Avg episode reward: [(0, '0.044')] [2022-07-10 18:40:19,277][26022] Updated weights on worker 0-0, policy_version 844166 (0.00086) [2022-07-10 18:40:21,267][26022] Updated weights on worker 0-0, policy_version 844176 (0.00088) [2022-07-10 18:40:22,696][25689] Fps is (10 sec: 5518.8, 60 sec: 5528.7, 300 sec: 5539.1). Total num frames: 864443392. Throughput: 0: 5734.7. Samples: 864447936. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:40:22,696][25689] Avg episode reward: [(0, '0.030')] [2022-07-10 18:40:23,079][26022] Updated weights on worker 0-0, policy_version 844186 (0.00095) [2022-07-10 18:40:24,618][26022] Updated weights on worker 0-0, policy_version 844196 (0.00086) [2022-07-10 18:40:26,908][26022] Updated weights on worker 0-0, policy_version 844206 (0.00091) [2022-07-10 18:40:27,762][25689] Fps is (10 sec: 5691.7, 60 sec: 5544.8, 300 sec: 5545.0). Total num frames: 864473088. Throughput: 0: 5832.6. Samples: 864481194. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:40:27,762][25689] Avg episode reward: [(0, '0.533')] [2022-07-10 18:40:28,442][26022] Updated weights on worker 0-0, policy_version 844216 (0.00081) [2022-07-10 18:40:30,551][26022] Updated weights on worker 0-0, policy_version 844226 (0.00090) [2022-07-10 18:40:32,159][26022] Updated weights on worker 0-0, policy_version 844236 (0.00087) [2022-07-10 18:40:32,765][25689] Fps is (10 sec: 5593.0, 60 sec: 5547.4, 300 sec: 5538.4). Total num frames: 864499712. Throughput: 0: 5005.9. Samples: 864498166. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:40:32,766][25689] Avg episode reward: [(0, '0.341')] [2022-07-10 18:40:34,004][26022] Updated weights on worker 0-0, policy_version 844246 (0.00083) [2022-07-10 18:40:35,867][26022] Updated weights on worker 0-0, policy_version 844256 (0.00090) [2022-07-10 18:40:37,687][26022] Updated weights on worker 0-0, policy_version 844266 (0.00083) [2022-07-10 18:40:37,776][25689] Fps is (10 sec: 5521.7, 60 sec: 5554.2, 300 sec: 5542.0). Total num frames: 864528384. Throughput: 0: 5839.7. Samples: 864531514. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:40:37,776][25689] Avg episode reward: [(0, '-0.607')] [2022-07-10 18:40:39,641][26022] Updated weights on worker 0-0, policy_version 844276 (0.00085) [2022-07-10 18:40:41,423][26022] Updated weights on worker 0-0, policy_version 844286 (0.00060) [2022-07-10 18:40:42,786][25689] Fps is (10 sec: 5722.7, 60 sec: 5572.9, 300 sec: 5544.0). Total num frames: 864557056. Throughput: 0: 5836.4. Samples: 864565190. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:40:42,786][25689] Avg episode reward: [(0, '-0.590')] [2022-07-10 18:40:43,115][26022] Updated weights on worker 0-0, policy_version 844296 (0.00086) [2022-07-10 18:40:45,116][26022] Updated weights on worker 0-0, policy_version 844306 (0.00089) [2022-07-10 18:40:47,003][26022] Updated weights on worker 0-0, policy_version 844316 (0.00088) [2022-07-10 18:40:47,833][25689] Fps is (10 sec: 5498.1, 60 sec: 5544.3, 300 sec: 5541.1). Total num frames: 864583680. Throughput: 0: 5015.3. Samples: 864581860. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:40:47,834][25689] Avg episode reward: [(0, '-0.964')] [2022-07-10 18:40:48,575][26022] Updated weights on worker 0-0, policy_version 844326 (0.00082) [2022-07-10 18:40:50,479][26022] Updated weights on worker 0-0, policy_version 844336 (0.00088) [2022-07-10 18:40:52,275][26022] Updated weights on worker 0-0, policy_version 844346 (0.00084) [2022-07-10 18:40:52,853][25689] Fps is (10 sec: 5492.8, 60 sec: 5548.1, 300 sec: 5537.6). Total num frames: 864612352. Throughput: 0: 5852.7. Samples: 864615732. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:40:52,853][25689] Avg episode reward: [(0, '-0.522')] [2022-07-10 18:40:54,151][26022] Updated weights on worker 0-0, policy_version 844356 (0.00091) [2022-07-10 18:40:55,176][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:40:55,185][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000844361_864625664.pth [2022-07-10 18:40:55,186][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000842409_862626816.pth [2022-07-10 18:40:56,007][26022] Updated weights on worker 0-0, policy_version 844366 (0.00087) [2022-07-10 18:40:57,527][26022] Updated weights on worker 0-0, policy_version 844376 (0.00074) [2022-07-10 18:40:57,866][25689] Fps is (10 sec: 5817.5, 60 sec: 5567.2, 300 sec: 5547.9). Total num frames: 864642048. Throughput: 0: 5872.5. Samples: 864649496. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:40:57,867][25689] Avg episode reward: [(0, '0.374')] [2022-07-10 18:40:59,706][26022] Updated weights on worker 0-0, policy_version 844386 (0.00090) [2022-07-10 18:41:01,715][26022] Updated weights on worker 0-0, policy_version 844396 (0.00092) [2022-07-10 18:41:02,887][25689] Fps is (10 sec: 5204.6, 60 sec: 5502.7, 300 sec: 5535.4). Total num frames: 864664576. Throughput: 0: 5024.2. Samples: 864666182. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:41:02,887][25689] Avg episode reward: [(0, '1.336')] [2022-07-10 18:41:03,754][26022] Updated weights on worker 0-0, policy_version 844406 (0.00087) [2022-07-10 18:41:05,571][26022] Updated weights on worker 0-0, policy_version 844416 (0.00086) [2022-07-10 18:41:07,294][26022] Updated weights on worker 0-0, policy_version 844426 (0.00085) [2022-07-10 18:41:07,993][25689] Fps is (10 sec: 5359.1, 60 sec: 5582.6, 300 sec: 5544.4). Total num frames: 864696320. Throughput: 0: 5745.3. Samples: 864697686. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:41:07,994][25689] Avg episode reward: [(0, '1.382')] [2022-07-10 18:41:09,307][26022] Updated weights on worker 0-0, policy_version 844436 (0.00086) [2022-07-10 18:41:11,023][26022] Updated weights on worker 0-0, policy_version 844446 (0.00092) [2022-07-10 18:41:12,829][26022] Updated weights on worker 0-0, policy_version 844456 (0.00089) [2022-07-10 18:41:13,020][25689] Fps is (10 sec: 5760.0, 60 sec: 5549.0, 300 sec: 5537.6). Total num frames: 864722944. Throughput: 0: 5716.7. Samples: 864731024. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:41:13,021][25689] Avg episode reward: [(0, '1.384')] [2022-07-10 18:41:14,759][26022] Updated weights on worker 0-0, policy_version 844466 (0.00089) [2022-07-10 18:41:16,413][26022] Updated weights on worker 0-0, policy_version 844476 (0.00089) [2022-07-10 18:41:18,038][25689] Fps is (10 sec: 5403.1, 60 sec: 5548.3, 300 sec: 5537.4). Total num frames: 864750592. Throughput: 0: 4880.4. Samples: 864747942. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:41:18,039][25689] Avg episode reward: [(0, '1.347')] [2022-07-10 18:41:18,400][26022] Updated weights on worker 0-0, policy_version 844486 (0.00086) [2022-07-10 18:41:20,475][26022] Updated weights on worker 0-0, policy_version 844496 (0.00099) [2022-07-10 18:41:22,018][26022] Updated weights on worker 0-0, policy_version 844506 (0.00087) [2022-07-10 18:41:23,115][25689] Fps is (10 sec: 5579.4, 60 sec: 5559.0, 300 sec: 5540.4). Total num frames: 864779264. Throughput: 0: 5703.0. Samples: 864781540. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:41:23,116][25689] Avg episode reward: [(0, '-1.174')] [2022-07-10 18:41:24,001][26022] Updated weights on worker 0-0, policy_version 844516 (0.00086) [2022-07-10 18:41:25,761][26022] Updated weights on worker 0-0, policy_version 844526 (0.00092) [2022-07-10 18:41:27,523][26022] Updated weights on worker 0-0, policy_version 844536 (0.00096) [2022-07-10 18:41:28,246][25689] Fps is (10 sec: 5617.5, 60 sec: 5536.1, 300 sec: 5535.0). Total num frames: 864807936. Throughput: 0: 5797.7. Samples: 864815106. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:41:28,247][25689] Avg episode reward: [(0, '-1.365')] [2022-07-10 18:41:29,449][26022] Updated weights on worker 0-0, policy_version 844546 (0.00084) [2022-07-10 18:41:31,244][26022] Updated weights on worker 0-0, policy_version 844556 (0.00102) [2022-07-10 18:41:33,092][26022] Updated weights on worker 0-0, policy_version 844566 (0.00082) [2022-07-10 18:41:33,250][25689] Fps is (10 sec: 5657.8, 60 sec: 5570.0, 300 sec: 5542.3). Total num frames: 864836608. Throughput: 0: 4990.3. Samples: 864831976. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:41:33,250][25689] Avg episode reward: [(0, '-2.516')] [2022-07-10 18:41:34,851][26022] Updated weights on worker 0-0, policy_version 844576 (0.00088) [2022-07-10 18:41:36,644][26022] Updated weights on worker 0-0, policy_version 844586 (0.00095) [2022-07-10 18:41:38,266][25689] Fps is (10 sec: 5620.8, 60 sec: 5552.5, 300 sec: 5538.6). Total num frames: 864864256. Throughput: 0: 5809.3. Samples: 864865452. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:41:38,267][25689] Avg episode reward: [(0, '-2.820')] [2022-07-10 18:41:38,659][26022] Updated weights on worker 0-0, policy_version 844596 (0.00091) [2022-07-10 18:41:40,356][26022] Updated weights on worker 0-0, policy_version 844606 (0.00097) [2022-07-10 18:41:42,364][26022] Updated weights on worker 0-0, policy_version 844616 (0.00082) [2022-07-10 18:41:43,275][25689] Fps is (10 sec: 5516.0, 60 sec: 5535.7, 300 sec: 5540.1). Total num frames: 864891904. Throughput: 0: 5820.8. Samples: 864898888. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:41:43,275][25689] Avg episode reward: [(0, '-3.136')] [2022-07-10 18:41:44,188][26022] Updated weights on worker 0-0, policy_version 844626 (0.00095) [2022-07-10 18:41:45,946][26022] Updated weights on worker 0-0, policy_version 844636 (0.00085) [2022-07-10 18:41:47,711][26022] Updated weights on worker 0-0, policy_version 844646 (0.00108) [2022-07-10 18:41:48,340][25689] Fps is (10 sec: 5692.5, 60 sec: 5584.9, 300 sec: 5550.7). Total num frames: 864921600. Throughput: 0: 5002.6. Samples: 864915626. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:41:48,340][25689] Avg episode reward: [(0, '-0.782')] [2022-07-10 18:41:49,577][26022] Updated weights on worker 0-0, policy_version 844656 (0.00090) [2022-07-10 18:41:51,268][26022] Updated weights on worker 0-0, policy_version 844666 (0.00085) [2022-07-10 18:41:53,283][26022] Updated weights on worker 0-0, policy_version 844676 (0.00087) [2022-07-10 18:41:53,403][25689] Fps is (10 sec: 5560.8, 60 sec: 5547.0, 300 sec: 5537.4). Total num frames: 864948224. Throughput: 0: 5828.1. Samples: 864949428. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-10 18:41:53,403][25689] Avg episode reward: [(0, '-0.723')] [2022-07-10 18:41:55,024][26022] Updated weights on worker 0-0, policy_version 844686 (0.00092) [2022-07-10 18:41:57,018][26022] Updated weights on worker 0-0, policy_version 844696 (0.00083) [2022-07-10 18:41:58,430][25689] Fps is (10 sec: 5480.3, 60 sec: 5528.9, 300 sec: 5544.3). Total num frames: 864976896. Throughput: 0: 5828.1. Samples: 864982968. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:41:58,430][25689] Avg episode reward: [(0, '-0.491')] [2022-07-10 18:41:58,814][26022] Updated weights on worker 0-0, policy_version 844706 (0.00079) [2022-07-10 18:42:00,623][26022] Updated weights on worker 0-0, policy_version 844716 (0.00084) [2022-07-10 18:42:02,644][26022] Updated weights on worker 0-0, policy_version 844726 (0.00087) [2022-07-10 18:42:03,483][25689] Fps is (10 sec: 5485.7, 60 sec: 5593.5, 300 sec: 5544.9). Total num frames: 865003520. Throughput: 0: 4988.0. Samples: 864999686. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:42:03,483][25689] Avg episode reward: [(0, '-1.127')] [2022-07-10 18:42:04,639][26022] Updated weights on worker 0-0, policy_version 844736 (0.00083) [2022-07-10 18:42:06,275][26022] Updated weights on worker 0-0, policy_version 844746 (0.00089) [2022-07-10 18:42:08,308][26022] Updated weights on worker 0-0, policy_version 844756 (0.00091) [2022-07-10 18:42:08,530][25689] Fps is (10 sec: 5373.2, 60 sec: 5531.3, 300 sec: 5544.6). Total num frames: 865031168. Throughput: 0: 5728.5. Samples: 865031286. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:42:08,531][25689] Avg episode reward: [(0, '-0.411')] [2022-07-10 18:42:10,020][26022] Updated weights on worker 0-0, policy_version 844766 (0.00088) [2022-07-10 18:42:11,997][26022] Updated weights on worker 0-0, policy_version 844776 (0.00089) [2022-07-10 18:42:13,564][25689] Fps is (10 sec: 5586.8, 60 sec: 5564.5, 300 sec: 5547.7). Total num frames: 865059840. Throughput: 0: 5719.3. Samples: 865064734. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:42:13,564][25689] Avg episode reward: [(0, '-0.036')] [2022-07-10 18:42:13,701][26022] Updated weights on worker 0-0, policy_version 844786 (0.00089) [2022-07-10 18:42:15,599][26022] Updated weights on worker 0-0, policy_version 844796 (0.00086) [2022-07-10 18:42:17,507][26022] Updated weights on worker 0-0, policy_version 844806 (0.00082) [2022-07-10 18:42:18,567][25689] Fps is (10 sec: 5509.2, 60 sec: 5548.9, 300 sec: 5537.6). Total num frames: 865086464. Throughput: 0: 4889.7. Samples: 865081434. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:42:18,568][25689] Avg episode reward: [(0, '0.064')] [2022-07-10 18:42:19,471][26022] Updated weights on worker 0-0, policy_version 844816 (0.00088) [2022-07-10 18:42:21,246][26022] Updated weights on worker 0-0, policy_version 844826 (0.00084) [2022-07-10 18:42:23,111][26022] Updated weights on worker 0-0, policy_version 844836 (0.00090) [2022-07-10 18:42:23,619][25689] Fps is (10 sec: 5397.3, 60 sec: 5534.3, 300 sec: 5541.4). Total num frames: 865114112. Throughput: 0: 5706.8. Samples: 865114600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:42:23,621][25689] Avg episode reward: [(0, '0.003')] [2022-07-10 18:42:24,936][26022] Updated weights on worker 0-0, policy_version 844846 (0.00085) [2022-07-10 18:42:26,742][26022] Updated weights on worker 0-0, policy_version 844856 (0.00088) [2022-07-10 18:42:28,635][26022] Updated weights on worker 0-0, policy_version 844866 (0.00084) [2022-07-10 18:42:28,712][25689] Fps is (10 sec: 5551.6, 60 sec: 5537.8, 300 sec: 5540.1). Total num frames: 865142784. Throughput: 0: 5790.4. Samples: 865148146. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:42:28,713][25689] Avg episode reward: [(0, '0.704')] [2022-07-10 18:42:30,315][26022] Updated weights on worker 0-0, policy_version 844876 (0.00088) [2022-07-10 18:42:32,300][26022] Updated weights on worker 0-0, policy_version 844886 (0.00094) [2022-07-10 18:42:33,743][25689] Fps is (10 sec: 5664.4, 60 sec: 5535.3, 300 sec: 5536.3). Total num frames: 865171456. Throughput: 0: 4975.2. Samples: 865165128. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:42:33,743][25689] Avg episode reward: [(0, '0.823')] [2022-07-10 18:42:34,023][26022] Updated weights on worker 0-0, policy_version 844896 (0.00084) [2022-07-10 18:42:35,976][26022] Updated weights on worker 0-0, policy_version 844906 (0.00098) [2022-07-10 18:42:37,458][26022] Updated weights on worker 0-0, policy_version 844916 (0.00084) [2022-07-10 18:42:38,761][25689] Fps is (10 sec: 5604.3, 60 sec: 5535.1, 300 sec: 5539.7). Total num frames: 865199104. Throughput: 0: 5814.9. Samples: 865198862. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:42:38,762][25689] Avg episode reward: [(0, '0.824')] [2022-07-10 18:42:39,796][26022] Updated weights on worker 0-0, policy_version 844926 (0.00090) [2022-07-10 18:42:41,196][26022] Updated weights on worker 0-0, policy_version 844936 (0.00088) [2022-07-10 18:42:43,387][26022] Updated weights on worker 0-0, policy_version 844946 (0.00083) [2022-07-10 18:42:43,765][25689] Fps is (10 sec: 5721.3, 60 sec: 5569.4, 300 sec: 5547.6). Total num frames: 865228800. Throughput: 0: 5831.8. Samples: 865232090. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:42:43,766][25689] Avg episode reward: [(0, '0.795')] [2022-07-10 18:42:45,059][26022] Updated weights on worker 0-0, policy_version 844956 (0.00084) [2022-07-10 18:42:46,854][26022] Updated weights on worker 0-0, policy_version 844966 (0.00086) [2022-07-10 18:42:48,827][25689] Fps is (10 sec: 5493.7, 60 sec: 5502.0, 300 sec: 5536.5). Total num frames: 865254400. Throughput: 0: 5001.4. Samples: 865248748. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:42:48,829][25689] Avg episode reward: [(0, '0.758')] [2022-07-10 18:42:49,037][26022] Updated weights on worker 0-0, policy_version 844976 (0.00084) [2022-07-10 18:42:50,562][26022] Updated weights on worker 0-0, policy_version 844986 (0.00090) [2022-07-10 18:42:52,390][26022] Updated weights on worker 0-0, policy_version 844996 (0.00106) [2022-07-10 18:42:53,885][25689] Fps is (10 sec: 5160.9, 60 sec: 5502.5, 300 sec: 5532.0). Total num frames: 865281024. Throughput: 0: 5810.5. Samples: 865282164. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:42:53,885][25689] Avg episode reward: [(0, '0.602')] [2022-07-10 18:42:54,324][26022] Updated weights on worker 0-0, policy_version 845006 (0.00085) [2022-07-10 18:42:55,307][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:42:55,322][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000845012_865292288.pth [2022-07-10 18:42:55,322][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000843062_863295488.pth [2022-07-10 18:42:56,191][26022] Updated weights on worker 0-0, policy_version 845016 (0.00096) [2022-07-10 18:42:58,019][26022] Updated weights on worker 0-0, policy_version 845026 (0.00094) [2022-07-10 18:42:58,916][25689] Fps is (10 sec: 5683.8, 60 sec: 5536.0, 300 sec: 5538.7). Total num frames: 865311744. Throughput: 0: 5776.0. Samples: 865315274. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:42:58,917][25689] Avg episode reward: [(0, '0.596')] [2022-07-10 18:42:59,973][26022] Updated weights on worker 0-0, policy_version 845036 (0.00100) [2022-07-10 18:43:01,571][26022] Updated weights on worker 0-0, policy_version 845046 (0.00063) [2022-07-10 18:43:03,923][25689] Fps is (10 sec: 5508.4, 60 sec: 5506.3, 300 sec: 5536.5). Total num frames: 865336320. Throughput: 0: 4962.8. Samples: 865332128. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:43:03,924][25689] Avg episode reward: [(0, '0.525')] [2022-07-10 18:43:04,127][26022] Updated weights on worker 0-0, policy_version 845056 (0.00082) [2022-07-10 18:43:05,564][26022] Updated weights on worker 0-0, policy_version 845066 (0.00085) [2022-07-10 18:43:07,562][26022] Updated weights on worker 0-0, policy_version 845076 (0.00091) [2022-07-10 18:43:08,997][25689] Fps is (10 sec: 5281.7, 60 sec: 5520.7, 300 sec: 5532.0). Total num frames: 865364992. Throughput: 0: 5693.3. Samples: 865363586. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:43:08,998][25689] Avg episode reward: [(0, '0.276')] [2022-07-10 18:43:09,384][26022] Updated weights on worker 0-0, policy_version 845086 (0.00088) [2022-07-10 18:43:11,227][26022] Updated weights on worker 0-0, policy_version 845096 (0.00087) [2022-07-10 18:43:13,151][26022] Updated weights on worker 0-0, policy_version 845106 (0.00098) [2022-07-10 18:43:14,000][25689] Fps is (10 sec: 5792.1, 60 sec: 5540.5, 300 sec: 5540.3). Total num frames: 865394688. Throughput: 0: 5703.0. Samples: 865396884. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:43:14,002][25689] Avg episode reward: [(0, '0.091')] [2022-07-10 18:43:15,005][26022] Updated weights on worker 0-0, policy_version 845116 (0.00089) [2022-07-10 18:43:16,614][26022] Updated weights on worker 0-0, policy_version 845126 (0.00088) [2022-07-10 18:43:18,842][26022] Updated weights on worker 0-0, policy_version 845136 (0.00091) [2022-07-10 18:43:19,038][25689] Fps is (10 sec: 5507.1, 60 sec: 5520.4, 300 sec: 5536.5). Total num frames: 865420288. Throughput: 0: 5729.5. Samples: 865430566. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:43:19,040][25689] Avg episode reward: [(0, '-0.145')] [2022-07-10 18:43:20,242][26022] Updated weights on worker 0-0, policy_version 845146 (0.00079) [2022-07-10 18:43:22,438][26022] Updated weights on worker 0-0, policy_version 845156 (0.00087) [2022-07-10 18:43:24,078][25689] Fps is (10 sec: 5486.9, 60 sec: 5555.4, 300 sec: 5538.3). Total num frames: 865449984. Throughput: 0: 5716.7. Samples: 865447348. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:43:24,079][25689] Avg episode reward: [(0, '-0.264')] [2022-07-10 18:43:24,080][26022] Updated weights on worker 0-0, policy_version 845166 (0.00098) [2022-07-10 18:43:25,970][26022] Updated weights on worker 0-0, policy_version 845176 (0.00444) [2022-07-10 18:43:27,802][26022] Updated weights on worker 0-0, policy_version 845186 (0.00080) [2022-07-10 18:43:29,139][25689] Fps is (10 sec: 5778.7, 60 sec: 5558.3, 300 sec: 5538.3). Total num frames: 865478656. Throughput: 0: 5818.0. Samples: 865480770. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:43:29,139][25689] Avg episode reward: [(0, '-0.626')] [2022-07-10 18:43:29,515][26022] Updated weights on worker 0-0, policy_version 845196 (0.00085) [2022-07-10 18:43:31,399][26022] Updated weights on worker 0-0, policy_version 845206 (0.00088) [2022-07-10 18:43:33,447][26022] Updated weights on worker 0-0, policy_version 845216 (0.00084) [2022-07-10 18:43:34,152][25689] Fps is (10 sec: 5387.6, 60 sec: 5509.1, 300 sec: 5535.3). Total num frames: 865504256. Throughput: 0: 5829.7. Samples: 865514360. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:43:34,152][25689] Avg episode reward: [(0, '-1.066')] [2022-07-10 18:43:34,950][26022] Updated weights on worker 0-0, policy_version 845226 (0.00112) [2022-07-10 18:43:36,987][26022] Updated weights on worker 0-0, policy_version 845236 (0.00081) [2022-07-10 18:43:38,636][26022] Updated weights on worker 0-0, policy_version 845246 (0.00090) [2022-07-10 18:43:39,233][25689] Fps is (10 sec: 5477.9, 60 sec: 5537.2, 300 sec: 5533.9). Total num frames: 865533952. Throughput: 0: 4983.4. Samples: 865531206. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:43:39,234][25689] Avg episode reward: [(0, '-0.855')] [2022-07-10 18:43:40,461][26022] Updated weights on worker 0-0, policy_version 845256 (0.00086) [2022-07-10 18:43:42,406][26022] Updated weights on worker 0-0, policy_version 845266 (0.00109) [2022-07-10 18:43:44,193][26022] Updated weights on worker 0-0, policy_version 845276 (0.00080) [2022-07-10 18:43:44,291][25689] Fps is (10 sec: 5756.6, 60 sec: 5515.4, 300 sec: 5538.6). Total num frames: 865562624. Throughput: 0: 5800.6. Samples: 865564594. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:43:44,291][25689] Avg episode reward: [(0, '-0.832')] [2022-07-10 18:43:46,270][26022] Updated weights on worker 0-0, policy_version 845286 (0.00090) [2022-07-10 18:43:48,100][26022] Updated weights on worker 0-0, policy_version 845296 (0.00084) [2022-07-10 18:43:49,397][25689] Fps is (10 sec: 5541.3, 60 sec: 5545.2, 300 sec: 5533.7). Total num frames: 865590272. Throughput: 0: 5786.7. Samples: 865597998. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:43:49,397][25689] Avg episode reward: [(0, '-0.357')] [2022-07-10 18:43:49,713][26022] Updated weights on worker 0-0, policy_version 845306 (0.00069) [2022-07-10 18:43:51,839][26022] Updated weights on worker 0-0, policy_version 845316 (0.00087) [2022-07-10 18:43:53,620][26022] Updated weights on worker 0-0, policy_version 845326 (0.00086) [2022-07-10 18:43:54,468][25689] Fps is (10 sec: 5533.7, 60 sec: 5577.7, 300 sec: 5540.8). Total num frames: 865618944. Throughput: 0: 4929.6. Samples: 865614514. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:43:54,469][25689] Avg episode reward: [(0, '0.011')] [2022-07-10 18:43:55,374][26022] Updated weights on worker 0-0, policy_version 845336 (0.00094) [2022-07-10 18:43:57,121][26022] Updated weights on worker 0-0, policy_version 845346 (0.00084) [2022-07-10 18:43:58,983][26022] Updated weights on worker 0-0, policy_version 845356 (0.00103) [2022-07-10 18:43:59,492][25689] Fps is (10 sec: 5477.6, 60 sec: 5510.8, 300 sec: 5541.3). Total num frames: 865645568. Throughput: 0: 5768.1. Samples: 865648060. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:43:59,492][25689] Avg episode reward: [(0, '0.297')] [2022-07-10 18:44:00,967][26022] Updated weights on worker 0-0, policy_version 845366 (0.00094) [2022-07-10 18:44:03,108][26022] Updated weights on worker 0-0, policy_version 845376 (0.00091) [2022-07-10 18:44:04,503][25689] Fps is (10 sec: 5306.5, 60 sec: 5544.3, 300 sec: 5535.8). Total num frames: 865672192. Throughput: 0: 5677.4. Samples: 865679346. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:44:04,503][25689] Avg episode reward: [(0, '0.385')] [2022-07-10 18:44:04,892][26022] Updated weights on worker 0-0, policy_version 845386 (0.00089) [2022-07-10 18:44:06,809][26022] Updated weights on worker 0-0, policy_version 845396 (0.00086) [2022-07-10 18:44:08,423][26022] Updated weights on worker 0-0, policy_version 845406 (0.00087) [2022-07-10 18:44:09,576][25689] Fps is (10 sec: 5381.6, 60 sec: 5527.4, 300 sec: 5534.6). Total num frames: 865699840. Throughput: 0: 4860.1. Samples: 865696072. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:44:09,577][25689] Avg episode reward: [(0, '0.622')] [2022-07-10 18:44:10,515][26022] Updated weights on worker 0-0, policy_version 845416 (0.00098) [2022-07-10 18:44:12,218][26022] Updated weights on worker 0-0, policy_version 845426 (0.00094) [2022-07-10 18:44:14,158][26022] Updated weights on worker 0-0, policy_version 845436 (0.00087) [2022-07-10 18:44:14,615][25689] Fps is (10 sec: 5671.0, 60 sec: 5524.2, 300 sec: 5545.1). Total num frames: 865729536. Throughput: 0: 5709.5. Samples: 865729538. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:44:14,615][25689] Avg episode reward: [(0, '0.641')] [2022-07-10 18:44:16,106][26022] Updated weights on worker 0-0, policy_version 845446 (0.00089) [2022-07-10 18:44:17,692][26022] Updated weights on worker 0-0, policy_version 845456 (0.00093) [2022-07-10 18:44:19,627][25689] Fps is (10 sec: 5603.9, 60 sec: 5543.5, 300 sec: 5538.3). Total num frames: 865756160. Throughput: 0: 5719.6. Samples: 865763224. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:44:19,627][25689] Avg episode reward: [(0, '0.647')] [2022-07-10 18:44:19,936][26022] Updated weights on worker 0-0, policy_version 845466 (0.00085) [2022-07-10 18:44:21,525][26022] Updated weights on worker 0-0, policy_version 845476 (0.00090) [2022-07-10 18:44:23,395][26022] Updated weights on worker 0-0, policy_version 845486 (0.00085) [2022-07-10 18:44:24,658][25689] Fps is (10 sec: 5404.0, 60 sec: 5510.5, 300 sec: 5535.3). Total num frames: 865783808. Throughput: 0: 4992.8. Samples: 865779974. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:44:24,658][25689] Avg episode reward: [(0, '0.385')] [2022-07-10 18:44:25,174][26022] Updated weights on worker 0-0, policy_version 845496 (0.00081) [2022-07-10 18:44:26,989][26022] Updated weights on worker 0-0, policy_version 845506 (0.00083) [2022-07-10 18:44:28,815][26022] Updated weights on worker 0-0, policy_version 845516 (0.00061) [2022-07-10 18:44:29,774][25689] Fps is (10 sec: 5751.9, 60 sec: 5539.2, 300 sec: 5547.5). Total num frames: 865814528. Throughput: 0: 5810.0. Samples: 865813422. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:44:29,775][25689] Avg episode reward: [(0, '0.881')] [2022-07-10 18:44:30,822][26022] Updated weights on worker 0-0, policy_version 845526 (0.00087) [2022-07-10 18:44:32,413][26022] Updated weights on worker 0-0, policy_version 845536 (0.00082) [2022-07-10 18:44:34,329][26022] Updated weights on worker 0-0, policy_version 845546 (0.00079) [2022-07-10 18:44:34,845][25689] Fps is (10 sec: 5729.3, 60 sec: 5567.6, 300 sec: 5544.3). Total num frames: 865842176. Throughput: 0: 5803.5. Samples: 865846946. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:44:34,846][25689] Avg episode reward: [(0, '0.572')] [2022-07-10 18:44:36,174][26022] Updated weights on worker 0-0, policy_version 845556 (0.00082) [2022-07-10 18:44:38,070][26022] Updated weights on worker 0-0, policy_version 845566 (0.00094) [2022-07-10 18:44:39,887][25689] Fps is (10 sec: 5366.8, 60 sec: 5520.7, 300 sec: 5540.7). Total num frames: 865868800. Throughput: 0: 4963.6. Samples: 865863786. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:44:39,887][25689] Avg episode reward: [(0, '0.043')] [2022-07-10 18:44:39,948][26022] Updated weights on worker 0-0, policy_version 845576 (0.00088) [2022-07-10 18:44:41,434][26022] Updated weights on worker 0-0, policy_version 845586 (0.00087) [2022-07-10 18:44:43,617][26022] Updated weights on worker 0-0, policy_version 845596 (0.00081) [2022-07-10 18:44:44,922][25689] Fps is (10 sec: 5589.0, 60 sec: 5539.6, 300 sec: 5545.4). Total num frames: 865898496. Throughput: 0: 5784.0. Samples: 865897184. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:44:44,923][25689] Avg episode reward: [(0, '-0.734')] [2022-07-10 18:44:45,331][26022] Updated weights on worker 0-0, policy_version 845606 (0.00096) [2022-07-10 18:44:47,245][26022] Updated weights on worker 0-0, policy_version 845616 (0.00082) [2022-07-10 18:44:49,046][26022] Updated weights on worker 0-0, policy_version 845626 (0.00085) [2022-07-10 18:44:50,023][25689] Fps is (10 sec: 5758.4, 60 sec: 5557.0, 300 sec: 5544.7). Total num frames: 865927168. Throughput: 0: 5804.0. Samples: 865930946. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:44:50,023][25689] Avg episode reward: [(0, '-0.542')] [2022-07-10 18:44:50,903][26022] Updated weights on worker 0-0, policy_version 845636 (0.00096) [2022-07-10 18:44:52,662][26022] Updated weights on worker 0-0, policy_version 845646 (0.00082) [2022-07-10 18:44:54,410][26022] Updated weights on worker 0-0, policy_version 845656 (0.00084) [2022-07-10 18:44:55,037][25689] Fps is (10 sec: 5568.0, 60 sec: 5545.3, 300 sec: 5541.6). Total num frames: 865954816. Throughput: 0: 5003.9. Samples: 865947984. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:44:55,038][25689] Avg episode reward: [(0, '-0.525')] [2022-07-10 18:44:55,349][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:44:55,364][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000845661_865956864.pth [2022-07-10 18:44:55,364][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000843711_863960064.pth [2022-07-10 18:44:56,288][26022] Updated weights on worker 0-0, policy_version 845666 (0.00084) [2022-07-10 18:44:58,166][26022] Updated weights on worker 0-0, policy_version 845676 (0.00092) [2022-07-10 18:45:00,046][25689] Fps is (10 sec: 5516.9, 60 sec: 5563.6, 300 sec: 5546.0). Total num frames: 865982464. Throughput: 0: 5853.3. Samples: 865981784. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:45:00,048][25689] Avg episode reward: [(0, '-0.700')] [2022-07-10 18:45:00,051][26022] Updated weights on worker 0-0, policy_version 845686 (0.00100) [2022-07-10 18:45:02,096][26022] Updated weights on worker 0-0, policy_version 845696 (0.00091) [2022-07-10 18:45:03,961][26022] Updated weights on worker 0-0, policy_version 845706 (0.00094) [2022-07-10 18:45:05,059][25689] Fps is (10 sec: 5313.2, 60 sec: 5546.4, 300 sec: 5543.3). Total num frames: 866008064. Throughput: 0: 5775.0. Samples: 866013474. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:45:05,060][25689] Avg episode reward: [(0, '-0.409')] [2022-07-10 18:45:05,733][26022] Updated weights on worker 0-0, policy_version 845716 (0.00085) [2022-07-10 18:45:07,622][26022] Updated weights on worker 0-0, policy_version 845726 (0.00098) [2022-07-10 18:45:09,614][26022] Updated weights on worker 0-0, policy_version 845736 (0.00086) [2022-07-10 18:45:10,164][25689] Fps is (10 sec: 5465.1, 60 sec: 5577.4, 300 sec: 5545.3). Total num frames: 866037760. Throughput: 0: 4928.1. Samples: 866030204. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:45:10,164][25689] Avg episode reward: [(0, '-0.291')] [2022-07-10 18:45:11,046][26022] Updated weights on worker 0-0, policy_version 845746 (0.00095) [2022-07-10 18:45:13,166][26022] Updated weights on worker 0-0, policy_version 845756 (0.00089) [2022-07-10 18:45:14,820][26022] Updated weights on worker 0-0, policy_version 845766 (0.00084) [2022-07-10 18:45:15,240][25689] Fps is (10 sec: 5733.3, 60 sec: 5557.0, 300 sec: 5547.5). Total num frames: 866066432. Throughput: 0: 5735.6. Samples: 866063856. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:45:15,240][25689] Avg episode reward: [(0, '-0.475')] [2022-07-10 18:45:16,883][26022] Updated weights on worker 0-0, policy_version 845776 (0.00084) [2022-07-10 18:45:18,378][26022] Updated weights on worker 0-0, policy_version 845786 (0.00085) [2022-07-10 18:45:20,297][25689] Fps is (10 sec: 5456.9, 60 sec: 5552.9, 300 sec: 5543.2). Total num frames: 866093056. Throughput: 0: 5720.6. Samples: 866097634. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:45:20,300][25689] Avg episode reward: [(0, '-1.277')] [2022-07-10 18:45:20,490][26022] Updated weights on worker 0-0, policy_version 845796 (0.00084) [2022-07-10 18:45:22,096][26022] Updated weights on worker 0-0, policy_version 845806 (0.00094) [2022-07-10 18:45:24,110][26022] Updated weights on worker 0-0, policy_version 845816 (0.00088) [2022-07-10 18:45:25,332][25689] Fps is (10 sec: 5580.7, 60 sec: 5586.3, 300 sec: 5543.8). Total num frames: 866122752. Throughput: 0: 5810.0. Samples: 866131256. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:45:25,332][25689] Avg episode reward: [(0, '-0.914')] [2022-07-10 18:45:25,881][26022] Updated weights on worker 0-0, policy_version 845826 (0.00088) [2022-07-10 18:45:27,691][26022] Updated weights on worker 0-0, policy_version 845836 (0.00088) [2022-07-10 18:45:29,640][26022] Updated weights on worker 0-0, policy_version 845846 (0.00092) [2022-07-10 18:45:30,437][25689] Fps is (10 sec: 5655.4, 60 sec: 5536.7, 300 sec: 5545.3). Total num frames: 866150400. Throughput: 0: 5805.1. Samples: 866147890. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2022-07-10 18:45:30,437][25689] Avg episode reward: [(0, '-0.964')] [2022-07-10 18:45:31,390][26022] Updated weights on worker 0-0, policy_version 845856 (0.00097) [2022-07-10 18:45:33,223][26022] Updated weights on worker 0-0, policy_version 845866 (0.00083) [2022-07-10 18:45:35,021][26022] Updated weights on worker 0-0, policy_version 845876 (0.00091) [2022-07-10 18:45:35,481][25689] Fps is (10 sec: 5548.9, 60 sec: 5556.0, 300 sec: 5544.7). Total num frames: 866179072. Throughput: 0: 5818.4. Samples: 866181630. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:45:35,482][25689] Avg episode reward: [(0, '-1.313')] [2022-07-10 18:45:36,921][26022] Updated weights on worker 0-0, policy_version 845886 (0.00088) [2022-07-10 18:45:38,721][26022] Updated weights on worker 0-0, policy_version 845896 (0.00083) [2022-07-10 18:45:40,529][25689] Fps is (10 sec: 5682.3, 60 sec: 5589.3, 300 sec: 5544.0). Total num frames: 866207744. Throughput: 0: 5790.3. Samples: 866214780. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:45:40,529][25689] Avg episode reward: [(0, '0.252')] [2022-07-10 18:45:40,535][26022] Updated weights on worker 0-0, policy_version 845906 (0.00089) [2022-07-10 18:45:42,299][26022] Updated weights on worker 0-0, policy_version 845916 (0.00094) [2022-07-10 18:45:44,379][26022] Updated weights on worker 0-0, policy_version 845926 (0.00089) [2022-07-10 18:45:45,601][25689] Fps is (10 sec: 5565.3, 60 sec: 5552.1, 300 sec: 5547.0). Total num frames: 866235392. Throughput: 0: 4934.9. Samples: 866231284. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:45:45,603][25689] Avg episode reward: [(0, '0.205')] [2022-07-10 18:45:46,185][26022] Updated weights on worker 0-0, policy_version 845936 (0.00092) [2022-07-10 18:45:48,002][26022] Updated weights on worker 0-0, policy_version 845946 (0.00087) [2022-07-10 18:45:49,715][26022] Updated weights on worker 0-0, policy_version 845956 (0.00083) [2022-07-10 18:45:50,716][25689] Fps is (10 sec: 5427.7, 60 sec: 5533.9, 300 sec: 5541.8). Total num frames: 866263040. Throughput: 0: 5772.7. Samples: 866264958. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:45:50,718][25689] Avg episode reward: [(0, '-0.175')] [2022-07-10 18:45:51,658][26022] Updated weights on worker 0-0, policy_version 845966 (0.00085) [2022-07-10 18:45:53,468][26022] Updated weights on worker 0-0, policy_version 845976 (0.00092) [2022-07-10 18:45:55,472][26022] Updated weights on worker 0-0, policy_version 845986 (0.00092) [2022-07-10 18:45:55,778][25689] Fps is (10 sec: 5433.7, 60 sec: 5529.6, 300 sec: 5534.0). Total num frames: 866290688. Throughput: 0: 5752.8. Samples: 866298390. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:45:55,778][25689] Avg episode reward: [(0, '-0.227')] [2022-07-10 18:45:57,233][26022] Updated weights on worker 0-0, policy_version 845996 (0.00096) [2022-07-10 18:45:59,092][26022] Updated weights on worker 0-0, policy_version 846006 (0.00095) [2022-07-10 18:46:00,834][25689] Fps is (10 sec: 5566.5, 60 sec: 5542.2, 300 sec: 5554.0). Total num frames: 866319360. Throughput: 0: 4941.6. Samples: 866315120. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:46:00,834][25689] Avg episode reward: [(0, '-0.824')] [2022-07-10 18:46:00,911][26022] Updated weights on worker 0-0, policy_version 846016 (0.00094) [2022-07-10 18:46:03,118][26022] Updated weights on worker 0-0, policy_version 846026 (0.00089) [2022-07-10 18:46:05,071][26022] Updated weights on worker 0-0, policy_version 846036 (0.00106) [2022-07-10 18:46:05,836][25689] Fps is (10 sec: 5497.6, 60 sec: 5560.0, 300 sec: 5538.7). Total num frames: 866345984. Throughput: 0: 5692.4. Samples: 866346468. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:46:05,838][25689] Avg episode reward: [(0, '-0.880')] [2022-07-10 18:46:06,807][26022] Updated weights on worker 0-0, policy_version 846046 (0.00093) [2022-07-10 18:46:08,774][26022] Updated weights on worker 0-0, policy_version 846056 (0.00618) [2022-07-10 18:46:10,492][26022] Updated weights on worker 0-0, policy_version 846066 (0.00090) [2022-07-10 18:46:10,969][25689] Fps is (10 sec: 5354.8, 60 sec: 5523.8, 300 sec: 5540.2). Total num frames: 866373632. Throughput: 0: 5673.7. Samples: 866379866. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:46:10,970][25689] Avg episode reward: [(0, '-0.919')] [2022-07-10 18:46:12,335][26022] Updated weights on worker 0-0, policy_version 846076 (0.00099) [2022-07-10 18:46:14,215][26022] Updated weights on worker 0-0, policy_version 846086 (0.00085) [2022-07-10 18:46:15,942][26022] Updated weights on worker 0-0, policy_version 846096 (0.00091) [2022-07-10 18:46:16,042][25689] Fps is (10 sec: 5518.3, 60 sec: 5524.0, 300 sec: 5542.6). Total num frames: 866402304. Throughput: 0: 4842.9. Samples: 866396528. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:46:16,046][25689] Avg episode reward: [(0, '-0.638')] [2022-07-10 18:46:17,863][26022] Updated weights on worker 0-0, policy_version 846106 (0.00080) [2022-07-10 18:46:19,746][26022] Updated weights on worker 0-0, policy_version 846116 (0.00085) [2022-07-10 18:46:21,075][25689] Fps is (10 sec: 5573.1, 60 sec: 5543.1, 300 sec: 5540.0). Total num frames: 866429952. Throughput: 0: 5670.6. Samples: 866429898. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:46:21,076][25689] Avg episode reward: [(0, '-0.624')] [2022-07-10 18:46:21,539][26022] Updated weights on worker 0-0, policy_version 846126 (0.00091) [2022-07-10 18:46:23,386][26022] Updated weights on worker 0-0, policy_version 846136 (0.00093) [2022-07-10 18:46:25,129][26022] Updated weights on worker 0-0, policy_version 846146 (0.00102) [2022-07-10 18:46:26,105][25689] Fps is (10 sec: 5494.7, 60 sec: 5509.7, 300 sec: 5538.4). Total num frames: 866457600. Throughput: 0: 5777.1. Samples: 866463568. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:46:26,106][25689] Avg episode reward: [(0, '-0.959')] [2022-07-10 18:46:26,947][26022] Updated weights on worker 0-0, policy_version 846156 (0.00086) [2022-07-10 18:46:28,866][26022] Updated weights on worker 0-0, policy_version 846166 (0.00089) [2022-07-10 18:46:30,666][26022] Updated weights on worker 0-0, policy_version 846176 (0.00078) [2022-07-10 18:46:31,193][25689] Fps is (10 sec: 5566.1, 60 sec: 5528.2, 300 sec: 5536.8). Total num frames: 866486272. Throughput: 0: 4963.1. Samples: 866480240. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:46:31,194][25689] Avg episode reward: [(0, '-0.455')] [2022-07-10 18:46:32,528][26022] Updated weights on worker 0-0, policy_version 846186 (0.00090) [2022-07-10 18:46:34,421][26022] Updated weights on worker 0-0, policy_version 846196 (0.00087) [2022-07-10 18:46:36,083][26022] Updated weights on worker 0-0, policy_version 846206 (0.00087) [2022-07-10 18:46:36,280][25689] Fps is (10 sec: 5635.9, 60 sec: 5524.3, 300 sec: 5539.0). Total num frames: 866514944. Throughput: 0: 5807.6. Samples: 866514064. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:46:36,281][25689] Avg episode reward: [(0, '-0.571')] [2022-07-10 18:46:37,901][26022] Updated weights on worker 0-0, policy_version 846216 (0.00366) [2022-07-10 18:46:39,766][26022] Updated weights on worker 0-0, policy_version 846226 (0.00088) [2022-07-10 18:46:41,330][25689] Fps is (10 sec: 5656.8, 60 sec: 5524.0, 300 sec: 5541.6). Total num frames: 866543616. Throughput: 0: 5812.1. Samples: 866547626. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:46:41,332][25689] Avg episode reward: [(0, '-0.529')] [2022-07-10 18:46:41,509][26022] Updated weights on worker 0-0, policy_version 846236 (0.00089) [2022-07-10 18:46:43,588][26022] Updated weights on worker 0-0, policy_version 846246 (0.00088) [2022-07-10 18:46:45,276][26022] Updated weights on worker 0-0, policy_version 846256 (0.00090) [2022-07-10 18:46:46,351][25689] Fps is (10 sec: 5592.7, 60 sec: 5528.8, 300 sec: 5535.6). Total num frames: 866571264. Throughput: 0: 4981.9. Samples: 866564428. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:46:46,352][25689] Avg episode reward: [(0, '-0.929')] [2022-07-10 18:46:47,290][26022] Updated weights on worker 0-0, policy_version 846266 (0.00091) [2022-07-10 18:46:49,012][26022] Updated weights on worker 0-0, policy_version 846276 (0.00090) [2022-07-10 18:46:51,005][26022] Updated weights on worker 0-0, policy_version 846286 (0.00082) [2022-07-10 18:46:51,410][25689] Fps is (10 sec: 5587.7, 60 sec: 5550.8, 300 sec: 5542.5). Total num frames: 866599936. Throughput: 0: 5821.1. Samples: 866597922. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:46:51,410][25689] Avg episode reward: [(0, '-0.345')] [2022-07-10 18:46:52,674][26022] Updated weights on worker 0-0, policy_version 846296 (0.00090) [2022-07-10 18:46:54,575][26022] Updated weights on worker 0-0, policy_version 846306 (0.00089) [2022-07-10 18:46:55,484][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:46:55,493][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000846310_866621440.pth [2022-07-10 18:46:55,494][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000844361_864625664.pth [2022-07-10 18:46:56,216][26022] Updated weights on worker 0-0, policy_version 846316 (0.00084) [2022-07-10 18:46:56,422][25689] Fps is (10 sec: 5693.9, 60 sec: 5572.2, 300 sec: 5542.8). Total num frames: 866628608. Throughput: 0: 5834.8. Samples: 866631586. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:46:56,425][25689] Avg episode reward: [(0, '-1.021')] [2022-07-10 18:46:58,269][26022] Updated weights on worker 0-0, policy_version 846326 (0.00110) [2022-07-10 18:46:59,929][26022] Updated weights on worker 0-0, policy_version 846336 (0.00090) [2022-07-10 18:47:01,436][25689] Fps is (10 sec: 5412.6, 60 sec: 5525.3, 300 sec: 5540.1). Total num frames: 866654208. Throughput: 0: 5836.4. Samples: 866664974. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:47:01,438][25689] Avg episode reward: [(0, '-0.998')] [2022-07-10 18:47:01,847][26022] Updated weights on worker 0-0, policy_version 846346 (0.00086) [2022-07-10 18:47:04,162][26022] Updated weights on worker 0-0, policy_version 846356 (0.00096) [2022-07-10 18:47:05,951][26022] Updated weights on worker 0-0, policy_version 846366 (0.00094) [2022-07-10 18:47:06,458][25689] Fps is (10 sec: 5203.3, 60 sec: 5523.5, 300 sec: 5537.1). Total num frames: 866680832. Throughput: 0: 5729.4. Samples: 866679634. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:47:06,465][25689] Avg episode reward: [(0, '-0.925')] [2022-07-10 18:47:07,731][26022] Updated weights on worker 0-0, policy_version 846376 (0.00086) [2022-07-10 18:47:09,652][26022] Updated weights on worker 0-0, policy_version 846386 (0.00622) [2022-07-10 18:47:11,394][26022] Updated weights on worker 0-0, policy_version 846396 (0.00995) [2022-07-10 18:47:11,531][25689] Fps is (10 sec: 5579.4, 60 sec: 5562.9, 300 sec: 5539.8). Total num frames: 866710528. Throughput: 0: 5710.7. Samples: 866712828. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:47:11,531][25689] Avg episode reward: [(0, '-1.598')] [2022-07-10 18:47:13,507][26022] Updated weights on worker 0-0, policy_version 846406 (0.00072) [2022-07-10 18:47:15,012][26022] Updated weights on worker 0-0, policy_version 846416 (0.00083) [2022-07-10 18:47:16,535][25689] Fps is (10 sec: 5487.8, 60 sec: 5518.4, 300 sec: 5536.4). Total num frames: 866736128. Throughput: 0: 5700.4. Samples: 866746238. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:47:16,536][25689] Avg episode reward: [(0, '-0.745')] [2022-07-10 18:47:16,942][26022] Updated weights on worker 0-0, policy_version 846426 (0.00092) [2022-07-10 18:47:18,951][26022] Updated weights on worker 0-0, policy_version 846436 (0.00089) [2022-07-10 18:47:20,449][26022] Updated weights on worker 0-0, policy_version 846446 (0.00093) [2022-07-10 18:47:21,557][25689] Fps is (10 sec: 5412.9, 60 sec: 5536.3, 300 sec: 5540.4). Total num frames: 866764800. Throughput: 0: 4868.2. Samples: 866762926. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:47:21,558][25689] Avg episode reward: [(0, '-0.900')] [2022-07-10 18:47:22,638][26022] Updated weights on worker 0-0, policy_version 846456 (0.00080) [2022-07-10 18:47:24,168][26022] Updated weights on worker 0-0, policy_version 846466 (0.00088) [2022-07-10 18:47:26,303][26022] Updated weights on worker 0-0, policy_version 846476 (0.00851) [2022-07-10 18:47:26,576][25689] Fps is (10 sec: 5711.0, 60 sec: 5554.3, 300 sec: 5541.8). Total num frames: 866793472. Throughput: 0: 5812.6. Samples: 866796568. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:47:26,576][25689] Avg episode reward: [(0, '-0.202')] [2022-07-10 18:47:27,960][26022] Updated weights on worker 0-0, policy_version 846486 (0.00089) [2022-07-10 18:47:29,893][26022] Updated weights on worker 0-0, policy_version 846496 (0.00089) [2022-07-10 18:47:31,665][25689] Fps is (10 sec: 5572.0, 60 sec: 5537.3, 300 sec: 5537.2). Total num frames: 866821120. Throughput: 0: 5807.0. Samples: 866829746. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:47:31,665][25689] Avg episode reward: [(0, '-0.443')] [2022-07-10 18:47:31,746][26022] Updated weights on worker 0-0, policy_version 846506 (0.00092) [2022-07-10 18:47:33,523][26022] Updated weights on worker 0-0, policy_version 846516 (0.00096) [2022-07-10 18:47:35,191][26022] Updated weights on worker 0-0, policy_version 846526 (0.00094) [2022-07-10 18:47:36,684][25689] Fps is (10 sec: 5470.5, 60 sec: 5526.6, 300 sec: 5537.2). Total num frames: 866848768. Throughput: 0: 4987.7. Samples: 866846736. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:47:36,684][25689] Avg episode reward: [(0, '0.162')] [2022-07-10 18:47:37,338][26022] Updated weights on worker 0-0, policy_version 846536 (0.00089) [2022-07-10 18:47:38,903][26022] Updated weights on worker 0-0, policy_version 846546 (0.00092) [2022-07-10 18:47:40,903][26022] Updated weights on worker 0-0, policy_version 846556 (0.00088) [2022-07-10 18:47:41,698][25689] Fps is (10 sec: 5613.2, 60 sec: 5529.8, 300 sec: 5533.6). Total num frames: 866877440. Throughput: 0: 5811.7. Samples: 866879980. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:47:41,699][25689] Avg episode reward: [(0, '0.383')] [2022-07-10 18:47:42,613][26022] Updated weights on worker 0-0, policy_version 846566 (0.00089) [2022-07-10 18:47:44,544][26022] Updated weights on worker 0-0, policy_version 846576 (0.00095) [2022-07-10 18:47:46,556][26022] Updated weights on worker 0-0, policy_version 846586 (0.00089) [2022-07-10 18:47:46,732][25689] Fps is (10 sec: 5605.2, 60 sec: 5528.6, 300 sec: 5541.0). Total num frames: 866905088. Throughput: 0: 5806.2. Samples: 866913598. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:47:46,732][25689] Avg episode reward: [(0, '0.326')] [2022-07-10 18:47:48,243][26022] Updated weights on worker 0-0, policy_version 846596 (0.00088) [2022-07-10 18:47:50,199][26022] Updated weights on worker 0-0, policy_version 846606 (0.00095) [2022-07-10 18:47:51,789][25689] Fps is (10 sec: 5479.6, 60 sec: 5511.8, 300 sec: 5544.4). Total num frames: 866932736. Throughput: 0: 4999.3. Samples: 866930356. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:47:51,790][25689] Avg episode reward: [(0, '0.048')] [2022-07-10 18:47:52,138][26022] Updated weights on worker 0-0, policy_version 846616 (0.00091) [2022-07-10 18:47:53,449][26022] Updated weights on worker 0-0, policy_version 846626 (0.00092) [2022-07-10 18:47:55,576][26022] Updated weights on worker 0-0, policy_version 846636 (0.00085) [2022-07-10 18:47:56,798][25689] Fps is (10 sec: 5696.7, 60 sec: 5529.1, 300 sec: 5541.4). Total num frames: 866962432. Throughput: 0: 5831.8. Samples: 866964036. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:47:56,798][25689] Avg episode reward: [(0, '0.255')] [2022-07-10 18:47:57,239][26022] Updated weights on worker 0-0, policy_version 846646 (0.00088) [2022-07-10 18:47:59,163][26022] Updated weights on worker 0-0, policy_version 846656 (0.00090) [2022-07-10 18:48:01,303][26022] Updated weights on worker 0-0, policy_version 846666 (0.00082) [2022-07-10 18:48:01,828][25689] Fps is (10 sec: 5406.3, 60 sec: 5510.7, 300 sec: 5541.0). Total num frames: 866987008. Throughput: 0: 5834.3. Samples: 866997424. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:48:01,829][25689] Avg episode reward: [(0, '0.770')] [2022-07-10 18:48:03,169][26022] Updated weights on worker 0-0, policy_version 846676 (0.00090) [2022-07-10 18:48:05,180][26022] Updated weights on worker 0-0, policy_version 846686 (0.00092) [2022-07-10 18:48:06,850][25689] Fps is (10 sec: 5195.0, 60 sec: 5527.6, 300 sec: 5538.5). Total num frames: 867014656. Throughput: 0: 4923.0. Samples: 867012642. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:48:06,851][25689] Avg episode reward: [(0, '-0.569')] [2022-07-10 18:48:06,991][26022] Updated weights on worker 0-0, policy_version 846696 (0.00080) [2022-07-10 18:48:08,682][26022] Updated weights on worker 0-0, policy_version 846706 (0.00082) [2022-07-10 18:48:10,727][26022] Updated weights on worker 0-0, policy_version 846716 (0.00086) [2022-07-10 18:48:11,903][25689] Fps is (10 sec: 5691.6, 60 sec: 5529.4, 300 sec: 5537.6). Total num frames: 867044352. Throughput: 0: 5763.4. Samples: 867046280. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:48:11,905][25689] Avg episode reward: [(0, '-2.922')] [2022-07-10 18:48:12,273][26022] Updated weights on worker 0-0, policy_version 846726 (0.00083) [2022-07-10 18:48:14,210][26022] Updated weights on worker 0-0, policy_version 846736 (0.00088) [2022-07-10 18:48:16,199][26022] Updated weights on worker 0-0, policy_version 846746 (0.00088) [2022-07-10 18:48:16,909][25689] Fps is (10 sec: 5599.2, 60 sec: 5546.2, 300 sec: 5541.6). Total num frames: 867070976. Throughput: 0: 5751.4. Samples: 867079704. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:48:16,910][25689] Avg episode reward: [(0, '-3.261')] [2022-07-10 18:48:17,932][26022] Updated weights on worker 0-0, policy_version 846756 (0.00090) [2022-07-10 18:48:19,842][26022] Updated weights on worker 0-0, policy_version 846766 (0.00094) [2022-07-10 18:48:21,672][26022] Updated weights on worker 0-0, policy_version 846776 (0.00087) [2022-07-10 18:48:21,928][25689] Fps is (10 sec: 5516.1, 60 sec: 5546.5, 300 sec: 5538.6). Total num frames: 867099648. Throughput: 0: 4929.0. Samples: 867096496. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:48:21,931][25689] Avg episode reward: [(0, '-3.686')] [2022-07-10 18:48:23,390][26022] Updated weights on worker 0-0, policy_version 846786 (0.00095) [2022-07-10 18:48:25,354][26022] Updated weights on worker 0-0, policy_version 846796 (0.00094) [2022-07-10 18:48:26,949][25689] Fps is (10 sec: 5609.4, 60 sec: 5529.3, 300 sec: 5535.9). Total num frames: 867127296. Throughput: 0: 5847.7. Samples: 867130174. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:48:26,950][25689] Avg episode reward: [(0, '-4.435')] [2022-07-10 18:48:27,190][26022] Updated weights on worker 0-0, policy_version 846806 (0.00099) [2022-07-10 18:48:29,002][26022] Updated weights on worker 0-0, policy_version 846816 (0.00090) [2022-07-10 18:48:30,960][26022] Updated weights on worker 0-0, policy_version 846826 (0.00092) [2022-07-10 18:48:32,017][25689] Fps is (10 sec: 5582.3, 60 sec: 5548.2, 300 sec: 5545.2). Total num frames: 867155968. Throughput: 0: 5814.1. Samples: 867163222. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:48:32,018][25689] Avg episode reward: [(0, '-3.533')] [2022-07-10 18:48:32,815][26022] Updated weights on worker 0-0, policy_version 846836 (0.00090) [2022-07-10 18:48:34,401][26022] Updated weights on worker 0-0, policy_version 846846 (0.00089) [2022-07-10 18:48:36,560][26022] Updated weights on worker 0-0, policy_version 846856 (0.00088) [2022-07-10 18:48:37,032][25689] Fps is (10 sec: 5484.0, 60 sec: 5531.6, 300 sec: 5536.1). Total num frames: 867182592. Throughput: 0: 4986.8. Samples: 867180056. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:48:37,034][25689] Avg episode reward: [(0, '-2.341')] [2022-07-10 18:48:38,042][26022] Updated weights on worker 0-0, policy_version 846866 (0.00087) [2022-07-10 18:48:40,105][26022] Updated weights on worker 0-0, policy_version 846876 (0.00086) [2022-07-10 18:48:41,894][26022] Updated weights on worker 0-0, policy_version 846886 (0.00088) [2022-07-10 18:48:42,057][25689] Fps is (10 sec: 5507.7, 60 sec: 5530.7, 300 sec: 5536.7). Total num frames: 867211264. Throughput: 0: 5833.6. Samples: 867213918. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:48:42,057][25689] Avg episode reward: [(0, '-0.660')] [2022-07-10 18:48:43,650][26022] Updated weights on worker 0-0, policy_version 846896 (0.00093) [2022-07-10 18:48:45,525][26022] Updated weights on worker 0-0, policy_version 846906 (0.00088) [2022-07-10 18:48:47,091][25689] Fps is (10 sec: 5599.2, 60 sec: 5530.6, 300 sec: 5538.0). Total num frames: 867238912. Throughput: 0: 5825.2. Samples: 867247504. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:48:47,091][25689] Avg episode reward: [(0, '0.070')] [2022-07-10 18:48:47,444][26022] Updated weights on worker 0-0, policy_version 846916 (0.00097) [2022-07-10 18:48:49,036][26022] Updated weights on worker 0-0, policy_version 846926 (0.00087) [2022-07-10 18:48:51,187][26022] Updated weights on worker 0-0, policy_version 846936 (0.00433) [2022-07-10 18:48:52,142][25689] Fps is (10 sec: 5584.4, 60 sec: 5548.2, 300 sec: 5538.4). Total num frames: 867267584. Throughput: 0: 5018.2. Samples: 867264214. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:48:52,142][25689] Avg episode reward: [(0, '0.122')] [2022-07-10 18:48:53,134][26022] Updated weights on worker 0-0, policy_version 846946 (0.00083) [2022-07-10 18:48:54,835][26022] Updated weights on worker 0-0, policy_version 846956 (0.00095) [2022-07-10 18:48:55,517][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:48:55,529][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000846960_867287040.pth [2022-07-10 18:48:55,530][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000845012_865292288.pth [2022-07-10 18:48:56,827][26022] Updated weights on worker 0-0, policy_version 846966 (0.00862) [2022-07-10 18:48:57,149][25689] Fps is (10 sec: 5599.7, 60 sec: 5514.4, 300 sec: 5542.2). Total num frames: 867295232. Throughput: 0: 5834.8. Samples: 867297430. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:48:57,149][25689] Avg episode reward: [(0, '-0.113')] [2022-07-10 18:48:58,355][26022] Updated weights on worker 0-0, policy_version 846976 (0.00090) [2022-07-10 18:49:00,324][26022] Updated weights on worker 0-0, policy_version 846986 (0.00089) [2022-07-10 18:49:02,156][25689] Fps is (10 sec: 5317.4, 60 sec: 5533.5, 300 sec: 5538.8). Total num frames: 867320832. Throughput: 0: 5733.3. Samples: 867329152. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:49:02,157][25689] Avg episode reward: [(0, '-0.204')] [2022-07-10 18:49:02,702][26022] Updated weights on worker 0-0, policy_version 846996 (0.00094) [2022-07-10 18:49:04,264][26022] Updated weights on worker 0-0, policy_version 847006 (0.00090) [2022-07-10 18:49:06,340][26022] Updated weights on worker 0-0, policy_version 847016 (0.00099) [2022-07-10 18:49:07,183][25689] Fps is (10 sec: 5306.7, 60 sec: 5533.1, 300 sec: 5539.7). Total num frames: 867348480. Throughput: 0: 4876.8. Samples: 867345488. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:49:07,183][25689] Avg episode reward: [(0, '0.010')] [2022-07-10 18:49:07,989][26022] Updated weights on worker 0-0, policy_version 847026 (0.00085) [2022-07-10 18:49:10,030][26022] Updated weights on worker 0-0, policy_version 847036 (0.00089) [2022-07-10 18:49:11,883][26022] Updated weights on worker 0-0, policy_version 847046 (0.00090) [2022-07-10 18:49:12,319][25689] Fps is (10 sec: 5541.8, 60 sec: 5508.5, 300 sec: 5534.4). Total num frames: 867377152. Throughput: 0: 5678.1. Samples: 867378780. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-10 18:49:12,319][25689] Avg episode reward: [(0, '-0.255')] [2022-07-10 18:49:13,495][26022] Updated weights on worker 0-0, policy_version 847056 (0.00084) [2022-07-10 18:49:15,590][26022] Updated weights on worker 0-0, policy_version 847066 (0.00096) [2022-07-10 18:49:17,352][25689] Fps is (10 sec: 5538.1, 60 sec: 5522.9, 300 sec: 5537.4). Total num frames: 867404800. Throughput: 0: 5673.8. Samples: 867412062. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:49:17,353][25689] Avg episode reward: [(0, '-0.747')] [2022-07-10 18:49:17,521][26022] Updated weights on worker 0-0, policy_version 847076 (0.00094) [2022-07-10 18:49:19,102][26022] Updated weights on worker 0-0, policy_version 847086 (0.00090) [2022-07-10 18:49:21,107][26022] Updated weights on worker 0-0, policy_version 847096 (0.00089) [2022-07-10 18:49:22,386][25689] Fps is (10 sec: 5594.5, 60 sec: 5521.6, 300 sec: 5540.8). Total num frames: 867433472. Throughput: 0: 4923.3. Samples: 867428750. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:49:22,386][25689] Avg episode reward: [(0, '-0.355')] [2022-07-10 18:49:22,659][26022] Updated weights on worker 0-0, policy_version 847106 (0.00090) [2022-07-10 18:49:24,678][26022] Updated weights on worker 0-0, policy_version 847116 (0.00091) [2022-07-10 18:49:26,499][26022] Updated weights on worker 0-0, policy_version 847126 (0.00084) [2022-07-10 18:49:27,419][25689] Fps is (10 sec: 5594.5, 60 sec: 5520.5, 300 sec: 5532.1). Total num frames: 867461120. Throughput: 0: 5780.0. Samples: 867462456. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:49:27,420][25689] Avg episode reward: [(0, '0.381')] [2022-07-10 18:49:28,291][26022] Updated weights on worker 0-0, policy_version 847136 (0.00086) [2022-07-10 18:49:30,150][26022] Updated weights on worker 0-0, policy_version 847146 (0.00083) [2022-07-10 18:49:32,033][26022] Updated weights on worker 0-0, policy_version 847156 (0.00084) [2022-07-10 18:49:32,518][25689] Fps is (10 sec: 5558.5, 60 sec: 5517.7, 300 sec: 5535.0). Total num frames: 867489792. Throughput: 0: 5800.8. Samples: 867495952. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:49:32,519][25689] Avg episode reward: [(0, '0.295')] [2022-07-10 18:49:33,827][26022] Updated weights on worker 0-0, policy_version 847166 (0.00079) [2022-07-10 18:49:35,696][26022] Updated weights on worker 0-0, policy_version 847176 (0.00091) [2022-07-10 18:49:37,521][25689] Fps is (10 sec: 5575.4, 60 sec: 5535.7, 300 sec: 5539.2). Total num frames: 867517440. Throughput: 0: 4990.8. Samples: 867512720. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:49:37,521][25689] Avg episode reward: [(0, '0.042')] [2022-07-10 18:49:37,568][26022] Updated weights on worker 0-0, policy_version 847186 (0.00086) [2022-07-10 18:49:39,176][26022] Updated weights on worker 0-0, policy_version 847196 (0.00085) [2022-07-10 18:49:41,463][26022] Updated weights on worker 0-0, policy_version 847206 (0.00090) [2022-07-10 18:49:42,595][25689] Fps is (10 sec: 5690.8, 60 sec: 5548.1, 300 sec: 5538.4). Total num frames: 867547136. Throughput: 0: 5794.1. Samples: 867545842. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:49:42,595][25689] Avg episode reward: [(0, '-0.226')] [2022-07-10 18:49:42,785][26022] Updated weights on worker 0-0, policy_version 847216 (0.00091) [2022-07-10 18:49:45,045][26022] Updated weights on worker 0-0, policy_version 847226 (0.00087) [2022-07-10 18:49:46,691][26022] Updated weights on worker 0-0, policy_version 847236 (0.00094) [2022-07-10 18:49:47,638][25689] Fps is (10 sec: 5465.8, 60 sec: 5513.5, 300 sec: 5529.2). Total num frames: 867572736. Throughput: 0: 5779.5. Samples: 867579308. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:49:47,638][25689] Avg episode reward: [(0, '0.086')] [2022-07-10 18:49:48,415][26022] Updated weights on worker 0-0, policy_version 847246 (0.00086) [2022-07-10 18:49:50,561][26022] Updated weights on worker 0-0, policy_version 847256 (0.00086) [2022-07-10 18:49:52,138][26022] Updated weights on worker 0-0, policy_version 847266 (0.00088) [2022-07-10 18:49:52,711][25689] Fps is (10 sec: 5466.3, 60 sec: 5528.4, 300 sec: 5535.0). Total num frames: 867602432. Throughput: 0: 4958.0. Samples: 867596066. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:49:52,711][25689] Avg episode reward: [(0, '0.228')] [2022-07-10 18:49:54,161][26022] Updated weights on worker 0-0, policy_version 847276 (0.00087) [2022-07-10 18:49:55,831][26022] Updated weights on worker 0-0, policy_version 847286 (0.00117) [2022-07-10 18:49:57,716][25689] Fps is (10 sec: 5689.5, 60 sec: 5528.5, 300 sec: 5535.0). Total num frames: 867630080. Throughput: 0: 5801.7. Samples: 867629890. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:49:57,717][25689] Avg episode reward: [(0, '0.209')] [2022-07-10 18:49:57,803][26022] Updated weights on worker 0-0, policy_version 847296 (0.00095) [2022-07-10 18:49:59,641][26022] Updated weights on worker 0-0, policy_version 847306 (0.00095) [2022-07-10 18:50:01,376][26022] Updated weights on worker 0-0, policy_version 847316 (0.00090) [2022-07-10 18:50:02,763][25689] Fps is (10 sec: 5297.0, 60 sec: 5524.9, 300 sec: 5534.4). Total num frames: 867655680. Throughput: 0: 5813.6. Samples: 867663092. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:50:02,763][25689] Avg episode reward: [(0, '-0.328')] [2022-07-10 18:50:03,650][26022] Updated weights on worker 0-0, policy_version 847326 (0.00093) [2022-07-10 18:50:05,660][26022] Updated weights on worker 0-0, policy_version 847336 (0.00089) [2022-07-10 18:50:07,384][26022] Updated weights on worker 0-0, policy_version 847346 (0.00080) [2022-07-10 18:50:07,772][25689] Fps is (10 sec: 5397.3, 60 sec: 5543.4, 300 sec: 5532.8). Total num frames: 867684352. Throughput: 0: 4889.7. Samples: 867677758. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:50:07,772][25689] Avg episode reward: [(0, '0.239')] [2022-07-10 18:50:09,228][26022] Updated weights on worker 0-0, policy_version 847356 (0.00089) [2022-07-10 18:50:10,962][26022] Updated weights on worker 0-0, policy_version 847366 (0.00089) [2022-07-10 18:50:12,706][26022] Updated weights on worker 0-0, policy_version 847376 (0.00082) [2022-07-10 18:50:12,886][25689] Fps is (10 sec: 5664.4, 60 sec: 5545.4, 300 sec: 5532.0). Total num frames: 867713024. Throughput: 0: 5717.8. Samples: 867711428. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:50:12,887][25689] Avg episode reward: [(0, '-0.094')] [2022-07-10 18:50:14,711][26022] Updated weights on worker 0-0, policy_version 847386 (0.00083) [2022-07-10 18:50:16,487][26022] Updated weights on worker 0-0, policy_version 847396 (0.00086) [2022-07-10 18:50:17,909][25689] Fps is (10 sec: 5656.6, 60 sec: 5563.3, 300 sec: 5539.6). Total num frames: 867741696. Throughput: 0: 5705.7. Samples: 867745106. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:50:17,910][25689] Avg episode reward: [(0, '-0.081')] [2022-07-10 18:50:18,439][26022] Updated weights on worker 0-0, policy_version 847406 (0.00085) [2022-07-10 18:50:20,256][26022] Updated weights on worker 0-0, policy_version 847416 (0.00105) [2022-07-10 18:50:22,002][26022] Updated weights on worker 0-0, policy_version 847426 (0.00086) [2022-07-10 18:50:22,919][25689] Fps is (10 sec: 5614.0, 60 sec: 5548.6, 300 sec: 5533.1). Total num frames: 867769344. Throughput: 0: 4893.1. Samples: 867761716. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:50:22,919][25689] Avg episode reward: [(0, '-0.160')] [2022-07-10 18:50:23,919][26022] Updated weights on worker 0-0, policy_version 847436 (0.00084) [2022-07-10 18:50:25,781][26022] Updated weights on worker 0-0, policy_version 847446 (0.00085) [2022-07-10 18:50:27,479][26022] Updated weights on worker 0-0, policy_version 847456 (0.00093) [2022-07-10 18:50:27,951][25689] Fps is (10 sec: 5506.7, 60 sec: 5548.7, 300 sec: 5534.5). Total num frames: 867796992. Throughput: 0: 5817.1. Samples: 867795142. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:50:27,952][25689] Avg episode reward: [(0, '0.014')] [2022-07-10 18:50:29,463][26022] Updated weights on worker 0-0, policy_version 847466 (0.00096) [2022-07-10 18:50:31,320][26022] Updated weights on worker 0-0, policy_version 847476 (0.00098) [2022-07-10 18:50:33,060][25689] Fps is (10 sec: 5553.5, 60 sec: 5547.8, 300 sec: 5533.3). Total num frames: 867825664. Throughput: 0: 5803.7. Samples: 867828508. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:50:33,060][25689] Avg episode reward: [(0, '0.206')] [2022-07-10 18:50:33,061][26022] Updated weights on worker 0-0, policy_version 847486 (0.00096) [2022-07-10 18:50:34,912][26022] Updated weights on worker 0-0, policy_version 847496 (0.00084) [2022-07-10 18:50:36,543][26022] Updated weights on worker 0-0, policy_version 847506 (0.00082) [2022-07-10 18:50:38,071][25689] Fps is (10 sec: 5565.4, 60 sec: 5547.0, 300 sec: 5530.5). Total num frames: 867853312. Throughput: 0: 5814.4. Samples: 867862332. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:50:38,071][25689] Avg episode reward: [(0, '-0.461')] [2022-07-10 18:50:38,572][26022] Updated weights on worker 0-0, policy_version 847516 (0.00110) [2022-07-10 18:50:40,174][26022] Updated weights on worker 0-0, policy_version 847526 (0.00085) [2022-07-10 18:50:42,290][26022] Updated weights on worker 0-0, policy_version 847536 (0.00077) [2022-07-10 18:50:43,094][25689] Fps is (10 sec: 5613.1, 60 sec: 5534.8, 300 sec: 5534.9). Total num frames: 867881984. Throughput: 0: 5815.0. Samples: 867879034. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:50:43,094][25689] Avg episode reward: [(0, '-1.092')] [2022-07-10 18:50:43,990][26022] Updated weights on worker 0-0, policy_version 847546 (0.00084) [2022-07-10 18:50:45,872][26022] Updated weights on worker 0-0, policy_version 847556 (0.00084) [2022-07-10 18:50:47,781][26022] Updated weights on worker 0-0, policy_version 847566 (0.00085) [2022-07-10 18:50:48,109][25689] Fps is (10 sec: 5508.6, 60 sec: 5554.2, 300 sec: 5533.3). Total num frames: 867908608. Throughput: 0: 5832.6. Samples: 867912714. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:50:48,109][25689] Avg episode reward: [(0, '-0.589')] [2022-07-10 18:50:49,579][26022] Updated weights on worker 0-0, policy_version 847576 (0.00092) [2022-07-10 18:50:51,426][26022] Updated weights on worker 0-0, policy_version 847586 (0.00087) [2022-07-10 18:50:53,192][25689] Fps is (10 sec: 5577.2, 60 sec: 5553.3, 300 sec: 5539.8). Total num frames: 867938304. Throughput: 0: 5846.1. Samples: 867946202. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:50:53,192][25689] Avg episode reward: [(0, '-1.692')] [2022-07-10 18:50:53,193][26022] Updated weights on worker 0-0, policy_version 847596 (0.00086) [2022-07-10 18:50:55,198][26022] Updated weights on worker 0-0, policy_version 847606 (0.00092) [2022-07-10 18:50:55,849][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:50:55,866][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000847610_867952640.pth [2022-07-10 18:50:55,867][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000845661_865956864.pth [2022-07-10 18:50:56,878][26022] Updated weights on worker 0-0, policy_version 847616 (0.00100) [2022-07-10 18:50:58,241][25689] Fps is (10 sec: 5558.3, 60 sec: 5532.4, 300 sec: 5533.0). Total num frames: 867964928. Throughput: 0: 4992.8. Samples: 867963040. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:50:58,242][25689] Avg episode reward: [(0, '-1.502')] [2022-07-10 18:50:58,712][26022] Updated weights on worker 0-0, policy_version 847626 (0.00089) [2022-07-10 18:51:00,568][26022] Updated weights on worker 0-0, policy_version 847636 (0.00092) [2022-07-10 18:51:02,628][26022] Updated weights on worker 0-0, policy_version 847646 (0.00091) [2022-07-10 18:51:03,250][25689] Fps is (10 sec: 5293.9, 60 sec: 5552.8, 300 sec: 5532.9). Total num frames: 867991552. Throughput: 0: 5800.9. Samples: 867995960. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:51:03,252][25689] Avg episode reward: [(0, '-1.590')] [2022-07-10 18:51:04,727][26022] Updated weights on worker 0-0, policy_version 847656 (0.00097) [2022-07-10 18:51:06,400][26022] Updated weights on worker 0-0, policy_version 847666 (0.00091) [2022-07-10 18:51:08,255][25689] Fps is (10 sec: 5420.0, 60 sec: 5536.3, 300 sec: 5535.3). Total num frames: 868019200. Throughput: 0: 5705.2. Samples: 868027650. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:51:08,255][25689] Avg episode reward: [(0, '-1.547')] [2022-07-10 18:51:08,459][26022] Updated weights on worker 0-0, policy_version 847676 (0.00089) [2022-07-10 18:51:10,212][26022] Updated weights on worker 0-0, policy_version 847686 (0.00087) [2022-07-10 18:51:11,989][26022] Updated weights on worker 0-0, policy_version 847696 (0.00088) [2022-07-10 18:51:13,319][25689] Fps is (10 sec: 5593.3, 60 sec: 5540.8, 300 sec: 5535.5). Total num frames: 868047872. Throughput: 0: 4876.4. Samples: 868044352. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:51:13,320][25689] Avg episode reward: [(0, '-1.516')] [2022-07-10 18:51:13,897][26022] Updated weights on worker 0-0, policy_version 847706 (0.00089) [2022-07-10 18:51:15,803][26022] Updated weights on worker 0-0, policy_version 847716 (0.00087) [2022-07-10 18:51:17,556][26022] Updated weights on worker 0-0, policy_version 847726 (0.00090) [2022-07-10 18:51:18,325][25689] Fps is (10 sec: 5592.5, 60 sec: 5525.4, 300 sec: 5536.0). Total num frames: 868075520. Throughput: 0: 5716.5. Samples: 868077850. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:51:18,327][25689] Avg episode reward: [(0, '-1.275')] [2022-07-10 18:51:19,273][26022] Updated weights on worker 0-0, policy_version 847736 (0.00083) [2022-07-10 18:51:21,368][26022] Updated weights on worker 0-0, policy_version 847746 (0.00092) [2022-07-10 18:51:23,145][26022] Updated weights on worker 0-0, policy_version 847756 (0.00091) [2022-07-10 18:51:23,330][25689] Fps is (10 sec: 5523.6, 60 sec: 5525.8, 300 sec: 5536.4). Total num frames: 868103168. Throughput: 0: 5724.7. Samples: 868110912. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:51:23,331][25689] Avg episode reward: [(0, '-0.764')] [2022-07-10 18:51:24,997][26022] Updated weights on worker 0-0, policy_version 847766 (0.00086) [2022-07-10 18:51:26,912][26022] Updated weights on worker 0-0, policy_version 847776 (0.00085) [2022-07-10 18:51:28,371][25689] Fps is (10 sec: 5402.6, 60 sec: 5508.1, 300 sec: 5530.4). Total num frames: 868129792. Throughput: 0: 4955.2. Samples: 868127330. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:51:28,371][25689] Avg episode reward: [(0, '-1.266')] [2022-07-10 18:51:28,726][26022] Updated weights on worker 0-0, policy_version 847786 (0.00092) [2022-07-10 18:51:30,519][26022] Updated weights on worker 0-0, policy_version 847796 (0.00088) [2022-07-10 18:51:32,443][26022] Updated weights on worker 0-0, policy_version 847806 (0.00086) [2022-07-10 18:51:33,449][25689] Fps is (10 sec: 5464.6, 60 sec: 5510.9, 300 sec: 5530.6). Total num frames: 868158464. Throughput: 0: 5778.5. Samples: 868160672. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:51:33,451][25689] Avg episode reward: [(0, '-0.458')] [2022-07-10 18:51:34,204][26022] Updated weights on worker 0-0, policy_version 847816 (0.00107) [2022-07-10 18:51:35,973][26022] Updated weights on worker 0-0, policy_version 847826 (0.00088) [2022-07-10 18:51:37,704][26022] Updated weights on worker 0-0, policy_version 847836 (0.00087) [2022-07-10 18:51:38,483][25689] Fps is (10 sec: 5771.9, 60 sec: 5542.7, 300 sec: 5534.3). Total num frames: 868188160. Throughput: 0: 5784.0. Samples: 868194444. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:51:38,484][25689] Avg episode reward: [(0, '0.177')] [2022-07-10 18:51:39,791][26022] Updated weights on worker 0-0, policy_version 847846 (0.00093) [2022-07-10 18:51:41,392][26022] Updated weights on worker 0-0, policy_version 847856 (0.00090) [2022-07-10 18:51:43,505][25689] Fps is (10 sec: 5498.8, 60 sec: 5491.9, 300 sec: 5527.4). Total num frames: 868213760. Throughput: 0: 4969.4. Samples: 868211170. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:51:43,506][25689] Avg episode reward: [(0, '0.090')] [2022-07-10 18:51:43,564][26022] Updated weights on worker 0-0, policy_version 847866 (0.00091) [2022-07-10 18:51:44,949][26022] Updated weights on worker 0-0, policy_version 847876 (0.00089) [2022-07-10 18:51:47,049][26022] Updated weights on worker 0-0, policy_version 847886 (0.00085) [2022-07-10 18:51:48,531][25689] Fps is (10 sec: 5605.4, 60 sec: 5558.8, 300 sec: 5534.9). Total num frames: 868244480. Throughput: 0: 5829.9. Samples: 868244860. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:51:48,532][25689] Avg episode reward: [(0, '0.193')] [2022-07-10 18:51:48,741][26022] Updated weights on worker 0-0, policy_version 847896 (0.00096) [2022-07-10 18:51:50,639][26022] Updated weights on worker 0-0, policy_version 847906 (0.00086) [2022-07-10 18:51:52,598][26022] Updated weights on worker 0-0, policy_version 847916 (0.00094) [2022-07-10 18:51:53,596][25689] Fps is (10 sec: 5682.7, 60 sec: 5509.5, 300 sec: 5527.0). Total num frames: 868271104. Throughput: 0: 5839.0. Samples: 868278310. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:51:53,597][25689] Avg episode reward: [(0, '-0.097')] [2022-07-10 18:51:54,380][26022] Updated weights on worker 0-0, policy_version 847926 (0.00078) [2022-07-10 18:51:56,137][26022] Updated weights on worker 0-0, policy_version 847936 (0.00090) [2022-07-10 18:51:58,034][26022] Updated weights on worker 0-0, policy_version 847946 (0.00089) [2022-07-10 18:51:58,615][25689] Fps is (10 sec: 5382.0, 60 sec: 5529.3, 300 sec: 5533.9). Total num frames: 868298752. Throughput: 0: 4994.3. Samples: 868294986. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:51:58,615][25689] Avg episode reward: [(0, '-0.168')] [2022-07-10 18:51:59,851][26022] Updated weights on worker 0-0, policy_version 847956 (0.00093) [2022-07-10 18:52:01,861][26022] Updated weights on worker 0-0, policy_version 847966 (0.00100) [2022-07-10 18:52:03,621][25689] Fps is (10 sec: 5618.2, 60 sec: 5563.5, 300 sec: 5541.1). Total num frames: 868327424. Throughput: 0: 5761.3. Samples: 868327062. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:52:03,621][25689] Avg episode reward: [(0, '0.010')] [2022-07-10 18:52:03,633][26022] Updated weights on worker 0-0, policy_version 847976 (0.00093) [2022-07-10 18:52:05,943][26022] Updated weights on worker 0-0, policy_version 847986 (0.00086) [2022-07-10 18:52:07,409][26022] Updated weights on worker 0-0, policy_version 847996 (0.00090) [2022-07-10 18:52:08,627][25689] Fps is (10 sec: 5420.7, 60 sec: 5529.4, 300 sec: 5528.5). Total num frames: 868353024. Throughput: 0: 5734.7. Samples: 868360104. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:52:08,627][25689] Avg episode reward: [(0, '-0.678')] [2022-07-10 18:52:09,671][26022] Updated weights on worker 0-0, policy_version 848006 (0.00093) [2022-07-10 18:52:11,320][26022] Updated weights on worker 0-0, policy_version 848016 (0.00096) [2022-07-10 18:52:13,315][26022] Updated weights on worker 0-0, policy_version 848026 (0.00077) [2022-07-10 18:52:13,771][25689] Fps is (10 sec: 5347.0, 60 sec: 5522.2, 300 sec: 5536.2). Total num frames: 868381696. Throughput: 0: 4861.6. Samples: 868376390. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:52:13,771][25689] Avg episode reward: [(0, '-0.963')] [2022-07-10 18:52:15,006][26022] Updated weights on worker 0-0, policy_version 848036 (0.00087) [2022-07-10 18:52:16,947][26022] Updated weights on worker 0-0, policy_version 848046 (0.00086) [2022-07-10 18:52:18,740][26022] Updated weights on worker 0-0, policy_version 848056 (0.00091) [2022-07-10 18:52:18,775][25689] Fps is (10 sec: 5549.6, 60 sec: 5522.3, 300 sec: 5533.1). Total num frames: 868409344. Throughput: 0: 5696.2. Samples: 868409824. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:52:18,776][25689] Avg episode reward: [(0, '-0.734')] [2022-07-10 18:52:20,499][26022] Updated weights on worker 0-0, policy_version 848066 (0.00092) [2022-07-10 18:52:22,423][26022] Updated weights on worker 0-0, policy_version 848076 (0.00094) [2022-07-10 18:52:23,799][25689] Fps is (10 sec: 5514.0, 60 sec: 5520.6, 300 sec: 5529.6). Total num frames: 868436992. Throughput: 0: 5746.5. Samples: 868443016. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:52:23,800][25689] Avg episode reward: [(0, '-0.795')] [2022-07-10 18:52:24,186][26022] Updated weights on worker 0-0, policy_version 848086 (0.00085) [2022-07-10 18:52:26,034][26022] Updated weights on worker 0-0, policy_version 848096 (0.00091) [2022-07-10 18:52:27,887][26022] Updated weights on worker 0-0, policy_version 848106 (0.00095) [2022-07-10 18:52:28,801][25689] Fps is (10 sec: 5515.4, 60 sec: 5541.1, 300 sec: 5531.2). Total num frames: 868464640. Throughput: 0: 4933.9. Samples: 868459642. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:52:28,801][25689] Avg episode reward: [(0, '-0.320')] [2022-07-10 18:52:29,539][26022] Updated weights on worker 0-0, policy_version 848116 (0.00088) [2022-07-10 18:52:31,852][26022] Updated weights on worker 0-0, policy_version 848126 (0.00089) [2022-07-10 18:52:33,435][26022] Updated weights on worker 0-0, policy_version 848136 (0.00081) [2022-07-10 18:52:33,839][25689] Fps is (10 sec: 5507.7, 60 sec: 5527.8, 300 sec: 5530.9). Total num frames: 868492288. Throughput: 0: 5815.4. Samples: 868493094. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:52:33,841][25689] Avg episode reward: [(0, '0.071')] [2022-07-10 18:52:35,340][26022] Updated weights on worker 0-0, policy_version 848146 (0.00087) [2022-07-10 18:52:37,171][26022] Updated weights on worker 0-0, policy_version 848156 (0.00098) [2022-07-10 18:52:38,906][25689] Fps is (10 sec: 5573.3, 60 sec: 5507.8, 300 sec: 5529.9). Total num frames: 868520960. Throughput: 0: 5800.9. Samples: 868526602. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:52:38,907][25689] Avg episode reward: [(0, '-0.034')] [2022-07-10 18:52:38,953][26022] Updated weights on worker 0-0, policy_version 848166 (0.00095) [2022-07-10 18:52:41,013][26022] Updated weights on worker 0-0, policy_version 848176 (0.00105) [2022-07-10 18:52:42,802][26022] Updated weights on worker 0-0, policy_version 848186 (0.00093) [2022-07-10 18:52:43,967][25689] Fps is (10 sec: 5560.7, 60 sec: 5538.2, 300 sec: 5529.4). Total num frames: 868548608. Throughput: 0: 4967.1. Samples: 868543192. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:52:43,967][25689] Avg episode reward: [(0, '-0.279')] [2022-07-10 18:52:44,609][26022] Updated weights on worker 0-0, policy_version 848196 (0.00095) [2022-07-10 18:52:46,326][26022] Updated weights on worker 0-0, policy_version 848206 (0.00080) [2022-07-10 18:52:48,223][26022] Updated weights on worker 0-0, policy_version 848216 (0.00089) [2022-07-10 18:52:49,033][25689] Fps is (10 sec: 5662.6, 60 sec: 5517.6, 300 sec: 5536.1). Total num frames: 868578304. Throughput: 0: 5792.7. Samples: 868576840. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-10 18:52:49,033][25689] Avg episode reward: [(0, '-0.043')] [2022-07-10 18:52:50,095][26022] Updated weights on worker 0-0, policy_version 848226 (0.00087) [2022-07-10 18:52:51,938][26022] Updated weights on worker 0-0, policy_version 848236 (0.00085) [2022-07-10 18:52:53,696][26022] Updated weights on worker 0-0, policy_version 848246 (0.00083) [2022-07-10 18:52:54,148][25689] Fps is (10 sec: 5532.0, 60 sec: 5513.0, 300 sec: 5523.8). Total num frames: 868604928. Throughput: 0: 5753.7. Samples: 868609946. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:52:54,148][25689] Avg episode reward: [(0, '-0.726')] [2022-07-10 18:52:55,568][26022] Updated weights on worker 0-0, policy_version 848256 (0.00090) [2022-07-10 18:52:56,108][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:52:56,117][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000848259_868617216.pth [2022-07-10 18:52:56,118][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000846310_866621440.pth [2022-07-10 18:52:57,676][26022] Updated weights on worker 0-0, policy_version 848266 (0.00091) [2022-07-10 18:52:59,199][25689] Fps is (10 sec: 5439.4, 60 sec: 5527.0, 300 sec: 5537.2). Total num frames: 868633600. Throughput: 0: 5766.0. Samples: 868643610. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:52:59,199][25689] Avg episode reward: [(0, '-0.462')] [2022-07-10 18:52:59,292][26022] Updated weights on worker 0-0, policy_version 848276 (0.00094) [2022-07-10 18:53:01,031][26022] Updated weights on worker 0-0, policy_version 848286 (0.00095) [2022-07-10 18:53:03,407][26022] Updated weights on worker 0-0, policy_version 848296 (0.00085) [2022-07-10 18:53:04,236][25689] Fps is (10 sec: 5379.8, 60 sec: 5473.5, 300 sec: 5530.0). Total num frames: 868659200. Throughput: 0: 5676.4. Samples: 868658246. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:53:04,236][25689] Avg episode reward: [(0, '-0.530')] [2022-07-10 18:53:05,111][26022] Updated weights on worker 0-0, policy_version 848306 (0.00079) [2022-07-10 18:53:07,051][26022] Updated weights on worker 0-0, policy_version 848316 (0.00080) [2022-07-10 18:53:08,778][26022] Updated weights on worker 0-0, policy_version 848326 (0.00085) [2022-07-10 18:53:09,249][25689] Fps is (10 sec: 5399.9, 60 sec: 5523.5, 300 sec: 5527.3). Total num frames: 868687872. Throughput: 0: 5685.9. Samples: 868691788. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:53:09,253][25689] Avg episode reward: [(0, '-0.569')] [2022-07-10 18:53:10,719][26022] Updated weights on worker 0-0, policy_version 848336 (0.00098) [2022-07-10 18:53:12,501][26022] Updated weights on worker 0-0, policy_version 848346 (0.00084) [2022-07-10 18:53:14,322][25689] Fps is (10 sec: 5583.7, 60 sec: 5513.0, 300 sec: 5529.5). Total num frames: 868715520. Throughput: 0: 5720.4. Samples: 868725354. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:53:14,324][25689] Avg episode reward: [(0, '-0.302')] [2022-07-10 18:53:14,429][26022] Updated weights on worker 0-0, policy_version 848356 (0.00083) [2022-07-10 18:53:16,218][26022] Updated weights on worker 0-0, policy_version 848366 (0.00088) [2022-07-10 18:53:18,071][26022] Updated weights on worker 0-0, policy_version 848376 (0.00084) [2022-07-10 18:53:19,354][25689] Fps is (10 sec: 5573.4, 60 sec: 5527.4, 300 sec: 5529.2). Total num frames: 868744192. Throughput: 0: 4882.3. Samples: 868742016. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:53:19,355][25689] Avg episode reward: [(0, '-0.518')] [2022-07-10 18:53:19,893][26022] Updated weights on worker 0-0, policy_version 848386 (0.00619) [2022-07-10 18:53:21,704][26022] Updated weights on worker 0-0, policy_version 848396 (0.00085) [2022-07-10 18:53:23,540][26022] Updated weights on worker 0-0, policy_version 848406 (0.00086) [2022-07-10 18:53:24,384][25689] Fps is (10 sec: 5699.1, 60 sec: 5543.8, 300 sec: 5532.5). Total num frames: 868772864. Throughput: 0: 5823.4. Samples: 868775578. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:53:24,386][25689] Avg episode reward: [(0, '0.514')] [2022-07-10 18:53:25,449][26022] Updated weights on worker 0-0, policy_version 848416 (0.00086) [2022-07-10 18:53:27,256][26022] Updated weights on worker 0-0, policy_version 848426 (0.00084) [2022-07-10 18:53:29,039][26022] Updated weights on worker 0-0, policy_version 848436 (0.00086) [2022-07-10 18:53:29,397][25689] Fps is (10 sec: 5506.0, 60 sec: 5525.9, 300 sec: 5526.7). Total num frames: 868799488. Throughput: 0: 5819.9. Samples: 868809046. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:53:29,398][25689] Avg episode reward: [(0, '0.091')] [2022-07-10 18:53:30,948][26022] Updated weights on worker 0-0, policy_version 848446 (0.00981) [2022-07-10 18:53:32,736][26022] Updated weights on worker 0-0, policy_version 848456 (0.00090) [2022-07-10 18:53:34,461][25689] Fps is (10 sec: 5487.7, 60 sec: 5540.4, 300 sec: 5532.7). Total num frames: 868828160. Throughput: 0: 4975.9. Samples: 868825560. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:53:34,461][25689] Avg episode reward: [(0, '0.106')] [2022-07-10 18:53:34,661][26022] Updated weights on worker 0-0, policy_version 848466 (0.00092) [2022-07-10 18:53:36,426][26022] Updated weights on worker 0-0, policy_version 848476 (0.00089) [2022-07-10 18:53:38,288][26022] Updated weights on worker 0-0, policy_version 848486 (0.00099) [2022-07-10 18:53:39,551][25689] Fps is (10 sec: 5546.8, 60 sec: 5521.5, 300 sec: 5528.0). Total num frames: 868855808. Throughput: 0: 5802.0. Samples: 868859196. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:53:39,551][25689] Avg episode reward: [(0, '0.331')] [2022-07-10 18:53:40,166][26022] Updated weights on worker 0-0, policy_version 848496 (0.00089) [2022-07-10 18:53:41,966][26022] Updated weights on worker 0-0, policy_version 848506 (0.00084) [2022-07-10 18:53:43,770][26022] Updated weights on worker 0-0, policy_version 848516 (0.00088) [2022-07-10 18:53:44,567][25689] Fps is (10 sec: 5572.8, 60 sec: 5542.5, 300 sec: 5531.8). Total num frames: 868884480. Throughput: 0: 5789.3. Samples: 868892420. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:53:44,569][25689] Avg episode reward: [(0, '0.274')] [2022-07-10 18:53:45,672][26022] Updated weights on worker 0-0, policy_version 848526 (0.00089) [2022-07-10 18:53:47,368][26022] Updated weights on worker 0-0, policy_version 848536 (0.00092) [2022-07-10 18:53:49,442][26022] Updated weights on worker 0-0, policy_version 848546 (0.00080) [2022-07-10 18:53:49,575][25689] Fps is (10 sec: 5618.3, 60 sec: 5513.9, 300 sec: 5529.1). Total num frames: 868912128. Throughput: 0: 4968.0. Samples: 868909290. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:53:49,577][25689] Avg episode reward: [(0, '0.200')] [2022-07-10 18:53:51,032][26022] Updated weights on worker 0-0, policy_version 848556 (0.00100) [2022-07-10 18:53:53,277][26022] Updated weights on worker 0-0, policy_version 848566 (0.00091) [2022-07-10 18:53:54,633][25689] Fps is (10 sec: 5493.0, 60 sec: 5536.0, 300 sec: 5528.2). Total num frames: 868939776. Throughput: 0: 5784.8. Samples: 868942256. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:53:54,634][25689] Avg episode reward: [(0, '0.221')] [2022-07-10 18:53:54,975][26022] Updated weights on worker 0-0, policy_version 848576 (0.00090) [2022-07-10 18:53:56,790][26022] Updated weights on worker 0-0, policy_version 848586 (0.00098) [2022-07-10 18:53:58,643][26022] Updated weights on worker 0-0, policy_version 848596 (0.00083) [2022-07-10 18:53:59,653][25689] Fps is (10 sec: 5588.6, 60 sec: 5538.9, 300 sec: 5538.3). Total num frames: 868968448. Throughput: 0: 5790.3. Samples: 868975592. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:53:59,653][25689] Avg episode reward: [(0, '0.458')] [2022-07-10 18:54:00,299][26022] Updated weights on worker 0-0, policy_version 848606 (0.00087) [2022-07-10 18:54:02,714][26022] Updated weights on worker 0-0, policy_version 848616 (0.00085) [2022-07-10 18:54:04,600][26022] Updated weights on worker 0-0, policy_version 848626 (0.00098) [2022-07-10 18:54:04,686][25689] Fps is (10 sec: 5296.8, 60 sec: 5522.3, 300 sec: 5527.8). Total num frames: 868993024. Throughput: 0: 4867.5. Samples: 868990352. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:54:04,687][25689] Avg episode reward: [(0, '0.648')] [2022-07-10 18:54:06,237][26022] Updated weights on worker 0-0, policy_version 848636 (0.00084) [2022-07-10 18:54:08,144][26022] Updated weights on worker 0-0, policy_version 848646 (0.00088) [2022-07-10 18:54:09,724][25689] Fps is (10 sec: 5388.8, 60 sec: 5537.0, 300 sec: 5533.1). Total num frames: 869022720. Throughput: 0: 5675.5. Samples: 869023644. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:54:09,724][25689] Avg episode reward: [(0, '0.482')] [2022-07-10 18:54:09,919][26022] Updated weights on worker 0-0, policy_version 848656 (0.00086) [2022-07-10 18:54:11,795][26022] Updated weights on worker 0-0, policy_version 848666 (0.00089) [2022-07-10 18:54:13,787][26022] Updated weights on worker 0-0, policy_version 848676 (0.00080) [2022-07-10 18:54:14,826][25689] Fps is (10 sec: 5756.6, 60 sec: 5551.3, 300 sec: 5535.3). Total num frames: 869051392. Throughput: 0: 5704.5. Samples: 869057442. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:54:14,826][25689] Avg episode reward: [(0, '0.665')] [2022-07-10 18:54:15,130][26022] Updated weights on worker 0-0, policy_version 848686 (0.00089) [2022-07-10 18:54:17,232][26022] Updated weights on worker 0-0, policy_version 848696 (0.00097) [2022-07-10 18:54:18,960][26022] Updated weights on worker 0-0, policy_version 848706 (0.00092) [2022-07-10 18:54:19,886][25689] Fps is (10 sec: 5441.3, 60 sec: 5514.8, 300 sec: 5527.9). Total num frames: 869078016. Throughput: 0: 4884.7. Samples: 869074426. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:54:19,887][25689] Avg episode reward: [(0, '0.689')] [2022-07-10 18:54:20,824][26022] Updated weights on worker 0-0, policy_version 848716 (0.00088) [2022-07-10 18:54:22,976][26022] Updated weights on worker 0-0, policy_version 848726 (0.00087) [2022-07-10 18:54:24,550][26022] Updated weights on worker 0-0, policy_version 848736 (0.00094) [2022-07-10 18:54:24,907][25689] Fps is (10 sec: 5485.1, 60 sec: 5515.7, 300 sec: 5531.6). Total num frames: 869106688. Throughput: 0: 5825.2. Samples: 869108140. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:54:24,907][25689] Avg episode reward: [(0, '0.465')] [2022-07-10 18:54:26,222][26022] Updated weights on worker 0-0, policy_version 848746 (0.00084) [2022-07-10 18:54:28,315][26022] Updated weights on worker 0-0, policy_version 848756 (0.00091) [2022-07-10 18:54:29,921][25689] Fps is (10 sec: 5816.6, 60 sec: 5566.3, 300 sec: 5536.6). Total num frames: 869136384. Throughput: 0: 5839.2. Samples: 869141578. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:54:29,922][25689] Avg episode reward: [(0, '0.378')] [2022-07-10 18:54:29,987][26022] Updated weights on worker 0-0, policy_version 848767 (0.00081) [2022-07-10 18:54:32,207][26022] Updated weights on worker 0-0, policy_version 848777 (0.00091) [2022-07-10 18:54:34,136][26022] Updated weights on worker 0-0, policy_version 848787 (0.00087) [2022-07-10 18:54:35,005][25689] Fps is (10 sec: 5476.1, 60 sec: 5513.8, 300 sec: 5528.2). Total num frames: 869161984. Throughput: 0: 4988.1. Samples: 869158096. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:54:35,005][25689] Avg episode reward: [(0, '0.392')] [2022-07-10 18:54:35,652][26022] Updated weights on worker 0-0, policy_version 848797 (0.00088) [2022-07-10 18:54:37,676][26022] Updated weights on worker 0-0, policy_version 848807 (0.00093) [2022-07-10 18:54:39,323][26022] Updated weights on worker 0-0, policy_version 848817 (0.00100) [2022-07-10 18:54:40,009][25689] Fps is (10 sec: 5379.7, 60 sec: 5538.5, 300 sec: 5526.0). Total num frames: 869190656. Throughput: 0: 5832.6. Samples: 869191796. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:54:40,011][25689] Avg episode reward: [(0, '0.953')] [2022-07-10 18:54:41,332][26022] Updated weights on worker 0-0, policy_version 848827 (0.00093) [2022-07-10 18:54:43,087][26022] Updated weights on worker 0-0, policy_version 848837 (0.00084) [2022-07-10 18:54:44,927][26022] Updated weights on worker 0-0, policy_version 848847 (0.00643) [2022-07-10 18:54:45,062][25689] Fps is (10 sec: 5701.4, 60 sec: 5535.1, 300 sec: 5536.2). Total num frames: 869219328. Throughput: 0: 5840.2. Samples: 869225852. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:54:45,063][25689] Avg episode reward: [(0, '-0.098')] [2022-07-10 18:54:46,566][26022] Updated weights on worker 0-0, policy_version 848857 (0.00091) [2022-07-10 18:54:48,805][26022] Updated weights on worker 0-0, policy_version 848867 (0.00086) [2022-07-10 18:54:50,077][25689] Fps is (10 sec: 5695.8, 60 sec: 5551.4, 300 sec: 5533.8). Total num frames: 869248000. Throughput: 0: 5015.5. Samples: 869242670. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:54:50,077][25689] Avg episode reward: [(0, '-0.314')] [2022-07-10 18:54:50,265][26022] Updated weights on worker 0-0, policy_version 848877 (0.00087) [2022-07-10 18:54:52,371][26022] Updated weights on worker 0-0, policy_version 848887 (0.00087) [2022-07-10 18:54:53,875][26022] Updated weights on worker 0-0, policy_version 848897 (0.00083) [2022-07-10 18:54:55,207][25689] Fps is (10 sec: 5450.5, 60 sec: 5527.9, 300 sec: 5528.0). Total num frames: 869274624. Throughput: 0: 5853.3. Samples: 869276350. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:54:55,209][25689] Avg episode reward: [(0, '-0.865')] [2022-07-10 18:54:55,893][26022] Updated weights on worker 0-0, policy_version 848907 (0.00084) [2022-07-10 18:54:56,191][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:54:56,200][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000848908_869281792.pth [2022-07-10 18:54:56,200][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000846960_867287040.pth [2022-07-10 18:54:57,839][26022] Updated weights on worker 0-0, policy_version 848917 (0.00085) [2022-07-10 18:54:59,494][26022] Updated weights on worker 0-0, policy_version 848927 (0.00089) [2022-07-10 18:55:00,231][25689] Fps is (10 sec: 5546.6, 60 sec: 5544.4, 300 sec: 5542.2). Total num frames: 869304320. Throughput: 0: 5829.0. Samples: 869309668. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:55:00,233][25689] Avg episode reward: [(0, '-0.868')] [2022-07-10 18:55:01,583][26022] Updated weights on worker 0-0, policy_version 848937 (0.00083) [2022-07-10 18:55:03,701][26022] Updated weights on worker 0-0, policy_version 848947 (0.00092) [2022-07-10 18:55:05,292][25689] Fps is (10 sec: 5483.2, 60 sec: 5558.8, 300 sec: 5530.9). Total num frames: 869329920. Throughput: 0: 4899.7. Samples: 869324972. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:55:05,294][25689] Avg episode reward: [(0, '-0.429')] [2022-07-10 18:55:05,472][26022] Updated weights on worker 0-0, policy_version 848957 (0.00079) [2022-07-10 18:55:07,708][26022] Updated weights on worker 0-0, policy_version 848967 (0.00079) [2022-07-10 18:55:08,954][26022] Updated weights on worker 0-0, policy_version 848977 (0.00080) [2022-07-10 18:55:10,306][25689] Fps is (10 sec: 5183.6, 60 sec: 5510.3, 300 sec: 5525.9). Total num frames: 869356544. Throughput: 0: 5684.7. Samples: 869357664. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:55:10,307][25689] Avg episode reward: [(0, '-1.185')] [2022-07-10 18:55:11,413][26022] Updated weights on worker 0-0, policy_version 848987 (0.00050) [2022-07-10 18:55:12,788][26022] Updated weights on worker 0-0, policy_version 848997 (0.00091) [2022-07-10 18:55:14,842][26022] Updated weights on worker 0-0, policy_version 849007 (0.00093) [2022-07-10 18:55:15,423][25689] Fps is (10 sec: 5559.4, 60 sec: 5525.8, 300 sec: 5527.6). Total num frames: 869386240. Throughput: 0: 5672.2. Samples: 869391014. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:55:15,423][25689] Avg episode reward: [(0, '-1.884')] [2022-07-10 18:55:16,738][26022] Updated weights on worker 0-0, policy_version 849017 (0.00085) [2022-07-10 18:55:18,375][26022] Updated weights on worker 0-0, policy_version 849027 (0.00086) [2022-07-10 18:55:20,299][26022] Updated weights on worker 0-0, policy_version 849037 (0.00086) [2022-07-10 18:55:20,451][25689] Fps is (10 sec: 5753.2, 60 sec: 5562.6, 300 sec: 5530.7). Total num frames: 869414912. Throughput: 0: 4856.0. Samples: 869407858. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:55:20,452][25689] Avg episode reward: [(0, '-1.755')] [2022-07-10 18:55:22,433][26022] Updated weights on worker 0-0, policy_version 849047 (0.00086) [2022-07-10 18:55:23,820][26022] Updated weights on worker 0-0, policy_version 849057 (0.00087) [2022-07-10 18:55:25,475][25689] Fps is (10 sec: 5500.9, 60 sec: 5528.5, 300 sec: 5527.4). Total num frames: 869441536. Throughput: 0: 5761.7. Samples: 869441258. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:55:25,476][25689] Avg episode reward: [(0, '-1.630')] [2022-07-10 18:55:26,195][26022] Updated weights on worker 0-0, policy_version 849067 (0.00094) [2022-07-10 18:55:27,608][26022] Updated weights on worker 0-0, policy_version 849077 (0.00089) [2022-07-10 18:55:29,615][26022] Updated weights on worker 0-0, policy_version 849087 (0.00095) [2022-07-10 18:55:30,491][25689] Fps is (10 sec: 5405.7, 60 sec: 5494.5, 300 sec: 5525.7). Total num frames: 869469184. Throughput: 0: 5793.5. Samples: 869474606. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:55:30,491][25689] Avg episode reward: [(0, '-1.838')] [2022-07-10 18:55:31,492][26022] Updated weights on worker 0-0, policy_version 849097 (0.00087) [2022-07-10 18:55:33,192][26022] Updated weights on worker 0-0, policy_version 849107 (0.00092) [2022-07-10 18:55:35,142][26022] Updated weights on worker 0-0, policy_version 849117 (0.00088) [2022-07-10 18:55:35,535][25689] Fps is (10 sec: 5598.3, 60 sec: 5548.8, 300 sec: 5528.5). Total num frames: 869497856. Throughput: 0: 4979.6. Samples: 869491166. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:55:35,536][25689] Avg episode reward: [(0, '-3.085')] [2022-07-10 18:55:36,904][26022] Updated weights on worker 0-0, policy_version 849127 (0.00195) [2022-07-10 18:55:38,737][26022] Updated weights on worker 0-0, policy_version 849137 (0.00083) [2022-07-10 18:55:40,561][25689] Fps is (10 sec: 5592.5, 60 sec: 5529.9, 300 sec: 5525.0). Total num frames: 869525504. Throughput: 0: 5805.9. Samples: 869524616. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:55:40,562][25689] Avg episode reward: [(0, '-2.843')] [2022-07-10 18:55:40,699][26022] Updated weights on worker 0-0, policy_version 849147 (0.00091) [2022-07-10 18:55:42,528][26022] Updated weights on worker 0-0, policy_version 849157 (0.00088) [2022-07-10 18:55:44,174][26022] Updated weights on worker 0-0, policy_version 849167 (0.00085) [2022-07-10 18:55:45,567][25689] Fps is (10 sec: 5511.9, 60 sec: 5517.3, 300 sec: 5528.7). Total num frames: 869553152. Throughput: 0: 5811.4. Samples: 869558022. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:55:45,568][25689] Avg episode reward: [(0, '-1.589')] [2022-07-10 18:55:46,289][26022] Updated weights on worker 0-0, policy_version 849177 (0.00091) [2022-07-10 18:55:47,864][26022] Updated weights on worker 0-0, policy_version 849187 (0.00088) [2022-07-10 18:55:49,842][26022] Updated weights on worker 0-0, policy_version 849197 (0.00093) [2022-07-10 18:55:50,593][25689] Fps is (10 sec: 5511.9, 60 sec: 5499.3, 300 sec: 5522.8). Total num frames: 869580800. Throughput: 0: 4980.5. Samples: 869574728. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:55:50,594][25689] Avg episode reward: [(0, '-1.523')] [2022-07-10 18:55:51,826][26022] Updated weights on worker 0-0, policy_version 849207 (0.00049) [2022-07-10 18:55:53,529][26022] Updated weights on worker 0-0, policy_version 849217 (0.00090) [2022-07-10 18:55:55,559][26022] Updated weights on worker 0-0, policy_version 849227 (0.00099) [2022-07-10 18:55:55,706][25689] Fps is (10 sec: 5554.8, 60 sec: 5534.8, 300 sec: 5528.5). Total num frames: 869609472. Throughput: 0: 5791.1. Samples: 869607978. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:55:55,707][25689] Avg episode reward: [(0, '-1.934')] [2022-07-10 18:55:57,144][26022] Updated weights on worker 0-0, policy_version 849237 (0.00092) [2022-07-10 18:55:59,112][26022] Updated weights on worker 0-0, policy_version 849247 (0.00092) [2022-07-10 18:56:00,756][25689] Fps is (10 sec: 5642.7, 60 sec: 5515.5, 300 sec: 5534.7). Total num frames: 869638144. Throughput: 0: 5755.4. Samples: 869640844. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:56:00,757][25689] Avg episode reward: [(0, '-1.582')] [2022-07-10 18:56:01,188][26022] Updated weights on worker 0-0, policy_version 849257 (0.00088) [2022-07-10 18:56:03,226][26022] Updated weights on worker 0-0, policy_version 849267 (0.00086) [2022-07-10 18:56:05,184][26022] Updated weights on worker 0-0, policy_version 849277 (0.00085) [2022-07-10 18:56:05,778][25689] Fps is (10 sec: 5286.7, 60 sec: 5502.1, 300 sec: 5524.0). Total num frames: 869662720. Throughput: 0: 5657.5. Samples: 869672366. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:56:05,779][25689] Avg episode reward: [(0, '0.364')] [2022-07-10 18:56:06,956][26022] Updated weights on worker 0-0, policy_version 849287 (0.00090) [2022-07-10 18:56:08,612][26022] Updated weights on worker 0-0, policy_version 849297 (0.00102) [2022-07-10 18:56:10,704][26022] Updated weights on worker 0-0, policy_version 849307 (0.00090) [2022-07-10 18:56:10,789][25689] Fps is (10 sec: 5205.4, 60 sec: 5519.3, 300 sec: 5521.6). Total num frames: 869690368. Throughput: 0: 5656.6. Samples: 869688964. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:56:10,789][25689] Avg episode reward: [(0, '0.560')] [2022-07-10 18:56:12,205][26022] Updated weights on worker 0-0, policy_version 849317 (0.00096) [2022-07-10 18:56:14,381][26022] Updated weights on worker 0-0, policy_version 849327 (0.00094) [2022-07-10 18:56:15,844][25689] Fps is (10 sec: 5697.1, 60 sec: 5525.0, 300 sec: 5527.5). Total num frames: 869720064. Throughput: 0: 5679.0. Samples: 869722340. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:56:15,845][25689] Avg episode reward: [(0, '0.754')] [2022-07-10 18:56:16,001][26022] Updated weights on worker 0-0, policy_version 849337 (0.00093) [2022-07-10 18:56:17,968][26022] Updated weights on worker 0-0, policy_version 849347 (0.00087) [2022-07-10 18:56:19,979][26022] Updated weights on worker 0-0, policy_version 849357 (0.00089) [2022-07-10 18:56:20,873][25689] Fps is (10 sec: 5584.6, 60 sec: 5490.9, 300 sec: 5523.6). Total num frames: 869746688. Throughput: 0: 5706.9. Samples: 869755654. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:56:20,874][25689] Avg episode reward: [(0, '0.610')] [2022-07-10 18:56:21,499][26022] Updated weights on worker 0-0, policy_version 849367 (0.00082) [2022-07-10 18:56:23,591][26022] Updated weights on worker 0-0, policy_version 849377 (0.00097) [2022-07-10 18:56:25,274][26022] Updated weights on worker 0-0, policy_version 849387 (0.00088) [2022-07-10 18:56:25,883][25689] Fps is (10 sec: 5304.1, 60 sec: 5492.3, 300 sec: 5524.2). Total num frames: 869773312. Throughput: 0: 4977.2. Samples: 869772430. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:56:25,884][25689] Avg episode reward: [(0, '0.870')] [2022-07-10 18:56:27,048][26022] Updated weights on worker 0-0, policy_version 849397 (0.00083) [2022-07-10 18:56:29,322][26022] Updated weights on worker 0-0, policy_version 849407 (0.00093) [2022-07-10 18:56:30,724][26022] Updated weights on worker 0-0, policy_version 849417 (0.00093) [2022-07-10 18:56:30,885][25689] Fps is (10 sec: 5625.3, 60 sec: 5527.4, 300 sec: 5529.1). Total num frames: 869803008. Throughput: 0: 5817.2. Samples: 869805872. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 18:56:30,887][25689] Avg episode reward: [(0, '0.955')] [2022-07-10 18:56:32,898][26022] Updated weights on worker 0-0, policy_version 849427 (0.00084) [2022-07-10 18:56:34,494][26022] Updated weights on worker 0-0, policy_version 849437 (0.00082) [2022-07-10 18:56:36,001][25689] Fps is (10 sec: 5667.4, 60 sec: 5503.9, 300 sec: 5520.7). Total num frames: 869830656. Throughput: 0: 5799.3. Samples: 869839238. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:56:36,001][25689] Avg episode reward: [(0, '0.524')] [2022-07-10 18:56:36,447][26022] Updated weights on worker 0-0, policy_version 849447 (0.00089) [2022-07-10 18:56:38,295][26022] Updated weights on worker 0-0, policy_version 849457 (0.00095) [2022-07-10 18:56:40,236][26022] Updated weights on worker 0-0, policy_version 849467 (0.00171) [2022-07-10 18:56:41,085][25689] Fps is (10 sec: 5521.8, 60 sec: 5515.6, 300 sec: 5529.8). Total num frames: 869859328. Throughput: 0: 4963.8. Samples: 869855980. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:56:41,086][25689] Avg episode reward: [(0, '0.110')] [2022-07-10 18:56:41,865][26022] Updated weights on worker 0-0, policy_version 849477 (0.00086) [2022-07-10 18:56:43,997][26022] Updated weights on worker 0-0, policy_version 849487 (0.01118) [2022-07-10 18:56:45,470][26022] Updated weights on worker 0-0, policy_version 849497 (0.00086) [2022-07-10 18:56:46,113][25689] Fps is (10 sec: 5569.5, 60 sec: 5513.6, 300 sec: 5519.5). Total num frames: 869886976. Throughput: 0: 5791.1. Samples: 869889586. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:56:46,114][25689] Avg episode reward: [(0, '0.280')] [2022-07-10 18:56:47,682][26022] Updated weights on worker 0-0, policy_version 849507 (0.00092) [2022-07-10 18:56:49,140][26022] Updated weights on worker 0-0, policy_version 849517 (0.00085) [2022-07-10 18:56:51,027][26022] Updated weights on worker 0-0, policy_version 849527 (0.00087) [2022-07-10 18:56:51,116][25689] Fps is (10 sec: 5614.9, 60 sec: 5532.7, 300 sec: 5527.5). Total num frames: 869915648. Throughput: 0: 5793.5. Samples: 869923076. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:56:51,116][25689] Avg episode reward: [(0, '0.186')] [2022-07-10 18:56:52,856][26022] Updated weights on worker 0-0, policy_version 849537 (0.00092) [2022-07-10 18:56:54,859][26022] Updated weights on worker 0-0, policy_version 849547 (0.00087) [2022-07-10 18:56:56,184][25689] Fps is (10 sec: 5694.3, 60 sec: 5536.8, 300 sec: 5530.0). Total num frames: 869944320. Throughput: 0: 4979.3. Samples: 869939734. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:56:56,184][25689] Avg episode reward: [(0, '-0.454')] [2022-07-10 18:56:56,281][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:56:56,298][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000849556_869945344.pth [2022-07-10 18:56:56,299][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000847610_867952640.pth [2022-07-10 18:56:56,555][26022] Updated weights on worker 0-0, policy_version 849557 (0.00090) [2022-07-10 18:56:58,539][26022] Updated weights on worker 0-0, policy_version 849567 (0.00094) [2022-07-10 18:57:00,174][26022] Updated weights on worker 0-0, policy_version 849577 (0.00087) [2022-07-10 18:57:01,252][25689] Fps is (10 sec: 5556.2, 60 sec: 5518.1, 300 sec: 5525.4). Total num frames: 869971968. Throughput: 0: 5817.1. Samples: 869973292. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:57:01,252][25689] Avg episode reward: [(0, '-0.116')] [2022-07-10 18:57:02,594][26022] Updated weights on worker 0-0, policy_version 849587 (0.00091) [2022-07-10 18:57:04,286][26022] Updated weights on worker 0-0, policy_version 849597 (0.00093) [2022-07-10 18:57:06,282][25689] Fps is (10 sec: 5171.3, 60 sec: 5517.4, 300 sec: 5521.5). Total num frames: 869996544. Throughput: 0: 5699.7. Samples: 870004544. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:57:06,283][25689] Avg episode reward: [(0, '0.314')] [2022-07-10 18:57:06,292][26022] Updated weights on worker 0-0, policy_version 849607 (0.00107) [2022-07-10 18:57:08,061][26022] Updated weights on worker 0-0, policy_version 849617 (0.00090) [2022-07-10 18:57:09,980][26022] Updated weights on worker 0-0, policy_version 849627 (0.00088) [2022-07-10 18:57:11,292][25689] Fps is (10 sec: 5303.8, 60 sec: 5534.4, 300 sec: 5524.0). Total num frames: 870025216. Throughput: 0: 4859.0. Samples: 870021112. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:57:11,292][25689] Avg episode reward: [(0, '-0.076')] [2022-07-10 18:57:11,630][26022] Updated weights on worker 0-0, policy_version 849637 (0.00105) [2022-07-10 18:57:13,628][26022] Updated weights on worker 0-0, policy_version 849647 (0.00088) [2022-07-10 18:57:15,236][26022] Updated weights on worker 0-0, policy_version 849657 (0.00088) [2022-07-10 18:57:16,334][25689] Fps is (10 sec: 5602.9, 60 sec: 5501.7, 300 sec: 5523.3). Total num frames: 870052864. Throughput: 0: 5694.5. Samples: 870054482. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:57:16,336][25689] Avg episode reward: [(0, '0.100')] [2022-07-10 18:57:17,382][26022] Updated weights on worker 0-0, policy_version 849667 (0.00085) [2022-07-10 18:57:19,047][26022] Updated weights on worker 0-0, policy_version 849677 (0.00086) [2022-07-10 18:57:21,139][26022] Updated weights on worker 0-0, policy_version 849687 (0.00086) [2022-07-10 18:57:21,339][25689] Fps is (10 sec: 5605.2, 60 sec: 5537.8, 300 sec: 5527.1). Total num frames: 870081536. Throughput: 0: 5703.7. Samples: 870087864. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:57:21,341][25689] Avg episode reward: [(0, '0.530')] [2022-07-10 18:57:22,669][26022] Updated weights on worker 0-0, policy_version 849697 (0.00086) [2022-07-10 18:57:24,848][26022] Updated weights on worker 0-0, policy_version 849707 (0.00095) [2022-07-10 18:57:26,343][26022] Updated weights on worker 0-0, policy_version 849717 (0.00088) [2022-07-10 18:57:26,344][25689] Fps is (10 sec: 5729.0, 60 sec: 5572.1, 300 sec: 5530.5). Total num frames: 870110208. Throughput: 0: 4993.8. Samples: 870104726. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:57:26,345][25689] Avg episode reward: [(0, '1.137')] [2022-07-10 18:57:28,472][26022] Updated weights on worker 0-0, policy_version 849727 (0.00082) [2022-07-10 18:57:30,214][26022] Updated weights on worker 0-0, policy_version 849737 (0.00084) [2022-07-10 18:57:31,367][25689] Fps is (10 sec: 5412.4, 60 sec: 5502.5, 300 sec: 5523.9). Total num frames: 870135808. Throughput: 0: 5812.3. Samples: 870137794. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:57:31,367][25689] Avg episode reward: [(0, '0.615')] [2022-07-10 18:57:32,143][26022] Updated weights on worker 0-0, policy_version 849747 (0.00097) [2022-07-10 18:57:33,783][26022] Updated weights on worker 0-0, policy_version 849757 (0.00092) [2022-07-10 18:57:35,632][26022] Updated weights on worker 0-0, policy_version 849767 (0.00094) [2022-07-10 18:57:36,423][25689] Fps is (10 sec: 5384.6, 60 sec: 5524.9, 300 sec: 5524.1). Total num frames: 870164480. Throughput: 0: 5800.4. Samples: 870171004. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:57:36,423][25689] Avg episode reward: [(0, '0.507')] [2022-07-10 18:57:37,664][26022] Updated weights on worker 0-0, policy_version 849777 (0.00080) [2022-07-10 18:57:39,367][26022] Updated weights on worker 0-0, policy_version 849787 (0.00088) [2022-07-10 18:57:41,289][26022] Updated weights on worker 0-0, policy_version 849797 (0.00089) [2022-07-10 18:57:41,435][25689] Fps is (10 sec: 5695.3, 60 sec: 5531.4, 300 sec: 5528.5). Total num frames: 870193152. Throughput: 0: 4975.4. Samples: 870187850. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:57:41,436][25689] Avg episode reward: [(0, '0.210')] [2022-07-10 18:57:42,983][26022] Updated weights on worker 0-0, policy_version 849807 (0.00617) [2022-07-10 18:57:44,938][26022] Updated weights on worker 0-0, policy_version 849817 (0.00092) [2022-07-10 18:57:46,464][25689] Fps is (10 sec: 5506.8, 60 sec: 5514.4, 300 sec: 5518.8). Total num frames: 870219776. Throughput: 0: 5797.6. Samples: 870221376. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:57:46,465][25689] Avg episode reward: [(0, '0.088')] [2022-07-10 18:57:46,820][26022] Updated weights on worker 0-0, policy_version 849827 (0.00087) [2022-07-10 18:57:48,616][26022] Updated weights on worker 0-0, policy_version 849837 (0.00089) [2022-07-10 18:57:50,508][26022] Updated weights on worker 0-0, policy_version 849847 (0.00088) [2022-07-10 18:57:51,479][25689] Fps is (10 sec: 5505.7, 60 sec: 5513.3, 300 sec: 5527.6). Total num frames: 870248448. Throughput: 0: 5806.5. Samples: 870254574. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:57:51,479][25689] Avg episode reward: [(0, '-1.961')] [2022-07-10 18:57:52,286][26022] Updated weights on worker 0-0, policy_version 849857 (0.00082) [2022-07-10 18:57:54,072][26022] Updated weights on worker 0-0, policy_version 849867 (0.00082) [2022-07-10 18:57:56,014][26022] Updated weights on worker 0-0, policy_version 849877 (0.00089) [2022-07-10 18:57:56,614][25689] Fps is (10 sec: 5649.6, 60 sec: 5507.1, 300 sec: 5526.0). Total num frames: 870277120. Throughput: 0: 4960.2. Samples: 870271158. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:57:56,615][25689] Avg episode reward: [(0, '-1.745')] [2022-07-10 18:57:57,863][26022] Updated weights on worker 0-0, policy_version 849887 (0.00088) [2022-07-10 18:57:59,825][26022] Updated weights on worker 0-0, policy_version 849897 (0.00085) [2022-07-10 18:58:01,638][25689] Fps is (10 sec: 5442.8, 60 sec: 5494.2, 300 sec: 5529.7). Total num frames: 870303744. Throughput: 0: 5767.3. Samples: 870304366. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:58:01,643][25689] Avg episode reward: [(0, '-1.814')] [2022-07-10 18:58:02,109][26022] Updated weights on worker 0-0, policy_version 849907 (0.00088) [2022-07-10 18:58:03,665][26022] Updated weights on worker 0-0, policy_version 849917 (0.00088) [2022-07-10 18:58:05,713][26022] Updated weights on worker 0-0, policy_version 849927 (0.00083) [2022-07-10 18:58:06,647][25689] Fps is (10 sec: 5511.8, 60 sec: 5564.1, 300 sec: 5529.8). Total num frames: 870332416. Throughput: 0: 5673.8. Samples: 870335886. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:58:06,647][25689] Avg episode reward: [(0, '-1.951')] [2022-07-10 18:58:07,652][26022] Updated weights on worker 0-0, policy_version 849937 (0.00097) [2022-07-10 18:58:09,351][26022] Updated weights on worker 0-0, policy_version 849947 (0.00095) [2022-07-10 18:58:11,389][26022] Updated weights on worker 0-0, policy_version 849957 (0.00096) [2022-07-10 18:58:11,675][25689] Fps is (10 sec: 5407.0, 60 sec: 5511.4, 300 sec: 5523.7). Total num frames: 870358016. Throughput: 0: 4854.9. Samples: 870352628. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:58:11,676][25689] Avg episode reward: [(0, '-2.243')] [2022-07-10 18:58:12,714][26022] Updated weights on worker 0-0, policy_version 849967 (0.00084) [2022-07-10 18:58:15,007][26022] Updated weights on worker 0-0, policy_version 849977 (0.00096) [2022-07-10 18:58:16,434][26022] Updated weights on worker 0-0, policy_version 849987 (0.00098) [2022-07-10 18:58:16,732][25689] Fps is (10 sec: 5482.5, 60 sec: 5544.0, 300 sec: 5526.7). Total num frames: 870387712. Throughput: 0: 5719.1. Samples: 870386216. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:58:16,733][25689] Avg episode reward: [(0, '-3.577')] [2022-07-10 18:58:18,580][26022] Updated weights on worker 0-0, policy_version 849997 (0.00086) [2022-07-10 18:58:20,232][26022] Updated weights on worker 0-0, policy_version 850007 (0.00093) [2022-07-10 18:58:21,739][25689] Fps is (10 sec: 5698.1, 60 sec: 5526.9, 300 sec: 5523.7). Total num frames: 870415360. Throughput: 0: 5749.1. Samples: 870419928. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:58:21,740][25689] Avg episode reward: [(0, '-1.722')] [2022-07-10 18:58:22,302][26022] Updated weights on worker 0-0, policy_version 850017 (0.00094) [2022-07-10 18:58:23,880][26022] Updated weights on worker 0-0, policy_version 850027 (0.00080) [2022-07-10 18:58:26,071][26022] Updated weights on worker 0-0, policy_version 850037 (0.00623) [2022-07-10 18:58:26,771][25689] Fps is (10 sec: 5406.4, 60 sec: 5490.5, 300 sec: 5523.3). Total num frames: 870441984. Throughput: 0: 4998.1. Samples: 870436470. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:58:26,771][25689] Avg episode reward: [(0, '-1.740')] [2022-07-10 18:58:27,691][26022] Updated weights on worker 0-0, policy_version 850047 (0.00089) [2022-07-10 18:58:29,673][26022] Updated weights on worker 0-0, policy_version 850057 (0.00089) [2022-07-10 18:58:31,302][26022] Updated weights on worker 0-0, policy_version 850067 (0.00080) [2022-07-10 18:58:31,783][25689] Fps is (10 sec: 5505.8, 60 sec: 5542.4, 300 sec: 5524.3). Total num frames: 870470656. Throughput: 0: 5828.8. Samples: 870469828. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:58:31,783][25689] Avg episode reward: [(0, '-1.342')] [2022-07-10 18:58:33,332][26022] Updated weights on worker 0-0, policy_version 850077 (0.00094) [2022-07-10 18:58:35,040][26022] Updated weights on worker 0-0, policy_version 850087 (0.00090) [2022-07-10 18:58:36,900][25689] Fps is (10 sec: 5459.4, 60 sec: 5502.9, 300 sec: 5520.4). Total num frames: 870497280. Throughput: 0: 5795.8. Samples: 870503100. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:58:36,900][25689] Avg episode reward: [(0, '-2.068')] [2022-07-10 18:58:37,047][26022] Updated weights on worker 0-0, policy_version 850097 (0.00093) [2022-07-10 18:58:38,871][26022] Updated weights on worker 0-0, policy_version 850107 (0.00086) [2022-07-10 18:58:40,654][26022] Updated weights on worker 0-0, policy_version 850117 (0.00090) [2022-07-10 18:58:41,957][25689] Fps is (10 sec: 5535.5, 60 sec: 5515.8, 300 sec: 5523.0). Total num frames: 870526976. Throughput: 0: 5759.6. Samples: 870536374. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:58:41,958][25689] Avg episode reward: [(0, '-0.413')] [2022-07-10 18:58:42,462][26022] Updated weights on worker 0-0, policy_version 850127 (0.00094) [2022-07-10 18:58:44,327][26022] Updated weights on worker 0-0, policy_version 850137 (0.00087) [2022-07-10 18:58:46,304][26022] Updated weights on worker 0-0, policy_version 850147 (0.00094) [2022-07-10 18:58:46,965][25689] Fps is (10 sec: 5799.0, 60 sec: 5551.5, 300 sec: 5526.5). Total num frames: 870555648. Throughput: 0: 5782.7. Samples: 870553246. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:58:46,967][25689] Avg episode reward: [(0, '0.374')] [2022-07-10 18:58:47,878][26022] Updated weights on worker 0-0, policy_version 850157 (0.00088) [2022-07-10 18:58:49,994][26022] Updated weights on worker 0-0, policy_version 850167 (0.00093) [2022-07-10 18:58:51,573][26022] Updated weights on worker 0-0, policy_version 850177 (0.00086) [2022-07-10 18:58:51,973][25689] Fps is (10 sec: 5520.6, 60 sec: 5518.2, 300 sec: 5524.0). Total num frames: 870582272. Throughput: 0: 5775.6. Samples: 870586442. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:58:51,975][25689] Avg episode reward: [(0, '0.106')] [2022-07-10 18:58:53,607][26022] Updated weights on worker 0-0, policy_version 850187 (0.00083) [2022-07-10 18:58:55,382][26022] Updated weights on worker 0-0, policy_version 850197 (0.00081) [2022-07-10 18:58:56,470][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 18:58:56,482][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000850202_870606848.pth [2022-07-10 18:58:56,483][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000848259_868617216.pth [2022-07-10 18:58:57,040][25689] Fps is (10 sec: 5488.6, 60 sec: 5524.6, 300 sec: 5523.1). Total num frames: 870610944. Throughput: 0: 5791.8. Samples: 870619746. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:58:57,040][25689] Avg episode reward: [(0, '0.182')] [2022-07-10 18:58:57,182][26022] Updated weights on worker 0-0, policy_version 850207 (0.00089) [2022-07-10 18:58:59,324][26022] Updated weights on worker 0-0, policy_version 850217 (0.00084) [2022-07-10 18:59:01,033][26022] Updated weights on worker 0-0, policy_version 850227 (0.00086) [2022-07-10 18:59:02,057][25689] Fps is (10 sec: 5585.2, 60 sec: 5542.1, 300 sec: 5533.7). Total num frames: 870638592. Throughput: 0: 4965.5. Samples: 870636182. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:59:02,058][25689] Avg episode reward: [(0, '-1.005')] [2022-07-10 18:59:03,194][26022] Updated weights on worker 0-0, policy_version 850237 (0.00085) [2022-07-10 18:59:04,995][26022] Updated weights on worker 0-0, policy_version 850247 (0.00094) [2022-07-10 18:59:06,866][26022] Updated weights on worker 0-0, policy_version 850257 (0.00091) [2022-07-10 18:59:07,074][25689] Fps is (10 sec: 5306.3, 60 sec: 5490.4, 300 sec: 5520.3). Total num frames: 870664192. Throughput: 0: 5685.8. Samples: 870667586. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:59:07,075][25689] Avg episode reward: [(0, '-0.401')] [2022-07-10 18:59:08,689][26022] Updated weights on worker 0-0, policy_version 850267 (0.00086) [2022-07-10 18:59:10,539][26022] Updated weights on worker 0-0, policy_version 850277 (0.00083) [2022-07-10 18:59:12,097][25689] Fps is (10 sec: 5303.9, 60 sec: 5525.0, 300 sec: 5518.4). Total num frames: 870691840. Throughput: 0: 5695.7. Samples: 870701058. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:59:12,097][25689] Avg episode reward: [(0, '-2.140')] [2022-07-10 18:59:12,224][26022] Updated weights on worker 0-0, policy_version 850287 (0.00086) [2022-07-10 18:59:14,254][26022] Updated weights on worker 0-0, policy_version 850297 (0.00083) [2022-07-10 18:59:16,004][26022] Updated weights on worker 0-0, policy_version 850307 (0.00089) [2022-07-10 18:59:17,218][25689] Fps is (10 sec: 5552.4, 60 sec: 5502.2, 300 sec: 5524.1). Total num frames: 870720512. Throughput: 0: 4860.8. Samples: 870717828. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:59:17,218][25689] Avg episode reward: [(0, '-2.542')] [2022-07-10 18:59:18,058][26022] Updated weights on worker 0-0, policy_version 850317 (0.00089) [2022-07-10 18:59:19,835][26022] Updated weights on worker 0-0, policy_version 850327 (0.00085) [2022-07-10 18:59:21,779][26022] Updated weights on worker 0-0, policy_version 850337 (0.00084) [2022-07-10 18:59:22,229][25689] Fps is (10 sec: 5558.4, 60 sec: 5501.8, 300 sec: 5520.9). Total num frames: 870748160. Throughput: 0: 5696.3. Samples: 870751086. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:59:22,231][25689] Avg episode reward: [(0, '-3.110')] [2022-07-10 18:59:23,482][26022] Updated weights on worker 0-0, policy_version 850347 (0.00089) [2022-07-10 18:59:25,201][26022] Updated weights on worker 0-0, policy_version 850357 (0.00088) [2022-07-10 18:59:27,190][26022] Updated weights on worker 0-0, policy_version 850367 (0.00085) [2022-07-10 18:59:27,292][25689] Fps is (10 sec: 5488.9, 60 sec: 5515.8, 300 sec: 5513.1). Total num frames: 870775808. Throughput: 0: 5782.4. Samples: 870784492. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:59:27,292][25689] Avg episode reward: [(0, '-2.151')] [2022-07-10 18:59:29,018][26022] Updated weights on worker 0-0, policy_version 850377 (0.00094) [2022-07-10 18:59:30,943][26022] Updated weights on worker 0-0, policy_version 850387 (0.00091) [2022-07-10 18:59:32,294][25689] Fps is (10 sec: 5494.0, 60 sec: 5499.8, 300 sec: 5521.5). Total num frames: 870803456. Throughput: 0: 4956.0. Samples: 870801156. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:59:32,294][25689] Avg episode reward: [(0, '-1.718')] [2022-07-10 18:59:32,591][26022] Updated weights on worker 0-0, policy_version 850397 (0.00088) [2022-07-10 18:59:34,575][26022] Updated weights on worker 0-0, policy_version 850407 (0.00084) [2022-07-10 18:59:36,348][26022] Updated weights on worker 0-0, policy_version 850417 (0.00086) [2022-07-10 18:59:37,383][25689] Fps is (10 sec: 5479.9, 60 sec: 5519.3, 300 sec: 5516.5). Total num frames: 870831104. Throughput: 0: 5777.1. Samples: 870834322. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:59:37,383][25689] Avg episode reward: [(0, '-2.468')] [2022-07-10 18:59:38,237][26022] Updated weights on worker 0-0, policy_version 850427 (0.00086) [2022-07-10 18:59:40,070][26022] Updated weights on worker 0-0, policy_version 850437 (0.00089) [2022-07-10 18:59:41,945][26022] Updated weights on worker 0-0, policy_version 850447 (0.00086) [2022-07-10 18:59:42,430][25689] Fps is (10 sec: 5556.3, 60 sec: 5503.3, 300 sec: 5516.6). Total num frames: 870859776. Throughput: 0: 5774.2. Samples: 870867730. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:59:42,430][25689] Avg episode reward: [(0, '-0.212')] [2022-07-10 18:59:43,737][26022] Updated weights on worker 0-0, policy_version 850457 (0.00085) [2022-07-10 18:59:45,660][26022] Updated weights on worker 0-0, policy_version 850467 (0.00089) [2022-07-10 18:59:47,303][26022] Updated weights on worker 0-0, policy_version 850477 (0.00094) [2022-07-10 18:59:47,479][25689] Fps is (10 sec: 5781.2, 60 sec: 5516.5, 300 sec: 5519.4). Total num frames: 870889472. Throughput: 0: 4947.9. Samples: 870884374. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:59:47,479][25689] Avg episode reward: [(0, '-0.448')] [2022-07-10 18:59:49,384][26022] Updated weights on worker 0-0, policy_version 850487 (0.00084) [2022-07-10 18:59:50,971][26022] Updated weights on worker 0-0, policy_version 850497 (0.00090) [2022-07-10 18:59:52,490][25689] Fps is (10 sec: 5394.9, 60 sec: 5482.4, 300 sec: 5514.7). Total num frames: 870914048. Throughput: 0: 5774.1. Samples: 870917770. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:59:52,490][25689] Avg episode reward: [(0, '0.383')] [2022-07-10 18:59:53,018][26022] Updated weights on worker 0-0, policy_version 850507 (0.00089) [2022-07-10 18:59:54,779][26022] Updated weights on worker 0-0, policy_version 850517 (0.00095) [2022-07-10 18:59:56,615][26022] Updated weights on worker 0-0, policy_version 850527 (0.00094) [2022-07-10 18:59:57,539][25689] Fps is (10 sec: 5394.7, 60 sec: 5500.9, 300 sec: 5514.2). Total num frames: 870943744. Throughput: 0: 5800.0. Samples: 870951230. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 18:59:57,541][25689] Avg episode reward: [(0, '0.286')] [2022-07-10 18:59:58,555][26022] Updated weights on worker 0-0, policy_version 850537 (0.00085) [2022-07-10 19:00:00,417][26022] Updated weights on worker 0-0, policy_version 850547 (0.00083) [2022-07-10 19:00:02,558][25689] Fps is (10 sec: 5492.2, 60 sec: 5466.9, 300 sec: 5515.0). Total num frames: 870969344. Throughput: 0: 4974.2. Samples: 870967850. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 19:00:02,560][25689] Avg episode reward: [(0, '1.195')] [2022-07-10 19:00:02,573][26022] Updated weights on worker 0-0, policy_version 850557 (0.00084) [2022-07-10 19:00:04,518][26022] Updated weights on worker 0-0, policy_version 850567 (0.00092) [2022-07-10 19:00:06,158][26022] Updated weights on worker 0-0, policy_version 850577 (0.00089) [2022-07-10 19:00:07,564][25689] Fps is (10 sec: 5311.7, 60 sec: 5501.8, 300 sec: 5518.6). Total num frames: 870996992. Throughput: 0: 5722.4. Samples: 870999308. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 19:00:07,566][25689] Avg episode reward: [(0, '1.282')] [2022-07-10 19:00:08,168][26022] Updated weights on worker 0-0, policy_version 850587 (0.00088) [2022-07-10 19:00:09,973][26022] Updated weights on worker 0-0, policy_version 850597 (0.00084) [2022-07-10 19:00:11,815][26022] Updated weights on worker 0-0, policy_version 850607 (0.00088) [2022-07-10 19:00:12,591][25689] Fps is (10 sec: 5613.5, 60 sec: 5518.3, 300 sec: 5516.8). Total num frames: 871025664. Throughput: 0: 5738.0. Samples: 871033110. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-10 19:00:12,591][25689] Avg episode reward: [(0, '0.880')] [2022-07-10 19:00:13,543][26022] Updated weights on worker 0-0, policy_version 850617 (0.00081) [2022-07-10 19:00:15,447][26022] Updated weights on worker 0-0, policy_version 850627 (0.00083) [2022-07-10 19:00:17,347][26022] Updated weights on worker 0-0, policy_version 850637 (0.00087) [2022-07-10 19:00:17,708][25689] Fps is (10 sec: 5552.0, 60 sec: 5501.7, 300 sec: 5511.7). Total num frames: 871053312. Throughput: 0: 4897.0. Samples: 871049998. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:00:17,709][25689] Avg episode reward: [(0, '1.173')] [2022-07-10 19:00:18,929][26022] Updated weights on worker 0-0, policy_version 850647 (0.00086) [2022-07-10 19:00:21,121][26022] Updated weights on worker 0-0, policy_version 850657 (0.00084) [2022-07-10 19:00:22,542][26022] Updated weights on worker 0-0, policy_version 850667 (0.00091) [2022-07-10 19:00:22,724][25689] Fps is (10 sec: 5659.1, 60 sec: 5535.1, 300 sec: 5522.2). Total num frames: 871083008. Throughput: 0: 5750.0. Samples: 871083804. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:00:22,724][25689] Avg episode reward: [(0, '-0.023')] [2022-07-10 19:00:24,731][26022] Updated weights on worker 0-0, policy_version 850677 (0.00086) [2022-07-10 19:00:26,401][26022] Updated weights on worker 0-0, policy_version 850687 (0.00092) [2022-07-10 19:00:27,746][25689] Fps is (10 sec: 5610.7, 60 sec: 5522.0, 300 sec: 5518.7). Total num frames: 871109632. Throughput: 0: 5836.8. Samples: 871117106. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:00:27,747][25689] Avg episode reward: [(0, '0.014')] [2022-07-10 19:00:28,362][26022] Updated weights on worker 0-0, policy_version 850697 (0.00092) [2022-07-10 19:00:30,058][26022] Updated weights on worker 0-0, policy_version 850707 (0.00093) [2022-07-10 19:00:32,062][26022] Updated weights on worker 0-0, policy_version 850717 (0.00086) [2022-07-10 19:00:32,756][25689] Fps is (10 sec: 5409.7, 60 sec: 5521.2, 300 sec: 5515.8). Total num frames: 871137280. Throughput: 0: 4990.5. Samples: 871133746. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:00:32,757][25689] Avg episode reward: [(0, '-0.902')] [2022-07-10 19:00:33,842][26022] Updated weights on worker 0-0, policy_version 850727 (0.00089) [2022-07-10 19:00:35,792][26022] Updated weights on worker 0-0, policy_version 850737 (0.00092) [2022-07-10 19:00:37,529][26022] Updated weights on worker 0-0, policy_version 850747 (0.00083) [2022-07-10 19:00:37,834][25689] Fps is (10 sec: 5684.5, 60 sec: 5556.1, 300 sec: 5521.8). Total num frames: 871166976. Throughput: 0: 5817.9. Samples: 871167088. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:00:37,834][25689] Avg episode reward: [(0, '-2.445')] [2022-07-10 19:00:39,335][26022] Updated weights on worker 0-0, policy_version 850757 (0.00054) [2022-07-10 19:00:41,117][26022] Updated weights on worker 0-0, policy_version 850767 (0.00092) [2022-07-10 19:00:42,842][25689] Fps is (10 sec: 5685.4, 60 sec: 5542.7, 300 sec: 5521.7). Total num frames: 871194624. Throughput: 0: 5807.4. Samples: 871200640. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:00:42,844][25689] Avg episode reward: [(0, '-2.500')] [2022-07-10 19:00:42,992][26022] Updated weights on worker 0-0, policy_version 850777 (0.00092) [2022-07-10 19:00:44,910][26022] Updated weights on worker 0-0, policy_version 850787 (0.00097) [2022-07-10 19:00:46,786][26022] Updated weights on worker 0-0, policy_version 850797 (0.00086) [2022-07-10 19:00:47,866][25689] Fps is (10 sec: 5614.1, 60 sec: 5528.1, 300 sec: 5525.2). Total num frames: 871223296. Throughput: 0: 4990.5. Samples: 871217514. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:00:47,867][25689] Avg episode reward: [(0, '-3.267')] [2022-07-10 19:00:48,491][26022] Updated weights on worker 0-0, policy_version 850807 (0.00088) [2022-07-10 19:00:50,578][26022] Updated weights on worker 0-0, policy_version 850817 (0.00086) [2022-07-10 19:00:52,287][26022] Updated weights on worker 0-0, policy_version 850827 (0.00085) [2022-07-10 19:00:52,898][25689] Fps is (10 sec: 5600.7, 60 sec: 5577.0, 300 sec: 5523.3). Total num frames: 871250944. Throughput: 0: 5817.6. Samples: 871250924. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:00:52,899][25689] Avg episode reward: [(0, '-2.544')] [2022-07-10 19:00:54,059][26022] Updated weights on worker 0-0, policy_version 850837 (0.00092) [2022-07-10 19:00:55,940][26022] Updated weights on worker 0-0, policy_version 850847 (0.00091) [2022-07-10 19:00:56,528][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:00:56,550][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000850851_871271424.pth [2022-07-10 19:00:56,550][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000848908_869281792.pth [2022-07-10 19:00:57,787][26022] Updated weights on worker 0-0, policy_version 850857 (0.00086) [2022-07-10 19:00:57,948][25689] Fps is (10 sec: 5382.9, 60 sec: 5526.1, 300 sec: 5516.4). Total num frames: 871277568. Throughput: 0: 5844.2. Samples: 871284640. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:00:57,948][25689] Avg episode reward: [(0, '-2.539')] [2022-07-10 19:00:59,674][26022] Updated weights on worker 0-0, policy_version 850867 (0.00098) [2022-07-10 19:01:01,527][26022] Updated weights on worker 0-0, policy_version 850877 (0.00100) [2022-07-10 19:01:02,953][25689] Fps is (10 sec: 5295.5, 60 sec: 5544.2, 300 sec: 5523.6). Total num frames: 871304192. Throughput: 0: 5008.0. Samples: 871301356. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:01:02,954][25689] Avg episode reward: [(0, '-1.585')] [2022-07-10 19:01:03,603][26022] Updated weights on worker 0-0, policy_version 850887 (0.00086) [2022-07-10 19:01:05,828][26022] Updated weights on worker 0-0, policy_version 850897 (0.00088) [2022-07-10 19:01:07,247][26022] Updated weights on worker 0-0, policy_version 850907 (0.00086) [2022-07-10 19:01:07,974][25689] Fps is (10 sec: 5310.6, 60 sec: 5525.9, 300 sec: 5519.9). Total num frames: 871330816. Throughput: 0: 5705.1. Samples: 871332238. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:01:07,975][25689] Avg episode reward: [(0, '-1.757')] [2022-07-10 19:01:09,148][26022] Updated weights on worker 0-0, policy_version 850917 (0.00077) [2022-07-10 19:01:11,058][26022] Updated weights on worker 0-0, policy_version 850927 (0.00091) [2022-07-10 19:01:12,883][26022] Updated weights on worker 0-0, policy_version 850937 (0.00089) [2022-07-10 19:01:12,979][25689] Fps is (10 sec: 5617.6, 60 sec: 5544.9, 300 sec: 5520.9). Total num frames: 871360512. Throughput: 0: 5739.2. Samples: 871366174. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:01:12,980][25689] Avg episode reward: [(0, '-0.824')] [2022-07-10 19:01:14,516][26022] Updated weights on worker 0-0, policy_version 850947 (0.00085) [2022-07-10 19:01:16,540][26022] Updated weights on worker 0-0, policy_version 850957 (0.00089) [2022-07-10 19:01:18,040][25689] Fps is (10 sec: 5798.8, 60 sec: 5567.1, 300 sec: 5527.2). Total num frames: 871389184. Throughput: 0: 4901.5. Samples: 871383122. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:01:18,041][25689] Avg episode reward: [(0, '-1.348')] [2022-07-10 19:01:18,139][26022] Updated weights on worker 0-0, policy_version 850967 (0.00093) [2022-07-10 19:01:20,187][26022] Updated weights on worker 0-0, policy_version 850977 (0.00093) [2022-07-10 19:01:21,865][26022] Updated weights on worker 0-0, policy_version 850987 (0.00083) [2022-07-10 19:01:23,063][25689] Fps is (10 sec: 5483.4, 60 sec: 5515.4, 300 sec: 5526.9). Total num frames: 871415808. Throughput: 0: 5735.1. Samples: 871416688. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:01:23,064][25689] Avg episode reward: [(0, '-1.194')] [2022-07-10 19:01:23,806][26022] Updated weights on worker 0-0, policy_version 850997 (0.00090) [2022-07-10 19:01:25,497][26022] Updated weights on worker 0-0, policy_version 851007 (0.00085) [2022-07-10 19:01:27,434][26022] Updated weights on worker 0-0, policy_version 851017 (0.00083) [2022-07-10 19:01:28,072][25689] Fps is (10 sec: 5512.0, 60 sec: 5550.6, 300 sec: 5523.4). Total num frames: 871444480. Throughput: 0: 5877.9. Samples: 871450368. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:01:28,072][25689] Avg episode reward: [(0, '-1.172')] [2022-07-10 19:01:29,262][26022] Updated weights on worker 0-0, policy_version 851027 (0.00104) [2022-07-10 19:01:31,031][26022] Updated weights on worker 0-0, policy_version 851037 (0.00085) [2022-07-10 19:01:32,911][26022] Updated weights on worker 0-0, policy_version 851047 (0.00094) [2022-07-10 19:01:33,096][25689] Fps is (10 sec: 5613.6, 60 sec: 5549.3, 300 sec: 5525.1). Total num frames: 871472128. Throughput: 0: 5021.8. Samples: 871467198. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:01:33,097][25689] Avg episode reward: [(0, '-0.169')] [2022-07-10 19:01:34,903][26022] Updated weights on worker 0-0, policy_version 851057 (0.00085) [2022-07-10 19:01:36,707][26022] Updated weights on worker 0-0, policy_version 851067 (0.00085) [2022-07-10 19:01:38,223][25689] Fps is (10 sec: 5447.5, 60 sec: 5510.9, 300 sec: 5520.8). Total num frames: 871499776. Throughput: 0: 5814.6. Samples: 871500476. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:01:38,223][25689] Avg episode reward: [(0, '-0.664')] [2022-07-10 19:01:38,519][26022] Updated weights on worker 0-0, policy_version 851077 (0.00087) [2022-07-10 19:01:40,307][26022] Updated weights on worker 0-0, policy_version 851087 (0.00085) [2022-07-10 19:01:42,066][26022] Updated weights on worker 0-0, policy_version 851097 (0.00095) [2022-07-10 19:01:43,235][25689] Fps is (10 sec: 5555.0, 60 sec: 5527.6, 300 sec: 5524.6). Total num frames: 871528448. Throughput: 0: 5808.8. Samples: 871533860. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:01:43,235][25689] Avg episode reward: [(0, '-0.935')] [2022-07-10 19:01:43,982][26022] Updated weights on worker 0-0, policy_version 851107 (0.00096) [2022-07-10 19:01:45,883][26022] Updated weights on worker 0-0, policy_version 851117 (0.00085) [2022-07-10 19:01:47,949][26022] Updated weights on worker 0-0, policy_version 851127 (0.00098) [2022-07-10 19:01:48,239][25689] Fps is (10 sec: 5622.9, 60 sec: 5512.4, 300 sec: 5521.1). Total num frames: 871556096. Throughput: 0: 4962.5. Samples: 871550446. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:01:48,239][25689] Avg episode reward: [(0, '-0.786')] [2022-07-10 19:01:49,559][26022] Updated weights on worker 0-0, policy_version 851137 (0.00086) [2022-07-10 19:01:51,351][26022] Updated weights on worker 0-0, policy_version 851147 (0.00090) [2022-07-10 19:01:53,238][26022] Updated weights on worker 0-0, policy_version 851157 (0.00084) [2022-07-10 19:01:53,315][25689] Fps is (10 sec: 5587.0, 60 sec: 5525.3, 300 sec: 5520.9). Total num frames: 871584768. Throughput: 0: 5760.5. Samples: 871583670. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:01:53,316][25689] Avg episode reward: [(0, '-2.158')] [2022-07-10 19:01:55,101][26022] Updated weights on worker 0-0, policy_version 851167 (0.00093) [2022-07-10 19:01:56,855][26022] Updated weights on worker 0-0, policy_version 851177 (0.00097) [2022-07-10 19:01:58,422][25689] Fps is (10 sec: 5530.6, 60 sec: 5537.0, 300 sec: 5520.2). Total num frames: 871612416. Throughput: 0: 5778.1. Samples: 871617192. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:01:58,423][25689] Avg episode reward: [(0, '-2.305')] [2022-07-10 19:01:58,965][26022] Updated weights on worker 0-0, policy_version 851187 (0.00093) [2022-07-10 19:02:00,389][26022] Updated weights on worker 0-0, policy_version 851197 (0.00091) [2022-07-10 19:02:03,063][26022] Updated weights on worker 0-0, policy_version 851207 (0.00817) [2022-07-10 19:02:03,445][25689] Fps is (10 sec: 5256.6, 60 sec: 5518.5, 300 sec: 5523.8). Total num frames: 871638016. Throughput: 0: 4951.9. Samples: 871633940. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:02:03,446][25689] Avg episode reward: [(0, '-2.444')] [2022-07-10 19:02:04,719][26022] Updated weights on worker 0-0, policy_version 851217 (0.00089) [2022-07-10 19:02:06,625][26022] Updated weights on worker 0-0, policy_version 851227 (0.00088) [2022-07-10 19:02:08,215][26022] Updated weights on worker 0-0, policy_version 851237 (0.00083) [2022-07-10 19:02:08,490][25689] Fps is (10 sec: 5492.4, 60 sec: 5567.1, 300 sec: 5526.6). Total num frames: 871667712. Throughput: 0: 5655.6. Samples: 871664978. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:02:08,492][25689] Avg episode reward: [(0, '-1.686')] [2022-07-10 19:02:10,604][26022] Updated weights on worker 0-0, policy_version 851247 (0.00091) [2022-07-10 19:02:11,949][26022] Updated weights on worker 0-0, policy_version 851257 (0.00087) [2022-07-10 19:02:13,502][25689] Fps is (10 sec: 5498.4, 60 sec: 5498.8, 300 sec: 5520.3). Total num frames: 871693312. Throughput: 0: 5687.3. Samples: 871698476. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:02:13,503][25689] Avg episode reward: [(0, '-1.543')] [2022-07-10 19:02:14,181][26022] Updated weights on worker 0-0, policy_version 851267 (0.00085) [2022-07-10 19:02:15,595][26022] Updated weights on worker 0-0, policy_version 851277 (0.00086) [2022-07-10 19:02:17,898][26022] Updated weights on worker 0-0, policy_version 851287 (0.00085) [2022-07-10 19:02:18,537][25689] Fps is (10 sec: 5503.7, 60 sec: 5518.0, 300 sec: 5523.2). Total num frames: 871723008. Throughput: 0: 5711.1. Samples: 871732068. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:02:18,537][25689] Avg episode reward: [(0, '-1.315')] [2022-07-10 19:02:19,161][26022] Updated weights on worker 0-0, policy_version 851297 (0.00087) [2022-07-10 19:02:21,361][26022] Updated weights on worker 0-0, policy_version 851307 (0.00088) [2022-07-10 19:02:23,063][26022] Updated weights on worker 0-0, policy_version 851317 (0.00086) [2022-07-10 19:02:23,559][25689] Fps is (10 sec: 5803.8, 60 sec: 5552.0, 300 sec: 5522.8). Total num frames: 871751680. Throughput: 0: 5707.8. Samples: 871748744. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:02:23,559][25689] Avg episode reward: [(0, '-0.386')] [2022-07-10 19:02:25,094][26022] Updated weights on worker 0-0, policy_version 851327 (0.00097) [2022-07-10 19:02:26,730][26022] Updated weights on worker 0-0, policy_version 851337 (0.00095) [2022-07-10 19:02:28,634][25689] Fps is (10 sec: 5375.0, 60 sec: 5495.2, 300 sec: 5521.9). Total num frames: 871777280. Throughput: 0: 5822.9. Samples: 871782274. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:02:28,635][25689] Avg episode reward: [(0, '-0.459')] [2022-07-10 19:02:28,737][26022] Updated weights on worker 0-0, policy_version 851347 (0.00352) [2022-07-10 19:02:30,339][26022] Updated weights on worker 0-0, policy_version 851357 (0.00091) [2022-07-10 19:02:32,415][26022] Updated weights on worker 0-0, policy_version 851367 (0.00086) [2022-07-10 19:02:33,655][25689] Fps is (10 sec: 5375.5, 60 sec: 5512.4, 300 sec: 5522.5). Total num frames: 871805952. Throughput: 0: 5829.6. Samples: 871815960. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:02:33,655][25689] Avg episode reward: [(0, '-1.619')] [2022-07-10 19:02:34,092][26022] Updated weights on worker 0-0, policy_version 851377 (0.00086) [2022-07-10 19:02:36,111][26022] Updated weights on worker 0-0, policy_version 851387 (0.00099) [2022-07-10 19:02:38,007][26022] Updated weights on worker 0-0, policy_version 851397 (0.00091) [2022-07-10 19:02:38,697][25689] Fps is (10 sec: 5698.1, 60 sec: 5537.0, 300 sec: 5522.0). Total num frames: 871834624. Throughput: 0: 4980.0. Samples: 871832468. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:02:38,698][25689] Avg episode reward: [(0, '-1.457')] [2022-07-10 19:02:39,744][26022] Updated weights on worker 0-0, policy_version 851407 (0.00092) [2022-07-10 19:02:41,563][26022] Updated weights on worker 0-0, policy_version 851417 (0.00089) [2022-07-10 19:02:43,359][26022] Updated weights on worker 0-0, policy_version 851427 (0.00085) [2022-07-10 19:02:43,711][25689] Fps is (10 sec: 5600.6, 60 sec: 5519.9, 300 sec: 5525.7). Total num frames: 871862272. Throughput: 0: 5796.8. Samples: 871865562. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:02:43,711][25689] Avg episode reward: [(0, '-1.497')] [2022-07-10 19:02:45,375][26022] Updated weights on worker 0-0, policy_version 851437 (0.00093) [2022-07-10 19:02:47,154][26022] Updated weights on worker 0-0, policy_version 851447 (0.00099) [2022-07-10 19:02:48,761][25689] Fps is (10 sec: 5494.4, 60 sec: 5515.7, 300 sec: 5521.6). Total num frames: 871889920. Throughput: 0: 5781.4. Samples: 871898640. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:02:48,763][25689] Avg episode reward: [(0, '-1.015')] [2022-07-10 19:02:48,924][26022] Updated weights on worker 0-0, policy_version 851457 (0.00092) [2022-07-10 19:02:51,051][26022] Updated weights on worker 0-0, policy_version 851467 (0.00082) [2022-07-10 19:02:52,644][26022] Updated weights on worker 0-0, policy_version 851477 (0.00089) [2022-07-10 19:02:53,775][25689] Fps is (10 sec: 5494.5, 60 sec: 5504.5, 300 sec: 5520.4). Total num frames: 871917568. Throughput: 0: 4937.8. Samples: 871915308. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:02:53,775][25689] Avg episode reward: [(0, '-0.644')] [2022-07-10 19:02:54,724][26022] Updated weights on worker 0-0, policy_version 851487 (0.00087) [2022-07-10 19:02:56,319][26022] Updated weights on worker 0-0, policy_version 851497 (0.00084) [2022-07-10 19:02:56,741][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:02:56,749][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000851499_871934976.pth [2022-07-10 19:02:56,749][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000849556_869945344.pth [2022-07-10 19:02:58,349][26022] Updated weights on worker 0-0, policy_version 851507 (0.00081) [2022-07-10 19:02:58,850][25689] Fps is (10 sec: 5481.2, 60 sec: 5507.4, 300 sec: 5522.9). Total num frames: 871945216. Throughput: 0: 5758.8. Samples: 871948522. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:02:58,852][25689] Avg episode reward: [(0, '-0.797')] [2022-07-10 19:02:59,890][26022] Updated weights on worker 0-0, policy_version 851517 (0.00093) [2022-07-10 19:03:02,540][26022] Updated weights on worker 0-0, policy_version 851527 (0.00086) [2022-07-10 19:03:03,864][25689] Fps is (10 sec: 5378.9, 60 sec: 5525.1, 300 sec: 5515.9). Total num frames: 871971840. Throughput: 0: 5667.7. Samples: 871979786. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:03:03,867][25689] Avg episode reward: [(0, '-0.627')] [2022-07-10 19:03:04,101][26022] Updated weights on worker 0-0, policy_version 851537 (0.00088) [2022-07-10 19:03:06,228][26022] Updated weights on worker 0-0, policy_version 851547 (0.00096) [2022-07-10 19:03:07,866][26022] Updated weights on worker 0-0, policy_version 851557 (0.00096) [2022-07-10 19:03:08,966][25689] Fps is (10 sec: 5263.5, 60 sec: 5469.1, 300 sec: 5518.0). Total num frames: 871998464. Throughput: 0: 4833.9. Samples: 871996308. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:03:08,968][25689] Avg episode reward: [(0, '-0.462')] [2022-07-10 19:03:09,926][26022] Updated weights on worker 0-0, policy_version 851567 (0.00093) [2022-07-10 19:03:11,543][26022] Updated weights on worker 0-0, policy_version 851577 (0.00094) [2022-07-10 19:03:13,552][26022] Updated weights on worker 0-0, policy_version 851587 (0.00097) [2022-07-10 19:03:14,061][25689] Fps is (10 sec: 5422.9, 60 sec: 5512.3, 300 sec: 5513.8). Total num frames: 872027136. Throughput: 0: 5639.1. Samples: 872029706. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:03:14,062][25689] Avg episode reward: [(0, '-0.446')] [2022-07-10 19:03:15,189][26022] Updated weights on worker 0-0, policy_version 851597 (0.00092) [2022-07-10 19:03:17,289][26022] Updated weights on worker 0-0, policy_version 851607 (0.00089) [2022-07-10 19:03:18,807][26022] Updated weights on worker 0-0, policy_version 851617 (0.00093) [2022-07-10 19:03:19,116][25689] Fps is (10 sec: 5751.0, 60 sec: 5510.6, 300 sec: 5519.8). Total num frames: 872056832. Throughput: 0: 5655.6. Samples: 872063138. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:03:19,118][25689] Avg episode reward: [(0, '-0.451')] [2022-07-10 19:03:21,069][26022] Updated weights on worker 0-0, policy_version 851627 (0.00086) [2022-07-10 19:03:22,627][26022] Updated weights on worker 0-0, policy_version 851637 (0.00095) [2022-07-10 19:03:24,131][25689] Fps is (10 sec: 5491.0, 60 sec: 5460.4, 300 sec: 5516.7). Total num frames: 872082432. Throughput: 0: 4934.0. Samples: 872079784. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:03:24,133][25689] Avg episode reward: [(0, '-0.318')] [2022-07-10 19:03:24,774][26022] Updated weights on worker 0-0, policy_version 851647 (0.00085) [2022-07-10 19:03:26,254][26022] Updated weights on worker 0-0, policy_version 851657 (0.00083) [2022-07-10 19:03:28,406][26022] Updated weights on worker 0-0, policy_version 851667 (0.00091) [2022-07-10 19:03:29,162][25689] Fps is (10 sec: 5504.0, 60 sec: 5532.1, 300 sec: 5519.8). Total num frames: 872112128. Throughput: 0: 5784.6. Samples: 872113132. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:03:29,163][25689] Avg episode reward: [(0, '-0.185')] [2022-07-10 19:03:29,836][26022] Updated weights on worker 0-0, policy_version 851677 (0.00089) [2022-07-10 19:03:32,102][26022] Updated weights on worker 0-0, policy_version 851687 (0.00089) [2022-07-10 19:03:33,535][26022] Updated weights on worker 0-0, policy_version 851697 (0.00082) [2022-07-10 19:03:34,239][25689] Fps is (10 sec: 5673.4, 60 sec: 5510.1, 300 sec: 5524.0). Total num frames: 872139776. Throughput: 0: 5797.3. Samples: 872146682. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:03:34,240][25689] Avg episode reward: [(0, '-0.228')] [2022-07-10 19:03:35,637][26022] Updated weights on worker 0-0, policy_version 851707 (0.00095) [2022-07-10 19:03:37,310][26022] Updated weights on worker 0-0, policy_version 851717 (0.00089) [2022-07-10 19:03:39,290][25689] Fps is (10 sec: 5560.8, 60 sec: 5509.3, 300 sec: 5520.6). Total num frames: 872168448. Throughput: 0: 4968.6. Samples: 872163378. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:03:39,292][25689] Avg episode reward: [(0, '0.621')] [2022-07-10 19:03:39,298][26022] Updated weights on worker 0-0, policy_version 851727 (0.00087) [2022-07-10 19:03:40,939][26022] Updated weights on worker 0-0, policy_version 851737 (0.00091) [2022-07-10 19:03:42,860][26022] Updated weights on worker 0-0, policy_version 851747 (0.00086) [2022-07-10 19:03:44,320][25689] Fps is (10 sec: 5688.2, 60 sec: 5524.7, 300 sec: 5520.2). Total num frames: 872197120. Throughput: 0: 5805.8. Samples: 872196996. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:03:44,321][25689] Avg episode reward: [(0, '0.493')] [2022-07-10 19:03:44,644][26022] Updated weights on worker 0-0, policy_version 851757 (0.00090) [2022-07-10 19:03:46,618][26022] Updated weights on worker 0-0, policy_version 851767 (0.00086) [2022-07-10 19:03:48,098][26022] Updated weights on worker 0-0, policy_version 851777 (0.00086) [2022-07-10 19:03:49,399][25689] Fps is (10 sec: 5571.4, 60 sec: 5522.1, 300 sec: 5522.4). Total num frames: 872224768. Throughput: 0: 5811.0. Samples: 872230728. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:03:49,401][25689] Avg episode reward: [(0, '0.029')] [2022-07-10 19:03:50,189][26022] Updated weights on worker 0-0, policy_version 851787 (0.00091) [2022-07-10 19:03:52,064][26022] Updated weights on worker 0-0, policy_version 851797 (0.00079) [2022-07-10 19:03:53,894][26022] Updated weights on worker 0-0, policy_version 851807 (0.00080) [2022-07-10 19:03:54,421][25689] Fps is (10 sec: 5575.7, 60 sec: 5538.2, 300 sec: 5523.2). Total num frames: 872253440. Throughput: 0: 4985.5. Samples: 872247300. Policy #0 lag: (min: 0.0, avg: 7.7, max: 17.0) [2022-07-10 19:03:54,422][25689] Avg episode reward: [(0, '0.281')] [2022-07-10 19:03:55,798][26022] Updated weights on worker 0-0, policy_version 851817 (0.00085) [2022-07-10 19:03:57,626][26022] Updated weights on worker 0-0, policy_version 851827 (0.00081) [2022-07-10 19:03:59,230][26022] Updated weights on worker 0-0, policy_version 851837 (0.00090) [2022-07-10 19:03:59,480][25689] Fps is (10 sec: 5789.7, 60 sec: 5573.4, 300 sec: 5529.3). Total num frames: 872283136. Throughput: 0: 5837.4. Samples: 872281236. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:03:59,481][25689] Avg episode reward: [(0, '0.246')] [2022-07-10 19:04:01,306][26022] Updated weights on worker 0-0, policy_version 851847 (0.00087) [2022-07-10 19:04:03,079][26022] Updated weights on worker 0-0, policy_version 851857 (0.00086) [2022-07-10 19:04:04,567][25689] Fps is (10 sec: 5349.3, 60 sec: 5533.1, 300 sec: 5524.6). Total num frames: 872307712. Throughput: 0: 5741.0. Samples: 872313232. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:04:04,567][25689] Avg episode reward: [(0, '0.060')] [2022-07-10 19:04:05,362][26022] Updated weights on worker 0-0, policy_version 851867 (0.00094) [2022-07-10 19:04:06,856][26022] Updated weights on worker 0-0, policy_version 851877 (0.00095) [2022-07-10 19:04:08,902][26022] Updated weights on worker 0-0, policy_version 851887 (0.00613) [2022-07-10 19:04:09,584][25689] Fps is (10 sec: 5269.9, 60 sec: 5574.6, 300 sec: 5528.1). Total num frames: 872336384. Throughput: 0: 4901.8. Samples: 872329674. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:04:09,585][25689] Avg episode reward: [(0, '-0.133')] [2022-07-10 19:04:10,521][26022] Updated weights on worker 0-0, policy_version 851897 (0.00091) [2022-07-10 19:04:12,461][26022] Updated weights on worker 0-0, policy_version 851907 (0.00092) [2022-07-10 19:04:14,202][26022] Updated weights on worker 0-0, policy_version 851917 (0.00090) [2022-07-10 19:04:14,599][25689] Fps is (10 sec: 5614.0, 60 sec: 5565.1, 300 sec: 5526.6). Total num frames: 872364032. Throughput: 0: 5767.8. Samples: 872363682. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:04:14,599][25689] Avg episode reward: [(0, '-0.642')] [2022-07-10 19:04:16,307][26022] Updated weights on worker 0-0, policy_version 851927 (0.00090) [2022-07-10 19:04:17,830][26022] Updated weights on worker 0-0, policy_version 851937 (0.00102) [2022-07-10 19:04:19,671][25689] Fps is (10 sec: 5482.4, 60 sec: 5529.6, 300 sec: 5525.5). Total num frames: 872391680. Throughput: 0: 5752.0. Samples: 872397372. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:04:19,671][25689] Avg episode reward: [(0, '-0.740')] [2022-07-10 19:04:19,819][26022] Updated weights on worker 0-0, policy_version 851947 (0.00092) [2022-07-10 19:04:21,323][26022] Updated weights on worker 0-0, policy_version 851957 (0.00091) [2022-07-10 19:04:23,153][26022] Updated weights on worker 0-0, policy_version 851967 (0.00086) [2022-07-10 19:04:24,706][25689] Fps is (10 sec: 5775.1, 60 sec: 5612.4, 300 sec: 5536.4). Total num frames: 872422400. Throughput: 0: 5019.0. Samples: 872414310. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:04:24,706][25689] Avg episode reward: [(0, '-1.401')] [2022-07-10 19:04:25,311][26022] Updated weights on worker 0-0, policy_version 851977 (0.00084) [2022-07-10 19:04:27,024][26022] Updated weights on worker 0-0, policy_version 851987 (0.00109) [2022-07-10 19:04:28,995][26022] Updated weights on worker 0-0, policy_version 851997 (0.00916) [2022-07-10 19:04:29,727][25689] Fps is (10 sec: 5702.5, 60 sec: 5562.6, 300 sec: 5532.6). Total num frames: 872449024. Throughput: 0: 5868.0. Samples: 872447868. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:04:29,727][25689] Avg episode reward: [(0, '-1.499')] [2022-07-10 19:04:30,689][26022] Updated weights on worker 0-0, policy_version 852007 (0.00086) [2022-07-10 19:04:32,473][26022] Updated weights on worker 0-0, policy_version 852017 (0.00089) [2022-07-10 19:04:34,516][26022] Updated weights on worker 0-0, policy_version 852027 (0.00079) [2022-07-10 19:04:34,743][25689] Fps is (10 sec: 5407.3, 60 sec: 5568.2, 300 sec: 5533.9). Total num frames: 872476672. Throughput: 0: 5839.7. Samples: 872481316. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:04:34,743][25689] Avg episode reward: [(0, '-1.682')] [2022-07-10 19:04:36,094][26022] Updated weights on worker 0-0, policy_version 852037 (0.00087) [2022-07-10 19:04:38,237][26022] Updated weights on worker 0-0, policy_version 852047 (0.00088) [2022-07-10 19:04:39,908][25689] Fps is (10 sec: 5531.8, 60 sec: 5557.7, 300 sec: 5531.7). Total num frames: 872505344. Throughput: 0: 5806.0. Samples: 872514868. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:04:39,912][25689] Avg episode reward: [(0, '-0.788')] [2022-07-10 19:04:39,943][26022] Updated weights on worker 0-0, policy_version 852057 (0.00084) [2022-07-10 19:04:41,817][26022] Updated weights on worker 0-0, policy_version 852067 (0.00087) [2022-07-10 19:04:43,429][26022] Updated weights on worker 0-0, policy_version 852077 (0.00084) [2022-07-10 19:04:44,957][25689] Fps is (10 sec: 5513.9, 60 sec: 5539.1, 300 sec: 5524.8). Total num frames: 872532992. Throughput: 0: 5796.0. Samples: 872531684. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:04:44,957][25689] Avg episode reward: [(0, '-0.901')] [2022-07-10 19:04:45,398][26022] Updated weights on worker 0-0, policy_version 852087 (0.00080) [2022-07-10 19:04:47,170][26022] Updated weights on worker 0-0, policy_version 852097 (0.00099) [2022-07-10 19:04:49,095][26022] Updated weights on worker 0-0, policy_version 852107 (0.00092) [2022-07-10 19:04:49,978][25689] Fps is (10 sec: 5694.2, 60 sec: 5578.1, 300 sec: 5541.9). Total num frames: 872562688. Throughput: 0: 5799.8. Samples: 872565324. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:04:49,979][25689] Avg episode reward: [(0, '-0.773')] [2022-07-10 19:04:50,820][26022] Updated weights on worker 0-0, policy_version 852117 (0.00087) [2022-07-10 19:04:52,704][26022] Updated weights on worker 0-0, policy_version 852127 (0.00084) [2022-07-10 19:04:54,381][26022] Updated weights on worker 0-0, policy_version 852137 (0.00092) [2022-07-10 19:04:55,009][25689] Fps is (10 sec: 5602.5, 60 sec: 5543.5, 300 sec: 5531.9). Total num frames: 872589312. Throughput: 0: 5808.4. Samples: 872599034. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:04:55,010][25689] Avg episode reward: [(0, '-0.775')] [2022-07-10 19:04:56,510][26022] Updated weights on worker 0-0, policy_version 852147 (0.00090) [2022-07-10 19:04:56,902][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:04:56,914][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000852150_872601600.pth [2022-07-10 19:04:56,914][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000850202_870606848.pth [2022-07-10 19:04:58,166][26022] Updated weights on worker 0-0, policy_version 852157 (0.00088) [2022-07-10 19:05:00,119][25689] Fps is (10 sec: 5453.1, 60 sec: 5522.0, 300 sec: 5540.5). Total num frames: 872617984. Throughput: 0: 5001.8. Samples: 872615960. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:05:00,119][25689] Avg episode reward: [(0, '-0.662')] [2022-07-10 19:05:00,133][26022] Updated weights on worker 0-0, policy_version 852167 (0.00093) [2022-07-10 19:05:02,120][26022] Updated weights on worker 0-0, policy_version 852177 (0.00100) [2022-07-10 19:05:04,049][26022] Updated weights on worker 0-0, policy_version 852187 (0.00087) [2022-07-10 19:05:05,135][25689] Fps is (10 sec: 5561.9, 60 sec: 5579.1, 300 sec: 5540.3). Total num frames: 872645632. Throughput: 0: 5746.3. Samples: 872647636. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:05:05,136][25689] Avg episode reward: [(0, '-0.410')] [2022-07-10 19:05:05,751][26022] Updated weights on worker 0-0, policy_version 852197 (0.00087) [2022-07-10 19:05:07,595][26022] Updated weights on worker 0-0, policy_version 852207 (0.00088) [2022-07-10 19:05:09,583][26022] Updated weights on worker 0-0, policy_version 852217 (0.00087) [2022-07-10 19:05:10,167][25689] Fps is (10 sec: 5401.2, 60 sec: 5544.1, 300 sec: 5533.3). Total num frames: 872672256. Throughput: 0: 5725.2. Samples: 872680906. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:05:10,167][25689] Avg episode reward: [(0, '-2.047')] [2022-07-10 19:05:11,292][26022] Updated weights on worker 0-0, policy_version 852227 (0.00091) [2022-07-10 19:05:13,215][26022] Updated weights on worker 0-0, policy_version 852237 (0.00088) [2022-07-10 19:05:15,003][26022] Updated weights on worker 0-0, policy_version 852247 (0.00082) [2022-07-10 19:05:15,249][25689] Fps is (10 sec: 5568.9, 60 sec: 5571.7, 300 sec: 5540.9). Total num frames: 872701952. Throughput: 0: 4875.2. Samples: 872697704. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:05:15,250][25689] Avg episode reward: [(0, '-2.317')] [2022-07-10 19:05:16,799][26022] Updated weights on worker 0-0, policy_version 852257 (0.00092) [2022-07-10 19:05:18,704][26022] Updated weights on worker 0-0, policy_version 852267 (0.00085) [2022-07-10 19:05:20,300][25689] Fps is (10 sec: 5760.0, 60 sec: 5590.4, 300 sec: 5536.8). Total num frames: 872730624. Throughput: 0: 5722.4. Samples: 872731446. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:05:20,301][25689] Avg episode reward: [(0, '-2.910')] [2022-07-10 19:05:20,381][26022] Updated weights on worker 0-0, policy_version 852277 (0.00085) [2022-07-10 19:05:22,326][26022] Updated weights on worker 0-0, policy_version 852287 (0.00053) [2022-07-10 19:05:24,043][26022] Updated weights on worker 0-0, policy_version 852297 (0.00087) [2022-07-10 19:05:25,367][25689] Fps is (10 sec: 5566.1, 60 sec: 5536.9, 300 sec: 5539.4). Total num frames: 872758272. Throughput: 0: 5822.1. Samples: 872765426. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:05:25,367][25689] Avg episode reward: [(0, '-1.761')] [2022-07-10 19:05:26,216][26022] Updated weights on worker 0-0, policy_version 852307 (0.00099) [2022-07-10 19:05:27,837][26022] Updated weights on worker 0-0, policy_version 852317 (0.00085) [2022-07-10 19:05:29,695][26022] Updated weights on worker 0-0, policy_version 852327 (0.00092) [2022-07-10 19:05:30,451][25689] Fps is (10 sec: 5548.2, 60 sec: 5564.8, 300 sec: 5541.5). Total num frames: 872786944. Throughput: 0: 4989.0. Samples: 872782104. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:05:30,452][25689] Avg episode reward: [(0, '-1.802')] [2022-07-10 19:05:31,419][26022] Updated weights on worker 0-0, policy_version 852337 (0.00091) [2022-07-10 19:05:33,423][26022] Updated weights on worker 0-0, policy_version 852347 (0.00087) [2022-07-10 19:05:35,096][26022] Updated weights on worker 0-0, policy_version 852357 (0.00093) [2022-07-10 19:05:35,457][25689] Fps is (10 sec: 5581.9, 60 sec: 5565.8, 300 sec: 5535.9). Total num frames: 872814592. Throughput: 0: 5842.3. Samples: 872815766. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:05:35,457][25689] Avg episode reward: [(0, '-1.257')] [2022-07-10 19:05:36,930][26022] Updated weights on worker 0-0, policy_version 852367 (0.00086) [2022-07-10 19:05:38,786][26022] Updated weights on worker 0-0, policy_version 852377 (0.00088) [2022-07-10 19:05:40,548][25689] Fps is (10 sec: 5578.2, 60 sec: 5572.6, 300 sec: 5537.8). Total num frames: 872843264. Throughput: 0: 5824.6. Samples: 872849380. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:05:40,548][25689] Avg episode reward: [(0, '-0.581')] [2022-07-10 19:05:40,644][26022] Updated weights on worker 0-0, policy_version 852387 (0.00091) [2022-07-10 19:05:42,455][26022] Updated weights on worker 0-0, policy_version 852397 (0.00085) [2022-07-10 19:05:44,316][26022] Updated weights on worker 0-0, policy_version 852407 (0.00078) [2022-07-10 19:05:45,591][25689] Fps is (10 sec: 5658.6, 60 sec: 5590.0, 300 sec: 5537.5). Total num frames: 872871936. Throughput: 0: 4980.5. Samples: 872866150. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:05:45,591][25689] Avg episode reward: [(0, '0.064')] [2022-07-10 19:05:46,016][26022] Updated weights on worker 0-0, policy_version 852417 (0.00098) [2022-07-10 19:05:48,027][26022] Updated weights on worker 0-0, policy_version 852427 (0.00086) [2022-07-10 19:05:49,747][26022] Updated weights on worker 0-0, policy_version 852437 (0.00082) [2022-07-10 19:05:50,594][25689] Fps is (10 sec: 5504.0, 60 sec: 5541.0, 300 sec: 5534.6). Total num frames: 872898560. Throughput: 0: 5853.0. Samples: 872900000. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:05:50,595][25689] Avg episode reward: [(0, '0.737')] [2022-07-10 19:05:51,611][26022] Updated weights on worker 0-0, policy_version 852447 (0.00093) [2022-07-10 19:05:53,423][26022] Updated weights on worker 0-0, policy_version 852457 (0.00090) [2022-07-10 19:05:55,211][26022] Updated weights on worker 0-0, policy_version 852467 (0.00089) [2022-07-10 19:05:55,637][25689] Fps is (10 sec: 5605.9, 60 sec: 5590.6, 300 sec: 5545.0). Total num frames: 872928256. Throughput: 0: 5841.8. Samples: 872933656. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:05:55,638][25689] Avg episode reward: [(0, '0.768')] [2022-07-10 19:05:57,235][26022] Updated weights on worker 0-0, policy_version 852477 (0.00094) [2022-07-10 19:05:58,759][26022] Updated weights on worker 0-0, policy_version 852487 (0.00084) [2022-07-10 19:06:00,732][25689] Fps is (10 sec: 5656.7, 60 sec: 5575.1, 300 sec: 5546.8). Total num frames: 872955904. Throughput: 0: 5013.7. Samples: 872950576. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:06:00,732][25689] Avg episode reward: [(0, '0.841')] [2022-07-10 19:06:00,919][26022] Updated weights on worker 0-0, policy_version 852497 (0.00093) [2022-07-10 19:06:02,679][26022] Updated weights on worker 0-0, policy_version 852507 (0.00081) [2022-07-10 19:06:04,783][26022] Updated weights on worker 0-0, policy_version 852517 (0.00085) [2022-07-10 19:06:05,759][25689] Fps is (10 sec: 5362.0, 60 sec: 5557.2, 300 sec: 5546.7). Total num frames: 872982528. Throughput: 0: 5739.8. Samples: 872981910. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:06:05,759][25689] Avg episode reward: [(0, '1.268')] [2022-07-10 19:06:06,765][26022] Updated weights on worker 0-0, policy_version 852527 (0.00087) [2022-07-10 19:06:08,335][26022] Updated weights on worker 0-0, policy_version 852537 (0.00089) [2022-07-10 19:06:10,505][26022] Updated weights on worker 0-0, policy_version 852547 (0.00087) [2022-07-10 19:06:10,790][25689] Fps is (10 sec: 5293.7, 60 sec: 5557.2, 300 sec: 5535.9). Total num frames: 873009152. Throughput: 0: 5697.7. Samples: 873015072. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:06:10,791][25689] Avg episode reward: [(0, '0.517')] [2022-07-10 19:06:12,003][26022] Updated weights on worker 0-0, policy_version 852557 (0.00091) [2022-07-10 19:06:14,228][26022] Updated weights on worker 0-0, policy_version 852567 (0.00087) [2022-07-10 19:06:15,770][26022] Updated weights on worker 0-0, policy_version 852577 (0.00090) [2022-07-10 19:06:15,839][25689] Fps is (10 sec: 5587.3, 60 sec: 5560.3, 300 sec: 5539.5). Total num frames: 873038848. Throughput: 0: 4873.2. Samples: 873032102. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:06:15,840][25689] Avg episode reward: [(0, '0.532')] [2022-07-10 19:06:17,663][26022] Updated weights on worker 0-0, policy_version 852587 (0.00092) [2022-07-10 19:06:19,354][26022] Updated weights on worker 0-0, policy_version 852597 (0.00091) [2022-07-10 19:06:20,954][25689] Fps is (10 sec: 5641.9, 60 sec: 5537.5, 300 sec: 5541.2). Total num frames: 873066496. Throughput: 0: 5694.8. Samples: 873065740. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:06:20,955][25689] Avg episode reward: [(0, '0.027')] [2022-07-10 19:06:21,408][26022] Updated weights on worker 0-0, policy_version 852607 (0.00087) [2022-07-10 19:06:23,015][26022] Updated weights on worker 0-0, policy_version 852617 (0.00087) [2022-07-10 19:06:25,133][26022] Updated weights on worker 0-0, policy_version 852627 (0.00087) [2022-07-10 19:06:25,963][25689] Fps is (10 sec: 5562.6, 60 sec: 5559.7, 300 sec: 5541.2). Total num frames: 873095168. Throughput: 0: 5838.9. Samples: 873099884. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:06:25,964][25689] Avg episode reward: [(0, '-0.034')] [2022-07-10 19:06:26,544][26022] Updated weights on worker 0-0, policy_version 852637 (0.00090) [2022-07-10 19:06:28,884][26022] Updated weights on worker 0-0, policy_version 852647 (0.00091) [2022-07-10 19:06:30,150][26022] Updated weights on worker 0-0, policy_version 852657 (0.00095) [2022-07-10 19:06:30,975][25689] Fps is (10 sec: 5722.3, 60 sec: 5566.3, 300 sec: 5544.9). Total num frames: 873123840. Throughput: 0: 5029.0. Samples: 873116584. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:06:30,976][25689] Avg episode reward: [(0, '0.060')] [2022-07-10 19:06:32,484][26022] Updated weights on worker 0-0, policy_version 852667 (0.00088) [2022-07-10 19:06:33,983][26022] Updated weights on worker 0-0, policy_version 852677 (0.00090) [2022-07-10 19:06:36,005][25689] Fps is (10 sec: 5506.8, 60 sec: 5547.2, 300 sec: 5543.3). Total num frames: 873150464. Throughput: 0: 5837.0. Samples: 873149812. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:06:36,005][25689] Avg episode reward: [(0, '0.353')] [2022-07-10 19:06:36,107][26022] Updated weights on worker 0-0, policy_version 852687 (0.00086) [2022-07-10 19:06:37,928][26022] Updated weights on worker 0-0, policy_version 852697 (0.00091) [2022-07-10 19:06:39,775][26022] Updated weights on worker 0-0, policy_version 852707 (0.00095) [2022-07-10 19:06:41,056][25689] Fps is (10 sec: 5485.7, 60 sec: 5550.9, 300 sec: 5542.6). Total num frames: 873179136. Throughput: 0: 5835.5. Samples: 873183040. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:06:41,056][25689] Avg episode reward: [(0, '0.096')] [2022-07-10 19:06:41,529][26022] Updated weights on worker 0-0, policy_version 852717 (0.00081) [2022-07-10 19:06:43,230][26022] Updated weights on worker 0-0, policy_version 852727 (0.00088) [2022-07-10 19:06:45,125][26022] Updated weights on worker 0-0, policy_version 852737 (0.00087) [2022-07-10 19:06:46,086][25689] Fps is (10 sec: 5586.6, 60 sec: 5535.1, 300 sec: 5542.1). Total num frames: 873206784. Throughput: 0: 5819.5. Samples: 873216988. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:06:46,087][25689] Avg episode reward: [(0, '0.806')] [2022-07-10 19:06:46,939][26022] Updated weights on worker 0-0, policy_version 852747 (0.00081) [2022-07-10 19:06:48,790][26022] Updated weights on worker 0-0, policy_version 852757 (0.00085) [2022-07-10 19:06:50,747][26022] Updated weights on worker 0-0, policy_version 852767 (0.00085) [2022-07-10 19:06:51,106][25689] Fps is (10 sec: 5603.6, 60 sec: 5567.4, 300 sec: 5543.1). Total num frames: 873235456. Throughput: 0: 5824.6. Samples: 873233838. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:06:51,107][25689] Avg episode reward: [(0, '0.658')] [2022-07-10 19:06:52,423][26022] Updated weights on worker 0-0, policy_version 852777 (0.00090) [2022-07-10 19:06:54,267][26022] Updated weights on worker 0-0, policy_version 852787 (0.00085) [2022-07-10 19:06:56,053][26022] Updated weights on worker 0-0, policy_version 852797 (0.00097) [2022-07-10 19:06:56,140][25689] Fps is (10 sec: 5703.8, 60 sec: 5551.4, 300 sec: 5547.9). Total num frames: 873264128. Throughput: 0: 5846.0. Samples: 873267520. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:06:56,140][25689] Avg episode reward: [(0, '0.366')] [2022-07-10 19:06:56,922][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:06:56,930][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000852801_873268224.pth [2022-07-10 19:06:56,947][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000850851_871271424.pth [2022-07-10 19:06:58,187][26022] Updated weights on worker 0-0, policy_version 852807 (0.00848) [2022-07-10 19:06:59,767][26022] Updated weights on worker 0-0, policy_version 852817 (0.00087) [2022-07-10 19:07:01,181][25689] Fps is (10 sec: 5590.4, 60 sec: 5556.3, 300 sec: 5554.5). Total num frames: 873291776. Throughput: 0: 5857.4. Samples: 873300920. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:07:01,181][25689] Avg episode reward: [(0, '0.128')] [2022-07-10 19:07:02,025][26022] Updated weights on worker 0-0, policy_version 852827 (0.00093) [2022-07-10 19:07:03,851][26022] Updated weights on worker 0-0, policy_version 852837 (0.00087) [2022-07-10 19:07:05,727][26022] Updated weights on worker 0-0, policy_version 852847 (0.00086) [2022-07-10 19:07:06,200][25689] Fps is (10 sec: 5292.9, 60 sec: 5540.1, 300 sec: 5541.2). Total num frames: 873317376. Throughput: 0: 4909.1. Samples: 873315728. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:07:06,200][25689] Avg episode reward: [(0, '0.442')] [2022-07-10 19:07:07,493][26022] Updated weights on worker 0-0, policy_version 852857 (0.00086) [2022-07-10 19:07:09,543][26022] Updated weights on worker 0-0, policy_version 852867 (0.00096) [2022-07-10 19:07:11,177][26022] Updated weights on worker 0-0, policy_version 852877 (0.00087) [2022-07-10 19:07:11,234][25689] Fps is (10 sec: 5398.5, 60 sec: 5573.7, 300 sec: 5551.1). Total num frames: 873346048. Throughput: 0: 5703.9. Samples: 873348644. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:07:11,234][25689] Avg episode reward: [(0, '0.557')] [2022-07-10 19:07:13,250][26022] Updated weights on worker 0-0, policy_version 852887 (0.00087) [2022-07-10 19:07:14,850][26022] Updated weights on worker 0-0, policy_version 852897 (0.00086) [2022-07-10 19:07:16,271][25689] Fps is (10 sec: 5592.4, 60 sec: 5540.9, 300 sec: 5544.2). Total num frames: 873373696. Throughput: 0: 5709.4. Samples: 873382456. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:07:16,271][25689] Avg episode reward: [(0, '0.149')] [2022-07-10 19:07:16,946][26022] Updated weights on worker 0-0, policy_version 852907 (0.00083) [2022-07-10 19:07:18,522][26022] Updated weights on worker 0-0, policy_version 852917 (0.00086) [2022-07-10 19:07:20,543][26022] Updated weights on worker 0-0, policy_version 852927 (0.00086) [2022-07-10 19:07:21,362][25689] Fps is (10 sec: 5459.8, 60 sec: 5543.2, 300 sec: 5539.4). Total num frames: 873401344. Throughput: 0: 4867.2. Samples: 873399146. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:07:21,362][25689] Avg episode reward: [(0, '0.047')] [2022-07-10 19:07:22,031][26022] Updated weights on worker 0-0, policy_version 852937 (0.00100) [2022-07-10 19:07:24,216][26022] Updated weights on worker 0-0, policy_version 852947 (0.00085) [2022-07-10 19:07:25,855][26022] Updated weights on worker 0-0, policy_version 852957 (0.00093) [2022-07-10 19:07:26,363][25689] Fps is (10 sec: 5580.7, 60 sec: 5543.9, 300 sec: 5551.1). Total num frames: 873430016. Throughput: 0: 5801.4. Samples: 873432700. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:07:26,363][25689] Avg episode reward: [(0, '-0.043')] [2022-07-10 19:07:27,778][26022] Updated weights on worker 0-0, policy_version 852967 (0.00097) [2022-07-10 19:07:29,676][26022] Updated weights on worker 0-0, policy_version 852977 (0.00098) [2022-07-10 19:07:31,393][25689] Fps is (10 sec: 5512.1, 60 sec: 5508.3, 300 sec: 5544.1). Total num frames: 873456640. Throughput: 0: 5813.3. Samples: 873465838. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-10 19:07:31,396][25689] Avg episode reward: [(0, '-0.462')] [2022-07-10 19:07:31,535][26022] Updated weights on worker 0-0, policy_version 852987 (0.00097) [2022-07-10 19:07:33,454][26022] Updated weights on worker 0-0, policy_version 852997 (0.00095) [2022-07-10 19:07:35,229][26022] Updated weights on worker 0-0, policy_version 853007 (0.00091) [2022-07-10 19:07:36,411][25689] Fps is (10 sec: 5401.3, 60 sec: 5526.4, 300 sec: 5541.1). Total num frames: 873484288. Throughput: 0: 4971.0. Samples: 873482572. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:07:36,412][25689] Avg episode reward: [(0, '-0.215')] [2022-07-10 19:07:36,884][26022] Updated weights on worker 0-0, policy_version 853017 (0.00093) [2022-07-10 19:07:39,160][26022] Updated weights on worker 0-0, policy_version 853027 (0.00087) [2022-07-10 19:07:40,608][26022] Updated weights on worker 0-0, policy_version 853037 (0.00079) [2022-07-10 19:07:41,471][25689] Fps is (10 sec: 5690.3, 60 sec: 5542.4, 300 sec: 5547.1). Total num frames: 873513984. Throughput: 0: 5802.6. Samples: 873515832. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:07:41,471][25689] Avg episode reward: [(0, '0.215')] [2022-07-10 19:07:42,744][26022] Updated weights on worker 0-0, policy_version 853047 (0.00094) [2022-07-10 19:07:44,343][26022] Updated weights on worker 0-0, policy_version 853057 (0.00093) [2022-07-10 19:07:46,382][26022] Updated weights on worker 0-0, policy_version 853067 (0.00088) [2022-07-10 19:07:46,505][25689] Fps is (10 sec: 5579.6, 60 sec: 5525.2, 300 sec: 5544.0). Total num frames: 873540608. Throughput: 0: 5789.7. Samples: 873549316. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:07:46,505][25689] Avg episode reward: [(0, '0.026')] [2022-07-10 19:07:47,981][26022] Updated weights on worker 0-0, policy_version 853077 (0.00087) [2022-07-10 19:07:50,143][26022] Updated weights on worker 0-0, policy_version 853087 (0.00094) [2022-07-10 19:07:51,508][25689] Fps is (10 sec: 5509.1, 60 sec: 5526.7, 300 sec: 5547.6). Total num frames: 873569280. Throughput: 0: 4969.7. Samples: 873565802. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:07:51,509][25689] Avg episode reward: [(0, '-0.748')] [2022-07-10 19:07:51,683][26022] Updated weights on worker 0-0, policy_version 853097 (0.00088) [2022-07-10 19:07:53,902][26022] Updated weights on worker 0-0, policy_version 853107 (0.00084) [2022-07-10 19:07:55,486][26022] Updated weights on worker 0-0, policy_version 853117 (0.00089) [2022-07-10 19:07:56,529][25689] Fps is (10 sec: 5515.9, 60 sec: 5493.9, 300 sec: 5545.2). Total num frames: 873595904. Throughput: 0: 5791.1. Samples: 873599082. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:07:56,530][25689] Avg episode reward: [(0, '-0.426')] [2022-07-10 19:07:57,416][26022] Updated weights on worker 0-0, policy_version 853127 (0.00092) [2022-07-10 19:07:59,107][26022] Updated weights on worker 0-0, policy_version 853137 (0.00091) [2022-07-10 19:08:01,243][26022] Updated weights on worker 0-0, policy_version 853147 (0.00079) [2022-07-10 19:08:01,619][25689] Fps is (10 sec: 5570.3, 60 sec: 5523.4, 300 sec: 5554.1). Total num frames: 873625600. Throughput: 0: 5793.6. Samples: 873632560. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:08:01,619][25689] Avg episode reward: [(0, '-0.391')] [2022-07-10 19:08:03,139][26022] Updated weights on worker 0-0, policy_version 853157 (0.00088) [2022-07-10 19:08:05,229][26022] Updated weights on worker 0-0, policy_version 853167 (0.00085) [2022-07-10 19:08:06,685][25689] Fps is (10 sec: 5545.5, 60 sec: 5536.0, 300 sec: 5554.8). Total num frames: 873652224. Throughput: 0: 4863.6. Samples: 873647468. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:08:06,686][25689] Avg episode reward: [(0, '-0.189')] [2022-07-10 19:08:06,699][26022] Updated weights on worker 0-0, policy_version 853177 (0.00086) [2022-07-10 19:08:08,780][26022] Updated weights on worker 0-0, policy_version 853187 (0.00087) [2022-07-10 19:08:10,363][26022] Updated weights on worker 0-0, policy_version 853197 (0.00097) [2022-07-10 19:08:11,708][25689] Fps is (10 sec: 5379.1, 60 sec: 5520.1, 300 sec: 5552.7). Total num frames: 873679872. Throughput: 0: 5702.7. Samples: 873680998. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:08:11,709][25689] Avg episode reward: [(0, '-0.139')] [2022-07-10 19:08:12,491][26022] Updated weights on worker 0-0, policy_version 853207 (0.00087) [2022-07-10 19:08:14,125][26022] Updated weights on worker 0-0, policy_version 853217 (0.00080) [2022-07-10 19:08:16,157][26022] Updated weights on worker 0-0, policy_version 853227 (0.00086) [2022-07-10 19:08:16,719][25689] Fps is (10 sec: 5511.3, 60 sec: 5522.5, 300 sec: 5546.6). Total num frames: 873707520. Throughput: 0: 5734.0. Samples: 873714848. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:08:16,719][25689] Avg episode reward: [(0, '0.788')] [2022-07-10 19:08:17,792][26022] Updated weights on worker 0-0, policy_version 853237 (0.00089) [2022-07-10 19:08:19,596][26022] Updated weights on worker 0-0, policy_version 853247 (0.00078) [2022-07-10 19:08:21,508][26022] Updated weights on worker 0-0, policy_version 853257 (0.00086) [2022-07-10 19:08:21,819][25689] Fps is (10 sec: 5671.6, 60 sec: 5555.5, 300 sec: 5558.8). Total num frames: 873737216. Throughput: 0: 4903.4. Samples: 873731610. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:08:21,819][25689] Avg episode reward: [(0, '0.926')] [2022-07-10 19:08:23,409][26022] Updated weights on worker 0-0, policy_version 853267 (0.00089) [2022-07-10 19:08:25,120][26022] Updated weights on worker 0-0, policy_version 853277 (0.00087) [2022-07-10 19:08:26,847][25689] Fps is (10 sec: 5560.9, 60 sec: 5519.2, 300 sec: 5548.5). Total num frames: 873763840. Throughput: 0: 5842.7. Samples: 873765264. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:08:26,847][25689] Avg episode reward: [(0, '0.879')] [2022-07-10 19:08:27,129][26022] Updated weights on worker 0-0, policy_version 853287 (0.00093) [2022-07-10 19:08:28,758][26022] Updated weights on worker 0-0, policy_version 853297 (0.00091) [2022-07-10 19:08:31,008][26022] Updated weights on worker 0-0, policy_version 853307 (0.00083) [2022-07-10 19:08:31,864][25689] Fps is (10 sec: 5505.1, 60 sec: 5554.3, 300 sec: 5553.1). Total num frames: 873792512. Throughput: 0: 5822.7. Samples: 873798356. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:08:31,864][25689] Avg episode reward: [(0, '0.817')] [2022-07-10 19:08:32,357][26022] Updated weights on worker 0-0, policy_version 853317 (0.00083) [2022-07-10 19:08:34,503][26022] Updated weights on worker 0-0, policy_version 853327 (0.00861) [2022-07-10 19:08:36,154][26022] Updated weights on worker 0-0, policy_version 853337 (0.00093) [2022-07-10 19:08:36,892][25689] Fps is (10 sec: 5606.8, 60 sec: 5553.4, 300 sec: 5550.1). Total num frames: 873820160. Throughput: 0: 4982.3. Samples: 873815356. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:08:36,896][25689] Avg episode reward: [(0, '0.590')] [2022-07-10 19:08:38,152][26022] Updated weights on worker 0-0, policy_version 853347 (0.00081) [2022-07-10 19:08:39,816][26022] Updated weights on worker 0-0, policy_version 853357 (0.00086) [2022-07-10 19:08:41,791][26022] Updated weights on worker 0-0, policy_version 853367 (0.00089) [2022-07-10 19:08:42,000][25689] Fps is (10 sec: 5455.1, 60 sec: 5515.1, 300 sec: 5545.2). Total num frames: 873847808. Throughput: 0: 5800.5. Samples: 873848670. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:08:42,001][25689] Avg episode reward: [(0, '0.433')] [2022-07-10 19:08:43,620][26022] Updated weights on worker 0-0, policy_version 853377 (0.00092) [2022-07-10 19:08:45,562][26022] Updated weights on worker 0-0, policy_version 853387 (0.00088) [2022-07-10 19:08:47,023][25689] Fps is (10 sec: 5559.0, 60 sec: 5549.9, 300 sec: 5549.7). Total num frames: 873876480. Throughput: 0: 5792.1. Samples: 873882128. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:08:47,025][25689] Avg episode reward: [(0, '-0.458')] [2022-07-10 19:08:47,254][26022] Updated weights on worker 0-0, policy_version 853397 (0.00086) [2022-07-10 19:08:49,036][26022] Updated weights on worker 0-0, policy_version 853407 (0.00088) [2022-07-10 19:08:50,980][26022] Updated weights on worker 0-0, policy_version 853417 (0.00093) [2022-07-10 19:08:52,052][25689] Fps is (10 sec: 5603.2, 60 sec: 5530.7, 300 sec: 5546.1). Total num frames: 873904128. Throughput: 0: 4980.0. Samples: 873898892. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:08:52,058][25689] Avg episode reward: [(0, '-0.712')] [2022-07-10 19:08:52,800][26022] Updated weights on worker 0-0, policy_version 853427 (0.00088) [2022-07-10 19:08:54,654][26022] Updated weights on worker 0-0, policy_version 853437 (0.00084) [2022-07-10 19:08:56,365][26022] Updated weights on worker 0-0, policy_version 853447 (0.00088) [2022-07-10 19:08:56,999][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:08:57,011][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000853449_873931776.pth [2022-07-10 19:08:57,011][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000851499_871934976.pth [2022-07-10 19:08:57,105][25689] Fps is (10 sec: 5484.6, 60 sec: 5544.7, 300 sec: 5539.3). Total num frames: 873931776. Throughput: 0: 5783.0. Samples: 873932250. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:08:57,107][25689] Avg episode reward: [(0, '-0.791')] [2022-07-10 19:08:58,238][26022] Updated weights on worker 0-0, policy_version 853457 (0.00084) [2022-07-10 19:09:00,386][26022] Updated weights on worker 0-0, policy_version 853467 (0.00088) [2022-07-10 19:09:02,151][25689] Fps is (10 sec: 5475.4, 60 sec: 5514.8, 300 sec: 5550.4). Total num frames: 873959424. Throughput: 0: 5799.8. Samples: 873965538. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:09:02,151][25689] Avg episode reward: [(0, '-0.999')] [2022-07-10 19:09:02,356][26022] Updated weights on worker 0-0, policy_version 853477 (0.00087) [2022-07-10 19:09:04,258][26022] Updated weights on worker 0-0, policy_version 853487 (0.00086) [2022-07-10 19:09:05,947][26022] Updated weights on worker 0-0, policy_version 853497 (0.00093) [2022-07-10 19:09:07,184][25689] Fps is (10 sec: 5486.4, 60 sec: 5534.8, 300 sec: 5546.7). Total num frames: 873987072. Throughput: 0: 4874.2. Samples: 873980396. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:09:07,184][25689] Avg episode reward: [(0, '-1.117')] [2022-07-10 19:09:07,899][26022] Updated weights on worker 0-0, policy_version 853507 (0.00093) [2022-07-10 19:09:09,631][26022] Updated weights on worker 0-0, policy_version 853517 (0.00089) [2022-07-10 19:09:11,487][26022] Updated weights on worker 0-0, policy_version 853527 (0.00098) [2022-07-10 19:09:12,199][25689] Fps is (10 sec: 5401.4, 60 sec: 5518.6, 300 sec: 5543.2). Total num frames: 874013696. Throughput: 0: 5725.4. Samples: 874014240. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:09:12,199][25689] Avg episode reward: [(0, '-1.124')] [2022-07-10 19:09:13,260][26022] Updated weights on worker 0-0, policy_version 853537 (0.00085) [2022-07-10 19:09:15,311][26022] Updated weights on worker 0-0, policy_version 853547 (0.00225) [2022-07-10 19:09:17,002][26022] Updated weights on worker 0-0, policy_version 853557 (0.00088) [2022-07-10 19:09:17,216][25689] Fps is (10 sec: 5614.0, 60 sec: 5551.9, 300 sec: 5551.1). Total num frames: 874043392. Throughput: 0: 5735.4. Samples: 874047594. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:09:17,216][25689] Avg episode reward: [(0, '-0.857')] [2022-07-10 19:09:19,103][26022] Updated weights on worker 0-0, policy_version 853567 (0.00091) [2022-07-10 19:09:20,620][26022] Updated weights on worker 0-0, policy_version 853577 (0.00090) [2022-07-10 19:09:22,323][25689] Fps is (10 sec: 5664.0, 60 sec: 5517.4, 300 sec: 5539.5). Total num frames: 874071040. Throughput: 0: 4889.2. Samples: 874064162. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:09:22,324][25689] Avg episode reward: [(0, '-0.902')] [2022-07-10 19:09:22,703][26022] Updated weights on worker 0-0, policy_version 853587 (0.00088) [2022-07-10 19:09:24,454][26022] Updated weights on worker 0-0, policy_version 853597 (0.00085) [2022-07-10 19:09:26,439][26022] Updated weights on worker 0-0, policy_version 853607 (0.00089) [2022-07-10 19:09:27,353][25689] Fps is (10 sec: 5454.7, 60 sec: 5534.1, 300 sec: 5542.7). Total num frames: 874098688. Throughput: 0: 5811.5. Samples: 874097610. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:09:27,354][25689] Avg episode reward: [(0, '-0.207')] [2022-07-10 19:09:28,070][26022] Updated weights on worker 0-0, policy_version 853617 (0.00087) [2022-07-10 19:09:29,997][26022] Updated weights on worker 0-0, policy_version 853627 (0.00094) [2022-07-10 19:09:31,679][26022] Updated weights on worker 0-0, policy_version 853637 (0.00083) [2022-07-10 19:09:32,359][25689] Fps is (10 sec: 5509.7, 60 sec: 5518.2, 300 sec: 5542.9). Total num frames: 874126336. Throughput: 0: 5793.3. Samples: 874131034. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:09:32,359][25689] Avg episode reward: [(0, '-0.065')] [2022-07-10 19:09:33,592][26022] Updated weights on worker 0-0, policy_version 853647 (0.00091) [2022-07-10 19:09:35,568][26022] Updated weights on worker 0-0, policy_version 853657 (0.00080) [2022-07-10 19:09:37,266][26022] Updated weights on worker 0-0, policy_version 853667 (0.00092) [2022-07-10 19:09:37,385][25689] Fps is (10 sec: 5614.4, 60 sec: 5535.3, 300 sec: 5545.5). Total num frames: 874155008. Throughput: 0: 4972.9. Samples: 874147890. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:09:37,385][25689] Avg episode reward: [(0, '-0.233')] [2022-07-10 19:09:39,237][26022] Updated weights on worker 0-0, policy_version 853677 (0.00090) [2022-07-10 19:09:41,003][26022] Updated weights on worker 0-0, policy_version 853687 (0.00084) [2022-07-10 19:09:42,472][25689] Fps is (10 sec: 5467.7, 60 sec: 5520.3, 300 sec: 5541.4). Total num frames: 874181632. Throughput: 0: 5812.7. Samples: 874181284. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:09:42,475][25689] Avg episode reward: [(0, '-0.498')] [2022-07-10 19:09:42,882][26022] Updated weights on worker 0-0, policy_version 853697 (0.00087) [2022-07-10 19:09:44,804][26022] Updated weights on worker 0-0, policy_version 853707 (0.00091) [2022-07-10 19:09:46,680][26022] Updated weights on worker 0-0, policy_version 853718 (0.00098) [2022-07-10 19:09:47,575][25689] Fps is (10 sec: 5526.9, 60 sec: 5529.9, 300 sec: 5539.9). Total num frames: 874211328. Throughput: 0: 5773.4. Samples: 874214358. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:09:47,575][25689] Avg episode reward: [(0, '-0.270')] [2022-07-10 19:09:48,712][26022] Updated weights on worker 0-0, policy_version 853728 (0.00096) [2022-07-10 19:09:50,431][26022] Updated weights on worker 0-0, policy_version 853738 (0.00092) [2022-07-10 19:09:52,399][26022] Updated weights on worker 0-0, policy_version 853748 (0.00093) [2022-07-10 19:09:52,624][25689] Fps is (10 sec: 5547.7, 60 sec: 5511.2, 300 sec: 5539.5). Total num frames: 874237952. Throughput: 0: 5758.5. Samples: 874247730. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:09:52,624][25689] Avg episode reward: [(0, '-0.466')] [2022-07-10 19:09:54,183][26022] Updated weights on worker 0-0, policy_version 853758 (0.00096) [2022-07-10 19:09:56,027][26022] Updated weights on worker 0-0, policy_version 853768 (0.00093) [2022-07-10 19:09:57,630][25689] Fps is (10 sec: 5499.2, 60 sec: 5532.4, 300 sec: 5541.4). Total num frames: 874266624. Throughput: 0: 5748.2. Samples: 874264266. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:09:57,632][25689] Avg episode reward: [(0, '-0.573')] [2022-07-10 19:09:57,983][26022] Updated weights on worker 0-0, policy_version 853778 (0.00088) [2022-07-10 19:09:59,832][26022] Updated weights on worker 0-0, policy_version 853788 (0.00086) [2022-07-10 19:10:01,975][26022] Updated weights on worker 0-0, policy_version 853798 (0.00087) [2022-07-10 19:10:02,725][25689] Fps is (10 sec: 5474.2, 60 sec: 5511.0, 300 sec: 5536.5). Total num frames: 874293248. Throughput: 0: 5722.8. Samples: 874297188. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:10:02,726][25689] Avg episode reward: [(0, '-0.620')] [2022-07-10 19:10:03,845][26022] Updated weights on worker 0-0, policy_version 853808 (0.00094) [2022-07-10 19:10:05,812][26022] Updated weights on worker 0-0, policy_version 853818 (0.00090) [2022-07-10 19:10:07,589][26022] Updated weights on worker 0-0, policy_version 853828 (0.00102) [2022-07-10 19:10:07,789][25689] Fps is (10 sec: 5342.2, 60 sec: 5508.2, 300 sec: 5539.4). Total num frames: 874320896. Throughput: 0: 5654.8. Samples: 874328666. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:10:07,791][25689] Avg episode reward: [(0, '-0.199')] [2022-07-10 19:10:09,524][26022] Updated weights on worker 0-0, policy_version 853838 (0.00089) [2022-07-10 19:10:11,054][26022] Updated weights on worker 0-0, policy_version 853848 (0.00089) [2022-07-10 19:10:12,832][25689] Fps is (10 sec: 5369.9, 60 sec: 5505.6, 300 sec: 5529.8). Total num frames: 874347520. Throughput: 0: 4835.5. Samples: 874345446. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:10:12,834][25689] Avg episode reward: [(0, '-0.641')] [2022-07-10 19:10:13,135][26022] Updated weights on worker 0-0, policy_version 853858 (0.00090) [2022-07-10 19:10:14,788][26022] Updated weights on worker 0-0, policy_version 853868 (0.00091) [2022-07-10 19:10:16,752][26022] Updated weights on worker 0-0, policy_version 853878 (0.00086) [2022-07-10 19:10:17,863][25689] Fps is (10 sec: 5590.9, 60 sec: 5504.4, 300 sec: 5533.6). Total num frames: 874377216. Throughput: 0: 5661.9. Samples: 874378820. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:10:17,863][25689] Avg episode reward: [(0, '0.496')] [2022-07-10 19:10:18,504][26022] Updated weights on worker 0-0, policy_version 853888 (0.00089) [2022-07-10 19:10:20,347][26022] Updated weights on worker 0-0, policy_version 853898 (0.00085) [2022-07-10 19:10:22,259][26022] Updated weights on worker 0-0, policy_version 853908 (0.00088) [2022-07-10 19:10:22,943][25689] Fps is (10 sec: 5671.5, 60 sec: 5506.8, 300 sec: 5533.4). Total num frames: 874404864. Throughput: 0: 5701.0. Samples: 874412450. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:10:22,943][25689] Avg episode reward: [(0, '0.265')] [2022-07-10 19:10:24,102][26022] Updated weights on worker 0-0, policy_version 853918 (0.00089) [2022-07-10 19:10:25,751][26022] Updated weights on worker 0-0, policy_version 853928 (0.00091) [2022-07-10 19:10:27,904][26022] Updated weights on worker 0-0, policy_version 853938 (0.00090) [2022-07-10 19:10:27,997][25689] Fps is (10 sec: 5557.3, 60 sec: 5521.5, 300 sec: 5533.9). Total num frames: 874433536. Throughput: 0: 4977.0. Samples: 874429240. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:10:27,998][25689] Avg episode reward: [(0, '0.187')] [2022-07-10 19:10:29,620][26022] Updated weights on worker 0-0, policy_version 853948 (0.00085) [2022-07-10 19:10:31,545][26022] Updated weights on worker 0-0, policy_version 853958 (0.00567) [2022-07-10 19:10:33,031][25689] Fps is (10 sec: 5582.6, 60 sec: 5518.9, 300 sec: 5533.4). Total num frames: 874461184. Throughput: 0: 5792.4. Samples: 874462450. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:10:33,032][25689] Avg episode reward: [(0, '0.353')] [2022-07-10 19:10:33,271][26022] Updated weights on worker 0-0, policy_version 853968 (0.00095) [2022-07-10 19:10:35,146][26022] Updated weights on worker 0-0, policy_version 853978 (0.00082) [2022-07-10 19:10:37,176][26022] Updated weights on worker 0-0, policy_version 853988 (0.00088) [2022-07-10 19:10:38,057][25689] Fps is (10 sec: 5496.5, 60 sec: 5502.0, 300 sec: 5531.1). Total num frames: 874488832. Throughput: 0: 5804.8. Samples: 874496046. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:10:38,058][25689] Avg episode reward: [(0, '0.527')] [2022-07-10 19:10:38,748][26022] Updated weights on worker 0-0, policy_version 853998 (0.00089) [2022-07-10 19:10:40,711][26022] Updated weights on worker 0-0, policy_version 854008 (0.00093) [2022-07-10 19:10:42,458][26022] Updated weights on worker 0-0, policy_version 854018 (0.00085) [2022-07-10 19:10:43,137][25689] Fps is (10 sec: 5674.6, 60 sec: 5553.4, 300 sec: 5533.9). Total num frames: 874518528. Throughput: 0: 4976.2. Samples: 874512938. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:10:43,137][25689] Avg episode reward: [(0, '0.770')] [2022-07-10 19:10:44,377][26022] Updated weights on worker 0-0, policy_version 854028 (0.00085) [2022-07-10 19:10:46,197][26022] Updated weights on worker 0-0, policy_version 854038 (0.00086) [2022-07-10 19:10:48,005][26022] Updated weights on worker 0-0, policy_version 854048 (0.00095) [2022-07-10 19:10:48,181][25689] Fps is (10 sec: 5563.3, 60 sec: 5508.1, 300 sec: 5533.2). Total num frames: 874545152. Throughput: 0: 5794.1. Samples: 874546186. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:10:48,181][25689] Avg episode reward: [(0, '0.433')] [2022-07-10 19:10:49,871][26022] Updated weights on worker 0-0, policy_version 854058 (0.00087) [2022-07-10 19:10:51,672][26022] Updated weights on worker 0-0, policy_version 854068 (0.00087) [2022-07-10 19:10:53,214][25689] Fps is (10 sec: 5385.6, 60 sec: 5526.4, 300 sec: 5526.5). Total num frames: 874572800. Throughput: 0: 5804.9. Samples: 874579608. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:10:53,215][25689] Avg episode reward: [(0, '-0.091')] [2022-07-10 19:10:53,570][26022] Updated weights on worker 0-0, policy_version 854078 (0.00093) [2022-07-10 19:10:55,301][26022] Updated weights on worker 0-0, policy_version 854088 (0.00085) [2022-07-10 19:10:57,038][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:10:57,049][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000854097_874595328.pth [2022-07-10 19:10:57,050][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000852150_872601600.pth [2022-07-10 19:10:57,139][26022] Updated weights on worker 0-0, policy_version 854098 (0.00086) [2022-07-10 19:10:58,261][25689] Fps is (10 sec: 5688.6, 60 sec: 5539.6, 300 sec: 5534.2). Total num frames: 874602496. Throughput: 0: 4974.1. Samples: 874596542. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:10:58,262][25689] Avg episode reward: [(0, '0.094')] [2022-07-10 19:10:59,082][26022] Updated weights on worker 0-0, policy_version 854108 (0.00084) [2022-07-10 19:11:00,815][26022] Updated weights on worker 0-0, policy_version 854118 (0.00087) [2022-07-10 19:11:03,228][26022] Updated weights on worker 0-0, policy_version 854128 (0.00088) [2022-07-10 19:11:03,364][25689] Fps is (10 sec: 5448.2, 60 sec: 5522.1, 300 sec: 5529.4). Total num frames: 874628096. Throughput: 0: 5782.1. Samples: 874629892. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:11:03,364][25689] Avg episode reward: [(0, '0.080')] [2022-07-10 19:11:04,911][26022] Updated weights on worker 0-0, policy_version 854138 (0.00931) [2022-07-10 19:11:06,819][26022] Updated weights on worker 0-0, policy_version 854148 (0.00082) [2022-07-10 19:11:08,375][25689] Fps is (10 sec: 5366.3, 60 sec: 5543.8, 300 sec: 5536.6). Total num frames: 874656768. Throughput: 0: 5693.2. Samples: 874661154. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:11:08,375][25689] Avg episode reward: [(0, '-0.360')] [2022-07-10 19:11:08,525][26022] Updated weights on worker 0-0, policy_version 854158 (0.00086) [2022-07-10 19:11:10,364][26022] Updated weights on worker 0-0, policy_version 854168 (0.00080) [2022-07-10 19:11:12,295][26022] Updated weights on worker 0-0, policy_version 854178 (0.00084) [2022-07-10 19:11:13,397][25689] Fps is (10 sec: 5510.9, 60 sec: 5545.6, 300 sec: 5526.8). Total num frames: 874683392. Throughput: 0: 4874.7. Samples: 874677996. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2022-07-10 19:11:13,398][25689] Avg episode reward: [(0, '-0.616')] [2022-07-10 19:11:14,099][26022] Updated weights on worker 0-0, policy_version 854188 (0.00084) [2022-07-10 19:11:16,086][26022] Updated weights on worker 0-0, policy_version 854198 (0.00102) [2022-07-10 19:11:17,889][26022] Updated weights on worker 0-0, policy_version 854208 (0.00098) [2022-07-10 19:11:18,421][25689] Fps is (10 sec: 5504.4, 60 sec: 5529.4, 300 sec: 5531.9). Total num frames: 874712064. Throughput: 0: 5690.9. Samples: 874711266. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:11:18,421][25689] Avg episode reward: [(0, '-0.107')] [2022-07-10 19:11:19,684][26022] Updated weights on worker 0-0, policy_version 854218 (0.00090) [2022-07-10 19:11:21,710][26022] Updated weights on worker 0-0, policy_version 854228 (0.00092) [2022-07-10 19:11:23,147][26022] Updated weights on worker 0-0, policy_version 854238 (0.00097) [2022-07-10 19:11:23,540][25689] Fps is (10 sec: 5653.9, 60 sec: 5542.7, 300 sec: 5529.9). Total num frames: 874740736. Throughput: 0: 5688.1. Samples: 874744656. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:11:23,540][25689] Avg episode reward: [(0, '-0.200')] [2022-07-10 19:11:25,463][26022] Updated weights on worker 0-0, policy_version 854248 (0.00099) [2022-07-10 19:11:26,828][26022] Updated weights on worker 0-0, policy_version 854258 (0.00087) [2022-07-10 19:11:28,563][25689] Fps is (10 sec: 5351.0, 60 sec: 5494.8, 300 sec: 5519.4). Total num frames: 874766336. Throughput: 0: 4973.0. Samples: 874761550. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:11:28,563][25689] Avg episode reward: [(0, '-0.083')] [2022-07-10 19:11:28,899][26022] Updated weights on worker 0-0, policy_version 854268 (0.00090) [2022-07-10 19:11:30,759][26022] Updated weights on worker 0-0, policy_version 854278 (0.00082) [2022-07-10 19:11:32,398][26022] Updated weights on worker 0-0, policy_version 854288 (0.00082) [2022-07-10 19:11:33,598][25689] Fps is (10 sec: 5599.2, 60 sec: 5545.5, 300 sec: 5533.0). Total num frames: 874797056. Throughput: 0: 5788.6. Samples: 874794930. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:11:33,599][25689] Avg episode reward: [(0, '-0.338')] [2022-07-10 19:11:34,660][26022] Updated weights on worker 0-0, policy_version 854298 (0.00083) [2022-07-10 19:11:36,077][26022] Updated weights on worker 0-0, policy_version 854308 (0.00095) [2022-07-10 19:11:38,032][26022] Updated weights on worker 0-0, policy_version 854318 (0.00088) [2022-07-10 19:11:38,676][25689] Fps is (10 sec: 5771.8, 60 sec: 5540.8, 300 sec: 5529.1). Total num frames: 874824704. Throughput: 0: 5784.1. Samples: 874828424. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:11:38,676][25689] Avg episode reward: [(0, '-0.437')] [2022-07-10 19:11:40,025][26022] Updated weights on worker 0-0, policy_version 854328 (0.00086) [2022-07-10 19:11:41,441][26022] Updated weights on worker 0-0, policy_version 854338 (0.00094) [2022-07-10 19:11:43,773][25689] Fps is (10 sec: 5333.9, 60 sec: 5488.4, 300 sec: 5524.4). Total num frames: 874851328. Throughput: 0: 5783.6. Samples: 874861678. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:11:43,774][25689] Avg episode reward: [(0, '0.000')] [2022-07-10 19:11:43,850][26022] Updated weights on worker 0-0, policy_version 854348 (0.00088) [2022-07-10 19:11:45,203][26022] Updated weights on worker 0-0, policy_version 854358 (0.00084) [2022-07-10 19:11:47,264][26022] Updated weights on worker 0-0, policy_version 854368 (0.00083) [2022-07-10 19:11:48,778][25689] Fps is (10 sec: 5574.9, 60 sec: 5542.7, 300 sec: 5528.1). Total num frames: 874881024. Throughput: 0: 5778.4. Samples: 874878362. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:11:48,779][25689] Avg episode reward: [(0, '0.241')] [2022-07-10 19:11:49,301][26022] Updated weights on worker 0-0, policy_version 854378 (0.00093) [2022-07-10 19:11:51,049][26022] Updated weights on worker 0-0, policy_version 854388 (0.00092) [2022-07-10 19:11:52,816][26022] Updated weights on worker 0-0, policy_version 854398 (0.00087) [2022-07-10 19:11:53,807][25689] Fps is (10 sec: 5816.9, 60 sec: 5559.9, 300 sec: 5528.2). Total num frames: 874909696. Throughput: 0: 5795.2. Samples: 874912046. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:11:53,808][25689] Avg episode reward: [(0, '-0.465')] [2022-07-10 19:11:54,686][26022] Updated weights on worker 0-0, policy_version 854408 (0.00098) [2022-07-10 19:11:56,184][26022] Updated weights on worker 0-0, policy_version 854418 (0.00090) [2022-07-10 19:11:58,506][26022] Updated weights on worker 0-0, policy_version 854428 (0.00086) [2022-07-10 19:11:58,814][25689] Fps is (10 sec: 5509.6, 60 sec: 5512.9, 300 sec: 5525.4). Total num frames: 874936320. Throughput: 0: 5826.4. Samples: 874945760. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:11:58,815][25689] Avg episode reward: [(0, '0.235')] [2022-07-10 19:11:59,762][26022] Updated weights on worker 0-0, policy_version 854438 (0.00091) [2022-07-10 19:12:02,480][26022] Updated weights on worker 0-0, policy_version 854448 (0.00083) [2022-07-10 19:12:03,865][25689] Fps is (10 sec: 5498.0, 60 sec: 5568.4, 300 sec: 5535.1). Total num frames: 874964992. Throughput: 0: 4988.6. Samples: 874961908. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:12:03,867][25689] Avg episode reward: [(0, '0.252')] [2022-07-10 19:12:03,874][26022] Updated weights on worker 0-0, policy_version 854458 (0.00085) [2022-07-10 19:12:06,119][26022] Updated weights on worker 0-0, policy_version 854468 (0.00089) [2022-07-10 19:12:07,707][26022] Updated weights on worker 0-0, policy_version 854478 (0.00083) [2022-07-10 19:12:08,959][25689] Fps is (10 sec: 5349.8, 60 sec: 5510.1, 300 sec: 5523.7). Total num frames: 874990592. Throughput: 0: 5722.1. Samples: 874993840. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:12:08,960][25689] Avg episode reward: [(0, '-0.235')] [2022-07-10 19:12:09,914][26022] Updated weights on worker 0-0, policy_version 854488 (0.00087) [2022-07-10 19:12:11,412][26022] Updated weights on worker 0-0, policy_version 854498 (0.00086) [2022-07-10 19:12:13,539][26022] Updated weights on worker 0-0, policy_version 854508 (0.00085) [2022-07-10 19:12:13,997][25689] Fps is (10 sec: 5255.1, 60 sec: 5525.5, 300 sec: 5523.7). Total num frames: 875018240. Throughput: 0: 5687.7. Samples: 875026880. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:12:13,998][25689] Avg episode reward: [(0, '-0.224')] [2022-07-10 19:12:15,066][26022] Updated weights on worker 0-0, policy_version 854518 (0.00093) [2022-07-10 19:12:17,179][26022] Updated weights on worker 0-0, policy_version 854528 (0.00084) [2022-07-10 19:12:19,027][25689] Fps is (10 sec: 5492.4, 60 sec: 5508.1, 300 sec: 5524.8). Total num frames: 875045888. Throughput: 0: 4836.1. Samples: 875043508. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:12:19,027][25689] Avg episode reward: [(0, '-0.351')] [2022-07-10 19:12:19,062][26022] Updated weights on worker 0-0, policy_version 854538 (0.00092) [2022-07-10 19:12:20,769][26022] Updated weights on worker 0-0, policy_version 854548 (0.00089) [2022-07-10 19:12:22,799][26022] Updated weights on worker 0-0, policy_version 854558 (0.00086) [2022-07-10 19:12:24,110][25689] Fps is (10 sec: 5569.6, 60 sec: 5511.4, 300 sec: 5523.3). Total num frames: 875074560. Throughput: 0: 5687.1. Samples: 875077042. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:12:24,111][25689] Avg episode reward: [(0, '-0.057')] [2022-07-10 19:12:24,335][26022] Updated weights on worker 0-0, policy_version 854568 (0.00092) [2022-07-10 19:12:26,438][26022] Updated weights on worker 0-0, policy_version 854578 (0.00090) [2022-07-10 19:12:28,269][26022] Updated weights on worker 0-0, policy_version 854588 (0.00079) [2022-07-10 19:12:29,146][25689] Fps is (10 sec: 5565.9, 60 sec: 5544.0, 300 sec: 5526.6). Total num frames: 875102208. Throughput: 0: 5776.4. Samples: 875110446. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:12:29,146][25689] Avg episode reward: [(0, '0.179')] [2022-07-10 19:12:29,947][26022] Updated weights on worker 0-0, policy_version 854598 (0.00093) [2022-07-10 19:12:31,996][26022] Updated weights on worker 0-0, policy_version 854608 (0.00085) [2022-07-10 19:12:33,532][26022] Updated weights on worker 0-0, policy_version 854618 (0.00090) [2022-07-10 19:12:34,175][25689] Fps is (10 sec: 5595.2, 60 sec: 5510.7, 300 sec: 5529.8). Total num frames: 875130880. Throughput: 0: 4975.9. Samples: 875127282. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:12:34,176][25689] Avg episode reward: [(0, '0.790')] [2022-07-10 19:12:35,620][26022] Updated weights on worker 0-0, policy_version 854628 (0.00088) [2022-07-10 19:12:37,324][26022] Updated weights on worker 0-0, policy_version 854638 (0.00090) [2022-07-10 19:12:39,177][26022] Updated weights on worker 0-0, policy_version 854648 (0.00085) [2022-07-10 19:12:39,274][25689] Fps is (10 sec: 5661.8, 60 sec: 5525.7, 300 sec: 5525.7). Total num frames: 875159552. Throughput: 0: 5796.8. Samples: 875160876. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:12:39,275][25689] Avg episode reward: [(0, '0.901')] [2022-07-10 19:12:41,036][26022] Updated weights on worker 0-0, policy_version 854658 (0.00083) [2022-07-10 19:12:42,900][26022] Updated weights on worker 0-0, policy_version 854668 (0.00093) [2022-07-10 19:12:44,345][25689] Fps is (10 sec: 5638.8, 60 sec: 5562.0, 300 sec: 5531.9). Total num frames: 875188224. Throughput: 0: 5800.3. Samples: 875194412. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:12:44,345][25689] Avg episode reward: [(0, '0.889')] [2022-07-10 19:12:44,699][26022] Updated weights on worker 0-0, policy_version 854678 (0.00094) [2022-07-10 19:12:46,663][26022] Updated weights on worker 0-0, policy_version 854688 (0.00079) [2022-07-10 19:12:48,158][26022] Updated weights on worker 0-0, policy_version 854698 (0.00084) [2022-07-10 19:12:49,366][25689] Fps is (10 sec: 5580.5, 60 sec: 5526.6, 300 sec: 5528.1). Total num frames: 875215872. Throughput: 0: 4990.9. Samples: 875211364. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:12:49,367][25689] Avg episode reward: [(0, '0.785')] [2022-07-10 19:12:50,596][26022] Updated weights on worker 0-0, policy_version 854708 (0.00093) [2022-07-10 19:12:51,795][26022] Updated weights on worker 0-0, policy_version 854718 (0.00085) [2022-07-10 19:12:53,978][26022] Updated weights on worker 0-0, policy_version 854728 (0.00089) [2022-07-10 19:12:54,373][25689] Fps is (10 sec: 5616.2, 60 sec: 5528.7, 300 sec: 5535.3). Total num frames: 875244544. Throughput: 0: 5821.0. Samples: 875244854. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:12:54,374][25689] Avg episode reward: [(0, '0.938')] [2022-07-10 19:12:55,587][26022] Updated weights on worker 0-0, policy_version 854738 (0.00088) [2022-07-10 19:12:57,052][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:12:57,063][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000854746_875259904.pth [2022-07-10 19:12:57,065][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000852801_873268224.pth [2022-07-10 19:12:57,519][26022] Updated weights on worker 0-0, policy_version 854748 (0.00090) [2022-07-10 19:12:59,381][25689] Fps is (10 sec: 5521.1, 60 sec: 5528.6, 300 sec: 5526.4). Total num frames: 875271168. Throughput: 0: 5837.7. Samples: 875278258. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:12:59,382][25689] Avg episode reward: [(0, '0.034')] [2022-07-10 19:12:59,463][26022] Updated weights on worker 0-0, policy_version 854758 (0.00094) [2022-07-10 19:13:01,319][26022] Updated weights on worker 0-0, policy_version 854768 (0.00091) [2022-07-10 19:13:03,453][26022] Updated weights on worker 0-0, policy_version 854778 (0.00093) [2022-07-10 19:13:04,467][25689] Fps is (10 sec: 5275.1, 60 sec: 5491.5, 300 sec: 5526.1). Total num frames: 875297792. Throughput: 0: 4912.7. Samples: 875293268. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:13:04,468][25689] Avg episode reward: [(0, '0.173')] [2022-07-10 19:13:05,360][26022] Updated weights on worker 0-0, policy_version 854788 (0.00090) [2022-07-10 19:13:06,999][26022] Updated weights on worker 0-0, policy_version 854798 (0.00343) [2022-07-10 19:13:09,011][26022] Updated weights on worker 0-0, policy_version 854808 (0.00090) [2022-07-10 19:13:09,532][25689] Fps is (10 sec: 5346.8, 60 sec: 5528.0, 300 sec: 5525.3). Total num frames: 875325440. Throughput: 0: 5692.9. Samples: 875326166. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:13:09,532][25689] Avg episode reward: [(0, '-0.277')] [2022-07-10 19:13:10,636][26022] Updated weights on worker 0-0, policy_version 854818 (0.00093) [2022-07-10 19:13:12,700][26022] Updated weights on worker 0-0, policy_version 854828 (0.00092) [2022-07-10 19:13:14,401][26022] Updated weights on worker 0-0, policy_version 854838 (0.00084) [2022-07-10 19:13:14,547][25689] Fps is (10 sec: 5587.5, 60 sec: 5547.1, 300 sec: 5528.7). Total num frames: 875354112. Throughput: 0: 5705.5. Samples: 875359956. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:13:14,547][25689] Avg episode reward: [(0, '-1.690')] [2022-07-10 19:13:16,299][26022] Updated weights on worker 0-0, policy_version 854848 (0.00095) [2022-07-10 19:13:18,084][26022] Updated weights on worker 0-0, policy_version 854858 (0.00091) [2022-07-10 19:13:19,569][25689] Fps is (10 sec: 5611.2, 60 sec: 5547.7, 300 sec: 5523.2). Total num frames: 875381760. Throughput: 0: 4891.9. Samples: 875377014. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:13:19,569][25689] Avg episode reward: [(0, '-1.358')] [2022-07-10 19:13:20,048][26022] Updated weights on worker 0-0, policy_version 854868 (0.00094) [2022-07-10 19:13:21,615][26022] Updated weights on worker 0-0, policy_version 854878 (0.00084) [2022-07-10 19:13:23,578][26022] Updated weights on worker 0-0, policy_version 854888 (0.00085) [2022-07-10 19:13:24,618][25689] Fps is (10 sec: 5694.1, 60 sec: 5567.8, 300 sec: 5533.2). Total num frames: 875411456. Throughput: 0: 5820.3. Samples: 875410548. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:13:24,618][25689] Avg episode reward: [(0, '-1.385')] [2022-07-10 19:13:25,330][26022] Updated weights on worker 0-0, policy_version 854898 (0.00094) [2022-07-10 19:13:27,362][26022] Updated weights on worker 0-0, policy_version 854908 (0.00084) [2022-07-10 19:13:29,003][26022] Updated weights on worker 0-0, policy_version 854918 (0.00108) [2022-07-10 19:13:29,622][25689] Fps is (10 sec: 5704.4, 60 sec: 5570.7, 300 sec: 5530.0). Total num frames: 875439104. Throughput: 0: 5881.9. Samples: 875444332. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:13:29,624][25689] Avg episode reward: [(0, '-0.837')] [2022-07-10 19:13:30,863][26022] Updated weights on worker 0-0, policy_version 854928 (0.00097) [2022-07-10 19:13:32,712][26022] Updated weights on worker 0-0, policy_version 854938 (0.00090) [2022-07-10 19:13:34,539][26022] Updated weights on worker 0-0, policy_version 854948 (0.00092) [2022-07-10 19:13:34,627][25689] Fps is (10 sec: 5524.3, 60 sec: 5556.0, 300 sec: 5530.4). Total num frames: 875466752. Throughput: 0: 5039.8. Samples: 875461156. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:13:34,628][25689] Avg episode reward: [(0, '-2.080')] [2022-07-10 19:13:36,310][26022] Updated weights on worker 0-0, policy_version 854958 (0.00094) [2022-07-10 19:13:38,222][26022] Updated weights on worker 0-0, policy_version 854968 (0.00084) [2022-07-10 19:13:39,657][25689] Fps is (10 sec: 5612.1, 60 sec: 5562.3, 300 sec: 5535.3). Total num frames: 875495424. Throughput: 0: 5863.3. Samples: 875494796. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:13:39,659][25689] Avg episode reward: [(0, '-1.994')] [2022-07-10 19:13:39,969][26022] Updated weights on worker 0-0, policy_version 854978 (0.00087) [2022-07-10 19:13:41,974][26022] Updated weights on worker 0-0, policy_version 854988 (0.00090) [2022-07-10 19:13:43,800][26022] Updated weights on worker 0-0, policy_version 854998 (0.00096) [2022-07-10 19:13:44,712][25689] Fps is (10 sec: 5483.1, 60 sec: 5529.9, 300 sec: 5527.8). Total num frames: 875522048. Throughput: 0: 5847.5. Samples: 875528050. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:13:44,714][25689] Avg episode reward: [(0, '-0.931')] [2022-07-10 19:13:45,487][26022] Updated weights on worker 0-0, policy_version 855008 (0.00089) [2022-07-10 19:13:47,550][26022] Updated weights on worker 0-0, policy_version 855018 (0.00099) [2022-07-10 19:13:49,172][26022] Updated weights on worker 0-0, policy_version 855028 (0.00087) [2022-07-10 19:13:49,739][25689] Fps is (10 sec: 5484.8, 60 sec: 5546.4, 300 sec: 5531.3). Total num frames: 875550720. Throughput: 0: 4996.2. Samples: 875544842. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:13:49,740][25689] Avg episode reward: [(0, '-0.835')] [2022-07-10 19:13:51,423][26022] Updated weights on worker 0-0, policy_version 855038 (0.00082) [2022-07-10 19:13:53,066][26022] Updated weights on worker 0-0, policy_version 855048 (0.00092) [2022-07-10 19:13:54,744][26022] Updated weights on worker 0-0, policy_version 855058 (0.00092) [2022-07-10 19:13:54,762][25689] Fps is (10 sec: 5705.9, 60 sec: 5544.9, 300 sec: 5535.3). Total num frames: 875579392. Throughput: 0: 5799.2. Samples: 875577922. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:13:54,762][25689] Avg episode reward: [(0, '-0.655')] [2022-07-10 19:13:56,922][26022] Updated weights on worker 0-0, policy_version 855068 (0.00094) [2022-07-10 19:13:58,312][26022] Updated weights on worker 0-0, policy_version 855078 (0.00083) [2022-07-10 19:13:59,764][25689] Fps is (10 sec: 5413.7, 60 sec: 5528.5, 300 sec: 5529.2). Total num frames: 875604992. Throughput: 0: 5790.7. Samples: 875611226. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:13:59,765][25689] Avg episode reward: [(0, '-0.171')] [2022-07-10 19:14:00,462][26022] Updated weights on worker 0-0, policy_version 855088 (0.00092) [2022-07-10 19:14:02,553][26022] Updated weights on worker 0-0, policy_version 855098 (0.00092) [2022-07-10 19:14:04,511][26022] Updated weights on worker 0-0, policy_version 855108 (0.00091) [2022-07-10 19:14:04,848][25689] Fps is (10 sec: 5279.5, 60 sec: 5545.6, 300 sec: 5528.3). Total num frames: 875632640. Throughput: 0: 4855.8. Samples: 875625826. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:14:04,848][25689] Avg episode reward: [(0, '0.595')] [2022-07-10 19:14:06,259][26022] Updated weights on worker 0-0, policy_version 855118 (0.00082) [2022-07-10 19:14:08,067][26022] Updated weights on worker 0-0, policy_version 855128 (0.00089) [2022-07-10 19:14:09,856][25689] Fps is (10 sec: 5479.0, 60 sec: 5550.8, 300 sec: 5531.8). Total num frames: 875660288. Throughput: 0: 5699.2. Samples: 875659494. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:14:09,856][25689] Avg episode reward: [(0, '-0.025')] [2022-07-10 19:14:09,946][26022] Updated weights on worker 0-0, policy_version 855138 (0.00090) [2022-07-10 19:14:11,727][26022] Updated weights on worker 0-0, policy_version 855148 (0.00097) [2022-07-10 19:14:13,531][26022] Updated weights on worker 0-0, policy_version 855158 (0.00091) [2022-07-10 19:14:14,874][25689] Fps is (10 sec: 5515.1, 60 sec: 5533.6, 300 sec: 5524.9). Total num frames: 875687936. Throughput: 0: 5716.3. Samples: 875692888. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:14:14,875][25689] Avg episode reward: [(0, '-0.051')] [2022-07-10 19:14:15,592][26022] Updated weights on worker 0-0, policy_version 855168 (0.00091) [2022-07-10 19:14:17,187][26022] Updated weights on worker 0-0, policy_version 855178 (0.00087) [2022-07-10 19:14:19,190][26022] Updated weights on worker 0-0, policy_version 855188 (0.00084) [2022-07-10 19:14:19,887][25689] Fps is (10 sec: 5512.6, 60 sec: 5534.4, 300 sec: 5526.7). Total num frames: 875715584. Throughput: 0: 4889.4. Samples: 875709616. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:14:19,888][25689] Avg episode reward: [(0, '-0.474')] [2022-07-10 19:14:20,980][26022] Updated weights on worker 0-0, policy_version 855198 (0.00089) [2022-07-10 19:14:22,806][26022] Updated weights on worker 0-0, policy_version 855208 (0.00092) [2022-07-10 19:14:24,707][26022] Updated weights on worker 0-0, policy_version 855218 (0.00547) [2022-07-10 19:14:24,934][25689] Fps is (10 sec: 5700.0, 60 sec: 5534.5, 300 sec: 5533.3). Total num frames: 875745280. Throughput: 0: 5829.7. Samples: 875742924. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:14:24,935][25689] Avg episode reward: [(0, '-1.335')] [2022-07-10 19:14:26,467][26022] Updated weights on worker 0-0, policy_version 855228 (0.00094) [2022-07-10 19:14:28,443][26022] Updated weights on worker 0-0, policy_version 855238 (0.00897) [2022-07-10 19:14:30,019][25689] Fps is (10 sec: 5558.7, 60 sec: 5510.2, 300 sec: 5528.3). Total num frames: 875771904. Throughput: 0: 5799.2. Samples: 875776422. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:14:30,019][25689] Avg episode reward: [(0, '-1.461')] [2022-07-10 19:14:30,199][26022] Updated weights on worker 0-0, policy_version 855248 (0.00088) [2022-07-10 19:14:31,976][26022] Updated weights on worker 0-0, policy_version 855258 (0.00086) [2022-07-10 19:14:33,924][26022] Updated weights on worker 0-0, policy_version 855268 (0.00094) [2022-07-10 19:14:35,043][25689] Fps is (10 sec: 5470.0, 60 sec: 5525.4, 300 sec: 5528.4). Total num frames: 875800576. Throughput: 0: 5807.9. Samples: 875810028. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:14:35,044][25689] Avg episode reward: [(0, '-0.554')] [2022-07-10 19:14:35,602][26022] Updated weights on worker 0-0, policy_version 855278 (0.00086) [2022-07-10 19:14:37,678][26022] Updated weights on worker 0-0, policy_version 855288 (0.00086) [2022-07-10 19:14:39,308][26022] Updated weights on worker 0-0, policy_version 855298 (0.00086) [2022-07-10 19:14:40,083][25689] Fps is (10 sec: 5697.8, 60 sec: 5524.5, 300 sec: 5536.1). Total num frames: 875829248. Throughput: 0: 5804.3. Samples: 875826840. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:14:40,083][25689] Avg episode reward: [(0, '-0.717')] [2022-07-10 19:14:41,324][26022] Updated weights on worker 0-0, policy_version 855308 (0.00082) [2022-07-10 19:14:42,882][26022] Updated weights on worker 0-0, policy_version 855318 (0.00050) [2022-07-10 19:14:44,934][26022] Updated weights on worker 0-0, policy_version 855328 (0.00087) [2022-07-10 19:14:45,148][25689] Fps is (10 sec: 5573.7, 60 sec: 5540.5, 300 sec: 5530.0). Total num frames: 875856896. Throughput: 0: 5810.2. Samples: 875860368. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:14:45,150][25689] Avg episode reward: [(0, '-0.932')] [2022-07-10 19:14:46,793][26022] Updated weights on worker 0-0, policy_version 855338 (0.00084) [2022-07-10 19:14:48,551][26022] Updated weights on worker 0-0, policy_version 855349 (0.00082) [2022-07-10 19:14:50,152][25689] Fps is (10 sec: 5491.8, 60 sec: 5525.7, 300 sec: 5534.2). Total num frames: 875884544. Throughput: 0: 5836.8. Samples: 875893934. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:14:50,152][25689] Avg episode reward: [(0, '-0.944')] [2022-07-10 19:14:50,670][26022] Updated weights on worker 0-0, policy_version 855359 (0.00088) [2022-07-10 19:14:52,465][26022] Updated weights on worker 0-0, policy_version 855369 (0.00084) [2022-07-10 19:14:54,266][26022] Updated weights on worker 0-0, policy_version 855379 (0.00085) [2022-07-10 19:14:55,223][25689] Fps is (10 sec: 5590.2, 60 sec: 5521.3, 300 sec: 5533.0). Total num frames: 875913216. Throughput: 0: 4989.3. Samples: 875910710. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-10 19:14:55,223][25689] Avg episode reward: [(0, '-0.519')] [2022-07-10 19:14:56,149][26022] Updated weights on worker 0-0, policy_version 855389 (0.00086) [2022-07-10 19:14:57,109][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:14:57,119][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000855394_875923456.pth [2022-07-10 19:14:57,120][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000853449_873931776.pth [2022-07-10 19:14:57,977][26022] Updated weights on worker 0-0, policy_version 855399 (0.00126) [2022-07-10 19:14:59,742][26022] Updated weights on worker 0-0, policy_version 855409 (0.00086) [2022-07-10 19:15:00,225][25689] Fps is (10 sec: 5591.1, 60 sec: 5555.2, 300 sec: 5538.2). Total num frames: 875940864. Throughput: 0: 5829.5. Samples: 875944258. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:15:00,226][25689] Avg episode reward: [(0, '-0.336')] [2022-07-10 19:15:01,744][26022] Updated weights on worker 0-0, policy_version 855419 (0.00096) [2022-07-10 19:15:03,961][26022] Updated weights on worker 0-0, policy_version 855429 (0.00089) [2022-07-10 19:15:05,321][25689] Fps is (10 sec: 5273.1, 60 sec: 5520.2, 300 sec: 5530.7). Total num frames: 875966464. Throughput: 0: 5715.8. Samples: 875975672. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:15:05,323][25689] Avg episode reward: [(0, '-0.580')] [2022-07-10 19:15:05,614][26022] Updated weights on worker 0-0, policy_version 855439 (0.00083) [2022-07-10 19:15:07,439][26022] Updated weights on worker 0-0, policy_version 855449 (0.00093) [2022-07-10 19:15:09,252][26022] Updated weights on worker 0-0, policy_version 855459 (0.00094) [2022-07-10 19:15:10,414][25689] Fps is (10 sec: 5326.6, 60 sec: 5529.4, 300 sec: 5536.7). Total num frames: 875995136. Throughput: 0: 4863.8. Samples: 875992484. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:15:10,416][25689] Avg episode reward: [(0, '-1.425')] [2022-07-10 19:15:11,235][26022] Updated weights on worker 0-0, policy_version 855469 (0.00092) [2022-07-10 19:15:12,928][26022] Updated weights on worker 0-0, policy_version 855479 (0.00088) [2022-07-10 19:15:14,977][26022] Updated weights on worker 0-0, policy_version 855489 (0.00085) [2022-07-10 19:15:15,458][25689] Fps is (10 sec: 5656.7, 60 sec: 5543.9, 300 sec: 5533.0). Total num frames: 876023808. Throughput: 0: 5702.8. Samples: 876026110. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:15:15,460][25689] Avg episode reward: [(0, '-0.857')] [2022-07-10 19:15:16,576][26022] Updated weights on worker 0-0, policy_version 855499 (0.00096) [2022-07-10 19:15:18,631][26022] Updated weights on worker 0-0, policy_version 855509 (0.00093) [2022-07-10 19:15:20,384][26022] Updated weights on worker 0-0, policy_version 855519 (0.00086) [2022-07-10 19:15:20,486][25689] Fps is (10 sec: 5591.9, 60 sec: 5542.6, 300 sec: 5534.0). Total num frames: 876051456. Throughput: 0: 5682.1. Samples: 876059382. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:15:20,488][25689] Avg episode reward: [(0, '-0.361')] [2022-07-10 19:15:22,200][26022] Updated weights on worker 0-0, policy_version 855529 (0.00100) [2022-07-10 19:15:23,834][26022] Updated weights on worker 0-0, policy_version 855539 (0.00075) [2022-07-10 19:15:25,556][25689] Fps is (10 sec: 5476.2, 60 sec: 5506.7, 300 sec: 5530.2). Total num frames: 876079104. Throughput: 0: 4970.5. Samples: 876076250. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:15:25,556][25689] Avg episode reward: [(0, '-0.010')] [2022-07-10 19:15:25,965][26022] Updated weights on worker 0-0, policy_version 855549 (0.00087) [2022-07-10 19:15:27,495][26022] Updated weights on worker 0-0, policy_version 855559 (0.00086) [2022-07-10 19:15:29,623][26022] Updated weights on worker 0-0, policy_version 855569 (0.00092) [2022-07-10 19:15:30,591][25689] Fps is (10 sec: 5674.7, 60 sec: 5561.9, 300 sec: 5537.1). Total num frames: 876108800. Throughput: 0: 5824.2. Samples: 876109998. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:15:30,591][25689] Avg episode reward: [(0, '-1.622')] [2022-07-10 19:15:31,093][26022] Updated weights on worker 0-0, policy_version 855579 (0.00085) [2022-07-10 19:15:33,148][26022] Updated weights on worker 0-0, policy_version 855589 (0.00091) [2022-07-10 19:15:34,836][26022] Updated weights on worker 0-0, policy_version 855599 (0.00093) [2022-07-10 19:15:35,656][25689] Fps is (10 sec: 5677.7, 60 sec: 5541.3, 300 sec: 5536.4). Total num frames: 876136448. Throughput: 0: 5825.8. Samples: 876143776. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:15:35,656][25689] Avg episode reward: [(0, '-0.775')] [2022-07-10 19:15:36,743][26022] Updated weights on worker 0-0, policy_version 855609 (0.00087) [2022-07-10 19:15:38,693][26022] Updated weights on worker 0-0, policy_version 855619 (0.00090) [2022-07-10 19:15:40,380][26022] Updated weights on worker 0-0, policy_version 855629 (0.00087) [2022-07-10 19:15:40,663][25689] Fps is (10 sec: 5693.1, 60 sec: 5561.1, 300 sec: 5537.7). Total num frames: 876166144. Throughput: 0: 5023.1. Samples: 876160738. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:15:40,664][25689] Avg episode reward: [(0, '-0.636')] [2022-07-10 19:15:42,054][26022] Updated weights on worker 0-0, policy_version 855639 (0.00079) [2022-07-10 19:15:44,040][26022] Updated weights on worker 0-0, policy_version 855649 (0.00081) [2022-07-10 19:15:45,579][26022] Updated weights on worker 0-0, policy_version 855659 (0.00094) [2022-07-10 19:15:45,760][25689] Fps is (10 sec: 5776.7, 60 sec: 5575.2, 300 sec: 5543.6). Total num frames: 876194816. Throughput: 0: 5878.0. Samples: 876195008. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:15:45,760][25689] Avg episode reward: [(0, '-0.896')] [2022-07-10 19:15:47,690][26022] Updated weights on worker 0-0, policy_version 855669 (0.00088) [2022-07-10 19:15:49,303][26022] Updated weights on worker 0-0, policy_version 855679 (0.00095) [2022-07-10 19:15:50,775][25689] Fps is (10 sec: 5468.7, 60 sec: 5557.2, 300 sec: 5540.5). Total num frames: 876221440. Throughput: 0: 5867.1. Samples: 876228418. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:15:50,775][25689] Avg episode reward: [(0, '-1.698')] [2022-07-10 19:15:51,395][26022] Updated weights on worker 0-0, policy_version 855689 (0.00090) [2022-07-10 19:15:53,237][26022] Updated weights on worker 0-0, policy_version 855699 (0.00112) [2022-07-10 19:15:54,974][26022] Updated weights on worker 0-0, policy_version 855709 (0.00083) [2022-07-10 19:15:55,777][25689] Fps is (10 sec: 5519.7, 60 sec: 5563.5, 300 sec: 5537.9). Total num frames: 876250112. Throughput: 0: 5038.6. Samples: 876245162. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:15:55,778][25689] Avg episode reward: [(0, '-1.750')] [2022-07-10 19:15:56,864][26022] Updated weights on worker 0-0, policy_version 855719 (0.00079) [2022-07-10 19:15:58,472][26022] Updated weights on worker 0-0, policy_version 855729 (0.00084) [2022-07-10 19:16:00,492][26022] Updated weights on worker 0-0, policy_version 855739 (0.00087) [2022-07-10 19:16:00,783][25689] Fps is (10 sec: 5729.5, 60 sec: 5580.1, 300 sec: 5550.0). Total num frames: 876278784. Throughput: 0: 5874.0. Samples: 876278922. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:16:00,784][25689] Avg episode reward: [(0, '-0.674')] [2022-07-10 19:16:02,708][26022] Updated weights on worker 0-0, policy_version 855749 (0.00090) [2022-07-10 19:16:04,403][26022] Updated weights on worker 0-0, policy_version 855759 (0.00097) [2022-07-10 19:16:05,831][25689] Fps is (10 sec: 5398.5, 60 sec: 5584.5, 300 sec: 5539.0). Total num frames: 876304384. Throughput: 0: 5752.3. Samples: 876310462. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:16:05,831][25689] Avg episode reward: [(0, '0.053')] [2022-07-10 19:16:06,254][26022] Updated weights on worker 0-0, policy_version 855769 (0.00055) [2022-07-10 19:16:08,139][26022] Updated weights on worker 0-0, policy_version 855779 (0.00086) [2022-07-10 19:16:09,939][26022] Updated weights on worker 0-0, policy_version 855789 (0.00085) [2022-07-10 19:16:10,892][25689] Fps is (10 sec: 5368.8, 60 sec: 5587.5, 300 sec: 5545.2). Total num frames: 876333056. Throughput: 0: 4913.2. Samples: 876327258. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:16:10,893][25689] Avg episode reward: [(0, '0.346')] [2022-07-10 19:16:11,774][26022] Updated weights on worker 0-0, policy_version 855799 (0.00092) [2022-07-10 19:16:13,747][26022] Updated weights on worker 0-0, policy_version 855809 (0.00085) [2022-07-10 19:16:15,579][26022] Updated weights on worker 0-0, policy_version 855819 (0.00091) [2022-07-10 19:16:15,945][25689] Fps is (10 sec: 5568.1, 60 sec: 5569.7, 300 sec: 5541.2). Total num frames: 876360704. Throughput: 0: 5724.2. Samples: 876360608. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:16:15,946][25689] Avg episode reward: [(0, '1.427')] [2022-07-10 19:16:17,445][26022] Updated weights on worker 0-0, policy_version 855829 (0.00091) [2022-07-10 19:16:19,311][26022] Updated weights on worker 0-0, policy_version 855839 (0.00087) [2022-07-10 19:16:20,986][25689] Fps is (10 sec: 5477.9, 60 sec: 5568.5, 300 sec: 5539.2). Total num frames: 876388352. Throughput: 0: 5702.2. Samples: 876394124. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:16:20,987][25689] Avg episode reward: [(0, '1.209')] [2022-07-10 19:16:21,163][26022] Updated weights on worker 0-0, policy_version 855849 (0.00103) [2022-07-10 19:16:23,116][26022] Updated weights on worker 0-0, policy_version 855859 (0.00087) [2022-07-10 19:16:24,643][26022] Updated weights on worker 0-0, policy_version 855869 (0.00090) [2022-07-10 19:16:26,047][25689] Fps is (10 sec: 5473.8, 60 sec: 5569.3, 300 sec: 5545.4). Total num frames: 876416000. Throughput: 0: 4959.5. Samples: 876410726. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:16:26,048][25689] Avg episode reward: [(0, '0.838')] [2022-07-10 19:16:26,652][26022] Updated weights on worker 0-0, policy_version 855879 (0.00092) [2022-07-10 19:16:28,397][26022] Updated weights on worker 0-0, policy_version 855889 (0.00090) [2022-07-10 19:16:30,310][26022] Updated weights on worker 0-0, policy_version 855899 (0.00094) [2022-07-10 19:16:31,084][25689] Fps is (10 sec: 5577.5, 60 sec: 5552.2, 300 sec: 5538.5). Total num frames: 876444672. Throughput: 0: 5795.0. Samples: 876444272. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:16:31,085][25689] Avg episode reward: [(0, '0.844')] [2022-07-10 19:16:31,961][26022] Updated weights on worker 0-0, policy_version 855909 (0.00080) [2022-07-10 19:16:33,929][26022] Updated weights on worker 0-0, policy_version 855919 (0.00083) [2022-07-10 19:16:35,648][26022] Updated weights on worker 0-0, policy_version 855929 (0.00053) [2022-07-10 19:16:36,088][25689] Fps is (10 sec: 5609.4, 60 sec: 5557.8, 300 sec: 5539.8). Total num frames: 876472320. Throughput: 0: 5815.7. Samples: 876477750. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:16:36,088][25689] Avg episode reward: [(0, '0.471')] [2022-07-10 19:16:37,515][26022] Updated weights on worker 0-0, policy_version 855939 (0.00082) [2022-07-10 19:16:39,350][26022] Updated weights on worker 0-0, policy_version 855949 (0.00087) [2022-07-10 19:16:41,090][25689] Fps is (10 sec: 5628.9, 60 sec: 5541.4, 300 sec: 5548.5). Total num frames: 876500992. Throughput: 0: 4987.1. Samples: 876494382. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:16:41,090][25689] Avg episode reward: [(0, '0.187')] [2022-07-10 19:16:41,155][26022] Updated weights on worker 0-0, policy_version 855959 (0.00093) [2022-07-10 19:16:43,145][26022] Updated weights on worker 0-0, policy_version 855969 (0.00093) [2022-07-10 19:16:44,839][26022] Updated weights on worker 0-0, policy_version 855979 (0.00087) [2022-07-10 19:16:46,167][25689] Fps is (10 sec: 5587.9, 60 sec: 5526.3, 300 sec: 5540.3). Total num frames: 876528640. Throughput: 0: 5822.7. Samples: 876527874. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:16:46,167][25689] Avg episode reward: [(0, '-0.312')] [2022-07-10 19:16:46,715][26022] Updated weights on worker 0-0, policy_version 855989 (0.00093) [2022-07-10 19:16:48,567][26022] Updated weights on worker 0-0, policy_version 855999 (0.00098) [2022-07-10 19:16:50,474][26022] Updated weights on worker 0-0, policy_version 856009 (0.00089) [2022-07-10 19:16:51,217][25689] Fps is (10 sec: 5560.9, 60 sec: 5556.9, 300 sec: 5539.9). Total num frames: 876557312. Throughput: 0: 5823.7. Samples: 876561522. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:16:51,218][25689] Avg episode reward: [(0, '-1.364')] [2022-07-10 19:16:52,407][26022] Updated weights on worker 0-0, policy_version 856019 (0.00084) [2022-07-10 19:16:54,019][26022] Updated weights on worker 0-0, policy_version 856029 (0.00091) [2022-07-10 19:16:55,778][26022] Updated weights on worker 0-0, policy_version 856039 (0.00089) [2022-07-10 19:16:56,223][25689] Fps is (10 sec: 5600.5, 60 sec: 5539.7, 300 sec: 5543.4). Total num frames: 876584960. Throughput: 0: 4986.5. Samples: 876578154. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:16:56,224][25689] Avg episode reward: [(0, '-1.209')] [2022-07-10 19:16:57,161][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:16:57,177][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000856045_876590080.pth [2022-07-10 19:16:57,179][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000854097_874595328.pth [2022-07-10 19:16:57,744][26022] Updated weights on worker 0-0, policy_version 856049 (0.00080) [2022-07-10 19:16:59,509][26022] Updated weights on worker 0-0, policy_version 856059 (0.00101) [2022-07-10 19:17:01,264][25689] Fps is (10 sec: 5605.7, 60 sec: 5536.4, 300 sec: 5543.5). Total num frames: 876613632. Throughput: 0: 5820.7. Samples: 876611814. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:17:01,265][25689] Avg episode reward: [(0, '-0.841')] [2022-07-10 19:17:01,787][26022] Updated weights on worker 0-0, policy_version 856069 (0.00114) [2022-07-10 19:17:03,672][26022] Updated weights on worker 0-0, policy_version 856079 (0.00081) [2022-07-10 19:17:05,547][26022] Updated weights on worker 0-0, policy_version 856089 (0.00089) [2022-07-10 19:17:06,349][25689] Fps is (10 sec: 5461.0, 60 sec: 5549.9, 300 sec: 5547.1). Total num frames: 876640256. Throughput: 0: 5711.3. Samples: 876643140. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:17:06,349][25689] Avg episode reward: [(0, '-1.575')] [2022-07-10 19:17:07,390][26022] Updated weights on worker 0-0, policy_version 856099 (0.00066) [2022-07-10 19:17:09,074][26022] Updated weights on worker 0-0, policy_version 856109 (0.00087) [2022-07-10 19:17:10,959][26022] Updated weights on worker 0-0, policy_version 856119 (0.00094) [2022-07-10 19:17:11,383][25689] Fps is (10 sec: 5262.2, 60 sec: 5518.5, 300 sec: 5543.8). Total num frames: 876666880. Throughput: 0: 5719.9. Samples: 876676868. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:17:11,384][25689] Avg episode reward: [(0, '-1.400')] [2022-07-10 19:17:12,823][26022] Updated weights on worker 0-0, policy_version 856129 (0.00081) [2022-07-10 19:17:14,576][26022] Updated weights on worker 0-0, policy_version 856139 (0.00090) [2022-07-10 19:17:16,403][25689] Fps is (10 sec: 5499.8, 60 sec: 5538.6, 300 sec: 5547.4). Total num frames: 876695552. Throughput: 0: 5733.2. Samples: 876693848. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:17:16,404][25689] Avg episode reward: [(0, '-1.348')] [2022-07-10 19:17:16,556][26022] Updated weights on worker 0-0, policy_version 856149 (0.00085) [2022-07-10 19:17:18,123][26022] Updated weights on worker 0-0, policy_version 856159 (0.00097) [2022-07-10 19:17:20,215][26022] Updated weights on worker 0-0, policy_version 856169 (0.00095) [2022-07-10 19:17:21,438][25689] Fps is (10 sec: 5703.4, 60 sec: 5556.1, 300 sec: 5548.3). Total num frames: 876724224. Throughput: 0: 5733.4. Samples: 876727476. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:17:21,439][25689] Avg episode reward: [(0, '-0.964')] [2022-07-10 19:17:21,980][26022] Updated weights on worker 0-0, policy_version 856179 (0.00092) [2022-07-10 19:17:23,888][26022] Updated weights on worker 0-0, policy_version 856189 (0.00091) [2022-07-10 19:17:25,720][26022] Updated weights on worker 0-0, policy_version 856199 (0.00123) [2022-07-10 19:17:26,530][25689] Fps is (10 sec: 5460.1, 60 sec: 5536.3, 300 sec: 5543.8). Total num frames: 876750848. Throughput: 0: 5820.0. Samples: 876760596. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:17:26,531][25689] Avg episode reward: [(0, '-0.337')] [2022-07-10 19:17:27,520][26022] Updated weights on worker 0-0, policy_version 856209 (0.00087) [2022-07-10 19:17:29,372][26022] Updated weights on worker 0-0, policy_version 856219 (0.00085) [2022-07-10 19:17:31,277][26022] Updated weights on worker 0-0, policy_version 856229 (0.00088) [2022-07-10 19:17:31,559][25689] Fps is (10 sec: 5564.4, 60 sec: 5553.9, 300 sec: 5547.3). Total num frames: 876780544. Throughput: 0: 4976.8. Samples: 876777278. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:17:31,560][25689] Avg episode reward: [(0, '0.534')] [2022-07-10 19:17:32,993][26022] Updated weights on worker 0-0, policy_version 856239 (0.00089) [2022-07-10 19:17:35,024][26022] Updated weights on worker 0-0, policy_version 856249 (0.00097) [2022-07-10 19:17:36,572][25689] Fps is (10 sec: 5608.5, 60 sec: 5536.1, 300 sec: 5542.0). Total num frames: 876807168. Throughput: 0: 5791.3. Samples: 876810654. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:17:36,573][25689] Avg episode reward: [(0, '0.475')] [2022-07-10 19:17:36,708][26022] Updated weights on worker 0-0, policy_version 856259 (0.00096) [2022-07-10 19:17:38,564][26022] Updated weights on worker 0-0, policy_version 856269 (0.00087) [2022-07-10 19:17:40,622][26022] Updated weights on worker 0-0, policy_version 856279 (0.00087) [2022-07-10 19:17:41,633][25689] Fps is (10 sec: 5489.0, 60 sec: 5530.7, 300 sec: 5542.1). Total num frames: 876835840. Throughput: 0: 5763.4. Samples: 876843870. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:17:41,634][25689] Avg episode reward: [(0, '0.642')] [2022-07-10 19:17:42,249][26022] Updated weights on worker 0-0, policy_version 856289 (0.00083) [2022-07-10 19:17:44,178][26022] Updated weights on worker 0-0, policy_version 856299 (0.00091) [2022-07-10 19:17:45,800][26022] Updated weights on worker 0-0, policy_version 856309 (0.00083) [2022-07-10 19:17:46,729][25689] Fps is (10 sec: 5645.8, 60 sec: 5545.9, 300 sec: 5544.2). Total num frames: 876864512. Throughput: 0: 4958.4. Samples: 876860750. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:17:46,729][25689] Avg episode reward: [(0, '0.363')] [2022-07-10 19:17:47,808][26022] Updated weights on worker 0-0, policy_version 856319 (0.00085) [2022-07-10 19:17:49,707][26022] Updated weights on worker 0-0, policy_version 856329 (0.00088) [2022-07-10 19:17:51,314][26022] Updated weights on worker 0-0, policy_version 856339 (0.00092) [2022-07-10 19:17:51,775][25689] Fps is (10 sec: 5654.4, 60 sec: 5546.4, 300 sec: 5543.5). Total num frames: 876893184. Throughput: 0: 5786.2. Samples: 876894250. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:17:51,775][25689] Avg episode reward: [(0, '0.446')] [2022-07-10 19:17:53,355][26022] Updated weights on worker 0-0, policy_version 856349 (0.00094) [2022-07-10 19:17:55,003][26022] Updated weights on worker 0-0, policy_version 856359 (0.00086) [2022-07-10 19:17:56,777][25689] Fps is (10 sec: 5401.2, 60 sec: 5512.8, 300 sec: 5540.2). Total num frames: 876918784. Throughput: 0: 5777.8. Samples: 876927394. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:17:56,778][25689] Avg episode reward: [(0, '0.030')] [2022-07-10 19:17:57,184][26022] Updated weights on worker 0-0, policy_version 856369 (0.00095) [2022-07-10 19:17:58,566][26022] Updated weights on worker 0-0, policy_version 856379 (0.00090) [2022-07-10 19:18:00,732][26022] Updated weights on worker 0-0, policy_version 856389 (0.00078) [2022-07-10 19:18:01,808][25689] Fps is (10 sec: 5511.3, 60 sec: 5530.7, 300 sec: 5551.5). Total num frames: 876948480. Throughput: 0: 4971.4. Samples: 876944168. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:18:01,808][25689] Avg episode reward: [(0, '0.132')] [2022-07-10 19:18:02,858][26022] Updated weights on worker 0-0, policy_version 856399 (0.00086) [2022-07-10 19:18:04,778][26022] Updated weights on worker 0-0, policy_version 856409 (0.00090) [2022-07-10 19:18:06,640][26022] Updated weights on worker 0-0, policy_version 856419 (0.00077) [2022-07-10 19:18:06,846][25689] Fps is (10 sec: 5491.7, 60 sec: 5518.0, 300 sec: 5545.1). Total num frames: 876974080. Throughput: 0: 5706.2. Samples: 876975542. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:18:06,846][25689] Avg episode reward: [(0, '-0.546')] [2022-07-10 19:18:08,457][26022] Updated weights on worker 0-0, policy_version 856429 (0.00084) [2022-07-10 19:18:10,272][26022] Updated weights on worker 0-0, policy_version 856439 (0.00091) [2022-07-10 19:18:11,863][25689] Fps is (10 sec: 5194.0, 60 sec: 5519.6, 300 sec: 5538.2). Total num frames: 877000704. Throughput: 0: 5711.7. Samples: 877008986. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:18:11,863][25689] Avg episode reward: [(0, '-0.606')] [2022-07-10 19:18:12,167][26022] Updated weights on worker 0-0, policy_version 856449 (0.00082) [2022-07-10 19:18:13,867][26022] Updated weights on worker 0-0, policy_version 856459 (0.00080) [2022-07-10 19:18:15,882][26022] Updated weights on worker 0-0, policy_version 856469 (0.00086) [2022-07-10 19:18:16,878][25689] Fps is (10 sec: 5613.9, 60 sec: 5536.9, 300 sec: 5545.2). Total num frames: 877030400. Throughput: 0: 4907.4. Samples: 877026038. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:18:16,879][25689] Avg episode reward: [(0, '-0.257')] [2022-07-10 19:18:17,522][26022] Updated weights on worker 0-0, policy_version 856479 (0.00084) [2022-07-10 19:18:19,606][26022] Updated weights on worker 0-0, policy_version 856489 (0.00080) [2022-07-10 19:18:21,106][26022] Updated weights on worker 0-0, policy_version 856499 (0.00082) [2022-07-10 19:18:21,887][25689] Fps is (10 sec: 5720.7, 60 sec: 5522.4, 300 sec: 5539.1). Total num frames: 877058048. Throughput: 0: 5752.8. Samples: 877059676. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:18:21,887][25689] Avg episode reward: [(0, '-0.850')] [2022-07-10 19:18:23,195][26022] Updated weights on worker 0-0, policy_version 856509 (0.00092) [2022-07-10 19:18:24,790][26022] Updated weights on worker 0-0, policy_version 856519 (0.00090) [2022-07-10 19:18:26,832][26022] Updated weights on worker 0-0, policy_version 856529 (0.01022) [2022-07-10 19:18:26,967][25689] Fps is (10 sec: 5480.9, 60 sec: 5540.5, 300 sec: 5537.6). Total num frames: 877085696. Throughput: 0: 5853.7. Samples: 877093324. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:18:26,969][25689] Avg episode reward: [(0, '-0.728')] [2022-07-10 19:18:28,461][26022] Updated weights on worker 0-0, policy_version 856539 (0.00092) [2022-07-10 19:18:30,409][26022] Updated weights on worker 0-0, policy_version 856549 (0.00087) [2022-07-10 19:18:31,987][25689] Fps is (10 sec: 5575.9, 60 sec: 5524.3, 300 sec: 5540.8). Total num frames: 877114368. Throughput: 0: 5012.9. Samples: 877109868. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:18:31,989][25689] Avg episode reward: [(0, '-0.793')] [2022-07-10 19:18:32,350][26022] Updated weights on worker 0-0, policy_version 856559 (0.00089) [2022-07-10 19:18:34,077][26022] Updated weights on worker 0-0, policy_version 856569 (0.00092) [2022-07-10 19:18:35,907][26022] Updated weights on worker 0-0, policy_version 856579 (0.00085) [2022-07-10 19:18:37,019][25689] Fps is (10 sec: 5704.4, 60 sec: 5556.4, 300 sec: 5540.8). Total num frames: 877143040. Throughput: 0: 5834.4. Samples: 877143550. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-10 19:18:37,020][25689] Avg episode reward: [(0, '0.162')] [2022-07-10 19:18:37,685][26022] Updated weights on worker 0-0, policy_version 856589 (0.00088) [2022-07-10 19:18:39,463][26022] Updated weights on worker 0-0, policy_version 856599 (0.00081) [2022-07-10 19:18:41,423][26022] Updated weights on worker 0-0, policy_version 856609 (0.00086) [2022-07-10 19:18:42,047][25689] Fps is (10 sec: 5700.5, 60 sec: 5559.5, 300 sec: 5548.2). Total num frames: 877171712. Throughput: 0: 5830.9. Samples: 877177228. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:18:42,048][25689] Avg episode reward: [(0, '-0.052')] [2022-07-10 19:18:43,067][26022] Updated weights on worker 0-0, policy_version 856619 (0.00727) [2022-07-10 19:18:45,112][26022] Updated weights on worker 0-0, policy_version 856629 (0.00087) [2022-07-10 19:18:46,770][26022] Updated weights on worker 0-0, policy_version 856639 (0.00087) [2022-07-10 19:18:47,108][25689] Fps is (10 sec: 5582.4, 60 sec: 5545.7, 300 sec: 5544.1). Total num frames: 877199360. Throughput: 0: 4994.4. Samples: 877193918. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:18:47,109][25689] Avg episode reward: [(0, '-0.525')] [2022-07-10 19:18:48,687][26022] Updated weights on worker 0-0, policy_version 856649 (0.00095) [2022-07-10 19:18:50,426][26022] Updated weights on worker 0-0, policy_version 856659 (0.00088) [2022-07-10 19:18:52,127][25689] Fps is (10 sec: 5384.1, 60 sec: 5514.3, 300 sec: 5537.3). Total num frames: 877225984. Throughput: 0: 5842.5. Samples: 877227532. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:18:52,128][25689] Avg episode reward: [(0, '-0.730')] [2022-07-10 19:18:52,547][26022] Updated weights on worker 0-0, policy_version 856669 (0.00096) [2022-07-10 19:18:54,144][26022] Updated weights on worker 0-0, policy_version 856679 (0.00091) [2022-07-10 19:18:56,246][26022] Updated weights on worker 0-0, policy_version 856689 (0.00087) [2022-07-10 19:18:57,165][25689] Fps is (10 sec: 5498.6, 60 sec: 5561.9, 300 sec: 5546.9). Total num frames: 877254656. Throughput: 0: 5811.7. Samples: 877260628. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:18:57,166][25689] Avg episode reward: [(0, '-0.604')] [2022-07-10 19:18:57,308][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:18:57,328][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000856695_877255680.pth [2022-07-10 19:18:57,341][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000854746_875259904.pth [2022-07-10 19:18:57,819][26022] Updated weights on worker 0-0, policy_version 856699 (0.00077) [2022-07-10 19:18:59,764][26022] Updated weights on worker 0-0, policy_version 856709 (0.00091) [2022-07-10 19:19:01,651][26022] Updated weights on worker 0-0, policy_version 856719 (0.00083) [2022-07-10 19:19:02,199][25689] Fps is (10 sec: 5489.9, 60 sec: 5510.8, 300 sec: 5544.4). Total num frames: 877281280. Throughput: 0: 4974.1. Samples: 877277468. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:19:02,200][25689] Avg episode reward: [(0, '-1.016')] [2022-07-10 19:19:03,866][26022] Updated weights on worker 0-0, policy_version 856729 (0.00087) [2022-07-10 19:19:05,599][26022] Updated weights on worker 0-0, policy_version 856739 (0.00091) [2022-07-10 19:19:07,312][25689] Fps is (10 sec: 5348.4, 60 sec: 5537.8, 300 sec: 5542.5). Total num frames: 877308928. Throughput: 0: 5697.9. Samples: 877309036. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:19:07,312][25689] Avg episode reward: [(0, '-1.269')] [2022-07-10 19:19:07,521][26022] Updated weights on worker 0-0, policy_version 856749 (0.00092) [2022-07-10 19:19:09,131][26022] Updated weights on worker 0-0, policy_version 856759 (0.00083) [2022-07-10 19:19:11,398][26022] Updated weights on worker 0-0, policy_version 856769 (0.00079) [2022-07-10 19:19:12,323][25689] Fps is (10 sec: 5563.2, 60 sec: 5572.2, 300 sec: 5546.0). Total num frames: 877337600. Throughput: 0: 5702.9. Samples: 877342708. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:19:12,324][25689] Avg episode reward: [(0, '-1.370')] [2022-07-10 19:19:12,725][26022] Updated weights on worker 0-0, policy_version 856779 (0.00095) [2022-07-10 19:19:14,980][26022] Updated weights on worker 0-0, policy_version 856789 (0.00093) [2022-07-10 19:19:16,367][26022] Updated weights on worker 0-0, policy_version 856799 (0.00089) [2022-07-10 19:19:17,341][25689] Fps is (10 sec: 5615.8, 60 sec: 5538.1, 300 sec: 5545.9). Total num frames: 877365248. Throughput: 0: 4893.1. Samples: 877359352. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:19:17,343][25689] Avg episode reward: [(0, '-0.475')] [2022-07-10 19:19:18,562][26022] Updated weights on worker 0-0, policy_version 856809 (0.00093) [2022-07-10 19:19:20,246][26022] Updated weights on worker 0-0, policy_version 856819 (0.00087) [2022-07-10 19:19:22,158][26022] Updated weights on worker 0-0, policy_version 856829 (0.00080) [2022-07-10 19:19:22,370][25689] Fps is (10 sec: 5605.4, 60 sec: 5553.1, 300 sec: 5542.8). Total num frames: 877393920. Throughput: 0: 5731.1. Samples: 877393070. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:19:22,372][25689] Avg episode reward: [(0, '-0.973')] [2022-07-10 19:19:24,055][26022] Updated weights on worker 0-0, policy_version 856839 (0.00091) [2022-07-10 19:19:25,816][26022] Updated weights on worker 0-0, policy_version 856849 (0.00096) [2022-07-10 19:19:27,461][25689] Fps is (10 sec: 5565.3, 60 sec: 5552.2, 300 sec: 5546.2). Total num frames: 877421568. Throughput: 0: 5817.7. Samples: 877426256. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:19:27,463][25689] Avg episode reward: [(0, '-0.503')] [2022-07-10 19:19:27,725][26022] Updated weights on worker 0-0, policy_version 856859 (0.00090) [2022-07-10 19:19:29,476][26022] Updated weights on worker 0-0, policy_version 856869 (0.00087) [2022-07-10 19:19:31,185][26022] Updated weights on worker 0-0, policy_version 856879 (0.00088) [2022-07-10 19:19:32,471][25689] Fps is (10 sec: 5575.6, 60 sec: 5553.1, 300 sec: 5546.4). Total num frames: 877450240. Throughput: 0: 4981.5. Samples: 877443078. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:19:32,472][25689] Avg episode reward: [(0, '-0.619')] [2022-07-10 19:19:33,164][26022] Updated weights on worker 0-0, policy_version 856889 (0.00086) [2022-07-10 19:19:34,991][26022] Updated weights on worker 0-0, policy_version 856899 (0.00093) [2022-07-10 19:19:36,911][26022] Updated weights on worker 0-0, policy_version 856909 (0.00099) [2022-07-10 19:19:37,473][25689] Fps is (10 sec: 5625.0, 60 sec: 5538.9, 300 sec: 5543.7). Total num frames: 877477888. Throughput: 0: 5830.1. Samples: 877476726. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:19:37,475][25689] Avg episode reward: [(0, '-0.002')] [2022-07-10 19:19:38,739][26022] Updated weights on worker 0-0, policy_version 856919 (0.00084) [2022-07-10 19:19:40,411][26022] Updated weights on worker 0-0, policy_version 856929 (0.00092) [2022-07-10 19:19:42,467][26022] Updated weights on worker 0-0, policy_version 856939 (0.00091) [2022-07-10 19:19:42,535][25689] Fps is (10 sec: 5494.7, 60 sec: 5518.8, 300 sec: 5543.8). Total num frames: 877505536. Throughput: 0: 5808.1. Samples: 877510190. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:19:42,536][25689] Avg episode reward: [(0, '0.131')] [2022-07-10 19:19:44,022][26022] Updated weights on worker 0-0, policy_version 856949 (0.00095) [2022-07-10 19:19:46,029][26022] Updated weights on worker 0-0, policy_version 856959 (0.00085) [2022-07-10 19:19:47,658][25689] Fps is (10 sec: 5630.2, 60 sec: 5547.0, 300 sec: 5548.4). Total num frames: 877535232. Throughput: 0: 5823.7. Samples: 877543882. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:19:47,660][25689] Avg episode reward: [(0, '-0.277')] [2022-07-10 19:19:47,698][26022] Updated weights on worker 0-0, policy_version 856969 (0.00087) [2022-07-10 19:19:49,596][26022] Updated weights on worker 0-0, policy_version 856979 (0.00085) [2022-07-10 19:19:51,410][26022] Updated weights on worker 0-0, policy_version 856989 (0.00091) [2022-07-10 19:19:52,669][25689] Fps is (10 sec: 5557.4, 60 sec: 5547.7, 300 sec: 5542.6). Total num frames: 877561856. Throughput: 0: 5820.0. Samples: 877560630. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:19:52,670][25689] Avg episode reward: [(0, '0.246')] [2022-07-10 19:19:53,017][26022] Updated weights on worker 0-0, policy_version 856999 (0.00091) [2022-07-10 19:19:55,116][26022] Updated weights on worker 0-0, policy_version 857009 (0.00090) [2022-07-10 19:19:57,147][26022] Updated weights on worker 0-0, policy_version 857019 (0.00089) [2022-07-10 19:19:57,707][25689] Fps is (10 sec: 5503.0, 60 sec: 5547.7, 300 sec: 5545.4). Total num frames: 877590528. Throughput: 0: 5799.5. Samples: 877594070. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:19:57,709][25689] Avg episode reward: [(0, '-0.191')] [2022-07-10 19:19:58,843][26022] Updated weights on worker 0-0, policy_version 857029 (0.00091) [2022-07-10 19:20:00,676][26022] Updated weights on worker 0-0, policy_version 857039 (0.00095) [2022-07-10 19:20:02,727][25689] Fps is (10 sec: 5395.8, 60 sec: 5532.1, 300 sec: 5546.8). Total num frames: 877616128. Throughput: 0: 5709.9. Samples: 877625486. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:20:02,728][25689] Avg episode reward: [(0, '-1.228')] [2022-07-10 19:20:03,148][26022] Updated weights on worker 0-0, policy_version 857049 (0.00098) [2022-07-10 19:20:04,708][26022] Updated weights on worker 0-0, policy_version 857059 (0.00090) [2022-07-10 19:20:06,571][26022] Updated weights on worker 0-0, policy_version 857069 (0.00085) [2022-07-10 19:20:07,812][25689] Fps is (10 sec: 5370.9, 60 sec: 5551.6, 300 sec: 5547.0). Total num frames: 877644800. Throughput: 0: 4889.7. Samples: 877642426. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:20:07,812][25689] Avg episode reward: [(0, '-1.953')] [2022-07-10 19:20:08,458][26022] Updated weights on worker 0-0, policy_version 857079 (0.00115) [2022-07-10 19:20:10,283][26022] Updated weights on worker 0-0, policy_version 857089 (0.00087) [2022-07-10 19:20:12,253][26022] Updated weights on worker 0-0, policy_version 857099 (0.00095) [2022-07-10 19:20:12,868][25689] Fps is (10 sec: 5654.7, 60 sec: 5547.4, 300 sec: 5546.8). Total num frames: 877673472. Throughput: 0: 5701.7. Samples: 877675800. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:20:12,870][25689] Avg episode reward: [(0, '-2.138')] [2022-07-10 19:20:13,828][26022] Updated weights on worker 0-0, policy_version 857109 (0.00086) [2022-07-10 19:20:15,683][26022] Updated weights on worker 0-0, policy_version 857119 (0.00109) [2022-07-10 19:20:17,612][26022] Updated weights on worker 0-0, policy_version 857129 (0.00087) [2022-07-10 19:20:17,906][25689] Fps is (10 sec: 5579.6, 60 sec: 5545.7, 300 sec: 5546.6). Total num frames: 877701120. Throughput: 0: 5713.4. Samples: 877709474. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:20:17,906][25689] Avg episode reward: [(0, '-1.840')] [2022-07-10 19:20:19,414][26022] Updated weights on worker 0-0, policy_version 857139 (0.00087) [2022-07-10 19:20:21,180][26022] Updated weights on worker 0-0, policy_version 857149 (0.00087) [2022-07-10 19:20:22,908][25689] Fps is (10 sec: 5507.4, 60 sec: 5531.2, 300 sec: 5547.8). Total num frames: 877728768. Throughput: 0: 4994.7. Samples: 877726284. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:20:22,910][25689] Avg episode reward: [(0, '-1.825')] [2022-07-10 19:20:23,083][26022] Updated weights on worker 0-0, policy_version 857159 (0.00096) [2022-07-10 19:20:24,850][26022] Updated weights on worker 0-0, policy_version 857169 (0.00099) [2022-07-10 19:20:26,823][26022] Updated weights on worker 0-0, policy_version 857179 (0.00060) [2022-07-10 19:20:27,979][25689] Fps is (10 sec: 5590.8, 60 sec: 5549.9, 300 sec: 5543.7). Total num frames: 877757440. Throughput: 0: 5815.8. Samples: 877759716. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:20:27,980][25689] Avg episode reward: [(0, '-2.591')] [2022-07-10 19:20:28,583][26022] Updated weights on worker 0-0, policy_version 857189 (0.00092) [2022-07-10 19:20:30,413][26022] Updated weights on worker 0-0, policy_version 857199 (0.00083) [2022-07-10 19:20:32,379][26022] Updated weights on worker 0-0, policy_version 857209 (0.00084) [2022-07-10 19:20:32,981][25689] Fps is (10 sec: 5693.4, 60 sec: 5550.8, 300 sec: 5548.4). Total num frames: 877786112. Throughput: 0: 5836.0. Samples: 877793176. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:20:32,981][25689] Avg episode reward: [(0, '-0.655')] [2022-07-10 19:20:34,219][26022] Updated weights on worker 0-0, policy_version 857219 (0.00087) [2022-07-10 19:20:36,026][26022] Updated weights on worker 0-0, policy_version 857229 (0.00093) [2022-07-10 19:20:37,965][26022] Updated weights on worker 0-0, policy_version 857239 (0.00086) [2022-07-10 19:20:38,015][25689] Fps is (10 sec: 5509.7, 60 sec: 5530.8, 300 sec: 5537.5). Total num frames: 877812736. Throughput: 0: 4994.9. Samples: 877809920. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:20:38,016][25689] Avg episode reward: [(0, '-0.234')] [2022-07-10 19:20:39,632][26022] Updated weights on worker 0-0, policy_version 857249 (0.00088) [2022-07-10 19:20:41,571][26022] Updated weights on worker 0-0, policy_version 857259 (0.00083) [2022-07-10 19:20:43,029][25689] Fps is (10 sec: 5503.0, 60 sec: 5552.2, 300 sec: 5539.1). Total num frames: 877841408. Throughput: 0: 5824.6. Samples: 877843478. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:20:43,029][25689] Avg episode reward: [(0, '-1.038')] [2022-07-10 19:20:43,212][26022] Updated weights on worker 0-0, policy_version 857269 (0.00086) [2022-07-10 19:20:45,134][26022] Updated weights on worker 0-0, policy_version 857279 (0.00089) [2022-07-10 19:20:46,975][26022] Updated weights on worker 0-0, policy_version 857289 (0.00087) [2022-07-10 19:20:48,142][25689] Fps is (10 sec: 5662.9, 60 sec: 5536.2, 300 sec: 5544.1). Total num frames: 877870080. Throughput: 0: 5819.3. Samples: 877877046. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:20:48,142][25689] Avg episode reward: [(0, '-0.877')] [2022-07-10 19:20:48,762][26022] Updated weights on worker 0-0, policy_version 857299 (0.00480) [2022-07-10 19:20:50,645][26022] Updated weights on worker 0-0, policy_version 857309 (0.00092) [2022-07-10 19:20:52,574][26022] Updated weights on worker 0-0, policy_version 857319 (0.00088) [2022-07-10 19:20:53,185][25689] Fps is (10 sec: 5545.5, 60 sec: 5550.1, 300 sec: 5539.9). Total num frames: 877897728. Throughput: 0: 4965.1. Samples: 877893492. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:20:53,185][25689] Avg episode reward: [(0, '0.040')] [2022-07-10 19:20:54,311][26022] Updated weights on worker 0-0, policy_version 857329 (0.00096) [2022-07-10 19:20:56,325][26022] Updated weights on worker 0-0, policy_version 857339 (0.00084) [2022-07-10 19:20:57,466][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:20:57,475][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000857345_877921280.pth [2022-07-10 19:20:57,475][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000855394_875923456.pth [2022-07-10 19:20:57,880][26022] Updated weights on worker 0-0, policy_version 857349 (0.00104) [2022-07-10 19:20:58,257][25689] Fps is (10 sec: 5567.9, 60 sec: 5547.0, 300 sec: 5538.7). Total num frames: 877926400. Throughput: 0: 5786.3. Samples: 877927042. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:20:58,257][25689] Avg episode reward: [(0, '0.101')] [2022-07-10 19:21:00,026][26022] Updated weights on worker 0-0, policy_version 857359 (0.00091) [2022-07-10 19:21:02,033][26022] Updated weights on worker 0-0, policy_version 857369 (0.00082) [2022-07-10 19:21:03,345][25689] Fps is (10 sec: 5341.6, 60 sec: 5540.8, 300 sec: 5537.9). Total num frames: 877952000. Throughput: 0: 5649.7. Samples: 877958258. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:21:03,346][25689] Avg episode reward: [(0, '-0.042')] [2022-07-10 19:21:03,898][26022] Updated weights on worker 0-0, policy_version 857379 (0.00102) [2022-07-10 19:21:05,746][26022] Updated weights on worker 0-0, policy_version 857389 (0.00096) [2022-07-10 19:21:07,649][26022] Updated weights on worker 0-0, policy_version 857399 (0.00089) [2022-07-10 19:21:08,427][25689] Fps is (10 sec: 5336.5, 60 sec: 5541.1, 300 sec: 5537.6). Total num frames: 877980672. Throughput: 0: 4824.7. Samples: 877974916. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:21:08,427][25689] Avg episode reward: [(0, '-0.278')] [2022-07-10 19:21:09,473][26022] Updated weights on worker 0-0, policy_version 857409 (0.00084) [2022-07-10 19:21:11,238][26022] Updated weights on worker 0-0, policy_version 857419 (0.00087) [2022-07-10 19:21:13,080][26022] Updated weights on worker 0-0, policy_version 857429 (0.00082) [2022-07-10 19:21:13,431][25689] Fps is (10 sec: 5685.7, 60 sec: 5545.9, 300 sec: 5541.9). Total num frames: 878009344. Throughput: 0: 5670.8. Samples: 878008302. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:21:13,431][25689] Avg episode reward: [(0, '0.636')] [2022-07-10 19:21:15,019][26022] Updated weights on worker 0-0, policy_version 857439 (0.00085) [2022-07-10 19:21:16,903][26022] Updated weights on worker 0-0, policy_version 857449 (0.00084) [2022-07-10 19:21:18,440][25689] Fps is (10 sec: 5522.2, 60 sec: 5531.5, 300 sec: 5539.1). Total num frames: 878035968. Throughput: 0: 5691.3. Samples: 878041912. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:21:18,441][25689] Avg episode reward: [(0, '0.605')] [2022-07-10 19:21:18,674][26022] Updated weights on worker 0-0, policy_version 857459 (0.00084) [2022-07-10 19:21:20,530][26022] Updated weights on worker 0-0, policy_version 857469 (0.00090) [2022-07-10 19:21:22,221][26022] Updated weights on worker 0-0, policy_version 857479 (0.00091) [2022-07-10 19:21:23,442][25689] Fps is (10 sec: 5421.0, 60 sec: 5531.6, 300 sec: 5540.2). Total num frames: 878063616. Throughput: 0: 5000.9. Samples: 878058762. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:21:23,443][25689] Avg episode reward: [(0, '-0.851')] [2022-07-10 19:21:24,146][26022] Updated weights on worker 0-0, policy_version 857489 (0.00052) [2022-07-10 19:21:25,851][26022] Updated weights on worker 0-0, policy_version 857499 (0.00093) [2022-07-10 19:21:27,838][26022] Updated weights on worker 0-0, policy_version 857509 (0.00091) [2022-07-10 19:21:28,507][25689] Fps is (10 sec: 5594.7, 60 sec: 5532.2, 300 sec: 5539.6). Total num frames: 878092288. Throughput: 0: 5848.6. Samples: 878092356. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:21:28,507][25689] Avg episode reward: [(0, '-0.945')] [2022-07-10 19:21:29,827][26022] Updated weights on worker 0-0, policy_version 857519 (0.00091) [2022-07-10 19:21:31,444][26022] Updated weights on worker 0-0, policy_version 857529 (0.00203) [2022-07-10 19:21:33,394][26022] Updated weights on worker 0-0, policy_version 857539 (0.00086) [2022-07-10 19:21:33,535][25689] Fps is (10 sec: 5580.4, 60 sec: 5512.8, 300 sec: 5539.2). Total num frames: 878119936. Throughput: 0: 5830.9. Samples: 878125526. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:21:33,535][25689] Avg episode reward: [(0, '-0.780')] [2022-07-10 19:21:35,084][26022] Updated weights on worker 0-0, policy_version 857549 (0.01053) [2022-07-10 19:21:36,926][26022] Updated weights on worker 0-0, policy_version 857559 (0.00081) [2022-07-10 19:21:38,573][25689] Fps is (10 sec: 5595.1, 60 sec: 5546.3, 300 sec: 5538.5). Total num frames: 878148608. Throughput: 0: 4987.0. Samples: 878142314. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:21:38,573][25689] Avg episode reward: [(0, '-0.515')] [2022-07-10 19:21:38,732][26022] Updated weights on worker 0-0, policy_version 857569 (0.00089) [2022-07-10 19:21:40,745][26022] Updated weights on worker 0-0, policy_version 857579 (0.00091) [2022-07-10 19:21:42,462][26022] Updated weights on worker 0-0, policy_version 857589 (0.00096) [2022-07-10 19:21:43,583][25689] Fps is (10 sec: 5604.8, 60 sec: 5529.7, 300 sec: 5539.8). Total num frames: 878176256. Throughput: 0: 5832.2. Samples: 878176228. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:21:43,584][25689] Avg episode reward: [(0, '-0.676')] [2022-07-10 19:21:44,383][26022] Updated weights on worker 0-0, policy_version 857599 (0.00088) [2022-07-10 19:21:46,074][26022] Updated weights on worker 0-0, policy_version 857609 (0.00092) [2022-07-10 19:21:47,976][26022] Updated weights on worker 0-0, policy_version 857619 (0.00092) [2022-07-10 19:21:48,654][25689] Fps is (10 sec: 5586.8, 60 sec: 5533.6, 300 sec: 5539.4). Total num frames: 878204928. Throughput: 0: 5837.1. Samples: 878209956. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:21:48,654][25689] Avg episode reward: [(0, '-0.335')] [2022-07-10 19:21:49,679][26022] Updated weights on worker 0-0, policy_version 857629 (0.00087) [2022-07-10 19:21:51,627][26022] Updated weights on worker 0-0, policy_version 857639 (0.00086) [2022-07-10 19:21:53,245][26022] Updated weights on worker 0-0, policy_version 857649 (0.00081) [2022-07-10 19:21:53,682][25689] Fps is (10 sec: 5779.9, 60 sec: 5568.8, 300 sec: 5545.9). Total num frames: 878234624. Throughput: 0: 5024.6. Samples: 878226756. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:21:53,682][25689] Avg episode reward: [(0, '-0.856')] [2022-07-10 19:21:55,364][26022] Updated weights on worker 0-0, policy_version 857659 (0.00083) [2022-07-10 19:21:57,010][26022] Updated weights on worker 0-0, policy_version 857669 (0.00092) [2022-07-10 19:21:58,692][25689] Fps is (10 sec: 5610.3, 60 sec: 5540.6, 300 sec: 5539.5). Total num frames: 878261248. Throughput: 0: 5891.2. Samples: 878260842. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:21:58,693][25689] Avg episode reward: [(0, '-0.988')] [2022-07-10 19:21:59,056][26022] Updated weights on worker 0-0, policy_version 857679 (0.00091) [2022-07-10 19:22:00,720][26022] Updated weights on worker 0-0, policy_version 857689 (0.00086) [2022-07-10 19:22:03,083][26022] Updated weights on worker 0-0, policy_version 857699 (0.00088) [2022-07-10 19:22:03,695][25689] Fps is (10 sec: 5215.6, 60 sec: 5548.5, 300 sec: 5537.6). Total num frames: 878286848. Throughput: 0: 5760.2. Samples: 878292074. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:22:03,695][25689] Avg episode reward: [(0, '-1.044')] [2022-07-10 19:22:04,694][26022] Updated weights on worker 0-0, policy_version 857709 (0.00089) [2022-07-10 19:22:06,643][26022] Updated weights on worker 0-0, policy_version 857719 (0.00091) [2022-07-10 19:22:08,380][26022] Updated weights on worker 0-0, policy_version 857729 (0.00094) [2022-07-10 19:22:08,797][25689] Fps is (10 sec: 5371.1, 60 sec: 5546.6, 300 sec: 5543.3). Total num frames: 878315520. Throughput: 0: 5727.7. Samples: 878325330. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:22:08,797][25689] Avg episode reward: [(0, '-0.814')] [2022-07-10 19:22:10,366][26022] Updated weights on worker 0-0, policy_version 857739 (0.00088) [2022-07-10 19:22:12,123][26022] Updated weights on worker 0-0, policy_version 857749 (0.00083) [2022-07-10 19:22:13,837][25689] Fps is (10 sec: 5653.7, 60 sec: 5543.2, 300 sec: 5542.9). Total num frames: 878344192. Throughput: 0: 5731.8. Samples: 878342284. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:22:13,839][25689] Avg episode reward: [(0, '-1.059')] [2022-07-10 19:22:13,871][26022] Updated weights on worker 0-0, policy_version 857759 (0.00093) [2022-07-10 19:22:15,723][26022] Updated weights on worker 0-0, policy_version 857769 (0.00086) [2022-07-10 19:22:17,687][26022] Updated weights on worker 0-0, policy_version 857779 (0.00087) [2022-07-10 19:22:18,856][25689] Fps is (10 sec: 5802.6, 60 sec: 5593.3, 300 sec: 5546.6). Total num frames: 878373888. Throughput: 0: 5722.1. Samples: 878376218. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-10 19:22:18,857][25689] Avg episode reward: [(0, '-0.199')] [2022-07-10 19:22:19,316][26022] Updated weights on worker 0-0, policy_version 857789 (0.00089) [2022-07-10 19:22:21,282][26022] Updated weights on worker 0-0, policy_version 857799 (0.00088) [2022-07-10 19:22:23,138][26022] Updated weights on worker 0-0, policy_version 857809 (0.00087) [2022-07-10 19:22:23,871][25689] Fps is (10 sec: 5715.0, 60 sec: 5592.0, 300 sec: 5551.5). Total num frames: 878401536. Throughput: 0: 5836.0. Samples: 878409824. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:22:23,873][25689] Avg episode reward: [(0, '-0.027')] [2022-07-10 19:22:24,822][26022] Updated weights on worker 0-0, policy_version 857819 (0.00096) [2022-07-10 19:22:26,786][26022] Updated weights on worker 0-0, policy_version 857829 (0.00086) [2022-07-10 19:22:28,649][26022] Updated weights on worker 0-0, policy_version 857839 (0.00086) [2022-07-10 19:22:28,919][25689] Fps is (10 sec: 5393.2, 60 sec: 5559.7, 300 sec: 5540.8). Total num frames: 878428160. Throughput: 0: 5029.5. Samples: 878426534. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:22:28,919][25689] Avg episode reward: [(0, '0.063')] [2022-07-10 19:22:30,460][26022] Updated weights on worker 0-0, policy_version 857849 (0.00090) [2022-07-10 19:22:32,414][26022] Updated weights on worker 0-0, policy_version 857859 (0.01185) [2022-07-10 19:22:33,956][25689] Fps is (10 sec: 5482.8, 60 sec: 5575.7, 300 sec: 5547.2). Total num frames: 878456832. Throughput: 0: 5847.6. Samples: 878459934. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:22:33,957][25689] Avg episode reward: [(0, '-0.604')] [2022-07-10 19:22:34,066][26022] Updated weights on worker 0-0, policy_version 857869 (0.00088) [2022-07-10 19:22:35,830][26022] Updated weights on worker 0-0, policy_version 857879 (0.00119) [2022-07-10 19:22:37,763][26022] Updated weights on worker 0-0, policy_version 857889 (0.00091) [2022-07-10 19:22:39,028][25689] Fps is (10 sec: 5773.6, 60 sec: 5589.6, 300 sec: 5550.5). Total num frames: 878486528. Throughput: 0: 5830.2. Samples: 878493828. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:22:39,029][25689] Avg episode reward: [(0, '-0.444')] [2022-07-10 19:22:39,433][26022] Updated weights on worker 0-0, policy_version 857899 (0.00091) [2022-07-10 19:22:41,559][26022] Updated weights on worker 0-0, policy_version 857909 (0.00086) [2022-07-10 19:22:43,322][26022] Updated weights on worker 0-0, policy_version 857919 (0.00086) [2022-07-10 19:22:44,071][25689] Fps is (10 sec: 5669.3, 60 sec: 5586.5, 300 sec: 5548.0). Total num frames: 878514176. Throughput: 0: 4993.2. Samples: 878510688. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:22:44,072][25689] Avg episode reward: [(0, '-0.253')] [2022-07-10 19:22:44,909][26022] Updated weights on worker 0-0, policy_version 857929 (0.00083) [2022-07-10 19:22:46,998][26022] Updated weights on worker 0-0, policy_version 857939 (0.00090) [2022-07-10 19:22:48,500][26022] Updated weights on worker 0-0, policy_version 857949 (0.00093) [2022-07-10 19:22:49,142][25689] Fps is (10 sec: 5467.6, 60 sec: 5569.6, 300 sec: 5544.1). Total num frames: 878541824. Throughput: 0: 5817.2. Samples: 878544176. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:22:49,142][25689] Avg episode reward: [(0, '-1.302')] [2022-07-10 19:22:50,569][26022] Updated weights on worker 0-0, policy_version 857959 (0.00091) [2022-07-10 19:22:52,292][26022] Updated weights on worker 0-0, policy_version 857969 (0.00089) [2022-07-10 19:22:54,158][25689] Fps is (10 sec: 5482.2, 60 sec: 5536.9, 300 sec: 5550.8). Total num frames: 878569472. Throughput: 0: 5831.6. Samples: 878577740. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:22:54,158][25689] Avg episode reward: [(0, '-2.039')] [2022-07-10 19:22:54,174][26022] Updated weights on worker 0-0, policy_version 857979 (0.00085) [2022-07-10 19:22:56,076][26022] Updated weights on worker 0-0, policy_version 857989 (0.00092) [2022-07-10 19:22:57,823][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:22:57,835][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000857998_878589952.pth [2022-07-10 19:22:57,836][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000856045_876590080.pth [2022-07-10 19:22:58,006][26022] Updated weights on worker 0-0, policy_version 857999 (0.00092) [2022-07-10 19:22:59,238][25689] Fps is (10 sec: 5679.5, 60 sec: 5581.2, 300 sec: 5549.8). Total num frames: 878599168. Throughput: 0: 4977.2. Samples: 878594418. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:22:59,239][25689] Avg episode reward: [(0, '-2.742')] [2022-07-10 19:22:59,706][26022] Updated weights on worker 0-0, policy_version 858009 (0.00104) [2022-07-10 19:23:01,758][26022] Updated weights on worker 0-0, policy_version 858019 (0.00087) [2022-07-10 19:23:03,672][26022] Updated weights on worker 0-0, policy_version 858029 (0.00085) [2022-07-10 19:23:04,269][25689] Fps is (10 sec: 5266.1, 60 sec: 5544.8, 300 sec: 5543.1). Total num frames: 878622720. Throughput: 0: 5691.2. Samples: 878625638. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:23:04,270][25689] Avg episode reward: [(0, '-2.208')] [2022-07-10 19:23:05,751][26022] Updated weights on worker 0-0, policy_version 858039 (0.00098) [2022-07-10 19:23:07,411][26022] Updated weights on worker 0-0, policy_version 858049 (0.00086) [2022-07-10 19:23:09,401][25689] Fps is (10 sec: 5138.6, 60 sec: 5542.0, 300 sec: 5547.8). Total num frames: 878651392. Throughput: 0: 5678.1. Samples: 878659212. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:23:09,402][25689] Avg episode reward: [(0, '-0.846')] [2022-07-10 19:23:09,521][26022] Updated weights on worker 0-0, policy_version 858059 (0.00091) [2022-07-10 19:23:11,381][26022] Updated weights on worker 0-0, policy_version 858069 (0.00084) [2022-07-10 19:23:13,046][26022] Updated weights on worker 0-0, policy_version 858079 (0.00875) [2022-07-10 19:23:14,437][25689] Fps is (10 sec: 5740.4, 60 sec: 5559.3, 300 sec: 5547.4). Total num frames: 878681088. Throughput: 0: 4844.4. Samples: 878675982. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:23:14,438][25689] Avg episode reward: [(0, '-1.103')] [2022-07-10 19:23:14,733][26022] Updated weights on worker 0-0, policy_version 858089 (0.00086) [2022-07-10 19:23:16,559][26022] Updated weights on worker 0-0, policy_version 858099 (0.00108) [2022-07-10 19:23:18,337][26022] Updated weights on worker 0-0, policy_version 858109 (0.00089) [2022-07-10 19:23:19,456][25689] Fps is (10 sec: 5601.6, 60 sec: 5508.6, 300 sec: 5543.8). Total num frames: 878707712. Throughput: 0: 5690.9. Samples: 878709474. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:23:19,456][25689] Avg episode reward: [(0, '-1.795')] [2022-07-10 19:23:20,437][26022] Updated weights on worker 0-0, policy_version 858119 (0.00091) [2022-07-10 19:23:22,135][26022] Updated weights on worker 0-0, policy_version 858129 (0.00098) [2022-07-10 19:23:24,123][26022] Updated weights on worker 0-0, policy_version 858139 (0.00083) [2022-07-10 19:23:24,469][25689] Fps is (10 sec: 5409.9, 60 sec: 5508.8, 300 sec: 5545.0). Total num frames: 878735360. Throughput: 0: 5811.7. Samples: 878743034. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:23:24,470][25689] Avg episode reward: [(0, '-0.741')] [2022-07-10 19:23:25,709][26022] Updated weights on worker 0-0, policy_version 858149 (0.00091) [2022-07-10 19:23:27,948][26022] Updated weights on worker 0-0, policy_version 858159 (0.00103) [2022-07-10 19:23:29,595][25689] Fps is (10 sec: 5554.7, 60 sec: 5535.5, 300 sec: 5543.1). Total num frames: 878764032. Throughput: 0: 4965.8. Samples: 878759488. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:23:29,596][25689] Avg episode reward: [(0, '-0.140')] [2022-07-10 19:23:29,639][26022] Updated weights on worker 0-0, policy_version 858169 (0.00089) [2022-07-10 19:23:31,474][26022] Updated weights on worker 0-0, policy_version 858179 (0.00081) [2022-07-10 19:23:33,320][26022] Updated weights on worker 0-0, policy_version 858189 (0.00085) [2022-07-10 19:23:34,611][25689] Fps is (10 sec: 5654.1, 60 sec: 5537.4, 300 sec: 5543.4). Total num frames: 878792704. Throughput: 0: 5789.7. Samples: 878792784. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:23:34,612][25689] Avg episode reward: [(0, '-0.598')] [2022-07-10 19:23:35,063][26022] Updated weights on worker 0-0, policy_version 858199 (0.00088) [2022-07-10 19:23:37,179][26022] Updated weights on worker 0-0, policy_version 858209 (0.00089) [2022-07-10 19:23:38,733][26022] Updated weights on worker 0-0, policy_version 858219 (0.00086) [2022-07-10 19:23:39,666][25689] Fps is (10 sec: 5490.5, 60 sec: 5488.3, 300 sec: 5536.0). Total num frames: 878819328. Throughput: 0: 5771.5. Samples: 878826118. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:23:39,667][25689] Avg episode reward: [(0, '-0.830')] [2022-07-10 19:23:40,701][26022] Updated weights on worker 0-0, policy_version 858229 (0.00087) [2022-07-10 19:23:42,499][26022] Updated weights on worker 0-0, policy_version 858239 (0.00061) [2022-07-10 19:23:44,226][26022] Updated weights on worker 0-0, policy_version 858249 (0.00085) [2022-07-10 19:23:44,722][25689] Fps is (10 sec: 5671.8, 60 sec: 5537.8, 300 sec: 5546.4). Total num frames: 878850048. Throughput: 0: 4933.5. Samples: 878842950. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:23:44,722][25689] Avg episode reward: [(0, '-0.175')] [2022-07-10 19:23:46,151][26022] Updated weights on worker 0-0, policy_version 858259 (0.00524) [2022-07-10 19:23:47,990][26022] Updated weights on worker 0-0, policy_version 858269 (0.00092) [2022-07-10 19:23:49,847][25689] Fps is (10 sec: 5632.6, 60 sec: 5516.0, 300 sec: 5544.4). Total num frames: 878876672. Throughput: 0: 5786.3. Samples: 878876672. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:23:49,847][25689] Avg episode reward: [(0, '-0.198')] [2022-07-10 19:23:49,974][26022] Updated weights on worker 0-0, policy_version 858279 (0.00088) [2022-07-10 19:23:51,736][26022] Updated weights on worker 0-0, policy_version 858289 (0.00091) [2022-07-10 19:23:53,464][26022] Updated weights on worker 0-0, policy_version 858299 (0.00093) [2022-07-10 19:23:54,864][25689] Fps is (10 sec: 5451.9, 60 sec: 5532.7, 300 sec: 5544.8). Total num frames: 878905344. Throughput: 0: 5778.9. Samples: 878909824. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:23:54,865][25689] Avg episode reward: [(0, '-0.246')] [2022-07-10 19:23:55,357][26022] Updated weights on worker 0-0, policy_version 858309 (0.00092) [2022-07-10 19:23:57,311][26022] Updated weights on worker 0-0, policy_version 858319 (0.00094) [2022-07-10 19:23:58,976][26022] Updated weights on worker 0-0, policy_version 858329 (0.00091) [2022-07-10 19:23:59,943][25689] Fps is (10 sec: 5578.3, 60 sec: 5499.1, 300 sec: 5547.4). Total num frames: 878932992. Throughput: 0: 4950.3. Samples: 878926496. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:23:59,945][25689] Avg episode reward: [(0, '0.383')] [2022-07-10 19:24:00,876][26022] Updated weights on worker 0-0, policy_version 858339 (0.00090) [2022-07-10 19:24:02,976][26022] Updated weights on worker 0-0, policy_version 858349 (0.00090) [2022-07-10 19:24:04,947][25689] Fps is (10 sec: 5179.4, 60 sec: 5518.4, 300 sec: 5539.1). Total num frames: 878957568. Throughput: 0: 5675.2. Samples: 878957734. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:24:04,948][25689] Avg episode reward: [(0, '0.640')] [2022-07-10 19:24:05,131][26022] Updated weights on worker 0-0, policy_version 858359 (0.00090) [2022-07-10 19:24:06,719][26022] Updated weights on worker 0-0, policy_version 858369 (0.00101) [2022-07-10 19:24:08,868][26022] Updated weights on worker 0-0, policy_version 858379 (0.00098) [2022-07-10 19:24:10,032][25689] Fps is (10 sec: 5582.6, 60 sec: 5573.5, 300 sec: 5548.1). Total num frames: 878989312. Throughput: 0: 5685.5. Samples: 878991434. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:24:10,032][25689] Avg episode reward: [(0, '0.013')] [2022-07-10 19:24:10,180][26022] Updated weights on worker 0-0, policy_version 858389 (0.00091) [2022-07-10 19:24:12,203][26022] Updated weights on worker 0-0, policy_version 858399 (0.00089) [2022-07-10 19:24:14,171][26022] Updated weights on worker 0-0, policy_version 858409 (0.00086) [2022-07-10 19:24:15,054][25689] Fps is (10 sec: 5775.0, 60 sec: 5524.0, 300 sec: 5544.6). Total num frames: 879015936. Throughput: 0: 5724.2. Samples: 879025396. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:24:15,055][25689] Avg episode reward: [(0, '-1.125')] [2022-07-10 19:24:16,023][26022] Updated weights on worker 0-0, policy_version 858419 (0.00089) [2022-07-10 19:24:17,573][26022] Updated weights on worker 0-0, policy_version 858429 (0.00085) [2022-07-10 19:24:19,867][26022] Updated weights on worker 0-0, policy_version 858439 (0.00901) [2022-07-10 19:24:20,064][25689] Fps is (10 sec: 5409.9, 60 sec: 5541.7, 300 sec: 5541.5). Total num frames: 879043584. Throughput: 0: 5755.3. Samples: 879042296. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:24:20,064][25689] Avg episode reward: [(0, '-0.815')] [2022-07-10 19:24:21,189][26022] Updated weights on worker 0-0, policy_version 858449 (0.00082) [2022-07-10 19:24:23,257][26022] Updated weights on worker 0-0, policy_version 858459 (0.00087) [2022-07-10 19:24:25,068][25689] Fps is (10 sec: 5521.8, 60 sec: 5542.5, 300 sec: 5543.1). Total num frames: 879071232. Throughput: 0: 5880.1. Samples: 879076048. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:24:25,069][25689] Avg episode reward: [(0, '-1.120')] [2022-07-10 19:24:25,120][26022] Updated weights on worker 0-0, policy_version 858469 (0.00092) [2022-07-10 19:24:26,856][26022] Updated weights on worker 0-0, policy_version 858479 (0.00084) [2022-07-10 19:24:28,824][26022] Updated weights on worker 0-0, policy_version 858489 (0.00090) [2022-07-10 19:24:30,155][25689] Fps is (10 sec: 5682.7, 60 sec: 5563.0, 300 sec: 5545.1). Total num frames: 879100928. Throughput: 0: 5870.0. Samples: 879109556. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:24:30,155][25689] Avg episode reward: [(0, '-1.198')] [2022-07-10 19:24:30,692][26022] Updated weights on worker 0-0, policy_version 858499 (0.00090) [2022-07-10 19:24:32,325][26022] Updated weights on worker 0-0, policy_version 858509 (0.00087) [2022-07-10 19:24:34,603][26022] Updated weights on worker 0-0, policy_version 858519 (0.00090) [2022-07-10 19:24:35,191][25689] Fps is (10 sec: 5665.1, 60 sec: 5544.3, 300 sec: 5544.5). Total num frames: 879128576. Throughput: 0: 5000.9. Samples: 879126096. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:24:35,191][25689] Avg episode reward: [(0, '-1.089')] [2022-07-10 19:24:35,819][26022] Updated weights on worker 0-0, policy_version 858529 (0.00095) [2022-07-10 19:24:38,153][26022] Updated weights on worker 0-0, policy_version 858539 (0.00079) [2022-07-10 19:24:39,662][26022] Updated weights on worker 0-0, policy_version 858549 (0.00090) [2022-07-10 19:24:40,203][25689] Fps is (10 sec: 5503.3, 60 sec: 5565.2, 300 sec: 5545.4). Total num frames: 879156224. Throughput: 0: 5841.2. Samples: 879159930. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:24:40,203][25689] Avg episode reward: [(0, '-0.392')] [2022-07-10 19:24:41,683][26022] Updated weights on worker 0-0, policy_version 858559 (0.00081) [2022-07-10 19:24:43,574][26022] Updated weights on worker 0-0, policy_version 858569 (0.00081) [2022-07-10 19:24:45,199][26022] Updated weights on worker 0-0, policy_version 858579 (0.00085) [2022-07-10 19:24:45,207][25689] Fps is (10 sec: 5520.7, 60 sec: 5519.1, 300 sec: 5540.7). Total num frames: 879183872. Throughput: 0: 5838.9. Samples: 879193634. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:24:45,207][25689] Avg episode reward: [(0, '-0.953')] [2022-07-10 19:24:47,169][26022] Updated weights on worker 0-0, policy_version 858589 (0.00118) [2022-07-10 19:24:48,899][26022] Updated weights on worker 0-0, policy_version 858599 (0.00092) [2022-07-10 19:24:50,265][25689] Fps is (10 sec: 5597.2, 60 sec: 5559.2, 300 sec: 5546.7). Total num frames: 879212544. Throughput: 0: 5024.5. Samples: 879210598. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:24:50,265][25689] Avg episode reward: [(0, '-1.066')] [2022-07-10 19:24:50,779][26022] Updated weights on worker 0-0, policy_version 858609 (0.00084) [2022-07-10 19:24:52,495][26022] Updated weights on worker 0-0, policy_version 858619 (0.00089) [2022-07-10 19:24:54,263][26022] Updated weights on worker 0-0, policy_version 858629 (0.00088) [2022-07-10 19:24:55,282][25689] Fps is (10 sec: 5691.4, 60 sec: 5559.1, 300 sec: 5547.1). Total num frames: 879241216. Throughput: 0: 5880.4. Samples: 879244242. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:24:55,283][25689] Avg episode reward: [(0, '-1.398')] [2022-07-10 19:24:56,235][26022] Updated weights on worker 0-0, policy_version 858639 (0.00088) [2022-07-10 19:24:58,028][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:24:58,039][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000858649_879256576.pth [2022-07-10 19:24:58,043][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000856695_877255680.pth [2022-07-10 19:24:58,046][26022] Updated weights on worker 0-0, policy_version 858649 (0.00092) [2022-07-10 19:25:00,147][26022] Updated weights on worker 0-0, policy_version 858659 (0.00087) [2022-07-10 19:25:00,312][25689] Fps is (10 sec: 5605.3, 60 sec: 5563.6, 300 sec: 5553.8). Total num frames: 879268864. Throughput: 0: 5859.8. Samples: 879277768. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:25:00,313][25689] Avg episode reward: [(0, '-0.694')] [2022-07-10 19:25:02,197][26022] Updated weights on worker 0-0, policy_version 858669 (0.00108) [2022-07-10 19:25:03,993][26022] Updated weights on worker 0-0, policy_version 858679 (0.00050) [2022-07-10 19:25:05,330][25689] Fps is (10 sec: 5299.4, 60 sec: 5579.3, 300 sec: 5544.7). Total num frames: 879294464. Throughput: 0: 4896.5. Samples: 879292168. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:25:05,331][25689] Avg episode reward: [(0, '-1.507')] [2022-07-10 19:25:05,796][26022] Updated weights on worker 0-0, policy_version 858689 (0.00089) [2022-07-10 19:25:07,707][26022] Updated weights on worker 0-0, policy_version 858699 (0.00098) [2022-07-10 19:25:09,508][26022] Updated weights on worker 0-0, policy_version 858709 (0.00097) [2022-07-10 19:25:10,375][25689] Fps is (10 sec: 5291.8, 60 sec: 5515.1, 300 sec: 5541.5). Total num frames: 879322112. Throughput: 0: 5723.2. Samples: 879325692. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:25:10,377][25689] Avg episode reward: [(0, '-1.599')] [2022-07-10 19:25:11,203][26022] Updated weights on worker 0-0, policy_version 858719 (0.00086) [2022-07-10 19:25:13,178][26022] Updated weights on worker 0-0, policy_version 858729 (0.00052) [2022-07-10 19:25:15,065][26022] Updated weights on worker 0-0, policy_version 858739 (0.01265) [2022-07-10 19:25:15,403][25689] Fps is (10 sec: 5591.4, 60 sec: 5548.6, 300 sec: 5545.1). Total num frames: 879350784. Throughput: 0: 5704.3. Samples: 879359016. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:25:15,403][25689] Avg episode reward: [(0, '-1.311')] [2022-07-10 19:25:16,867][26022] Updated weights on worker 0-0, policy_version 858749 (0.00087) [2022-07-10 19:25:18,739][26022] Updated weights on worker 0-0, policy_version 858759 (0.00090) [2022-07-10 19:25:20,420][25689] Fps is (10 sec: 5606.7, 60 sec: 5547.9, 300 sec: 5544.9). Total num frames: 879378432. Throughput: 0: 4877.1. Samples: 879375834. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:25:20,420][25689] Avg episode reward: [(0, '-0.928')] [2022-07-10 19:25:20,518][26022] Updated weights on worker 0-0, policy_version 858769 (0.00077) [2022-07-10 19:25:22,467][26022] Updated weights on worker 0-0, policy_version 858779 (0.00090) [2022-07-10 19:25:24,311][26022] Updated weights on worker 0-0, policy_version 858789 (0.00105) [2022-07-10 19:25:25,423][25689] Fps is (10 sec: 5518.4, 60 sec: 5548.0, 300 sec: 5542.7). Total num frames: 879406080. Throughput: 0: 5821.3. Samples: 879409134. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:25:25,424][25689] Avg episode reward: [(0, '-1.132')] [2022-07-10 19:25:26,160][26022] Updated weights on worker 0-0, policy_version 858799 (0.00085) [2022-07-10 19:25:27,917][26022] Updated weights on worker 0-0, policy_version 858809 (0.00089) [2022-07-10 19:25:29,632][26022] Updated weights on worker 0-0, policy_version 858819 (0.00090) [2022-07-10 19:25:30,480][25689] Fps is (10 sec: 5598.1, 60 sec: 5533.7, 300 sec: 5541.7). Total num frames: 879434752. Throughput: 0: 5820.5. Samples: 879442716. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:25:30,481][25689] Avg episode reward: [(0, '-1.013')] [2022-07-10 19:25:31,619][26022] Updated weights on worker 0-0, policy_version 858829 (0.00086) [2022-07-10 19:25:33,295][26022] Updated weights on worker 0-0, policy_version 858839 (0.00080) [2022-07-10 19:25:35,322][26022] Updated weights on worker 0-0, policy_version 858849 (0.00088) [2022-07-10 19:25:35,484][25689] Fps is (10 sec: 5597.9, 60 sec: 5536.7, 300 sec: 5545.7). Total num frames: 879462400. Throughput: 0: 4990.6. Samples: 879459234. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:25:35,484][25689] Avg episode reward: [(0, '-1.781')] [2022-07-10 19:25:37,151][26022] Updated weights on worker 0-0, policy_version 858859 (0.00086) [2022-07-10 19:25:38,886][26022] Updated weights on worker 0-0, policy_version 858869 (0.00084) [2022-07-10 19:25:40,497][25689] Fps is (10 sec: 5520.3, 60 sec: 5536.5, 300 sec: 5542.2). Total num frames: 879490048. Throughput: 0: 5839.3. Samples: 879493072. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:25:40,498][25689] Avg episode reward: [(0, '-1.228')] [2022-07-10 19:25:40,829][26022] Updated weights on worker 0-0, policy_version 858879 (0.00090) [2022-07-10 19:25:42,565][26022] Updated weights on worker 0-0, policy_version 858889 (0.00083) [2022-07-10 19:25:44,476][26022] Updated weights on worker 0-0, policy_version 858899 (0.00099) [2022-07-10 19:25:45,505][25689] Fps is (10 sec: 5620.0, 60 sec: 5553.2, 300 sec: 5544.2). Total num frames: 879518720. Throughput: 0: 5846.7. Samples: 879526550. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:25:45,506][25689] Avg episode reward: [(0, '-2.167')] [2022-07-10 19:25:46,159][26022] Updated weights on worker 0-0, policy_version 858909 (0.00094) [2022-07-10 19:25:48,184][26022] Updated weights on worker 0-0, policy_version 858919 (0.00091) [2022-07-10 19:25:49,926][26022] Updated weights on worker 0-0, policy_version 858929 (0.00089) [2022-07-10 19:25:50,597][25689] Fps is (10 sec: 5677.5, 60 sec: 5550.0, 300 sec: 5546.7). Total num frames: 879547392. Throughput: 0: 4993.4. Samples: 879543168. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:25:50,598][25689] Avg episode reward: [(0, '-2.225')] [2022-07-10 19:25:51,687][26022] Updated weights on worker 0-0, policy_version 858939 (0.00087) [2022-07-10 19:25:53,514][26022] Updated weights on worker 0-0, policy_version 858949 (0.00096) [2022-07-10 19:25:55,392][26022] Updated weights on worker 0-0, policy_version 858959 (0.00090) [2022-07-10 19:25:55,603][25689] Fps is (10 sec: 5577.7, 60 sec: 5534.2, 300 sec: 5544.5). Total num frames: 879575040. Throughput: 0: 5836.1. Samples: 879576648. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 19:25:55,603][25689] Avg episode reward: [(0, '-2.293')] [2022-07-10 19:25:57,175][26022] Updated weights on worker 0-0, policy_version 858969 (0.00084) [2022-07-10 19:25:59,172][26022] Updated weights on worker 0-0, policy_version 858979 (0.00080) [2022-07-10 19:26:00,608][25689] Fps is (10 sec: 5523.5, 60 sec: 5536.4, 300 sec: 5552.9). Total num frames: 879602688. Throughput: 0: 5831.7. Samples: 879610354. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:26:00,609][25689] Avg episode reward: [(0, '-0.481')] [2022-07-10 19:26:00,908][26022] Updated weights on worker 0-0, policy_version 858989 (0.00082) [2022-07-10 19:26:03,312][26022] Updated weights on worker 0-0, policy_version 858999 (0.00094) [2022-07-10 19:26:05,044][26022] Updated weights on worker 0-0, policy_version 859009 (0.00089) [2022-07-10 19:26:05,614][25689] Fps is (10 sec: 5216.4, 60 sec: 5520.5, 300 sec: 5540.6). Total num frames: 879627264. Throughput: 0: 4885.0. Samples: 879624782. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:26:05,615][25689] Avg episode reward: [(0, '-0.419')] [2022-07-10 19:26:06,919][26022] Updated weights on worker 0-0, policy_version 859019 (0.00080) [2022-07-10 19:26:08,787][26022] Updated weights on worker 0-0, policy_version 859029 (0.00087) [2022-07-10 19:26:10,599][26022] Updated weights on worker 0-0, policy_version 859039 (0.00095) [2022-07-10 19:26:10,689][25689] Fps is (10 sec: 5282.3, 60 sec: 5534.8, 300 sec: 5539.3). Total num frames: 879655936. Throughput: 0: 5724.3. Samples: 879658178. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:26:10,689][25689] Avg episode reward: [(0, '-0.358')] [2022-07-10 19:26:12,423][26022] Updated weights on worker 0-0, policy_version 859049 (0.00085) [2022-07-10 19:26:14,182][26022] Updated weights on worker 0-0, policy_version 859059 (0.00091) [2022-07-10 19:26:15,771][25689] Fps is (10 sec: 5646.0, 60 sec: 5529.8, 300 sec: 5544.8). Total num frames: 879684608. Throughput: 0: 5712.4. Samples: 879691856. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:26:15,771][25689] Avg episode reward: [(0, '-0.731')] [2022-07-10 19:26:15,987][26022] Updated weights on worker 0-0, policy_version 859069 (0.00083) [2022-07-10 19:26:17,869][26022] Updated weights on worker 0-0, policy_version 859079 (0.00088) [2022-07-10 19:26:19,652][26022] Updated weights on worker 0-0, policy_version 859089 (0.00086) [2022-07-10 19:26:20,791][25689] Fps is (10 sec: 5575.1, 60 sec: 5529.5, 300 sec: 5544.5). Total num frames: 879712256. Throughput: 0: 4866.7. Samples: 879708576. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:26:20,791][25689] Avg episode reward: [(0, '-0.700')] [2022-07-10 19:26:21,610][26022] Updated weights on worker 0-0, policy_version 859099 (0.00088) [2022-07-10 19:26:23,256][26022] Updated weights on worker 0-0, policy_version 859109 (0.00089) [2022-07-10 19:26:25,147][26022] Updated weights on worker 0-0, policy_version 859119 (0.00084) [2022-07-10 19:26:25,808][25689] Fps is (10 sec: 5712.9, 60 sec: 5562.2, 300 sec: 5548.8). Total num frames: 879741952. Throughput: 0: 5830.7. Samples: 879742528. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:26:25,809][25689] Avg episode reward: [(0, '-0.703')] [2022-07-10 19:26:27,103][26022] Updated weights on worker 0-0, policy_version 859129 (0.00471) [2022-07-10 19:26:28,715][26022] Updated weights on worker 0-0, policy_version 859139 (0.00083) [2022-07-10 19:26:30,731][26022] Updated weights on worker 0-0, policy_version 859149 (0.00087) [2022-07-10 19:26:30,857][25689] Fps is (10 sec: 5798.5, 60 sec: 5563.0, 300 sec: 5551.9). Total num frames: 879770624. Throughput: 0: 5846.1. Samples: 879776082. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:26:30,857][25689] Avg episode reward: [(0, '-0.712')] [2022-07-10 19:26:32,516][26022] Updated weights on worker 0-0, policy_version 859159 (0.00090) [2022-07-10 19:26:34,303][26022] Updated weights on worker 0-0, policy_version 859169 (0.00087) [2022-07-10 19:26:35,868][25689] Fps is (10 sec: 5293.1, 60 sec: 5511.4, 300 sec: 5538.6). Total num frames: 879795200. Throughput: 0: 5022.5. Samples: 879792796. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:26:35,869][25689] Avg episode reward: [(0, '-1.284')] [2022-07-10 19:26:36,281][26022] Updated weights on worker 0-0, policy_version 859179 (0.00094) [2022-07-10 19:26:37,823][26022] Updated weights on worker 0-0, policy_version 859189 (0.00088) [2022-07-10 19:26:39,778][26022] Updated weights on worker 0-0, policy_version 859199 (0.00094) [2022-07-10 19:26:40,871][25689] Fps is (10 sec: 5419.6, 60 sec: 5546.3, 300 sec: 5545.6). Total num frames: 879824896. Throughput: 0: 5852.3. Samples: 879826090. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:26:40,871][25689] Avg episode reward: [(0, '-0.925')] [2022-07-10 19:26:41,667][26022] Updated weights on worker 0-0, policy_version 859209 (0.00093) [2022-07-10 19:26:43,450][26022] Updated weights on worker 0-0, policy_version 859219 (0.00087) [2022-07-10 19:26:45,335][26022] Updated weights on worker 0-0, policy_version 859229 (0.00085) [2022-07-10 19:26:45,918][25689] Fps is (10 sec: 5807.8, 60 sec: 5542.7, 300 sec: 5546.0). Total num frames: 879853568. Throughput: 0: 5832.5. Samples: 879859818. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:26:45,918][25689] Avg episode reward: [(0, '-1.039')] [2022-07-10 19:26:47,102][26022] Updated weights on worker 0-0, policy_version 859239 (0.00085) [2022-07-10 19:26:48,853][26022] Updated weights on worker 0-0, policy_version 859249 (0.00087) [2022-07-10 19:26:50,819][26022] Updated weights on worker 0-0, policy_version 859259 (0.00094) [2022-07-10 19:26:50,973][25689] Fps is (10 sec: 5574.9, 60 sec: 5529.1, 300 sec: 5538.6). Total num frames: 879881216. Throughput: 0: 4998.6. Samples: 879876636. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:26:50,974][25689] Avg episode reward: [(0, '-1.398')] [2022-07-10 19:26:52,554][26022] Updated weights on worker 0-0, policy_version 859269 (0.00088) [2022-07-10 19:26:54,483][26022] Updated weights on worker 0-0, policy_version 859279 (0.00080) [2022-07-10 19:26:55,979][25689] Fps is (10 sec: 5597.6, 60 sec: 5546.0, 300 sec: 5545.6). Total num frames: 879909888. Throughput: 0: 5843.2. Samples: 879910310. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:26:55,980][25689] Avg episode reward: [(0, '-1.996')] [2022-07-10 19:26:56,335][26022] Updated weights on worker 0-0, policy_version 859289 (0.00092) [2022-07-10 19:26:58,051][26022] Updated weights on worker 0-0, policy_version 859299 (0.00059) [2022-07-10 19:26:58,320][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:26:58,334][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000859300_879923200.pth [2022-07-10 19:26:58,334][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000857345_877921280.pth [2022-07-10 19:27:00,051][26022] Updated weights on worker 0-0, policy_version 859309 (0.00103) [2022-07-10 19:27:00,982][25689] Fps is (10 sec: 5729.6, 60 sec: 5563.3, 300 sec: 5556.0). Total num frames: 879938560. Throughput: 0: 5848.6. Samples: 879943710. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:27:00,982][25689] Avg episode reward: [(0, '-2.210')] [2022-07-10 19:27:01,781][26022] Updated weights on worker 0-0, policy_version 859319 (0.00093) [2022-07-10 19:27:03,965][26022] Updated weights on worker 0-0, policy_version 859329 (0.00095) [2022-07-10 19:27:05,911][26022] Updated weights on worker 0-0, policy_version 859339 (0.00087) [2022-07-10 19:27:06,003][25689] Fps is (10 sec: 5312.5, 60 sec: 5561.8, 300 sec: 5543.7). Total num frames: 879963136. Throughput: 0: 4915.6. Samples: 879958546. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:27:06,003][25689] Avg episode reward: [(0, '-1.998')] [2022-07-10 19:27:07,663][26022] Updated weights on worker 0-0, policy_version 859349 (0.00086) [2022-07-10 19:27:09,515][26022] Updated weights on worker 0-0, policy_version 859359 (0.00092) [2022-07-10 19:27:11,069][25689] Fps is (10 sec: 5278.8, 60 sec: 5562.6, 300 sec: 5543.2). Total num frames: 879991808. Throughput: 0: 5744.5. Samples: 879992074. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:27:11,069][25689] Avg episode reward: [(0, '-3.258')] [2022-07-10 19:27:11,318][26022] Updated weights on worker 0-0, policy_version 859369 (0.00086) [2022-07-10 19:27:12,949][26022] Updated weights on worker 0-0, policy_version 859379 (0.00084) [2022-07-10 19:27:15,144][26022] Updated weights on worker 0-0, policy_version 859389 (0.00087) [2022-07-10 19:27:16,111][25689] Fps is (10 sec: 5673.2, 60 sec: 5566.3, 300 sec: 5539.3). Total num frames: 880020480. Throughput: 0: 5753.6. Samples: 880026136. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:27:16,112][25689] Avg episode reward: [(0, '-2.536')] [2022-07-10 19:27:16,606][26022] Updated weights on worker 0-0, policy_version 859399 (0.00629) [2022-07-10 19:27:18,583][26022] Updated weights on worker 0-0, policy_version 859409 (0.00090) [2022-07-10 19:27:20,498][26022] Updated weights on worker 0-0, policy_version 859419 (0.00090) [2022-07-10 19:27:21,127][25689] Fps is (10 sec: 5497.5, 60 sec: 5549.7, 300 sec: 5535.9). Total num frames: 880047104. Throughput: 0: 4923.6. Samples: 880042898. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:27:21,129][25689] Avg episode reward: [(0, '-3.774')] [2022-07-10 19:27:22,294][26022] Updated weights on worker 0-0, policy_version 859429 (0.00084) [2022-07-10 19:27:24,314][26022] Updated weights on worker 0-0, policy_version 859439 (0.00896) [2022-07-10 19:27:25,873][26022] Updated weights on worker 0-0, policy_version 859449 (0.00092) [2022-07-10 19:27:26,130][25689] Fps is (10 sec: 5723.4, 60 sec: 5568.0, 300 sec: 5550.5). Total num frames: 880077824. Throughput: 0: 5861.3. Samples: 880076518. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:27:26,130][25689] Avg episode reward: [(0, '-3.381')] [2022-07-10 19:27:27,853][26022] Updated weights on worker 0-0, policy_version 859459 (0.00087) [2022-07-10 19:27:29,564][26022] Updated weights on worker 0-0, policy_version 859469 (0.00086) [2022-07-10 19:27:31,221][25689] Fps is (10 sec: 5680.8, 60 sec: 5530.1, 300 sec: 5542.6). Total num frames: 880104448. Throughput: 0: 5850.5. Samples: 880109978. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:27:31,222][25689] Avg episode reward: [(0, '-2.862')] [2022-07-10 19:27:31,317][26022] Updated weights on worker 0-0, policy_version 859479 (0.00096) [2022-07-10 19:27:33,431][26022] Updated weights on worker 0-0, policy_version 859489 (0.00095) [2022-07-10 19:27:34,944][26022] Updated weights on worker 0-0, policy_version 859499 (0.00089) [2022-07-10 19:27:36,269][25689] Fps is (10 sec: 5352.8, 60 sec: 5577.7, 300 sec: 5536.1). Total num frames: 880132096. Throughput: 0: 4988.6. Samples: 880126698. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:27:36,269][25689] Avg episode reward: [(0, '-3.949')] [2022-07-10 19:27:37,047][26022] Updated weights on worker 0-0, policy_version 859509 (0.00077) [2022-07-10 19:27:38,831][26022] Updated weights on worker 0-0, policy_version 859519 (0.00095) [2022-07-10 19:27:40,574][26022] Updated weights on worker 0-0, policy_version 859529 (0.00080) [2022-07-10 19:27:41,292][25689] Fps is (10 sec: 5694.0, 60 sec: 5575.8, 300 sec: 5543.4). Total num frames: 880161792. Throughput: 0: 5813.9. Samples: 880160136. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:27:41,293][25689] Avg episode reward: [(0, '-2.710')] [2022-07-10 19:27:42,529][26022] Updated weights on worker 0-0, policy_version 859539 (0.00085) [2022-07-10 19:27:44,296][26022] Updated weights on worker 0-0, policy_version 859549 (0.00082) [2022-07-10 19:27:45,979][26022] Updated weights on worker 0-0, policy_version 859559 (0.00086) [2022-07-10 19:27:46,325][25689] Fps is (10 sec: 5702.7, 60 sec: 5560.2, 300 sec: 5544.1). Total num frames: 880189440. Throughput: 0: 5819.5. Samples: 880194042. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:27:46,325][25689] Avg episode reward: [(0, '-1.064')] [2022-07-10 19:27:48,291][26022] Updated weights on worker 0-0, policy_version 859569 (0.00084) [2022-07-10 19:27:49,814][26022] Updated weights on worker 0-0, policy_version 859580 (0.00099) [2022-07-10 19:27:51,430][25689] Fps is (10 sec: 5454.8, 60 sec: 5555.6, 300 sec: 5542.5). Total num frames: 880217088. Throughput: 0: 4997.9. Samples: 880210976. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:27:51,430][25689] Avg episode reward: [(0, '-0.275')] [2022-07-10 19:27:51,906][26022] Updated weights on worker 0-0, policy_version 859590 (0.00078) [2022-07-10 19:27:53,459][26022] Updated weights on worker 0-0, policy_version 859600 (0.00084) [2022-07-10 19:27:55,458][26022] Updated weights on worker 0-0, policy_version 859610 (0.00086) [2022-07-10 19:27:56,465][25689] Fps is (10 sec: 5655.0, 60 sec: 5569.9, 300 sec: 5543.3). Total num frames: 880246784. Throughput: 0: 5836.9. Samples: 880244580. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:27:56,465][25689] Avg episode reward: [(0, '-0.123')] [2022-07-10 19:27:57,367][26022] Updated weights on worker 0-0, policy_version 859620 (0.00085) [2022-07-10 19:27:58,975][26022] Updated weights on worker 0-0, policy_version 859630 (0.00097) [2022-07-10 19:28:00,999][26022] Updated weights on worker 0-0, policy_version 859640 (0.00092) [2022-07-10 19:28:01,532][25689] Fps is (10 sec: 5676.3, 60 sec: 5547.0, 300 sec: 5556.4). Total num frames: 880274432. Throughput: 0: 5845.2. Samples: 880278440. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:28:01,533][25689] Avg episode reward: [(0, '-1.356')] [2022-07-10 19:28:02,983][26022] Updated weights on worker 0-0, policy_version 859650 (0.00085) [2022-07-10 19:28:05,047][26022] Updated weights on worker 0-0, policy_version 859660 (0.00094) [2022-07-10 19:28:06,555][25689] Fps is (10 sec: 5378.9, 60 sec: 5580.7, 300 sec: 5551.6). Total num frames: 880301056. Throughput: 0: 5710.5. Samples: 880309566. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:28:06,555][25689] Avg episode reward: [(0, '0.021')] [2022-07-10 19:28:06,879][26022] Updated weights on worker 0-0, policy_version 859670 (0.00089) [2022-07-10 19:28:08,631][26022] Updated weights on worker 0-0, policy_version 859680 (0.00086) [2022-07-10 19:28:10,531][26022] Updated weights on worker 0-0, policy_version 859690 (0.00085) [2022-07-10 19:28:11,690][25689] Fps is (10 sec: 5342.8, 60 sec: 5557.4, 300 sec: 5542.8). Total num frames: 880328704. Throughput: 0: 5690.6. Samples: 880326270. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:28:11,691][25689] Avg episode reward: [(0, '-1.592')] [2022-07-10 19:28:12,293][26022] Updated weights on worker 0-0, policy_version 859700 (0.00091) [2022-07-10 19:28:14,286][26022] Updated weights on worker 0-0, policy_version 859710 (0.00086) [2022-07-10 19:28:15,904][26022] Updated weights on worker 0-0, policy_version 859720 (0.00092) [2022-07-10 19:28:16,692][25689] Fps is (10 sec: 5555.8, 60 sec: 5561.1, 300 sec: 5550.0). Total num frames: 880357376. Throughput: 0: 5699.6. Samples: 880359866. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:28:16,692][25689] Avg episode reward: [(0, '-1.949')] [2022-07-10 19:28:17,770][26022] Updated weights on worker 0-0, policy_version 859730 (0.00092) [2022-07-10 19:28:19,652][26022] Updated weights on worker 0-0, policy_version 859740 (0.00089) [2022-07-10 19:28:21,587][26022] Updated weights on worker 0-0, policy_version 859750 (0.00095) [2022-07-10 19:28:21,715][25689] Fps is (10 sec: 5618.0, 60 sec: 5577.4, 300 sec: 5549.8). Total num frames: 880385024. Throughput: 0: 5704.1. Samples: 880393566. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:28:21,715][25689] Avg episode reward: [(0, '-1.804')] [2022-07-10 19:28:23,373][26022] Updated weights on worker 0-0, policy_version 859760 (0.00819) [2022-07-10 19:28:25,329][26022] Updated weights on worker 0-0, policy_version 859770 (0.00848) [2022-07-10 19:28:26,724][25689] Fps is (10 sec: 5511.8, 60 sec: 5526.1, 300 sec: 5548.6). Total num frames: 880412672. Throughput: 0: 4990.9. Samples: 880410230. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:28:26,725][25689] Avg episode reward: [(0, '-1.342')] [2022-07-10 19:28:26,988][26022] Updated weights on worker 0-0, policy_version 859780 (0.00086) [2022-07-10 19:28:28,996][26022] Updated weights on worker 0-0, policy_version 859790 (0.00092) [2022-07-10 19:28:30,709][26022] Updated weights on worker 0-0, policy_version 859800 (0.00092) [2022-07-10 19:28:31,817][25689] Fps is (10 sec: 5473.8, 60 sec: 5542.9, 300 sec: 5543.7). Total num frames: 880440320. Throughput: 0: 5818.4. Samples: 880443376. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:28:31,819][25689] Avg episode reward: [(0, '-1.068')] [2022-07-10 19:28:32,684][26022] Updated weights on worker 0-0, policy_version 859810 (0.00090) [2022-07-10 19:28:34,338][26022] Updated weights on worker 0-0, policy_version 859820 (0.00089) [2022-07-10 19:28:36,332][26022] Updated weights on worker 0-0, policy_version 859830 (0.00089) [2022-07-10 19:28:36,848][25689] Fps is (10 sec: 5461.9, 60 sec: 5544.4, 300 sec: 5547.6). Total num frames: 880467968. Throughput: 0: 5782.4. Samples: 880476416. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:28:36,850][25689] Avg episode reward: [(0, '-1.395')] [2022-07-10 19:28:38,183][26022] Updated weights on worker 0-0, policy_version 859840 (0.00105) [2022-07-10 19:28:40,115][26022] Updated weights on worker 0-0, policy_version 859850 (0.00083) [2022-07-10 19:28:41,815][26022] Updated weights on worker 0-0, policy_version 859860 (0.00094) [2022-07-10 19:28:41,867][25689] Fps is (10 sec: 5604.1, 60 sec: 5527.9, 300 sec: 5541.4). Total num frames: 880496640. Throughput: 0: 4950.9. Samples: 880493336. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:28:41,867][25689] Avg episode reward: [(0, '-0.855')] [2022-07-10 19:28:43,672][26022] Updated weights on worker 0-0, policy_version 859870 (0.00101) [2022-07-10 19:28:45,530][26022] Updated weights on worker 0-0, policy_version 859880 (0.00056) [2022-07-10 19:28:46,879][25689] Fps is (10 sec: 5614.7, 60 sec: 5529.7, 300 sec: 5546.9). Total num frames: 880524288. Throughput: 0: 5798.0. Samples: 880527086. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:28:46,880][25689] Avg episode reward: [(0, '-0.489')] [2022-07-10 19:28:47,242][26022] Updated weights on worker 0-0, policy_version 859890 (0.00090) [2022-07-10 19:28:49,166][26022] Updated weights on worker 0-0, policy_version 859900 (0.00090) [2022-07-10 19:28:50,860][26022] Updated weights on worker 0-0, policy_version 859910 (0.00101) [2022-07-10 19:28:51,944][25689] Fps is (10 sec: 5588.8, 60 sec: 5550.3, 300 sec: 5546.0). Total num frames: 880552960. Throughput: 0: 5820.0. Samples: 880560516. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:28:51,946][25689] Avg episode reward: [(0, '-0.535')] [2022-07-10 19:28:52,782][26022] Updated weights on worker 0-0, policy_version 859920 (0.00092) [2022-07-10 19:28:54,643][26022] Updated weights on worker 0-0, policy_version 859930 (0.00086) [2022-07-10 19:28:56,457][26022] Updated weights on worker 0-0, policy_version 859940 (0.00087) [2022-07-10 19:28:56,951][25689] Fps is (10 sec: 5794.7, 60 sec: 5552.8, 300 sec: 5554.3). Total num frames: 880582656. Throughput: 0: 5021.2. Samples: 880577360. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:28:56,953][25689] Avg episode reward: [(0, '-1.494')] [2022-07-10 19:28:58,487][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:28:58,496][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000859950_880588800.pth [2022-07-10 19:28:58,497][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000857998_878589952.pth [2022-07-10 19:28:58,502][26022] Updated weights on worker 0-0, policy_version 859950 (0.00085) [2022-07-10 19:29:00,014][26022] Updated weights on worker 0-0, policy_version 859960 (0.00068) [2022-07-10 19:29:01,963][25689] Fps is (10 sec: 5417.0, 60 sec: 5507.1, 300 sec: 5554.1). Total num frames: 880607232. Throughput: 0: 5862.9. Samples: 880611158. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:29:01,963][25689] Avg episode reward: [(0, '-2.503')] [2022-07-10 19:29:02,506][26022] Updated weights on worker 0-0, policy_version 859970 (0.00633) [2022-07-10 19:29:03,992][26022] Updated weights on worker 0-0, policy_version 859980 (0.00106) [2022-07-10 19:29:06,009][26022] Updated weights on worker 0-0, policy_version 859990 (0.00089) [2022-07-10 19:29:06,967][25689] Fps is (10 sec: 5214.4, 60 sec: 5525.8, 300 sec: 5541.8). Total num frames: 880634880. Throughput: 0: 5762.8. Samples: 880642848. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:29:06,967][25689] Avg episode reward: [(0, '-1.896')] [2022-07-10 19:29:07,748][26022] Updated weights on worker 0-0, policy_version 860000 (0.00090) [2022-07-10 19:29:09,682][26022] Updated weights on worker 0-0, policy_version 860010 (0.00085) [2022-07-10 19:29:11,469][26022] Updated weights on worker 0-0, policy_version 860020 (0.00088) [2022-07-10 19:29:12,025][25689] Fps is (10 sec: 5699.2, 60 sec: 5566.8, 300 sec: 5551.5). Total num frames: 880664576. Throughput: 0: 4942.2. Samples: 880659758. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:29:12,025][25689] Avg episode reward: [(0, '-0.862')] [2022-07-10 19:29:13,172][26022] Updated weights on worker 0-0, policy_version 860030 (0.00091) [2022-07-10 19:29:14,963][26022] Updated weights on worker 0-0, policy_version 860040 (0.00083) [2022-07-10 19:29:16,875][26022] Updated weights on worker 0-0, policy_version 860050 (0.00084) [2022-07-10 19:29:17,049][25689] Fps is (10 sec: 5687.6, 60 sec: 5547.8, 300 sec: 5551.3). Total num frames: 880692224. Throughput: 0: 5802.1. Samples: 880693968. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:29:17,049][25689] Avg episode reward: [(0, '-1.279')] [2022-07-10 19:29:18,415][26022] Updated weights on worker 0-0, policy_version 860060 (0.00086) [2022-07-10 19:29:20,550][26022] Updated weights on worker 0-0, policy_version 860070 (0.00088) [2022-07-10 19:29:22,015][26022] Updated weights on worker 0-0, policy_version 860080 (0.00080) [2022-07-10 19:29:22,052][25689] Fps is (10 sec: 5719.0, 60 sec: 5583.6, 300 sec: 5558.2). Total num frames: 880721920. Throughput: 0: 5817.7. Samples: 880728028. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:29:22,052][25689] Avg episode reward: [(0, '-1.573')] [2022-07-10 19:29:24,040][26022] Updated weights on worker 0-0, policy_version 860090 (0.00088) [2022-07-10 19:29:25,608][26022] Updated weights on worker 0-0, policy_version 860100 (0.00089) [2022-07-10 19:29:27,083][25689] Fps is (10 sec: 5612.9, 60 sec: 5564.6, 300 sec: 5548.9). Total num frames: 880748544. Throughput: 0: 5093.5. Samples: 880745310. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:29:27,083][25689] Avg episode reward: [(0, '-1.878')] [2022-07-10 19:29:27,546][26022] Updated weights on worker 0-0, policy_version 860110 (0.00087) [2022-07-10 19:29:29,380][26022] Updated weights on worker 0-0, policy_version 860120 (0.00084) [2022-07-10 19:29:31,191][26022] Updated weights on worker 0-0, policy_version 860130 (0.00083) [2022-07-10 19:29:32,139][25689] Fps is (10 sec: 5583.2, 60 sec: 5601.9, 300 sec: 5555.4). Total num frames: 880778240. Throughput: 0: 5930.8. Samples: 880779052. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:29:32,139][25689] Avg episode reward: [(0, '0.023')] [2022-07-10 19:29:33,019][26022] Updated weights on worker 0-0, policy_version 860140 (0.00080) [2022-07-10 19:29:34,935][26022] Updated weights on worker 0-0, policy_version 860150 (0.00114) [2022-07-10 19:29:36,585][26022] Updated weights on worker 0-0, policy_version 860160 (0.00096) [2022-07-10 19:29:37,165][25689] Fps is (10 sec: 5789.1, 60 sec: 5619.3, 300 sec: 5558.6). Total num frames: 880806912. Throughput: 0: 5916.8. Samples: 880812994. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-10 19:29:37,166][25689] Avg episode reward: [(0, '-0.007')] [2022-07-10 19:29:38,655][26022] Updated weights on worker 0-0, policy_version 860170 (0.00082) [2022-07-10 19:29:40,395][26022] Updated weights on worker 0-0, policy_version 860180 (0.00084) [2022-07-10 19:29:42,179][25689] Fps is (10 sec: 5507.7, 60 sec: 5585.9, 300 sec: 5554.9). Total num frames: 880833536. Throughput: 0: 5072.7. Samples: 880830128. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:29:42,179][25689] Avg episode reward: [(0, '-0.299')] [2022-07-10 19:29:42,199][26022] Updated weights on worker 0-0, policy_version 860190 (0.00087) [2022-07-10 19:29:43,947][26022] Updated weights on worker 0-0, policy_version 860200 (0.00087) [2022-07-10 19:29:45,786][26022] Updated weights on worker 0-0, policy_version 860210 (0.00093) [2022-07-10 19:29:47,198][25689] Fps is (10 sec: 5511.8, 60 sec: 5602.2, 300 sec: 5555.7). Total num frames: 880862208. Throughput: 0: 5899.4. Samples: 880863974. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:29:47,198][25689] Avg episode reward: [(0, '0.989')] [2022-07-10 19:29:47,503][26022] Updated weights on worker 0-0, policy_version 860220 (0.00087) [2022-07-10 19:29:49,511][26022] Updated weights on worker 0-0, policy_version 860230 (0.00085) [2022-07-10 19:29:51,109][26022] Updated weights on worker 0-0, policy_version 860240 (0.00085) [2022-07-10 19:29:52,313][25689] Fps is (10 sec: 5658.7, 60 sec: 5597.6, 300 sec: 5553.9). Total num frames: 880890880. Throughput: 0: 5874.4. Samples: 880897560. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:29:52,313][25689] Avg episode reward: [(0, '1.055')] [2022-07-10 19:29:53,352][26022] Updated weights on worker 0-0, policy_version 860250 (0.00098) [2022-07-10 19:29:54,983][26022] Updated weights on worker 0-0, policy_version 860260 (0.00087) [2022-07-10 19:29:56,867][26022] Updated weights on worker 0-0, policy_version 860270 (0.00084) [2022-07-10 19:29:57,331][25689] Fps is (10 sec: 5658.9, 60 sec: 5579.6, 300 sec: 5557.5). Total num frames: 880919552. Throughput: 0: 5028.7. Samples: 880914400. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:29:57,331][25689] Avg episode reward: [(0, '-0.280')] [2022-07-10 19:29:58,444][26022] Updated weights on worker 0-0, policy_version 860280 (0.00091) [2022-07-10 19:30:00,424][26022] Updated weights on worker 0-0, policy_version 860290 (0.00085) [2022-07-10 19:30:02,334][25689] Fps is (10 sec: 5517.8, 60 sec: 5614.3, 300 sec: 5561.2). Total num frames: 880946176. Throughput: 0: 5872.1. Samples: 880948482. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:30:02,335][25689] Avg episode reward: [(0, '-0.631')] [2022-07-10 19:30:02,384][26022] Updated weights on worker 0-0, policy_version 860300 (0.00091) [2022-07-10 19:30:04,491][26022] Updated weights on worker 0-0, policy_version 860310 (0.00106) [2022-07-10 19:30:06,151][26022] Updated weights on worker 0-0, policy_version 860320 (0.00088) [2022-07-10 19:30:07,338][25689] Fps is (10 sec: 5321.0, 60 sec: 5597.3, 300 sec: 5558.6). Total num frames: 880972800. Throughput: 0: 5757.7. Samples: 880979938. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:30:07,342][25689] Avg episode reward: [(0, '-1.194')] [2022-07-10 19:30:08,158][26022] Updated weights on worker 0-0, policy_version 860330 (0.00086) [2022-07-10 19:30:09,780][26022] Updated weights on worker 0-0, policy_version 860340 (0.00094) [2022-07-10 19:30:11,872][26022] Updated weights on worker 0-0, policy_version 860350 (0.00060) [2022-07-10 19:30:12,486][25689] Fps is (10 sec: 5446.9, 60 sec: 5572.1, 300 sec: 5556.3). Total num frames: 881001472. Throughput: 0: 4896.7. Samples: 880996342. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:30:12,487][25689] Avg episode reward: [(0, '-1.578')] [2022-07-10 19:30:13,550][26022] Updated weights on worker 0-0, policy_version 860360 (0.00620) [2022-07-10 19:30:15,513][26022] Updated weights on worker 0-0, policy_version 860370 (0.00089) [2022-07-10 19:30:17,257][26022] Updated weights on worker 0-0, policy_version 860380 (0.00091) [2022-07-10 19:30:17,499][25689] Fps is (10 sec: 5643.3, 60 sec: 5590.0, 300 sec: 5559.8). Total num frames: 881030144. Throughput: 0: 5704.5. Samples: 881029452. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:30:17,501][25689] Avg episode reward: [(0, '-2.155')] [2022-07-10 19:30:19,282][26022] Updated weights on worker 0-0, policy_version 860390 (0.00053) [2022-07-10 19:30:20,965][26022] Updated weights on worker 0-0, policy_version 860400 (0.00059) [2022-07-10 19:30:22,506][25689] Fps is (10 sec: 5518.4, 60 sec: 5538.8, 300 sec: 5556.3). Total num frames: 881056768. Throughput: 0: 5682.8. Samples: 881063116. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:30:22,508][25689] Avg episode reward: [(0, '-1.252')] [2022-07-10 19:30:22,816][26022] Updated weights on worker 0-0, policy_version 860410 (0.00100) [2022-07-10 19:30:24,686][26022] Updated weights on worker 0-0, policy_version 860420 (0.00089) [2022-07-10 19:30:26,501][26022] Updated weights on worker 0-0, policy_version 860430 (0.00090) [2022-07-10 19:30:27,514][25689] Fps is (10 sec: 5623.7, 60 sec: 5591.8, 300 sec: 5560.7). Total num frames: 881086464. Throughput: 0: 4953.9. Samples: 881079888. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:30:27,515][25689] Avg episode reward: [(0, '-2.898')] [2022-07-10 19:30:28,380][26022] Updated weights on worker 0-0, policy_version 860440 (0.00086) [2022-07-10 19:30:30,139][26022] Updated weights on worker 0-0, policy_version 860450 (0.00085) [2022-07-10 19:30:31,978][26022] Updated weights on worker 0-0, policy_version 860460 (0.00088) [2022-07-10 19:30:32,563][25689] Fps is (10 sec: 5600.2, 60 sec: 5541.6, 300 sec: 5556.4). Total num frames: 881113088. Throughput: 0: 5820.4. Samples: 881113198. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:30:32,563][25689] Avg episode reward: [(0, '-3.099')] [2022-07-10 19:30:33,792][26022] Updated weights on worker 0-0, policy_version 860470 (0.00091) [2022-07-10 19:30:35,502][26022] Updated weights on worker 0-0, policy_version 860480 (0.00081) [2022-07-10 19:30:37,451][26022] Updated weights on worker 0-0, policy_version 860490 (0.00082) [2022-07-10 19:30:37,640][25689] Fps is (10 sec: 5460.9, 60 sec: 5537.0, 300 sec: 5558.6). Total num frames: 881141760. Throughput: 0: 5824.9. Samples: 881146770. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:30:37,641][25689] Avg episode reward: [(0, '-3.367')] [2022-07-10 19:30:39,605][26022] Updated weights on worker 0-0, policy_version 860500 (0.00089) [2022-07-10 19:30:41,120][26022] Updated weights on worker 0-0, policy_version 860510 (0.00088) [2022-07-10 19:30:42,657][25689] Fps is (10 sec: 5579.4, 60 sec: 5553.6, 300 sec: 5555.0). Total num frames: 881169408. Throughput: 0: 4991.0. Samples: 881163690. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:30:42,658][25689] Avg episode reward: [(0, '-2.757')] [2022-07-10 19:30:43,085][26022] Updated weights on worker 0-0, policy_version 860520 (0.00083) [2022-07-10 19:30:44,611][26022] Updated weights on worker 0-0, policy_version 860530 (0.00082) [2022-07-10 19:30:46,687][26022] Updated weights on worker 0-0, policy_version 860540 (0.00088) [2022-07-10 19:30:47,686][25689] Fps is (10 sec: 5606.5, 60 sec: 5552.7, 300 sec: 5556.2). Total num frames: 881198080. Throughput: 0: 5824.4. Samples: 881197374. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:30:47,686][25689] Avg episode reward: [(0, '-2.173')] [2022-07-10 19:30:48,420][26022] Updated weights on worker 0-0, policy_version 860550 (0.00090) [2022-07-10 19:30:50,331][26022] Updated weights on worker 0-0, policy_version 860560 (0.00082) [2022-07-10 19:30:52,109][26022] Updated weights on worker 0-0, policy_version 860570 (0.00094) [2022-07-10 19:30:52,821][25689] Fps is (10 sec: 5642.2, 60 sec: 5550.9, 300 sec: 5557.2). Total num frames: 881226752. Throughput: 0: 5805.9. Samples: 881230812. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:30:52,827][25689] Avg episode reward: [(0, '-2.067')] [2022-07-10 19:30:53,920][26022] Updated weights on worker 0-0, policy_version 860580 (0.00085) [2022-07-10 19:30:55,742][26022] Updated weights on worker 0-0, policy_version 860590 (0.00092) [2022-07-10 19:30:57,684][26022] Updated weights on worker 0-0, policy_version 860600 (0.00085) [2022-07-10 19:30:57,865][25689] Fps is (10 sec: 5633.3, 60 sec: 5548.5, 300 sec: 5560.0). Total num frames: 881255424. Throughput: 0: 5821.6. Samples: 881264512. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:30:57,866][25689] Avg episode reward: [(0, '-1.202')] [2022-07-10 19:30:58,620][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:30:58,631][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000860605_881259520.pth [2022-07-10 19:30:58,632][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000858649_879256576.pth [2022-07-10 19:30:59,270][26022] Updated weights on worker 0-0, policy_version 860610 (0.00052) [2022-07-10 19:31:01,245][26022] Updated weights on worker 0-0, policy_version 860620 (0.00092) [2022-07-10 19:31:02,878][25689] Fps is (10 sec: 5396.4, 60 sec: 5530.7, 300 sec: 5563.3). Total num frames: 881281024. Throughput: 0: 5826.6. Samples: 881281506. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:31:02,880][25689] Avg episode reward: [(0, '-0.395')] [2022-07-10 19:31:03,498][26022] Updated weights on worker 0-0, policy_version 860630 (0.00050) [2022-07-10 19:31:05,168][26022] Updated weights on worker 0-0, policy_version 860640 (0.00087) [2022-07-10 19:31:07,141][26022] Updated weights on worker 0-0, policy_version 860650 (0.00092) [2022-07-10 19:31:07,883][25689] Fps is (10 sec: 5417.7, 60 sec: 5564.4, 300 sec: 5564.6). Total num frames: 881309696. Throughput: 0: 5742.0. Samples: 881313346. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:31:07,884][25689] Avg episode reward: [(0, '-0.636')] [2022-07-10 19:31:08,762][26022] Updated weights on worker 0-0, policy_version 860660 (0.00086) [2022-07-10 19:31:10,607][26022] Updated weights on worker 0-0, policy_version 860670 (0.00089) [2022-07-10 19:31:12,598][26022] Updated weights on worker 0-0, policy_version 860680 (0.00086) [2022-07-10 19:31:12,934][25689] Fps is (10 sec: 5702.5, 60 sec: 5573.3, 300 sec: 5565.2). Total num frames: 881338368. Throughput: 0: 5770.9. Samples: 881346882. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:31:12,935][25689] Avg episode reward: [(0, '-0.760')] [2022-07-10 19:31:14,321][26022] Updated weights on worker 0-0, policy_version 860690 (0.00088) [2022-07-10 19:31:16,212][26022] Updated weights on worker 0-0, policy_version 860700 (0.00090) [2022-07-10 19:31:17,961][25689] Fps is (10 sec: 5588.2, 60 sec: 5555.1, 300 sec: 5565.0). Total num frames: 881366016. Throughput: 0: 4952.6. Samples: 881364040. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:31:17,963][25689] Avg episode reward: [(0, '-1.547')] [2022-07-10 19:31:18,032][26022] Updated weights on worker 0-0, policy_version 860710 (0.00090) [2022-07-10 19:31:19,745][26022] Updated weights on worker 0-0, policy_version 860720 (0.00088) [2022-07-10 19:31:21,659][26022] Updated weights on worker 0-0, policy_version 860730 (0.00088) [2022-07-10 19:31:22,990][25689] Fps is (10 sec: 5600.6, 60 sec: 5587.0, 300 sec: 5561.4). Total num frames: 881394688. Throughput: 0: 5791.9. Samples: 881397992. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:31:22,991][25689] Avg episode reward: [(0, '-0.657')] [2022-07-10 19:31:23,303][26022] Updated weights on worker 0-0, policy_version 860740 (0.00078) [2022-07-10 19:31:25,219][26022] Updated weights on worker 0-0, policy_version 860750 (0.00078) [2022-07-10 19:31:27,318][26022] Updated weights on worker 0-0, policy_version 860760 (0.00087) [2022-07-10 19:31:27,994][25689] Fps is (10 sec: 5613.5, 60 sec: 5553.5, 300 sec: 5558.8). Total num frames: 881422336. Throughput: 0: 5880.0. Samples: 881431600. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:31:27,996][25689] Avg episode reward: [(0, '-1.684')] [2022-07-10 19:31:28,840][26022] Updated weights on worker 0-0, policy_version 860770 (0.00088) [2022-07-10 19:31:30,947][26022] Updated weights on worker 0-0, policy_version 860780 (0.00089) [2022-07-10 19:31:32,591][26022] Updated weights on worker 0-0, policy_version 860790 (0.00091) [2022-07-10 19:31:33,122][25689] Fps is (10 sec: 5558.2, 60 sec: 5580.0, 300 sec: 5570.3). Total num frames: 881451008. Throughput: 0: 5036.1. Samples: 881448554. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:31:33,124][25689] Avg episode reward: [(0, '-1.541')] [2022-07-10 19:31:34,575][26022] Updated weights on worker 0-0, policy_version 860800 (0.00083) [2022-07-10 19:31:36,328][26022] Updated weights on worker 0-0, policy_version 860810 (0.00087) [2022-07-10 19:31:38,097][26022] Updated weights on worker 0-0, policy_version 860820 (0.00081) [2022-07-10 19:31:38,196][25689] Fps is (10 sec: 5620.9, 60 sec: 5580.3, 300 sec: 5565.6). Total num frames: 881479680. Throughput: 0: 5827.7. Samples: 881481962. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:31:38,196][25689] Avg episode reward: [(0, '-2.021')] [2022-07-10 19:31:39,870][26022] Updated weights on worker 0-0, policy_version 860830 (0.00101) [2022-07-10 19:31:42,011][26022] Updated weights on worker 0-0, policy_version 860840 (0.00096) [2022-07-10 19:31:43,210][25689] Fps is (10 sec: 5684.3, 60 sec: 5597.5, 300 sec: 5566.2). Total num frames: 881508352. Throughput: 0: 5803.3. Samples: 881515340. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:31:43,211][25689] Avg episode reward: [(0, '-2.310')] [2022-07-10 19:31:43,668][26022] Updated weights on worker 0-0, policy_version 860850 (0.00399) [2022-07-10 19:31:45,495][26022] Updated weights on worker 0-0, policy_version 860860 (0.00083) [2022-07-10 19:31:47,120][26022] Updated weights on worker 0-0, policy_version 860870 (0.00085) [2022-07-10 19:31:48,219][25689] Fps is (10 sec: 5618.8, 60 sec: 5582.3, 300 sec: 5567.1). Total num frames: 881536000. Throughput: 0: 4979.9. Samples: 881532320. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:31:48,220][25689] Avg episode reward: [(0, '-1.920')] [2022-07-10 19:31:49,344][26022] Updated weights on worker 0-0, policy_version 860880 (0.00098) [2022-07-10 19:31:50,801][26022] Updated weights on worker 0-0, policy_version 860890 (0.00087) [2022-07-10 19:31:52,847][26022] Updated weights on worker 0-0, policy_version 860900 (0.00087) [2022-07-10 19:31:53,276][25689] Fps is (10 sec: 5595.2, 60 sec: 5589.6, 300 sec: 5566.1). Total num frames: 881564672. Throughput: 0: 5830.5. Samples: 881566060. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:31:53,277][25689] Avg episode reward: [(0, '-1.061')] [2022-07-10 19:31:54,621][26022] Updated weights on worker 0-0, policy_version 860910 (0.00051) [2022-07-10 19:31:56,506][26022] Updated weights on worker 0-0, policy_version 860920 (0.00090) [2022-07-10 19:31:58,263][26022] Updated weights on worker 0-0, policy_version 860930 (0.00087) [2022-07-10 19:31:58,284][25689] Fps is (10 sec: 5595.6, 60 sec: 5576.0, 300 sec: 5562.5). Total num frames: 881592320. Throughput: 0: 5868.4. Samples: 881599850. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:31:58,285][25689] Avg episode reward: [(0, '-0.535')] [2022-07-10 19:32:00,267][26022] Updated weights on worker 0-0, policy_version 860940 (0.00088) [2022-07-10 19:32:02,239][26022] Updated weights on worker 0-0, policy_version 860950 (0.00091) [2022-07-10 19:32:03,304][25689] Fps is (10 sec: 5412.2, 60 sec: 5592.3, 300 sec: 5569.5). Total num frames: 881618944. Throughput: 0: 5043.6. Samples: 881616684. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:32:03,304][25689] Avg episode reward: [(0, '-0.458')] [2022-07-10 19:32:04,316][26022] Updated weights on worker 0-0, policy_version 860960 (0.00088) [2022-07-10 19:32:05,731][26022] Updated weights on worker 0-0, policy_version 860970 (0.00087) [2022-07-10 19:32:07,803][26022] Updated weights on worker 0-0, policy_version 860980 (0.01075) [2022-07-10 19:32:08,307][25689] Fps is (10 sec: 5312.8, 60 sec: 5558.6, 300 sec: 5563.8). Total num frames: 881645568. Throughput: 0: 5766.7. Samples: 881648160. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:32:08,308][25689] Avg episode reward: [(0, '-1.582')] [2022-07-10 19:32:09,404][26022] Updated weights on worker 0-0, policy_version 860990 (0.00094) [2022-07-10 19:32:11,584][26022] Updated weights on worker 0-0, policy_version 861000 (0.00085) [2022-07-10 19:32:13,433][25689] Fps is (10 sec: 5256.9, 60 sec: 5517.8, 300 sec: 5555.3). Total num frames: 881672192. Throughput: 0: 5698.8. Samples: 881680930. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:32:13,433][25689] Avg episode reward: [(0, '-0.839')] [2022-07-10 19:32:13,690][26022] Updated weights on worker 0-0, policy_version 861010 (0.00086) [2022-07-10 19:32:15,097][26022] Updated weights on worker 0-0, policy_version 861020 (0.00096) [2022-07-10 19:32:16,996][26022] Updated weights on worker 0-0, policy_version 861030 (0.00093) [2022-07-10 19:32:18,459][25689] Fps is (10 sec: 5648.6, 60 sec: 5568.7, 300 sec: 5568.9). Total num frames: 881702912. Throughput: 0: 4854.3. Samples: 881697782. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:32:18,459][25689] Avg episode reward: [(0, '-1.560')] [2022-07-10 19:32:19,267][26022] Updated weights on worker 0-0, policy_version 861041 (0.00094) [2022-07-10 19:32:20,832][26022] Updated weights on worker 0-0, policy_version 861051 (0.00092) [2022-07-10 19:32:22,899][26022] Updated weights on worker 0-0, policy_version 861061 (0.00080) [2022-07-10 19:32:23,496][25689] Fps is (10 sec: 5800.0, 60 sec: 5551.0, 300 sec: 5557.9). Total num frames: 881730560. Throughput: 0: 5682.8. Samples: 881731434. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:32:23,497][25689] Avg episode reward: [(0, '-1.453')] [2022-07-10 19:32:24,606][26022] Updated weights on worker 0-0, policy_version 861071 (0.00082) [2022-07-10 19:32:26,459][26022] Updated weights on worker 0-0, policy_version 861081 (0.00092) [2022-07-10 19:32:28,501][25689] Fps is (10 sec: 5404.3, 60 sec: 5534.0, 300 sec: 5559.5). Total num frames: 881757184. Throughput: 0: 5765.6. Samples: 881764592. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:32:28,502][25689] Avg episode reward: [(0, '-1.340')] [2022-07-10 19:32:28,508][26022] Updated weights on worker 0-0, policy_version 861091 (0.00092) [2022-07-10 19:32:29,963][26022] Updated weights on worker 0-0, policy_version 861101 (0.00093) [2022-07-10 19:32:32,271][26022] Updated weights on worker 0-0, policy_version 861111 (0.00088) [2022-07-10 19:32:33,626][25689] Fps is (10 sec: 5560.2, 60 sec: 5551.3, 300 sec: 5565.0). Total num frames: 881786880. Throughput: 0: 4950.5. Samples: 881780894. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:32:33,627][25689] Avg episode reward: [(0, '-1.349')] [2022-07-10 19:32:33,746][26022] Updated weights on worker 0-0, policy_version 861121 (0.00083) [2022-07-10 19:32:35,696][26022] Updated weights on worker 0-0, policy_version 861131 (0.00081) [2022-07-10 19:32:37,572][26022] Updated weights on worker 0-0, policy_version 861141 (0.00094) [2022-07-10 19:32:38,656][25689] Fps is (10 sec: 5747.9, 60 sec: 5555.3, 300 sec: 5561.4). Total num frames: 881815552. Throughput: 0: 5786.4. Samples: 881814650. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:32:38,656][25689] Avg episode reward: [(0, '-1.757')] [2022-07-10 19:32:39,478][26022] Updated weights on worker 0-0, policy_version 861151 (0.00091) [2022-07-10 19:32:41,190][26022] Updated weights on worker 0-0, policy_version 861161 (0.00096) [2022-07-10 19:32:43,292][26022] Updated weights on worker 0-0, policy_version 861171 (0.00085) [2022-07-10 19:32:43,681][25689] Fps is (10 sec: 5295.3, 60 sec: 5486.5, 300 sec: 5551.2). Total num frames: 881840128. Throughput: 0: 5772.2. Samples: 881847944. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:32:43,682][25689] Avg episode reward: [(0, '-2.707')] [2022-07-10 19:32:44,783][26022] Updated weights on worker 0-0, policy_version 861181 (0.00087) [2022-07-10 19:32:46,858][26022] Updated weights on worker 0-0, policy_version 861191 (0.00085) [2022-07-10 19:32:48,639][26022] Updated weights on worker 0-0, policy_version 861201 (0.00094) [2022-07-10 19:32:48,745][25689] Fps is (10 sec: 5379.2, 60 sec: 5515.4, 300 sec: 5558.9). Total num frames: 881869824. Throughput: 0: 4932.1. Samples: 881864436. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:32:48,745][25689] Avg episode reward: [(0, '-3.019')] [2022-07-10 19:32:50,573][26022] Updated weights on worker 0-0, policy_version 861211 (0.00089) [2022-07-10 19:32:52,259][26022] Updated weights on worker 0-0, policy_version 861221 (0.00089) [2022-07-10 19:32:53,805][25689] Fps is (10 sec: 5664.5, 60 sec: 5498.2, 300 sec: 5551.5). Total num frames: 881897472. Throughput: 0: 5788.3. Samples: 881897698. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:32:53,805][25689] Avg episode reward: [(0, '-3.418')] [2022-07-10 19:32:54,367][26022] Updated weights on worker 0-0, policy_version 861231 (0.00090) [2022-07-10 19:32:55,823][26022] Updated weights on worker 0-0, policy_version 861241 (0.00102) [2022-07-10 19:32:57,971][26022] Updated weights on worker 0-0, policy_version 861251 (0.01301) [2022-07-10 19:32:58,752][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:32:58,764][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000861256_881926144.pth [2022-07-10 19:32:58,765][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000859300_879923200.pth [2022-07-10 19:32:58,810][25689] Fps is (10 sec: 5697.5, 60 sec: 5532.4, 300 sec: 5559.6). Total num frames: 881927168. Throughput: 0: 5773.8. Samples: 881931014. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:32:58,810][25689] Avg episode reward: [(0, '-3.357')] [2022-07-10 19:32:59,618][26022] Updated weights on worker 0-0, policy_version 861261 (0.00086) [2022-07-10 19:33:01,712][26022] Updated weights on worker 0-0, policy_version 861271 (0.00092) [2022-07-10 19:33:03,867][25689] Fps is (10 sec: 5292.0, 60 sec: 5478.2, 300 sec: 5548.6). Total num frames: 881950720. Throughput: 0: 4937.3. Samples: 881947606. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:33:03,868][25689] Avg episode reward: [(0, '-1.549')] [2022-07-10 19:33:03,911][26022] Updated weights on worker 0-0, policy_version 861281 (0.00099) [2022-07-10 19:33:05,687][26022] Updated weights on worker 0-0, policy_version 861291 (0.00082) [2022-07-10 19:33:07,563][26022] Updated weights on worker 0-0, policy_version 861301 (0.00092) [2022-07-10 19:33:08,870][25689] Fps is (10 sec: 5191.3, 60 sec: 5512.0, 300 sec: 5554.5). Total num frames: 881979392. Throughput: 0: 5674.1. Samples: 881978626. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:33:08,870][25689] Avg episode reward: [(0, '-0.794')] [2022-07-10 19:33:09,572][26022] Updated weights on worker 0-0, policy_version 861311 (0.00090) [2022-07-10 19:33:11,186][26022] Updated weights on worker 0-0, policy_version 861321 (0.00094) [2022-07-10 19:33:13,231][26022] Updated weights on worker 0-0, policy_version 861331 (0.00098) [2022-07-10 19:33:13,996][25689] Fps is (10 sec: 5560.2, 60 sec: 5528.9, 300 sec: 5548.7). Total num frames: 882007040. Throughput: 0: 5648.7. Samples: 882011752. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:33:13,996][25689] Avg episode reward: [(0, '-0.361')] [2022-07-10 19:33:15,052][26022] Updated weights on worker 0-0, policy_version 861341 (0.00091) [2022-07-10 19:33:16,875][26022] Updated weights on worker 0-0, policy_version 861351 (0.00087) [2022-07-10 19:33:18,581][26022] Updated weights on worker 0-0, policy_version 861361 (0.00088) [2022-07-10 19:33:19,001][25689] Fps is (10 sec: 5458.1, 60 sec: 5480.1, 300 sec: 5549.1). Total num frames: 882034688. Throughput: 0: 4833.6. Samples: 882028608. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-10 19:33:19,001][25689] Avg episode reward: [(0, '1.193')] [2022-07-10 19:33:20,349][26022] Updated weights on worker 0-0, policy_version 861371 (0.00080) [2022-07-10 19:33:22,137][26022] Updated weights on worker 0-0, policy_version 861381 (0.00087) [2022-07-10 19:33:24,054][25689] Fps is (10 sec: 5701.3, 60 sec: 5512.5, 300 sec: 5555.1). Total num frames: 882064384. Throughput: 0: 5694.3. Samples: 882062558. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:33:24,055][25689] Avg episode reward: [(0, '0.609')] [2022-07-10 19:33:24,055][26022] Updated weights on worker 0-0, policy_version 861391 (0.00084) [2022-07-10 19:33:25,972][26022] Updated weights on worker 0-0, policy_version 861401 (0.00092) [2022-07-10 19:33:27,701][26022] Updated weights on worker 0-0, policy_version 861411 (0.00082) [2022-07-10 19:33:29,151][25689] Fps is (10 sec: 5649.4, 60 sec: 5521.0, 300 sec: 5555.1). Total num frames: 882092032. Throughput: 0: 5793.1. Samples: 882096118. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:33:29,153][25689] Avg episode reward: [(0, '0.352')] [2022-07-10 19:33:29,570][26022] Updated weights on worker 0-0, policy_version 861421 (0.00099) [2022-07-10 19:33:31,536][26022] Updated weights on worker 0-0, policy_version 861431 (0.00083) [2022-07-10 19:33:33,267][26022] Updated weights on worker 0-0, policy_version 861441 (0.00086) [2022-07-10 19:33:34,239][25689] Fps is (10 sec: 5429.3, 60 sec: 5490.6, 300 sec: 5554.0). Total num frames: 882119680. Throughput: 0: 5810.6. Samples: 882129374. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:33:34,241][25689] Avg episode reward: [(0, '-0.197')] [2022-07-10 19:33:35,127][26022] Updated weights on worker 0-0, policy_version 861451 (0.00080) [2022-07-10 19:33:37,035][26022] Updated weights on worker 0-0, policy_version 861461 (0.00084) [2022-07-10 19:33:38,902][26022] Updated weights on worker 0-0, policy_version 861471 (0.00085) [2022-07-10 19:33:39,245][25689] Fps is (10 sec: 5680.9, 60 sec: 5509.6, 300 sec: 5557.7). Total num frames: 882149376. Throughput: 0: 5801.6. Samples: 882146060. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:33:39,247][25689] Avg episode reward: [(0, '-0.699')] [2022-07-10 19:33:40,759][26022] Updated weights on worker 0-0, policy_version 861481 (0.00084) [2022-07-10 19:33:42,510][26022] Updated weights on worker 0-0, policy_version 861491 (0.00097) [2022-07-10 19:33:44,254][25689] Fps is (10 sec: 5623.5, 60 sec: 5545.0, 300 sec: 5554.3). Total num frames: 882176000. Throughput: 0: 5768.0. Samples: 882179070. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:33:44,256][25689] Avg episode reward: [(0, '-1.015')] [2022-07-10 19:33:44,451][26022] Updated weights on worker 0-0, policy_version 861501 (0.00093) [2022-07-10 19:33:46,301][26022] Updated weights on worker 0-0, policy_version 861511 (0.00097) [2022-07-10 19:33:48,216][26022] Updated weights on worker 0-0, policy_version 861521 (0.00085) [2022-07-10 19:33:49,283][25689] Fps is (10 sec: 5406.9, 60 sec: 5514.3, 300 sec: 5551.5). Total num frames: 882203648. Throughput: 0: 5775.4. Samples: 882212386. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:33:49,284][25689] Avg episode reward: [(0, '-1.407')] [2022-07-10 19:33:49,923][26022] Updated weights on worker 0-0, policy_version 861531 (0.00087) [2022-07-10 19:33:51,919][26022] Updated weights on worker 0-0, policy_version 861541 (0.00081) [2022-07-10 19:33:53,631][26022] Updated weights on worker 0-0, policy_version 861551 (0.00086) [2022-07-10 19:33:54,410][25689] Fps is (10 sec: 5343.7, 60 sec: 5491.3, 300 sec: 5539.0). Total num frames: 882230272. Throughput: 0: 4930.3. Samples: 882228824. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:33:54,411][25689] Avg episode reward: [(0, '-1.309')] [2022-07-10 19:33:55,662][26022] Updated weights on worker 0-0, policy_version 861561 (0.00095) [2022-07-10 19:33:57,518][26022] Updated weights on worker 0-0, policy_version 861571 (0.00086) [2022-07-10 19:33:59,210][26022] Updated weights on worker 0-0, policy_version 861581 (0.00088) [2022-07-10 19:33:59,436][25689] Fps is (10 sec: 5546.8, 60 sec: 5489.3, 300 sec: 5555.9). Total num frames: 882259968. Throughput: 0: 5752.7. Samples: 882262214. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:33:59,437][25689] Avg episode reward: [(0, '-0.448')] [2022-07-10 19:34:01,138][26022] Updated weights on worker 0-0, policy_version 861591 (0.00087) [2022-07-10 19:34:03,272][26022] Updated weights on worker 0-0, policy_version 861601 (0.00089) [2022-07-10 19:34:04,447][25689] Fps is (10 sec: 5407.4, 60 sec: 5510.5, 300 sec: 5545.5). Total num frames: 882284544. Throughput: 0: 5667.2. Samples: 882293506. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:34:04,448][25689] Avg episode reward: [(0, '-0.563')] [2022-07-10 19:34:05,249][26022] Updated weights on worker 0-0, policy_version 861611 (0.00083) [2022-07-10 19:34:07,095][26022] Updated weights on worker 0-0, policy_version 861621 (0.00099) [2022-07-10 19:34:08,715][26022] Updated weights on worker 0-0, policy_version 861631 (0.00088) [2022-07-10 19:34:09,463][25689] Fps is (10 sec: 5310.8, 60 sec: 5509.3, 300 sec: 5542.8). Total num frames: 882313216. Throughput: 0: 4855.7. Samples: 882310374. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:34:09,463][25689] Avg episode reward: [(0, '-0.952')] [2022-07-10 19:34:10,787][26022] Updated weights on worker 0-0, policy_version 861641 (0.00085) [2022-07-10 19:34:12,367][26022] Updated weights on worker 0-0, policy_version 861651 (0.00095) [2022-07-10 19:34:14,462][26022] Updated weights on worker 0-0, policy_version 861661 (0.00090) [2022-07-10 19:34:14,556][25689] Fps is (10 sec: 5570.9, 60 sec: 5512.2, 300 sec: 5541.5). Total num frames: 882340864. Throughput: 0: 5699.1. Samples: 882343640. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:34:14,557][25689] Avg episode reward: [(0, '-1.643')] [2022-07-10 19:34:16,167][26022] Updated weights on worker 0-0, policy_version 861671 (0.00085) [2022-07-10 19:34:18,127][26022] Updated weights on worker 0-0, policy_version 861681 (0.00083) [2022-07-10 19:34:19,572][25689] Fps is (10 sec: 5570.9, 60 sec: 5528.1, 300 sec: 5537.8). Total num frames: 882369536. Throughput: 0: 5689.3. Samples: 882376774. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:34:19,573][25689] Avg episode reward: [(0, '-2.809')] [2022-07-10 19:34:20,071][26022] Updated weights on worker 0-0, policy_version 861691 (0.00083) [2022-07-10 19:34:21,630][26022] Updated weights on worker 0-0, policy_version 861701 (0.00092) [2022-07-10 19:34:23,818][26022] Updated weights on worker 0-0, policy_version 861711 (0.00105) [2022-07-10 19:34:24,578][25689] Fps is (10 sec: 5517.6, 60 sec: 5481.7, 300 sec: 5538.3). Total num frames: 882396160. Throughput: 0: 4958.2. Samples: 882393322. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:34:24,579][25689] Avg episode reward: [(0, '-2.647')] [2022-07-10 19:34:25,309][26022] Updated weights on worker 0-0, policy_version 861721 (0.00096) [2022-07-10 19:34:27,448][26022] Updated weights on worker 0-0, policy_version 861731 (0.00088) [2022-07-10 19:34:29,065][26022] Updated weights on worker 0-0, policy_version 861741 (0.00084) [2022-07-10 19:34:29,603][25689] Fps is (10 sec: 5410.6, 60 sec: 5488.3, 300 sec: 5532.0). Total num frames: 882423808. Throughput: 0: 5770.3. Samples: 882426590. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:34:29,604][25689] Avg episode reward: [(0, '-2.858')] [2022-07-10 19:34:31,016][26022] Updated weights on worker 0-0, policy_version 861751 (0.00097) [2022-07-10 19:34:32,869][26022] Updated weights on worker 0-0, policy_version 861761 (0.00087) [2022-07-10 19:34:34,732][25689] Fps is (10 sec: 5546.4, 60 sec: 5501.4, 300 sec: 5530.1). Total num frames: 882452480. Throughput: 0: 5777.0. Samples: 882460196. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:34:34,733][25689] Avg episode reward: [(0, '-0.860')] [2022-07-10 19:34:34,808][26022] Updated weights on worker 0-0, policy_version 861771 (0.00108) [2022-07-10 19:34:36,491][26022] Updated weights on worker 0-0, policy_version 861781 (0.00092) [2022-07-10 19:34:38,401][26022] Updated weights on worker 0-0, policy_version 861791 (0.00095) [2022-07-10 19:34:39,810][25689] Fps is (10 sec: 5718.5, 60 sec: 5495.0, 300 sec: 5539.2). Total num frames: 882482176. Throughput: 0: 4954.7. Samples: 882477044. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:34:39,811][25689] Avg episode reward: [(0, '-0.953')] [2022-07-10 19:34:40,095][26022] Updated weights on worker 0-0, policy_version 861801 (0.00439) [2022-07-10 19:34:41,923][26022] Updated weights on worker 0-0, policy_version 861811 (0.00100) [2022-07-10 19:34:43,763][26022] Updated weights on worker 0-0, policy_version 861821 (0.00091) [2022-07-10 19:34:44,870][25689] Fps is (10 sec: 5656.7, 60 sec: 5507.2, 300 sec: 5535.0). Total num frames: 882509824. Throughput: 0: 5788.0. Samples: 882510770. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:34:44,870][25689] Avg episode reward: [(0, '-1.589')] [2022-07-10 19:34:45,786][26022] Updated weights on worker 0-0, policy_version 861831 (0.00080) [2022-07-10 19:34:47,463][26022] Updated weights on worker 0-0, policy_version 861841 (0.00093) [2022-07-10 19:34:49,106][26022] Updated weights on worker 0-0, policy_version 861851 (0.00086) [2022-07-10 19:34:49,895][25689] Fps is (10 sec: 5584.5, 60 sec: 5524.4, 300 sec: 5536.7). Total num frames: 882538496. Throughput: 0: 5817.2. Samples: 882544632. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:34:49,896][25689] Avg episode reward: [(0, '-0.702')] [2022-07-10 19:34:51,174][26022] Updated weights on worker 0-0, policy_version 861861 (0.00087) [2022-07-10 19:34:52,920][26022] Updated weights on worker 0-0, policy_version 861871 (0.00089) [2022-07-10 19:34:54,739][26022] Updated weights on worker 0-0, policy_version 861881 (0.00088) [2022-07-10 19:34:54,946][25689] Fps is (10 sec: 5690.8, 60 sec: 5565.2, 300 sec: 5536.1). Total num frames: 882567168. Throughput: 0: 5001.0. Samples: 882561286. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:34:54,947][25689] Avg episode reward: [(0, '-0.086')] [2022-07-10 19:34:56,730][26022] Updated weights on worker 0-0, policy_version 861891 (0.00090) [2022-07-10 19:34:58,219][26022] Updated weights on worker 0-0, policy_version 861901 (0.00084) [2022-07-10 19:34:59,111][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:34:59,125][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000861905_882590720.pth [2022-07-10 19:34:59,125][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000859950_880588800.pth [2022-07-10 19:34:59,950][25689] Fps is (10 sec: 5499.4, 60 sec: 5516.5, 300 sec: 5536.0). Total num frames: 882593792. Throughput: 0: 5868.3. Samples: 882595232. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:34:59,950][25689] Avg episode reward: [(0, '0.183')] [2022-07-10 19:35:00,484][26022] Updated weights on worker 0-0, policy_version 861911 (0.00091) [2022-07-10 19:35:02,097][26022] Updated weights on worker 0-0, policy_version 861921 (0.00079) [2022-07-10 19:35:04,392][26022] Updated weights on worker 0-0, policy_version 861931 (0.00089) [2022-07-10 19:35:04,998][25689] Fps is (10 sec: 5297.6, 60 sec: 5546.9, 300 sec: 5535.2). Total num frames: 882620416. Throughput: 0: 5761.4. Samples: 882626736. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:35:04,998][25689] Avg episode reward: [(0, '-0.851')] [2022-07-10 19:35:05,987][26022] Updated weights on worker 0-0, policy_version 861941 (0.00085) [2022-07-10 19:35:07,972][26022] Updated weights on worker 0-0, policy_version 861951 (0.00088) [2022-07-10 19:35:09,740][26022] Updated weights on worker 0-0, policy_version 861961 (0.00086) [2022-07-10 19:35:10,004][25689] Fps is (10 sec: 5499.9, 60 sec: 5547.8, 300 sec: 5537.9). Total num frames: 882649088. Throughput: 0: 4920.2. Samples: 882643572. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:35:10,004][25689] Avg episode reward: [(0, '-0.078')] [2022-07-10 19:35:11,485][26022] Updated weights on worker 0-0, policy_version 861971 (0.00087) [2022-07-10 19:35:13,255][26022] Updated weights on worker 0-0, policy_version 861981 (0.00082) [2022-07-10 19:35:15,132][25689] Fps is (10 sec: 5557.2, 60 sec: 5544.6, 300 sec: 5532.3). Total num frames: 882676736. Throughput: 0: 5737.3. Samples: 882677098. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:35:15,133][25689] Avg episode reward: [(0, '-0.039')] [2022-07-10 19:35:15,512][26022] Updated weights on worker 0-0, policy_version 861991 (0.00857) [2022-07-10 19:35:17,004][26022] Updated weights on worker 0-0, policy_version 862001 (0.00085) [2022-07-10 19:35:18,978][26022] Updated weights on worker 0-0, policy_version 862011 (0.00084) [2022-07-10 19:35:20,187][25689] Fps is (10 sec: 5631.5, 60 sec: 5558.0, 300 sec: 5541.7). Total num frames: 882706432. Throughput: 0: 5706.4. Samples: 882710710. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:35:20,187][25689] Avg episode reward: [(0, '0.128')] [2022-07-10 19:35:20,678][26022] Updated weights on worker 0-0, policy_version 862021 (0.00086) [2022-07-10 19:35:22,432][26022] Updated weights on worker 0-0, policy_version 862031 (0.00091) [2022-07-10 19:35:24,331][26022] Updated weights on worker 0-0, policy_version 862041 (0.00080) [2022-07-10 19:35:25,245][25689] Fps is (10 sec: 5670.3, 60 sec: 5570.1, 300 sec: 5533.9). Total num frames: 882734080. Throughput: 0: 4979.2. Samples: 882727552. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:35:25,247][25689] Avg episode reward: [(0, '0.128')] [2022-07-10 19:35:26,219][26022] Updated weights on worker 0-0, policy_version 862051 (0.00092) [2022-07-10 19:35:28,167][26022] Updated weights on worker 0-0, policy_version 862061 (0.00091) [2022-07-10 19:35:30,110][26022] Updated weights on worker 0-0, policy_version 862071 (0.00089) [2022-07-10 19:35:30,277][25689] Fps is (10 sec: 5480.2, 60 sec: 5569.5, 300 sec: 5537.7). Total num frames: 882761728. Throughput: 0: 5786.7. Samples: 882760884. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:35:30,277][25689] Avg episode reward: [(0, '-0.161')] [2022-07-10 19:35:31,723][26022] Updated weights on worker 0-0, policy_version 862081 (0.00092) [2022-07-10 19:35:33,901][26022] Updated weights on worker 0-0, policy_version 862091 (0.00085) [2022-07-10 19:35:35,399][25689] Fps is (10 sec: 5546.9, 60 sec: 5570.1, 300 sec: 5536.8). Total num frames: 882790400. Throughput: 0: 5780.1. Samples: 882794240. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:35:35,401][25689] Avg episode reward: [(0, '-0.485')] [2022-07-10 19:35:35,464][26022] Updated weights on worker 0-0, policy_version 862101 (0.00086) [2022-07-10 19:35:37,391][26022] Updated weights on worker 0-0, policy_version 862111 (0.00086) [2022-07-10 19:35:39,102][26022] Updated weights on worker 0-0, policy_version 862121 (0.00085) [2022-07-10 19:35:40,474][25689] Fps is (10 sec: 5523.2, 60 sec: 5536.6, 300 sec: 5535.8). Total num frames: 882818048. Throughput: 0: 5767.5. Samples: 882827716. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:35:40,474][25689] Avg episode reward: [(0, '-0.650')] [2022-07-10 19:35:41,060][26022] Updated weights on worker 0-0, policy_version 862131 (0.00608) [2022-07-10 19:35:42,847][26022] Updated weights on worker 0-0, policy_version 862141 (0.00084) [2022-07-10 19:35:44,871][26022] Updated weights on worker 0-0, policy_version 862151 (0.00098) [2022-07-10 19:35:45,477][25689] Fps is (10 sec: 5588.4, 60 sec: 5558.7, 300 sec: 5536.2). Total num frames: 882846720. Throughput: 0: 5784.1. Samples: 882844572. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:35:45,479][25689] Avg episode reward: [(0, '-0.715')] [2022-07-10 19:35:46,466][26022] Updated weights on worker 0-0, policy_version 862161 (0.00086) [2022-07-10 19:35:48,681][26022] Updated weights on worker 0-0, policy_version 862171 (0.00086) [2022-07-10 19:35:49,916][26022] Updated weights on worker 0-0, policy_version 862181 (0.00092) [2022-07-10 19:35:50,509][25689] Fps is (10 sec: 5714.0, 60 sec: 5558.0, 300 sec: 5538.1). Total num frames: 882875392. Throughput: 0: 5790.5. Samples: 882878042. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:35:50,510][25689] Avg episode reward: [(0, '-1.204')] [2022-07-10 19:35:52,328][26022] Updated weights on worker 0-0, policy_version 862191 (0.00092) [2022-07-10 19:35:53,934][26022] Updated weights on worker 0-0, policy_version 862201 (0.00087) [2022-07-10 19:35:55,556][25689] Fps is (10 sec: 5486.3, 60 sec: 5524.7, 300 sec: 5531.2). Total num frames: 882902016. Throughput: 0: 5804.0. Samples: 882911230. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:35:55,556][25689] Avg episode reward: [(0, '-1.238')] [2022-07-10 19:35:55,833][26022] Updated weights on worker 0-0, policy_version 862211 (0.00108) [2022-07-10 19:35:57,652][26022] Updated weights on worker 0-0, policy_version 862221 (0.00090) [2022-07-10 19:35:59,317][26022] Updated weights on worker 0-0, policy_version 862231 (0.00091) [2022-07-10 19:36:00,607][25689] Fps is (10 sec: 5476.4, 60 sec: 5554.1, 300 sec: 5540.8). Total num frames: 882930688. Throughput: 0: 4979.0. Samples: 882927958. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:36:00,607][25689] Avg episode reward: [(0, '-1.811')] [2022-07-10 19:36:01,461][26022] Updated weights on worker 0-0, policy_version 862241 (0.00093) [2022-07-10 19:36:03,744][26022] Updated weights on worker 0-0, policy_version 862251 (0.00088) [2022-07-10 19:36:05,366][26022] Updated weights on worker 0-0, policy_version 862261 (0.00091) [2022-07-10 19:36:05,620][25689] Fps is (10 sec: 5494.4, 60 sec: 5557.3, 300 sec: 5533.8). Total num frames: 882957312. Throughput: 0: 5695.6. Samples: 882959298. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:36:05,622][25689] Avg episode reward: [(0, '-1.110')] [2022-07-10 19:36:07,257][26022] Updated weights on worker 0-0, policy_version 862271 (0.00090) [2022-07-10 19:36:09,042][26022] Updated weights on worker 0-0, policy_version 862281 (0.00088) [2022-07-10 19:36:10,629][25689] Fps is (10 sec: 5312.9, 60 sec: 5523.3, 300 sec: 5527.7). Total num frames: 882983936. Throughput: 0: 5698.0. Samples: 882992682. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:36:10,630][25689] Avg episode reward: [(0, '-0.954')] [2022-07-10 19:36:11,034][26022] Updated weights on worker 0-0, policy_version 862291 (0.00079) [2022-07-10 19:36:12,635][26022] Updated weights on worker 0-0, policy_version 862301 (0.00083) [2022-07-10 19:36:14,652][26022] Updated weights on worker 0-0, policy_version 862311 (0.00083) [2022-07-10 19:36:15,712][25689] Fps is (10 sec: 5479.2, 60 sec: 5544.3, 300 sec: 5530.1). Total num frames: 883012608. Throughput: 0: 4878.4. Samples: 883009558. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:36:15,713][25689] Avg episode reward: [(0, '-1.730')] [2022-07-10 19:36:16,374][26022] Updated weights on worker 0-0, policy_version 862321 (0.00085) [2022-07-10 19:36:18,307][26022] Updated weights on worker 0-0, policy_version 862331 (0.00092) [2022-07-10 19:36:20,072][26022] Updated weights on worker 0-0, policy_version 862341 (0.00091) [2022-07-10 19:36:20,717][25689] Fps is (10 sec: 5583.0, 60 sec: 5515.0, 300 sec: 5527.1). Total num frames: 883040256. Throughput: 0: 5719.3. Samples: 883042974. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:36:20,720][25689] Avg episode reward: [(0, '-1.535')] [2022-07-10 19:36:21,815][26022] Updated weights on worker 0-0, policy_version 862351 (0.00085) [2022-07-10 19:36:23,678][26022] Updated weights on worker 0-0, policy_version 862361 (0.00090) [2022-07-10 19:36:25,620][26022] Updated weights on worker 0-0, policy_version 862371 (0.00086) [2022-07-10 19:36:25,733][25689] Fps is (10 sec: 5517.7, 60 sec: 5518.9, 300 sec: 5526.9). Total num frames: 883067904. Throughput: 0: 5836.8. Samples: 883076696. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:36:25,734][25689] Avg episode reward: [(0, '-1.622')] [2022-07-10 19:36:27,284][26022] Updated weights on worker 0-0, policy_version 862381 (0.00080) [2022-07-10 19:36:29,310][26022] Updated weights on worker 0-0, policy_version 862391 (0.00092) [2022-07-10 19:36:30,755][25689] Fps is (10 sec: 5610.6, 60 sec: 5536.7, 300 sec: 5528.9). Total num frames: 883096576. Throughput: 0: 5000.2. Samples: 883093316. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:36:30,757][25689] Avg episode reward: [(0, '-0.955')] [2022-07-10 19:36:30,977][26022] Updated weights on worker 0-0, policy_version 862401 (0.00094) [2022-07-10 19:36:33,047][26022] Updated weights on worker 0-0, policy_version 862411 (0.00085) [2022-07-10 19:36:34,874][26022] Updated weights on worker 0-0, policy_version 862421 (0.00096) [2022-07-10 19:36:35,859][25689] Fps is (10 sec: 5561.9, 60 sec: 5521.4, 300 sec: 5524.8). Total num frames: 883124224. Throughput: 0: 5792.8. Samples: 883126268. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:36:35,860][25689] Avg episode reward: [(0, '-0.468')] [2022-07-10 19:36:36,707][26022] Updated weights on worker 0-0, policy_version 862431 (0.00085) [2022-07-10 19:36:38,700][26022] Updated weights on worker 0-0, policy_version 862441 (0.00097) [2022-07-10 19:36:40,510][26022] Updated weights on worker 0-0, policy_version 862451 (0.00094) [2022-07-10 19:36:40,885][25689] Fps is (10 sec: 5559.8, 60 sec: 5542.8, 300 sec: 5524.6). Total num frames: 883152896. Throughput: 0: 5771.9. Samples: 883159382. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:36:40,885][25689] Avg episode reward: [(0, '-0.984')] [2022-07-10 19:36:42,325][26022] Updated weights on worker 0-0, policy_version 862461 (0.00086) [2022-07-10 19:36:44,073][26022] Updated weights on worker 0-0, policy_version 862471 (0.00091) [2022-07-10 19:36:45,894][25689] Fps is (10 sec: 5612.3, 60 sec: 5525.3, 300 sec: 5524.6). Total num frames: 883180544. Throughput: 0: 4939.2. Samples: 883176276. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:36:45,895][25689] Avg episode reward: [(0, '-0.206')] [2022-07-10 19:36:45,901][26022] Updated weights on worker 0-0, policy_version 862481 (0.00089) [2022-07-10 19:36:47,775][26022] Updated weights on worker 0-0, policy_version 862491 (0.00086) [2022-07-10 19:36:49,608][26022] Updated weights on worker 0-0, policy_version 862501 (0.00091) [2022-07-10 19:36:50,911][25689] Fps is (10 sec: 5413.1, 60 sec: 5492.9, 300 sec: 5518.5). Total num frames: 883207168. Throughput: 0: 5760.5. Samples: 883209424. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:36:50,911][25689] Avg episode reward: [(0, '-1.127')] [2022-07-10 19:36:51,420][26022] Updated weights on worker 0-0, policy_version 862511 (0.00094) [2022-07-10 19:36:53,471][26022] Updated weights on worker 0-0, policy_version 862521 (0.00096) [2022-07-10 19:36:55,003][26022] Updated weights on worker 0-0, policy_version 862531 (0.00092) [2022-07-10 19:36:55,987][25689] Fps is (10 sec: 5478.9, 60 sec: 5524.0, 300 sec: 5520.7). Total num frames: 883235840. Throughput: 0: 5808.4. Samples: 883243178. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:36:55,988][25689] Avg episode reward: [(0, '-0.903')] [2022-07-10 19:36:56,991][26022] Updated weights on worker 0-0, policy_version 862541 (0.00088) [2022-07-10 19:36:58,831][26022] Updated weights on worker 0-0, policy_version 862551 (0.00090) [2022-07-10 19:36:59,368][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:36:59,381][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000862553_883254272.pth [2022-07-10 19:36:59,382][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000860605_881259520.pth [2022-07-10 19:37:00,579][26022] Updated weights on worker 0-0, policy_version 862561 (0.00085) [2022-07-10 19:37:01,057][25689] Fps is (10 sec: 5651.7, 60 sec: 5522.3, 300 sec: 5526.6). Total num frames: 883264512. Throughput: 0: 4974.2. Samples: 883259724. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-10 19:37:01,058][25689] Avg episode reward: [(0, '-1.530')] [2022-07-10 19:37:02,900][26022] Updated weights on worker 0-0, policy_version 862571 (0.00092) [2022-07-10 19:37:04,619][26022] Updated weights on worker 0-0, policy_version 862581 (0.00091) [2022-07-10 19:37:06,098][25689] Fps is (10 sec: 5367.6, 60 sec: 5502.8, 300 sec: 5522.5). Total num frames: 883290112. Throughput: 0: 5695.6. Samples: 883291348. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:37:06,101][25689] Avg episode reward: [(0, '-1.434')] [2022-07-10 19:37:06,631][26022] Updated weights on worker 0-0, policy_version 862591 (0.00092) [2022-07-10 19:37:08,347][26022] Updated weights on worker 0-0, policy_version 862601 (0.00091) [2022-07-10 19:37:09,980][26022] Updated weights on worker 0-0, policy_version 862611 (0.00085) [2022-07-10 19:37:11,144][25689] Fps is (10 sec: 5278.9, 60 sec: 5516.4, 300 sec: 5527.4). Total num frames: 883317760. Throughput: 0: 5703.5. Samples: 883324824. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:37:11,145][25689] Avg episode reward: [(0, '-1.274')] [2022-07-10 19:37:12,187][26022] Updated weights on worker 0-0, policy_version 862621 (0.00094) [2022-07-10 19:37:13,829][26022] Updated weights on worker 0-0, policy_version 862631 (0.00086) [2022-07-10 19:37:15,779][26022] Updated weights on worker 0-0, policy_version 862641 (0.00082) [2022-07-10 19:37:16,283][25689] Fps is (10 sec: 5630.4, 60 sec: 5528.2, 300 sec: 5521.9). Total num frames: 883347456. Throughput: 0: 4848.5. Samples: 883341582. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:37:16,283][25689] Avg episode reward: [(0, '-1.013')] [2022-07-10 19:37:17,577][26022] Updated weights on worker 0-0, policy_version 862651 (0.00087) [2022-07-10 19:37:19,244][26022] Updated weights on worker 0-0, policy_version 862661 (0.00084) [2022-07-10 19:37:21,124][26022] Updated weights on worker 0-0, policy_version 862671 (0.00090) [2022-07-10 19:37:21,356][25689] Fps is (10 sec: 5715.6, 60 sec: 5538.8, 300 sec: 5524.6). Total num frames: 883376128. Throughput: 0: 5674.1. Samples: 883374902. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:37:21,358][25689] Avg episode reward: [(0, '-1.858')] [2022-07-10 19:37:23,171][26022] Updated weights on worker 0-0, policy_version 862681 (0.00098) [2022-07-10 19:37:24,784][26022] Updated weights on worker 0-0, policy_version 862691 (0.00086) [2022-07-10 19:37:26,438][25689] Fps is (10 sec: 5444.9, 60 sec: 5516.0, 300 sec: 5523.2). Total num frames: 883402752. Throughput: 0: 5768.2. Samples: 883408676. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:37:26,440][25689] Avg episode reward: [(0, '-1.464')] [2022-07-10 19:37:26,756][26022] Updated weights on worker 0-0, policy_version 862701 (0.00095) [2022-07-10 19:37:28,358][26022] Updated weights on worker 0-0, policy_version 862711 (0.00089) [2022-07-10 19:37:30,426][26022] Updated weights on worker 0-0, policy_version 862721 (0.00091) [2022-07-10 19:37:31,473][25689] Fps is (10 sec: 5465.5, 60 sec: 5514.8, 300 sec: 5521.4). Total num frames: 883431424. Throughput: 0: 4935.2. Samples: 883425154. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:37:31,475][25689] Avg episode reward: [(0, '-1.809')] [2022-07-10 19:37:32,340][26022] Updated weights on worker 0-0, policy_version 862731 (0.00091) [2022-07-10 19:37:34,047][26022] Updated weights on worker 0-0, policy_version 862741 (0.00085) [2022-07-10 19:37:35,978][26022] Updated weights on worker 0-0, policy_version 862751 (0.00085) [2022-07-10 19:37:36,589][25689] Fps is (10 sec: 5750.4, 60 sec: 5547.5, 300 sec: 5523.3). Total num frames: 883461120. Throughput: 0: 5777.5. Samples: 883458900. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:37:36,589][25689] Avg episode reward: [(0, '-1.640')] [2022-07-10 19:37:37,818][26022] Updated weights on worker 0-0, policy_version 862761 (0.00053) [2022-07-10 19:37:39,710][26022] Updated weights on worker 0-0, policy_version 862771 (0.00083) [2022-07-10 19:37:41,621][25689] Fps is (10 sec: 5449.3, 60 sec: 5496.3, 300 sec: 5526.6). Total num frames: 883486720. Throughput: 0: 5805.3. Samples: 883492544. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:37:41,621][25689] Avg episode reward: [(0, '-1.574')] [2022-07-10 19:37:41,673][26022] Updated weights on worker 0-0, policy_version 862781 (0.00084) [2022-07-10 19:37:43,202][26022] Updated weights on worker 0-0, policy_version 862791 (0.00086) [2022-07-10 19:37:45,186][26022] Updated weights on worker 0-0, policy_version 862801 (0.00087) [2022-07-10 19:37:46,648][25689] Fps is (10 sec: 5599.0, 60 sec: 5545.3, 300 sec: 5530.7). Total num frames: 883517440. Throughput: 0: 5817.9. Samples: 883526252. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:37:46,648][25689] Avg episode reward: [(0, '-1.007')] [2022-07-10 19:37:46,856][26022] Updated weights on worker 0-0, policy_version 862811 (0.00106) [2022-07-10 19:37:48,864][26022] Updated weights on worker 0-0, policy_version 862821 (0.00089) [2022-07-10 19:37:50,800][26022] Updated weights on worker 0-0, policy_version 862831 (0.00090) [2022-07-10 19:37:51,659][25689] Fps is (10 sec: 5712.7, 60 sec: 5545.8, 300 sec: 5528.2). Total num frames: 883544064. Throughput: 0: 5838.6. Samples: 883543008. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:37:51,664][25689] Avg episode reward: [(0, '-0.710')] [2022-07-10 19:37:52,276][26022] Updated weights on worker 0-0, policy_version 862841 (0.00094) [2022-07-10 19:37:54,307][26022] Updated weights on worker 0-0, policy_version 862851 (0.00415) [2022-07-10 19:37:56,165][26022] Updated weights on worker 0-0, policy_version 862861 (0.00091) [2022-07-10 19:37:56,711][25689] Fps is (10 sec: 5393.2, 60 sec: 5531.2, 300 sec: 5520.4). Total num frames: 883571712. Throughput: 0: 5836.2. Samples: 883576336. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:37:56,711][25689] Avg episode reward: [(0, '-2.528')] [2022-07-10 19:37:57,993][26022] Updated weights on worker 0-0, policy_version 862871 (0.00096) [2022-07-10 19:37:59,923][26022] Updated weights on worker 0-0, policy_version 862881 (0.00061) [2022-07-10 19:38:01,496][26022] Updated weights on worker 0-0, policy_version 862891 (0.00094) [2022-07-10 19:38:01,732][25689] Fps is (10 sec: 5692.8, 60 sec: 5552.5, 300 sec: 5541.7). Total num frames: 883601408. Throughput: 0: 5830.3. Samples: 883609798. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:38:01,733][25689] Avg episode reward: [(0, '-2.924')] [2022-07-10 19:38:03,811][26022] Updated weights on worker 0-0, policy_version 862901 (0.00092) [2022-07-10 19:38:05,642][26022] Updated weights on worker 0-0, policy_version 862911 (0.00090) [2022-07-10 19:38:06,746][25689] Fps is (10 sec: 5306.2, 60 sec: 5521.2, 300 sec: 5524.3). Total num frames: 883624960. Throughput: 0: 4889.2. Samples: 883624516. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:38:06,746][25689] Avg episode reward: [(0, '-2.446')] [2022-07-10 19:38:07,440][26022] Updated weights on worker 0-0, policy_version 862921 (0.00091) [2022-07-10 19:38:09,406][26022] Updated weights on worker 0-0, policy_version 862931 (0.00096) [2022-07-10 19:38:11,055][26022] Updated weights on worker 0-0, policy_version 862941 (0.00084) [2022-07-10 19:38:11,767][25689] Fps is (10 sec: 5204.3, 60 sec: 5540.4, 300 sec: 5529.7). Total num frames: 883653632. Throughput: 0: 5719.5. Samples: 883658014. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:38:11,767][25689] Avg episode reward: [(0, '-2.936')] [2022-07-10 19:38:13,087][26022] Updated weights on worker 0-0, policy_version 862951 (0.00089) [2022-07-10 19:38:14,792][26022] Updated weights on worker 0-0, policy_version 862961 (0.00090) [2022-07-10 19:38:16,679][26022] Updated weights on worker 0-0, policy_version 862971 (0.00094) [2022-07-10 19:38:16,872][25689] Fps is (10 sec: 5764.2, 60 sec: 5543.5, 300 sec: 5534.7). Total num frames: 883683328. Throughput: 0: 5709.5. Samples: 883691444. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:38:16,872][25689] Avg episode reward: [(0, '-2.922')] [2022-07-10 19:38:18,740][26022] Updated weights on worker 0-0, policy_version 862981 (0.00089) [2022-07-10 19:38:20,475][26022] Updated weights on worker 0-0, policy_version 862991 (0.00088) [2022-07-10 19:38:21,883][25689] Fps is (10 sec: 5466.1, 60 sec: 5498.4, 300 sec: 5521.8). Total num frames: 883708928. Throughput: 0: 4865.1. Samples: 883707834. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:38:21,883][25689] Avg episode reward: [(0, '-0.883')] [2022-07-10 19:38:22,278][26022] Updated weights on worker 0-0, policy_version 863001 (0.00093) [2022-07-10 19:38:23,992][26022] Updated weights on worker 0-0, policy_version 863011 (0.00089) [2022-07-10 19:38:25,848][26022] Updated weights on worker 0-0, policy_version 863021 (0.00090) [2022-07-10 19:38:26,922][25689] Fps is (10 sec: 5501.9, 60 sec: 5553.1, 300 sec: 5529.7). Total num frames: 883738624. Throughput: 0: 5792.3. Samples: 883741382. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:38:26,922][25689] Avg episode reward: [(0, '-1.116')] [2022-07-10 19:38:27,943][26022] Updated weights on worker 0-0, policy_version 863031 (0.00087) [2022-07-10 19:38:29,623][26022] Updated weights on worker 0-0, policy_version 863041 (0.00096) [2022-07-10 19:38:31,623][26022] Updated weights on worker 0-0, policy_version 863051 (0.00090) [2022-07-10 19:38:32,014][25689] Fps is (10 sec: 5660.5, 60 sec: 5531.0, 300 sec: 5529.7). Total num frames: 883766272. Throughput: 0: 5749.0. Samples: 883774414. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:38:32,014][25689] Avg episode reward: [(0, '-0.391')] [2022-07-10 19:38:33,402][26022] Updated weights on worker 0-0, policy_version 863061 (0.00082) [2022-07-10 19:38:35,279][26022] Updated weights on worker 0-0, policy_version 863071 (0.00094) [2022-07-10 19:38:37,060][25689] Fps is (10 sec: 5555.1, 60 sec: 5520.3, 300 sec: 5525.5). Total num frames: 883794944. Throughput: 0: 4932.2. Samples: 883791018. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:38:37,062][25689] Avg episode reward: [(0, '-0.668')] [2022-07-10 19:38:37,067][26022] Updated weights on worker 0-0, policy_version 863081 (0.00517) [2022-07-10 19:38:39,072][26022] Updated weights on worker 0-0, policy_version 863091 (0.00089) [2022-07-10 19:38:40,764][26022] Updated weights on worker 0-0, policy_version 863101 (0.00080) [2022-07-10 19:38:42,088][25689] Fps is (10 sec: 5590.5, 60 sec: 5554.6, 300 sec: 5528.6). Total num frames: 883822592. Throughput: 0: 5787.6. Samples: 883824772. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:38:42,089][25689] Avg episode reward: [(0, '-0.608')] [2022-07-10 19:38:42,693][26022] Updated weights on worker 0-0, policy_version 863111 (0.00089) [2022-07-10 19:38:44,319][26022] Updated weights on worker 0-0, policy_version 863121 (0.00399) [2022-07-10 19:38:46,427][26022] Updated weights on worker 0-0, policy_version 863131 (0.00107) [2022-07-10 19:38:47,098][25689] Fps is (10 sec: 5508.7, 60 sec: 5505.3, 300 sec: 5528.9). Total num frames: 883850240. Throughput: 0: 5790.9. Samples: 883858220. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:38:47,099][25689] Avg episode reward: [(0, '-0.747')] [2022-07-10 19:38:48,154][26022] Updated weights on worker 0-0, policy_version 863141 (0.00401) [2022-07-10 19:38:50,055][26022] Updated weights on worker 0-0, policy_version 863151 (0.00090) [2022-07-10 19:38:51,807][26022] Updated weights on worker 0-0, policy_version 863161 (0.00114) [2022-07-10 19:38:52,109][25689] Fps is (10 sec: 5517.7, 60 sec: 5522.3, 300 sec: 5534.5). Total num frames: 883877888. Throughput: 0: 4992.1. Samples: 883874734. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:38:52,110][25689] Avg episode reward: [(0, '-1.487')] [2022-07-10 19:38:53,640][26022] Updated weights on worker 0-0, policy_version 863171 (0.00090) [2022-07-10 19:38:55,773][26022] Updated weights on worker 0-0, policy_version 863181 (0.00089) [2022-07-10 19:38:57,155][25689] Fps is (10 sec: 5600.1, 60 sec: 5539.8, 300 sec: 5530.7). Total num frames: 883906560. Throughput: 0: 5806.7. Samples: 883907704. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:38:57,157][25689] Avg episode reward: [(0, '-2.217')] [2022-07-10 19:38:57,336][26022] Updated weights on worker 0-0, policy_version 863191 (0.00089) [2022-07-10 19:38:59,334][26022] Updated weights on worker 0-0, policy_version 863201 (0.00084) [2022-07-10 19:38:59,390][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:38:59,404][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000863202_883918848.pth [2022-07-10 19:38:59,405][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000861256_881926144.pth [2022-07-10 19:39:01,155][26022] Updated weights on worker 0-0, policy_version 863211 (0.00088) [2022-07-10 19:39:02,257][25689] Fps is (10 sec: 5247.2, 60 sec: 5447.8, 300 sec: 5529.0). Total num frames: 883931136. Throughput: 0: 5740.8. Samples: 883940560. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:39:02,258][25689] Avg episode reward: [(0, '-1.745')] [2022-07-10 19:39:03,247][26022] Updated weights on worker 0-0, policy_version 863221 (0.00086) [2022-07-10 19:39:05,081][26022] Updated weights on worker 0-0, policy_version 863231 (0.00084) [2022-07-10 19:39:06,866][26022] Updated weights on worker 0-0, policy_version 863241 (0.00087) [2022-07-10 19:39:07,288][25689] Fps is (10 sec: 5255.1, 60 sec: 5530.8, 300 sec: 5528.7). Total num frames: 883959808. Throughput: 0: 4852.1. Samples: 883956182. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:39:07,289][25689] Avg episode reward: [(0, '-1.620')] [2022-07-10 19:39:08,686][26022] Updated weights on worker 0-0, policy_version 863251 (0.00086) [2022-07-10 19:39:10,744][26022] Updated weights on worker 0-0, policy_version 863261 (0.00086) [2022-07-10 19:39:12,299][25689] Fps is (10 sec: 5812.5, 60 sec: 5548.6, 300 sec: 5537.1). Total num frames: 883989504. Throughput: 0: 5703.4. Samples: 883989884. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:39:12,300][25689] Avg episode reward: [(0, '-1.664')] [2022-07-10 19:39:12,301][26022] Updated weights on worker 0-0, policy_version 863271 (0.00102) [2022-07-10 19:39:14,302][26022] Updated weights on worker 0-0, policy_version 863281 (0.00087) [2022-07-10 19:39:15,958][26022] Updated weights on worker 0-0, policy_version 863291 (0.00084) [2022-07-10 19:39:17,363][25689] Fps is (10 sec: 5691.8, 60 sec: 5518.6, 300 sec: 5532.8). Total num frames: 884017152. Throughput: 0: 5732.9. Samples: 884023552. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:39:17,363][25689] Avg episode reward: [(0, '-0.435')] [2022-07-10 19:39:17,936][26022] Updated weights on worker 0-0, policy_version 863301 (0.00093) [2022-07-10 19:39:19,851][26022] Updated weights on worker 0-0, policy_version 863311 (0.00090) [2022-07-10 19:39:21,549][26022] Updated weights on worker 0-0, policy_version 863321 (0.00085) [2022-07-10 19:39:22,368][25689] Fps is (10 sec: 5491.8, 60 sec: 5553.0, 300 sec: 5536.3). Total num frames: 884044800. Throughput: 0: 4946.2. Samples: 884040032. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:39:22,369][25689] Avg episode reward: [(0, '-0.459')] [2022-07-10 19:39:23,483][26022] Updated weights on worker 0-0, policy_version 863331 (0.00094) [2022-07-10 19:39:25,145][26022] Updated weights on worker 0-0, policy_version 863341 (0.00080) [2022-07-10 19:39:27,252][26022] Updated weights on worker 0-0, policy_version 863351 (0.00086) [2022-07-10 19:39:27,378][25689] Fps is (10 sec: 5521.0, 60 sec: 5521.7, 300 sec: 5536.5). Total num frames: 884072448. Throughput: 0: 5851.5. Samples: 884073742. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:39:27,379][25689] Avg episode reward: [(0, '-0.269')] [2022-07-10 19:39:29,056][26022] Updated weights on worker 0-0, policy_version 863361 (0.00092) [2022-07-10 19:39:30,871][26022] Updated weights on worker 0-0, policy_version 863371 (0.00081) [2022-07-10 19:39:32,428][25689] Fps is (10 sec: 5598.2, 60 sec: 5542.5, 300 sec: 5538.0). Total num frames: 884101120. Throughput: 0: 5809.8. Samples: 884106830. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:39:32,429][25689] Avg episode reward: [(0, '-0.359')] [2022-07-10 19:39:32,608][26022] Updated weights on worker 0-0, policy_version 863381 (0.00115) [2022-07-10 19:39:34,699][26022] Updated weights on worker 0-0, policy_version 863391 (0.00088) [2022-07-10 19:39:36,337][26022] Updated weights on worker 0-0, policy_version 863401 (0.00082) [2022-07-10 19:39:37,491][25689] Fps is (10 sec: 5569.4, 60 sec: 5524.1, 300 sec: 5531.4). Total num frames: 884128768. Throughput: 0: 4971.4. Samples: 884123616. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:39:37,491][25689] Avg episode reward: [(0, '-0.389')] [2022-07-10 19:39:38,199][26022] Updated weights on worker 0-0, policy_version 863411 (0.00086) [2022-07-10 19:39:39,916][26022] Updated weights on worker 0-0, policy_version 863421 (0.00095) [2022-07-10 19:39:41,991][26022] Updated weights on worker 0-0, policy_version 863431 (0.00097) [2022-07-10 19:39:42,517][25689] Fps is (10 sec: 5481.0, 60 sec: 5524.2, 300 sec: 5532.0). Total num frames: 884156416. Throughput: 0: 5829.9. Samples: 884157498. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:39:42,517][25689] Avg episode reward: [(0, '-0.703')] [2022-07-10 19:39:43,647][26022] Updated weights on worker 0-0, policy_version 863441 (0.00087) [2022-07-10 19:39:45,575][26022] Updated weights on worker 0-0, policy_version 863451 (0.00087) [2022-07-10 19:39:47,255][26022] Updated weights on worker 0-0, policy_version 863461 (0.00097) [2022-07-10 19:39:47,543][25689] Fps is (10 sec: 5602.9, 60 sec: 5539.8, 300 sec: 5532.0). Total num frames: 884185088. Throughput: 0: 5823.6. Samples: 884191172. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:39:47,543][25689] Avg episode reward: [(0, '-1.649')] [2022-07-10 19:39:49,309][26022] Updated weights on worker 0-0, policy_version 863471 (0.00095) [2022-07-10 19:39:51,083][26022] Updated weights on worker 0-0, policy_version 863481 (0.00083) [2022-07-10 19:39:52,561][25689] Fps is (10 sec: 5607.1, 60 sec: 5539.1, 300 sec: 5529.2). Total num frames: 884212736. Throughput: 0: 5013.7. Samples: 884207772. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:39:52,562][25689] Avg episode reward: [(0, '-2.811')] [2022-07-10 19:39:52,838][26022] Updated weights on worker 0-0, policy_version 863491 (0.00098) [2022-07-10 19:39:54,696][26022] Updated weights on worker 0-0, policy_version 863501 (0.00091) [2022-07-10 19:39:56,544][26022] Updated weights on worker 0-0, policy_version 863511 (0.00083) [2022-07-10 19:39:57,628][25689] Fps is (10 sec: 5584.4, 60 sec: 5537.2, 300 sec: 5534.9). Total num frames: 884241408. Throughput: 0: 5837.1. Samples: 884241160. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:39:57,630][25689] Avg episode reward: [(0, '-2.350')] [2022-07-10 19:39:58,479][26022] Updated weights on worker 0-0, policy_version 863521 (0.00081) [2022-07-10 19:40:00,347][26022] Updated weights on worker 0-0, policy_version 863531 (0.00082) [2022-07-10 19:40:02,488][26022] Updated weights on worker 0-0, policy_version 863541 (0.00089) [2022-07-10 19:40:02,642][25689] Fps is (10 sec: 5485.5, 60 sec: 5579.2, 300 sec: 5535.5). Total num frames: 884268032. Throughput: 0: 5738.2. Samples: 884272978. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:40:02,644][25689] Avg episode reward: [(0, '-2.396')] [2022-07-10 19:40:04,165][26022] Updated weights on worker 0-0, policy_version 863551 (0.00085) [2022-07-10 19:40:05,943][26022] Updated weights on worker 0-0, policy_version 863561 (0.00081) [2022-07-10 19:40:07,712][25689] Fps is (10 sec: 5280.4, 60 sec: 5541.6, 300 sec: 5527.5). Total num frames: 884294656. Throughput: 0: 4877.3. Samples: 884289544. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:40:07,713][25689] Avg episode reward: [(0, '-1.735')] [2022-07-10 19:40:08,001][26022] Updated weights on worker 0-0, policy_version 863571 (0.00089) [2022-07-10 19:40:09,716][26022] Updated weights on worker 0-0, policy_version 863581 (0.00085) [2022-07-10 19:40:11,664][26022] Updated weights on worker 0-0, policy_version 863591 (0.00094) [2022-07-10 19:40:12,753][25689] Fps is (10 sec: 5569.9, 60 sec: 5538.9, 300 sec: 5536.0). Total num frames: 884324352. Throughput: 0: 5726.1. Samples: 884323392. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:40:12,755][25689] Avg episode reward: [(0, '-1.614')] [2022-07-10 19:40:13,423][26022] Updated weights on worker 0-0, policy_version 863601 (0.00084) [2022-07-10 19:40:15,205][26022] Updated weights on worker 0-0, policy_version 863611 (0.00093) [2022-07-10 19:40:17,017][26022] Updated weights on worker 0-0, policy_version 863621 (0.00090) [2022-07-10 19:40:17,815][25689] Fps is (10 sec: 5676.1, 60 sec: 5539.1, 300 sec: 5528.9). Total num frames: 884352000. Throughput: 0: 5724.2. Samples: 884356714. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:40:17,815][25689] Avg episode reward: [(0, '-1.387')] [2022-07-10 19:40:18,875][26022] Updated weights on worker 0-0, policy_version 863631 (0.00092) [2022-07-10 19:40:20,739][26022] Updated weights on worker 0-0, policy_version 863641 (0.00092) [2022-07-10 19:40:22,732][26022] Updated weights on worker 0-0, policy_version 863651 (0.00089) [2022-07-10 19:40:22,892][25689] Fps is (10 sec: 5352.8, 60 sec: 5515.5, 300 sec: 5525.2). Total num frames: 884378624. Throughput: 0: 4951.9. Samples: 884373256. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:40:22,893][25689] Avg episode reward: [(0, '-1.150')] [2022-07-10 19:40:24,346][26022] Updated weights on worker 0-0, policy_version 863661 (0.00089) [2022-07-10 19:40:26,155][26022] Updated weights on worker 0-0, policy_version 863671 (0.00093) [2022-07-10 19:40:27,898][26022] Updated weights on worker 0-0, policy_version 863681 (0.00095) [2022-07-10 19:40:27,913][25689] Fps is (10 sec: 5678.5, 60 sec: 5565.3, 300 sec: 5535.7). Total num frames: 884409344. Throughput: 0: 5811.9. Samples: 884406952. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:40:27,914][25689] Avg episode reward: [(0, '-1.175')] [2022-07-10 19:40:29,851][26022] Updated weights on worker 0-0, policy_version 863691 (0.00095) [2022-07-10 19:40:31,623][26022] Updated weights on worker 0-0, policy_version 863701 (0.00087) [2022-07-10 19:40:32,933][25689] Fps is (10 sec: 5710.9, 60 sec: 5534.2, 300 sec: 5530.7). Total num frames: 884435968. Throughput: 0: 5819.3. Samples: 884440826. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:40:32,934][25689] Avg episode reward: [(0, '-1.877')] [2022-07-10 19:40:33,556][26022] Updated weights on worker 0-0, policy_version 863711 (0.00088) [2022-07-10 19:40:35,538][26022] Updated weights on worker 0-0, policy_version 863721 (0.00085) [2022-07-10 19:40:37,107][26022] Updated weights on worker 0-0, policy_version 863731 (0.00091) [2022-07-10 19:40:37,993][25689] Fps is (10 sec: 5485.6, 60 sec: 5551.4, 300 sec: 5534.4). Total num frames: 884464640. Throughput: 0: 4996.1. Samples: 884457528. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:40:37,994][25689] Avg episode reward: [(0, '-2.175')] [2022-07-10 19:40:38,885][26022] Updated weights on worker 0-0, policy_version 863741 (0.00094) [2022-07-10 19:40:40,759][26022] Updated weights on worker 0-0, policy_version 863751 (0.00082) [2022-07-10 19:40:42,683][26022] Updated weights on worker 0-0, policy_version 863761 (0.00085) [2022-07-10 19:40:43,045][25689] Fps is (10 sec: 5671.2, 60 sec: 5566.0, 300 sec: 5533.5). Total num frames: 884493312. Throughput: 0: 5861.7. Samples: 884491384. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-10 19:40:43,045][25689] Avg episode reward: [(0, '-1.434')] [2022-07-10 19:40:44,585][26022] Updated weights on worker 0-0, policy_version 863771 (0.00086) [2022-07-10 19:40:46,463][26022] Updated weights on worker 0-0, policy_version 863781 (0.00088) [2022-07-10 19:40:48,063][25689] Fps is (10 sec: 5491.3, 60 sec: 5532.8, 300 sec: 5526.9). Total num frames: 884519936. Throughput: 0: 5857.3. Samples: 884524976. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:40:48,063][25689] Avg episode reward: [(0, '-0.676')] [2022-07-10 19:40:48,269][26022] Updated weights on worker 0-0, policy_version 863791 (0.00092) [2022-07-10 19:40:49,853][26022] Updated weights on worker 0-0, policy_version 863801 (0.00085) [2022-07-10 19:40:52,066][26022] Updated weights on worker 0-0, policy_version 863811 (0.00091) [2022-07-10 19:40:53,081][25689] Fps is (10 sec: 5509.8, 60 sec: 5549.8, 300 sec: 5534.3). Total num frames: 884548608. Throughput: 0: 5811.4. Samples: 884557910. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:40:53,081][25689] Avg episode reward: [(0, '-0.428')] [2022-07-10 19:40:53,579][26022] Updated weights on worker 0-0, policy_version 863821 (0.00084) [2022-07-10 19:40:55,755][26022] Updated weights on worker 0-0, policy_version 863831 (0.00084) [2022-07-10 19:40:57,406][26022] Updated weights on worker 0-0, policy_version 863841 (0.00087) [2022-07-10 19:40:58,195][25689] Fps is (10 sec: 5558.6, 60 sec: 5528.6, 300 sec: 5529.7). Total num frames: 884576256. Throughput: 0: 5804.2. Samples: 884574782. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:40:58,196][25689] Avg episode reward: [(0, '-0.391')] [2022-07-10 19:40:59,270][26022] Updated weights on worker 0-0, policy_version 863851 (0.00092) [2022-07-10 19:40:59,701][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:40:59,721][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000863852_884584448.pth [2022-07-10 19:40:59,722][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000861905_882590720.pth [2022-07-10 19:41:01,133][26022] Updated weights on worker 0-0, policy_version 863861 (0.00082) [2022-07-10 19:41:03,233][25689] Fps is (10 sec: 5345.7, 60 sec: 5526.3, 300 sec: 5529.2). Total num frames: 884602880. Throughput: 0: 5684.6. Samples: 884606148. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:41:03,234][25689] Avg episode reward: [(0, '0.049')] [2022-07-10 19:41:03,367][26022] Updated weights on worker 0-0, policy_version 863871 (0.00084) [2022-07-10 19:41:05,219][26022] Updated weights on worker 0-0, policy_version 863881 (0.00114) [2022-07-10 19:41:07,139][26022] Updated weights on worker 0-0, policy_version 863891 (0.00054) [2022-07-10 19:41:08,263][25689] Fps is (10 sec: 5390.7, 60 sec: 5547.0, 300 sec: 5532.3). Total num frames: 884630528. Throughput: 0: 5674.0. Samples: 884639590. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:41:08,263][25689] Avg episode reward: [(0, '-0.201')] [2022-07-10 19:41:08,797][26022] Updated weights on worker 0-0, policy_version 863901 (0.00087) [2022-07-10 19:41:10,609][26022] Updated weights on worker 0-0, policy_version 863911 (0.00087) [2022-07-10 19:41:12,391][26022] Updated weights on worker 0-0, policy_version 863921 (0.00094) [2022-07-10 19:41:13,267][25689] Fps is (10 sec: 5510.8, 60 sec: 5516.5, 300 sec: 5530.3). Total num frames: 884658176. Throughput: 0: 4876.4. Samples: 884656348. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:41:13,268][25689] Avg episode reward: [(0, '-1.049')] [2022-07-10 19:41:14,358][26022] Updated weights on worker 0-0, policy_version 863931 (0.00081) [2022-07-10 19:41:16,121][26022] Updated weights on worker 0-0, policy_version 863941 (0.00089) [2022-07-10 19:41:17,936][26022] Updated weights on worker 0-0, policy_version 863951 (0.00086) [2022-07-10 19:41:18,317][25689] Fps is (10 sec: 5703.4, 60 sec: 5551.4, 300 sec: 5536.3). Total num frames: 884687872. Throughput: 0: 5737.5. Samples: 884690232. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:41:18,318][25689] Avg episode reward: [(0, '-1.233')] [2022-07-10 19:41:19,799][26022] Updated weights on worker 0-0, policy_version 863961 (0.00089) [2022-07-10 19:41:21,622][26022] Updated weights on worker 0-0, policy_version 863971 (0.00090) [2022-07-10 19:41:23,331][25689] Fps is (10 sec: 5596.6, 60 sec: 5557.3, 300 sec: 5533.0). Total num frames: 884714496. Throughput: 0: 5858.1. Samples: 884723880. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:41:23,331][25689] Avg episode reward: [(0, '-1.239')] [2022-07-10 19:41:23,463][26022] Updated weights on worker 0-0, policy_version 863981 (0.00094) [2022-07-10 19:41:25,348][26022] Updated weights on worker 0-0, policy_version 863991 (0.00086) [2022-07-10 19:41:27,135][26022] Updated weights on worker 0-0, policy_version 864001 (0.00085) [2022-07-10 19:41:28,361][25689] Fps is (10 sec: 5505.2, 60 sec: 5522.5, 300 sec: 5532.8). Total num frames: 884743168. Throughput: 0: 5014.5. Samples: 884740374. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:41:28,362][25689] Avg episode reward: [(0, '-1.081')] [2022-07-10 19:41:29,031][26022] Updated weights on worker 0-0, policy_version 864011 (0.00099) [2022-07-10 19:41:31,063][26022] Updated weights on worker 0-0, policy_version 864021 (0.00091) [2022-07-10 19:41:32,632][26022] Updated weights on worker 0-0, policy_version 864031 (0.00086) [2022-07-10 19:41:33,366][25689] Fps is (10 sec: 5612.1, 60 sec: 5540.8, 300 sec: 5534.6). Total num frames: 884770816. Throughput: 0: 5840.3. Samples: 884773730. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:41:33,367][25689] Avg episode reward: [(0, '-0.527')] [2022-07-10 19:41:34,645][26022] Updated weights on worker 0-0, policy_version 864041 (0.00095) [2022-07-10 19:41:36,188][26022] Updated weights on worker 0-0, policy_version 864051 (0.00085) [2022-07-10 19:41:38,312][26022] Updated weights on worker 0-0, policy_version 864061 (0.00086) [2022-07-10 19:41:38,417][25689] Fps is (10 sec: 5499.2, 60 sec: 5524.8, 300 sec: 5530.7). Total num frames: 884798464. Throughput: 0: 5820.0. Samples: 884807212. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:41:38,417][25689] Avg episode reward: [(0, '-0.038')] [2022-07-10 19:41:40,196][26022] Updated weights on worker 0-0, policy_version 864071 (0.00085) [2022-07-10 19:41:41,823][26022] Updated weights on worker 0-0, policy_version 864081 (0.00086) [2022-07-10 19:41:43,455][25689] Fps is (10 sec: 5582.3, 60 sec: 5525.9, 300 sec: 5533.6). Total num frames: 884827136. Throughput: 0: 4976.1. Samples: 884824022. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:41:43,456][25689] Avg episode reward: [(0, '0.408')] [2022-07-10 19:41:43,797][26022] Updated weights on worker 0-0, policy_version 864091 (0.00092) [2022-07-10 19:41:45,440][26022] Updated weights on worker 0-0, policy_version 864101 (0.00084) [2022-07-10 19:41:47,615][26022] Updated weights on worker 0-0, policy_version 864111 (0.00087) [2022-07-10 19:41:48,479][25689] Fps is (10 sec: 5597.2, 60 sec: 5542.4, 300 sec: 5536.9). Total num frames: 884854784. Throughput: 0: 5827.9. Samples: 884857616. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:41:48,479][25689] Avg episode reward: [(0, '-0.065')] [2022-07-10 19:41:49,195][26022] Updated weights on worker 0-0, policy_version 864121 (0.00090) [2022-07-10 19:41:51,064][26022] Updated weights on worker 0-0, policy_version 864131 (0.00083) [2022-07-10 19:41:52,982][26022] Updated weights on worker 0-0, policy_version 864141 (0.00094) [2022-07-10 19:41:53,502][25689] Fps is (10 sec: 5605.6, 60 sec: 5541.9, 300 sec: 5537.9). Total num frames: 884883456. Throughput: 0: 5841.6. Samples: 884891358. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:41:53,503][25689] Avg episode reward: [(0, '-0.773')] [2022-07-10 19:41:54,840][26022] Updated weights on worker 0-0, policy_version 864151 (0.00087) [2022-07-10 19:41:56,663][26022] Updated weights on worker 0-0, policy_version 864161 (0.00093) [2022-07-10 19:41:58,579][25689] Fps is (10 sec: 5575.9, 60 sec: 5545.3, 300 sec: 5534.4). Total num frames: 884911104. Throughput: 0: 4994.1. Samples: 884907906. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:41:58,581][26022] Updated weights on worker 0-0, policy_version 864171 (0.00075) [2022-07-10 19:41:58,580][25689] Avg episode reward: [(0, '-0.935')] [2022-07-10 19:42:00,409][26022] Updated weights on worker 0-0, policy_version 864181 (0.00081) [2022-07-10 19:42:02,727][26022] Updated weights on worker 0-0, policy_version 864191 (0.00089) [2022-07-10 19:42:03,650][25689] Fps is (10 sec: 5348.1, 60 sec: 5542.3, 300 sec: 5537.2). Total num frames: 884937728. Throughput: 0: 5782.3. Samples: 884940794. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:42:03,651][25689] Avg episode reward: [(0, '-0.892')] [2022-07-10 19:42:04,402][26022] Updated weights on worker 0-0, policy_version 864201 (0.00083) [2022-07-10 19:42:06,039][26022] Updated weights on worker 0-0, policy_version 864211 (0.00090) [2022-07-10 19:42:08,013][26022] Updated weights on worker 0-0, policy_version 864221 (0.00088) [2022-07-10 19:42:08,695][25689] Fps is (10 sec: 5264.0, 60 sec: 5524.0, 300 sec: 5533.8). Total num frames: 884964352. Throughput: 0: 5696.7. Samples: 884972780. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:42:08,695][25689] Avg episode reward: [(0, '-1.137')] [2022-07-10 19:42:09,880][26022] Updated weights on worker 0-0, policy_version 864231 (0.00095) [2022-07-10 19:42:11,758][26022] Updated weights on worker 0-0, policy_version 864241 (0.00090) [2022-07-10 19:42:13,705][25689] Fps is (10 sec: 5397.7, 60 sec: 5523.5, 300 sec: 5529.3). Total num frames: 884992000. Throughput: 0: 4858.5. Samples: 884989512. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:42:13,705][25689] Avg episode reward: [(0, '-1.513')] [2022-07-10 19:42:13,752][26022] Updated weights on worker 0-0, policy_version 864251 (0.00087) [2022-07-10 19:42:15,311][26022] Updated weights on worker 0-0, policy_version 864261 (0.00082) [2022-07-10 19:42:17,479][26022] Updated weights on worker 0-0, policy_version 864271 (0.00093) [2022-07-10 19:42:18,782][25689] Fps is (10 sec: 5685.0, 60 sec: 5521.0, 300 sec: 5532.7). Total num frames: 885021696. Throughput: 0: 5695.8. Samples: 885022974. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:42:18,782][25689] Avg episode reward: [(0, '-1.023')] [2022-07-10 19:42:19,199][26022] Updated weights on worker 0-0, policy_version 864281 (0.00086) [2022-07-10 19:42:21,020][26022] Updated weights on worker 0-0, policy_version 864291 (0.00081) [2022-07-10 19:42:22,754][26022] Updated weights on worker 0-0, policy_version 864301 (0.00088) [2022-07-10 19:42:23,841][25689] Fps is (10 sec: 5758.1, 60 sec: 5550.6, 300 sec: 5540.0). Total num frames: 885050368. Throughput: 0: 5733.7. Samples: 885056566. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:42:23,842][25689] Avg episode reward: [(0, '-0.960')] [2022-07-10 19:42:24,647][26022] Updated weights on worker 0-0, policy_version 864311 (0.00086) [2022-07-10 19:42:26,425][26022] Updated weights on worker 0-0, policy_version 864321 (0.00554) [2022-07-10 19:42:28,303][26022] Updated weights on worker 0-0, policy_version 864331 (0.00089) [2022-07-10 19:42:28,879][25689] Fps is (10 sec: 5577.6, 60 sec: 5533.0, 300 sec: 5536.5). Total num frames: 885078016. Throughput: 0: 4970.2. Samples: 885073100. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:42:28,880][25689] Avg episode reward: [(0, '-0.977')] [2022-07-10 19:42:29,901][26022] Updated weights on worker 0-0, policy_version 864341 (0.00085) [2022-07-10 19:42:32,194][26022] Updated weights on worker 0-0, policy_version 864351 (0.00088) [2022-07-10 19:42:33,750][26022] Updated weights on worker 0-0, policy_version 864361 (0.00092) [2022-07-10 19:42:33,979][25689] Fps is (10 sec: 5555.9, 60 sec: 5541.3, 300 sec: 5533.4). Total num frames: 885106688. Throughput: 0: 5770.1. Samples: 885106494. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:42:33,980][25689] Avg episode reward: [(0, '-1.388')] [2022-07-10 19:42:35,663][26022] Updated weights on worker 0-0, policy_version 864371 (0.00094) [2022-07-10 19:42:37,496][26022] Updated weights on worker 0-0, policy_version 864381 (0.00094) [2022-07-10 19:42:39,079][25689] Fps is (10 sec: 5521.9, 60 sec: 5536.8, 300 sec: 5539.0). Total num frames: 885134336. Throughput: 0: 5758.0. Samples: 885139844. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:42:39,079][25689] Avg episode reward: [(0, '-0.594')] [2022-07-10 19:42:39,428][26022] Updated weights on worker 0-0, policy_version 864391 (0.00086) [2022-07-10 19:42:41,196][26022] Updated weights on worker 0-0, policy_version 864401 (0.00091) [2022-07-10 19:42:43,081][26022] Updated weights on worker 0-0, policy_version 864411 (0.00080) [2022-07-10 19:42:44,145][25689] Fps is (10 sec: 5539.9, 60 sec: 5534.3, 300 sec: 5531.4). Total num frames: 885163008. Throughput: 0: 4933.7. Samples: 885156736. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:42:44,145][25689] Avg episode reward: [(0, '-0.077')] [2022-07-10 19:42:44,788][26022] Updated weights on worker 0-0, policy_version 864421 (0.00091) [2022-07-10 19:42:46,902][26022] Updated weights on worker 0-0, policy_version 864431 (0.00092) [2022-07-10 19:42:48,259][26022] Updated weights on worker 0-0, policy_version 864441 (0.00093) [2022-07-10 19:42:49,243][25689] Fps is (10 sec: 5641.8, 60 sec: 5544.3, 300 sec: 5536.6). Total num frames: 885191680. Throughput: 0: 5758.5. Samples: 885190364. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:42:49,244][25689] Avg episode reward: [(0, '0.512')] [2022-07-10 19:42:50,495][26022] Updated weights on worker 0-0, policy_version 864451 (0.00093) [2022-07-10 19:42:52,147][26022] Updated weights on worker 0-0, policy_version 864461 (0.00086) [2022-07-10 19:42:54,079][26022] Updated weights on worker 0-0, policy_version 864471 (0.00090) [2022-07-10 19:42:54,249][25689] Fps is (10 sec: 5675.2, 60 sec: 5545.9, 300 sec: 5540.9). Total num frames: 885220352. Throughput: 0: 5804.7. Samples: 885224158. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:42:54,250][25689] Avg episode reward: [(0, '0.683')] [2022-07-10 19:42:55,768][26022] Updated weights on worker 0-0, policy_version 864481 (0.00085) [2022-07-10 19:42:57,614][26022] Updated weights on worker 0-0, policy_version 864491 (0.00088) [2022-07-10 19:42:59,326][25689] Fps is (10 sec: 5484.1, 60 sec: 5529.1, 300 sec: 5529.6). Total num frames: 885246976. Throughput: 0: 4977.0. Samples: 885240614. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:42:59,327][25689] Avg episode reward: [(0, '-0.539')] [2022-07-10 19:42:59,546][26022] Updated weights on worker 0-0, policy_version 864501 (0.00089) [2022-07-10 19:42:59,727][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:42:59,744][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000864502_885250048.pth [2022-07-10 19:42:59,744][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000862553_883254272.pth [2022-07-10 19:43:01,608][26022] Updated weights on worker 0-0, policy_version 864511 (0.00086) [2022-07-10 19:43:03,410][26022] Updated weights on worker 0-0, policy_version 864521 (0.00104) [2022-07-10 19:43:04,343][25689] Fps is (10 sec: 5275.2, 60 sec: 5534.0, 300 sec: 5539.8). Total num frames: 885273600. Throughput: 0: 5722.6. Samples: 885272322. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:43:04,344][25689] Avg episode reward: [(0, '-1.341')] [2022-07-10 19:43:05,521][26022] Updated weights on worker 0-0, policy_version 864531 (0.00079) [2022-07-10 19:43:06,986][26022] Updated weights on worker 0-0, policy_version 864541 (0.00084) [2022-07-10 19:43:09,054][26022] Updated weights on worker 0-0, policy_version 864551 (0.00090) [2022-07-10 19:43:09,363][25689] Fps is (10 sec: 5611.2, 60 sec: 5586.9, 300 sec: 5543.3). Total num frames: 885303296. Throughput: 0: 5740.3. Samples: 885305858. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:43:09,363][25689] Avg episode reward: [(0, '-1.785')] [2022-07-10 19:43:11,191][26022] Updated weights on worker 0-0, policy_version 864561 (0.00091) [2022-07-10 19:43:12,484][26022] Updated weights on worker 0-0, policy_version 864571 (0.00083) [2022-07-10 19:43:14,381][25689] Fps is (10 sec: 5508.7, 60 sec: 5552.4, 300 sec: 5531.1). Total num frames: 885328896. Throughput: 0: 5739.2. Samples: 885339698. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:43:14,381][25689] Avg episode reward: [(0, '-2.533')] [2022-07-10 19:43:14,731][26022] Updated weights on worker 0-0, policy_version 864581 (0.00091) [2022-07-10 19:43:16,267][26022] Updated weights on worker 0-0, policy_version 864591 (0.00085) [2022-07-10 19:43:18,152][26022] Updated weights on worker 0-0, policy_version 864601 (0.00086) [2022-07-10 19:43:19,427][25689] Fps is (10 sec: 5494.5, 60 sec: 5555.2, 300 sec: 5544.3). Total num frames: 885358592. Throughput: 0: 5774.2. Samples: 885356682. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:43:19,427][25689] Avg episode reward: [(0, '-2.900')] [2022-07-10 19:43:20,082][26022] Updated weights on worker 0-0, policy_version 864611 (0.00086) [2022-07-10 19:43:21,577][26022] Updated weights on worker 0-0, policy_version 864621 (0.00086) [2022-07-10 19:43:23,584][26022] Updated weights on worker 0-0, policy_version 864631 (0.00089) [2022-07-10 19:43:24,436][25689] Fps is (10 sec: 5804.6, 60 sec: 5559.8, 300 sec: 5541.4). Total num frames: 885387264. Throughput: 0: 5884.2. Samples: 885390556. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:43:24,437][25689] Avg episode reward: [(0, '-2.649')] [2022-07-10 19:43:25,465][26022] Updated weights on worker 0-0, policy_version 864641 (0.00089) [2022-07-10 19:43:27,315][26022] Updated weights on worker 0-0, policy_version 864651 (0.00097) [2022-07-10 19:43:29,230][26022] Updated weights on worker 0-0, policy_version 864661 (0.00102) [2022-07-10 19:43:29,478][25689] Fps is (10 sec: 5501.1, 60 sec: 5542.5, 300 sec: 5538.9). Total num frames: 885413888. Throughput: 0: 5841.1. Samples: 885423356. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:43:29,479][25689] Avg episode reward: [(0, '-1.790')] [2022-07-10 19:43:31,063][26022] Updated weights on worker 0-0, policy_version 864671 (0.00087) [2022-07-10 19:43:32,943][26022] Updated weights on worker 0-0, policy_version 864681 (0.00087) [2022-07-10 19:43:34,487][25689] Fps is (10 sec: 5399.9, 60 sec: 5533.9, 300 sec: 5536.1). Total num frames: 885441536. Throughput: 0: 4986.4. Samples: 885439954. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:43:34,487][25689] Avg episode reward: [(0, '-0.553')] [2022-07-10 19:43:34,773][26022] Updated weights on worker 0-0, policy_version 864691 (0.00087) [2022-07-10 19:43:36,516][26022] Updated weights on worker 0-0, policy_version 864701 (0.00094) [2022-07-10 19:43:38,388][26022] Updated weights on worker 0-0, policy_version 864711 (0.00093) [2022-07-10 19:43:39,571][25689] Fps is (10 sec: 5580.1, 60 sec: 5552.3, 300 sec: 5538.5). Total num frames: 885470208. Throughput: 0: 5797.2. Samples: 885473466. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:43:39,572][25689] Avg episode reward: [(0, '-0.210')] [2022-07-10 19:43:40,331][26022] Updated weights on worker 0-0, policy_version 864721 (0.00088) [2022-07-10 19:43:42,037][26022] Updated weights on worker 0-0, policy_version 864731 (0.00079) [2022-07-10 19:43:44,036][26022] Updated weights on worker 0-0, policy_version 864741 (0.00088) [2022-07-10 19:43:44,606][25689] Fps is (10 sec: 5565.6, 60 sec: 5538.2, 300 sec: 5538.1). Total num frames: 885497856. Throughput: 0: 5778.0. Samples: 885507096. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:43:44,607][25689] Avg episode reward: [(0, '0.567')] [2022-07-10 19:43:45,635][26022] Updated weights on worker 0-0, policy_version 864751 (0.00092) [2022-07-10 19:43:47,621][26022] Updated weights on worker 0-0, policy_version 864761 (0.00094) [2022-07-10 19:43:49,313][26022] Updated weights on worker 0-0, policy_version 864771 (0.00093) [2022-07-10 19:43:49,610][25689] Fps is (10 sec: 5610.4, 60 sec: 5546.9, 300 sec: 5541.6). Total num frames: 885526528. Throughput: 0: 4992.4. Samples: 885523860. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:43:49,612][25689] Avg episode reward: [(0, '0.494')] [2022-07-10 19:43:51,215][26022] Updated weights on worker 0-0, policy_version 864781 (0.00112) [2022-07-10 19:43:53,148][26022] Updated weights on worker 0-0, policy_version 864791 (0.00093) [2022-07-10 19:43:54,619][25689] Fps is (10 sec: 5522.7, 60 sec: 5512.7, 300 sec: 5535.4). Total num frames: 885553152. Throughput: 0: 5805.0. Samples: 885556818. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:43:54,620][25689] Avg episode reward: [(0, '0.047')] [2022-07-10 19:43:55,212][26022] Updated weights on worker 0-0, policy_version 864801 (0.00086) [2022-07-10 19:43:56,751][26022] Updated weights on worker 0-0, policy_version 864811 (0.00092) [2022-07-10 19:43:58,876][26022] Updated weights on worker 0-0, policy_version 864821 (0.00092) [2022-07-10 19:43:59,682][25689] Fps is (10 sec: 5489.9, 60 sec: 5547.8, 300 sec: 5549.9). Total num frames: 885581824. Throughput: 0: 5786.4. Samples: 885589834. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:43:59,683][25689] Avg episode reward: [(0, '0.110')] [2022-07-10 19:44:00,616][26022] Updated weights on worker 0-0, policy_version 864831 (0.00090) [2022-07-10 19:44:02,910][26022] Updated weights on worker 0-0, policy_version 864841 (0.00098) [2022-07-10 19:44:04,708][25689] Fps is (10 sec: 5277.3, 60 sec: 5513.1, 300 sec: 5536.2). Total num frames: 885606400. Throughput: 0: 4838.5. Samples: 885604362. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:44:04,709][25689] Avg episode reward: [(0, '0.037')] [2022-07-10 19:44:04,751][26022] Updated weights on worker 0-0, policy_version 864851 (0.00087) [2022-07-10 19:44:06,542][26022] Updated weights on worker 0-0, policy_version 864861 (0.00098) [2022-07-10 19:44:08,608][26022] Updated weights on worker 0-0, policy_version 864871 (0.00090) [2022-07-10 19:44:09,720][25689] Fps is (10 sec: 5202.8, 60 sec: 5479.9, 300 sec: 5529.4). Total num frames: 885634048. Throughput: 0: 5647.1. Samples: 885637422. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:44:09,721][25689] Avg episode reward: [(0, '-0.027')] [2022-07-10 19:44:10,142][26022] Updated weights on worker 0-0, policy_version 864881 (0.00089) [2022-07-10 19:44:12,185][26022] Updated weights on worker 0-0, policy_version 864891 (0.00090) [2022-07-10 19:44:14,131][26022] Updated weights on worker 0-0, policy_version 864901 (0.00084) [2022-07-10 19:44:14,729][25689] Fps is (10 sec: 5518.1, 60 sec: 5514.6, 300 sec: 5530.4). Total num frames: 885661696. Throughput: 0: 5654.5. Samples: 885670534. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:44:14,730][25689] Avg episode reward: [(0, '0.144')] [2022-07-10 19:44:15,845][26022] Updated weights on worker 0-0, policy_version 864911 (0.00091) [2022-07-10 19:44:17,803][26022] Updated weights on worker 0-0, policy_version 864921 (0.00093) [2022-07-10 19:44:19,471][26022] Updated weights on worker 0-0, policy_version 864931 (0.00082) [2022-07-10 19:44:19,774][25689] Fps is (10 sec: 5601.7, 60 sec: 5497.8, 300 sec: 5533.1). Total num frames: 885690368. Throughput: 0: 4835.6. Samples: 885686990. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:44:19,774][25689] Avg episode reward: [(0, '-1.277')] [2022-07-10 19:44:21,417][26022] Updated weights on worker 0-0, policy_version 864941 (0.00085) [2022-07-10 19:44:23,316][26022] Updated weights on worker 0-0, policy_version 864951 (0.00085) [2022-07-10 19:44:24,782][25689] Fps is (10 sec: 5602.8, 60 sec: 5481.0, 300 sec: 5533.1). Total num frames: 885718016. Throughput: 0: 5782.3. Samples: 885720430. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 19:44:24,782][25689] Avg episode reward: [(0, '-0.771')] [2022-07-10 19:44:25,158][26022] Updated weights on worker 0-0, policy_version 864961 (0.00090) [2022-07-10 19:44:27,092][26022] Updated weights on worker 0-0, policy_version 864971 (0.00089) [2022-07-10 19:44:28,663][26022] Updated weights on worker 0-0, policy_version 864981 (0.00087) [2022-07-10 19:44:29,798][25689] Fps is (10 sec: 5516.1, 60 sec: 5500.3, 300 sec: 5530.3). Total num frames: 885745664. Throughput: 0: 5797.8. Samples: 885753834. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:44:29,799][25689] Avg episode reward: [(0, '-0.685')] [2022-07-10 19:44:30,907][26022] Updated weights on worker 0-0, policy_version 864991 (0.00088) [2022-07-10 19:44:32,402][26022] Updated weights on worker 0-0, policy_version 865001 (0.00084) [2022-07-10 19:44:34,455][26022] Updated weights on worker 0-0, policy_version 865011 (0.00089) [2022-07-10 19:44:34,809][25689] Fps is (10 sec: 5616.8, 60 sec: 5517.1, 300 sec: 5534.7). Total num frames: 885774336. Throughput: 0: 4981.2. Samples: 885770554. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:44:34,809][25689] Avg episode reward: [(0, '-1.391')] [2022-07-10 19:44:36,120][26022] Updated weights on worker 0-0, policy_version 865021 (0.00086) [2022-07-10 19:44:37,954][26022] Updated weights on worker 0-0, policy_version 865031 (0.00087) [2022-07-10 19:44:39,853][25689] Fps is (10 sec: 5499.8, 60 sec: 5486.8, 300 sec: 5530.9). Total num frames: 885800960. Throughput: 0: 5829.8. Samples: 885804044. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:44:39,853][25689] Avg episode reward: [(0, '-1.617')] [2022-07-10 19:44:39,923][26022] Updated weights on worker 0-0, policy_version 865041 (0.00091) [2022-07-10 19:44:41,650][26022] Updated weights on worker 0-0, policy_version 865051 (0.00092) [2022-07-10 19:44:43,433][26022] Updated weights on worker 0-0, policy_version 865061 (0.00092) [2022-07-10 19:44:44,930][25689] Fps is (10 sec: 5463.3, 60 sec: 5499.9, 300 sec: 5530.0). Total num frames: 885829632. Throughput: 0: 5821.0. Samples: 885837714. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:44:44,931][25689] Avg episode reward: [(0, '-0.702')] [2022-07-10 19:44:45,351][26022] Updated weights on worker 0-0, policy_version 865071 (0.00103) [2022-07-10 19:44:47,138][26022] Updated weights on worker 0-0, policy_version 865081 (0.00083) [2022-07-10 19:44:49,310][26022] Updated weights on worker 0-0, policy_version 865091 (0.00087) [2022-07-10 19:44:49,932][25689] Fps is (10 sec: 5587.7, 60 sec: 5483.1, 300 sec: 5530.3). Total num frames: 885857280. Throughput: 0: 4990.8. Samples: 885854316. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:44:49,932][25689] Avg episode reward: [(0, '-0.403')] [2022-07-10 19:44:50,670][26022] Updated weights on worker 0-0, policy_version 865101 (0.00097) [2022-07-10 19:44:52,989][26022] Updated weights on worker 0-0, policy_version 865111 (0.00091) [2022-07-10 19:44:54,277][26022] Updated weights on worker 0-0, policy_version 865121 (0.01116) [2022-07-10 19:44:54,936][25689] Fps is (10 sec: 5731.0, 60 sec: 5534.5, 300 sec: 5534.9). Total num frames: 885886976. Throughput: 0: 5830.5. Samples: 885887906. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:44:54,937][25689] Avg episode reward: [(0, '-1.444')] [2022-07-10 19:44:56,550][26022] Updated weights on worker 0-0, policy_version 865131 (0.00099) [2022-07-10 19:44:58,186][26022] Updated weights on worker 0-0, policy_version 865141 (0.00086) [2022-07-10 19:44:59,788][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:44:59,801][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000865149_885912576.pth [2022-07-10 19:44:59,802][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000863202_883918848.pth [2022-07-10 19:45:00,024][25689] Fps is (10 sec: 5580.4, 60 sec: 5498.3, 300 sec: 5533.5). Total num frames: 885913600. Throughput: 0: 5808.3. Samples: 885921208. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:45:00,025][25689] Avg episode reward: [(0, '-1.532')] [2022-07-10 19:45:00,265][26022] Updated weights on worker 0-0, policy_version 865151 (0.00088) [2022-07-10 19:45:02,060][26022] Updated weights on worker 0-0, policy_version 865161 (0.00176) [2022-07-10 19:45:04,225][26022] Updated weights on worker 0-0, policy_version 865171 (0.00078) [2022-07-10 19:45:05,043][25689] Fps is (10 sec: 5268.2, 60 sec: 5532.9, 300 sec: 5534.5). Total num frames: 885940224. Throughput: 0: 4882.8. Samples: 885935924. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:45:05,044][25689] Avg episode reward: [(0, '-0.700')] [2022-07-10 19:45:05,911][26022] Updated weights on worker 0-0, policy_version 865181 (0.00097) [2022-07-10 19:45:07,869][26022] Updated weights on worker 0-0, policy_version 865191 (0.00087) [2022-07-10 19:45:09,598][26022] Updated weights on worker 0-0, policy_version 865201 (0.00078) [2022-07-10 19:45:10,049][25689] Fps is (10 sec: 5413.6, 60 sec: 5533.3, 300 sec: 5528.2). Total num frames: 885967872. Throughput: 0: 5717.0. Samples: 885969328. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:45:10,050][25689] Avg episode reward: [(0, '-0.439')] [2022-07-10 19:45:11,595][26022] Updated weights on worker 0-0, policy_version 865211 (0.00094) [2022-07-10 19:45:13,299][26022] Updated weights on worker 0-0, policy_version 865221 (0.00090) [2022-07-10 19:45:15,101][25689] Fps is (10 sec: 5497.9, 60 sec: 5529.5, 300 sec: 5528.4). Total num frames: 885995520. Throughput: 0: 5689.5. Samples: 886002636. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:45:15,102][25689] Avg episode reward: [(0, '-0.382')] [2022-07-10 19:45:15,350][26022] Updated weights on worker 0-0, policy_version 865231 (0.00090) [2022-07-10 19:45:16,886][26022] Updated weights on worker 0-0, policy_version 865241 (0.00087) [2022-07-10 19:45:19,123][26022] Updated weights on worker 0-0, policy_version 865251 (0.00086) [2022-07-10 19:45:20,209][25689] Fps is (10 sec: 5543.3, 60 sec: 5523.7, 300 sec: 5534.7). Total num frames: 886024192. Throughput: 0: 4870.3. Samples: 886019516. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:45:20,210][25689] Avg episode reward: [(0, '-0.196')] [2022-07-10 19:45:20,489][26022] Updated weights on worker 0-0, policy_version 865261 (0.00087) [2022-07-10 19:45:22,574][26022] Updated weights on worker 0-0, policy_version 865271 (0.00083) [2022-07-10 19:45:24,363][26022] Updated weights on worker 0-0, policy_version 865281 (0.00088) [2022-07-10 19:45:25,228][25689] Fps is (10 sec: 5561.6, 60 sec: 5522.7, 300 sec: 5524.4). Total num frames: 886051840. Throughput: 0: 5806.1. Samples: 886053116. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:45:25,228][25689] Avg episode reward: [(0, '0.408')] [2022-07-10 19:45:26,222][26022] Updated weights on worker 0-0, policy_version 865291 (0.00088) [2022-07-10 19:45:28,147][26022] Updated weights on worker 0-0, policy_version 865301 (0.00089) [2022-07-10 19:45:29,826][26022] Updated weights on worker 0-0, policy_version 865311 (0.00061) [2022-07-10 19:45:30,243][25689] Fps is (10 sec: 5511.2, 60 sec: 5522.8, 300 sec: 5528.0). Total num frames: 886079488. Throughput: 0: 5799.8. Samples: 886086446. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:45:30,245][25689] Avg episode reward: [(0, '0.660')] [2022-07-10 19:45:31,767][26022] Updated weights on worker 0-0, policy_version 865321 (0.00084) [2022-07-10 19:45:33,632][26022] Updated weights on worker 0-0, policy_version 865331 (0.00087) [2022-07-10 19:45:35,257][25689] Fps is (10 sec: 5615.6, 60 sec: 5522.5, 300 sec: 5528.8). Total num frames: 886108160. Throughput: 0: 4993.1. Samples: 886103276. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:45:35,257][25689] Avg episode reward: [(0, '0.595')] [2022-07-10 19:45:35,327][26022] Updated weights on worker 0-0, policy_version 865341 (0.00088) [2022-07-10 19:45:37,203][26022] Updated weights on worker 0-0, policy_version 865351 (0.00089) [2022-07-10 19:45:39,227][26022] Updated weights on worker 0-0, policy_version 865361 (0.00088) [2022-07-10 19:45:40,314][25689] Fps is (10 sec: 5693.6, 60 sec: 5555.1, 300 sec: 5528.7). Total num frames: 886136832. Throughput: 0: 5839.4. Samples: 886136918. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:45:40,315][25689] Avg episode reward: [(0, '0.369')] [2022-07-10 19:45:41,008][26022] Updated weights on worker 0-0, policy_version 865371 (0.00092) [2022-07-10 19:45:42,599][26022] Updated weights on worker 0-0, policy_version 865381 (0.00080) [2022-07-10 19:45:44,677][26022] Updated weights on worker 0-0, policy_version 865391 (0.00091) [2022-07-10 19:45:45,336][25689] Fps is (10 sec: 5486.3, 60 sec: 5526.4, 300 sec: 5528.7). Total num frames: 886163456. Throughput: 0: 5836.5. Samples: 886170478. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:45:45,336][25689] Avg episode reward: [(0, '0.360')] [2022-07-10 19:45:46,214][26022] Updated weights on worker 0-0, policy_version 865401 (0.00612) [2022-07-10 19:45:48,305][26022] Updated weights on worker 0-0, policy_version 865411 (0.00092) [2022-07-10 19:45:49,964][26022] Updated weights on worker 0-0, policy_version 865421 (0.00088) [2022-07-10 19:45:50,337][25689] Fps is (10 sec: 5517.2, 60 sec: 5543.4, 300 sec: 5529.0). Total num frames: 886192128. Throughput: 0: 5023.8. Samples: 886187396. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:45:50,338][25689] Avg episode reward: [(0, '0.261')] [2022-07-10 19:45:51,839][26022] Updated weights on worker 0-0, policy_version 865431 (0.00085) [2022-07-10 19:45:53,792][26022] Updated weights on worker 0-0, policy_version 865441 (0.00088) [2022-07-10 19:45:55,369][25689] Fps is (10 sec: 5613.3, 60 sec: 5506.9, 300 sec: 5530.5). Total num frames: 886219776. Throughput: 0: 5825.6. Samples: 886220442. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:45:55,370][25689] Avg episode reward: [(0, '0.563')] [2022-07-10 19:45:55,756][26022] Updated weights on worker 0-0, policy_version 865451 (0.00088) [2022-07-10 19:45:57,442][26022] Updated weights on worker 0-0, policy_version 865461 (0.00096) [2022-07-10 19:45:59,492][26022] Updated weights on worker 0-0, policy_version 865471 (0.00091) [2022-07-10 19:46:00,412][25689] Fps is (10 sec: 5590.0, 60 sec: 5544.9, 300 sec: 5537.3). Total num frames: 886248448. Throughput: 0: 5815.7. Samples: 886253800. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:46:00,413][25689] Avg episode reward: [(0, '-0.033')] [2022-07-10 19:46:01,121][26022] Updated weights on worker 0-0, policy_version 865481 (0.00088) [2022-07-10 19:46:03,408][26022] Updated weights on worker 0-0, policy_version 865491 (0.00083) [2022-07-10 19:46:05,335][26022] Updated weights on worker 0-0, policy_version 865501 (0.00086) [2022-07-10 19:46:05,482][25689] Fps is (10 sec: 5265.1, 60 sec: 5506.4, 300 sec: 5526.2). Total num frames: 886273024. Throughput: 0: 4859.6. Samples: 886268382. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:46:05,483][25689] Avg episode reward: [(0, '-0.079')] [2022-07-10 19:46:06,992][26022] Updated weights on worker 0-0, policy_version 865511 (0.00103) [2022-07-10 19:46:09,064][26022] Updated weights on worker 0-0, policy_version 865521 (0.00087) [2022-07-10 19:46:10,508][25689] Fps is (10 sec: 5274.3, 60 sec: 5521.5, 300 sec: 5529.3). Total num frames: 886301696. Throughput: 0: 5671.7. Samples: 886301798. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:46:10,508][25689] Avg episode reward: [(0, '-0.042')] [2022-07-10 19:46:10,758][26022] Updated weights on worker 0-0, policy_version 865531 (0.00092) [2022-07-10 19:46:12,603][26022] Updated weights on worker 0-0, policy_version 865541 (0.00089) [2022-07-10 19:46:14,572][26022] Updated weights on worker 0-0, policy_version 865551 (0.00089) [2022-07-10 19:46:15,527][25689] Fps is (10 sec: 5607.1, 60 sec: 5524.5, 300 sec: 5522.9). Total num frames: 886329344. Throughput: 0: 5694.4. Samples: 886335228. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:46:15,529][25689] Avg episode reward: [(0, '-0.367')] [2022-07-10 19:46:16,204][26022] Updated weights on worker 0-0, policy_version 865561 (0.00080) [2022-07-10 19:46:18,058][26022] Updated weights on worker 0-0, policy_version 865571 (0.00082) [2022-07-10 19:46:19,961][26022] Updated weights on worker 0-0, policy_version 865581 (0.00091) [2022-07-10 19:46:20,592][25689] Fps is (10 sec: 5686.6, 60 sec: 5545.4, 300 sec: 5532.3). Total num frames: 886359040. Throughput: 0: 4870.4. Samples: 886352084. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:46:20,592][25689] Avg episode reward: [(0, '0.067')] [2022-07-10 19:46:21,734][26022] Updated weights on worker 0-0, policy_version 865591 (0.00089) [2022-07-10 19:46:23,517][26022] Updated weights on worker 0-0, policy_version 865601 (0.00116) [2022-07-10 19:46:25,275][26022] Updated weights on worker 0-0, policy_version 865611 (0.00078) [2022-07-10 19:46:25,631][25689] Fps is (10 sec: 5675.4, 60 sec: 5543.6, 300 sec: 5528.7). Total num frames: 886386688. Throughput: 0: 5830.3. Samples: 886385852. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:46:25,632][25689] Avg episode reward: [(0, '-0.501')] [2022-07-10 19:46:27,095][26022] Updated weights on worker 0-0, policy_version 865621 (0.00085) [2022-07-10 19:46:29,198][26022] Updated weights on worker 0-0, policy_version 865631 (0.00083) [2022-07-10 19:46:30,653][25689] Fps is (10 sec: 5495.9, 60 sec: 5542.9, 300 sec: 5528.4). Total num frames: 886414336. Throughput: 0: 5841.0. Samples: 886419468. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:46:30,655][25689] Avg episode reward: [(0, '-0.266')] [2022-07-10 19:46:30,978][26022] Updated weights on worker 0-0, policy_version 865641 (0.00089) [2022-07-10 19:46:32,872][26022] Updated weights on worker 0-0, policy_version 865651 (0.00110) [2022-07-10 19:46:34,676][26022] Updated weights on worker 0-0, policy_version 865661 (0.00089) [2022-07-10 19:46:35,667][25689] Fps is (10 sec: 5509.7, 60 sec: 5526.0, 300 sec: 5529.1). Total num frames: 886441984. Throughput: 0: 5007.9. Samples: 886436088. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:46:35,667][25689] Avg episode reward: [(0, '0.364')] [2022-07-10 19:46:36,280][26022] Updated weights on worker 0-0, policy_version 865671 (0.00093) [2022-07-10 19:46:38,368][26022] Updated weights on worker 0-0, policy_version 865681 (0.00091) [2022-07-10 19:46:39,972][26022] Updated weights on worker 0-0, policy_version 865691 (0.00085) [2022-07-10 19:46:40,715][25689] Fps is (10 sec: 5495.8, 60 sec: 5509.9, 300 sec: 5525.5). Total num frames: 886469632. Throughput: 0: 5844.7. Samples: 886469696. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:46:40,715][25689] Avg episode reward: [(0, '-0.251')] [2022-07-10 19:46:41,877][26022] Updated weights on worker 0-0, policy_version 865701 (0.00085) [2022-07-10 19:46:43,871][26022] Updated weights on worker 0-0, policy_version 865711 (0.00093) [2022-07-10 19:46:45,451][26022] Updated weights on worker 0-0, policy_version 865721 (0.00088) [2022-07-10 19:46:45,718][25689] Fps is (10 sec: 5705.0, 60 sec: 5562.4, 300 sec: 5532.7). Total num frames: 886499328. Throughput: 0: 5854.9. Samples: 886503464. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:46:45,719][25689] Avg episode reward: [(0, '-1.110')] [2022-07-10 19:46:47,454][26022] Updated weights on worker 0-0, policy_version 865731 (0.00085) [2022-07-10 19:46:49,303][26022] Updated weights on worker 0-0, policy_version 865741 (0.00086) [2022-07-10 19:46:50,805][25689] Fps is (10 sec: 5784.5, 60 sec: 5554.5, 300 sec: 5531.5). Total num frames: 886528000. Throughput: 0: 5004.6. Samples: 886520320. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:46:50,806][25689] Avg episode reward: [(0, '-0.556')] [2022-07-10 19:46:50,891][26022] Updated weights on worker 0-0, policy_version 865751 (0.00060) [2022-07-10 19:46:53,034][26022] Updated weights on worker 0-0, policy_version 865761 (0.00105) [2022-07-10 19:46:54,762][26022] Updated weights on worker 0-0, policy_version 865771 (0.00084) [2022-07-10 19:46:55,885][25689] Fps is (10 sec: 5438.8, 60 sec: 5533.2, 300 sec: 5528.0). Total num frames: 886554624. Throughput: 0: 5837.0. Samples: 886554104. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:46:55,886][25689] Avg episode reward: [(0, '-0.773')] [2022-07-10 19:46:56,452][26022] Updated weights on worker 0-0, policy_version 865781 (0.00090) [2022-07-10 19:46:58,393][26022] Updated weights on worker 0-0, policy_version 865791 (0.00096) [2022-07-10 19:46:59,827][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:46:59,840][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000865799_886578176.pth [2022-07-10 19:46:59,840][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000863852_884584448.pth [2022-07-10 19:47:00,156][26022] Updated weights on worker 0-0, policy_version 865801 (0.00090) [2022-07-10 19:47:00,948][25689] Fps is (10 sec: 5552.7, 60 sec: 5548.3, 300 sec: 5538.5). Total num frames: 886584320. Throughput: 0: 5829.2. Samples: 886587640. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:47:00,949][25689] Avg episode reward: [(0, '-0.805')] [2022-07-10 19:47:02,227][26022] Updated weights on worker 0-0, policy_version 865811 (0.00090) [2022-07-10 19:47:04,095][26022] Updated weights on worker 0-0, policy_version 865821 (0.00082) [2022-07-10 19:47:06,007][25689] Fps is (10 sec: 5463.3, 60 sec: 5566.3, 300 sec: 5534.8). Total num frames: 886609920. Throughput: 0: 5706.0. Samples: 886619230. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:47:06,007][25689] Avg episode reward: [(0, '-1.204')] [2022-07-10 19:47:06,121][26022] Updated weights on worker 0-0, policy_version 865831 (0.00084) [2022-07-10 19:47:07,701][26022] Updated weights on worker 0-0, policy_version 865841 (0.00086) [2022-07-10 19:47:09,637][26022] Updated weights on worker 0-0, policy_version 865851 (0.00084) [2022-07-10 19:47:11,047][25689] Fps is (10 sec: 5374.2, 60 sec: 5565.0, 300 sec: 5537.7). Total num frames: 886638592. Throughput: 0: 5715.7. Samples: 886636014. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:47:11,047][25689] Avg episode reward: [(0, '-1.545')] [2022-07-10 19:47:11,578][26022] Updated weights on worker 0-0, policy_version 865861 (0.00092) [2022-07-10 19:47:13,343][26022] Updated weights on worker 0-0, policy_version 865871 (0.00089) [2022-07-10 19:47:15,300][26022] Updated weights on worker 0-0, policy_version 865881 (0.00093) [2022-07-10 19:47:16,052][25689] Fps is (10 sec: 5606.6, 60 sec: 5566.2, 300 sec: 5532.1). Total num frames: 886666240. Throughput: 0: 5723.5. Samples: 886669528. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:47:16,053][25689] Avg episode reward: [(0, '-0.009')] [2022-07-10 19:47:17,138][26022] Updated weights on worker 0-0, policy_version 865891 (0.00088) [2022-07-10 19:47:19,031][26022] Updated weights on worker 0-0, policy_version 865901 (0.00095) [2022-07-10 19:47:20,673][26022] Updated weights on worker 0-0, policy_version 865911 (0.00078) [2022-07-10 19:47:21,125][25689] Fps is (10 sec: 5588.4, 60 sec: 5548.6, 300 sec: 5531.9). Total num frames: 886694912. Throughput: 0: 5718.5. Samples: 886703020. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:47:21,125][25689] Avg episode reward: [(0, '-1.740')] [2022-07-10 19:47:22,574][26022] Updated weights on worker 0-0, policy_version 865921 (0.00089) [2022-07-10 19:47:24,531][26022] Updated weights on worker 0-0, policy_version 865931 (0.00088) [2022-07-10 19:47:26,194][25689] Fps is (10 sec: 5654.3, 60 sec: 5562.7, 300 sec: 5534.8). Total num frames: 886723584. Throughput: 0: 4989.5. Samples: 886719952. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:47:26,195][25689] Avg episode reward: [(0, '-2.073')] [2022-07-10 19:47:26,195][26022] Updated weights on worker 0-0, policy_version 865941 (0.00093) [2022-07-10 19:47:28,179][26022] Updated weights on worker 0-0, policy_version 865951 (0.00088) [2022-07-10 19:47:29,872][26022] Updated weights on worker 0-0, policy_version 865961 (0.00095) [2022-07-10 19:47:31,208][25689] Fps is (10 sec: 5483.9, 60 sec: 5546.6, 300 sec: 5529.5). Total num frames: 886750208. Throughput: 0: 5809.7. Samples: 886753146. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:47:31,209][25689] Avg episode reward: [(0, '-1.854')] [2022-07-10 19:47:31,842][26022] Updated weights on worker 0-0, policy_version 865971 (0.00094) [2022-07-10 19:47:33,335][26022] Updated weights on worker 0-0, policy_version 865981 (0.00098) [2022-07-10 19:47:35,678][26022] Updated weights on worker 0-0, policy_version 865991 (0.00089) [2022-07-10 19:47:36,226][25689] Fps is (10 sec: 5613.9, 60 sec: 5580.0, 300 sec: 5537.9). Total num frames: 886779904. Throughput: 0: 5831.4. Samples: 886787170. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:47:36,228][25689] Avg episode reward: [(0, '-1.522')] [2022-07-10 19:47:37,029][26022] Updated weights on worker 0-0, policy_version 866001 (0.00093) [2022-07-10 19:47:39,060][26022] Updated weights on worker 0-0, policy_version 866011 (0.00085) [2022-07-10 19:47:40,832][26022] Updated weights on worker 0-0, policy_version 866021 (0.00085) [2022-07-10 19:47:41,283][25689] Fps is (10 sec: 5590.3, 60 sec: 5562.3, 300 sec: 5531.2). Total num frames: 886806528. Throughput: 0: 5009.2. Samples: 886803996. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:47:41,283][25689] Avg episode reward: [(0, '-1.527')] [2022-07-10 19:47:42,831][26022] Updated weights on worker 0-0, policy_version 866031 (0.00086) [2022-07-10 19:47:44,701][26022] Updated weights on worker 0-0, policy_version 866041 (0.00085) [2022-07-10 19:47:46,293][25689] Fps is (10 sec: 5492.7, 60 sec: 5544.7, 300 sec: 5532.8). Total num frames: 886835200. Throughput: 0: 5833.4. Samples: 886837200. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:47:46,294][25689] Avg episode reward: [(0, '-1.188')] [2022-07-10 19:47:46,396][26022] Updated weights on worker 0-0, policy_version 866051 (0.00086) [2022-07-10 19:47:48,255][26022] Updated weights on worker 0-0, policy_version 866061 (0.00103) [2022-07-10 19:47:50,151][26022] Updated weights on worker 0-0, policy_version 866071 (0.00088) [2022-07-10 19:47:51,316][25689] Fps is (10 sec: 5613.3, 60 sec: 5533.7, 300 sec: 5529.0). Total num frames: 886862848. Throughput: 0: 5848.3. Samples: 886870744. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:47:51,317][25689] Avg episode reward: [(0, '-0.253')] [2022-07-10 19:47:51,920][26022] Updated weights on worker 0-0, policy_version 866081 (0.00089) [2022-07-10 19:47:53,794][26022] Updated weights on worker 0-0, policy_version 866091 (0.00090) [2022-07-10 19:47:55,710][26022] Updated weights on worker 0-0, policy_version 866101 (0.00087) [2022-07-10 19:47:56,324][25689] Fps is (10 sec: 5512.9, 60 sec: 5557.3, 300 sec: 5533.8). Total num frames: 886890496. Throughput: 0: 4985.7. Samples: 886887370. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:47:56,324][25689] Avg episode reward: [(0, '0.315')] [2022-07-10 19:47:57,539][26022] Updated weights on worker 0-0, policy_version 866111 (0.00090) [2022-07-10 19:47:59,415][26022] Updated weights on worker 0-0, policy_version 866121 (0.00095) [2022-07-10 19:48:01,200][26022] Updated weights on worker 0-0, policy_version 866131 (0.00098) [2022-07-10 19:48:01,379][25689] Fps is (10 sec: 5495.2, 60 sec: 5524.1, 300 sec: 5536.5). Total num frames: 886918144. Throughput: 0: 5779.6. Samples: 886920142. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:48:01,379][25689] Avg episode reward: [(0, '0.131')] [2022-07-10 19:48:03,513][26022] Updated weights on worker 0-0, policy_version 866141 (0.00088) [2022-07-10 19:48:05,402][26022] Updated weights on worker 0-0, policy_version 866151 (0.00082) [2022-07-10 19:48:06,412][25689] Fps is (10 sec: 5278.1, 60 sec: 5526.4, 300 sec: 5522.5). Total num frames: 886943744. Throughput: 0: 5673.4. Samples: 886951342. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:48:06,412][25689] Avg episode reward: [(0, '0.173')] [2022-07-10 19:48:07,371][26022] Updated weights on worker 0-0, policy_version 866161 (0.00088) [2022-07-10 19:48:09,030][26022] Updated weights on worker 0-0, policy_version 866171 (0.00096) [2022-07-10 19:48:10,876][26022] Updated weights on worker 0-0, policy_version 866181 (0.00085) [2022-07-10 19:48:11,435][25689] Fps is (10 sec: 5397.0, 60 sec: 5528.0, 300 sec: 5532.7). Total num frames: 886972416. Throughput: 0: 4827.1. Samples: 886967858. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 19:48:11,435][25689] Avg episode reward: [(0, '0.876')] [2022-07-10 19:48:12,689][26022] Updated weights on worker 0-0, policy_version 866191 (0.00086) [2022-07-10 19:48:14,567][26022] Updated weights on worker 0-0, policy_version 866201 (0.00354) [2022-07-10 19:48:16,447][25689] Fps is (10 sec: 5510.6, 60 sec: 5510.4, 300 sec: 5523.0). Total num frames: 886999040. Throughput: 0: 5670.3. Samples: 887001474. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:48:16,447][25689] Avg episode reward: [(0, '0.943')] [2022-07-10 19:48:16,522][26022] Updated weights on worker 0-0, policy_version 866211 (0.00093) [2022-07-10 19:48:18,175][26022] Updated weights on worker 0-0, policy_version 866221 (0.00086) [2022-07-10 19:48:20,114][26022] Updated weights on worker 0-0, policy_version 866231 (0.00085) [2022-07-10 19:48:21,555][25689] Fps is (10 sec: 5565.2, 60 sec: 5524.1, 300 sec: 5524.6). Total num frames: 887028736. Throughput: 0: 5702.3. Samples: 887035192. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:48:21,555][25689] Avg episode reward: [(0, '0.796')] [2022-07-10 19:48:21,853][26022] Updated weights on worker 0-0, policy_version 866241 (0.00085) [2022-07-10 19:48:23,731][26022] Updated weights on worker 0-0, policy_version 866251 (0.00087) [2022-07-10 19:48:25,672][26022] Updated weights on worker 0-0, policy_version 866261 (0.00090) [2022-07-10 19:48:26,579][25689] Fps is (10 sec: 5659.5, 60 sec: 5511.3, 300 sec: 5528.4). Total num frames: 887056384. Throughput: 0: 4996.0. Samples: 887052096. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:48:26,579][25689] Avg episode reward: [(0, '0.639')] [2022-07-10 19:48:27,121][26022] Updated weights on worker 0-0, policy_version 866271 (0.00087) [2022-07-10 19:48:29,175][26022] Updated weights on worker 0-0, policy_version 866281 (0.00090) [2022-07-10 19:48:31,035][26022] Updated weights on worker 0-0, policy_version 866291 (0.00093) [2022-07-10 19:48:31,591][25689] Fps is (10 sec: 5407.7, 60 sec: 5511.5, 300 sec: 5524.9). Total num frames: 887083008. Throughput: 0: 5833.3. Samples: 887085434. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:48:31,591][25689] Avg episode reward: [(0, '0.551')] [2022-07-10 19:48:32,886][26022] Updated weights on worker 0-0, policy_version 866301 (0.00094) [2022-07-10 19:48:34,755][26022] Updated weights on worker 0-0, policy_version 866311 (0.00087) [2022-07-10 19:48:36,533][26022] Updated weights on worker 0-0, policy_version 866321 (0.00085) [2022-07-10 19:48:36,626][25689] Fps is (10 sec: 5605.5, 60 sec: 5509.9, 300 sec: 5529.3). Total num frames: 887112704. Throughput: 0: 5813.9. Samples: 887118796. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:48:36,627][25689] Avg episode reward: [(0, '0.910')] [2022-07-10 19:48:38,380][26022] Updated weights on worker 0-0, policy_version 866331 (0.00095) [2022-07-10 19:48:40,341][26022] Updated weights on worker 0-0, policy_version 866341 (0.00085) [2022-07-10 19:48:41,708][25689] Fps is (10 sec: 5769.1, 60 sec: 5541.5, 300 sec: 5531.8). Total num frames: 887141376. Throughput: 0: 4973.6. Samples: 887135426. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:48:41,709][25689] Avg episode reward: [(0, '0.514')] [2022-07-10 19:48:41,977][26022] Updated weights on worker 0-0, policy_version 866351 (0.00094) [2022-07-10 19:48:43,901][26022] Updated weights on worker 0-0, policy_version 866361 (0.00095) [2022-07-10 19:48:45,889][26022] Updated weights on worker 0-0, policy_version 866371 (0.00093) [2022-07-10 19:48:46,750][25689] Fps is (10 sec: 5461.9, 60 sec: 5504.8, 300 sec: 5524.2). Total num frames: 887168000. Throughput: 0: 5799.9. Samples: 887169088. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:48:46,751][25689] Avg episode reward: [(0, '0.367')] [2022-07-10 19:48:47,680][26022] Updated weights on worker 0-0, policy_version 866381 (0.00086) [2022-07-10 19:48:49,595][26022] Updated weights on worker 0-0, policy_version 866391 (0.00092) [2022-07-10 19:48:51,309][26022] Updated weights on worker 0-0, policy_version 866401 (0.00091) [2022-07-10 19:48:51,761][25689] Fps is (10 sec: 5398.4, 60 sec: 5505.8, 300 sec: 5527.6). Total num frames: 887195648. Throughput: 0: 5790.4. Samples: 887202230. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:48:51,762][25689] Avg episode reward: [(0, '0.406')] [2022-07-10 19:48:53,317][26022] Updated weights on worker 0-0, policy_version 866411 (0.00089) [2022-07-10 19:48:55,223][26022] Updated weights on worker 0-0, policy_version 866421 (0.00090) [2022-07-10 19:48:56,791][25689] Fps is (10 sec: 5609.2, 60 sec: 5520.7, 300 sec: 5528.3). Total num frames: 887224320. Throughput: 0: 4956.2. Samples: 887218736. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:48:56,791][25689] Avg episode reward: [(0, '0.832')] [2022-07-10 19:48:56,801][26022] Updated weights on worker 0-0, policy_version 866431 (0.00712) [2022-07-10 19:48:58,869][26022] Updated weights on worker 0-0, policy_version 866441 (0.00104) [2022-07-10 19:48:59,979][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:48:59,990][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000866448_887242752.pth [2022-07-10 19:48:59,990][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000864502_885250048.pth [2022-07-10 19:49:00,678][26022] Updated weights on worker 0-0, policy_version 866451 (0.00091) [2022-07-10 19:49:01,866][25689] Fps is (10 sec: 5370.8, 60 sec: 5485.0, 300 sec: 5530.8). Total num frames: 887249920. Throughput: 0: 5781.0. Samples: 887251960. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:49:01,867][25689] Avg episode reward: [(0, '0.688')] [2022-07-10 19:49:02,842][26022] Updated weights on worker 0-0, policy_version 866461 (0.00094) [2022-07-10 19:49:04,798][26022] Updated weights on worker 0-0, policy_version 866471 (0.00087) [2022-07-10 19:49:06,589][26022] Updated weights on worker 0-0, policy_version 866481 (0.00084) [2022-07-10 19:49:06,896][25689] Fps is (10 sec: 5269.1, 60 sec: 5519.2, 300 sec: 5530.4). Total num frames: 887277568. Throughput: 0: 5663.7. Samples: 887283190. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:49:06,897][25689] Avg episode reward: [(0, '0.528')] [2022-07-10 19:49:08,440][26022] Updated weights on worker 0-0, policy_version 866491 (0.00093) [2022-07-10 19:49:10,387][26022] Updated weights on worker 0-0, policy_version 866501 (0.00083) [2022-07-10 19:49:11,901][25689] Fps is (10 sec: 5510.6, 60 sec: 5503.9, 300 sec: 5530.5). Total num frames: 887305216. Throughput: 0: 4842.0. Samples: 887299746. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:49:11,901][25689] Avg episode reward: [(0, '0.688')] [2022-07-10 19:49:12,128][26022] Updated weights on worker 0-0, policy_version 866511 (0.00085) [2022-07-10 19:49:14,042][26022] Updated weights on worker 0-0, policy_version 866521 (0.00081) [2022-07-10 19:49:15,939][26022] Updated weights on worker 0-0, policy_version 866531 (0.00085) [2022-07-10 19:49:16,909][25689] Fps is (10 sec: 5420.2, 60 sec: 5504.2, 300 sec: 5524.3). Total num frames: 887331840. Throughput: 0: 5678.5. Samples: 887332978. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:49:16,910][25689] Avg episode reward: [(0, '0.376')] [2022-07-10 19:49:17,728][26022] Updated weights on worker 0-0, policy_version 866541 (0.00091) [2022-07-10 19:49:19,546][26022] Updated weights on worker 0-0, policy_version 866551 (0.00097) [2022-07-10 19:49:21,215][26022] Updated weights on worker 0-0, policy_version 866561 (0.00086) [2022-07-10 19:49:21,954][25689] Fps is (10 sec: 5602.3, 60 sec: 5510.0, 300 sec: 5530.5). Total num frames: 887361536. Throughput: 0: 5697.1. Samples: 887366402. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:49:21,954][25689] Avg episode reward: [(0, '0.402')] [2022-07-10 19:49:23,109][26022] Updated weights on worker 0-0, policy_version 866571 (0.00078) [2022-07-10 19:49:25,028][26022] Updated weights on worker 0-0, policy_version 866581 (0.00094) [2022-07-10 19:49:26,759][26022] Updated weights on worker 0-0, policy_version 866591 (0.00086) [2022-07-10 19:49:26,962][25689] Fps is (10 sec: 5806.4, 60 sec: 5528.5, 300 sec: 5534.1). Total num frames: 887390208. Throughput: 0: 4997.2. Samples: 887383460. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:49:26,962][25689] Avg episode reward: [(0, '0.181')] [2022-07-10 19:49:28,692][26022] Updated weights on worker 0-0, policy_version 866601 (0.00087) [2022-07-10 19:49:30,407][26022] Updated weights on worker 0-0, policy_version 866611 (0.00085) [2022-07-10 19:49:31,978][25689] Fps is (10 sec: 5618.7, 60 sec: 5545.1, 300 sec: 5530.6). Total num frames: 887417856. Throughput: 0: 5840.0. Samples: 887416994. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:49:31,978][25689] Avg episode reward: [(0, '0.500')] [2022-07-10 19:49:32,344][26022] Updated weights on worker 0-0, policy_version 866621 (0.00093) [2022-07-10 19:49:34,211][26022] Updated weights on worker 0-0, policy_version 866631 (0.00094) [2022-07-10 19:49:35,995][26022] Updated weights on worker 0-0, policy_version 866641 (0.00087) [2022-07-10 19:49:36,991][25689] Fps is (10 sec: 5513.6, 60 sec: 5513.2, 300 sec: 5534.6). Total num frames: 887445504. Throughput: 0: 5854.0. Samples: 887450534. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:49:36,991][25689] Avg episode reward: [(0, '-0.070')] [2022-07-10 19:49:37,918][26022] Updated weights on worker 0-0, policy_version 866651 (0.00085) [2022-07-10 19:49:39,669][26022] Updated weights on worker 0-0, policy_version 866661 (0.00088) [2022-07-10 19:49:41,535][26022] Updated weights on worker 0-0, policy_version 866671 (0.00090) [2022-07-10 19:49:42,105][25689] Fps is (10 sec: 5561.1, 60 sec: 5510.2, 300 sec: 5533.9). Total num frames: 887474176. Throughput: 0: 5016.4. Samples: 887467488. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:49:42,106][25689] Avg episode reward: [(0, '-1.071')] [2022-07-10 19:49:43,276][26022] Updated weights on worker 0-0, policy_version 866681 (0.00091) [2022-07-10 19:49:45,060][26022] Updated weights on worker 0-0, policy_version 866691 (0.00088) [2022-07-10 19:49:47,112][25689] Fps is (10 sec: 5463.4, 60 sec: 5513.4, 300 sec: 5530.4). Total num frames: 887500800. Throughput: 0: 5845.9. Samples: 887501256. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:49:47,112][25689] Avg episode reward: [(0, '-0.775')] [2022-07-10 19:49:47,166][26022] Updated weights on worker 0-0, policy_version 866701 (0.00091) [2022-07-10 19:49:48,787][26022] Updated weights on worker 0-0, policy_version 866711 (0.00585) [2022-07-10 19:49:50,711][26022] Updated weights on worker 0-0, policy_version 866721 (0.00083) [2022-07-10 19:49:52,206][25689] Fps is (10 sec: 5575.6, 60 sec: 5539.7, 300 sec: 5528.7). Total num frames: 887530496. Throughput: 0: 5821.6. Samples: 887534756. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:49:52,207][25689] Avg episode reward: [(0, '-2.753')] [2022-07-10 19:49:52,383][26022] Updated weights on worker 0-0, policy_version 866731 (0.00088) [2022-07-10 19:49:54,286][26022] Updated weights on worker 0-0, policy_version 866741 (0.00089) [2022-07-10 19:49:56,215][26022] Updated weights on worker 0-0, policy_version 866751 (0.00088) [2022-07-10 19:49:57,213][25689] Fps is (10 sec: 5676.8, 60 sec: 5524.8, 300 sec: 5533.7). Total num frames: 887558144. Throughput: 0: 5813.2. Samples: 887568092. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:49:57,214][25689] Avg episode reward: [(0, '-2.960')] [2022-07-10 19:49:58,021][26022] Updated weights on worker 0-0, policy_version 866761 (0.00089) [2022-07-10 19:49:59,831][26022] Updated weights on worker 0-0, policy_version 866771 (0.00089) [2022-07-10 19:50:02,047][26022] Updated weights on worker 0-0, policy_version 866781 (0.00117) [2022-07-10 19:50:02,290][25689] Fps is (10 sec: 5381.8, 60 sec: 5541.6, 300 sec: 5532.6). Total num frames: 887584768. Throughput: 0: 5819.3. Samples: 887584952. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:50:02,291][25689] Avg episode reward: [(0, '-2.321')] [2022-07-10 19:50:03,875][26022] Updated weights on worker 0-0, policy_version 866791 (0.00083) [2022-07-10 19:50:05,684][26022] Updated weights on worker 0-0, policy_version 866801 (0.00092) [2022-07-10 19:50:07,311][25689] Fps is (10 sec: 5374.9, 60 sec: 5542.5, 300 sec: 5532.3). Total num frames: 887612416. Throughput: 0: 5708.7. Samples: 887616564. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:50:07,311][25689] Avg episode reward: [(0, '-1.709')] [2022-07-10 19:50:07,629][26022] Updated weights on worker 0-0, policy_version 866811 (0.00904) [2022-07-10 19:50:09,339][26022] Updated weights on worker 0-0, policy_version 866821 (0.00090) [2022-07-10 19:50:11,262][26022] Updated weights on worker 0-0, policy_version 866831 (0.00085) [2022-07-10 19:50:12,324][25689] Fps is (10 sec: 5613.1, 60 sec: 5558.7, 300 sec: 5536.5). Total num frames: 887641088. Throughput: 0: 5745.8. Samples: 887650350. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:50:12,325][25689] Avg episode reward: [(0, '-1.923')] [2022-07-10 19:50:12,971][26022] Updated weights on worker 0-0, policy_version 866841 (0.00087) [2022-07-10 19:50:14,935][26022] Updated weights on worker 0-0, policy_version 866851 (0.00092) [2022-07-10 19:50:16,723][26022] Updated weights on worker 0-0, policy_version 866861 (0.00092) [2022-07-10 19:50:17,336][25689] Fps is (10 sec: 5719.7, 60 sec: 5592.2, 300 sec: 5538.3). Total num frames: 887669760. Throughput: 0: 4923.2. Samples: 887667162. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:50:17,338][25689] Avg episode reward: [(0, '-1.722')] [2022-07-10 19:50:18,407][26022] Updated weights on worker 0-0, policy_version 866871 (0.00108) [2022-07-10 19:50:20,223][26022] Updated weights on worker 0-0, policy_version 866881 (0.00084) [2022-07-10 19:50:22,096][26022] Updated weights on worker 0-0, policy_version 866891 (0.00086) [2022-07-10 19:50:22,367][25689] Fps is (10 sec: 5607.9, 60 sec: 5559.6, 300 sec: 5538.0). Total num frames: 887697408. Throughput: 0: 5787.9. Samples: 887701154. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:50:22,368][25689] Avg episode reward: [(0, '-0.516')] [2022-07-10 19:50:23,914][26022] Updated weights on worker 0-0, policy_version 866901 (0.00088) [2022-07-10 19:50:25,731][26022] Updated weights on worker 0-0, policy_version 866911 (0.00093) [2022-07-10 19:50:27,407][25689] Fps is (10 sec: 5592.3, 60 sec: 5556.6, 300 sec: 5541.0). Total num frames: 887726080. Throughput: 0: 5887.1. Samples: 887734874. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:50:27,408][25689] Avg episode reward: [(0, '-1.624')] [2022-07-10 19:50:27,586][26022] Updated weights on worker 0-0, policy_version 866921 (0.00084) [2022-07-10 19:50:29,331][26022] Updated weights on worker 0-0, policy_version 866931 (0.00088) [2022-07-10 19:50:31,363][26022] Updated weights on worker 0-0, policy_version 866941 (0.00086) [2022-07-10 19:50:32,409][25689] Fps is (10 sec: 5608.3, 60 sec: 5557.9, 300 sec: 5537.8). Total num frames: 887753728. Throughput: 0: 5054.1. Samples: 887751862. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:50:32,410][25689] Avg episode reward: [(0, '-1.726')] [2022-07-10 19:50:32,828][26022] Updated weights on worker 0-0, policy_version 866951 (0.00086) [2022-07-10 19:50:34,961][26022] Updated weights on worker 0-0, policy_version 866961 (0.00086) [2022-07-10 19:50:36,486][26022] Updated weights on worker 0-0, policy_version 866971 (0.00092) [2022-07-10 19:50:37,436][25689] Fps is (10 sec: 5616.1, 60 sec: 5573.6, 300 sec: 5538.4). Total num frames: 887782400. Throughput: 0: 5872.9. Samples: 887785204. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:50:37,436][25689] Avg episode reward: [(0, '-1.353')] [2022-07-10 19:50:38,715][26022] Updated weights on worker 0-0, policy_version 866981 (0.00090) [2022-07-10 19:50:40,277][26022] Updated weights on worker 0-0, policy_version 866991 (0.00095) [2022-07-10 19:50:42,403][26022] Updated weights on worker 0-0, policy_version 867001 (0.00094) [2022-07-10 19:50:42,486][25689] Fps is (10 sec: 5589.1, 60 sec: 5562.5, 300 sec: 5541.3). Total num frames: 887810048. Throughput: 0: 5837.7. Samples: 887818602. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:50:42,486][25689] Avg episode reward: [(0, '-1.839')] [2022-07-10 19:50:44,154][26022] Updated weights on worker 0-0, policy_version 867011 (0.00085) [2022-07-10 19:50:45,948][26022] Updated weights on worker 0-0, policy_version 867021 (0.00081) [2022-07-10 19:50:47,492][25689] Fps is (10 sec: 5498.5, 60 sec: 5579.6, 300 sec: 5537.7). Total num frames: 887837696. Throughput: 0: 5007.1. Samples: 887835440. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:50:47,495][25689] Avg episode reward: [(0, '-2.223')] [2022-07-10 19:50:47,821][26022] Updated weights on worker 0-0, policy_version 867031 (0.00088) [2022-07-10 19:50:49,490][26022] Updated weights on worker 0-0, policy_version 867041 (0.00091) [2022-07-10 19:50:51,459][26022] Updated weights on worker 0-0, policy_version 867051 (0.00086) [2022-07-10 19:50:52,499][25689] Fps is (10 sec: 5624.5, 60 sec: 5570.6, 300 sec: 5541.6). Total num frames: 887866368. Throughput: 0: 5847.5. Samples: 887869338. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:50:52,500][25689] Avg episode reward: [(0, '-0.932')] [2022-07-10 19:50:53,161][26022] Updated weights on worker 0-0, policy_version 867061 (0.00084) [2022-07-10 19:50:54,892][26022] Updated weights on worker 0-0, policy_version 867071 (0.00096) [2022-07-10 19:50:57,011][26022] Updated weights on worker 0-0, policy_version 867081 (0.00095) [2022-07-10 19:50:57,511][25689] Fps is (10 sec: 5519.0, 60 sec: 5553.2, 300 sec: 5535.3). Total num frames: 887892992. Throughput: 0: 5855.6. Samples: 887902758. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:50:57,512][25689] Avg episode reward: [(0, '-1.026')] [2022-07-10 19:50:58,710][26022] Updated weights on worker 0-0, policy_version 867091 (0.00089) [2022-07-10 19:51:00,028][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:51:00,040][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000867098_887908352.pth [2022-07-10 19:51:00,040][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000865149_885912576.pth [2022-07-10 19:51:00,433][26022] Updated weights on worker 0-0, policy_version 867101 (0.01113) [2022-07-10 19:51:02,568][25689] Fps is (10 sec: 5288.7, 60 sec: 5555.1, 300 sec: 5542.5). Total num frames: 887919616. Throughput: 0: 5021.9. Samples: 887919450. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:51:02,568][25689] Avg episode reward: [(0, '-1.500')] [2022-07-10 19:51:02,693][26022] Updated weights on worker 0-0, policy_version 867111 (0.00085) [2022-07-10 19:51:04,590][26022] Updated weights on worker 0-0, policy_version 867121 (0.00088) [2022-07-10 19:51:06,575][26022] Updated weights on worker 0-0, policy_version 867131 (0.00091) [2022-07-10 19:51:07,572][25689] Fps is (10 sec: 5597.8, 60 sec: 5590.6, 300 sec: 5546.3). Total num frames: 887949312. Throughput: 0: 5744.5. Samples: 887950790. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:51:07,573][25689] Avg episode reward: [(0, '-1.292')] [2022-07-10 19:51:08,371][26022] Updated weights on worker 0-0, policy_version 867141 (0.00099) [2022-07-10 19:51:10,222][26022] Updated weights on worker 0-0, policy_version 867151 (0.00088) [2022-07-10 19:51:12,041][26022] Updated weights on worker 0-0, policy_version 867161 (0.00095) [2022-07-10 19:51:12,578][25689] Fps is (10 sec: 5524.0, 60 sec: 5540.3, 300 sec: 5539.7). Total num frames: 887974912. Throughput: 0: 5715.5. Samples: 887984096. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:51:12,578][25689] Avg episode reward: [(0, '-2.999')] [2022-07-10 19:51:13,738][26022] Updated weights on worker 0-0, policy_version 867171 (0.00058) [2022-07-10 19:51:15,832][26022] Updated weights on worker 0-0, policy_version 867181 (0.00081) [2022-07-10 19:51:17,518][26022] Updated weights on worker 0-0, policy_version 867191 (0.00087) [2022-07-10 19:51:17,595][25689] Fps is (10 sec: 5414.8, 60 sec: 5539.8, 300 sec: 5537.1). Total num frames: 888003584. Throughput: 0: 4878.7. Samples: 888000742. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:51:17,596][25689] Avg episode reward: [(0, '-3.099')] [2022-07-10 19:51:19,401][26022] Updated weights on worker 0-0, policy_version 867201 (0.00095) [2022-07-10 19:51:21,196][26022] Updated weights on worker 0-0, policy_version 867211 (0.00092) [2022-07-10 19:51:22,763][25689] Fps is (10 sec: 5630.3, 60 sec: 5544.2, 300 sec: 5538.2). Total num frames: 888032256. Throughput: 0: 5675.7. Samples: 888034072. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:51:22,763][25689] Avg episode reward: [(0, '-2.622')] [2022-07-10 19:51:23,077][26022] Updated weights on worker 0-0, policy_version 867221 (0.00089) [2022-07-10 19:51:24,671][26022] Updated weights on worker 0-0, policy_version 867231 (0.00092) [2022-07-10 19:51:26,751][26022] Updated weights on worker 0-0, policy_version 867241 (0.00096) [2022-07-10 19:51:27,771][25689] Fps is (10 sec: 5534.5, 60 sec: 5530.2, 300 sec: 5538.4). Total num frames: 888059904. Throughput: 0: 5801.2. Samples: 888067968. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:51:27,772][25689] Avg episode reward: [(0, '-2.177')] [2022-07-10 19:51:28,448][26022] Updated weights on worker 0-0, policy_version 867251 (0.00098) [2022-07-10 19:51:30,483][26022] Updated weights on worker 0-0, policy_version 867261 (0.00087) [2022-07-10 19:51:32,075][26022] Updated weights on worker 0-0, policy_version 867271 (0.00094) [2022-07-10 19:51:32,785][25689] Fps is (10 sec: 5619.7, 60 sec: 5546.1, 300 sec: 5541.9). Total num frames: 888088576. Throughput: 0: 4977.1. Samples: 888084668. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:51:32,785][25689] Avg episode reward: [(0, '-1.926')] [2022-07-10 19:51:33,966][26022] Updated weights on worker 0-0, policy_version 867281 (0.00091) [2022-07-10 19:51:35,737][26022] Updated weights on worker 0-0, policy_version 867291 (0.00095) [2022-07-10 19:51:37,807][25689] Fps is (10 sec: 5612.3, 60 sec: 5529.5, 300 sec: 5542.4). Total num frames: 888116224. Throughput: 0: 5803.1. Samples: 888118030. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:51:37,807][25689] Avg episode reward: [(0, '-0.787')] [2022-07-10 19:51:37,808][26022] Updated weights on worker 0-0, policy_version 867301 (0.00111) [2022-07-10 19:51:39,496][26022] Updated weights on worker 0-0, policy_version 867311 (0.00085) [2022-07-10 19:51:41,426][26022] Updated weights on worker 0-0, policy_version 867321 (0.00091) [2022-07-10 19:51:42,911][25689] Fps is (10 sec: 5663.1, 60 sec: 5558.5, 300 sec: 5540.5). Total num frames: 888145920. Throughput: 0: 5837.7. Samples: 888151690. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:51:42,913][25689] Avg episode reward: [(0, '0.005')] [2022-07-10 19:51:43,137][26022] Updated weights on worker 0-0, policy_version 867331 (0.00091) [2022-07-10 19:51:45,303][26022] Updated weights on worker 0-0, policy_version 867341 (0.00087) [2022-07-10 19:51:46,879][26022] Updated weights on worker 0-0, policy_version 867351 (0.00083) [2022-07-10 19:51:47,929][25689] Fps is (10 sec: 5664.9, 60 sec: 5557.4, 300 sec: 5538.3). Total num frames: 888173568. Throughput: 0: 4987.1. Samples: 888168498. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:51:47,930][25689] Avg episode reward: [(0, '0.005')] [2022-07-10 19:51:48,863][26022] Updated weights on worker 0-0, policy_version 867361 (0.00089) [2022-07-10 19:51:50,463][26022] Updated weights on worker 0-0, policy_version 867371 (0.00093) [2022-07-10 19:51:52,451][26022] Updated weights on worker 0-0, policy_version 867381 (0.00094) [2022-07-10 19:51:52,999][25689] Fps is (10 sec: 5379.7, 60 sec: 5517.8, 300 sec: 5538.5). Total num frames: 888200192. Throughput: 0: 5808.7. Samples: 888202086. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:51:52,999][25689] Avg episode reward: [(0, '0.051')] [2022-07-10 19:51:54,285][26022] Updated weights on worker 0-0, policy_version 867391 (0.00086) [2022-07-10 19:51:56,115][26022] Updated weights on worker 0-0, policy_version 867401 (0.00089) [2022-07-10 19:51:57,924][26022] Updated weights on worker 0-0, policy_version 867411 (0.00094) [2022-07-10 19:51:58,023][25689] Fps is (10 sec: 5478.2, 60 sec: 5550.5, 300 sec: 5535.8). Total num frames: 888228864. Throughput: 0: 5801.3. Samples: 888235312. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:51:58,024][25689] Avg episode reward: [(0, '-0.315')] [2022-07-10 19:51:59,818][26022] Updated weights on worker 0-0, policy_version 867421 (0.00091) [2022-07-10 19:52:01,627][26022] Updated weights on worker 0-0, policy_version 867431 (0.00092) [2022-07-10 19:52:03,083][25689] Fps is (10 sec: 5381.8, 60 sec: 5533.2, 300 sec: 5535.8). Total num frames: 888254464. Throughput: 0: 4974.6. Samples: 888252042. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:52:03,084][25689] Avg episode reward: [(0, '-0.192')] [2022-07-10 19:52:03,857][26022] Updated weights on worker 0-0, policy_version 867441 (0.00088) [2022-07-10 19:52:05,713][26022] Updated weights on worker 0-0, policy_version 867451 (0.00080) [2022-07-10 19:52:07,696][26022] Updated weights on worker 0-0, policy_version 867461 (0.00087) [2022-07-10 19:52:08,100][25689] Fps is (10 sec: 5284.1, 60 sec: 5498.3, 300 sec: 5532.7). Total num frames: 888282112. Throughput: 0: 5707.0. Samples: 888283612. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:52:08,101][25689] Avg episode reward: [(0, '-0.677')] [2022-07-10 19:52:09,421][26022] Updated weights on worker 0-0, policy_version 867471 (0.00089) [2022-07-10 19:52:11,203][26022] Updated weights on worker 0-0, policy_version 867481 (0.00094) [2022-07-10 19:52:13,119][25689] Fps is (10 sec: 5509.6, 60 sec: 5530.9, 300 sec: 5532.5). Total num frames: 888309760. Throughput: 0: 5704.4. Samples: 888316862. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:52:13,120][25689] Avg episode reward: [(0, '-0.262')] [2022-07-10 19:52:13,140][26022] Updated weights on worker 0-0, policy_version 867491 (0.00087) [2022-07-10 19:52:14,936][26022] Updated weights on worker 0-0, policy_version 867501 (0.00098) [2022-07-10 19:52:16,727][26022] Updated weights on worker 0-0, policy_version 867511 (0.00086) [2022-07-10 19:52:18,148][25689] Fps is (10 sec: 5605.1, 60 sec: 5529.9, 300 sec: 5533.3). Total num frames: 888338432. Throughput: 0: 4886.1. Samples: 888333644. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:52:18,149][25689] Avg episode reward: [(0, '-0.370')] [2022-07-10 19:52:18,710][26022] Updated weights on worker 0-0, policy_version 867521 (0.00092) [2022-07-10 19:52:20,362][26022] Updated weights on worker 0-0, policy_version 867531 (0.00071) [2022-07-10 19:52:22,457][26022] Updated weights on worker 0-0, policy_version 867541 (0.00088) [2022-07-10 19:52:23,232][25689] Fps is (10 sec: 5670.8, 60 sec: 5537.5, 300 sec: 5533.0). Total num frames: 888367104. Throughput: 0: 5719.0. Samples: 888367270. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:52:23,232][25689] Avg episode reward: [(0, '-0.226')] [2022-07-10 19:52:24,055][26022] Updated weights on worker 0-0, policy_version 867551 (0.00096) [2022-07-10 19:52:26,009][26022] Updated weights on worker 0-0, policy_version 867561 (0.00087) [2022-07-10 19:52:27,769][26022] Updated weights on worker 0-0, policy_version 867571 (0.00091) [2022-07-10 19:52:28,253][25689] Fps is (10 sec: 5573.3, 60 sec: 5536.3, 300 sec: 5536.3). Total num frames: 888394752. Throughput: 0: 5800.5. Samples: 888400510. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:52:28,254][25689] Avg episode reward: [(0, '-0.344')] [2022-07-10 19:52:29,675][26022] Updated weights on worker 0-0, policy_version 867581 (0.00898) [2022-07-10 19:52:31,543][26022] Updated weights on worker 0-0, policy_version 867591 (0.00083) [2022-07-10 19:52:33,255][25689] Fps is (10 sec: 5619.1, 60 sec: 5537.4, 300 sec: 5533.2). Total num frames: 888423424. Throughput: 0: 4983.5. Samples: 888417206. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:52:33,256][25689] Avg episode reward: [(0, '-0.423')] [2022-07-10 19:52:33,256][26022] Updated weights on worker 0-0, policy_version 867601 (0.00087) [2022-07-10 19:52:35,186][26022] Updated weights on worker 0-0, policy_version 867611 (0.00085) [2022-07-10 19:52:36,907][26022] Updated weights on worker 0-0, policy_version 867621 (0.00093) [2022-07-10 19:52:38,358][25689] Fps is (10 sec: 5472.4, 60 sec: 5513.1, 300 sec: 5532.3). Total num frames: 888450048. Throughput: 0: 5784.5. Samples: 888450548. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:52:38,358][25689] Avg episode reward: [(0, '-0.600')] [2022-07-10 19:52:39,042][26022] Updated weights on worker 0-0, policy_version 867631 (0.00091) [2022-07-10 19:52:40,532][26022] Updated weights on worker 0-0, policy_version 867641 (0.00087) [2022-07-10 19:52:42,703][26022] Updated weights on worker 0-0, policy_version 867651 (0.00056) [2022-07-10 19:52:43,486][25689] Fps is (10 sec: 5404.6, 60 sec: 5494.0, 300 sec: 5530.1). Total num frames: 888478720. Throughput: 0: 5764.6. Samples: 888484028. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:52:43,488][25689] Avg episode reward: [(0, '-0.386')] [2022-07-10 19:52:44,388][26022] Updated weights on worker 0-0, policy_version 867661 (0.00093) [2022-07-10 19:52:46,215][26022] Updated weights on worker 0-0, policy_version 867671 (0.00098) [2022-07-10 19:52:48,055][26022] Updated weights on worker 0-0, policy_version 867681 (0.00093) [2022-07-10 19:52:48,539][25689] Fps is (10 sec: 5632.5, 60 sec: 5507.8, 300 sec: 5533.0). Total num frames: 888507392. Throughput: 0: 4933.3. Samples: 888500584. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:52:48,539][25689] Avg episode reward: [(0, '-0.031')] [2022-07-10 19:52:49,892][26022] Updated weights on worker 0-0, policy_version 867691 (0.00090) [2022-07-10 19:52:51,782][26022] Updated weights on worker 0-0, policy_version 867701 (0.00089) [2022-07-10 19:52:53,621][25689] Fps is (10 sec: 5556.8, 60 sec: 5523.5, 300 sec: 5531.6). Total num frames: 888535040. Throughput: 0: 5742.6. Samples: 888534164. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:52:53,622][25689] Avg episode reward: [(0, '-1.193')] [2022-07-10 19:52:53,840][26022] Updated weights on worker 0-0, policy_version 867711 (0.00085) [2022-07-10 19:52:55,463][26022] Updated weights on worker 0-0, policy_version 867721 (0.00091) [2022-07-10 19:52:57,541][26022] Updated weights on worker 0-0, policy_version 867731 (0.00084) [2022-07-10 19:52:58,633][25689] Fps is (10 sec: 5579.3, 60 sec: 5524.6, 300 sec: 5535.9). Total num frames: 888563712. Throughput: 0: 5773.3. Samples: 888567604. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:52:58,634][25689] Avg episode reward: [(0, '-1.929')] [2022-07-10 19:52:59,005][26022] Updated weights on worker 0-0, policy_version 867741 (0.00086) [2022-07-10 19:53:00,127][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:53:00,138][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000867746_888571904.pth [2022-07-10 19:53:00,138][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000865799_886578176.pth [2022-07-10 19:53:01,122][26022] Updated weights on worker 0-0, policy_version 867751 (0.00088) [2022-07-10 19:53:03,170][26022] Updated weights on worker 0-0, policy_version 867761 (0.00088) [2022-07-10 19:53:03,750][25689] Fps is (10 sec: 5358.4, 60 sec: 5519.5, 300 sec: 5534.3). Total num frames: 888589312. Throughput: 0: 5669.6. Samples: 888598916. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:53:03,750][25689] Avg episode reward: [(0, '-1.137')] [2022-07-10 19:53:05,048][26022] Updated weights on worker 0-0, policy_version 867771 (0.00090) [2022-07-10 19:53:06,879][26022] Updated weights on worker 0-0, policy_version 867781 (0.00088) [2022-07-10 19:53:08,786][25689] Fps is (10 sec: 5345.6, 60 sec: 5534.6, 300 sec: 5534.1). Total num frames: 888617984. Throughput: 0: 5690.9. Samples: 888615808. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:53:08,786][25689] Avg episode reward: [(0, '-1.490')] [2022-07-10 19:53:08,792][26022] Updated weights on worker 0-0, policy_version 867791 (0.00084) [2022-07-10 19:53:10,403][26022] Updated weights on worker 0-0, policy_version 867801 (0.00091) [2022-07-10 19:53:12,455][26022] Updated weights on worker 0-0, policy_version 867811 (0.00087) [2022-07-10 19:53:13,812][25689] Fps is (10 sec: 5596.9, 60 sec: 5534.0, 300 sec: 5537.3). Total num frames: 888645632. Throughput: 0: 5686.6. Samples: 888648982. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:53:13,813][25689] Avg episode reward: [(0, '-1.593')] [2022-07-10 19:53:14,147][26022] Updated weights on worker 0-0, policy_version 867821 (0.00093) [2022-07-10 19:53:16,305][26022] Updated weights on worker 0-0, policy_version 867831 (0.00089) [2022-07-10 19:53:17,823][26022] Updated weights on worker 0-0, policy_version 867841 (0.00089) [2022-07-10 19:53:18,896][25689] Fps is (10 sec: 5570.5, 60 sec: 5528.9, 300 sec: 5534.3). Total num frames: 888674304. Throughput: 0: 5696.6. Samples: 888683034. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:53:18,897][25689] Avg episode reward: [(0, '-1.537')] [2022-07-10 19:53:19,662][26022] Updated weights on worker 0-0, policy_version 867851 (0.00090) [2022-07-10 19:53:21,551][26022] Updated weights on worker 0-0, policy_version 867861 (0.00054) [2022-07-10 19:53:23,571][26022] Updated weights on worker 0-0, policy_version 867871 (0.00085) [2022-07-10 19:53:23,992][25689] Fps is (10 sec: 5532.5, 60 sec: 5510.9, 300 sec: 5532.9). Total num frames: 888701952. Throughput: 0: 4980.7. Samples: 888699738. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:53:23,993][25689] Avg episode reward: [(0, '-0.407')] [2022-07-10 19:53:25,128][26022] Updated weights on worker 0-0, policy_version 867881 (0.00087) [2022-07-10 19:53:27,206][26022] Updated weights on worker 0-0, policy_version 867891 (0.00094) [2022-07-10 19:53:28,778][26022] Updated weights on worker 0-0, policy_version 867901 (0.00090) [2022-07-10 19:53:29,025][25689] Fps is (10 sec: 5661.3, 60 sec: 5543.6, 300 sec: 5542.9). Total num frames: 888731648. Throughput: 0: 5796.1. Samples: 888733118. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:53:29,026][25689] Avg episode reward: [(0, '-0.319')] [2022-07-10 19:53:30,696][26022] Updated weights on worker 0-0, policy_version 867911 (0.00097) [2022-07-10 19:53:32,502][26022] Updated weights on worker 0-0, policy_version 867921 (0.00539) [2022-07-10 19:53:34,047][25689] Fps is (10 sec: 5703.2, 60 sec: 5524.9, 300 sec: 5536.2). Total num frames: 888759296. Throughput: 0: 5821.4. Samples: 888766776. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:53:34,048][25689] Avg episode reward: [(0, '-0.303')] [2022-07-10 19:53:34,309][26022] Updated weights on worker 0-0, policy_version 867931 (0.00084) [2022-07-10 19:53:36,386][26022] Updated weights on worker 0-0, policy_version 867941 (0.00088) [2022-07-10 19:53:38,003][26022] Updated weights on worker 0-0, policy_version 867951 (0.00093) [2022-07-10 19:53:39,054][25689] Fps is (10 sec: 5513.8, 60 sec: 5550.5, 300 sec: 5534.2). Total num frames: 888786944. Throughput: 0: 4994.5. Samples: 888783712. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:53:39,055][25689] Avg episode reward: [(0, '-0.313')] [2022-07-10 19:53:40,016][26022] Updated weights on worker 0-0, policy_version 867961 (0.00084) [2022-07-10 19:53:41,758][26022] Updated weights on worker 0-0, policy_version 867971 (0.00089) [2022-07-10 19:53:43,693][26022] Updated weights on worker 0-0, policy_version 867981 (0.00085) [2022-07-10 19:53:44,140][25689] Fps is (10 sec: 5580.1, 60 sec: 5554.4, 300 sec: 5540.3). Total num frames: 888815616. Throughput: 0: 5816.1. Samples: 888816920. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:53:44,141][25689] Avg episode reward: [(0, '-0.229')] [2022-07-10 19:53:45,469][26022] Updated weights on worker 0-0, policy_version 867991 (0.00080) [2022-07-10 19:53:47,315][26022] Updated weights on worker 0-0, policy_version 868001 (0.00086) [2022-07-10 19:53:49,071][26022] Updated weights on worker 0-0, policy_version 868011 (0.00082) [2022-07-10 19:53:49,186][25689] Fps is (10 sec: 5558.9, 60 sec: 5538.2, 300 sec: 5539.6). Total num frames: 888843264. Throughput: 0: 5826.3. Samples: 888850578. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:53:49,186][25689] Avg episode reward: [(0, '-0.184')] [2022-07-10 19:53:50,993][26022] Updated weights on worker 0-0, policy_version 868021 (0.00094) [2022-07-10 19:53:52,735][26022] Updated weights on worker 0-0, policy_version 868031 (0.00087) [2022-07-10 19:53:54,203][25689] Fps is (10 sec: 5495.1, 60 sec: 5544.1, 300 sec: 5536.4). Total num frames: 888870912. Throughput: 0: 4974.9. Samples: 888867048. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:53:54,204][25689] Avg episode reward: [(0, '-0.149')] [2022-07-10 19:53:54,638][26022] Updated weights on worker 0-0, policy_version 868041 (0.00094) [2022-07-10 19:53:56,704][26022] Updated weights on worker 0-0, policy_version 868051 (0.00079) [2022-07-10 19:53:58,215][26022] Updated weights on worker 0-0, policy_version 868061 (0.00093) [2022-07-10 19:53:59,224][25689] Fps is (10 sec: 5610.7, 60 sec: 5543.3, 300 sec: 5547.7). Total num frames: 888899584. Throughput: 0: 5801.5. Samples: 888900726. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:53:59,225][25689] Avg episode reward: [(0, '-0.045')] [2022-07-10 19:54:00,253][26022] Updated weights on worker 0-0, policy_version 868071 (0.00089) [2022-07-10 19:54:02,426][26022] Updated weights on worker 0-0, policy_version 868081 (0.00120) [2022-07-10 19:54:04,301][25689] Fps is (10 sec: 5374.2, 60 sec: 5546.9, 300 sec: 5540.0). Total num frames: 888925184. Throughput: 0: 5705.2. Samples: 888931944. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:54:04,302][25689] Avg episode reward: [(0, '0.169')] [2022-07-10 19:54:04,304][26022] Updated weights on worker 0-0, policy_version 868091 (0.00102) [2022-07-10 19:54:06,080][26022] Updated weights on worker 0-0, policy_version 868101 (0.00090) [2022-07-10 19:54:07,735][26022] Updated weights on worker 0-0, policy_version 868111 (0.00085) [2022-07-10 19:54:09,330][25689] Fps is (10 sec: 5269.0, 60 sec: 5530.7, 300 sec: 5539.5). Total num frames: 888952832. Throughput: 0: 4870.6. Samples: 888948690. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:54:09,330][25689] Avg episode reward: [(0, '0.190')] [2022-07-10 19:54:09,693][26022] Updated weights on worker 0-0, policy_version 868121 (0.00089) [2022-07-10 19:54:11,616][26022] Updated weights on worker 0-0, policy_version 868131 (0.00085) [2022-07-10 19:54:13,266][26022] Updated weights on worker 0-0, policy_version 868141 (0.00085) [2022-07-10 19:54:14,381][25689] Fps is (10 sec: 5485.9, 60 sec: 5528.4, 300 sec: 5542.2). Total num frames: 888980480. Throughput: 0: 5699.2. Samples: 888982048. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:54:14,382][25689] Avg episode reward: [(0, '0.170')] [2022-07-10 19:54:15,540][26022] Updated weights on worker 0-0, policy_version 868151 (0.00085) [2022-07-10 19:54:16,832][26022] Updated weights on worker 0-0, policy_version 868161 (0.00091) [2022-07-10 19:54:19,191][26022] Updated weights on worker 0-0, policy_version 868171 (0.00115) [2022-07-10 19:54:19,387][25689] Fps is (10 sec: 5599.6, 60 sec: 5535.5, 300 sec: 5539.4). Total num frames: 889009152. Throughput: 0: 5685.0. Samples: 889015358. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:54:19,388][25689] Avg episode reward: [(0, '-0.420')] [2022-07-10 19:54:20,615][26022] Updated weights on worker 0-0, policy_version 868181 (0.00116) [2022-07-10 19:54:22,816][26022] Updated weights on worker 0-0, policy_version 868191 (0.00095) [2022-07-10 19:54:24,463][25689] Fps is (10 sec: 5586.0, 60 sec: 5537.3, 300 sec: 5534.7). Total num frames: 889036800. Throughput: 0: 4952.8. Samples: 889031802. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:54:24,464][25689] Avg episode reward: [(0, '-0.768')] [2022-07-10 19:54:24,532][26022] Updated weights on worker 0-0, policy_version 868201 (0.00085) [2022-07-10 19:54:26,503][26022] Updated weights on worker 0-0, policy_version 868211 (0.00087) [2022-07-10 19:54:28,171][26022] Updated weights on worker 0-0, policy_version 868221 (0.00096) [2022-07-10 19:54:29,478][25689] Fps is (10 sec: 5480.1, 60 sec: 5505.2, 300 sec: 5534.8). Total num frames: 889064448. Throughput: 0: 5786.8. Samples: 889065284. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:54:29,478][25689] Avg episode reward: [(0, '-1.309')] [2022-07-10 19:54:30,109][26022] Updated weights on worker 0-0, policy_version 868231 (0.00084) [2022-07-10 19:54:31,705][26022] Updated weights on worker 0-0, policy_version 868241 (0.00079) [2022-07-10 19:54:33,992][26022] Updated weights on worker 0-0, policy_version 868251 (0.00088) [2022-07-10 19:54:34,492][25689] Fps is (10 sec: 5514.0, 60 sec: 5505.9, 300 sec: 5534.7). Total num frames: 889092096. Throughput: 0: 5816.7. Samples: 889099026. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:54:34,492][25689] Avg episode reward: [(0, '-0.982')] [2022-07-10 19:54:35,554][26022] Updated weights on worker 0-0, policy_version 868261 (0.00081) [2022-07-10 19:54:37,448][26022] Updated weights on worker 0-0, policy_version 868271 (0.00092) [2022-07-10 19:54:39,111][26022] Updated weights on worker 0-0, policy_version 868281 (0.00090) [2022-07-10 19:54:39,500][25689] Fps is (10 sec: 5619.7, 60 sec: 5522.7, 300 sec: 5536.7). Total num frames: 889120768. Throughput: 0: 4994.3. Samples: 889115804. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:54:39,500][25689] Avg episode reward: [(0, '-2.618')] [2022-07-10 19:54:41,077][26022] Updated weights on worker 0-0, policy_version 868291 (0.00090) [2022-07-10 19:54:42,802][26022] Updated weights on worker 0-0, policy_version 868301 (0.00093) [2022-07-10 19:54:44,633][25689] Fps is (10 sec: 5654.3, 60 sec: 5518.4, 300 sec: 5541.2). Total num frames: 889149440. Throughput: 0: 5820.0. Samples: 889149190. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:54:44,634][25689] Avg episode reward: [(0, '-2.379')] [2022-07-10 19:54:44,870][26022] Updated weights on worker 0-0, policy_version 868311 (0.00084) [2022-07-10 19:54:46,367][26022] Updated weights on worker 0-0, policy_version 868321 (0.00093) [2022-07-10 19:54:48,603][26022] Updated weights on worker 0-0, policy_version 868331 (0.00083) [2022-07-10 19:54:49,679][25689] Fps is (10 sec: 5633.3, 60 sec: 5535.3, 300 sec: 5538.7). Total num frames: 889178112. Throughput: 0: 5816.3. Samples: 889182782. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:54:49,680][25689] Avg episode reward: [(0, '-3.223')] [2022-07-10 19:54:50,221][26022] Updated weights on worker 0-0, policy_version 868341 (0.00089) [2022-07-10 19:54:52,056][26022] Updated weights on worker 0-0, policy_version 868351 (0.00087) [2022-07-10 19:54:53,623][26022] Updated weights on worker 0-0, policy_version 868361 (0.00085) [2022-07-10 19:54:54,684][25689] Fps is (10 sec: 5501.7, 60 sec: 5519.5, 300 sec: 5535.3). Total num frames: 889204736. Throughput: 0: 4986.9. Samples: 889199722. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:54:54,685][25689] Avg episode reward: [(0, '-2.962')] [2022-07-10 19:54:55,877][26022] Updated weights on worker 0-0, policy_version 868371 (0.00089) [2022-07-10 19:54:57,495][26022] Updated weights on worker 0-0, policy_version 868381 (0.00090) [2022-07-10 19:54:59,403][26022] Updated weights on worker 0-0, policy_version 868391 (0.00097) [2022-07-10 19:54:59,703][25689] Fps is (10 sec: 5516.5, 60 sec: 5519.7, 300 sec: 5543.3). Total num frames: 889233408. Throughput: 0: 5825.4. Samples: 889233496. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:54:59,703][25689] Avg episode reward: [(0, '-3.507')] [2022-07-10 19:55:00,245][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:55:00,253][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000868396_889237504.pth [2022-07-10 19:55:00,256][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000866448_887242752.pth [2022-07-10 19:55:01,123][26022] Updated weights on worker 0-0, policy_version 868401 (0.00093) [2022-07-10 19:55:03,473][26022] Updated weights on worker 0-0, policy_version 868411 (0.00090) [2022-07-10 19:55:04,767][25689] Fps is (10 sec: 5585.7, 60 sec: 5554.8, 300 sec: 5542.5). Total num frames: 889261056. Throughput: 0: 5748.3. Samples: 889264924. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:55:04,767][25689] Avg episode reward: [(0, '-3.519')] [2022-07-10 19:55:05,022][26022] Updated weights on worker 0-0, policy_version 868421 (0.00086) [2022-07-10 19:55:07,129][26022] Updated weights on worker 0-0, policy_version 868431 (0.00085) [2022-07-10 19:55:08,558][26022] Updated weights on worker 0-0, policy_version 868441 (0.00091) [2022-07-10 19:55:09,789][25689] Fps is (10 sec: 5380.6, 60 sec: 5538.4, 300 sec: 5535.4). Total num frames: 889287680. Throughput: 0: 4924.1. Samples: 889281806. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:55:09,791][25689] Avg episode reward: [(0, '-1.847')] [2022-07-10 19:55:10,736][26022] Updated weights on worker 0-0, policy_version 868451 (0.00094) [2022-07-10 19:55:12,494][26022] Updated weights on worker 0-0, policy_version 868461 (0.00088) [2022-07-10 19:55:14,326][26022] Updated weights on worker 0-0, policy_version 868471 (0.00094) [2022-07-10 19:55:14,827][25689] Fps is (10 sec: 5496.7, 60 sec: 5556.6, 300 sec: 5534.9). Total num frames: 889316352. Throughput: 0: 5745.6. Samples: 889315454. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:55:14,827][25689] Avg episode reward: [(0, '-1.069')] [2022-07-10 19:55:16,108][26022] Updated weights on worker 0-0, policy_version 868481 (0.00082) [2022-07-10 19:55:18,129][26022] Updated weights on worker 0-0, policy_version 868491 (0.00085) [2022-07-10 19:55:19,634][26022] Updated weights on worker 0-0, policy_version 868501 (0.00093) [2022-07-10 19:55:19,832][25689] Fps is (10 sec: 5812.3, 60 sec: 5573.7, 300 sec: 5542.3). Total num frames: 889346048. Throughput: 0: 5749.7. Samples: 889349230. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:55:19,832][25689] Avg episode reward: [(0, '-1.163')] [2022-07-10 19:55:21,915][26022] Updated weights on worker 0-0, policy_version 868511 (0.00092) [2022-07-10 19:55:23,261][26022] Updated weights on worker 0-0, policy_version 868521 (0.00086) [2022-07-10 19:55:24,888][25689] Fps is (10 sec: 5597.6, 60 sec: 5558.5, 300 sec: 5535.1). Total num frames: 889372672. Throughput: 0: 5024.6. Samples: 889366026. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:55:24,889][25689] Avg episode reward: [(0, '-0.898')] [2022-07-10 19:55:25,479][26022] Updated weights on worker 0-0, policy_version 868531 (0.00089) [2022-07-10 19:55:26,995][26022] Updated weights on worker 0-0, policy_version 868541 (0.00083) [2022-07-10 19:55:28,973][26022] Updated weights on worker 0-0, policy_version 868551 (0.00093) [2022-07-10 19:55:29,911][25689] Fps is (10 sec: 5384.5, 60 sec: 5557.7, 300 sec: 5534.7). Total num frames: 889400320. Throughput: 0: 5856.6. Samples: 889399650. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:55:29,911][25689] Avg episode reward: [(0, '0.108')] [2022-07-10 19:55:30,780][26022] Updated weights on worker 0-0, policy_version 868561 (0.00091) [2022-07-10 19:55:32,737][26022] Updated weights on worker 0-0, policy_version 868571 (0.00086) [2022-07-10 19:55:34,619][26022] Updated weights on worker 0-0, policy_version 868582 (0.00088) [2022-07-10 19:55:34,938][25689] Fps is (10 sec: 5604.2, 60 sec: 5573.5, 300 sec: 5534.7). Total num frames: 889428992. Throughput: 0: 5855.1. Samples: 889433206. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-10 19:55:34,941][25689] Avg episode reward: [(0, '-0.965')] [2022-07-10 19:55:36,682][26022] Updated weights on worker 0-0, policy_version 868592 (0.00094) [2022-07-10 19:55:38,206][26022] Updated weights on worker 0-0, policy_version 868602 (0.00085) [2022-07-10 19:55:39,963][25689] Fps is (10 sec: 5603.0, 60 sec: 5555.0, 300 sec: 5535.2). Total num frames: 889456640. Throughput: 0: 5830.8. Samples: 889466610. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:55:39,963][25689] Avg episode reward: [(0, '-1.011')] [2022-07-10 19:55:40,432][26022] Updated weights on worker 0-0, policy_version 868612 (0.00087) [2022-07-10 19:55:41,987][26022] Updated weights on worker 0-0, policy_version 868622 (0.00097) [2022-07-10 19:55:44,159][26022] Updated weights on worker 0-0, policy_version 868632 (0.00089) [2022-07-10 19:55:45,021][25689] Fps is (10 sec: 5585.6, 60 sec: 5561.9, 300 sec: 5537.7). Total num frames: 889485312. Throughput: 0: 5804.5. Samples: 889482886. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:55:45,021][25689] Avg episode reward: [(0, '-1.029')] [2022-07-10 19:55:45,673][26022] Updated weights on worker 0-0, policy_version 868642 (0.00093) [2022-07-10 19:55:47,868][26022] Updated weights on worker 0-0, policy_version 868652 (0.00086) [2022-07-10 19:55:49,459][26022] Updated weights on worker 0-0, policy_version 868662 (0.00089) [2022-07-10 19:55:50,078][25689] Fps is (10 sec: 5669.1, 60 sec: 5560.9, 300 sec: 5536.7). Total num frames: 889513984. Throughput: 0: 5773.3. Samples: 889516078. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:55:50,078][25689] Avg episode reward: [(0, '-2.011')] [2022-07-10 19:55:51,443][26022] Updated weights on worker 0-0, policy_version 868672 (0.00088) [2022-07-10 19:55:53,112][26022] Updated weights on worker 0-0, policy_version 868682 (0.00081) [2022-07-10 19:55:55,088][25689] Fps is (10 sec: 5289.3, 60 sec: 5526.5, 300 sec: 5529.9). Total num frames: 889538560. Throughput: 0: 5769.9. Samples: 889549470. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:55:55,088][25689] Avg episode reward: [(0, '-2.430')] [2022-07-10 19:55:55,187][26022] Updated weights on worker 0-0, policy_version 868692 (0.00082) [2022-07-10 19:55:56,909][26022] Updated weights on worker 0-0, policy_version 868702 (0.00094) [2022-07-10 19:55:58,761][26022] Updated weights on worker 0-0, policy_version 868712 (0.00086) [2022-07-10 19:56:00,119][25689] Fps is (10 sec: 5404.6, 60 sec: 5542.3, 300 sec: 5540.7). Total num frames: 889568256. Throughput: 0: 4937.9. Samples: 889566142. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:56:00,120][25689] Avg episode reward: [(0, '-3.022')] [2022-07-10 19:56:00,538][26022] Updated weights on worker 0-0, policy_version 868722 (0.00080) [2022-07-10 19:56:02,831][26022] Updated weights on worker 0-0, policy_version 868732 (0.00097) [2022-07-10 19:56:04,407][26022] Updated weights on worker 0-0, policy_version 868742 (0.00093) [2022-07-10 19:56:05,195][25689] Fps is (10 sec: 5572.4, 60 sec: 5524.3, 300 sec: 5529.0). Total num frames: 889594880. Throughput: 0: 5696.3. Samples: 889597804. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:56:05,195][25689] Avg episode reward: [(0, '-3.063')] [2022-07-10 19:56:06,535][26022] Updated weights on worker 0-0, policy_version 868752 (0.00082) [2022-07-10 19:56:08,171][26022] Updated weights on worker 0-0, policy_version 868762 (0.00079) [2022-07-10 19:56:10,017][26022] Updated weights on worker 0-0, policy_version 868772 (0.00086) [2022-07-10 19:56:10,243][25689] Fps is (10 sec: 5462.2, 60 sec: 5555.9, 300 sec: 5538.6). Total num frames: 889623552. Throughput: 0: 5722.7. Samples: 889631478. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:56:10,243][25689] Avg episode reward: [(0, '-3.373')] [2022-07-10 19:56:11,871][26022] Updated weights on worker 0-0, policy_version 868782 (0.00089) [2022-07-10 19:56:13,783][26022] Updated weights on worker 0-0, policy_version 868792 (0.00087) [2022-07-10 19:56:15,255][25689] Fps is (10 sec: 5598.5, 60 sec: 5541.3, 300 sec: 5535.2). Total num frames: 889651200. Throughput: 0: 4897.3. Samples: 889648234. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:56:15,260][25689] Avg episode reward: [(0, '-2.356')] [2022-07-10 19:56:15,439][26022] Updated weights on worker 0-0, policy_version 868802 (0.00087) [2022-07-10 19:56:17,371][26022] Updated weights on worker 0-0, policy_version 868812 (0.00098) [2022-07-10 19:56:19,100][26022] Updated weights on worker 0-0, policy_version 868822 (0.00095) [2022-07-10 19:56:20,300][25689] Fps is (10 sec: 5498.2, 60 sec: 5503.7, 300 sec: 5534.1). Total num frames: 889678848. Throughput: 0: 5751.7. Samples: 889682214. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:56:20,300][25689] Avg episode reward: [(0, '-2.411')] [2022-07-10 19:56:21,025][26022] Updated weights on worker 0-0, policy_version 868832 (0.00086) [2022-07-10 19:56:22,745][26022] Updated weights on worker 0-0, policy_version 868842 (0.00089) [2022-07-10 19:56:24,618][26022] Updated weights on worker 0-0, policy_version 868852 (0.00774) [2022-07-10 19:56:25,381][25689] Fps is (10 sec: 5561.6, 60 sec: 5535.3, 300 sec: 5536.2). Total num frames: 889707520. Throughput: 0: 5841.2. Samples: 889715716. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:56:25,383][25689] Avg episode reward: [(0, '-1.586')] [2022-07-10 19:56:26,416][26022] Updated weights on worker 0-0, policy_version 868862 (0.00095) [2022-07-10 19:56:28,462][26022] Updated weights on worker 0-0, policy_version 868872 (0.00090) [2022-07-10 19:56:30,019][26022] Updated weights on worker 0-0, policy_version 868882 (0.00108) [2022-07-10 19:56:30,386][25689] Fps is (10 sec: 5786.9, 60 sec: 5570.8, 300 sec: 5539.8). Total num frames: 889737216. Throughput: 0: 5017.7. Samples: 889732552. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:56:30,388][25689] Avg episode reward: [(0, '-0.874')] [2022-07-10 19:56:32,204][26022] Updated weights on worker 0-0, policy_version 868892 (0.00074) [2022-07-10 19:56:33,623][26022] Updated weights on worker 0-0, policy_version 868902 (0.00082) [2022-07-10 19:56:35,405][25689] Fps is (10 sec: 5618.5, 60 sec: 5537.6, 300 sec: 5536.4). Total num frames: 889763840. Throughput: 0: 5851.0. Samples: 889766134. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:56:35,406][25689] Avg episode reward: [(0, '-0.110')] [2022-07-10 19:56:35,719][26022] Updated weights on worker 0-0, policy_version 868912 (0.00092) [2022-07-10 19:56:37,501][26022] Updated weights on worker 0-0, policy_version 868922 (0.00086) [2022-07-10 19:56:39,225][26022] Updated weights on worker 0-0, policy_version 868932 (0.00091) [2022-07-10 19:56:40,414][25689] Fps is (10 sec: 5514.1, 60 sec: 5556.0, 300 sec: 5534.7). Total num frames: 889792512. Throughput: 0: 5842.8. Samples: 889799738. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:56:40,415][25689] Avg episode reward: [(0, '-0.164')] [2022-07-10 19:56:41,226][26022] Updated weights on worker 0-0, policy_version 868942 (0.00088) [2022-07-10 19:56:42,951][26022] Updated weights on worker 0-0, policy_version 868952 (0.00093) [2022-07-10 19:56:44,795][26022] Updated weights on worker 0-0, policy_version 868962 (0.00087) [2022-07-10 19:56:45,471][25689] Fps is (10 sec: 5696.8, 60 sec: 5556.2, 300 sec: 5537.4). Total num frames: 889821184. Throughput: 0: 5008.4. Samples: 889816336. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:56:45,472][25689] Avg episode reward: [(0, '-0.193')] [2022-07-10 19:56:46,587][26022] Updated weights on worker 0-0, policy_version 868972 (0.00092) [2022-07-10 19:56:48,326][26022] Updated weights on worker 0-0, policy_version 868982 (0.00091) [2022-07-10 19:56:50,498][25689] Fps is (10 sec: 5483.8, 60 sec: 5525.0, 300 sec: 5538.2). Total num frames: 889847808. Throughput: 0: 5839.8. Samples: 889850000. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:56:50,499][25689] Avg episode reward: [(0, '0.304')] [2022-07-10 19:56:50,506][26022] Updated weights on worker 0-0, policy_version 868992 (0.00086) [2022-07-10 19:56:52,017][26022] Updated weights on worker 0-0, policy_version 869002 (0.00093) [2022-07-10 19:56:54,007][26022] Updated weights on worker 0-0, policy_version 869012 (0.00088) [2022-07-10 19:56:55,542][25689] Fps is (10 sec: 5490.8, 60 sec: 5589.7, 300 sec: 5537.9). Total num frames: 889876480. Throughput: 0: 5846.7. Samples: 889883868. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:56:55,542][25689] Avg episode reward: [(0, '0.005')] [2022-07-10 19:56:55,899][26022] Updated weights on worker 0-0, policy_version 869022 (0.00083) [2022-07-10 19:56:57,576][26022] Updated weights on worker 0-0, policy_version 869032 (0.00086) [2022-07-10 19:56:59,567][26022] Updated weights on worker 0-0, policy_version 869042 (0.00085) [2022-07-10 19:57:00,541][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:57:00,548][25689] Fps is (10 sec: 5603.8, 60 sec: 5558.2, 300 sec: 5545.7). Total num frames: 889904128. Throughput: 0: 5016.1. Samples: 889900732. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:57:00,549][25689] Avg episode reward: [(0, '-0.020')] [2022-07-10 19:57:00,572][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000869048_889905152.pth [2022-07-10 19:57:00,572][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000867098_887908352.pth [2022-07-10 19:57:01,320][26022] Updated weights on worker 0-0, policy_version 869052 (0.00085) [2022-07-10 19:57:03,510][26022] Updated weights on worker 0-0, policy_version 869062 (0.00089) [2022-07-10 19:57:05,483][26022] Updated weights on worker 0-0, policy_version 869072 (0.00713) [2022-07-10 19:57:05,638][25689] Fps is (10 sec: 5274.3, 60 sec: 5539.9, 300 sec: 5537.5). Total num frames: 889929728. Throughput: 0: 5729.3. Samples: 889931876. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:57:05,638][25689] Avg episode reward: [(0, '-0.013')] [2022-07-10 19:57:07,092][26022] Updated weights on worker 0-0, policy_version 869082 (0.00091) [2022-07-10 19:57:09,198][26022] Updated weights on worker 0-0, policy_version 869092 (0.00086) [2022-07-10 19:57:10,666][25689] Fps is (10 sec: 5363.8, 60 sec: 5541.7, 300 sec: 5540.8). Total num frames: 889958400. Throughput: 0: 5719.3. Samples: 889965350. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:57:10,667][25689] Avg episode reward: [(0, '0.005')] [2022-07-10 19:57:10,927][26022] Updated weights on worker 0-0, policy_version 869102 (0.00090) [2022-07-10 19:57:12,847][26022] Updated weights on worker 0-0, policy_version 869112 (0.00082) [2022-07-10 19:57:14,532][26022] Updated weights on worker 0-0, policy_version 869122 (0.00080) [2022-07-10 19:57:15,745][25689] Fps is (10 sec: 5572.3, 60 sec: 5535.6, 300 sec: 5536.4). Total num frames: 889986048. Throughput: 0: 4856.3. Samples: 889981980. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:57:15,745][25689] Avg episode reward: [(0, '0.375')] [2022-07-10 19:57:16,417][26022] Updated weights on worker 0-0, policy_version 869132 (0.00094) [2022-07-10 19:57:18,311][26022] Updated weights on worker 0-0, policy_version 869142 (0.00085) [2022-07-10 19:57:20,197][26022] Updated weights on worker 0-0, policy_version 869152 (0.00085) [2022-07-10 19:57:20,792][25689] Fps is (10 sec: 5663.1, 60 sec: 5569.2, 300 sec: 5540.5). Total num frames: 890015744. Throughput: 0: 5659.6. Samples: 890015306. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:57:20,793][25689] Avg episode reward: [(0, '-0.084')] [2022-07-10 19:57:22,030][26022] Updated weights on worker 0-0, policy_version 869162 (0.00091) [2022-07-10 19:57:23,807][26022] Updated weights on worker 0-0, policy_version 869172 (0.00089) [2022-07-10 19:57:25,533][26022] Updated weights on worker 0-0, policy_version 869182 (0.00092) [2022-07-10 19:57:25,875][25689] Fps is (10 sec: 5559.8, 60 sec: 5535.3, 300 sec: 5535.9). Total num frames: 890042368. Throughput: 0: 5766.0. Samples: 890048562. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:57:25,875][25689] Avg episode reward: [(0, '-0.003')] [2022-07-10 19:57:27,566][26022] Updated weights on worker 0-0, policy_version 869192 (0.00088) [2022-07-10 19:57:29,431][26022] Updated weights on worker 0-0, policy_version 869202 (0.00092) [2022-07-10 19:57:30,897][25689] Fps is (10 sec: 5472.1, 60 sec: 5516.7, 300 sec: 5535.6). Total num frames: 890071040. Throughput: 0: 4930.0. Samples: 890065088. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:57:30,898][25689] Avg episode reward: [(0, '-1.633')] [2022-07-10 19:57:31,152][26022] Updated weights on worker 0-0, policy_version 869212 (0.00085) [2022-07-10 19:57:33,176][26022] Updated weights on worker 0-0, policy_version 869222 (0.00091) [2022-07-10 19:57:35,018][26022] Updated weights on worker 0-0, policy_version 869232 (0.00085) [2022-07-10 19:57:35,927][25689] Fps is (10 sec: 5704.7, 60 sec: 5549.6, 300 sec: 5543.8). Total num frames: 890099712. Throughput: 0: 5780.5. Samples: 890098642. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:57:35,929][25689] Avg episode reward: [(0, '-1.854')] [2022-07-10 19:57:36,749][26022] Updated weights on worker 0-0, policy_version 869242 (0.00092) [2022-07-10 19:57:38,583][26022] Updated weights on worker 0-0, policy_version 869252 (0.00090) [2022-07-10 19:57:40,304][26022] Updated weights on worker 0-0, policy_version 869262 (0.00085) [2022-07-10 19:57:40,954][25689] Fps is (10 sec: 5396.7, 60 sec: 5497.2, 300 sec: 5535.4). Total num frames: 890125312. Throughput: 0: 5807.3. Samples: 890132390. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:57:40,955][25689] Avg episode reward: [(0, '-1.919')] [2022-07-10 19:57:42,149][26022] Updated weights on worker 0-0, policy_version 869272 (0.00089) [2022-07-10 19:57:44,214][26022] Updated weights on worker 0-0, policy_version 869282 (0.00087) [2022-07-10 19:57:45,693][26022] Updated weights on worker 0-0, policy_version 869292 (0.00092) [2022-07-10 19:57:46,069][25689] Fps is (10 sec: 5553.1, 60 sec: 5525.8, 300 sec: 5541.1). Total num frames: 890156032. Throughput: 0: 4974.4. Samples: 890149016. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:57:46,069][25689] Avg episode reward: [(0, '-1.901')] [2022-07-10 19:57:47,758][26022] Updated weights on worker 0-0, policy_version 869302 (0.00085) [2022-07-10 19:57:49,449][26022] Updated weights on worker 0-0, policy_version 869312 (0.00090) [2022-07-10 19:57:51,085][25689] Fps is (10 sec: 5660.4, 60 sec: 5526.8, 300 sec: 5538.9). Total num frames: 890182656. Throughput: 0: 5835.2. Samples: 890182884. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:57:51,086][25689] Avg episode reward: [(0, '-1.523')] [2022-07-10 19:57:51,368][26022] Updated weights on worker 0-0, policy_version 869322 (0.00090) [2022-07-10 19:57:53,348][26022] Updated weights on worker 0-0, policy_version 869332 (0.00086) [2022-07-10 19:57:54,922][26022] Updated weights on worker 0-0, policy_version 869342 (0.00090) [2022-07-10 19:57:56,099][25689] Fps is (10 sec: 5512.9, 60 sec: 5529.5, 300 sec: 5538.8). Total num frames: 890211328. Throughput: 0: 5838.4. Samples: 890216416. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:57:56,099][25689] Avg episode reward: [(0, '-1.653')] [2022-07-10 19:57:56,881][26022] Updated weights on worker 0-0, policy_version 869352 (0.00087) [2022-07-10 19:57:58,631][26022] Updated weights on worker 0-0, policy_version 869362 (0.00088) [2022-07-10 19:58:00,646][26022] Updated weights on worker 0-0, policy_version 869372 (0.00090) [2022-07-10 19:58:01,119][25689] Fps is (10 sec: 5714.8, 60 sec: 5545.1, 300 sec: 5551.0). Total num frames: 890240000. Throughput: 0: 5004.1. Samples: 890233298. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:58:01,120][25689] Avg episode reward: [(0, '-1.038')] [2022-07-10 19:58:02,723][26022] Updated weights on worker 0-0, policy_version 869382 (0.00087) [2022-07-10 19:58:04,458][26022] Updated weights on worker 0-0, policy_version 869392 (0.00088) [2022-07-10 19:58:06,163][25689] Fps is (10 sec: 5494.5, 60 sec: 5566.2, 300 sec: 5543.9). Total num frames: 890266624. Throughput: 0: 5760.3. Samples: 890264764. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:58:06,163][25689] Avg episode reward: [(0, '-1.166')] [2022-07-10 19:58:06,348][26022] Updated weights on worker 0-0, policy_version 869402 (0.00088) [2022-07-10 19:58:08,101][26022] Updated weights on worker 0-0, policy_version 869412 (0.00875) [2022-07-10 19:58:10,207][26022] Updated weights on worker 0-0, policy_version 869422 (0.00084) [2022-07-10 19:58:11,191][25689] Fps is (10 sec: 5388.5, 60 sec: 5549.4, 300 sec: 5543.9). Total num frames: 890294272. Throughput: 0: 5745.8. Samples: 890298410. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:58:11,191][25689] Avg episode reward: [(0, '-1.465')] [2022-07-10 19:58:11,921][26022] Updated weights on worker 0-0, policy_version 869432 (0.00091) [2022-07-10 19:58:13,922][26022] Updated weights on worker 0-0, policy_version 869442 (0.00088) [2022-07-10 19:58:15,534][26022] Updated weights on worker 0-0, policy_version 869452 (0.00100) [2022-07-10 19:58:16,192][25689] Fps is (10 sec: 5615.8, 60 sec: 5573.4, 300 sec: 5545.5). Total num frames: 890322944. Throughput: 0: 4913.7. Samples: 890315146. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:58:16,192][25689] Avg episode reward: [(0, '-1.240')] [2022-07-10 19:58:17,511][26022] Updated weights on worker 0-0, policy_version 869462 (0.00097) [2022-07-10 19:58:19,053][26022] Updated weights on worker 0-0, policy_version 869472 (0.00092) [2022-07-10 19:58:21,033][26022] Updated weights on worker 0-0, policy_version 869482 (0.00084) [2022-07-10 19:58:21,201][25689] Fps is (10 sec: 5524.1, 60 sec: 5526.1, 300 sec: 5543.6). Total num frames: 890349568. Throughput: 0: 5755.2. Samples: 890348874. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:58:21,201][25689] Avg episode reward: [(0, '-1.582')] [2022-07-10 19:58:22,695][26022] Updated weights on worker 0-0, policy_version 869492 (0.00084) [2022-07-10 19:58:24,907][26022] Updated weights on worker 0-0, policy_version 869502 (0.00085) [2022-07-10 19:58:26,319][25689] Fps is (10 sec: 5561.3, 60 sec: 5573.7, 300 sec: 5542.1). Total num frames: 890379264. Throughput: 0: 5859.7. Samples: 890382872. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:58:26,319][25689] Avg episode reward: [(0, '-1.369')] [2022-07-10 19:58:26,367][26022] Updated weights on worker 0-0, policy_version 869512 (0.00087) [2022-07-10 19:58:28,453][26022] Updated weights on worker 0-0, policy_version 869522 (0.00085) [2022-07-10 19:58:29,913][26022] Updated weights on worker 0-0, policy_version 869532 (0.00087) [2022-07-10 19:58:31,366][25689] Fps is (10 sec: 5641.0, 60 sec: 5554.5, 300 sec: 5541.6). Total num frames: 890406912. Throughput: 0: 5838.0. Samples: 890416194. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:58:31,368][25689] Avg episode reward: [(0, '-0.463')] [2022-07-10 19:58:32,255][26022] Updated weights on worker 0-0, policy_version 869542 (0.00086) [2022-07-10 19:58:33,723][26022] Updated weights on worker 0-0, policy_version 869552 (0.00083) [2022-07-10 19:58:35,897][26022] Updated weights on worker 0-0, policy_version 869562 (0.00091) [2022-07-10 19:58:36,389][25689] Fps is (10 sec: 5491.2, 60 sec: 5538.1, 300 sec: 5541.3). Total num frames: 890434560. Throughput: 0: 5823.5. Samples: 890432764. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:58:36,389][25689] Avg episode reward: [(0, '-0.252')] [2022-07-10 19:58:37,386][26022] Updated weights on worker 0-0, policy_version 869572 (0.00087) [2022-07-10 19:58:39,603][26022] Updated weights on worker 0-0, policy_version 869582 (0.00096) [2022-07-10 19:58:41,274][26022] Updated weights on worker 0-0, policy_version 869592 (0.00084) [2022-07-10 19:58:41,418][25689] Fps is (10 sec: 5500.9, 60 sec: 5571.8, 300 sec: 5538.9). Total num frames: 890462208. Throughput: 0: 5817.7. Samples: 890466494. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:58:41,427][25689] Avg episode reward: [(0, '0.001')] [2022-07-10 19:58:43,045][26022] Updated weights on worker 0-0, policy_version 869602 (0.00084) [2022-07-10 19:58:44,856][26022] Updated weights on worker 0-0, policy_version 869612 (0.00090) [2022-07-10 19:58:46,477][25689] Fps is (10 sec: 5684.0, 60 sec: 5560.0, 300 sec: 5545.5). Total num frames: 890491904. Throughput: 0: 5807.1. Samples: 890499936. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:58:46,478][25689] Avg episode reward: [(0, '-0.310')] [2022-07-10 19:58:46,871][26022] Updated weights on worker 0-0, policy_version 869622 (0.00086) [2022-07-10 19:58:48,596][26022] Updated weights on worker 0-0, policy_version 869632 (0.00088) [2022-07-10 19:58:50,557][26022] Updated weights on worker 0-0, policy_version 869642 (0.00084) [2022-07-10 19:58:51,490][25689] Fps is (10 sec: 5693.6, 60 sec: 5577.2, 300 sec: 5545.6). Total num frames: 890519552. Throughput: 0: 5002.8. Samples: 890516870. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:58:51,492][25689] Avg episode reward: [(0, '-0.275')] [2022-07-10 19:58:52,050][26022] Updated weights on worker 0-0, policy_version 869652 (0.00097) [2022-07-10 19:58:54,186][26022] Updated weights on worker 0-0, policy_version 869662 (0.00085) [2022-07-10 19:58:55,816][26022] Updated weights on worker 0-0, policy_version 869672 (0.00092) [2022-07-10 19:58:56,530][25689] Fps is (10 sec: 5500.6, 60 sec: 5557.9, 300 sec: 5541.8). Total num frames: 890547200. Throughput: 0: 5831.0. Samples: 890550208. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:58:56,531][25689] Avg episode reward: [(0, '-0.728')] [2022-07-10 19:58:57,737][26022] Updated weights on worker 0-0, policy_version 869682 (0.00085) [2022-07-10 19:58:59,573][26022] Updated weights on worker 0-0, policy_version 869692 (0.00087) [2022-07-10 19:59:00,630][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 19:59:00,642][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000869698_890570752.pth [2022-07-10 19:59:00,643][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000867746_888571904.pth [2022-07-10 19:59:01,230][26022] Updated weights on worker 0-0, policy_version 869702 (0.00085) [2022-07-10 19:59:01,578][25689] Fps is (10 sec: 5583.2, 60 sec: 5555.4, 300 sec: 5552.7). Total num frames: 890575872. Throughput: 0: 5836.4. Samples: 890584150. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:59:01,578][25689] Avg episode reward: [(0, '-0.618')] [2022-07-10 19:59:03,550][26022] Updated weights on worker 0-0, policy_version 869712 (0.00091) [2022-07-10 19:59:05,361][26022] Updated weights on worker 0-0, policy_version 869722 (0.00080) [2022-07-10 19:59:06,668][25689] Fps is (10 sec: 5454.3, 60 sec: 5551.1, 300 sec: 5548.1). Total num frames: 890602496. Throughput: 0: 4899.7. Samples: 890598866. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:59:06,669][25689] Avg episode reward: [(0, '-0.508')] [2022-07-10 19:59:07,095][26022] Updated weights on worker 0-0, policy_version 869732 (0.00094) [2022-07-10 19:59:09,223][26022] Updated weights on worker 0-0, policy_version 869742 (0.00093) [2022-07-10 19:59:10,791][26022] Updated weights on worker 0-0, policy_version 869752 (0.00086) [2022-07-10 19:59:11,689][25689] Fps is (10 sec: 5468.5, 60 sec: 5568.7, 300 sec: 5552.1). Total num frames: 890631168. Throughput: 0: 5723.7. Samples: 890632484. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:59:11,691][25689] Avg episode reward: [(0, '-0.278')] [2022-07-10 19:59:12,888][26022] Updated weights on worker 0-0, policy_version 869762 (0.00092) [2022-07-10 19:59:14,612][26022] Updated weights on worker 0-0, policy_version 869772 (0.00115) [2022-07-10 19:59:16,516][26022] Updated weights on worker 0-0, policy_version 869782 (0.00091) [2022-07-10 19:59:16,704][25689] Fps is (10 sec: 5509.7, 60 sec: 5533.5, 300 sec: 5545.1). Total num frames: 890657792. Throughput: 0: 5726.4. Samples: 890665734. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2022-07-10 19:59:16,706][25689] Avg episode reward: [(0, '0.588')] [2022-07-10 19:59:18,292][26022] Updated weights on worker 0-0, policy_version 869792 (0.00084) [2022-07-10 19:59:20,100][26022] Updated weights on worker 0-0, policy_version 869802 (0.00087) [2022-07-10 19:59:21,718][25689] Fps is (10 sec: 5411.8, 60 sec: 5550.0, 300 sec: 5546.2). Total num frames: 890685440. Throughput: 0: 4881.6. Samples: 890682466. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 19:59:21,720][25689] Avg episode reward: [(0, '-0.058')] [2022-07-10 19:59:21,938][26022] Updated weights on worker 0-0, policy_version 869812 (0.00083) [2022-07-10 19:59:23,910][26022] Updated weights on worker 0-0, policy_version 869822 (0.00099) [2022-07-10 19:59:25,615][26022] Updated weights on worker 0-0, policy_version 869832 (0.00085) [2022-07-10 19:59:26,827][25689] Fps is (10 sec: 5563.8, 60 sec: 5533.9, 300 sec: 5547.9). Total num frames: 890714112. Throughput: 0: 5800.6. Samples: 890715798. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 19:59:26,827][25689] Avg episode reward: [(0, '0.513')] [2022-07-10 19:59:27,593][26022] Updated weights on worker 0-0, policy_version 869842 (0.00086) [2022-07-10 19:59:29,114][26022] Updated weights on worker 0-0, policy_version 869852 (0.00634) [2022-07-10 19:59:31,412][26022] Updated weights on worker 0-0, policy_version 869862 (0.00089) [2022-07-10 19:59:31,921][25689] Fps is (10 sec: 5620.3, 60 sec: 5546.6, 300 sec: 5549.9). Total num frames: 890742784. Throughput: 0: 5773.1. Samples: 890749282. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 19:59:31,921][25689] Avg episode reward: [(0, '0.688')] [2022-07-10 19:59:32,837][26022] Updated weights on worker 0-0, policy_version 869872 (0.00093) [2022-07-10 19:59:34,926][26022] Updated weights on worker 0-0, policy_version 869882 (0.00091) [2022-07-10 19:59:36,518][26022] Updated weights on worker 0-0, policy_version 869892 (0.00093) [2022-07-10 19:59:36,976][25689] Fps is (10 sec: 5549.1, 60 sec: 5543.6, 300 sec: 5545.5). Total num frames: 890770432. Throughput: 0: 4932.4. Samples: 890765724. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 19:59:36,977][25689] Avg episode reward: [(0, '0.429')] [2022-07-10 19:59:38,787][26022] Updated weights on worker 0-0, policy_version 869902 (0.00089) [2022-07-10 19:59:40,380][26022] Updated weights on worker 0-0, policy_version 869912 (0.00091) [2022-07-10 19:59:41,991][25689] Fps is (10 sec: 5389.4, 60 sec: 5528.0, 300 sec: 5540.9). Total num frames: 890797056. Throughput: 0: 5759.7. Samples: 890799234. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 19:59:41,991][25689] Avg episode reward: [(0, '0.135')] [2022-07-10 19:59:42,365][26022] Updated weights on worker 0-0, policy_version 869922 (0.00089) [2022-07-10 19:59:43,968][26022] Updated weights on worker 0-0, policy_version 869932 (0.00090) [2022-07-10 19:59:46,096][26022] Updated weights on worker 0-0, policy_version 869942 (0.00093) [2022-07-10 19:59:47,102][25689] Fps is (10 sec: 5461.1, 60 sec: 5506.4, 300 sec: 5539.6). Total num frames: 890825728. Throughput: 0: 5750.6. Samples: 890832390. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 19:59:47,102][25689] Avg episode reward: [(0, '0.422')] [2022-07-10 19:59:47,776][26022] Updated weights on worker 0-0, policy_version 869952 (0.00085) [2022-07-10 19:59:49,782][26022] Updated weights on worker 0-0, policy_version 869962 (0.00085) [2022-07-10 19:59:51,458][26022] Updated weights on worker 0-0, policy_version 869972 (0.00094) [2022-07-10 19:59:52,108][25689] Fps is (10 sec: 5668.1, 60 sec: 5523.9, 300 sec: 5546.5). Total num frames: 890854400. Throughput: 0: 4947.0. Samples: 890849148. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 19:59:52,108][25689] Avg episode reward: [(0, '1.148')] [2022-07-10 19:59:53,385][26022] Updated weights on worker 0-0, policy_version 869982 (0.00089) [2022-07-10 19:59:55,107][26022] Updated weights on worker 0-0, policy_version 869992 (0.00096) [2022-07-10 19:59:57,072][26022] Updated weights on worker 0-0, policy_version 870002 (0.00090) [2022-07-10 19:59:57,130][25689] Fps is (10 sec: 5616.3, 60 sec: 5525.6, 300 sec: 5543.0). Total num frames: 890882048. Throughput: 0: 5801.8. Samples: 890882650. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 19:59:57,130][25689] Avg episode reward: [(0, '1.058')] [2022-07-10 19:59:59,091][26022] Updated weights on worker 0-0, policy_version 870012 (0.00091) [2022-07-10 20:00:00,677][26022] Updated weights on worker 0-0, policy_version 870022 (0.00090) [2022-07-10 20:00:02,138][25689] Fps is (10 sec: 5308.9, 60 sec: 5478.4, 300 sec: 5537.2). Total num frames: 890907648. Throughput: 0: 5758.8. Samples: 890915256. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:00:02,139][25689] Avg episode reward: [(0, '0.285')] [2022-07-10 20:00:02,962][26022] Updated weights on worker 0-0, policy_version 870032 (0.00093) [2022-07-10 20:00:04,863][26022] Updated weights on worker 0-0, policy_version 870042 (0.00094) [2022-07-10 20:00:06,804][26022] Updated weights on worker 0-0, policy_version 870052 (0.00086) [2022-07-10 20:00:07,200][25689] Fps is (10 sec: 5287.6, 60 sec: 5497.9, 300 sec: 5539.9). Total num frames: 890935296. Throughput: 0: 4875.1. Samples: 890930372. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:00:07,201][25689] Avg episode reward: [(0, '0.209')] [2022-07-10 20:00:08,523][26022] Updated weights on worker 0-0, policy_version 870062 (0.00087) [2022-07-10 20:00:10,522][26022] Updated weights on worker 0-0, policy_version 870072 (0.00083) [2022-07-10 20:00:12,181][26022] Updated weights on worker 0-0, policy_version 870082 (0.00120) [2022-07-10 20:00:12,230][25689] Fps is (10 sec: 5580.7, 60 sec: 5497.1, 300 sec: 5540.0). Total num frames: 890963968. Throughput: 0: 5699.7. Samples: 890963836. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:00:12,231][25689] Avg episode reward: [(0, '-0.852')] [2022-07-10 20:00:14,170][26022] Updated weights on worker 0-0, policy_version 870092 (0.00091) [2022-07-10 20:00:16,030][26022] Updated weights on worker 0-0, policy_version 870102 (0.00620) [2022-07-10 20:00:17,257][25689] Fps is (10 sec: 5600.4, 60 sec: 5513.0, 300 sec: 5532.7). Total num frames: 890991616. Throughput: 0: 5674.9. Samples: 890996868. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:00:17,257][25689] Avg episode reward: [(0, '-0.826')] [2022-07-10 20:00:17,722][26022] Updated weights on worker 0-0, policy_version 870112 (0.00090) [2022-07-10 20:00:19,683][26022] Updated weights on worker 0-0, policy_version 870122 (0.00084) [2022-07-10 20:00:21,411][26022] Updated weights on worker 0-0, policy_version 870132 (0.00080) [2022-07-10 20:00:22,267][25689] Fps is (10 sec: 5611.3, 60 sec: 5530.2, 300 sec: 5540.5). Total num frames: 891020288. Throughput: 0: 4886.7. Samples: 891013620. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:00:22,268][25689] Avg episode reward: [(0, '-2.001')] [2022-07-10 20:00:23,295][26022] Updated weights on worker 0-0, policy_version 870142 (0.00089) [2022-07-10 20:00:25,066][26022] Updated weights on worker 0-0, policy_version 870152 (0.00087) [2022-07-10 20:00:27,060][26022] Updated weights on worker 0-0, policy_version 870162 (0.00086) [2022-07-10 20:00:27,362][25689] Fps is (10 sec: 5370.3, 60 sec: 5480.7, 300 sec: 5532.2). Total num frames: 891045888. Throughput: 0: 5786.8. Samples: 891047048. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:00:27,367][25689] Avg episode reward: [(0, '-1.194')] [2022-07-10 20:00:28,912][26022] Updated weights on worker 0-0, policy_version 870172 (0.00093) [2022-07-10 20:00:30,801][26022] Updated weights on worker 0-0, policy_version 870182 (0.00082) [2022-07-10 20:00:32,403][25689] Fps is (10 sec: 5556.5, 60 sec: 5519.4, 300 sec: 5538.9). Total num frames: 891076608. Throughput: 0: 5796.0. Samples: 891080758. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:00:32,403][25689] Avg episode reward: [(0, '-1.269')] [2022-07-10 20:00:32,404][26022] Updated weights on worker 0-0, policy_version 870192 (0.00096) [2022-07-10 20:00:34,357][26022] Updated weights on worker 0-0, policy_version 870202 (0.00086) [2022-07-10 20:00:36,090][26022] Updated weights on worker 0-0, policy_version 870212 (0.00096) [2022-07-10 20:00:37,435][25689] Fps is (10 sec: 5794.5, 60 sec: 5521.5, 300 sec: 5538.7). Total num frames: 891104256. Throughput: 0: 4984.7. Samples: 891097454. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:00:37,437][25689] Avg episode reward: [(0, '-1.432')] [2022-07-10 20:00:38,098][26022] Updated weights on worker 0-0, policy_version 870222 (0.00084) [2022-07-10 20:00:39,731][26022] Updated weights on worker 0-0, policy_version 870232 (0.00089) [2022-07-10 20:00:41,761][26022] Updated weights on worker 0-0, policy_version 870242 (0.00084) [2022-07-10 20:00:42,493][25689] Fps is (10 sec: 5378.8, 60 sec: 5517.6, 300 sec: 5531.8). Total num frames: 891130880. Throughput: 0: 5792.6. Samples: 891130780. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:00:42,493][25689] Avg episode reward: [(0, '-0.351')] [2022-07-10 20:00:43,629][26022] Updated weights on worker 0-0, policy_version 870252 (0.00057) [2022-07-10 20:00:45,366][26022] Updated weights on worker 0-0, policy_version 870262 (0.00091) [2022-07-10 20:00:47,069][26022] Updated weights on worker 0-0, policy_version 870272 (0.00086) [2022-07-10 20:00:47,550][25689] Fps is (10 sec: 5568.4, 60 sec: 5539.5, 300 sec: 5535.3). Total num frames: 891160576. Throughput: 0: 5826.2. Samples: 891164662. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:00:47,550][25689] Avg episode reward: [(0, '0.325')] [2022-07-10 20:00:49,079][26022] Updated weights on worker 0-0, policy_version 870282 (0.00083) [2022-07-10 20:00:50,868][26022] Updated weights on worker 0-0, policy_version 870292 (0.00091) [2022-07-10 20:00:52,609][25689] Fps is (10 sec: 5668.3, 60 sec: 5517.6, 300 sec: 5544.7). Total num frames: 891188224. Throughput: 0: 5810.0. Samples: 891198158. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:00:52,610][25689] Avg episode reward: [(0, '0.238')] [2022-07-10 20:00:52,882][26022] Updated weights on worker 0-0, policy_version 870302 (0.00084) [2022-07-10 20:00:54,404][26022] Updated weights on worker 0-0, policy_version 870312 (0.00085) [2022-07-10 20:00:56,304][26022] Updated weights on worker 0-0, policy_version 870322 (0.00090) [2022-07-10 20:00:57,638][25689] Fps is (10 sec: 5582.9, 60 sec: 5534.0, 300 sec: 5541.3). Total num frames: 891216896. Throughput: 0: 5807.5. Samples: 891214778. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:00:57,638][25689] Avg episode reward: [(0, '-0.012')] [2022-07-10 20:00:58,379][26022] Updated weights on worker 0-0, policy_version 870332 (0.00088) [2022-07-10 20:01:00,037][26022] Updated weights on worker 0-0, policy_version 870342 (0.00085) [2022-07-10 20:01:00,678][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:01:00,688][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000870346_891234304.pth [2022-07-10 20:01:00,689][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000868396_889237504.pth [2022-07-10 20:01:02,080][26022] Updated weights on worker 0-0, policy_version 870352 (0.00086) [2022-07-10 20:01:02,678][25689] Fps is (10 sec: 5390.3, 60 sec: 5531.1, 300 sec: 5538.5). Total num frames: 891242496. Throughput: 0: 5810.4. Samples: 891248064. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:01:02,678][25689] Avg episode reward: [(0, '-0.730')] [2022-07-10 20:01:03,915][26022] Updated weights on worker 0-0, policy_version 870362 (0.00089) [2022-07-10 20:01:06,009][26022] Updated weights on worker 0-0, policy_version 870372 (0.00092) [2022-07-10 20:01:07,747][25689] Fps is (10 sec: 5368.6, 60 sec: 5547.3, 300 sec: 5538.1). Total num frames: 891271168. Throughput: 0: 5670.7. Samples: 891279194. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:01:07,747][25689] Avg episode reward: [(0, '-2.567')] [2022-07-10 20:01:07,752][26022] Updated weights on worker 0-0, policy_version 870382 (0.00085) [2022-07-10 20:01:09,722][26022] Updated weights on worker 0-0, policy_version 870392 (0.00085) [2022-07-10 20:01:11,448][26022] Updated weights on worker 0-0, policy_version 870402 (0.00087) [2022-07-10 20:01:12,767][25689] Fps is (10 sec: 5480.7, 60 sec: 5514.4, 300 sec: 5534.5). Total num frames: 891297792. Throughput: 0: 4858.5. Samples: 891296094. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:01:12,767][25689] Avg episode reward: [(0, '-2.791')] [2022-07-10 20:01:13,319][26022] Updated weights on worker 0-0, policy_version 870412 (0.00088) [2022-07-10 20:01:15,002][26022] Updated weights on worker 0-0, policy_version 870422 (0.00091) [2022-07-10 20:01:17,082][26022] Updated weights on worker 0-0, policy_version 870432 (0.00085) [2022-07-10 20:01:17,768][25689] Fps is (10 sec: 5517.6, 60 sec: 5533.6, 300 sec: 5538.8). Total num frames: 891326464. Throughput: 0: 5703.2. Samples: 891329588. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:01:17,769][25689] Avg episode reward: [(0, '-2.863')] [2022-07-10 20:01:18,888][26022] Updated weights on worker 0-0, policy_version 870442 (0.00091) [2022-07-10 20:01:20,836][26022] Updated weights on worker 0-0, policy_version 870452 (0.00093) [2022-07-10 20:01:22,515][26022] Updated weights on worker 0-0, policy_version 870462 (0.00087) [2022-07-10 20:01:22,821][25689] Fps is (10 sec: 5601.5, 60 sec: 5512.8, 300 sec: 5535.9). Total num frames: 891354112. Throughput: 0: 5699.5. Samples: 891362874. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:01:22,822][25689] Avg episode reward: [(0, '-2.511')] [2022-07-10 20:01:24,529][26022] Updated weights on worker 0-0, policy_version 870472 (0.00095) [2022-07-10 20:01:26,180][26022] Updated weights on worker 0-0, policy_version 870482 (0.00091) [2022-07-10 20:01:27,964][25689] Fps is (10 sec: 5423.6, 60 sec: 5542.3, 300 sec: 5526.4). Total num frames: 891381760. Throughput: 0: 4964.4. Samples: 891379560. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:01:27,966][25689] Avg episode reward: [(0, '-1.929')] [2022-07-10 20:01:28,169][26022] Updated weights on worker 0-0, policy_version 870492 (0.00088) [2022-07-10 20:01:29,698][26022] Updated weights on worker 0-0, policy_version 870502 (0.00088) [2022-07-10 20:01:31,912][26022] Updated weights on worker 0-0, policy_version 870512 (0.00087) [2022-07-10 20:01:33,026][25689] Fps is (10 sec: 5619.5, 60 sec: 5523.4, 300 sec: 5536.0). Total num frames: 891411456. Throughput: 0: 5773.1. Samples: 891413050. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:01:33,026][25689] Avg episode reward: [(0, '-2.279')] [2022-07-10 20:01:33,529][26022] Updated weights on worker 0-0, policy_version 870522 (0.00094) [2022-07-10 20:01:35,455][26022] Updated weights on worker 0-0, policy_version 870532 (0.00090) [2022-07-10 20:01:37,391][26022] Updated weights on worker 0-0, policy_version 870542 (0.00091) [2022-07-10 20:01:38,096][25689] Fps is (10 sec: 5558.7, 60 sec: 5503.2, 300 sec: 5528.0). Total num frames: 891438080. Throughput: 0: 5752.6. Samples: 891446522. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:01:38,096][25689] Avg episode reward: [(0, '-0.813')] [2022-07-10 20:01:38,929][26022] Updated weights on worker 0-0, policy_version 870552 (0.00087) [2022-07-10 20:01:41,007][26022] Updated weights on worker 0-0, policy_version 870562 (0.00055) [2022-07-10 20:01:42,620][26022] Updated weights on worker 0-0, policy_version 870572 (0.00086) [2022-07-10 20:01:43,102][25689] Fps is (10 sec: 5487.9, 60 sec: 5541.6, 300 sec: 5528.9). Total num frames: 891466752. Throughput: 0: 4961.6. Samples: 891463498. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:01:43,102][25689] Avg episode reward: [(0, '-1.100')] [2022-07-10 20:01:44,680][26022] Updated weights on worker 0-0, policy_version 870582 (0.00089) [2022-07-10 20:01:46,542][26022] Updated weights on worker 0-0, policy_version 870592 (0.00089) [2022-07-10 20:01:48,153][25689] Fps is (10 sec: 5803.6, 60 sec: 5542.1, 300 sec: 5538.8). Total num frames: 891496448. Throughput: 0: 5829.0. Samples: 891497242. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:01:48,153][25689] Avg episode reward: [(0, '-1.033')] [2022-07-10 20:01:48,158][26022] Updated weights on worker 0-0, policy_version 870602 (0.00089) [2022-07-10 20:01:50,167][26022] Updated weights on worker 0-0, policy_version 870612 (0.00088) [2022-07-10 20:01:52,036][26022] Updated weights on worker 0-0, policy_version 870622 (0.00091) [2022-07-10 20:01:53,244][25689] Fps is (10 sec: 5552.8, 60 sec: 5522.3, 300 sec: 5531.0). Total num frames: 891523072. Throughput: 0: 5823.6. Samples: 891530796. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:01:53,245][25689] Avg episode reward: [(0, '-1.130')] [2022-07-10 20:01:53,746][26022] Updated weights on worker 0-0, policy_version 870632 (0.00085) [2022-07-10 20:01:55,671][26022] Updated weights on worker 0-0, policy_version 870642 (0.00088) [2022-07-10 20:01:57,295][26022] Updated weights on worker 0-0, policy_version 870652 (0.00090) [2022-07-10 20:01:58,258][25689] Fps is (10 sec: 5573.2, 60 sec: 5540.5, 300 sec: 5537.7). Total num frames: 891552768. Throughput: 0: 5014.4. Samples: 891547626. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:01:58,259][25689] Avg episode reward: [(0, '-1.299')] [2022-07-10 20:01:59,161][26022] Updated weights on worker 0-0, policy_version 870662 (0.00089) [2022-07-10 20:02:00,821][26022] Updated weights on worker 0-0, policy_version 870672 (0.00088) [2022-07-10 20:02:03,188][26022] Updated weights on worker 0-0, policy_version 870682 (0.00084) [2022-07-10 20:02:03,327][25689] Fps is (10 sec: 5586.0, 60 sec: 5554.8, 300 sec: 5541.6). Total num frames: 891579392. Throughput: 0: 5846.7. Samples: 891581748. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:02:03,329][25689] Avg episode reward: [(0, '-1.149')] [2022-07-10 20:02:05,063][26022] Updated weights on worker 0-0, policy_version 870692 (0.00085) [2022-07-10 20:02:06,543][26022] Updated weights on worker 0-0, policy_version 870702 (0.00387) [2022-07-10 20:02:08,382][25689] Fps is (10 sec: 5361.0, 60 sec: 5539.2, 300 sec: 5537.6). Total num frames: 891607040. Throughput: 0: 5752.9. Samples: 891613618. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:02:08,382][25689] Avg episode reward: [(0, '-0.797')] [2022-07-10 20:02:08,665][26022] Updated weights on worker 0-0, policy_version 870712 (0.00089) [2022-07-10 20:02:10,316][26022] Updated weights on worker 0-0, policy_version 870722 (0.00081) [2022-07-10 20:02:12,367][26022] Updated weights on worker 0-0, policy_version 870732 (0.00086) [2022-07-10 20:02:13,414][25689] Fps is (10 sec: 5583.5, 60 sec: 5571.9, 300 sec: 5541.9). Total num frames: 891635712. Throughput: 0: 4945.3. Samples: 891630540. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:02:13,414][25689] Avg episode reward: [(0, '-0.415')] [2022-07-10 20:02:14,045][26022] Updated weights on worker 0-0, policy_version 870742 (0.00089) [2022-07-10 20:02:15,854][26022] Updated weights on worker 0-0, policy_version 870752 (0.00093) [2022-07-10 20:02:17,686][26022] Updated weights on worker 0-0, policy_version 870762 (0.00091) [2022-07-10 20:02:18,422][25689] Fps is (10 sec: 5609.5, 60 sec: 5554.4, 300 sec: 5535.8). Total num frames: 891663360. Throughput: 0: 5787.0. Samples: 891664312. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:02:18,422][25689] Avg episode reward: [(0, '-0.558')] [2022-07-10 20:02:19,715][26022] Updated weights on worker 0-0, policy_version 870772 (0.00085) [2022-07-10 20:02:21,400][26022] Updated weights on worker 0-0, policy_version 870782 (0.00089) [2022-07-10 20:02:23,200][26022] Updated weights on worker 0-0, policy_version 870792 (0.00086) [2022-07-10 20:02:23,427][25689] Fps is (10 sec: 5624.4, 60 sec: 5575.7, 300 sec: 5544.1). Total num frames: 891692032. Throughput: 0: 5764.3. Samples: 891697614. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:02:23,428][25689] Avg episode reward: [(0, '-0.310')] [2022-07-10 20:02:25,316][26022] Updated weights on worker 0-0, policy_version 870802 (0.00084) [2022-07-10 20:02:26,924][26022] Updated weights on worker 0-0, policy_version 870812 (0.00085) [2022-07-10 20:02:28,510][25689] Fps is (10 sec: 5582.8, 60 sec: 5581.2, 300 sec: 5539.6). Total num frames: 891719680. Throughput: 0: 5007.1. Samples: 891714402. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:02:28,510][25689] Avg episode reward: [(0, '-0.947')] [2022-07-10 20:02:28,785][26022] Updated weights on worker 0-0, policy_version 870822 (0.00091) [2022-07-10 20:02:30,584][26022] Updated weights on worker 0-0, policy_version 870832 (0.00094) [2022-07-10 20:02:32,488][26022] Updated weights on worker 0-0, policy_version 870842 (0.00083) [2022-07-10 20:02:33,520][25689] Fps is (10 sec: 5580.3, 60 sec: 5569.1, 300 sec: 5539.9). Total num frames: 891748352. Throughput: 0: 5831.1. Samples: 891747780. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:02:33,520][25689] Avg episode reward: [(0, '-1.079')] [2022-07-10 20:02:34,320][26022] Updated weights on worker 0-0, policy_version 870852 (0.00094) [2022-07-10 20:02:36,095][26022] Updated weights on worker 0-0, policy_version 870862 (0.00089) [2022-07-10 20:02:38,103][26022] Updated weights on worker 0-0, policy_version 870872 (0.00090) [2022-07-10 20:02:38,532][25689] Fps is (10 sec: 5517.4, 60 sec: 5574.4, 300 sec: 5543.6). Total num frames: 891774976. Throughput: 0: 5806.7. Samples: 891781086. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:02:38,533][25689] Avg episode reward: [(0, '-0.509')] [2022-07-10 20:02:39,790][26022] Updated weights on worker 0-0, policy_version 870882 (0.00097) [2022-07-10 20:02:41,837][26022] Updated weights on worker 0-0, policy_version 870892 (0.00093) [2022-07-10 20:02:43,560][25689] Fps is (10 sec: 5405.4, 60 sec: 5555.4, 300 sec: 5534.9). Total num frames: 891802624. Throughput: 0: 4974.4. Samples: 891797762. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:02:43,562][25689] Avg episode reward: [(0, '-0.716')] [2022-07-10 20:02:43,593][26022] Updated weights on worker 0-0, policy_version 870902 (0.00356) [2022-07-10 20:02:45,355][26022] Updated weights on worker 0-0, policy_version 870912 (0.00083) [2022-07-10 20:02:47,394][26022] Updated weights on worker 0-0, policy_version 870922 (0.00092) [2022-07-10 20:02:48,618][25689] Fps is (10 sec: 5583.8, 60 sec: 5537.8, 300 sec: 5541.0). Total num frames: 891831296. Throughput: 0: 5796.4. Samples: 891830958. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:02:48,619][25689] Avg episode reward: [(0, '-0.350')] [2022-07-10 20:02:49,265][26022] Updated weights on worker 0-0, policy_version 870932 (0.00094) [2022-07-10 20:02:50,933][26022] Updated weights on worker 0-0, policy_version 870942 (0.00081) [2022-07-10 20:02:52,927][26022] Updated weights on worker 0-0, policy_version 870952 (0.00086) [2022-07-10 20:02:53,693][25689] Fps is (10 sec: 5558.3, 60 sec: 5556.3, 300 sec: 5536.5). Total num frames: 891858944. Throughput: 0: 5775.9. Samples: 891864296. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:02:53,693][25689] Avg episode reward: [(0, '-0.666')] [2022-07-10 20:02:54,599][26022] Updated weights on worker 0-0, policy_version 870962 (0.00082) [2022-07-10 20:02:56,569][26022] Updated weights on worker 0-0, policy_version 870972 (0.00086) [2022-07-10 20:02:58,306][26022] Updated weights on worker 0-0, policy_version 870982 (0.00091) [2022-07-10 20:02:58,720][25689] Fps is (10 sec: 5473.7, 60 sec: 5521.2, 300 sec: 5532.9). Total num frames: 891886592. Throughput: 0: 4950.0. Samples: 891881018. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-10 20:02:58,721][25689] Avg episode reward: [(0, '0.485')] [2022-07-10 20:03:00,250][26022] Updated weights on worker 0-0, policy_version 870992 (0.00086) [2022-07-10 20:03:01,018][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:03:01,040][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000870996_891899904.pth [2022-07-10 20:03:01,040][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000869048_889905152.pth [2022-07-10 20:03:02,395][26022] Updated weights on worker 0-0, policy_version 871002 (0.00104) [2022-07-10 20:03:03,768][25689] Fps is (10 sec: 5386.7, 60 sec: 5523.1, 300 sec: 5532.8). Total num frames: 891913216. Throughput: 0: 5751.0. Samples: 891913976. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:03:03,768][25689] Avg episode reward: [(0, '-0.187')] [2022-07-10 20:03:04,365][26022] Updated weights on worker 0-0, policy_version 871012 (0.00087) [2022-07-10 20:03:06,010][26022] Updated weights on worker 0-0, policy_version 871022 (0.00512) [2022-07-10 20:03:07,894][26022] Updated weights on worker 0-0, policy_version 871032 (0.00092) [2022-07-10 20:03:08,855][25689] Fps is (10 sec: 5456.0, 60 sec: 5537.1, 300 sec: 5535.1). Total num frames: 891941888. Throughput: 0: 5680.6. Samples: 891945916. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:03:08,856][25689] Avg episode reward: [(0, '-0.027')] [2022-07-10 20:03:09,574][26022] Updated weights on worker 0-0, policy_version 871042 (0.00088) [2022-07-10 20:03:11,546][26022] Updated weights on worker 0-0, policy_version 871052 (0.00094) [2022-07-10 20:03:13,419][26022] Updated weights on worker 0-0, policy_version 871062 (0.00077) [2022-07-10 20:03:13,874][25689] Fps is (10 sec: 5674.1, 60 sec: 5538.3, 300 sec: 5534.8). Total num frames: 891970560. Throughput: 0: 5696.8. Samples: 891979266. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:03:13,875][25689] Avg episode reward: [(0, '0.658')] [2022-07-10 20:03:15,328][26022] Updated weights on worker 0-0, policy_version 871072 (0.00089) [2022-07-10 20:03:17,049][26022] Updated weights on worker 0-0, policy_version 871082 (0.00092) [2022-07-10 20:03:18,883][25689] Fps is (10 sec: 5514.3, 60 sec: 5521.3, 300 sec: 5534.8). Total num frames: 891997184. Throughput: 0: 5708.9. Samples: 891996124. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:03:18,884][25689] Avg episode reward: [(0, '0.144')] [2022-07-10 20:03:18,940][26022] Updated weights on worker 0-0, policy_version 871092 (0.00088) [2022-07-10 20:03:20,624][26022] Updated weights on worker 0-0, policy_version 871102 (0.00093) [2022-07-10 20:03:22,682][26022] Updated weights on worker 0-0, policy_version 871112 (0.00091) [2022-07-10 20:03:23,907][25689] Fps is (10 sec: 5613.5, 60 sec: 5536.5, 300 sec: 5536.6). Total num frames: 892026880. Throughput: 0: 5736.9. Samples: 892029512. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:03:23,908][25689] Avg episode reward: [(0, '0.112')] [2022-07-10 20:03:24,375][26022] Updated weights on worker 0-0, policy_version 871122 (0.00090) [2022-07-10 20:03:26,231][26022] Updated weights on worker 0-0, policy_version 871132 (0.00086) [2022-07-10 20:03:28,121][26022] Updated weights on worker 0-0, policy_version 871142 (0.00086) [2022-07-10 20:03:28,967][25689] Fps is (10 sec: 5585.3, 60 sec: 5521.7, 300 sec: 5532.9). Total num frames: 892053504. Throughput: 0: 5817.9. Samples: 892062922. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:03:28,967][25689] Avg episode reward: [(0, '-1.207')] [2022-07-10 20:03:29,966][26022] Updated weights on worker 0-0, policy_version 871152 (0.00086) [2022-07-10 20:03:31,682][26022] Updated weights on worker 0-0, policy_version 871162 (0.00092) [2022-07-10 20:03:33,591][26022] Updated weights on worker 0-0, policy_version 871172 (0.00086) [2022-07-10 20:03:33,983][25689] Fps is (10 sec: 5488.1, 60 sec: 5521.1, 300 sec: 5536.5). Total num frames: 892082176. Throughput: 0: 4985.5. Samples: 892079518. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:03:33,983][25689] Avg episode reward: [(0, '-0.510')] [2022-07-10 20:03:35,450][26022] Updated weights on worker 0-0, policy_version 871182 (0.00091) [2022-07-10 20:03:37,310][26022] Updated weights on worker 0-0, policy_version 871192 (0.00090) [2022-07-10 20:03:38,989][25689] Fps is (10 sec: 5619.4, 60 sec: 5538.6, 300 sec: 5536.9). Total num frames: 892109824. Throughput: 0: 5820.6. Samples: 892113152. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:03:38,990][25689] Avg episode reward: [(0, '-1.782')] [2022-07-10 20:03:39,028][26022] Updated weights on worker 0-0, policy_version 871202 (0.00094) [2022-07-10 20:03:40,967][26022] Updated weights on worker 0-0, policy_version 871212 (0.00448) [2022-07-10 20:03:42,893][26022] Updated weights on worker 0-0, policy_version 871222 (0.00091) [2022-07-10 20:03:43,992][25689] Fps is (10 sec: 5524.4, 60 sec: 5540.9, 300 sec: 5531.0). Total num frames: 892137472. Throughput: 0: 5824.3. Samples: 892146492. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:03:43,992][25689] Avg episode reward: [(0, '-1.975')] [2022-07-10 20:03:44,687][26022] Updated weights on worker 0-0, policy_version 871232 (0.00093) [2022-07-10 20:03:46,515][26022] Updated weights on worker 0-0, policy_version 871242 (0.00087) [2022-07-10 20:03:48,342][26022] Updated weights on worker 0-0, policy_version 871252 (0.00091) [2022-07-10 20:03:49,096][25689] Fps is (10 sec: 5471.3, 60 sec: 5519.8, 300 sec: 5529.4). Total num frames: 892165120. Throughput: 0: 4980.2. Samples: 892163168. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:03:49,096][25689] Avg episode reward: [(0, '-1.733')] [2022-07-10 20:03:50,243][26022] Updated weights on worker 0-0, policy_version 871262 (0.00083) [2022-07-10 20:03:52,012][26022] Updated weights on worker 0-0, policy_version 871272 (0.00085) [2022-07-10 20:03:53,931][26022] Updated weights on worker 0-0, policy_version 871282 (0.00088) [2022-07-10 20:03:54,194][25689] Fps is (10 sec: 5520.5, 60 sec: 5534.6, 300 sec: 5531.7). Total num frames: 892193792. Throughput: 0: 5776.6. Samples: 892196270. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:03:54,194][25689] Avg episode reward: [(0, '-0.621')] [2022-07-10 20:03:55,646][26022] Updated weights on worker 0-0, policy_version 871292 (0.00085) [2022-07-10 20:03:57,711][26022] Updated weights on worker 0-0, policy_version 871302 (0.00435) [2022-07-10 20:03:59,212][25689] Fps is (10 sec: 5668.7, 60 sec: 5552.4, 300 sec: 5532.3). Total num frames: 892222464. Throughput: 0: 5770.0. Samples: 892229836. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:03:59,212][25689] Avg episode reward: [(0, '-0.835')] [2022-07-10 20:03:59,316][26022] Updated weights on worker 0-0, policy_version 871312 (0.00086) [2022-07-10 20:04:01,416][26022] Updated weights on worker 0-0, policy_version 871322 (0.00088) [2022-07-10 20:04:03,570][26022] Updated weights on worker 0-0, policy_version 871332 (0.00175) [2022-07-10 20:04:04,226][25689] Fps is (10 sec: 5307.6, 60 sec: 5521.5, 300 sec: 5526.8). Total num frames: 892247040. Throughput: 0: 4878.3. Samples: 892245202. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:04:04,227][25689] Avg episode reward: [(0, '-1.086')] [2022-07-10 20:04:05,452][26022] Updated weights on worker 0-0, policy_version 871342 (0.00091) [2022-07-10 20:04:07,138][26022] Updated weights on worker 0-0, policy_version 871352 (0.00092) [2022-07-10 20:04:09,096][26022] Updated weights on worker 0-0, policy_version 871362 (0.00088) [2022-07-10 20:04:09,290][25689] Fps is (10 sec: 5181.6, 60 sec: 5506.7, 300 sec: 5522.6). Total num frames: 892274688. Throughput: 0: 5681.4. Samples: 892277902. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:04:09,291][25689] Avg episode reward: [(0, '-0.977')] [2022-07-10 20:04:10,881][26022] Updated weights on worker 0-0, policy_version 871372 (0.00091) [2022-07-10 20:04:12,930][26022] Updated weights on worker 0-0, policy_version 871382 (0.00094) [2022-07-10 20:04:14,324][25689] Fps is (10 sec: 5679.1, 60 sec: 5522.3, 300 sec: 5532.5). Total num frames: 892304384. Throughput: 0: 5712.0. Samples: 892311252. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:04:14,324][25689] Avg episode reward: [(0, '-0.777')] [2022-07-10 20:04:14,453][26022] Updated weights on worker 0-0, policy_version 871392 (0.00092) [2022-07-10 20:04:16,462][26022] Updated weights on worker 0-0, policy_version 871402 (0.00094) [2022-07-10 20:04:18,226][26022] Updated weights on worker 0-0, policy_version 871412 (0.00083) [2022-07-10 20:04:19,353][25689] Fps is (10 sec: 5596.9, 60 sec: 5520.5, 300 sec: 5528.8). Total num frames: 892331008. Throughput: 0: 4880.7. Samples: 892328140. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:04:19,354][25689] Avg episode reward: [(0, '-0.768')] [2022-07-10 20:04:20,250][26022] Updated weights on worker 0-0, policy_version 871422 (0.00049) [2022-07-10 20:04:21,911][26022] Updated weights on worker 0-0, policy_version 871432 (0.00090) [2022-07-10 20:04:23,898][26022] Updated weights on worker 0-0, policy_version 871442 (0.00092) [2022-07-10 20:04:24,427][25689] Fps is (10 sec: 5473.3, 60 sec: 5499.0, 300 sec: 5529.5). Total num frames: 892359680. Throughput: 0: 5765.3. Samples: 892361662. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:04:24,428][25689] Avg episode reward: [(0, '-1.178')] [2022-07-10 20:04:25,684][26022] Updated weights on worker 0-0, policy_version 871452 (0.00083) [2022-07-10 20:04:27,628][26022] Updated weights on worker 0-0, policy_version 871462 (0.00092) [2022-07-10 20:04:29,256][26022] Updated weights on worker 0-0, policy_version 871472 (0.00085) [2022-07-10 20:04:29,573][25689] Fps is (10 sec: 5610.9, 60 sec: 5525.0, 300 sec: 5528.5). Total num frames: 892388352. Throughput: 0: 5771.0. Samples: 892394954. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:04:29,574][25689] Avg episode reward: [(0, '-0.811')] [2022-07-10 20:04:31,175][26022] Updated weights on worker 0-0, policy_version 871482 (0.00089) [2022-07-10 20:04:33,020][26022] Updated weights on worker 0-0, policy_version 871492 (0.00090) [2022-07-10 20:04:34,611][25689] Fps is (10 sec: 5530.1, 60 sec: 5506.1, 300 sec: 5528.8). Total num frames: 892416000. Throughput: 0: 4956.3. Samples: 892411802. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:04:34,611][25689] Avg episode reward: [(0, '-0.679')] [2022-07-10 20:04:34,954][26022] Updated weights on worker 0-0, policy_version 871502 (0.00081) [2022-07-10 20:04:36,754][26022] Updated weights on worker 0-0, policy_version 871512 (0.00092) [2022-07-10 20:04:38,479][26022] Updated weights on worker 0-0, policy_version 871522 (0.00123) [2022-07-10 20:04:39,617][25689] Fps is (10 sec: 5505.6, 60 sec: 5506.1, 300 sec: 5532.4). Total num frames: 892443648. Throughput: 0: 5774.8. Samples: 892445158. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:04:39,617][25689] Avg episode reward: [(0, '0.175')] [2022-07-10 20:04:40,472][26022] Updated weights on worker 0-0, policy_version 871532 (0.00090) [2022-07-10 20:04:42,095][26022] Updated weights on worker 0-0, policy_version 871542 (0.00098) [2022-07-10 20:04:44,088][26022] Updated weights on worker 0-0, policy_version 871552 (0.00082) [2022-07-10 20:04:44,632][25689] Fps is (10 sec: 5620.3, 60 sec: 5521.9, 300 sec: 5534.2). Total num frames: 892472320. Throughput: 0: 5786.9. Samples: 892478586. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:04:44,632][25689] Avg episode reward: [(0, '0.249')] [2022-07-10 20:04:46,010][26022] Updated weights on worker 0-0, policy_version 871562 (0.00097) [2022-07-10 20:04:47,659][26022] Updated weights on worker 0-0, policy_version 871572 (0.00093) [2022-07-10 20:04:49,751][25689] Fps is (10 sec: 5456.4, 60 sec: 5503.6, 300 sec: 5525.2). Total num frames: 892498944. Throughput: 0: 4967.6. Samples: 892495190. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:04:49,751][25689] Avg episode reward: [(0, '0.462')] [2022-07-10 20:04:49,792][26022] Updated weights on worker 0-0, policy_version 871582 (0.00093) [2022-07-10 20:04:51,320][26022] Updated weights on worker 0-0, policy_version 871592 (0.00091) [2022-07-10 20:04:53,273][26022] Updated weights on worker 0-0, policy_version 871602 (0.00085) [2022-07-10 20:04:54,753][25689] Fps is (10 sec: 5463.3, 60 sec: 5512.4, 300 sec: 5529.0). Total num frames: 892527616. Throughput: 0: 5788.2. Samples: 892528388. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:04:54,756][25689] Avg episode reward: [(0, '0.853')] [2022-07-10 20:04:55,324][26022] Updated weights on worker 0-0, policy_version 871612 (0.00089) [2022-07-10 20:04:56,938][26022] Updated weights on worker 0-0, policy_version 871622 (0.00089) [2022-07-10 20:04:58,914][26022] Updated weights on worker 0-0, policy_version 871632 (0.00096) [2022-07-10 20:04:59,796][25689] Fps is (10 sec: 5607.0, 60 sec: 5493.2, 300 sec: 5535.3). Total num frames: 892555264. Throughput: 0: 5788.7. Samples: 892561966. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:04:59,796][25689] Avg episode reward: [(0, '0.078')] [2022-07-10 20:05:00,606][26022] Updated weights on worker 0-0, policy_version 871642 (0.00204) [2022-07-10 20:05:01,134][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:05:01,146][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000871645_892564480.pth [2022-07-10 20:05:01,147][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000869698_890570752.pth [2022-07-10 20:05:02,770][26022] Updated weights on worker 0-0, policy_version 871652 (0.00082) [2022-07-10 20:05:04,787][26022] Updated weights on worker 0-0, policy_version 871662 (0.00090) [2022-07-10 20:05:04,851][25689] Fps is (10 sec: 5374.7, 60 sec: 5523.3, 300 sec: 5532.0). Total num frames: 892581888. Throughput: 0: 4840.8. Samples: 892576462. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:05:04,851][25689] Avg episode reward: [(0, '-0.501')] [2022-07-10 20:05:06,644][26022] Updated weights on worker 0-0, policy_version 871672 (0.00086) [2022-07-10 20:05:08,358][26022] Updated weights on worker 0-0, policy_version 871682 (0.00094) [2022-07-10 20:05:09,916][25689] Fps is (10 sec: 5261.6, 60 sec: 5506.4, 300 sec: 5524.4). Total num frames: 892608512. Throughput: 0: 5694.7. Samples: 892610022. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:05:09,916][25689] Avg episode reward: [(0, '-0.493')] [2022-07-10 20:05:10,315][26022] Updated weights on worker 0-0, policy_version 871692 (0.00087) [2022-07-10 20:05:12,104][26022] Updated weights on worker 0-0, policy_version 871702 (0.00094) [2022-07-10 20:05:13,989][26022] Updated weights on worker 0-0, policy_version 871712 (0.00083) [2022-07-10 20:05:14,963][25689] Fps is (10 sec: 5670.6, 60 sec: 5521.9, 300 sec: 5534.4). Total num frames: 892639232. Throughput: 0: 5697.3. Samples: 892643532. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:05:14,964][25689] Avg episode reward: [(0, '-0.526')] [2022-07-10 20:05:15,790][26022] Updated weights on worker 0-0, policy_version 871722 (0.00094) [2022-07-10 20:05:17,675][26022] Updated weights on worker 0-0, policy_version 871732 (0.00086) [2022-07-10 20:05:19,536][26022] Updated weights on worker 0-0, policy_version 871742 (0.00073) [2022-07-10 20:05:19,974][25689] Fps is (10 sec: 5802.8, 60 sec: 5540.5, 300 sec: 5530.9). Total num frames: 892666880. Throughput: 0: 4873.1. Samples: 892660300. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:05:19,975][25689] Avg episode reward: [(0, '-0.637')] [2022-07-10 20:05:21,331][26022] Updated weights on worker 0-0, policy_version 871752 (0.00085) [2022-07-10 20:05:22,960][26022] Updated weights on worker 0-0, policy_version 871762 (0.00091) [2022-07-10 20:05:24,992][25689] Fps is (10 sec: 5309.5, 60 sec: 5494.9, 300 sec: 5532.4). Total num frames: 892692480. Throughput: 0: 5799.6. Samples: 892693274. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:05:24,993][25689] Avg episode reward: [(0, '-1.751')] [2022-07-10 20:05:25,341][26022] Updated weights on worker 0-0, policy_version 871772 (0.00089) [2022-07-10 20:05:26,571][26022] Updated weights on worker 0-0, policy_version 871782 (0.00086) [2022-07-10 20:05:28,821][26022] Updated weights on worker 0-0, policy_version 871792 (0.00086) [2022-07-10 20:05:30,032][25689] Fps is (10 sec: 5599.7, 60 sec: 5538.5, 300 sec: 5532.4). Total num frames: 892723200. Throughput: 0: 5804.3. Samples: 892726782. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:05:30,032][25689] Avg episode reward: [(0, '-1.682')] [2022-07-10 20:05:30,280][26022] Updated weights on worker 0-0, policy_version 871802 (0.00088) [2022-07-10 20:05:32,508][26022] Updated weights on worker 0-0, policy_version 871812 (0.00086) [2022-07-10 20:05:34,214][26022] Updated weights on worker 0-0, policy_version 871822 (0.00093) [2022-07-10 20:05:35,039][25689] Fps is (10 sec: 5605.5, 60 sec: 5507.3, 300 sec: 5525.9). Total num frames: 892748800. Throughput: 0: 4979.3. Samples: 892743496. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:05:35,040][25689] Avg episode reward: [(0, '-2.003')] [2022-07-10 20:05:36,042][26022] Updated weights on worker 0-0, policy_version 871832 (0.00082) [2022-07-10 20:05:38,083][26022] Updated weights on worker 0-0, policy_version 871842 (0.00087) [2022-07-10 20:05:39,620][26022] Updated weights on worker 0-0, policy_version 871852 (0.00080) [2022-07-10 20:05:40,116][25689] Fps is (10 sec: 5483.5, 60 sec: 5534.8, 300 sec: 5535.9). Total num frames: 892778496. Throughput: 0: 5806.3. Samples: 892777248. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:05:40,117][25689] Avg episode reward: [(0, '-2.016')] [2022-07-10 20:05:41,471][26022] Updated weights on worker 0-0, policy_version 871862 (0.00087) [2022-07-10 20:05:43,405][26022] Updated weights on worker 0-0, policy_version 871872 (0.00088) [2022-07-10 20:05:44,970][26022] Updated weights on worker 0-0, policy_version 871882 (0.00092) [2022-07-10 20:05:45,124][25689] Fps is (10 sec: 5787.8, 60 sec: 5535.4, 300 sec: 5533.4). Total num frames: 892807168. Throughput: 0: 5847.4. Samples: 892810994. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:05:45,125][25689] Avg episode reward: [(0, '-2.077')] [2022-07-10 20:05:47,062][26022] Updated weights on worker 0-0, policy_version 871892 (0.00088) [2022-07-10 20:05:48,849][26022] Updated weights on worker 0-0, policy_version 871902 (0.00089) [2022-07-10 20:05:50,208][25689] Fps is (10 sec: 5479.4, 60 sec: 5538.7, 300 sec: 5529.5). Total num frames: 892833792. Throughput: 0: 5834.9. Samples: 892844504. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:05:50,208][25689] Avg episode reward: [(0, '-1.749')] [2022-07-10 20:05:50,705][26022] Updated weights on worker 0-0, policy_version 871912 (0.00088) [2022-07-10 20:05:52,799][26022] Updated weights on worker 0-0, policy_version 871922 (0.00086) [2022-07-10 20:05:54,317][26022] Updated weights on worker 0-0, policy_version 871932 (0.00092) [2022-07-10 20:05:55,246][25689] Fps is (10 sec: 5463.2, 60 sec: 5535.4, 300 sec: 5529.3). Total num frames: 892862464. Throughput: 0: 5829.3. Samples: 892861284. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:05:55,246][25689] Avg episode reward: [(0, '-0.479')] [2022-07-10 20:05:56,435][26022] Updated weights on worker 0-0, policy_version 871942 (0.00086) [2022-07-10 20:05:58,016][26022] Updated weights on worker 0-0, policy_version 871952 (0.00099) [2022-07-10 20:05:59,887][26022] Updated weights on worker 0-0, policy_version 871962 (0.00088) [2022-07-10 20:06:00,250][25689] Fps is (10 sec: 5812.0, 60 sec: 5572.7, 300 sec: 5543.7). Total num frames: 892892160. Throughput: 0: 5834.6. Samples: 892894724. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:06:00,251][25689] Avg episode reward: [(0, '-0.348')] [2022-07-10 20:06:02,145][26022] Updated weights on worker 0-0, policy_version 871972 (0.00110) [2022-07-10 20:06:03,922][26022] Updated weights on worker 0-0, policy_version 871982 (0.00058) [2022-07-10 20:06:05,330][25689] Fps is (10 sec: 5381.6, 60 sec: 5536.5, 300 sec: 5529.8). Total num frames: 892916736. Throughput: 0: 5708.8. Samples: 892926348. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:06:05,331][25689] Avg episode reward: [(0, '0.798')] [2022-07-10 20:06:05,854][26022] Updated weights on worker 0-0, policy_version 871992 (0.00091) [2022-07-10 20:06:07,591][26022] Updated weights on worker 0-0, policy_version 872002 (0.00084) [2022-07-10 20:06:09,455][26022] Updated weights on worker 0-0, policy_version 872012 (0.00089) [2022-07-10 20:06:10,385][25689] Fps is (10 sec: 5254.0, 60 sec: 5571.4, 300 sec: 5536.0). Total num frames: 892945408. Throughput: 0: 4889.2. Samples: 892943156. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:06:10,385][25689] Avg episode reward: [(0, '0.366')] [2022-07-10 20:06:11,211][26022] Updated weights on worker 0-0, policy_version 872022 (0.00089) [2022-07-10 20:06:13,043][26022] Updated weights on worker 0-0, policy_version 872032 (0.00094) [2022-07-10 20:06:15,132][26022] Updated weights on worker 0-0, policy_version 872042 (0.00084) [2022-07-10 20:06:15,391][25689] Fps is (10 sec: 5598.3, 60 sec: 5524.4, 300 sec: 5532.5). Total num frames: 892973056. Throughput: 0: 5714.7. Samples: 892976406. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:06:15,391][25689] Avg episode reward: [(0, '0.471')] [2022-07-10 20:06:16,734][26022] Updated weights on worker 0-0, policy_version 872052 (0.00083) [2022-07-10 20:06:18,688][26022] Updated weights on worker 0-0, policy_version 872062 (0.00091) [2022-07-10 20:06:20,397][25689] Fps is (10 sec: 5522.7, 60 sec: 5524.8, 300 sec: 5533.3). Total num frames: 893000704. Throughput: 0: 5710.9. Samples: 893009782. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:06:20,398][25689] Avg episode reward: [(0, '0.863')] [2022-07-10 20:06:20,611][26022] Updated weights on worker 0-0, policy_version 872072 (0.00090) [2022-07-10 20:06:22,390][26022] Updated weights on worker 0-0, policy_version 872082 (0.00095) [2022-07-10 20:06:24,378][26022] Updated weights on worker 0-0, policy_version 872092 (0.00087) [2022-07-10 20:06:25,403][25689] Fps is (10 sec: 5420.5, 60 sec: 5542.8, 300 sec: 5532.5). Total num frames: 893027328. Throughput: 0: 4981.2. Samples: 893026334. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:06:25,404][25689] Avg episode reward: [(0, '0.309')] [2022-07-10 20:06:25,946][26022] Updated weights on worker 0-0, policy_version 872102 (0.00088) [2022-07-10 20:06:28,016][26022] Updated weights on worker 0-0, policy_version 872112 (0.00092) [2022-07-10 20:06:29,705][26022] Updated weights on worker 0-0, policy_version 872122 (0.00086) [2022-07-10 20:06:30,531][25689] Fps is (10 sec: 5456.5, 60 sec: 5500.9, 300 sec: 5527.8). Total num frames: 893056000. Throughput: 0: 5766.4. Samples: 893059328. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:06:30,532][25689] Avg episode reward: [(0, '0.074')] [2022-07-10 20:06:31,685][26022] Updated weights on worker 0-0, policy_version 872132 (0.00550) [2022-07-10 20:06:33,327][26022] Updated weights on worker 0-0, policy_version 872142 (0.00053) [2022-07-10 20:06:35,288][26022] Updated weights on worker 0-0, policy_version 872152 (0.00089) [2022-07-10 20:06:35,558][25689] Fps is (10 sec: 5647.0, 60 sec: 5549.9, 300 sec: 5535.5). Total num frames: 893084672. Throughput: 0: 5782.6. Samples: 893093024. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:06:35,558][25689] Avg episode reward: [(0, '-0.062')] [2022-07-10 20:06:37,000][26022] Updated weights on worker 0-0, policy_version 872162 (0.00091) [2022-07-10 20:06:39,044][26022] Updated weights on worker 0-0, policy_version 872172 (0.00092) [2022-07-10 20:06:40,588][25689] Fps is (10 sec: 5702.3, 60 sec: 5537.3, 300 sec: 5535.0). Total num frames: 893113344. Throughput: 0: 4955.4. Samples: 893109832. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:06:40,589][25689] Avg episode reward: [(0, '-0.317')] [2022-07-10 20:06:40,693][26022] Updated weights on worker 0-0, policy_version 872182 (0.00091) [2022-07-10 20:06:42,662][26022] Updated weights on worker 0-0, policy_version 872192 (0.00087) [2022-07-10 20:06:44,508][26022] Updated weights on worker 0-0, policy_version 872202 (0.00054) [2022-07-10 20:06:45,634][25689] Fps is (10 sec: 5589.3, 60 sec: 5516.8, 300 sec: 5528.2). Total num frames: 893140992. Throughput: 0: 5773.2. Samples: 893143134. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:06:45,643][25689] Avg episode reward: [(0, '-0.198')] [2022-07-10 20:06:46,415][26022] Updated weights on worker 0-0, policy_version 872212 (0.00101) [2022-07-10 20:06:48,166][26022] Updated weights on worker 0-0, policy_version 872222 (0.00087) [2022-07-10 20:06:50,045][26022] Updated weights on worker 0-0, policy_version 872232 (0.00097) [2022-07-10 20:06:50,726][25689] Fps is (10 sec: 5454.2, 60 sec: 5533.0, 300 sec: 5531.7). Total num frames: 893168640. Throughput: 0: 5806.2. Samples: 893176582. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:06:50,726][25689] Avg episode reward: [(0, '-0.836')] [2022-07-10 20:06:51,753][26022] Updated weights on worker 0-0, policy_version 872242 (0.00086) [2022-07-10 20:06:53,823][26022] Updated weights on worker 0-0, policy_version 872252 (0.00077) [2022-07-10 20:06:55,283][26022] Updated weights on worker 0-0, policy_version 872262 (0.00081) [2022-07-10 20:06:55,746][25689] Fps is (10 sec: 5671.1, 60 sec: 5551.6, 300 sec: 5531.6). Total num frames: 893198336. Throughput: 0: 4974.4. Samples: 893193448. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:06:55,746][25689] Avg episode reward: [(0, '-0.418')] [2022-07-10 20:06:57,353][26022] Updated weights on worker 0-0, policy_version 872272 (0.00096) [2022-07-10 20:06:58,965][26022] Updated weights on worker 0-0, policy_version 872282 (0.00088) [2022-07-10 20:07:00,785][25689] Fps is (10 sec: 5700.8, 60 sec: 5514.6, 300 sec: 5535.5). Total num frames: 893225984. Throughput: 0: 5800.5. Samples: 893226986. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:07:00,785][25689] Avg episode reward: [(0, '-0.684')] [2022-07-10 20:07:00,883][26022] Updated weights on worker 0-0, policy_version 872292 (0.00086) [2022-07-10 20:07:01,335][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:07:01,348][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000872294_893229056.pth [2022-07-10 20:07:01,348][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000870346_891234304.pth [2022-07-10 20:07:03,320][26022] Updated weights on worker 0-0, policy_version 872302 (0.00089) [2022-07-10 20:07:04,891][26022] Updated weights on worker 0-0, policy_version 872312 (0.00091) [2022-07-10 20:07:05,840][25689] Fps is (10 sec: 5173.8, 60 sec: 5516.9, 300 sec: 5525.2). Total num frames: 893250560. Throughput: 0: 5716.8. Samples: 893258644. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:07:05,840][25689] Avg episode reward: [(0, '-0.679')] [2022-07-10 20:07:06,708][26022] Updated weights on worker 0-0, policy_version 872322 (0.00081) [2022-07-10 20:07:08,672][26022] Updated weights on worker 0-0, policy_version 872332 (0.00086) [2022-07-10 20:07:10,240][26022] Updated weights on worker 0-0, policy_version 872342 (0.00083) [2022-07-10 20:07:10,955][25689] Fps is (10 sec: 5437.2, 60 sec: 5545.2, 300 sec: 5530.6). Total num frames: 893281280. Throughput: 0: 4898.4. Samples: 893275672. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:07:10,955][25689] Avg episode reward: [(0, '-0.373')] [2022-07-10 20:07:12,307][26022] Updated weights on worker 0-0, policy_version 872352 (0.00117) [2022-07-10 20:07:13,782][26022] Updated weights on worker 0-0, policy_version 872362 (0.00047) [2022-07-10 20:07:15,987][25689] Fps is (10 sec: 5651.1, 60 sec: 5525.9, 300 sec: 5526.7). Total num frames: 893307904. Throughput: 0: 5716.2. Samples: 893309152. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:07:15,988][25689] Avg episode reward: [(0, '-0.945')] [2022-07-10 20:07:16,021][26022] Updated weights on worker 0-0, policy_version 872372 (0.00091) [2022-07-10 20:07:17,726][26022] Updated weights on worker 0-0, policy_version 872382 (0.00089) [2022-07-10 20:07:19,735][26022] Updated weights on worker 0-0, policy_version 872392 (0.00084) [2022-07-10 20:07:21,029][25689] Fps is (10 sec: 5590.5, 60 sec: 5556.4, 300 sec: 5529.4). Total num frames: 893337600. Throughput: 0: 5728.3. Samples: 893342950. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:07:21,030][25689] Avg episode reward: [(0, '-0.359')] [2022-07-10 20:07:21,323][26022] Updated weights on worker 0-0, policy_version 872402 (0.00082) [2022-07-10 20:07:23,464][26022] Updated weights on worker 0-0, policy_version 872412 (0.00052) [2022-07-10 20:07:24,758][26022] Updated weights on worker 0-0, policy_version 872422 (0.00089) [2022-07-10 20:07:26,109][25689] Fps is (10 sec: 5564.1, 60 sec: 5549.6, 300 sec: 5526.0). Total num frames: 893364224. Throughput: 0: 4995.0. Samples: 893359890. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:07:26,110][25689] Avg episode reward: [(0, '0.015')] [2022-07-10 20:07:27,012][26022] Updated weights on worker 0-0, policy_version 872432 (0.00093) [2022-07-10 20:07:28,728][26022] Updated weights on worker 0-0, policy_version 872442 (0.00090) [2022-07-10 20:07:30,623][26022] Updated weights on worker 0-0, policy_version 872452 (0.00082) [2022-07-10 20:07:31,171][25689] Fps is (10 sec: 5452.3, 60 sec: 5555.7, 300 sec: 5525.1). Total num frames: 893392896. Throughput: 0: 5802.9. Samples: 893392984. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:07:31,171][25689] Avg episode reward: [(0, '0.224')] [2022-07-10 20:07:32,404][26022] Updated weights on worker 0-0, policy_version 872462 (0.00090) [2022-07-10 20:07:34,328][26022] Updated weights on worker 0-0, policy_version 872472 (0.00092) [2022-07-10 20:07:35,935][26022] Updated weights on worker 0-0, policy_version 872482 (0.00084) [2022-07-10 20:07:36,175][25689] Fps is (10 sec: 5697.1, 60 sec: 5557.8, 300 sec: 5532.1). Total num frames: 893421568. Throughput: 0: 5822.7. Samples: 893426698. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:07:36,180][25689] Avg episode reward: [(0, '0.403')] [2022-07-10 20:07:38,167][26022] Updated weights on worker 0-0, policy_version 872492 (0.00089) [2022-07-10 20:07:39,638][26022] Updated weights on worker 0-0, policy_version 872502 (0.00069) [2022-07-10 20:07:41,232][25689] Fps is (10 sec: 5598.0, 60 sec: 5538.4, 300 sec: 5531.6). Total num frames: 893449216. Throughput: 0: 4975.6. Samples: 893443470. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:07:41,232][25689] Avg episode reward: [(0, '0.335')] [2022-07-10 20:07:41,867][26022] Updated weights on worker 0-0, policy_version 872512 (0.00083) [2022-07-10 20:07:43,499][26022] Updated weights on worker 0-0, policy_version 872522 (0.00087) [2022-07-10 20:07:45,451][26022] Updated weights on worker 0-0, policy_version 872532 (0.00491) [2022-07-10 20:07:46,259][25689] Fps is (10 sec: 5483.4, 60 sec: 5540.2, 300 sec: 5528.7). Total num frames: 893476864. Throughput: 0: 5785.8. Samples: 893476472. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:07:46,259][25689] Avg episode reward: [(0, '-0.250')] [2022-07-10 20:07:47,227][26022] Updated weights on worker 0-0, policy_version 872542 (0.00088) [2022-07-10 20:07:49,031][26022] Updated weights on worker 0-0, policy_version 872552 (0.00383) [2022-07-10 20:07:50,767][26022] Updated weights on worker 0-0, policy_version 872562 (0.00094) [2022-07-10 20:07:51,323][25689] Fps is (10 sec: 5479.5, 60 sec: 5542.7, 300 sec: 5528.9). Total num frames: 893504512. Throughput: 0: 5820.5. Samples: 893510280. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:07:51,325][25689] Avg episode reward: [(0, '-0.492')] [2022-07-10 20:07:52,888][26022] Updated weights on worker 0-0, policy_version 872572 (0.00088) [2022-07-10 20:07:54,697][26022] Updated weights on worker 0-0, policy_version 872582 (0.00086) [2022-07-10 20:07:56,353][25689] Fps is (10 sec: 5579.3, 60 sec: 5524.9, 300 sec: 5532.3). Total num frames: 893533184. Throughput: 0: 4962.5. Samples: 893526836. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:07:56,354][25689] Avg episode reward: [(0, '-0.388')] [2022-07-10 20:07:56,366][26022] Updated weights on worker 0-0, policy_version 872592 (0.00092) [2022-07-10 20:07:58,235][26022] Updated weights on worker 0-0, policy_version 872602 (0.00079) [2022-07-10 20:08:00,254][26022] Updated weights on worker 0-0, policy_version 872612 (0.00091) [2022-07-10 20:08:01,409][25689] Fps is (10 sec: 5584.1, 60 sec: 5523.4, 300 sec: 5535.6). Total num frames: 893560832. Throughput: 0: 5780.0. Samples: 893560094. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:08:01,411][25689] Avg episode reward: [(0, '-0.321')] [2022-07-10 20:08:02,286][26022] Updated weights on worker 0-0, policy_version 872622 (0.00090) [2022-07-10 20:08:04,262][26022] Updated weights on worker 0-0, policy_version 872632 (0.00219) [2022-07-10 20:08:05,950][26022] Updated weights on worker 0-0, policy_version 872642 (0.00081) [2022-07-10 20:08:06,431][25689] Fps is (10 sec: 5385.6, 60 sec: 5560.2, 300 sec: 5529.9). Total num frames: 893587456. Throughput: 0: 5712.9. Samples: 893591710. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:08:06,431][25689] Avg episode reward: [(0, '-1.744')] [2022-07-10 20:08:07,784][26022] Updated weights on worker 0-0, policy_version 872652 (0.00083) [2022-07-10 20:08:09,702][26022] Updated weights on worker 0-0, policy_version 872662 (0.00084) [2022-07-10 20:08:11,228][26022] Updated weights on worker 0-0, policy_version 872672 (0.00086) [2022-07-10 20:08:11,469][25689] Fps is (10 sec: 5496.3, 60 sec: 5533.3, 300 sec: 5529.5). Total num frames: 893616128. Throughput: 0: 4878.3. Samples: 893608560. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:08:11,470][25689] Avg episode reward: [(0, '-2.094')] [2022-07-10 20:08:13,423][26022] Updated weights on worker 0-0, policy_version 872682 (0.00086) [2022-07-10 20:08:15,040][26022] Updated weights on worker 0-0, policy_version 872692 (0.00084) [2022-07-10 20:08:16,473][25689] Fps is (10 sec: 5506.5, 60 sec: 5536.0, 300 sec: 5529.7). Total num frames: 893642752. Throughput: 0: 5754.1. Samples: 893642604. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:08:16,473][25689] Avg episode reward: [(0, '-1.321')] [2022-07-10 20:08:16,863][26022] Updated weights on worker 0-0, policy_version 872702 (0.00086) [2022-07-10 20:08:18,893][26022] Updated weights on worker 0-0, policy_version 872712 (0.00092) [2022-07-10 20:08:20,553][26022] Updated weights on worker 0-0, policy_version 872722 (0.00092) [2022-07-10 20:08:21,474][25689] Fps is (10 sec: 5527.1, 60 sec: 5522.8, 300 sec: 5526.6). Total num frames: 893671424. Throughput: 0: 5788.2. Samples: 893676234. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:08:21,475][25689] Avg episode reward: [(0, '-2.172')] [2022-07-10 20:08:22,427][26022] Updated weights on worker 0-0, policy_version 872732 (0.00084) [2022-07-10 20:08:24,095][26022] Updated weights on worker 0-0, policy_version 872742 (0.00085) [2022-07-10 20:08:26,165][26022] Updated weights on worker 0-0, policy_version 872752 (0.00094) [2022-07-10 20:08:26,479][25689] Fps is (10 sec: 5730.9, 60 sec: 5563.6, 300 sec: 5534.5). Total num frames: 893700096. Throughput: 0: 5060.4. Samples: 893693160. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:08:26,479][25689] Avg episode reward: [(0, '-2.342')] [2022-07-10 20:08:27,895][26022] Updated weights on worker 0-0, policy_version 872762 (0.00084) [2022-07-10 20:08:29,724][26022] Updated weights on worker 0-0, policy_version 872772 (0.00089) [2022-07-10 20:08:31,419][26022] Updated weights on worker 0-0, policy_version 872782 (0.00091) [2022-07-10 20:08:31,527][25689] Fps is (10 sec: 5806.2, 60 sec: 5581.8, 300 sec: 5537.4). Total num frames: 893729792. Throughput: 0: 5880.7. Samples: 893726510. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:08:31,527][25689] Avg episode reward: [(0, '-1.800')] [2022-07-10 20:08:33,473][26022] Updated weights on worker 0-0, policy_version 872792 (0.00087) [2022-07-10 20:08:34,979][26022] Updated weights on worker 0-0, policy_version 872802 (0.00086) [2022-07-10 20:08:36,538][25689] Fps is (10 sec: 5395.1, 60 sec: 5513.3, 300 sec: 5527.0). Total num frames: 893754368. Throughput: 0: 5857.2. Samples: 893760132. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:08:36,539][25689] Avg episode reward: [(0, '-1.080')] [2022-07-10 20:08:37,155][26022] Updated weights on worker 0-0, policy_version 872812 (0.00083) [2022-07-10 20:08:38,786][26022] Updated weights on worker 0-0, policy_version 872822 (0.00091) [2022-07-10 20:08:40,638][26022] Updated weights on worker 0-0, policy_version 872832 (0.00085) [2022-07-10 20:08:41,541][25689] Fps is (10 sec: 5419.4, 60 sec: 5552.2, 300 sec: 5533.8). Total num frames: 893784064. Throughput: 0: 5859.1. Samples: 893793810. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:08:41,543][25689] Avg episode reward: [(0, '-0.443')] [2022-07-10 20:08:42,628][26022] Updated weights on worker 0-0, policy_version 872842 (0.00085) [2022-07-10 20:08:44,215][26022] Updated weights on worker 0-0, policy_version 872852 (0.00088) [2022-07-10 20:08:46,206][26022] Updated weights on worker 0-0, policy_version 872862 (0.00089) [2022-07-10 20:08:46,552][25689] Fps is (10 sec: 5828.5, 60 sec: 5570.6, 300 sec: 5539.0). Total num frames: 893812736. Throughput: 0: 5838.5. Samples: 893810362. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:08:46,553][25689] Avg episode reward: [(0, '0.349')] [2022-07-10 20:08:48,080][26022] Updated weights on worker 0-0, policy_version 872872 (0.00092) [2022-07-10 20:08:49,914][26022] Updated weights on worker 0-0, policy_version 872882 (0.00092) [2022-07-10 20:08:51,625][25689] Fps is (10 sec: 5483.4, 60 sec: 5552.8, 300 sec: 5532.6). Total num frames: 893839360. Throughput: 0: 5843.3. Samples: 893843954. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:08:51,626][25689] Avg episode reward: [(0, '0.258')] [2022-07-10 20:08:51,786][26022] Updated weights on worker 0-0, policy_version 872892 (0.00085) [2022-07-10 20:08:53,628][26022] Updated weights on worker 0-0, policy_version 872902 (0.00096) [2022-07-10 20:08:55,335][26022] Updated weights on worker 0-0, policy_version 872912 (0.00092) [2022-07-10 20:08:56,633][25689] Fps is (10 sec: 5485.4, 60 sec: 5554.9, 300 sec: 5532.8). Total num frames: 893868032. Throughput: 0: 5835.3. Samples: 893877392. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:08:56,639][25689] Avg episode reward: [(0, '0.558')] [2022-07-10 20:08:57,418][26022] Updated weights on worker 0-0, policy_version 872922 (0.00084) [2022-07-10 20:08:59,174][26022] Updated weights on worker 0-0, policy_version 872932 (0.00086) [2022-07-10 20:09:00,904][26022] Updated weights on worker 0-0, policy_version 872942 (0.00099) [2022-07-10 20:09:01,422][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:09:01,434][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000872945_893895680.pth [2022-07-10 20:09:01,434][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000870996_891899904.pth [2022-07-10 20:09:01,719][25689] Fps is (10 sec: 5579.8, 60 sec: 5552.1, 300 sec: 5541.8). Total num frames: 893895680. Throughput: 0: 4975.1. Samples: 893894198. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:09:01,720][25689] Avg episode reward: [(0, '0.427')] [2022-07-10 20:09:03,326][26022] Updated weights on worker 0-0, policy_version 872952 (0.00093) [2022-07-10 20:09:05,000][26022] Updated weights on worker 0-0, policy_version 872962 (0.00087) [2022-07-10 20:09:06,735][25689] Fps is (10 sec: 5372.7, 60 sec: 5552.7, 300 sec: 5539.2). Total num frames: 893922304. Throughput: 0: 5722.5. Samples: 893925854. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:09:06,735][25689] Avg episode reward: [(0, '0.464')] [2022-07-10 20:09:06,891][26022] Updated weights on worker 0-0, policy_version 872972 (0.00100) [2022-07-10 20:09:08,531][26022] Updated weights on worker 0-0, policy_version 872982 (0.00086) [2022-07-10 20:09:10,435][26022] Updated weights on worker 0-0, policy_version 872992 (0.00089) [2022-07-10 20:09:11,778][25689] Fps is (10 sec: 5497.0, 60 sec: 5552.2, 300 sec: 5535.6). Total num frames: 893950976. Throughput: 0: 5739.3. Samples: 893959618. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:09:11,779][25689] Avg episode reward: [(0, '0.717')] [2022-07-10 20:09:12,412][26022] Updated weights on worker 0-0, policy_version 873002 (0.00099) [2022-07-10 20:09:14,085][26022] Updated weights on worker 0-0, policy_version 873012 (0.00087) [2022-07-10 20:09:16,024][26022] Updated weights on worker 0-0, policy_version 873022 (0.00094) [2022-07-10 20:09:16,789][25689] Fps is (10 sec: 5601.4, 60 sec: 5568.5, 300 sec: 5539.4). Total num frames: 893978624. Throughput: 0: 4906.7. Samples: 893976292. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:09:16,790][25689] Avg episode reward: [(0, '0.327')] [2022-07-10 20:09:17,773][26022] Updated weights on worker 0-0, policy_version 873032 (0.00086) [2022-07-10 20:09:19,642][26022] Updated weights on worker 0-0, policy_version 873042 (0.00054) [2022-07-10 20:09:21,419][26022] Updated weights on worker 0-0, policy_version 873052 (0.00088) [2022-07-10 20:09:21,819][25689] Fps is (10 sec: 5507.4, 60 sec: 5548.9, 300 sec: 5536.8). Total num frames: 894006272. Throughput: 0: 5753.2. Samples: 894009836. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:09:21,819][25689] Avg episode reward: [(0, '0.185')] [2022-07-10 20:09:23,312][26022] Updated weights on worker 0-0, policy_version 873062 (0.00090) [2022-07-10 20:09:25,178][26022] Updated weights on worker 0-0, policy_version 873072 (0.00085) [2022-07-10 20:09:26,858][25689] Fps is (10 sec: 5593.7, 60 sec: 5545.8, 300 sec: 5538.8). Total num frames: 894034944. Throughput: 0: 5843.3. Samples: 894043440. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:09:26,858][25689] Avg episode reward: [(0, '-0.094')] [2022-07-10 20:09:26,995][26022] Updated weights on worker 0-0, policy_version 873082 (0.00092) [2022-07-10 20:09:28,843][26022] Updated weights on worker 0-0, policy_version 873092 (0.00081) [2022-07-10 20:09:30,761][26022] Updated weights on worker 0-0, policy_version 873102 (0.00095) [2022-07-10 20:09:31,957][25689] Fps is (10 sec: 5555.3, 60 sec: 5507.2, 300 sec: 5537.6). Total num frames: 894062592. Throughput: 0: 4976.3. Samples: 894060032. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:09:31,957][25689] Avg episode reward: [(0, '-0.535')] [2022-07-10 20:09:32,604][26022] Updated weights on worker 0-0, policy_version 873112 (0.00093) [2022-07-10 20:09:34,362][26022] Updated weights on worker 0-0, policy_version 873122 (0.00467) [2022-07-10 20:09:36,223][26022] Updated weights on worker 0-0, policy_version 873132 (0.00085) [2022-07-10 20:09:36,966][25689] Fps is (10 sec: 5571.8, 60 sec: 5575.2, 300 sec: 5541.0). Total num frames: 894091264. Throughput: 0: 5836.6. Samples: 894094054. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:09:36,966][25689] Avg episode reward: [(0, '-0.707')] [2022-07-10 20:09:37,929][26022] Updated weights on worker 0-0, policy_version 873142 (0.00090) [2022-07-10 20:09:39,950][26022] Updated weights on worker 0-0, policy_version 873152 (0.00096) [2022-07-10 20:09:41,668][26022] Updated weights on worker 0-0, policy_version 873162 (0.01293) [2022-07-10 20:09:41,971][25689] Fps is (10 sec: 5726.3, 60 sec: 5558.0, 300 sec: 5541.2). Total num frames: 894119936. Throughput: 0: 5844.9. Samples: 894127624. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:09:41,972][25689] Avg episode reward: [(0, '-0.359')] [2022-07-10 20:09:43,504][26022] Updated weights on worker 0-0, policy_version 873172 (0.00093) [2022-07-10 20:09:45,246][26022] Updated weights on worker 0-0, policy_version 873182 (0.00084) [2022-07-10 20:09:46,974][25689] Fps is (10 sec: 5627.4, 60 sec: 5541.9, 300 sec: 5546.8). Total num frames: 894147584. Throughput: 0: 5010.7. Samples: 894144236. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:09:46,974][25689] Avg episode reward: [(0, '-0.026')] [2022-07-10 20:09:47,204][26022] Updated weights on worker 0-0, policy_version 873192 (0.00088) [2022-07-10 20:09:48,986][26022] Updated weights on worker 0-0, policy_version 873202 (0.00087) [2022-07-10 20:09:50,804][26022] Updated weights on worker 0-0, policy_version 873212 (0.00085) [2022-07-10 20:09:52,038][25689] Fps is (10 sec: 5492.7, 60 sec: 5559.6, 300 sec: 5542.2). Total num frames: 894175232. Throughput: 0: 5858.3. Samples: 894177674. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:09:52,039][25689] Avg episode reward: [(0, '0.045')] [2022-07-10 20:09:52,650][26022] Updated weights on worker 0-0, policy_version 873222 (0.00102) [2022-07-10 20:09:54,599][26022] Updated weights on worker 0-0, policy_version 873232 (0.00086) [2022-07-10 20:09:56,550][26022] Updated weights on worker 0-0, policy_version 873242 (0.00097) [2022-07-10 20:09:57,043][25689] Fps is (10 sec: 5491.8, 60 sec: 5543.0, 300 sec: 5542.9). Total num frames: 894202880. Throughput: 0: 5824.2. Samples: 894210986. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:09:57,043][25689] Avg episode reward: [(0, '-0.153')] [2022-07-10 20:09:58,167][26022] Updated weights on worker 0-0, policy_version 873252 (0.00082) [2022-07-10 20:10:00,040][26022] Updated weights on worker 0-0, policy_version 873262 (0.00093) [2022-07-10 20:10:02,069][25689] Fps is (10 sec: 5410.4, 60 sec: 5531.4, 300 sec: 5543.5). Total num frames: 894229504. Throughput: 0: 4969.6. Samples: 894227504. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:10:02,070][25689] Avg episode reward: [(0, '-0.047')] [2022-07-10 20:10:02,124][26022] Updated weights on worker 0-0, policy_version 873272 (0.00092) [2022-07-10 20:10:04,255][26022] Updated weights on worker 0-0, policy_version 873282 (0.00083) [2022-07-10 20:10:05,763][26022] Updated weights on worker 0-0, policy_version 873292 (0.00093) [2022-07-10 20:10:07,071][25689] Fps is (10 sec: 5412.1, 60 sec: 5549.7, 300 sec: 5548.1). Total num frames: 894257152. Throughput: 0: 5710.8. Samples: 894259004. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:10:07,073][25689] Avg episode reward: [(0, '-0.181')] [2022-07-10 20:10:07,847][26022] Updated weights on worker 0-0, policy_version 873302 (0.00396) [2022-07-10 20:10:09,774][26022] Updated weights on worker 0-0, policy_version 873312 (0.00089) [2022-07-10 20:10:11,295][26022] Updated weights on worker 0-0, policy_version 873322 (0.00085) [2022-07-10 20:10:12,140][25689] Fps is (10 sec: 5490.8, 60 sec: 5530.4, 300 sec: 5537.3). Total num frames: 894284800. Throughput: 0: 5714.1. Samples: 894292538. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:10:12,141][25689] Avg episode reward: [(0, '-0.035')] [2022-07-10 20:10:13,247][26022] Updated weights on worker 0-0, policy_version 873332 (0.00092) [2022-07-10 20:10:14,929][26022] Updated weights on worker 0-0, policy_version 873342 (0.00090) [2022-07-10 20:10:17,008][26022] Updated weights on worker 0-0, policy_version 873352 (0.00091) [2022-07-10 20:10:17,203][25689] Fps is (10 sec: 5457.6, 60 sec: 5525.7, 300 sec: 5536.4). Total num frames: 894312448. Throughput: 0: 4892.3. Samples: 894309614. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:10:17,204][25689] Avg episode reward: [(0, '-0.130')] [2022-07-10 20:10:18,734][26022] Updated weights on worker 0-0, policy_version 873362 (0.00091) [2022-07-10 20:10:20,481][26022] Updated weights on worker 0-0, policy_version 873372 (0.00077) [2022-07-10 20:10:22,208][25689] Fps is (10 sec: 5695.6, 60 sec: 5561.8, 300 sec: 5550.4). Total num frames: 894342144. Throughput: 0: 5754.8. Samples: 894343400. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:10:22,209][25689] Avg episode reward: [(0, '0.377')] [2022-07-10 20:10:22,410][26022] Updated weights on worker 0-0, policy_version 873382 (0.00091) [2022-07-10 20:10:24,324][26022] Updated weights on worker 0-0, policy_version 873392 (0.00096) [2022-07-10 20:10:26,054][26022] Updated weights on worker 0-0, policy_version 873402 (0.00096) [2022-07-10 20:10:27,221][25689] Fps is (10 sec: 5723.7, 60 sec: 5547.2, 300 sec: 5540.5). Total num frames: 894369792. Throughput: 0: 5842.7. Samples: 894376738. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-10 20:10:27,223][25689] Avg episode reward: [(0, '0.829')] [2022-07-10 20:10:27,947][26022] Updated weights on worker 0-0, policy_version 873412 (0.00089) [2022-07-10 20:10:29,705][26022] Updated weights on worker 0-0, policy_version 873422 (0.00092) [2022-07-10 20:10:31,806][26022] Updated weights on worker 0-0, policy_version 873432 (0.00089) [2022-07-10 20:10:32,259][25689] Fps is (10 sec: 5501.5, 60 sec: 5552.8, 300 sec: 5546.9). Total num frames: 894397440. Throughput: 0: 4994.3. Samples: 894393020. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:10:32,260][25689] Avg episode reward: [(0, '0.531')] [2022-07-10 20:10:33,495][26022] Updated weights on worker 0-0, policy_version 873442 (0.00089) [2022-07-10 20:10:35,353][26022] Updated weights on worker 0-0, policy_version 873452 (0.00085) [2022-07-10 20:10:37,263][25689] Fps is (10 sec: 5404.8, 60 sec: 5519.3, 300 sec: 5537.9). Total num frames: 894424064. Throughput: 0: 5828.0. Samples: 894426524. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:10:37,263][25689] Avg episode reward: [(0, '0.492')] [2022-07-10 20:10:37,343][26022] Updated weights on worker 0-0, policy_version 873462 (0.00090) [2022-07-10 20:10:38,959][26022] Updated weights on worker 0-0, policy_version 873472 (0.00089) [2022-07-10 20:10:40,967][26022] Updated weights on worker 0-0, policy_version 873482 (0.00084) [2022-07-10 20:10:42,275][25689] Fps is (10 sec: 5521.1, 60 sec: 5518.7, 300 sec: 5537.8). Total num frames: 894452736. Throughput: 0: 5816.2. Samples: 894460110. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:10:42,275][25689] Avg episode reward: [(0, '0.257')] [2022-07-10 20:10:42,583][26022] Updated weights on worker 0-0, policy_version 873492 (0.00089) [2022-07-10 20:10:44,628][26022] Updated weights on worker 0-0, policy_version 873502 (0.00092) [2022-07-10 20:10:46,352][26022] Updated weights on worker 0-0, policy_version 873512 (0.00088) [2022-07-10 20:10:47,283][25689] Fps is (10 sec: 5722.7, 60 sec: 5535.2, 300 sec: 5546.1). Total num frames: 894481408. Throughput: 0: 4988.7. Samples: 894476822. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:10:47,284][25689] Avg episode reward: [(0, '-0.157')] [2022-07-10 20:10:48,533][26022] Updated weights on worker 0-0, policy_version 873522 (0.00087) [2022-07-10 20:10:49,818][26022] Updated weights on worker 0-0, policy_version 873532 (0.00087) [2022-07-10 20:10:52,070][26022] Updated weights on worker 0-0, policy_version 873542 (0.00090) [2022-07-10 20:10:52,328][25689] Fps is (10 sec: 5500.1, 60 sec: 5520.0, 300 sec: 5539.1). Total num frames: 894508032. Throughput: 0: 5826.0. Samples: 894509942. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:10:52,329][25689] Avg episode reward: [(0, '-0.907')] [2022-07-10 20:10:53,753][26022] Updated weights on worker 0-0, policy_version 873552 (0.00082) [2022-07-10 20:10:55,541][26022] Updated weights on worker 0-0, policy_version 873562 (0.00086) [2022-07-10 20:10:57,344][25689] Fps is (10 sec: 5496.0, 60 sec: 5535.9, 300 sec: 5535.4). Total num frames: 894536704. Throughput: 0: 5820.0. Samples: 894543400. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:10:57,345][25689] Avg episode reward: [(0, '-1.300')] [2022-07-10 20:10:57,626][26022] Updated weights on worker 0-0, policy_version 873572 (0.00096) [2022-07-10 20:10:59,269][26022] Updated weights on worker 0-0, policy_version 873582 (0.00093) [2022-07-10 20:11:01,275][26022] Updated weights on worker 0-0, policy_version 873592 (0.00093) [2022-07-10 20:11:01,680][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:11:01,695][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000873595_894561280.pth [2022-07-10 20:11:01,695][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000871645_892564480.pth [2022-07-10 20:11:02,359][25689] Fps is (10 sec: 5512.8, 60 sec: 5537.0, 300 sec: 5543.6). Total num frames: 894563328. Throughput: 0: 4973.6. Samples: 894560000. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:11:02,361][25689] Avg episode reward: [(0, '-0.703')] [2022-07-10 20:11:03,513][26022] Updated weights on worker 0-0, policy_version 873602 (0.00083) [2022-07-10 20:11:05,245][26022] Updated weights on worker 0-0, policy_version 873612 (0.00089) [2022-07-10 20:11:07,136][26022] Updated weights on worker 0-0, policy_version 873622 (0.00089) [2022-07-10 20:11:07,376][25689] Fps is (10 sec: 5307.9, 60 sec: 5518.6, 300 sec: 5537.3). Total num frames: 894589952. Throughput: 0: 5685.9. Samples: 894591068. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:11:07,377][25689] Avg episode reward: [(0, '-0.628')] [2022-07-10 20:11:09,000][26022] Updated weights on worker 0-0, policy_version 873632 (0.00101) [2022-07-10 20:11:10,643][26022] Updated weights on worker 0-0, policy_version 873642 (0.00090) [2022-07-10 20:11:12,422][25689] Fps is (10 sec: 5393.2, 60 sec: 5520.8, 300 sec: 5536.6). Total num frames: 894617600. Throughput: 0: 5712.6. Samples: 894624728. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:11:12,422][25689] Avg episode reward: [(0, '-1.225')] [2022-07-10 20:11:12,912][26022] Updated weights on worker 0-0, policy_version 873652 (0.00091) [2022-07-10 20:11:14,315][26022] Updated weights on worker 0-0, policy_version 873662 (0.00088) [2022-07-10 20:11:16,310][26022] Updated weights on worker 0-0, policy_version 873672 (0.00088) [2022-07-10 20:11:17,437][25689] Fps is (10 sec: 5496.4, 60 sec: 5525.1, 300 sec: 5536.5). Total num frames: 894645248. Throughput: 0: 4882.2. Samples: 894641494. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:11:17,442][25689] Avg episode reward: [(0, '-0.989')] [2022-07-10 20:11:18,216][26022] Updated weights on worker 0-0, policy_version 873682 (0.00080) [2022-07-10 20:11:19,763][26022] Updated weights on worker 0-0, policy_version 873692 (0.00079) [2022-07-10 20:11:21,882][26022] Updated weights on worker 0-0, policy_version 873702 (0.00096) [2022-07-10 20:11:22,461][25689] Fps is (10 sec: 5711.8, 60 sec: 5523.4, 300 sec: 5546.4). Total num frames: 894674944. Throughput: 0: 5717.0. Samples: 894674926. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:11:22,463][25689] Avg episode reward: [(0, '-0.431')] [2022-07-10 20:11:23,624][26022] Updated weights on worker 0-0, policy_version 873712 (0.00092) [2022-07-10 20:11:25,490][26022] Updated weights on worker 0-0, policy_version 873722 (0.00085) [2022-07-10 20:11:27,385][26022] Updated weights on worker 0-0, policy_version 873732 (0.00087) [2022-07-10 20:11:27,470][25689] Fps is (10 sec: 5715.5, 60 sec: 5523.8, 300 sec: 5545.2). Total num frames: 894702592. Throughput: 0: 5841.5. Samples: 894708444. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:11:27,470][25689] Avg episode reward: [(0, '-0.326')] [2022-07-10 20:11:29,112][26022] Updated weights on worker 0-0, policy_version 873742 (0.00091) [2022-07-10 20:11:30,897][26022] Updated weights on worker 0-0, policy_version 873752 (0.00085) [2022-07-10 20:11:32,515][25689] Fps is (10 sec: 5398.4, 60 sec: 5506.2, 300 sec: 5538.0). Total num frames: 894729216. Throughput: 0: 5005.4. Samples: 894725302. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:11:32,515][25689] Avg episode reward: [(0, '-0.603')] [2022-07-10 20:11:32,932][26022] Updated weights on worker 0-0, policy_version 873762 (0.00088) [2022-07-10 20:11:34,508][26022] Updated weights on worker 0-0, policy_version 873772 (0.00089) [2022-07-10 20:11:36,580][26022] Updated weights on worker 0-0, policy_version 873782 (0.00056) [2022-07-10 20:11:37,527][25689] Fps is (10 sec: 5599.7, 60 sec: 5556.3, 300 sec: 5541.8). Total num frames: 894758912. Throughput: 0: 5836.2. Samples: 894758748. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:11:37,528][25689] Avg episode reward: [(0, '-0.759')] [2022-07-10 20:11:38,255][26022] Updated weights on worker 0-0, policy_version 873792 (0.00090) [2022-07-10 20:11:40,175][26022] Updated weights on worker 0-0, policy_version 873802 (0.00092) [2022-07-10 20:11:41,985][26022] Updated weights on worker 0-0, policy_version 873812 (0.00081) [2022-07-10 20:11:42,538][25689] Fps is (10 sec: 5721.1, 60 sec: 5539.4, 300 sec: 5542.4). Total num frames: 894786560. Throughput: 0: 5849.9. Samples: 894792372. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:11:42,540][25689] Avg episode reward: [(0, '-0.040')] [2022-07-10 20:11:43,789][26022] Updated weights on worker 0-0, policy_version 873822 (0.00088) [2022-07-10 20:11:45,666][26022] Updated weights on worker 0-0, policy_version 873832 (0.00085) [2022-07-10 20:11:47,430][26022] Updated weights on worker 0-0, policy_version 873842 (0.00084) [2022-07-10 20:11:47,541][25689] Fps is (10 sec: 5522.0, 60 sec: 5522.9, 300 sec: 5544.1). Total num frames: 894814208. Throughput: 0: 5022.3. Samples: 894809248. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:11:47,542][25689] Avg episode reward: [(0, '0.384')] [2022-07-10 20:11:49,404][26022] Updated weights on worker 0-0, policy_version 873852 (0.00086) [2022-07-10 20:11:51,308][26022] Updated weights on worker 0-0, policy_version 873862 (0.00090) [2022-07-10 20:11:52,668][25689] Fps is (10 sec: 5559.5, 60 sec: 5549.3, 300 sec: 5538.6). Total num frames: 894842880. Throughput: 0: 5836.3. Samples: 894842924. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:11:52,669][25689] Avg episode reward: [(0, '0.349')] [2022-07-10 20:11:52,968][26022] Updated weights on worker 0-0, policy_version 873872 (0.00094) [2022-07-10 20:11:54,707][26022] Updated weights on worker 0-0, policy_version 873882 (0.00087) [2022-07-10 20:11:56,576][26022] Updated weights on worker 0-0, policy_version 873892 (0.00062) [2022-07-10 20:11:57,692][25689] Fps is (10 sec: 5648.9, 60 sec: 5548.6, 300 sec: 5542.4). Total num frames: 894871552. Throughput: 0: 5836.0. Samples: 894876432. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:11:57,693][25689] Avg episode reward: [(0, '0.829')] [2022-07-10 20:11:58,806][26022] Updated weights on worker 0-0, policy_version 873902 (0.00082) [2022-07-10 20:12:00,276][26022] Updated weights on worker 0-0, policy_version 873912 (0.00084) [2022-07-10 20:12:02,565][26022] Updated weights on worker 0-0, policy_version 873922 (0.00099) [2022-07-10 20:12:02,734][25689] Fps is (10 sec: 5290.2, 60 sec: 5512.2, 300 sec: 5542.6). Total num frames: 894896128. Throughput: 0: 5001.2. Samples: 894893376. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:12:02,734][25689] Avg episode reward: [(0, '1.007')] [2022-07-10 20:12:04,435][26022] Updated weights on worker 0-0, policy_version 873932 (0.00088) [2022-07-10 20:12:06,155][26022] Updated weights on worker 0-0, policy_version 873942 (0.00083) [2022-07-10 20:12:07,784][25689] Fps is (10 sec: 5175.1, 60 sec: 5526.2, 300 sec: 5533.5). Total num frames: 894923776. Throughput: 0: 5692.4. Samples: 894924478. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:12:07,784][25689] Avg episode reward: [(0, '1.183')] [2022-07-10 20:12:08,336][26022] Updated weights on worker 0-0, policy_version 873952 (0.00095) [2022-07-10 20:12:09,699][26022] Updated weights on worker 0-0, policy_version 873962 (0.00081) [2022-07-10 20:12:11,806][26022] Updated weights on worker 0-0, policy_version 873972 (0.00093) [2022-07-10 20:12:12,912][25689] Fps is (10 sec: 5734.6, 60 sec: 5569.4, 300 sec: 5545.5). Total num frames: 894954496. Throughput: 0: 5695.0. Samples: 894958212. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:12:12,912][25689] Avg episode reward: [(0, '1.137')] [2022-07-10 20:12:13,363][26022] Updated weights on worker 0-0, policy_version 873982 (0.00095) [2022-07-10 20:12:15,291][26022] Updated weights on worker 0-0, policy_version 873992 (0.00088) [2022-07-10 20:12:17,101][26022] Updated weights on worker 0-0, policy_version 874002 (0.00086) [2022-07-10 20:12:17,982][25689] Fps is (10 sec: 5622.8, 60 sec: 5547.4, 300 sec: 5534.6). Total num frames: 894981120. Throughput: 0: 5690.3. Samples: 894991888. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:12:17,983][25689] Avg episode reward: [(0, '1.378')] [2022-07-10 20:12:18,939][26022] Updated weights on worker 0-0, policy_version 874012 (0.00084) [2022-07-10 20:12:20,835][26022] Updated weights on worker 0-0, policy_version 874022 (0.00084) [2022-07-10 20:12:22,809][26022] Updated weights on worker 0-0, policy_version 874032 (0.00081) [2022-07-10 20:12:23,004][25689] Fps is (10 sec: 5479.1, 60 sec: 5530.7, 300 sec: 5542.6). Total num frames: 895009792. Throughput: 0: 5699.2. Samples: 895008902. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:12:23,005][25689] Avg episode reward: [(0, '0.485')] [2022-07-10 20:12:24,319][26022] Updated weights on worker 0-0, policy_version 874042 (0.00087) [2022-07-10 20:12:26,327][26022] Updated weights on worker 0-0, policy_version 874052 (0.00090) [2022-07-10 20:12:28,047][25689] Fps is (10 sec: 5697.7, 60 sec: 5544.5, 300 sec: 5542.9). Total num frames: 895038464. Throughput: 0: 5825.2. Samples: 895042514. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:12:28,047][25689] Avg episode reward: [(0, '0.576')] [2022-07-10 20:12:28,055][26022] Updated weights on worker 0-0, policy_version 874062 (0.00081) [2022-07-10 20:12:30,093][26022] Updated weights on worker 0-0, policy_version 874072 (0.00092) [2022-07-10 20:12:31,856][26022] Updated weights on worker 0-0, policy_version 874082 (0.00095) [2022-07-10 20:12:33,096][25689] Fps is (10 sec: 5580.5, 60 sec: 5561.0, 300 sec: 5538.6). Total num frames: 895066112. Throughput: 0: 5810.9. Samples: 895075502. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:12:33,101][25689] Avg episode reward: [(0, '0.219')] [2022-07-10 20:12:33,681][26022] Updated weights on worker 0-0, policy_version 874092 (0.00080) [2022-07-10 20:12:35,545][26022] Updated weights on worker 0-0, policy_version 874102 (0.00092) [2022-07-10 20:12:37,409][26022] Updated weights on worker 0-0, policy_version 874112 (0.00080) [2022-07-10 20:12:38,163][25689] Fps is (10 sec: 5567.4, 60 sec: 5539.2, 300 sec: 5541.9). Total num frames: 895094784. Throughput: 0: 4976.1. Samples: 895092308. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:12:38,165][25689] Avg episode reward: [(0, '0.086')] [2022-07-10 20:12:39,260][26022] Updated weights on worker 0-0, policy_version 874122 (0.00095) [2022-07-10 20:12:40,994][26022] Updated weights on worker 0-0, policy_version 874132 (0.00088) [2022-07-10 20:12:43,104][26022] Updated weights on worker 0-0, policy_version 874142 (0.00084) [2022-07-10 20:12:43,226][25689] Fps is (10 sec: 5559.9, 60 sec: 5534.4, 300 sec: 5541.2). Total num frames: 895122432. Throughput: 0: 5773.0. Samples: 895125644. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:12:43,227][25689] Avg episode reward: [(0, '-0.397')] [2022-07-10 20:12:44,751][26022] Updated weights on worker 0-0, policy_version 874152 (0.00096) [2022-07-10 20:12:46,768][26022] Updated weights on worker 0-0, policy_version 874162 (0.00086) [2022-07-10 20:12:48,274][25689] Fps is (10 sec: 5569.9, 60 sec: 5547.1, 300 sec: 5545.0). Total num frames: 895151104. Throughput: 0: 5765.6. Samples: 895159138. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:12:48,275][25689] Avg episode reward: [(0, '-0.576')] [2022-07-10 20:12:48,311][26022] Updated weights on worker 0-0, policy_version 874172 (0.00086) [2022-07-10 20:12:50,325][26022] Updated weights on worker 0-0, policy_version 874182 (0.00608) [2022-07-10 20:12:52,106][26022] Updated weights on worker 0-0, policy_version 874192 (0.00106) [2022-07-10 20:12:53,411][25689] Fps is (10 sec: 5630.5, 60 sec: 5546.3, 300 sec: 5543.0). Total num frames: 895179776. Throughput: 0: 4949.1. Samples: 895176040. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:12:53,411][25689] Avg episode reward: [(0, '-0.530')] [2022-07-10 20:12:53,948][26022] Updated weights on worker 0-0, policy_version 874202 (0.00086) [2022-07-10 20:12:56,024][26022] Updated weights on worker 0-0, policy_version 874212 (0.00076) [2022-07-10 20:12:57,583][26022] Updated weights on worker 0-0, policy_version 874222 (0.00078) [2022-07-10 20:12:58,431][25689] Fps is (10 sec: 5544.9, 60 sec: 5529.8, 300 sec: 5543.7). Total num frames: 895207424. Throughput: 0: 5787.8. Samples: 895209618. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:12:58,432][25689] Avg episode reward: [(0, '-0.354')] [2022-07-10 20:12:59,592][26022] Updated weights on worker 0-0, policy_version 874232 (0.00086) [2022-07-10 20:13:01,235][26022] Updated weights on worker 0-0, policy_version 874242 (0.00085) [2022-07-10 20:13:02,104][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:13:02,112][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000874245_895226880.pth [2022-07-10 20:13:02,115][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000872294_893229056.pth [2022-07-10 20:13:03,448][25689] Fps is (10 sec: 5305.1, 60 sec: 5548.9, 300 sec: 5540.3). Total num frames: 895233024. Throughput: 0: 5708.3. Samples: 895241076. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:13:03,448][25689] Avg episode reward: [(0, '-0.384')] [2022-07-10 20:13:03,613][26022] Updated weights on worker 0-0, policy_version 874252 (0.00094) [2022-07-10 20:13:05,503][26022] Updated weights on worker 0-0, policy_version 874262 (0.00095) [2022-07-10 20:13:07,260][26022] Updated weights on worker 0-0, policy_version 874272 (0.00091) [2022-07-10 20:13:08,471][25689] Fps is (10 sec: 5303.5, 60 sec: 5551.3, 300 sec: 5537.1). Total num frames: 895260672. Throughput: 0: 4861.9. Samples: 895257334. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:13:08,473][25689] Avg episode reward: [(0, '-0.208')] [2022-07-10 20:13:09,183][26022] Updated weights on worker 0-0, policy_version 874282 (0.00089) [2022-07-10 20:13:10,876][26022] Updated weights on worker 0-0, policy_version 874292 (0.00092) [2022-07-10 20:13:12,730][26022] Updated weights on worker 0-0, policy_version 874302 (0.00097) [2022-07-10 20:13:13,583][25689] Fps is (10 sec: 5658.0, 60 sec: 5536.0, 300 sec: 5545.5). Total num frames: 895290368. Throughput: 0: 5692.1. Samples: 895290862. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:13:13,583][25689] Avg episode reward: [(0, '-0.103')] [2022-07-10 20:13:14,812][26022] Updated weights on worker 0-0, policy_version 874312 (0.00086) [2022-07-10 20:13:16,431][26022] Updated weights on worker 0-0, policy_version 874322 (0.00100) [2022-07-10 20:13:18,350][26022] Updated weights on worker 0-0, policy_version 874332 (0.00082) [2022-07-10 20:13:18,588][25689] Fps is (10 sec: 5566.9, 60 sec: 5541.9, 300 sec: 5538.5). Total num frames: 895316992. Throughput: 0: 5699.8. Samples: 895324510. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:13:18,589][25689] Avg episode reward: [(0, '-0.748')] [2022-07-10 20:13:20,122][26022] Updated weights on worker 0-0, policy_version 874342 (0.00099) [2022-07-10 20:13:22,062][26022] Updated weights on worker 0-0, policy_version 874352 (0.00089) [2022-07-10 20:13:23,640][25689] Fps is (10 sec: 5600.0, 60 sec: 5556.1, 300 sec: 5541.1). Total num frames: 895346688. Throughput: 0: 4953.7. Samples: 895341102. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:13:23,641][25689] Avg episode reward: [(0, '-0.182')] [2022-07-10 20:13:23,647][26022] Updated weights on worker 0-0, policy_version 874362 (0.00087) [2022-07-10 20:13:25,788][26022] Updated weights on worker 0-0, policy_version 874372 (0.00093) [2022-07-10 20:13:27,450][26022] Updated weights on worker 0-0, policy_version 874382 (0.00092) [2022-07-10 20:13:28,693][25689] Fps is (10 sec: 5573.8, 60 sec: 5521.4, 300 sec: 5530.6). Total num frames: 895373312. Throughput: 0: 5787.4. Samples: 895374364. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:13:28,693][25689] Avg episode reward: [(0, '-0.421')] [2022-07-10 20:13:29,595][26022] Updated weights on worker 0-0, policy_version 874392 (0.00083) [2022-07-10 20:13:31,092][26022] Updated weights on worker 0-0, policy_version 874402 (0.00092) [2022-07-10 20:13:33,133][26022] Updated weights on worker 0-0, policy_version 874412 (0.00084) [2022-07-10 20:13:33,779][25689] Fps is (10 sec: 5352.9, 60 sec: 5518.0, 300 sec: 5539.6). Total num frames: 895400960. Throughput: 0: 5781.1. Samples: 895407616. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:13:33,779][25689] Avg episode reward: [(0, '-0.045')] [2022-07-10 20:13:35,063][26022] Updated weights on worker 0-0, policy_version 874422 (0.00085) [2022-07-10 20:13:36,845][26022] Updated weights on worker 0-0, policy_version 874432 (0.00088) [2022-07-10 20:13:38,624][26022] Updated weights on worker 0-0, policy_version 874442 (0.00089) [2022-07-10 20:13:38,786][25689] Fps is (10 sec: 5579.9, 60 sec: 5523.4, 300 sec: 5536.1). Total num frames: 895429632. Throughput: 0: 4935.3. Samples: 895424192. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:13:38,787][25689] Avg episode reward: [(0, '0.020')] [2022-07-10 20:13:40,610][26022] Updated weights on worker 0-0, policy_version 874452 (0.00098) [2022-07-10 20:13:41,958][26022] Updated weights on worker 0-0, policy_version 874462 (0.00087) [2022-07-10 20:13:43,848][25689] Fps is (10 sec: 5593.4, 60 sec: 5523.6, 300 sec: 5531.7). Total num frames: 895457280. Throughput: 0: 5777.7. Samples: 895457856. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:13:43,848][25689] Avg episode reward: [(0, '-0.874')] [2022-07-10 20:13:44,203][26022] Updated weights on worker 0-0, policy_version 874472 (0.00079) [2022-07-10 20:13:45,693][26022] Updated weights on worker 0-0, policy_version 874482 (0.00089) [2022-07-10 20:13:47,797][26022] Updated weights on worker 0-0, policy_version 874492 (0.00092) [2022-07-10 20:13:48,888][25689] Fps is (10 sec: 5473.9, 60 sec: 5507.4, 300 sec: 5535.7). Total num frames: 895484928. Throughput: 0: 5796.3. Samples: 895491420. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:13:48,888][25689] Avg episode reward: [(0, '-0.774')] [2022-07-10 20:13:49,484][26022] Updated weights on worker 0-0, policy_version 874502 (0.00086) [2022-07-10 20:13:51,439][26022] Updated weights on worker 0-0, policy_version 874512 (0.00089) [2022-07-10 20:13:53,377][26022] Updated weights on worker 0-0, policy_version 874522 (0.00087) [2022-07-10 20:13:54,003][25689] Fps is (10 sec: 5647.0, 60 sec: 5526.3, 300 sec: 5537.2). Total num frames: 895514624. Throughput: 0: 4953.0. Samples: 895507784. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:13:54,003][25689] Avg episode reward: [(0, '-2.081')] [2022-07-10 20:13:55,020][26022] Updated weights on worker 0-0, policy_version 874532 (0.00094) [2022-07-10 20:13:56,965][26022] Updated weights on worker 0-0, policy_version 874542 (0.00093) [2022-07-10 20:13:58,814][26022] Updated weights on worker 0-0, policy_version 874552 (0.00084) [2022-07-10 20:13:59,005][25689] Fps is (10 sec: 5567.0, 60 sec: 5511.1, 300 sec: 5535.3). Total num frames: 895541248. Throughput: 0: 5801.4. Samples: 895541488. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:13:59,005][25689] Avg episode reward: [(0, '-2.163')] [2022-07-10 20:14:00,532][26022] Updated weights on worker 0-0, policy_version 874562 (0.00091) [2022-07-10 20:14:03,059][26022] Updated weights on worker 0-0, policy_version 874572 (0.00374) [2022-07-10 20:14:04,020][25689] Fps is (10 sec: 5315.4, 60 sec: 5528.1, 300 sec: 5535.3). Total num frames: 895567872. Throughput: 0: 5715.0. Samples: 895573142. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:14:04,021][25689] Avg episode reward: [(0, '-2.176')] [2022-07-10 20:14:04,395][26022] Updated weights on worker 0-0, policy_version 874582 (0.00092) [2022-07-10 20:14:06,743][26022] Updated weights on worker 0-0, policy_version 874592 (0.00088) [2022-07-10 20:14:08,171][26022] Updated weights on worker 0-0, policy_version 874602 (0.00082) [2022-07-10 20:14:09,099][25689] Fps is (10 sec: 5477.7, 60 sec: 5539.9, 300 sec: 5534.7). Total num frames: 895596544. Throughput: 0: 4860.7. Samples: 895589662. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 20:14:09,101][25689] Avg episode reward: [(0, '-1.411')] [2022-07-10 20:14:10,385][26022] Updated weights on worker 0-0, policy_version 874612 (0.00089) [2022-07-10 20:14:12,040][26022] Updated weights on worker 0-0, policy_version 874622 (0.00094) [2022-07-10 20:14:14,090][26022] Updated weights on worker 0-0, policy_version 874632 (0.00104) [2022-07-10 20:14:14,186][25689] Fps is (10 sec: 5439.4, 60 sec: 5491.5, 300 sec: 5529.8). Total num frames: 895623168. Throughput: 0: 5724.0. Samples: 895623314. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:14:14,187][25689] Avg episode reward: [(0, '-1.838')] [2022-07-10 20:14:15,421][26022] Updated weights on worker 0-0, policy_version 874642 (0.00086) [2022-07-10 20:14:17,647][26022] Updated weights on worker 0-0, policy_version 874652 (0.00085) [2022-07-10 20:14:19,036][26022] Updated weights on worker 0-0, policy_version 874662 (0.00087) [2022-07-10 20:14:19,213][25689] Fps is (10 sec: 5670.0, 60 sec: 5557.1, 300 sec: 5540.2). Total num frames: 895653888. Throughput: 0: 5727.0. Samples: 895657220. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:14:19,213][25689] Avg episode reward: [(0, '-1.389')] [2022-07-10 20:14:21,222][26022] Updated weights on worker 0-0, policy_version 874672 (0.00092) [2022-07-10 20:14:22,839][26022] Updated weights on worker 0-0, policy_version 874682 (0.00085) [2022-07-10 20:14:24,240][25689] Fps is (10 sec: 5703.7, 60 sec: 5508.6, 300 sec: 5533.5). Total num frames: 895680512. Throughput: 0: 5817.7. Samples: 895690774. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:14:24,241][25689] Avg episode reward: [(0, '-0.746')] [2022-07-10 20:14:24,747][26022] Updated weights on worker 0-0, policy_version 874692 (0.00093) [2022-07-10 20:14:26,743][26022] Updated weights on worker 0-0, policy_version 874702 (0.00089) [2022-07-10 20:14:28,411][26022] Updated weights on worker 0-0, policy_version 874712 (0.00087) [2022-07-10 20:14:29,266][25689] Fps is (10 sec: 5398.7, 60 sec: 5528.0, 300 sec: 5534.9). Total num frames: 895708160. Throughput: 0: 5835.9. Samples: 895707350. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:14:29,266][25689] Avg episode reward: [(0, '-0.985')] [2022-07-10 20:14:30,330][26022] Updated weights on worker 0-0, policy_version 874722 (0.00090) [2022-07-10 20:14:32,150][26022] Updated weights on worker 0-0, policy_version 874732 (0.00091) [2022-07-10 20:14:33,844][26022] Updated weights on worker 0-0, policy_version 874742 (0.00096) [2022-07-10 20:14:34,302][25689] Fps is (10 sec: 5699.3, 60 sec: 5566.4, 300 sec: 5537.8). Total num frames: 895737856. Throughput: 0: 5849.6. Samples: 895740982. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:14:34,302][25689] Avg episode reward: [(0, '-0.943')] [2022-07-10 20:14:35,867][26022] Updated weights on worker 0-0, policy_version 874752 (0.00086) [2022-07-10 20:14:37,634][26022] Updated weights on worker 0-0, policy_version 874762 (0.00088) [2022-07-10 20:14:39,332][25689] Fps is (10 sec: 5696.9, 60 sec: 5547.4, 300 sec: 5533.9). Total num frames: 895765504. Throughput: 0: 5816.6. Samples: 895774242. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:14:39,332][25689] Avg episode reward: [(0, '-1.494')] [2022-07-10 20:14:39,472][26022] Updated weights on worker 0-0, policy_version 874772 (0.00085) [2022-07-10 20:14:41,426][26022] Updated weights on worker 0-0, policy_version 874782 (0.00088) [2022-07-10 20:14:43,226][26022] Updated weights on worker 0-0, policy_version 874792 (0.00088) [2022-07-10 20:14:44,355][25689] Fps is (10 sec: 5500.4, 60 sec: 5551.0, 300 sec: 5533.5). Total num frames: 895793152. Throughput: 0: 4984.6. Samples: 895791038. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:14:44,356][25689] Avg episode reward: [(0, '-1.374')] [2022-07-10 20:14:44,935][26022] Updated weights on worker 0-0, policy_version 874802 (0.00095) [2022-07-10 20:14:46,827][26022] Updated weights on worker 0-0, policy_version 874812 (0.00089) [2022-07-10 20:14:48,563][26022] Updated weights on worker 0-0, policy_version 874822 (0.00098) [2022-07-10 20:14:49,363][25689] Fps is (10 sec: 5410.4, 60 sec: 5537.0, 300 sec: 5531.1). Total num frames: 895819776. Throughput: 0: 5841.7. Samples: 895824750. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:14:49,363][25689] Avg episode reward: [(0, '-0.737')] [2022-07-10 20:14:50,522][26022] Updated weights on worker 0-0, policy_version 874832 (0.00087) [2022-07-10 20:14:52,432][26022] Updated weights on worker 0-0, policy_version 874842 (0.00090) [2022-07-10 20:14:53,989][26022] Updated weights on worker 0-0, policy_version 874852 (0.00092) [2022-07-10 20:14:54,471][25689] Fps is (10 sec: 5567.3, 60 sec: 5537.6, 300 sec: 5536.1). Total num frames: 895849472. Throughput: 0: 5822.2. Samples: 895858412. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:14:54,471][25689] Avg episode reward: [(0, '-0.407')] [2022-07-10 20:14:56,145][26022] Updated weights on worker 0-0, policy_version 874862 (0.00091) [2022-07-10 20:14:57,755][26022] Updated weights on worker 0-0, policy_version 874872 (0.00607) [2022-07-10 20:14:59,479][25689] Fps is (10 sec: 5668.3, 60 sec: 5553.9, 300 sec: 5539.9). Total num frames: 895877120. Throughput: 0: 5005.7. Samples: 895875096. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:14:59,480][25689] Avg episode reward: [(0, '0.332')] [2022-07-10 20:14:59,641][26022] Updated weights on worker 0-0, policy_version 874882 (0.00087) [2022-07-10 20:15:02,076][26022] Updated weights on worker 0-0, policy_version 874892 (0.00097) [2022-07-10 20:15:02,190][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:15:02,199][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000874893_895890432.pth [2022-07-10 20:15:02,200][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000872945_893895680.pth [2022-07-10 20:15:03,546][26022] Updated weights on worker 0-0, policy_version 874902 (0.00086) [2022-07-10 20:15:04,527][25689] Fps is (10 sec: 5295.1, 60 sec: 5534.1, 300 sec: 5532.1). Total num frames: 895902720. Throughput: 0: 5721.3. Samples: 895906450. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:15:04,527][25689] Avg episode reward: [(0, '0.388')] [2022-07-10 20:15:05,875][26022] Updated weights on worker 0-0, policy_version 874912 (0.00080) [2022-07-10 20:15:07,474][26022] Updated weights on worker 0-0, policy_version 874922 (0.00081) [2022-07-10 20:15:09,305][26022] Updated weights on worker 0-0, policy_version 874932 (0.00087) [2022-07-10 20:15:09,557][25689] Fps is (10 sec: 5385.4, 60 sec: 5538.6, 300 sec: 5536.3). Total num frames: 895931392. Throughput: 0: 5673.5. Samples: 895939322. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:15:09,558][25689] Avg episode reward: [(0, '0.689')] [2022-07-10 20:15:11,196][26022] Updated weights on worker 0-0, policy_version 874942 (0.00087) [2022-07-10 20:15:12,936][26022] Updated weights on worker 0-0, policy_version 874952 (0.00092) [2022-07-10 20:15:14,622][25689] Fps is (10 sec: 5578.8, 60 sec: 5557.5, 300 sec: 5536.3). Total num frames: 895959040. Throughput: 0: 4862.6. Samples: 895956400. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:15:14,623][25689] Avg episode reward: [(0, '0.838')] [2022-07-10 20:15:14,874][26022] Updated weights on worker 0-0, policy_version 874962 (0.00084) [2022-07-10 20:15:16,651][26022] Updated weights on worker 0-0, policy_version 874972 (0.00085) [2022-07-10 20:15:18,429][26022] Updated weights on worker 0-0, policy_version 874982 (0.00089) [2022-07-10 20:15:19,651][25689] Fps is (10 sec: 5579.7, 60 sec: 5523.4, 300 sec: 5532.4). Total num frames: 895987712. Throughput: 0: 5708.1. Samples: 895990236. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:15:19,652][25689] Avg episode reward: [(0, '0.802')] [2022-07-10 20:15:20,306][26022] Updated weights on worker 0-0, policy_version 874992 (0.00086) [2022-07-10 20:15:22,063][26022] Updated weights on worker 0-0, policy_version 875002 (0.00861) [2022-07-10 20:15:24,011][26022] Updated weights on worker 0-0, policy_version 875012 (0.00460) [2022-07-10 20:15:24,666][25689] Fps is (10 sec: 5709.5, 60 sec: 5558.4, 300 sec: 5535.8). Total num frames: 896016384. Throughput: 0: 5838.6. Samples: 896024034. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:15:24,668][25689] Avg episode reward: [(0, '0.126')] [2022-07-10 20:15:25,688][26022] Updated weights on worker 0-0, policy_version 875022 (0.00090) [2022-07-10 20:15:27,635][26022] Updated weights on worker 0-0, policy_version 875032 (0.00092) [2022-07-10 20:15:29,323][26022] Updated weights on worker 0-0, policy_version 875042 (0.00091) [2022-07-10 20:15:29,695][25689] Fps is (10 sec: 5607.3, 60 sec: 5558.1, 300 sec: 5536.0). Total num frames: 896044032. Throughput: 0: 5036.4. Samples: 896040744. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:15:29,697][25689] Avg episode reward: [(0, '-0.014')] [2022-07-10 20:15:31,329][26022] Updated weights on worker 0-0, policy_version 875052 (0.00088) [2022-07-10 20:15:33,059][26022] Updated weights on worker 0-0, policy_version 875062 (0.00101) [2022-07-10 20:15:34,747][25689] Fps is (10 sec: 5383.7, 60 sec: 5505.8, 300 sec: 5535.1). Total num frames: 896070656. Throughput: 0: 5840.0. Samples: 896073926. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:15:34,747][25689] Avg episode reward: [(0, '0.142')] [2022-07-10 20:15:35,059][26022] Updated weights on worker 0-0, policy_version 875072 (0.00086) [2022-07-10 20:15:36,916][26022] Updated weights on worker 0-0, policy_version 875082 (0.00089) [2022-07-10 20:15:38,710][26022] Updated weights on worker 0-0, policy_version 875092 (0.00088) [2022-07-10 20:15:39,778][25689] Fps is (10 sec: 5585.3, 60 sec: 5539.6, 300 sec: 5538.1). Total num frames: 896100352. Throughput: 0: 5809.1. Samples: 896107160. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:15:39,779][25689] Avg episode reward: [(0, '0.327')] [2022-07-10 20:15:40,542][26022] Updated weights on worker 0-0, policy_version 875102 (0.00087) [2022-07-10 20:15:42,395][26022] Updated weights on worker 0-0, policy_version 875112 (0.00091) [2022-07-10 20:15:44,057][26022] Updated weights on worker 0-0, policy_version 875122 (0.00096) [2022-07-10 20:15:44,851][25689] Fps is (10 sec: 5776.7, 60 sec: 5552.0, 300 sec: 5537.0). Total num frames: 896129024. Throughput: 0: 4949.7. Samples: 896123942. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:15:44,851][25689] Avg episode reward: [(0, '0.097')] [2022-07-10 20:15:46,315][26022] Updated weights on worker 0-0, policy_version 875132 (0.00087) [2022-07-10 20:15:47,648][26022] Updated weights on worker 0-0, policy_version 875142 (0.00087) [2022-07-10 20:15:49,847][26022] Updated weights on worker 0-0, policy_version 875152 (0.00081) [2022-07-10 20:15:49,873][25689] Fps is (10 sec: 5477.6, 60 sec: 5550.6, 300 sec: 5537.4). Total num frames: 896155648. Throughput: 0: 5790.0. Samples: 896157578. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:15:49,874][25689] Avg episode reward: [(0, '0.917')] [2022-07-10 20:15:51,380][26022] Updated weights on worker 0-0, policy_version 875162 (0.00085) [2022-07-10 20:15:53,536][26022] Updated weights on worker 0-0, policy_version 875172 (0.00093) [2022-07-10 20:15:54,983][25689] Fps is (10 sec: 5457.1, 60 sec: 5533.5, 300 sec: 5535.6). Total num frames: 896184320. Throughput: 0: 5780.9. Samples: 896190912. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:15:54,984][25689] Avg episode reward: [(0, '0.944')] [2022-07-10 20:15:55,228][26022] Updated weights on worker 0-0, policy_version 875182 (0.00089) [2022-07-10 20:15:57,071][26022] Updated weights on worker 0-0, policy_version 875192 (0.00094) [2022-07-10 20:15:58,912][26022] Updated weights on worker 0-0, policy_version 875202 (0.00085) [2022-07-10 20:15:59,995][25689] Fps is (10 sec: 5564.2, 60 sec: 5533.2, 300 sec: 5539.1). Total num frames: 896211968. Throughput: 0: 4968.2. Samples: 896207600. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:15:59,996][25689] Avg episode reward: [(0, '1.026')] [2022-07-10 20:16:00,651][26022] Updated weights on worker 0-0, policy_version 875212 (0.00094) [2022-07-10 20:16:02,890][26022] Updated weights on worker 0-0, policy_version 875222 (0.00101) [2022-07-10 20:16:04,795][26022] Updated weights on worker 0-0, policy_version 875232 (0.00086) [2022-07-10 20:16:05,014][25689] Fps is (10 sec: 5308.4, 60 sec: 5535.8, 300 sec: 5535.6). Total num frames: 896237568. Throughput: 0: 5721.6. Samples: 896239310. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:16:05,015][25689] Avg episode reward: [(0, '0.926')] [2022-07-10 20:16:06,631][26022] Updated weights on worker 0-0, policy_version 875242 (0.00101) [2022-07-10 20:16:08,648][26022] Updated weights on worker 0-0, policy_version 875252 (0.00083) [2022-07-10 20:16:10,019][25689] Fps is (10 sec: 5312.2, 60 sec: 5521.2, 300 sec: 5536.4). Total num frames: 896265216. Throughput: 0: 5698.2. Samples: 896272372. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:16:10,023][25689] Avg episode reward: [(0, '0.903')] [2022-07-10 20:16:10,306][26022] Updated weights on worker 0-0, policy_version 875262 (0.00081) [2022-07-10 20:16:12,319][26022] Updated weights on worker 0-0, policy_version 875272 (0.00085) [2022-07-10 20:16:13,801][26022] Updated weights on worker 0-0, policy_version 875282 (0.00095) [2022-07-10 20:16:15,153][25689] Fps is (10 sec: 5555.2, 60 sec: 5531.9, 300 sec: 5537.6). Total num frames: 896293888. Throughput: 0: 4871.5. Samples: 896289166. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:16:15,153][25689] Avg episode reward: [(0, '0.563')] [2022-07-10 20:16:16,034][26022] Updated weights on worker 0-0, policy_version 875292 (0.00092) [2022-07-10 20:16:17,634][26022] Updated weights on worker 0-0, policy_version 875302 (0.00091) [2022-07-10 20:16:19,607][26022] Updated weights on worker 0-0, policy_version 875312 (0.00084) [2022-07-10 20:16:20,183][25689] Fps is (10 sec: 5641.9, 60 sec: 5531.7, 300 sec: 5534.1). Total num frames: 896322560. Throughput: 0: 5695.1. Samples: 896322570. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:16:20,185][25689] Avg episode reward: [(0, '-0.486')] [2022-07-10 20:16:21,430][26022] Updated weights on worker 0-0, policy_version 875322 (0.00098) [2022-07-10 20:16:23,246][26022] Updated weights on worker 0-0, policy_version 875332 (0.00086) [2022-07-10 20:16:25,152][26022] Updated weights on worker 0-0, policy_version 875342 (0.00090) [2022-07-10 20:16:25,238][25689] Fps is (10 sec: 5584.1, 60 sec: 5511.2, 300 sec: 5533.2). Total num frames: 896350208. Throughput: 0: 5781.3. Samples: 896356230. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:16:25,240][25689] Avg episode reward: [(0, '-1.564')] [2022-07-10 20:16:26,905][26022] Updated weights on worker 0-0, policy_version 875352 (0.00089) [2022-07-10 20:16:28,719][26022] Updated weights on worker 0-0, policy_version 875362 (0.00089) [2022-07-10 20:16:30,270][25689] Fps is (10 sec: 5481.9, 60 sec: 5510.9, 300 sec: 5536.9). Total num frames: 896377856. Throughput: 0: 4965.7. Samples: 896372932. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:16:30,272][25689] Avg episode reward: [(0, '-1.710')] [2022-07-10 20:16:30,658][26022] Updated weights on worker 0-0, policy_version 875372 (0.00089) [2022-07-10 20:16:32,494][26022] Updated weights on worker 0-0, policy_version 875382 (0.00092) [2022-07-10 20:16:34,360][26022] Updated weights on worker 0-0, policy_version 875392 (0.00091) [2022-07-10 20:16:35,321][25689] Fps is (10 sec: 5687.7, 60 sec: 5561.7, 300 sec: 5536.2). Total num frames: 896407552. Throughput: 0: 5809.0. Samples: 896406320. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:16:35,321][25689] Avg episode reward: [(0, '-1.262')] [2022-07-10 20:16:36,179][26022] Updated weights on worker 0-0, policy_version 875402 (0.00086) [2022-07-10 20:16:37,925][26022] Updated weights on worker 0-0, policy_version 875412 (0.00094) [2022-07-10 20:16:39,774][26022] Updated weights on worker 0-0, policy_version 875422 (0.00081) [2022-07-10 20:16:40,327][25689] Fps is (10 sec: 5702.0, 60 sec: 5530.3, 300 sec: 5536.3). Total num frames: 896435200. Throughput: 0: 5823.9. Samples: 896439886. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:16:40,327][25689] Avg episode reward: [(0, '-3.046')] [2022-07-10 20:16:41,675][26022] Updated weights on worker 0-0, policy_version 875432 (0.00094) [2022-07-10 20:16:43,447][26022] Updated weights on worker 0-0, policy_version 875442 (0.00082) [2022-07-10 20:16:45,332][25689] Fps is (10 sec: 5421.0, 60 sec: 5502.5, 300 sec: 5532.8). Total num frames: 896461824. Throughput: 0: 5002.8. Samples: 896456754. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:16:45,332][25689] Avg episode reward: [(0, '-2.707')] [2022-07-10 20:16:45,347][26022] Updated weights on worker 0-0, policy_version 875452 (0.00094) [2022-07-10 20:16:46,819][26022] Updated weights on worker 0-0, policy_version 875462 (0.00093) [2022-07-10 20:16:48,939][26022] Updated weights on worker 0-0, policy_version 875472 (0.00082) [2022-07-10 20:16:50,351][25689] Fps is (10 sec: 5618.4, 60 sec: 5553.7, 300 sec: 5538.2). Total num frames: 896491520. Throughput: 0: 5850.1. Samples: 896490408. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:16:50,351][25689] Avg episode reward: [(0, '-2.580')] [2022-07-10 20:16:50,615][26022] Updated weights on worker 0-0, policy_version 875482 (0.00083) [2022-07-10 20:16:52,670][26022] Updated weights on worker 0-0, policy_version 875492 (0.00091) [2022-07-10 20:16:54,403][26022] Updated weights on worker 0-0, policy_version 875502 (0.00086) [2022-07-10 20:16:55,392][25689] Fps is (10 sec: 5598.1, 60 sec: 5526.1, 300 sec: 5531.0). Total num frames: 896518144. Throughput: 0: 5870.5. Samples: 896524152. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:16:55,393][25689] Avg episode reward: [(0, '-1.512')] [2022-07-10 20:16:56,091][26022] Updated weights on worker 0-0, policy_version 875512 (0.00092) [2022-07-10 20:16:58,114][26022] Updated weights on worker 0-0, policy_version 875522 (0.00084) [2022-07-10 20:17:00,006][26022] Updated weights on worker 0-0, policy_version 875532 (0.00099) [2022-07-10 20:17:00,424][25689] Fps is (10 sec: 5489.3, 60 sec: 5541.2, 300 sec: 5545.0). Total num frames: 896546816. Throughput: 0: 5021.7. Samples: 896540810. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:17:00,425][25689] Avg episode reward: [(0, '-1.471')] [2022-07-10 20:17:01,831][26022] Updated weights on worker 0-0, policy_version 875542 (0.00084) [2022-07-10 20:17:02,387][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:17:02,401][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000875544_896557056.pth [2022-07-10 20:17:02,402][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000873595_894561280.pth [2022-07-10 20:17:03,937][26022] Updated weights on worker 0-0, policy_version 875552 (0.00091) [2022-07-10 20:17:05,431][25689] Fps is (10 sec: 5508.2, 60 sec: 5559.2, 300 sec: 5542.3). Total num frames: 896573440. Throughput: 0: 5748.4. Samples: 896572294. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:17:05,432][25689] Avg episode reward: [(0, '-1.652')] [2022-07-10 20:17:05,719][26022] Updated weights on worker 0-0, policy_version 875562 (0.00083) [2022-07-10 20:17:07,614][26022] Updated weights on worker 0-0, policy_version 875572 (0.00092) [2022-07-10 20:17:09,482][26022] Updated weights on worker 0-0, policy_version 875582 (0.00086) [2022-07-10 20:17:10,451][25689] Fps is (10 sec: 5412.9, 60 sec: 5557.9, 300 sec: 5534.0). Total num frames: 896601088. Throughput: 0: 5729.1. Samples: 896605562. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:17:10,451][25689] Avg episode reward: [(0, '-0.066')] [2022-07-10 20:17:11,235][26022] Updated weights on worker 0-0, policy_version 875592 (0.00087) [2022-07-10 20:17:13,273][26022] Updated weights on worker 0-0, policy_version 875602 (0.00090) [2022-07-10 20:17:15,043][26022] Updated weights on worker 0-0, policy_version 875612 (0.00084) [2022-07-10 20:17:15,483][25689] Fps is (10 sec: 5501.0, 60 sec: 5550.2, 300 sec: 5538.2). Total num frames: 896628736. Throughput: 0: 4888.5. Samples: 896622366. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:17:15,484][25689] Avg episode reward: [(0, '-0.222')] [2022-07-10 20:17:16,935][26022] Updated weights on worker 0-0, policy_version 875622 (0.00079) [2022-07-10 20:17:18,638][26022] Updated weights on worker 0-0, policy_version 875632 (0.00086) [2022-07-10 20:17:20,421][26022] Updated weights on worker 0-0, policy_version 875642 (0.00088) [2022-07-10 20:17:20,495][25689] Fps is (10 sec: 5607.0, 60 sec: 5551.9, 300 sec: 5538.4). Total num frames: 896657408. Throughput: 0: 5738.7. Samples: 896655992. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:17:20,496][25689] Avg episode reward: [(0, '0.497')] [2022-07-10 20:17:22,179][26022] Updated weights on worker 0-0, policy_version 875652 (0.00056) [2022-07-10 20:17:24,336][26022] Updated weights on worker 0-0, policy_version 875662 (0.00094) [2022-07-10 20:17:25,519][25689] Fps is (10 sec: 5714.2, 60 sec: 5571.8, 300 sec: 5538.7). Total num frames: 896686080. Throughput: 0: 5851.8. Samples: 896689842. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:17:25,519][25689] Avg episode reward: [(0, '0.426')] [2022-07-10 20:17:25,904][26022] Updated weights on worker 0-0, policy_version 875672 (0.00093) [2022-07-10 20:17:27,863][26022] Updated weights on worker 0-0, policy_version 875682 (0.00089) [2022-07-10 20:17:29,612][26022] Updated weights on worker 0-0, policy_version 875692 (0.00086) [2022-07-10 20:17:30,533][25689] Fps is (10 sec: 5508.9, 60 sec: 5556.4, 300 sec: 5535.9). Total num frames: 896712704. Throughput: 0: 5041.2. Samples: 896706800. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:17:30,533][25689] Avg episode reward: [(0, '0.541')] [2022-07-10 20:17:31,322][26022] Updated weights on worker 0-0, policy_version 875702 (0.00086) [2022-07-10 20:17:33,211][26022] Updated weights on worker 0-0, policy_version 875712 (0.00087) [2022-07-10 20:17:35,167][26022] Updated weights on worker 0-0, policy_version 875722 (0.00085) [2022-07-10 20:17:35,595][25689] Fps is (10 sec: 5589.1, 60 sec: 5555.4, 300 sec: 5539.5). Total num frames: 896742400. Throughput: 0: 5873.0. Samples: 896740484. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:17:35,596][25689] Avg episode reward: [(0, '0.443')] [2022-07-10 20:17:36,873][26022] Updated weights on worker 0-0, policy_version 875732 (0.00088) [2022-07-10 20:17:38,723][26022] Updated weights on worker 0-0, policy_version 875742 (0.00553) [2022-07-10 20:17:40,601][25689] Fps is (10 sec: 5593.6, 60 sec: 5538.4, 300 sec: 5537.1). Total num frames: 896769024. Throughput: 0: 5862.1. Samples: 896773856. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:17:40,602][25689] Avg episode reward: [(0, '0.111')] [2022-07-10 20:17:40,649][26022] Updated weights on worker 0-0, policy_version 875752 (0.00085) [2022-07-10 20:17:42,354][26022] Updated weights on worker 0-0, policy_version 875762 (0.00094) [2022-07-10 20:17:44,391][26022] Updated weights on worker 0-0, policy_version 875772 (0.00859) [2022-07-10 20:17:45,615][25689] Fps is (10 sec: 5621.2, 60 sec: 5588.6, 300 sec: 5541.2). Total num frames: 896798720. Throughput: 0: 5010.4. Samples: 896790530. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:17:45,616][25689] Avg episode reward: [(0, '0.243')] [2022-07-10 20:17:45,990][26022] Updated weights on worker 0-0, policy_version 875782 (0.00089) [2022-07-10 20:17:47,919][26022] Updated weights on worker 0-0, policy_version 875792 (0.00084) [2022-07-10 20:17:49,835][26022] Updated weights on worker 0-0, policy_version 875802 (0.00086) [2022-07-10 20:17:50,620][25689] Fps is (10 sec: 5724.1, 60 sec: 5555.9, 300 sec: 5540.2). Total num frames: 896826368. Throughput: 0: 5857.1. Samples: 896824450. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-10 20:17:50,620][25689] Avg episode reward: [(0, '-0.236')] [2022-07-10 20:17:51,552][26022] Updated weights on worker 0-0, policy_version 875812 (0.00089) [2022-07-10 20:17:53,411][26022] Updated weights on worker 0-0, policy_version 875822 (0.00085) [2022-07-10 20:17:55,187][26022] Updated weights on worker 0-0, policy_version 875832 (0.00088) [2022-07-10 20:17:55,801][25689] Fps is (10 sec: 5428.3, 60 sec: 5560.0, 300 sec: 5537.2). Total num frames: 896854016. Throughput: 0: 5831.2. Samples: 896858308. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:17:55,802][25689] Avg episode reward: [(0, '-1.406')] [2022-07-10 20:17:57,007][26022] Updated weights on worker 0-0, policy_version 875842 (0.00085) [2022-07-10 20:17:59,043][26022] Updated weights on worker 0-0, policy_version 875852 (0.00081) [2022-07-10 20:18:00,723][26022] Updated weights on worker 0-0, policy_version 875862 (0.00090) [2022-07-10 20:18:00,806][25689] Fps is (10 sec: 5529.2, 60 sec: 5562.5, 300 sec: 5547.7). Total num frames: 896882688. Throughput: 0: 5836.2. Samples: 896891770. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:18:00,806][25689] Avg episode reward: [(0, '-1.914')] [2022-07-10 20:18:02,880][26022] Updated weights on worker 0-0, policy_version 875872 (0.00088) [2022-07-10 20:18:04,839][26022] Updated weights on worker 0-0, policy_version 875882 (0.00087) [2022-07-10 20:18:05,831][25689] Fps is (10 sec: 5411.4, 60 sec: 5543.9, 300 sec: 5540.8). Total num frames: 896908288. Throughput: 0: 5723.1. Samples: 896906228. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:18:05,831][25689] Avg episode reward: [(0, '-2.179')] [2022-07-10 20:18:06,726][26022] Updated weights on worker 0-0, policy_version 875892 (0.00091) [2022-07-10 20:18:08,568][26022] Updated weights on worker 0-0, policy_version 875902 (0.00089) [2022-07-10 20:18:10,415][26022] Updated weights on worker 0-0, policy_version 875912 (0.00094) [2022-07-10 20:18:10,875][25689] Fps is (10 sec: 5288.1, 60 sec: 5541.6, 300 sec: 5535.2). Total num frames: 896935936. Throughput: 0: 5670.9. Samples: 896939316. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:18:10,876][25689] Avg episode reward: [(0, '-1.746')] [2022-07-10 20:18:12,342][26022] Updated weights on worker 0-0, policy_version 875922 (0.00088) [2022-07-10 20:18:14,181][26022] Updated weights on worker 0-0, policy_version 875932 (0.00087) [2022-07-10 20:18:15,984][25689] Fps is (10 sec: 5446.2, 60 sec: 5534.6, 300 sec: 5536.7). Total num frames: 896963584. Throughput: 0: 5666.1. Samples: 896972664. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:18:15,984][25689] Avg episode reward: [(0, '-1.997')] [2022-07-10 20:18:15,996][26022] Updated weights on worker 0-0, policy_version 875942 (0.00085) [2022-07-10 20:18:17,715][26022] Updated weights on worker 0-0, policy_version 875952 (0.00406) [2022-07-10 20:18:19,679][26022] Updated weights on worker 0-0, policy_version 875962 (0.00082) [2022-07-10 20:18:21,022][25689] Fps is (10 sec: 5651.1, 60 sec: 5549.1, 300 sec: 5537.0). Total num frames: 896993280. Throughput: 0: 4835.5. Samples: 896989528. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:18:21,023][25689] Avg episode reward: [(0, '-1.549')] [2022-07-10 20:18:21,175][26022] Updated weights on worker 0-0, policy_version 875972 (0.00092) [2022-07-10 20:18:23,366][26022] Updated weights on worker 0-0, policy_version 875982 (0.00092) [2022-07-10 20:18:24,820][26022] Updated weights on worker 0-0, policy_version 875992 (0.00088) [2022-07-10 20:18:26,028][25689] Fps is (10 sec: 5505.5, 60 sec: 5500.0, 300 sec: 5534.4). Total num frames: 897018880. Throughput: 0: 5786.9. Samples: 897023106. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:18:26,028][25689] Avg episode reward: [(0, '-0.525')] [2022-07-10 20:18:26,975][26022] Updated weights on worker 0-0, policy_version 876002 (0.00091) [2022-07-10 20:18:28,825][26022] Updated weights on worker 0-0, policy_version 876012 (0.00091) [2022-07-10 20:18:30,529][26022] Updated weights on worker 0-0, policy_version 876022 (0.00087) [2022-07-10 20:18:31,059][25689] Fps is (10 sec: 5509.7, 60 sec: 5549.3, 300 sec: 5542.3). Total num frames: 897048576. Throughput: 0: 5782.4. Samples: 897056026. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:18:31,059][25689] Avg episode reward: [(0, '-0.087')] [2022-07-10 20:18:32,530][26022] Updated weights on worker 0-0, policy_version 876032 (0.00086) [2022-07-10 20:18:34,256][26022] Updated weights on worker 0-0, policy_version 876042 (0.00093) [2022-07-10 20:18:36,185][25689] Fps is (10 sec: 5645.7, 60 sec: 5509.6, 300 sec: 5536.7). Total num frames: 897076224. Throughput: 0: 4950.1. Samples: 897072660. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:18:36,185][25689] Avg episode reward: [(0, '0.414')] [2022-07-10 20:18:36,196][26022] Updated weights on worker 0-0, policy_version 876052 (0.00053) [2022-07-10 20:18:38,061][26022] Updated weights on worker 0-0, policy_version 876062 (0.00091) [2022-07-10 20:18:39,995][26022] Updated weights on worker 0-0, policy_version 876072 (0.00108) [2022-07-10 20:18:41,281][25689] Fps is (10 sec: 5409.2, 60 sec: 5518.3, 300 sec: 5536.0). Total num frames: 897103872. Throughput: 0: 5738.9. Samples: 897105790. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:18:41,282][25689] Avg episode reward: [(0, '0.410')] [2022-07-10 20:18:41,759][26022] Updated weights on worker 0-0, policy_version 876082 (0.00084) [2022-07-10 20:18:43,623][26022] Updated weights on worker 0-0, policy_version 876092 (0.00087) [2022-07-10 20:18:45,315][26022] Updated weights on worker 0-0, policy_version 876102 (0.00089) [2022-07-10 20:18:46,294][25689] Fps is (10 sec: 5469.7, 60 sec: 5484.6, 300 sec: 5536.5). Total num frames: 897131520. Throughput: 0: 5733.3. Samples: 897139300. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:18:46,295][25689] Avg episode reward: [(0, '0.465')] [2022-07-10 20:18:47,390][26022] Updated weights on worker 0-0, policy_version 876112 (0.00873) [2022-07-10 20:18:49,206][26022] Updated weights on worker 0-0, policy_version 876122 (0.00092) [2022-07-10 20:18:51,069][26022] Updated weights on worker 0-0, policy_version 876132 (0.00094) [2022-07-10 20:18:51,301][25689] Fps is (10 sec: 5518.8, 60 sec: 5484.4, 300 sec: 5531.6). Total num frames: 897159168. Throughput: 0: 4935.3. Samples: 897155928. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:18:51,301][25689] Avg episode reward: [(0, '0.553')] [2022-07-10 20:18:53,016][26022] Updated weights on worker 0-0, policy_version 876142 (0.00084) [2022-07-10 20:18:54,996][26022] Updated weights on worker 0-0, policy_version 876152 (0.00087) [2022-07-10 20:18:56,359][25689] Fps is (10 sec: 5595.7, 60 sec: 5512.5, 300 sec: 5537.5). Total num frames: 897187840. Throughput: 0: 5743.7. Samples: 897188536. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:18:56,360][25689] Avg episode reward: [(0, '0.616')] [2022-07-10 20:18:56,659][26022] Updated weights on worker 0-0, policy_version 876162 (0.00089) [2022-07-10 20:18:58,736][26022] Updated weights on worker 0-0, policy_version 876172 (0.00082) [2022-07-10 20:19:00,247][26022] Updated weights on worker 0-0, policy_version 876182 (0.00083) [2022-07-10 20:19:01,407][25689] Fps is (10 sec: 5471.7, 60 sec: 5474.7, 300 sec: 5536.9). Total num frames: 897214464. Throughput: 0: 5772.7. Samples: 897221968. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:19:01,415][25689] Avg episode reward: [(0, '0.564')] [2022-07-10 20:19:02,510][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:19:02,522][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000876191_897219584.pth [2022-07-10 20:19:02,522][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000874245_895226880.pth [2022-07-10 20:19:02,646][26022] Updated weights on worker 0-0, policy_version 876192 (0.00084) [2022-07-10 20:19:04,437][26022] Updated weights on worker 0-0, policy_version 876202 (0.00086) [2022-07-10 20:19:06,278][26022] Updated weights on worker 0-0, policy_version 876212 (0.00089) [2022-07-10 20:19:06,417][25689] Fps is (10 sec: 5395.7, 60 sec: 5509.8, 300 sec: 5534.7). Total num frames: 897242112. Throughput: 0: 4836.3. Samples: 897236622. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:19:06,424][25689] Avg episode reward: [(0, '0.409')] [2022-07-10 20:19:08,166][26022] Updated weights on worker 0-0, policy_version 876222 (0.00086) [2022-07-10 20:19:09,969][26022] Updated weights on worker 0-0, policy_version 876232 (0.00087) [2022-07-10 20:19:11,447][25689] Fps is (10 sec: 5405.3, 60 sec: 5494.3, 300 sec: 5535.8). Total num frames: 897268736. Throughput: 0: 5661.6. Samples: 897269988. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:19:11,449][25689] Avg episode reward: [(0, '-1.278')] [2022-07-10 20:19:11,808][26022] Updated weights on worker 0-0, policy_version 876242 (0.00084) [2022-07-10 20:19:13,727][26022] Updated weights on worker 0-0, policy_version 876252 (0.00085) [2022-07-10 20:19:15,313][26022] Updated weights on worker 0-0, policy_version 876262 (0.00087) [2022-07-10 20:19:16,556][25689] Fps is (10 sec: 5555.2, 60 sec: 5528.1, 300 sec: 5530.8). Total num frames: 897298432. Throughput: 0: 5686.6. Samples: 897303386. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:19:16,556][25689] Avg episode reward: [(0, '-1.095')] [2022-07-10 20:19:17,452][26022] Updated weights on worker 0-0, policy_version 876272 (0.00085) [2022-07-10 20:19:19,014][26022] Updated weights on worker 0-0, policy_version 876282 (0.00054) [2022-07-10 20:19:20,974][26022] Updated weights on worker 0-0, policy_version 876292 (0.00089) [2022-07-10 20:19:21,586][25689] Fps is (10 sec: 5655.6, 60 sec: 5495.0, 300 sec: 5534.2). Total num frames: 897326080. Throughput: 0: 4867.5. Samples: 897320192. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:19:21,587][25689] Avg episode reward: [(0, '-0.743')] [2022-07-10 20:19:22,639][26022] Updated weights on worker 0-0, policy_version 876302 (0.00091) [2022-07-10 20:19:24,550][26022] Updated weights on worker 0-0, policy_version 876312 (0.00094) [2022-07-10 20:19:26,405][26022] Updated weights on worker 0-0, policy_version 876322 (0.00086) [2022-07-10 20:19:26,595][25689] Fps is (10 sec: 5507.8, 60 sec: 5528.5, 300 sec: 5534.5). Total num frames: 897353728. Throughput: 0: 5808.8. Samples: 897353832. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:19:26,597][25689] Avg episode reward: [(0, '-0.621')] [2022-07-10 20:19:28,319][26022] Updated weights on worker 0-0, policy_version 876332 (0.00112) [2022-07-10 20:19:30,155][26022] Updated weights on worker 0-0, policy_version 876342 (0.00087) [2022-07-10 20:19:31,603][25689] Fps is (10 sec: 5520.2, 60 sec: 5496.7, 300 sec: 5528.1). Total num frames: 897381376. Throughput: 0: 5802.1. Samples: 897386936. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:19:31,605][25689] Avg episode reward: [(0, '-0.652')] [2022-07-10 20:19:32,015][26022] Updated weights on worker 0-0, policy_version 876352 (0.00085) [2022-07-10 20:19:33,936][26022] Updated weights on worker 0-0, policy_version 876362 (0.00099) [2022-07-10 20:19:35,533][26022] Updated weights on worker 0-0, policy_version 876372 (0.00089) [2022-07-10 20:19:36,693][25689] Fps is (10 sec: 5577.4, 60 sec: 5517.0, 300 sec: 5530.5). Total num frames: 897410048. Throughput: 0: 4990.2. Samples: 897403876. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:19:36,694][25689] Avg episode reward: [(0, '-0.488')] [2022-07-10 20:19:37,590][26022] Updated weights on worker 0-0, policy_version 876382 (0.00095) [2022-07-10 20:19:39,275][26022] Updated weights on worker 0-0, policy_version 876392 (0.00081) [2022-07-10 20:19:41,198][26022] Updated weights on worker 0-0, policy_version 876402 (0.00614) [2022-07-10 20:19:41,697][25689] Fps is (10 sec: 5681.3, 60 sec: 5542.4, 300 sec: 5534.3). Total num frames: 897438720. Throughput: 0: 5831.3. Samples: 897437462. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:19:41,698][25689] Avg episode reward: [(0, '1.173')] [2022-07-10 20:19:43,009][26022] Updated weights on worker 0-0, policy_version 876412 (0.00086) [2022-07-10 20:19:44,767][26022] Updated weights on worker 0-0, policy_version 876422 (0.00083) [2022-07-10 20:19:46,679][26022] Updated weights on worker 0-0, policy_version 876432 (0.00080) [2022-07-10 20:19:46,791][25689] Fps is (10 sec: 5577.5, 60 sec: 5535.0, 300 sec: 5536.1). Total num frames: 897466368. Throughput: 0: 5799.5. Samples: 897470956. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:19:46,791][25689] Avg episode reward: [(0, '1.275')] [2022-07-10 20:19:48,407][26022] Updated weights on worker 0-0, policy_version 876442 (0.00095) [2022-07-10 20:19:50,307][26022] Updated weights on worker 0-0, policy_version 876452 (0.00082) [2022-07-10 20:19:51,848][25689] Fps is (10 sec: 5447.5, 60 sec: 5530.4, 300 sec: 5530.2). Total num frames: 897494016. Throughput: 0: 4979.2. Samples: 897487736. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:19:51,849][25689] Avg episode reward: [(0, '0.988')] [2022-07-10 20:19:52,147][26022] Updated weights on worker 0-0, policy_version 876462 (0.00069) [2022-07-10 20:19:53,948][26022] Updated weights on worker 0-0, policy_version 876472 (0.00088) [2022-07-10 20:19:55,905][26022] Updated weights on worker 0-0, policy_version 876482 (0.00086) [2022-07-10 20:19:56,933][25689] Fps is (10 sec: 5553.2, 60 sec: 5527.9, 300 sec: 5532.2). Total num frames: 897522688. Throughput: 0: 5794.3. Samples: 897521150. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:19:56,933][25689] Avg episode reward: [(0, '0.571')] [2022-07-10 20:19:57,580][26022] Updated weights on worker 0-0, policy_version 876492 (0.00088) [2022-07-10 20:19:59,609][26022] Updated weights on worker 0-0, policy_version 876502 (0.00098) [2022-07-10 20:20:01,429][26022] Updated weights on worker 0-0, policy_version 876512 (0.00088) [2022-07-10 20:20:01,979][25689] Fps is (10 sec: 5660.2, 60 sec: 5561.9, 300 sec: 5542.5). Total num frames: 897551360. Throughput: 0: 5786.6. Samples: 897554824. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:20:01,979][25689] Avg episode reward: [(0, '0.549')] [2022-07-10 20:20:03,532][26022] Updated weights on worker 0-0, policy_version 876522 (0.00093) [2022-07-10 20:20:05,460][26022] Updated weights on worker 0-0, policy_version 876532 (0.00087) [2022-07-10 20:20:06,996][25689] Fps is (10 sec: 5291.3, 60 sec: 5510.6, 300 sec: 5529.0). Total num frames: 897575936. Throughput: 0: 5709.8. Samples: 897586324. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:20:06,997][25689] Avg episode reward: [(0, '0.712')] [2022-07-10 20:20:07,353][26022] Updated weights on worker 0-0, policy_version 876542 (0.00088) [2022-07-10 20:20:09,135][26022] Updated weights on worker 0-0, policy_version 876552 (0.00088) [2022-07-10 20:20:11,048][26022] Updated weights on worker 0-0, policy_version 876562 (0.00084) [2022-07-10 20:20:12,041][25689] Fps is (10 sec: 5393.7, 60 sec: 5559.9, 300 sec: 5536.3). Total num frames: 897605632. Throughput: 0: 5710.9. Samples: 897603058. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:20:12,042][25689] Avg episode reward: [(0, '0.220')] [2022-07-10 20:20:12,825][26022] Updated weights on worker 0-0, policy_version 876572 (0.00087) [2022-07-10 20:20:14,561][26022] Updated weights on worker 0-0, policy_version 876582 (0.00090) [2022-07-10 20:20:16,567][26022] Updated weights on worker 0-0, policy_version 876592 (0.00092) [2022-07-10 20:20:17,179][25689] Fps is (10 sec: 5631.1, 60 sec: 5523.4, 300 sec: 5530.8). Total num frames: 897633280. Throughput: 0: 5694.2. Samples: 897636438. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:20:17,180][25689] Avg episode reward: [(0, '-0.643')] [2022-07-10 20:20:18,210][26022] Updated weights on worker 0-0, policy_version 876602 (0.00095) [2022-07-10 20:20:20,104][26022] Updated weights on worker 0-0, policy_version 876612 (0.00087) [2022-07-10 20:20:21,995][26022] Updated weights on worker 0-0, policy_version 876622 (0.00092) [2022-07-10 20:20:22,245][25689] Fps is (10 sec: 5519.4, 60 sec: 5537.1, 300 sec: 5529.9). Total num frames: 897661952. Throughput: 0: 5699.5. Samples: 897670330. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:20:22,245][25689] Avg episode reward: [(0, '-0.987')] [2022-07-10 20:20:23,699][26022] Updated weights on worker 0-0, policy_version 876632 (0.00411) [2022-07-10 20:20:25,696][26022] Updated weights on worker 0-0, policy_version 876642 (0.00096) [2022-07-10 20:20:27,297][25689] Fps is (10 sec: 5768.9, 60 sec: 5566.9, 300 sec: 5536.3). Total num frames: 897691648. Throughput: 0: 4963.6. Samples: 897687090. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:20:27,299][25689] Avg episode reward: [(0, '-0.680')] [2022-07-10 20:20:27,307][26022] Updated weights on worker 0-0, policy_version 876652 (0.00098) [2022-07-10 20:20:29,230][26022] Updated weights on worker 0-0, policy_version 876662 (0.00085) [2022-07-10 20:20:30,946][26022] Updated weights on worker 0-0, policy_version 876672 (0.00095) [2022-07-10 20:20:32,316][25689] Fps is (10 sec: 5592.2, 60 sec: 5549.0, 300 sec: 5536.9). Total num frames: 897718272. Throughput: 0: 5800.9. Samples: 897720672. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:20:32,317][25689] Avg episode reward: [(0, '-1.455')] [2022-07-10 20:20:32,836][26022] Updated weights on worker 0-0, policy_version 876682 (0.00092) [2022-07-10 20:20:34,758][26022] Updated weights on worker 0-0, policy_version 876692 (0.00091) [2022-07-10 20:20:36,483][26022] Updated weights on worker 0-0, policy_version 876702 (0.00083) [2022-07-10 20:20:37,403][25689] Fps is (10 sec: 5471.6, 60 sec: 5549.3, 300 sec: 5532.4). Total num frames: 897746944. Throughput: 0: 5832.6. Samples: 897754394. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:20:37,404][25689] Avg episode reward: [(0, '-1.553')] [2022-07-10 20:20:38,336][26022] Updated weights on worker 0-0, policy_version 876712 (0.00100) [2022-07-10 20:20:40,240][26022] Updated weights on worker 0-0, policy_version 876722 (0.00085) [2022-07-10 20:20:42,035][26022] Updated weights on worker 0-0, policy_version 876732 (0.00080) [2022-07-10 20:20:42,412][25689] Fps is (10 sec: 5679.9, 60 sec: 5548.8, 300 sec: 5533.6). Total num frames: 897775616. Throughput: 0: 5002.9. Samples: 897771224. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:20:42,412][25689] Avg episode reward: [(0, '-1.171')] [2022-07-10 20:20:43,968][26022] Updated weights on worker 0-0, policy_version 876742 (0.00081) [2022-07-10 20:20:45,621][26022] Updated weights on worker 0-0, policy_version 876752 (0.00086) [2022-07-10 20:20:47,432][25689] Fps is (10 sec: 5513.5, 60 sec: 5538.7, 300 sec: 5533.7). Total num frames: 897802240. Throughput: 0: 5834.0. Samples: 897804558. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:20:47,434][25689] Avg episode reward: [(0, '-0.919')] [2022-07-10 20:20:47,795][26022] Updated weights on worker 0-0, policy_version 876762 (0.00109) [2022-07-10 20:20:49,367][26022] Updated weights on worker 0-0, policy_version 876772 (0.00091) [2022-07-10 20:20:51,288][26022] Updated weights on worker 0-0, policy_version 876782 (0.00092) [2022-07-10 20:20:52,460][25689] Fps is (10 sec: 5605.2, 60 sec: 5575.1, 300 sec: 5538.6). Total num frames: 897831936. Throughput: 0: 5825.6. Samples: 897838022. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:20:52,460][25689] Avg episode reward: [(0, '-0.817')] [2022-07-10 20:20:52,906][26022] Updated weights on worker 0-0, policy_version 876792 (0.00093) [2022-07-10 20:20:55,003][26022] Updated weights on worker 0-0, policy_version 876802 (0.00092) [2022-07-10 20:20:56,802][26022] Updated weights on worker 0-0, policy_version 876812 (0.00084) [2022-07-10 20:20:57,522][25689] Fps is (10 sec: 5683.3, 60 sec: 5560.3, 300 sec: 5537.7). Total num frames: 897859584. Throughput: 0: 4986.0. Samples: 897854708. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:20:57,523][25689] Avg episode reward: [(0, '-1.224')] [2022-07-10 20:20:58,568][26022] Updated weights on worker 0-0, policy_version 876822 (0.00090) [2022-07-10 20:21:00,372][26022] Updated weights on worker 0-0, policy_version 876832 (0.00102) [2022-07-10 20:21:02,539][25689] Fps is (10 sec: 5283.0, 60 sec: 5512.3, 300 sec: 5537.8). Total num frames: 897885184. Throughput: 0: 5812.0. Samples: 897888202. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:21:02,539][25689] Avg episode reward: [(0, '-0.699')] [2022-07-10 20:21:02,605][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:21:02,631][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000876842_897886208.pth [2022-07-10 20:21:02,632][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000874893_895890432.pth [2022-07-10 20:21:02,636][26022] Updated weights on worker 0-0, policy_version 876842 (0.00078) [2022-07-10 20:21:04,494][26022] Updated weights on worker 0-0, policy_version 876852 (0.00083) [2022-07-10 20:21:06,352][26022] Updated weights on worker 0-0, policy_version 876862 (0.00092) [2022-07-10 20:21:07,563][25689] Fps is (10 sec: 5303.1, 60 sec: 5562.3, 300 sec: 5537.4). Total num frames: 897912832. Throughput: 0: 5714.7. Samples: 897919600. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:21:07,564][25689] Avg episode reward: [(0, '0.072')] [2022-07-10 20:21:08,161][26022] Updated weights on worker 0-0, policy_version 876872 (0.00102) [2022-07-10 20:21:10,084][26022] Updated weights on worker 0-0, policy_version 876882 (0.00094) [2022-07-10 20:21:11,871][26022] Updated weights on worker 0-0, policy_version 876892 (0.00087) [2022-07-10 20:21:12,580][25689] Fps is (10 sec: 5506.9, 60 sec: 5531.1, 300 sec: 5536.1). Total num frames: 897940480. Throughput: 0: 4885.3. Samples: 897936316. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:21:12,581][25689] Avg episode reward: [(0, '0.126')] [2022-07-10 20:21:13,884][26022] Updated weights on worker 0-0, policy_version 876902 (0.00090) [2022-07-10 20:21:15,643][26022] Updated weights on worker 0-0, policy_version 876912 (0.00087) [2022-07-10 20:21:17,596][26022] Updated weights on worker 0-0, policy_version 876922 (0.00092) [2022-07-10 20:21:17,615][25689] Fps is (10 sec: 5501.4, 60 sec: 5540.6, 300 sec: 5532.6). Total num frames: 897968128. Throughput: 0: 5703.5. Samples: 897969304. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:21:17,615][25689] Avg episode reward: [(0, '-0.510')] [2022-07-10 20:21:19,249][26022] Updated weights on worker 0-0, policy_version 876932 (0.00083) [2022-07-10 20:21:21,054][26022] Updated weights on worker 0-0, policy_version 876942 (0.00085) [2022-07-10 20:21:22,646][25689] Fps is (10 sec: 5594.8, 60 sec: 5543.7, 300 sec: 5536.5). Total num frames: 897996800. Throughput: 0: 5709.8. Samples: 898003012. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:21:22,647][25689] Avg episode reward: [(0, '-0.591')] [2022-07-10 20:21:22,897][26022] Updated weights on worker 0-0, policy_version 876952 (0.00091) [2022-07-10 20:21:24,799][26022] Updated weights on worker 0-0, policy_version 876962 (0.00082) [2022-07-10 20:21:26,706][26022] Updated weights on worker 0-0, policy_version 876972 (0.00096) [2022-07-10 20:21:27,667][25689] Fps is (10 sec: 5602.9, 60 sec: 5512.7, 300 sec: 5536.7). Total num frames: 898024448. Throughput: 0: 4994.8. Samples: 898020012. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:21:27,668][25689] Avg episode reward: [(0, '-0.320')] [2022-07-10 20:21:28,417][26022] Updated weights on worker 0-0, policy_version 876982 (0.00092) [2022-07-10 20:21:30,256][26022] Updated weights on worker 0-0, policy_version 876992 (0.00092) [2022-07-10 20:21:32,320][26022] Updated weights on worker 0-0, policy_version 877002 (0.00092) [2022-07-10 20:21:32,678][25689] Fps is (10 sec: 5512.0, 60 sec: 5530.3, 300 sec: 5530.5). Total num frames: 898052096. Throughput: 0: 5800.1. Samples: 898052888. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:21:32,679][25689] Avg episode reward: [(0, '-0.296')] [2022-07-10 20:21:34,039][26022] Updated weights on worker 0-0, policy_version 877012 (0.00090) [2022-07-10 20:21:35,735][26022] Updated weights on worker 0-0, policy_version 877022 (0.00089) [2022-07-10 20:21:37,488][26022] Updated weights on worker 0-0, policy_version 877032 (0.00095) [2022-07-10 20:21:37,787][25689] Fps is (10 sec: 5666.1, 60 sec: 5545.3, 300 sec: 5535.5). Total num frames: 898081792. Throughput: 0: 5807.8. Samples: 898086462. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 20:21:37,788][25689] Avg episode reward: [(0, '-1.060')] [2022-07-10 20:21:39,508][26022] Updated weights on worker 0-0, policy_version 877042 (0.00086) [2022-07-10 20:21:41,423][26022] Updated weights on worker 0-0, policy_version 877052 (0.00088) [2022-07-10 20:21:42,818][25689] Fps is (10 sec: 5554.2, 60 sec: 5509.3, 300 sec: 5535.0). Total num frames: 898108416. Throughput: 0: 4960.7. Samples: 898103080. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:21:42,819][25689] Avg episode reward: [(0, '-1.025')] [2022-07-10 20:21:43,262][26022] Updated weights on worker 0-0, policy_version 877062 (0.00091) [2022-07-10 20:21:45,119][26022] Updated weights on worker 0-0, policy_version 877072 (0.00084) [2022-07-10 20:21:46,940][26022] Updated weights on worker 0-0, policy_version 877082 (0.00086) [2022-07-10 20:21:47,837][25689] Fps is (10 sec: 5400.1, 60 sec: 5526.4, 300 sec: 5528.1). Total num frames: 898136064. Throughput: 0: 5768.4. Samples: 898136364. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:21:47,839][25689] Avg episode reward: [(0, '-1.022')] [2022-07-10 20:21:48,706][26022] Updated weights on worker 0-0, policy_version 877092 (0.00083) [2022-07-10 20:21:50,573][26022] Updated weights on worker 0-0, policy_version 877102 (0.00093) [2022-07-10 20:21:52,378][26022] Updated weights on worker 0-0, policy_version 877112 (0.00085) [2022-07-10 20:21:52,892][25689] Fps is (10 sec: 5591.1, 60 sec: 5507.0, 300 sec: 5534.8). Total num frames: 898164736. Throughput: 0: 5796.5. Samples: 898170054. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:21:52,892][25689] Avg episode reward: [(0, '-1.096')] [2022-07-10 20:21:54,279][26022] Updated weights on worker 0-0, policy_version 877122 (0.00091) [2022-07-10 20:21:55,995][26022] Updated weights on worker 0-0, policy_version 877132 (0.00091) [2022-07-10 20:21:57,927][26022] Updated weights on worker 0-0, policy_version 877142 (0.00094) [2022-07-10 20:21:58,025][25689] Fps is (10 sec: 5629.0, 60 sec: 5517.5, 300 sec: 5532.9). Total num frames: 898193408. Throughput: 0: 4958.6. Samples: 898186812. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:21:58,025][25689] Avg episode reward: [(0, '-1.176')] [2022-07-10 20:21:59,819][26022] Updated weights on worker 0-0, policy_version 877152 (0.00084) [2022-07-10 20:22:01,384][26022] Updated weights on worker 0-0, policy_version 877162 (0.00098) [2022-07-10 20:22:03,072][25689] Fps is (10 sec: 5431.5, 60 sec: 5531.6, 300 sec: 5532.1). Total num frames: 898220032. Throughput: 0: 5801.2. Samples: 898220576. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:22:03,073][25689] Avg episode reward: [(0, '-1.201')] [2022-07-10 20:22:03,896][26022] Updated weights on worker 0-0, policy_version 877172 (0.00095) [2022-07-10 20:22:05,516][26022] Updated weights on worker 0-0, policy_version 877182 (0.00082) [2022-07-10 20:22:07,415][26022] Updated weights on worker 0-0, policy_version 877192 (0.00094) [2022-07-10 20:22:08,124][25689] Fps is (10 sec: 5475.4, 60 sec: 5546.1, 300 sec: 5535.0). Total num frames: 898248704. Throughput: 0: 5712.7. Samples: 898252254. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:22:08,124][25689] Avg episode reward: [(0, '-2.457')] [2022-07-10 20:22:09,275][26022] Updated weights on worker 0-0, policy_version 877202 (0.00083) [2022-07-10 20:22:10,930][26022] Updated weights on worker 0-0, policy_version 877212 (0.00086) [2022-07-10 20:22:13,031][26022] Updated weights on worker 0-0, policy_version 877222 (0.00089) [2022-07-10 20:22:13,167][25689] Fps is (10 sec: 5680.6, 60 sec: 5560.5, 300 sec: 5538.2). Total num frames: 898277376. Throughput: 0: 5726.9. Samples: 898286168. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:22:13,167][25689] Avg episode reward: [(0, '-2.011')] [2022-07-10 20:22:14,622][26022] Updated weights on worker 0-0, policy_version 877232 (0.00094) [2022-07-10 20:22:16,588][26022] Updated weights on worker 0-0, policy_version 877242 (0.00086) [2022-07-10 20:22:18,211][25689] Fps is (10 sec: 5583.0, 60 sec: 5559.6, 300 sec: 5534.2). Total num frames: 898305024. Throughput: 0: 5742.8. Samples: 898302740. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:22:18,212][25689] Avg episode reward: [(0, '-1.723')] [2022-07-10 20:22:18,374][26022] Updated weights on worker 0-0, policy_version 877252 (0.00092) [2022-07-10 20:22:20,321][26022] Updated weights on worker 0-0, policy_version 877262 (0.00089) [2022-07-10 20:22:22,106][26022] Updated weights on worker 0-0, policy_version 877272 (0.00087) [2022-07-10 20:22:23,213][25689] Fps is (10 sec: 5504.1, 60 sec: 5545.5, 300 sec: 5531.1). Total num frames: 898332672. Throughput: 0: 5763.5. Samples: 898336660. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:22:23,214][25689] Avg episode reward: [(0, '-1.967')] [2022-07-10 20:22:23,719][26022] Updated weights on worker 0-0, policy_version 877282 (0.00088) [2022-07-10 20:22:25,680][26022] Updated weights on worker 0-0, policy_version 877292 (0.00097) [2022-07-10 20:22:27,399][26022] Updated weights on worker 0-0, policy_version 877302 (0.00082) [2022-07-10 20:22:28,217][25689] Fps is (10 sec: 5628.6, 60 sec: 5563.9, 300 sec: 5538.2). Total num frames: 898361344. Throughput: 0: 5874.2. Samples: 898370288. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:22:28,218][25689] Avg episode reward: [(0, '-2.559')] [2022-07-10 20:22:29,271][26022] Updated weights on worker 0-0, policy_version 877312 (0.00083) [2022-07-10 20:22:31,011][26022] Updated weights on worker 0-0, policy_version 877322 (0.00082) [2022-07-10 20:22:32,985][26022] Updated weights on worker 0-0, policy_version 877332 (0.00085) [2022-07-10 20:22:33,258][25689] Fps is (10 sec: 5606.6, 60 sec: 5561.2, 300 sec: 5531.7). Total num frames: 898388992. Throughput: 0: 5017.9. Samples: 898386986. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:22:33,259][25689] Avg episode reward: [(0, '-1.842')] [2022-07-10 20:22:34,703][26022] Updated weights on worker 0-0, policy_version 877342 (0.00082) [2022-07-10 20:22:36,659][26022] Updated weights on worker 0-0, policy_version 877352 (0.00084) [2022-07-10 20:22:38,369][25689] Fps is (10 sec: 5547.8, 60 sec: 5544.1, 300 sec: 5536.7). Total num frames: 898417664. Throughput: 0: 5853.5. Samples: 898420732. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:22:38,370][25689] Avg episode reward: [(0, '-2.189')] [2022-07-10 20:22:38,476][26022] Updated weights on worker 0-0, policy_version 877362 (0.00091) [2022-07-10 20:22:39,970][26022] Updated weights on worker 0-0, policy_version 877372 (0.00091) [2022-07-10 20:22:42,008][26022] Updated weights on worker 0-0, policy_version 877382 (0.00086) [2022-07-10 20:22:43,397][25689] Fps is (10 sec: 5757.2, 60 sec: 5595.1, 300 sec: 5536.4). Total num frames: 898447360. Throughput: 0: 5835.0. Samples: 898454432. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:22:43,398][25689] Avg episode reward: [(0, '-1.924')] [2022-07-10 20:22:43,970][26022] Updated weights on worker 0-0, policy_version 877392 (0.00099) [2022-07-10 20:22:45,697][26022] Updated weights on worker 0-0, policy_version 877402 (0.00088) [2022-07-10 20:22:47,596][26022] Updated weights on worker 0-0, policy_version 877412 (0.00083) [2022-07-10 20:22:48,421][25689] Fps is (10 sec: 5500.9, 60 sec: 5560.8, 300 sec: 5529.2). Total num frames: 898472960. Throughput: 0: 4985.1. Samples: 898471006. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:22:48,422][25689] Avg episode reward: [(0, '-0.984')] [2022-07-10 20:22:49,455][26022] Updated weights on worker 0-0, policy_version 877422 (0.00085) [2022-07-10 20:22:51,288][26022] Updated weights on worker 0-0, policy_version 877432 (0.00089) [2022-07-10 20:22:53,211][26022] Updated weights on worker 0-0, policy_version 877442 (0.00087) [2022-07-10 20:22:53,454][25689] Fps is (10 sec: 5396.4, 60 sec: 5562.8, 300 sec: 5535.4). Total num frames: 898501632. Throughput: 0: 5823.2. Samples: 898504588. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:22:53,454][25689] Avg episode reward: [(0, '-1.087')] [2022-07-10 20:22:54,859][26022] Updated weights on worker 0-0, policy_version 877452 (0.00096) [2022-07-10 20:22:56,774][26022] Updated weights on worker 0-0, policy_version 877462 (0.00094) [2022-07-10 20:22:58,503][25689] Fps is (10 sec: 5687.7, 60 sec: 5570.5, 300 sec: 5534.5). Total num frames: 898530304. Throughput: 0: 5826.3. Samples: 898538040. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:22:58,504][25689] Avg episode reward: [(0, '-1.195')] [2022-07-10 20:22:58,541][26022] Updated weights on worker 0-0, policy_version 877472 (0.00091) [2022-07-10 20:23:00,508][26022] Updated weights on worker 0-0, policy_version 877482 (0.00086) [2022-07-10 20:23:02,687][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:23:02,700][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000877492_898551808.pth [2022-07-10 20:23:02,701][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000875544_896557056.pth [2022-07-10 20:23:02,703][26022] Updated weights on worker 0-0, policy_version 877492 (0.00087) [2022-07-10 20:23:03,506][25689] Fps is (10 sec: 5297.2, 60 sec: 5540.7, 300 sec: 5531.5). Total num frames: 898554880. Throughput: 0: 5007.2. Samples: 898555124. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:23:03,507][25689] Avg episode reward: [(0, '0.075')] [2022-07-10 20:23:04,449][26022] Updated weights on worker 0-0, policy_version 877502 (0.00088) [2022-07-10 20:23:06,261][26022] Updated weights on worker 0-0, policy_version 877512 (0.00080) [2022-07-10 20:23:08,172][26022] Updated weights on worker 0-0, policy_version 877522 (0.00089) [2022-07-10 20:23:08,550][25689] Fps is (10 sec: 5300.3, 60 sec: 5541.5, 300 sec: 5535.0). Total num frames: 898583552. Throughput: 0: 5754.4. Samples: 898586832. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:23:08,550][25689] Avg episode reward: [(0, '0.239')] [2022-07-10 20:23:09,780][26022] Updated weights on worker 0-0, policy_version 877532 (0.00088) [2022-07-10 20:23:11,828][26022] Updated weights on worker 0-0, policy_version 877542 (0.00095) [2022-07-10 20:23:13,512][26022] Updated weights on worker 0-0, policy_version 877552 (0.00113) [2022-07-10 20:23:13,556][25689] Fps is (10 sec: 5807.8, 60 sec: 5561.8, 300 sec: 5543.7). Total num frames: 898613248. Throughput: 0: 5766.4. Samples: 898620504. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:23:13,556][25689] Avg episode reward: [(0, '0.136')] [2022-07-10 20:23:15,485][26022] Updated weights on worker 0-0, policy_version 877562 (0.00083) [2022-07-10 20:23:17,253][26022] Updated weights on worker 0-0, policy_version 877572 (0.00092) [2022-07-10 20:23:18,621][25689] Fps is (10 sec: 5693.5, 60 sec: 5559.9, 300 sec: 5536.4). Total num frames: 898640896. Throughput: 0: 4925.2. Samples: 898637126. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:23:18,622][25689] Avg episode reward: [(0, '0.148')] [2022-07-10 20:23:19,023][26022] Updated weights on worker 0-0, policy_version 877582 (0.00091) [2022-07-10 20:23:21,050][26022] Updated weights on worker 0-0, policy_version 877592 (0.00096) [2022-07-10 20:23:22,635][26022] Updated weights on worker 0-0, policy_version 877602 (0.00084) [2022-07-10 20:23:23,643][25689] Fps is (10 sec: 5482.1, 60 sec: 5558.1, 300 sec: 5542.9). Total num frames: 898668544. Throughput: 0: 5757.6. Samples: 898671062. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:23:23,643][25689] Avg episode reward: [(0, '0.033')] [2022-07-10 20:23:24,410][26022] Updated weights on worker 0-0, policy_version 877612 (0.00093) [2022-07-10 20:23:26,480][26022] Updated weights on worker 0-0, policy_version 877622 (0.00087) [2022-07-10 20:23:28,200][26022] Updated weights on worker 0-0, policy_version 877632 (0.00092) [2022-07-10 20:23:28,667][25689] Fps is (10 sec: 5606.5, 60 sec: 5556.2, 300 sec: 5539.6). Total num frames: 898697216. Throughput: 0: 5856.6. Samples: 898704650. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:23:28,667][25689] Avg episode reward: [(0, '-0.620')] [2022-07-10 20:23:30,034][26022] Updated weights on worker 0-0, policy_version 877642 (0.00085) [2022-07-10 20:23:31,811][26022] Updated weights on worker 0-0, policy_version 877652 (0.00088) [2022-07-10 20:23:33,556][26022] Updated weights on worker 0-0, policy_version 877662 (0.00086) [2022-07-10 20:23:33,703][25689] Fps is (10 sec: 5700.2, 60 sec: 5573.7, 300 sec: 5544.8). Total num frames: 898725888. Throughput: 0: 5002.4. Samples: 898721286. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:23:33,703][25689] Avg episode reward: [(0, '-0.618')] [2022-07-10 20:23:35,519][26022] Updated weights on worker 0-0, policy_version 877672 (0.00086) [2022-07-10 20:23:37,515][26022] Updated weights on worker 0-0, policy_version 877682 (0.00083) [2022-07-10 20:23:38,766][25689] Fps is (10 sec: 5475.0, 60 sec: 5544.1, 300 sec: 5541.9). Total num frames: 898752512. Throughput: 0: 5838.3. Samples: 898754740. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:23:38,767][25689] Avg episode reward: [(0, '-1.675')] [2022-07-10 20:23:39,219][26022] Updated weights on worker 0-0, policy_version 877692 (0.00093) [2022-07-10 20:23:41,198][26022] Updated weights on worker 0-0, policy_version 877702 (0.00054) [2022-07-10 20:23:42,960][26022] Updated weights on worker 0-0, policy_version 877712 (0.00093) [2022-07-10 20:23:43,783][25689] Fps is (10 sec: 5384.1, 60 sec: 5511.2, 300 sec: 5541.9). Total num frames: 898780160. Throughput: 0: 5811.6. Samples: 898788108. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:23:43,783][25689] Avg episode reward: [(0, '-1.553')] [2022-07-10 20:23:44,947][26022] Updated weights on worker 0-0, policy_version 877722 (0.00086) [2022-07-10 20:23:46,734][26022] Updated weights on worker 0-0, policy_version 877732 (0.00082) [2022-07-10 20:23:48,461][26022] Updated weights on worker 0-0, policy_version 877742 (0.00065) [2022-07-10 20:23:48,810][25689] Fps is (10 sec: 5607.8, 60 sec: 5561.9, 300 sec: 5544.9). Total num frames: 898808832. Throughput: 0: 4957.4. Samples: 898804506. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:23:48,810][25689] Avg episode reward: [(0, '-2.039')] [2022-07-10 20:23:50,359][26022] Updated weights on worker 0-0, policy_version 877752 (0.00092) [2022-07-10 20:23:52,326][26022] Updated weights on worker 0-0, policy_version 877762 (0.00092) [2022-07-10 20:23:53,816][25689] Fps is (10 sec: 5511.4, 60 sec: 5530.4, 300 sec: 5539.0). Total num frames: 898835456. Throughput: 0: 5781.1. Samples: 898837562. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:23:53,816][25689] Avg episode reward: [(0, '-1.904')] [2022-07-10 20:23:54,131][26022] Updated weights on worker 0-0, policy_version 877772 (0.00079) [2022-07-10 20:23:56,108][26022] Updated weights on worker 0-0, policy_version 877782 (0.00088) [2022-07-10 20:23:57,864][26022] Updated weights on worker 0-0, policy_version 877792 (0.00091) [2022-07-10 20:23:58,899][25689] Fps is (10 sec: 5379.1, 60 sec: 5510.3, 300 sec: 5541.8). Total num frames: 898863104. Throughput: 0: 5757.4. Samples: 898870652. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:23:58,900][25689] Avg episode reward: [(0, '-1.994')] [2022-07-10 20:23:59,860][26022] Updated weights on worker 0-0, policy_version 877802 (0.00053) [2022-07-10 20:24:01,875][26022] Updated weights on worker 0-0, policy_version 877812 (0.00101) [2022-07-10 20:24:03,963][25689] Fps is (10 sec: 5348.2, 60 sec: 5538.6, 300 sec: 5537.4). Total num frames: 898889728. Throughput: 0: 4904.6. Samples: 898887086. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:24:03,964][25689] Avg episode reward: [(0, '-2.800')] [2022-07-10 20:24:03,967][26022] Updated weights on worker 0-0, policy_version 877822 (0.00090) [2022-07-10 20:24:05,648][26022] Updated weights on worker 0-0, policy_version 877832 (0.00091) [2022-07-10 20:24:07,553][26022] Updated weights on worker 0-0, policy_version 877842 (0.00086) [2022-07-10 20:24:09,045][25689] Fps is (10 sec: 5349.4, 60 sec: 5518.2, 300 sec: 5539.8). Total num frames: 898917376. Throughput: 0: 5654.1. Samples: 898918916. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:24:09,045][25689] Avg episode reward: [(0, '-3.553')] [2022-07-10 20:24:09,433][26022] Updated weights on worker 0-0, policy_version 877852 (0.00085) [2022-07-10 20:24:11,205][26022] Updated weights on worker 0-0, policy_version 877862 (0.00095) [2022-07-10 20:24:12,998][26022] Updated weights on worker 0-0, policy_version 877872 (0.00085) [2022-07-10 20:24:14,063][25689] Fps is (10 sec: 5677.5, 60 sec: 5517.1, 300 sec: 5541.5). Total num frames: 898947072. Throughput: 0: 5665.2. Samples: 898952270. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:24:14,064][25689] Avg episode reward: [(0, '-4.427')] [2022-07-10 20:24:14,966][26022] Updated weights on worker 0-0, policy_version 877882 (0.00094) [2022-07-10 20:24:16,683][26022] Updated weights on worker 0-0, policy_version 877892 (0.00097) [2022-07-10 20:24:18,742][26022] Updated weights on worker 0-0, policy_version 877902 (0.00088) [2022-07-10 20:24:19,133][25689] Fps is (10 sec: 5582.9, 60 sec: 5499.8, 300 sec: 5537.3). Total num frames: 898973696. Throughput: 0: 4850.8. Samples: 898968800. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:24:19,133][25689] Avg episode reward: [(0, '-4.324')] [2022-07-10 20:24:20,369][26022] Updated weights on worker 0-0, policy_version 877912 (0.00083) [2022-07-10 20:24:22,183][26022] Updated weights on worker 0-0, policy_version 877922 (0.00081) [2022-07-10 20:24:23,982][26022] Updated weights on worker 0-0, policy_version 877932 (0.00092) [2022-07-10 20:24:24,143][25689] Fps is (10 sec: 5485.8, 60 sec: 5517.7, 300 sec: 5540.8). Total num frames: 899002368. Throughput: 0: 5714.9. Samples: 899002412. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:24:24,144][25689] Avg episode reward: [(0, '-4.994')] [2022-07-10 20:24:25,878][26022] Updated weights on worker 0-0, policy_version 877942 (0.00090) [2022-07-10 20:24:27,757][26022] Updated weights on worker 0-0, policy_version 877952 (0.00088) [2022-07-10 20:24:29,153][25689] Fps is (10 sec: 5620.3, 60 sec: 5502.0, 300 sec: 5540.7). Total num frames: 899030016. Throughput: 0: 5801.2. Samples: 899035572. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:24:29,154][25689] Avg episode reward: [(0, '-4.749')] [2022-07-10 20:24:29,645][26022] Updated weights on worker 0-0, policy_version 877962 (0.00092) [2022-07-10 20:24:31,427][26022] Updated weights on worker 0-0, policy_version 877972 (0.00084) [2022-07-10 20:24:33,380][26022] Updated weights on worker 0-0, policy_version 877982 (0.00092) [2022-07-10 20:24:34,167][25689] Fps is (10 sec: 5516.6, 60 sec: 5487.2, 300 sec: 5538.7). Total num frames: 899057664. Throughput: 0: 4966.0. Samples: 899052102. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:24:34,168][25689] Avg episode reward: [(0, '-2.127')] [2022-07-10 20:24:35,248][26022] Updated weights on worker 0-0, policy_version 877992 (0.00095) [2022-07-10 20:24:36,935][26022] Updated weights on worker 0-0, policy_version 878002 (0.00093) [2022-07-10 20:24:38,904][26022] Updated weights on worker 0-0, policy_version 878012 (0.00089) [2022-07-10 20:24:39,217][25689] Fps is (10 sec: 5596.6, 60 sec: 5522.3, 300 sec: 5537.8). Total num frames: 899086336. Throughput: 0: 5811.7. Samples: 899085522. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:24:39,217][25689] Avg episode reward: [(0, '-2.310')] [2022-07-10 20:24:40,784][26022] Updated weights on worker 0-0, policy_version 878022 (0.00261) [2022-07-10 20:24:42,373][26022] Updated weights on worker 0-0, policy_version 878032 (0.00091) [2022-07-10 20:24:44,230][25689] Fps is (10 sec: 5596.8, 60 sec: 5522.6, 300 sec: 5539.3). Total num frames: 899113984. Throughput: 0: 5815.1. Samples: 899119216. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:24:44,230][25689] Avg episode reward: [(0, '-0.946')] [2022-07-10 20:24:44,382][26022] Updated weights on worker 0-0, policy_version 878042 (0.00089) [2022-07-10 20:24:46,080][26022] Updated weights on worker 0-0, policy_version 878052 (0.00093) [2022-07-10 20:24:48,068][26022] Updated weights on worker 0-0, policy_version 878062 (0.00077) [2022-07-10 20:24:49,281][25689] Fps is (10 sec: 5596.2, 60 sec: 5520.4, 300 sec: 5542.9). Total num frames: 899142656. Throughput: 0: 4995.7. Samples: 899136122. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:24:49,281][25689] Avg episode reward: [(0, '0.027')] [2022-07-10 20:24:49,907][26022] Updated weights on worker 0-0, policy_version 878072 (0.00087) [2022-07-10 20:24:51,528][26022] Updated weights on worker 0-0, policy_version 878082 (0.00096) [2022-07-10 20:24:53,436][26022] Updated weights on worker 0-0, policy_version 878092 (0.00086) [2022-07-10 20:24:54,292][25689] Fps is (10 sec: 5495.2, 60 sec: 5519.9, 300 sec: 5537.4). Total num frames: 899169280. Throughput: 0: 5849.4. Samples: 899169824. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:24:54,293][25689] Avg episode reward: [(0, '1.255')] [2022-07-10 20:24:55,126][26022] Updated weights on worker 0-0, policy_version 878102 (0.00081) [2022-07-10 20:24:57,086][26022] Updated weights on worker 0-0, policy_version 878112 (0.00085) [2022-07-10 20:24:58,853][26022] Updated weights on worker 0-0, policy_version 878122 (0.00081) [2022-07-10 20:24:59,410][25689] Fps is (10 sec: 5661.1, 60 sec: 5567.5, 300 sec: 5542.9). Total num frames: 899200000. Throughput: 0: 5860.0. Samples: 899203856. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:24:59,411][25689] Avg episode reward: [(0, '1.524')] [2022-07-10 20:25:00,631][26022] Updated weights on worker 0-0, policy_version 878132 (0.00614) [2022-07-10 20:25:02,881][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:25:02,895][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000878142_899217408.pth [2022-07-10 20:25:02,896][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000876191_897219584.pth [2022-07-10 20:25:02,907][26022] Updated weights on worker 0-0, policy_version 878142 (0.00057) [2022-07-10 20:25:04,457][25689] Fps is (10 sec: 5641.5, 60 sec: 5569.1, 300 sec: 5549.3). Total num frames: 899226624. Throughput: 0: 5032.0. Samples: 899221000. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:25:04,458][25689] Avg episode reward: [(0, '1.867')] [2022-07-10 20:25:04,517][26022] Updated weights on worker 0-0, policy_version 878152 (0.00086) [2022-07-10 20:25:06,466][26022] Updated weights on worker 0-0, policy_version 878162 (0.00079) [2022-07-10 20:25:08,354][26022] Updated weights on worker 0-0, policy_version 878172 (0.00097) [2022-07-10 20:25:09,545][25689] Fps is (10 sec: 5354.7, 60 sec: 5568.5, 300 sec: 5541.6). Total num frames: 899254272. Throughput: 0: 5766.0. Samples: 899252968. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:25:09,546][25689] Avg episode reward: [(0, '0.820')] [2022-07-10 20:25:10,105][26022] Updated weights on worker 0-0, policy_version 878182 (0.00095) [2022-07-10 20:25:11,933][26022] Updated weights on worker 0-0, policy_version 878192 (0.00089) [2022-07-10 20:25:13,564][26022] Updated weights on worker 0-0, policy_version 878202 (0.00094) [2022-07-10 20:25:14,563][25689] Fps is (10 sec: 5573.1, 60 sec: 5551.7, 300 sec: 5547.3). Total num frames: 899282944. Throughput: 0: 5773.0. Samples: 899286846. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:25:14,563][25689] Avg episode reward: [(0, '0.852')] [2022-07-10 20:25:15,747][26022] Updated weights on worker 0-0, policy_version 878212 (0.00088) [2022-07-10 20:25:17,429][26022] Updated weights on worker 0-0, policy_version 878222 (0.00085) [2022-07-10 20:25:19,385][26022] Updated weights on worker 0-0, policy_version 878232 (0.00093) [2022-07-10 20:25:19,629][25689] Fps is (10 sec: 5686.9, 60 sec: 5585.8, 300 sec: 5547.3). Total num frames: 899311616. Throughput: 0: 5739.9. Samples: 899319910. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2022-07-10 20:25:19,629][25689] Avg episode reward: [(0, '0.981')] [2022-07-10 20:25:21,027][26022] Updated weights on worker 0-0, policy_version 878242 (0.00082) [2022-07-10 20:25:23,052][26022] Updated weights on worker 0-0, policy_version 878252 (0.00081) [2022-07-10 20:25:24,658][25689] Fps is (10 sec: 5477.3, 60 sec: 5550.2, 300 sec: 5537.4). Total num frames: 899338240. Throughput: 0: 5735.4. Samples: 899336862. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:25:24,660][25689] Avg episode reward: [(0, '0.587')] [2022-07-10 20:25:24,879][26022] Updated weights on worker 0-0, policy_version 878262 (0.00079) [2022-07-10 20:25:26,583][26022] Updated weights on worker 0-0, policy_version 878272 (0.00086) [2022-07-10 20:25:28,532][26022] Updated weights on worker 0-0, policy_version 878282 (0.00089) [2022-07-10 20:25:29,707][25689] Fps is (10 sec: 5486.9, 60 sec: 5563.6, 300 sec: 5543.7). Total num frames: 899366912. Throughput: 0: 5815.8. Samples: 899370222. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:25:29,709][25689] Avg episode reward: [(0, '-0.093')] [2022-07-10 20:25:30,506][26022] Updated weights on worker 0-0, policy_version 878292 (0.00087) [2022-07-10 20:25:32,200][26022] Updated weights on worker 0-0, policy_version 878302 (0.00084) [2022-07-10 20:25:34,155][26022] Updated weights on worker 0-0, policy_version 878312 (0.00086) [2022-07-10 20:25:34,711][25689] Fps is (10 sec: 5602.4, 60 sec: 5564.5, 300 sec: 5541.8). Total num frames: 899394560. Throughput: 0: 5798.0. Samples: 899403664. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:25:34,711][25689] Avg episode reward: [(0, '-0.464')] [2022-07-10 20:25:35,815][26022] Updated weights on worker 0-0, policy_version 878322 (0.00087) [2022-07-10 20:25:37,835][26022] Updated weights on worker 0-0, policy_version 878332 (0.00091) [2022-07-10 20:25:39,470][26022] Updated weights on worker 0-0, policy_version 878342 (0.00089) [2022-07-10 20:25:39,787][25689] Fps is (10 sec: 5688.9, 60 sec: 5579.0, 300 sec: 5544.0). Total num frames: 899424256. Throughput: 0: 4978.5. Samples: 899420266. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:25:39,787][25689] Avg episode reward: [(0, '-0.288')] [2022-07-10 20:25:41,282][26022] Updated weights on worker 0-0, policy_version 878352 (0.00086) [2022-07-10 20:25:43,217][26022] Updated weights on worker 0-0, policy_version 878362 (0.00087) [2022-07-10 20:25:44,831][25689] Fps is (10 sec: 5565.0, 60 sec: 5559.2, 300 sec: 5543.5). Total num frames: 899450880. Throughput: 0: 5806.7. Samples: 899454002. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:25:44,832][25689] Avg episode reward: [(0, '-0.912')] [2022-07-10 20:25:45,170][26022] Updated weights on worker 0-0, policy_version 878372 (0.00086) [2022-07-10 20:25:46,698][26022] Updated weights on worker 0-0, policy_version 878382 (0.00084) [2022-07-10 20:25:48,821][26022] Updated weights on worker 0-0, policy_version 878392 (0.00091) [2022-07-10 20:25:49,838][25689] Fps is (10 sec: 5603.2, 60 sec: 5580.2, 300 sec: 5543.9). Total num frames: 899480576. Throughput: 0: 5814.7. Samples: 899487280. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:25:49,838][25689] Avg episode reward: [(0, '-1.187')] [2022-07-10 20:25:50,501][26022] Updated weights on worker 0-0, policy_version 878402 (0.00090) [2022-07-10 20:25:52,443][26022] Updated weights on worker 0-0, policy_version 878412 (0.00092) [2022-07-10 20:25:54,221][26022] Updated weights on worker 0-0, policy_version 878422 (0.00096) [2022-07-10 20:25:54,885][25689] Fps is (10 sec: 5499.8, 60 sec: 5560.0, 300 sec: 5537.3). Total num frames: 899506176. Throughput: 0: 4980.5. Samples: 899504144. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:25:54,886][25689] Avg episode reward: [(0, '-0.264')] [2022-07-10 20:25:56,216][26022] Updated weights on worker 0-0, policy_version 878433 (0.00096) [2022-07-10 20:25:58,296][26022] Updated weights on worker 0-0, policy_version 878443 (0.00095) [2022-07-10 20:25:59,827][26022] Updated weights on worker 0-0, policy_version 878453 (0.00085) [2022-07-10 20:25:59,939][25689] Fps is (10 sec: 5474.3, 60 sec: 5549.0, 300 sec: 5550.4). Total num frames: 899535872. Throughput: 0: 5807.8. Samples: 899537306. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:25:59,939][25689] Avg episode reward: [(0, '-0.516')] [2022-07-10 20:26:01,826][26022] Updated weights on worker 0-0, policy_version 878463 (0.00078) [2022-07-10 20:26:04,234][26022] Updated weights on worker 0-0, policy_version 878473 (0.00093) [2022-07-10 20:26:04,972][25689] Fps is (10 sec: 5380.6, 60 sec: 5516.4, 300 sec: 5539.9). Total num frames: 899560448. Throughput: 0: 5706.9. Samples: 899568942. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:26:04,974][25689] Avg episode reward: [(0, '0.479')] [2022-07-10 20:26:05,774][26022] Updated weights on worker 0-0, policy_version 878483 (0.00055) [2022-07-10 20:26:07,912][26022] Updated weights on worker 0-0, policy_version 878493 (0.00086) [2022-07-10 20:26:09,489][26022] Updated weights on worker 0-0, policy_version 878503 (0.00095) [2022-07-10 20:26:10,011][25689] Fps is (10 sec: 5286.9, 60 sec: 5537.9, 300 sec: 5542.9). Total num frames: 899589120. Throughput: 0: 4877.2. Samples: 899585666. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:26:10,011][25689] Avg episode reward: [(0, '0.696')] [2022-07-10 20:26:11,411][26022] Updated weights on worker 0-0, policy_version 878513 (0.00094) [2022-07-10 20:26:13,023][26022] Updated weights on worker 0-0, policy_version 878523 (0.00087) [2022-07-10 20:26:14,748][26022] Updated weights on worker 0-0, policy_version 878533 (0.00091) [2022-07-10 20:26:15,019][25689] Fps is (10 sec: 5707.7, 60 sec: 5538.7, 300 sec: 5546.9). Total num frames: 899617792. Throughput: 0: 5731.7. Samples: 899619542. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:26:15,019][25689] Avg episode reward: [(0, '0.377')] [2022-07-10 20:26:16,917][26022] Updated weights on worker 0-0, policy_version 878543 (0.00052) [2022-07-10 20:26:18,794][26022] Updated weights on worker 0-0, policy_version 878553 (0.00093) [2022-07-10 20:26:20,131][25689] Fps is (10 sec: 5666.2, 60 sec: 5534.5, 300 sec: 5545.4). Total num frames: 899646464. Throughput: 0: 5715.7. Samples: 899652718. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:26:20,131][25689] Avg episode reward: [(0, '0.322')] [2022-07-10 20:26:20,478][26022] Updated weights on worker 0-0, policy_version 878563 (0.00083) [2022-07-10 20:26:22,510][26022] Updated weights on worker 0-0, policy_version 878573 (0.00091) [2022-07-10 20:26:23,993][26022] Updated weights on worker 0-0, policy_version 878583 (0.00105) [2022-07-10 20:26:25,149][25689] Fps is (10 sec: 5458.6, 60 sec: 5535.5, 300 sec: 5542.0). Total num frames: 899673088. Throughput: 0: 4994.5. Samples: 899669714. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:26:25,151][25689] Avg episode reward: [(0, '0.305')] [2022-07-10 20:26:26,118][26022] Updated weights on worker 0-0, policy_version 878593 (0.00089) [2022-07-10 20:26:27,695][26022] Updated weights on worker 0-0, policy_version 878603 (0.00088) [2022-07-10 20:26:29,621][26022] Updated weights on worker 0-0, policy_version 878613 (0.00227) [2022-07-10 20:26:30,154][25689] Fps is (10 sec: 5517.1, 60 sec: 5539.5, 300 sec: 5545.5). Total num frames: 899701760. Throughput: 0: 5842.9. Samples: 899703360. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:26:30,154][25689] Avg episode reward: [(0, '0.308')] [2022-07-10 20:26:31,524][26022] Updated weights on worker 0-0, policy_version 878623 (0.00084) [2022-07-10 20:26:33,348][26022] Updated weights on worker 0-0, policy_version 878633 (0.00078) [2022-07-10 20:26:35,108][26022] Updated weights on worker 0-0, policy_version 878643 (0.00089) [2022-07-10 20:26:35,188][25689] Fps is (10 sec: 5712.0, 60 sec: 5553.7, 300 sec: 5543.5). Total num frames: 899730432. Throughput: 0: 5813.9. Samples: 899736804. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:26:35,188][25689] Avg episode reward: [(0, '0.076')] [2022-07-10 20:26:37,095][26022] Updated weights on worker 0-0, policy_version 878653 (0.00095) [2022-07-10 20:26:38,900][26022] Updated weights on worker 0-0, policy_version 878663 (0.00091) [2022-07-10 20:26:40,303][25689] Fps is (10 sec: 5448.3, 60 sec: 5499.4, 300 sec: 5541.9). Total num frames: 899757056. Throughput: 0: 4990.1. Samples: 899753378. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:26:40,306][25689] Avg episode reward: [(0, '-0.335')] [2022-07-10 20:26:40,631][26022] Updated weights on worker 0-0, policy_version 878673 (0.00089) [2022-07-10 20:26:42,631][26022] Updated weights on worker 0-0, policy_version 878683 (0.00096) [2022-07-10 20:26:44,339][26022] Updated weights on worker 0-0, policy_version 878693 (0.00088) [2022-07-10 20:26:45,323][25689] Fps is (10 sec: 5456.0, 60 sec: 5535.5, 300 sec: 5545.4). Total num frames: 899785728. Throughput: 0: 5813.4. Samples: 899786994. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:26:45,323][25689] Avg episode reward: [(0, '-0.437')] [2022-07-10 20:26:46,419][26022] Updated weights on worker 0-0, policy_version 878703 (0.00097) [2022-07-10 20:26:47,988][26022] Updated weights on worker 0-0, policy_version 878713 (0.00088) [2022-07-10 20:26:49,989][26022] Updated weights on worker 0-0, policy_version 878723 (0.00102) [2022-07-10 20:26:50,367][25689] Fps is (10 sec: 5595.9, 60 sec: 5498.2, 300 sec: 5542.1). Total num frames: 899813376. Throughput: 0: 5774.7. Samples: 899820088. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:26:50,368][25689] Avg episode reward: [(0, '-0.547')] [2022-07-10 20:26:51,723][26022] Updated weights on worker 0-0, policy_version 878733 (0.00077) [2022-07-10 20:26:53,623][26022] Updated weights on worker 0-0, policy_version 878743 (0.00087) [2022-07-10 20:26:55,426][25689] Fps is (10 sec: 5574.5, 60 sec: 5547.9, 300 sec: 5543.5). Total num frames: 899842048. Throughput: 0: 4946.4. Samples: 899836910. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:26:55,427][25689] Avg episode reward: [(0, '-0.866')] [2022-07-10 20:26:55,539][26022] Updated weights on worker 0-0, policy_version 878753 (0.00088) [2022-07-10 20:26:57,136][26022] Updated weights on worker 0-0, policy_version 878763 (0.00090) [2022-07-10 20:26:59,305][26022] Updated weights on worker 0-0, policy_version 878773 (0.00087) [2022-07-10 20:27:00,511][25689] Fps is (10 sec: 5653.4, 60 sec: 5528.1, 300 sec: 5549.7). Total num frames: 899870720. Throughput: 0: 5786.5. Samples: 899870310. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:27:00,511][25689] Avg episode reward: [(0, '-0.887')] [2022-07-10 20:27:00,786][26022] Updated weights on worker 0-0, policy_version 878783 (0.00095) [2022-07-10 20:27:03,067][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:27:03,090][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000878792_899883008.pth [2022-07-10 20:27:03,090][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000876842_897886208.pth [2022-07-10 20:27:03,119][26022] Updated weights on worker 0-0, policy_version 878793 (0.00087) [2022-07-10 20:27:05,235][26022] Updated weights on worker 0-0, policy_version 878803 (0.00089) [2022-07-10 20:27:05,527][25689] Fps is (10 sec: 5373.0, 60 sec: 5546.6, 300 sec: 5540.0). Total num frames: 899896320. Throughput: 0: 5677.1. Samples: 899901694. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:27:05,527][25689] Avg episode reward: [(0, '-0.939')] [2022-07-10 20:27:06,798][26022] Updated weights on worker 0-0, policy_version 878813 (0.00089) [2022-07-10 20:27:08,747][26022] Updated weights on worker 0-0, policy_version 878823 (0.00094) [2022-07-10 20:27:10,546][25689] Fps is (10 sec: 5408.1, 60 sec: 5548.4, 300 sec: 5540.5). Total num frames: 899924992. Throughput: 0: 4870.7. Samples: 899918372. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:27:10,546][25689] Avg episode reward: [(0, '-1.392')] [2022-07-10 20:27:10,560][26022] Updated weights on worker 0-0, policy_version 878833 (0.00092) [2022-07-10 20:27:12,307][26022] Updated weights on worker 0-0, policy_version 878843 (0.00087) [2022-07-10 20:27:14,330][26022] Updated weights on worker 0-0, policy_version 878853 (0.00094) [2022-07-10 20:27:15,575][25689] Fps is (10 sec: 5605.2, 60 sec: 5529.6, 300 sec: 5540.7). Total num frames: 899952640. Throughput: 0: 5718.9. Samples: 899952138. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:27:15,575][25689] Avg episode reward: [(0, '-0.555')] [2022-07-10 20:27:16,010][26022] Updated weights on worker 0-0, policy_version 878863 (0.00092) [2022-07-10 20:27:17,983][26022] Updated weights on worker 0-0, policy_version 878873 (0.00090) [2022-07-10 20:27:19,754][26022] Updated weights on worker 0-0, policy_version 878883 (0.00089) [2022-07-10 20:27:20,694][25689] Fps is (10 sec: 5449.0, 60 sec: 5512.0, 300 sec: 5538.6). Total num frames: 899980288. Throughput: 0: 5707.6. Samples: 899985506. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:27:20,694][25689] Avg episode reward: [(0, '0.215')] [2022-07-10 20:27:21,639][26022] Updated weights on worker 0-0, policy_version 878893 (0.00084) [2022-07-10 20:27:23,635][26022] Updated weights on worker 0-0, policy_version 878903 (0.00063) [2022-07-10 20:27:25,273][26022] Updated weights on worker 0-0, policy_version 878913 (0.00094) [2022-07-10 20:27:25,724][25689] Fps is (10 sec: 5549.2, 60 sec: 5544.7, 300 sec: 5538.1). Total num frames: 900008960. Throughput: 0: 4962.0. Samples: 900001912. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:27:25,724][25689] Avg episode reward: [(0, '0.085')] [2022-07-10 20:27:27,119][26022] Updated weights on worker 0-0, policy_version 878923 (0.00096) [2022-07-10 20:27:28,946][26022] Updated weights on worker 0-0, policy_version 878933 (0.00082) [2022-07-10 20:27:30,781][25689] Fps is (10 sec: 5684.8, 60 sec: 5539.9, 300 sec: 5541.2). Total num frames: 900037632. Throughput: 0: 5794.7. Samples: 900035628. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:27:30,782][25689] Avg episode reward: [(0, '0.419')] [2022-07-10 20:27:30,784][26022] Updated weights on worker 0-0, policy_version 878943 (0.00090) [2022-07-10 20:27:32,741][26022] Updated weights on worker 0-0, policy_version 878953 (0.00441) [2022-07-10 20:27:34,524][26022] Updated weights on worker 0-0, policy_version 878963 (0.00090) [2022-07-10 20:27:35,791][25689] Fps is (10 sec: 5594.3, 60 sec: 5525.2, 300 sec: 5539.7). Total num frames: 900065280. Throughput: 0: 5792.0. Samples: 900069232. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:27:35,792][25689] Avg episode reward: [(0, '0.703')] [2022-07-10 20:27:36,287][26022] Updated weights on worker 0-0, policy_version 878973 (0.00089) [2022-07-10 20:27:38,229][26022] Updated weights on worker 0-0, policy_version 878983 (0.00092) [2022-07-10 20:27:40,067][26022] Updated weights on worker 0-0, policy_version 878993 (0.00097) [2022-07-10 20:27:40,839][25689] Fps is (10 sec: 5497.5, 60 sec: 5548.2, 300 sec: 5532.4). Total num frames: 900092928. Throughput: 0: 5811.9. Samples: 900102590. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:27:40,840][25689] Avg episode reward: [(0, '1.345')] [2022-07-10 20:27:41,842][26022] Updated weights on worker 0-0, policy_version 879003 (0.00092) [2022-07-10 20:27:43,699][26022] Updated weights on worker 0-0, policy_version 879013 (0.00083) [2022-07-10 20:27:45,450][26022] Updated weights on worker 0-0, policy_version 879023 (0.00087) [2022-07-10 20:27:45,918][25689] Fps is (10 sec: 5561.6, 60 sec: 5542.9, 300 sec: 5541.7). Total num frames: 900121600. Throughput: 0: 5819.9. Samples: 900119438. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:27:45,918][25689] Avg episode reward: [(0, '0.490')] [2022-07-10 20:27:47,374][26022] Updated weights on worker 0-0, policy_version 879033 (0.00086) [2022-07-10 20:27:49,068][26022] Updated weights on worker 0-0, policy_version 879043 (0.00082) [2022-07-10 20:27:50,960][25689] Fps is (10 sec: 5565.1, 60 sec: 5543.1, 300 sec: 5538.1). Total num frames: 900149248. Throughput: 0: 5818.6. Samples: 900153040. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:27:50,960][25689] Avg episode reward: [(0, '0.251')] [2022-07-10 20:27:51,095][26022] Updated weights on worker 0-0, policy_version 879053 (0.00094) [2022-07-10 20:27:52,943][26022] Updated weights on worker 0-0, policy_version 879063 (0.00091) [2022-07-10 20:27:54,672][26022] Updated weights on worker 0-0, policy_version 879073 (0.00079) [2022-07-10 20:27:56,018][25689] Fps is (10 sec: 5576.0, 60 sec: 5543.2, 300 sec: 5537.9). Total num frames: 900177920. Throughput: 0: 5796.0. Samples: 900186468. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:27:56,020][25689] Avg episode reward: [(0, '-0.104')] [2022-07-10 20:27:56,628][26022] Updated weights on worker 0-0, policy_version 879083 (0.00097) [2022-07-10 20:27:58,265][26022] Updated weights on worker 0-0, policy_version 879093 (0.00097) [2022-07-10 20:28:00,179][26022] Updated weights on worker 0-0, policy_version 879103 (0.00092) [2022-07-10 20:28:01,087][25689] Fps is (10 sec: 5662.5, 60 sec: 5544.6, 300 sec: 5550.5). Total num frames: 900206592. Throughput: 0: 4969.0. Samples: 900203198. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:28:01,089][25689] Avg episode reward: [(0, '-2.027')] [2022-07-10 20:28:02,512][26022] Updated weights on worker 0-0, policy_version 879113 (0.00086) [2022-07-10 20:28:04,146][26022] Updated weights on worker 0-0, policy_version 879123 (0.00093) [2022-07-10 20:28:06,093][25689] Fps is (10 sec: 5285.3, 60 sec: 5528.6, 300 sec: 5537.4). Total num frames: 900231168. Throughput: 0: 5707.6. Samples: 900234590. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:28:06,093][25689] Avg episode reward: [(0, '-2.623')] [2022-07-10 20:28:06,219][26022] Updated weights on worker 0-0, policy_version 879133 (0.00091) [2022-07-10 20:28:07,680][26022] Updated weights on worker 0-0, policy_version 879143 (0.00077) [2022-07-10 20:28:09,810][26022] Updated weights on worker 0-0, policy_version 879153 (0.00097) [2022-07-10 20:28:11,168][25689] Fps is (10 sec: 5281.8, 60 sec: 5523.5, 300 sec: 5532.7). Total num frames: 900259840. Throughput: 0: 5694.7. Samples: 900268122. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:28:11,169][25689] Avg episode reward: [(0, '-2.211')] [2022-07-10 20:28:11,528][26022] Updated weights on worker 0-0, policy_version 879163 (0.00082) [2022-07-10 20:28:13,328][26022] Updated weights on worker 0-0, policy_version 879173 (0.00088) [2022-07-10 20:28:15,303][26022] Updated weights on worker 0-0, policy_version 879183 (0.00086) [2022-07-10 20:28:16,231][25689] Fps is (10 sec: 5555.0, 60 sec: 5520.4, 300 sec: 5532.7). Total num frames: 900287488. Throughput: 0: 4867.5. Samples: 900284856. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:28:16,232][25689] Avg episode reward: [(0, '-2.482')] [2022-07-10 20:28:17,136][26022] Updated weights on worker 0-0, policy_version 879193 (0.00093) [2022-07-10 20:28:18,996][26022] Updated weights on worker 0-0, policy_version 879203 (0.00091) [2022-07-10 20:28:20,704][26022] Updated weights on worker 0-0, policy_version 879213 (0.00094) [2022-07-10 20:28:21,369][25689] Fps is (10 sec: 5621.7, 60 sec: 5552.5, 300 sec: 5537.4). Total num frames: 900317184. Throughput: 0: 5682.9. Samples: 900318458. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:28:21,369][25689] Avg episode reward: [(0, '-2.550')] [2022-07-10 20:28:22,637][26022] Updated weights on worker 0-0, policy_version 879223 (0.00083) [2022-07-10 20:28:24,421][26022] Updated weights on worker 0-0, policy_version 879233 (0.00087) [2022-07-10 20:28:26,329][26022] Updated weights on worker 0-0, policy_version 879243 (0.00091) [2022-07-10 20:28:26,403][25689] Fps is (10 sec: 5637.8, 60 sec: 5535.2, 300 sec: 5533.8). Total num frames: 900344832. Throughput: 0: 5778.6. Samples: 900351952. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:28:26,403][25689] Avg episode reward: [(0, '-1.656')] [2022-07-10 20:28:28,221][26022] Updated weights on worker 0-0, policy_version 879253 (0.00087) [2022-07-10 20:28:29,816][26022] Updated weights on worker 0-0, policy_version 879263 (0.00089) [2022-07-10 20:28:31,447][25689] Fps is (10 sec: 5486.4, 60 sec: 5519.5, 300 sec: 5530.2). Total num frames: 900372480. Throughput: 0: 4963.2. Samples: 900368772. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:28:31,448][25689] Avg episode reward: [(0, '-2.454')] [2022-07-10 20:28:31,835][26022] Updated weights on worker 0-0, policy_version 879273 (0.00096) [2022-07-10 20:28:33,355][26022] Updated weights on worker 0-0, policy_version 879283 (0.00083) [2022-07-10 20:28:35,598][26022] Updated weights on worker 0-0, policy_version 879293 (0.00092) [2022-07-10 20:28:36,470][25689] Fps is (10 sec: 5594.1, 60 sec: 5535.2, 300 sec: 5537.9). Total num frames: 900401152. Throughput: 0: 5804.6. Samples: 900402336. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:28:36,471][25689] Avg episode reward: [(0, '-2.091')] [2022-07-10 20:28:37,233][26022] Updated weights on worker 0-0, policy_version 879303 (0.00085) [2022-07-10 20:28:39,233][26022] Updated weights on worker 0-0, policy_version 879313 (0.00090) [2022-07-10 20:28:40,971][26022] Updated weights on worker 0-0, policy_version 879323 (0.00087) [2022-07-10 20:28:41,548][25689] Fps is (10 sec: 5677.0, 60 sec: 5549.4, 300 sec: 5540.1). Total num frames: 900429824. Throughput: 0: 5780.5. Samples: 900435108. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:28:41,549][25689] Avg episode reward: [(0, '-1.638')] [2022-07-10 20:28:42,924][26022] Updated weights on worker 0-0, policy_version 879333 (0.00090) [2022-07-10 20:28:44,622][26022] Updated weights on worker 0-0, policy_version 879343 (0.00086) [2022-07-10 20:28:46,554][25689] Fps is (10 sec: 5585.1, 60 sec: 5539.1, 300 sec: 5537.1). Total num frames: 900457472. Throughput: 0: 4962.7. Samples: 900451958. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:28:46,555][25689] Avg episode reward: [(0, '-1.974')] [2022-07-10 20:28:46,556][26022] Updated weights on worker 0-0, policy_version 879353 (0.00088) [2022-07-10 20:28:48,336][26022] Updated weights on worker 0-0, policy_version 879363 (0.00096) [2022-07-10 20:28:50,222][26022] Updated weights on worker 0-0, policy_version 879373 (0.00094) [2022-07-10 20:28:51,575][25689] Fps is (10 sec: 5515.0, 60 sec: 5541.1, 300 sec: 5540.3). Total num frames: 900485120. Throughput: 0: 5816.7. Samples: 900485848. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:28:51,575][25689] Avg episode reward: [(0, '-2.065')] [2022-07-10 20:28:52,141][26022] Updated weights on worker 0-0, policy_version 879383 (0.00090) [2022-07-10 20:28:54,054][26022] Updated weights on worker 0-0, policy_version 879393 (0.00084) [2022-07-10 20:28:55,677][26022] Updated weights on worker 0-0, policy_version 879403 (0.00082) [2022-07-10 20:28:56,587][25689] Fps is (10 sec: 5613.5, 60 sec: 5545.3, 300 sec: 5545.0). Total num frames: 900513792. Throughput: 0: 5798.4. Samples: 900518982. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:28:56,588][25689] Avg episode reward: [(0, '-1.008')] [2022-07-10 20:28:57,706][26022] Updated weights on worker 0-0, policy_version 879413 (0.00091) [2022-07-10 20:28:59,375][26022] Updated weights on worker 0-0, policy_version 879423 (0.00085) [2022-07-10 20:29:01,439][26022] Updated weights on worker 0-0, policy_version 879433 (0.00101) [2022-07-10 20:29:01,646][25689] Fps is (10 sec: 5388.9, 60 sec: 5495.4, 300 sec: 5541.7). Total num frames: 900539392. Throughput: 0: 5010.4. Samples: 900535804. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:29:01,646][25689] Avg episode reward: [(0, '0.404')] [2022-07-10 20:29:03,285][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:29:03,297][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000879442_900548608.pth [2022-07-10 20:29:03,298][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000877492_898551808.pth [2022-07-10 20:29:03,347][26022] Updated weights on worker 0-0, policy_version 879443 (0.00093) [2022-07-10 20:29:05,190][26022] Updated weights on worker 0-0, policy_version 879453 (0.00092) [2022-07-10 20:29:06,654][25689] Fps is (10 sec: 5289.4, 60 sec: 5546.0, 300 sec: 5543.0). Total num frames: 900567040. Throughput: 0: 5759.9. Samples: 900567732. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 20:29:06,655][25689] Avg episode reward: [(0, '0.390')] [2022-07-10 20:29:07,084][26022] Updated weights on worker 0-0, policy_version 879463 (0.00087) [2022-07-10 20:29:08,808][26022] Updated weights on worker 0-0, policy_version 879473 (0.00087) [2022-07-10 20:29:10,627][26022] Updated weights on worker 0-0, policy_version 879483 (0.00092) [2022-07-10 20:29:11,658][25689] Fps is (10 sec: 5727.3, 60 sec: 5569.4, 300 sec: 5543.3). Total num frames: 900596736. Throughput: 0: 5764.3. Samples: 900601616. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:29:11,659][25689] Avg episode reward: [(0, '-0.591')] [2022-07-10 20:29:12,364][26022] Updated weights on worker 0-0, policy_version 879493 (0.00091) [2022-07-10 20:29:14,435][26022] Updated weights on worker 0-0, policy_version 879503 (0.00090) [2022-07-10 20:29:16,106][26022] Updated weights on worker 0-0, policy_version 879513 (0.00090) [2022-07-10 20:29:16,671][25689] Fps is (10 sec: 5622.3, 60 sec: 5557.1, 300 sec: 5544.4). Total num frames: 900623360. Throughput: 0: 4944.3. Samples: 900618284. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:29:16,672][25689] Avg episode reward: [(0, '-1.005')] [2022-07-10 20:29:17,937][26022] Updated weights on worker 0-0, policy_version 879523 (0.00088) [2022-07-10 20:29:19,753][26022] Updated weights on worker 0-0, policy_version 879533 (0.00092) [2022-07-10 20:29:21,543][26022] Updated weights on worker 0-0, policy_version 879543 (0.00082) [2022-07-10 20:29:21,723][25689] Fps is (10 sec: 5494.0, 60 sec: 5548.0, 300 sec: 5543.6). Total num frames: 900652032. Throughput: 0: 5780.5. Samples: 900651860. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:29:21,725][25689] Avg episode reward: [(0, '-1.949')] [2022-07-10 20:29:23,588][26022] Updated weights on worker 0-0, policy_version 879553 (0.00089) [2022-07-10 20:29:25,433][26022] Updated weights on worker 0-0, policy_version 879563 (0.00086) [2022-07-10 20:29:26,728][25689] Fps is (10 sec: 5498.6, 60 sec: 5533.7, 300 sec: 5540.3). Total num frames: 900678656. Throughput: 0: 5839.4. Samples: 900684950. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:29:26,730][25689] Avg episode reward: [(0, '-2.082')] [2022-07-10 20:29:27,195][26022] Updated weights on worker 0-0, policy_version 879573 (0.00094) [2022-07-10 20:29:29,129][26022] Updated weights on worker 0-0, policy_version 879583 (0.00092) [2022-07-10 20:29:30,953][26022] Updated weights on worker 0-0, policy_version 879593 (0.00097) [2022-07-10 20:29:31,733][25689] Fps is (10 sec: 5523.9, 60 sec: 5554.3, 300 sec: 5543.8). Total num frames: 900707328. Throughput: 0: 4973.1. Samples: 900701452. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:29:31,735][25689] Avg episode reward: [(0, '-3.211')] [2022-07-10 20:29:32,767][26022] Updated weights on worker 0-0, policy_version 879603 (0.00086) [2022-07-10 20:29:34,743][26022] Updated weights on worker 0-0, policy_version 879613 (0.00084) [2022-07-10 20:29:36,408][26022] Updated weights on worker 0-0, policy_version 879623 (0.00053) [2022-07-10 20:29:36,773][25689] Fps is (10 sec: 5708.4, 60 sec: 5552.8, 300 sec: 5544.0). Total num frames: 900736000. Throughput: 0: 5803.7. Samples: 900734950. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:29:36,774][25689] Avg episode reward: [(0, '-4.432')] [2022-07-10 20:29:38,371][26022] Updated weights on worker 0-0, policy_version 879633 (0.00100) [2022-07-10 20:29:39,970][26022] Updated weights on worker 0-0, policy_version 879643 (0.00080) [2022-07-10 20:29:41,813][25689] Fps is (10 sec: 5587.3, 60 sec: 5539.3, 300 sec: 5543.5). Total num frames: 900763648. Throughput: 0: 5812.4. Samples: 900768632. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:29:41,814][25689] Avg episode reward: [(0, '-3.307')] [2022-07-10 20:29:41,870][26022] Updated weights on worker 0-0, policy_version 879653 (0.00085) [2022-07-10 20:29:43,800][26022] Updated weights on worker 0-0, policy_version 879663 (0.00087) [2022-07-10 20:29:45,666][26022] Updated weights on worker 0-0, policy_version 879673 (0.00095) [2022-07-10 20:29:46,826][25689] Fps is (10 sec: 5500.5, 60 sec: 5538.6, 300 sec: 5540.8). Total num frames: 900791296. Throughput: 0: 4987.8. Samples: 900785198. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:29:46,827][25689] Avg episode reward: [(0, '-2.583')] [2022-07-10 20:29:47,534][26022] Updated weights on worker 0-0, policy_version 879683 (0.00089) [2022-07-10 20:29:49,273][26022] Updated weights on worker 0-0, policy_version 879693 (0.00093) [2022-07-10 20:29:51,247][26022] Updated weights on worker 0-0, policy_version 879703 (0.00098) [2022-07-10 20:29:51,843][25689] Fps is (10 sec: 5513.3, 60 sec: 5539.0, 300 sec: 5544.1). Total num frames: 900818944. Throughput: 0: 5829.3. Samples: 900818676. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:29:51,845][25689] Avg episode reward: [(0, '-2.590')] [2022-07-10 20:29:53,079][26022] Updated weights on worker 0-0, policy_version 879713 (0.00087) [2022-07-10 20:29:54,807][26022] Updated weights on worker 0-0, policy_version 879723 (0.00102) [2022-07-10 20:29:56,679][26022] Updated weights on worker 0-0, policy_version 879733 (0.00082) [2022-07-10 20:29:56,865][25689] Fps is (10 sec: 5508.1, 60 sec: 5521.1, 300 sec: 5535.6). Total num frames: 900846592. Throughput: 0: 5828.7. Samples: 900852060. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:29:56,866][25689] Avg episode reward: [(0, '-1.493')] [2022-07-10 20:29:58,666][26022] Updated weights on worker 0-0, policy_version 879743 (0.00085) [2022-07-10 20:30:00,371][26022] Updated weights on worker 0-0, policy_version 879753 (0.00085) [2022-07-10 20:30:01,918][25689] Fps is (10 sec: 5488.4, 60 sec: 5555.6, 300 sec: 5538.9). Total num frames: 900874240. Throughput: 0: 4973.6. Samples: 900868624. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:30:01,918][25689] Avg episode reward: [(0, '-1.225')] [2022-07-10 20:30:02,603][26022] Updated weights on worker 0-0, policy_version 879763 (0.00093) [2022-07-10 20:30:04,491][26022] Updated weights on worker 0-0, policy_version 879773 (0.00101) [2022-07-10 20:30:06,262][26022] Updated weights on worker 0-0, policy_version 879783 (0.00082) [2022-07-10 20:30:06,928][25689] Fps is (10 sec: 5393.7, 60 sec: 5538.5, 300 sec: 5537.0). Total num frames: 900900864. Throughput: 0: 5714.9. Samples: 900900074. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:30:06,928][25689] Avg episode reward: [(0, '0.344')] [2022-07-10 20:30:08,152][26022] Updated weights on worker 0-0, policy_version 879793 (0.00085) [2022-07-10 20:30:10,064][26022] Updated weights on worker 0-0, policy_version 879803 (0.00084) [2022-07-10 20:30:11,807][26022] Updated weights on worker 0-0, policy_version 879813 (0.00095) [2022-07-10 20:30:11,947][25689] Fps is (10 sec: 5411.7, 60 sec: 5503.2, 300 sec: 5533.5). Total num frames: 900928512. Throughput: 0: 5707.4. Samples: 900933416. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:30:11,952][25689] Avg episode reward: [(0, '1.177')] [2022-07-10 20:30:13,772][26022] Updated weights on worker 0-0, policy_version 879823 (0.00087) [2022-07-10 20:30:15,368][26022] Updated weights on worker 0-0, policy_version 879833 (0.00095) [2022-07-10 20:30:16,969][25689] Fps is (10 sec: 5608.9, 60 sec: 5536.3, 300 sec: 5534.3). Total num frames: 900957184. Throughput: 0: 4882.5. Samples: 900950214. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:30:16,969][25689] Avg episode reward: [(0, '1.122')] [2022-07-10 20:30:17,287][26022] Updated weights on worker 0-0, policy_version 879843 (0.00094) [2022-07-10 20:30:19,272][26022] Updated weights on worker 0-0, policy_version 879853 (0.00088) [2022-07-10 20:30:21,047][26022] Updated weights on worker 0-0, policy_version 879863 (0.00086) [2022-07-10 20:30:22,041][25689] Fps is (10 sec: 5579.4, 60 sec: 5517.5, 300 sec: 5536.9). Total num frames: 900984832. Throughput: 0: 5729.8. Samples: 900983924. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:30:22,041][25689] Avg episode reward: [(0, '1.185')] [2022-07-10 20:30:22,874][26022] Updated weights on worker 0-0, policy_version 879873 (0.00051) [2022-07-10 20:30:24,587][26022] Updated weights on worker 0-0, policy_version 879883 (0.00090) [2022-07-10 20:30:26,618][26022] Updated weights on worker 0-0, policy_version 879893 (0.00087) [2022-07-10 20:30:27,082][25689] Fps is (10 sec: 5569.1, 60 sec: 5548.1, 300 sec: 5537.1). Total num frames: 901013504. Throughput: 0: 5808.9. Samples: 901017146. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:30:27,083][25689] Avg episode reward: [(0, '1.177')] [2022-07-10 20:30:28,435][26022] Updated weights on worker 0-0, policy_version 879903 (0.00533) [2022-07-10 20:30:30,272][26022] Updated weights on worker 0-0, policy_version 879913 (0.00101) [2022-07-10 20:30:32,087][25689] Fps is (10 sec: 5504.3, 60 sec: 5514.2, 300 sec: 5533.6). Total num frames: 901040128. Throughput: 0: 4986.8. Samples: 901033850. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:30:32,088][25689] Avg episode reward: [(0, '-0.667')] [2022-07-10 20:30:32,124][26022] Updated weights on worker 0-0, policy_version 879923 (0.00097) [2022-07-10 20:30:33,879][26022] Updated weights on worker 0-0, policy_version 879933 (0.00086) [2022-07-10 20:30:35,967][26022] Updated weights on worker 0-0, policy_version 879943 (0.00085) [2022-07-10 20:30:37,128][25689] Fps is (10 sec: 5402.0, 60 sec: 5497.1, 300 sec: 5527.4). Total num frames: 901067776. Throughput: 0: 5792.7. Samples: 901066992. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:30:37,134][25689] Avg episode reward: [(0, '-1.901')] [2022-07-10 20:30:37,526][26022] Updated weights on worker 0-0, policy_version 879953 (0.00090) [2022-07-10 20:30:39,669][26022] Updated weights on worker 0-0, policy_version 879963 (0.00090) [2022-07-10 20:30:41,278][26022] Updated weights on worker 0-0, policy_version 879973 (0.00097) [2022-07-10 20:30:42,265][25689] Fps is (10 sec: 5634.1, 60 sec: 5522.2, 300 sec: 5536.0). Total num frames: 901097472. Throughput: 0: 5763.6. Samples: 901100488. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:30:42,265][25689] Avg episode reward: [(0, '-2.457')] [2022-07-10 20:30:43,305][26022] Updated weights on worker 0-0, policy_version 879983 (0.00093) [2022-07-10 20:30:44,823][26022] Updated weights on worker 0-0, policy_version 879993 (0.00082) [2022-07-10 20:30:46,952][26022] Updated weights on worker 0-0, policy_version 880003 (0.00090) [2022-07-10 20:30:47,329][25689] Fps is (10 sec: 5621.3, 60 sec: 5517.5, 300 sec: 5528.0). Total num frames: 901125120. Throughput: 0: 5778.5. Samples: 901134148. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:30:47,331][25689] Avg episode reward: [(0, '-2.260')] [2022-07-10 20:30:48,540][26022] Updated weights on worker 0-0, policy_version 880013 (0.00084) [2022-07-10 20:30:50,561][26022] Updated weights on worker 0-0, policy_version 880023 (0.00092) [2022-07-10 20:30:52,338][25689] Fps is (10 sec: 5590.9, 60 sec: 5535.1, 300 sec: 5539.1). Total num frames: 901153792. Throughput: 0: 5777.9. Samples: 901150862. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:30:52,339][25689] Avg episode reward: [(0, '-2.577')] [2022-07-10 20:30:52,339][26022] Updated weights on worker 0-0, policy_version 880033 (0.00079) [2022-07-10 20:30:54,071][26022] Updated weights on worker 0-0, policy_version 880043 (0.00100) [2022-07-10 20:30:56,016][26022] Updated weights on worker 0-0, policy_version 880053 (0.00088) [2022-07-10 20:30:57,396][25689] Fps is (10 sec: 5595.0, 60 sec: 5531.9, 300 sec: 5532.1). Total num frames: 901181440. Throughput: 0: 5785.9. Samples: 901184256. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:30:57,396][25689] Avg episode reward: [(0, '-2.381')] [2022-07-10 20:30:57,999][26022] Updated weights on worker 0-0, policy_version 880063 (0.00088) [2022-07-10 20:30:59,835][26022] Updated weights on worker 0-0, policy_version 880073 (0.00081) [2022-07-10 20:31:01,667][26022] Updated weights on worker 0-0, policy_version 880083 (0.00097) [2022-07-10 20:31:02,493][25689] Fps is (10 sec: 5243.6, 60 sec: 5494.0, 300 sec: 5534.4). Total num frames: 901207040. Throughput: 0: 5711.2. Samples: 901216016. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:31:02,494][25689] Avg episode reward: [(0, '-1.131')] [2022-07-10 20:31:03,383][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:31:03,394][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000880091_901213184.pth [2022-07-10 20:31:03,395][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000878142_899217408.pth [2022-07-10 20:31:03,795][26022] Updated weights on worker 0-0, policy_version 880093 (0.00085) [2022-07-10 20:31:05,569][26022] Updated weights on worker 0-0, policy_version 880103 (0.00092) [2022-07-10 20:31:07,460][26022] Updated weights on worker 0-0, policy_version 880113 (0.00086) [2022-07-10 20:31:07,508][25689] Fps is (10 sec: 5366.7, 60 sec: 5527.3, 300 sec: 5534.8). Total num frames: 901235712. Throughput: 0: 4870.9. Samples: 901232438. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:31:07,509][25689] Avg episode reward: [(0, '0.771')] [2022-07-10 20:31:09,309][26022] Updated weights on worker 0-0, policy_version 880123 (0.00084) [2022-07-10 20:31:11,087][26022] Updated weights on worker 0-0, policy_version 880133 (0.00086) [2022-07-10 20:31:12,537][25689] Fps is (10 sec: 5607.5, 60 sec: 5526.4, 300 sec: 5531.0). Total num frames: 901263360. Throughput: 0: 5693.3. Samples: 901265858. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:31:12,537][25689] Avg episode reward: [(0, '-0.044')] [2022-07-10 20:31:12,986][26022] Updated weights on worker 0-0, policy_version 880143 (0.00093) [2022-07-10 20:31:14,688][26022] Updated weights on worker 0-0, policy_version 880153 (0.00085) [2022-07-10 20:31:16,683][26022] Updated weights on worker 0-0, policy_version 880163 (0.00089) [2022-07-10 20:31:17,554][25689] Fps is (10 sec: 5606.3, 60 sec: 5526.9, 300 sec: 5532.8). Total num frames: 901292032. Throughput: 0: 5718.0. Samples: 901299522. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:31:17,555][25689] Avg episode reward: [(0, '-0.118')] [2022-07-10 20:31:18,402][26022] Updated weights on worker 0-0, policy_version 880173 (0.00087) [2022-07-10 20:31:20,292][26022] Updated weights on worker 0-0, policy_version 880183 (0.00088) [2022-07-10 20:31:22,004][26022] Updated weights on worker 0-0, policy_version 880193 (0.00079) [2022-07-10 20:31:22,698][25689] Fps is (10 sec: 5643.5, 60 sec: 5537.2, 300 sec: 5537.3). Total num frames: 901320704. Throughput: 0: 4963.6. Samples: 901316304. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:31:22,698][25689] Avg episode reward: [(0, '0.125')] [2022-07-10 20:31:24,092][26022] Updated weights on worker 0-0, policy_version 880203 (0.00088) [2022-07-10 20:31:25,583][26022] Updated weights on worker 0-0, policy_version 880213 (0.00084) [2022-07-10 20:31:27,717][25689] Fps is (10 sec: 5440.9, 60 sec: 5505.4, 300 sec: 5530.1). Total num frames: 901347328. Throughput: 0: 5808.2. Samples: 901349816. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:31:27,718][25689] Avg episode reward: [(0, '-0.023')] [2022-07-10 20:31:27,782][26022] Updated weights on worker 0-0, policy_version 880223 (0.00087) [2022-07-10 20:31:29,372][26022] Updated weights on worker 0-0, policy_version 880233 (0.00092) [2022-07-10 20:31:31,439][26022] Updated weights on worker 0-0, policy_version 880243 (0.00082) [2022-07-10 20:31:32,731][25689] Fps is (10 sec: 5613.7, 60 sec: 5555.3, 300 sec: 5534.0). Total num frames: 901377024. Throughput: 0: 5805.6. Samples: 901383094. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:31:32,732][25689] Avg episode reward: [(0, '0.094')] [2022-07-10 20:31:33,134][26022] Updated weights on worker 0-0, policy_version 880253 (0.00092) [2022-07-10 20:31:34,920][26022] Updated weights on worker 0-0, policy_version 880263 (0.00085) [2022-07-10 20:31:36,810][26022] Updated weights on worker 0-0, policy_version 880273 (0.00087) [2022-07-10 20:31:37,737][25689] Fps is (10 sec: 5825.4, 60 sec: 5575.4, 300 sec: 5542.9). Total num frames: 901405696. Throughput: 0: 4973.5. Samples: 901399900. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:31:37,737][25689] Avg episode reward: [(0, '0.619')] [2022-07-10 20:31:38,795][26022] Updated weights on worker 0-0, policy_version 880283 (0.00089) [2022-07-10 20:31:40,426][26022] Updated weights on worker 0-0, policy_version 880293 (0.00090) [2022-07-10 20:31:42,485][26022] Updated weights on worker 0-0, policy_version 880303 (0.00089) [2022-07-10 20:31:42,855][25689] Fps is (10 sec: 5461.9, 60 sec: 5526.5, 300 sec: 5534.2). Total num frames: 901432320. Throughput: 0: 5808.8. Samples: 901433390. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:31:42,855][25689] Avg episode reward: [(0, '0.817')] [2022-07-10 20:31:44,050][26022] Updated weights on worker 0-0, policy_version 880313 (0.00083) [2022-07-10 20:31:46,066][26022] Updated weights on worker 0-0, policy_version 880323 (0.00089) [2022-07-10 20:31:47,839][26022] Updated weights on worker 0-0, policy_version 880333 (0.00083) [2022-07-10 20:31:47,928][25689] Fps is (10 sec: 5425.8, 60 sec: 5542.5, 300 sec: 5537.1). Total num frames: 901460992. Throughput: 0: 5792.0. Samples: 901466878. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:31:47,929][25689] Avg episode reward: [(0, '0.888')] [2022-07-10 20:31:49,698][26022] Updated weights on worker 0-0, policy_version 880343 (0.00093) [2022-07-10 20:31:51,538][26022] Updated weights on worker 0-0, policy_version 880353 (0.00094) [2022-07-10 20:31:52,987][25689] Fps is (10 sec: 5558.3, 60 sec: 5521.1, 300 sec: 5533.6). Total num frames: 901488640. Throughput: 0: 4970.4. Samples: 901483778. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:31:52,988][25689] Avg episode reward: [(0, '1.075')] [2022-07-10 20:31:53,419][26022] Updated weights on worker 0-0, policy_version 880363 (0.00089) [2022-07-10 20:31:55,064][26022] Updated weights on worker 0-0, policy_version 880373 (0.00084) [2022-07-10 20:31:57,075][26022] Updated weights on worker 0-0, policy_version 880383 (0.00094) [2022-07-10 20:31:57,995][25689] Fps is (10 sec: 5594.7, 60 sec: 5542.5, 300 sec: 5535.1). Total num frames: 901517312. Throughput: 0: 5792.0. Samples: 901517236. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:31:57,997][25689] Avg episode reward: [(0, '1.209')] [2022-07-10 20:31:58,785][26022] Updated weights on worker 0-0, policy_version 880393 (0.01661) [2022-07-10 20:32:00,833][26022] Updated weights on worker 0-0, policy_version 880403 (0.00087) [2022-07-10 20:32:02,970][26022] Updated weights on worker 0-0, policy_version 880413 (0.00940) [2022-07-10 20:32:03,061][25689] Fps is (10 sec: 5387.5, 60 sec: 5545.4, 300 sec: 5534.1). Total num frames: 901542912. Throughput: 0: 5681.0. Samples: 901548182. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:32:03,061][25689] Avg episode reward: [(0, '1.160')] [2022-07-10 20:32:04,912][26022] Updated weights on worker 0-0, policy_version 880423 (0.00090) [2022-07-10 20:32:06,695][26022] Updated weights on worker 0-0, policy_version 880433 (0.00089) [2022-07-10 20:32:08,085][25689] Fps is (10 sec: 5378.7, 60 sec: 5544.6, 300 sec: 5534.0). Total num frames: 901571584. Throughput: 0: 4863.2. Samples: 901564906. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:32:08,086][25689] Avg episode reward: [(0, '0.721')] [2022-07-10 20:32:08,449][26022] Updated weights on worker 0-0, policy_version 880443 (0.00089) [2022-07-10 20:32:10,356][26022] Updated weights on worker 0-0, policy_version 880453 (0.00089) [2022-07-10 20:32:12,299][26022] Updated weights on worker 0-0, policy_version 880463 (0.00089) [2022-07-10 20:32:13,098][25689] Fps is (10 sec: 5509.1, 60 sec: 5529.1, 300 sec: 5530.9). Total num frames: 901598208. Throughput: 0: 5704.1. Samples: 901598492. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:32:13,099][25689] Avg episode reward: [(0, '0.300')] [2022-07-10 20:32:13,959][26022] Updated weights on worker 0-0, policy_version 880473 (0.00093) [2022-07-10 20:32:15,912][26022] Updated weights on worker 0-0, policy_version 880483 (0.00085) [2022-07-10 20:32:17,767][26022] Updated weights on worker 0-0, policy_version 880493 (0.00090) [2022-07-10 20:32:18,100][25689] Fps is (10 sec: 5316.9, 60 sec: 5496.7, 300 sec: 5529.6). Total num frames: 901624832. Throughput: 0: 5698.4. Samples: 901631802. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:32:18,101][25689] Avg episode reward: [(0, '0.191')] [2022-07-10 20:32:19,550][26022] Updated weights on worker 0-0, policy_version 880503 (0.00084) [2022-07-10 20:32:21,490][26022] Updated weights on worker 0-0, policy_version 880513 (0.00089) [2022-07-10 20:32:23,072][26022] Updated weights on worker 0-0, policy_version 880523 (0.00087) [2022-07-10 20:32:23,168][25689] Fps is (10 sec: 5694.6, 60 sec: 5537.4, 300 sec: 5535.8). Total num frames: 901655552. Throughput: 0: 4990.0. Samples: 901648516. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:32:23,168][25689] Avg episode reward: [(0, '-0.161')] [2022-07-10 20:32:25,243][26022] Updated weights on worker 0-0, policy_version 880533 (0.00091) [2022-07-10 20:32:26,780][26022] Updated weights on worker 0-0, policy_version 880543 (0.00092) [2022-07-10 20:32:28,189][25689] Fps is (10 sec: 5683.9, 60 sec: 5537.3, 300 sec: 5529.6). Total num frames: 901682176. Throughput: 0: 5838.8. Samples: 901682288. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:32:28,189][25689] Avg episode reward: [(0, '0.129')] [2022-07-10 20:32:28,787][26022] Updated weights on worker 0-0, policy_version 880553 (0.00100) [2022-07-10 20:32:30,435][26022] Updated weights on worker 0-0, policy_version 880563 (0.00090) [2022-07-10 20:32:32,579][26022] Updated weights on worker 0-0, policy_version 880573 (0.00094) [2022-07-10 20:32:33,230][25689] Fps is (10 sec: 5393.7, 60 sec: 5500.9, 300 sec: 5529.0). Total num frames: 901709824. Throughput: 0: 5819.2. Samples: 901715644. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:32:33,231][25689] Avg episode reward: [(0, '-0.200')] [2022-07-10 20:32:34,185][26022] Updated weights on worker 0-0, policy_version 880583 (0.00087) [2022-07-10 20:32:36,198][26022] Updated weights on worker 0-0, policy_version 880593 (0.00091) [2022-07-10 20:32:37,773][26022] Updated weights on worker 0-0, policy_version 880603 (0.00088) [2022-07-10 20:32:38,234][25689] Fps is (10 sec: 5708.5, 60 sec: 5518.0, 300 sec: 5536.7). Total num frames: 901739520. Throughput: 0: 5000.9. Samples: 901732492. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:32:38,235][25689] Avg episode reward: [(0, '0.078')] [2022-07-10 20:32:39,875][26022] Updated weights on worker 0-0, policy_version 880613 (0.00086) [2022-07-10 20:32:41,491][26022] Updated weights on worker 0-0, policy_version 880623 (0.00093) [2022-07-10 20:32:43,376][25689] Fps is (10 sec: 5550.8, 60 sec: 5515.8, 300 sec: 5528.6). Total num frames: 901766144. Throughput: 0: 5813.5. Samples: 901765998. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:32:43,378][25689] Avg episode reward: [(0, '0.010')] [2022-07-10 20:32:43,482][26022] Updated weights on worker 0-0, policy_version 880633 (0.00082) [2022-07-10 20:32:45,311][26022] Updated weights on worker 0-0, policy_version 880643 (0.00873) [2022-07-10 20:32:47,022][26022] Updated weights on worker 0-0, policy_version 880653 (0.00497) [2022-07-10 20:32:48,382][25689] Fps is (10 sec: 5448.9, 60 sec: 5521.9, 300 sec: 5532.8). Total num frames: 901794816. Throughput: 0: 5805.8. Samples: 901799528. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 20:32:48,383][25689] Avg episode reward: [(0, '0.302')] [2022-07-10 20:32:48,932][26022] Updated weights on worker 0-0, policy_version 880663 (0.00090) [2022-07-10 20:32:50,601][26022] Updated weights on worker 0-0, policy_version 880673 (0.00087) [2022-07-10 20:32:52,610][26022] Updated weights on worker 0-0, policy_version 880683 (0.00090) [2022-07-10 20:32:53,465][25689] Fps is (10 sec: 5785.5, 60 sec: 5553.6, 300 sec: 5535.8). Total num frames: 901824512. Throughput: 0: 5800.6. Samples: 901833020. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:32:53,466][25689] Avg episode reward: [(0, '-0.903')] [2022-07-10 20:32:54,304][26022] Updated weights on worker 0-0, policy_version 880693 (0.00088) [2022-07-10 20:32:56,256][26022] Updated weights on worker 0-0, policy_version 880703 (0.00091) [2022-07-10 20:32:58,158][26022] Updated weights on worker 0-0, policy_version 880713 (0.00088) [2022-07-10 20:32:58,514][25689] Fps is (10 sec: 5558.6, 60 sec: 5516.0, 300 sec: 5529.2). Total num frames: 901851136. Throughput: 0: 5786.4. Samples: 901849840. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:32:58,516][25689] Avg episode reward: [(0, '-2.209')] [2022-07-10 20:32:59,930][26022] Updated weights on worker 0-0, policy_version 880723 (0.00093) [2022-07-10 20:33:02,146][26022] Updated weights on worker 0-0, policy_version 880733 (0.00087) [2022-07-10 20:33:03,460][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:33:03,471][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000880740_901877760.pth [2022-07-10 20:33:03,472][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000878792_899883008.pth [2022-07-10 20:33:03,558][25689] Fps is (10 sec: 5377.2, 60 sec: 5551.9, 300 sec: 5538.9). Total num frames: 901878784. Throughput: 0: 5696.4. Samples: 901880960. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:33:03,559][25689] Avg episode reward: [(0, '-1.937')] [2022-07-10 20:33:04,063][26022] Updated weights on worker 0-0, policy_version 880743 (0.00084) [2022-07-10 20:33:05,913][26022] Updated weights on worker 0-0, policy_version 880753 (0.00083) [2022-07-10 20:33:07,870][26022] Updated weights on worker 0-0, policy_version 880763 (0.00098) [2022-07-10 20:33:08,589][25689] Fps is (10 sec: 5488.7, 60 sec: 5534.3, 300 sec: 5536.2). Total num frames: 901906432. Throughput: 0: 5687.1. Samples: 901914444. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:33:08,589][25689] Avg episode reward: [(0, '-1.978')] [2022-07-10 20:33:09,518][26022] Updated weights on worker 0-0, policy_version 880773 (0.00083) [2022-07-10 20:33:11,391][26022] Updated weights on worker 0-0, policy_version 880783 (0.00088) [2022-07-10 20:33:13,413][26022] Updated weights on worker 0-0, policy_version 880793 (0.00080) [2022-07-10 20:33:13,623][25689] Fps is (10 sec: 5493.9, 60 sec: 5549.3, 300 sec: 5536.8). Total num frames: 901934080. Throughput: 0: 4868.4. Samples: 901931156. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:33:13,623][25689] Avg episode reward: [(0, '-2.012')] [2022-07-10 20:33:15,204][26022] Updated weights on worker 0-0, policy_version 880803 (0.00094) [2022-07-10 20:33:16,983][26022] Updated weights on worker 0-0, policy_version 880813 (0.00095) [2022-07-10 20:33:18,672][25689] Fps is (10 sec: 5484.0, 60 sec: 5561.9, 300 sec: 5531.5). Total num frames: 901961728. Throughput: 0: 5694.7. Samples: 901964632. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:33:18,674][25689] Avg episode reward: [(0, '-1.876')] [2022-07-10 20:33:18,827][26022] Updated weights on worker 0-0, policy_version 880823 (0.00086) [2022-07-10 20:33:20,569][26022] Updated weights on worker 0-0, policy_version 880833 (0.00089) [2022-07-10 20:33:22,591][26022] Updated weights on worker 0-0, policy_version 880843 (0.00087) [2022-07-10 20:33:23,717][25689] Fps is (10 sec: 5579.8, 60 sec: 5530.2, 300 sec: 5534.8). Total num frames: 901990400. Throughput: 0: 5793.2. Samples: 901997742. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:33:23,717][25689] Avg episode reward: [(0, '-2.176')] [2022-07-10 20:33:24,365][26022] Updated weights on worker 0-0, policy_version 880853 (0.00085) [2022-07-10 20:33:26,224][26022] Updated weights on worker 0-0, policy_version 880863 (0.00093) [2022-07-10 20:33:28,058][26022] Updated weights on worker 0-0, policy_version 880873 (0.00086) [2022-07-10 20:33:28,724][25689] Fps is (10 sec: 5500.8, 60 sec: 5531.4, 300 sec: 5532.0). Total num frames: 902017024. Throughput: 0: 4955.0. Samples: 902014216. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:33:28,725][25689] Avg episode reward: [(0, '-1.788')] [2022-07-10 20:33:30,045][26022] Updated weights on worker 0-0, policy_version 880883 (0.00085) [2022-07-10 20:33:31,892][26022] Updated weights on worker 0-0, policy_version 880893 (0.00085) [2022-07-10 20:33:33,680][26022] Updated weights on worker 0-0, policy_version 880903 (0.00099) [2022-07-10 20:33:33,767][25689] Fps is (10 sec: 5400.2, 60 sec: 5531.3, 300 sec: 5528.2). Total num frames: 902044672. Throughput: 0: 5773.5. Samples: 902047456. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:33:33,767][25689] Avg episode reward: [(0, '-2.635')] [2022-07-10 20:33:35,467][26022] Updated weights on worker 0-0, policy_version 880913 (0.00095) [2022-07-10 20:33:37,501][26022] Updated weights on worker 0-0, policy_version 880923 (0.00085) [2022-07-10 20:33:38,789][25689] Fps is (10 sec: 5595.7, 60 sec: 5512.7, 300 sec: 5529.3). Total num frames: 902073344. Throughput: 0: 5791.3. Samples: 902081138. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:33:38,790][25689] Avg episode reward: [(0, '-2.536')] [2022-07-10 20:33:39,171][26022] Updated weights on worker 0-0, policy_version 880933 (0.00085) [2022-07-10 20:33:41,090][26022] Updated weights on worker 0-0, policy_version 880943 (0.00097) [2022-07-10 20:33:42,731][26022] Updated weights on worker 0-0, policy_version 880953 (0.00084) [2022-07-10 20:33:43,824][25689] Fps is (10 sec: 5600.2, 60 sec: 5539.5, 300 sec: 5528.7). Total num frames: 902100992. Throughput: 0: 4980.3. Samples: 902097880. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:33:43,824][25689] Avg episode reward: [(0, '-4.079')] [2022-07-10 20:33:44,554][26022] Updated weights on worker 0-0, policy_version 880963 (0.00092) [2022-07-10 20:33:46,583][26022] Updated weights on worker 0-0, policy_version 880973 (0.00088) [2022-07-10 20:33:48,190][26022] Updated weights on worker 0-0, policy_version 880983 (0.00085) [2022-07-10 20:33:48,830][25689] Fps is (10 sec: 5711.0, 60 sec: 5556.4, 300 sec: 5535.9). Total num frames: 902130688. Throughput: 0: 5844.1. Samples: 902131718. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:33:48,831][25689] Avg episode reward: [(0, '-3.638')] [2022-07-10 20:33:50,186][26022] Updated weights on worker 0-0, policy_version 880993 (0.00092) [2022-07-10 20:33:51,824][26022] Updated weights on worker 0-0, policy_version 881003 (0.00085) [2022-07-10 20:33:53,845][25689] Fps is (10 sec: 5518.0, 60 sec: 5494.8, 300 sec: 5525.5). Total num frames: 902156288. Throughput: 0: 5856.7. Samples: 902165046. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:33:53,845][25689] Avg episode reward: [(0, '-1.455')] [2022-07-10 20:33:53,924][26022] Updated weights on worker 0-0, policy_version 881013 (0.00086) [2022-07-10 20:33:55,579][26022] Updated weights on worker 0-0, policy_version 881023 (0.00083) [2022-07-10 20:33:57,459][26022] Updated weights on worker 0-0, policy_version 881033 (0.00084) [2022-07-10 20:33:58,854][25689] Fps is (10 sec: 5414.4, 60 sec: 5532.4, 300 sec: 5536.8). Total num frames: 902184960. Throughput: 0: 5023.8. Samples: 902181938. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:33:58,855][25689] Avg episode reward: [(0, '-1.399')] [2022-07-10 20:33:59,397][26022] Updated weights on worker 0-0, policy_version 881043 (0.00071) [2022-07-10 20:34:01,032][26022] Updated weights on worker 0-0, policy_version 881053 (0.00086) [2022-07-10 20:34:03,553][26022] Updated weights on worker 0-0, policy_version 881063 (0.00081) [2022-07-10 20:34:03,893][25689] Fps is (10 sec: 5400.9, 60 sec: 5498.9, 300 sec: 5529.3). Total num frames: 902210560. Throughput: 0: 5744.4. Samples: 902213168. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:34:03,894][25689] Avg episode reward: [(0, '-2.067')] [2022-07-10 20:34:05,150][26022] Updated weights on worker 0-0, policy_version 881073 (0.00085) [2022-07-10 20:34:07,162][26022] Updated weights on worker 0-0, policy_version 881083 (0.00082) [2022-07-10 20:34:08,721][26022] Updated weights on worker 0-0, policy_version 881093 (0.00085) [2022-07-10 20:34:08,922][25689] Fps is (10 sec: 5492.3, 60 sec: 5533.0, 300 sec: 5528.8). Total num frames: 902240256. Throughput: 0: 5730.3. Samples: 902246848. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:34:08,923][25689] Avg episode reward: [(0, '-1.939')] [2022-07-10 20:34:10,716][26022] Updated weights on worker 0-0, policy_version 881103 (0.00088) [2022-07-10 20:34:12,567][26022] Updated weights on worker 0-0, policy_version 881113 (0.00096) [2022-07-10 20:34:13,950][25689] Fps is (10 sec: 5600.0, 60 sec: 5516.6, 300 sec: 5528.5). Total num frames: 902266880. Throughput: 0: 4900.4. Samples: 902263572. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:34:13,951][25689] Avg episode reward: [(0, '-0.290')] [2022-07-10 20:34:14,646][26022] Updated weights on worker 0-0, policy_version 881123 (0.00096) [2022-07-10 20:34:16,242][26022] Updated weights on worker 0-0, policy_version 881133 (0.00090) [2022-07-10 20:34:18,180][26022] Updated weights on worker 0-0, policy_version 881143 (0.00091) [2022-07-10 20:34:18,971][25689] Fps is (10 sec: 5400.9, 60 sec: 5519.2, 300 sec: 5525.7). Total num frames: 902294528. Throughput: 0: 5704.2. Samples: 902296688. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:34:18,972][25689] Avg episode reward: [(0, '-0.458')] [2022-07-10 20:34:19,764][26022] Updated weights on worker 0-0, policy_version 881153 (0.00093) [2022-07-10 20:34:22,063][26022] Updated weights on worker 0-0, policy_version 881163 (0.00098) [2022-07-10 20:34:23,615][26022] Updated weights on worker 0-0, policy_version 881173 (0.00088) [2022-07-10 20:34:24,009][25689] Fps is (10 sec: 5599.3, 60 sec: 5519.8, 300 sec: 5532.0). Total num frames: 902323200. Throughput: 0: 5789.9. Samples: 902329634. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:34:24,009][25689] Avg episode reward: [(0, '-0.589')] [2022-07-10 20:34:25,570][26022] Updated weights on worker 0-0, policy_version 881183 (0.00062) [2022-07-10 20:34:27,317][26022] Updated weights on worker 0-0, policy_version 881193 (0.00091) [2022-07-10 20:34:29,026][25689] Fps is (10 sec: 5600.9, 60 sec: 5535.9, 300 sec: 5528.3). Total num frames: 902350848. Throughput: 0: 4945.5. Samples: 902346274. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:34:29,027][25689] Avg episode reward: [(0, '-0.465')] [2022-07-10 20:34:29,492][26022] Updated weights on worker 0-0, policy_version 881203 (0.00091) [2022-07-10 20:34:31,186][26022] Updated weights on worker 0-0, policy_version 881213 (0.00089) [2022-07-10 20:34:33,175][26022] Updated weights on worker 0-0, policy_version 881223 (0.00094) [2022-07-10 20:34:34,031][25689] Fps is (10 sec: 5517.2, 60 sec: 5539.3, 300 sec: 5525.5). Total num frames: 902378496. Throughput: 0: 5775.3. Samples: 902379546. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:34:34,032][25689] Avg episode reward: [(0, '0.030')] [2022-07-10 20:34:34,700][26022] Updated weights on worker 0-0, policy_version 881233 (0.00085) [2022-07-10 20:34:36,968][26022] Updated weights on worker 0-0, policy_version 881243 (0.00093) [2022-07-10 20:34:38,507][26022] Updated weights on worker 0-0, policy_version 881253 (0.00093) [2022-07-10 20:34:39,050][25689] Fps is (10 sec: 5312.3, 60 sec: 5488.7, 300 sec: 5519.0). Total num frames: 902404096. Throughput: 0: 5785.0. Samples: 902412846. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:34:39,050][25689] Avg episode reward: [(0, '0.135')] [2022-07-10 20:34:40,417][26022] Updated weights on worker 0-0, policy_version 881263 (0.00052) [2022-07-10 20:34:42,214][26022] Updated weights on worker 0-0, policy_version 881273 (0.00094) [2022-07-10 20:34:44,014][26022] Updated weights on worker 0-0, policy_version 881283 (0.00083) [2022-07-10 20:34:44,143][25689] Fps is (10 sec: 5468.7, 60 sec: 5517.3, 300 sec: 5524.4). Total num frames: 902433792. Throughput: 0: 4964.6. Samples: 902429590. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:34:44,143][25689] Avg episode reward: [(0, '0.304')] [2022-07-10 20:34:46,002][26022] Updated weights on worker 0-0, policy_version 881293 (0.00090) [2022-07-10 20:34:47,800][26022] Updated weights on worker 0-0, policy_version 881303 (0.00092) [2022-07-10 20:34:49,148][25689] Fps is (10 sec: 5678.4, 60 sec: 5483.5, 300 sec: 5524.6). Total num frames: 902461440. Throughput: 0: 5797.8. Samples: 902462938. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:34:49,149][25689] Avg episode reward: [(0, '0.475')] [2022-07-10 20:34:49,531][26022] Updated weights on worker 0-0, policy_version 881313 (0.00090) [2022-07-10 20:34:51,650][26022] Updated weights on worker 0-0, policy_version 881323 (0.00094) [2022-07-10 20:34:53,011][26022] Updated weights on worker 0-0, policy_version 881333 (0.00096) [2022-07-10 20:34:54,213][25689] Fps is (10 sec: 5491.1, 60 sec: 5512.8, 300 sec: 5523.8). Total num frames: 902489088. Throughput: 0: 5799.2. Samples: 902496582. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:34:54,213][25689] Avg episode reward: [(0, '0.512')] [2022-07-10 20:34:55,182][26022] Updated weights on worker 0-0, policy_version 881343 (0.00777) [2022-07-10 20:34:56,903][26022] Updated weights on worker 0-0, policy_version 881353 (0.00094) [2022-07-10 20:34:58,785][26022] Updated weights on worker 0-0, policy_version 881363 (0.00086) [2022-07-10 20:34:59,267][25689] Fps is (10 sec: 5566.2, 60 sec: 5508.8, 300 sec: 5527.2). Total num frames: 902517760. Throughput: 0: 4963.3. Samples: 902513188. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:34:59,267][25689] Avg episode reward: [(0, '1.732')] [2022-07-10 20:35:00,599][26022] Updated weights on worker 0-0, policy_version 881373 (0.00084) [2022-07-10 20:35:02,876][26022] Updated weights on worker 0-0, policy_version 881383 (0.00088) [2022-07-10 20:35:03,527][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:35:03,543][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000881386_902539264.pth [2022-07-10 20:35:03,544][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000879442_900548608.pth [2022-07-10 20:35:04,331][25689] Fps is (10 sec: 5465.2, 60 sec: 5523.5, 300 sec: 5526.2). Total num frames: 902544384. Throughput: 0: 5693.0. Samples: 902544520. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:35:04,331][25689] Avg episode reward: [(0, '1.821')] [2022-07-10 20:35:04,785][26022] Updated weights on worker 0-0, policy_version 881393 (0.00079) [2022-07-10 20:35:06,618][26022] Updated weights on worker 0-0, policy_version 881403 (0.00842) [2022-07-10 20:35:08,380][26022] Updated weights on worker 0-0, policy_version 881413 (0.00096) [2022-07-10 20:35:09,362][25689] Fps is (10 sec: 5375.9, 60 sec: 5489.3, 300 sec: 5526.0). Total num frames: 902572032. Throughput: 0: 5679.1. Samples: 902577734. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:35:09,363][25689] Avg episode reward: [(0, '1.918')] [2022-07-10 20:35:10,312][26022] Updated weights on worker 0-0, policy_version 881423 (0.00091) [2022-07-10 20:35:12,096][26022] Updated weights on worker 0-0, policy_version 881433 (0.00100) [2022-07-10 20:35:13,951][26022] Updated weights on worker 0-0, policy_version 881443 (0.00094) [2022-07-10 20:35:14,449][25689] Fps is (10 sec: 5465.1, 60 sec: 5501.0, 300 sec: 5521.4). Total num frames: 902599680. Throughput: 0: 4845.2. Samples: 902594630. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:35:14,449][25689] Avg episode reward: [(0, '1.697')] [2022-07-10 20:35:15,595][26022] Updated weights on worker 0-0, policy_version 881453 (0.00081) [2022-07-10 20:35:17,512][26022] Updated weights on worker 0-0, policy_version 881463 (0.00093) [2022-07-10 20:35:19,270][26022] Updated weights on worker 0-0, policy_version 881473 (0.00103) [2022-07-10 20:35:19,525][25689] Fps is (10 sec: 5642.4, 60 sec: 5529.7, 300 sec: 5528.2). Total num frames: 902629376. Throughput: 0: 5679.6. Samples: 902628248. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:35:19,526][25689] Avg episode reward: [(0, '1.259')] [2022-07-10 20:35:21,293][26022] Updated weights on worker 0-0, policy_version 881483 (0.00080) [2022-07-10 20:35:22,861][26022] Updated weights on worker 0-0, policy_version 881493 (0.00092) [2022-07-10 20:35:24,631][25689] Fps is (10 sec: 5531.3, 60 sec: 5489.8, 300 sec: 5520.1). Total num frames: 902656000. Throughput: 0: 5783.4. Samples: 902661924. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:35:24,631][25689] Avg episode reward: [(0, '0.845')] [2022-07-10 20:35:24,872][26022] Updated weights on worker 0-0, policy_version 881503 (0.00086) [2022-07-10 20:35:26,594][26022] Updated weights on worker 0-0, policy_version 881513 (0.00088) [2022-07-10 20:35:28,550][26022] Updated weights on worker 0-0, policy_version 881523 (0.00091) [2022-07-10 20:35:29,735][25689] Fps is (10 sec: 5416.1, 60 sec: 5498.8, 300 sec: 5525.1). Total num frames: 902684672. Throughput: 0: 5780.5. Samples: 902695498. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:35:29,735][25689] Avg episode reward: [(0, '0.665')] [2022-07-10 20:35:30,457][26022] Updated weights on worker 0-0, policy_version 881533 (0.00091) [2022-07-10 20:35:32,157][26022] Updated weights on worker 0-0, policy_version 881543 (0.00092) [2022-07-10 20:35:34,091][26022] Updated weights on worker 0-0, policy_version 881553 (0.00084) [2022-07-10 20:35:34,795][25689] Fps is (10 sec: 5742.7, 60 sec: 5527.5, 300 sec: 5531.7). Total num frames: 902714368. Throughput: 0: 5766.4. Samples: 902711952. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:35:34,795][25689] Avg episode reward: [(0, '0.637')] [2022-07-10 20:35:36,202][26022] Updated weights on worker 0-0, policy_version 881563 (0.00090) [2022-07-10 20:35:37,808][26022] Updated weights on worker 0-0, policy_version 881573 (0.00086) [2022-07-10 20:35:39,798][25689] Fps is (10 sec: 5494.9, 60 sec: 5528.9, 300 sec: 5520.4). Total num frames: 902739968. Throughput: 0: 5757.2. Samples: 902744962. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:35:39,798][25689] Avg episode reward: [(0, '-0.523')] [2022-07-10 20:35:39,985][26022] Updated weights on worker 0-0, policy_version 881583 (0.00085) [2022-07-10 20:35:41,327][26022] Updated weights on worker 0-0, policy_version 881593 (0.00086) [2022-07-10 20:35:43,523][26022] Updated weights on worker 0-0, policy_version 881603 (0.00090) [2022-07-10 20:35:44,856][25689] Fps is (10 sec: 5597.4, 60 sec: 5548.9, 300 sec: 5530.8). Total num frames: 902770688. Throughput: 0: 5749.5. Samples: 902778212. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:35:44,858][25689] Avg episode reward: [(0, '-1.377')] [2022-07-10 20:35:45,143][26022] Updated weights on worker 0-0, policy_version 881613 (0.00087) [2022-07-10 20:35:47,274][26022] Updated weights on worker 0-0, policy_version 881623 (0.00119) [2022-07-10 20:35:48,856][26022] Updated weights on worker 0-0, policy_version 881633 (0.00088) [2022-07-10 20:35:49,871][25689] Fps is (10 sec: 5692.5, 60 sec: 5531.2, 300 sec: 5523.8). Total num frames: 902797312. Throughput: 0: 4937.0. Samples: 902794912. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:35:49,872][25689] Avg episode reward: [(0, '-1.124')] [2022-07-10 20:35:50,847][26022] Updated weights on worker 0-0, policy_version 881643 (0.00081) [2022-07-10 20:35:52,520][26022] Updated weights on worker 0-0, policy_version 881653 (0.00095) [2022-07-10 20:35:54,563][26022] Updated weights on worker 0-0, policy_version 881663 (0.00085) [2022-07-10 20:35:54,883][25689] Fps is (10 sec: 5413.0, 60 sec: 5536.1, 300 sec: 5524.7). Total num frames: 902824960. Throughput: 0: 5789.8. Samples: 902828258. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:35:54,883][25689] Avg episode reward: [(0, '-0.897')] [2022-07-10 20:35:56,211][26022] Updated weights on worker 0-0, policy_version 881673 (0.00084) [2022-07-10 20:35:58,259][26022] Updated weights on worker 0-0, policy_version 881683 (0.00093) [2022-07-10 20:35:59,906][25689] Fps is (10 sec: 5408.4, 60 sec: 5505.1, 300 sec: 5529.5). Total num frames: 902851584. Throughput: 0: 5800.1. Samples: 902861592. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:35:59,907][25689] Avg episode reward: [(0, '-0.969')] [2022-07-10 20:36:00,023][26022] Updated weights on worker 0-0, policy_version 881693 (0.00083) [2022-07-10 20:36:01,905][26022] Updated weights on worker 0-0, policy_version 881703 (0.00089) [2022-07-10 20:36:04,222][26022] Updated weights on worker 0-0, policy_version 881713 (0.00084) [2022-07-10 20:36:05,021][25689] Fps is (10 sec: 5352.9, 60 sec: 5517.3, 300 sec: 5524.2). Total num frames: 902879232. Throughput: 0: 4847.3. Samples: 902875958. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:36:05,022][25689] Avg episode reward: [(0, '-1.139')] [2022-07-10 20:36:05,808][26022] Updated weights on worker 0-0, policy_version 881723 (0.00089) [2022-07-10 20:36:07,770][26022] Updated weights on worker 0-0, policy_version 881733 (0.00096) [2022-07-10 20:36:09,645][26022] Updated weights on worker 0-0, policy_version 881743 (0.00090) [2022-07-10 20:36:10,074][25689] Fps is (10 sec: 5337.3, 60 sec: 5498.5, 300 sec: 5520.3). Total num frames: 902905856. Throughput: 0: 5668.8. Samples: 902909440. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:36:10,075][25689] Avg episode reward: [(0, '-0.350')] [2022-07-10 20:36:11,611][26022] Updated weights on worker 0-0, policy_version 881753 (0.00097) [2022-07-10 20:36:13,363][26022] Updated weights on worker 0-0, policy_version 881763 (0.00085) [2022-07-10 20:36:15,168][25689] Fps is (10 sec: 5449.9, 60 sec: 5514.7, 300 sec: 5518.9). Total num frames: 902934528. Throughput: 0: 5635.1. Samples: 902942566. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:36:15,168][25689] Avg episode reward: [(0, '0.384')] [2022-07-10 20:36:15,230][26022] Updated weights on worker 0-0, policy_version 881773 (0.00092) [2022-07-10 20:36:16,974][26022] Updated weights on worker 0-0, policy_version 881783 (0.00083) [2022-07-10 20:36:19,075][26022] Updated weights on worker 0-0, policy_version 881793 (0.00084) [2022-07-10 20:36:20,225][25689] Fps is (10 sec: 5649.5, 60 sec: 5499.6, 300 sec: 5520.5). Total num frames: 902963200. Throughput: 0: 4803.4. Samples: 902959190. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:36:20,225][25689] Avg episode reward: [(0, '0.082')] [2022-07-10 20:36:20,583][26022] Updated weights on worker 0-0, policy_version 881803 (0.00087) [2022-07-10 20:36:22,685][26022] Updated weights on worker 0-0, policy_version 881813 (0.00091) [2022-07-10 20:36:24,353][26022] Updated weights on worker 0-0, policy_version 881823 (0.00085) [2022-07-10 20:36:25,305][25689] Fps is (10 sec: 5555.7, 60 sec: 5518.8, 300 sec: 5522.8). Total num frames: 902990848. Throughput: 0: 5754.3. Samples: 902992676. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:36:25,306][25689] Avg episode reward: [(0, '0.215')] [2022-07-10 20:36:26,367][26022] Updated weights on worker 0-0, policy_version 881833 (0.00093) [2022-07-10 20:36:28,025][26022] Updated weights on worker 0-0, policy_version 881843 (0.00090) [2022-07-10 20:36:29,967][26022] Updated weights on worker 0-0, policy_version 881853 (0.00094) [2022-07-10 20:36:30,322][25689] Fps is (10 sec: 5577.7, 60 sec: 5526.7, 300 sec: 5519.3). Total num frames: 903019520. Throughput: 0: 5755.8. Samples: 903025980. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:36:30,323][25689] Avg episode reward: [(0, '-0.472')] [2022-07-10 20:36:31,812][26022] Updated weights on worker 0-0, policy_version 881863 (0.00084) [2022-07-10 20:36:33,576][26022] Updated weights on worker 0-0, policy_version 881873 (0.00088) [2022-07-10 20:36:35,351][25689] Fps is (10 sec: 5504.4, 60 sec: 5478.8, 300 sec: 5512.0). Total num frames: 903046144. Throughput: 0: 4949.1. Samples: 903042454. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 20:36:35,351][25689] Avg episode reward: [(0, '-1.343')] [2022-07-10 20:36:35,664][26022] Updated weights on worker 0-0, policy_version 881883 (0.00085) [2022-07-10 20:36:37,418][26022] Updated weights on worker 0-0, policy_version 881893 (0.00086) [2022-07-10 20:36:39,366][26022] Updated weights on worker 0-0, policy_version 881903 (0.00087) [2022-07-10 20:36:40,368][25689] Fps is (10 sec: 5504.0, 60 sec: 5528.2, 300 sec: 5520.7). Total num frames: 903074816. Throughput: 0: 5780.4. Samples: 903075628. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:36:40,369][25689] Avg episode reward: [(0, '-1.019')] [2022-07-10 20:36:41,136][26022] Updated weights on worker 0-0, policy_version 881913 (0.00090) [2022-07-10 20:36:42,992][26022] Updated weights on worker 0-0, policy_version 881923 (0.00088) [2022-07-10 20:36:44,701][26022] Updated weights on worker 0-0, policy_version 881933 (0.00093) [2022-07-10 20:36:45,472][25689] Fps is (10 sec: 5463.3, 60 sec: 5456.5, 300 sec: 5513.3). Total num frames: 903101440. Throughput: 0: 5763.7. Samples: 903108912. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:36:45,473][25689] Avg episode reward: [(0, '-1.025')] [2022-07-10 20:36:46,654][26022] Updated weights on worker 0-0, policy_version 881943 (0.00083) [2022-07-10 20:36:48,564][26022] Updated weights on worker 0-0, policy_version 881953 (0.00089) [2022-07-10 20:36:50,273][26022] Updated weights on worker 0-0, policy_version 881963 (0.00085) [2022-07-10 20:36:50,551][25689] Fps is (10 sec: 5530.8, 60 sec: 5501.4, 300 sec: 5519.8). Total num frames: 903131136. Throughput: 0: 4924.4. Samples: 903125598. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:36:50,552][25689] Avg episode reward: [(0, '-0.611')] [2022-07-10 20:36:52,238][26022] Updated weights on worker 0-0, policy_version 881973 (0.00092) [2022-07-10 20:36:54,204][26022] Updated weights on worker 0-0, policy_version 881983 (0.00084) [2022-07-10 20:36:55,614][25689] Fps is (10 sec: 5755.2, 60 sec: 5513.6, 300 sec: 5518.8). Total num frames: 903159808. Throughput: 0: 5748.5. Samples: 903158936. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:36:55,614][25689] Avg episode reward: [(0, '0.061')] [2022-07-10 20:36:55,737][26022] Updated weights on worker 0-0, policy_version 881993 (0.00093) [2022-07-10 20:36:57,815][26022] Updated weights on worker 0-0, policy_version 882003 (0.00088) [2022-07-10 20:36:59,395][26022] Updated weights on worker 0-0, policy_version 882013 (0.00080) [2022-07-10 20:37:00,661][25689] Fps is (10 sec: 5368.3, 60 sec: 5494.6, 300 sec: 5519.1). Total num frames: 903185408. Throughput: 0: 5746.1. Samples: 903192232. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:37:00,662][25689] Avg episode reward: [(0, '0.048')] [2022-07-10 20:37:01,694][26022] Updated weights on worker 0-0, policy_version 882023 (0.00089) [2022-07-10 20:37:03,641][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:37:03,656][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000882033_903201792.pth [2022-07-10 20:37:03,656][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000880091_901213184.pth [2022-07-10 20:37:03,665][26022] Updated weights on worker 0-0, policy_version 882033 (0.00081) [2022-07-10 20:37:05,548][26022] Updated weights on worker 0-0, policy_version 882043 (0.00051) [2022-07-10 20:37:05,727][25689] Fps is (10 sec: 5265.3, 60 sec: 5499.1, 300 sec: 5514.9). Total num frames: 903213056. Throughput: 0: 4807.6. Samples: 903206290. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:37:05,727][25689] Avg episode reward: [(0, '0.754')] [2022-07-10 20:37:07,639][26022] Updated weights on worker 0-0, policy_version 882053 (0.00079) [2022-07-10 20:37:09,401][26022] Updated weights on worker 0-0, policy_version 882063 (0.00085) [2022-07-10 20:37:10,741][25689] Fps is (10 sec: 5384.0, 60 sec: 5502.6, 300 sec: 5514.9). Total num frames: 903239680. Throughput: 0: 5642.0. Samples: 903239510. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:37:10,742][25689] Avg episode reward: [(0, '0.418')] [2022-07-10 20:37:11,135][26022] Updated weights on worker 0-0, policy_version 882073 (0.00083) [2022-07-10 20:37:13,008][26022] Updated weights on worker 0-0, policy_version 882083 (0.00094) [2022-07-10 20:37:14,945][26022] Updated weights on worker 0-0, policy_version 882093 (0.00095) [2022-07-10 20:37:15,775][25689] Fps is (10 sec: 5401.4, 60 sec: 5491.1, 300 sec: 5517.8). Total num frames: 903267328. Throughput: 0: 5653.9. Samples: 903272924. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:37:15,775][25689] Avg episode reward: [(0, '-0.038')] [2022-07-10 20:37:16,550][26022] Updated weights on worker 0-0, policy_version 882103 (0.00094) [2022-07-10 20:37:18,679][26022] Updated weights on worker 0-0, policy_version 882113 (0.00093) [2022-07-10 20:37:20,365][26022] Updated weights on worker 0-0, policy_version 882123 (0.00088) [2022-07-10 20:37:20,795][25689] Fps is (10 sec: 5602.1, 60 sec: 5494.5, 300 sec: 5511.8). Total num frames: 903296000. Throughput: 0: 4839.1. Samples: 903289660. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:37:20,795][25689] Avg episode reward: [(0, '0.132')] [2022-07-10 20:37:22,234][26022] Updated weights on worker 0-0, policy_version 882133 (0.00067) [2022-07-10 20:37:24,059][26022] Updated weights on worker 0-0, policy_version 882143 (0.00085) [2022-07-10 20:37:25,878][25689] Fps is (10 sec: 5675.8, 60 sec: 5511.1, 300 sec: 5517.5). Total num frames: 903324672. Throughput: 0: 5795.4. Samples: 903323074. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:37:25,879][25689] Avg episode reward: [(0, '-0.190')] [2022-07-10 20:37:25,887][26022] Updated weights on worker 0-0, policy_version 882153 (0.00086) [2022-07-10 20:37:27,664][26022] Updated weights on worker 0-0, policy_version 882163 (0.00089) [2022-07-10 20:37:29,549][26022] Updated weights on worker 0-0, policy_version 882173 (0.00089) [2022-07-10 20:37:30,946][25689] Fps is (10 sec: 5447.4, 60 sec: 5472.7, 300 sec: 5513.6). Total num frames: 903351296. Throughput: 0: 5789.5. Samples: 903356482. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:37:30,946][25689] Avg episode reward: [(0, '-0.334')] [2022-07-10 20:37:31,380][26022] Updated weights on worker 0-0, policy_version 882183 (0.00079) [2022-07-10 20:37:33,441][26022] Updated weights on worker 0-0, policy_version 882193 (0.00087) [2022-07-10 20:37:34,966][26022] Updated weights on worker 0-0, policy_version 882203 (0.00100) [2022-07-10 20:37:35,950][25689] Fps is (10 sec: 5489.9, 60 sec: 5508.7, 300 sec: 5510.1). Total num frames: 903379968. Throughput: 0: 4968.0. Samples: 903373156. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:37:35,951][25689] Avg episode reward: [(0, '-1.399')] [2022-07-10 20:37:36,903][26022] Updated weights on worker 0-0, policy_version 882213 (0.00089) [2022-07-10 20:37:38,687][26022] Updated weights on worker 0-0, policy_version 882223 (0.00088) [2022-07-10 20:37:40,520][26022] Updated weights on worker 0-0, policy_version 882233 (0.00094) [2022-07-10 20:37:40,963][25689] Fps is (10 sec: 5622.3, 60 sec: 5492.3, 300 sec: 5516.0). Total num frames: 903407616. Throughput: 0: 5810.7. Samples: 903406850. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:37:40,964][25689] Avg episode reward: [(0, '-1.403')] [2022-07-10 20:37:42,556][26022] Updated weights on worker 0-0, policy_version 882243 (0.00082) [2022-07-10 20:37:44,331][26022] Updated weights on worker 0-0, policy_version 882253 (0.00086) [2022-07-10 20:37:45,936][26022] Updated weights on worker 0-0, policy_version 882263 (0.00083) [2022-07-10 20:37:46,035][25689] Fps is (10 sec: 5686.4, 60 sec: 5545.9, 300 sec: 5518.2). Total num frames: 903437312. Throughput: 0: 5810.8. Samples: 903440200. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:37:46,036][25689] Avg episode reward: [(0, '-2.357')] [2022-07-10 20:37:48,143][26022] Updated weights on worker 0-0, policy_version 882273 (0.00091) [2022-07-10 20:37:49,712][26022] Updated weights on worker 0-0, policy_version 882283 (0.00093) [2022-07-10 20:37:51,051][25689] Fps is (10 sec: 5582.6, 60 sec: 5500.8, 300 sec: 5509.1). Total num frames: 903463936. Throughput: 0: 5833.1. Samples: 903473760. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:37:51,052][25689] Avg episode reward: [(0, '-2.433')] [2022-07-10 20:37:51,754][26022] Updated weights on worker 0-0, policy_version 882293 (0.00097) [2022-07-10 20:37:53,501][26022] Updated weights on worker 0-0, policy_version 882303 (0.00081) [2022-07-10 20:37:55,279][26022] Updated weights on worker 0-0, policy_version 882313 (0.00091) [2022-07-10 20:37:56,065][25689] Fps is (10 sec: 5614.9, 60 sec: 5522.2, 300 sec: 5520.1). Total num frames: 903493632. Throughput: 0: 5841.4. Samples: 903490654. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:37:56,066][25689] Avg episode reward: [(0, '-2.840')] [2022-07-10 20:37:57,154][26022] Updated weights on worker 0-0, policy_version 882323 (0.00084) [2022-07-10 20:37:58,973][26022] Updated weights on worker 0-0, policy_version 882333 (0.00087) [2022-07-10 20:38:00,869][26022] Updated weights on worker 0-0, policy_version 882343 (0.00090) [2022-07-10 20:38:01,124][25689] Fps is (10 sec: 5693.2, 60 sec: 5555.0, 300 sec: 5519.8). Total num frames: 903521280. Throughput: 0: 5814.2. Samples: 903524070. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:38:01,124][25689] Avg episode reward: [(0, '-1.735')] [2022-07-10 20:38:03,090][26022] Updated weights on worker 0-0, policy_version 882353 (0.00108) [2022-07-10 20:38:04,965][26022] Updated weights on worker 0-0, policy_version 882363 (0.00087) [2022-07-10 20:38:06,197][25689] Fps is (10 sec: 5154.3, 60 sec: 5503.5, 300 sec: 5508.7). Total num frames: 903545856. Throughput: 0: 5700.1. Samples: 903555128. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:38:06,198][25689] Avg episode reward: [(0, '-1.657')] [2022-07-10 20:38:06,808][26022] Updated weights on worker 0-0, policy_version 882373 (0.00084) [2022-07-10 20:38:08,677][26022] Updated weights on worker 0-0, policy_version 882383 (0.00092) [2022-07-10 20:38:10,599][26022] Updated weights on worker 0-0, policy_version 882393 (0.00087) [2022-07-10 20:38:11,200][25689] Fps is (10 sec: 5284.4, 60 sec: 5538.4, 300 sec: 5512.7). Total num frames: 903574528. Throughput: 0: 4862.3. Samples: 903571730. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:38:11,202][25689] Avg episode reward: [(0, '-1.058')] [2022-07-10 20:38:12,453][26022] Updated weights on worker 0-0, policy_version 882403 (0.00094) [2022-07-10 20:38:13,965][26022] Updated weights on worker 0-0, policy_version 882413 (0.00089) [2022-07-10 20:38:16,204][26022] Updated weights on worker 0-0, policy_version 882423 (0.00091) [2022-07-10 20:38:16,283][25689] Fps is (10 sec: 5482.6, 60 sec: 5517.0, 300 sec: 5508.6). Total num frames: 903601152. Throughput: 0: 5657.8. Samples: 903605042. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:38:16,283][25689] Avg episode reward: [(0, '-0.399')] [2022-07-10 20:38:17,890][26022] Updated weights on worker 0-0, policy_version 882433 (0.00100) [2022-07-10 20:38:19,921][26022] Updated weights on worker 0-0, policy_version 882443 (0.00097) [2022-07-10 20:38:21,363][25689] Fps is (10 sec: 5541.8, 60 sec: 5528.4, 300 sec: 5511.4). Total num frames: 903630848. Throughput: 0: 5663.7. Samples: 903638698. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:38:21,365][25689] Avg episode reward: [(0, '-1.247')] [2022-07-10 20:38:21,427][26022] Updated weights on worker 0-0, policy_version 882453 (0.00981) [2022-07-10 20:38:23,426][26022] Updated weights on worker 0-0, policy_version 882463 (0.00091) [2022-07-10 20:38:25,058][26022] Updated weights on worker 0-0, policy_version 882473 (0.00096) [2022-07-10 20:38:26,419][25689] Fps is (10 sec: 5657.5, 60 sec: 5514.0, 300 sec: 5514.0). Total num frames: 903658496. Throughput: 0: 4960.2. Samples: 903655430. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:38:26,419][25689] Avg episode reward: [(0, '-0.792')] [2022-07-10 20:38:27,244][26022] Updated weights on worker 0-0, policy_version 882483 (0.00093) [2022-07-10 20:38:28,815][26022] Updated weights on worker 0-0, policy_version 882493 (0.00081) [2022-07-10 20:38:30,900][26022] Updated weights on worker 0-0, policy_version 882503 (0.00086) [2022-07-10 20:38:31,439][25689] Fps is (10 sec: 5488.0, 60 sec: 5535.3, 300 sec: 5514.4). Total num frames: 903686144. Throughput: 0: 5773.4. Samples: 903688576. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:38:31,439][25689] Avg episode reward: [(0, '-0.936')] [2022-07-10 20:38:32,749][26022] Updated weights on worker 0-0, policy_version 882513 (0.00280) [2022-07-10 20:38:34,460][26022] Updated weights on worker 0-0, policy_version 882523 (0.00090) [2022-07-10 20:38:36,361][26022] Updated weights on worker 0-0, policy_version 882533 (0.00090) [2022-07-10 20:38:36,459][25689] Fps is (10 sec: 5507.5, 60 sec: 5517.0, 300 sec: 5511.0). Total num frames: 903713792. Throughput: 0: 5781.7. Samples: 903721694. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:38:36,459][25689] Avg episode reward: [(0, '-2.124')] [2022-07-10 20:38:38,108][26022] Updated weights on worker 0-0, policy_version 882543 (0.00088) [2022-07-10 20:38:40,228][26022] Updated weights on worker 0-0, policy_version 882553 (0.00091) [2022-07-10 20:38:41,471][25689] Fps is (10 sec: 5613.9, 60 sec: 5533.9, 300 sec: 5514.8). Total num frames: 903742464. Throughput: 0: 4948.9. Samples: 903738212. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:38:41,471][25689] Avg episode reward: [(0, '-1.851')] [2022-07-10 20:38:41,965][26022] Updated weights on worker 0-0, policy_version 882563 (0.00084) [2022-07-10 20:38:43,771][26022] Updated weights on worker 0-0, policy_version 882573 (0.00158) [2022-07-10 20:38:45,562][26022] Updated weights on worker 0-0, policy_version 882583 (0.00094) [2022-07-10 20:38:46,510][25689] Fps is (10 sec: 5603.3, 60 sec: 5503.1, 300 sec: 5507.3). Total num frames: 903770112. Throughput: 0: 5792.5. Samples: 903771810. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:38:46,511][25689] Avg episode reward: [(0, '-0.072')] [2022-07-10 20:38:47,487][26022] Updated weights on worker 0-0, policy_version 882593 (0.00095) [2022-07-10 20:38:49,260][26022] Updated weights on worker 0-0, policy_version 882603 (0.00088) [2022-07-10 20:38:51,412][26022] Updated weights on worker 0-0, policy_version 882613 (0.00089) [2022-07-10 20:38:51,518][25689] Fps is (10 sec: 5401.6, 60 sec: 5503.8, 300 sec: 5510.9). Total num frames: 903796736. Throughput: 0: 5799.4. Samples: 903805026. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:38:51,519][25689] Avg episode reward: [(0, '-0.050')] [2022-07-10 20:38:52,793][26022] Updated weights on worker 0-0, policy_version 882623 (0.00095) [2022-07-10 20:38:54,933][26022] Updated weights on worker 0-0, policy_version 882633 (0.00087) [2022-07-10 20:38:56,537][25689] Fps is (10 sec: 5514.8, 60 sec: 5486.4, 300 sec: 5510.7). Total num frames: 903825408. Throughput: 0: 4980.3. Samples: 903821690. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:38:56,538][25689] Avg episode reward: [(0, '-0.022')] [2022-07-10 20:38:56,727][26022] Updated weights on worker 0-0, policy_version 882643 (0.00087) [2022-07-10 20:38:58,472][26022] Updated weights on worker 0-0, policy_version 882653 (0.01031) [2022-07-10 20:39:00,542][26022] Updated weights on worker 0-0, policy_version 882663 (0.00093) [2022-07-10 20:39:01,543][25689] Fps is (10 sec: 5414.2, 60 sec: 5457.4, 300 sec: 5511.3). Total num frames: 903851008. Throughput: 0: 5820.5. Samples: 903855038. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:39:01,551][25689] Avg episode reward: [(0, '-0.156')] [2022-07-10 20:39:02,499][26022] Updated weights on worker 0-0, policy_version 882673 (0.00088) [2022-07-10 20:39:03,787][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:39:03,802][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000882679_903863296.pth [2022-07-10 20:39:03,802][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000880740_901877760.pth [2022-07-10 20:39:04,422][26022] Updated weights on worker 0-0, policy_version 882683 (0.00095) [2022-07-10 20:39:06,386][26022] Updated weights on worker 0-0, policy_version 882693 (0.00082) [2022-07-10 20:39:06,640][25689] Fps is (10 sec: 5270.6, 60 sec: 5506.0, 300 sec: 5503.2). Total num frames: 903878656. Throughput: 0: 5667.9. Samples: 903885902. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:39:06,642][25689] Avg episode reward: [(0, '1.276')] [2022-07-10 20:39:08,052][26022] Updated weights on worker 0-0, policy_version 882703 (0.00089) [2022-07-10 20:39:10,213][26022] Updated weights on worker 0-0, policy_version 882713 (0.00096) [2022-07-10 20:39:11,673][25689] Fps is (10 sec: 5559.7, 60 sec: 5503.3, 300 sec: 5510.0). Total num frames: 903907328. Throughput: 0: 4843.6. Samples: 903902646. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:39:11,673][25689] Avg episode reward: [(0, '1.385')] [2022-07-10 20:39:11,961][26022] Updated weights on worker 0-0, policy_version 882723 (0.00098) [2022-07-10 20:39:13,661][26022] Updated weights on worker 0-0, policy_version 882733 (0.00087) [2022-07-10 20:39:15,415][26022] Updated weights on worker 0-0, policy_version 882743 (0.00089) [2022-07-10 20:39:16,678][25689] Fps is (10 sec: 5712.4, 60 sec: 5544.3, 300 sec: 5513.7). Total num frames: 903936000. Throughput: 0: 5686.6. Samples: 903936226. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:39:16,679][25689] Avg episode reward: [(0, '1.479')] [2022-07-10 20:39:17,218][26022] Updated weights on worker 0-0, policy_version 882753 (0.00086) [2022-07-10 20:39:19,304][26022] Updated weights on worker 0-0, policy_version 882763 (0.00095) [2022-07-10 20:39:21,078][26022] Updated weights on worker 0-0, policy_version 882773 (0.00092) [2022-07-10 20:39:21,695][25689] Fps is (10 sec: 5415.2, 60 sec: 5482.2, 300 sec: 5503.8). Total num frames: 903961600. Throughput: 0: 5689.0. Samples: 903969684. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:39:21,696][25689] Avg episode reward: [(0, '0.449')] [2022-07-10 20:39:22,840][26022] Updated weights on worker 0-0, policy_version 882783 (0.00090) [2022-07-10 20:39:24,921][26022] Updated weights on worker 0-0, policy_version 882793 (0.00082) [2022-07-10 20:39:26,666][26022] Updated weights on worker 0-0, policy_version 882803 (0.00087) [2022-07-10 20:39:26,795][25689] Fps is (10 sec: 5364.7, 60 sec: 5495.2, 300 sec: 5505.7). Total num frames: 903990272. Throughput: 0: 4977.1. Samples: 903986216. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:39:26,795][25689] Avg episode reward: [(0, '0.540')] [2022-07-10 20:39:28,522][26022] Updated weights on worker 0-0, policy_version 882813 (0.00086) [2022-07-10 20:39:30,193][26022] Updated weights on worker 0-0, policy_version 882823 (0.00091) [2022-07-10 20:39:31,811][25689] Fps is (10 sec: 5770.0, 60 sec: 5529.5, 300 sec: 5512.4). Total num frames: 904019968. Throughput: 0: 5809.3. Samples: 904019632. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:39:31,811][25689] Avg episode reward: [(0, '0.196')] [2022-07-10 20:39:31,905][26022] Updated weights on worker 0-0, policy_version 882833 (0.00054) [2022-07-10 20:39:34,287][26022] Updated weights on worker 0-0, policy_version 882843 (0.00082) [2022-07-10 20:39:35,630][26022] Updated weights on worker 0-0, policy_version 882853 (0.00096) [2022-07-10 20:39:36,827][25689] Fps is (10 sec: 5613.9, 60 sec: 5512.8, 300 sec: 5515.9). Total num frames: 904046592. Throughput: 0: 5815.3. Samples: 904053398. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:39:36,829][25689] Avg episode reward: [(0, '0.154')] [2022-07-10 20:39:37,566][26022] Updated weights on worker 0-0, policy_version 882863 (0.00093) [2022-07-10 20:39:39,215][26022] Updated weights on worker 0-0, policy_version 882873 (0.00096) [2022-07-10 20:39:41,231][26022] Updated weights on worker 0-0, policy_version 882883 (0.00090) [2022-07-10 20:39:41,869][25689] Fps is (10 sec: 5395.7, 60 sec: 5493.2, 300 sec: 5509.9). Total num frames: 904074240. Throughput: 0: 4984.3. Samples: 904070238. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:39:41,869][25689] Avg episode reward: [(0, '-0.527')] [2022-07-10 20:39:43,158][26022] Updated weights on worker 0-0, policy_version 882893 (0.00087) [2022-07-10 20:39:45,063][26022] Updated weights on worker 0-0, policy_version 882903 (0.00086) [2022-07-10 20:39:46,811][26022] Updated weights on worker 0-0, policy_version 882913 (0.00086) [2022-07-10 20:39:46,944][25689] Fps is (10 sec: 5769.3, 60 sec: 5540.7, 300 sec: 5519.0). Total num frames: 904104960. Throughput: 0: 5831.4. Samples: 904103714. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:39:46,945][25689] Avg episode reward: [(0, '-1.937')] [2022-07-10 20:39:48,738][26022] Updated weights on worker 0-0, policy_version 882923 (0.00089) [2022-07-10 20:39:50,300][26022] Updated weights on worker 0-0, policy_version 882933 (0.00090) [2022-07-10 20:39:52,003][25689] Fps is (10 sec: 5557.6, 60 sec: 5519.2, 300 sec: 5512.2). Total num frames: 904130560. Throughput: 0: 5830.3. Samples: 904137360. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:39:52,003][25689] Avg episode reward: [(0, '-1.221')] [2022-07-10 20:39:52,335][26022] Updated weights on worker 0-0, policy_version 882943 (0.00093) [2022-07-10 20:39:54,075][26022] Updated weights on worker 0-0, policy_version 882953 (0.00093) [2022-07-10 20:39:55,890][26022] Updated weights on worker 0-0, policy_version 882963 (0.00096) [2022-07-10 20:39:57,069][25689] Fps is (10 sec: 5461.3, 60 sec: 5531.7, 300 sec: 5515.4). Total num frames: 904160256. Throughput: 0: 4982.9. Samples: 904154262. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:39:57,070][25689] Avg episode reward: [(0, '-2.160')] [2022-07-10 20:39:57,998][26022] Updated weights on worker 0-0, policy_version 882973 (0.00084) [2022-07-10 20:39:59,541][26022] Updated weights on worker 0-0, policy_version 882983 (0.00081) [2022-07-10 20:40:01,576][26022] Updated weights on worker 0-0, policy_version 882993 (0.00092) [2022-07-10 20:40:02,158][25689] Fps is (10 sec: 5546.0, 60 sec: 5541.0, 300 sec: 5514.9). Total num frames: 904186880. Throughput: 0: 5772.1. Samples: 904187348. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:40:02,158][25689] Avg episode reward: [(0, '-1.742')] [2022-07-10 20:40:03,622][26022] Updated weights on worker 0-0, policy_version 883003 (0.00083) [2022-07-10 20:40:05,474][26022] Updated weights on worker 0-0, policy_version 883013 (0.00098) [2022-07-10 20:40:07,264][25689] Fps is (10 sec: 5223.2, 60 sec: 5523.4, 300 sec: 5510.1). Total num frames: 904213504. Throughput: 0: 5654.3. Samples: 904218606. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:40:07,264][25689] Avg episode reward: [(0, '-1.849')] [2022-07-10 20:40:07,519][26022] Updated weights on worker 0-0, policy_version 883023 (0.00090) [2022-07-10 20:40:09,349][26022] Updated weights on worker 0-0, policy_version 883033 (0.00088) [2022-07-10 20:40:11,156][26022] Updated weights on worker 0-0, policy_version 883043 (0.00140) [2022-07-10 20:40:12,266][25689] Fps is (10 sec: 5369.1, 60 sec: 5509.2, 300 sec: 5511.7). Total num frames: 904241152. Throughput: 0: 4836.0. Samples: 904235352. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:40:12,267][25689] Avg episode reward: [(0, '-1.645')] [2022-07-10 20:40:13,104][26022] Updated weights on worker 0-0, policy_version 883053 (0.00091) [2022-07-10 20:40:14,808][26022] Updated weights on worker 0-0, policy_version 883063 (0.00089) [2022-07-10 20:40:16,696][26022] Updated weights on worker 0-0, policy_version 883073 (0.00108) [2022-07-10 20:40:17,323][25689] Fps is (10 sec: 5700.8, 60 sec: 5521.5, 300 sec: 5512.0). Total num frames: 904270848. Throughput: 0: 5648.1. Samples: 904268656. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 20:40:17,323][25689] Avg episode reward: [(0, '-1.347')] [2022-07-10 20:40:18,641][26022] Updated weights on worker 0-0, policy_version 883083 (0.00084) [2022-07-10 20:40:20,303][26022] Updated weights on worker 0-0, policy_version 883093 (0.00096) [2022-07-10 20:40:22,241][26022] Updated weights on worker 0-0, policy_version 883103 (0.00086) [2022-07-10 20:40:22,327][25689] Fps is (10 sec: 5598.0, 60 sec: 5539.5, 300 sec: 5513.9). Total num frames: 904297472. Throughput: 0: 5695.1. Samples: 904302212. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:40:22,327][25689] Avg episode reward: [(0, '-1.130')] [2022-07-10 20:40:23,930][26022] Updated weights on worker 0-0, policy_version 883113 (0.00094) [2022-07-10 20:40:25,818][26022] Updated weights on worker 0-0, policy_version 883123 (0.00095) [2022-07-10 20:40:27,416][25689] Fps is (10 sec: 5478.5, 60 sec: 5540.5, 300 sec: 5514.2). Total num frames: 904326144. Throughput: 0: 5805.7. Samples: 904335604. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:40:27,416][25689] Avg episode reward: [(0, '-0.815')] [2022-07-10 20:40:27,667][26022] Updated weights on worker 0-0, policy_version 883133 (0.00086) [2022-07-10 20:40:29,431][26022] Updated weights on worker 0-0, policy_version 883143 (0.00087) [2022-07-10 20:40:31,378][26022] Updated weights on worker 0-0, policy_version 883153 (0.00084) [2022-07-10 20:40:32,431][25689] Fps is (10 sec: 5675.3, 60 sec: 5523.7, 300 sec: 5511.6). Total num frames: 904354816. Throughput: 0: 5792.9. Samples: 904352164. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:40:32,431][25689] Avg episode reward: [(0, '-0.651')] [2022-07-10 20:40:33,319][26022] Updated weights on worker 0-0, policy_version 883163 (0.00089) [2022-07-10 20:40:34,920][26022] Updated weights on worker 0-0, policy_version 883173 (0.00098) [2022-07-10 20:40:37,016][26022] Updated weights on worker 0-0, policy_version 883183 (0.00083) [2022-07-10 20:40:37,451][25689] Fps is (10 sec: 5510.4, 60 sec: 5523.4, 300 sec: 5514.7). Total num frames: 904381440. Throughput: 0: 5832.7. Samples: 904386056. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:40:37,451][25689] Avg episode reward: [(0, '-1.755')] [2022-07-10 20:40:38,374][26022] Updated weights on worker 0-0, policy_version 883193 (0.00101) [2022-07-10 20:40:40,774][26022] Updated weights on worker 0-0, policy_version 883203 (0.00091) [2022-07-10 20:40:42,207][26022] Updated weights on worker 0-0, policy_version 883213 (0.00087) [2022-07-10 20:40:42,479][25689] Fps is (10 sec: 5503.3, 60 sec: 5541.6, 300 sec: 5508.4). Total num frames: 904410112. Throughput: 0: 5798.5. Samples: 904419062. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:40:42,479][25689] Avg episode reward: [(0, '-1.189')] [2022-07-10 20:40:44,178][26022] Updated weights on worker 0-0, policy_version 883223 (0.00086) [2022-07-10 20:40:46,145][26022] Updated weights on worker 0-0, policy_version 883233 (0.00093) [2022-07-10 20:40:47,539][25689] Fps is (10 sec: 5684.4, 60 sec: 5509.1, 300 sec: 5514.5). Total num frames: 904438784. Throughput: 0: 4976.7. Samples: 904435748. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:40:47,539][25689] Avg episode reward: [(0, '-1.060')] [2022-07-10 20:40:47,967][26022] Updated weights on worker 0-0, policy_version 883243 (0.00057) [2022-07-10 20:40:49,894][26022] Updated weights on worker 0-0, policy_version 883253 (0.00092) [2022-07-10 20:40:51,602][26022] Updated weights on worker 0-0, policy_version 883263 (0.00092) [2022-07-10 20:40:52,582][25689] Fps is (10 sec: 5371.9, 60 sec: 5510.6, 300 sec: 5507.0). Total num frames: 904464384. Throughput: 0: 5792.5. Samples: 904468886. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:40:52,582][25689] Avg episode reward: [(0, '-1.232')] [2022-07-10 20:40:53,350][26022] Updated weights on worker 0-0, policy_version 883273 (0.00095) [2022-07-10 20:40:55,361][26022] Updated weights on worker 0-0, policy_version 883283 (0.00092) [2022-07-10 20:40:57,201][26022] Updated weights on worker 0-0, policy_version 883293 (0.00083) [2022-07-10 20:40:57,585][25689] Fps is (10 sec: 5504.4, 60 sec: 5516.4, 300 sec: 5517.7). Total num frames: 904494080. Throughput: 0: 5765.5. Samples: 904502138. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:40:57,585][25689] Avg episode reward: [(0, '-1.537')] [2022-07-10 20:40:59,127][26022] Updated weights on worker 0-0, policy_version 883303 (0.00086) [2022-07-10 20:41:00,999][26022] Updated weights on worker 0-0, policy_version 883313 (0.00083) [2022-07-10 20:41:02,606][25689] Fps is (10 sec: 5515.9, 60 sec: 5505.5, 300 sec: 5512.5). Total num frames: 904519680. Throughput: 0: 4943.3. Samples: 904518558. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:41:02,607][25689] Avg episode reward: [(0, '-1.449')] [2022-07-10 20:41:03,077][26022] Updated weights on worker 0-0, policy_version 883323 (0.00093) [2022-07-10 20:41:03,974][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:41:03,983][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000883326_904525824.pth [2022-07-10 20:41:03,984][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000881386_902539264.pth [2022-07-10 20:41:05,194][26022] Updated weights on worker 0-0, policy_version 883333 (0.00091) [2022-07-10 20:41:06,710][26022] Updated weights on worker 0-0, policy_version 883343 (0.00083) [2022-07-10 20:41:07,659][25689] Fps is (10 sec: 5183.9, 60 sec: 5510.4, 300 sec: 5512.5). Total num frames: 904546304. Throughput: 0: 5674.8. Samples: 904549926. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:41:07,659][25689] Avg episode reward: [(0, '-0.312')] [2022-07-10 20:41:08,806][26022] Updated weights on worker 0-0, policy_version 883353 (0.00102) [2022-07-10 20:41:10,515][26022] Updated weights on worker 0-0, policy_version 883363 (0.00077) [2022-07-10 20:41:12,459][26022] Updated weights on worker 0-0, policy_version 883373 (0.00085) [2022-07-10 20:41:12,718][25689] Fps is (10 sec: 5468.4, 60 sec: 5522.1, 300 sec: 5513.2). Total num frames: 904574976. Throughput: 0: 5666.6. Samples: 904582992. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:41:12,719][25689] Avg episode reward: [(0, '-0.216')] [2022-07-10 20:41:14,347][26022] Updated weights on worker 0-0, policy_version 883383 (0.00085) [2022-07-10 20:41:16,264][26022] Updated weights on worker 0-0, policy_version 883393 (0.00098) [2022-07-10 20:41:17,761][25689] Fps is (10 sec: 5575.1, 60 sec: 5489.5, 300 sec: 5510.0). Total num frames: 904602624. Throughput: 0: 4817.6. Samples: 904599340. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:41:17,764][25689] Avg episode reward: [(0, '0.291')] [2022-07-10 20:41:18,066][26022] Updated weights on worker 0-0, policy_version 883403 (0.00093) [2022-07-10 20:41:20,104][26022] Updated weights on worker 0-0, policy_version 883413 (0.00095) [2022-07-10 20:41:21,654][26022] Updated weights on worker 0-0, policy_version 883423 (0.00088) [2022-07-10 20:41:22,803][25689] Fps is (10 sec: 5483.2, 60 sec: 5503.0, 300 sec: 5510.7). Total num frames: 904630272. Throughput: 0: 5642.3. Samples: 904632512. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:41:22,805][25689] Avg episode reward: [(0, '-0.924')] [2022-07-10 20:41:23,725][26022] Updated weights on worker 0-0, policy_version 883433 (0.00085) [2022-07-10 20:41:25,329][26022] Updated weights on worker 0-0, policy_version 883443 (0.00088) [2022-07-10 20:41:27,387][26022] Updated weights on worker 0-0, policy_version 883453 (0.00087) [2022-07-10 20:41:27,867][25689] Fps is (10 sec: 5572.9, 60 sec: 5505.3, 300 sec: 5509.8). Total num frames: 904658944. Throughput: 0: 5741.4. Samples: 904665948. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:41:27,867][25689] Avg episode reward: [(0, '-0.552')] [2022-07-10 20:41:29,174][26022] Updated weights on worker 0-0, policy_version 883463 (0.00097) [2022-07-10 20:41:30,955][26022] Updated weights on worker 0-0, policy_version 883473 (0.00086) [2022-07-10 20:41:32,870][25689] Fps is (10 sec: 5492.6, 60 sec: 5472.5, 300 sec: 5510.3). Total num frames: 904685568. Throughput: 0: 4949.3. Samples: 904682730. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:41:32,871][25689] Avg episode reward: [(0, '-0.630')] [2022-07-10 20:41:32,954][26022] Updated weights on worker 0-0, policy_version 883483 (0.00088) [2022-07-10 20:41:34,552][26022] Updated weights on worker 0-0, policy_version 883493 (0.00430) [2022-07-10 20:41:36,613][26022] Updated weights on worker 0-0, policy_version 883503 (0.00095) [2022-07-10 20:41:37,885][25689] Fps is (10 sec: 5622.0, 60 sec: 5523.8, 300 sec: 5513.8). Total num frames: 904715264. Throughput: 0: 5808.6. Samples: 904716232. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:41:37,887][25689] Avg episode reward: [(0, '-0.449')] [2022-07-10 20:41:38,396][26022] Updated weights on worker 0-0, policy_version 883513 (0.00095) [2022-07-10 20:41:40,243][26022] Updated weights on worker 0-0, policy_version 883523 (0.00085) [2022-07-10 20:41:41,998][26022] Updated weights on worker 0-0, policy_version 883533 (0.00081) [2022-07-10 20:41:42,897][25689] Fps is (10 sec: 5617.0, 60 sec: 5491.3, 300 sec: 5515.5). Total num frames: 904741888. Throughput: 0: 5814.3. Samples: 904749344. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:41:42,898][25689] Avg episode reward: [(0, '-0.498')] [2022-07-10 20:41:43,891][26022] Updated weights on worker 0-0, policy_version 883543 (0.00085) [2022-07-10 20:41:45,792][26022] Updated weights on worker 0-0, policy_version 883553 (0.00090) [2022-07-10 20:41:47,514][26022] Updated weights on worker 0-0, policy_version 883563 (0.00090) [2022-07-10 20:41:47,962][25689] Fps is (10 sec: 5487.0, 60 sec: 5490.8, 300 sec: 5512.3). Total num frames: 904770560. Throughput: 0: 4982.5. Samples: 904766074. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:41:47,964][25689] Avg episode reward: [(0, '-0.629')] [2022-07-10 20:41:49,411][26022] Updated weights on worker 0-0, policy_version 883573 (0.00094) [2022-07-10 20:41:51,259][26022] Updated weights on worker 0-0, policy_version 883583 (0.00083) [2022-07-10 20:41:52,986][25689] Fps is (10 sec: 5582.0, 60 sec: 5526.4, 300 sec: 5509.6). Total num frames: 904798208. Throughput: 0: 5816.2. Samples: 904799730. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:41:52,988][25689] Avg episode reward: [(0, '0.732')] [2022-07-10 20:41:53,040][26022] Updated weights on worker 0-0, policy_version 883593 (0.00093) [2022-07-10 20:41:54,889][26022] Updated weights on worker 0-0, policy_version 883603 (0.00095) [2022-07-10 20:41:56,735][26022] Updated weights on worker 0-0, policy_version 883613 (0.00077) [2022-07-10 20:41:58,025][25689] Fps is (10 sec: 5495.4, 60 sec: 5489.3, 300 sec: 5516.6). Total num frames: 904825856. Throughput: 0: 5798.0. Samples: 904833004. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:41:58,031][25689] Avg episode reward: [(0, '0.373')] [2022-07-10 20:41:58,578][26022] Updated weights on worker 0-0, policy_version 883623 (0.00080) [2022-07-10 20:42:00,407][26022] Updated weights on worker 0-0, policy_version 883633 (0.00094) [2022-07-10 20:42:02,789][26022] Updated weights on worker 0-0, policy_version 883643 (0.00088) [2022-07-10 20:42:03,124][25689] Fps is (10 sec: 5252.7, 60 sec: 5482.3, 300 sec: 5509.1). Total num frames: 904851456. Throughput: 0: 4964.7. Samples: 904849764. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:42:03,126][25689] Avg episode reward: [(0, '0.160')] [2022-07-10 20:42:04,447][26022] Updated weights on worker 0-0, policy_version 883653 (0.00087) [2022-07-10 20:42:06,446][26022] Updated weights on worker 0-0, policy_version 883663 (0.00088) [2022-07-10 20:42:08,160][26022] Updated weights on worker 0-0, policy_version 883673 (0.00092) [2022-07-10 20:42:08,256][25689] Fps is (10 sec: 5404.7, 60 sec: 5525.8, 300 sec: 5517.3). Total num frames: 904881152. Throughput: 0: 5677.6. Samples: 904881292. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:42:08,256][25689] Avg episode reward: [(0, '0.530')] [2022-07-10 20:42:09,980][26022] Updated weights on worker 0-0, policy_version 883683 (0.00096) [2022-07-10 20:42:11,908][26022] Updated weights on worker 0-0, policy_version 883693 (0.00071) [2022-07-10 20:42:13,276][25689] Fps is (10 sec: 5749.1, 60 sec: 5529.4, 300 sec: 5521.0). Total num frames: 904909824. Throughput: 0: 5668.4. Samples: 904914740. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:42:13,277][25689] Avg episode reward: [(0, '0.526')] [2022-07-10 20:42:13,672][26022] Updated weights on worker 0-0, policy_version 883703 (0.00085) [2022-07-10 20:42:15,578][26022] Updated weights on worker 0-0, policy_version 883713 (0.00095) [2022-07-10 20:42:17,174][26022] Updated weights on worker 0-0, policy_version 883723 (0.00111) [2022-07-10 20:42:18,328][25689] Fps is (10 sec: 5591.4, 60 sec: 5528.5, 300 sec: 5516.9). Total num frames: 904937472. Throughput: 0: 4870.5. Samples: 904931892. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:42:18,329][25689] Avg episode reward: [(0, '0.623')] [2022-07-10 20:42:19,172][26022] Updated weights on worker 0-0, policy_version 883733 (0.00089) [2022-07-10 20:42:21,039][26022] Updated weights on worker 0-0, policy_version 883743 (0.00085) [2022-07-10 20:42:22,747][26022] Updated weights on worker 0-0, policy_version 883753 (0.00091) [2022-07-10 20:42:23,381][25689] Fps is (10 sec: 5573.7, 60 sec: 5544.5, 300 sec: 5517.5). Total num frames: 904966144. Throughput: 0: 5712.5. Samples: 904965482. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:42:23,381][25689] Avg episode reward: [(0, '0.664')] [2022-07-10 20:42:24,627][26022] Updated weights on worker 0-0, policy_version 883763 (0.00087) [2022-07-10 20:42:26,615][26022] Updated weights on worker 0-0, policy_version 883773 (0.00092) [2022-07-10 20:42:28,327][26022] Updated weights on worker 0-0, policy_version 883783 (0.00088) [2022-07-10 20:42:28,454][25689] Fps is (10 sec: 5663.3, 60 sec: 5543.7, 300 sec: 5524.3). Total num frames: 904994816. Throughput: 0: 5815.7. Samples: 904998758. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:42:28,454][25689] Avg episode reward: [(0, '0.090')] [2022-07-10 20:42:30,267][26022] Updated weights on worker 0-0, policy_version 883793 (0.00095) [2022-07-10 20:42:32,016][26022] Updated weights on worker 0-0, policy_version 883803 (0.00091) [2022-07-10 20:42:33,487][25689] Fps is (10 sec: 5370.1, 60 sec: 5524.0, 300 sec: 5513.4). Total num frames: 905020416. Throughput: 0: 4980.6. Samples: 905015402. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:42:33,488][25689] Avg episode reward: [(0, '0.399')] [2022-07-10 20:42:33,895][26022] Updated weights on worker 0-0, policy_version 883813 (0.00093) [2022-07-10 20:42:35,731][26022] Updated weights on worker 0-0, policy_version 883823 (0.00087) [2022-07-10 20:42:37,707][26022] Updated weights on worker 0-0, policy_version 883833 (0.00088) [2022-07-10 20:42:38,489][25689] Fps is (10 sec: 5510.3, 60 sec: 5525.2, 300 sec: 5520.5). Total num frames: 905050112. Throughput: 0: 5796.5. Samples: 905048752. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:42:38,489][25689] Avg episode reward: [(0, '0.565')] [2022-07-10 20:42:39,447][26022] Updated weights on worker 0-0, policy_version 883843 (0.00085) [2022-07-10 20:42:41,250][26022] Updated weights on worker 0-0, policy_version 883853 (0.00094) [2022-07-10 20:42:43,138][26022] Updated weights on worker 0-0, policy_version 883863 (0.00088) [2022-07-10 20:42:43,495][25689] Fps is (10 sec: 5730.2, 60 sec: 5542.7, 300 sec: 5514.9). Total num frames: 905077760. Throughput: 0: 5796.0. Samples: 905082060. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:42:43,495][25689] Avg episode reward: [(0, '0.034')] [2022-07-10 20:42:45,075][26022] Updated weights on worker 0-0, policy_version 883873 (0.00089) [2022-07-10 20:42:46,754][26022] Updated weights on worker 0-0, policy_version 883883 (0.00088) [2022-07-10 20:42:48,566][25689] Fps is (10 sec: 5385.3, 60 sec: 5508.3, 300 sec: 5513.8). Total num frames: 905104384. Throughput: 0: 4969.4. Samples: 905098706. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:42:48,568][25689] Avg episode reward: [(0, '0.009')] [2022-07-10 20:42:48,947][26022] Updated weights on worker 0-0, policy_version 883893 (0.00095) [2022-07-10 20:42:50,287][26022] Updated weights on worker 0-0, policy_version 883903 (0.00100) [2022-07-10 20:42:52,458][26022] Updated weights on worker 0-0, policy_version 883913 (0.00093) [2022-07-10 20:42:53,595][25689] Fps is (10 sec: 5576.2, 60 sec: 5541.7, 300 sec: 5513.6). Total num frames: 905134080. Throughput: 0: 5807.4. Samples: 905132174. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:42:53,595][25689] Avg episode reward: [(0, '0.553')] [2022-07-10 20:42:54,082][26022] Updated weights on worker 0-0, policy_version 883923 (0.00082) [2022-07-10 20:42:55,939][26022] Updated weights on worker 0-0, policy_version 883933 (0.00094) [2022-07-10 20:42:57,855][26022] Updated weights on worker 0-0, policy_version 883943 (0.00095) [2022-07-10 20:42:58,611][25689] Fps is (10 sec: 5607.1, 60 sec: 5526.8, 300 sec: 5510.9). Total num frames: 905160704. Throughput: 0: 5813.5. Samples: 905165732. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:42:58,613][25689] Avg episode reward: [(0, '-0.009')] [2022-07-10 20:42:59,445][26022] Updated weights on worker 0-0, policy_version 883953 (0.00063) [2022-07-10 20:43:01,688][26022] Updated weights on worker 0-0, policy_version 883963 (0.00085) [2022-07-10 20:43:03,629][25689] Fps is (10 sec: 5306.4, 60 sec: 5551.1, 300 sec: 5518.8). Total num frames: 905187328. Throughput: 0: 5703.1. Samples: 905196890. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:43:03,631][25689] Avg episode reward: [(0, '-0.174')] [2022-07-10 20:43:03,928][26022] Updated weights on worker 0-0, policy_version 883973 (0.00115) [2022-07-10 20:43:04,166][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:43:04,180][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000883974_905189376.pth [2022-07-10 20:43:04,180][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000882033_903201792.pth [2022-07-10 20:43:05,479][26022] Updated weights on worker 0-0, policy_version 883983 (0.00080) [2022-07-10 20:43:07,601][26022] Updated weights on worker 0-0, policy_version 883993 (0.01236) [2022-07-10 20:43:08,697][25689] Fps is (10 sec: 5482.1, 60 sec: 5540.0, 300 sec: 5517.6). Total num frames: 905216000. Throughput: 0: 5697.1. Samples: 905213394. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:43:08,698][25689] Avg episode reward: [(0, '-1.247')] [2022-07-10 20:43:09,252][26022] Updated weights on worker 0-0, policy_version 884003 (0.00092) [2022-07-10 20:43:11,250][26022] Updated weights on worker 0-0, policy_version 884013 (0.01011) [2022-07-10 20:43:12,975][26022] Updated weights on worker 0-0, policy_version 884023 (0.00086) [2022-07-10 20:43:13,778][25689] Fps is (10 sec: 5448.3, 60 sec: 5500.6, 300 sec: 5517.6). Total num frames: 905242624. Throughput: 0: 5663.2. Samples: 905246480. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:43:13,779][25689] Avg episode reward: [(0, '-2.328')] [2022-07-10 20:43:14,790][26022] Updated weights on worker 0-0, policy_version 884033 (0.00088) [2022-07-10 20:43:16,747][26022] Updated weights on worker 0-0, policy_version 884043 (0.00085) [2022-07-10 20:43:18,631][26022] Updated weights on worker 0-0, policy_version 884053 (0.00087) [2022-07-10 20:43:18,821][25689] Fps is (10 sec: 5361.1, 60 sec: 5501.5, 300 sec: 5511.5). Total num frames: 905270272. Throughput: 0: 5639.0. Samples: 905279696. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:43:18,822][25689] Avg episode reward: [(0, '-1.338')] [2022-07-10 20:43:20,331][26022] Updated weights on worker 0-0, policy_version 884063 (0.00090) [2022-07-10 20:43:22,309][26022] Updated weights on worker 0-0, policy_version 884073 (0.00090) [2022-07-10 20:43:23,856][25689] Fps is (10 sec: 5690.3, 60 sec: 5520.0, 300 sec: 5518.7). Total num frames: 905299968. Throughput: 0: 4913.0. Samples: 905296264. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:43:23,857][25689] Avg episode reward: [(0, '-0.769')] [2022-07-10 20:43:23,989][26022] Updated weights on worker 0-0, policy_version 884083 (0.00092) [2022-07-10 20:43:26,213][26022] Updated weights on worker 0-0, policy_version 884093 (0.00082) [2022-07-10 20:43:27,935][26022] Updated weights on worker 0-0, policy_version 884103 (0.00096) [2022-07-10 20:43:28,938][25689] Fps is (10 sec: 5566.8, 60 sec: 5485.3, 300 sec: 5514.1). Total num frames: 905326592. Throughput: 0: 5724.1. Samples: 905329252. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:43:28,938][25689] Avg episode reward: [(0, '-0.964')] [2022-07-10 20:43:29,822][26022] Updated weights on worker 0-0, policy_version 884113 (0.00091) [2022-07-10 20:43:31,512][26022] Updated weights on worker 0-0, policy_version 884123 (0.00087) [2022-07-10 20:43:33,609][26022] Updated weights on worker 0-0, policy_version 884133 (0.00085) [2022-07-10 20:43:34,009][25689] Fps is (10 sec: 5345.6, 60 sec: 5515.8, 300 sec: 5513.2). Total num frames: 905354240. Throughput: 0: 5747.7. Samples: 905362756. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:43:34,009][25689] Avg episode reward: [(0, '-0.163')] [2022-07-10 20:43:35,297][26022] Updated weights on worker 0-0, policy_version 884143 (0.00086) [2022-07-10 20:43:37,330][26022] Updated weights on worker 0-0, policy_version 884153 (0.00083) [2022-07-10 20:43:38,815][26022] Updated weights on worker 0-0, policy_version 884163 (0.00093) [2022-07-10 20:43:39,016][25689] Fps is (10 sec: 5689.9, 60 sec: 5515.3, 300 sec: 5516.7). Total num frames: 905383936. Throughput: 0: 4952.6. Samples: 905379716. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:43:39,017][25689] Avg episode reward: [(0, '1.226')] [2022-07-10 20:43:40,828][26022] Updated weights on worker 0-0, policy_version 884173 (0.00051) [2022-07-10 20:43:42,419][26022] Updated weights on worker 0-0, policy_version 884183 (0.00105) [2022-07-10 20:43:44,055][25689] Fps is (10 sec: 5402.1, 60 sec: 5461.5, 300 sec: 5506.4). Total num frames: 905408512. Throughput: 0: 5779.0. Samples: 905412994. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:43:44,056][25689] Avg episode reward: [(0, '1.228')] [2022-07-10 20:43:44,532][26022] Updated weights on worker 0-0, policy_version 884193 (0.00094) [2022-07-10 20:43:46,363][26022] Updated weights on worker 0-0, policy_version 884203 (0.00089) [2022-07-10 20:43:48,259][26022] Updated weights on worker 0-0, policy_version 884213 (0.00091) [2022-07-10 20:43:49,157][25689] Fps is (10 sec: 5453.0, 60 sec: 5526.4, 300 sec: 5518.4). Total num frames: 905439232. Throughput: 0: 5779.0. Samples: 905446096. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:43:49,157][25689] Avg episode reward: [(0, '-0.280')] [2022-07-10 20:43:50,052][26022] Updated weights on worker 0-0, policy_version 884223 (0.00093) [2022-07-10 20:43:51,877][26022] Updated weights on worker 0-0, policy_version 884233 (0.00079) [2022-07-10 20:43:53,631][26022] Updated weights on worker 0-0, policy_version 884243 (0.00087) [2022-07-10 20:43:54,160][25689] Fps is (10 sec: 5877.4, 60 sec: 5511.8, 300 sec: 5518.7). Total num frames: 905467904. Throughput: 0: 4961.0. Samples: 905462728. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:43:54,161][25689] Avg episode reward: [(0, '-0.000')] [2022-07-10 20:43:55,647][26022] Updated weights on worker 0-0, policy_version 884253 (0.00090) [2022-07-10 20:43:57,275][26022] Updated weights on worker 0-0, policy_version 884263 (0.00080) [2022-07-10 20:43:59,198][25689] Fps is (10 sec: 5404.8, 60 sec: 5492.9, 300 sec: 5518.1). Total num frames: 905493504. Throughput: 0: 5776.2. Samples: 905496290. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:43:59,198][25689] Avg episode reward: [(0, '0.002')] [2022-07-10 20:43:59,409][26022] Updated weights on worker 0-0, policy_version 884273 (0.00092) [2022-07-10 20:44:01,067][26022] Updated weights on worker 0-0, policy_version 884283 (0.01240) [2022-07-10 20:44:03,518][26022] Updated weights on worker 0-0, policy_version 884293 (0.00092) [2022-07-10 20:44:04,206][25689] Fps is (10 sec: 5198.7, 60 sec: 5493.9, 300 sec: 5516.3). Total num frames: 905520128. Throughput: 0: 5670.4. Samples: 905527256. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 20:44:04,206][25689] Avg episode reward: [(0, '-2.223')] [2022-07-10 20:44:05,093][26022] Updated weights on worker 0-0, policy_version 884303 (0.00052) [2022-07-10 20:44:07,103][26022] Updated weights on worker 0-0, policy_version 884313 (0.00088) [2022-07-10 20:44:09,062][26022] Updated weights on worker 0-0, policy_version 884323 (0.00093) [2022-07-10 20:44:09,271][25689] Fps is (10 sec: 5388.0, 60 sec: 5477.3, 300 sec: 5512.3). Total num frames: 905547776. Throughput: 0: 4865.3. Samples: 905543958. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:44:09,271][25689] Avg episode reward: [(0, '-2.266')] [2022-07-10 20:44:10,638][26022] Updated weights on worker 0-0, policy_version 884333 (0.00091) [2022-07-10 20:44:12,791][26022] Updated weights on worker 0-0, policy_version 884343 (0.00096) [2022-07-10 20:44:14,273][25689] Fps is (10 sec: 5594.2, 60 sec: 5518.2, 300 sec: 5512.3). Total num frames: 905576448. Throughput: 0: 5691.2. Samples: 905577196. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:44:14,274][25689] Avg episode reward: [(0, '-1.498')] [2022-07-10 20:44:14,396][26022] Updated weights on worker 0-0, policy_version 884353 (0.00090) [2022-07-10 20:44:16,303][26022] Updated weights on worker 0-0, policy_version 884363 (0.00091) [2022-07-10 20:44:18,191][26022] Updated weights on worker 0-0, policy_version 884373 (0.00090) [2022-07-10 20:44:19,277][25689] Fps is (10 sec: 5628.6, 60 sec: 5521.7, 300 sec: 5519.5). Total num frames: 905604096. Throughput: 0: 5700.8. Samples: 905610756. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:44:19,277][25689] Avg episode reward: [(0, '-0.760')] [2022-07-10 20:44:20,031][26022] Updated weights on worker 0-0, policy_version 884383 (0.00094) [2022-07-10 20:44:21,837][26022] Updated weights on worker 0-0, policy_version 884393 (0.00084) [2022-07-10 20:44:23,703][26022] Updated weights on worker 0-0, policy_version 884403 (0.00109) [2022-07-10 20:44:24,307][25689] Fps is (10 sec: 5511.1, 60 sec: 5488.4, 300 sec: 5517.3). Total num frames: 905631744. Throughput: 0: 4983.1. Samples: 905627422. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:44:24,309][25689] Avg episode reward: [(0, '-1.201')] [2022-07-10 20:44:25,501][26022] Updated weights on worker 0-0, policy_version 884413 (0.00436) [2022-07-10 20:44:27,451][26022] Updated weights on worker 0-0, policy_version 884423 (0.00087) [2022-07-10 20:44:29,020][26022] Updated weights on worker 0-0, policy_version 884433 (0.00088) [2022-07-10 20:44:29,387][25689] Fps is (10 sec: 5570.5, 60 sec: 5522.4, 300 sec: 5512.7). Total num frames: 905660416. Throughput: 0: 5797.5. Samples: 905660584. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:44:29,388][25689] Avg episode reward: [(0, '-1.068')] [2022-07-10 20:44:31,293][26022] Updated weights on worker 0-0, policy_version 884443 (0.00356) [2022-07-10 20:44:32,891][26022] Updated weights on worker 0-0, policy_version 884453 (0.00090) [2022-07-10 20:44:34,414][25689] Fps is (10 sec: 5470.8, 60 sec: 5509.4, 300 sec: 5512.5). Total num frames: 905687040. Throughput: 0: 5801.6. Samples: 905694046. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:44:34,415][25689] Avg episode reward: [(0, '0.938')] [2022-07-10 20:44:34,803][26022] Updated weights on worker 0-0, policy_version 884463 (0.00089) [2022-07-10 20:44:36,436][26022] Updated weights on worker 0-0, policy_version 884473 (0.00086) [2022-07-10 20:44:38,451][26022] Updated weights on worker 0-0, policy_version 884483 (0.00096) [2022-07-10 20:44:39,436][25689] Fps is (10 sec: 5604.6, 60 sec: 5508.1, 300 sec: 5519.7). Total num frames: 905716736. Throughput: 0: 4965.3. Samples: 905710854. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:44:39,437][25689] Avg episode reward: [(0, '0.868')] [2022-07-10 20:44:40,268][26022] Updated weights on worker 0-0, policy_version 884493 (0.00079) [2022-07-10 20:44:42,027][26022] Updated weights on worker 0-0, policy_version 884503 (0.00090) [2022-07-10 20:44:43,874][26022] Updated weights on worker 0-0, policy_version 884513 (0.00089) [2022-07-10 20:44:44,487][25689] Fps is (10 sec: 5591.6, 60 sec: 5540.9, 300 sec: 5506.4). Total num frames: 905743360. Throughput: 0: 5789.6. Samples: 905744254. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:44:44,489][25689] Avg episode reward: [(0, '0.326')] [2022-07-10 20:44:45,864][26022] Updated weights on worker 0-0, policy_version 884523 (0.00091) [2022-07-10 20:44:47,627][26022] Updated weights on worker 0-0, policy_version 884533 (0.00085) [2022-07-10 20:44:49,432][26022] Updated weights on worker 0-0, policy_version 884543 (0.00087) [2022-07-10 20:44:49,589][25689] Fps is (10 sec: 5446.1, 60 sec: 5506.9, 300 sec: 5515.9). Total num frames: 905772032. Throughput: 0: 5794.4. Samples: 905777644. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:44:49,590][25689] Avg episode reward: [(0, '0.666')] [2022-07-10 20:44:51,186][26022] Updated weights on worker 0-0, policy_version 884553 (0.00086) [2022-07-10 20:44:53,131][26022] Updated weights on worker 0-0, policy_version 884563 (0.00085) [2022-07-10 20:44:54,621][25689] Fps is (10 sec: 5658.5, 60 sec: 5504.4, 300 sec: 5513.2). Total num frames: 905800704. Throughput: 0: 4978.3. Samples: 905794640. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:44:54,621][25689] Avg episode reward: [(0, '0.545')] [2022-07-10 20:44:54,823][26022] Updated weights on worker 0-0, policy_version 884573 (0.00088) [2022-07-10 20:44:56,671][26022] Updated weights on worker 0-0, policy_version 884583 (0.00091) [2022-07-10 20:44:58,633][26022] Updated weights on worker 0-0, policy_version 884593 (0.00093) [2022-07-10 20:44:59,675][25689] Fps is (10 sec: 5584.5, 60 sec: 5536.8, 300 sec: 5517.2). Total num frames: 905828352. Throughput: 0: 5796.5. Samples: 905828166. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:44:59,675][25689] Avg episode reward: [(0, '0.601')] [2022-07-10 20:45:00,517][26022] Updated weights on worker 0-0, policy_version 884603 (0.00084) [2022-07-10 20:45:02,658][26022] Updated weights on worker 0-0, policy_version 884613 (0.00089) [2022-07-10 20:45:04,227][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:45:04,234][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000884621_905851904.pth [2022-07-10 20:45:04,235][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000882679_903863296.pth [2022-07-10 20:45:04,584][26022] Updated weights on worker 0-0, policy_version 884623 (0.00087) [2022-07-10 20:45:04,694][25689] Fps is (10 sec: 5286.3, 60 sec: 5518.9, 300 sec: 5515.4). Total num frames: 905853952. Throughput: 0: 5699.6. Samples: 905859426. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:45:04,694][25689] Avg episode reward: [(0, '0.579')] [2022-07-10 20:45:06,383][26022] Updated weights on worker 0-0, policy_version 884633 (0.00087) [2022-07-10 20:45:08,264][26022] Updated weights on worker 0-0, policy_version 884643 (0.00078) [2022-07-10 20:45:09,796][25689] Fps is (10 sec: 5463.3, 60 sec: 5549.3, 300 sec: 5520.4). Total num frames: 905883648. Throughput: 0: 4879.5. Samples: 905876240. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:45:09,796][25689] Avg episode reward: [(0, '0.191')] [2022-07-10 20:45:09,976][26022] Updated weights on worker 0-0, policy_version 884653 (0.00083) [2022-07-10 20:45:11,960][26022] Updated weights on worker 0-0, policy_version 884663 (0.00095) [2022-07-10 20:45:13,577][26022] Updated weights on worker 0-0, policy_version 884673 (0.00089) [2022-07-10 20:45:14,800][25689] Fps is (10 sec: 5674.3, 60 sec: 5532.3, 300 sec: 5514.5). Total num frames: 905911296. Throughput: 0: 5716.2. Samples: 905909986. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:45:14,800][25689] Avg episode reward: [(0, '0.564')] [2022-07-10 20:45:15,645][26022] Updated weights on worker 0-0, policy_version 884683 (0.00092) [2022-07-10 20:45:17,215][26022] Updated weights on worker 0-0, policy_version 884693 (0.00086) [2022-07-10 20:45:19,237][26022] Updated weights on worker 0-0, policy_version 884703 (0.00089) [2022-07-10 20:45:19,819][25689] Fps is (10 sec: 5517.0, 60 sec: 5530.9, 300 sec: 5517.7). Total num frames: 905938944. Throughput: 0: 5718.5. Samples: 905943360. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:45:19,819][25689] Avg episode reward: [(0, '0.722')] [2022-07-10 20:45:21,020][26022] Updated weights on worker 0-0, policy_version 884713 (0.00091) [2022-07-10 20:45:23,077][26022] Updated weights on worker 0-0, policy_version 884723 (0.00099) [2022-07-10 20:45:24,795][26022] Updated weights on worker 0-0, policy_version 884733 (0.00093) [2022-07-10 20:45:24,886][25689] Fps is (10 sec: 5482.1, 60 sec: 5527.4, 300 sec: 5514.7). Total num frames: 905966592. Throughput: 0: 4982.6. Samples: 905960038. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:45:24,899][25689] Avg episode reward: [(0, '0.507')] [2022-07-10 20:45:26,480][26022] Updated weights on worker 0-0, policy_version 884743 (0.00093) [2022-07-10 20:45:28,467][26022] Updated weights on worker 0-0, policy_version 884753 (0.00100) [2022-07-10 20:45:30,020][25689] Fps is (10 sec: 5520.9, 60 sec: 5522.6, 300 sec: 5512.5). Total num frames: 905995264. Throughput: 0: 5787.6. Samples: 905993288. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:45:30,020][25689] Avg episode reward: [(0, '0.534')] [2022-07-10 20:45:30,512][26022] Updated weights on worker 0-0, policy_version 884763 (0.00091) [2022-07-10 20:45:32,106][26022] Updated weights on worker 0-0, policy_version 884773 (0.00086) [2022-07-10 20:45:34,036][26022] Updated weights on worker 0-0, policy_version 884783 (0.00086) [2022-07-10 20:45:35,071][25689] Fps is (10 sec: 5630.3, 60 sec: 5554.1, 300 sec: 5518.8). Total num frames: 906023936. Throughput: 0: 5761.7. Samples: 906026784. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:45:35,073][25689] Avg episode reward: [(0, '0.239')] [2022-07-10 20:45:35,966][26022] Updated weights on worker 0-0, policy_version 884793 (0.00094) [2022-07-10 20:45:37,801][26022] Updated weights on worker 0-0, policy_version 884803 (0.00082) [2022-07-10 20:45:39,571][26022] Updated weights on worker 0-0, policy_version 884813 (0.00088) [2022-07-10 20:45:40,093][25689] Fps is (10 sec: 5590.9, 60 sec: 5520.3, 300 sec: 5515.4). Total num frames: 906051584. Throughput: 0: 5753.7. Samples: 906060014. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:45:40,095][25689] Avg episode reward: [(0, '0.332')] [2022-07-10 20:45:41,212][26022] Updated weights on worker 0-0, policy_version 884823 (0.00080) [2022-07-10 20:45:43,169][26022] Updated weights on worker 0-0, policy_version 884833 (0.00095) [2022-07-10 20:45:45,001][26022] Updated weights on worker 0-0, policy_version 884843 (0.00090) [2022-07-10 20:45:45,108][25689] Fps is (10 sec: 5509.1, 60 sec: 5540.5, 300 sec: 5512.8). Total num frames: 906079232. Throughput: 0: 5776.1. Samples: 906076842. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:45:45,110][25689] Avg episode reward: [(0, '-0.726')] [2022-07-10 20:45:46,831][26022] Updated weights on worker 0-0, policy_version 884853 (0.00088) [2022-07-10 20:45:48,835][26022] Updated weights on worker 0-0, policy_version 884863 (0.00093) [2022-07-10 20:45:50,198][25689] Fps is (10 sec: 5573.6, 60 sec: 5541.7, 300 sec: 5522.3). Total num frames: 906107904. Throughput: 0: 5776.8. Samples: 906109854. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:45:50,199][25689] Avg episode reward: [(0, '-0.531')] [2022-07-10 20:45:50,647][26022] Updated weights on worker 0-0, policy_version 884873 (0.00093) [2022-07-10 20:45:52,466][26022] Updated weights on worker 0-0, policy_version 884883 (0.00091) [2022-07-10 20:45:54,422][26022] Updated weights on worker 0-0, policy_version 884893 (0.00088) [2022-07-10 20:45:55,266][25689] Fps is (10 sec: 5544.5, 60 sec: 5521.4, 300 sec: 5514.2). Total num frames: 906135552. Throughput: 0: 5773.2. Samples: 906143374. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:45:55,267][25689] Avg episode reward: [(0, '-0.592')] [2022-07-10 20:45:56,296][26022] Updated weights on worker 0-0, policy_version 884903 (0.00090) [2022-07-10 20:45:57,920][26022] Updated weights on worker 0-0, policy_version 884913 (0.00086) [2022-07-10 20:45:59,941][26022] Updated weights on worker 0-0, policy_version 884923 (0.00086) [2022-07-10 20:46:00,273][25689] Fps is (10 sec: 5488.6, 60 sec: 5525.7, 300 sec: 5521.3). Total num frames: 906163200. Throughput: 0: 4952.7. Samples: 906159958. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:46:00,273][25689] Avg episode reward: [(0, '-0.895')] [2022-07-10 20:46:01,459][26022] Updated weights on worker 0-0, policy_version 884933 (0.00092) [2022-07-10 20:46:03,886][26022] Updated weights on worker 0-0, policy_version 884943 (0.00089) [2022-07-10 20:46:05,280][25689] Fps is (10 sec: 5522.0, 60 sec: 5560.6, 300 sec: 5525.6). Total num frames: 906190848. Throughput: 0: 5682.7. Samples: 906191470. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:46:05,280][25689] Avg episode reward: [(0, '-0.548')] [2022-07-10 20:46:05,486][26022] Updated weights on worker 0-0, policy_version 884953 (0.00093) [2022-07-10 20:46:07,517][26022] Updated weights on worker 0-0, policy_version 884963 (0.00086) [2022-07-10 20:46:09,433][26022] Updated weights on worker 0-0, policy_version 884973 (0.00089) [2022-07-10 20:46:10,394][25689] Fps is (10 sec: 5362.1, 60 sec: 5508.8, 300 sec: 5517.7). Total num frames: 906217472. Throughput: 0: 5688.2. Samples: 906224734. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:46:10,396][25689] Avg episode reward: [(0, '-1.467')] [2022-07-10 20:46:11,169][26022] Updated weights on worker 0-0, policy_version 884983 (0.00088) [2022-07-10 20:46:13,050][26022] Updated weights on worker 0-0, policy_version 884993 (0.00081) [2022-07-10 20:46:14,896][26022] Updated weights on worker 0-0, policy_version 885003 (0.00091) [2022-07-10 20:46:15,404][25689] Fps is (10 sec: 5360.6, 60 sec: 5508.2, 300 sec: 5518.3). Total num frames: 906245120. Throughput: 0: 4875.6. Samples: 906241558. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:46:15,405][25689] Avg episode reward: [(0, '-0.949')] [2022-07-10 20:46:16,756][26022] Updated weights on worker 0-0, policy_version 885013 (0.00087) [2022-07-10 20:46:18,727][26022] Updated weights on worker 0-0, policy_version 885023 (0.00095) [2022-07-10 20:46:20,412][25689] Fps is (10 sec: 5519.7, 60 sec: 5509.2, 300 sec: 5519.0). Total num frames: 906272768. Throughput: 0: 5699.4. Samples: 906274740. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:46:20,414][25689] Avg episode reward: [(0, '-0.600')] [2022-07-10 20:46:20,471][26022] Updated weights on worker 0-0, policy_version 885033 (0.00111) [2022-07-10 20:46:22,278][26022] Updated weights on worker 0-0, policy_version 885043 (0.00096) [2022-07-10 20:46:24,292][26022] Updated weights on worker 0-0, policy_version 885053 (0.00088) [2022-07-10 20:46:25,469][25689] Fps is (10 sec: 5493.8, 60 sec: 5510.2, 300 sec: 5515.7). Total num frames: 906300416. Throughput: 0: 5792.2. Samples: 906308410. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:46:25,471][25689] Avg episode reward: [(0, '-0.211')] [2022-07-10 20:46:25,849][26022] Updated weights on worker 0-0, policy_version 885063 (0.00092) [2022-07-10 20:46:27,866][26022] Updated weights on worker 0-0, policy_version 885073 (0.00087) [2022-07-10 20:46:29,784][26022] Updated weights on worker 0-0, policy_version 885083 (0.00092) [2022-07-10 20:46:30,575][25689] Fps is (10 sec: 5642.6, 60 sec: 5529.7, 300 sec: 5524.1). Total num frames: 906330112. Throughput: 0: 4962.7. Samples: 906324884. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:46:30,575][25689] Avg episode reward: [(0, '-0.261')] [2022-07-10 20:46:31,464][26022] Updated weights on worker 0-0, policy_version 885093 (0.00092) [2022-07-10 20:46:33,459][26022] Updated weights on worker 0-0, policy_version 885103 (0.00095) [2022-07-10 20:46:35,015][26022] Updated weights on worker 0-0, policy_version 885113 (0.00087) [2022-07-10 20:46:35,588][25689] Fps is (10 sec: 5667.3, 60 sec: 5516.3, 300 sec: 5517.2). Total num frames: 906357760. Throughput: 0: 5793.5. Samples: 906358492. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:46:35,588][25689] Avg episode reward: [(0, '0.321')] [2022-07-10 20:46:37,245][26022] Updated weights on worker 0-0, policy_version 885123 (0.00090) [2022-07-10 20:46:38,647][26022] Updated weights on worker 0-0, policy_version 885133 (0.00086) [2022-07-10 20:46:40,599][25689] Fps is (10 sec: 5516.4, 60 sec: 5517.3, 300 sec: 5520.7). Total num frames: 906385408. Throughput: 0: 5822.4. Samples: 906392274. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:46:40,599][25689] Avg episode reward: [(0, '1.122')] [2022-07-10 20:46:40,682][26022] Updated weights on worker 0-0, policy_version 885143 (0.00083) [2022-07-10 20:46:42,436][26022] Updated weights on worker 0-0, policy_version 885153 (0.00093) [2022-07-10 20:46:44,323][26022] Updated weights on worker 0-0, policy_version 885163 (0.00093) [2022-07-10 20:46:45,611][25689] Fps is (10 sec: 5516.8, 60 sec: 5517.5, 300 sec: 5518.2). Total num frames: 906413056. Throughput: 0: 4996.9. Samples: 906409056. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:46:45,613][25689] Avg episode reward: [(0, '1.021')] [2022-07-10 20:46:46,083][26022] Updated weights on worker 0-0, policy_version 885173 (0.00095) [2022-07-10 20:46:48,104][26022] Updated weights on worker 0-0, policy_version 885183 (0.00088) [2022-07-10 20:46:49,858][26022] Updated weights on worker 0-0, policy_version 885193 (0.00085) [2022-07-10 20:46:50,670][25689] Fps is (10 sec: 5490.7, 60 sec: 5503.4, 300 sec: 5517.6). Total num frames: 906440704. Throughput: 0: 5833.5. Samples: 906442108. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:46:50,670][25689] Avg episode reward: [(0, '1.010')] [2022-07-10 20:46:51,765][26022] Updated weights on worker 0-0, policy_version 885203 (0.00073) [2022-07-10 20:46:53,546][26022] Updated weights on worker 0-0, policy_version 885213 (0.00089) [2022-07-10 20:46:55,512][26022] Updated weights on worker 0-0, policy_version 885223 (0.00083) [2022-07-10 20:46:55,690][25689] Fps is (10 sec: 5485.9, 60 sec: 5507.7, 300 sec: 5517.9). Total num frames: 906468352. Throughput: 0: 5837.0. Samples: 906475832. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:46:55,692][25689] Avg episode reward: [(0, '0.536')] [2022-07-10 20:46:57,136][26022] Updated weights on worker 0-0, policy_version 885233 (0.00093) [2022-07-10 20:46:59,212][26022] Updated weights on worker 0-0, policy_version 885243 (0.00088) [2022-07-10 20:47:00,743][25689] Fps is (10 sec: 5692.5, 60 sec: 5537.4, 300 sec: 5532.5). Total num frames: 906498048. Throughput: 0: 4977.3. Samples: 906492540. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:47:00,745][25689] Avg episode reward: [(0, '0.597')] [2022-07-10 20:47:00,770][26022] Updated weights on worker 0-0, policy_version 885253 (0.00093) [2022-07-10 20:47:03,185][26022] Updated weights on worker 0-0, policy_version 885263 (0.00087) [2022-07-10 20:47:04,304][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:47:04,320][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000885270_906516480.pth [2022-07-10 20:47:04,320][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000883326_904525824.pth [2022-07-10 20:47:04,828][26022] Updated weights on worker 0-0, policy_version 885273 (0.00085) [2022-07-10 20:47:05,755][25689] Fps is (10 sec: 5392.6, 60 sec: 5486.2, 300 sec: 5517.6). Total num frames: 906522624. Throughput: 0: 5692.3. Samples: 906523722. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:47:05,756][25689] Avg episode reward: [(0, '0.689')] [2022-07-10 20:47:06,849][26022] Updated weights on worker 0-0, policy_version 885283 (0.00086) [2022-07-10 20:47:08,591][26022] Updated weights on worker 0-0, policy_version 885293 (0.00084) [2022-07-10 20:47:10,509][26022] Updated weights on worker 0-0, policy_version 885303 (0.00085) [2022-07-10 20:47:10,810][25689] Fps is (10 sec: 5289.5, 60 sec: 5525.4, 300 sec: 5516.9). Total num frames: 906551296. Throughput: 0: 5723.5. Samples: 906557382. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:47:10,810][25689] Avg episode reward: [(0, '0.715')] [2022-07-10 20:47:12,118][26022] Updated weights on worker 0-0, policy_version 885313 (0.00092) [2022-07-10 20:47:14,255][26022] Updated weights on worker 0-0, policy_version 885323 (0.00094) [2022-07-10 20:47:15,815][25689] Fps is (10 sec: 5700.1, 60 sec: 5542.8, 300 sec: 5521.2). Total num frames: 906579968. Throughput: 0: 4889.1. Samples: 906574222. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:47:15,815][25689] Avg episode reward: [(0, '0.850')] [2022-07-10 20:47:15,891][26022] Updated weights on worker 0-0, policy_version 885333 (0.00092) [2022-07-10 20:47:17,875][26022] Updated weights on worker 0-0, policy_version 885343 (0.00088) [2022-07-10 20:47:19,608][26022] Updated weights on worker 0-0, policy_version 885353 (0.00086) [2022-07-10 20:47:20,839][25689] Fps is (10 sec: 5513.7, 60 sec: 5524.4, 300 sec: 5514.9). Total num frames: 906606592. Throughput: 0: 5730.4. Samples: 906607696. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:47:20,839][25689] Avg episode reward: [(0, '1.227')] [2022-07-10 20:47:21,555][26022] Updated weights on worker 0-0, policy_version 885363 (0.00084) [2022-07-10 20:47:23,485][26022] Updated weights on worker 0-0, policy_version 885373 (0.00088) [2022-07-10 20:47:25,106][26022] Updated weights on worker 0-0, policy_version 885383 (0.00082) [2022-07-10 20:47:25,883][25689] Fps is (10 sec: 5492.3, 60 sec: 5542.6, 300 sec: 5515.4). Total num frames: 906635264. Throughput: 0: 5839.5. Samples: 906641260. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:47:25,883][25689] Avg episode reward: [(0, '1.695')] [2022-07-10 20:47:26,983][26022] Updated weights on worker 0-0, policy_version 885393 (0.00090) [2022-07-10 20:47:29,056][26022] Updated weights on worker 0-0, policy_version 885403 (0.00090) [2022-07-10 20:47:30,645][26022] Updated weights on worker 0-0, policy_version 885413 (0.00094) [2022-07-10 20:47:30,949][25689] Fps is (10 sec: 5773.3, 60 sec: 5546.2, 300 sec: 5528.6). Total num frames: 906664960. Throughput: 0: 4984.7. Samples: 906657768. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:47:30,949][25689] Avg episode reward: [(0, '0.842')] [2022-07-10 20:47:32,632][26022] Updated weights on worker 0-0, policy_version 885423 (0.00090) [2022-07-10 20:47:34,319][26022] Updated weights on worker 0-0, policy_version 885433 (0.00089) [2022-07-10 20:47:35,950][25689] Fps is (10 sec: 5492.5, 60 sec: 5513.3, 300 sec: 5514.8). Total num frames: 906690560. Throughput: 0: 5815.4. Samples: 906691318. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:47:35,951][25689] Avg episode reward: [(0, '0.574')] [2022-07-10 20:47:36,259][26022] Updated weights on worker 0-0, policy_version 885443 (0.00085) [2022-07-10 20:47:37,911][26022] Updated weights on worker 0-0, policy_version 885453 (0.00088) [2022-07-10 20:47:39,883][26022] Updated weights on worker 0-0, policy_version 885463 (0.00075) [2022-07-10 20:47:40,975][25689] Fps is (10 sec: 5515.3, 60 sec: 5546.0, 300 sec: 5521.4). Total num frames: 906720256. Throughput: 0: 5832.7. Samples: 906725142. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:47:40,975][25689] Avg episode reward: [(0, '0.501')] [2022-07-10 20:47:41,609][26022] Updated weights on worker 0-0, policy_version 885473 (0.00090) [2022-07-10 20:47:43,687][26022] Updated weights on worker 0-0, policy_version 885483 (0.00084) [2022-07-10 20:47:45,282][26022] Updated weights on worker 0-0, policy_version 885493 (0.00084) [2022-07-10 20:47:45,983][25689] Fps is (10 sec: 5818.0, 60 sec: 5563.4, 300 sec: 5529.4). Total num frames: 906748928. Throughput: 0: 5004.8. Samples: 906741856. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 20:47:45,983][25689] Avg episode reward: [(0, '-0.471')] [2022-07-10 20:47:47,391][26022] Updated weights on worker 0-0, policy_version 885503 (0.00090) [2022-07-10 20:47:49,062][26022] Updated weights on worker 0-0, policy_version 885513 (0.00087) [2022-07-10 20:47:51,016][26022] Updated weights on worker 0-0, policy_version 885523 (0.00092) [2022-07-10 20:47:51,055][25689] Fps is (10 sec: 5485.7, 60 sec: 5545.2, 300 sec: 5518.3). Total num frames: 906775552. Throughput: 0: 5830.7. Samples: 906775000. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:47:51,055][25689] Avg episode reward: [(0, '-1.735')] [2022-07-10 20:47:52,858][26022] Updated weights on worker 0-0, policy_version 885533 (0.00084) [2022-07-10 20:47:54,657][26022] Updated weights on worker 0-0, policy_version 885543 (0.00088) [2022-07-10 20:47:56,058][25689] Fps is (10 sec: 5488.3, 60 sec: 5563.8, 300 sec: 5525.4). Total num frames: 906804224. Throughput: 0: 5835.2. Samples: 906808650. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:47:56,059][25689] Avg episode reward: [(0, '-1.872')] [2022-07-10 20:47:56,380][26022] Updated weights on worker 0-0, policy_version 885553 (0.00086) [2022-07-10 20:47:58,338][26022] Updated weights on worker 0-0, policy_version 885563 (0.00091) [2022-07-10 20:47:59,983][26022] Updated weights on worker 0-0, policy_version 885573 (0.00084) [2022-07-10 20:48:01,071][25689] Fps is (10 sec: 5623.0, 60 sec: 5533.5, 300 sec: 5529.0). Total num frames: 906831872. Throughput: 0: 4994.4. Samples: 906825510. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:48:01,071][25689] Avg episode reward: [(0, '-0.957')] [2022-07-10 20:48:02,342][26022] Updated weights on worker 0-0, policy_version 885583 (0.00084) [2022-07-10 20:48:04,423][26022] Updated weights on worker 0-0, policy_version 885593 (0.00087) [2022-07-10 20:48:05,875][26022] Updated weights on worker 0-0, policy_version 885603 (0.00087) [2022-07-10 20:48:06,080][25689] Fps is (10 sec: 5415.5, 60 sec: 5567.7, 300 sec: 5523.2). Total num frames: 906858496. Throughput: 0: 5721.1. Samples: 906856834. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:48:06,080][25689] Avg episode reward: [(0, '-1.072')] [2022-07-10 20:48:07,862][26022] Updated weights on worker 0-0, policy_version 885613 (0.00089) [2022-07-10 20:48:09,579][26022] Updated weights on worker 0-0, policy_version 885623 (0.00088) [2022-07-10 20:48:11,138][25689] Fps is (10 sec: 5492.5, 60 sec: 5567.4, 300 sec: 5530.5). Total num frames: 906887168. Throughput: 0: 5738.4. Samples: 906890248. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:48:11,139][25689] Avg episode reward: [(0, '-1.274')] [2022-07-10 20:48:11,477][26022] Updated weights on worker 0-0, policy_version 885633 (0.00092) [2022-07-10 20:48:13,643][26022] Updated weights on worker 0-0, policy_version 885643 (0.00096) [2022-07-10 20:48:15,041][26022] Updated weights on worker 0-0, policy_version 885653 (0.00086) [2022-07-10 20:48:16,160][25689] Fps is (10 sec: 5384.1, 60 sec: 5515.0, 300 sec: 5524.0). Total num frames: 906912768. Throughput: 0: 4885.3. Samples: 906906852. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:48:16,160][25689] Avg episode reward: [(0, '-0.244')] [2022-07-10 20:48:17,059][26022] Updated weights on worker 0-0, policy_version 885663 (0.00090) [2022-07-10 20:48:19,156][26022] Updated weights on worker 0-0, policy_version 885673 (0.00098) [2022-07-10 20:48:20,602][26022] Updated weights on worker 0-0, policy_version 885683 (0.00104) [2022-07-10 20:48:21,169][25689] Fps is (10 sec: 5512.8, 60 sec: 5567.3, 300 sec: 5524.5). Total num frames: 906942464. Throughput: 0: 5707.9. Samples: 906940228. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:48:21,169][25689] Avg episode reward: [(0, '0.256')] [2022-07-10 20:48:22,721][26022] Updated weights on worker 0-0, policy_version 885693 (0.00090) [2022-07-10 20:48:24,323][26022] Updated weights on worker 0-0, policy_version 885703 (0.00089) [2022-07-10 20:48:26,179][25689] Fps is (10 sec: 5621.1, 60 sec: 5536.4, 300 sec: 5525.8). Total num frames: 906969088. Throughput: 0: 5817.5. Samples: 906973762. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:48:26,180][25689] Avg episode reward: [(0, '0.255')] [2022-07-10 20:48:26,258][26022] Updated weights on worker 0-0, policy_version 885713 (0.00087) [2022-07-10 20:48:28,232][26022] Updated weights on worker 0-0, policy_version 885723 (0.00088) [2022-07-10 20:48:29,698][26022] Updated weights on worker 0-0, policy_version 885733 (0.00094) [2022-07-10 20:48:31,221][25689] Fps is (10 sec: 5398.9, 60 sec: 5504.6, 300 sec: 5526.3). Total num frames: 906996736. Throughput: 0: 5820.9. Samples: 907007148. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:48:31,222][25689] Avg episode reward: [(0, '0.279')] [2022-07-10 20:48:31,753][26022] Updated weights on worker 0-0, policy_version 885743 (0.00090) [2022-07-10 20:48:33,755][26022] Updated weights on worker 0-0, policy_version 885753 (0.00099) [2022-07-10 20:48:35,303][26022] Updated weights on worker 0-0, policy_version 885763 (0.00093) [2022-07-10 20:48:36,271][25689] Fps is (10 sec: 5682.4, 60 sec: 5568.1, 300 sec: 5525.6). Total num frames: 907026432. Throughput: 0: 5823.4. Samples: 907023966. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:48:36,271][25689] Avg episode reward: [(0, '0.416')] [2022-07-10 20:48:37,448][26022] Updated weights on worker 0-0, policy_version 885773 (0.00086) [2022-07-10 20:48:38,923][26022] Updated weights on worker 0-0, policy_version 885783 (0.00078) [2022-07-10 20:48:41,091][26022] Updated weights on worker 0-0, policy_version 885793 (0.00088) [2022-07-10 20:48:41,299][25689] Fps is (10 sec: 5588.3, 60 sec: 5516.8, 300 sec: 5532.6). Total num frames: 907053056. Throughput: 0: 5822.7. Samples: 907057444. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:48:41,300][25689] Avg episode reward: [(0, '0.568')] [2022-07-10 20:48:42,781][26022] Updated weights on worker 0-0, policy_version 885803 (0.00086) [2022-07-10 20:48:44,546][26022] Updated weights on worker 0-0, policy_version 885813 (0.00087) [2022-07-10 20:48:46,308][25689] Fps is (10 sec: 5509.1, 60 sec: 5516.8, 300 sec: 5527.5). Total num frames: 907081728. Throughput: 0: 5817.5. Samples: 907090862. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:48:46,308][25689] Avg episode reward: [(0, '1.154')] [2022-07-10 20:48:46,419][26022] Updated weights on worker 0-0, policy_version 885823 (0.00098) [2022-07-10 20:48:48,359][26022] Updated weights on worker 0-0, policy_version 885833 (0.00091) [2022-07-10 20:48:50,143][26022] Updated weights on worker 0-0, policy_version 885843 (0.00085) [2022-07-10 20:48:51,430][25689] Fps is (10 sec: 5559.5, 60 sec: 5529.2, 300 sec: 5521.8). Total num frames: 907109376. Throughput: 0: 4964.6. Samples: 907107476. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:48:51,430][25689] Avg episode reward: [(0, '1.126')] [2022-07-10 20:48:52,136][26022] Updated weights on worker 0-0, policy_version 885853 (0.00089) [2022-07-10 20:48:54,011][26022] Updated weights on worker 0-0, policy_version 885863 (0.00088) [2022-07-10 20:48:55,758][26022] Updated weights on worker 0-0, policy_version 885873 (0.00074) [2022-07-10 20:48:56,447][25689] Fps is (10 sec: 5453.7, 60 sec: 5510.9, 300 sec: 5529.1). Total num frames: 907137024. Throughput: 0: 5770.2. Samples: 907140388. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:48:56,448][25689] Avg episode reward: [(0, '1.121')] [2022-07-10 20:48:57,553][26022] Updated weights on worker 0-0, policy_version 885883 (0.00093) [2022-07-10 20:48:59,419][26022] Updated weights on worker 0-0, policy_version 885893 (0.00088) [2022-07-10 20:49:01,227][26022] Updated weights on worker 0-0, policy_version 885903 (0.00084) [2022-07-10 20:49:01,534][25689] Fps is (10 sec: 5574.2, 60 sec: 5521.1, 300 sec: 5534.5). Total num frames: 907165696. Throughput: 0: 5758.5. Samples: 907173962. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:49:01,534][25689] Avg episode reward: [(0, '1.142')] [2022-07-10 20:49:03,660][26022] Updated weights on worker 0-0, policy_version 885913 (0.00088) [2022-07-10 20:49:04,335][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:49:04,343][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000885918_907180032.pth [2022-07-10 20:49:04,344][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000883974_905189376.pth [2022-07-10 20:49:05,357][26022] Updated weights on worker 0-0, policy_version 885923 (0.00091) [2022-07-10 20:49:06,556][25689] Fps is (10 sec: 5267.3, 60 sec: 5486.0, 300 sec: 5525.0). Total num frames: 907190272. Throughput: 0: 4827.9. Samples: 907188620. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:49:06,557][25689] Avg episode reward: [(0, '1.295')] [2022-07-10 20:49:07,220][26022] Updated weights on worker 0-0, policy_version 885933 (0.00086) [2022-07-10 20:49:09,088][26022] Updated weights on worker 0-0, policy_version 885943 (0.00090) [2022-07-10 20:49:10,860][26022] Updated weights on worker 0-0, policy_version 885953 (0.00100) [2022-07-10 20:49:11,689][25689] Fps is (10 sec: 5444.9, 60 sec: 5513.1, 300 sec: 5529.4). Total num frames: 907220992. Throughput: 0: 5654.2. Samples: 907222026. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:49:11,690][25689] Avg episode reward: [(0, '0.441')] [2022-07-10 20:49:12,939][26022] Updated weights on worker 0-0, policy_version 885963 (0.00085) [2022-07-10 20:49:14,319][26022] Updated weights on worker 0-0, policy_version 885973 (0.00088) [2022-07-10 20:49:16,463][26022] Updated weights on worker 0-0, policy_version 885983 (0.00091) [2022-07-10 20:49:16,784][25689] Fps is (10 sec: 5707.2, 60 sec: 5540.3, 300 sec: 5527.8). Total num frames: 907248640. Throughput: 0: 5678.5. Samples: 907255868. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:49:16,784][25689] Avg episode reward: [(0, '0.237')] [2022-07-10 20:49:18,074][26022] Updated weights on worker 0-0, policy_version 885993 (0.00088) [2022-07-10 20:49:20,099][26022] Updated weights on worker 0-0, policy_version 886003 (0.00089) [2022-07-10 20:49:21,805][25689] Fps is (10 sec: 5466.5, 60 sec: 5505.4, 300 sec: 5527.9). Total num frames: 907276288. Throughput: 0: 4856.7. Samples: 907272412. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:49:21,805][25689] Avg episode reward: [(0, '-0.404')] [2022-07-10 20:49:21,857][26022] Updated weights on worker 0-0, policy_version 886013 (0.00087) [2022-07-10 20:49:23,544][26022] Updated weights on worker 0-0, policy_version 886023 (0.00088) [2022-07-10 20:49:25,715][26022] Updated weights on worker 0-0, policy_version 886033 (0.00091) [2022-07-10 20:49:26,887][25689] Fps is (10 sec: 5574.4, 60 sec: 5532.6, 300 sec: 5527.9). Total num frames: 907304960. Throughput: 0: 5762.3. Samples: 907305770. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:49:26,887][25689] Avg episode reward: [(0, '-1.014')] [2022-07-10 20:49:27,393][26022] Updated weights on worker 0-0, policy_version 886043 (0.00088) [2022-07-10 20:49:29,206][26022] Updated weights on worker 0-0, policy_version 886053 (0.00090) [2022-07-10 20:49:31,350][26022] Updated weights on worker 0-0, policy_version 886063 (0.00088) [2022-07-10 20:49:31,954][25689] Fps is (10 sec: 5448.1, 60 sec: 5513.4, 300 sec: 5527.1). Total num frames: 907331584. Throughput: 0: 5776.7. Samples: 907339092. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:49:31,956][25689] Avg episode reward: [(0, '-1.105')] [2022-07-10 20:49:32,752][26022] Updated weights on worker 0-0, policy_version 886073 (0.00086) [2022-07-10 20:49:34,960][26022] Updated weights on worker 0-0, policy_version 886083 (0.00081) [2022-07-10 20:49:36,455][26022] Updated weights on worker 0-0, policy_version 886093 (0.00092) [2022-07-10 20:49:36,961][25689] Fps is (10 sec: 5590.7, 60 sec: 5517.3, 300 sec: 5527.4). Total num frames: 907361280. Throughput: 0: 4966.2. Samples: 907356072. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:49:36,963][25689] Avg episode reward: [(0, '-1.085')] [2022-07-10 20:49:38,543][26022] Updated weights on worker 0-0, policy_version 886103 (0.01014) [2022-07-10 20:49:40,246][26022] Updated weights on worker 0-0, policy_version 886113 (0.00109) [2022-07-10 20:49:42,006][25689] Fps is (10 sec: 5704.8, 60 sec: 5532.7, 300 sec: 5531.0). Total num frames: 907388928. Throughput: 0: 5786.2. Samples: 907389302. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:49:42,006][25689] Avg episode reward: [(0, '-0.461')] [2022-07-10 20:49:42,149][26022] Updated weights on worker 0-0, policy_version 886123 (0.00087) [2022-07-10 20:49:43,947][26022] Updated weights on worker 0-0, policy_version 886133 (0.00092) [2022-07-10 20:49:45,906][26022] Updated weights on worker 0-0, policy_version 886143 (0.00093) [2022-07-10 20:49:47,085][25689] Fps is (10 sec: 5461.7, 60 sec: 5509.4, 300 sec: 5528.0). Total num frames: 907416576. Throughput: 0: 5792.2. Samples: 907422762. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:49:47,086][25689] Avg episode reward: [(0, '-0.362')] [2022-07-10 20:49:47,694][26022] Updated weights on worker 0-0, policy_version 886153 (0.00079) [2022-07-10 20:49:49,422][26022] Updated weights on worker 0-0, policy_version 886163 (0.00092) [2022-07-10 20:49:51,332][26022] Updated weights on worker 0-0, policy_version 886173 (0.00090) [2022-07-10 20:49:52,129][25689] Fps is (10 sec: 5563.5, 60 sec: 5533.4, 300 sec: 5527.7). Total num frames: 907445248. Throughput: 0: 4970.5. Samples: 907439372. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:49:52,130][25689] Avg episode reward: [(0, '-0.597')] [2022-07-10 20:49:53,155][26022] Updated weights on worker 0-0, policy_version 886183 (0.00087) [2022-07-10 20:49:55,031][26022] Updated weights on worker 0-0, policy_version 886193 (0.00083) [2022-07-10 20:49:56,804][26022] Updated weights on worker 0-0, policy_version 886203 (0.00094) [2022-07-10 20:49:57,152][25689] Fps is (10 sec: 5594.5, 60 sec: 5532.9, 300 sec: 5528.3). Total num frames: 907472896. Throughput: 0: 5779.3. Samples: 907472764. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:49:57,153][25689] Avg episode reward: [(0, '-0.544')] [2022-07-10 20:49:58,756][26022] Updated weights on worker 0-0, policy_version 886213 (0.00089) [2022-07-10 20:50:00,555][26022] Updated weights on worker 0-0, policy_version 886223 (0.00085) [2022-07-10 20:50:02,191][25689] Fps is (10 sec: 5495.7, 60 sec: 5520.3, 300 sec: 5534.8). Total num frames: 907500544. Throughput: 0: 5795.2. Samples: 907506278. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:50:02,191][25689] Avg episode reward: [(0, '-1.314')] [2022-07-10 20:50:02,620][26022] Updated weights on worker 0-0, policy_version 886233 (0.00087) [2022-07-10 20:50:04,588][26022] Updated weights on worker 0-0, policy_version 886243 (0.00097) [2022-07-10 20:50:06,411][26022] Updated weights on worker 0-0, policy_version 886253 (0.00094) [2022-07-10 20:50:07,219][25689] Fps is (10 sec: 5391.3, 60 sec: 5553.6, 300 sec: 5525.9). Total num frames: 907527168. Throughput: 0: 4879.2. Samples: 907520998. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:50:07,222][25689] Avg episode reward: [(0, '-2.087')] [2022-07-10 20:50:08,483][26022] Updated weights on worker 0-0, policy_version 886263 (0.00092) [2022-07-10 20:50:10,091][26022] Updated weights on worker 0-0, policy_version 886273 (0.00094) [2022-07-10 20:50:12,010][26022] Updated weights on worker 0-0, policy_version 886283 (0.00093) [2022-07-10 20:50:12,311][25689] Fps is (10 sec: 5362.8, 60 sec: 5506.7, 300 sec: 5524.2). Total num frames: 907554816. Throughput: 0: 5690.8. Samples: 907554222. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:50:12,311][25689] Avg episode reward: [(0, '-1.879')] [2022-07-10 20:50:13,781][26022] Updated weights on worker 0-0, policy_version 886293 (0.00082) [2022-07-10 20:50:15,548][26022] Updated weights on worker 0-0, policy_version 886303 (0.00094) [2022-07-10 20:50:17,380][25689] Fps is (10 sec: 5441.7, 60 sec: 5509.0, 300 sec: 5523.3). Total num frames: 907582464. Throughput: 0: 5682.9. Samples: 907587718. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:50:17,381][25689] Avg episode reward: [(0, '-2.641')] [2022-07-10 20:50:17,623][26022] Updated weights on worker 0-0, policy_version 886313 (0.00090) [2022-07-10 20:50:19,319][26022] Updated weights on worker 0-0, policy_version 886323 (0.00093) [2022-07-10 20:50:21,243][26022] Updated weights on worker 0-0, policy_version 886333 (0.00085) [2022-07-10 20:50:22,396][25689] Fps is (10 sec: 5584.2, 60 sec: 5526.3, 300 sec: 5527.7). Total num frames: 907611136. Throughput: 0: 4857.4. Samples: 907604424. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:50:22,397][25689] Avg episode reward: [(0, '-1.890')] [2022-07-10 20:50:23,278][26022] Updated weights on worker 0-0, policy_version 886343 (0.00091) [2022-07-10 20:50:24,991][26022] Updated weights on worker 0-0, policy_version 886353 (0.00093) [2022-07-10 20:50:26,891][26022] Updated weights on worker 0-0, policy_version 886363 (0.00098) [2022-07-10 20:50:27,431][25689] Fps is (10 sec: 5501.7, 60 sec: 5496.8, 300 sec: 5522.6). Total num frames: 907637760. Throughput: 0: 5781.8. Samples: 907637860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:50:27,431][25689] Avg episode reward: [(0, '-3.303')] [2022-07-10 20:50:28,583][26022] Updated weights on worker 0-0, policy_version 886373 (0.00092) [2022-07-10 20:50:30,602][26022] Updated weights on worker 0-0, policy_version 886383 (0.00093) [2022-07-10 20:50:32,268][26022] Updated weights on worker 0-0, policy_version 886393 (0.00085) [2022-07-10 20:50:32,524][25689] Fps is (10 sec: 5561.3, 60 sec: 5545.2, 300 sec: 5525.3). Total num frames: 907667456. Throughput: 0: 5779.7. Samples: 907671044. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:50:32,524][25689] Avg episode reward: [(0, '-2.348')] [2022-07-10 20:50:34,315][26022] Updated weights on worker 0-0, policy_version 886403 (0.00097) [2022-07-10 20:50:36,076][26022] Updated weights on worker 0-0, policy_version 886413 (0.00092) [2022-07-10 20:50:37,527][25689] Fps is (10 sec: 5680.0, 60 sec: 5511.8, 300 sec: 5525.7). Total num frames: 907695104. Throughput: 0: 5787.5. Samples: 907704316. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:50:37,527][25689] Avg episode reward: [(0, '-2.074')] [2022-07-10 20:50:37,739][26022] Updated weights on worker 0-0, policy_version 886423 (0.00093) [2022-07-10 20:50:39,711][26022] Updated weights on worker 0-0, policy_version 886433 (0.00092) [2022-07-10 20:50:41,489][26022] Updated weights on worker 0-0, policy_version 886443 (0.00093) [2022-07-10 20:50:42,542][25689] Fps is (10 sec: 5519.4, 60 sec: 5514.5, 300 sec: 5525.7). Total num frames: 907722752. Throughput: 0: 5785.0. Samples: 907720966. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:50:42,543][25689] Avg episode reward: [(0, '-2.865')] [2022-07-10 20:50:43,482][26022] Updated weights on worker 0-0, policy_version 886453 (0.00090) [2022-07-10 20:50:45,295][26022] Updated weights on worker 0-0, policy_version 886463 (0.00097) [2022-07-10 20:50:47,159][26022] Updated weights on worker 0-0, policy_version 886473 (0.00094) [2022-07-10 20:50:47,567][25689] Fps is (10 sec: 5507.6, 60 sec: 5519.4, 300 sec: 5523.4). Total num frames: 907750400. Throughput: 0: 5777.9. Samples: 907754202. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:50:47,567][25689] Avg episode reward: [(0, '-1.991')] [2022-07-10 20:50:49,057][26022] Updated weights on worker 0-0, policy_version 886483 (0.00093) [2022-07-10 20:50:50,652][26022] Updated weights on worker 0-0, policy_version 886493 (0.00088) [2022-07-10 20:50:52,671][25689] Fps is (10 sec: 5358.2, 60 sec: 5480.1, 300 sec: 5519.3). Total num frames: 907777024. Throughput: 0: 5773.7. Samples: 907787368. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:50:52,671][25689] Avg episode reward: [(0, '-1.473')] [2022-07-10 20:50:52,875][26022] Updated weights on worker 0-0, policy_version 886503 (0.00084) [2022-07-10 20:50:54,457][26022] Updated weights on worker 0-0, policy_version 886513 (0.00084) [2022-07-10 20:50:56,342][26022] Updated weights on worker 0-0, policy_version 886523 (0.00087) [2022-07-10 20:50:57,738][25689] Fps is (10 sec: 5637.6, 60 sec: 5526.8, 300 sec: 5528.5). Total num frames: 907807744. Throughput: 0: 4952.6. Samples: 907804416. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:50:57,739][25689] Avg episode reward: [(0, '-1.881')] [2022-07-10 20:50:58,118][26022] Updated weights on worker 0-0, policy_version 886533 (0.00088) [2022-07-10 20:50:59,919][26022] Updated weights on worker 0-0, policy_version 886543 (0.00085) [2022-07-10 20:51:02,080][26022] Updated weights on worker 0-0, policy_version 886553 (0.00085) [2022-07-10 20:51:02,748][25689] Fps is (10 sec: 5487.1, 60 sec: 5478.7, 300 sec: 5518.1). Total num frames: 907832320. Throughput: 0: 5787.9. Samples: 907837918. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:51:02,749][25689] Avg episode reward: [(0, '-1.985')] [2022-07-10 20:51:03,971][26022] Updated weights on worker 0-0, policy_version 886563 (0.00079) [2022-07-10 20:51:04,430][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:51:04,436][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000886565_907842560.pth [2022-07-10 20:51:04,437][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000884621_905851904.pth [2022-07-10 20:51:05,987][26022] Updated weights on worker 0-0, policy_version 886573 (0.00092) [2022-07-10 20:51:07,753][25689] Fps is (10 sec: 5214.6, 60 sec: 5497.7, 300 sec: 5523.6). Total num frames: 907859968. Throughput: 0: 5689.0. Samples: 907869044. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:51:07,754][25689] Avg episode reward: [(0, '-2.793')] [2022-07-10 20:51:07,829][26022] Updated weights on worker 0-0, policy_version 886583 (0.00087) [2022-07-10 20:51:09,575][26022] Updated weights on worker 0-0, policy_version 886593 (0.00094) [2022-07-10 20:51:11,582][26022] Updated weights on worker 0-0, policy_version 886603 (0.00085) [2022-07-10 20:51:12,873][25689] Fps is (10 sec: 5562.8, 60 sec: 5512.1, 300 sec: 5525.0). Total num frames: 907888640. Throughput: 0: 4861.6. Samples: 907885582. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:51:12,873][25689] Avg episode reward: [(0, '-2.293')] [2022-07-10 20:51:13,378][26022] Updated weights on worker 0-0, policy_version 886613 (0.00098) [2022-07-10 20:51:15,284][26022] Updated weights on worker 0-0, policy_version 886623 (0.00092) [2022-07-10 20:51:17,050][26022] Updated weights on worker 0-0, policy_version 886633 (0.00098) [2022-07-10 20:51:17,906][25689] Fps is (10 sec: 5547.4, 60 sec: 5515.4, 300 sec: 5524.6). Total num frames: 907916288. Throughput: 0: 5679.6. Samples: 907918960. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:51:17,906][25689] Avg episode reward: [(0, '-2.368')] [2022-07-10 20:51:18,867][26022] Updated weights on worker 0-0, policy_version 886643 (0.00081) [2022-07-10 20:51:20,703][26022] Updated weights on worker 0-0, policy_version 886653 (0.00086) [2022-07-10 20:51:22,610][26022] Updated weights on worker 0-0, policy_version 886663 (0.00088) [2022-07-10 20:51:22,959][25689] Fps is (10 sec: 5482.1, 60 sec: 5495.1, 300 sec: 5524.6). Total num frames: 907943936. Throughput: 0: 5637.6. Samples: 907951860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:51:22,960][25689] Avg episode reward: [(0, '-1.466')] [2022-07-10 20:51:24,707][26022] Updated weights on worker 0-0, policy_version 886673 (0.00087) [2022-07-10 20:51:26,368][26022] Updated weights on worker 0-0, policy_version 886683 (0.00093) [2022-07-10 20:51:27,998][25689] Fps is (10 sec: 5479.0, 60 sec: 5511.6, 300 sec: 5519.0). Total num frames: 907971584. Throughput: 0: 4914.3. Samples: 907968534. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:51:27,999][25689] Avg episode reward: [(0, '-0.518')] [2022-07-10 20:51:28,229][26022] Updated weights on worker 0-0, policy_version 886693 (0.00087) [2022-07-10 20:51:29,979][26022] Updated weights on worker 0-0, policy_version 886703 (0.00088) [2022-07-10 20:51:31,854][26022] Updated weights on worker 0-0, policy_version 886713 (0.00089) [2022-07-10 20:51:33,077][25689] Fps is (10 sec: 5566.4, 60 sec: 5496.0, 300 sec: 5521.2). Total num frames: 908000256. Throughput: 0: 5741.3. Samples: 908001582. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 20:51:33,078][25689] Avg episode reward: [(0, '-0.778')] [2022-07-10 20:51:33,801][26022] Updated weights on worker 0-0, policy_version 886723 (0.00093) [2022-07-10 20:51:35,526][26022] Updated weights on worker 0-0, policy_version 886733 (0.00088) [2022-07-10 20:51:37,492][26022] Updated weights on worker 0-0, policy_version 886743 (0.00086) [2022-07-10 20:51:38,112][25689] Fps is (10 sec: 5568.6, 60 sec: 5493.1, 300 sec: 5520.7). Total num frames: 908027904. Throughput: 0: 5750.1. Samples: 908035148. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:51:38,113][25689] Avg episode reward: [(0, '0.179')] [2022-07-10 20:51:39,071][26022] Updated weights on worker 0-0, policy_version 886753 (0.00086) [2022-07-10 20:51:41,149][26022] Updated weights on worker 0-0, policy_version 886763 (0.00089) [2022-07-10 20:51:42,640][26022] Updated weights on worker 0-0, policy_version 886773 (0.00085) [2022-07-10 20:51:43,181][25689] Fps is (10 sec: 5574.0, 60 sec: 5505.1, 300 sec: 5523.1). Total num frames: 908056576. Throughput: 0: 4946.8. Samples: 908051894. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:51:43,182][25689] Avg episode reward: [(0, '0.137')] [2022-07-10 20:51:44,991][26022] Updated weights on worker 0-0, policy_version 886783 (0.00084) [2022-07-10 20:51:46,563][26022] Updated weights on worker 0-0, policy_version 886793 (0.00098) [2022-07-10 20:51:48,197][25689] Fps is (10 sec: 5482.8, 60 sec: 5489.0, 300 sec: 5520.5). Total num frames: 908083200. Throughput: 0: 5774.2. Samples: 908085168. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:51:48,198][25689] Avg episode reward: [(0, '0.141')] [2022-07-10 20:51:48,508][26022] Updated weights on worker 0-0, policy_version 886803 (0.00094) [2022-07-10 20:51:50,267][26022] Updated weights on worker 0-0, policy_version 886813 (0.00080) [2022-07-10 20:51:52,039][26022] Updated weights on worker 0-0, policy_version 886823 (0.00088) [2022-07-10 20:51:53,264][25689] Fps is (10 sec: 5586.0, 60 sec: 5543.1, 300 sec: 5526.5). Total num frames: 908112896. Throughput: 0: 5808.7. Samples: 908118838. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:51:53,264][25689] Avg episode reward: [(0, '-0.461')] [2022-07-10 20:51:53,777][26022] Updated weights on worker 0-0, policy_version 886833 (0.00090) [2022-07-10 20:51:55,699][26022] Updated weights on worker 0-0, policy_version 886843 (0.00089) [2022-07-10 20:51:57,677][26022] Updated weights on worker 0-0, policy_version 886853 (0.00085) [2022-07-10 20:51:58,288][25689] Fps is (10 sec: 5682.9, 60 sec: 5496.3, 300 sec: 5520.2). Total num frames: 908140544. Throughput: 0: 4980.6. Samples: 908135636. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:51:58,290][25689] Avg episode reward: [(0, '-0.659')] [2022-07-10 20:51:59,309][26022] Updated weights on worker 0-0, policy_version 886863 (0.00094) [2022-07-10 20:52:01,365][26022] Updated weights on worker 0-0, policy_version 886873 (0.00090) [2022-07-10 20:52:03,298][25689] Fps is (10 sec: 5408.4, 60 sec: 5530.1, 300 sec: 5527.1). Total num frames: 908167168. Throughput: 0: 5824.2. Samples: 908169060. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:52:03,300][25689] Avg episode reward: [(0, '-0.268')] [2022-07-10 20:52:03,395][26022] Updated weights on worker 0-0, policy_version 886883 (0.00083) [2022-07-10 20:52:05,191][26022] Updated weights on worker 0-0, policy_version 886893 (0.00089) [2022-07-10 20:52:07,256][26022] Updated weights on worker 0-0, policy_version 886903 (0.00095) [2022-07-10 20:52:08,304][25689] Fps is (10 sec: 5418.7, 60 sec: 5530.1, 300 sec: 5524.6). Total num frames: 908194816. Throughput: 0: 5740.6. Samples: 908200590. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:52:08,305][25689] Avg episode reward: [(0, '-1.079')] [2022-07-10 20:52:08,866][26022] Updated weights on worker 0-0, policy_version 886913 (0.00081) [2022-07-10 20:52:10,914][26022] Updated weights on worker 0-0, policy_version 886923 (0.00088) [2022-07-10 20:52:12,729][26022] Updated weights on worker 0-0, policy_version 886933 (0.00086) [2022-07-10 20:52:13,431][25689] Fps is (10 sec: 5457.3, 60 sec: 5512.5, 300 sec: 5518.8). Total num frames: 908222464. Throughput: 0: 4886.9. Samples: 908217394. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:52:13,431][25689] Avg episode reward: [(0, '-0.913')] [2022-07-10 20:52:14,362][26022] Updated weights on worker 0-0, policy_version 886943 (0.00082) [2022-07-10 20:52:16,382][26022] Updated weights on worker 0-0, policy_version 886953 (0.00091) [2022-07-10 20:52:18,125][26022] Updated weights on worker 0-0, policy_version 886963 (0.00090) [2022-07-10 20:52:18,443][25689] Fps is (10 sec: 5554.8, 60 sec: 5531.3, 300 sec: 5525.9). Total num frames: 908251136. Throughput: 0: 5739.3. Samples: 908251310. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:52:18,443][25689] Avg episode reward: [(0, '-0.859')] [2022-07-10 20:52:19,943][26022] Updated weights on worker 0-0, policy_version 886973 (0.00076) [2022-07-10 20:52:21,885][26022] Updated weights on worker 0-0, policy_version 886983 (0.00090) [2022-07-10 20:52:23,483][25689] Fps is (10 sec: 5704.7, 60 sec: 5549.4, 300 sec: 5526.0). Total num frames: 908279808. Throughput: 0: 5723.5. Samples: 908284586. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:52:23,483][25689] Avg episode reward: [(0, '-0.178')] [2022-07-10 20:52:23,611][26022] Updated weights on worker 0-0, policy_version 886993 (0.00091) [2022-07-10 20:52:25,728][26022] Updated weights on worker 0-0, policy_version 887003 (0.00097) [2022-07-10 20:52:27,439][26022] Updated weights on worker 0-0, policy_version 887013 (0.00086) [2022-07-10 20:52:28,505][25689] Fps is (10 sec: 5495.3, 60 sec: 5534.0, 300 sec: 5516.5). Total num frames: 908306432. Throughput: 0: 4972.8. Samples: 908301048. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:52:28,510][25689] Avg episode reward: [(0, '0.205')] [2022-07-10 20:52:29,234][26022] Updated weights on worker 0-0, policy_version 887023 (0.00089) [2022-07-10 20:52:31,197][26022] Updated weights on worker 0-0, policy_version 887033 (0.00086) [2022-07-10 20:52:32,670][26022] Updated weights on worker 0-0, policy_version 887043 (0.00094) [2022-07-10 20:52:33,553][25689] Fps is (10 sec: 5389.6, 60 sec: 5520.0, 300 sec: 5522.5). Total num frames: 908334080. Throughput: 0: 5817.9. Samples: 908334460. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:52:33,553][25689] Avg episode reward: [(0, '0.648')] [2022-07-10 20:52:34,689][26022] Updated weights on worker 0-0, policy_version 887053 (0.00090) [2022-07-10 20:52:36,648][26022] Updated weights on worker 0-0, policy_version 887063 (0.00095) [2022-07-10 20:52:38,241][26022] Updated weights on worker 0-0, policy_version 887073 (0.00091) [2022-07-10 20:52:38,558][25689] Fps is (10 sec: 5704.4, 60 sec: 5556.6, 300 sec: 5522.9). Total num frames: 908363776. Throughput: 0: 5799.8. Samples: 908367972. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:52:38,558][25689] Avg episode reward: [(0, '1.004')] [2022-07-10 20:52:40,459][26022] Updated weights on worker 0-0, policy_version 887083 (0.00093) [2022-07-10 20:52:41,996][26022] Updated weights on worker 0-0, policy_version 887093 (0.00085) [2022-07-10 20:52:43,563][25689] Fps is (10 sec: 5421.8, 60 sec: 5494.7, 300 sec: 5509.2). Total num frames: 908388352. Throughput: 0: 4986.2. Samples: 908384708. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:52:43,563][25689] Avg episode reward: [(0, '1.046')] [2022-07-10 20:52:44,011][26022] Updated weights on worker 0-0, policy_version 887103 (0.00053) [2022-07-10 20:52:45,655][26022] Updated weights on worker 0-0, policy_version 887113 (0.00081) [2022-07-10 20:52:47,651][26022] Updated weights on worker 0-0, policy_version 887123 (0.00054) [2022-07-10 20:52:48,572][25689] Fps is (10 sec: 5521.5, 60 sec: 5563.1, 300 sec: 5524.1). Total num frames: 908419072. Throughput: 0: 5836.6. Samples: 908418172. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:52:48,573][25689] Avg episode reward: [(0, '0.921')] [2022-07-10 20:52:49,458][26022] Updated weights on worker 0-0, policy_version 887133 (0.00081) [2022-07-10 20:52:51,406][26022] Updated weights on worker 0-0, policy_version 887143 (0.00088) [2022-07-10 20:52:53,000][26022] Updated weights on worker 0-0, policy_version 887153 (0.00088) [2022-07-10 20:52:53,623][25689] Fps is (10 sec: 5903.8, 60 sec: 5547.6, 300 sec: 5523.2). Total num frames: 908447744. Throughput: 0: 5845.3. Samples: 908451776. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:52:53,623][25689] Avg episode reward: [(0, '0.777')] [2022-07-10 20:52:55,198][26022] Updated weights on worker 0-0, policy_version 887163 (0.00087) [2022-07-10 20:52:56,649][26022] Updated weights on worker 0-0, policy_version 887173 (0.00089) [2022-07-10 20:52:58,659][25689] Fps is (10 sec: 5279.1, 60 sec: 5495.6, 300 sec: 5512.5). Total num frames: 908472320. Throughput: 0: 4990.6. Samples: 908468290. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:52:58,659][25689] Avg episode reward: [(0, '0.250')] [2022-07-10 20:52:58,919][26022] Updated weights on worker 0-0, policy_version 887183 (0.00083) [2022-07-10 20:53:00,363][26022] Updated weights on worker 0-0, policy_version 887193 (0.00091) [2022-07-10 20:53:02,712][26022] Updated weights on worker 0-0, policy_version 887203 (0.00078) [2022-07-10 20:53:03,663][25689] Fps is (10 sec: 5405.4, 60 sec: 5547.1, 300 sec: 5522.9). Total num frames: 908502016. Throughput: 0: 5778.0. Samples: 908500848. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:53:03,664][25689] Avg episode reward: [(0, '0.322')] [2022-07-10 20:53:04,569][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:53:04,579][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000887213_908506112.pth [2022-07-10 20:53:04,579][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000885270_906516480.pth [2022-07-10 20:53:04,580][25974] Saving a milestone ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/milestones/checkpoint_000887213_908506112.pth.milestone [2022-07-10 20:53:04,583][26022] Updated weights on worker 0-0, policy_version 887213 (0.00094) [2022-07-10 20:53:06,296][26022] Updated weights on worker 0-0, policy_version 887223 (0.00099) [2022-07-10 20:53:08,154][26022] Updated weights on worker 0-0, policy_version 887233 (0.00090) [2022-07-10 20:53:08,667][25689] Fps is (10 sec: 5627.7, 60 sec: 5530.3, 300 sec: 5517.0). Total num frames: 908528640. Throughput: 0: 5755.4. Samples: 908533822. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:53:08,667][25689] Avg episode reward: [(0, '0.491')] [2022-07-10 20:53:09,888][26022] Updated weights on worker 0-0, policy_version 887243 (0.00089) [2022-07-10 20:53:11,875][26022] Updated weights on worker 0-0, policy_version 887253 (0.00091) [2022-07-10 20:53:13,411][26022] Updated weights on worker 0-0, policy_version 887263 (0.00089) [2022-07-10 20:53:13,767][25689] Fps is (10 sec: 5472.9, 60 sec: 5549.7, 300 sec: 5525.9). Total num frames: 908557312. Throughput: 0: 4903.4. Samples: 908550550. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:53:13,768][25689] Avg episode reward: [(0, '0.035')] [2022-07-10 20:53:15,490][26022] Updated weights on worker 0-0, policy_version 887273 (0.00083) [2022-07-10 20:53:17,291][26022] Updated weights on worker 0-0, policy_version 887283 (0.00083) [2022-07-10 20:53:18,780][25689] Fps is (10 sec: 5670.1, 60 sec: 5549.6, 300 sec: 5522.4). Total num frames: 908585984. Throughput: 0: 5769.0. Samples: 908584368. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:53:18,782][25689] Avg episode reward: [(0, '-0.076')] [2022-07-10 20:53:19,007][26022] Updated weights on worker 0-0, policy_version 887293 (0.00103) [2022-07-10 20:53:21,036][26022] Updated weights on worker 0-0, policy_version 887303 (0.00112) [2022-07-10 20:53:22,791][26022] Updated weights on worker 0-0, policy_version 887313 (0.00088) [2022-07-10 20:53:23,793][25689] Fps is (10 sec: 5617.4, 60 sec: 5535.2, 300 sec: 5525.8). Total num frames: 908613632. Throughput: 0: 5812.9. Samples: 908617860. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:53:23,794][25689] Avg episode reward: [(0, '0.319')] [2022-07-10 20:53:24,701][26022] Updated weights on worker 0-0, policy_version 887323 (0.00086) [2022-07-10 20:53:26,526][26022] Updated weights on worker 0-0, policy_version 887333 (0.00083) [2022-07-10 20:53:28,343][26022] Updated weights on worker 0-0, policy_version 887343 (0.00094) [2022-07-10 20:53:28,816][25689] Fps is (10 sec: 5612.0, 60 sec: 5569.1, 300 sec: 5529.6). Total num frames: 908642304. Throughput: 0: 5807.0. Samples: 908650828. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:53:28,816][25689] Avg episode reward: [(0, '0.245')] [2022-07-10 20:53:30,255][26022] Updated weights on worker 0-0, policy_version 887353 (0.00096) [2022-07-10 20:53:31,972][26022] Updated weights on worker 0-0, policy_version 887363 (0.00081) [2022-07-10 20:53:33,898][25689] Fps is (10 sec: 5471.7, 60 sec: 5548.8, 300 sec: 5518.6). Total num frames: 908668928. Throughput: 0: 5802.8. Samples: 908667370. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:53:33,899][25689] Avg episode reward: [(0, '0.135')] [2022-07-10 20:53:33,975][26022] Updated weights on worker 0-0, policy_version 887373 (0.00089) [2022-07-10 20:53:35,693][26022] Updated weights on worker 0-0, policy_version 887383 (0.00248) [2022-07-10 20:53:37,551][26022] Updated weights on worker 0-0, policy_version 887393 (0.00095) [2022-07-10 20:53:38,973][25689] Fps is (10 sec: 5443.9, 60 sec: 5525.5, 300 sec: 5524.7). Total num frames: 908697600. Throughput: 0: 5768.1. Samples: 908700844. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:53:38,974][25689] Avg episode reward: [(0, '0.301')] [2022-07-10 20:53:39,538][26022] Updated weights on worker 0-0, policy_version 887403 (0.00082) [2022-07-10 20:53:41,287][26022] Updated weights on worker 0-0, policy_version 887413 (0.00090) [2022-07-10 20:53:43,236][26022] Updated weights on worker 0-0, policy_version 887423 (0.00087) [2022-07-10 20:53:43,985][25689] Fps is (10 sec: 5685.3, 60 sec: 5592.6, 300 sec: 5524.6). Total num frames: 908726272. Throughput: 0: 5766.7. Samples: 908734302. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:53:43,985][25689] Avg episode reward: [(0, '0.558')] [2022-07-10 20:53:44,979][26022] Updated weights on worker 0-0, policy_version 887433 (0.00083) [2022-07-10 20:53:46,834][26022] Updated weights on worker 0-0, policy_version 887443 (0.00087) [2022-07-10 20:53:48,587][26022] Updated weights on worker 0-0, policy_version 887453 (0.00100) [2022-07-10 20:53:49,018][25689] Fps is (10 sec: 5607.1, 60 sec: 5539.7, 300 sec: 5526.2). Total num frames: 908753920. Throughput: 0: 4961.1. Samples: 908751052. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:53:49,018][25689] Avg episode reward: [(0, '-0.393')] [2022-07-10 20:53:50,584][26022] Updated weights on worker 0-0, policy_version 887463 (0.00090) [2022-07-10 20:53:52,372][26022] Updated weights on worker 0-0, policy_version 887473 (0.00090) [2022-07-10 20:53:54,154][25689] Fps is (10 sec: 5437.7, 60 sec: 5514.9, 300 sec: 5524.0). Total num frames: 908781568. Throughput: 0: 5767.2. Samples: 908784186. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:53:54,154][25689] Avg episode reward: [(0, '0.141')] [2022-07-10 20:53:54,359][26022] Updated weights on worker 0-0, policy_version 887483 (0.00094) [2022-07-10 20:53:56,170][26022] Updated weights on worker 0-0, policy_version 887493 (0.00093) [2022-07-10 20:53:57,964][26022] Updated weights on worker 0-0, policy_version 887503 (0.00082) [2022-07-10 20:53:59,186][25689] Fps is (10 sec: 5438.2, 60 sec: 5566.1, 300 sec: 5521.6). Total num frames: 908809216. Throughput: 0: 5759.2. Samples: 908817252. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:53:59,186][25689] Avg episode reward: [(0, '-0.013')] [2022-07-10 20:53:59,729][26022] Updated weights on worker 0-0, policy_version 887513 (0.00098) [2022-07-10 20:54:02,076][26022] Updated weights on worker 0-0, policy_version 887523 (0.00086) [2022-07-10 20:54:03,899][26022] Updated weights on worker 0-0, policy_version 887533 (0.00094) [2022-07-10 20:54:04,239][25689] Fps is (10 sec: 5280.0, 60 sec: 5494.0, 300 sec: 5524.5). Total num frames: 908834816. Throughput: 0: 4823.6. Samples: 908831998. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:54:04,239][25689] Avg episode reward: [(0, '-2.222')] [2022-07-10 20:54:05,618][26022] Updated weights on worker 0-0, policy_version 887543 (0.00086) [2022-07-10 20:54:07,511][26022] Updated weights on worker 0-0, policy_version 887553 (0.00097) [2022-07-10 20:54:09,254][25689] Fps is (10 sec: 5390.3, 60 sec: 5526.7, 300 sec: 5519.8). Total num frames: 908863488. Throughput: 0: 5665.4. Samples: 908865700. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:54:09,255][25689] Avg episode reward: [(0, '-1.240')] [2022-07-10 20:54:09,375][26022] Updated weights on worker 0-0, policy_version 887563 (0.00092) [2022-07-10 20:54:11,167][26022] Updated weights on worker 0-0, policy_version 887573 (0.00085) [2022-07-10 20:54:12,981][26022] Updated weights on worker 0-0, policy_version 887583 (0.00109) [2022-07-10 20:54:14,298][25689] Fps is (10 sec: 5598.6, 60 sec: 5514.9, 300 sec: 5520.7). Total num frames: 908891136. Throughput: 0: 5727.4. Samples: 908899562. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:54:14,299][25689] Avg episode reward: [(0, '-1.163')] [2022-07-10 20:54:14,751][26022] Updated weights on worker 0-0, policy_version 887593 (0.00085) [2022-07-10 20:54:16,485][26022] Updated weights on worker 0-0, policy_version 887603 (0.00089) [2022-07-10 20:54:18,578][26022] Updated weights on worker 0-0, policy_version 887613 (0.00090) [2022-07-10 20:54:19,341][25689] Fps is (10 sec: 5583.7, 60 sec: 5512.2, 300 sec: 5523.8). Total num frames: 908919808. Throughput: 0: 4927.9. Samples: 908916574. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:54:19,342][25689] Avg episode reward: [(0, '-1.049')] [2022-07-10 20:54:20,173][26022] Updated weights on worker 0-0, policy_version 887623 (0.00089) [2022-07-10 20:54:22,105][26022] Updated weights on worker 0-0, policy_version 887633 (0.00094) [2022-07-10 20:54:23,707][26022] Updated weights on worker 0-0, policy_version 887643 (0.00103) [2022-07-10 20:54:24,377][25689] Fps is (10 sec: 5689.6, 60 sec: 5527.0, 300 sec: 5524.6). Total num frames: 908948480. Throughput: 0: 5869.7. Samples: 908950204. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:54:24,379][25689] Avg episode reward: [(0, '-1.323')] [2022-07-10 20:54:25,659][26022] Updated weights on worker 0-0, policy_version 887653 (0.00085) [2022-07-10 20:54:27,486][26022] Updated weights on worker 0-0, policy_version 887663 (0.00095) [2022-07-10 20:54:29,381][26022] Updated weights on worker 0-0, policy_version 887673 (0.00091) [2022-07-10 20:54:29,391][25689] Fps is (10 sec: 5604.0, 60 sec: 5510.9, 300 sec: 5529.1). Total num frames: 908976128. Throughput: 0: 5857.2. Samples: 908983644. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:54:29,391][25689] Avg episode reward: [(0, '-1.532')] [2022-07-10 20:54:31,243][26022] Updated weights on worker 0-0, policy_version 887683 (0.00097) [2022-07-10 20:54:33,121][26022] Updated weights on worker 0-0, policy_version 887693 (0.00087) [2022-07-10 20:54:34,477][25689] Fps is (10 sec: 5474.8, 60 sec: 5527.5, 300 sec: 5520.7). Total num frames: 909003776. Throughput: 0: 4974.4. Samples: 908999940. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:54:34,479][25689] Avg episode reward: [(0, '-1.494')] [2022-07-10 20:54:34,931][26022] Updated weights on worker 0-0, policy_version 887703 (0.00080) [2022-07-10 20:54:36,687][26022] Updated weights on worker 0-0, policy_version 887713 (0.00091) [2022-07-10 20:54:38,511][26022] Updated weights on worker 0-0, policy_version 887723 (0.00086) [2022-07-10 20:54:39,509][25689] Fps is (10 sec: 5566.1, 60 sec: 5531.4, 300 sec: 5524.4). Total num frames: 909032448. Throughput: 0: 5813.2. Samples: 909033818. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:54:39,510][25689] Avg episode reward: [(0, '0.267')] [2022-07-10 20:54:40,424][26022] Updated weights on worker 0-0, policy_version 887733 (0.00092) [2022-07-10 20:54:42,285][26022] Updated weights on worker 0-0, policy_version 887743 (0.00082) [2022-07-10 20:54:43,985][26022] Updated weights on worker 0-0, policy_version 887753 (0.00088) [2022-07-10 20:54:44,532][25689] Fps is (10 sec: 5703.2, 60 sec: 5530.4, 300 sec: 5528.9). Total num frames: 909061120. Throughput: 0: 5820.5. Samples: 909067516. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:54:44,532][25689] Avg episode reward: [(0, '-0.437')] [2022-07-10 20:54:45,912][26022] Updated weights on worker 0-0, policy_version 887763 (0.00088) [2022-07-10 20:54:47,638][26022] Updated weights on worker 0-0, policy_version 887773 (0.00086) [2022-07-10 20:54:49,551][25689] Fps is (10 sec: 5608.7, 60 sec: 5531.7, 300 sec: 5525.9). Total num frames: 909088768. Throughput: 0: 5007.1. Samples: 909084588. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:54:49,551][25689] Avg episode reward: [(0, '-1.009')] [2022-07-10 20:54:49,641][26022] Updated weights on worker 0-0, policy_version 887783 (0.00087) [2022-07-10 20:54:51,281][26022] Updated weights on worker 0-0, policy_version 887793 (0.00089) [2022-07-10 20:54:53,289][26022] Updated weights on worker 0-0, policy_version 887803 (0.00086) [2022-07-10 20:54:54,622][25689] Fps is (10 sec: 5683.2, 60 sec: 5571.5, 300 sec: 5531.9). Total num frames: 909118464. Throughput: 0: 5872.8. Samples: 909118246. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:54:54,625][25689] Avg episode reward: [(0, '-1.536')] [2022-07-10 20:54:54,796][26022] Updated weights on worker 0-0, policy_version 887813 (0.00590) [2022-07-10 20:54:56,981][26022] Updated weights on worker 0-0, policy_version 887823 (0.00085) [2022-07-10 20:54:58,737][26022] Updated weights on worker 0-0, policy_version 887833 (0.00086) [2022-07-10 20:54:59,635][25689] Fps is (10 sec: 5686.6, 60 sec: 5573.2, 300 sec: 5532.4). Total num frames: 909146112. Throughput: 0: 5867.0. Samples: 909151894. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:54:59,635][25689] Avg episode reward: [(0, '-1.377')] [2022-07-10 20:55:00,624][26022] Updated weights on worker 0-0, policy_version 887843 (0.00093) [2022-07-10 20:55:02,616][26022] Updated weights on worker 0-0, policy_version 887853 (0.00094) [2022-07-10 20:55:04,632][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:55:04,643][25689] Fps is (10 sec: 5211.6, 60 sec: 5560.4, 300 sec: 5525.8). Total num frames: 909170688. Throughput: 0: 5015.1. Samples: 909168374. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:55:04,643][25689] Avg episode reward: [(0, '-1.649')] [2022-07-10 20:55:04,646][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000887863_909171712.pth [2022-07-10 20:55:04,646][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000885918_907180032.pth [2022-07-10 20:55:04,650][26022] Updated weights on worker 0-0, policy_version 887863 (0.00096) [2022-07-10 20:55:06,383][26022] Updated weights on worker 0-0, policy_version 887873 (0.00088) [2022-07-10 20:55:08,290][26022] Updated weights on worker 0-0, policy_version 887883 (0.00090) [2022-07-10 20:55:09,648][25689] Fps is (10 sec: 5419.9, 60 sec: 5578.3, 300 sec: 5534.3). Total num frames: 909200384. Throughput: 0: 5738.5. Samples: 909199918. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:55:09,649][25689] Avg episode reward: [(0, '-1.651')] [2022-07-10 20:55:09,936][26022] Updated weights on worker 0-0, policy_version 887893 (0.00090) [2022-07-10 20:55:12,178][26022] Updated weights on worker 0-0, policy_version 887903 (0.00258) [2022-07-10 20:55:13,602][26022] Updated weights on worker 0-0, policy_version 887913 (0.00082) [2022-07-10 20:55:14,710][25689] Fps is (10 sec: 5594.5, 60 sec: 5559.7, 300 sec: 5531.0). Total num frames: 909227008. Throughput: 0: 5732.5. Samples: 909233400. Policy #0 lag: (min: 0.0, avg: 6.4, max: 19.0) [2022-07-10 20:55:14,710][25689] Avg episode reward: [(0, '-1.962')] [2022-07-10 20:55:15,607][26022] Updated weights on worker 0-0, policy_version 887923 (0.00081) [2022-07-10 20:55:17,485][26022] Updated weights on worker 0-0, policy_version 887933 (0.00087) [2022-07-10 20:55:19,299][26022] Updated weights on worker 0-0, policy_version 887943 (0.00088) [2022-07-10 20:55:19,720][25689] Fps is (10 sec: 5490.3, 60 sec: 5562.7, 300 sec: 5531.2). Total num frames: 909255680. Throughput: 0: 4888.7. Samples: 909250084. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:55:19,720][25689] Avg episode reward: [(0, '-0.997')] [2022-07-10 20:55:20,988][26022] Updated weights on worker 0-0, policy_version 887953 (0.00090) [2022-07-10 20:55:23,017][26022] Updated weights on worker 0-0, policy_version 887963 (0.00083) [2022-07-10 20:55:24,724][25689] Fps is (10 sec: 5624.2, 60 sec: 5548.8, 300 sec: 5535.2). Total num frames: 909283328. Throughput: 0: 5730.6. Samples: 909283450. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:55:24,724][25689] Avg episode reward: [(0, '-0.441')] [2022-07-10 20:55:24,778][26022] Updated weights on worker 0-0, policy_version 887973 (0.00084) [2022-07-10 20:55:26,792][26022] Updated weights on worker 0-0, policy_version 887983 (0.00091) [2022-07-10 20:55:28,505][26022] Updated weights on worker 0-0, policy_version 887993 (0.00095) [2022-07-10 20:55:29,728][25689] Fps is (10 sec: 5524.9, 60 sec: 5549.6, 300 sec: 5529.9). Total num frames: 909310976. Throughput: 0: 5804.3. Samples: 909316468. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:55:29,729][25689] Avg episode reward: [(0, '0.012')] [2022-07-10 20:55:30,546][26022] Updated weights on worker 0-0, policy_version 888003 (0.00090) [2022-07-10 20:55:32,139][26022] Updated weights on worker 0-0, policy_version 888013 (0.00093) [2022-07-10 20:55:34,154][26022] Updated weights on worker 0-0, policy_version 888023 (0.00085) [2022-07-10 20:55:34,783][25689] Fps is (10 sec: 5497.1, 60 sec: 5552.5, 300 sec: 5529.0). Total num frames: 909338624. Throughput: 0: 4971.4. Samples: 909333190. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:55:34,783][25689] Avg episode reward: [(0, '0.022')] [2022-07-10 20:55:35,775][26022] Updated weights on worker 0-0, policy_version 888033 (0.00086) [2022-07-10 20:55:37,816][26022] Updated weights on worker 0-0, policy_version 888043 (0.00096) [2022-07-10 20:55:39,596][26022] Updated weights on worker 0-0, policy_version 888053 (0.00092) [2022-07-10 20:55:39,827][25689] Fps is (10 sec: 5678.1, 60 sec: 5568.3, 300 sec: 5535.3). Total num frames: 909368320. Throughput: 0: 5798.3. Samples: 909366674. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:55:39,828][25689] Avg episode reward: [(0, '0.459')] [2022-07-10 20:55:41,461][26022] Updated weights on worker 0-0, policy_version 888063 (0.00082) [2022-07-10 20:55:43,175][26022] Updated weights on worker 0-0, policy_version 888073 (0.00087) [2022-07-10 20:55:44,833][25689] Fps is (10 sec: 5603.6, 60 sec: 5535.9, 300 sec: 5532.2). Total num frames: 909394944. Throughput: 0: 5832.3. Samples: 909400736. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:55:44,834][25689] Avg episode reward: [(0, '0.998')] [2022-07-10 20:55:44,954][26022] Updated weights on worker 0-0, policy_version 888083 (0.00091) [2022-07-10 20:55:46,888][26022] Updated weights on worker 0-0, policy_version 888093 (0.00097) [2022-07-10 20:55:48,606][26022] Updated weights on worker 0-0, policy_version 888103 (0.00084) [2022-07-10 20:55:49,853][25689] Fps is (10 sec: 5413.3, 60 sec: 5535.8, 300 sec: 5537.2). Total num frames: 909422592. Throughput: 0: 5023.1. Samples: 909417558. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:55:49,854][25689] Avg episode reward: [(0, '1.080')] [2022-07-10 20:55:50,459][26022] Updated weights on worker 0-0, policy_version 888113 (0.00080) [2022-07-10 20:55:52,439][26022] Updated weights on worker 0-0, policy_version 888123 (0.00097) [2022-07-10 20:55:54,119][26022] Updated weights on worker 0-0, policy_version 888133 (0.00086) [2022-07-10 20:55:54,990][25689] Fps is (10 sec: 5746.7, 60 sec: 5546.8, 300 sec: 5535.9). Total num frames: 909453312. Throughput: 0: 5849.5. Samples: 909451394. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:55:54,995][25689] Avg episode reward: [(0, '1.304')] [2022-07-10 20:55:55,956][26022] Updated weights on worker 0-0, policy_version 888143 (0.00085) [2022-07-10 20:55:57,743][26022] Updated weights on worker 0-0, policy_version 888153 (0.00087) [2022-07-10 20:55:59,933][26022] Updated weights on worker 0-0, policy_version 888163 (0.00087) [2022-07-10 20:56:00,026][25689] Fps is (10 sec: 5536.0, 60 sec: 5510.7, 300 sec: 5538.9). Total num frames: 909478912. Throughput: 0: 5841.5. Samples: 909484666. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:56:00,027][25689] Avg episode reward: [(0, '0.788')] [2022-07-10 20:56:01,470][26022] Updated weights on worker 0-0, policy_version 888173 (0.00086) [2022-07-10 20:56:03,695][26022] Updated weights on worker 0-0, policy_version 888183 (0.00086) [2022-07-10 20:56:05,085][25689] Fps is (10 sec: 5274.9, 60 sec: 5556.9, 300 sec: 5537.9). Total num frames: 909506560. Throughput: 0: 5701.6. Samples: 909516202. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:56:05,085][25689] Avg episode reward: [(0, '-0.981')] [2022-07-10 20:56:05,447][26022] Updated weights on worker 0-0, policy_version 888193 (0.00087) [2022-07-10 20:56:07,238][26022] Updated weights on worker 0-0, policy_version 888203 (0.00090) [2022-07-10 20:56:09,323][26022] Updated weights on worker 0-0, policy_version 888213 (0.00094) [2022-07-10 20:56:10,146][25689] Fps is (10 sec: 5464.2, 60 sec: 5517.9, 300 sec: 5535.5). Total num frames: 909534208. Throughput: 0: 5693.7. Samples: 909533100. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:56:10,147][25689] Avg episode reward: [(0, '-0.945')] [2022-07-10 20:56:11,019][26022] Updated weights on worker 0-0, policy_version 888223 (0.00094) [2022-07-10 20:56:13,009][26022] Updated weights on worker 0-0, policy_version 888233 (0.00096) [2022-07-10 20:56:14,735][26022] Updated weights on worker 0-0, policy_version 888243 (0.00093) [2022-07-10 20:56:15,236][25689] Fps is (10 sec: 5548.2, 60 sec: 5549.2, 300 sec: 5537.9). Total num frames: 909562880. Throughput: 0: 5685.3. Samples: 909566496. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:56:15,236][25689] Avg episode reward: [(0, '-1.224')] [2022-07-10 20:56:16,614][26022] Updated weights on worker 0-0, policy_version 888253 (0.00090) [2022-07-10 20:56:18,788][26022] Updated weights on worker 0-0, policy_version 888263 (0.00088) [2022-07-10 20:56:20,190][26022] Updated weights on worker 0-0, policy_version 888273 (0.00090) [2022-07-10 20:56:20,241][25689] Fps is (10 sec: 5781.8, 60 sec: 5566.5, 300 sec: 5545.7). Total num frames: 909592576. Throughput: 0: 5705.2. Samples: 909599996. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:56:20,242][25689] Avg episode reward: [(0, '-2.849')] [2022-07-10 20:56:22,242][26022] Updated weights on worker 0-0, policy_version 888283 (0.00087) [2022-07-10 20:56:23,974][26022] Updated weights on worker 0-0, policy_version 888293 (0.00082) [2022-07-10 20:56:25,303][25689] Fps is (10 sec: 5492.5, 60 sec: 5527.4, 300 sec: 5538.4). Total num frames: 909618176. Throughput: 0: 4970.8. Samples: 909616704. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:56:25,304][25689] Avg episode reward: [(0, '-2.416')] [2022-07-10 20:56:25,714][26022] Updated weights on worker 0-0, policy_version 888303 (0.00085) [2022-07-10 20:56:27,872][26022] Updated weights on worker 0-0, policy_version 888313 (0.00084) [2022-07-10 20:56:29,540][26022] Updated weights on worker 0-0, policy_version 888323 (0.00087) [2022-07-10 20:56:30,398][25689] Fps is (10 sec: 5343.2, 60 sec: 5536.0, 300 sec: 5538.1). Total num frames: 909646848. Throughput: 0: 5758.3. Samples: 909649718. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:56:30,400][25689] Avg episode reward: [(0, '-1.801')] [2022-07-10 20:56:31,509][26022] Updated weights on worker 0-0, policy_version 888333 (0.00101) [2022-07-10 20:56:33,128][26022] Updated weights on worker 0-0, policy_version 888343 (0.00082) [2022-07-10 20:56:35,091][26022] Updated weights on worker 0-0, policy_version 888353 (0.00114) [2022-07-10 20:56:35,471][25689] Fps is (10 sec: 5539.1, 60 sec: 5534.4, 300 sec: 5537.4). Total num frames: 909674496. Throughput: 0: 5759.9. Samples: 909683050. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:56:35,471][25689] Avg episode reward: [(0, '-1.680')] [2022-07-10 20:56:37,001][26022] Updated weights on worker 0-0, policy_version 888363 (0.00091) [2022-07-10 20:56:38,770][26022] Updated weights on worker 0-0, policy_version 888373 (0.00091) [2022-07-10 20:56:40,471][26022] Updated weights on worker 0-0, policy_version 888383 (0.00085) [2022-07-10 20:56:40,569][25689] Fps is (10 sec: 5638.0, 60 sec: 5529.5, 300 sec: 5540.3). Total num frames: 909704192. Throughput: 0: 4912.2. Samples: 909699852. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:56:40,570][25689] Avg episode reward: [(0, '-1.765')] [2022-07-10 20:56:42,486][26022] Updated weights on worker 0-0, policy_version 888393 (0.00092) [2022-07-10 20:56:44,004][26022] Updated weights on worker 0-0, policy_version 888403 (0.00089) [2022-07-10 20:56:45,608][25689] Fps is (10 sec: 5656.8, 60 sec: 5543.3, 300 sec: 5543.3). Total num frames: 909731840. Throughput: 0: 5748.2. Samples: 909733422. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:56:45,609][25689] Avg episode reward: [(0, '-2.653')] [2022-07-10 20:56:46,198][26022] Updated weights on worker 0-0, policy_version 888413 (0.00094) [2022-07-10 20:56:47,763][26022] Updated weights on worker 0-0, policy_version 888423 (0.00093) [2022-07-10 20:56:49,851][26022] Updated weights on worker 0-0, policy_version 888433 (0.00093) [2022-07-10 20:56:50,628][25689] Fps is (10 sec: 5395.6, 60 sec: 5526.5, 300 sec: 5533.8). Total num frames: 909758464. Throughput: 0: 5792.7. Samples: 909766902. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:56:50,630][25689] Avg episode reward: [(0, '-0.883')] [2022-07-10 20:56:51,691][26022] Updated weights on worker 0-0, policy_version 888443 (0.00090) [2022-07-10 20:56:53,733][26022] Updated weights on worker 0-0, policy_version 888453 (0.00091) [2022-07-10 20:56:55,255][26022] Updated weights on worker 0-0, policy_version 888463 (0.00080) [2022-07-10 20:56:55,767][25689] Fps is (10 sec: 5644.8, 60 sec: 5526.3, 300 sec: 5542.0). Total num frames: 909789184. Throughput: 0: 4947.6. Samples: 909783466. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:56:55,767][25689] Avg episode reward: [(0, '-2.231')] [2022-07-10 20:56:57,316][26022] Updated weights on worker 0-0, policy_version 888473 (0.00092) [2022-07-10 20:56:58,835][26022] Updated weights on worker 0-0, policy_version 888483 (0.00089) [2022-07-10 20:57:00,780][25689] Fps is (10 sec: 5547.7, 60 sec: 5528.4, 300 sec: 5538.5). Total num frames: 909814784. Throughput: 0: 5787.2. Samples: 909816816. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:57:00,781][25689] Avg episode reward: [(0, '-3.574')] [2022-07-10 20:57:01,098][26022] Updated weights on worker 0-0, policy_version 888493 (0.00078) [2022-07-10 20:57:02,953][26022] Updated weights on worker 0-0, policy_version 888503 (0.00090) [2022-07-10 20:57:04,873][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:57:04,885][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000888512_909836288.pth [2022-07-10 20:57:04,887][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000886565_907842560.pth [2022-07-10 20:57:05,175][26022] Updated weights on worker 0-0, policy_version 888513 (0.00096) [2022-07-10 20:57:05,840][25689] Fps is (10 sec: 5286.4, 60 sec: 5528.3, 300 sec: 5537.5). Total num frames: 909842432. Throughput: 0: 5675.0. Samples: 909848234. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:57:05,848][25689] Avg episode reward: [(0, '-2.750')] [2022-07-10 20:57:06,759][26022] Updated weights on worker 0-0, policy_version 888523 (0.00080) [2022-07-10 20:57:08,710][26022] Updated weights on worker 0-0, policy_version 888533 (0.00079) [2022-07-10 20:57:10,388][26022] Updated weights on worker 0-0, policy_version 888543 (0.00085) [2022-07-10 20:57:10,876][25689] Fps is (10 sec: 5578.8, 60 sec: 5547.5, 300 sec: 5542.7). Total num frames: 909871104. Throughput: 0: 4855.1. Samples: 909865206. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:57:10,877][25689] Avg episode reward: [(0, '-2.370')] [2022-07-10 20:57:12,453][26022] Updated weights on worker 0-0, policy_version 888553 (0.00088) [2022-07-10 20:57:13,970][26022] Updated weights on worker 0-0, policy_version 888563 (0.00089) [2022-07-10 20:57:16,024][25689] Fps is (10 sec: 5328.9, 60 sec: 5491.6, 300 sec: 5529.8). Total num frames: 909896704. Throughput: 0: 5686.8. Samples: 909898664. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:57:16,025][25689] Avg episode reward: [(0, '-1.492')] [2022-07-10 20:57:16,154][26022] Updated weights on worker 0-0, policy_version 888573 (0.00293) [2022-07-10 20:57:17,610][26022] Updated weights on worker 0-0, policy_version 888583 (0.00090) [2022-07-10 20:57:19,795][26022] Updated weights on worker 0-0, policy_version 888593 (0.00087) [2022-07-10 20:57:21,051][25689] Fps is (10 sec: 5534.8, 60 sec: 5506.5, 300 sec: 5536.9). Total num frames: 909927424. Throughput: 0: 5704.3. Samples: 909932448. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:57:21,052][25689] Avg episode reward: [(0, '-1.310')] [2022-07-10 20:57:21,231][26022] Updated weights on worker 0-0, policy_version 888603 (0.00089) [2022-07-10 20:57:23,366][26022] Updated weights on worker 0-0, policy_version 888613 (0.00086) [2022-07-10 20:57:25,026][26022] Updated weights on worker 0-0, policy_version 888623 (0.00089) [2022-07-10 20:57:26,053][25689] Fps is (10 sec: 5819.9, 60 sec: 5545.6, 300 sec: 5540.8). Total num frames: 909955072. Throughput: 0: 5001.4. Samples: 909949330. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:57:26,054][25689] Avg episode reward: [(0, '-0.046')] [2022-07-10 20:57:27,135][26022] Updated weights on worker 0-0, policy_version 888633 (0.00092) [2022-07-10 20:57:28,510][26022] Updated weights on worker 0-0, policy_version 888643 (0.00099) [2022-07-10 20:57:30,713][26022] Updated weights on worker 0-0, policy_version 888653 (0.00091) [2022-07-10 20:57:31,056][25689] Fps is (10 sec: 5527.0, 60 sec: 5537.2, 300 sec: 5541.6). Total num frames: 909982720. Throughput: 0: 5825.2. Samples: 909982760. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:57:31,056][25689] Avg episode reward: [(0, '0.932')] [2022-07-10 20:57:32,440][26022] Updated weights on worker 0-0, policy_version 888663 (0.00087) [2022-07-10 20:57:34,351][26022] Updated weights on worker 0-0, policy_version 888673 (0.00084) [2022-07-10 20:57:36,117][25689] Fps is (10 sec: 5494.3, 60 sec: 5538.2, 300 sec: 5533.7). Total num frames: 910010368. Throughput: 0: 5865.8. Samples: 910016528. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:57:36,118][25689] Avg episode reward: [(0, '0.728')] [2022-07-10 20:57:36,205][26022] Updated weights on worker 0-0, policy_version 888683 (0.00092) [2022-07-10 20:57:37,895][26022] Updated weights on worker 0-0, policy_version 888693 (0.00087) [2022-07-10 20:57:39,763][26022] Updated weights on worker 0-0, policy_version 888703 (0.00086) [2022-07-10 20:57:41,125][25689] Fps is (10 sec: 5593.4, 60 sec: 5529.6, 300 sec: 5547.4). Total num frames: 910039040. Throughput: 0: 5037.8. Samples: 910033576. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:57:41,125][25689] Avg episode reward: [(0, '0.743')] [2022-07-10 20:57:41,507][26022] Updated weights on worker 0-0, policy_version 888713 (0.00100) [2022-07-10 20:57:43,436][26022] Updated weights on worker 0-0, policy_version 888723 (0.00088) [2022-07-10 20:57:45,231][26022] Updated weights on worker 0-0, policy_version 888733 (0.00089) [2022-07-10 20:57:46,141][25689] Fps is (10 sec: 5823.2, 60 sec: 5565.5, 300 sec: 5543.8). Total num frames: 910068736. Throughput: 0: 5863.9. Samples: 910067122. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:57:46,141][25689] Avg episode reward: [(0, '0.836')] [2022-07-10 20:57:47,051][26022] Updated weights on worker 0-0, policy_version 888743 (0.00098) [2022-07-10 20:57:48,893][26022] Updated weights on worker 0-0, policy_version 888753 (0.00084) [2022-07-10 20:57:50,853][26022] Updated weights on worker 0-0, policy_version 888763 (0.00090) [2022-07-10 20:57:51,151][25689] Fps is (10 sec: 5617.5, 60 sec: 5566.4, 300 sec: 5537.7). Total num frames: 910095360. Throughput: 0: 5866.7. Samples: 910100650. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:57:51,151][25689] Avg episode reward: [(0, '0.941')] [2022-07-10 20:57:52,601][26022] Updated weights on worker 0-0, policy_version 888773 (0.00085) [2022-07-10 20:57:54,446][26022] Updated weights on worker 0-0, policy_version 888783 (0.00091) [2022-07-10 20:57:56,019][26022] Updated weights on worker 0-0, policy_version 888793 (0.00098) [2022-07-10 20:57:56,232][25689] Fps is (10 sec: 5479.7, 60 sec: 5537.9, 300 sec: 5550.6). Total num frames: 910124032. Throughput: 0: 5026.2. Samples: 910117628. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:57:56,232][25689] Avg episode reward: [(0, '0.992')] [2022-07-10 20:57:58,150][26022] Updated weights on worker 0-0, policy_version 888803 (0.00090) [2022-07-10 20:57:59,812][26022] Updated weights on worker 0-0, policy_version 888813 (0.00093) [2022-07-10 20:58:01,243][25689] Fps is (10 sec: 5479.0, 60 sec: 5555.0, 300 sec: 5540.2). Total num frames: 910150656. Throughput: 0: 5833.6. Samples: 910150938. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:58:01,244][25689] Avg episode reward: [(0, '1.190')] [2022-07-10 20:58:02,160][26022] Updated weights on worker 0-0, policy_version 888823 (0.00089) [2022-07-10 20:58:03,844][26022] Updated weights on worker 0-0, policy_version 888833 (0.00090) [2022-07-10 20:58:05,751][26022] Updated weights on worker 0-0, policy_version 888843 (0.00089) [2022-07-10 20:58:06,246][25689] Fps is (10 sec: 5317.4, 60 sec: 5543.3, 300 sec: 5540.2). Total num frames: 910177280. Throughput: 0: 5741.0. Samples: 910182546. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:58:06,247][25689] Avg episode reward: [(0, '1.047')] [2022-07-10 20:58:07,488][26022] Updated weights on worker 0-0, policy_version 888853 (0.00087) [2022-07-10 20:58:09,571][26022] Updated weights on worker 0-0, policy_version 888863 (0.00089) [2022-07-10 20:58:11,251][25689] Fps is (10 sec: 5422.9, 60 sec: 5529.1, 300 sec: 5538.5). Total num frames: 910204928. Throughput: 0: 4903.1. Samples: 910199204. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:58:11,252][25689] Avg episode reward: [(0, '1.263')] [2022-07-10 20:58:11,336][26022] Updated weights on worker 0-0, policy_version 888873 (0.00095) [2022-07-10 20:58:13,094][26022] Updated weights on worker 0-0, policy_version 888883 (0.00090) [2022-07-10 20:58:14,985][26022] Updated weights on worker 0-0, policy_version 888893 (0.00087) [2022-07-10 20:58:16,396][25689] Fps is (10 sec: 5548.5, 60 sec: 5580.3, 300 sec: 5536.0). Total num frames: 910233600. Throughput: 0: 5702.6. Samples: 910232618. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:58:16,397][25689] Avg episode reward: [(0, '1.137')] [2022-07-10 20:58:16,690][26022] Updated weights on worker 0-0, policy_version 888903 (0.00096) [2022-07-10 20:58:18,544][26022] Updated weights on worker 0-0, policy_version 888913 (0.00096) [2022-07-10 20:58:20,316][26022] Updated weights on worker 0-0, policy_version 888923 (0.00082) [2022-07-10 20:58:21,407][25689] Fps is (10 sec: 5747.4, 60 sec: 5564.9, 300 sec: 5543.0). Total num frames: 910263296. Throughput: 0: 5717.9. Samples: 910266230. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:58:21,408][25689] Avg episode reward: [(0, '1.273')] [2022-07-10 20:58:22,611][26022] Updated weights on worker 0-0, policy_version 888933 (0.00091) [2022-07-10 20:58:23,906][26022] Updated weights on worker 0-0, policy_version 888943 (0.00086) [2022-07-10 20:58:26,110][26022] Updated weights on worker 0-0, policy_version 888953 (0.00106) [2022-07-10 20:58:26,423][25689] Fps is (10 sec: 5515.1, 60 sec: 5529.7, 300 sec: 5532.8). Total num frames: 910288896. Throughput: 0: 4982.6. Samples: 910283078. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:58:26,424][25689] Avg episode reward: [(0, '1.236')] [2022-07-10 20:58:27,615][26022] Updated weights on worker 0-0, policy_version 888963 (0.00091) [2022-07-10 20:58:29,953][26022] Updated weights on worker 0-0, policy_version 888973 (0.00090) [2022-07-10 20:58:31,433][25689] Fps is (10 sec: 5412.7, 60 sec: 5545.9, 300 sec: 5541.0). Total num frames: 910317568. Throughput: 0: 5803.7. Samples: 910316336. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:58:31,434][25689] Avg episode reward: [(0, '1.258')] [2022-07-10 20:58:31,556][26022] Updated weights on worker 0-0, policy_version 888983 (0.00087) [2022-07-10 20:58:33,629][26022] Updated weights on worker 0-0, policy_version 888993 (0.00087) [2022-07-10 20:58:35,215][26022] Updated weights on worker 0-0, policy_version 889003 (0.00085) [2022-07-10 20:58:36,560][25689] Fps is (10 sec: 5555.6, 60 sec: 5539.9, 300 sec: 5536.6). Total num frames: 910345216. Throughput: 0: 5810.0. Samples: 910349770. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:58:36,561][25689] Avg episode reward: [(0, '1.074')] [2022-07-10 20:58:37,083][26022] Updated weights on worker 0-0, policy_version 889013 (0.00086) [2022-07-10 20:58:38,857][26022] Updated weights on worker 0-0, policy_version 889023 (0.00626) [2022-07-10 20:58:40,838][26022] Updated weights on worker 0-0, policy_version 889033 (0.00093) [2022-07-10 20:58:41,582][25689] Fps is (10 sec: 5549.2, 60 sec: 5538.6, 300 sec: 5536.4). Total num frames: 910373888. Throughput: 0: 5786.7. Samples: 910382982. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:58:41,583][25689] Avg episode reward: [(0, '0.828')] [2022-07-10 20:58:42,538][26022] Updated weights on worker 0-0, policy_version 889043 (0.00094) [2022-07-10 20:58:44,391][26022] Updated weights on worker 0-0, policy_version 889053 (0.00082) [2022-07-10 20:58:46,229][26022] Updated weights on worker 0-0, policy_version 889063 (0.00091) [2022-07-10 20:58:46,600][25689] Fps is (10 sec: 5609.8, 60 sec: 5504.6, 300 sec: 5536.7). Total num frames: 910401536. Throughput: 0: 5784.4. Samples: 910399790. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:58:46,601][25689] Avg episode reward: [(0, '0.665')] [2022-07-10 20:58:48,062][26022] Updated weights on worker 0-0, policy_version 889073 (0.00094) [2022-07-10 20:58:49,926][26022] Updated weights on worker 0-0, policy_version 889083 (0.00081) [2022-07-10 20:58:51,623][25689] Fps is (10 sec: 5609.4, 60 sec: 5537.2, 300 sec: 5542.2). Total num frames: 910430208. Throughput: 0: 5786.9. Samples: 910433170. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:58:51,623][25689] Avg episode reward: [(0, '0.783')] [2022-07-10 20:58:51,763][26022] Updated weights on worker 0-0, policy_version 889093 (0.00088) [2022-07-10 20:58:53,821][26022] Updated weights on worker 0-0, policy_version 889103 (0.00083) [2022-07-10 20:58:55,545][26022] Updated weights on worker 0-0, policy_version 889113 (0.00091) [2022-07-10 20:58:56,770][25689] Fps is (10 sec: 5537.7, 60 sec: 5514.3, 300 sec: 5540.1). Total num frames: 910457856. Throughput: 0: 5766.1. Samples: 910466302. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:58:56,774][25689] Avg episode reward: [(0, '0.592')] [2022-07-10 20:58:57,534][26022] Updated weights on worker 0-0, policy_version 889123 (0.01140) [2022-07-10 20:58:59,158][26022] Updated weights on worker 0-0, policy_version 889133 (0.00086) [2022-07-10 20:59:01,249][26022] Updated weights on worker 0-0, policy_version 889143 (0.00091) [2022-07-10 20:59:01,870][25689] Fps is (10 sec: 5496.0, 60 sec: 5540.0, 300 sec: 5549.5). Total num frames: 910486528. Throughput: 0: 4923.9. Samples: 910482876. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 20:59:01,871][25689] Avg episode reward: [(0, '0.586')] [2022-07-10 20:59:03,271][26022] Updated weights on worker 0-0, policy_version 889153 (0.00095) [2022-07-10 20:59:04,918][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 20:59:04,930][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000889161_910500864.pth [2022-07-10 20:59:04,931][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000887213_908506112.pth [2022-07-10 20:59:05,183][26022] Updated weights on worker 0-0, policy_version 889163 (0.00087) [2022-07-10 20:59:06,869][26022] Updated weights on worker 0-0, policy_version 889173 (0.00084) [2022-07-10 20:59:06,964][25689] Fps is (10 sec: 5424.4, 60 sec: 5531.7, 300 sec: 5541.2). Total num frames: 910513152. Throughput: 0: 5608.1. Samples: 910513994. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 20:59:06,965][25689] Avg episode reward: [(0, '0.715')] [2022-07-10 20:59:08,803][26022] Updated weights on worker 0-0, policy_version 889183 (0.00088) [2022-07-10 20:59:10,692][26022] Updated weights on worker 0-0, policy_version 889193 (0.00092) [2022-07-10 20:59:12,007][25689] Fps is (10 sec: 5354.0, 60 sec: 5528.2, 300 sec: 5541.2). Total num frames: 910540800. Throughput: 0: 5608.7. Samples: 910547498. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 20:59:12,009][25689] Avg episode reward: [(0, '0.532')] [2022-07-10 20:59:12,541][26022] Updated weights on worker 0-0, policy_version 889203 (0.00083) [2022-07-10 20:59:14,307][26022] Updated weights on worker 0-0, policy_version 889213 (0.00083) [2022-07-10 20:59:16,030][26022] Updated weights on worker 0-0, policy_version 889223 (0.00085) [2022-07-10 20:59:17,070][25689] Fps is (10 sec: 5471.7, 60 sec: 5518.8, 300 sec: 5537.4). Total num frames: 910568448. Throughput: 0: 4827.2. Samples: 910564294. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 20:59:17,070][25689] Avg episode reward: [(0, '-0.186')] [2022-07-10 20:59:17,972][26022] Updated weights on worker 0-0, policy_version 889233 (0.00091) [2022-07-10 20:59:19,927][26022] Updated weights on worker 0-0, policy_version 889243 (0.00052) [2022-07-10 20:59:21,868][26022] Updated weights on worker 0-0, policy_version 889253 (0.00088) [2022-07-10 20:59:22,164][25689] Fps is (10 sec: 5444.2, 60 sec: 5477.5, 300 sec: 5532.9). Total num frames: 910596096. Throughput: 0: 5667.9. Samples: 910597898. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 20:59:22,164][25689] Avg episode reward: [(0, '-0.179')] [2022-07-10 20:59:23,435][26022] Updated weights on worker 0-0, policy_version 889263 (0.00084) [2022-07-10 20:59:25,457][26022] Updated weights on worker 0-0, policy_version 889273 (0.00080) [2022-07-10 20:59:27,139][26022] Updated weights on worker 0-0, policy_version 889283 (0.00084) [2022-07-10 20:59:27,207][25689] Fps is (10 sec: 5657.1, 60 sec: 5542.5, 300 sec: 5539.2). Total num frames: 910625792. Throughput: 0: 5802.7. Samples: 910631456. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 20:59:27,207][25689] Avg episode reward: [(0, '-0.176')] [2022-07-10 20:59:29,003][26022] Updated weights on worker 0-0, policy_version 889293 (0.00087) [2022-07-10 20:59:30,806][26022] Updated weights on worker 0-0, policy_version 889303 (0.00091) [2022-07-10 20:59:32,221][25689] Fps is (10 sec: 5599.8, 60 sec: 5508.5, 300 sec: 5537.1). Total num frames: 910652416. Throughput: 0: 4980.4. Samples: 910648174. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 20:59:32,223][25689] Avg episode reward: [(0, '-0.425')] [2022-07-10 20:59:32,688][26022] Updated weights on worker 0-0, policy_version 889313 (0.00088) [2022-07-10 20:59:34,626][26022] Updated weights on worker 0-0, policy_version 889323 (0.00085) [2022-07-10 20:59:36,344][26022] Updated weights on worker 0-0, policy_version 889333 (0.00087) [2022-07-10 20:59:37,287][25689] Fps is (10 sec: 5586.9, 60 sec: 5547.7, 300 sec: 5539.9). Total num frames: 910682112. Throughput: 0: 5803.4. Samples: 910681624. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 20:59:37,299][25689] Avg episode reward: [(0, '-0.419')] [2022-07-10 20:59:38,215][26022] Updated weights on worker 0-0, policy_version 889343 (0.00083) [2022-07-10 20:59:40,039][26022] Updated weights on worker 0-0, policy_version 889353 (0.00091) [2022-07-10 20:59:42,000][26022] Updated weights on worker 0-0, policy_version 889363 (0.00104) [2022-07-10 20:59:42,364][25689] Fps is (10 sec: 5552.9, 60 sec: 5509.1, 300 sec: 5532.0). Total num frames: 910708736. Throughput: 0: 5797.8. Samples: 910715014. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 20:59:42,364][25689] Avg episode reward: [(0, '-0.131')] [2022-07-10 20:59:43,700][26022] Updated weights on worker 0-0, policy_version 889373 (0.00093) [2022-07-10 20:59:45,643][26022] Updated weights on worker 0-0, policy_version 889383 (0.00085) [2022-07-10 20:59:47,392][25689] Fps is (10 sec: 5472.2, 60 sec: 5524.9, 300 sec: 5535.3). Total num frames: 910737408. Throughput: 0: 4968.1. Samples: 910731740. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 20:59:47,393][25689] Avg episode reward: [(0, '0.530')] [2022-07-10 20:59:47,399][26022] Updated weights on worker 0-0, policy_version 889393 (0.00104) [2022-07-10 20:59:49,298][26022] Updated weights on worker 0-0, policy_version 889403 (0.00083) [2022-07-10 20:59:51,069][26022] Updated weights on worker 0-0, policy_version 889413 (0.00095) [2022-07-10 20:59:52,428][25689] Fps is (10 sec: 5596.0, 60 sec: 5506.9, 300 sec: 5529.1). Total num frames: 910765056. Throughput: 0: 5797.0. Samples: 910765312. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 20:59:52,429][25689] Avg episode reward: [(0, '0.862')] [2022-07-10 20:59:52,965][26022] Updated weights on worker 0-0, policy_version 889423 (0.00089) [2022-07-10 20:59:54,804][26022] Updated weights on worker 0-0, policy_version 889433 (0.00087) [2022-07-10 20:59:56,387][26022] Updated weights on worker 0-0, policy_version 889443 (0.00083) [2022-07-10 20:59:57,551][25689] Fps is (10 sec: 5644.9, 60 sec: 5542.8, 300 sec: 5533.9). Total num frames: 910794752. Throughput: 0: 5798.7. Samples: 910799126. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 20:59:57,551][25689] Avg episode reward: [(0, '0.776')] [2022-07-10 20:59:58,616][26022] Updated weights on worker 0-0, policy_version 889453 (0.00087) [2022-07-10 21:00:00,132][26022] Updated weights on worker 0-0, policy_version 889463 (0.00090) [2022-07-10 21:00:02,574][25689] Fps is (10 sec: 5349.4, 60 sec: 5482.4, 300 sec: 5533.6). Total num frames: 910819328. Throughput: 0: 4998.1. Samples: 910816024. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:00:02,574][25689] Avg episode reward: [(0, '0.670')] [2022-07-10 21:00:02,639][26022] Updated weights on worker 0-0, policy_version 889473 (0.00094) [2022-07-10 21:00:04,266][26022] Updated weights on worker 0-0, policy_version 889483 (0.00088) [2022-07-10 21:00:06,175][26022] Updated weights on worker 0-0, policy_version 889493 (0.00098) [2022-07-10 21:00:07,604][25689] Fps is (10 sec: 5296.8, 60 sec: 5521.9, 300 sec: 5529.7). Total num frames: 910848000. Throughput: 0: 5705.2. Samples: 910847050. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:00:07,604][25689] Avg episode reward: [(0, '0.099')] [2022-07-10 21:00:08,057][26022] Updated weights on worker 0-0, policy_version 889503 (0.00088) [2022-07-10 21:00:09,787][26022] Updated weights on worker 0-0, policy_version 889513 (0.00085) [2022-07-10 21:00:11,575][26022] Updated weights on worker 0-0, policy_version 889523 (0.00099) [2022-07-10 21:00:12,627][25689] Fps is (10 sec: 5704.2, 60 sec: 5540.6, 300 sec: 5537.3). Total num frames: 910876672. Throughput: 0: 5706.3. Samples: 910880570. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:00:12,627][25689] Avg episode reward: [(0, '0.373')] [2022-07-10 21:00:13,739][26022] Updated weights on worker 0-0, policy_version 889533 (0.00093) [2022-07-10 21:00:15,249][26022] Updated weights on worker 0-0, policy_version 889543 (0.00081) [2022-07-10 21:00:17,275][26022] Updated weights on worker 0-0, policy_version 889553 (0.00088) [2022-07-10 21:00:17,664][25689] Fps is (10 sec: 5699.8, 60 sec: 5559.9, 300 sec: 5536.8). Total num frames: 910905344. Throughput: 0: 5726.8. Samples: 910914312. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:00:17,665][25689] Avg episode reward: [(0, '0.354')] [2022-07-10 21:00:19,007][26022] Updated weights on worker 0-0, policy_version 889563 (0.00083) [2022-07-10 21:00:20,776][26022] Updated weights on worker 0-0, policy_version 889573 (0.00088) [2022-07-10 21:00:22,687][25689] Fps is (10 sec: 5496.6, 60 sec: 5549.5, 300 sec: 5533.0). Total num frames: 910931968. Throughput: 0: 5731.4. Samples: 910931300. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:00:22,693][25689] Avg episode reward: [(0, '-0.986')] [2022-07-10 21:00:22,706][26022] Updated weights on worker 0-0, policy_version 889583 (0.00394) [2022-07-10 21:00:24,395][26022] Updated weights on worker 0-0, policy_version 889593 (0.00082) [2022-07-10 21:00:26,468][26022] Updated weights on worker 0-0, policy_version 889603 (0.00086) [2022-07-10 21:00:27,712][25689] Fps is (10 sec: 5503.3, 60 sec: 5534.2, 300 sec: 5536.1). Total num frames: 910960640. Throughput: 0: 5845.1. Samples: 910964586. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:00:27,713][25689] Avg episode reward: [(0, '-0.988')] [2022-07-10 21:00:28,190][26022] Updated weights on worker 0-0, policy_version 889613 (0.00086) [2022-07-10 21:00:29,898][26022] Updated weights on worker 0-0, policy_version 889623 (0.00438) [2022-07-10 21:00:32,067][26022] Updated weights on worker 0-0, policy_version 889633 (0.00094) [2022-07-10 21:00:32,715][25689] Fps is (10 sec: 5616.1, 60 sec: 5552.2, 300 sec: 5537.0). Total num frames: 910988288. Throughput: 0: 5820.2. Samples: 910997488. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:00:32,715][25689] Avg episode reward: [(0, '-1.336')] [2022-07-10 21:00:33,758][26022] Updated weights on worker 0-0, policy_version 889643 (0.00090) [2022-07-10 21:00:35,535][26022] Updated weights on worker 0-0, policy_version 889653 (0.00085) [2022-07-10 21:00:37,476][26022] Updated weights on worker 0-0, policy_version 889663 (0.00088) [2022-07-10 21:00:37,786][25689] Fps is (10 sec: 5489.2, 60 sec: 5517.9, 300 sec: 5529.7). Total num frames: 911015936. Throughput: 0: 4965.6. Samples: 911014226. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:00:37,788][25689] Avg episode reward: [(0, '-1.134')] [2022-07-10 21:00:39,138][26022] Updated weights on worker 0-0, policy_version 889673 (0.00085) [2022-07-10 21:00:41,391][26022] Updated weights on worker 0-0, policy_version 889683 (0.00083) [2022-07-10 21:00:42,806][25689] Fps is (10 sec: 5581.3, 60 sec: 5556.9, 300 sec: 5536.3). Total num frames: 911044608. Throughput: 0: 5782.4. Samples: 911047636. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:00:42,806][25689] Avg episode reward: [(0, '-1.015')] [2022-07-10 21:00:42,964][26022] Updated weights on worker 0-0, policy_version 889693 (0.00090) [2022-07-10 21:00:44,788][26022] Updated weights on worker 0-0, policy_version 889703 (0.00091) [2022-07-10 21:00:46,632][26022] Updated weights on worker 0-0, policy_version 889713 (0.00086) [2022-07-10 21:00:47,817][25689] Fps is (10 sec: 5614.6, 60 sec: 5541.6, 300 sec: 5536.5). Total num frames: 911072256. Throughput: 0: 5801.6. Samples: 911081224. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:00:47,817][25689] Avg episode reward: [(0, '0.081')] [2022-07-10 21:00:48,319][26022] Updated weights on worker 0-0, policy_version 889723 (0.00114) [2022-07-10 21:00:50,348][26022] Updated weights on worker 0-0, policy_version 889733 (0.00088) [2022-07-10 21:00:52,135][26022] Updated weights on worker 0-0, policy_version 889743 (0.00091) [2022-07-10 21:00:52,833][25689] Fps is (10 sec: 5616.7, 60 sec: 5560.4, 300 sec: 5531.8). Total num frames: 911100928. Throughput: 0: 5005.7. Samples: 911098192. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:00:52,834][25689] Avg episode reward: [(0, '0.332')] [2022-07-10 21:00:53,957][26022] Updated weights on worker 0-0, policy_version 889753 (0.00084) [2022-07-10 21:00:55,812][26022] Updated weights on worker 0-0, policy_version 889763 (0.00085) [2022-07-10 21:00:57,505][26022] Updated weights on worker 0-0, policy_version 889773 (0.00088) [2022-07-10 21:00:57,990][25689] Fps is (10 sec: 5535.8, 60 sec: 5523.3, 300 sec: 5536.5). Total num frames: 911128576. Throughput: 0: 5824.0. Samples: 911131898. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:00:57,991][25689] Avg episode reward: [(0, '0.901')] [2022-07-10 21:00:59,548][26022] Updated weights on worker 0-0, policy_version 889783 (0.00091) [2022-07-10 21:01:01,216][26022] Updated weights on worker 0-0, policy_version 889793 (0.00095) [2022-07-10 21:01:03,078][25689] Fps is (10 sec: 5197.7, 60 sec: 5534.3, 300 sec: 5529.0). Total num frames: 911154176. Throughput: 0: 5700.8. Samples: 911163202. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:01:03,078][25689] Avg episode reward: [(0, '1.347')] [2022-07-10 21:01:03,650][26022] Updated weights on worker 0-0, policy_version 889803 (0.00094) [2022-07-10 21:01:04,959][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:01:04,971][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000889811_911166464.pth [2022-07-10 21:01:04,971][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000887863_909171712.pth [2022-07-10 21:01:05,436][26022] Updated weights on worker 0-0, policy_version 889813 (0.00086) [2022-07-10 21:01:07,332][26022] Updated weights on worker 0-0, policy_version 889823 (0.00096) [2022-07-10 21:01:08,121][25689] Fps is (10 sec: 5458.2, 60 sec: 5550.0, 300 sec: 5536.2). Total num frames: 911183872. Throughput: 0: 4851.4. Samples: 911179730. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:01:08,123][25689] Avg episode reward: [(0, '1.063')] [2022-07-10 21:01:09,205][26022] Updated weights on worker 0-0, policy_version 889833 (0.00093) [2022-07-10 21:01:10,991][26022] Updated weights on worker 0-0, policy_version 889843 (0.00089) [2022-07-10 21:01:12,773][26022] Updated weights on worker 0-0, policy_version 889853 (0.00089) [2022-07-10 21:01:13,141][25689] Fps is (10 sec: 5596.4, 60 sec: 5516.4, 300 sec: 5530.7). Total num frames: 911210496. Throughput: 0: 5647.5. Samples: 911212882. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:01:13,142][25689] Avg episode reward: [(0, '1.112')] [2022-07-10 21:01:14,780][26022] Updated weights on worker 0-0, policy_version 889863 (0.00081) [2022-07-10 21:01:16,387][26022] Updated weights on worker 0-0, policy_version 889873 (0.00085) [2022-07-10 21:01:18,218][25689] Fps is (10 sec: 5375.0, 60 sec: 5495.9, 300 sec: 5522.5). Total num frames: 911238144. Throughput: 0: 5659.9. Samples: 911246386. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:01:18,219][25689] Avg episode reward: [(0, '1.418')] [2022-07-10 21:01:18,548][26022] Updated weights on worker 0-0, policy_version 889883 (0.00097) [2022-07-10 21:01:19,812][26022] Updated weights on worker 0-0, policy_version 889893 (0.00091) [2022-07-10 21:01:22,144][26022] Updated weights on worker 0-0, policy_version 889903 (0.00087) [2022-07-10 21:01:23,235][25689] Fps is (10 sec: 5782.7, 60 sec: 5564.1, 300 sec: 5540.5). Total num frames: 911268864. Throughput: 0: 4952.3. Samples: 911263026. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:01:23,235][25689] Avg episode reward: [(0, '0.917')] [2022-07-10 21:01:23,632][26022] Updated weights on worker 0-0, policy_version 889913 (0.00094) [2022-07-10 21:01:25,716][26022] Updated weights on worker 0-0, policy_version 889923 (0.00090) [2022-07-10 21:01:27,739][26022] Updated weights on worker 0-0, policy_version 889933 (0.00086) [2022-07-10 21:01:28,273][25689] Fps is (10 sec: 5397.7, 60 sec: 5478.4, 300 sec: 5524.4). Total num frames: 911292416. Throughput: 0: 5791.1. Samples: 911296430. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:01:28,273][25689] Avg episode reward: [(0, '0.953')] [2022-07-10 21:01:29,324][26022] Updated weights on worker 0-0, policy_version 889943 (0.00089) [2022-07-10 21:01:31,475][26022] Updated weights on worker 0-0, policy_version 889953 (0.00096) [2022-07-10 21:01:32,952][26022] Updated weights on worker 0-0, policy_version 889963 (0.00092) [2022-07-10 21:01:33,295][25689] Fps is (10 sec: 5394.8, 60 sec: 5527.4, 300 sec: 5535.6). Total num frames: 911323136. Throughput: 0: 5793.6. Samples: 911329644. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:01:33,295][25689] Avg episode reward: [(0, '0.871')] [2022-07-10 21:01:35,087][26022] Updated weights on worker 0-0, policy_version 889973 (0.00088) [2022-07-10 21:01:36,840][26022] Updated weights on worker 0-0, policy_version 889983 (0.00093) [2022-07-10 21:01:38,371][25689] Fps is (10 sec: 5780.2, 60 sec: 5526.9, 300 sec: 5529.2). Total num frames: 911350784. Throughput: 0: 4944.6. Samples: 911346032. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:01:38,373][25689] Avg episode reward: [(0, '1.125')] [2022-07-10 21:01:38,697][26022] Updated weights on worker 0-0, policy_version 889993 (0.00086) [2022-07-10 21:01:40,635][26022] Updated weights on worker 0-0, policy_version 890003 (0.00089) [2022-07-10 21:01:42,322][26022] Updated weights on worker 0-0, policy_version 890013 (0.00081) [2022-07-10 21:01:43,445][25689] Fps is (10 sec: 5346.9, 60 sec: 5488.2, 300 sec: 5525.1). Total num frames: 911377408. Throughput: 0: 5724.8. Samples: 911378726. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:01:43,446][25689] Avg episode reward: [(0, '0.808')] [2022-07-10 21:01:44,289][26022] Updated weights on worker 0-0, policy_version 890023 (0.00086) [2022-07-10 21:01:46,227][26022] Updated weights on worker 0-0, policy_version 890033 (0.00087) [2022-07-10 21:01:47,834][26022] Updated weights on worker 0-0, policy_version 890043 (0.00087) [2022-07-10 21:01:48,475][25689] Fps is (10 sec: 5574.1, 60 sec: 5520.2, 300 sec: 5535.2). Total num frames: 911407104. Throughput: 0: 5755.7. Samples: 911412706. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:01:48,477][25689] Avg episode reward: [(0, '0.521')] [2022-07-10 21:01:49,811][26022] Updated weights on worker 0-0, policy_version 890053 (0.00100) [2022-07-10 21:01:51,569][26022] Updated weights on worker 0-0, policy_version 890063 (0.00089) [2022-07-10 21:01:53,362][26022] Updated weights on worker 0-0, policy_version 890073 (0.00087) [2022-07-10 21:01:53,539][25689] Fps is (10 sec: 5681.1, 60 sec: 5499.0, 300 sec: 5526.3). Total num frames: 911434752. Throughput: 0: 4948.2. Samples: 911429816. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:01:53,539][25689] Avg episode reward: [(0, '0.420')] [2022-07-10 21:01:55,284][26022] Updated weights on worker 0-0, policy_version 890083 (0.00090) [2022-07-10 21:01:56,991][26022] Updated weights on worker 0-0, policy_version 890093 (0.00087) [2022-07-10 21:01:58,598][25689] Fps is (10 sec: 5462.2, 60 sec: 5507.9, 300 sec: 5532.3). Total num frames: 911462400. Throughput: 0: 5801.8. Samples: 911463386. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:01:58,599][25689] Avg episode reward: [(0, '-0.796')] [2022-07-10 21:01:58,829][26022] Updated weights on worker 0-0, policy_version 890103 (0.00084) [2022-07-10 21:02:00,644][26022] Updated weights on worker 0-0, policy_version 890113 (0.00088) [2022-07-10 21:02:02,906][26022] Updated weights on worker 0-0, policy_version 890123 (0.00098) [2022-07-10 21:02:03,616][25689] Fps is (10 sec: 5486.9, 60 sec: 5548.0, 300 sec: 5533.1). Total num frames: 911490048. Throughput: 0: 5767.3. Samples: 911495062. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:02:03,617][25689] Avg episode reward: [(0, '-0.557')] [2022-07-10 21:02:04,623][26022] Updated weights on worker 0-0, policy_version 890133 (0.00087) [2022-07-10 21:02:06,602][26022] Updated weights on worker 0-0, policy_version 890143 (0.00088) [2022-07-10 21:02:08,458][26022] Updated weights on worker 0-0, policy_version 890153 (0.00086) [2022-07-10 21:02:08,637][25689] Fps is (10 sec: 5406.3, 60 sec: 5499.4, 300 sec: 5526.5). Total num frames: 911516672. Throughput: 0: 4900.4. Samples: 911511508. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:02:08,637][25689] Avg episode reward: [(0, '-0.280')] [2022-07-10 21:02:10,308][26022] Updated weights on worker 0-0, policy_version 890163 (0.00082) [2022-07-10 21:02:12,137][26022] Updated weights on worker 0-0, policy_version 890173 (0.00084) [2022-07-10 21:02:13,647][25689] Fps is (10 sec: 5410.9, 60 sec: 5517.2, 300 sec: 5535.9). Total num frames: 911544320. Throughput: 0: 5720.0. Samples: 911544834. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:02:13,648][25689] Avg episode reward: [(0, '-0.659')] [2022-07-10 21:02:13,963][26022] Updated weights on worker 0-0, policy_version 890183 (0.00100) [2022-07-10 21:02:16,249][26022] Updated weights on worker 0-0, policy_version 890193 (0.00085) [2022-07-10 21:02:17,510][26022] Updated weights on worker 0-0, policy_version 890203 (0.00094) [2022-07-10 21:02:18,779][25689] Fps is (10 sec: 5553.1, 60 sec: 5529.2, 300 sec: 5527.1). Total num frames: 911572992. Throughput: 0: 5685.6. Samples: 911578126. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:02:18,780][25689] Avg episode reward: [(0, '-0.555')] [2022-07-10 21:02:19,695][26022] Updated weights on worker 0-0, policy_version 890213 (0.00086) [2022-07-10 21:02:21,187][26022] Updated weights on worker 0-0, policy_version 890223 (0.00087) [2022-07-10 21:02:23,352][26022] Updated weights on worker 0-0, policy_version 890233 (0.00095) [2022-07-10 21:02:23,796][25689] Fps is (10 sec: 5650.0, 60 sec: 5495.2, 300 sec: 5530.2). Total num frames: 911601664. Throughput: 0: 4947.9. Samples: 911594908. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:02:23,797][25689] Avg episode reward: [(0, '-0.300')] [2022-07-10 21:02:24,984][26022] Updated weights on worker 0-0, policy_version 890243 (0.00087) [2022-07-10 21:02:26,916][26022] Updated weights on worker 0-0, policy_version 890253 (0.00094) [2022-07-10 21:02:28,676][26022] Updated weights on worker 0-0, policy_version 890263 (0.00106) [2022-07-10 21:02:28,837][25689] Fps is (10 sec: 5701.4, 60 sec: 5579.6, 300 sec: 5533.0). Total num frames: 911630336. Throughput: 0: 5795.9. Samples: 911628584. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:02:28,837][25689] Avg episode reward: [(0, '0.539')] [2022-07-10 21:02:30,401][26022] Updated weights on worker 0-0, policy_version 890273 (0.00084) [2022-07-10 21:02:32,381][26022] Updated weights on worker 0-0, policy_version 890283 (0.00091) [2022-07-10 21:02:33,849][25689] Fps is (10 sec: 5602.6, 60 sec: 5529.8, 300 sec: 5533.9). Total num frames: 911657984. Throughput: 0: 5803.0. Samples: 911662066. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:02:33,849][25689] Avg episode reward: [(0, '-0.095')] [2022-07-10 21:02:34,043][26022] Updated weights on worker 0-0, policy_version 890293 (0.00099) [2022-07-10 21:02:36,103][26022] Updated weights on worker 0-0, policy_version 890303 (0.00093) [2022-07-10 21:02:37,739][26022] Updated weights on worker 0-0, policy_version 890313 (0.00083) [2022-07-10 21:02:38,931][25689] Fps is (10 sec: 5478.0, 60 sec: 5529.2, 300 sec: 5529.1). Total num frames: 911685632. Throughput: 0: 4998.8. Samples: 911678862. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:02:38,931][25689] Avg episode reward: [(0, '-0.487')] [2022-07-10 21:02:39,647][26022] Updated weights on worker 0-0, policy_version 890323 (0.00085) [2022-07-10 21:02:41,507][26022] Updated weights on worker 0-0, policy_version 890333 (0.00087) [2022-07-10 21:02:43,275][26022] Updated weights on worker 0-0, policy_version 890343 (0.00092) [2022-07-10 21:02:43,966][25689] Fps is (10 sec: 5566.8, 60 sec: 5566.6, 300 sec: 5525.3). Total num frames: 911714304. Throughput: 0: 5843.0. Samples: 911712758. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:02:43,966][25689] Avg episode reward: [(0, '-0.594')] [2022-07-10 21:02:44,944][26022] Updated weights on worker 0-0, policy_version 890353 (0.00112) [2022-07-10 21:02:47,096][26022] Updated weights on worker 0-0, policy_version 890363 (0.00090) [2022-07-10 21:02:48,652][26022] Updated weights on worker 0-0, policy_version 890373 (0.00094) [2022-07-10 21:02:48,975][25689] Fps is (10 sec: 5810.9, 60 sec: 5568.5, 300 sec: 5535.6). Total num frames: 911744000. Throughput: 0: 5852.2. Samples: 911746440. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2022-07-10 21:02:48,976][25689] Avg episode reward: [(0, '-1.023')] [2022-07-10 21:02:50,820][26022] Updated weights on worker 0-0, policy_version 890383 (0.00083) [2022-07-10 21:02:52,495][26022] Updated weights on worker 0-0, policy_version 890393 (0.00106) [2022-07-10 21:02:53,978][25689] Fps is (10 sec: 5522.7, 60 sec: 5540.2, 300 sec: 5526.7). Total num frames: 911769600. Throughput: 0: 5865.5. Samples: 911780136. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:02:53,979][25689] Avg episode reward: [(0, '-0.952')] [2022-07-10 21:02:54,431][26022] Updated weights on worker 0-0, policy_version 890403 (0.00084) [2022-07-10 21:02:56,247][26022] Updated weights on worker 0-0, policy_version 890413 (0.00089) [2022-07-10 21:02:57,931][26022] Updated weights on worker 0-0, policy_version 890423 (0.00087) [2022-07-10 21:02:59,094][25689] Fps is (10 sec: 5465.0, 60 sec: 5568.9, 300 sec: 5535.1). Total num frames: 911799296. Throughput: 0: 5851.3. Samples: 911796840. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:02:59,094][25689] Avg episode reward: [(0, '-0.854')] [2022-07-10 21:02:59,836][26022] Updated weights on worker 0-0, policy_version 890433 (0.00086) [2022-07-10 21:03:02,018][26022] Updated weights on worker 0-0, policy_version 890443 (0.00079) [2022-07-10 21:03:03,907][26022] Updated weights on worker 0-0, policy_version 890453 (0.00067) [2022-07-10 21:03:04,096][25689] Fps is (10 sec: 5566.3, 60 sec: 5553.5, 300 sec: 5535.1). Total num frames: 911825920. Throughput: 0: 5749.3. Samples: 911828494. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:03:04,097][25689] Avg episode reward: [(0, '-0.534')] [2022-07-10 21:03:05,024][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:03:05,043][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000890459_911830016.pth [2022-07-10 21:03:05,044][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000888512_909836288.pth [2022-07-10 21:03:05,698][26022] Updated weights on worker 0-0, policy_version 890463 (0.00056) [2022-07-10 21:03:07,510][26022] Updated weights on worker 0-0, policy_version 890473 (0.00103) [2022-07-10 21:03:09,154][25689] Fps is (10 sec: 5292.9, 60 sec: 5550.0, 300 sec: 5530.7). Total num frames: 911852544. Throughput: 0: 5708.7. Samples: 911861634. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:03:09,154][25689] Avg episode reward: [(0, '-0.675')] [2022-07-10 21:03:09,416][26022] Updated weights on worker 0-0, policy_version 890483 (0.00107) [2022-07-10 21:03:11,180][26022] Updated weights on worker 0-0, policy_version 890493 (0.00089) [2022-07-10 21:03:13,063][26022] Updated weights on worker 0-0, policy_version 890503 (0.00091) [2022-07-10 21:03:14,163][25689] Fps is (10 sec: 5391.0, 60 sec: 5550.1, 300 sec: 5529.8). Total num frames: 911880192. Throughput: 0: 4859.1. Samples: 911878218. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:03:14,164][25689] Avg episode reward: [(0, '-0.659')] [2022-07-10 21:03:14,936][26022] Updated weights on worker 0-0, policy_version 890513 (0.00084) [2022-07-10 21:03:16,908][26022] Updated weights on worker 0-0, policy_version 890523 (0.00087) [2022-07-10 21:03:18,632][26022] Updated weights on worker 0-0, policy_version 890533 (0.00083) [2022-07-10 21:03:19,267][25689] Fps is (10 sec: 5467.5, 60 sec: 5535.7, 300 sec: 5521.2). Total num frames: 911907840. Throughput: 0: 5681.7. Samples: 911911462. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:03:19,268][25689] Avg episode reward: [(0, '-0.342')] [2022-07-10 21:03:20,561][26022] Updated weights on worker 0-0, policy_version 890543 (0.00086) [2022-07-10 21:03:22,462][26022] Updated weights on worker 0-0, policy_version 890553 (0.00094) [2022-07-10 21:03:24,135][26022] Updated weights on worker 0-0, policy_version 890563 (0.00094) [2022-07-10 21:03:24,283][25689] Fps is (10 sec: 5666.6, 60 sec: 5552.8, 300 sec: 5534.9). Total num frames: 911937536. Throughput: 0: 5765.9. Samples: 911944890. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:03:24,285][25689] Avg episode reward: [(0, '-0.708')] [2022-07-10 21:03:26,235][26022] Updated weights on worker 0-0, policy_version 890573 (0.00083) [2022-07-10 21:03:27,775][26022] Updated weights on worker 0-0, policy_version 890583 (0.00100) [2022-07-10 21:03:29,348][25689] Fps is (10 sec: 5586.7, 60 sec: 5516.7, 300 sec: 5527.1). Total num frames: 911964160. Throughput: 0: 4946.1. Samples: 911961518. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:03:29,349][25689] Avg episode reward: [(0, '-0.941')] [2022-07-10 21:03:29,816][26022] Updated weights on worker 0-0, policy_version 890593 (0.00105) [2022-07-10 21:03:31,595][26022] Updated weights on worker 0-0, policy_version 890603 (0.00090) [2022-07-10 21:03:33,238][26022] Updated weights on worker 0-0, policy_version 890613 (0.00093) [2022-07-10 21:03:34,351][25689] Fps is (10 sec: 5390.5, 60 sec: 5517.5, 300 sec: 5529.4). Total num frames: 911991808. Throughput: 0: 5788.5. Samples: 911995074. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:03:34,351][25689] Avg episode reward: [(0, '-0.671')] [2022-07-10 21:03:35,268][26022] Updated weights on worker 0-0, policy_version 890623 (0.00093) [2022-07-10 21:03:37,023][26022] Updated weights on worker 0-0, policy_version 890633 (0.00092) [2022-07-10 21:03:38,808][26022] Updated weights on worker 0-0, policy_version 890643 (0.00088) [2022-07-10 21:03:39,383][25689] Fps is (10 sec: 5714.4, 60 sec: 5556.0, 300 sec: 5532.6). Total num frames: 912021504. Throughput: 0: 5816.2. Samples: 912028460. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:03:39,384][25689] Avg episode reward: [(0, '-0.832')] [2022-07-10 21:03:40,899][26022] Updated weights on worker 0-0, policy_version 890653 (0.00088) [2022-07-10 21:03:42,512][26022] Updated weights on worker 0-0, policy_version 890663 (0.00093) [2022-07-10 21:03:44,432][25689] Fps is (10 sec: 5587.0, 60 sec: 5520.9, 300 sec: 5528.6). Total num frames: 912048128. Throughput: 0: 4982.7. Samples: 912045284. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:03:44,432][25689] Avg episode reward: [(0, '-0.783')] [2022-07-10 21:03:44,469][26022] Updated weights on worker 0-0, policy_version 890673 (0.00089) [2022-07-10 21:03:46,095][26022] Updated weights on worker 0-0, policy_version 890683 (0.00088) [2022-07-10 21:03:48,030][26022] Updated weights on worker 0-0, policy_version 890693 (0.00080) [2022-07-10 21:03:49,453][25689] Fps is (10 sec: 5389.5, 60 sec: 5485.9, 300 sec: 5525.2). Total num frames: 912075776. Throughput: 0: 5835.6. Samples: 912078842. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:03:49,454][25689] Avg episode reward: [(0, '-1.034')] [2022-07-10 21:03:49,859][26022] Updated weights on worker 0-0, policy_version 890703 (0.00084) [2022-07-10 21:03:51,773][26022] Updated weights on worker 0-0, policy_version 890713 (0.00089) [2022-07-10 21:03:53,560][26022] Updated weights on worker 0-0, policy_version 890723 (0.00088) [2022-07-10 21:03:54,460][25689] Fps is (10 sec: 5616.1, 60 sec: 5536.4, 300 sec: 5531.2). Total num frames: 912104448. Throughput: 0: 5808.7. Samples: 912111880. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:03:54,460][25689] Avg episode reward: [(0, '-0.831')] [2022-07-10 21:03:55,404][26022] Updated weights on worker 0-0, policy_version 890733 (0.00082) [2022-07-10 21:03:57,361][26022] Updated weights on worker 0-0, policy_version 890743 (0.00083) [2022-07-10 21:03:59,312][26022] Updated weights on worker 0-0, policy_version 890753 (0.00092) [2022-07-10 21:03:59,577][25689] Fps is (10 sec: 5563.0, 60 sec: 5502.3, 300 sec: 5527.5). Total num frames: 912132096. Throughput: 0: 4960.0. Samples: 912128622. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:03:59,578][25689] Avg episode reward: [(0, '-1.367')] [2022-07-10 21:04:00,987][26022] Updated weights on worker 0-0, policy_version 890763 (0.00083) [2022-07-10 21:04:03,421][26022] Updated weights on worker 0-0, policy_version 890773 (0.00096) [2022-07-10 21:04:04,654][25689] Fps is (10 sec: 5324.0, 60 sec: 5495.6, 300 sec: 5527.8). Total num frames: 912158720. Throughput: 0: 5662.4. Samples: 912159790. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:04:04,654][25689] Avg episode reward: [(0, '-0.632')] [2022-07-10 21:04:05,002][26022] Updated weights on worker 0-0, policy_version 890783 (0.00086) [2022-07-10 21:04:06,992][26022] Updated weights on worker 0-0, policy_version 890793 (0.00086) [2022-07-10 21:04:08,744][26022] Updated weights on worker 0-0, policy_version 890803 (0.00085) [2022-07-10 21:04:09,667][25689] Fps is (10 sec: 5277.6, 60 sec: 5499.7, 300 sec: 5524.9). Total num frames: 912185344. Throughput: 0: 5644.2. Samples: 912192930. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:04:09,667][25689] Avg episode reward: [(0, '-0.345')] [2022-07-10 21:04:10,684][26022] Updated weights on worker 0-0, policy_version 890813 (0.00095) [2022-07-10 21:04:12,519][26022] Updated weights on worker 0-0, policy_version 890823 (0.00094) [2022-07-10 21:04:14,419][26022] Updated weights on worker 0-0, policy_version 890833 (0.00085) [2022-07-10 21:04:14,687][25689] Fps is (10 sec: 5613.4, 60 sec: 5532.5, 300 sec: 5532.6). Total num frames: 912215040. Throughput: 0: 4832.4. Samples: 912209626. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:04:14,687][25689] Avg episode reward: [(0, '-0.261')] [2022-07-10 21:04:16,281][26022] Updated weights on worker 0-0, policy_version 890843 (0.00760) [2022-07-10 21:04:18,046][26022] Updated weights on worker 0-0, policy_version 890853 (0.01296) [2022-07-10 21:04:19,754][25689] Fps is (10 sec: 5583.2, 60 sec: 5519.0, 300 sec: 5529.6). Total num frames: 912241664. Throughput: 0: 5675.7. Samples: 912243138. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:04:19,755][25689] Avg episode reward: [(0, '-0.771')] [2022-07-10 21:04:19,982][26022] Updated weights on worker 0-0, policy_version 890863 (0.00086) [2022-07-10 21:04:21,691][26022] Updated weights on worker 0-0, policy_version 890873 (0.00094) [2022-07-10 21:04:23,609][26022] Updated weights on worker 0-0, policy_version 890883 (0.00086) [2022-07-10 21:04:24,772][25689] Fps is (10 sec: 5584.3, 60 sec: 5518.7, 300 sec: 5530.1). Total num frames: 912271360. Throughput: 0: 5820.7. Samples: 912276894. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:04:24,773][25689] Avg episode reward: [(0, '-0.579')] [2022-07-10 21:04:25,458][26022] Updated weights on worker 0-0, policy_version 890893 (0.00086) [2022-07-10 21:04:27,210][26022] Updated weights on worker 0-0, policy_version 890903 (0.00084) [2022-07-10 21:04:29,114][26022] Updated weights on worker 0-0, policy_version 890913 (0.00091) [2022-07-10 21:04:29,803][25689] Fps is (10 sec: 5502.8, 60 sec: 5505.0, 300 sec: 5526.4). Total num frames: 912296960. Throughput: 0: 4996.3. Samples: 912293536. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:04:29,803][25689] Avg episode reward: [(0, '-0.912')] [2022-07-10 21:04:30,886][26022] Updated weights on worker 0-0, policy_version 890923 (0.00086) [2022-07-10 21:04:32,955][26022] Updated weights on worker 0-0, policy_version 890933 (0.00087) [2022-07-10 21:04:34,559][26022] Updated weights on worker 0-0, policy_version 890943 (0.00096) [2022-07-10 21:04:34,841][25689] Fps is (10 sec: 5492.1, 60 sec: 5535.6, 300 sec: 5526.9). Total num frames: 912326656. Throughput: 0: 5807.0. Samples: 912326658. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:04:34,841][25689] Avg episode reward: [(0, '-0.641')] [2022-07-10 21:04:36,417][26022] Updated weights on worker 0-0, policy_version 890953 (0.00089) [2022-07-10 21:04:38,321][26022] Updated weights on worker 0-0, policy_version 890963 (0.00095) [2022-07-10 21:04:39,915][25689] Fps is (10 sec: 5670.9, 60 sec: 5498.0, 300 sec: 5530.4). Total num frames: 912354304. Throughput: 0: 5793.9. Samples: 912359946. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:04:39,918][25689] Avg episode reward: [(0, '-1.065')] [2022-07-10 21:04:40,391][26022] Updated weights on worker 0-0, policy_version 890973 (0.00096) [2022-07-10 21:04:41,955][26022] Updated weights on worker 0-0, policy_version 890983 (0.00086) [2022-07-10 21:04:44,011][26022] Updated weights on worker 0-0, policy_version 890993 (0.00084) [2022-07-10 21:04:44,947][25689] Fps is (10 sec: 5471.5, 60 sec: 5516.4, 300 sec: 5526.9). Total num frames: 912381952. Throughput: 0: 4945.8. Samples: 912376672. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:04:44,948][25689] Avg episode reward: [(0, '-0.658')] [2022-07-10 21:04:45,608][26022] Updated weights on worker 0-0, policy_version 891003 (0.00089) [2022-07-10 21:04:47,803][26022] Updated weights on worker 0-0, policy_version 891013 (0.00089) [2022-07-10 21:04:49,400][26022] Updated weights on worker 0-0, policy_version 891023 (0.00091) [2022-07-10 21:04:49,962][25689] Fps is (10 sec: 5605.6, 60 sec: 5533.9, 300 sec: 5530.7). Total num frames: 912410624. Throughput: 0: 5779.2. Samples: 912410040. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:04:49,962][25689] Avg episode reward: [(0, '-1.026')] [2022-07-10 21:04:51,182][26022] Updated weights on worker 0-0, policy_version 891033 (0.00080) [2022-07-10 21:04:53,050][26022] Updated weights on worker 0-0, policy_version 891043 (0.00092) [2022-07-10 21:04:54,767][26022] Updated weights on worker 0-0, policy_version 891053 (0.00088) [2022-07-10 21:04:54,965][25689] Fps is (10 sec: 5622.0, 60 sec: 5517.3, 300 sec: 5526.0). Total num frames: 912438272. Throughput: 0: 5838.5. Samples: 912444152. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:04:54,965][25689] Avg episode reward: [(0, '-0.167')] [2022-07-10 21:04:56,685][26022] Updated weights on worker 0-0, policy_version 891063 (0.00086) [2022-07-10 21:04:58,466][26022] Updated weights on worker 0-0, policy_version 891073 (0.00333) [2022-07-10 21:05:00,013][25689] Fps is (10 sec: 5603.5, 60 sec: 5540.6, 300 sec: 5539.3). Total num frames: 912466944. Throughput: 0: 5030.1. Samples: 912461042. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:05:00,013][25689] Avg episode reward: [(0, '-0.400')] [2022-07-10 21:05:00,367][26022] Updated weights on worker 0-0, policy_version 891083 (0.00085) [2022-07-10 21:05:02,518][26022] Updated weights on worker 0-0, policy_version 891093 (0.00095) [2022-07-10 21:05:04,422][26022] Updated weights on worker 0-0, policy_version 891103 (0.00086) [2022-07-10 21:05:05,020][25689] Fps is (10 sec: 5499.2, 60 sec: 5546.9, 300 sec: 5532.9). Total num frames: 912493568. Throughput: 0: 5775.8. Samples: 912492610. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:05:05,020][25689] Avg episode reward: [(0, '0.226')] [2022-07-10 21:05:05,107][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:05:05,118][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000891108_912494592.pth [2022-07-10 21:05:05,119][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000889161_910500864.pth [2022-07-10 21:05:06,229][26022] Updated weights on worker 0-0, policy_version 891113 (0.00097) [2022-07-10 21:05:08,088][26022] Updated weights on worker 0-0, policy_version 891123 (0.00093) [2022-07-10 21:05:09,844][26022] Updated weights on worker 0-0, policy_version 891133 (0.00085) [2022-07-10 21:05:10,022][25689] Fps is (10 sec: 5422.4, 60 sec: 5565.0, 300 sec: 5529.8). Total num frames: 912521216. Throughput: 0: 5783.9. Samples: 912526064. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:05:10,022][25689] Avg episode reward: [(0, '-0.332')] [2022-07-10 21:05:11,707][26022] Updated weights on worker 0-0, policy_version 891143 (0.00049) [2022-07-10 21:05:13,584][26022] Updated weights on worker 0-0, policy_version 891153 (0.00093) [2022-07-10 21:05:15,023][25689] Fps is (10 sec: 5528.0, 60 sec: 5532.8, 300 sec: 5527.1). Total num frames: 912548864. Throughput: 0: 4919.2. Samples: 912542822. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:05:15,023][25689] Avg episode reward: [(0, '-0.739')] [2022-07-10 21:05:15,491][26022] Updated weights on worker 0-0, policy_version 891163 (0.00087) [2022-07-10 21:05:17,292][26022] Updated weights on worker 0-0, policy_version 891173 (0.00086) [2022-07-10 21:05:19,078][26022] Updated weights on worker 0-0, policy_version 891183 (0.00091) [2022-07-10 21:05:20,079][25689] Fps is (10 sec: 5498.2, 60 sec: 5550.8, 300 sec: 5529.9). Total num frames: 912576512. Throughput: 0: 5731.6. Samples: 912576052. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:05:20,080][25689] Avg episode reward: [(0, '-0.085')] [2022-07-10 21:05:20,969][26022] Updated weights on worker 0-0, policy_version 891193 (0.00090) [2022-07-10 21:05:22,729][26022] Updated weights on worker 0-0, policy_version 891203 (0.00089) [2022-07-10 21:05:24,628][26022] Updated weights on worker 0-0, policy_version 891213 (0.00092) [2022-07-10 21:05:25,082][25689] Fps is (10 sec: 5395.2, 60 sec: 5501.2, 300 sec: 5523.4). Total num frames: 912603136. Throughput: 0: 5823.1. Samples: 912609434. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:05:25,084][25689] Avg episode reward: [(0, '-0.739')] [2022-07-10 21:05:26,375][26022] Updated weights on worker 0-0, policy_version 891223 (0.00084) [2022-07-10 21:05:28,471][26022] Updated weights on worker 0-0, policy_version 891233 (0.00084) [2022-07-10 21:05:30,102][25689] Fps is (10 sec: 5516.8, 60 sec: 5553.1, 300 sec: 5526.5). Total num frames: 912631808. Throughput: 0: 4985.6. Samples: 912626176. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:05:30,104][25689] Avg episode reward: [(0, '-0.625')] [2022-07-10 21:05:30,152][26022] Updated weights on worker 0-0, policy_version 891243 (0.00085) [2022-07-10 21:05:32,201][26022] Updated weights on worker 0-0, policy_version 891253 (0.00086) [2022-07-10 21:05:33,504][26022] Updated weights on worker 0-0, policy_version 891263 (0.00085) [2022-07-10 21:05:35,136][25689] Fps is (10 sec: 5601.8, 60 sec: 5519.5, 300 sec: 5527.2). Total num frames: 912659456. Throughput: 0: 5807.8. Samples: 912659636. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:05:35,138][25689] Avg episode reward: [(0, '-0.654')] [2022-07-10 21:05:35,813][26022] Updated weights on worker 0-0, policy_version 891273 (0.00092) [2022-07-10 21:05:37,509][26022] Updated weights on worker 0-0, policy_version 891283 (0.00089) [2022-07-10 21:05:39,415][26022] Updated weights on worker 0-0, policy_version 891293 (0.00083) [2022-07-10 21:05:40,249][25689] Fps is (10 sec: 5752.4, 60 sec: 5566.9, 300 sec: 5532.4). Total num frames: 912690176. Throughput: 0: 5804.8. Samples: 912693134. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:05:40,251][25689] Avg episode reward: [(0, '-0.986')] [2022-07-10 21:05:41,376][26022] Updated weights on worker 0-0, policy_version 891303 (0.00085) [2022-07-10 21:05:42,939][26022] Updated weights on worker 0-0, policy_version 891313 (0.00091) [2022-07-10 21:05:44,831][26022] Updated weights on worker 0-0, policy_version 891323 (0.00098) [2022-07-10 21:05:45,281][25689] Fps is (10 sec: 5551.3, 60 sec: 5532.9, 300 sec: 5525.1). Total num frames: 912715776. Throughput: 0: 4979.7. Samples: 912710020. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:05:45,282][25689] Avg episode reward: [(0, '-0.604')] [2022-07-10 21:05:46,537][26022] Updated weights on worker 0-0, policy_version 891333 (0.00084) [2022-07-10 21:05:48,594][26022] Updated weights on worker 0-0, policy_version 891343 (0.00079) [2022-07-10 21:05:50,240][26022] Updated weights on worker 0-0, policy_version 891353 (0.00085) [2022-07-10 21:05:50,375][25689] Fps is (10 sec: 5460.5, 60 sec: 5542.6, 300 sec: 5527.1). Total num frames: 912745472. Throughput: 0: 5793.5. Samples: 912743630. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:05:50,376][25689] Avg episode reward: [(0, '-0.001')] [2022-07-10 21:05:52,250][26022] Updated weights on worker 0-0, policy_version 891363 (0.00098) [2022-07-10 21:05:53,960][26022] Updated weights on worker 0-0, policy_version 891373 (0.00092) [2022-07-10 21:05:55,440][25689] Fps is (10 sec: 5745.8, 60 sec: 5553.9, 300 sec: 5532.3). Total num frames: 912774144. Throughput: 0: 5789.9. Samples: 912777192. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:05:55,440][25689] Avg episode reward: [(0, '-0.008')] [2022-07-10 21:05:55,903][26022] Updated weights on worker 0-0, policy_version 891383 (0.00096) [2022-07-10 21:05:57,554][26022] Updated weights on worker 0-0, policy_version 891393 (0.00092) [2022-07-10 21:05:59,573][26022] Updated weights on worker 0-0, policy_version 891403 (0.00094) [2022-07-10 21:06:00,512][25689] Fps is (10 sec: 5555.8, 60 sec: 5534.7, 300 sec: 5539.4). Total num frames: 912801792. Throughput: 0: 5788.6. Samples: 912810432. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:06:00,513][25689] Avg episode reward: [(0, '-0.195')] [2022-07-10 21:06:01,526][26022] Updated weights on worker 0-0, policy_version 891413 (0.00090) [2022-07-10 21:06:03,611][26022] Updated weights on worker 0-0, policy_version 891423 (0.00091) [2022-07-10 21:06:05,376][26022] Updated weights on worker 0-0, policy_version 891433 (0.00088) [2022-07-10 21:06:05,563][25689] Fps is (10 sec: 5260.1, 60 sec: 5513.8, 300 sec: 5525.5). Total num frames: 912827392. Throughput: 0: 5670.7. Samples: 912825030. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:06:05,563][25689] Avg episode reward: [(0, '-0.301')] [2022-07-10 21:06:07,477][26022] Updated weights on worker 0-0, policy_version 891443 (0.00053) [2022-07-10 21:06:09,067][26022] Updated weights on worker 0-0, policy_version 891453 (0.00093) [2022-07-10 21:06:10,580][25689] Fps is (10 sec: 5289.2, 60 sec: 5512.5, 300 sec: 5529.0). Total num frames: 912855040. Throughput: 0: 5681.7. Samples: 912858426. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:06:10,580][25689] Avg episode reward: [(0, '0.426')] [2022-07-10 21:06:11,058][26022] Updated weights on worker 0-0, policy_version 891463 (0.00615) [2022-07-10 21:06:12,790][26022] Updated weights on worker 0-0, policy_version 891473 (0.00087) [2022-07-10 21:06:14,700][26022] Updated weights on worker 0-0, policy_version 891483 (0.00080) [2022-07-10 21:06:15,603][25689] Fps is (10 sec: 5507.6, 60 sec: 5510.5, 300 sec: 5530.0). Total num frames: 912882688. Throughput: 0: 5703.1. Samples: 912892184. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:06:15,603][25689] Avg episode reward: [(0, '-0.100')] [2022-07-10 21:06:16,646][26022] Updated weights on worker 0-0, policy_version 891493 (0.00090) [2022-07-10 21:06:18,246][26022] Updated weights on worker 0-0, policy_version 891503 (0.00085) [2022-07-10 21:06:20,193][26022] Updated weights on worker 0-0, policy_version 891513 (0.00087) [2022-07-10 21:06:20,702][25689] Fps is (10 sec: 5665.0, 60 sec: 5540.3, 300 sec: 5525.0). Total num frames: 912912384. Throughput: 0: 4878.3. Samples: 912908924. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:06:20,703][25689] Avg episode reward: [(0, '0.035')] [2022-07-10 21:06:22,245][26022] Updated weights on worker 0-0, policy_version 891523 (0.00079) [2022-07-10 21:06:23,894][26022] Updated weights on worker 0-0, policy_version 891533 (0.00085) [2022-07-10 21:06:25,754][25689] Fps is (10 sec: 5548.2, 60 sec: 5535.9, 300 sec: 5535.1). Total num frames: 912939008. Throughput: 0: 5798.7. Samples: 912942112. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:06:25,754][25689] Avg episode reward: [(0, '-0.356')] [2022-07-10 21:06:25,867][26022] Updated weights on worker 0-0, policy_version 891543 (0.00099) [2022-07-10 21:06:27,395][26022] Updated weights on worker 0-0, policy_version 891553 (0.00087) [2022-07-10 21:06:29,663][26022] Updated weights on worker 0-0, policy_version 891563 (0.00089) [2022-07-10 21:06:30,765][25689] Fps is (10 sec: 5495.3, 60 sec: 5536.7, 300 sec: 5528.4). Total num frames: 912967680. Throughput: 0: 5790.5. Samples: 912975306. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:06:30,765][25689] Avg episode reward: [(0, '-0.901')] [2022-07-10 21:06:31,247][26022] Updated weights on worker 0-0, policy_version 891573 (0.00087) [2022-07-10 21:06:33,262][26022] Updated weights on worker 0-0, policy_version 891583 (0.00085) [2022-07-10 21:06:34,831][26022] Updated weights on worker 0-0, policy_version 891593 (0.00088) [2022-07-10 21:06:35,802][25689] Fps is (10 sec: 5605.2, 60 sec: 5536.5, 300 sec: 5529.2). Total num frames: 912995328. Throughput: 0: 4947.9. Samples: 912992126. Policy #0 lag: (min: 0.0, avg: 6.9, max: 18.0) [2022-07-10 21:06:35,802][25689] Avg episode reward: [(0, '-1.673')] [2022-07-10 21:06:36,934][26022] Updated weights on worker 0-0, policy_version 891603 (0.00088) [2022-07-10 21:06:38,492][26022] Updated weights on worker 0-0, policy_version 891613 (0.00082) [2022-07-10 21:06:40,479][26022] Updated weights on worker 0-0, policy_version 891623 (0.00084) [2022-07-10 21:06:40,918][25689] Fps is (10 sec: 5547.0, 60 sec: 5502.4, 300 sec: 5535.3). Total num frames: 913024000. Throughput: 0: 5767.3. Samples: 913025514. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:06:40,918][25689] Avg episode reward: [(0, '-1.130')] [2022-07-10 21:06:42,444][26022] Updated weights on worker 0-0, policy_version 891633 (0.00093) [2022-07-10 21:06:44,128][26022] Updated weights on worker 0-0, policy_version 891643 (0.00085) [2022-07-10 21:06:45,924][25689] Fps is (10 sec: 5462.5, 60 sec: 5521.6, 300 sec: 5525.4). Total num frames: 913050624. Throughput: 0: 5772.6. Samples: 913058550. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:06:45,925][25689] Avg episode reward: [(0, '-1.267')] [2022-07-10 21:06:46,166][26022] Updated weights on worker 0-0, policy_version 891653 (0.00089) [2022-07-10 21:06:47,743][26022] Updated weights on worker 0-0, policy_version 891663 (0.00081) [2022-07-10 21:06:49,698][26022] Updated weights on worker 0-0, policy_version 891673 (0.00084) [2022-07-10 21:06:50,984][25689] Fps is (10 sec: 5493.0, 60 sec: 5507.8, 300 sec: 5528.9). Total num frames: 913079296. Throughput: 0: 4935.7. Samples: 913075106. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:06:50,985][25689] Avg episode reward: [(0, '-1.414')] [2022-07-10 21:06:51,657][26022] Updated weights on worker 0-0, policy_version 891683 (0.00084) [2022-07-10 21:06:53,297][26022] Updated weights on worker 0-0, policy_version 891693 (0.00092) [2022-07-10 21:06:55,412][26022] Updated weights on worker 0-0, policy_version 891703 (0.00087) [2022-07-10 21:06:56,011][25689] Fps is (10 sec: 5685.1, 60 sec: 5511.2, 300 sec: 5532.9). Total num frames: 913107968. Throughput: 0: 5752.1. Samples: 913108374. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:06:56,012][25689] Avg episode reward: [(0, '-1.508')] [2022-07-10 21:06:57,283][26022] Updated weights on worker 0-0, policy_version 891713 (0.00089) [2022-07-10 21:06:59,041][26022] Updated weights on worker 0-0, policy_version 891723 (0.00091) [2022-07-10 21:07:00,990][26022] Updated weights on worker 0-0, policy_version 891733 (0.00099) [2022-07-10 21:07:01,091][25689] Fps is (10 sec: 5471.2, 60 sec: 5493.7, 300 sec: 5528.3). Total num frames: 913134592. Throughput: 0: 5752.8. Samples: 913141566. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:07:01,092][25689] Avg episode reward: [(0, '-0.383')] [2022-07-10 21:07:03,083][26022] Updated weights on worker 0-0, policy_version 891743 (0.00068) [2022-07-10 21:07:05,001][26022] Updated weights on worker 0-0, policy_version 891753 (0.00090) [2022-07-10 21:07:05,363][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:07:05,377][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000891755_913157120.pth [2022-07-10 21:07:05,378][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000889811_911166464.pth [2022-07-10 21:07:06,188][25689] Fps is (10 sec: 5232.2, 60 sec: 5506.3, 300 sec: 5526.9). Total num frames: 913161216. Throughput: 0: 4810.0. Samples: 913156018. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:07:06,189][25689] Avg episode reward: [(0, '-0.448')] [2022-07-10 21:07:06,956][26022] Updated weights on worker 0-0, policy_version 891763 (0.00082) [2022-07-10 21:07:08,516][26022] Updated weights on worker 0-0, policy_version 891773 (0.00083) [2022-07-10 21:07:10,706][26022] Updated weights on worker 0-0, policy_version 891783 (0.00101) [2022-07-10 21:07:11,241][25689] Fps is (10 sec: 5347.3, 60 sec: 5503.1, 300 sec: 5526.1). Total num frames: 913188864. Throughput: 0: 5644.0. Samples: 913189432. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:07:11,241][25689] Avg episode reward: [(0, '-0.592')] [2022-07-10 21:07:12,272][26022] Updated weights on worker 0-0, policy_version 891793 (0.00091) [2022-07-10 21:07:14,463][26022] Updated weights on worker 0-0, policy_version 891803 (0.00081) [2022-07-10 21:07:15,839][26022] Updated weights on worker 0-0, policy_version 891813 (0.00083) [2022-07-10 21:07:16,271][25689] Fps is (10 sec: 5484.3, 60 sec: 5502.5, 300 sec: 5524.6). Total num frames: 913216512. Throughput: 0: 5654.1. Samples: 913222924. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:07:16,271][25689] Avg episode reward: [(0, '0.375')] [2022-07-10 21:07:17,920][26022] Updated weights on worker 0-0, policy_version 891823 (0.00094) [2022-07-10 21:07:19,862][26022] Updated weights on worker 0-0, policy_version 891833 (0.00091) [2022-07-10 21:07:21,341][25689] Fps is (10 sec: 5474.9, 60 sec: 5471.4, 300 sec: 5520.2). Total num frames: 913244160. Throughput: 0: 4831.4. Samples: 913239398. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:07:21,341][25689] Avg episode reward: [(0, '0.370')] [2022-07-10 21:07:21,665][26022] Updated weights on worker 0-0, policy_version 891843 (0.00414) [2022-07-10 21:07:23,487][26022] Updated weights on worker 0-0, policy_version 891853 (0.00085) [2022-07-10 21:07:25,388][26022] Updated weights on worker 0-0, policy_version 891863 (0.00093) [2022-07-10 21:07:26,410][25689] Fps is (10 sec: 5655.5, 60 sec: 5520.4, 300 sec: 5523.1). Total num frames: 913273856. Throughput: 0: 5776.5. Samples: 913272830. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:07:26,411][25689] Avg episode reward: [(0, '-0.853')] [2022-07-10 21:07:27,225][26022] Updated weights on worker 0-0, policy_version 891873 (0.00096) [2022-07-10 21:07:29,159][26022] Updated weights on worker 0-0, policy_version 891883 (0.00086) [2022-07-10 21:07:30,716][26022] Updated weights on worker 0-0, policy_version 891893 (0.00088) [2022-07-10 21:07:31,492][25689] Fps is (10 sec: 5548.0, 60 sec: 5480.2, 300 sec: 5518.3). Total num frames: 913300480. Throughput: 0: 5740.5. Samples: 913305684. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:07:31,492][25689] Avg episode reward: [(0, '-0.520')] [2022-07-10 21:07:32,841][26022] Updated weights on worker 0-0, policy_version 891903 (0.00088) [2022-07-10 21:07:34,643][26022] Updated weights on worker 0-0, policy_version 891913 (0.00085) [2022-07-10 21:07:36,444][26022] Updated weights on worker 0-0, policy_version 891923 (0.00090) [2022-07-10 21:07:36,540][25689] Fps is (10 sec: 5458.9, 60 sec: 5496.1, 300 sec: 5522.4). Total num frames: 913329152. Throughput: 0: 4905.0. Samples: 913322344. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:07:36,540][25689] Avg episode reward: [(0, '-1.052')] [2022-07-10 21:07:38,214][26022] Updated weights on worker 0-0, policy_version 891933 (0.00088) [2022-07-10 21:07:40,086][26022] Updated weights on worker 0-0, policy_version 891943 (0.00083) [2022-07-10 21:07:41,655][25689] Fps is (10 sec: 5642.3, 60 sec: 5496.2, 300 sec: 5520.9). Total num frames: 913357824. Throughput: 0: 5740.6. Samples: 913356016. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:07:41,656][25689] Avg episode reward: [(0, '-1.437')] [2022-07-10 21:07:41,899][26022] Updated weights on worker 0-0, policy_version 891953 (0.00088) [2022-07-10 21:07:43,934][26022] Updated weights on worker 0-0, policy_version 891963 (0.00496) [2022-07-10 21:07:45,533][26022] Updated weights on worker 0-0, policy_version 891973 (0.00085) [2022-07-10 21:07:46,687][25689] Fps is (10 sec: 5449.4, 60 sec: 5493.9, 300 sec: 5510.2). Total num frames: 913384448. Throughput: 0: 5744.1. Samples: 913389302. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:07:46,687][25689] Avg episode reward: [(0, '-1.662')] [2022-07-10 21:07:47,563][26022] Updated weights on worker 0-0, policy_version 891983 (0.00085) [2022-07-10 21:07:49,288][26022] Updated weights on worker 0-0, policy_version 891993 (0.00094) [2022-07-10 21:07:51,215][26022] Updated weights on worker 0-0, policy_version 892003 (0.00085) [2022-07-10 21:07:51,693][25689] Fps is (10 sec: 5508.6, 60 sec: 5498.8, 300 sec: 5520.4). Total num frames: 913413120. Throughput: 0: 5789.8. Samples: 913422646. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:07:51,694][25689] Avg episode reward: [(0, '-1.034')] [2022-07-10 21:07:53,163][26022] Updated weights on worker 0-0, policy_version 892013 (0.00089) [2022-07-10 21:07:54,908][26022] Updated weights on worker 0-0, policy_version 892023 (0.00091) [2022-07-10 21:07:56,719][25689] Fps is (10 sec: 5614.2, 60 sec: 5482.0, 300 sec: 5515.2). Total num frames: 913440768. Throughput: 0: 5800.3. Samples: 913439388. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:07:56,720][25689] Avg episode reward: [(0, '-0.297')] [2022-07-10 21:07:56,932][26022] Updated weights on worker 0-0, policy_version 892033 (0.00083) [2022-07-10 21:07:58,600][26022] Updated weights on worker 0-0, policy_version 892043 (0.00102) [2022-07-10 21:08:00,628][26022] Updated weights on worker 0-0, policy_version 892053 (0.00120) [2022-07-10 21:08:01,804][25689] Fps is (10 sec: 5570.1, 60 sec: 5515.2, 300 sec: 5520.6). Total num frames: 913469440. Throughput: 0: 5793.7. Samples: 913472756. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:08:01,805][25689] Avg episode reward: [(0, '-0.103')] [2022-07-10 21:08:02,545][26022] Updated weights on worker 0-0, policy_version 892063 (0.00079) [2022-07-10 21:08:04,493][26022] Updated weights on worker 0-0, policy_version 892073 (0.00092) [2022-07-10 21:08:06,312][26022] Updated weights on worker 0-0, policy_version 892083 (0.00093) [2022-07-10 21:08:06,881][25689] Fps is (10 sec: 5340.5, 60 sec: 5500.2, 300 sec: 5516.8). Total num frames: 913495040. Throughput: 0: 5691.9. Samples: 913504244. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:08:06,882][25689] Avg episode reward: [(0, '-0.238')] [2022-07-10 21:08:08,115][26022] Updated weights on worker 0-0, policy_version 892093 (0.00085) [2022-07-10 21:08:10,188][26022] Updated weights on worker 0-0, policy_version 892103 (0.00085) [2022-07-10 21:08:11,704][26022] Updated weights on worker 0-0, policy_version 892113 (0.00092) [2022-07-10 21:08:11,921][25689] Fps is (10 sec: 5466.0, 60 sec: 5535.1, 300 sec: 5523.1). Total num frames: 913524736. Throughput: 0: 4858.5. Samples: 913520922. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:08:11,921][25689] Avg episode reward: [(0, '-1.126')] [2022-07-10 21:08:13,907][26022] Updated weights on worker 0-0, policy_version 892123 (0.00096) [2022-07-10 21:08:15,432][26022] Updated weights on worker 0-0, policy_version 892133 (0.00091) [2022-07-10 21:08:17,019][25689] Fps is (10 sec: 5454.2, 60 sec: 5495.2, 300 sec: 5516.3). Total num frames: 913550336. Throughput: 0: 5638.6. Samples: 913553856. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:08:17,020][25689] Avg episode reward: [(0, '-0.776')] [2022-07-10 21:08:17,532][26022] Updated weights on worker 0-0, policy_version 892143 (0.00091) [2022-07-10 21:08:19,325][26022] Updated weights on worker 0-0, policy_version 892153 (0.00092) [2022-07-10 21:08:21,330][26022] Updated weights on worker 0-0, policy_version 892163 (0.00088) [2022-07-10 21:08:22,077][25689] Fps is (10 sec: 5444.4, 60 sec: 5530.0, 300 sec: 5515.5). Total num frames: 913580032. Throughput: 0: 5649.0. Samples: 913587278. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:08:22,079][25689] Avg episode reward: [(0, '-0.406')] [2022-07-10 21:08:22,992][26022] Updated weights on worker 0-0, policy_version 892173 (0.00081) [2022-07-10 21:08:24,842][26022] Updated weights on worker 0-0, policy_version 892183 (0.00093) [2022-07-10 21:08:26,459][26022] Updated weights on worker 0-0, policy_version 892193 (0.00088) [2022-07-10 21:08:27,080][25689] Fps is (10 sec: 5801.6, 60 sec: 5519.2, 300 sec: 5523.6). Total num frames: 913608704. Throughput: 0: 4958.5. Samples: 913604400. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:08:27,080][25689] Avg episode reward: [(0, '-1.018')] [2022-07-10 21:08:28,560][26022] Updated weights on worker 0-0, policy_version 892203 (0.00089) [2022-07-10 21:08:30,109][26022] Updated weights on worker 0-0, policy_version 892213 (0.00088) [2022-07-10 21:08:32,083][25689] Fps is (10 sec: 5424.3, 60 sec: 5509.5, 300 sec: 5516.7). Total num frames: 913634304. Throughput: 0: 5797.3. Samples: 913637806. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:08:32,083][25689] Avg episode reward: [(0, '-0.362')] [2022-07-10 21:08:32,268][26022] Updated weights on worker 0-0, policy_version 892223 (0.00106) [2022-07-10 21:08:33,888][26022] Updated weights on worker 0-0, policy_version 892233 (0.00091) [2022-07-10 21:08:35,781][26022] Updated weights on worker 0-0, policy_version 892243 (0.00096) [2022-07-10 21:08:37,097][25689] Fps is (10 sec: 5520.0, 60 sec: 5529.4, 300 sec: 5517.0). Total num frames: 913664000. Throughput: 0: 5834.3. Samples: 913670998. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:08:37,098][25689] Avg episode reward: [(0, '-1.338')] [2022-07-10 21:08:37,788][26022] Updated weights on worker 0-0, policy_version 892253 (0.00088) [2022-07-10 21:08:39,409][26022] Updated weights on worker 0-0, policy_version 892263 (0.00090) [2022-07-10 21:08:41,551][26022] Updated weights on worker 0-0, policy_version 892273 (0.00093) [2022-07-10 21:08:42,165][25689] Fps is (10 sec: 5687.6, 60 sec: 5516.9, 300 sec: 5520.1). Total num frames: 913691648. Throughput: 0: 4999.4. Samples: 913687704. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:08:42,166][25689] Avg episode reward: [(0, '-0.711')] [2022-07-10 21:08:43,176][26022] Updated weights on worker 0-0, policy_version 892283 (0.00090) [2022-07-10 21:08:45,162][26022] Updated weights on worker 0-0, policy_version 892293 (0.00094) [2022-07-10 21:08:47,115][26022] Updated weights on worker 0-0, policy_version 892303 (0.00097) [2022-07-10 21:08:47,172][25689] Fps is (10 sec: 5387.1, 60 sec: 5519.1, 300 sec: 5516.9). Total num frames: 913718272. Throughput: 0: 5793.3. Samples: 913720798. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:08:47,172][25689] Avg episode reward: [(0, '-0.903')] [2022-07-10 21:08:48,668][26022] Updated weights on worker 0-0, policy_version 892313 (0.00087) [2022-07-10 21:08:50,795][26022] Updated weights on worker 0-0, policy_version 892323 (0.00091) [2022-07-10 21:08:52,181][25689] Fps is (10 sec: 5623.3, 60 sec: 5535.8, 300 sec: 5520.3). Total num frames: 913747968. Throughput: 0: 5778.3. Samples: 913753938. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:08:52,181][25689] Avg episode reward: [(0, '-0.167')] [2022-07-10 21:08:52,449][26022] Updated weights on worker 0-0, policy_version 892333 (0.00095) [2022-07-10 21:08:54,537][26022] Updated weights on worker 0-0, policy_version 892343 (0.00090) [2022-07-10 21:08:56,119][26022] Updated weights on worker 0-0, policy_version 892353 (0.00494) [2022-07-10 21:08:57,212][25689] Fps is (10 sec: 5609.5, 60 sec: 5518.4, 300 sec: 5518.5). Total num frames: 913774592. Throughput: 0: 4955.9. Samples: 913770684. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:08:57,213][25689] Avg episode reward: [(0, '-0.075')] [2022-07-10 21:08:58,092][26022] Updated weights on worker 0-0, policy_version 892363 (0.00085) [2022-07-10 21:08:59,723][26022] Updated weights on worker 0-0, policy_version 892373 (0.00085) [2022-07-10 21:09:01,841][26022] Updated weights on worker 0-0, policy_version 892383 (0.00098) [2022-07-10 21:09:02,252][25689] Fps is (10 sec: 5287.3, 60 sec: 5488.7, 300 sec: 5519.2). Total num frames: 913801216. Throughput: 0: 5796.5. Samples: 913804136. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:09:02,252][25689] Avg episode reward: [(0, '-0.875')] [2022-07-10 21:09:03,976][26022] Updated weights on worker 0-0, policy_version 892393 (0.00089) [2022-07-10 21:09:05,414][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:09:05,427][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000892401_913818624.pth [2022-07-10 21:09:05,427][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000890459_911830016.pth [2022-07-10 21:09:05,887][26022] Updated weights on worker 0-0, policy_version 892403 (0.00085) [2022-07-10 21:09:07,268][25689] Fps is (10 sec: 5396.8, 60 sec: 5528.1, 300 sec: 5522.6). Total num frames: 913828864. Throughput: 0: 5708.3. Samples: 913835514. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:09:07,269][25689] Avg episode reward: [(0, '-1.056')] [2022-07-10 21:09:07,655][26022] Updated weights on worker 0-0, policy_version 892413 (0.00082) [2022-07-10 21:09:09,348][26022] Updated weights on worker 0-0, policy_version 892423 (0.00108) [2022-07-10 21:09:11,287][26022] Updated weights on worker 0-0, policy_version 892433 (0.00163) [2022-07-10 21:09:12,282][25689] Fps is (10 sec: 5512.9, 60 sec: 5496.5, 300 sec: 5515.8). Total num frames: 913856512. Throughput: 0: 4891.1. Samples: 913852256. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:09:12,286][25689] Avg episode reward: [(0, '-1.520')] [2022-07-10 21:09:13,176][26022] Updated weights on worker 0-0, policy_version 892443 (0.00093) [2022-07-10 21:09:15,016][26022] Updated weights on worker 0-0, policy_version 892453 (0.00092) [2022-07-10 21:09:16,766][26022] Updated weights on worker 0-0, policy_version 892463 (0.00865) [2022-07-10 21:09:17,303][25689] Fps is (10 sec: 5510.7, 60 sec: 5537.6, 300 sec: 5520.1). Total num frames: 913884160. Throughput: 0: 5718.1. Samples: 913885564. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:09:17,303][25689] Avg episode reward: [(0, '-1.553')] [2022-07-10 21:09:18,624][26022] Updated weights on worker 0-0, policy_version 892473 (0.00092) [2022-07-10 21:09:20,611][26022] Updated weights on worker 0-0, policy_version 892483 (0.00087) [2022-07-10 21:09:22,423][25689] Fps is (10 sec: 5452.8, 60 sec: 5498.0, 300 sec: 5511.3). Total num frames: 913911808. Throughput: 0: 5697.2. Samples: 913919056. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:09:22,423][25689] Avg episode reward: [(0, '-2.451')] [2022-07-10 21:09:22,490][26022] Updated weights on worker 0-0, policy_version 892493 (0.00093) [2022-07-10 21:09:24,215][26022] Updated weights on worker 0-0, policy_version 892503 (0.00090) [2022-07-10 21:09:26,318][26022] Updated weights on worker 0-0, policy_version 892513 (0.00078) [2022-07-10 21:09:27,447][25689] Fps is (10 sec: 5552.0, 60 sec: 5496.0, 300 sec: 5521.8). Total num frames: 913940480. Throughput: 0: 4970.8. Samples: 913935818. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:09:27,447][25689] Avg episode reward: [(0, '-2.872')] [2022-07-10 21:09:27,918][26022] Updated weights on worker 0-0, policy_version 892523 (0.00092) [2022-07-10 21:09:29,932][26022] Updated weights on worker 0-0, policy_version 892533 (0.00089) [2022-07-10 21:09:31,528][26022] Updated weights on worker 0-0, policy_version 892543 (0.00094) [2022-07-10 21:09:32,517][25689] Fps is (10 sec: 5579.6, 60 sec: 5523.8, 300 sec: 5514.3). Total num frames: 913968128. Throughput: 0: 5779.7. Samples: 913969208. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:09:32,517][25689] Avg episode reward: [(0, '-1.632')] [2022-07-10 21:09:33,539][26022] Updated weights on worker 0-0, policy_version 892553 (0.00097) [2022-07-10 21:09:35,321][26022] Updated weights on worker 0-0, policy_version 892563 (0.00091) [2022-07-10 21:09:37,216][26022] Updated weights on worker 0-0, policy_version 892573 (0.00085) [2022-07-10 21:09:37,522][25689] Fps is (10 sec: 5488.1, 60 sec: 5490.7, 300 sec: 5515.6). Total num frames: 913995776. Throughput: 0: 5764.0. Samples: 914002110. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:09:37,524][25689] Avg episode reward: [(0, '-0.830')] [2022-07-10 21:09:39,082][26022] Updated weights on worker 0-0, policy_version 892583 (0.00086) [2022-07-10 21:09:40,746][26022] Updated weights on worker 0-0, policy_version 892593 (0.00091) [2022-07-10 21:09:42,580][25689] Fps is (10 sec: 5596.3, 60 sec: 5508.5, 300 sec: 5518.5). Total num frames: 914024448. Throughput: 0: 4958.5. Samples: 914019008. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:09:42,581][25689] Avg episode reward: [(0, '-0.576')] [2022-07-10 21:09:42,710][26022] Updated weights on worker 0-0, policy_version 892603 (0.00095) [2022-07-10 21:09:44,764][26022] Updated weights on worker 0-0, policy_version 892613 (0.00085) [2022-07-10 21:09:46,378][26022] Updated weights on worker 0-0, policy_version 892623 (0.00087) [2022-07-10 21:09:47,596][25689] Fps is (10 sec: 5590.6, 60 sec: 5524.6, 300 sec: 5515.1). Total num frames: 914052096. Throughput: 0: 5786.3. Samples: 914052410. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:09:47,597][25689] Avg episode reward: [(0, '-0.886')] [2022-07-10 21:09:48,369][26022] Updated weights on worker 0-0, policy_version 892633 (0.00088) [2022-07-10 21:09:49,835][26022] Updated weights on worker 0-0, policy_version 892643 (0.00072) [2022-07-10 21:09:52,127][26022] Updated weights on worker 0-0, policy_version 892653 (0.00094) [2022-07-10 21:09:52,611][25689] Fps is (10 sec: 5512.9, 60 sec: 5490.2, 300 sec: 5514.8). Total num frames: 914079744. Throughput: 0: 5807.6. Samples: 914085906. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:09:52,613][25689] Avg episode reward: [(0, '-0.211')] [2022-07-10 21:09:53,602][26022] Updated weights on worker 0-0, policy_version 892663 (0.00087) [2022-07-10 21:09:55,648][26022] Updated weights on worker 0-0, policy_version 892673 (0.00088) [2022-07-10 21:09:57,465][26022] Updated weights on worker 0-0, policy_version 892683 (0.00095) [2022-07-10 21:09:57,621][25689] Fps is (10 sec: 5618.0, 60 sec: 5526.0, 300 sec: 5515.5). Total num frames: 914108416. Throughput: 0: 5004.0. Samples: 914102686. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:09:57,623][25689] Avg episode reward: [(0, '-0.104')] [2022-07-10 21:09:59,319][26022] Updated weights on worker 0-0, policy_version 892693 (0.00088) [2022-07-10 21:10:01,187][26022] Updated weights on worker 0-0, policy_version 892703 (0.00088) [2022-07-10 21:10:02,723][25689] Fps is (10 sec: 5468.1, 60 sec: 5520.3, 300 sec: 5513.8). Total num frames: 914135040. Throughput: 0: 5806.7. Samples: 914135972. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:10:02,724][25689] Avg episode reward: [(0, '0.039')] [2022-07-10 21:10:03,415][26022] Updated weights on worker 0-0, policy_version 892713 (0.00083) [2022-07-10 21:10:04,979][26022] Updated weights on worker 0-0, policy_version 892723 (0.00087) [2022-07-10 21:10:07,171][26022] Updated weights on worker 0-0, policy_version 892733 (0.00085) [2022-07-10 21:10:07,775][25689] Fps is (10 sec: 5244.4, 60 sec: 5500.3, 300 sec: 5509.4). Total num frames: 914161664. Throughput: 0: 5700.3. Samples: 914167432. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:10:07,775][25689] Avg episode reward: [(0, '-0.123')] [2022-07-10 21:10:08,799][26022] Updated weights on worker 0-0, policy_version 892743 (0.00099) [2022-07-10 21:10:10,797][26022] Updated weights on worker 0-0, policy_version 892753 (0.00086) [2022-07-10 21:10:12,516][26022] Updated weights on worker 0-0, policy_version 892763 (0.00096) [2022-07-10 21:10:12,786][25689] Fps is (10 sec: 5495.0, 60 sec: 5517.4, 300 sec: 5512.7). Total num frames: 914190336. Throughput: 0: 4874.6. Samples: 914184254. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:10:12,787][25689] Avg episode reward: [(0, '-0.277')] [2022-07-10 21:10:14,330][26022] Updated weights on worker 0-0, policy_version 892773 (0.00054) [2022-07-10 21:10:16,303][26022] Updated weights on worker 0-0, policy_version 892783 (0.00302) [2022-07-10 21:10:17,822][25689] Fps is (10 sec: 5503.4, 60 sec: 5499.1, 300 sec: 5509.6). Total num frames: 914216960. Throughput: 0: 5690.3. Samples: 914217634. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 21:10:17,823][25689] Avg episode reward: [(0, '-0.061')] [2022-07-10 21:10:18,157][26022] Updated weights on worker 0-0, policy_version 892793 (0.00085) [2022-07-10 21:10:19,724][26022] Updated weights on worker 0-0, policy_version 892803 (0.00083) [2022-07-10 21:10:21,913][26022] Updated weights on worker 0-0, policy_version 892813 (0.00085) [2022-07-10 21:10:22,878][25689] Fps is (10 sec: 5682.2, 60 sec: 5555.7, 300 sec: 5522.4). Total num frames: 914247680. Throughput: 0: 5711.5. Samples: 914251086. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:10:22,878][25689] Avg episode reward: [(0, '-0.065')] [2022-07-10 21:10:23,350][26022] Updated weights on worker 0-0, policy_version 892823 (0.00088) [2022-07-10 21:10:25,487][26022] Updated weights on worker 0-0, policy_version 892833 (0.00089) [2022-07-10 21:10:27,169][26022] Updated weights on worker 0-0, policy_version 892843 (0.00084) [2022-07-10 21:10:27,898][25689] Fps is (10 sec: 5691.0, 60 sec: 5522.2, 300 sec: 5515.5). Total num frames: 914274304. Throughput: 0: 4992.7. Samples: 914267904. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:10:27,899][25689] Avg episode reward: [(0, '-0.048')] [2022-07-10 21:10:29,132][26022] Updated weights on worker 0-0, policy_version 892853 (0.00080) [2022-07-10 21:10:30,923][26022] Updated weights on worker 0-0, policy_version 892863 (0.00086) [2022-07-10 21:10:32,737][26022] Updated weights on worker 0-0, policy_version 892873 (0.00088) [2022-07-10 21:10:32,915][25689] Fps is (10 sec: 5407.3, 60 sec: 5527.1, 300 sec: 5515.8). Total num frames: 914301952. Throughput: 0: 5815.8. Samples: 914301318. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:10:32,915][25689] Avg episode reward: [(0, '-0.168')] [2022-07-10 21:10:34,455][26022] Updated weights on worker 0-0, policy_version 892883 (0.00093) [2022-07-10 21:10:36,399][26022] Updated weights on worker 0-0, policy_version 892893 (0.00089) [2022-07-10 21:10:37,952][25689] Fps is (10 sec: 5704.1, 60 sec: 5558.1, 300 sec: 5513.8). Total num frames: 914331648. Throughput: 0: 5823.6. Samples: 914334860. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:10:37,954][25689] Avg episode reward: [(0, '0.225')] [2022-07-10 21:10:38,158][26022] Updated weights on worker 0-0, policy_version 892903 (0.00085) [2022-07-10 21:10:40,210][26022] Updated weights on worker 0-0, policy_version 892913 (0.00094) [2022-07-10 21:10:41,903][26022] Updated weights on worker 0-0, policy_version 892923 (0.00087) [2022-07-10 21:10:43,003][25689] Fps is (10 sec: 5684.4, 60 sec: 5541.8, 300 sec: 5520.3). Total num frames: 914359296. Throughput: 0: 5837.6. Samples: 914368566. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:10:43,003][25689] Avg episode reward: [(0, '0.191')] [2022-07-10 21:10:43,786][26022] Updated weights on worker 0-0, policy_version 892933 (0.00086) [2022-07-10 21:10:45,451][26022] Updated weights on worker 0-0, policy_version 892943 (0.00090) [2022-07-10 21:10:47,266][26022] Updated weights on worker 0-0, policy_version 892953 (0.00094) [2022-07-10 21:10:48,016][25689] Fps is (10 sec: 5494.3, 60 sec: 5542.0, 300 sec: 5514.9). Total num frames: 914386944. Throughput: 0: 5846.8. Samples: 914385526. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:10:48,016][25689] Avg episode reward: [(0, '0.546')] [2022-07-10 21:10:49,222][26022] Updated weights on worker 0-0, policy_version 892963 (0.00089) [2022-07-10 21:10:50,907][26022] Updated weights on worker 0-0, policy_version 892973 (0.00084) [2022-07-10 21:10:52,918][26022] Updated weights on worker 0-0, policy_version 892983 (0.00086) [2022-07-10 21:10:53,027][25689] Fps is (10 sec: 5516.4, 60 sec: 5542.4, 300 sec: 5512.5). Total num frames: 914414592. Throughput: 0: 5838.4. Samples: 914418740. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:10:53,027][25689] Avg episode reward: [(0, '0.227')] [2022-07-10 21:10:54,531][26022] Updated weights on worker 0-0, policy_version 892993 (0.00182) [2022-07-10 21:10:56,697][26022] Updated weights on worker 0-0, policy_version 893003 (0.00091) [2022-07-10 21:10:58,029][25689] Fps is (10 sec: 5624.8, 60 sec: 5543.2, 300 sec: 5517.2). Total num frames: 914443264. Throughput: 0: 5862.8. Samples: 914452568. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:10:58,029][25689] Avg episode reward: [(0, '0.587')] [2022-07-10 21:10:58,395][26022] Updated weights on worker 0-0, policy_version 893013 (0.00090) [2022-07-10 21:11:00,232][26022] Updated weights on worker 0-0, policy_version 893023 (0.00086) [2022-07-10 21:11:02,456][26022] Updated weights on worker 0-0, policy_version 893033 (0.00092) [2022-07-10 21:11:03,095][25689] Fps is (10 sec: 5491.8, 60 sec: 5546.4, 300 sec: 5520.4). Total num frames: 914469888. Throughput: 0: 5013.3. Samples: 914469298. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:11:03,096][25689] Avg episode reward: [(0, '0.756')] [2022-07-10 21:11:04,139][26022] Updated weights on worker 0-0, policy_version 893043 (0.00084) [2022-07-10 21:11:05,664][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:11:05,681][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000893050_914483200.pth [2022-07-10 21:11:05,681][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000891108_912494592.pth [2022-07-10 21:11:06,192][26022] Updated weights on worker 0-0, policy_version 893053 (0.00084) [2022-07-10 21:11:08,029][26022] Updated weights on worker 0-0, policy_version 893063 (0.00096) [2022-07-10 21:11:08,146][25689] Fps is (10 sec: 5364.4, 60 sec: 5563.4, 300 sec: 5519.8). Total num frames: 914497536. Throughput: 0: 5720.6. Samples: 914500682. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:11:08,146][25689] Avg episode reward: [(0, '0.440')] [2022-07-10 21:11:09,861][26022] Updated weights on worker 0-0, policy_version 893073 (0.00099) [2022-07-10 21:11:11,779][26022] Updated weights on worker 0-0, policy_version 893083 (0.00085) [2022-07-10 21:11:13,182][25689] Fps is (10 sec: 5481.9, 60 sec: 5544.2, 300 sec: 5519.5). Total num frames: 914525184. Throughput: 0: 5710.9. Samples: 914533848. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:11:13,183][25689] Avg episode reward: [(0, '0.613')] [2022-07-10 21:11:13,516][26022] Updated weights on worker 0-0, policy_version 893093 (0.00088) [2022-07-10 21:11:15,346][26022] Updated weights on worker 0-0, policy_version 893103 (0.00092) [2022-07-10 21:11:17,215][26022] Updated weights on worker 0-0, policy_version 893113 (0.00095) [2022-07-10 21:11:18,201][25689] Fps is (10 sec: 5499.1, 60 sec: 5562.7, 300 sec: 5514.1). Total num frames: 914552832. Throughput: 0: 4859.3. Samples: 914550592. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:11:18,202][25689] Avg episode reward: [(0, '0.722')] [2022-07-10 21:11:19,098][26022] Updated weights on worker 0-0, policy_version 893123 (0.00087) [2022-07-10 21:11:21,153][26022] Updated weights on worker 0-0, policy_version 893133 (0.00085) [2022-07-10 21:11:22,664][26022] Updated weights on worker 0-0, policy_version 893143 (0.00085) [2022-07-10 21:11:23,326][25689] Fps is (10 sec: 5552.3, 60 sec: 5522.5, 300 sec: 5519.7). Total num frames: 914581504. Throughput: 0: 5658.3. Samples: 914583768. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:11:23,326][25689] Avg episode reward: [(0, '0.907')] [2022-07-10 21:11:24,750][26022] Updated weights on worker 0-0, policy_version 893153 (0.00094) [2022-07-10 21:11:26,359][26022] Updated weights on worker 0-0, policy_version 893163 (0.00096) [2022-07-10 21:11:28,304][26022] Updated weights on worker 0-0, policy_version 893173 (0.00085) [2022-07-10 21:11:28,401][25689] Fps is (10 sec: 5521.7, 60 sec: 5534.4, 300 sec: 5515.0). Total num frames: 914609152. Throughput: 0: 5746.4. Samples: 914617076. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:11:28,402][25689] Avg episode reward: [(0, '-0.458')] [2022-07-10 21:11:30,074][26022] Updated weights on worker 0-0, policy_version 893183 (0.00084) [2022-07-10 21:11:31,981][26022] Updated weights on worker 0-0, policy_version 893193 (0.00084) [2022-07-10 21:11:33,419][25689] Fps is (10 sec: 5579.9, 60 sec: 5551.2, 300 sec: 5518.8). Total num frames: 914637824. Throughput: 0: 4947.8. Samples: 914633976. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:11:33,420][25689] Avg episode reward: [(0, '-0.506')] [2022-07-10 21:11:33,699][26022] Updated weights on worker 0-0, policy_version 893203 (0.00093) [2022-07-10 21:11:35,720][26022] Updated weights on worker 0-0, policy_version 893213 (0.00086) [2022-07-10 21:11:37,356][26022] Updated weights on worker 0-0, policy_version 893223 (0.00086) [2022-07-10 21:11:38,443][25689] Fps is (10 sec: 5608.4, 60 sec: 5518.5, 300 sec: 5517.1). Total num frames: 914665472. Throughput: 0: 5782.8. Samples: 914667646. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:11:38,444][25689] Avg episode reward: [(0, '-0.779')] [2022-07-10 21:11:39,358][26022] Updated weights on worker 0-0, policy_version 893233 (0.00512) [2022-07-10 21:11:41,226][26022] Updated weights on worker 0-0, policy_version 893243 (0.00084) [2022-07-10 21:11:43,009][26022] Updated weights on worker 0-0, policy_version 893253 (0.00087) [2022-07-10 21:11:43,492][25689] Fps is (10 sec: 5489.5, 60 sec: 5518.7, 300 sec: 5519.7). Total num frames: 914693120. Throughput: 0: 5823.5. Samples: 914701208. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:11:43,493][25689] Avg episode reward: [(0, '-0.842')] [2022-07-10 21:11:44,855][26022] Updated weights on worker 0-0, policy_version 893263 (0.00096) [2022-07-10 21:11:46,700][26022] Updated weights on worker 0-0, policy_version 893273 (0.00094) [2022-07-10 21:11:48,563][25689] Fps is (10 sec: 5464.4, 60 sec: 5513.5, 300 sec: 5516.1). Total num frames: 914720768. Throughput: 0: 5013.6. Samples: 914718158. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:11:48,563][25689] Avg episode reward: [(0, '-1.568')] [2022-07-10 21:11:48,599][26022] Updated weights on worker 0-0, policy_version 893283 (0.00085) [2022-07-10 21:11:50,578][26022] Updated weights on worker 0-0, policy_version 893293 (0.00083) [2022-07-10 21:11:52,143][26022] Updated weights on worker 0-0, policy_version 893303 (0.00082) [2022-07-10 21:11:53,576][25689] Fps is (10 sec: 5585.1, 60 sec: 5530.1, 300 sec: 5516.3). Total num frames: 914749440. Throughput: 0: 5821.8. Samples: 914751326. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:11:53,577][25689] Avg episode reward: [(0, '-1.432')] [2022-07-10 21:11:54,158][26022] Updated weights on worker 0-0, policy_version 893313 (0.00090) [2022-07-10 21:11:55,747][26022] Updated weights on worker 0-0, policy_version 893323 (0.00085) [2022-07-10 21:11:57,930][26022] Updated weights on worker 0-0, policy_version 893333 (0.00100) [2022-07-10 21:11:58,651][25689] Fps is (10 sec: 5785.7, 60 sec: 5540.4, 300 sec: 5526.8). Total num frames: 914779136. Throughput: 0: 5781.7. Samples: 914784480. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:11:58,652][25689] Avg episode reward: [(0, '0.152')] [2022-07-10 21:11:59,728][26022] Updated weights on worker 0-0, policy_version 893343 (0.00088) [2022-07-10 21:12:01,748][26022] Updated weights on worker 0-0, policy_version 893353 (0.00107) [2022-07-10 21:12:03,779][25689] Fps is (10 sec: 5118.7, 60 sec: 5467.3, 300 sec: 5512.4). Total num frames: 914801664. Throughput: 0: 4911.7. Samples: 914800850. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:12:03,781][25689] Avg episode reward: [(0, '0.286')] [2022-07-10 21:12:04,011][26022] Updated weights on worker 0-0, policy_version 893363 (0.00087) [2022-07-10 21:12:05,202][26022] Updated weights on worker 0-0, policy_version 893373 (0.00090) [2022-07-10 21:12:07,583][26022] Updated weights on worker 0-0, policy_version 893383 (0.00084) [2022-07-10 21:12:08,791][25689] Fps is (10 sec: 5251.4, 60 sec: 5521.5, 300 sec: 5523.5). Total num frames: 914832384. Throughput: 0: 5649.5. Samples: 914832436. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:12:08,792][25689] Avg episode reward: [(0, '0.332')] [2022-07-10 21:12:09,024][26022] Updated weights on worker 0-0, policy_version 893393 (0.00093) [2022-07-10 21:12:11,033][26022] Updated weights on worker 0-0, policy_version 893403 (0.00085) [2022-07-10 21:12:13,087][26022] Updated weights on worker 0-0, policy_version 893413 (0.00083) [2022-07-10 21:12:13,799][25689] Fps is (10 sec: 5722.9, 60 sec: 5507.1, 300 sec: 5520.5). Total num frames: 914859008. Throughput: 0: 5678.4. Samples: 914866158. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:12:13,801][25689] Avg episode reward: [(0, '0.341')] [2022-07-10 21:12:14,561][26022] Updated weights on worker 0-0, policy_version 893423 (0.00092) [2022-07-10 21:12:16,643][26022] Updated weights on worker 0-0, policy_version 893433 (0.00094) [2022-07-10 21:12:18,459][26022] Updated weights on worker 0-0, policy_version 893443 (0.00091) [2022-07-10 21:12:18,826][25689] Fps is (10 sec: 5306.2, 60 sec: 5489.6, 300 sec: 5517.8). Total num frames: 914885632. Throughput: 0: 4880.9. Samples: 914882950. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:12:18,827][25689] Avg episode reward: [(0, '0.710')] [2022-07-10 21:12:20,279][26022] Updated weights on worker 0-0, policy_version 893453 (0.00088) [2022-07-10 21:12:22,325][26022] Updated weights on worker 0-0, policy_version 893463 (0.00083) [2022-07-10 21:12:23,907][25689] Fps is (10 sec: 5572.1, 60 sec: 5510.4, 300 sec: 5517.6). Total num frames: 914915328. Throughput: 0: 5718.1. Samples: 914915942. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:12:23,907][25689] Avg episode reward: [(0, '0.510')] [2022-07-10 21:12:23,927][26022] Updated weights on worker 0-0, policy_version 893473 (0.00083) [2022-07-10 21:12:25,739][26022] Updated weights on worker 0-0, policy_version 893483 (0.00096) [2022-07-10 21:12:27,640][26022] Updated weights on worker 0-0, policy_version 893493 (0.00087) [2022-07-10 21:12:28,919][25689] Fps is (10 sec: 5580.2, 60 sec: 5499.2, 300 sec: 5518.9). Total num frames: 914941952. Throughput: 0: 5805.4. Samples: 914949286. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:12:28,919][25689] Avg episode reward: [(0, '-0.353')] [2022-07-10 21:12:29,464][26022] Updated weights on worker 0-0, policy_version 893503 (0.00092) [2022-07-10 21:12:31,366][26022] Updated weights on worker 0-0, policy_version 893513 (0.00089) [2022-07-10 21:12:33,220][26022] Updated weights on worker 0-0, policy_version 893523 (0.00090) [2022-07-10 21:12:33,942][25689] Fps is (10 sec: 5510.3, 60 sec: 5498.8, 300 sec: 5519.4). Total num frames: 914970624. Throughput: 0: 4956.7. Samples: 914965998. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:12:33,942][25689] Avg episode reward: [(0, '-0.416')] [2022-07-10 21:12:35,051][26022] Updated weights on worker 0-0, policy_version 893533 (0.00097) [2022-07-10 21:12:36,923][26022] Updated weights on worker 0-0, policy_version 893543 (0.00088) [2022-07-10 21:12:38,817][26022] Updated weights on worker 0-0, policy_version 893553 (0.00090) [2022-07-10 21:12:38,951][25689] Fps is (10 sec: 5716.0, 60 sec: 5517.1, 300 sec: 5521.3). Total num frames: 914999296. Throughput: 0: 5784.1. Samples: 914999356. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:12:38,952][25689] Avg episode reward: [(0, '-0.383')] [2022-07-10 21:12:40,632][26022] Updated weights on worker 0-0, policy_version 893563 (0.00089) [2022-07-10 21:12:42,291][26022] Updated weights on worker 0-0, policy_version 893573 (0.00090) [2022-07-10 21:12:44,086][25689] Fps is (10 sec: 5652.8, 60 sec: 5526.1, 300 sec: 5526.3). Total num frames: 915027968. Throughput: 0: 5787.9. Samples: 915032738. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:12:44,087][25689] Avg episode reward: [(0, '-0.457')] [2022-07-10 21:12:44,378][26022] Updated weights on worker 0-0, policy_version 893583 (0.00088) [2022-07-10 21:12:46,136][26022] Updated weights on worker 0-0, policy_version 893593 (0.00089) [2022-07-10 21:12:48,078][26022] Updated weights on worker 0-0, policy_version 893603 (0.00092) [2022-07-10 21:12:49,105][25689] Fps is (10 sec: 5546.5, 60 sec: 5530.8, 300 sec: 5522.6). Total num frames: 915055616. Throughput: 0: 4959.5. Samples: 915049400. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:12:49,105][25689] Avg episode reward: [(0, '-0.583')] [2022-07-10 21:12:49,807][26022] Updated weights on worker 0-0, policy_version 893613 (0.00086) [2022-07-10 21:12:51,700][26022] Updated weights on worker 0-0, policy_version 893623 (0.00089) [2022-07-10 21:12:53,310][26022] Updated weights on worker 0-0, policy_version 893633 (0.00089) [2022-07-10 21:12:54,110][25689] Fps is (10 sec: 5516.3, 60 sec: 5514.7, 300 sec: 5523.0). Total num frames: 915083264. Throughput: 0: 5794.4. Samples: 915082862. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:12:54,111][25689] Avg episode reward: [(0, '0.165')] [2022-07-10 21:12:55,277][26022] Updated weights on worker 0-0, policy_version 893643 (0.00085) [2022-07-10 21:12:57,140][26022] Updated weights on worker 0-0, policy_version 893653 (0.00086) [2022-07-10 21:12:58,882][26022] Updated weights on worker 0-0, policy_version 893663 (0.00095) [2022-07-10 21:12:59,204][25689] Fps is (10 sec: 5576.8, 60 sec: 5496.0, 300 sec: 5522.9). Total num frames: 915111936. Throughput: 0: 5761.7. Samples: 915116048. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:12:59,204][25689] Avg episode reward: [(0, '0.522')] [2022-07-10 21:13:00,966][26022] Updated weights on worker 0-0, policy_version 893673 (0.00087) [2022-07-10 21:13:03,068][26022] Updated weights on worker 0-0, policy_version 893683 (0.00089) [2022-07-10 21:13:04,302][25689] Fps is (10 sec: 5324.8, 60 sec: 5549.4, 300 sec: 5522.5). Total num frames: 915137536. Throughput: 0: 4947.3. Samples: 915132754. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:13:04,303][25689] Avg episode reward: [(0, '0.553')] [2022-07-10 21:13:05,065][26022] Updated weights on worker 0-0, policy_version 893693 (0.00089) [2022-07-10 21:13:05,707][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:13:05,716][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000893698_915146752.pth [2022-07-10 21:13:05,717][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000891755_913157120.pth [2022-07-10 21:13:06,697][26022] Updated weights on worker 0-0, policy_version 893703 (0.00092) [2022-07-10 21:13:08,512][26022] Updated weights on worker 0-0, policy_version 893713 (0.00105) [2022-07-10 21:13:09,322][25689] Fps is (10 sec: 5262.7, 60 sec: 5498.0, 300 sec: 5515.9). Total num frames: 915165184. Throughput: 0: 5680.0. Samples: 915164232. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:13:09,323][25689] Avg episode reward: [(0, '0.462')] [2022-07-10 21:13:10,464][26022] Updated weights on worker 0-0, policy_version 893723 (0.00090) [2022-07-10 21:13:12,316][26022] Updated weights on worker 0-0, policy_version 893733 (0.00090) [2022-07-10 21:13:14,103][26022] Updated weights on worker 0-0, policy_version 893743 (0.00085) [2022-07-10 21:13:14,417][25689] Fps is (10 sec: 5568.1, 60 sec: 5523.9, 300 sec: 5526.3). Total num frames: 915193856. Throughput: 0: 5652.7. Samples: 915197652. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:13:14,419][25689] Avg episode reward: [(0, '-0.368')] [2022-07-10 21:13:15,981][26022] Updated weights on worker 0-0, policy_version 893753 (0.00084) [2022-07-10 21:13:17,819][26022] Updated weights on worker 0-0, policy_version 893763 (0.00091) [2022-07-10 21:13:19,475][25689] Fps is (10 sec: 5446.1, 60 sec: 5521.0, 300 sec: 5516.0). Total num frames: 915220480. Throughput: 0: 5664.2. Samples: 915230870. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:13:19,476][25689] Avg episode reward: [(0, '-0.062')] [2022-07-10 21:13:19,648][26022] Updated weights on worker 0-0, policy_version 893773 (0.00086) [2022-07-10 21:13:21,616][26022] Updated weights on worker 0-0, policy_version 893783 (0.00086) [2022-07-10 21:13:23,289][26022] Updated weights on worker 0-0, policy_version 893793 (0.00102) [2022-07-10 21:13:24,573][25689] Fps is (10 sec: 5545.5, 60 sec: 5519.5, 300 sec: 5517.7). Total num frames: 915250176. Throughput: 0: 5666.0. Samples: 915247608. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:13:24,574][25689] Avg episode reward: [(0, '0.175')] [2022-07-10 21:13:25,388][26022] Updated weights on worker 0-0, policy_version 893803 (0.00090) [2022-07-10 21:13:27,023][26022] Updated weights on worker 0-0, policy_version 893813 (0.00088) [2022-07-10 21:13:29,194][26022] Updated weights on worker 0-0, policy_version 893823 (0.00087) [2022-07-10 21:13:29,646][25689] Fps is (10 sec: 5638.3, 60 sec: 5530.8, 300 sec: 5523.3). Total num frames: 915277824. Throughput: 0: 5734.1. Samples: 915280770. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:13:29,646][25689] Avg episode reward: [(0, '0.375')] [2022-07-10 21:13:30,715][26022] Updated weights on worker 0-0, policy_version 893833 (0.00090) [2022-07-10 21:13:32,565][26022] Updated weights on worker 0-0, policy_version 893843 (0.00086) [2022-07-10 21:13:34,400][26022] Updated weights on worker 0-0, policy_version 893853 (0.00083) [2022-07-10 21:13:34,690][25689] Fps is (10 sec: 5567.2, 60 sec: 5528.9, 300 sec: 5519.3). Total num frames: 915306496. Throughput: 0: 5754.7. Samples: 915314314. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:13:34,690][25689] Avg episode reward: [(0, '0.397')] [2022-07-10 21:13:36,452][26022] Updated weights on worker 0-0, policy_version 893863 (0.00087) [2022-07-10 21:13:38,055][26022] Updated weights on worker 0-0, policy_version 893873 (0.00088) [2022-07-10 21:13:39,696][25689] Fps is (10 sec: 5604.0, 60 sec: 5512.4, 300 sec: 5520.4). Total num frames: 915334144. Throughput: 0: 4958.8. Samples: 915331138. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:13:39,696][25689] Avg episode reward: [(0, '-0.216')] [2022-07-10 21:13:40,136][26022] Updated weights on worker 0-0, policy_version 893883 (0.00084) [2022-07-10 21:13:41,771][26022] Updated weights on worker 0-0, policy_version 893893 (0.00081) [2022-07-10 21:13:43,716][26022] Updated weights on worker 0-0, policy_version 893903 (0.00087) [2022-07-10 21:13:44,797][25689] Fps is (10 sec: 5572.1, 60 sec: 5515.4, 300 sec: 5525.6). Total num frames: 915362816. Throughput: 0: 5774.7. Samples: 915364396. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:13:44,798][25689] Avg episode reward: [(0, '-0.087')] [2022-07-10 21:13:45,601][26022] Updated weights on worker 0-0, policy_version 893913 (0.00092) [2022-07-10 21:13:47,286][26022] Updated weights on worker 0-0, policy_version 893923 (0.00088) [2022-07-10 21:13:49,223][26022] Updated weights on worker 0-0, policy_version 893933 (0.00084) [2022-07-10 21:13:49,816][25689] Fps is (10 sec: 5665.9, 60 sec: 5532.3, 300 sec: 5521.9). Total num frames: 915391488. Throughput: 0: 5819.0. Samples: 915398144. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:13:49,817][25689] Avg episode reward: [(0, '-0.402')] [2022-07-10 21:13:51,312][26022] Updated weights on worker 0-0, policy_version 893943 (0.00090) [2022-07-10 21:13:52,818][26022] Updated weights on worker 0-0, policy_version 893953 (0.00093) [2022-07-10 21:13:54,838][25689] Fps is (10 sec: 5405.1, 60 sec: 5497.1, 300 sec: 5518.7). Total num frames: 915417088. Throughput: 0: 4965.9. Samples: 915414368. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:13:54,838][25689] Avg episode reward: [(0, '-0.498')] [2022-07-10 21:13:55,048][26022] Updated weights on worker 0-0, policy_version 893963 (0.00091) [2022-07-10 21:13:56,460][26022] Updated weights on worker 0-0, policy_version 893973 (0.00089) [2022-07-10 21:13:58,541][26022] Updated weights on worker 0-0, policy_version 893983 (0.00092) [2022-07-10 21:13:59,859][25689] Fps is (10 sec: 5404.3, 60 sec: 5503.7, 300 sec: 5525.9). Total num frames: 915445760. Throughput: 0: 5773.0. Samples: 915447540. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:13:59,859][25689] Avg episode reward: [(0, '-0.775')] [2022-07-10 21:14:00,431][26022] Updated weights on worker 0-0, policy_version 893993 (0.00086) [2022-07-10 21:14:02,426][26022] Updated weights on worker 0-0, policy_version 894003 (0.00084) [2022-07-10 21:14:04,532][26022] Updated weights on worker 0-0, policy_version 894013 (0.00091) [2022-07-10 21:14:04,934][25689] Fps is (10 sec: 5274.0, 60 sec: 5488.9, 300 sec: 5514.5). Total num frames: 915470336. Throughput: 0: 5688.0. Samples: 915478936. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:14:04,935][25689] Avg episode reward: [(0, '-0.533')] [2022-07-10 21:14:06,200][26022] Updated weights on worker 0-0, policy_version 894023 (0.00084) [2022-07-10 21:14:08,206][26022] Updated weights on worker 0-0, policy_version 894033 (0.00089) [2022-07-10 21:14:09,980][25689] Fps is (10 sec: 5362.1, 60 sec: 5520.2, 300 sec: 5520.8). Total num frames: 915500032. Throughput: 0: 4830.7. Samples: 915495548. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:14:09,981][25689] Avg episode reward: [(0, '0.518')] [2022-07-10 21:14:09,981][26022] Updated weights on worker 0-0, policy_version 894043 (0.00089) [2022-07-10 21:14:11,816][26022] Updated weights on worker 0-0, policy_version 894053 (0.00052) [2022-07-10 21:14:13,713][26022] Updated weights on worker 0-0, policy_version 894063 (0.00088) [2022-07-10 21:14:15,018][25689] Fps is (10 sec: 5686.8, 60 sec: 5508.6, 300 sec: 5520.4). Total num frames: 915527680. Throughput: 0: 5690.8. Samples: 915529208. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:14:15,019][25689] Avg episode reward: [(0, '0.734')] [2022-07-10 21:14:15,358][26022] Updated weights on worker 0-0, policy_version 894073 (0.00085) [2022-07-10 21:14:17,239][26022] Updated weights on worker 0-0, policy_version 894083 (0.00086) [2022-07-10 21:14:19,060][26022] Updated weights on worker 0-0, policy_version 894093 (0.00087) [2022-07-10 21:14:20,026][25689] Fps is (10 sec: 5606.5, 60 sec: 5547.0, 300 sec: 5526.0). Total num frames: 915556352. Throughput: 0: 5721.4. Samples: 915562924. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:14:20,026][25689] Avg episode reward: [(0, '-0.028')] [2022-07-10 21:14:21,030][26022] Updated weights on worker 0-0, policy_version 894103 (0.00087) [2022-07-10 21:14:22,744][26022] Updated weights on worker 0-0, policy_version 894113 (0.00095) [2022-07-10 21:14:24,657][26022] Updated weights on worker 0-0, policy_version 894123 (0.00085) [2022-07-10 21:14:25,146][25689] Fps is (10 sec: 5560.8, 60 sec: 5511.1, 300 sec: 5520.7). Total num frames: 915584000. Throughput: 0: 4982.8. Samples: 915579648. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:14:25,147][25689] Avg episode reward: [(0, '-0.062')] [2022-07-10 21:14:26,323][26022] Updated weights on worker 0-0, policy_version 894133 (0.00090) [2022-07-10 21:14:28,216][26022] Updated weights on worker 0-0, policy_version 894143 (0.01118) [2022-07-10 21:14:30,115][26022] Updated weights on worker 0-0, policy_version 894153 (0.00092) [2022-07-10 21:14:30,172][25689] Fps is (10 sec: 5551.0, 60 sec: 5532.3, 300 sec: 5525.0). Total num frames: 915612672. Throughput: 0: 5831.3. Samples: 915613290. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:14:30,172][25689] Avg episode reward: [(0, '-0.735')] [2022-07-10 21:14:32,044][26022] Updated weights on worker 0-0, policy_version 894163 (0.00088) [2022-07-10 21:14:33,710][26022] Updated weights on worker 0-0, policy_version 894173 (0.00085) [2022-07-10 21:14:35,179][25689] Fps is (10 sec: 5613.5, 60 sec: 5518.7, 300 sec: 5525.0). Total num frames: 915640320. Throughput: 0: 5843.6. Samples: 915647020. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:14:35,180][25689] Avg episode reward: [(0, '-1.783')] [2022-07-10 21:14:35,417][26022] Updated weights on worker 0-0, policy_version 894183 (0.00083) [2022-07-10 21:14:37,427][26022] Updated weights on worker 0-0, policy_version 894193 (0.00087) [2022-07-10 21:14:39,090][26022] Updated weights on worker 0-0, policy_version 894203 (0.00092) [2022-07-10 21:14:40,231][25689] Fps is (10 sec: 5497.0, 60 sec: 5514.5, 300 sec: 5521.6). Total num frames: 915667968. Throughput: 0: 4994.8. Samples: 915663846. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:14:40,233][25689] Avg episode reward: [(0, '-1.548')] [2022-07-10 21:14:41,226][26022] Updated weights on worker 0-0, policy_version 894213 (0.00087) [2022-07-10 21:14:42,859][26022] Updated weights on worker 0-0, policy_version 894223 (0.00084) [2022-07-10 21:14:44,770][26022] Updated weights on worker 0-0, policy_version 894233 (0.00090) [2022-07-10 21:14:45,310][25689] Fps is (10 sec: 5559.2, 60 sec: 5516.6, 300 sec: 5523.9). Total num frames: 915696640. Throughput: 0: 5824.5. Samples: 915697092. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:14:45,312][25689] Avg episode reward: [(0, '-0.584')] [2022-07-10 21:14:46,732][26022] Updated weights on worker 0-0, policy_version 894243 (0.00084) [2022-07-10 21:14:48,538][26022] Updated weights on worker 0-0, policy_version 894253 (0.00078) [2022-07-10 21:14:50,166][26022] Updated weights on worker 0-0, policy_version 894263 (0.00092) [2022-07-10 21:14:50,313][25689] Fps is (10 sec: 5789.4, 60 sec: 5535.0, 300 sec: 5531.0). Total num frames: 915726336. Throughput: 0: 5843.5. Samples: 915730986. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:14:50,313][25689] Avg episode reward: [(0, '-0.940')] [2022-07-10 21:14:52,243][26022] Updated weights on worker 0-0, policy_version 894273 (0.00090) [2022-07-10 21:14:53,963][26022] Updated weights on worker 0-0, policy_version 894283 (0.00089) [2022-07-10 21:14:55,338][25689] Fps is (10 sec: 5616.4, 60 sec: 5551.6, 300 sec: 5523.9). Total num frames: 915752960. Throughput: 0: 4994.8. Samples: 915747710. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:14:55,338][25689] Avg episode reward: [(0, '-1.231')] [2022-07-10 21:14:55,858][26022] Updated weights on worker 0-0, policy_version 894293 (0.00090) [2022-07-10 21:14:57,361][26022] Updated weights on worker 0-0, policy_version 894303 (0.00082) [2022-07-10 21:14:59,579][26022] Updated weights on worker 0-0, policy_version 894313 (0.00416) [2022-07-10 21:15:00,347][25689] Fps is (10 sec: 5510.5, 60 sec: 5552.6, 300 sec: 5532.5). Total num frames: 915781632. Throughput: 0: 5828.1. Samples: 915781086. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:15:00,348][25689] Avg episode reward: [(0, '0.331')] [2022-07-10 21:15:01,090][26022] Updated weights on worker 0-0, policy_version 894323 (0.00098) [2022-07-10 21:15:03,610][26022] Updated weights on worker 0-0, policy_version 894333 (0.00088) [2022-07-10 21:15:05,079][26022] Updated weights on worker 0-0, policy_version 894343 (0.00084) [2022-07-10 21:15:05,379][25689] Fps is (10 sec: 5507.1, 60 sec: 5590.6, 300 sec: 5532.8). Total num frames: 915808256. Throughput: 0: 5748.7. Samples: 915812460. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:15:05,379][25689] Avg episode reward: [(0, '0.683')] [2022-07-10 21:15:05,910][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:15:05,924][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000894346_915810304.pth [2022-07-10 21:15:05,925][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000892401_913818624.pth [2022-07-10 21:15:07,176][26022] Updated weights on worker 0-0, policy_version 894353 (0.00082) [2022-07-10 21:15:08,935][26022] Updated weights on worker 0-0, policy_version 894363 (0.00080) [2022-07-10 21:15:10,387][25689] Fps is (10 sec: 5304.0, 60 sec: 5543.2, 300 sec: 5526.0). Total num frames: 915834880. Throughput: 0: 4895.8. Samples: 915829264. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:15:10,387][25689] Avg episode reward: [(0, '0.513')] [2022-07-10 21:15:10,862][26022] Updated weights on worker 0-0, policy_version 894373 (0.00090) [2022-07-10 21:15:12,728][26022] Updated weights on worker 0-0, policy_version 894383 (0.00052) [2022-07-10 21:15:14,682][26022] Updated weights on worker 0-0, policy_version 894393 (0.00085) [2022-07-10 21:15:15,396][25689] Fps is (10 sec: 5315.6, 60 sec: 5528.9, 300 sec: 5526.5). Total num frames: 915861504. Throughput: 0: 5729.2. Samples: 915862626. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:15:15,399][25689] Avg episode reward: [(0, '0.778')] [2022-07-10 21:15:16,307][26022] Updated weights on worker 0-0, policy_version 894403 (0.00091) [2022-07-10 21:15:18,417][26022] Updated weights on worker 0-0, policy_version 894413 (0.00087) [2022-07-10 21:15:20,101][26022] Updated weights on worker 0-0, policy_version 894423 (0.00086) [2022-07-10 21:15:20,476][25689] Fps is (10 sec: 5683.8, 60 sec: 5556.2, 300 sec: 5526.1). Total num frames: 915892224. Throughput: 0: 5694.2. Samples: 915895700. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:15:20,476][25689] Avg episode reward: [(0, '0.966')] [2022-07-10 21:15:22,112][26022] Updated weights on worker 0-0, policy_version 894433 (0.00086) [2022-07-10 21:15:23,799][26022] Updated weights on worker 0-0, policy_version 894443 (0.00100) [2022-07-10 21:15:25,519][25689] Fps is (10 sec: 5664.8, 60 sec: 5546.4, 300 sec: 5525.6). Total num frames: 915918848. Throughput: 0: 4948.4. Samples: 915912120. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:15:25,519][25689] Avg episode reward: [(0, '1.165')] [2022-07-10 21:15:25,628][26022] Updated weights on worker 0-0, policy_version 894453 (0.00086) [2022-07-10 21:15:27,572][26022] Updated weights on worker 0-0, policy_version 894463 (0.00101) [2022-07-10 21:15:29,407][26022] Updated weights on worker 0-0, policy_version 894473 (0.00090) [2022-07-10 21:15:30,532][25689] Fps is (10 sec: 5396.7, 60 sec: 5530.5, 300 sec: 5525.7). Total num frames: 915946496. Throughput: 0: 5763.2. Samples: 915945366. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:15:30,533][25689] Avg episode reward: [(0, '0.758')] [2022-07-10 21:15:31,203][26022] Updated weights on worker 0-0, policy_version 894483 (0.00085) [2022-07-10 21:15:32,995][26022] Updated weights on worker 0-0, policy_version 894493 (0.00090) [2022-07-10 21:15:34,935][26022] Updated weights on worker 0-0, policy_version 894503 (0.00093) [2022-07-10 21:15:35,596][25689] Fps is (10 sec: 5588.5, 60 sec: 5542.2, 300 sec: 5521.7). Total num frames: 915975168. Throughput: 0: 5744.8. Samples: 915978674. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:15:35,597][25689] Avg episode reward: [(0, '0.803')] [2022-07-10 21:15:36,791][26022] Updated weights on worker 0-0, policy_version 894513 (0.00089) [2022-07-10 21:15:38,525][26022] Updated weights on worker 0-0, policy_version 894523 (0.00093) [2022-07-10 21:15:40,436][26022] Updated weights on worker 0-0, policy_version 894533 (0.00088) [2022-07-10 21:15:40,606][25689] Fps is (10 sec: 5590.6, 60 sec: 5546.1, 300 sec: 5522.5). Total num frames: 916002816. Throughput: 0: 4945.4. Samples: 915995254. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:15:40,607][25689] Avg episode reward: [(0, '0.818')] [2022-07-10 21:15:42,180][26022] Updated weights on worker 0-0, policy_version 894543 (0.00087) [2022-07-10 21:15:44,185][26022] Updated weights on worker 0-0, policy_version 894553 (0.00087) [2022-07-10 21:15:45,710][25689] Fps is (10 sec: 5467.7, 60 sec: 5526.9, 300 sec: 5520.8). Total num frames: 916030464. Throughput: 0: 5774.2. Samples: 916028706. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:15:45,710][25689] Avg episode reward: [(0, '-0.426')] [2022-07-10 21:15:45,921][26022] Updated weights on worker 0-0, policy_version 894563 (0.00085) [2022-07-10 21:15:47,896][26022] Updated weights on worker 0-0, policy_version 894573 (0.00089) [2022-07-10 21:15:49,614][26022] Updated weights on worker 0-0, policy_version 894583 (0.00090) [2022-07-10 21:15:50,760][25689] Fps is (10 sec: 5446.2, 60 sec: 5488.7, 300 sec: 5520.1). Total num frames: 916058112. Throughput: 0: 5774.3. Samples: 916062164. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:15:50,760][25689] Avg episode reward: [(0, '-0.127')] [2022-07-10 21:15:51,484][26022] Updated weights on worker 0-0, policy_version 894593 (0.00095) [2022-07-10 21:15:53,537][26022] Updated weights on worker 0-0, policy_version 894603 (0.00087) [2022-07-10 21:15:55,315][26022] Updated weights on worker 0-0, policy_version 894613 (0.00085) [2022-07-10 21:15:55,782][25689] Fps is (10 sec: 5591.4, 60 sec: 5522.8, 300 sec: 5519.7). Total num frames: 916086784. Throughput: 0: 5787.0. Samples: 916095488. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:15:55,783][25689] Avg episode reward: [(0, '0.256')] [2022-07-10 21:15:57,158][26022] Updated weights on worker 0-0, policy_version 894623 (0.00083) [2022-07-10 21:15:58,845][26022] Updated weights on worker 0-0, policy_version 894633 (0.00627) [2022-07-10 21:16:00,707][26022] Updated weights on worker 0-0, policy_version 894643 (0.00082) [2022-07-10 21:16:00,807][25689] Fps is (10 sec: 5605.6, 60 sec: 5504.5, 300 sec: 5524.0). Total num frames: 916114432. Throughput: 0: 5790.3. Samples: 916112220. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:16:00,808][25689] Avg episode reward: [(0, '0.502')] [2022-07-10 21:16:03,226][26022] Updated weights on worker 0-0, policy_version 894653 (0.00092) [2022-07-10 21:16:04,680][26022] Updated weights on worker 0-0, policy_version 894663 (0.00094) [2022-07-10 21:16:05,939][25689] Fps is (10 sec: 5343.6, 60 sec: 5495.4, 300 sec: 5519.0). Total num frames: 916141056. Throughput: 0: 5684.1. Samples: 916143690. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:16:05,939][25689] Avg episode reward: [(0, '0.023')] [2022-07-10 21:16:06,852][26022] Updated weights on worker 0-0, policy_version 894673 (0.00088) [2022-07-10 21:16:08,459][26022] Updated weights on worker 0-0, policy_version 894683 (0.00075) [2022-07-10 21:16:10,286][26022] Updated weights on worker 0-0, policy_version 894693 (0.00084) [2022-07-10 21:16:11,024][25689] Fps is (10 sec: 5311.5, 60 sec: 5505.2, 300 sec: 5518.1). Total num frames: 916168704. Throughput: 0: 5670.3. Samples: 916177072. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:16:11,025][25689] Avg episode reward: [(0, '-1.562')] [2022-07-10 21:16:12,144][26022] Updated weights on worker 0-0, policy_version 894703 (0.00091) [2022-07-10 21:16:13,874][26022] Updated weights on worker 0-0, policy_version 894713 (0.00093) [2022-07-10 21:16:15,811][26022] Updated weights on worker 0-0, policy_version 894723 (0.00089) [2022-07-10 21:16:16,040][25689] Fps is (10 sec: 5575.6, 60 sec: 5538.4, 300 sec: 5521.6). Total num frames: 916197376. Throughput: 0: 4846.4. Samples: 916193664. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:16:16,040][25689] Avg episode reward: [(0, '-0.708')] [2022-07-10 21:16:17,917][26022] Updated weights on worker 0-0, policy_version 894733 (0.00088) [2022-07-10 21:16:19,479][26022] Updated weights on worker 0-0, policy_version 894743 (0.00087) [2022-07-10 21:16:21,067][25689] Fps is (10 sec: 5608.3, 60 sec: 5492.6, 300 sec: 5520.0). Total num frames: 916225024. Throughput: 0: 5669.3. Samples: 916227080. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:16:21,067][25689] Avg episode reward: [(0, '-1.273')] [2022-07-10 21:16:21,429][26022] Updated weights on worker 0-0, policy_version 894753 (0.00085) [2022-07-10 21:16:23,253][26022] Updated weights on worker 0-0, policy_version 894763 (0.00090) [2022-07-10 21:16:25,084][26022] Updated weights on worker 0-0, policy_version 894773 (0.00089) [2022-07-10 21:16:26,185][25689] Fps is (10 sec: 5349.4, 60 sec: 5485.7, 300 sec: 5515.7). Total num frames: 916251648. Throughput: 0: 5758.4. Samples: 916260278. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:16:26,186][25689] Avg episode reward: [(0, '-2.778')] [2022-07-10 21:16:26,997][26022] Updated weights on worker 0-0, policy_version 894783 (0.00112) [2022-07-10 21:16:28,892][26022] Updated weights on worker 0-0, policy_version 894793 (0.00094) [2022-07-10 21:16:30,496][26022] Updated weights on worker 0-0, policy_version 894803 (0.00094) [2022-07-10 21:16:31,246][25689] Fps is (10 sec: 5532.9, 60 sec: 5515.2, 300 sec: 5518.4). Total num frames: 916281344. Throughput: 0: 4936.7. Samples: 916276900. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:16:31,246][25689] Avg episode reward: [(0, '-3.853')] [2022-07-10 21:16:32,795][26022] Updated weights on worker 0-0, policy_version 894813 (0.00104) [2022-07-10 21:16:34,297][26022] Updated weights on worker 0-0, policy_version 894823 (0.00063) [2022-07-10 21:16:36,131][26022] Updated weights on worker 0-0, policy_version 894833 (0.00095) [2022-07-10 21:16:36,291][25689] Fps is (10 sec: 5674.6, 60 sec: 5500.1, 300 sec: 5518.0). Total num frames: 916308992. Throughput: 0: 5762.6. Samples: 916310362. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:16:36,291][25689] Avg episode reward: [(0, '-2.786')] [2022-07-10 21:16:38,019][26022] Updated weights on worker 0-0, policy_version 894843 (0.00078) [2022-07-10 21:16:39,962][26022] Updated weights on worker 0-0, policy_version 894853 (0.00086) [2022-07-10 21:16:41,302][25689] Fps is (10 sec: 5396.9, 60 sec: 5483.1, 300 sec: 5515.3). Total num frames: 916335616. Throughput: 0: 5762.9. Samples: 916343694. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:16:41,304][25689] Avg episode reward: [(0, '-1.891')] [2022-07-10 21:16:41,744][26022] Updated weights on worker 0-0, policy_version 894863 (0.00086) [2022-07-10 21:16:43,593][26022] Updated weights on worker 0-0, policy_version 894873 (0.00101) [2022-07-10 21:16:45,362][26022] Updated weights on worker 0-0, policy_version 894883 (0.00086) [2022-07-10 21:16:46,403][25689] Fps is (10 sec: 5670.9, 60 sec: 5533.9, 300 sec: 5525.0). Total num frames: 916366336. Throughput: 0: 4947.7. Samples: 916360308. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:16:46,404][25689] Avg episode reward: [(0, '-1.968')] [2022-07-10 21:16:47,294][26022] Updated weights on worker 0-0, policy_version 894893 (0.00086) [2022-07-10 21:16:49,056][26022] Updated weights on worker 0-0, policy_version 894903 (0.00091) [2022-07-10 21:16:50,879][26022] Updated weights on worker 0-0, policy_version 894913 (0.00087) [2022-07-10 21:16:51,435][25689] Fps is (10 sec: 5659.1, 60 sec: 5518.7, 300 sec: 5517.8). Total num frames: 916392960. Throughput: 0: 5792.1. Samples: 916393838. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:16:51,436][25689] Avg episode reward: [(0, '-1.429')] [2022-07-10 21:16:52,658][26022] Updated weights on worker 0-0, policy_version 894923 (0.00090) [2022-07-10 21:16:54,531][26022] Updated weights on worker 0-0, policy_version 894933 (0.00084) [2022-07-10 21:16:56,281][26022] Updated weights on worker 0-0, policy_version 894943 (0.00084) [2022-07-10 21:16:56,467][25689] Fps is (10 sec: 5596.2, 60 sec: 5534.7, 300 sec: 5518.6). Total num frames: 916422656. Throughput: 0: 5812.2. Samples: 916427628. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:16:56,467][25689] Avg episode reward: [(0, '-0.521')] [2022-07-10 21:16:58,271][26022] Updated weights on worker 0-0, policy_version 894953 (0.00087) [2022-07-10 21:16:59,830][26022] Updated weights on worker 0-0, policy_version 894963 (0.00113) [2022-07-10 21:17:01,474][25689] Fps is (10 sec: 5508.0, 60 sec: 5502.5, 300 sec: 5531.1). Total num frames: 916448256. Throughput: 0: 4991.5. Samples: 916444384. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:17:01,479][25689] Avg episode reward: [(0, '0.267')] [2022-07-10 21:17:02,245][26022] Updated weights on worker 0-0, policy_version 894973 (0.00090) [2022-07-10 21:17:04,167][26022] Updated weights on worker 0-0, policy_version 894983 (0.00085) [2022-07-10 21:17:05,889][26022] Updated weights on worker 0-0, policy_version 894993 (0.00094) [2022-07-10 21:17:05,995][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:17:06,011][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000894994_916473856.pth [2022-07-10 21:17:06,012][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000893050_914483200.pth [2022-07-10 21:17:06,559][25689] Fps is (10 sec: 5275.8, 60 sec: 5523.6, 300 sec: 5519.4). Total num frames: 916475904. Throughput: 0: 5724.2. Samples: 916475690. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:17:06,565][25689] Avg episode reward: [(0, '0.548')] [2022-07-10 21:17:07,912][26022] Updated weights on worker 0-0, policy_version 895003 (0.00113) [2022-07-10 21:17:09,639][26022] Updated weights on worker 0-0, policy_version 895013 (0.00085) [2022-07-10 21:17:11,586][25689] Fps is (10 sec: 5367.1, 60 sec: 5512.1, 300 sec: 5519.1). Total num frames: 916502528. Throughput: 0: 5710.3. Samples: 916508908. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:17:11,588][25689] Avg episode reward: [(0, '-0.263')] [2022-07-10 21:17:11,705][26022] Updated weights on worker 0-0, policy_version 895023 (0.00437) [2022-07-10 21:17:13,309][26022] Updated weights on worker 0-0, policy_version 895033 (0.00068) [2022-07-10 21:17:15,124][26022] Updated weights on worker 0-0, policy_version 895043 (0.00084) [2022-07-10 21:17:16,599][25689] Fps is (10 sec: 5609.6, 60 sec: 5529.2, 300 sec: 5529.7). Total num frames: 916532224. Throughput: 0: 4869.6. Samples: 916525666. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:17:16,600][25689] Avg episode reward: [(0, '-0.621')] [2022-07-10 21:17:17,091][26022] Updated weights on worker 0-0, policy_version 895053 (0.00122) [2022-07-10 21:17:18,743][26022] Updated weights on worker 0-0, policy_version 895063 (0.00090) [2022-07-10 21:17:20,721][26022] Updated weights on worker 0-0, policy_version 895073 (0.00083) [2022-07-10 21:17:21,617][25689] Fps is (10 sec: 5716.7, 60 sec: 5530.1, 300 sec: 5524.0). Total num frames: 916559872. Throughput: 0: 5726.2. Samples: 916559726. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:17:21,618][25689] Avg episode reward: [(0, '-0.545')] [2022-07-10 21:17:22,134][26022] Updated weights on worker 0-0, policy_version 895083 (0.00093) [2022-07-10 21:17:24,492][26022] Updated weights on worker 0-0, policy_version 895093 (0.00091) [2022-07-10 21:17:25,947][26022] Updated weights on worker 0-0, policy_version 895103 (0.00100) [2022-07-10 21:17:26,700][25689] Fps is (10 sec: 5576.1, 60 sec: 5567.2, 300 sec: 5529.5). Total num frames: 916588544. Throughput: 0: 5823.1. Samples: 916592968. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:17:26,700][25689] Avg episode reward: [(0, '-0.893')] [2022-07-10 21:17:28,054][26022] Updated weights on worker 0-0, policy_version 895113 (0.00095) [2022-07-10 21:17:29,771][26022] Updated weights on worker 0-0, policy_version 895123 (0.00084) [2022-07-10 21:17:31,709][25689] Fps is (10 sec: 5479.1, 60 sec: 5521.1, 300 sec: 5522.9). Total num frames: 916615168. Throughput: 0: 5010.4. Samples: 916609732. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:17:31,710][25689] Avg episode reward: [(0, '-0.164')] [2022-07-10 21:17:31,796][26022] Updated weights on worker 0-0, policy_version 895133 (0.00086) [2022-07-10 21:17:33,507][26022] Updated weights on worker 0-0, policy_version 895143 (0.00091) [2022-07-10 21:17:35,279][26022] Updated weights on worker 0-0, policy_version 895153 (0.00084) [2022-07-10 21:17:36,727][25689] Fps is (10 sec: 5514.5, 60 sec: 5540.5, 300 sec: 5522.8). Total num frames: 916643840. Throughput: 0: 5851.9. Samples: 916643450. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:17:36,727][25689] Avg episode reward: [(0, '-1.054')] [2022-07-10 21:17:37,231][26022] Updated weights on worker 0-0, policy_version 895163 (0.00082) [2022-07-10 21:17:38,928][26022] Updated weights on worker 0-0, policy_version 895173 (0.00088) [2022-07-10 21:17:40,868][26022] Updated weights on worker 0-0, policy_version 895183 (0.00093) [2022-07-10 21:17:41,731][25689] Fps is (10 sec: 5722.0, 60 sec: 5575.1, 300 sec: 5525.2). Total num frames: 916672512. Throughput: 0: 5837.3. Samples: 916677136. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:17:41,731][25689] Avg episode reward: [(0, '-0.707')] [2022-07-10 21:17:42,542][26022] Updated weights on worker 0-0, policy_version 895193 (0.00092) [2022-07-10 21:17:44,411][26022] Updated weights on worker 0-0, policy_version 895203 (0.00099) [2022-07-10 21:17:46,247][26022] Updated weights on worker 0-0, policy_version 895213 (0.00086) [2022-07-10 21:17:46,818][25689] Fps is (10 sec: 5581.1, 60 sec: 5525.5, 300 sec: 5523.9). Total num frames: 916700160. Throughput: 0: 5022.0. Samples: 916694002. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:17:46,819][25689] Avg episode reward: [(0, '-1.776')] [2022-07-10 21:17:48,087][26022] Updated weights on worker 0-0, policy_version 895223 (0.00079) [2022-07-10 21:17:49,945][26022] Updated weights on worker 0-0, policy_version 895233 (0.00088) [2022-07-10 21:17:51,626][26022] Updated weights on worker 0-0, policy_version 895243 (0.00084) [2022-07-10 21:17:51,842][25689] Fps is (10 sec: 5570.2, 60 sec: 5560.2, 300 sec: 5527.0). Total num frames: 916728832. Throughput: 0: 5854.9. Samples: 916727606. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 21:17:51,842][25689] Avg episode reward: [(0, '-1.805')] [2022-07-10 21:17:53,508][26022] Updated weights on worker 0-0, policy_version 895253 (0.00086) [2022-07-10 21:17:55,509][26022] Updated weights on worker 0-0, policy_version 895263 (0.00083) [2022-07-10 21:17:56,887][25689] Fps is (10 sec: 5694.9, 60 sec: 5541.9, 300 sec: 5527.9). Total num frames: 916757504. Throughput: 0: 5826.1. Samples: 916760906. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:17:56,888][25689] Avg episode reward: [(0, '-3.400')] [2022-07-10 21:17:57,172][26022] Updated weights on worker 0-0, policy_version 895273 (0.00086) [2022-07-10 21:17:59,260][26022] Updated weights on worker 0-0, policy_version 895283 (0.00082) [2022-07-10 21:18:00,795][26022] Updated weights on worker 0-0, policy_version 895293 (0.00083) [2022-07-10 21:18:01,901][25689] Fps is (10 sec: 5497.2, 60 sec: 5558.3, 300 sec: 5532.9). Total num frames: 916784128. Throughput: 0: 4988.9. Samples: 916777762. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:18:01,902][25689] Avg episode reward: [(0, '-3.355')] [2022-07-10 21:18:03,260][26022] Updated weights on worker 0-0, policy_version 895303 (0.00093) [2022-07-10 21:18:05,026][26022] Updated weights on worker 0-0, policy_version 895313 (0.00086) [2022-07-10 21:18:06,906][26022] Updated weights on worker 0-0, policy_version 895323 (0.00089) [2022-07-10 21:18:06,989][25689] Fps is (10 sec: 5271.3, 60 sec: 5541.1, 300 sec: 5528.2). Total num frames: 916810752. Throughput: 0: 5698.5. Samples: 916808946. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:18:06,989][25689] Avg episode reward: [(0, '-2.899')] [2022-07-10 21:18:08,835][26022] Updated weights on worker 0-0, policy_version 895333 (0.00083) [2022-07-10 21:18:10,354][26022] Updated weights on worker 0-0, policy_version 895343 (0.00088) [2022-07-10 21:18:11,999][25689] Fps is (10 sec: 5374.5, 60 sec: 5559.6, 300 sec: 5526.4). Total num frames: 916838400. Throughput: 0: 5715.0. Samples: 916842804. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:18:11,999][25689] Avg episode reward: [(0, '-2.492')] [2022-07-10 21:18:12,482][26022] Updated weights on worker 0-0, policy_version 895353 (0.00091) [2022-07-10 21:18:14,063][26022] Updated weights on worker 0-0, policy_version 895363 (0.00108) [2022-07-10 21:18:16,107][26022] Updated weights on worker 0-0, policy_version 895373 (0.00096) [2022-07-10 21:18:17,020][25689] Fps is (10 sec: 5716.9, 60 sec: 5558.9, 300 sec: 5537.4). Total num frames: 916868096. Throughput: 0: 5731.5. Samples: 916876294. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:18:17,020][25689] Avg episode reward: [(0, '-1.811')] [2022-07-10 21:18:18,019][26022] Updated weights on worker 0-0, policy_version 895383 (0.00085) [2022-07-10 21:18:19,678][26022] Updated weights on worker 0-0, policy_version 895393 (0.00083) [2022-07-10 21:18:21,682][26022] Updated weights on worker 0-0, policy_version 895403 (0.00090) [2022-07-10 21:18:22,047][25689] Fps is (10 sec: 5605.3, 60 sec: 5541.1, 300 sec: 5528.4). Total num frames: 916894720. Throughput: 0: 5723.5. Samples: 916893066. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:18:22,047][25689] Avg episode reward: [(0, '-2.940')] [2022-07-10 21:18:23,275][26022] Updated weights on worker 0-0, policy_version 895413 (0.00089) [2022-07-10 21:18:25,379][26022] Updated weights on worker 0-0, policy_version 895423 (0.00095) [2022-07-10 21:18:27,163][25689] Fps is (10 sec: 5350.6, 60 sec: 5521.1, 300 sec: 5527.6). Total num frames: 916922368. Throughput: 0: 5809.0. Samples: 916926134. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:18:27,163][25689] Avg episode reward: [(0, '-2.240')] [2022-07-10 21:18:27,475][26022] Updated weights on worker 0-0, policy_version 895433 (0.00093) [2022-07-10 21:18:28,914][26022] Updated weights on worker 0-0, policy_version 895443 (0.00089) [2022-07-10 21:18:30,914][26022] Updated weights on worker 0-0, policy_version 895453 (0.00083) [2022-07-10 21:18:32,184][25689] Fps is (10 sec: 5656.9, 60 sec: 5570.9, 300 sec: 5531.4). Total num frames: 916952064. Throughput: 0: 5780.8. Samples: 916959488. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:18:32,185][25689] Avg episode reward: [(0, '-2.027')] [2022-07-10 21:18:32,667][26022] Updated weights on worker 0-0, policy_version 895463 (0.00090) [2022-07-10 21:18:34,428][26022] Updated weights on worker 0-0, policy_version 895473 (0.00087) [2022-07-10 21:18:36,420][26022] Updated weights on worker 0-0, policy_version 895483 (0.00095) [2022-07-10 21:18:37,222][25689] Fps is (10 sec: 5598.7, 60 sec: 5535.1, 300 sec: 5527.4). Total num frames: 916978688. Throughput: 0: 4952.2. Samples: 916976342. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:18:37,223][25689] Avg episode reward: [(0, '-2.265')] [2022-07-10 21:18:38,164][26022] Updated weights on worker 0-0, policy_version 895493 (0.00092) [2022-07-10 21:18:40,127][26022] Updated weights on worker 0-0, policy_version 895503 (0.00078) [2022-07-10 21:18:41,791][26022] Updated weights on worker 0-0, policy_version 895513 (0.00090) [2022-07-10 21:18:42,282][25689] Fps is (10 sec: 5374.1, 60 sec: 5513.1, 300 sec: 5524.7). Total num frames: 917006336. Throughput: 0: 5766.8. Samples: 917009762. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:18:42,283][25689] Avg episode reward: [(0, '-2.933')] [2022-07-10 21:18:43,555][26022] Updated weights on worker 0-0, policy_version 895523 (0.00087) [2022-07-10 21:18:45,608][26022] Updated weights on worker 0-0, policy_version 895533 (0.00086) [2022-07-10 21:18:47,319][26022] Updated weights on worker 0-0, policy_version 895543 (0.00089) [2022-07-10 21:18:47,397][25689] Fps is (10 sec: 5635.8, 60 sec: 5544.4, 300 sec: 5526.4). Total num frames: 917036032. Throughput: 0: 5793.7. Samples: 917043366. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:18:47,402][25689] Avg episode reward: [(0, '-2.203')] [2022-07-10 21:18:49,255][26022] Updated weights on worker 0-0, policy_version 895553 (0.00091) [2022-07-10 21:18:51,149][26022] Updated weights on worker 0-0, policy_version 895563 (0.00094) [2022-07-10 21:18:52,404][25689] Fps is (10 sec: 5665.6, 60 sec: 5529.0, 300 sec: 5533.5). Total num frames: 917063680. Throughput: 0: 4985.0. Samples: 917060286. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:18:52,404][25689] Avg episode reward: [(0, '-1.640')] [2022-07-10 21:18:52,798][26022] Updated weights on worker 0-0, policy_version 895573 (0.00087) [2022-07-10 21:18:54,793][26022] Updated weights on worker 0-0, policy_version 895583 (0.00092) [2022-07-10 21:18:56,567][26022] Updated weights on worker 0-0, policy_version 895593 (0.00092) [2022-07-10 21:18:57,409][25689] Fps is (10 sec: 5420.5, 60 sec: 5498.8, 300 sec: 5526.9). Total num frames: 917090304. Throughput: 0: 5804.8. Samples: 917093526. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:18:57,410][25689] Avg episode reward: [(0, '-0.501')] [2022-07-10 21:18:58,349][26022] Updated weights on worker 0-0, policy_version 895603 (0.00080) [2022-07-10 21:19:00,360][26022] Updated weights on worker 0-0, policy_version 895613 (0.00250) [2022-07-10 21:19:02,271][26022] Updated weights on worker 0-0, policy_version 895623 (0.00089) [2022-07-10 21:19:02,447][25689] Fps is (10 sec: 5506.0, 60 sec: 5530.5, 300 sec: 5541.4). Total num frames: 917118976. Throughput: 0: 5795.4. Samples: 917126622. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:19:02,447][25689] Avg episode reward: [(0, '-0.663')] [2022-07-10 21:19:04,282][26022] Updated weights on worker 0-0, policy_version 895633 (0.00092) [2022-07-10 21:19:06,006][26022] Updated weights on worker 0-0, policy_version 895643 (0.00087) [2022-07-10 21:19:06,200][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:19:06,224][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000895644_917139456.pth [2022-07-10 21:19:06,231][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000893698_915146752.pth [2022-07-10 21:19:07,523][25689] Fps is (10 sec: 5568.5, 60 sec: 5548.4, 300 sec: 5534.0). Total num frames: 917146624. Throughput: 0: 4910.5. Samples: 917142196. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:19:07,524][25689] Avg episode reward: [(0, '-0.675')] [2022-07-10 21:19:07,856][26022] Updated weights on worker 0-0, policy_version 895653 (0.00091) [2022-07-10 21:19:09,833][26022] Updated weights on worker 0-0, policy_version 895663 (0.00144) [2022-07-10 21:19:11,425][26022] Updated weights on worker 0-0, policy_version 895673 (0.00085) [2022-07-10 21:19:12,577][25689] Fps is (10 sec: 5458.7, 60 sec: 5544.5, 300 sec: 5533.7). Total num frames: 917174272. Throughput: 0: 5739.3. Samples: 917176062. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:19:12,577][25689] Avg episode reward: [(0, '-0.209')] [2022-07-10 21:19:13,306][26022] Updated weights on worker 0-0, policy_version 895683 (0.00090) [2022-07-10 21:19:15,074][26022] Updated weights on worker 0-0, policy_version 895693 (0.00075) [2022-07-10 21:19:16,872][26022] Updated weights on worker 0-0, policy_version 895703 (0.00498) [2022-07-10 21:19:17,587][25689] Fps is (10 sec: 5698.4, 60 sec: 5545.4, 300 sec: 5537.1). Total num frames: 917203968. Throughput: 0: 5780.8. Samples: 917210166. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:19:17,587][25689] Avg episode reward: [(0, '0.198')] [2022-07-10 21:19:18,747][26022] Updated weights on worker 0-0, policy_version 895713 (0.00086) [2022-07-10 21:19:20,640][26022] Updated weights on worker 0-0, policy_version 895723 (0.00090) [2022-07-10 21:19:22,580][26022] Updated weights on worker 0-0, policy_version 895733 (0.00091) [2022-07-10 21:19:22,670][25689] Fps is (10 sec: 5579.6, 60 sec: 5540.3, 300 sec: 5534.3). Total num frames: 917230592. Throughput: 0: 4956.5. Samples: 917226860. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:19:22,671][25689] Avg episode reward: [(0, '-0.303')] [2022-07-10 21:19:24,260][26022] Updated weights on worker 0-0, policy_version 895743 (0.00083) [2022-07-10 21:19:26,151][26022] Updated weights on worker 0-0, policy_version 895753 (0.00089) [2022-07-10 21:19:27,808][25689] Fps is (10 sec: 5409.8, 60 sec: 5555.2, 300 sec: 5532.2). Total num frames: 917259264. Throughput: 0: 5814.4. Samples: 917260138. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:19:27,809][25689] Avg episode reward: [(0, '-0.088')] [2022-07-10 21:19:27,964][26022] Updated weights on worker 0-0, policy_version 895763 (0.00084) [2022-07-10 21:19:29,945][26022] Updated weights on worker 0-0, policy_version 895773 (0.00090) [2022-07-10 21:19:31,654][26022] Updated weights on worker 0-0, policy_version 895783 (0.00085) [2022-07-10 21:19:32,822][25689] Fps is (10 sec: 5547.7, 60 sec: 5522.0, 300 sec: 5532.1). Total num frames: 917286912. Throughput: 0: 5788.7. Samples: 917293258. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:19:32,824][25689] Avg episode reward: [(0, '-1.606')] [2022-07-10 21:19:33,721][26022] Updated weights on worker 0-0, policy_version 895793 (0.00088) [2022-07-10 21:19:35,299][26022] Updated weights on worker 0-0, policy_version 895803 (0.00083) [2022-07-10 21:19:37,235][26022] Updated weights on worker 0-0, policy_version 895813 (0.00090) [2022-07-10 21:19:37,913][25689] Fps is (10 sec: 5674.6, 60 sec: 5567.8, 300 sec: 5538.2). Total num frames: 917316608. Throughput: 0: 4912.5. Samples: 917310022. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:19:37,914][25689] Avg episode reward: [(0, '-1.232')] [2022-07-10 21:19:39,009][26022] Updated weights on worker 0-0, policy_version 895823 (0.00159) [2022-07-10 21:19:40,777][26022] Updated weights on worker 0-0, policy_version 895833 (0.00083) [2022-07-10 21:19:42,770][26022] Updated weights on worker 0-0, policy_version 895843 (0.00089) [2022-07-10 21:19:42,967][25689] Fps is (10 sec: 5551.7, 60 sec: 5551.6, 300 sec: 5531.8). Total num frames: 917343232. Throughput: 0: 5755.8. Samples: 917343684. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:19:42,968][25689] Avg episode reward: [(0, '-0.958')] [2022-07-10 21:19:44,530][26022] Updated weights on worker 0-0, policy_version 895853 (0.00091) [2022-07-10 21:19:46,536][26022] Updated weights on worker 0-0, policy_version 895863 (0.00092) [2022-07-10 21:19:48,030][25689] Fps is (10 sec: 5566.9, 60 sec: 5556.3, 300 sec: 5530.7). Total num frames: 917372928. Throughput: 0: 5789.8. Samples: 917377220. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:19:48,031][25689] Avg episode reward: [(0, '-0.957')] [2022-07-10 21:19:48,238][26022] Updated weights on worker 0-0, policy_version 895873 (0.00090) [2022-07-10 21:19:50,104][26022] Updated weights on worker 0-0, policy_version 895883 (0.00086) [2022-07-10 21:19:52,023][26022] Updated weights on worker 0-0, policy_version 895893 (0.00084) [2022-07-10 21:19:53,055][25689] Fps is (10 sec: 5785.9, 60 sec: 5571.5, 300 sec: 5537.6). Total num frames: 917401600. Throughput: 0: 4985.7. Samples: 917394132. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:19:53,055][25689] Avg episode reward: [(0, '-0.600')] [2022-07-10 21:19:53,750][26022] Updated weights on worker 0-0, policy_version 895903 (0.00084) [2022-07-10 21:19:55,432][26022] Updated weights on worker 0-0, policy_version 895913 (0.00085) [2022-07-10 21:19:57,349][26022] Updated weights on worker 0-0, policy_version 895923 (0.00086) [2022-07-10 21:19:58,129][25689] Fps is (10 sec: 5475.7, 60 sec: 5565.3, 300 sec: 5529.5). Total num frames: 917428224. Throughput: 0: 5827.9. Samples: 917427834. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:19:58,129][25689] Avg episode reward: [(0, '-0.739')] [2022-07-10 21:19:59,178][26022] Updated weights on worker 0-0, policy_version 895933 (0.00085) [2022-07-10 21:20:01,036][26022] Updated weights on worker 0-0, policy_version 895943 (0.00098) [2022-07-10 21:20:03,175][25689] Fps is (10 sec: 5261.4, 60 sec: 5530.7, 300 sec: 5529.2). Total num frames: 917454848. Throughput: 0: 5736.9. Samples: 917459618. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:20:03,176][25689] Avg episode reward: [(0, '-0.043')] [2022-07-10 21:20:03,236][26022] Updated weights on worker 0-0, policy_version 895953 (0.00094) [2022-07-10 21:20:04,831][26022] Updated weights on worker 0-0, policy_version 895963 (0.00090) [2022-07-10 21:20:06,875][26022] Updated weights on worker 0-0, policy_version 895973 (0.00085) [2022-07-10 21:20:08,231][25689] Fps is (10 sec: 5473.7, 60 sec: 5549.5, 300 sec: 5535.2). Total num frames: 917483520. Throughput: 0: 5736.5. Samples: 917493100. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:20:08,231][25689] Avg episode reward: [(0, '0.349')] [2022-07-10 21:20:08,666][26022] Updated weights on worker 0-0, policy_version 895983 (0.00090) [2022-07-10 21:20:10,671][26022] Updated weights on worker 0-0, policy_version 895993 (0.00082) [2022-07-10 21:20:12,248][26022] Updated weights on worker 0-0, policy_version 896003 (0.00085) [2022-07-10 21:20:13,237][25689] Fps is (10 sec: 5597.3, 60 sec: 5553.8, 300 sec: 5538.7). Total num frames: 917511168. Throughput: 0: 5745.3. Samples: 917510086. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:20:13,238][25689] Avg episode reward: [(0, '0.701')] [2022-07-10 21:20:14,176][26022] Updated weights on worker 0-0, policy_version 896013 (0.00274) [2022-07-10 21:20:15,962][26022] Updated weights on worker 0-0, policy_version 896023 (0.00086) [2022-07-10 21:20:17,790][26022] Updated weights on worker 0-0, policy_version 896033 (0.00091) [2022-07-10 21:20:18,271][25689] Fps is (10 sec: 5609.6, 60 sec: 5534.7, 300 sec: 5532.7). Total num frames: 917539840. Throughput: 0: 5765.3. Samples: 917543960. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:20:18,271][25689] Avg episode reward: [(0, '0.140')] [2022-07-10 21:20:19,477][26022] Updated weights on worker 0-0, policy_version 896043 (0.00086) [2022-07-10 21:20:21,567][26022] Updated weights on worker 0-0, policy_version 896053 (0.00085) [2022-07-10 21:20:23,283][25689] Fps is (10 sec: 5606.5, 60 sec: 5558.2, 300 sec: 5536.7). Total num frames: 917567488. Throughput: 0: 5870.6. Samples: 917577660. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:20:23,284][25689] Avg episode reward: [(0, '-0.040')] [2022-07-10 21:20:23,290][26022] Updated weights on worker 0-0, policy_version 896063 (0.00087) [2022-07-10 21:20:25,139][26022] Updated weights on worker 0-0, policy_version 896073 (0.00086) [2022-07-10 21:20:26,795][26022] Updated weights on worker 0-0, policy_version 896083 (0.00095) [2022-07-10 21:20:28,393][25689] Fps is (10 sec: 5462.9, 60 sec: 5543.8, 300 sec: 5534.9). Total num frames: 917595136. Throughput: 0: 5014.6. Samples: 917594204. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:20:28,393][25689] Avg episode reward: [(0, '0.143')] [2022-07-10 21:20:28,868][26022] Updated weights on worker 0-0, policy_version 896093 (0.00090) [2022-07-10 21:20:30,573][26022] Updated weights on worker 0-0, policy_version 896103 (0.00088) [2022-07-10 21:20:32,463][26022] Updated weights on worker 0-0, policy_version 896113 (0.00079) [2022-07-10 21:20:33,407][25689] Fps is (10 sec: 5663.9, 60 sec: 5577.6, 300 sec: 5539.3). Total num frames: 917624832. Throughput: 0: 5831.1. Samples: 917627698. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:20:33,409][25689] Avg episode reward: [(0, '0.305')] [2022-07-10 21:20:34,227][26022] Updated weights on worker 0-0, policy_version 896123 (0.00085) [2022-07-10 21:20:36,018][26022] Updated weights on worker 0-0, policy_version 896133 (0.00050) [2022-07-10 21:20:37,831][26022] Updated weights on worker 0-0, policy_version 896143 (0.00091) [2022-07-10 21:20:38,446][25689] Fps is (10 sec: 5704.4, 60 sec: 5548.7, 300 sec: 5538.8). Total num frames: 917652480. Throughput: 0: 5826.2. Samples: 917661502. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:20:38,446][25689] Avg episode reward: [(0, '0.221')] [2022-07-10 21:20:39,707][26022] Updated weights on worker 0-0, policy_version 896153 (0.00081) [2022-07-10 21:20:41,696][26022] Updated weights on worker 0-0, policy_version 896163 (0.00091) [2022-07-10 21:20:43,393][26022] Updated weights on worker 0-0, policy_version 896173 (0.00082) [2022-07-10 21:20:43,479][25689] Fps is (10 sec: 5592.0, 60 sec: 5584.4, 300 sec: 5543.5). Total num frames: 917681152. Throughput: 0: 4986.1. Samples: 917678360. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:20:43,479][25689] Avg episode reward: [(0, '0.166')] [2022-07-10 21:20:45,275][26022] Updated weights on worker 0-0, policy_version 896183 (0.00093) [2022-07-10 21:20:47,024][26022] Updated weights on worker 0-0, policy_version 896193 (0.00087) [2022-07-10 21:20:48,543][25689] Fps is (10 sec: 5678.9, 60 sec: 5567.3, 300 sec: 5546.7). Total num frames: 917709824. Throughput: 0: 5832.4. Samples: 917711728. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:20:48,544][25689] Avg episode reward: [(0, '-1.754')] [2022-07-10 21:20:48,871][26022] Updated weights on worker 0-0, policy_version 896203 (0.00091) [2022-07-10 21:20:50,971][26022] Updated weights on worker 0-0, policy_version 896213 (0.00084) [2022-07-10 21:20:52,621][26022] Updated weights on worker 0-0, policy_version 896223 (0.00091) [2022-07-10 21:20:53,549][25689] Fps is (10 sec: 5694.2, 60 sec: 5569.1, 300 sec: 5547.0). Total num frames: 917738496. Throughput: 0: 5843.7. Samples: 917745402. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:20:53,550][25689] Avg episode reward: [(0, '-1.860')] [2022-07-10 21:20:54,416][26022] Updated weights on worker 0-0, policy_version 896233 (0.00088) [2022-07-10 21:20:56,210][26022] Updated weights on worker 0-0, policy_version 896243 (0.00087) [2022-07-10 21:20:58,042][26022] Updated weights on worker 0-0, policy_version 896253 (0.00102) [2022-07-10 21:20:58,593][25689] Fps is (10 sec: 5604.1, 60 sec: 5588.7, 300 sec: 5546.6). Total num frames: 917766144. Throughput: 0: 5006.2. Samples: 917762366. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:20:58,594][25689] Avg episode reward: [(0, '-2.183')] [2022-07-10 21:20:59,969][26022] Updated weights on worker 0-0, policy_version 896263 (0.00086) [2022-07-10 21:21:02,244][26022] Updated weights on worker 0-0, policy_version 896273 (0.00089) [2022-07-10 21:21:03,614][25689] Fps is (10 sec: 5290.6, 60 sec: 5574.2, 300 sec: 5545.3). Total num frames: 917791744. Throughput: 0: 5732.3. Samples: 917793780. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:21:03,615][25689] Avg episode reward: [(0, '-2.102')] [2022-07-10 21:21:03,901][26022] Updated weights on worker 0-0, policy_version 896283 (0.00084) [2022-07-10 21:21:05,744][26022] Updated weights on worker 0-0, policy_version 896293 (0.00092) [2022-07-10 21:21:06,275][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:21:06,298][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000896295_917806080.pth [2022-07-10 21:21:06,298][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000894346_915810304.pth [2022-07-10 21:21:07,652][26022] Updated weights on worker 0-0, policy_version 896303 (0.00085) [2022-07-10 21:21:08,725][25689] Fps is (10 sec: 5255.3, 60 sec: 5552.1, 300 sec: 5544.8). Total num frames: 917819392. Throughput: 0: 5728.2. Samples: 917827336. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:21:08,726][25689] Avg episode reward: [(0, '-2.240')] [2022-07-10 21:21:09,335][26022] Updated weights on worker 0-0, policy_version 896313 (0.00085) [2022-07-10 21:21:11,336][26022] Updated weights on worker 0-0, policy_version 896323 (0.00194) [2022-07-10 21:21:13,166][26022] Updated weights on worker 0-0, policy_version 896333 (0.00098) [2022-07-10 21:21:13,755][25689] Fps is (10 sec: 5553.8, 60 sec: 5566.9, 300 sec: 5544.5). Total num frames: 917848064. Throughput: 0: 4875.4. Samples: 917843912. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:21:13,755][25689] Avg episode reward: [(0, '-0.946')] [2022-07-10 21:21:15,066][26022] Updated weights on worker 0-0, policy_version 896343 (0.00101) [2022-07-10 21:21:16,872][26022] Updated weights on worker 0-0, policy_version 896353 (0.00093) [2022-07-10 21:21:18,560][26022] Updated weights on worker 0-0, policy_version 896363 (0.00093) [2022-07-10 21:21:18,770][25689] Fps is (10 sec: 5709.1, 60 sec: 5568.6, 300 sec: 5548.2). Total num frames: 917876736. Throughput: 0: 5722.9. Samples: 917877834. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:21:18,772][25689] Avg episode reward: [(0, '0.198')] [2022-07-10 21:21:20,385][26022] Updated weights on worker 0-0, policy_version 896373 (0.00084) [2022-07-10 21:21:22,571][26022] Updated weights on worker 0-0, policy_version 896384 (0.00097) [2022-07-10 21:21:23,779][25689] Fps is (10 sec: 5618.5, 60 sec: 5568.9, 300 sec: 5553.7). Total num frames: 917904384. Throughput: 0: 5839.8. Samples: 917911538. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:21:23,780][25689] Avg episode reward: [(0, '0.612')] [2022-07-10 21:21:24,193][26022] Updated weights on worker 0-0, policy_version 896394 (0.00089) [2022-07-10 21:21:26,332][26022] Updated weights on worker 0-0, policy_version 896404 (0.00085) [2022-07-10 21:21:27,906][26022] Updated weights on worker 0-0, policy_version 896414 (0.00094) [2022-07-10 21:21:28,873][25689] Fps is (10 sec: 5473.3, 60 sec: 5570.4, 300 sec: 5546.2). Total num frames: 917932032. Throughput: 0: 5003.7. Samples: 917928146. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:21:28,873][25689] Avg episode reward: [(0, '0.353')] [2022-07-10 21:21:29,924][26022] Updated weights on worker 0-0, policy_version 896424 (0.00092) [2022-07-10 21:21:31,960][26022] Updated weights on worker 0-0, policy_version 896434 (0.00090) [2022-07-10 21:21:33,568][26022] Updated weights on worker 0-0, policy_version 896444 (0.00085) [2022-07-10 21:21:33,891][25689] Fps is (10 sec: 5670.6, 60 sec: 5570.0, 300 sec: 5553.5). Total num frames: 917961728. Throughput: 0: 5832.6. Samples: 917961360. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:21:33,892][25689] Avg episode reward: [(0, '-0.613')] [2022-07-10 21:21:35,653][26022] Updated weights on worker 0-0, policy_version 896454 (0.00095) [2022-07-10 21:21:37,039][26022] Updated weights on worker 0-0, policy_version 896464 (0.00090) [2022-07-10 21:21:38,896][25689] Fps is (10 sec: 5516.7, 60 sec: 5539.2, 300 sec: 5550.2). Total num frames: 917987328. Throughput: 0: 5820.4. Samples: 917994976. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-10 21:21:38,896][25689] Avg episode reward: [(0, '-0.467')] [2022-07-10 21:21:39,116][26022] Updated weights on worker 0-0, policy_version 896474 (0.00091) [2022-07-10 21:21:40,809][26022] Updated weights on worker 0-0, policy_version 896484 (0.00085) [2022-07-10 21:21:43,008][26022] Updated weights on worker 0-0, policy_version 896494 (0.00078) [2022-07-10 21:21:43,916][25689] Fps is (10 sec: 5413.6, 60 sec: 5540.4, 300 sec: 5544.8). Total num frames: 918016000. Throughput: 0: 4978.5. Samples: 918011794. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:21:43,917][25689] Avg episode reward: [(0, '-1.084')] [2022-07-10 21:21:44,451][26022] Updated weights on worker 0-0, policy_version 896504 (0.00100) [2022-07-10 21:21:46,573][26022] Updated weights on worker 0-0, policy_version 896514 (0.00088) [2022-07-10 21:21:47,971][26022] Updated weights on worker 0-0, policy_version 896524 (0.00090) [2022-07-10 21:21:49,046][25689] Fps is (10 sec: 5649.5, 60 sec: 5534.4, 300 sec: 5549.9). Total num frames: 918044672. Throughput: 0: 5814.3. Samples: 918045440. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:21:49,048][25689] Avg episode reward: [(0, '-0.792')] [2022-07-10 21:21:50,332][26022] Updated weights on worker 0-0, policy_version 896534 (0.00087) [2022-07-10 21:21:51,918][26022] Updated weights on worker 0-0, policy_version 896544 (0.00100) [2022-07-10 21:21:53,856][26022] Updated weights on worker 0-0, policy_version 896554 (0.00097) [2022-07-10 21:21:54,052][25689] Fps is (10 sec: 5556.4, 60 sec: 5517.5, 300 sec: 5543.5). Total num frames: 918072320. Throughput: 0: 5841.0. Samples: 918079122. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:21:54,054][25689] Avg episode reward: [(0, '-1.320')] [2022-07-10 21:21:55,457][26022] Updated weights on worker 0-0, policy_version 896564 (0.00100) [2022-07-10 21:21:57,604][26022] Updated weights on worker 0-0, policy_version 896574 (0.00089) [2022-07-10 21:21:58,908][26022] Updated weights on worker 0-0, policy_version 896584 (0.00090) [2022-07-10 21:21:59,118][25689] Fps is (10 sec: 5693.7, 60 sec: 5549.4, 300 sec: 5556.2). Total num frames: 918102016. Throughput: 0: 4991.1. Samples: 918095904. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:21:59,118][25689] Avg episode reward: [(0, '-0.835')] [2022-07-10 21:22:01,199][26022] Updated weights on worker 0-0, policy_version 896594 (0.00093) [2022-07-10 21:22:03,175][26022] Updated weights on worker 0-0, policy_version 896604 (0.00094) [2022-07-10 21:22:04,170][25689] Fps is (10 sec: 5364.1, 60 sec: 5529.5, 300 sec: 5546.5). Total num frames: 918126592. Throughput: 0: 5690.5. Samples: 918127048. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:22:04,171][25689] Avg episode reward: [(0, '-0.113')] [2022-07-10 21:22:05,318][26022] Updated weights on worker 0-0, policy_version 896614 (0.00087) [2022-07-10 21:22:07,026][26022] Updated weights on worker 0-0, policy_version 896624 (0.00085) [2022-07-10 21:22:08,859][26022] Updated weights on worker 0-0, policy_version 896634 (0.00087) [2022-07-10 21:22:09,274][25689] Fps is (10 sec: 5243.0, 60 sec: 5547.2, 300 sec: 5551.9). Total num frames: 918155264. Throughput: 0: 5672.1. Samples: 918160172. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:22:09,275][25689] Avg episode reward: [(0, '-0.353')] [2022-07-10 21:22:10,646][26022] Updated weights on worker 0-0, policy_version 896644 (0.00092) [2022-07-10 21:22:12,518][26022] Updated weights on worker 0-0, policy_version 896654 (0.00094) [2022-07-10 21:22:14,292][25689] Fps is (10 sec: 5564.2, 60 sec: 5531.3, 300 sec: 5544.9). Total num frames: 918182912. Throughput: 0: 4837.2. Samples: 918177024. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:22:14,294][25689] Avg episode reward: [(0, '0.139')] [2022-07-10 21:22:14,365][26022] Updated weights on worker 0-0, policy_version 896664 (0.00093) [2022-07-10 21:22:16,251][26022] Updated weights on worker 0-0, policy_version 896674 (0.00089) [2022-07-10 21:22:18,108][26022] Updated weights on worker 0-0, policy_version 896684 (0.00097) [2022-07-10 21:22:19,336][25689] Fps is (10 sec: 5495.5, 60 sec: 5511.8, 300 sec: 5544.5). Total num frames: 918210560. Throughput: 0: 5668.7. Samples: 918210514. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:22:19,337][25689] Avg episode reward: [(0, '-0.712')] [2022-07-10 21:22:19,949][26022] Updated weights on worker 0-0, policy_version 896694 (0.00081) [2022-07-10 21:22:21,746][26022] Updated weights on worker 0-0, policy_version 896704 (0.00086) [2022-07-10 21:22:23,627][26022] Updated weights on worker 0-0, policy_version 896714 (0.00090) [2022-07-10 21:22:24,358][25689] Fps is (10 sec: 5595.2, 60 sec: 5527.4, 300 sec: 5545.6). Total num frames: 918239232. Throughput: 0: 5799.6. Samples: 918244128. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:22:24,358][25689] Avg episode reward: [(0, '0.046')] [2022-07-10 21:22:25,391][26022] Updated weights on worker 0-0, policy_version 896724 (0.00098) [2022-07-10 21:22:27,313][26022] Updated weights on worker 0-0, policy_version 896734 (0.00089) [2022-07-10 21:22:28,975][26022] Updated weights on worker 0-0, policy_version 896744 (0.00084) [2022-07-10 21:22:29,435][25689] Fps is (10 sec: 5678.1, 60 sec: 5545.9, 300 sec: 5551.2). Total num frames: 918267904. Throughput: 0: 4996.3. Samples: 918260902. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:22:29,435][25689] Avg episode reward: [(0, '-0.095')] [2022-07-10 21:22:30,916][26022] Updated weights on worker 0-0, policy_version 896754 (0.00083) [2022-07-10 21:22:32,671][26022] Updated weights on worker 0-0, policy_version 896764 (0.00053) [2022-07-10 21:22:34,450][25689] Fps is (10 sec: 5580.5, 60 sec: 5512.4, 300 sec: 5547.8). Total num frames: 918295552. Throughput: 0: 5829.3. Samples: 918294532. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:22:34,451][25689] Avg episode reward: [(0, '-0.691')] [2022-07-10 21:22:34,683][26022] Updated weights on worker 0-0, policy_version 896774 (0.00095) [2022-07-10 21:22:36,324][26022] Updated weights on worker 0-0, policy_version 896784 (0.00088) [2022-07-10 21:22:38,215][26022] Updated weights on worker 0-0, policy_version 896794 (0.00088) [2022-07-10 21:22:39,475][25689] Fps is (10 sec: 5609.6, 60 sec: 5561.3, 300 sec: 5547.4). Total num frames: 918324224. Throughput: 0: 5849.8. Samples: 918328324. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:22:39,475][25689] Avg episode reward: [(0, '-0.920')] [2022-07-10 21:22:39,939][26022] Updated weights on worker 0-0, policy_version 896804 (0.00085) [2022-07-10 21:22:42,031][26022] Updated weights on worker 0-0, policy_version 896814 (0.00087) [2022-07-10 21:22:43,606][26022] Updated weights on worker 0-0, policy_version 896824 (0.00090) [2022-07-10 21:22:44,495][25689] Fps is (10 sec: 5607.0, 60 sec: 5544.4, 300 sec: 5548.7). Total num frames: 918351872. Throughput: 0: 5018.8. Samples: 918345190. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:22:44,495][25689] Avg episode reward: [(0, '-0.985')] [2022-07-10 21:22:45,695][26022] Updated weights on worker 0-0, policy_version 896834 (0.00086) [2022-07-10 21:22:47,164][26022] Updated weights on worker 0-0, policy_version 896844 (0.00092) [2022-07-10 21:22:49,169][26022] Updated weights on worker 0-0, policy_version 896854 (0.00085) [2022-07-10 21:22:49,553][25689] Fps is (10 sec: 5588.0, 60 sec: 5550.9, 300 sec: 5548.0). Total num frames: 918380544. Throughput: 0: 5862.9. Samples: 918378856. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:22:49,554][25689] Avg episode reward: [(0, '-0.287')] [2022-07-10 21:22:50,949][26022] Updated weights on worker 0-0, policy_version 896864 (0.00080) [2022-07-10 21:22:52,642][26022] Updated weights on worker 0-0, policy_version 896874 (0.00090) [2022-07-10 21:22:54,619][25689] Fps is (10 sec: 5461.9, 60 sec: 5528.6, 300 sec: 5540.8). Total num frames: 918407168. Throughput: 0: 5868.3. Samples: 918412888. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:22:54,619][25689] Avg episode reward: [(0, '-0.682')] [2022-07-10 21:22:54,795][26022] Updated weights on worker 0-0, policy_version 896884 (0.00086) [2022-07-10 21:22:56,302][26022] Updated weights on worker 0-0, policy_version 896894 (0.00083) [2022-07-10 21:22:58,330][26022] Updated weights on worker 0-0, policy_version 896904 (0.00090) [2022-07-10 21:22:59,667][25689] Fps is (10 sec: 5568.5, 60 sec: 5530.1, 300 sec: 5550.4). Total num frames: 918436864. Throughput: 0: 5861.0. Samples: 918446674. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:22:59,669][25689] Avg episode reward: [(0, '-0.804')] [2022-07-10 21:23:00,051][26022] Updated weights on worker 0-0, policy_version 896914 (0.00090) [2022-07-10 21:23:01,996][26022] Updated weights on worker 0-0, policy_version 896924 (0.00104) [2022-07-10 21:23:04,197][26022] Updated weights on worker 0-0, policy_version 896934 (0.00087) [2022-07-10 21:23:04,671][25689] Fps is (10 sec: 5603.0, 60 sec: 5568.5, 300 sec: 5552.0). Total num frames: 918463488. Throughput: 0: 5754.4. Samples: 918461290. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:23:04,671][25689] Avg episode reward: [(0, '-0.620')] [2022-07-10 21:23:06,122][26022] Updated weights on worker 0-0, policy_version 896944 (0.00081) [2022-07-10 21:23:06,456][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:23:06,473][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000896946_918472704.pth [2022-07-10 21:23:06,473][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000894994_916473856.pth [2022-07-10 21:23:07,724][26022] Updated weights on worker 0-0, policy_version 896954 (0.00086) [2022-07-10 21:23:09,756][25689] Fps is (10 sec: 5277.9, 60 sec: 5536.3, 300 sec: 5547.2). Total num frames: 918490112. Throughput: 0: 5735.8. Samples: 918494736. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:23:09,757][25689] Avg episode reward: [(0, '-0.417')] [2022-07-10 21:23:09,765][26022] Updated weights on worker 0-0, policy_version 896964 (0.00095) [2022-07-10 21:23:11,255][26022] Updated weights on worker 0-0, policy_version 896974 (0.00081) [2022-07-10 21:23:13,388][26022] Updated weights on worker 0-0, policy_version 896984 (0.00086) [2022-07-10 21:23:14,772][25689] Fps is (10 sec: 5677.0, 60 sec: 5587.3, 300 sec: 5550.7). Total num frames: 918520832. Throughput: 0: 5758.5. Samples: 918528940. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:23:14,772][25689] Avg episode reward: [(0, '-0.503')] [2022-07-10 21:23:14,884][26022] Updated weights on worker 0-0, policy_version 896994 (0.00093) [2022-07-10 21:23:16,892][26022] Updated weights on worker 0-0, policy_version 897004 (0.00082) [2022-07-10 21:23:18,756][26022] Updated weights on worker 0-0, policy_version 897014 (0.00086) [2022-07-10 21:23:19,821][25689] Fps is (10 sec: 5799.0, 60 sec: 5586.8, 300 sec: 5553.7). Total num frames: 918548480. Throughput: 0: 4916.8. Samples: 918545766. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:23:19,822][25689] Avg episode reward: [(0, '-0.558')] [2022-07-10 21:23:20,493][26022] Updated weights on worker 0-0, policy_version 897024 (0.00080) [2022-07-10 21:23:22,409][26022] Updated weights on worker 0-0, policy_version 897034 (0.00095) [2022-07-10 21:23:24,290][26022] Updated weights on worker 0-0, policy_version 897044 (0.00083) [2022-07-10 21:23:24,898][25689] Fps is (10 sec: 5561.7, 60 sec: 5581.7, 300 sec: 5557.9). Total num frames: 918577152. Throughput: 0: 5832.1. Samples: 918579260. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:23:24,899][25689] Avg episode reward: [(0, '-0.548')] [2022-07-10 21:23:26,042][26022] Updated weights on worker 0-0, policy_version 897054 (0.00096) [2022-07-10 21:23:28,006][26022] Updated weights on worker 0-0, policy_version 897064 (0.00085) [2022-07-10 21:23:29,649][26022] Updated weights on worker 0-0, policy_version 897074 (0.00091) [2022-07-10 21:23:30,072][25689] Fps is (10 sec: 5594.2, 60 sec: 5572.8, 300 sec: 5551.6). Total num frames: 918605824. Throughput: 0: 5807.5. Samples: 918612720. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:23:30,074][25689] Avg episode reward: [(0, '-0.655')] [2022-07-10 21:23:31,622][26022] Updated weights on worker 0-0, policy_version 897084 (0.00085) [2022-07-10 21:23:33,529][26022] Updated weights on worker 0-0, policy_version 897094 (0.00092) [2022-07-10 21:23:35,082][25689] Fps is (10 sec: 5429.5, 60 sec: 5556.4, 300 sec: 5552.1). Total num frames: 918632448. Throughput: 0: 4941.8. Samples: 918629312. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:23:35,083][25689] Avg episode reward: [(0, '-0.801')] [2022-07-10 21:23:35,279][26022] Updated weights on worker 0-0, policy_version 897104 (0.00084) [2022-07-10 21:23:37,186][26022] Updated weights on worker 0-0, policy_version 897114 (0.00091) [2022-07-10 21:23:38,869][26022] Updated weights on worker 0-0, policy_version 897124 (0.00087) [2022-07-10 21:23:40,123][25689] Fps is (10 sec: 5501.3, 60 sec: 5554.9, 300 sec: 5555.9). Total num frames: 918661120. Throughput: 0: 5770.8. Samples: 918662926. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:23:40,124][25689] Avg episode reward: [(0, '-0.662')] [2022-07-10 21:23:40,863][26022] Updated weights on worker 0-0, policy_version 897134 (0.00098) [2022-07-10 21:23:42,668][26022] Updated weights on worker 0-0, policy_version 897144 (0.00093) [2022-07-10 21:23:44,317][26022] Updated weights on worker 0-0, policy_version 897154 (0.00088) [2022-07-10 21:23:45,134][25689] Fps is (10 sec: 5704.9, 60 sec: 5572.6, 300 sec: 5554.4). Total num frames: 918689792. Throughput: 0: 5803.8. Samples: 918696706. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:23:45,136][25689] Avg episode reward: [(0, '-1.122')] [2022-07-10 21:23:46,326][26022] Updated weights on worker 0-0, policy_version 897164 (0.00083) [2022-07-10 21:23:47,986][26022] Updated weights on worker 0-0, policy_version 897174 (0.00103) [2022-07-10 21:23:49,918][26022] Updated weights on worker 0-0, policy_version 897184 (0.00091) [2022-07-10 21:23:50,212][25689] Fps is (10 sec: 5582.7, 60 sec: 5554.0, 300 sec: 5553.1). Total num frames: 918717440. Throughput: 0: 4995.8. Samples: 918713334. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:23:50,212][25689] Avg episode reward: [(0, '-1.159')] [2022-07-10 21:23:51,667][26022] Updated weights on worker 0-0, policy_version 897194 (0.00087) [2022-07-10 21:23:53,613][26022] Updated weights on worker 0-0, policy_version 897204 (0.01504) [2022-07-10 21:23:55,259][25689] Fps is (10 sec: 5562.5, 60 sec: 5589.4, 300 sec: 5559.2). Total num frames: 918746112. Throughput: 0: 5826.8. Samples: 918746876. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:23:55,260][25689] Avg episode reward: [(0, '-0.843')] [2022-07-10 21:23:55,333][26022] Updated weights on worker 0-0, policy_version 897214 (0.00087) [2022-07-10 21:23:57,125][26022] Updated weights on worker 0-0, policy_version 897224 (0.00082) [2022-07-10 21:23:58,899][26022] Updated weights on worker 0-0, policy_version 897234 (0.00082) [2022-07-10 21:24:00,326][25689] Fps is (10 sec: 5568.4, 60 sec: 5554.0, 300 sec: 5555.2). Total num frames: 918773760. Throughput: 0: 5833.1. Samples: 918780770. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:24:00,326][25689] Avg episode reward: [(0, '-0.908')] [2022-07-10 21:24:00,801][26022] Updated weights on worker 0-0, policy_version 897244 (0.00084) [2022-07-10 21:24:03,072][26022] Updated weights on worker 0-0, policy_version 897254 (0.00091) [2022-07-10 21:24:04,879][26022] Updated weights on worker 0-0, policy_version 897264 (0.00072) [2022-07-10 21:24:05,372][25689] Fps is (10 sec: 5467.8, 60 sec: 5566.9, 300 sec: 5555.8). Total num frames: 918801408. Throughput: 0: 4880.7. Samples: 918795480. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:24:05,373][25689] Avg episode reward: [(0, '-1.211')] [2022-07-10 21:24:06,904][26022] Updated weights on worker 0-0, policy_version 897274 (0.00090) [2022-07-10 21:24:08,411][26022] Updated weights on worker 0-0, policy_version 897284 (0.00083) [2022-07-10 21:24:10,478][25689] Fps is (10 sec: 5345.9, 60 sec: 5565.0, 300 sec: 5551.4). Total num frames: 918828032. Throughput: 0: 5704.7. Samples: 918828948. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:24:10,479][25689] Avg episode reward: [(0, '-1.660')] [2022-07-10 21:24:10,716][26022] Updated weights on worker 0-0, policy_version 897294 (0.00087) [2022-07-10 21:24:12,085][26022] Updated weights on worker 0-0, policy_version 897304 (0.00092) [2022-07-10 21:24:14,221][26022] Updated weights on worker 0-0, policy_version 897314 (0.00081) [2022-07-10 21:24:15,521][25689] Fps is (10 sec: 5549.5, 60 sec: 5545.7, 300 sec: 5550.8). Total num frames: 918857728. Throughput: 0: 5701.6. Samples: 918862400. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:24:15,531][25689] Avg episode reward: [(0, '-2.197')] [2022-07-10 21:24:16,011][26022] Updated weights on worker 0-0, policy_version 897324 (0.00093) [2022-07-10 21:24:17,671][26022] Updated weights on worker 0-0, policy_version 897334 (0.00084) [2022-07-10 21:24:19,707][26022] Updated weights on worker 0-0, policy_version 897344 (0.00087) [2022-07-10 21:24:20,534][25689] Fps is (10 sec: 5804.5, 60 sec: 5565.9, 300 sec: 5558.9). Total num frames: 918886400. Throughput: 0: 4873.3. Samples: 918879248. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:24:20,534][25689] Avg episode reward: [(0, '-2.023')] [2022-07-10 21:24:21,346][26022] Updated weights on worker 0-0, policy_version 897354 (0.00091) [2022-07-10 21:24:23,325][26022] Updated weights on worker 0-0, policy_version 897364 (0.00091) [2022-07-10 21:24:25,070][26022] Updated weights on worker 0-0, policy_version 897374 (0.00093) [2022-07-10 21:24:25,555][25689] Fps is (10 sec: 5510.7, 60 sec: 5537.3, 300 sec: 5554.2). Total num frames: 918913024. Throughput: 0: 5821.4. Samples: 918912974. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:24:25,555][25689] Avg episode reward: [(0, '-1.993')] [2022-07-10 21:24:26,767][26022] Updated weights on worker 0-0, policy_version 897384 (0.00090) [2022-07-10 21:24:28,787][26022] Updated weights on worker 0-0, policy_version 897394 (0.00079) [2022-07-10 21:24:30,589][26022] Updated weights on worker 0-0, policy_version 897404 (0.00087) [2022-07-10 21:24:30,673][25689] Fps is (10 sec: 5554.6, 60 sec: 5559.2, 300 sec: 5559.2). Total num frames: 918942720. Throughput: 0: 5840.2. Samples: 918946892. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:24:30,674][25689] Avg episode reward: [(0, '-2.046')] [2022-07-10 21:24:32,372][26022] Updated weights on worker 0-0, policy_version 897414 (0.00089) [2022-07-10 21:24:34,360][26022] Updated weights on worker 0-0, policy_version 897424 (0.00086) [2022-07-10 21:24:35,681][25689] Fps is (10 sec: 5663.0, 60 sec: 5576.4, 300 sec: 5553.9). Total num frames: 918970368. Throughput: 0: 5013.7. Samples: 918963478. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:24:35,682][25689] Avg episode reward: [(0, '-1.731')] [2022-07-10 21:24:35,922][26022] Updated weights on worker 0-0, policy_version 897434 (0.00086) [2022-07-10 21:24:37,919][26022] Updated weights on worker 0-0, policy_version 897444 (0.00088) [2022-07-10 21:24:39,637][26022] Updated weights on worker 0-0, policy_version 897454 (0.00088) [2022-07-10 21:24:40,708][25689] Fps is (10 sec: 5306.6, 60 sec: 5527.0, 300 sec: 5550.9). Total num frames: 918995968. Throughput: 0: 5826.7. Samples: 918996794. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:24:40,708][25689] Avg episode reward: [(0, '-1.691')] [2022-07-10 21:24:41,666][26022] Updated weights on worker 0-0, policy_version 897464 (0.00082) [2022-07-10 21:24:43,544][26022] Updated weights on worker 0-0, policy_version 897474 (0.00094) [2022-07-10 21:24:45,123][26022] Updated weights on worker 0-0, policy_version 897484 (0.00088) [2022-07-10 21:24:45,726][25689] Fps is (10 sec: 5709.0, 60 sec: 5577.0, 300 sec: 5558.7). Total num frames: 919027712. Throughput: 0: 5823.6. Samples: 919030440. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:24:45,726][25689] Avg episode reward: [(0, '-0.764')] [2022-07-10 21:24:47,226][26022] Updated weights on worker 0-0, policy_version 897494 (0.00087) [2022-07-10 21:24:48,836][26022] Updated weights on worker 0-0, policy_version 897504 (0.00084) [2022-07-10 21:24:50,837][25689] Fps is (10 sec: 5661.4, 60 sec: 5540.2, 300 sec: 5546.7). Total num frames: 919053312. Throughput: 0: 4980.4. Samples: 919047314. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:24:50,837][25689] Avg episode reward: [(0, '-0.295')] [2022-07-10 21:24:50,943][26022] Updated weights on worker 0-0, policy_version 897514 (0.00080) [2022-07-10 21:24:52,451][26022] Updated weights on worker 0-0, policy_version 897524 (0.00101) [2022-07-10 21:24:54,470][26022] Updated weights on worker 0-0, policy_version 897534 (0.00080) [2022-07-10 21:24:55,852][25689] Fps is (10 sec: 5460.3, 60 sec: 5559.9, 300 sec: 5558.1). Total num frames: 919083008. Throughput: 0: 5830.5. Samples: 919081088. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:24:55,853][25689] Avg episode reward: [(0, '-0.753')] [2022-07-10 21:24:56,046][26022] Updated weights on worker 0-0, policy_version 897544 (0.00091) [2022-07-10 21:24:58,071][26022] Updated weights on worker 0-0, policy_version 897554 (0.00089) [2022-07-10 21:24:59,747][26022] Updated weights on worker 0-0, policy_version 897564 (0.00094) [2022-07-10 21:25:00,873][25689] Fps is (10 sec: 5713.6, 60 sec: 5564.2, 300 sec: 5562.0). Total num frames: 919110656. Throughput: 0: 5844.7. Samples: 919114654. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:25:00,873][25689] Avg episode reward: [(0, '-0.492')] [2022-07-10 21:25:01,964][26022] Updated weights on worker 0-0, policy_version 897574 (0.00086) [2022-07-10 21:25:03,956][26022] Updated weights on worker 0-0, policy_version 897584 (0.00092) [2022-07-10 21:25:05,822][26022] Updated weights on worker 0-0, policy_version 897594 (0.00086) [2022-07-10 21:25:05,969][25689] Fps is (10 sec: 5263.5, 60 sec: 5525.8, 300 sec: 5551.0). Total num frames: 919136256. Throughput: 0: 5682.8. Samples: 919145480. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:25:05,969][25689] Avg episode reward: [(0, '-0.197')] [2022-07-10 21:25:06,660][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:25:06,669][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000897598_919140352.pth [2022-07-10 21:25:06,670][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000895644_917139456.pth [2022-07-10 21:25:07,762][26022] Updated weights on worker 0-0, policy_version 897604 (0.00088) [2022-07-10 21:25:09,597][26022] Updated weights on worker 0-0, policy_version 897614 (0.00084) [2022-07-10 21:25:11,065][25689] Fps is (10 sec: 5224.4, 60 sec: 5543.6, 300 sec: 5549.3). Total num frames: 919163904. Throughput: 0: 5677.8. Samples: 919162168. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:25:11,066][25689] Avg episode reward: [(0, '0.225')] [2022-07-10 21:25:11,395][26022] Updated weights on worker 0-0, policy_version 897624 (0.00059) [2022-07-10 21:25:13,282][26022] Updated weights on worker 0-0, policy_version 897634 (0.00086) [2022-07-10 21:25:14,904][26022] Updated weights on worker 0-0, policy_version 897644 (0.00089) [2022-07-10 21:25:16,132][25689] Fps is (10 sec: 5541.6, 60 sec: 5524.5, 300 sec: 5548.7). Total num frames: 919192576. Throughput: 0: 5663.7. Samples: 919195946. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:25:16,134][25689] Avg episode reward: [(0, '0.347')] [2022-07-10 21:25:16,859][26022] Updated weights on worker 0-0, policy_version 897654 (0.00086) [2022-07-10 21:25:18,564][26022] Updated weights on worker 0-0, policy_version 897664 (0.00094) [2022-07-10 21:25:20,489][26022] Updated weights on worker 0-0, policy_version 897674 (0.00088) [2022-07-10 21:25:21,194][25689] Fps is (10 sec: 5762.6, 60 sec: 5537.0, 300 sec: 5554.6). Total num frames: 919222272. Throughput: 0: 5663.4. Samples: 919229740. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:25:21,195][25689] Avg episode reward: [(0, '-0.644')] [2022-07-10 21:25:22,478][26022] Updated weights on worker 0-0, policy_version 897684 (0.00090) [2022-07-10 21:25:24,124][26022] Updated weights on worker 0-0, policy_version 897694 (0.00086) [2022-07-10 21:25:26,104][26022] Updated weights on worker 0-0, policy_version 897704 (0.00091) [2022-07-10 21:25:26,229][25689] Fps is (10 sec: 5679.4, 60 sec: 5552.6, 300 sec: 5556.0). Total num frames: 919249920. Throughput: 0: 4984.6. Samples: 919246468. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-10 21:25:26,229][25689] Avg episode reward: [(0, '-0.547')] [2022-07-10 21:25:27,812][26022] Updated weights on worker 0-0, policy_version 897714 (0.00083) [2022-07-10 21:25:29,799][26022] Updated weights on worker 0-0, policy_version 897724 (0.00080) [2022-07-10 21:25:31,276][25689] Fps is (10 sec: 5586.0, 60 sec: 5542.2, 300 sec: 5552.0). Total num frames: 919278592. Throughput: 0: 5822.3. Samples: 919279844. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:25:31,279][25689] Avg episode reward: [(0, '-0.247')] [2022-07-10 21:25:31,690][26022] Updated weights on worker 0-0, policy_version 897734 (0.00090) [2022-07-10 21:25:33,416][26022] Updated weights on worker 0-0, policy_version 897744 (0.00085) [2022-07-10 21:25:35,318][26022] Updated weights on worker 0-0, policy_version 897754 (0.00084) [2022-07-10 21:25:36,286][25689] Fps is (10 sec: 5497.9, 60 sec: 5525.0, 300 sec: 5549.1). Total num frames: 919305216. Throughput: 0: 5815.8. Samples: 919313160. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:25:36,287][25689] Avg episode reward: [(0, '-0.694')] [2022-07-10 21:25:36,947][26022] Updated weights on worker 0-0, policy_version 897764 (0.00087) [2022-07-10 21:25:38,892][26022] Updated weights on worker 0-0, policy_version 897774 (0.00091) [2022-07-10 21:25:40,912][26022] Updated weights on worker 0-0, policy_version 897784 (0.00091) [2022-07-10 21:25:41,311][25689] Fps is (10 sec: 5408.6, 60 sec: 5559.1, 300 sec: 5545.8). Total num frames: 919332864. Throughput: 0: 4975.7. Samples: 919329834. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:25:41,312][25689] Avg episode reward: [(0, '-0.433')] [2022-07-10 21:25:42,514][26022] Updated weights on worker 0-0, policy_version 897794 (0.00085) [2022-07-10 21:25:44,528][26022] Updated weights on worker 0-0, policy_version 897804 (0.00086) [2022-07-10 21:25:46,123][26022] Updated weights on worker 0-0, policy_version 897814 (0.00090) [2022-07-10 21:25:46,320][25689] Fps is (10 sec: 5612.9, 60 sec: 5509.1, 300 sec: 5546.8). Total num frames: 919361536. Throughput: 0: 5815.5. Samples: 919363310. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:25:46,321][25689] Avg episode reward: [(0, '-0.350')] [2022-07-10 21:25:48,119][26022] Updated weights on worker 0-0, policy_version 897824 (0.00090) [2022-07-10 21:25:49,915][26022] Updated weights on worker 0-0, policy_version 897834 (0.00084) [2022-07-10 21:25:51,358][25689] Fps is (10 sec: 5605.6, 60 sec: 5549.7, 300 sec: 5542.8). Total num frames: 919389184. Throughput: 0: 5821.6. Samples: 919396750. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:25:51,359][25689] Avg episode reward: [(0, '-0.133')] [2022-07-10 21:25:51,765][26022] Updated weights on worker 0-0, policy_version 897844 (0.00082) [2022-07-10 21:25:53,552][26022] Updated weights on worker 0-0, policy_version 897854 (0.00089) [2022-07-10 21:25:55,431][26022] Updated weights on worker 0-0, policy_version 897864 (0.00088) [2022-07-10 21:25:56,378][25689] Fps is (10 sec: 5497.8, 60 sec: 5515.4, 300 sec: 5543.2). Total num frames: 919416832. Throughput: 0: 5002.9. Samples: 919413678. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:25:56,379][25689] Avg episode reward: [(0, '-0.461')] [2022-07-10 21:25:57,099][26022] Updated weights on worker 0-0, policy_version 897874 (0.00082) [2022-07-10 21:25:59,105][26022] Updated weights on worker 0-0, policy_version 897884 (0.00099) [2022-07-10 21:26:00,892][26022] Updated weights on worker 0-0, policy_version 897894 (0.00088) [2022-07-10 21:26:01,415][25689] Fps is (10 sec: 5600.0, 60 sec: 5530.8, 300 sec: 5553.2). Total num frames: 919445504. Throughput: 0: 5844.4. Samples: 919447332. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:26:01,416][25689] Avg episode reward: [(0, '-0.213')] [2022-07-10 21:26:03,092][26022] Updated weights on worker 0-0, policy_version 897904 (0.00087) [2022-07-10 21:26:05,044][26022] Updated weights on worker 0-0, policy_version 897914 (0.00095) [2022-07-10 21:26:06,435][25689] Fps is (10 sec: 5396.4, 60 sec: 5537.7, 300 sec: 5548.1). Total num frames: 919471104. Throughput: 0: 5723.8. Samples: 919478444. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:26:06,436][25689] Avg episode reward: [(0, '-0.003')] [2022-07-10 21:26:06,882][26022] Updated weights on worker 0-0, policy_version 897924 (0.00092) [2022-07-10 21:26:08,597][26022] Updated weights on worker 0-0, policy_version 897934 (0.00088) [2022-07-10 21:26:10,503][26022] Updated weights on worker 0-0, policy_version 897944 (0.00085) [2022-07-10 21:26:11,503][25689] Fps is (10 sec: 5379.6, 60 sec: 5557.3, 300 sec: 5547.3). Total num frames: 919499776. Throughput: 0: 4891.1. Samples: 919495284. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:26:11,504][25689] Avg episode reward: [(0, '-0.102')] [2022-07-10 21:26:12,294][26022] Updated weights on worker 0-0, policy_version 897954 (0.00083) [2022-07-10 21:26:14,269][26022] Updated weights on worker 0-0, policy_version 897964 (0.00087) [2022-07-10 21:26:15,914][26022] Updated weights on worker 0-0, policy_version 897974 (0.00091) [2022-07-10 21:26:16,545][25689] Fps is (10 sec: 5571.1, 60 sec: 5542.7, 300 sec: 5543.4). Total num frames: 919527424. Throughput: 0: 5709.1. Samples: 919528810. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:26:16,545][25689] Avg episode reward: [(0, '0.207')] [2022-07-10 21:26:17,952][26022] Updated weights on worker 0-0, policy_version 897984 (0.00086) [2022-07-10 21:26:19,562][26022] Updated weights on worker 0-0, policy_version 897994 (0.00094) [2022-07-10 21:26:21,451][26022] Updated weights on worker 0-0, policy_version 898004 (0.00089) [2022-07-10 21:26:21,549][25689] Fps is (10 sec: 5606.4, 60 sec: 5531.0, 300 sec: 5546.9). Total num frames: 919556096. Throughput: 0: 5732.8. Samples: 919562758. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:26:21,550][25689] Avg episode reward: [(0, '0.227')] [2022-07-10 21:26:23,381][26022] Updated weights on worker 0-0, policy_version 898014 (0.00085) [2022-07-10 21:26:25,110][26022] Updated weights on worker 0-0, policy_version 898024 (0.00088) [2022-07-10 21:26:26,574][25689] Fps is (10 sec: 5615.8, 60 sec: 5532.0, 300 sec: 5548.2). Total num frames: 919583744. Throughput: 0: 5024.5. Samples: 919579628. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:26:26,574][25689] Avg episode reward: [(0, '0.279')] [2022-07-10 21:26:26,839][26022] Updated weights on worker 0-0, policy_version 898034 (0.00089) [2022-07-10 21:26:28,808][26022] Updated weights on worker 0-0, policy_version 898044 (0.00083) [2022-07-10 21:26:30,807][26022] Updated weights on worker 0-0, policy_version 898054 (0.00402) [2022-07-10 21:26:31,678][25689] Fps is (10 sec: 5560.2, 60 sec: 5526.7, 300 sec: 5543.2). Total num frames: 919612416. Throughput: 0: 5831.1. Samples: 919612926. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:26:31,679][25689] Avg episode reward: [(0, '0.411')] [2022-07-10 21:26:32,461][26022] Updated weights on worker 0-0, policy_version 898064 (0.00091) [2022-07-10 21:26:34,324][26022] Updated weights on worker 0-0, policy_version 898074 (0.01363) [2022-07-10 21:26:36,177][26022] Updated weights on worker 0-0, policy_version 898084 (0.00086) [2022-07-10 21:26:36,681][25689] Fps is (10 sec: 5572.0, 60 sec: 5544.3, 300 sec: 5550.1). Total num frames: 919640064. Throughput: 0: 5826.4. Samples: 919646132. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:26:36,682][25689] Avg episode reward: [(0, '-0.414')] [2022-07-10 21:26:38,153][26022] Updated weights on worker 0-0, policy_version 898094 (0.00087) [2022-07-10 21:26:39,943][26022] Updated weights on worker 0-0, policy_version 898104 (0.00090) [2022-07-10 21:26:41,678][26022] Updated weights on worker 0-0, policy_version 898114 (0.00086) [2022-07-10 21:26:41,695][25689] Fps is (10 sec: 5622.7, 60 sec: 5562.3, 300 sec: 5550.2). Total num frames: 919668736. Throughput: 0: 4976.1. Samples: 919663004. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:26:41,695][25689] Avg episode reward: [(0, '-0.646')] [2022-07-10 21:26:43,600][26022] Updated weights on worker 0-0, policy_version 898124 (0.00093) [2022-07-10 21:26:45,333][26022] Updated weights on worker 0-0, policy_version 898134 (0.00084) [2022-07-10 21:26:46,711][25689] Fps is (10 sec: 5513.2, 60 sec: 5527.8, 300 sec: 5545.5). Total num frames: 919695360. Throughput: 0: 5814.2. Samples: 919696710. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:26:46,712][25689] Avg episode reward: [(0, '-0.541')] [2022-07-10 21:26:47,238][26022] Updated weights on worker 0-0, policy_version 898144 (0.00092) [2022-07-10 21:26:48,995][26022] Updated weights on worker 0-0, policy_version 898154 (0.00083) [2022-07-10 21:26:50,873][26022] Updated weights on worker 0-0, policy_version 898164 (0.00093) [2022-07-10 21:26:51,748][25689] Fps is (10 sec: 5500.1, 60 sec: 5544.7, 300 sec: 5548.3). Total num frames: 919724032. Throughput: 0: 5851.3. Samples: 919730362. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:26:51,749][25689] Avg episode reward: [(0, '-1.353')] [2022-07-10 21:26:52,700][26022] Updated weights on worker 0-0, policy_version 898174 (0.00091) [2022-07-10 21:26:54,624][26022] Updated weights on worker 0-0, policy_version 898184 (0.00081) [2022-07-10 21:26:56,327][26022] Updated weights on worker 0-0, policy_version 898194 (0.00088) [2022-07-10 21:26:56,754][25689] Fps is (10 sec: 5709.7, 60 sec: 5563.0, 300 sec: 5546.0). Total num frames: 919752704. Throughput: 0: 5041.2. Samples: 919747324. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:26:56,755][25689] Avg episode reward: [(0, '-0.979')] [2022-07-10 21:26:58,318][26022] Updated weights on worker 0-0, policy_version 898204 (0.00085) [2022-07-10 21:26:59,899][26022] Updated weights on worker 0-0, policy_version 898214 (0.00067) [2022-07-10 21:27:01,761][25689] Fps is (10 sec: 5522.5, 60 sec: 5531.8, 300 sec: 5553.7). Total num frames: 919779328. Throughput: 0: 5889.1. Samples: 919781178. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:27:01,762][25689] Avg episode reward: [(0, '-0.494')] [2022-07-10 21:27:02,191][26022] Updated weights on worker 0-0, policy_version 898224 (0.00087) [2022-07-10 21:27:04,098][26022] Updated weights on worker 0-0, policy_version 898234 (0.00085) [2022-07-10 21:27:05,927][26022] Updated weights on worker 0-0, policy_version 898244 (0.00094) [2022-07-10 21:27:06,768][25689] Fps is (10 sec: 5317.7, 60 sec: 5550.1, 300 sec: 5548.6). Total num frames: 919805952. Throughput: 0: 5752.6. Samples: 919812088. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:27:06,768][25689] Avg episode reward: [(0, '-0.277')] [2022-07-10 21:27:06,911][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:27:06,924][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000898249_919806976.pth [2022-07-10 21:27:06,924][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000896295_917806080.pth [2022-07-10 21:27:07,872][26022] Updated weights on worker 0-0, policy_version 898254 (0.00052) [2022-07-10 21:27:09,788][26022] Updated weights on worker 0-0, policy_version 898264 (0.00086) [2022-07-10 21:27:11,359][26022] Updated weights on worker 0-0, policy_version 898274 (0.00093) [2022-07-10 21:27:11,831][25689] Fps is (10 sec: 5491.3, 60 sec: 5550.5, 300 sec: 5551.2). Total num frames: 919834624. Throughput: 0: 4892.6. Samples: 919828618. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:27:11,831][25689] Avg episode reward: [(0, '-0.088')] [2022-07-10 21:27:13,513][26022] Updated weights on worker 0-0, policy_version 898284 (0.00083) [2022-07-10 21:27:14,937][26022] Updated weights on worker 0-0, policy_version 898294 (0.00085) [2022-07-10 21:27:16,896][25689] Fps is (10 sec: 5560.5, 60 sec: 5548.3, 300 sec: 5550.8). Total num frames: 919862272. Throughput: 0: 5709.7. Samples: 919862328. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:27:16,897][25689] Avg episode reward: [(0, '-0.363')] [2022-07-10 21:27:16,991][26022] Updated weights on worker 0-0, policy_version 898304 (0.00087) [2022-07-10 21:27:18,592][26022] Updated weights on worker 0-0, policy_version 898314 (0.00088) [2022-07-10 21:27:20,575][26022] Updated weights on worker 0-0, policy_version 898324 (0.00092) [2022-07-10 21:27:21,926][25689] Fps is (10 sec: 5680.6, 60 sec: 5562.9, 300 sec: 5554.1). Total num frames: 919891968. Throughput: 0: 5698.0. Samples: 919896076. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:27:21,927][25689] Avg episode reward: [(0, '0.757')] [2022-07-10 21:27:22,302][26022] Updated weights on worker 0-0, policy_version 898334 (0.00088) [2022-07-10 21:27:24,149][26022] Updated weights on worker 0-0, policy_version 898344 (0.00088) [2022-07-10 21:27:25,907][26022] Updated weights on worker 0-0, policy_version 898354 (0.00087) [2022-07-10 21:27:26,963][25689] Fps is (10 sec: 5493.0, 60 sec: 5527.9, 300 sec: 5544.6). Total num frames: 919917568. Throughput: 0: 4997.7. Samples: 919913018. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:27:26,963][25689] Avg episode reward: [(0, '0.758')] [2022-07-10 21:27:28,084][26022] Updated weights on worker 0-0, policy_version 898364 (0.00097) [2022-07-10 21:27:29,809][26022] Updated weights on worker 0-0, policy_version 898374 (0.00081) [2022-07-10 21:27:31,622][26022] Updated weights on worker 0-0, policy_version 898384 (0.00093) [2022-07-10 21:27:32,097][25689] Fps is (10 sec: 5436.3, 60 sec: 5542.1, 300 sec: 5549.2). Total num frames: 919947264. Throughput: 0: 5787.1. Samples: 919945902. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:27:32,098][25689] Avg episode reward: [(0, '-0.785')] [2022-07-10 21:27:33,685][26022] Updated weights on worker 0-0, policy_version 898394 (0.00054) [2022-07-10 21:27:35,320][26022] Updated weights on worker 0-0, policy_version 898404 (0.00084) [2022-07-10 21:27:37,189][25689] Fps is (10 sec: 5507.5, 60 sec: 5517.1, 300 sec: 5541.1). Total num frames: 919973888. Throughput: 0: 5753.1. Samples: 919979074. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:27:37,189][25689] Avg episode reward: [(0, '-0.753')] [2022-07-10 21:27:37,345][26022] Updated weights on worker 0-0, policy_version 898414 (0.00079) [2022-07-10 21:27:39,008][26022] Updated weights on worker 0-0, policy_version 898424 (0.00089) [2022-07-10 21:27:40,925][26022] Updated weights on worker 0-0, policy_version 898434 (0.00086) [2022-07-10 21:27:42,243][25689] Fps is (10 sec: 5450.3, 60 sec: 5513.4, 300 sec: 5543.9). Total num frames: 920002560. Throughput: 0: 5720.0. Samples: 920012290. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:27:42,243][25689] Avg episode reward: [(0, '-1.520')] [2022-07-10 21:27:42,831][26022] Updated weights on worker 0-0, policy_version 898444 (0.00087) [2022-07-10 21:27:44,491][26022] Updated weights on worker 0-0, policy_version 898454 (0.00084) [2022-07-10 21:27:46,455][26022] Updated weights on worker 0-0, policy_version 898464 (0.00093) [2022-07-10 21:27:47,278][25689] Fps is (10 sec: 5683.7, 60 sec: 5545.5, 300 sec: 5544.3). Total num frames: 920031232. Throughput: 0: 5716.3. Samples: 920029146. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:27:47,278][25689] Avg episode reward: [(0, '-1.415')] [2022-07-10 21:27:48,265][26022] Updated weights on worker 0-0, policy_version 898474 (0.00082) [2022-07-10 21:27:50,195][26022] Updated weights on worker 0-0, policy_version 898484 (0.00093) [2022-07-10 21:27:51,955][26022] Updated weights on worker 0-0, policy_version 898494 (0.00091) [2022-07-10 21:27:52,379][25689] Fps is (10 sec: 5556.5, 60 sec: 5522.8, 300 sec: 5547.1). Total num frames: 920058880. Throughput: 0: 5755.3. Samples: 920062628. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:27:52,379][25689] Avg episode reward: [(0, '-1.512')] [2022-07-10 21:27:53,879][26022] Updated weights on worker 0-0, policy_version 898504 (0.00085) [2022-07-10 21:27:55,498][26022] Updated weights on worker 0-0, policy_version 898514 (0.00760) [2022-07-10 21:27:57,440][25689] Fps is (10 sec: 5542.2, 60 sec: 5517.7, 300 sec: 5543.5). Total num frames: 920087552. Throughput: 0: 5796.6. Samples: 920096462. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:27:57,443][25689] Avg episode reward: [(0, '0.118')] [2022-07-10 21:27:57,517][26022] Updated weights on worker 0-0, policy_version 898524 (0.00092) [2022-07-10 21:27:59,236][26022] Updated weights on worker 0-0, policy_version 898534 (0.00083) [2022-07-10 21:28:01,130][26022] Updated weights on worker 0-0, policy_version 898544 (0.00098) [2022-07-10 21:28:02,475][25689] Fps is (10 sec: 5578.5, 60 sec: 5532.1, 300 sec: 5546.3). Total num frames: 920115200. Throughput: 0: 4996.8. Samples: 920113384. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:28:02,475][25689] Avg episode reward: [(0, '0.350')] [2022-07-10 21:28:03,264][26022] Updated weights on worker 0-0, policy_version 898554 (0.00088) [2022-07-10 21:28:04,973][26022] Updated weights on worker 0-0, policy_version 898564 (0.00076) [2022-07-10 21:28:06,829][26022] Updated weights on worker 0-0, policy_version 898574 (0.00096) [2022-07-10 21:28:07,478][25689] Fps is (10 sec: 5508.8, 60 sec: 5549.3, 300 sec: 5551.3). Total num frames: 920142848. Throughput: 0: 5748.3. Samples: 920145260. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:28:07,478][25689] Avg episode reward: [(0, '1.463')] [2022-07-10 21:28:08,684][26022] Updated weights on worker 0-0, policy_version 898584 (0.00091) [2022-07-10 21:28:10,633][26022] Updated weights on worker 0-0, policy_version 898594 (0.00095) [2022-07-10 21:28:12,559][25689] Fps is (10 sec: 5483.4, 60 sec: 5530.8, 300 sec: 5539.7). Total num frames: 920170496. Throughput: 0: 5752.5. Samples: 920178714. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:28:12,560][25689] Avg episode reward: [(0, '0.642')] [2022-07-10 21:28:12,567][26022] Updated weights on worker 0-0, policy_version 898604 (0.00094) [2022-07-10 21:28:14,201][26022] Updated weights on worker 0-0, policy_version 898614 (0.00079) [2022-07-10 21:28:15,933][26022] Updated weights on worker 0-0, policy_version 898624 (0.00088) [2022-07-10 21:28:17,607][25689] Fps is (10 sec: 5458.9, 60 sec: 5532.3, 300 sec: 5539.8). Total num frames: 920198144. Throughput: 0: 4910.8. Samples: 920195498. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:28:17,608][25689] Avg episode reward: [(0, '0.754')] [2022-07-10 21:28:18,045][26022] Updated weights on worker 0-0, policy_version 898634 (0.00089) [2022-07-10 21:28:19,522][26022] Updated weights on worker 0-0, policy_version 898644 (0.00093) [2022-07-10 21:28:21,651][26022] Updated weights on worker 0-0, policy_version 898654 (0.00085) [2022-07-10 21:28:22,610][25689] Fps is (10 sec: 5603.3, 60 sec: 5517.8, 300 sec: 5541.1). Total num frames: 920226816. Throughput: 0: 5742.2. Samples: 920229006. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:28:22,612][25689] Avg episode reward: [(0, '0.740')] [2022-07-10 21:28:23,295][26022] Updated weights on worker 0-0, policy_version 898664 (0.00090) [2022-07-10 21:28:25,159][26022] Updated weights on worker 0-0, policy_version 898674 (0.00078) [2022-07-10 21:28:27,284][26022] Updated weights on worker 0-0, policy_version 898684 (0.00090) [2022-07-10 21:28:27,620][25689] Fps is (10 sec: 5624.6, 60 sec: 5554.1, 300 sec: 5540.7). Total num frames: 920254464. Throughput: 0: 5816.7. Samples: 920262424. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:28:27,622][25689] Avg episode reward: [(0, '0.423')] [2022-07-10 21:28:29,015][26022] Updated weights on worker 0-0, policy_version 898694 (0.00096) [2022-07-10 21:28:30,767][26022] Updated weights on worker 0-0, policy_version 898704 (0.00254) [2022-07-10 21:28:32,642][26022] Updated weights on worker 0-0, policy_version 898714 (0.00096) [2022-07-10 21:28:32,694][25689] Fps is (10 sec: 5585.1, 60 sec: 5542.7, 300 sec: 5546.4). Total num frames: 920283136. Throughput: 0: 4988.4. Samples: 920279156. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:28:32,702][25689] Avg episode reward: [(0, '-0.602')] [2022-07-10 21:28:34,422][26022] Updated weights on worker 0-0, policy_version 898724 (0.00094) [2022-07-10 21:28:36,443][26022] Updated weights on worker 0-0, policy_version 898734 (0.00089) [2022-07-10 21:28:37,717][25689] Fps is (10 sec: 5578.3, 60 sec: 5566.0, 300 sec: 5543.4). Total num frames: 920310784. Throughput: 0: 5814.6. Samples: 920312426. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:28:37,717][25689] Avg episode reward: [(0, '0.075')] [2022-07-10 21:28:38,291][26022] Updated weights on worker 0-0, policy_version 898744 (0.00095) [2022-07-10 21:28:39,947][26022] Updated weights on worker 0-0, policy_version 898754 (0.00088) [2022-07-10 21:28:41,870][26022] Updated weights on worker 0-0, policy_version 898764 (0.00100) [2022-07-10 21:28:42,737][25689] Fps is (10 sec: 5506.0, 60 sec: 5552.1, 300 sec: 5539.7). Total num frames: 920338432. Throughput: 0: 5815.3. Samples: 920346050. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:28:42,739][25689] Avg episode reward: [(0, '-0.692')] [2022-07-10 21:28:43,732][26022] Updated weights on worker 0-0, policy_version 898774 (0.00091) [2022-07-10 21:28:45,415][26022] Updated weights on worker 0-0, policy_version 898784 (0.00080) [2022-07-10 21:28:47,347][26022] Updated weights on worker 0-0, policy_version 898794 (0.00089) [2022-07-10 21:28:47,778][25689] Fps is (10 sec: 5597.7, 60 sec: 5551.6, 300 sec: 5543.9). Total num frames: 920367104. Throughput: 0: 4994.1. Samples: 920363092. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:28:47,779][25689] Avg episode reward: [(0, '-0.643')] [2022-07-10 21:28:48,891][26022] Updated weights on worker 0-0, policy_version 898804 (0.00097) [2022-07-10 21:28:51,090][26022] Updated weights on worker 0-0, policy_version 898814 (0.00092) [2022-07-10 21:28:52,803][26022] Updated weights on worker 0-0, policy_version 898824 (0.00086) [2022-07-10 21:28:52,859][25689] Fps is (10 sec: 5665.6, 60 sec: 5570.4, 300 sec: 5543.2). Total num frames: 920395776. Throughput: 0: 5817.7. Samples: 920396466. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:28:52,859][25689] Avg episode reward: [(0, '-0.742')] [2022-07-10 21:28:54,591][26022] Updated weights on worker 0-0, policy_version 898834 (0.00089) [2022-07-10 21:28:56,683][26022] Updated weights on worker 0-0, policy_version 898844 (0.00083) [2022-07-10 21:28:57,863][25689] Fps is (10 sec: 5686.1, 60 sec: 5575.7, 300 sec: 5547.8). Total num frames: 920424448. Throughput: 0: 5839.4. Samples: 920430068. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:28:57,863][25689] Avg episode reward: [(0, '-0.596')] [2022-07-10 21:28:58,144][26022] Updated weights on worker 0-0, policy_version 898854 (0.00096) [2022-07-10 21:29:00,292][26022] Updated weights on worker 0-0, policy_version 898864 (0.00051) [2022-07-10 21:29:01,950][26022] Updated weights on worker 0-0, policy_version 898874 (0.00094) [2022-07-10 21:29:02,882][25689] Fps is (10 sec: 5414.6, 60 sec: 5543.2, 300 sec: 5541.5). Total num frames: 920450048. Throughput: 0: 5005.9. Samples: 920446892. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:29:02,882][25689] Avg episode reward: [(0, '0.212')] [2022-07-10 21:29:04,235][26022] Updated weights on worker 0-0, policy_version 898884 (0.00093) [2022-07-10 21:29:06,040][26022] Updated weights on worker 0-0, policy_version 898894 (0.00096) [2022-07-10 21:29:06,943][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:29:06,954][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000898899_920472576.pth [2022-07-10 21:29:06,954][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000896946_918472704.pth [2022-07-10 21:29:07,791][26022] Updated weights on worker 0-0, policy_version 898904 (0.00098) [2022-07-10 21:29:07,900][25689] Fps is (10 sec: 5305.1, 60 sec: 5541.8, 300 sec: 5546.5). Total num frames: 920477696. Throughput: 0: 5722.7. Samples: 920478244. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:29:07,903][25689] Avg episode reward: [(0, '-0.065')] [2022-07-10 21:29:09,778][26022] Updated weights on worker 0-0, policy_version 898914 (0.00093) [2022-07-10 21:29:11,722][26022] Updated weights on worker 0-0, policy_version 898924 (0.00090) [2022-07-10 21:29:12,943][25689] Fps is (10 sec: 5496.3, 60 sec: 5545.4, 300 sec: 5539.7). Total num frames: 920505344. Throughput: 0: 5726.3. Samples: 920511472. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 21:29:12,943][25689] Avg episode reward: [(0, '0.545')] [2022-07-10 21:29:13,355][26022] Updated weights on worker 0-0, policy_version 898934 (0.00088) [2022-07-10 21:29:15,459][26022] Updated weights on worker 0-0, policy_version 898944 (0.00087) [2022-07-10 21:29:17,085][26022] Updated weights on worker 0-0, policy_version 898954 (0.00086) [2022-07-10 21:29:18,002][25689] Fps is (10 sec: 5473.6, 60 sec: 5544.3, 300 sec: 5535.4). Total num frames: 920532992. Throughput: 0: 4866.6. Samples: 920528078. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:29:18,003][25689] Avg episode reward: [(0, '-0.165')] [2022-07-10 21:29:18,934][26022] Updated weights on worker 0-0, policy_version 898964 (0.00092) [2022-07-10 21:29:20,556][26022] Updated weights on worker 0-0, policy_version 898974 (0.00086) [2022-07-10 21:29:22,741][26022] Updated weights on worker 0-0, policy_version 898984 (0.00064) [2022-07-10 21:29:23,004][25689] Fps is (10 sec: 5496.1, 60 sec: 5527.5, 300 sec: 5539.2). Total num frames: 920560640. Throughput: 0: 5703.1. Samples: 920561648. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:29:23,004][25689] Avg episode reward: [(0, '-1.695')] [2022-07-10 21:29:24,506][26022] Updated weights on worker 0-0, policy_version 898994 (0.00095) [2022-07-10 21:29:26,365][26022] Updated weights on worker 0-0, policy_version 899004 (0.00080) [2022-07-10 21:29:28,019][25689] Fps is (10 sec: 5622.5, 60 sec: 5544.0, 300 sec: 5537.6). Total num frames: 920589312. Throughput: 0: 5830.5. Samples: 920595548. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:29:28,019][25689] Avg episode reward: [(0, '-1.391')] [2022-07-10 21:29:28,135][26022] Updated weights on worker 0-0, policy_version 899014 (0.00087) [2022-07-10 21:29:30,105][26022] Updated weights on worker 0-0, policy_version 899024 (0.00085) [2022-07-10 21:29:31,655][26022] Updated weights on worker 0-0, policy_version 899034 (0.00088) [2022-07-10 21:29:33,065][25689] Fps is (10 sec: 5699.4, 60 sec: 5546.5, 300 sec: 5540.4). Total num frames: 920617984. Throughput: 0: 5009.3. Samples: 920612270. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:29:33,067][25689] Avg episode reward: [(0, '-1.103')] [2022-07-10 21:29:33,564][26022] Updated weights on worker 0-0, policy_version 899044 (0.00090) [2022-07-10 21:29:35,385][26022] Updated weights on worker 0-0, policy_version 899054 (0.00074) [2022-07-10 21:29:37,242][26022] Updated weights on worker 0-0, policy_version 899064 (0.00088) [2022-07-10 21:29:38,110][25689] Fps is (10 sec: 5581.5, 60 sec: 5544.5, 300 sec: 5546.9). Total num frames: 920645632. Throughput: 0: 5888.2. Samples: 920646474. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:29:38,111][25689] Avg episode reward: [(0, '-1.922')] [2022-07-10 21:29:38,997][26022] Updated weights on worker 0-0, policy_version 899074 (0.00087) [2022-07-10 21:29:41,124][26022] Updated weights on worker 0-0, policy_version 899084 (0.00088) [2022-07-10 21:29:42,539][26022] Updated weights on worker 0-0, policy_version 899094 (0.00087) [2022-07-10 21:29:43,124][25689] Fps is (10 sec: 5598.7, 60 sec: 5562.0, 300 sec: 5536.6). Total num frames: 920674304. Throughput: 0: 5870.2. Samples: 920679762. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:29:43,126][25689] Avg episode reward: [(0, '-1.764')] [2022-07-10 21:29:44,779][26022] Updated weights on worker 0-0, policy_version 899104 (0.00088) [2022-07-10 21:29:46,084][26022] Updated weights on worker 0-0, policy_version 899114 (0.00089) [2022-07-10 21:29:48,132][25689] Fps is (10 sec: 5517.2, 60 sec: 5531.1, 300 sec: 5542.0). Total num frames: 920700928. Throughput: 0: 5020.9. Samples: 920696534. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:29:48,134][25689] Avg episode reward: [(0, '-1.711')] [2022-07-10 21:29:48,353][26022] Updated weights on worker 0-0, policy_version 899124 (0.00094) [2022-07-10 21:29:50,095][26022] Updated weights on worker 0-0, policy_version 899134 (0.00086) [2022-07-10 21:29:51,940][26022] Updated weights on worker 0-0, policy_version 899144 (0.00093) [2022-07-10 21:29:53,244][25689] Fps is (10 sec: 5464.4, 60 sec: 5528.3, 300 sec: 5536.8). Total num frames: 920729600. Throughput: 0: 5823.9. Samples: 920729790. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:29:53,244][25689] Avg episode reward: [(0, '-0.162')] [2022-07-10 21:29:53,762][26022] Updated weights on worker 0-0, policy_version 899154 (0.00084) [2022-07-10 21:29:55,636][26022] Updated weights on worker 0-0, policy_version 899164 (0.00088) [2022-07-10 21:29:57,458][26022] Updated weights on worker 0-0, policy_version 899174 (0.00092) [2022-07-10 21:29:58,263][25689] Fps is (10 sec: 5660.4, 60 sec: 5526.9, 300 sec: 5540.3). Total num frames: 920758272. Throughput: 0: 5789.1. Samples: 920763144. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:29:58,263][25689] Avg episode reward: [(0, '-0.501')] [2022-07-10 21:29:59,367][26022] Updated weights on worker 0-0, policy_version 899184 (0.00083) [2022-07-10 21:30:01,039][26022] Updated weights on worker 0-0, policy_version 899194 (0.00094) [2022-07-10 21:30:03,296][25689] Fps is (10 sec: 5297.1, 60 sec: 5508.7, 300 sec: 5538.0). Total num frames: 920782848. Throughput: 0: 4961.0. Samples: 920779834. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:30:03,297][25689] Avg episode reward: [(0, '0.251')] [2022-07-10 21:30:03,525][26022] Updated weights on worker 0-0, policy_version 899204 (0.00091) [2022-07-10 21:30:05,019][26022] Updated weights on worker 0-0, policy_version 899214 (0.00501) [2022-07-10 21:30:07,131][26022] Updated weights on worker 0-0, policy_version 899224 (0.00085) [2022-07-10 21:30:08,300][25689] Fps is (10 sec: 5305.1, 60 sec: 5526.9, 300 sec: 5543.1). Total num frames: 920811520. Throughput: 0: 5669.1. Samples: 920810868. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:30:08,301][25689] Avg episode reward: [(0, '-0.886')] [2022-07-10 21:30:08,956][26022] Updated weights on worker 0-0, policy_version 899234 (0.00105) [2022-07-10 21:30:10,714][26022] Updated weights on worker 0-0, policy_version 899244 (0.00095) [2022-07-10 21:30:12,647][26022] Updated weights on worker 0-0, policy_version 899254 (0.00088) [2022-07-10 21:30:13,396][25689] Fps is (10 sec: 5677.6, 60 sec: 5538.9, 300 sec: 5542.6). Total num frames: 920840192. Throughput: 0: 5664.1. Samples: 920843934. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:30:13,396][25689] Avg episode reward: [(0, '-0.684')] [2022-07-10 21:30:14,477][26022] Updated weights on worker 0-0, policy_version 899264 (0.00086) [2022-07-10 21:30:16,228][26022] Updated weights on worker 0-0, policy_version 899274 (0.00089) [2022-07-10 21:30:18,206][26022] Updated weights on worker 0-0, policy_version 899284 (0.00094) [2022-07-10 21:30:18,438][25689] Fps is (10 sec: 5454.6, 60 sec: 5523.6, 300 sec: 5532.6). Total num frames: 920866816. Throughput: 0: 4850.1. Samples: 920860994. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:30:18,438][25689] Avg episode reward: [(0, '0.045')] [2022-07-10 21:30:19,839][26022] Updated weights on worker 0-0, policy_version 899294 (0.00081) [2022-07-10 21:30:21,796][26022] Updated weights on worker 0-0, policy_version 899304 (0.00091) [2022-07-10 21:30:23,493][25689] Fps is (10 sec: 5577.8, 60 sec: 5552.6, 300 sec: 5539.2). Total num frames: 920896512. Throughput: 0: 5677.8. Samples: 920894508. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:30:23,494][25689] Avg episode reward: [(0, '-0.004')] [2022-07-10 21:30:23,587][26022] Updated weights on worker 0-0, policy_version 899314 (0.00087) [2022-07-10 21:30:25,553][26022] Updated weights on worker 0-0, policy_version 899324 (0.00082) [2022-07-10 21:30:27,371][26022] Updated weights on worker 0-0, policy_version 899334 (0.00088) [2022-07-10 21:30:28,505][25689] Fps is (10 sec: 5695.8, 60 sec: 5535.9, 300 sec: 5536.4). Total num frames: 920924160. Throughput: 0: 5806.0. Samples: 920928180. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:30:28,506][25689] Avg episode reward: [(0, '0.041')] [2022-07-10 21:30:29,154][26022] Updated weights on worker 0-0, policy_version 899344 (0.00081) [2022-07-10 21:30:30,980][26022] Updated weights on worker 0-0, policy_version 899354 (0.00083) [2022-07-10 21:30:32,914][26022] Updated weights on worker 0-0, policy_version 899364 (0.00085) [2022-07-10 21:30:33,612][25689] Fps is (10 sec: 5464.4, 60 sec: 5513.4, 300 sec: 5538.0). Total num frames: 920951808. Throughput: 0: 5826.1. Samples: 920961716. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:30:33,613][25689] Avg episode reward: [(0, '0.263')] [2022-07-10 21:30:34,648][26022] Updated weights on worker 0-0, policy_version 899374 (0.00091) [2022-07-10 21:30:36,466][26022] Updated weights on worker 0-0, policy_version 899384 (0.00092) [2022-07-10 21:30:38,415][26022] Updated weights on worker 0-0, policy_version 899394 (0.00086) [2022-07-10 21:30:38,646][25689] Fps is (10 sec: 5553.7, 60 sec: 5531.3, 300 sec: 5541.3). Total num frames: 920980480. Throughput: 0: 5824.7. Samples: 920978704. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:30:38,647][25689] Avg episode reward: [(0, '0.347')] [2022-07-10 21:30:40,222][26022] Updated weights on worker 0-0, policy_version 899404 (0.00086) [2022-07-10 21:30:42,156][26022] Updated weights on worker 0-0, policy_version 899414 (0.00095) [2022-07-10 21:30:43,668][25689] Fps is (10 sec: 5702.4, 60 sec: 5530.7, 300 sec: 5541.0). Total num frames: 921009152. Throughput: 0: 5817.0. Samples: 921011868. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:30:43,669][25689] Avg episode reward: [(0, '-0.155')] [2022-07-10 21:30:43,957][26022] Updated weights on worker 0-0, policy_version 899424 (0.00093) [2022-07-10 21:30:45,670][26022] Updated weights on worker 0-0, policy_version 899434 (0.00093) [2022-07-10 21:30:47,755][26022] Updated weights on worker 0-0, policy_version 899444 (0.00370) [2022-07-10 21:30:48,685][25689] Fps is (10 sec: 5610.5, 60 sec: 5546.8, 300 sec: 5541.4). Total num frames: 921036800. Throughput: 0: 5799.3. Samples: 921045206. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:30:48,685][25689] Avg episode reward: [(0, '0.178')] [2022-07-10 21:30:49,469][26022] Updated weights on worker 0-0, policy_version 899454 (0.00100) [2022-07-10 21:30:51,299][26022] Updated weights on worker 0-0, policy_version 899464 (0.00088) [2022-07-10 21:30:53,351][26022] Updated weights on worker 0-0, policy_version 899474 (0.00115) [2022-07-10 21:30:53,754][25689] Fps is (10 sec: 5381.0, 60 sec: 5516.8, 300 sec: 5537.1). Total num frames: 921063424. Throughput: 0: 4968.0. Samples: 921061782. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:30:53,755][25689] Avg episode reward: [(0, '0.502')] [2022-07-10 21:30:54,955][26022] Updated weights on worker 0-0, policy_version 899484 (0.00081) [2022-07-10 21:30:56,963][26022] Updated weights on worker 0-0, policy_version 899494 (0.00085) [2022-07-10 21:30:58,566][26022] Updated weights on worker 0-0, policy_version 899504 (0.00077) [2022-07-10 21:30:58,787][25689] Fps is (10 sec: 5473.7, 60 sec: 5515.6, 300 sec: 5537.1). Total num frames: 921092096. Throughput: 0: 5792.0. Samples: 921095358. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:30:58,787][25689] Avg episode reward: [(0, '-0.497')] [2022-07-10 21:31:00,680][26022] Updated weights on worker 0-0, policy_version 899514 (0.00084) [2022-07-10 21:31:02,734][26022] Updated weights on worker 0-0, policy_version 899524 (0.00086) [2022-07-10 21:31:03,794][25689] Fps is (10 sec: 5507.7, 60 sec: 5551.8, 300 sec: 5540.8). Total num frames: 921118720. Throughput: 0: 5712.9. Samples: 921126844. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:31:03,801][25689] Avg episode reward: [(0, '-0.202')] [2022-07-10 21:31:04,479][26022] Updated weights on worker 0-0, policy_version 899534 (0.00095) [2022-07-10 21:31:06,584][26022] Updated weights on worker 0-0, policy_version 899544 (0.00090) [2022-07-10 21:31:06,973][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:31:06,986][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000899548_921137152.pth [2022-07-10 21:31:06,987][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000897598_919140352.pth [2022-07-10 21:31:08,330][26022] Updated weights on worker 0-0, policy_version 899554 (0.00094) [2022-07-10 21:31:08,823][25689] Fps is (10 sec: 5305.7, 60 sec: 5515.7, 300 sec: 5534.7). Total num frames: 921145344. Throughput: 0: 4883.4. Samples: 921143548. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:31:08,824][25689] Avg episode reward: [(0, '0.051')] [2022-07-10 21:31:10,207][26022] Updated weights on worker 0-0, policy_version 899564 (0.00100) [2022-07-10 21:31:12,027][26022] Updated weights on worker 0-0, policy_version 899574 (0.00083) [2022-07-10 21:31:13,917][25689] Fps is (10 sec: 5462.6, 60 sec: 5515.9, 300 sec: 5537.1). Total num frames: 921174016. Throughput: 0: 5682.9. Samples: 921176362. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:31:13,917][25689] Avg episode reward: [(0, '-0.231')] [2022-07-10 21:31:13,935][26022] Updated weights on worker 0-0, policy_version 899584 (0.00090) [2022-07-10 21:31:15,599][26022] Updated weights on worker 0-0, policy_version 899594 (0.00089) [2022-07-10 21:31:17,660][26022] Updated weights on worker 0-0, policy_version 899604 (0.00088) [2022-07-10 21:31:18,943][25689] Fps is (10 sec: 5565.2, 60 sec: 5534.2, 300 sec: 5533.3). Total num frames: 921201664. Throughput: 0: 5680.1. Samples: 921209846. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:31:18,943][25689] Avg episode reward: [(0, '-0.432')] [2022-07-10 21:31:19,392][26022] Updated weights on worker 0-0, policy_version 899614 (0.00090) [2022-07-10 21:31:21,263][26022] Updated weights on worker 0-0, policy_version 899624 (0.00083) [2022-07-10 21:31:23,000][26022] Updated weights on worker 0-0, policy_version 899634 (0.00090) [2022-07-10 21:31:23,953][25689] Fps is (10 sec: 5509.7, 60 sec: 5504.5, 300 sec: 5533.6). Total num frames: 921229312. Throughput: 0: 4949.3. Samples: 921226616. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:31:23,953][25689] Avg episode reward: [(0, '-0.960')] [2022-07-10 21:31:24,710][26022] Updated weights on worker 0-0, policy_version 899644 (0.00098) [2022-07-10 21:31:26,750][26022] Updated weights on worker 0-0, policy_version 899654 (0.00097) [2022-07-10 21:31:28,480][26022] Updated weights on worker 0-0, policy_version 899664 (0.00092) [2022-07-10 21:31:28,975][25689] Fps is (10 sec: 5511.9, 60 sec: 5503.6, 300 sec: 5531.7). Total num frames: 921256960. Throughput: 0: 5777.1. Samples: 921259968. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:31:28,975][25689] Avg episode reward: [(0, '-0.568')] [2022-07-10 21:31:30,358][26022] Updated weights on worker 0-0, policy_version 899674 (0.00081) [2022-07-10 21:31:32,248][26022] Updated weights on worker 0-0, policy_version 899684 (0.00083) [2022-07-10 21:31:34,031][25689] Fps is (10 sec: 5689.8, 60 sec: 5542.1, 300 sec: 5537.6). Total num frames: 921286656. Throughput: 0: 5821.6. Samples: 921293460. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:31:34,032][25689] Avg episode reward: [(0, '0.018')] [2022-07-10 21:31:34,033][26022] Updated weights on worker 0-0, policy_version 899694 (0.00127) [2022-07-10 21:31:35,966][26022] Updated weights on worker 0-0, policy_version 899704 (0.00088) [2022-07-10 21:31:37,775][26022] Updated weights on worker 0-0, policy_version 899714 (0.00083) [2022-07-10 21:31:39,043][25689] Fps is (10 sec: 5695.6, 60 sec: 5527.2, 300 sec: 5534.1). Total num frames: 921314304. Throughput: 0: 4990.6. Samples: 921310158. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:31:39,043][25689] Avg episode reward: [(0, '0.380')] [2022-07-10 21:31:39,490][26022] Updated weights on worker 0-0, policy_version 899724 (0.00080) [2022-07-10 21:31:41,530][26022] Updated weights on worker 0-0, policy_version 899734 (0.00098) [2022-07-10 21:31:43,211][26022] Updated weights on worker 0-0, policy_version 899744 (0.00094) [2022-07-10 21:31:44,120][25689] Fps is (10 sec: 5379.3, 60 sec: 5488.3, 300 sec: 5533.0). Total num frames: 921340928. Throughput: 0: 5801.9. Samples: 921343626. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:31:44,121][25689] Avg episode reward: [(0, '0.598')] [2022-07-10 21:31:45,307][26022] Updated weights on worker 0-0, policy_version 899754 (0.00083) [2022-07-10 21:31:46,743][26022] Updated weights on worker 0-0, policy_version 899764 (0.00091) [2022-07-10 21:31:48,906][26022] Updated weights on worker 0-0, policy_version 899774 (0.00086) [2022-07-10 21:31:49,161][25689] Fps is (10 sec: 5464.8, 60 sec: 5502.9, 300 sec: 5532.9). Total num frames: 921369600. Throughput: 0: 5806.7. Samples: 921377186. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:31:49,162][25689] Avg episode reward: [(0, '0.915')] [2022-07-10 21:31:50,579][26022] Updated weights on worker 0-0, policy_version 899784 (0.00091) [2022-07-10 21:31:52,422][26022] Updated weights on worker 0-0, policy_version 899794 (0.00089) [2022-07-10 21:31:54,243][25689] Fps is (10 sec: 5664.9, 60 sec: 5535.7, 300 sec: 5531.5). Total num frames: 921398272. Throughput: 0: 4966.9. Samples: 921393850. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:31:54,243][25689] Avg episode reward: [(0, '0.999')] [2022-07-10 21:31:54,459][26022] Updated weights on worker 0-0, policy_version 899804 (0.00092) [2022-07-10 21:31:55,929][26022] Updated weights on worker 0-0, policy_version 899814 (0.00091) [2022-07-10 21:31:58,147][26022] Updated weights on worker 0-0, policy_version 899824 (0.00060) [2022-07-10 21:31:59,273][25689] Fps is (10 sec: 5671.0, 60 sec: 5535.9, 300 sec: 5538.0). Total num frames: 921426944. Throughput: 0: 5798.1. Samples: 921427456. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:31:59,274][25689] Avg episode reward: [(0, '0.908')] [2022-07-10 21:31:59,587][26022] Updated weights on worker 0-0, policy_version 899834 (0.00090) [2022-07-10 21:32:02,176][26022] Updated weights on worker 0-0, policy_version 899844 (0.00088) [2022-07-10 21:32:03,753][26022] Updated weights on worker 0-0, policy_version 899854 (0.00078) [2022-07-10 21:32:04,296][25689] Fps is (10 sec: 5398.5, 60 sec: 5517.6, 300 sec: 5534.2). Total num frames: 921452544. Throughput: 0: 5729.3. Samples: 921459220. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:32:04,299][25689] Avg episode reward: [(0, '1.236')] [2022-07-10 21:32:05,520][26022] Updated weights on worker 0-0, policy_version 899864 (0.00093) [2022-07-10 21:32:07,443][26022] Updated weights on worker 0-0, policy_version 899874 (0.00084) [2022-07-10 21:32:09,262][26022] Updated weights on worker 0-0, policy_version 899884 (0.00092) [2022-07-10 21:32:09,347][25689] Fps is (10 sec: 5387.4, 60 sec: 5549.4, 300 sec: 5534.5). Total num frames: 921481216. Throughput: 0: 4897.3. Samples: 921476040. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:32:09,347][25689] Avg episode reward: [(0, '1.362')] [2022-07-10 21:32:11,185][26022] Updated weights on worker 0-0, policy_version 899894 (0.00097) [2022-07-10 21:32:12,945][26022] Updated weights on worker 0-0, policy_version 899904 (0.00092) [2022-07-10 21:32:14,473][25689] Fps is (10 sec: 5533.9, 60 sec: 5529.5, 300 sec: 5533.3). Total num frames: 921508864. Throughput: 0: 5702.2. Samples: 921509208. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:32:14,474][25689] Avg episode reward: [(0, '1.366')] [2022-07-10 21:32:14,847][26022] Updated weights on worker 0-0, policy_version 899914 (0.00093) [2022-07-10 21:32:16,744][26022] Updated weights on worker 0-0, policy_version 899924 (0.00088) [2022-07-10 21:32:18,457][26022] Updated weights on worker 0-0, policy_version 899934 (0.00095) [2022-07-10 21:32:19,539][25689] Fps is (10 sec: 5525.6, 60 sec: 5542.7, 300 sec: 5529.2). Total num frames: 921537536. Throughput: 0: 5677.5. Samples: 921542518. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:32:19,540][25689] Avg episode reward: [(0, '-0.030')] [2022-07-10 21:32:20,379][26022] Updated weights on worker 0-0, policy_version 899944 (0.00095) [2022-07-10 21:32:22,235][26022] Updated weights on worker 0-0, policy_version 899954 (0.00086) [2022-07-10 21:32:24,220][26022] Updated weights on worker 0-0, policy_version 899964 (0.00083) [2022-07-10 21:32:24,559][25689] Fps is (10 sec: 5584.3, 60 sec: 5541.9, 300 sec: 5536.4). Total num frames: 921565184. Throughput: 0: 4927.9. Samples: 921559072. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:32:24,560][25689] Avg episode reward: [(0, '0.115')] [2022-07-10 21:32:25,771][26022] Updated weights on worker 0-0, policy_version 899974 (0.00082) [2022-07-10 21:32:27,827][26022] Updated weights on worker 0-0, policy_version 899984 (0.00094) [2022-07-10 21:32:29,476][26022] Updated weights on worker 0-0, policy_version 899994 (0.00088) [2022-07-10 21:32:29,560][25689] Fps is (10 sec: 5620.3, 60 sec: 5560.7, 300 sec: 5535.5). Total num frames: 921593856. Throughput: 0: 5756.1. Samples: 921592390. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:32:29,561][25689] Avg episode reward: [(0, '0.029')] [2022-07-10 21:32:31,520][26022] Updated weights on worker 0-0, policy_version 900004 (0.00084) [2022-07-10 21:32:33,419][26022] Updated weights on worker 0-0, policy_version 900014 (0.00087) [2022-07-10 21:32:34,664][25689] Fps is (10 sec: 5472.1, 60 sec: 5505.7, 300 sec: 5535.2). Total num frames: 921620480. Throughput: 0: 5762.9. Samples: 921625564. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:32:34,664][25689] Avg episode reward: [(0, '-0.317')] [2022-07-10 21:32:35,096][26022] Updated weights on worker 0-0, policy_version 900024 (0.00093) [2022-07-10 21:32:37,167][26022] Updated weights on worker 0-0, policy_version 900034 (0.00056) [2022-07-10 21:32:38,824][26022] Updated weights on worker 0-0, policy_version 900044 (0.00087) [2022-07-10 21:32:39,711][25689] Fps is (10 sec: 5346.7, 60 sec: 5502.5, 300 sec: 5531.9). Total num frames: 921648128. Throughput: 0: 4944.9. Samples: 921642264. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:32:39,711][25689] Avg episode reward: [(0, '-0.289')] [2022-07-10 21:32:40,853][26022] Updated weights on worker 0-0, policy_version 900054 (0.00084) [2022-07-10 21:32:42,706][26022] Updated weights on worker 0-0, policy_version 900064 (0.00085) [2022-07-10 21:32:44,444][26022] Updated weights on worker 0-0, policy_version 900074 (0.00085) [2022-07-10 21:32:44,744][25689] Fps is (10 sec: 5586.9, 60 sec: 5540.2, 300 sec: 5531.9). Total num frames: 921676800. Throughput: 0: 5755.5. Samples: 921675252. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:32:44,745][25689] Avg episode reward: [(0, '-0.131')] [2022-07-10 21:32:46,461][26022] Updated weights on worker 0-0, policy_version 900084 (0.00088) [2022-07-10 21:32:48,119][26022] Updated weights on worker 0-0, policy_version 900094 (0.00108) [2022-07-10 21:32:49,799][25689] Fps is (10 sec: 5582.8, 60 sec: 5522.1, 300 sec: 5532.8). Total num frames: 921704448. Throughput: 0: 5735.7. Samples: 921708474. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:32:49,800][25689] Avg episode reward: [(0, '0.554')] [2022-07-10 21:32:50,230][26022] Updated weights on worker 0-0, policy_version 900104 (0.00094) [2022-07-10 21:32:52,062][26022] Updated weights on worker 0-0, policy_version 900114 (0.00084) [2022-07-10 21:32:53,747][26022] Updated weights on worker 0-0, policy_version 900124 (0.00095) [2022-07-10 21:32:54,906][25689] Fps is (10 sec: 5542.5, 60 sec: 5519.8, 300 sec: 5532.0). Total num frames: 921733120. Throughput: 0: 5751.8. Samples: 921741994. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:32:54,906][25689] Avg episode reward: [(0, '0.788')] [2022-07-10 21:32:55,599][26022] Updated weights on worker 0-0, policy_version 900134 (0.00097) [2022-07-10 21:32:57,387][26022] Updated weights on worker 0-0, policy_version 900144 (0.00095) [2022-07-10 21:32:59,259][26022] Updated weights on worker 0-0, policy_version 900154 (0.00094) [2022-07-10 21:32:59,916][25689] Fps is (10 sec: 5465.8, 60 sec: 5487.9, 300 sec: 5529.0). Total num frames: 921759744. Throughput: 0: 5768.2. Samples: 921758810. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-10 21:32:59,916][25689] Avg episode reward: [(0, '0.156')] [2022-07-10 21:33:01,009][26022] Updated weights on worker 0-0, policy_version 900164 (0.00414) [2022-07-10 21:33:03,424][26022] Updated weights on worker 0-0, policy_version 900174 (0.00082) [2022-07-10 21:33:04,935][25689] Fps is (10 sec: 5411.4, 60 sec: 5522.0, 300 sec: 5528.7). Total num frames: 921787392. Throughput: 0: 5702.2. Samples: 921790384. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:33:04,936][25689] Avg episode reward: [(0, '-1.895')] [2022-07-10 21:33:05,017][26022] Updated weights on worker 0-0, policy_version 900184 (0.00087) [2022-07-10 21:33:06,888][26022] Updated weights on worker 0-0, policy_version 900194 (0.00085) [2022-07-10 21:33:07,126][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:33:07,143][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000900195_921799680.pth [2022-07-10 21:33:07,143][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000898249_919806976.pth [2022-07-10 21:33:08,719][26022] Updated weights on worker 0-0, policy_version 900204 (0.00094) [2022-07-10 21:33:09,946][25689] Fps is (10 sec: 5615.1, 60 sec: 5525.6, 300 sec: 5533.4). Total num frames: 921816064. Throughput: 0: 5743.3. Samples: 921824184. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:33:09,946][25689] Avg episode reward: [(0, '-2.207')] [2022-07-10 21:33:10,666][26022] Updated weights on worker 0-0, policy_version 900214 (0.00088) [2022-07-10 21:33:12,587][26022] Updated weights on worker 0-0, policy_version 900224 (0.00096) [2022-07-10 21:33:14,083][26022] Updated weights on worker 0-0, policy_version 900234 (0.00085) [2022-07-10 21:33:15,009][25689] Fps is (10 sec: 5489.1, 60 sec: 5514.5, 300 sec: 5529.7). Total num frames: 921842688. Throughput: 0: 4914.4. Samples: 921840788. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:33:15,011][25689] Avg episode reward: [(0, '-1.997')] [2022-07-10 21:33:16,203][26022] Updated weights on worker 0-0, policy_version 900244 (0.00084) [2022-07-10 21:33:17,983][26022] Updated weights on worker 0-0, policy_version 900254 (0.00083) [2022-07-10 21:33:19,895][26022] Updated weights on worker 0-0, policy_version 900264 (0.00103) [2022-07-10 21:33:20,026][25689] Fps is (10 sec: 5485.8, 60 sec: 5519.0, 300 sec: 5529.4). Total num frames: 921871360. Throughput: 0: 5738.7. Samples: 921874216. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:33:20,028][25689] Avg episode reward: [(0, '-2.285')] [2022-07-10 21:33:21,565][26022] Updated weights on worker 0-0, policy_version 900274 (0.00085) [2022-07-10 21:33:23,625][26022] Updated weights on worker 0-0, policy_version 900284 (0.00082) [2022-07-10 21:33:25,055][25689] Fps is (10 sec: 5707.9, 60 sec: 5535.0, 300 sec: 5532.5). Total num frames: 921900032. Throughput: 0: 5832.4. Samples: 921907732. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:33:25,056][25689] Avg episode reward: [(0, '-2.129')] [2022-07-10 21:33:25,190][26022] Updated weights on worker 0-0, policy_version 900294 (0.00087) [2022-07-10 21:33:27,092][26022] Updated weights on worker 0-0, policy_version 900304 (0.00089) [2022-07-10 21:33:28,892][26022] Updated weights on worker 0-0, policy_version 900314 (0.00086) [2022-07-10 21:33:30,080][25689] Fps is (10 sec: 5397.8, 60 sec: 5482.1, 300 sec: 5523.1). Total num frames: 921925632. Throughput: 0: 4965.8. Samples: 921924166. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:33:30,082][25689] Avg episode reward: [(0, '-1.881')] [2022-07-10 21:33:30,989][26022] Updated weights on worker 0-0, policy_version 900324 (0.00094) [2022-07-10 21:33:32,615][26022] Updated weights on worker 0-0, policy_version 900334 (0.00096) [2022-07-10 21:33:34,581][26022] Updated weights on worker 0-0, policy_version 900344 (0.00092) [2022-07-10 21:33:35,156][25689] Fps is (10 sec: 5474.5, 60 sec: 5535.4, 300 sec: 5529.0). Total num frames: 921955328. Throughput: 0: 5786.8. Samples: 921957376. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:33:35,158][25689] Avg episode reward: [(0, '0.193')] [2022-07-10 21:33:36,447][26022] Updated weights on worker 0-0, policy_version 900354 (0.00090) [2022-07-10 21:33:38,344][26022] Updated weights on worker 0-0, policy_version 900364 (0.00091) [2022-07-10 21:33:40,042][26022] Updated weights on worker 0-0, policy_version 900374 (0.00085) [2022-07-10 21:33:40,160][25689] Fps is (10 sec: 5790.8, 60 sec: 5556.3, 300 sec: 5532.8). Total num frames: 921984000. Throughput: 0: 5783.5. Samples: 921990660. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:33:40,161][25689] Avg episode reward: [(0, '0.185')] [2022-07-10 21:33:42,112][26022] Updated weights on worker 0-0, policy_version 900384 (0.00091) [2022-07-10 21:33:43,588][26022] Updated weights on worker 0-0, policy_version 900394 (0.00092) [2022-07-10 21:33:45,166][25689] Fps is (10 sec: 5422.0, 60 sec: 5508.0, 300 sec: 5523.1). Total num frames: 922009600. Throughput: 0: 4959.9. Samples: 922007476. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:33:45,167][25689] Avg episode reward: [(0, '-0.781')] [2022-07-10 21:33:45,717][26022] Updated weights on worker 0-0, policy_version 900404 (0.00095) [2022-07-10 21:33:47,784][26022] Updated weights on worker 0-0, policy_version 900414 (0.00096) [2022-07-10 21:33:49,388][26022] Updated weights on worker 0-0, policy_version 900424 (0.00114) [2022-07-10 21:33:50,197][25689] Fps is (10 sec: 5406.9, 60 sec: 5527.1, 300 sec: 5524.0). Total num frames: 922038272. Throughput: 0: 5794.8. Samples: 922040740. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:33:50,199][25689] Avg episode reward: [(0, '-1.018')] [2022-07-10 21:33:51,315][26022] Updated weights on worker 0-0, policy_version 900434 (0.00092) [2022-07-10 21:33:53,126][26022] Updated weights on worker 0-0, policy_version 900444 (0.00084) [2022-07-10 21:33:54,717][26022] Updated weights on worker 0-0, policy_version 900454 (0.00225) [2022-07-10 21:33:55,277][25689] Fps is (10 sec: 5671.3, 60 sec: 5529.6, 300 sec: 5522.6). Total num frames: 922066944. Throughput: 0: 5809.5. Samples: 922074268. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:33:55,277][25689] Avg episode reward: [(0, '-1.264')] [2022-07-10 21:33:56,862][26022] Updated weights on worker 0-0, policy_version 900464 (0.00089) [2022-07-10 21:33:58,629][26022] Updated weights on worker 0-0, policy_version 900474 (0.00085) [2022-07-10 21:34:00,275][26022] Updated weights on worker 0-0, policy_version 900484 (0.00088) [2022-07-10 21:34:00,373][25689] Fps is (10 sec: 5635.5, 60 sec: 5555.6, 300 sec: 5531.5). Total num frames: 922095616. Throughput: 0: 4964.1. Samples: 922091000. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:34:00,373][25689] Avg episode reward: [(0, '-1.442')] [2022-07-10 21:34:02,673][26022] Updated weights on worker 0-0, policy_version 900494 (0.00091) [2022-07-10 21:34:04,415][26022] Updated weights on worker 0-0, policy_version 900504 (0.00090) [2022-07-10 21:34:05,386][25689] Fps is (10 sec: 5267.4, 60 sec: 5505.3, 300 sec: 5521.3). Total num frames: 922120192. Throughput: 0: 5678.4. Samples: 922122294. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:34:05,386][25689] Avg episode reward: [(0, '-1.587')] [2022-07-10 21:34:06,271][26022] Updated weights on worker 0-0, policy_version 900514 (0.00096) [2022-07-10 21:34:08,072][26022] Updated weights on worker 0-0, policy_version 900524 (0.00085) [2022-07-10 21:34:09,924][26022] Updated weights on worker 0-0, policy_version 900534 (0.00078) [2022-07-10 21:34:10,437][25689] Fps is (10 sec: 5290.7, 60 sec: 5501.6, 300 sec: 5524.5). Total num frames: 922148864. Throughput: 0: 5695.9. Samples: 922156024. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:34:10,437][25689] Avg episode reward: [(0, '-1.964')] [2022-07-10 21:34:11,904][26022] Updated weights on worker 0-0, policy_version 900544 (0.00086) [2022-07-10 21:34:13,723][26022] Updated weights on worker 0-0, policy_version 900554 (0.00086) [2022-07-10 21:34:15,405][26022] Updated weights on worker 0-0, policy_version 900564 (0.00093) [2022-07-10 21:34:15,495][25689] Fps is (10 sec: 5672.6, 60 sec: 5536.0, 300 sec: 5528.0). Total num frames: 922177536. Throughput: 0: 4862.9. Samples: 922172586. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:34:15,495][25689] Avg episode reward: [(0, '-1.197')] [2022-07-10 21:34:17,430][26022] Updated weights on worker 0-0, policy_version 900574 (0.00090) [2022-07-10 21:34:19,159][26022] Updated weights on worker 0-0, policy_version 900584 (0.00089) [2022-07-10 21:34:20,516][25689] Fps is (10 sec: 5486.0, 60 sec: 5501.7, 300 sec: 5524.2). Total num frames: 922204160. Throughput: 0: 5704.6. Samples: 922205912. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:34:20,517][25689] Avg episode reward: [(0, '-0.362')] [2022-07-10 21:34:21,243][26022] Updated weights on worker 0-0, policy_version 900594 (0.00093) [2022-07-10 21:34:22,963][26022] Updated weights on worker 0-0, policy_version 900604 (0.00094) [2022-07-10 21:34:24,622][26022] Updated weights on worker 0-0, policy_version 900614 (0.00088) [2022-07-10 21:34:25,534][25689] Fps is (10 sec: 5507.8, 60 sec: 5502.7, 300 sec: 5524.2). Total num frames: 922232832. Throughput: 0: 5808.5. Samples: 922239328. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:34:25,535][25689] Avg episode reward: [(0, '-0.080')] [2022-07-10 21:34:26,735][26022] Updated weights on worker 0-0, policy_version 900624 (0.00085) [2022-07-10 21:34:28,458][26022] Updated weights on worker 0-0, policy_version 900634 (0.00092) [2022-07-10 21:34:30,430][26022] Updated weights on worker 0-0, policy_version 900644 (0.00092) [2022-07-10 21:34:30,561][25689] Fps is (10 sec: 5607.0, 60 sec: 5536.4, 300 sec: 5521.1). Total num frames: 922260480. Throughput: 0: 4973.4. Samples: 922256110. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:34:30,562][25689] Avg episode reward: [(0, '-0.649')] [2022-07-10 21:34:32,054][26022] Updated weights on worker 0-0, policy_version 900654 (0.00086) [2022-07-10 21:34:34,144][26022] Updated weights on worker 0-0, policy_version 900664 (0.00096) [2022-07-10 21:34:35,624][25689] Fps is (10 sec: 5480.6, 60 sec: 5503.8, 300 sec: 5520.7). Total num frames: 922288128. Throughput: 0: 5785.1. Samples: 922289036. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:34:35,624][25689] Avg episode reward: [(0, '-0.269')] [2022-07-10 21:34:35,820][26022] Updated weights on worker 0-0, policy_version 900674 (0.00087) [2022-07-10 21:34:37,881][26022] Updated weights on worker 0-0, policy_version 900684 (0.00079) [2022-07-10 21:34:39,438][26022] Updated weights on worker 0-0, policy_version 900694 (0.00088) [2022-07-10 21:34:40,625][25689] Fps is (10 sec: 5494.3, 60 sec: 5487.0, 300 sec: 5517.5). Total num frames: 922315776. Throughput: 0: 5797.7. Samples: 922322496. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:34:40,627][25689] Avg episode reward: [(0, '-0.498')] [2022-07-10 21:34:41,512][26022] Updated weights on worker 0-0, policy_version 900704 (0.00090) [2022-07-10 21:34:43,173][26022] Updated weights on worker 0-0, policy_version 900714 (0.00088) [2022-07-10 21:34:45,151][26022] Updated weights on worker 0-0, policy_version 900724 (0.00088) [2022-07-10 21:34:45,630][25689] Fps is (10 sec: 5628.7, 60 sec: 5538.0, 300 sec: 5524.5). Total num frames: 922344448. Throughput: 0: 4967.4. Samples: 922339150. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:34:45,630][25689] Avg episode reward: [(0, '-0.650')] [2022-07-10 21:34:46,883][26022] Updated weights on worker 0-0, policy_version 900734 (0.00084) [2022-07-10 21:34:48,591][26022] Updated weights on worker 0-0, policy_version 900744 (0.00091) [2022-07-10 21:34:50,608][26022] Updated weights on worker 0-0, policy_version 900754 (0.00088) [2022-07-10 21:34:50,632][25689] Fps is (10 sec: 5628.0, 60 sec: 5523.7, 300 sec: 5523.1). Total num frames: 922372096. Throughput: 0: 5811.2. Samples: 922372748. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:34:50,633][25689] Avg episode reward: [(0, '-0.554')] [2022-07-10 21:34:52,182][26022] Updated weights on worker 0-0, policy_version 900764 (0.00088) [2022-07-10 21:34:54,105][26022] Updated weights on worker 0-0, policy_version 900774 (0.00091) [2022-07-10 21:34:55,703][25689] Fps is (10 sec: 5591.1, 60 sec: 5524.5, 300 sec: 5522.1). Total num frames: 922400768. Throughput: 0: 5839.6. Samples: 922406290. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:34:55,703][25689] Avg episode reward: [(0, '-0.214')] [2022-07-10 21:34:56,339][26022] Updated weights on worker 0-0, policy_version 900784 (0.00210) [2022-07-10 21:34:57,921][26022] Updated weights on worker 0-0, policy_version 900794 (0.00093) [2022-07-10 21:34:59,752][26022] Updated weights on worker 0-0, policy_version 900804 (0.00087) [2022-07-10 21:35:00,731][25689] Fps is (10 sec: 5475.8, 60 sec: 5496.8, 300 sec: 5529.1). Total num frames: 922427392. Throughput: 0: 5001.5. Samples: 922423054. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:35:00,731][25689] Avg episode reward: [(0, '-1.152')] [2022-07-10 21:35:01,427][26022] Updated weights on worker 0-0, policy_version 900814 (0.00084) [2022-07-10 21:35:03,746][26022] Updated weights on worker 0-0, policy_version 900824 (0.00092) [2022-07-10 21:35:05,734][25689] Fps is (10 sec: 5206.0, 60 sec: 5514.6, 300 sec: 5518.8). Total num frames: 922452992. Throughput: 0: 5739.2. Samples: 922454536. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:35:05,735][25689] Avg episode reward: [(0, '-0.945')] [2022-07-10 21:35:05,822][26022] Updated weights on worker 0-0, policy_version 900834 (0.00082) [2022-07-10 21:35:07,287][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:35:07,303][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000900843_922463232.pth [2022-07-10 21:35:07,304][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000898899_920472576.pth [2022-07-10 21:35:07,417][26022] Updated weights on worker 0-0, policy_version 900844 (0.00084) [2022-07-10 21:35:09,412][26022] Updated weights on worker 0-0, policy_version 900854 (0.00093) [2022-07-10 21:35:10,780][25689] Fps is (10 sec: 5502.6, 60 sec: 5532.1, 300 sec: 5523.2). Total num frames: 922482688. Throughput: 0: 5715.7. Samples: 922487904. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:35:10,780][25689] Avg episode reward: [(0, '-0.865')] [2022-07-10 21:35:10,999][26022] Updated weights on worker 0-0, policy_version 900864 (0.00086) [2022-07-10 21:35:13,162][26022] Updated weights on worker 0-0, policy_version 900874 (0.00094) [2022-07-10 21:35:15,095][26022] Updated weights on worker 0-0, policy_version 900884 (0.00094) [2022-07-10 21:35:15,839][25689] Fps is (10 sec: 5675.2, 60 sec: 5515.0, 300 sec: 5526.3). Total num frames: 922510336. Throughput: 0: 4893.7. Samples: 922504830. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:35:15,839][25689] Avg episode reward: [(0, '-1.253')] [2022-07-10 21:35:16,704][26022] Updated weights on worker 0-0, policy_version 900894 (0.00088) [2022-07-10 21:35:18,712][26022] Updated weights on worker 0-0, policy_version 900904 (0.00094) [2022-07-10 21:35:20,245][26022] Updated weights on worker 0-0, policy_version 900914 (0.00090) [2022-07-10 21:35:20,896][25689] Fps is (10 sec: 5567.0, 60 sec: 5545.7, 300 sec: 5522.8). Total num frames: 922539008. Throughput: 0: 5713.0. Samples: 922538262. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:35:20,897][25689] Avg episode reward: [(0, '-2.206')] [2022-07-10 21:35:22,499][26022] Updated weights on worker 0-0, policy_version 900924 (0.00083) [2022-07-10 21:35:23,987][26022] Updated weights on worker 0-0, policy_version 900934 (0.00516) [2022-07-10 21:35:25,913][25689] Fps is (10 sec: 5489.0, 60 sec: 5511.9, 300 sec: 5519.3). Total num frames: 922565632. Throughput: 0: 5804.8. Samples: 922571668. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:35:25,913][25689] Avg episode reward: [(0, '-2.101')] [2022-07-10 21:35:25,963][26022] Updated weights on worker 0-0, policy_version 900944 (0.00086) [2022-07-10 21:35:27,761][26022] Updated weights on worker 0-0, policy_version 900954 (0.00087) [2022-07-10 21:35:29,590][26022] Updated weights on worker 0-0, policy_version 900964 (0.00090) [2022-07-10 21:35:30,927][25689] Fps is (10 sec: 5512.6, 60 sec: 5530.0, 300 sec: 5524.4). Total num frames: 922594304. Throughput: 0: 4974.7. Samples: 922588132. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:35:30,928][25689] Avg episode reward: [(0, '-1.532')] [2022-07-10 21:35:31,321][26022] Updated weights on worker 0-0, policy_version 900974 (0.00085) [2022-07-10 21:35:33,295][26022] Updated weights on worker 0-0, policy_version 900984 (0.00086) [2022-07-10 21:35:35,038][26022] Updated weights on worker 0-0, policy_version 900994 (0.00087) [2022-07-10 21:35:35,974][25689] Fps is (10 sec: 5394.2, 60 sec: 5497.5, 300 sec: 5513.9). Total num frames: 922619904. Throughput: 0: 5773.1. Samples: 922621072. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:35:35,974][25689] Avg episode reward: [(0, '-2.474')] [2022-07-10 21:35:36,970][26022] Updated weights on worker 0-0, policy_version 901004 (0.00080) [2022-07-10 21:35:38,996][26022] Updated weights on worker 0-0, policy_version 901014 (0.00085) [2022-07-10 21:35:40,544][26022] Updated weights on worker 0-0, policy_version 901024 (0.00093) [2022-07-10 21:35:40,993][25689] Fps is (10 sec: 5595.1, 60 sec: 5546.8, 300 sec: 5520.8). Total num frames: 922650624. Throughput: 0: 5799.7. Samples: 922654816. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:35:40,994][25689] Avg episode reward: [(0, '-2.608')] [2022-07-10 21:35:42,550][26022] Updated weights on worker 0-0, policy_version 901034 (0.00087) [2022-07-10 21:35:44,310][26022] Updated weights on worker 0-0, policy_version 901044 (0.00094) [2022-07-10 21:35:46,019][25689] Fps is (10 sec: 5810.6, 60 sec: 5527.9, 300 sec: 5520.6). Total num frames: 922678272. Throughput: 0: 4965.4. Samples: 922671504. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:35:46,019][25689] Avg episode reward: [(0, '-1.125')] [2022-07-10 21:35:46,219][26022] Updated weights on worker 0-0, policy_version 901054 (0.00094) [2022-07-10 21:35:48,183][26022] Updated weights on worker 0-0, policy_version 901064 (0.00097) [2022-07-10 21:35:49,749][26022] Updated weights on worker 0-0, policy_version 901074 (0.00164) [2022-07-10 21:35:51,031][25689] Fps is (10 sec: 5406.9, 60 sec: 5510.1, 300 sec: 5521.7). Total num frames: 922704896. Throughput: 0: 5804.6. Samples: 922704824. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:35:51,031][25689] Avg episode reward: [(0, '-0.486')] [2022-07-10 21:35:51,741][26022] Updated weights on worker 0-0, policy_version 901084 (0.00088) [2022-07-10 21:35:53,619][26022] Updated weights on worker 0-0, policy_version 901094 (0.00083) [2022-07-10 21:35:55,317][26022] Updated weights on worker 0-0, policy_version 901104 (0.00088) [2022-07-10 21:35:56,074][25689] Fps is (10 sec: 5601.4, 60 sec: 5529.6, 300 sec: 5525.0). Total num frames: 922734592. Throughput: 0: 5841.1. Samples: 922738476. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:35:56,074][25689] Avg episode reward: [(0, '-0.254')] [2022-07-10 21:35:57,386][26022] Updated weights on worker 0-0, policy_version 901114 (0.00091) [2022-07-10 21:35:59,074][26022] Updated weights on worker 0-0, policy_version 901124 (0.00087) [2022-07-10 21:36:00,997][26022] Updated weights on worker 0-0, policy_version 901134 (0.00084) [2022-07-10 21:36:01,179][25689] Fps is (10 sec: 5549.4, 60 sec: 5522.4, 300 sec: 5523.1). Total num frames: 922761216. Throughput: 0: 4968.8. Samples: 922755118. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:36:01,181][25689] Avg episode reward: [(0, '-0.143')] [2022-07-10 21:36:03,241][26022] Updated weights on worker 0-0, policy_version 901144 (0.00093) [2022-07-10 21:36:04,919][26022] Updated weights on worker 0-0, policy_version 901154 (0.00092) [2022-07-10 21:36:06,267][25689] Fps is (10 sec: 5223.6, 60 sec: 5531.7, 300 sec: 5522.0). Total num frames: 922787840. Throughput: 0: 5663.1. Samples: 922786172. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:36:06,269][25689] Avg episode reward: [(0, '0.712')] [2022-07-10 21:36:07,014][26022] Updated weights on worker 0-0, policy_version 901164 (0.00377) [2022-07-10 21:36:08,646][26022] Updated weights on worker 0-0, policy_version 901174 (0.00097) [2022-07-10 21:36:10,759][26022] Updated weights on worker 0-0, policy_version 901184 (0.00098) [2022-07-10 21:36:11,296][25689] Fps is (10 sec: 5364.6, 60 sec: 5499.3, 300 sec: 5519.8). Total num frames: 922815488. Throughput: 0: 5654.6. Samples: 922819418. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:36:11,297][25689] Avg episode reward: [(0, '0.455')] [2022-07-10 21:36:12,287][26022] Updated weights on worker 0-0, policy_version 901194 (0.00087) [2022-07-10 21:36:14,477][26022] Updated weights on worker 0-0, policy_version 901204 (0.00092) [2022-07-10 21:36:16,023][26022] Updated weights on worker 0-0, policy_version 901214 (0.00093) [2022-07-10 21:36:16,356][25689] Fps is (10 sec: 5582.2, 60 sec: 5516.2, 300 sec: 5522.6). Total num frames: 922844160. Throughput: 0: 5647.4. Samples: 922853022. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:36:16,357][25689] Avg episode reward: [(0, '0.348')] [2022-07-10 21:36:18,107][26022] Updated weights on worker 0-0, policy_version 901224 (0.00467) [2022-07-10 21:36:19,663][26022] Updated weights on worker 0-0, policy_version 901234 (0.00093) [2022-07-10 21:36:21,391][25689] Fps is (10 sec: 5579.1, 60 sec: 5501.3, 300 sec: 5522.1). Total num frames: 922871808. Throughput: 0: 5671.8. Samples: 922869754. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:36:21,399][25689] Avg episode reward: [(0, '0.368')] [2022-07-10 21:36:21,735][26022] Updated weights on worker 0-0, policy_version 901244 (0.00087) [2022-07-10 21:36:23,371][26022] Updated weights on worker 0-0, policy_version 901254 (0.00084) [2022-07-10 21:36:25,219][26022] Updated weights on worker 0-0, policy_version 901264 (0.00085) [2022-07-10 21:36:26,405][25689] Fps is (10 sec: 5605.0, 60 sec: 5535.4, 300 sec: 5525.7). Total num frames: 922900480. Throughput: 0: 5844.7. Samples: 922903868. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:36:26,405][25689] Avg episode reward: [(0, '-1.173')] [2022-07-10 21:36:27,156][26022] Updated weights on worker 0-0, policy_version 901274 (0.00090) [2022-07-10 21:36:28,918][26022] Updated weights on worker 0-0, policy_version 901284 (0.00094) [2022-07-10 21:36:30,739][26022] Updated weights on worker 0-0, policy_version 901294 (0.00090) [2022-07-10 21:36:31,440][25689] Fps is (10 sec: 5706.4, 60 sec: 5533.5, 300 sec: 5522.7). Total num frames: 922929152. Throughput: 0: 5842.5. Samples: 922937110. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:36:31,441][25689] Avg episode reward: [(0, '-0.890')] [2022-07-10 21:36:32,780][26022] Updated weights on worker 0-0, policy_version 901304 (0.00089) [2022-07-10 21:36:34,363][26022] Updated weights on worker 0-0, policy_version 901314 (0.00085) [2022-07-10 21:36:36,322][26022] Updated weights on worker 0-0, policy_version 901324 (0.00087) [2022-07-10 21:36:36,550][25689] Fps is (10 sec: 5652.4, 60 sec: 5578.5, 300 sec: 5524.3). Total num frames: 922957824. Throughput: 0: 5002.5. Samples: 922954040. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:36:36,550][25689] Avg episode reward: [(0, '-0.395')] [2022-07-10 21:36:38,035][26022] Updated weights on worker 0-0, policy_version 901334 (0.00086) [2022-07-10 21:36:39,912][26022] Updated weights on worker 0-0, policy_version 901344 (0.00086) [2022-07-10 21:36:41,574][25689] Fps is (10 sec: 5557.6, 60 sec: 5527.3, 300 sec: 5528.7). Total num frames: 922985472. Throughput: 0: 5835.8. Samples: 922987538. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:36:41,582][25689] Avg episode reward: [(0, '-0.054')] [2022-07-10 21:36:41,773][26022] Updated weights on worker 0-0, policy_version 901354 (0.00090) [2022-07-10 21:36:43,524][26022] Updated weights on worker 0-0, policy_version 901364 (0.00092) [2022-07-10 21:36:45,402][26022] Updated weights on worker 0-0, policy_version 901374 (0.00088) [2022-07-10 21:36:46,614][25689] Fps is (10 sec: 5494.5, 60 sec: 5526.0, 300 sec: 5525.3). Total num frames: 923013120. Throughput: 0: 5797.8. Samples: 923021036. Policy #0 lag: (min: 0.0, avg: 7.2, max: 19.0) [2022-07-10 21:36:46,614][25689] Avg episode reward: [(0, '0.176')] [2022-07-10 21:36:47,147][26022] Updated weights on worker 0-0, policy_version 901384 (0.00085) [2022-07-10 21:36:49,038][26022] Updated weights on worker 0-0, policy_version 901394 (0.00102) [2022-07-10 21:36:51,003][26022] Updated weights on worker 0-0, policy_version 901404 (0.00095) [2022-07-10 21:36:51,635][25689] Fps is (10 sec: 5496.1, 60 sec: 5542.0, 300 sec: 5523.0). Total num frames: 923040768. Throughput: 0: 4975.1. Samples: 923037582. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:36:51,635][25689] Avg episode reward: [(0, '0.112')] [2022-07-10 21:36:52,872][26022] Updated weights on worker 0-0, policy_version 901414 (0.00082) [2022-07-10 21:36:54,490][26022] Updated weights on worker 0-0, policy_version 901424 (0.00536) [2022-07-10 21:36:56,558][26022] Updated weights on worker 0-0, policy_version 901434 (0.00087) [2022-07-10 21:36:56,687][25689] Fps is (10 sec: 5489.5, 60 sec: 5507.5, 300 sec: 5519.2). Total num frames: 923068416. Throughput: 0: 5835.1. Samples: 923071540. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:36:56,687][25689] Avg episode reward: [(0, '0.843')] [2022-07-10 21:36:58,119][26022] Updated weights on worker 0-0, policy_version 901444 (0.00094) [2022-07-10 21:37:00,133][26022] Updated weights on worker 0-0, policy_version 901454 (0.00089) [2022-07-10 21:37:01,759][25689] Fps is (10 sec: 5563.3, 60 sec: 5544.3, 300 sec: 5528.6). Total num frames: 923097088. Throughput: 0: 5812.1. Samples: 923104852. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:37:01,759][25689] Avg episode reward: [(0, '0.160')] [2022-07-10 21:37:02,059][26022] Updated weights on worker 0-0, policy_version 901464 (0.00084) [2022-07-10 21:37:04,151][26022] Updated weights on worker 0-0, policy_version 901474 (0.00447) [2022-07-10 21:37:05,908][26022] Updated weights on worker 0-0, policy_version 901484 (0.00095) [2022-07-10 21:37:06,817][25689] Fps is (10 sec: 5559.4, 60 sec: 5563.9, 300 sec: 5525.0). Total num frames: 923124736. Throughput: 0: 4899.5. Samples: 923120024. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:37:06,818][25689] Avg episode reward: [(0, '0.055')] [2022-07-10 21:37:07,326][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:37:07,339][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000901492_923127808.pth [2022-07-10 21:37:07,339][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000899548_921137152.pth [2022-07-10 21:37:07,652][26022] Updated weights on worker 0-0, policy_version 901494 (0.00090) [2022-07-10 21:37:09,619][26022] Updated weights on worker 0-0, policy_version 901504 (0.00084) [2022-07-10 21:37:11,506][26022] Updated weights on worker 0-0, policy_version 901514 (0.00094) [2022-07-10 21:37:11,850][25689] Fps is (10 sec: 5378.1, 60 sec: 5546.7, 300 sec: 5523.3). Total num frames: 923151360. Throughput: 0: 5722.4. Samples: 923153260. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:37:11,851][25689] Avg episode reward: [(0, '-0.217')] [2022-07-10 21:37:13,339][26022] Updated weights on worker 0-0, policy_version 901524 (0.00085) [2022-07-10 21:37:15,283][26022] Updated weights on worker 0-0, policy_version 901534 (0.00086) [2022-07-10 21:37:16,979][25689] Fps is (10 sec: 5441.7, 60 sec: 5540.4, 300 sec: 5522.1). Total num frames: 923180032. Throughput: 0: 5678.7. Samples: 923186772. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:37:16,980][25689] Avg episode reward: [(0, '-0.282')] [2022-07-10 21:37:17,032][26022] Updated weights on worker 0-0, policy_version 901544 (0.00093) [2022-07-10 21:37:18,756][26022] Updated weights on worker 0-0, policy_version 901554 (0.00084) [2022-07-10 21:37:20,881][26022] Updated weights on worker 0-0, policy_version 901564 (0.00081) [2022-07-10 21:37:22,057][25689] Fps is (10 sec: 5818.8, 60 sec: 5587.1, 300 sec: 5531.4). Total num frames: 923210752. Throughput: 0: 4853.4. Samples: 923203368. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:37:22,058][25689] Avg episode reward: [(0, '-0.204')] [2022-07-10 21:37:22,396][26022] Updated weights on worker 0-0, policy_version 901574 (0.00089) [2022-07-10 21:37:24,426][26022] Updated weights on worker 0-0, policy_version 901584 (0.00090) [2022-07-10 21:37:26,088][26022] Updated weights on worker 0-0, policy_version 901594 (0.00099) [2022-07-10 21:37:27,120][25689] Fps is (10 sec: 5553.4, 60 sec: 5531.9, 300 sec: 5519.9). Total num frames: 923236352. Throughput: 0: 5758.7. Samples: 923236942. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:37:27,122][25689] Avg episode reward: [(0, '-0.503')] [2022-07-10 21:37:27,995][26022] Updated weights on worker 0-0, policy_version 901604 (0.00092) [2022-07-10 21:37:29,816][26022] Updated weights on worker 0-0, policy_version 901614 (0.00092) [2022-07-10 21:37:31,668][26022] Updated weights on worker 0-0, policy_version 901624 (0.00098) [2022-07-10 21:37:32,214][25689] Fps is (10 sec: 5343.1, 60 sec: 5526.6, 300 sec: 5526.9). Total num frames: 923265024. Throughput: 0: 5744.0. Samples: 923270232. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:37:32,215][25689] Avg episode reward: [(0, '0.442')] [2022-07-10 21:37:33,603][26022] Updated weights on worker 0-0, policy_version 901634 (0.00089) [2022-07-10 21:37:35,368][26022] Updated weights on worker 0-0, policy_version 901644 (0.00092) [2022-07-10 21:37:37,221][26022] Updated weights on worker 0-0, policy_version 901654 (0.00091) [2022-07-10 21:37:37,271][25689] Fps is (10 sec: 5649.1, 60 sec: 5531.4, 300 sec: 5530.2). Total num frames: 923293696. Throughput: 0: 4935.0. Samples: 923286916. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:37:37,272][25689] Avg episode reward: [(0, '-0.447')] [2022-07-10 21:37:38,986][26022] Updated weights on worker 0-0, policy_version 901664 (0.00083) [2022-07-10 21:37:40,951][26022] Updated weights on worker 0-0, policy_version 901674 (0.00085) [2022-07-10 21:37:42,278][25689] Fps is (10 sec: 5596.5, 60 sec: 5533.0, 300 sec: 5527.3). Total num frames: 923321344. Throughput: 0: 5789.6. Samples: 923320438. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:37:42,278][25689] Avg episode reward: [(0, '-0.727')] [2022-07-10 21:37:42,769][26022] Updated weights on worker 0-0, policy_version 901684 (0.00269) [2022-07-10 21:37:44,705][26022] Updated weights on worker 0-0, policy_version 901694 (0.00090) [2022-07-10 21:37:46,415][26022] Updated weights on worker 0-0, policy_version 901704 (0.00092) [2022-07-10 21:37:47,288][25689] Fps is (10 sec: 5520.6, 60 sec: 5535.7, 300 sec: 5528.1). Total num frames: 923348992. Throughput: 0: 5800.5. Samples: 923353922. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:37:47,288][25689] Avg episode reward: [(0, '-0.780')] [2022-07-10 21:37:48,298][26022] Updated weights on worker 0-0, policy_version 901714 (0.00089) [2022-07-10 21:37:50,147][26022] Updated weights on worker 0-0, policy_version 901724 (0.00085) [2022-07-10 21:37:52,011][26022] Updated weights on worker 0-0, policy_version 901734 (0.00087) [2022-07-10 21:37:52,303][25689] Fps is (10 sec: 5515.9, 60 sec: 5536.3, 300 sec: 5526.4). Total num frames: 923376640. Throughput: 0: 4992.5. Samples: 923370524. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:37:52,303][25689] Avg episode reward: [(0, '-0.460')] [2022-07-10 21:37:53,850][26022] Updated weights on worker 0-0, policy_version 901744 (0.00100) [2022-07-10 21:37:55,752][26022] Updated weights on worker 0-0, policy_version 901754 (0.00088) [2022-07-10 21:37:57,422][25689] Fps is (10 sec: 5557.3, 60 sec: 5547.0, 300 sec: 5531.2). Total num frames: 923405312. Throughput: 0: 5817.5. Samples: 923404142. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:37:57,422][25689] Avg episode reward: [(0, '-0.332')] [2022-07-10 21:37:57,522][26022] Updated weights on worker 0-0, policy_version 901764 (0.00086) [2022-07-10 21:37:59,476][26022] Updated weights on worker 0-0, policy_version 901774 (0.00091) [2022-07-10 21:38:01,072][26022] Updated weights on worker 0-0, policy_version 901784 (0.00084) [2022-07-10 21:38:02,438][25689] Fps is (10 sec: 5657.7, 60 sec: 5552.0, 300 sec: 5534.7). Total num frames: 923433984. Throughput: 0: 5822.3. Samples: 923437818. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:38:02,439][25689] Avg episode reward: [(0, '-0.255')] [2022-07-10 21:38:03,551][26022] Updated weights on worker 0-0, policy_version 901794 (0.00196) [2022-07-10 21:38:05,093][26022] Updated weights on worker 0-0, policy_version 901804 (0.00083) [2022-07-10 21:38:06,971][26022] Updated weights on worker 0-0, policy_version 901814 (0.00082) [2022-07-10 21:38:07,460][25689] Fps is (10 sec: 5406.8, 60 sec: 5521.7, 300 sec: 5524.2). Total num frames: 923459584. Throughput: 0: 5721.2. Samples: 923469332. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:38:07,460][25689] Avg episode reward: [(0, '0.188')] [2022-07-10 21:38:08,739][26022] Updated weights on worker 0-0, policy_version 901824 (0.00087) [2022-07-10 21:38:10,731][26022] Updated weights on worker 0-0, policy_version 901834 (0.00943) [2022-07-10 21:38:12,504][25689] Fps is (10 sec: 5289.9, 60 sec: 5537.5, 300 sec: 5528.0). Total num frames: 923487232. Throughput: 0: 5725.0. Samples: 923486178. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:38:12,505][25689] Avg episode reward: [(0, '0.058')] [2022-07-10 21:38:12,513][26022] Updated weights on worker 0-0, policy_version 901844 (0.00086) [2022-07-10 21:38:14,275][26022] Updated weights on worker 0-0, policy_version 901854 (0.00092) [2022-07-10 21:38:16,220][26022] Updated weights on worker 0-0, policy_version 901864 (0.00090) [2022-07-10 21:38:17,566][25689] Fps is (10 sec: 5674.2, 60 sec: 5560.5, 300 sec: 5530.6). Total num frames: 923516928. Throughput: 0: 5744.4. Samples: 923519856. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:38:17,566][25689] Avg episode reward: [(0, '-0.123')] [2022-07-10 21:38:17,888][26022] Updated weights on worker 0-0, policy_version 901874 (0.00092) [2022-07-10 21:38:19,812][26022] Updated weights on worker 0-0, policy_version 901884 (0.00088) [2022-07-10 21:38:21,548][26022] Updated weights on worker 0-0, policy_version 901894 (0.00086) [2022-07-10 21:38:22,574][25689] Fps is (10 sec: 5593.0, 60 sec: 5499.3, 300 sec: 5524.1). Total num frames: 923543552. Throughput: 0: 5740.1. Samples: 923553398. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:38:22,574][25689] Avg episode reward: [(0, '-0.793')] [2022-07-10 21:38:23,507][26022] Updated weights on worker 0-0, policy_version 901904 (0.00092) [2022-07-10 21:38:25,259][26022] Updated weights on worker 0-0, policy_version 901914 (0.00085) [2022-07-10 21:38:27,232][26022] Updated weights on worker 0-0, policy_version 901924 (0.00087) [2022-07-10 21:38:27,604][25689] Fps is (10 sec: 5406.5, 60 sec: 5536.1, 300 sec: 5530.9). Total num frames: 923571200. Throughput: 0: 5005.6. Samples: 923570168. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:38:27,605][25689] Avg episode reward: [(0, '-0.698')] [2022-07-10 21:38:28,891][26022] Updated weights on worker 0-0, policy_version 901934 (0.00097) [2022-07-10 21:38:30,937][26022] Updated weights on worker 0-0, policy_version 901944 (0.00098) [2022-07-10 21:38:32,633][25689] Fps is (10 sec: 5598.9, 60 sec: 5542.1, 300 sec: 5528.3). Total num frames: 923599872. Throughput: 0: 5826.4. Samples: 923603458. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:38:32,634][25689] Avg episode reward: [(0, '-0.219')] [2022-07-10 21:38:32,714][26022] Updated weights on worker 0-0, policy_version 901954 (0.00088) [2022-07-10 21:38:34,774][26022] Updated weights on worker 0-0, policy_version 901964 (0.00081) [2022-07-10 21:38:36,249][26022] Updated weights on worker 0-0, policy_version 901974 (0.00090) [2022-07-10 21:38:37,755][25689] Fps is (10 sec: 5548.3, 60 sec: 5519.2, 300 sec: 5522.7). Total num frames: 923627520. Throughput: 0: 5811.6. Samples: 923637188. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:38:37,756][25689] Avg episode reward: [(0, '-0.246')] [2022-07-10 21:38:38,285][26022] Updated weights on worker 0-0, policy_version 901984 (0.00092) [2022-07-10 21:38:40,038][26022] Updated weights on worker 0-0, policy_version 901994 (0.00086) [2022-07-10 21:38:41,742][26022] Updated weights on worker 0-0, policy_version 902004 (0.00090) [2022-07-10 21:38:42,775][25689] Fps is (10 sec: 5654.3, 60 sec: 5551.9, 300 sec: 5536.2). Total num frames: 923657216. Throughput: 0: 4986.5. Samples: 923654130. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:38:42,775][25689] Avg episode reward: [(0, '-0.603')] [2022-07-10 21:38:43,717][26022] Updated weights on worker 0-0, policy_version 902014 (0.00082) [2022-07-10 21:38:45,332][26022] Updated weights on worker 0-0, policy_version 902024 (0.00086) [2022-07-10 21:38:47,296][26022] Updated weights on worker 0-0, policy_version 902034 (0.00085) [2022-07-10 21:38:47,821][25689] Fps is (10 sec: 5900.3, 60 sec: 5582.4, 300 sec: 5539.3). Total num frames: 923686912. Throughput: 0: 5836.5. Samples: 923688164. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:38:47,822][25689] Avg episode reward: [(0, '0.706')] [2022-07-10 21:38:49,142][26022] Updated weights on worker 0-0, policy_version 902044 (0.00094) [2022-07-10 21:38:50,836][26022] Updated weights on worker 0-0, policy_version 902054 (0.00096) [2022-07-10 21:38:52,899][25689] Fps is (10 sec: 5461.9, 60 sec: 5542.8, 300 sec: 5529.1). Total num frames: 923712512. Throughput: 0: 5824.5. Samples: 923721494. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:38:52,899][25689] Avg episode reward: [(0, '0.458')] [2022-07-10 21:38:52,925][26022] Updated weights on worker 0-0, policy_version 902064 (0.00078) [2022-07-10 21:38:54,680][26022] Updated weights on worker 0-0, policy_version 902074 (0.00088) [2022-07-10 21:38:56,642][26022] Updated weights on worker 0-0, policy_version 902084 (0.00093) [2022-07-10 21:38:57,962][25689] Fps is (10 sec: 5452.7, 60 sec: 5564.8, 300 sec: 5533.1). Total num frames: 923742208. Throughput: 0: 4996.9. Samples: 923738160. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:38:57,963][25689] Avg episode reward: [(0, '0.004')] [2022-07-10 21:38:58,374][26022] Updated weights on worker 0-0, policy_version 902094 (0.00093) [2022-07-10 21:39:00,269][26022] Updated weights on worker 0-0, policy_version 902104 (0.00091) [2022-07-10 21:39:02,079][26022] Updated weights on worker 0-0, policy_version 902114 (0.00091) [2022-07-10 21:39:03,009][25689] Fps is (10 sec: 5570.6, 60 sec: 5528.2, 300 sec: 5539.4). Total num frames: 923768832. Throughput: 0: 5820.1. Samples: 923771894. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:39:03,009][25689] Avg episode reward: [(0, '-0.238')] [2022-07-10 21:39:04,257][26022] Updated weights on worker 0-0, policy_version 902124 (0.00079) [2022-07-10 21:39:05,785][26022] Updated weights on worker 0-0, policy_version 902134 (0.00090) [2022-07-10 21:39:07,426][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:39:07,438][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000902141_923792384.pth [2022-07-10 21:39:07,438][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000900195_921799680.pth [2022-07-10 21:39:07,762][26022] Updated weights on worker 0-0, policy_version 902144 (0.00088) [2022-07-10 21:39:08,010][25689] Fps is (10 sec: 5401.1, 60 sec: 5563.9, 300 sec: 5536.8). Total num frames: 923796480. Throughput: 0: 5722.4. Samples: 923803696. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:39:08,011][25689] Avg episode reward: [(0, '-0.182')] [2022-07-10 21:39:09,327][26022] Updated weights on worker 0-0, policy_version 902154 (0.00094) [2022-07-10 21:39:11,460][26022] Updated weights on worker 0-0, policy_version 902164 (0.00085) [2022-07-10 21:39:13,031][25689] Fps is (10 sec: 5619.7, 60 sec: 5583.0, 300 sec: 5537.5). Total num frames: 923825152. Throughput: 0: 4915.5. Samples: 923820452. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:39:13,031][25689] Avg episode reward: [(0, '0.396')] [2022-07-10 21:39:13,076][26022] Updated weights on worker 0-0, policy_version 902174 (0.00092) [2022-07-10 21:39:15,050][26022] Updated weights on worker 0-0, policy_version 902184 (0.00090) [2022-07-10 21:39:16,906][26022] Updated weights on worker 0-0, policy_version 902194 (0.00093) [2022-07-10 21:39:18,079][25689] Fps is (10 sec: 5593.9, 60 sec: 5550.4, 300 sec: 5540.5). Total num frames: 923852800. Throughput: 0: 5752.3. Samples: 923853876. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:39:18,079][25689] Avg episode reward: [(0, '-0.593')] [2022-07-10 21:39:18,994][26022] Updated weights on worker 0-0, policy_version 902204 (0.00080) [2022-07-10 21:39:20,425][26022] Updated weights on worker 0-0, policy_version 902214 (0.00084) [2022-07-10 21:39:22,655][26022] Updated weights on worker 0-0, policy_version 902224 (0.00091) [2022-07-10 21:39:23,083][25689] Fps is (10 sec: 5398.9, 60 sec: 5550.8, 300 sec: 5533.9). Total num frames: 923879424. Throughput: 0: 5749.8. Samples: 923887316. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:39:23,084][25689] Avg episode reward: [(0, '0.014')] [2022-07-10 21:39:24,219][26022] Updated weights on worker 0-0, policy_version 902234 (0.00086) [2022-07-10 21:39:26,327][26022] Updated weights on worker 0-0, policy_version 902244 (0.00084) [2022-07-10 21:39:27,845][26022] Updated weights on worker 0-0, policy_version 902254 (0.00086) [2022-07-10 21:39:28,088][25689] Fps is (10 sec: 5626.5, 60 sec: 5586.9, 300 sec: 5541.1). Total num frames: 923909120. Throughput: 0: 5007.2. Samples: 923904230. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:39:28,089][25689] Avg episode reward: [(0, '0.167')] [2022-07-10 21:39:29,706][26022] Updated weights on worker 0-0, policy_version 902264 (0.00086) [2022-07-10 21:39:31,733][26022] Updated weights on worker 0-0, policy_version 902274 (0.00085) [2022-07-10 21:39:33,091][25689] Fps is (10 sec: 5730.1, 60 sec: 5572.4, 300 sec: 5542.3). Total num frames: 923936768. Throughput: 0: 5851.4. Samples: 923937832. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:39:33,091][25689] Avg episode reward: [(0, '0.351')] [2022-07-10 21:39:33,400][26022] Updated weights on worker 0-0, policy_version 902284 (0.00086) [2022-07-10 21:39:35,206][26022] Updated weights on worker 0-0, policy_version 902294 (0.00082) [2022-07-10 21:39:37,126][26022] Updated weights on worker 0-0, policy_version 902304 (0.00088) [2022-07-10 21:39:38,173][25689] Fps is (10 sec: 5483.2, 60 sec: 5576.1, 300 sec: 5540.8). Total num frames: 923964416. Throughput: 0: 5851.8. Samples: 923971464. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:39:38,174][25689] Avg episode reward: [(0, '0.693')] [2022-07-10 21:39:38,847][26022] Updated weights on worker 0-0, policy_version 902314 (0.00092) [2022-07-10 21:39:40,721][26022] Updated weights on worker 0-0, policy_version 902324 (0.00087) [2022-07-10 21:39:42,494][26022] Updated weights on worker 0-0, policy_version 902334 (0.00091) [2022-07-10 21:39:43,186][25689] Fps is (10 sec: 5578.6, 60 sec: 5559.7, 300 sec: 5540.6). Total num frames: 923993088. Throughput: 0: 5027.9. Samples: 923988394. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:39:43,187][25689] Avg episode reward: [(0, '0.434')] [2022-07-10 21:39:44,311][26022] Updated weights on worker 0-0, policy_version 902344 (0.00097) [2022-07-10 21:39:45,951][26022] Updated weights on worker 0-0, policy_version 902354 (0.00090) [2022-07-10 21:39:47,966][26022] Updated weights on worker 0-0, policy_version 902364 (0.00093) [2022-07-10 21:39:48,211][25689] Fps is (10 sec: 5610.9, 60 sec: 5527.8, 300 sec: 5540.2). Total num frames: 924020736. Throughput: 0: 5858.6. Samples: 924022118. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:39:48,211][25689] Avg episode reward: [(0, '1.053')] [2022-07-10 21:39:49,739][26022] Updated weights on worker 0-0, policy_version 902374 (0.00081) [2022-07-10 21:39:51,707][26022] Updated weights on worker 0-0, policy_version 902384 (0.00087) [2022-07-10 21:39:53,227][25689] Fps is (10 sec: 5711.5, 60 sec: 5601.4, 300 sec: 5544.6). Total num frames: 924050432. Throughput: 0: 5869.8. Samples: 924056026. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:39:53,227][25689] Avg episode reward: [(0, '1.005')] [2022-07-10 21:39:53,283][26022] Updated weights on worker 0-0, policy_version 902394 (0.00090) [2022-07-10 21:39:55,240][26022] Updated weights on worker 0-0, policy_version 902404 (0.00088) [2022-07-10 21:39:57,104][26022] Updated weights on worker 0-0, policy_version 902414 (0.00082) [2022-07-10 21:39:58,276][25689] Fps is (10 sec: 5697.2, 60 sec: 5568.7, 300 sec: 5547.7). Total num frames: 924078080. Throughput: 0: 5044.4. Samples: 924072870. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:39:58,276][25689] Avg episode reward: [(0, '0.987')] [2022-07-10 21:39:58,938][26022] Updated weights on worker 0-0, policy_version 902424 (0.00087) [2022-07-10 21:40:00,721][26022] Updated weights on worker 0-0, policy_version 902434 (0.00092) [2022-07-10 21:40:03,044][26022] Updated weights on worker 0-0, policy_version 902444 (0.00111) [2022-07-10 21:40:03,287][25689] Fps is (10 sec: 5394.8, 60 sec: 5572.1, 300 sec: 5551.0). Total num frames: 924104704. Throughput: 0: 5877.2. Samples: 924106528. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:40:03,287][25689] Avg episode reward: [(0, '1.051')] [2022-07-10 21:40:04,935][26022] Updated weights on worker 0-0, policy_version 902454 (0.00088) [2022-07-10 21:40:06,571][26022] Updated weights on worker 0-0, policy_version 902464 (0.00088) [2022-07-10 21:40:08,296][25689] Fps is (10 sec: 5416.3, 60 sec: 5571.3, 300 sec: 5544.8). Total num frames: 924132352. Throughput: 0: 5783.0. Samples: 924138272. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:40:08,297][25689] Avg episode reward: [(0, '0.661')] [2022-07-10 21:40:08,344][26022] Updated weights on worker 0-0, policy_version 902474 (0.00085) [2022-07-10 21:40:10,291][26022] Updated weights on worker 0-0, policy_version 902484 (0.00086) [2022-07-10 21:40:12,060][26022] Updated weights on worker 0-0, policy_version 902494 (0.00085) [2022-07-10 21:40:13,387][25689] Fps is (10 sec: 5576.3, 60 sec: 5564.8, 300 sec: 5547.6). Total num frames: 924161024. Throughput: 0: 4927.0. Samples: 924155354. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:40:13,387][25689] Avg episode reward: [(0, '0.929')] [2022-07-10 21:40:14,015][26022] Updated weights on worker 0-0, policy_version 902504 (0.00080) [2022-07-10 21:40:15,737][26022] Updated weights on worker 0-0, policy_version 902514 (0.00087) [2022-07-10 21:40:17,608][26022] Updated weights on worker 0-0, policy_version 902524 (0.00839) [2022-07-10 21:40:18,485][25689] Fps is (10 sec: 5628.2, 60 sec: 5577.2, 300 sec: 5546.9). Total num frames: 924189696. Throughput: 0: 5743.4. Samples: 924188938. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:40:18,485][25689] Avg episode reward: [(0, '0.619')] [2022-07-10 21:40:19,399][26022] Updated weights on worker 0-0, policy_version 902534 (0.00095) [2022-07-10 21:40:21,235][26022] Updated weights on worker 0-0, policy_version 902544 (0.00083) [2022-07-10 21:40:23,100][26022] Updated weights on worker 0-0, policy_version 902554 (0.00090) [2022-07-10 21:40:23,495][25689] Fps is (10 sec: 5571.5, 60 sec: 5593.6, 300 sec: 5550.4). Total num frames: 924217344. Throughput: 0: 5719.2. Samples: 924222104. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:40:23,496][25689] Avg episode reward: [(0, '0.357')] [2022-07-10 21:40:25,069][26022] Updated weights on worker 0-0, policy_version 902564 (0.00087) [2022-07-10 21:40:26,949][26022] Updated weights on worker 0-0, policy_version 902574 (0.00093) [2022-07-10 21:40:28,514][25689] Fps is (10 sec: 5513.4, 60 sec: 5558.4, 300 sec: 5546.9). Total num frames: 924244992. Throughput: 0: 4978.1. Samples: 924238920. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:40:28,515][25689] Avg episode reward: [(0, '0.281')] [2022-07-10 21:40:28,770][26022] Updated weights on worker 0-0, policy_version 902584 (0.00093) [2022-07-10 21:40:30,539][26022] Updated weights on worker 0-0, policy_version 902594 (0.00088) [2022-07-10 21:40:32,479][26022] Updated weights on worker 0-0, policy_version 902604 (0.00093) [2022-07-10 21:40:33,533][25689] Fps is (10 sec: 5406.8, 60 sec: 5540.0, 300 sec: 5550.9). Total num frames: 924271616. Throughput: 0: 5786.2. Samples: 924271924. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:40:33,533][25689] Avg episode reward: [(0, '0.263')] [2022-07-10 21:40:34,220][26022] Updated weights on worker 0-0, policy_version 902614 (0.00092) [2022-07-10 21:40:36,387][26022] Updated weights on worker 0-0, policy_version 902624 (0.00087) [2022-07-10 21:40:37,791][26022] Updated weights on worker 0-0, policy_version 902634 (0.00085) [2022-07-10 21:40:38,592][25689] Fps is (10 sec: 5588.3, 60 sec: 5576.0, 300 sec: 5546.7). Total num frames: 924301312. Throughput: 0: 5796.7. Samples: 924305496. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:40:38,594][25689] Avg episode reward: [(0, '0.762')] [2022-07-10 21:40:39,949][26022] Updated weights on worker 0-0, policy_version 902644 (0.00085) [2022-07-10 21:40:41,357][26022] Updated weights on worker 0-0, policy_version 902654 (0.00089) [2022-07-10 21:40:43,528][26022] Updated weights on worker 0-0, policy_version 902664 (0.00089) [2022-07-10 21:40:43,666][25689] Fps is (10 sec: 5558.2, 60 sec: 5536.6, 300 sec: 5542.4). Total num frames: 924327936. Throughput: 0: 4965.1. Samples: 924322254. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:40:43,667][25689] Avg episode reward: [(0, '0.286')] [2022-07-10 21:40:45,151][26022] Updated weights on worker 0-0, policy_version 902674 (0.00100) [2022-07-10 21:40:47,132][26022] Updated weights on worker 0-0, policy_version 902684 (0.00088) [2022-07-10 21:40:48,694][25689] Fps is (10 sec: 5575.3, 60 sec: 5570.1, 300 sec: 5552.4). Total num frames: 924357632. Throughput: 0: 5816.4. Samples: 924356294. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:40:48,695][25689] Avg episode reward: [(0, '0.232')] [2022-07-10 21:40:48,759][26022] Updated weights on worker 0-0, policy_version 902694 (0.00089) [2022-07-10 21:40:50,812][26022] Updated weights on worker 0-0, policy_version 902704 (0.00092) [2022-07-10 21:40:52,389][26022] Updated weights on worker 0-0, policy_version 902714 (0.00088) [2022-07-10 21:40:53,735][25689] Fps is (10 sec: 5695.3, 60 sec: 5534.0, 300 sec: 5545.5). Total num frames: 924385280. Throughput: 0: 5839.1. Samples: 924389884. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:40:53,735][25689] Avg episode reward: [(0, '-0.341')] [2022-07-10 21:40:54,457][26022] Updated weights on worker 0-0, policy_version 902724 (0.00098) [2022-07-10 21:40:55,950][26022] Updated weights on worker 0-0, policy_version 902734 (0.00092) [2022-07-10 21:40:58,223][26022] Updated weights on worker 0-0, policy_version 902744 (0.00081) [2022-07-10 21:40:58,835][25689] Fps is (10 sec: 5553.8, 60 sec: 5546.2, 300 sec: 5552.5). Total num frames: 924413952. Throughput: 0: 5816.3. Samples: 924423234. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:40:58,837][25689] Avg episode reward: [(0, '-0.446')] [2022-07-10 21:40:59,799][26022] Updated weights on worker 0-0, policy_version 902754 (0.00091) [2022-07-10 21:41:02,250][26022] Updated weights on worker 0-0, policy_version 902764 (0.00090) [2022-07-10 21:41:03,783][26022] Updated weights on worker 0-0, policy_version 902774 (0.00097) [2022-07-10 21:41:03,851][25689] Fps is (10 sec: 5567.1, 60 sec: 5562.6, 300 sec: 5557.3). Total num frames: 924441600. Throughput: 0: 5778.2. Samples: 924438888. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:41:03,852][25689] Avg episode reward: [(0, '-0.452')] [2022-07-10 21:41:05,971][26022] Updated weights on worker 0-0, policy_version 902784 (0.00087) [2022-07-10 21:41:07,453][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:41:07,478][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000902794_924461056.pth [2022-07-10 21:41:07,478][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000900843_922463232.pth [2022-07-10 21:41:07,481][26022] Updated weights on worker 0-0, policy_version 902794 (0.00091) [2022-07-10 21:41:08,868][25689] Fps is (10 sec: 5307.0, 60 sec: 5528.1, 300 sec: 5550.6). Total num frames: 924467200. Throughput: 0: 5718.6. Samples: 924471664. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:41:08,870][25689] Avg episode reward: [(0, '-0.161')] [2022-07-10 21:41:09,612][26022] Updated weights on worker 0-0, policy_version 902804 (0.00611) [2022-07-10 21:41:11,194][26022] Updated weights on worker 0-0, policy_version 902814 (0.00092) [2022-07-10 21:41:13,119][26022] Updated weights on worker 0-0, policy_version 902824 (0.00062) [2022-07-10 21:41:13,883][25689] Fps is (10 sec: 5410.0, 60 sec: 5535.0, 300 sec: 5551.5). Total num frames: 924495872. Throughput: 0: 5730.3. Samples: 924505340. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:41:13,885][25689] Avg episode reward: [(0, '0.108')] [2022-07-10 21:41:14,785][26022] Updated weights on worker 0-0, policy_version 902834 (0.00087) [2022-07-10 21:41:16,520][26022] Updated weights on worker 0-0, policy_version 902844 (0.00092) [2022-07-10 21:41:18,634][26022] Updated weights on worker 0-0, policy_version 902854 (0.00092) [2022-07-10 21:41:18,991][25689] Fps is (10 sec: 5563.6, 60 sec: 5517.2, 300 sec: 5550.1). Total num frames: 924523520. Throughput: 0: 4902.3. Samples: 924522046. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:41:18,992][25689] Avg episode reward: [(0, '-0.084')] [2022-07-10 21:41:20,274][26022] Updated weights on worker 0-0, policy_version 902864 (0.00088) [2022-07-10 21:41:22,248][26022] Updated weights on worker 0-0, policy_version 902874 (0.00629) [2022-07-10 21:41:23,930][26022] Updated weights on worker 0-0, policy_version 902884 (0.00098) [2022-07-10 21:41:24,087][25689] Fps is (10 sec: 5619.8, 60 sec: 5543.2, 300 sec: 5552.0). Total num frames: 924553216. Throughput: 0: 5785.0. Samples: 924555952. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:41:24,087][25689] Avg episode reward: [(0, '0.599')] [2022-07-10 21:41:25,866][26022] Updated weights on worker 0-0, policy_version 902894 (0.00086) [2022-07-10 21:41:27,893][26022] Updated weights on worker 0-0, policy_version 902904 (0.00089) [2022-07-10 21:41:29,122][25689] Fps is (10 sec: 5660.2, 60 sec: 5541.7, 300 sec: 5548.6). Total num frames: 924580864. Throughput: 0: 5806.3. Samples: 924589264. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:41:29,123][25689] Avg episode reward: [(0, '0.696')] [2022-07-10 21:41:29,449][26022] Updated weights on worker 0-0, policy_version 902914 (0.00088) [2022-07-10 21:41:31,532][26022] Updated weights on worker 0-0, policy_version 902924 (0.00088) [2022-07-10 21:41:33,110][26022] Updated weights on worker 0-0, policy_version 902934 (0.00087) [2022-07-10 21:41:34,135][25689] Fps is (10 sec: 5503.0, 60 sec: 5559.1, 300 sec: 5546.9). Total num frames: 924608512. Throughput: 0: 4971.8. Samples: 924606030. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:41:34,136][25689] Avg episode reward: [(0, '-0.256')] [2022-07-10 21:41:35,337][26022] Updated weights on worker 0-0, policy_version 902944 (0.00084) [2022-07-10 21:41:36,851][26022] Updated weights on worker 0-0, policy_version 902954 (0.00086) [2022-07-10 21:41:38,839][26022] Updated weights on worker 0-0, policy_version 902964 (0.00087) [2022-07-10 21:41:39,207][25689] Fps is (10 sec: 5483.1, 60 sec: 5524.2, 300 sec: 5546.1). Total num frames: 924636160. Throughput: 0: 5816.2. Samples: 924639626. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:41:39,208][25689] Avg episode reward: [(0, '-0.052')] [2022-07-10 21:41:40,444][26022] Updated weights on worker 0-0, policy_version 902974 (0.00095) [2022-07-10 21:41:42,535][26022] Updated weights on worker 0-0, policy_version 902984 (0.00088) [2022-07-10 21:41:44,065][26022] Updated weights on worker 0-0, policy_version 902994 (0.00087) [2022-07-10 21:41:44,229][25689] Fps is (10 sec: 5782.7, 60 sec: 5596.5, 300 sec: 5556.7). Total num frames: 924666880. Throughput: 0: 5832.8. Samples: 924673436. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:41:44,229][25689] Avg episode reward: [(0, '-0.649')] [2022-07-10 21:41:46,075][26022] Updated weights on worker 0-0, policy_version 903004 (0.00094) [2022-07-10 21:41:47,760][26022] Updated weights on worker 0-0, policy_version 903014 (0.00093) [2022-07-10 21:41:49,319][25689] Fps is (10 sec: 5670.9, 60 sec: 5540.2, 300 sec: 5552.0). Total num frames: 924693504. Throughput: 0: 4999.0. Samples: 924690228. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:41:49,320][25689] Avg episode reward: [(0, '-0.208')] [2022-07-10 21:41:49,882][26022] Updated weights on worker 0-0, policy_version 903024 (0.00084) [2022-07-10 21:41:51,395][26022] Updated weights on worker 0-0, policy_version 903034 (0.00089) [2022-07-10 21:41:53,424][26022] Updated weights on worker 0-0, policy_version 903044 (0.00090) [2022-07-10 21:41:54,379][25689] Fps is (10 sec: 5448.0, 60 sec: 5555.3, 300 sec: 5555.3). Total num frames: 924722176. Throughput: 0: 5825.5. Samples: 924723958. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:41:54,379][25689] Avg episode reward: [(0, '-1.284')] [2022-07-10 21:41:55,132][26022] Updated weights on worker 0-0, policy_version 903054 (0.00091) [2022-07-10 21:41:57,150][26022] Updated weights on worker 0-0, policy_version 903064 (0.00088) [2022-07-10 21:41:58,859][26022] Updated weights on worker 0-0, policy_version 903074 (0.00089) [2022-07-10 21:41:59,451][25689] Fps is (10 sec: 5558.8, 60 sec: 5541.0, 300 sec: 5551.8). Total num frames: 924749824. Throughput: 0: 5821.9. Samples: 924757482. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:41:59,451][25689] Avg episode reward: [(0, '-0.433')] [2022-07-10 21:42:00,684][26022] Updated weights on worker 0-0, policy_version 903084 (0.00090) [2022-07-10 21:42:02,846][26022] Updated weights on worker 0-0, policy_version 903094 (0.00087) [2022-07-10 21:42:04,510][25689] Fps is (10 sec: 5357.0, 60 sec: 5520.2, 300 sec: 5548.4). Total num frames: 924776448. Throughput: 0: 4875.7. Samples: 924772320. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:42:04,510][25689] Avg episode reward: [(0, '-0.655')] [2022-07-10 21:42:04,852][26022] Updated weights on worker 0-0, policy_version 903104 (0.00088) [2022-07-10 21:42:06,709][26022] Updated weights on worker 0-0, policy_version 903114 (0.00090) [2022-07-10 21:42:08,179][26022] Updated weights on worker 0-0, policy_version 903124 (0.00095) [2022-07-10 21:42:09,532][25689] Fps is (10 sec: 5383.6, 60 sec: 5553.5, 300 sec: 5552.0). Total num frames: 924804096. Throughput: 0: 5714.1. Samples: 924805726. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:42:09,532][25689] Avg episode reward: [(0, '-0.704')] [2022-07-10 21:42:10,413][26022] Updated weights on worker 0-0, policy_version 903134 (0.00081) [2022-07-10 21:42:12,103][26022] Updated weights on worker 0-0, policy_version 903144 (0.00083) [2022-07-10 21:42:13,974][26022] Updated weights on worker 0-0, policy_version 903154 (0.00086) [2022-07-10 21:42:14,568][25689] Fps is (10 sec: 5701.4, 60 sec: 5568.5, 300 sec: 5557.2). Total num frames: 924833792. Throughput: 0: 5731.0. Samples: 924839662. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:42:14,568][25689] Avg episode reward: [(0, '0.048')] [2022-07-10 21:42:15,774][26022] Updated weights on worker 0-0, policy_version 903164 (0.00087) [2022-07-10 21:42:17,374][26022] Updated weights on worker 0-0, policy_version 903174 (0.00082) [2022-07-10 21:42:19,539][26022] Updated weights on worker 0-0, policy_version 903184 (0.00093) [2022-07-10 21:42:19,607][25689] Fps is (10 sec: 5590.1, 60 sec: 5557.9, 300 sec: 5544.2). Total num frames: 924860416. Throughput: 0: 4912.8. Samples: 924856504. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:42:19,607][25689] Avg episode reward: [(0, '0.134')] [2022-07-10 21:42:20,935][26022] Updated weights on worker 0-0, policy_version 903194 (0.00084) [2022-07-10 21:42:23,108][26022] Updated weights on worker 0-0, policy_version 903204 (0.00090) [2022-07-10 21:42:24,660][25689] Fps is (10 sec: 5681.8, 60 sec: 5578.7, 300 sec: 5561.6). Total num frames: 924891136. Throughput: 0: 5851.8. Samples: 924890238. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:42:24,662][25689] Avg episode reward: [(0, '-0.080')] [2022-07-10 21:42:24,664][26022] Updated weights on worker 0-0, policy_version 903214 (0.00092) [2022-07-10 21:42:26,804][26022] Updated weights on worker 0-0, policy_version 903224 (0.00087) [2022-07-10 21:42:28,623][26022] Updated weights on worker 0-0, policy_version 903234 (0.00088) [2022-07-10 21:42:29,747][25689] Fps is (10 sec: 5655.0, 60 sec: 5557.1, 300 sec: 5554.8). Total num frames: 924917760. Throughput: 0: 5831.5. Samples: 924923612. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:42:29,748][25689] Avg episode reward: [(0, '-0.382')] [2022-07-10 21:42:30,420][26022] Updated weights on worker 0-0, policy_version 903244 (0.00087) [2022-07-10 21:42:32,215][26022] Updated weights on worker 0-0, policy_version 903254 (0.00097) [2022-07-10 21:42:34,221][26022] Updated weights on worker 0-0, policy_version 903264 (0.00090) [2022-07-10 21:42:34,844][25689] Fps is (10 sec: 5429.6, 60 sec: 5566.2, 300 sec: 5554.0). Total num frames: 924946432. Throughput: 0: 4968.5. Samples: 924940406. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:42:34,846][25689] Avg episode reward: [(0, '-0.752')] [2022-07-10 21:42:35,796][26022] Updated weights on worker 0-0, policy_version 903274 (0.00087) [2022-07-10 21:42:37,894][26022] Updated weights on worker 0-0, policy_version 903284 (0.00089) [2022-07-10 21:42:39,595][26022] Updated weights on worker 0-0, policy_version 903294 (0.00085) [2022-07-10 21:42:39,936][25689] Fps is (10 sec: 5628.1, 60 sec: 5581.3, 300 sec: 5555.9). Total num frames: 924975104. Throughput: 0: 5766.1. Samples: 924973724. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:42:39,937][25689] Avg episode reward: [(0, '-0.723')] [2022-07-10 21:42:41,439][26022] Updated weights on worker 0-0, policy_version 903304 (0.00088) [2022-07-10 21:42:43,277][26022] Updated weights on worker 0-0, policy_version 903314 (0.00087) [2022-07-10 21:42:45,021][25689] Fps is (10 sec: 5635.0, 60 sec: 5541.8, 300 sec: 5557.9). Total num frames: 925003776. Throughput: 0: 5755.5. Samples: 925007424. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:42:45,021][25689] Avg episode reward: [(0, '-0.766')] [2022-07-10 21:42:45,021][26022] Updated weights on worker 0-0, policy_version 903324 (0.00084) [2022-07-10 21:42:46,871][26022] Updated weights on worker 0-0, policy_version 903334 (0.00097) [2022-07-10 21:42:48,853][26022] Updated weights on worker 0-0, policy_version 903344 (0.00084) [2022-07-10 21:42:50,071][25689] Fps is (10 sec: 5658.0, 60 sec: 5579.1, 300 sec: 5560.7). Total num frames: 925032448. Throughput: 0: 5778.6. Samples: 925041056. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:42:50,072][25689] Avg episode reward: [(0, '-0.598')] [2022-07-10 21:42:50,440][26022] Updated weights on worker 0-0, policy_version 903354 (0.00090) [2022-07-10 21:42:52,504][26022] Updated weights on worker 0-0, policy_version 903364 (0.00089) [2022-07-10 21:42:54,134][26022] Updated weights on worker 0-0, policy_version 903374 (0.00093) [2022-07-10 21:42:55,107][25689] Fps is (10 sec: 5482.4, 60 sec: 5547.6, 300 sec: 5555.4). Total num frames: 925059072. Throughput: 0: 5796.1. Samples: 925057850. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:42:55,107][25689] Avg episode reward: [(0, '0.225')] [2022-07-10 21:42:56,156][26022] Updated weights on worker 0-0, policy_version 903384 (0.00095) [2022-07-10 21:42:58,135][26022] Updated weights on worker 0-0, policy_version 903394 (0.00403) [2022-07-10 21:42:59,779][26022] Updated weights on worker 0-0, policy_version 903404 (0.00082) [2022-07-10 21:43:00,178][25689] Fps is (10 sec: 5470.9, 60 sec: 5564.5, 300 sec: 5554.4). Total num frames: 925087744. Throughput: 0: 5795.3. Samples: 925091034. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:43:00,179][25689] Avg episode reward: [(0, '0.386')] [2022-07-10 21:43:01,527][26022] Updated weights on worker 0-0, policy_version 903414 (0.00092) [2022-07-10 21:43:03,861][26022] Updated weights on worker 0-0, policy_version 903424 (0.00093) [2022-07-10 21:43:05,198][25689] Fps is (10 sec: 5479.9, 60 sec: 5568.2, 300 sec: 5557.9). Total num frames: 925114368. Throughput: 0: 5716.7. Samples: 925122768. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:43:05,198][25689] Avg episode reward: [(0, '0.700')] [2022-07-10 21:43:05,639][26022] Updated weights on worker 0-0, policy_version 903434 (0.00091) [2022-07-10 21:43:07,531][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:43:07,542][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000903443_925125632.pth [2022-07-10 21:43:07,542][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000901492_923127808.pth [2022-07-10 21:43:07,769][26022] Updated weights on worker 0-0, policy_version 903444 (0.00086) [2022-07-10 21:43:09,305][26022] Updated weights on worker 0-0, policy_version 903454 (0.00058) [2022-07-10 21:43:10,200][25689] Fps is (10 sec: 5313.2, 60 sec: 5553.1, 300 sec: 5555.2). Total num frames: 925140992. Throughput: 0: 4891.5. Samples: 925139518. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:43:10,201][25689] Avg episode reward: [(0, '0.224')] [2022-07-10 21:43:11,304][26022] Updated weights on worker 0-0, policy_version 903464 (0.00100) [2022-07-10 21:43:12,955][26022] Updated weights on worker 0-0, policy_version 903474 (0.00097) [2022-07-10 21:43:15,047][26022] Updated weights on worker 0-0, policy_version 903484 (0.00085) [2022-07-10 21:43:15,267][25689] Fps is (10 sec: 5389.7, 60 sec: 5516.5, 300 sec: 5548.2). Total num frames: 925168640. Throughput: 0: 5710.2. Samples: 925172970. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:43:15,268][25689] Avg episode reward: [(0, '0.461')] [2022-07-10 21:43:16,805][26022] Updated weights on worker 0-0, policy_version 903494 (0.00094) [2022-07-10 21:43:18,574][26022] Updated weights on worker 0-0, policy_version 903504 (0.00571) [2022-07-10 21:43:20,334][25689] Fps is (10 sec: 5658.9, 60 sec: 5564.6, 300 sec: 5557.4). Total num frames: 925198336. Throughput: 0: 5721.4. Samples: 925206350. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:43:20,335][25689] Avg episode reward: [(0, '0.931')] [2022-07-10 21:43:20,339][26022] Updated weights on worker 0-0, policy_version 903514 (0.00092) [2022-07-10 21:43:22,346][26022] Updated weights on worker 0-0, policy_version 903524 (0.00072) [2022-07-10 21:43:24,108][26022] Updated weights on worker 0-0, policy_version 903534 (0.00088) [2022-07-10 21:43:25,391][25689] Fps is (10 sec: 5563.2, 60 sec: 5496.8, 300 sec: 5553.5). Total num frames: 925224960. Throughput: 0: 4965.6. Samples: 925223040. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:43:25,391][25689] Avg episode reward: [(0, '0.681')] [2022-07-10 21:43:26,056][26022] Updated weights on worker 0-0, policy_version 903544 (0.00080) [2022-07-10 21:43:27,764][26022] Updated weights on worker 0-0, policy_version 903554 (0.00090) [2022-07-10 21:43:29,903][26022] Updated weights on worker 0-0, policy_version 903564 (0.00088) [2022-07-10 21:43:30,441][25689] Fps is (10 sec: 5470.7, 60 sec: 5533.8, 300 sec: 5553.1). Total num frames: 925253632. Throughput: 0: 5758.9. Samples: 925256084. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:43:30,443][25689] Avg episode reward: [(0, '0.678')] [2022-07-10 21:43:31,472][26022] Updated weights on worker 0-0, policy_version 903574 (0.00088) [2022-07-10 21:43:33,560][26022] Updated weights on worker 0-0, policy_version 903584 (0.00088) [2022-07-10 21:43:34,952][26022] Updated weights on worker 0-0, policy_version 903594 (0.00094) [2022-07-10 21:43:35,461][25689] Fps is (10 sec: 5593.0, 60 sec: 5524.1, 300 sec: 5555.0). Total num frames: 925281280. Throughput: 0: 5764.7. Samples: 925289378. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:43:35,461][25689] Avg episode reward: [(0, '0.022')] [2022-07-10 21:43:37,267][26022] Updated weights on worker 0-0, policy_version 903604 (0.00090) [2022-07-10 21:43:38,873][26022] Updated weights on worker 0-0, policy_version 903614 (0.00092) [2022-07-10 21:43:40,532][25689] Fps is (10 sec: 5480.1, 60 sec: 5509.0, 300 sec: 5547.2). Total num frames: 925308928. Throughput: 0: 4929.6. Samples: 925305918. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:43:40,532][25689] Avg episode reward: [(0, '0.080')] [2022-07-10 21:43:40,806][26022] Updated weights on worker 0-0, policy_version 903624 (0.00089) [2022-07-10 21:43:42,556][26022] Updated weights on worker 0-0, policy_version 903634 (0.00085) [2022-07-10 21:43:44,468][26022] Updated weights on worker 0-0, policy_version 903644 (0.00084) [2022-07-10 21:43:45,571][25689] Fps is (10 sec: 5671.5, 60 sec: 5530.1, 300 sec: 5547.3). Total num frames: 925338624. Throughput: 0: 5770.7. Samples: 925339496. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:43:45,572][25689] Avg episode reward: [(0, '-0.473')] [2022-07-10 21:43:46,341][26022] Updated weights on worker 0-0, policy_version 903654 (0.00095) [2022-07-10 21:43:48,099][26022] Updated weights on worker 0-0, policy_version 903664 (0.00411) [2022-07-10 21:43:49,876][26022] Updated weights on worker 0-0, policy_version 903674 (0.00092) [2022-07-10 21:43:50,599][25689] Fps is (10 sec: 5594.1, 60 sec: 5498.3, 300 sec: 5551.7). Total num frames: 925365248. Throughput: 0: 5813.5. Samples: 925373272. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:43:50,600][25689] Avg episode reward: [(0, '-0.364')] [2022-07-10 21:43:51,856][26022] Updated weights on worker 0-0, policy_version 903684 (0.00095) [2022-07-10 21:43:53,737][26022] Updated weights on worker 0-0, policy_version 903694 (0.00088) [2022-07-10 21:43:55,427][26022] Updated weights on worker 0-0, policy_version 903704 (0.00104) [2022-07-10 21:43:55,628][25689] Fps is (10 sec: 5396.6, 60 sec: 5515.8, 300 sec: 5545.4). Total num frames: 925392896. Throughput: 0: 4973.2. Samples: 925389676. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:43:55,630][25689] Avg episode reward: [(0, '-0.435')] [2022-07-10 21:43:57,240][26022] Updated weights on worker 0-0, policy_version 903714 (0.00085) [2022-07-10 21:43:59,204][26022] Updated weights on worker 0-0, policy_version 903724 (0.00085) [2022-07-10 21:44:00,689][25689] Fps is (10 sec: 5581.8, 60 sec: 5516.8, 300 sec: 5552.0). Total num frames: 925421568. Throughput: 0: 5806.1. Samples: 925422956. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:44:00,690][25689] Avg episode reward: [(0, '-0.466')] [2022-07-10 21:44:01,015][26022] Updated weights on worker 0-0, policy_version 903734 (0.00091) [2022-07-10 21:44:03,284][26022] Updated weights on worker 0-0, policy_version 903744 (0.00093) [2022-07-10 21:44:04,950][26022] Updated weights on worker 0-0, policy_version 903754 (0.00083) [2022-07-10 21:44:05,778][25689] Fps is (10 sec: 5347.3, 60 sec: 5493.6, 300 sec: 5543.6). Total num frames: 925447168. Throughput: 0: 5693.8. Samples: 925454546. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:44:05,778][25689] Avg episode reward: [(0, '0.420')] [2022-07-10 21:44:07,007][26022] Updated weights on worker 0-0, policy_version 903764 (0.00094) [2022-07-10 21:44:08,667][26022] Updated weights on worker 0-0, policy_version 903774 (0.00089) [2022-07-10 21:44:10,762][26022] Updated weights on worker 0-0, policy_version 903784 (0.00085) [2022-07-10 21:44:10,785][25689] Fps is (10 sec: 5274.1, 60 sec: 5510.0, 300 sec: 5540.3). Total num frames: 925474816. Throughput: 0: 4853.8. Samples: 925471252. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:44:10,786][25689] Avg episode reward: [(0, '0.505')] [2022-07-10 21:44:12,303][26022] Updated weights on worker 0-0, policy_version 903794 (0.00092) [2022-07-10 21:44:14,248][26022] Updated weights on worker 0-0, policy_version 903804 (0.00090) [2022-07-10 21:44:15,883][25689] Fps is (10 sec: 5674.7, 60 sec: 5541.0, 300 sec: 5546.3). Total num frames: 925504512. Throughput: 0: 5702.6. Samples: 925505180. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:44:15,883][25689] Avg episode reward: [(0, '0.273')] [2022-07-10 21:44:15,938][26022] Updated weights on worker 0-0, policy_version 903814 (0.00099) [2022-07-10 21:44:17,689][26022] Updated weights on worker 0-0, policy_version 903824 (0.00080) [2022-07-10 21:44:19,506][26022] Updated weights on worker 0-0, policy_version 903834 (0.00085) [2022-07-10 21:44:20,961][25689] Fps is (10 sec: 5736.0, 60 sec: 5523.0, 300 sec: 5551.8). Total num frames: 925533184. Throughput: 0: 5734.3. Samples: 925539200. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-10 21:44:20,962][25689] Avg episode reward: [(0, '0.478')] [2022-07-10 21:44:21,483][26022] Updated weights on worker 0-0, policy_version 903844 (0.00085) [2022-07-10 21:44:23,106][26022] Updated weights on worker 0-0, policy_version 903854 (0.00094) [2022-07-10 21:44:25,175][26022] Updated weights on worker 0-0, policy_version 903864 (0.00084) [2022-07-10 21:44:25,999][25689] Fps is (10 sec: 5668.7, 60 sec: 5558.6, 300 sec: 5547.8). Total num frames: 925561856. Throughput: 0: 5019.5. Samples: 925556046. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:44:25,999][25689] Avg episode reward: [(0, '0.529')] [2022-07-10 21:44:27,088][26022] Updated weights on worker 0-0, policy_version 903874 (0.00089) [2022-07-10 21:44:28,611][26022] Updated weights on worker 0-0, policy_version 903884 (0.00086) [2022-07-10 21:44:30,656][26022] Updated weights on worker 0-0, policy_version 903894 (0.00098) [2022-07-10 21:44:31,033][25689] Fps is (10 sec: 5490.4, 60 sec: 5526.3, 300 sec: 5543.7). Total num frames: 925588480. Throughput: 0: 5836.2. Samples: 925589418. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:44:31,033][25689] Avg episode reward: [(0, '0.790')] [2022-07-10 21:44:32,436][26022] Updated weights on worker 0-0, policy_version 903904 (0.00087) [2022-07-10 21:44:34,385][26022] Updated weights on worker 0-0, policy_version 903914 (0.00089) [2022-07-10 21:44:36,060][25689] Fps is (10 sec: 5394.2, 60 sec: 5525.6, 300 sec: 5544.8). Total num frames: 925616128. Throughput: 0: 5837.5. Samples: 925622964. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:44:36,061][25689] Avg episode reward: [(0, '0.657')] [2022-07-10 21:44:36,324][26022] Updated weights on worker 0-0, policy_version 903924 (0.00092) [2022-07-10 21:44:37,798][26022] Updated weights on worker 0-0, policy_version 903934 (0.00082) [2022-07-10 21:44:40,131][26022] Updated weights on worker 0-0, policy_version 903944 (0.00093) [2022-07-10 21:44:41,128][25689] Fps is (10 sec: 5781.5, 60 sec: 5576.6, 300 sec: 5550.6). Total num frames: 925646848. Throughput: 0: 4982.9. Samples: 925639690. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:44:41,129][25689] Avg episode reward: [(0, '0.890')] [2022-07-10 21:44:41,529][26022] Updated weights on worker 0-0, policy_version 903954 (0.00085) [2022-07-10 21:44:43,550][26022] Updated weights on worker 0-0, policy_version 903964 (0.00093) [2022-07-10 21:44:45,457][26022] Updated weights on worker 0-0, policy_version 903974 (0.00085) [2022-07-10 21:44:46,141][25689] Fps is (10 sec: 5688.3, 60 sec: 5528.3, 300 sec: 5547.4). Total num frames: 925673472. Throughput: 0: 5818.5. Samples: 925673244. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:44:46,141][25689] Avg episode reward: [(0, '1.115')] [2022-07-10 21:44:47,052][26022] Updated weights on worker 0-0, policy_version 903984 (0.00081) [2022-07-10 21:44:49,167][26022] Updated weights on worker 0-0, policy_version 903994 (0.00091) [2022-07-10 21:44:50,933][26022] Updated weights on worker 0-0, policy_version 904004 (0.00092) [2022-07-10 21:44:51,211][25689] Fps is (10 sec: 5382.6, 60 sec: 5541.4, 300 sec: 5539.5). Total num frames: 925701120. Throughput: 0: 5810.8. Samples: 925706672. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:44:51,212][25689] Avg episode reward: [(0, '1.112')] [2022-07-10 21:44:52,779][26022] Updated weights on worker 0-0, policy_version 904014 (0.00085) [2022-07-10 21:44:54,692][26022] Updated weights on worker 0-0, policy_version 904024 (0.00099) [2022-07-10 21:44:56,225][25689] Fps is (10 sec: 5584.7, 60 sec: 5559.6, 300 sec: 5543.6). Total num frames: 925729792. Throughput: 0: 4979.0. Samples: 925723368. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:44:56,227][25689] Avg episode reward: [(0, '1.092')] [2022-07-10 21:44:56,447][26022] Updated weights on worker 0-0, policy_version 904034 (0.00086) [2022-07-10 21:44:58,306][26022] Updated weights on worker 0-0, policy_version 904044 (0.00090) [2022-07-10 21:45:00,519][26022] Updated weights on worker 0-0, policy_version 904054 (0.00083) [2022-07-10 21:45:01,311][25689] Fps is (10 sec: 5576.4, 60 sec: 5540.5, 300 sec: 5545.6). Total num frames: 925757440. Throughput: 0: 5773.1. Samples: 925756206. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:45:01,311][25689] Avg episode reward: [(0, '0.903')] [2022-07-10 21:45:02,315][26022] Updated weights on worker 0-0, policy_version 904064 (0.00103) [2022-07-10 21:45:04,543][26022] Updated weights on worker 0-0, policy_version 904074 (0.00093) [2022-07-10 21:45:06,222][26022] Updated weights on worker 0-0, policy_version 904084 (0.00092) [2022-07-10 21:45:06,339][25689] Fps is (10 sec: 5265.2, 60 sec: 5546.0, 300 sec: 5538.4). Total num frames: 925783040. Throughput: 0: 5633.6. Samples: 925787030. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:45:06,339][25689] Avg episode reward: [(0, '0.589')] [2022-07-10 21:45:07,613][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:45:07,626][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000904092_925790208.pth [2022-07-10 21:45:07,626][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000902141_923792384.pth [2022-07-10 21:45:07,969][26022] Updated weights on worker 0-0, policy_version 904094 (0.00091) [2022-07-10 21:45:09,655][26022] Updated weights on worker 0-0, policy_version 904104 (0.00095) [2022-07-10 21:45:11,342][25689] Fps is (10 sec: 5308.2, 60 sec: 5546.4, 300 sec: 5536.6). Total num frames: 925810688. Throughput: 0: 4820.8. Samples: 925803718. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:45:11,342][25689] Avg episode reward: [(0, '0.451')] [2022-07-10 21:45:11,642][26022] Updated weights on worker 0-0, policy_version 904114 (0.00091) [2022-07-10 21:45:13,412][26022] Updated weights on worker 0-0, policy_version 904124 (0.00090) [2022-07-10 21:45:15,412][26022] Updated weights on worker 0-0, policy_version 904134 (0.00639) [2022-07-10 21:45:16,359][25689] Fps is (10 sec: 5620.6, 60 sec: 5536.9, 300 sec: 5538.1). Total num frames: 925839360. Throughput: 0: 5669.1. Samples: 925837506. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:45:16,361][25689] Avg episode reward: [(0, '0.532')] [2022-07-10 21:45:17,005][26022] Updated weights on worker 0-0, policy_version 904144 (0.00084) [2022-07-10 21:45:19,125][26022] Updated weights on worker 0-0, policy_version 904154 (0.00092) [2022-07-10 21:45:20,777][26022] Updated weights on worker 0-0, policy_version 904164 (0.00085) [2022-07-10 21:45:21,469][25689] Fps is (10 sec: 5561.2, 60 sec: 5517.0, 300 sec: 5536.3). Total num frames: 925867008. Throughput: 0: 5693.3. Samples: 925870974. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:45:21,470][25689] Avg episode reward: [(0, '-0.163')] [2022-07-10 21:45:22,613][26022] Updated weights on worker 0-0, policy_version 904174 (0.00086) [2022-07-10 21:45:24,547][26022] Updated weights on worker 0-0, policy_version 904184 (0.00082) [2022-07-10 21:45:26,230][26022] Updated weights on worker 0-0, policy_version 904194 (0.00105) [2022-07-10 21:45:26,480][25689] Fps is (10 sec: 5463.6, 60 sec: 5502.5, 300 sec: 5536.4). Total num frames: 925894656. Throughput: 0: 5824.6. Samples: 925904344. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:45:26,487][25689] Avg episode reward: [(0, '0.269')] [2022-07-10 21:45:28,203][26022] Updated weights on worker 0-0, policy_version 904204 (0.00088) [2022-07-10 21:45:30,088][26022] Updated weights on worker 0-0, policy_version 904214 (0.00090) [2022-07-10 21:45:31,495][25689] Fps is (10 sec: 5515.6, 60 sec: 5521.2, 300 sec: 5539.9). Total num frames: 925922304. Throughput: 0: 5820.0. Samples: 925921008. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:45:31,497][25689] Avg episode reward: [(0, '0.005')] [2022-07-10 21:45:31,827][26022] Updated weights on worker 0-0, policy_version 904224 (0.00084) [2022-07-10 21:45:33,989][26022] Updated weights on worker 0-0, policy_version 904234 (0.00083) [2022-07-10 21:45:35,386][26022] Updated weights on worker 0-0, policy_version 904244 (0.00094) [2022-07-10 21:45:36,527][25689] Fps is (10 sec: 5605.7, 60 sec: 5537.7, 300 sec: 5537.0). Total num frames: 925950976. Throughput: 0: 5797.1. Samples: 925954420. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:45:36,528][25689] Avg episode reward: [(0, '0.344')] [2022-07-10 21:45:37,562][26022] Updated weights on worker 0-0, policy_version 904254 (0.00091) [2022-07-10 21:45:39,059][26022] Updated weights on worker 0-0, policy_version 904264 (0.00089) [2022-07-10 21:45:41,291][26022] Updated weights on worker 0-0, policy_version 904274 (0.00089) [2022-07-10 21:45:41,560][25689] Fps is (10 sec: 5595.8, 60 sec: 5490.1, 300 sec: 5541.2). Total num frames: 925978624. Throughput: 0: 5824.9. Samples: 925987998. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:45:41,561][25689] Avg episode reward: [(0, '0.372')] [2022-07-10 21:45:42,840][26022] Updated weights on worker 0-0, policy_version 904284 (0.00093) [2022-07-10 21:45:44,763][26022] Updated weights on worker 0-0, policy_version 904294 (0.00085) [2022-07-10 21:45:46,564][25689] Fps is (10 sec: 5611.2, 60 sec: 5524.8, 300 sec: 5538.2). Total num frames: 926007296. Throughput: 0: 5001.7. Samples: 926004798. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:45:46,564][25689] Avg episode reward: [(0, '0.697')] [2022-07-10 21:45:46,573][26022] Updated weights on worker 0-0, policy_version 904304 (0.00089) [2022-07-10 21:45:48,313][26022] Updated weights on worker 0-0, policy_version 904314 (0.00087) [2022-07-10 21:45:50,285][26022] Updated weights on worker 0-0, policy_version 904324 (0.00114) [2022-07-10 21:45:51,574][25689] Fps is (10 sec: 5521.8, 60 sec: 5513.3, 300 sec: 5535.3). Total num frames: 926033920. Throughput: 0: 5840.3. Samples: 926038276. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:45:51,583][25689] Avg episode reward: [(0, '0.291')] [2022-07-10 21:45:52,074][26022] Updated weights on worker 0-0, policy_version 904334 (0.00083) [2022-07-10 21:45:53,774][26022] Updated weights on worker 0-0, policy_version 904344 (0.00082) [2022-07-10 21:45:55,692][26022] Updated weights on worker 0-0, policy_version 904354 (0.00086) [2022-07-10 21:45:56,608][25689] Fps is (10 sec: 5505.5, 60 sec: 5511.6, 300 sec: 5536.6). Total num frames: 926062592. Throughput: 0: 5840.6. Samples: 926071704. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:45:56,608][25689] Avg episode reward: [(0, '-0.460')] [2022-07-10 21:45:57,655][26022] Updated weights on worker 0-0, policy_version 904364 (0.00096) [2022-07-10 21:45:59,482][26022] Updated weights on worker 0-0, policy_version 904374 (0.00100) [2022-07-10 21:46:01,452][26022] Updated weights on worker 0-0, policy_version 904384 (0.00091) [2022-07-10 21:46:01,650][25689] Fps is (10 sec: 5589.2, 60 sec: 5515.4, 300 sec: 5536.1). Total num frames: 926090240. Throughput: 0: 4997.4. Samples: 926088400. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:46:01,655][25689] Avg episode reward: [(0, '-0.215')] [2022-07-10 21:46:03,342][26022] Updated weights on worker 0-0, policy_version 904394 (0.00084) [2022-07-10 21:46:05,507][26022] Updated weights on worker 0-0, policy_version 904404 (0.00088) [2022-07-10 21:46:06,713][25689] Fps is (10 sec: 5370.4, 60 sec: 5529.2, 300 sec: 5538.7). Total num frames: 926116864. Throughput: 0: 5704.6. Samples: 926119744. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:46:06,714][25689] Avg episode reward: [(0, '-0.317')] [2022-07-10 21:46:07,250][26022] Updated weights on worker 0-0, policy_version 904414 (0.00100) [2022-07-10 21:46:09,099][26022] Updated weights on worker 0-0, policy_version 904424 (0.00082) [2022-07-10 21:46:11,052][26022] Updated weights on worker 0-0, policy_version 904434 (0.00088) [2022-07-10 21:46:11,726][25689] Fps is (10 sec: 5386.4, 60 sec: 5528.3, 300 sec: 5535.3). Total num frames: 926144512. Throughput: 0: 5685.5. Samples: 926152852. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:46:11,728][25689] Avg episode reward: [(0, '-1.071')] [2022-07-10 21:46:12,675][26022] Updated weights on worker 0-0, policy_version 904444 (0.00089) [2022-07-10 21:46:14,656][26022] Updated weights on worker 0-0, policy_version 904454 (0.00105) [2022-07-10 21:46:16,615][26022] Updated weights on worker 0-0, policy_version 904464 (0.00087) [2022-07-10 21:46:16,732][25689] Fps is (10 sec: 5519.5, 60 sec: 5512.4, 300 sec: 5537.2). Total num frames: 926172160. Throughput: 0: 4847.1. Samples: 926169248. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:46:16,733][25689] Avg episode reward: [(0, '-0.544')] [2022-07-10 21:46:18,301][26022] Updated weights on worker 0-0, policy_version 904474 (0.00083) [2022-07-10 21:46:20,456][26022] Updated weights on worker 0-0, policy_version 904484 (0.00084) [2022-07-10 21:46:21,795][25689] Fps is (10 sec: 5593.7, 60 sec: 5533.7, 300 sec: 5534.3). Total num frames: 926200832. Throughput: 0: 5657.1. Samples: 926202358. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:46:21,795][25689] Avg episode reward: [(0, '-0.489')] [2022-07-10 21:46:22,028][26022] Updated weights on worker 0-0, policy_version 904494 (0.00085) [2022-07-10 21:46:24,048][26022] Updated weights on worker 0-0, policy_version 904504 (0.00084) [2022-07-10 21:46:25,923][26022] Updated weights on worker 0-0, policy_version 904514 (0.00089) [2022-07-10 21:46:26,805][25689] Fps is (10 sec: 5387.7, 60 sec: 5499.8, 300 sec: 5527.9). Total num frames: 926226432. Throughput: 0: 5772.0. Samples: 926235714. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:46:26,806][25689] Avg episode reward: [(0, '-0.080')] [2022-07-10 21:46:27,585][26022] Updated weights on worker 0-0, policy_version 904524 (0.00093) [2022-07-10 21:46:29,709][26022] Updated weights on worker 0-0, policy_version 904534 (0.00056) [2022-07-10 21:46:31,304][26022] Updated weights on worker 0-0, policy_version 904544 (0.00082) [2022-07-10 21:46:31,815][25689] Fps is (10 sec: 5518.3, 60 sec: 5534.2, 300 sec: 5534.9). Total num frames: 926256128. Throughput: 0: 4954.8. Samples: 926252390. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:46:31,816][25689] Avg episode reward: [(0, '0.158')] [2022-07-10 21:46:33,351][26022] Updated weights on worker 0-0, policy_version 904554 (0.00088) [2022-07-10 21:46:35,092][26022] Updated weights on worker 0-0, policy_version 904564 (0.00094) [2022-07-10 21:46:36,807][26022] Updated weights on worker 0-0, policy_version 904574 (0.00091) [2022-07-10 21:46:36,874][25689] Fps is (10 sec: 5695.4, 60 sec: 5514.8, 300 sec: 5535.1). Total num frames: 926283776. Throughput: 0: 5792.1. Samples: 926285912. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:46:36,874][25689] Avg episode reward: [(0, '-0.382')] [2022-07-10 21:46:38,871][26022] Updated weights on worker 0-0, policy_version 904584 (0.00086) [2022-07-10 21:46:40,408][26022] Updated weights on worker 0-0, policy_version 904594 (0.00094) [2022-07-10 21:46:41,967][25689] Fps is (10 sec: 5446.7, 60 sec: 5509.2, 300 sec: 5523.4). Total num frames: 926311424. Throughput: 0: 5802.7. Samples: 926319414. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:46:41,968][25689] Avg episode reward: [(0, '0.229')] [2022-07-10 21:46:42,347][26022] Updated weights on worker 0-0, policy_version 904604 (0.00089) [2022-07-10 21:46:44,227][26022] Updated weights on worker 0-0, policy_version 904614 (0.00085) [2022-07-10 21:46:45,975][26022] Updated weights on worker 0-0, policy_version 904624 (0.00085) [2022-07-10 21:46:46,975][25689] Fps is (10 sec: 5575.3, 60 sec: 5508.9, 300 sec: 5531.9). Total num frames: 926340096. Throughput: 0: 4970.6. Samples: 926335972. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:46:46,976][25689] Avg episode reward: [(0, '0.022')] [2022-07-10 21:46:47,743][26022] Updated weights on worker 0-0, policy_version 904634 (0.00085) [2022-07-10 21:46:49,771][26022] Updated weights on worker 0-0, policy_version 904644 (0.00095) [2022-07-10 21:46:51,487][26022] Updated weights on worker 0-0, policy_version 904654 (0.00100) [2022-07-10 21:46:51,999][25689] Fps is (10 sec: 5716.1, 60 sec: 5541.5, 300 sec: 5532.5). Total num frames: 926368768. Throughput: 0: 5819.1. Samples: 926369844. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:46:52,000][25689] Avg episode reward: [(0, '-1.106')] [2022-07-10 21:46:53,466][26022] Updated weights on worker 0-0, policy_version 904664 (0.00084) [2022-07-10 21:46:55,195][26022] Updated weights on worker 0-0, policy_version 904674 (0.00095) [2022-07-10 21:46:57,023][25689] Fps is (10 sec: 5503.4, 60 sec: 5508.5, 300 sec: 5530.0). Total num frames: 926395392. Throughput: 0: 5821.3. Samples: 926403206. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:46:57,023][25689] Avg episode reward: [(0, '-0.950')] [2022-07-10 21:46:57,088][26022] Updated weights on worker 0-0, policy_version 904684 (0.00086) [2022-07-10 21:46:58,961][26022] Updated weights on worker 0-0, policy_version 904694 (0.00086) [2022-07-10 21:47:00,831][26022] Updated weights on worker 0-0, policy_version 904704 (0.00093) [2022-07-10 21:47:02,133][25689] Fps is (10 sec: 5254.6, 60 sec: 5485.5, 300 sec: 5529.0). Total num frames: 926422016. Throughput: 0: 4983.3. Samples: 926419906. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:47:02,135][25689] Avg episode reward: [(0, '-1.529')] [2022-07-10 21:47:03,055][26022] Updated weights on worker 0-0, policy_version 904715 (0.00088) [2022-07-10 21:47:04,880][26022] Updated weights on worker 0-0, policy_version 904725 (0.00087) [2022-07-10 21:47:06,964][26022] Updated weights on worker 0-0, policy_version 904735 (0.00121) [2022-07-10 21:47:07,180][25689] Fps is (10 sec: 5343.4, 60 sec: 5503.9, 300 sec: 5528.6). Total num frames: 926449664. Throughput: 0: 5702.5. Samples: 926451188. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:47:07,181][25689] Avg episode reward: [(0, '-1.057')] [2022-07-10 21:47:07,721][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:47:07,735][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000904739_926452736.pth [2022-07-10 21:47:07,736][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000902794_924461056.pth [2022-07-10 21:47:08,663][26022] Updated weights on worker 0-0, policy_version 904745 (0.00100) [2022-07-10 21:47:10,572][26022] Updated weights on worker 0-0, policy_version 904755 (0.00089) [2022-07-10 21:47:12,200][25689] Fps is (10 sec: 5594.8, 60 sec: 5520.2, 300 sec: 5525.4). Total num frames: 926478336. Throughput: 0: 5666.7. Samples: 926484312. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:47:12,200][25689] Avg episode reward: [(0, '-0.913')] [2022-07-10 21:47:12,423][26022] Updated weights on worker 0-0, policy_version 904765 (0.00084) [2022-07-10 21:47:14,199][26022] Updated weights on worker 0-0, policy_version 904775 (0.00086) [2022-07-10 21:47:15,944][26022] Updated weights on worker 0-0, policy_version 904785 (0.00093) [2022-07-10 21:47:17,202][25689] Fps is (10 sec: 5517.0, 60 sec: 5503.5, 300 sec: 5526.1). Total num frames: 926504960. Throughput: 0: 4849.1. Samples: 926501058. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:47:17,206][25689] Avg episode reward: [(0, '-0.586')] [2022-07-10 21:47:17,825][26022] Updated weights on worker 0-0, policy_version 904795 (0.00081) [2022-07-10 21:47:19,754][26022] Updated weights on worker 0-0, policy_version 904805 (0.00087) [2022-07-10 21:47:21,668][26022] Updated weights on worker 0-0, policy_version 904815 (0.00086) [2022-07-10 21:47:22,303][25689] Fps is (10 sec: 5675.8, 60 sec: 5533.9, 300 sec: 5525.2). Total num frames: 926535680. Throughput: 0: 5688.3. Samples: 926534638. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:47:22,305][25689] Avg episode reward: [(0, '-0.462')] [2022-07-10 21:47:23,506][26022] Updated weights on worker 0-0, policy_version 904825 (0.00087) [2022-07-10 21:47:25,112][26022] Updated weights on worker 0-0, policy_version 904835 (0.00092) [2022-07-10 21:47:27,307][26022] Updated weights on worker 0-0, policy_version 904845 (0.00085) [2022-07-10 21:47:27,338][25689] Fps is (10 sec: 5556.6, 60 sec: 5531.7, 300 sec: 5522.7). Total num frames: 926561280. Throughput: 0: 5797.7. Samples: 926568060. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:47:27,339][25689] Avg episode reward: [(0, '-0.430')] [2022-07-10 21:47:28,872][26022] Updated weights on worker 0-0, policy_version 904855 (0.00093) [2022-07-10 21:47:30,888][26022] Updated weights on worker 0-0, policy_version 904865 (0.00074) [2022-07-10 21:47:32,351][25689] Fps is (10 sec: 5401.4, 60 sec: 5514.5, 300 sec: 5524.3). Total num frames: 926589952. Throughput: 0: 4977.7. Samples: 926584618. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:47:32,351][25689] Avg episode reward: [(0, '-1.106')] [2022-07-10 21:47:32,555][26022] Updated weights on worker 0-0, policy_version 904875 (0.00099) [2022-07-10 21:47:34,716][26022] Updated weights on worker 0-0, policy_version 904885 (0.00083) [2022-07-10 21:47:36,200][26022] Updated weights on worker 0-0, policy_version 904895 (0.00494) [2022-07-10 21:47:37,384][25689] Fps is (10 sec: 5606.3, 60 sec: 5516.8, 300 sec: 5522.0). Total num frames: 926617600. Throughput: 0: 5791.0. Samples: 926617928. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:47:37,385][25689] Avg episode reward: [(0, '-0.932')] [2022-07-10 21:47:38,221][26022] Updated weights on worker 0-0, policy_version 904905 (0.00094) [2022-07-10 21:47:39,887][26022] Updated weights on worker 0-0, policy_version 904915 (0.00085) [2022-07-10 21:47:41,953][26022] Updated weights on worker 0-0, policy_version 904925 (0.00084) [2022-07-10 21:47:42,473][25689] Fps is (10 sec: 5665.3, 60 sec: 5551.1, 300 sec: 5525.3). Total num frames: 926647296. Throughput: 0: 5785.1. Samples: 926651322. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:47:42,473][25689] Avg episode reward: [(0, '-1.076')] [2022-07-10 21:47:43,772][26022] Updated weights on worker 0-0, policy_version 904935 (0.00085) [2022-07-10 21:47:45,554][26022] Updated weights on worker 0-0, policy_version 904945 (0.00092) [2022-07-10 21:47:47,562][25689] Fps is (10 sec: 5433.1, 60 sec: 5493.0, 300 sec: 5514.3). Total num frames: 926672896. Throughput: 0: 4932.7. Samples: 926667816. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:47:47,562][25689] Avg episode reward: [(0, '-0.850')] [2022-07-10 21:47:47,592][26022] Updated weights on worker 0-0, policy_version 904955 (0.00092) [2022-07-10 21:47:49,210][26022] Updated weights on worker 0-0, policy_version 904965 (0.00092) [2022-07-10 21:47:51,063][26022] Updated weights on worker 0-0, policy_version 904975 (0.00095) [2022-07-10 21:47:52,570][25689] Fps is (10 sec: 5375.1, 60 sec: 5494.5, 300 sec: 5521.7). Total num frames: 926701568. Throughput: 0: 5758.3. Samples: 926701044. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:47:52,570][25689] Avg episode reward: [(0, '-0.158')] [2022-07-10 21:47:53,067][26022] Updated weights on worker 0-0, policy_version 904985 (0.00096) [2022-07-10 21:47:54,759][26022] Updated weights on worker 0-0, policy_version 904995 (0.00095) [2022-07-10 21:47:56,806][26022] Updated weights on worker 0-0, policy_version 905005 (0.00094) [2022-07-10 21:47:57,582][25689] Fps is (10 sec: 5722.5, 60 sec: 5529.2, 300 sec: 5522.8). Total num frames: 926730240. Throughput: 0: 5767.7. Samples: 926734426. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:47:57,583][25689] Avg episode reward: [(0, '0.044')] [2022-07-10 21:47:58,683][26022] Updated weights on worker 0-0, policy_version 905015 (0.00083) [2022-07-10 21:48:00,300][26022] Updated weights on worker 0-0, policy_version 905025 (0.00086) [2022-07-10 21:48:02,635][25689] Fps is (10 sec: 5188.5, 60 sec: 5483.7, 300 sec: 5511.8). Total num frames: 926753792. Throughput: 0: 4945.1. Samples: 926751026. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:48:02,635][25689] Avg episode reward: [(0, '1.196')] [2022-07-10 21:48:02,812][26022] Updated weights on worker 0-0, policy_version 905035 (0.00086) [2022-07-10 21:48:04,312][26022] Updated weights on worker 0-0, policy_version 905045 (0.00089) [2022-07-10 21:48:06,268][26022] Updated weights on worker 0-0, policy_version 905055 (0.00091) [2022-07-10 21:48:07,654][25689] Fps is (10 sec: 5388.8, 60 sec: 5537.1, 300 sec: 5525.3). Total num frames: 926784512. Throughput: 0: 5694.2. Samples: 926782224. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-10 21:48:07,654][25689] Avg episode reward: [(0, '1.842')] [2022-07-10 21:48:08,270][26022] Updated weights on worker 0-0, policy_version 905065 (0.00090) [2022-07-10 21:48:09,830][26022] Updated weights on worker 0-0, policy_version 905075 (0.00093) [2022-07-10 21:48:12,042][26022] Updated weights on worker 0-0, policy_version 905085 (0.00086) [2022-07-10 21:48:12,680][25689] Fps is (10 sec: 5708.5, 60 sec: 5502.6, 300 sec: 5522.6). Total num frames: 926811136. Throughput: 0: 5683.2. Samples: 926815338. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:48:12,681][25689] Avg episode reward: [(0, '1.494')] [2022-07-10 21:48:13,621][26022] Updated weights on worker 0-0, policy_version 905095 (0.00344) [2022-07-10 21:48:15,506][26022] Updated weights on worker 0-0, policy_version 905105 (0.00086) [2022-07-10 21:48:17,467][26022] Updated weights on worker 0-0, policy_version 905115 (0.00088) [2022-07-10 21:48:17,710][25689] Fps is (10 sec: 5295.0, 60 sec: 5500.1, 300 sec: 5513.0). Total num frames: 926837760. Throughput: 0: 5680.5. Samples: 926848760. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:48:17,710][25689] Avg episode reward: [(0, '1.625')] [2022-07-10 21:48:19,141][26022] Updated weights on worker 0-0, policy_version 905125 (0.00089) [2022-07-10 21:48:21,174][26022] Updated weights on worker 0-0, policy_version 905135 (0.00091) [2022-07-10 21:48:22,771][25689] Fps is (10 sec: 5683.1, 60 sec: 5503.8, 300 sec: 5526.7). Total num frames: 926868480. Throughput: 0: 5677.0. Samples: 926865336. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:48:22,771][25689] Avg episode reward: [(0, '1.038')] [2022-07-10 21:48:22,774][26022] Updated weights on worker 0-0, policy_version 905145 (0.00082) [2022-07-10 21:48:24,807][26022] Updated weights on worker 0-0, policy_version 905155 (0.00088) [2022-07-10 21:48:26,662][26022] Updated weights on worker 0-0, policy_version 905165 (0.00100) [2022-07-10 21:48:27,828][25689] Fps is (10 sec: 5566.2, 60 sec: 5501.7, 300 sec: 5516.2). Total num frames: 926894080. Throughput: 0: 5785.8. Samples: 926898950. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:48:27,829][25689] Avg episode reward: [(0, '0.375')] [2022-07-10 21:48:28,359][26022] Updated weights on worker 0-0, policy_version 905175 (0.00083) [2022-07-10 21:48:30,249][26022] Updated weights on worker 0-0, policy_version 905185 (0.00091) [2022-07-10 21:48:32,195][26022] Updated weights on worker 0-0, policy_version 905195 (0.00093) [2022-07-10 21:48:32,845][25689] Fps is (10 sec: 5488.7, 60 sec: 5518.3, 300 sec: 5523.1). Total num frames: 926923776. Throughput: 0: 5788.6. Samples: 926932064. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:48:32,847][25689] Avg episode reward: [(0, '0.177')] [2022-07-10 21:48:34,040][26022] Updated weights on worker 0-0, policy_version 905205 (0.00093) [2022-07-10 21:48:35,978][26022] Updated weights on worker 0-0, policy_version 905215 (0.00068) [2022-07-10 21:48:37,670][26022] Updated weights on worker 0-0, policy_version 905225 (0.00094) [2022-07-10 21:48:37,864][25689] Fps is (10 sec: 5714.1, 60 sec: 5519.6, 300 sec: 5524.1). Total num frames: 926951424. Throughput: 0: 4966.5. Samples: 926948854. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:48:37,864][25689] Avg episode reward: [(0, '0.196')] [2022-07-10 21:48:39,676][26022] Updated weights on worker 0-0, policy_version 905235 (0.00091) [2022-07-10 21:48:41,373][26022] Updated weights on worker 0-0, policy_version 905245 (0.00089) [2022-07-10 21:48:42,971][25689] Fps is (10 sec: 5461.0, 60 sec: 5484.1, 300 sec: 5516.0). Total num frames: 926979072. Throughput: 0: 5797.0. Samples: 926982436. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:48:42,971][25689] Avg episode reward: [(0, '0.168')] [2022-07-10 21:48:43,265][26022] Updated weights on worker 0-0, policy_version 905255 (0.00084) [2022-07-10 21:48:44,949][26022] Updated weights on worker 0-0, policy_version 905265 (0.00086) [2022-07-10 21:48:46,914][26022] Updated weights on worker 0-0, policy_version 905275 (0.00084) [2022-07-10 21:48:48,009][25689] Fps is (10 sec: 5551.5, 60 sec: 5539.5, 300 sec: 5522.7). Total num frames: 927007744. Throughput: 0: 5804.2. Samples: 927016082. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:48:48,009][25689] Avg episode reward: [(0, '-0.281')] [2022-07-10 21:48:48,509][26022] Updated weights on worker 0-0, policy_version 905285 (0.00370) [2022-07-10 21:48:50,571][26022] Updated weights on worker 0-0, policy_version 905295 (0.00095) [2022-07-10 21:48:52,154][26022] Updated weights on worker 0-0, policy_version 905305 (0.00085) [2022-07-10 21:48:53,095][25689] Fps is (10 sec: 5461.7, 60 sec: 5498.5, 300 sec: 5518.2). Total num frames: 927034368. Throughput: 0: 4971.3. Samples: 927032732. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:48:53,096][25689] Avg episode reward: [(0, '-0.165')] [2022-07-10 21:48:54,066][26022] Updated weights on worker 0-0, policy_version 905315 (0.00085) [2022-07-10 21:48:56,334][26022] Updated weights on worker 0-0, policy_version 905325 (0.00092) [2022-07-10 21:48:57,708][26022] Updated weights on worker 0-0, policy_version 905335 (0.00094) [2022-07-10 21:48:58,169][25689] Fps is (10 sec: 5643.8, 60 sec: 5526.7, 300 sec: 5524.8). Total num frames: 927065088. Throughput: 0: 5761.3. Samples: 927065840. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:48:58,171][25689] Avg episode reward: [(0, '-0.794')] [2022-07-10 21:48:59,945][26022] Updated weights on worker 0-0, policy_version 905345 (0.00071) [2022-07-10 21:49:01,523][26022] Updated weights on worker 0-0, policy_version 905355 (0.00082) [2022-07-10 21:49:03,227][25689] Fps is (10 sec: 5255.4, 60 sec: 5509.3, 300 sec: 5515.0). Total num frames: 927087616. Throughput: 0: 5666.1. Samples: 927097210. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:49:03,228][25689] Avg episode reward: [(0, '-1.010')] [2022-07-10 21:49:03,899][26022] Updated weights on worker 0-0, policy_version 905365 (0.00085) [2022-07-10 21:49:05,719][26022] Updated weights on worker 0-0, policy_version 905375 (0.00098) [2022-07-10 21:49:07,570][26022] Updated weights on worker 0-0, policy_version 905385 (0.00094) [2022-07-10 21:49:07,895][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:49:07,910][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000905387_927116288.pth [2022-07-10 21:49:07,910][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000903443_925125632.pth [2022-07-10 21:49:08,247][25689] Fps is (10 sec: 5182.2, 60 sec: 5492.4, 300 sec: 5521.7). Total num frames: 927117312. Throughput: 0: 4822.3. Samples: 927113676. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:49:08,247][25689] Avg episode reward: [(0, '-0.894')] [2022-07-10 21:49:09,403][26022] Updated weights on worker 0-0, policy_version 905395 (0.00086) [2022-07-10 21:49:11,391][26022] Updated weights on worker 0-0, policy_version 905405 (0.00088) [2022-07-10 21:49:12,954][26022] Updated weights on worker 0-0, policy_version 905415 (0.00097) [2022-07-10 21:49:13,328][25689] Fps is (10 sec: 5778.6, 60 sec: 5521.2, 300 sec: 5518.6). Total num frames: 927145984. Throughput: 0: 5656.3. Samples: 927147174. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:49:13,328][25689] Avg episode reward: [(0, '-0.927')] [2022-07-10 21:49:15,056][26022] Updated weights on worker 0-0, policy_version 905425 (0.00092) [2022-07-10 21:49:16,654][26022] Updated weights on worker 0-0, policy_version 905435 (0.00088) [2022-07-10 21:49:18,330][25689] Fps is (10 sec: 5585.2, 60 sec: 5540.6, 300 sec: 5516.5). Total num frames: 927173632. Throughput: 0: 5708.0. Samples: 927180920. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:49:18,331][25689] Avg episode reward: [(0, '-0.231')] [2022-07-10 21:49:18,640][26022] Updated weights on worker 0-0, policy_version 905445 (0.00096) [2022-07-10 21:49:20,353][26022] Updated weights on worker 0-0, policy_version 905455 (0.00750) [2022-07-10 21:49:22,281][26022] Updated weights on worker 0-0, policy_version 905465 (0.00083) [2022-07-10 21:49:23,474][25689] Fps is (10 sec: 5551.0, 60 sec: 5499.3, 300 sec: 5514.6). Total num frames: 927202304. Throughput: 0: 4965.3. Samples: 927197740. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:49:23,474][25689] Avg episode reward: [(0, '-0.714')] [2022-07-10 21:49:23,921][26022] Updated weights on worker 0-0, policy_version 905475 (0.00084) [2022-07-10 21:49:25,961][26022] Updated weights on worker 0-0, policy_version 905485 (0.00088) [2022-07-10 21:49:27,691][26022] Updated weights on worker 0-0, policy_version 905495 (0.00096) [2022-07-10 21:49:28,519][25689] Fps is (10 sec: 5628.5, 60 sec: 5551.0, 300 sec: 5521.3). Total num frames: 927230976. Throughput: 0: 5799.6. Samples: 927231246. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:49:28,519][25689] Avg episode reward: [(0, '0.140')] [2022-07-10 21:49:29,485][26022] Updated weights on worker 0-0, policy_version 905505 (0.00088) [2022-07-10 21:49:31,190][26022] Updated weights on worker 0-0, policy_version 905515 (0.00090) [2022-07-10 21:49:33,159][26022] Updated weights on worker 0-0, policy_version 905525 (0.00093) [2022-07-10 21:49:33,524][25689] Fps is (10 sec: 5603.7, 60 sec: 5518.3, 300 sec: 5521.7). Total num frames: 927258624. Throughput: 0: 5825.6. Samples: 927264830. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:49:33,525][25689] Avg episode reward: [(0, '0.417')] [2022-07-10 21:49:34,979][26022] Updated weights on worker 0-0, policy_version 905535 (0.00086) [2022-07-10 21:49:36,795][26022] Updated weights on worker 0-0, policy_version 905545 (0.00113) [2022-07-10 21:49:38,532][25689] Fps is (10 sec: 5624.6, 60 sec: 5536.2, 300 sec: 5515.9). Total num frames: 927287296. Throughput: 0: 4975.0. Samples: 927281422. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:49:38,532][25689] Avg episode reward: [(0, '-0.480')] [2022-07-10 21:49:38,675][26022] Updated weights on worker 0-0, policy_version 905555 (0.00087) [2022-07-10 21:49:40,575][26022] Updated weights on worker 0-0, policy_version 905565 (0.00088) [2022-07-10 21:49:42,340][26022] Updated weights on worker 0-0, policy_version 905575 (0.00085) [2022-07-10 21:49:43,589][25689] Fps is (10 sec: 5595.7, 60 sec: 5540.8, 300 sec: 5518.5). Total num frames: 927314944. Throughput: 0: 5829.4. Samples: 927315000. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:49:43,590][25689] Avg episode reward: [(0, '-1.239')] [2022-07-10 21:49:44,143][26022] Updated weights on worker 0-0, policy_version 905585 (0.00089) [2022-07-10 21:49:45,908][26022] Updated weights on worker 0-0, policy_version 905595 (0.00091) [2022-07-10 21:49:48,075][26022] Updated weights on worker 0-0, policy_version 905605 (0.00091) [2022-07-10 21:49:48,612][25689] Fps is (10 sec: 5384.2, 60 sec: 5508.4, 300 sec: 5516.0). Total num frames: 927341568. Throughput: 0: 5848.0. Samples: 927348750. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:49:48,614][25689] Avg episode reward: [(0, '-1.595')] [2022-07-10 21:49:49,621][26022] Updated weights on worker 0-0, policy_version 905615 (0.00084) [2022-07-10 21:49:51,751][26022] Updated weights on worker 0-0, policy_version 905625 (0.00094) [2022-07-10 21:49:53,422][26022] Updated weights on worker 0-0, policy_version 905635 (0.00106) [2022-07-10 21:49:53,615][25689] Fps is (10 sec: 5515.5, 60 sec: 5549.8, 300 sec: 5516.2). Total num frames: 927370240. Throughput: 0: 5002.8. Samples: 927365338. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:49:53,615][25689] Avg episode reward: [(0, '-1.654')] [2022-07-10 21:49:55,124][26022] Updated weights on worker 0-0, policy_version 905645 (0.00087) [2022-07-10 21:49:57,245][26022] Updated weights on worker 0-0, policy_version 905655 (0.00087) [2022-07-10 21:49:58,636][25689] Fps is (10 sec: 5822.4, 60 sec: 5537.7, 300 sec: 5524.2). Total num frames: 927399936. Throughput: 0: 5820.9. Samples: 927398448. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:49:58,637][25689] Avg episode reward: [(0, '-2.532')] [2022-07-10 21:49:58,699][26022] Updated weights on worker 0-0, policy_version 905665 (0.00084) [2022-07-10 21:50:00,926][26022] Updated weights on worker 0-0, policy_version 905675 (0.00087) [2022-07-10 21:50:03,111][26022] Updated weights on worker 0-0, policy_version 905685 (0.00086) [2022-07-10 21:50:03,703][25689] Fps is (10 sec: 5379.5, 60 sec: 5570.7, 300 sec: 5520.1). Total num frames: 927424512. Throughput: 0: 5718.3. Samples: 927430018. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:50:03,704][25689] Avg episode reward: [(0, '-2.526')] [2022-07-10 21:50:04,924][26022] Updated weights on worker 0-0, policy_version 905695 (0.00093) [2022-07-10 21:50:06,947][26022] Updated weights on worker 0-0, policy_version 905705 (0.00087) [2022-07-10 21:50:08,554][26022] Updated weights on worker 0-0, policy_version 905715 (0.00084) [2022-07-10 21:50:08,732][25689] Fps is (10 sec: 5274.5, 60 sec: 5553.0, 300 sec: 5523.0). Total num frames: 927453184. Throughput: 0: 4860.1. Samples: 927446534. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:50:08,732][25689] Avg episode reward: [(0, '-2.018')] [2022-07-10 21:50:10,494][26022] Updated weights on worker 0-0, policy_version 905725 (0.00090) [2022-07-10 21:50:12,452][26022] Updated weights on worker 0-0, policy_version 905735 (0.00091) [2022-07-10 21:50:13,734][25689] Fps is (10 sec: 5614.6, 60 sec: 5543.3, 300 sec: 5519.9). Total num frames: 927480832. Throughput: 0: 5689.8. Samples: 927479814. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:50:13,736][25689] Avg episode reward: [(0, '-1.209')] [2022-07-10 21:50:14,255][26022] Updated weights on worker 0-0, policy_version 905745 (0.00086) [2022-07-10 21:50:16,103][26022] Updated weights on worker 0-0, policy_version 905755 (0.00085) [2022-07-10 21:50:17,929][26022] Updated weights on worker 0-0, policy_version 905765 (0.00095) [2022-07-10 21:50:18,798][25689] Fps is (10 sec: 5391.5, 60 sec: 5520.7, 300 sec: 5517.3). Total num frames: 927507456. Throughput: 0: 5688.8. Samples: 927513142. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:50:18,799][25689] Avg episode reward: [(0, '-1.278')] [2022-07-10 21:50:19,705][26022] Updated weights on worker 0-0, policy_version 905775 (0.00085) [2022-07-10 21:50:21,639][26022] Updated weights on worker 0-0, policy_version 905785 (0.00091) [2022-07-10 21:50:23,239][26022] Updated weights on worker 0-0, policy_version 905795 (0.00091) [2022-07-10 21:50:23,838][25689] Fps is (10 sec: 5573.9, 60 sec: 5547.1, 300 sec: 5523.6). Total num frames: 927537152. Throughput: 0: 4958.3. Samples: 927529854. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:50:23,839][25689] Avg episode reward: [(0, '0.313')] [2022-07-10 21:50:25,227][26022] Updated weights on worker 0-0, policy_version 905805 (0.00089) [2022-07-10 21:50:27,088][26022] Updated weights on worker 0-0, policy_version 905815 (0.00088) [2022-07-10 21:50:28,841][25689] Fps is (10 sec: 5709.9, 60 sec: 5534.0, 300 sec: 5523.9). Total num frames: 927564800. Throughput: 0: 5804.8. Samples: 927563264. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:50:28,842][25689] Avg episode reward: [(0, '0.160')] [2022-07-10 21:50:28,851][26022] Updated weights on worker 0-0, policy_version 905825 (0.00089) [2022-07-10 21:50:30,803][26022] Updated weights on worker 0-0, policy_version 905835 (0.00088) [2022-07-10 21:50:32,568][26022] Updated weights on worker 0-0, policy_version 905845 (0.00086) [2022-07-10 21:50:33,848][25689] Fps is (10 sec: 5524.4, 60 sec: 5533.9, 300 sec: 5520.9). Total num frames: 927592448. Throughput: 0: 5811.2. Samples: 927596698. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:50:33,850][25689] Avg episode reward: [(0, '-0.415')] [2022-07-10 21:50:34,326][26022] Updated weights on worker 0-0, policy_version 905855 (0.00090) [2022-07-10 21:50:36,272][26022] Updated weights on worker 0-0, policy_version 905865 (0.00085) [2022-07-10 21:50:37,958][26022] Updated weights on worker 0-0, policy_version 905875 (0.00087) [2022-07-10 21:50:38,903][25689] Fps is (10 sec: 5495.7, 60 sec: 5512.6, 300 sec: 5520.5). Total num frames: 927620096. Throughput: 0: 4984.9. Samples: 927613362. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:50:38,904][25689] Avg episode reward: [(0, '-0.449')] [2022-07-10 21:50:40,012][26022] Updated weights on worker 0-0, policy_version 905885 (0.00085) [2022-07-10 21:50:41,795][26022] Updated weights on worker 0-0, policy_version 905895 (0.00087) [2022-07-10 21:50:43,544][26022] Updated weights on worker 0-0, policy_version 905905 (0.00086) [2022-07-10 21:50:43,959][25689] Fps is (10 sec: 5468.7, 60 sec: 5512.7, 300 sec: 5516.0). Total num frames: 927647744. Throughput: 0: 5794.1. Samples: 927646436. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:50:43,961][25689] Avg episode reward: [(0, '-0.089')] [2022-07-10 21:50:45,582][26022] Updated weights on worker 0-0, policy_version 905915 (0.00095) [2022-07-10 21:50:47,296][26022] Updated weights on worker 0-0, policy_version 905925 (0.00090) [2022-07-10 21:50:48,969][25689] Fps is (10 sec: 5493.1, 60 sec: 5530.8, 300 sec: 5519.5). Total num frames: 927675392. Throughput: 0: 5792.3. Samples: 927679852. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:50:48,970][25689] Avg episode reward: [(0, '0.228')] [2022-07-10 21:50:49,190][26022] Updated weights on worker 0-0, policy_version 905935 (0.00090) [2022-07-10 21:50:51,084][26022] Updated weights on worker 0-0, policy_version 905945 (0.00091) [2022-07-10 21:50:52,844][26022] Updated weights on worker 0-0, policy_version 905955 (0.00089) [2022-07-10 21:50:53,982][25689] Fps is (10 sec: 5517.2, 60 sec: 5512.9, 300 sec: 5516.4). Total num frames: 927703040. Throughput: 0: 4965.8. Samples: 927696678. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:50:53,982][25689] Avg episode reward: [(0, '-0.001')] [2022-07-10 21:50:54,872][26022] Updated weights on worker 0-0, policy_version 905965 (0.00091) [2022-07-10 21:50:56,400][26022] Updated weights on worker 0-0, policy_version 905975 (0.00092) [2022-07-10 21:50:58,261][26022] Updated weights on worker 0-0, policy_version 905985 (0.00092) [2022-07-10 21:50:59,003][25689] Fps is (10 sec: 5715.0, 60 sec: 5513.0, 300 sec: 5523.7). Total num frames: 927732736. Throughput: 0: 5831.9. Samples: 927730586. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:50:59,004][25689] Avg episode reward: [(0, '-0.300')] [2022-07-10 21:51:00,295][26022] Updated weights on worker 0-0, policy_version 905995 (0.00092) [2022-07-10 21:51:02,244][26022] Updated weights on worker 0-0, policy_version 906005 (0.00097) [2022-07-10 21:51:04,049][25689] Fps is (10 sec: 5492.5, 60 sec: 5531.8, 300 sec: 5520.6). Total num frames: 927758336. Throughput: 0: 5765.8. Samples: 927762270. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:51:04,050][25689] Avg episode reward: [(0, '-0.332')] [2022-07-10 21:51:04,317][26022] Updated weights on worker 0-0, policy_version 906015 (0.00092) [2022-07-10 21:51:06,037][26022] Updated weights on worker 0-0, policy_version 906025 (0.00086) [2022-07-10 21:51:07,721][26022] Updated weights on worker 0-0, policy_version 906035 (0.00083) [2022-07-10 21:51:08,103][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:51:08,118][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000906037_927781888.pth [2022-07-10 21:51:08,119][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000904092_925790208.pth [2022-07-10 21:51:09,052][25689] Fps is (10 sec: 5298.9, 60 sec: 5517.2, 300 sec: 5520.8). Total num frames: 927785984. Throughput: 0: 5750.1. Samples: 927795330. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:51:09,052][25689] Avg episode reward: [(0, '-0.457')] [2022-07-10 21:51:10,112][26022] Updated weights on worker 0-0, policy_version 906045 (0.00084) [2022-07-10 21:51:11,419][26022] Updated weights on worker 0-0, policy_version 906055 (0.00080) [2022-07-10 21:51:13,572][26022] Updated weights on worker 0-0, policy_version 906065 (0.00091) [2022-07-10 21:51:14,059][25689] Fps is (10 sec: 5524.0, 60 sec: 5516.8, 300 sec: 5520.7). Total num frames: 927813632. Throughput: 0: 5739.5. Samples: 927811914. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:51:14,071][25689] Avg episode reward: [(0, '-0.685')] [2022-07-10 21:51:15,215][26022] Updated weights on worker 0-0, policy_version 906075 (0.00094) [2022-07-10 21:51:17,147][26022] Updated weights on worker 0-0, policy_version 906085 (0.00088) [2022-07-10 21:51:19,072][25689] Fps is (10 sec: 5416.4, 60 sec: 5521.5, 300 sec: 5514.8). Total num frames: 927840256. Throughput: 0: 5717.4. Samples: 927845326. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:51:19,073][25689] Avg episode reward: [(0, '-1.166')] [2022-07-10 21:51:19,157][26022] Updated weights on worker 0-0, policy_version 906095 (0.00091) [2022-07-10 21:51:20,874][26022] Updated weights on worker 0-0, policy_version 906105 (0.00097) [2022-07-10 21:51:22,666][26022] Updated weights on worker 0-0, policy_version 906115 (0.00092) [2022-07-10 21:51:24,130][25689] Fps is (10 sec: 5592.6, 60 sec: 5519.9, 300 sec: 5527.7). Total num frames: 927869952. Throughput: 0: 5814.9. Samples: 927879036. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:51:24,130][25689] Avg episode reward: [(0, '-1.005')] [2022-07-10 21:51:24,407][26022] Updated weights on worker 0-0, policy_version 906125 (0.00089) [2022-07-10 21:51:26,397][26022] Updated weights on worker 0-0, policy_version 906135 (0.00357) [2022-07-10 21:51:28,118][26022] Updated weights on worker 0-0, policy_version 906145 (0.00086) [2022-07-10 21:51:29,159][25689] Fps is (10 sec: 5786.0, 60 sec: 5534.3, 300 sec: 5523.9). Total num frames: 927898624. Throughput: 0: 5003.8. Samples: 927895946. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:51:29,160][25689] Avg episode reward: [(0, '-0.556')] [2022-07-10 21:51:30,135][26022] Updated weights on worker 0-0, policy_version 906155 (0.00092) [2022-07-10 21:51:31,847][26022] Updated weights on worker 0-0, policy_version 906165 (0.00090) [2022-07-10 21:51:33,774][26022] Updated weights on worker 0-0, policy_version 906175 (0.00089) [2022-07-10 21:51:34,184][25689] Fps is (10 sec: 5601.5, 60 sec: 5532.7, 300 sec: 5524.5). Total num frames: 927926272. Throughput: 0: 5856.3. Samples: 927929772. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:51:34,185][25689] Avg episode reward: [(0, '-0.502')] [2022-07-10 21:51:35,358][26022] Updated weights on worker 0-0, policy_version 906185 (0.00085) [2022-07-10 21:51:37,301][26022] Updated weights on worker 0-0, policy_version 906195 (0.00085) [2022-07-10 21:51:39,051][26022] Updated weights on worker 0-0, policy_version 906205 (0.00095) [2022-07-10 21:51:39,211][25689] Fps is (10 sec: 5501.2, 60 sec: 5535.3, 300 sec: 5525.8). Total num frames: 927953920. Throughput: 0: 5841.4. Samples: 927962968. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:51:39,211][25689] Avg episode reward: [(0, '-1.357')] [2022-07-10 21:51:41,032][26022] Updated weights on worker 0-0, policy_version 906215 (0.00081) [2022-07-10 21:51:42,951][26022] Updated weights on worker 0-0, policy_version 906225 (0.00086) [2022-07-10 21:51:44,315][25689] Fps is (10 sec: 5559.2, 60 sec: 5547.9, 300 sec: 5524.0). Total num frames: 927982592. Throughput: 0: 4998.4. Samples: 927979932. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:51:44,316][25689] Avg episode reward: [(0, '-1.044')] [2022-07-10 21:51:44,742][26022] Updated weights on worker 0-0, policy_version 906235 (0.00086) [2022-07-10 21:51:46,496][26022] Updated weights on worker 0-0, policy_version 906245 (0.00087) [2022-07-10 21:51:48,387][26022] Updated weights on worker 0-0, policy_version 906255 (0.00087) [2022-07-10 21:51:49,326][25689] Fps is (10 sec: 5669.5, 60 sec: 5564.8, 300 sec: 5524.2). Total num frames: 928011264. Throughput: 0: 5834.1. Samples: 928013596. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:51:49,327][25689] Avg episode reward: [(0, '-1.239')] [2022-07-10 21:51:50,234][26022] Updated weights on worker 0-0, policy_version 906265 (0.00149) [2022-07-10 21:51:51,955][26022] Updated weights on worker 0-0, policy_version 906275 (0.00086) [2022-07-10 21:51:53,908][26022] Updated weights on worker 0-0, policy_version 906285 (0.00087) [2022-07-10 21:51:54,367][25689] Fps is (10 sec: 5501.2, 60 sec: 5545.2, 300 sec: 5523.9). Total num frames: 928037888. Throughput: 0: 5823.2. Samples: 928047300. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-10 21:51:54,369][25689] Avg episode reward: [(0, '-0.896')] [2022-07-10 21:51:55,544][26022] Updated weights on worker 0-0, policy_version 906295 (0.00089) [2022-07-10 21:51:57,503][26022] Updated weights on worker 0-0, policy_version 906305 (0.00085) [2022-07-10 21:51:59,217][26022] Updated weights on worker 0-0, policy_version 906315 (0.00083) [2022-07-10 21:51:59,410][25689] Fps is (10 sec: 5483.1, 60 sec: 5526.2, 300 sec: 5532.0). Total num frames: 928066560. Throughput: 0: 5002.3. Samples: 928064010. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:51:59,412][25689] Avg episode reward: [(0, '-1.003')] [2022-07-10 21:52:01,139][26022] Updated weights on worker 0-0, policy_version 906325 (0.00084) [2022-07-10 21:52:03,474][26022] Updated weights on worker 0-0, policy_version 906335 (0.00087) [2022-07-10 21:52:04,479][25689] Fps is (10 sec: 5468.1, 60 sec: 5541.1, 300 sec: 5528.2). Total num frames: 928093184. Throughput: 0: 5736.0. Samples: 928095594. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:52:04,480][25689] Avg episode reward: [(0, '-1.016')] [2022-07-10 21:52:04,937][26022] Updated weights on worker 0-0, policy_version 906345 (0.00092) [2022-07-10 21:52:07,143][26022] Updated weights on worker 0-0, policy_version 906355 (0.00086) [2022-07-10 21:52:08,732][26022] Updated weights on worker 0-0, policy_version 906365 (0.00085) [2022-07-10 21:52:09,488][25689] Fps is (10 sec: 5385.5, 60 sec: 5540.5, 300 sec: 5524.9). Total num frames: 928120832. Throughput: 0: 5723.5. Samples: 928128996. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:52:09,490][25689] Avg episode reward: [(0, '0.305')] [2022-07-10 21:52:10,675][26022] Updated weights on worker 0-0, policy_version 906375 (0.00082) [2022-07-10 21:52:12,618][26022] Updated weights on worker 0-0, policy_version 906385 (0.00432) [2022-07-10 21:52:14,428][26022] Updated weights on worker 0-0, policy_version 906395 (0.00088) [2022-07-10 21:52:14,499][25689] Fps is (10 sec: 5518.8, 60 sec: 5540.2, 300 sec: 5528.2). Total num frames: 928148480. Throughput: 0: 4894.9. Samples: 928145844. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:52:14,501][25689] Avg episode reward: [(0, '-0.968')] [2022-07-10 21:52:16,299][26022] Updated weights on worker 0-0, policy_version 906405 (0.00087) [2022-07-10 21:52:18,055][26022] Updated weights on worker 0-0, policy_version 906415 (0.00087) [2022-07-10 21:52:19,503][25689] Fps is (10 sec: 5725.9, 60 sec: 5591.8, 300 sec: 5526.6). Total num frames: 928178176. Throughput: 0: 5736.2. Samples: 928179264. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:52:19,503][25689] Avg episode reward: [(0, '-0.425')] [2022-07-10 21:52:19,776][26022] Updated weights on worker 0-0, policy_version 906425 (0.00084) [2022-07-10 21:52:22,003][26022] Updated weights on worker 0-0, policy_version 906435 (0.00100) [2022-07-10 21:52:23,494][26022] Updated weights on worker 0-0, policy_version 906445 (0.00086) [2022-07-10 21:52:24,622][25689] Fps is (10 sec: 5563.8, 60 sec: 5535.4, 300 sec: 5528.5). Total num frames: 928204800. Throughput: 0: 5802.3. Samples: 928212464. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:52:24,622][25689] Avg episode reward: [(0, '-0.203')] [2022-07-10 21:52:25,344][26022] Updated weights on worker 0-0, policy_version 906455 (0.00094) [2022-07-10 21:52:27,194][26022] Updated weights on worker 0-0, policy_version 906465 (0.00086) [2022-07-10 21:52:29,236][26022] Updated weights on worker 0-0, policy_version 906475 (0.00089) [2022-07-10 21:52:29,634][25689] Fps is (10 sec: 5457.8, 60 sec: 5537.0, 300 sec: 5528.5). Total num frames: 928233472. Throughput: 0: 4968.5. Samples: 928229094. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:52:29,635][25689] Avg episode reward: [(0, '-0.729')] [2022-07-10 21:52:30,987][26022] Updated weights on worker 0-0, policy_version 906485 (0.00086) [2022-07-10 21:52:32,914][26022] Updated weights on worker 0-0, policy_version 906495 (0.00092) [2022-07-10 21:52:34,544][26022] Updated weights on worker 0-0, policy_version 906505 (0.00091) [2022-07-10 21:52:34,644][25689] Fps is (10 sec: 5619.2, 60 sec: 5538.3, 300 sec: 5528.9). Total num frames: 928261120. Throughput: 0: 5808.6. Samples: 928262860. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:52:34,646][25689] Avg episode reward: [(0, '-1.206')] [2022-07-10 21:52:36,466][26022] Updated weights on worker 0-0, policy_version 906515 (0.00088) [2022-07-10 21:52:38,051][26022] Updated weights on worker 0-0, policy_version 906525 (0.00092) [2022-07-10 21:52:39,675][25689] Fps is (10 sec: 5507.4, 60 sec: 5538.0, 300 sec: 5523.1). Total num frames: 928288768. Throughput: 0: 5812.4. Samples: 928296512. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:52:39,676][25689] Avg episode reward: [(0, '-0.127')] [2022-07-10 21:52:40,032][26022] Updated weights on worker 0-0, policy_version 906535 (0.00089) [2022-07-10 21:52:42,014][26022] Updated weights on worker 0-0, policy_version 906545 (0.00081) [2022-07-10 21:52:43,707][26022] Updated weights on worker 0-0, policy_version 906555 (0.00084) [2022-07-10 21:52:44,764][25689] Fps is (10 sec: 5565.3, 60 sec: 5539.4, 300 sec: 5533.4). Total num frames: 928317440. Throughput: 0: 4998.3. Samples: 928313144. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:52:44,765][25689] Avg episode reward: [(0, '-0.316')] [2022-07-10 21:52:45,735][26022] Updated weights on worker 0-0, policy_version 906565 (0.00817) [2022-07-10 21:52:47,552][26022] Updated weights on worker 0-0, policy_version 906575 (0.00091) [2022-07-10 21:52:49,255][26022] Updated weights on worker 0-0, policy_version 906585 (0.00085) [2022-07-10 21:52:49,796][25689] Fps is (10 sec: 5665.7, 60 sec: 5537.4, 300 sec: 5533.0). Total num frames: 928346112. Throughput: 0: 5826.5. Samples: 928346566. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:52:49,796][25689] Avg episode reward: [(0, '-0.280')] [2022-07-10 21:52:51,315][26022] Updated weights on worker 0-0, policy_version 906595 (0.00088) [2022-07-10 21:52:52,899][26022] Updated weights on worker 0-0, policy_version 906605 (0.00083) [2022-07-10 21:52:54,858][25689] Fps is (10 sec: 5478.1, 60 sec: 5535.5, 300 sec: 5525.2). Total num frames: 928372736. Throughput: 0: 5791.6. Samples: 928379930. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:52:54,859][25689] Avg episode reward: [(0, '-1.150')] [2022-07-10 21:52:55,020][26022] Updated weights on worker 0-0, policy_version 906615 (0.00087) [2022-07-10 21:52:56,729][26022] Updated weights on worker 0-0, policy_version 906625 (0.00088) [2022-07-10 21:52:58,533][26022] Updated weights on worker 0-0, policy_version 906635 (0.00055) [2022-07-10 21:52:59,870][25689] Fps is (10 sec: 5387.5, 60 sec: 5521.5, 300 sec: 5539.7). Total num frames: 928400384. Throughput: 0: 4950.2. Samples: 928396482. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:52:59,870][25689] Avg episode reward: [(0, '-0.456')] [2022-07-10 21:53:00,535][26022] Updated weights on worker 0-0, policy_version 906645 (0.00077) [2022-07-10 21:53:02,028][26022] Updated weights on worker 0-0, policy_version 906655 (0.00085) [2022-07-10 21:53:04,670][26022] Updated weights on worker 0-0, policy_version 906665 (0.00093) [2022-07-10 21:53:04,976][25689] Fps is (10 sec: 5363.8, 60 sec: 5518.0, 300 sec: 5524.3). Total num frames: 928427008. Throughput: 0: 5664.9. Samples: 928427644. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:53:04,977][25689] Avg episode reward: [(0, '-0.512')] [2022-07-10 21:53:06,251][26022] Updated weights on worker 0-0, policy_version 906675 (0.00085) [2022-07-10 21:53:08,181][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:53:08,194][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000906685_928445440.pth [2022-07-10 21:53:08,195][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000904739_926452736.pth [2022-07-10 21:53:08,198][26022] Updated weights on worker 0-0, policy_version 906685 (0.00094) [2022-07-10 21:53:09,999][25689] Fps is (10 sec: 5256.9, 60 sec: 5499.8, 300 sec: 5524.4). Total num frames: 928453632. Throughput: 0: 5653.6. Samples: 928460784. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:53:09,999][25689] Avg episode reward: [(0, '-0.255')] [2022-07-10 21:53:10,316][26022] Updated weights on worker 0-0, policy_version 906695 (0.00090) [2022-07-10 21:53:11,829][26022] Updated weights on worker 0-0, policy_version 906705 (0.00086) [2022-07-10 21:53:13,834][26022] Updated weights on worker 0-0, policy_version 906715 (0.00088) [2022-07-10 21:53:15,019][25689] Fps is (10 sec: 5506.3, 60 sec: 5516.0, 300 sec: 5531.4). Total num frames: 928482304. Throughput: 0: 4818.6. Samples: 928477076. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:53:15,019][25689] Avg episode reward: [(0, '0.020')] [2022-07-10 21:53:15,813][26022] Updated weights on worker 0-0, policy_version 906725 (0.00092) [2022-07-10 21:53:17,638][26022] Updated weights on worker 0-0, policy_version 906735 (0.00085) [2022-07-10 21:53:19,464][26022] Updated weights on worker 0-0, policy_version 906745 (0.00086) [2022-07-10 21:53:20,022][25689] Fps is (10 sec: 5619.2, 60 sec: 5482.2, 300 sec: 5522.2). Total num frames: 928509952. Throughput: 0: 5640.4. Samples: 928510146. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:53:20,022][25689] Avg episode reward: [(0, '0.622')] [2022-07-10 21:53:21,231][26022] Updated weights on worker 0-0, policy_version 906755 (0.00083) [2022-07-10 21:53:23,090][26022] Updated weights on worker 0-0, policy_version 906765 (0.00097) [2022-07-10 21:53:25,081][25689] Fps is (10 sec: 5495.4, 60 sec: 5504.5, 300 sec: 5529.0). Total num frames: 928537600. Throughput: 0: 5757.9. Samples: 928543404. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:53:25,082][25689] Avg episode reward: [(0, '0.515')] [2022-07-10 21:53:25,088][26022] Updated weights on worker 0-0, policy_version 906775 (0.00093) [2022-07-10 21:53:26,847][26022] Updated weights on worker 0-0, policy_version 906785 (0.00088) [2022-07-10 21:53:28,588][26022] Updated weights on worker 0-0, policy_version 906795 (0.00086) [2022-07-10 21:53:30,119][25689] Fps is (10 sec: 5476.3, 60 sec: 5485.3, 300 sec: 5521.8). Total num frames: 928565248. Throughput: 0: 4921.7. Samples: 928559808. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:53:30,121][25689] Avg episode reward: [(0, '0.667')] [2022-07-10 21:53:30,731][26022] Updated weights on worker 0-0, policy_version 906805 (0.00090) [2022-07-10 21:53:32,439][26022] Updated weights on worker 0-0, policy_version 906815 (0.00083) [2022-07-10 21:53:34,327][26022] Updated weights on worker 0-0, policy_version 906825 (0.00092) [2022-07-10 21:53:35,128][25689] Fps is (10 sec: 5503.5, 60 sec: 5485.4, 300 sec: 5521.9). Total num frames: 928592896. Throughput: 0: 5768.3. Samples: 928593074. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:53:35,129][25689] Avg episode reward: [(0, '1.266')] [2022-07-10 21:53:36,169][26022] Updated weights on worker 0-0, policy_version 906835 (0.00085) [2022-07-10 21:53:38,070][26022] Updated weights on worker 0-0, policy_version 906845 (0.00092) [2022-07-10 21:53:40,051][26022] Updated weights on worker 0-0, policy_version 906855 (0.00113) [2022-07-10 21:53:40,139][25689] Fps is (10 sec: 5416.1, 60 sec: 5470.2, 300 sec: 5520.3). Total num frames: 928619520. Throughput: 0: 5774.2. Samples: 928626310. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:53:40,141][25689] Avg episode reward: [(0, '0.694')] [2022-07-10 21:53:41,628][26022] Updated weights on worker 0-0, policy_version 906865 (0.00089) [2022-07-10 21:53:43,597][26022] Updated weights on worker 0-0, policy_version 906875 (0.00086) [2022-07-10 21:53:45,272][25689] Fps is (10 sec: 5552.0, 60 sec: 5483.2, 300 sec: 5521.9). Total num frames: 928649216. Throughput: 0: 4920.8. Samples: 928642762. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:53:45,273][25689] Avg episode reward: [(0, '0.918')] [2022-07-10 21:53:45,422][26022] Updated weights on worker 0-0, policy_version 906885 (0.00089) [2022-07-10 21:53:47,193][26022] Updated weights on worker 0-0, policy_version 906895 (0.00093) [2022-07-10 21:53:49,298][26022] Updated weights on worker 0-0, policy_version 906905 (0.00081) [2022-07-10 21:53:50,343][25689] Fps is (10 sec: 5620.1, 60 sec: 5462.7, 300 sec: 5525.7). Total num frames: 928676864. Throughput: 0: 5744.1. Samples: 928675976. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:53:50,344][25689] Avg episode reward: [(0, '0.454')] [2022-07-10 21:53:50,949][26022] Updated weights on worker 0-0, policy_version 906915 (0.00093) [2022-07-10 21:53:52,790][26022] Updated weights on worker 0-0, policy_version 906925 (0.00090) [2022-07-10 21:53:54,767][26022] Updated weights on worker 0-0, policy_version 906935 (0.00109) [2022-07-10 21:53:55,396][25689] Fps is (10 sec: 5563.3, 60 sec: 5497.4, 300 sec: 5519.2). Total num frames: 928705536. Throughput: 0: 5729.1. Samples: 928709188. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:53:55,398][25689] Avg episode reward: [(0, '0.369')] [2022-07-10 21:53:56,573][26022] Updated weights on worker 0-0, policy_version 906945 (0.00092) [2022-07-10 21:53:58,438][26022] Updated weights on worker 0-0, policy_version 906955 (0.00091) [2022-07-10 21:54:00,042][26022] Updated weights on worker 0-0, policy_version 906965 (0.00091) [2022-07-10 21:54:00,454][25689] Fps is (10 sec: 5468.8, 60 sec: 5476.3, 300 sec: 5533.0). Total num frames: 928732160. Throughput: 0: 5724.8. Samples: 928742608. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:54:00,455][25689] Avg episode reward: [(0, '-0.299')] [2022-07-10 21:54:02,614][26022] Updated weights on worker 0-0, policy_version 906975 (0.00079) [2022-07-10 21:54:04,466][26022] Updated weights on worker 0-0, policy_version 906985 (0.00088) [2022-07-10 21:54:05,576][25689] Fps is (10 sec: 5130.0, 60 sec: 5458.0, 300 sec: 5517.3). Total num frames: 928757760. Throughput: 0: 5631.9. Samples: 928757110. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:54:05,576][25689] Avg episode reward: [(0, '-0.522')] [2022-07-10 21:54:06,207][26022] Updated weights on worker 0-0, policy_version 906995 (0.00089) [2022-07-10 21:54:07,801][26022] Updated weights on worker 0-0, policy_version 907005 (0.00098) [2022-07-10 21:54:09,878][26022] Updated weights on worker 0-0, policy_version 907015 (0.00082) [2022-07-10 21:54:10,656][25689] Fps is (10 sec: 5520.7, 60 sec: 5520.4, 300 sec: 5524.2). Total num frames: 928788480. Throughput: 0: 5645.2. Samples: 928790646. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:54:10,656][25689] Avg episode reward: [(0, '-0.316')] [2022-07-10 21:54:11,640][26022] Updated weights on worker 0-0, policy_version 907025 (0.00091) [2022-07-10 21:54:13,462][26022] Updated weights on worker 0-0, policy_version 907035 (0.00085) [2022-07-10 21:54:15,560][26022] Updated weights on worker 0-0, policy_version 907045 (0.00090) [2022-07-10 21:54:15,660][25689] Fps is (10 sec: 5585.3, 60 sec: 5471.2, 300 sec: 5517.3). Total num frames: 928814080. Throughput: 0: 5652.0. Samples: 928823720. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:54:15,660][25689] Avg episode reward: [(0, '-0.463')] [2022-07-10 21:54:17,215][26022] Updated weights on worker 0-0, policy_version 907055 (0.00091) [2022-07-10 21:54:19,175][26022] Updated weights on worker 0-0, policy_version 907065 (0.00090) [2022-07-10 21:54:20,665][25689] Fps is (10 sec: 5421.9, 60 sec: 5487.8, 300 sec: 5519.8). Total num frames: 928842752. Throughput: 0: 4833.4. Samples: 928840298. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:54:20,666][25689] Avg episode reward: [(0, '-0.989')] [2022-07-10 21:54:21,113][26022] Updated weights on worker 0-0, policy_version 907075 (0.00084) [2022-07-10 21:54:22,822][26022] Updated weights on worker 0-0, policy_version 907085 (0.00090) [2022-07-10 21:54:24,628][26022] Updated weights on worker 0-0, policy_version 907095 (0.00091) [2022-07-10 21:54:25,717][25689] Fps is (10 sec: 5701.8, 60 sec: 5505.4, 300 sec: 5519.7). Total num frames: 928871424. Throughput: 0: 5781.5. Samples: 928873554. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:54:25,717][25689] Avg episode reward: [(0, '-0.191')] [2022-07-10 21:54:26,532][26022] Updated weights on worker 0-0, policy_version 907105 (0.00100) [2022-07-10 21:54:28,436][26022] Updated weights on worker 0-0, policy_version 907115 (0.00092) [2022-07-10 21:54:30,267][26022] Updated weights on worker 0-0, policy_version 907125 (0.00392) [2022-07-10 21:54:30,739][25689] Fps is (10 sec: 5489.3, 60 sec: 5489.9, 300 sec: 5516.0). Total num frames: 928898048. Throughput: 0: 5783.5. Samples: 928906796. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:54:30,739][25689] Avg episode reward: [(0, '-0.073')] [2022-07-10 21:54:32,086][26022] Updated weights on worker 0-0, policy_version 907135 (0.00089) [2022-07-10 21:54:33,863][26022] Updated weights on worker 0-0, policy_version 907145 (0.00083) [2022-07-10 21:54:35,743][25689] Fps is (10 sec: 5413.1, 60 sec: 5490.4, 300 sec: 5512.6). Total num frames: 928925696. Throughput: 0: 4962.4. Samples: 928923380. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:54:35,743][25689] Avg episode reward: [(0, '0.178')] [2022-07-10 21:54:35,835][26022] Updated weights on worker 0-0, policy_version 907155 (0.00079) [2022-07-10 21:54:37,564][26022] Updated weights on worker 0-0, policy_version 907165 (0.00086) [2022-07-10 21:54:39,419][26022] Updated weights on worker 0-0, policy_version 907175 (0.00088) [2022-07-10 21:54:40,824][25689] Fps is (10 sec: 5584.1, 60 sec: 5517.8, 300 sec: 5515.6). Total num frames: 928954368. Throughput: 0: 5786.1. Samples: 928956942. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:54:40,825][25689] Avg episode reward: [(0, '-0.115')] [2022-07-10 21:54:41,344][26022] Updated weights on worker 0-0, policy_version 907185 (0.00100) [2022-07-10 21:54:43,254][26022] Updated weights on worker 0-0, policy_version 907195 (0.00090) [2022-07-10 21:54:45,069][26022] Updated weights on worker 0-0, policy_version 907205 (0.00084) [2022-07-10 21:54:45,939][25689] Fps is (10 sec: 5523.8, 60 sec: 5485.7, 300 sec: 5517.3). Total num frames: 928982016. Throughput: 0: 5752.5. Samples: 928989880. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:54:45,939][25689] Avg episode reward: [(0, '-0.327')] [2022-07-10 21:54:47,020][26022] Updated weights on worker 0-0, policy_version 907215 (0.00080) [2022-07-10 21:54:48,618][26022] Updated weights on worker 0-0, policy_version 907225 (0.00085) [2022-07-10 21:54:50,619][26022] Updated weights on worker 0-0, policy_version 907235 (0.00089) [2022-07-10 21:54:50,989][25689] Fps is (10 sec: 5540.9, 60 sec: 5504.5, 300 sec: 5516.4). Total num frames: 929010688. Throughput: 0: 4931.7. Samples: 929006664. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:54:50,989][25689] Avg episode reward: [(0, '-0.525')] [2022-07-10 21:54:52,242][26022] Updated weights on worker 0-0, policy_version 907245 (0.00092) [2022-07-10 21:54:54,247][26022] Updated weights on worker 0-0, policy_version 907255 (0.00085) [2022-07-10 21:54:56,019][25689] Fps is (10 sec: 5587.3, 60 sec: 5489.7, 300 sec: 5509.4). Total num frames: 929038336. Throughput: 0: 5760.6. Samples: 929040182. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:54:56,020][25689] Avg episode reward: [(0, '-0.888')] [2022-07-10 21:54:56,191][26022] Updated weights on worker 0-0, policy_version 907265 (0.00083) [2022-07-10 21:54:57,764][26022] Updated weights on worker 0-0, policy_version 907275 (0.00084) [2022-07-10 21:54:59,833][26022] Updated weights on worker 0-0, policy_version 907285 (0.00090) [2022-07-10 21:55:01,062][25689] Fps is (10 sec: 5591.3, 60 sec: 5524.8, 300 sec: 5523.6). Total num frames: 929067008. Throughput: 0: 5769.3. Samples: 929073698. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:55:01,062][25689] Avg episode reward: [(0, '-1.827')] [2022-07-10 21:55:01,479][26022] Updated weights on worker 0-0, policy_version 907295 (0.00085) [2022-07-10 21:55:04,077][26022] Updated weights on worker 0-0, policy_version 907305 (0.00087) [2022-07-10 21:55:05,695][26022] Updated weights on worker 0-0, policy_version 907315 (0.00096) [2022-07-10 21:55:06,123][25689] Fps is (10 sec: 5371.4, 60 sec: 5530.4, 300 sec: 5512.7). Total num frames: 929092608. Throughput: 0: 4868.8. Samples: 929088150. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:55:06,123][25689] Avg episode reward: [(0, '-1.344')] [2022-07-10 21:55:07,455][26022] Updated weights on worker 0-0, policy_version 907325 (0.00092) [2022-07-10 21:55:08,286][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:55:08,299][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000907329_929104896.pth [2022-07-10 21:55:08,299][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000905387_927116288.pth [2022-07-10 21:55:09,359][26022] Updated weights on worker 0-0, policy_version 907335 (0.00088) [2022-07-10 21:55:11,016][26022] Updated weights on worker 0-0, policy_version 907345 (0.00085) [2022-07-10 21:55:11,166][25689] Fps is (10 sec: 5371.1, 60 sec: 5499.8, 300 sec: 5515.4). Total num frames: 929121280. Throughput: 0: 5707.4. Samples: 929121824. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:55:11,167][25689] Avg episode reward: [(0, '-0.947')] [2022-07-10 21:55:12,956][26022] Updated weights on worker 0-0, policy_version 907355 (0.00089) [2022-07-10 21:55:14,744][26022] Updated weights on worker 0-0, policy_version 907365 (0.00094) [2022-07-10 21:55:16,175][25689] Fps is (10 sec: 5501.0, 60 sec: 5516.4, 300 sec: 5516.4). Total num frames: 929147904. Throughput: 0: 5700.5. Samples: 929155080. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:55:16,175][25689] Avg episode reward: [(0, '-0.723')] [2022-07-10 21:55:16,744][26022] Updated weights on worker 0-0, policy_version 907375 (0.00087) [2022-07-10 21:55:18,636][26022] Updated weights on worker 0-0, policy_version 907385 (0.00091) [2022-07-10 21:55:20,335][26022] Updated weights on worker 0-0, policy_version 907395 (0.00093) [2022-07-10 21:55:21,199][25689] Fps is (10 sec: 5511.6, 60 sec: 5514.7, 300 sec: 5513.3). Total num frames: 929176576. Throughput: 0: 4862.4. Samples: 929171614. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:55:21,199][25689] Avg episode reward: [(0, '-0.171')] [2022-07-10 21:55:22,358][26022] Updated weights on worker 0-0, policy_version 907405 (0.00085) [2022-07-10 21:55:24,154][26022] Updated weights on worker 0-0, policy_version 907415 (0.00086) [2022-07-10 21:55:25,939][26022] Updated weights on worker 0-0, policy_version 907425 (0.00092) [2022-07-10 21:55:26,335][25689] Fps is (10 sec: 5543.1, 60 sec: 5490.1, 300 sec: 5510.8). Total num frames: 929204224. Throughput: 0: 5781.8. Samples: 929205014. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:55:26,335][25689] Avg episode reward: [(0, '0.289')] [2022-07-10 21:55:27,736][26022] Updated weights on worker 0-0, policy_version 907435 (0.00092) [2022-07-10 21:55:29,658][26022] Updated weights on worker 0-0, policy_version 907445 (0.00096) [2022-07-10 21:55:31,286][26022] Updated weights on worker 0-0, policy_version 907455 (0.00093) [2022-07-10 21:55:31,376][25689] Fps is (10 sec: 5634.6, 60 sec: 5539.0, 300 sec: 5517.0). Total num frames: 929233920. Throughput: 0: 5778.2. Samples: 929238600. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:55:31,377][25689] Avg episode reward: [(0, '0.679')] [2022-07-10 21:55:33,268][26022] Updated weights on worker 0-0, policy_version 907465 (0.00088) [2022-07-10 21:55:35,278][26022] Updated weights on worker 0-0, policy_version 907475 (0.00082) [2022-07-10 21:55:36,387][25689] Fps is (10 sec: 5602.8, 60 sec: 5521.5, 300 sec: 5514.4). Total num frames: 929260544. Throughput: 0: 4957.9. Samples: 929255292. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:55:36,389][25689] Avg episode reward: [(0, '0.326')] [2022-07-10 21:55:36,886][26022] Updated weights on worker 0-0, policy_version 907485 (0.00086) [2022-07-10 21:55:38,931][26022] Updated weights on worker 0-0, policy_version 907495 (0.00083) [2022-07-10 21:55:40,608][26022] Updated weights on worker 0-0, policy_version 907505 (0.00079) [2022-07-10 21:55:41,446][25689] Fps is (10 sec: 5389.4, 60 sec: 5506.7, 300 sec: 5514.4). Total num frames: 929288192. Throughput: 0: 5777.1. Samples: 929288584. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 21:55:41,448][25689] Avg episode reward: [(0, '0.577')] [2022-07-10 21:55:42,508][26022] Updated weights on worker 0-0, policy_version 907515 (0.00097) [2022-07-10 21:55:44,395][26022] Updated weights on worker 0-0, policy_version 907525 (0.00095) [2022-07-10 21:55:46,028][26022] Updated weights on worker 0-0, policy_version 907535 (0.00091) [2022-07-10 21:55:46,594][25689] Fps is (10 sec: 5618.1, 60 sec: 5537.4, 300 sec: 5518.7). Total num frames: 929317888. Throughput: 0: 5763.2. Samples: 929321772. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:55:46,595][25689] Avg episode reward: [(0, '-0.134')] [2022-07-10 21:55:48,273][26022] Updated weights on worker 0-0, policy_version 907545 (0.00086) [2022-07-10 21:55:49,785][26022] Updated weights on worker 0-0, policy_version 907555 (0.00082) [2022-07-10 21:55:51,627][25689] Fps is (10 sec: 5532.2, 60 sec: 5505.2, 300 sec: 5514.9). Total num frames: 929344512. Throughput: 0: 4929.2. Samples: 929338420. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:55:51,628][25689] Avg episode reward: [(0, '-0.405')] [2022-07-10 21:55:51,913][26022] Updated weights on worker 0-0, policy_version 907565 (0.00094) [2022-07-10 21:55:53,601][26022] Updated weights on worker 0-0, policy_version 907575 (0.00089) [2022-07-10 21:55:55,410][26022] Updated weights on worker 0-0, policy_version 907585 (0.00088) [2022-07-10 21:55:56,668][25689] Fps is (10 sec: 5488.8, 60 sec: 5521.0, 300 sec: 5511.0). Total num frames: 929373184. Throughput: 0: 5752.2. Samples: 929371954. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:55:56,670][25689] Avg episode reward: [(0, '-0.197')] [2022-07-10 21:55:57,554][26022] Updated weights on worker 0-0, policy_version 907595 (0.00088) [2022-07-10 21:55:59,119][26022] Updated weights on worker 0-0, policy_version 907605 (0.00089) [2022-07-10 21:56:01,104][26022] Updated weights on worker 0-0, policy_version 907615 (0.00086) [2022-07-10 21:56:01,745][25689] Fps is (10 sec: 5667.2, 60 sec: 5517.9, 300 sec: 5520.8). Total num frames: 929401856. Throughput: 0: 5763.1. Samples: 929405570. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:56:01,747][25689] Avg episode reward: [(0, '0.167')] [2022-07-10 21:56:03,255][26022] Updated weights on worker 0-0, policy_version 907625 (0.00087) [2022-07-10 21:56:04,995][26022] Updated weights on worker 0-0, policy_version 907635 (0.00107) [2022-07-10 21:56:06,850][25689] Fps is (10 sec: 5330.2, 60 sec: 5513.9, 300 sec: 5512.0). Total num frames: 929427456. Throughput: 0: 5679.5. Samples: 929436816. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:56:06,851][25689] Avg episode reward: [(0, '0.022')] [2022-07-10 21:56:06,892][26022] Updated weights on worker 0-0, policy_version 907645 (0.00084) [2022-07-10 21:56:08,757][26022] Updated weights on worker 0-0, policy_version 907655 (0.00086) [2022-07-10 21:56:10,638][26022] Updated weights on worker 0-0, policy_version 907665 (0.00092) [2022-07-10 21:56:11,863][25689] Fps is (10 sec: 5262.9, 60 sec: 5499.9, 300 sec: 5511.9). Total num frames: 929455104. Throughput: 0: 5694.0. Samples: 929453646. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:56:11,863][25689] Avg episode reward: [(0, '-0.850')] [2022-07-10 21:56:12,307][26022] Updated weights on worker 0-0, policy_version 907675 (0.00088) [2022-07-10 21:56:14,309][26022] Updated weights on worker 0-0, policy_version 907685 (0.00099) [2022-07-10 21:56:15,994][26022] Updated weights on worker 0-0, policy_version 907695 (0.00084) [2022-07-10 21:56:16,874][25689] Fps is (10 sec: 5618.9, 60 sec: 5533.4, 300 sec: 5518.8). Total num frames: 929483776. Throughput: 0: 5695.2. Samples: 929487026. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:56:16,874][25689] Avg episode reward: [(0, '-0.205')] [2022-07-10 21:56:18,012][26022] Updated weights on worker 0-0, policy_version 907705 (0.00090) [2022-07-10 21:56:19,764][26022] Updated weights on worker 0-0, policy_version 907715 (0.00095) [2022-07-10 21:56:21,759][26022] Updated weights on worker 0-0, policy_version 907725 (0.00089) [2022-07-10 21:56:21,885][25689] Fps is (10 sec: 5619.7, 60 sec: 5517.7, 300 sec: 5512.8). Total num frames: 929511424. Throughput: 0: 5717.3. Samples: 929520714. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:56:21,885][25689] Avg episode reward: [(0, '-0.664')] [2022-07-10 21:56:23,446][26022] Updated weights on worker 0-0, policy_version 907735 (0.00081) [2022-07-10 21:56:25,301][26022] Updated weights on worker 0-0, policy_version 907745 (0.00087) [2022-07-10 21:56:26,939][25689] Fps is (10 sec: 5493.7, 60 sec: 5525.2, 300 sec: 5508.9). Total num frames: 929539072. Throughput: 0: 5013.0. Samples: 929537520. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:56:26,940][25689] Avg episode reward: [(0, '-0.823')] [2022-07-10 21:56:27,154][26022] Updated weights on worker 0-0, policy_version 907755 (0.00085) [2022-07-10 21:56:28,918][26022] Updated weights on worker 0-0, policy_version 907765 (0.00093) [2022-07-10 21:56:30,667][26022] Updated weights on worker 0-0, policy_version 907775 (0.00086) [2022-07-10 21:56:31,940][25689] Fps is (10 sec: 5600.8, 60 sec: 5511.9, 300 sec: 5512.8). Total num frames: 929567744. Throughput: 0: 5848.6. Samples: 929571072. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:56:31,941][25689] Avg episode reward: [(0, '-0.457')] [2022-07-10 21:56:32,609][26022] Updated weights on worker 0-0, policy_version 907785 (0.00091) [2022-07-10 21:56:34,399][26022] Updated weights on worker 0-0, policy_version 907795 (0.00085) [2022-07-10 21:56:36,404][26022] Updated weights on worker 0-0, policy_version 907805 (0.00090) [2022-07-10 21:56:36,968][25689] Fps is (10 sec: 5717.6, 60 sec: 5544.2, 300 sec: 5516.2). Total num frames: 929596416. Throughput: 0: 5860.7. Samples: 929604794. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:56:36,969][25689] Avg episode reward: [(0, '-0.472')] [2022-07-10 21:56:38,089][26022] Updated weights on worker 0-0, policy_version 907815 (0.00092) [2022-07-10 21:56:39,994][26022] Updated weights on worker 0-0, policy_version 907825 (0.00092) [2022-07-10 21:56:41,749][26022] Updated weights on worker 0-0, policy_version 907835 (0.00093) [2022-07-10 21:56:41,981][25689] Fps is (10 sec: 5609.2, 60 sec: 5548.4, 300 sec: 5514.5). Total num frames: 929624064. Throughput: 0: 5019.0. Samples: 929621578. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:56:41,982][25689] Avg episode reward: [(0, '0.346')] [2022-07-10 21:56:43,635][26022] Updated weights on worker 0-0, policy_version 907845 (0.00098) [2022-07-10 21:56:45,503][26022] Updated weights on worker 0-0, policy_version 907855 (0.00101) [2022-07-10 21:56:47,124][25689] Fps is (10 sec: 5444.7, 60 sec: 5515.0, 300 sec: 5508.6). Total num frames: 929651712. Throughput: 0: 5820.0. Samples: 929654998. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:56:47,125][25689] Avg episode reward: [(0, '-0.627')] [2022-07-10 21:56:47,479][26022] Updated weights on worker 0-0, policy_version 907865 (0.00093) [2022-07-10 21:56:49,112][26022] Updated weights on worker 0-0, policy_version 907875 (0.00089) [2022-07-10 21:56:51,065][26022] Updated weights on worker 0-0, policy_version 907885 (0.00099) [2022-07-10 21:56:52,126][25689] Fps is (10 sec: 5450.8, 60 sec: 5534.8, 300 sec: 5512.8). Total num frames: 929679360. Throughput: 0: 5818.5. Samples: 929688520. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:56:52,127][25689] Avg episode reward: [(0, '0.233')] [2022-07-10 21:56:52,783][26022] Updated weights on worker 0-0, policy_version 907895 (0.00082) [2022-07-10 21:56:54,745][26022] Updated weights on worker 0-0, policy_version 907905 (0.00085) [2022-07-10 21:56:56,421][26022] Updated weights on worker 0-0, policy_version 907915 (0.00082) [2022-07-10 21:56:57,190][25689] Fps is (10 sec: 5595.3, 60 sec: 5532.7, 300 sec: 5512.4). Total num frames: 929708032. Throughput: 0: 4969.1. Samples: 929705276. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:56:57,191][25689] Avg episode reward: [(0, '0.506')] [2022-07-10 21:56:58,390][26022] Updated weights on worker 0-0, policy_version 907925 (0.00099) [2022-07-10 21:56:59,995][26022] Updated weights on worker 0-0, policy_version 907935 (0.00083) [2022-07-10 21:57:02,218][25689] Fps is (10 sec: 5377.7, 60 sec: 5486.4, 300 sec: 5509.7). Total num frames: 929733632. Throughput: 0: 5771.7. Samples: 929738378. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:57:02,220][25689] Avg episode reward: [(0, '0.171')] [2022-07-10 21:57:02,582][26022] Updated weights on worker 0-0, policy_version 907945 (0.00052) [2022-07-10 21:57:04,071][26022] Updated weights on worker 0-0, policy_version 907955 (0.00085) [2022-07-10 21:57:06,170][26022] Updated weights on worker 0-0, policy_version 907965 (0.00088) [2022-07-10 21:57:07,287][25689] Fps is (10 sec: 5476.5, 60 sec: 5557.4, 300 sec: 5515.5). Total num frames: 929763328. Throughput: 0: 5689.5. Samples: 929769712. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:57:07,288][25689] Avg episode reward: [(0, '-0.095')] [2022-07-10 21:57:07,996][26022] Updated weights on worker 0-0, policy_version 907975 (0.00098) [2022-07-10 21:57:08,436][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:57:08,447][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000907977_929768448.pth [2022-07-10 21:57:08,447][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000906037_927781888.pth [2022-07-10 21:57:09,839][26022] Updated weights on worker 0-0, policy_version 907985 (0.00097) [2022-07-10 21:57:11,738][26022] Updated weights on worker 0-0, policy_version 907995 (0.00080) [2022-07-10 21:57:12,327][25689] Fps is (10 sec: 5571.2, 60 sec: 5538.0, 300 sec: 5511.5). Total num frames: 929789952. Throughput: 0: 4842.0. Samples: 929786334. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:57:12,329][25689] Avg episode reward: [(0, '0.559')] [2022-07-10 21:57:13,504][26022] Updated weights on worker 0-0, policy_version 908005 (0.00082) [2022-07-10 21:57:15,376][26022] Updated weights on worker 0-0, policy_version 908015 (0.00097) [2022-07-10 21:57:17,176][26022] Updated weights on worker 0-0, policy_version 908025 (0.00091) [2022-07-10 21:57:17,331][25689] Fps is (10 sec: 5403.4, 60 sec: 5521.7, 300 sec: 5504.6). Total num frames: 929817600. Throughput: 0: 5677.0. Samples: 929819616. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:57:17,333][25689] Avg episode reward: [(0, '-0.244')] [2022-07-10 21:57:18,920][26022] Updated weights on worker 0-0, policy_version 908035 (0.00091) [2022-07-10 21:57:21,003][26022] Updated weights on worker 0-0, policy_version 908045 (0.00084) [2022-07-10 21:57:22,352][25689] Fps is (10 sec: 5618.1, 60 sec: 5537.7, 300 sec: 5513.3). Total num frames: 929846272. Throughput: 0: 5705.0. Samples: 929853242. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:57:22,354][25689] Avg episode reward: [(0, '0.014')] [2022-07-10 21:57:22,584][26022] Updated weights on worker 0-0, policy_version 908055 (0.00094) [2022-07-10 21:57:24,628][26022] Updated weights on worker 0-0, policy_version 908065 (0.00086) [2022-07-10 21:57:26,436][26022] Updated weights on worker 0-0, policy_version 908075 (0.00089) [2022-07-10 21:57:27,435][25689] Fps is (10 sec: 5472.6, 60 sec: 5518.1, 300 sec: 5505.1). Total num frames: 929872896. Throughput: 0: 4975.4. Samples: 929869958. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:57:27,436][25689] Avg episode reward: [(0, '-0.274')] [2022-07-10 21:57:28,323][26022] Updated weights on worker 0-0, policy_version 908085 (0.00080) [2022-07-10 21:57:30,225][26022] Updated weights on worker 0-0, policy_version 908095 (0.00089) [2022-07-10 21:57:31,823][26022] Updated weights on worker 0-0, policy_version 908105 (0.00085) [2022-07-10 21:57:32,472][25689] Fps is (10 sec: 5463.8, 60 sec: 5514.9, 300 sec: 5508.0). Total num frames: 929901568. Throughput: 0: 5787.2. Samples: 929902918. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:57:32,474][25689] Avg episode reward: [(0, '-0.169')] [2022-07-10 21:57:33,792][26022] Updated weights on worker 0-0, policy_version 908115 (0.00080) [2022-07-10 21:57:35,646][26022] Updated weights on worker 0-0, policy_version 908125 (0.00087) [2022-07-10 21:57:37,524][25689] Fps is (10 sec: 5582.3, 60 sec: 5495.8, 300 sec: 5507.6). Total num frames: 929929216. Throughput: 0: 5776.6. Samples: 929936264. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:57:37,525][25689] Avg episode reward: [(0, '-0.472')] [2022-07-10 21:57:37,588][26022] Updated weights on worker 0-0, policy_version 908135 (0.00089) [2022-07-10 21:57:39,510][26022] Updated weights on worker 0-0, policy_version 908145 (0.00091) [2022-07-10 21:57:41,167][26022] Updated weights on worker 0-0, policy_version 908155 (0.00098) [2022-07-10 21:57:42,550][25689] Fps is (10 sec: 5588.5, 60 sec: 5511.5, 300 sec: 5508.8). Total num frames: 929957888. Throughput: 0: 4939.5. Samples: 929953012. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:57:42,551][25689] Avg episode reward: [(0, '-0.666')] [2022-07-10 21:57:43,131][26022] Updated weights on worker 0-0, policy_version 908165 (0.00090) [2022-07-10 21:57:44,826][26022] Updated weights on worker 0-0, policy_version 908175 (0.00093) [2022-07-10 21:57:46,799][26022] Updated weights on worker 0-0, policy_version 908185 (0.00365) [2022-07-10 21:57:47,606][25689] Fps is (10 sec: 5586.1, 60 sec: 5519.4, 300 sec: 5504.9). Total num frames: 929985536. Throughput: 0: 5761.7. Samples: 929986178. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:57:47,607][25689] Avg episode reward: [(0, '0.187')] [2022-07-10 21:57:48,514][26022] Updated weights on worker 0-0, policy_version 908195 (0.00087) [2022-07-10 21:57:50,557][26022] Updated weights on worker 0-0, policy_version 908205 (0.00087) [2022-07-10 21:57:52,175][26022] Updated weights on worker 0-0, policy_version 908215 (0.00100) [2022-07-10 21:57:52,657][25689] Fps is (10 sec: 5572.4, 60 sec: 5531.8, 300 sec: 5512.0). Total num frames: 930014208. Throughput: 0: 5795.9. Samples: 930019908. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:57:52,658][25689] Avg episode reward: [(0, '0.306')] [2022-07-10 21:57:54,172][26022] Updated weights on worker 0-0, policy_version 908225 (0.00088) [2022-07-10 21:57:55,950][26022] Updated weights on worker 0-0, policy_version 908235 (0.00097) [2022-07-10 21:57:57,712][25689] Fps is (10 sec: 5573.2, 60 sec: 5515.8, 300 sec: 5511.2). Total num frames: 930041856. Throughput: 0: 4965.4. Samples: 930036504. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:57:57,712][25689] Avg episode reward: [(0, '0.025')] [2022-07-10 21:57:57,805][26022] Updated weights on worker 0-0, policy_version 908245 (0.00089) [2022-07-10 21:57:59,725][26022] Updated weights on worker 0-0, policy_version 908255 (0.00093) [2022-07-10 21:58:01,855][26022] Updated weights on worker 0-0, policy_version 908265 (0.00092) [2022-07-10 21:58:02,787][25689] Fps is (10 sec: 5357.3, 60 sec: 5528.3, 300 sec: 5511.8). Total num frames: 930068480. Throughput: 0: 5773.6. Samples: 930069854. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:58:02,788][25689] Avg episode reward: [(0, '-0.139')] [2022-07-10 21:58:03,866][26022] Updated weights on worker 0-0, policy_version 908275 (0.00098) [2022-07-10 21:58:05,520][26022] Updated weights on worker 0-0, policy_version 908285 (0.00084) [2022-07-10 21:58:07,401][26022] Updated weights on worker 0-0, policy_version 908295 (0.00091) [2022-07-10 21:58:07,857][25689] Fps is (10 sec: 5450.5, 60 sec: 5511.4, 300 sec: 5517.8). Total num frames: 930097152. Throughput: 0: 5682.8. Samples: 930101258. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:58:07,857][25689] Avg episode reward: [(0, '0.169')] [2022-07-10 21:58:09,388][26022] Updated weights on worker 0-0, policy_version 908305 (0.00090) [2022-07-10 21:58:11,165][26022] Updated weights on worker 0-0, policy_version 908315 (0.00081) [2022-07-10 21:58:12,875][25689] Fps is (10 sec: 5380.0, 60 sec: 5496.4, 300 sec: 5507.5). Total num frames: 930122752. Throughput: 0: 4836.8. Samples: 930117700. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:58:12,876][25689] Avg episode reward: [(0, '0.388')] [2022-07-10 21:58:13,035][26022] Updated weights on worker 0-0, policy_version 908325 (0.00083) [2022-07-10 21:58:14,885][26022] Updated weights on worker 0-0, policy_version 908335 (0.00085) [2022-07-10 21:58:16,553][26022] Updated weights on worker 0-0, policy_version 908345 (0.00086) [2022-07-10 21:58:17,913][25689] Fps is (10 sec: 5396.7, 60 sec: 5510.2, 300 sec: 5510.3). Total num frames: 930151424. Throughput: 0: 5665.6. Samples: 930150958. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:58:17,914][25689] Avg episode reward: [(0, '-0.057')] [2022-07-10 21:58:18,714][26022] Updated weights on worker 0-0, policy_version 908355 (0.00092) [2022-07-10 21:58:20,303][26022] Updated weights on worker 0-0, policy_version 908365 (0.00080) [2022-07-10 21:58:22,265][26022] Updated weights on worker 0-0, policy_version 908375 (0.00095) [2022-07-10 21:58:22,932][25689] Fps is (10 sec: 5701.9, 60 sec: 5510.4, 300 sec: 5514.5). Total num frames: 930180096. Throughput: 0: 5674.5. Samples: 930184166. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:58:22,933][25689] Avg episode reward: [(0, '-0.268')] [2022-07-10 21:58:24,198][26022] Updated weights on worker 0-0, policy_version 908385 (0.00084) [2022-07-10 21:58:25,865][26022] Updated weights on worker 0-0, policy_version 908395 (0.00093) [2022-07-10 21:58:27,913][26022] Updated weights on worker 0-0, policy_version 908405 (0.00088) [2022-07-10 21:58:28,000][25689] Fps is (10 sec: 5482.0, 60 sec: 5511.8, 300 sec: 5510.5). Total num frames: 930206720. Throughput: 0: 4930.7. Samples: 930200578. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:58:28,001][25689] Avg episode reward: [(0, '0.091')] [2022-07-10 21:58:29,646][26022] Updated weights on worker 0-0, policy_version 908415 (0.00091) [2022-07-10 21:58:31,567][26022] Updated weights on worker 0-0, policy_version 908425 (0.00058) [2022-07-10 21:58:33,028][25689] Fps is (10 sec: 5375.9, 60 sec: 5495.8, 300 sec: 5510.2). Total num frames: 930234368. Throughput: 0: 5749.4. Samples: 930233564. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:58:33,028][25689] Avg episode reward: [(0, '-0.048')] [2022-07-10 21:58:33,498][26022] Updated weights on worker 0-0, policy_version 908435 (0.00087) [2022-07-10 21:58:35,406][26022] Updated weights on worker 0-0, policy_version 908445 (0.00083) [2022-07-10 21:58:37,079][26022] Updated weights on worker 0-0, policy_version 908455 (0.00088) [2022-07-10 21:58:38,054][25689] Fps is (10 sec: 5499.9, 60 sec: 5498.1, 300 sec: 5513.3). Total num frames: 930262016. Throughput: 0: 5741.3. Samples: 930266594. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:58:38,056][25689] Avg episode reward: [(0, '-0.053')] [2022-07-10 21:58:38,921][26022] Updated weights on worker 0-0, policy_version 908465 (0.00085) [2022-07-10 21:58:40,880][26022] Updated weights on worker 0-0, policy_version 908475 (0.00090) [2022-07-10 21:58:42,791][26022] Updated weights on worker 0-0, policy_version 908485 (0.00601) [2022-07-10 21:58:43,057][25689] Fps is (10 sec: 5615.5, 60 sec: 5500.2, 300 sec: 5512.3). Total num frames: 930290688. Throughput: 0: 4922.8. Samples: 930283236. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:58:43,058][25689] Avg episode reward: [(0, '0.275')] [2022-07-10 21:58:44,708][26022] Updated weights on worker 0-0, policy_version 908495 (0.00089) [2022-07-10 21:58:46,400][26022] Updated weights on worker 0-0, policy_version 908505 (0.00079) [2022-07-10 21:58:48,111][25689] Fps is (10 sec: 5498.2, 60 sec: 5483.4, 300 sec: 5509.1). Total num frames: 930317312. Throughput: 0: 5748.3. Samples: 930316182. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:58:48,112][25689] Avg episode reward: [(0, '0.670')] [2022-07-10 21:58:48,274][26022] Updated weights on worker 0-0, policy_version 908515 (0.00095) [2022-07-10 21:58:50,126][26022] Updated weights on worker 0-0, policy_version 908525 (0.00058) [2022-07-10 21:58:51,946][26022] Updated weights on worker 0-0, policy_version 908535 (0.00084) [2022-07-10 21:58:53,147][25689] Fps is (10 sec: 5379.1, 60 sec: 5467.9, 300 sec: 5506.0). Total num frames: 930344960. Throughput: 0: 5747.2. Samples: 930349190. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:58:53,147][25689] Avg episode reward: [(0, '0.498')] [2022-07-10 21:58:54,151][26022] Updated weights on worker 0-0, policy_version 908545 (0.00085) [2022-07-10 21:58:55,611][26022] Updated weights on worker 0-0, policy_version 908555 (0.00086) [2022-07-10 21:58:57,645][26022] Updated weights on worker 0-0, policy_version 908565 (0.00094) [2022-07-10 21:58:58,168][25689] Fps is (10 sec: 5600.3, 60 sec: 5487.9, 300 sec: 5513.6). Total num frames: 930373632. Throughput: 0: 5784.6. Samples: 930382944. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:58:58,168][25689] Avg episode reward: [(0, '0.145')] [2022-07-10 21:58:59,418][26022] Updated weights on worker 0-0, policy_version 908575 (0.00091) [2022-07-10 21:59:01,390][26022] Updated weights on worker 0-0, policy_version 908585 (0.00091) [2022-07-10 21:59:03,195][25689] Fps is (10 sec: 5400.8, 60 sec: 5475.3, 300 sec: 5515.3). Total num frames: 930399232. Throughput: 0: 5779.5. Samples: 930399626. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:59:03,196][25689] Avg episode reward: [(0, '0.425')] [2022-07-10 21:59:03,476][26022] Updated weights on worker 0-0, policy_version 908595 (0.00079) [2022-07-10 21:59:05,274][26022] Updated weights on worker 0-0, policy_version 908605 (0.00090) [2022-07-10 21:59:07,177][26022] Updated weights on worker 0-0, policy_version 908615 (0.00091) [2022-07-10 21:59:08,302][25689] Fps is (10 sec: 5355.5, 60 sec: 5472.0, 300 sec: 5508.0). Total num frames: 930427904. Throughput: 0: 5697.2. Samples: 930431212. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:59:08,302][25689] Avg episode reward: [(0, '0.417')] [2022-07-10 21:59:08,640][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 21:59:08,649][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000908623_930429952.pth [2022-07-10 21:59:08,649][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000906685_928445440.pth [2022-07-10 21:59:09,167][26022] Updated weights on worker 0-0, policy_version 908625 (0.00093) [2022-07-10 21:59:10,792][26022] Updated weights on worker 0-0, policy_version 908635 (0.00086) [2022-07-10 21:59:12,892][26022] Updated weights on worker 0-0, policy_version 908645 (0.00092) [2022-07-10 21:59:13,383][25689] Fps is (10 sec: 5528.3, 60 sec: 5500.1, 300 sec: 5513.4). Total num frames: 930455552. Throughput: 0: 5693.0. Samples: 930464396. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:59:13,383][25689] Avg episode reward: [(0, '0.109')] [2022-07-10 21:59:14,622][26022] Updated weights on worker 0-0, policy_version 908655 (0.00096) [2022-07-10 21:59:16,450][26022] Updated weights on worker 0-0, policy_version 908665 (0.00092) [2022-07-10 21:59:18,196][26022] Updated weights on worker 0-0, policy_version 908675 (0.00089) [2022-07-10 21:59:18,396][25689] Fps is (10 sec: 5579.4, 60 sec: 5502.4, 300 sec: 5513.3). Total num frames: 930484224. Throughput: 0: 4854.8. Samples: 930481148. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:59:18,396][25689] Avg episode reward: [(0, '0.470')] [2022-07-10 21:59:20,100][26022] Updated weights on worker 0-0, policy_version 908685 (0.00087) [2022-07-10 21:59:22,027][26022] Updated weights on worker 0-0, policy_version 908695 (0.00093) [2022-07-10 21:59:23,488][25689] Fps is (10 sec: 5472.0, 60 sec: 5461.9, 300 sec: 5505.6). Total num frames: 930510848. Throughput: 0: 5658.9. Samples: 930514462. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:59:23,489][25689] Avg episode reward: [(0, '0.482')] [2022-07-10 21:59:23,859][26022] Updated weights on worker 0-0, policy_version 908705 (0.00086) [2022-07-10 21:59:25,752][26022] Updated weights on worker 0-0, policy_version 908715 (0.00092) [2022-07-10 21:59:27,460][26022] Updated weights on worker 0-0, policy_version 908725 (0.00509) [2022-07-10 21:59:28,614][25689] Fps is (10 sec: 5511.8, 60 sec: 5507.4, 300 sec: 5514.0). Total num frames: 930540544. Throughput: 0: 5740.6. Samples: 930547818. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-10 21:59:28,615][25689] Avg episode reward: [(0, '0.162')] [2022-07-10 21:59:29,316][26022] Updated weights on worker 0-0, policy_version 908735 (0.00090) [2022-07-10 21:59:31,169][26022] Updated weights on worker 0-0, policy_version 908745 (0.00085) [2022-07-10 21:59:33,103][26022] Updated weights on worker 0-0, policy_version 908755 (0.00087) [2022-07-10 21:59:33,701][25689] Fps is (10 sec: 5615.2, 60 sec: 5502.0, 300 sec: 5512.5). Total num frames: 930568192. Throughput: 0: 4928.3. Samples: 930564528. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 21:59:33,701][25689] Avg episode reward: [(0, '-0.460')] [2022-07-10 21:59:34,862][26022] Updated weights on worker 0-0, policy_version 908765 (0.00090) [2022-07-10 21:59:36,843][26022] Updated weights on worker 0-0, policy_version 908775 (0.00089) [2022-07-10 21:59:38,495][26022] Updated weights on worker 0-0, policy_version 908785 (0.00077) [2022-07-10 21:59:38,710][25689] Fps is (10 sec: 5578.7, 60 sec: 5520.5, 300 sec: 5513.9). Total num frames: 930596864. Throughput: 0: 5749.3. Samples: 930597938. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 21:59:38,710][25689] Avg episode reward: [(0, '-0.212')] [2022-07-10 21:59:40,443][26022] Updated weights on worker 0-0, policy_version 908795 (0.00094) [2022-07-10 21:59:42,218][26022] Updated weights on worker 0-0, policy_version 908805 (0.00093) [2022-07-10 21:59:43,715][25689] Fps is (10 sec: 5624.0, 60 sec: 5503.4, 300 sec: 5515.9). Total num frames: 930624512. Throughput: 0: 5782.0. Samples: 930631412. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 21:59:43,715][25689] Avg episode reward: [(0, '-0.369')] [2022-07-10 21:59:44,044][26022] Updated weights on worker 0-0, policy_version 908815 (0.00088) [2022-07-10 21:59:45,918][26022] Updated weights on worker 0-0, policy_version 908825 (0.00090) [2022-07-10 21:59:47,847][26022] Updated weights on worker 0-0, policy_version 908835 (0.00085) [2022-07-10 21:59:48,777][25689] Fps is (10 sec: 5391.0, 60 sec: 5502.7, 300 sec: 5508.8). Total num frames: 930651136. Throughput: 0: 4964.0. Samples: 930647906. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 21:59:48,777][25689] Avg episode reward: [(0, '-0.735')] [2022-07-10 21:59:49,679][26022] Updated weights on worker 0-0, policy_version 908845 (0.00089) [2022-07-10 21:59:51,750][26022] Updated weights on worker 0-0, policy_version 908855 (0.00099) [2022-07-10 21:59:53,185][26022] Updated weights on worker 0-0, policy_version 908865 (0.00084) [2022-07-10 21:59:53,813][25689] Fps is (10 sec: 5577.5, 60 sec: 5536.4, 300 sec: 5515.5). Total num frames: 930680832. Throughput: 0: 5799.8. Samples: 930681174. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 21:59:53,814][25689] Avg episode reward: [(0, '-0.749')] [2022-07-10 21:59:55,413][26022] Updated weights on worker 0-0, policy_version 908875 (0.00087) [2022-07-10 21:59:57,011][26022] Updated weights on worker 0-0, policy_version 908885 (0.00093) [2022-07-10 21:59:58,808][26022] Updated weights on worker 0-0, policy_version 908895 (0.00087) [2022-07-10 21:59:58,874][25689] Fps is (10 sec: 5679.3, 60 sec: 5515.9, 300 sec: 5511.8). Total num frames: 930708480. Throughput: 0: 5806.6. Samples: 930715024. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 21:59:58,874][25689] Avg episode reward: [(0, '0.094')] [2022-07-10 22:00:00,591][26022] Updated weights on worker 0-0, policy_version 908905 (0.00081) [2022-07-10 22:00:02,942][26022] Updated weights on worker 0-0, policy_version 908915 (0.00092) [2022-07-10 22:00:03,932][25689] Fps is (10 sec: 5363.2, 60 sec: 5530.0, 300 sec: 5515.3). Total num frames: 930735104. Throughput: 0: 4964.4. Samples: 930731778. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:00:03,932][25689] Avg episode reward: [(0, '0.238')] [2022-07-10 22:00:04,564][26022] Updated weights on worker 0-0, policy_version 908925 (0.00089) [2022-07-10 22:00:06,445][26022] Updated weights on worker 0-0, policy_version 908935 (0.00086) [2022-07-10 22:00:08,262][26022] Updated weights on worker 0-0, policy_version 908945 (0.00056) [2022-07-10 22:00:09,018][25689] Fps is (10 sec: 5349.9, 60 sec: 5514.9, 300 sec: 5511.0). Total num frames: 930762752. Throughput: 0: 5705.3. Samples: 930763390. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:00:09,019][25689] Avg episode reward: [(0, '-0.057')] [2022-07-10 22:00:10,228][26022] Updated weights on worker 0-0, policy_version 908955 (0.00087) [2022-07-10 22:00:11,919][26022] Updated weights on worker 0-0, policy_version 908965 (0.00105) [2022-07-10 22:00:13,875][26022] Updated weights on worker 0-0, policy_version 908975 (0.00085) [2022-07-10 22:00:14,052][25689] Fps is (10 sec: 5564.9, 60 sec: 5536.1, 300 sec: 5517.4). Total num frames: 930791424. Throughput: 0: 5714.5. Samples: 930796834. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:00:14,053][25689] Avg episode reward: [(0, '-0.180')] [2022-07-10 22:00:15,322][26022] Updated weights on worker 0-0, policy_version 908985 (0.00095) [2022-07-10 22:00:17,503][26022] Updated weights on worker 0-0, policy_version 908995 (0.00087) [2022-07-10 22:00:19,075][25689] Fps is (10 sec: 5600.5, 60 sec: 5518.4, 300 sec: 5514.0). Total num frames: 930819072. Throughput: 0: 4884.6. Samples: 930813696. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:00:19,075][25689] Avg episode reward: [(0, '0.096')] [2022-07-10 22:00:19,438][26022] Updated weights on worker 0-0, policy_version 909005 (0.00090) [2022-07-10 22:00:21,155][26022] Updated weights on worker 0-0, policy_version 909015 (0.00087) [2022-07-10 22:00:23,317][26022] Updated weights on worker 0-0, policy_version 909025 (0.00090) [2022-07-10 22:00:24,122][25689] Fps is (10 sec: 5389.6, 60 sec: 5522.5, 300 sec: 5512.2). Total num frames: 930845696. Throughput: 0: 5690.5. Samples: 930846670. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:00:24,123][25689] Avg episode reward: [(0, '0.203')] [2022-07-10 22:00:24,718][26022] Updated weights on worker 0-0, policy_version 909035 (0.00090) [2022-07-10 22:00:26,785][26022] Updated weights on worker 0-0, policy_version 909045 (0.00084) [2022-07-10 22:00:28,582][26022] Updated weights on worker 0-0, policy_version 909055 (0.00085) [2022-07-10 22:00:29,188][25689] Fps is (10 sec: 5568.8, 60 sec: 5527.9, 300 sec: 5511.8). Total num frames: 930875392. Throughput: 0: 5775.7. Samples: 930879884. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:00:29,188][25689] Avg episode reward: [(0, '0.204')] [2022-07-10 22:00:30,546][26022] Updated weights on worker 0-0, policy_version 909065 (0.00088) [2022-07-10 22:00:32,311][26022] Updated weights on worker 0-0, policy_version 909075 (0.00082) [2022-07-10 22:00:34,187][26022] Updated weights on worker 0-0, policy_version 909085 (0.00081) [2022-07-10 22:00:34,281][25689] Fps is (10 sec: 5644.5, 60 sec: 5527.3, 300 sec: 5513.7). Total num frames: 930903040. Throughput: 0: 4933.5. Samples: 930896634. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:00:34,282][25689] Avg episode reward: [(0, '0.414')] [2022-07-10 22:00:35,972][26022] Updated weights on worker 0-0, policy_version 909095 (0.00088) [2022-07-10 22:00:37,995][26022] Updated weights on worker 0-0, policy_version 909105 (0.00096) [2022-07-10 22:00:39,323][25689] Fps is (10 sec: 5455.9, 60 sec: 5507.4, 300 sec: 5514.0). Total num frames: 930930688. Throughput: 0: 5734.3. Samples: 930929806. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:00:39,324][25689] Avg episode reward: [(0, '0.345')] [2022-07-10 22:00:39,809][26022] Updated weights on worker 0-0, policy_version 909115 (0.00090) [2022-07-10 22:00:41,786][26022] Updated weights on worker 0-0, policy_version 909125 (0.00087) [2022-07-10 22:00:43,403][26022] Updated weights on worker 0-0, policy_version 909135 (0.00089) [2022-07-10 22:00:44,376][25689] Fps is (10 sec: 5579.2, 60 sec: 5520.0, 300 sec: 5512.3). Total num frames: 930959360. Throughput: 0: 5751.5. Samples: 930963160. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:00:44,377][25689] Avg episode reward: [(0, '-0.633')] [2022-07-10 22:00:45,299][26022] Updated weights on worker 0-0, policy_version 909145 (0.00080) [2022-07-10 22:00:47,103][26022] Updated weights on worker 0-0, policy_version 909155 (0.00090) [2022-07-10 22:00:49,035][26022] Updated weights on worker 0-0, policy_version 909165 (0.00093) [2022-07-10 22:00:49,414][25689] Fps is (10 sec: 5479.7, 60 sec: 5522.1, 300 sec: 5512.2). Total num frames: 930985984. Throughput: 0: 4947.5. Samples: 930979952. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:00:49,415][25689] Avg episode reward: [(0, '-0.350')] [2022-07-10 22:00:50,926][26022] Updated weights on worker 0-0, policy_version 909175 (0.00087) [2022-07-10 22:00:52,764][26022] Updated weights on worker 0-0, policy_version 909185 (0.00096) [2022-07-10 22:00:54,419][25689] Fps is (10 sec: 5505.8, 60 sec: 5508.0, 300 sec: 5512.9). Total num frames: 931014656. Throughput: 0: 5788.7. Samples: 931013206. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:00:54,421][25689] Avg episode reward: [(0, '-0.452')] [2022-07-10 22:00:54,554][26022] Updated weights on worker 0-0, policy_version 909195 (0.00087) [2022-07-10 22:00:56,359][26022] Updated weights on worker 0-0, policy_version 909205 (0.00088) [2022-07-10 22:00:58,244][26022] Updated weights on worker 0-0, policy_version 909215 (0.00091) [2022-07-10 22:00:59,443][25689] Fps is (10 sec: 5718.2, 60 sec: 5528.4, 300 sec: 5513.9). Total num frames: 931043328. Throughput: 0: 5799.3. Samples: 931046484. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:00:59,445][25689] Avg episode reward: [(0, '-1.158')] [2022-07-10 22:01:00,156][26022] Updated weights on worker 0-0, policy_version 909225 (0.00086) [2022-07-10 22:01:02,402][26022] Updated weights on worker 0-0, policy_version 909235 (0.00087) [2022-07-10 22:01:04,208][26022] Updated weights on worker 0-0, policy_version 909245 (0.00088) [2022-07-10 22:01:04,451][25689] Fps is (10 sec: 5410.0, 60 sec: 5516.0, 300 sec: 5515.7). Total num frames: 931068928. Throughput: 0: 5693.1. Samples: 931077450. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:01:04,452][25689] Avg episode reward: [(0, '-1.017')] [2022-07-10 22:01:06,069][26022] Updated weights on worker 0-0, policy_version 909255 (0.00091) [2022-07-10 22:01:07,850][26022] Updated weights on worker 0-0, policy_version 909265 (0.00145) [2022-07-10 22:01:08,659][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:01:08,674][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000909269_931091456.pth [2022-07-10 22:01:08,675][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000907329_929104896.pth [2022-07-10 22:01:09,554][25689] Fps is (10 sec: 5164.9, 60 sec: 5497.6, 300 sec: 5510.6). Total num frames: 931095552. Throughput: 0: 5672.6. Samples: 931094196. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:01:09,556][25689] Avg episode reward: [(0, '-0.727')] [2022-07-10 22:01:09,716][26022] Updated weights on worker 0-0, policy_version 909275 (0.00107) [2022-07-10 22:01:11,670][26022] Updated weights on worker 0-0, policy_version 909285 (0.00094) [2022-07-10 22:01:13,383][26022] Updated weights on worker 0-0, policy_version 909295 (0.00093) [2022-07-10 22:01:14,563][25689] Fps is (10 sec: 5367.2, 60 sec: 5483.0, 300 sec: 5507.2). Total num frames: 931123200. Throughput: 0: 5684.6. Samples: 931127714. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:01:14,563][25689] Avg episode reward: [(0, '-0.338')] [2022-07-10 22:01:15,215][26022] Updated weights on worker 0-0, policy_version 909305 (0.00090) [2022-07-10 22:01:17,024][26022] Updated weights on worker 0-0, policy_version 909315 (0.00086) [2022-07-10 22:01:18,868][26022] Updated weights on worker 0-0, policy_version 909325 (0.00087) [2022-07-10 22:01:19,594][25689] Fps is (10 sec: 5609.3, 60 sec: 5499.0, 300 sec: 5510.2). Total num frames: 931151872. Throughput: 0: 5703.0. Samples: 931161410. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:01:19,595][25689] Avg episode reward: [(0, '-0.216')] [2022-07-10 22:01:20,900][26022] Updated weights on worker 0-0, policy_version 909335 (0.00095) [2022-07-10 22:01:22,557][26022] Updated weights on worker 0-0, policy_version 909345 (0.00086) [2022-07-10 22:01:24,251][26022] Updated weights on worker 0-0, policy_version 909355 (0.00083) [2022-07-10 22:01:24,603][25689] Fps is (10 sec: 5609.4, 60 sec: 5519.5, 300 sec: 5511.1). Total num frames: 931179520. Throughput: 0: 4990.3. Samples: 931178014. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:01:24,604][25689] Avg episode reward: [(0, '-0.478')] [2022-07-10 22:01:26,397][26022] Updated weights on worker 0-0, policy_version 909365 (0.00091) [2022-07-10 22:01:28,172][26022] Updated weights on worker 0-0, policy_version 909375 (0.00088) [2022-07-10 22:01:29,641][25689] Fps is (10 sec: 5605.9, 60 sec: 5505.1, 300 sec: 5510.4). Total num frames: 931208192. Throughput: 0: 5841.0. Samples: 931211524. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:01:29,642][25689] Avg episode reward: [(0, '-0.349')] [2022-07-10 22:01:29,951][26022] Updated weights on worker 0-0, policy_version 909385 (0.00088) [2022-07-10 22:01:31,750][26022] Updated weights on worker 0-0, policy_version 909395 (0.00089) [2022-07-10 22:01:33,674][26022] Updated weights on worker 0-0, policy_version 909405 (0.00091) [2022-07-10 22:01:34,671][25689] Fps is (10 sec: 5594.2, 60 sec: 5510.9, 300 sec: 5506.9). Total num frames: 931235840. Throughput: 0: 5837.3. Samples: 931245090. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:01:34,671][25689] Avg episode reward: [(0, '0.193')] [2022-07-10 22:01:35,343][26022] Updated weights on worker 0-0, policy_version 909415 (0.00091) [2022-07-10 22:01:37,519][26022] Updated weights on worker 0-0, policy_version 909425 (0.00087) [2022-07-10 22:01:39,059][26022] Updated weights on worker 0-0, policy_version 909435 (0.00087) [2022-07-10 22:01:39,678][25689] Fps is (10 sec: 5509.3, 60 sec: 5514.1, 300 sec: 5507.0). Total num frames: 931263488. Throughput: 0: 4994.7. Samples: 931261720. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:01:39,681][25689] Avg episode reward: [(0, '-0.204')] [2022-07-10 22:01:41,028][26022] Updated weights on worker 0-0, policy_version 909445 (0.00095) [2022-07-10 22:01:42,765][26022] Updated weights on worker 0-0, policy_version 909455 (0.00091) [2022-07-10 22:01:44,507][26022] Updated weights on worker 0-0, policy_version 909465 (0.00087) [2022-07-10 22:01:44,682][25689] Fps is (10 sec: 5625.5, 60 sec: 5518.5, 300 sec: 5513.0). Total num frames: 931292160. Throughput: 0: 5845.3. Samples: 931295382. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:01:44,683][25689] Avg episode reward: [(0, '-0.094')] [2022-07-10 22:01:46,467][26022] Updated weights on worker 0-0, policy_version 909475 (0.00923) [2022-07-10 22:01:48,216][26022] Updated weights on worker 0-0, policy_version 909485 (0.00086) [2022-07-10 22:01:49,793][25689] Fps is (10 sec: 5770.5, 60 sec: 5562.8, 300 sec: 5517.9). Total num frames: 931321856. Throughput: 0: 5838.8. Samples: 931329186. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:01:49,795][25689] Avg episode reward: [(0, '0.176')] [2022-07-10 22:01:50,011][26022] Updated weights on worker 0-0, policy_version 909495 (0.00085) [2022-07-10 22:01:51,875][26022] Updated weights on worker 0-0, policy_version 909505 (0.00093) [2022-07-10 22:01:53,795][26022] Updated weights on worker 0-0, policy_version 909515 (0.00090) [2022-07-10 22:01:54,806][25689] Fps is (10 sec: 5563.1, 60 sec: 5528.1, 300 sec: 5512.0). Total num frames: 931348480. Throughput: 0: 4993.7. Samples: 931345640. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:01:54,807][25689] Avg episode reward: [(0, '-0.064')] [2022-07-10 22:01:55,590][26022] Updated weights on worker 0-0, policy_version 909525 (0.00088) [2022-07-10 22:01:57,549][26022] Updated weights on worker 0-0, policy_version 909535 (0.00112) [2022-07-10 22:01:59,272][26022] Updated weights on worker 0-0, policy_version 909545 (0.00093) [2022-07-10 22:01:59,840][25689] Fps is (10 sec: 5503.3, 60 sec: 5527.1, 300 sec: 5522.2). Total num frames: 931377152. Throughput: 0: 5818.0. Samples: 931379026. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:01:59,841][25689] Avg episode reward: [(0, '-0.040')] [2022-07-10 22:02:01,244][26022] Updated weights on worker 0-0, policy_version 909555 (0.00089) [2022-07-10 22:02:03,124][26022] Updated weights on worker 0-0, policy_version 909565 (0.00089) [2022-07-10 22:02:04,851][25689] Fps is (10 sec: 5402.9, 60 sec: 5526.9, 300 sec: 5509.5). Total num frames: 931402752. Throughput: 0: 5703.7. Samples: 931410420. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:02:04,852][25689] Avg episode reward: [(0, '-0.351')] [2022-07-10 22:02:05,309][26022] Updated weights on worker 0-0, policy_version 909575 (0.00089) [2022-07-10 22:02:07,080][26022] Updated weights on worker 0-0, policy_version 909585 (0.00096) [2022-07-10 22:02:08,989][26022] Updated weights on worker 0-0, policy_version 909595 (0.00088) [2022-07-10 22:02:09,900][25689] Fps is (10 sec: 5191.6, 60 sec: 5531.9, 300 sec: 5509.3). Total num frames: 931429376. Throughput: 0: 4866.2. Samples: 931427032. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:02:09,900][25689] Avg episode reward: [(0, '-0.162')] [2022-07-10 22:02:10,779][26022] Updated weights on worker 0-0, policy_version 909605 (0.00094) [2022-07-10 22:02:12,453][26022] Updated weights on worker 0-0, policy_version 909615 (0.00095) [2022-07-10 22:02:14,470][26022] Updated weights on worker 0-0, policy_version 909625 (0.00098) [2022-07-10 22:02:14,929][25689] Fps is (10 sec: 5486.9, 60 sec: 5547.0, 300 sec: 5512.3). Total num frames: 931458048. Throughput: 0: 5713.3. Samples: 931460608. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:02:14,930][25689] Avg episode reward: [(0, '-1.508')] [2022-07-10 22:02:16,424][26022] Updated weights on worker 0-0, policy_version 909635 (0.00090) [2022-07-10 22:02:18,069][26022] Updated weights on worker 0-0, policy_version 909645 (0.00086) [2022-07-10 22:02:19,961][25689] Fps is (10 sec: 5597.7, 60 sec: 5529.9, 300 sec: 5508.6). Total num frames: 931485696. Throughput: 0: 5716.2. Samples: 931494040. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:02:19,962][25689] Avg episode reward: [(0, '-2.090')] [2022-07-10 22:02:20,054][26022] Updated weights on worker 0-0, policy_version 909655 (0.00086) [2022-07-10 22:02:21,879][26022] Updated weights on worker 0-0, policy_version 909665 (0.00085) [2022-07-10 22:02:23,764][26022] Updated weights on worker 0-0, policy_version 909675 (0.00090) [2022-07-10 22:02:24,988][25689] Fps is (10 sec: 5497.2, 60 sec: 5528.3, 300 sec: 5513.1). Total num frames: 931513344. Throughput: 0: 4979.4. Samples: 931510688. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:02:24,990][25689] Avg episode reward: [(0, '-1.930')] [2022-07-10 22:02:25,564][26022] Updated weights on worker 0-0, policy_version 909685 (0.00082) [2022-07-10 22:02:27,349][26022] Updated weights on worker 0-0, policy_version 909695 (0.00089) [2022-07-10 22:02:29,315][26022] Updated weights on worker 0-0, policy_version 909705 (0.00088) [2022-07-10 22:02:30,053][25689] Fps is (10 sec: 5580.6, 60 sec: 5525.8, 300 sec: 5512.6). Total num frames: 931542016. Throughput: 0: 5794.9. Samples: 931543820. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:02:30,054][25689] Avg episode reward: [(0, '-2.135')] [2022-07-10 22:02:31,191][26022] Updated weights on worker 0-0, policy_version 909715 (0.00089) [2022-07-10 22:02:32,932][26022] Updated weights on worker 0-0, policy_version 909725 (0.00053) [2022-07-10 22:02:34,735][26022] Updated weights on worker 0-0, policy_version 909735 (0.00090) [2022-07-10 22:02:35,082][25689] Fps is (10 sec: 5680.9, 60 sec: 5542.8, 300 sec: 5516.5). Total num frames: 931570688. Throughput: 0: 5789.8. Samples: 931577292. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:02:35,083][25689] Avg episode reward: [(0, '-2.047')] [2022-07-10 22:02:36,754][26022] Updated weights on worker 0-0, policy_version 909745 (0.00089) [2022-07-10 22:02:38,459][26022] Updated weights on worker 0-0, policy_version 909755 (0.00094) [2022-07-10 22:02:40,106][25689] Fps is (10 sec: 5398.7, 60 sec: 5507.4, 300 sec: 5506.2). Total num frames: 931596288. Throughput: 0: 4956.1. Samples: 931593880. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:02:40,106][25689] Avg episode reward: [(0, '-1.902')] [2022-07-10 22:02:40,484][26022] Updated weights on worker 0-0, policy_version 909765 (0.00088) [2022-07-10 22:02:41,840][26022] Updated weights on worker 0-0, policy_version 909775 (0.00083) [2022-07-10 22:02:44,064][26022] Updated weights on worker 0-0, policy_version 909785 (0.00093) [2022-07-10 22:02:45,135][25689] Fps is (10 sec: 5602.1, 60 sec: 5539.0, 300 sec: 5517.0). Total num frames: 931627008. Throughput: 0: 5794.0. Samples: 931627424. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:02:45,136][25689] Avg episode reward: [(0, '-0.660')] [2022-07-10 22:02:45,622][26022] Updated weights on worker 0-0, policy_version 909795 (0.00086) [2022-07-10 22:02:47,745][26022] Updated weights on worker 0-0, policy_version 909805 (0.01132) [2022-07-10 22:02:49,350][26022] Updated weights on worker 0-0, policy_version 909815 (0.00093) [2022-07-10 22:02:50,239][25689] Fps is (10 sec: 5659.0, 60 sec: 5488.8, 300 sec: 5509.1). Total num frames: 931653632. Throughput: 0: 5807.8. Samples: 931661056. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:02:50,240][25689] Avg episode reward: [(0, '-0.245')] [2022-07-10 22:02:51,355][26022] Updated weights on worker 0-0, policy_version 909825 (0.00083) [2022-07-10 22:02:52,944][26022] Updated weights on worker 0-0, policy_version 909835 (0.00093) [2022-07-10 22:02:55,091][26022] Updated weights on worker 0-0, policy_version 909845 (0.00087) [2022-07-10 22:02:55,306][25689] Fps is (10 sec: 5436.7, 60 sec: 5517.7, 300 sec: 5512.4). Total num frames: 931682304. Throughput: 0: 4954.5. Samples: 931677494. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:02:55,307][25689] Avg episode reward: [(0, '-0.320')] [2022-07-10 22:02:56,714][26022] Updated weights on worker 0-0, policy_version 909855 (0.00086) [2022-07-10 22:02:58,953][26022] Updated weights on worker 0-0, policy_version 909865 (0.00084) [2022-07-10 22:03:00,320][25689] Fps is (10 sec: 5688.2, 60 sec: 5519.6, 300 sec: 5520.4). Total num frames: 931710976. Throughput: 0: 5794.4. Samples: 931711010. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:03:00,321][25689] Avg episode reward: [(0, '0.407')] [2022-07-10 22:03:00,495][26022] Updated weights on worker 0-0, policy_version 909875 (0.00098) [2022-07-10 22:03:02,925][26022] Updated weights on worker 0-0, policy_version 909885 (0.00091) [2022-07-10 22:03:04,555][26022] Updated weights on worker 0-0, policy_version 909895 (0.00094) [2022-07-10 22:03:05,407][25689] Fps is (10 sec: 5271.7, 60 sec: 5495.7, 300 sec: 5506.3). Total num frames: 931735552. Throughput: 0: 5658.8. Samples: 931742136. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:03:05,408][25689] Avg episode reward: [(0, '0.480')] [2022-07-10 22:03:06,647][26022] Updated weights on worker 0-0, policy_version 909905 (0.00092) [2022-07-10 22:03:08,340][26022] Updated weights on worker 0-0, policy_version 909915 (0.00091) [2022-07-10 22:03:08,870][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:03:08,882][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000909917_931755008.pth [2022-07-10 22:03:08,882][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000907977_929768448.pth [2022-07-10 22:03:10,193][26022] Updated weights on worker 0-0, policy_version 909925 (0.00082) [2022-07-10 22:03:10,470][25689] Fps is (10 sec: 5246.2, 60 sec: 5528.3, 300 sec: 5515.8). Total num frames: 931764224. Throughput: 0: 4829.0. Samples: 931758752. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:03:10,471][25689] Avg episode reward: [(0, '-0.068')] [2022-07-10 22:03:12,077][26022] Updated weights on worker 0-0, policy_version 909935 (0.00088) [2022-07-10 22:03:13,679][26022] Updated weights on worker 0-0, policy_version 909945 (0.00078) [2022-07-10 22:03:15,481][25689] Fps is (10 sec: 5590.7, 60 sec: 5513.0, 300 sec: 5512.9). Total num frames: 931791872. Throughput: 0: 5683.5. Samples: 931792156. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-10 22:03:15,481][25689] Avg episode reward: [(0, '0.098')] [2022-07-10 22:03:15,720][26022] Updated weights on worker 0-0, policy_version 909955 (0.00090) [2022-07-10 22:03:17,548][26022] Updated weights on worker 0-0, policy_version 909965 (0.00094) [2022-07-10 22:03:19,508][26022] Updated weights on worker 0-0, policy_version 909975 (0.00087) [2022-07-10 22:03:20,521][25689] Fps is (10 sec: 5603.4, 60 sec: 5529.2, 300 sec: 5512.5). Total num frames: 931820544. Throughput: 0: 5668.2. Samples: 931825512. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:03:20,521][25689] Avg episode reward: [(0, '0.008')] [2022-07-10 22:03:21,318][26022] Updated weights on worker 0-0, policy_version 909985 (0.00106) [2022-07-10 22:03:23,188][26022] Updated weights on worker 0-0, policy_version 909995 (0.00091) [2022-07-10 22:03:25,234][26022] Updated weights on worker 0-0, policy_version 910005 (0.00084) [2022-07-10 22:03:25,531][25689] Fps is (10 sec: 5502.2, 60 sec: 5513.8, 300 sec: 5513.5). Total num frames: 931847168. Throughput: 0: 4947.6. Samples: 931841700. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:03:25,531][25689] Avg episode reward: [(0, '0.493')] [2022-07-10 22:03:26,740][26022] Updated weights on worker 0-0, policy_version 910015 (0.00087) [2022-07-10 22:03:28,899][26022] Updated weights on worker 0-0, policy_version 910025 (0.00088) [2022-07-10 22:03:30,561][25689] Fps is (10 sec: 5507.4, 60 sec: 5517.0, 300 sec: 5516.9). Total num frames: 931875840. Throughput: 0: 5781.8. Samples: 931874916. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:03:30,562][25689] Avg episode reward: [(0, '0.632')] [2022-07-10 22:03:30,573][26022] Updated weights on worker 0-0, policy_version 910035 (0.00091) [2022-07-10 22:03:32,587][26022] Updated weights on worker 0-0, policy_version 910045 (0.00099) [2022-07-10 22:03:34,175][26022] Updated weights on worker 0-0, policy_version 910055 (0.00090) [2022-07-10 22:03:35,575][25689] Fps is (10 sec: 5505.4, 60 sec: 5484.6, 300 sec: 5513.7). Total num frames: 931902464. Throughput: 0: 5773.0. Samples: 931908158. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:03:35,575][25689] Avg episode reward: [(0, '0.845')] [2022-07-10 22:03:36,167][26022] Updated weights on worker 0-0, policy_version 910065 (0.00086) [2022-07-10 22:03:38,107][26022] Updated weights on worker 0-0, policy_version 910075 (0.00092) [2022-07-10 22:03:39,810][26022] Updated weights on worker 0-0, policy_version 910085 (0.00100) [2022-07-10 22:03:40,610][25689] Fps is (10 sec: 5400.8, 60 sec: 5517.4, 300 sec: 5509.7). Total num frames: 931930112. Throughput: 0: 4936.7. Samples: 931924686. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:03:40,611][25689] Avg episode reward: [(0, '0.868')] [2022-07-10 22:03:41,732][26022] Updated weights on worker 0-0, policy_version 910095 (0.00090) [2022-07-10 22:03:43,679][26022] Updated weights on worker 0-0, policy_version 910105 (0.00091) [2022-07-10 22:03:45,358][26022] Updated weights on worker 0-0, policy_version 910115 (0.00095) [2022-07-10 22:03:45,615][25689] Fps is (10 sec: 5711.5, 60 sec: 5502.7, 300 sec: 5520.9). Total num frames: 931959808. Throughput: 0: 5788.9. Samples: 931957964. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:03:45,615][25689] Avg episode reward: [(0, '0.682')] [2022-07-10 22:03:47,317][26022] Updated weights on worker 0-0, policy_version 910125 (0.00093) [2022-07-10 22:03:48,895][26022] Updated weights on worker 0-0, policy_version 910135 (0.00096) [2022-07-10 22:03:50,748][25689] Fps is (10 sec: 5353.5, 60 sec: 5466.2, 300 sec: 5508.8). Total num frames: 931984384. Throughput: 0: 5769.3. Samples: 931991380. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:03:50,749][25689] Avg episode reward: [(0, '0.477')] [2022-07-10 22:03:51,091][26022] Updated weights on worker 0-0, policy_version 910145 (0.00096) [2022-07-10 22:03:52,954][26022] Updated weights on worker 0-0, policy_version 910155 (0.00092) [2022-07-10 22:03:54,686][26022] Updated weights on worker 0-0, policy_version 910165 (0.00088) [2022-07-10 22:03:55,771][25689] Fps is (10 sec: 5343.9, 60 sec: 5487.1, 300 sec: 5512.2). Total num frames: 932014080. Throughput: 0: 4920.8. Samples: 932007540. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:03:55,772][25689] Avg episode reward: [(0, '0.628')] [2022-07-10 22:03:56,761][26022] Updated weights on worker 0-0, policy_version 910175 (0.00094) [2022-07-10 22:03:58,374][26022] Updated weights on worker 0-0, policy_version 910185 (0.00087) [2022-07-10 22:04:00,334][26022] Updated weights on worker 0-0, policy_version 910195 (0.00071) [2022-07-10 22:04:00,822][25689] Fps is (10 sec: 5793.8, 60 sec: 5483.7, 300 sec: 5522.1). Total num frames: 932042752. Throughput: 0: 5748.5. Samples: 932040874. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:04:00,823][25689] Avg episode reward: [(0, '-0.251')] [2022-07-10 22:04:02,506][26022] Updated weights on worker 0-0, policy_version 910205 (0.00088) [2022-07-10 22:04:04,314][26022] Updated weights on worker 0-0, policy_version 910215 (0.00104) [2022-07-10 22:04:05,838][25689] Fps is (10 sec: 5289.5, 60 sec: 5490.2, 300 sec: 5510.0). Total num frames: 932067328. Throughput: 0: 5652.9. Samples: 932072282. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:04:05,838][25689] Avg episode reward: [(0, '-0.170')] [2022-07-10 22:04:06,239][26022] Updated weights on worker 0-0, policy_version 910225 (0.00089) [2022-07-10 22:04:08,035][26022] Updated weights on worker 0-0, policy_version 910235 (0.00092) [2022-07-10 22:04:10,049][26022] Updated weights on worker 0-0, policy_version 910245 (0.00093) [2022-07-10 22:04:10,889][25689] Fps is (10 sec: 5289.3, 60 sec: 5491.2, 300 sec: 5514.0). Total num frames: 932096000. Throughput: 0: 5664.9. Samples: 932105478. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:04:10,890][25689] Avg episode reward: [(0, '0.252')] [2022-07-10 22:04:11,744][26022] Updated weights on worker 0-0, policy_version 910255 (0.00087) [2022-07-10 22:04:13,743][26022] Updated weights on worker 0-0, policy_version 910265 (0.00089) [2022-07-10 22:04:15,263][26022] Updated weights on worker 0-0, policy_version 910275 (0.00090) [2022-07-10 22:04:15,929][25689] Fps is (10 sec: 5682.3, 60 sec: 5505.5, 300 sec: 5513.5). Total num frames: 932124672. Throughput: 0: 5690.6. Samples: 932122254. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:04:15,931][25689] Avg episode reward: [(0, '0.286')] [2022-07-10 22:04:17,419][26022] Updated weights on worker 0-0, policy_version 910285 (0.00090) [2022-07-10 22:04:18,996][26022] Updated weights on worker 0-0, policy_version 910295 (0.00094) [2022-07-10 22:04:20,803][26022] Updated weights on worker 0-0, policy_version 910305 (0.00098) [2022-07-10 22:04:20,979][25689] Fps is (10 sec: 5582.0, 60 sec: 5487.7, 300 sec: 5517.7). Total num frames: 932152320. Throughput: 0: 5715.2. Samples: 932156074. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:04:20,979][25689] Avg episode reward: [(0, '-0.404')] [2022-07-10 22:04:22,807][26022] Updated weights on worker 0-0, policy_version 910315 (0.00086) [2022-07-10 22:04:24,572][26022] Updated weights on worker 0-0, policy_version 910325 (0.00089) [2022-07-10 22:04:26,022][25689] Fps is (10 sec: 5377.3, 60 sec: 5484.7, 300 sec: 5508.9). Total num frames: 932178944. Throughput: 0: 5781.3. Samples: 932188974. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:04:26,024][25689] Avg episode reward: [(0, '-0.065')] [2022-07-10 22:04:26,534][26022] Updated weights on worker 0-0, policy_version 910335 (0.00084) [2022-07-10 22:04:28,387][26022] Updated weights on worker 0-0, policy_version 910345 (0.00096) [2022-07-10 22:04:30,129][26022] Updated weights on worker 0-0, policy_version 910355 (0.00085) [2022-07-10 22:04:31,122][25689] Fps is (10 sec: 5552.4, 60 sec: 5495.3, 300 sec: 5515.6). Total num frames: 932208640. Throughput: 0: 4948.4. Samples: 932205598. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:04:31,123][25689] Avg episode reward: [(0, '0.254')] [2022-07-10 22:04:32,275][26022] Updated weights on worker 0-0, policy_version 910365 (0.00090) [2022-07-10 22:04:33,754][26022] Updated weights on worker 0-0, policy_version 910375 (0.00085) [2022-07-10 22:04:35,880][26022] Updated weights on worker 0-0, policy_version 910385 (0.00089) [2022-07-10 22:04:36,129][25689] Fps is (10 sec: 5674.0, 60 sec: 5512.8, 300 sec: 5512.2). Total num frames: 932236288. Throughput: 0: 5775.3. Samples: 932238910. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:04:36,129][25689] Avg episode reward: [(0, '0.083')] [2022-07-10 22:04:37,648][26022] Updated weights on worker 0-0, policy_version 910395 (0.00090) [2022-07-10 22:04:39,639][26022] Updated weights on worker 0-0, policy_version 910405 (0.00099) [2022-07-10 22:04:41,184][25689] Fps is (10 sec: 5393.8, 60 sec: 5494.1, 300 sec: 5507.8). Total num frames: 932262912. Throughput: 0: 5703.9. Samples: 932271322. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:04:41,185][25689] Avg episode reward: [(0, '0.062')] [2022-07-10 22:04:41,543][26022] Updated weights on worker 0-0, policy_version 910415 (0.00101) [2022-07-10 22:04:43,344][26022] Updated weights on worker 0-0, policy_version 910425 (0.00096) [2022-07-10 22:04:45,400][26022] Updated weights on worker 0-0, policy_version 910435 (0.00093) [2022-07-10 22:04:46,250][25689] Fps is (10 sec: 5362.0, 60 sec: 5454.8, 300 sec: 5511.2). Total num frames: 932290560. Throughput: 0: 4855.3. Samples: 932287188. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:04:46,251][25689] Avg episode reward: [(0, '1.339')] [2022-07-10 22:04:47,175][26022] Updated weights on worker 0-0, policy_version 910445 (0.00093) [2022-07-10 22:04:49,295][26022] Updated weights on worker 0-0, policy_version 910455 (0.00093) [2022-07-10 22:04:51,190][26022] Updated weights on worker 0-0, policy_version 910465 (0.00084) [2022-07-10 22:04:51,308][25689] Fps is (10 sec: 5259.6, 60 sec: 5478.5, 300 sec: 5497.0). Total num frames: 932316160. Throughput: 0: 5643.1. Samples: 932319508. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:04:51,309][25689] Avg episode reward: [(0, '1.284')] [2022-07-10 22:04:52,850][26022] Updated weights on worker 0-0, policy_version 910475 (0.00096) [2022-07-10 22:04:54,933][26022] Updated weights on worker 0-0, policy_version 910485 (0.00093) [2022-07-10 22:04:56,324][25689] Fps is (10 sec: 5489.0, 60 sec: 5479.1, 300 sec: 5504.7). Total num frames: 932345856. Throughput: 0: 5627.5. Samples: 932352560. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:04:56,325][25689] Avg episode reward: [(0, '1.459')] [2022-07-10 22:04:56,499][26022] Updated weights on worker 0-0, policy_version 910495 (0.00085) [2022-07-10 22:04:58,583][26022] Updated weights on worker 0-0, policy_version 910505 (0.00088) [2022-07-10 22:05:00,289][26022] Updated weights on worker 0-0, policy_version 910515 (0.00050) [2022-07-10 22:05:01,351][25689] Fps is (10 sec: 5506.0, 60 sec: 5430.5, 300 sec: 5501.8). Total num frames: 932371456. Throughput: 0: 4849.5. Samples: 932369120. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:05:01,351][25689] Avg episode reward: [(0, '1.351')] [2022-07-10 22:05:02,824][26022] Updated weights on worker 0-0, policy_version 910525 (0.00083) [2022-07-10 22:05:04,355][26022] Updated weights on worker 0-0, policy_version 910535 (0.00097) [2022-07-10 22:05:06,381][25689] Fps is (10 sec: 5090.9, 60 sec: 5446.1, 300 sec: 5496.0). Total num frames: 932397056. Throughput: 0: 5604.6. Samples: 932400014. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:05:06,383][25689] Avg episode reward: [(0, '1.186')] [2022-07-10 22:05:06,392][26022] Updated weights on worker 0-0, policy_version 910545 (0.00086) [2022-07-10 22:05:07,954][26022] Updated weights on worker 0-0, policy_version 910555 (0.00086) [2022-07-10 22:05:09,188][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:05:09,200][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000910560_932413440.pth [2022-07-10 22:05:09,201][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000908623_930429952.pth [2022-07-10 22:05:10,119][26022] Updated weights on worker 0-0, policy_version 910565 (0.00087) [2022-07-10 22:05:11,483][25689] Fps is (10 sec: 5558.5, 60 sec: 5475.4, 300 sec: 5501.6). Total num frames: 932427776. Throughput: 0: 5636.3. Samples: 932433222. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:05:11,484][25689] Avg episode reward: [(0, '0.376')] [2022-07-10 22:05:11,725][26022] Updated weights on worker 0-0, policy_version 910575 (0.00084) [2022-07-10 22:05:13,762][26022] Updated weights on worker 0-0, policy_version 910585 (0.00091) [2022-07-10 22:05:15,513][26022] Updated weights on worker 0-0, policy_version 910595 (0.00089) [2022-07-10 22:05:16,544][25689] Fps is (10 sec: 5642.8, 60 sec: 5439.8, 300 sec: 5497.5). Total num frames: 932454400. Throughput: 0: 4816.3. Samples: 932449944. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:05:16,544][25689] Avg episode reward: [(0, '0.061')] [2022-07-10 22:05:17,436][26022] Updated weights on worker 0-0, policy_version 910605 (0.00094) [2022-07-10 22:05:19,290][26022] Updated weights on worker 0-0, policy_version 910615 (0.00083) [2022-07-10 22:05:21,186][26022] Updated weights on worker 0-0, policy_version 910625 (0.00081) [2022-07-10 22:05:21,564][25689] Fps is (10 sec: 5383.7, 60 sec: 5442.4, 300 sec: 5501.4). Total num frames: 932482048. Throughput: 0: 5628.0. Samples: 932482878. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:05:21,565][25689] Avg episode reward: [(0, '-0.165')] [2022-07-10 22:05:23,006][26022] Updated weights on worker 0-0, policy_version 910635 (0.00087) [2022-07-10 22:05:24,926][26022] Updated weights on worker 0-0, policy_version 910645 (0.00085) [2022-07-10 22:05:26,594][25689] Fps is (10 sec: 5501.7, 60 sec: 5460.5, 300 sec: 5495.2). Total num frames: 932509696. Throughput: 0: 5737.9. Samples: 932515994. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:05:26,595][25689] Avg episode reward: [(0, '-0.322')] [2022-07-10 22:05:26,754][26022] Updated weights on worker 0-0, policy_version 910655 (0.00149) [2022-07-10 22:05:28,468][26022] Updated weights on worker 0-0, policy_version 910665 (0.00083) [2022-07-10 22:05:30,484][26022] Updated weights on worker 0-0, policy_version 910675 (0.00101) [2022-07-10 22:05:31,655][25689] Fps is (10 sec: 5479.8, 60 sec: 5430.2, 300 sec: 5495.8). Total num frames: 932537344. Throughput: 0: 4927.2. Samples: 932532612. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:05:31,657][25689] Avg episode reward: [(0, '-0.597')] [2022-07-10 22:05:32,174][26022] Updated weights on worker 0-0, policy_version 910685 (0.00091) [2022-07-10 22:05:34,138][26022] Updated weights on worker 0-0, policy_version 910695 (0.00084) [2022-07-10 22:05:35,873][26022] Updated weights on worker 0-0, policy_version 910705 (0.00093) [2022-07-10 22:05:36,695][25689] Fps is (10 sec: 5474.7, 60 sec: 5427.2, 300 sec: 5495.8). Total num frames: 932564992. Throughput: 0: 5769.2. Samples: 932566200. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:05:36,695][25689] Avg episode reward: [(0, '-0.527')] [2022-07-10 22:05:37,664][26022] Updated weights on worker 0-0, policy_version 910715 (0.00087) [2022-07-10 22:05:39,853][26022] Updated weights on worker 0-0, policy_version 910725 (0.00099) [2022-07-10 22:05:41,421][26022] Updated weights on worker 0-0, policy_version 910735 (0.00094) [2022-07-10 22:05:41,698][25689] Fps is (10 sec: 5607.9, 60 sec: 5465.7, 300 sec: 5496.7). Total num frames: 932593664. Throughput: 0: 5800.9. Samples: 932599672. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:05:41,699][25689] Avg episode reward: [(0, '0.535')] [2022-07-10 22:05:43,403][26022] Updated weights on worker 0-0, policy_version 910745 (0.00085) [2022-07-10 22:05:45,060][26022] Updated weights on worker 0-0, policy_version 910755 (0.00080) [2022-07-10 22:05:46,707][25689] Fps is (10 sec: 5625.3, 60 sec: 5470.9, 300 sec: 5500.7). Total num frames: 932621312. Throughput: 0: 4994.0. Samples: 932616434. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:05:46,707][25689] Avg episode reward: [(0, '0.664')] [2022-07-10 22:05:46,898][26022] Updated weights on worker 0-0, policy_version 910765 (0.00092) [2022-07-10 22:05:48,720][26022] Updated weights on worker 0-0, policy_version 910775 (0.00092) [2022-07-10 22:05:50,616][26022] Updated weights on worker 0-0, policy_version 910785 (0.00087) [2022-07-10 22:05:51,771][25689] Fps is (10 sec: 5692.9, 60 sec: 5538.1, 300 sec: 5503.1). Total num frames: 932651008. Throughput: 0: 5841.4. Samples: 932650118. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:05:51,772][25689] Avg episode reward: [(0, '0.376')] [2022-07-10 22:05:52,444][26022] Updated weights on worker 0-0, policy_version 910795 (0.00086) [2022-07-10 22:05:54,337][26022] Updated weights on worker 0-0, policy_version 910805 (0.00090) [2022-07-10 22:05:56,132][26022] Updated weights on worker 0-0, policy_version 910815 (0.00091) [2022-07-10 22:05:56,804][25689] Fps is (10 sec: 5679.4, 60 sec: 5502.7, 300 sec: 5499.5). Total num frames: 932678656. Throughput: 0: 5827.5. Samples: 932683384. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:05:56,805][25689] Avg episode reward: [(0, '0.231')] [2022-07-10 22:05:57,935][26022] Updated weights on worker 0-0, policy_version 910825 (0.00082) [2022-07-10 22:05:59,736][26022] Updated weights on worker 0-0, policy_version 910835 (0.00089) [2022-07-10 22:06:01,806][25689] Fps is (10 sec: 5306.2, 60 sec: 5504.9, 300 sec: 5499.6). Total num frames: 932704256. Throughput: 0: 5000.0. Samples: 932700212. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:06:01,808][25689] Avg episode reward: [(0, '0.662')] [2022-07-10 22:06:01,949][26022] Updated weights on worker 0-0, policy_version 910845 (0.00095) [2022-07-10 22:06:03,975][26022] Updated weights on worker 0-0, policy_version 910855 (0.00086) [2022-07-10 22:06:05,666][26022] Updated weights on worker 0-0, policy_version 910865 (0.00088) [2022-07-10 22:06:06,833][25689] Fps is (10 sec: 5207.3, 60 sec: 5522.2, 300 sec: 5501.0). Total num frames: 932730880. Throughput: 0: 5729.0. Samples: 932731736. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:06:06,835][25689] Avg episode reward: [(0, '0.756')] [2022-07-10 22:06:07,712][26022] Updated weights on worker 0-0, policy_version 910875 (0.00088) [2022-07-10 22:06:09,387][26022] Updated weights on worker 0-0, policy_version 910885 (0.00097) [2022-07-10 22:06:11,276][26022] Updated weights on worker 0-0, policy_version 910895 (0.00092) [2022-07-10 22:06:11,885][25689] Fps is (10 sec: 5587.6, 60 sec: 5509.8, 300 sec: 5507.1). Total num frames: 932760576. Throughput: 0: 5720.1. Samples: 932765174. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:06:11,888][25689] Avg episode reward: [(0, '0.540')] [2022-07-10 22:06:13,120][26022] Updated weights on worker 0-0, policy_version 910905 (0.00083) [2022-07-10 22:06:14,920][26022] Updated weights on worker 0-0, policy_version 910915 (0.00084) [2022-07-10 22:06:16,905][26022] Updated weights on worker 0-0, policy_version 910925 (0.00091) [2022-07-10 22:06:16,924][25689] Fps is (10 sec: 5581.1, 60 sec: 5511.7, 300 sec: 5500.1). Total num frames: 932787200. Throughput: 0: 4898.8. Samples: 932781948. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:06:16,924][25689] Avg episode reward: [(0, '0.627')] [2022-07-10 22:06:18,718][26022] Updated weights on worker 0-0, policy_version 910935 (0.00087) [2022-07-10 22:06:20,491][26022] Updated weights on worker 0-0, policy_version 910945 (0.00095) [2022-07-10 22:06:21,950][25689] Fps is (10 sec: 5290.8, 60 sec: 5494.3, 300 sec: 5496.3). Total num frames: 932813824. Throughput: 0: 5703.3. Samples: 932815098. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:06:21,952][25689] Avg episode reward: [(0, '1.039')] [2022-07-10 22:06:22,355][26022] Updated weights on worker 0-0, policy_version 910955 (0.00094) [2022-07-10 22:06:24,117][26022] Updated weights on worker 0-0, policy_version 910965 (0.00100) [2022-07-10 22:06:26,229][26022] Updated weights on worker 0-0, policy_version 910975 (0.00088) [2022-07-10 22:06:26,971][25689] Fps is (10 sec: 5605.7, 60 sec: 5529.1, 300 sec: 5500.1). Total num frames: 932843520. Throughput: 0: 5778.1. Samples: 932848096. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:06:26,971][25689] Avg episode reward: [(0, '0.045')] [2022-07-10 22:06:28,010][26022] Updated weights on worker 0-0, policy_version 910985 (0.00090) [2022-07-10 22:06:29,955][26022] Updated weights on worker 0-0, policy_version 910995 (0.00094) [2022-07-10 22:06:31,459][26022] Updated weights on worker 0-0, policy_version 911005 (0.00080) [2022-07-10 22:06:32,018][25689] Fps is (10 sec: 5695.3, 60 sec: 5530.2, 300 sec: 5499.7). Total num frames: 932871168. Throughput: 0: 4942.4. Samples: 932864680. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:06:32,019][25689] Avg episode reward: [(0, '-0.474')] [2022-07-10 22:06:33,542][26022] Updated weights on worker 0-0, policy_version 911015 (0.00081) [2022-07-10 22:06:35,169][26022] Updated weights on worker 0-0, policy_version 911025 (0.00082) [2022-07-10 22:06:37,035][25689] Fps is (10 sec: 5392.6, 60 sec: 5515.4, 300 sec: 5496.1). Total num frames: 932897792. Throughput: 0: 5797.3. Samples: 932898538. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:06:37,035][25689] Avg episode reward: [(0, '-1.090')] [2022-07-10 22:06:37,255][26022] Updated weights on worker 0-0, policy_version 911035 (0.00094) [2022-07-10 22:06:38,780][26022] Updated weights on worker 0-0, policy_version 911045 (0.00085) [2022-07-10 22:06:40,800][26022] Updated weights on worker 0-0, policy_version 911055 (0.00092) [2022-07-10 22:06:42,056][25689] Fps is (10 sec: 5610.8, 60 sec: 5530.7, 300 sec: 5499.2). Total num frames: 932927488. Throughput: 0: 5808.6. Samples: 932931888. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:06:42,057][25689] Avg episode reward: [(0, '-1.066')] [2022-07-10 22:06:42,612][26022] Updated weights on worker 0-0, policy_version 911065 (0.00093) [2022-07-10 22:06:44,580][26022] Updated weights on worker 0-0, policy_version 911075 (0.00086) [2022-07-10 22:06:46,288][26022] Updated weights on worker 0-0, policy_version 911085 (0.00102) [2022-07-10 22:06:47,075][25689] Fps is (10 sec: 5609.3, 60 sec: 5512.8, 300 sec: 5490.6). Total num frames: 932954112. Throughput: 0: 4993.7. Samples: 932948492. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:06:47,076][25689] Avg episode reward: [(0, '-1.480')] [2022-07-10 22:06:48,183][26022] Updated weights on worker 0-0, policy_version 911095 (0.00084) [2022-07-10 22:06:50,075][26022] Updated weights on worker 0-0, policy_version 911105 (0.00059) [2022-07-10 22:06:51,781][26022] Updated weights on worker 0-0, policy_version 911115 (0.00087) [2022-07-10 22:06:52,120][25689] Fps is (10 sec: 5494.6, 60 sec: 5497.6, 300 sec: 5496.9). Total num frames: 932982784. Throughput: 0: 5838.7. Samples: 932982046. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:06:52,120][25689] Avg episode reward: [(0, '-1.471')] [2022-07-10 22:06:53,652][26022] Updated weights on worker 0-0, policy_version 911125 (0.00086) [2022-07-10 22:06:55,632][26022] Updated weights on worker 0-0, policy_version 911135 (0.00097) [2022-07-10 22:06:57,136][25689] Fps is (10 sec: 5598.0, 60 sec: 5499.1, 300 sec: 5493.8). Total num frames: 933010432. Throughput: 0: 5810.9. Samples: 933015344. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:06:57,137][25689] Avg episode reward: [(0, '-0.368')] [2022-07-10 22:06:57,354][26022] Updated weights on worker 0-0, policy_version 911145 (0.00086) [2022-07-10 22:06:59,220][26022] Updated weights on worker 0-0, policy_version 911155 (0.00085) [2022-07-10 22:07:01,180][26022] Updated weights on worker 0-0, policy_version 911165 (0.00085) [2022-07-10 22:07:02,161][25689] Fps is (10 sec: 5302.8, 60 sec: 5497.0, 300 sec: 5493.5). Total num frames: 933036032. Throughput: 0: 4986.4. Samples: 933032140. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:07:02,162][25689] Avg episode reward: [(0, '0.012')] [2022-07-10 22:07:03,197][26022] Updated weights on worker 0-0, policy_version 911175 (0.00787) [2022-07-10 22:07:05,309][26022] Updated weights on worker 0-0, policy_version 911185 (0.00087) [2022-07-10 22:07:06,811][26022] Updated weights on worker 0-0, policy_version 911195 (0.00092) [2022-07-10 22:07:07,176][25689] Fps is (10 sec: 5405.7, 60 sec: 5532.1, 300 sec: 5501.0). Total num frames: 933064704. Throughput: 0: 5742.0. Samples: 933063910. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2022-07-10 22:07:07,176][25689] Avg episode reward: [(0, '0.694')] [2022-07-10 22:07:08,773][26022] Updated weights on worker 0-0, policy_version 911205 (0.00087) [2022-07-10 22:07:09,408][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:07:09,421][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000911208_933076992.pth [2022-07-10 22:07:09,421][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000909269_931091456.pth [2022-07-10 22:07:10,527][26022] Updated weights on worker 0-0, policy_version 911215 (0.00092) [2022-07-10 22:07:12,275][25689] Fps is (10 sec: 5670.1, 60 sec: 5510.9, 300 sec: 5499.8). Total num frames: 933093376. Throughput: 0: 5710.2. Samples: 933097134. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:07:12,275][25689] Avg episode reward: [(0, '0.499')] [2022-07-10 22:07:12,350][26022] Updated weights on worker 0-0, policy_version 911225 (0.00087) [2022-07-10 22:07:14,299][26022] Updated weights on worker 0-0, policy_version 911235 (0.00097) [2022-07-10 22:07:16,191][26022] Updated weights on worker 0-0, policy_version 911245 (0.00295) [2022-07-10 22:07:17,312][25689] Fps is (10 sec: 5455.0, 60 sec: 5511.0, 300 sec: 5496.2). Total num frames: 933120000. Throughput: 0: 4883.9. Samples: 933113884. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:07:17,319][25689] Avg episode reward: [(0, '1.232')] [2022-07-10 22:07:18,127][26022] Updated weights on worker 0-0, policy_version 911255 (0.00085) [2022-07-10 22:07:19,852][26022] Updated weights on worker 0-0, policy_version 911265 (0.00089) [2022-07-10 22:07:21,680][26022] Updated weights on worker 0-0, policy_version 911275 (0.00086) [2022-07-10 22:07:22,321][25689] Fps is (10 sec: 5504.0, 60 sec: 5546.4, 300 sec: 5500.0). Total num frames: 933148672. Throughput: 0: 5713.7. Samples: 933147328. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:07:22,323][25689] Avg episode reward: [(0, '1.235')] [2022-07-10 22:07:23,503][26022] Updated weights on worker 0-0, policy_version 911285 (0.00087) [2022-07-10 22:07:25,401][26022] Updated weights on worker 0-0, policy_version 911295 (0.00091) [2022-07-10 22:07:27,196][26022] Updated weights on worker 0-0, policy_version 911305 (0.00095) [2022-07-10 22:07:27,352][25689] Fps is (10 sec: 5609.8, 60 sec: 5511.6, 300 sec: 5497.2). Total num frames: 933176320. Throughput: 0: 5772.6. Samples: 933180380. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:07:27,352][25689] Avg episode reward: [(0, '1.066')] [2022-07-10 22:07:29,258][26022] Updated weights on worker 0-0, policy_version 911315 (0.00088) [2022-07-10 22:07:31,053][26022] Updated weights on worker 0-0, policy_version 911325 (0.00055) [2022-07-10 22:07:32,449][25689] Fps is (10 sec: 5460.0, 60 sec: 5507.2, 300 sec: 5492.5). Total num frames: 933203968. Throughput: 0: 5753.1. Samples: 933213198. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:07:32,449][25689] Avg episode reward: [(0, '0.450')] [2022-07-10 22:07:32,799][26022] Updated weights on worker 0-0, policy_version 911335 (0.00090) [2022-07-10 22:07:34,699][26022] Updated weights on worker 0-0, policy_version 911345 (0.00083) [2022-07-10 22:07:36,406][26022] Updated weights on worker 0-0, policy_version 911355 (0.00087) [2022-07-10 22:07:37,480][25689] Fps is (10 sec: 5459.7, 60 sec: 5522.8, 300 sec: 5499.2). Total num frames: 933231616. Throughput: 0: 5760.6. Samples: 933230060. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:07:37,480][25689] Avg episode reward: [(0, '0.404')] [2022-07-10 22:07:38,397][26022] Updated weights on worker 0-0, policy_version 911365 (0.00092) [2022-07-10 22:07:40,164][26022] Updated weights on worker 0-0, policy_version 911375 (0.00089) [2022-07-10 22:07:42,051][26022] Updated weights on worker 0-0, policy_version 911385 (0.00090) [2022-07-10 22:07:42,489][25689] Fps is (10 sec: 5609.6, 60 sec: 5507.0, 300 sec: 5492.7). Total num frames: 933260288. Throughput: 0: 5753.3. Samples: 933263358. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:07:42,490][25689] Avg episode reward: [(0, '-0.012')] [2022-07-10 22:07:44,061][26022] Updated weights on worker 0-0, policy_version 911395 (0.00052) [2022-07-10 22:07:45,747][26022] Updated weights on worker 0-0, policy_version 911405 (0.00086) [2022-07-10 22:07:47,534][25689] Fps is (10 sec: 5601.8, 60 sec: 5521.5, 300 sec: 5497.3). Total num frames: 933287936. Throughput: 0: 5772.4. Samples: 933296878. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:07:47,534][25689] Avg episode reward: [(0, '0.138')] [2022-07-10 22:07:47,656][26022] Updated weights on worker 0-0, policy_version 911415 (0.00088) [2022-07-10 22:07:49,408][26022] Updated weights on worker 0-0, policy_version 911425 (0.00083) [2022-07-10 22:07:51,159][26022] Updated weights on worker 0-0, policy_version 911435 (0.00092) [2022-07-10 22:07:52,625][25689] Fps is (10 sec: 5556.4, 60 sec: 5517.3, 300 sec: 5496.8). Total num frames: 933316608. Throughput: 0: 4983.3. Samples: 933313742. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:07:52,626][25689] Avg episode reward: [(0, '0.139')] [2022-07-10 22:07:52,963][26022] Updated weights on worker 0-0, policy_version 911445 (0.00083) [2022-07-10 22:07:54,792][26022] Updated weights on worker 0-0, policy_version 911455 (0.00087) [2022-07-10 22:07:56,581][26022] Updated weights on worker 0-0, policy_version 911465 (0.00083) [2022-07-10 22:07:57,650][25689] Fps is (10 sec: 5567.6, 60 sec: 5516.5, 300 sec: 5493.2). Total num frames: 933344256. Throughput: 0: 5822.1. Samples: 933347490. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:07:57,650][25689] Avg episode reward: [(0, '-0.127')] [2022-07-10 22:07:58,537][26022] Updated weights on worker 0-0, policy_version 911475 (0.00087) [2022-07-10 22:08:00,449][26022] Updated weights on worker 0-0, policy_version 911485 (0.00085) [2022-07-10 22:08:02,523][26022] Updated weights on worker 0-0, policy_version 911495 (0.00087) [2022-07-10 22:08:02,665][25689] Fps is (10 sec: 5507.5, 60 sec: 5551.3, 300 sec: 5504.8). Total num frames: 933371904. Throughput: 0: 5728.1. Samples: 933378930. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:08:02,665][25689] Avg episode reward: [(0, '0.315')] [2022-07-10 22:08:04,499][26022] Updated weights on worker 0-0, policy_version 911505 (0.00117) [2022-07-10 22:08:06,348][26022] Updated weights on worker 0-0, policy_version 911515 (0.00094) [2022-07-10 22:08:07,694][25689] Fps is (10 sec: 5403.1, 60 sec: 5516.1, 300 sec: 5498.6). Total num frames: 933398528. Throughput: 0: 4905.7. Samples: 933395778. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:08:07,695][25689] Avg episode reward: [(0, '0.231')] [2022-07-10 22:08:08,145][26022] Updated weights on worker 0-0, policy_version 911525 (0.00085) [2022-07-10 22:08:10,050][26022] Updated weights on worker 0-0, policy_version 911535 (0.00097) [2022-07-10 22:08:11,594][26022] Updated weights on worker 0-0, policy_version 911545 (0.00085) [2022-07-10 22:08:12,725][25689] Fps is (10 sec: 5394.7, 60 sec: 5505.3, 300 sec: 5498.2). Total num frames: 933426176. Throughput: 0: 5748.3. Samples: 933429284. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:08:12,725][25689] Avg episode reward: [(0, '0.554')] [2022-07-10 22:08:13,650][26022] Updated weights on worker 0-0, policy_version 911555 (0.00088) [2022-07-10 22:08:15,476][26022] Updated weights on worker 0-0, policy_version 911565 (0.00091) [2022-07-10 22:08:17,283][26022] Updated weights on worker 0-0, policy_version 911575 (0.00088) [2022-07-10 22:08:17,732][25689] Fps is (10 sec: 5815.1, 60 sec: 5576.0, 300 sec: 5505.7). Total num frames: 933456896. Throughput: 0: 5752.6. Samples: 933463014. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:08:17,732][25689] Avg episode reward: [(0, '0.347')] [2022-07-10 22:08:19,200][26022] Updated weights on worker 0-0, policy_version 911585 (0.00084) [2022-07-10 22:08:20,759][26022] Updated weights on worker 0-0, policy_version 911595 (0.00090) [2022-07-10 22:08:22,753][25689] Fps is (10 sec: 5616.0, 60 sec: 5523.9, 300 sec: 5502.0). Total num frames: 933482496. Throughput: 0: 5034.3. Samples: 933480062. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:08:22,754][25689] Avg episode reward: [(0, '0.249')] [2022-07-10 22:08:22,805][26022] Updated weights on worker 0-0, policy_version 911605 (0.00091) [2022-07-10 22:08:24,434][26022] Updated weights on worker 0-0, policy_version 911615 (0.00089) [2022-07-10 22:08:26,443][26022] Updated weights on worker 0-0, policy_version 911625 (0.00085) [2022-07-10 22:08:27,780][25689] Fps is (10 sec: 5401.1, 60 sec: 5541.3, 300 sec: 5502.1). Total num frames: 933511168. Throughput: 0: 5857.7. Samples: 933513434. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:08:27,782][25689] Avg episode reward: [(0, '-0.383')] [2022-07-10 22:08:28,326][26022] Updated weights on worker 0-0, policy_version 911635 (0.00086) [2022-07-10 22:08:29,941][26022] Updated weights on worker 0-0, policy_version 911645 (0.00086) [2022-07-10 22:08:32,070][26022] Updated weights on worker 0-0, policy_version 911655 (0.00083) [2022-07-10 22:08:32,905][25689] Fps is (10 sec: 5648.9, 60 sec: 5555.6, 300 sec: 5506.9). Total num frames: 933539840. Throughput: 0: 5833.9. Samples: 933547012. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:08:32,907][25689] Avg episode reward: [(0, '-0.456')] [2022-07-10 22:08:33,628][26022] Updated weights on worker 0-0, policy_version 911665 (0.00088) [2022-07-10 22:08:35,775][26022] Updated weights on worker 0-0, policy_version 911675 (0.00089) [2022-07-10 22:08:37,171][26022] Updated weights on worker 0-0, policy_version 911685 (0.00086) [2022-07-10 22:08:37,938][25689] Fps is (10 sec: 5645.3, 60 sec: 5572.4, 300 sec: 5510.4). Total num frames: 933568512. Throughput: 0: 4989.8. Samples: 933563838. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:08:37,938][25689] Avg episode reward: [(0, '-0.749')] [2022-07-10 22:08:39,298][26022] Updated weights on worker 0-0, policy_version 911695 (0.00084) [2022-07-10 22:08:40,938][26022] Updated weights on worker 0-0, policy_version 911705 (0.00084) [2022-07-10 22:08:42,901][26022] Updated weights on worker 0-0, policy_version 911715 (0.00089) [2022-07-10 22:08:42,995][25689] Fps is (10 sec: 5582.0, 60 sec: 5551.1, 300 sec: 5502.6). Total num frames: 933596160. Throughput: 0: 5792.4. Samples: 933597306. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:08:42,996][25689] Avg episode reward: [(0, '-0.477')] [2022-07-10 22:08:44,726][26022] Updated weights on worker 0-0, policy_version 911725 (0.00089) [2022-07-10 22:08:46,780][26022] Updated weights on worker 0-0, policy_version 911735 (0.00095) [2022-07-10 22:08:48,004][25689] Fps is (10 sec: 5595.0, 60 sec: 5571.3, 300 sec: 5518.6). Total num frames: 933624832. Throughput: 0: 5789.1. Samples: 933630512. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:08:48,006][25689] Avg episode reward: [(0, '-0.796')] [2022-07-10 22:08:48,480][26022] Updated weights on worker 0-0, policy_version 911745 (0.00086) [2022-07-10 22:08:50,604][26022] Updated weights on worker 0-0, policy_version 911755 (0.00103) [2022-07-10 22:08:52,209][26022] Updated weights on worker 0-0, policy_version 911765 (0.00086) [2022-07-10 22:08:53,119][25689] Fps is (10 sec: 5461.5, 60 sec: 5535.2, 300 sec: 5506.6). Total num frames: 933651456. Throughput: 0: 4954.1. Samples: 933647156. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:08:53,121][25689] Avg episode reward: [(0, '-0.495')] [2022-07-10 22:08:54,374][26022] Updated weights on worker 0-0, policy_version 911775 (0.00081) [2022-07-10 22:08:55,914][26022] Updated weights on worker 0-0, policy_version 911785 (0.00087) [2022-07-10 22:08:57,921][26022] Updated weights on worker 0-0, policy_version 911795 (0.00082) [2022-07-10 22:08:58,138][25689] Fps is (10 sec: 5355.6, 60 sec: 5535.8, 300 sec: 5503.8). Total num frames: 933679104. Throughput: 0: 5750.0. Samples: 933679988. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:08:58,138][25689] Avg episode reward: [(0, '0.469')] [2022-07-10 22:08:59,662][26022] Updated weights on worker 0-0, policy_version 911805 (0.00086) [2022-07-10 22:09:01,903][26022] Updated weights on worker 0-0, policy_version 911815 (0.00087) [2022-07-10 22:09:03,148][25689] Fps is (10 sec: 5309.4, 60 sec: 5502.3, 300 sec: 5507.3). Total num frames: 933704704. Throughput: 0: 5655.2. Samples: 933711280. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:09:03,150][25689] Avg episode reward: [(0, '0.843')] [2022-07-10 22:09:03,755][26022] Updated weights on worker 0-0, policy_version 911825 (0.00091) [2022-07-10 22:09:05,746][26022] Updated weights on worker 0-0, policy_version 911835 (0.00090) [2022-07-10 22:09:07,294][26022] Updated weights on worker 0-0, policy_version 911845 (0.00089) [2022-07-10 22:09:08,167][25689] Fps is (10 sec: 5411.1, 60 sec: 5537.1, 300 sec: 5507.9). Total num frames: 933733376. Throughput: 0: 4838.2. Samples: 933728066. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:09:08,168][25689] Avg episode reward: [(0, '0.484')] [2022-07-10 22:09:09,385][26022] Updated weights on worker 0-0, policy_version 911855 (0.00090) [2022-07-10 22:09:09,512][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:09:09,523][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000911856_933740544.pth [2022-07-10 22:09:09,523][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000909917_931755008.pth [2022-07-10 22:09:11,014][26022] Updated weights on worker 0-0, policy_version 911865 (0.00087) [2022-07-10 22:09:12,823][26022] Updated weights on worker 0-0, policy_version 911875 (0.00096) [2022-07-10 22:09:13,211][25689] Fps is (10 sec: 5495.3, 60 sec: 5519.1, 300 sec: 5500.9). Total num frames: 933760000. Throughput: 0: 5691.0. Samples: 933761496. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:09:13,211][25689] Avg episode reward: [(0, '0.531')] [2022-07-10 22:09:14,627][26022] Updated weights on worker 0-0, policy_version 911885 (0.00092) [2022-07-10 22:09:16,483][26022] Updated weights on worker 0-0, policy_version 911895 (0.00562) [2022-07-10 22:09:18,116][26022] Updated weights on worker 0-0, policy_version 911905 (0.00089) [2022-07-10 22:09:18,212][25689] Fps is (10 sec: 5708.9, 60 sec: 5519.5, 300 sec: 5512.2). Total num frames: 933790720. Throughput: 0: 5755.6. Samples: 933795528. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:09:18,213][25689] Avg episode reward: [(0, '0.622')] [2022-07-10 22:09:20,284][26022] Updated weights on worker 0-0, policy_version 911915 (0.00082) [2022-07-10 22:09:21,730][26022] Updated weights on worker 0-0, policy_version 911925 (0.00090) [2022-07-10 22:09:23,214][25689] Fps is (10 sec: 5732.4, 60 sec: 5538.3, 300 sec: 5512.9). Total num frames: 933817344. Throughput: 0: 5035.8. Samples: 933812328. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:09:23,215][25689] Avg episode reward: [(0, '0.882')] [2022-07-10 22:09:23,909][26022] Updated weights on worker 0-0, policy_version 911935 (0.00085) [2022-07-10 22:09:25,600][26022] Updated weights on worker 0-0, policy_version 911945 (0.00095) [2022-07-10 22:09:27,362][26022] Updated weights on worker 0-0, policy_version 911955 (0.00087) [2022-07-10 22:09:28,234][25689] Fps is (10 sec: 5415.2, 60 sec: 5521.9, 300 sec: 5507.5). Total num frames: 933844992. Throughput: 0: 5861.9. Samples: 933845696. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:09:28,236][25689] Avg episode reward: [(0, '1.065')] [2022-07-10 22:09:29,392][26022] Updated weights on worker 0-0, policy_version 911965 (0.00090) [2022-07-10 22:09:31,273][26022] Updated weights on worker 0-0, policy_version 911975 (0.00085) [2022-07-10 22:09:32,987][26022] Updated weights on worker 0-0, policy_version 911985 (0.00081) [2022-07-10 22:09:33,290][25689] Fps is (10 sec: 5691.3, 60 sec: 5545.2, 300 sec: 5513.5). Total num frames: 933874688. Throughput: 0: 5866.7. Samples: 933879296. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:09:33,291][25689] Avg episode reward: [(0, '0.898')] [2022-07-10 22:09:34,984][26022] Updated weights on worker 0-0, policy_version 911995 (0.00084) [2022-07-10 22:09:36,334][26022] Updated weights on worker 0-0, policy_version 912005 (0.00086) [2022-07-10 22:09:38,296][25689] Fps is (10 sec: 5495.6, 60 sec: 5496.8, 300 sec: 5511.0). Total num frames: 933900288. Throughput: 0: 5018.4. Samples: 933896318. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:09:38,298][25689] Avg episode reward: [(0, '1.126')] [2022-07-10 22:09:38,693][26022] Updated weights on worker 0-0, policy_version 912015 (0.00089) [2022-07-10 22:09:40,314][26022] Updated weights on worker 0-0, policy_version 912025 (0.00092) [2022-07-10 22:09:42,268][26022] Updated weights on worker 0-0, policy_version 912035 (0.00088) [2022-07-10 22:09:43,311][25689] Fps is (10 sec: 5620.0, 60 sec: 5551.5, 300 sec: 5522.3). Total num frames: 933931008. Throughput: 0: 5848.0. Samples: 933929856. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:09:43,312][25689] Avg episode reward: [(0, '1.385')] [2022-07-10 22:09:44,084][26022] Updated weights on worker 0-0, policy_version 912045 (0.00089) [2022-07-10 22:09:45,747][26022] Updated weights on worker 0-0, policy_version 912055 (0.00097) [2022-07-10 22:09:47,472][26022] Updated weights on worker 0-0, policy_version 912065 (0.00085) [2022-07-10 22:09:48,320][25689] Fps is (10 sec: 5720.7, 60 sec: 5517.6, 300 sec: 5526.6). Total num frames: 933957632. Throughput: 0: 5875.7. Samples: 933963714. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:09:48,320][25689] Avg episode reward: [(0, '0.979')] [2022-07-10 22:09:49,478][26022] Updated weights on worker 0-0, policy_version 912075 (0.00089) [2022-07-10 22:09:51,069][26022] Updated weights on worker 0-0, policy_version 912085 (0.00053) [2022-07-10 22:09:53,118][26022] Updated weights on worker 0-0, policy_version 912095 (0.00090) [2022-07-10 22:09:53,437][25689] Fps is (10 sec: 5562.0, 60 sec: 5568.4, 300 sec: 5524.7). Total num frames: 933987328. Throughput: 0: 5025.7. Samples: 933980550. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:09:53,438][25689] Avg episode reward: [(0, '0.783')] [2022-07-10 22:09:54,831][26022] Updated weights on worker 0-0, policy_version 912105 (0.00100) [2022-07-10 22:09:56,680][26022] Updated weights on worker 0-0, policy_version 912115 (0.00096) [2022-07-10 22:09:58,462][25689] Fps is (10 sec: 5654.1, 60 sec: 5567.8, 300 sec: 5531.7). Total num frames: 934014976. Throughput: 0: 5854.9. Samples: 934014386. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:09:58,462][25689] Avg episode reward: [(0, '0.699')] [2022-07-10 22:09:58,561][26022] Updated weights on worker 0-0, policy_version 912125 (0.00097) [2022-07-10 22:10:00,408][26022] Updated weights on worker 0-0, policy_version 912135 (0.00088) [2022-07-10 22:10:02,797][26022] Updated weights on worker 0-0, policy_version 912145 (0.00092) [2022-07-10 22:10:03,513][25689] Fps is (10 sec: 5284.9, 60 sec: 5564.1, 300 sec: 5531.3). Total num frames: 934040576. Throughput: 0: 5725.1. Samples: 934045510. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:10:03,513][25689] Avg episode reward: [(0, '-0.353')] [2022-07-10 22:10:04,458][26022] Updated weights on worker 0-0, policy_version 912155 (0.00089) [2022-07-10 22:10:06,312][26022] Updated weights on worker 0-0, policy_version 912165 (0.00100) [2022-07-10 22:10:08,164][26022] Updated weights on worker 0-0, policy_version 912175 (0.00085) [2022-07-10 22:10:08,543][25689] Fps is (10 sec: 5383.5, 60 sec: 5563.0, 300 sec: 5525.7). Total num frames: 934069248. Throughput: 0: 4880.3. Samples: 934062408. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:10:08,545][25689] Avg episode reward: [(0, '-0.313')] [2022-07-10 22:10:09,963][26022] Updated weights on worker 0-0, policy_version 912185 (0.00090) [2022-07-10 22:10:11,875][26022] Updated weights on worker 0-0, policy_version 912195 (0.00088) [2022-07-10 22:10:13,586][26022] Updated weights on worker 0-0, policy_version 912205 (0.00086) [2022-07-10 22:10:13,627][25689] Fps is (10 sec: 5669.4, 60 sec: 5593.2, 300 sec: 5532.2). Total num frames: 934097920. Throughput: 0: 5708.4. Samples: 934095804. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:10:13,628][25689] Avg episode reward: [(0, '0.030')] [2022-07-10 22:10:15,669][26022] Updated weights on worker 0-0, policy_version 912215 (0.00090) [2022-07-10 22:10:17,251][26022] Updated weights on worker 0-0, policy_version 912225 (0.00090) [2022-07-10 22:10:18,661][25689] Fps is (10 sec: 5465.3, 60 sec: 5522.4, 300 sec: 5528.5). Total num frames: 934124544. Throughput: 0: 5692.8. Samples: 934129374. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:10:18,661][25689] Avg episode reward: [(0, '-0.212')] [2022-07-10 22:10:19,467][26022] Updated weights on worker 0-0, policy_version 912235 (0.00094) [2022-07-10 22:10:20,835][26022] Updated weights on worker 0-0, policy_version 912245 (0.00085) [2022-07-10 22:10:23,161][26022] Updated weights on worker 0-0, policy_version 912255 (0.00086) [2022-07-10 22:10:23,681][25689] Fps is (10 sec: 5499.8, 60 sec: 5554.6, 300 sec: 5532.1). Total num frames: 934153216. Throughput: 0: 5808.6. Samples: 934162662. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:10:23,682][25689] Avg episode reward: [(0, '0.106')] [2022-07-10 22:10:24,572][26022] Updated weights on worker 0-0, policy_version 912265 (0.00083) [2022-07-10 22:10:26,812][26022] Updated weights on worker 0-0, policy_version 912275 (0.00090) [2022-07-10 22:10:28,186][26022] Updated weights on worker 0-0, policy_version 912285 (0.00087) [2022-07-10 22:10:28,703][25689] Fps is (10 sec: 5710.1, 60 sec: 5571.4, 300 sec: 5536.3). Total num frames: 934181888. Throughput: 0: 5798.9. Samples: 934179316. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:10:28,704][25689] Avg episode reward: [(0, '0.523')] [2022-07-10 22:10:30,515][26022] Updated weights on worker 0-0, policy_version 912295 (0.00091) [2022-07-10 22:10:31,928][26022] Updated weights on worker 0-0, policy_version 912305 (0.00088) [2022-07-10 22:10:33,813][25689] Fps is (10 sec: 5559.1, 60 sec: 5532.6, 300 sec: 5535.0). Total num frames: 934209536. Throughput: 0: 5785.7. Samples: 934212590. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:10:33,813][25689] Avg episode reward: [(0, '1.362')] [2022-07-10 22:10:33,947][26022] Updated weights on worker 0-0, policy_version 912315 (0.00089) [2022-07-10 22:10:35,677][26022] Updated weights on worker 0-0, policy_version 912325 (0.00096) [2022-07-10 22:10:37,690][26022] Updated weights on worker 0-0, policy_version 912335 (0.00102) [2022-07-10 22:10:38,880][25689] Fps is (10 sec: 5433.4, 60 sec: 5560.8, 300 sec: 5530.3). Total num frames: 934237184. Throughput: 0: 5760.9. Samples: 934245858. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:10:38,881][25689] Avg episode reward: [(0, '1.154')] [2022-07-10 22:10:39,529][26022] Updated weights on worker 0-0, policy_version 912345 (0.00089) [2022-07-10 22:10:41,408][26022] Updated weights on worker 0-0, policy_version 912355 (0.00086) [2022-07-10 22:10:43,232][26022] Updated weights on worker 0-0, policy_version 912365 (0.00092) [2022-07-10 22:10:43,967][25689] Fps is (10 sec: 5546.2, 60 sec: 5520.4, 300 sec: 5532.3). Total num frames: 934265856. Throughput: 0: 4919.3. Samples: 934262454. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:10:43,968][25689] Avg episode reward: [(0, '1.160')] [2022-07-10 22:10:45,224][26022] Updated weights on worker 0-0, policy_version 912375 (0.00091) [2022-07-10 22:10:46,772][26022] Updated weights on worker 0-0, policy_version 912385 (0.00085) [2022-07-10 22:10:48,935][26022] Updated weights on worker 0-0, policy_version 912395 (0.00093) [2022-07-10 22:10:48,973][25689] Fps is (10 sec: 5478.6, 60 sec: 5520.7, 300 sec: 5523.1). Total num frames: 934292480. Throughput: 0: 5749.5. Samples: 934295860. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:10:48,975][25689] Avg episode reward: [(0, '0.919')] [2022-07-10 22:10:50,368][26022] Updated weights on worker 0-0, policy_version 912405 (0.00087) [2022-07-10 22:10:52,667][26022] Updated weights on worker 0-0, policy_version 912415 (0.00093) [2022-07-10 22:10:54,020][25689] Fps is (10 sec: 5602.6, 60 sec: 5527.1, 300 sec: 5529.7). Total num frames: 934322176. Throughput: 0: 5789.6. Samples: 934329582. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-10 22:10:54,020][25689] Avg episode reward: [(0, '0.761')] [2022-07-10 22:10:54,091][26022] Updated weights on worker 0-0, policy_version 912425 (0.00082) [2022-07-10 22:10:56,127][26022] Updated weights on worker 0-0, policy_version 912435 (0.00099) [2022-07-10 22:10:57,715][26022] Updated weights on worker 0-0, policy_version 912445 (0.00090) [2022-07-10 22:10:59,030][25689] Fps is (10 sec: 5600.4, 60 sec: 5511.6, 300 sec: 5533.0). Total num frames: 934348800. Throughput: 0: 4981.8. Samples: 934346240. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:10:59,030][25689] Avg episode reward: [(0, '0.692')] [2022-07-10 22:10:59,855][26022] Updated weights on worker 0-0, policy_version 912455 (0.00101) [2022-07-10 22:11:01,495][26022] Updated weights on worker 0-0, policy_version 912465 (0.00111) [2022-07-10 22:11:04,028][26022] Updated weights on worker 0-0, policy_version 912475 (0.00087) [2022-07-10 22:11:04,068][25689] Fps is (10 sec: 5197.7, 60 sec: 5512.8, 300 sec: 5529.4). Total num frames: 934374400. Throughput: 0: 5740.9. Samples: 934377846. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:11:04,068][25689] Avg episode reward: [(0, '0.743')] [2022-07-10 22:11:05,593][26022] Updated weights on worker 0-0, policy_version 912485 (0.00095) [2022-07-10 22:11:07,567][26022] Updated weights on worker 0-0, policy_version 912495 (0.00107) [2022-07-10 22:11:09,083][25689] Fps is (10 sec: 5500.6, 60 sec: 5531.1, 300 sec: 5530.1). Total num frames: 934404096. Throughput: 0: 5741.5. Samples: 934411318. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:11:09,084][25689] Avg episode reward: [(0, '0.375')] [2022-07-10 22:11:09,205][26022] Updated weights on worker 0-0, policy_version 912505 (0.00106) [2022-07-10 22:11:09,741][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:11:09,756][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000912508_934408192.pth [2022-07-10 22:11:09,756][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000910560_932413440.pth [2022-07-10 22:11:11,291][26022] Updated weights on worker 0-0, policy_version 912515 (0.00092) [2022-07-10 22:11:12,911][26022] Updated weights on worker 0-0, policy_version 912525 (0.00092) [2022-07-10 22:11:14,187][25689] Fps is (10 sec: 5666.9, 60 sec: 5512.4, 300 sec: 5532.3). Total num frames: 934431744. Throughput: 0: 4888.0. Samples: 934428156. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:11:14,187][25689] Avg episode reward: [(0, '0.375')] [2022-07-10 22:11:14,856][26022] Updated weights on worker 0-0, policy_version 912535 (0.00091) [2022-07-10 22:11:16,632][26022] Updated weights on worker 0-0, policy_version 912545 (0.00084) [2022-07-10 22:11:18,499][26022] Updated weights on worker 0-0, policy_version 912555 (0.00093) [2022-07-10 22:11:19,196][25689] Fps is (10 sec: 5467.8, 60 sec: 5531.5, 300 sec: 5536.0). Total num frames: 934459392. Throughput: 0: 5728.1. Samples: 934461752. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:11:19,197][25689] Avg episode reward: [(0, '0.435')] [2022-07-10 22:11:20,155][26022] Updated weights on worker 0-0, policy_version 912565 (0.00084) [2022-07-10 22:11:22,087][26022] Updated weights on worker 0-0, policy_version 912575 (0.00089) [2022-07-10 22:11:23,948][26022] Updated weights on worker 0-0, policy_version 912585 (0.00090) [2022-07-10 22:11:24,211][25689] Fps is (10 sec: 5618.5, 60 sec: 5532.0, 300 sec: 5532.7). Total num frames: 934488064. Throughput: 0: 5848.1. Samples: 934495646. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:11:24,213][25689] Avg episode reward: [(0, '-0.461')] [2022-07-10 22:11:25,763][26022] Updated weights on worker 0-0, policy_version 912595 (0.00084) [2022-07-10 22:11:27,722][26022] Updated weights on worker 0-0, policy_version 912605 (0.00091) [2022-07-10 22:11:29,228][25689] Fps is (10 sec: 5511.7, 60 sec: 5498.6, 300 sec: 5529.8). Total num frames: 934514688. Throughput: 0: 5014.8. Samples: 934512344. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:11:29,230][25689] Avg episode reward: [(0, '-0.387')] [2022-07-10 22:11:29,684][26022] Updated weights on worker 0-0, policy_version 912615 (0.00092) [2022-07-10 22:11:31,306][26022] Updated weights on worker 0-0, policy_version 912625 (0.01178) [2022-07-10 22:11:33,181][26022] Updated weights on worker 0-0, policy_version 912635 (0.00082) [2022-07-10 22:11:34,341][25689] Fps is (10 sec: 5559.3, 60 sec: 5532.1, 300 sec: 5538.4). Total num frames: 934544384. Throughput: 0: 5818.7. Samples: 934545430. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:11:34,342][25689] Avg episode reward: [(0, '-0.415')] [2022-07-10 22:11:34,951][26022] Updated weights on worker 0-0, policy_version 912645 (0.00093) [2022-07-10 22:11:36,941][26022] Updated weights on worker 0-0, policy_version 912655 (0.00087) [2022-07-10 22:11:38,791][26022] Updated weights on worker 0-0, policy_version 912665 (0.00093) [2022-07-10 22:11:39,390][25689] Fps is (10 sec: 5643.3, 60 sec: 5533.8, 300 sec: 5531.0). Total num frames: 934572032. Throughput: 0: 5786.5. Samples: 934578604. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:11:39,390][25689] Avg episode reward: [(0, '-0.475')] [2022-07-10 22:11:40,647][26022] Updated weights on worker 0-0, policy_version 912675 (0.00090) [2022-07-10 22:11:42,335][26022] Updated weights on worker 0-0, policy_version 912685 (0.00086) [2022-07-10 22:11:44,317][26022] Updated weights on worker 0-0, policy_version 912695 (0.00087) [2022-07-10 22:11:44,398][25689] Fps is (10 sec: 5498.2, 60 sec: 5524.1, 300 sec: 5534.6). Total num frames: 934599680. Throughput: 0: 4934.1. Samples: 934595254. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:11:44,399][25689] Avg episode reward: [(0, '-0.186')] [2022-07-10 22:11:46,143][26022] Updated weights on worker 0-0, policy_version 912705 (0.00094) [2022-07-10 22:11:47,895][26022] Updated weights on worker 0-0, policy_version 912715 (0.00091) [2022-07-10 22:11:49,423][25689] Fps is (10 sec: 5511.4, 60 sec: 5539.3, 300 sec: 5531.5). Total num frames: 934627328. Throughput: 0: 5757.5. Samples: 934628614. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:11:49,423][25689] Avg episode reward: [(0, '0.550')] [2022-07-10 22:11:49,910][26022] Updated weights on worker 0-0, policy_version 912725 (0.00096) [2022-07-10 22:11:51,681][26022] Updated weights on worker 0-0, policy_version 912735 (0.00087) [2022-07-10 22:11:53,565][26022] Updated weights on worker 0-0, policy_version 912745 (0.00091) [2022-07-10 22:11:54,473][25689] Fps is (10 sec: 5590.3, 60 sec: 5522.0, 300 sec: 5534.3). Total num frames: 934656000. Throughput: 0: 5779.9. Samples: 934661790. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:11:54,474][25689] Avg episode reward: [(0, '0.820')] [2022-07-10 22:11:55,276][26022] Updated weights on worker 0-0, policy_version 912755 (0.00088) [2022-07-10 22:11:57,239][26022] Updated weights on worker 0-0, policy_version 912765 (0.00089) [2022-07-10 22:11:58,975][26022] Updated weights on worker 0-0, policy_version 912775 (0.00087) [2022-07-10 22:11:59,488][25689] Fps is (10 sec: 5595.5, 60 sec: 5538.5, 300 sec: 5541.4). Total num frames: 934683648. Throughput: 0: 4969.6. Samples: 934678484. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:11:59,488][25689] Avg episode reward: [(0, '1.064')] [2022-07-10 22:12:00,945][26022] Updated weights on worker 0-0, policy_version 912785 (0.00092) [2022-07-10 22:12:03,119][26022] Updated weights on worker 0-0, policy_version 912795 (0.00085) [2022-07-10 22:12:04,510][25689] Fps is (10 sec: 5305.2, 60 sec: 5539.9, 300 sec: 5531.0). Total num frames: 934709248. Throughput: 0: 5680.2. Samples: 934709492. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:12:04,511][25689] Avg episode reward: [(0, '0.954')] [2022-07-10 22:12:05,226][26022] Updated weights on worker 0-0, policy_version 912805 (0.00085) [2022-07-10 22:12:06,802][26022] Updated weights on worker 0-0, policy_version 912815 (0.00100) [2022-07-10 22:12:08,783][26022] Updated weights on worker 0-0, policy_version 912825 (0.00090) [2022-07-10 22:12:09,529][25689] Fps is (10 sec: 5303.1, 60 sec: 5505.7, 300 sec: 5529.0). Total num frames: 934736896. Throughput: 0: 5674.2. Samples: 934742700. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:12:09,529][25689] Avg episode reward: [(0, '0.562')] [2022-07-10 22:12:10,527][26022] Updated weights on worker 0-0, policy_version 912835 (0.00096) [2022-07-10 22:12:12,615][26022] Updated weights on worker 0-0, policy_version 912845 (0.00085) [2022-07-10 22:12:14,347][26022] Updated weights on worker 0-0, policy_version 912855 (0.00090) [2022-07-10 22:12:14,651][25689] Fps is (10 sec: 5452.8, 60 sec: 5504.1, 300 sec: 5530.9). Total num frames: 934764544. Throughput: 0: 4833.5. Samples: 934759320. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:12:14,651][25689] Avg episode reward: [(0, '0.529')] [2022-07-10 22:12:16,171][26022] Updated weights on worker 0-0, policy_version 912865 (0.00077) [2022-07-10 22:12:18,185][26022] Updated weights on worker 0-0, policy_version 912876 (0.00056) [2022-07-10 22:12:19,673][25689] Fps is (10 sec: 5652.9, 60 sec: 5536.8, 300 sec: 5534.1). Total num frames: 934794240. Throughput: 0: 5656.0. Samples: 934792650. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:12:19,674][25689] Avg episode reward: [(0, '0.122')] [2022-07-10 22:12:19,890][26022] Updated weights on worker 0-0, policy_version 912886 (0.00114) [2022-07-10 22:12:21,939][26022] Updated weights on worker 0-0, policy_version 912896 (0.00085) [2022-07-10 22:12:23,508][26022] Updated weights on worker 0-0, policy_version 912906 (0.00084) [2022-07-10 22:12:24,727][25689] Fps is (10 sec: 5589.5, 60 sec: 5499.4, 300 sec: 5530.2). Total num frames: 934820864. Throughput: 0: 5779.0. Samples: 934826324. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:12:24,729][25689] Avg episode reward: [(0, '0.181')] [2022-07-10 22:12:25,432][26022] Updated weights on worker 0-0, policy_version 912916 (0.00084) [2022-07-10 22:12:27,541][26022] Updated weights on worker 0-0, policy_version 912926 (0.00085) [2022-07-10 22:12:29,039][26022] Updated weights on worker 0-0, policy_version 912936 (0.00085) [2022-07-10 22:12:29,771][25689] Fps is (10 sec: 5476.1, 60 sec: 5530.8, 300 sec: 5534.6). Total num frames: 934849536. Throughput: 0: 4958.1. Samples: 934843062. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:12:29,772][25689] Avg episode reward: [(0, '-0.207')] [2022-07-10 22:12:31,041][26022] Updated weights on worker 0-0, policy_version 912946 (0.00081) [2022-07-10 22:12:32,881][26022] Updated weights on worker 0-0, policy_version 912956 (0.00088) [2022-07-10 22:12:34,558][26022] Updated weights on worker 0-0, policy_version 912966 (0.00083) [2022-07-10 22:12:34,827][25689] Fps is (10 sec: 5677.7, 60 sec: 5519.1, 300 sec: 5537.6). Total num frames: 934878208. Throughput: 0: 5815.6. Samples: 934876654. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:12:34,827][25689] Avg episode reward: [(0, '-0.143')] [2022-07-10 22:12:36,489][26022] Updated weights on worker 0-0, policy_version 912976 (0.00085) [2022-07-10 22:12:38,249][26022] Updated weights on worker 0-0, policy_version 912986 (0.00087) [2022-07-10 22:12:39,830][25689] Fps is (10 sec: 5598.9, 60 sec: 5523.2, 300 sec: 5534.3). Total num frames: 934905856. Throughput: 0: 5847.9. Samples: 934910526. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:12:39,831][25689] Avg episode reward: [(0, '0.206')] [2022-07-10 22:12:40,156][26022] Updated weights on worker 0-0, policy_version 912996 (0.00094) [2022-07-10 22:12:41,803][26022] Updated weights on worker 0-0, policy_version 913006 (0.00896) [2022-07-10 22:12:43,631][26022] Updated weights on worker 0-0, policy_version 913016 (0.00083) [2022-07-10 22:12:44,836][25689] Fps is (10 sec: 5729.3, 60 sec: 5557.4, 300 sec: 5541.9). Total num frames: 934935552. Throughput: 0: 5033.2. Samples: 934927536. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:12:44,837][25689] Avg episode reward: [(0, '-0.703')] [2022-07-10 22:12:45,480][26022] Updated weights on worker 0-0, policy_version 913026 (0.00081) [2022-07-10 22:12:47,427][26022] Updated weights on worker 0-0, policy_version 913036 (0.00086) [2022-07-10 22:12:49,077][26022] Updated weights on worker 0-0, policy_version 913046 (0.00081) [2022-07-10 22:12:49,845][25689] Fps is (10 sec: 5623.6, 60 sec: 5541.8, 300 sec: 5536.5). Total num frames: 934962176. Throughput: 0: 5905.4. Samples: 934961606. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:12:49,846][25689] Avg episode reward: [(0, '-1.020')] [2022-07-10 22:12:51,046][26022] Updated weights on worker 0-0, policy_version 913056 (0.00086) [2022-07-10 22:12:52,939][26022] Updated weights on worker 0-0, policy_version 913066 (0.00635) [2022-07-10 22:12:54,702][26022] Updated weights on worker 0-0, policy_version 913076 (0.00092) [2022-07-10 22:12:54,903][25689] Fps is (10 sec: 5594.1, 60 sec: 5558.0, 300 sec: 5542.8). Total num frames: 934991872. Throughput: 0: 5913.8. Samples: 934995384. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:12:54,904][25689] Avg episode reward: [(0, '-0.410')] [2022-07-10 22:12:56,585][26022] Updated weights on worker 0-0, policy_version 913086 (0.00105) [2022-07-10 22:12:58,074][26022] Updated weights on worker 0-0, policy_version 913096 (0.00087) [2022-07-10 22:12:59,950][25689] Fps is (10 sec: 5573.2, 60 sec: 5538.1, 300 sec: 5538.8). Total num frames: 935018496. Throughput: 0: 5053.2. Samples: 935012198. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:12:59,951][25689] Avg episode reward: [(0, '-1.025')] [2022-07-10 22:13:00,104][26022] Updated weights on worker 0-0, policy_version 913106 (0.00091) [2022-07-10 22:13:02,345][26022] Updated weights on worker 0-0, policy_version 913116 (0.00086) [2022-07-10 22:13:04,071][26022] Updated weights on worker 0-0, policy_version 913126 (0.01329) [2022-07-10 22:13:04,951][25689] Fps is (10 sec: 5197.5, 60 sec: 5540.1, 300 sec: 5535.8). Total num frames: 935044096. Throughput: 0: 5782.2. Samples: 935043848. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:13:04,952][25689] Avg episode reward: [(0, '-0.816')] [2022-07-10 22:13:06,137][26022] Updated weights on worker 0-0, policy_version 913136 (0.00095) [2022-07-10 22:13:07,645][26022] Updated weights on worker 0-0, policy_version 913146 (0.00083) [2022-07-10 22:13:09,903][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:13:09,919][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000913156_935071744.pth [2022-07-10 22:13:09,919][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000911208_933076992.pth [2022-07-10 22:13:09,925][26022] Updated weights on worker 0-0, policy_version 913156 (0.00091) [2022-07-10 22:13:09,971][25689] Fps is (10 sec: 5415.9, 60 sec: 5556.9, 300 sec: 5539.5). Total num frames: 935072768. Throughput: 0: 5743.9. Samples: 935077210. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:13:09,971][25689] Avg episode reward: [(0, '-0.448')] [2022-07-10 22:13:11,339][26022] Updated weights on worker 0-0, policy_version 913166 (0.00081) [2022-07-10 22:13:13,302][26022] Updated weights on worker 0-0, policy_version 913176 (0.00094) [2022-07-10 22:13:15,138][25689] Fps is (10 sec: 5629.5, 60 sec: 5569.7, 300 sec: 5529.6). Total num frames: 935101440. Throughput: 0: 5703.5. Samples: 935110790. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:13:15,138][25689] Avg episode reward: [(0, '0.595')] [2022-07-10 22:13:15,271][26022] Updated weights on worker 0-0, policy_version 913186 (0.00086) [2022-07-10 22:13:16,847][26022] Updated weights on worker 0-0, policy_version 913196 (0.00093) [2022-07-10 22:13:18,877][26022] Updated weights on worker 0-0, policy_version 913206 (0.00053) [2022-07-10 22:13:20,140][25689] Fps is (10 sec: 5740.0, 60 sec: 5571.6, 300 sec: 5543.8). Total num frames: 935131136. Throughput: 0: 5728.8. Samples: 935127858. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:13:20,140][25689] Avg episode reward: [(0, '0.376')] [2022-07-10 22:13:20,467][26022] Updated weights on worker 0-0, policy_version 913216 (0.00087) [2022-07-10 22:13:22,422][26022] Updated weights on worker 0-0, policy_version 913226 (0.00085) [2022-07-10 22:13:24,222][26022] Updated weights on worker 0-0, policy_version 913236 (0.00087) [2022-07-10 22:13:25,195][25689] Fps is (10 sec: 5701.9, 60 sec: 5588.4, 300 sec: 5539.8). Total num frames: 935158784. Throughput: 0: 5814.0. Samples: 935161542. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:13:25,195][25689] Avg episode reward: [(0, '-0.170')] [2022-07-10 22:13:26,088][26022] Updated weights on worker 0-0, policy_version 913246 (0.00094) [2022-07-10 22:13:28,004][26022] Updated weights on worker 0-0, policy_version 913256 (0.00092) [2022-07-10 22:13:29,588][26022] Updated weights on worker 0-0, policy_version 913266 (0.00091) [2022-07-10 22:13:30,223][25689] Fps is (10 sec: 5382.4, 60 sec: 5556.0, 300 sec: 5534.7). Total num frames: 935185408. Throughput: 0: 5802.4. Samples: 935194720. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:13:30,223][25689] Avg episode reward: [(0, '-0.196')] [2022-07-10 22:13:31,702][26022] Updated weights on worker 0-0, policy_version 913276 (0.00090) [2022-07-10 22:13:33,503][26022] Updated weights on worker 0-0, policy_version 913286 (0.00086) [2022-07-10 22:13:35,263][26022] Updated weights on worker 0-0, policy_version 913296 (0.00087) [2022-07-10 22:13:35,292][25689] Fps is (10 sec: 5577.5, 60 sec: 5571.7, 300 sec: 5537.5). Total num frames: 935215104. Throughput: 0: 4992.7. Samples: 935211416. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:13:35,293][25689] Avg episode reward: [(0, '-0.243')] [2022-07-10 22:13:37,260][26022] Updated weights on worker 0-0, policy_version 913306 (0.00099) [2022-07-10 22:13:38,919][26022] Updated weights on worker 0-0, policy_version 913316 (0.00092) [2022-07-10 22:13:40,295][25689] Fps is (10 sec: 5693.2, 60 sec: 5571.7, 300 sec: 5538.5). Total num frames: 935242752. Throughput: 0: 5812.6. Samples: 935245014. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:13:40,296][25689] Avg episode reward: [(0, '-0.392')] [2022-07-10 22:13:40,843][26022] Updated weights on worker 0-0, policy_version 913326 (0.00094) [2022-07-10 22:13:42,696][26022] Updated weights on worker 0-0, policy_version 913336 (0.00085) [2022-07-10 22:13:44,447][26022] Updated weights on worker 0-0, policy_version 913346 (0.00088) [2022-07-10 22:13:45,302][25689] Fps is (10 sec: 5524.1, 60 sec: 5537.7, 300 sec: 5535.1). Total num frames: 935270400. Throughput: 0: 5823.0. Samples: 935278628. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:13:45,303][25689] Avg episode reward: [(0, '-0.876')] [2022-07-10 22:13:46,423][26022] Updated weights on worker 0-0, policy_version 913356 (0.00088) [2022-07-10 22:13:48,087][26022] Updated weights on worker 0-0, policy_version 913366 (0.00089) [2022-07-10 22:13:50,241][26022] Updated weights on worker 0-0, policy_version 913376 (0.00089) [2022-07-10 22:13:50,325][25689] Fps is (10 sec: 5410.8, 60 sec: 5536.4, 300 sec: 5536.8). Total num frames: 935297024. Throughput: 0: 4997.9. Samples: 935295192. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:13:50,332][25689] Avg episode reward: [(0, '-0.247')] [2022-07-10 22:13:51,866][26022] Updated weights on worker 0-0, policy_version 913386 (0.00095) [2022-07-10 22:13:53,837][26022] Updated weights on worker 0-0, policy_version 913396 (0.00091) [2022-07-10 22:13:55,457][25689] Fps is (10 sec: 5445.4, 60 sec: 5512.9, 300 sec: 5538.1). Total num frames: 935325696. Throughput: 0: 5793.1. Samples: 935328232. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:13:55,459][25689] Avg episode reward: [(0, '-0.077')] [2022-07-10 22:13:55,650][26022] Updated weights on worker 0-0, policy_version 913406 (0.00096) [2022-07-10 22:13:57,312][26022] Updated weights on worker 0-0, policy_version 913416 (0.00084) [2022-07-10 22:13:59,327][26022] Updated weights on worker 0-0, policy_version 913426 (0.00084) [2022-07-10 22:14:00,545][25689] Fps is (10 sec: 5611.4, 60 sec: 5542.9, 300 sec: 5547.0). Total num frames: 935354368. Throughput: 0: 5754.4. Samples: 935361538. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:14:00,546][25689] Avg episode reward: [(0, '-0.497')] [2022-07-10 22:14:00,940][26022] Updated weights on worker 0-0, policy_version 913436 (0.00088) [2022-07-10 22:14:03,284][26022] Updated weights on worker 0-0, policy_version 913446 (0.00084) [2022-07-10 22:14:05,479][26022] Updated weights on worker 0-0, policy_version 913456 (0.00088) [2022-07-10 22:14:05,578][25689] Fps is (10 sec: 5261.2, 60 sec: 5523.1, 300 sec: 5533.0). Total num frames: 935378944. Throughput: 0: 4805.6. Samples: 935376064. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:14:05,578][25689] Avg episode reward: [(0, '-0.295')] [2022-07-10 22:14:06,986][26022] Updated weights on worker 0-0, policy_version 913466 (0.00082) [2022-07-10 22:14:09,002][26022] Updated weights on worker 0-0, policy_version 913476 (0.00092) [2022-07-10 22:14:10,629][25689] Fps is (10 sec: 5381.6, 60 sec: 5537.1, 300 sec: 5543.2). Total num frames: 935408640. Throughput: 0: 5619.4. Samples: 935409288. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:14:10,631][25689] Avg episode reward: [(0, '-0.341')] [2022-07-10 22:14:10,696][26022] Updated weights on worker 0-0, policy_version 913486 (0.00098) [2022-07-10 22:14:12,786][26022] Updated weights on worker 0-0, policy_version 913496 (0.00087) [2022-07-10 22:14:14,477][26022] Updated weights on worker 0-0, policy_version 913506 (0.00086) [2022-07-10 22:14:15,710][25689] Fps is (10 sec: 5659.7, 60 sec: 5528.1, 300 sec: 5531.4). Total num frames: 935436288. Throughput: 0: 5654.9. Samples: 935442760. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:14:15,710][25689] Avg episode reward: [(0, '-0.114')] [2022-07-10 22:14:16,337][26022] Updated weights on worker 0-0, policy_version 913516 (0.00093) [2022-07-10 22:14:18,068][26022] Updated weights on worker 0-0, policy_version 913526 (0.00090) [2022-07-10 22:14:19,939][26022] Updated weights on worker 0-0, policy_version 913536 (0.00083) [2022-07-10 22:14:20,758][25689] Fps is (10 sec: 5560.4, 60 sec: 5506.9, 300 sec: 5537.4). Total num frames: 935464960. Throughput: 0: 4845.3. Samples: 935459478. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:14:20,759][25689] Avg episode reward: [(0, '-0.251')] [2022-07-10 22:14:21,921][26022] Updated weights on worker 0-0, policy_version 913546 (0.00087) [2022-07-10 22:14:23,691][26022] Updated weights on worker 0-0, policy_version 913556 (0.00084) [2022-07-10 22:14:25,384][26022] Updated weights on worker 0-0, policy_version 913566 (0.00091) [2022-07-10 22:14:25,800][25689] Fps is (10 sec: 5581.7, 60 sec: 5508.2, 300 sec: 5537.0). Total num frames: 935492608. Throughput: 0: 5782.8. Samples: 935493004. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:14:25,800][25689] Avg episode reward: [(0, '-0.724')] [2022-07-10 22:14:27,434][26022] Updated weights on worker 0-0, policy_version 913576 (0.00087) [2022-07-10 22:14:29,150][26022] Updated weights on worker 0-0, policy_version 913586 (0.00093) [2022-07-10 22:14:30,808][25689] Fps is (10 sec: 5502.3, 60 sec: 5526.9, 300 sec: 5531.0). Total num frames: 935520256. Throughput: 0: 5803.7. Samples: 935526396. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:14:30,808][25689] Avg episode reward: [(0, '-0.202')] [2022-07-10 22:14:31,150][26022] Updated weights on worker 0-0, policy_version 913596 (0.00092) [2022-07-10 22:14:32,970][26022] Updated weights on worker 0-0, policy_version 913606 (0.00090) [2022-07-10 22:14:34,749][26022] Updated weights on worker 0-0, policy_version 913616 (0.00089) [2022-07-10 22:14:35,943][25689] Fps is (10 sec: 5552.5, 60 sec: 5504.0, 300 sec: 5538.9). Total num frames: 935548928. Throughput: 0: 4959.2. Samples: 935543102. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:14:35,944][25689] Avg episode reward: [(0, '-0.173')] [2022-07-10 22:14:36,650][26022] Updated weights on worker 0-0, policy_version 913626 (0.00111) [2022-07-10 22:14:38,342][26022] Updated weights on worker 0-0, policy_version 913636 (0.00089) [2022-07-10 22:14:40,105][26022] Updated weights on worker 0-0, policy_version 913646 (0.00098) [2022-07-10 22:14:40,969][25689] Fps is (10 sec: 5643.3, 60 sec: 5518.8, 300 sec: 5531.8). Total num frames: 935577600. Throughput: 0: 5800.1. Samples: 935576704. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 22:14:40,970][25689] Avg episode reward: [(0, '0.276')] [2022-07-10 22:14:42,053][26022] Updated weights on worker 0-0, policy_version 913656 (0.00080) [2022-07-10 22:14:43,874][26022] Updated weights on worker 0-0, policy_version 913666 (0.00090) [2022-07-10 22:14:45,767][26022] Updated weights on worker 0-0, policy_version 913676 (0.00096) [2022-07-10 22:14:46,000][25689] Fps is (10 sec: 5600.2, 60 sec: 5516.7, 300 sec: 5534.9). Total num frames: 935605248. Throughput: 0: 5824.0. Samples: 935610648. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:14:46,000][25689] Avg episode reward: [(0, '0.134')] [2022-07-10 22:14:47,492][26022] Updated weights on worker 0-0, policy_version 913686 (0.00093) [2022-07-10 22:14:49,367][26022] Updated weights on worker 0-0, policy_version 913696 (0.00093) [2022-07-10 22:14:51,029][25689] Fps is (10 sec: 5496.6, 60 sec: 5533.0, 300 sec: 5529.6). Total num frames: 935632896. Throughput: 0: 4988.5. Samples: 935627272. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:14:51,033][25689] Avg episode reward: [(0, '0.620')] [2022-07-10 22:14:51,305][26022] Updated weights on worker 0-0, policy_version 913706 (0.00092) [2022-07-10 22:14:53,027][26022] Updated weights on worker 0-0, policy_version 913716 (0.00091) [2022-07-10 22:14:54,962][26022] Updated weights on worker 0-0, policy_version 913726 (0.00087) [2022-07-10 22:14:56,083][25689] Fps is (10 sec: 5687.3, 60 sec: 5557.0, 300 sec: 5536.0). Total num frames: 935662592. Throughput: 0: 5829.5. Samples: 935660506. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:14:56,083][25689] Avg episode reward: [(0, '1.059')] [2022-07-10 22:14:56,724][26022] Updated weights on worker 0-0, policy_version 913736 (0.00087) [2022-07-10 22:14:58,546][26022] Updated weights on worker 0-0, policy_version 913746 (0.00091) [2022-07-10 22:15:00,528][26022] Updated weights on worker 0-0, policy_version 913756 (0.00093) [2022-07-10 22:15:01,085][25689] Fps is (10 sec: 5498.8, 60 sec: 5514.1, 300 sec: 5536.9). Total num frames: 935688192. Throughput: 0: 5835.1. Samples: 935694082. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:15:01,086][25689] Avg episode reward: [(0, '0.940')] [2022-07-10 22:15:02,392][26022] Updated weights on worker 0-0, policy_version 913766 (0.00121) [2022-07-10 22:15:04,595][26022] Updated weights on worker 0-0, policy_version 913776 (0.00089) [2022-07-10 22:15:06,116][25689] Fps is (10 sec: 5307.3, 60 sec: 5565.0, 300 sec: 5533.4). Total num frames: 935715840. Throughput: 0: 4877.6. Samples: 935708764. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:15:06,116][25689] Avg episode reward: [(0, '0.806')] [2022-07-10 22:15:06,198][26022] Updated weights on worker 0-0, policy_version 913786 (0.00108) [2022-07-10 22:15:08,305][26022] Updated weights on worker 0-0, policy_version 913796 (0.00089) [2022-07-10 22:15:09,957][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:15:09,970][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000913806_935737344.pth [2022-07-10 22:15:09,970][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000911856_933740544.pth [2022-07-10 22:15:09,977][26022] Updated weights on worker 0-0, policy_version 913806 (0.00085) [2022-07-10 22:15:11,134][25689] Fps is (10 sec: 5502.9, 60 sec: 5534.3, 300 sec: 5531.2). Total num frames: 935743488. Throughput: 0: 5711.9. Samples: 935742106. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:15:11,136][25689] Avg episode reward: [(0, '0.440')] [2022-07-10 22:15:11,958][26022] Updated weights on worker 0-0, policy_version 913816 (0.00090) [2022-07-10 22:15:13,500][26022] Updated weights on worker 0-0, policy_version 913826 (0.00085) [2022-07-10 22:15:15,589][26022] Updated weights on worker 0-0, policy_version 913836 (0.00095) [2022-07-10 22:15:16,195][25689] Fps is (10 sec: 5587.6, 60 sec: 5552.9, 300 sec: 5537.6). Total num frames: 935772160. Throughput: 0: 5750.9. Samples: 935776170. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:15:16,195][25689] Avg episode reward: [(0, '0.834')] [2022-07-10 22:15:17,162][26022] Updated weights on worker 0-0, policy_version 913846 (0.00090) [2022-07-10 22:15:19,026][26022] Updated weights on worker 0-0, policy_version 913856 (0.00087) [2022-07-10 22:15:20,946][26022] Updated weights on worker 0-0, policy_version 913866 (0.00094) [2022-07-10 22:15:21,198][25689] Fps is (10 sec: 5697.6, 60 sec: 5557.1, 300 sec: 5537.9). Total num frames: 935800832. Throughput: 0: 4919.5. Samples: 935793028. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:15:21,199][25689] Avg episode reward: [(0, '-0.124')] [2022-07-10 22:15:22,646][26022] Updated weights on worker 0-0, policy_version 913876 (0.00086) [2022-07-10 22:15:24,599][26022] Updated weights on worker 0-0, policy_version 913886 (0.00090) [2022-07-10 22:15:26,208][25689] Fps is (10 sec: 5624.8, 60 sec: 5560.1, 300 sec: 5534.7). Total num frames: 935828480. Throughput: 0: 5860.1. Samples: 935826506. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:15:26,208][25689] Avg episode reward: [(0, '0.043')] [2022-07-10 22:15:26,316][26022] Updated weights on worker 0-0, policy_version 913896 (0.00089) [2022-07-10 22:15:28,239][26022] Updated weights on worker 0-0, policy_version 913906 (0.00091) [2022-07-10 22:15:30,263][26022] Updated weights on worker 0-0, policy_version 913916 (0.00088) [2022-07-10 22:15:31,213][25689] Fps is (10 sec: 5521.7, 60 sec: 5560.3, 300 sec: 5536.6). Total num frames: 935856128. Throughput: 0: 5862.7. Samples: 935859822. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:15:31,213][25689] Avg episode reward: [(0, '-0.055')] [2022-07-10 22:15:31,892][26022] Updated weights on worker 0-0, policy_version 913926 (0.00089) [2022-07-10 22:15:33,866][26022] Updated weights on worker 0-0, policy_version 913936 (0.00075) [2022-07-10 22:15:35,589][26022] Updated weights on worker 0-0, policy_version 913946 (0.00090) [2022-07-10 22:15:36,264][25689] Fps is (10 sec: 5396.8, 60 sec: 5534.1, 300 sec: 5533.5). Total num frames: 935882752. Throughput: 0: 5841.0. Samples: 935893394. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:15:36,265][25689] Avg episode reward: [(0, '-0.654')] [2022-07-10 22:15:37,586][26022] Updated weights on worker 0-0, policy_version 913956 (0.01014) [2022-07-10 22:15:39,454][26022] Updated weights on worker 0-0, policy_version 913966 (0.00084) [2022-07-10 22:15:41,173][26022] Updated weights on worker 0-0, policy_version 913976 (0.00614) [2022-07-10 22:15:41,270][25689] Fps is (10 sec: 5498.3, 60 sec: 5536.0, 300 sec: 5535.0). Total num frames: 935911424. Throughput: 0: 5820.0. Samples: 935909844. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:15:41,270][25689] Avg episode reward: [(0, '-1.192')] [2022-07-10 22:15:42,880][26022] Updated weights on worker 0-0, policy_version 913986 (0.00088) [2022-07-10 22:15:44,867][26022] Updated weights on worker 0-0, policy_version 913996 (0.00089) [2022-07-10 22:15:46,273][25689] Fps is (10 sec: 5729.5, 60 sec: 5555.5, 300 sec: 5541.9). Total num frames: 935940096. Throughput: 0: 5841.6. Samples: 935943718. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:15:46,273][25689] Avg episode reward: [(0, '-1.873')] [2022-07-10 22:15:46,547][26022] Updated weights on worker 0-0, policy_version 914006 (0.00084) [2022-07-10 22:15:48,566][26022] Updated weights on worker 0-0, policy_version 914016 (0.00094) [2022-07-10 22:15:50,130][26022] Updated weights on worker 0-0, policy_version 914026 (0.00094) [2022-07-10 22:15:51,290][25689] Fps is (10 sec: 5518.3, 60 sec: 5539.7, 300 sec: 5532.2). Total num frames: 935966720. Throughput: 0: 5849.9. Samples: 935977272. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:15:51,291][25689] Avg episode reward: [(0, '-1.283')] [2022-07-10 22:15:52,120][26022] Updated weights on worker 0-0, policy_version 914036 (0.00084) [2022-07-10 22:15:54,104][26022] Updated weights on worker 0-0, policy_version 914046 (0.00088) [2022-07-10 22:15:55,839][26022] Updated weights on worker 0-0, policy_version 914056 (0.00082) [2022-07-10 22:15:56,336][25689] Fps is (10 sec: 5596.6, 60 sec: 5540.3, 300 sec: 5541.8). Total num frames: 935996416. Throughput: 0: 5010.6. Samples: 935993966. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:15:56,336][25689] Avg episode reward: [(0, '-1.777')] [2022-07-10 22:15:57,688][26022] Updated weights on worker 0-0, policy_version 914066 (0.00082) [2022-07-10 22:15:59,346][26022] Updated weights on worker 0-0, policy_version 914076 (0.00085) [2022-07-10 22:16:01,227][26022] Updated weights on worker 0-0, policy_version 914086 (0.00084) [2022-07-10 22:16:01,370][25689] Fps is (10 sec: 5689.2, 60 sec: 5571.5, 300 sec: 5548.8). Total num frames: 936024064. Throughput: 0: 5879.5. Samples: 936028022. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:16:01,370][25689] Avg episode reward: [(0, '-1.713')] [2022-07-10 22:16:03,530][26022] Updated weights on worker 0-0, policy_version 914096 (0.00086) [2022-07-10 22:16:05,284][26022] Updated weights on worker 0-0, policy_version 914106 (0.00086) [2022-07-10 22:16:06,378][25689] Fps is (10 sec: 5302.4, 60 sec: 5539.5, 300 sec: 5535.1). Total num frames: 936049664. Throughput: 0: 5781.6. Samples: 936059960. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:16:06,379][25689] Avg episode reward: [(0, '-0.750')] [2022-07-10 22:16:06,995][26022] Updated weights on worker 0-0, policy_version 914116 (0.00081) [2022-07-10 22:16:09,037][26022] Updated weights on worker 0-0, policy_version 914126 (0.00088) [2022-07-10 22:16:10,711][26022] Updated weights on worker 0-0, policy_version 914136 (0.00087) [2022-07-10 22:16:11,399][25689] Fps is (10 sec: 5411.5, 60 sec: 5556.3, 300 sec: 5540.1). Total num frames: 936078336. Throughput: 0: 4930.1. Samples: 936076410. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:16:11,400][25689] Avg episode reward: [(0, '-1.433')] [2022-07-10 22:16:12,602][26022] Updated weights on worker 0-0, policy_version 914146 (0.00089) [2022-07-10 22:16:14,379][26022] Updated weights on worker 0-0, policy_version 914156 (0.00093) [2022-07-10 22:16:16,284][26022] Updated weights on worker 0-0, policy_version 914166 (0.00088) [2022-07-10 22:16:16,523][25689] Fps is (10 sec: 5652.6, 60 sec: 5550.5, 300 sec: 5541.4). Total num frames: 936107008. Throughput: 0: 5757.0. Samples: 936110182. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:16:16,523][25689] Avg episode reward: [(0, '-0.847')] [2022-07-10 22:16:18,063][26022] Updated weights on worker 0-0, policy_version 914176 (0.00094) [2022-07-10 22:16:19,934][26022] Updated weights on worker 0-0, policy_version 914186 (0.00086) [2022-07-10 22:16:21,600][25689] Fps is (10 sec: 5621.2, 60 sec: 5543.7, 300 sec: 5540.3). Total num frames: 936135680. Throughput: 0: 5726.6. Samples: 936143874. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:16:21,601][25689] Avg episode reward: [(0, '-0.828')] [2022-07-10 22:16:21,701][26022] Updated weights on worker 0-0, policy_version 914196 (0.00086) [2022-07-10 22:16:23,529][26022] Updated weights on worker 0-0, policy_version 914206 (0.00087) [2022-07-10 22:16:25,286][26022] Updated weights on worker 0-0, policy_version 914216 (0.00102) [2022-07-10 22:16:26,617][25689] Fps is (10 sec: 5579.7, 60 sec: 5543.0, 300 sec: 5543.7). Total num frames: 936163328. Throughput: 0: 4996.7. Samples: 936161086. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:16:26,618][25689] Avg episode reward: [(0, '-0.107')] [2022-07-10 22:16:27,046][26022] Updated weights on worker 0-0, policy_version 914226 (0.00081) [2022-07-10 22:16:28,959][26022] Updated weights on worker 0-0, policy_version 914236 (0.00083) [2022-07-10 22:16:30,915][26022] Updated weights on worker 0-0, policy_version 914246 (0.00092) [2022-07-10 22:16:31,665][25689] Fps is (10 sec: 5595.8, 60 sec: 5556.0, 300 sec: 5541.5). Total num frames: 936192000. Throughput: 0: 5828.5. Samples: 936194532. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:16:31,665][25689] Avg episode reward: [(0, '-0.152')] [2022-07-10 22:16:32,677][26022] Updated weights on worker 0-0, policy_version 914256 (0.00088) [2022-07-10 22:16:34,603][26022] Updated weights on worker 0-0, policy_version 914266 (0.00089) [2022-07-10 22:16:36,233][26022] Updated weights on worker 0-0, policy_version 914276 (0.00088) [2022-07-10 22:16:36,713][25689] Fps is (10 sec: 5781.3, 60 sec: 5607.2, 300 sec: 5548.4). Total num frames: 936221696. Throughput: 0: 5834.9. Samples: 936227988. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:16:36,713][25689] Avg episode reward: [(0, '-0.019')] [2022-07-10 22:16:38,196][26022] Updated weights on worker 0-0, policy_version 914286 (0.00086) [2022-07-10 22:16:40,022][26022] Updated weights on worker 0-0, policy_version 914296 (0.00099) [2022-07-10 22:16:41,767][25689] Fps is (10 sec: 5575.2, 60 sec: 5568.8, 300 sec: 5544.1). Total num frames: 936248320. Throughput: 0: 4988.6. Samples: 936244478. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:16:41,767][25689] Avg episode reward: [(0, '0.909')] [2022-07-10 22:16:41,969][26022] Updated weights on worker 0-0, policy_version 914306 (0.00082) [2022-07-10 22:16:43,875][26022] Updated weights on worker 0-0, policy_version 914316 (0.00096) [2022-07-10 22:16:45,730][26022] Updated weights on worker 0-0, policy_version 914326 (0.00093) [2022-07-10 22:16:46,780][25689] Fps is (10 sec: 5289.0, 60 sec: 5534.0, 300 sec: 5540.9). Total num frames: 936274944. Throughput: 0: 5793.0. Samples: 936277894. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:16:46,781][25689] Avg episode reward: [(0, '0.512')] [2022-07-10 22:16:47,480][26022] Updated weights on worker 0-0, policy_version 914336 (0.00339) [2022-07-10 22:16:49,448][26022] Updated weights on worker 0-0, policy_version 914346 (0.00097) [2022-07-10 22:16:51,135][26022] Updated weights on worker 0-0, policy_version 914356 (0.00098) [2022-07-10 22:16:51,787][25689] Fps is (10 sec: 5620.5, 60 sec: 5585.7, 300 sec: 5545.1). Total num frames: 936304640. Throughput: 0: 5786.3. Samples: 936310966. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:16:51,788][25689] Avg episode reward: [(0, '0.976')] [2022-07-10 22:16:53,207][26022] Updated weights on worker 0-0, policy_version 914366 (0.00088) [2022-07-10 22:16:54,825][26022] Updated weights on worker 0-0, policy_version 914376 (0.00111) [2022-07-10 22:16:56,713][26022] Updated weights on worker 0-0, policy_version 914386 (0.00085) [2022-07-10 22:16:56,923][25689] Fps is (10 sec: 5552.8, 60 sec: 5526.7, 300 sec: 5539.4). Total num frames: 936331264. Throughput: 0: 4933.4. Samples: 936327694. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:16:56,923][25689] Avg episode reward: [(0, '0.864')] [2022-07-10 22:16:58,422][26022] Updated weights on worker 0-0, policy_version 914396 (0.00086) [2022-07-10 22:17:00,504][26022] Updated weights on worker 0-0, policy_version 914406 (0.00084) [2022-07-10 22:17:01,975][25689] Fps is (10 sec: 5327.3, 60 sec: 5525.1, 300 sec: 5545.7). Total num frames: 936358912. Throughput: 0: 5785.5. Samples: 936361392. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:17:01,975][25689] Avg episode reward: [(0, '0.342')] [2022-07-10 22:17:02,564][26022] Updated weights on worker 0-0, policy_version 914416 (0.00085) [2022-07-10 22:17:04,428][26022] Updated weights on worker 0-0, policy_version 914426 (0.00088) [2022-07-10 22:17:06,315][26022] Updated weights on worker 0-0, policy_version 914436 (0.00079) [2022-07-10 22:17:06,989][25689] Fps is (10 sec: 5493.1, 60 sec: 5558.3, 300 sec: 5545.8). Total num frames: 936386560. Throughput: 0: 5681.4. Samples: 936392710. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:17:06,990][25689] Avg episode reward: [(0, '0.101')] [2022-07-10 22:17:08,214][26022] Updated weights on worker 0-0, policy_version 914446 (0.00088) [2022-07-10 22:17:09,904][26022] Updated weights on worker 0-0, policy_version 914456 (0.00096) [2022-07-10 22:17:10,011][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:17:10,025][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000914457_936403968.pth [2022-07-10 22:17:10,025][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000912508_934408192.pth [2022-07-10 22:17:12,020][25689] Fps is (10 sec: 5300.6, 60 sec: 5506.7, 300 sec: 5540.6). Total num frames: 936412160. Throughput: 0: 4869.0. Samples: 936409482. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:17:12,021][25689] Avg episode reward: [(0, '-0.119')] [2022-07-10 22:17:12,059][26022] Updated weights on worker 0-0, policy_version 914466 (0.00084) [2022-07-10 22:17:13,531][26022] Updated weights on worker 0-0, policy_version 914476 (0.00082) [2022-07-10 22:17:15,533][26022] Updated weights on worker 0-0, policy_version 914486 (0.00082) [2022-07-10 22:17:17,101][25689] Fps is (10 sec: 5569.8, 60 sec: 5544.4, 300 sec: 5543.0). Total num frames: 936442880. Throughput: 0: 5719.3. Samples: 936443098. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:17:17,101][25689] Avg episode reward: [(0, '-0.447')] [2022-07-10 22:17:17,124][26022] Updated weights on worker 0-0, policy_version 914496 (0.00088) [2022-07-10 22:17:19,039][26022] Updated weights on worker 0-0, policy_version 914506 (0.00088) [2022-07-10 22:17:20,974][26022] Updated weights on worker 0-0, policy_version 914516 (0.00086) [2022-07-10 22:17:22,106][25689] Fps is (10 sec: 5787.3, 60 sec: 5534.2, 300 sec: 5547.3). Total num frames: 936470528. Throughput: 0: 5736.0. Samples: 936476864. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:17:22,106][25689] Avg episode reward: [(0, '-0.821')] [2022-07-10 22:17:22,436][26022] Updated weights on worker 0-0, policy_version 914526 (0.00087) [2022-07-10 22:17:24,613][26022] Updated weights on worker 0-0, policy_version 914536 (0.00086) [2022-07-10 22:17:26,466][26022] Updated weights on worker 0-0, policy_version 914546 (0.00081) [2022-07-10 22:17:27,113][25689] Fps is (10 sec: 5625.1, 60 sec: 5551.9, 300 sec: 5548.0). Total num frames: 936499200. Throughput: 0: 5026.7. Samples: 936493866. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:17:27,115][25689] Avg episode reward: [(0, '-0.719')] [2022-07-10 22:17:28,152][26022] Updated weights on worker 0-0, policy_version 914556 (0.00088) [2022-07-10 22:17:29,960][26022] Updated weights on worker 0-0, policy_version 914566 (0.00089) [2022-07-10 22:17:31,860][26022] Updated weights on worker 0-0, policy_version 914576 (0.00092) [2022-07-10 22:17:32,127][25689] Fps is (10 sec: 5518.0, 60 sec: 5521.2, 300 sec: 5541.9). Total num frames: 936525824. Throughput: 0: 5855.6. Samples: 936527218. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:17:32,128][25689] Avg episode reward: [(0, '-0.313')] [2022-07-10 22:17:33,829][26022] Updated weights on worker 0-0, policy_version 914586 (0.00088) [2022-07-10 22:17:35,646][26022] Updated weights on worker 0-0, policy_version 914596 (0.00086) [2022-07-10 22:17:37,226][25689] Fps is (10 sec: 5569.1, 60 sec: 5516.5, 300 sec: 5547.0). Total num frames: 936555520. Throughput: 0: 5857.1. Samples: 936560974. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:17:37,228][25689] Avg episode reward: [(0, '-0.166')] [2022-07-10 22:17:37,273][26022] Updated weights on worker 0-0, policy_version 914606 (0.00088) [2022-07-10 22:17:39,143][26022] Updated weights on worker 0-0, policy_version 914616 (0.00087) [2022-07-10 22:17:40,954][26022] Updated weights on worker 0-0, policy_version 914626 (0.00082) [2022-07-10 22:17:42,263][25689] Fps is (10 sec: 5657.4, 60 sec: 5535.0, 300 sec: 5539.6). Total num frames: 936583168. Throughput: 0: 5012.5. Samples: 936577900. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:17:42,265][25689] Avg episode reward: [(0, '0.115')] [2022-07-10 22:17:42,699][26022] Updated weights on worker 0-0, policy_version 914636 (0.00084) [2022-07-10 22:17:44,813][26022] Updated weights on worker 0-0, policy_version 914646 (0.00086) [2022-07-10 22:17:46,407][26022] Updated weights on worker 0-0, policy_version 914656 (0.00085) [2022-07-10 22:17:47,288][25689] Fps is (10 sec: 5495.5, 60 sec: 5550.9, 300 sec: 5542.7). Total num frames: 936610816. Throughput: 0: 5823.4. Samples: 936611352. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:17:47,289][25689] Avg episode reward: [(0, '0.199')] [2022-07-10 22:17:48,407][26022] Updated weights on worker 0-0, policy_version 914666 (0.00087) [2022-07-10 22:17:50,146][26022] Updated weights on worker 0-0, policy_version 914676 (0.00084) [2022-07-10 22:17:52,039][26022] Updated weights on worker 0-0, policy_version 914686 (0.00087) [2022-07-10 22:17:52,335][25689] Fps is (10 sec: 5693.4, 60 sec: 5547.2, 300 sec: 5542.9). Total num frames: 936640512. Throughput: 0: 5826.8. Samples: 936644966. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:17:52,335][25689] Avg episode reward: [(0, '0.268')] [2022-07-10 22:17:53,896][26022] Updated weights on worker 0-0, policy_version 914696 (0.00093) [2022-07-10 22:17:55,510][26022] Updated weights on worker 0-0, policy_version 914706 (0.00087) [2022-07-10 22:17:57,435][25689] Fps is (10 sec: 5651.3, 60 sec: 5567.4, 300 sec: 5545.4). Total num frames: 936668160. Throughput: 0: 4992.4. Samples: 936661866. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:17:57,436][25689] Avg episode reward: [(0, '-0.223')] [2022-07-10 22:17:57,548][26022] Updated weights on worker 0-0, policy_version 914716 (0.00053) [2022-07-10 22:17:59,063][26022] Updated weights on worker 0-0, policy_version 914726 (0.00084) [2022-07-10 22:18:01,216][26022] Updated weights on worker 0-0, policy_version 914736 (0.00079) [2022-07-10 22:18:02,507][25689] Fps is (10 sec: 5234.7, 60 sec: 5531.7, 300 sec: 5544.1). Total num frames: 936693760. Throughput: 0: 5817.5. Samples: 936695670. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:18:02,507][25689] Avg episode reward: [(0, '-0.105')] [2022-07-10 22:18:03,102][26022] Updated weights on worker 0-0, policy_version 914746 (0.00087) [2022-07-10 22:18:05,267][26022] Updated weights on worker 0-0, policy_version 914756 (0.00087) [2022-07-10 22:18:06,764][26022] Updated weights on worker 0-0, policy_version 914766 (0.00088) [2022-07-10 22:18:07,515][25689] Fps is (10 sec: 5485.8, 60 sec: 5566.2, 300 sec: 5547.7). Total num frames: 936723456. Throughput: 0: 5727.7. Samples: 936727206. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:18:07,515][25689] Avg episode reward: [(0, '-0.400')] [2022-07-10 22:18:08,889][26022] Updated weights on worker 0-0, policy_version 914776 (0.00086) [2022-07-10 22:18:10,594][26022] Updated weights on worker 0-0, policy_version 914786 (0.00090) [2022-07-10 22:18:12,544][25689] Fps is (10 sec: 5509.0, 60 sec: 5566.3, 300 sec: 5539.9). Total num frames: 936749056. Throughput: 0: 4901.4. Samples: 936744022. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:18:12,545][25689] Avg episode reward: [(0, '-1.507')] [2022-07-10 22:18:12,747][26022] Updated weights on worker 0-0, policy_version 914796 (0.00087) [2022-07-10 22:18:14,500][26022] Updated weights on worker 0-0, policy_version 914806 (0.00086) [2022-07-10 22:18:16,344][26022] Updated weights on worker 0-0, policy_version 914816 (0.00090) [2022-07-10 22:18:17,663][25689] Fps is (10 sec: 5549.9, 60 sec: 5562.8, 300 sec: 5541.2). Total num frames: 936779776. Throughput: 0: 5703.7. Samples: 936777240. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:18:17,663][25689] Avg episode reward: [(0, '-1.644')] [2022-07-10 22:18:18,028][26022] Updated weights on worker 0-0, policy_version 914826 (0.00090) [2022-07-10 22:18:19,927][26022] Updated weights on worker 0-0, policy_version 914836 (0.00086) [2022-07-10 22:18:21,751][26022] Updated weights on worker 0-0, policy_version 914846 (0.00093) [2022-07-10 22:18:22,715][25689] Fps is (10 sec: 5638.2, 60 sec: 5541.6, 300 sec: 5537.8). Total num frames: 936806400. Throughput: 0: 5693.2. Samples: 936810718. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:18:22,716][25689] Avg episode reward: [(0, '-2.139')] [2022-07-10 22:18:23,669][26022] Updated weights on worker 0-0, policy_version 914856 (0.00089) [2022-07-10 22:18:25,392][26022] Updated weights on worker 0-0, policy_version 914866 (0.00086) [2022-07-10 22:18:27,586][26022] Updated weights on worker 0-0, policy_version 914876 (0.00092) [2022-07-10 22:18:27,800][25689] Fps is (10 sec: 5354.0, 60 sec: 5517.7, 300 sec: 5540.2). Total num frames: 936834048. Throughput: 0: 5750.2. Samples: 936843846. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:18:27,800][25689] Avg episode reward: [(0, '-1.462')] [2022-07-10 22:18:29,098][26022] Updated weights on worker 0-0, policy_version 914886 (0.00085) [2022-07-10 22:18:30,978][26022] Updated weights on worker 0-0, policy_version 914896 (0.00086) [2022-07-10 22:18:32,739][26022] Updated weights on worker 0-0, policy_version 914906 (0.00079) [2022-07-10 22:18:32,836][25689] Fps is (10 sec: 5665.7, 60 sec: 5566.2, 300 sec: 5540.8). Total num frames: 936863744. Throughput: 0: 5744.6. Samples: 936860590. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2022-07-10 22:18:32,837][25689] Avg episode reward: [(0, '-1.257')] [2022-07-10 22:18:34,755][26022] Updated weights on worker 0-0, policy_version 914916 (0.00083) [2022-07-10 22:18:36,455][26022] Updated weights on worker 0-0, policy_version 914926 (0.00087) [2022-07-10 22:18:37,884][25689] Fps is (10 sec: 5787.9, 60 sec: 5554.0, 300 sec: 5543.4). Total num frames: 936892416. Throughput: 0: 5796.2. Samples: 936894446. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:18:37,885][25689] Avg episode reward: [(0, '-1.188')] [2022-07-10 22:18:38,200][26022] Updated weights on worker 0-0, policy_version 914936 (0.00099) [2022-07-10 22:18:40,089][26022] Updated weights on worker 0-0, policy_version 914946 (0.00096) [2022-07-10 22:18:42,183][26022] Updated weights on worker 0-0, policy_version 914956 (0.00081) [2022-07-10 22:18:42,927][25689] Fps is (10 sec: 5378.3, 60 sec: 5519.7, 300 sec: 5535.9). Total num frames: 936918016. Throughput: 0: 5794.8. Samples: 936927844. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:18:42,928][25689] Avg episode reward: [(0, '0.210')] [2022-07-10 22:18:43,711][26022] Updated weights on worker 0-0, policy_version 914966 (0.00089) [2022-07-10 22:18:45,709][26022] Updated weights on worker 0-0, policy_version 914976 (0.00088) [2022-07-10 22:18:47,518][26022] Updated weights on worker 0-0, policy_version 914986 (0.00091) [2022-07-10 22:18:47,982][25689] Fps is (10 sec: 5475.9, 60 sec: 5550.7, 300 sec: 5545.6). Total num frames: 936947712. Throughput: 0: 4995.2. Samples: 936944662. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:18:47,983][25689] Avg episode reward: [(0, '0.728')] [2022-07-10 22:18:49,414][26022] Updated weights on worker 0-0, policy_version 914996 (0.00084) [2022-07-10 22:18:51,312][26022] Updated weights on worker 0-0, policy_version 915006 (0.00081) [2022-07-10 22:18:53,009][25689] Fps is (10 sec: 5586.0, 60 sec: 5501.9, 300 sec: 5540.7). Total num frames: 936974336. Throughput: 0: 5803.0. Samples: 936977656. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:18:53,010][25689] Avg episode reward: [(0, '0.658')] [2022-07-10 22:18:53,163][26022] Updated weights on worker 0-0, policy_version 915016 (0.00935) [2022-07-10 22:18:55,093][26022] Updated weights on worker 0-0, policy_version 915026 (0.00091) [2022-07-10 22:18:56,868][26022] Updated weights on worker 0-0, policy_version 915036 (0.00093) [2022-07-10 22:18:58,053][25689] Fps is (10 sec: 5490.6, 60 sec: 5523.9, 300 sec: 5541.5). Total num frames: 937003008. Throughput: 0: 5784.8. Samples: 937011122. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:18:58,055][25689] Avg episode reward: [(0, '0.880')] [2022-07-10 22:18:58,737][26022] Updated weights on worker 0-0, policy_version 915046 (0.00095) [2022-07-10 22:19:00,659][26022] Updated weights on worker 0-0, policy_version 915056 (0.00105) [2022-07-10 22:19:02,840][26022] Updated weights on worker 0-0, policy_version 915066 (0.00102) [2022-07-10 22:19:03,090][25689] Fps is (10 sec: 5384.0, 60 sec: 5527.1, 300 sec: 5544.9). Total num frames: 937028608. Throughput: 0: 4965.2. Samples: 937027956. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:19:03,090][25689] Avg episode reward: [(0, '1.014')] [2022-07-10 22:19:04,723][26022] Updated weights on worker 0-0, policy_version 915076 (0.00085) [2022-07-10 22:19:06,497][26022] Updated weights on worker 0-0, policy_version 915086 (0.00088) [2022-07-10 22:19:08,109][25689] Fps is (10 sec: 5397.0, 60 sec: 5509.2, 300 sec: 5542.0). Total num frames: 937057280. Throughput: 0: 5682.2. Samples: 937059028. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:19:08,110][25689] Avg episode reward: [(0, '1.062')] [2022-07-10 22:19:08,266][26022] Updated weights on worker 0-0, policy_version 915096 (0.00087) [2022-07-10 22:19:09,998][26022] Updated weights on worker 0-0, policy_version 915106 (0.00094) [2022-07-10 22:19:10,264][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:19:10,278][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000915107_937069568.pth [2022-07-10 22:19:10,279][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000913156_935071744.pth [2022-07-10 22:19:12,077][26022] Updated weights on worker 0-0, policy_version 915116 (0.00092) [2022-07-10 22:19:13,184][25689] Fps is (10 sec: 5579.3, 60 sec: 5538.8, 300 sec: 5542.1). Total num frames: 937084928. Throughput: 0: 5675.8. Samples: 937092164. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:19:13,185][25689] Avg episode reward: [(0, '0.851')] [2022-07-10 22:19:13,812][26022] Updated weights on worker 0-0, policy_version 915126 (0.00089) [2022-07-10 22:19:15,651][26022] Updated weights on worker 0-0, policy_version 915136 (0.00095) [2022-07-10 22:19:17,579][26022] Updated weights on worker 0-0, policy_version 915146 (0.00086) [2022-07-10 22:19:18,259][25689] Fps is (10 sec: 5447.9, 60 sec: 5492.1, 300 sec: 5538.2). Total num frames: 937112576. Throughput: 0: 4842.5. Samples: 937108968. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:19:18,260][25689] Avg episode reward: [(0, '0.853')] [2022-07-10 22:19:19,232][26022] Updated weights on worker 0-0, policy_version 915156 (0.00080) [2022-07-10 22:19:21,261][26022] Updated weights on worker 0-0, policy_version 915166 (0.00095) [2022-07-10 22:19:23,105][26022] Updated weights on worker 0-0, policy_version 915176 (0.00083) [2022-07-10 22:19:23,328][25689] Fps is (10 sec: 5754.2, 60 sec: 5558.2, 300 sec: 5548.0). Total num frames: 937143296. Throughput: 0: 5656.5. Samples: 937142432. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:19:23,328][25689] Avg episode reward: [(0, '1.044')] [2022-07-10 22:19:24,891][26022] Updated weights on worker 0-0, policy_version 915186 (0.00090) [2022-07-10 22:19:26,580][26022] Updated weights on worker 0-0, policy_version 915196 (0.00092) [2022-07-10 22:19:28,356][25689] Fps is (10 sec: 5577.9, 60 sec: 5529.5, 300 sec: 5540.7). Total num frames: 937168896. Throughput: 0: 5764.3. Samples: 937175738. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:19:28,358][25689] Avg episode reward: [(0, '1.206')] [2022-07-10 22:19:28,572][26022] Updated weights on worker 0-0, policy_version 915206 (0.00089) [2022-07-10 22:19:30,422][26022] Updated weights on worker 0-0, policy_version 915216 (0.00085) [2022-07-10 22:19:32,241][26022] Updated weights on worker 0-0, policy_version 915226 (0.00080) [2022-07-10 22:19:33,391][25689] Fps is (10 sec: 5291.4, 60 sec: 5495.9, 300 sec: 5539.2). Total num frames: 937196544. Throughput: 0: 4952.0. Samples: 937192228. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:19:33,391][25689] Avg episode reward: [(0, '1.078')] [2022-07-10 22:19:34,206][26022] Updated weights on worker 0-0, policy_version 915236 (0.00101) [2022-07-10 22:19:35,921][26022] Updated weights on worker 0-0, policy_version 915246 (0.00089) [2022-07-10 22:19:37,753][26022] Updated weights on worker 0-0, policy_version 915256 (0.00083) [2022-07-10 22:19:38,499][25689] Fps is (10 sec: 5552.5, 60 sec: 5490.4, 300 sec: 5537.6). Total num frames: 937225216. Throughput: 0: 5755.8. Samples: 937225466. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:19:38,500][25689] Avg episode reward: [(0, '0.404')] [2022-07-10 22:19:39,678][26022] Updated weights on worker 0-0, policy_version 915266 (0.00090) [2022-07-10 22:19:41,435][26022] Updated weights on worker 0-0, policy_version 915276 (0.00093) [2022-07-10 22:19:43,492][26022] Updated weights on worker 0-0, policy_version 915286 (0.00091) [2022-07-10 22:19:43,539][25689] Fps is (10 sec: 5549.6, 60 sec: 5524.4, 300 sec: 5537.5). Total num frames: 937252864. Throughput: 0: 5770.3. Samples: 937259058. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:19:43,540][25689] Avg episode reward: [(0, '0.617')] [2022-07-10 22:19:45,191][26022] Updated weights on worker 0-0, policy_version 915296 (0.00091) [2022-07-10 22:19:47,021][26022] Updated weights on worker 0-0, policy_version 915306 (0.00081) [2022-07-10 22:19:48,568][25689] Fps is (10 sec: 5593.8, 60 sec: 5510.0, 300 sec: 5540.9). Total num frames: 937281536. Throughput: 0: 4956.9. Samples: 937275924. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:19:48,568][25689] Avg episode reward: [(0, '0.447')] [2022-07-10 22:19:48,946][26022] Updated weights on worker 0-0, policy_version 915316 (0.00088) [2022-07-10 22:19:50,516][26022] Updated weights on worker 0-0, policy_version 915326 (0.00086) [2022-07-10 22:19:52,689][26022] Updated weights on worker 0-0, policy_version 915336 (0.00086) [2022-07-10 22:19:53,619][25689] Fps is (10 sec: 5587.3, 60 sec: 5524.6, 300 sec: 5534.1). Total num frames: 937309184. Throughput: 0: 5794.5. Samples: 937309442. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:19:53,621][25689] Avg episode reward: [(0, '0.741')] [2022-07-10 22:19:54,074][26022] Updated weights on worker 0-0, policy_version 915346 (0.00086) [2022-07-10 22:19:56,223][26022] Updated weights on worker 0-0, policy_version 915356 (0.00081) [2022-07-10 22:19:57,732][26022] Updated weights on worker 0-0, policy_version 915366 (0.00091) [2022-07-10 22:19:58,759][25689] Fps is (10 sec: 5526.1, 60 sec: 5515.9, 300 sec: 5541.8). Total num frames: 937337856. Throughput: 0: 5805.3. Samples: 937343082. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:19:58,760][25689] Avg episode reward: [(0, '-0.150')] [2022-07-10 22:19:59,841][26022] Updated weights on worker 0-0, policy_version 915376 (0.00086) [2022-07-10 22:20:01,790][26022] Updated weights on worker 0-0, policy_version 915386 (0.00087) [2022-07-10 22:20:03,804][25689] Fps is (10 sec: 5429.5, 60 sec: 5532.0, 300 sec: 5538.1). Total num frames: 937364480. Throughput: 0: 4978.1. Samples: 937359936. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:20:03,804][25689] Avg episode reward: [(0, '-0.177')] [2022-07-10 22:20:03,872][26022] Updated weights on worker 0-0, policy_version 915396 (0.00088) [2022-07-10 22:20:05,674][26022] Updated weights on worker 0-0, policy_version 915406 (0.00077) [2022-07-10 22:20:07,755][26022] Updated weights on worker 0-0, policy_version 915416 (0.00266) [2022-07-10 22:20:08,853][25689] Fps is (10 sec: 5376.6, 60 sec: 5512.4, 300 sec: 5537.6). Total num frames: 937392128. Throughput: 0: 5679.5. Samples: 937391136. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:20:08,854][25689] Avg episode reward: [(0, '-0.914')] [2022-07-10 22:20:09,263][26022] Updated weights on worker 0-0, policy_version 915426 (0.00089) [2022-07-10 22:20:11,499][26022] Updated weights on worker 0-0, policy_version 915436 (0.00087) [2022-07-10 22:20:13,058][26022] Updated weights on worker 0-0, policy_version 915446 (0.00086) [2022-07-10 22:20:13,888][25689] Fps is (10 sec: 5483.4, 60 sec: 5516.1, 300 sec: 5534.6). Total num frames: 937419776. Throughput: 0: 5664.1. Samples: 937424246. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:20:13,888][25689] Avg episode reward: [(0, '-1.412')] [2022-07-10 22:20:15,195][26022] Updated weights on worker 0-0, policy_version 915456 (0.00444) [2022-07-10 22:20:16,762][26022] Updated weights on worker 0-0, policy_version 915466 (0.00092) [2022-07-10 22:20:18,699][26022] Updated weights on worker 0-0, policy_version 915476 (0.00096) [2022-07-10 22:20:18,953][25689] Fps is (10 sec: 5576.4, 60 sec: 5533.9, 300 sec: 5533.5). Total num frames: 937448448. Throughput: 0: 5672.4. Samples: 937457630. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:20:18,954][25689] Avg episode reward: [(0, '-1.462')] [2022-07-10 22:20:20,555][26022] Updated weights on worker 0-0, policy_version 915486 (0.00090) [2022-07-10 22:20:22,327][26022] Updated weights on worker 0-0, policy_version 915496 (0.00084) [2022-07-10 22:20:23,959][26022] Updated weights on worker 0-0, policy_version 915506 (0.00095) [2022-07-10 22:20:24,033][25689] Fps is (10 sec: 5753.2, 60 sec: 5516.0, 300 sec: 5539.0). Total num frames: 937478144. Throughput: 0: 5657.8. Samples: 937474390. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:20:24,033][25689] Avg episode reward: [(0, '-1.222')] [2022-07-10 22:20:26,163][26022] Updated weights on worker 0-0, policy_version 915516 (0.00090) [2022-07-10 22:20:27,811][26022] Updated weights on worker 0-0, policy_version 915526 (0.00086) [2022-07-10 22:20:29,091][25689] Fps is (10 sec: 5555.6, 60 sec: 5530.2, 300 sec: 5534.6). Total num frames: 937504768. Throughput: 0: 5765.1. Samples: 937507806. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:20:29,091][25689] Avg episode reward: [(0, '-1.732')] [2022-07-10 22:20:29,724][26022] Updated weights on worker 0-0, policy_version 915536 (0.00099) [2022-07-10 22:20:31,531][26022] Updated weights on worker 0-0, policy_version 915546 (0.00082) [2022-07-10 22:20:33,360][26022] Updated weights on worker 0-0, policy_version 915556 (0.00085) [2022-07-10 22:20:34,112][25689] Fps is (10 sec: 5587.6, 60 sec: 5565.1, 300 sec: 5545.5). Total num frames: 937534464. Throughput: 0: 5801.5. Samples: 937541580. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:20:34,113][25689] Avg episode reward: [(0, '-2.467')] [2022-07-10 22:20:35,419][26022] Updated weights on worker 0-0, policy_version 915566 (0.00086) [2022-07-10 22:20:36,926][26022] Updated weights on worker 0-0, policy_version 915576 (0.00089) [2022-07-10 22:20:38,914][26022] Updated weights on worker 0-0, policy_version 915586 (0.00091) [2022-07-10 22:20:39,198][25689] Fps is (10 sec: 5673.4, 60 sec: 5550.3, 300 sec: 5540.6). Total num frames: 937562112. Throughput: 0: 4966.4. Samples: 937558176. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:20:39,199][25689] Avg episode reward: [(0, '-1.564')] [2022-07-10 22:20:40,848][26022] Updated weights on worker 0-0, policy_version 915596 (0.00084) [2022-07-10 22:20:42,614][26022] Updated weights on worker 0-0, policy_version 915606 (0.00093) [2022-07-10 22:20:44,210][25689] Fps is (10 sec: 5476.5, 60 sec: 5552.9, 300 sec: 5537.0). Total num frames: 937589760. Throughput: 0: 5826.4. Samples: 937591946. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:20:44,210][25689] Avg episode reward: [(0, '-1.473')] [2022-07-10 22:20:44,363][26022] Updated weights on worker 0-0, policy_version 915616 (0.00093) [2022-07-10 22:20:46,301][26022] Updated weights on worker 0-0, policy_version 915626 (0.00086) [2022-07-10 22:20:48,012][26022] Updated weights on worker 0-0, policy_version 915636 (0.00093) [2022-07-10 22:20:49,238][25689] Fps is (10 sec: 5507.8, 60 sec: 5536.0, 300 sec: 5540.2). Total num frames: 937617408. Throughput: 0: 5843.0. Samples: 937625526. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:20:49,238][25689] Avg episode reward: [(0, '-2.338')] [2022-07-10 22:20:49,935][26022] Updated weights on worker 0-0, policy_version 915646 (0.00083) [2022-07-10 22:20:51,742][26022] Updated weights on worker 0-0, policy_version 915656 (0.00082) [2022-07-10 22:20:53,498][26022] Updated weights on worker 0-0, policy_version 915666 (0.00084) [2022-07-10 22:20:54,248][25689] Fps is (10 sec: 5610.5, 60 sec: 5556.7, 300 sec: 5537.4). Total num frames: 937646080. Throughput: 0: 4994.7. Samples: 937642150. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:20:54,248][25689] Avg episode reward: [(0, '-2.312')] [2022-07-10 22:20:55,356][26022] Updated weights on worker 0-0, policy_version 915676 (0.00089) [2022-07-10 22:20:57,235][26022] Updated weights on worker 0-0, policy_version 915686 (0.00089) [2022-07-10 22:20:58,799][26022] Updated weights on worker 0-0, policy_version 915696 (0.00091) [2022-07-10 22:20:59,347][25689] Fps is (10 sec: 5672.6, 60 sec: 5560.5, 300 sec: 5539.6). Total num frames: 937674752. Throughput: 0: 5853.4. Samples: 937676114. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:20:59,347][25689] Avg episode reward: [(0, '-1.862')] [2022-07-10 22:21:00,884][26022] Updated weights on worker 0-0, policy_version 915706 (0.00095) [2022-07-10 22:21:02,916][26022] Updated weights on worker 0-0, policy_version 915716 (0.00087) [2022-07-10 22:21:04,349][25689] Fps is (10 sec: 5271.3, 60 sec: 5530.5, 300 sec: 5536.3). Total num frames: 937699328. Throughput: 0: 5744.2. Samples: 937707634. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:21:04,350][25689] Avg episode reward: [(0, '-2.004')] [2022-07-10 22:21:04,854][26022] Updated weights on worker 0-0, policy_version 915726 (0.00085) [2022-07-10 22:21:06,850][26022] Updated weights on worker 0-0, policy_version 915736 (0.00097) [2022-07-10 22:21:08,535][26022] Updated weights on worker 0-0, policy_version 915746 (0.00101) [2022-07-10 22:21:09,395][25689] Fps is (10 sec: 5299.4, 60 sec: 5547.8, 300 sec: 5535.9). Total num frames: 937728000. Throughput: 0: 4893.7. Samples: 937724168. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:21:09,395][25689] Avg episode reward: [(0, '-2.067')] [2022-07-10 22:21:10,535][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:21:10,550][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000915756_937734144.pth [2022-07-10 22:21:10,551][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000913806_935737344.pth [2022-07-10 22:21:10,560][26022] Updated weights on worker 0-0, policy_version 915756 (0.00091) [2022-07-10 22:21:12,299][26022] Updated weights on worker 0-0, policy_version 915766 (0.00085) [2022-07-10 22:21:14,055][26022] Updated weights on worker 0-0, policy_version 915776 (0.00089) [2022-07-10 22:21:14,413][25689] Fps is (10 sec: 5596.3, 60 sec: 5549.3, 300 sec: 5534.4). Total num frames: 937755648. Throughput: 0: 5727.5. Samples: 937757646. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:21:14,414][25689] Avg episode reward: [(0, '-2.051')] [2022-07-10 22:21:15,952][26022] Updated weights on worker 0-0, policy_version 915786 (0.00095) [2022-07-10 22:21:17,716][26022] Updated weights on worker 0-0, policy_version 915796 (0.00090) [2022-07-10 22:21:19,538][25689] Fps is (10 sec: 5552.5, 60 sec: 5543.8, 300 sec: 5533.5). Total num frames: 937784320. Throughput: 0: 5700.6. Samples: 937791216. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:21:19,538][25689] Avg episode reward: [(0, '-0.690')] [2022-07-10 22:21:19,728][26022] Updated weights on worker 0-0, policy_version 915806 (0.00095) [2022-07-10 22:21:21,501][26022] Updated weights on worker 0-0, policy_version 915816 (0.00089) [2022-07-10 22:21:23,161][26022] Updated weights on worker 0-0, policy_version 915826 (0.00053) [2022-07-10 22:21:24,571][25689] Fps is (10 sec: 5544.5, 60 sec: 5514.3, 300 sec: 5533.2). Total num frames: 937811968. Throughput: 0: 4968.8. Samples: 937808110. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:21:24,571][25689] Avg episode reward: [(0, '-0.695')] [2022-07-10 22:21:25,025][26022] Updated weights on worker 0-0, policy_version 915836 (0.00088) [2022-07-10 22:21:26,807][26022] Updated weights on worker 0-0, policy_version 915846 (0.00089) [2022-07-10 22:21:28,696][26022] Updated weights on worker 0-0, policy_version 915856 (0.00089) [2022-07-10 22:21:29,576][25689] Fps is (10 sec: 5610.3, 60 sec: 5552.9, 300 sec: 5534.0). Total num frames: 937840640. Throughput: 0: 5827.9. Samples: 937841786. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:21:29,578][25689] Avg episode reward: [(0, '-1.452')] [2022-07-10 22:21:30,616][26022] Updated weights on worker 0-0, policy_version 915866 (0.00089) [2022-07-10 22:21:32,407][26022] Updated weights on worker 0-0, policy_version 915876 (0.00092) [2022-07-10 22:21:34,183][26022] Updated weights on worker 0-0, policy_version 915886 (0.00088) [2022-07-10 22:21:34,594][25689] Fps is (10 sec: 5721.0, 60 sec: 5536.3, 300 sec: 5531.1). Total num frames: 937869312. Throughput: 0: 5816.7. Samples: 937875034. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:21:34,595][25689] Avg episode reward: [(0, '-0.880')] [2022-07-10 22:21:36,312][26022] Updated weights on worker 0-0, policy_version 915896 (0.00082) [2022-07-10 22:21:37,813][26022] Updated weights on worker 0-0, policy_version 915906 (0.00086) [2022-07-10 22:21:39,640][25689] Fps is (10 sec: 5494.8, 60 sec: 5523.1, 300 sec: 5531.3). Total num frames: 937895936. Throughput: 0: 4993.9. Samples: 937891602. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:21:39,640][25689] Avg episode reward: [(0, '-1.402')] [2022-07-10 22:21:40,015][26022] Updated weights on worker 0-0, policy_version 915916 (0.00092) [2022-07-10 22:21:41,607][26022] Updated weights on worker 0-0, policy_version 915926 (0.00082) [2022-07-10 22:21:43,464][26022] Updated weights on worker 0-0, policy_version 915936 (0.00089) [2022-07-10 22:21:44,648][25689] Fps is (10 sec: 5500.1, 60 sec: 5540.3, 300 sec: 5538.2). Total num frames: 937924608. Throughput: 0: 5834.4. Samples: 937925248. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:21:44,648][25689] Avg episode reward: [(0, '-1.890')] [2022-07-10 22:21:45,381][26022] Updated weights on worker 0-0, policy_version 915946 (0.00092) [2022-07-10 22:21:47,123][26022] Updated weights on worker 0-0, policy_version 915956 (0.00091) [2022-07-10 22:21:49,062][26022] Updated weights on worker 0-0, policy_version 915966 (0.00092) [2022-07-10 22:21:49,666][25689] Fps is (10 sec: 5617.4, 60 sec: 5541.3, 300 sec: 5531.2). Total num frames: 937952256. Throughput: 0: 5813.9. Samples: 937958582. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:21:49,666][25689] Avg episode reward: [(0, '-1.519')] [2022-07-10 22:21:50,714][26022] Updated weights on worker 0-0, policy_version 915976 (0.00084) [2022-07-10 22:21:52,687][26022] Updated weights on worker 0-0, policy_version 915986 (0.00089) [2022-07-10 22:21:54,391][26022] Updated weights on worker 0-0, policy_version 915996 (0.00086) [2022-07-10 22:21:54,682][25689] Fps is (10 sec: 5612.9, 60 sec: 5540.7, 300 sec: 5540.3). Total num frames: 937980928. Throughput: 0: 4992.7. Samples: 937975326. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:21:54,682][25689] Avg episode reward: [(0, '-1.009')] [2022-07-10 22:21:56,282][26022] Updated weights on worker 0-0, policy_version 916006 (0.00088) [2022-07-10 22:21:58,288][26022] Updated weights on worker 0-0, policy_version 916016 (0.00093) [2022-07-10 22:21:59,787][25689] Fps is (10 sec: 5564.3, 60 sec: 5523.2, 300 sec: 5539.3). Total num frames: 938008576. Throughput: 0: 5822.7. Samples: 938008916. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:21:59,788][25689] Avg episode reward: [(0, '-0.476')] [2022-07-10 22:21:59,975][26022] Updated weights on worker 0-0, policy_version 916026 (0.00081) [2022-07-10 22:22:02,357][26022] Updated weights on worker 0-0, policy_version 916036 (0.00281) [2022-07-10 22:22:03,975][26022] Updated weights on worker 0-0, policy_version 916046 (0.00086) [2022-07-10 22:22:04,800][25689] Fps is (10 sec: 5161.3, 60 sec: 5522.2, 300 sec: 5529.0). Total num frames: 938033152. Throughput: 0: 5706.6. Samples: 938040250. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:22:04,801][25689] Avg episode reward: [(0, '-0.233')] [2022-07-10 22:22:05,888][26022] Updated weights on worker 0-0, policy_version 916056 (0.00090) [2022-07-10 22:22:07,824][26022] Updated weights on worker 0-0, policy_version 916066 (0.00089) [2022-07-10 22:22:09,424][26022] Updated weights on worker 0-0, policy_version 916076 (0.00087) [2022-07-10 22:22:09,836][25689] Fps is (10 sec: 5502.6, 60 sec: 5557.0, 300 sec: 5546.1). Total num frames: 938063872. Throughput: 0: 4869.2. Samples: 938056796. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:22:09,837][25689] Avg episode reward: [(0, '-0.474')] [2022-07-10 22:22:11,425][26022] Updated weights on worker 0-0, policy_version 916086 (0.00092) [2022-07-10 22:22:13,051][26022] Updated weights on worker 0-0, policy_version 916096 (0.00087) [2022-07-10 22:22:14,845][25689] Fps is (10 sec: 5709.1, 60 sec: 5540.9, 300 sec: 5533.7). Total num frames: 938090496. Throughput: 0: 5709.6. Samples: 938090448. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:22:14,845][25689] Avg episode reward: [(0, '-1.100')] [2022-07-10 22:22:15,255][26022] Updated weights on worker 0-0, policy_version 916106 (0.00086) [2022-07-10 22:22:16,943][26022] Updated weights on worker 0-0, policy_version 916116 (0.00090) [2022-07-10 22:22:18,672][26022] Updated weights on worker 0-0, policy_version 916126 (0.00091) [2022-07-10 22:22:19,882][25689] Fps is (10 sec: 5402.5, 60 sec: 5532.0, 300 sec: 5533.1). Total num frames: 938118144. Throughput: 0: 5709.3. Samples: 938123642. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 22:22:19,883][25689] Avg episode reward: [(0, '-0.806')] [2022-07-10 22:22:20,738][26022] Updated weights on worker 0-0, policy_version 916136 (0.00086) [2022-07-10 22:22:22,520][26022] Updated weights on worker 0-0, policy_version 916146 (0.00086) [2022-07-10 22:22:24,650][26022] Updated weights on worker 0-0, policy_version 916156 (0.00095) [2022-07-10 22:22:24,898][25689] Fps is (10 sec: 5500.0, 60 sec: 5533.5, 300 sec: 5529.5). Total num frames: 938145792. Throughput: 0: 4974.2. Samples: 938140222. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:22:24,899][25689] Avg episode reward: [(0, '-0.986')] [2022-07-10 22:22:26,222][26022] Updated weights on worker 0-0, policy_version 916166 (0.00081) [2022-07-10 22:22:28,191][26022] Updated weights on worker 0-0, policy_version 916176 (0.00110) [2022-07-10 22:22:29,824][26022] Updated weights on worker 0-0, policy_version 916186 (0.00095) [2022-07-10 22:22:29,904][25689] Fps is (10 sec: 5619.5, 60 sec: 5533.5, 300 sec: 5536.5). Total num frames: 938174464. Throughput: 0: 5825.2. Samples: 938173696. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:22:29,905][25689] Avg episode reward: [(0, '-1.549')] [2022-07-10 22:22:31,745][26022] Updated weights on worker 0-0, policy_version 916196 (0.00093) [2022-07-10 22:22:33,670][26022] Updated weights on worker 0-0, policy_version 916206 (0.00098) [2022-07-10 22:22:34,917][25689] Fps is (10 sec: 5621.7, 60 sec: 5517.0, 300 sec: 5531.2). Total num frames: 938202112. Throughput: 0: 5814.6. Samples: 938207160. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:22:34,917][25689] Avg episode reward: [(0, '-1.411')] [2022-07-10 22:22:35,438][26022] Updated weights on worker 0-0, policy_version 916216 (0.00088) [2022-07-10 22:22:37,331][26022] Updated weights on worker 0-0, policy_version 916226 (0.00086) [2022-07-10 22:22:39,230][26022] Updated weights on worker 0-0, policy_version 916236 (0.00090) [2022-07-10 22:22:40,048][25689] Fps is (10 sec: 5451.1, 60 sec: 5526.1, 300 sec: 5529.5). Total num frames: 938229760. Throughput: 0: 4969.4. Samples: 938223856. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:22:40,050][25689] Avg episode reward: [(0, '-1.167')] [2022-07-10 22:22:40,775][26022] Updated weights on worker 0-0, policy_version 916246 (0.00101) [2022-07-10 22:22:42,904][26022] Updated weights on worker 0-0, policy_version 916256 (0.00089) [2022-07-10 22:22:44,414][26022] Updated weights on worker 0-0, policy_version 916266 (0.00093) [2022-07-10 22:22:45,080][25689] Fps is (10 sec: 5743.3, 60 sec: 5557.9, 300 sec: 5539.7). Total num frames: 938260480. Throughput: 0: 5831.5. Samples: 938257908. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:22:45,081][25689] Avg episode reward: [(0, '-1.050')] [2022-07-10 22:22:46,355][26022] Updated weights on worker 0-0, policy_version 916276 (0.00084) [2022-07-10 22:22:48,255][26022] Updated weights on worker 0-0, policy_version 916286 (0.00092) [2022-07-10 22:22:50,103][25689] Fps is (10 sec: 5703.5, 60 sec: 5540.5, 300 sec: 5529.8). Total num frames: 938287104. Throughput: 0: 5834.2. Samples: 938291536. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:22:50,103][25689] Avg episode reward: [(0, '-1.178')] [2022-07-10 22:22:50,112][26022] Updated weights on worker 0-0, policy_version 916296 (0.00105) [2022-07-10 22:22:51,931][26022] Updated weights on worker 0-0, policy_version 916306 (0.00085) [2022-07-10 22:22:53,641][26022] Updated weights on worker 0-0, policy_version 916316 (0.00092) [2022-07-10 22:22:55,106][25689] Fps is (10 sec: 5413.1, 60 sec: 5524.7, 300 sec: 5531.6). Total num frames: 938314752. Throughput: 0: 5007.2. Samples: 938308250. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:22:55,107][25689] Avg episode reward: [(0, '-0.819')] [2022-07-10 22:22:55,593][26022] Updated weights on worker 0-0, policy_version 916326 (0.00086) [2022-07-10 22:22:57,204][26022] Updated weights on worker 0-0, policy_version 916336 (0.00088) [2022-07-10 22:22:59,220][26022] Updated weights on worker 0-0, policy_version 916346 (0.00084) [2022-07-10 22:23:00,186][25689] Fps is (10 sec: 5788.4, 60 sec: 5577.9, 300 sec: 5548.6). Total num frames: 938345472. Throughput: 0: 5883.5. Samples: 938342338. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:23:00,187][25689] Avg episode reward: [(0, '-0.401')] [2022-07-10 22:23:00,950][26022] Updated weights on worker 0-0, policy_version 916356 (0.00087) [2022-07-10 22:23:03,161][26022] Updated weights on worker 0-0, policy_version 916366 (0.00067) [2022-07-10 22:23:05,026][26022] Updated weights on worker 0-0, policy_version 916376 (0.00089) [2022-07-10 22:23:05,193][25689] Fps is (10 sec: 5481.4, 60 sec: 5578.4, 300 sec: 5531.4). Total num frames: 938370048. Throughput: 0: 5785.4. Samples: 938374276. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:23:05,194][25689] Avg episode reward: [(0, '-0.185')] [2022-07-10 22:23:06,643][26022] Updated weights on worker 0-0, policy_version 916386 (0.00079) [2022-07-10 22:23:08,630][26022] Updated weights on worker 0-0, policy_version 916396 (0.00090) [2022-07-10 22:23:10,213][25689] Fps is (10 sec: 5106.2, 60 sec: 5512.0, 300 sec: 5535.1). Total num frames: 938396672. Throughput: 0: 4951.2. Samples: 938391108. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:23:10,214][25689] Avg episode reward: [(0, '-0.144')] [2022-07-10 22:23:10,421][26022] Updated weights on worker 0-0, policy_version 916406 (0.00086) [2022-07-10 22:23:10,967][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:23:10,977][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000916408_938401792.pth [2022-07-10 22:23:10,977][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000914457_936403968.pth [2022-07-10 22:23:12,205][26022] Updated weights on worker 0-0, policy_version 916416 (0.00084) [2022-07-10 22:23:14,041][26022] Updated weights on worker 0-0, policy_version 916426 (0.00082) [2022-07-10 22:23:15,251][25689] Fps is (10 sec: 5599.9, 60 sec: 5560.2, 300 sec: 5533.1). Total num frames: 938426368. Throughput: 0: 5782.7. Samples: 938424744. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:23:15,251][25689] Avg episode reward: [(0, '0.537')] [2022-07-10 22:23:15,923][26022] Updated weights on worker 0-0, policy_version 916436 (0.00078) [2022-07-10 22:23:17,703][26022] Updated weights on worker 0-0, policy_version 916446 (0.00097) [2022-07-10 22:23:19,679][26022] Updated weights on worker 0-0, policy_version 916456 (0.00087) [2022-07-10 22:23:20,301][25689] Fps is (10 sec: 5684.4, 60 sec: 5559.0, 300 sec: 5536.6). Total num frames: 938454016. Throughput: 0: 5765.9. Samples: 938458320. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:23:20,302][25689] Avg episode reward: [(0, '-0.376')] [2022-07-10 22:23:21,242][26022] Updated weights on worker 0-0, policy_version 916466 (0.00083) [2022-07-10 22:23:23,180][26022] Updated weights on worker 0-0, policy_version 916476 (0.00092) [2022-07-10 22:23:24,953][26022] Updated weights on worker 0-0, policy_version 916486 (0.00091) [2022-07-10 22:23:25,315][25689] Fps is (10 sec: 5596.0, 60 sec: 5576.2, 300 sec: 5541.4). Total num frames: 938482688. Throughput: 0: 5863.6. Samples: 938492262. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:23:25,316][25689] Avg episode reward: [(0, '-0.594')] [2022-07-10 22:23:27,004][26022] Updated weights on worker 0-0, policy_version 916496 (0.00090) [2022-07-10 22:23:28,674][26022] Updated weights on worker 0-0, policy_version 916506 (0.00084) [2022-07-10 22:23:30,323][25689] Fps is (10 sec: 5619.5, 60 sec: 5559.0, 300 sec: 5535.0). Total num frames: 938510336. Throughput: 0: 5856.3. Samples: 938508880. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:23:30,324][25689] Avg episode reward: [(0, '-1.077')] [2022-07-10 22:23:30,518][26022] Updated weights on worker 0-0, policy_version 916516 (0.00091) [2022-07-10 22:23:32,284][26022] Updated weights on worker 0-0, policy_version 916526 (0.00089) [2022-07-10 22:23:34,125][26022] Updated weights on worker 0-0, policy_version 916536 (0.00489) [2022-07-10 22:23:35,338][25689] Fps is (10 sec: 5618.9, 60 sec: 5575.7, 300 sec: 5535.6). Total num frames: 938539008. Throughput: 0: 5870.0. Samples: 938542658. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:23:35,340][25689] Avg episode reward: [(0, '-0.904')] [2022-07-10 22:23:36,005][26022] Updated weights on worker 0-0, policy_version 916546 (0.00089) [2022-07-10 22:23:37,884][26022] Updated weights on worker 0-0, policy_version 916556 (0.00092) [2022-07-10 22:23:39,707][26022] Updated weights on worker 0-0, policy_version 916566 (0.00086) [2022-07-10 22:23:40,418][25689] Fps is (10 sec: 5579.3, 60 sec: 5580.6, 300 sec: 5541.8). Total num frames: 938566656. Throughput: 0: 5835.7. Samples: 938575714. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:23:40,420][25689] Avg episode reward: [(0, '-1.772')] [2022-07-10 22:23:41,646][26022] Updated weights on worker 0-0, policy_version 916576 (0.00084) [2022-07-10 22:23:43,303][26022] Updated weights on worker 0-0, policy_version 916586 (0.00086) [2022-07-10 22:23:45,109][26022] Updated weights on worker 0-0, policy_version 916596 (0.00087) [2022-07-10 22:23:45,424][25689] Fps is (10 sec: 5584.3, 60 sec: 5549.0, 300 sec: 5539.3). Total num frames: 938595328. Throughput: 0: 4987.0. Samples: 938592544. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:23:45,425][25689] Avg episode reward: [(0, '-0.747')] [2022-07-10 22:23:47,150][26022] Updated weights on worker 0-0, policy_version 916606 (0.00096) [2022-07-10 22:23:48,927][26022] Updated weights on worker 0-0, policy_version 916616 (0.00088) [2022-07-10 22:23:50,444][25689] Fps is (10 sec: 5413.1, 60 sec: 5532.2, 300 sec: 5536.0). Total num frames: 938620928. Throughput: 0: 5816.7. Samples: 938625916. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:23:50,446][25689] Avg episode reward: [(0, '-0.476')] [2022-07-10 22:23:50,799][26022] Updated weights on worker 0-0, policy_version 916626 (0.00092) [2022-07-10 22:23:52,675][26022] Updated weights on worker 0-0, policy_version 916636 (0.00084) [2022-07-10 22:23:54,443][26022] Updated weights on worker 0-0, policy_version 916646 (0.00082) [2022-07-10 22:23:55,455][25689] Fps is (10 sec: 5410.3, 60 sec: 5548.5, 300 sec: 5536.6). Total num frames: 938649600. Throughput: 0: 5762.2. Samples: 938658574. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:23:55,455][25689] Avg episode reward: [(0, '-0.420')] [2022-07-10 22:23:56,673][26022] Updated weights on worker 0-0, policy_version 916656 (0.00093) [2022-07-10 22:23:58,228][26022] Updated weights on worker 0-0, policy_version 916666 (0.00095) [2022-07-10 22:24:00,107][26022] Updated weights on worker 0-0, policy_version 916676 (0.00089) [2022-07-10 22:24:00,552][25689] Fps is (10 sec: 5673.4, 60 sec: 5513.1, 300 sec: 5545.8). Total num frames: 938678272. Throughput: 0: 4946.6. Samples: 938675306. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:24:00,552][25689] Avg episode reward: [(0, '-0.563')] [2022-07-10 22:24:02,282][26022] Updated weights on worker 0-0, policy_version 916686 (0.00087) [2022-07-10 22:24:04,079][26022] Updated weights on worker 0-0, policy_version 916696 (0.00109) [2022-07-10 22:24:05,556][25689] Fps is (10 sec: 5372.8, 60 sec: 5530.3, 300 sec: 5535.7). Total num frames: 938703872. Throughput: 0: 5692.9. Samples: 938707156. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:24:05,558][25689] Avg episode reward: [(0, '-0.318')] [2022-07-10 22:24:06,109][26022] Updated weights on worker 0-0, policy_version 916706 (0.00089) [2022-07-10 22:24:07,750][26022] Updated weights on worker 0-0, policy_version 916716 (0.00085) [2022-07-10 22:24:09,657][26022] Updated weights on worker 0-0, policy_version 916726 (0.00092) [2022-07-10 22:24:10,592][25689] Fps is (10 sec: 5405.4, 60 sec: 5562.8, 300 sec: 5539.9). Total num frames: 938732544. Throughput: 0: 5699.8. Samples: 938740754. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:24:10,592][25689] Avg episode reward: [(0, '-0.196')] [2022-07-10 22:24:11,419][26022] Updated weights on worker 0-0, policy_version 916736 (0.00086) [2022-07-10 22:24:13,368][26022] Updated weights on worker 0-0, policy_version 916746 (0.00085) [2022-07-10 22:24:15,227][26022] Updated weights on worker 0-0, policy_version 916756 (0.00092) [2022-07-10 22:24:15,619][25689] Fps is (10 sec: 5597.0, 60 sec: 5529.8, 300 sec: 5540.8). Total num frames: 938760192. Throughput: 0: 4899.3. Samples: 938757362. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:24:15,619][25689] Avg episode reward: [(0, '-1.135')] [2022-07-10 22:24:16,937][26022] Updated weights on worker 0-0, policy_version 916766 (0.00107) [2022-07-10 22:24:18,855][26022] Updated weights on worker 0-0, policy_version 916776 (0.00096) [2022-07-10 22:24:20,698][25689] Fps is (10 sec: 5572.7, 60 sec: 5544.1, 300 sec: 5533.7). Total num frames: 938788864. Throughput: 0: 5719.5. Samples: 938790536. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:24:20,699][25689] Avg episode reward: [(0, '-0.740')] [2022-07-10 22:24:20,700][26022] Updated weights on worker 0-0, policy_version 916786 (0.00086) [2022-07-10 22:24:22,800][26022] Updated weights on worker 0-0, policy_version 916796 (0.00085) [2022-07-10 22:24:24,287][26022] Updated weights on worker 0-0, policy_version 916806 (0.00086) [2022-07-10 22:24:25,723][25689] Fps is (10 sec: 5270.0, 60 sec: 5475.3, 300 sec: 5530.4). Total num frames: 938813440. Throughput: 0: 5784.7. Samples: 938823814. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:24:25,724][25689] Avg episode reward: [(0, '-0.356')] [2022-07-10 22:24:26,322][26022] Updated weights on worker 0-0, policy_version 916816 (0.00085) [2022-07-10 22:24:28,298][26022] Updated weights on worker 0-0, policy_version 916826 (0.00088) [2022-07-10 22:24:30,175][26022] Updated weights on worker 0-0, policy_version 916836 (0.00096) [2022-07-10 22:24:30,735][25689] Fps is (10 sec: 5509.5, 60 sec: 5525.9, 300 sec: 5541.1). Total num frames: 938844160. Throughput: 0: 4933.7. Samples: 938840132. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:24:30,737][25689] Avg episode reward: [(0, '-0.422')] [2022-07-10 22:24:32,037][26022] Updated weights on worker 0-0, policy_version 916846 (0.00090) [2022-07-10 22:24:33,723][26022] Updated weights on worker 0-0, policy_version 916856 (0.00086) [2022-07-10 22:24:35,566][26022] Updated weights on worker 0-0, policy_version 916866 (0.00087) [2022-07-10 22:24:35,766][25689] Fps is (10 sec: 5709.7, 60 sec: 5490.5, 300 sec: 5535.7). Total num frames: 938870784. Throughput: 0: 5771.9. Samples: 938873650. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:24:35,767][25689] Avg episode reward: [(0, '-1.319')] [2022-07-10 22:24:37,372][26022] Updated weights on worker 0-0, policy_version 916876 (0.00091) [2022-07-10 22:24:39,250][26022] Updated weights on worker 0-0, policy_version 916886 (0.00087) [2022-07-10 22:24:40,889][25689] Fps is (10 sec: 5445.8, 60 sec: 5503.5, 300 sec: 5537.6). Total num frames: 938899456. Throughput: 0: 5789.7. Samples: 938907432. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:24:40,890][25689] Avg episode reward: [(0, '-1.461')] [2022-07-10 22:24:41,047][26022] Updated weights on worker 0-0, policy_version 916896 (0.00091) [2022-07-10 22:24:42,895][26022] Updated weights on worker 0-0, policy_version 916906 (0.00093) [2022-07-10 22:24:44,654][26022] Updated weights on worker 0-0, policy_version 916916 (0.00048) [2022-07-10 22:24:45,906][25689] Fps is (10 sec: 5756.1, 60 sec: 5519.4, 300 sec: 5541.2). Total num frames: 938929152. Throughput: 0: 4985.3. Samples: 938924434. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:24:45,907][25689] Avg episode reward: [(0, '-1.189')] [2022-07-10 22:24:46,399][26022] Updated weights on worker 0-0, policy_version 916926 (0.00085) [2022-07-10 22:24:48,308][26022] Updated weights on worker 0-0, policy_version 916936 (0.00089) [2022-07-10 22:24:50,015][26022] Updated weights on worker 0-0, policy_version 916946 (0.00091) [2022-07-10 22:24:50,931][25689] Fps is (10 sec: 5608.5, 60 sec: 5535.9, 300 sec: 5538.3). Total num frames: 938955776. Throughput: 0: 5848.4. Samples: 938958246. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:24:50,931][25689] Avg episode reward: [(0, '-0.366')] [2022-07-10 22:24:51,997][26022] Updated weights on worker 0-0, policy_version 916956 (0.00089) [2022-07-10 22:24:53,727][26022] Updated weights on worker 0-0, policy_version 916966 (0.00094) [2022-07-10 22:24:55,769][26022] Updated weights on worker 0-0, policy_version 916976 (0.00088) [2022-07-10 22:24:55,950][25689] Fps is (10 sec: 5505.6, 60 sec: 5535.2, 300 sec: 5540.5). Total num frames: 938984448. Throughput: 0: 5840.8. Samples: 938991540. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:24:55,951][25689] Avg episode reward: [(0, '-0.148')] [2022-07-10 22:24:57,451][26022] Updated weights on worker 0-0, policy_version 916986 (0.00085) [2022-07-10 22:24:59,487][26022] Updated weights on worker 0-0, policy_version 916996 (0.00088) [2022-07-10 22:25:00,994][25689] Fps is (10 sec: 5698.5, 60 sec: 5540.0, 300 sec: 5547.4). Total num frames: 939013120. Throughput: 0: 5026.5. Samples: 939008490. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:25:00,996][25689] Avg episode reward: [(0, '-0.054')] [2022-07-10 22:25:01,006][26022] Updated weights on worker 0-0, policy_version 917006 (0.00086) [2022-07-10 22:25:03,538][26022] Updated weights on worker 0-0, policy_version 917016 (0.00086) [2022-07-10 22:25:05,068][26022] Updated weights on worker 0-0, policy_version 917026 (0.00101) [2022-07-10 22:25:06,019][25689] Fps is (10 sec: 5288.2, 60 sec: 5521.2, 300 sec: 5537.5). Total num frames: 939037696. Throughput: 0: 5746.7. Samples: 939040016. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:25:06,021][25689] Avg episode reward: [(0, '0.926')] [2022-07-10 22:25:07,106][26022] Updated weights on worker 0-0, policy_version 917036 (0.00087) [2022-07-10 22:25:08,748][26022] Updated weights on worker 0-0, policy_version 917046 (0.00090) [2022-07-10 22:25:10,703][26022] Updated weights on worker 0-0, policy_version 917056 (0.00107) [2022-07-10 22:25:11,048][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:25:11,059][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000917058_939067392.pth [2022-07-10 22:25:11,059][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000915107_937069568.pth [2022-07-10 22:25:11,060][25689] Fps is (10 sec: 5391.2, 60 sec: 5537.5, 300 sec: 5544.3). Total num frames: 939067392. Throughput: 0: 5739.0. Samples: 939073772. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:25:11,061][25689] Avg episode reward: [(0, '1.588')] [2022-07-10 22:25:12,406][26022] Updated weights on worker 0-0, policy_version 917066 (0.00086) [2022-07-10 22:25:14,305][26022] Updated weights on worker 0-0, policy_version 917076 (0.00083) [2022-07-10 22:25:16,096][25689] Fps is (10 sec: 5690.3, 60 sec: 5536.7, 300 sec: 5541.4). Total num frames: 939095040. Throughput: 0: 4908.9. Samples: 939090438. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:25:16,098][25689] Avg episode reward: [(0, '1.772')] [2022-07-10 22:25:16,114][26022] Updated weights on worker 0-0, policy_version 917086 (0.00090) [2022-07-10 22:25:18,063][26022] Updated weights on worker 0-0, policy_version 917096 (0.00082) [2022-07-10 22:25:19,885][26022] Updated weights on worker 0-0, policy_version 917106 (0.00089) [2022-07-10 22:25:21,178][25689] Fps is (10 sec: 5465.5, 60 sec: 5519.6, 300 sec: 5534.5). Total num frames: 939122688. Throughput: 0: 5720.3. Samples: 939123948. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:25:21,178][25689] Avg episode reward: [(0, '0.922')] [2022-07-10 22:25:21,773][26022] Updated weights on worker 0-0, policy_version 917116 (0.00087) [2022-07-10 22:25:23,624][26022] Updated weights on worker 0-0, policy_version 917126 (0.00094) [2022-07-10 22:25:25,362][26022] Updated weights on worker 0-0, policy_version 917136 (0.00082) [2022-07-10 22:25:26,197][25689] Fps is (10 sec: 5575.8, 60 sec: 5587.8, 300 sec: 5542.1). Total num frames: 939151360. Throughput: 0: 5816.0. Samples: 939157374. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:25:26,198][25689] Avg episode reward: [(0, '0.046')] [2022-07-10 22:25:27,180][26022] Updated weights on worker 0-0, policy_version 917146 (0.00088) [2022-07-10 22:25:29,033][26022] Updated weights on worker 0-0, policy_version 917156 (0.00085) [2022-07-10 22:25:30,833][26022] Updated weights on worker 0-0, policy_version 917166 (0.00088) [2022-07-10 22:25:31,214][25689] Fps is (10 sec: 5611.6, 60 sec: 5536.6, 300 sec: 5535.3). Total num frames: 939179008. Throughput: 0: 4975.9. Samples: 939174054. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:25:31,215][25689] Avg episode reward: [(0, '-0.242')] [2022-07-10 22:25:32,859][26022] Updated weights on worker 0-0, policy_version 917176 (0.00087) [2022-07-10 22:25:34,447][26022] Updated weights on worker 0-0, policy_version 917186 (0.00104) [2022-07-10 22:25:36,227][25689] Fps is (10 sec: 5513.4, 60 sec: 5555.2, 300 sec: 5536.7). Total num frames: 939206656. Throughput: 0: 5823.8. Samples: 939207672. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:25:36,227][25689] Avg episode reward: [(0, '-0.304')] [2022-07-10 22:25:36,487][26022] Updated weights on worker 0-0, policy_version 917196 (0.00091) [2022-07-10 22:25:38,017][26022] Updated weights on worker 0-0, policy_version 917206 (0.00090) [2022-07-10 22:25:40,211][26022] Updated weights on worker 0-0, policy_version 917216 (0.00085) [2022-07-10 22:25:41,299][25689] Fps is (10 sec: 5787.6, 60 sec: 5593.7, 300 sec: 5545.9). Total num frames: 939237376. Throughput: 0: 5836.0. Samples: 939241376. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:25:41,301][25689] Avg episode reward: [(0, '-1.911')] [2022-07-10 22:25:41,764][26022] Updated weights on worker 0-0, policy_version 917226 (0.00089) [2022-07-10 22:25:43,833][26022] Updated weights on worker 0-0, policy_version 917236 (0.00091) [2022-07-10 22:25:45,292][26022] Updated weights on worker 0-0, policy_version 917246 (0.00095) [2022-07-10 22:25:46,380][25689] Fps is (10 sec: 5547.1, 60 sec: 5520.2, 300 sec: 5538.0). Total num frames: 939262976. Throughput: 0: 4991.6. Samples: 939258116. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:25:46,380][25689] Avg episode reward: [(0, '-2.176')] [2022-07-10 22:25:47,523][26022] Updated weights on worker 0-0, policy_version 917256 (0.00092) [2022-07-10 22:25:49,045][26022] Updated weights on worker 0-0, policy_version 917266 (0.00094) [2022-07-10 22:25:50,968][26022] Updated weights on worker 0-0, policy_version 917276 (0.00090) [2022-07-10 22:25:51,395][25689] Fps is (10 sec: 5477.1, 60 sec: 5571.8, 300 sec: 5541.3). Total num frames: 939292672. Throughput: 0: 5855.5. Samples: 939292224. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:25:51,397][25689] Avg episode reward: [(0, '-2.150')] [2022-07-10 22:25:52,816][26022] Updated weights on worker 0-0, policy_version 917286 (0.00085) [2022-07-10 22:25:54,677][26022] Updated weights on worker 0-0, policy_version 917296 (0.00614) [2022-07-10 22:25:56,403][25689] Fps is (10 sec: 5823.4, 60 sec: 5572.9, 300 sec: 5543.0). Total num frames: 939321344. Throughput: 0: 5853.5. Samples: 939325772. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:25:56,403][25689] Avg episode reward: [(0, '-1.642')] [2022-07-10 22:25:56,403][26022] Updated weights on worker 0-0, policy_version 917306 (0.00074) [2022-07-10 22:25:58,404][26022] Updated weights on worker 0-0, policy_version 917316 (0.00091) [2022-07-10 22:25:59,918][26022] Updated weights on worker 0-0, policy_version 917326 (0.00086) [2022-07-10 22:26:01,490][25689] Fps is (10 sec: 5578.7, 60 sec: 5551.9, 300 sec: 5551.8). Total num frames: 939348992. Throughput: 0: 5009.4. Samples: 939342522. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:26:01,496][25689] Avg episode reward: [(0, '-1.388')] [2022-07-10 22:26:02,574][26022] Updated weights on worker 0-0, policy_version 917336 (0.00093) [2022-07-10 22:26:04,105][26022] Updated weights on worker 0-0, policy_version 917346 (0.00089) [2022-07-10 22:26:05,955][26022] Updated weights on worker 0-0, policy_version 917356 (0.00086) [2022-07-10 22:26:06,506][25689] Fps is (10 sec: 5371.6, 60 sec: 5586.6, 300 sec: 5545.4). Total num frames: 939375616. Throughput: 0: 5761.0. Samples: 939374064. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:26:06,508][25689] Avg episode reward: [(0, '-1.642')] [2022-07-10 22:26:07,817][26022] Updated weights on worker 0-0, policy_version 917366 (0.00084) [2022-07-10 22:26:09,575][26022] Updated weights on worker 0-0, policy_version 917376 (0.00083) [2022-07-10 22:26:11,534][25689] Fps is (10 sec: 5403.6, 60 sec: 5554.0, 300 sec: 5545.3). Total num frames: 939403264. Throughput: 0: 5739.2. Samples: 939407806. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2022-07-10 22:26:11,534][25689] Avg episode reward: [(0, '-1.302')] [2022-07-10 22:26:11,548][26022] Updated weights on worker 0-0, policy_version 917386 (0.00171) [2022-07-10 22:26:13,363][26022] Updated weights on worker 0-0, policy_version 917396 (0.00089) [2022-07-10 22:26:14,901][26022] Updated weights on worker 0-0, policy_version 917406 (0.00085) [2022-07-10 22:26:16,597][25689] Fps is (10 sec: 5479.9, 60 sec: 5551.6, 300 sec: 5543.0). Total num frames: 939430912. Throughput: 0: 4907.0. Samples: 939424866. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:26:16,599][25689] Avg episode reward: [(0, '-1.849')] [2022-07-10 22:26:17,034][26022] Updated weights on worker 0-0, policy_version 917416 (0.00091) [2022-07-10 22:26:18,622][26022] Updated weights on worker 0-0, policy_version 917426 (0.00376) [2022-07-10 22:26:20,796][26022] Updated weights on worker 0-0, policy_version 917436 (0.00090) [2022-07-10 22:26:21,652][25689] Fps is (10 sec: 5667.6, 60 sec: 5587.8, 300 sec: 5549.5). Total num frames: 939460608. Throughput: 0: 5753.1. Samples: 939458512. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:26:21,652][25689] Avg episode reward: [(0, '-3.382')] [2022-07-10 22:26:22,294][26022] Updated weights on worker 0-0, policy_version 917446 (0.00086) [2022-07-10 22:26:24,283][26022] Updated weights on worker 0-0, policy_version 917456 (0.00085) [2022-07-10 22:26:26,111][26022] Updated weights on worker 0-0, policy_version 917466 (0.00079) [2022-07-10 22:26:26,705][25689] Fps is (10 sec: 5571.7, 60 sec: 5550.9, 300 sec: 5541.7). Total num frames: 939487232. Throughput: 0: 5845.2. Samples: 939492132. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:26:26,707][25689] Avg episode reward: [(0, '-3.362')] [2022-07-10 22:26:28,006][26022] Updated weights on worker 0-0, policy_version 917476 (0.00613) [2022-07-10 22:26:29,680][26022] Updated weights on worker 0-0, policy_version 917486 (0.00094) [2022-07-10 22:26:31,498][26022] Updated weights on worker 0-0, policy_version 917496 (0.00089) [2022-07-10 22:26:31,741][25689] Fps is (10 sec: 5480.4, 60 sec: 5566.0, 300 sec: 5541.3). Total num frames: 939515904. Throughput: 0: 5831.2. Samples: 939525640. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:26:31,742][25689] Avg episode reward: [(0, '-3.139')] [2022-07-10 22:26:33,384][26022] Updated weights on worker 0-0, policy_version 917506 (0.00088) [2022-07-10 22:26:35,105][26022] Updated weights on worker 0-0, policy_version 917516 (0.00085) [2022-07-10 22:26:36,746][25689] Fps is (10 sec: 5710.9, 60 sec: 5583.6, 300 sec: 5549.0). Total num frames: 939544576. Throughput: 0: 5841.2. Samples: 939542562. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:26:36,747][25689] Avg episode reward: [(0, '-3.194')] [2022-07-10 22:26:37,240][26022] Updated weights on worker 0-0, policy_version 917526 (0.00087) [2022-07-10 22:26:38,904][26022] Updated weights on worker 0-0, policy_version 917536 (0.00095) [2022-07-10 22:26:40,860][26022] Updated weights on worker 0-0, policy_version 917546 (0.00093) [2022-07-10 22:26:41,812][25689] Fps is (10 sec: 5694.4, 60 sec: 5550.4, 300 sec: 5547.9). Total num frames: 939573248. Throughput: 0: 5839.2. Samples: 939576230. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:26:41,812][25689] Avg episode reward: [(0, '-2.085')] [2022-07-10 22:26:42,495][26022] Updated weights on worker 0-0, policy_version 917556 (0.00086) [2022-07-10 22:26:44,466][26022] Updated weights on worker 0-0, policy_version 917566 (0.00361) [2022-07-10 22:26:46,304][26022] Updated weights on worker 0-0, policy_version 917576 (0.00088) [2022-07-10 22:26:46,859][25689] Fps is (10 sec: 5569.2, 60 sec: 5587.4, 300 sec: 5547.4). Total num frames: 939600896. Throughput: 0: 5835.8. Samples: 939609746. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:26:46,860][25689] Avg episode reward: [(0, '-1.805')] [2022-07-10 22:26:48,039][26022] Updated weights on worker 0-0, policy_version 917586 (0.00087) [2022-07-10 22:26:49,936][26022] Updated weights on worker 0-0, policy_version 917596 (0.00090) [2022-07-10 22:26:51,789][26022] Updated weights on worker 0-0, policy_version 917606 (0.00090) [2022-07-10 22:26:51,885][25689] Fps is (10 sec: 5489.1, 60 sec: 5552.5, 300 sec: 5543.7). Total num frames: 939628544. Throughput: 0: 5013.8. Samples: 939626638. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:26:51,886][25689] Avg episode reward: [(0, '-0.649')] [2022-07-10 22:26:53,528][26022] Updated weights on worker 0-0, policy_version 917616 (0.00092) [2022-07-10 22:26:55,394][26022] Updated weights on worker 0-0, policy_version 917626 (0.00094) [2022-07-10 22:26:56,911][25689] Fps is (10 sec: 5602.8, 60 sec: 5550.8, 300 sec: 5548.7). Total num frames: 939657216. Throughput: 0: 5830.4. Samples: 939660132. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:26:56,912][25689] Avg episode reward: [(0, '-0.992')] [2022-07-10 22:26:57,192][26022] Updated weights on worker 0-0, policy_version 917636 (0.00085) [2022-07-10 22:26:59,057][26022] Updated weights on worker 0-0, policy_version 917646 (0.00084) [2022-07-10 22:27:01,064][26022] Updated weights on worker 0-0, policy_version 917656 (0.00087) [2022-07-10 22:27:01,994][25689] Fps is (10 sec: 5470.1, 60 sec: 5534.3, 300 sec: 5554.2). Total num frames: 939683840. Throughput: 0: 5777.4. Samples: 939692832. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:27:01,995][25689] Avg episode reward: [(0, '-1.488')] [2022-07-10 22:27:03,024][26022] Updated weights on worker 0-0, policy_version 917666 (0.00102) [2022-07-10 22:27:04,959][26022] Updated weights on worker 0-0, policy_version 917676 (0.00093) [2022-07-10 22:27:06,556][26022] Updated weights on worker 0-0, policy_version 917686 (0.00090) [2022-07-10 22:27:06,996][25689] Fps is (10 sec: 5483.2, 60 sec: 5569.5, 300 sec: 5548.0). Total num frames: 939712512. Throughput: 0: 4918.6. Samples: 939708790. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:27:06,996][25689] Avg episode reward: [(0, '-0.472')] [2022-07-10 22:27:08,518][26022] Updated weights on worker 0-0, policy_version 917696 (0.00089) [2022-07-10 22:27:10,149][26022] Updated weights on worker 0-0, policy_version 917706 (0.00090) [2022-07-10 22:27:11,126][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:27:11,140][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000917710_939735040.pth [2022-07-10 22:27:11,141][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000915756_937734144.pth [2022-07-10 22:27:12,001][25689] Fps is (10 sec: 5525.7, 60 sec: 5554.6, 300 sec: 5548.1). Total num frames: 939739136. Throughput: 0: 5749.3. Samples: 939742290. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:27:12,002][25689] Avg episode reward: [(0, '-0.645')] [2022-07-10 22:27:12,216][26022] Updated weights on worker 0-0, policy_version 917716 (0.00081) [2022-07-10 22:27:13,946][26022] Updated weights on worker 0-0, policy_version 917726 (0.00092) [2022-07-10 22:27:15,852][26022] Updated weights on worker 0-0, policy_version 917736 (0.00087) [2022-07-10 22:27:17,013][25689] Fps is (10 sec: 5622.5, 60 sec: 5593.2, 300 sec: 5555.4). Total num frames: 939768832. Throughput: 0: 5769.6. Samples: 939776110. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:27:17,013][25689] Avg episode reward: [(0, '-0.271')] [2022-07-10 22:27:17,511][26022] Updated weights on worker 0-0, policy_version 917746 (0.00080) [2022-07-10 22:27:19,387][26022] Updated weights on worker 0-0, policy_version 917756 (0.00090) [2022-07-10 22:27:21,293][26022] Updated weights on worker 0-0, policy_version 917766 (0.00082) [2022-07-10 22:27:22,119][25689] Fps is (10 sec: 5667.7, 60 sec: 5554.6, 300 sec: 5553.8). Total num frames: 939796480. Throughput: 0: 4972.3. Samples: 939792898. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:27:22,119][25689] Avg episode reward: [(0, '0.148')] [2022-07-10 22:27:23,098][26022] Updated weights on worker 0-0, policy_version 917776 (0.00084) [2022-07-10 22:27:24,915][26022] Updated weights on worker 0-0, policy_version 917786 (0.00092) [2022-07-10 22:27:26,846][26022] Updated weights on worker 0-0, policy_version 917796 (0.00080) [2022-07-10 22:27:27,132][25689] Fps is (10 sec: 5464.5, 60 sec: 5575.3, 300 sec: 5550.2). Total num frames: 939824128. Throughput: 0: 5834.2. Samples: 939826268. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:27:27,132][25689] Avg episode reward: [(0, '0.171')] [2022-07-10 22:27:28,677][26022] Updated weights on worker 0-0, policy_version 917806 (0.00088) [2022-07-10 22:27:30,423][26022] Updated weights on worker 0-0, policy_version 917816 (0.00091) [2022-07-10 22:27:32,153][25689] Fps is (10 sec: 5510.6, 60 sec: 5559.7, 300 sec: 5550.0). Total num frames: 939851776. Throughput: 0: 5839.1. Samples: 939859960. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:27:32,154][25689] Avg episode reward: [(0, '0.611')] [2022-07-10 22:27:32,353][26022] Updated weights on worker 0-0, policy_version 917826 (0.00090) [2022-07-10 22:27:33,977][26022] Updated weights on worker 0-0, policy_version 917836 (0.00085) [2022-07-10 22:27:36,138][26022] Updated weights on worker 0-0, policy_version 917846 (0.00094) [2022-07-10 22:27:37,211][25689] Fps is (10 sec: 5587.9, 60 sec: 5554.9, 300 sec: 5554.8). Total num frames: 939880448. Throughput: 0: 4981.7. Samples: 939876730. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:27:37,211][25689] Avg episode reward: [(0, '0.564')] [2022-07-10 22:27:37,777][26022] Updated weights on worker 0-0, policy_version 917856 (0.00079) [2022-07-10 22:27:39,656][26022] Updated weights on worker 0-0, policy_version 917866 (0.00094) [2022-07-10 22:27:41,551][26022] Updated weights on worker 0-0, policy_version 917876 (0.00097) [2022-07-10 22:27:42,354][25689] Fps is (10 sec: 5521.4, 60 sec: 5530.8, 300 sec: 5542.4). Total num frames: 939908096. Throughput: 0: 5792.0. Samples: 939910098. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:27:42,354][25689] Avg episode reward: [(0, '0.134')] [2022-07-10 22:27:43,200][26022] Updated weights on worker 0-0, policy_version 917886 (0.00089) [2022-07-10 22:27:45,243][26022] Updated weights on worker 0-0, policy_version 917896 (0.00091) [2022-07-10 22:27:47,056][26022] Updated weights on worker 0-0, policy_version 917906 (0.00096) [2022-07-10 22:27:47,369][25689] Fps is (10 sec: 5544.3, 60 sec: 5550.7, 300 sec: 5549.4). Total num frames: 939936768. Throughput: 0: 5806.9. Samples: 939943784. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:27:47,369][25689] Avg episode reward: [(0, '0.258')] [2022-07-10 22:27:48,753][26022] Updated weights on worker 0-0, policy_version 917916 (0.00380) [2022-07-10 22:27:50,890][26022] Updated weights on worker 0-0, policy_version 917926 (0.00097) [2022-07-10 22:27:52,296][26022] Updated weights on worker 0-0, policy_version 917936 (0.00085) [2022-07-10 22:27:52,423][25689] Fps is (10 sec: 5796.6, 60 sec: 5582.0, 300 sec: 5555.4). Total num frames: 939966464. Throughput: 0: 4968.3. Samples: 939960666. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:27:52,423][25689] Avg episode reward: [(0, '0.711')] [2022-07-10 22:27:54,496][26022] Updated weights on worker 0-0, policy_version 917946 (0.00085) [2022-07-10 22:27:56,247][26022] Updated weights on worker 0-0, policy_version 917956 (0.00077) [2022-07-10 22:27:57,486][25689] Fps is (10 sec: 5465.7, 60 sec: 5527.9, 300 sec: 5538.5). Total num frames: 939992064. Throughput: 0: 5771.2. Samples: 939993742. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:27:57,486][25689] Avg episode reward: [(0, '0.619')] [2022-07-10 22:27:58,073][26022] Updated weights on worker 0-0, policy_version 917966 (0.00094) [2022-07-10 22:28:00,023][26022] Updated weights on worker 0-0, policy_version 917976 (0.00103) [2022-07-10 22:28:02,108][26022] Updated weights on worker 0-0, policy_version 917986 (0.00087) [2022-07-10 22:28:02,558][25689] Fps is (10 sec: 5253.8, 60 sec: 5545.7, 300 sec: 5547.6). Total num frames: 940019712. Throughput: 0: 5676.4. Samples: 940024788. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:28:02,559][25689] Avg episode reward: [(0, '0.618')] [2022-07-10 22:28:04,066][26022] Updated weights on worker 0-0, policy_version 917996 (0.00078) [2022-07-10 22:28:05,690][26022] Updated weights on worker 0-0, policy_version 918006 (0.00095) [2022-07-10 22:28:07,589][25689] Fps is (10 sec: 5472.8, 60 sec: 5526.1, 300 sec: 5550.8). Total num frames: 940047360. Throughput: 0: 4844.0. Samples: 940041732. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:28:07,590][25689] Avg episode reward: [(0, '0.509')] [2022-07-10 22:28:07,625][26022] Updated weights on worker 0-0, policy_version 918016 (0.00088) [2022-07-10 22:28:09,565][26022] Updated weights on worker 0-0, policy_version 918026 (0.00087) [2022-07-10 22:28:11,290][26022] Updated weights on worker 0-0, policy_version 918036 (0.00083) [2022-07-10 22:28:12,621][25689] Fps is (10 sec: 5495.3, 60 sec: 5540.7, 300 sec: 5544.1). Total num frames: 940075008. Throughput: 0: 5680.2. Samples: 940075392. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:28:12,621][25689] Avg episode reward: [(0, '0.352')] [2022-07-10 22:28:13,202][26022] Updated weights on worker 0-0, policy_version 918046 (0.00091) [2022-07-10 22:28:14,874][26022] Updated weights on worker 0-0, policy_version 918056 (0.00079) [2022-07-10 22:28:16,692][26022] Updated weights on worker 0-0, policy_version 918066 (0.00090) [2022-07-10 22:28:17,659][25689] Fps is (10 sec: 5593.0, 60 sec: 5521.3, 300 sec: 5547.7). Total num frames: 940103680. Throughput: 0: 5713.5. Samples: 940109004. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:28:17,660][25689] Avg episode reward: [(0, '0.056')] [2022-07-10 22:28:18,671][26022] Updated weights on worker 0-0, policy_version 918076 (0.00089) [2022-07-10 22:28:20,490][26022] Updated weights on worker 0-0, policy_version 918086 (0.00088) [2022-07-10 22:28:22,270][26022] Updated weights on worker 0-0, policy_version 918096 (0.00088) [2022-07-10 22:28:22,805][25689] Fps is (10 sec: 5630.6, 60 sec: 5534.6, 300 sec: 5545.3). Total num frames: 940132352. Throughput: 0: 4985.6. Samples: 940125730. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:28:22,805][25689] Avg episode reward: [(0, '-0.073')] [2022-07-10 22:28:24,203][26022] Updated weights on worker 0-0, policy_version 918106 (0.00091) [2022-07-10 22:28:25,948][26022] Updated weights on worker 0-0, policy_version 918116 (0.00100) [2022-07-10 22:28:27,774][26022] Updated weights on worker 0-0, policy_version 918126 (0.00086) [2022-07-10 22:28:27,872][25689] Fps is (10 sec: 5614.9, 60 sec: 5546.5, 300 sec: 5547.6). Total num frames: 940161024. Throughput: 0: 5789.6. Samples: 940159160. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:28:27,873][25689] Avg episode reward: [(0, '-0.050')] [2022-07-10 22:28:29,567][26022] Updated weights on worker 0-0, policy_version 918136 (0.00088) [2022-07-10 22:28:31,463][26022] Updated weights on worker 0-0, policy_version 918146 (0.00089) [2022-07-10 22:28:32,955][25689] Fps is (10 sec: 5549.0, 60 sec: 5540.9, 300 sec: 5542.9). Total num frames: 940188672. Throughput: 0: 5775.5. Samples: 940192830. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:28:32,955][25689] Avg episode reward: [(0, '0.272')] [2022-07-10 22:28:33,364][26022] Updated weights on worker 0-0, policy_version 918156 (0.00081) [2022-07-10 22:28:35,091][26022] Updated weights on worker 0-0, policy_version 918166 (0.00088) [2022-07-10 22:28:37,071][26022] Updated weights on worker 0-0, policy_version 918176 (0.01297) [2022-07-10 22:28:37,957][25689] Fps is (10 sec: 5584.6, 60 sec: 5545.9, 300 sec: 5547.8). Total num frames: 940217344. Throughput: 0: 5786.7. Samples: 940226462. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:28:37,957][25689] Avg episode reward: [(0, '0.455')] [2022-07-10 22:28:38,854][26022] Updated weights on worker 0-0, policy_version 918186 (0.00104) [2022-07-10 22:28:40,735][26022] Updated weights on worker 0-0, policy_version 918196 (0.00088) [2022-07-10 22:28:42,526][26022] Updated weights on worker 0-0, policy_version 918206 (0.00623) [2022-07-10 22:28:43,096][25689] Fps is (10 sec: 5553.5, 60 sec: 5546.3, 300 sec: 5541.9). Total num frames: 940244992. Throughput: 0: 5786.4. Samples: 940243142. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:28:43,097][25689] Avg episode reward: [(0, '-0.106')] [2022-07-10 22:28:44,119][26022] Updated weights on worker 0-0, policy_version 918216 (0.00085) [2022-07-10 22:28:46,276][26022] Updated weights on worker 0-0, policy_version 918226 (0.00090) [2022-07-10 22:28:47,952][26022] Updated weights on worker 0-0, policy_version 918236 (0.00271) [2022-07-10 22:28:48,115][25689] Fps is (10 sec: 5544.4, 60 sec: 5546.0, 300 sec: 5552.2). Total num frames: 940273664. Throughput: 0: 5815.3. Samples: 940276878. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:28:48,115][25689] Avg episode reward: [(0, '-0.563')] [2022-07-10 22:28:49,806][26022] Updated weights on worker 0-0, policy_version 918246 (0.00105) [2022-07-10 22:28:51,683][26022] Updated weights on worker 0-0, policy_version 918256 (0.00104) [2022-07-10 22:28:53,159][25689] Fps is (10 sec: 5698.7, 60 sec: 5530.1, 300 sec: 5551.6). Total num frames: 940302336. Throughput: 0: 5812.1. Samples: 940310258. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:28:53,159][25689] Avg episode reward: [(0, '-0.664')] [2022-07-10 22:28:53,686][26022] Updated weights on worker 0-0, policy_version 918266 (0.00094) [2022-07-10 22:28:55,514][26022] Updated weights on worker 0-0, policy_version 918276 (0.00083) [2022-07-10 22:28:57,240][26022] Updated weights on worker 0-0, policy_version 918286 (0.00093) [2022-07-10 22:28:58,183][25689] Fps is (10 sec: 5492.3, 60 sec: 5550.4, 300 sec: 5546.1). Total num frames: 940328960. Throughput: 0: 4961.8. Samples: 940326818. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:28:58,183][25689] Avg episode reward: [(0, '-0.742')] [2022-07-10 22:28:59,084][26022] Updated weights on worker 0-0, policy_version 918296 (0.00092) [2022-07-10 22:29:00,907][26022] Updated weights on worker 0-0, policy_version 918306 (0.00094) [2022-07-10 22:29:03,097][26022] Updated weights on worker 0-0, policy_version 918316 (0.00091) [2022-07-10 22:29:03,245][25689] Fps is (10 sec: 5380.9, 60 sec: 5551.4, 300 sec: 5551.9). Total num frames: 940356608. Throughput: 0: 5809.2. Samples: 940360190. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:29:03,245][25689] Avg episode reward: [(0, '-1.028')] [2022-07-10 22:29:04,962][26022] Updated weights on worker 0-0, policy_version 918326 (0.00094) [2022-07-10 22:29:06,702][26022] Updated weights on worker 0-0, policy_version 918336 (0.00090) [2022-07-10 22:29:08,263][25689] Fps is (10 sec: 5384.2, 60 sec: 5535.8, 300 sec: 5545.3). Total num frames: 940383232. Throughput: 0: 5738.2. Samples: 940392490. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:29:08,263][25689] Avg episode reward: [(0, '-1.204')] [2022-07-10 22:29:08,559][26022] Updated weights on worker 0-0, policy_version 918346 (0.00089) [2022-07-10 22:29:10,509][26022] Updated weights on worker 0-0, policy_version 918356 (0.00094) [2022-07-10 22:29:11,438][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:29:11,456][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000918362_940402688.pth [2022-07-10 22:29:11,471][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000916408_938401792.pth [2022-07-10 22:29:12,104][26022] Updated weights on worker 0-0, policy_version 918366 (0.00089) [2022-07-10 22:29:13,272][25689] Fps is (10 sec: 5514.8, 60 sec: 5554.7, 300 sec: 5549.1). Total num frames: 940411904. Throughput: 0: 4929.9. Samples: 940409412. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:29:13,272][25689] Avg episode reward: [(0, '-0.483')] [2022-07-10 22:29:14,128][26022] Updated weights on worker 0-0, policy_version 918376 (0.00096) [2022-07-10 22:29:15,608][26022] Updated weights on worker 0-0, policy_version 918386 (0.00095) [2022-07-10 22:29:17,815][26022] Updated weights on worker 0-0, policy_version 918396 (0.00106) [2022-07-10 22:29:18,295][25689] Fps is (10 sec: 5716.1, 60 sec: 5556.1, 300 sec: 5550.2). Total num frames: 940440576. Throughput: 0: 5780.4. Samples: 940443072. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:29:18,295][25689] Avg episode reward: [(0, '-1.281')] [2022-07-10 22:29:19,461][26022] Updated weights on worker 0-0, policy_version 918406 (0.00092) [2022-07-10 22:29:21,339][26022] Updated weights on worker 0-0, policy_version 918416 (0.00082) [2022-07-10 22:29:23,233][26022] Updated weights on worker 0-0, policy_version 918426 (0.00084) [2022-07-10 22:29:23,379][25689] Fps is (10 sec: 5673.5, 60 sec: 5561.7, 300 sec: 5562.8). Total num frames: 940469248. Throughput: 0: 5766.5. Samples: 940476294. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:29:23,379][25689] Avg episode reward: [(0, '-0.552')] [2022-07-10 22:29:24,865][26022] Updated weights on worker 0-0, policy_version 918436 (0.00086) [2022-07-10 22:29:27,126][26022] Updated weights on worker 0-0, policy_version 918446 (0.00087) [2022-07-10 22:29:28,451][25689] Fps is (10 sec: 5545.4, 60 sec: 5544.4, 300 sec: 5551.4). Total num frames: 940496896. Throughput: 0: 4970.1. Samples: 940492826. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:29:28,451][25689] Avg episode reward: [(0, '-0.615')] [2022-07-10 22:29:28,779][26022] Updated weights on worker 0-0, policy_version 918456 (0.00091) [2022-07-10 22:29:30,604][26022] Updated weights on worker 0-0, policy_version 918466 (0.00086) [2022-07-10 22:29:32,471][26022] Updated weights on worker 0-0, policy_version 918476 (0.00096) [2022-07-10 22:29:33,465][25689] Fps is (10 sec: 5380.8, 60 sec: 5533.7, 300 sec: 5551.7). Total num frames: 940523520. Throughput: 0: 5783.5. Samples: 940526202. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:29:33,467][25689] Avg episode reward: [(0, '-0.822')] [2022-07-10 22:29:34,191][26022] Updated weights on worker 0-0, policy_version 918486 (0.00099) [2022-07-10 22:29:36,276][26022] Updated weights on worker 0-0, policy_version 918496 (0.00093) [2022-07-10 22:29:37,864][26022] Updated weights on worker 0-0, policy_version 918506 (0.00087) [2022-07-10 22:29:38,538][25689] Fps is (10 sec: 5583.1, 60 sec: 5544.1, 300 sec: 5556.1). Total num frames: 940553216. Throughput: 0: 5761.9. Samples: 940559714. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:29:38,539][25689] Avg episode reward: [(0, '-0.906')] [2022-07-10 22:29:39,817][26022] Updated weights on worker 0-0, policy_version 918516 (0.00085) [2022-07-10 22:29:41,617][26022] Updated weights on worker 0-0, policy_version 918526 (0.00089) [2022-07-10 22:29:43,606][25689] Fps is (10 sec: 5654.9, 60 sec: 5550.7, 300 sec: 5548.2). Total num frames: 940580864. Throughput: 0: 4951.9. Samples: 940576456. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:29:43,606][25689] Avg episode reward: [(0, '0.124')] [2022-07-10 22:29:43,606][26022] Updated weights on worker 0-0, policy_version 918536 (0.00060) [2022-07-10 22:29:45,386][26022] Updated weights on worker 0-0, policy_version 918546 (0.00089) [2022-07-10 22:29:47,189][26022] Updated weights on worker 0-0, policy_version 918556 (0.00082) [2022-07-10 22:29:48,645][25689] Fps is (10 sec: 5471.4, 60 sec: 5531.9, 300 sec: 5551.4). Total num frames: 940608512. Throughput: 0: 5802.2. Samples: 940609996. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:29:48,645][25689] Avg episode reward: [(0, '-0.965')] [2022-07-10 22:29:48,905][26022] Updated weights on worker 0-0, policy_version 918566 (0.00086) [2022-07-10 22:29:50,774][26022] Updated weights on worker 0-0, policy_version 918576 (0.00094) [2022-07-10 22:29:52,723][26022] Updated weights on worker 0-0, policy_version 918586 (0.00092) [2022-07-10 22:29:53,655][25689] Fps is (10 sec: 5502.6, 60 sec: 5518.1, 300 sec: 5548.1). Total num frames: 940636160. Throughput: 0: 5804.8. Samples: 940643398. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:29:53,655][25689] Avg episode reward: [(0, '-0.995')] [2022-07-10 22:29:54,635][26022] Updated weights on worker 0-0, policy_version 918596 (0.00087) [2022-07-10 22:29:56,343][26022] Updated weights on worker 0-0, policy_version 918606 (0.00083) [2022-07-10 22:29:58,354][26022] Updated weights on worker 0-0, policy_version 918616 (0.00083) [2022-07-10 22:29:58,688][25689] Fps is (10 sec: 5709.9, 60 sec: 5568.1, 300 sec: 5551.8). Total num frames: 940665856. Throughput: 0: 4966.0. Samples: 940659776. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 22:29:58,688][25689] Avg episode reward: [(0, '-1.291')] [2022-07-10 22:30:00,160][26022] Updated weights on worker 0-0, policy_version 918626 (0.00108) [2022-07-10 22:30:02,154][26022] Updated weights on worker 0-0, policy_version 918636 (0.00088) [2022-07-10 22:30:03,756][25689] Fps is (10 sec: 5372.9, 60 sec: 5516.8, 300 sec: 5551.0). Total num frames: 940690432. Throughput: 0: 5737.5. Samples: 940692066. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:30:03,756][25689] Avg episode reward: [(0, '-2.010')] [2022-07-10 22:30:04,284][26022] Updated weights on worker 0-0, policy_version 918646 (0.00091) [2022-07-10 22:30:05,866][26022] Updated weights on worker 0-0, policy_version 918656 (0.00088) [2022-07-10 22:30:08,155][26022] Updated weights on worker 0-0, policy_version 918666 (0.00093) [2022-07-10 22:30:08,771][25689] Fps is (10 sec: 5382.6, 60 sec: 5567.8, 300 sec: 5551.5). Total num frames: 940720128. Throughput: 0: 5702.3. Samples: 940724760. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:30:08,771][25689] Avg episode reward: [(0, '-2.061')] [2022-07-10 22:30:09,514][26022] Updated weights on worker 0-0, policy_version 918676 (0.01188) [2022-07-10 22:30:11,611][26022] Updated weights on worker 0-0, policy_version 918686 (0.00093) [2022-07-10 22:30:13,477][26022] Updated weights on worker 0-0, policy_version 918696 (0.00088) [2022-07-10 22:30:13,785][25689] Fps is (10 sec: 5615.4, 60 sec: 5533.4, 300 sec: 5548.4). Total num frames: 940746752. Throughput: 0: 4869.9. Samples: 940741432. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:30:13,786][25689] Avg episode reward: [(0, '-3.487')] [2022-07-10 22:30:15,290][26022] Updated weights on worker 0-0, policy_version 918706 (0.00089) [2022-07-10 22:30:17,042][26022] Updated weights on worker 0-0, policy_version 918716 (0.00083) [2022-07-10 22:30:18,804][25689] Fps is (10 sec: 5409.0, 60 sec: 5516.9, 300 sec: 5549.6). Total num frames: 940774400. Throughput: 0: 5732.8. Samples: 940775100. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:30:18,805][25689] Avg episode reward: [(0, '-2.340')] [2022-07-10 22:30:18,837][26022] Updated weights on worker 0-0, policy_version 918726 (0.00096) [2022-07-10 22:30:20,740][26022] Updated weights on worker 0-0, policy_version 918736 (0.00097) [2022-07-10 22:30:22,492][26022] Updated weights on worker 0-0, policy_version 918746 (0.00088) [2022-07-10 22:30:23,848][25689] Fps is (10 sec: 5495.1, 60 sec: 5503.6, 300 sec: 5545.7). Total num frames: 940802048. Throughput: 0: 5795.1. Samples: 940808504. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:30:23,849][25689] Avg episode reward: [(0, '-2.579')] [2022-07-10 22:30:24,381][26022] Updated weights on worker 0-0, policy_version 918756 (0.00096) [2022-07-10 22:30:26,377][26022] Updated weights on worker 0-0, policy_version 918766 (0.00091) [2022-07-10 22:30:28,103][26022] Updated weights on worker 0-0, policy_version 918776 (0.00082) [2022-07-10 22:30:28,884][25689] Fps is (10 sec: 5587.4, 60 sec: 5523.8, 300 sec: 5548.8). Total num frames: 940830720. Throughput: 0: 4982.7. Samples: 940824980. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:30:28,885][25689] Avg episode reward: [(0, '-1.553')] [2022-07-10 22:30:29,959][26022] Updated weights on worker 0-0, policy_version 918786 (0.00096) [2022-07-10 22:30:31,878][26022] Updated weights on worker 0-0, policy_version 918796 (0.00096) [2022-07-10 22:30:33,520][26022] Updated weights on worker 0-0, policy_version 918806 (0.00083) [2022-07-10 22:30:33,892][25689] Fps is (10 sec: 5607.4, 60 sec: 5541.4, 300 sec: 5548.9). Total num frames: 940858368. Throughput: 0: 5818.3. Samples: 940858418. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:30:33,893][25689] Avg episode reward: [(0, '-1.639')] [2022-07-10 22:30:35,496][26022] Updated weights on worker 0-0, policy_version 918816 (0.00094) [2022-07-10 22:30:37,242][26022] Updated weights on worker 0-0, policy_version 918826 (0.00089) [2022-07-10 22:30:38,899][25689] Fps is (10 sec: 5521.6, 60 sec: 5513.6, 300 sec: 5539.8). Total num frames: 940886016. Throughput: 0: 5823.1. Samples: 940892112. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:30:38,899][25689] Avg episode reward: [(0, '-1.173')] [2022-07-10 22:30:39,074][26022] Updated weights on worker 0-0, policy_version 918836 (0.00083) [2022-07-10 22:30:40,908][26022] Updated weights on worker 0-0, policy_version 918846 (0.00095) [2022-07-10 22:30:42,752][26022] Updated weights on worker 0-0, policy_version 918856 (0.00085) [2022-07-10 22:30:43,950][25689] Fps is (10 sec: 5701.1, 60 sec: 5548.9, 300 sec: 5554.1). Total num frames: 940915712. Throughput: 0: 4988.7. Samples: 940908788. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:30:43,951][25689] Avg episode reward: [(0, '0.191')] [2022-07-10 22:30:44,688][26022] Updated weights on worker 0-0, policy_version 918866 (0.00084) [2022-07-10 22:30:46,415][26022] Updated weights on worker 0-0, policy_version 918876 (0.00086) [2022-07-10 22:30:48,272][26022] Updated weights on worker 0-0, policy_version 918886 (0.00084) [2022-07-10 22:30:49,003][25689] Fps is (10 sec: 5573.8, 60 sec: 5530.7, 300 sec: 5543.1). Total num frames: 940942336. Throughput: 0: 5830.9. Samples: 940942292. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:30:49,004][25689] Avg episode reward: [(0, '0.627')] [2022-07-10 22:30:50,087][26022] Updated weights on worker 0-0, policy_version 918896 (0.00088) [2022-07-10 22:30:52,036][26022] Updated weights on worker 0-0, policy_version 918906 (0.00087) [2022-07-10 22:30:53,647][26022] Updated weights on worker 0-0, policy_version 918916 (0.00108) [2022-07-10 22:30:54,015][25689] Fps is (10 sec: 5392.5, 60 sec: 5530.6, 300 sec: 5539.6). Total num frames: 940969984. Throughput: 0: 5827.4. Samples: 940975680. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:30:54,015][25689] Avg episode reward: [(0, '1.042')] [2022-07-10 22:30:55,739][26022] Updated weights on worker 0-0, policy_version 918926 (0.00087) [2022-07-10 22:30:57,603][26022] Updated weights on worker 0-0, policy_version 918936 (0.00089) [2022-07-10 22:30:59,030][25689] Fps is (10 sec: 5514.8, 60 sec: 5498.2, 300 sec: 5540.9). Total num frames: 940997632. Throughput: 0: 4974.4. Samples: 940992254. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:30:59,031][25689] Avg episode reward: [(0, '0.239')] [2022-07-10 22:30:59,586][26022] Updated weights on worker 0-0, policy_version 918946 (0.00083) [2022-07-10 22:31:01,266][26022] Updated weights on worker 0-0, policy_version 918956 (0.00092) [2022-07-10 22:31:03,659][26022] Updated weights on worker 0-0, policy_version 918966 (0.00099) [2022-07-10 22:31:04,096][25689] Fps is (10 sec: 5383.6, 60 sec: 5532.4, 300 sec: 5540.0). Total num frames: 941024256. Throughput: 0: 5678.1. Samples: 941023174. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:31:04,096][25689] Avg episode reward: [(0, '-0.007')] [2022-07-10 22:31:05,187][26022] Updated weights on worker 0-0, policy_version 918976 (0.00977) [2022-07-10 22:31:07,256][26022] Updated weights on worker 0-0, policy_version 918986 (0.00087) [2022-07-10 22:31:08,975][26022] Updated weights on worker 0-0, policy_version 918996 (0.00067) [2022-07-10 22:31:09,142][25689] Fps is (10 sec: 5366.9, 60 sec: 5495.6, 300 sec: 5539.6). Total num frames: 941051904. Throughput: 0: 5686.9. Samples: 941056822. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:31:09,143][25689] Avg episode reward: [(0, '0.068')] [2022-07-10 22:31:10,787][26022] Updated weights on worker 0-0, policy_version 919006 (0.00086) [2022-07-10 22:31:11,524][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:31:11,539][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000919010_941066240.pth [2022-07-10 22:31:11,539][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000917058_939067392.pth [2022-07-10 22:31:12,722][26022] Updated weights on worker 0-0, policy_version 919016 (0.00087) [2022-07-10 22:31:14,200][25689] Fps is (10 sec: 5573.9, 60 sec: 5525.6, 300 sec: 5543.2). Total num frames: 941080576. Throughput: 0: 4858.5. Samples: 941073750. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:31:14,200][25689] Avg episode reward: [(0, '-0.035')] [2022-07-10 22:31:14,493][26022] Updated weights on worker 0-0, policy_version 919026 (0.00100) [2022-07-10 22:31:16,161][26022] Updated weights on worker 0-0, policy_version 919036 (0.00088) [2022-07-10 22:31:17,945][26022] Updated weights on worker 0-0, policy_version 919046 (0.00088) [2022-07-10 22:31:19,203][25689] Fps is (10 sec: 5700.1, 60 sec: 5544.0, 300 sec: 5540.7). Total num frames: 941109248. Throughput: 0: 5709.9. Samples: 941107436. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:31:19,203][25689] Avg episode reward: [(0, '-0.603')] [2022-07-10 22:31:19,862][26022] Updated weights on worker 0-0, policy_version 919056 (0.00090) [2022-07-10 22:31:21,631][26022] Updated weights on worker 0-0, policy_version 919066 (0.00098) [2022-07-10 22:31:23,734][26022] Updated weights on worker 0-0, policy_version 919076 (0.00089) [2022-07-10 22:31:24,300][25689] Fps is (10 sec: 5475.0, 60 sec: 5522.2, 300 sec: 5539.9). Total num frames: 941135872. Throughput: 0: 5830.7. Samples: 941140976. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:31:24,300][25689] Avg episode reward: [(0, '-0.851')] [2022-07-10 22:31:25,360][26022] Updated weights on worker 0-0, policy_version 919086 (0.00083) [2022-07-10 22:31:27,447][26022] Updated weights on worker 0-0, policy_version 919096 (0.00085) [2022-07-10 22:31:28,916][26022] Updated weights on worker 0-0, policy_version 919106 (0.00101) [2022-07-10 22:31:29,309][25689] Fps is (10 sec: 5572.6, 60 sec: 5541.6, 300 sec: 5543.8). Total num frames: 941165568. Throughput: 0: 5826.1. Samples: 941174316. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:31:29,315][25689] Avg episode reward: [(0, '0.288')] [2022-07-10 22:31:31,034][26022] Updated weights on worker 0-0, policy_version 919116 (0.00084) [2022-07-10 22:31:32,919][26022] Updated weights on worker 0-0, policy_version 919126 (0.00092) [2022-07-10 22:31:34,334][25689] Fps is (10 sec: 5612.6, 60 sec: 5523.0, 300 sec: 5536.6). Total num frames: 941192192. Throughput: 0: 5821.2. Samples: 941190956. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:31:34,335][25689] Avg episode reward: [(0, '0.497')] [2022-07-10 22:31:34,632][26022] Updated weights on worker 0-0, policy_version 919136 (0.00090) [2022-07-10 22:31:36,728][26022] Updated weights on worker 0-0, policy_version 919146 (0.00097) [2022-07-10 22:31:38,209][26022] Updated weights on worker 0-0, policy_version 919156 (0.00089) [2022-07-10 22:31:39,357][25689] Fps is (10 sec: 5503.3, 60 sec: 5538.5, 300 sec: 5537.4). Total num frames: 941220864. Throughput: 0: 5806.8. Samples: 941224468. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:31:39,358][25689] Avg episode reward: [(0, '0.196')] [2022-07-10 22:31:40,275][26022] Updated weights on worker 0-0, policy_version 919166 (0.00084) [2022-07-10 22:31:41,963][26022] Updated weights on worker 0-0, policy_version 919176 (0.00083) [2022-07-10 22:31:44,020][26022] Updated weights on worker 0-0, policy_version 919186 (0.00089) [2022-07-10 22:31:44,397][25689] Fps is (10 sec: 5596.7, 60 sec: 5505.7, 300 sec: 5537.5). Total num frames: 941248512. Throughput: 0: 5814.5. Samples: 941257834. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:31:44,398][25689] Avg episode reward: [(0, '0.446')] [2022-07-10 22:31:45,773][26022] Updated weights on worker 0-0, policy_version 919196 (0.00091) [2022-07-10 22:31:47,569][26022] Updated weights on worker 0-0, policy_version 919206 (0.00092) [2022-07-10 22:31:49,370][26022] Updated weights on worker 0-0, policy_version 919216 (0.00056) [2022-07-10 22:31:49,416][25689] Fps is (10 sec: 5598.8, 60 sec: 5542.7, 300 sec: 5541.1). Total num frames: 941277184. Throughput: 0: 4987.8. Samples: 941274606. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:31:49,416][25689] Avg episode reward: [(0, '0.690')] [2022-07-10 22:31:51,286][26022] Updated weights on worker 0-0, policy_version 919226 (0.00086) [2022-07-10 22:31:53,149][26022] Updated weights on worker 0-0, policy_version 919236 (0.00084) [2022-07-10 22:31:54,452][25689] Fps is (10 sec: 5601.4, 60 sec: 5540.4, 300 sec: 5537.4). Total num frames: 941304832. Throughput: 0: 5840.3. Samples: 941308450. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:31:54,452][25689] Avg episode reward: [(0, '0.300')] [2022-07-10 22:31:54,837][26022] Updated weights on worker 0-0, policy_version 919246 (0.00087) [2022-07-10 22:31:56,741][26022] Updated weights on worker 0-0, policy_version 919256 (0.00085) [2022-07-10 22:31:58,484][26022] Updated weights on worker 0-0, policy_version 919266 (0.00089) [2022-07-10 22:31:59,456][25689] Fps is (10 sec: 5609.7, 60 sec: 5558.4, 300 sec: 5545.8). Total num frames: 941333504. Throughput: 0: 5850.1. Samples: 941342050. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:31:59,456][25689] Avg episode reward: [(0, '0.480')] [2022-07-10 22:32:00,579][26022] Updated weights on worker 0-0, policy_version 919276 (0.00088) [2022-07-10 22:32:02,467][26022] Updated weights on worker 0-0, policy_version 919286 (0.00085) [2022-07-10 22:32:04,443][26022] Updated weights on worker 0-0, policy_version 919296 (0.00090) [2022-07-10 22:32:04,514][25689] Fps is (10 sec: 5393.6, 60 sec: 5542.1, 300 sec: 5534.4). Total num frames: 941359104. Throughput: 0: 4901.9. Samples: 941356444. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:32:04,515][25689] Avg episode reward: [(0, '0.852')] [2022-07-10 22:32:06,130][26022] Updated weights on worker 0-0, policy_version 919306 (0.00088) [2022-07-10 22:32:08,076][26022] Updated weights on worker 0-0, policy_version 919316 (0.00089) [2022-07-10 22:32:09,551][25689] Fps is (10 sec: 5376.1, 60 sec: 5560.0, 300 sec: 5540.7). Total num frames: 941387776. Throughput: 0: 5733.7. Samples: 941390052. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:32:09,551][25689] Avg episode reward: [(0, '0.032')] [2022-07-10 22:32:10,005][26022] Updated weights on worker 0-0, policy_version 919326 (0.00092) [2022-07-10 22:32:11,809][26022] Updated weights on worker 0-0, policy_version 919336 (0.00086) [2022-07-10 22:32:13,696][26022] Updated weights on worker 0-0, policy_version 919346 (0.00085) [2022-07-10 22:32:14,572][25689] Fps is (10 sec: 5599.8, 60 sec: 5546.4, 300 sec: 5533.7). Total num frames: 941415424. Throughput: 0: 5718.3. Samples: 941423500. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:32:14,572][25689] Avg episode reward: [(0, '-0.195')] [2022-07-10 22:32:15,446][26022] Updated weights on worker 0-0, policy_version 919356 (0.00085) [2022-07-10 22:32:17,226][26022] Updated weights on worker 0-0, policy_version 919366 (0.00092) [2022-07-10 22:32:19,166][26022] Updated weights on worker 0-0, policy_version 919376 (0.00086) [2022-07-10 22:32:19,596][25689] Fps is (10 sec: 5504.9, 60 sec: 5527.5, 300 sec: 5535.2). Total num frames: 941443072. Throughput: 0: 4880.2. Samples: 941440336. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:32:19,596][25689] Avg episode reward: [(0, '-0.575')] [2022-07-10 22:32:20,982][26022] Updated weights on worker 0-0, policy_version 919386 (0.00093) [2022-07-10 22:32:22,913][26022] Updated weights on worker 0-0, policy_version 919396 (0.00107) [2022-07-10 22:32:24,655][26022] Updated weights on worker 0-0, policy_version 919406 (0.00080) [2022-07-10 22:32:24,660][25689] Fps is (10 sec: 5582.5, 60 sec: 5564.4, 300 sec: 5537.7). Total num frames: 941471744. Throughput: 0: 5833.4. Samples: 941473964. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:32:24,661][25689] Avg episode reward: [(0, '-0.791')] [2022-07-10 22:32:26,488][26022] Updated weights on worker 0-0, policy_version 919416 (0.00086) [2022-07-10 22:32:28,355][26022] Updated weights on worker 0-0, policy_version 919426 (0.00091) [2022-07-10 22:32:29,710][25689] Fps is (10 sec: 5466.9, 60 sec: 5509.8, 300 sec: 5533.7). Total num frames: 941498368. Throughput: 0: 5824.6. Samples: 941507474. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:32:29,712][25689] Avg episode reward: [(0, '-1.166')] [2022-07-10 22:32:30,028][26022] Updated weights on worker 0-0, policy_version 919436 (0.00088) [2022-07-10 22:32:32,123][26022] Updated weights on worker 0-0, policy_version 919446 (0.00088) [2022-07-10 22:32:33,756][26022] Updated weights on worker 0-0, policy_version 919456 (0.00089) [2022-07-10 22:32:34,720][25689] Fps is (10 sec: 5598.7, 60 sec: 5562.1, 300 sec: 5538.0). Total num frames: 941528064. Throughput: 0: 4992.1. Samples: 941524080. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:32:34,721][25689] Avg episode reward: [(0, '-0.480')] [2022-07-10 22:32:35,808][26022] Updated weights on worker 0-0, policy_version 919466 (0.00086) [2022-07-10 22:32:37,359][26022] Updated weights on worker 0-0, policy_version 919476 (0.00089) [2022-07-10 22:32:39,439][26022] Updated weights on worker 0-0, policy_version 919486 (0.00094) [2022-07-10 22:32:39,739][25689] Fps is (10 sec: 5718.2, 60 sec: 5545.5, 300 sec: 5540.3). Total num frames: 941555712. Throughput: 0: 5839.4. Samples: 941557958. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:32:39,739][25689] Avg episode reward: [(0, '-0.265')] [2022-07-10 22:32:41,034][26022] Updated weights on worker 0-0, policy_version 919496 (0.00088) [2022-07-10 22:32:42,996][26022] Updated weights on worker 0-0, policy_version 919506 (0.00093) [2022-07-10 22:32:44,824][25689] Fps is (10 sec: 5573.6, 60 sec: 5558.3, 300 sec: 5539.0). Total num frames: 941584384. Throughput: 0: 5841.2. Samples: 941591746. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:32:44,825][25689] Avg episode reward: [(0, '-0.479')] [2022-07-10 22:32:44,832][26022] Updated weights on worker 0-0, policy_version 919516 (0.00092) [2022-07-10 22:32:46,643][26022] Updated weights on worker 0-0, policy_version 919526 (0.00099) [2022-07-10 22:32:48,426][26022] Updated weights on worker 0-0, policy_version 919536 (0.00079) [2022-07-10 22:32:49,845][25689] Fps is (10 sec: 5572.7, 60 sec: 5541.2, 300 sec: 5532.8). Total num frames: 941612032. Throughput: 0: 5016.1. Samples: 941608472. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:32:49,847][25689] Avg episode reward: [(0, '-0.276')] [2022-07-10 22:32:50,203][26022] Updated weights on worker 0-0, policy_version 919546 (0.00084) [2022-07-10 22:32:51,995][26022] Updated weights on worker 0-0, policy_version 919556 (0.00086) [2022-07-10 22:32:54,054][26022] Updated weights on worker 0-0, policy_version 919566 (0.00089) [2022-07-10 22:32:54,860][25689] Fps is (10 sec: 5612.2, 60 sec: 5560.1, 300 sec: 5544.0). Total num frames: 941640704. Throughput: 0: 5868.2. Samples: 941642266. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:32:54,860][25689] Avg episode reward: [(0, '0.043')] [2022-07-10 22:32:55,501][26022] Updated weights on worker 0-0, policy_version 919576 (0.00088) [2022-07-10 22:32:57,674][26022] Updated weights on worker 0-0, policy_version 919586 (0.00093) [2022-07-10 22:32:59,141][26022] Updated weights on worker 0-0, policy_version 919596 (0.00096) [2022-07-10 22:32:59,868][25689] Fps is (10 sec: 5618.9, 60 sec: 5542.7, 300 sec: 5545.2). Total num frames: 941668352. Throughput: 0: 5848.0. Samples: 941675676. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:32:59,869][25689] Avg episode reward: [(0, '0.514')] [2022-07-10 22:33:01,701][26022] Updated weights on worker 0-0, policy_version 919606 (0.00086) [2022-07-10 22:33:03,582][26022] Updated weights on worker 0-0, policy_version 919616 (0.00089) [2022-07-10 22:33:04,959][25689] Fps is (10 sec: 5272.5, 60 sec: 5539.7, 300 sec: 5537.2). Total num frames: 941693952. Throughput: 0: 4878.5. Samples: 941689972. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:33:04,959][25689] Avg episode reward: [(0, '0.239')] [2022-07-10 22:33:05,443][26022] Updated weights on worker 0-0, policy_version 919626 (0.00094) [2022-07-10 22:33:07,319][26022] Updated weights on worker 0-0, policy_version 919636 (0.00090) [2022-07-10 22:33:09,017][26022] Updated weights on worker 0-0, policy_version 919646 (0.00086) [2022-07-10 22:33:09,970][25689] Fps is (10 sec: 5271.1, 60 sec: 5525.1, 300 sec: 5537.6). Total num frames: 941721600. Throughput: 0: 5716.0. Samples: 941723508. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:33:09,971][25689] Avg episode reward: [(0, '-0.127')] [2022-07-10 22:33:10,966][26022] Updated weights on worker 0-0, policy_version 919656 (0.00098) [2022-07-10 22:33:11,577][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:33:11,592][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000919660_941731840.pth [2022-07-10 22:33:11,593][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000917710_939735040.pth [2022-07-10 22:33:12,829][26022] Updated weights on worker 0-0, policy_version 919666 (0.00084) [2022-07-10 22:33:14,650][26022] Updated weights on worker 0-0, policy_version 919676 (0.00093) [2022-07-10 22:33:14,974][25689] Fps is (10 sec: 5623.6, 60 sec: 5543.6, 300 sec: 5538.2). Total num frames: 941750272. Throughput: 0: 5710.8. Samples: 941757134. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:33:14,974][25689] Avg episode reward: [(0, '0.154')] [2022-07-10 22:33:16,509][26022] Updated weights on worker 0-0, policy_version 919686 (0.00086) [2022-07-10 22:33:18,202][26022] Updated weights on worker 0-0, policy_version 919696 (0.00086) [2022-07-10 22:33:19,987][25689] Fps is (10 sec: 5520.4, 60 sec: 5527.7, 300 sec: 5533.8). Total num frames: 941776896. Throughput: 0: 4885.4. Samples: 941773962. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:33:19,987][25689] Avg episode reward: [(0, '-0.106')] [2022-07-10 22:33:20,092][26022] Updated weights on worker 0-0, policy_version 919706 (0.00091) [2022-07-10 22:33:21,906][26022] Updated weights on worker 0-0, policy_version 919716 (0.00093) [2022-07-10 22:33:23,867][26022] Updated weights on worker 0-0, policy_version 919726 (0.00089) [2022-07-10 22:33:25,029][25689] Fps is (10 sec: 5601.3, 60 sec: 5546.7, 300 sec: 5537.7). Total num frames: 941806592. Throughput: 0: 5857.5. Samples: 941807530. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:33:25,029][25689] Avg episode reward: [(0, '-0.060')] [2022-07-10 22:33:25,433][26022] Updated weights on worker 0-0, policy_version 919736 (0.00086) [2022-07-10 22:33:27,427][26022] Updated weights on worker 0-0, policy_version 919746 (0.00086) [2022-07-10 22:33:29,227][26022] Updated weights on worker 0-0, policy_version 919756 (0.00084) [2022-07-10 22:33:30,033][25689] Fps is (10 sec: 5605.9, 60 sec: 5550.9, 300 sec: 5535.7). Total num frames: 941833216. Throughput: 0: 5848.2. Samples: 941840842. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:33:30,034][25689] Avg episode reward: [(0, '-0.189')] [2022-07-10 22:33:31,163][26022] Updated weights on worker 0-0, policy_version 919766 (0.00093) [2022-07-10 22:33:32,890][26022] Updated weights on worker 0-0, policy_version 919776 (0.00090) [2022-07-10 22:33:34,927][26022] Updated weights on worker 0-0, policy_version 919786 (0.00094) [2022-07-10 22:33:35,100][25689] Fps is (10 sec: 5490.4, 60 sec: 5528.7, 300 sec: 5534.5). Total num frames: 941861888. Throughput: 0: 4988.0. Samples: 941857524. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:33:35,101][25689] Avg episode reward: [(0, '0.186')] [2022-07-10 22:33:36,642][26022] Updated weights on worker 0-0, policy_version 919796 (0.00085) [2022-07-10 22:33:38,579][26022] Updated weights on worker 0-0, policy_version 919806 (0.00086) [2022-07-10 22:33:40,128][25689] Fps is (10 sec: 5579.2, 60 sec: 5527.9, 300 sec: 5536.6). Total num frames: 941889536. Throughput: 0: 5833.6. Samples: 941891458. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:33:40,130][25689] Avg episode reward: [(0, '0.132')] [2022-07-10 22:33:40,234][26022] Updated weights on worker 0-0, policy_version 919816 (0.00088) [2022-07-10 22:33:42,060][26022] Updated weights on worker 0-0, policy_version 919826 (0.00086) [2022-07-10 22:33:43,770][26022] Updated weights on worker 0-0, policy_version 919836 (0.00087) [2022-07-10 22:33:45,242][25689] Fps is (10 sec: 5654.3, 60 sec: 5542.2, 300 sec: 5538.3). Total num frames: 941919232. Throughput: 0: 5823.9. Samples: 941925250. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:33:45,242][25689] Avg episode reward: [(0, '0.278')] [2022-07-10 22:33:45,757][26022] Updated weights on worker 0-0, policy_version 919846 (0.00112) [2022-07-10 22:33:47,499][26022] Updated weights on worker 0-0, policy_version 919856 (0.00098) [2022-07-10 22:33:49,455][26022] Updated weights on worker 0-0, policy_version 919866 (0.00087) [2022-07-10 22:33:50,245][25689] Fps is (10 sec: 5870.4, 60 sec: 5577.7, 300 sec: 5542.5). Total num frames: 941948928. Throughput: 0: 5012.4. Samples: 941942152. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-10 22:33:50,246][25689] Avg episode reward: [(0, '0.522')] [2022-07-10 22:33:51,301][26022] Updated weights on worker 0-0, policy_version 919876 (0.00248) [2022-07-10 22:33:52,952][26022] Updated weights on worker 0-0, policy_version 919886 (0.00084) [2022-07-10 22:33:54,917][26022] Updated weights on worker 0-0, policy_version 919896 (0.00090) [2022-07-10 22:33:55,267][25689] Fps is (10 sec: 5515.6, 60 sec: 5526.2, 300 sec: 5539.1). Total num frames: 941974528. Throughput: 0: 5853.0. Samples: 941975562. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:33:55,269][25689] Avg episode reward: [(0, '0.798')] [2022-07-10 22:33:56,460][26022] Updated weights on worker 0-0, policy_version 919906 (0.00085) [2022-07-10 22:33:58,527][26022] Updated weights on worker 0-0, policy_version 919916 (0.00090) [2022-07-10 22:34:00,278][25689] Fps is (10 sec: 5409.4, 60 sec: 5542.9, 300 sec: 5543.5). Total num frames: 942003200. Throughput: 0: 5851.3. Samples: 942009362. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:34:00,280][25689] Avg episode reward: [(0, '-0.021')] [2022-07-10 22:34:00,405][26022] Updated weights on worker 0-0, policy_version 919926 (0.00092) [2022-07-10 22:34:02,559][26022] Updated weights on worker 0-0, policy_version 919936 (0.00086) [2022-07-10 22:34:04,432][26022] Updated weights on worker 0-0, policy_version 919946 (0.00087) [2022-07-10 22:34:05,407][25689] Fps is (10 sec: 5453.5, 60 sec: 5556.4, 300 sec: 5541.4). Total num frames: 942029824. Throughput: 0: 4895.3. Samples: 942023964. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:34:05,407][25689] Avg episode reward: [(0, '-0.379')] [2022-07-10 22:34:06,180][26022] Updated weights on worker 0-0, policy_version 919956 (0.00094) [2022-07-10 22:34:08,016][26022] Updated weights on worker 0-0, policy_version 919966 (0.00091) [2022-07-10 22:34:09,759][26022] Updated weights on worker 0-0, policy_version 919976 (0.00088) [2022-07-10 22:34:10,447][25689] Fps is (10 sec: 5438.0, 60 sec: 5570.7, 300 sec: 5540.8). Total num frames: 942058496. Throughput: 0: 5712.2. Samples: 942057546. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:34:10,447][25689] Avg episode reward: [(0, '-0.228')] [2022-07-10 22:34:11,727][26022] Updated weights on worker 0-0, policy_version 919986 (0.00093) [2022-07-10 22:34:13,459][26022] Updated weights on worker 0-0, policy_version 919996 (0.00087) [2022-07-10 22:34:15,347][26022] Updated weights on worker 0-0, policy_version 920006 (0.00088) [2022-07-10 22:34:15,515][25689] Fps is (10 sec: 5672.8, 60 sec: 5564.7, 300 sec: 5540.0). Total num frames: 942087168. Throughput: 0: 5720.6. Samples: 942091394. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:34:15,516][25689] Avg episode reward: [(0, '-0.690')] [2022-07-10 22:34:17,016][26022] Updated weights on worker 0-0, policy_version 920016 (0.00092) [2022-07-10 22:34:19,131][26022] Updated weights on worker 0-0, policy_version 920026 (0.00085) [2022-07-10 22:34:20,538][25689] Fps is (10 sec: 5581.2, 60 sec: 5580.8, 300 sec: 5537.7). Total num frames: 942114816. Throughput: 0: 4885.1. Samples: 942108332. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:34:20,538][25689] Avg episode reward: [(0, '-0.553')] [2022-07-10 22:34:20,872][26022] Updated weights on worker 0-0, policy_version 920036 (0.00080) [2022-07-10 22:34:22,711][26022] Updated weights on worker 0-0, policy_version 920046 (0.00084) [2022-07-10 22:34:24,452][26022] Updated weights on worker 0-0, policy_version 920056 (0.00089) [2022-07-10 22:34:25,592][25689] Fps is (10 sec: 5589.3, 60 sec: 5562.7, 300 sec: 5541.5). Total num frames: 942143488. Throughput: 0: 5832.8. Samples: 942141698. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:34:25,592][25689] Avg episode reward: [(0, '-0.672')] [2022-07-10 22:34:26,331][26022] Updated weights on worker 0-0, policy_version 920066 (0.00084) [2022-07-10 22:34:28,110][26022] Updated weights on worker 0-0, policy_version 920076 (0.00087) [2022-07-10 22:34:30,133][26022] Updated weights on worker 0-0, policy_version 920086 (0.00095) [2022-07-10 22:34:30,608][25689] Fps is (10 sec: 5287.4, 60 sec: 5527.8, 300 sec: 5534.5). Total num frames: 942168064. Throughput: 0: 5832.6. Samples: 942175142. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:34:30,609][25689] Avg episode reward: [(0, '-0.291')] [2022-07-10 22:34:31,844][26022] Updated weights on worker 0-0, policy_version 920096 (0.00092) [2022-07-10 22:34:33,979][26022] Updated weights on worker 0-0, policy_version 920106 (0.00097) [2022-07-10 22:34:35,459][26022] Updated weights on worker 0-0, policy_version 920116 (0.00091) [2022-07-10 22:34:35,636][25689] Fps is (10 sec: 5505.1, 60 sec: 5565.2, 300 sec: 5538.8). Total num frames: 942198784. Throughput: 0: 4980.9. Samples: 942191614. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:34:35,637][25689] Avg episode reward: [(0, '0.224')] [2022-07-10 22:34:37,598][26022] Updated weights on worker 0-0, policy_version 920126 (0.00094) [2022-07-10 22:34:39,396][26022] Updated weights on worker 0-0, policy_version 920136 (0.00089) [2022-07-10 22:34:40,643][25689] Fps is (10 sec: 5816.5, 60 sec: 5567.1, 300 sec: 5539.9). Total num frames: 942226432. Throughput: 0: 5801.0. Samples: 942224966. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:34:40,644][25689] Avg episode reward: [(0, '0.088')] [2022-07-10 22:34:41,107][26022] Updated weights on worker 0-0, policy_version 920146 (0.00091) [2022-07-10 22:34:43,164][26022] Updated weights on worker 0-0, policy_version 920156 (0.00081) [2022-07-10 22:34:44,785][26022] Updated weights on worker 0-0, policy_version 920166 (0.00087) [2022-07-10 22:34:45,698][25689] Fps is (10 sec: 5495.5, 60 sec: 5538.7, 300 sec: 5539.6). Total num frames: 942254080. Throughput: 0: 5795.5. Samples: 942258226. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:34:45,700][25689] Avg episode reward: [(0, '0.457')] [2022-07-10 22:34:46,783][26022] Updated weights on worker 0-0, policy_version 920176 (0.00087) [2022-07-10 22:34:48,439][26022] Updated weights on worker 0-0, policy_version 920186 (0.00088) [2022-07-10 22:34:50,342][26022] Updated weights on worker 0-0, policy_version 920196 (0.00077) [2022-07-10 22:34:50,745][25689] Fps is (10 sec: 5575.1, 60 sec: 5517.7, 300 sec: 5542.4). Total num frames: 942282752. Throughput: 0: 5791.5. Samples: 942291766. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:34:50,747][25689] Avg episode reward: [(0, '0.255')] [2022-07-10 22:34:52,454][26022] Updated weights on worker 0-0, policy_version 920206 (0.00082) [2022-07-10 22:34:53,932][26022] Updated weights on worker 0-0, policy_version 920216 (0.00090) [2022-07-10 22:34:55,755][25689] Fps is (10 sec: 5498.3, 60 sec: 5535.8, 300 sec: 5532.5). Total num frames: 942309376. Throughput: 0: 5819.7. Samples: 942308700. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:34:55,756][25689] Avg episode reward: [(0, '0.507')] [2022-07-10 22:34:55,901][26022] Updated weights on worker 0-0, policy_version 920226 (0.00086) [2022-07-10 22:34:57,582][26022] Updated weights on worker 0-0, policy_version 920236 (0.00086) [2022-07-10 22:34:59,460][26022] Updated weights on worker 0-0, policy_version 920246 (0.00092) [2022-07-10 22:35:00,759][25689] Fps is (10 sec: 5624.5, 60 sec: 5553.4, 300 sec: 5550.9). Total num frames: 942339072. Throughput: 0: 5832.4. Samples: 942342288. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:35:00,759][25689] Avg episode reward: [(0, '-0.466')] [2022-07-10 22:35:01,232][26022] Updated weights on worker 0-0, policy_version 920256 (0.00093) [2022-07-10 22:35:03,740][26022] Updated weights on worker 0-0, policy_version 920266 (0.00092) [2022-07-10 22:35:05,455][26022] Updated weights on worker 0-0, policy_version 920276 (0.00089) [2022-07-10 22:35:05,791][25689] Fps is (10 sec: 5611.8, 60 sec: 5562.2, 300 sec: 5540.3). Total num frames: 942365696. Throughput: 0: 5739.7. Samples: 942373554. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:35:05,792][25689] Avg episode reward: [(0, '-0.393')] [2022-07-10 22:35:07,353][26022] Updated weights on worker 0-0, policy_version 920286 (0.00088) [2022-07-10 22:35:08,960][26022] Updated weights on worker 0-0, policy_version 920296 (0.00093) [2022-07-10 22:35:10,815][25689] Fps is (10 sec: 5193.4, 60 sec: 5512.9, 300 sec: 5536.6). Total num frames: 942391296. Throughput: 0: 4906.7. Samples: 942390240. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:35:10,815][25689] Avg episode reward: [(0, '-0.580')] [2022-07-10 22:35:11,074][26022] Updated weights on worker 0-0, policy_version 920306 (0.00090) [2022-07-10 22:35:11,640][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:35:11,660][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000920311_942398464.pth [2022-07-10 22:35:11,660][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000918362_940402688.pth [2022-07-10 22:35:12,677][26022] Updated weights on worker 0-0, policy_version 920316 (0.00085) [2022-07-10 22:35:14,571][26022] Updated weights on worker 0-0, policy_version 920326 (0.00088) [2022-07-10 22:35:15,829][25689] Fps is (10 sec: 5406.8, 60 sec: 5517.8, 300 sec: 5540.2). Total num frames: 942419968. Throughput: 0: 5742.6. Samples: 942423976. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:35:15,829][25689] Avg episode reward: [(0, '-0.456')] [2022-07-10 22:35:16,225][26022] Updated weights on worker 0-0, policy_version 920336 (0.00082) [2022-07-10 22:35:18,223][26022] Updated weights on worker 0-0, policy_version 920346 (0.00080) [2022-07-10 22:35:19,955][26022] Updated weights on worker 0-0, policy_version 920356 (0.00085) [2022-07-10 22:35:20,844][25689] Fps is (10 sec: 5717.8, 60 sec: 5535.5, 300 sec: 5544.2). Total num frames: 942448640. Throughput: 0: 5755.1. Samples: 942457878. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:35:20,846][25689] Avg episode reward: [(0, '-0.801')] [2022-07-10 22:35:21,767][26022] Updated weights on worker 0-0, policy_version 920366 (0.00086) [2022-07-10 22:35:23,616][26022] Updated weights on worker 0-0, policy_version 920376 (0.00088) [2022-07-10 22:35:25,371][26022] Updated weights on worker 0-0, policy_version 920386 (0.00089) [2022-07-10 22:35:25,949][25689] Fps is (10 sec: 5666.4, 60 sec: 5530.8, 300 sec: 5542.9). Total num frames: 942477312. Throughput: 0: 5023.1. Samples: 942474808. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:35:25,949][25689] Avg episode reward: [(0, '-0.476')] [2022-07-10 22:35:27,407][26022] Updated weights on worker 0-0, policy_version 920396 (0.00079) [2022-07-10 22:35:29,206][26022] Updated weights on worker 0-0, policy_version 920406 (0.00093) [2022-07-10 22:35:30,986][25689] Fps is (10 sec: 5553.2, 60 sec: 5579.9, 300 sec: 5542.3). Total num frames: 942504960. Throughput: 0: 5866.9. Samples: 942508582. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:35:30,986][25689] Avg episode reward: [(0, '0.230')] [2022-07-10 22:35:31,044][26022] Updated weights on worker 0-0, policy_version 920416 (0.00091) [2022-07-10 22:35:32,924][26022] Updated weights on worker 0-0, policy_version 920426 (0.00092) [2022-07-10 22:35:34,583][26022] Updated weights on worker 0-0, policy_version 920436 (0.00084) [2022-07-10 22:35:36,027][25689] Fps is (10 sec: 5486.9, 60 sec: 5527.7, 300 sec: 5541.7). Total num frames: 942532608. Throughput: 0: 5826.0. Samples: 942541650. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:35:36,027][25689] Avg episode reward: [(0, '-0.305')] [2022-07-10 22:35:36,635][26022] Updated weights on worker 0-0, policy_version 920446 (0.00087) [2022-07-10 22:35:38,245][26022] Updated weights on worker 0-0, policy_version 920456 (0.00089) [2022-07-10 22:35:40,161][26022] Updated weights on worker 0-0, policy_version 920466 (0.00080) [2022-07-10 22:35:41,065][25689] Fps is (10 sec: 5587.8, 60 sec: 5541.9, 300 sec: 5538.5). Total num frames: 942561280. Throughput: 0: 4973.2. Samples: 942558440. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:35:41,065][25689] Avg episode reward: [(0, '-0.182')] [2022-07-10 22:35:41,931][26022] Updated weights on worker 0-0, policy_version 920476 (0.00080) [2022-07-10 22:35:43,758][26022] Updated weights on worker 0-0, policy_version 920486 (0.00083) [2022-07-10 22:35:45,603][26022] Updated weights on worker 0-0, policy_version 920496 (0.00095) [2022-07-10 22:35:46,157][25689] Fps is (10 sec: 5761.8, 60 sec: 5572.3, 300 sec: 5548.1). Total num frames: 942590976. Throughput: 0: 5802.2. Samples: 942592060. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:35:46,158][25689] Avg episode reward: [(0, '0.088')] [2022-07-10 22:35:47,470][26022] Updated weights on worker 0-0, policy_version 920506 (0.00083) [2022-07-10 22:35:49,409][26022] Updated weights on worker 0-0, policy_version 920516 (0.00086) [2022-07-10 22:35:51,159][25689] Fps is (10 sec: 5680.9, 60 sec: 5559.6, 300 sec: 5548.3). Total num frames: 942618624. Throughput: 0: 5801.5. Samples: 942625618. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:35:51,159][25689] Avg episode reward: [(0, '0.170')] [2022-07-10 22:35:51,161][26022] Updated weights on worker 0-0, policy_version 920526 (0.00088) [2022-07-10 22:35:53,114][26022] Updated weights on worker 0-0, policy_version 920536 (0.00082) [2022-07-10 22:35:54,766][26022] Updated weights on worker 0-0, policy_version 920546 (0.00087) [2022-07-10 22:35:56,207][25689] Fps is (10 sec: 5400.0, 60 sec: 5556.0, 300 sec: 5544.2). Total num frames: 942645248. Throughput: 0: 4994.4. Samples: 942642440. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:35:56,208][25689] Avg episode reward: [(0, '0.152')] [2022-07-10 22:35:56,709][26022] Updated weights on worker 0-0, policy_version 920556 (0.00085) [2022-07-10 22:35:58,420][26022] Updated weights on worker 0-0, policy_version 920566 (0.00161) [2022-07-10 22:36:00,312][26022] Updated weights on worker 0-0, policy_version 920576 (0.00095) [2022-07-10 22:36:01,239][25689] Fps is (10 sec: 5485.8, 60 sec: 5536.5, 300 sec: 5551.8). Total num frames: 942673920. Throughput: 0: 5833.6. Samples: 942676128. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:36:01,242][25689] Avg episode reward: [(0, '0.584')] [2022-07-10 22:36:02,479][26022] Updated weights on worker 0-0, policy_version 920586 (0.00083) [2022-07-10 22:36:04,607][26022] Updated weights on worker 0-0, policy_version 920596 (0.00083) [2022-07-10 22:36:06,148][26022] Updated weights on worker 0-0, policy_version 920606 (0.00089) [2022-07-10 22:36:06,307][25689] Fps is (10 sec: 5576.3, 60 sec: 5550.1, 300 sec: 5551.3). Total num frames: 942701568. Throughput: 0: 5715.4. Samples: 942707226. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:36:06,309][25689] Avg episode reward: [(0, '1.231')] [2022-07-10 22:36:08,241][26022] Updated weights on worker 0-0, policy_version 920616 (0.00603) [2022-07-10 22:36:09,883][26022] Updated weights on worker 0-0, policy_version 920626 (0.00084) [2022-07-10 22:36:11,391][25689] Fps is (10 sec: 5345.8, 60 sec: 5561.5, 300 sec: 5544.0). Total num frames: 942728192. Throughput: 0: 4865.7. Samples: 942724062. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:36:11,392][25689] Avg episode reward: [(0, '0.502')] [2022-07-10 22:36:11,804][26022] Updated weights on worker 0-0, policy_version 920636 (0.00095) [2022-07-10 22:36:13,522][26022] Updated weights on worker 0-0, policy_version 920646 (0.00088) [2022-07-10 22:36:15,545][26022] Updated weights on worker 0-0, policy_version 920656 (0.00086) [2022-07-10 22:36:16,413][25689] Fps is (10 sec: 5471.9, 60 sec: 5560.8, 300 sec: 5543.6). Total num frames: 942756864. Throughput: 0: 5704.7. Samples: 942757706. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:36:16,413][25689] Avg episode reward: [(0, '0.197')] [2022-07-10 22:36:17,177][26022] Updated weights on worker 0-0, policy_version 920666 (0.00089) [2022-07-10 22:36:19,152][26022] Updated weights on worker 0-0, policy_version 920676 (0.00084) [2022-07-10 22:36:20,807][26022] Updated weights on worker 0-0, policy_version 920686 (0.00095) [2022-07-10 22:36:21,431][25689] Fps is (10 sec: 5711.4, 60 sec: 5560.5, 300 sec: 5552.0). Total num frames: 942785536. Throughput: 0: 5712.0. Samples: 942791468. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:36:21,432][25689] Avg episode reward: [(0, '0.157')] [2022-07-10 22:36:22,766][26022] Updated weights on worker 0-0, policy_version 920696 (0.00087) [2022-07-10 22:36:24,668][26022] Updated weights on worker 0-0, policy_version 920706 (0.00087) [2022-07-10 22:36:26,344][26022] Updated weights on worker 0-0, policy_version 920716 (0.00089) [2022-07-10 22:36:26,511][25689] Fps is (10 sec: 5577.2, 60 sec: 5545.9, 300 sec: 5543.8). Total num frames: 942813184. Throughput: 0: 4996.0. Samples: 942808164. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:36:26,511][25689] Avg episode reward: [(0, '0.053')] [2022-07-10 22:36:28,341][26022] Updated weights on worker 0-0, policy_version 920726 (0.00085) [2022-07-10 22:36:30,083][26022] Updated weights on worker 0-0, policy_version 920736 (0.00085) [2022-07-10 22:36:31,556][25689] Fps is (10 sec: 5663.9, 60 sec: 5579.0, 300 sec: 5553.7). Total num frames: 942842880. Throughput: 0: 5835.6. Samples: 942841736. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:36:31,556][25689] Avg episode reward: [(0, '-0.067')] [2022-07-10 22:36:31,753][26022] Updated weights on worker 0-0, policy_version 920746 (0.00086) [2022-07-10 22:36:33,725][26022] Updated weights on worker 0-0, policy_version 920756 (0.00088) [2022-07-10 22:36:35,677][26022] Updated weights on worker 0-0, policy_version 920766 (0.00084) [2022-07-10 22:36:36,578][25689] Fps is (10 sec: 5594.3, 60 sec: 5563.8, 300 sec: 5546.8). Total num frames: 942869504. Throughput: 0: 5831.9. Samples: 942875310. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:36:36,579][25689] Avg episode reward: [(0, '-0.133')] [2022-07-10 22:36:37,321][26022] Updated weights on worker 0-0, policy_version 920776 (0.00082) [2022-07-10 22:36:39,289][26022] Updated weights on worker 0-0, policy_version 920786 (0.00081) [2022-07-10 22:36:40,958][26022] Updated weights on worker 0-0, policy_version 920796 (0.00090) [2022-07-10 22:36:41,587][25689] Fps is (10 sec: 5512.4, 60 sec: 5566.5, 300 sec: 5550.9). Total num frames: 942898176. Throughput: 0: 5000.3. Samples: 942892252. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:36:41,588][25689] Avg episode reward: [(0, '0.417')] [2022-07-10 22:36:42,878][26022] Updated weights on worker 0-0, policy_version 920806 (0.00091) [2022-07-10 22:36:44,553][26022] Updated weights on worker 0-0, policy_version 920816 (0.00089) [2022-07-10 22:36:46,496][26022] Updated weights on worker 0-0, policy_version 920826 (0.00092) [2022-07-10 22:36:46,711][25689] Fps is (10 sec: 5558.2, 60 sec: 5529.7, 300 sec: 5545.5). Total num frames: 942925824. Throughput: 0: 5839.8. Samples: 942926128. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:36:46,711][25689] Avg episode reward: [(0, '1.002')] [2022-07-10 22:36:48,191][26022] Updated weights on worker 0-0, policy_version 920836 (0.00609) [2022-07-10 22:36:50,148][26022] Updated weights on worker 0-0, policy_version 920846 (0.00093) [2022-07-10 22:36:51,762][25689] Fps is (10 sec: 5635.8, 60 sec: 5559.1, 300 sec: 5552.1). Total num frames: 942955520. Throughput: 0: 5853.5. Samples: 942960012. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:36:51,762][25689] Avg episode reward: [(0, '-0.522')] [2022-07-10 22:36:51,925][26022] Updated weights on worker 0-0, policy_version 920856 (0.00095) [2022-07-10 22:36:53,752][26022] Updated weights on worker 0-0, policy_version 920866 (0.00094) [2022-07-10 22:36:55,542][26022] Updated weights on worker 0-0, policy_version 920876 (0.00088) [2022-07-10 22:36:56,768][25689] Fps is (10 sec: 5701.7, 60 sec: 5579.8, 300 sec: 5548.6). Total num frames: 942983168. Throughput: 0: 5031.6. Samples: 942976898. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:36:56,769][25689] Avg episode reward: [(0, '-0.630')] [2022-07-10 22:36:57,427][26022] Updated weights on worker 0-0, policy_version 920886 (0.00081) [2022-07-10 22:36:59,279][26022] Updated weights on worker 0-0, policy_version 920896 (0.00088) [2022-07-10 22:37:01,155][26022] Updated weights on worker 0-0, policy_version 920906 (0.00089) [2022-07-10 22:37:01,793][25689] Fps is (10 sec: 5308.5, 60 sec: 5529.8, 300 sec: 5549.2). Total num frames: 943008768. Throughput: 0: 5850.0. Samples: 943010456. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:37:01,793][25689] Avg episode reward: [(0, '-0.854')] [2022-07-10 22:37:03,239][26022] Updated weights on worker 0-0, policy_version 920916 (0.00086) [2022-07-10 22:37:05,183][26022] Updated weights on worker 0-0, policy_version 920926 (0.00084) [2022-07-10 22:37:06,927][25689] Fps is (10 sec: 5342.6, 60 sec: 5540.6, 300 sec: 5547.4). Total num frames: 943037440. Throughput: 0: 5714.3. Samples: 943041648. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:37:06,927][25689] Avg episode reward: [(0, '-0.741')] [2022-07-10 22:37:07,036][26022] Updated weights on worker 0-0, policy_version 920936 (0.00081) [2022-07-10 22:37:08,739][26022] Updated weights on worker 0-0, policy_version 920946 (0.00088) [2022-07-10 22:37:10,699][26022] Updated weights on worker 0-0, policy_version 920956 (0.00080) [2022-07-10 22:37:11,933][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:37:11,934][25689] Fps is (10 sec: 5553.3, 60 sec: 5564.5, 300 sec: 5547.6). Total num frames: 943065088. Throughput: 0: 5726.9. Samples: 943075538. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:37:11,935][25689] Avg episode reward: [(0, '-0.791')] [2022-07-10 22:37:11,944][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000920963_943066112.pth [2022-07-10 22:37:11,945][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000919010_941066240.pth [2022-07-10 22:37:12,425][26022] Updated weights on worker 0-0, policy_version 920966 (0.00086) [2022-07-10 22:37:14,093][26022] Updated weights on worker 0-0, policy_version 920976 (0.00085) [2022-07-10 22:37:16,241][26022] Updated weights on worker 0-0, policy_version 920986 (0.00086) [2022-07-10 22:37:16,960][25689] Fps is (10 sec: 5817.3, 60 sec: 5598.0, 300 sec: 5557.9). Total num frames: 943095808. Throughput: 0: 5719.2. Samples: 943092380. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:37:16,961][25689] Avg episode reward: [(0, '-1.691')] [2022-07-10 22:37:17,635][26022] Updated weights on worker 0-0, policy_version 920996 (0.00089) [2022-07-10 22:37:19,911][26022] Updated weights on worker 0-0, policy_version 921006 (0.00097) [2022-07-10 22:37:21,718][26022] Updated weights on worker 0-0, policy_version 921016 (0.00084) [2022-07-10 22:37:21,982][25689] Fps is (10 sec: 5503.2, 60 sec: 5530.0, 300 sec: 5545.0). Total num frames: 943120384. Throughput: 0: 5717.6. Samples: 943125892. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:37:21,984][25689] Avg episode reward: [(0, '-2.517')] [2022-07-10 22:37:23,330][26022] Updated weights on worker 0-0, policy_version 921026 (0.00090) [2022-07-10 22:37:25,482][26022] Updated weights on worker 0-0, policy_version 921036 (0.00053) [2022-07-10 22:37:27,010][26022] Updated weights on worker 0-0, policy_version 921046 (0.00084) [2022-07-10 22:37:27,055][25689] Fps is (10 sec: 5478.0, 60 sec: 5581.4, 300 sec: 5558.3). Total num frames: 943151104. Throughput: 0: 5849.6. Samples: 943159388. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:37:27,055][25689] Avg episode reward: [(0, '-2.142')] [2022-07-10 22:37:28,987][26022] Updated weights on worker 0-0, policy_version 921056 (0.00088) [2022-07-10 22:37:30,792][26022] Updated weights on worker 0-0, policy_version 921066 (0.00092) [2022-07-10 22:37:32,143][25689] Fps is (10 sec: 5845.1, 60 sec: 5560.5, 300 sec: 5553.4). Total num frames: 943179776. Throughput: 0: 4991.1. Samples: 943176402. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:37:32,144][25689] Avg episode reward: [(0, '-2.093')] [2022-07-10 22:37:32,654][26022] Updated weights on worker 0-0, policy_version 921076 (0.00088) [2022-07-10 22:37:34,462][26022] Updated weights on worker 0-0, policy_version 921086 (0.00087) [2022-07-10 22:37:36,338][26022] Updated weights on worker 0-0, policy_version 921096 (0.00085) [2022-07-10 22:37:37,210][25689] Fps is (10 sec: 5445.0, 60 sec: 5556.4, 300 sec: 5549.1). Total num frames: 943206400. Throughput: 0: 5794.9. Samples: 943209724. Policy #0 lag: (min: 0.0, avg: 7.8, max: 17.0) [2022-07-10 22:37:37,210][25689] Avg episode reward: [(0, '-2.022')] [2022-07-10 22:37:38,096][26022] Updated weights on worker 0-0, policy_version 921106 (0.00088) [2022-07-10 22:37:40,129][26022] Updated weights on worker 0-0, policy_version 921116 (0.00087) [2022-07-10 22:37:41,747][26022] Updated weights on worker 0-0, policy_version 921126 (0.00089) [2022-07-10 22:37:42,218][25689] Fps is (10 sec: 5590.5, 60 sec: 5573.4, 300 sec: 5554.0). Total num frames: 943236096. Throughput: 0: 5803.9. Samples: 943243336. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:37:42,218][25689] Avg episode reward: [(0, '-1.104')] [2022-07-10 22:37:43,671][26022] Updated weights on worker 0-0, policy_version 921136 (0.00096) [2022-07-10 22:37:45,476][26022] Updated weights on worker 0-0, policy_version 921146 (0.00072) [2022-07-10 22:37:47,310][25689] Fps is (10 sec: 5677.3, 60 sec: 5576.3, 300 sec: 5552.6). Total num frames: 943263744. Throughput: 0: 4975.2. Samples: 943260166. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:37:47,311][25689] Avg episode reward: [(0, '-1.108')] [2022-07-10 22:37:47,311][26022] Updated weights on worker 0-0, policy_version 921156 (0.00089) [2022-07-10 22:37:49,066][26022] Updated weights on worker 0-0, policy_version 921166 (0.00085) [2022-07-10 22:37:50,875][26022] Updated weights on worker 0-0, policy_version 921176 (0.00093) [2022-07-10 22:37:52,313][25689] Fps is (10 sec: 5477.2, 60 sec: 5546.9, 300 sec: 5549.4). Total num frames: 943291392. Throughput: 0: 5818.1. Samples: 943293752. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:37:52,314][25689] Avg episode reward: [(0, '-0.365')] [2022-07-10 22:37:52,853][26022] Updated weights on worker 0-0, policy_version 921186 (0.00081) [2022-07-10 22:37:54,644][26022] Updated weights on worker 0-0, policy_version 921196 (0.00091) [2022-07-10 22:37:56,399][26022] Updated weights on worker 0-0, policy_version 921206 (0.00083) [2022-07-10 22:37:57,397][25689] Fps is (10 sec: 5583.7, 60 sec: 5556.7, 300 sec: 5551.4). Total num frames: 943320064. Throughput: 0: 5834.8. Samples: 943327512. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:37:57,397][25689] Avg episode reward: [(0, '-1.270')] [2022-07-10 22:37:58,303][26022] Updated weights on worker 0-0, policy_version 921216 (0.00085) [2022-07-10 22:37:59,973][26022] Updated weights on worker 0-0, policy_version 921226 (0.00083) [2022-07-10 22:38:02,024][26022] Updated weights on worker 0-0, policy_version 921236 (0.00089) [2022-07-10 22:38:02,487][25689] Fps is (10 sec: 5435.3, 60 sec: 5567.6, 300 sec: 5554.9). Total num frames: 943346688. Throughput: 0: 4980.7. Samples: 943344290. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:38:02,487][25689] Avg episode reward: [(0, '-1.996')] [2022-07-10 22:38:04,149][26022] Updated weights on worker 0-0, policy_version 921246 (0.00087) [2022-07-10 22:38:06,087][26022] Updated weights on worker 0-0, policy_version 921256 (0.00093) [2022-07-10 22:38:07,623][25689] Fps is (10 sec: 5307.3, 60 sec: 5550.5, 300 sec: 5552.6). Total num frames: 943374336. Throughput: 0: 5672.8. Samples: 943375394. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:38:07,623][25689] Avg episode reward: [(0, '-1.300')] [2022-07-10 22:38:07,972][26022] Updated weights on worker 0-0, policy_version 921266 (0.00089) [2022-07-10 22:38:09,815][26022] Updated weights on worker 0-0, policy_version 921276 (0.00093) [2022-07-10 22:38:11,565][26022] Updated weights on worker 0-0, policy_version 921286 (0.00089) [2022-07-10 22:38:12,718][25689] Fps is (10 sec: 5504.5, 60 sec: 5559.4, 300 sec: 5550.9). Total num frames: 943403008. Throughput: 0: 5646.1. Samples: 943408962. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:38:12,719][25689] Avg episode reward: [(0, '-0.460')] [2022-07-10 22:38:13,533][26022] Updated weights on worker 0-0, policy_version 921296 (0.00088) [2022-07-10 22:38:15,352][26022] Updated weights on worker 0-0, policy_version 921306 (0.00063) [2022-07-10 22:38:17,090][26022] Updated weights on worker 0-0, policy_version 921316 (0.00102) [2022-07-10 22:38:17,733][25689] Fps is (10 sec: 5570.4, 60 sec: 5509.8, 300 sec: 5554.3). Total num frames: 943430656. Throughput: 0: 4830.5. Samples: 943425748. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:38:17,734][25689] Avg episode reward: [(0, '-0.189')] [2022-07-10 22:38:18,895][26022] Updated weights on worker 0-0, policy_version 921326 (0.00086) [2022-07-10 22:38:20,801][26022] Updated weights on worker 0-0, policy_version 921336 (0.00088) [2022-07-10 22:38:22,554][26022] Updated weights on worker 0-0, policy_version 921346 (0.00082) [2022-07-10 22:38:22,808][25689] Fps is (10 sec: 5581.9, 60 sec: 5572.4, 300 sec: 5550.2). Total num frames: 943459328. Throughput: 0: 5660.6. Samples: 943459320. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:38:22,809][25689] Avg episode reward: [(0, '-0.337')] [2022-07-10 22:38:24,683][26022] Updated weights on worker 0-0, policy_version 921356 (0.00096) [2022-07-10 22:38:26,221][26022] Updated weights on worker 0-0, policy_version 921366 (0.00086) [2022-07-10 22:38:27,868][25689] Fps is (10 sec: 5557.1, 60 sec: 5523.0, 300 sec: 5552.6). Total num frames: 943486976. Throughput: 0: 5787.6. Samples: 943492566. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:38:27,869][25689] Avg episode reward: [(0, '0.193')] [2022-07-10 22:38:28,129][26022] Updated weights on worker 0-0, policy_version 921376 (0.00087) [2022-07-10 22:38:30,024][26022] Updated weights on worker 0-0, policy_version 921386 (0.00086) [2022-07-10 22:38:31,563][26022] Updated weights on worker 0-0, policy_version 921396 (0.00084) [2022-07-10 22:38:32,913][25689] Fps is (10 sec: 5472.1, 60 sec: 5510.1, 300 sec: 5549.6). Total num frames: 943514624. Throughput: 0: 4964.5. Samples: 943509220. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:38:32,914][25689] Avg episode reward: [(0, '0.092')] [2022-07-10 22:38:33,815][26022] Updated weights on worker 0-0, policy_version 921406 (0.00085) [2022-07-10 22:38:35,520][26022] Updated weights on worker 0-0, policy_version 921416 (0.00089) [2022-07-10 22:38:37,478][26022] Updated weights on worker 0-0, policy_version 921426 (0.00094) [2022-07-10 22:38:37,962][25689] Fps is (10 sec: 5579.8, 60 sec: 5545.5, 300 sec: 5552.6). Total num frames: 943543296. Throughput: 0: 5767.2. Samples: 943542408. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:38:37,963][25689] Avg episode reward: [(0, '0.095')] [2022-07-10 22:38:39,271][26022] Updated weights on worker 0-0, policy_version 921436 (0.00096) [2022-07-10 22:38:41,009][26022] Updated weights on worker 0-0, policy_version 921446 (0.00091) [2022-07-10 22:38:43,027][25689] Fps is (10 sec: 5467.2, 60 sec: 5489.7, 300 sec: 5543.2). Total num frames: 943569920. Throughput: 0: 5775.3. Samples: 943576092. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:38:43,028][25689] Avg episode reward: [(0, '-0.914')] [2022-07-10 22:38:43,051][26022] Updated weights on worker 0-0, policy_version 921456 (0.00093) [2022-07-10 22:38:44,571][26022] Updated weights on worker 0-0, policy_version 921466 (0.00093) [2022-07-10 22:38:46,510][26022] Updated weights on worker 0-0, policy_version 921476 (0.00090) [2022-07-10 22:38:48,119][25689] Fps is (10 sec: 5544.5, 60 sec: 5523.4, 300 sec: 5541.6). Total num frames: 943599616. Throughput: 0: 4955.7. Samples: 943592926. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:38:48,120][25689] Avg episode reward: [(0, '-0.749')] [2022-07-10 22:38:48,296][26022] Updated weights on worker 0-0, policy_version 921486 (0.00094) [2022-07-10 22:38:50,215][26022] Updated weights on worker 0-0, policy_version 921496 (0.00080) [2022-07-10 22:38:52,101][26022] Updated weights on worker 0-0, policy_version 921506 (0.00082) [2022-07-10 22:38:53,158][25689] Fps is (10 sec: 5660.6, 60 sec: 5520.2, 300 sec: 5548.1). Total num frames: 943627264. Throughput: 0: 5773.5. Samples: 943626100. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:38:53,158][25689] Avg episode reward: [(0, '-0.807')] [2022-07-10 22:38:53,939][26022] Updated weights on worker 0-0, policy_version 921516 (0.00094) [2022-07-10 22:38:55,698][26022] Updated weights on worker 0-0, policy_version 921526 (0.00090) [2022-07-10 22:38:57,639][26022] Updated weights on worker 0-0, policy_version 921536 (0.00096) [2022-07-10 22:38:58,182][25689] Fps is (10 sec: 5597.3, 60 sec: 5525.6, 300 sec: 5547.9). Total num frames: 943655936. Throughput: 0: 5800.3. Samples: 943659690. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:38:58,182][25689] Avg episode reward: [(0, '-1.151')] [2022-07-10 22:38:59,530][26022] Updated weights on worker 0-0, policy_version 921546 (0.00088) [2022-07-10 22:39:01,222][26022] Updated weights on worker 0-0, policy_version 921556 (0.00088) [2022-07-10 22:39:03,202][25689] Fps is (10 sec: 5403.4, 60 sec: 5515.1, 300 sec: 5546.5). Total num frames: 943681536. Throughput: 0: 5697.3. Samples: 943691032. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:39:03,204][25689] Avg episode reward: [(0, '-1.388')] [2022-07-10 22:39:03,496][26022] Updated weights on worker 0-0, policy_version 921566 (0.00089) [2022-07-10 22:39:05,405][26022] Updated weights on worker 0-0, policy_version 921576 (0.00086) [2022-07-10 22:39:07,349][26022] Updated weights on worker 0-0, policy_version 921586 (0.00083) [2022-07-10 22:39:08,301][25689] Fps is (10 sec: 5261.8, 60 sec: 5518.4, 300 sec: 5541.9). Total num frames: 943709184. Throughput: 0: 5693.7. Samples: 943707836. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:39:08,302][25689] Avg episode reward: [(0, '-1.302')] [2022-07-10 22:39:09,095][26022] Updated weights on worker 0-0, policy_version 921596 (0.00085) [2022-07-10 22:39:11,046][26022] Updated weights on worker 0-0, policy_version 921606 (0.00085) [2022-07-10 22:39:12,128][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:39:12,145][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000921612_943730688.pth [2022-07-10 22:39:12,146][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000919660_941731840.pth [2022-07-10 22:39:12,683][26022] Updated weights on worker 0-0, policy_version 921616 (0.00085) [2022-07-10 22:39:13,309][25689] Fps is (10 sec: 5572.3, 60 sec: 5526.4, 300 sec: 5543.1). Total num frames: 943737856. Throughput: 0: 5699.4. Samples: 943740950. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:39:13,311][25689] Avg episode reward: [(0, '-0.569')] [2022-07-10 22:39:14,656][26022] Updated weights on worker 0-0, policy_version 921626 (0.00085) [2022-07-10 22:39:16,369][26022] Updated weights on worker 0-0, policy_version 921636 (0.00091) [2022-07-10 22:39:18,293][26022] Updated weights on worker 0-0, policy_version 921646 (0.00107) [2022-07-10 22:39:18,392][25689] Fps is (10 sec: 5581.3, 60 sec: 5520.2, 300 sec: 5541.9). Total num frames: 943765504. Throughput: 0: 5686.1. Samples: 943774610. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:39:18,393][25689] Avg episode reward: [(0, '-0.583')] [2022-07-10 22:39:20,047][26022] Updated weights on worker 0-0, policy_version 921656 (0.00081) [2022-07-10 22:39:21,889][26022] Updated weights on worker 0-0, policy_version 921666 (0.00108) [2022-07-10 22:39:23,400][25689] Fps is (10 sec: 5479.7, 60 sec: 5509.4, 300 sec: 5539.3). Total num frames: 943793152. Throughput: 0: 4968.2. Samples: 943791380. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:39:23,400][25689] Avg episode reward: [(0, '-0.545')] [2022-07-10 22:39:23,815][26022] Updated weights on worker 0-0, policy_version 921676 (0.00084) [2022-07-10 22:39:25,573][26022] Updated weights on worker 0-0, policy_version 921686 (0.00082) [2022-07-10 22:39:27,539][26022] Updated weights on worker 0-0, policy_version 921696 (0.00083) [2022-07-10 22:39:28,446][25689] Fps is (10 sec: 5398.0, 60 sec: 5493.7, 300 sec: 5545.7). Total num frames: 943819776. Throughput: 0: 5794.6. Samples: 943824568. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:39:28,447][25689] Avg episode reward: [(0, '0.496')] [2022-07-10 22:39:29,424][26022] Updated weights on worker 0-0, policy_version 921706 (0.00087) [2022-07-10 22:39:31,223][26022] Updated weights on worker 0-0, policy_version 921716 (0.00083) [2022-07-10 22:39:33,089][26022] Updated weights on worker 0-0, policy_version 921726 (0.00097) [2022-07-10 22:39:33,466][25689] Fps is (10 sec: 5594.9, 60 sec: 5529.8, 300 sec: 5542.4). Total num frames: 943849472. Throughput: 0: 5797.4. Samples: 943857810. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:39:33,467][25689] Avg episode reward: [(0, '-0.093')] [2022-07-10 22:39:34,848][26022] Updated weights on worker 0-0, policy_version 921736 (0.00080) [2022-07-10 22:39:36,581][26022] Updated weights on worker 0-0, policy_version 921746 (0.00091) [2022-07-10 22:39:38,509][25689] Fps is (10 sec: 5698.4, 60 sec: 5513.4, 300 sec: 5541.7). Total num frames: 943877120. Throughput: 0: 4975.3. Samples: 943874700. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:39:38,510][25689] Avg episode reward: [(0, '-0.250')] [2022-07-10 22:39:38,671][26022] Updated weights on worker 0-0, policy_version 921756 (0.00088) [2022-07-10 22:39:40,201][26022] Updated weights on worker 0-0, policy_version 921766 (0.00055) [2022-07-10 22:39:42,337][26022] Updated weights on worker 0-0, policy_version 921776 (0.00085) [2022-07-10 22:39:43,544][25689] Fps is (10 sec: 5588.5, 60 sec: 5550.1, 300 sec: 5545.5). Total num frames: 943905792. Throughput: 0: 5782.9. Samples: 943907872. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:39:43,545][25689] Avg episode reward: [(0, '-0.490')] [2022-07-10 22:39:44,104][26022] Updated weights on worker 0-0, policy_version 921786 (0.00093) [2022-07-10 22:39:46,020][26022] Updated weights on worker 0-0, policy_version 921796 (0.00093) [2022-07-10 22:39:47,843][26022] Updated weights on worker 0-0, policy_version 921806 (0.00087) [2022-07-10 22:39:48,641][25689] Fps is (10 sec: 5558.9, 60 sec: 5515.8, 300 sec: 5541.2). Total num frames: 943933440. Throughput: 0: 5758.3. Samples: 943940856. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:39:48,641][25689] Avg episode reward: [(0, '-1.221')] [2022-07-10 22:39:49,887][26022] Updated weights on worker 0-0, policy_version 921816 (0.00093) [2022-07-10 22:39:51,468][26022] Updated weights on worker 0-0, policy_version 921826 (0.00084) [2022-07-10 22:39:53,452][26022] Updated weights on worker 0-0, policy_version 921836 (0.00083) [2022-07-10 22:39:53,686][25689] Fps is (10 sec: 5452.2, 60 sec: 5515.2, 300 sec: 5543.9). Total num frames: 943961088. Throughput: 0: 4942.5. Samples: 943957746. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:39:53,686][25689] Avg episode reward: [(0, '-1.997')] [2022-07-10 22:39:54,981][26022] Updated weights on worker 0-0, policy_version 921846 (0.00087) [2022-07-10 22:39:57,272][26022] Updated weights on worker 0-0, policy_version 921856 (0.00089) [2022-07-10 22:39:58,719][25689] Fps is (10 sec: 5588.3, 60 sec: 5514.3, 300 sec: 5540.0). Total num frames: 943989760. Throughput: 0: 5766.0. Samples: 943991230. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:39:58,720][25689] Avg episode reward: [(0, '-1.654')] [2022-07-10 22:39:59,032][26022] Updated weights on worker 0-0, policy_version 921866 (0.00094) [2022-07-10 22:40:00,942][26022] Updated weights on worker 0-0, policy_version 921876 (0.00096) [2022-07-10 22:40:02,870][26022] Updated weights on worker 0-0, policy_version 921886 (0.00092) [2022-07-10 22:40:03,756][25689] Fps is (10 sec: 5389.2, 60 sec: 5512.8, 300 sec: 5536.4). Total num frames: 944015360. Throughput: 0: 5662.8. Samples: 944022332. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:40:03,757][25689] Avg episode reward: [(0, '-1.073')] [2022-07-10 22:40:04,882][26022] Updated weights on worker 0-0, policy_version 921896 (0.00086) [2022-07-10 22:40:06,570][26022] Updated weights on worker 0-0, policy_version 921906 (0.00089) [2022-07-10 22:40:08,537][26022] Updated weights on worker 0-0, policy_version 921916 (0.00088) [2022-07-10 22:40:08,847][25689] Fps is (10 sec: 5257.7, 60 sec: 5513.6, 300 sec: 5542.1). Total num frames: 944043008. Throughput: 0: 4859.7. Samples: 944039050. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:40:08,847][25689] Avg episode reward: [(0, '-1.153')] [2022-07-10 22:40:10,275][26022] Updated weights on worker 0-0, policy_version 921926 (0.00090) [2022-07-10 22:40:12,175][26022] Updated weights on worker 0-0, policy_version 921936 (0.00090) [2022-07-10 22:40:13,861][25689] Fps is (10 sec: 5573.8, 60 sec: 5513.0, 300 sec: 5542.1). Total num frames: 944071680. Throughput: 0: 5671.1. Samples: 944072160. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:40:13,861][25689] Avg episode reward: [(0, '-0.751')] [2022-07-10 22:40:14,042][26022] Updated weights on worker 0-0, policy_version 921946 (0.00094) [2022-07-10 22:40:16,096][26022] Updated weights on worker 0-0, policy_version 921956 (0.00052) [2022-07-10 22:40:17,783][26022] Updated weights on worker 0-0, policy_version 921966 (0.00081) [2022-07-10 22:40:18,893][25689] Fps is (10 sec: 5504.0, 60 sec: 5500.7, 300 sec: 5534.9). Total num frames: 944098304. Throughput: 0: 5665.2. Samples: 944105520. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:40:18,895][25689] Avg episode reward: [(0, '-0.459')] [2022-07-10 22:40:19,825][26022] Updated weights on worker 0-0, policy_version 921976 (0.00086) [2022-07-10 22:40:21,418][26022] Updated weights on worker 0-0, policy_version 921986 (0.00091) [2022-07-10 22:40:23,469][26022] Updated weights on worker 0-0, policy_version 921996 (0.01458) [2022-07-10 22:40:23,910][25689] Fps is (10 sec: 5502.4, 60 sec: 5516.8, 300 sec: 5536.5). Total num frames: 944126976. Throughput: 0: 4959.0. Samples: 944122274. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:40:23,911][25689] Avg episode reward: [(0, '-0.149')] [2022-07-10 22:40:25,231][26022] Updated weights on worker 0-0, policy_version 922006 (0.00089) [2022-07-10 22:40:26,911][26022] Updated weights on worker 0-0, policy_version 922016 (0.00085) [2022-07-10 22:40:28,848][26022] Updated weights on worker 0-0, policy_version 922026 (0.00089) [2022-07-10 22:40:28,958][25689] Fps is (10 sec: 5595.9, 60 sec: 5533.6, 300 sec: 5536.3). Total num frames: 944154624. Throughput: 0: 5807.6. Samples: 944155848. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:40:28,958][25689] Avg episode reward: [(0, '-0.144')] [2022-07-10 22:40:30,554][26022] Updated weights on worker 0-0, policy_version 922036 (0.00085) [2022-07-10 22:40:32,527][26022] Updated weights on worker 0-0, policy_version 922046 (0.00086) [2022-07-10 22:40:33,968][25689] Fps is (10 sec: 5599.8, 60 sec: 5517.6, 300 sec: 5540.3). Total num frames: 944183296. Throughput: 0: 5833.5. Samples: 944189454. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:40:33,968][25689] Avg episode reward: [(0, '-0.887')] [2022-07-10 22:40:34,241][26022] Updated weights on worker 0-0, policy_version 922056 (0.00090) [2022-07-10 22:40:36,041][26022] Updated weights on worker 0-0, policy_version 922066 (0.00087) [2022-07-10 22:40:37,998][26022] Updated weights on worker 0-0, policy_version 922076 (0.00090) [2022-07-10 22:40:38,999][25689] Fps is (10 sec: 5507.1, 60 sec: 5501.8, 300 sec: 5533.6). Total num frames: 944209920. Throughput: 0: 5013.9. Samples: 944206326. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:40:38,999][25689] Avg episode reward: [(0, '-0.965')] [2022-07-10 22:40:39,682][26022] Updated weights on worker 0-0, policy_version 922086 (0.00088) [2022-07-10 22:40:41,745][26022] Updated weights on worker 0-0, policy_version 922096 (0.00093) [2022-07-10 22:40:43,527][26022] Updated weights on worker 0-0, policy_version 922106 (0.00084) [2022-07-10 22:40:44,031][25689] Fps is (10 sec: 5495.0, 60 sec: 5502.0, 300 sec: 5531.3). Total num frames: 944238592. Throughput: 0: 5822.4. Samples: 944239424. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:40:44,031][25689] Avg episode reward: [(0, '-0.512')] [2022-07-10 22:40:45,400][26022] Updated weights on worker 0-0, policy_version 922116 (0.00089) [2022-07-10 22:40:47,249][26022] Updated weights on worker 0-0, policy_version 922126 (0.00085) [2022-07-10 22:40:48,920][26022] Updated weights on worker 0-0, policy_version 922136 (0.00090) [2022-07-10 22:40:49,084][25689] Fps is (10 sec: 5787.5, 60 sec: 5539.9, 300 sec: 5537.2). Total num frames: 944268288. Throughput: 0: 5829.3. Samples: 944273170. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:40:49,084][25689] Avg episode reward: [(0, '-0.808')] [2022-07-10 22:40:50,873][26022] Updated weights on worker 0-0, policy_version 922146 (0.00085) [2022-07-10 22:40:52,667][26022] Updated weights on worker 0-0, policy_version 922156 (0.00089) [2022-07-10 22:40:54,087][25689] Fps is (10 sec: 5600.7, 60 sec: 5526.8, 300 sec: 5538.0). Total num frames: 944294912. Throughput: 0: 4996.6. Samples: 944289980. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:40:54,087][25689] Avg episode reward: [(0, '-0.633')] [2022-07-10 22:40:54,583][26022] Updated weights on worker 0-0, policy_version 922166 (0.00083) [2022-07-10 22:40:56,356][26022] Updated weights on worker 0-0, policy_version 922176 (0.00090) [2022-07-10 22:40:58,203][26022] Updated weights on worker 0-0, policy_version 922186 (0.00081) [2022-07-10 22:40:59,100][25689] Fps is (10 sec: 5418.7, 60 sec: 5511.7, 300 sec: 5534.9). Total num frames: 944322560. Throughput: 0: 5832.8. Samples: 944323570. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:40:59,107][25689] Avg episode reward: [(0, '-0.715')] [2022-07-10 22:41:00,066][26022] Updated weights on worker 0-0, policy_version 922196 (0.00086) [2022-07-10 22:41:02,296][26022] Updated weights on worker 0-0, policy_version 922206 (0.00086) [2022-07-10 22:41:04,027][26022] Updated weights on worker 0-0, policy_version 922216 (0.00090) [2022-07-10 22:41:04,110][25689] Fps is (10 sec: 5414.3, 60 sec: 5531.1, 300 sec: 5532.6). Total num frames: 944349184. Throughput: 0: 5748.4. Samples: 944354850. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:41:04,111][25689] Avg episode reward: [(0, '0.026')] [2022-07-10 22:41:05,817][26022] Updated weights on worker 0-0, policy_version 922226 (0.00085) [2022-07-10 22:41:07,723][26022] Updated weights on worker 0-0, policy_version 922236 (0.00084) [2022-07-10 22:41:09,171][25689] Fps is (10 sec: 5490.2, 60 sec: 5550.8, 300 sec: 5539.9). Total num frames: 944377856. Throughput: 0: 4893.0. Samples: 944371458. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:41:09,173][25689] Avg episode reward: [(0, '0.370')] [2022-07-10 22:41:09,545][26022] Updated weights on worker 0-0, policy_version 922246 (0.00091) [2022-07-10 22:41:11,495][26022] Updated weights on worker 0-0, policy_version 922256 (0.00091) [2022-07-10 22:41:12,359][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:41:12,378][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000922261_944395264.pth [2022-07-10 22:41:12,378][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000920311_942398464.pth [2022-07-10 22:41:13,433][26022] Updated weights on worker 0-0, policy_version 922266 (0.00088) [2022-07-10 22:41:14,187][25689] Fps is (10 sec: 5386.1, 60 sec: 5499.7, 300 sec: 5529.7). Total num frames: 944403456. Throughput: 0: 5703.1. Samples: 944404614. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:41:14,187][25689] Avg episode reward: [(0, '0.435')] [2022-07-10 22:41:15,079][26022] Updated weights on worker 0-0, policy_version 922276 (0.00084) [2022-07-10 22:41:17,012][26022] Updated weights on worker 0-0, policy_version 922286 (0.00088) [2022-07-10 22:41:18,600][26022] Updated weights on worker 0-0, policy_version 922296 (0.00093) [2022-07-10 22:41:19,200][25689] Fps is (10 sec: 5513.8, 60 sec: 5552.4, 300 sec: 5533.2). Total num frames: 944433152. Throughput: 0: 5711.0. Samples: 944438364. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:41:19,201][25689] Avg episode reward: [(0, '0.609')] [2022-07-10 22:41:20,834][26022] Updated weights on worker 0-0, policy_version 922306 (0.00088) [2022-07-10 22:41:22,499][26022] Updated weights on worker 0-0, policy_version 922316 (0.00083) [2022-07-10 22:41:24,210][25689] Fps is (10 sec: 5720.9, 60 sec: 5536.0, 300 sec: 5534.5). Total num frames: 944460800. Throughput: 0: 4979.9. Samples: 944454946. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:41:24,211][25689] Avg episode reward: [(0, '0.493')] [2022-07-10 22:41:24,491][26022] Updated weights on worker 0-0, policy_version 922326 (0.00091) [2022-07-10 22:41:26,371][26022] Updated weights on worker 0-0, policy_version 922336 (0.00094) [2022-07-10 22:41:28,080][26022] Updated weights on worker 0-0, policy_version 922346 (0.00086) [2022-07-10 22:41:29,345][25689] Fps is (10 sec: 5450.6, 60 sec: 5528.1, 300 sec: 5525.9). Total num frames: 944488448. Throughput: 0: 5788.9. Samples: 944488242. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-10 22:41:29,346][25689] Avg episode reward: [(0, '0.230')] [2022-07-10 22:41:29,919][26022] Updated weights on worker 0-0, policy_version 922356 (0.00089) [2022-07-10 22:41:32,011][26022] Updated weights on worker 0-0, policy_version 922366 (0.00089) [2022-07-10 22:41:33,375][26022] Updated weights on worker 0-0, policy_version 922376 (0.00084) [2022-07-10 22:41:34,381][25689] Fps is (10 sec: 5537.3, 60 sec: 5525.6, 300 sec: 5532.6). Total num frames: 944517120. Throughput: 0: 5809.1. Samples: 944521928. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:41:34,382][25689] Avg episode reward: [(0, '-0.108')] [2022-07-10 22:41:35,526][26022] Updated weights on worker 0-0, policy_version 922386 (0.00088) [2022-07-10 22:41:37,195][26022] Updated weights on worker 0-0, policy_version 922396 (0.00092) [2022-07-10 22:41:39,162][26022] Updated weights on worker 0-0, policy_version 922406 (0.00087) [2022-07-10 22:41:39,403][25689] Fps is (10 sec: 5701.1, 60 sec: 5560.4, 300 sec: 5532.3). Total num frames: 944545792. Throughput: 0: 4962.6. Samples: 944538626. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:41:39,405][25689] Avg episode reward: [(0, '-0.346')] [2022-07-10 22:41:41,134][26022] Updated weights on worker 0-0, policy_version 922416 (0.00090) [2022-07-10 22:41:42,755][26022] Updated weights on worker 0-0, policy_version 922426 (0.00079) [2022-07-10 22:41:44,434][25689] Fps is (10 sec: 5500.6, 60 sec: 5526.6, 300 sec: 5530.6). Total num frames: 944572416. Throughput: 0: 5788.7. Samples: 944572016. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:41:44,435][25689] Avg episode reward: [(0, '-0.395')] [2022-07-10 22:41:44,788][26022] Updated weights on worker 0-0, policy_version 922436 (0.00095) [2022-07-10 22:41:46,411][26022] Updated weights on worker 0-0, policy_version 922446 (0.00087) [2022-07-10 22:41:48,199][26022] Updated weights on worker 0-0, policy_version 922456 (0.00088) [2022-07-10 22:41:49,475][25689] Fps is (10 sec: 5591.7, 60 sec: 5527.7, 300 sec: 5530.8). Total num frames: 944602112. Throughput: 0: 5826.8. Samples: 944605538. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:41:49,477][25689] Avg episode reward: [(0, '-0.227')] [2022-07-10 22:41:50,263][26022] Updated weights on worker 0-0, policy_version 922466 (0.00088) [2022-07-10 22:41:51,874][26022] Updated weights on worker 0-0, policy_version 922476 (0.00080) [2022-07-10 22:41:53,875][26022] Updated weights on worker 0-0, policy_version 922486 (0.00094) [2022-07-10 22:41:54,480][25689] Fps is (10 sec: 5708.1, 60 sec: 5544.4, 300 sec: 5530.8). Total num frames: 944629760. Throughput: 0: 5000.2. Samples: 944622426. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:41:54,481][25689] Avg episode reward: [(0, '-0.011')] [2022-07-10 22:41:55,412][26022] Updated weights on worker 0-0, policy_version 922496 (0.00084) [2022-07-10 22:41:57,516][26022] Updated weights on worker 0-0, policy_version 922506 (0.00089) [2022-07-10 22:41:59,243][26022] Updated weights on worker 0-0, policy_version 922516 (0.00082) [2022-07-10 22:41:59,531][25689] Fps is (10 sec: 5499.2, 60 sec: 5541.0, 300 sec: 5537.2). Total num frames: 944657408. Throughput: 0: 5851.8. Samples: 944656408. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:41:59,531][25689] Avg episode reward: [(0, '0.323')] [2022-07-10 22:42:01,104][26022] Updated weights on worker 0-0, policy_version 922526 (0.00090) [2022-07-10 22:42:03,349][26022] Updated weights on worker 0-0, policy_version 922536 (0.00086) [2022-07-10 22:42:04,609][25689] Fps is (10 sec: 5358.3, 60 sec: 5534.8, 300 sec: 5531.4). Total num frames: 944684032. Throughput: 0: 5747.3. Samples: 944687966. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:42:04,610][25689] Avg episode reward: [(0, '0.476')] [2022-07-10 22:42:05,053][26022] Updated weights on worker 0-0, policy_version 922546 (0.00206) [2022-07-10 22:42:06,863][26022] Updated weights on worker 0-0, policy_version 922556 (0.00108) [2022-07-10 22:42:08,693][26022] Updated weights on worker 0-0, policy_version 922566 (0.00080) [2022-07-10 22:42:09,720][25689] Fps is (10 sec: 5527.3, 60 sec: 5547.1, 300 sec: 5536.3). Total num frames: 944713728. Throughput: 0: 5726.7. Samples: 944721474. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:42:09,722][25689] Avg episode reward: [(0, '0.845')] [2022-07-10 22:42:10,496][26022] Updated weights on worker 0-0, policy_version 922576 (0.00090) [2022-07-10 22:42:12,458][26022] Updated weights on worker 0-0, policy_version 922586 (0.00088) [2022-07-10 22:42:14,444][26022] Updated weights on worker 0-0, policy_version 922596 (0.00092) [2022-07-10 22:42:14,819][25689] Fps is (10 sec: 5415.7, 60 sec: 5539.5, 300 sec: 5517.8). Total num frames: 944739328. Throughput: 0: 5690.8. Samples: 944738170. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:42:14,820][25689] Avg episode reward: [(0, '0.844')] [2022-07-10 22:42:15,958][26022] Updated weights on worker 0-0, policy_version 922606 (0.00093) [2022-07-10 22:42:18,099][26022] Updated weights on worker 0-0, policy_version 922616 (0.00092) [2022-07-10 22:42:19,603][26022] Updated weights on worker 0-0, policy_version 922626 (0.00093) [2022-07-10 22:42:19,885][25689] Fps is (10 sec: 5540.7, 60 sec: 5551.6, 300 sec: 5537.6). Total num frames: 944770048. Throughput: 0: 5660.8. Samples: 944771630. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:42:19,887][25689] Avg episode reward: [(0, '0.624')] [2022-07-10 22:42:21,683][26022] Updated weights on worker 0-0, policy_version 922636 (0.00090) [2022-07-10 22:42:23,255][26022] Updated weights on worker 0-0, policy_version 922646 (0.00091) [2022-07-10 22:42:24,903][25689] Fps is (10 sec: 5585.3, 60 sec: 5517.1, 300 sec: 5521.4). Total num frames: 944795648. Throughput: 0: 5782.3. Samples: 944805310. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:42:24,904][25689] Avg episode reward: [(0, '0.239')] [2022-07-10 22:42:25,366][26022] Updated weights on worker 0-0, policy_version 922656 (0.00088) [2022-07-10 22:42:27,183][26022] Updated weights on worker 0-0, policy_version 922666 (0.00087) [2022-07-10 22:42:28,883][26022] Updated weights on worker 0-0, policy_version 922676 (0.00080) [2022-07-10 22:42:29,969][25689] Fps is (10 sec: 5483.4, 60 sec: 5557.1, 300 sec: 5525.3). Total num frames: 944825344. Throughput: 0: 4963.7. Samples: 944821984. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:42:29,971][25689] Avg episode reward: [(0, '0.354')] [2022-07-10 22:42:30,781][26022] Updated weights on worker 0-0, policy_version 922686 (0.00080) [2022-07-10 22:42:32,596][26022] Updated weights on worker 0-0, policy_version 922696 (0.00092) [2022-07-10 22:42:34,550][26022] Updated weights on worker 0-0, policy_version 922706 (0.00098) [2022-07-10 22:42:34,985][25689] Fps is (10 sec: 5687.3, 60 sec: 5542.1, 300 sec: 5529.6). Total num frames: 944852992. Throughput: 0: 5822.7. Samples: 944855592. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:42:34,987][25689] Avg episode reward: [(0, '0.534')] [2022-07-10 22:42:36,311][26022] Updated weights on worker 0-0, policy_version 922716 (0.00095) [2022-07-10 22:42:38,142][26022] Updated weights on worker 0-0, policy_version 922726 (0.00083) [2022-07-10 22:42:39,992][25689] Fps is (10 sec: 5517.1, 60 sec: 5526.6, 300 sec: 5522.8). Total num frames: 944880640. Throughput: 0: 5834.4. Samples: 944888942. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:42:39,994][25689] Avg episode reward: [(0, '0.951')] [2022-07-10 22:42:39,996][26022] Updated weights on worker 0-0, policy_version 922736 (0.00090) [2022-07-10 22:42:41,870][26022] Updated weights on worker 0-0, policy_version 922746 (0.00099) [2022-07-10 22:42:43,431][26022] Updated weights on worker 0-0, policy_version 922756 (0.00085) [2022-07-10 22:42:45,028][25689] Fps is (10 sec: 5506.1, 60 sec: 5543.0, 300 sec: 5523.8). Total num frames: 944908288. Throughput: 0: 4984.4. Samples: 944905624. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:42:45,030][25689] Avg episode reward: [(0, '0.787')] [2022-07-10 22:42:45,535][26022] Updated weights on worker 0-0, policy_version 922766 (0.00097) [2022-07-10 22:42:47,294][26022] Updated weights on worker 0-0, policy_version 922776 (0.00090) [2022-07-10 22:42:49,201][26022] Updated weights on worker 0-0, policy_version 922786 (0.00085) [2022-07-10 22:42:50,073][25689] Fps is (10 sec: 5586.6, 60 sec: 5525.8, 300 sec: 5526.5). Total num frames: 944936960. Throughput: 0: 5822.3. Samples: 944939034. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:42:50,075][25689] Avg episode reward: [(0, '0.790')] [2022-07-10 22:42:51,064][26022] Updated weights on worker 0-0, policy_version 922796 (0.00090) [2022-07-10 22:42:52,656][26022] Updated weights on worker 0-0, policy_version 922806 (0.00090) [2022-07-10 22:42:54,650][26022] Updated weights on worker 0-0, policy_version 922816 (0.00089) [2022-07-10 22:42:55,081][25689] Fps is (10 sec: 5703.9, 60 sec: 5542.3, 300 sec: 5527.9). Total num frames: 944965632. Throughput: 0: 5819.2. Samples: 944972534. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:42:55,082][25689] Avg episode reward: [(0, '0.841')] [2022-07-10 22:42:56,588][26022] Updated weights on worker 0-0, policy_version 922826 (0.00086) [2022-07-10 22:42:58,346][26022] Updated weights on worker 0-0, policy_version 922836 (0.00093) [2022-07-10 22:43:00,132][25689] Fps is (10 sec: 5599.0, 60 sec: 5542.3, 300 sec: 5532.1). Total num frames: 944993280. Throughput: 0: 4986.5. Samples: 944989370. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:43:00,134][25689] Avg episode reward: [(0, '1.147')] [2022-07-10 22:43:00,340][26022] Updated weights on worker 0-0, policy_version 922846 (0.00087) [2022-07-10 22:43:02,015][26022] Updated weights on worker 0-0, policy_version 922856 (0.00098) [2022-07-10 22:43:04,376][26022] Updated weights on worker 0-0, policy_version 922866 (0.00088) [2022-07-10 22:43:05,151][25689] Fps is (10 sec: 5389.5, 60 sec: 5547.7, 300 sec: 5530.8). Total num frames: 945019904. Throughput: 0: 5700.6. Samples: 945020340. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:43:05,152][25689] Avg episode reward: [(0, '0.861')] [2022-07-10 22:43:06,240][26022] Updated weights on worker 0-0, policy_version 922876 (0.00080) [2022-07-10 22:43:08,125][26022] Updated weights on worker 0-0, policy_version 922886 (0.00087) [2022-07-10 22:43:09,904][26022] Updated weights on worker 0-0, policy_version 922896 (0.00091) [2022-07-10 22:43:10,195][25689] Fps is (10 sec: 5291.8, 60 sec: 5503.2, 300 sec: 5524.9). Total num frames: 945046528. Throughput: 0: 5707.3. Samples: 945053872. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:43:10,195][25689] Avg episode reward: [(0, '0.403')] [2022-07-10 22:43:11,670][26022] Updated weights on worker 0-0, policy_version 922906 (0.00060) [2022-07-10 22:43:12,434][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:43:12,444][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000922910_945059840.pth [2022-07-10 22:43:12,444][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000920963_943066112.pth [2022-07-10 22:43:13,783][26022] Updated weights on worker 0-0, policy_version 922916 (0.00077) [2022-07-10 22:43:15,206][25689] Fps is (10 sec: 5296.1, 60 sec: 5528.1, 300 sec: 5521.5). Total num frames: 945073152. Throughput: 0: 4869.8. Samples: 945070534. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:43:15,206][25689] Avg episode reward: [(0, '0.429')] [2022-07-10 22:43:15,489][26022] Updated weights on worker 0-0, policy_version 922926 (0.00094) [2022-07-10 22:43:17,398][26022] Updated weights on worker 0-0, policy_version 922936 (0.00083) [2022-07-10 22:43:19,070][26022] Updated weights on worker 0-0, policy_version 922946 (0.00098) [2022-07-10 22:43:20,236][25689] Fps is (10 sec: 5608.7, 60 sec: 5514.4, 300 sec: 5525.8). Total num frames: 945102848. Throughput: 0: 5685.3. Samples: 945103668. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:43:20,237][25689] Avg episode reward: [(0, '0.319')] [2022-07-10 22:43:20,980][26022] Updated weights on worker 0-0, policy_version 922956 (0.00088) [2022-07-10 22:43:22,941][26022] Updated weights on worker 0-0, policy_version 922966 (0.00087) [2022-07-10 22:43:24,560][26022] Updated weights on worker 0-0, policy_version 922976 (0.00094) [2022-07-10 22:43:25,263][25689] Fps is (10 sec: 5701.9, 60 sec: 5547.5, 300 sec: 5526.4). Total num frames: 945130496. Throughput: 0: 5809.4. Samples: 945137174. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:43:25,263][25689] Avg episode reward: [(0, '0.838')] [2022-07-10 22:43:26,589][26022] Updated weights on worker 0-0, policy_version 922986 (0.00090) [2022-07-10 22:43:28,270][26022] Updated weights on worker 0-0, policy_version 922996 (0.00089) [2022-07-10 22:43:30,327][25689] Fps is (10 sec: 5480.0, 60 sec: 5513.8, 300 sec: 5526.0). Total num frames: 945158144. Throughput: 0: 4953.0. Samples: 945153586. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:43:30,327][25689] Avg episode reward: [(0, '0.266')] [2022-07-10 22:43:30,333][26022] Updated weights on worker 0-0, policy_version 923006 (0.00087) [2022-07-10 22:43:31,994][26022] Updated weights on worker 0-0, policy_version 923016 (0.00091) [2022-07-10 22:43:33,869][26022] Updated weights on worker 0-0, policy_version 923026 (0.00085) [2022-07-10 22:43:35,346][25689] Fps is (10 sec: 5585.5, 60 sec: 5530.5, 300 sec: 5526.6). Total num frames: 945186816. Throughput: 0: 5792.5. Samples: 945187196. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:43:35,346][25689] Avg episode reward: [(0, '-0.146')] [2022-07-10 22:43:35,844][26022] Updated weights on worker 0-0, policy_version 923036 (0.00089) [2022-07-10 22:43:37,339][26022] Updated weights on worker 0-0, policy_version 923046 (0.00083) [2022-07-10 22:43:39,547][26022] Updated weights on worker 0-0, policy_version 923056 (0.00096) [2022-07-10 22:43:40,379][25689] Fps is (10 sec: 5704.4, 60 sec: 5545.0, 300 sec: 5534.1). Total num frames: 945215488. Throughput: 0: 5815.4. Samples: 945220808. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:43:40,380][25689] Avg episode reward: [(0, '-0.655')] [2022-07-10 22:43:41,071][26022] Updated weights on worker 0-0, policy_version 923066 (0.00086) [2022-07-10 22:43:42,989][26022] Updated weights on worker 0-0, policy_version 923076 (0.00086) [2022-07-10 22:43:44,806][26022] Updated weights on worker 0-0, policy_version 923086 (0.00095) [2022-07-10 22:43:45,394][25689] Fps is (10 sec: 5401.4, 60 sec: 5513.1, 300 sec: 5521.7). Total num frames: 945241088. Throughput: 0: 4985.9. Samples: 945237546. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:43:45,394][25689] Avg episode reward: [(0, '-1.043')] [2022-07-10 22:43:46,884][26022] Updated weights on worker 0-0, policy_version 923096 (0.00098) [2022-07-10 22:43:48,706][26022] Updated weights on worker 0-0, policy_version 923106 (0.00081) [2022-07-10 22:43:50,496][25689] Fps is (10 sec: 5465.8, 60 sec: 5524.8, 300 sec: 5527.4). Total num frames: 945270784. Throughput: 0: 5792.4. Samples: 945270416. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:43:50,497][25689] Avg episode reward: [(0, '-1.137')] [2022-07-10 22:43:50,503][26022] Updated weights on worker 0-0, policy_version 923116 (0.00097) [2022-07-10 22:43:52,276][26022] Updated weights on worker 0-0, policy_version 923126 (0.00088) [2022-07-10 22:43:54,221][26022] Updated weights on worker 0-0, policy_version 923136 (0.00087) [2022-07-10 22:43:55,546][25689] Fps is (10 sec: 5648.2, 60 sec: 5504.0, 300 sec: 5523.5). Total num frames: 945298432. Throughput: 0: 5765.8. Samples: 945303666. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:43:55,548][25689] Avg episode reward: [(0, '-1.016')] [2022-07-10 22:43:56,106][26022] Updated weights on worker 0-0, policy_version 923146 (0.00082) [2022-07-10 22:43:57,908][26022] Updated weights on worker 0-0, policy_version 923156 (0.00087) [2022-07-10 22:43:59,838][26022] Updated weights on worker 0-0, policy_version 923166 (0.00086) [2022-07-10 22:44:00,633][25689] Fps is (10 sec: 5353.7, 60 sec: 5483.8, 300 sec: 5525.7). Total num frames: 945325056. Throughput: 0: 4914.6. Samples: 945320348. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:44:00,635][25689] Avg episode reward: [(0, '-0.440')] [2022-07-10 22:44:01,632][26022] Updated weights on worker 0-0, policy_version 923176 (0.00086) [2022-07-10 22:44:03,843][26022] Updated weights on worker 0-0, policy_version 923186 (0.00087) [2022-07-10 22:44:05,638][25689] Fps is (10 sec: 5377.9, 60 sec: 5502.1, 300 sec: 5527.5). Total num frames: 945352704. Throughput: 0: 5648.9. Samples: 945351904. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:44:05,638][25689] Avg episode reward: [(0, '0.526')] [2022-07-10 22:44:05,648][26022] Updated weights on worker 0-0, policy_version 923196 (0.00090) [2022-07-10 22:44:07,581][26022] Updated weights on worker 0-0, policy_version 923206 (0.00087) [2022-07-10 22:44:09,331][26022] Updated weights on worker 0-0, policy_version 923216 (0.00091) [2022-07-10 22:44:10,771][25689] Fps is (10 sec: 5454.6, 60 sec: 5510.8, 300 sec: 5521.7). Total num frames: 945380352. Throughput: 0: 5664.7. Samples: 945385266. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:44:10,771][25689] Avg episode reward: [(0, '0.904')] [2022-07-10 22:44:11,237][26022] Updated weights on worker 0-0, policy_version 923226 (0.00087) [2022-07-10 22:44:13,113][26022] Updated weights on worker 0-0, policy_version 923236 (0.00086) [2022-07-10 22:44:14,890][26022] Updated weights on worker 0-0, policy_version 923246 (0.00096) [2022-07-10 22:44:15,823][25689] Fps is (10 sec: 5529.7, 60 sec: 5540.9, 300 sec: 5525.7). Total num frames: 945409024. Throughput: 0: 4850.5. Samples: 945402018. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:44:15,823][25689] Avg episode reward: [(0, '0.515')] [2022-07-10 22:44:16,738][26022] Updated weights on worker 0-0, policy_version 923256 (0.00090) [2022-07-10 22:44:18,520][26022] Updated weights on worker 0-0, policy_version 923266 (0.00087) [2022-07-10 22:44:20,474][26022] Updated weights on worker 0-0, policy_version 923276 (0.00092) [2022-07-10 22:44:20,836][25689] Fps is (10 sec: 5494.0, 60 sec: 5491.8, 300 sec: 5522.2). Total num frames: 945435648. Throughput: 0: 5705.3. Samples: 945435608. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:44:20,836][25689] Avg episode reward: [(0, '0.423')] [2022-07-10 22:44:22,059][26022] Updated weights on worker 0-0, policy_version 923286 (0.00094) [2022-07-10 22:44:24,155][26022] Updated weights on worker 0-0, policy_version 923296 (0.00081) [2022-07-10 22:44:25,705][26022] Updated weights on worker 0-0, policy_version 923306 (0.00091) [2022-07-10 22:44:25,852][25689] Fps is (10 sec: 5717.8, 60 sec: 5543.4, 300 sec: 5536.5). Total num frames: 945466368. Throughput: 0: 5810.1. Samples: 945469350. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:44:25,852][25689] Avg episode reward: [(0, '-0.088')] [2022-07-10 22:44:27,873][26022] Updated weights on worker 0-0, policy_version 923316 (0.00086) [2022-07-10 22:44:29,223][26022] Updated weights on worker 0-0, policy_version 923326 (0.00083) [2022-07-10 22:44:30,901][25689] Fps is (10 sec: 5595.3, 60 sec: 5510.9, 300 sec: 5522.2). Total num frames: 945491968. Throughput: 0: 5847.8. Samples: 945502986. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:44:30,902][25689] Avg episode reward: [(0, '-0.113')] [2022-07-10 22:44:31,480][26022] Updated weights on worker 0-0, policy_version 923336 (0.00085) [2022-07-10 22:44:33,012][26022] Updated weights on worker 0-0, policy_version 923346 (0.00091) [2022-07-10 22:44:34,959][26022] Updated weights on worker 0-0, policy_version 923356 (0.00087) [2022-07-10 22:44:35,919][25689] Fps is (10 sec: 5594.7, 60 sec: 5544.9, 300 sec: 5533.0). Total num frames: 945522688. Throughput: 0: 5864.4. Samples: 945519868. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:44:35,919][25689] Avg episode reward: [(0, '-0.302')] [2022-07-10 22:44:36,887][26022] Updated weights on worker 0-0, policy_version 923366 (0.00083) [2022-07-10 22:44:38,555][26022] Updated weights on worker 0-0, policy_version 923376 (0.00083) [2022-07-10 22:44:40,641][26022] Updated weights on worker 0-0, policy_version 923386 (0.00088) [2022-07-10 22:44:40,923][25689] Fps is (10 sec: 5619.8, 60 sec: 5496.8, 300 sec: 5523.2). Total num frames: 945548288. Throughput: 0: 5861.0. Samples: 945553340. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:44:40,923][25689] Avg episode reward: [(0, '-0.595')] [2022-07-10 22:44:42,257][26022] Updated weights on worker 0-0, policy_version 923396 (0.00085) [2022-07-10 22:44:44,351][26022] Updated weights on worker 0-0, policy_version 923406 (0.00085) [2022-07-10 22:44:45,870][26022] Updated weights on worker 0-0, policy_version 923416 (0.00085) [2022-07-10 22:44:45,938][25689] Fps is (10 sec: 5518.8, 60 sec: 5564.4, 300 sec: 5531.7). Total num frames: 945577984. Throughput: 0: 5843.3. Samples: 945586720. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:44:45,939][25689] Avg episode reward: [(0, '0.153')] [2022-07-10 22:44:48,036][26022] Updated weights on worker 0-0, policy_version 923426 (0.00089) [2022-07-10 22:44:49,761][26022] Updated weights on worker 0-0, policy_version 923436 (0.00091) [2022-07-10 22:44:51,049][25689] Fps is (10 sec: 5662.9, 60 sec: 5529.8, 300 sec: 5530.4). Total num frames: 945605632. Throughput: 0: 4989.4. Samples: 945603512. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:44:51,050][25689] Avg episode reward: [(0, '0.505')] [2022-07-10 22:44:51,643][26022] Updated weights on worker 0-0, policy_version 923446 (0.00085) [2022-07-10 22:44:53,259][26022] Updated weights on worker 0-0, policy_version 923456 (0.00089) [2022-07-10 22:44:55,395][26022] Updated weights on worker 0-0, policy_version 923466 (0.00089) [2022-07-10 22:44:56,140][25689] Fps is (10 sec: 5420.3, 60 sec: 5526.1, 300 sec: 5525.9). Total num frames: 945633280. Throughput: 0: 5792.3. Samples: 945636996. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:44:56,141][25689] Avg episode reward: [(0, '0.271')] [2022-07-10 22:44:57,054][26022] Updated weights on worker 0-0, policy_version 923476 (0.00087) [2022-07-10 22:44:59,001][26022] Updated weights on worker 0-0, policy_version 923486 (0.00096) [2022-07-10 22:45:00,599][26022] Updated weights on worker 0-0, policy_version 923496 (0.00091) [2022-07-10 22:45:01,178][25689] Fps is (10 sec: 5560.6, 60 sec: 5564.4, 300 sec: 5536.2). Total num frames: 945661952. Throughput: 0: 5781.0. Samples: 945670432. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:45:01,178][25689] Avg episode reward: [(0, '0.361')] [2022-07-10 22:45:03,069][26022] Updated weights on worker 0-0, policy_version 923506 (0.00086) [2022-07-10 22:45:04,844][26022] Updated weights on worker 0-0, policy_version 923516 (0.00089) [2022-07-10 22:45:06,229][25689] Fps is (10 sec: 5277.9, 60 sec: 5509.5, 300 sec: 5526.6). Total num frames: 945686528. Throughput: 0: 4833.9. Samples: 945684802. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:45:06,235][25689] Avg episode reward: [(0, '1.146')] [2022-07-10 22:45:06,573][26022] Updated weights on worker 0-0, policy_version 923526 (0.00087) [2022-07-10 22:45:08,623][26022] Updated weights on worker 0-0, policy_version 923536 (0.00087) [2022-07-10 22:45:10,283][26022] Updated weights on worker 0-0, policy_version 923546 (0.00088) [2022-07-10 22:45:11,323][25689] Fps is (10 sec: 5349.5, 60 sec: 5546.8, 300 sec: 5528.6). Total num frames: 945716224. Throughput: 0: 5637.0. Samples: 945717796. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:45:11,325][25689] Avg episode reward: [(0, '1.419')] [2022-07-10 22:45:12,469][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:45:12,482][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000923556_945721344.pth [2022-07-10 22:45:12,483][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000921612_943730688.pth [2022-07-10 22:45:12,487][26022] Updated weights on worker 0-0, policy_version 923556 (0.00089) [2022-07-10 22:45:14,003][26022] Updated weights on worker 0-0, policy_version 923566 (0.00095) [2022-07-10 22:45:16,007][26022] Updated weights on worker 0-0, policy_version 923576 (0.00091) [2022-07-10 22:45:16,381][25689] Fps is (10 sec: 5648.5, 60 sec: 5529.4, 300 sec: 5531.6). Total num frames: 945743872. Throughput: 0: 5625.2. Samples: 945750854. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2022-07-10 22:45:16,383][25689] Avg episode reward: [(0, '1.104')] [2022-07-10 22:45:18,067][26022] Updated weights on worker 0-0, policy_version 923586 (0.00088) [2022-07-10 22:45:19,592][26022] Updated weights on worker 0-0, policy_version 923596 (0.00092) [2022-07-10 22:45:21,391][25689] Fps is (10 sec: 5390.7, 60 sec: 5529.7, 300 sec: 5524.8). Total num frames: 945770496. Throughput: 0: 4807.6. Samples: 945767608. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:45:21,393][25689] Avg episode reward: [(0, '0.971')] [2022-07-10 22:45:21,611][26022] Updated weights on worker 0-0, policy_version 923606 (0.00094) [2022-07-10 22:45:23,213][26022] Updated weights on worker 0-0, policy_version 923616 (0.00089) [2022-07-10 22:45:25,519][26022] Updated weights on worker 0-0, policy_version 923626 (0.00094) [2022-07-10 22:45:26,403][25689] Fps is (10 sec: 5619.6, 60 sec: 5513.1, 300 sec: 5532.3). Total num frames: 945800192. Throughput: 0: 5754.5. Samples: 945800894. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:45:26,403][25689] Avg episode reward: [(0, '0.843')] [2022-07-10 22:45:27,134][26022] Updated weights on worker 0-0, policy_version 923636 (0.00095) [2022-07-10 22:45:29,067][26022] Updated weights on worker 0-0, policy_version 923646 (0.00084) [2022-07-10 22:45:30,812][26022] Updated weights on worker 0-0, policy_version 923656 (0.00086) [2022-07-10 22:45:31,469][25689] Fps is (10 sec: 5486.8, 60 sec: 5511.6, 300 sec: 5521.0). Total num frames: 945825792. Throughput: 0: 5757.0. Samples: 945833774. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:45:31,469][25689] Avg episode reward: [(0, '0.672')] [2022-07-10 22:45:32,671][26022] Updated weights on worker 0-0, policy_version 923666 (0.00095) [2022-07-10 22:45:34,833][26022] Updated weights on worker 0-0, policy_version 923676 (0.00080) [2022-07-10 22:45:36,287][26022] Updated weights on worker 0-0, policy_version 923686 (0.00079) [2022-07-10 22:45:36,491][25689] Fps is (10 sec: 5582.8, 60 sec: 5511.2, 300 sec: 5534.9). Total num frames: 945856512. Throughput: 0: 4954.8. Samples: 945850496. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:45:36,491][25689] Avg episode reward: [(0, '0.464')] [2022-07-10 22:45:38,249][26022] Updated weights on worker 0-0, policy_version 923696 (0.00085) [2022-07-10 22:45:39,837][26022] Updated weights on worker 0-0, policy_version 923706 (0.00088) [2022-07-10 22:45:41,520][25689] Fps is (10 sec: 5705.2, 60 sec: 5525.8, 300 sec: 5528.1). Total num frames: 945883136. Throughput: 0: 5788.8. Samples: 945884132. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:45:41,523][25689] Avg episode reward: [(0, '0.605')] [2022-07-10 22:45:41,951][26022] Updated weights on worker 0-0, policy_version 923716 (0.00086) [2022-07-10 22:45:43,754][26022] Updated weights on worker 0-0, policy_version 923726 (0.00087) [2022-07-10 22:45:45,508][26022] Updated weights on worker 0-0, policy_version 923736 (0.00089) [2022-07-10 22:45:46,611][25689] Fps is (10 sec: 5362.9, 60 sec: 5485.2, 300 sec: 5520.5). Total num frames: 945910784. Throughput: 0: 5771.0. Samples: 945917514. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:45:46,611][25689] Avg episode reward: [(0, '0.710')] [2022-07-10 22:45:47,415][26022] Updated weights on worker 0-0, policy_version 923746 (0.00090) [2022-07-10 22:45:49,332][26022] Updated weights on worker 0-0, policy_version 923756 (0.00093) [2022-07-10 22:45:50,923][26022] Updated weights on worker 0-0, policy_version 923766 (0.00084) [2022-07-10 22:45:51,681][25689] Fps is (10 sec: 5643.2, 60 sec: 5522.6, 300 sec: 5529.6). Total num frames: 945940480. Throughput: 0: 4977.7. Samples: 945934388. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:45:51,682][25689] Avg episode reward: [(0, '0.757')] [2022-07-10 22:45:52,876][26022] Updated weights on worker 0-0, policy_version 923776 (0.00091) [2022-07-10 22:45:54,504][26022] Updated weights on worker 0-0, policy_version 923786 (0.00087) [2022-07-10 22:45:56,698][25689] Fps is (10 sec: 5583.0, 60 sec: 5512.4, 300 sec: 5526.0). Total num frames: 945967104. Throughput: 0: 5830.6. Samples: 945968318. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:45:56,699][25689] Avg episode reward: [(0, '1.050')] [2022-07-10 22:45:56,700][26022] Updated weights on worker 0-0, policy_version 923796 (0.00086) [2022-07-10 22:45:57,990][26022] Updated weights on worker 0-0, policy_version 923806 (0.01574) [2022-07-10 22:46:00,189][26022] Updated weights on worker 0-0, policy_version 923816 (0.00087) [2022-07-10 22:46:01,783][25689] Fps is (10 sec: 5372.8, 60 sec: 5491.3, 300 sec: 5528.1). Total num frames: 945994752. Throughput: 0: 5812.2. Samples: 946001904. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:46:01,783][25689] Avg episode reward: [(0, '1.304')] [2022-07-10 22:46:02,413][26022] Updated weights on worker 0-0, policy_version 923826 (0.00082) [2022-07-10 22:46:04,085][26022] Updated weights on worker 0-0, policy_version 923836 (0.00094) [2022-07-10 22:46:06,168][26022] Updated weights on worker 0-0, policy_version 923846 (0.00086) [2022-07-10 22:46:06,787][25689] Fps is (10 sec: 5481.2, 60 sec: 5546.3, 300 sec: 5525.7). Total num frames: 946022400. Throughput: 0: 4923.3. Samples: 946016846. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:46:06,789][25689] Avg episode reward: [(0, '0.881')] [2022-07-10 22:46:07,558][26022] Updated weights on worker 0-0, policy_version 923856 (0.00083) [2022-07-10 22:46:09,710][26022] Updated weights on worker 0-0, policy_version 923866 (0.00087) [2022-07-10 22:46:11,373][26022] Updated weights on worker 0-0, policy_version 923876 (0.00085) [2022-07-10 22:46:11,855][25689] Fps is (10 sec: 5591.7, 60 sec: 5531.8, 300 sec: 5535.1). Total num frames: 946051072. Throughput: 0: 5777.0. Samples: 946050930. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:46:11,857][25689] Avg episode reward: [(0, '1.179')] [2022-07-10 22:46:13,253][26022] Updated weights on worker 0-0, policy_version 923886 (0.00080) [2022-07-10 22:46:15,128][26022] Updated weights on worker 0-0, policy_version 923896 (0.00093) [2022-07-10 22:46:16,857][26022] Updated weights on worker 0-0, policy_version 923906 (0.00088) [2022-07-10 22:46:16,910][25689] Fps is (10 sec: 5664.7, 60 sec: 5548.9, 300 sec: 5530.9). Total num frames: 946079744. Throughput: 0: 5750.5. Samples: 946084542. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:46:16,910][25689] Avg episode reward: [(0, '1.550')] [2022-07-10 22:46:18,887][26022] Updated weights on worker 0-0, policy_version 923916 (0.00087) [2022-07-10 22:46:20,564][26022] Updated weights on worker 0-0, policy_version 923926 (0.00089) [2022-07-10 22:46:21,915][25689] Fps is (10 sec: 5496.5, 60 sec: 5549.4, 300 sec: 5527.5). Total num frames: 946106368. Throughput: 0: 4937.7. Samples: 946101310. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:46:21,915][25689] Avg episode reward: [(0, '1.818')] [2022-07-10 22:46:22,469][26022] Updated weights on worker 0-0, policy_version 923936 (0.00077) [2022-07-10 22:46:24,179][26022] Updated weights on worker 0-0, policy_version 923946 (0.00084) [2022-07-10 22:46:26,318][26022] Updated weights on worker 0-0, policy_version 923956 (0.00088) [2022-07-10 22:46:26,959][25689] Fps is (10 sec: 5604.5, 60 sec: 5546.5, 300 sec: 5536.1). Total num frames: 946136064. Throughput: 0: 5845.7. Samples: 946134766. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:46:26,960][25689] Avg episode reward: [(0, '1.850')] [2022-07-10 22:46:27,942][26022] Updated weights on worker 0-0, policy_version 923966 (0.00084) [2022-07-10 22:46:29,975][26022] Updated weights on worker 0-0, policy_version 923976 (0.00095) [2022-07-10 22:46:31,911][26022] Updated weights on worker 0-0, policy_version 923986 (0.00088) [2022-07-10 22:46:32,001][25689] Fps is (10 sec: 5583.8, 60 sec: 5565.5, 300 sec: 5529.1). Total num frames: 946162688. Throughput: 0: 5815.1. Samples: 946168084. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:46:32,003][25689] Avg episode reward: [(0, '1.599')] [2022-07-10 22:46:33,479][26022] Updated weights on worker 0-0, policy_version 923996 (0.00088) [2022-07-10 22:46:35,465][26022] Updated weights on worker 0-0, policy_version 924006 (0.00089) [2022-07-10 22:46:37,005][25689] Fps is (10 sec: 5503.9, 60 sec: 5533.3, 300 sec: 5529.4). Total num frames: 946191360. Throughput: 0: 4993.5. Samples: 946184890. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:46:37,007][25689] Avg episode reward: [(0, '-0.371')] [2022-07-10 22:46:37,123][26022] Updated weights on worker 0-0, policy_version 924016 (0.00469) [2022-07-10 22:46:39,028][26022] Updated weights on worker 0-0, policy_version 924026 (0.00099) [2022-07-10 22:46:40,901][26022] Updated weights on worker 0-0, policy_version 924036 (0.00084) [2022-07-10 22:46:42,012][25689] Fps is (10 sec: 5523.5, 60 sec: 5535.4, 300 sec: 5529.9). Total num frames: 946217984. Throughput: 0: 5826.1. Samples: 946218398. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:46:42,013][25689] Avg episode reward: [(0, '-0.734')] [2022-07-10 22:46:42,659][26022] Updated weights on worker 0-0, policy_version 924046 (0.00087) [2022-07-10 22:46:44,476][26022] Updated weights on worker 0-0, policy_version 924056 (0.00088) [2022-07-10 22:46:46,476][26022] Updated weights on worker 0-0, policy_version 924066 (0.00085) [2022-07-10 22:46:47,025][25689] Fps is (10 sec: 5518.9, 60 sec: 5559.5, 300 sec: 5527.0). Total num frames: 946246656. Throughput: 0: 5829.5. Samples: 946251740. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:46:47,025][25689] Avg episode reward: [(0, '-1.708')] [2022-07-10 22:46:48,268][26022] Updated weights on worker 0-0, policy_version 924076 (0.00050) [2022-07-10 22:46:50,206][26022] Updated weights on worker 0-0, policy_version 924086 (0.00095) [2022-07-10 22:46:51,927][26022] Updated weights on worker 0-0, policy_version 924096 (0.00088) [2022-07-10 22:46:52,101][25689] Fps is (10 sec: 5683.7, 60 sec: 5542.0, 300 sec: 5529.1). Total num frames: 946275328. Throughput: 0: 4979.1. Samples: 946268166. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:46:52,102][25689] Avg episode reward: [(0, '-2.118')] [2022-07-10 22:46:53,955][26022] Updated weights on worker 0-0, policy_version 924106 (0.00086) [2022-07-10 22:46:55,615][26022] Updated weights on worker 0-0, policy_version 924116 (0.00082) [2022-07-10 22:46:57,164][25689] Fps is (10 sec: 5554.8, 60 sec: 5554.8, 300 sec: 5528.9). Total num frames: 946302976. Throughput: 0: 5807.4. Samples: 946301958. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:46:57,164][25689] Avg episode reward: [(0, '-2.211')] [2022-07-10 22:46:57,396][26022] Updated weights on worker 0-0, policy_version 924126 (0.00099) [2022-07-10 22:46:59,432][26022] Updated weights on worker 0-0, policy_version 924136 (0.00093) [2022-07-10 22:47:01,198][26022] Updated weights on worker 0-0, policy_version 924146 (0.00084) [2022-07-10 22:47:02,169][25689] Fps is (10 sec: 5391.0, 60 sec: 5545.1, 300 sec: 5530.2). Total num frames: 946329600. Throughput: 0: 5814.8. Samples: 946335604. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:47:02,169][25689] Avg episode reward: [(0, '-1.043')] [2022-07-10 22:47:03,389][26022] Updated weights on worker 0-0, policy_version 924156 (0.00090) [2022-07-10 22:47:05,155][26022] Updated weights on worker 0-0, policy_version 924166 (0.00081) [2022-07-10 22:47:06,891][26022] Updated weights on worker 0-0, policy_version 924176 (0.00091) [2022-07-10 22:47:07,210][25689] Fps is (10 sec: 5402.4, 60 sec: 5541.7, 300 sec: 5524.6). Total num frames: 946357248. Throughput: 0: 4882.3. Samples: 946350290. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:47:07,210][25689] Avg episode reward: [(0, '0.076')] [2022-07-10 22:47:08,963][26022] Updated weights on worker 0-0, policy_version 924186 (0.00535) [2022-07-10 22:47:10,786][26022] Updated weights on worker 0-0, policy_version 924196 (0.00092) [2022-07-10 22:47:12,303][25689] Fps is (10 sec: 5557.3, 60 sec: 5539.4, 300 sec: 5535.1). Total num frames: 946385920. Throughput: 0: 5725.3. Samples: 946383826. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:47:12,303][25689] Avg episode reward: [(0, '0.334')] [2022-07-10 22:47:12,456][26022] Updated weights on worker 0-0, policy_version 924206 (0.00094) [2022-07-10 22:47:12,620][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:47:12,634][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000924207_946387968.pth [2022-07-10 22:47:12,634][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000922261_944395264.pth [2022-07-10 22:47:14,350][26022] Updated weights on worker 0-0, policy_version 924216 (0.00069) [2022-07-10 22:47:16,289][26022] Updated weights on worker 0-0, policy_version 924226 (0.00090) [2022-07-10 22:47:17,329][25689] Fps is (10 sec: 5565.7, 60 sec: 5525.1, 300 sec: 5525.5). Total num frames: 946413568. Throughput: 0: 5711.9. Samples: 946417140. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:47:17,329][25689] Avg episode reward: [(0, '0.373')] [2022-07-10 22:47:18,012][26022] Updated weights on worker 0-0, policy_version 924236 (0.00094) [2022-07-10 22:47:20,032][26022] Updated weights on worker 0-0, policy_version 924246 (0.00086) [2022-07-10 22:47:21,756][26022] Updated weights on worker 0-0, policy_version 924256 (0.00091) [2022-07-10 22:47:22,337][25689] Fps is (10 sec: 5510.9, 60 sec: 5541.8, 300 sec: 5532.5). Total num frames: 946441216. Throughput: 0: 5683.8. Samples: 946450236. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:47:22,337][25689] Avg episode reward: [(0, '0.668')] [2022-07-10 22:47:23,687][26022] Updated weights on worker 0-0, policy_version 924266 (0.00478) [2022-07-10 22:47:25,528][26022] Updated weights on worker 0-0, policy_version 924276 (0.00089) [2022-07-10 22:47:27,202][26022] Updated weights on worker 0-0, policy_version 924286 (0.00088) [2022-07-10 22:47:27,366][25689] Fps is (10 sec: 5610.9, 60 sec: 5526.2, 300 sec: 5529.8). Total num frames: 946469888. Throughput: 0: 5799.5. Samples: 946467188. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:47:27,367][25689] Avg episode reward: [(0, '0.669')] [2022-07-10 22:47:29,125][26022] Updated weights on worker 0-0, policy_version 924296 (0.00090) [2022-07-10 22:47:30,921][26022] Updated weights on worker 0-0, policy_version 924306 (0.00086) [2022-07-10 22:47:32,409][25689] Fps is (10 sec: 5591.7, 60 sec: 5543.1, 300 sec: 5529.3). Total num frames: 946497536. Throughput: 0: 5812.6. Samples: 946500694. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:47:32,411][25689] Avg episode reward: [(0, '-0.039')] [2022-07-10 22:47:32,753][26022] Updated weights on worker 0-0, policy_version 924316 (0.00085) [2022-07-10 22:47:34,681][26022] Updated weights on worker 0-0, policy_version 924326 (0.00081) [2022-07-10 22:47:36,241][26022] Updated weights on worker 0-0, policy_version 924336 (0.00086) [2022-07-10 22:47:37,411][25689] Fps is (10 sec: 5504.8, 60 sec: 5526.3, 300 sec: 5529.4). Total num frames: 946525184. Throughput: 0: 5829.9. Samples: 946534220. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:47:37,412][25689] Avg episode reward: [(0, '-0.105')] [2022-07-10 22:47:38,471][26022] Updated weights on worker 0-0, policy_version 924346 (0.00085) [2022-07-10 22:47:39,997][26022] Updated weights on worker 0-0, policy_version 924356 (0.00077) [2022-07-10 22:47:41,927][26022] Updated weights on worker 0-0, policy_version 924366 (0.00088) [2022-07-10 22:47:42,430][25689] Fps is (10 sec: 5517.8, 60 sec: 5542.2, 300 sec: 5529.7). Total num frames: 946552832. Throughput: 0: 5013.2. Samples: 946550968. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:47:42,431][25689] Avg episode reward: [(0, '0.131')] [2022-07-10 22:47:43,627][26022] Updated weights on worker 0-0, policy_version 924376 (0.00090) [2022-07-10 22:47:45,658][26022] Updated weights on worker 0-0, policy_version 924386 (0.01192) [2022-07-10 22:47:47,447][25689] Fps is (10 sec: 5509.9, 60 sec: 5524.8, 300 sec: 5526.8). Total num frames: 946580480. Throughput: 0: 5845.7. Samples: 946584574. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:47:47,448][25689] Avg episode reward: [(0, '-0.353')] [2022-07-10 22:47:47,459][26022] Updated weights on worker 0-0, policy_version 924396 (0.00095) [2022-07-10 22:47:49,280][26022] Updated weights on worker 0-0, policy_version 924406 (0.00081) [2022-07-10 22:47:50,955][26022] Updated weights on worker 0-0, policy_version 924416 (0.00091) [2022-07-10 22:47:52,514][25689] Fps is (10 sec: 5585.2, 60 sec: 5525.7, 300 sec: 5525.7). Total num frames: 946609152. Throughput: 0: 5829.8. Samples: 946617904. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:47:52,515][25689] Avg episode reward: [(0, '-0.667')] [2022-07-10 22:47:52,972][26022] Updated weights on worker 0-0, policy_version 924426 (0.00089) [2022-07-10 22:47:54,710][26022] Updated weights on worker 0-0, policy_version 924436 (0.00090) [2022-07-10 22:47:56,678][26022] Updated weights on worker 0-0, policy_version 924446 (0.00093) [2022-07-10 22:47:57,592][25689] Fps is (10 sec: 5652.4, 60 sec: 5541.2, 300 sec: 5528.6). Total num frames: 946637824. Throughput: 0: 4979.6. Samples: 946634712. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:47:57,593][25689] Avg episode reward: [(0, '-1.877')] [2022-07-10 22:47:58,585][26022] Updated weights on worker 0-0, policy_version 924456 (0.00090) [2022-07-10 22:48:00,115][26022] Updated weights on worker 0-0, policy_version 924466 (0.00086) [2022-07-10 22:48:02,557][26022] Updated weights on worker 0-0, policy_version 924476 (0.00086) [2022-07-10 22:48:02,650][25689] Fps is (10 sec: 5354.5, 60 sec: 5519.5, 300 sec: 5524.5). Total num frames: 946663424. Throughput: 0: 5775.0. Samples: 946667736. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:48:02,650][25689] Avg episode reward: [(0, '-1.005')] [2022-07-10 22:48:04,216][26022] Updated weights on worker 0-0, policy_version 924486 (0.00088) [2022-07-10 22:48:06,137][26022] Updated weights on worker 0-0, policy_version 924496 (0.00090) [2022-07-10 22:48:07,732][25689] Fps is (10 sec: 5352.2, 60 sec: 5532.6, 300 sec: 5530.6). Total num frames: 946692096. Throughput: 0: 5676.7. Samples: 946699728. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:48:07,733][25689] Avg episode reward: [(0, '-0.762')] [2022-07-10 22:48:08,221][26022] Updated weights on worker 0-0, policy_version 924506 (0.00090) [2022-07-10 22:48:09,827][26022] Updated weights on worker 0-0, policy_version 924516 (0.00090) [2022-07-10 22:48:11,748][26022] Updated weights on worker 0-0, policy_version 924526 (0.00091) [2022-07-10 22:48:12,786][25689] Fps is (10 sec: 5556.6, 60 sec: 5519.3, 300 sec: 5533.3). Total num frames: 946719744. Throughput: 0: 4869.5. Samples: 946716616. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:48:12,786][25689] Avg episode reward: [(0, '-0.654')] [2022-07-10 22:48:13,358][26022] Updated weights on worker 0-0, policy_version 924536 (0.00094) [2022-07-10 22:48:15,475][26022] Updated weights on worker 0-0, policy_version 924546 (0.00093) [2022-07-10 22:48:17,122][26022] Updated weights on worker 0-0, policy_version 924556 (0.00088) [2022-07-10 22:48:17,794][25689] Fps is (10 sec: 5699.4, 60 sec: 5554.8, 300 sec: 5533.7). Total num frames: 946749440. Throughput: 0: 5723.2. Samples: 946750330. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:48:17,795][25689] Avg episode reward: [(0, '-0.298')] [2022-07-10 22:48:19,198][26022] Updated weights on worker 0-0, policy_version 924566 (0.00090) [2022-07-10 22:48:20,759][26022] Updated weights on worker 0-0, policy_version 924576 (0.00085) [2022-07-10 22:48:22,820][25689] Fps is (10 sec: 5408.9, 60 sec: 5502.4, 300 sec: 5523.4). Total num frames: 946774016. Throughput: 0: 5748.3. Samples: 946783678. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:48:22,820][25689] Avg episode reward: [(0, '0.661')] [2022-07-10 22:48:22,899][26022] Updated weights on worker 0-0, policy_version 924586 (0.00085) [2022-07-10 22:48:24,292][26022] Updated weights on worker 0-0, policy_version 924596 (0.00082) [2022-07-10 22:48:26,519][26022] Updated weights on worker 0-0, policy_version 924606 (0.00090) [2022-07-10 22:48:27,831][25689] Fps is (10 sec: 5509.1, 60 sec: 5537.9, 300 sec: 5534.7). Total num frames: 946804736. Throughput: 0: 5011.3. Samples: 946800448. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:48:27,831][25689] Avg episode reward: [(0, '1.165')] [2022-07-10 22:48:28,114][26022] Updated weights on worker 0-0, policy_version 924616 (0.00087) [2022-07-10 22:48:30,199][26022] Updated weights on worker 0-0, policy_version 924626 (0.00502) [2022-07-10 22:48:31,940][26022] Updated weights on worker 0-0, policy_version 924636 (0.00090) [2022-07-10 22:48:32,913][25689] Fps is (10 sec: 5782.8, 60 sec: 5534.3, 300 sec: 5530.1). Total num frames: 946832384. Throughput: 0: 5826.1. Samples: 946833880. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:48:32,913][25689] Avg episode reward: [(0, '0.874')] [2022-07-10 22:48:33,708][26022] Updated weights on worker 0-0, policy_version 924646 (0.00086) [2022-07-10 22:48:35,523][26022] Updated weights on worker 0-0, policy_version 924656 (0.00084) [2022-07-10 22:48:37,427][26022] Updated weights on worker 0-0, policy_version 924666 (0.00088) [2022-07-10 22:48:37,936][25689] Fps is (10 sec: 5573.4, 60 sec: 5549.3, 300 sec: 5530.3). Total num frames: 946861056. Throughput: 0: 5834.0. Samples: 946867840. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:48:37,937][25689] Avg episode reward: [(0, '1.006')] [2022-07-10 22:48:38,931][26022] Updated weights on worker 0-0, policy_version 924676 (0.00086) [2022-07-10 22:48:41,198][26022] Updated weights on worker 0-0, policy_version 924686 (0.00087) [2022-07-10 22:48:42,656][26022] Updated weights on worker 0-0, policy_version 924696 (0.00087) [2022-07-10 22:48:42,964][25689] Fps is (10 sec: 5705.0, 60 sec: 5565.4, 300 sec: 5540.3). Total num frames: 946889728. Throughput: 0: 5008.9. Samples: 946884582. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:48:42,965][25689] Avg episode reward: [(0, '1.079')] [2022-07-10 22:48:44,723][26022] Updated weights on worker 0-0, policy_version 924706 (0.00079) [2022-07-10 22:48:46,586][26022] Updated weights on worker 0-0, policy_version 924716 (0.00088) [2022-07-10 22:48:47,970][25689] Fps is (10 sec: 5612.8, 60 sec: 5566.4, 300 sec: 5535.2). Total num frames: 946917376. Throughput: 0: 5847.9. Samples: 946918220. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:48:47,971][25689] Avg episode reward: [(0, '1.026')] [2022-07-10 22:48:48,192][26022] Updated weights on worker 0-0, policy_version 924726 (0.00089) [2022-07-10 22:48:50,263][26022] Updated weights on worker 0-0, policy_version 924736 (0.00092) [2022-07-10 22:48:52,001][26022] Updated weights on worker 0-0, policy_version 924746 (0.00084) [2022-07-10 22:48:53,092][25689] Fps is (10 sec: 5358.7, 60 sec: 5527.5, 300 sec: 5530.5). Total num frames: 946944000. Throughput: 0: 5819.1. Samples: 946951306. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:48:53,093][25689] Avg episode reward: [(0, '1.140')] [2022-07-10 22:48:54,032][26022] Updated weights on worker 0-0, policy_version 924756 (0.00087) [2022-07-10 22:48:55,920][26022] Updated weights on worker 0-0, policy_version 924766 (0.00088) [2022-07-10 22:48:57,477][26022] Updated weights on worker 0-0, policy_version 924776 (0.00085) [2022-07-10 22:48:58,131][25689] Fps is (10 sec: 5542.8, 60 sec: 5548.0, 300 sec: 5541.7). Total num frames: 946973696. Throughput: 0: 4965.2. Samples: 946968112. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:48:58,132][25689] Avg episode reward: [(0, '0.810')] [2022-07-10 22:48:59,646][26022] Updated weights on worker 0-0, policy_version 924786 (0.00088) [2022-07-10 22:49:01,053][26022] Updated weights on worker 0-0, policy_version 924796 (0.00083) [2022-07-10 22:49:03,174][25689] Fps is (10 sec: 5383.3, 60 sec: 5532.5, 300 sec: 5530.6). Total num frames: 946998272. Throughput: 0: 5806.1. Samples: 947001920. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:49:03,176][25689] Avg episode reward: [(0, '1.154')] [2022-07-10 22:49:03,495][26022] Updated weights on worker 0-0, policy_version 924806 (0.00091) [2022-07-10 22:49:05,423][26022] Updated weights on worker 0-0, policy_version 924816 (0.00085) [2022-07-10 22:49:06,926][26022] Updated weights on worker 0-0, policy_version 924826 (0.00478) [2022-07-10 22:49:08,220][25689] Fps is (10 sec: 5379.3, 60 sec: 5552.7, 300 sec: 5539.1). Total num frames: 947027968. Throughput: 0: 5699.5. Samples: 947033634. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 22:49:08,222][25689] Avg episode reward: [(0, '0.409')] [2022-07-10 22:49:08,875][26022] Updated weights on worker 0-0, policy_version 924836 (0.00095) [2022-07-10 22:49:10,754][26022] Updated weights on worker 0-0, policy_version 924846 (0.00098) [2022-07-10 22:49:12,455][26022] Updated weights on worker 0-0, policy_version 924856 (0.00092) [2022-07-10 22:49:12,717][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:49:12,730][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000924857_947053568.pth [2022-07-10 22:49:12,730][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000922910_945059840.pth [2022-07-10 22:49:13,371][25689] Fps is (10 sec: 5724.3, 60 sec: 5560.7, 300 sec: 5537.3). Total num frames: 947056640. Throughput: 0: 4884.5. Samples: 947050354. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:49:13,371][25689] Avg episode reward: [(0, '0.138')] [2022-07-10 22:49:14,606][26022] Updated weights on worker 0-0, policy_version 924866 (0.00081) [2022-07-10 22:49:16,039][26022] Updated weights on worker 0-0, policy_version 924876 (0.00095) [2022-07-10 22:49:18,129][26022] Updated weights on worker 0-0, policy_version 924886 (0.00081) [2022-07-10 22:49:18,401][25689] Fps is (10 sec: 5632.8, 60 sec: 5541.8, 300 sec: 5543.9). Total num frames: 947085312. Throughput: 0: 5709.3. Samples: 947083836. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:49:18,401][25689] Avg episode reward: [(0, '0.428')] [2022-07-10 22:49:19,904][26022] Updated weights on worker 0-0, policy_version 924896 (0.00086) [2022-07-10 22:49:21,808][26022] Updated weights on worker 0-0, policy_version 924906 (0.00083) [2022-07-10 22:49:23,414][25689] Fps is (10 sec: 5505.6, 60 sec: 5576.7, 300 sec: 5530.2). Total num frames: 947111936. Throughput: 0: 5698.2. Samples: 947117254. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:49:23,415][25689] Avg episode reward: [(0, '-0.695')] [2022-07-10 22:49:23,623][26022] Updated weights on worker 0-0, policy_version 924916 (0.00084) [2022-07-10 22:49:25,308][26022] Updated weights on worker 0-0, policy_version 924926 (0.00089) [2022-07-10 22:49:27,296][26022] Updated weights on worker 0-0, policy_version 924936 (0.00085) [2022-07-10 22:49:28,437][25689] Fps is (10 sec: 5509.4, 60 sec: 5541.9, 300 sec: 5541.0). Total num frames: 947140608. Throughput: 0: 4967.0. Samples: 947134052. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:49:28,438][25689] Avg episode reward: [(0, '-0.430')] [2022-07-10 22:49:29,211][26022] Updated weights on worker 0-0, policy_version 924946 (0.00086) [2022-07-10 22:49:30,948][26022] Updated weights on worker 0-0, policy_version 924956 (0.00085) [2022-07-10 22:49:32,947][26022] Updated weights on worker 0-0, policy_version 924966 (0.00093) [2022-07-10 22:49:33,546][25689] Fps is (10 sec: 5659.7, 60 sec: 5556.3, 300 sec: 5532.4). Total num frames: 947169280. Throughput: 0: 5793.0. Samples: 947167232. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:49:33,547][25689] Avg episode reward: [(0, '-1.125')] [2022-07-10 22:49:34,510][26022] Updated weights on worker 0-0, policy_version 924976 (0.00095) [2022-07-10 22:49:36,423][26022] Updated weights on worker 0-0, policy_version 924986 (0.00084) [2022-07-10 22:49:38,349][26022] Updated weights on worker 0-0, policy_version 924996 (0.00094) [2022-07-10 22:49:38,595][25689] Fps is (10 sec: 5544.3, 60 sec: 5537.0, 300 sec: 5538.4). Total num frames: 947196928. Throughput: 0: 5788.1. Samples: 947200726. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:49:38,596][25689] Avg episode reward: [(0, '-0.588')] [2022-07-10 22:49:40,298][26022] Updated weights on worker 0-0, policy_version 925006 (0.00091) [2022-07-10 22:49:41,909][26022] Updated weights on worker 0-0, policy_version 925016 (0.00081) [2022-07-10 22:49:43,653][25689] Fps is (10 sec: 5471.2, 60 sec: 5517.5, 300 sec: 5530.8). Total num frames: 947224576. Throughput: 0: 5770.0. Samples: 947234032. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:49:43,655][25689] Avg episode reward: [(0, '-0.350')] [2022-07-10 22:49:43,988][26022] Updated weights on worker 0-0, policy_version 925026 (0.00089) [2022-07-10 22:49:45,737][26022] Updated weights on worker 0-0, policy_version 925036 (0.00088) [2022-07-10 22:49:47,567][26022] Updated weights on worker 0-0, policy_version 925046 (0.00082) [2022-07-10 22:49:48,705][25689] Fps is (10 sec: 5570.9, 60 sec: 5530.1, 300 sec: 5535.3). Total num frames: 947253248. Throughput: 0: 5767.6. Samples: 947250948. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:49:48,706][25689] Avg episode reward: [(0, '0.057')] [2022-07-10 22:49:49,462][26022] Updated weights on worker 0-0, policy_version 925056 (0.00087) [2022-07-10 22:49:51,146][26022] Updated weights on worker 0-0, policy_version 925066 (0.00093) [2022-07-10 22:49:53,178][26022] Updated weights on worker 0-0, policy_version 925076 (0.00088) [2022-07-10 22:49:53,796][25689] Fps is (10 sec: 5552.7, 60 sec: 5549.9, 300 sec: 5535.3). Total num frames: 947280896. Throughput: 0: 5776.4. Samples: 947284202. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:49:53,796][25689] Avg episode reward: [(0, '0.356')] [2022-07-10 22:49:54,824][26022] Updated weights on worker 0-0, policy_version 925086 (0.00086) [2022-07-10 22:49:56,813][26022] Updated weights on worker 0-0, policy_version 925096 (0.00086) [2022-07-10 22:49:58,663][26022] Updated weights on worker 0-0, policy_version 925106 (0.00091) [2022-07-10 22:49:58,805][25689] Fps is (10 sec: 5474.7, 60 sec: 5518.8, 300 sec: 5532.4). Total num frames: 947308544. Throughput: 0: 5795.5. Samples: 947317852. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:49:58,806][25689] Avg episode reward: [(0, '0.613')] [2022-07-10 22:50:00,243][26022] Updated weights on worker 0-0, policy_version 925116 (0.00093) [2022-07-10 22:50:02,639][26022] Updated weights on worker 0-0, policy_version 925126 (0.00092) [2022-07-10 22:50:03,843][25689] Fps is (10 sec: 5401.9, 60 sec: 5553.0, 300 sec: 5539.5). Total num frames: 947335168. Throughput: 0: 4986.5. Samples: 947334708. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:50:03,843][25689] Avg episode reward: [(0, '0.966')] [2022-07-10 22:50:04,340][26022] Updated weights on worker 0-0, policy_version 925136 (0.00089) [2022-07-10 22:50:06,307][26022] Updated weights on worker 0-0, policy_version 925146 (0.00089) [2022-07-10 22:50:08,114][26022] Updated weights on worker 0-0, policy_version 925156 (0.00061) [2022-07-10 22:50:08,853][25689] Fps is (10 sec: 5401.4, 60 sec: 5522.5, 300 sec: 5534.2). Total num frames: 947362816. Throughput: 0: 5723.3. Samples: 947366262. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:50:08,854][25689] Avg episode reward: [(0, '0.574')] [2022-07-10 22:50:09,877][26022] Updated weights on worker 0-0, policy_version 925166 (0.00086) [2022-07-10 22:50:11,690][26022] Updated weights on worker 0-0, policy_version 925176 (0.00105) [2022-07-10 22:50:13,543][26022] Updated weights on worker 0-0, policy_version 925186 (0.00089) [2022-07-10 22:50:13,977][25689] Fps is (10 sec: 5557.5, 60 sec: 5525.0, 300 sec: 5536.4). Total num frames: 947391488. Throughput: 0: 5717.9. Samples: 947399594. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:50:13,977][25689] Avg episode reward: [(0, '0.544')] [2022-07-10 22:50:15,391][26022] Updated weights on worker 0-0, policy_version 925196 (0.00087) [2022-07-10 22:50:17,304][26022] Updated weights on worker 0-0, policy_version 925206 (0.00081) [2022-07-10 22:50:18,952][26022] Updated weights on worker 0-0, policy_version 925216 (0.00096) [2022-07-10 22:50:19,043][25689] Fps is (10 sec: 5728.3, 60 sec: 5538.6, 300 sec: 5545.7). Total num frames: 947421184. Throughput: 0: 4873.7. Samples: 947416482. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:50:19,043][25689] Avg episode reward: [(0, '0.286')] [2022-07-10 22:50:21,031][26022] Updated weights on worker 0-0, policy_version 925226 (0.00087) [2022-07-10 22:50:22,853][26022] Updated weights on worker 0-0, policy_version 925236 (0.00089) [2022-07-10 22:50:24,048][25689] Fps is (10 sec: 5490.3, 60 sec: 5522.4, 300 sec: 5532.1). Total num frames: 947446784. Throughput: 0: 5692.4. Samples: 947449726. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:50:24,049][25689] Avg episode reward: [(0, '0.543')] [2022-07-10 22:50:24,707][26022] Updated weights on worker 0-0, policy_version 925246 (0.00085) [2022-07-10 22:50:26,606][26022] Updated weights on worker 0-0, policy_version 925256 (0.00090) [2022-07-10 22:50:28,319][26022] Updated weights on worker 0-0, policy_version 925266 (0.00095) [2022-07-10 22:50:29,059][25689] Fps is (10 sec: 5418.5, 60 sec: 5523.6, 300 sec: 5543.4). Total num frames: 947475456. Throughput: 0: 5769.2. Samples: 947482832. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:50:29,059][25689] Avg episode reward: [(0, '0.346')] [2022-07-10 22:50:30,130][26022] Updated weights on worker 0-0, policy_version 925276 (0.00094) [2022-07-10 22:50:32,035][26022] Updated weights on worker 0-0, policy_version 925286 (0.00086) [2022-07-10 22:50:33,903][26022] Updated weights on worker 0-0, policy_version 925296 (0.00093) [2022-07-10 22:50:34,118][25689] Fps is (10 sec: 5592.7, 60 sec: 5511.2, 300 sec: 5532.4). Total num frames: 947503104. Throughput: 0: 4961.4. Samples: 947499526. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:50:34,119][25689] Avg episode reward: [(0, '-0.127')] [2022-07-10 22:50:35,771][26022] Updated weights on worker 0-0, policy_version 925306 (0.00090) [2022-07-10 22:50:37,625][26022] Updated weights on worker 0-0, policy_version 925316 (0.00084) [2022-07-10 22:50:39,130][25689] Fps is (10 sec: 5490.2, 60 sec: 5514.6, 300 sec: 5536.1). Total num frames: 947530752. Throughput: 0: 5803.9. Samples: 947533070. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:50:39,131][25689] Avg episode reward: [(0, '0.093')] [2022-07-10 22:50:39,597][26022] Updated weights on worker 0-0, policy_version 925326 (0.00088) [2022-07-10 22:50:41,295][26022] Updated weights on worker 0-0, policy_version 925336 (0.00083) [2022-07-10 22:50:43,235][26022] Updated weights on worker 0-0, policy_version 925346 (0.00083) [2022-07-10 22:50:44,137][25689] Fps is (10 sec: 5519.6, 60 sec: 5519.2, 300 sec: 5537.7). Total num frames: 947558400. Throughput: 0: 5807.9. Samples: 947566396. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:50:44,137][25689] Avg episode reward: [(0, '0.189')] [2022-07-10 22:50:45,023][26022] Updated weights on worker 0-0, policy_version 925356 (0.00086) [2022-07-10 22:50:47,099][26022] Updated weights on worker 0-0, policy_version 925366 (0.00092) [2022-07-10 22:50:48,630][26022] Updated weights on worker 0-0, policy_version 925376 (0.00091) [2022-07-10 22:50:49,161][25689] Fps is (10 sec: 5716.8, 60 sec: 5538.7, 300 sec: 5538.6). Total num frames: 947588096. Throughput: 0: 4994.3. Samples: 947583230. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:50:49,161][25689] Avg episode reward: [(0, '0.437')] [2022-07-10 22:50:50,635][26022] Updated weights on worker 0-0, policy_version 925386 (0.00090) [2022-07-10 22:50:52,146][26022] Updated weights on worker 0-0, policy_version 925396 (0.00091) [2022-07-10 22:50:54,272][25689] Fps is (10 sec: 5556.8, 60 sec: 5519.9, 300 sec: 5536.8). Total num frames: 947614720. Throughput: 0: 5819.2. Samples: 947616804. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:50:54,272][25689] Avg episode reward: [(0, '0.070')] [2022-07-10 22:50:54,295][26022] Updated weights on worker 0-0, policy_version 925406 (0.00093) [2022-07-10 22:50:55,930][26022] Updated weights on worker 0-0, policy_version 925416 (0.00091) [2022-07-10 22:50:58,008][26022] Updated weights on worker 0-0, policy_version 925426 (0.00094) [2022-07-10 22:50:59,282][25689] Fps is (10 sec: 5564.7, 60 sec: 5553.8, 300 sec: 5545.1). Total num frames: 947644416. Throughput: 0: 5810.7. Samples: 947650166. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:50:59,282][25689] Avg episode reward: [(0, '-0.363')] [2022-07-10 22:50:59,787][26022] Updated weights on worker 0-0, policy_version 925436 (0.00088) [2022-07-10 22:51:02,033][26022] Updated weights on worker 0-0, policy_version 925446 (0.00102) [2022-07-10 22:51:03,595][26022] Updated weights on worker 0-0, policy_version 925456 (0.00087) [2022-07-10 22:51:04,305][25689] Fps is (10 sec: 5511.2, 60 sec: 5538.1, 300 sec: 5537.9). Total num frames: 947670016. Throughput: 0: 4937.9. Samples: 947665988. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:51:04,306][25689] Avg episode reward: [(0, '0.286')] [2022-07-10 22:51:05,567][26022] Updated weights on worker 0-0, policy_version 925466 (0.00085) [2022-07-10 22:51:07,365][26022] Updated weights on worker 0-0, policy_version 925476 (0.00084) [2022-07-10 22:51:09,321][25689] Fps is (10 sec: 5202.2, 60 sec: 5520.7, 300 sec: 5532.0). Total num frames: 947696640. Throughput: 0: 5723.9. Samples: 947698622. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:51:09,321][25689] Avg episode reward: [(0, '0.252')] [2022-07-10 22:51:09,347][26022] Updated weights on worker 0-0, policy_version 925486 (0.00080) [2022-07-10 22:51:10,961][26022] Updated weights on worker 0-0, policy_version 925496 (0.00089) [2022-07-10 22:51:12,754][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:51:12,766][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000925505_947717120.pth [2022-07-10 22:51:12,766][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000923556_945721344.pth [2022-07-10 22:51:12,876][26022] Updated weights on worker 0-0, policy_version 925506 (0.00084) [2022-07-10 22:51:14,377][25689] Fps is (10 sec: 5591.7, 60 sec: 5543.8, 300 sec: 5535.4). Total num frames: 947726336. Throughput: 0: 5748.4. Samples: 947732378. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:51:14,378][25689] Avg episode reward: [(0, '0.357')] [2022-07-10 22:51:14,639][26022] Updated weights on worker 0-0, policy_version 925516 (0.00424) [2022-07-10 22:51:16,505][26022] Updated weights on worker 0-0, policy_version 925526 (0.00081) [2022-07-10 22:51:18,279][26022] Updated weights on worker 0-0, policy_version 925536 (0.00094) [2022-07-10 22:51:19,449][25689] Fps is (10 sec: 5661.8, 60 sec: 5509.4, 300 sec: 5537.6). Total num frames: 947753984. Throughput: 0: 4909.2. Samples: 947749170. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:51:19,449][25689] Avg episode reward: [(0, '-0.450')] [2022-07-10 22:51:20,138][26022] Updated weights on worker 0-0, policy_version 925546 (0.00082) [2022-07-10 22:51:22,075][26022] Updated weights on worker 0-0, policy_version 925556 (0.00072) [2022-07-10 22:51:23,738][26022] Updated weights on worker 0-0, policy_version 925566 (0.00093) [2022-07-10 22:51:24,464][25689] Fps is (10 sec: 5482.1, 60 sec: 5542.4, 300 sec: 5531.2). Total num frames: 947781632. Throughput: 0: 5790.3. Samples: 947782712. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:51:24,464][25689] Avg episode reward: [(0, '-0.451')] [2022-07-10 22:51:25,828][26022] Updated weights on worker 0-0, policy_version 925576 (0.00092) [2022-07-10 22:51:27,559][26022] Updated weights on worker 0-0, policy_version 925586 (0.00084) [2022-07-10 22:51:29,384][26022] Updated weights on worker 0-0, policy_version 925596 (0.00096) [2022-07-10 22:51:29,526][25689] Fps is (10 sec: 5791.9, 60 sec: 5571.5, 300 sec: 5544.6). Total num frames: 947812352. Throughput: 0: 5812.1. Samples: 947816058. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:51:29,527][25689] Avg episode reward: [(0, '-0.694')] [2022-07-10 22:51:31,594][26022] Updated weights on worker 0-0, policy_version 925606 (0.00089) [2022-07-10 22:51:32,887][26022] Updated weights on worker 0-0, policy_version 925616 (0.00084) [2022-07-10 22:51:34,616][25689] Fps is (10 sec: 5547.7, 60 sec: 5534.9, 300 sec: 5532.7). Total num frames: 947837952. Throughput: 0: 4960.3. Samples: 947832768. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:51:34,618][25689] Avg episode reward: [(0, '-0.495')] [2022-07-10 22:51:35,197][26022] Updated weights on worker 0-0, policy_version 925627 (0.00085) [2022-07-10 22:51:36,703][26022] Updated weights on worker 0-0, policy_version 925637 (0.00086) [2022-07-10 22:51:38,855][26022] Updated weights on worker 0-0, policy_version 925647 (0.00109) [2022-07-10 22:51:39,639][25689] Fps is (10 sec: 5467.7, 60 sec: 5567.7, 300 sec: 5542.7). Total num frames: 947867648. Throughput: 0: 5807.8. Samples: 947866432. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:51:39,640][25689] Avg episode reward: [(0, '-0.488')] [2022-07-10 22:51:40,500][26022] Updated weights on worker 0-0, policy_version 925657 (0.00085) [2022-07-10 22:51:42,485][26022] Updated weights on worker 0-0, policy_version 925667 (0.00085) [2022-07-10 22:51:44,211][26022] Updated weights on worker 0-0, policy_version 925677 (0.00089) [2022-07-10 22:51:44,683][25689] Fps is (10 sec: 5594.7, 60 sec: 5547.4, 300 sec: 5535.3). Total num frames: 947894272. Throughput: 0: 5795.3. Samples: 947899886. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:51:44,683][25689] Avg episode reward: [(0, '-1.994')] [2022-07-10 22:51:46,076][26022] Updated weights on worker 0-0, policy_version 925687 (0.00088) [2022-07-10 22:51:47,884][26022] Updated weights on worker 0-0, policy_version 925697 (0.00081) [2022-07-10 22:51:49,696][25689] Fps is (10 sec: 5498.7, 60 sec: 5531.5, 300 sec: 5536.5). Total num frames: 947922944. Throughput: 0: 5005.2. Samples: 947917010. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:51:49,696][25689] Avg episode reward: [(0, '-2.181')] [2022-07-10 22:51:49,740][26022] Updated weights on worker 0-0, policy_version 925707 (0.00087) [2022-07-10 22:51:51,505][26022] Updated weights on worker 0-0, policy_version 925717 (0.00090) [2022-07-10 22:51:53,434][26022] Updated weights on worker 0-0, policy_version 925727 (0.00084) [2022-07-10 22:51:54,746][25689] Fps is (10 sec: 5698.0, 60 sec: 5570.9, 300 sec: 5540.1). Total num frames: 947951616. Throughput: 0: 5872.8. Samples: 947950990. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:51:54,747][25689] Avg episode reward: [(0, '-2.068')] [2022-07-10 22:51:55,078][26022] Updated weights on worker 0-0, policy_version 925737 (0.00086) [2022-07-10 22:51:57,068][26022] Updated weights on worker 0-0, policy_version 925747 (0.00093) [2022-07-10 22:51:58,875][26022] Updated weights on worker 0-0, policy_version 925757 (0.00093) [2022-07-10 22:51:59,754][25689] Fps is (10 sec: 5701.1, 60 sec: 5554.1, 300 sec: 5546.9). Total num frames: 947980288. Throughput: 0: 5883.2. Samples: 947984772. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:51:59,755][25689] Avg episode reward: [(0, '-1.533')] [2022-07-10 22:52:00,658][26022] Updated weights on worker 0-0, policy_version 925767 (0.00085) [2022-07-10 22:52:02,734][26022] Updated weights on worker 0-0, policy_version 925777 (0.00089) [2022-07-10 22:52:04,725][26022] Updated weights on worker 0-0, policy_version 925787 (0.00087) [2022-07-10 22:52:04,781][25689] Fps is (10 sec: 5408.3, 60 sec: 5553.8, 300 sec: 5540.3). Total num frames: 948005888. Throughput: 0: 4966.9. Samples: 947999714. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:52:04,782][25689] Avg episode reward: [(0, '-2.285')] [2022-07-10 22:52:06,294][26022] Updated weights on worker 0-0, policy_version 925797 (0.00086) [2022-07-10 22:52:08,455][26022] Updated weights on worker 0-0, policy_version 925807 (0.00089) [2022-07-10 22:52:09,807][25689] Fps is (10 sec: 5500.7, 60 sec: 5603.7, 300 sec: 5545.0). Total num frames: 948035584. Throughput: 0: 5788.9. Samples: 948033432. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:52:09,807][25689] Avg episode reward: [(0, '-2.120')] [2022-07-10 22:52:10,060][26022] Updated weights on worker 0-0, policy_version 925817 (0.00095) [2022-07-10 22:52:12,059][26022] Updated weights on worker 0-0, policy_version 925827 (0.01374) [2022-07-10 22:52:13,611][26022] Updated weights on worker 0-0, policy_version 925837 (0.00109) [2022-07-10 22:52:14,934][25689] Fps is (10 sec: 5547.5, 60 sec: 5546.4, 300 sec: 5539.7). Total num frames: 948062208. Throughput: 0: 5749.0. Samples: 948067046. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:52:14,934][25689] Avg episode reward: [(0, '-0.720')] [2022-07-10 22:52:15,770][26022] Updated weights on worker 0-0, policy_version 925847 (0.00091) [2022-07-10 22:52:17,478][26022] Updated weights on worker 0-0, policy_version 925857 (0.00090) [2022-07-10 22:52:19,234][26022] Updated weights on worker 0-0, policy_version 925867 (0.00085) [2022-07-10 22:52:19,951][25689] Fps is (10 sec: 5551.9, 60 sec: 5585.3, 300 sec: 5546.4). Total num frames: 948091904. Throughput: 0: 5748.8. Samples: 948100880. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:52:19,951][25689] Avg episode reward: [(0, '-0.033')] [2022-07-10 22:52:20,946][26022] Updated weights on worker 0-0, policy_version 925877 (0.00086) [2022-07-10 22:52:22,940][26022] Updated weights on worker 0-0, policy_version 925887 (0.00103) [2022-07-10 22:52:24,723][26022] Updated weights on worker 0-0, policy_version 925897 (0.00090) [2022-07-10 22:52:24,975][25689] Fps is (10 sec: 5609.0, 60 sec: 5567.5, 300 sec: 5539.6). Total num frames: 948118528. Throughput: 0: 5852.9. Samples: 948117904. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:52:24,975][25689] Avg episode reward: [(0, '0.151')] [2022-07-10 22:52:26,623][26022] Updated weights on worker 0-0, policy_version 925907 (0.00082) [2022-07-10 22:52:28,523][26022] Updated weights on worker 0-0, policy_version 925917 (0.00096) [2022-07-10 22:52:29,983][25689] Fps is (10 sec: 5511.7, 60 sec: 5538.6, 300 sec: 5543.7). Total num frames: 948147200. Throughput: 0: 5840.7. Samples: 948151278. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:52:29,984][25689] Avg episode reward: [(0, '-0.978')] [2022-07-10 22:52:30,087][26022] Updated weights on worker 0-0, policy_version 925927 (0.00084) [2022-07-10 22:52:32,087][26022] Updated weights on worker 0-0, policy_version 925937 (0.00092) [2022-07-10 22:52:33,764][26022] Updated weights on worker 0-0, policy_version 925947 (0.00090) [2022-07-10 22:52:35,119][25689] Fps is (10 sec: 5653.0, 60 sec: 5585.2, 300 sec: 5544.7). Total num frames: 948175872. Throughput: 0: 5838.6. Samples: 948184898. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:52:35,119][25689] Avg episode reward: [(0, '-1.293')] [2022-07-10 22:52:35,737][26022] Updated weights on worker 0-0, policy_version 925957 (0.00091) [2022-07-10 22:52:37,418][26022] Updated weights on worker 0-0, policy_version 925967 (0.00086) [2022-07-10 22:52:39,335][26022] Updated weights on worker 0-0, policy_version 925977 (0.00083) [2022-07-10 22:52:40,148][25689] Fps is (10 sec: 5641.4, 60 sec: 5567.7, 300 sec: 5547.9). Total num frames: 948204544. Throughput: 0: 4998.1. Samples: 948201828. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:52:40,150][25689] Avg episode reward: [(0, '-1.181')] [2022-07-10 22:52:41,255][26022] Updated weights on worker 0-0, policy_version 925987 (0.00066) [2022-07-10 22:52:43,040][26022] Updated weights on worker 0-0, policy_version 925997 (0.00090) [2022-07-10 22:52:44,804][26022] Updated weights on worker 0-0, policy_version 926007 (0.00093) [2022-07-10 22:52:45,153][25689] Fps is (10 sec: 5714.7, 60 sec: 5605.1, 300 sec: 5551.6). Total num frames: 948233216. Throughput: 0: 5824.2. Samples: 948235428. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:52:45,154][25689] Avg episode reward: [(0, '-1.629')] [2022-07-10 22:52:46,770][26022] Updated weights on worker 0-0, policy_version 926017 (0.00090) [2022-07-10 22:52:48,327][26022] Updated weights on worker 0-0, policy_version 926027 (0.00087) [2022-07-10 22:52:50,163][25689] Fps is (10 sec: 5521.3, 60 sec: 5571.5, 300 sec: 5545.7). Total num frames: 948259840. Throughput: 0: 5844.2. Samples: 948269214. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:52:50,164][25689] Avg episode reward: [(0, '-1.473')] [2022-07-10 22:52:50,544][26022] Updated weights on worker 0-0, policy_version 926037 (0.00091) [2022-07-10 22:52:52,140][26022] Updated weights on worker 0-0, policy_version 926047 (0.00096) [2022-07-10 22:52:54,127][26022] Updated weights on worker 0-0, policy_version 926057 (0.00098) [2022-07-10 22:52:55,227][25689] Fps is (10 sec: 5489.3, 60 sec: 5570.3, 300 sec: 5546.0). Total num frames: 948288512. Throughput: 0: 5035.7. Samples: 948286156. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:52:55,227][25689] Avg episode reward: [(0, '-0.040')] [2022-07-10 22:52:55,827][26022] Updated weights on worker 0-0, policy_version 926067 (0.00087) [2022-07-10 22:52:57,641][26022] Updated weights on worker 0-0, policy_version 926077 (0.00090) [2022-07-10 22:52:59,484][26022] Updated weights on worker 0-0, policy_version 926087 (0.00087) [2022-07-10 22:53:00,246][25689] Fps is (10 sec: 5585.9, 60 sec: 5552.4, 300 sec: 5553.6). Total num frames: 948316160. Throughput: 0: 5876.5. Samples: 948319932. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:53:00,246][25689] Avg episode reward: [(0, '0.477')] [2022-07-10 22:53:01,308][26022] Updated weights on worker 0-0, policy_version 926097 (0.00098) [2022-07-10 22:53:03,498][26022] Updated weights on worker 0-0, policy_version 926107 (0.00085) [2022-07-10 22:53:05,271][25689] Fps is (10 sec: 5403.4, 60 sec: 5569.5, 300 sec: 5547.8). Total num frames: 948342784. Throughput: 0: 5761.3. Samples: 948351330. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:53:05,271][25689] Avg episode reward: [(0, '0.523')] [2022-07-10 22:53:05,313][26022] Updated weights on worker 0-0, policy_version 926117 (0.00084) [2022-07-10 22:53:07,070][26022] Updated weights on worker 0-0, policy_version 926127 (0.00087) [2022-07-10 22:53:09,005][26022] Updated weights on worker 0-0, policy_version 926137 (0.00085) [2022-07-10 22:53:10,300][25689] Fps is (10 sec: 5601.6, 60 sec: 5569.1, 300 sec: 5555.1). Total num frames: 948372480. Throughput: 0: 4915.5. Samples: 948368196. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:53:10,301][25689] Avg episode reward: [(0, '0.120')] [2022-07-10 22:53:10,852][26022] Updated weights on worker 0-0, policy_version 926147 (0.00090) [2022-07-10 22:53:12,705][26022] Updated weights on worker 0-0, policy_version 926157 (0.00083) [2022-07-10 22:53:13,141][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:53:13,154][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000926158_948385792.pth [2022-07-10 22:53:13,155][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000924207_946387968.pth [2022-07-10 22:53:14,547][26022] Updated weights on worker 0-0, policy_version 926167 (0.00091) [2022-07-10 22:53:15,351][25689] Fps is (10 sec: 5587.1, 60 sec: 5576.1, 300 sec: 5544.0). Total num frames: 948399104. Throughput: 0: 5733.4. Samples: 948401538. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:53:15,352][25689] Avg episode reward: [(0, '0.008')] [2022-07-10 22:53:16,442][26022] Updated weights on worker 0-0, policy_version 926177 (0.00089) [2022-07-10 22:53:18,111][26022] Updated weights on worker 0-0, policy_version 926187 (0.00089) [2022-07-10 22:53:20,019][26022] Updated weights on worker 0-0, policy_version 926197 (0.00087) [2022-07-10 22:53:20,370][25689] Fps is (10 sec: 5491.1, 60 sec: 5559.0, 300 sec: 5557.9). Total num frames: 948427776. Throughput: 0: 5735.9. Samples: 948435364. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:53:20,371][25689] Avg episode reward: [(0, '-0.396')] [2022-07-10 22:53:21,860][26022] Updated weights on worker 0-0, policy_version 926207 (0.00085) [2022-07-10 22:53:23,470][26022] Updated weights on worker 0-0, policy_version 926217 (0.00097) [2022-07-10 22:53:25,438][25689] Fps is (10 sec: 5583.6, 60 sec: 5571.9, 300 sec: 5546.5). Total num frames: 948455424. Throughput: 0: 5004.3. Samples: 948452252. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:53:25,438][25689] Avg episode reward: [(0, '-0.433')] [2022-07-10 22:53:25,518][26022] Updated weights on worker 0-0, policy_version 926227 (0.00090) [2022-07-10 22:53:27,379][26022] Updated weights on worker 0-0, policy_version 926237 (0.00085) [2022-07-10 22:53:29,044][26022] Updated weights on worker 0-0, policy_version 926247 (0.00091) [2022-07-10 22:53:30,447][25689] Fps is (10 sec: 5487.7, 60 sec: 5555.0, 300 sec: 5547.9). Total num frames: 948483072. Throughput: 0: 5806.4. Samples: 948485174. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:53:30,447][25689] Avg episode reward: [(0, '-0.188')] [2022-07-10 22:53:31,365][26022] Updated weights on worker 0-0, policy_version 926257 (0.00082) [2022-07-10 22:53:32,707][26022] Updated weights on worker 0-0, policy_version 926267 (0.00096) [2022-07-10 22:53:34,886][26022] Updated weights on worker 0-0, policy_version 926277 (0.00092) [2022-07-10 22:53:35,516][25689] Fps is (10 sec: 5588.3, 60 sec: 5561.0, 300 sec: 5547.0). Total num frames: 948511744. Throughput: 0: 5806.5. Samples: 948518626. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:53:35,517][25689] Avg episode reward: [(0, '0.287')] [2022-07-10 22:53:36,751][26022] Updated weights on worker 0-0, policy_version 926287 (0.00086) [2022-07-10 22:53:38,317][26022] Updated weights on worker 0-0, policy_version 926297 (0.00101) [2022-07-10 22:53:40,538][25689] Fps is (10 sec: 5377.9, 60 sec: 5510.8, 300 sec: 5536.8). Total num frames: 948537344. Throughput: 0: 4958.0. Samples: 948535358. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:53:40,539][25689] Avg episode reward: [(0, '-0.037')] [2022-07-10 22:53:40,545][26022] Updated weights on worker 0-0, policy_version 926307 (0.00086) [2022-07-10 22:53:41,786][26022] Updated weights on worker 0-0, policy_version 926317 (0.00091) [2022-07-10 22:53:44,131][26022] Updated weights on worker 0-0, policy_version 926327 (0.00088) [2022-07-10 22:53:45,544][25689] Fps is (10 sec: 5616.4, 60 sec: 5544.7, 300 sec: 5547.2). Total num frames: 948568064. Throughput: 0: 5809.4. Samples: 948569056. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:53:45,544][25689] Avg episode reward: [(0, '0.983')] [2022-07-10 22:53:45,562][26022] Updated weights on worker 0-0, policy_version 926337 (0.00091) [2022-07-10 22:53:47,673][26022] Updated weights on worker 0-0, policy_version 926347 (0.00091) [2022-07-10 22:53:49,599][26022] Updated weights on worker 0-0, policy_version 926357 (0.00084) [2022-07-10 22:53:50,554][25689] Fps is (10 sec: 5725.1, 60 sec: 5544.7, 300 sec: 5549.2). Total num frames: 948594688. Throughput: 0: 5847.4. Samples: 948602754. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:53:50,555][25689] Avg episode reward: [(0, '0.806')] [2022-07-10 22:53:51,110][26022] Updated weights on worker 0-0, policy_version 926367 (0.00106) [2022-07-10 22:53:53,169][26022] Updated weights on worker 0-0, policy_version 926377 (0.00086) [2022-07-10 22:53:54,978][26022] Updated weights on worker 0-0, policy_version 926387 (0.00084) [2022-07-10 22:53:55,622][25689] Fps is (10 sec: 5486.6, 60 sec: 5544.3, 300 sec: 5545.3). Total num frames: 948623360. Throughput: 0: 4995.6. Samples: 948619068. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:53:55,623][25689] Avg episode reward: [(0, '0.493')] [2022-07-10 22:53:56,824][26022] Updated weights on worker 0-0, policy_version 926397 (0.00103) [2022-07-10 22:53:58,720][26022] Updated weights on worker 0-0, policy_version 926407 (0.00083) [2022-07-10 22:54:00,519][26022] Updated weights on worker 0-0, policy_version 926417 (0.00097) [2022-07-10 22:54:00,664][25689] Fps is (10 sec: 5571.1, 60 sec: 5542.2, 300 sec: 5555.6). Total num frames: 948651008. Throughput: 0: 5820.1. Samples: 948652490. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:54:00,664][25689] Avg episode reward: [(0, '-0.173')] [2022-07-10 22:54:02,643][26022] Updated weights on worker 0-0, policy_version 926427 (0.00086) [2022-07-10 22:54:04,494][26022] Updated weights on worker 0-0, policy_version 926437 (0.00080) [2022-07-10 22:54:05,693][25689] Fps is (10 sec: 5389.0, 60 sec: 5541.8, 300 sec: 5545.6). Total num frames: 948677632. Throughput: 0: 5711.5. Samples: 948684138. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:54:05,693][25689] Avg episode reward: [(0, '-1.136')] [2022-07-10 22:54:06,539][26022] Updated weights on worker 0-0, policy_version 926447 (0.00081) [2022-07-10 22:54:08,010][26022] Updated weights on worker 0-0, policy_version 926457 (0.00100) [2022-07-10 22:54:10,278][26022] Updated weights on worker 0-0, policy_version 926467 (0.00090) [2022-07-10 22:54:10,711][25689] Fps is (10 sec: 5401.8, 60 sec: 5509.0, 300 sec: 5544.6). Total num frames: 948705280. Throughput: 0: 4863.8. Samples: 948700790. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:54:10,711][25689] Avg episode reward: [(0, '-2.132')] [2022-07-10 22:54:11,651][26022] Updated weights on worker 0-0, policy_version 926477 (0.00089) [2022-07-10 22:54:13,911][26022] Updated weights on worker 0-0, policy_version 926487 (0.00084) [2022-07-10 22:54:15,588][26022] Updated weights on worker 0-0, policy_version 926497 (0.00092) [2022-07-10 22:54:15,764][25689] Fps is (10 sec: 5490.5, 60 sec: 5525.7, 300 sec: 5540.7). Total num frames: 948732928. Throughput: 0: 5706.7. Samples: 948734012. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:54:15,765][25689] Avg episode reward: [(0, '-2.869')] [2022-07-10 22:54:17,449][26022] Updated weights on worker 0-0, policy_version 926507 (0.00089) [2022-07-10 22:54:19,504][26022] Updated weights on worker 0-0, policy_version 926517 (0.00084) [2022-07-10 22:54:20,792][25689] Fps is (10 sec: 5586.5, 60 sec: 5524.9, 300 sec: 5547.4). Total num frames: 948761600. Throughput: 0: 5718.4. Samples: 948767592. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:54:20,793][25689] Avg episode reward: [(0, '-2.455')] [2022-07-10 22:54:21,225][26022] Updated weights on worker 0-0, policy_version 926527 (0.00090) [2022-07-10 22:54:22,948][26022] Updated weights on worker 0-0, policy_version 926537 (0.00092) [2022-07-10 22:54:24,905][26022] Updated weights on worker 0-0, policy_version 926547 (0.00082) [2022-07-10 22:54:25,825][25689] Fps is (10 sec: 5496.2, 60 sec: 5511.1, 300 sec: 5540.3). Total num frames: 948788224. Throughput: 0: 4963.9. Samples: 948784072. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:54:25,826][25689] Avg episode reward: [(0, '-2.446')] [2022-07-10 22:54:26,446][26022] Updated weights on worker 0-0, policy_version 926557 (0.00100) [2022-07-10 22:54:28,754][26022] Updated weights on worker 0-0, policy_version 926567 (0.00089) [2022-07-10 22:54:30,442][26022] Updated weights on worker 0-0, policy_version 926577 (0.00095) [2022-07-10 22:54:30,837][25689] Fps is (10 sec: 5300.7, 60 sec: 5493.8, 300 sec: 5535.2). Total num frames: 948814848. Throughput: 0: 5790.5. Samples: 948817334. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:54:30,838][25689] Avg episode reward: [(0, '-1.046')] [2022-07-10 22:54:32,200][26022] Updated weights on worker 0-0, policy_version 926587 (0.00089) [2022-07-10 22:54:34,315][26022] Updated weights on worker 0-0, policy_version 926597 (0.00084) [2022-07-10 22:54:35,875][26022] Updated weights on worker 0-0, policy_version 926607 (0.00054) [2022-07-10 22:54:35,971][25689] Fps is (10 sec: 5651.9, 60 sec: 5521.9, 300 sec: 5544.0). Total num frames: 948845568. Throughput: 0: 5782.8. Samples: 948850864. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:54:35,971][25689] Avg episode reward: [(0, '-1.176')] [2022-07-10 22:54:37,921][26022] Updated weights on worker 0-0, policy_version 926617 (0.00088) [2022-07-10 22:54:39,530][26022] Updated weights on worker 0-0, policy_version 926627 (0.00084) [2022-07-10 22:54:41,028][25689] Fps is (10 sec: 5828.0, 60 sec: 5569.4, 300 sec: 5547.4). Total num frames: 948874240. Throughput: 0: 4950.1. Samples: 948867764. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:54:41,029][25689] Avg episode reward: [(0, '0.051')] [2022-07-10 22:54:41,423][26022] Updated weights on worker 0-0, policy_version 926637 (0.00086) [2022-07-10 22:54:43,361][26022] Updated weights on worker 0-0, policy_version 926647 (0.00092) [2022-07-10 22:54:44,908][26022] Updated weights on worker 0-0, policy_version 926657 (0.00082) [2022-07-10 22:54:46,053][25689] Fps is (10 sec: 5585.9, 60 sec: 5516.9, 300 sec: 5544.5). Total num frames: 948901888. Throughput: 0: 5798.6. Samples: 948901368. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:54:46,054][25689] Avg episode reward: [(0, '1.018')] [2022-07-10 22:54:47,016][26022] Updated weights on worker 0-0, policy_version 926667 (0.00084) [2022-07-10 22:54:48,579][26022] Updated weights on worker 0-0, policy_version 926677 (0.00089) [2022-07-10 22:54:50,655][26022] Updated weights on worker 0-0, policy_version 926687 (0.00090) [2022-07-10 22:54:51,056][25689] Fps is (10 sec: 5616.2, 60 sec: 5551.4, 300 sec: 5549.6). Total num frames: 948930560. Throughput: 0: 5828.6. Samples: 948935182. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:54:51,057][25689] Avg episode reward: [(0, '0.504')] [2022-07-10 22:54:52,285][26022] Updated weights on worker 0-0, policy_version 926697 (0.00083) [2022-07-10 22:54:54,298][26022] Updated weights on worker 0-0, policy_version 926707 (0.00087) [2022-07-10 22:54:55,953][26022] Updated weights on worker 0-0, policy_version 926717 (0.00091) [2022-07-10 22:54:56,206][25689] Fps is (10 sec: 5648.0, 60 sec: 5543.9, 300 sec: 5550.4). Total num frames: 948959232. Throughput: 0: 4994.0. Samples: 948951920. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:54:56,207][25689] Avg episode reward: [(0, '0.850')] [2022-07-10 22:54:58,021][26022] Updated weights on worker 0-0, policy_version 926727 (0.00088) [2022-07-10 22:54:59,756][26022] Updated weights on worker 0-0, policy_version 926737 (0.00090) [2022-07-10 22:55:01,291][25689] Fps is (10 sec: 5402.9, 60 sec: 5523.1, 300 sec: 5549.5). Total num frames: 948985856. Throughput: 0: 5799.3. Samples: 948985272. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:55:01,291][25689] Avg episode reward: [(0, '0.938')] [2022-07-10 22:55:01,529][26022] Updated weights on worker 0-0, policy_version 926747 (0.00080) [2022-07-10 22:55:03,867][26022] Updated weights on worker 0-0, policy_version 926757 (0.00085) [2022-07-10 22:55:05,552][26022] Updated weights on worker 0-0, policy_version 926767 (0.00093) [2022-07-10 22:55:06,334][25689] Fps is (10 sec: 5358.6, 60 sec: 5538.7, 300 sec: 5548.9). Total num frames: 949013504. Throughput: 0: 5698.5. Samples: 949016940. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:55:06,335][25689] Avg episode reward: [(0, '1.285')] [2022-07-10 22:55:07,550][26022] Updated weights on worker 0-0, policy_version 926777 (0.00085) [2022-07-10 22:55:09,160][26022] Updated weights on worker 0-0, policy_version 926787 (0.00092) [2022-07-10 22:55:11,121][26022] Updated weights on worker 0-0, policy_version 926797 (0.00095) [2022-07-10 22:55:11,399][25689] Fps is (10 sec: 5470.7, 60 sec: 5534.4, 300 sec: 5546.5). Total num frames: 949041152. Throughput: 0: 5682.0. Samples: 949050766. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:55:11,400][25689] Avg episode reward: [(0, '1.269')] [2022-07-10 22:55:12,867][26022] Updated weights on worker 0-0, policy_version 926807 (0.00052) [2022-07-10 22:55:13,338][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:55:13,360][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000926809_949052416.pth [2022-07-10 22:55:13,360][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000924857_947053568.pth [2022-07-10 22:55:14,707][26022] Updated weights on worker 0-0, policy_version 926817 (0.00090) [2022-07-10 22:55:16,508][25689] Fps is (10 sec: 5535.9, 60 sec: 5546.2, 300 sec: 5542.3). Total num frames: 949069824. Throughput: 0: 5692.1. Samples: 949067480. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:55:16,509][25689] Avg episode reward: [(0, '1.063')] [2022-07-10 22:55:16,646][26022] Updated weights on worker 0-0, policy_version 926827 (0.00106) [2022-07-10 22:55:18,359][26022] Updated weights on worker 0-0, policy_version 926837 (0.00093) [2022-07-10 22:55:20,127][26022] Updated weights on worker 0-0, policy_version 926847 (0.00081) [2022-07-10 22:55:21,557][25689] Fps is (10 sec: 5644.9, 60 sec: 5544.2, 300 sec: 5551.8). Total num frames: 949098496. Throughput: 0: 5715.2. Samples: 949101098. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:55:21,558][25689] Avg episode reward: [(0, '-0.242')] [2022-07-10 22:55:22,065][26022] Updated weights on worker 0-0, policy_version 926857 (0.00091) [2022-07-10 22:55:23,823][26022] Updated weights on worker 0-0, policy_version 926867 (0.00086) [2022-07-10 22:55:25,949][26022] Updated weights on worker 0-0, policy_version 926877 (0.00087) [2022-07-10 22:55:26,586][25689] Fps is (10 sec: 5588.3, 60 sec: 5561.4, 300 sec: 5548.0). Total num frames: 949126144. Throughput: 0: 5797.4. Samples: 949134348. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:55:26,587][25689] Avg episode reward: [(0, '-0.181')] [2022-07-10 22:55:27,566][26022] Updated weights on worker 0-0, policy_version 926887 (0.00095) [2022-07-10 22:55:29,476][26022] Updated weights on worker 0-0, policy_version 926897 (0.00085) [2022-07-10 22:55:31,287][26022] Updated weights on worker 0-0, policy_version 926907 (0.00088) [2022-07-10 22:55:31,631][25689] Fps is (10 sec: 5489.2, 60 sec: 5575.4, 300 sec: 5548.3). Total num frames: 949153792. Throughput: 0: 4950.6. Samples: 949150928. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:55:31,631][25689] Avg episode reward: [(0, '-0.203')] [2022-07-10 22:55:33,122][26022] Updated weights on worker 0-0, policy_version 926917 (0.00087) [2022-07-10 22:55:34,923][26022] Updated weights on worker 0-0, policy_version 926927 (0.00081) [2022-07-10 22:55:36,728][25689] Fps is (10 sec: 5553.6, 60 sec: 5545.0, 300 sec: 5550.2). Total num frames: 949182464. Throughput: 0: 5782.1. Samples: 949184390. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:55:36,728][25689] Avg episode reward: [(0, '-0.143')] [2022-07-10 22:55:36,859][26022] Updated weights on worker 0-0, policy_version 926937 (0.00088) [2022-07-10 22:55:38,514][26022] Updated weights on worker 0-0, policy_version 926947 (0.00081) [2022-07-10 22:55:40,488][26022] Updated weights on worker 0-0, policy_version 926957 (0.00091) [2022-07-10 22:55:41,730][25689] Fps is (10 sec: 5677.9, 60 sec: 5550.0, 300 sec: 5553.7). Total num frames: 949211136. Throughput: 0: 5821.5. Samples: 949218536. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:55:41,731][25689] Avg episode reward: [(0, '0.130')] [2022-07-10 22:55:42,008][26022] Updated weights on worker 0-0, policy_version 926967 (0.00096) [2022-07-10 22:55:44,102][26022] Updated weights on worker 0-0, policy_version 926977 (0.00086) [2022-07-10 22:55:45,830][26022] Updated weights on worker 0-0, policy_version 926987 (0.00092) [2022-07-10 22:55:46,759][25689] Fps is (10 sec: 5614.4, 60 sec: 5549.7, 300 sec: 5546.7). Total num frames: 949238784. Throughput: 0: 5015.1. Samples: 949235516. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:55:46,759][25689] Avg episode reward: [(0, '0.654')] [2022-07-10 22:55:47,700][26022] Updated weights on worker 0-0, policy_version 926997 (0.00083) [2022-07-10 22:55:49,566][26022] Updated weights on worker 0-0, policy_version 927007 (0.00087) [2022-07-10 22:55:51,366][26022] Updated weights on worker 0-0, policy_version 927017 (0.00090) [2022-07-10 22:55:51,818][25689] Fps is (10 sec: 5582.9, 60 sec: 5544.5, 300 sec: 5554.6). Total num frames: 949267456. Throughput: 0: 5859.4. Samples: 949269214. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:55:51,819][25689] Avg episode reward: [(0, '1.619')] [2022-07-10 22:55:53,118][26022] Updated weights on worker 0-0, policy_version 927027 (0.00090) [2022-07-10 22:55:55,158][26022] Updated weights on worker 0-0, policy_version 927037 (0.00052) [2022-07-10 22:55:56,541][26022] Updated weights on worker 0-0, policy_version 927047 (0.00086) [2022-07-10 22:55:56,934][25689] Fps is (10 sec: 5736.3, 60 sec: 5564.5, 300 sec: 5552.6). Total num frames: 949297152. Throughput: 0: 5875.6. Samples: 949303116. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:55:56,935][25689] Avg episode reward: [(0, '1.555')] [2022-07-10 22:55:58,774][26022] Updated weights on worker 0-0, policy_version 927057 (0.00091) [2022-07-10 22:56:00,336][26022] Updated weights on worker 0-0, policy_version 927067 (0.00093) [2022-07-10 22:56:01,987][25689] Fps is (10 sec: 5236.5, 60 sec: 5516.8, 300 sec: 5545.1). Total num frames: 949320704. Throughput: 0: 4993.6. Samples: 949319690. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:56:01,988][25689] Avg episode reward: [(0, '1.143')] [2022-07-10 22:56:02,761][26022] Updated weights on worker 0-0, policy_version 927077 (0.00085) [2022-07-10 22:56:04,426][26022] Updated weights on worker 0-0, policy_version 927087 (0.00104) [2022-07-10 22:56:06,438][26022] Updated weights on worker 0-0, policy_version 927097 (0.00085) [2022-07-10 22:56:07,057][25689] Fps is (10 sec: 5361.6, 60 sec: 5565.0, 300 sec: 5557.9). Total num frames: 949351424. Throughput: 0: 5708.7. Samples: 949351390. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:56:07,057][25689] Avg episode reward: [(0, '1.645')] [2022-07-10 22:56:08,164][26022] Updated weights on worker 0-0, policy_version 927107 (0.00499) [2022-07-10 22:56:10,105][26022] Updated weights on worker 0-0, policy_version 927117 (0.00090) [2022-07-10 22:56:11,785][26022] Updated weights on worker 0-0, policy_version 927127 (0.00084) [2022-07-10 22:56:12,097][25689] Fps is (10 sec: 5773.3, 60 sec: 5567.2, 300 sec: 5551.3). Total num frames: 949379072. Throughput: 0: 5717.1. Samples: 949385150. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:56:12,097][25689] Avg episode reward: [(0, '1.672')] [2022-07-10 22:56:13,669][26022] Updated weights on worker 0-0, policy_version 927137 (0.00093) [2022-07-10 22:56:15,526][26022] Updated weights on worker 0-0, policy_version 927147 (0.00082) [2022-07-10 22:56:17,231][25689] Fps is (10 sec: 5434.7, 60 sec: 5548.1, 300 sec: 5550.1). Total num frames: 949406720. Throughput: 0: 4868.3. Samples: 949401928. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:56:17,232][25689] Avg episode reward: [(0, '1.209')] [2022-07-10 22:56:17,335][26022] Updated weights on worker 0-0, policy_version 927157 (0.00088) [2022-07-10 22:56:19,163][26022] Updated weights on worker 0-0, policy_version 927167 (0.00086) [2022-07-10 22:56:21,015][26022] Updated weights on worker 0-0, policy_version 927177 (0.00079) [2022-07-10 22:56:22,326][25689] Fps is (10 sec: 5606.2, 60 sec: 5560.8, 300 sec: 5555.5). Total num frames: 949436416. Throughput: 0: 5709.2. Samples: 949435806. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:56:22,326][25689] Avg episode reward: [(0, '0.391')] [2022-07-10 22:56:22,658][26022] Updated weights on worker 0-0, policy_version 927187 (0.00085) [2022-07-10 22:56:24,671][26022] Updated weights on worker 0-0, policy_version 927197 (0.00082) [2022-07-10 22:56:26,330][26022] Updated weights on worker 0-0, policy_version 927207 (0.00085) [2022-07-10 22:56:27,334][25689] Fps is (10 sec: 5676.0, 60 sec: 5562.7, 300 sec: 5546.2). Total num frames: 949464064. Throughput: 0: 5835.5. Samples: 949469720. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:56:27,335][25689] Avg episode reward: [(0, '0.224')] [2022-07-10 22:56:28,259][26022] Updated weights on worker 0-0, policy_version 927217 (0.00082) [2022-07-10 22:56:30,047][26022] Updated weights on worker 0-0, policy_version 927227 (0.00096) [2022-07-10 22:56:31,980][26022] Updated weights on worker 0-0, policy_version 927237 (0.00083) [2022-07-10 22:56:32,353][25689] Fps is (10 sec: 5514.6, 60 sec: 5565.0, 300 sec: 5554.4). Total num frames: 949491712. Throughput: 0: 4998.2. Samples: 949486392. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:56:32,354][25689] Avg episode reward: [(0, '-0.351')] [2022-07-10 22:56:33,692][26022] Updated weights on worker 0-0, policy_version 927247 (0.00091) [2022-07-10 22:56:35,615][26022] Updated weights on worker 0-0, policy_version 927257 (0.00081) [2022-07-10 22:56:37,271][26022] Updated weights on worker 0-0, policy_version 927267 (0.00083) [2022-07-10 22:56:37,478][25689] Fps is (10 sec: 5653.1, 60 sec: 5579.3, 300 sec: 5552.5). Total num frames: 949521408. Throughput: 0: 5826.4. Samples: 949519894. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:56:37,478][25689] Avg episode reward: [(0, '-1.389')] [2022-07-10 22:56:39,262][26022] Updated weights on worker 0-0, policy_version 927277 (0.00091) [2022-07-10 22:56:40,997][26022] Updated weights on worker 0-0, policy_version 927287 (0.00081) [2022-07-10 22:56:42,499][25689] Fps is (10 sec: 5651.9, 60 sec: 5560.8, 300 sec: 5556.4). Total num frames: 949549056. Throughput: 0: 5830.1. Samples: 949553418. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:56:42,499][25689] Avg episode reward: [(0, '-1.364')] [2022-07-10 22:56:43,072][26022] Updated weights on worker 0-0, policy_version 927297 (0.00103) [2022-07-10 22:56:44,585][26022] Updated weights on worker 0-0, policy_version 927307 (0.00086) [2022-07-10 22:56:46,674][26022] Updated weights on worker 0-0, policy_version 927317 (0.00091) [2022-07-10 22:56:47,504][25689] Fps is (10 sec: 5515.0, 60 sec: 5562.9, 300 sec: 5553.1). Total num frames: 949576704. Throughput: 0: 4987.7. Samples: 949570324. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-10 22:56:47,505][25689] Avg episode reward: [(0, '-1.102')] [2022-07-10 22:56:48,312][26022] Updated weights on worker 0-0, policy_version 927327 (0.00087) [2022-07-10 22:56:50,496][26022] Updated weights on worker 0-0, policy_version 927337 (0.00086) [2022-07-10 22:56:51,950][26022] Updated weights on worker 0-0, policy_version 927347 (0.00087) [2022-07-10 22:56:52,521][25689] Fps is (10 sec: 5619.4, 60 sec: 5566.8, 300 sec: 5553.7). Total num frames: 949605376. Throughput: 0: 5824.3. Samples: 949603858. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:56:52,523][25689] Avg episode reward: [(0, '-1.513')] [2022-07-10 22:56:53,934][26022] Updated weights on worker 0-0, policy_version 927357 (0.00086) [2022-07-10 22:56:55,609][26022] Updated weights on worker 0-0, policy_version 927367 (0.00087) [2022-07-10 22:56:57,547][26022] Updated weights on worker 0-0, policy_version 927377 (0.00091) [2022-07-10 22:56:57,581][25689] Fps is (10 sec: 5690.8, 60 sec: 5555.1, 300 sec: 5552.8). Total num frames: 949634048. Throughput: 0: 5865.7. Samples: 949637812. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:56:57,581][25689] Avg episode reward: [(0, '-1.398')] [2022-07-10 22:56:59,377][26022] Updated weights on worker 0-0, policy_version 927387 (0.00094) [2022-07-10 22:57:01,541][26022] Updated weights on worker 0-0, policy_version 927397 (0.00093) [2022-07-10 22:57:02,603][25689] Fps is (10 sec: 5281.5, 60 sec: 5574.8, 300 sec: 5549.4). Total num frames: 949658624. Throughput: 0: 5759.4. Samples: 949669206. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:57:02,603][25689] Avg episode reward: [(0, '-0.274')] [2022-07-10 22:57:03,362][26022] Updated weights on worker 0-0, policy_version 927407 (0.00089) [2022-07-10 22:57:05,319][26022] Updated weights on worker 0-0, policy_version 927417 (0.00087) [2022-07-10 22:57:06,955][26022] Updated weights on worker 0-0, policy_version 927427 (0.00103) [2022-07-10 22:57:07,614][25689] Fps is (10 sec: 5409.1, 60 sec: 5563.2, 300 sec: 5549.7). Total num frames: 949688320. Throughput: 0: 5761.1. Samples: 949686180. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:57:07,615][25689] Avg episode reward: [(0, '-0.531')] [2022-07-10 22:57:09,169][26022] Updated weights on worker 0-0, policy_version 927437 (0.00082) [2022-07-10 22:57:10,543][26022] Updated weights on worker 0-0, policy_version 927447 (0.00098) [2022-07-10 22:57:12,624][25689] Fps is (10 sec: 5620.3, 60 sec: 5549.1, 300 sec: 5551.9). Total num frames: 949714944. Throughput: 0: 5759.8. Samples: 949719646. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:57:12,624][25689] Avg episode reward: [(0, '-0.340')] [2022-07-10 22:57:12,652][26022] Updated weights on worker 0-0, policy_version 927457 (0.00099) [2022-07-10 22:57:13,499][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:57:13,510][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000927462_949721088.pth [2022-07-10 22:57:13,510][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000925505_947717120.pth [2022-07-10 22:57:14,401][26022] Updated weights on worker 0-0, policy_version 927467 (0.00090) [2022-07-10 22:57:16,241][26022] Updated weights on worker 0-0, policy_version 927477 (0.00079) [2022-07-10 22:57:17,733][25689] Fps is (10 sec: 5464.7, 60 sec: 5568.3, 300 sec: 5546.7). Total num frames: 949743616. Throughput: 0: 5725.4. Samples: 949753192. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:57:17,734][25689] Avg episode reward: [(0, '0.788')] [2022-07-10 22:57:18,067][26022] Updated weights on worker 0-0, policy_version 927487 (0.00097) [2022-07-10 22:57:20,110][26022] Updated weights on worker 0-0, policy_version 927497 (0.00086) [2022-07-10 22:57:21,631][26022] Updated weights on worker 0-0, policy_version 927507 (0.00088) [2022-07-10 22:57:22,737][25689] Fps is (10 sec: 5670.2, 60 sec: 5559.7, 300 sec: 5554.0). Total num frames: 949772288. Throughput: 0: 5003.9. Samples: 949769954. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:57:22,738][25689] Avg episode reward: [(0, '0.967')] [2022-07-10 22:57:23,810][26022] Updated weights on worker 0-0, policy_version 927517 (0.00096) [2022-07-10 22:57:25,474][26022] Updated weights on worker 0-0, policy_version 927527 (0.00085) [2022-07-10 22:57:27,392][26022] Updated weights on worker 0-0, policy_version 927537 (0.00087) [2022-07-10 22:57:27,757][25689] Fps is (10 sec: 5720.7, 60 sec: 5575.6, 300 sec: 5553.7). Total num frames: 949800960. Throughput: 0: 5816.9. Samples: 949803348. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:57:27,757][25689] Avg episode reward: [(0, '1.464')] [2022-07-10 22:57:29,085][26022] Updated weights on worker 0-0, policy_version 927547 (0.00096) [2022-07-10 22:57:30,996][26022] Updated weights on worker 0-0, policy_version 927557 (0.00089) [2022-07-10 22:57:32,764][25689] Fps is (10 sec: 5514.7, 60 sec: 5559.7, 300 sec: 5549.3). Total num frames: 949827584. Throughput: 0: 5815.2. Samples: 949836766. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:57:32,765][25689] Avg episode reward: [(0, '1.242')] [2022-07-10 22:57:32,851][26022] Updated weights on worker 0-0, policy_version 927567 (0.00085) [2022-07-10 22:57:34,671][26022] Updated weights on worker 0-0, policy_version 927577 (0.00894) [2022-07-10 22:57:36,565][26022] Updated weights on worker 0-0, policy_version 927587 (0.00088) [2022-07-10 22:57:37,887][25689] Fps is (10 sec: 5357.6, 60 sec: 5526.1, 300 sec: 5544.1). Total num frames: 949855232. Throughput: 0: 4977.0. Samples: 949853498. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:57:37,888][25689] Avg episode reward: [(0, '0.411')] [2022-07-10 22:57:38,340][26022] Updated weights on worker 0-0, policy_version 927597 (0.00643) [2022-07-10 22:57:40,053][26022] Updated weights on worker 0-0, policy_version 927607 (0.00092) [2022-07-10 22:57:42,050][26022] Updated weights on worker 0-0, policy_version 927617 (0.00090) [2022-07-10 22:57:42,973][25689] Fps is (10 sec: 5616.9, 60 sec: 5553.9, 300 sec: 5546.0). Total num frames: 949884928. Throughput: 0: 5786.7. Samples: 949887054. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:57:42,974][25689] Avg episode reward: [(0, '-0.758')] [2022-07-10 22:57:43,970][26022] Updated weights on worker 0-0, policy_version 927627 (0.00088) [2022-07-10 22:57:45,624][26022] Updated weights on worker 0-0, policy_version 927637 (0.00092) [2022-07-10 22:57:47,643][26022] Updated weights on worker 0-0, policy_version 927647 (0.00089) [2022-07-10 22:57:48,002][25689] Fps is (10 sec: 5669.2, 60 sec: 5551.8, 300 sec: 5549.1). Total num frames: 949912576. Throughput: 0: 5807.3. Samples: 949920916. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:57:48,002][25689] Avg episode reward: [(0, '-0.870')] [2022-07-10 22:57:49,255][26022] Updated weights on worker 0-0, policy_version 927657 (0.00096) [2022-07-10 22:57:51,153][26022] Updated weights on worker 0-0, policy_version 927667 (0.00087) [2022-07-10 22:57:53,002][26022] Updated weights on worker 0-0, policy_version 927677 (0.00092) [2022-07-10 22:57:53,004][25689] Fps is (10 sec: 5614.6, 60 sec: 5553.1, 300 sec: 5550.2). Total num frames: 949941248. Throughput: 0: 4986.7. Samples: 949937698. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:57:53,005][25689] Avg episode reward: [(0, '-1.177')] [2022-07-10 22:57:54,892][26022] Updated weights on worker 0-0, policy_version 927687 (0.00088) [2022-07-10 22:57:56,593][26022] Updated weights on worker 0-0, policy_version 927697 (0.00050) [2022-07-10 22:57:58,107][25689] Fps is (10 sec: 5573.3, 60 sec: 5532.3, 300 sec: 5548.7). Total num frames: 949968896. Throughput: 0: 5847.2. Samples: 949971730. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:57:58,108][25689] Avg episode reward: [(0, '-1.326')] [2022-07-10 22:57:58,470][26022] Updated weights on worker 0-0, policy_version 927707 (0.00080) [2022-07-10 22:58:00,111][26022] Updated weights on worker 0-0, policy_version 927717 (0.00083) [2022-07-10 22:58:02,530][26022] Updated weights on worker 0-0, policy_version 927727 (0.00085) [2022-07-10 22:58:03,112][25689] Fps is (10 sec: 5369.3, 60 sec: 5567.7, 300 sec: 5549.0). Total num frames: 949995520. Throughput: 0: 5763.6. Samples: 950003126. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:58:03,114][25689] Avg episode reward: [(0, '-1.475')] [2022-07-10 22:58:04,272][26022] Updated weights on worker 0-0, policy_version 927737 (0.00090) [2022-07-10 22:58:06,208][26022] Updated weights on worker 0-0, policy_version 927747 (0.00093) [2022-07-10 22:58:07,974][26022] Updated weights on worker 0-0, policy_version 927757 (0.00085) [2022-07-10 22:58:08,157][25689] Fps is (10 sec: 5400.2, 60 sec: 5530.7, 300 sec: 5541.9). Total num frames: 950023168. Throughput: 0: 4915.6. Samples: 950019992. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:58:08,158][25689] Avg episode reward: [(0, '-0.554')] [2022-07-10 22:58:09,608][26022] Updated weights on worker 0-0, policy_version 927767 (0.00087) [2022-07-10 22:58:11,536][26022] Updated weights on worker 0-0, policy_version 927777 (0.00088) [2022-07-10 22:58:13,186][25689] Fps is (10 sec: 5692.4, 60 sec: 5579.7, 300 sec: 5552.6). Total num frames: 950052864. Throughput: 0: 5746.1. Samples: 950053664. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:58:13,186][25689] Avg episode reward: [(0, '0.560')] [2022-07-10 22:58:13,253][26022] Updated weights on worker 0-0, policy_version 927787 (0.00084) [2022-07-10 22:58:15,152][26022] Updated weights on worker 0-0, policy_version 927797 (0.00081) [2022-07-10 22:58:17,183][26022] Updated weights on worker 0-0, policy_version 927807 (0.00095) [2022-07-10 22:58:18,252][25689] Fps is (10 sec: 5579.1, 60 sec: 5549.9, 300 sec: 5544.9). Total num frames: 950079488. Throughput: 0: 5757.7. Samples: 950087718. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:58:18,253][25689] Avg episode reward: [(0, '0.790')] [2022-07-10 22:58:18,882][26022] Updated weights on worker 0-0, policy_version 927817 (0.00087) [2022-07-10 22:58:20,729][26022] Updated weights on worker 0-0, policy_version 927827 (0.00087) [2022-07-10 22:58:22,547][26022] Updated weights on worker 0-0, policy_version 927837 (0.00090) [2022-07-10 22:58:23,280][25689] Fps is (10 sec: 5681.0, 60 sec: 5581.5, 300 sec: 5555.9). Total num frames: 950110208. Throughput: 0: 5029.6. Samples: 950104562. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:58:23,280][25689] Avg episode reward: [(0, '0.713')] [2022-07-10 22:58:24,292][26022] Updated weights on worker 0-0, policy_version 927847 (0.00088) [2022-07-10 22:58:26,110][26022] Updated weights on worker 0-0, policy_version 927857 (0.00095) [2022-07-10 22:58:28,039][26022] Updated weights on worker 0-0, policy_version 927867 (0.00080) [2022-07-10 22:58:28,299][25689] Fps is (10 sec: 5707.8, 60 sec: 5547.8, 300 sec: 5552.3). Total num frames: 950136832. Throughput: 0: 5870.5. Samples: 950138232. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:58:28,299][25689] Avg episode reward: [(0, '1.317')] [2022-07-10 22:58:29,871][26022] Updated weights on worker 0-0, policy_version 927877 (0.00086) [2022-07-10 22:58:31,785][26022] Updated weights on worker 0-0, policy_version 927887 (0.00083) [2022-07-10 22:58:33,313][25689] Fps is (10 sec: 5511.6, 60 sec: 5581.0, 300 sec: 5553.3). Total num frames: 950165504. Throughput: 0: 5856.1. Samples: 950171528. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:58:33,313][25689] Avg episode reward: [(0, '1.585')] [2022-07-10 22:58:33,546][26022] Updated weights on worker 0-0, policy_version 927897 (0.00093) [2022-07-10 22:58:35,514][26022] Updated weights on worker 0-0, policy_version 927907 (0.00086) [2022-07-10 22:58:37,252][26022] Updated weights on worker 0-0, policy_version 927917 (0.00089) [2022-07-10 22:58:38,356][25689] Fps is (10 sec: 5599.7, 60 sec: 5588.3, 300 sec: 5559.8). Total num frames: 950193152. Throughput: 0: 5008.5. Samples: 950188410. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:58:38,357][25689] Avg episode reward: [(0, '1.095')] [2022-07-10 22:58:39,209][26022] Updated weights on worker 0-0, policy_version 927927 (0.00089) [2022-07-10 22:58:40,896][26022] Updated weights on worker 0-0, policy_version 927937 (0.00089) [2022-07-10 22:58:42,670][26022] Updated weights on worker 0-0, policy_version 927947 (0.00092) [2022-07-10 22:58:43,372][25689] Fps is (10 sec: 5497.0, 60 sec: 5560.9, 300 sec: 5549.3). Total num frames: 950220800. Throughput: 0: 5849.8. Samples: 950222096. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:58:43,373][25689] Avg episode reward: [(0, '1.382')] [2022-07-10 22:58:44,549][26022] Updated weights on worker 0-0, policy_version 927957 (0.00083) [2022-07-10 22:58:46,287][26022] Updated weights on worker 0-0, policy_version 927967 (0.00084) [2022-07-10 22:58:48,178][26022] Updated weights on worker 0-0, policy_version 927977 (0.00085) [2022-07-10 22:58:48,403][25689] Fps is (10 sec: 5504.1, 60 sec: 5560.7, 300 sec: 5552.4). Total num frames: 950248448. Throughput: 0: 5851.3. Samples: 950255866. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:58:48,403][25689] Avg episode reward: [(0, '1.255')] [2022-07-10 22:58:49,874][26022] Updated weights on worker 0-0, policy_version 927987 (0.00083) [2022-07-10 22:58:52,055][26022] Updated weights on worker 0-0, policy_version 927997 (0.00088) [2022-07-10 22:58:53,450][25689] Fps is (10 sec: 5588.6, 60 sec: 5556.6, 300 sec: 5552.7). Total num frames: 950277120. Throughput: 0: 5017.2. Samples: 950272558. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:58:53,450][25689] Avg episode reward: [(0, '1.112')] [2022-07-10 22:58:53,592][26022] Updated weights on worker 0-0, policy_version 928007 (0.00087) [2022-07-10 22:58:55,574][26022] Updated weights on worker 0-0, policy_version 928017 (0.00084) [2022-07-10 22:58:57,267][26022] Updated weights on worker 0-0, policy_version 928027 (0.00091) [2022-07-10 22:58:58,479][25689] Fps is (10 sec: 5589.5, 60 sec: 5563.4, 300 sec: 5553.0). Total num frames: 950304768. Throughput: 0: 5836.7. Samples: 950305858. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:58:58,480][25689] Avg episode reward: [(0, '1.210')] [2022-07-10 22:58:59,262][26022] Updated weights on worker 0-0, policy_version 928037 (0.00090) [2022-07-10 22:59:01,259][26022] Updated weights on worker 0-0, policy_version 928047 (0.00103) [2022-07-10 22:59:03,393][26022] Updated weights on worker 0-0, policy_version 928057 (0.00087) [2022-07-10 22:59:03,482][25689] Fps is (10 sec: 5409.4, 60 sec: 5563.5, 300 sec: 5553.5). Total num frames: 950331392. Throughput: 0: 5713.3. Samples: 950336992. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:59:03,483][25689] Avg episode reward: [(0, '-0.617')] [2022-07-10 22:59:05,227][26022] Updated weights on worker 0-0, policy_version 928067 (0.00097) [2022-07-10 22:59:06,964][26022] Updated weights on worker 0-0, policy_version 928077 (0.00094) [2022-07-10 22:59:08,506][25689] Fps is (10 sec: 5412.6, 60 sec: 5565.6, 300 sec: 5553.4). Total num frames: 950359040. Throughput: 0: 4862.3. Samples: 950353612. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:59:08,507][25689] Avg episode reward: [(0, '-0.096')] [2022-07-10 22:59:08,794][26022] Updated weights on worker 0-0, policy_version 928087 (0.00087) [2022-07-10 22:59:10,875][26022] Updated weights on worker 0-0, policy_version 928097 (0.00095) [2022-07-10 22:59:12,572][26022] Updated weights on worker 0-0, policy_version 928107 (0.00086) [2022-07-10 22:59:13,515][25689] Fps is (10 sec: 5409.3, 60 sec: 5516.4, 300 sec: 5550.7). Total num frames: 950385664. Throughput: 0: 5709.2. Samples: 950387118. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:59:13,516][25689] Avg episode reward: [(0, '-0.171')] [2022-07-10 22:59:13,518][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 22:59:13,530][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000928112_950386688.pth [2022-07-10 22:59:13,531][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000926158_948385792.pth [2022-07-10 22:59:14,328][26022] Updated weights on worker 0-0, policy_version 928117 (0.00090) [2022-07-10 22:59:16,216][26022] Updated weights on worker 0-0, policy_version 928127 (0.00087) [2022-07-10 22:59:17,876][26022] Updated weights on worker 0-0, policy_version 928137 (0.00089) [2022-07-10 22:59:18,624][25689] Fps is (10 sec: 5464.9, 60 sec: 5546.4, 300 sec: 5549.2). Total num frames: 950414336. Throughput: 0: 5686.8. Samples: 950420420. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:59:18,626][25689] Avg episode reward: [(0, '-0.388')] [2022-07-10 22:59:20,170][26022] Updated weights on worker 0-0, policy_version 928147 (0.00090) [2022-07-10 22:59:21,742][26022] Updated weights on worker 0-0, policy_version 928157 (0.00090) [2022-07-10 22:59:23,626][25689] Fps is (10 sec: 5671.5, 60 sec: 5514.8, 300 sec: 5556.7). Total num frames: 950443008. Throughput: 0: 4965.7. Samples: 950437020. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:59:23,627][25689] Avg episode reward: [(0, '-1.105')] [2022-07-10 22:59:23,630][26022] Updated weights on worker 0-0, policy_version 928167 (0.00094) [2022-07-10 22:59:25,486][26022] Updated weights on worker 0-0, policy_version 928177 (0.00088) [2022-07-10 22:59:27,346][26022] Updated weights on worker 0-0, policy_version 928187 (0.00084) [2022-07-10 22:59:28,704][25689] Fps is (10 sec: 5587.3, 60 sec: 5526.4, 300 sec: 5558.9). Total num frames: 950470656. Throughput: 0: 5779.6. Samples: 950470350. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:59:28,705][25689] Avg episode reward: [(0, '-0.097')] [2022-07-10 22:59:29,136][26022] Updated weights on worker 0-0, policy_version 928197 (0.00104) [2022-07-10 22:59:30,866][26022] Updated weights on worker 0-0, policy_version 928207 (0.00085) [2022-07-10 22:59:32,937][26022] Updated weights on worker 0-0, policy_version 928217 (0.00095) [2022-07-10 22:59:33,709][25689] Fps is (10 sec: 5382.8, 60 sec: 5493.3, 300 sec: 5547.5). Total num frames: 950497280. Throughput: 0: 5782.1. Samples: 950503878. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:59:33,709][25689] Avg episode reward: [(0, '0.566')] [2022-07-10 22:59:34,668][26022] Updated weights on worker 0-0, policy_version 928227 (0.00084) [2022-07-10 22:59:36,678][26022] Updated weights on worker 0-0, policy_version 928237 (0.00088) [2022-07-10 22:59:38,326][26022] Updated weights on worker 0-0, policy_version 928247 (0.00094) [2022-07-10 22:59:38,805][25689] Fps is (10 sec: 5677.0, 60 sec: 5539.4, 300 sec: 5553.7). Total num frames: 950528000. Throughput: 0: 4970.2. Samples: 950520724. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:59:38,806][25689] Avg episode reward: [(0, '0.717')] [2022-07-10 22:59:40,279][26022] Updated weights on worker 0-0, policy_version 928257 (0.00091) [2022-07-10 22:59:41,779][26022] Updated weights on worker 0-0, policy_version 928267 (0.00106) [2022-07-10 22:59:43,809][25689] Fps is (10 sec: 5677.4, 60 sec: 5523.5, 300 sec: 5550.6). Total num frames: 950554624. Throughput: 0: 5816.6. Samples: 950554416. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:59:43,810][25689] Avg episode reward: [(0, '0.777')] [2022-07-10 22:59:43,830][26022] Updated weights on worker 0-0, policy_version 928277 (0.00086) [2022-07-10 22:59:45,711][26022] Updated weights on worker 0-0, policy_version 928287 (0.00053) [2022-07-10 22:59:47,328][26022] Updated weights on worker 0-0, policy_version 928297 (0.00096) [2022-07-10 22:59:48,856][25689] Fps is (10 sec: 5501.3, 60 sec: 5538.9, 300 sec: 5549.8). Total num frames: 950583296. Throughput: 0: 5845.8. Samples: 950588158. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:59:48,858][25689] Avg episode reward: [(0, '-0.317')] [2022-07-10 22:59:49,374][26022] Updated weights on worker 0-0, policy_version 928307 (0.00084) [2022-07-10 22:59:51,146][26022] Updated weights on worker 0-0, policy_version 928317 (0.00093) [2022-07-10 22:59:52,885][26022] Updated weights on worker 0-0, policy_version 928327 (0.00085) [2022-07-10 22:59:53,868][25689] Fps is (10 sec: 5701.1, 60 sec: 5542.2, 300 sec: 5552.4). Total num frames: 950611968. Throughput: 0: 5023.3. Samples: 950605142. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:59:53,869][25689] Avg episode reward: [(0, '0.259')] [2022-07-10 22:59:54,938][26022] Updated weights on worker 0-0, policy_version 928337 (0.00089) [2022-07-10 22:59:56,639][26022] Updated weights on worker 0-0, policy_version 928347 (0.00085) [2022-07-10 22:59:58,505][26022] Updated weights on worker 0-0, policy_version 928357 (0.00088) [2022-07-10 22:59:58,998][25689] Fps is (10 sec: 5553.6, 60 sec: 5533.0, 300 sec: 5555.0). Total num frames: 950639616. Throughput: 0: 5830.3. Samples: 950638452. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 22:59:58,998][25689] Avg episode reward: [(0, '0.542')] [2022-07-10 23:00:00,499][26022] Updated weights on worker 0-0, policy_version 928367 (0.00081) [2022-07-10 23:00:02,642][26022] Updated weights on worker 0-0, policy_version 928377 (0.00087) [2022-07-10 23:00:04,033][25689] Fps is (10 sec: 5238.4, 60 sec: 5513.2, 300 sec: 5548.3). Total num frames: 950665216. Throughput: 0: 5688.4. Samples: 950669454. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 23:00:04,033][25689] Avg episode reward: [(0, '0.427')] [2022-07-10 23:00:04,464][26022] Updated weights on worker 0-0, policy_version 928387 (0.00082) [2022-07-10 23:00:06,388][26022] Updated weights on worker 0-0, policy_version 928397 (0.00092) [2022-07-10 23:00:08,029][26022] Updated weights on worker 0-0, policy_version 928407 (0.00092) [2022-07-10 23:00:09,091][25689] Fps is (10 sec: 5377.0, 60 sec: 5526.9, 300 sec: 5551.8). Total num frames: 950693888. Throughput: 0: 5660.3. Samples: 950702692. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 23:00:09,093][25689] Avg episode reward: [(0, '0.423')] [2022-07-10 23:00:10,258][26022] Updated weights on worker 0-0, policy_version 928417 (0.00095) [2022-07-10 23:00:11,678][26022] Updated weights on worker 0-0, policy_version 928427 (0.00087) [2022-07-10 23:00:13,743][26022] Updated weights on worker 0-0, policy_version 928437 (0.00088) [2022-07-10 23:00:14,173][25689] Fps is (10 sec: 5554.2, 60 sec: 5537.2, 300 sec: 5548.9). Total num frames: 950721536. Throughput: 0: 5626.2. Samples: 950719382. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 23:00:14,173][25689] Avg episode reward: [(0, '0.313')] [2022-07-10 23:00:15,522][26022] Updated weights on worker 0-0, policy_version 928447 (0.00088) [2022-07-10 23:00:17,504][26022] Updated weights on worker 0-0, policy_version 928457 (0.00087) [2022-07-10 23:00:19,229][25689] Fps is (10 sec: 5454.1, 60 sec: 5525.1, 300 sec: 5545.3). Total num frames: 950749184. Throughput: 0: 5643.2. Samples: 950752624. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 23:00:19,230][25689] Avg episode reward: [(0, '1.579')] [2022-07-10 23:00:19,295][26022] Updated weights on worker 0-0, policy_version 928467 (0.00092) [2022-07-10 23:00:21,271][26022] Updated weights on worker 0-0, policy_version 928477 (0.00517) [2022-07-10 23:00:22,767][26022] Updated weights on worker 0-0, policy_version 928487 (0.00092) [2022-07-10 23:00:24,263][25689] Fps is (10 sec: 5480.0, 60 sec: 5505.3, 300 sec: 5545.2). Total num frames: 950776832. Throughput: 0: 5770.8. Samples: 950786200. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 23:00:24,264][25689] Avg episode reward: [(0, '1.882')] [2022-07-10 23:00:24,925][26022] Updated weights on worker 0-0, policy_version 928497 (0.00082) [2022-07-10 23:00:26,343][26022] Updated weights on worker 0-0, policy_version 928507 (0.00089) [2022-07-10 23:00:28,452][26022] Updated weights on worker 0-0, policy_version 928517 (0.00087) [2022-07-10 23:00:29,355][25689] Fps is (10 sec: 5764.5, 60 sec: 5554.7, 300 sec: 5554.7). Total num frames: 950807552. Throughput: 0: 4938.1. Samples: 950802758. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 23:00:29,357][25689] Avg episode reward: [(0, '1.977')] [2022-07-10 23:00:30,263][26022] Updated weights on worker 0-0, policy_version 928527 (0.00094) [2022-07-10 23:00:32,191][26022] Updated weights on worker 0-0, policy_version 928537 (0.00093) [2022-07-10 23:00:33,987][26022] Updated weights on worker 0-0, policy_version 928547 (0.00083) [2022-07-10 23:00:34,457][25689] Fps is (10 sec: 5625.2, 60 sec: 5545.8, 300 sec: 5547.7). Total num frames: 950834176. Throughput: 0: 5746.9. Samples: 950835956. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 23:00:34,458][25689] Avg episode reward: [(0, '2.013')] [2022-07-10 23:00:35,810][26022] Updated weights on worker 0-0, policy_version 928557 (0.00092) [2022-07-10 23:00:37,667][26022] Updated weights on worker 0-0, policy_version 928567 (0.00086) [2022-07-10 23:00:39,510][25689] Fps is (10 sec: 5344.1, 60 sec: 5499.1, 300 sec: 5543.3). Total num frames: 950861824. Throughput: 0: 5745.0. Samples: 950869138. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-10 23:00:39,511][25689] Avg episode reward: [(0, '1.862')] [2022-07-10 23:00:39,642][26022] Updated weights on worker 0-0, policy_version 928577 (0.00087) [2022-07-10 23:00:41,318][26022] Updated weights on worker 0-0, policy_version 928587 (0.00091) [2022-07-10 23:00:43,250][26022] Updated weights on worker 0-0, policy_version 928597 (0.00084) [2022-07-10 23:00:44,552][25689] Fps is (10 sec: 5680.2, 60 sec: 5546.2, 300 sec: 5549.9). Total num frames: 950891520. Throughput: 0: 4931.8. Samples: 950886264. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:00:44,553][25689] Avg episode reward: [(0, '1.215')] [2022-07-10 23:00:45,223][26022] Updated weights on worker 0-0, policy_version 928607 (0.00099) [2022-07-10 23:00:46,800][26022] Updated weights on worker 0-0, policy_version 928618 (0.00087) [2022-07-10 23:00:48,967][26022] Updated weights on worker 0-0, policy_version 928628 (0.00094) [2022-07-10 23:00:49,559][25689] Fps is (10 sec: 5706.5, 60 sec: 5533.1, 300 sec: 5547.5). Total num frames: 950919168. Throughput: 0: 5799.1. Samples: 950919926. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:00:49,562][25689] Avg episode reward: [(0, '-0.052')] [2022-07-10 23:00:50,428][26022] Updated weights on worker 0-0, policy_version 928638 (0.00085) [2022-07-10 23:00:52,471][26022] Updated weights on worker 0-0, policy_version 928648 (0.00085) [2022-07-10 23:00:54,262][26022] Updated weights on worker 0-0, policy_version 928658 (0.00077) [2022-07-10 23:00:54,575][25689] Fps is (10 sec: 5414.9, 60 sec: 5498.9, 300 sec: 5539.0). Total num frames: 950945792. Throughput: 0: 5846.8. Samples: 950953582. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:00:54,576][25689] Avg episode reward: [(0, '-1.135')] [2022-07-10 23:00:56,091][26022] Updated weights on worker 0-0, policy_version 928668 (0.00094) [2022-07-10 23:00:57,987][26022] Updated weights on worker 0-0, policy_version 928678 (0.00079) [2022-07-10 23:00:59,703][25689] Fps is (10 sec: 5552.3, 60 sec: 5532.8, 300 sec: 5558.3). Total num frames: 950975488. Throughput: 0: 5015.4. Samples: 950970410. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:00:59,703][25689] Avg episode reward: [(0, '-1.075')] [2022-07-10 23:00:59,735][26022] Updated weights on worker 0-0, policy_version 928688 (0.00085) [2022-07-10 23:01:01,612][26022] Updated weights on worker 0-0, policy_version 928698 (0.00103) [2022-07-10 23:01:04,228][26022] Updated weights on worker 0-0, policy_version 928708 (0.00092) [2022-07-10 23:01:04,734][25689] Fps is (10 sec: 5443.1, 60 sec: 5533.2, 300 sec: 5541.8). Total num frames: 951001088. Throughput: 0: 5734.1. Samples: 951001988. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:01:04,735][25689] Avg episode reward: [(0, '-1.233')] [2022-07-10 23:01:05,334][26022] Updated weights on worker 0-0, policy_version 928718 (0.00087) [2022-07-10 23:01:07,639][26022] Updated weights on worker 0-0, policy_version 928728 (0.00087) [2022-07-10 23:01:09,310][26022] Updated weights on worker 0-0, policy_version 928738 (0.00084) [2022-07-10 23:01:09,755][25689] Fps is (10 sec: 5297.1, 60 sec: 5519.7, 300 sec: 5542.1). Total num frames: 951028736. Throughput: 0: 5705.4. Samples: 951035150. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:01:09,757][25689] Avg episode reward: [(0, '-0.521')] [2022-07-10 23:01:11,475][26022] Updated weights on worker 0-0, policy_version 928748 (0.00086) [2022-07-10 23:01:13,020][26022] Updated weights on worker 0-0, policy_version 928758 (0.00088) [2022-07-10 23:01:13,697][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:01:13,711][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000928762_951052288.pth [2022-07-10 23:01:13,712][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000926809_949052416.pth [2022-07-10 23:01:14,762][25689] Fps is (10 sec: 5412.3, 60 sec: 5509.6, 300 sec: 5541.1). Total num frames: 951055360. Throughput: 0: 4877.7. Samples: 951052046. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:01:14,762][25689] Avg episode reward: [(0, '0.533')] [2022-07-10 23:01:15,035][26022] Updated weights on worker 0-0, policy_version 928768 (0.00084) [2022-07-10 23:01:16,607][26022] Updated weights on worker 0-0, policy_version 928778 (0.00115) [2022-07-10 23:01:18,592][26022] Updated weights on worker 0-0, policy_version 928788 (0.00089) [2022-07-10 23:01:19,883][25689] Fps is (10 sec: 5662.2, 60 sec: 5554.5, 300 sec: 5544.0). Total num frames: 951086080. Throughput: 0: 5705.6. Samples: 951085548. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:01:19,883][25689] Avg episode reward: [(0, '1.049')] [2022-07-10 23:01:20,270][26022] Updated weights on worker 0-0, policy_version 928798 (0.00089) [2022-07-10 23:01:22,172][26022] Updated weights on worker 0-0, policy_version 928808 (0.00091) [2022-07-10 23:01:24,048][26022] Updated weights on worker 0-0, policy_version 928818 (0.00090) [2022-07-10 23:01:24,892][25689] Fps is (10 sec: 5862.5, 60 sec: 5573.5, 300 sec: 5547.4). Total num frames: 951114752. Throughput: 0: 5813.7. Samples: 951119182. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:01:24,894][25689] Avg episode reward: [(0, '1.463')] [2022-07-10 23:01:25,803][26022] Updated weights on worker 0-0, policy_version 928828 (0.00092) [2022-07-10 23:01:27,864][26022] Updated weights on worker 0-0, policy_version 928838 (0.00090) [2022-07-10 23:01:29,468][26022] Updated weights on worker 0-0, policy_version 928848 (0.00086) [2022-07-10 23:01:29,905][25689] Fps is (10 sec: 5517.1, 60 sec: 5513.2, 300 sec: 5544.1). Total num frames: 951141376. Throughput: 0: 4999.4. Samples: 951135888. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:01:29,906][25689] Avg episode reward: [(0, '0.792')] [2022-07-10 23:01:31,265][26022] Updated weights on worker 0-0, policy_version 928858 (0.00085) [2022-07-10 23:01:33,452][26022] Updated weights on worker 0-0, policy_version 928868 (0.00095) [2022-07-10 23:01:34,917][25689] Fps is (10 sec: 5516.1, 60 sec: 5555.3, 300 sec: 5542.8). Total num frames: 951170048. Throughput: 0: 5823.4. Samples: 951169418. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:01:34,918][25689] Avg episode reward: [(0, '0.659')] [2022-07-10 23:01:35,037][26022] Updated weights on worker 0-0, policy_version 928878 (0.00095) [2022-07-10 23:01:37,096][26022] Updated weights on worker 0-0, policy_version 928888 (0.00089) [2022-07-10 23:01:38,806][26022] Updated weights on worker 0-0, policy_version 928898 (0.00096) [2022-07-10 23:01:40,021][25689] Fps is (10 sec: 5466.3, 60 sec: 5533.7, 300 sec: 5537.8). Total num frames: 951196672. Throughput: 0: 5815.6. Samples: 951202666. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:01:40,022][25689] Avg episode reward: [(0, '0.773')] [2022-07-10 23:01:40,514][26022] Updated weights on worker 0-0, policy_version 928908 (0.00088) [2022-07-10 23:01:42,575][26022] Updated weights on worker 0-0, policy_version 928918 (0.00078) [2022-07-10 23:01:44,238][26022] Updated weights on worker 0-0, policy_version 928928 (0.00092) [2022-07-10 23:01:45,071][25689] Fps is (10 sec: 5546.7, 60 sec: 5533.0, 300 sec: 5543.9). Total num frames: 951226368. Throughput: 0: 4975.6. Samples: 951219582. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:01:45,073][25689] Avg episode reward: [(0, '0.776')] [2022-07-10 23:01:46,260][26022] Updated weights on worker 0-0, policy_version 928938 (0.00087) [2022-07-10 23:01:47,907][26022] Updated weights on worker 0-0, policy_version 928948 (0.00058) [2022-07-10 23:01:49,776][26022] Updated weights on worker 0-0, policy_version 928958 (0.00109) [2022-07-10 23:01:50,079][25689] Fps is (10 sec: 5803.3, 60 sec: 5549.8, 300 sec: 5544.0). Total num frames: 951255040. Throughput: 0: 5821.1. Samples: 951253320. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:01:50,081][25689] Avg episode reward: [(0, '0.050')] [2022-07-10 23:01:51,654][26022] Updated weights on worker 0-0, policy_version 928968 (0.00094) [2022-07-10 23:01:53,184][26022] Updated weights on worker 0-0, policy_version 928978 (0.00089) [2022-07-10 23:01:55,117][25689] Fps is (10 sec: 5504.3, 60 sec: 5547.8, 300 sec: 5537.5). Total num frames: 951281664. Throughput: 0: 5820.4. Samples: 951286988. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:01:55,118][25689] Avg episode reward: [(0, '0.291')] [2022-07-10 23:01:55,398][26022] Updated weights on worker 0-0, policy_version 928988 (0.00092) [2022-07-10 23:01:57,112][26022] Updated weights on worker 0-0, policy_version 928998 (0.00089) [2022-07-10 23:01:59,169][26022] Updated weights on worker 0-0, policy_version 929008 (0.00086) [2022-07-10 23:02:00,178][25689] Fps is (10 sec: 5475.3, 60 sec: 5536.9, 300 sec: 5550.6). Total num frames: 951310336. Throughput: 0: 5005.9. Samples: 951303570. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:02:00,178][25689] Avg episode reward: [(0, '0.566')] [2022-07-10 23:02:00,715][26022] Updated weights on worker 0-0, policy_version 929018 (0.00082) [2022-07-10 23:02:03,020][26022] Updated weights on worker 0-0, policy_version 929028 (0.00095) [2022-07-10 23:02:04,656][26022] Updated weights on worker 0-0, policy_version 929038 (0.00093) [2022-07-10 23:02:05,193][25689] Fps is (10 sec: 5589.3, 60 sec: 5572.3, 300 sec: 5543.6). Total num frames: 951337984. Throughput: 0: 5753.1. Samples: 951335348. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:02:05,194][25689] Avg episode reward: [(0, '-0.164')] [2022-07-10 23:02:06,826][26022] Updated weights on worker 0-0, policy_version 929048 (0.00089) [2022-07-10 23:02:08,297][26022] Updated weights on worker 0-0, policy_version 929058 (0.00086) [2022-07-10 23:02:10,236][25689] Fps is (10 sec: 5294.2, 60 sec: 5536.4, 300 sec: 5539.6). Total num frames: 951363584. Throughput: 0: 5724.8. Samples: 951368716. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:02:10,236][25689] Avg episode reward: [(0, '-0.446')] [2022-07-10 23:02:10,475][26022] Updated weights on worker 0-0, policy_version 929068 (0.00088) [2022-07-10 23:02:12,183][26022] Updated weights on worker 0-0, policy_version 929078 (0.00093) [2022-07-10 23:02:14,032][26022] Updated weights on worker 0-0, policy_version 929088 (0.00088) [2022-07-10 23:02:15,255][25689] Fps is (10 sec: 5393.9, 60 sec: 5569.2, 300 sec: 5541.2). Total num frames: 951392256. Throughput: 0: 5721.8. Samples: 951402214. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:02:15,255][25689] Avg episode reward: [(0, '-0.221')] [2022-07-10 23:02:15,904][26022] Updated weights on worker 0-0, policy_version 929098 (0.00094) [2022-07-10 23:02:17,881][26022] Updated weights on worker 0-0, policy_version 929108 (0.00083) [2022-07-10 23:02:19,337][26022] Updated weights on worker 0-0, policy_version 929118 (0.00089) [2022-07-10 23:02:20,387][25689] Fps is (10 sec: 5750.0, 60 sec: 5551.2, 300 sec: 5542.3). Total num frames: 951421952. Throughput: 0: 5715.2. Samples: 951419068. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:02:20,387][25689] Avg episode reward: [(0, '-0.015')] [2022-07-10 23:02:21,516][26022] Updated weights on worker 0-0, policy_version 929128 (0.00079) [2022-07-10 23:02:22,883][26022] Updated weights on worker 0-0, policy_version 929138 (0.00086) [2022-07-10 23:02:25,107][26022] Updated weights on worker 0-0, policy_version 929148 (0.00080) [2022-07-10 23:02:25,471][25689] Fps is (10 sec: 5713.2, 60 sec: 5544.4, 300 sec: 5541.1). Total num frames: 951450624. Throughput: 0: 5807.1. Samples: 951453106. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:02:25,472][25689] Avg episode reward: [(0, '0.229')] [2022-07-10 23:02:26,599][26022] Updated weights on worker 0-0, policy_version 929158 (0.00085) [2022-07-10 23:02:28,639][26022] Updated weights on worker 0-0, policy_version 929168 (0.00086) [2022-07-10 23:02:30,359][26022] Updated weights on worker 0-0, policy_version 929178 (0.00083) [2022-07-10 23:02:30,496][25689] Fps is (10 sec: 5571.1, 60 sec: 5560.2, 300 sec: 5544.2). Total num frames: 951478272. Throughput: 0: 5822.9. Samples: 951486690. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:02:30,497][25689] Avg episode reward: [(0, '1.040')] [2022-07-10 23:02:32,153][26022] Updated weights on worker 0-0, policy_version 929188 (0.00094) [2022-07-10 23:02:33,958][26022] Updated weights on worker 0-0, policy_version 929198 (0.00084) [2022-07-10 23:02:35,595][25689] Fps is (10 sec: 5563.2, 60 sec: 5552.2, 300 sec: 5548.1). Total num frames: 951506944. Throughput: 0: 4991.3. Samples: 951503738. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:02:35,596][25689] Avg episode reward: [(0, '1.155')] [2022-07-10 23:02:35,741][26022] Updated weights on worker 0-0, policy_version 929208 (0.00079) [2022-07-10 23:02:37,734][26022] Updated weights on worker 0-0, policy_version 929218 (0.00095) [2022-07-10 23:02:39,407][26022] Updated weights on worker 0-0, policy_version 929228 (0.00093) [2022-07-10 23:02:40,651][25689] Fps is (10 sec: 5445.5, 60 sec: 5556.6, 300 sec: 5538.3). Total num frames: 951533568. Throughput: 0: 5817.7. Samples: 951536958. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:02:40,652][25689] Avg episode reward: [(0, '1.021')] [2022-07-10 23:02:41,306][26022] Updated weights on worker 0-0, policy_version 929238 (0.00081) [2022-07-10 23:02:43,277][26022] Updated weights on worker 0-0, policy_version 929248 (0.00453) [2022-07-10 23:02:44,988][26022] Updated weights on worker 0-0, policy_version 929258 (0.00084) [2022-07-10 23:02:45,655][25689] Fps is (10 sec: 5598.2, 60 sec: 5560.8, 300 sec: 5545.7). Total num frames: 951563264. Throughput: 0: 5825.2. Samples: 951570682. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:02:45,656][25689] Avg episode reward: [(0, '1.783')] [2022-07-10 23:02:47,055][26022] Updated weights on worker 0-0, policy_version 929268 (0.00610) [2022-07-10 23:02:48,676][26022] Updated weights on worker 0-0, policy_version 929278 (0.00086) [2022-07-10 23:02:50,482][26022] Updated weights on worker 0-0, policy_version 929288 (0.00085) [2022-07-10 23:02:50,668][25689] Fps is (10 sec: 5724.7, 60 sec: 5543.5, 300 sec: 5542.0). Total num frames: 951590912. Throughput: 0: 5000.5. Samples: 951587558. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:02:50,668][25689] Avg episode reward: [(0, '1.500')] [2022-07-10 23:02:52,502][26022] Updated weights on worker 0-0, policy_version 929298 (0.00701) [2022-07-10 23:02:54,068][26022] Updated weights on worker 0-0, policy_version 929308 (0.00093) [2022-07-10 23:02:55,695][25689] Fps is (10 sec: 5507.9, 60 sec: 5561.4, 300 sec: 5543.5). Total num frames: 951618560. Throughput: 0: 5845.0. Samples: 951621222. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:02:55,695][25689] Avg episode reward: [(0, '1.409')] [2022-07-10 23:02:56,025][26022] Updated weights on worker 0-0, policy_version 929318 (0.00078) [2022-07-10 23:02:58,312][26022] Updated weights on worker 0-0, policy_version 929329 (0.00086) [2022-07-10 23:02:59,830][26022] Updated weights on worker 0-0, policy_version 929339 (0.00093) [2022-07-10 23:03:00,798][25689] Fps is (10 sec: 5559.7, 60 sec: 5557.5, 300 sec: 5548.5). Total num frames: 951647232. Throughput: 0: 5855.2. Samples: 951654922. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:03:00,799][25689] Avg episode reward: [(0, '1.415')] [2022-07-10 23:03:01,875][26022] Updated weights on worker 0-0, policy_version 929349 (0.00076) [2022-07-10 23:03:03,883][26022] Updated weights on worker 0-0, policy_version 929359 (0.00086) [2022-07-10 23:03:05,653][26022] Updated weights on worker 0-0, policy_version 929369 (0.00085) [2022-07-10 23:03:05,807][25689] Fps is (10 sec: 5468.3, 60 sec: 5541.2, 300 sec: 5545.7). Total num frames: 951673856. Throughput: 0: 4912.6. Samples: 951669678. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:03:05,809][25689] Avg episode reward: [(0, '1.165')] [2022-07-10 23:03:07,525][26022] Updated weights on worker 0-0, policy_version 929379 (0.00087) [2022-07-10 23:03:09,481][26022] Updated weights on worker 0-0, policy_version 929389 (0.00084) [2022-07-10 23:03:10,821][25689] Fps is (10 sec: 5516.6, 60 sec: 5594.5, 300 sec: 5542.6). Total num frames: 951702528. Throughput: 0: 5744.0. Samples: 951703320. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:03:10,823][25689] Avg episode reward: [(0, '0.923')] [2022-07-10 23:03:11,095][26022] Updated weights on worker 0-0, policy_version 929399 (0.00123) [2022-07-10 23:03:13,227][26022] Updated weights on worker 0-0, policy_version 929409 (0.00094) [2022-07-10 23:03:13,854][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:03:13,868][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000929413_951718912.pth [2022-07-10 23:03:13,869][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000927462_949721088.pth [2022-07-10 23:03:14,723][26022] Updated weights on worker 0-0, policy_version 929419 (0.00100) [2022-07-10 23:03:15,839][25689] Fps is (10 sec: 5512.0, 60 sec: 5560.8, 300 sec: 5543.5). Total num frames: 951729152. Throughput: 0: 5763.9. Samples: 951737330. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:03:15,841][25689] Avg episode reward: [(0, '1.103')] [2022-07-10 23:03:16,663][26022] Updated weights on worker 0-0, policy_version 929429 (0.00093) [2022-07-10 23:03:18,523][26022] Updated weights on worker 0-0, policy_version 929439 (0.00085) [2022-07-10 23:03:20,307][26022] Updated weights on worker 0-0, policy_version 929449 (0.00082) [2022-07-10 23:03:20,962][25689] Fps is (10 sec: 5452.7, 60 sec: 5544.7, 300 sec: 5534.8). Total num frames: 951757824. Throughput: 0: 4919.6. Samples: 951754124. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:03:20,963][25689] Avg episode reward: [(0, '0.989')] [2022-07-10 23:03:22,342][26022] Updated weights on worker 0-0, policy_version 929459 (0.00096) [2022-07-10 23:03:24,046][26022] Updated weights on worker 0-0, policy_version 929469 (0.00093) [2022-07-10 23:03:25,938][26022] Updated weights on worker 0-0, policy_version 929479 (0.00093) [2022-07-10 23:03:26,036][25689] Fps is (10 sec: 5623.3, 60 sec: 5545.7, 300 sec: 5540.7). Total num frames: 951786496. Throughput: 0: 5835.5. Samples: 951787726. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:03:26,037][25689] Avg episode reward: [(0, '0.906')] [2022-07-10 23:03:27,846][26022] Updated weights on worker 0-0, policy_version 929489 (0.00082) [2022-07-10 23:03:29,565][26022] Updated weights on worker 0-0, policy_version 929499 (0.00088) [2022-07-10 23:03:31,045][25689] Fps is (10 sec: 5687.4, 60 sec: 5564.1, 300 sec: 5540.8). Total num frames: 951815168. Throughput: 0: 5818.5. Samples: 951820990. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:03:31,045][25689] Avg episode reward: [(0, '0.767')] [2022-07-10 23:03:31,508][26022] Updated weights on worker 0-0, policy_version 929509 (0.00092) [2022-07-10 23:03:33,255][26022] Updated weights on worker 0-0, policy_version 929519 (0.00090) [2022-07-10 23:03:35,171][26022] Updated weights on worker 0-0, policy_version 929529 (0.00080) [2022-07-10 23:03:36,083][25689] Fps is (10 sec: 5503.9, 60 sec: 5535.8, 300 sec: 5537.4). Total num frames: 951841792. Throughput: 0: 4949.0. Samples: 951837518. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:03:36,083][25689] Avg episode reward: [(0, '0.929')] [2022-07-10 23:03:36,963][26022] Updated weights on worker 0-0, policy_version 929539 (0.00086) [2022-07-10 23:03:38,816][26022] Updated weights on worker 0-0, policy_version 929549 (0.00089) [2022-07-10 23:03:40,687][26022] Updated weights on worker 0-0, policy_version 929559 (0.00081) [2022-07-10 23:03:41,137][25689] Fps is (10 sec: 5479.0, 60 sec: 5569.8, 300 sec: 5540.2). Total num frames: 951870464. Throughput: 0: 5777.1. Samples: 951870676. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:03:41,138][25689] Avg episode reward: [(0, '1.252')] [2022-07-10 23:03:42,497][26022] Updated weights on worker 0-0, policy_version 929569 (0.00081) [2022-07-10 23:03:44,422][26022] Updated weights on worker 0-0, policy_version 929579 (0.00086) [2022-07-10 23:03:46,178][25689] Fps is (10 sec: 5578.8, 60 sec: 5532.6, 300 sec: 5540.0). Total num frames: 951898112. Throughput: 0: 5787.7. Samples: 951904302. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:03:46,179][25689] Avg episode reward: [(0, '0.418')] [2022-07-10 23:03:46,268][26022] Updated weights on worker 0-0, policy_version 929589 (0.00088) [2022-07-10 23:03:48,072][26022] Updated weights on worker 0-0, policy_version 929599 (0.00088) [2022-07-10 23:03:49,823][26022] Updated weights on worker 0-0, policy_version 929609 (0.00089) [2022-07-10 23:03:51,185][25689] Fps is (10 sec: 5503.0, 60 sec: 5533.1, 300 sec: 5537.3). Total num frames: 951925760. Throughput: 0: 4973.6. Samples: 951921162. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:03:51,186][25689] Avg episode reward: [(0, '0.593')] [2022-07-10 23:03:51,729][26022] Updated weights on worker 0-0, policy_version 929619 (0.00096) [2022-07-10 23:03:53,435][26022] Updated weights on worker 0-0, policy_version 929629 (0.00082) [2022-07-10 23:03:55,156][26022] Updated weights on worker 0-0, policy_version 929639 (0.00097) [2022-07-10 23:03:56,251][25689] Fps is (10 sec: 5692.8, 60 sec: 5563.3, 300 sec: 5543.5). Total num frames: 951955456. Throughput: 0: 5823.4. Samples: 951954968. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:03:56,252][25689] Avg episode reward: [(0, '1.040')] [2022-07-10 23:03:57,104][26022] Updated weights on worker 0-0, policy_version 929649 (0.00086) [2022-07-10 23:03:59,070][26022] Updated weights on worker 0-0, policy_version 929659 (0.00091) [2022-07-10 23:04:00,671][26022] Updated weights on worker 0-0, policy_version 929669 (0.00093) [2022-07-10 23:04:01,298][25689] Fps is (10 sec: 5670.5, 60 sec: 5551.6, 300 sec: 5546.1). Total num frames: 951983104. Throughput: 0: 5838.4. Samples: 951988384. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:04:01,299][25689] Avg episode reward: [(0, '1.556')] [2022-07-10 23:04:03,018][26022] Updated weights on worker 0-0, policy_version 929679 (0.00092) [2022-07-10 23:04:04,714][26022] Updated weights on worker 0-0, policy_version 929689 (0.00055) [2022-07-10 23:04:06,321][25689] Fps is (10 sec: 5186.1, 60 sec: 5516.4, 300 sec: 5535.8). Total num frames: 952007680. Throughput: 0: 4904.3. Samples: 952003090. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:04:06,322][25689] Avg episode reward: [(0, '1.416')] [2022-07-10 23:04:06,776][26022] Updated weights on worker 0-0, policy_version 929699 (0.00086) [2022-07-10 23:04:08,459][26022] Updated weights on worker 0-0, policy_version 929709 (0.00084) [2022-07-10 23:04:10,486][26022] Updated weights on worker 0-0, policy_version 929719 (0.00096) [2022-07-10 23:04:11,336][25689] Fps is (10 sec: 5406.6, 60 sec: 5533.3, 300 sec: 5546.0). Total num frames: 952037376. Throughput: 0: 5725.9. Samples: 952036544. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:04:11,337][25689] Avg episode reward: [(0, '0.947')] [2022-07-10 23:04:12,269][26022] Updated weights on worker 0-0, policy_version 929729 (0.00088) [2022-07-10 23:04:14,172][26022] Updated weights on worker 0-0, policy_version 929739 (0.00091) [2022-07-10 23:04:16,019][26022] Updated weights on worker 0-0, policy_version 929749 (0.00083) [2022-07-10 23:04:16,347][25689] Fps is (10 sec: 5617.6, 60 sec: 5533.9, 300 sec: 5540.9). Total num frames: 952064000. Throughput: 0: 5702.3. Samples: 952069560. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:04:16,348][25689] Avg episode reward: [(0, '1.383')] [2022-07-10 23:04:17,746][26022] Updated weights on worker 0-0, policy_version 929759 (0.00078) [2022-07-10 23:04:19,690][26022] Updated weights on worker 0-0, policy_version 929769 (0.00082) [2022-07-10 23:04:21,403][25689] Fps is (10 sec: 5493.0, 60 sec: 5540.1, 300 sec: 5539.9). Total num frames: 952092672. Throughput: 0: 4868.7. Samples: 952086268. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:04:21,403][25689] Avg episode reward: [(0, '1.115')] [2022-07-10 23:04:21,560][26022] Updated weights on worker 0-0, policy_version 929779 (0.00092) [2022-07-10 23:04:23,406][26022] Updated weights on worker 0-0, policy_version 929789 (0.00084) [2022-07-10 23:04:25,211][26022] Updated weights on worker 0-0, policy_version 929799 (0.00086) [2022-07-10 23:04:26,410][25689] Fps is (10 sec: 5596.9, 60 sec: 5529.3, 300 sec: 5541.3). Total num frames: 952120320. Throughput: 0: 5817.4. Samples: 952119952. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:04:26,410][25689] Avg episode reward: [(0, '0.826')] [2022-07-10 23:04:26,943][26022] Updated weights on worker 0-0, policy_version 929809 (0.00095) [2022-07-10 23:04:28,835][26022] Updated weights on worker 0-0, policy_version 929819 (0.00094) [2022-07-10 23:04:30,612][26022] Updated weights on worker 0-0, policy_version 929829 (0.00103) [2022-07-10 23:04:31,416][25689] Fps is (10 sec: 5624.8, 60 sec: 5529.6, 300 sec: 5548.1). Total num frames: 952148992. Throughput: 0: 5829.6. Samples: 952153600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-10 23:04:31,416][25689] Avg episode reward: [(0, '0.709')] [2022-07-10 23:04:32,464][26022] Updated weights on worker 0-0, policy_version 929839 (0.00091) [2022-07-10 23:04:34,259][26022] Updated weights on worker 0-0, policy_version 929849 (0.00086) [2022-07-10 23:04:36,007][26022] Updated weights on worker 0-0, policy_version 929859 (0.00078) [2022-07-10 23:04:36,436][25689] Fps is (10 sec: 5719.2, 60 sec: 5565.1, 300 sec: 5542.6). Total num frames: 952177664. Throughput: 0: 5023.6. Samples: 952170480. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:04:36,437][25689] Avg episode reward: [(0, '-0.479')] [2022-07-10 23:04:38,007][26022] Updated weights on worker 0-0, policy_version 929869 (0.00083) [2022-07-10 23:04:39,729][26022] Updated weights on worker 0-0, policy_version 929879 (0.00084) [2022-07-10 23:04:41,483][25689] Fps is (10 sec: 5492.7, 60 sec: 5531.9, 300 sec: 5541.9). Total num frames: 952204288. Throughput: 0: 5873.7. Samples: 952204212. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:04:41,483][25689] Avg episode reward: [(0, '-0.568')] [2022-07-10 23:04:41,587][26022] Updated weights on worker 0-0, policy_version 929889 (0.00080) [2022-07-10 23:04:43,444][26022] Updated weights on worker 0-0, policy_version 929899 (0.00090) [2022-07-10 23:04:45,258][26022] Updated weights on worker 0-0, policy_version 929909 (0.00087) [2022-07-10 23:04:46,507][25689] Fps is (10 sec: 5592.5, 60 sec: 5567.4, 300 sec: 5545.7). Total num frames: 952233984. Throughput: 0: 5858.3. Samples: 952237688. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:04:46,508][25689] Avg episode reward: [(0, '-1.192')] [2022-07-10 23:04:47,050][26022] Updated weights on worker 0-0, policy_version 929919 (0.00094) [2022-07-10 23:04:48,875][26022] Updated weights on worker 0-0, policy_version 929929 (0.00087) [2022-07-10 23:04:50,683][26022] Updated weights on worker 0-0, policy_version 929939 (0.00086) [2022-07-10 23:04:51,537][25689] Fps is (10 sec: 5703.5, 60 sec: 5565.2, 300 sec: 5541.9). Total num frames: 952261632. Throughput: 0: 5025.1. Samples: 952254712. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:04:51,537][25689] Avg episode reward: [(0, '-0.981')] [2022-07-10 23:04:52,539][26022] Updated weights on worker 0-0, policy_version 929949 (0.00094) [2022-07-10 23:04:54,331][26022] Updated weights on worker 0-0, policy_version 929959 (0.00097) [2022-07-10 23:04:56,055][26022] Updated weights on worker 0-0, policy_version 929969 (0.00094) [2022-07-10 23:04:56,563][25689] Fps is (10 sec: 5600.8, 60 sec: 5552.0, 300 sec: 5547.3). Total num frames: 952290304. Throughput: 0: 5856.5. Samples: 952288350. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:04:56,563][25689] Avg episode reward: [(0, '-0.956')] [2022-07-10 23:04:58,187][26022] Updated weights on worker 0-0, policy_version 929979 (0.00079) [2022-07-10 23:04:59,683][26022] Updated weights on worker 0-0, policy_version 929989 (0.00088) [2022-07-10 23:05:01,695][25689] Fps is (10 sec: 5544.6, 60 sec: 5544.1, 300 sec: 5552.4). Total num frames: 952317952. Throughput: 0: 5824.9. Samples: 952321944. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:05:01,696][25689] Avg episode reward: [(0, '-0.767')] [2022-07-10 23:05:01,711][26022] Updated weights on worker 0-0, policy_version 929999 (0.00087) [2022-07-10 23:05:03,798][26022] Updated weights on worker 0-0, policy_version 930009 (0.00107) [2022-07-10 23:05:05,627][26022] Updated weights on worker 0-0, policy_version 930019 (0.00089) [2022-07-10 23:05:06,766][25689] Fps is (10 sec: 5319.0, 60 sec: 5573.6, 300 sec: 5545.3). Total num frames: 952344576. Throughput: 0: 5718.2. Samples: 952353534. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:05:06,768][25689] Avg episode reward: [(0, '-0.587')] [2022-07-10 23:05:07,730][26022] Updated weights on worker 0-0, policy_version 930029 (0.00083) [2022-07-10 23:05:09,227][26022] Updated weights on worker 0-0, policy_version 930039 (0.00618) [2022-07-10 23:05:11,439][26022] Updated weights on worker 0-0, policy_version 930049 (0.01291) [2022-07-10 23:05:11,775][25689] Fps is (10 sec: 5384.1, 60 sec: 5540.3, 300 sec: 5546.6). Total num frames: 952372224. Throughput: 0: 5699.6. Samples: 952370058. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:05:11,776][25689] Avg episode reward: [(0, '-0.224')] [2022-07-10 23:05:13,059][26022] Updated weights on worker 0-0, policy_version 930059 (0.00088) [2022-07-10 23:05:14,026][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:05:14,039][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000930062_952383488.pth [2022-07-10 23:05:14,039][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000928112_950386688.pth [2022-07-10 23:05:14,805][26022] Updated weights on worker 0-0, policy_version 930069 (0.00086) [2022-07-10 23:05:16,727][26022] Updated weights on worker 0-0, policy_version 930079 (0.00094) [2022-07-10 23:05:16,782][25689] Fps is (10 sec: 5622.8, 60 sec: 5574.5, 300 sec: 5551.0). Total num frames: 952400896. Throughput: 0: 5701.2. Samples: 952403626. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:05:16,784][25689] Avg episode reward: [(0, '0.250')] [2022-07-10 23:05:18,641][26022] Updated weights on worker 0-0, policy_version 930089 (0.00096) [2022-07-10 23:05:20,328][26022] Updated weights on worker 0-0, policy_version 930099 (0.00082) [2022-07-10 23:05:21,857][25689] Fps is (10 sec: 5687.4, 60 sec: 5572.7, 300 sec: 5553.7). Total num frames: 952429568. Throughput: 0: 5713.7. Samples: 952437148. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:05:21,859][25689] Avg episode reward: [(0, '0.270')] [2022-07-10 23:05:22,318][26022] Updated weights on worker 0-0, policy_version 930109 (0.00084) [2022-07-10 23:05:23,862][26022] Updated weights on worker 0-0, policy_version 930119 (0.00086) [2022-07-10 23:05:25,827][26022] Updated weights on worker 0-0, policy_version 930129 (0.00092) [2022-07-10 23:05:26,867][25689] Fps is (10 sec: 5584.5, 60 sec: 5572.4, 300 sec: 5544.9). Total num frames: 952457216. Throughput: 0: 4997.9. Samples: 952453998. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:05:26,869][25689] Avg episode reward: [(0, '0.520')] [2022-07-10 23:05:27,998][26022] Updated weights on worker 0-0, policy_version 930139 (0.00087) [2022-07-10 23:05:29,508][26022] Updated weights on worker 0-0, policy_version 930149 (0.00090) [2022-07-10 23:05:31,774][26022] Updated weights on worker 0-0, policy_version 930159 (0.00090) [2022-07-10 23:05:31,899][25689] Fps is (10 sec: 5302.8, 60 sec: 5519.3, 300 sec: 5542.7). Total num frames: 952482816. Throughput: 0: 5824.0. Samples: 952487260. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:05:31,899][25689] Avg episode reward: [(0, '0.033')] [2022-07-10 23:05:33,012][26022] Updated weights on worker 0-0, policy_version 930169 (0.00087) [2022-07-10 23:05:35,431][26022] Updated weights on worker 0-0, policy_version 930179 (0.00090) [2022-07-10 23:05:36,762][26022] Updated weights on worker 0-0, policy_version 930189 (0.00085) [2022-07-10 23:05:36,913][25689] Fps is (10 sec: 5606.1, 60 sec: 5553.7, 300 sec: 5553.8). Total num frames: 952513536. Throughput: 0: 5804.4. Samples: 952520476. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:05:36,916][25689] Avg episode reward: [(0, '-0.294')] [2022-07-10 23:05:38,972][26022] Updated weights on worker 0-0, policy_version 930199 (0.00092) [2022-07-10 23:05:40,719][26022] Updated weights on worker 0-0, policy_version 930209 (0.00100) [2022-07-10 23:05:41,973][25689] Fps is (10 sec: 5590.4, 60 sec: 5535.6, 300 sec: 5539.7). Total num frames: 952539136. Throughput: 0: 4968.4. Samples: 952537092. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:05:41,974][25689] Avg episode reward: [(0, '-1.672')] [2022-07-10 23:05:42,568][26022] Updated weights on worker 0-0, policy_version 930219 (0.00092) [2022-07-10 23:05:44,487][26022] Updated weights on worker 0-0, policy_version 930229 (0.00084) [2022-07-10 23:05:46,433][26022] Updated weights on worker 0-0, policy_version 930239 (0.00093) [2022-07-10 23:05:46,988][25689] Fps is (10 sec: 5387.2, 60 sec: 5519.5, 300 sec: 5543.0). Total num frames: 952567808. Throughput: 0: 5771.5. Samples: 952570122. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:05:46,988][25689] Avg episode reward: [(0, '-1.816')] [2022-07-10 23:05:47,854][26022] Updated weights on worker 0-0, policy_version 930249 (0.00083) [2022-07-10 23:05:50,241][26022] Updated weights on worker 0-0, policy_version 930259 (0.00091) [2022-07-10 23:05:51,656][26022] Updated weights on worker 0-0, policy_version 930269 (0.00096) [2022-07-10 23:05:52,023][25689] Fps is (10 sec: 5705.9, 60 sec: 5536.0, 300 sec: 5549.5). Total num frames: 952596480. Throughput: 0: 5794.7. Samples: 952603874. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:05:52,023][25689] Avg episode reward: [(0, '-2.198')] [2022-07-10 23:05:53,641][26022] Updated weights on worker 0-0, policy_version 930279 (0.00086) [2022-07-10 23:05:55,353][26022] Updated weights on worker 0-0, policy_version 930289 (0.00094) [2022-07-10 23:05:57,042][25689] Fps is (10 sec: 5601.4, 60 sec: 5519.6, 300 sec: 5544.6). Total num frames: 952624128. Throughput: 0: 4980.0. Samples: 952620718. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:05:57,043][25689] Avg episode reward: [(0, '-2.507')] [2022-07-10 23:05:57,367][26022] Updated weights on worker 0-0, policy_version 930299 (0.00087) [2022-07-10 23:05:58,886][26022] Updated weights on worker 0-0, policy_version 930309 (0.00090) [2022-07-10 23:06:01,087][26022] Updated weights on worker 0-0, policy_version 930319 (0.00091) [2022-07-10 23:06:02,077][25689] Fps is (10 sec: 5397.8, 60 sec: 5511.5, 300 sec: 5548.0). Total num frames: 952650752. Throughput: 0: 5818.8. Samples: 952654076. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:06:02,078][25689] Avg episode reward: [(0, '-1.391')] [2022-07-10 23:06:02,988][26022] Updated weights on worker 0-0, policy_version 930329 (0.00089) [2022-07-10 23:06:05,144][26022] Updated weights on worker 0-0, policy_version 930339 (0.00094) [2022-07-10 23:06:06,649][26022] Updated weights on worker 0-0, policy_version 930349 (0.00086) [2022-07-10 23:06:07,093][25689] Fps is (10 sec: 5501.7, 60 sec: 5550.5, 300 sec: 5551.6). Total num frames: 952679424. Throughput: 0: 5739.8. Samples: 952685524. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:06:07,094][25689] Avg episode reward: [(0, '-0.765')] [2022-07-10 23:06:08,877][26022] Updated weights on worker 0-0, policy_version 930359 (0.00095) [2022-07-10 23:06:10,340][26022] Updated weights on worker 0-0, policy_version 930369 (0.00086) [2022-07-10 23:06:12,096][25689] Fps is (10 sec: 5519.3, 60 sec: 5534.1, 300 sec: 5551.6). Total num frames: 952706048. Throughput: 0: 4903.1. Samples: 952702296. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:06:12,097][25689] Avg episode reward: [(0, '0.231')] [2022-07-10 23:06:12,654][26022] Updated weights on worker 0-0, policy_version 930379 (0.00092) [2022-07-10 23:06:14,148][26022] Updated weights on worker 0-0, policy_version 930389 (0.00092) [2022-07-10 23:06:16,124][26022] Updated weights on worker 0-0, policy_version 930399 (0.00094) [2022-07-10 23:06:17,117][25689] Fps is (10 sec: 5516.4, 60 sec: 5532.9, 300 sec: 5546.6). Total num frames: 952734720. Throughput: 0: 5725.2. Samples: 952735650. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:06:17,118][25689] Avg episode reward: [(0, '0.685')] [2022-07-10 23:06:17,854][26022] Updated weights on worker 0-0, policy_version 930409 (0.00086) [2022-07-10 23:06:19,823][26022] Updated weights on worker 0-0, policy_version 930419 (0.00080) [2022-07-10 23:06:21,447][26022] Updated weights on worker 0-0, policy_version 930429 (0.00087) [2022-07-10 23:06:22,188][25689] Fps is (10 sec: 5682.1, 60 sec: 5533.2, 300 sec: 5545.5). Total num frames: 952763392. Throughput: 0: 5733.3. Samples: 952769378. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:06:22,189][25689] Avg episode reward: [(0, '-0.270')] [2022-07-10 23:06:23,455][26022] Updated weights on worker 0-0, policy_version 930439 (0.00083) [2022-07-10 23:06:25,055][26022] Updated weights on worker 0-0, policy_version 930449 (0.00084) [2022-07-10 23:06:27,109][26022] Updated weights on worker 0-0, policy_version 930459 (0.00096) [2022-07-10 23:06:27,209][25689] Fps is (10 sec: 5479.2, 60 sec: 5515.2, 300 sec: 5545.3). Total num frames: 952790016. Throughput: 0: 5021.5. Samples: 952786536. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:06:27,210][25689] Avg episode reward: [(0, '-0.102')] [2022-07-10 23:06:28,748][26022] Updated weights on worker 0-0, policy_version 930469 (0.00088) [2022-07-10 23:06:30,639][26022] Updated weights on worker 0-0, policy_version 930479 (0.00089) [2022-07-10 23:06:32,219][25689] Fps is (10 sec: 5512.9, 60 sec: 5568.2, 300 sec: 5545.3). Total num frames: 952818688. Throughput: 0: 5854.3. Samples: 952820098. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:06:32,219][25689] Avg episode reward: [(0, '0.323')] [2022-07-10 23:06:32,374][26022] Updated weights on worker 0-0, policy_version 930489 (0.00087) [2022-07-10 23:06:34,094][26022] Updated weights on worker 0-0, policy_version 930499 (0.00090) [2022-07-10 23:06:36,306][26022] Updated weights on worker 0-0, policy_version 930509 (0.00082) [2022-07-10 23:06:37,246][25689] Fps is (10 sec: 5611.4, 60 sec: 5516.1, 300 sec: 5550.2). Total num frames: 952846336. Throughput: 0: 5881.7. Samples: 952854042. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:06:37,246][25689] Avg episode reward: [(0, '0.380')] [2022-07-10 23:06:37,795][26022] Updated weights on worker 0-0, policy_version 930519 (0.00082) [2022-07-10 23:06:39,794][26022] Updated weights on worker 0-0, policy_version 930529 (0.00089) [2022-07-10 23:06:41,486][26022] Updated weights on worker 0-0, policy_version 930539 (0.00086) [2022-07-10 23:06:42,294][25689] Fps is (10 sec: 5590.1, 60 sec: 5568.1, 300 sec: 5546.8). Total num frames: 952875008. Throughput: 0: 5037.6. Samples: 952870660. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:06:42,294][25689] Avg episode reward: [(0, '0.394')] [2022-07-10 23:06:43,310][26022] Updated weights on worker 0-0, policy_version 930549 (0.00091) [2022-07-10 23:06:45,331][26022] Updated weights on worker 0-0, policy_version 930559 (0.00083) [2022-07-10 23:06:47,074][26022] Updated weights on worker 0-0, policy_version 930569 (0.00514) [2022-07-10 23:06:47,312][25689] Fps is (10 sec: 5696.8, 60 sec: 5567.8, 300 sec: 5546.6). Total num frames: 952903680. Throughput: 0: 5857.1. Samples: 952904280. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:06:47,313][25689] Avg episode reward: [(0, '0.948')] [2022-07-10 23:06:48,986][26022] Updated weights on worker 0-0, policy_version 930579 (0.00085) [2022-07-10 23:06:50,820][26022] Updated weights on worker 0-0, policy_version 930589 (0.00092) [2022-07-10 23:06:52,331][25689] Fps is (10 sec: 5610.7, 60 sec: 5552.3, 300 sec: 5550.4). Total num frames: 952931328. Throughput: 0: 5850.5. Samples: 952937768. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:06:52,332][25689] Avg episode reward: [(0, '0.497')] [2022-07-10 23:06:52,608][26022] Updated weights on worker 0-0, policy_version 930599 (0.00085) [2022-07-10 23:06:54,435][26022] Updated weights on worker 0-0, policy_version 930609 (0.00080) [2022-07-10 23:06:56,232][26022] Updated weights on worker 0-0, policy_version 930619 (0.00088) [2022-07-10 23:06:57,362][25689] Fps is (10 sec: 5502.0, 60 sec: 5551.2, 300 sec: 5547.5). Total num frames: 952958976. Throughput: 0: 5005.9. Samples: 952954742. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:06:57,364][25689] Avg episode reward: [(0, '0.361')] [2022-07-10 23:06:58,080][26022] Updated weights on worker 0-0, policy_version 930629 (0.00092) [2022-07-10 23:06:59,945][26022] Updated weights on worker 0-0, policy_version 930639 (0.00101) [2022-07-10 23:07:02,027][26022] Updated weights on worker 0-0, policy_version 930649 (0.00078) [2022-07-10 23:07:02,492][25689] Fps is (10 sec: 5442.4, 60 sec: 5559.5, 300 sec: 5545.4). Total num frames: 952986624. Throughput: 0: 5827.7. Samples: 952988368. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:07:02,492][25689] Avg episode reward: [(0, '0.158')] [2022-07-10 23:07:03,844][26022] Updated weights on worker 0-0, policy_version 930659 (0.00088) [2022-07-10 23:07:05,802][26022] Updated weights on worker 0-0, policy_version 930669 (0.00087) [2022-07-10 23:07:07,499][25689] Fps is (10 sec: 5455.0, 60 sec: 5543.3, 300 sec: 5553.0). Total num frames: 953014272. Throughput: 0: 5736.2. Samples: 953020076. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:07:07,499][25689] Avg episode reward: [(0, '0.209')] [2022-07-10 23:07:07,599][26022] Updated weights on worker 0-0, policy_version 930679 (0.00087) [2022-07-10 23:07:09,471][26022] Updated weights on worker 0-0, policy_version 930689 (0.00088) [2022-07-10 23:07:11,326][26022] Updated weights on worker 0-0, policy_version 930699 (0.00090) [2022-07-10 23:07:12,591][25689] Fps is (10 sec: 5576.7, 60 sec: 5569.0, 300 sec: 5551.6). Total num frames: 953042944. Throughput: 0: 4892.8. Samples: 953036894. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:07:12,591][25689] Avg episode reward: [(0, '0.053')] [2022-07-10 23:07:12,933][26022] Updated weights on worker 0-0, policy_version 930709 (0.00083) [2022-07-10 23:07:14,109][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:07:14,124][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000930715_953052160.pth [2022-07-10 23:07:14,125][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000928762_951052288.pth [2022-07-10 23:07:15,075][26022] Updated weights on worker 0-0, policy_version 930719 (0.00097) [2022-07-10 23:07:16,852][26022] Updated weights on worker 0-0, policy_version 930729 (0.00086) [2022-07-10 23:07:17,619][25689] Fps is (10 sec: 5565.2, 60 sec: 5551.5, 300 sec: 5546.6). Total num frames: 953070592. Throughput: 0: 5694.7. Samples: 953070098. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:07:17,619][25689] Avg episode reward: [(0, '0.309')] [2022-07-10 23:07:18,678][26022] Updated weights on worker 0-0, policy_version 930739 (0.00086) [2022-07-10 23:07:20,259][26022] Updated weights on worker 0-0, policy_version 930749 (0.00462) [2022-07-10 23:07:22,138][26022] Updated weights on worker 0-0, policy_version 930759 (0.00095) [2022-07-10 23:07:22,726][25689] Fps is (10 sec: 5556.7, 60 sec: 5548.1, 300 sec: 5546.2). Total num frames: 953099264. Throughput: 0: 5725.6. Samples: 953104224. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:07:22,727][25689] Avg episode reward: [(0, '0.554')] [2022-07-10 23:07:24,043][26022] Updated weights on worker 0-0, policy_version 930769 (0.00086) [2022-07-10 23:07:25,771][26022] Updated weights on worker 0-0, policy_version 930779 (0.00086) [2022-07-10 23:07:27,752][25689] Fps is (10 sec: 5659.2, 60 sec: 5581.5, 300 sec: 5549.6). Total num frames: 953127936. Throughput: 0: 4986.4. Samples: 953121066. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:07:27,752][25689] Avg episode reward: [(0, '0.199')] [2022-07-10 23:07:27,764][26022] Updated weights on worker 0-0, policy_version 930789 (0.00064) [2022-07-10 23:07:29,515][26022] Updated weights on worker 0-0, policy_version 930799 (0.00094) [2022-07-10 23:07:31,438][26022] Updated weights on worker 0-0, policy_version 930809 (0.00084) [2022-07-10 23:07:32,790][25689] Fps is (10 sec: 5596.5, 60 sec: 5562.0, 300 sec: 5547.3). Total num frames: 953155584. Throughput: 0: 5814.9. Samples: 953154350. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:07:32,791][25689] Avg episode reward: [(0, '-0.661')] [2022-07-10 23:07:33,285][26022] Updated weights on worker 0-0, policy_version 930819 (0.00493) [2022-07-10 23:07:34,943][26022] Updated weights on worker 0-0, policy_version 930829 (0.00077) [2022-07-10 23:07:36,907][26022] Updated weights on worker 0-0, policy_version 930839 (0.00088) [2022-07-10 23:07:37,831][25689] Fps is (10 sec: 5587.5, 60 sec: 5577.6, 300 sec: 5554.5). Total num frames: 953184256. Throughput: 0: 5835.0. Samples: 953188038. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:07:37,839][25689] Avg episode reward: [(0, '-0.367')] [2022-07-10 23:07:38,877][26022] Updated weights on worker 0-0, policy_version 930849 (0.00089) [2022-07-10 23:07:40,476][26022] Updated weights on worker 0-0, policy_version 930859 (0.00087) [2022-07-10 23:07:42,595][26022] Updated weights on worker 0-0, policy_version 930869 (0.00085) [2022-07-10 23:07:42,907][25689] Fps is (10 sec: 5465.4, 60 sec: 5541.2, 300 sec: 5542.8). Total num frames: 953210880. Throughput: 0: 5819.2. Samples: 953221662. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:07:42,908][25689] Avg episode reward: [(0, '-0.428')] [2022-07-10 23:07:43,949][26022] Updated weights on worker 0-0, policy_version 930879 (0.00086) [2022-07-10 23:07:46,180][26022] Updated weights on worker 0-0, policy_version 930889 (0.00085) [2022-07-10 23:07:47,703][26022] Updated weights on worker 0-0, policy_version 930899 (0.00090) [2022-07-10 23:07:47,919][25689] Fps is (10 sec: 5582.8, 60 sec: 5558.7, 300 sec: 5549.7). Total num frames: 953240576. Throughput: 0: 5816.7. Samples: 953238376. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:07:47,920][25689] Avg episode reward: [(0, '-0.050')] [2022-07-10 23:07:49,750][26022] Updated weights on worker 0-0, policy_version 930909 (0.00095) [2022-07-10 23:07:51,522][26022] Updated weights on worker 0-0, policy_version 930919 (0.00544) [2022-07-10 23:07:53,010][25689] Fps is (10 sec: 5777.3, 60 sec: 5569.0, 300 sec: 5552.0). Total num frames: 953269248. Throughput: 0: 5829.9. Samples: 953272234. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:07:53,012][25689] Avg episode reward: [(0, '0.138')] [2022-07-10 23:07:53,420][26022] Updated weights on worker 0-0, policy_version 930929 (0.00089) [2022-07-10 23:07:55,098][26022] Updated weights on worker 0-0, policy_version 930939 (0.00088) [2022-07-10 23:07:57,055][26022] Updated weights on worker 0-0, policy_version 930949 (0.00080) [2022-07-10 23:07:58,027][25689] Fps is (10 sec: 5572.0, 60 sec: 5570.3, 300 sec: 5550.1). Total num frames: 953296896. Throughput: 0: 5837.3. Samples: 953305926. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:07:58,028][25689] Avg episode reward: [(0, '-0.644')] [2022-07-10 23:07:58,803][26022] Updated weights on worker 0-0, policy_version 930959 (0.00083) [2022-07-10 23:08:00,786][26022] Updated weights on worker 0-0, policy_version 930969 (0.00049) [2022-07-10 23:08:02,742][26022] Updated weights on worker 0-0, policy_version 930979 (0.00078) [2022-07-10 23:08:03,119][25689] Fps is (10 sec: 5369.1, 60 sec: 5556.9, 300 sec: 5548.6). Total num frames: 953323520. Throughput: 0: 5003.8. Samples: 953322796. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:08:03,119][25689] Avg episode reward: [(0, '0.212')] [2022-07-10 23:08:04,649][26022] Updated weights on worker 0-0, policy_version 930989 (0.00093) [2022-07-10 23:08:06,456][26022] Updated weights on worker 0-0, policy_version 930999 (0.00087) [2022-07-10 23:08:08,124][25689] Fps is (10 sec: 5476.4, 60 sec: 5573.9, 300 sec: 5548.8). Total num frames: 953352192. Throughput: 0: 5741.3. Samples: 953354378. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:08:08,125][25689] Avg episode reward: [(0, '0.084')] [2022-07-10 23:08:08,343][26022] Updated weights on worker 0-0, policy_version 931009 (0.00094) [2022-07-10 23:08:10,158][26022] Updated weights on worker 0-0, policy_version 931019 (0.00089) [2022-07-10 23:08:11,943][26022] Updated weights on worker 0-0, policy_version 931029 (0.00092) [2022-07-10 23:08:13,142][25689] Fps is (10 sec: 5516.7, 60 sec: 5546.9, 300 sec: 5548.8). Total num frames: 953378816. Throughput: 0: 5746.6. Samples: 953387922. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:08:13,143][25689] Avg episode reward: [(0, '-0.463')] [2022-07-10 23:08:13,755][26022] Updated weights on worker 0-0, policy_version 931039 (0.00081) [2022-07-10 23:08:15,717][26022] Updated weights on worker 0-0, policy_version 931049 (0.00083) [2022-07-10 23:08:17,493][26022] Updated weights on worker 0-0, policy_version 931059 (0.00092) [2022-07-10 23:08:18,153][25689] Fps is (10 sec: 5513.6, 60 sec: 5565.4, 300 sec: 5550.9). Total num frames: 953407488. Throughput: 0: 4899.0. Samples: 953404524. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:08:18,154][25689] Avg episode reward: [(0, '-1.027')] [2022-07-10 23:08:19,211][26022] Updated weights on worker 0-0, policy_version 931069 (0.00090) [2022-07-10 23:08:21,104][26022] Updated weights on worker 0-0, policy_version 931079 (0.00084) [2022-07-10 23:08:22,896][26022] Updated weights on worker 0-0, policy_version 931089 (0.00088) [2022-07-10 23:08:23,276][25689] Fps is (10 sec: 5557.4, 60 sec: 5547.0, 300 sec: 5546.5). Total num frames: 953435136. Throughput: 0: 5722.3. Samples: 953438144. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-10 23:08:23,277][25689] Avg episode reward: [(0, '-1.241')] [2022-07-10 23:08:24,854][26022] Updated weights on worker 0-0, policy_version 931099 (0.00091) [2022-07-10 23:08:26,805][26022] Updated weights on worker 0-0, policy_version 931109 (0.00084) [2022-07-10 23:08:28,346][25689] Fps is (10 sec: 5625.7, 60 sec: 5559.8, 300 sec: 5548.8). Total num frames: 953464832. Throughput: 0: 5811.9. Samples: 953471908. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:08:28,347][25689] Avg episode reward: [(0, '-0.339')] [2022-07-10 23:08:28,419][26022] Updated weights on worker 0-0, policy_version 931119 (0.00087) [2022-07-10 23:08:30,439][26022] Updated weights on worker 0-0, policy_version 931129 (0.00086) [2022-07-10 23:08:31,929][26022] Updated weights on worker 0-0, policy_version 931139 (0.00086) [2022-07-10 23:08:33,414][25689] Fps is (10 sec: 5656.5, 60 sec: 5557.1, 300 sec: 5551.7). Total num frames: 953492480. Throughput: 0: 4972.6. Samples: 953488726. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:08:33,414][25689] Avg episode reward: [(0, '-0.166')] [2022-07-10 23:08:34,027][26022] Updated weights on worker 0-0, policy_version 931149 (0.00081) [2022-07-10 23:08:35,655][26022] Updated weights on worker 0-0, policy_version 931159 (0.00095) [2022-07-10 23:08:37,755][26022] Updated weights on worker 0-0, policy_version 931169 (0.00091) [2022-07-10 23:08:38,485][25689] Fps is (10 sec: 5554.9, 60 sec: 5554.4, 300 sec: 5551.4). Total num frames: 953521152. Throughput: 0: 5798.5. Samples: 953522418. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:08:38,486][25689] Avg episode reward: [(0, '-0.008')] [2022-07-10 23:08:39,227][26022] Updated weights on worker 0-0, policy_version 931179 (0.00084) [2022-07-10 23:08:41,340][26022] Updated weights on worker 0-0, policy_version 931189 (0.00089) [2022-07-10 23:08:43,131][26022] Updated weights on worker 0-0, policy_version 931199 (0.00089) [2022-07-10 23:08:43,571][25689] Fps is (10 sec: 5645.9, 60 sec: 5587.3, 300 sec: 5554.0). Total num frames: 953549824. Throughput: 0: 5805.9. Samples: 953555970. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:08:43,571][25689] Avg episode reward: [(0, '0.229')] [2022-07-10 23:08:44,944][26022] Updated weights on worker 0-0, policy_version 931209 (0.00089) [2022-07-10 23:08:46,985][26022] Updated weights on worker 0-0, policy_version 931219 (0.00083) [2022-07-10 23:08:48,505][26022] Updated weights on worker 0-0, policy_version 931229 (0.00092) [2022-07-10 23:08:48,590][25689] Fps is (10 sec: 5675.0, 60 sec: 5569.8, 300 sec: 5557.2). Total num frames: 953578496. Throughput: 0: 4971.2. Samples: 953572542. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:08:48,590][25689] Avg episode reward: [(0, '0.225')] [2022-07-10 23:08:50,652][26022] Updated weights on worker 0-0, policy_version 931239 (0.00091) [2022-07-10 23:08:52,532][26022] Updated weights on worker 0-0, policy_version 931249 (0.00090) [2022-07-10 23:08:53,628][25689] Fps is (10 sec: 5294.4, 60 sec: 5507.0, 300 sec: 5540.5). Total num frames: 953603072. Throughput: 0: 5772.8. Samples: 953605418. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:08:53,629][25689] Avg episode reward: [(0, '0.769')] [2022-07-10 23:08:54,267][26022] Updated weights on worker 0-0, policy_version 931259 (0.00091) [2022-07-10 23:08:56,151][26022] Updated weights on worker 0-0, policy_version 931269 (0.00088) [2022-07-10 23:08:57,856][26022] Updated weights on worker 0-0, policy_version 931279 (0.00091) [2022-07-10 23:08:58,645][25689] Fps is (10 sec: 5499.4, 60 sec: 5557.7, 300 sec: 5551.4). Total num frames: 953633792. Throughput: 0: 5779.1. Samples: 953638922. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:08:58,645][25689] Avg episode reward: [(0, '0.485')] [2022-07-10 23:08:59,896][26022] Updated weights on worker 0-0, policy_version 931289 (0.00093) [2022-07-10 23:09:02,053][26022] Updated weights on worker 0-0, policy_version 931299 (0.00089) [2022-07-10 23:09:03,736][25689] Fps is (10 sec: 5470.7, 60 sec: 5524.0, 300 sec: 5550.1). Total num frames: 953658368. Throughput: 0: 4944.3. Samples: 953655672. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:09:03,736][25689] Avg episode reward: [(0, '0.362')] [2022-07-10 23:09:03,853][26022] Updated weights on worker 0-0, policy_version 931309 (0.00092) [2022-07-10 23:09:05,601][26022] Updated weights on worker 0-0, policy_version 931319 (0.00079) [2022-07-10 23:09:07,478][26022] Updated weights on worker 0-0, policy_version 931329 (0.00086) [2022-07-10 23:09:08,802][25689] Fps is (10 sec: 5242.1, 60 sec: 5518.4, 300 sec: 5545.7). Total num frames: 953687040. Throughput: 0: 5670.8. Samples: 953687164. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:09:08,803][25689] Avg episode reward: [(0, '0.450')] [2022-07-10 23:09:09,443][26022] Updated weights on worker 0-0, policy_version 931339 (0.00091) [2022-07-10 23:09:11,072][26022] Updated weights on worker 0-0, policy_version 931349 (0.00084) [2022-07-10 23:09:13,041][26022] Updated weights on worker 0-0, policy_version 931359 (0.00092) [2022-07-10 23:09:13,831][25689] Fps is (10 sec: 5781.6, 60 sec: 5568.0, 300 sec: 5555.7). Total num frames: 953716736. Throughput: 0: 5717.2. Samples: 953720924. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:09:13,832][25689] Avg episode reward: [(0, '0.532')] [2022-07-10 23:09:14,297][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:09:14,314][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000931367_953719808.pth [2022-07-10 23:09:14,315][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000929413_951718912.pth [2022-07-10 23:09:14,952][26022] Updated weights on worker 0-0, policy_version 931369 (0.00084) [2022-07-10 23:09:16,827][26022] Updated weights on worker 0-0, policy_version 931379 (0.00096) [2022-07-10 23:09:18,729][26022] Updated weights on worker 0-0, policy_version 931389 (0.00087) [2022-07-10 23:09:18,855][25689] Fps is (10 sec: 5500.8, 60 sec: 5516.3, 300 sec: 5546.0). Total num frames: 953742336. Throughput: 0: 4859.9. Samples: 953737142. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:09:18,855][25689] Avg episode reward: [(0, '0.405')] [2022-07-10 23:09:20,425][26022] Updated weights on worker 0-0, policy_version 931399 (0.00089) [2022-07-10 23:09:22,310][26022] Updated weights on worker 0-0, policy_version 931409 (0.00097) [2022-07-10 23:09:23,925][25689] Fps is (10 sec: 5377.0, 60 sec: 5538.0, 300 sec: 5548.3). Total num frames: 953771008. Throughput: 0: 5699.8. Samples: 953770744. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:09:23,925][25689] Avg episode reward: [(0, '0.626')] [2022-07-10 23:09:24,255][26022] Updated weights on worker 0-0, policy_version 931419 (0.00090) [2022-07-10 23:09:25,875][26022] Updated weights on worker 0-0, policy_version 931429 (0.00092) [2022-07-10 23:09:27,963][26022] Updated weights on worker 0-0, policy_version 931439 (0.00078) [2022-07-10 23:09:28,963][25689] Fps is (10 sec: 5774.1, 60 sec: 5540.9, 300 sec: 5551.1). Total num frames: 953800704. Throughput: 0: 5803.6. Samples: 953804170. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:09:28,964][25689] Avg episode reward: [(0, '-0.195')] [2022-07-10 23:09:29,534][26022] Updated weights on worker 0-0, policy_version 931449 (0.00088) [2022-07-10 23:09:31,471][26022] Updated weights on worker 0-0, policy_version 931459 (0.00085) [2022-07-10 23:09:33,436][26022] Updated weights on worker 0-0, policy_version 931469 (0.00631) [2022-07-10 23:09:33,997][25689] Fps is (10 sec: 5591.6, 60 sec: 5527.1, 300 sec: 5544.0). Total num frames: 953827328. Throughput: 0: 4955.5. Samples: 953820856. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:09:33,997][25689] Avg episode reward: [(0, '-0.185')] [2022-07-10 23:09:35,138][26022] Updated weights on worker 0-0, policy_version 931479 (0.00092) [2022-07-10 23:09:37,000][26022] Updated weights on worker 0-0, policy_version 931489 (0.00087) [2022-07-10 23:09:38,845][26022] Updated weights on worker 0-0, policy_version 931499 (0.00091) [2022-07-10 23:09:39,026][25689] Fps is (10 sec: 5495.5, 60 sec: 5531.0, 300 sec: 5551.2). Total num frames: 953856000. Throughput: 0: 5820.9. Samples: 953854554. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:09:39,026][25689] Avg episode reward: [(0, '-0.349')] [2022-07-10 23:09:40,646][26022] Updated weights on worker 0-0, policy_version 931509 (0.00095) [2022-07-10 23:09:42,499][26022] Updated weights on worker 0-0, policy_version 931519 (0.00092) [2022-07-10 23:09:44,140][25689] Fps is (10 sec: 5653.8, 60 sec: 5528.4, 300 sec: 5546.1). Total num frames: 953884672. Throughput: 0: 5813.0. Samples: 953888254. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:09:44,140][25689] Avg episode reward: [(0, '0.201')] [2022-07-10 23:09:44,219][26022] Updated weights on worker 0-0, policy_version 931529 (0.00095) [2022-07-10 23:09:46,259][26022] Updated weights on worker 0-0, policy_version 931539 (0.00088) [2022-07-10 23:09:47,954][26022] Updated weights on worker 0-0, policy_version 931549 (0.00094) [2022-07-10 23:09:49,141][25689] Fps is (10 sec: 5567.6, 60 sec: 5513.1, 300 sec: 5546.6). Total num frames: 953912320. Throughput: 0: 5812.5. Samples: 953921454. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:09:49,142][25689] Avg episode reward: [(0, '0.048')] [2022-07-10 23:09:50,010][26022] Updated weights on worker 0-0, policy_version 931559 (0.01517) [2022-07-10 23:09:51,576][26022] Updated weights on worker 0-0, policy_version 931569 (0.00090) [2022-07-10 23:09:53,625][26022] Updated weights on worker 0-0, policy_version 931579 (0.00090) [2022-07-10 23:09:54,160][25689] Fps is (10 sec: 5518.5, 60 sec: 5565.6, 300 sec: 5543.3). Total num frames: 953939968. Throughput: 0: 5828.6. Samples: 953938376. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:09:54,160][25689] Avg episode reward: [(0, '1.172')] [2022-07-10 23:09:55,183][26022] Updated weights on worker 0-0, policy_version 931589 (0.00084) [2022-07-10 23:09:57,211][26022] Updated weights on worker 0-0, policy_version 931599 (0.00092) [2022-07-10 23:09:58,904][26022] Updated weights on worker 0-0, policy_version 931609 (0.00087) [2022-07-10 23:09:59,200][25689] Fps is (10 sec: 5598.9, 60 sec: 5529.6, 300 sec: 5548.4). Total num frames: 953968640. Throughput: 0: 5831.2. Samples: 953972196. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:09:59,201][25689] Avg episode reward: [(0, '1.395')] [2022-07-10 23:10:00,811][26022] Updated weights on worker 0-0, policy_version 931619 (0.00088) [2022-07-10 23:10:02,918][26022] Updated weights on worker 0-0, policy_version 931629 (0.00092) [2022-07-10 23:10:04,287][25689] Fps is (10 sec: 5359.2, 60 sec: 5546.9, 300 sec: 5544.7). Total num frames: 953994240. Throughput: 0: 5731.1. Samples: 954003718. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:10:04,287][25689] Avg episode reward: [(0, '1.511')] [2022-07-10 23:10:04,921][26022] Updated weights on worker 0-0, policy_version 931639 (0.00094) [2022-07-10 23:10:06,539][26022] Updated weights on worker 0-0, policy_version 931649 (0.00086) [2022-07-10 23:10:08,637][26022] Updated weights on worker 0-0, policy_version 931659 (0.00092) [2022-07-10 23:10:09,289][25689] Fps is (10 sec: 5379.2, 60 sec: 5552.8, 300 sec: 5548.3). Total num frames: 954022912. Throughput: 0: 4919.8. Samples: 954020580. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:10:09,290][25689] Avg episode reward: [(0, '1.298')] [2022-07-10 23:10:10,206][26022] Updated weights on worker 0-0, policy_version 931669 (0.00081) [2022-07-10 23:10:12,317][26022] Updated weights on worker 0-0, policy_version 931679 (0.00084) [2022-07-10 23:10:13,870][26022] Updated weights on worker 0-0, policy_version 931689 (0.00111) [2022-07-10 23:10:14,293][25689] Fps is (10 sec: 5628.2, 60 sec: 5521.2, 300 sec: 5544.9). Total num frames: 954050560. Throughput: 0: 5766.1. Samples: 954054470. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:10:14,294][25689] Avg episode reward: [(0, '1.310')] [2022-07-10 23:10:15,957][26022] Updated weights on worker 0-0, policy_version 931699 (0.00088) [2022-07-10 23:10:17,599][26022] Updated weights on worker 0-0, policy_version 931709 (0.00089) [2022-07-10 23:10:19,335][25689] Fps is (10 sec: 5504.6, 60 sec: 5553.4, 300 sec: 5542.1). Total num frames: 954078208. Throughput: 0: 5742.1. Samples: 954087810. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:10:19,336][25689] Avg episode reward: [(0, '1.379')] [2022-07-10 23:10:19,415][26022] Updated weights on worker 0-0, policy_version 931719 (0.00091) [2022-07-10 23:10:21,163][26022] Updated weights on worker 0-0, policy_version 931729 (0.00091) [2022-07-10 23:10:23,204][26022] Updated weights on worker 0-0, policy_version 931739 (0.00089) [2022-07-10 23:10:24,396][25689] Fps is (10 sec: 5676.2, 60 sec: 5571.2, 300 sec: 5548.0). Total num frames: 954107904. Throughput: 0: 5010.7. Samples: 954104480. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:10:24,398][25689] Avg episode reward: [(0, '1.118')] [2022-07-10 23:10:25,147][26022] Updated weights on worker 0-0, policy_version 931749 (0.00090) [2022-07-10 23:10:26,906][26022] Updated weights on worker 0-0, policy_version 931759 (0.00085) [2022-07-10 23:10:28,649][26022] Updated weights on worker 0-0, policy_version 931769 (0.00089) [2022-07-10 23:10:29,475][25689] Fps is (10 sec: 5553.9, 60 sec: 5516.7, 300 sec: 5550.5). Total num frames: 954134528. Throughput: 0: 5801.1. Samples: 954137680. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:10:29,476][25689] Avg episode reward: [(0, '0.195')] [2022-07-10 23:10:30,738][26022] Updated weights on worker 0-0, policy_version 931779 (0.00090) [2022-07-10 23:10:32,390][26022] Updated weights on worker 0-0, policy_version 931789 (0.00092) [2022-07-10 23:10:34,362][26022] Updated weights on worker 0-0, policy_version 931799 (0.00088) [2022-07-10 23:10:34,510][25689] Fps is (10 sec: 5467.1, 60 sec: 5550.4, 300 sec: 5543.3). Total num frames: 954163200. Throughput: 0: 5766.0. Samples: 954171040. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:10:34,511][25689] Avg episode reward: [(0, '0.085')] [2022-07-10 23:10:35,929][26022] Updated weights on worker 0-0, policy_version 931809 (0.00094) [2022-07-10 23:10:38,028][26022] Updated weights on worker 0-0, policy_version 931819 (0.00088) [2022-07-10 23:10:39,609][25689] Fps is (10 sec: 5658.7, 60 sec: 5543.9, 300 sec: 5552.9). Total num frames: 954191872. Throughput: 0: 4936.8. Samples: 954187900. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:10:39,610][25689] Avg episode reward: [(0, '0.025')] [2022-07-10 23:10:39,692][26022] Updated weights on worker 0-0, policy_version 931829 (0.00084) [2022-07-10 23:10:41,720][26022] Updated weights on worker 0-0, policy_version 931839 (0.00085) [2022-07-10 23:10:43,355][26022] Updated weights on worker 0-0, policy_version 931849 (0.00091) [2022-07-10 23:10:44,702][25689] Fps is (10 sec: 5526.2, 60 sec: 5529.0, 300 sec: 5548.0). Total num frames: 954219520. Throughput: 0: 5761.8. Samples: 954221478. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:10:44,703][25689] Avg episode reward: [(0, '-0.572')] [2022-07-10 23:10:45,318][26022] Updated weights on worker 0-0, policy_version 931859 (0.00446) [2022-07-10 23:10:47,148][26022] Updated weights on worker 0-0, policy_version 931869 (0.00091) [2022-07-10 23:10:49,009][26022] Updated weights on worker 0-0, policy_version 931879 (0.00079) [2022-07-10 23:10:49,731][25689] Fps is (10 sec: 5564.3, 60 sec: 5543.4, 300 sec: 5548.1). Total num frames: 954248192. Throughput: 0: 5773.6. Samples: 954254626. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:10:49,731][25689] Avg episode reward: [(0, '-0.853')] [2022-07-10 23:10:50,979][26022] Updated weights on worker 0-0, policy_version 931889 (0.00114) [2022-07-10 23:10:52,660][26022] Updated weights on worker 0-0, policy_version 931899 (0.00085) [2022-07-10 23:10:54,497][26022] Updated weights on worker 0-0, policy_version 931909 (0.00087) [2022-07-10 23:10:54,740][25689] Fps is (10 sec: 5610.6, 60 sec: 5544.2, 300 sec: 5548.3). Total num frames: 954275840. Throughput: 0: 4955.1. Samples: 954271280. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:10:54,740][25689] Avg episode reward: [(0, '-0.456')] [2022-07-10 23:10:56,562][26022] Updated weights on worker 0-0, policy_version 931919 (0.00090) [2022-07-10 23:10:58,215][26022] Updated weights on worker 0-0, policy_version 931929 (0.00083) [2022-07-10 23:10:59,760][25689] Fps is (10 sec: 5411.4, 60 sec: 5512.3, 300 sec: 5548.6). Total num frames: 954302464. Throughput: 0: 5796.3. Samples: 954304702. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:10:59,761][25689] Avg episode reward: [(0, '0.513')] [2022-07-10 23:11:00,173][26022] Updated weights on worker 0-0, policy_version 931939 (0.00093) [2022-07-10 23:11:01,840][26022] Updated weights on worker 0-0, policy_version 931949 (0.00094) [2022-07-10 23:11:04,112][26022] Updated weights on worker 0-0, policy_version 931959 (0.00088) [2022-07-10 23:11:04,815][25689] Fps is (10 sec: 5386.8, 60 sec: 5549.0, 300 sec: 5544.4). Total num frames: 954330112. Throughput: 0: 5694.5. Samples: 954336012. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:11:04,816][25689] Avg episode reward: [(0, '0.318')] [2022-07-10 23:11:05,974][26022] Updated weights on worker 0-0, policy_version 931969 (0.00088) [2022-07-10 23:11:07,893][26022] Updated weights on worker 0-0, policy_version 931979 (0.00089) [2022-07-10 23:11:09,617][26022] Updated weights on worker 0-0, policy_version 931989 (0.00085) [2022-07-10 23:11:09,827][25689] Fps is (10 sec: 5492.7, 60 sec: 5531.2, 300 sec: 5547.7). Total num frames: 954357760. Throughput: 0: 4889.3. Samples: 954352884. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:11:09,828][25689] Avg episode reward: [(0, '0.484')] [2022-07-10 23:11:11,537][26022] Updated weights on worker 0-0, policy_version 931999 (0.00096) [2022-07-10 23:11:13,318][26022] Updated weights on worker 0-0, policy_version 932009 (0.00093) [2022-07-10 23:11:14,332][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:11:14,352][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000932014_954382336.pth [2022-07-10 23:11:14,353][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000930062_952383488.pth [2022-07-10 23:11:14,867][25689] Fps is (10 sec: 5399.1, 60 sec: 5511.0, 300 sec: 5540.4). Total num frames: 954384384. Throughput: 0: 5705.0. Samples: 954386106. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:11:14,868][25689] Avg episode reward: [(0, '1.297')] [2022-07-10 23:11:15,211][26022] Updated weights on worker 0-0, policy_version 932019 (0.00100) [2022-07-10 23:11:16,863][26022] Updated weights on worker 0-0, policy_version 932029 (0.00089) [2022-07-10 23:11:18,893][26022] Updated weights on worker 0-0, policy_version 932039 (0.00094) [2022-07-10 23:11:19,881][25689] Fps is (10 sec: 5500.1, 60 sec: 5530.5, 300 sec: 5541.5). Total num frames: 954413056. Throughput: 0: 5685.1. Samples: 954419092. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:11:19,881][25689] Avg episode reward: [(0, '1.161')] [2022-07-10 23:11:20,768][26022] Updated weights on worker 0-0, policy_version 932049 (0.00088) [2022-07-10 23:11:22,631][26022] Updated weights on worker 0-0, policy_version 932059 (0.00086) [2022-07-10 23:11:24,475][26022] Updated weights on worker 0-0, policy_version 932069 (0.00083) [2022-07-10 23:11:25,010][25689] Fps is (10 sec: 5653.8, 60 sec: 5507.4, 300 sec: 5546.4). Total num frames: 954441728. Throughput: 0: 4946.0. Samples: 954435894. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:11:25,010][25689] Avg episode reward: [(0, '0.626')] [2022-07-10 23:11:26,553][26022] Updated weights on worker 0-0, policy_version 932079 (0.00087) [2022-07-10 23:11:28,113][26022] Updated weights on worker 0-0, policy_version 932089 (0.00552) [2022-07-10 23:11:30,041][25689] Fps is (10 sec: 5442.7, 60 sec: 5511.8, 300 sec: 5539.1). Total num frames: 954468352. Throughput: 0: 5764.8. Samples: 954469410. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:11:30,041][25689] Avg episode reward: [(0, '0.794')] [2022-07-10 23:11:30,136][26022] Updated weights on worker 0-0, policy_version 932099 (0.00084) [2022-07-10 23:11:31,727][26022] Updated weights on worker 0-0, policy_version 932109 (0.00092) [2022-07-10 23:11:33,762][26022] Updated weights on worker 0-0, policy_version 932119 (0.00085) [2022-07-10 23:11:35,086][25689] Fps is (10 sec: 5589.4, 60 sec: 5527.8, 300 sec: 5545.6). Total num frames: 954498048. Throughput: 0: 5783.6. Samples: 954503044. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:11:35,086][25689] Avg episode reward: [(0, '0.558')] [2022-07-10 23:11:35,411][26022] Updated weights on worker 0-0, policy_version 932129 (0.00088) [2022-07-10 23:11:37,211][26022] Updated weights on worker 0-0, policy_version 932139 (0.00091) [2022-07-10 23:11:38,935][26022] Updated weights on worker 0-0, policy_version 932149 (0.00092) [2022-07-10 23:11:40,096][25689] Fps is (10 sec: 5702.9, 60 sec: 5519.0, 300 sec: 5542.9). Total num frames: 954525696. Throughput: 0: 4991.8. Samples: 954520000. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:11:40,098][25689] Avg episode reward: [(0, '0.185')] [2022-07-10 23:11:41,097][26022] Updated weights on worker 0-0, policy_version 932159 (0.00095) [2022-07-10 23:11:42,643][26022] Updated weights on worker 0-0, policy_version 932169 (0.00084) [2022-07-10 23:11:44,864][26022] Updated weights on worker 0-0, policy_version 932179 (0.00087) [2022-07-10 23:11:45,238][25689] Fps is (10 sec: 5446.7, 60 sec: 5514.4, 300 sec: 5537.2). Total num frames: 954553344. Throughput: 0: 5814.6. Samples: 954553514. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:11:45,239][25689] Avg episode reward: [(0, '0.232')] [2022-07-10 23:11:46,283][26022] Updated weights on worker 0-0, policy_version 932189 (0.00090) [2022-07-10 23:11:48,484][26022] Updated weights on worker 0-0, policy_version 932199 (0.00087) [2022-07-10 23:11:50,030][26022] Updated weights on worker 0-0, policy_version 932209 (0.00095) [2022-07-10 23:11:50,248][25689] Fps is (10 sec: 5648.5, 60 sec: 5533.1, 300 sec: 5544.2). Total num frames: 954583040. Throughput: 0: 5824.8. Samples: 954587114. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:11:50,250][25689] Avg episode reward: [(0, '0.021')] [2022-07-10 23:11:51,997][26022] Updated weights on worker 0-0, policy_version 932219 (0.00089) [2022-07-10 23:11:53,643][26022] Updated weights on worker 0-0, policy_version 932229 (0.00088) [2022-07-10 23:11:55,277][25689] Fps is (10 sec: 5711.9, 60 sec: 5531.3, 300 sec: 5544.2). Total num frames: 954610688. Throughput: 0: 4993.0. Samples: 954603860. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:11:55,278][25689] Avg episode reward: [(0, '0.195')] [2022-07-10 23:11:55,583][26022] Updated weights on worker 0-0, policy_version 932239 (0.00092) [2022-07-10 23:11:57,374][26022] Updated weights on worker 0-0, policy_version 932249 (0.00085) [2022-07-10 23:11:59,221][26022] Updated weights on worker 0-0, policy_version 932259 (0.00101) [2022-07-10 23:12:00,305][25689] Fps is (10 sec: 5498.1, 60 sec: 5547.5, 300 sec: 5546.1). Total num frames: 954638336. Throughput: 0: 5816.1. Samples: 954637540. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:12:00,306][25689] Avg episode reward: [(0, '0.659')] [2022-07-10 23:12:00,930][26022] Updated weights on worker 0-0, policy_version 932269 (0.00091) [2022-07-10 23:12:03,464][26022] Updated weights on worker 0-0, policy_version 932279 (0.00086) [2022-07-10 23:12:05,032][26022] Updated weights on worker 0-0, policy_version 932289 (0.00091) [2022-07-10 23:12:05,353][25689] Fps is (10 sec: 5386.4, 60 sec: 5531.2, 300 sec: 5541.9). Total num frames: 954664960. Throughput: 0: 5753.9. Samples: 954669254. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:12:05,354][25689] Avg episode reward: [(0, '-0.402')] [2022-07-10 23:12:06,886][26022] Updated weights on worker 0-0, policy_version 932299 (0.00085) [2022-07-10 23:12:08,853][26022] Updated weights on worker 0-0, policy_version 932309 (0.00084) [2022-07-10 23:12:10,442][25689] Fps is (10 sec: 5454.9, 60 sec: 5541.1, 300 sec: 5542.0). Total num frames: 954693632. Throughput: 0: 5742.2. Samples: 954703074. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-10 23:12:10,443][25689] Avg episode reward: [(0, '-0.508')] [2022-07-10 23:12:10,465][26022] Updated weights on worker 0-0, policy_version 932319 (0.00076) [2022-07-10 23:12:12,454][26022] Updated weights on worker 0-0, policy_version 932329 (0.00097) [2022-07-10 23:12:14,132][26022] Updated weights on worker 0-0, policy_version 932339 (0.00095) [2022-07-10 23:12:15,481][25689] Fps is (10 sec: 5560.7, 60 sec: 5558.0, 300 sec: 5541.8). Total num frames: 954721280. Throughput: 0: 5739.3. Samples: 954719816. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:12:15,482][25689] Avg episode reward: [(0, '-1.181')] [2022-07-10 23:12:16,057][26022] Updated weights on worker 0-0, policy_version 932349 (0.00085) [2022-07-10 23:12:18,311][26022] Updated weights on worker 0-0, policy_version 932359 (0.00100) [2022-07-10 23:12:19,677][26022] Updated weights on worker 0-0, policy_version 932369 (0.00105) [2022-07-10 23:12:20,557][25689] Fps is (10 sec: 5467.0, 60 sec: 5535.5, 300 sec: 5538.9). Total num frames: 954748928. Throughput: 0: 5703.1. Samples: 954753036. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:12:20,557][25689] Avg episode reward: [(0, '-1.083')] [2022-07-10 23:12:21,899][26022] Updated weights on worker 0-0, policy_version 932379 (0.00086) [2022-07-10 23:12:23,456][26022] Updated weights on worker 0-0, policy_version 932389 (0.00088) [2022-07-10 23:12:25,493][26022] Updated weights on worker 0-0, policy_version 932399 (0.00087) [2022-07-10 23:12:25,618][25689] Fps is (10 sec: 5454.9, 60 sec: 5524.8, 300 sec: 5534.8). Total num frames: 954776576. Throughput: 0: 5791.8. Samples: 954786626. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:12:25,619][25689] Avg episode reward: [(0, '-1.168')] [2022-07-10 23:12:27,077][26022] Updated weights on worker 0-0, policy_version 932409 (0.00091) [2022-07-10 23:12:29,098][26022] Updated weights on worker 0-0, policy_version 932419 (0.00083) [2022-07-10 23:12:30,681][25689] Fps is (10 sec: 5664.2, 60 sec: 5572.5, 300 sec: 5541.3). Total num frames: 954806272. Throughput: 0: 4959.2. Samples: 954803440. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:12:30,681][25689] Avg episode reward: [(0, '0.179')] [2022-07-10 23:12:30,720][26022] Updated weights on worker 0-0, policy_version 932429 (0.00088) [2022-07-10 23:12:32,924][26022] Updated weights on worker 0-0, policy_version 932439 (0.00111) [2022-07-10 23:12:34,413][26022] Updated weights on worker 0-0, policy_version 932449 (0.00086) [2022-07-10 23:12:35,736][25689] Fps is (10 sec: 5566.4, 60 sec: 5521.0, 300 sec: 5534.1). Total num frames: 954832896. Throughput: 0: 5769.6. Samples: 954836678. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:12:35,737][25689] Avg episode reward: [(0, '0.175')] [2022-07-10 23:12:36,644][26022] Updated weights on worker 0-0, policy_version 932459 (0.00091) [2022-07-10 23:12:38,068][26022] Updated weights on worker 0-0, policy_version 932469 (0.00063) [2022-07-10 23:12:40,268][26022] Updated weights on worker 0-0, policy_version 932479 (0.00087) [2022-07-10 23:12:40,767][25689] Fps is (10 sec: 5482.2, 60 sec: 5535.9, 300 sec: 5541.8). Total num frames: 954861568. Throughput: 0: 5795.3. Samples: 954870160. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:12:40,768][25689] Avg episode reward: [(0, '0.837')] [2022-07-10 23:12:41,902][26022] Updated weights on worker 0-0, policy_version 932489 (0.00088) [2022-07-10 23:12:43,928][26022] Updated weights on worker 0-0, policy_version 932499 (0.00089) [2022-07-10 23:12:45,512][26022] Updated weights on worker 0-0, policy_version 932509 (0.00085) [2022-07-10 23:12:45,873][25689] Fps is (10 sec: 5758.2, 60 sec: 5573.1, 300 sec: 5540.1). Total num frames: 954891264. Throughput: 0: 4953.8. Samples: 954886962. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:12:45,873][25689] Avg episode reward: [(0, '0.653')] [2022-07-10 23:12:47,541][26022] Updated weights on worker 0-0, policy_version 932519 (0.00093) [2022-07-10 23:12:49,016][26022] Updated weights on worker 0-0, policy_version 932529 (0.00090) [2022-07-10 23:12:50,892][25689] Fps is (10 sec: 5461.2, 60 sec: 5504.6, 300 sec: 5531.1). Total num frames: 954916864. Throughput: 0: 5776.8. Samples: 954920198. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:12:50,893][25689] Avg episode reward: [(0, '0.066')] [2022-07-10 23:12:51,346][26022] Updated weights on worker 0-0, policy_version 932539 (0.00092) [2022-07-10 23:12:52,976][26022] Updated weights on worker 0-0, policy_version 932549 (0.00089) [2022-07-10 23:12:54,849][26022] Updated weights on worker 0-0, policy_version 932559 (0.00090) [2022-07-10 23:12:55,927][25689] Fps is (10 sec: 5397.9, 60 sec: 5521.0, 300 sec: 5534.2). Total num frames: 954945536. Throughput: 0: 5776.1. Samples: 954953302. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:12:55,928][25689] Avg episode reward: [(0, '0.055')] [2022-07-10 23:12:56,669][26022] Updated weights on worker 0-0, policy_version 932569 (0.00081) [2022-07-10 23:12:58,525][26022] Updated weights on worker 0-0, policy_version 932579 (0.00094) [2022-07-10 23:13:00,335][26022] Updated weights on worker 0-0, policy_version 932589 (0.00087) [2022-07-10 23:13:01,000][25689] Fps is (10 sec: 5775.0, 60 sec: 5550.7, 300 sec: 5544.9). Total num frames: 954975232. Throughput: 0: 4937.9. Samples: 954970066. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:13:01,000][25689] Avg episode reward: [(0, '0.034')] [2022-07-10 23:13:02,800][26022] Updated weights on worker 0-0, policy_version 932599 (0.00096) [2022-07-10 23:13:04,175][26022] Updated weights on worker 0-0, policy_version 932609 (0.01043) [2022-07-10 23:13:06,120][25689] Fps is (10 sec: 5223.9, 60 sec: 5493.5, 300 sec: 5525.6). Total num frames: 954998784. Throughput: 0: 5671.5. Samples: 955001794. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:13:06,121][25689] Avg episode reward: [(0, '0.146')] [2022-07-10 23:13:06,480][26022] Updated weights on worker 0-0, policy_version 932619 (0.00090) [2022-07-10 23:13:07,941][26022] Updated weights on worker 0-0, policy_version 932629 (0.00089) [2022-07-10 23:13:09,896][26022] Updated weights on worker 0-0, policy_version 932639 (0.00089) [2022-07-10 23:13:11,150][25689] Fps is (10 sec: 5346.8, 60 sec: 5532.6, 300 sec: 5539.1). Total num frames: 955029504. Throughput: 0: 5678.9. Samples: 955035238. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:13:11,151][25689] Avg episode reward: [(0, '0.515')] [2022-07-10 23:13:11,915][26022] Updated weights on worker 0-0, policy_version 932649 (0.00083) [2022-07-10 23:13:13,461][26022] Updated weights on worker 0-0, policy_version 932659 (0.00087) [2022-07-10 23:13:14,636][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:13:14,645][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000932664_955047936.pth [2022-07-10 23:13:14,646][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000930715_953052160.pth [2022-07-10 23:13:15,674][26022] Updated weights on worker 0-0, policy_version 932669 (0.00094) [2022-07-10 23:13:16,179][25689] Fps is (10 sec: 5802.5, 60 sec: 5533.5, 300 sec: 5535.3). Total num frames: 955057152. Throughput: 0: 4877.6. Samples: 955052084. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:13:16,179][25689] Avg episode reward: [(0, '1.032')] [2022-07-10 23:13:17,345][26022] Updated weights on worker 0-0, policy_version 932679 (0.00071) [2022-07-10 23:13:19,199][26022] Updated weights on worker 0-0, policy_version 932689 (0.00094) [2022-07-10 23:13:21,104][26022] Updated weights on worker 0-0, policy_version 932699 (0.00086) [2022-07-10 23:13:21,196][25689] Fps is (10 sec: 5402.1, 60 sec: 5521.9, 300 sec: 5533.8). Total num frames: 955083776. Throughput: 0: 5709.6. Samples: 955085382. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:13:21,197][25689] Avg episode reward: [(0, '0.779')] [2022-07-10 23:13:22,851][26022] Updated weights on worker 0-0, policy_version 932709 (0.00087) [2022-07-10 23:13:24,631][26022] Updated weights on worker 0-0, policy_version 932719 (0.00084) [2022-07-10 23:13:26,253][25689] Fps is (10 sec: 5590.6, 60 sec: 5556.2, 300 sec: 5534.1). Total num frames: 955113472. Throughput: 0: 5813.6. Samples: 955118840. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:13:26,254][25689] Avg episode reward: [(0, '0.281')] [2022-07-10 23:13:26,506][26022] Updated weights on worker 0-0, policy_version 932729 (0.00093) [2022-07-10 23:13:28,570][26022] Updated weights on worker 0-0, policy_version 932739 (0.00096) [2022-07-10 23:13:30,185][26022] Updated weights on worker 0-0, policy_version 932749 (0.00094) [2022-07-10 23:13:31,346][25689] Fps is (10 sec: 5548.8, 60 sec: 5502.7, 300 sec: 5530.2). Total num frames: 955140096. Throughput: 0: 4961.1. Samples: 955135434. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:13:31,347][25689] Avg episode reward: [(0, '-0.588')] [2022-07-10 23:13:32,160][26022] Updated weights on worker 0-0, policy_version 932759 (0.00095) [2022-07-10 23:13:33,864][26022] Updated weights on worker 0-0, policy_version 932769 (0.00098) [2022-07-10 23:13:35,686][26022] Updated weights on worker 0-0, policy_version 932779 (0.00090) [2022-07-10 23:13:36,403][25689] Fps is (10 sec: 5549.1, 60 sec: 5553.3, 300 sec: 5533.9). Total num frames: 955169792. Throughput: 0: 5780.3. Samples: 955168982. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:13:36,403][25689] Avg episode reward: [(0, '-0.606')] [2022-07-10 23:13:37,866][26022] Updated weights on worker 0-0, policy_version 932789 (0.00085) [2022-07-10 23:13:39,303][26022] Updated weights on worker 0-0, policy_version 932799 (0.00091) [2022-07-10 23:13:41,415][25689] Fps is (10 sec: 5492.1, 60 sec: 5504.3, 300 sec: 5524.9). Total num frames: 955195392. Throughput: 0: 5788.3. Samples: 955202412. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:13:41,415][25689] Avg episode reward: [(0, '-0.766')] [2022-07-10 23:13:41,420][26022] Updated weights on worker 0-0, policy_version 932809 (0.00092) [2022-07-10 23:13:43,067][26022] Updated weights on worker 0-0, policy_version 932819 (0.00095) [2022-07-10 23:13:44,888][26022] Updated weights on worker 0-0, policy_version 932829 (0.00092) [2022-07-10 23:13:46,529][25689] Fps is (10 sec: 5460.8, 60 sec: 5503.6, 300 sec: 5526.6). Total num frames: 955225088. Throughput: 0: 4953.3. Samples: 955219270. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:13:46,529][25689] Avg episode reward: [(0, '-0.007')] [2022-07-10 23:13:46,724][26022] Updated weights on worker 0-0, policy_version 932839 (0.00087) [2022-07-10 23:13:48,429][26022] Updated weights on worker 0-0, policy_version 932849 (0.00086) [2022-07-10 23:13:50,456][26022] Updated weights on worker 0-0, policy_version 932859 (0.00085) [2022-07-10 23:13:51,543][25689] Fps is (10 sec: 5762.8, 60 sec: 5554.7, 300 sec: 5540.8). Total num frames: 955253760. Throughput: 0: 5820.5. Samples: 955252990. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:13:51,544][25689] Avg episode reward: [(0, '0.556')] [2022-07-10 23:13:52,180][26022] Updated weights on worker 0-0, policy_version 932869 (0.00086) [2022-07-10 23:13:54,161][26022] Updated weights on worker 0-0, policy_version 932879 (0.00091) [2022-07-10 23:13:55,859][26022] Updated weights on worker 0-0, policy_version 932889 (0.00089) [2022-07-10 23:13:56,589][25689] Fps is (10 sec: 5394.8, 60 sec: 5503.0, 300 sec: 5523.1). Total num frames: 955279360. Throughput: 0: 5787.8. Samples: 955285814. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:13:56,589][25689] Avg episode reward: [(0, '1.182')] [2022-07-10 23:13:57,791][26022] Updated weights on worker 0-0, policy_version 932899 (0.00097) [2022-07-10 23:13:59,908][26022] Updated weights on worker 0-0, policy_version 932909 (0.00096) [2022-07-10 23:14:01,394][26022] Updated weights on worker 0-0, policy_version 932919 (0.00086) [2022-07-10 23:14:01,644][25689] Fps is (10 sec: 5575.8, 60 sec: 5521.5, 300 sec: 5544.4). Total num frames: 955310080. Throughput: 0: 4956.7. Samples: 955302682. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:14:01,644][25689] Avg episode reward: [(0, '1.525')] [2022-07-10 23:14:03,809][26022] Updated weights on worker 0-0, policy_version 932929 (0.00094) [2022-07-10 23:14:05,403][26022] Updated weights on worker 0-0, policy_version 932939 (0.00077) [2022-07-10 23:14:06,758][25689] Fps is (10 sec: 5638.9, 60 sec: 5572.7, 300 sec: 5536.6). Total num frames: 955336704. Throughput: 0: 5683.7. Samples: 955334248. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:14:06,759][25689] Avg episode reward: [(0, '1.313')] [2022-07-10 23:14:07,506][26022] Updated weights on worker 0-0, policy_version 932949 (0.00088) [2022-07-10 23:14:09,107][26022] Updated weights on worker 0-0, policy_version 932959 (0.00086) [2022-07-10 23:14:11,055][26022] Updated weights on worker 0-0, policy_version 932969 (0.00079) [2022-07-10 23:14:11,766][25689] Fps is (10 sec: 5361.5, 60 sec: 5524.1, 300 sec: 5530.1). Total num frames: 955364352. Throughput: 0: 5669.9. Samples: 955367654. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:14:11,768][25689] Avg episode reward: [(0, '0.613')] [2022-07-10 23:14:12,648][26022] Updated weights on worker 0-0, policy_version 932979 (0.00092) [2022-07-10 23:14:14,678][26022] Updated weights on worker 0-0, policy_version 932989 (0.00093) [2022-07-10 23:14:16,443][26022] Updated weights on worker 0-0, policy_version 932999 (0.00093) [2022-07-10 23:14:16,794][25689] Fps is (10 sec: 5509.6, 60 sec: 5524.1, 300 sec: 5536.9). Total num frames: 955392000. Throughput: 0: 5720.3. Samples: 955401396. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:14:16,796][25689] Avg episode reward: [(0, '-0.221')] [2022-07-10 23:14:18,426][26022] Updated weights on worker 0-0, policy_version 933009 (0.00084) [2022-07-10 23:14:20,082][26022] Updated weights on worker 0-0, policy_version 933019 (0.00087) [2022-07-10 23:14:21,799][25689] Fps is (10 sec: 5511.8, 60 sec: 5542.2, 300 sec: 5534.7). Total num frames: 955419648. Throughput: 0: 5727.8. Samples: 955418124. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:14:21,800][25689] Avg episode reward: [(0, '-1.083')] [2022-07-10 23:14:22,087][26022] Updated weights on worker 0-0, policy_version 933029 (0.00085) [2022-07-10 23:14:23,778][26022] Updated weights on worker 0-0, policy_version 933039 (0.00085) [2022-07-10 23:14:25,828][26022] Updated weights on worker 0-0, policy_version 933049 (0.00090) [2022-07-10 23:14:26,886][25689] Fps is (10 sec: 5581.0, 60 sec: 5522.6, 300 sec: 5530.4). Total num frames: 955448320. Throughput: 0: 5825.9. Samples: 955451510. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:14:26,886][25689] Avg episode reward: [(0, '-1.177')] [2022-07-10 23:14:27,531][26022] Updated weights on worker 0-0, policy_version 933059 (0.00087) [2022-07-10 23:14:29,460][26022] Updated weights on worker 0-0, policy_version 933069 (0.00086) [2022-07-10 23:14:31,117][26022] Updated weights on worker 0-0, policy_version 933079 (0.00086) [2022-07-10 23:14:31,906][25689] Fps is (10 sec: 5572.0, 60 sec: 5546.1, 300 sec: 5534.0). Total num frames: 955475968. Throughput: 0: 5815.7. Samples: 955484782. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:14:31,907][25689] Avg episode reward: [(0, '-1.407')] [2022-07-10 23:14:33,114][26022] Updated weights on worker 0-0, policy_version 933089 (0.00094) [2022-07-10 23:14:34,944][26022] Updated weights on worker 0-0, policy_version 933099 (0.00089) [2022-07-10 23:14:36,684][26022] Updated weights on worker 0-0, policy_version 933109 (0.00088) [2022-07-10 23:14:36,912][25689] Fps is (10 sec: 5617.3, 60 sec: 5533.8, 300 sec: 5534.5). Total num frames: 955504640. Throughput: 0: 4982.5. Samples: 955501632. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:14:36,912][25689] Avg episode reward: [(0, '-1.640')] [2022-07-10 23:14:38,536][26022] Updated weights on worker 0-0, policy_version 933119 (0.00087) [2022-07-10 23:14:40,411][26022] Updated weights on worker 0-0, policy_version 933129 (0.00086) [2022-07-10 23:14:41,914][25689] Fps is (10 sec: 5730.2, 60 sec: 5585.6, 300 sec: 5536.5). Total num frames: 955533312. Throughput: 0: 5814.4. Samples: 955535080. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:14:41,914][25689] Avg episode reward: [(0, '-1.013')] [2022-07-10 23:14:42,035][26022] Updated weights on worker 0-0, policy_version 933139 (0.00091) [2022-07-10 23:14:44,241][26022] Updated weights on worker 0-0, policy_version 933149 (0.00088) [2022-07-10 23:14:45,796][26022] Updated weights on worker 0-0, policy_version 933159 (0.00084) [2022-07-10 23:14:47,039][25689] Fps is (10 sec: 5460.2, 60 sec: 5533.7, 300 sec: 5530.8). Total num frames: 955559936. Throughput: 0: 5827.7. Samples: 955568956. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:14:47,039][25689] Avg episode reward: [(0, '-0.558')] [2022-07-10 23:14:47,876][26022] Updated weights on worker 0-0, policy_version 933169 (0.00089) [2022-07-10 23:14:49,363][26022] Updated weights on worker 0-0, policy_version 933179 (0.00084) [2022-07-10 23:14:51,426][26022] Updated weights on worker 0-0, policy_version 933189 (0.00423) [2022-07-10 23:14:52,057][25689] Fps is (10 sec: 5451.2, 60 sec: 5533.4, 300 sec: 5534.2). Total num frames: 955588608. Throughput: 0: 5018.3. Samples: 955585906. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:14:52,058][25689] Avg episode reward: [(0, '-0.479')] [2022-07-10 23:14:53,210][26022] Updated weights on worker 0-0, policy_version 933199 (0.00089) [2022-07-10 23:14:55,129][26022] Updated weights on worker 0-0, policy_version 933209 (0.00086) [2022-07-10 23:14:56,854][26022] Updated weights on worker 0-0, policy_version 933219 (0.00086) [2022-07-10 23:14:57,119][25689] Fps is (10 sec: 5688.6, 60 sec: 5582.6, 300 sec: 5533.9). Total num frames: 955617280. Throughput: 0: 5834.4. Samples: 955619532. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:14:57,120][25689] Avg episode reward: [(0, '-1.199')] [2022-07-10 23:14:58,797][26022] Updated weights on worker 0-0, policy_version 933229 (0.00092) [2022-07-10 23:15:00,465][26022] Updated weights on worker 0-0, policy_version 933239 (0.00084) [2022-07-10 23:15:02,131][25689] Fps is (10 sec: 5489.4, 60 sec: 5519.0, 300 sec: 5538.7). Total num frames: 955643904. Throughput: 0: 5835.1. Samples: 955653048. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:15:02,131][25689] Avg episode reward: [(0, '-0.667')] [2022-07-10 23:15:02,776][26022] Updated weights on worker 0-0, policy_version 933249 (0.00111) [2022-07-10 23:15:04,495][26022] Updated weights on worker 0-0, policy_version 933259 (0.00082) [2022-07-10 23:15:06,554][26022] Updated weights on worker 0-0, policy_version 933269 (0.00090) [2022-07-10 23:15:07,225][25689] Fps is (10 sec: 5269.2, 60 sec: 5520.8, 300 sec: 5530.1). Total num frames: 955670528. Throughput: 0: 4886.2. Samples: 955667590. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:15:07,225][25689] Avg episode reward: [(0, '-1.866')] [2022-07-10 23:15:08,095][26022] Updated weights on worker 0-0, policy_version 933279 (0.00087) [2022-07-10 23:15:10,241][26022] Updated weights on worker 0-0, policy_version 933289 (0.00091) [2022-07-10 23:15:11,715][26022] Updated weights on worker 0-0, policy_version 933299 (0.00078) [2022-07-10 23:15:12,291][25689] Fps is (10 sec: 5543.0, 60 sec: 5549.3, 300 sec: 5535.8). Total num frames: 955700224. Throughput: 0: 5693.6. Samples: 955701110. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:15:12,292][25689] Avg episode reward: [(0, '-1.622')] [2022-07-10 23:15:13,899][26022] Updated weights on worker 0-0, policy_version 933309 (0.00101) [2022-07-10 23:15:14,650][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:15:14,664][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000933314_955713536.pth [2022-07-10 23:15:14,664][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000931367_953719808.pth [2022-07-10 23:15:15,532][26022] Updated weights on worker 0-0, policy_version 933319 (0.00086) [2022-07-10 23:15:17,297][25689] Fps is (10 sec: 5693.5, 60 sec: 5551.4, 300 sec: 5536.5). Total num frames: 955727872. Throughput: 0: 5706.3. Samples: 955734670. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:15:17,297][25689] Avg episode reward: [(0, '-0.641')] [2022-07-10 23:15:17,436][26022] Updated weights on worker 0-0, policy_version 933329 (0.00086) [2022-07-10 23:15:19,335][26022] Updated weights on worker 0-0, policy_version 933339 (0.00090) [2022-07-10 23:15:21,082][26022] Updated weights on worker 0-0, policy_version 933349 (0.00086) [2022-07-10 23:15:22,315][25689] Fps is (10 sec: 5414.5, 60 sec: 5533.2, 300 sec: 5527.0). Total num frames: 955754496. Throughput: 0: 4886.6. Samples: 955751680. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:15:22,315][25689] Avg episode reward: [(0, '0.279')] [2022-07-10 23:15:22,963][26022] Updated weights on worker 0-0, policy_version 933359 (0.00085) [2022-07-10 23:15:24,738][26022] Updated weights on worker 0-0, policy_version 933369 (0.00363) [2022-07-10 23:15:26,536][26022] Updated weights on worker 0-0, policy_version 933379 (0.00086) [2022-07-10 23:15:27,386][25689] Fps is (10 sec: 5582.0, 60 sec: 5551.5, 300 sec: 5537.4). Total num frames: 955784192. Throughput: 0: 5833.7. Samples: 955785206. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:15:27,387][25689] Avg episode reward: [(0, '-0.024')] [2022-07-10 23:15:28,710][26022] Updated weights on worker 0-0, policy_version 933389 (0.00091) [2022-07-10 23:15:30,367][26022] Updated weights on worker 0-0, policy_version 933399 (0.00085) [2022-07-10 23:15:32,301][26022] Updated weights on worker 0-0, policy_version 933409 (0.00085) [2022-07-10 23:15:32,455][25689] Fps is (10 sec: 5756.3, 60 sec: 5564.1, 300 sec: 5536.8). Total num frames: 955812864. Throughput: 0: 5824.6. Samples: 955818552. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:15:32,456][25689] Avg episode reward: [(0, '0.269')] [2022-07-10 23:15:33,894][26022] Updated weights on worker 0-0, policy_version 933419 (0.00089) [2022-07-10 23:15:35,853][26022] Updated weights on worker 0-0, policy_version 933429 (0.00085) [2022-07-10 23:15:37,548][25689] Fps is (10 sec: 5441.5, 60 sec: 5522.2, 300 sec: 5530.0). Total num frames: 955839488. Throughput: 0: 4966.4. Samples: 955835246. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:15:37,549][25689] Avg episode reward: [(0, '0.438')] [2022-07-10 23:15:37,803][26022] Updated weights on worker 0-0, policy_version 933439 (0.00090) [2022-07-10 23:15:39,487][26022] Updated weights on worker 0-0, policy_version 933449 (0.00088) [2022-07-10 23:15:41,416][26022] Updated weights on worker 0-0, policy_version 933459 (0.00081) [2022-07-10 23:15:42,599][25689] Fps is (10 sec: 5551.5, 60 sec: 5534.6, 300 sec: 5537.7). Total num frames: 955869184. Throughput: 0: 5772.0. Samples: 955868764. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:15:42,600][25689] Avg episode reward: [(0, '0.126')] [2022-07-10 23:15:43,178][26022] Updated weights on worker 0-0, policy_version 933469 (0.00089) [2022-07-10 23:15:45,168][26022] Updated weights on worker 0-0, policy_version 933479 (0.00098) [2022-07-10 23:15:46,891][26022] Updated weights on worker 0-0, policy_version 933489 (0.00098) [2022-07-10 23:15:47,687][25689] Fps is (10 sec: 5656.0, 60 sec: 5554.9, 300 sec: 5533.2). Total num frames: 955896832. Throughput: 0: 5763.0. Samples: 955902198. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:15:47,687][25689] Avg episode reward: [(0, '0.102')] [2022-07-10 23:15:48,641][26022] Updated weights on worker 0-0, policy_version 933499 (0.00092) [2022-07-10 23:15:50,614][26022] Updated weights on worker 0-0, policy_version 933509 (0.00093) [2022-07-10 23:15:52,548][26022] Updated weights on worker 0-0, policy_version 933519 (0.00085) [2022-07-10 23:15:52,694][25689] Fps is (10 sec: 5477.8, 60 sec: 5539.1, 300 sec: 5533.2). Total num frames: 955924480. Throughput: 0: 4952.3. Samples: 955918784. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:15:52,695][25689] Avg episode reward: [(0, '0.204')] [2022-07-10 23:15:54,310][26022] Updated weights on worker 0-0, policy_version 933529 (0.00093) [2022-07-10 23:15:56,184][26022] Updated weights on worker 0-0, policy_version 933539 (0.00086) [2022-07-10 23:15:57,711][25689] Fps is (10 sec: 5516.1, 60 sec: 5526.3, 300 sec: 5536.7). Total num frames: 955952128. Throughput: 0: 5788.9. Samples: 955951968. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:15:57,712][25689] Avg episode reward: [(0, '0.338')] [2022-07-10 23:15:58,035][26022] Updated weights on worker 0-0, policy_version 933549 (0.00084) [2022-07-10 23:15:59,673][26022] Updated weights on worker 0-0, policy_version 933559 (0.00091) [2022-07-10 23:16:01,887][26022] Updated weights on worker 0-0, policy_version 933569 (0.00101) [2022-07-10 23:16:02,721][25689] Fps is (10 sec: 5412.7, 60 sec: 5526.4, 300 sec: 5534.1). Total num frames: 955978752. Throughput: 0: 5751.3. Samples: 955984488. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-10 23:16:02,721][25689] Avg episode reward: [(0, '0.288')] [2022-07-10 23:16:03,955][26022] Updated weights on worker 0-0, policy_version 933579 (0.00094) [2022-07-10 23:16:05,700][26022] Updated weights on worker 0-0, policy_version 933589 (0.00085) [2022-07-10 23:16:07,606][26022] Updated weights on worker 0-0, policy_version 933599 (0.00087) [2022-07-10 23:16:07,781][25689] Fps is (10 sec: 5389.5, 60 sec: 5546.4, 300 sec: 5533.2). Total num frames: 956006400. Throughput: 0: 4867.3. Samples: 956000004. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:16:07,782][25689] Avg episode reward: [(0, '0.010')] [2022-07-10 23:16:09,249][26022] Updated weights on worker 0-0, policy_version 933609 (0.00081) [2022-07-10 23:16:11,314][26022] Updated weights on worker 0-0, policy_version 933619 (0.00081) [2022-07-10 23:16:12,879][25689] Fps is (10 sec: 5443.3, 60 sec: 5509.7, 300 sec: 5535.6). Total num frames: 956034048. Throughput: 0: 5680.9. Samples: 956033456. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:16:12,880][25689] Avg episode reward: [(0, '0.394')] [2022-07-10 23:16:13,060][26022] Updated weights on worker 0-0, policy_version 933629 (0.00090) [2022-07-10 23:16:14,706][26022] Updated weights on worker 0-0, policy_version 933639 (0.00080) [2022-07-10 23:16:16,853][26022] Updated weights on worker 0-0, policy_version 933649 (0.00090) [2022-07-10 23:16:17,951][25689] Fps is (10 sec: 5638.8, 60 sec: 5537.5, 300 sec: 5537.9). Total num frames: 956063744. Throughput: 0: 5688.8. Samples: 956067106. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:16:17,951][25689] Avg episode reward: [(0, '0.244')] [2022-07-10 23:16:18,566][26022] Updated weights on worker 0-0, policy_version 933659 (0.00081) [2022-07-10 23:16:20,523][26022] Updated weights on worker 0-0, policy_version 933669 (0.00091) [2022-07-10 23:16:22,015][26022] Updated weights on worker 0-0, policy_version 933679 (0.00081) [2022-07-10 23:16:22,956][25689] Fps is (10 sec: 5589.2, 60 sec: 5538.7, 300 sec: 5533.3). Total num frames: 956090368. Throughput: 0: 4907.4. Samples: 956083790. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:16:22,957][25689] Avg episode reward: [(0, '-0.273')] [2022-07-10 23:16:24,172][26022] Updated weights on worker 0-0, policy_version 933689 (0.00095) [2022-07-10 23:16:26,155][26022] Updated weights on worker 0-0, policy_version 933699 (0.00084) [2022-07-10 23:16:27,749][26022] Updated weights on worker 0-0, policy_version 933709 (0.00082) [2022-07-10 23:16:28,080][25689] Fps is (10 sec: 5459.0, 60 sec: 5517.0, 300 sec: 5538.5). Total num frames: 956119040. Throughput: 0: 5764.0. Samples: 956117006. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:16:28,080][25689] Avg episode reward: [(0, '0.052')] [2022-07-10 23:16:29,605][26022] Updated weights on worker 0-0, policy_version 933719 (0.00086) [2022-07-10 23:16:31,507][26022] Updated weights on worker 0-0, policy_version 933729 (0.00090) [2022-07-10 23:16:33,110][25689] Fps is (10 sec: 5647.7, 60 sec: 5520.5, 300 sec: 5535.3). Total num frames: 956147712. Throughput: 0: 5799.2. Samples: 956150774. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:16:33,110][25689] Avg episode reward: [(0, '0.338')] [2022-07-10 23:16:33,169][26022] Updated weights on worker 0-0, policy_version 933739 (0.00090) [2022-07-10 23:16:35,117][26022] Updated weights on worker 0-0, policy_version 933749 (0.00089) [2022-07-10 23:16:36,855][26022] Updated weights on worker 0-0, policy_version 933759 (0.00111) [2022-07-10 23:16:38,146][25689] Fps is (10 sec: 5595.0, 60 sec: 5542.6, 300 sec: 5534.9). Total num frames: 956175360. Throughput: 0: 5817.8. Samples: 956184598. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:16:38,147][25689] Avg episode reward: [(0, '0.287')] [2022-07-10 23:16:38,642][26022] Updated weights on worker 0-0, policy_version 933769 (0.00080) [2022-07-10 23:16:40,463][26022] Updated weights on worker 0-0, policy_version 933779 (0.00097) [2022-07-10 23:16:42,358][26022] Updated weights on worker 0-0, policy_version 933789 (0.00086) [2022-07-10 23:16:43,222][25689] Fps is (10 sec: 5670.4, 60 sec: 5540.3, 300 sec: 5542.9). Total num frames: 956205056. Throughput: 0: 5809.1. Samples: 956201520. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:16:43,223][25689] Avg episode reward: [(0, '0.329')] [2022-07-10 23:16:44,259][26022] Updated weights on worker 0-0, policy_version 933799 (0.00085) [2022-07-10 23:16:45,957][26022] Updated weights on worker 0-0, policy_version 933809 (0.00084) [2022-07-10 23:16:47,852][26022] Updated weights on worker 0-0, policy_version 933819 (0.00079) [2022-07-10 23:16:48,294][25689] Fps is (10 sec: 5650.8, 60 sec: 5541.8, 300 sec: 5534.9). Total num frames: 956232704. Throughput: 0: 5861.3. Samples: 956235486. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:16:48,295][25689] Avg episode reward: [(0, '0.454')] [2022-07-10 23:16:49,580][26022] Updated weights on worker 0-0, policy_version 933829 (0.00091) [2022-07-10 23:16:51,490][26022] Updated weights on worker 0-0, policy_version 933839 (0.00092) [2022-07-10 23:16:53,322][25689] Fps is (10 sec: 5474.9, 60 sec: 5539.9, 300 sec: 5535.0). Total num frames: 956260352. Throughput: 0: 5876.0. Samples: 956269544. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:16:53,323][25689] Avg episode reward: [(0, '1.062')] [2022-07-10 23:16:53,394][26022] Updated weights on worker 0-0, policy_version 933849 (0.00083) [2022-07-10 23:16:54,881][26022] Updated weights on worker 0-0, policy_version 933859 (0.00094) [2022-07-10 23:16:57,052][26022] Updated weights on worker 0-0, policy_version 933869 (0.00084) [2022-07-10 23:16:58,339][25689] Fps is (10 sec: 5708.6, 60 sec: 5573.7, 300 sec: 5542.0). Total num frames: 956290048. Throughput: 0: 5033.4. Samples: 956286238. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:16:58,340][25689] Avg episode reward: [(0, '0.713')] [2022-07-10 23:16:58,544][26022] Updated weights on worker 0-0, policy_version 933879 (0.00082) [2022-07-10 23:17:00,617][26022] Updated weights on worker 0-0, policy_version 933889 (0.00093) [2022-07-10 23:17:02,714][26022] Updated weights on worker 0-0, policy_version 933899 (0.00088) [2022-07-10 23:17:03,349][25689] Fps is (10 sec: 5310.6, 60 sec: 5523.0, 300 sec: 5532.4). Total num frames: 956313600. Throughput: 0: 5793.3. Samples: 956318120. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:17:03,350][25689] Avg episode reward: [(0, '0.040')] [2022-07-10 23:17:04,469][26022] Updated weights on worker 0-0, policy_version 933909 (0.00089) [2022-07-10 23:17:06,524][26022] Updated weights on worker 0-0, policy_version 933919 (0.00087) [2022-07-10 23:17:08,214][26022] Updated weights on worker 0-0, policy_version 933929 (0.00086) [2022-07-10 23:17:08,460][25689] Fps is (10 sec: 5463.7, 60 sec: 5585.9, 300 sec: 5542.3). Total num frames: 956345344. Throughput: 0: 5752.2. Samples: 956351484. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:17:08,460][25689] Avg episode reward: [(0, '0.043')] [2022-07-10 23:17:10,271][26022] Updated weights on worker 0-0, policy_version 933939 (0.00084) [2022-07-10 23:17:11,991][26022] Updated weights on worker 0-0, policy_version 933949 (0.00120) [2022-07-10 23:17:13,488][25689] Fps is (10 sec: 5756.6, 60 sec: 5575.4, 300 sec: 5539.1). Total num frames: 956371968. Throughput: 0: 4879.9. Samples: 956367952. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:17:13,489][25689] Avg episode reward: [(0, '-0.230')] [2022-07-10 23:17:13,962][26022] Updated weights on worker 0-0, policy_version 933959 (0.00088) [2022-07-10 23:17:14,721][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:17:14,739][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000933964_956379136.pth [2022-07-10 23:17:14,740][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000932014_954382336.pth [2022-07-10 23:17:15,784][26022] Updated weights on worker 0-0, policy_version 933969 (0.00090) [2022-07-10 23:17:17,371][26022] Updated weights on worker 0-0, policy_version 933979 (0.00079) [2022-07-10 23:17:18,532][25689] Fps is (10 sec: 5388.2, 60 sec: 5544.1, 300 sec: 5539.7). Total num frames: 956399616. Throughput: 0: 5717.3. Samples: 956401688. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:17:18,533][25689] Avg episode reward: [(0, '-0.073')] [2022-07-10 23:17:19,324][26022] Updated weights on worker 0-0, policy_version 933989 (0.00084) [2022-07-10 23:17:20,863][26022] Updated weights on worker 0-0, policy_version 933999 (0.00078) [2022-07-10 23:17:23,027][26022] Updated weights on worker 0-0, policy_version 934009 (0.00083) [2022-07-10 23:17:23,559][25689] Fps is (10 sec: 5592.2, 60 sec: 5575.9, 300 sec: 5543.8). Total num frames: 956428288. Throughput: 0: 5801.4. Samples: 956435368. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:17:23,568][25689] Avg episode reward: [(0, '0.116')] [2022-07-10 23:17:24,903][26022] Updated weights on worker 0-0, policy_version 934019 (0.00087) [2022-07-10 23:17:26,750][26022] Updated weights on worker 0-0, policy_version 934029 (0.00088) [2022-07-10 23:17:28,597][26022] Updated weights on worker 0-0, policy_version 934039 (0.00089) [2022-07-10 23:17:28,663][25689] Fps is (10 sec: 5660.1, 60 sec: 5577.8, 300 sec: 5539.5). Total num frames: 956456960. Throughput: 0: 4971.3. Samples: 956451922. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:17:28,664][25689] Avg episode reward: [(0, '0.131')] [2022-07-10 23:17:30,505][26022] Updated weights on worker 0-0, policy_version 934049 (0.00092) [2022-07-10 23:17:32,139][26022] Updated weights on worker 0-0, policy_version 934059 (0.00098) [2022-07-10 23:17:33,687][25689] Fps is (10 sec: 5460.2, 60 sec: 5544.5, 300 sec: 5540.1). Total num frames: 956483584. Throughput: 0: 5804.3. Samples: 956485188. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:17:33,687][25689] Avg episode reward: [(0, '0.852')] [2022-07-10 23:17:34,205][26022] Updated weights on worker 0-0, policy_version 934069 (0.00086) [2022-07-10 23:17:35,918][26022] Updated weights on worker 0-0, policy_version 934079 (0.00086) [2022-07-10 23:17:37,767][26022] Updated weights on worker 0-0, policy_version 934089 (0.00089) [2022-07-10 23:17:38,702][25689] Fps is (10 sec: 5610.3, 60 sec: 5580.3, 300 sec: 5543.9). Total num frames: 956513280. Throughput: 0: 5806.1. Samples: 956518796. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:17:38,704][25689] Avg episode reward: [(0, '0.804')] [2022-07-10 23:17:39,540][26022] Updated weights on worker 0-0, policy_version 934099 (0.00088) [2022-07-10 23:17:41,376][26022] Updated weights on worker 0-0, policy_version 934109 (0.00086) [2022-07-10 23:17:43,316][26022] Updated weights on worker 0-0, policy_version 934119 (0.00085) [2022-07-10 23:17:43,712][25689] Fps is (10 sec: 5617.7, 60 sec: 5535.6, 300 sec: 5535.3). Total num frames: 956539904. Throughput: 0: 4973.6. Samples: 956535598. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:17:43,713][25689] Avg episode reward: [(0, '1.160')] [2022-07-10 23:17:45,044][26022] Updated weights on worker 0-0, policy_version 934129 (0.00104) [2022-07-10 23:17:46,884][26022] Updated weights on worker 0-0, policy_version 934139 (0.00089) [2022-07-10 23:17:48,721][26022] Updated weights on worker 0-0, policy_version 934149 (0.00095) [2022-07-10 23:17:48,788][25689] Fps is (10 sec: 5482.4, 60 sec: 5552.1, 300 sec: 5544.6). Total num frames: 956568576. Throughput: 0: 5827.8. Samples: 956569206. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:17:48,789][25689] Avg episode reward: [(0, '0.805')] [2022-07-10 23:17:50,531][26022] Updated weights on worker 0-0, policy_version 934159 (0.00096) [2022-07-10 23:17:52,314][26022] Updated weights on worker 0-0, policy_version 934169 (0.00079) [2022-07-10 23:17:53,867][25689] Fps is (10 sec: 5646.8, 60 sec: 5564.4, 300 sec: 5543.8). Total num frames: 956597248. Throughput: 0: 5840.5. Samples: 956603052. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:17:53,868][25689] Avg episode reward: [(0, '0.464')] [2022-07-10 23:17:54,380][26022] Updated weights on worker 0-0, policy_version 934179 (0.00084) [2022-07-10 23:17:55,789][26022] Updated weights on worker 0-0, policy_version 934189 (0.00089) [2022-07-10 23:17:57,985][26022] Updated weights on worker 0-0, policy_version 934199 (0.00084) [2022-07-10 23:17:58,889][25689] Fps is (10 sec: 5677.3, 60 sec: 5547.1, 300 sec: 5541.3). Total num frames: 956625920. Throughput: 0: 5011.2. Samples: 956619956. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:17:58,889][25689] Avg episode reward: [(0, '0.503')] [2022-07-10 23:17:59,611][26022] Updated weights on worker 0-0, policy_version 934209 (0.00087) [2022-07-10 23:18:01,661][26022] Updated weights on worker 0-0, policy_version 934219 (0.00091) [2022-07-10 23:18:03,696][26022] Updated weights on worker 0-0, policy_version 934229 (0.00084) [2022-07-10 23:18:03,951][25689] Fps is (10 sec: 5280.8, 60 sec: 5559.2, 300 sec: 5545.8). Total num frames: 956650496. Throughput: 0: 5733.1. Samples: 956651628. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:18:03,952][25689] Avg episode reward: [(0, '0.835')] [2022-07-10 23:18:05,426][26022] Updated weights on worker 0-0, policy_version 934239 (0.00085) [2022-07-10 23:18:07,228][26022] Updated weights on worker 0-0, policy_version 934249 (0.00080) [2022-07-10 23:18:09,018][25689] Fps is (10 sec: 5358.0, 60 sec: 5529.4, 300 sec: 5541.7). Total num frames: 956680192. Throughput: 0: 5756.2. Samples: 956685652. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:18:09,018][25689] Avg episode reward: [(0, '0.717')] [2022-07-10 23:18:09,153][26022] Updated weights on worker 0-0, policy_version 934259 (0.00093) [2022-07-10 23:18:10,795][26022] Updated weights on worker 0-0, policy_version 934269 (0.00089) [2022-07-10 23:18:12,957][26022] Updated weights on worker 0-0, policy_version 934279 (0.00079) [2022-07-10 23:18:14,022][25689] Fps is (10 sec: 5795.5, 60 sec: 5565.5, 300 sec: 5545.6). Total num frames: 956708864. Throughput: 0: 4929.6. Samples: 956702406. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:18:14,022][25689] Avg episode reward: [(0, '0.657')] [2022-07-10 23:18:14,367][26022] Updated weights on worker 0-0, policy_version 934289 (0.00086) [2022-07-10 23:18:16,430][26022] Updated weights on worker 0-0, policy_version 934299 (0.00089) [2022-07-10 23:18:18,135][26022] Updated weights on worker 0-0, policy_version 934309 (0.00082) [2022-07-10 23:18:19,028][25689] Fps is (10 sec: 5626.5, 60 sec: 5568.9, 300 sec: 5549.2). Total num frames: 956736512. Throughput: 0: 5788.4. Samples: 956736528. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:18:19,028][25689] Avg episode reward: [(0, '0.675')] [2022-07-10 23:18:19,875][26022] Updated weights on worker 0-0, policy_version 934319 (0.00083) [2022-07-10 23:18:21,808][26022] Updated weights on worker 0-0, policy_version 934329 (0.00087) [2022-07-10 23:18:23,522][26022] Updated weights on worker 0-0, policy_version 934339 (0.00186) [2022-07-10 23:18:24,043][25689] Fps is (10 sec: 5517.9, 60 sec: 5553.1, 300 sec: 5543.1). Total num frames: 956764160. Throughput: 0: 5912.0. Samples: 956770414. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:18:24,043][25689] Avg episode reward: [(0, '0.669')] [2022-07-10 23:18:25,344][26022] Updated weights on worker 0-0, policy_version 934349 (0.00088) [2022-07-10 23:18:27,446][26022] Updated weights on worker 0-0, policy_version 934359 (0.00086) [2022-07-10 23:18:29,072][26022] Updated weights on worker 0-0, policy_version 934369 (0.00090) [2022-07-10 23:18:29,120][25689] Fps is (10 sec: 5682.0, 60 sec: 5572.5, 300 sec: 5553.7). Total num frames: 956793856. Throughput: 0: 5046.7. Samples: 956787100. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:18:29,120][25689] Avg episode reward: [(0, '-0.040')] [2022-07-10 23:18:31,066][26022] Updated weights on worker 0-0, policy_version 934379 (0.00072) [2022-07-10 23:18:32,783][26022] Updated weights on worker 0-0, policy_version 934389 (0.00088) [2022-07-10 23:18:34,139][25689] Fps is (10 sec: 5578.6, 60 sec: 5572.9, 300 sec: 5544.1). Total num frames: 956820480. Throughput: 0: 5861.3. Samples: 956820318. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:18:34,139][25689] Avg episode reward: [(0, '0.141')] [2022-07-10 23:18:34,591][26022] Updated weights on worker 0-0, policy_version 934399 (0.00078) [2022-07-10 23:18:36,669][26022] Updated weights on worker 0-0, policy_version 934409 (0.00085) [2022-07-10 23:18:38,238][26022] Updated weights on worker 0-0, policy_version 934419 (0.00089) [2022-07-10 23:18:39,155][25689] Fps is (10 sec: 5510.2, 60 sec: 5555.9, 300 sec: 5554.4). Total num frames: 956849152. Throughput: 0: 5843.6. Samples: 956854144. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:18:39,156][25689] Avg episode reward: [(0, '-0.048')] [2022-07-10 23:18:40,314][26022] Updated weights on worker 0-0, policy_version 934429 (0.00084) [2022-07-10 23:18:41,851][26022] Updated weights on worker 0-0, policy_version 934439 (0.00084) [2022-07-10 23:18:43,729][26022] Updated weights on worker 0-0, policy_version 934449 (0.00086) [2022-07-10 23:18:44,179][25689] Fps is (10 sec: 5711.5, 60 sec: 5588.5, 300 sec: 5552.6). Total num frames: 956877824. Throughput: 0: 4998.2. Samples: 956871056. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:18:44,179][25689] Avg episode reward: [(0, '0.166')] [2022-07-10 23:18:45,539][26022] Updated weights on worker 0-0, policy_version 934459 (0.00085) [2022-07-10 23:18:47,583][26022] Updated weights on worker 0-0, policy_version 934469 (0.00091) [2022-07-10 23:18:49,165][26022] Updated weights on worker 0-0, policy_version 934479 (0.00090) [2022-07-10 23:18:49,225][25689] Fps is (10 sec: 5694.6, 60 sec: 5591.3, 300 sec: 5552.0). Total num frames: 956906496. Throughput: 0: 5849.8. Samples: 956904712. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:18:49,226][25689] Avg episode reward: [(0, '0.327')] [2022-07-10 23:18:51,226][26022] Updated weights on worker 0-0, policy_version 934489 (0.00097) [2022-07-10 23:18:52,756][26022] Updated weights on worker 0-0, policy_version 934499 (0.00090) [2022-07-10 23:18:54,232][25689] Fps is (10 sec: 5500.3, 60 sec: 5564.0, 300 sec: 5556.2). Total num frames: 956933120. Throughput: 0: 5857.7. Samples: 956938020. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:18:54,234][25689] Avg episode reward: [(0, '0.580')] [2022-07-10 23:18:54,752][26022] Updated weights on worker 0-0, policy_version 934509 (0.00092) [2022-07-10 23:18:56,735][26022] Updated weights on worker 0-0, policy_version 934519 (0.00082) [2022-07-10 23:18:58,429][26022] Updated weights on worker 0-0, policy_version 934529 (0.00096) [2022-07-10 23:18:59,236][25689] Fps is (10 sec: 5421.1, 60 sec: 5548.7, 300 sec: 5546.8). Total num frames: 956960768. Throughput: 0: 5010.5. Samples: 956954760. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:18:59,237][25689] Avg episode reward: [(0, '1.106')] [2022-07-10 23:19:00,495][26022] Updated weights on worker 0-0, policy_version 934539 (0.00084) [2022-07-10 23:19:02,372][26022] Updated weights on worker 0-0, policy_version 934549 (0.00088) [2022-07-10 23:19:04,249][25689] Fps is (10 sec: 5418.0, 60 sec: 5587.1, 300 sec: 5548.7). Total num frames: 956987392. Throughput: 0: 5754.8. Samples: 956986556. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:19:04,250][25689] Avg episode reward: [(0, '0.326')] [2022-07-10 23:19:04,478][26022] Updated weights on worker 0-0, policy_version 934559 (0.00088) [2022-07-10 23:19:06,395][26022] Updated weights on worker 0-0, policy_version 934569 (0.00090) [2022-07-10 23:19:07,950][26022] Updated weights on worker 0-0, policy_version 934579 (0.00088) [2022-07-10 23:19:09,303][25689] Fps is (10 sec: 5391.6, 60 sec: 5554.5, 300 sec: 5547.9). Total num frames: 957015040. Throughput: 0: 5706.8. Samples: 957019290. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:19:09,303][25689] Avg episode reward: [(0, '0.463')] [2022-07-10 23:19:10,014][26022] Updated weights on worker 0-0, policy_version 934589 (0.00093) [2022-07-10 23:19:11,753][26022] Updated weights on worker 0-0, policy_version 934599 (0.00081) [2022-07-10 23:19:13,555][26022] Updated weights on worker 0-0, policy_version 934609 (0.00092) [2022-07-10 23:19:14,356][25689] Fps is (10 sec: 5673.5, 60 sec: 5566.8, 300 sec: 5554.3). Total num frames: 957044736. Throughput: 0: 4875.5. Samples: 957036136. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:19:14,357][25689] Avg episode reward: [(0, '0.661')] [2022-07-10 23:19:14,999][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:19:15,011][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000934616_957046784.pth [2022-07-10 23:19:15,012][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000932664_955047936.pth [2022-07-10 23:19:15,710][26022] Updated weights on worker 0-0, policy_version 934619 (0.00084) [2022-07-10 23:19:17,112][26022] Updated weights on worker 0-0, policy_version 934629 (0.00086) [2022-07-10 23:19:19,286][26022] Updated weights on worker 0-0, policy_version 934639 (0.00096) [2022-07-10 23:19:19,383][25689] Fps is (10 sec: 5587.0, 60 sec: 5547.9, 300 sec: 5550.4). Total num frames: 957071360. Throughput: 0: 5698.1. Samples: 957069558. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:19:19,385][25689] Avg episode reward: [(0, '0.435')] [2022-07-10 23:19:20,848][26022] Updated weights on worker 0-0, policy_version 934649 (0.00087) [2022-07-10 23:19:22,926][26022] Updated weights on worker 0-0, policy_version 934659 (0.00095) [2022-07-10 23:19:24,406][25689] Fps is (10 sec: 5400.6, 60 sec: 5547.3, 300 sec: 5548.2). Total num frames: 957099008. Throughput: 0: 5791.8. Samples: 957103298. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:19:24,409][25689] Avg episode reward: [(0, '-0.820')] [2022-07-10 23:19:24,726][26022] Updated weights on worker 0-0, policy_version 934669 (0.00084) [2022-07-10 23:19:26,392][26022] Updated weights on worker 0-0, policy_version 934679 (0.00086) [2022-07-10 23:19:28,343][26022] Updated weights on worker 0-0, policy_version 934689 (0.00086) [2022-07-10 23:19:29,470][25689] Fps is (10 sec: 5583.6, 60 sec: 5531.5, 300 sec: 5550.8). Total num frames: 957127680. Throughput: 0: 5807.3. Samples: 957136408. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:19:29,470][25689] Avg episode reward: [(0, '-1.024')] [2022-07-10 23:19:30,321][26022] Updated weights on worker 0-0, policy_version 934699 (0.00087) [2022-07-10 23:19:32,140][26022] Updated weights on worker 0-0, policy_version 934709 (0.00086) [2022-07-10 23:19:34,045][26022] Updated weights on worker 0-0, policy_version 934719 (0.00088) [2022-07-10 23:19:34,486][25689] Fps is (10 sec: 5485.8, 60 sec: 5531.8, 300 sec: 5543.7). Total num frames: 957154304. Throughput: 0: 5798.3. Samples: 957152850. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:19:34,486][25689] Avg episode reward: [(0, '-1.976')] [2022-07-10 23:19:35,683][26022] Updated weights on worker 0-0, policy_version 934729 (0.00095) [2022-07-10 23:19:37,648][26022] Updated weights on worker 0-0, policy_version 934739 (0.00090) [2022-07-10 23:19:39,416][26022] Updated weights on worker 0-0, policy_version 934749 (0.00085) [2022-07-10 23:19:39,503][25689] Fps is (10 sec: 5511.5, 60 sec: 5531.7, 300 sec: 5543.4). Total num frames: 957182976. Throughput: 0: 5793.5. Samples: 957186120. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:19:39,504][25689] Avg episode reward: [(0, '-1.874')] [2022-07-10 23:19:41,563][26022] Updated weights on worker 0-0, policy_version 934759 (0.00080) [2022-07-10 23:19:43,131][26022] Updated weights on worker 0-0, policy_version 934769 (0.00095) [2022-07-10 23:19:44,525][25689] Fps is (10 sec: 5507.8, 60 sec: 5497.9, 300 sec: 5545.3). Total num frames: 957209600. Throughput: 0: 5794.1. Samples: 957219872. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:19:44,527][25689] Avg episode reward: [(0, '-2.877')] [2022-07-10 23:19:45,031][26022] Updated weights on worker 0-0, policy_version 934779 (0.00086) [2022-07-10 23:19:46,765][26022] Updated weights on worker 0-0, policy_version 934789 (0.00685) [2022-07-10 23:19:48,769][26022] Updated weights on worker 0-0, policy_version 934799 (0.00090) [2022-07-10 23:19:49,572][25689] Fps is (10 sec: 5491.6, 60 sec: 5497.8, 300 sec: 5544.8). Total num frames: 957238272. Throughput: 0: 4980.4. Samples: 957236524. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:19:49,572][25689] Avg episode reward: [(0, '-2.704')] [2022-07-10 23:19:50,542][26022] Updated weights on worker 0-0, policy_version 934809 (0.00087) [2022-07-10 23:19:52,339][26022] Updated weights on worker 0-0, policy_version 934819 (0.00093) [2022-07-10 23:19:54,164][26022] Updated weights on worker 0-0, policy_version 934829 (0.00085) [2022-07-10 23:19:54,589][25689] Fps is (10 sec: 5800.0, 60 sec: 5547.8, 300 sec: 5549.1). Total num frames: 957267968. Throughput: 0: 5840.8. Samples: 957270268. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:19:54,590][25689] Avg episode reward: [(0, '-2.294')] [2022-07-10 23:19:56,143][26022] Updated weights on worker 0-0, policy_version 934839 (0.00082) [2022-07-10 23:19:57,767][26022] Updated weights on worker 0-0, policy_version 934849 (0.00104) [2022-07-10 23:19:59,616][25689] Fps is (10 sec: 5505.4, 60 sec: 5511.8, 300 sec: 5545.4). Total num frames: 957293568. Throughput: 0: 5835.1. Samples: 957303482. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:19:59,617][25689] Avg episode reward: [(0, '-1.390')] [2022-07-10 23:19:59,840][26022] Updated weights on worker 0-0, policy_version 934859 (0.00086) [2022-07-10 23:20:01,423][26022] Updated weights on worker 0-0, policy_version 934869 (0.00087) [2022-07-10 23:20:03,907][26022] Updated weights on worker 0-0, policy_version 934879 (0.00096) [2022-07-10 23:20:04,625][25689] Fps is (10 sec: 5203.4, 60 sec: 5512.1, 300 sec: 5547.0). Total num frames: 957320192. Throughput: 0: 4885.7. Samples: 957318074. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:20:04,627][25689] Avg episode reward: [(0, '-0.594')] [2022-07-10 23:20:05,712][26022] Updated weights on worker 0-0, policy_version 934889 (0.00095) [2022-07-10 23:20:07,470][26022] Updated weights on worker 0-0, policy_version 934899 (0.00087) [2022-07-10 23:20:09,400][26022] Updated weights on worker 0-0, policy_version 934909 (0.00089) [2022-07-10 23:20:09,730][25689] Fps is (10 sec: 5467.4, 60 sec: 5524.4, 300 sec: 5542.8). Total num frames: 957348864. Throughput: 0: 5682.4. Samples: 957351068. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:20:09,731][25689] Avg episode reward: [(0, '-0.045')] [2022-07-10 23:20:11,012][26022] Updated weights on worker 0-0, policy_version 934919 (0.00087) [2022-07-10 23:20:13,127][26022] Updated weights on worker 0-0, policy_version 934929 (0.00092) [2022-07-10 23:20:14,762][25689] Fps is (10 sec: 5556.0, 60 sec: 5492.5, 300 sec: 5542.3). Total num frames: 957376512. Throughput: 0: 5674.4. Samples: 957384738. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:20:14,762][25689] Avg episode reward: [(0, '0.328')] [2022-07-10 23:20:14,784][26022] Updated weights on worker 0-0, policy_version 934939 (0.00087) [2022-07-10 23:20:16,692][26022] Updated weights on worker 0-0, policy_version 934949 (0.00096) [2022-07-10 23:20:18,427][26022] Updated weights on worker 0-0, policy_version 934959 (0.00086) [2022-07-10 23:20:19,774][25689] Fps is (10 sec: 5505.3, 60 sec: 5510.8, 300 sec: 5545.9). Total num frames: 957404160. Throughput: 0: 4874.2. Samples: 957401736. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:20:19,774][25689] Avg episode reward: [(0, '0.382')] [2022-07-10 23:20:20,222][26022] Updated weights on worker 0-0, policy_version 934969 (0.00086) [2022-07-10 23:20:22,364][26022] Updated weights on worker 0-0, policy_version 934979 (0.00089) [2022-07-10 23:20:23,761][26022] Updated weights on worker 0-0, policy_version 934989 (0.00082) [2022-07-10 23:20:24,782][25689] Fps is (10 sec: 5620.9, 60 sec: 5529.1, 300 sec: 5543.6). Total num frames: 957432832. Throughput: 0: 5819.3. Samples: 957435372. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:20:24,782][25689] Avg episode reward: [(0, '1.257')] [2022-07-10 23:20:26,131][26022] Updated weights on worker 0-0, policy_version 934999 (0.00084) [2022-07-10 23:20:27,615][26022] Updated weights on worker 0-0, policy_version 935009 (0.00087) [2022-07-10 23:20:29,553][26022] Updated weights on worker 0-0, policy_version 935019 (0.00092) [2022-07-10 23:20:29,831][25689] Fps is (10 sec: 5600.1, 60 sec: 5513.5, 300 sec: 5540.5). Total num frames: 957460480. Throughput: 0: 5859.9. Samples: 957468858. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:20:29,832][25689] Avg episode reward: [(0, '1.072')] [2022-07-10 23:20:31,345][26022] Updated weights on worker 0-0, policy_version 935029 (0.00091) [2022-07-10 23:20:33,085][26022] Updated weights on worker 0-0, policy_version 935039 (0.00080) [2022-07-10 23:20:34,843][25689] Fps is (10 sec: 5597.8, 60 sec: 5547.8, 300 sec: 5548.9). Total num frames: 957489152. Throughput: 0: 5024.3. Samples: 957485630. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:20:34,844][25689] Avg episode reward: [(0, '1.201')] [2022-07-10 23:20:35,027][26022] Updated weights on worker 0-0, policy_version 935049 (0.00085) [2022-07-10 23:20:36,867][26022] Updated weights on worker 0-0, policy_version 935059 (0.00095) [2022-07-10 23:20:38,684][26022] Updated weights on worker 0-0, policy_version 935069 (0.00092) [2022-07-10 23:20:39,847][25689] Fps is (10 sec: 5622.8, 60 sec: 5532.0, 300 sec: 5542.9). Total num frames: 957516800. Throughput: 0: 5857.1. Samples: 957519308. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:20:39,855][25689] Avg episode reward: [(0, '1.200')] [2022-07-10 23:20:40,560][26022] Updated weights on worker 0-0, policy_version 935079 (0.00093) [2022-07-10 23:20:42,298][26022] Updated weights on worker 0-0, policy_version 935089 (0.00086) [2022-07-10 23:20:44,174][26022] Updated weights on worker 0-0, policy_version 935099 (0.00084) [2022-07-10 23:20:44,861][25689] Fps is (10 sec: 5621.5, 60 sec: 5566.7, 300 sec: 5547.7). Total num frames: 957545472. Throughput: 0: 5863.4. Samples: 957553108. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:20:44,863][25689] Avg episode reward: [(0, '1.111')] [2022-07-10 23:20:46,029][26022] Updated weights on worker 0-0, policy_version 935109 (0.00091) [2022-07-10 23:20:47,776][26022] Updated weights on worker 0-0, policy_version 935119 (0.00096) [2022-07-10 23:20:49,681][26022] Updated weights on worker 0-0, policy_version 935129 (0.00086) [2022-07-10 23:20:49,923][25689] Fps is (10 sec: 5488.1, 60 sec: 5531.4, 300 sec: 5543.3). Total num frames: 957572096. Throughput: 0: 5030.4. Samples: 957569928. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:20:49,923][25689] Avg episode reward: [(0, '0.379')] [2022-07-10 23:20:51,364][26022] Updated weights on worker 0-0, policy_version 935139 (0.00087) [2022-07-10 23:20:53,465][26022] Updated weights on worker 0-0, policy_version 935149 (0.00087) [2022-07-10 23:20:54,925][25689] Fps is (10 sec: 5698.1, 60 sec: 5549.7, 300 sec: 5553.9). Total num frames: 957602816. Throughput: 0: 5863.9. Samples: 957603392. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:20:54,926][25689] Avg episode reward: [(0, '-0.265')] [2022-07-10 23:20:54,927][26022] Updated weights on worker 0-0, policy_version 935159 (0.00099) [2022-07-10 23:20:57,127][26022] Updated weights on worker 0-0, policy_version 935169 (0.00566) [2022-07-10 23:20:58,877][26022] Updated weights on worker 0-0, policy_version 935179 (0.00086) [2022-07-10 23:20:59,996][25689] Fps is (10 sec: 5590.9, 60 sec: 5545.6, 300 sec: 5549.3). Total num frames: 957628416. Throughput: 0: 5839.9. Samples: 957636976. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:20:59,997][25689] Avg episode reward: [(0, '-0.230')] [2022-07-10 23:21:00,599][26022] Updated weights on worker 0-0, policy_version 935189 (0.00093) [2022-07-10 23:21:03,064][26022] Updated weights on worker 0-0, policy_version 935199 (0.00083) [2022-07-10 23:21:04,579][26022] Updated weights on worker 0-0, policy_version 935209 (0.00090) [2022-07-10 23:21:04,998][25689] Fps is (10 sec: 5184.8, 60 sec: 5546.3, 300 sec: 5546.9). Total num frames: 957655040. Throughput: 0: 4882.0. Samples: 957651414. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:21:04,998][25689] Avg episode reward: [(0, '-0.114')] [2022-07-10 23:21:06,787][26022] Updated weights on worker 0-0, policy_version 935219 (0.00087) [2022-07-10 23:21:08,369][26022] Updated weights on worker 0-0, policy_version 935229 (0.00095) [2022-07-10 23:21:10,107][25689] Fps is (10 sec: 5367.9, 60 sec: 5529.0, 300 sec: 5546.7). Total num frames: 957682688. Throughput: 0: 5677.8. Samples: 957684528. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:21:10,107][25689] Avg episode reward: [(0, '-0.556')] [2022-07-10 23:21:10,481][26022] Updated weights on worker 0-0, policy_version 935239 (0.00086) [2022-07-10 23:21:12,040][26022] Updated weights on worker 0-0, policy_version 935249 (0.00090) [2022-07-10 23:21:14,200][26022] Updated weights on worker 0-0, policy_version 935259 (0.00090) [2022-07-10 23:21:15,123][25689] Fps is (10 sec: 5461.2, 60 sec: 5530.4, 300 sec: 5540.9). Total num frames: 957710336. Throughput: 0: 5673.2. Samples: 957717978. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:21:15,123][25689] Avg episode reward: [(0, '-0.476')] [2022-07-10 23:21:15,233][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:21:15,246][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000935265_957711360.pth [2022-07-10 23:21:15,246][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000933314_955713536.pth [2022-07-10 23:21:15,735][26022] Updated weights on worker 0-0, policy_version 935269 (0.00087) [2022-07-10 23:21:17,826][26022] Updated weights on worker 0-0, policy_version 935279 (0.00100) [2022-07-10 23:21:19,515][26022] Updated weights on worker 0-0, policy_version 935289 (0.00091) [2022-07-10 23:21:20,138][25689] Fps is (10 sec: 5614.6, 60 sec: 5547.2, 300 sec: 5547.6). Total num frames: 957739008. Throughput: 0: 4850.0. Samples: 957734662. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:21:20,138][25689] Avg episode reward: [(0, '0.677')] [2022-07-10 23:21:21,387][26022] Updated weights on worker 0-0, policy_version 935299 (0.00087) [2022-07-10 23:21:23,164][26022] Updated weights on worker 0-0, policy_version 935309 (0.00096) [2022-07-10 23:21:24,967][26022] Updated weights on worker 0-0, policy_version 935319 (0.00086) [2022-07-10 23:21:25,165][25689] Fps is (10 sec: 5608.3, 60 sec: 5528.4, 300 sec: 5545.9). Total num frames: 957766656. Throughput: 0: 5799.6. Samples: 957768376. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:21:25,166][25689] Avg episode reward: [(0, '0.591')] [2022-07-10 23:21:26,876][26022] Updated weights on worker 0-0, policy_version 935329 (0.00104) [2022-07-10 23:21:28,804][26022] Updated weights on worker 0-0, policy_version 935339 (0.00085) [2022-07-10 23:21:30,215][25689] Fps is (10 sec: 5588.9, 60 sec: 5545.3, 300 sec: 5545.6). Total num frames: 957795328. Throughput: 0: 5828.4. Samples: 957801726. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:21:30,215][25689] Avg episode reward: [(0, '0.391')] [2022-07-10 23:21:30,419][26022] Updated weights on worker 0-0, policy_version 935349 (0.00093) [2022-07-10 23:21:32,562][26022] Updated weights on worker 0-0, policy_version 935359 (0.00095) [2022-07-10 23:21:33,983][26022] Updated weights on worker 0-0, policy_version 935369 (0.00084) [2022-07-10 23:21:35,229][25689] Fps is (10 sec: 5596.4, 60 sec: 5528.2, 300 sec: 5546.0). Total num frames: 957822976. Throughput: 0: 4999.0. Samples: 957818486. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:21:35,229][25689] Avg episode reward: [(0, '0.355')] [2022-07-10 23:21:36,277][26022] Updated weights on worker 0-0, policy_version 935379 (0.00090) [2022-07-10 23:21:37,727][26022] Updated weights on worker 0-0, policy_version 935389 (0.00088) [2022-07-10 23:21:39,713][26022] Updated weights on worker 0-0, policy_version 935399 (0.00085) [2022-07-10 23:21:40,259][25689] Fps is (10 sec: 5607.3, 60 sec: 5542.8, 300 sec: 5543.4). Total num frames: 957851648. Throughput: 0: 5850.9. Samples: 957852388. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:21:40,259][25689] Avg episode reward: [(0, '0.329')] [2022-07-10 23:21:41,223][26022] Updated weights on worker 0-0, policy_version 935409 (0.00090) [2022-07-10 23:21:43,172][26022] Updated weights on worker 0-0, policy_version 935419 (0.00089) [2022-07-10 23:21:45,007][26022] Updated weights on worker 0-0, policy_version 935429 (0.00089) [2022-07-10 23:21:45,261][25689] Fps is (10 sec: 5716.1, 60 sec: 5543.9, 300 sec: 5548.2). Total num frames: 957880320. Throughput: 0: 5873.5. Samples: 957886408. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:21:45,261][25689] Avg episode reward: [(0, '0.399')] [2022-07-10 23:21:46,952][26022] Updated weights on worker 0-0, policy_version 935439 (0.00090) [2022-07-10 23:21:48,563][26022] Updated weights on worker 0-0, policy_version 935449 (0.00082) [2022-07-10 23:21:50,311][25689] Fps is (10 sec: 5704.7, 60 sec: 5578.8, 300 sec: 5551.2). Total num frames: 957908992. Throughput: 0: 5053.9. Samples: 957903290. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:21:50,311][25689] Avg episode reward: [(0, '-0.610')] [2022-07-10 23:21:50,734][26022] Updated weights on worker 0-0, policy_version 935459 (0.00086) [2022-07-10 23:21:52,043][26022] Updated weights on worker 0-0, policy_version 935469 (0.00089) [2022-07-10 23:21:54,408][26022] Updated weights on worker 0-0, policy_version 935479 (0.00083) [2022-07-10 23:21:55,343][25689] Fps is (10 sec: 5687.5, 60 sec: 5542.2, 300 sec: 5547.5). Total num frames: 957937664. Throughput: 0: 5893.2. Samples: 957937024. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:21:55,345][25689] Avg episode reward: [(0, '0.060')] [2022-07-10 23:21:55,719][26022] Updated weights on worker 0-0, policy_version 935489 (0.00086) [2022-07-10 23:21:58,002][26022] Updated weights on worker 0-0, policy_version 935499 (0.00086) [2022-07-10 23:21:59,421][26022] Updated weights on worker 0-0, policy_version 935509 (0.00087) [2022-07-10 23:22:00,366][25689] Fps is (10 sec: 5601.1, 60 sec: 5580.5, 300 sec: 5561.0). Total num frames: 957965312. Throughput: 0: 5896.1. Samples: 957970942. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:22:00,367][25689] Avg episode reward: [(0, '-0.424')] [2022-07-10 23:22:01,728][26022] Updated weights on worker 0-0, policy_version 935519 (0.00090) [2022-07-10 23:22:03,574][26022] Updated weights on worker 0-0, policy_version 935529 (0.00389) [2022-07-10 23:22:05,379][25689] Fps is (10 sec: 5305.9, 60 sec: 5562.5, 300 sec: 5542.1). Total num frames: 957990912. Throughput: 0: 4939.6. Samples: 957985782. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:22:05,379][25689] Avg episode reward: [(0, '0.459')] [2022-07-10 23:22:05,596][26022] Updated weights on worker 0-0, policy_version 935539 (0.00092) [2022-07-10 23:22:07,222][26022] Updated weights on worker 0-0, policy_version 935549 (0.00102) [2022-07-10 23:22:09,363][26022] Updated weights on worker 0-0, policy_version 935559 (0.00082) [2022-07-10 23:22:10,440][25689] Fps is (10 sec: 5387.4, 60 sec: 5583.9, 300 sec: 5548.4). Total num frames: 958019584. Throughput: 0: 5741.1. Samples: 958018852. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:22:10,440][25689] Avg episode reward: [(0, '0.076')] [2022-07-10 23:22:10,848][26022] Updated weights on worker 0-0, policy_version 935569 (0.00095) [2022-07-10 23:22:13,095][26022] Updated weights on worker 0-0, policy_version 935579 (0.00085) [2022-07-10 23:22:14,756][26022] Updated weights on worker 0-0, policy_version 935589 (0.00088) [2022-07-10 23:22:15,494][25689] Fps is (10 sec: 5466.4, 60 sec: 5563.4, 300 sec: 5544.8). Total num frames: 958046208. Throughput: 0: 5729.8. Samples: 958052486. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:22:15,495][25689] Avg episode reward: [(0, '-0.734')] [2022-07-10 23:22:16,517][26022] Updated weights on worker 0-0, policy_version 935599 (0.00088) [2022-07-10 23:22:18,282][26022] Updated weights on worker 0-0, policy_version 935609 (0.00088) [2022-07-10 23:22:20,155][26022] Updated weights on worker 0-0, policy_version 935619 (0.00095) [2022-07-10 23:22:20,519][25689] Fps is (10 sec: 5486.5, 60 sec: 5562.6, 300 sec: 5544.8). Total num frames: 958074880. Throughput: 0: 4875.7. Samples: 958069200. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:22:20,519][25689] Avg episode reward: [(0, '-0.259')] [2022-07-10 23:22:22,054][26022] Updated weights on worker 0-0, policy_version 935629 (0.00090) [2022-07-10 23:22:23,932][26022] Updated weights on worker 0-0, policy_version 935639 (0.01096) [2022-07-10 23:22:25,540][25689] Fps is (10 sec: 5708.4, 60 sec: 5580.1, 300 sec: 5546.4). Total num frames: 958103552. Throughput: 0: 5788.6. Samples: 958102486. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:22:25,540][25689] Avg episode reward: [(0, '-0.586')] [2022-07-10 23:22:25,726][26022] Updated weights on worker 0-0, policy_version 935649 (0.00085) [2022-07-10 23:22:27,547][26022] Updated weights on worker 0-0, policy_version 935659 (0.00099) [2022-07-10 23:22:29,283][26022] Updated weights on worker 0-0, policy_version 935669 (0.00092) [2022-07-10 23:22:30,636][25689] Fps is (10 sec: 5667.7, 60 sec: 5575.8, 300 sec: 5551.9). Total num frames: 958132224. Throughput: 0: 5792.4. Samples: 958135836. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:22:30,637][25689] Avg episode reward: [(0, '-0.256')] [2022-07-10 23:22:31,431][26022] Updated weights on worker 0-0, policy_version 935679 (0.00085) [2022-07-10 23:22:33,331][26022] Updated weights on worker 0-0, policy_version 935689 (0.00091) [2022-07-10 23:22:35,000][26022] Updated weights on worker 0-0, policy_version 935699 (0.00090) [2022-07-10 23:22:35,682][25689] Fps is (10 sec: 5451.8, 60 sec: 5555.8, 300 sec: 5541.0). Total num frames: 958158848. Throughput: 0: 4949.7. Samples: 958152412. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:22:35,684][25689] Avg episode reward: [(0, '-0.922')] [2022-07-10 23:22:36,951][26022] Updated weights on worker 0-0, policy_version 935709 (0.00084) [2022-07-10 23:22:38,413][26022] Updated weights on worker 0-0, policy_version 935719 (0.00093) [2022-07-10 23:22:40,667][26022] Updated weights on worker 0-0, policy_version 935729 (0.00087) [2022-07-10 23:22:40,762][25689] Fps is (10 sec: 5360.0, 60 sec: 5534.4, 300 sec: 5543.2). Total num frames: 958186496. Throughput: 0: 5756.3. Samples: 958185724. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:22:40,762][25689] Avg episode reward: [(0, '-0.753')] [2022-07-10 23:22:42,403][26022] Updated weights on worker 0-0, policy_version 935739 (0.00071) [2022-07-10 23:22:44,150][26022] Updated weights on worker 0-0, policy_version 935749 (0.00090) [2022-07-10 23:22:45,781][25689] Fps is (10 sec: 5577.1, 60 sec: 5532.8, 300 sec: 5544.2). Total num frames: 958215168. Throughput: 0: 5760.6. Samples: 958219086. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:22:45,781][25689] Avg episode reward: [(0, '-0.054')] [2022-07-10 23:22:46,158][26022] Updated weights on worker 0-0, policy_version 935759 (0.00329) [2022-07-10 23:22:47,655][26022] Updated weights on worker 0-0, policy_version 935769 (0.00090) [2022-07-10 23:22:49,817][26022] Updated weights on worker 0-0, policy_version 935779 (0.00080) [2022-07-10 23:22:50,871][25689] Fps is (10 sec: 5672.5, 60 sec: 5529.2, 300 sec: 5544.0). Total num frames: 958243840. Throughput: 0: 5765.1. Samples: 958252490. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:22:50,872][25689] Avg episode reward: [(0, '-0.198')] [2022-07-10 23:22:51,632][26022] Updated weights on worker 0-0, policy_version 935789 (0.00089) [2022-07-10 23:22:53,580][26022] Updated weights on worker 0-0, policy_version 935799 (0.00087) [2022-07-10 23:22:55,370][26022] Updated weights on worker 0-0, policy_version 935809 (0.00101) [2022-07-10 23:22:55,874][25689] Fps is (10 sec: 5478.7, 60 sec: 5498.0, 300 sec: 5537.5). Total num frames: 958270464. Throughput: 0: 5784.0. Samples: 958269198. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:22:55,875][25689] Avg episode reward: [(0, '-0.908')] [2022-07-10 23:22:57,275][26022] Updated weights on worker 0-0, policy_version 935819 (0.00089) [2022-07-10 23:22:58,962][26022] Updated weights on worker 0-0, policy_version 935829 (0.00090) [2022-07-10 23:23:00,863][26022] Updated weights on worker 0-0, policy_version 935839 (0.00085) [2022-07-10 23:23:00,935][25689] Fps is (10 sec: 5494.4, 60 sec: 5511.5, 300 sec: 5551.3). Total num frames: 958299136. Throughput: 0: 5781.0. Samples: 958302346. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:23:00,935][25689] Avg episode reward: [(0, '-1.053')] [2022-07-10 23:23:02,961][26022] Updated weights on worker 0-0, policy_version 935849 (0.00093) [2022-07-10 23:23:05,026][26022] Updated weights on worker 0-0, policy_version 935859 (0.00095) [2022-07-10 23:23:05,946][25689] Fps is (10 sec: 5388.1, 60 sec: 5511.6, 300 sec: 5538.5). Total num frames: 958324736. Throughput: 0: 5661.9. Samples: 958333260. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:23:05,949][25689] Avg episode reward: [(0, '-0.987')] [2022-07-10 23:23:06,784][26022] Updated weights on worker 0-0, policy_version 935869 (0.00084) [2022-07-10 23:23:08,746][26022] Updated weights on worker 0-0, policy_version 935879 (0.00090) [2022-07-10 23:23:10,438][26022] Updated weights on worker 0-0, policy_version 935889 (0.00093) [2022-07-10 23:23:11,027][25689] Fps is (10 sec: 5276.2, 60 sec: 5492.9, 300 sec: 5533.6). Total num frames: 958352384. Throughput: 0: 4834.2. Samples: 958349928. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:23:11,027][25689] Avg episode reward: [(0, '-0.418')] [2022-07-10 23:23:12,380][26022] Updated weights on worker 0-0, policy_version 935899 (0.00093) [2022-07-10 23:23:14,129][26022] Updated weights on worker 0-0, policy_version 935909 (0.00090) [2022-07-10 23:23:15,361][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:23:15,377][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000935915_958376960.pth [2022-07-10 23:23:15,377][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000933964_956379136.pth [2022-07-10 23:23:16,057][25689] Fps is (10 sec: 5468.8, 60 sec: 5512.0, 300 sec: 5533.2). Total num frames: 958380032. Throughput: 0: 5666.7. Samples: 958383574. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:23:16,060][25689] Avg episode reward: [(0, '-0.175')] [2022-07-10 23:23:16,191][26022] Updated weights on worker 0-0, policy_version 935919 (0.00092) [2022-07-10 23:23:17,873][26022] Updated weights on worker 0-0, policy_version 935929 (0.00085) [2022-07-10 23:23:19,695][26022] Updated weights on worker 0-0, policy_version 935939 (0.00085) [2022-07-10 23:23:21,088][25689] Fps is (10 sec: 5597.5, 60 sec: 5511.4, 300 sec: 5536.3). Total num frames: 958408704. Throughput: 0: 5707.7. Samples: 958417378. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:23:21,090][25689] Avg episode reward: [(0, '-0.193')] [2022-07-10 23:23:21,576][26022] Updated weights on worker 0-0, policy_version 935949 (0.00089) [2022-07-10 23:23:23,132][26022] Updated weights on worker 0-0, policy_version 935959 (0.00087) [2022-07-10 23:23:24,957][26022] Updated weights on worker 0-0, policy_version 935969 (0.00092) [2022-07-10 23:23:26,127][25689] Fps is (10 sec: 5593.3, 60 sec: 5492.9, 300 sec: 5530.2). Total num frames: 958436352. Throughput: 0: 5018.1. Samples: 958434528. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:23:26,128][25689] Avg episode reward: [(0, '0.309')] [2022-07-10 23:23:27,079][26022] Updated weights on worker 0-0, policy_version 935979 (0.00091) [2022-07-10 23:23:28,647][26022] Updated weights on worker 0-0, policy_version 935989 (0.00086) [2022-07-10 23:23:30,734][26022] Updated weights on worker 0-0, policy_version 935999 (0.00090) [2022-07-10 23:23:31,257][25689] Fps is (10 sec: 5739.7, 60 sec: 5523.6, 300 sec: 5541.9). Total num frames: 958467072. Throughput: 0: 5834.9. Samples: 958467972. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:23:31,258][25689] Avg episode reward: [(0, '0.377')] [2022-07-10 23:23:32,346][26022] Updated weights on worker 0-0, policy_version 936009 (0.00087) [2022-07-10 23:23:34,181][26022] Updated weights on worker 0-0, policy_version 936019 (0.00086) [2022-07-10 23:23:36,148][26022] Updated weights on worker 0-0, policy_version 936029 (0.00087) [2022-07-10 23:23:36,303][25689] Fps is (10 sec: 5635.1, 60 sec: 5523.7, 300 sec: 5534.4). Total num frames: 958493696. Throughput: 0: 5813.3. Samples: 958501268. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:23:36,304][25689] Avg episode reward: [(0, '1.282')] [2022-07-10 23:23:37,989][26022] Updated weights on worker 0-0, policy_version 936039 (0.00086) [2022-07-10 23:23:39,866][26022] Updated weights on worker 0-0, policy_version 936049 (0.00091) [2022-07-10 23:23:41,344][25689] Fps is (10 sec: 5482.0, 60 sec: 5544.0, 300 sec: 5534.1). Total num frames: 958522368. Throughput: 0: 4964.7. Samples: 958517942. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:23:41,345][25689] Avg episode reward: [(0, '0.974')] [2022-07-10 23:23:41,684][26022] Updated weights on worker 0-0, policy_version 936059 (0.00089) [2022-07-10 23:23:43,572][26022] Updated weights on worker 0-0, policy_version 936069 (0.00084) [2022-07-10 23:23:45,375][26022] Updated weights on worker 0-0, policy_version 936079 (0.00089) [2022-07-10 23:23:46,357][25689] Fps is (10 sec: 5601.9, 60 sec: 5527.7, 300 sec: 5531.3). Total num frames: 958550016. Throughput: 0: 5792.9. Samples: 958551718. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-10 23:23:46,357][25689] Avg episode reward: [(0, '0.795')] [2022-07-10 23:23:47,115][26022] Updated weights on worker 0-0, policy_version 936089 (0.00083) [2022-07-10 23:23:48,838][26022] Updated weights on worker 0-0, policy_version 936099 (0.00088) [2022-07-10 23:23:50,884][26022] Updated weights on worker 0-0, policy_version 936109 (0.00092) [2022-07-10 23:23:51,485][25689] Fps is (10 sec: 5553.7, 60 sec: 5524.2, 300 sec: 5535.9). Total num frames: 958578688. Throughput: 0: 5803.1. Samples: 958585356. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:23:51,486][25689] Avg episode reward: [(0, '0.876')] [2022-07-10 23:23:52,564][26022] Updated weights on worker 0-0, policy_version 936119 (0.00085) [2022-07-10 23:23:54,604][26022] Updated weights on worker 0-0, policy_version 936129 (0.00090) [2022-07-10 23:23:56,139][26022] Updated weights on worker 0-0, policy_version 936139 (0.00084) [2022-07-10 23:23:56,505][25689] Fps is (10 sec: 5650.8, 60 sec: 5556.5, 300 sec: 5539.1). Total num frames: 958607360. Throughput: 0: 4989.9. Samples: 958602072. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:23:56,505][25689] Avg episode reward: [(0, '1.032')] [2022-07-10 23:23:58,098][26022] Updated weights on worker 0-0, policy_version 936149 (0.00090) [2022-07-10 23:24:00,011][26022] Updated weights on worker 0-0, policy_version 936159 (0.00086) [2022-07-10 23:24:01,589][25689] Fps is (10 sec: 5675.5, 60 sec: 5554.4, 300 sec: 5544.6). Total num frames: 958636032. Throughput: 0: 5826.1. Samples: 958635890. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:24:01,590][25689] Avg episode reward: [(0, '0.092')] [2022-07-10 23:24:01,808][26022] Updated weights on worker 0-0, policy_version 936169 (0.00082) [2022-07-10 23:24:03,918][26022] Updated weights on worker 0-0, policy_version 936179 (0.00089) [2022-07-10 23:24:05,719][26022] Updated weights on worker 0-0, policy_version 936189 (0.00091) [2022-07-10 23:24:06,637][25689] Fps is (10 sec: 5255.3, 60 sec: 5534.2, 300 sec: 5534.4). Total num frames: 958660608. Throughput: 0: 5686.3. Samples: 958667038. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:24:06,638][25689] Avg episode reward: [(0, '-0.023')] [2022-07-10 23:24:07,555][26022] Updated weights on worker 0-0, policy_version 936199 (0.00084) [2022-07-10 23:24:09,498][26022] Updated weights on worker 0-0, policy_version 936209 (0.00090) [2022-07-10 23:24:11,155][26022] Updated weights on worker 0-0, policy_version 936219 (0.00084) [2022-07-10 23:24:11,755][25689] Fps is (10 sec: 5338.7, 60 sec: 5564.5, 300 sec: 5533.2). Total num frames: 958690304. Throughput: 0: 4875.4. Samples: 958684178. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:24:11,763][25689] Avg episode reward: [(0, '-0.731')] [2022-07-10 23:24:13,063][26022] Updated weights on worker 0-0, policy_version 936229 (0.00096) [2022-07-10 23:24:15,048][26022] Updated weights on worker 0-0, policy_version 936239 (0.00097) [2022-07-10 23:24:16,748][26022] Updated weights on worker 0-0, policy_version 936249 (0.00095) [2022-07-10 23:24:16,843][25689] Fps is (10 sec: 5718.9, 60 sec: 5576.1, 300 sec: 5538.9). Total num frames: 958718976. Throughput: 0: 5704.2. Samples: 958718086. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:24:16,844][25689] Avg episode reward: [(0, '-0.443')] [2022-07-10 23:24:18,500][26022] Updated weights on worker 0-0, policy_version 936259 (0.00087) [2022-07-10 23:24:20,376][26022] Updated weights on worker 0-0, policy_version 936269 (0.00085) [2022-07-10 23:24:21,862][25689] Fps is (10 sec: 5673.8, 60 sec: 5577.2, 300 sec: 5542.5). Total num frames: 958747648. Throughput: 0: 5728.2. Samples: 958752016. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:24:21,862][25689] Avg episode reward: [(0, '-0.442')] [2022-07-10 23:24:22,172][26022] Updated weights on worker 0-0, policy_version 936279 (0.00092) [2022-07-10 23:24:23,967][26022] Updated weights on worker 0-0, policy_version 936289 (0.00084) [2022-07-10 23:24:25,754][26022] Updated weights on worker 0-0, policy_version 936299 (0.00093) [2022-07-10 23:24:26,938][25689] Fps is (10 sec: 5578.9, 60 sec: 5573.7, 300 sec: 5538.8). Total num frames: 958775296. Throughput: 0: 5021.1. Samples: 958768972. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:24:26,939][25689] Avg episode reward: [(0, '-0.318')] [2022-07-10 23:24:27,622][26022] Updated weights on worker 0-0, policy_version 936309 (0.00087) [2022-07-10 23:24:29,746][26022] Updated weights on worker 0-0, policy_version 936319 (0.00095) [2022-07-10 23:24:31,375][26022] Updated weights on worker 0-0, policy_version 936329 (0.00086) [2022-07-10 23:24:32,091][25689] Fps is (10 sec: 5505.7, 60 sec: 5538.0, 300 sec: 5543.1). Total num frames: 958803968. Throughput: 0: 5810.2. Samples: 958802334. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:24:32,091][25689] Avg episode reward: [(0, '-0.391')] [2022-07-10 23:24:33,173][26022] Updated weights on worker 0-0, policy_version 936339 (0.00092) [2022-07-10 23:24:34,997][26022] Updated weights on worker 0-0, policy_version 936349 (0.00088) [2022-07-10 23:24:36,836][26022] Updated weights on worker 0-0, policy_version 936359 (0.00107) [2022-07-10 23:24:37,139][25689] Fps is (10 sec: 5721.8, 60 sec: 5588.3, 300 sec: 5546.0). Total num frames: 958833664. Throughput: 0: 5804.2. Samples: 958835888. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:24:37,140][25689] Avg episode reward: [(0, '0.352')] [2022-07-10 23:24:38,672][26022] Updated weights on worker 0-0, policy_version 936369 (0.00089) [2022-07-10 23:24:40,553][26022] Updated weights on worker 0-0, policy_version 936379 (0.00086) [2022-07-10 23:24:42,168][25689] Fps is (10 sec: 5588.9, 60 sec: 5555.7, 300 sec: 5545.9). Total num frames: 958860288. Throughput: 0: 5799.7. Samples: 958869788. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:24:42,168][25689] Avg episode reward: [(0, '0.166')] [2022-07-10 23:24:42,296][26022] Updated weights on worker 0-0, policy_version 936389 (0.00088) [2022-07-10 23:24:44,188][26022] Updated weights on worker 0-0, policy_version 936399 (0.00085) [2022-07-10 23:24:45,839][26022] Updated weights on worker 0-0, policy_version 936409 (0.00091) [2022-07-10 23:24:47,170][25689] Fps is (10 sec: 5512.5, 60 sec: 5573.5, 300 sec: 5546.7). Total num frames: 958888960. Throughput: 0: 5824.7. Samples: 958886818. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:24:47,171][25689] Avg episode reward: [(0, '-0.626')] [2022-07-10 23:24:47,849][26022] Updated weights on worker 0-0, policy_version 936419 (0.00091) [2022-07-10 23:24:49,551][26022] Updated weights on worker 0-0, policy_version 936429 (0.00088) [2022-07-10 23:24:51,494][26022] Updated weights on worker 0-0, policy_version 936439 (0.00089) [2022-07-10 23:24:52,215][25689] Fps is (10 sec: 5707.6, 60 sec: 5581.2, 300 sec: 5542.7). Total num frames: 958917632. Throughput: 0: 5867.9. Samples: 958920420. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:24:52,215][25689] Avg episode reward: [(0, '-0.484')] [2022-07-10 23:24:53,338][26022] Updated weights on worker 0-0, policy_version 936449 (0.00092) [2022-07-10 23:24:54,863][26022] Updated weights on worker 0-0, policy_version 936459 (0.00093) [2022-07-10 23:24:57,211][26022] Updated weights on worker 0-0, policy_version 936469 (0.00096) [2022-07-10 23:24:57,256][25689] Fps is (10 sec: 5584.3, 60 sec: 5562.4, 300 sec: 5549.4). Total num frames: 958945280. Throughput: 0: 5869.8. Samples: 958953968. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:24:57,256][25689] Avg episode reward: [(0, '0.297')] [2022-07-10 23:24:58,689][26022] Updated weights on worker 0-0, policy_version 936479 (0.00091) [2022-07-10 23:25:00,827][26022] Updated weights on worker 0-0, policy_version 936489 (0.00087) [2022-07-10 23:25:02,340][25689] Fps is (10 sec: 5360.4, 60 sec: 5528.7, 300 sec: 5548.0). Total num frames: 958971904. Throughput: 0: 5001.6. Samples: 958970678. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:25:02,340][25689] Avg episode reward: [(0, '-0.159')] [2022-07-10 23:25:02,613][26022] Updated weights on worker 0-0, policy_version 936499 (0.00093) [2022-07-10 23:25:04,688][26022] Updated weights on worker 0-0, policy_version 936509 (0.00088) [2022-07-10 23:25:06,487][26022] Updated weights on worker 0-0, policy_version 936519 (0.00088) [2022-07-10 23:25:07,354][25689] Fps is (10 sec: 5475.5, 60 sec: 5599.2, 300 sec: 5549.6). Total num frames: 959000576. Throughput: 0: 5700.6. Samples: 959001880. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:25:07,355][25689] Avg episode reward: [(0, '-0.442')] [2022-07-10 23:25:08,680][26022] Updated weights on worker 0-0, policy_version 936529 (0.00089) [2022-07-10 23:25:10,075][26022] Updated weights on worker 0-0, policy_version 936539 (0.00105) [2022-07-10 23:25:12,445][25689] Fps is (10 sec: 5370.7, 60 sec: 5534.2, 300 sec: 5541.7). Total num frames: 959026176. Throughput: 0: 5664.7. Samples: 959035016. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:25:12,445][25689] Avg episode reward: [(0, '0.144')] [2022-07-10 23:25:12,445][26022] Updated weights on worker 0-0, policy_version 936549 (0.00086) [2022-07-10 23:25:13,671][26022] Updated weights on worker 0-0, policy_version 936559 (0.00086) [2022-07-10 23:25:15,468][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:25:15,484][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000936567_959044608.pth [2022-07-10 23:25:15,484][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000934616_957046784.pth [2022-07-10 23:25:15,801][26022] Updated weights on worker 0-0, policy_version 936569 (0.00083) [2022-07-10 23:25:17,337][26022] Updated weights on worker 0-0, policy_version 936579 (0.00085) [2022-07-10 23:25:17,483][25689] Fps is (10 sec: 5560.2, 60 sec: 5572.5, 300 sec: 5551.5). Total num frames: 959056896. Throughput: 0: 4852.3. Samples: 959052120. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:25:17,485][25689] Avg episode reward: [(0, '0.171')] [2022-07-10 23:25:19,479][26022] Updated weights on worker 0-0, policy_version 936589 (0.00429) [2022-07-10 23:25:21,144][26022] Updated weights on worker 0-0, policy_version 936599 (0.00086) [2022-07-10 23:25:22,513][25689] Fps is (10 sec: 5695.4, 60 sec: 5537.7, 300 sec: 5544.2). Total num frames: 959083520. Throughput: 0: 5716.7. Samples: 959086004. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:25:22,515][25689] Avg episode reward: [(0, '-0.107')] [2022-07-10 23:25:22,911][26022] Updated weights on worker 0-0, policy_version 936609 (0.00085) [2022-07-10 23:25:24,668][26022] Updated weights on worker 0-0, policy_version 936619 (0.00083) [2022-07-10 23:25:26,681][26022] Updated weights on worker 0-0, policy_version 936629 (0.00089) [2022-07-10 23:25:27,532][25689] Fps is (10 sec: 5503.1, 60 sec: 5560.0, 300 sec: 5548.2). Total num frames: 959112192. Throughput: 0: 5847.2. Samples: 959119860. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:25:27,533][25689] Avg episode reward: [(0, '-0.190')] [2022-07-10 23:25:28,433][26022] Updated weights on worker 0-0, policy_version 936639 (0.00088) [2022-07-10 23:25:30,409][26022] Updated weights on worker 0-0, policy_version 936649 (0.00092) [2022-07-10 23:25:32,099][26022] Updated weights on worker 0-0, policy_version 936659 (0.00083) [2022-07-10 23:25:32,583][25689] Fps is (10 sec: 5694.4, 60 sec: 5569.2, 300 sec: 5547.5). Total num frames: 959140864. Throughput: 0: 5040.6. Samples: 959136528. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:25:32,584][25689] Avg episode reward: [(0, '0.369')] [2022-07-10 23:25:33,952][26022] Updated weights on worker 0-0, policy_version 936669 (0.00090) [2022-07-10 23:25:35,938][26022] Updated weights on worker 0-0, policy_version 936679 (0.00084) [2022-07-10 23:25:37,575][26022] Updated weights on worker 0-0, policy_version 936689 (0.00086) [2022-07-10 23:25:37,613][25689] Fps is (10 sec: 5687.8, 60 sec: 5554.0, 300 sec: 5550.4). Total num frames: 959169536. Throughput: 0: 5874.2. Samples: 959170368. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:25:37,615][25689] Avg episode reward: [(0, '0.536')] [2022-07-10 23:25:39,578][26022] Updated weights on worker 0-0, policy_version 936699 (0.00087) [2022-07-10 23:25:41,484][26022] Updated weights on worker 0-0, policy_version 936709 (0.00085) [2022-07-10 23:25:42,619][25689] Fps is (10 sec: 5612.0, 60 sec: 5573.0, 300 sec: 5547.2). Total num frames: 959197184. Throughput: 0: 5858.2. Samples: 959203788. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:25:42,621][25689] Avg episode reward: [(0, '0.689')] [2022-07-10 23:25:43,274][26022] Updated weights on worker 0-0, policy_version 936719 (0.00088) [2022-07-10 23:25:45,078][26022] Updated weights on worker 0-0, policy_version 936729 (0.00089) [2022-07-10 23:25:46,835][26022] Updated weights on worker 0-0, policy_version 936739 (0.00094) [2022-07-10 23:25:47,632][25689] Fps is (10 sec: 5519.2, 60 sec: 5555.1, 300 sec: 5551.5). Total num frames: 959224832. Throughput: 0: 5014.4. Samples: 959220652. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:25:47,634][25689] Avg episode reward: [(0, '0.453')] [2022-07-10 23:25:48,688][26022] Updated weights on worker 0-0, policy_version 936749 (0.00095) [2022-07-10 23:25:50,606][26022] Updated weights on worker 0-0, policy_version 936759 (0.00086) [2022-07-10 23:25:52,433][26022] Updated weights on worker 0-0, policy_version 936769 (0.00086) [2022-07-10 23:25:52,734][25689] Fps is (10 sec: 5568.0, 60 sec: 5549.8, 300 sec: 5542.8). Total num frames: 959253504. Throughput: 0: 5828.3. Samples: 959253972. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:25:52,734][25689] Avg episode reward: [(0, '0.564')] [2022-07-10 23:25:54,136][26022] Updated weights on worker 0-0, policy_version 936779 (0.00087) [2022-07-10 23:25:55,851][26022] Updated weights on worker 0-0, policy_version 936789 (0.00095) [2022-07-10 23:25:57,755][25689] Fps is (10 sec: 5563.8, 60 sec: 5551.7, 300 sec: 5550.6). Total num frames: 959281152. Throughput: 0: 5819.8. Samples: 959287588. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:25:57,755][25689] Avg episode reward: [(0, '0.348')] [2022-07-10 23:25:57,822][26022] Updated weights on worker 0-0, policy_version 936799 (0.00091) [2022-07-10 23:25:59,730][26022] Updated weights on worker 0-0, policy_version 936809 (0.00087) [2022-07-10 23:26:01,515][26022] Updated weights on worker 0-0, policy_version 936819 (0.00494) [2022-07-10 23:26:02,783][25689] Fps is (10 sec: 5400.9, 60 sec: 5556.8, 300 sec: 5550.1). Total num frames: 959307776. Throughput: 0: 4990.1. Samples: 959304406. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:26:02,783][25689] Avg episode reward: [(0, '0.530')] [2022-07-10 23:26:03,668][26022] Updated weights on worker 0-0, policy_version 936829 (0.00049) [2022-07-10 23:26:05,326][26022] Updated weights on worker 0-0, policy_version 936839 (0.00052) [2022-07-10 23:26:07,450][26022] Updated weights on worker 0-0, policy_version 936849 (0.00104) [2022-07-10 23:26:07,795][25689] Fps is (10 sec: 5405.8, 60 sec: 5540.2, 300 sec: 5551.9). Total num frames: 959335424. Throughput: 0: 5754.7. Samples: 959336680. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:26:07,795][25689] Avg episode reward: [(0, '-0.033')] [2022-07-10 23:26:09,167][26022] Updated weights on worker 0-0, policy_version 936859 (0.00101) [2022-07-10 23:26:11,047][26022] Updated weights on worker 0-0, policy_version 936869 (0.00096) [2022-07-10 23:26:12,770][26022] Updated weights on worker 0-0, policy_version 936879 (0.00074) [2022-07-10 23:26:12,906][25689] Fps is (10 sec: 5563.6, 60 sec: 5589.1, 300 sec: 5553.6). Total num frames: 959364096. Throughput: 0: 5743.5. Samples: 959369828. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:26:12,906][25689] Avg episode reward: [(0, '0.029')] [2022-07-10 23:26:14,574][26022] Updated weights on worker 0-0, policy_version 936889 (0.00093) [2022-07-10 23:26:16,405][26022] Updated weights on worker 0-0, policy_version 936899 (0.00090) [2022-07-10 23:26:17,950][25689] Fps is (10 sec: 5646.5, 60 sec: 5554.6, 300 sec: 5553.0). Total num frames: 959392768. Throughput: 0: 4920.3. Samples: 959386956. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:26:17,951][25689] Avg episode reward: [(0, '-0.292')] [2022-07-10 23:26:18,296][26022] Updated weights on worker 0-0, policy_version 936909 (0.00082) [2022-07-10 23:26:20,050][26022] Updated weights on worker 0-0, policy_version 936919 (0.00093) [2022-07-10 23:26:22,085][26022] Updated weights on worker 0-0, policy_version 936929 (0.00095) [2022-07-10 23:26:22,958][25689] Fps is (10 sec: 5602.6, 60 sec: 5573.6, 300 sec: 5553.4). Total num frames: 959420416. Throughput: 0: 5756.5. Samples: 959420546. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:26:22,959][25689] Avg episode reward: [(0, '-0.296')] [2022-07-10 23:26:23,722][26022] Updated weights on worker 0-0, policy_version 936939 (0.00085) [2022-07-10 23:26:25,695][26022] Updated weights on worker 0-0, policy_version 936949 (0.00416) [2022-07-10 23:26:27,516][26022] Updated weights on worker 0-0, policy_version 936959 (0.00094) [2022-07-10 23:26:28,039][25689] Fps is (10 sec: 5480.8, 60 sec: 5550.9, 300 sec: 5549.4). Total num frames: 959448064. Throughput: 0: 5773.1. Samples: 959453556. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:26:28,040][25689] Avg episode reward: [(0, '-0.903')] [2022-07-10 23:26:29,257][26022] Updated weights on worker 0-0, policy_version 936969 (0.00083) [2022-07-10 23:26:31,270][26022] Updated weights on worker 0-0, policy_version 936979 (0.00083) [2022-07-10 23:26:33,077][26022] Updated weights on worker 0-0, policy_version 936989 (0.00084) [2022-07-10 23:26:33,115][25689] Fps is (10 sec: 5545.0, 60 sec: 5548.7, 300 sec: 5551.6). Total num frames: 959476736. Throughput: 0: 4958.7. Samples: 959470042. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:26:33,116][25689] Avg episode reward: [(0, '-0.616')] [2022-07-10 23:26:34,787][26022] Updated weights on worker 0-0, policy_version 936999 (0.00095) [2022-07-10 23:26:36,708][26022] Updated weights on worker 0-0, policy_version 937009 (0.00089) [2022-07-10 23:26:38,120][25689] Fps is (10 sec: 5688.5, 60 sec: 5551.0, 300 sec: 5552.1). Total num frames: 959505408. Throughput: 0: 5806.3. Samples: 959504068. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:26:38,121][25689] Avg episode reward: [(0, '-0.510')] [2022-07-10 23:26:38,335][26022] Updated weights on worker 0-0, policy_version 937019 (0.00100) [2022-07-10 23:26:40,454][26022] Updated weights on worker 0-0, policy_version 937029 (0.00081) [2022-07-10 23:26:42,101][26022] Updated weights on worker 0-0, policy_version 937039 (0.00087) [2022-07-10 23:26:43,142][25689] Fps is (10 sec: 5514.9, 60 sec: 5532.6, 300 sec: 5544.9). Total num frames: 959532032. Throughput: 0: 5786.9. Samples: 959537346. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:26:43,142][25689] Avg episode reward: [(0, '-0.260')] [2022-07-10 23:26:44,000][26022] Updated weights on worker 0-0, policy_version 937049 (0.00096) [2022-07-10 23:26:45,779][26022] Updated weights on worker 0-0, policy_version 937059 (0.00083) [2022-07-10 23:26:47,600][26022] Updated weights on worker 0-0, policy_version 937069 (0.00096) [2022-07-10 23:26:48,190][25689] Fps is (10 sec: 5389.7, 60 sec: 5529.4, 300 sec: 5541.5). Total num frames: 959559680. Throughput: 0: 5003.7. Samples: 959554382. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:26:48,190][25689] Avg episode reward: [(0, '-0.102')] [2022-07-10 23:26:49,405][26022] Updated weights on worker 0-0, policy_version 937079 (0.00083) [2022-07-10 23:26:51,304][26022] Updated weights on worker 0-0, policy_version 937089 (0.00087) [2022-07-10 23:26:53,041][26022] Updated weights on worker 0-0, policy_version 937099 (0.00084) [2022-07-10 23:26:53,270][25689] Fps is (10 sec: 5661.7, 60 sec: 5548.3, 300 sec: 5544.0). Total num frames: 959589376. Throughput: 0: 5852.4. Samples: 959587998. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:26:53,271][25689] Avg episode reward: [(0, '-0.798')] [2022-07-10 23:26:54,995][26022] Updated weights on worker 0-0, policy_version 937109 (0.00093) [2022-07-10 23:26:56,865][26022] Updated weights on worker 0-0, policy_version 937119 (0.00088) [2022-07-10 23:26:58,272][25689] Fps is (10 sec: 5789.2, 60 sec: 5567.0, 300 sec: 5547.8). Total num frames: 959618048. Throughput: 0: 5844.0. Samples: 959621836. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:26:58,272][25689] Avg episode reward: [(0, '-0.148')] [2022-07-10 23:26:58,638][26022] Updated weights on worker 0-0, policy_version 937129 (0.00088) [2022-07-10 23:27:00,482][26022] Updated weights on worker 0-0, policy_version 937139 (0.00087) [2022-07-10 23:27:02,699][26022] Updated weights on worker 0-0, policy_version 937149 (0.00088) [2022-07-10 23:27:03,283][25689] Fps is (10 sec: 5420.4, 60 sec: 5551.6, 300 sec: 5547.9). Total num frames: 959643648. Throughput: 0: 5755.6. Samples: 959653270. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:27:03,283][25689] Avg episode reward: [(0, '-0.019')] [2022-07-10 23:27:04,491][26022] Updated weights on worker 0-0, policy_version 937159 (0.00096) [2022-07-10 23:27:06,203][26022] Updated weights on worker 0-0, policy_version 937169 (0.00082) [2022-07-10 23:27:07,970][26022] Updated weights on worker 0-0, policy_version 937179 (0.00099) [2022-07-10 23:27:08,298][25689] Fps is (10 sec: 5515.3, 60 sec: 5585.1, 300 sec: 5552.2). Total num frames: 959673344. Throughput: 0: 5761.9. Samples: 959670244. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:27:08,298][25689] Avg episode reward: [(0, '-0.679')] [2022-07-10 23:27:09,952][26022] Updated weights on worker 0-0, policy_version 937189 (0.00085) [2022-07-10 23:27:11,793][26022] Updated weights on worker 0-0, policy_version 937199 (0.00086) [2022-07-10 23:27:13,361][25689] Fps is (10 sec: 5690.1, 60 sec: 5572.7, 300 sec: 5555.5). Total num frames: 959700992. Throughput: 0: 5771.6. Samples: 959703952. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:27:13,361][25689] Avg episode reward: [(0, '-1.004')] [2022-07-10 23:27:13,464][26022] Updated weights on worker 0-0, policy_version 937209 (0.00085) [2022-07-10 23:27:15,407][26022] Updated weights on worker 0-0, policy_version 937219 (0.00088) [2022-07-10 23:27:15,569][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:27:15,582][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000937220_959713280.pth [2022-07-10 23:27:15,583][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000935265_957711360.pth [2022-07-10 23:27:17,119][26022] Updated weights on worker 0-0, policy_version 937229 (0.00097) [2022-07-10 23:27:18,403][25689] Fps is (10 sec: 5573.3, 60 sec: 5572.9, 300 sec: 5555.1). Total num frames: 959729664. Throughput: 0: 5773.1. Samples: 959738056. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:27:18,404][25689] Avg episode reward: [(0, '-0.806')] [2022-07-10 23:27:18,978][26022] Updated weights on worker 0-0, policy_version 937239 (0.00084) [2022-07-10 23:27:20,882][26022] Updated weights on worker 0-0, policy_version 937249 (0.00080) [2022-07-10 23:27:22,469][26022] Updated weights on worker 0-0, policy_version 937259 (0.00086) [2022-07-10 23:27:23,466][25689] Fps is (10 sec: 5573.1, 60 sec: 5567.8, 300 sec: 5550.9). Total num frames: 959757312. Throughput: 0: 5045.7. Samples: 959755110. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:27:23,469][25689] Avg episode reward: [(0, '-1.188')] [2022-07-10 23:27:24,470][26022] Updated weights on worker 0-0, policy_version 937269 (0.00098) [2022-07-10 23:27:26,267][26022] Updated weights on worker 0-0, policy_version 937279 (0.00084) [2022-07-10 23:27:27,979][26022] Updated weights on worker 0-0, policy_version 937289 (0.00091) [2022-07-10 23:27:28,476][25689] Fps is (10 sec: 5489.8, 60 sec: 5574.4, 300 sec: 5549.1). Total num frames: 959784960. Throughput: 0: 5873.1. Samples: 959788750. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:27:28,476][25689] Avg episode reward: [(0, '-2.155')] [2022-07-10 23:27:29,994][26022] Updated weights on worker 0-0, policy_version 937299 (0.00082) [2022-07-10 23:27:31,838][26022] Updated weights on worker 0-0, policy_version 937309 (0.00089) [2022-07-10 23:27:33,514][26022] Updated weights on worker 0-0, policy_version 937319 (0.00088) [2022-07-10 23:27:33,580][25689] Fps is (10 sec: 5669.8, 60 sec: 5588.7, 300 sec: 5558.3). Total num frames: 959814656. Throughput: 0: 5860.6. Samples: 959822450. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:27:33,581][25689] Avg episode reward: [(0, '-2.383')] [2022-07-10 23:27:35,498][26022] Updated weights on worker 0-0, policy_version 937329 (0.00082) [2022-07-10 23:27:37,119][26022] Updated weights on worker 0-0, policy_version 937339 (0.00081) [2022-07-10 23:27:38,678][25689] Fps is (10 sec: 5620.5, 60 sec: 5563.2, 300 sec: 5558.0). Total num frames: 959842304. Throughput: 0: 5010.1. Samples: 959839636. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-10 23:27:38,679][25689] Avg episode reward: [(0, '-0.849')] [2022-07-10 23:27:39,168][26022] Updated weights on worker 0-0, policy_version 937349 (0.00093) [2022-07-10 23:27:40,890][26022] Updated weights on worker 0-0, policy_version 937359 (0.00091) [2022-07-10 23:27:42,728][26022] Updated weights on worker 0-0, policy_version 937369 (0.00091) [2022-07-10 23:27:43,738][25689] Fps is (10 sec: 5544.4, 60 sec: 5593.5, 300 sec: 5557.2). Total num frames: 959870976. Throughput: 0: 5832.8. Samples: 959873352. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:27:43,739][25689] Avg episode reward: [(0, '-0.798')] [2022-07-10 23:27:44,522][26022] Updated weights on worker 0-0, policy_version 937379 (0.00087) [2022-07-10 23:27:46,298][26022] Updated weights on worker 0-0, policy_version 937389 (0.00084) [2022-07-10 23:27:48,191][26022] Updated weights on worker 0-0, policy_version 937399 (0.00096) [2022-07-10 23:27:48,778][25689] Fps is (10 sec: 5677.6, 60 sec: 5611.1, 300 sec: 5558.1). Total num frames: 959899648. Throughput: 0: 5836.4. Samples: 959907244. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:27:48,779][25689] Avg episode reward: [(0, '0.272')] [2022-07-10 23:27:50,025][26022] Updated weights on worker 0-0, policy_version 937409 (0.00086) [2022-07-10 23:27:51,737][26022] Updated weights on worker 0-0, policy_version 937419 (0.00089) [2022-07-10 23:27:53,578][26022] Updated weights on worker 0-0, policy_version 937429 (0.00086) [2022-07-10 23:27:53,930][25689] Fps is (10 sec: 5626.8, 60 sec: 5587.7, 300 sec: 5562.2). Total num frames: 959928320. Throughput: 0: 4993.1. Samples: 959924056. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:27:53,930][25689] Avg episode reward: [(0, '0.367')] [2022-07-10 23:27:55,524][26022] Updated weights on worker 0-0, policy_version 937439 (0.00089) [2022-07-10 23:27:57,200][26022] Updated weights on worker 0-0, policy_version 937449 (0.00085) [2022-07-10 23:27:59,017][25689] Fps is (10 sec: 5600.6, 60 sec: 5579.7, 300 sec: 5561.7). Total num frames: 959956992. Throughput: 0: 5802.7. Samples: 959957658. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:27:59,018][25689] Avg episode reward: [(0, '1.296')] [2022-07-10 23:27:59,121][26022] Updated weights on worker 0-0, policy_version 937459 (0.00097) [2022-07-10 23:28:00,845][26022] Updated weights on worker 0-0, policy_version 937469 (0.00087) [2022-07-10 23:28:02,967][26022] Updated weights on worker 0-0, policy_version 937479 (0.00086) [2022-07-10 23:28:04,068][25689] Fps is (10 sec: 5555.0, 60 sec: 5609.8, 300 sec: 5567.9). Total num frames: 959984640. Throughput: 0: 5724.1. Samples: 959989722. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:28:04,069][25689] Avg episode reward: [(0, '1.412')] [2022-07-10 23:28:04,867][26022] Updated weights on worker 0-0, policy_version 937489 (0.00096) [2022-07-10 23:28:06,639][26022] Updated weights on worker 0-0, policy_version 937499 (0.00088) [2022-07-10 23:28:08,631][26022] Updated weights on worker 0-0, policy_version 937509 (0.00084) [2022-07-10 23:28:09,126][25689] Fps is (10 sec: 5571.3, 60 sec: 5589.0, 300 sec: 5571.7). Total num frames: 960013312. Throughput: 0: 4888.4. Samples: 960006712. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:28:09,127][25689] Avg episode reward: [(0, '1.348')] [2022-07-10 23:28:10,356][26022] Updated weights on worker 0-0, policy_version 937519 (0.00094) [2022-07-10 23:28:12,266][26022] Updated weights on worker 0-0, policy_version 937529 (0.00088) [2022-07-10 23:28:14,097][26022] Updated weights on worker 0-0, policy_version 937539 (0.00088) [2022-07-10 23:28:14,196][25689] Fps is (10 sec: 5459.5, 60 sec: 5571.5, 300 sec: 5567.6). Total num frames: 960039936. Throughput: 0: 5714.3. Samples: 960039868. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:28:14,197][25689] Avg episode reward: [(0, '1.389')] [2022-07-10 23:28:15,825][26022] Updated weights on worker 0-0, policy_version 937549 (0.00089) [2022-07-10 23:28:17,623][26022] Updated weights on worker 0-0, policy_version 937559 (0.00088) [2022-07-10 23:28:19,230][25689] Fps is (10 sec: 5472.9, 60 sec: 5572.3, 300 sec: 5567.5). Total num frames: 960068608. Throughput: 0: 5753.2. Samples: 960073944. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:28:19,231][25689] Avg episode reward: [(0, '1.338')] [2022-07-10 23:28:19,387][26022] Updated weights on worker 0-0, policy_version 937569 (0.00084) [2022-07-10 23:28:21,270][26022] Updated weights on worker 0-0, policy_version 937579 (0.00098) [2022-07-10 23:28:23,105][26022] Updated weights on worker 0-0, policy_version 937589 (0.00083) [2022-07-10 23:28:24,282][25689] Fps is (10 sec: 5686.0, 60 sec: 5590.2, 300 sec: 5570.7). Total num frames: 960097280. Throughput: 0: 5002.7. Samples: 960090844. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:28:24,282][25689] Avg episode reward: [(0, '1.092')] [2022-07-10 23:28:24,915][26022] Updated weights on worker 0-0, policy_version 937599 (0.00084) [2022-07-10 23:28:26,753][26022] Updated weights on worker 0-0, policy_version 937609 (0.00084) [2022-07-10 23:28:28,606][26022] Updated weights on worker 0-0, policy_version 937619 (0.00085) [2022-07-10 23:28:29,334][25689] Fps is (10 sec: 5675.3, 60 sec: 5603.0, 300 sec: 5565.3). Total num frames: 960125952. Throughput: 0: 5819.9. Samples: 960124320. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:28:29,335][25689] Avg episode reward: [(0, '0.739')] [2022-07-10 23:28:30,484][26022] Updated weights on worker 0-0, policy_version 937629 (0.00089) [2022-07-10 23:28:32,385][26022] Updated weights on worker 0-0, policy_version 937639 (0.00083) [2022-07-10 23:28:34,094][26022] Updated weights on worker 0-0, policy_version 937649 (0.00076) [2022-07-10 23:28:34,415][25689] Fps is (10 sec: 5557.7, 60 sec: 5571.5, 300 sec: 5568.1). Total num frames: 960153600. Throughput: 0: 5846.1. Samples: 960158068. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:28:34,416][25689] Avg episode reward: [(0, '0.140')] [2022-07-10 23:28:35,931][26022] Updated weights on worker 0-0, policy_version 937659 (0.00083) [2022-07-10 23:28:37,611][26022] Updated weights on worker 0-0, policy_version 937669 (0.00094) [2022-07-10 23:28:39,354][26022] Updated weights on worker 0-0, policy_version 937679 (0.00086) [2022-07-10 23:28:39,444][25689] Fps is (10 sec: 5672.1, 60 sec: 5611.6, 300 sec: 5571.7). Total num frames: 960183296. Throughput: 0: 5854.4. Samples: 960192284. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:28:39,445][25689] Avg episode reward: [(0, '-0.628')] [2022-07-10 23:28:41,432][26022] Updated weights on worker 0-0, policy_version 937689 (0.00079) [2022-07-10 23:28:43,247][26022] Updated weights on worker 0-0, policy_version 937699 (0.00087) [2022-07-10 23:28:44,505][25689] Fps is (10 sec: 5784.8, 60 sec: 5611.5, 300 sec: 5574.2). Total num frames: 960211968. Throughput: 0: 5848.3. Samples: 960209118. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:28:44,506][25689] Avg episode reward: [(0, '-0.526')] [2022-07-10 23:28:44,861][26022] Updated weights on worker 0-0, policy_version 937709 (0.00086) [2022-07-10 23:28:46,814][26022] Updated weights on worker 0-0, policy_version 937719 (0.00107) [2022-07-10 23:28:48,744][26022] Updated weights on worker 0-0, policy_version 937729 (0.00106) [2022-07-10 23:28:49,533][25689] Fps is (10 sec: 5683.6, 60 sec: 5612.6, 300 sec: 5576.1). Total num frames: 960240640. Throughput: 0: 5881.1. Samples: 960243114. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:28:49,534][25689] Avg episode reward: [(0, '-0.400')] [2022-07-10 23:28:50,380][26022] Updated weights on worker 0-0, policy_version 937739 (0.00088) [2022-07-10 23:28:52,352][26022] Updated weights on worker 0-0, policy_version 937749 (0.00087) [2022-07-10 23:28:53,950][26022] Updated weights on worker 0-0, policy_version 937759 (0.00104) [2022-07-10 23:28:54,616][25689] Fps is (10 sec: 5570.4, 60 sec: 5602.1, 300 sec: 5571.5). Total num frames: 960268288. Throughput: 0: 5877.0. Samples: 960276786. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:28:54,616][25689] Avg episode reward: [(0, '-0.973')] [2022-07-10 23:28:55,845][26022] Updated weights on worker 0-0, policy_version 937769 (0.00087) [2022-07-10 23:28:57,623][26022] Updated weights on worker 0-0, policy_version 937779 (0.00088) [2022-07-10 23:28:59,289][26022] Updated weights on worker 0-0, policy_version 937789 (0.00098) [2022-07-10 23:28:59,657][25689] Fps is (10 sec: 5664.6, 60 sec: 5623.3, 300 sec: 5575.8). Total num frames: 960297984. Throughput: 0: 5042.5. Samples: 960294206. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:28:59,657][25689] Avg episode reward: [(0, '-0.681')] [2022-07-10 23:29:01,215][26022] Updated weights on worker 0-0, policy_version 937799 (0.00086) [2022-07-10 23:29:03,272][26022] Updated weights on worker 0-0, policy_version 937809 (0.00124) [2022-07-10 23:29:04,729][25689] Fps is (10 sec: 5366.6, 60 sec: 5570.7, 300 sec: 5575.3). Total num frames: 960322560. Throughput: 0: 5799.8. Samples: 960326410. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:29:04,729][25689] Avg episode reward: [(0, '0.228')] [2022-07-10 23:29:05,231][26022] Updated weights on worker 0-0, policy_version 937819 (0.00087) [2022-07-10 23:29:06,988][26022] Updated weights on worker 0-0, policy_version 937829 (0.00086) [2022-07-10 23:29:08,714][26022] Updated weights on worker 0-0, policy_version 937839 (0.00086) [2022-07-10 23:29:09,770][25689] Fps is (10 sec: 5366.1, 60 sec: 5589.1, 300 sec: 5576.7). Total num frames: 960352256. Throughput: 0: 5806.5. Samples: 960360620. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:29:09,771][25689] Avg episode reward: [(0, '0.741')] [2022-07-10 23:29:10,655][26022] Updated weights on worker 0-0, policy_version 937849 (0.00094) [2022-07-10 23:29:12,476][26022] Updated weights on worker 0-0, policy_version 937859 (0.00081) [2022-07-10 23:29:14,225][26022] Updated weights on worker 0-0, policy_version 937869 (0.00090) [2022-07-10 23:29:14,869][25689] Fps is (10 sec: 5856.9, 60 sec: 5637.1, 300 sec: 5580.0). Total num frames: 960381952. Throughput: 0: 4967.5. Samples: 960377396. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:29:14,870][25689] Avg episode reward: [(0, '0.126')] [2022-07-10 23:29:15,623][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:29:15,634][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000937877_960386048.pth [2022-07-10 23:29:15,635][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000935915_958376960.pth [2022-07-10 23:29:16,090][26022] Updated weights on worker 0-0, policy_version 937879 (0.00098) [2022-07-10 23:29:17,703][26022] Updated weights on worker 0-0, policy_version 937889 (0.00088) [2022-07-10 23:29:19,644][26022] Updated weights on worker 0-0, policy_version 937899 (0.00087) [2022-07-10 23:29:19,928][25689] Fps is (10 sec: 5645.4, 60 sec: 5617.8, 300 sec: 5575.8). Total num frames: 960409600. Throughput: 0: 5795.4. Samples: 960411688. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:29:19,930][25689] Avg episode reward: [(0, '0.596')] [2022-07-10 23:29:21,431][26022] Updated weights on worker 0-0, policy_version 937909 (0.00085) [2022-07-10 23:29:23,196][26022] Updated weights on worker 0-0, policy_version 937919 (0.00085) [2022-07-10 23:29:25,003][25689] Fps is (10 sec: 5557.8, 60 sec: 5615.7, 300 sec: 5579.3). Total num frames: 960438272. Throughput: 0: 5895.6. Samples: 960445938. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:29:25,004][25689] Avg episode reward: [(0, '0.277')] [2022-07-10 23:29:25,022][26022] Updated weights on worker 0-0, policy_version 937929 (0.00085) [2022-07-10 23:29:26,723][26022] Updated weights on worker 0-0, policy_version 937939 (0.00079) [2022-07-10 23:29:28,664][26022] Updated weights on worker 0-0, policy_version 937949 (0.00093) [2022-07-10 23:29:30,013][25689] Fps is (10 sec: 5686.2, 60 sec: 5619.6, 300 sec: 5581.9). Total num frames: 960466944. Throughput: 0: 5057.4. Samples: 960462994. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:29:30,015][25689] Avg episode reward: [(0, '0.129')] [2022-07-10 23:29:30,512][26022] Updated weights on worker 0-0, policy_version 937959 (0.00086) [2022-07-10 23:29:32,399][26022] Updated weights on worker 0-0, policy_version 937969 (0.00083) [2022-07-10 23:29:34,115][26022] Updated weights on worker 0-0, policy_version 937979 (0.00047) [2022-07-10 23:29:35,088][25689] Fps is (10 sec: 5584.5, 60 sec: 5620.2, 300 sec: 5574.5). Total num frames: 960494592. Throughput: 0: 5882.7. Samples: 960496336. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:29:35,088][25689] Avg episode reward: [(0, '-0.486')] [2022-07-10 23:29:35,907][26022] Updated weights on worker 0-0, policy_version 937989 (0.00054) [2022-07-10 23:29:38,028][26022] Updated weights on worker 0-0, policy_version 937999 (0.00092) [2022-07-10 23:29:39,557][26022] Updated weights on worker 0-0, policy_version 938009 (0.00084) [2022-07-10 23:29:40,112][25689] Fps is (10 sec: 5678.2, 60 sec: 5620.6, 300 sec: 5584.9). Total num frames: 960524288. Throughput: 0: 5884.2. Samples: 960530454. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:29:40,112][25689] Avg episode reward: [(0, '-0.475')] [2022-07-10 23:29:41,586][26022] Updated weights on worker 0-0, policy_version 938019 (0.00086) [2022-07-10 23:29:43,271][26022] Updated weights on worker 0-0, policy_version 938029 (0.00095) [2022-07-10 23:29:45,114][25689] Fps is (10 sec: 5719.6, 60 sec: 5609.2, 300 sec: 5581.5). Total num frames: 960551936. Throughput: 0: 5025.2. Samples: 960547000. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:29:45,114][26022] Updated weights on worker 0-0, policy_version 938039 (0.00075) [2022-07-10 23:29:45,114][25689] Avg episode reward: [(0, '0.001')] [2022-07-10 23:29:46,969][26022] Updated weights on worker 0-0, policy_version 938049 (0.00087) [2022-07-10 23:29:48,832][26022] Updated weights on worker 0-0, policy_version 938059 (0.00087) [2022-07-10 23:29:50,134][25689] Fps is (10 sec: 5619.5, 60 sec: 5610.0, 300 sec: 5582.0). Total num frames: 960580608. Throughput: 0: 5852.7. Samples: 960580758. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:29:50,135][25689] Avg episode reward: [(0, '-0.000')] [2022-07-10 23:29:50,570][26022] Updated weights on worker 0-0, policy_version 938069 (0.00086) [2022-07-10 23:29:52,347][26022] Updated weights on worker 0-0, policy_version 938079 (0.00093) [2022-07-10 23:29:54,086][26022] Updated weights on worker 0-0, policy_version 938089 (0.00086) [2022-07-10 23:29:55,251][25689] Fps is (10 sec: 5555.6, 60 sec: 5606.8, 300 sec: 5580.5). Total num frames: 960608256. Throughput: 0: 5870.5. Samples: 960614706. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:29:55,252][25689] Avg episode reward: [(0, '0.407')] [2022-07-10 23:29:56,054][26022] Updated weights on worker 0-0, policy_version 938099 (0.00092) [2022-07-10 23:29:57,763][26022] Updated weights on worker 0-0, policy_version 938109 (0.00095) [2022-07-10 23:29:59,628][26022] Updated weights on worker 0-0, policy_version 938119 (0.00091) [2022-07-10 23:30:00,322][25689] Fps is (10 sec: 5729.3, 60 sec: 5620.9, 300 sec: 5594.5). Total num frames: 960638976. Throughput: 0: 5019.7. Samples: 960631902. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:30:00,323][25689] Avg episode reward: [(0, '1.241')] [2022-07-10 23:30:01,398][26022] Updated weights on worker 0-0, policy_version 938129 (0.00079) [2022-07-10 23:30:03,579][26022] Updated weights on worker 0-0, policy_version 938139 (0.00106) [2022-07-10 23:30:05,360][25689] Fps is (10 sec: 5470.1, 60 sec: 5624.0, 300 sec: 5580.4). Total num frames: 960663552. Throughput: 0: 5770.4. Samples: 960663830. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:30:05,360][25689] Avg episode reward: [(0, '0.972')] [2022-07-10 23:30:05,451][26022] Updated weights on worker 0-0, policy_version 938149 (0.00085) [2022-07-10 23:30:07,221][26022] Updated weights on worker 0-0, policy_version 938159 (0.00097) [2022-07-10 23:30:08,892][26022] Updated weights on worker 0-0, policy_version 938169 (0.00103) [2022-07-10 23:30:10,379][25689] Fps is (10 sec: 5294.7, 60 sec: 5609.3, 300 sec: 5592.0). Total num frames: 960692224. Throughput: 0: 5774.8. Samples: 960697666. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:30:10,379][25689] Avg episode reward: [(0, '0.947')] [2022-07-10 23:30:10,901][26022] Updated weights on worker 0-0, policy_version 938179 (0.00090) [2022-07-10 23:30:12,614][26022] Updated weights on worker 0-0, policy_version 938189 (0.00087) [2022-07-10 23:30:14,639][26022] Updated weights on worker 0-0, policy_version 938199 (0.00087) [2022-07-10 23:30:15,423][25689] Fps is (10 sec: 5698.7, 60 sec: 5597.4, 300 sec: 5585.0). Total num frames: 960720896. Throughput: 0: 4945.4. Samples: 960714462. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:30:15,423][25689] Avg episode reward: [(0, '0.519')] [2022-07-10 23:30:16,370][26022] Updated weights on worker 0-0, policy_version 938209 (0.00078) [2022-07-10 23:30:18,216][26022] Updated weights on worker 0-0, policy_version 938219 (0.00091) [2022-07-10 23:30:20,123][26022] Updated weights on worker 0-0, policy_version 938229 (0.00087) [2022-07-10 23:30:20,436][25689] Fps is (10 sec: 5498.4, 60 sec: 5584.8, 300 sec: 5585.3). Total num frames: 960747520. Throughput: 0: 5773.8. Samples: 960748034. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:30:20,436][25689] Avg episode reward: [(0, '0.435')] [2022-07-10 23:30:21,793][26022] Updated weights on worker 0-0, policy_version 938239 (0.00088) [2022-07-10 23:30:23,772][26022] Updated weights on worker 0-0, policy_version 938249 (0.00082) [2022-07-10 23:30:25,441][25689] Fps is (10 sec: 5519.3, 60 sec: 5591.1, 300 sec: 5585.6). Total num frames: 960776192. Throughput: 0: 5889.0. Samples: 960782090. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:30:25,442][25689] Avg episode reward: [(0, '0.374')] [2022-07-10 23:30:25,475][26022] Updated weights on worker 0-0, policy_version 938259 (0.00106) [2022-07-10 23:30:27,327][26022] Updated weights on worker 0-0, policy_version 938269 (0.00090) [2022-07-10 23:30:29,027][26022] Updated weights on worker 0-0, policy_version 938279 (0.00088) [2022-07-10 23:30:30,451][25689] Fps is (10 sec: 5725.7, 60 sec: 5591.2, 300 sec: 5586.4). Total num frames: 960804864. Throughput: 0: 5047.6. Samples: 960798982. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:30:30,451][25689] Avg episode reward: [(0, '-0.585')] [2022-07-10 23:30:30,787][26022] Updated weights on worker 0-0, policy_version 938289 (0.00085) [2022-07-10 23:30:32,740][26022] Updated weights on worker 0-0, policy_version 938299 (0.00089) [2022-07-10 23:30:34,482][26022] Updated weights on worker 0-0, policy_version 938309 (0.00090) [2022-07-10 23:30:35,547][25689] Fps is (10 sec: 5775.5, 60 sec: 5623.1, 300 sec: 5588.6). Total num frames: 960834560. Throughput: 0: 5869.2. Samples: 960832578. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:30:35,548][25689] Avg episode reward: [(0, '-0.539')] [2022-07-10 23:30:36,420][26022] Updated weights on worker 0-0, policy_version 938319 (0.00082) [2022-07-10 23:30:38,071][26022] Updated weights on worker 0-0, policy_version 938329 (0.00084) [2022-07-10 23:30:40,068][26022] Updated weights on worker 0-0, policy_version 938339 (0.00095) [2022-07-10 23:30:40,592][25689] Fps is (10 sec: 5553.4, 60 sec: 5570.3, 300 sec: 5584.4). Total num frames: 960861184. Throughput: 0: 5885.9. Samples: 960866674. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:30:40,593][25689] Avg episode reward: [(0, '-0.879')] [2022-07-10 23:30:41,742][26022] Updated weights on worker 0-0, policy_version 938349 (0.00085) [2022-07-10 23:30:43,868][26022] Updated weights on worker 0-0, policy_version 938359 (0.00086) [2022-07-10 23:30:45,679][25689] Fps is (10 sec: 5457.8, 60 sec: 5579.4, 300 sec: 5586.5). Total num frames: 960889856. Throughput: 0: 5841.9. Samples: 960900316. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:30:45,679][25689] Avg episode reward: [(0, '-0.326')] [2022-07-10 23:30:45,716][26022] Updated weights on worker 0-0, policy_version 938370 (0.00087) [2022-07-10 23:30:47,297][26022] Updated weights on worker 0-0, policy_version 938380 (0.00091) [2022-07-10 23:30:49,409][26022] Updated weights on worker 0-0, policy_version 938390 (0.00083) [2022-07-10 23:30:50,751][25689] Fps is (10 sec: 5745.6, 60 sec: 5591.6, 300 sec: 5590.4). Total num frames: 960919552. Throughput: 0: 5821.2. Samples: 960917154. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:30:50,751][25689] Avg episode reward: [(0, '-0.021')] [2022-07-10 23:30:51,151][26022] Updated weights on worker 0-0, policy_version 938400 (0.00084) [2022-07-10 23:30:53,106][26022] Updated weights on worker 0-0, policy_version 938410 (0.00085) [2022-07-10 23:30:54,718][26022] Updated weights on worker 0-0, policy_version 938420 (0.00087) [2022-07-10 23:30:55,863][25689] Fps is (10 sec: 5630.5, 60 sec: 5592.0, 300 sec: 5588.7). Total num frames: 960947200. Throughput: 0: 5824.0. Samples: 960950898. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:30:55,864][25689] Avg episode reward: [(0, '-0.276')] [2022-07-10 23:30:56,542][26022] Updated weights on worker 0-0, policy_version 938430 (0.00086) [2022-07-10 23:30:58,433][26022] Updated weights on worker 0-0, policy_version 938440 (0.00087) [2022-07-10 23:31:00,347][26022] Updated weights on worker 0-0, policy_version 938450 (0.00090) [2022-07-10 23:31:00,892][25689] Fps is (10 sec: 5654.5, 60 sec: 5579.0, 300 sec: 5599.0). Total num frames: 960976896. Throughput: 0: 5811.9. Samples: 960984656. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:31:00,893][25689] Avg episode reward: [(0, '1.011')] [2022-07-10 23:31:02,350][26022] Updated weights on worker 0-0, policy_version 938460 (0.00098) [2022-07-10 23:31:04,415][26022] Updated weights on worker 0-0, policy_version 938470 (0.00092) [2022-07-10 23:31:05,914][25689] Fps is (10 sec: 5603.5, 60 sec: 5614.3, 300 sec: 5595.4). Total num frames: 961003520. Throughput: 0: 4893.8. Samples: 960999344. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:31:05,915][25689] Avg episode reward: [(0, '0.727')] [2022-07-10 23:31:05,923][26022] Updated weights on worker 0-0, policy_version 938480 (0.00081) [2022-07-10 23:31:07,928][26022] Updated weights on worker 0-0, policy_version 938490 (0.00085) [2022-07-10 23:31:09,737][26022] Updated weights on worker 0-0, policy_version 938500 (0.00084) [2022-07-10 23:31:10,942][25689] Fps is (10 sec: 5196.4, 60 sec: 5562.7, 300 sec: 5586.6). Total num frames: 961029120. Throughput: 0: 5752.8. Samples: 961033310. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:31:10,943][25689] Avg episode reward: [(0, '0.453')] [2022-07-10 23:31:11,472][26022] Updated weights on worker 0-0, policy_version 938510 (0.00085) [2022-07-10 23:31:13,289][26022] Updated weights on worker 0-0, policy_version 938520 (0.00093) [2022-07-10 23:31:15,234][26022] Updated weights on worker 0-0, policy_version 938530 (0.00088) [2022-07-10 23:31:15,642][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:31:15,654][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000938534_961058816.pth [2022-07-10 23:31:15,654][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000936567_959044608.pth [2022-07-10 23:31:16,071][25689] Fps is (10 sec: 5545.1, 60 sec: 5588.7, 300 sec: 5591.9). Total num frames: 961059840. Throughput: 0: 5747.4. Samples: 961067040. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:31:16,071][25689] Avg episode reward: [(0, '0.378')] [2022-07-10 23:31:16,981][26022] Updated weights on worker 0-0, policy_version 938540 (0.00095) [2022-07-10 23:31:18,686][26022] Updated weights on worker 0-0, policy_version 938550 (0.00084) [2022-07-10 23:31:20,603][26022] Updated weights on worker 0-0, policy_version 938560 (0.00086) [2022-07-10 23:31:21,169][25689] Fps is (10 sec: 5707.6, 60 sec: 5597.7, 300 sec: 5590.3). Total num frames: 961087488. Throughput: 0: 4903.3. Samples: 961084078. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:31:21,169][25689] Avg episode reward: [(0, '0.503')] [2022-07-10 23:31:22,491][26022] Updated weights on worker 0-0, policy_version 938570 (0.00059) [2022-07-10 23:31:24,367][26022] Updated weights on worker 0-0, policy_version 938580 (0.00097) [2022-07-10 23:31:25,984][26022] Updated weights on worker 0-0, policy_version 938590 (0.00107) [2022-07-10 23:31:26,190][25689] Fps is (10 sec: 5565.5, 60 sec: 5596.3, 300 sec: 5594.8). Total num frames: 961116160. Throughput: 0: 5855.8. Samples: 961118078. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:31:26,191][25689] Avg episode reward: [(0, '0.269')] [2022-07-10 23:31:28,007][26022] Updated weights on worker 0-0, policy_version 938600 (0.00063) [2022-07-10 23:31:29,847][26022] Updated weights on worker 0-0, policy_version 938610 (0.00084) [2022-07-10 23:31:31,203][25689] Fps is (10 sec: 5613.0, 60 sec: 5579.2, 300 sec: 5592.6). Total num frames: 961143808. Throughput: 0: 5827.2. Samples: 961151370. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-10 23:31:31,203][25689] Avg episode reward: [(0, '0.056')] [2022-07-10 23:31:31,687][26022] Updated weights on worker 0-0, policy_version 938620 (0.00063) [2022-07-10 23:31:33,472][26022] Updated weights on worker 0-0, policy_version 938630 (0.00096) [2022-07-10 23:31:35,422][26022] Updated weights on worker 0-0, policy_version 938640 (0.00090) [2022-07-10 23:31:36,336][25689] Fps is (10 sec: 5450.2, 60 sec: 5542.1, 300 sec: 5586.7). Total num frames: 961171456. Throughput: 0: 4978.4. Samples: 961167928. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:31:36,337][25689] Avg episode reward: [(0, '0.265')] [2022-07-10 23:31:37,087][26022] Updated weights on worker 0-0, policy_version 938650 (0.00085) [2022-07-10 23:31:39,135][26022] Updated weights on worker 0-0, policy_version 938660 (0.00100) [2022-07-10 23:31:40,747][26022] Updated weights on worker 0-0, policy_version 938670 (0.00090) [2022-07-10 23:31:41,371][25689] Fps is (10 sec: 5539.0, 60 sec: 5576.7, 300 sec: 5593.3). Total num frames: 961200128. Throughput: 0: 5819.8. Samples: 961201648. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:31:41,371][25689] Avg episode reward: [(0, '0.786')] [2022-07-10 23:31:42,624][26022] Updated weights on worker 0-0, policy_version 938680 (0.00086) [2022-07-10 23:31:44,467][26022] Updated weights on worker 0-0, policy_version 938690 (0.00087) [2022-07-10 23:31:46,112][26022] Updated weights on worker 0-0, policy_version 938700 (0.00088) [2022-07-10 23:31:46,460][25689] Fps is (10 sec: 5866.5, 60 sec: 5610.1, 300 sec: 5602.9). Total num frames: 961230848. Throughput: 0: 5777.6. Samples: 961235188. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:31:46,461][25689] Avg episode reward: [(0, '-0.179')] [2022-07-10 23:31:48,061][26022] Updated weights on worker 0-0, policy_version 938710 (0.00769) [2022-07-10 23:31:49,825][26022] Updated weights on worker 0-0, policy_version 938720 (0.00089) [2022-07-10 23:31:51,510][25689] Fps is (10 sec: 5554.6, 60 sec: 5544.8, 300 sec: 5589.7). Total num frames: 961256448. Throughput: 0: 4963.8. Samples: 961252172. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:31:51,512][25689] Avg episode reward: [(0, '-0.504')] [2022-07-10 23:31:51,772][26022] Updated weights on worker 0-0, policy_version 938730 (0.00085) [2022-07-10 23:31:53,532][26022] Updated weights on worker 0-0, policy_version 938740 (0.00088) [2022-07-10 23:31:55,567][26022] Updated weights on worker 0-0, policy_version 938750 (0.00088) [2022-07-10 23:31:56,579][25689] Fps is (10 sec: 5464.8, 60 sec: 5582.5, 300 sec: 5591.9). Total num frames: 961286144. Throughput: 0: 5829.8. Samples: 961285938. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:31:56,579][25689] Avg episode reward: [(0, '-0.358')] [2022-07-10 23:31:56,947][26022] Updated weights on worker 0-0, policy_version 938760 (0.00088) [2022-07-10 23:31:59,090][26022] Updated weights on worker 0-0, policy_version 938770 (0.00113) [2022-07-10 23:32:00,582][26022] Updated weights on worker 0-0, policy_version 938780 (0.00086) [2022-07-10 23:32:01,625][25689] Fps is (10 sec: 5568.0, 60 sec: 5530.3, 300 sec: 5594.7). Total num frames: 961312768. Throughput: 0: 5851.6. Samples: 961320168. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:32:01,625][25689] Avg episode reward: [(0, '-0.862')] [2022-07-10 23:32:03,037][26022] Updated weights on worker 0-0, policy_version 938790 (0.00086) [2022-07-10 23:32:04,675][26022] Updated weights on worker 0-0, policy_version 938800 (0.00081) [2022-07-10 23:32:06,568][26022] Updated weights on worker 0-0, policy_version 938810 (0.00090) [2022-07-10 23:32:06,646][25689] Fps is (10 sec: 5492.8, 60 sec: 5564.2, 300 sec: 5591.1). Total num frames: 961341440. Throughput: 0: 4953.5. Samples: 961335178. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:32:06,646][25689] Avg episode reward: [(0, '-1.011')] [2022-07-10 23:32:08,391][26022] Updated weights on worker 0-0, policy_version 938820 (0.00094) [2022-07-10 23:32:10,062][26022] Updated weights on worker 0-0, policy_version 938830 (0.00085) [2022-07-10 23:32:11,660][25689] Fps is (10 sec: 5714.5, 60 sec: 5616.0, 300 sec: 5595.5). Total num frames: 961370112. Throughput: 0: 5805.1. Samples: 961369144. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:32:11,660][25689] Avg episode reward: [(0, '0.161')] [2022-07-10 23:32:12,191][26022] Updated weights on worker 0-0, policy_version 938840 (0.00091) [2022-07-10 23:32:13,714][26022] Updated weights on worker 0-0, policy_version 938850 (0.00081) [2022-07-10 23:32:15,552][26022] Updated weights on worker 0-0, policy_version 938860 (0.00095) [2022-07-10 23:32:16,798][25689] Fps is (10 sec: 5547.3, 60 sec: 5564.5, 300 sec: 5590.2). Total num frames: 961397760. Throughput: 0: 5786.7. Samples: 961402944. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:32:16,799][25689] Avg episode reward: [(0, '0.251')] [2022-07-10 23:32:17,469][26022] Updated weights on worker 0-0, policy_version 938870 (0.00084) [2022-07-10 23:32:19,043][26022] Updated weights on worker 0-0, policy_version 938880 (0.00092) [2022-07-10 23:32:21,104][26022] Updated weights on worker 0-0, policy_version 938890 (0.00079) [2022-07-10 23:32:21,820][25689] Fps is (10 sec: 5643.7, 60 sec: 5605.2, 300 sec: 5597.9). Total num frames: 961427456. Throughput: 0: 5791.4. Samples: 961437130. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:32:21,821][25689] Avg episode reward: [(0, '0.643')] [2022-07-10 23:32:22,732][26022] Updated weights on worker 0-0, policy_version 938900 (0.00083) [2022-07-10 23:32:24,587][26022] Updated weights on worker 0-0, policy_version 938910 (0.00086) [2022-07-10 23:32:26,400][26022] Updated weights on worker 0-0, policy_version 938920 (0.00087) [2022-07-10 23:32:26,875][25689] Fps is (10 sec: 5792.4, 60 sec: 5602.2, 300 sec: 5600.4). Total num frames: 961456128. Throughput: 0: 5885.1. Samples: 961454230. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:32:26,876][25689] Avg episode reward: [(0, '0.422')] [2022-07-10 23:32:28,358][26022] Updated weights on worker 0-0, policy_version 938930 (0.00086) [2022-07-10 23:32:30,097][26022] Updated weights on worker 0-0, policy_version 938940 (0.00092) [2022-07-10 23:32:31,893][25689] Fps is (10 sec: 5591.3, 60 sec: 5601.6, 300 sec: 5595.2). Total num frames: 961483776. Throughput: 0: 5878.0. Samples: 961488076. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:32:31,894][25689] Avg episode reward: [(0, '1.212')] [2022-07-10 23:32:32,134][26022] Updated weights on worker 0-0, policy_version 938950 (0.00083) [2022-07-10 23:32:33,651][26022] Updated weights on worker 0-0, policy_version 938960 (0.00982) [2022-07-10 23:32:35,687][26022] Updated weights on worker 0-0, policy_version 938970 (0.00086) [2022-07-10 23:32:36,953][25689] Fps is (10 sec: 5791.5, 60 sec: 5659.1, 300 sec: 5606.2). Total num frames: 961514496. Throughput: 0: 5916.5. Samples: 961522190. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:32:36,954][25689] Avg episode reward: [(0, '0.999')] [2022-07-10 23:32:37,105][26022] Updated weights on worker 0-0, policy_version 938980 (0.00063) [2022-07-10 23:32:39,183][26022] Updated weights on worker 0-0, policy_version 938990 (0.00086) [2022-07-10 23:32:41,054][26022] Updated weights on worker 0-0, policy_version 939000 (0.00091) [2022-07-10 23:32:42,010][25689] Fps is (10 sec: 5769.5, 60 sec: 5640.2, 300 sec: 5602.8). Total num frames: 961542144. Throughput: 0: 5071.7. Samples: 961539524. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:32:42,010][25689] Avg episode reward: [(0, '0.936')] [2022-07-10 23:32:42,599][26022] Updated weights on worker 0-0, policy_version 939010 (0.00095) [2022-07-10 23:32:44,615][26022] Updated weights on worker 0-0, policy_version 939020 (0.00085) [2022-07-10 23:32:46,174][26022] Updated weights on worker 0-0, policy_version 939030 (0.00094) [2022-07-10 23:32:47,040][25689] Fps is (10 sec: 5583.3, 60 sec: 5611.8, 300 sec: 5603.0). Total num frames: 961570816. Throughput: 0: 5903.1. Samples: 961573268. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:32:47,041][25689] Avg episode reward: [(0, '1.242')] [2022-07-10 23:32:48,291][26022] Updated weights on worker 0-0, policy_version 939040 (0.00090) [2022-07-10 23:32:50,040][26022] Updated weights on worker 0-0, policy_version 939050 (0.00089) [2022-07-10 23:32:51,743][26022] Updated weights on worker 0-0, policy_version 939060 (0.00090) [2022-07-10 23:32:52,054][25689] Fps is (10 sec: 5505.5, 60 sec: 5632.2, 300 sec: 5598.7). Total num frames: 961597440. Throughput: 0: 5906.6. Samples: 961607156. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:32:52,054][25689] Avg episode reward: [(0, '1.345')] [2022-07-10 23:32:53,531][26022] Updated weights on worker 0-0, policy_version 939070 (0.00080) [2022-07-10 23:32:55,579][26022] Updated weights on worker 0-0, policy_version 939080 (0.00082) [2022-07-10 23:32:57,121][25689] Fps is (10 sec: 5688.6, 60 sec: 5649.2, 300 sec: 5606.0). Total num frames: 961628160. Throughput: 0: 5050.8. Samples: 961624054. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:32:57,122][25689] Avg episode reward: [(0, '-0.032')] [2022-07-10 23:32:57,132][26022] Updated weights on worker 0-0, policy_version 939090 (0.00089) [2022-07-10 23:32:59,131][26022] Updated weights on worker 0-0, policy_version 939100 (0.00088) [2022-07-10 23:33:01,007][26022] Updated weights on worker 0-0, policy_version 939110 (0.00087) [2022-07-10 23:33:02,203][25689] Fps is (10 sec: 5650.2, 60 sec: 5645.9, 300 sec: 5602.0). Total num frames: 961654784. Throughput: 0: 5867.6. Samples: 961658008. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:33:02,203][25689] Avg episode reward: [(0, '0.015')] [2022-07-10 23:33:03,161][26022] Updated weights on worker 0-0, policy_version 939120 (0.00088) [2022-07-10 23:33:04,998][26022] Updated weights on worker 0-0, policy_version 939130 (0.00083) [2022-07-10 23:33:06,655][26022] Updated weights on worker 0-0, policy_version 939140 (0.00087) [2022-07-10 23:33:07,211][25689] Fps is (10 sec: 5277.5, 60 sec: 5613.3, 300 sec: 5596.1). Total num frames: 961681408. Throughput: 0: 5767.7. Samples: 961689604. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:33:07,213][25689] Avg episode reward: [(0, '-0.014')] [2022-07-10 23:33:08,575][26022] Updated weights on worker 0-0, policy_version 939150 (0.00088) [2022-07-10 23:33:10,397][26022] Updated weights on worker 0-0, policy_version 939160 (0.00082) [2022-07-10 23:33:12,120][26022] Updated weights on worker 0-0, policy_version 939170 (0.00092) [2022-07-10 23:33:12,222][25689] Fps is (10 sec: 5519.1, 60 sec: 5613.6, 300 sec: 5604.0). Total num frames: 961710080. Throughput: 0: 4926.2. Samples: 961706508. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:33:12,223][25689] Avg episode reward: [(0, '-0.078')] [2022-07-10 23:33:14,038][26022] Updated weights on worker 0-0, policy_version 939180 (0.00110) [2022-07-10 23:33:15,929][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:33:15,938][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000939190_961730560.pth [2022-07-10 23:33:15,940][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000937220_959713280.pth [2022-07-10 23:33:15,947][26022] Updated weights on worker 0-0, policy_version 939190 (0.00085) [2022-07-10 23:33:17,271][25689] Fps is (10 sec: 5699.9, 60 sec: 5638.8, 300 sec: 5603.7). Total num frames: 961738752. Throughput: 0: 5784.9. Samples: 961740620. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:33:17,273][25689] Avg episode reward: [(0, '-0.423')] [2022-07-10 23:33:17,573][26022] Updated weights on worker 0-0, policy_version 939200 (0.00091) [2022-07-10 23:33:19,668][26022] Updated weights on worker 0-0, policy_version 939210 (0.00086) [2022-07-10 23:33:21,363][26022] Updated weights on worker 0-0, policy_version 939220 (0.00089) [2022-07-10 23:33:22,281][25689] Fps is (10 sec: 5700.7, 60 sec: 5623.0, 300 sec: 5604.5). Total num frames: 961767424. Throughput: 0: 5785.4. Samples: 961774168. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:33:22,281][25689] Avg episode reward: [(0, '-0.659')] [2022-07-10 23:33:23,157][26022] Updated weights on worker 0-0, policy_version 939230 (0.00083) [2022-07-10 23:33:24,897][26022] Updated weights on worker 0-0, policy_version 939240 (0.00082) [2022-07-10 23:33:26,774][26022] Updated weights on worker 0-0, policy_version 939250 (0.00090) [2022-07-10 23:33:27,290][25689] Fps is (10 sec: 5621.3, 60 sec: 5610.2, 300 sec: 5601.9). Total num frames: 961795072. Throughput: 0: 5067.1. Samples: 961791348. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:33:27,292][25689] Avg episode reward: [(0, '0.542')] [2022-07-10 23:33:28,632][26022] Updated weights on worker 0-0, policy_version 939260 (0.00085) [2022-07-10 23:33:30,439][26022] Updated weights on worker 0-0, policy_version 939270 (0.00086) [2022-07-10 23:33:32,296][25689] Fps is (10 sec: 5418.8, 60 sec: 5594.4, 300 sec: 5599.9). Total num frames: 961821696. Throughput: 0: 5888.0. Samples: 961824708. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:33:32,297][25689] Avg episode reward: [(0, '0.388')] [2022-07-10 23:33:32,407][26022] Updated weights on worker 0-0, policy_version 939280 (0.00088) [2022-07-10 23:33:34,014][26022] Updated weights on worker 0-0, policy_version 939290 (0.00088) [2022-07-10 23:33:36,016][26022] Updated weights on worker 0-0, policy_version 939300 (0.00092) [2022-07-10 23:33:37,414][25689] Fps is (10 sec: 5664.3, 60 sec: 5589.1, 300 sec: 5601.6). Total num frames: 961852416. Throughput: 0: 5861.0. Samples: 961858676. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:33:37,417][25689] Avg episode reward: [(0, '0.389')] [2022-07-10 23:33:37,620][26022] Updated weights on worker 0-0, policy_version 939310 (0.00081) [2022-07-10 23:33:39,611][26022] Updated weights on worker 0-0, policy_version 939320 (0.00096) [2022-07-10 23:33:41,322][26022] Updated weights on worker 0-0, policy_version 939330 (0.00086) [2022-07-10 23:33:42,431][25689] Fps is (10 sec: 5658.1, 60 sec: 5575.8, 300 sec: 5595.6). Total num frames: 961879040. Throughput: 0: 5033.9. Samples: 961875602. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:33:42,433][25689] Avg episode reward: [(0, '0.480')] [2022-07-10 23:33:43,059][26022] Updated weights on worker 0-0, policy_version 939340 (0.00081) [2022-07-10 23:33:44,823][26022] Updated weights on worker 0-0, policy_version 939350 (0.00103) [2022-07-10 23:33:46,699][26022] Updated weights on worker 0-0, policy_version 939360 (0.00089) [2022-07-10 23:33:47,434][25689] Fps is (10 sec: 5416.5, 60 sec: 5561.4, 300 sec: 5592.6). Total num frames: 961906688. Throughput: 0: 5874.2. Samples: 961909674. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:33:47,434][25689] Avg episode reward: [(0, '0.911')] [2022-07-10 23:33:48,315][26022] Updated weights on worker 0-0, policy_version 939370 (0.00095) [2022-07-10 23:33:50,580][26022] Updated weights on worker 0-0, policy_version 939380 (0.00092) [2022-07-10 23:33:52,218][26022] Updated weights on worker 0-0, policy_version 939390 (0.00083) [2022-07-10 23:33:52,485][25689] Fps is (10 sec: 5703.6, 60 sec: 5608.7, 300 sec: 5600.1). Total num frames: 961936384. Throughput: 0: 5861.5. Samples: 961943044. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:33:52,486][25689] Avg episode reward: [(0, '0.848')] [2022-07-10 23:33:54,090][26022] Updated weights on worker 0-0, policy_version 939400 (0.00102) [2022-07-10 23:33:55,838][26022] Updated weights on worker 0-0, policy_version 939410 (0.00436) [2022-07-10 23:33:57,593][25689] Fps is (10 sec: 5745.1, 60 sec: 5571.1, 300 sec: 5595.4). Total num frames: 961965056. Throughput: 0: 5019.8. Samples: 961959974. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:33:57,595][25689] Avg episode reward: [(0, '0.737')] [2022-07-10 23:33:57,782][26022] Updated weights on worker 0-0, policy_version 939420 (0.00081) [2022-07-10 23:33:59,421][26022] Updated weights on worker 0-0, policy_version 939430 (0.00092) [2022-07-10 23:34:01,567][26022] Updated weights on worker 0-0, policy_version 939440 (0.00091) [2022-07-10 23:34:02,681][25689] Fps is (10 sec: 5423.6, 60 sec: 5570.6, 300 sec: 5602.0). Total num frames: 961991680. Throughput: 0: 5855.7. Samples: 961994176. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:34:02,681][25689] Avg episode reward: [(0, '1.193')] [2022-07-10 23:34:03,357][26022] Updated weights on worker 0-0, policy_version 939450 (0.00086) [2022-07-10 23:34:05,306][26022] Updated weights on worker 0-0, policy_version 939460 (0.00087) [2022-07-10 23:34:07,057][26022] Updated weights on worker 0-0, policy_version 939470 (0.00088) [2022-07-10 23:34:07,703][25689] Fps is (10 sec: 5368.4, 60 sec: 5586.2, 300 sec: 5595.5). Total num frames: 962019328. Throughput: 0: 5737.6. Samples: 962025970. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:34:07,704][25689] Avg episode reward: [(0, '1.007')] [2022-07-10 23:34:08,903][26022] Updated weights on worker 0-0, policy_version 939480 (0.00082) [2022-07-10 23:34:10,858][26022] Updated weights on worker 0-0, policy_version 939490 (0.00088) [2022-07-10 23:34:12,528][26022] Updated weights on worker 0-0, policy_version 939500 (0.00086) [2022-07-10 23:34:12,727][25689] Fps is (10 sec: 5606.1, 60 sec: 5585.0, 300 sec: 5593.4). Total num frames: 962048000. Throughput: 0: 4929.6. Samples: 962042828. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:34:12,727][25689] Avg episode reward: [(0, '-0.148')] [2022-07-10 23:34:14,435][26022] Updated weights on worker 0-0, policy_version 939510 (0.00092) [2022-07-10 23:34:16,113][26022] Updated weights on worker 0-0, policy_version 939520 (0.00095) [2022-07-10 23:34:17,787][25689] Fps is (10 sec: 5787.9, 60 sec: 5600.9, 300 sec: 5600.3). Total num frames: 962077696. Throughput: 0: 5800.7. Samples: 962077114. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:34:17,788][25689] Avg episode reward: [(0, '-0.129')] [2022-07-10 23:34:17,914][26022] Updated weights on worker 0-0, policy_version 939530 (0.00088) [2022-07-10 23:34:20,070][26022] Updated weights on worker 0-0, policy_version 939540 (0.00087) [2022-07-10 23:34:21,371][26022] Updated weights on worker 0-0, policy_version 939550 (0.00078) [2022-07-10 23:34:22,792][25689] Fps is (10 sec: 5697.1, 60 sec: 5584.4, 300 sec: 5598.2). Total num frames: 962105344. Throughput: 0: 5825.1. Samples: 962111328. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:34:22,793][25689] Avg episode reward: [(0, '-0.812')] [2022-07-10 23:34:23,454][26022] Updated weights on worker 0-0, policy_version 939560 (0.00080) [2022-07-10 23:34:25,081][26022] Updated weights on worker 0-0, policy_version 939570 (0.00093) [2022-07-10 23:34:26,881][26022] Updated weights on worker 0-0, policy_version 939580 (0.00087) [2022-07-10 23:34:27,816][25689] Fps is (10 sec: 5718.0, 60 sec: 5616.9, 300 sec: 5601.3). Total num frames: 962135040. Throughput: 0: 5096.4. Samples: 962128474. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:34:27,817][25689] Avg episode reward: [(0, '-0.687')] [2022-07-10 23:34:28,918][26022] Updated weights on worker 0-0, policy_version 939590 (0.00094) [2022-07-10 23:34:30,472][26022] Updated weights on worker 0-0, policy_version 939600 (0.00098) [2022-07-10 23:34:32,565][26022] Updated weights on worker 0-0, policy_version 939610 (0.00086) [2022-07-10 23:34:32,834][25689] Fps is (10 sec: 5710.3, 60 sec: 5632.7, 300 sec: 5602.4). Total num frames: 962162688. Throughput: 0: 5957.4. Samples: 962162618. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:34:32,835][25689] Avg episode reward: [(0, '-0.463')] [2022-07-10 23:34:34,284][26022] Updated weights on worker 0-0, policy_version 939620 (0.00084) [2022-07-10 23:34:36,059][26022] Updated weights on worker 0-0, policy_version 939630 (0.00089) [2022-07-10 23:34:37,864][26022] Updated weights on worker 0-0, policy_version 939640 (0.00078) [2022-07-10 23:34:37,973][25689] Fps is (10 sec: 5545.0, 60 sec: 5596.9, 300 sec: 5596.8). Total num frames: 962191360. Throughput: 0: 5905.7. Samples: 962196324. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:34:37,973][25689] Avg episode reward: [(0, '-0.205')] [2022-07-10 23:34:39,704][26022] Updated weights on worker 0-0, policy_version 939650 (0.00084) [2022-07-10 23:34:41,532][26022] Updated weights on worker 0-0, policy_version 939660 (0.00087) [2022-07-10 23:34:43,012][25689] Fps is (10 sec: 5634.5, 60 sec: 5628.7, 300 sec: 5599.6). Total num frames: 962220032. Throughput: 0: 5036.2. Samples: 962213158. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:34:43,012][25689] Avg episode reward: [(0, '0.555')] [2022-07-10 23:34:43,431][26022] Updated weights on worker 0-0, policy_version 939670 (0.00088) [2022-07-10 23:34:45,055][26022] Updated weights on worker 0-0, policy_version 939680 (0.00078) [2022-07-10 23:34:46,835][26022] Updated weights on worker 0-0, policy_version 939690 (0.00087) [2022-07-10 23:34:48,023][25689] Fps is (10 sec: 5705.9, 60 sec: 5644.8, 300 sec: 5599.7). Total num frames: 962248704. Throughput: 0: 5870.1. Samples: 962247090. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:34:48,023][25689] Avg episode reward: [(0, '0.637')] [2022-07-10 23:34:48,848][26022] Updated weights on worker 0-0, policy_version 939700 (0.00085) [2022-07-10 23:34:50,429][26022] Updated weights on worker 0-0, policy_version 939710 (0.00085) [2022-07-10 23:34:52,387][26022] Updated weights on worker 0-0, policy_version 939720 (0.00077) [2022-07-10 23:34:53,093][25689] Fps is (10 sec: 5586.8, 60 sec: 5609.3, 300 sec: 5600.6). Total num frames: 962276352. Throughput: 0: 5839.3. Samples: 962280912. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:34:53,093][25689] Avg episode reward: [(0, '1.431')] [2022-07-10 23:34:54,132][26022] Updated weights on worker 0-0, policy_version 939730 (0.00084) [2022-07-10 23:34:56,005][26022] Updated weights on worker 0-0, policy_version 939740 (0.00091) [2022-07-10 23:34:57,775][26022] Updated weights on worker 0-0, policy_version 939750 (0.00084) [2022-07-10 23:34:58,155][25689] Fps is (10 sec: 5659.7, 60 sec: 5630.5, 300 sec: 5597.3). Total num frames: 962306048. Throughput: 0: 5879.4. Samples: 962314982. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:34:58,155][25689] Avg episode reward: [(0, '1.463')] [2022-07-10 23:34:59,727][26022] Updated weights on worker 0-0, policy_version 939760 (0.00087) [2022-07-10 23:35:01,380][26022] Updated weights on worker 0-0, policy_version 939770 (0.00092) [2022-07-10 23:35:03,169][25689] Fps is (10 sec: 5386.0, 60 sec: 5603.4, 300 sec: 5597.8). Total num frames: 962330624. Throughput: 0: 5900.3. Samples: 962332092. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:35:03,170][25689] Avg episode reward: [(0, '1.483')] [2022-07-10 23:35:03,850][26022] Updated weights on worker 0-0, policy_version 939780 (0.00095) [2022-07-10 23:35:05,304][26022] Updated weights on worker 0-0, policy_version 939790 (0.00096) [2022-07-10 23:35:07,305][26022] Updated weights on worker 0-0, policy_version 939800 (0.00091) [2022-07-10 23:35:08,181][25689] Fps is (10 sec: 5413.1, 60 sec: 5638.3, 300 sec: 5601.4). Total num frames: 962360320. Throughput: 0: 5787.0. Samples: 962363744. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:35:08,181][25689] Avg episode reward: [(0, '1.464')] [2022-07-10 23:35:08,901][26022] Updated weights on worker 0-0, policy_version 939810 (0.00090) [2022-07-10 23:35:11,007][26022] Updated weights on worker 0-0, policy_version 939820 (0.00088) [2022-07-10 23:35:12,427][26022] Updated weights on worker 0-0, policy_version 939830 (0.00083) [2022-07-10 23:35:13,211][25689] Fps is (10 sec: 5710.7, 60 sec: 5620.8, 300 sec: 5598.2). Total num frames: 962387968. Throughput: 0: 5807.8. Samples: 962397754. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:35:13,211][25689] Avg episode reward: [(0, '0.519')] [2022-07-10 23:35:14,459][26022] Updated weights on worker 0-0, policy_version 939840 (0.00078) [2022-07-10 23:35:16,013][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:35:16,022][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000939849_962405376.pth [2022-07-10 23:35:16,023][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000937877_960386048.pth [2022-07-10 23:35:16,213][26022] Updated weights on worker 0-0, policy_version 939850 (0.00081) [2022-07-10 23:35:18,174][26022] Updated weights on worker 0-0, policy_version 939860 (0.00086) [2022-07-10 23:35:18,302][25689] Fps is (10 sec: 5564.5, 60 sec: 5601.0, 300 sec: 5603.6). Total num frames: 962416640. Throughput: 0: 4949.0. Samples: 962414690. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:35:18,304][25689] Avg episode reward: [(0, '0.682')] [2022-07-10 23:35:19,899][26022] Updated weights on worker 0-0, policy_version 939870 (0.00091) [2022-07-10 23:35:21,654][26022] Updated weights on worker 0-0, policy_version 939880 (0.00083) [2022-07-10 23:35:23,336][25689] Fps is (10 sec: 5764.7, 60 sec: 5632.2, 300 sec: 5606.5). Total num frames: 962446336. Throughput: 0: 5770.5. Samples: 962448464. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-10 23:35:23,336][25689] Avg episode reward: [(0, '0.389')] [2022-07-10 23:35:23,535][26022] Updated weights on worker 0-0, policy_version 939890 (0.00085) [2022-07-10 23:35:25,338][26022] Updated weights on worker 0-0, policy_version 939900 (0.00079) [2022-07-10 23:35:27,232][26022] Updated weights on worker 0-0, policy_version 939910 (0.00096) [2022-07-10 23:35:28,407][25689] Fps is (10 sec: 5776.4, 60 sec: 5610.9, 300 sec: 5605.3). Total num frames: 962475008. Throughput: 0: 5889.5. Samples: 962482864. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:35:28,407][25689] Avg episode reward: [(0, '0.068')] [2022-07-10 23:35:28,772][26022] Updated weights on worker 0-0, policy_version 939920 (0.00091) [2022-07-10 23:35:30,977][26022] Updated weights on worker 0-0, policy_version 939930 (0.00093) [2022-07-10 23:35:32,409][26022] Updated weights on worker 0-0, policy_version 939940 (0.00086) [2022-07-10 23:35:33,427][25689] Fps is (10 sec: 5581.2, 60 sec: 5610.7, 300 sec: 5599.9). Total num frames: 962502656. Throughput: 0: 5044.1. Samples: 962499726. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:35:33,427][25689] Avg episode reward: [(0, '0.130')] [2022-07-10 23:35:34,453][26022] Updated weights on worker 0-0, policy_version 939950 (0.00096) [2022-07-10 23:35:36,331][26022] Updated weights on worker 0-0, policy_version 939960 (0.00088) [2022-07-10 23:35:37,936][26022] Updated weights on worker 0-0, policy_version 939970 (0.00085) [2022-07-10 23:35:38,481][25689] Fps is (10 sec: 5691.9, 60 sec: 5635.4, 300 sec: 5610.0). Total num frames: 962532352. Throughput: 0: 5893.2. Samples: 962533610. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:35:38,482][25689] Avg episode reward: [(0, '0.527')] [2022-07-10 23:35:39,768][26022] Updated weights on worker 0-0, policy_version 939980 (0.00094) [2022-07-10 23:35:41,744][26022] Updated weights on worker 0-0, policy_version 939990 (0.00084) [2022-07-10 23:35:43,210][26022] Updated weights on worker 0-0, policy_version 940000 (0.00801) [2022-07-10 23:35:43,496][25689] Fps is (10 sec: 5898.5, 60 sec: 5654.6, 300 sec: 5614.8). Total num frames: 962562048. Throughput: 0: 5931.9. Samples: 962568052. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:35:43,496][25689] Avg episode reward: [(0, '0.567')] [2022-07-10 23:35:45,259][26022] Updated weights on worker 0-0, policy_version 940010 (0.00218) [2022-07-10 23:35:46,695][26022] Updated weights on worker 0-0, policy_version 940020 (0.00087) [2022-07-10 23:35:48,506][25689] Fps is (10 sec: 5617.9, 60 sec: 5620.8, 300 sec: 5605.7). Total num frames: 962588672. Throughput: 0: 5100.8. Samples: 962585390. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:35:48,507][25689] Avg episode reward: [(0, '0.591')] [2022-07-10 23:35:48,748][26022] Updated weights on worker 0-0, policy_version 940030 (0.00087) [2022-07-10 23:35:50,622][26022] Updated weights on worker 0-0, policy_version 940040 (0.00086) [2022-07-10 23:35:52,214][26022] Updated weights on worker 0-0, policy_version 940050 (0.00086) [2022-07-10 23:35:53,535][25689] Fps is (10 sec: 5508.1, 60 sec: 5641.6, 300 sec: 5610.7). Total num frames: 962617344. Throughput: 0: 5942.8. Samples: 962619224. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:35:53,536][25689] Avg episode reward: [(0, '0.425')] [2022-07-10 23:35:54,241][26022] Updated weights on worker 0-0, policy_version 940060 (0.00083) [2022-07-10 23:35:56,004][26022] Updated weights on worker 0-0, policy_version 940070 (0.00095) [2022-07-10 23:35:57,791][26022] Updated weights on worker 0-0, policy_version 940080 (0.00081) [2022-07-10 23:35:58,632][25689] Fps is (10 sec: 5764.4, 60 sec: 5638.3, 300 sec: 5609.4). Total num frames: 962647040. Throughput: 0: 5948.1. Samples: 962653468. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:35:58,632][25689] Avg episode reward: [(0, '-0.018')] [2022-07-10 23:35:59,731][26022] Updated weights on worker 0-0, policy_version 940090 (0.00085) [2022-07-10 23:36:01,268][26022] Updated weights on worker 0-0, policy_version 940100 (0.00085) [2022-07-10 23:36:03,664][25689] Fps is (10 sec: 5459.1, 60 sec: 5653.6, 300 sec: 5605.8). Total num frames: 962672640. Throughput: 0: 5088.9. Samples: 962670686. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:36:03,665][25689] Avg episode reward: [(0, '0.034')] [2022-07-10 23:36:03,670][26022] Updated weights on worker 0-0, policy_version 940110 (0.00085) [2022-07-10 23:36:05,307][26022] Updated weights on worker 0-0, policy_version 940120 (0.00084) [2022-07-10 23:36:07,116][26022] Updated weights on worker 0-0, policy_version 940130 (0.00084) [2022-07-10 23:36:08,702][25689] Fps is (10 sec: 5389.3, 60 sec: 5634.2, 300 sec: 5615.9). Total num frames: 962701312. Throughput: 0: 5830.6. Samples: 962703144. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:36:08,703][25689] Avg episode reward: [(0, '-0.540')] [2022-07-10 23:36:08,874][26022] Updated weights on worker 0-0, policy_version 940140 (0.00089) [2022-07-10 23:36:10,704][26022] Updated weights on worker 0-0, policy_version 940150 (0.00093) [2022-07-10 23:36:12,474][26022] Updated weights on worker 0-0, policy_version 940160 (0.00089) [2022-07-10 23:36:13,709][25689] Fps is (10 sec: 5708.6, 60 sec: 5653.3, 300 sec: 5611.3). Total num frames: 962729984. Throughput: 0: 5840.2. Samples: 962737046. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:36:13,710][25689] Avg episode reward: [(0, '0.154')] [2022-07-10 23:36:14,302][26022] Updated weights on worker 0-0, policy_version 940170 (0.00099) [2022-07-10 23:36:16,226][26022] Updated weights on worker 0-0, policy_version 940180 (0.00090) [2022-07-10 23:36:17,999][26022] Updated weights on worker 0-0, policy_version 940190 (0.00098) [2022-07-10 23:36:18,770][25689] Fps is (10 sec: 5594.2, 60 sec: 5639.2, 300 sec: 5612.0). Total num frames: 962757632. Throughput: 0: 4995.5. Samples: 962754066. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:36:18,770][25689] Avg episode reward: [(0, '0.421')] [2022-07-10 23:36:19,911][26022] Updated weights on worker 0-0, policy_version 940200 (0.00082) [2022-07-10 23:36:21,514][26022] Updated weights on worker 0-0, policy_version 940210 (0.00085) [2022-07-10 23:36:23,379][26022] Updated weights on worker 0-0, policy_version 940220 (0.00088) [2022-07-10 23:36:23,825][25689] Fps is (10 sec: 5567.6, 60 sec: 5620.3, 300 sec: 5611.4). Total num frames: 962786304. Throughput: 0: 5817.0. Samples: 962787962. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:36:23,826][25689] Avg episode reward: [(0, '0.537')] [2022-07-10 23:36:25,267][26022] Updated weights on worker 0-0, policy_version 940230 (0.00091) [2022-07-10 23:36:27,081][26022] Updated weights on worker 0-0, policy_version 940240 (0.00082) [2022-07-10 23:36:28,679][26022] Updated weights on worker 0-0, policy_version 940250 (0.00086) [2022-07-10 23:36:28,844][25689] Fps is (10 sec: 5794.1, 60 sec: 5642.1, 300 sec: 5618.2). Total num frames: 962816000. Throughput: 0: 5913.2. Samples: 962822242. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:36:28,844][25689] Avg episode reward: [(0, '0.743')] [2022-07-10 23:36:30,625][26022] Updated weights on worker 0-0, policy_version 940260 (0.00082) [2022-07-10 23:36:32,291][26022] Updated weights on worker 0-0, policy_version 940270 (0.00090) [2022-07-10 23:36:33,906][25689] Fps is (10 sec: 5688.5, 60 sec: 5638.2, 300 sec: 5619.5). Total num frames: 962843648. Throughput: 0: 5050.8. Samples: 962839054. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:36:33,906][25689] Avg episode reward: [(0, '1.594')] [2022-07-10 23:36:34,269][26022] Updated weights on worker 0-0, policy_version 940280 (0.00085) [2022-07-10 23:36:36,039][26022] Updated weights on worker 0-0, policy_version 940290 (0.00087) [2022-07-10 23:36:38,038][26022] Updated weights on worker 0-0, policy_version 940300 (0.00084) [2022-07-10 23:36:39,019][25689] Fps is (10 sec: 5635.6, 60 sec: 5632.7, 300 sec: 5621.5). Total num frames: 962873344. Throughput: 0: 5877.7. Samples: 962873084. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:36:39,020][25689] Avg episode reward: [(0, '1.415')] [2022-07-10 23:36:39,861][26022] Updated weights on worker 0-0, policy_version 940310 (0.00087) [2022-07-10 23:36:41,672][26022] Updated weights on worker 0-0, policy_version 940320 (0.00079) [2022-07-10 23:36:43,420][26022] Updated weights on worker 0-0, policy_version 940330 (0.00098) [2022-07-10 23:36:44,087][25689] Fps is (10 sec: 5733.2, 60 sec: 5610.9, 300 sec: 5615.0). Total num frames: 962902016. Throughput: 0: 5896.0. Samples: 962907424. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:36:44,087][25689] Avg episode reward: [(0, '1.432')] [2022-07-10 23:36:45,088][26022] Updated weights on worker 0-0, policy_version 940340 (0.00362) [2022-07-10 23:36:47,006][26022] Updated weights on worker 0-0, policy_version 940350 (0.00091) [2022-07-10 23:36:48,679][26022] Updated weights on worker 0-0, policy_version 940360 (0.00088) [2022-07-10 23:36:49,152][25689] Fps is (10 sec: 5557.9, 60 sec: 5622.7, 300 sec: 5621.6). Total num frames: 962929664. Throughput: 0: 5030.9. Samples: 962924414. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:36:49,153][25689] Avg episode reward: [(0, '1.718')] [2022-07-10 23:36:50,647][26022] Updated weights on worker 0-0, policy_version 940370 (0.00099) [2022-07-10 23:36:52,403][26022] Updated weights on worker 0-0, policy_version 940380 (0.00102) [2022-07-10 23:36:54,175][25689] Fps is (10 sec: 5582.5, 60 sec: 5623.2, 300 sec: 5619.0). Total num frames: 962958336. Throughput: 0: 5855.9. Samples: 962957750. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:36:54,176][25689] Avg episode reward: [(0, '1.503')] [2022-07-10 23:36:54,284][26022] Updated weights on worker 0-0, policy_version 940390 (0.00099) [2022-07-10 23:36:56,107][26022] Updated weights on worker 0-0, policy_version 940400 (0.00077) [2022-07-10 23:36:57,658][26022] Updated weights on worker 0-0, policy_version 940410 (0.00082) [2022-07-10 23:36:59,283][25689] Fps is (10 sec: 5660.5, 60 sec: 5605.3, 300 sec: 5624.7). Total num frames: 962987008. Throughput: 0: 5869.5. Samples: 962992024. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:36:59,285][25689] Avg episode reward: [(0, '0.777')] [2022-07-10 23:36:59,764][26022] Updated weights on worker 0-0, policy_version 940420 (0.00081) [2022-07-10 23:37:01,380][26022] Updated weights on worker 0-0, policy_version 940430 (0.00094) [2022-07-10 23:37:03,678][26022] Updated weights on worker 0-0, policy_version 940440 (0.00088) [2022-07-10 23:37:04,360][25689] Fps is (10 sec: 5530.0, 60 sec: 5634.9, 300 sec: 5620.2). Total num frames: 963014656. Throughput: 0: 5743.3. Samples: 963023858. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:37:04,360][25689] Avg episode reward: [(0, '0.648')] [2022-07-10 23:37:05,406][26022] Updated weights on worker 0-0, policy_version 940450 (0.00087) [2022-07-10 23:37:07,212][26022] Updated weights on worker 0-0, policy_version 940460 (0.00087) [2022-07-10 23:37:09,045][26022] Updated weights on worker 0-0, policy_version 940470 (0.00078) [2022-07-10 23:37:09,387][25689] Fps is (10 sec: 5573.8, 60 sec: 5635.9, 300 sec: 5620.0). Total num frames: 963043328. Throughput: 0: 5751.0. Samples: 963040786. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:37:09,388][25689] Avg episode reward: [(0, '0.184')] [2022-07-10 23:37:10,979][26022] Updated weights on worker 0-0, policy_version 940480 (0.00086) [2022-07-10 23:37:12,640][26022] Updated weights on worker 0-0, policy_version 940490 (0.00093) [2022-07-10 23:37:14,411][25689] Fps is (10 sec: 5603.4, 60 sec: 5617.5, 300 sec: 5622.1). Total num frames: 963070976. Throughput: 0: 5787.3. Samples: 963074860. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:37:14,412][25689] Avg episode reward: [(0, '0.059')] [2022-07-10 23:37:14,647][26022] Updated weights on worker 0-0, policy_version 940500 (0.00099) [2022-07-10 23:37:16,064][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:37:16,077][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000940509_963081216.pth [2022-07-10 23:37:16,077][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000938534_961058816.pth [2022-07-10 23:37:16,372][26022] Updated weights on worker 0-0, policy_version 940511 (0.00604) [2022-07-10 23:37:18,455][26022] Updated weights on worker 0-0, policy_version 940521 (0.00085) [2022-07-10 23:37:19,538][25689] Fps is (10 sec: 5548.6, 60 sec: 5628.2, 300 sec: 5616.7). Total num frames: 963099648. Throughput: 0: 5765.8. Samples: 963108810. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:37:19,538][25689] Avg episode reward: [(0, '0.127')] [2022-07-10 23:37:19,912][26022] Updated weights on worker 0-0, policy_version 940531 (0.00089) [2022-07-10 23:37:22,335][26022] Updated weights on worker 0-0, policy_version 940541 (0.00070) [2022-07-10 23:37:23,627][26022] Updated weights on worker 0-0, policy_version 940551 (0.00085) [2022-07-10 23:37:24,544][25689] Fps is (10 sec: 5659.1, 60 sec: 5632.8, 300 sec: 5617.6). Total num frames: 963128320. Throughput: 0: 5037.8. Samples: 963125542. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:37:24,544][25689] Avg episode reward: [(0, '0.283')] [2022-07-10 23:37:25,845][26022] Updated weights on worker 0-0, policy_version 940561 (0.00096) [2022-07-10 23:37:27,241][26022] Updated weights on worker 0-0, policy_version 940571 (0.00084) [2022-07-10 23:37:29,514][26022] Updated weights on worker 0-0, policy_version 940581 (0.00084) [2022-07-10 23:37:29,561][25689] Fps is (10 sec: 5618.8, 60 sec: 5599.2, 300 sec: 5617.6). Total num frames: 963155968. Throughput: 0: 5889.0. Samples: 963159590. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:37:29,562][25689] Avg episode reward: [(0, '0.767')] [2022-07-10 23:37:30,958][26022] Updated weights on worker 0-0, policy_version 940591 (0.00082) [2022-07-10 23:37:32,995][26022] Updated weights on worker 0-0, policy_version 940601 (0.00088) [2022-07-10 23:37:34,570][25689] Fps is (10 sec: 5515.4, 60 sec: 5604.1, 300 sec: 5608.3). Total num frames: 963183616. Throughput: 0: 5878.4. Samples: 963193362. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:37:34,570][25689] Avg episode reward: [(0, '0.750')] [2022-07-10 23:37:34,739][26022] Updated weights on worker 0-0, policy_version 940611 (0.00090) [2022-07-10 23:37:36,459][26022] Updated weights on worker 0-0, policy_version 940621 (0.00084) [2022-07-10 23:37:38,357][26022] Updated weights on worker 0-0, policy_version 940631 (0.00092) [2022-07-10 23:37:39,707][25689] Fps is (10 sec: 5652.1, 60 sec: 5601.9, 300 sec: 5613.7). Total num frames: 963213312. Throughput: 0: 5039.4. Samples: 963210450. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:37:39,707][25689] Avg episode reward: [(0, '0.174')] [2022-07-10 23:37:40,156][26022] Updated weights on worker 0-0, policy_version 940641 (0.00085) [2022-07-10 23:37:41,911][26022] Updated weights on worker 0-0, policy_version 940651 (0.00084) [2022-07-10 23:37:43,586][26022] Updated weights on worker 0-0, policy_version 940661 (0.00083) [2022-07-10 23:37:44,716][25689] Fps is (10 sec: 5752.4, 60 sec: 5607.3, 300 sec: 5614.1). Total num frames: 963241984. Throughput: 0: 5891.5. Samples: 963244388. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:37:44,717][25689] Avg episode reward: [(0, '-0.696')] [2022-07-10 23:37:45,681][26022] Updated weights on worker 0-0, policy_version 940671 (0.00094) [2022-07-10 23:37:47,262][26022] Updated weights on worker 0-0, policy_version 940681 (0.00090) [2022-07-10 23:37:49,210][26022] Updated weights on worker 0-0, policy_version 940691 (0.00095) [2022-07-10 23:37:49,804][25689] Fps is (10 sec: 5780.9, 60 sec: 5639.1, 300 sec: 5623.0). Total num frames: 963271680. Throughput: 0: 5865.0. Samples: 963278312. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:37:49,804][25689] Avg episode reward: [(0, '-2.089')] [2022-07-10 23:37:51,083][26022] Updated weights on worker 0-0, policy_version 940701 (0.00087) [2022-07-10 23:37:52,770][26022] Updated weights on worker 0-0, policy_version 940711 (0.00097) [2022-07-10 23:37:54,604][26022] Updated weights on worker 0-0, policy_version 940721 (0.00084) [2022-07-10 23:37:54,808][25689] Fps is (10 sec: 5682.4, 60 sec: 5623.9, 300 sec: 5613.8). Total num frames: 963299328. Throughput: 0: 5031.5. Samples: 963295190. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:37:54,808][25689] Avg episode reward: [(0, '-2.085')] [2022-07-10 23:37:56,455][26022] Updated weights on worker 0-0, policy_version 940731 (0.00094) [2022-07-10 23:37:58,219][26022] Updated weights on worker 0-0, policy_version 940741 (0.00089) [2022-07-10 23:37:59,864][25689] Fps is (10 sec: 5496.1, 60 sec: 5611.7, 300 sec: 5617.8). Total num frames: 963326976. Throughput: 0: 5908.6. Samples: 963329554. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:37:59,865][25689] Avg episode reward: [(0, '-2.242')] [2022-07-10 23:38:00,063][26022] Updated weights on worker 0-0, policy_version 940751 (0.00087) [2022-07-10 23:38:01,698][26022] Updated weights on worker 0-0, policy_version 940761 (0.00084) [2022-07-10 23:38:03,960][26022] Updated weights on worker 0-0, policy_version 940771 (0.00091) [2022-07-10 23:38:04,898][25689] Fps is (10 sec: 5581.6, 60 sec: 5632.6, 300 sec: 5624.2). Total num frames: 963355648. Throughput: 0: 5817.9. Samples: 963361804. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:38:04,899][25689] Avg episode reward: [(0, '-1.558')] [2022-07-10 23:38:05,962][26022] Updated weights on worker 0-0, policy_version 940781 (0.00090) [2022-07-10 23:38:07,386][26022] Updated weights on worker 0-0, policy_version 940791 (0.00087) [2022-07-10 23:38:09,526][26022] Updated weights on worker 0-0, policy_version 940801 (0.00086) [2022-07-10 23:38:09,935][25689] Fps is (10 sec: 5694.0, 60 sec: 5631.7, 300 sec: 5623.7). Total num frames: 963384320. Throughput: 0: 5007.9. Samples: 963379130. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:38:09,936][25689] Avg episode reward: [(0, '-1.021')] [2022-07-10 23:38:10,912][26022] Updated weights on worker 0-0, policy_version 940811 (0.00085) [2022-07-10 23:38:12,944][26022] Updated weights on worker 0-0, policy_version 940821 (0.00094) [2022-07-10 23:38:14,487][26022] Updated weights on worker 0-0, policy_version 940831 (0.00419) [2022-07-10 23:38:15,029][25689] Fps is (10 sec: 5559.2, 60 sec: 5625.2, 300 sec: 5619.4). Total num frames: 963411968. Throughput: 0: 5848.7. Samples: 963413458. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:38:15,030][25689] Avg episode reward: [(0, '0.017')] [2022-07-10 23:38:16,359][26022] Updated weights on worker 0-0, policy_version 940841 (0.00086) [2022-07-10 23:38:18,312][26022] Updated weights on worker 0-0, policy_version 940851 (0.00093) [2022-07-10 23:38:20,035][26022] Updated weights on worker 0-0, policy_version 940861 (0.00083) [2022-07-10 23:38:20,100][25689] Fps is (10 sec: 5742.2, 60 sec: 5664.2, 300 sec: 5625.1). Total num frames: 963442688. Throughput: 0: 5855.3. Samples: 963448040. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:38:20,101][25689] Avg episode reward: [(0, '0.129')] [2022-07-10 23:38:21,795][26022] Updated weights on worker 0-0, policy_version 940871 (0.00082) [2022-07-10 23:38:23,724][26022] Updated weights on worker 0-0, policy_version 940881 (0.00087) [2022-07-10 23:38:25,113][25689] Fps is (10 sec: 5686.8, 60 sec: 5629.8, 300 sec: 5621.6). Total num frames: 963469312. Throughput: 0: 5100.2. Samples: 963464902. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:38:25,114][25689] Avg episode reward: [(0, '-0.933')] [2022-07-10 23:38:25,453][26022] Updated weights on worker 0-0, policy_version 940891 (0.00086) [2022-07-10 23:38:27,310][26022] Updated weights on worker 0-0, policy_version 940901 (0.00087) [2022-07-10 23:38:29,076][26022] Updated weights on worker 0-0, policy_version 940911 (0.00087) [2022-07-10 23:38:30,183][25689] Fps is (10 sec: 5484.3, 60 sec: 5641.8, 300 sec: 5627.3). Total num frames: 963497984. Throughput: 0: 5907.3. Samples: 963498738. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:38:30,183][25689] Avg episode reward: [(0, '-0.545')] [2022-07-10 23:38:30,955][26022] Updated weights on worker 0-0, policy_version 940921 (0.00084) [2022-07-10 23:38:32,863][26022] Updated weights on worker 0-0, policy_version 940931 (0.00093) [2022-07-10 23:38:34,444][26022] Updated weights on worker 0-0, policy_version 940941 (0.00085) [2022-07-10 23:38:35,199][25689] Fps is (10 sec: 5583.6, 60 sec: 5641.0, 300 sec: 5618.9). Total num frames: 963525632. Throughput: 0: 5909.7. Samples: 963532658. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:38:35,200][25689] Avg episode reward: [(0, '-0.562')] [2022-07-10 23:38:36,298][26022] Updated weights on worker 0-0, policy_version 940951 (0.00096) [2022-07-10 23:38:38,236][26022] Updated weights on worker 0-0, policy_version 940961 (0.00090) [2022-07-10 23:38:39,865][26022] Updated weights on worker 0-0, policy_version 940971 (0.00086) [2022-07-10 23:38:40,264][25689] Fps is (10 sec: 5891.1, 60 sec: 5681.6, 300 sec: 5635.2). Total num frames: 963557376. Throughput: 0: 5042.0. Samples: 963549708. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:38:40,265][25689] Avg episode reward: [(0, '-0.388')] [2022-07-10 23:38:41,949][26022] Updated weights on worker 0-0, policy_version 940981 (0.00082) [2022-07-10 23:38:43,455][26022] Updated weights on worker 0-0, policy_version 940991 (0.00087) [2022-07-10 23:38:45,330][25689] Fps is (10 sec: 5761.7, 60 sec: 5642.6, 300 sec: 5630.5). Total num frames: 963584000. Throughput: 0: 5877.3. Samples: 963583724. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:38:45,330][25689] Avg episode reward: [(0, '-0.305')] [2022-07-10 23:38:45,413][26022] Updated weights on worker 0-0, policy_version 941001 (0.00087) [2022-07-10 23:38:47,339][26022] Updated weights on worker 0-0, policy_version 941011 (0.00096) [2022-07-10 23:38:48,892][26022] Updated weights on worker 0-0, policy_version 941021 (0.00090) [2022-07-10 23:38:50,356][25689] Fps is (10 sec: 5377.7, 60 sec: 5614.4, 300 sec: 5624.1). Total num frames: 963611648. Throughput: 0: 5907.7. Samples: 963617918. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:38:50,357][25689] Avg episode reward: [(0, '-0.138')] [2022-07-10 23:38:50,910][26022] Updated weights on worker 0-0, policy_version 941031 (0.00090) [2022-07-10 23:38:52,726][26022] Updated weights on worker 0-0, policy_version 941041 (0.00093) [2022-07-10 23:38:54,323][26022] Updated weights on worker 0-0, policy_version 941051 (0.00084) [2022-07-10 23:38:55,367][25689] Fps is (10 sec: 5611.3, 60 sec: 5630.7, 300 sec: 5626.0). Total num frames: 963640320. Throughput: 0: 5067.3. Samples: 963634852. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:38:55,367][25689] Avg episode reward: [(0, '1.084')] [2022-07-10 23:38:56,451][26022] Updated weights on worker 0-0, policy_version 941061 (0.00084) [2022-07-10 23:38:57,918][26022] Updated weights on worker 0-0, policy_version 941071 (0.00085) [2022-07-10 23:39:00,004][26022] Updated weights on worker 0-0, policy_version 941081 (0.00088) [2022-07-10 23:39:00,491][25689] Fps is (10 sec: 5759.5, 60 sec: 5658.3, 300 sec: 5635.6). Total num frames: 963670016. Throughput: 0: 5864.6. Samples: 963668326. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:39:00,491][25689] Avg episode reward: [(0, '0.921')] [2022-07-10 23:39:02,053][26022] Updated weights on worker 0-0, policy_version 941091 (0.00081) [2022-07-10 23:39:03,651][26022] Updated weights on worker 0-0, policy_version 941101 (0.00093) [2022-07-10 23:39:05,538][25689] Fps is (10 sec: 5335.7, 60 sec: 5589.4, 300 sec: 5624.8). Total num frames: 963694592. Throughput: 0: 5755.5. Samples: 963700034. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:39:05,539][25689] Avg episode reward: [(0, '0.187')] [2022-07-10 23:39:06,020][26022] Updated weights on worker 0-0, policy_version 941111 (0.00090) [2022-07-10 23:39:07,413][26022] Updated weights on worker 0-0, policy_version 941121 (0.00084) [2022-07-10 23:39:09,528][26022] Updated weights on worker 0-0, policy_version 941131 (0.00088) [2022-07-10 23:39:10,549][25689] Fps is (10 sec: 5395.9, 60 sec: 5608.8, 300 sec: 5628.5). Total num frames: 963724288. Throughput: 0: 4899.5. Samples: 963716850. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:39:10,549][25689] Avg episode reward: [(0, '0.286')] [2022-07-10 23:39:11,125][26022] Updated weights on worker 0-0, policy_version 941141 (0.00083) [2022-07-10 23:39:13,028][26022] Updated weights on worker 0-0, policy_version 941151 (0.00091) [2022-07-10 23:39:14,701][26022] Updated weights on worker 0-0, policy_version 941161 (0.00090) [2022-07-10 23:39:15,559][25689] Fps is (10 sec: 5824.9, 60 sec: 5633.5, 300 sec: 5626.0). Total num frames: 963752960. Throughput: 0: 5743.7. Samples: 963750828. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-10 23:39:15,559][25689] Avg episode reward: [(0, '0.356')] [2022-07-10 23:39:16,164][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:39:16,176][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000941168_963756032.pth [2022-07-10 23:39:16,176][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000939190_961730560.pth [2022-07-10 23:39:16,939][26022] Updated weights on worker 0-0, policy_version 941171 (0.00083) [2022-07-10 23:39:18,422][26022] Updated weights on worker 0-0, policy_version 941181 (0.00618) [2022-07-10 23:39:20,327][26022] Updated weights on worker 0-0, policy_version 941191 (0.00092) [2022-07-10 23:39:20,621][25689] Fps is (10 sec: 5693.0, 60 sec: 5600.4, 300 sec: 5628.4). Total num frames: 963781632. Throughput: 0: 5769.6. Samples: 963784472. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:39:20,622][25689] Avg episode reward: [(0, '0.543')] [2022-07-10 23:39:22,174][26022] Updated weights on worker 0-0, policy_version 941201 (0.00086) [2022-07-10 23:39:23,931][26022] Updated weights on worker 0-0, policy_version 941211 (0.00092) [2022-07-10 23:39:25,624][25689] Fps is (10 sec: 5493.6, 60 sec: 5601.3, 300 sec: 5618.4). Total num frames: 963808256. Throughput: 0: 5896.4. Samples: 963818468. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:39:25,626][25689] Avg episode reward: [(0, '0.532')] [2022-07-10 23:39:25,991][26022] Updated weights on worker 0-0, policy_version 941221 (0.00623) [2022-07-10 23:39:27,544][26022] Updated weights on worker 0-0, policy_version 941231 (0.00087) [2022-07-10 23:39:29,413][26022] Updated weights on worker 0-0, policy_version 941241 (0.00091) [2022-07-10 23:39:30,635][25689] Fps is (10 sec: 5624.4, 60 sec: 5623.8, 300 sec: 5625.5). Total num frames: 963837952. Throughput: 0: 5909.9. Samples: 963835558. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:39:30,636][25689] Avg episode reward: [(0, '0.521')] [2022-07-10 23:39:31,188][26022] Updated weights on worker 0-0, policy_version 941251 (0.00087) [2022-07-10 23:39:32,906][26022] Updated weights on worker 0-0, policy_version 941261 (0.00084) [2022-07-10 23:39:34,835][26022] Updated weights on worker 0-0, policy_version 941271 (0.00054) [2022-07-10 23:39:35,665][25689] Fps is (10 sec: 5813.0, 60 sec: 5639.4, 300 sec: 5627.5). Total num frames: 963866624. Throughput: 0: 5919.7. Samples: 963869852. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:39:35,667][25689] Avg episode reward: [(0, '1.421')] [2022-07-10 23:39:36,639][26022] Updated weights on worker 0-0, policy_version 941281 (0.00084) [2022-07-10 23:39:38,495][26022] Updated weights on worker 0-0, policy_version 941291 (0.00094) [2022-07-10 23:39:40,331][26022] Updated weights on worker 0-0, policy_version 941301 (0.00091) [2022-07-10 23:39:40,733][25689] Fps is (10 sec: 5577.3, 60 sec: 5571.4, 300 sec: 5623.5). Total num frames: 963894272. Throughput: 0: 5939.7. Samples: 963903928. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:39:40,733][25689] Avg episode reward: [(0, '1.168')] [2022-07-10 23:39:41,904][26022] Updated weights on worker 0-0, policy_version 941311 (0.00086) [2022-07-10 23:39:43,843][26022] Updated weights on worker 0-0, policy_version 941321 (0.00086) [2022-07-10 23:39:45,703][26022] Updated weights on worker 0-0, policy_version 941331 (0.00385) [2022-07-10 23:39:45,744][25689] Fps is (10 sec: 5587.8, 60 sec: 5610.3, 300 sec: 5623.5). Total num frames: 963922944. Throughput: 0: 5100.9. Samples: 963921098. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:39:45,745][25689] Avg episode reward: [(0, '1.293')] [2022-07-10 23:39:47,435][26022] Updated weights on worker 0-0, policy_version 941341 (0.00227) [2022-07-10 23:39:49,271][26022] Updated weights on worker 0-0, policy_version 941351 (0.00086) [2022-07-10 23:39:50,748][25689] Fps is (10 sec: 5828.0, 60 sec: 5646.4, 300 sec: 5631.6). Total num frames: 963952640. Throughput: 0: 5937.4. Samples: 963954978. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:39:50,748][25689] Avg episode reward: [(0, '1.354')] [2022-07-10 23:39:50,977][26022] Updated weights on worker 0-0, policy_version 941361 (0.00086) [2022-07-10 23:39:52,882][26022] Updated weights on worker 0-0, policy_version 941371 (0.00080) [2022-07-10 23:39:54,790][26022] Updated weights on worker 0-0, policy_version 941381 (0.00080) [2022-07-10 23:39:55,788][25689] Fps is (10 sec: 5505.3, 60 sec: 5592.7, 300 sec: 5618.3). Total num frames: 963978240. Throughput: 0: 5908.5. Samples: 963988750. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:39:55,789][25689] Avg episode reward: [(0, '1.440')] [2022-07-10 23:39:56,489][26022] Updated weights on worker 0-0, policy_version 941391 (0.00083) [2022-07-10 23:39:58,270][26022] Updated weights on worker 0-0, policy_version 941401 (0.00086) [2022-07-10 23:40:00,246][26022] Updated weights on worker 0-0, policy_version 941411 (0.00081) [2022-07-10 23:40:00,908][25689] Fps is (10 sec: 5543.3, 60 sec: 5610.1, 300 sec: 5636.9). Total num frames: 964008960. Throughput: 0: 5038.7. Samples: 964005584. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:40:00,909][25689] Avg episode reward: [(0, '1.387')] [2022-07-10 23:40:02,356][26022] Updated weights on worker 0-0, policy_version 941421 (0.00106) [2022-07-10 23:40:04,348][26022] Updated weights on worker 0-0, policy_version 941431 (0.00091) [2022-07-10 23:40:05,931][25689] Fps is (10 sec: 5552.6, 60 sec: 5629.3, 300 sec: 5623.0). Total num frames: 964034560. Throughput: 0: 5742.6. Samples: 964037024. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:40:05,932][25689] Avg episode reward: [(0, '1.242')] [2022-07-10 23:40:06,088][26022] Updated weights on worker 0-0, policy_version 941441 (0.00086) [2022-07-10 23:40:07,863][26022] Updated weights on worker 0-0, policy_version 941451 (0.00097) [2022-07-10 23:40:09,822][26022] Updated weights on worker 0-0, policy_version 941461 (0.00090) [2022-07-10 23:40:10,961][25689] Fps is (10 sec: 5195.0, 60 sec: 5576.6, 300 sec: 5619.5). Total num frames: 964061184. Throughput: 0: 5722.2. Samples: 964070640. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:40:10,961][25689] Avg episode reward: [(0, '1.437')] [2022-07-10 23:40:11,517][26022] Updated weights on worker 0-0, policy_version 941471 (0.00082) [2022-07-10 23:40:13,454][26022] Updated weights on worker 0-0, policy_version 941481 (0.00050) [2022-07-10 23:40:15,303][26022] Updated weights on worker 0-0, policy_version 941491 (0.00091) [2022-07-10 23:40:15,968][25689] Fps is (10 sec: 5509.5, 60 sec: 5576.9, 300 sec: 5621.1). Total num frames: 964089856. Throughput: 0: 4895.4. Samples: 964087536. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:40:15,969][25689] Avg episode reward: [(0, '1.017')] [2022-07-10 23:40:17,075][26022] Updated weights on worker 0-0, policy_version 941501 (0.00088) [2022-07-10 23:40:19,078][26022] Updated weights on worker 0-0, policy_version 941511 (0.00084) [2022-07-10 23:40:20,748][26022] Updated weights on worker 0-0, policy_version 941521 (0.00087) [2022-07-10 23:40:21,101][25689] Fps is (10 sec: 5755.8, 60 sec: 5587.3, 300 sec: 5619.2). Total num frames: 964119552. Throughput: 0: 5732.9. Samples: 964121352. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:40:21,108][25689] Avg episode reward: [(0, '0.884')] [2022-07-10 23:40:22,660][26022] Updated weights on worker 0-0, policy_version 941531 (0.00085) [2022-07-10 23:40:24,459][26022] Updated weights on worker 0-0, policy_version 941541 (0.00085) [2022-07-10 23:40:26,121][25689] Fps is (10 sec: 5547.1, 60 sec: 5585.8, 300 sec: 5613.3). Total num frames: 964146176. Throughput: 0: 5816.1. Samples: 964154448. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:40:26,122][25689] Avg episode reward: [(0, '0.517')] [2022-07-10 23:40:26,426][26022] Updated weights on worker 0-0, policy_version 941551 (0.00095) [2022-07-10 23:40:28,184][26022] Updated weights on worker 0-0, policy_version 941561 (0.00092) [2022-07-10 23:40:30,211][26022] Updated weights on worker 0-0, policy_version 941571 (0.00090) [2022-07-10 23:40:31,159][25689] Fps is (10 sec: 5497.8, 60 sec: 5566.3, 300 sec: 5616.4). Total num frames: 964174848. Throughput: 0: 4965.8. Samples: 964170942. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:40:31,160][25689] Avg episode reward: [(0, '-0.077')] [2022-07-10 23:40:31,784][26022] Updated weights on worker 0-0, policy_version 941581 (0.00084) [2022-07-10 23:40:33,966][26022] Updated weights on worker 0-0, policy_version 941591 (0.00088) [2022-07-10 23:40:35,407][26022] Updated weights on worker 0-0, policy_version 941601 (0.00093) [2022-07-10 23:40:36,187][25689] Fps is (10 sec: 5595.1, 60 sec: 5549.6, 300 sec: 5610.1). Total num frames: 964202496. Throughput: 0: 5775.5. Samples: 964204310. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:40:36,187][25689] Avg episode reward: [(0, '-0.473')] [2022-07-10 23:40:37,515][26022] Updated weights on worker 0-0, policy_version 941611 (0.00094) [2022-07-10 23:40:39,274][26022] Updated weights on worker 0-0, policy_version 941621 (0.00088) [2022-07-10 23:40:41,239][25689] Fps is (10 sec: 5485.7, 60 sec: 5551.0, 300 sec: 5602.5). Total num frames: 964230144. Throughput: 0: 5772.0. Samples: 964237588. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:40:41,241][25689] Avg episode reward: [(0, '0.212')] [2022-07-10 23:40:41,244][26022] Updated weights on worker 0-0, policy_version 941631 (0.00083) [2022-07-10 23:40:43,018][26022] Updated weights on worker 0-0, policy_version 941641 (0.00087) [2022-07-10 23:40:44,883][26022] Updated weights on worker 0-0, policy_version 941651 (0.00531) [2022-07-10 23:40:46,266][25689] Fps is (10 sec: 5587.8, 60 sec: 5549.6, 300 sec: 5609.0). Total num frames: 964258816. Throughput: 0: 4962.4. Samples: 964254416. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:40:46,266][25689] Avg episode reward: [(0, '0.506')] [2022-07-10 23:40:46,661][26022] Updated weights on worker 0-0, policy_version 941661 (0.00094) [2022-07-10 23:40:48,509][26022] Updated weights on worker 0-0, policy_version 941671 (0.00089) [2022-07-10 23:40:50,242][26022] Updated weights on worker 0-0, policy_version 941681 (0.00087) [2022-07-10 23:40:51,335][25689] Fps is (10 sec: 5477.3, 60 sec: 5492.9, 300 sec: 5601.4). Total num frames: 964285440. Throughput: 0: 5803.9. Samples: 964288038. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:40:51,338][25689] Avg episode reward: [(0, '-0.045')] [2022-07-10 23:40:52,214][26022] Updated weights on worker 0-0, policy_version 941691 (0.00085) [2022-07-10 23:40:54,103][26022] Updated weights on worker 0-0, policy_version 941701 (0.00083) [2022-07-10 23:40:55,828][26022] Updated weights on worker 0-0, policy_version 941711 (0.00084) [2022-07-10 23:40:56,339][25689] Fps is (10 sec: 5794.3, 60 sec: 5597.7, 300 sec: 5610.0). Total num frames: 964317184. Throughput: 0: 5825.4. Samples: 964321706. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:40:56,340][25689] Avg episode reward: [(0, '0.278')] [2022-07-10 23:40:57,755][26022] Updated weights on worker 0-0, policy_version 941721 (0.00089) [2022-07-10 23:40:59,457][26022] Updated weights on worker 0-0, policy_version 941731 (0.00082) [2022-07-10 23:41:01,382][25689] Fps is (10 sec: 5503.4, 60 sec: 5486.2, 300 sec: 5602.9). Total num frames: 964340736. Throughput: 0: 4994.6. Samples: 964338194. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:41:01,383][25689] Avg episode reward: [(0, '0.546')] [2022-07-10 23:41:01,812][26022] Updated weights on worker 0-0, policy_version 941741 (0.00099) [2022-07-10 23:41:03,360][26022] Updated weights on worker 0-0, policy_version 941751 (0.00088) [2022-07-10 23:41:05,566][26022] Updated weights on worker 0-0, policy_version 941761 (0.00088) [2022-07-10 23:41:06,384][25689] Fps is (10 sec: 5097.2, 60 sec: 5522.1, 300 sec: 5600.2). Total num frames: 964368384. Throughput: 0: 5727.4. Samples: 964369640. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:41:06,384][25689] Avg episode reward: [(0, '-0.078')] [2022-07-10 23:41:07,094][26022] Updated weights on worker 0-0, policy_version 941771 (0.00087) [2022-07-10 23:41:09,165][26022] Updated weights on worker 0-0, policy_version 941781 (0.00088) [2022-07-10 23:41:10,948][26022] Updated weights on worker 0-0, policy_version 941791 (0.00090) [2022-07-10 23:41:11,401][25689] Fps is (10 sec: 5621.4, 60 sec: 5557.1, 300 sec: 5600.0). Total num frames: 964397056. Throughput: 0: 5736.6. Samples: 964403150. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:41:11,402][25689] Avg episode reward: [(0, '-0.618')] [2022-07-10 23:41:12,763][26022] Updated weights on worker 0-0, policy_version 941801 (0.00082) [2022-07-10 23:41:14,633][26022] Updated weights on worker 0-0, policy_version 941811 (0.00085) [2022-07-10 23:41:16,219][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:41:16,237][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000941821_964424704.pth [2022-07-10 23:41:16,238][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000939849_962405376.pth [2022-07-10 23:41:16,238][25974] Saving a milestone ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/milestones/checkpoint_000941821_964424704.pth.milestone [2022-07-10 23:41:16,240][26022] Updated weights on worker 0-0, policy_version 941821 (0.00085) [2022-07-10 23:41:16,407][25689] Fps is (10 sec: 5619.2, 60 sec: 5540.3, 300 sec: 5601.0). Total num frames: 964424704. Throughput: 0: 4892.4. Samples: 964419884. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:41:16,407][25689] Avg episode reward: [(0, '-0.618')] [2022-07-10 23:41:18,430][26022] Updated weights on worker 0-0, policy_version 941831 (0.00093) [2022-07-10 23:41:20,041][26022] Updated weights on worker 0-0, policy_version 941841 (0.00086) [2022-07-10 23:41:21,459][25689] Fps is (10 sec: 5395.7, 60 sec: 5496.8, 300 sec: 5594.2). Total num frames: 964451328. Throughput: 0: 5728.6. Samples: 964453208. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:41:21,460][25689] Avg episode reward: [(0, '-0.118')] [2022-07-10 23:41:21,957][26022] Updated weights on worker 0-0, policy_version 941851 (0.00087) [2022-07-10 23:41:23,927][26022] Updated weights on worker 0-0, policy_version 941861 (0.00099) [2022-07-10 23:41:25,592][26022] Updated weights on worker 0-0, policy_version 941871 (0.00087) [2022-07-10 23:41:26,464][25689] Fps is (10 sec: 5599.7, 60 sec: 5549.0, 300 sec: 5594.4). Total num frames: 964481024. Throughput: 0: 5833.8. Samples: 964486784. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:41:26,465][25689] Avg episode reward: [(0, '-0.065')] [2022-07-10 23:41:27,703][26022] Updated weights on worker 0-0, policy_version 941881 (0.00092) [2022-07-10 23:41:29,295][26022] Updated weights on worker 0-0, policy_version 941891 (0.00085) [2022-07-10 23:41:31,258][26022] Updated weights on worker 0-0, policy_version 941901 (0.00092) [2022-07-10 23:41:31,483][25689] Fps is (10 sec: 5618.9, 60 sec: 5516.9, 300 sec: 5591.8). Total num frames: 964507648. Throughput: 0: 4990.0. Samples: 964503358. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:41:31,483][25689] Avg episode reward: [(0, '0.722')] [2022-07-10 23:41:32,968][26022] Updated weights on worker 0-0, policy_version 941911 (0.00090) [2022-07-10 23:41:34,879][26022] Updated weights on worker 0-0, policy_version 941921 (0.00085) [2022-07-10 23:41:36,510][25689] Fps is (10 sec: 5504.6, 60 sec: 5534.0, 300 sec: 5590.0). Total num frames: 964536320. Throughput: 0: 5822.3. Samples: 964536930. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:41:36,510][25689] Avg episode reward: [(0, '0.749')] [2022-07-10 23:41:36,686][26022] Updated weights on worker 0-0, policy_version 941931 (0.00086) [2022-07-10 23:41:38,607][26022] Updated weights on worker 0-0, policy_version 941941 (0.00085) [2022-07-10 23:41:40,304][26022] Updated weights on worker 0-0, policy_version 941951 (0.00100) [2022-07-10 23:41:41,563][25689] Fps is (10 sec: 5587.4, 60 sec: 5533.9, 300 sec: 5586.8). Total num frames: 964563968. Throughput: 0: 5828.7. Samples: 964570384. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:41:41,563][25689] Avg episode reward: [(0, '0.734')] [2022-07-10 23:41:42,129][26022] Updated weights on worker 0-0, policy_version 941961 (0.00085) [2022-07-10 23:41:44,019][26022] Updated weights on worker 0-0, policy_version 941971 (0.00093) [2022-07-10 23:41:45,840][26022] Updated weights on worker 0-0, policy_version 941981 (0.00093) [2022-07-10 23:41:46,578][25689] Fps is (10 sec: 5593.8, 60 sec: 5534.9, 300 sec: 5591.2). Total num frames: 964592640. Throughput: 0: 4995.4. Samples: 964587260. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:41:46,579][25689] Avg episode reward: [(0, '0.473')] [2022-07-10 23:41:47,789][26022] Updated weights on worker 0-0, policy_version 941991 (0.00094) [2022-07-10 23:41:49,384][26022] Updated weights on worker 0-0, policy_version 942001 (0.00088) [2022-07-10 23:41:51,467][26022] Updated weights on worker 0-0, policy_version 942011 (0.00092) [2022-07-10 23:41:51,587][25689] Fps is (10 sec: 5516.4, 60 sec: 5540.5, 300 sec: 5584.6). Total num frames: 964619264. Throughput: 0: 5840.6. Samples: 964620776. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:41:51,588][25689] Avg episode reward: [(0, '0.267')] [2022-07-10 23:41:53,049][26022] Updated weights on worker 0-0, policy_version 942021 (0.00085) [2022-07-10 23:41:55,082][26022] Updated weights on worker 0-0, policy_version 942031 (0.00086) [2022-07-10 23:41:56,603][25689] Fps is (10 sec: 5515.6, 60 sec: 5488.4, 300 sec: 5586.3). Total num frames: 964647936. Throughput: 0: 5826.5. Samples: 964654006. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:41:56,604][25689] Avg episode reward: [(0, '0.736')] [2022-07-10 23:41:57,000][26022] Updated weights on worker 0-0, policy_version 942041 (0.00092) [2022-07-10 23:41:59,057][26022] Updated weights on worker 0-0, policy_version 942051 (0.00091) [2022-07-10 23:42:00,648][26022] Updated weights on worker 0-0, policy_version 942061 (0.00081) [2022-07-10 23:42:01,741][25689] Fps is (10 sec: 5445.6, 60 sec: 5530.7, 300 sec: 5581.7). Total num frames: 964674560. Throughput: 0: 4960.6. Samples: 964670480. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:42:01,742][25689] Avg episode reward: [(0, '0.678')] [2022-07-10 23:42:03,072][26022] Updated weights on worker 0-0, policy_version 942071 (0.00081) [2022-07-10 23:42:04,534][26022] Updated weights on worker 0-0, policy_version 942081 (0.00606) [2022-07-10 23:42:06,562][26022] Updated weights on worker 0-0, policy_version 942091 (0.00080) [2022-07-10 23:42:06,780][25689] Fps is (10 sec: 5333.3, 60 sec: 5527.3, 300 sec: 5578.1). Total num frames: 964702208. Throughput: 0: 5671.1. Samples: 964701824. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:42:06,782][25689] Avg episode reward: [(0, '0.900')] [2022-07-10 23:42:08,386][26022] Updated weights on worker 0-0, policy_version 942101 (0.01418) [2022-07-10 23:42:10,199][26022] Updated weights on worker 0-0, policy_version 942111 (0.00087) [2022-07-10 23:42:11,835][25689] Fps is (10 sec: 5579.7, 60 sec: 5523.8, 300 sec: 5580.9). Total num frames: 964730880. Throughput: 0: 5661.1. Samples: 964735402. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:42:11,835][25689] Avg episode reward: [(0, '1.094')] [2022-07-10 23:42:12,095][26022] Updated weights on worker 0-0, policy_version 942121 (0.00083) [2022-07-10 23:42:13,902][26022] Updated weights on worker 0-0, policy_version 942131 (0.00091) [2022-07-10 23:42:15,763][26022] Updated weights on worker 0-0, policy_version 942141 (0.00053) [2022-07-10 23:42:16,866][25689] Fps is (10 sec: 5583.9, 60 sec: 5521.5, 300 sec: 5579.3). Total num frames: 964758528. Throughput: 0: 4846.4. Samples: 964752204. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:42:16,866][25689] Avg episode reward: [(0, '1.530')] [2022-07-10 23:42:17,483][26022] Updated weights on worker 0-0, policy_version 942151 (0.00080) [2022-07-10 23:42:19,359][26022] Updated weights on worker 0-0, policy_version 942161 (0.00087) [2022-07-10 23:42:21,131][26022] Updated weights on worker 0-0, policy_version 942171 (0.00088) [2022-07-10 23:42:21,986][25689] Fps is (10 sec: 5547.9, 60 sec: 5549.1, 300 sec: 5577.1). Total num frames: 964787200. Throughput: 0: 5707.1. Samples: 964786022. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:42:21,987][25689] Avg episode reward: [(0, '1.279')] [2022-07-10 23:42:22,930][26022] Updated weights on worker 0-0, policy_version 942181 (0.00090) [2022-07-10 23:42:24,883][26022] Updated weights on worker 0-0, policy_version 942191 (0.00085) [2022-07-10 23:42:26,707][26022] Updated weights on worker 0-0, policy_version 942201 (0.00087) [2022-07-10 23:42:27,014][25689] Fps is (10 sec: 5650.3, 60 sec: 5530.1, 300 sec: 5580.4). Total num frames: 964815872. Throughput: 0: 5812.9. Samples: 964819448. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:42:27,015][25689] Avg episode reward: [(0, '0.967')] [2022-07-10 23:42:28,609][26022] Updated weights on worker 0-0, policy_version 942211 (0.00085) [2022-07-10 23:42:30,286][26022] Updated weights on worker 0-0, policy_version 942221 (0.00087) [2022-07-10 23:42:32,025][25689] Fps is (10 sec: 5610.3, 60 sec: 5547.8, 300 sec: 5580.3). Total num frames: 964843520. Throughput: 0: 5819.3. Samples: 964852894. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:42:32,025][25689] Avg episode reward: [(0, '0.755')] [2022-07-10 23:42:32,294][26022] Updated weights on worker 0-0, policy_version 942231 (0.00092) [2022-07-10 23:42:34,131][26022] Updated weights on worker 0-0, policy_version 942241 (0.00087) [2022-07-10 23:42:35,858][26022] Updated weights on worker 0-0, policy_version 942251 (0.00089) [2022-07-10 23:42:37,047][25689] Fps is (10 sec: 5511.8, 60 sec: 5531.3, 300 sec: 5575.6). Total num frames: 964871168. Throughput: 0: 5829.6. Samples: 964869852. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:42:37,048][25689] Avg episode reward: [(0, '0.153')] [2022-07-10 23:42:37,656][26022] Updated weights on worker 0-0, policy_version 942261 (0.00056) [2022-07-10 23:42:39,380][26022] Updated weights on worker 0-0, policy_version 942271 (0.00084) [2022-07-10 23:42:41,429][26022] Updated weights on worker 0-0, policy_version 942281 (0.00088) [2022-07-10 23:42:42,172][25689] Fps is (10 sec: 5449.6, 60 sec: 5524.7, 300 sec: 5570.0). Total num frames: 964898816. Throughput: 0: 5805.8. Samples: 964903214. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:42:42,178][25689] Avg episode reward: [(0, '-0.031')] [2022-07-10 23:42:43,203][26022] Updated weights on worker 0-0, policy_version 942291 (0.00091) [2022-07-10 23:42:45,007][26022] Updated weights on worker 0-0, policy_version 942301 (0.00091) [2022-07-10 23:42:46,846][26022] Updated weights on worker 0-0, policy_version 942311 (0.00081) [2022-07-10 23:42:47,197][25689] Fps is (10 sec: 5649.7, 60 sec: 5540.7, 300 sec: 5571.2). Total num frames: 964928512. Throughput: 0: 5810.2. Samples: 964936710. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:42:47,197][25689] Avg episode reward: [(0, '-0.147')] [2022-07-10 23:42:48,532][26022] Updated weights on worker 0-0, policy_version 942321 (0.00090) [2022-07-10 23:42:50,585][26022] Updated weights on worker 0-0, policy_version 942331 (0.00086) [2022-07-10 23:42:52,247][25689] Fps is (10 sec: 5691.3, 60 sec: 5553.8, 300 sec: 5570.3). Total num frames: 964956160. Throughput: 0: 4975.9. Samples: 964953520. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:42:52,248][25689] Avg episode reward: [(0, '0.436')] [2022-07-10 23:42:52,307][26022] Updated weights on worker 0-0, policy_version 942341 (0.00081) [2022-07-10 23:42:54,103][26022] Updated weights on worker 0-0, policy_version 942351 (0.00087) [2022-07-10 23:42:55,951][26022] Updated weights on worker 0-0, policy_version 942361 (0.00081) [2022-07-10 23:42:57,333][25689] Fps is (10 sec: 5556.3, 60 sec: 5547.5, 300 sec: 5573.2). Total num frames: 964984832. Throughput: 0: 5789.1. Samples: 964987292. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:42:57,333][25689] Avg episode reward: [(0, '0.702')] [2022-07-10 23:42:57,724][26022] Updated weights on worker 0-0, policy_version 942371 (0.00085) [2022-07-10 23:42:59,690][26022] Updated weights on worker 0-0, policy_version 942381 (0.00087) [2022-07-10 23:43:01,498][26022] Updated weights on worker 0-0, policy_version 942391 (0.00087) [2022-07-10 23:43:02,423][25689] Fps is (10 sec: 5434.3, 60 sec: 5551.9, 300 sec: 5565.3). Total num frames: 965011456. Throughput: 0: 5787.0. Samples: 965020406. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:43:02,423][25689] Avg episode reward: [(0, '1.088')] [2022-07-10 23:43:03,612][26022] Updated weights on worker 0-0, policy_version 942401 (0.00085) [2022-07-10 23:43:05,589][26022] Updated weights on worker 0-0, policy_version 942411 (0.00091) [2022-07-10 23:43:07,224][26022] Updated weights on worker 0-0, policy_version 942421 (0.00085) [2022-07-10 23:43:07,445][25689] Fps is (10 sec: 5367.3, 60 sec: 5553.4, 300 sec: 5562.1). Total num frames: 965039104. Throughput: 0: 4890.5. Samples: 965035732. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-10 23:43:07,445][25689] Avg episode reward: [(0, '1.296')] [2022-07-10 23:43:09,193][26022] Updated weights on worker 0-0, policy_version 942431 (0.00085) [2022-07-10 23:43:11,052][26022] Updated weights on worker 0-0, policy_version 942441 (0.00092) [2022-07-10 23:43:12,491][25689] Fps is (10 sec: 5492.2, 60 sec: 5537.3, 300 sec: 5563.0). Total num frames: 965066752. Throughput: 0: 5732.9. Samples: 965069574. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:43:12,491][25689] Avg episode reward: [(0, '1.356')] [2022-07-10 23:43:12,779][26022] Updated weights on worker 0-0, policy_version 942451 (0.00084) [2022-07-10 23:43:14,607][26022] Updated weights on worker 0-0, policy_version 942461 (0.00080) [2022-07-10 23:43:16,409][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:43:16,422][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000942470_965089280.pth [2022-07-10 23:43:16,422][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000940509_963081216.pth [2022-07-10 23:43:16,542][26022] Updated weights on worker 0-0, policy_version 942471 (0.00084) [2022-07-10 23:43:17,493][25689] Fps is (10 sec: 5502.9, 60 sec: 5539.9, 300 sec: 5554.0). Total num frames: 965094400. Throughput: 0: 5747.5. Samples: 965103164. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:43:17,494][25689] Avg episode reward: [(0, '0.611')] [2022-07-10 23:43:18,239][26022] Updated weights on worker 0-0, policy_version 942481 (0.00085) [2022-07-10 23:43:20,171][26022] Updated weights on worker 0-0, policy_version 942491 (0.00099) [2022-07-10 23:43:21,747][26022] Updated weights on worker 0-0, policy_version 942501 (0.00085) [2022-07-10 23:43:22,576][25689] Fps is (10 sec: 5685.8, 60 sec: 5560.3, 300 sec: 5563.0). Total num frames: 965124096. Throughput: 0: 4944.2. Samples: 965120050. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:43:22,577][25689] Avg episode reward: [(0, '0.621')] [2022-07-10 23:43:23,917][26022] Updated weights on worker 0-0, policy_version 942511 (0.00091) [2022-07-10 23:43:25,576][26022] Updated weights on worker 0-0, policy_version 942521 (0.00086) [2022-07-10 23:43:27,506][26022] Updated weights on worker 0-0, policy_version 942531 (0.00089) [2022-07-10 23:43:27,673][25689] Fps is (10 sec: 5733.5, 60 sec: 5553.9, 300 sec: 5562.5). Total num frames: 965152768. Throughput: 0: 5827.2. Samples: 965153608. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:43:27,674][25689] Avg episode reward: [(0, '0.396')] [2022-07-10 23:43:29,179][26022] Updated weights on worker 0-0, policy_version 942541 (0.00088) [2022-07-10 23:43:31,306][26022] Updated weights on worker 0-0, policy_version 942551 (0.00085) [2022-07-10 23:43:32,758][25689] Fps is (10 sec: 5631.8, 60 sec: 5564.0, 300 sec: 5564.6). Total num frames: 965181440. Throughput: 0: 5787.8. Samples: 965186878. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:43:32,759][25689] Avg episode reward: [(0, '0.275')] [2022-07-10 23:43:32,777][26022] Updated weights on worker 0-0, policy_version 942561 (0.00085) [2022-07-10 23:43:35,043][26022] Updated weights on worker 0-0, policy_version 942571 (0.00089) [2022-07-10 23:43:36,618][26022] Updated weights on worker 0-0, policy_version 942581 (0.00088) [2022-07-10 23:43:37,815][25689] Fps is (10 sec: 5250.5, 60 sec: 5510.3, 300 sec: 5540.7). Total num frames: 965206016. Throughput: 0: 4948.8. Samples: 965203732. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:43:37,815][25689] Avg episode reward: [(0, '0.409')] [2022-07-10 23:43:38,492][26022] Updated weights on worker 0-0, policy_version 942591 (0.00084) [2022-07-10 23:43:40,249][26022] Updated weights on worker 0-0, policy_version 942601 (0.00094) [2022-07-10 23:43:41,990][26022] Updated weights on worker 0-0, policy_version 942611 (0.00083) [2022-07-10 23:43:42,930][25689] Fps is (10 sec: 5637.9, 60 sec: 5595.4, 300 sec: 5560.4). Total num frames: 965238784. Throughput: 0: 5773.6. Samples: 965237562. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:43:42,930][25689] Avg episode reward: [(0, '0.948')] [2022-07-10 23:43:44,035][26022] Updated weights on worker 0-0, policy_version 942621 (0.00083) [2022-07-10 23:43:45,756][26022] Updated weights on worker 0-0, policy_version 942631 (0.00092) [2022-07-10 23:43:47,538][26022] Updated weights on worker 0-0, policy_version 942641 (0.00054) [2022-07-10 23:43:47,978][25689] Fps is (10 sec: 5844.2, 60 sec: 5542.8, 300 sec: 5556.6). Total num frames: 965265408. Throughput: 0: 5801.2. Samples: 965271396. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:43:47,978][25689] Avg episode reward: [(0, '0.672')] [2022-07-10 23:43:49,405][26022] Updated weights on worker 0-0, policy_version 942651 (0.00095) [2022-07-10 23:43:51,366][26022] Updated weights on worker 0-0, policy_version 942661 (0.00088) [2022-07-10 23:43:52,995][25689] Fps is (10 sec: 5493.9, 60 sec: 5562.7, 300 sec: 5556.5). Total num frames: 965294080. Throughput: 0: 5009.6. Samples: 965288254. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:43:52,995][25689] Avg episode reward: [(0, '0.587')] [2022-07-10 23:43:53,036][26022] Updated weights on worker 0-0, policy_version 942671 (0.00091) [2022-07-10 23:43:55,120][26022] Updated weights on worker 0-0, policy_version 942681 (0.00087) [2022-07-10 23:43:56,709][26022] Updated weights on worker 0-0, policy_version 942691 (0.00091) [2022-07-10 23:43:58,082][25689] Fps is (10 sec: 5573.6, 60 sec: 5545.7, 300 sec: 5550.3). Total num frames: 965321728. Throughput: 0: 5801.7. Samples: 965321316. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:43:58,083][25689] Avg episode reward: [(0, '0.685')] [2022-07-10 23:43:58,587][26022] Updated weights on worker 0-0, policy_version 942701 (0.00086) [2022-07-10 23:44:00,499][26022] Updated weights on worker 0-0, policy_version 942711 (0.00086) [2022-07-10 23:44:02,760][26022] Updated weights on worker 0-0, policy_version 942721 (0.00087) [2022-07-10 23:44:03,188][25689] Fps is (10 sec: 5223.8, 60 sec: 5527.4, 300 sec: 5552.6). Total num frames: 965347328. Throughput: 0: 5698.5. Samples: 965353004. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:44:03,189][25689] Avg episode reward: [(0, '1.285')] [2022-07-10 23:44:04,667][26022] Updated weights on worker 0-0, policy_version 942731 (0.00085) [2022-07-10 23:44:06,586][26022] Updated weights on worker 0-0, policy_version 942741 (0.00082) [2022-07-10 23:44:08,044][26022] Updated weights on worker 0-0, policy_version 942751 (0.00091) [2022-07-10 23:44:08,215][25689] Fps is (10 sec: 5457.2, 60 sec: 5560.6, 300 sec: 5552.3). Total num frames: 965377024. Throughput: 0: 4826.1. Samples: 965369064. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:44:08,216][25689] Avg episode reward: [(0, '1.046')] [2022-07-10 23:44:10,183][26022] Updated weights on worker 0-0, policy_version 942761 (0.00086) [2022-07-10 23:44:11,823][26022] Updated weights on worker 0-0, policy_version 942771 (0.00088) [2022-07-10 23:44:13,224][25689] Fps is (10 sec: 5509.9, 60 sec: 5530.3, 300 sec: 5542.0). Total num frames: 965402624. Throughput: 0: 5647.5. Samples: 965402498. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:44:13,224][25689] Avg episode reward: [(0, '1.346')] [2022-07-10 23:44:13,879][26022] Updated weights on worker 0-0, policy_version 942781 (0.00088) [2022-07-10 23:44:15,639][26022] Updated weights on worker 0-0, policy_version 942791 (0.00090) [2022-07-10 23:44:17,510][26022] Updated weights on worker 0-0, policy_version 942801 (0.00094) [2022-07-10 23:44:18,236][25689] Fps is (10 sec: 5518.1, 60 sec: 5563.2, 300 sec: 5546.4). Total num frames: 965432320. Throughput: 0: 5689.6. Samples: 965435980. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:44:18,237][25689] Avg episode reward: [(0, '0.261')] [2022-07-10 23:44:19,263][26022] Updated weights on worker 0-0, policy_version 942811 (0.00098) [2022-07-10 23:44:21,115][26022] Updated weights on worker 0-0, policy_version 942821 (0.00085) [2022-07-10 23:44:23,074][26022] Updated weights on worker 0-0, policy_version 942831 (0.00079) [2022-07-10 23:44:23,328][25689] Fps is (10 sec: 5675.0, 60 sec: 5528.6, 300 sec: 5548.2). Total num frames: 965459968. Throughput: 0: 5786.2. Samples: 965469540. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:44:23,329][25689] Avg episode reward: [(0, '-0.215')] [2022-07-10 23:44:24,661][26022] Updated weights on worker 0-0, policy_version 942841 (0.00083) [2022-07-10 23:44:26,852][26022] Updated weights on worker 0-0, policy_version 942851 (0.00085) [2022-07-10 23:44:28,361][25689] Fps is (10 sec: 5663.6, 60 sec: 5551.3, 300 sec: 5547.8). Total num frames: 965489664. Throughput: 0: 5822.2. Samples: 965486356. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:44:28,361][25689] Avg episode reward: [(0, '-0.519')] [2022-07-10 23:44:28,370][26022] Updated weights on worker 0-0, policy_version 942861 (0.00095) [2022-07-10 23:44:30,398][26022] Updated weights on worker 0-0, policy_version 942871 (0.00092) [2022-07-10 23:44:32,380][26022] Updated weights on worker 0-0, policy_version 942881 (0.00084) [2022-07-10 23:44:33,429][25689] Fps is (10 sec: 5576.0, 60 sec: 5519.1, 300 sec: 5540.2). Total num frames: 965516288. Throughput: 0: 5788.0. Samples: 965519446. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:44:33,429][25689] Avg episode reward: [(0, '-0.445')] [2022-07-10 23:44:33,964][26022] Updated weights on worker 0-0, policy_version 942891 (0.00092) [2022-07-10 23:44:35,985][26022] Updated weights on worker 0-0, policy_version 942901 (0.00091) [2022-07-10 23:44:37,599][26022] Updated weights on worker 0-0, policy_version 942911 (0.00084) [2022-07-10 23:44:38,524][25689] Fps is (10 sec: 5340.2, 60 sec: 5566.3, 300 sec: 5539.7). Total num frames: 965543936. Throughput: 0: 5748.4. Samples: 965552604. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:44:38,524][25689] Avg episode reward: [(0, '-0.280')] [2022-07-10 23:44:39,709][26022] Updated weights on worker 0-0, policy_version 942921 (0.00091) [2022-07-10 23:44:41,572][26022] Updated weights on worker 0-0, policy_version 942931 (0.00088) [2022-07-10 23:44:43,352][26022] Updated weights on worker 0-0, policy_version 942941 (0.00092) [2022-07-10 23:44:43,618][25689] Fps is (10 sec: 5628.1, 60 sec: 5517.5, 300 sec: 5541.6). Total num frames: 965573632. Throughput: 0: 4918.7. Samples: 965569338. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:44:43,618][25689] Avg episode reward: [(0, '-0.414')] [2022-07-10 23:44:45,096][26022] Updated weights on worker 0-0, policy_version 942951 (0.00088) [2022-07-10 23:44:47,088][26022] Updated weights on worker 0-0, policy_version 942961 (0.00091) [2022-07-10 23:44:48,632][25689] Fps is (10 sec: 5571.6, 60 sec: 5520.6, 300 sec: 5531.1). Total num frames: 965600256. Throughput: 0: 5742.6. Samples: 965602766. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:44:48,632][25689] Avg episode reward: [(0, '0.796')] [2022-07-10 23:44:48,930][26022] Updated weights on worker 0-0, policy_version 942971 (0.00092) [2022-07-10 23:44:50,778][26022] Updated weights on worker 0-0, policy_version 942981 (0.00086) [2022-07-10 23:44:52,420][26022] Updated weights on worker 0-0, policy_version 942991 (0.00055) [2022-07-10 23:44:53,636][25689] Fps is (10 sec: 5417.1, 60 sec: 5504.9, 300 sec: 5538.6). Total num frames: 965627904. Throughput: 0: 5778.7. Samples: 965636220. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:44:53,637][25689] Avg episode reward: [(0, '0.301')] [2022-07-10 23:44:54,505][26022] Updated weights on worker 0-0, policy_version 943001 (0.00090) [2022-07-10 23:44:56,241][26022] Updated weights on worker 0-0, policy_version 943011 (0.00092) [2022-07-10 23:44:58,179][26022] Updated weights on worker 0-0, policy_version 943021 (0.00083) [2022-07-10 23:44:58,642][25689] Fps is (10 sec: 5524.2, 60 sec: 5512.3, 300 sec: 5530.4). Total num frames: 965655552. Throughput: 0: 4981.5. Samples: 965652822. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:44:58,642][25689] Avg episode reward: [(0, '0.704')] [2022-07-10 23:44:59,953][26022] Updated weights on worker 0-0, policy_version 943031 (0.00090) [2022-07-10 23:45:02,274][26022] Updated weights on worker 0-0, policy_version 943041 (0.00092) [2022-07-10 23:45:03,730][25689] Fps is (10 sec: 5478.5, 60 sec: 5547.8, 300 sec: 5536.1). Total num frames: 965683200. Throughput: 0: 5723.1. Samples: 965684440. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:45:03,730][25689] Avg episode reward: [(0, '0.356')] [2022-07-10 23:45:03,775][26022] Updated weights on worker 0-0, policy_version 943051 (0.00104) [2022-07-10 23:45:05,888][26022] Updated weights on worker 0-0, policy_version 943061 (0.00091) [2022-07-10 23:45:07,623][26022] Updated weights on worker 0-0, policy_version 943071 (0.00081) [2022-07-10 23:45:08,763][25689] Fps is (10 sec: 5463.5, 60 sec: 5513.4, 300 sec: 5539.5). Total num frames: 965710848. Throughput: 0: 5729.2. Samples: 965718100. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:45:08,763][25689] Avg episode reward: [(0, '0.418')] [2022-07-10 23:45:09,435][26022] Updated weights on worker 0-0, policy_version 943081 (0.00081) [2022-07-10 23:45:11,349][26022] Updated weights on worker 0-0, policy_version 943091 (0.00094) [2022-07-10 23:45:12,960][26022] Updated weights on worker 0-0, policy_version 943101 (0.00087) [2022-07-10 23:45:13,775][25689] Fps is (10 sec: 5606.7, 60 sec: 5563.8, 300 sec: 5539.4). Total num frames: 965739520. Throughput: 0: 4898.6. Samples: 965734870. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:45:13,775][25689] Avg episode reward: [(0, '0.362')] [2022-07-10 23:45:15,056][26022] Updated weights on worker 0-0, policy_version 943111 (0.00098) [2022-07-10 23:45:16,469][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:45:16,480][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000943120_965754880.pth [2022-07-10 23:45:16,480][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000941168_963756032.pth [2022-07-10 23:45:16,595][26022] Updated weights on worker 0-0, policy_version 943121 (0.00088) [2022-07-10 23:45:18,682][26022] Updated weights on worker 0-0, policy_version 943131 (0.00088) [2022-07-10 23:45:18,786][25689] Fps is (10 sec: 5619.1, 60 sec: 5530.1, 300 sec: 5534.8). Total num frames: 965767168. Throughput: 0: 5745.5. Samples: 965768560. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:45:18,786][25689] Avg episode reward: [(0, '0.746')] [2022-07-10 23:45:20,216][26022] Updated weights on worker 0-0, policy_version 943141 (0.00080) [2022-07-10 23:45:22,146][26022] Updated weights on worker 0-0, policy_version 943151 (0.00089) [2022-07-10 23:45:23,893][25689] Fps is (10 sec: 5667.6, 60 sec: 5562.6, 300 sec: 5543.5). Total num frames: 965796864. Throughput: 0: 5860.6. Samples: 965802608. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:45:23,893][25689] Avg episode reward: [(0, '0.755')] [2022-07-10 23:45:23,895][26022] Updated weights on worker 0-0, policy_version 943161 (0.00087) [2022-07-10 23:45:25,851][26022] Updated weights on worker 0-0, policy_version 943171 (0.00098) [2022-07-10 23:45:27,503][26022] Updated weights on worker 0-0, policy_version 943181 (0.00091) [2022-07-10 23:45:28,944][25689] Fps is (10 sec: 5645.1, 60 sec: 5527.1, 300 sec: 5539.8). Total num frames: 965824512. Throughput: 0: 5016.9. Samples: 965819346. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:45:28,944][25689] Avg episode reward: [(0, '0.532')] [2022-07-10 23:45:29,458][26022] Updated weights on worker 0-0, policy_version 943191 (0.00084) [2022-07-10 23:45:31,224][26022] Updated weights on worker 0-0, policy_version 943201 (0.00092) [2022-07-10 23:45:33,234][26022] Updated weights on worker 0-0, policy_version 943211 (0.00089) [2022-07-10 23:45:34,016][25689] Fps is (10 sec: 5462.3, 60 sec: 5543.6, 300 sec: 5539.0). Total num frames: 965852160. Throughput: 0: 5835.4. Samples: 965852984. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:45:34,018][25689] Avg episode reward: [(0, '-0.202')] [2022-07-10 23:45:34,936][26022] Updated weights on worker 0-0, policy_version 943221 (0.00084) [2022-07-10 23:45:36,893][26022] Updated weights on worker 0-0, policy_version 943231 (0.00098) [2022-07-10 23:45:38,538][26022] Updated weights on worker 0-0, policy_version 943241 (0.00097) [2022-07-10 23:45:39,060][25689] Fps is (10 sec: 5567.6, 60 sec: 5565.2, 300 sec: 5542.6). Total num frames: 965880832. Throughput: 0: 5808.9. Samples: 965886330. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:45:39,060][25689] Avg episode reward: [(0, '-0.173')] [2022-07-10 23:45:40,796][26022] Updated weights on worker 0-0, policy_version 943251 (0.00085) [2022-07-10 23:45:42,254][26022] Updated weights on worker 0-0, policy_version 943261 (0.00091) [2022-07-10 23:45:44,117][25689] Fps is (10 sec: 5575.5, 60 sec: 5534.7, 300 sec: 5538.6). Total num frames: 965908480. Throughput: 0: 4964.7. Samples: 965903016. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:45:44,118][25689] Avg episode reward: [(0, '-1.228')] [2022-07-10 23:45:44,283][26022] Updated weights on worker 0-0, policy_version 943271 (0.00085) [2022-07-10 23:45:46,048][26022] Updated weights on worker 0-0, policy_version 943281 (0.00093) [2022-07-10 23:45:47,983][26022] Updated weights on worker 0-0, policy_version 943291 (0.00087) [2022-07-10 23:45:49,141][25689] Fps is (10 sec: 5484.7, 60 sec: 5550.7, 300 sec: 5542.8). Total num frames: 965936128. Throughput: 0: 5794.1. Samples: 965936374. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:45:49,142][25689] Avg episode reward: [(0, '-0.730')] [2022-07-10 23:45:49,823][26022] Updated weights on worker 0-0, policy_version 943301 (0.00091) [2022-07-10 23:45:51,671][26022] Updated weights on worker 0-0, policy_version 943311 (0.00091) [2022-07-10 23:45:53,609][26022] Updated weights on worker 0-0, policy_version 943321 (0.00085) [2022-07-10 23:45:54,186][25689] Fps is (10 sec: 5593.7, 60 sec: 5564.0, 300 sec: 5531.8). Total num frames: 965964800. Throughput: 0: 5798.0. Samples: 965969928. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:45:54,186][25689] Avg episode reward: [(0, '-0.925')] [2022-07-10 23:45:55,261][26022] Updated weights on worker 0-0, policy_version 943331 (0.00082) [2022-07-10 23:45:57,068][26022] Updated weights on worker 0-0, policy_version 943341 (0.00084) [2022-07-10 23:45:59,107][26022] Updated weights on worker 0-0, policy_version 943351 (0.00093) [2022-07-10 23:45:59,201][25689] Fps is (10 sec: 5496.6, 60 sec: 5546.1, 300 sec: 5542.6). Total num frames: 965991424. Throughput: 0: 4981.1. Samples: 965986658. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:45:59,203][25689] Avg episode reward: [(0, '-0.695')] [2022-07-10 23:46:00,574][26022] Updated weights on worker 0-0, policy_version 943361 (0.00079) [2022-07-10 23:46:03,136][26022] Updated weights on worker 0-0, policy_version 943371 (0.00109) [2022-07-10 23:46:04,329][25689] Fps is (10 sec: 5552.5, 60 sec: 5576.3, 300 sec: 5547.1). Total num frames: 966021120. Throughput: 0: 5696.4. Samples: 966018150. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:46:04,330][25689] Avg episode reward: [(0, '0.174')] [2022-07-10 23:46:04,635][26022] Updated weights on worker 0-0, policy_version 943381 (0.00085) [2022-07-10 23:46:06,829][26022] Updated weights on worker 0-0, policy_version 943391 (0.00614) [2022-07-10 23:46:08,460][26022] Updated weights on worker 0-0, policy_version 943401 (0.00092) [2022-07-10 23:46:09,398][25689] Fps is (10 sec: 5422.9, 60 sec: 5539.2, 300 sec: 5535.8). Total num frames: 966046720. Throughput: 0: 5676.6. Samples: 966051364. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:46:09,399][25689] Avg episode reward: [(0, '0.340')] [2022-07-10 23:46:10,505][26022] Updated weights on worker 0-0, policy_version 943411 (0.00090) [2022-07-10 23:46:12,176][26022] Updated weights on worker 0-0, policy_version 943421 (0.00089) [2022-07-10 23:46:14,391][26022] Updated weights on worker 0-0, policy_version 943431 (0.00089) [2022-07-10 23:46:14,458][25689] Fps is (10 sec: 5155.7, 60 sec: 5501.1, 300 sec: 5531.4). Total num frames: 966073344. Throughput: 0: 4830.3. Samples: 966067850. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:46:14,459][25689] Avg episode reward: [(0, '0.893')] [2022-07-10 23:46:15,892][26022] Updated weights on worker 0-0, policy_version 943441 (0.00086) [2022-07-10 23:46:17,980][26022] Updated weights on worker 0-0, policy_version 943451 (0.00093) [2022-07-10 23:46:19,434][26022] Updated weights on worker 0-0, policy_version 943461 (0.00086) [2022-07-10 23:46:19,467][25689] Fps is (10 sec: 5695.4, 60 sec: 5551.9, 300 sec: 5546.0). Total num frames: 966104064. Throughput: 0: 5643.2. Samples: 966101018. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:46:19,468][25689] Avg episode reward: [(0, '0.643')] [2022-07-10 23:46:21,743][26022] Updated weights on worker 0-0, policy_version 943471 (0.00089) [2022-07-10 23:46:23,606][26022] Updated weights on worker 0-0, policy_version 943481 (0.00085) [2022-07-10 23:46:24,613][25689] Fps is (10 sec: 5647.1, 60 sec: 5497.7, 300 sec: 5533.0). Total num frames: 966130688. Throughput: 0: 5723.5. Samples: 966134246. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:46:24,613][25689] Avg episode reward: [(0, '-0.464')] [2022-07-10 23:46:25,282][26022] Updated weights on worker 0-0, policy_version 943491 (0.00094) [2022-07-10 23:46:27,176][26022] Updated weights on worker 0-0, policy_version 943501 (0.00086) [2022-07-10 23:46:29,248][26022] Updated weights on worker 0-0, policy_version 943511 (0.00084) [2022-07-10 23:46:29,622][25689] Fps is (10 sec: 5243.3, 60 sec: 5484.7, 300 sec: 5533.2). Total num frames: 966157312. Throughput: 0: 5737.6. Samples: 966167400. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:46:29,623][25689] Avg episode reward: [(0, '-1.325')] [2022-07-10 23:46:30,741][26022] Updated weights on worker 0-0, policy_version 943521 (0.00102) [2022-07-10 23:46:32,812][26022] Updated weights on worker 0-0, policy_version 943531 (0.00086) [2022-07-10 23:46:34,418][26022] Updated weights on worker 0-0, policy_version 943541 (0.00095) [2022-07-10 23:46:34,676][25689] Fps is (10 sec: 5494.8, 60 sec: 5503.2, 300 sec: 5532.7). Total num frames: 966185984. Throughput: 0: 5744.9. Samples: 966184000. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:46:34,677][25689] Avg episode reward: [(0, '-1.322')] [2022-07-10 23:46:36,580][26022] Updated weights on worker 0-0, policy_version 943551 (0.00085) [2022-07-10 23:46:38,298][26022] Updated weights on worker 0-0, policy_version 943561 (0.00087) [2022-07-10 23:46:39,678][25689] Fps is (10 sec: 5600.5, 60 sec: 5490.0, 300 sec: 5533.6). Total num frames: 966213632. Throughput: 0: 5743.6. Samples: 966217106. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:46:39,679][25689] Avg episode reward: [(0, '-1.469')] [2022-07-10 23:46:40,277][26022] Updated weights on worker 0-0, policy_version 943571 (0.00083) [2022-07-10 23:46:41,938][26022] Updated weights on worker 0-0, policy_version 943581 (0.00085) [2022-07-10 23:46:43,884][26022] Updated weights on worker 0-0, policy_version 943591 (0.00091) [2022-07-10 23:46:44,795][25689] Fps is (10 sec: 5464.4, 60 sec: 5484.7, 300 sec: 5528.3). Total num frames: 966241280. Throughput: 0: 5766.0. Samples: 966250618. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:46:44,796][25689] Avg episode reward: [(0, '-1.178')] [2022-07-10 23:46:45,594][26022] Updated weights on worker 0-0, policy_version 943601 (0.00087) [2022-07-10 23:46:47,674][26022] Updated weights on worker 0-0, policy_version 943611 (0.00088) [2022-07-10 23:46:49,242][26022] Updated weights on worker 0-0, policy_version 943621 (0.00097) [2022-07-10 23:46:49,845][25689] Fps is (10 sec: 5640.4, 60 sec: 5516.1, 300 sec: 5537.9). Total num frames: 966270976. Throughput: 0: 4937.2. Samples: 966267244. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:46:49,846][25689] Avg episode reward: [(0, '-0.710')] [2022-07-10 23:46:51,313][26022] Updated weights on worker 0-0, policy_version 943631 (0.00088) [2022-07-10 23:46:52,768][26022] Updated weights on worker 0-0, policy_version 943641 (0.00092) [2022-07-10 23:46:54,823][26022] Updated weights on worker 0-0, policy_version 943651 (0.00083) [2022-07-10 23:46:54,920][25689] Fps is (10 sec: 5664.0, 60 sec: 5496.5, 300 sec: 5533.4). Total num frames: 966298624. Throughput: 0: 5762.8. Samples: 966300660. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:46:54,920][25689] Avg episode reward: [(0, '0.818')] [2022-07-10 23:46:56,767][26022] Updated weights on worker 0-0, policy_version 943661 (0.00084) [2022-07-10 23:46:58,491][26022] Updated weights on worker 0-0, policy_version 943671 (0.00085) [2022-07-10 23:46:59,985][25689] Fps is (10 sec: 5453.2, 60 sec: 5508.9, 300 sec: 5538.1). Total num frames: 966326272. Throughput: 0: 5772.2. Samples: 966334320. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-10 23:46:59,986][25689] Avg episode reward: [(0, '0.305')] [2022-07-10 23:47:00,239][26022] Updated weights on worker 0-0, policy_version 943681 (0.00085) [2022-07-10 23:47:02,507][26022] Updated weights on worker 0-0, policy_version 943691 (0.00078) [2022-07-10 23:47:04,300][26022] Updated weights on worker 0-0, policy_version 943701 (0.00092) [2022-07-10 23:47:05,099][25689] Fps is (10 sec: 5331.5, 60 sec: 5459.5, 300 sec: 5533.3). Total num frames: 966352896. Throughput: 0: 4859.8. Samples: 966349290. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:47:05,100][25689] Avg episode reward: [(0, '0.209')] [2022-07-10 23:47:06,458][26022] Updated weights on worker 0-0, policy_version 943711 (0.00089) [2022-07-10 23:47:08,172][26022] Updated weights on worker 0-0, policy_version 943721 (0.00091) [2022-07-10 23:47:09,832][26022] Updated weights on worker 0-0, policy_version 943731 (0.00086) [2022-07-10 23:47:10,132][25689] Fps is (10 sec: 5348.4, 60 sec: 5496.5, 300 sec: 5530.3). Total num frames: 966380544. Throughput: 0: 5665.8. Samples: 966382188. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:47:10,133][25689] Avg episode reward: [(0, '0.017')] [2022-07-10 23:47:11,924][26022] Updated weights on worker 0-0, policy_version 943741 (0.00085) [2022-07-10 23:47:13,624][26022] Updated weights on worker 0-0, policy_version 943751 (0.00084) [2022-07-10 23:47:15,155][25689] Fps is (10 sec: 5499.0, 60 sec: 5516.8, 300 sec: 5530.4). Total num frames: 966408192. Throughput: 0: 5673.7. Samples: 966415468. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:47:15,155][25689] Avg episode reward: [(0, '0.470')] [2022-07-10 23:47:15,636][26022] Updated weights on worker 0-0, policy_version 943761 (0.00087) [2022-07-10 23:47:16,612][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:47:16,632][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000943767_966417408.pth [2022-07-10 23:47:16,633][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000941821_964424704.pth [2022-07-10 23:47:17,209][26022] Updated weights on worker 0-0, policy_version 943771 (0.00087) [2022-07-10 23:47:19,167][26022] Updated weights on worker 0-0, policy_version 943781 (0.00097) [2022-07-10 23:47:20,202][25689] Fps is (10 sec: 5593.0, 60 sec: 5479.5, 300 sec: 5531.8). Total num frames: 966436864. Throughput: 0: 4841.2. Samples: 966432194. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:47:20,203][25689] Avg episode reward: [(0, '0.245')] [2022-07-10 23:47:21,053][26022] Updated weights on worker 0-0, policy_version 943791 (0.00094) [2022-07-10 23:47:22,833][26022] Updated weights on worker 0-0, policy_version 943801 (0.00090) [2022-07-10 23:47:24,664][26022] Updated weights on worker 0-0, policy_version 943811 (0.00084) [2022-07-10 23:47:25,255][25689] Fps is (10 sec: 5677.4, 60 sec: 5521.7, 300 sec: 5531.3). Total num frames: 966465536. Throughput: 0: 5779.0. Samples: 966465770. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:47:25,256][25689] Avg episode reward: [(0, '0.060')] [2022-07-10 23:47:26,603][26022] Updated weights on worker 0-0, policy_version 943821 (0.00092) [2022-07-10 23:47:28,266][26022] Updated weights on worker 0-0, policy_version 943831 (0.00091) [2022-07-10 23:47:30,257][25689] Fps is (10 sec: 5397.5, 60 sec: 5505.5, 300 sec: 5524.6). Total num frames: 966491136. Throughput: 0: 5809.5. Samples: 966499102. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:47:30,258][25689] Avg episode reward: [(0, '0.206')] [2022-07-10 23:47:30,368][26022] Updated weights on worker 0-0, policy_version 943841 (0.00088) [2022-07-10 23:47:32,051][26022] Updated weights on worker 0-0, policy_version 943851 (0.00089) [2022-07-10 23:47:33,928][26022] Updated weights on worker 0-0, policy_version 943861 (0.00097) [2022-07-10 23:47:35,270][25689] Fps is (10 sec: 5521.2, 60 sec: 5526.1, 300 sec: 5531.6). Total num frames: 966520832. Throughput: 0: 4985.7. Samples: 966515758. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:47:35,271][25689] Avg episode reward: [(0, '0.859')] [2022-07-10 23:47:35,815][26022] Updated weights on worker 0-0, policy_version 943871 (0.00091) [2022-07-10 23:47:37,512][26022] Updated weights on worker 0-0, policy_version 943881 (0.00088) [2022-07-10 23:47:39,390][26022] Updated weights on worker 0-0, policy_version 943891 (0.00100) [2022-07-10 23:47:40,289][25689] Fps is (10 sec: 5716.4, 60 sec: 5524.6, 300 sec: 5533.6). Total num frames: 966548480. Throughput: 0: 5811.1. Samples: 966548918. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:47:40,289][25689] Avg episode reward: [(0, '1.249')] [2022-07-10 23:47:41,531][26022] Updated weights on worker 0-0, policy_version 943901 (0.00094) [2022-07-10 23:47:43,231][26022] Updated weights on worker 0-0, policy_version 943911 (0.00086) [2022-07-10 23:47:45,114][26022] Updated weights on worker 0-0, policy_version 943921 (0.00086) [2022-07-10 23:47:45,345][25689] Fps is (10 sec: 5488.4, 60 sec: 5530.1, 300 sec: 5526.2). Total num frames: 966576128. Throughput: 0: 5797.0. Samples: 966582232. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:47:45,346][25689] Avg episode reward: [(0, '1.057')] [2022-07-10 23:47:46,792][26022] Updated weights on worker 0-0, policy_version 943931 (0.00083) [2022-07-10 23:47:48,748][26022] Updated weights on worker 0-0, policy_version 943941 (0.00089) [2022-07-10 23:47:50,382][25689] Fps is (10 sec: 5579.8, 60 sec: 5514.4, 300 sec: 5529.8). Total num frames: 966604800. Throughput: 0: 4965.6. Samples: 966599034. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:47:50,383][25689] Avg episode reward: [(0, '1.272')] [2022-07-10 23:47:50,530][26022] Updated weights on worker 0-0, policy_version 943951 (0.00082) [2022-07-10 23:47:52,279][26022] Updated weights on worker 0-0, policy_version 943961 (0.00089) [2022-07-10 23:47:54,475][26022] Updated weights on worker 0-0, policy_version 943971 (0.00094) [2022-07-10 23:47:55,390][25689] Fps is (10 sec: 5606.7, 60 sec: 5520.4, 300 sec: 5527.8). Total num frames: 966632448. Throughput: 0: 5805.3. Samples: 966632562. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:47:55,391][25689] Avg episode reward: [(0, '1.174')] [2022-07-10 23:47:56,117][26022] Updated weights on worker 0-0, policy_version 943981 (0.00091) [2022-07-10 23:47:58,083][26022] Updated weights on worker 0-0, policy_version 943991 (0.00092) [2022-07-10 23:47:59,688][26022] Updated weights on worker 0-0, policy_version 944001 (0.00092) [2022-07-10 23:48:00,430][25689] Fps is (10 sec: 5503.1, 60 sec: 5522.8, 300 sec: 5532.2). Total num frames: 966660096. Throughput: 0: 5805.6. Samples: 966665852. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:48:00,431][25689] Avg episode reward: [(0, '1.228')] [2022-07-10 23:48:01,478][26022] Updated weights on worker 0-0, policy_version 944011 (0.00096) [2022-07-10 23:48:03,935][26022] Updated weights on worker 0-0, policy_version 944021 (0.00085) [2022-07-10 23:48:05,527][25689] Fps is (10 sec: 5354.3, 60 sec: 5524.4, 300 sec: 5527.4). Total num frames: 966686720. Throughput: 0: 4864.8. Samples: 966680410. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:48:05,527][25689] Avg episode reward: [(0, '0.917')] [2022-07-10 23:48:05,829][26022] Updated weights on worker 0-0, policy_version 944031 (0.00613) [2022-07-10 23:48:07,686][26022] Updated weights on worker 0-0, policy_version 944041 (0.00092) [2022-07-10 23:48:09,423][26022] Updated weights on worker 0-0, policy_version 944051 (0.00080) [2022-07-10 23:48:10,559][25689] Fps is (10 sec: 5358.4, 60 sec: 5524.5, 300 sec: 5527.7). Total num frames: 966714368. Throughput: 0: 5679.5. Samples: 966713626. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:48:10,559][25689] Avg episode reward: [(0, '0.833')] [2022-07-10 23:48:11,385][26022] Updated weights on worker 0-0, policy_version 944061 (0.00090) [2022-07-10 23:48:12,898][26022] Updated weights on worker 0-0, policy_version 944071 (0.00091) [2022-07-10 23:48:14,919][26022] Updated weights on worker 0-0, policy_version 944081 (0.00090) [2022-07-10 23:48:15,612][25689] Fps is (10 sec: 5584.2, 60 sec: 5538.6, 300 sec: 5530.1). Total num frames: 966743040. Throughput: 0: 5679.9. Samples: 966747420. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:48:15,613][25689] Avg episode reward: [(0, '0.369')] [2022-07-10 23:48:16,681][26022] Updated weights on worker 0-0, policy_version 944091 (0.00086) [2022-07-10 23:48:18,678][26022] Updated weights on worker 0-0, policy_version 944101 (0.00091) [2022-07-10 23:48:20,327][26022] Updated weights on worker 0-0, policy_version 944111 (0.00094) [2022-07-10 23:48:20,632][25689] Fps is (10 sec: 5489.5, 60 sec: 5507.2, 300 sec: 5521.0). Total num frames: 966769664. Throughput: 0: 4875.4. Samples: 966764342. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:48:20,633][25689] Avg episode reward: [(0, '-0.079')] [2022-07-10 23:48:22,305][26022] Updated weights on worker 0-0, policy_version 944121 (0.00087) [2022-07-10 23:48:24,101][26022] Updated weights on worker 0-0, policy_version 944131 (0.00086) [2022-07-10 23:48:25,685][25689] Fps is (10 sec: 5489.7, 60 sec: 5507.2, 300 sec: 5521.8). Total num frames: 966798336. Throughput: 0: 5819.8. Samples: 966797728. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:48:25,687][25689] Avg episode reward: [(0, '0.122')] [2022-07-10 23:48:25,894][26022] Updated weights on worker 0-0, policy_version 944141 (0.00098) [2022-07-10 23:48:27,723][26022] Updated weights on worker 0-0, policy_version 944151 (0.00083) [2022-07-10 23:48:29,608][26022] Updated weights on worker 0-0, policy_version 944161 (0.00086) [2022-07-10 23:48:30,775][25689] Fps is (10 sec: 5754.1, 60 sec: 5566.9, 300 sec: 5525.2). Total num frames: 966828032. Throughput: 0: 5809.5. Samples: 966831076. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:48:30,776][25689] Avg episode reward: [(0, '0.109')] [2022-07-10 23:48:31,494][26022] Updated weights on worker 0-0, policy_version 944171 (0.00090) [2022-07-10 23:48:33,255][26022] Updated weights on worker 0-0, policy_version 944181 (0.00084) [2022-07-10 23:48:34,995][26022] Updated weights on worker 0-0, policy_version 944191 (0.00088) [2022-07-10 23:48:35,853][25689] Fps is (10 sec: 5437.9, 60 sec: 5493.3, 300 sec: 5528.2). Total num frames: 966853632. Throughput: 0: 5789.0. Samples: 966864596. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:48:35,854][25689] Avg episode reward: [(0, '0.073')] [2022-07-10 23:48:36,971][26022] Updated weights on worker 0-0, policy_version 944201 (0.00086) [2022-07-10 23:48:39,019][26022] Updated weights on worker 0-0, policy_version 944211 (0.00097) [2022-07-10 23:48:40,636][26022] Updated weights on worker 0-0, policy_version 944221 (0.00091) [2022-07-10 23:48:40,859][25689] Fps is (10 sec: 5585.2, 60 sec: 5545.1, 300 sec: 5523.3). Total num frames: 966884352. Throughput: 0: 5764.5. Samples: 966880944. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:48:40,860][25689] Avg episode reward: [(0, '0.328')] [2022-07-10 23:48:42,478][26022] Updated weights on worker 0-0, policy_version 944231 (0.00080) [2022-07-10 23:48:44,245][26022] Updated weights on worker 0-0, policy_version 944241 (0.00087) [2022-07-10 23:48:45,909][25689] Fps is (10 sec: 5702.8, 60 sec: 5528.9, 300 sec: 5523.3). Total num frames: 966910976. Throughput: 0: 5778.0. Samples: 966914582. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:48:45,909][25689] Avg episode reward: [(0, '0.617')] [2022-07-10 23:48:46,145][26022] Updated weights on worker 0-0, policy_version 944251 (0.00088) [2022-07-10 23:48:48,098][26022] Updated weights on worker 0-0, policy_version 944261 (0.00087) [2022-07-10 23:48:49,727][26022] Updated weights on worker 0-0, policy_version 944271 (0.00050) [2022-07-10 23:48:50,916][25689] Fps is (10 sec: 5294.7, 60 sec: 5497.8, 300 sec: 5516.6). Total num frames: 966937600. Throughput: 0: 5808.8. Samples: 966948068. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:48:50,916][25689] Avg episode reward: [(0, '0.944')] [2022-07-10 23:48:51,796][26022] Updated weights on worker 0-0, policy_version 944281 (0.00088) [2022-07-10 23:48:53,583][26022] Updated weights on worker 0-0, policy_version 944291 (0.00080) [2022-07-10 23:48:55,457][26022] Updated weights on worker 0-0, policy_version 944301 (0.00087) [2022-07-10 23:48:55,918][25689] Fps is (10 sec: 5729.1, 60 sec: 5549.1, 300 sec: 5528.5). Total num frames: 966968320. Throughput: 0: 4990.0. Samples: 966964718. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:48:55,918][25689] Avg episode reward: [(0, '0.752')] [2022-07-10 23:48:57,152][26022] Updated weights on worker 0-0, policy_version 944311 (0.00090) [2022-07-10 23:48:59,004][26022] Updated weights on worker 0-0, policy_version 944321 (0.00084) [2022-07-10 23:49:00,955][25689] Fps is (10 sec: 5609.9, 60 sec: 5515.5, 300 sec: 5529.8). Total num frames: 966993920. Throughput: 0: 5842.2. Samples: 966998350. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:49:00,956][25689] Avg episode reward: [(0, '0.918')] [2022-07-10 23:49:01,070][26022] Updated weights on worker 0-0, policy_version 944331 (0.00090) [2022-07-10 23:49:03,061][26022] Updated weights on worker 0-0, policy_version 944341 (0.00082) [2022-07-10 23:49:05,101][26022] Updated weights on worker 0-0, policy_version 944351 (0.00090) [2022-07-10 23:49:06,007][25689] Fps is (10 sec: 5176.5, 60 sec: 5519.6, 300 sec: 5519.0). Total num frames: 967020544. Throughput: 0: 5726.9. Samples: 967029680. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:49:06,007][25689] Avg episode reward: [(0, '0.675')] [2022-07-10 23:49:06,823][26022] Updated weights on worker 0-0, policy_version 944361 (0.00089) [2022-07-10 23:49:08,703][26022] Updated weights on worker 0-0, policy_version 944371 (0.00086) [2022-07-10 23:49:10,550][26022] Updated weights on worker 0-0, policy_version 944381 (0.00090) [2022-07-10 23:49:11,059][25689] Fps is (10 sec: 5472.9, 60 sec: 5534.6, 300 sec: 5528.5). Total num frames: 967049216. Throughput: 0: 4879.0. Samples: 967046346. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:49:11,059][25689] Avg episode reward: [(0, '0.670')] [2022-07-10 23:49:12,301][26022] Updated weights on worker 0-0, policy_version 944391 (0.00091) [2022-07-10 23:49:14,220][26022] Updated weights on worker 0-0, policy_version 944401 (0.00090) [2022-07-10 23:49:15,859][26022] Updated weights on worker 0-0, policy_version 944411 (0.00095) [2022-07-10 23:49:16,072][25689] Fps is (10 sec: 5697.1, 60 sec: 5538.3, 300 sec: 5525.1). Total num frames: 967077888. Throughput: 0: 5718.0. Samples: 967079960. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:49:16,073][25689] Avg episode reward: [(0, '0.921')] [2022-07-10 23:49:16,637][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:49:16,646][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000944415_967080960.pth [2022-07-10 23:49:16,648][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000942470_965089280.pth [2022-07-10 23:49:17,959][26022] Updated weights on worker 0-0, policy_version 944421 (0.01133) [2022-07-10 23:49:19,450][26022] Updated weights on worker 0-0, policy_version 944431 (0.00096) [2022-07-10 23:49:21,109][25689] Fps is (10 sec: 5502.2, 60 sec: 5536.8, 300 sec: 5522.7). Total num frames: 967104512. Throughput: 0: 5712.3. Samples: 967113472. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:49:21,111][25689] Avg episode reward: [(0, '0.888')] [2022-07-10 23:49:21,557][26022] Updated weights on worker 0-0, policy_version 944441 (0.00088) [2022-07-10 23:49:23,190][26022] Updated weights on worker 0-0, policy_version 944451 (0.00089) [2022-07-10 23:49:25,121][26022] Updated weights on worker 0-0, policy_version 944461 (0.00085) [2022-07-10 23:49:26,178][25689] Fps is (10 sec: 5572.8, 60 sec: 5552.2, 300 sec: 5522.0). Total num frames: 967134208. Throughput: 0: 4990.3. Samples: 967130344. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:49:26,179][25689] Avg episode reward: [(0, '0.230')] [2022-07-10 23:49:26,924][26022] Updated weights on worker 0-0, policy_version 944471 (0.00090) [2022-07-10 23:49:28,701][26022] Updated weights on worker 0-0, policy_version 944481 (0.00085) [2022-07-10 23:49:30,601][26022] Updated weights on worker 0-0, policy_version 944491 (0.00087) [2022-07-10 23:49:31,207][25689] Fps is (10 sec: 5577.0, 60 sec: 5507.0, 300 sec: 5522.7). Total num frames: 967160832. Throughput: 0: 5838.9. Samples: 967163990. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:49:31,208][25689] Avg episode reward: [(0, '0.080')] [2022-07-10 23:49:32,515][26022] Updated weights on worker 0-0, policy_version 944501 (0.00081) [2022-07-10 23:49:34,327][26022] Updated weights on worker 0-0, policy_version 944511 (0.00086) [2022-07-10 23:49:36,104][26022] Updated weights on worker 0-0, policy_version 944521 (0.00091) [2022-07-10 23:49:36,221][25689] Fps is (10 sec: 5608.0, 60 sec: 5580.7, 300 sec: 5531.1). Total num frames: 967190528. Throughput: 0: 5854.9. Samples: 967197930. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:49:36,222][25689] Avg episode reward: [(0, '0.602')] [2022-07-10 23:49:38,030][26022] Updated weights on worker 0-0, policy_version 944531 (0.00092) [2022-07-10 23:49:39,778][26022] Updated weights on worker 0-0, policy_version 944541 (0.00083) [2022-07-10 23:49:41,224][25689] Fps is (10 sec: 5622.7, 60 sec: 5513.1, 300 sec: 5522.5). Total num frames: 967217152. Throughput: 0: 5012.5. Samples: 967214300. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:49:41,224][25689] Avg episode reward: [(0, '0.589')] [2022-07-10 23:49:41,828][26022] Updated weights on worker 0-0, policy_version 944551 (0.00089) [2022-07-10 23:49:43,432][26022] Updated weights on worker 0-0, policy_version 944561 (0.00088) [2022-07-10 23:49:45,395][26022] Updated weights on worker 0-0, policy_version 944571 (0.00089) [2022-07-10 23:49:46,344][25689] Fps is (10 sec: 5462.6, 60 sec: 5540.6, 300 sec: 5527.4). Total num frames: 967245824. Throughput: 0: 5827.9. Samples: 967247866. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:49:46,345][25689] Avg episode reward: [(0, '0.457')] [2022-07-10 23:49:47,048][26022] Updated weights on worker 0-0, policy_version 944581 (0.00084) [2022-07-10 23:49:49,096][26022] Updated weights on worker 0-0, policy_version 944591 (0.00095) [2022-07-10 23:49:50,695][26022] Updated weights on worker 0-0, policy_version 944601 (0.00097) [2022-07-10 23:49:51,388][25689] Fps is (10 sec: 5641.9, 60 sec: 5571.1, 300 sec: 5530.1). Total num frames: 967274496. Throughput: 0: 5821.1. Samples: 967281462. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:49:51,388][25689] Avg episode reward: [(0, '0.556')] [2022-07-10 23:49:52,703][26022] Updated weights on worker 0-0, policy_version 944611 (0.00085) [2022-07-10 23:49:54,267][26022] Updated weights on worker 0-0, policy_version 944621 (0.00093) [2022-07-10 23:49:56,329][26022] Updated weights on worker 0-0, policy_version 944631 (0.00095) [2022-07-10 23:49:56,417][25689] Fps is (10 sec: 5591.2, 60 sec: 5517.9, 300 sec: 5529.6). Total num frames: 967302144. Throughput: 0: 4971.5. Samples: 967298332. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:49:56,417][25689] Avg episode reward: [(0, '1.182')] [2022-07-10 23:49:58,182][26022] Updated weights on worker 0-0, policy_version 944641 (0.00085) [2022-07-10 23:50:00,029][26022] Updated weights on worker 0-0, policy_version 944651 (0.00083) [2022-07-10 23:50:01,422][25689] Fps is (10 sec: 5510.8, 60 sec: 5554.7, 300 sec: 5531.2). Total num frames: 967329792. Throughput: 0: 5817.5. Samples: 967331802. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:50:01,423][25689] Avg episode reward: [(0, '0.966')] [2022-07-10 23:50:02,182][26022] Updated weights on worker 0-0, policy_version 944661 (0.00088) [2022-07-10 23:50:04,178][26022] Updated weights on worker 0-0, policy_version 944671 (0.00094) [2022-07-10 23:50:05,819][26022] Updated weights on worker 0-0, policy_version 944681 (0.00089) [2022-07-10 23:50:06,515][25689] Fps is (10 sec: 5374.6, 60 sec: 5550.9, 300 sec: 5526.6). Total num frames: 967356416. Throughput: 0: 5716.8. Samples: 967363178. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:50:06,515][25689] Avg episode reward: [(0, '0.726')] [2022-07-10 23:50:07,680][26022] Updated weights on worker 0-0, policy_version 944691 (0.00095) [2022-07-10 23:50:09,563][26022] Updated weights on worker 0-0, policy_version 944701 (0.00087) [2022-07-10 23:50:11,533][25689] Fps is (10 sec: 5266.4, 60 sec: 5520.1, 300 sec: 5519.6). Total num frames: 967383040. Throughput: 0: 4867.6. Samples: 967379520. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:50:11,533][25689] Avg episode reward: [(0, '0.676')] [2022-07-10 23:50:11,657][26022] Updated weights on worker 0-0, policy_version 944711 (0.00093) [2022-07-10 23:50:13,236][26022] Updated weights on worker 0-0, policy_version 944721 (0.00087) [2022-07-10 23:50:15,210][26022] Updated weights on worker 0-0, policy_version 944731 (0.00085) [2022-07-10 23:50:16,539][25689] Fps is (10 sec: 5618.5, 60 sec: 5537.7, 300 sec: 5526.6). Total num frames: 967412736. Throughput: 0: 5693.0. Samples: 967412886. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:50:16,539][25689] Avg episode reward: [(0, '0.771')] [2022-07-10 23:50:16,847][26022] Updated weights on worker 0-0, policy_version 944741 (0.00080) [2022-07-10 23:50:18,964][26022] Updated weights on worker 0-0, policy_version 944751 (0.00089) [2022-07-10 23:50:20,569][26022] Updated weights on worker 0-0, policy_version 944761 (0.00092) [2022-07-10 23:50:21,548][25689] Fps is (10 sec: 5725.9, 60 sec: 5557.2, 300 sec: 5521.6). Total num frames: 967440384. Throughput: 0: 5711.1. Samples: 967446742. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:50:21,548][25689] Avg episode reward: [(0, '1.063')] [2022-07-10 23:50:22,457][26022] Updated weights on worker 0-0, policy_version 944771 (0.00084) [2022-07-10 23:50:24,239][26022] Updated weights on worker 0-0, policy_version 944781 (0.00091) [2022-07-10 23:50:26,329][26022] Updated weights on worker 0-0, policy_version 944791 (0.00094) [2022-07-10 23:50:26,604][25689] Fps is (10 sec: 5494.0, 60 sec: 5524.6, 300 sec: 5521.5). Total num frames: 967468032. Throughput: 0: 4999.2. Samples: 967463606. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:50:26,604][25689] Avg episode reward: [(0, '0.999')] [2022-07-10 23:50:28,019][26022] Updated weights on worker 0-0, policy_version 944801 (0.00578) [2022-07-10 23:50:29,991][26022] Updated weights on worker 0-0, policy_version 944811 (0.00089) [2022-07-10 23:50:31,474][26022] Updated weights on worker 0-0, policy_version 944821 (0.00090) [2022-07-10 23:50:31,611][25689] Fps is (10 sec: 5596.8, 60 sec: 5560.5, 300 sec: 5526.1). Total num frames: 967496704. Throughput: 0: 5836.7. Samples: 967496708. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:50:31,611][25689] Avg episode reward: [(0, '1.230')] [2022-07-10 23:50:33,824][26022] Updated weights on worker 0-0, policy_version 944831 (0.00091) [2022-07-10 23:50:35,131][26022] Updated weights on worker 0-0, policy_version 944841 (0.00091) [2022-07-10 23:50:36,614][25689] Fps is (10 sec: 5524.0, 60 sec: 5510.6, 300 sec: 5520.0). Total num frames: 967523328. Throughput: 0: 5844.7. Samples: 967530218. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:50:36,615][25689] Avg episode reward: [(0, '1.452')] [2022-07-10 23:50:37,354][26022] Updated weights on worker 0-0, policy_version 944851 (0.00082) [2022-07-10 23:50:38,844][26022] Updated weights on worker 0-0, policy_version 944861 (0.00085) [2022-07-10 23:50:41,062][26022] Updated weights on worker 0-0, policy_version 944871 (0.00081) [2022-07-10 23:50:41,624][25689] Fps is (10 sec: 5522.3, 60 sec: 5543.8, 300 sec: 5524.3). Total num frames: 967552000. Throughput: 0: 4995.0. Samples: 967547022. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:50:41,625][25689] Avg episode reward: [(0, '1.100')] [2022-07-10 23:50:42,587][26022] Updated weights on worker 0-0, policy_version 944881 (0.00079) [2022-07-10 23:50:44,615][26022] Updated weights on worker 0-0, policy_version 944891 (0.00086) [2022-07-10 23:50:46,221][26022] Updated weights on worker 0-0, policy_version 944901 (0.00090) [2022-07-10 23:50:46,688][25689] Fps is (10 sec: 5692.2, 60 sec: 5549.0, 300 sec: 5527.0). Total num frames: 967580672. Throughput: 0: 5815.2. Samples: 967580400. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:50:46,690][25689] Avg episode reward: [(0, '1.066')] [2022-07-10 23:50:48,307][26022] Updated weights on worker 0-0, policy_version 944911 (0.00093) [2022-07-10 23:50:50,021][26022] Updated weights on worker 0-0, policy_version 944921 (0.00090) [2022-07-10 23:50:51,732][25689] Fps is (10 sec: 5368.9, 60 sec: 5498.0, 300 sec: 5516.7). Total num frames: 967606272. Throughput: 0: 5820.2. Samples: 967613820. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-10 23:50:51,735][25689] Avg episode reward: [(0, '0.694')] [2022-07-10 23:50:52,001][26022] Updated weights on worker 0-0, policy_version 944931 (0.00090) [2022-07-10 23:50:53,535][26022] Updated weights on worker 0-0, policy_version 944941 (0.00090) [2022-07-10 23:50:55,607][26022] Updated weights on worker 0-0, policy_version 944951 (0.00093) [2022-07-10 23:50:56,737][25689] Fps is (10 sec: 5604.6, 60 sec: 5551.2, 300 sec: 5530.7). Total num frames: 967636992. Throughput: 0: 4988.3. Samples: 967630598. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:50:56,737][25689] Avg episode reward: [(0, '0.676')] [2022-07-10 23:50:57,162][26022] Updated weights on worker 0-0, policy_version 944961 (0.00093) [2022-07-10 23:50:59,426][26022] Updated weights on worker 0-0, policy_version 944971 (0.00124) [2022-07-10 23:51:01,075][26022] Updated weights on worker 0-0, policy_version 944981 (0.00086) [2022-07-10 23:51:01,775][25689] Fps is (10 sec: 5508.6, 60 sec: 5497.7, 300 sec: 5515.2). Total num frames: 967661568. Throughput: 0: 5800.0. Samples: 967663872. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:51:01,775][25689] Avg episode reward: [(0, '0.550')] [2022-07-10 23:51:03,435][26022] Updated weights on worker 0-0, policy_version 944991 (0.00082) [2022-07-10 23:51:05,224][26022] Updated weights on worker 0-0, policy_version 945001 (0.00085) [2022-07-10 23:51:06,928][25689] Fps is (10 sec: 5325.5, 60 sec: 5542.7, 300 sec: 5527.3). Total num frames: 967691264. Throughput: 0: 5688.0. Samples: 967695524. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:51:06,928][25689] Avg episode reward: [(0, '0.585')] [2022-07-10 23:51:06,932][26022] Updated weights on worker 0-0, policy_version 945011 (0.00087) [2022-07-10 23:51:08,832][26022] Updated weights on worker 0-0, policy_version 945021 (0.00099) [2022-07-10 23:51:10,792][26022] Updated weights on worker 0-0, policy_version 945031 (0.00101) [2022-07-10 23:51:11,947][25689] Fps is (10 sec: 5635.1, 60 sec: 5559.5, 300 sec: 5531.5). Total num frames: 967718912. Throughput: 0: 4872.5. Samples: 967712314. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:51:11,947][25689] Avg episode reward: [(0, '0.667')] [2022-07-10 23:51:12,698][26022] Updated weights on worker 0-0, policy_version 945041 (0.00088) [2022-07-10 23:51:14,451][26022] Updated weights on worker 0-0, policy_version 945051 (0.00100) [2022-07-10 23:51:16,102][26022] Updated weights on worker 0-0, policy_version 945061 (0.00458) [2022-07-10 23:51:16,728][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:51:16,748][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000945064_967745536.pth [2022-07-10 23:51:16,749][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000943120_965754880.pth [2022-07-10 23:51:16,950][25689] Fps is (10 sec: 5414.9, 60 sec: 5508.9, 300 sec: 5517.9). Total num frames: 967745536. Throughput: 0: 5693.0. Samples: 967745674. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:51:16,951][25689] Avg episode reward: [(0, '0.616')] [2022-07-10 23:51:18,039][26022] Updated weights on worker 0-0, policy_version 945071 (0.00085) [2022-07-10 23:51:19,873][26022] Updated weights on worker 0-0, policy_version 945081 (0.00087) [2022-07-10 23:51:21,755][26022] Updated weights on worker 0-0, policy_version 945091 (0.00086) [2022-07-10 23:51:21,988][25689] Fps is (10 sec: 5506.8, 60 sec: 5523.2, 300 sec: 5526.8). Total num frames: 967774208. Throughput: 0: 5707.2. Samples: 967779258. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:51:21,989][25689] Avg episode reward: [(0, '0.930')] [2022-07-10 23:51:23,620][26022] Updated weights on worker 0-0, policy_version 945101 (0.00087) [2022-07-10 23:51:25,325][26022] Updated weights on worker 0-0, policy_version 945111 (0.00088) [2022-07-10 23:51:27,028][25689] Fps is (10 sec: 5690.0, 60 sec: 5541.6, 300 sec: 5533.1). Total num frames: 967802880. Throughput: 0: 5003.0. Samples: 967796088. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:51:27,029][25689] Avg episode reward: [(0, '0.571')] [2022-07-10 23:51:27,418][26022] Updated weights on worker 0-0, policy_version 945121 (0.00105) [2022-07-10 23:51:29,128][26022] Updated weights on worker 0-0, policy_version 945131 (0.00094) [2022-07-10 23:51:31,078][26022] Updated weights on worker 0-0, policy_version 945141 (0.00092) [2022-07-10 23:51:32,054][25689] Fps is (10 sec: 5595.0, 60 sec: 5522.9, 300 sec: 5530.1). Total num frames: 967830528. Throughput: 0: 5828.2. Samples: 967829504. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:51:32,055][25689] Avg episode reward: [(0, '0.383')] [2022-07-10 23:51:32,731][26022] Updated weights on worker 0-0, policy_version 945151 (0.00088) [2022-07-10 23:51:34,648][26022] Updated weights on worker 0-0, policy_version 945161 (0.00086) [2022-07-10 23:51:36,365][26022] Updated weights on worker 0-0, policy_version 945171 (0.00091) [2022-07-10 23:51:37,089][25689] Fps is (10 sec: 5496.4, 60 sec: 5537.0, 300 sec: 5529.5). Total num frames: 967858176. Throughput: 0: 5827.6. Samples: 967863034. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:51:37,091][25689] Avg episode reward: [(0, '0.241')] [2022-07-10 23:51:38,242][26022] Updated weights on worker 0-0, policy_version 945181 (0.00088) [2022-07-10 23:51:40,020][26022] Updated weights on worker 0-0, policy_version 945191 (0.00091) [2022-07-10 23:51:41,920][26022] Updated weights on worker 0-0, policy_version 945201 (0.00087) [2022-07-10 23:51:42,139][25689] Fps is (10 sec: 5585.0, 60 sec: 5533.3, 300 sec: 5534.2). Total num frames: 967886848. Throughput: 0: 5802.5. Samples: 967896180. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:51:42,139][25689] Avg episode reward: [(0, '0.611')] [2022-07-10 23:51:44,031][26022] Updated weights on worker 0-0, policy_version 945211 (0.00090) [2022-07-10 23:51:45,439][26022] Updated weights on worker 0-0, policy_version 945221 (0.00089) [2022-07-10 23:51:47,273][25689] Fps is (10 sec: 5530.1, 60 sec: 5510.0, 300 sec: 5525.8). Total num frames: 967914496. Throughput: 0: 5761.7. Samples: 967912732. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:51:47,275][25689] Avg episode reward: [(0, '0.680')] [2022-07-10 23:51:47,644][26022] Updated weights on worker 0-0, policy_version 945231 (0.00086) [2022-07-10 23:51:49,205][26022] Updated weights on worker 0-0, policy_version 945241 (0.00102) [2022-07-10 23:51:51,255][26022] Updated weights on worker 0-0, policy_version 945251 (0.00089) [2022-07-10 23:51:52,303][25689] Fps is (10 sec: 5540.9, 60 sec: 5562.0, 300 sec: 5530.0). Total num frames: 967943168. Throughput: 0: 5762.2. Samples: 967946180. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:51:52,305][25689] Avg episode reward: [(0, '0.381')] [2022-07-10 23:51:52,928][26022] Updated weights on worker 0-0, policy_version 945261 (0.00091) [2022-07-10 23:51:54,831][26022] Updated weights on worker 0-0, policy_version 945271 (0.00083) [2022-07-10 23:51:56,553][26022] Updated weights on worker 0-0, policy_version 945281 (0.00081) [2022-07-10 23:51:57,340][25689] Fps is (10 sec: 5594.2, 60 sec: 5508.3, 300 sec: 5530.6). Total num frames: 967970816. Throughput: 0: 5764.3. Samples: 967979772. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:51:57,342][25689] Avg episode reward: [(0, '1.197')] [2022-07-10 23:51:58,488][26022] Updated weights on worker 0-0, policy_version 945291 (0.00091) [2022-07-10 23:52:00,334][26022] Updated weights on worker 0-0, policy_version 945301 (0.00082) [2022-07-10 23:52:02,406][25689] Fps is (10 sec: 5371.8, 60 sec: 5539.2, 300 sec: 5531.5). Total num frames: 967997440. Throughput: 0: 4951.3. Samples: 967996530. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:52:02,407][25689] Avg episode reward: [(0, '1.053')] [2022-07-10 23:52:02,560][26022] Updated weights on worker 0-0, policy_version 945311 (0.00077) [2022-07-10 23:52:04,379][26022] Updated weights on worker 0-0, policy_version 945321 (0.00095) [2022-07-10 23:52:06,280][26022] Updated weights on worker 0-0, policy_version 945331 (0.00089) [2022-07-10 23:52:07,523][25689] Fps is (10 sec: 5329.9, 60 sec: 5509.1, 300 sec: 5529.9). Total num frames: 968025088. Throughput: 0: 5687.9. Samples: 968027912. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:52:07,524][25689] Avg episode reward: [(0, '0.754')] [2022-07-10 23:52:08,013][26022] Updated weights on worker 0-0, policy_version 945341 (0.00083) [2022-07-10 23:52:09,968][26022] Updated weights on worker 0-0, policy_version 945351 (0.00088) [2022-07-10 23:52:11,767][26022] Updated weights on worker 0-0, policy_version 945361 (0.00085) [2022-07-10 23:52:12,545][25689] Fps is (10 sec: 5555.0, 60 sec: 5525.7, 300 sec: 5533.4). Total num frames: 968053760. Throughput: 0: 5684.0. Samples: 968061234. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:52:12,545][25689] Avg episode reward: [(0, '0.610')] [2022-07-10 23:52:13,691][26022] Updated weights on worker 0-0, policy_version 945371 (0.00080) [2022-07-10 23:52:15,504][26022] Updated weights on worker 0-0, policy_version 945381 (0.00089) [2022-07-10 23:52:17,326][26022] Updated weights on worker 0-0, policy_version 945391 (0.00088) [2022-07-10 23:52:17,552][25689] Fps is (10 sec: 5615.7, 60 sec: 5542.3, 300 sec: 5530.7). Total num frames: 968081408. Throughput: 0: 4852.4. Samples: 968077846. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:52:17,553][25689] Avg episode reward: [(0, '0.416')] [2022-07-10 23:52:19,071][26022] Updated weights on worker 0-0, policy_version 945401 (0.00084) [2022-07-10 23:52:21,248][26022] Updated weights on worker 0-0, policy_version 945411 (0.00093) [2022-07-10 23:52:22,558][25689] Fps is (10 sec: 5522.1, 60 sec: 5528.3, 300 sec: 5528.1). Total num frames: 968109056. Throughput: 0: 5682.5. Samples: 968111046. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:52:22,559][25689] Avg episode reward: [(0, '0.363')] [2022-07-10 23:52:22,862][26022] Updated weights on worker 0-0, policy_version 945421 (0.00093) [2022-07-10 23:52:24,822][26022] Updated weights on worker 0-0, policy_version 945431 (0.00115) [2022-07-10 23:52:26,546][26022] Updated weights on worker 0-0, policy_version 945441 (0.00082) [2022-07-10 23:52:27,642][25689] Fps is (10 sec: 5480.6, 60 sec: 5507.4, 300 sec: 5533.5). Total num frames: 968136704. Throughput: 0: 5796.3. Samples: 968144524. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:52:27,642][25689] Avg episode reward: [(0, '0.249')] [2022-07-10 23:52:28,419][26022] Updated weights on worker 0-0, policy_version 945451 (0.00086) [2022-07-10 23:52:30,240][26022] Updated weights on worker 0-0, policy_version 945461 (0.00091) [2022-07-10 23:52:32,091][26022] Updated weights on worker 0-0, policy_version 945471 (0.00094) [2022-07-10 23:52:32,659][25689] Fps is (10 sec: 5576.1, 60 sec: 5525.2, 300 sec: 5530.0). Total num frames: 968165376. Throughput: 0: 4969.7. Samples: 968161194. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:52:32,659][25689] Avg episode reward: [(0, '-0.569')] [2022-07-10 23:52:33,989][26022] Updated weights on worker 0-0, policy_version 945481 (0.00087) [2022-07-10 23:52:35,722][26022] Updated weights on worker 0-0, policy_version 945491 (0.00087) [2022-07-10 23:52:37,558][26022] Updated weights on worker 0-0, policy_version 945501 (0.00088) [2022-07-10 23:52:37,719][25689] Fps is (10 sec: 5588.7, 60 sec: 5522.8, 300 sec: 5529.2). Total num frames: 968193024. Throughput: 0: 5787.5. Samples: 968194562. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:52:37,720][25689] Avg episode reward: [(0, '-0.731')] [2022-07-10 23:52:39,516][26022] Updated weights on worker 0-0, policy_version 945511 (0.00086) [2022-07-10 23:52:41,155][26022] Updated weights on worker 0-0, policy_version 945521 (0.00089) [2022-07-10 23:52:42,756][25689] Fps is (10 sec: 5374.8, 60 sec: 5490.2, 300 sec: 5526.1). Total num frames: 968219648. Throughput: 0: 5807.2. Samples: 968228338. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:52:42,758][25689] Avg episode reward: [(0, '-0.924')] [2022-07-10 23:52:43,273][26022] Updated weights on worker 0-0, policy_version 945531 (0.00085) [2022-07-10 23:52:44,842][26022] Updated weights on worker 0-0, policy_version 945541 (0.00414) [2022-07-10 23:52:46,971][26022] Updated weights on worker 0-0, policy_version 945551 (0.00091) [2022-07-10 23:52:47,803][25689] Fps is (10 sec: 5585.0, 60 sec: 5531.9, 300 sec: 5529.3). Total num frames: 968249344. Throughput: 0: 4965.7. Samples: 968244640. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:52:47,805][25689] Avg episode reward: [(0, '-0.907')] [2022-07-10 23:52:48,579][26022] Updated weights on worker 0-0, policy_version 945561 (0.00092) [2022-07-10 23:52:50,608][26022] Updated weights on worker 0-0, policy_version 945571 (0.00085) [2022-07-10 23:52:52,336][26022] Updated weights on worker 0-0, policy_version 945581 (0.00054) [2022-07-10 23:52:52,807][25689] Fps is (10 sec: 5705.4, 60 sec: 5517.4, 300 sec: 5529.4). Total num frames: 968276992. Throughput: 0: 5816.7. Samples: 968278390. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:52:52,808][25689] Avg episode reward: [(0, '-0.534')] [2022-07-10 23:52:54,264][26022] Updated weights on worker 0-0, policy_version 945591 (0.00085) [2022-07-10 23:52:55,955][26022] Updated weights on worker 0-0, policy_version 945601 (0.00084) [2022-07-10 23:52:57,820][25689] Fps is (10 sec: 5418.0, 60 sec: 5502.7, 300 sec: 5526.5). Total num frames: 968303616. Throughput: 0: 5841.6. Samples: 968311984. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:52:57,821][25689] Avg episode reward: [(0, '0.058')] [2022-07-10 23:52:58,005][26022] Updated weights on worker 0-0, policy_version 945611 (0.00085) [2022-07-10 23:52:59,331][26022] Updated weights on worker 0-0, policy_version 945621 (0.00088) [2022-07-10 23:53:01,627][26022] Updated weights on worker 0-0, policy_version 945631 (0.00093) [2022-07-10 23:53:02,825][25689] Fps is (10 sec: 5519.3, 60 sec: 5542.1, 300 sec: 5535.1). Total num frames: 968332288. Throughput: 0: 5005.4. Samples: 968328794. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:53:02,827][25689] Avg episode reward: [(0, '0.137')] [2022-07-10 23:53:03,939][26022] Updated weights on worker 0-0, policy_version 945641 (0.00084) [2022-07-10 23:53:05,427][26022] Updated weights on worker 0-0, policy_version 945651 (0.00078) [2022-07-10 23:53:07,583][26022] Updated weights on worker 0-0, policy_version 945661 (0.01332) [2022-07-10 23:53:07,875][25689] Fps is (10 sec: 5601.0, 60 sec: 5548.2, 300 sec: 5534.7). Total num frames: 968359936. Throughput: 0: 5751.6. Samples: 968360088. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:53:07,876][25689] Avg episode reward: [(0, '0.258')] [2022-07-10 23:53:09,403][26022] Updated weights on worker 0-0, policy_version 945671 (0.00094) [2022-07-10 23:53:11,050][26022] Updated weights on worker 0-0, policy_version 945681 (0.00085) [2022-07-10 23:53:12,927][25689] Fps is (10 sec: 5372.5, 60 sec: 5511.5, 300 sec: 5527.9). Total num frames: 968386560. Throughput: 0: 5727.1. Samples: 968393622. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:53:12,928][25689] Avg episode reward: [(0, '0.346')] [2022-07-10 23:53:12,989][26022] Updated weights on worker 0-0, policy_version 945691 (0.00085) [2022-07-10 23:53:14,624][26022] Updated weights on worker 0-0, policy_version 945701 (0.00080) [2022-07-10 23:53:16,767][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:53:16,788][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000945711_968408064.pth [2022-07-10 23:53:16,789][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000943767_966417408.pth [2022-07-10 23:53:16,792][26022] Updated weights on worker 0-0, policy_version 945711 (0.00085) [2022-07-10 23:53:17,943][25689] Fps is (10 sec: 5594.4, 60 sec: 5544.7, 300 sec: 5538.3). Total num frames: 968416256. Throughput: 0: 4878.7. Samples: 968410156. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:53:17,943][25689] Avg episode reward: [(0, '0.696')] [2022-07-10 23:53:18,274][26022] Updated weights on worker 0-0, policy_version 945721 (0.00090) [2022-07-10 23:53:20,199][26022] Updated weights on worker 0-0, policy_version 945731 (0.00085) [2022-07-10 23:53:22,268][26022] Updated weights on worker 0-0, policy_version 945741 (0.00088) [2022-07-10 23:53:22,983][25689] Fps is (10 sec: 5601.0, 60 sec: 5524.7, 300 sec: 5531.6). Total num frames: 968442880. Throughput: 0: 5709.5. Samples: 968443882. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:53:22,984][25689] Avg episode reward: [(0, '0.294')] [2022-07-10 23:53:23,838][26022] Updated weights on worker 0-0, policy_version 945751 (0.00080) [2022-07-10 23:53:26,004][26022] Updated weights on worker 0-0, policy_version 945761 (0.00094) [2022-07-10 23:53:27,511][26022] Updated weights on worker 0-0, policy_version 945771 (0.00085) [2022-07-10 23:53:28,033][25689] Fps is (10 sec: 5378.8, 60 sec: 5527.7, 300 sec: 5525.5). Total num frames: 968470528. Throughput: 0: 5801.1. Samples: 968477020. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:53:28,035][25689] Avg episode reward: [(0, '0.393')] [2022-07-10 23:53:29,691][26022] Updated weights on worker 0-0, policy_version 945781 (0.00093) [2022-07-10 23:53:31,351][26022] Updated weights on worker 0-0, policy_version 945791 (0.00084) [2022-07-10 23:53:33,028][26022] Updated weights on worker 0-0, policy_version 945801 (0.00085) [2022-07-10 23:53:33,055][25689] Fps is (10 sec: 5693.4, 60 sec: 5544.2, 300 sec: 5540.3). Total num frames: 968500224. Throughput: 0: 4966.2. Samples: 968493578. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:53:33,055][25689] Avg episode reward: [(0, '0.699')] [2022-07-10 23:53:35,079][26022] Updated weights on worker 0-0, policy_version 945811 (0.00089) [2022-07-10 23:53:36,704][26022] Updated weights on worker 0-0, policy_version 945821 (0.00087) [2022-07-10 23:53:38,079][25689] Fps is (10 sec: 5504.0, 60 sec: 5513.6, 300 sec: 5522.8). Total num frames: 968525824. Throughput: 0: 5825.9. Samples: 968527470. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:53:38,080][25689] Avg episode reward: [(0, '0.328')] [2022-07-10 23:53:38,619][26022] Updated weights on worker 0-0, policy_version 945831 (0.00086) [2022-07-10 23:53:40,667][26022] Updated weights on worker 0-0, policy_version 945841 (0.00107) [2022-07-10 23:53:42,482][26022] Updated weights on worker 0-0, policy_version 945851 (0.00088) [2022-07-10 23:53:43,081][25689] Fps is (10 sec: 5617.4, 60 sec: 5584.7, 300 sec: 5537.4). Total num frames: 968556544. Throughput: 0: 5838.2. Samples: 968561218. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:53:43,081][25689] Avg episode reward: [(0, '0.289')] [2022-07-10 23:53:44,258][26022] Updated weights on worker 0-0, policy_version 945861 (0.00092) [2022-07-10 23:53:46,194][26022] Updated weights on worker 0-0, policy_version 945871 (0.00087) [2022-07-10 23:53:47,745][26022] Updated weights on worker 0-0, policy_version 945881 (0.00087) [2022-07-10 23:53:48,165][25689] Fps is (10 sec: 5787.3, 60 sec: 5547.4, 300 sec: 5539.4). Total num frames: 968584192. Throughput: 0: 4992.6. Samples: 968577532. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:53:48,165][25689] Avg episode reward: [(0, '-0.288')] [2022-07-10 23:53:49,751][26022] Updated weights on worker 0-0, policy_version 945891 (0.00081) [2022-07-10 23:53:51,440][26022] Updated weights on worker 0-0, policy_version 945901 (0.00083) [2022-07-10 23:53:53,177][25689] Fps is (10 sec: 5172.6, 60 sec: 5495.7, 300 sec: 5518.6). Total num frames: 968608768. Throughput: 0: 5816.9. Samples: 968610628. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:53:53,178][25689] Avg episode reward: [(0, '-0.141')] [2022-07-10 23:53:53,412][26022] Updated weights on worker 0-0, policy_version 945911 (0.00090) [2022-07-10 23:53:55,151][26022] Updated weights on worker 0-0, policy_version 945921 (0.00086) [2022-07-10 23:53:57,033][26022] Updated weights on worker 0-0, policy_version 945931 (0.00049) [2022-07-10 23:53:58,183][25689] Fps is (10 sec: 5621.4, 60 sec: 5581.1, 300 sec: 5539.8). Total num frames: 968640512. Throughput: 0: 5828.4. Samples: 968644646. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:53:58,184][25689] Avg episode reward: [(0, '-0.637')] [2022-07-10 23:53:58,851][26022] Updated weights on worker 0-0, policy_version 945941 (0.00092) [2022-07-10 23:54:00,667][26022] Updated weights on worker 0-0, policy_version 945951 (0.00097) [2022-07-10 23:54:02,789][26022] Updated weights on worker 0-0, policy_version 945961 (0.00079) [2022-07-10 23:54:03,187][25689] Fps is (10 sec: 5728.9, 60 sec: 5530.4, 300 sec: 5537.3). Total num frames: 968666112. Throughput: 0: 4987.8. Samples: 968661502. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:54:03,187][25689] Avg episode reward: [(0, '-0.756')] [2022-07-10 23:54:04,843][26022] Updated weights on worker 0-0, policy_version 945971 (0.00087) [2022-07-10 23:54:06,508][26022] Updated weights on worker 0-0, policy_version 945981 (0.00096) [2022-07-10 23:54:08,232][25689] Fps is (10 sec: 5197.3, 60 sec: 5513.9, 300 sec: 5530.5). Total num frames: 968692736. Throughput: 0: 5758.5. Samples: 968693090. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:54:08,232][25689] Avg episode reward: [(0, '-0.825')] [2022-07-10 23:54:08,416][26022] Updated weights on worker 0-0, policy_version 945991 (0.00091) [2022-07-10 23:54:09,987][26022] Updated weights on worker 0-0, policy_version 946001 (0.00089) [2022-07-10 23:54:11,936][26022] Updated weights on worker 0-0, policy_version 946011 (0.00086) [2022-07-10 23:54:13,233][25689] Fps is (10 sec: 5606.1, 60 sec: 5569.5, 300 sec: 5534.2). Total num frames: 968722432. Throughput: 0: 5808.3. Samples: 968727118. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:54:13,233][25689] Avg episode reward: [(0, '-0.931')] [2022-07-10 23:54:13,951][26022] Updated weights on worker 0-0, policy_version 946021 (0.00086) [2022-07-10 23:54:15,592][26022] Updated weights on worker 0-0, policy_version 946031 (0.00084) [2022-07-10 23:54:17,589][26022] Updated weights on worker 0-0, policy_version 946041 (0.00094) [2022-07-10 23:54:18,244][25689] Fps is (10 sec: 5727.2, 60 sec: 5535.9, 300 sec: 5538.1). Total num frames: 968750080. Throughput: 0: 4938.1. Samples: 968743710. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:54:18,245][25689] Avg episode reward: [(0, '-0.648')] [2022-07-10 23:54:19,255][26022] Updated weights on worker 0-0, policy_version 946051 (0.00087) [2022-07-10 23:54:21,064][26022] Updated weights on worker 0-0, policy_version 946061 (0.00103) [2022-07-10 23:54:23,006][26022] Updated weights on worker 0-0, policy_version 946071 (0.00084) [2022-07-10 23:54:23,248][25689] Fps is (10 sec: 5521.2, 60 sec: 5556.2, 300 sec: 5532.4). Total num frames: 968777728. Throughput: 0: 5795.5. Samples: 968777768. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:54:23,249][25689] Avg episode reward: [(0, '-0.691')] [2022-07-10 23:54:24,609][26022] Updated weights on worker 0-0, policy_version 946081 (0.00614) [2022-07-10 23:54:26,598][26022] Updated weights on worker 0-0, policy_version 946091 (0.00087) [2022-07-10 23:54:28,342][25689] Fps is (10 sec: 5577.5, 60 sec: 5569.1, 300 sec: 5538.1). Total num frames: 968806400. Throughput: 0: 5889.9. Samples: 968811538. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:54:28,343][25689] Avg episode reward: [(0, '-0.496')] [2022-07-10 23:54:28,570][26022] Updated weights on worker 0-0, policy_version 946101 (0.00092) [2022-07-10 23:54:29,980][26022] Updated weights on worker 0-0, policy_version 946111 (0.00090) [2022-07-10 23:54:32,134][26022] Updated weights on worker 0-0, policy_version 946121 (0.00082) [2022-07-10 23:54:33,355][25689] Fps is (10 sec: 5775.3, 60 sec: 5570.0, 300 sec: 5538.1). Total num frames: 968836096. Throughput: 0: 5028.0. Samples: 968828290. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:54:33,355][25689] Avg episode reward: [(0, '-0.424')] [2022-07-10 23:54:33,769][26022] Updated weights on worker 0-0, policy_version 946131 (0.00092) [2022-07-10 23:54:35,643][26022] Updated weights on worker 0-0, policy_version 946141 (0.00088) [2022-07-10 23:54:37,292][26022] Updated weights on worker 0-0, policy_version 946151 (0.00608) [2022-07-10 23:54:38,395][25689] Fps is (10 sec: 5704.1, 60 sec: 5602.5, 300 sec: 5540.9). Total num frames: 968863744. Throughput: 0: 5900.6. Samples: 968862612. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:54:38,397][25689] Avg episode reward: [(0, '-0.503')] [2022-07-10 23:54:39,239][26022] Updated weights on worker 0-0, policy_version 946161 (0.00085) [2022-07-10 23:54:41,034][26022] Updated weights on worker 0-0, policy_version 946171 (0.00084) [2022-07-10 23:54:42,718][26022] Updated weights on worker 0-0, policy_version 946181 (0.00082) [2022-07-10 23:54:43,431][25689] Fps is (10 sec: 5487.8, 60 sec: 5548.4, 300 sec: 5539.0). Total num frames: 968891392. Throughput: 0: 5906.8. Samples: 968896982. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-10 23:54:43,431][25689] Avg episode reward: [(0, '-0.222')] [2022-07-10 23:54:44,522][26022] Updated weights on worker 0-0, policy_version 946191 (0.00085) [2022-07-10 23:54:46,511][26022] Updated weights on worker 0-0, policy_version 946201 (0.00086) [2022-07-10 23:54:48,340][26022] Updated weights on worker 0-0, policy_version 946211 (0.00084) [2022-07-10 23:54:48,509][25689] Fps is (10 sec: 5568.7, 60 sec: 5565.9, 300 sec: 5538.4). Total num frames: 968920064. Throughput: 0: 5907.2. Samples: 968930668. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:54:48,509][25689] Avg episode reward: [(0, '-0.204')] [2022-07-10 23:54:50,100][26022] Updated weights on worker 0-0, policy_version 946221 (0.00088) [2022-07-10 23:54:51,881][26022] Updated weights on worker 0-0, policy_version 946231 (0.00086) [2022-07-10 23:54:53,538][25689] Fps is (10 sec: 5775.0, 60 sec: 5649.2, 300 sec: 5545.3). Total num frames: 968949760. Throughput: 0: 5913.2. Samples: 968947636. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:54:53,538][25689] Avg episode reward: [(0, '-0.321')] [2022-07-10 23:54:53,900][26022] Updated weights on worker 0-0, policy_version 946241 (0.00088) [2022-07-10 23:54:55,394][26022] Updated weights on worker 0-0, policy_version 946251 (0.00092) [2022-07-10 23:54:57,460][26022] Updated weights on worker 0-0, policy_version 946261 (0.00087) [2022-07-10 23:54:58,546][25689] Fps is (10 sec: 5815.2, 60 sec: 5598.2, 300 sec: 5548.7). Total num frames: 968978432. Throughput: 0: 5903.6. Samples: 968981574. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:54:58,547][25689] Avg episode reward: [(0, '0.300')] [2022-07-10 23:54:58,966][26022] Updated weights on worker 0-0, policy_version 946271 (0.00082) [2022-07-10 23:55:01,060][26022] Updated weights on worker 0-0, policy_version 946281 (0.00094) [2022-07-10 23:55:03,100][26022] Updated weights on worker 0-0, policy_version 946291 (0.00085) [2022-07-10 23:55:03,631][25689] Fps is (10 sec: 5377.0, 60 sec: 5590.6, 300 sec: 5545.4). Total num frames: 969004032. Throughput: 0: 5770.4. Samples: 969013546. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:55:03,632][25689] Avg episode reward: [(0, '0.590')] [2022-07-10 23:55:04,990][26022] Updated weights on worker 0-0, policy_version 946301 (0.00093) [2022-07-10 23:55:06,622][26022] Updated weights on worker 0-0, policy_version 946311 (0.00093) [2022-07-10 23:55:08,594][26022] Updated weights on worker 0-0, policy_version 946321 (0.00085) [2022-07-10 23:55:08,693][25689] Fps is (10 sec: 5348.6, 60 sec: 5622.9, 300 sec: 5551.4). Total num frames: 969032704. Throughput: 0: 4943.8. Samples: 969030454. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:55:08,694][25689] Avg episode reward: [(0, '0.427')] [2022-07-10 23:55:10,236][26022] Updated weights on worker 0-0, policy_version 946331 (0.00086) [2022-07-10 23:55:12,054][26022] Updated weights on worker 0-0, policy_version 946341 (0.00093) [2022-07-10 23:55:13,781][25689] Fps is (10 sec: 5750.7, 60 sec: 5614.9, 300 sec: 5549.9). Total num frames: 969062400. Throughput: 0: 5778.0. Samples: 969064600. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:55:13,781][25689] Avg episode reward: [(0, '0.919')] [2022-07-10 23:55:13,993][26022] Updated weights on worker 0-0, policy_version 946351 (0.00088) [2022-07-10 23:55:15,708][26022] Updated weights on worker 0-0, policy_version 946361 (0.00094) [2022-07-10 23:55:16,900][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:55:16,922][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000946366_969078784.pth [2022-07-10 23:55:16,922][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000944415_967080960.pth [2022-07-10 23:55:17,566][26022] Updated weights on worker 0-0, policy_version 946371 (0.00083) [2022-07-10 23:55:18,785][25689] Fps is (10 sec: 5682.0, 60 sec: 5615.5, 300 sec: 5550.0). Total num frames: 969090048. Throughput: 0: 5772.9. Samples: 969098412. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:55:18,786][25689] Avg episode reward: [(0, '0.700')] [2022-07-10 23:55:19,419][26022] Updated weights on worker 0-0, policy_version 946381 (0.00081) [2022-07-10 23:55:21,154][26022] Updated weights on worker 0-0, policy_version 946391 (0.00089) [2022-07-10 23:55:23,192][26022] Updated weights on worker 0-0, policy_version 946401 (0.00083) [2022-07-10 23:55:23,816][25689] Fps is (10 sec: 5612.0, 60 sec: 5629.9, 300 sec: 5553.9). Total num frames: 969118720. Throughput: 0: 5047.6. Samples: 969115434. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:55:23,817][25689] Avg episode reward: [(0, '1.109')] [2022-07-10 23:55:24,702][26022] Updated weights on worker 0-0, policy_version 946411 (0.00088) [2022-07-10 23:55:26,751][26022] Updated weights on worker 0-0, policy_version 946421 (0.00084) [2022-07-10 23:55:28,612][26022] Updated weights on worker 0-0, policy_version 946431 (0.00083) [2022-07-10 23:55:28,886][25689] Fps is (10 sec: 5576.1, 60 sec: 5615.3, 300 sec: 5549.3). Total num frames: 969146368. Throughput: 0: 5895.6. Samples: 969149500. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:55:28,886][25689] Avg episode reward: [(0, '0.944')] [2022-07-10 23:55:30,193][26022] Updated weights on worker 0-0, policy_version 946441 (0.00086) [2022-07-10 23:55:32,266][26022] Updated weights on worker 0-0, policy_version 946451 (0.00092) [2022-07-10 23:55:33,738][26022] Updated weights on worker 0-0, policy_version 946461 (0.00078) [2022-07-10 23:55:33,929][25689] Fps is (10 sec: 5670.7, 60 sec: 5612.4, 300 sec: 5558.9). Total num frames: 969176064. Throughput: 0: 5909.1. Samples: 969183656. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:55:33,929][25689] Avg episode reward: [(0, '1.008')] [2022-07-10 23:55:35,716][26022] Updated weights on worker 0-0, policy_version 946471 (0.00090) [2022-07-10 23:55:37,466][26022] Updated weights on worker 0-0, policy_version 946481 (0.00086) [2022-07-10 23:55:38,959][25689] Fps is (10 sec: 5692.7, 60 sec: 5613.4, 300 sec: 5555.1). Total num frames: 969203712. Throughput: 0: 5064.3. Samples: 969200576. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:55:38,959][25689] Avg episode reward: [(0, '0.657')] [2022-07-10 23:55:39,294][26022] Updated weights on worker 0-0, policy_version 946491 (0.00082) [2022-07-10 23:55:41,160][26022] Updated weights on worker 0-0, policy_version 946501 (0.00087) [2022-07-10 23:55:42,931][26022] Updated weights on worker 0-0, policy_version 946511 (0.00083) [2022-07-10 23:55:43,990][25689] Fps is (10 sec: 5495.7, 60 sec: 5613.8, 300 sec: 5552.2). Total num frames: 969231360. Throughput: 0: 5894.2. Samples: 969234344. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:55:43,991][25689] Avg episode reward: [(0, '1.176')] [2022-07-10 23:55:44,746][26022] Updated weights on worker 0-0, policy_version 946521 (0.00090) [2022-07-10 23:55:46,724][26022] Updated weights on worker 0-0, policy_version 946531 (0.00090) [2022-07-10 23:55:48,262][26022] Updated weights on worker 0-0, policy_version 946541 (0.00088) [2022-07-10 23:55:49,106][25689] Fps is (10 sec: 5651.4, 60 sec: 5627.2, 300 sec: 5564.7). Total num frames: 969261056. Throughput: 0: 5850.1. Samples: 969267790. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:55:49,106][25689] Avg episode reward: [(0, '0.070')] [2022-07-10 23:55:50,415][26022] Updated weights on worker 0-0, policy_version 946551 (0.00083) [2022-07-10 23:55:52,015][26022] Updated weights on worker 0-0, policy_version 946561 (0.00088) [2022-07-10 23:55:54,058][26022] Updated weights on worker 0-0, policy_version 946571 (0.00090) [2022-07-10 23:55:54,119][25689] Fps is (10 sec: 5863.8, 60 sec: 5628.7, 300 sec: 5561.1). Total num frames: 969290752. Throughput: 0: 5015.1. Samples: 969284912. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:55:54,120][25689] Avg episode reward: [(0, '-0.033')] [2022-07-10 23:55:55,705][26022] Updated weights on worker 0-0, policy_version 946581 (0.00087) [2022-07-10 23:55:57,539][26022] Updated weights on worker 0-0, policy_version 946591 (0.00089) [2022-07-10 23:55:59,147][25689] Fps is (10 sec: 5710.6, 60 sec: 5609.9, 300 sec: 5571.5). Total num frames: 969318400. Throughput: 0: 5880.9. Samples: 969319304. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:55:59,148][25689] Avg episode reward: [(0, '0.012')] [2022-07-10 23:55:59,175][26022] Updated weights on worker 0-0, policy_version 946601 (0.00093) [2022-07-10 23:56:01,230][26022] Updated weights on worker 0-0, policy_version 946611 (0.00088) [2022-07-10 23:56:03,126][26022] Updated weights on worker 0-0, policy_version 946621 (0.00081) [2022-07-10 23:56:04,155][25689] Fps is (10 sec: 5305.5, 60 sec: 5617.0, 300 sec: 5560.5). Total num frames: 969344000. Throughput: 0: 5790.4. Samples: 969351108. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:56:04,156][25689] Avg episode reward: [(0, '-0.681')] [2022-07-10 23:56:05,238][26022] Updated weights on worker 0-0, policy_version 946631 (0.00086) [2022-07-10 23:56:06,841][26022] Updated weights on worker 0-0, policy_version 946641 (0.00082) [2022-07-10 23:56:08,695][26022] Updated weights on worker 0-0, policy_version 946651 (0.00091) [2022-07-10 23:56:09,222][25689] Fps is (10 sec: 5387.1, 60 sec: 5616.6, 300 sec: 5563.1). Total num frames: 969372672. Throughput: 0: 4987.8. Samples: 969368126. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:56:09,223][25689] Avg episode reward: [(0, '-0.877')] [2022-07-10 23:56:10,544][26022] Updated weights on worker 0-0, policy_version 946661 (0.00086) [2022-07-10 23:56:12,416][26022] Updated weights on worker 0-0, policy_version 946671 (0.00087) [2022-07-10 23:56:14,066][26022] Updated weights on worker 0-0, policy_version 946681 (0.00085) [2022-07-10 23:56:14,249][25689] Fps is (10 sec: 5782.6, 60 sec: 5622.3, 300 sec: 5573.0). Total num frames: 969402368. Throughput: 0: 5824.5. Samples: 969402160. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:56:14,250][25689] Avg episode reward: [(0, '-0.932')] [2022-07-10 23:56:16,004][26022] Updated weights on worker 0-0, policy_version 946691 (0.00088) [2022-07-10 23:56:17,615][26022] Updated weights on worker 0-0, policy_version 946701 (0.00082) [2022-07-10 23:56:19,268][25689] Fps is (10 sec: 5606.2, 60 sec: 5604.0, 300 sec: 5566.4). Total num frames: 969428992. Throughput: 0: 5796.0. Samples: 969435922. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:56:19,268][25689] Avg episode reward: [(0, '-0.312')] [2022-07-10 23:56:19,939][26022] Updated weights on worker 0-0, policy_version 946711 (0.00093) [2022-07-10 23:56:21,224][26022] Updated weights on worker 0-0, policy_version 946721 (0.00084) [2022-07-10 23:56:23,398][26022] Updated weights on worker 0-0, policy_version 946731 (0.00089) [2022-07-10 23:56:24,302][25689] Fps is (10 sec: 5602.3, 60 sec: 5620.6, 300 sec: 5570.0). Total num frames: 969458688. Throughput: 0: 5047.5. Samples: 969452798. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:56:24,303][25689] Avg episode reward: [(0, '-0.551')] [2022-07-10 23:56:25,130][26022] Updated weights on worker 0-0, policy_version 946741 (0.00098) [2022-07-10 23:56:27,130][26022] Updated weights on worker 0-0, policy_version 946751 (0.00090) [2022-07-10 23:56:28,900][26022] Updated weights on worker 0-0, policy_version 946761 (0.00092) [2022-07-10 23:56:29,437][25689] Fps is (10 sec: 5538.3, 60 sec: 5597.6, 300 sec: 5564.5). Total num frames: 969485312. Throughput: 0: 5847.0. Samples: 969486324. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:56:29,438][25689] Avg episode reward: [(0, '-0.538')] [2022-07-10 23:56:30,580][26022] Updated weights on worker 0-0, policy_version 946771 (0.00094) [2022-07-10 23:56:32,527][26022] Updated weights on worker 0-0, policy_version 946781 (0.00083) [2022-07-10 23:56:34,193][26022] Updated weights on worker 0-0, policy_version 946791 (0.00087) [2022-07-10 23:56:34,439][25689] Fps is (10 sec: 5455.0, 60 sec: 5584.5, 300 sec: 5568.5). Total num frames: 969513984. Throughput: 0: 5844.7. Samples: 969520164. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:56:34,440][25689] Avg episode reward: [(0, '0.024')] [2022-07-10 23:56:36,261][26022] Updated weights on worker 0-0, policy_version 946801 (0.00100) [2022-07-10 23:56:37,939][26022] Updated weights on worker 0-0, policy_version 946811 (0.00087) [2022-07-10 23:56:39,492][25689] Fps is (10 sec: 5703.0, 60 sec: 5599.3, 300 sec: 5568.5). Total num frames: 969542656. Throughput: 0: 4993.3. Samples: 969536908. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:56:39,493][25689] Avg episode reward: [(0, '-0.429')] [2022-07-10 23:56:39,831][26022] Updated weights on worker 0-0, policy_version 946821 (0.00087) [2022-07-10 23:56:41,572][26022] Updated weights on worker 0-0, policy_version 946831 (0.00091) [2022-07-10 23:56:43,584][26022] Updated weights on worker 0-0, policy_version 946841 (0.00089) [2022-07-10 23:56:44,516][25689] Fps is (10 sec: 5792.5, 60 sec: 5633.9, 300 sec: 5577.5). Total num frames: 969572352. Throughput: 0: 5835.3. Samples: 969570748. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:56:44,516][25689] Avg episode reward: [(0, '-0.091')] [2022-07-10 23:56:45,227][26022] Updated weights on worker 0-0, policy_version 946851 (0.00090) [2022-07-10 23:56:47,246][26022] Updated weights on worker 0-0, policy_version 946861 (0.00100) [2022-07-10 23:56:48,833][26022] Updated weights on worker 0-0, policy_version 946871 (0.00090) [2022-07-10 23:56:49,581][25689] Fps is (10 sec: 5582.5, 60 sec: 5587.7, 300 sec: 5569.9). Total num frames: 969598976. Throughput: 0: 5840.5. Samples: 969603974. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:56:49,581][25689] Avg episode reward: [(0, '0.264')] [2022-07-10 23:56:50,907][26022] Updated weights on worker 0-0, policy_version 946881 (0.00086) [2022-07-10 23:56:52,740][26022] Updated weights on worker 0-0, policy_version 946891 (0.00090) [2022-07-10 23:56:54,278][26022] Updated weights on worker 0-0, policy_version 946901 (0.00095) [2022-07-10 23:56:54,614][25689] Fps is (10 sec: 5475.4, 60 sec: 5569.0, 300 sec: 5573.4). Total num frames: 969627648. Throughput: 0: 4995.2. Samples: 969620944. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:56:54,615][25689] Avg episode reward: [(0, '0.308')] [2022-07-10 23:56:56,265][26022] Updated weights on worker 0-0, policy_version 946911 (0.00085) [2022-07-10 23:56:58,078][26022] Updated weights on worker 0-0, policy_version 946921 (0.00084) [2022-07-10 23:56:59,699][25689] Fps is (10 sec: 5667.5, 60 sec: 5580.7, 300 sec: 5580.0). Total num frames: 969656320. Throughput: 0: 5836.7. Samples: 969654848. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:56:59,699][25689] Avg episode reward: [(0, '0.308')] [2022-07-10 23:56:59,913][26022] Updated weights on worker 0-0, policy_version 946931 (0.00086) [2022-07-10 23:57:02,110][26022] Updated weights on worker 0-0, policy_version 946941 (0.00097) [2022-07-10 23:57:03,747][26022] Updated weights on worker 0-0, policy_version 946951 (0.00080) [2022-07-10 23:57:04,764][25689] Fps is (10 sec: 5448.1, 60 sec: 5592.3, 300 sec: 5577.5). Total num frames: 969682944. Throughput: 0: 5716.9. Samples: 969686508. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:57:04,765][25689] Avg episode reward: [(0, '0.030')] [2022-07-10 23:57:05,811][26022] Updated weights on worker 0-0, policy_version 946961 (0.00095) [2022-07-10 23:57:07,367][26022] Updated weights on worker 0-0, policy_version 946971 (0.00088) [2022-07-10 23:57:09,359][26022] Updated weights on worker 0-0, policy_version 946981 (0.00078) [2022-07-10 23:57:09,840][25689] Fps is (10 sec: 5351.8, 60 sec: 5574.6, 300 sec: 5573.0). Total num frames: 969710592. Throughput: 0: 5741.1. Samples: 969720282. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:57:09,840][25689] Avg episode reward: [(0, '0.144')] [2022-07-10 23:57:11,214][26022] Updated weights on worker 0-0, policy_version 946991 (0.00092) [2022-07-10 23:57:13,138][26022] Updated weights on worker 0-0, policy_version 947001 (0.00090) [2022-07-10 23:57:14,899][25689] Fps is (10 sec: 5557.1, 60 sec: 5554.8, 300 sec: 5575.5). Total num frames: 969739264. Throughput: 0: 5726.2. Samples: 969737096. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:57:14,899][25689] Avg episode reward: [(0, '0.005')] [2022-07-10 23:57:14,906][26022] Updated weights on worker 0-0, policy_version 947011 (0.00094) [2022-07-10 23:57:16,693][26022] Updated weights on worker 0-0, policy_version 947021 (0.00092) [2022-07-10 23:57:17,091][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:57:17,102][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000947023_969751552.pth [2022-07-10 23:57:17,102][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000945064_967745536.pth [2022-07-10 23:57:18,471][26022] Updated weights on worker 0-0, policy_version 947031 (0.00086) [2022-07-10 23:57:19,922][25689] Fps is (10 sec: 5586.3, 60 sec: 5571.3, 300 sec: 5575.2). Total num frames: 969766912. Throughput: 0: 5696.7. Samples: 969770050. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:57:19,922][25689] Avg episode reward: [(0, '-0.210')] [2022-07-10 23:57:20,472][26022] Updated weights on worker 0-0, policy_version 947041 (0.00087) [2022-07-10 23:57:22,104][26022] Updated weights on worker 0-0, policy_version 947051 (0.00085) [2022-07-10 23:57:24,249][26022] Updated weights on worker 0-0, policy_version 947061 (0.00085) [2022-07-10 23:57:24,991][25689] Fps is (10 sec: 5479.3, 60 sec: 5534.4, 300 sec: 5575.5). Total num frames: 969794560. Throughput: 0: 5779.5. Samples: 969803408. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:57:24,991][25689] Avg episode reward: [(0, '-0.227')] [2022-07-10 23:57:25,941][26022] Updated weights on worker 0-0, policy_version 947071 (0.00081) [2022-07-10 23:57:28,116][26022] Updated weights on worker 0-0, policy_version 947081 (0.00091) [2022-07-10 23:57:29,626][26022] Updated weights on worker 0-0, policy_version 947091 (0.00088) [2022-07-10 23:57:30,063][25689] Fps is (10 sec: 5553.6, 60 sec: 5573.9, 300 sec: 5574.4). Total num frames: 969823232. Throughput: 0: 4919.9. Samples: 969819778. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:57:30,063][25689] Avg episode reward: [(0, '-0.476')] [2022-07-10 23:57:31,642][26022] Updated weights on worker 0-0, policy_version 947101 (0.00087) [2022-07-10 23:57:33,436][26022] Updated weights on worker 0-0, policy_version 947111 (0.00094) [2022-07-10 23:57:35,071][25689] Fps is (10 sec: 5485.6, 60 sec: 5539.5, 300 sec: 5572.0). Total num frames: 969849856. Throughput: 0: 5762.8. Samples: 969853344. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:57:35,071][25689] Avg episode reward: [(0, '0.568')] [2022-07-10 23:57:35,233][26022] Updated weights on worker 0-0, policy_version 947121 (0.00081) [2022-07-10 23:57:36,885][26022] Updated weights on worker 0-0, policy_version 947131 (0.00085) [2022-07-10 23:57:38,611][26022] Updated weights on worker 0-0, policy_version 947141 (0.00084) [2022-07-10 23:57:40,091][25689] Fps is (10 sec: 5615.9, 60 sec: 5559.4, 300 sec: 5582.6). Total num frames: 969879552. Throughput: 0: 5803.0. Samples: 969887096. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:57:40,094][25689] Avg episode reward: [(0, '0.412')] [2022-07-10 23:57:40,849][26022] Updated weights on worker 0-0, policy_version 947151 (0.00090) [2022-07-10 23:57:42,366][26022] Updated weights on worker 0-0, policy_version 947161 (0.00090) [2022-07-10 23:57:44,432][26022] Updated weights on worker 0-0, policy_version 947171 (0.00086) [2022-07-10 23:57:45,174][25689] Fps is (10 sec: 5675.7, 60 sec: 5520.2, 300 sec: 5575.0). Total num frames: 969907200. Throughput: 0: 4980.3. Samples: 969903928. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:57:45,175][25689] Avg episode reward: [(0, '0.321')] [2022-07-10 23:57:45,993][26022] Updated weights on worker 0-0, policy_version 947181 (0.00095) [2022-07-10 23:57:47,997][26022] Updated weights on worker 0-0, policy_version 947191 (0.00089) [2022-07-10 23:57:50,049][26022] Updated weights on worker 0-0, policy_version 947201 (0.00087) [2022-07-10 23:57:50,231][25689] Fps is (10 sec: 5453.4, 60 sec: 5537.8, 300 sec: 5574.0). Total num frames: 969934848. Throughput: 0: 5810.0. Samples: 969936956. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:57:50,232][25689] Avg episode reward: [(0, '0.625')] [2022-07-10 23:57:51,783][26022] Updated weights on worker 0-0, policy_version 947211 (0.00085) [2022-07-10 23:57:53,685][26022] Updated weights on worker 0-0, policy_version 947221 (0.00086) [2022-07-10 23:57:55,246][25689] Fps is (10 sec: 5591.8, 60 sec: 5539.5, 300 sec: 5580.9). Total num frames: 969963520. Throughput: 0: 5815.3. Samples: 969970670. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:57:55,248][25689] Avg episode reward: [(0, '0.404')] [2022-07-10 23:57:55,357][26022] Updated weights on worker 0-0, policy_version 947231 (0.00096) [2022-07-10 23:57:57,367][26022] Updated weights on worker 0-0, policy_version 947241 (0.00096) [2022-07-10 23:57:59,293][26022] Updated weights on worker 0-0, policy_version 947251 (0.00081) [2022-07-10 23:58:00,281][25689] Fps is (10 sec: 5502.4, 60 sec: 5510.3, 300 sec: 5573.5). Total num frames: 969990144. Throughput: 0: 4962.8. Samples: 969987292. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:58:00,295][25689] Avg episode reward: [(0, '0.577')] [2022-07-10 23:58:00,975][26022] Updated weights on worker 0-0, policy_version 947261 (0.00081) [2022-07-10 23:58:03,221][26022] Updated weights on worker 0-0, policy_version 947271 (0.00083) [2022-07-10 23:58:05,173][26022] Updated weights on worker 0-0, policy_version 947281 (0.00091) [2022-07-10 23:58:05,307][25689] Fps is (10 sec: 5190.9, 60 sec: 5497.0, 300 sec: 5567.0). Total num frames: 970015744. Throughput: 0: 5685.2. Samples: 970018386. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:58:05,308][25689] Avg episode reward: [(0, '0.591')] [2022-07-10 23:58:06,982][26022] Updated weights on worker 0-0, policy_version 947291 (0.00053) [2022-07-10 23:58:08,981][26022] Updated weights on worker 0-0, policy_version 947301 (0.00331) [2022-07-10 23:58:10,371][25689] Fps is (10 sec: 5378.7, 60 sec: 5514.9, 300 sec: 5573.7). Total num frames: 970044416. Throughput: 0: 5702.1. Samples: 970051794. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:58:10,373][25689] Avg episode reward: [(0, '0.028')] [2022-07-10 23:58:10,505][26022] Updated weights on worker 0-0, policy_version 947311 (0.00083) [2022-07-10 23:58:12,551][26022] Updated weights on worker 0-0, policy_version 947321 (0.00089) [2022-07-10 23:58:14,254][26022] Updated weights on worker 0-0, policy_version 947331 (0.00085) [2022-07-10 23:58:15,446][25689] Fps is (10 sec: 5756.4, 60 sec: 5530.3, 300 sec: 5572.6). Total num frames: 970074112. Throughput: 0: 4850.5. Samples: 970068652. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:58:15,447][25689] Avg episode reward: [(0, '-0.013')] [2022-07-10 23:58:16,065][26022] Updated weights on worker 0-0, policy_version 947341 (0.00082) [2022-07-10 23:58:18,067][26022] Updated weights on worker 0-0, policy_version 947351 (0.00060) [2022-07-10 23:58:19,731][26022] Updated weights on worker 0-0, policy_version 947361 (0.00089) [2022-07-10 23:58:20,541][25689] Fps is (10 sec: 5537.6, 60 sec: 5506.9, 300 sec: 5571.5). Total num frames: 970100736. Throughput: 0: 5671.0. Samples: 970102190. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:58:20,543][25689] Avg episode reward: [(0, '0.030')] [2022-07-10 23:58:21,691][26022] Updated weights on worker 0-0, policy_version 947371 (0.00086) [2022-07-10 23:58:23,447][26022] Updated weights on worker 0-0, policy_version 947381 (0.00091) [2022-07-10 23:58:25,231][26022] Updated weights on worker 0-0, policy_version 947391 (0.00089) [2022-07-10 23:58:25,548][25689] Fps is (10 sec: 5575.3, 60 sec: 5546.3, 300 sec: 5579.2). Total num frames: 970130432. Throughput: 0: 5794.5. Samples: 970135674. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:58:25,549][25689] Avg episode reward: [(0, '-0.067')] [2022-07-10 23:58:27,099][26022] Updated weights on worker 0-0, policy_version 947401 (0.00085) [2022-07-10 23:58:28,944][26022] Updated weights on worker 0-0, policy_version 947411 (0.00097) [2022-07-10 23:58:30,682][25689] Fps is (10 sec: 5654.9, 60 sec: 5523.8, 300 sec: 5570.2). Total num frames: 970158080. Throughput: 0: 4934.9. Samples: 970152028. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:58:30,684][25689] Avg episode reward: [(0, '0.205')] [2022-07-10 23:58:30,749][26022] Updated weights on worker 0-0, policy_version 947421 (0.00080) [2022-07-10 23:58:32,626][26022] Updated weights on worker 0-0, policy_version 947431 (0.00057) [2022-07-10 23:58:34,598][26022] Updated weights on worker 0-0, policy_version 947441 (0.00091) [2022-07-10 23:58:35,699][25689] Fps is (10 sec: 5447.4, 60 sec: 5539.9, 300 sec: 5577.3). Total num frames: 970185728. Throughput: 0: 5781.7. Samples: 970185746. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:58:35,699][25689] Avg episode reward: [(0, '0.504')] [2022-07-10 23:58:36,308][26022] Updated weights on worker 0-0, policy_version 947451 (0.00093) [2022-07-10 23:58:38,319][26022] Updated weights on worker 0-0, policy_version 947461 (0.00090) [2022-07-10 23:58:40,015][26022] Updated weights on worker 0-0, policy_version 947471 (0.00087) [2022-07-10 23:58:40,704][25689] Fps is (10 sec: 5619.4, 60 sec: 5524.4, 300 sec: 5570.3). Total num frames: 970214400. Throughput: 0: 5804.3. Samples: 970219222. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-10 23:58:40,705][25689] Avg episode reward: [(0, '1.079')] [2022-07-10 23:58:41,902][26022] Updated weights on worker 0-0, policy_version 947481 (0.00087) [2022-07-10 23:58:43,707][26022] Updated weights on worker 0-0, policy_version 947491 (0.00091) [2022-07-10 23:58:45,536][26022] Updated weights on worker 0-0, policy_version 947501 (0.00090) [2022-07-10 23:58:45,731][25689] Fps is (10 sec: 5511.7, 60 sec: 5512.5, 300 sec: 5567.9). Total num frames: 970241024. Throughput: 0: 4979.0. Samples: 970236166. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 23:58:45,732][25689] Avg episode reward: [(0, '0.999')] [2022-07-10 23:58:47,354][26022] Updated weights on worker 0-0, policy_version 947511 (0.00083) [2022-07-10 23:58:49,167][26022] Updated weights on worker 0-0, policy_version 947521 (0.00086) [2022-07-10 23:58:50,781][25689] Fps is (10 sec: 5487.6, 60 sec: 5530.1, 300 sec: 5581.0). Total num frames: 970269696. Throughput: 0: 5852.6. Samples: 970269660. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 23:58:50,781][25689] Avg episode reward: [(0, '1.164')] [2022-07-10 23:58:51,111][26022] Updated weights on worker 0-0, policy_version 947531 (0.00084) [2022-07-10 23:58:52,779][26022] Updated weights on worker 0-0, policy_version 947541 (0.00085) [2022-07-10 23:58:54,884][26022] Updated weights on worker 0-0, policy_version 947551 (0.00655) [2022-07-10 23:58:55,786][25689] Fps is (10 sec: 5601.5, 60 sec: 5514.1, 300 sec: 5567.3). Total num frames: 970297344. Throughput: 0: 5834.5. Samples: 970302944. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 23:58:55,787][25689] Avg episode reward: [(0, '1.038')] [2022-07-10 23:58:56,518][26022] Updated weights on worker 0-0, policy_version 947561 (0.00089) [2022-07-10 23:58:58,463][26022] Updated weights on worker 0-0, policy_version 947571 (0.00090) [2022-07-10 23:59:00,106][26022] Updated weights on worker 0-0, policy_version 947581 (0.00090) [2022-07-10 23:59:00,789][25689] Fps is (10 sec: 5627.5, 60 sec: 5550.9, 300 sec: 5577.6). Total num frames: 970326016. Throughput: 0: 5009.1. Samples: 970319828. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 23:59:00,789][25689] Avg episode reward: [(0, '0.811')] [2022-07-10 23:59:02,487][26022] Updated weights on worker 0-0, policy_version 947591 (0.00118) [2022-07-10 23:59:04,227][26022] Updated weights on worker 0-0, policy_version 947601 (0.00083) [2022-07-10 23:59:05,808][25689] Fps is (10 sec: 5313.0, 60 sec: 5534.6, 300 sec: 5571.2). Total num frames: 970350592. Throughput: 0: 5741.0. Samples: 970351426. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 23:59:05,809][25689] Avg episode reward: [(0, '0.144')] [2022-07-10 23:59:06,136][26022] Updated weights on worker 0-0, policy_version 947611 (0.00090) [2022-07-10 23:59:07,909][26022] Updated weights on worker 0-0, policy_version 947621 (0.00081) [2022-07-10 23:59:09,850][26022] Updated weights on worker 0-0, policy_version 947631 (0.00092) [2022-07-10 23:59:10,942][25689] Fps is (10 sec: 5345.2, 60 sec: 5545.1, 300 sec: 5568.7). Total num frames: 970380288. Throughput: 0: 5723.2. Samples: 970385048. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 23:59:10,944][25689] Avg episode reward: [(0, '0.018')] [2022-07-10 23:59:11,644][26022] Updated weights on worker 0-0, policy_version 947641 (0.00086) [2022-07-10 23:59:13,435][26022] Updated weights on worker 0-0, policy_version 947651 (0.00090) [2022-07-10 23:59:15,446][26022] Updated weights on worker 0-0, policy_version 947661 (0.00088) [2022-07-10 23:59:15,965][25689] Fps is (10 sec: 5645.6, 60 sec: 5516.0, 300 sec: 5568.5). Total num frames: 970407936. Throughput: 0: 4907.0. Samples: 970401966. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 23:59:15,968][25689] Avg episode reward: [(0, '-0.702')] [2022-07-10 23:59:17,030][26022] Updated weights on worker 0-0, policy_version 947671 (0.00099) [2022-07-10 23:59:17,150][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-10 23:59:17,156][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000947672_970416128.pth [2022-07-10 23:59:17,157][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000945711_968408064.pth [2022-07-10 23:59:19,092][26022] Updated weights on worker 0-0, policy_version 947681 (0.00059) [2022-07-10 23:59:20,869][26022] Updated weights on worker 0-0, policy_version 947691 (0.00084) [2022-07-10 23:59:20,980][25689] Fps is (10 sec: 5610.7, 60 sec: 5557.2, 300 sec: 5571.7). Total num frames: 970436608. Throughput: 0: 5719.9. Samples: 970435320. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 23:59:20,981][25689] Avg episode reward: [(0, '-1.576')] [2022-07-10 23:59:22,666][26022] Updated weights on worker 0-0, policy_version 947701 (0.00086) [2022-07-10 23:59:24,423][26022] Updated weights on worker 0-0, policy_version 947711 (0.00089) [2022-07-10 23:59:25,995][25689] Fps is (10 sec: 5513.1, 60 sec: 5505.6, 300 sec: 5566.3). Total num frames: 970463232. Throughput: 0: 5800.0. Samples: 970468512. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 23:59:25,996][25689] Avg episode reward: [(0, '-1.383')] [2022-07-10 23:59:26,485][26022] Updated weights on worker 0-0, policy_version 947721 (0.00682) [2022-07-10 23:59:28,299][26022] Updated weights on worker 0-0, policy_version 947731 (0.00086) [2022-07-10 23:59:30,017][26022] Updated weights on worker 0-0, policy_version 947741 (0.00050) [2022-07-10 23:59:31,056][25689] Fps is (10 sec: 5386.6, 60 sec: 5512.3, 300 sec: 5558.5). Total num frames: 970490880. Throughput: 0: 4974.4. Samples: 970485100. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 23:59:31,056][25689] Avg episode reward: [(0, '-1.531')] [2022-07-10 23:59:32,005][26022] Updated weights on worker 0-0, policy_version 947751 (0.00088) [2022-07-10 23:59:33,777][26022] Updated weights on worker 0-0, policy_version 947761 (0.00093) [2022-07-10 23:59:35,852][26022] Updated weights on worker 0-0, policy_version 947771 (0.00088) [2022-07-10 23:59:36,072][25689] Fps is (10 sec: 5487.4, 60 sec: 5512.4, 300 sec: 5559.0). Total num frames: 970518528. Throughput: 0: 5791.3. Samples: 970518410. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 23:59:36,073][25689] Avg episode reward: [(0, '-1.691')] [2022-07-10 23:59:37,329][26022] Updated weights on worker 0-0, policy_version 947781 (0.00084) [2022-07-10 23:59:39,395][26022] Updated weights on worker 0-0, policy_version 947791 (0.00099) [2022-07-10 23:59:40,877][26022] Updated weights on worker 0-0, policy_version 947801 (0.00094) [2022-07-10 23:59:41,107][25689] Fps is (10 sec: 5807.2, 60 sec: 5543.6, 300 sec: 5569.3). Total num frames: 970549248. Throughput: 0: 5797.0. Samples: 970551992. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 23:59:41,107][25689] Avg episode reward: [(0, '-0.522')] [2022-07-10 23:59:42,980][26022] Updated weights on worker 0-0, policy_version 947811 (0.00085) [2022-07-10 23:59:44,654][26022] Updated weights on worker 0-0, policy_version 947821 (0.00085) [2022-07-10 23:59:46,128][25689] Fps is (10 sec: 5600.9, 60 sec: 5527.2, 300 sec: 5560.1). Total num frames: 970574848. Throughput: 0: 4997.5. Samples: 970569124. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 23:59:46,129][25689] Avg episode reward: [(0, '0.271')] [2022-07-10 23:59:46,570][26022] Updated weights on worker 0-0, policy_version 947831 (0.00089) [2022-07-10 23:59:48,273][26022] Updated weights on worker 0-0, policy_version 947841 (0.00084) [2022-07-10 23:59:50,238][26022] Updated weights on worker 0-0, policy_version 947851 (0.00086) [2022-07-10 23:59:51,179][25689] Fps is (10 sec: 5489.9, 60 sec: 5544.0, 300 sec: 5559.7). Total num frames: 970604544. Throughput: 0: 5845.8. Samples: 970602736. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 23:59:51,179][25689] Avg episode reward: [(0, '-0.239')] [2022-07-10 23:59:52,011][26022] Updated weights on worker 0-0, policy_version 947861 (0.00083) [2022-07-10 23:59:53,949][26022] Updated weights on worker 0-0, policy_version 947871 (0.00087) [2022-07-10 23:59:55,702][26022] Updated weights on worker 0-0, policy_version 947881 (0.00089) [2022-07-10 23:59:56,184][25689] Fps is (10 sec: 5702.4, 60 sec: 5544.0, 300 sec: 5556.3). Total num frames: 970632192. Throughput: 0: 5857.6. Samples: 970636216. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-10 23:59:56,184][25689] Avg episode reward: [(0, '-0.506')] [2022-07-10 23:59:57,630][26022] Updated weights on worker 0-0, policy_version 947891 (0.00084) [2022-07-10 23:59:59,488][26022] Updated weights on worker 0-0, policy_version 947901 (0.00090) [2022-07-11 00:00:01,193][25689] Fps is (10 sec: 5419.8, 60 sec: 5509.6, 300 sec: 5561.2). Total num frames: 970658816. Throughput: 0: 5845.0. Samples: 970669394. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:00:01,193][25689] Avg episode reward: [(0, '-0.001')] [2022-07-11 00:00:01,457][26022] Updated weights on worker 0-0, policy_version 947911 (0.00080) [2022-07-11 00:00:03,483][26022] Updated weights on worker 0-0, policy_version 947921 (0.00082) [2022-07-11 00:00:05,436][26022] Updated weights on worker 0-0, policy_version 947931 (0.00082) [2022-07-11 00:00:06,219][25689] Fps is (10 sec: 5204.4, 60 sec: 5525.9, 300 sec: 5551.5). Total num frames: 970684416. Throughput: 0: 5692.3. Samples: 970683486. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:00:06,219][25689] Avg episode reward: [(0, '0.624')] [2022-07-11 00:00:07,365][26022] Updated weights on worker 0-0, policy_version 947941 (0.00084) [2022-07-11 00:00:09,047][26022] Updated weights on worker 0-0, policy_version 947951 (0.00089) [2022-07-11 00:00:11,138][26022] Updated weights on worker 0-0, policy_version 947961 (0.00090) [2022-07-11 00:00:11,271][25689] Fps is (10 sec: 5385.1, 60 sec: 5516.5, 300 sec: 5548.7). Total num frames: 970713088. Throughput: 0: 5682.9. Samples: 970716916. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:00:11,271][25689] Avg episode reward: [(0, '0.428')] [2022-07-11 00:00:12,570][26022] Updated weights on worker 0-0, policy_version 947971 (0.00090) [2022-07-11 00:00:14,657][26022] Updated weights on worker 0-0, policy_version 947981 (0.00083) [2022-07-11 00:00:16,279][25689] Fps is (10 sec: 5700.1, 60 sec: 5534.8, 300 sec: 5552.1). Total num frames: 970741760. Throughput: 0: 5693.6. Samples: 970750628. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:00:16,279][25689] Avg episode reward: [(0, '1.341')] [2022-07-11 00:00:16,486][26022] Updated weights on worker 0-0, policy_version 947991 (0.00093) [2022-07-11 00:00:18,206][26022] Updated weights on worker 0-0, policy_version 948001 (0.00085) [2022-07-11 00:00:20,371][26022] Updated weights on worker 0-0, policy_version 948011 (0.00113) [2022-07-11 00:00:21,310][25689] Fps is (10 sec: 5610.0, 60 sec: 5516.4, 300 sec: 5548.7). Total num frames: 970769408. Throughput: 0: 4872.5. Samples: 970767416. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:00:21,310][25689] Avg episode reward: [(0, '1.615')] [2022-07-11 00:00:21,935][26022] Updated weights on worker 0-0, policy_version 948021 (0.00089) [2022-07-11 00:00:23,963][26022] Updated weights on worker 0-0, policy_version 948031 (0.00086) [2022-07-11 00:00:25,584][26022] Updated weights on worker 0-0, policy_version 948041 (0.00088) [2022-07-11 00:00:26,336][25689] Fps is (10 sec: 5497.9, 60 sec: 5532.3, 300 sec: 5549.5). Total num frames: 970797056. Throughput: 0: 5824.6. Samples: 970800664. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:00:26,337][25689] Avg episode reward: [(0, '1.303')] [2022-07-11 00:00:27,676][26022] Updated weights on worker 0-0, policy_version 948051 (0.00108) [2022-07-11 00:00:29,396][26022] Updated weights on worker 0-0, policy_version 948061 (0.00096) [2022-07-11 00:00:31,152][26022] Updated weights on worker 0-0, policy_version 948071 (0.00082) [2022-07-11 00:00:31,458][25689] Fps is (10 sec: 5549.8, 60 sec: 5543.6, 300 sec: 5544.5). Total num frames: 970825728. Throughput: 0: 5795.5. Samples: 970833912. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:00:31,459][25689] Avg episode reward: [(0, '0.888')] [2022-07-11 00:00:33,257][26022] Updated weights on worker 0-0, policy_version 948081 (0.00087) [2022-07-11 00:00:34,932][26022] Updated weights on worker 0-0, policy_version 948091 (0.00088) [2022-07-11 00:00:36,478][25689] Fps is (10 sec: 5654.4, 60 sec: 5560.3, 300 sec: 5548.2). Total num frames: 970854400. Throughput: 0: 4952.3. Samples: 970850660. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:00:36,478][25689] Avg episode reward: [(0, '0.802')] [2022-07-11 00:00:36,735][26022] Updated weights on worker 0-0, policy_version 948101 (0.00086) [2022-07-11 00:00:38,672][26022] Updated weights on worker 0-0, policy_version 948111 (0.00090) [2022-07-11 00:00:40,548][26022] Updated weights on worker 0-0, policy_version 948121 (0.00081) [2022-07-11 00:00:41,479][25689] Fps is (10 sec: 5517.8, 60 sec: 5495.5, 300 sec: 5545.3). Total num frames: 970881024. Throughput: 0: 5779.1. Samples: 970883978. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:00:41,480][25689] Avg episode reward: [(0, '0.556')] [2022-07-11 00:00:42,359][26022] Updated weights on worker 0-0, policy_version 948131 (0.00097) [2022-07-11 00:00:44,046][26022] Updated weights on worker 0-0, policy_version 948141 (0.00082) [2022-07-11 00:00:45,934][26022] Updated weights on worker 0-0, policy_version 948151 (0.00541) [2022-07-11 00:00:46,504][25689] Fps is (10 sec: 5412.8, 60 sec: 5529.0, 300 sec: 5540.1). Total num frames: 970908672. Throughput: 0: 5794.6. Samples: 970917530. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:00:46,505][25689] Avg episode reward: [(0, '0.651')] [2022-07-11 00:00:47,884][26022] Updated weights on worker 0-0, policy_version 948161 (0.00092) [2022-07-11 00:00:49,730][26022] Updated weights on worker 0-0, policy_version 948171 (0.00088) [2022-07-11 00:00:51,488][26022] Updated weights on worker 0-0, policy_version 948181 (0.00090) [2022-07-11 00:00:51,612][25689] Fps is (10 sec: 5558.6, 60 sec: 5507.0, 300 sec: 5534.9). Total num frames: 970937344. Throughput: 0: 4970.7. Samples: 970934090. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:00:51,612][25689] Avg episode reward: [(0, '-0.347')] [2022-07-11 00:00:53,286][26022] Updated weights on worker 0-0, policy_version 948191 (0.00078) [2022-07-11 00:00:55,337][26022] Updated weights on worker 0-0, policy_version 948201 (0.00083) [2022-07-11 00:00:56,628][25689] Fps is (10 sec: 5563.4, 60 sec: 5505.9, 300 sec: 5535.1). Total num frames: 970964992. Throughput: 0: 5773.4. Samples: 970966994. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:00:56,628][25689] Avg episode reward: [(0, '-0.419')] [2022-07-11 00:00:57,020][26022] Updated weights on worker 0-0, policy_version 948211 (0.00092) [2022-07-11 00:00:58,916][26022] Updated weights on worker 0-0, policy_version 948221 (0.00086) [2022-07-11 00:01:00,647][26022] Updated weights on worker 0-0, policy_version 948231 (0.00090) [2022-07-11 00:01:01,674][25689] Fps is (10 sec: 5495.5, 60 sec: 5519.5, 300 sec: 5541.3). Total num frames: 970992640. Throughput: 0: 5778.2. Samples: 971000664. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:01:01,675][25689] Avg episode reward: [(0, '-0.898')] [2022-07-11 00:01:03,083][26022] Updated weights on worker 0-0, policy_version 948241 (0.00088) [2022-07-11 00:01:04,666][26022] Updated weights on worker 0-0, policy_version 948251 (0.00087) [2022-07-11 00:01:06,678][25689] Fps is (10 sec: 5298.1, 60 sec: 5521.5, 300 sec: 5532.1). Total num frames: 971018240. Throughput: 0: 4830.2. Samples: 971014974. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:01:06,678][25689] Avg episode reward: [(0, '-0.966')] [2022-07-11 00:01:06,766][26022] Updated weights on worker 0-0, policy_version 948261 (0.00089) [2022-07-11 00:01:08,472][26022] Updated weights on worker 0-0, policy_version 948271 (0.00095) [2022-07-11 00:01:10,361][26022] Updated weights on worker 0-0, policy_version 948281 (0.00092) [2022-07-11 00:01:11,788][25689] Fps is (10 sec: 5467.3, 60 sec: 5533.1, 300 sec: 5530.6). Total num frames: 971047936. Throughput: 0: 5669.2. Samples: 971048472. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:01:11,788][25689] Avg episode reward: [(0, '-0.888')] [2022-07-11 00:01:12,235][26022] Updated weights on worker 0-0, policy_version 948291 (0.00088) [2022-07-11 00:01:14,060][26022] Updated weights on worker 0-0, policy_version 948301 (0.00088) [2022-07-11 00:01:15,956][26022] Updated weights on worker 0-0, policy_version 948311 (0.00092) [2022-07-11 00:01:16,814][25689] Fps is (10 sec: 5657.4, 60 sec: 5514.5, 300 sec: 5533.9). Total num frames: 971075584. Throughput: 0: 5702.3. Samples: 971082104. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:01:16,815][25689] Avg episode reward: [(0, '-0.943')] [2022-07-11 00:01:17,192][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:01:17,211][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000948318_971077632.pth [2022-07-11 00:01:17,211][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000946366_969078784.pth [2022-07-11 00:01:17,580][26022] Updated weights on worker 0-0, policy_version 948321 (0.00090) [2022-07-11 00:01:19,407][26022] Updated weights on worker 0-0, policy_version 948331 (0.00085) [2022-07-11 00:01:21,510][26022] Updated weights on worker 0-0, policy_version 948341 (0.00079) [2022-07-11 00:01:21,831][25689] Fps is (10 sec: 5505.9, 60 sec: 5515.9, 300 sec: 5527.4). Total num frames: 971103232. Throughput: 0: 4880.7. Samples: 971099044. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:01:21,831][25689] Avg episode reward: [(0, '-0.353')] [2022-07-11 00:01:23,095][26022] Updated weights on worker 0-0, policy_version 948351 (0.00086) [2022-07-11 00:01:25,030][26022] Updated weights on worker 0-0, policy_version 948361 (0.00092) [2022-07-11 00:01:26,855][25689] Fps is (10 sec: 5507.3, 60 sec: 5516.1, 300 sec: 5532.9). Total num frames: 971130880. Throughput: 0: 5823.1. Samples: 971132466. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:01:26,855][25689] Avg episode reward: [(0, '0.707')] [2022-07-11 00:01:26,876][26022] Updated weights on worker 0-0, policy_version 948371 (0.00091) [2022-07-11 00:01:28,664][26022] Updated weights on worker 0-0, policy_version 948381 (0.00093) [2022-07-11 00:01:30,703][26022] Updated weights on worker 0-0, policy_version 948391 (0.00081) [2022-07-11 00:01:31,960][25689] Fps is (10 sec: 5661.4, 60 sec: 5534.5, 300 sec: 5534.4). Total num frames: 971160576. Throughput: 0: 5823.7. Samples: 971165948. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:01:31,963][25689] Avg episode reward: [(0, '0.668')] [2022-07-11 00:01:32,410][26022] Updated weights on worker 0-0, policy_version 948401 (0.00094) [2022-07-11 00:01:34,313][26022] Updated weights on worker 0-0, policy_version 948411 (0.00095) [2022-07-11 00:01:36,139][26022] Updated weights on worker 0-0, policy_version 948421 (0.00087) [2022-07-11 00:01:36,981][25689] Fps is (10 sec: 5662.7, 60 sec: 5517.4, 300 sec: 5531.5). Total num frames: 971188224. Throughput: 0: 4989.8. Samples: 971182734. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:01:36,982][25689] Avg episode reward: [(0, '0.618')] [2022-07-11 00:01:37,623][26022] Updated weights on worker 0-0, policy_version 948431 (0.00083) [2022-07-11 00:01:39,827][26022] Updated weights on worker 0-0, policy_version 948441 (0.00089) [2022-07-11 00:01:41,492][26022] Updated weights on worker 0-0, policy_version 948451 (0.00091) [2022-07-11 00:01:42,059][25689] Fps is (10 sec: 5475.3, 60 sec: 5527.4, 300 sec: 5523.6). Total num frames: 971215872. Throughput: 0: 5799.9. Samples: 971216366. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:01:42,059][25689] Avg episode reward: [(0, '-0.075')] [2022-07-11 00:01:43,362][26022] Updated weights on worker 0-0, policy_version 948461 (0.00093) [2022-07-11 00:01:45,274][26022] Updated weights on worker 0-0, policy_version 948471 (0.00084) [2022-07-11 00:01:46,939][26022] Updated weights on worker 0-0, policy_version 948481 (0.00086) [2022-07-11 00:01:47,085][25689] Fps is (10 sec: 5574.4, 60 sec: 5544.2, 300 sec: 5531.2). Total num frames: 971244544. Throughput: 0: 5816.5. Samples: 971250134. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:01:47,085][25689] Avg episode reward: [(0, '-0.774')] [2022-07-11 00:01:48,823][26022] Updated weights on worker 0-0, policy_version 948491 (0.00083) [2022-07-11 00:01:50,683][26022] Updated weights on worker 0-0, policy_version 948501 (0.00088) [2022-07-11 00:01:52,163][25689] Fps is (10 sec: 5675.6, 60 sec: 5546.9, 300 sec: 5530.4). Total num frames: 971273216. Throughput: 0: 4994.4. Samples: 971266850. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:01:52,163][25689] Avg episode reward: [(0, '-0.627')] [2022-07-11 00:01:52,538][26022] Updated weights on worker 0-0, policy_version 948511 (0.00103) [2022-07-11 00:01:54,382][26022] Updated weights on worker 0-0, policy_version 948521 (0.00088) [2022-07-11 00:01:56,418][26022] Updated weights on worker 0-0, policy_version 948531 (0.01159) [2022-07-11 00:01:57,170][25689] Fps is (10 sec: 5483.0, 60 sec: 5530.8, 300 sec: 5525.0). Total num frames: 971299840. Throughput: 0: 5821.4. Samples: 971300258. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:01:57,170][25689] Avg episode reward: [(0, '-0.491')] [2022-07-11 00:01:58,047][26022] Updated weights on worker 0-0, policy_version 948541 (0.00083) [2022-07-11 00:02:00,146][26022] Updated weights on worker 0-0, policy_version 948551 (0.00089) [2022-07-11 00:02:01,625][26022] Updated weights on worker 0-0, policy_version 948561 (0.00092) [2022-07-11 00:02:02,259][25689] Fps is (10 sec: 5274.1, 60 sec: 5510.0, 300 sec: 5524.5). Total num frames: 971326464. Throughput: 0: 5807.1. Samples: 971333668. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:02:02,259][25689] Avg episode reward: [(0, '-1.232')] [2022-07-11 00:02:03,897][26022] Updated weights on worker 0-0, policy_version 948571 (0.00086) [2022-07-11 00:02:05,812][26022] Updated weights on worker 0-0, policy_version 948581 (0.00088) [2022-07-11 00:02:07,323][25689] Fps is (10 sec: 5445.9, 60 sec: 5555.2, 300 sec: 5528.2). Total num frames: 971355136. Throughput: 0: 4865.2. Samples: 971348600. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:02:07,324][25689] Avg episode reward: [(0, '-0.648')] [2022-07-11 00:02:07,603][26022] Updated weights on worker 0-0, policy_version 948591 (0.00098) [2022-07-11 00:02:09,428][26022] Updated weights on worker 0-0, policy_version 948601 (0.00111) [2022-07-11 00:02:11,321][26022] Updated weights on worker 0-0, policy_version 948611 (0.00098) [2022-07-11 00:02:12,372][25689] Fps is (10 sec: 5569.1, 60 sec: 5527.0, 300 sec: 5524.9). Total num frames: 971382784. Throughput: 0: 5693.7. Samples: 971381914. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:02:12,373][25689] Avg episode reward: [(0, '-0.272')] [2022-07-11 00:02:13,002][26022] Updated weights on worker 0-0, policy_version 948621 (0.00084) [2022-07-11 00:02:14,945][26022] Updated weights on worker 0-0, policy_version 948631 (0.00292) [2022-07-11 00:02:16,646][26022] Updated weights on worker 0-0, policy_version 948641 (0.00085) [2022-07-11 00:02:17,386][25689] Fps is (10 sec: 5698.7, 60 sec: 5561.9, 300 sec: 5532.0). Total num frames: 971412480. Throughput: 0: 5709.6. Samples: 971415684. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:02:17,388][25689] Avg episode reward: [(0, '-0.754')] [2022-07-11 00:02:18,722][26022] Updated weights on worker 0-0, policy_version 948651 (0.00088) [2022-07-11 00:02:20,159][26022] Updated weights on worker 0-0, policy_version 948661 (0.00084) [2022-07-11 00:02:22,277][26022] Updated weights on worker 0-0, policy_version 948671 (0.00087) [2022-07-11 00:02:22,435][25689] Fps is (10 sec: 5596.6, 60 sec: 5542.1, 300 sec: 5528.9). Total num frames: 971439104. Throughput: 0: 4900.8. Samples: 971432548. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:02:22,435][25689] Avg episode reward: [(0, '-0.934')] [2022-07-11 00:02:24,111][26022] Updated weights on worker 0-0, policy_version 948681 (0.00092) [2022-07-11 00:02:25,890][26022] Updated weights on worker 0-0, policy_version 948691 (0.00101) [2022-07-11 00:02:27,438][25689] Fps is (10 sec: 5501.1, 60 sec: 5560.9, 300 sec: 5530.2). Total num frames: 971467776. Throughput: 0: 5854.8. Samples: 971466366. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:02:27,440][25689] Avg episode reward: [(0, '-1.354')] [2022-07-11 00:02:28,012][26022] Updated weights on worker 0-0, policy_version 948701 (0.00101) [2022-07-11 00:02:29,559][26022] Updated weights on worker 0-0, policy_version 948711 (0.00089) [2022-07-11 00:02:31,611][26022] Updated weights on worker 0-0, policy_version 948721 (0.00095) [2022-07-11 00:02:32,547][25689] Fps is (10 sec: 5771.9, 60 sec: 5560.5, 300 sec: 5538.6). Total num frames: 971497472. Throughput: 0: 5822.0. Samples: 971499376. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 00:02:32,548][25689] Avg episode reward: [(0, '-2.144')] [2022-07-11 00:02:33,377][26022] Updated weights on worker 0-0, policy_version 948731 (0.00648) [2022-07-11 00:02:35,150][26022] Updated weights on worker 0-0, policy_version 948741 (0.00092) [2022-07-11 00:02:37,089][26022] Updated weights on worker 0-0, policy_version 948751 (0.00092) [2022-07-11 00:02:37,585][25689] Fps is (10 sec: 5449.7, 60 sec: 5525.3, 300 sec: 5524.6). Total num frames: 971523072. Throughput: 0: 5789.4. Samples: 971532622. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:02:37,585][25689] Avg episode reward: [(0, '-2.161')] [2022-07-11 00:02:39,009][26022] Updated weights on worker 0-0, policy_version 948761 (0.00087) [2022-07-11 00:02:40,780][26022] Updated weights on worker 0-0, policy_version 948771 (0.00086) [2022-07-11 00:02:42,601][25689] Fps is (10 sec: 5296.5, 60 sec: 5530.9, 300 sec: 5525.8). Total num frames: 971550720. Throughput: 0: 5791.7. Samples: 971549342. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:02:42,601][25689] Avg episode reward: [(0, '-2.146')] [2022-07-11 00:02:42,761][26022] Updated weights on worker 0-0, policy_version 948781 (0.00089) [2022-07-11 00:02:44,227][26022] Updated weights on worker 0-0, policy_version 948791 (0.00087) [2022-07-11 00:02:46,320][26022] Updated weights on worker 0-0, policy_version 948801 (0.00093) [2022-07-11 00:02:47,606][25689] Fps is (10 sec: 5722.2, 60 sec: 5549.7, 300 sec: 5533.7). Total num frames: 971580416. Throughput: 0: 5787.5. Samples: 971583088. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:02:47,607][25689] Avg episode reward: [(0, '-1.574')] [2022-07-11 00:02:48,024][26022] Updated weights on worker 0-0, policy_version 948811 (0.00091) [2022-07-11 00:02:49,926][26022] Updated weights on worker 0-0, policy_version 948821 (0.00091) [2022-07-11 00:02:51,696][26022] Updated weights on worker 0-0, policy_version 948831 (0.00088) [2022-07-11 00:02:52,652][25689] Fps is (10 sec: 5704.8, 60 sec: 5535.6, 300 sec: 5529.6). Total num frames: 971608064. Throughput: 0: 5822.7. Samples: 971616444. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:02:52,653][25689] Avg episode reward: [(0, '-0.708')] [2022-07-11 00:02:53,655][26022] Updated weights on worker 0-0, policy_version 948841 (0.00093) [2022-07-11 00:02:55,396][26022] Updated weights on worker 0-0, policy_version 948851 (0.00125) [2022-07-11 00:02:57,466][26022] Updated weights on worker 0-0, policy_version 948861 (0.00088) [2022-07-11 00:02:57,689][25689] Fps is (10 sec: 5484.2, 60 sec: 5549.9, 300 sec: 5533.0). Total num frames: 971635712. Throughput: 0: 4999.9. Samples: 971633140. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:02:57,689][25689] Avg episode reward: [(0, '-0.501')] [2022-07-11 00:02:58,870][26022] Updated weights on worker 0-0, policy_version 948871 (0.00087) [2022-07-11 00:03:00,974][26022] Updated weights on worker 0-0, policy_version 948881 (0.00088) [2022-07-11 00:03:02,716][25689] Fps is (10 sec: 5393.0, 60 sec: 5555.5, 300 sec: 5536.5). Total num frames: 971662336. Throughput: 0: 5806.4. Samples: 971666140. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:03:02,716][25689] Avg episode reward: [(0, '0.208')] [2022-07-11 00:03:03,022][26022] Updated weights on worker 0-0, policy_version 948891 (0.00088) [2022-07-11 00:03:04,916][26022] Updated weights on worker 0-0, policy_version 948901 (0.00084) [2022-07-11 00:03:07,015][26022] Updated weights on worker 0-0, policy_version 948911 (0.00092) [2022-07-11 00:03:07,754][25689] Fps is (10 sec: 5290.0, 60 sec: 5524.1, 300 sec: 5530.0). Total num frames: 971688960. Throughput: 0: 5718.3. Samples: 971698304. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:03:07,755][25689] Avg episode reward: [(0, '0.014')] [2022-07-11 00:03:08,514][26022] Updated weights on worker 0-0, policy_version 948921 (0.00097) [2022-07-11 00:03:10,633][26022] Updated weights on worker 0-0, policy_version 948931 (0.00089) [2022-07-11 00:03:12,192][26022] Updated weights on worker 0-0, policy_version 948941 (0.00087) [2022-07-11 00:03:12,812][25689] Fps is (10 sec: 5578.2, 60 sec: 5557.1, 300 sec: 5530.4). Total num frames: 971718656. Throughput: 0: 4889.3. Samples: 971715012. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:03:12,813][25689] Avg episode reward: [(0, '0.139')] [2022-07-11 00:03:14,140][26022] Updated weights on worker 0-0, policy_version 948951 (0.00092) [2022-07-11 00:03:15,943][26022] Updated weights on worker 0-0, policy_version 948961 (0.00099) [2022-07-11 00:03:17,283][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:03:17,296][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000948969_971744256.pth [2022-07-11 00:03:17,296][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000947023_969751552.pth [2022-07-11 00:03:17,872][25689] Fps is (10 sec: 5667.7, 60 sec: 5519.0, 300 sec: 5534.5). Total num frames: 971746304. Throughput: 0: 5723.2. Samples: 971748654. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:03:17,873][25689] Avg episode reward: [(0, '1.112')] [2022-07-11 00:03:17,874][26022] Updated weights on worker 0-0, policy_version 948971 (0.00088) [2022-07-11 00:03:19,807][26022] Updated weights on worker 0-0, policy_version 948981 (0.00089) [2022-07-11 00:03:21,515][26022] Updated weights on worker 0-0, policy_version 948991 (0.00094) [2022-07-11 00:03:22,935][25689] Fps is (10 sec: 5361.4, 60 sec: 5517.7, 300 sec: 5523.1). Total num frames: 971772928. Throughput: 0: 5730.4. Samples: 971782004. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:03:22,935][25689] Avg episode reward: [(0, '0.590')] [2022-07-11 00:03:23,225][26022] Updated weights on worker 0-0, policy_version 949001 (0.00089) [2022-07-11 00:03:25,410][26022] Updated weights on worker 0-0, policy_version 949011 (0.00094) [2022-07-11 00:03:26,969][26022] Updated weights on worker 0-0, policy_version 949021 (0.00090) [2022-07-11 00:03:27,973][25689] Fps is (10 sec: 5575.9, 60 sec: 5531.5, 300 sec: 5531.8). Total num frames: 971802624. Throughput: 0: 4972.3. Samples: 971798836. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:03:27,975][25689] Avg episode reward: [(0, '-0.138')] [2022-07-11 00:03:29,114][26022] Updated weights on worker 0-0, policy_version 949031 (0.00107) [2022-07-11 00:03:30,719][26022] Updated weights on worker 0-0, policy_version 949041 (0.00085) [2022-07-11 00:03:32,599][26022] Updated weights on worker 0-0, policy_version 949051 (0.00080) [2022-07-11 00:03:33,060][25689] Fps is (10 sec: 5663.6, 60 sec: 5499.7, 300 sec: 5530.5). Total num frames: 971830272. Throughput: 0: 5775.5. Samples: 971831952. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:03:33,061][25689] Avg episode reward: [(0, '0.024')] [2022-07-11 00:03:34,407][26022] Updated weights on worker 0-0, policy_version 949061 (0.00096) [2022-07-11 00:03:36,138][26022] Updated weights on worker 0-0, policy_version 949071 (0.00081) [2022-07-11 00:03:37,988][26022] Updated weights on worker 0-0, policy_version 949081 (0.00085) [2022-07-11 00:03:38,089][25689] Fps is (10 sec: 5567.1, 60 sec: 5551.2, 300 sec: 5530.0). Total num frames: 971858944. Throughput: 0: 5796.7. Samples: 971865846. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:03:38,090][25689] Avg episode reward: [(0, '-0.139')] [2022-07-11 00:03:39,725][26022] Updated weights on worker 0-0, policy_version 949091 (0.00082) [2022-07-11 00:03:41,716][26022] Updated weights on worker 0-0, policy_version 949101 (0.00092) [2022-07-11 00:03:43,095][25689] Fps is (10 sec: 5612.4, 60 sec: 5552.1, 300 sec: 5533.9). Total num frames: 971886592. Throughput: 0: 5006.1. Samples: 971882926. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:03:43,095][25689] Avg episode reward: [(0, '-0.183')] [2022-07-11 00:03:43,523][26022] Updated weights on worker 0-0, policy_version 949111 (0.00097) [2022-07-11 00:03:45,342][26022] Updated weights on worker 0-0, policy_version 949121 (0.00088) [2022-07-11 00:03:47,157][26022] Updated weights on worker 0-0, policy_version 949131 (0.00088) [2022-07-11 00:03:48,187][25689] Fps is (10 sec: 5678.8, 60 sec: 5544.2, 300 sec: 5536.5). Total num frames: 971916288. Throughput: 0: 5817.1. Samples: 971916426. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:03:48,188][25689] Avg episode reward: [(0, '-1.481')] [2022-07-11 00:03:49,038][26022] Updated weights on worker 0-0, policy_version 949141 (0.00091) [2022-07-11 00:03:50,838][26022] Updated weights on worker 0-0, policy_version 949151 (0.00112) [2022-07-11 00:03:52,775][26022] Updated weights on worker 0-0, policy_version 949161 (0.00088) [2022-07-11 00:03:53,271][25689] Fps is (10 sec: 5433.7, 60 sec: 5506.9, 300 sec: 5528.1). Total num frames: 971941888. Throughput: 0: 5825.6. Samples: 971949696. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:03:53,272][25689] Avg episode reward: [(0, '-1.080')] [2022-07-11 00:03:54,335][26022] Updated weights on worker 0-0, policy_version 949171 (0.00086) [2022-07-11 00:03:56,521][26022] Updated weights on worker 0-0, policy_version 949181 (0.00084) [2022-07-11 00:03:58,085][26022] Updated weights on worker 0-0, policy_version 949191 (0.00081) [2022-07-11 00:03:58,275][25689] Fps is (10 sec: 5481.6, 60 sec: 5543.7, 300 sec: 5531.6). Total num frames: 971971584. Throughput: 0: 4985.9. Samples: 971966488. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:03:58,275][25689] Avg episode reward: [(0, '-0.349')] [2022-07-11 00:04:00,274][26022] Updated weights on worker 0-0, policy_version 949201 (0.00083) [2022-07-11 00:04:02,107][26022] Updated weights on worker 0-0, policy_version 949211 (0.00083) [2022-07-11 00:04:03,305][25689] Fps is (10 sec: 5510.9, 60 sec: 5526.5, 300 sec: 5534.8). Total num frames: 971997184. Throughput: 0: 5706.5. Samples: 971998256. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:04:03,307][25689] Avg episode reward: [(0, '-0.105')] [2022-07-11 00:04:04,111][26022] Updated weights on worker 0-0, policy_version 949221 (0.00086) [2022-07-11 00:04:05,930][26022] Updated weights on worker 0-0, policy_version 949231 (0.00091) [2022-07-11 00:04:07,785][26022] Updated weights on worker 0-0, policy_version 949241 (0.00084) [2022-07-11 00:04:08,313][25689] Fps is (10 sec: 5304.3, 60 sec: 5546.2, 300 sec: 5530.3). Total num frames: 972024832. Throughput: 0: 5705.2. Samples: 972031250. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:04:08,315][25689] Avg episode reward: [(0, '-0.071')] [2022-07-11 00:04:09,617][26022] Updated weights on worker 0-0, policy_version 949251 (0.00089) [2022-07-11 00:04:11,533][26022] Updated weights on worker 0-0, policy_version 949261 (0.00086) [2022-07-11 00:04:13,347][26022] Updated weights on worker 0-0, policy_version 949271 (0.00087) [2022-07-11 00:04:13,442][25689] Fps is (10 sec: 5556.0, 60 sec: 5522.8, 300 sec: 5531.8). Total num frames: 972053504. Throughput: 0: 4870.1. Samples: 972047930. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:04:13,443][25689] Avg episode reward: [(0, '0.550')] [2022-07-11 00:04:15,217][26022] Updated weights on worker 0-0, policy_version 949281 (0.00082) [2022-07-11 00:04:16,795][26022] Updated weights on worker 0-0, policy_version 949291 (0.00089) [2022-07-11 00:04:18,542][25689] Fps is (10 sec: 5606.0, 60 sec: 5536.0, 300 sec: 5530.2). Total num frames: 972082176. Throughput: 0: 5684.6. Samples: 972081702. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:04:18,544][25689] Avg episode reward: [(0, '0.803')] [2022-07-11 00:04:18,888][26022] Updated weights on worker 0-0, policy_version 949301 (0.00091) [2022-07-11 00:04:20,642][26022] Updated weights on worker 0-0, policy_version 949311 (0.00088) [2022-07-11 00:04:22,471][26022] Updated weights on worker 0-0, policy_version 949321 (0.00098) [2022-07-11 00:04:23,587][25689] Fps is (10 sec: 5551.7, 60 sec: 5554.6, 300 sec: 5533.1). Total num frames: 972109824. Throughput: 0: 5764.0. Samples: 972115158. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:04:23,588][25689] Avg episode reward: [(0, '0.112')] [2022-07-11 00:04:24,224][26022] Updated weights on worker 0-0, policy_version 949331 (0.00088) [2022-07-11 00:04:25,981][26022] Updated weights on worker 0-0, policy_version 949341 (0.00095) [2022-07-11 00:04:28,095][26022] Updated weights on worker 0-0, policy_version 949351 (0.00089) [2022-07-11 00:04:28,590][25689] Fps is (10 sec: 5503.6, 60 sec: 5524.0, 300 sec: 5534.1). Total num frames: 972137472. Throughput: 0: 4979.5. Samples: 972132216. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:04:28,590][25689] Avg episode reward: [(0, '0.305')] [2022-07-11 00:04:29,797][26022] Updated weights on worker 0-0, policy_version 949361 (0.00085) [2022-07-11 00:04:31,663][26022] Updated weights on worker 0-0, policy_version 949371 (0.00083) [2022-07-11 00:04:33,579][26022] Updated weights on worker 0-0, policy_version 949381 (0.00082) [2022-07-11 00:04:33,659][25689] Fps is (10 sec: 5591.5, 60 sec: 5542.5, 300 sec: 5536.6). Total num frames: 972166144. Throughput: 0: 5803.5. Samples: 972165258. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:04:33,670][25689] Avg episode reward: [(0, '-0.987')] [2022-07-11 00:04:35,249][26022] Updated weights on worker 0-0, policy_version 949391 (0.00093) [2022-07-11 00:04:37,218][26022] Updated weights on worker 0-0, policy_version 949401 (0.00091) [2022-07-11 00:04:38,757][25689] Fps is (10 sec: 5539.4, 60 sec: 5519.4, 300 sec: 5525.1). Total num frames: 972193792. Throughput: 0: 5790.4. Samples: 972198752. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:04:38,758][25689] Avg episode reward: [(0, '-1.027')] [2022-07-11 00:04:39,239][26022] Updated weights on worker 0-0, policy_version 949411 (0.00087) [2022-07-11 00:04:40,781][26022] Updated weights on worker 0-0, policy_version 949421 (0.00092) [2022-07-11 00:04:42,749][26022] Updated weights on worker 0-0, policy_version 949431 (0.00092) [2022-07-11 00:04:43,853][25689] Fps is (10 sec: 5625.6, 60 sec: 5544.9, 300 sec: 5537.5). Total num frames: 972223488. Throughput: 0: 5788.1. Samples: 972232458. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:04:43,854][25689] Avg episode reward: [(0, '-0.128')] [2022-07-11 00:04:44,615][26022] Updated weights on worker 0-0, policy_version 949441 (0.00083) [2022-07-11 00:04:46,279][26022] Updated weights on worker 0-0, policy_version 949451 (0.00089) [2022-07-11 00:04:48,273][26022] Updated weights on worker 0-0, policy_version 949461 (0.00095) [2022-07-11 00:04:48,941][25689] Fps is (10 sec: 5630.8, 60 sec: 5511.6, 300 sec: 5529.9). Total num frames: 972251136. Throughput: 0: 5749.0. Samples: 972249216. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:04:48,941][25689] Avg episode reward: [(0, '-0.093')] [2022-07-11 00:04:50,042][26022] Updated weights on worker 0-0, policy_version 949471 (0.00086) [2022-07-11 00:04:51,862][26022] Updated weights on worker 0-0, policy_version 949481 (0.00098) [2022-07-11 00:04:53,784][26022] Updated weights on worker 0-0, policy_version 949491 (0.00089) [2022-07-11 00:04:54,043][25689] Fps is (10 sec: 5426.0, 60 sec: 5543.6, 300 sec: 5528.1). Total num frames: 972278784. Throughput: 0: 5772.3. Samples: 972282922. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:04:54,044][25689] Avg episode reward: [(0, '0.464')] [2022-07-11 00:04:55,324][26022] Updated weights on worker 0-0, policy_version 949501 (0.00087) [2022-07-11 00:04:57,403][26022] Updated weights on worker 0-0, policy_version 949511 (0.00085) [2022-07-11 00:04:59,079][25689] Fps is (10 sec: 5555.4, 60 sec: 5523.8, 300 sec: 5534.5). Total num frames: 972307456. Throughput: 0: 5769.3. Samples: 972315996. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:04:59,079][25689] Avg episode reward: [(0, '0.988')] [2022-07-11 00:04:59,276][26022] Updated weights on worker 0-0, policy_version 949521 (0.00086) [2022-07-11 00:05:01,030][26022] Updated weights on worker 0-0, policy_version 949531 (0.00100) [2022-07-11 00:05:03,470][26022] Updated weights on worker 0-0, policy_version 949541 (0.00084) [2022-07-11 00:05:04,080][25689] Fps is (10 sec: 5509.4, 60 sec: 5543.4, 300 sec: 5538.4). Total num frames: 972334080. Throughput: 0: 4945.7. Samples: 972332498. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:05:04,081][25689] Avg episode reward: [(0, '1.108')] [2022-07-11 00:05:05,058][26022] Updated weights on worker 0-0, policy_version 949551 (0.00091) [2022-07-11 00:05:06,967][26022] Updated weights on worker 0-0, policy_version 949561 (0.00951) [2022-07-11 00:05:08,808][26022] Updated weights on worker 0-0, policy_version 949571 (0.00091) [2022-07-11 00:05:09,115][25689] Fps is (10 sec: 5407.6, 60 sec: 5540.9, 300 sec: 5535.2). Total num frames: 972361728. Throughput: 0: 5683.3. Samples: 972363872. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:05:09,116][25689] Avg episode reward: [(0, '1.250')] [2022-07-11 00:05:10,488][26022] Updated weights on worker 0-0, policy_version 949581 (0.00087) [2022-07-11 00:05:12,651][26022] Updated weights on worker 0-0, policy_version 949591 (0.00077) [2022-07-11 00:05:14,080][26022] Updated weights on worker 0-0, policy_version 949601 (0.00094) [2022-07-11 00:05:14,187][25689] Fps is (10 sec: 5673.8, 60 sec: 5562.9, 300 sec: 5537.5). Total num frames: 972391424. Throughput: 0: 5686.7. Samples: 972397472. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:05:14,187][25689] Avg episode reward: [(0, '1.044')] [2022-07-11 00:05:16,385][26022] Updated weights on worker 0-0, policy_version 949611 (0.00084) [2022-07-11 00:05:17,343][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:05:17,351][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000949618_972408832.pth [2022-07-11 00:05:17,352][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000947672_970416128.pth [2022-07-11 00:05:18,177][26022] Updated weights on worker 0-0, policy_version 949621 (0.00093) [2022-07-11 00:05:19,229][25689] Fps is (10 sec: 5568.8, 60 sec: 5534.6, 300 sec: 5533.9). Total num frames: 972418048. Throughput: 0: 4873.8. Samples: 972414202. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:05:19,229][25689] Avg episode reward: [(0, '0.920')] [2022-07-11 00:05:19,911][26022] Updated weights on worker 0-0, policy_version 949631 (0.00091) [2022-07-11 00:05:21,703][26022] Updated weights on worker 0-0, policy_version 949641 (0.00087) [2022-07-11 00:05:23,384][26022] Updated weights on worker 0-0, policy_version 949651 (0.00085) [2022-07-11 00:05:24,232][25689] Fps is (10 sec: 5402.9, 60 sec: 5538.3, 300 sec: 5534.3). Total num frames: 972445696. Throughput: 0: 5725.9. Samples: 972447886. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:05:24,232][25689] Avg episode reward: [(0, '0.032')] [2022-07-11 00:05:25,348][26022] Updated weights on worker 0-0, policy_version 949661 (0.00092) [2022-07-11 00:05:27,265][26022] Updated weights on worker 0-0, policy_version 949671 (0.00087) [2022-07-11 00:05:29,066][26022] Updated weights on worker 0-0, policy_version 949681 (0.00415) [2022-07-11 00:05:29,257][25689] Fps is (10 sec: 5616.0, 60 sec: 5553.2, 300 sec: 5536.1). Total num frames: 972474368. Throughput: 0: 5838.2. Samples: 972481464. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:05:29,257][25689] Avg episode reward: [(0, '0.383')] [2022-07-11 00:05:30,910][26022] Updated weights on worker 0-0, policy_version 949691 (0.00086) [2022-07-11 00:05:32,651][26022] Updated weights on worker 0-0, policy_version 949701 (0.00067) [2022-07-11 00:05:34,393][25689] Fps is (10 sec: 5643.4, 60 sec: 5547.0, 300 sec: 5533.9). Total num frames: 972503040. Throughput: 0: 4985.6. Samples: 972498212. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:05:34,394][25689] Avg episode reward: [(0, '0.345')] [2022-07-11 00:05:34,484][26022] Updated weights on worker 0-0, policy_version 949711 (0.00093) [2022-07-11 00:05:36,212][26022] Updated weights on worker 0-0, policy_version 949721 (0.00088) [2022-07-11 00:05:38,143][26022] Updated weights on worker 0-0, policy_version 949731 (0.00094) [2022-07-11 00:05:39,397][25689] Fps is (10 sec: 5554.3, 60 sec: 5555.7, 300 sec: 5537.3). Total num frames: 972530688. Throughput: 0: 5831.9. Samples: 972531822. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:05:39,397][25689] Avg episode reward: [(0, '-0.813')] [2022-07-11 00:05:39,983][26022] Updated weights on worker 0-0, policy_version 949741 (0.00090) [2022-07-11 00:05:41,884][26022] Updated weights on worker 0-0, policy_version 949751 (0.00082) [2022-07-11 00:05:43,731][26022] Updated weights on worker 0-0, policy_version 949761 (0.00084) [2022-07-11 00:05:44,410][25689] Fps is (10 sec: 5520.4, 60 sec: 5529.4, 300 sec: 5537.5). Total num frames: 972558336. Throughput: 0: 5828.5. Samples: 972565494. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:05:44,411][25689] Avg episode reward: [(0, '-0.728')] [2022-07-11 00:05:45,525][26022] Updated weights on worker 0-0, policy_version 949771 (0.00091) [2022-07-11 00:05:47,432][26022] Updated weights on worker 0-0, policy_version 949781 (0.00091) [2022-07-11 00:05:49,228][26022] Updated weights on worker 0-0, policy_version 949791 (0.00090) [2022-07-11 00:05:49,427][25689] Fps is (10 sec: 5512.8, 60 sec: 5535.9, 300 sec: 5535.8). Total num frames: 972585984. Throughput: 0: 5002.1. Samples: 972582358. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:05:49,428][25689] Avg episode reward: [(0, '-0.277')] [2022-07-11 00:05:51,088][26022] Updated weights on worker 0-0, policy_version 949801 (0.00087) [2022-07-11 00:05:52,940][26022] Updated weights on worker 0-0, policy_version 949811 (0.00089) [2022-07-11 00:05:54,471][25689] Fps is (10 sec: 5496.3, 60 sec: 5541.3, 300 sec: 5535.3). Total num frames: 972613632. Throughput: 0: 5834.2. Samples: 972615348. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:05:54,472][25689] Avg episode reward: [(0, '0.042')] [2022-07-11 00:05:54,747][26022] Updated weights on worker 0-0, policy_version 949821 (0.00090) [2022-07-11 00:05:56,571][26022] Updated weights on worker 0-0, policy_version 949831 (0.00094) [2022-07-11 00:05:58,505][26022] Updated weights on worker 0-0, policy_version 949841 (0.00090) [2022-07-11 00:05:59,513][25689] Fps is (10 sec: 5685.9, 60 sec: 5557.7, 300 sec: 5542.2). Total num frames: 972643328. Throughput: 0: 5805.5. Samples: 972648604. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:05:59,513][25689] Avg episode reward: [(0, '-0.618')] [2022-07-11 00:06:00,462][26022] Updated weights on worker 0-0, policy_version 949851 (0.00096) [2022-07-11 00:06:02,620][26022] Updated weights on worker 0-0, policy_version 949861 (0.00082) [2022-07-11 00:06:04,469][26022] Updated weights on worker 0-0, policy_version 949871 (0.00088) [2022-07-11 00:06:04,565][25689] Fps is (10 sec: 5376.5, 60 sec: 5519.1, 300 sec: 5537.9). Total num frames: 972667904. Throughput: 0: 4900.7. Samples: 972664268. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:06:04,567][25689] Avg episode reward: [(0, '0.324')] [2022-07-11 00:06:06,367][26022] Updated weights on worker 0-0, policy_version 949881 (0.00502) [2022-07-11 00:06:07,944][26022] Updated weights on worker 0-0, policy_version 949891 (0.00092) [2022-07-11 00:06:09,574][25689] Fps is (10 sec: 5089.0, 60 sec: 5504.6, 300 sec: 5529.4). Total num frames: 972694528. Throughput: 0: 5671.1. Samples: 972696610. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:06:09,575][25689] Avg episode reward: [(0, '0.388')] [2022-07-11 00:06:10,119][26022] Updated weights on worker 0-0, policy_version 949901 (0.00083) [2022-07-11 00:06:11,523][26022] Updated weights on worker 0-0, policy_version 949911 (0.00087) [2022-07-11 00:06:13,798][26022] Updated weights on worker 0-0, policy_version 949921 (0.00086) [2022-07-11 00:06:14,665][25689] Fps is (10 sec: 5779.3, 60 sec: 5536.7, 300 sec: 5542.0). Total num frames: 972726272. Throughput: 0: 5700.3. Samples: 972730460. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:06:14,665][25689] Avg episode reward: [(0, '0.313')] [2022-07-11 00:06:15,383][26022] Updated weights on worker 0-0, policy_version 949931 (0.00493) [2022-07-11 00:06:17,280][26022] Updated weights on worker 0-0, policy_version 949941 (0.00085) [2022-07-11 00:06:19,011][26022] Updated weights on worker 0-0, policy_version 949951 (0.00092) [2022-07-11 00:06:19,673][25689] Fps is (10 sec: 5779.4, 60 sec: 5539.8, 300 sec: 5538.7). Total num frames: 972752896. Throughput: 0: 4892.5. Samples: 972747238. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:06:19,673][25689] Avg episode reward: [(0, '0.383')] [2022-07-11 00:06:21,000][26022] Updated weights on worker 0-0, policy_version 949961 (0.00079) [2022-07-11 00:06:22,675][26022] Updated weights on worker 0-0, policy_version 949971 (0.00089) [2022-07-11 00:06:24,720][25689] Fps is (10 sec: 5397.1, 60 sec: 5535.7, 300 sec: 5538.3). Total num frames: 972780544. Throughput: 0: 5799.0. Samples: 972781148. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 00:06:24,721][25689] Avg episode reward: [(0, '0.183')] [2022-07-11 00:06:24,728][26022] Updated weights on worker 0-0, policy_version 949981 (0.00090) [2022-07-11 00:06:26,189][26022] Updated weights on worker 0-0, policy_version 949991 (0.00086) [2022-07-11 00:06:28,308][26022] Updated weights on worker 0-0, policy_version 950001 (0.00093) [2022-07-11 00:06:29,732][25689] Fps is (10 sec: 5599.2, 60 sec: 5537.0, 300 sec: 5536.6). Total num frames: 972809216. Throughput: 0: 5871.6. Samples: 972814968. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:06:29,732][25689] Avg episode reward: [(0, '0.586')] [2022-07-11 00:06:30,027][26022] Updated weights on worker 0-0, policy_version 950011 (0.00086) [2022-07-11 00:06:32,028][26022] Updated weights on worker 0-0, policy_version 950021 (0.00090) [2022-07-11 00:06:33,589][26022] Updated weights on worker 0-0, policy_version 950031 (0.00084) [2022-07-11 00:06:34,888][25689] Fps is (10 sec: 5639.9, 60 sec: 5535.2, 300 sec: 5537.5). Total num frames: 972837888. Throughput: 0: 4994.7. Samples: 972831470. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:06:34,888][25689] Avg episode reward: [(0, '0.676')] [2022-07-11 00:06:35,667][26022] Updated weights on worker 0-0, policy_version 950041 (0.00099) [2022-07-11 00:06:37,248][26022] Updated weights on worker 0-0, policy_version 950051 (0.00085) [2022-07-11 00:06:39,355][26022] Updated weights on worker 0-0, policy_version 950061 (0.00083) [2022-07-11 00:06:39,912][25689] Fps is (10 sec: 5733.2, 60 sec: 5567.1, 300 sec: 5545.4). Total num frames: 972867584. Throughput: 0: 5848.7. Samples: 972865612. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:06:39,913][25689] Avg episode reward: [(0, '-0.203')] [2022-07-11 00:06:41,003][26022] Updated weights on worker 0-0, policy_version 950071 (0.00094) [2022-07-11 00:06:42,799][26022] Updated weights on worker 0-0, policy_version 950081 (0.00089) [2022-07-11 00:06:44,754][26022] Updated weights on worker 0-0, policy_version 950091 (0.00088) [2022-07-11 00:06:44,952][25689] Fps is (10 sec: 5596.5, 60 sec: 5547.8, 300 sec: 5538.3). Total num frames: 972894208. Throughput: 0: 5823.4. Samples: 972898962. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:06:44,952][25689] Avg episode reward: [(0, '-0.492')] [2022-07-11 00:06:46,433][26022] Updated weights on worker 0-0, policy_version 950101 (0.00086) [2022-07-11 00:06:48,321][26022] Updated weights on worker 0-0, policy_version 950111 (0.00095) [2022-07-11 00:06:49,994][25689] Fps is (10 sec: 5383.3, 60 sec: 5545.5, 300 sec: 5535.5). Total num frames: 972921856. Throughput: 0: 4979.4. Samples: 972915862. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:06:49,995][25689] Avg episode reward: [(0, '-0.895')] [2022-07-11 00:06:50,342][26022] Updated weights on worker 0-0, policy_version 950121 (0.00107) [2022-07-11 00:06:51,868][26022] Updated weights on worker 0-0, policy_version 950131 (0.00091) [2022-07-11 00:06:53,878][26022] Updated weights on worker 0-0, policy_version 950141 (0.00087) [2022-07-11 00:06:55,115][25689] Fps is (10 sec: 5642.2, 60 sec: 5572.2, 300 sec: 5543.7). Total num frames: 972951552. Throughput: 0: 5847.6. Samples: 972949748. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:06:55,115][25689] Avg episode reward: [(0, '-0.860')] [2022-07-11 00:06:55,501][26022] Updated weights on worker 0-0, policy_version 950151 (0.00370) [2022-07-11 00:06:57,431][26022] Updated weights on worker 0-0, policy_version 950161 (0.00086) [2022-07-11 00:06:59,107][26022] Updated weights on worker 0-0, policy_version 950171 (0.00087) [2022-07-11 00:07:00,205][25689] Fps is (10 sec: 5716.0, 60 sec: 5550.9, 300 sec: 5550.5). Total num frames: 972980224. Throughput: 0: 5840.6. Samples: 972984134. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:07:00,207][25689] Avg episode reward: [(0, '-0.587')] [2022-07-11 00:07:01,083][26022] Updated weights on worker 0-0, policy_version 950181 (0.00084) [2022-07-11 00:07:03,147][26022] Updated weights on worker 0-0, policy_version 950191 (0.00087) [2022-07-11 00:07:05,076][26022] Updated weights on worker 0-0, policy_version 950201 (0.00086) [2022-07-11 00:07:05,272][25689] Fps is (10 sec: 5444.0, 60 sec: 5583.3, 300 sec: 5543.6). Total num frames: 973006848. Throughput: 0: 5754.2. Samples: 973015890. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:07:05,272][25689] Avg episode reward: [(0, '-0.771')] [2022-07-11 00:07:06,744][26022] Updated weights on worker 0-0, policy_version 950211 (0.00089) [2022-07-11 00:07:08,695][26022] Updated weights on worker 0-0, policy_version 950221 (0.00097) [2022-07-11 00:07:10,282][25689] Fps is (10 sec: 5487.4, 60 sec: 5616.9, 300 sec: 5547.8). Total num frames: 973035520. Throughput: 0: 5778.2. Samples: 973033090. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:07:10,282][25689] Avg episode reward: [(0, '0.399')] [2022-07-11 00:07:10,368][26022] Updated weights on worker 0-0, policy_version 950231 (0.00091) [2022-07-11 00:07:12,243][26022] Updated weights on worker 0-0, policy_version 950241 (0.00089) [2022-07-11 00:07:14,020][26022] Updated weights on worker 0-0, policy_version 950251 (0.00087) [2022-07-11 00:07:15,357][25689] Fps is (10 sec: 5584.3, 60 sec: 5550.8, 300 sec: 5539.8). Total num frames: 973063168. Throughput: 0: 5781.7. Samples: 973066784. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:07:15,358][25689] Avg episode reward: [(0, '0.627')] [2022-07-11 00:07:15,771][26022] Updated weights on worker 0-0, policy_version 950261 (0.00084) [2022-07-11 00:07:17,443][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:07:17,452][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000950271_973077504.pth [2022-07-11 00:07:17,453][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000948318_971077632.pth [2022-07-11 00:07:17,459][26022] Updated weights on worker 0-0, policy_version 950271 (0.00089) [2022-07-11 00:07:19,594][26022] Updated weights on worker 0-0, policy_version 950281 (0.00084) [2022-07-11 00:07:20,374][25689] Fps is (10 sec: 5580.4, 60 sec: 5583.8, 300 sec: 5547.2). Total num frames: 973091840. Throughput: 0: 5779.6. Samples: 973100704. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:07:20,377][25689] Avg episode reward: [(0, '0.648')] [2022-07-11 00:07:21,208][26022] Updated weights on worker 0-0, policy_version 950291 (0.00098) [2022-07-11 00:07:23,268][26022] Updated weights on worker 0-0, policy_version 950301 (0.00096) [2022-07-11 00:07:24,678][26022] Updated weights on worker 0-0, policy_version 950311 (0.00092) [2022-07-11 00:07:25,447][25689] Fps is (10 sec: 5683.4, 60 sec: 5598.3, 300 sec: 5545.9). Total num frames: 973120512. Throughput: 0: 5029.9. Samples: 973117368. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:07:25,448][25689] Avg episode reward: [(0, '0.528')] [2022-07-11 00:07:26,992][26022] Updated weights on worker 0-0, policy_version 950321 (0.00090) [2022-07-11 00:07:28,713][26022] Updated weights on worker 0-0, policy_version 950331 (0.00085) [2022-07-11 00:07:30,378][26022] Updated weights on worker 0-0, policy_version 950341 (0.00095) [2022-07-11 00:07:30,480][25689] Fps is (10 sec: 5674.5, 60 sec: 5596.3, 300 sec: 5543.9). Total num frames: 973149184. Throughput: 0: 5828.2. Samples: 973150808. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:07:30,480][25689] Avg episode reward: [(0, '0.253')] [2022-07-11 00:07:32,435][26022] Updated weights on worker 0-0, policy_version 950351 (0.00079) [2022-07-11 00:07:33,962][26022] Updated weights on worker 0-0, policy_version 950361 (0.00085) [2022-07-11 00:07:35,541][25689] Fps is (10 sec: 5579.9, 60 sec: 5588.3, 300 sec: 5550.4). Total num frames: 973176832. Throughput: 0: 5841.8. Samples: 973184690. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:07:35,541][25689] Avg episode reward: [(0, '0.495')] [2022-07-11 00:07:36,186][26022] Updated weights on worker 0-0, policy_version 950371 (0.00095) [2022-07-11 00:07:37,683][26022] Updated weights on worker 0-0, policy_version 950381 (0.00090) [2022-07-11 00:07:39,610][26022] Updated weights on worker 0-0, policy_version 950391 (0.00101) [2022-07-11 00:07:40,558][25689] Fps is (10 sec: 5588.5, 60 sec: 5572.1, 300 sec: 5553.8). Total num frames: 973205504. Throughput: 0: 4990.6. Samples: 973201432. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:07:40,558][25689] Avg episode reward: [(0, '0.173')] [2022-07-11 00:07:41,469][26022] Updated weights on worker 0-0, policy_version 950401 (0.00083) [2022-07-11 00:07:43,285][26022] Updated weights on worker 0-0, policy_version 950411 (0.00092) [2022-07-11 00:07:45,200][26022] Updated weights on worker 0-0, policy_version 950421 (0.00086) [2022-07-11 00:07:45,587][25689] Fps is (10 sec: 5606.2, 60 sec: 5589.9, 300 sec: 5546.5). Total num frames: 973233152. Throughput: 0: 5841.7. Samples: 973235018. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:07:45,587][25689] Avg episode reward: [(0, '0.370')] [2022-07-11 00:07:46,804][26022] Updated weights on worker 0-0, policy_version 950431 (0.00089) [2022-07-11 00:07:48,751][26022] Updated weights on worker 0-0, policy_version 950441 (0.00087) [2022-07-11 00:07:50,589][25689] Fps is (10 sec: 5512.6, 60 sec: 5593.6, 300 sec: 5547.3). Total num frames: 973260800. Throughput: 0: 5857.4. Samples: 973268592. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:07:50,589][25689] Avg episode reward: [(0, '0.280')] [2022-07-11 00:07:50,611][26022] Updated weights on worker 0-0, policy_version 950451 (0.00089) [2022-07-11 00:07:52,556][26022] Updated weights on worker 0-0, policy_version 950461 (0.00084) [2022-07-11 00:07:54,213][26022] Updated weights on worker 0-0, policy_version 950471 (0.00080) [2022-07-11 00:07:55,708][25689] Fps is (10 sec: 5564.6, 60 sec: 5576.9, 300 sec: 5549.2). Total num frames: 973289472. Throughput: 0: 4984.4. Samples: 973285212. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:07:55,717][25689] Avg episode reward: [(0, '0.070')] [2022-07-11 00:07:56,163][26022] Updated weights on worker 0-0, policy_version 950481 (0.00087) [2022-07-11 00:07:57,935][26022] Updated weights on worker 0-0, policy_version 950491 (0.00101) [2022-07-11 00:07:59,799][26022] Updated weights on worker 0-0, policy_version 950501 (0.00097) [2022-07-11 00:08:00,740][25689] Fps is (10 sec: 5548.0, 60 sec: 5565.3, 300 sec: 5552.5). Total num frames: 973317120. Throughput: 0: 5819.9. Samples: 973318890. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:08:00,742][25689] Avg episode reward: [(0, '0.562')] [2022-07-11 00:08:02,085][26022] Updated weights on worker 0-0, policy_version 950511 (0.00086) [2022-07-11 00:08:03,627][26022] Updated weights on worker 0-0, policy_version 950521 (0.00091) [2022-07-11 00:08:05,587][26022] Updated weights on worker 0-0, policy_version 950531 (0.00089) [2022-07-11 00:08:05,815][25689] Fps is (10 sec: 5471.1, 60 sec: 5581.5, 300 sec: 5555.3). Total num frames: 973344768. Throughput: 0: 5702.2. Samples: 973350362. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:08:05,817][25689] Avg episode reward: [(0, '0.600')] [2022-07-11 00:08:07,691][26022] Updated weights on worker 0-0, policy_version 950541 (0.00092) [2022-07-11 00:08:09,181][26022] Updated weights on worker 0-0, policy_version 950551 (0.00090) [2022-07-11 00:08:10,828][25689] Fps is (10 sec: 5379.7, 60 sec: 5547.4, 300 sec: 5545.8). Total num frames: 973371392. Throughput: 0: 4872.8. Samples: 973367216. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:08:10,830][25689] Avg episode reward: [(0, '-0.416')] [2022-07-11 00:08:11,214][26022] Updated weights on worker 0-0, policy_version 950561 (0.00087) [2022-07-11 00:08:12,920][26022] Updated weights on worker 0-0, policy_version 950571 (0.00096) [2022-07-11 00:08:14,769][26022] Updated weights on worker 0-0, policy_version 950581 (0.00084) [2022-07-11 00:08:15,934][25689] Fps is (10 sec: 5565.5, 60 sec: 5578.4, 300 sec: 5551.8). Total num frames: 973401088. Throughput: 0: 5718.6. Samples: 973400878. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:08:15,937][25689] Avg episode reward: [(0, '-0.008')] [2022-07-11 00:08:16,609][26022] Updated weights on worker 0-0, policy_version 950591 (0.00087) [2022-07-11 00:08:18,363][26022] Updated weights on worker 0-0, policy_version 950601 (0.00089) [2022-07-11 00:08:20,333][26022] Updated weights on worker 0-0, policy_version 950611 (0.00086) [2022-07-11 00:08:20,944][25689] Fps is (10 sec: 5567.6, 60 sec: 5545.2, 300 sec: 5552.8). Total num frames: 973427712. Throughput: 0: 5743.9. Samples: 973434938. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:08:20,946][25689] Avg episode reward: [(0, '-0.018')] [2022-07-11 00:08:21,947][26022] Updated weights on worker 0-0, policy_version 950621 (0.00089) [2022-07-11 00:08:24,012][26022] Updated weights on worker 0-0, policy_version 950631 (0.00091) [2022-07-11 00:08:25,499][26022] Updated weights on worker 0-0, policy_version 950641 (0.00085) [2022-07-11 00:08:25,988][25689] Fps is (10 sec: 5602.1, 60 sec: 5564.8, 300 sec: 5552.7). Total num frames: 973457408. Throughput: 0: 5025.5. Samples: 973451742. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:08:25,988][25689] Avg episode reward: [(0, '0.174')] [2022-07-11 00:08:27,657][26022] Updated weights on worker 0-0, policy_version 950651 (0.00094) [2022-07-11 00:08:29,452][26022] Updated weights on worker 0-0, policy_version 950661 (0.00090) [2022-07-11 00:08:31,000][25689] Fps is (10 sec: 5702.4, 60 sec: 5549.8, 300 sec: 5554.1). Total num frames: 973485056. Throughput: 0: 5846.6. Samples: 973485152. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:08:31,001][25689] Avg episode reward: [(0, '0.304')] [2022-07-11 00:08:31,420][26022] Updated weights on worker 0-0, policy_version 950671 (0.00091) [2022-07-11 00:08:33,115][26022] Updated weights on worker 0-0, policy_version 950681 (0.00875) [2022-07-11 00:08:35,146][26022] Updated weights on worker 0-0, policy_version 950691 (0.00085) [2022-07-11 00:08:36,126][25689] Fps is (10 sec: 5555.2, 60 sec: 5560.7, 300 sec: 5552.3). Total num frames: 973513728. Throughput: 0: 5827.8. Samples: 973518552. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:08:36,127][25689] Avg episode reward: [(0, '0.331')] [2022-07-11 00:08:36,482][26022] Updated weights on worker 0-0, policy_version 950701 (0.00076) [2022-07-11 00:08:38,976][26022] Updated weights on worker 0-0, policy_version 950711 (0.00082) [2022-07-11 00:08:40,403][26022] Updated weights on worker 0-0, policy_version 950721 (0.00234) [2022-07-11 00:08:41,151][25689] Fps is (10 sec: 5547.9, 60 sec: 5543.0, 300 sec: 5551.9). Total num frames: 973541376. Throughput: 0: 4971.8. Samples: 973535406. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:08:41,153][25689] Avg episode reward: [(0, '0.297')] [2022-07-11 00:08:42,593][26022] Updated weights on worker 0-0, policy_version 950731 (0.00086) [2022-07-11 00:08:44,077][26022] Updated weights on worker 0-0, policy_version 950741 (0.00087) [2022-07-11 00:08:46,142][26022] Updated weights on worker 0-0, policy_version 950751 (0.00087) [2022-07-11 00:08:46,167][25689] Fps is (10 sec: 5506.9, 60 sec: 5544.2, 300 sec: 5546.5). Total num frames: 973569024. Throughput: 0: 5809.1. Samples: 973568966. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:08:46,168][25689] Avg episode reward: [(0, '0.146')] [2022-07-11 00:08:47,594][26022] Updated weights on worker 0-0, policy_version 950761 (0.00091) [2022-07-11 00:08:49,562][26022] Updated weights on worker 0-0, policy_version 950771 (0.00086) [2022-07-11 00:08:51,185][25689] Fps is (10 sec: 5612.9, 60 sec: 5559.6, 300 sec: 5558.0). Total num frames: 973597696. Throughput: 0: 5832.9. Samples: 973602892. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:08:51,186][25689] Avg episode reward: [(0, '-0.756')] [2022-07-11 00:08:51,455][26022] Updated weights on worker 0-0, policy_version 950781 (0.00100) [2022-07-11 00:08:53,386][26022] Updated weights on worker 0-0, policy_version 950791 (0.00089) [2022-07-11 00:08:55,111][26022] Updated weights on worker 0-0, policy_version 950801 (0.00092) [2022-07-11 00:08:56,291][25689] Fps is (10 sec: 5664.4, 60 sec: 5560.9, 300 sec: 5552.7). Total num frames: 973626368. Throughput: 0: 4997.1. Samples: 973619318. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:08:56,291][25689] Avg episode reward: [(0, '-0.953')] [2022-07-11 00:08:57,036][26022] Updated weights on worker 0-0, policy_version 950811 (0.00085) [2022-07-11 00:08:58,559][26022] Updated weights on worker 0-0, policy_version 950821 (0.00090) [2022-07-11 00:09:00,857][26022] Updated weights on worker 0-0, policy_version 950831 (0.00084) [2022-07-11 00:09:01,313][25689] Fps is (10 sec: 5560.9, 60 sec: 5561.8, 300 sec: 5559.7). Total num frames: 973654016. Throughput: 0: 5838.9. Samples: 973653130. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:09:01,314][25689] Avg episode reward: [(0, '-0.667')] [2022-07-11 00:09:02,708][26022] Updated weights on worker 0-0, policy_version 950841 (0.00089) [2022-07-11 00:09:04,718][26022] Updated weights on worker 0-0, policy_version 950851 (0.00092) [2022-07-11 00:09:06,366][25689] Fps is (10 sec: 5386.5, 60 sec: 5546.9, 300 sec: 5555.4). Total num frames: 973680640. Throughput: 0: 5711.8. Samples: 973684340. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:09:06,367][25689] Avg episode reward: [(0, '-0.743')] [2022-07-11 00:09:06,371][26022] Updated weights on worker 0-0, policy_version 950861 (0.00086) [2022-07-11 00:09:08,216][26022] Updated weights on worker 0-0, policy_version 950871 (0.00087) [2022-07-11 00:09:10,276][26022] Updated weights on worker 0-0, policy_version 950881 (0.00097) [2022-07-11 00:09:11,371][25689] Fps is (10 sec: 5395.8, 60 sec: 5564.6, 300 sec: 5554.3). Total num frames: 973708288. Throughput: 0: 5707.7. Samples: 973718108. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:09:11,373][25689] Avg episode reward: [(0, '-1.598')] [2022-07-11 00:09:11,807][26022] Updated weights on worker 0-0, policy_version 950891 (0.00083) [2022-07-11 00:09:13,761][26022] Updated weights on worker 0-0, policy_version 950901 (0.00089) [2022-07-11 00:09:15,704][26022] Updated weights on worker 0-0, policy_version 950911 (0.00084) [2022-07-11 00:09:16,479][25689] Fps is (10 sec: 5467.8, 60 sec: 5530.6, 300 sec: 5550.7). Total num frames: 973735936. Throughput: 0: 5725.6. Samples: 973734910. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:09:16,480][25689] Avg episode reward: [(0, '-1.604')] [2022-07-11 00:09:17,311][26022] Updated weights on worker 0-0, policy_version 950921 (0.00087) [2022-07-11 00:09:17,466][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:09:17,482][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000950923_973745152.pth [2022-07-11 00:09:17,483][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000948969_971744256.pth [2022-07-11 00:09:19,582][26022] Updated weights on worker 0-0, policy_version 950931 (0.00082) [2022-07-11 00:09:20,967][26022] Updated weights on worker 0-0, policy_version 950941 (0.00093) [2022-07-11 00:09:21,496][25689] Fps is (10 sec: 5562.8, 60 sec: 5563.7, 300 sec: 5554.7). Total num frames: 973764608. Throughput: 0: 5717.7. Samples: 973768528. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:09:21,496][25689] Avg episode reward: [(0, '-0.957')] [2022-07-11 00:09:23,053][26022] Updated weights on worker 0-0, policy_version 950951 (0.00082) [2022-07-11 00:09:24,866][26022] Updated weights on worker 0-0, policy_version 950961 (0.00084) [2022-07-11 00:09:26,505][25689] Fps is (10 sec: 5617.7, 60 sec: 5533.1, 300 sec: 5554.5). Total num frames: 973792256. Throughput: 0: 5829.0. Samples: 973801728. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:09:26,506][25689] Avg episode reward: [(0, '-0.710')] [2022-07-11 00:09:26,785][26022] Updated weights on worker 0-0, policy_version 950971 (0.00083) [2022-07-11 00:09:28,557][26022] Updated weights on worker 0-0, policy_version 950981 (0.00091) [2022-07-11 00:09:30,448][26022] Updated weights on worker 0-0, policy_version 950991 (0.00092) [2022-07-11 00:09:31,533][25689] Fps is (10 sec: 5509.3, 60 sec: 5531.7, 300 sec: 5551.9). Total num frames: 973819904. Throughput: 0: 4971.8. Samples: 973818346. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:09:31,535][25689] Avg episode reward: [(0, '0.309')] [2022-07-11 00:09:32,115][26022] Updated weights on worker 0-0, policy_version 951001 (0.00084) [2022-07-11 00:09:34,155][26022] Updated weights on worker 0-0, policy_version 951011 (0.00087) [2022-07-11 00:09:35,992][26022] Updated weights on worker 0-0, policy_version 951021 (0.00094) [2022-07-11 00:09:36,568][25689] Fps is (10 sec: 5494.9, 60 sec: 5523.0, 300 sec: 5553.0). Total num frames: 973847552. Throughput: 0: 5812.1. Samples: 973851668. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:09:36,569][25689] Avg episode reward: [(0, '1.041')] [2022-07-11 00:09:37,845][26022] Updated weights on worker 0-0, policy_version 951031 (0.00092) [2022-07-11 00:09:39,742][26022] Updated weights on worker 0-0, policy_version 951041 (0.00088) [2022-07-11 00:09:41,400][26022] Updated weights on worker 0-0, policy_version 951051 (0.00084) [2022-07-11 00:09:41,619][25689] Fps is (10 sec: 5685.2, 60 sec: 5554.5, 300 sec: 5553.9). Total num frames: 973877248. Throughput: 0: 5809.8. Samples: 973885442. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:09:41,620][25689] Avg episode reward: [(0, '0.987')] [2022-07-11 00:09:43,441][26022] Updated weights on worker 0-0, policy_version 951061 (0.00077) [2022-07-11 00:09:45,094][26022] Updated weights on worker 0-0, policy_version 951071 (0.00093) [2022-07-11 00:09:46,666][25689] Fps is (10 sec: 5577.3, 60 sec: 5534.8, 300 sec: 5551.2). Total num frames: 973903872. Throughput: 0: 4986.2. Samples: 973902256. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:09:46,667][25689] Avg episode reward: [(0, '0.269')] [2022-07-11 00:09:47,036][26022] Updated weights on worker 0-0, policy_version 951081 (0.00088) [2022-07-11 00:09:48,905][26022] Updated weights on worker 0-0, policy_version 951091 (0.00087) [2022-07-11 00:09:50,693][26022] Updated weights on worker 0-0, policy_version 951101 (0.00087) [2022-07-11 00:09:51,759][25689] Fps is (10 sec: 5453.6, 60 sec: 5528.0, 300 sec: 5554.8). Total num frames: 973932544. Throughput: 0: 5814.1. Samples: 973935942. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:09:51,759][25689] Avg episode reward: [(0, '-0.668')] [2022-07-11 00:09:52,410][26022] Updated weights on worker 0-0, policy_version 951111 (0.00093) [2022-07-11 00:09:54,315][26022] Updated weights on worker 0-0, policy_version 951121 (0.00080) [2022-07-11 00:09:56,123][26022] Updated weights on worker 0-0, policy_version 951131 (0.00097) [2022-07-11 00:09:56,830][25689] Fps is (10 sec: 5641.9, 60 sec: 5531.1, 300 sec: 5554.2). Total num frames: 973961216. Throughput: 0: 5809.5. Samples: 973969380. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:09:56,831][25689] Avg episode reward: [(0, '-0.560')] [2022-07-11 00:09:58,015][26022] Updated weights on worker 0-0, policy_version 951141 (0.00086) [2022-07-11 00:09:59,888][26022] Updated weights on worker 0-0, policy_version 951151 (0.00091) [2022-07-11 00:10:01,620][26022] Updated weights on worker 0-0, policy_version 951161 (0.00093) [2022-07-11 00:10:01,833][25689] Fps is (10 sec: 5692.3, 60 sec: 5549.8, 300 sec: 5561.0). Total num frames: 973989888. Throughput: 0: 4981.1. Samples: 973986128. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:10:01,834][25689] Avg episode reward: [(0, '-1.556')] [2022-07-11 00:10:03,935][26022] Updated weights on worker 0-0, policy_version 951171 (0.00087) [2022-07-11 00:10:05,656][26022] Updated weights on worker 0-0, policy_version 951181 (0.00087) [2022-07-11 00:10:06,871][25689] Fps is (10 sec: 5405.4, 60 sec: 5534.3, 300 sec: 5554.1). Total num frames: 974015488. Throughput: 0: 5704.3. Samples: 974017508. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:10:06,871][25689] Avg episode reward: [(0, '-2.078')] [2022-07-11 00:10:07,467][26022] Updated weights on worker 0-0, policy_version 951191 (0.00094) [2022-07-11 00:10:09,419][26022] Updated weights on worker 0-0, policy_version 951201 (0.00088) [2022-07-11 00:10:11,176][26022] Updated weights on worker 0-0, policy_version 951211 (0.00093) [2022-07-11 00:10:11,879][25689] Fps is (10 sec: 5300.5, 60 sec: 5534.0, 300 sec: 5548.4). Total num frames: 974043136. Throughput: 0: 5734.3. Samples: 974051316. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:10:11,879][25689] Avg episode reward: [(0, '-2.739')] [2022-07-11 00:10:12,939][26022] Updated weights on worker 0-0, policy_version 951221 (0.00081) [2022-07-11 00:10:14,911][26022] Updated weights on worker 0-0, policy_version 951231 (0.00095) [2022-07-11 00:10:16,515][26022] Updated weights on worker 0-0, policy_version 951241 (0.00102) [2022-07-11 00:10:17,012][25689] Fps is (10 sec: 5654.4, 60 sec: 5565.5, 300 sec: 5557.0). Total num frames: 974072832. Throughput: 0: 4895.0. Samples: 974068170. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 00:10:17,012][25689] Avg episode reward: [(0, '-1.231')] [2022-07-11 00:10:18,630][26022] Updated weights on worker 0-0, policy_version 951251 (0.00091) [2022-07-11 00:10:20,216][26022] Updated weights on worker 0-0, policy_version 951261 (0.00097) [2022-07-11 00:10:22,042][25689] Fps is (10 sec: 5541.7, 60 sec: 5530.5, 300 sec: 5553.1). Total num frames: 974099456. Throughput: 0: 5724.6. Samples: 974101816. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:10:22,043][25689] Avg episode reward: [(0, '-0.821')] [2022-07-11 00:10:22,184][26022] Updated weights on worker 0-0, policy_version 951271 (0.00091) [2022-07-11 00:10:24,151][26022] Updated weights on worker 0-0, policy_version 951281 (0.00096) [2022-07-11 00:10:25,683][26022] Updated weights on worker 0-0, policy_version 951291 (0.00095) [2022-07-11 00:10:27,053][25689] Fps is (10 sec: 5609.2, 60 sec: 5564.1, 300 sec: 5556.8). Total num frames: 974129152. Throughput: 0: 5830.3. Samples: 974135176. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:10:27,053][25689] Avg episode reward: [(0, '-0.162')] [2022-07-11 00:10:27,699][26022] Updated weights on worker 0-0, policy_version 951301 (0.00086) [2022-07-11 00:10:29,548][26022] Updated weights on worker 0-0, policy_version 951311 (0.00092) [2022-07-11 00:10:31,198][26022] Updated weights on worker 0-0, policy_version 951321 (0.00088) [2022-07-11 00:10:32,063][25689] Fps is (10 sec: 5722.4, 60 sec: 5565.8, 300 sec: 5555.7). Total num frames: 974156800. Throughput: 0: 4979.0. Samples: 974151812. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:10:32,063][25689] Avg episode reward: [(0, '0.060')] [2022-07-11 00:10:33,333][26022] Updated weights on worker 0-0, policy_version 951331 (0.00095) [2022-07-11 00:10:34,843][26022] Updated weights on worker 0-0, policy_version 951341 (0.00093) [2022-07-11 00:10:36,868][26022] Updated weights on worker 0-0, policy_version 951351 (0.00084) [2022-07-11 00:10:37,170][25689] Fps is (10 sec: 5465.7, 60 sec: 5559.2, 300 sec: 5553.8). Total num frames: 974184448. Throughput: 0: 5798.1. Samples: 974185046. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:10:37,170][25689] Avg episode reward: [(0, '0.267')] [2022-07-11 00:10:38,631][26022] Updated weights on worker 0-0, policy_version 951361 (0.00081) [2022-07-11 00:10:40,536][26022] Updated weights on worker 0-0, policy_version 951371 (0.00090) [2022-07-11 00:10:42,221][25689] Fps is (10 sec: 5544.1, 60 sec: 5542.3, 300 sec: 5556.5). Total num frames: 974213120. Throughput: 0: 5794.6. Samples: 974218748. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:10:42,222][25689] Avg episode reward: [(0, '0.723')] [2022-07-11 00:10:42,326][26022] Updated weights on worker 0-0, policy_version 951381 (0.00085) [2022-07-11 00:10:44,084][26022] Updated weights on worker 0-0, policy_version 951391 (0.00086) [2022-07-11 00:10:45,886][26022] Updated weights on worker 0-0, policy_version 951401 (0.00083) [2022-07-11 00:10:47,231][25689] Fps is (10 sec: 5496.0, 60 sec: 5545.7, 300 sec: 5553.2). Total num frames: 974239744. Throughput: 0: 4988.9. Samples: 974235842. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:10:47,231][25689] Avg episode reward: [(0, '0.606')] [2022-07-11 00:10:47,855][26022] Updated weights on worker 0-0, policy_version 951411 (0.00099) [2022-07-11 00:10:49,783][26022] Updated weights on worker 0-0, policy_version 951421 (0.00086) [2022-07-11 00:10:51,427][26022] Updated weights on worker 0-0, policy_version 951431 (0.00093) [2022-07-11 00:10:52,236][25689] Fps is (10 sec: 5623.8, 60 sec: 5570.6, 300 sec: 5560.8). Total num frames: 974269440. Throughput: 0: 5821.1. Samples: 974269242. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:10:52,236][25689] Avg episode reward: [(0, '0.319')] [2022-07-11 00:10:53,404][26022] Updated weights on worker 0-0, policy_version 951441 (0.00085) [2022-07-11 00:10:55,111][26022] Updated weights on worker 0-0, policy_version 951451 (0.00088) [2022-07-11 00:10:56,928][26022] Updated weights on worker 0-0, policy_version 951461 (0.00087) [2022-07-11 00:10:57,271][25689] Fps is (10 sec: 5813.5, 60 sec: 5574.0, 300 sec: 5557.5). Total num frames: 974298112. Throughput: 0: 5849.2. Samples: 974302622. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:10:57,271][25689] Avg episode reward: [(0, '-0.027')] [2022-07-11 00:10:58,939][26022] Updated weights on worker 0-0, policy_version 951471 (0.00083) [2022-07-11 00:11:00,411][26022] Updated weights on worker 0-0, policy_version 951481 (0.00089) [2022-07-11 00:11:02,286][25689] Fps is (10 sec: 5196.3, 60 sec: 5488.1, 300 sec: 5554.7). Total num frames: 974321664. Throughput: 0: 5028.6. Samples: 974319646. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:11:02,287][25689] Avg episode reward: [(0, '0.456')] [2022-07-11 00:11:03,027][26022] Updated weights on worker 0-0, policy_version 951491 (0.00080) [2022-07-11 00:11:04,524][26022] Updated weights on worker 0-0, policy_version 951501 (0.00120) [2022-07-11 00:11:06,576][26022] Updated weights on worker 0-0, policy_version 951511 (0.00089) [2022-07-11 00:11:07,315][25689] Fps is (10 sec: 5403.2, 60 sec: 5573.6, 300 sec: 5568.1). Total num frames: 974352384. Throughput: 0: 5743.4. Samples: 974351194. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:11:07,316][25689] Avg episode reward: [(0, '0.697')] [2022-07-11 00:11:08,656][26022] Updated weights on worker 0-0, policy_version 951521 (0.00085) [2022-07-11 00:11:10,103][26022] Updated weights on worker 0-0, policy_version 951531 (0.00088) [2022-07-11 00:11:12,222][26022] Updated weights on worker 0-0, policy_version 951541 (0.00085) [2022-07-11 00:11:12,326][25689] Fps is (10 sec: 5609.5, 60 sec: 5539.5, 300 sec: 5549.0). Total num frames: 974377984. Throughput: 0: 5737.8. Samples: 974384516. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:11:12,327][25689] Avg episode reward: [(0, '0.737')] [2022-07-11 00:11:13,835][26022] Updated weights on worker 0-0, policy_version 951551 (0.00093) [2022-07-11 00:11:15,802][26022] Updated weights on worker 0-0, policy_version 951561 (0.00086) [2022-07-11 00:11:17,398][25689] Fps is (10 sec: 5382.8, 60 sec: 5528.2, 300 sec: 5554.7). Total num frames: 974406656. Throughput: 0: 4903.0. Samples: 974401304. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:11:17,398][25689] Avg episode reward: [(0, '0.822')] [2022-07-11 00:11:17,706][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:11:17,724][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000951571_974408704.pth [2022-07-11 00:11:17,724][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000949618_972408832.pth [2022-07-11 00:11:17,743][26022] Updated weights on worker 0-0, policy_version 951571 (0.00096) [2022-07-11 00:11:19,401][26022] Updated weights on worker 0-0, policy_version 951581 (0.00083) [2022-07-11 00:11:21,206][26022] Updated weights on worker 0-0, policy_version 951591 (0.00083) [2022-07-11 00:11:22,417][25689] Fps is (10 sec: 5683.0, 60 sec: 5563.1, 300 sec: 5558.6). Total num frames: 974435328. Throughput: 0: 5725.1. Samples: 974434894. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:11:22,417][25689] Avg episode reward: [(0, '1.111')] [2022-07-11 00:11:23,150][26022] Updated weights on worker 0-0, policy_version 951601 (0.00098) [2022-07-11 00:11:24,850][26022] Updated weights on worker 0-0, policy_version 951611 (0.00097) [2022-07-11 00:11:26,826][26022] Updated weights on worker 0-0, policy_version 951621 (0.00081) [2022-07-11 00:11:27,418][25689] Fps is (10 sec: 5722.6, 60 sec: 5547.0, 300 sec: 5558.8). Total num frames: 974464000. Throughput: 0: 5828.5. Samples: 974468366. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:11:27,419][25689] Avg episode reward: [(0, '1.074')] [2022-07-11 00:11:28,569][26022] Updated weights on worker 0-0, policy_version 951631 (0.00084) [2022-07-11 00:11:30,242][26022] Updated weights on worker 0-0, policy_version 951641 (0.00096) [2022-07-11 00:11:32,339][26022] Updated weights on worker 0-0, policy_version 951651 (0.00094) [2022-07-11 00:11:32,421][25689] Fps is (10 sec: 5629.4, 60 sec: 5547.7, 300 sec: 5558.2). Total num frames: 974491648. Throughput: 0: 5007.9. Samples: 974485150. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:11:32,422][25689] Avg episode reward: [(0, '1.061')] [2022-07-11 00:11:34,146][26022] Updated weights on worker 0-0, policy_version 951661 (0.00082) [2022-07-11 00:11:36,003][26022] Updated weights on worker 0-0, policy_version 951671 (0.00088) [2022-07-11 00:11:37,459][25689] Fps is (10 sec: 5507.3, 60 sec: 5554.0, 300 sec: 5551.1). Total num frames: 974519296. Throughput: 0: 5858.3. Samples: 974518828. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:11:37,459][25689] Avg episode reward: [(0, '1.109')] [2022-07-11 00:11:37,888][26022] Updated weights on worker 0-0, policy_version 951681 (0.00092) [2022-07-11 00:11:39,456][26022] Updated weights on worker 0-0, policy_version 951691 (0.00084) [2022-07-11 00:11:41,413][26022] Updated weights on worker 0-0, policy_version 951701 (0.00498) [2022-07-11 00:11:42,477][25689] Fps is (10 sec: 5498.9, 60 sec: 5540.1, 300 sec: 5554.9). Total num frames: 974546944. Throughput: 0: 5857.9. Samples: 974552406. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:11:42,477][25689] Avg episode reward: [(0, '1.306')] [2022-07-11 00:11:43,251][26022] Updated weights on worker 0-0, policy_version 951711 (0.00086) [2022-07-11 00:11:45,026][26022] Updated weights on worker 0-0, policy_version 951721 (0.00080) [2022-07-11 00:11:46,853][26022] Updated weights on worker 0-0, policy_version 951731 (0.00087) [2022-07-11 00:11:47,485][25689] Fps is (10 sec: 5515.2, 60 sec: 5557.3, 300 sec: 5555.6). Total num frames: 974574592. Throughput: 0: 5026.3. Samples: 974569226. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:11:47,485][25689] Avg episode reward: [(0, '1.486')] [2022-07-11 00:11:48,691][26022] Updated weights on worker 0-0, policy_version 951741 (0.00102) [2022-07-11 00:11:50,595][26022] Updated weights on worker 0-0, policy_version 951751 (0.00086) [2022-07-11 00:11:52,524][25689] Fps is (10 sec: 5503.7, 60 sec: 5520.2, 300 sec: 5550.2). Total num frames: 974602240. Throughput: 0: 5851.2. Samples: 974602776. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:11:52,524][25689] Avg episode reward: [(0, '1.681')] [2022-07-11 00:11:52,644][26022] Updated weights on worker 0-0, policy_version 951761 (0.00091) [2022-07-11 00:11:54,215][26022] Updated weights on worker 0-0, policy_version 951771 (0.00087) [2022-07-11 00:11:56,149][26022] Updated weights on worker 0-0, policy_version 951781 (0.00090) [2022-07-11 00:11:57,561][25689] Fps is (10 sec: 5690.9, 60 sec: 5536.9, 300 sec: 5554.7). Total num frames: 974631936. Throughput: 0: 5827.0. Samples: 974635966. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:11:57,563][25689] Avg episode reward: [(0, '1.649')] [2022-07-11 00:11:57,950][26022] Updated weights on worker 0-0, policy_version 951791 (0.00087) [2022-07-11 00:11:59,848][26022] Updated weights on worker 0-0, policy_version 951801 (0.00083) [2022-07-11 00:12:01,945][26022] Updated weights on worker 0-0, policy_version 951811 (0.00092) [2022-07-11 00:12:02,579][25689] Fps is (10 sec: 5499.1, 60 sec: 5570.6, 300 sec: 5552.1). Total num frames: 974657536. Throughput: 0: 5007.1. Samples: 974653062. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:12:02,582][25689] Avg episode reward: [(0, '1.624')] [2022-07-11 00:12:03,742][26022] Updated weights on worker 0-0, policy_version 951821 (0.00086) [2022-07-11 00:12:05,644][26022] Updated weights on worker 0-0, policy_version 951831 (0.00095) [2022-07-11 00:12:07,211][26022] Updated weights on worker 0-0, policy_version 951841 (0.00084) [2022-07-11 00:12:07,629][25689] Fps is (10 sec: 5492.3, 60 sec: 5551.7, 300 sec: 5554.8). Total num frames: 974687232. Throughput: 0: 5736.3. Samples: 974684780. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:12:07,630][25689] Avg episode reward: [(0, '1.670')] [2022-07-11 00:12:09,335][26022] Updated weights on worker 0-0, policy_version 951851 (0.00094) [2022-07-11 00:12:10,967][26022] Updated weights on worker 0-0, policy_version 951861 (0.00086) [2022-07-11 00:12:12,643][25689] Fps is (10 sec: 5596.5, 60 sec: 5568.5, 300 sec: 5552.5). Total num frames: 974713856. Throughput: 0: 5744.7. Samples: 974718354. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:12:12,645][25689] Avg episode reward: [(0, '1.623')] [2022-07-11 00:12:13,075][26022] Updated weights on worker 0-0, policy_version 951871 (0.00089) [2022-07-11 00:12:14,616][26022] Updated weights on worker 0-0, policy_version 951881 (0.00093) [2022-07-11 00:12:16,761][26022] Updated weights on worker 0-0, policy_version 951891 (0.00088) [2022-07-11 00:12:17,695][25689] Fps is (10 sec: 5493.3, 60 sec: 5570.2, 300 sec: 5551.9). Total num frames: 974742528. Throughput: 0: 5742.9. Samples: 974751594. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:12:17,697][25689] Avg episode reward: [(0, '1.485')] [2022-07-11 00:12:18,357][26022] Updated weights on worker 0-0, policy_version 951901 (0.00084) [2022-07-11 00:12:20,395][26022] Updated weights on worker 0-0, policy_version 951911 (0.00087) [2022-07-11 00:12:22,057][26022] Updated weights on worker 0-0, policy_version 951921 (0.00086) [2022-07-11 00:12:22,708][25689] Fps is (10 sec: 5595.5, 60 sec: 5553.8, 300 sec: 5549.5). Total num frames: 974770176. Throughput: 0: 5726.0. Samples: 974768318. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:12:22,710][25689] Avg episode reward: [(0, '0.344')] [2022-07-11 00:12:24,047][26022] Updated weights on worker 0-0, policy_version 951931 (0.00088) [2022-07-11 00:12:25,731][26022] Updated weights on worker 0-0, policy_version 951941 (0.00085) [2022-07-11 00:12:27,649][26022] Updated weights on worker 0-0, policy_version 951951 (0.00084) [2022-07-11 00:12:27,774][25689] Fps is (10 sec: 5486.6, 60 sec: 5531.0, 300 sec: 5545.5). Total num frames: 974797824. Throughput: 0: 5808.5. Samples: 974801790. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:12:27,774][25689] Avg episode reward: [(0, '0.378')] [2022-07-11 00:12:29,578][26022] Updated weights on worker 0-0, policy_version 951961 (0.00091) [2022-07-11 00:12:31,437][26022] Updated weights on worker 0-0, policy_version 951971 (0.00080) [2022-07-11 00:12:32,819][25689] Fps is (10 sec: 5468.8, 60 sec: 5527.1, 300 sec: 5545.8). Total num frames: 974825472. Throughput: 0: 5780.4. Samples: 974834982. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:12:32,820][25689] Avg episode reward: [(0, '0.255')] [2022-07-11 00:12:33,231][26022] Updated weights on worker 0-0, policy_version 951981 (0.00085) [2022-07-11 00:12:35,278][26022] Updated weights on worker 0-0, policy_version 951991 (0.00087) [2022-07-11 00:12:36,776][26022] Updated weights on worker 0-0, policy_version 952001 (0.00085) [2022-07-11 00:12:37,928][25689] Fps is (10 sec: 5546.2, 60 sec: 5537.4, 300 sec: 5544.1). Total num frames: 974854144. Throughput: 0: 4945.1. Samples: 974851648. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:12:37,929][25689] Avg episode reward: [(0, '0.122')] [2022-07-11 00:12:38,901][26022] Updated weights on worker 0-0, policy_version 952011 (0.00094) [2022-07-11 00:12:40,645][26022] Updated weights on worker 0-0, policy_version 952021 (0.00084) [2022-07-11 00:12:42,391][26022] Updated weights on worker 0-0, policy_version 952031 (0.00096) [2022-07-11 00:12:43,017][25689] Fps is (10 sec: 5723.6, 60 sec: 5564.8, 300 sec: 5549.8). Total num frames: 974883840. Throughput: 0: 5739.8. Samples: 974884886. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:12:43,018][25689] Avg episode reward: [(0, '0.342')] [2022-07-11 00:12:44,307][26022] Updated weights on worker 0-0, policy_version 952041 (0.00082) [2022-07-11 00:12:46,139][26022] Updated weights on worker 0-0, policy_version 952051 (0.00087) [2022-07-11 00:12:47,916][26022] Updated weights on worker 0-0, policy_version 952061 (0.00113) [2022-07-11 00:12:48,096][25689] Fps is (10 sec: 5639.8, 60 sec: 5558.3, 300 sec: 5548.4). Total num frames: 974911488. Throughput: 0: 5737.8. Samples: 974918396. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:12:48,097][25689] Avg episode reward: [(0, '0.333')] [2022-07-11 00:12:49,826][26022] Updated weights on worker 0-0, policy_version 952071 (0.00093) [2022-07-11 00:12:51,571][26022] Updated weights on worker 0-0, policy_version 952081 (0.00097) [2022-07-11 00:12:53,110][25689] Fps is (10 sec: 5377.0, 60 sec: 5543.7, 300 sec: 5543.5). Total num frames: 974938112. Throughput: 0: 4941.7. Samples: 974935254. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:12:53,111][25689] Avg episode reward: [(0, '1.264')] [2022-07-11 00:12:53,441][26022] Updated weights on worker 0-0, policy_version 952091 (0.00084) [2022-07-11 00:12:55,259][26022] Updated weights on worker 0-0, policy_version 952101 (0.00087) [2022-07-11 00:12:57,079][26022] Updated weights on worker 0-0, policy_version 952111 (0.00084) [2022-07-11 00:12:58,160][25689] Fps is (10 sec: 5596.2, 60 sec: 5542.6, 300 sec: 5550.0). Total num frames: 974967808. Throughput: 0: 5782.7. Samples: 974968642. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:12:58,161][25689] Avg episode reward: [(0, '1.342')] [2022-07-11 00:12:58,993][26022] Updated weights on worker 0-0, policy_version 952121 (0.00091) [2022-07-11 00:13:00,764][26022] Updated weights on worker 0-0, policy_version 952131 (0.00085) [2022-07-11 00:13:03,071][26022] Updated weights on worker 0-0, policy_version 952141 (0.00089) [2022-07-11 00:13:03,170][25689] Fps is (10 sec: 5395.1, 60 sec: 5526.4, 300 sec: 5540.9). Total num frames: 974992384. Throughput: 0: 5690.5. Samples: 974999566. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:13:03,170][25689] Avg episode reward: [(0, '1.234')] [2022-07-11 00:13:04,859][26022] Updated weights on worker 0-0, policy_version 952151 (0.00094) [2022-07-11 00:13:06,709][26022] Updated weights on worker 0-0, policy_version 952161 (0.00086) [2022-07-11 00:13:08,223][25689] Fps is (10 sec: 5189.7, 60 sec: 5492.3, 300 sec: 5543.6). Total num frames: 975020032. Throughput: 0: 4865.8. Samples: 975016330. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:13:08,224][25689] Avg episode reward: [(0, '1.056')] [2022-07-11 00:13:08,524][26022] Updated weights on worker 0-0, policy_version 952171 (0.00083) [2022-07-11 00:13:10,408][26022] Updated weights on worker 0-0, policy_version 952181 (0.00070) [2022-07-11 00:13:12,475][26022] Updated weights on worker 0-0, policy_version 952191 (0.00087) [2022-07-11 00:13:13,280][25689] Fps is (10 sec: 5570.9, 60 sec: 5522.2, 300 sec: 5541.1). Total num frames: 975048704. Throughput: 0: 5665.3. Samples: 975049518. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:13:13,280][25689] Avg episode reward: [(0, '0.569')] [2022-07-11 00:13:14,185][26022] Updated weights on worker 0-0, policy_version 952201 (0.00090) [2022-07-11 00:13:16,114][26022] Updated weights on worker 0-0, policy_version 952211 (0.00086) [2022-07-11 00:13:17,785][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:13:17,798][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000952221_975074304.pth [2022-07-11 00:13:17,798][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000950271_973077504.pth [2022-07-11 00:13:17,803][26022] Updated weights on worker 0-0, policy_version 952221 (0.00091) [2022-07-11 00:13:18,401][25689] Fps is (10 sec: 5533.4, 60 sec: 5499.0, 300 sec: 5542.4). Total num frames: 975076352. Throughput: 0: 5659.1. Samples: 975083188. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:13:18,402][25689] Avg episode reward: [(0, '0.668')] [2022-07-11 00:13:19,718][26022] Updated weights on worker 0-0, policy_version 952231 (0.00091) [2022-07-11 00:13:21,733][26022] Updated weights on worker 0-0, policy_version 952241 (0.00084) [2022-07-11 00:13:23,317][26022] Updated weights on worker 0-0, policy_version 952251 (0.00096) [2022-07-11 00:13:23,442][25689] Fps is (10 sec: 5542.0, 60 sec: 5513.4, 300 sec: 5539.1). Total num frames: 975105024. Throughput: 0: 4944.0. Samples: 975099792. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:13:23,443][25689] Avg episode reward: [(0, '0.537')] [2022-07-11 00:13:25,314][26022] Updated weights on worker 0-0, policy_version 952261 (0.00086) [2022-07-11 00:13:26,942][26022] Updated weights on worker 0-0, policy_version 952271 (0.00099) [2022-07-11 00:13:28,460][25689] Fps is (10 sec: 5599.4, 60 sec: 5517.7, 300 sec: 5539.0). Total num frames: 975132672. Throughput: 0: 5771.9. Samples: 975133132. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:13:28,460][25689] Avg episode reward: [(0, '0.355')] [2022-07-11 00:13:28,912][26022] Updated weights on worker 0-0, policy_version 952281 (0.00082) [2022-07-11 00:13:30,649][26022] Updated weights on worker 0-0, policy_version 952291 (0.00089) [2022-07-11 00:13:32,575][26022] Updated weights on worker 0-0, policy_version 952301 (0.00083) [2022-07-11 00:13:33,494][25689] Fps is (10 sec: 5500.9, 60 sec: 5518.7, 300 sec: 5537.2). Total num frames: 975160320. Throughput: 0: 5794.8. Samples: 975166658. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:13:33,495][25689] Avg episode reward: [(0, '0.450')] [2022-07-11 00:13:34,361][26022] Updated weights on worker 0-0, policy_version 952311 (0.00086) [2022-07-11 00:13:36,308][26022] Updated weights on worker 0-0, policy_version 952321 (0.00085) [2022-07-11 00:13:38,020][26022] Updated weights on worker 0-0, policy_version 952331 (0.00088) [2022-07-11 00:13:38,554][25689] Fps is (10 sec: 5579.1, 60 sec: 5523.2, 300 sec: 5540.0). Total num frames: 975188992. Throughput: 0: 4973.7. Samples: 975183422. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:13:38,555][25689] Avg episode reward: [(0, '0.220')] [2022-07-11 00:13:40,183][26022] Updated weights on worker 0-0, policy_version 952341 (0.00085) [2022-07-11 00:13:41,678][26022] Updated weights on worker 0-0, policy_version 952351 (0.00092) [2022-07-11 00:13:43,598][25689] Fps is (10 sec: 5574.0, 60 sec: 5493.4, 300 sec: 5539.5). Total num frames: 975216640. Throughput: 0: 5802.3. Samples: 975216744. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:13:43,599][25689] Avg episode reward: [(0, '0.012')] [2022-07-11 00:13:43,722][26022] Updated weights on worker 0-0, policy_version 952361 (0.00885) [2022-07-11 00:13:45,326][26022] Updated weights on worker 0-0, policy_version 952371 (0.00086) [2022-07-11 00:13:47,305][26022] Updated weights on worker 0-0, policy_version 952381 (0.00084) [2022-07-11 00:13:48,653][25689] Fps is (10 sec: 5475.6, 60 sec: 5495.7, 300 sec: 5535.4). Total num frames: 975244288. Throughput: 0: 5801.4. Samples: 975250282. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:13:48,653][25689] Avg episode reward: [(0, '0.266')] [2022-07-11 00:13:49,318][26022] Updated weights on worker 0-0, policy_version 952391 (0.00088) [2022-07-11 00:13:51,000][26022] Updated weights on worker 0-0, policy_version 952401 (0.00097) [2022-07-11 00:13:52,911][26022] Updated weights on worker 0-0, policy_version 952411 (0.00094) [2022-07-11 00:13:53,657][25689] Fps is (10 sec: 5599.2, 60 sec: 5530.4, 300 sec: 5537.3). Total num frames: 975272960. Throughput: 0: 4961.6. Samples: 975266696. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:13:53,657][25689] Avg episode reward: [(0, '-0.084')] [2022-07-11 00:13:54,743][26022] Updated weights on worker 0-0, policy_version 952421 (0.00122) [2022-07-11 00:13:56,669][26022] Updated weights on worker 0-0, policy_version 952431 (0.00079) [2022-07-11 00:13:58,624][26022] Updated weights on worker 0-0, policy_version 952441 (0.00091) [2022-07-11 00:13:58,757][25689] Fps is (10 sec: 5573.9, 60 sec: 5492.0, 300 sec: 5535.8). Total num frames: 975300608. Throughput: 0: 5770.4. Samples: 975300000. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:13:58,758][25689] Avg episode reward: [(0, '0.106')] [2022-07-11 00:14:00,356][26022] Updated weights on worker 0-0, policy_version 952451 (0.00099) [2022-07-11 00:14:02,384][26022] Updated weights on worker 0-0, policy_version 952461 (0.00091) [2022-07-11 00:14:03,816][25689] Fps is (10 sec: 5443.2, 60 sec: 5538.3, 300 sec: 5539.1). Total num frames: 975328256. Throughput: 0: 5667.7. Samples: 975331330. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:14:03,816][25689] Avg episode reward: [(0, '0.172')] [2022-07-11 00:14:04,444][26022] Updated weights on worker 0-0, policy_version 952471 (0.00087) [2022-07-11 00:14:06,274][26022] Updated weights on worker 0-0, policy_version 952481 (0.00092) [2022-07-11 00:14:08,184][26022] Updated weights on worker 0-0, policy_version 952491 (0.00090) [2022-07-11 00:14:08,905][25689] Fps is (10 sec: 5449.2, 60 sec: 5535.0, 300 sec: 5537.6). Total num frames: 975355904. Throughput: 0: 4832.8. Samples: 975348154. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:14:08,905][25689] Avg episode reward: [(0, '-0.196')] [2022-07-11 00:14:09,780][26022] Updated weights on worker 0-0, policy_version 952501 (0.00085) [2022-07-11 00:14:11,728][26022] Updated weights on worker 0-0, policy_version 952511 (0.00087) [2022-07-11 00:14:13,400][26022] Updated weights on worker 0-0, policy_version 952521 (0.00083) [2022-07-11 00:14:13,930][25689] Fps is (10 sec: 5365.6, 60 sec: 5504.0, 300 sec: 5535.7). Total num frames: 975382528. Throughput: 0: 5673.2. Samples: 975381712. Policy #0 lag: (min: 0.0, avg: 7.1, max: 19.0) [2022-07-11 00:14:13,931][25689] Avg episode reward: [(0, '-0.234')] [2022-07-11 00:14:15,259][26022] Updated weights on worker 0-0, policy_version 952531 (0.00088) [2022-07-11 00:14:17,132][26022] Updated weights on worker 0-0, policy_version 952541 (0.00090) [2022-07-11 00:14:19,019][25689] Fps is (10 sec: 5467.1, 60 sec: 5523.9, 300 sec: 5534.3). Total num frames: 975411200. Throughput: 0: 5693.1. Samples: 975415352. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:14:19,020][25689] Avg episode reward: [(0, '0.641')] [2022-07-11 00:14:19,164][26022] Updated weights on worker 0-0, policy_version 952551 (0.00082) [2022-07-11 00:14:20,912][26022] Updated weights on worker 0-0, policy_version 952561 (0.00106) [2022-07-11 00:14:22,783][26022] Updated weights on worker 0-0, policy_version 952571 (0.00091) [2022-07-11 00:14:24,026][25689] Fps is (10 sec: 5680.2, 60 sec: 5527.0, 300 sec: 5537.8). Total num frames: 975439872. Throughput: 0: 5813.7. Samples: 975448826. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:14:24,027][25689] Avg episode reward: [(0, '0.268')] [2022-07-11 00:14:24,578][26022] Updated weights on worker 0-0, policy_version 952581 (0.00083) [2022-07-11 00:14:26,421][26022] Updated weights on worker 0-0, policy_version 952591 (0.00084) [2022-07-11 00:14:28,335][26022] Updated weights on worker 0-0, policy_version 952601 (0.00075) [2022-07-11 00:14:29,045][25689] Fps is (10 sec: 5719.4, 60 sec: 5543.8, 300 sec: 5541.4). Total num frames: 975468544. Throughput: 0: 5826.4. Samples: 975465500. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:14:29,046][25689] Avg episode reward: [(0, '0.101')] [2022-07-11 00:14:30,152][26022] Updated weights on worker 0-0, policy_version 952611 (0.00088) [2022-07-11 00:14:31,791][26022] Updated weights on worker 0-0, policy_version 952621 (0.00084) [2022-07-11 00:14:33,715][26022] Updated weights on worker 0-0, policy_version 952631 (0.00084) [2022-07-11 00:14:34,077][25689] Fps is (10 sec: 5501.4, 60 sec: 5527.1, 300 sec: 5538.0). Total num frames: 975495168. Throughput: 0: 5831.3. Samples: 975499194. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:14:34,079][25689] Avg episode reward: [(0, '-0.419')] [2022-07-11 00:14:35,695][26022] Updated weights on worker 0-0, policy_version 952641 (0.00085) [2022-07-11 00:14:37,347][26022] Updated weights on worker 0-0, policy_version 952651 (0.00085) [2022-07-11 00:14:39,154][25689] Fps is (10 sec: 5571.3, 60 sec: 5542.5, 300 sec: 5537.6). Total num frames: 975524864. Throughput: 0: 5835.3. Samples: 975532846. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:14:39,155][25689] Avg episode reward: [(0, '0.553')] [2022-07-11 00:14:39,169][26022] Updated weights on worker 0-0, policy_version 952661 (0.01125) [2022-07-11 00:14:41,134][26022] Updated weights on worker 0-0, policy_version 952671 (0.00092) [2022-07-11 00:14:42,784][26022] Updated weights on worker 0-0, policy_version 952681 (0.00087) [2022-07-11 00:14:44,248][25689] Fps is (10 sec: 5638.3, 60 sec: 5537.9, 300 sec: 5540.1). Total num frames: 975552512. Throughput: 0: 4982.2. Samples: 975549574. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:14:44,249][25689] Avg episode reward: [(0, '0.082')] [2022-07-11 00:14:44,780][26022] Updated weights on worker 0-0, policy_version 952691 (0.00091) [2022-07-11 00:14:46,435][26022] Updated weights on worker 0-0, policy_version 952701 (0.00094) [2022-07-11 00:14:48,276][26022] Updated weights on worker 0-0, policy_version 952711 (0.00094) [2022-07-11 00:14:49,317][25689] Fps is (10 sec: 5541.5, 60 sec: 5553.4, 300 sec: 5540.6). Total num frames: 975581184. Throughput: 0: 5818.0. Samples: 975583444. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:14:49,319][25689] Avg episode reward: [(0, '0.000')] [2022-07-11 00:14:50,339][26022] Updated weights on worker 0-0, policy_version 952721 (0.00089) [2022-07-11 00:14:51,855][26022] Updated weights on worker 0-0, policy_version 952731 (0.00092) [2022-07-11 00:14:53,858][26022] Updated weights on worker 0-0, policy_version 952741 (0.01060) [2022-07-11 00:14:54,356][25689] Fps is (10 sec: 5673.0, 60 sec: 5550.3, 300 sec: 5541.2). Total num frames: 975609856. Throughput: 0: 5823.2. Samples: 975617280. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:14:54,356][25689] Avg episode reward: [(0, '0.201')] [2022-07-11 00:14:55,554][26022] Updated weights on worker 0-0, policy_version 952751 (0.00094) [2022-07-11 00:14:57,510][26022] Updated weights on worker 0-0, policy_version 952761 (0.00091) [2022-07-11 00:14:59,299][26022] Updated weights on worker 0-0, policy_version 952771 (0.00090) [2022-07-11 00:14:59,438][25689] Fps is (10 sec: 5564.8, 60 sec: 5551.9, 300 sec: 5536.2). Total num frames: 975637504. Throughput: 0: 4981.9. Samples: 975633900. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:14:59,439][25689] Avg episode reward: [(0, '0.215')] [2022-07-11 00:15:01,220][26022] Updated weights on worker 0-0, policy_version 952781 (0.00080) [2022-07-11 00:15:03,234][26022] Updated weights on worker 0-0, policy_version 952791 (0.00083) [2022-07-11 00:15:04,456][25689] Fps is (10 sec: 5373.3, 60 sec: 5538.7, 300 sec: 5540.1). Total num frames: 975664128. Throughput: 0: 5732.1. Samples: 975665410. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:15:04,457][25689] Avg episode reward: [(0, '1.065')] [2022-07-11 00:15:05,444][26022] Updated weights on worker 0-0, policy_version 952801 (0.00093) [2022-07-11 00:15:06,979][26022] Updated weights on worker 0-0, policy_version 952811 (0.00092) [2022-07-11 00:15:08,928][26022] Updated weights on worker 0-0, policy_version 952821 (0.00090) [2022-07-11 00:15:09,466][25689] Fps is (10 sec: 5412.4, 60 sec: 5546.0, 300 sec: 5540.0). Total num frames: 975691776. Throughput: 0: 5743.7. Samples: 975699168. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:15:09,466][25689] Avg episode reward: [(0, '0.963')] [2022-07-11 00:15:10,792][26022] Updated weights on worker 0-0, policy_version 952831 (0.00096) [2022-07-11 00:15:12,487][26022] Updated weights on worker 0-0, policy_version 952841 (0.00097) [2022-07-11 00:15:14,458][26022] Updated weights on worker 0-0, policy_version 952851 (0.00091) [2022-07-11 00:15:14,498][25689] Fps is (10 sec: 5506.4, 60 sec: 5562.3, 300 sec: 5535.0). Total num frames: 975719424. Throughput: 0: 4891.1. Samples: 975715798. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:15:14,499][25689] Avg episode reward: [(0, '0.858')] [2022-07-11 00:15:16,242][26022] Updated weights on worker 0-0, policy_version 952861 (0.00093) [2022-07-11 00:15:17,975][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:15:17,986][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000952871_975739904.pth [2022-07-11 00:15:17,986][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000950923_973745152.pth [2022-07-11 00:15:17,991][26022] Updated weights on worker 0-0, policy_version 952871 (0.00083) [2022-07-11 00:15:19,558][25689] Fps is (10 sec: 5580.5, 60 sec: 5565.0, 300 sec: 5541.3). Total num frames: 975748096. Throughput: 0: 5722.8. Samples: 975749040. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:15:19,558][25689] Avg episode reward: [(0, '1.023')] [2022-07-11 00:15:19,928][26022] Updated weights on worker 0-0, policy_version 952881 (0.00090) [2022-07-11 00:15:21,625][26022] Updated weights on worker 0-0, policy_version 952891 (0.00084) [2022-07-11 00:15:23,454][26022] Updated weights on worker 0-0, policy_version 952901 (0.00089) [2022-07-11 00:15:24,559][25689] Fps is (10 sec: 5598.0, 60 sec: 5548.6, 300 sec: 5534.6). Total num frames: 975775744. Throughput: 0: 5833.0. Samples: 975782670. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:15:24,559][25689] Avg episode reward: [(0, '0.641')] [2022-07-11 00:15:25,399][26022] Updated weights on worker 0-0, policy_version 952911 (0.00087) [2022-07-11 00:15:27,270][26022] Updated weights on worker 0-0, policy_version 952921 (0.00096) [2022-07-11 00:15:29,131][26022] Updated weights on worker 0-0, policy_version 952931 (0.00100) [2022-07-11 00:15:29,568][25689] Fps is (10 sec: 5523.9, 60 sec: 5532.6, 300 sec: 5534.7). Total num frames: 975803392. Throughput: 0: 4983.8. Samples: 975799354. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:15:29,569][25689] Avg episode reward: [(0, '0.342')] [2022-07-11 00:15:30,879][26022] Updated weights on worker 0-0, policy_version 952941 (0.00085) [2022-07-11 00:15:32,706][26022] Updated weights on worker 0-0, policy_version 952951 (0.00085) [2022-07-11 00:15:34,592][25689] Fps is (10 sec: 5511.7, 60 sec: 5550.3, 300 sec: 5536.2). Total num frames: 975831040. Throughput: 0: 5818.1. Samples: 975832702. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:15:34,599][25689] Avg episode reward: [(0, '0.176')] [2022-07-11 00:15:34,626][26022] Updated weights on worker 0-0, policy_version 952961 (0.00089) [2022-07-11 00:15:36,488][26022] Updated weights on worker 0-0, policy_version 952971 (0.00102) [2022-07-11 00:15:38,161][26022] Updated weights on worker 0-0, policy_version 952981 (0.00079) [2022-07-11 00:15:39,676][25689] Fps is (10 sec: 5571.6, 60 sec: 5532.6, 300 sec: 5535.6). Total num frames: 975859712. Throughput: 0: 5841.4. Samples: 975866562. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:15:39,678][25689] Avg episode reward: [(0, '0.098')] [2022-07-11 00:15:40,128][26022] Updated weights on worker 0-0, policy_version 952991 (0.00089) [2022-07-11 00:15:41,754][26022] Updated weights on worker 0-0, policy_version 953001 (0.00090) [2022-07-11 00:15:43,672][26022] Updated weights on worker 0-0, policy_version 953011 (0.00089) [2022-07-11 00:15:44,696][25689] Fps is (10 sec: 5674.9, 60 sec: 5556.3, 300 sec: 5542.3). Total num frames: 975888384. Throughput: 0: 4994.9. Samples: 975883254. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:15:44,704][25689] Avg episode reward: [(0, '0.285')] [2022-07-11 00:15:45,615][26022] Updated weights on worker 0-0, policy_version 953021 (0.00085) [2022-07-11 00:15:47,334][26022] Updated weights on worker 0-0, policy_version 953031 (0.00082) [2022-07-11 00:15:49,120][26022] Updated weights on worker 0-0, policy_version 953041 (0.00088) [2022-07-11 00:15:49,801][25689] Fps is (10 sec: 5562.6, 60 sec: 5536.2, 300 sec: 5533.5). Total num frames: 975916032. Throughput: 0: 5802.6. Samples: 975916758. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:15:49,802][25689] Avg episode reward: [(0, '-0.098')] [2022-07-11 00:15:50,996][26022] Updated weights on worker 0-0, policy_version 953051 (0.00087) [2022-07-11 00:15:52,892][26022] Updated weights on worker 0-0, policy_version 953061 (0.00087) [2022-07-11 00:15:54,690][26022] Updated weights on worker 0-0, policy_version 953071 (0.00096) [2022-07-11 00:15:54,827][25689] Fps is (10 sec: 5660.7, 60 sec: 5554.3, 300 sec: 5537.2). Total num frames: 975945728. Throughput: 0: 5797.3. Samples: 975950012. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:15:54,828][25689] Avg episode reward: [(0, '0.147')] [2022-07-11 00:15:56,670][26022] Updated weights on worker 0-0, policy_version 953081 (0.00105) [2022-07-11 00:15:58,532][26022] Updated weights on worker 0-0, policy_version 953091 (0.00094) [2022-07-11 00:15:59,907][25689] Fps is (10 sec: 5471.9, 60 sec: 5520.6, 300 sec: 5542.8). Total num frames: 975971328. Throughput: 0: 4946.0. Samples: 975966620. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:15:59,907][25689] Avg episode reward: [(0, '-0.102')] [2022-07-11 00:16:00,312][26022] Updated weights on worker 0-0, policy_version 953101 (0.00083) [2022-07-11 00:16:02,631][26022] Updated weights on worker 0-0, policy_version 953111 (0.00093) [2022-07-11 00:16:04,517][26022] Updated weights on worker 0-0, policy_version 953121 (0.00620) [2022-07-11 00:16:04,926][25689] Fps is (10 sec: 5171.2, 60 sec: 5520.5, 300 sec: 5529.3). Total num frames: 975997952. Throughput: 0: 5664.1. Samples: 975997834. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:16:04,926][25689] Avg episode reward: [(0, '-0.028')] [2022-07-11 00:16:06,191][26022] Updated weights on worker 0-0, policy_version 953131 (0.00093) [2022-07-11 00:16:08,216][26022] Updated weights on worker 0-0, policy_version 953141 (0.00092) [2022-07-11 00:16:09,698][26022] Updated weights on worker 0-0, policy_version 953151 (0.00085) [2022-07-11 00:16:09,958][25689] Fps is (10 sec: 5501.6, 60 sec: 5535.4, 300 sec: 5539.2). Total num frames: 976026624. Throughput: 0: 5688.3. Samples: 976031414. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:16:09,958][25689] Avg episode reward: [(0, '-0.131')] [2022-07-11 00:16:11,874][26022] Updated weights on worker 0-0, policy_version 953161 (0.00085) [2022-07-11 00:16:13,784][26022] Updated weights on worker 0-0, policy_version 953171 (0.00089) [2022-07-11 00:16:14,981][25689] Fps is (10 sec: 5499.1, 60 sec: 5519.3, 300 sec: 5533.2). Total num frames: 976053248. Throughput: 0: 4857.2. Samples: 976047906. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:16:14,982][25689] Avg episode reward: [(0, '-0.795')] [2022-07-11 00:16:15,362][26022] Updated weights on worker 0-0, policy_version 953181 (0.00089) [2022-07-11 00:16:17,429][26022] Updated weights on worker 0-0, policy_version 953191 (0.00089) [2022-07-11 00:16:19,067][26022] Updated weights on worker 0-0, policy_version 953201 (0.00087) [2022-07-11 00:16:20,015][25689] Fps is (10 sec: 5396.3, 60 sec: 5504.7, 300 sec: 5529.5). Total num frames: 976080896. Throughput: 0: 5695.6. Samples: 976081148. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:16:20,015][25689] Avg episode reward: [(0, '-0.512')] [2022-07-11 00:16:21,071][26022] Updated weights on worker 0-0, policy_version 953211 (0.00088) [2022-07-11 00:16:22,890][26022] Updated weights on worker 0-0, policy_version 953221 (0.00092) [2022-07-11 00:16:24,718][26022] Updated weights on worker 0-0, policy_version 953231 (0.00091) [2022-07-11 00:16:25,020][25689] Fps is (10 sec: 5610.2, 60 sec: 5521.3, 300 sec: 5529.4). Total num frames: 976109568. Throughput: 0: 5786.8. Samples: 976114116. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:16:25,021][25689] Avg episode reward: [(0, '0.221')] [2022-07-11 00:16:26,736][26022] Updated weights on worker 0-0, policy_version 953241 (0.00087) [2022-07-11 00:16:28,577][26022] Updated weights on worker 0-0, policy_version 953251 (0.00091) [2022-07-11 00:16:30,032][25689] Fps is (10 sec: 5724.8, 60 sec: 5538.0, 300 sec: 5532.7). Total num frames: 976138240. Throughput: 0: 4948.5. Samples: 976130750. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:16:30,032][25689] Avg episode reward: [(0, '-0.147')] [2022-07-11 00:16:30,164][26022] Updated weights on worker 0-0, policy_version 953261 (0.00091) [2022-07-11 00:16:32,362][26022] Updated weights on worker 0-0, policy_version 953271 (0.00087) [2022-07-11 00:16:33,862][26022] Updated weights on worker 0-0, policy_version 953281 (0.00091) [2022-07-11 00:16:35,061][25689] Fps is (10 sec: 5405.1, 60 sec: 5503.6, 300 sec: 5526.0). Total num frames: 976163840. Throughput: 0: 5777.2. Samples: 976163912. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:16:35,062][25689] Avg episode reward: [(0, '-0.057')] [2022-07-11 00:16:35,999][26022] Updated weights on worker 0-0, policy_version 953291 (0.00084) [2022-07-11 00:16:37,815][26022] Updated weights on worker 0-0, policy_version 953301 (0.00085) [2022-07-11 00:16:39,495][26022] Updated weights on worker 0-0, policy_version 953311 (0.00087) [2022-07-11 00:16:40,122][25689] Fps is (10 sec: 5378.6, 60 sec: 5505.8, 300 sec: 5528.6). Total num frames: 976192512. Throughput: 0: 5776.9. Samples: 976197304. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:16:40,122][25689] Avg episode reward: [(0, '-0.525')] [2022-07-11 00:16:41,468][26022] Updated weights on worker 0-0, policy_version 953321 (0.00092) [2022-07-11 00:16:43,354][26022] Updated weights on worker 0-0, policy_version 953331 (0.00086) [2022-07-11 00:16:45,119][26022] Updated weights on worker 0-0, policy_version 953341 (0.00084) [2022-07-11 00:16:45,139][25689] Fps is (10 sec: 5690.2, 60 sec: 5506.1, 300 sec: 5531.9). Total num frames: 976221184. Throughput: 0: 4959.6. Samples: 976213896. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:16:45,139][25689] Avg episode reward: [(0, '-0.800')] [2022-07-11 00:16:47,083][26022] Updated weights on worker 0-0, policy_version 953351 (0.00088) [2022-07-11 00:16:48,658][26022] Updated weights on worker 0-0, policy_version 953361 (0.00088) [2022-07-11 00:16:50,161][25689] Fps is (10 sec: 5508.2, 60 sec: 5496.6, 300 sec: 5528.8). Total num frames: 976247808. Throughput: 0: 5776.8. Samples: 976247034. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:16:50,161][25689] Avg episode reward: [(0, '-0.459')] [2022-07-11 00:16:50,983][26022] Updated weights on worker 0-0, policy_version 953371 (0.00080) [2022-07-11 00:16:52,752][26022] Updated weights on worker 0-0, policy_version 953381 (0.00092) [2022-07-11 00:16:54,518][26022] Updated weights on worker 0-0, policy_version 953391 (0.00088) [2022-07-11 00:16:55,178][25689] Fps is (10 sec: 5507.9, 60 sec: 5480.4, 300 sec: 5525.7). Total num frames: 976276480. Throughput: 0: 5780.4. Samples: 976280196. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:16:55,179][25689] Avg episode reward: [(0, '-0.524')] [2022-07-11 00:16:56,513][26022] Updated weights on worker 0-0, policy_version 953401 (0.00096) [2022-07-11 00:16:58,052][26022] Updated weights on worker 0-0, policy_version 953411 (0.00092) [2022-07-11 00:16:59,982][26022] Updated weights on worker 0-0, policy_version 953421 (0.00091) [2022-07-11 00:17:00,259][25689] Fps is (10 sec: 5577.4, 60 sec: 5514.3, 300 sec: 5531.4). Total num frames: 976304128. Throughput: 0: 5767.5. Samples: 976313442. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:17:00,259][25689] Avg episode reward: [(0, '-0.254')] [2022-07-11 00:17:02,247][26022] Updated weights on worker 0-0, policy_version 953431 (0.00082) [2022-07-11 00:17:03,904][26022] Updated weights on worker 0-0, policy_version 953441 (0.00091) [2022-07-11 00:17:05,263][25689] Fps is (10 sec: 5279.9, 60 sec: 5498.7, 300 sec: 5518.5). Total num frames: 976329728. Throughput: 0: 5683.7. Samples: 976328276. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:17:05,264][25689] Avg episode reward: [(0, '-0.925')] [2022-07-11 00:17:06,052][26022] Updated weights on worker 0-0, policy_version 953451 (0.00084) [2022-07-11 00:17:07,596][26022] Updated weights on worker 0-0, policy_version 953461 (0.00084) [2022-07-11 00:17:09,639][26022] Updated weights on worker 0-0, policy_version 953471 (0.00088) [2022-07-11 00:17:10,273][25689] Fps is (10 sec: 5419.3, 60 sec: 5500.7, 300 sec: 5525.5). Total num frames: 976358400. Throughput: 0: 5711.2. Samples: 976361900. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:17:10,278][25689] Avg episode reward: [(0, '0.564')] [2022-07-11 00:17:11,278][26022] Updated weights on worker 0-0, policy_version 953481 (0.00086) [2022-07-11 00:17:13,186][26022] Updated weights on worker 0-0, policy_version 953491 (0.00091) [2022-07-11 00:17:15,098][26022] Updated weights on worker 0-0, policy_version 953501 (0.00087) [2022-07-11 00:17:15,285][25689] Fps is (10 sec: 5517.6, 60 sec: 5501.8, 300 sec: 5519.3). Total num frames: 976385024. Throughput: 0: 5717.2. Samples: 976395150. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:17:15,285][25689] Avg episode reward: [(0, '0.362')] [2022-07-11 00:17:16,779][26022] Updated weights on worker 0-0, policy_version 953511 (0.00088) [2022-07-11 00:17:18,056][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:17:18,071][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000953517_976401408.pth [2022-07-11 00:17:18,071][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000951571_974408704.pth [2022-07-11 00:17:18,829][26022] Updated weights on worker 0-0, policy_version 953521 (0.00086) [2022-07-11 00:17:20,411][25689] Fps is (10 sec: 5555.2, 60 sec: 5527.2, 300 sec: 5524.1). Total num frames: 976414720. Throughput: 0: 4889.9. Samples: 976411988. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:17:20,412][25689] Avg episode reward: [(0, '0.604')] [2022-07-11 00:17:20,738][26022] Updated weights on worker 0-0, policy_version 953531 (0.00091) [2022-07-11 00:17:22,342][26022] Updated weights on worker 0-0, policy_version 953541 (0.00099) [2022-07-11 00:17:24,214][26022] Updated weights on worker 0-0, policy_version 953551 (0.00085) [2022-07-11 00:17:25,510][25689] Fps is (10 sec: 5608.2, 60 sec: 5501.8, 300 sec: 5523.5). Total num frames: 976442368. Throughput: 0: 5780.2. Samples: 976445306. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:17:25,511][25689] Avg episode reward: [(0, '0.668')] [2022-07-11 00:17:26,208][26022] Updated weights on worker 0-0, policy_version 953561 (0.00087) [2022-07-11 00:17:27,966][26022] Updated weights on worker 0-0, policy_version 953571 (0.00083) [2022-07-11 00:17:29,956][26022] Updated weights on worker 0-0, policy_version 953581 (0.00090) [2022-07-11 00:17:30,514][25689] Fps is (10 sec: 5473.7, 60 sec: 5485.5, 300 sec: 5524.3). Total num frames: 976470016. Throughput: 0: 5767.4. Samples: 976478634. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:17:30,514][25689] Avg episode reward: [(0, '0.494')] [2022-07-11 00:17:31,479][26022] Updated weights on worker 0-0, policy_version 953591 (0.00093) [2022-07-11 00:17:33,531][26022] Updated weights on worker 0-0, policy_version 953601 (0.00081) [2022-07-11 00:17:35,151][26022] Updated weights on worker 0-0, policy_version 953611 (0.00085) [2022-07-11 00:17:35,545][25689] Fps is (10 sec: 5510.1, 60 sec: 5519.2, 300 sec: 5522.3). Total num frames: 976497664. Throughput: 0: 4942.2. Samples: 976495282. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:17:35,546][25689] Avg episode reward: [(0, '1.126')] [2022-07-11 00:17:37,263][26022] Updated weights on worker 0-0, policy_version 953621 (0.00090) [2022-07-11 00:17:39,188][26022] Updated weights on worker 0-0, policy_version 953631 (0.00096) [2022-07-11 00:17:40,623][25689] Fps is (10 sec: 5571.1, 60 sec: 5517.7, 300 sec: 5519.0). Total num frames: 976526336. Throughput: 0: 5766.9. Samples: 976528546. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:17:40,623][25689] Avg episode reward: [(0, '0.354')] [2022-07-11 00:17:40,915][26022] Updated weights on worker 0-0, policy_version 953641 (0.00085) [2022-07-11 00:17:42,879][26022] Updated weights on worker 0-0, policy_version 953651 (0.00095) [2022-07-11 00:17:44,619][26022] Updated weights on worker 0-0, policy_version 953661 (0.00094) [2022-07-11 00:17:45,644][25689] Fps is (10 sec: 5475.4, 60 sec: 5483.4, 300 sec: 5516.7). Total num frames: 976552960. Throughput: 0: 5775.2. Samples: 976561586. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:17:45,645][25689] Avg episode reward: [(0, '0.756')] [2022-07-11 00:17:46,566][26022] Updated weights on worker 0-0, policy_version 953671 (0.00089) [2022-07-11 00:17:48,474][26022] Updated weights on worker 0-0, policy_version 953681 (0.00079) [2022-07-11 00:17:50,082][26022] Updated weights on worker 0-0, policy_version 953691 (0.00082) [2022-07-11 00:17:50,647][25689] Fps is (10 sec: 5414.3, 60 sec: 5502.1, 300 sec: 5520.3). Total num frames: 976580608. Throughput: 0: 4949.9. Samples: 976578290. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:17:50,647][25689] Avg episode reward: [(0, '0.607')] [2022-07-11 00:17:52,007][26022] Updated weights on worker 0-0, policy_version 953701 (0.00088) [2022-07-11 00:17:54,018][26022] Updated weights on worker 0-0, policy_version 953711 (0.00085) [2022-07-11 00:17:55,588][26022] Updated weights on worker 0-0, policy_version 953721 (0.00090) [2022-07-11 00:17:55,664][25689] Fps is (10 sec: 5723.3, 60 sec: 5519.1, 300 sec: 5520.9). Total num frames: 976610304. Throughput: 0: 5796.0. Samples: 976611888. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:17:55,664][25689] Avg episode reward: [(0, '0.475')] [2022-07-11 00:17:57,636][26022] Updated weights on worker 0-0, policy_version 953731 (0.00088) [2022-07-11 00:17:59,249][26022] Updated weights on worker 0-0, policy_version 953741 (0.00052) [2022-07-11 00:18:00,771][25689] Fps is (10 sec: 5663.9, 60 sec: 5516.6, 300 sec: 5529.4). Total num frames: 976637952. Throughput: 0: 5795.7. Samples: 976645318. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:18:00,772][25689] Avg episode reward: [(0, '0.774')] [2022-07-11 00:18:01,195][26022] Updated weights on worker 0-0, policy_version 953751 (0.00094) [2022-07-11 00:18:03,313][26022] Updated weights on worker 0-0, policy_version 953761 (0.00081) [2022-07-11 00:18:05,135][26022] Updated weights on worker 0-0, policy_version 953771 (0.00082) [2022-07-11 00:18:05,785][25689] Fps is (10 sec: 5260.8, 60 sec: 5515.7, 300 sec: 5523.3). Total num frames: 976663552. Throughput: 0: 4877.9. Samples: 976659832. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2022-07-11 00:18:05,787][25689] Avg episode reward: [(0, '1.335')] [2022-07-11 00:18:07,012][26022] Updated weights on worker 0-0, policy_version 953781 (0.00094) [2022-07-11 00:18:08,955][26022] Updated weights on worker 0-0, policy_version 953791 (0.00083) [2022-07-11 00:18:10,721][26022] Updated weights on worker 0-0, policy_version 953801 (0.00080) [2022-07-11 00:18:10,815][25689] Fps is (10 sec: 5403.6, 60 sec: 5514.0, 300 sec: 5523.8). Total num frames: 976692224. Throughput: 0: 5701.3. Samples: 976693274. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:18:10,817][25689] Avg episode reward: [(0, '1.257')] [2022-07-11 00:18:12,603][26022] Updated weights on worker 0-0, policy_version 953811 (0.00096) [2022-07-11 00:18:14,498][26022] Updated weights on worker 0-0, policy_version 953821 (0.00090) [2022-07-11 00:18:15,889][25689] Fps is (10 sec: 5472.9, 60 sec: 5508.3, 300 sec: 5521.2). Total num frames: 976718848. Throughput: 0: 5667.7. Samples: 976726518. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:18:15,891][25689] Avg episode reward: [(0, '0.253')] [2022-07-11 00:18:16,470][26022] Updated weights on worker 0-0, policy_version 953831 (0.00085) [2022-07-11 00:18:18,120][26022] Updated weights on worker 0-0, policy_version 953841 (0.00086) [2022-07-11 00:18:20,075][26022] Updated weights on worker 0-0, policy_version 953851 (0.00473) [2022-07-11 00:18:20,955][25689] Fps is (10 sec: 5553.8, 60 sec: 5513.7, 300 sec: 5524.2). Total num frames: 976748544. Throughput: 0: 4852.1. Samples: 976743254. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:18:20,957][25689] Avg episode reward: [(0, '0.360')] [2022-07-11 00:18:21,731][26022] Updated weights on worker 0-0, policy_version 953861 (0.00092) [2022-07-11 00:18:23,737][26022] Updated weights on worker 0-0, policy_version 953871 (0.00091) [2022-07-11 00:18:25,394][26022] Updated weights on worker 0-0, policy_version 953881 (0.00088) [2022-07-11 00:18:26,020][25689] Fps is (10 sec: 5659.9, 60 sec: 5516.8, 300 sec: 5523.3). Total num frames: 976776192. Throughput: 0: 5778.7. Samples: 976776766. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:18:26,022][25689] Avg episode reward: [(0, '0.504')] [2022-07-11 00:18:27,446][26022] Updated weights on worker 0-0, policy_version 953891 (0.00114) [2022-07-11 00:18:29,130][26022] Updated weights on worker 0-0, policy_version 953901 (0.00089) [2022-07-11 00:18:31,053][25689] Fps is (10 sec: 5374.8, 60 sec: 5497.2, 300 sec: 5519.9). Total num frames: 976802816. Throughput: 0: 5769.5. Samples: 976810038. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:18:31,054][25689] Avg episode reward: [(0, '-0.411')] [2022-07-11 00:18:31,146][26022] Updated weights on worker 0-0, policy_version 953911 (0.00086) [2022-07-11 00:18:32,863][26022] Updated weights on worker 0-0, policy_version 953921 (0.00094) [2022-07-11 00:18:34,662][26022] Updated weights on worker 0-0, policy_version 953931 (0.00091) [2022-07-11 00:18:36,084][25689] Fps is (10 sec: 5698.1, 60 sec: 5548.0, 300 sec: 5527.3). Total num frames: 976833536. Throughput: 0: 4968.2. Samples: 976826854. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:18:36,085][25689] Avg episode reward: [(0, '-0.192')] [2022-07-11 00:18:36,414][26022] Updated weights on worker 0-0, policy_version 953941 (0.00059) [2022-07-11 00:18:38,571][26022] Updated weights on worker 0-0, policy_version 953951 (0.00086) [2022-07-11 00:18:40,044][26022] Updated weights on worker 0-0, policy_version 953961 (0.00089) [2022-07-11 00:18:41,215][25689] Fps is (10 sec: 5643.1, 60 sec: 5509.4, 300 sec: 5522.3). Total num frames: 976860160. Throughput: 0: 5795.1. Samples: 976860658. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:18:41,215][25689] Avg episode reward: [(0, '0.678')] [2022-07-11 00:18:42,247][26022] Updated weights on worker 0-0, policy_version 953971 (0.00086) [2022-07-11 00:18:43,666][26022] Updated weights on worker 0-0, policy_version 953981 (0.00092) [2022-07-11 00:18:45,635][26022] Updated weights on worker 0-0, policy_version 953991 (0.00443) [2022-07-11 00:18:46,273][25689] Fps is (10 sec: 5527.4, 60 sec: 5556.7, 300 sec: 5529.1). Total num frames: 976889856. Throughput: 0: 5809.8. Samples: 976894432. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:18:46,274][25689] Avg episode reward: [(0, '0.356')] [2022-07-11 00:18:47,626][26022] Updated weights on worker 0-0, policy_version 954001 (0.00095) [2022-07-11 00:18:49,224][26022] Updated weights on worker 0-0, policy_version 954011 (0.00084) [2022-07-11 00:18:51,247][26022] Updated weights on worker 0-0, policy_version 954021 (0.00099) [2022-07-11 00:18:51,331][25689] Fps is (10 sec: 5769.6, 60 sec: 5568.5, 300 sec: 5528.1). Total num frames: 976918528. Throughput: 0: 4998.6. Samples: 976911400. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:18:51,332][25689] Avg episode reward: [(0, '0.037')] [2022-07-11 00:18:53,162][26022] Updated weights on worker 0-0, policy_version 954031 (0.00094) [2022-07-11 00:18:54,933][26022] Updated weights on worker 0-0, policy_version 954041 (0.00080) [2022-07-11 00:18:56,343][25689] Fps is (10 sec: 5592.9, 60 sec: 5535.1, 300 sec: 5529.7). Total num frames: 976946176. Throughput: 0: 5806.5. Samples: 976944490. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:18:56,344][25689] Avg episode reward: [(0, '-0.124')] [2022-07-11 00:18:56,796][26022] Updated weights on worker 0-0, policy_version 954051 (0.00094) [2022-07-11 00:18:58,619][26022] Updated weights on worker 0-0, policy_version 954061 (0.00084) [2022-07-11 00:19:00,503][26022] Updated weights on worker 0-0, policy_version 954071 (0.00084) [2022-07-11 00:19:01,387][25689] Fps is (10 sec: 5499.0, 60 sec: 5541.0, 300 sec: 5530.0). Total num frames: 976973824. Throughput: 0: 5832.8. Samples: 976978318. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:19:01,387][25689] Avg episode reward: [(0, '0.592')] [2022-07-11 00:19:02,495][26022] Updated weights on worker 0-0, policy_version 954081 (0.00102) [2022-07-11 00:19:04,575][26022] Updated weights on worker 0-0, policy_version 954091 (0.00079) [2022-07-11 00:19:06,242][26022] Updated weights on worker 0-0, policy_version 954101 (0.00829) [2022-07-11 00:19:06,434][25689] Fps is (10 sec: 5277.2, 60 sec: 5538.0, 300 sec: 5523.9). Total num frames: 976999424. Throughput: 0: 4888.2. Samples: 976992978. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:19:06,434][25689] Avg episode reward: [(0, '0.459')] [2022-07-11 00:19:08,131][26022] Updated weights on worker 0-0, policy_version 954111 (0.00086) [2022-07-11 00:19:09,807][26022] Updated weights on worker 0-0, policy_version 954121 (0.00094) [2022-07-11 00:19:11,443][25689] Fps is (10 sec: 5295.2, 60 sec: 5523.0, 300 sec: 5527.7). Total num frames: 977027072. Throughput: 0: 5727.7. Samples: 977026592. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:19:11,443][25689] Avg episode reward: [(0, '0.002')] [2022-07-11 00:19:11,680][26022] Updated weights on worker 0-0, policy_version 954131 (0.00090) [2022-07-11 00:19:13,676][26022] Updated weights on worker 0-0, policy_version 954141 (0.00090) [2022-07-11 00:19:15,228][26022] Updated weights on worker 0-0, policy_version 954151 (0.00094) [2022-07-11 00:19:16,458][25689] Fps is (10 sec: 5618.2, 60 sec: 5562.1, 300 sec: 5529.0). Total num frames: 977055744. Throughput: 0: 5746.9. Samples: 977060088. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:19:16,459][25689] Avg episode reward: [(0, '0.104')] [2022-07-11 00:19:17,385][26022] Updated weights on worker 0-0, policy_version 954161 (0.00087) [2022-07-11 00:19:18,297][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:19:18,310][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000954167_977067008.pth [2022-07-11 00:19:18,311][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000952221_975074304.pth [2022-07-11 00:19:19,071][26022] Updated weights on worker 0-0, policy_version 954171 (0.00095) [2022-07-11 00:19:20,791][26022] Updated weights on worker 0-0, policy_version 954181 (0.00092) [2022-07-11 00:19:21,502][25689] Fps is (10 sec: 5598.6, 60 sec: 5530.4, 300 sec: 5524.9). Total num frames: 977083392. Throughput: 0: 5719.4. Samples: 977093366. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:19:21,503][25689] Avg episode reward: [(0, '0.002')] [2022-07-11 00:19:22,864][26022] Updated weights on worker 0-0, policy_version 954191 (0.00088) [2022-07-11 00:19:24,674][26022] Updated weights on worker 0-0, policy_version 954201 (0.00091) [2022-07-11 00:19:26,506][25689] Fps is (10 sec: 5503.5, 60 sec: 5536.0, 300 sec: 5521.7). Total num frames: 977111040. Throughput: 0: 5837.4. Samples: 977110148. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:19:26,506][25689] Avg episode reward: [(0, '0.317')] [2022-07-11 00:19:26,633][26022] Updated weights on worker 0-0, policy_version 954211 (0.00090) [2022-07-11 00:19:28,493][26022] Updated weights on worker 0-0, policy_version 954221 (0.00086) [2022-07-11 00:19:30,170][26022] Updated weights on worker 0-0, policy_version 954231 (0.00097) [2022-07-11 00:19:31,513][25689] Fps is (10 sec: 5523.6, 60 sec: 5555.3, 300 sec: 5525.6). Total num frames: 977138688. Throughput: 0: 5818.5. Samples: 977143372. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:19:31,514][25689] Avg episode reward: [(0, '0.051')] [2022-07-11 00:19:32,273][26022] Updated weights on worker 0-0, policy_version 954241 (0.00088) [2022-07-11 00:19:33,855][26022] Updated weights on worker 0-0, policy_version 954251 (0.00088) [2022-07-11 00:19:35,997][26022] Updated weights on worker 0-0, policy_version 954261 (0.00087) [2022-07-11 00:19:36,516][25689] Fps is (10 sec: 5626.0, 60 sec: 5523.9, 300 sec: 5523.6). Total num frames: 977167360. Throughput: 0: 4983.6. Samples: 977160050. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:19:36,517][25689] Avg episode reward: [(0, '0.476')] [2022-07-11 00:19:37,327][26022] Updated weights on worker 0-0, policy_version 954271 (0.00089) [2022-07-11 00:19:39,639][26022] Updated weights on worker 0-0, policy_version 954281 (0.00085) [2022-07-11 00:19:40,980][26022] Updated weights on worker 0-0, policy_version 954291 (0.00091) [2022-07-11 00:19:41,637][25689] Fps is (10 sec: 5664.2, 60 sec: 5558.7, 300 sec: 5526.5). Total num frames: 977196032. Throughput: 0: 4979.1. Samples: 977193620. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:19:41,639][25689] Avg episode reward: [(0, '-0.497')] [2022-07-11 00:19:43,063][26022] Updated weights on worker 0-0, policy_version 954301 (0.00088) [2022-07-11 00:19:44,775][26022] Updated weights on worker 0-0, policy_version 954311 (0.00098) [2022-07-11 00:19:46,640][25689] Fps is (10 sec: 5563.4, 60 sec: 5530.0, 300 sec: 5524.3). Total num frames: 977223680. Throughput: 0: 5827.6. Samples: 977227480. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:19:46,641][25689] Avg episode reward: [(0, '-0.264')] [2022-07-11 00:19:46,644][26022] Updated weights on worker 0-0, policy_version 954321 (0.00084) [2022-07-11 00:19:48,590][26022] Updated weights on worker 0-0, policy_version 954331 (0.00095) [2022-07-11 00:19:50,319][26022] Updated weights on worker 0-0, policy_version 954341 (0.00058) [2022-07-11 00:19:51,723][25689] Fps is (10 sec: 5482.7, 60 sec: 5510.7, 300 sec: 5520.0). Total num frames: 977251328. Throughput: 0: 5821.3. Samples: 977261018. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:19:51,723][25689] Avg episode reward: [(0, '-1.041')] [2022-07-11 00:19:52,177][26022] Updated weights on worker 0-0, policy_version 954351 (0.00082) [2022-07-11 00:19:54,064][26022] Updated weights on worker 0-0, policy_version 954361 (0.00090) [2022-07-11 00:19:55,732][26022] Updated weights on worker 0-0, policy_version 954371 (0.00087) [2022-07-11 00:19:56,758][25689] Fps is (10 sec: 5566.4, 60 sec: 5525.6, 300 sec: 5524.3). Total num frames: 977280000. Throughput: 0: 5811.2. Samples: 977277676. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:19:56,760][25689] Avg episode reward: [(0, '-0.940')] [2022-07-11 00:19:58,003][26022] Updated weights on worker 0-0, policy_version 954381 (0.00085) [2022-07-11 00:19:59,310][26022] Updated weights on worker 0-0, policy_version 954391 (0.00090) [2022-07-11 00:20:01,522][26022] Updated weights on worker 0-0, policy_version 954401 (0.00086) [2022-07-11 00:20:01,811][25689] Fps is (10 sec: 5481.4, 60 sec: 5507.8, 300 sec: 5523.7). Total num frames: 977306624. Throughput: 0: 5826.3. Samples: 977311158. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:20:01,811][25689] Avg episode reward: [(0, '-0.654')] [2022-07-11 00:20:03,648][26022] Updated weights on worker 0-0, policy_version 954411 (0.00091) [2022-07-11 00:20:05,415][26022] Updated weights on worker 0-0, policy_version 954421 (0.00083) [2022-07-11 00:20:06,812][25689] Fps is (10 sec: 5398.1, 60 sec: 5545.9, 300 sec: 5523.9). Total num frames: 977334272. Throughput: 0: 5700.7. Samples: 977342474. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:20:06,812][25689] Avg episode reward: [(0, '0.363')] [2022-07-11 00:20:07,344][26022] Updated weights on worker 0-0, policy_version 954431 (0.00084) [2022-07-11 00:20:09,059][26022] Updated weights on worker 0-0, policy_version 954441 (0.00084) [2022-07-11 00:20:11,115][26022] Updated weights on worker 0-0, policy_version 954451 (0.00090) [2022-07-11 00:20:11,816][25689] Fps is (10 sec: 5526.7, 60 sec: 5546.3, 300 sec: 5524.4). Total num frames: 977361920. Throughput: 0: 4884.0. Samples: 977359154. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:20:11,816][25689] Avg episode reward: [(0, '0.326')] [2022-07-11 00:20:13,000][26022] Updated weights on worker 0-0, policy_version 954461 (0.00054) [2022-07-11 00:20:14,777][26022] Updated weights on worker 0-0, policy_version 954471 (0.00083) [2022-07-11 00:20:16,609][26022] Updated weights on worker 0-0, policy_version 954481 (0.00091) [2022-07-11 00:20:16,821][25689] Fps is (10 sec: 5524.4, 60 sec: 5530.3, 300 sec: 5521.9). Total num frames: 977389568. Throughput: 0: 5738.0. Samples: 977392800. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:20:16,821][25689] Avg episode reward: [(0, '1.444')] [2022-07-11 00:20:18,272][26022] Updated weights on worker 0-0, policy_version 954491 (0.00088) [2022-07-11 00:20:20,394][26022] Updated weights on worker 0-0, policy_version 954501 (0.00371) [2022-07-11 00:20:21,898][25689] Fps is (10 sec: 5687.9, 60 sec: 5561.2, 300 sec: 5527.4). Total num frames: 977419264. Throughput: 0: 5716.7. Samples: 977425990. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:20:21,898][25689] Avg episode reward: [(0, '-0.001')] [2022-07-11 00:20:21,902][26022] Updated weights on worker 0-0, policy_version 954511 (0.00092) [2022-07-11 00:20:24,150][26022] Updated weights on worker 0-0, policy_version 954521 (0.00087) [2022-07-11 00:20:25,582][26022] Updated weights on worker 0-0, policy_version 954531 (0.00088) [2022-07-11 00:20:26,921][25689] Fps is (10 sec: 5576.0, 60 sec: 5542.4, 300 sec: 5523.7). Total num frames: 977445888. Throughput: 0: 4987.2. Samples: 977442766. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:20:26,922][25689] Avg episode reward: [(0, '0.372')] [2022-07-11 00:20:27,685][26022] Updated weights on worker 0-0, policy_version 954541 (0.00087) [2022-07-11 00:20:29,416][26022] Updated weights on worker 0-0, policy_version 954551 (0.00100) [2022-07-11 00:20:31,247][26022] Updated weights on worker 0-0, policy_version 954561 (0.00084) [2022-07-11 00:20:31,940][25689] Fps is (10 sec: 5608.4, 60 sec: 5575.3, 300 sec: 5530.7). Total num frames: 977475584. Throughput: 0: 5828.8. Samples: 977476452. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:20:31,940][25689] Avg episode reward: [(0, '-0.391')] [2022-07-11 00:20:33,124][26022] Updated weights on worker 0-0, policy_version 954571 (0.00093) [2022-07-11 00:20:34,899][26022] Updated weights on worker 0-0, policy_version 954581 (0.00092) [2022-07-11 00:20:36,799][26022] Updated weights on worker 0-0, policy_version 954591 (0.00084) [2022-07-11 00:20:36,998][25689] Fps is (10 sec: 5690.9, 60 sec: 5553.3, 300 sec: 5527.8). Total num frames: 977503232. Throughput: 0: 5803.8. Samples: 977509902. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:20:36,998][25689] Avg episode reward: [(0, '-0.105')] [2022-07-11 00:20:38,538][26022] Updated weights on worker 0-0, policy_version 954601 (0.00096) [2022-07-11 00:20:40,361][26022] Updated weights on worker 0-0, policy_version 954611 (0.00091) [2022-07-11 00:20:42,046][25689] Fps is (10 sec: 5370.1, 60 sec: 5526.1, 300 sec: 5520.3). Total num frames: 977529856. Throughput: 0: 4998.3. Samples: 977526698. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:20:42,048][25689] Avg episode reward: [(0, '-0.353')] [2022-07-11 00:20:42,351][26022] Updated weights on worker 0-0, policy_version 954621 (0.00081) [2022-07-11 00:20:44,047][26022] Updated weights on worker 0-0, policy_version 954631 (0.00085) [2022-07-11 00:20:46,038][26022] Updated weights on worker 0-0, policy_version 954641 (0.00086) [2022-07-11 00:20:47,071][25689] Fps is (10 sec: 5591.1, 60 sec: 5557.9, 300 sec: 5528.7). Total num frames: 977559552. Throughput: 0: 5818.8. Samples: 977560012. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:20:47,072][25689] Avg episode reward: [(0, '-0.444')] [2022-07-11 00:20:47,668][26022] Updated weights on worker 0-0, policy_version 954651 (0.00082) [2022-07-11 00:20:49,806][26022] Updated weights on worker 0-0, policy_version 954661 (0.00093) [2022-07-11 00:20:51,719][26022] Updated weights on worker 0-0, policy_version 954671 (0.00095) [2022-07-11 00:20:52,137][25689] Fps is (10 sec: 5479.7, 60 sec: 5525.6, 300 sec: 5514.2). Total num frames: 977585152. Throughput: 0: 5778.5. Samples: 977593162. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:20:52,138][25689] Avg episode reward: [(0, '0.077')] [2022-07-11 00:20:53,294][26022] Updated weights on worker 0-0, policy_version 954681 (0.00087) [2022-07-11 00:20:55,281][26022] Updated weights on worker 0-0, policy_version 954691 (0.00095) [2022-07-11 00:20:56,972][26022] Updated weights on worker 0-0, policy_version 954701 (0.00085) [2022-07-11 00:20:57,151][25689] Fps is (10 sec: 5384.3, 60 sec: 5527.5, 300 sec: 5525.8). Total num frames: 977613824. Throughput: 0: 4950.4. Samples: 977609666. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:20:57,152][25689] Avg episode reward: [(0, '0.435')] [2022-07-11 00:20:59,010][26022] Updated weights on worker 0-0, policy_version 954711 (0.00107) [2022-07-11 00:21:00,812][26022] Updated weights on worker 0-0, policy_version 954721 (0.00089) [2022-07-11 00:21:02,240][25689] Fps is (10 sec: 5371.7, 60 sec: 5507.2, 300 sec: 5521.0). Total num frames: 977639424. Throughput: 0: 5760.9. Samples: 977643036. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:21:02,241][25689] Avg episode reward: [(0, '-0.725')] [2022-07-11 00:21:02,968][26022] Updated weights on worker 0-0, policy_version 954731 (0.00092) [2022-07-11 00:21:04,951][26022] Updated weights on worker 0-0, policy_version 954741 (0.00089) [2022-07-11 00:21:06,678][26022] Updated weights on worker 0-0, policy_version 954751 (0.00094) [2022-07-11 00:21:07,259][25689] Fps is (10 sec: 5267.9, 60 sec: 5505.6, 300 sec: 5517.8). Total num frames: 977667072. Throughput: 0: 5666.4. Samples: 977674404. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:21:07,260][25689] Avg episode reward: [(0, '-1.113')] [2022-07-11 00:21:08,479][26022] Updated weights on worker 0-0, policy_version 954761 (0.00096) [2022-07-11 00:21:10,239][26022] Updated weights on worker 0-0, policy_version 954771 (0.00094) [2022-07-11 00:21:12,041][26022] Updated weights on worker 0-0, policy_version 954781 (0.00086) [2022-07-11 00:21:12,340][25689] Fps is (10 sec: 5779.2, 60 sec: 5549.4, 300 sec: 5530.5). Total num frames: 977697792. Throughput: 0: 4851.3. Samples: 977691172. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:21:12,341][25689] Avg episode reward: [(0, '-1.080')] [2022-07-11 00:21:14,306][26022] Updated weights on worker 0-0, policy_version 954791 (0.00086) [2022-07-11 00:21:15,613][26022] Updated weights on worker 0-0, policy_version 954801 (0.00082) [2022-07-11 00:21:17,395][25689] Fps is (10 sec: 5556.4, 60 sec: 5511.0, 300 sec: 5523.2). Total num frames: 977723392. Throughput: 0: 5691.9. Samples: 977724894. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:21:17,395][25689] Avg episode reward: [(0, '-1.075')] [2022-07-11 00:21:17,901][26022] Updated weights on worker 0-0, policy_version 954811 (0.00088) [2022-07-11 00:21:18,360][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:21:18,373][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000954814_977729536.pth [2022-07-11 00:21:18,373][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000952871_975739904.pth [2022-07-11 00:21:19,303][26022] Updated weights on worker 0-0, policy_version 954821 (0.00087) [2022-07-11 00:21:21,567][26022] Updated weights on worker 0-0, policy_version 954831 (0.00092) [2022-07-11 00:21:22,445][25689] Fps is (10 sec: 5370.7, 60 sec: 5496.5, 300 sec: 5522.4). Total num frames: 977752064. Throughput: 0: 5680.5. Samples: 977757810. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:21:22,446][25689] Avg episode reward: [(0, '-0.439')] [2022-07-11 00:21:23,221][26022] Updated weights on worker 0-0, policy_version 954841 (0.00089) [2022-07-11 00:21:25,080][26022] Updated weights on worker 0-0, policy_version 954851 (0.00082) [2022-07-11 00:21:26,904][26022] Updated weights on worker 0-0, policy_version 954861 (0.00094) [2022-07-11 00:21:27,451][25689] Fps is (10 sec: 5600.4, 60 sec: 5515.0, 300 sec: 5519.0). Total num frames: 977779712. Throughput: 0: 4957.3. Samples: 977774504. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:21:27,452][25689] Avg episode reward: [(0, '-0.037')] [2022-07-11 00:21:29,080][26022] Updated weights on worker 0-0, policy_version 954871 (0.00106) [2022-07-11 00:21:30,570][26022] Updated weights on worker 0-0, policy_version 954881 (0.00094) [2022-07-11 00:21:32,467][25689] Fps is (10 sec: 5517.7, 60 sec: 5481.4, 300 sec: 5526.2). Total num frames: 977807360. Throughput: 0: 5806.0. Samples: 977808028. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:21:32,467][25689] Avg episode reward: [(0, '1.049')] [2022-07-11 00:21:32,647][26022] Updated weights on worker 0-0, policy_version 954891 (0.00087) [2022-07-11 00:21:34,110][26022] Updated weights on worker 0-0, policy_version 954901 (0.00091) [2022-07-11 00:21:36,302][26022] Updated weights on worker 0-0, policy_version 954911 (0.00092) [2022-07-11 00:21:37,476][25689] Fps is (10 sec: 5618.3, 60 sec: 5502.8, 300 sec: 5527.2). Total num frames: 977836032. Throughput: 0: 5820.8. Samples: 977841780. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:21:37,476][25689] Avg episode reward: [(0, '1.229')] [2022-07-11 00:21:37,886][26022] Updated weights on worker 0-0, policy_version 954921 (0.00086) [2022-07-11 00:21:39,830][26022] Updated weights on worker 0-0, policy_version 954931 (0.00091) [2022-07-11 00:21:41,743][26022] Updated weights on worker 0-0, policy_version 954941 (0.00085) [2022-07-11 00:21:42,623][25689] Fps is (10 sec: 5646.5, 60 sec: 5527.7, 300 sec: 5524.7). Total num frames: 977864704. Throughput: 0: 4984.6. Samples: 977858386. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:21:42,623][25689] Avg episode reward: [(0, '1.099')] [2022-07-11 00:21:43,548][26022] Updated weights on worker 0-0, policy_version 954951 (0.00098) [2022-07-11 00:21:45,421][26022] Updated weights on worker 0-0, policy_version 954961 (0.00091) [2022-07-11 00:21:47,366][26022] Updated weights on worker 0-0, policy_version 954971 (0.00094) [2022-07-11 00:21:47,656][25689] Fps is (10 sec: 5532.1, 60 sec: 5493.1, 300 sec: 5528.0). Total num frames: 977892352. Throughput: 0: 5793.8. Samples: 977891568. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:21:47,657][25689] Avg episode reward: [(0, '1.203')] [2022-07-11 00:21:49,050][26022] Updated weights on worker 0-0, policy_version 954981 (0.00093) [2022-07-11 00:21:51,091][26022] Updated weights on worker 0-0, policy_version 954991 (0.00075) [2022-07-11 00:21:52,587][26022] Updated weights on worker 0-0, policy_version 955001 (0.00085) [2022-07-11 00:21:52,682][25689] Fps is (10 sec: 5598.6, 60 sec: 5547.5, 300 sec: 5527.8). Total num frames: 977921024. Throughput: 0: 5786.3. Samples: 977925000. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:21:52,683][25689] Avg episode reward: [(0, '1.431')] [2022-07-11 00:21:54,687][26022] Updated weights on worker 0-0, policy_version 955011 (0.00091) [2022-07-11 00:21:56,412][26022] Updated weights on worker 0-0, policy_version 955021 (0.00087) [2022-07-11 00:21:57,741][25689] Fps is (10 sec: 5482.9, 60 sec: 5509.5, 300 sec: 5524.8). Total num frames: 977947648. Throughput: 0: 4931.3. Samples: 977941716. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:21:57,745][25689] Avg episode reward: [(0, '1.485')] [2022-07-11 00:21:58,274][26022] Updated weights on worker 0-0, policy_version 955031 (0.00092) [2022-07-11 00:22:00,043][26022] Updated weights on worker 0-0, policy_version 955041 (0.00086) [2022-07-11 00:22:02,579][26022] Updated weights on worker 0-0, policy_version 955051 (0.00083) [2022-07-11 00:22:02,819][25689] Fps is (10 sec: 5152.1, 60 sec: 5510.6, 300 sec: 5523.4). Total num frames: 977973248. Throughput: 0: 5766.4. Samples: 977974844. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2022-07-11 00:22:02,819][25689] Avg episode reward: [(0, '0.650')] [2022-07-11 00:22:04,243][26022] Updated weights on worker 0-0, policy_version 955061 (0.00092) [2022-07-11 00:22:06,004][26022] Updated weights on worker 0-0, policy_version 955071 (0.00088) [2022-07-11 00:22:07,815][26022] Updated weights on worker 0-0, policy_version 955081 (0.00084) [2022-07-11 00:22:07,835][25689] Fps is (10 sec: 5478.2, 60 sec: 5544.6, 300 sec: 5526.7). Total num frames: 978002944. Throughput: 0: 5679.7. Samples: 978006178. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:22:07,836][25689] Avg episode reward: [(0, '0.566')] [2022-07-11 00:22:09,821][26022] Updated weights on worker 0-0, policy_version 955091 (0.00085) [2022-07-11 00:22:11,615][26022] Updated weights on worker 0-0, policy_version 955101 (0.00085) [2022-07-11 00:22:12,842][25689] Fps is (10 sec: 5618.9, 60 sec: 5483.7, 300 sec: 5526.8). Total num frames: 978029568. Throughput: 0: 5680.5. Samples: 978039518. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:22:12,844][25689] Avg episode reward: [(0, '0.210')] [2022-07-11 00:22:13,435][26022] Updated weights on worker 0-0, policy_version 955111 (0.00086) [2022-07-11 00:22:15,315][26022] Updated weights on worker 0-0, policy_version 955121 (0.00093) [2022-07-11 00:22:17,112][26022] Updated weights on worker 0-0, policy_version 955131 (0.00090) [2022-07-11 00:22:17,859][25689] Fps is (10 sec: 5312.3, 60 sec: 5504.1, 300 sec: 5518.5). Total num frames: 978056192. Throughput: 0: 5693.1. Samples: 978056246. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:22:17,861][25689] Avg episode reward: [(0, '0.077')] [2022-07-11 00:22:19,003][26022] Updated weights on worker 0-0, policy_version 955141 (0.00087) [2022-07-11 00:22:21,011][26022] Updated weights on worker 0-0, policy_version 955151 (0.00087) [2022-07-11 00:22:22,643][26022] Updated weights on worker 0-0, policy_version 955161 (0.00092) [2022-07-11 00:22:22,922][25689] Fps is (10 sec: 5689.1, 60 sec: 5536.8, 300 sec: 5529.5). Total num frames: 978086912. Throughput: 0: 5696.7. Samples: 978089364. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:22:22,922][25689] Avg episode reward: [(0, '-1.582')] [2022-07-11 00:22:24,567][26022] Updated weights on worker 0-0, policy_version 955171 (0.00106) [2022-07-11 00:22:26,299][26022] Updated weights on worker 0-0, policy_version 955181 (0.00052) [2022-07-11 00:22:27,991][25689] Fps is (10 sec: 5760.8, 60 sec: 5531.1, 300 sec: 5528.3). Total num frames: 978114560. Throughput: 0: 5795.5. Samples: 978122990. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:22:27,991][25689] Avg episode reward: [(0, '-2.056')] [2022-07-11 00:22:28,156][26022] Updated weights on worker 0-0, policy_version 955191 (0.00093) [2022-07-11 00:22:30,075][26022] Updated weights on worker 0-0, policy_version 955201 (0.00092) [2022-07-11 00:22:31,785][26022] Updated weights on worker 0-0, policy_version 955211 (0.00085) [2022-07-11 00:22:33,061][25689] Fps is (10 sec: 5352.6, 60 sec: 5509.2, 300 sec: 5524.1). Total num frames: 978141184. Throughput: 0: 4947.8. Samples: 978139556. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:22:33,061][25689] Avg episode reward: [(0, '-1.162')] [2022-07-11 00:22:33,695][26022] Updated weights on worker 0-0, policy_version 955221 (0.00094) [2022-07-11 00:22:35,583][26022] Updated weights on worker 0-0, policy_version 955231 (0.00100) [2022-07-11 00:22:37,455][26022] Updated weights on worker 0-0, policy_version 955241 (0.00088) [2022-07-11 00:22:38,125][25689] Fps is (10 sec: 5557.1, 60 sec: 5521.0, 300 sec: 5527.8). Total num frames: 978170880. Throughput: 0: 5772.3. Samples: 978173232. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:22:38,126][25689] Avg episode reward: [(0, '-0.894')] [2022-07-11 00:22:39,363][26022] Updated weights on worker 0-0, policy_version 955251 (0.00085) [2022-07-11 00:22:41,079][26022] Updated weights on worker 0-0, policy_version 955261 (0.00087) [2022-07-11 00:22:42,983][26022] Updated weights on worker 0-0, policy_version 955271 (0.00092) [2022-07-11 00:22:43,179][25689] Fps is (10 sec: 5566.2, 60 sec: 5495.7, 300 sec: 5527.2). Total num frames: 978197504. Throughput: 0: 5784.3. Samples: 978206540. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:22:43,181][25689] Avg episode reward: [(0, '-0.893')] [2022-07-11 00:22:44,846][26022] Updated weights on worker 0-0, policy_version 955281 (0.00091) [2022-07-11 00:22:46,577][26022] Updated weights on worker 0-0, policy_version 955291 (0.00088) [2022-07-11 00:22:48,204][25689] Fps is (10 sec: 5486.3, 60 sec: 5513.4, 300 sec: 5530.2). Total num frames: 978226176. Throughput: 0: 4951.2. Samples: 978223072. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:22:48,205][25689] Avg episode reward: [(0, '0.177')] [2022-07-11 00:22:48,594][26022] Updated weights on worker 0-0, policy_version 955301 (0.00081) [2022-07-11 00:22:50,352][26022] Updated weights on worker 0-0, policy_version 955311 (0.00091) [2022-07-11 00:22:52,260][26022] Updated weights on worker 0-0, policy_version 955321 (0.00107) [2022-07-11 00:22:53,255][25689] Fps is (10 sec: 5691.1, 60 sec: 5511.1, 300 sec: 5526.2). Total num frames: 978254848. Throughput: 0: 5772.4. Samples: 978256124. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:22:53,257][25689] Avg episode reward: [(0, '-0.634')] [2022-07-11 00:22:54,107][26022] Updated weights on worker 0-0, policy_version 955331 (0.00093) [2022-07-11 00:22:56,017][26022] Updated weights on worker 0-0, policy_version 955341 (0.00095) [2022-07-11 00:22:57,826][26022] Updated weights on worker 0-0, policy_version 955351 (0.00091) [2022-07-11 00:22:58,331][25689] Fps is (10 sec: 5460.2, 60 sec: 5509.5, 300 sec: 5523.3). Total num frames: 978281472. Throughput: 0: 5762.6. Samples: 978289672. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:22:58,332][25689] Avg episode reward: [(0, '0.230')] [2022-07-11 00:22:59,584][26022] Updated weights on worker 0-0, policy_version 955361 (0.00093) [2022-07-11 00:23:01,588][26022] Updated weights on worker 0-0, policy_version 955371 (0.00094) [2022-07-11 00:23:03,402][25689] Fps is (10 sec: 5147.0, 60 sec: 5510.2, 300 sec: 5522.3). Total num frames: 978307072. Throughput: 0: 4927.2. Samples: 978306180. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:23:03,404][25689] Avg episode reward: [(0, '-0.257')] [2022-07-11 00:23:03,751][26022] Updated weights on worker 0-0, policy_version 955381 (0.00088) [2022-07-11 00:23:05,500][26022] Updated weights on worker 0-0, policy_version 955391 (0.00050) [2022-07-11 00:23:07,433][26022] Updated weights on worker 0-0, policy_version 955401 (0.00087) [2022-07-11 00:23:08,427][25689] Fps is (10 sec: 5375.8, 60 sec: 5492.5, 300 sec: 5522.4). Total num frames: 978335744. Throughput: 0: 5658.7. Samples: 978337506. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:23:08,428][25689] Avg episode reward: [(0, '-0.923')] [2022-07-11 00:23:09,019][26022] Updated weights on worker 0-0, policy_version 955411 (0.00087) [2022-07-11 00:23:11,046][26022] Updated weights on worker 0-0, policy_version 955421 (0.00086) [2022-07-11 00:23:12,991][26022] Updated weights on worker 0-0, policy_version 955431 (0.00095) [2022-07-11 00:23:13,460][25689] Fps is (10 sec: 5497.4, 60 sec: 5490.1, 300 sec: 5523.1). Total num frames: 978362368. Throughput: 0: 5669.4. Samples: 978370674. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:23:13,464][25689] Avg episode reward: [(0, '-1.295')] [2022-07-11 00:23:14,744][26022] Updated weights on worker 0-0, policy_version 955441 (0.00092) [2022-07-11 00:23:16,683][26022] Updated weights on worker 0-0, policy_version 955451 (0.00094) [2022-07-11 00:23:18,464][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:23:18,465][25689] Fps is (10 sec: 5406.5, 60 sec: 5508.0, 300 sec: 5517.4). Total num frames: 978390016. Throughput: 0: 4846.8. Samples: 978387256. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:23:18,466][25689] Avg episode reward: [(0, '-0.592')] [2022-07-11 00:23:18,475][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000955460_978391040.pth [2022-07-11 00:23:18,475][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000953517_976401408.pth [2022-07-11 00:23:18,602][26022] Updated weights on worker 0-0, policy_version 955461 (0.00096) [2022-07-11 00:23:20,344][26022] Updated weights on worker 0-0, policy_version 955471 (0.00086) [2022-07-11 00:23:22,433][26022] Updated weights on worker 0-0, policy_version 955481 (0.00398) [2022-07-11 00:23:23,542][25689] Fps is (10 sec: 5586.5, 60 sec: 5473.0, 300 sec: 5520.6). Total num frames: 978418688. Throughput: 0: 5648.0. Samples: 978419930. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:23:23,542][25689] Avg episode reward: [(0, '-1.260')] [2022-07-11 00:23:24,176][26022] Updated weights on worker 0-0, policy_version 955491 (0.00088) [2022-07-11 00:23:26,172][26022] Updated weights on worker 0-0, policy_version 955501 (0.00086) [2022-07-11 00:23:27,870][26022] Updated weights on worker 0-0, policy_version 955511 (0.00098) [2022-07-11 00:23:28,556][25689] Fps is (10 sec: 5479.7, 60 sec: 5461.0, 300 sec: 5520.9). Total num frames: 978445312. Throughput: 0: 5726.0. Samples: 978452766. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:23:28,557][25689] Avg episode reward: [(0, '-1.071')] [2022-07-11 00:23:29,787][26022] Updated weights on worker 0-0, policy_version 955521 (0.00089) [2022-07-11 00:23:31,876][26022] Updated weights on worker 0-0, policy_version 955531 (0.00088) [2022-07-11 00:23:33,365][26022] Updated weights on worker 0-0, policy_version 955541 (0.00095) [2022-07-11 00:23:33,563][25689] Fps is (10 sec: 5517.7, 60 sec: 5500.6, 300 sec: 5514.5). Total num frames: 978473984. Throughput: 0: 4911.3. Samples: 978469406. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:23:33,564][25689] Avg episode reward: [(0, '-0.464')] [2022-07-11 00:23:35,383][26022] Updated weights on worker 0-0, policy_version 955551 (0.00105) [2022-07-11 00:23:37,175][26022] Updated weights on worker 0-0, policy_version 955561 (0.00087) [2022-07-11 00:23:38,574][25689] Fps is (10 sec: 5621.9, 60 sec: 5471.6, 300 sec: 5520.2). Total num frames: 978501632. Throughput: 0: 5751.8. Samples: 978502918. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:23:38,576][25689] Avg episode reward: [(0, '0.060')] [2022-07-11 00:23:38,984][26022] Updated weights on worker 0-0, policy_version 955571 (0.00090) [2022-07-11 00:23:40,901][26022] Updated weights on worker 0-0, policy_version 955581 (0.00091) [2022-07-11 00:23:42,681][26022] Updated weights on worker 0-0, policy_version 955591 (0.00090) [2022-07-11 00:23:43,652][25689] Fps is (10 sec: 5582.7, 60 sec: 5503.3, 300 sec: 5516.4). Total num frames: 978530304. Throughput: 0: 5781.4. Samples: 978536194. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:23:43,652][25689] Avg episode reward: [(0, '-0.053')] [2022-07-11 00:23:44,757][26022] Updated weights on worker 0-0, policy_version 955601 (0.00092) [2022-07-11 00:23:46,272][26022] Updated weights on worker 0-0, policy_version 955611 (0.00093) [2022-07-11 00:23:48,304][26022] Updated weights on worker 0-0, policy_version 955621 (0.00092) [2022-07-11 00:23:48,666][25689] Fps is (10 sec: 5580.8, 60 sec: 5487.4, 300 sec: 5513.7). Total num frames: 978557952. Throughput: 0: 4982.3. Samples: 978552958. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:23:48,666][25689] Avg episode reward: [(0, '0.416')] [2022-07-11 00:23:49,987][26022] Updated weights on worker 0-0, policy_version 955631 (0.00093) [2022-07-11 00:23:52,021][26022] Updated weights on worker 0-0, policy_version 955641 (0.00092) [2022-07-11 00:23:53,681][25689] Fps is (10 sec: 5513.5, 60 sec: 5473.7, 300 sec: 5513.7). Total num frames: 978585600. Throughput: 0: 5810.6. Samples: 978586300. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:23:53,681][25689] Avg episode reward: [(0, '0.497')] [2022-07-11 00:23:53,754][26022] Updated weights on worker 0-0, policy_version 955651 (0.00084) [2022-07-11 00:23:55,895][26022] Updated weights on worker 0-0, policy_version 955661 (0.00090) [2022-07-11 00:23:57,468][26022] Updated weights on worker 0-0, policy_version 955671 (0.00086) [2022-07-11 00:23:58,695][25689] Fps is (10 sec: 5513.3, 60 sec: 5496.3, 300 sec: 5514.2). Total num frames: 978613248. Throughput: 0: 5781.5. Samples: 978619248. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:23:58,696][25689] Avg episode reward: [(0, '-0.908')] [2022-07-11 00:23:59,423][26022] Updated weights on worker 0-0, policy_version 955681 (0.00084) [2022-07-11 00:24:01,123][26022] Updated weights on worker 0-0, policy_version 955691 (0.00084) [2022-07-11 00:24:03,403][26022] Updated weights on worker 0-0, policy_version 955701 (0.00087) [2022-07-11 00:24:03,733][25689] Fps is (10 sec: 5195.1, 60 sec: 5482.2, 300 sec: 5510.9). Total num frames: 978637824. Throughput: 0: 4964.5. Samples: 978635890. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:24:03,734][25689] Avg episode reward: [(0, '-1.121')] [2022-07-11 00:24:05,251][26022] Updated weights on worker 0-0, policy_version 955711 (0.00088) [2022-07-11 00:24:07,250][26022] Updated weights on worker 0-0, policy_version 955721 (0.00085) [2022-07-11 00:24:08,755][25689] Fps is (10 sec: 5293.5, 60 sec: 5482.6, 300 sec: 5514.2). Total num frames: 978666496. Throughput: 0: 5686.9. Samples: 978667200. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:24:08,755][25689] Avg episode reward: [(0, '-2.236')] [2022-07-11 00:24:09,032][26022] Updated weights on worker 0-0, policy_version 955731 (0.00088) [2022-07-11 00:24:10,864][26022] Updated weights on worker 0-0, policy_version 955741 (0.00378) [2022-07-11 00:24:12,724][26022] Updated weights on worker 0-0, policy_version 955751 (0.00087) [2022-07-11 00:24:13,773][25689] Fps is (10 sec: 5711.5, 60 sec: 5517.9, 300 sec: 5514.1). Total num frames: 978695168. Throughput: 0: 5680.5. Samples: 978700436. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:24:13,774][25689] Avg episode reward: [(0, '-2.180')] [2022-07-11 00:24:14,466][26022] Updated weights on worker 0-0, policy_version 955761 (0.00082) [2022-07-11 00:24:16,388][26022] Updated weights on worker 0-0, policy_version 955771 (0.00086) [2022-07-11 00:24:18,182][26022] Updated weights on worker 0-0, policy_version 955781 (0.00088) [2022-07-11 00:24:18,783][25689] Fps is (10 sec: 5514.1, 60 sec: 5500.5, 300 sec: 5511.3). Total num frames: 978721792. Throughput: 0: 4871.7. Samples: 978717106. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:24:18,784][25689] Avg episode reward: [(0, '-2.166')] [2022-07-11 00:24:19,971][26022] Updated weights on worker 0-0, policy_version 955791 (0.00081) [2022-07-11 00:24:21,968][26022] Updated weights on worker 0-0, policy_version 955801 (0.00085) [2022-07-11 00:24:23,743][26022] Updated weights on worker 0-0, policy_version 955811 (0.00088) [2022-07-11 00:24:23,881][25689] Fps is (10 sec: 5571.9, 60 sec: 5515.4, 300 sec: 5516.4). Total num frames: 978751488. Throughput: 0: 5695.7. Samples: 978750648. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:24:23,882][25689] Avg episode reward: [(0, '-2.084')] [2022-07-11 00:24:25,747][26022] Updated weights on worker 0-0, policy_version 955821 (0.00088) [2022-07-11 00:24:27,488][26022] Updated weights on worker 0-0, policy_version 955831 (0.00092) [2022-07-11 00:24:28,888][25689] Fps is (10 sec: 5674.9, 60 sec: 5533.2, 300 sec: 5516.4). Total num frames: 978779136. Throughput: 0: 5783.6. Samples: 978783640. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:24:28,888][25689] Avg episode reward: [(0, '-1.112')] [2022-07-11 00:24:29,490][26022] Updated weights on worker 0-0, policy_version 955841 (0.00087) [2022-07-11 00:24:31,082][26022] Updated weights on worker 0-0, policy_version 955851 (0.00085) [2022-07-11 00:24:33,075][26022] Updated weights on worker 0-0, policy_version 955861 (0.00094) [2022-07-11 00:24:33,949][25689] Fps is (10 sec: 5492.2, 60 sec: 5511.2, 300 sec: 5511.9). Total num frames: 978806784. Throughput: 0: 4948.6. Samples: 978800278. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:24:33,950][25689] Avg episode reward: [(0, '0.046')] [2022-07-11 00:24:34,964][26022] Updated weights on worker 0-0, policy_version 955871 (0.00090) [2022-07-11 00:24:36,832][26022] Updated weights on worker 0-0, policy_version 955881 (0.00090) [2022-07-11 00:24:38,511][26022] Updated weights on worker 0-0, policy_version 955891 (0.00097) [2022-07-11 00:24:39,015][25689] Fps is (10 sec: 5560.9, 60 sec: 5523.1, 300 sec: 5512.9). Total num frames: 978835456. Throughput: 0: 5771.5. Samples: 978833880. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:24:39,017][25689] Avg episode reward: [(0, '0.708')] [2022-07-11 00:24:40,440][26022] Updated weights on worker 0-0, policy_version 955901 (0.00079) [2022-07-11 00:24:42,123][26022] Updated weights on worker 0-0, policy_version 955911 (0.00089) [2022-07-11 00:24:44,079][25689] Fps is (10 sec: 5458.6, 60 sec: 5490.4, 300 sec: 5508.3). Total num frames: 978862080. Throughput: 0: 5774.0. Samples: 978867272. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:24:44,081][25689] Avg episode reward: [(0, '0.549')] [2022-07-11 00:24:44,141][26022] Updated weights on worker 0-0, policy_version 955921 (0.00087) [2022-07-11 00:24:45,653][26022] Updated weights on worker 0-0, policy_version 955931 (0.00087) [2022-07-11 00:24:47,784][26022] Updated weights on worker 0-0, policy_version 955941 (0.00082) [2022-07-11 00:24:49,158][25689] Fps is (10 sec: 5552.6, 60 sec: 5518.4, 300 sec: 5515.3). Total num frames: 978891776. Throughput: 0: 4955.3. Samples: 978884088. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:24:49,162][25689] Avg episode reward: [(0, '0.399')] [2022-07-11 00:24:49,554][26022] Updated weights on worker 0-0, policy_version 955951 (0.00082) [2022-07-11 00:24:51,432][26022] Updated weights on worker 0-0, policy_version 955961 (0.00088) [2022-07-11 00:24:53,082][26022] Updated weights on worker 0-0, policy_version 955971 (0.00085) [2022-07-11 00:24:54,166][25689] Fps is (10 sec: 5684.9, 60 sec: 5519.1, 300 sec: 5512.4). Total num frames: 978919424. Throughput: 0: 5805.3. Samples: 978917644. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:24:54,168][25689] Avg episode reward: [(0, '0.165')] [2022-07-11 00:24:55,100][26022] Updated weights on worker 0-0, policy_version 955981 (0.00086) [2022-07-11 00:24:56,752][26022] Updated weights on worker 0-0, policy_version 955991 (0.00090) [2022-07-11 00:24:58,961][26022] Updated weights on worker 0-0, policy_version 956001 (0.00087) [2022-07-11 00:24:59,189][25689] Fps is (10 sec: 5410.4, 60 sec: 5501.4, 300 sec: 5512.9). Total num frames: 978946048. Throughput: 0: 5809.5. Samples: 978951080. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:24:59,190][25689] Avg episode reward: [(0, '0.081')] [2022-07-11 00:25:00,551][26022] Updated weights on worker 0-0, policy_version 956011 (0.00086) [2022-07-11 00:25:02,887][26022] Updated weights on worker 0-0, policy_version 956021 (0.00091) [2022-07-11 00:25:04,238][25689] Fps is (10 sec: 5388.5, 60 sec: 5551.1, 300 sec: 5512.0). Total num frames: 978973696. Throughput: 0: 5706.6. Samples: 978982310. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:25:04,239][25689] Avg episode reward: [(0, '-0.412')] [2022-07-11 00:25:04,623][26022] Updated weights on worker 0-0, policy_version 956031 (0.00092) [2022-07-11 00:25:06,659][26022] Updated weights on worker 0-0, policy_version 956041 (0.00093) [2022-07-11 00:25:08,212][26022] Updated weights on worker 0-0, policy_version 956051 (0.00091) [2022-07-11 00:25:09,291][25689] Fps is (10 sec: 5473.8, 60 sec: 5531.3, 300 sec: 5511.1). Total num frames: 979001344. Throughput: 0: 5707.4. Samples: 978998996. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:25:09,292][25689] Avg episode reward: [(0, '-0.506')] [2022-07-11 00:25:10,282][26022] Updated weights on worker 0-0, policy_version 956061 (0.00084) [2022-07-11 00:25:11,846][26022] Updated weights on worker 0-0, policy_version 956071 (0.00080) [2022-07-11 00:25:13,982][26022] Updated weights on worker 0-0, policy_version 956081 (0.00082) [2022-07-11 00:25:14,388][25689] Fps is (10 sec: 5549.0, 60 sec: 5524.2, 300 sec: 5512.9). Total num frames: 979030016. Throughput: 0: 5683.4. Samples: 979032570. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:25:14,388][25689] Avg episode reward: [(0, '0.001')] [2022-07-11 00:25:15,559][26022] Updated weights on worker 0-0, policy_version 956091 (0.00098) [2022-07-11 00:25:17,503][26022] Updated weights on worker 0-0, policy_version 956101 (0.00079) [2022-07-11 00:25:18,511][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:25:18,522][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000956106_979052544.pth [2022-07-11 00:25:18,523][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000954167_977067008.pth [2022-07-11 00:25:19,149][26022] Updated weights on worker 0-0, policy_version 956111 (0.00089) [2022-07-11 00:25:19,400][25689] Fps is (10 sec: 5571.7, 60 sec: 5540.9, 300 sec: 5507.2). Total num frames: 979057664. Throughput: 0: 5695.3. Samples: 979066182. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:25:19,400][25689] Avg episode reward: [(0, '-0.740')] [2022-07-11 00:25:21,378][26022] Updated weights on worker 0-0, policy_version 956121 (0.00097) [2022-07-11 00:25:23,047][26022] Updated weights on worker 0-0, policy_version 956131 (0.00096) [2022-07-11 00:25:24,443][25689] Fps is (10 sec: 5499.2, 60 sec: 5512.1, 300 sec: 5510.3). Total num frames: 979085312. Throughput: 0: 4985.5. Samples: 979083040. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:25:24,444][25689] Avg episode reward: [(0, '-1.121')] [2022-07-11 00:25:25,068][26022] Updated weights on worker 0-0, policy_version 956141 (0.00087) [2022-07-11 00:25:26,603][26022] Updated weights on worker 0-0, policy_version 956151 (0.01117) [2022-07-11 00:25:28,877][26022] Updated weights on worker 0-0, policy_version 956161 (0.00087) [2022-07-11 00:25:29,515][25689] Fps is (10 sec: 5567.6, 60 sec: 5523.0, 300 sec: 5505.8). Total num frames: 979113984. Throughput: 0: 5794.2. Samples: 979116178. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:25:29,516][25689] Avg episode reward: [(0, '-0.706')] [2022-07-11 00:25:30,321][26022] Updated weights on worker 0-0, policy_version 956171 (0.00092) [2022-07-11 00:25:32,337][26022] Updated weights on worker 0-0, policy_version 956181 (0.00093) [2022-07-11 00:25:34,037][26022] Updated weights on worker 0-0, policy_version 956191 (0.00087) [2022-07-11 00:25:34,589][25689] Fps is (10 sec: 5550.8, 60 sec: 5521.9, 300 sec: 5505.5). Total num frames: 979141632. Throughput: 0: 5775.0. Samples: 979149234. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:25:34,590][25689] Avg episode reward: [(0, '0.225')] [2022-07-11 00:25:36,046][26022] Updated weights on worker 0-0, policy_version 956201 (0.00111) [2022-07-11 00:25:37,860][26022] Updated weights on worker 0-0, policy_version 956211 (0.00089) [2022-07-11 00:25:39,684][25689] Fps is (10 sec: 5437.7, 60 sec: 5502.4, 300 sec: 5508.1). Total num frames: 979169280. Throughput: 0: 4904.5. Samples: 979165668. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:25:39,685][25689] Avg episode reward: [(0, '0.127')] [2022-07-11 00:25:39,729][26022] Updated weights on worker 0-0, policy_version 956221 (0.00093) [2022-07-11 00:25:41,484][26022] Updated weights on worker 0-0, policy_version 956231 (0.00096) [2022-07-11 00:25:43,711][26022] Updated weights on worker 0-0, policy_version 956241 (0.00095) [2022-07-11 00:25:44,825][25689] Fps is (10 sec: 5502.5, 60 sec: 5529.2, 300 sec: 5502.5). Total num frames: 979197952. Throughput: 0: 5683.7. Samples: 979198882. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:25:44,826][25689] Avg episode reward: [(0, '0.241')] [2022-07-11 00:25:45,227][26022] Updated weights on worker 0-0, policy_version 956251 (0.00091) [2022-07-11 00:25:47,226][26022] Updated weights on worker 0-0, policy_version 956261 (0.00089) [2022-07-11 00:25:49,116][26022] Updated weights on worker 0-0, policy_version 956271 (0.00087) [2022-07-11 00:25:49,852][25689] Fps is (10 sec: 5539.2, 60 sec: 5500.2, 300 sec: 5510.1). Total num frames: 979225600. Throughput: 0: 5697.4. Samples: 979232044. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:25:49,852][25689] Avg episode reward: [(0, '1.081')] [2022-07-11 00:25:50,686][26022] Updated weights on worker 0-0, policy_version 956281 (0.00082) [2022-07-11 00:25:52,883][26022] Updated weights on worker 0-0, policy_version 956291 (0.00093) [2022-07-11 00:25:54,415][26022] Updated weights on worker 0-0, policy_version 956301 (0.00086) [2022-07-11 00:25:54,889][25689] Fps is (10 sec: 5494.0, 60 sec: 5497.5, 300 sec: 5506.2). Total num frames: 979253248. Throughput: 0: 4901.3. Samples: 979248730. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 00:25:54,890][25689] Avg episode reward: [(0, '0.638')] [2022-07-11 00:25:56,501][26022] Updated weights on worker 0-0, policy_version 956311 (0.00086) [2022-07-11 00:25:58,403][26022] Updated weights on worker 0-0, policy_version 956321 (0.00092) [2022-07-11 00:25:59,910][25689] Fps is (10 sec: 5599.7, 60 sec: 5531.5, 300 sec: 5517.9). Total num frames: 979281920. Throughput: 0: 5761.2. Samples: 979282190. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:25:59,910][25689] Avg episode reward: [(0, '0.198')] [2022-07-11 00:26:00,048][26022] Updated weights on worker 0-0, policy_version 956331 (0.00087) [2022-07-11 00:26:02,429][26022] Updated weights on worker 0-0, policy_version 956341 (0.00089) [2022-07-11 00:26:04,193][26022] Updated weights on worker 0-0, policy_version 956351 (0.00089) [2022-07-11 00:26:05,021][25689] Fps is (10 sec: 5356.7, 60 sec: 5492.1, 300 sec: 5509.3). Total num frames: 979307520. Throughput: 0: 5668.1. Samples: 979313358. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:26:05,023][25689] Avg episode reward: [(0, '0.207')] [2022-07-11 00:26:06,045][26022] Updated weights on worker 0-0, policy_version 956361 (0.00569) [2022-07-11 00:26:07,875][26022] Updated weights on worker 0-0, policy_version 956371 (0.00086) [2022-07-11 00:26:09,633][26022] Updated weights on worker 0-0, policy_version 956381 (0.00052) [2022-07-11 00:26:10,049][25689] Fps is (10 sec: 5352.7, 60 sec: 5511.2, 300 sec: 5503.4). Total num frames: 979336192. Throughput: 0: 4857.2. Samples: 979330144. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:26:10,050][25689] Avg episode reward: [(0, '0.075')] [2022-07-11 00:26:11,556][26022] Updated weights on worker 0-0, policy_version 956391 (0.00089) [2022-07-11 00:26:13,308][26022] Updated weights on worker 0-0, policy_version 956401 (0.00083) [2022-07-11 00:26:15,107][25689] Fps is (10 sec: 5482.6, 60 sec: 5481.0, 300 sec: 5506.7). Total num frames: 979362816. Throughput: 0: 5682.4. Samples: 979363614. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:26:15,107][25689] Avg episode reward: [(0, '0.028')] [2022-07-11 00:26:15,308][26022] Updated weights on worker 0-0, policy_version 956411 (0.00086) [2022-07-11 00:26:16,979][26022] Updated weights on worker 0-0, policy_version 956421 (0.00089) [2022-07-11 00:26:18,921][26022] Updated weights on worker 0-0, policy_version 956431 (0.00087) [2022-07-11 00:26:20,140][25689] Fps is (10 sec: 5479.6, 60 sec: 5495.9, 300 sec: 5507.1). Total num frames: 979391488. Throughput: 0: 5687.3. Samples: 979397246. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:26:20,141][25689] Avg episode reward: [(0, '0.186')] [2022-07-11 00:26:20,704][26022] Updated weights on worker 0-0, policy_version 956441 (0.00091) [2022-07-11 00:26:22,605][26022] Updated weights on worker 0-0, policy_version 956451 (0.00090) [2022-07-11 00:26:24,506][26022] Updated weights on worker 0-0, policy_version 956461 (0.00085) [2022-07-11 00:26:25,227][25689] Fps is (10 sec: 5767.6, 60 sec: 5525.7, 300 sec: 5512.4). Total num frames: 979421184. Throughput: 0: 4977.1. Samples: 979413924. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:26:25,227][25689] Avg episode reward: [(0, '0.950')] [2022-07-11 00:26:26,200][26022] Updated weights on worker 0-0, policy_version 956471 (0.00088) [2022-07-11 00:26:28,262][26022] Updated weights on worker 0-0, policy_version 956481 (0.00084) [2022-07-11 00:26:29,940][26022] Updated weights on worker 0-0, policy_version 956491 (0.00094) [2022-07-11 00:26:30,245][25689] Fps is (10 sec: 5573.3, 60 sec: 5496.8, 300 sec: 5509.0). Total num frames: 979447808. Throughput: 0: 5794.1. Samples: 979447166. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:26:30,246][25689] Avg episode reward: [(0, '1.413')] [2022-07-11 00:26:31,883][26022] Updated weights on worker 0-0, policy_version 956501 (0.00089) [2022-07-11 00:26:33,410][26022] Updated weights on worker 0-0, policy_version 956511 (0.00086) [2022-07-11 00:26:35,279][25689] Fps is (10 sec: 5399.1, 60 sec: 5500.5, 300 sec: 5505.1). Total num frames: 979475456. Throughput: 0: 5809.5. Samples: 979480804. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:26:35,279][25689] Avg episode reward: [(0, '1.075')] [2022-07-11 00:26:35,523][26022] Updated weights on worker 0-0, policy_version 956521 (0.00085) [2022-07-11 00:26:37,305][26022] Updated weights on worker 0-0, policy_version 956531 (0.00094) [2022-07-11 00:26:38,928][26022] Updated weights on worker 0-0, policy_version 956541 (0.00090) [2022-07-11 00:26:40,314][25689] Fps is (10 sec: 5593.6, 60 sec: 5522.8, 300 sec: 5507.1). Total num frames: 979504128. Throughput: 0: 4980.9. Samples: 979497732. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:26:40,315][25689] Avg episode reward: [(0, '1.162')] [2022-07-11 00:26:40,991][26022] Updated weights on worker 0-0, policy_version 956551 (0.00083) [2022-07-11 00:26:42,646][26022] Updated weights on worker 0-0, policy_version 956561 (0.00094) [2022-07-11 00:26:44,654][26022] Updated weights on worker 0-0, policy_version 956571 (0.00097) [2022-07-11 00:26:45,396][25689] Fps is (10 sec: 5768.9, 60 sec: 5545.0, 300 sec: 5513.1). Total num frames: 979533824. Throughput: 0: 5832.9. Samples: 979531572. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:26:45,397][25689] Avg episode reward: [(0, '-0.575')] [2022-07-11 00:26:46,322][26022] Updated weights on worker 0-0, policy_version 956581 (0.00086) [2022-07-11 00:26:48,288][26022] Updated weights on worker 0-0, policy_version 956591 (0.00097) [2022-07-11 00:26:49,981][26022] Updated weights on worker 0-0, policy_version 956601 (0.00098) [2022-07-11 00:26:50,399][25689] Fps is (10 sec: 5686.2, 60 sec: 5547.3, 300 sec: 5510.1). Total num frames: 979561472. Throughput: 0: 5866.4. Samples: 979565394. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:26:50,400][25689] Avg episode reward: [(0, '-0.711')] [2022-07-11 00:26:51,909][26022] Updated weights on worker 0-0, policy_version 956611 (0.00088) [2022-07-11 00:26:53,511][26022] Updated weights on worker 0-0, policy_version 956621 (0.00085) [2022-07-11 00:26:55,446][26022] Updated weights on worker 0-0, policy_version 956631 (0.00095) [2022-07-11 00:26:55,447][25689] Fps is (10 sec: 5501.6, 60 sec: 5546.3, 300 sec: 5513.7). Total num frames: 979589120. Throughput: 0: 5857.0. Samples: 979598932. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:26:55,448][25689] Avg episode reward: [(0, '-0.757')] [2022-07-11 00:26:57,191][26022] Updated weights on worker 0-0, policy_version 956641 (0.00085) [2022-07-11 00:26:59,282][26022] Updated weights on worker 0-0, policy_version 956651 (0.00089) [2022-07-11 00:27:00,469][25689] Fps is (10 sec: 5593.0, 60 sec: 5546.2, 300 sec: 5525.1). Total num frames: 979617792. Throughput: 0: 5841.5. Samples: 979615464. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:27:00,469][25689] Avg episode reward: [(0, '-1.595')] [2022-07-11 00:27:00,967][26022] Updated weights on worker 0-0, policy_version 956661 (0.00103) [2022-07-11 00:27:03,175][26022] Updated weights on worker 0-0, policy_version 956671 (0.00089) [2022-07-11 00:27:04,959][26022] Updated weights on worker 0-0, policy_version 956681 (0.00090) [2022-07-11 00:27:05,509][25689] Fps is (10 sec: 5393.7, 60 sec: 5552.7, 300 sec: 5510.9). Total num frames: 979643392. Throughput: 0: 5743.1. Samples: 979647084. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:27:05,510][25689] Avg episode reward: [(0, '-1.333')] [2022-07-11 00:27:06,931][26022] Updated weights on worker 0-0, policy_version 956691 (0.00091) [2022-07-11 00:27:08,714][26022] Updated weights on worker 0-0, policy_version 956701 (0.00087) [2022-07-11 00:27:10,486][26022] Updated weights on worker 0-0, policy_version 956711 (0.00086) [2022-07-11 00:27:10,539][25689] Fps is (10 sec: 5389.4, 60 sec: 5552.5, 300 sec: 5517.3). Total num frames: 979672064. Throughput: 0: 5723.8. Samples: 979680670. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:27:10,539][25689] Avg episode reward: [(0, '-1.555')] [2022-07-11 00:27:12,294][26022] Updated weights on worker 0-0, policy_version 956721 (0.00086) [2022-07-11 00:27:14,332][26022] Updated weights on worker 0-0, policy_version 956731 (0.00088) [2022-07-11 00:27:15,581][25689] Fps is (10 sec: 5592.1, 60 sec: 5570.9, 300 sec: 5520.3). Total num frames: 979699712. Throughput: 0: 4894.8. Samples: 979697482. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:27:15,581][25689] Avg episode reward: [(0, '-0.201')] [2022-07-11 00:27:16,152][26022] Updated weights on worker 0-0, policy_version 956741 (0.00093) [2022-07-11 00:27:17,949][26022] Updated weights on worker 0-0, policy_version 956751 (0.00090) [2022-07-11 00:27:18,602][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:27:18,614][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000956754_979716096.pth [2022-07-11 00:27:18,615][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000954814_977729536.pth [2022-07-11 00:27:19,595][26022] Updated weights on worker 0-0, policy_version 956761 (0.00086) [2022-07-11 00:27:20,583][25689] Fps is (10 sec: 5607.2, 60 sec: 5573.8, 300 sec: 5514.6). Total num frames: 979728384. Throughput: 0: 5733.2. Samples: 979730784. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:27:20,583][25689] Avg episode reward: [(0, '-0.167')] [2022-07-11 00:27:21,827][26022] Updated weights on worker 0-0, policy_version 956771 (0.00115) [2022-07-11 00:27:23,175][26022] Updated weights on worker 0-0, policy_version 956781 (0.00084) [2022-07-11 00:27:25,461][26022] Updated weights on worker 0-0, policy_version 956791 (0.00090) [2022-07-11 00:27:25,637][25689] Fps is (10 sec: 5498.8, 60 sec: 5526.0, 300 sec: 5511.4). Total num frames: 979755008. Throughput: 0: 5824.5. Samples: 979764316. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:27:25,637][25689] Avg episode reward: [(0, '0.064')] [2022-07-11 00:27:27,035][26022] Updated weights on worker 0-0, policy_version 956801 (0.00085) [2022-07-11 00:27:29,200][26022] Updated weights on worker 0-0, policy_version 956811 (0.00091) [2022-07-11 00:27:30,667][25689] Fps is (10 sec: 5585.1, 60 sec: 5575.8, 300 sec: 5522.5). Total num frames: 979784704. Throughput: 0: 4971.8. Samples: 979780744. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:27:30,668][25689] Avg episode reward: [(0, '0.736')] [2022-07-11 00:27:30,680][26022] Updated weights on worker 0-0, policy_version 956821 (0.00825) [2022-07-11 00:27:32,884][26022] Updated weights on worker 0-0, policy_version 956831 (0.00082) [2022-07-11 00:27:34,615][26022] Updated weights on worker 0-0, policy_version 956841 (0.00084) [2022-07-11 00:27:35,740][25689] Fps is (10 sec: 5574.8, 60 sec: 5555.2, 300 sec: 5512.0). Total num frames: 979811328. Throughput: 0: 5799.7. Samples: 979814396. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:27:35,740][25689] Avg episode reward: [(0, '0.742')] [2022-07-11 00:27:36,515][26022] Updated weights on worker 0-0, policy_version 956851 (0.00084) [2022-07-11 00:27:38,103][26022] Updated weights on worker 0-0, policy_version 956861 (0.00086) [2022-07-11 00:27:40,137][26022] Updated weights on worker 0-0, policy_version 956871 (0.00087) [2022-07-11 00:27:40,767][25689] Fps is (10 sec: 5373.4, 60 sec: 5539.0, 300 sec: 5515.9). Total num frames: 979838976. Throughput: 0: 5808.3. Samples: 979848020. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:27:40,768][25689] Avg episode reward: [(0, '0.771')] [2022-07-11 00:27:41,690][26022] Updated weights on worker 0-0, policy_version 956881 (0.00092) [2022-07-11 00:27:43,619][26022] Updated weights on worker 0-0, policy_version 956891 (0.00090) [2022-07-11 00:27:45,308][26022] Updated weights on worker 0-0, policy_version 956901 (0.00086) [2022-07-11 00:27:45,826][25689] Fps is (10 sec: 5583.9, 60 sec: 5524.2, 300 sec: 5515.3). Total num frames: 979867648. Throughput: 0: 4977.4. Samples: 979864804. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:27:45,826][25689] Avg episode reward: [(0, '0.981')] [2022-07-11 00:27:47,409][26022] Updated weights on worker 0-0, policy_version 956911 (0.00086) [2022-07-11 00:27:49,112][26022] Updated weights on worker 0-0, policy_version 956921 (0.00085) [2022-07-11 00:27:50,855][25689] Fps is (10 sec: 5582.8, 60 sec: 5521.7, 300 sec: 5512.3). Total num frames: 979895296. Throughput: 0: 5830.3. Samples: 979898448. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:27:50,856][25689] Avg episode reward: [(0, '-0.446')] [2022-07-11 00:27:50,998][26022] Updated weights on worker 0-0, policy_version 956931 (0.00084) [2022-07-11 00:27:52,519][26022] Updated weights on worker 0-0, policy_version 956941 (0.00087) [2022-07-11 00:27:54,569][26022] Updated weights on worker 0-0, policy_version 956951 (0.00091) [2022-07-11 00:27:55,864][25689] Fps is (10 sec: 5610.6, 60 sec: 5542.3, 300 sec: 5520.4). Total num frames: 979923968. Throughput: 0: 5866.8. Samples: 979932460. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:27:55,864][25689] Avg episode reward: [(0, '-1.430')] [2022-07-11 00:27:56,420][26022] Updated weights on worker 0-0, policy_version 956961 (0.00089) [2022-07-11 00:27:58,231][26022] Updated weights on worker 0-0, policy_version 956971 (0.00093) [2022-07-11 00:28:00,150][26022] Updated weights on worker 0-0, policy_version 956981 (0.00087) [2022-07-11 00:28:00,870][25689] Fps is (10 sec: 5725.8, 60 sec: 5543.7, 300 sec: 5531.9). Total num frames: 979952640. Throughput: 0: 5021.1. Samples: 979948960. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:28:00,871][25689] Avg episode reward: [(0, '-1.265')] [2022-07-11 00:28:01,705][26022] Updated weights on worker 0-0, policy_version 956991 (0.00091) [2022-07-11 00:28:04,097][26022] Updated weights on worker 0-0, policy_version 957001 (0.00094) [2022-07-11 00:28:05,925][25689] Fps is (10 sec: 5496.0, 60 sec: 5559.4, 300 sec: 5524.5). Total num frames: 979979264. Throughput: 0: 5756.4. Samples: 979980504. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:28:05,926][25689] Avg episode reward: [(0, '-1.266')] [2022-07-11 00:28:05,927][26022] Updated weights on worker 0-0, policy_version 957011 (0.00081) [2022-07-11 00:28:07,871][26022] Updated weights on worker 0-0, policy_version 957021 (0.00082) [2022-07-11 00:28:09,647][26022] Updated weights on worker 0-0, policy_version 957031 (0.00087) [2022-07-11 00:28:11,013][25689] Fps is (10 sec: 5350.8, 60 sec: 5537.0, 300 sec: 5526.9). Total num frames: 980006912. Throughput: 0: 5749.5. Samples: 980014346. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:28:11,019][25689] Avg episode reward: [(0, '-0.627')] [2022-07-11 00:28:11,580][26022] Updated weights on worker 0-0, policy_version 957041 (0.00091) [2022-07-11 00:28:13,163][26022] Updated weights on worker 0-0, policy_version 957051 (0.00087) [2022-07-11 00:28:15,184][26022] Updated weights on worker 0-0, policy_version 957061 (0.00094) [2022-07-11 00:28:16,027][25689] Fps is (10 sec: 5574.9, 60 sec: 5556.5, 300 sec: 5530.2). Total num frames: 980035584. Throughput: 0: 4887.0. Samples: 980031000. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:28:16,028][25689] Avg episode reward: [(0, '0.874')] [2022-07-11 00:28:16,921][26022] Updated weights on worker 0-0, policy_version 957071 (0.00085) [2022-07-11 00:28:18,733][26022] Updated weights on worker 0-0, policy_version 957081 (0.00086) [2022-07-11 00:28:20,688][26022] Updated weights on worker 0-0, policy_version 957091 (0.00087) [2022-07-11 00:28:21,046][25689] Fps is (10 sec: 5613.5, 60 sec: 5538.1, 300 sec: 5527.8). Total num frames: 980063232. Throughput: 0: 5729.9. Samples: 980064564. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:28:21,047][25689] Avg episode reward: [(0, '1.005')] [2022-07-11 00:28:22,461][26022] Updated weights on worker 0-0, policy_version 957101 (0.00090) [2022-07-11 00:28:24,354][26022] Updated weights on worker 0-0, policy_version 957112 (0.00099) [2022-07-11 00:28:26,110][25689] Fps is (10 sec: 5382.7, 60 sec: 5537.1, 300 sec: 5526.9). Total num frames: 980089856. Throughput: 0: 5817.8. Samples: 980097936. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:28:26,110][25689] Avg episode reward: [(0, '0.992')] [2022-07-11 00:28:26,398][26022] Updated weights on worker 0-0, policy_version 957122 (0.00086) [2022-07-11 00:28:28,159][26022] Updated weights on worker 0-0, policy_version 957132 (0.00092) [2022-07-11 00:28:30,108][26022] Updated weights on worker 0-0, policy_version 957142 (0.00088) [2022-07-11 00:28:31,123][25689] Fps is (10 sec: 5487.5, 60 sec: 5521.8, 300 sec: 5526.8). Total num frames: 980118528. Throughput: 0: 4960.7. Samples: 980114104. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:28:31,124][25689] Avg episode reward: [(0, '1.501')] [2022-07-11 00:28:32,076][26022] Updated weights on worker 0-0, policy_version 957152 (0.00092) [2022-07-11 00:28:33,747][26022] Updated weights on worker 0-0, policy_version 957162 (0.00095) [2022-07-11 00:28:35,963][26022] Updated weights on worker 0-0, policy_version 957172 (0.00082) [2022-07-11 00:28:36,139][25689] Fps is (10 sec: 5513.8, 60 sec: 5527.0, 300 sec: 5523.2). Total num frames: 980145152. Throughput: 0: 5784.9. Samples: 980147342. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:28:36,139][25689] Avg episode reward: [(0, '0.130')] [2022-07-11 00:28:37,257][26022] Updated weights on worker 0-0, policy_version 957182 (0.00087) [2022-07-11 00:28:39,460][26022] Updated weights on worker 0-0, policy_version 957192 (0.00082) [2022-07-11 00:28:41,075][26022] Updated weights on worker 0-0, policy_version 957202 (0.00088) [2022-07-11 00:28:41,146][25689] Fps is (10 sec: 5618.9, 60 sec: 5562.7, 300 sec: 5528.0). Total num frames: 980174848. Throughput: 0: 5787.8. Samples: 980180900. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:28:41,147][25689] Avg episode reward: [(0, '-0.127')] [2022-07-11 00:28:43,105][26022] Updated weights on worker 0-0, policy_version 957212 (0.00082) [2022-07-11 00:28:44,830][26022] Updated weights on worker 0-0, policy_version 957222 (0.00090) [2022-07-11 00:28:46,218][25689] Fps is (10 sec: 5689.1, 60 sec: 5544.5, 300 sec: 5526.9). Total num frames: 980202496. Throughput: 0: 4964.7. Samples: 980197768. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:28:46,219][25689] Avg episode reward: [(0, '-0.378')] [2022-07-11 00:28:46,708][26022] Updated weights on worker 0-0, policy_version 957232 (0.00088) [2022-07-11 00:28:48,334][26022] Updated weights on worker 0-0, policy_version 957242 (0.00092) [2022-07-11 00:28:50,419][26022] Updated weights on worker 0-0, policy_version 957252 (0.00090) [2022-07-11 00:28:51,303][25689] Fps is (10 sec: 5343.2, 60 sec: 5522.5, 300 sec: 5522.2). Total num frames: 980229120. Throughput: 0: 5802.4. Samples: 980231200. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:28:51,304][25689] Avg episode reward: [(0, '-0.398')] [2022-07-11 00:28:52,185][26022] Updated weights on worker 0-0, policy_version 957262 (0.00092) [2022-07-11 00:28:54,347][26022] Updated weights on worker 0-0, policy_version 957272 (0.00312) [2022-07-11 00:28:55,763][26022] Updated weights on worker 0-0, policy_version 957282 (0.00090) [2022-07-11 00:28:56,323][25689] Fps is (10 sec: 5472.1, 60 sec: 5521.4, 300 sec: 5525.5). Total num frames: 980257792. Throughput: 0: 5786.2. Samples: 980264134. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:28:56,324][25689] Avg episode reward: [(0, '-0.057')] [2022-07-11 00:28:57,924][26022] Updated weights on worker 0-0, policy_version 957292 (0.00085) [2022-07-11 00:28:59,640][26022] Updated weights on worker 0-0, policy_version 957302 (0.00091) [2022-07-11 00:29:01,342][25689] Fps is (10 sec: 5712.3, 60 sec: 5520.3, 300 sec: 5539.6). Total num frames: 980286464. Throughput: 0: 4936.5. Samples: 980280598. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:29:01,343][25689] Avg episode reward: [(0, '0.065')] [2022-07-11 00:29:01,679][26022] Updated weights on worker 0-0, policy_version 957312 (0.00080) [2022-07-11 00:29:03,787][26022] Updated weights on worker 0-0, policy_version 957322 (0.00718) [2022-07-11 00:29:05,552][26022] Updated weights on worker 0-0, policy_version 957332 (0.00085) [2022-07-11 00:29:06,470][25689] Fps is (10 sec: 5348.9, 60 sec: 5496.8, 300 sec: 5527.3). Total num frames: 980312064. Throughput: 0: 5655.1. Samples: 980312292. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:29:06,470][25689] Avg episode reward: [(0, '1.072')] [2022-07-11 00:29:07,348][26022] Updated weights on worker 0-0, policy_version 957342 (0.00087) [2022-07-11 00:29:09,224][26022] Updated weights on worker 0-0, policy_version 957352 (0.00089) [2022-07-11 00:29:11,050][26022] Updated weights on worker 0-0, policy_version 957362 (0.00082) [2022-07-11 00:29:11,510][25689] Fps is (10 sec: 5438.2, 60 sec: 5535.0, 300 sec: 5530.4). Total num frames: 980341760. Throughput: 0: 5676.3. Samples: 980345900. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:29:11,511][25689] Avg episode reward: [(0, '1.370')] [2022-07-11 00:29:12,975][26022] Updated weights on worker 0-0, policy_version 957372 (0.00085) [2022-07-11 00:29:14,741][26022] Updated weights on worker 0-0, policy_version 957382 (0.00089) [2022-07-11 00:29:16,440][26022] Updated weights on worker 0-0, policy_version 957392 (0.00094) [2022-07-11 00:29:16,572][25689] Fps is (10 sec: 5676.5, 60 sec: 5513.7, 300 sec: 5532.8). Total num frames: 980369408. Throughput: 0: 4871.0. Samples: 980362764. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:29:16,572][25689] Avg episode reward: [(0, '1.574')] [2022-07-11 00:29:18,502][26022] Updated weights on worker 0-0, policy_version 957402 (0.00088) [2022-07-11 00:29:18,648][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:29:18,661][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000957404_980381696.pth [2022-07-11 00:29:18,662][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000955460_978391040.pth [2022-07-11 00:29:20,124][26022] Updated weights on worker 0-0, policy_version 957412 (0.00093) [2022-07-11 00:29:21,574][25689] Fps is (10 sec: 5392.7, 60 sec: 5498.3, 300 sec: 5524.3). Total num frames: 980396032. Throughput: 0: 5714.6. Samples: 980396216. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:29:21,576][25689] Avg episode reward: [(0, '1.312')] [2022-07-11 00:29:22,185][26022] Updated weights on worker 0-0, policy_version 957422 (0.00092) [2022-07-11 00:29:24,036][26022] Updated weights on worker 0-0, policy_version 957432 (0.00085) [2022-07-11 00:29:25,719][26022] Updated weights on worker 0-0, policy_version 957442 (0.00089) [2022-07-11 00:29:26,662][25689] Fps is (10 sec: 5581.4, 60 sec: 5546.8, 300 sec: 5529.7). Total num frames: 980425728. Throughput: 0: 5807.3. Samples: 980429556. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:29:26,663][25689] Avg episode reward: [(0, '1.128')] [2022-07-11 00:29:27,767][26022] Updated weights on worker 0-0, policy_version 957452 (0.00096) [2022-07-11 00:29:29,387][26022] Updated weights on worker 0-0, policy_version 957462 (0.00088) [2022-07-11 00:29:31,547][26022] Updated weights on worker 0-0, policy_version 957472 (0.00083) [2022-07-11 00:29:31,670][25689] Fps is (10 sec: 5679.8, 60 sec: 5530.3, 300 sec: 5530.7). Total num frames: 980453376. Throughput: 0: 4970.0. Samples: 980446096. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:29:31,671][25689] Avg episode reward: [(0, '1.060')] [2022-07-11 00:29:33,311][26022] Updated weights on worker 0-0, policy_version 957482 (0.00092) [2022-07-11 00:29:34,897][26022] Updated weights on worker 0-0, policy_version 957492 (0.00093) [2022-07-11 00:29:36,686][25689] Fps is (10 sec: 5516.6, 60 sec: 5547.3, 300 sec: 5528.2). Total num frames: 980481024. Throughput: 0: 5799.4. Samples: 980479414. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:29:36,686][25689] Avg episode reward: [(0, '0.399')] [2022-07-11 00:29:36,807][26022] Updated weights on worker 0-0, policy_version 957502 (0.00080) [2022-07-11 00:29:38,583][26022] Updated weights on worker 0-0, policy_version 957512 (0.00099) [2022-07-11 00:29:40,427][26022] Updated weights on worker 0-0, policy_version 957522 (0.00081) [2022-07-11 00:29:41,697][25689] Fps is (10 sec: 5616.6, 60 sec: 5530.0, 300 sec: 5536.0). Total num frames: 980509696. Throughput: 0: 5814.0. Samples: 980513214. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:29:41,699][25689] Avg episode reward: [(0, '0.074')] [2022-07-11 00:29:42,263][26022] Updated weights on worker 0-0, policy_version 957532 (0.00077) [2022-07-11 00:29:44,017][26022] Updated weights on worker 0-0, policy_version 957542 (0.00102) [2022-07-11 00:29:46,027][26022] Updated weights on worker 0-0, policy_version 957552 (0.00088) [2022-07-11 00:29:46,742][25689] Fps is (10 sec: 5600.4, 60 sec: 5532.5, 300 sec: 5529.8). Total num frames: 980537344. Throughput: 0: 5009.7. Samples: 980530152. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:29:46,743][25689] Avg episode reward: [(0, '0.705')] [2022-07-11 00:29:47,895][26022] Updated weights on worker 0-0, policy_version 957562 (0.00086) [2022-07-11 00:29:49,736][26022] Updated weights on worker 0-0, policy_version 957572 (0.00091) [2022-07-11 00:29:51,417][26022] Updated weights on worker 0-0, policy_version 957582 (0.00089) [2022-07-11 00:29:51,767][25689] Fps is (10 sec: 5592.8, 60 sec: 5571.8, 300 sec: 5532.9). Total num frames: 980566016. Throughput: 0: 5858.9. Samples: 980563844. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 00:29:51,769][25689] Avg episode reward: [(0, '0.761')] [2022-07-11 00:29:53,321][26022] Updated weights on worker 0-0, policy_version 957592 (0.00086) [2022-07-11 00:29:54,970][26022] Updated weights on worker 0-0, policy_version 957602 (0.00095) [2022-07-11 00:29:56,776][25689] Fps is (10 sec: 5511.3, 60 sec: 5539.1, 300 sec: 5533.2). Total num frames: 980592640. Throughput: 0: 5876.7. Samples: 980597474. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:29:56,776][25689] Avg episode reward: [(0, '0.443')] [2022-07-11 00:29:57,014][26022] Updated weights on worker 0-0, policy_version 957612 (0.00092) [2022-07-11 00:29:58,679][26022] Updated weights on worker 0-0, policy_version 957622 (0.00095) [2022-07-11 00:30:00,497][26022] Updated weights on worker 0-0, policy_version 957632 (0.00085) [2022-07-11 00:30:01,785][25689] Fps is (10 sec: 5417.7, 60 sec: 5523.0, 300 sec: 5533.9). Total num frames: 980620288. Throughput: 0: 5813.0. Samples: 980629982. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:30:01,785][25689] Avg episode reward: [(0, '1.225')] [2022-07-11 00:30:02,746][26022] Updated weights on worker 0-0, policy_version 957642 (0.00085) [2022-07-11 00:30:04,675][26022] Updated weights on worker 0-0, policy_version 957652 (0.00090) [2022-07-11 00:30:06,415][26022] Updated weights on worker 0-0, policy_version 957662 (0.00092) [2022-07-11 00:30:06,955][25689] Fps is (10 sec: 5532.6, 60 sec: 5569.9, 300 sec: 5535.2). Total num frames: 980648960. Throughput: 0: 5721.8. Samples: 980645806. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:30:06,956][25689] Avg episode reward: [(0, '1.238')] [2022-07-11 00:30:08,351][26022] Updated weights on worker 0-0, policy_version 957672 (0.00096) [2022-07-11 00:30:09,990][26022] Updated weights on worker 0-0, policy_version 957682 (0.00082) [2022-07-11 00:30:11,846][26022] Updated weights on worker 0-0, policy_version 957692 (0.00081) [2022-07-11 00:30:11,962][25689] Fps is (10 sec: 5634.4, 60 sec: 5556.0, 300 sec: 5536.8). Total num frames: 980677632. Throughput: 0: 5736.2. Samples: 980679686. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:30:11,963][25689] Avg episode reward: [(0, '0.083')] [2022-07-11 00:30:13,821][26022] Updated weights on worker 0-0, policy_version 957702 (0.00081) [2022-07-11 00:30:15,443][26022] Updated weights on worker 0-0, policy_version 957712 (0.00085) [2022-07-11 00:30:16,991][25689] Fps is (10 sec: 5509.8, 60 sec: 5542.1, 300 sec: 5533.1). Total num frames: 980704256. Throughput: 0: 5743.0. Samples: 980713572. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:30:16,992][25689] Avg episode reward: [(0, '-0.096')] [2022-07-11 00:30:17,429][26022] Updated weights on worker 0-0, policy_version 957722 (0.00090) [2022-07-11 00:30:19,214][26022] Updated weights on worker 0-0, policy_version 957732 (0.00089) [2022-07-11 00:30:21,000][26022] Updated weights on worker 0-0, policy_version 957742 (0.00108) [2022-07-11 00:30:22,069][25689] Fps is (10 sec: 5572.6, 60 sec: 5585.9, 300 sec: 5539.3). Total num frames: 980733952. Throughput: 0: 4940.1. Samples: 980730188. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:30:22,070][25689] Avg episode reward: [(0, '-0.733')] [2022-07-11 00:30:22,915][26022] Updated weights on worker 0-0, policy_version 957752 (0.00087) [2022-07-11 00:30:24,648][26022] Updated weights on worker 0-0, policy_version 957762 (0.00097) [2022-07-11 00:30:26,595][26022] Updated weights on worker 0-0, policy_version 957772 (0.00090) [2022-07-11 00:30:27,157][25689] Fps is (10 sec: 5539.8, 60 sec: 5535.1, 300 sec: 5532.1). Total num frames: 980760576. Throughput: 0: 5837.1. Samples: 980763728. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:30:27,158][25689] Avg episode reward: [(0, '0.012')] [2022-07-11 00:30:28,339][26022] Updated weights on worker 0-0, policy_version 957782 (0.00095) [2022-07-11 00:30:30,224][26022] Updated weights on worker 0-0, policy_version 957792 (0.00093) [2022-07-11 00:30:32,038][26022] Updated weights on worker 0-0, policy_version 957802 (0.00085) [2022-07-11 00:30:32,237][25689] Fps is (10 sec: 5438.1, 60 sec: 5545.4, 300 sec: 5535.5). Total num frames: 980789248. Throughput: 0: 5783.5. Samples: 980796946. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:30:32,238][25689] Avg episode reward: [(0, '0.149')] [2022-07-11 00:30:33,920][26022] Updated weights on worker 0-0, policy_version 957812 (0.00085) [2022-07-11 00:30:35,818][26022] Updated weights on worker 0-0, policy_version 957822 (0.00419) [2022-07-11 00:30:37,309][25689] Fps is (10 sec: 5548.1, 60 sec: 5540.4, 300 sec: 5535.9). Total num frames: 980816896. Throughput: 0: 4937.8. Samples: 980813896. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:30:37,309][25689] Avg episode reward: [(0, '0.241')] [2022-07-11 00:30:37,525][26022] Updated weights on worker 0-0, policy_version 957832 (0.00091) [2022-07-11 00:30:39,470][26022] Updated weights on worker 0-0, policy_version 957842 (0.00092) [2022-07-11 00:30:41,223][26022] Updated weights on worker 0-0, policy_version 957852 (0.00091) [2022-07-11 00:30:42,372][25689] Fps is (10 sec: 5658.1, 60 sec: 5552.5, 300 sec: 5540.8). Total num frames: 980846592. Throughput: 0: 5779.9. Samples: 980847538. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:30:42,373][25689] Avg episode reward: [(0, '0.824')] [2022-07-11 00:30:43,057][26022] Updated weights on worker 0-0, policy_version 957862 (0.00089) [2022-07-11 00:30:44,840][26022] Updated weights on worker 0-0, policy_version 957872 (0.00083) [2022-07-11 00:30:46,841][26022] Updated weights on worker 0-0, policy_version 957882 (0.00089) [2022-07-11 00:30:47,434][25689] Fps is (10 sec: 5663.2, 60 sec: 5550.9, 300 sec: 5540.1). Total num frames: 980874240. Throughput: 0: 5793.1. Samples: 980881194. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:30:47,435][25689] Avg episode reward: [(0, '0.983')] [2022-07-11 00:30:48,661][26022] Updated weights on worker 0-0, policy_version 957892 (0.00089) [2022-07-11 00:30:50,386][26022] Updated weights on worker 0-0, policy_version 957902 (0.00054) [2022-07-11 00:30:52,186][26022] Updated weights on worker 0-0, policy_version 957912 (0.00086) [2022-07-11 00:30:52,446][25689] Fps is (10 sec: 5590.9, 60 sec: 5552.2, 300 sec: 5544.0). Total num frames: 980902912. Throughput: 0: 5004.5. Samples: 980898080. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:30:52,446][25689] Avg episode reward: [(0, '1.328')] [2022-07-11 00:30:54,170][26022] Updated weights on worker 0-0, policy_version 957922 (0.00088) [2022-07-11 00:30:55,718][26022] Updated weights on worker 0-0, policy_version 957932 (0.00092) [2022-07-11 00:30:57,468][25689] Fps is (10 sec: 5613.3, 60 sec: 5567.8, 300 sec: 5540.6). Total num frames: 980930560. Throughput: 0: 5841.2. Samples: 980931648. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:30:57,468][25689] Avg episode reward: [(0, '-0.032')] [2022-07-11 00:30:57,861][26022] Updated weights on worker 0-0, policy_version 957942 (0.00083) [2022-07-11 00:30:59,598][26022] Updated weights on worker 0-0, policy_version 957952 (0.00089) [2022-07-11 00:31:01,899][26022] Updated weights on worker 0-0, policy_version 957962 (0.00083) [2022-07-11 00:31:02,495][25689] Fps is (10 sec: 5400.8, 60 sec: 5549.3, 300 sec: 5545.6). Total num frames: 980957184. Throughput: 0: 5749.5. Samples: 980963232. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:31:02,497][25689] Avg episode reward: [(0, '0.034')] [2022-07-11 00:31:03,507][26022] Updated weights on worker 0-0, policy_version 957972 (0.00088) [2022-07-11 00:31:05,378][26022] Updated weights on worker 0-0, policy_version 957982 (0.00085) [2022-07-11 00:31:07,177][26022] Updated weights on worker 0-0, policy_version 957992 (0.00084) [2022-07-11 00:31:07,627][25689] Fps is (10 sec: 5443.0, 60 sec: 5552.8, 300 sec: 5543.6). Total num frames: 980985856. Throughput: 0: 4888.9. Samples: 980979912. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:31:07,627][25689] Avg episode reward: [(0, '0.060')] [2022-07-11 00:31:09,101][26022] Updated weights on worker 0-0, policy_version 958002 (0.00079) [2022-07-11 00:31:10,780][26022] Updated weights on worker 0-0, policy_version 958012 (0.00089) [2022-07-11 00:31:12,642][25689] Fps is (10 sec: 5449.2, 60 sec: 5518.3, 300 sec: 5544.4). Total num frames: 981012480. Throughput: 0: 5724.0. Samples: 981013684. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:31:12,644][25689] Avg episode reward: [(0, '0.338')] [2022-07-11 00:31:12,742][26022] Updated weights on worker 0-0, policy_version 958022 (0.00084) [2022-07-11 00:31:14,483][26022] Updated weights on worker 0-0, policy_version 958032 (0.00090) [2022-07-11 00:31:16,155][26022] Updated weights on worker 0-0, policy_version 958042 (0.00081) [2022-07-11 00:31:17,746][25689] Fps is (10 sec: 5565.8, 60 sec: 5562.0, 300 sec: 5546.6). Total num frames: 981042176. Throughput: 0: 5705.4. Samples: 981047342. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:31:17,749][25689] Avg episode reward: [(0, '0.646')] [2022-07-11 00:31:18,159][26022] Updated weights on worker 0-0, policy_version 958052 (0.00096) [2022-07-11 00:31:18,869][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:31:18,879][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000958055_981048320.pth [2022-07-11 00:31:18,880][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000956106_979052544.pth [2022-07-11 00:31:19,995][26022] Updated weights on worker 0-0, policy_version 958062 (0.00085) [2022-07-11 00:31:21,860][26022] Updated weights on worker 0-0, policy_version 958072 (0.00093) [2022-07-11 00:31:22,825][25689] Fps is (10 sec: 5732.0, 60 sec: 5545.1, 300 sec: 5543.3). Total num frames: 981070848. Throughput: 0: 4963.3. Samples: 981064138. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:31:22,826][25689] Avg episode reward: [(0, '0.732')] [2022-07-11 00:31:23,763][26022] Updated weights on worker 0-0, policy_version 958082 (0.00076) [2022-07-11 00:31:25,426][26022] Updated weights on worker 0-0, policy_version 958092 (0.00088) [2022-07-11 00:31:27,348][26022] Updated weights on worker 0-0, policy_version 958102 (0.00087) [2022-07-11 00:31:27,895][25689] Fps is (10 sec: 5549.4, 60 sec: 5563.6, 300 sec: 5545.7). Total num frames: 981098496. Throughput: 0: 5806.1. Samples: 981097590. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:31:27,895][25689] Avg episode reward: [(0, '1.692')] [2022-07-11 00:31:29,213][26022] Updated weights on worker 0-0, policy_version 958112 (0.00094) [2022-07-11 00:31:31,041][26022] Updated weights on worker 0-0, policy_version 958122 (0.00082) [2022-07-11 00:31:32,930][25689] Fps is (10 sec: 5472.3, 60 sec: 5550.9, 300 sec: 5545.7). Total num frames: 981126144. Throughput: 0: 5787.5. Samples: 981131098. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:31:32,930][25689] Avg episode reward: [(0, '1.596')] [2022-07-11 00:31:33,071][26022] Updated weights on worker 0-0, policy_version 958132 (0.00098) [2022-07-11 00:31:34,868][26022] Updated weights on worker 0-0, policy_version 958142 (0.00083) [2022-07-11 00:31:36,539][26022] Updated weights on worker 0-0, policy_version 958152 (0.00095) [2022-07-11 00:31:37,955][25689] Fps is (10 sec: 5496.4, 60 sec: 5555.1, 300 sec: 5542.5). Total num frames: 981153792. Throughput: 0: 4963.4. Samples: 981147650. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:31:37,956][25689] Avg episode reward: [(0, '0.862')] [2022-07-11 00:31:38,560][26022] Updated weights on worker 0-0, policy_version 958162 (0.00090) [2022-07-11 00:31:40,041][26022] Updated weights on worker 0-0, policy_version 958172 (0.00094) [2022-07-11 00:31:42,310][26022] Updated weights on worker 0-0, policy_version 958182 (0.00094) [2022-07-11 00:31:42,967][25689] Fps is (10 sec: 5713.1, 60 sec: 5559.8, 300 sec: 5543.8). Total num frames: 981183488. Throughput: 0: 5804.2. Samples: 981181046. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:31:42,968][25689] Avg episode reward: [(0, '0.634')] [2022-07-11 00:31:44,067][26022] Updated weights on worker 0-0, policy_version 958192 (0.00089) [2022-07-11 00:31:45,730][26022] Updated weights on worker 0-0, policy_version 958202 (0.00089) [2022-07-11 00:31:47,700][26022] Updated weights on worker 0-0, policy_version 958212 (0.00089) [2022-07-11 00:31:48,011][25689] Fps is (10 sec: 5499.1, 60 sec: 5527.7, 300 sec: 5536.1). Total num frames: 981209088. Throughput: 0: 5809.6. Samples: 981214456. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:31:48,011][25689] Avg episode reward: [(0, '0.467')] [2022-07-11 00:31:49,419][26022] Updated weights on worker 0-0, policy_version 958222 (0.00079) [2022-07-11 00:31:51,248][26022] Updated weights on worker 0-0, policy_version 958232 (0.00087) [2022-07-11 00:31:53,014][25689] Fps is (10 sec: 5503.9, 60 sec: 5545.4, 300 sec: 5543.9). Total num frames: 981238784. Throughput: 0: 5824.4. Samples: 981248074. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:31:53,014][25689] Avg episode reward: [(0, '0.439')] [2022-07-11 00:31:53,059][26022] Updated weights on worker 0-0, policy_version 958242 (0.00088) [2022-07-11 00:31:54,840][26022] Updated weights on worker 0-0, policy_version 958252 (0.00088) [2022-07-11 00:31:56,868][26022] Updated weights on worker 0-0, policy_version 958262 (0.00086) [2022-07-11 00:31:58,019][25689] Fps is (10 sec: 5729.8, 60 sec: 5546.9, 300 sec: 5540.7). Total num frames: 981266432. Throughput: 0: 5840.8. Samples: 981264838. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:31:58,019][25689] Avg episode reward: [(0, '0.177')] [2022-07-11 00:31:58,796][26022] Updated weights on worker 0-0, policy_version 958272 (0.00088) [2022-07-11 00:32:00,394][26022] Updated weights on worker 0-0, policy_version 958282 (0.00090) [2022-07-11 00:32:02,787][26022] Updated weights on worker 0-0, policy_version 958292 (0.00080) [2022-07-11 00:32:03,055][25689] Fps is (10 sec: 5303.0, 60 sec: 5529.2, 300 sec: 5540.8). Total num frames: 981292032. Throughput: 0: 5841.8. Samples: 981298394. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:32:03,055][25689] Avg episode reward: [(0, '0.717')] [2022-07-11 00:32:04,563][26022] Updated weights on worker 0-0, policy_version 958302 (0.00088) [2022-07-11 00:32:06,508][26022] Updated weights on worker 0-0, policy_version 958312 (0.00092) [2022-07-11 00:32:08,098][25689] Fps is (10 sec: 5384.4, 60 sec: 5537.3, 300 sec: 5540.5). Total num frames: 981320704. Throughput: 0: 5725.4. Samples: 981329464. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:32:08,099][25689] Avg episode reward: [(0, '0.763')] [2022-07-11 00:32:08,348][26022] Updated weights on worker 0-0, policy_version 958322 (0.00097) [2022-07-11 00:32:10,354][26022] Updated weights on worker 0-0, policy_version 958332 (0.00092) [2022-07-11 00:32:11,915][26022] Updated weights on worker 0-0, policy_version 958342 (0.00093) [2022-07-11 00:32:13,106][25689] Fps is (10 sec: 5501.4, 60 sec: 5538.0, 300 sec: 5537.7). Total num frames: 981347328. Throughput: 0: 4870.1. Samples: 981345926. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:32:13,106][25689] Avg episode reward: [(0, '0.624')] [2022-07-11 00:32:14,006][26022] Updated weights on worker 0-0, policy_version 958352 (0.00089) [2022-07-11 00:32:15,486][26022] Updated weights on worker 0-0, policy_version 958362 (0.00078) [2022-07-11 00:32:17,695][26022] Updated weights on worker 0-0, policy_version 958372 (0.00092) [2022-07-11 00:32:18,121][25689] Fps is (10 sec: 5414.6, 60 sec: 5512.2, 300 sec: 5534.0). Total num frames: 981374976. Throughput: 0: 5696.3. Samples: 981379350. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:32:18,123][25689] Avg episode reward: [(0, '0.513')] [2022-07-11 00:32:19,337][26022] Updated weights on worker 0-0, policy_version 958382 (0.00086) [2022-07-11 00:32:21,202][26022] Updated weights on worker 0-0, policy_version 958392 (0.00089) [2022-07-11 00:32:23,014][26022] Updated weights on worker 0-0, policy_version 958402 (0.00090) [2022-07-11 00:32:23,126][25689] Fps is (10 sec: 5620.9, 60 sec: 5519.0, 300 sec: 5541.9). Total num frames: 981403648. Throughput: 0: 5711.9. Samples: 981413038. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:32:23,126][25689] Avg episode reward: [(0, '0.504')] [2022-07-11 00:32:24,907][26022] Updated weights on worker 0-0, policy_version 958412 (0.00089) [2022-07-11 00:32:26,796][26022] Updated weights on worker 0-0, policy_version 958422 (0.00089) [2022-07-11 00:32:28,171][25689] Fps is (10 sec: 5706.3, 60 sec: 5538.3, 300 sec: 5538.1). Total num frames: 981432320. Throughput: 0: 4991.6. Samples: 981429658. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:32:28,173][25689] Avg episode reward: [(0, '0.520')] [2022-07-11 00:32:28,482][26022] Updated weights on worker 0-0, policy_version 958432 (0.00093) [2022-07-11 00:32:30,423][26022] Updated weights on worker 0-0, policy_version 958442 (0.00091) [2022-07-11 00:32:32,172][26022] Updated weights on worker 0-0, policy_version 958452 (0.00085) [2022-07-11 00:32:33,182][25689] Fps is (10 sec: 5600.4, 60 sec: 5540.4, 300 sec: 5542.7). Total num frames: 981459968. Throughput: 0: 5848.2. Samples: 981463336. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:32:33,184][25689] Avg episode reward: [(0, '0.904')] [2022-07-11 00:32:34,004][26022] Updated weights on worker 0-0, policy_version 958462 (0.00094) [2022-07-11 00:32:35,858][26022] Updated weights on worker 0-0, policy_version 958472 (0.00093) [2022-07-11 00:32:37,797][26022] Updated weights on worker 0-0, policy_version 958482 (0.00095) [2022-07-11 00:32:38,203][25689] Fps is (10 sec: 5613.7, 60 sec: 5557.8, 300 sec: 5546.3). Total num frames: 981488640. Throughput: 0: 5846.7. Samples: 981496762. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:32:38,204][25689] Avg episode reward: [(0, '1.195')] [2022-07-11 00:32:39,642][26022] Updated weights on worker 0-0, policy_version 958492 (0.00092) [2022-07-11 00:32:41,324][26022] Updated weights on worker 0-0, policy_version 958502 (0.00087) [2022-07-11 00:32:43,210][25689] Fps is (10 sec: 5514.3, 60 sec: 5507.3, 300 sec: 5540.4). Total num frames: 981515264. Throughput: 0: 4993.1. Samples: 981513320. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:32:43,210][25689] Avg episode reward: [(0, '1.496')] [2022-07-11 00:32:43,362][26022] Updated weights on worker 0-0, policy_version 958512 (0.00089) [2022-07-11 00:32:45,127][26022] Updated weights on worker 0-0, policy_version 958522 (0.00088) [2022-07-11 00:32:47,073][26022] Updated weights on worker 0-0, policy_version 958532 (0.00085) [2022-07-11 00:32:48,249][25689] Fps is (10 sec: 5504.4, 60 sec: 5558.7, 300 sec: 5543.6). Total num frames: 981543936. Throughput: 0: 5833.5. Samples: 981546786. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:32:48,250][25689] Avg episode reward: [(0, '1.618')] [2022-07-11 00:32:48,679][26022] Updated weights on worker 0-0, policy_version 958542 (0.00082) [2022-07-11 00:32:50,699][26022] Updated weights on worker 0-0, policy_version 958552 (0.00083) [2022-07-11 00:32:52,338][26022] Updated weights on worker 0-0, policy_version 958562 (0.00093) [2022-07-11 00:32:53,258][25689] Fps is (10 sec: 5605.2, 60 sec: 5524.2, 300 sec: 5540.2). Total num frames: 981571584. Throughput: 0: 5832.8. Samples: 981580432. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:32:53,258][25689] Avg episode reward: [(0, '1.464')] [2022-07-11 00:32:54,379][26022] Updated weights on worker 0-0, policy_version 958572 (0.00096) [2022-07-11 00:32:56,060][26022] Updated weights on worker 0-0, policy_version 958582 (0.00091) [2022-07-11 00:32:57,864][26022] Updated weights on worker 0-0, policy_version 958592 (0.00078) [2022-07-11 00:32:58,279][25689] Fps is (10 sec: 5513.2, 60 sec: 5522.7, 300 sec: 5536.5). Total num frames: 981599232. Throughput: 0: 4997.0. Samples: 981597080. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:32:58,280][25689] Avg episode reward: [(0, '1.626')] [2022-07-11 00:32:59,762][26022] Updated weights on worker 0-0, policy_version 958602 (0.00089) [2022-07-11 00:33:01,766][26022] Updated weights on worker 0-0, policy_version 958612 (0.00081) [2022-07-11 00:33:03,286][25689] Fps is (10 sec: 5411.7, 60 sec: 5542.3, 300 sec: 5537.3). Total num frames: 981625856. Throughput: 0: 5769.7. Samples: 981629156. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:33:03,287][25689] Avg episode reward: [(0, '1.505')] [2022-07-11 00:33:04,122][26022] Updated weights on worker 0-0, policy_version 958622 (0.00089) [2022-07-11 00:33:05,648][26022] Updated weights on worker 0-0, policy_version 958632 (0.00090) [2022-07-11 00:33:07,713][26022] Updated weights on worker 0-0, policy_version 958642 (0.00094) [2022-07-11 00:33:08,371][25689] Fps is (10 sec: 5377.7, 60 sec: 5521.6, 300 sec: 5537.4). Total num frames: 981653504. Throughput: 0: 5702.6. Samples: 981661532. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:33:08,372][25689] Avg episode reward: [(0, '1.141')] [2022-07-11 00:33:09,394][26022] Updated weights on worker 0-0, policy_version 958652 (0.00087) [2022-07-11 00:33:11,488][26022] Updated weights on worker 0-0, policy_version 958662 (0.00091) [2022-07-11 00:33:13,042][26022] Updated weights on worker 0-0, policy_version 958672 (0.00082) [2022-07-11 00:33:13,393][25689] Fps is (10 sec: 5471.3, 60 sec: 5537.2, 300 sec: 5533.8). Total num frames: 981681152. Throughput: 0: 4841.0. Samples: 981677908. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:33:13,394][25689] Avg episode reward: [(0, '0.949')] [2022-07-11 00:33:15,195][26022] Updated weights on worker 0-0, policy_version 958682 (0.00088) [2022-07-11 00:33:16,935][26022] Updated weights on worker 0-0, policy_version 958692 (0.00270) [2022-07-11 00:33:18,427][25689] Fps is (10 sec: 5397.0, 60 sec: 5518.6, 300 sec: 5530.1). Total num frames: 981707776. Throughput: 0: 5679.3. Samples: 981711506. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:33:18,427][25689] Avg episode reward: [(0, '0.570')] [2022-07-11 00:33:18,775][26022] Updated weights on worker 0-0, policy_version 958702 (0.00087) [2022-07-11 00:33:18,925][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:33:18,940][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000958703_981711872.pth [2022-07-11 00:33:18,941][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000956754_979716096.pth [2022-07-11 00:33:20,424][26022] Updated weights on worker 0-0, policy_version 958712 (0.00091) [2022-07-11 00:33:22,375][26022] Updated weights on worker 0-0, policy_version 958722 (0.00082) [2022-07-11 00:33:23,434][25689] Fps is (10 sec: 5711.2, 60 sec: 5552.3, 300 sec: 5544.9). Total num frames: 981738496. Throughput: 0: 5756.3. Samples: 981745130. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:33:23,434][25689] Avg episode reward: [(0, '0.796')] [2022-07-11 00:33:24,115][26022] Updated weights on worker 0-0, policy_version 958732 (0.00085) [2022-07-11 00:33:26,288][26022] Updated weights on worker 0-0, policy_version 958742 (0.00088) [2022-07-11 00:33:27,850][26022] Updated weights on worker 0-0, policy_version 958752 (0.00092) [2022-07-11 00:33:28,514][25689] Fps is (10 sec: 5583.4, 60 sec: 5498.1, 300 sec: 5533.3). Total num frames: 981764096. Throughput: 0: 4969.3. Samples: 981761628. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:33:28,515][25689] Avg episode reward: [(0, '1.120')] [2022-07-11 00:33:29,832][26022] Updated weights on worker 0-0, policy_version 958762 (0.00088) [2022-07-11 00:33:31,693][26022] Updated weights on worker 0-0, policy_version 958772 (0.00096) [2022-07-11 00:33:33,334][26022] Updated weights on worker 0-0, policy_version 958782 (0.00086) [2022-07-11 00:33:33,572][25689] Fps is (10 sec: 5555.0, 60 sec: 5544.7, 300 sec: 5546.3). Total num frames: 981794816. Throughput: 0: 5800.2. Samples: 981794952. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:33:33,573][25689] Avg episode reward: [(0, '0.958')] [2022-07-11 00:33:35,400][26022] Updated weights on worker 0-0, policy_version 958792 (0.00096) [2022-07-11 00:33:37,126][26022] Updated weights on worker 0-0, policy_version 958802 (0.00086) [2022-07-11 00:33:38,624][25689] Fps is (10 sec: 5570.7, 60 sec: 5491.1, 300 sec: 5531.7). Total num frames: 981820416. Throughput: 0: 5785.8. Samples: 981828362. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:33:38,627][25689] Avg episode reward: [(0, '0.438')] [2022-07-11 00:33:39,022][26022] Updated weights on worker 0-0, policy_version 958812 (0.00088) [2022-07-11 00:33:40,951][26022] Updated weights on worker 0-0, policy_version 958822 (0.00088) [2022-07-11 00:33:42,659][26022] Updated weights on worker 0-0, policy_version 958832 (0.00087) [2022-07-11 00:33:43,646][25689] Fps is (10 sec: 5285.8, 60 sec: 5506.6, 300 sec: 5532.7). Total num frames: 981848064. Throughput: 0: 4934.5. Samples: 981844868. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2022-07-11 00:33:43,648][25689] Avg episode reward: [(0, '0.566')] [2022-07-11 00:33:44,567][26022] Updated weights on worker 0-0, policy_version 958842 (0.00091) [2022-07-11 00:33:46,457][26022] Updated weights on worker 0-0, policy_version 958852 (0.00085) [2022-07-11 00:33:48,157][26022] Updated weights on worker 0-0, policy_version 958862 (0.00082) [2022-07-11 00:33:48,743][25689] Fps is (10 sec: 5667.2, 60 sec: 5518.4, 300 sec: 5542.8). Total num frames: 981877760. Throughput: 0: 5760.9. Samples: 981878164. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:33:48,743][25689] Avg episode reward: [(0, '0.629')] [2022-07-11 00:33:50,247][26022] Updated weights on worker 0-0, policy_version 958872 (0.00099) [2022-07-11 00:33:51,875][26022] Updated weights on worker 0-0, policy_version 958882 (0.00084) [2022-07-11 00:33:53,792][25689] Fps is (10 sec: 5652.1, 60 sec: 5514.7, 300 sec: 5538.8). Total num frames: 981905408. Throughput: 0: 5776.9. Samples: 981911756. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:33:53,792][25689] Avg episode reward: [(0, '0.543')] [2022-07-11 00:33:53,792][26022] Updated weights on worker 0-0, policy_version 958892 (0.00090) [2022-07-11 00:33:55,590][26022] Updated weights on worker 0-0, policy_version 958902 (0.00092) [2022-07-11 00:33:57,371][26022] Updated weights on worker 0-0, policy_version 958912 (0.00415) [2022-07-11 00:33:58,830][25689] Fps is (10 sec: 5379.9, 60 sec: 5496.1, 300 sec: 5531.5). Total num frames: 981932032. Throughput: 0: 4951.0. Samples: 981928404. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:33:58,831][25689] Avg episode reward: [(0, '0.263')] [2022-07-11 00:33:59,431][26022] Updated weights on worker 0-0, policy_version 958922 (0.00089) [2022-07-11 00:34:01,024][26022] Updated weights on worker 0-0, policy_version 958932 (0.00090) [2022-07-11 00:34:03,347][26022] Updated weights on worker 0-0, policy_version 958942 (0.00087) [2022-07-11 00:34:03,929][25689] Fps is (10 sec: 5353.4, 60 sec: 5504.7, 300 sec: 5539.0). Total num frames: 981959680. Throughput: 0: 5674.9. Samples: 981959974. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:34:03,932][25689] Avg episode reward: [(0, '0.172')] [2022-07-11 00:34:05,227][26022] Updated weights on worker 0-0, policy_version 958952 (0.00082) [2022-07-11 00:34:06,956][26022] Updated weights on worker 0-0, policy_version 958962 (0.00087) [2022-07-11 00:34:08,911][26022] Updated weights on worker 0-0, policy_version 958972 (0.00090) [2022-07-11 00:34:09,006][25689] Fps is (10 sec: 5434.3, 60 sec: 5505.5, 300 sec: 5531.4). Total num frames: 981987328. Throughput: 0: 5692.6. Samples: 981993514. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:34:09,006][25689] Avg episode reward: [(0, '0.973')] [2022-07-11 00:34:10,687][26022] Updated weights on worker 0-0, policy_version 958982 (0.00091) [2022-07-11 00:34:12,560][26022] Updated weights on worker 0-0, policy_version 958992 (0.00113) [2022-07-11 00:34:14,015][25689] Fps is (10 sec: 5584.3, 60 sec: 5523.5, 300 sec: 5535.8). Total num frames: 982016000. Throughput: 0: 4865.0. Samples: 982010142. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:34:14,016][25689] Avg episode reward: [(0, '1.316')] [2022-07-11 00:34:14,532][26022] Updated weights on worker 0-0, policy_version 959002 (0.00093) [2022-07-11 00:34:16,232][26022] Updated weights on worker 0-0, policy_version 959012 (0.00085) [2022-07-11 00:34:18,080][26022] Updated weights on worker 0-0, policy_version 959022 (0.00087) [2022-07-11 00:34:19,034][25689] Fps is (10 sec: 5615.9, 60 sec: 5541.8, 300 sec: 5538.9). Total num frames: 982043648. Throughput: 0: 5705.9. Samples: 982043686. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:34:19,035][25689] Avg episode reward: [(0, '1.223')] [2022-07-11 00:34:19,976][26022] Updated weights on worker 0-0, policy_version 959032 (0.00072) [2022-07-11 00:34:21,667][26022] Updated weights on worker 0-0, policy_version 959042 (0.00085) [2022-07-11 00:34:23,472][26022] Updated weights on worker 0-0, policy_version 959052 (0.00089) [2022-07-11 00:34:24,091][25689] Fps is (10 sec: 5589.4, 60 sec: 5503.4, 300 sec: 5536.1). Total num frames: 982072320. Throughput: 0: 5827.2. Samples: 982077460. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:34:24,092][25689] Avg episode reward: [(0, '0.959')] [2022-07-11 00:34:25,391][26022] Updated weights on worker 0-0, policy_version 959062 (0.00096) [2022-07-11 00:34:27,108][26022] Updated weights on worker 0-0, policy_version 959072 (0.00090) [2022-07-11 00:34:29,105][26022] Updated weights on worker 0-0, policy_version 959082 (0.00092) [2022-07-11 00:34:29,139][25689] Fps is (10 sec: 5573.6, 60 sec: 5540.1, 300 sec: 5535.3). Total num frames: 982099968. Throughput: 0: 4996.2. Samples: 982094104. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:34:29,140][25689] Avg episode reward: [(0, '0.553')] [2022-07-11 00:34:30,931][26022] Updated weights on worker 0-0, policy_version 959092 (0.00089) [2022-07-11 00:34:32,595][26022] Updated weights on worker 0-0, policy_version 959102 (0.00083) [2022-07-11 00:34:34,141][25689] Fps is (10 sec: 5502.2, 60 sec: 5494.6, 300 sec: 5535.6). Total num frames: 982127616. Throughput: 0: 5846.5. Samples: 982127810. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:34:34,143][25689] Avg episode reward: [(0, '0.704')] [2022-07-11 00:34:34,654][26022] Updated weights on worker 0-0, policy_version 959112 (0.00090) [2022-07-11 00:34:36,181][26022] Updated weights on worker 0-0, policy_version 959122 (0.00092) [2022-07-11 00:34:38,247][26022] Updated weights on worker 0-0, policy_version 959132 (0.00091) [2022-07-11 00:34:39,149][25689] Fps is (10 sec: 5626.3, 60 sec: 5549.3, 300 sec: 5535.7). Total num frames: 982156288. Throughput: 0: 5846.3. Samples: 982161284. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:34:39,151][25689] Avg episode reward: [(0, '0.706')] [2022-07-11 00:34:39,953][26022] Updated weights on worker 0-0, policy_version 959142 (0.00087) [2022-07-11 00:34:41,827][26022] Updated weights on worker 0-0, policy_version 959152 (0.00387) [2022-07-11 00:34:43,901][26022] Updated weights on worker 0-0, policy_version 959162 (0.00083) [2022-07-11 00:34:44,243][25689] Fps is (10 sec: 5575.3, 60 sec: 5542.7, 300 sec: 5534.8). Total num frames: 982183936. Throughput: 0: 4991.5. Samples: 982178046. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:34:44,245][25689] Avg episode reward: [(0, '0.596')] [2022-07-11 00:34:45,424][26022] Updated weights on worker 0-0, policy_version 959172 (0.00097) [2022-07-11 00:34:47,429][26022] Updated weights on worker 0-0, policy_version 959182 (0.00087) [2022-07-11 00:34:49,199][26022] Updated weights on worker 0-0, policy_version 959192 (0.00090) [2022-07-11 00:34:49,345][25689] Fps is (10 sec: 5523.9, 60 sec: 5525.3, 300 sec: 5533.3). Total num frames: 982212608. Throughput: 0: 5822.7. Samples: 982211756. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:34:49,346][25689] Avg episode reward: [(0, '0.672')] [2022-07-11 00:34:51,007][26022] Updated weights on worker 0-0, policy_version 959202 (0.00089) [2022-07-11 00:34:52,785][26022] Updated weights on worker 0-0, policy_version 959212 (0.00087) [2022-07-11 00:34:54,371][25689] Fps is (10 sec: 5661.8, 60 sec: 5544.3, 300 sec: 5539.9). Total num frames: 982241280. Throughput: 0: 5827.1. Samples: 982245692. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:34:54,371][25689] Avg episode reward: [(0, '1.298')] [2022-07-11 00:34:54,610][26022] Updated weights on worker 0-0, policy_version 959222 (0.00088) [2022-07-11 00:34:56,447][26022] Updated weights on worker 0-0, policy_version 959232 (0.00087) [2022-07-11 00:34:58,302][26022] Updated weights on worker 0-0, policy_version 959242 (0.00086) [2022-07-11 00:34:59,405][25689] Fps is (10 sec: 5598.3, 60 sec: 5561.7, 300 sec: 5539.4). Total num frames: 982268928. Throughput: 0: 5813.7. Samples: 982279046. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:34:59,408][25689] Avg episode reward: [(0, '1.021')] [2022-07-11 00:35:00,126][26022] Updated weights on worker 0-0, policy_version 959252 (0.00086) [2022-07-11 00:35:02,326][26022] Updated weights on worker 0-0, policy_version 959262 (0.00090) [2022-07-11 00:35:04,191][26022] Updated weights on worker 0-0, policy_version 959272 (0.00081) [2022-07-11 00:35:04,459][25689] Fps is (10 sec: 5278.4, 60 sec: 5532.0, 300 sec: 5531.3). Total num frames: 982294528. Throughput: 0: 5711.9. Samples: 982293518. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:35:04,462][25689] Avg episode reward: [(0, '1.307')] [2022-07-11 00:35:06,084][26022] Updated weights on worker 0-0, policy_version 959282 (0.00085) [2022-07-11 00:35:07,854][26022] Updated weights on worker 0-0, policy_version 959292 (0.00088) [2022-07-11 00:35:09,564][25689] Fps is (10 sec: 5342.5, 60 sec: 5546.3, 300 sec: 5529.4). Total num frames: 982323200. Throughput: 0: 5692.6. Samples: 982326854. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:35:09,564][25689] Avg episode reward: [(0, '1.413')] [2022-07-11 00:35:09,856][26022] Updated weights on worker 0-0, policy_version 959302 (0.00094) [2022-07-11 00:35:11,456][26022] Updated weights on worker 0-0, policy_version 959312 (0.00094) [2022-07-11 00:35:13,674][26022] Updated weights on worker 0-0, policy_version 959322 (0.00085) [2022-07-11 00:35:14,624][25689] Fps is (10 sec: 5741.8, 60 sec: 5558.5, 300 sec: 5539.2). Total num frames: 982352896. Throughput: 0: 5674.6. Samples: 982360622. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:35:14,625][25689] Avg episode reward: [(0, '1.351')] [2022-07-11 00:35:15,335][26022] Updated weights on worker 0-0, policy_version 959332 (0.00093) [2022-07-11 00:35:17,067][26022] Updated weights on worker 0-0, policy_version 959342 (0.00100) [2022-07-11 00:35:18,793][26022] Updated weights on worker 0-0, policy_version 959352 (0.00088) [2022-07-11 00:35:19,141][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:35:19,151][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000959353_982377472.pth [2022-07-11 00:35:19,151][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000957404_980381696.pth [2022-07-11 00:35:19,659][25689] Fps is (10 sec: 5579.0, 60 sec: 5540.2, 300 sec: 5529.7). Total num frames: 982379520. Throughput: 0: 4859.2. Samples: 982377462. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:35:19,659][25689] Avg episode reward: [(0, '1.711')] [2022-07-11 00:35:20,713][26022] Updated weights on worker 0-0, policy_version 959362 (0.00088) [2022-07-11 00:35:22,576][26022] Updated weights on worker 0-0, policy_version 959372 (0.00091) [2022-07-11 00:35:24,488][26022] Updated weights on worker 0-0, policy_version 959382 (0.00099) [2022-07-11 00:35:24,663][25689] Fps is (10 sec: 5508.0, 60 sec: 5545.0, 300 sec: 5538.1). Total num frames: 982408192. Throughput: 0: 5813.3. Samples: 982410974. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:35:24,664][25689] Avg episode reward: [(0, '1.687')] [2022-07-11 00:35:26,422][26022] Updated weights on worker 0-0, policy_version 959392 (0.00082) [2022-07-11 00:35:28,056][26022] Updated weights on worker 0-0, policy_version 959402 (0.00087) [2022-07-11 00:35:29,739][25689] Fps is (10 sec: 5587.4, 60 sec: 5542.5, 300 sec: 5534.7). Total num frames: 982435840. Throughput: 0: 5807.5. Samples: 982444020. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:35:29,739][25689] Avg episode reward: [(0, '1.931')] [2022-07-11 00:35:30,047][26022] Updated weights on worker 0-0, policy_version 959412 (0.00088) [2022-07-11 00:35:31,820][26022] Updated weights on worker 0-0, policy_version 959422 (0.00101) [2022-07-11 00:35:33,659][26022] Updated weights on worker 0-0, policy_version 959432 (0.00091) [2022-07-11 00:35:34,754][25689] Fps is (10 sec: 5581.3, 60 sec: 5558.1, 300 sec: 5539.2). Total num frames: 982464512. Throughput: 0: 4975.3. Samples: 982460774. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:35:34,755][25689] Avg episode reward: [(0, '1.131')] [2022-07-11 00:35:35,701][26022] Updated weights on worker 0-0, policy_version 959442 (0.00086) [2022-07-11 00:35:37,268][26022] Updated weights on worker 0-0, policy_version 959452 (0.00091) [2022-07-11 00:35:39,271][26022] Updated weights on worker 0-0, policy_version 959462 (0.00083) [2022-07-11 00:35:39,759][25689] Fps is (10 sec: 5518.6, 60 sec: 5524.7, 300 sec: 5530.0). Total num frames: 982491136. Throughput: 0: 5804.0. Samples: 982494122. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:35:39,759][25689] Avg episode reward: [(0, '1.168')] [2022-07-11 00:35:41,133][26022] Updated weights on worker 0-0, policy_version 959472 (0.00093) [2022-07-11 00:35:43,018][26022] Updated weights on worker 0-0, policy_version 959482 (0.00086) [2022-07-11 00:35:44,805][25689] Fps is (10 sec: 5399.6, 60 sec: 5529.0, 300 sec: 5530.3). Total num frames: 982518784. Throughput: 0: 5785.8. Samples: 982527510. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:35:44,806][25689] Avg episode reward: [(0, '1.001')] [2022-07-11 00:35:44,842][26022] Updated weights on worker 0-0, policy_version 959492 (0.00094) [2022-07-11 00:35:46,491][26022] Updated weights on worker 0-0, policy_version 959502 (0.00091) [2022-07-11 00:35:48,595][26022] Updated weights on worker 0-0, policy_version 959512 (0.00085) [2022-07-11 00:35:49,924][25689] Fps is (10 sec: 5641.1, 60 sec: 5544.4, 300 sec: 5531.7). Total num frames: 982548480. Throughput: 0: 4964.3. Samples: 982544226. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:35:49,925][25689] Avg episode reward: [(0, '0.745')] [2022-07-11 00:35:50,411][26022] Updated weights on worker 0-0, policy_version 959522 (0.00053) [2022-07-11 00:35:52,175][26022] Updated weights on worker 0-0, policy_version 959532 (0.00094) [2022-07-11 00:35:54,202][26022] Updated weights on worker 0-0, policy_version 959542 (0.00087) [2022-07-11 00:35:54,990][25689] Fps is (10 sec: 5529.7, 60 sec: 5506.9, 300 sec: 5527.5). Total num frames: 982575104. Throughput: 0: 5754.1. Samples: 982577214. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:35:54,991][25689] Avg episode reward: [(0, '0.723')] [2022-07-11 00:35:55,953][26022] Updated weights on worker 0-0, policy_version 959552 (0.00088) [2022-07-11 00:35:57,909][26022] Updated weights on worker 0-0, policy_version 959562 (0.00085) [2022-07-11 00:35:59,711][26022] Updated weights on worker 0-0, policy_version 959572 (0.00053) [2022-07-11 00:36:00,001][25689] Fps is (10 sec: 5487.4, 60 sec: 5525.9, 300 sec: 5534.6). Total num frames: 982603776. Throughput: 0: 5758.8. Samples: 982610694. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:36:00,001][25689] Avg episode reward: [(0, '0.643')] [2022-07-11 00:36:01,762][26022] Updated weights on worker 0-0, policy_version 959582 (0.00081) [2022-07-11 00:36:03,723][26022] Updated weights on worker 0-0, policy_version 959592 (0.00084) [2022-07-11 00:36:05,045][25689] Fps is (10 sec: 5397.4, 60 sec: 5526.8, 300 sec: 5525.9). Total num frames: 982629376. Throughput: 0: 4844.7. Samples: 982625568. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:36:05,046][25689] Avg episode reward: [(0, '1.401')] [2022-07-11 00:36:05,416][26022] Updated weights on worker 0-0, policy_version 959602 (0.00084) [2022-07-11 00:36:07,283][26022] Updated weights on worker 0-0, policy_version 959612 (0.00089) [2022-07-11 00:36:09,175][26022] Updated weights on worker 0-0, policy_version 959622 (0.00087) [2022-07-11 00:36:10,100][25689] Fps is (10 sec: 5374.0, 60 sec: 5531.4, 300 sec: 5532.1). Total num frames: 982658048. Throughput: 0: 5686.0. Samples: 982658946. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:36:10,113][25689] Avg episode reward: [(0, '1.393')] [2022-07-11 00:36:10,967][26022] Updated weights on worker 0-0, policy_version 959632 (0.00096) [2022-07-11 00:36:12,735][26022] Updated weights on worker 0-0, policy_version 959642 (0.00089) [2022-07-11 00:36:14,668][26022] Updated weights on worker 0-0, policy_version 959652 (0.00091) [2022-07-11 00:36:15,153][25689] Fps is (10 sec: 5572.0, 60 sec: 5498.2, 300 sec: 5526.2). Total num frames: 982685696. Throughput: 0: 5721.8. Samples: 982692582. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:36:15,154][25689] Avg episode reward: [(0, '1.606')] [2022-07-11 00:36:16,523][26022] Updated weights on worker 0-0, policy_version 959662 (0.00083) [2022-07-11 00:36:18,308][26022] Updated weights on worker 0-0, policy_version 959672 (0.00083) [2022-07-11 00:36:20,191][25689] Fps is (10 sec: 5581.3, 60 sec: 5531.8, 300 sec: 5526.9). Total num frames: 982714368. Throughput: 0: 4885.9. Samples: 982709340. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:36:20,191][25689] Avg episode reward: [(0, '2.005')] [2022-07-11 00:36:20,195][26022] Updated weights on worker 0-0, policy_version 959682 (0.00092) [2022-07-11 00:36:21,831][26022] Updated weights on worker 0-0, policy_version 959692 (0.00627) [2022-07-11 00:36:23,807][26022] Updated weights on worker 0-0, policy_version 959702 (0.00094) [2022-07-11 00:36:25,275][25689] Fps is (10 sec: 5665.2, 60 sec: 5524.5, 300 sec: 5530.1). Total num frames: 982743040. Throughput: 0: 5810.1. Samples: 982743106. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:36:25,276][25689] Avg episode reward: [(0, '1.951')] [2022-07-11 00:36:25,766][26022] Updated weights on worker 0-0, policy_version 959712 (0.00100) [2022-07-11 00:36:27,316][26022] Updated weights on worker 0-0, policy_version 959722 (0.00054) [2022-07-11 00:36:29,439][26022] Updated weights on worker 0-0, policy_version 959732 (0.00091) [2022-07-11 00:36:30,399][25689] Fps is (10 sec: 5617.5, 60 sec: 5536.9, 300 sec: 5531.9). Total num frames: 982771712. Throughput: 0: 5800.2. Samples: 982776684. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:36:30,400][25689] Avg episode reward: [(0, '1.735')] [2022-07-11 00:36:31,064][26022] Updated weights on worker 0-0, policy_version 959742 (0.00094) [2022-07-11 00:36:33,079][26022] Updated weights on worker 0-0, policy_version 959752 (0.00088) [2022-07-11 00:36:34,797][26022] Updated weights on worker 0-0, policy_version 959762 (0.00088) [2022-07-11 00:36:35,413][25689] Fps is (10 sec: 5555.6, 60 sec: 5520.2, 300 sec: 5532.1). Total num frames: 982799360. Throughput: 0: 4977.8. Samples: 982793430. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:36:35,413][25689] Avg episode reward: [(0, '1.770')] [2022-07-11 00:36:36,663][26022] Updated weights on worker 0-0, policy_version 959772 (0.00096) [2022-07-11 00:36:38,556][26022] Updated weights on worker 0-0, policy_version 959782 (0.00091) [2022-07-11 00:36:40,431][25689] Fps is (10 sec: 5409.9, 60 sec: 5519.0, 300 sec: 5521.7). Total num frames: 982825984. Throughput: 0: 5798.0. Samples: 982826692. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:36:40,431][25689] Avg episode reward: [(0, '1.110')] [2022-07-11 00:36:40,441][26022] Updated weights on worker 0-0, policy_version 959792 (0.00088) [2022-07-11 00:36:42,241][26022] Updated weights on worker 0-0, policy_version 959802 (0.00095) [2022-07-11 00:36:44,138][26022] Updated weights on worker 0-0, policy_version 959812 (0.00085) [2022-07-11 00:36:45,451][25689] Fps is (10 sec: 5508.5, 60 sec: 5538.2, 300 sec: 5532.4). Total num frames: 982854656. Throughput: 0: 5808.1. Samples: 982860290. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:36:45,452][25689] Avg episode reward: [(0, '0.441')] [2022-07-11 00:36:45,749][26022] Updated weights on worker 0-0, policy_version 959822 (0.00080) [2022-07-11 00:36:47,640][26022] Updated weights on worker 0-0, policy_version 959832 (0.00097) [2022-07-11 00:36:49,659][26022] Updated weights on worker 0-0, policy_version 959842 (0.00093) [2022-07-11 00:36:50,544][25689] Fps is (10 sec: 5771.3, 60 sec: 5540.6, 300 sec: 5530.7). Total num frames: 982884352. Throughput: 0: 4991.2. Samples: 982877234. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:36:50,545][25689] Avg episode reward: [(0, '0.429')] [2022-07-11 00:36:51,133][26022] Updated weights on worker 0-0, policy_version 959852 (0.00085) [2022-07-11 00:36:53,300][26022] Updated weights on worker 0-0, policy_version 959862 (0.00089) [2022-07-11 00:36:54,818][26022] Updated weights on worker 0-0, policy_version 959872 (0.00085) [2022-07-11 00:36:55,579][25689] Fps is (10 sec: 5662.2, 60 sec: 5560.4, 300 sec: 5530.2). Total num frames: 982912000. Throughput: 0: 5839.0. Samples: 982911178. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:36:55,579][25689] Avg episode reward: [(0, '0.067')] [2022-07-11 00:36:56,844][26022] Updated weights on worker 0-0, policy_version 959882 (0.00085) [2022-07-11 00:36:58,610][26022] Updated weights on worker 0-0, policy_version 959892 (0.00094) [2022-07-11 00:37:00,467][26022] Updated weights on worker 0-0, policy_version 959902 (0.00098) [2022-07-11 00:37:00,594][25689] Fps is (10 sec: 5502.1, 60 sec: 5543.0, 300 sec: 5537.5). Total num frames: 982939648. Throughput: 0: 5853.0. Samples: 982944710. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:37:00,595][25689] Avg episode reward: [(0, '0.272')] [2022-07-11 00:37:02,515][26022] Updated weights on worker 0-0, policy_version 959912 (0.00903) [2022-07-11 00:37:04,573][26022] Updated weights on worker 0-0, policy_version 959922 (0.00063) [2022-07-11 00:37:05,602][25689] Fps is (10 sec: 5414.7, 60 sec: 5563.3, 300 sec: 5531.3). Total num frames: 982966272. Throughput: 0: 4919.9. Samples: 982959430. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:37:05,602][25689] Avg episode reward: [(0, '0.272')] [2022-07-11 00:37:06,307][26022] Updated weights on worker 0-0, policy_version 959932 (0.00090) [2022-07-11 00:37:08,232][26022] Updated weights on worker 0-0, policy_version 959942 (0.00087) [2022-07-11 00:37:09,940][26022] Updated weights on worker 0-0, policy_version 959952 (0.00091) [2022-07-11 00:37:10,696][25689] Fps is (10 sec: 5372.8, 60 sec: 5542.8, 300 sec: 5533.1). Total num frames: 982993920. Throughput: 0: 5728.5. Samples: 982992670. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:37:10,696][25689] Avg episode reward: [(0, '1.093')] [2022-07-11 00:37:11,933][26022] Updated weights on worker 0-0, policy_version 959962 (0.00082) [2022-07-11 00:37:13,845][26022] Updated weights on worker 0-0, policy_version 959972 (0.00092) [2022-07-11 00:37:15,413][26022] Updated weights on worker 0-0, policy_version 959982 (0.00082) [2022-07-11 00:37:15,744][25689] Fps is (10 sec: 5654.1, 60 sec: 5577.1, 300 sec: 5539.4). Total num frames: 983023616. Throughput: 0: 5698.7. Samples: 983026094. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:37:15,744][25689] Avg episode reward: [(0, '1.542')] [2022-07-11 00:37:17,344][26022] Updated weights on worker 0-0, policy_version 959992 (0.00087) [2022-07-11 00:37:19,023][26022] Updated weights on worker 0-0, policy_version 960002 (0.00088) [2022-07-11 00:37:19,154][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:37:19,169][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000960003_983043072.pth [2022-07-11 00:37:19,170][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000958055_981048320.pth [2022-07-11 00:37:20,759][25689] Fps is (10 sec: 5596.6, 60 sec: 5545.4, 300 sec: 5532.3). Total num frames: 983050240. Throughput: 0: 5714.9. Samples: 983059948. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:37:20,760][25689] Avg episode reward: [(0, '1.479')] [2022-07-11 00:37:21,075][26022] Updated weights on worker 0-0, policy_version 960012 (0.00093) [2022-07-11 00:37:22,856][26022] Updated weights on worker 0-0, policy_version 960022 (0.00095) [2022-07-11 00:37:24,850][26022] Updated weights on worker 0-0, policy_version 960032 (0.00095) [2022-07-11 00:37:25,780][25689] Fps is (10 sec: 5407.8, 60 sec: 5534.3, 300 sec: 5529.3). Total num frames: 983077888. Throughput: 0: 5804.1. Samples: 983076546. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:37:25,780][25689] Avg episode reward: [(0, '1.629')] [2022-07-11 00:37:26,435][26022] Updated weights on worker 0-0, policy_version 960042 (0.00087) [2022-07-11 00:37:28,431][26022] Updated weights on worker 0-0, policy_version 960052 (0.00093) [2022-07-11 00:37:30,261][26022] Updated weights on worker 0-0, policy_version 960062 (0.00088) [2022-07-11 00:37:30,829][25689] Fps is (10 sec: 5593.0, 60 sec: 5541.1, 300 sec: 5532.0). Total num frames: 983106560. Throughput: 0: 5821.5. Samples: 983109874. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:37:30,829][25689] Avg episode reward: [(0, '1.528')] [2022-07-11 00:37:32,170][26022] Updated weights on worker 0-0, policy_version 960072 (0.00090) [2022-07-11 00:37:33,933][26022] Updated weights on worker 0-0, policy_version 960082 (0.00083) [2022-07-11 00:37:35,712][26022] Updated weights on worker 0-0, policy_version 960092 (0.00091) [2022-07-11 00:37:35,843][25689] Fps is (10 sec: 5698.1, 60 sec: 5558.0, 300 sec: 5532.2). Total num frames: 983135232. Throughput: 0: 5837.3. Samples: 983143422. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:37:35,844][25689] Avg episode reward: [(0, '1.632')] [2022-07-11 00:37:37,697][26022] Updated weights on worker 0-0, policy_version 960102 (0.00080) [2022-07-11 00:37:39,405][26022] Updated weights on worker 0-0, policy_version 960112 (0.00095) [2022-07-11 00:37:40,943][25689] Fps is (10 sec: 5467.2, 60 sec: 5550.5, 300 sec: 5530.4). Total num frames: 983161856. Throughput: 0: 4977.2. Samples: 983160408. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 00:37:40,944][25689] Avg episode reward: [(0, '1.519')] [2022-07-11 00:37:41,248][26022] Updated weights on worker 0-0, policy_version 960122 (0.00093) [2022-07-11 00:37:43,175][26022] Updated weights on worker 0-0, policy_version 960132 (0.00092) [2022-07-11 00:37:44,871][26022] Updated weights on worker 0-0, policy_version 960142 (0.00086) [2022-07-11 00:37:45,947][25689] Fps is (10 sec: 5574.2, 60 sec: 5568.9, 300 sec: 5534.5). Total num frames: 983191552. Throughput: 0: 5821.1. Samples: 983193942. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:37:45,948][25689] Avg episode reward: [(0, '1.080')] [2022-07-11 00:37:46,924][26022] Updated weights on worker 0-0, policy_version 960152 (0.00087) [2022-07-11 00:37:48,518][26022] Updated weights on worker 0-0, policy_version 960162 (0.00084) [2022-07-11 00:37:50,460][26022] Updated weights on worker 0-0, policy_version 960172 (0.00086) [2022-07-11 00:37:51,068][25689] Fps is (10 sec: 5764.8, 60 sec: 5549.5, 300 sec: 5535.9). Total num frames: 983220224. Throughput: 0: 5812.2. Samples: 983227508. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:37:51,068][25689] Avg episode reward: [(0, '1.540')] [2022-07-11 00:37:52,199][26022] Updated weights on worker 0-0, policy_version 960182 (0.00089) [2022-07-11 00:37:54,047][26022] Updated weights on worker 0-0, policy_version 960192 (0.00087) [2022-07-11 00:37:55,731][26022] Updated weights on worker 0-0, policy_version 960202 (0.00087) [2022-07-11 00:37:56,071][25689] Fps is (10 sec: 5563.0, 60 sec: 5552.3, 300 sec: 5536.2). Total num frames: 983247872. Throughput: 0: 4993.7. Samples: 983244434. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:37:56,072][25689] Avg episode reward: [(0, '1.302')] [2022-07-11 00:37:57,656][26022] Updated weights on worker 0-0, policy_version 960212 (0.00097) [2022-07-11 00:37:59,610][26022] Updated weights on worker 0-0, policy_version 960222 (0.00111) [2022-07-11 00:38:01,094][25689] Fps is (10 sec: 5514.9, 60 sec: 5551.6, 300 sec: 5539.4). Total num frames: 983275520. Throughput: 0: 5836.2. Samples: 983278016. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:38:01,095][25689] Avg episode reward: [(0, '1.737')] [2022-07-11 00:38:01,434][26022] Updated weights on worker 0-0, policy_version 960232 (0.00087) [2022-07-11 00:38:03,616][26022] Updated weights on worker 0-0, policy_version 960242 (0.00373) [2022-07-11 00:38:05,582][26022] Updated weights on worker 0-0, policy_version 960252 (0.00087) [2022-07-11 00:38:06,188][25689] Fps is (10 sec: 5364.4, 60 sec: 5543.7, 300 sec: 5535.7). Total num frames: 983302144. Throughput: 0: 5714.9. Samples: 983309618. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:38:06,189][25689] Avg episode reward: [(0, '0.800')] [2022-07-11 00:38:07,215][26022] Updated weights on worker 0-0, policy_version 960262 (0.00095) [2022-07-11 00:38:09,155][26022] Updated weights on worker 0-0, policy_version 960272 (0.00098) [2022-07-11 00:38:11,014][26022] Updated weights on worker 0-0, policy_version 960282 (0.00091) [2022-07-11 00:38:11,247][25689] Fps is (10 sec: 5345.7, 60 sec: 5546.9, 300 sec: 5535.1). Total num frames: 983329792. Throughput: 0: 4896.0. Samples: 983326304. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:38:11,247][25689] Avg episode reward: [(0, '0.897')] [2022-07-11 00:38:12,919][26022] Updated weights on worker 0-0, policy_version 960292 (0.00087) [2022-07-11 00:38:14,581][26022] Updated weights on worker 0-0, policy_version 960302 (0.00086) [2022-07-11 00:38:16,338][25689] Fps is (10 sec: 5448.1, 60 sec: 5509.2, 300 sec: 5537.4). Total num frames: 983357440. Throughput: 0: 5671.4. Samples: 983359374. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:38:16,338][25689] Avg episode reward: [(0, '0.818')] [2022-07-11 00:38:16,487][26022] Updated weights on worker 0-0, policy_version 960312 (0.00083) [2022-07-11 00:38:18,574][26022] Updated weights on worker 0-0, policy_version 960322 (0.00091) [2022-07-11 00:38:20,136][26022] Updated weights on worker 0-0, policy_version 960332 (0.00092) [2022-07-11 00:38:21,343][25689] Fps is (10 sec: 5578.3, 60 sec: 5543.9, 300 sec: 5530.6). Total num frames: 983386112. Throughput: 0: 5673.8. Samples: 983392904. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:38:21,344][25689] Avg episode reward: [(0, '0.679')] [2022-07-11 00:38:21,967][26022] Updated weights on worker 0-0, policy_version 960342 (0.00081) [2022-07-11 00:38:23,778][26022] Updated weights on worker 0-0, policy_version 960352 (0.00095) [2022-07-11 00:38:25,735][26022] Updated weights on worker 0-0, policy_version 960362 (0.00091) [2022-07-11 00:38:26,389][25689] Fps is (10 sec: 5501.2, 60 sec: 5524.7, 300 sec: 5534.7). Total num frames: 983412736. Throughput: 0: 4954.6. Samples: 983409706. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:38:26,390][25689] Avg episode reward: [(0, '0.604')] [2022-07-11 00:38:27,672][26022] Updated weights on worker 0-0, policy_version 960372 (0.00088) [2022-07-11 00:38:29,535][26022] Updated weights on worker 0-0, policy_version 960382 (0.00094) [2022-07-11 00:38:31,319][26022] Updated weights on worker 0-0, policy_version 960392 (0.00082) [2022-07-11 00:38:31,458][25689] Fps is (10 sec: 5567.8, 60 sec: 5539.7, 300 sec: 5531.0). Total num frames: 983442432. Throughput: 0: 5765.6. Samples: 983442836. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:38:31,459][25689] Avg episode reward: [(0, '0.120')] [2022-07-11 00:38:33,115][26022] Updated weights on worker 0-0, policy_version 960402 (0.00086) [2022-07-11 00:38:35,291][26022] Updated weights on worker 0-0, policy_version 960413 (0.00085) [2022-07-11 00:38:36,550][25689] Fps is (10 sec: 5643.7, 60 sec: 5515.8, 300 sec: 5537.2). Total num frames: 983470080. Throughput: 0: 5782.1. Samples: 983476244. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:38:36,550][25689] Avg episode reward: [(0, '0.810')] [2022-07-11 00:38:36,949][26022] Updated weights on worker 0-0, policy_version 960423 (0.00087) [2022-07-11 00:38:38,907][26022] Updated weights on worker 0-0, policy_version 960433 (0.00090) [2022-07-11 00:38:40,547][26022] Updated weights on worker 0-0, policy_version 960443 (0.00098) [2022-07-11 00:38:41,579][25689] Fps is (10 sec: 5362.7, 60 sec: 5522.3, 300 sec: 5533.6). Total num frames: 983496704. Throughput: 0: 4949.1. Samples: 983493052. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:38:41,579][25689] Avg episode reward: [(0, '0.947')] [2022-07-11 00:38:42,586][26022] Updated weights on worker 0-0, policy_version 960453 (0.00091) [2022-07-11 00:38:44,519][26022] Updated weights on worker 0-0, policy_version 960463 (0.00086) [2022-07-11 00:38:46,346][26022] Updated weights on worker 0-0, policy_version 960473 (0.00087) [2022-07-11 00:38:46,611][25689] Fps is (10 sec: 5598.0, 60 sec: 5519.8, 300 sec: 5534.8). Total num frames: 983526400. Throughput: 0: 5756.4. Samples: 983526108. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:38:46,611][25689] Avg episode reward: [(0, '0.986')] [2022-07-11 00:38:48,096][26022] Updated weights on worker 0-0, policy_version 960483 (0.00086) [2022-07-11 00:38:50,083][26022] Updated weights on worker 0-0, policy_version 960493 (0.00081) [2022-07-11 00:38:51,621][26022] Updated weights on worker 0-0, policy_version 960503 (0.00085) [2022-07-11 00:38:51,730][25689] Fps is (10 sec: 5750.0, 60 sec: 5519.9, 300 sec: 5536.9). Total num frames: 983555072. Throughput: 0: 5740.3. Samples: 983559198. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:38:51,730][25689] Avg episode reward: [(0, '1.561')] [2022-07-11 00:38:53,752][26022] Updated weights on worker 0-0, policy_version 960513 (0.00090) [2022-07-11 00:38:55,563][26022] Updated weights on worker 0-0, policy_version 960523 (0.00084) [2022-07-11 00:38:56,764][25689] Fps is (10 sec: 5446.2, 60 sec: 5500.2, 300 sec: 5537.0). Total num frames: 983581696. Throughput: 0: 4919.7. Samples: 983575690. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:38:56,764][25689] Avg episode reward: [(0, '1.943')] [2022-07-11 00:38:57,346][26022] Updated weights on worker 0-0, policy_version 960533 (0.00083) [2022-07-11 00:38:59,298][26022] Updated weights on worker 0-0, policy_version 960543 (0.00083) [2022-07-11 00:39:00,986][26022] Updated weights on worker 0-0, policy_version 960553 (0.00093) [2022-07-11 00:39:01,851][25689] Fps is (10 sec: 5362.3, 60 sec: 5494.5, 300 sec: 5537.2). Total num frames: 983609344. Throughput: 0: 5730.4. Samples: 983609218. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:39:01,851][25689] Avg episode reward: [(0, '1.171')] [2022-07-11 00:39:03,361][26022] Updated weights on worker 0-0, policy_version 960563 (0.00086) [2022-07-11 00:39:05,067][26022] Updated weights on worker 0-0, policy_version 960573 (0.00096) [2022-07-11 00:39:06,815][26022] Updated weights on worker 0-0, policy_version 960583 (0.00093) [2022-07-11 00:39:06,895][25689] Fps is (10 sec: 5458.2, 60 sec: 5515.8, 300 sec: 5537.8). Total num frames: 983636992. Throughput: 0: 5645.0. Samples: 983640612. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:39:06,895][25689] Avg episode reward: [(0, '1.229')] [2022-07-11 00:39:08,708][26022] Updated weights on worker 0-0, policy_version 960593 (0.00084) [2022-07-11 00:39:10,568][26022] Updated weights on worker 0-0, policy_version 960603 (0.00084) [2022-07-11 00:39:12,052][25689] Fps is (10 sec: 5320.2, 60 sec: 5490.1, 300 sec: 5528.2). Total num frames: 983663616. Throughput: 0: 5638.5. Samples: 983673784. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:39:12,052][25689] Avg episode reward: [(0, '0.872')] [2022-07-11 00:39:12,446][26022] Updated weights on worker 0-0, policy_version 960613 (0.00086) [2022-07-11 00:39:14,276][26022] Updated weights on worker 0-0, policy_version 960623 (0.00360) [2022-07-11 00:39:16,158][26022] Updated weights on worker 0-0, policy_version 960633 (0.00090) [2022-07-11 00:39:17,065][25689] Fps is (10 sec: 5437.0, 60 sec: 5514.0, 300 sec: 5531.8). Total num frames: 983692288. Throughput: 0: 5664.9. Samples: 983690694. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:39:17,066][25689] Avg episode reward: [(0, '0.739')] [2022-07-11 00:39:18,034][26022] Updated weights on worker 0-0, policy_version 960643 (0.00083) [2022-07-11 00:39:19,279][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:39:19,295][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000960649_983704576.pth [2022-07-11 00:39:19,295][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000958703_981711872.pth [2022-07-11 00:39:19,895][26022] Updated weights on worker 0-0, policy_version 960653 (0.00089) [2022-07-11 00:39:21,809][26022] Updated weights on worker 0-0, policy_version 960663 (0.00091) [2022-07-11 00:39:22,163][25689] Fps is (10 sec: 5671.4, 60 sec: 5505.6, 300 sec: 5531.0). Total num frames: 983720960. Throughput: 0: 5648.9. Samples: 983723960. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:39:22,163][25689] Avg episode reward: [(0, '0.570')] [2022-07-11 00:39:23,495][26022] Updated weights on worker 0-0, policy_version 960673 (0.00087) [2022-07-11 00:39:25,515][26022] Updated weights on worker 0-0, policy_version 960683 (0.00085) [2022-07-11 00:39:27,165][25689] Fps is (10 sec: 5474.6, 60 sec: 5509.5, 300 sec: 5528.4). Total num frames: 983747584. Throughput: 0: 5748.5. Samples: 983757138. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:39:27,167][25689] Avg episode reward: [(0, '-0.097')] [2022-07-11 00:39:27,320][26022] Updated weights on worker 0-0, policy_version 960693 (0.00084) [2022-07-11 00:39:29,271][26022] Updated weights on worker 0-0, policy_version 960703 (0.00082) [2022-07-11 00:39:31,032][26022] Updated weights on worker 0-0, policy_version 960713 (0.00091) [2022-07-11 00:39:32,272][25689] Fps is (10 sec: 5470.1, 60 sec: 5489.3, 300 sec: 5529.9). Total num frames: 983776256. Throughput: 0: 4945.4. Samples: 983773778. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:39:32,273][25689] Avg episode reward: [(0, '0.677')] [2022-07-11 00:39:32,875][26022] Updated weights on worker 0-0, policy_version 960723 (0.00089) [2022-07-11 00:39:34,718][26022] Updated weights on worker 0-0, policy_version 960733 (0.00081) [2022-07-11 00:39:36,675][26022] Updated weights on worker 0-0, policy_version 960743 (0.00092) [2022-07-11 00:39:37,295][25689] Fps is (10 sec: 5660.9, 60 sec: 5512.3, 300 sec: 5529.6). Total num frames: 983804928. Throughput: 0: 5761.3. Samples: 983807246. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:39:37,303][25689] Avg episode reward: [(0, '0.984')] [2022-07-11 00:39:38,297][26022] Updated weights on worker 0-0, policy_version 960753 (0.00087) [2022-07-11 00:39:40,403][26022] Updated weights on worker 0-0, policy_version 960763 (0.00084) [2022-07-11 00:39:41,976][26022] Updated weights on worker 0-0, policy_version 960773 (0.00083) [2022-07-11 00:39:42,350][25689] Fps is (10 sec: 5689.8, 60 sec: 5543.7, 300 sec: 5533.8). Total num frames: 983833600. Throughput: 0: 5781.1. Samples: 983840664. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:39:42,352][25689] Avg episode reward: [(0, '1.181')] [2022-07-11 00:39:43,947][26022] Updated weights on worker 0-0, policy_version 960783 (0.00084) [2022-07-11 00:39:45,660][26022] Updated weights on worker 0-0, policy_version 960793 (0.00092) [2022-07-11 00:39:47,414][25689] Fps is (10 sec: 5464.6, 60 sec: 5490.2, 300 sec: 5527.6). Total num frames: 983860224. Throughput: 0: 4953.9. Samples: 983857448. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:39:47,416][25689] Avg episode reward: [(0, '0.291')] [2022-07-11 00:39:47,710][26022] Updated weights on worker 0-0, policy_version 960803 (0.00087) [2022-07-11 00:39:49,441][26022] Updated weights on worker 0-0, policy_version 960813 (0.00087) [2022-07-11 00:39:51,336][26022] Updated weights on worker 0-0, policy_version 960823 (0.00090) [2022-07-11 00:39:52,485][25689] Fps is (10 sec: 5456.0, 60 sec: 5494.6, 300 sec: 5526.8). Total num frames: 983888896. Throughput: 0: 5787.2. Samples: 983890756. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:39:52,485][25689] Avg episode reward: [(0, '0.476')] [2022-07-11 00:39:52,864][26022] Updated weights on worker 0-0, policy_version 960833 (0.00089) [2022-07-11 00:39:55,102][26022] Updated weights on worker 0-0, policy_version 960843 (0.00092) [2022-07-11 00:39:56,720][26022] Updated weights on worker 0-0, policy_version 960853 (0.00084) [2022-07-11 00:39:57,492][25689] Fps is (10 sec: 5487.1, 60 sec: 5497.0, 300 sec: 5523.9). Total num frames: 983915520. Throughput: 0: 5783.1. Samples: 983924044. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:39:57,492][25689] Avg episode reward: [(0, '0.745')] [2022-07-11 00:39:58,544][26022] Updated weights on worker 0-0, policy_version 960863 (0.00093) [2022-07-11 00:40:00,641][26022] Updated weights on worker 0-0, policy_version 960873 (0.00094) [2022-07-11 00:40:02,503][25689] Fps is (10 sec: 5417.6, 60 sec: 5503.9, 300 sec: 5531.5). Total num frames: 983943168. Throughput: 0: 4960.9. Samples: 983940640. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:40:02,504][25689] Avg episode reward: [(0, '0.902')] [2022-07-11 00:40:02,719][26022] Updated weights on worker 0-0, policy_version 960883 (0.00089) [2022-07-11 00:40:04,644][26022] Updated weights on worker 0-0, policy_version 960893 (0.00097) [2022-07-11 00:40:06,385][26022] Updated weights on worker 0-0, policy_version 960903 (0.00090) [2022-07-11 00:40:07,517][25689] Fps is (10 sec: 5515.7, 60 sec: 5506.6, 300 sec: 5529.8). Total num frames: 983970816. Throughput: 0: 5696.6. Samples: 983971966. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:40:07,519][25689] Avg episode reward: [(0, '-0.072')] [2022-07-11 00:40:08,329][26022] Updated weights on worker 0-0, policy_version 960913 (0.00094) [2022-07-11 00:40:10,271][26022] Updated weights on worker 0-0, policy_version 960923 (0.00086) [2022-07-11 00:40:11,830][26022] Updated weights on worker 0-0, policy_version 960933 (0.00087) [2022-07-11 00:40:12,573][25689] Fps is (10 sec: 5491.1, 60 sec: 5532.8, 300 sec: 5523.0). Total num frames: 983998464. Throughput: 0: 5709.0. Samples: 984005438. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:40:12,573][25689] Avg episode reward: [(0, '-0.184')] [2022-07-11 00:40:13,732][26022] Updated weights on worker 0-0, policy_version 960943 (0.00084) [2022-07-11 00:40:15,764][26022] Updated weights on worker 0-0, policy_version 960953 (0.00091) [2022-07-11 00:40:17,496][26022] Updated weights on worker 0-0, policy_version 960963 (0.00099) [2022-07-11 00:40:17,580][25689] Fps is (10 sec: 5494.8, 60 sec: 5516.3, 300 sec: 5526.9). Total num frames: 984026112. Throughput: 0: 4874.7. Samples: 984021970. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:40:17,581][25689] Avg episode reward: [(0, '0.894')] [2022-07-11 00:40:19,469][26022] Updated weights on worker 0-0, policy_version 960973 (0.00083) [2022-07-11 00:40:21,151][26022] Updated weights on worker 0-0, policy_version 960983 (0.00089) [2022-07-11 00:40:22,610][25689] Fps is (10 sec: 5509.1, 60 sec: 5505.7, 300 sec: 5523.0). Total num frames: 984053760. Throughput: 0: 5696.6. Samples: 984055182. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:40:22,610][25689] Avg episode reward: [(0, '0.057')] [2022-07-11 00:40:23,151][26022] Updated weights on worker 0-0, policy_version 960993 (0.00099) [2022-07-11 00:40:25,107][26022] Updated weights on worker 0-0, policy_version 961003 (0.00091) [2022-07-11 00:40:26,775][26022] Updated weights on worker 0-0, policy_version 961013 (0.00087) [2022-07-11 00:40:27,625][25689] Fps is (10 sec: 5504.8, 60 sec: 5521.4, 300 sec: 5524.1). Total num frames: 984081408. Throughput: 0: 5794.1. Samples: 984088476. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:40:27,626][25689] Avg episode reward: [(0, '0.007')] [2022-07-11 00:40:28,692][26022] Updated weights on worker 0-0, policy_version 961023 (0.00098) [2022-07-11 00:40:30,453][26022] Updated weights on worker 0-0, policy_version 961033 (0.00094) [2022-07-11 00:40:32,196][26022] Updated weights on worker 0-0, policy_version 961043 (0.00090) [2022-07-11 00:40:32,744][25689] Fps is (10 sec: 5557.2, 60 sec: 5520.2, 300 sec: 5522.2). Total num frames: 984110080. Throughput: 0: 4942.1. Samples: 984105130. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:40:32,745][25689] Avg episode reward: [(0, '0.806')] [2022-07-11 00:40:34,276][26022] Updated weights on worker 0-0, policy_version 961053 (0.00084) [2022-07-11 00:40:35,894][26022] Updated weights on worker 0-0, policy_version 961063 (0.00082) [2022-07-11 00:40:37,749][25689] Fps is (10 sec: 5462.0, 60 sec: 5488.1, 300 sec: 5522.2). Total num frames: 984136704. Throughput: 0: 5778.8. Samples: 984138520. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:40:37,750][25689] Avg episode reward: [(0, '0.980')] [2022-07-11 00:40:37,956][26022] Updated weights on worker 0-0, policy_version 961073 (0.00084) [2022-07-11 00:40:39,453][26022] Updated weights on worker 0-0, policy_version 961083 (0.00089) [2022-07-11 00:40:41,603][26022] Updated weights on worker 0-0, policy_version 961093 (0.00089) [2022-07-11 00:40:42,776][25689] Fps is (10 sec: 5614.4, 60 sec: 5507.6, 300 sec: 5529.5). Total num frames: 984166400. Throughput: 0: 5801.2. Samples: 984172168. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:40:42,776][25689] Avg episode reward: [(0, '0.789')] [2022-07-11 00:40:43,213][26022] Updated weights on worker 0-0, policy_version 961103 (0.00090) [2022-07-11 00:40:45,230][26022] Updated weights on worker 0-0, policy_version 961113 (0.00090) [2022-07-11 00:40:46,968][26022] Updated weights on worker 0-0, policy_version 961123 (0.00085) [2022-07-11 00:40:47,826][25689] Fps is (10 sec: 5690.5, 60 sec: 5525.8, 300 sec: 5523.9). Total num frames: 984194048. Throughput: 0: 4967.1. Samples: 984188816. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:40:47,827][25689] Avg episode reward: [(0, '0.523')] [2022-07-11 00:40:48,943][26022] Updated weights on worker 0-0, policy_version 961133 (0.00087) [2022-07-11 00:40:50,648][26022] Updated weights on worker 0-0, policy_version 961143 (0.00091) [2022-07-11 00:40:52,651][26022] Updated weights on worker 0-0, policy_version 961153 (0.00086) [2022-07-11 00:40:52,888][25689] Fps is (10 sec: 5468.0, 60 sec: 5509.6, 300 sec: 5527.4). Total num frames: 984221696. Throughput: 0: 5818.4. Samples: 984222336. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:40:52,889][25689] Avg episode reward: [(0, '1.572')] [2022-07-11 00:40:54,213][26022] Updated weights on worker 0-0, policy_version 961163 (0.00088) [2022-07-11 00:40:56,199][26022] Updated weights on worker 0-0, policy_version 961173 (0.00095) [2022-07-11 00:40:57,885][26022] Updated weights on worker 0-0, policy_version 961183 (0.00087) [2022-07-11 00:40:57,984][25689] Fps is (10 sec: 5645.6, 60 sec: 5552.4, 300 sec: 5529.2). Total num frames: 984251392. Throughput: 0: 5803.7. Samples: 984255954. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:40:57,984][25689] Avg episode reward: [(0, '1.791')] [2022-07-11 00:40:59,851][26022] Updated weights on worker 0-0, policy_version 961193 (0.00097) [2022-07-11 00:41:02,255][26022] Updated weights on worker 0-0, policy_version 961203 (0.00085) [2022-07-11 00:41:03,036][25689] Fps is (10 sec: 5449.0, 60 sec: 5514.7, 300 sec: 5529.1). Total num frames: 984276992. Throughput: 0: 4961.3. Samples: 984272686. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:41:03,038][25689] Avg episode reward: [(0, '1.628')] [2022-07-11 00:41:04,018][26022] Updated weights on worker 0-0, policy_version 961213 (0.00074) [2022-07-11 00:41:05,815][26022] Updated weights on worker 0-0, policy_version 961223 (0.00094) [2022-07-11 00:41:07,646][26022] Updated weights on worker 0-0, policy_version 961233 (0.00084) [2022-07-11 00:41:08,129][25689] Fps is (10 sec: 5248.7, 60 sec: 5507.6, 300 sec: 5524.9). Total num frames: 984304640. Throughput: 0: 5675.1. Samples: 984304034. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:41:08,129][25689] Avg episode reward: [(0, '1.575')] [2022-07-11 00:41:09,263][26022] Updated weights on worker 0-0, policy_version 961243 (0.00093) [2022-07-11 00:41:11,549][26022] Updated weights on worker 0-0, policy_version 961253 (0.00086) [2022-07-11 00:41:13,001][26022] Updated weights on worker 0-0, policy_version 961263 (0.00094) [2022-07-11 00:41:13,261][25689] Fps is (10 sec: 5608.2, 60 sec: 5534.3, 300 sec: 5530.3). Total num frames: 984334336. Throughput: 0: 5657.6. Samples: 984337598. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:41:13,263][25689] Avg episode reward: [(0, '1.973')] [2022-07-11 00:41:14,998][26022] Updated weights on worker 0-0, policy_version 961273 (0.00112) [2022-07-11 00:41:16,915][26022] Updated weights on worker 0-0, policy_version 961283 (0.00084) [2022-07-11 00:41:18,290][25689] Fps is (10 sec: 5542.7, 60 sec: 5515.6, 300 sec: 5523.6). Total num frames: 984360960. Throughput: 0: 5654.6. Samples: 984370776. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:41:18,290][25689] Avg episode reward: [(0, '1.806')] [2022-07-11 00:41:18,577][26022] Updated weights on worker 0-0, policy_version 961293 (0.00088) [2022-07-11 00:41:19,647][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:41:19,664][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000961298_984369152.pth [2022-07-11 00:41:19,665][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000959353_982377472.pth [2022-07-11 00:41:20,544][26022] Updated weights on worker 0-0, policy_version 961303 (0.00087) [2022-07-11 00:41:22,240][26022] Updated weights on worker 0-0, policy_version 961313 (0.00084) [2022-07-11 00:41:23,303][25689] Fps is (10 sec: 5506.8, 60 sec: 5533.9, 300 sec: 5525.0). Total num frames: 984389632. Throughput: 0: 5672.8. Samples: 984387654. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:41:23,304][25689] Avg episode reward: [(0, '1.726')] [2022-07-11 00:41:24,256][26022] Updated weights on worker 0-0, policy_version 961323 (0.00084) [2022-07-11 00:41:26,112][26022] Updated weights on worker 0-0, policy_version 961333 (0.00216) [2022-07-11 00:41:27,844][26022] Updated weights on worker 0-0, policy_version 961343 (0.00089) [2022-07-11 00:41:28,314][25689] Fps is (10 sec: 5618.4, 60 sec: 5534.4, 300 sec: 5523.6). Total num frames: 984417280. Throughput: 0: 5804.2. Samples: 984421194. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:41:28,314][25689] Avg episode reward: [(0, '1.173')] [2022-07-11 00:41:29,873][26022] Updated weights on worker 0-0, policy_version 961353 (0.00086) [2022-07-11 00:41:31,440][26022] Updated weights on worker 0-0, policy_version 961363 (0.00084) [2022-07-11 00:41:33,397][25689] Fps is (10 sec: 5579.7, 60 sec: 5537.7, 300 sec: 5525.8). Total num frames: 984445952. Throughput: 0: 5798.1. Samples: 984454344. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 00:41:33,397][25689] Avg episode reward: [(0, '1.352')] [2022-07-11 00:41:33,404][26022] Updated weights on worker 0-0, policy_version 961373 (0.00088) [2022-07-11 00:41:35,273][26022] Updated weights on worker 0-0, policy_version 961383 (0.00088) [2022-07-11 00:41:36,965][26022] Updated weights on worker 0-0, policy_version 961393 (0.00089) [2022-07-11 00:41:38,495][25689] Fps is (10 sec: 5531.9, 60 sec: 5546.0, 300 sec: 5527.7). Total num frames: 984473600. Throughput: 0: 4977.0. Samples: 984471338. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:41:38,495][25689] Avg episode reward: [(0, '1.124')] [2022-07-11 00:41:38,903][26022] Updated weights on worker 0-0, policy_version 961403 (0.00094) [2022-07-11 00:41:40,522][26022] Updated weights on worker 0-0, policy_version 961413 (0.00083) [2022-07-11 00:41:42,612][26022] Updated weights on worker 0-0, policy_version 961423 (0.00086) [2022-07-11 00:41:43,527][25689] Fps is (10 sec: 5559.6, 60 sec: 5528.7, 300 sec: 5527.5). Total num frames: 984502272. Throughput: 0: 5796.7. Samples: 984504886. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:41:43,527][25689] Avg episode reward: [(0, '1.107')] [2022-07-11 00:41:44,322][26022] Updated weights on worker 0-0, policy_version 961433 (0.00083) [2022-07-11 00:41:46,284][26022] Updated weights on worker 0-0, policy_version 961443 (0.00086) [2022-07-11 00:41:47,981][26022] Updated weights on worker 0-0, policy_version 961453 (0.00084) [2022-07-11 00:41:48,543][25689] Fps is (10 sec: 5605.1, 60 sec: 5531.8, 300 sec: 5522.1). Total num frames: 984529920. Throughput: 0: 5787.2. Samples: 984538262. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:41:48,543][25689] Avg episode reward: [(0, '0.785')] [2022-07-11 00:41:49,966][26022] Updated weights on worker 0-0, policy_version 961463 (0.00088) [2022-07-11 00:41:51,782][26022] Updated weights on worker 0-0, policy_version 961473 (0.00087) [2022-07-11 00:41:53,565][26022] Updated weights on worker 0-0, policy_version 961483 (0.00088) [2022-07-11 00:41:53,631][25689] Fps is (10 sec: 5573.8, 60 sec: 5546.3, 300 sec: 5524.5). Total num frames: 984558592. Throughput: 0: 4976.6. Samples: 984555044. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:41:53,632][25689] Avg episode reward: [(0, '1.446')] [2022-07-11 00:41:55,467][26022] Updated weights on worker 0-0, policy_version 961493 (0.00086) [2022-07-11 00:41:56,995][26022] Updated weights on worker 0-0, policy_version 961503 (0.00092) [2022-07-11 00:41:58,647][25689] Fps is (10 sec: 5574.1, 60 sec: 5519.8, 300 sec: 5524.5). Total num frames: 984586240. Throughput: 0: 5840.0. Samples: 984589026. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:41:58,647][25689] Avg episode reward: [(0, '1.447')] [2022-07-11 00:41:59,013][26022] Updated weights on worker 0-0, policy_version 961513 (0.00080) [2022-07-11 00:42:00,799][26022] Updated weights on worker 0-0, policy_version 961523 (0.00091) [2022-07-11 00:42:02,923][26022] Updated weights on worker 0-0, policy_version 961533 (0.00088) [2022-07-11 00:42:03,667][25689] Fps is (10 sec: 5510.1, 60 sec: 5556.6, 300 sec: 5527.7). Total num frames: 984613888. Throughput: 0: 5768.2. Samples: 984621056. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:42:03,667][25689] Avg episode reward: [(0, '0.960')] [2022-07-11 00:42:04,996][26022] Updated weights on worker 0-0, policy_version 961543 (0.00091) [2022-07-11 00:42:06,456][26022] Updated weights on worker 0-0, policy_version 961553 (0.00094) [2022-07-11 00:42:08,616][26022] Updated weights on worker 0-0, policy_version 961563 (0.00084) [2022-07-11 00:42:08,668][25689] Fps is (10 sec: 5415.9, 60 sec: 5548.0, 300 sec: 5526.0). Total num frames: 984640512. Throughput: 0: 4952.1. Samples: 984637922. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:42:08,668][25689] Avg episode reward: [(0, '0.908')] [2022-07-11 00:42:10,239][26022] Updated weights on worker 0-0, policy_version 961573 (0.00089) [2022-07-11 00:42:12,044][26022] Updated weights on worker 0-0, policy_version 961583 (0.00094) [2022-07-11 00:42:13,743][25689] Fps is (10 sec: 5589.3, 60 sec: 5553.3, 300 sec: 5525.5). Total num frames: 984670208. Throughput: 0: 5786.7. Samples: 984671424. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:42:13,743][25689] Avg episode reward: [(0, '0.759')] [2022-07-11 00:42:13,933][26022] Updated weights on worker 0-0, policy_version 961593 (0.00092) [2022-07-11 00:42:15,896][26022] Updated weights on worker 0-0, policy_version 961603 (0.00094) [2022-07-11 00:42:17,706][26022] Updated weights on worker 0-0, policy_version 961613 (0.00091) [2022-07-11 00:42:18,757][25689] Fps is (10 sec: 5785.1, 60 sec: 5588.5, 300 sec: 5532.4). Total num frames: 984698880. Throughput: 0: 5758.2. Samples: 984704824. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:42:18,757][25689] Avg episode reward: [(0, '0.860')] [2022-07-11 00:42:19,561][26022] Updated weights on worker 0-0, policy_version 961623 (0.00083) [2022-07-11 00:42:21,177][26022] Updated weights on worker 0-0, policy_version 961633 (0.00086) [2022-07-11 00:42:23,388][26022] Updated weights on worker 0-0, policy_version 961643 (0.00087) [2022-07-11 00:42:23,806][25689] Fps is (10 sec: 5393.1, 60 sec: 5534.4, 300 sec: 5525.0). Total num frames: 984724480. Throughput: 0: 4998.3. Samples: 984721720. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:42:23,807][25689] Avg episode reward: [(0, '0.859')] [2022-07-11 00:42:24,798][26022] Updated weights on worker 0-0, policy_version 961653 (0.00095) [2022-07-11 00:42:26,860][26022] Updated weights on worker 0-0, policy_version 961663 (0.00078) [2022-07-11 00:42:28,511][26022] Updated weights on worker 0-0, policy_version 961673 (0.00096) [2022-07-11 00:42:28,812][25689] Fps is (10 sec: 5499.3, 60 sec: 5568.7, 300 sec: 5529.2). Total num frames: 984754176. Throughput: 0: 5832.4. Samples: 984755412. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:42:28,812][25689] Avg episode reward: [(0, '0.734')] [2022-07-11 00:42:30,531][26022] Updated weights on worker 0-0, policy_version 961683 (0.00090) [2022-07-11 00:42:32,456][26022] Updated weights on worker 0-0, policy_version 961693 (0.00085) [2022-07-11 00:42:33,904][25689] Fps is (10 sec: 5780.3, 60 sec: 5567.8, 300 sec: 5527.8). Total num frames: 984782848. Throughput: 0: 5820.7. Samples: 984788776. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:42:33,904][25689] Avg episode reward: [(0, '0.893')] [2022-07-11 00:42:34,115][26022] Updated weights on worker 0-0, policy_version 961703 (0.00082) [2022-07-11 00:42:35,930][26022] Updated weights on worker 0-0, policy_version 961713 (0.00096) [2022-07-11 00:42:37,679][26022] Updated weights on worker 0-0, policy_version 961723 (0.00085) [2022-07-11 00:42:38,927][25689] Fps is (10 sec: 5466.4, 60 sec: 5557.8, 300 sec: 5529.2). Total num frames: 984809472. Throughput: 0: 4997.3. Samples: 984805622. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:42:38,928][25689] Avg episode reward: [(0, '0.707')] [2022-07-11 00:42:39,615][26022] Updated weights on worker 0-0, policy_version 961733 (0.00075) [2022-07-11 00:42:41,654][26022] Updated weights on worker 0-0, policy_version 961743 (0.00092) [2022-07-11 00:42:43,263][26022] Updated weights on worker 0-0, policy_version 961753 (0.00087) [2022-07-11 00:42:43,975][25689] Fps is (10 sec: 5694.0, 60 sec: 5590.2, 300 sec: 5531.8). Total num frames: 984840192. Throughput: 0: 5832.9. Samples: 984839362. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:42:43,975][25689] Avg episode reward: [(0, '0.570')] [2022-07-11 00:42:44,935][26022] Updated weights on worker 0-0, policy_version 961763 (0.00097) [2022-07-11 00:42:46,986][26022] Updated weights on worker 0-0, policy_version 961773 (0.00085) [2022-07-11 00:42:48,788][26022] Updated weights on worker 0-0, policy_version 961783 (0.00087) [2022-07-11 00:42:48,979][25689] Fps is (10 sec: 5705.0, 60 sec: 5574.4, 300 sec: 5527.1). Total num frames: 984866816. Throughput: 0: 5825.1. Samples: 984872888. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:42:48,979][25689] Avg episode reward: [(0, '0.796')] [2022-07-11 00:42:50,693][26022] Updated weights on worker 0-0, policy_version 961793 (0.00086) [2022-07-11 00:42:52,289][26022] Updated weights on worker 0-0, policy_version 961803 (0.00093) [2022-07-11 00:42:54,089][25689] Fps is (10 sec: 5366.0, 60 sec: 5555.5, 300 sec: 5525.1). Total num frames: 984894464. Throughput: 0: 4996.5. Samples: 984889630. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:42:54,090][25689] Avg episode reward: [(0, '0.823')] [2022-07-11 00:42:54,254][26022] Updated weights on worker 0-0, policy_version 961813 (0.00084) [2022-07-11 00:42:56,129][26022] Updated weights on worker 0-0, policy_version 961823 (0.00095) [2022-07-11 00:42:57,732][26022] Updated weights on worker 0-0, policy_version 961833 (0.00084) [2022-07-11 00:42:59,100][25689] Fps is (10 sec: 5564.9, 60 sec: 5572.9, 300 sec: 5528.8). Total num frames: 984923136. Throughput: 0: 5847.3. Samples: 984923574. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:42:59,100][25689] Avg episode reward: [(0, '1.184')] [2022-07-11 00:42:59,771][26022] Updated weights on worker 0-0, policy_version 961843 (0.00085) [2022-07-11 00:43:01,343][26022] Updated weights on worker 0-0, policy_version 961853 (0.00089) [2022-07-11 00:43:03,904][26022] Updated weights on worker 0-0, policy_version 961863 (0.00086) [2022-07-11 00:43:04,120][25689] Fps is (10 sec: 5308.3, 60 sec: 5522.0, 300 sec: 5523.3). Total num frames: 984947712. Throughput: 0: 5755.8. Samples: 984955314. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:43:04,121][25689] Avg episode reward: [(0, '1.370')] [2022-07-11 00:43:05,372][26022] Updated weights on worker 0-0, policy_version 961873 (0.00095) [2022-07-11 00:43:07,744][26022] Updated weights on worker 0-0, policy_version 961883 (0.00095) [2022-07-11 00:43:09,128][25689] Fps is (10 sec: 5514.0, 60 sec: 5589.1, 300 sec: 5534.6). Total num frames: 984978432. Throughput: 0: 4915.2. Samples: 984971924. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:43:09,128][25689] Avg episode reward: [(0, '1.770')] [2022-07-11 00:43:09,133][26022] Updated weights on worker 0-0, policy_version 961893 (0.00091) [2022-07-11 00:43:11,408][26022] Updated weights on worker 0-0, policy_version 961903 (0.00093) [2022-07-11 00:43:12,752][26022] Updated weights on worker 0-0, policy_version 961913 (0.00086) [2022-07-11 00:43:14,170][25689] Fps is (10 sec: 5706.0, 60 sec: 5541.4, 300 sec: 5532.0). Total num frames: 985005056. Throughput: 0: 5765.8. Samples: 985005412. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:43:14,170][25689] Avg episode reward: [(0, '1.686')] [2022-07-11 00:43:14,857][26022] Updated weights on worker 0-0, policy_version 961923 (0.00082) [2022-07-11 00:43:16,605][26022] Updated weights on worker 0-0, policy_version 961933 (0.00089) [2022-07-11 00:43:18,551][26022] Updated weights on worker 0-0, policy_version 961943 (0.00090) [2022-07-11 00:43:19,179][25689] Fps is (10 sec: 5399.3, 60 sec: 5524.8, 300 sec: 5528.5). Total num frames: 985032704. Throughput: 0: 5741.5. Samples: 985038862. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:43:19,181][25689] Avg episode reward: [(0, '0.530')] [2022-07-11 00:43:19,800][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:43:19,814][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000961950_985036800.pth [2022-07-11 00:43:19,814][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000960003_983043072.pth [2022-07-11 00:43:20,249][26022] Updated weights on worker 0-0, policy_version 961953 (0.00072) [2022-07-11 00:43:22,298][26022] Updated weights on worker 0-0, policy_version 961963 (0.00088) [2022-07-11 00:43:23,811][26022] Updated weights on worker 0-0, policy_version 961973 (0.00090) [2022-07-11 00:43:24,190][25689] Fps is (10 sec: 5620.9, 60 sec: 5579.3, 300 sec: 5536.1). Total num frames: 985061376. Throughput: 0: 5001.8. Samples: 985055698. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:43:24,191][25689] Avg episode reward: [(0, '0.553')] [2022-07-11 00:43:26,138][26022] Updated weights on worker 0-0, policy_version 961983 (0.00090) [2022-07-11 00:43:27,683][26022] Updated weights on worker 0-0, policy_version 961993 (0.00084) [2022-07-11 00:43:29,196][25689] Fps is (10 sec: 5520.5, 60 sec: 5528.4, 300 sec: 5526.9). Total num frames: 985088000. Throughput: 0: 5820.9. Samples: 985088738. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:43:29,196][25689] Avg episode reward: [(0, '0.435')] [2022-07-11 00:43:29,598][26022] Updated weights on worker 0-0, policy_version 962003 (0.00089) [2022-07-11 00:43:31,473][26022] Updated weights on worker 0-0, policy_version 962013 (0.00088) [2022-07-11 00:43:33,294][26022] Updated weights on worker 0-0, policy_version 962023 (0.00095) [2022-07-11 00:43:34,321][25689] Fps is (10 sec: 5356.6, 60 sec: 5508.4, 300 sec: 5526.3). Total num frames: 985115648. Throughput: 0: 5793.2. Samples: 985122154. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:43:34,323][25689] Avg episode reward: [(0, '0.179')] [2022-07-11 00:43:35,066][26022] Updated weights on worker 0-0, policy_version 962033 (0.00087) [2022-07-11 00:43:37,045][26022] Updated weights on worker 0-0, policy_version 962043 (0.00092) [2022-07-11 00:43:38,550][26022] Updated weights on worker 0-0, policy_version 962053 (0.00094) [2022-07-11 00:43:39,340][25689] Fps is (10 sec: 5653.2, 60 sec: 5559.7, 300 sec: 5536.8). Total num frames: 985145344. Throughput: 0: 4962.6. Samples: 985138910. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:43:39,340][25689] Avg episode reward: [(0, '0.098')] [2022-07-11 00:43:40,790][26022] Updated weights on worker 0-0, policy_version 962063 (0.00095) [2022-07-11 00:43:42,286][26022] Updated weights on worker 0-0, policy_version 962073 (0.00088) [2022-07-11 00:43:44,366][25689] Fps is (10 sec: 5607.1, 60 sec: 5493.8, 300 sec: 5526.6). Total num frames: 985171968. Throughput: 0: 5778.5. Samples: 985172286. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:43:44,366][25689] Avg episode reward: [(0, '0.117')] [2022-07-11 00:43:44,407][26022] Updated weights on worker 0-0, policy_version 962083 (0.00088) [2022-07-11 00:43:46,168][26022] Updated weights on worker 0-0, policy_version 962093 (0.00089) [2022-07-11 00:43:47,834][26022] Updated weights on worker 0-0, policy_version 962103 (0.00090) [2022-07-11 00:43:49,433][25689] Fps is (10 sec: 5478.4, 60 sec: 5521.9, 300 sec: 5527.5). Total num frames: 985200640. Throughput: 0: 5773.8. Samples: 985205582. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:43:49,435][25689] Avg episode reward: [(0, '-0.293')] [2022-07-11 00:43:50,067][26022] Updated weights on worker 0-0, policy_version 962113 (0.00082) [2022-07-11 00:43:51,823][26022] Updated weights on worker 0-0, policy_version 962123 (0.00090) [2022-07-11 00:43:53,676][26022] Updated weights on worker 0-0, policy_version 962133 (0.00089) [2022-07-11 00:43:54,497][25689] Fps is (10 sec: 5761.2, 60 sec: 5560.1, 300 sec: 5537.3). Total num frames: 985230336. Throughput: 0: 4942.8. Samples: 985221878. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:43:54,498][25689] Avg episode reward: [(0, '0.654')] [2022-07-11 00:43:55,606][26022] Updated weights on worker 0-0, policy_version 962143 (0.00086) [2022-07-11 00:43:57,149][26022] Updated weights on worker 0-0, policy_version 962153 (0.00086) [2022-07-11 00:43:59,308][26022] Updated weights on worker 0-0, policy_version 962163 (0.00093) [2022-07-11 00:43:59,502][25689] Fps is (10 sec: 5390.2, 60 sec: 5492.8, 300 sec: 5528.5). Total num frames: 985254912. Throughput: 0: 5784.1. Samples: 985255530. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:43:59,502][25689] Avg episode reward: [(0, '0.469')] [2022-07-11 00:44:00,808][26022] Updated weights on worker 0-0, policy_version 962173 (0.00093) [2022-07-11 00:44:03,082][26022] Updated weights on worker 0-0, policy_version 962183 (0.00088) [2022-07-11 00:44:04,503][25689] Fps is (10 sec: 5219.2, 60 sec: 5545.4, 300 sec: 5529.3). Total num frames: 985282560. Throughput: 0: 5689.2. Samples: 985286852. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:44:04,504][25689] Avg episode reward: [(0, '0.369')] [2022-07-11 00:44:05,015][26022] Updated weights on worker 0-0, policy_version 962193 (0.00085) [2022-07-11 00:44:06,787][26022] Updated weights on worker 0-0, policy_version 962203 (0.00098) [2022-07-11 00:44:08,667][26022] Updated weights on worker 0-0, policy_version 962213 (0.00095) [2022-07-11 00:44:09,505][25689] Fps is (10 sec: 5630.2, 60 sec: 5512.0, 300 sec: 5539.1). Total num frames: 985311232. Throughput: 0: 4884.5. Samples: 985303624. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:44:09,505][25689] Avg episode reward: [(0, '0.736')] [2022-07-11 00:44:10,431][26022] Updated weights on worker 0-0, policy_version 962223 (0.00081) [2022-07-11 00:44:12,285][26022] Updated weights on worker 0-0, policy_version 962233 (0.00089) [2022-07-11 00:44:14,262][26022] Updated weights on worker 0-0, policy_version 962243 (0.00090) [2022-07-11 00:44:14,570][25689] Fps is (10 sec: 5594.4, 60 sec: 5526.8, 300 sec: 5534.7). Total num frames: 985338880. Throughput: 0: 5755.3. Samples: 985337406. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:44:14,571][25689] Avg episode reward: [(0, '0.137')] [2022-07-11 00:44:15,877][26022] Updated weights on worker 0-0, policy_version 962253 (0.00090) [2022-07-11 00:44:17,702][26022] Updated weights on worker 0-0, policy_version 962263 (0.00088) [2022-07-11 00:44:19,597][25689] Fps is (10 sec: 5478.9, 60 sec: 5525.2, 300 sec: 5532.5). Total num frames: 985366528. Throughput: 0: 5751.4. Samples: 985371108. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:44:19,598][25689] Avg episode reward: [(0, '-0.706')] [2022-07-11 00:44:19,617][26022] Updated weights on worker 0-0, policy_version 962273 (0.00092) [2022-07-11 00:44:21,550][26022] Updated weights on worker 0-0, policy_version 962283 (0.00089) [2022-07-11 00:44:23,246][26022] Updated weights on worker 0-0, policy_version 962293 (0.00605) [2022-07-11 00:44:24,624][25689] Fps is (10 sec: 5500.2, 60 sec: 5506.8, 300 sec: 5535.5). Total num frames: 985394176. Throughput: 0: 5018.7. Samples: 985387832. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:44:24,625][25689] Avg episode reward: [(0, '-0.472')] [2022-07-11 00:44:25,352][26022] Updated weights on worker 0-0, policy_version 962303 (0.00083) [2022-07-11 00:44:26,903][26022] Updated weights on worker 0-0, policy_version 962313 (0.00092) [2022-07-11 00:44:29,044][26022] Updated weights on worker 0-0, policy_version 962323 (0.00058) [2022-07-11 00:44:29,638][25689] Fps is (10 sec: 5609.3, 60 sec: 5540.0, 300 sec: 5537.3). Total num frames: 985422848. Throughput: 0: 5834.9. Samples: 985421098. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:44:29,638][25689] Avg episode reward: [(0, '-0.535')] [2022-07-11 00:44:30,638][26022] Updated weights on worker 0-0, policy_version 962333 (0.00094) [2022-07-11 00:44:32,788][26022] Updated weights on worker 0-0, policy_version 962343 (0.00092) [2022-07-11 00:44:34,448][26022] Updated weights on worker 0-0, policy_version 962353 (0.00093) [2022-07-11 00:44:34,677][25689] Fps is (10 sec: 5704.3, 60 sec: 5564.9, 300 sec: 5537.0). Total num frames: 985451520. Throughput: 0: 5820.2. Samples: 985454428. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:44:34,677][25689] Avg episode reward: [(0, '-0.676')] [2022-07-11 00:44:36,347][26022] Updated weights on worker 0-0, policy_version 962363 (0.00082) [2022-07-11 00:44:37,999][26022] Updated weights on worker 0-0, policy_version 962373 (0.00100) [2022-07-11 00:44:39,682][25689] Fps is (10 sec: 5505.1, 60 sec: 5515.2, 300 sec: 5531.0). Total num frames: 985478144. Throughput: 0: 5827.2. Samples: 985488146. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:44:39,683][25689] Avg episode reward: [(0, '-0.666')] [2022-07-11 00:44:40,142][26022] Updated weights on worker 0-0, policy_version 962383 (0.00097) [2022-07-11 00:44:41,567][26022] Updated weights on worker 0-0, policy_version 962393 (0.00084) [2022-07-11 00:44:43,758][26022] Updated weights on worker 0-0, policy_version 962403 (0.00084) [2022-07-11 00:44:44,715][25689] Fps is (10 sec: 5508.6, 60 sec: 5548.5, 300 sec: 5538.5). Total num frames: 985506816. Throughput: 0: 5838.1. Samples: 985505124. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:44:44,715][25689] Avg episode reward: [(0, '0.089')] [2022-07-11 00:44:45,187][26022] Updated weights on worker 0-0, policy_version 962413 (0.00081) [2022-07-11 00:44:47,135][26022] Updated weights on worker 0-0, policy_version 962423 (0.00080) [2022-07-11 00:44:48,870][26022] Updated weights on worker 0-0, policy_version 962433 (0.00086) [2022-07-11 00:44:49,729][25689] Fps is (10 sec: 5503.7, 60 sec: 5519.4, 300 sec: 5532.6). Total num frames: 985533440. Throughput: 0: 5858.4. Samples: 985538802. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:44:49,730][25689] Avg episode reward: [(0, '0.086')] [2022-07-11 00:44:50,719][26022] Updated weights on worker 0-0, policy_version 962443 (0.00090) [2022-07-11 00:44:52,728][26022] Updated weights on worker 0-0, policy_version 962453 (0.00094) [2022-07-11 00:44:54,580][26022] Updated weights on worker 0-0, policy_version 962463 (0.00089) [2022-07-11 00:44:54,843][25689] Fps is (10 sec: 5661.6, 60 sec: 5531.8, 300 sec: 5544.4). Total num frames: 985564160. Throughput: 0: 5846.4. Samples: 985572330. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:44:54,844][25689] Avg episode reward: [(0, '0.354')] [2022-07-11 00:44:56,457][26022] Updated weights on worker 0-0, policy_version 962473 (0.00087) [2022-07-11 00:44:58,080][26022] Updated weights on worker 0-0, policy_version 962483 (0.00090) [2022-07-11 00:44:59,873][25689] Fps is (10 sec: 5753.9, 60 sec: 5580.4, 300 sec: 5544.1). Total num frames: 985591808. Throughput: 0: 4993.2. Samples: 985588964. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:44:59,874][25689] Avg episode reward: [(0, '-0.902')] [2022-07-11 00:45:00,209][26022] Updated weights on worker 0-0, policy_version 962493 (0.00089) [2022-07-11 00:45:02,128][26022] Updated weights on worker 0-0, policy_version 962503 (0.00090) [2022-07-11 00:45:04,159][26022] Updated weights on worker 0-0, policy_version 962513 (0.00088) [2022-07-11 00:45:04,900][25689] Fps is (10 sec: 5396.3, 60 sec: 5561.0, 300 sec: 5540.4). Total num frames: 985618432. Throughput: 0: 5711.0. Samples: 985620406. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:45:04,901][25689] Avg episode reward: [(0, '-1.057')] [2022-07-11 00:45:06,052][26022] Updated weights on worker 0-0, policy_version 962523 (0.00087) [2022-07-11 00:45:07,699][26022] Updated weights on worker 0-0, policy_version 962533 (0.00087) [2022-07-11 00:45:09,555][26022] Updated weights on worker 0-0, policy_version 962543 (0.00092) [2022-07-11 00:45:09,919][25689] Fps is (10 sec: 5300.7, 60 sec: 5525.6, 300 sec: 5537.6). Total num frames: 985645056. Throughput: 0: 5701.2. Samples: 985653906. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:45:09,920][25689] Avg episode reward: [(0, '-1.657')] [2022-07-11 00:45:11,411][26022] Updated weights on worker 0-0, policy_version 962553 (0.00083) [2022-07-11 00:45:13,232][26022] Updated weights on worker 0-0, policy_version 962563 (0.00089) [2022-07-11 00:45:15,035][25689] Fps is (10 sec: 5355.1, 60 sec: 5521.0, 300 sec: 5535.6). Total num frames: 985672704. Throughput: 0: 4866.9. Samples: 985670602. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:45:15,036][25689] Avg episode reward: [(0, '-1.029')] [2022-07-11 00:45:15,285][26022] Updated weights on worker 0-0, policy_version 962573 (0.00091) [2022-07-11 00:45:16,915][26022] Updated weights on worker 0-0, policy_version 962583 (0.00090) [2022-07-11 00:45:18,910][26022] Updated weights on worker 0-0, policy_version 962593 (0.00084) [2022-07-11 00:45:19,977][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:45:19,990][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000962599_985701376.pth [2022-07-11 00:45:19,990][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000960649_983704576.pth [2022-07-11 00:45:20,051][25689] Fps is (10 sec: 5558.7, 60 sec: 5538.9, 300 sec: 5539.3). Total num frames: 985701376. Throughput: 0: 5698.5. Samples: 985703944. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:45:20,051][25689] Avg episode reward: [(0, '-0.570')] [2022-07-11 00:45:20,664][26022] Updated weights on worker 0-0, policy_version 962603 (0.00092) [2022-07-11 00:45:22,533][26022] Updated weights on worker 0-0, policy_version 962613 (0.00087) [2022-07-11 00:45:24,273][26022] Updated weights on worker 0-0, policy_version 962623 (0.00090) [2022-07-11 00:45:25,066][25689] Fps is (10 sec: 5716.8, 60 sec: 5556.9, 300 sec: 5542.8). Total num frames: 985730048. Throughput: 0: 5797.0. Samples: 985737306. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:45:25,067][25689] Avg episode reward: [(0, '-0.464')] [2022-07-11 00:45:26,124][26022] Updated weights on worker 0-0, policy_version 962633 (0.00094) [2022-07-11 00:45:27,976][26022] Updated weights on worker 0-0, policy_version 962643 (0.00091) [2022-07-11 00:45:29,881][26022] Updated weights on worker 0-0, policy_version 962653 (0.00086) [2022-07-11 00:45:30,075][25689] Fps is (10 sec: 5618.3, 60 sec: 5540.4, 300 sec: 5541.4). Total num frames: 985757696. Throughput: 0: 4956.9. Samples: 985753818. Policy #0 lag: (min: 0.0, avg: 7.4, max: 18.0) [2022-07-11 00:45:30,076][25689] Avg episode reward: [(0, '0.930')] [2022-07-11 00:45:31,676][26022] Updated weights on worker 0-0, policy_version 962663 (0.00093) [2022-07-11 00:45:33,605][26022] Updated weights on worker 0-0, policy_version 962673 (0.00091) [2022-07-11 00:45:35,122][25689] Fps is (10 sec: 5498.8, 60 sec: 5522.7, 300 sec: 5544.0). Total num frames: 985785344. Throughput: 0: 5798.0. Samples: 985787066. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:45:35,123][25689] Avg episode reward: [(0, '0.617')] [2022-07-11 00:45:35,559][26022] Updated weights on worker 0-0, policy_version 962683 (0.00085) [2022-07-11 00:45:37,396][26022] Updated weights on worker 0-0, policy_version 962693 (0.00092) [2022-07-11 00:45:39,095][26022] Updated weights on worker 0-0, policy_version 962703 (0.00088) [2022-07-11 00:45:40,131][25689] Fps is (10 sec: 5397.2, 60 sec: 5522.5, 300 sec: 5534.0). Total num frames: 985811968. Throughput: 0: 5799.9. Samples: 985820406. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:45:40,131][25689] Avg episode reward: [(0, '0.910')] [2022-07-11 00:45:41,099][26022] Updated weights on worker 0-0, policy_version 962713 (0.00089) [2022-07-11 00:45:42,817][26022] Updated weights on worker 0-0, policy_version 962723 (0.00084) [2022-07-11 00:45:44,712][26022] Updated weights on worker 0-0, policy_version 962733 (0.00084) [2022-07-11 00:45:45,149][25689] Fps is (10 sec: 5514.7, 60 sec: 5523.7, 300 sec: 5538.1). Total num frames: 985840640. Throughput: 0: 4965.4. Samples: 985837026. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:45:45,149][25689] Avg episode reward: [(0, '0.832')] [2022-07-11 00:45:46,493][26022] Updated weights on worker 0-0, policy_version 962743 (0.00085) [2022-07-11 00:45:48,219][26022] Updated weights on worker 0-0, policy_version 962753 (0.00083) [2022-07-11 00:45:50,158][25689] Fps is (10 sec: 5616.9, 60 sec: 5541.2, 300 sec: 5539.1). Total num frames: 985868288. Throughput: 0: 5806.0. Samples: 985870418. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:45:50,160][25689] Avg episode reward: [(0, '1.008')] [2022-07-11 00:45:50,449][26022] Updated weights on worker 0-0, policy_version 962763 (0.00086) [2022-07-11 00:45:52,103][26022] Updated weights on worker 0-0, policy_version 962773 (0.00087) [2022-07-11 00:45:54,022][26022] Updated weights on worker 0-0, policy_version 962783 (0.00086) [2022-07-11 00:45:55,267][25689] Fps is (10 sec: 5566.5, 60 sec: 5507.8, 300 sec: 5535.4). Total num frames: 985896960. Throughput: 0: 5793.7. Samples: 985903778. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:45:55,267][25689] Avg episode reward: [(0, '1.304')] [2022-07-11 00:45:55,492][26022] Updated weights on worker 0-0, policy_version 962793 (0.00082) [2022-07-11 00:45:57,747][26022] Updated weights on worker 0-0, policy_version 962803 (0.00082) [2022-07-11 00:45:59,392][26022] Updated weights on worker 0-0, policy_version 962813 (0.00090) [2022-07-11 00:46:00,272][25689] Fps is (10 sec: 5568.6, 60 sec: 5510.1, 300 sec: 5543.1). Total num frames: 985924608. Throughput: 0: 4972.3. Samples: 985920552. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:46:00,273][25689] Avg episode reward: [(0, '1.550')] [2022-07-11 00:46:01,390][26022] Updated weights on worker 0-0, policy_version 962823 (0.00100) [2022-07-11 00:46:03,607][26022] Updated weights on worker 0-0, policy_version 962833 (0.00089) [2022-07-11 00:46:05,203][26022] Updated weights on worker 0-0, policy_version 962843 (0.00101) [2022-07-11 00:46:05,293][25689] Fps is (10 sec: 5413.0, 60 sec: 5510.6, 300 sec: 5541.0). Total num frames: 985951232. Throughput: 0: 5707.6. Samples: 985952000. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:46:05,295][25689] Avg episode reward: [(0, '1.789')] [2022-07-11 00:46:07,352][26022] Updated weights on worker 0-0, policy_version 962853 (0.00090) [2022-07-11 00:46:08,983][26022] Updated weights on worker 0-0, policy_version 962863 (0.00087) [2022-07-11 00:46:10,325][25689] Fps is (10 sec: 5296.3, 60 sec: 5509.3, 300 sec: 5532.5). Total num frames: 985977856. Throughput: 0: 5713.6. Samples: 985985648. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:46:10,326][25689] Avg episode reward: [(0, '1.869')] [2022-07-11 00:46:10,979][26022] Updated weights on worker 0-0, policy_version 962873 (0.00087) [2022-07-11 00:46:12,531][26022] Updated weights on worker 0-0, policy_version 962883 (0.00089) [2022-07-11 00:46:14,611][26022] Updated weights on worker 0-0, policy_version 962893 (0.00094) [2022-07-11 00:46:15,412][25689] Fps is (10 sec: 5464.6, 60 sec: 5529.0, 300 sec: 5538.4). Total num frames: 986006528. Throughput: 0: 4903.2. Samples: 986002554. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:46:15,413][25689] Avg episode reward: [(0, '1.859')] [2022-07-11 00:46:16,441][26022] Updated weights on worker 0-0, policy_version 962903 (0.00087) [2022-07-11 00:46:18,492][26022] Updated weights on worker 0-0, policy_version 962913 (0.00088) [2022-07-11 00:46:19,987][26022] Updated weights on worker 0-0, policy_version 962923 (0.00059) [2022-07-11 00:46:20,419][25689] Fps is (10 sec: 5579.9, 60 sec: 5512.9, 300 sec: 5535.0). Total num frames: 986034176. Throughput: 0: 5722.4. Samples: 986035842. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:46:20,419][25689] Avg episode reward: [(0, '1.838')] [2022-07-11 00:46:22,006][26022] Updated weights on worker 0-0, policy_version 962933 (0.00103) [2022-07-11 00:46:23,724][26022] Updated weights on worker 0-0, policy_version 962943 (0.00091) [2022-07-11 00:46:25,422][25689] Fps is (10 sec: 5524.3, 60 sec: 5497.0, 300 sec: 5535.2). Total num frames: 986061824. Throughput: 0: 5831.5. Samples: 986069380. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:46:25,422][25689] Avg episode reward: [(0, '0.770')] [2022-07-11 00:46:25,584][26022] Updated weights on worker 0-0, policy_version 962953 (0.00090) [2022-07-11 00:46:27,420][26022] Updated weights on worker 0-0, policy_version 962963 (0.00091) [2022-07-11 00:46:29,239][26022] Updated weights on worker 0-0, policy_version 962973 (0.00093) [2022-07-11 00:46:30,430][25689] Fps is (10 sec: 5523.1, 60 sec: 5497.0, 300 sec: 5533.1). Total num frames: 986089472. Throughput: 0: 4986.1. Samples: 986085894. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:46:30,431][25689] Avg episode reward: [(0, '1.003')] [2022-07-11 00:46:31,069][26022] Updated weights on worker 0-0, policy_version 962983 (0.00088) [2022-07-11 00:46:33,011][26022] Updated weights on worker 0-0, policy_version 962993 (0.00081) [2022-07-11 00:46:34,682][26022] Updated weights on worker 0-0, policy_version 963003 (0.00095) [2022-07-11 00:46:35,529][25689] Fps is (10 sec: 5673.7, 60 sec: 5526.3, 300 sec: 5540.0). Total num frames: 986119168. Throughput: 0: 5793.5. Samples: 986119100. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:46:35,529][25689] Avg episode reward: [(0, '0.305')] [2022-07-11 00:46:36,726][26022] Updated weights on worker 0-0, policy_version 963013 (0.00084) [2022-07-11 00:46:38,339][26022] Updated weights on worker 0-0, policy_version 963023 (0.00105) [2022-07-11 00:46:40,517][26022] Updated weights on worker 0-0, policy_version 963033 (0.00093) [2022-07-11 00:46:40,611][25689] Fps is (10 sec: 5532.1, 60 sec: 5519.5, 300 sec: 5532.2). Total num frames: 986145792. Throughput: 0: 5786.0. Samples: 986152674. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:46:40,612][25689] Avg episode reward: [(0, '0.127')] [2022-07-11 00:46:42,179][26022] Updated weights on worker 0-0, policy_version 963043 (0.00099) [2022-07-11 00:46:43,824][26022] Updated weights on worker 0-0, policy_version 963053 (0.00095) [2022-07-11 00:46:45,636][25689] Fps is (10 sec: 5572.0, 60 sec: 5535.8, 300 sec: 5538.9). Total num frames: 986175488. Throughput: 0: 4950.1. Samples: 986169444. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:46:45,637][25689] Avg episode reward: [(0, '0.126')] [2022-07-11 00:46:45,855][26022] Updated weights on worker 0-0, policy_version 963063 (0.00100) [2022-07-11 00:46:47,643][26022] Updated weights on worker 0-0, policy_version 963073 (0.00093) [2022-07-11 00:46:49,509][26022] Updated weights on worker 0-0, policy_version 963083 (0.00097) [2022-07-11 00:46:50,647][25689] Fps is (10 sec: 5713.9, 60 sec: 5535.7, 300 sec: 5536.9). Total num frames: 986203136. Throughput: 0: 5789.4. Samples: 986202934. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:46:50,647][25689] Avg episode reward: [(0, '1.039')] [2022-07-11 00:46:51,515][26022] Updated weights on worker 0-0, policy_version 963093 (0.00083) [2022-07-11 00:46:53,007][26022] Updated weights on worker 0-0, policy_version 963103 (0.00088) [2022-07-11 00:46:55,144][26022] Updated weights on worker 0-0, policy_version 963113 (0.00086) [2022-07-11 00:46:55,776][25689] Fps is (10 sec: 5453.6, 60 sec: 5516.9, 300 sec: 5534.8). Total num frames: 986230784. Throughput: 0: 5794.6. Samples: 986236424. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:46:55,776][25689] Avg episode reward: [(0, '0.927')] [2022-07-11 00:46:56,884][26022] Updated weights on worker 0-0, policy_version 963123 (0.00086) [2022-07-11 00:46:58,617][26022] Updated weights on worker 0-0, policy_version 963133 (0.00084) [2022-07-11 00:47:00,428][26022] Updated weights on worker 0-0, policy_version 963143 (0.00091) [2022-07-11 00:47:00,811][25689] Fps is (10 sec: 5440.3, 60 sec: 5514.1, 300 sec: 5534.5). Total num frames: 986258432. Throughput: 0: 4964.1. Samples: 986252950. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:47:00,812][25689] Avg episode reward: [(0, '0.980')] [2022-07-11 00:47:02,488][26022] Updated weights on worker 0-0, policy_version 963153 (0.00183) [2022-07-11 00:47:04,641][26022] Updated weights on worker 0-0, policy_version 963163 (0.00086) [2022-07-11 00:47:05,864][25689] Fps is (10 sec: 5481.4, 60 sec: 5528.2, 300 sec: 5537.0). Total num frames: 986286080. Throughput: 0: 5688.6. Samples: 986284510. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:47:05,864][25689] Avg episode reward: [(0, '0.865')] [2022-07-11 00:47:06,275][26022] Updated weights on worker 0-0, policy_version 963173 (0.00088) [2022-07-11 00:47:08,074][26022] Updated weights on worker 0-0, policy_version 963183 (0.00088) [2022-07-11 00:47:10,016][26022] Updated weights on worker 0-0, policy_version 963193 (0.00088) [2022-07-11 00:47:10,878][25689] Fps is (10 sec: 5390.9, 60 sec: 5529.8, 300 sec: 5527.8). Total num frames: 986312704. Throughput: 0: 5700.2. Samples: 986318258. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:47:10,879][25689] Avg episode reward: [(0, '0.945')] [2022-07-11 00:47:11,944][26022] Updated weights on worker 0-0, policy_version 963203 (0.00089) [2022-07-11 00:47:13,703][26022] Updated weights on worker 0-0, policy_version 963213 (0.00089) [2022-07-11 00:47:15,466][26022] Updated weights on worker 0-0, policy_version 963223 (0.00094) [2022-07-11 00:47:15,947][25689] Fps is (10 sec: 5484.0, 60 sec: 5531.5, 300 sec: 5526.8). Total num frames: 986341376. Throughput: 0: 4886.1. Samples: 986334982. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:47:15,948][25689] Avg episode reward: [(0, '-0.003')] [2022-07-11 00:47:17,399][26022] Updated weights on worker 0-0, policy_version 963233 (0.00086) [2022-07-11 00:47:19,360][26022] Updated weights on worker 0-0, policy_version 963243 (0.00059) [2022-07-11 00:47:20,051][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:47:20,066][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000963248_986365952.pth [2022-07-11 00:47:20,066][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000961298_984369152.pth [2022-07-11 00:47:21,004][25689] Fps is (10 sec: 5764.4, 60 sec: 5560.7, 300 sec: 5540.4). Total num frames: 986371072. Throughput: 0: 5712.0. Samples: 986368292. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:47:21,006][25689] Avg episode reward: [(0, '-0.634')] [2022-07-11 00:47:21,007][26022] Updated weights on worker 0-0, policy_version 963253 (0.00113) [2022-07-11 00:47:22,958][26022] Updated weights on worker 0-0, policy_version 963263 (0.00096) [2022-07-11 00:47:25,022][26022] Updated weights on worker 0-0, policy_version 963273 (0.00090) [2022-07-11 00:47:26,089][25689] Fps is (10 sec: 5653.9, 60 sec: 5553.1, 300 sec: 5532.0). Total num frames: 986398720. Throughput: 0: 5785.8. Samples: 986401530. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:47:26,090][25689] Avg episode reward: [(0, '-0.864')] [2022-07-11 00:47:26,813][26022] Updated weights on worker 0-0, policy_version 963283 (0.00090) [2022-07-11 00:47:28,748][26022] Updated weights on worker 0-0, policy_version 963293 (0.00102) [2022-07-11 00:47:30,405][26022] Updated weights on worker 0-0, policy_version 963303 (0.00088) [2022-07-11 00:47:31,127][25689] Fps is (10 sec: 5361.3, 60 sec: 5533.6, 300 sec: 5526.1). Total num frames: 986425344. Throughput: 0: 5749.5. Samples: 986434676. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:47:31,129][25689] Avg episode reward: [(0, '-0.097')] [2022-07-11 00:47:32,285][26022] Updated weights on worker 0-0, policy_version 963313 (0.00088) [2022-07-11 00:47:34,205][26022] Updated weights on worker 0-0, policy_version 963323 (0.00086) [2022-07-11 00:47:35,819][26022] Updated weights on worker 0-0, policy_version 963333 (0.00089) [2022-07-11 00:47:36,229][25689] Fps is (10 sec: 5554.2, 60 sec: 5533.2, 300 sec: 5535.0). Total num frames: 986455040. Throughput: 0: 5735.5. Samples: 986451312. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:47:36,231][25689] Avg episode reward: [(0, '-0.280')] [2022-07-11 00:47:37,818][26022] Updated weights on worker 0-0, policy_version 963343 (0.00089) [2022-07-11 00:47:39,417][26022] Updated weights on worker 0-0, policy_version 963353 (0.00088) [2022-07-11 00:47:41,321][25689] Fps is (10 sec: 5625.2, 60 sec: 5549.3, 300 sec: 5523.9). Total num frames: 986482688. Throughput: 0: 5750.3. Samples: 986485120. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:47:41,321][25689] Avg episode reward: [(0, '-0.691')] [2022-07-11 00:47:41,415][26022] Updated weights on worker 0-0, policy_version 963363 (0.00093) [2022-07-11 00:47:43,349][26022] Updated weights on worker 0-0, policy_version 963373 (0.00089) [2022-07-11 00:47:45,223][26022] Updated weights on worker 0-0, policy_version 963383 (0.00081) [2022-07-11 00:47:46,412][25689] Fps is (10 sec: 5531.1, 60 sec: 5526.4, 300 sec: 5529.1). Total num frames: 986511360. Throughput: 0: 5756.3. Samples: 986518512. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:47:46,412][25689] Avg episode reward: [(0, '0.071')] [2022-07-11 00:47:46,947][26022] Updated weights on worker 0-0, policy_version 963393 (0.00087) [2022-07-11 00:47:48,917][26022] Updated weights on worker 0-0, policy_version 963403 (0.00087) [2022-07-11 00:47:50,672][26022] Updated weights on worker 0-0, policy_version 963413 (0.00093) [2022-07-11 00:47:51,485][25689] Fps is (10 sec: 5541.0, 60 sec: 5520.7, 300 sec: 5529.8). Total num frames: 986539008. Throughput: 0: 4944.0. Samples: 986535328. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:47:51,486][25689] Avg episode reward: [(0, '0.860')] [2022-07-11 00:47:52,592][26022] Updated weights on worker 0-0, policy_version 963423 (0.00097) [2022-07-11 00:47:54,482][26022] Updated weights on worker 0-0, policy_version 963433 (0.00082) [2022-07-11 00:47:56,027][26022] Updated weights on worker 0-0, policy_version 963443 (0.00101) [2022-07-11 00:47:56,614][25689] Fps is (10 sec: 5620.7, 60 sec: 5554.4, 300 sec: 5531.1). Total num frames: 986568704. Throughput: 0: 5741.7. Samples: 986568354. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:47:56,614][25689] Avg episode reward: [(0, '1.044')] [2022-07-11 00:47:58,242][26022] Updated weights on worker 0-0, policy_version 963453 (0.00081) [2022-07-11 00:47:59,779][26022] Updated weights on worker 0-0, policy_version 963463 (0.00083) [2022-07-11 00:48:01,627][25689] Fps is (10 sec: 5553.3, 60 sec: 5539.6, 300 sec: 5538.1). Total num frames: 986595328. Throughput: 0: 5774.1. Samples: 986602368. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:48:01,628][25689] Avg episode reward: [(0, '1.174')] [2022-07-11 00:48:01,642][26022] Updated weights on worker 0-0, policy_version 963473 (0.00096) [2022-07-11 00:48:03,846][26022] Updated weights on worker 0-0, policy_version 963483 (0.00091) [2022-07-11 00:48:05,582][26022] Updated weights on worker 0-0, policy_version 963493 (0.00082) [2022-07-11 00:48:06,639][25689] Fps is (10 sec: 5311.3, 60 sec: 5526.4, 300 sec: 5524.2). Total num frames: 986621952. Throughput: 0: 4874.3. Samples: 986617108. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:48:06,641][25689] Avg episode reward: [(0, '1.643')] [2022-07-11 00:48:07,699][26022] Updated weights on worker 0-0, policy_version 963503 (0.01314) [2022-07-11 00:48:09,377][26022] Updated weights on worker 0-0, policy_version 963513 (0.00090) [2022-07-11 00:48:11,123][26022] Updated weights on worker 0-0, policy_version 963523 (0.00084) [2022-07-11 00:48:11,675][25689] Fps is (10 sec: 5401.1, 60 sec: 5541.4, 300 sec: 5527.8). Total num frames: 986649600. Throughput: 0: 5697.3. Samples: 986650356. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:48:11,676][25689] Avg episode reward: [(0, '1.676')] [2022-07-11 00:48:13,169][26022] Updated weights on worker 0-0, policy_version 963533 (0.00082) [2022-07-11 00:48:14,927][26022] Updated weights on worker 0-0, policy_version 963543 (0.00091) [2022-07-11 00:48:16,742][25689] Fps is (10 sec: 5575.0, 60 sec: 5541.5, 300 sec: 5530.2). Total num frames: 986678272. Throughput: 0: 5746.8. Samples: 986684024. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:48:16,742][25689] Avg episode reward: [(0, '1.695')] [2022-07-11 00:48:16,743][26022] Updated weights on worker 0-0, policy_version 963553 (0.00093) [2022-07-11 00:48:18,807][26022] Updated weights on worker 0-0, policy_version 963563 (0.00098) [2022-07-11 00:48:20,210][26022] Updated weights on worker 0-0, policy_version 963573 (0.00093) [2022-07-11 00:48:21,777][25689] Fps is (10 sec: 5473.9, 60 sec: 5493.0, 300 sec: 5522.8). Total num frames: 986704896. Throughput: 0: 4881.1. Samples: 986700722. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:48:21,778][25689] Avg episode reward: [(0, '0.475')] [2022-07-11 00:48:22,523][26022] Updated weights on worker 0-0, policy_version 963583 (0.00122) [2022-07-11 00:48:23,978][26022] Updated weights on worker 0-0, policy_version 963593 (0.00092) [2022-07-11 00:48:25,976][26022] Updated weights on worker 0-0, policy_version 963603 (0.00090) [2022-07-11 00:48:26,781][25689] Fps is (10 sec: 5508.0, 60 sec: 5517.2, 300 sec: 5529.7). Total num frames: 986733568. Throughput: 0: 5810.4. Samples: 986734138. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:48:26,783][25689] Avg episode reward: [(0, '-0.251')] [2022-07-11 00:48:27,655][26022] Updated weights on worker 0-0, policy_version 963613 (0.00083) [2022-07-11 00:48:29,591][26022] Updated weights on worker 0-0, policy_version 963623 (0.00091) [2022-07-11 00:48:31,549][26022] Updated weights on worker 0-0, policy_version 963633 (0.00095) [2022-07-11 00:48:31,812][25689] Fps is (10 sec: 5714.1, 60 sec: 5551.5, 300 sec: 5534.9). Total num frames: 986762240. Throughput: 0: 5802.7. Samples: 986767206. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:48:31,813][25689] Avg episode reward: [(0, '-0.123')] [2022-07-11 00:48:33,334][26022] Updated weights on worker 0-0, policy_version 963643 (0.00079) [2022-07-11 00:48:35,070][26022] Updated weights on worker 0-0, policy_version 963653 (0.00095) [2022-07-11 00:48:36,922][25689] Fps is (10 sec: 5553.6, 60 sec: 5517.1, 300 sec: 5526.3). Total num frames: 986789888. Throughput: 0: 4955.6. Samples: 986784030. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:48:36,923][25689] Avg episode reward: [(0, '-0.246')] [2022-07-11 00:48:37,002][26022] Updated weights on worker 0-0, policy_version 963663 (0.00093) [2022-07-11 00:48:38,853][26022] Updated weights on worker 0-0, policy_version 963673 (0.00085) [2022-07-11 00:48:40,597][26022] Updated weights on worker 0-0, policy_version 963683 (0.00091) [2022-07-11 00:48:41,970][25689] Fps is (10 sec: 5544.6, 60 sec: 5538.0, 300 sec: 5532.8). Total num frames: 986818560. Throughput: 0: 5793.2. Samples: 986817704. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:48:41,970][25689] Avg episode reward: [(0, '-0.407')] [2022-07-11 00:48:42,469][26022] Updated weights on worker 0-0, policy_version 963693 (0.00087) [2022-07-11 00:48:44,247][26022] Updated weights on worker 0-0, policy_version 963703 (0.00089) [2022-07-11 00:48:46,191][26022] Updated weights on worker 0-0, policy_version 963713 (0.00085) [2022-07-11 00:48:47,019][25689] Fps is (10 sec: 5679.5, 60 sec: 5541.8, 300 sec: 5533.2). Total num frames: 986847232. Throughput: 0: 5791.4. Samples: 986851342. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:48:47,020][25689] Avg episode reward: [(0, '-0.398')] [2022-07-11 00:48:47,899][26022] Updated weights on worker 0-0, policy_version 963723 (0.00095) [2022-07-11 00:48:49,827][26022] Updated weights on worker 0-0, policy_version 963733 (0.00092) [2022-07-11 00:48:51,755][26022] Updated weights on worker 0-0, policy_version 963743 (0.00087) [2022-07-11 00:48:52,039][25689] Fps is (10 sec: 5491.8, 60 sec: 5529.8, 300 sec: 5523.7). Total num frames: 986873856. Throughput: 0: 4982.5. Samples: 986867984. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:48:52,039][25689] Avg episode reward: [(0, '1.550')] [2022-07-11 00:48:53,464][26022] Updated weights on worker 0-0, policy_version 963753 (0.00089) [2022-07-11 00:48:55,626][26022] Updated weights on worker 0-0, policy_version 963763 (0.00087) [2022-07-11 00:48:56,966][26022] Updated weights on worker 0-0, policy_version 963773 (0.00084) [2022-07-11 00:48:57,130][25689] Fps is (10 sec: 5569.8, 60 sec: 5533.2, 300 sec: 5539.3). Total num frames: 986903552. Throughput: 0: 5782.3. Samples: 986900880. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:48:57,131][25689] Avg episode reward: [(0, '1.373')] [2022-07-11 00:48:59,278][26022] Updated weights on worker 0-0, policy_version 963783 (0.00088) [2022-07-11 00:49:01,017][26022] Updated weights on worker 0-0, policy_version 963793 (0.00084) [2022-07-11 00:49:02,208][25689] Fps is (10 sec: 5336.8, 60 sec: 5493.5, 300 sec: 5527.5). Total num frames: 986928128. Throughput: 0: 5755.5. Samples: 986934184. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:49:02,209][25689] Avg episode reward: [(0, '1.489')] [2022-07-11 00:49:03,197][26022] Updated weights on worker 0-0, policy_version 963803 (0.00088) [2022-07-11 00:49:05,118][26022] Updated weights on worker 0-0, policy_version 963813 (0.00090) [2022-07-11 00:49:06,674][26022] Updated weights on worker 0-0, policy_version 963823 (0.00051) [2022-07-11 00:49:07,224][25689] Fps is (10 sec: 5174.0, 60 sec: 5510.1, 300 sec: 5523.8). Total num frames: 986955776. Throughput: 0: 5645.2. Samples: 986965404. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:49:07,224][25689] Avg episode reward: [(0, '1.752')] [2022-07-11 00:49:08,932][26022] Updated weights on worker 0-0, policy_version 963833 (0.00089) [2022-07-11 00:49:10,670][26022] Updated weights on worker 0-0, policy_version 963843 (0.00064) [2022-07-11 00:49:12,237][25689] Fps is (10 sec: 5513.5, 60 sec: 5512.1, 300 sec: 5524.8). Total num frames: 986983424. Throughput: 0: 5640.4. Samples: 986981912. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:49:12,238][25689] Avg episode reward: [(0, '2.041')] [2022-07-11 00:49:12,516][26022] Updated weights on worker 0-0, policy_version 963853 (0.00091) [2022-07-11 00:49:14,439][26022] Updated weights on worker 0-0, policy_version 963863 (0.00098) [2022-07-11 00:49:16,424][26022] Updated weights on worker 0-0, policy_version 963873 (0.00084) [2022-07-11 00:49:17,298][25689] Fps is (10 sec: 5488.9, 60 sec: 5495.7, 300 sec: 5524.2). Total num frames: 987011072. Throughput: 0: 5669.8. Samples: 987015226. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:49:17,299][25689] Avg episode reward: [(0, '1.871')] [2022-07-11 00:49:17,934][26022] Updated weights on worker 0-0, policy_version 963883 (0.00087) [2022-07-11 00:49:20,059][26022] Updated weights on worker 0-0, policy_version 963893 (0.00086) [2022-07-11 00:49:20,129][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:49:20,139][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000963894_987027456.pth [2022-07-11 00:49:20,139][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000961950_985036800.pth [2022-07-11 00:49:21,570][26022] Updated weights on worker 0-0, policy_version 963903 (0.00083) [2022-07-11 00:49:22,375][25689] Fps is (10 sec: 5454.5, 60 sec: 5508.8, 300 sec: 5523.2). Total num frames: 987038720. Throughput: 0: 5667.7. Samples: 987048484. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 00:49:22,375][25689] Avg episode reward: [(0, '0.555')] [2022-07-11 00:49:23,590][26022] Updated weights on worker 0-0, policy_version 963913 (0.00100) [2022-07-11 00:49:25,537][26022] Updated weights on worker 0-0, policy_version 963923 (0.00088) [2022-07-11 00:49:27,289][26022] Updated weights on worker 0-0, policy_version 963933 (0.00090) [2022-07-11 00:49:27,417][25689] Fps is (10 sec: 5565.6, 60 sec: 5505.3, 300 sec: 5522.7). Total num frames: 987067392. Throughput: 0: 4924.7. Samples: 987064850. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:49:27,418][25689] Avg episode reward: [(0, '0.583')] [2022-07-11 00:49:29,285][26022] Updated weights on worker 0-0, policy_version 963943 (0.00099) [2022-07-11 00:49:31,131][26022] Updated weights on worker 0-0, policy_version 963953 (0.00096) [2022-07-11 00:49:32,418][25689] Fps is (10 sec: 5505.7, 60 sec: 5474.3, 300 sec: 5516.5). Total num frames: 987094016. Throughput: 0: 5757.7. Samples: 987098108. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:49:32,420][25689] Avg episode reward: [(0, '-0.433')] [2022-07-11 00:49:32,870][26022] Updated weights on worker 0-0, policy_version 963963 (0.00085) [2022-07-11 00:49:34,836][26022] Updated weights on worker 0-0, policy_version 963973 (0.00090) [2022-07-11 00:49:36,603][26022] Updated weights on worker 0-0, policy_version 963983 (0.00087) [2022-07-11 00:49:37,489][25689] Fps is (10 sec: 5591.9, 60 sec: 5511.6, 300 sec: 5525.6). Total num frames: 987123712. Throughput: 0: 5761.7. Samples: 987131560. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:49:37,491][25689] Avg episode reward: [(0, '-2.625')] [2022-07-11 00:49:38,380][26022] Updated weights on worker 0-0, policy_version 963993 (0.00097) [2022-07-11 00:49:40,304][26022] Updated weights on worker 0-0, policy_version 964003 (0.00091) [2022-07-11 00:49:42,024][26022] Updated weights on worker 0-0, policy_version 964013 (0.00089) [2022-07-11 00:49:42,520][25689] Fps is (10 sec: 5676.4, 60 sec: 5496.2, 300 sec: 5522.2). Total num frames: 987151360. Throughput: 0: 4954.7. Samples: 987148298. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:49:42,521][25689] Avg episode reward: [(0, '-2.336')] [2022-07-11 00:49:43,918][26022] Updated weights on worker 0-0, policy_version 964023 (0.00090) [2022-07-11 00:49:45,809][26022] Updated weights on worker 0-0, policy_version 964033 (0.00088) [2022-07-11 00:49:47,568][25689] Fps is (10 sec: 5588.0, 60 sec: 5496.4, 300 sec: 5528.5). Total num frames: 987180032. Throughput: 0: 5792.8. Samples: 987181578. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:49:47,568][25689] Avg episode reward: [(0, '-2.389')] [2022-07-11 00:49:47,570][26022] Updated weights on worker 0-0, policy_version 964043 (0.00088) [2022-07-11 00:49:49,576][26022] Updated weights on worker 0-0, policy_version 964053 (0.00084) [2022-07-11 00:49:51,147][26022] Updated weights on worker 0-0, policy_version 964063 (0.00087) [2022-07-11 00:49:52,609][25689] Fps is (10 sec: 5582.3, 60 sec: 5511.3, 300 sec: 5519.5). Total num frames: 987207680. Throughput: 0: 5798.9. Samples: 987215194. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:49:52,610][25689] Avg episode reward: [(0, '-2.050')] [2022-07-11 00:49:53,194][26022] Updated weights on worker 0-0, policy_version 964073 (0.00088) [2022-07-11 00:49:54,984][26022] Updated weights on worker 0-0, policy_version 964083 (0.00838) [2022-07-11 00:49:56,824][26022] Updated weights on worker 0-0, policy_version 964093 (0.00089) [2022-07-11 00:49:57,670][25689] Fps is (10 sec: 5473.9, 60 sec: 5480.4, 300 sec: 5518.9). Total num frames: 987235328. Throughput: 0: 4965.3. Samples: 987231762. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:49:57,670][25689] Avg episode reward: [(0, '-2.235')] [2022-07-11 00:49:58,678][26022] Updated weights on worker 0-0, policy_version 964103 (0.00087) [2022-07-11 00:50:00,432][26022] Updated weights on worker 0-0, policy_version 964113 (0.00085) [2022-07-11 00:50:02,705][25689] Fps is (10 sec: 5376.0, 60 sec: 5518.1, 300 sec: 5518.8). Total num frames: 987261952. Throughput: 0: 5793.1. Samples: 987265228. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:50:02,705][25689] Avg episode reward: [(0, '-1.190')] [2022-07-11 00:50:02,715][26022] Updated weights on worker 0-0, policy_version 964123 (0.00089) [2022-07-11 00:50:04,574][26022] Updated weights on worker 0-0, policy_version 964133 (0.00087) [2022-07-11 00:50:06,394][26022] Updated weights on worker 0-0, policy_version 964143 (0.00096) [2022-07-11 00:50:07,790][25689] Fps is (10 sec: 5362.7, 60 sec: 5511.7, 300 sec: 5521.0). Total num frames: 987289600. Throughput: 0: 5695.2. Samples: 987296746. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:50:07,790][25689] Avg episode reward: [(0, '0.435')] [2022-07-11 00:50:08,330][26022] Updated weights on worker 0-0, policy_version 964153 (0.00100) [2022-07-11 00:50:10,029][26022] Updated weights on worker 0-0, policy_version 964163 (0.00089) [2022-07-11 00:50:11,960][26022] Updated weights on worker 0-0, policy_version 964173 (0.00085) [2022-07-11 00:50:12,870][25689] Fps is (10 sec: 5439.6, 60 sec: 5505.7, 300 sec: 5521.7). Total num frames: 987317248. Throughput: 0: 4843.4. Samples: 987313324. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:50:12,872][25689] Avg episode reward: [(0, '0.349')] [2022-07-11 00:50:13,732][26022] Updated weights on worker 0-0, policy_version 964183 (0.00092) [2022-07-11 00:50:15,519][26022] Updated weights on worker 0-0, policy_version 964193 (0.00088) [2022-07-11 00:50:17,358][26022] Updated weights on worker 0-0, policy_version 964203 (0.00097) [2022-07-11 00:50:17,956][25689] Fps is (10 sec: 5640.7, 60 sec: 5537.2, 300 sec: 5523.8). Total num frames: 987346944. Throughput: 0: 5673.3. Samples: 987346852. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:50:17,956][25689] Avg episode reward: [(0, '-0.910')] [2022-07-11 00:50:19,426][26022] Updated weights on worker 0-0, policy_version 964213 (0.00092) [2022-07-11 00:50:21,020][26022] Updated weights on worker 0-0, policy_version 964223 (0.00094) [2022-07-11 00:50:22,967][25689] Fps is (10 sec: 5578.0, 60 sec: 5526.3, 300 sec: 5517.0). Total num frames: 987373568. Throughput: 0: 5683.5. Samples: 987380388. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:50:22,967][25689] Avg episode reward: [(0, '-0.110')] [2022-07-11 00:50:23,012][26022] Updated weights on worker 0-0, policy_version 964233 (0.00096) [2022-07-11 00:50:24,828][26022] Updated weights on worker 0-0, policy_version 964243 (0.00091) [2022-07-11 00:50:26,696][26022] Updated weights on worker 0-0, policy_version 964253 (0.00099) [2022-07-11 00:50:27,974][25689] Fps is (10 sec: 5519.5, 60 sec: 5529.5, 300 sec: 5520.5). Total num frames: 987402240. Throughput: 0: 4949.8. Samples: 987396654. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:50:27,975][25689] Avg episode reward: [(0, '-0.588')] [2022-07-11 00:50:28,606][26022] Updated weights on worker 0-0, policy_version 964263 (0.00084) [2022-07-11 00:50:30,511][26022] Updated weights on worker 0-0, policy_version 964273 (0.00086) [2022-07-11 00:50:32,075][26022] Updated weights on worker 0-0, policy_version 964283 (0.00084) [2022-07-11 00:50:32,976][25689] Fps is (10 sec: 5627.0, 60 sec: 5546.3, 300 sec: 5521.3). Total num frames: 987429888. Throughput: 0: 5799.7. Samples: 987429932. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:50:32,976][25689] Avg episode reward: [(0, '-0.746')] [2022-07-11 00:50:34,316][26022] Updated weights on worker 0-0, policy_version 964293 (0.00087) [2022-07-11 00:50:35,813][26022] Updated weights on worker 0-0, policy_version 964303 (0.00081) [2022-07-11 00:50:37,935][26022] Updated weights on worker 0-0, policy_version 964313 (0.00088) [2022-07-11 00:50:38,054][25689] Fps is (10 sec: 5485.6, 60 sec: 5511.8, 300 sec: 5523.5). Total num frames: 987457536. Throughput: 0: 5808.2. Samples: 987463588. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:50:38,055][25689] Avg episode reward: [(0, '-0.827')] [2022-07-11 00:50:39,434][26022] Updated weights on worker 0-0, policy_version 964323 (0.00094) [2022-07-11 00:50:41,431][26022] Updated weights on worker 0-0, policy_version 964333 (0.00087) [2022-07-11 00:50:43,084][25689] Fps is (10 sec: 5470.5, 60 sec: 5512.0, 300 sec: 5519.8). Total num frames: 987485184. Throughput: 0: 4967.5. Samples: 987480318. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:50:43,084][25689] Avg episode reward: [(0, '-0.587')] [2022-07-11 00:50:43,395][26022] Updated weights on worker 0-0, policy_version 964343 (0.00097) [2022-07-11 00:50:44,999][26022] Updated weights on worker 0-0, policy_version 964353 (0.00082) [2022-07-11 00:50:46,999][26022] Updated weights on worker 0-0, policy_version 964363 (0.00089) [2022-07-11 00:50:48,172][25689] Fps is (10 sec: 5769.1, 60 sec: 5542.1, 300 sec: 5528.7). Total num frames: 987515904. Throughput: 0: 5807.2. Samples: 987513946. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:50:48,172][25689] Avg episode reward: [(0, '0.974')] [2022-07-11 00:50:48,700][26022] Updated weights on worker 0-0, policy_version 964373 (0.00089) [2022-07-11 00:50:50,508][26022] Updated weights on worker 0-0, policy_version 964383 (0.00091) [2022-07-11 00:50:52,368][26022] Updated weights on worker 0-0, policy_version 964393 (0.00091) [2022-07-11 00:50:53,217][25689] Fps is (10 sec: 5759.9, 60 sec: 5541.7, 300 sec: 5526.4). Total num frames: 987543552. Throughput: 0: 5816.3. Samples: 987547662. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:50:53,218][25689] Avg episode reward: [(0, '1.163')] [2022-07-11 00:50:54,215][26022] Updated weights on worker 0-0, policy_version 964403 (0.00094) [2022-07-11 00:50:56,084][26022] Updated weights on worker 0-0, policy_version 964413 (0.00092) [2022-07-11 00:50:58,073][26022] Updated weights on worker 0-0, policy_version 964423 (0.00101) [2022-07-11 00:50:58,283][25689] Fps is (10 sec: 5367.4, 60 sec: 5524.3, 300 sec: 5521.8). Total num frames: 987570176. Throughput: 0: 4984.0. Samples: 987564408. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:50:58,284][25689] Avg episode reward: [(0, '1.545')] [2022-07-11 00:50:59,938][26022] Updated weights on worker 0-0, policy_version 964433 (0.00084) [2022-07-11 00:51:02,142][26022] Updated weights on worker 0-0, policy_version 964443 (0.00092) [2022-07-11 00:51:03,316][25689] Fps is (10 sec: 5171.3, 60 sec: 5507.6, 300 sec: 5518.2). Total num frames: 987595776. Throughput: 0: 5744.0. Samples: 987596532. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:51:03,316][25689] Avg episode reward: [(0, '1.899')] [2022-07-11 00:51:04,027][26022] Updated weights on worker 0-0, policy_version 964453 (0.00085) [2022-07-11 00:51:05,701][26022] Updated weights on worker 0-0, policy_version 964463 (0.00090) [2022-07-11 00:51:07,556][26022] Updated weights on worker 0-0, policy_version 964473 (0.00089) [2022-07-11 00:51:08,355][25689] Fps is (10 sec: 5490.3, 60 sec: 5545.6, 300 sec: 5528.4). Total num frames: 987625472. Throughput: 0: 5713.1. Samples: 987629254. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:51:08,355][25689] Avg episode reward: [(0, '2.056')] [2022-07-11 00:51:09,467][26022] Updated weights on worker 0-0, policy_version 964483 (0.00095) [2022-07-11 00:51:11,197][26022] Updated weights on worker 0-0, policy_version 964493 (0.00089) [2022-07-11 00:51:12,982][26022] Updated weights on worker 0-0, policy_version 964503 (0.00091) [2022-07-11 00:51:13,385][25689] Fps is (10 sec: 5593.4, 60 sec: 5533.3, 300 sec: 5522.5). Total num frames: 987652096. Throughput: 0: 4865.1. Samples: 987645780. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:51:13,387][25689] Avg episode reward: [(0, '1.366')] [2022-07-11 00:51:14,857][26022] Updated weights on worker 0-0, policy_version 964513 (0.00089) [2022-07-11 00:51:16,770][26022] Updated weights on worker 0-0, policy_version 964523 (0.00092) [2022-07-11 00:51:18,512][25689] Fps is (10 sec: 5443.8, 60 sec: 5512.6, 300 sec: 5523.7). Total num frames: 987680768. Throughput: 0: 5680.1. Samples: 987679312. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:51:18,513][25689] Avg episode reward: [(0, '1.268')] [2022-07-11 00:51:18,560][26022] Updated weights on worker 0-0, policy_version 964533 (0.00496) [2022-07-11 00:51:20,427][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:51:20,437][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000964543_987692032.pth [2022-07-11 00:51:20,441][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000962599_985701376.pth [2022-07-11 00:51:20,446][26022] Updated weights on worker 0-0, policy_version 964543 (0.00086) [2022-07-11 00:51:22,373][26022] Updated weights on worker 0-0, policy_version 964553 (0.00094) [2022-07-11 00:51:23,589][25689] Fps is (10 sec: 5619.6, 60 sec: 5540.4, 300 sec: 5525.8). Total num frames: 987709440. Throughput: 0: 5734.7. Samples: 987712794. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:51:23,590][25689] Avg episode reward: [(0, '1.410')] [2022-07-11 00:51:24,019][26022] Updated weights on worker 0-0, policy_version 964563 (0.00082) [2022-07-11 00:51:26,035][26022] Updated weights on worker 0-0, policy_version 964573 (0.00085) [2022-07-11 00:51:27,515][26022] Updated weights on worker 0-0, policy_version 964583 (0.00090) [2022-07-11 00:51:28,662][25689] Fps is (10 sec: 5549.0, 60 sec: 5517.5, 300 sec: 5524.6). Total num frames: 987737088. Throughput: 0: 5750.4. Samples: 987746028. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:51:28,663][25689] Avg episode reward: [(0, '1.760')] [2022-07-11 00:51:29,718][26022] Updated weights on worker 0-0, policy_version 964593 (0.00098) [2022-07-11 00:51:31,574][26022] Updated weights on worker 0-0, policy_version 964603 (0.00628) [2022-07-11 00:51:33,316][26022] Updated weights on worker 0-0, policy_version 964613 (0.00091) [2022-07-11 00:51:33,677][25689] Fps is (10 sec: 5481.6, 60 sec: 5516.3, 300 sec: 5519.3). Total num frames: 987764736. Throughput: 0: 5756.3. Samples: 987762586. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:51:33,677][25689] Avg episode reward: [(0, '0.948')] [2022-07-11 00:51:35,243][26022] Updated weights on worker 0-0, policy_version 964623 (0.00099) [2022-07-11 00:51:37,071][26022] Updated weights on worker 0-0, policy_version 964633 (0.00085) [2022-07-11 00:51:38,735][25689] Fps is (10 sec: 5591.3, 60 sec: 5535.1, 300 sec: 5526.6). Total num frames: 987793408. Throughput: 0: 5766.1. Samples: 987795916. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:51:38,735][25689] Avg episode reward: [(0, '0.766')] [2022-07-11 00:51:38,811][26022] Updated weights on worker 0-0, policy_version 964643 (0.00089) [2022-07-11 00:51:40,669][26022] Updated weights on worker 0-0, policy_version 964653 (0.00091) [2022-07-11 00:51:42,372][26022] Updated weights on worker 0-0, policy_version 964663 (0.00089) [2022-07-11 00:51:43,780][25689] Fps is (10 sec: 5574.5, 60 sec: 5533.7, 300 sec: 5519.4). Total num frames: 987821056. Throughput: 0: 5774.8. Samples: 987829392. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:51:43,780][25689] Avg episode reward: [(0, '1.214')] [2022-07-11 00:51:44,355][26022] Updated weights on worker 0-0, policy_version 964673 (0.00105) [2022-07-11 00:51:46,333][26022] Updated weights on worker 0-0, policy_version 964683 (0.00090) [2022-07-11 00:51:47,915][26022] Updated weights on worker 0-0, policy_version 964693 (0.00103) [2022-07-11 00:51:48,878][25689] Fps is (10 sec: 5451.7, 60 sec: 5482.2, 300 sec: 5517.8). Total num frames: 987848704. Throughput: 0: 4952.4. Samples: 987846144. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:51:48,878][25689] Avg episode reward: [(0, '1.059')] [2022-07-11 00:51:49,789][26022] Updated weights on worker 0-0, policy_version 964703 (0.00086) [2022-07-11 00:51:51,660][26022] Updated weights on worker 0-0, policy_version 964713 (0.00089) [2022-07-11 00:51:53,401][26022] Updated weights on worker 0-0, policy_version 964723 (0.00596) [2022-07-11 00:51:53,887][25689] Fps is (10 sec: 5572.6, 60 sec: 5502.4, 300 sec: 5523.4). Total num frames: 987877376. Throughput: 0: 5799.8. Samples: 987879798. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:51:53,887][25689] Avg episode reward: [(0, '0.495')] [2022-07-11 00:51:55,599][26022] Updated weights on worker 0-0, policy_version 964733 (0.00077) [2022-07-11 00:51:57,202][26022] Updated weights on worker 0-0, policy_version 964743 (0.00096) [2022-07-11 00:51:58,957][25689] Fps is (10 sec: 5689.3, 60 sec: 5535.7, 300 sec: 5526.2). Total num frames: 987906048. Throughput: 0: 5806.1. Samples: 987913328. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:51:58,958][25689] Avg episode reward: [(0, '0.184')] [2022-07-11 00:51:59,008][26022] Updated weights on worker 0-0, policy_version 964753 (0.00088) [2022-07-11 00:52:01,014][26022] Updated weights on worker 0-0, policy_version 964763 (0.00085) [2022-07-11 00:52:02,977][26022] Updated weights on worker 0-0, policy_version 964773 (0.00095) [2022-07-11 00:52:03,988][25689] Fps is (10 sec: 5473.8, 60 sec: 5552.7, 300 sec: 5523.2). Total num frames: 987932672. Throughput: 0: 4977.3. Samples: 987929976. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:52:03,989][25689] Avg episode reward: [(0, '0.877')] [2022-07-11 00:52:04,883][26022] Updated weights on worker 0-0, policy_version 964783 (0.00099) [2022-07-11 00:52:06,706][26022] Updated weights on worker 0-0, policy_version 964793 (0.00087) [2022-07-11 00:52:08,564][26022] Updated weights on worker 0-0, policy_version 964803 (0.00088) [2022-07-11 00:52:08,997][25689] Fps is (10 sec: 5303.5, 60 sec: 5504.8, 300 sec: 5523.3). Total num frames: 987959296. Throughput: 0: 5757.7. Samples: 987961986. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:52:08,998][25689] Avg episode reward: [(0, '0.621')] [2022-07-11 00:52:10,239][26022] Updated weights on worker 0-0, policy_version 964813 (0.00092) [2022-07-11 00:52:12,339][26022] Updated weights on worker 0-0, policy_version 964823 (0.00075) [2022-07-11 00:52:13,838][26022] Updated weights on worker 0-0, policy_version 964833 (0.00095) [2022-07-11 00:52:14,036][25689] Fps is (10 sec: 5605.6, 60 sec: 5554.7, 300 sec: 5527.3). Total num frames: 987988992. Throughput: 0: 5753.3. Samples: 987995722. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:52:14,036][25689] Avg episode reward: [(0, '0.745')] [2022-07-11 00:52:15,996][26022] Updated weights on worker 0-0, policy_version 964843 (0.00087) [2022-07-11 00:52:17,623][26022] Updated weights on worker 0-0, policy_version 964853 (0.00083) [2022-07-11 00:52:19,099][25689] Fps is (10 sec: 5676.8, 60 sec: 5543.7, 300 sec: 5520.3). Total num frames: 988016640. Throughput: 0: 4919.5. Samples: 988012414. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:52:19,099][25689] Avg episode reward: [(0, '0.873')] [2022-07-11 00:52:19,661][26022] Updated weights on worker 0-0, policy_version 964863 (0.00084) [2022-07-11 00:52:21,411][26022] Updated weights on worker 0-0, policy_version 964873 (0.00084) [2022-07-11 00:52:23,193][26022] Updated weights on worker 0-0, policy_version 964883 (0.00080) [2022-07-11 00:52:24,117][25689] Fps is (10 sec: 5586.3, 60 sec: 5549.0, 300 sec: 5525.0). Total num frames: 988045312. Throughput: 0: 5774.3. Samples: 988046206. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:52:24,119][25689] Avg episode reward: [(0, '1.049')] [2022-07-11 00:52:25,136][26022] Updated weights on worker 0-0, policy_version 964893 (0.00086) [2022-07-11 00:52:26,910][26022] Updated weights on worker 0-0, policy_version 964903 (0.00092) [2022-07-11 00:52:28,751][26022] Updated weights on worker 0-0, policy_version 964913 (0.00085) [2022-07-11 00:52:29,144][25689] Fps is (10 sec: 5708.4, 60 sec: 5570.2, 300 sec: 5532.0). Total num frames: 988073984. Throughput: 0: 5837.8. Samples: 988079600. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:52:29,146][25689] Avg episode reward: [(0, '1.226')] [2022-07-11 00:52:30,655][26022] Updated weights on worker 0-0, policy_version 964923 (0.00083) [2022-07-11 00:52:32,387][26022] Updated weights on worker 0-0, policy_version 964933 (0.00096) [2022-07-11 00:52:34,168][25689] Fps is (10 sec: 5501.6, 60 sec: 5552.4, 300 sec: 5523.2). Total num frames: 988100608. Throughput: 0: 4987.5. Samples: 988096134. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:52:34,170][25689] Avg episode reward: [(0, '1.171')] [2022-07-11 00:52:34,236][26022] Updated weights on worker 0-0, policy_version 964943 (0.00097) [2022-07-11 00:52:36,219][26022] Updated weights on worker 0-0, policy_version 964953 (0.00091) [2022-07-11 00:52:38,035][26022] Updated weights on worker 0-0, policy_version 964963 (0.00087) [2022-07-11 00:52:39,234][25689] Fps is (10 sec: 5378.8, 60 sec: 5534.7, 300 sec: 5523.6). Total num frames: 988128256. Throughput: 0: 5813.5. Samples: 988129472. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:52:39,235][25689] Avg episode reward: [(0, '1.180')] [2022-07-11 00:52:39,890][26022] Updated weights on worker 0-0, policy_version 964973 (0.00096) [2022-07-11 00:52:41,641][26022] Updated weights on worker 0-0, policy_version 964983 (0.00090) [2022-07-11 00:52:43,568][26022] Updated weights on worker 0-0, policy_version 964993 (0.00086) [2022-07-11 00:52:44,252][25689] Fps is (10 sec: 5483.7, 60 sec: 5537.2, 300 sec: 5521.6). Total num frames: 988155904. Throughput: 0: 5792.3. Samples: 988162832. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:52:44,255][25689] Avg episode reward: [(0, '1.372')] [2022-07-11 00:52:45,306][26022] Updated weights on worker 0-0, policy_version 965003 (0.00087) [2022-07-11 00:52:47,284][26022] Updated weights on worker 0-0, policy_version 965013 (0.00084) [2022-07-11 00:52:48,901][26022] Updated weights on worker 0-0, policy_version 965023 (0.00089) [2022-07-11 00:52:49,295][25689] Fps is (10 sec: 5699.9, 60 sec: 5576.2, 300 sec: 5529.0). Total num frames: 988185600. Throughput: 0: 4958.4. Samples: 988179514. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:52:49,295][25689] Avg episode reward: [(0, '1.520')] [2022-07-11 00:52:50,942][26022] Updated weights on worker 0-0, policy_version 965033 (0.00088) [2022-07-11 00:52:52,647][26022] Updated weights on worker 0-0, policy_version 965043 (0.00089) [2022-07-11 00:52:54,303][25689] Fps is (10 sec: 5603.5, 60 sec: 5542.4, 300 sec: 5520.9). Total num frames: 988212224. Throughput: 0: 5792.2. Samples: 988212756. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:52:54,304][25689] Avg episode reward: [(0, '1.435')] [2022-07-11 00:52:54,631][26022] Updated weights on worker 0-0, policy_version 965053 (0.00088) [2022-07-11 00:52:56,405][26022] Updated weights on worker 0-0, policy_version 965063 (0.00085) [2022-07-11 00:52:58,396][26022] Updated weights on worker 0-0, policy_version 965073 (0.00088) [2022-07-11 00:52:59,418][25689] Fps is (10 sec: 5462.4, 60 sec: 5538.3, 300 sec: 5525.9). Total num frames: 988240896. Throughput: 0: 5775.3. Samples: 988246038. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:52:59,418][25689] Avg episode reward: [(0, '1.489')] [2022-07-11 00:52:59,960][26022] Updated weights on worker 0-0, policy_version 965083 (0.00091) [2022-07-11 00:53:02,597][26022] Updated weights on worker 0-0, policy_version 965093 (0.00086) [2022-07-11 00:53:04,204][26022] Updated weights on worker 0-0, policy_version 965103 (0.00090) [2022-07-11 00:53:04,427][25689] Fps is (10 sec: 5360.3, 60 sec: 5523.3, 300 sec: 5522.5). Total num frames: 988266496. Throughput: 0: 4919.2. Samples: 988262080. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:53:04,428][25689] Avg episode reward: [(0, '1.196')] [2022-07-11 00:53:06,331][26022] Updated weights on worker 0-0, policy_version 965113 (0.00096) [2022-07-11 00:53:07,951][26022] Updated weights on worker 0-0, policy_version 965123 (0.00094) [2022-07-11 00:53:09,438][25689] Fps is (10 sec: 5212.0, 60 sec: 5523.2, 300 sec: 5519.6). Total num frames: 988293120. Throughput: 0: 5654.8. Samples: 988293418. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:53:09,439][25689] Avg episode reward: [(0, '1.198')] [2022-07-11 00:53:09,893][26022] Updated weights on worker 0-0, policy_version 965133 (0.00093) [2022-07-11 00:53:11,718][26022] Updated weights on worker 0-0, policy_version 965143 (0.00099) [2022-07-11 00:53:13,581][26022] Updated weights on worker 0-0, policy_version 965153 (0.00091) [2022-07-11 00:53:14,455][25689] Fps is (10 sec: 5412.4, 60 sec: 5491.2, 300 sec: 5517.0). Total num frames: 988320768. Throughput: 0: 5654.1. Samples: 988326698. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:53:14,455][25689] Avg episode reward: [(0, '0.387')] [2022-07-11 00:53:15,387][26022] Updated weights on worker 0-0, policy_version 965163 (0.00082) [2022-07-11 00:53:17,147][26022] Updated weights on worker 0-0, policy_version 965173 (0.00092) [2022-07-11 00:53:19,151][26022] Updated weights on worker 0-0, policy_version 965183 (0.00085) [2022-07-11 00:53:19,537][25689] Fps is (10 sec: 5576.9, 60 sec: 5506.5, 300 sec: 5523.1). Total num frames: 988349440. Throughput: 0: 4840.1. Samples: 988343416. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2022-07-11 00:53:19,537][25689] Avg episode reward: [(0, '-0.254')] [2022-07-11 00:53:20,463][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:53:20,480][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000965191_988355584.pth [2022-07-11 00:53:20,480][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000963248_986365952.pth [2022-07-11 00:53:20,918][26022] Updated weights on worker 0-0, policy_version 965193 (0.00086) [2022-07-11 00:53:22,839][26022] Updated weights on worker 0-0, policy_version 965203 (0.00082) [2022-07-11 00:53:24,558][25689] Fps is (10 sec: 5675.9, 60 sec: 5506.2, 300 sec: 5522.7). Total num frames: 988378112. Throughput: 0: 5702.1. Samples: 988376866. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:53:24,558][25689] Avg episode reward: [(0, '-0.195')] [2022-07-11 00:53:24,565][26022] Updated weights on worker 0-0, policy_version 965213 (0.00090) [2022-07-11 00:53:26,432][26022] Updated weights on worker 0-0, policy_version 965223 (0.00088) [2022-07-11 00:53:28,233][26022] Updated weights on worker 0-0, policy_version 965233 (0.00091) [2022-07-11 00:53:29,603][25689] Fps is (10 sec: 5594.7, 60 sec: 5487.6, 300 sec: 5519.0). Total num frames: 988405760. Throughput: 0: 5784.5. Samples: 988410066. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:53:29,605][25689] Avg episode reward: [(0, '-1.185')] [2022-07-11 00:53:30,138][26022] Updated weights on worker 0-0, policy_version 965243 (0.00091) [2022-07-11 00:53:32,090][26022] Updated weights on worker 0-0, policy_version 965253 (0.00104) [2022-07-11 00:53:33,849][26022] Updated weights on worker 0-0, policy_version 965263 (0.00090) [2022-07-11 00:53:34,611][25689] Fps is (10 sec: 5500.3, 60 sec: 5506.0, 300 sec: 5520.9). Total num frames: 988433408. Throughput: 0: 4951.8. Samples: 988426508. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:53:34,612][25689] Avg episode reward: [(0, '-0.500')] [2022-07-11 00:53:35,704][26022] Updated weights on worker 0-0, policy_version 965273 (0.00099) [2022-07-11 00:53:37,380][26022] Updated weights on worker 0-0, policy_version 965283 (0.00094) [2022-07-11 00:53:39,572][26022] Updated weights on worker 0-0, policy_version 965293 (0.00088) [2022-07-11 00:53:39,753][25689] Fps is (10 sec: 5346.9, 60 sec: 5482.2, 300 sec: 5512.3). Total num frames: 988460032. Throughput: 0: 5764.7. Samples: 988459960. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:53:39,754][25689] Avg episode reward: [(0, '-1.305')] [2022-07-11 00:53:41,083][26022] Updated weights on worker 0-0, policy_version 965303 (0.00094) [2022-07-11 00:53:43,079][26022] Updated weights on worker 0-0, policy_version 965313 (0.00087) [2022-07-11 00:53:44,684][26022] Updated weights on worker 0-0, policy_version 965323 (0.00084) [2022-07-11 00:53:44,776][25689] Fps is (10 sec: 5641.2, 60 sec: 5532.4, 300 sec: 5519.7). Total num frames: 988490752. Throughput: 0: 5778.0. Samples: 988493688. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:53:44,777][25689] Avg episode reward: [(0, '-1.351')] [2022-07-11 00:53:46,624][26022] Updated weights on worker 0-0, policy_version 965333 (0.00091) [2022-07-11 00:53:48,550][26022] Updated weights on worker 0-0, policy_version 965343 (0.00089) [2022-07-11 00:53:49,819][25689] Fps is (10 sec: 5798.7, 60 sec: 5498.6, 300 sec: 5522.7). Total num frames: 988518400. Throughput: 0: 5793.8. Samples: 988527192. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:53:49,821][25689] Avg episode reward: [(0, '-1.005')] [2022-07-11 00:53:50,388][26022] Updated weights on worker 0-0, policy_version 965353 (0.00085) [2022-07-11 00:53:52,226][26022] Updated weights on worker 0-0, policy_version 965363 (0.00085) [2022-07-11 00:53:54,244][26022] Updated weights on worker 0-0, policy_version 965373 (0.00087) [2022-07-11 00:53:54,832][25689] Fps is (10 sec: 5397.1, 60 sec: 5498.2, 300 sec: 5513.8). Total num frames: 988545024. Throughput: 0: 5787.1. Samples: 988543528. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:53:54,832][25689] Avg episode reward: [(0, '-1.081')] [2022-07-11 00:53:55,744][26022] Updated weights on worker 0-0, policy_version 965383 (0.00092) [2022-07-11 00:53:57,909][26022] Updated weights on worker 0-0, policy_version 965393 (0.00091) [2022-07-11 00:53:59,576][26022] Updated weights on worker 0-0, policy_version 965403 (0.00093) [2022-07-11 00:53:59,983][25689] Fps is (10 sec: 5440.5, 60 sec: 5494.9, 300 sec: 5526.2). Total num frames: 988573696. Throughput: 0: 5775.1. Samples: 988576788. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:53:59,983][25689] Avg episode reward: [(0, '-1.371')] [2022-07-11 00:54:01,598][26022] Updated weights on worker 0-0, policy_version 965413 (0.00079) [2022-07-11 00:54:03,877][26022] Updated weights on worker 0-0, policy_version 965423 (0.00087) [2022-07-11 00:54:05,040][25689] Fps is (10 sec: 5416.9, 60 sec: 5507.5, 300 sec: 5522.0). Total num frames: 988600320. Throughput: 0: 5653.4. Samples: 988608248. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:54:05,041][25689] Avg episode reward: [(0, '-0.249')] [2022-07-11 00:54:05,435][26022] Updated weights on worker 0-0, policy_version 965433 (0.00092) [2022-07-11 00:54:07,559][26022] Updated weights on worker 0-0, policy_version 965443 (0.00082) [2022-07-11 00:54:09,127][26022] Updated weights on worker 0-0, policy_version 965453 (0.00090) [2022-07-11 00:54:10,055][25689] Fps is (10 sec: 5388.1, 60 sec: 5523.9, 300 sec: 5522.0). Total num frames: 988627968. Throughput: 0: 4839.6. Samples: 988625128. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:54:10,056][25689] Avg episode reward: [(0, '0.518')] [2022-07-11 00:54:11,121][26022] Updated weights on worker 0-0, policy_version 965463 (0.00091) [2022-07-11 00:54:13,068][26022] Updated weights on worker 0-0, policy_version 965473 (0.00087) [2022-07-11 00:54:14,690][26022] Updated weights on worker 0-0, policy_version 965483 (0.00094) [2022-07-11 00:54:15,076][25689] Fps is (10 sec: 5509.6, 60 sec: 5523.6, 300 sec: 5522.7). Total num frames: 988655616. Throughput: 0: 5662.2. Samples: 988658156. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:54:15,077][25689] Avg episode reward: [(0, '1.004')] [2022-07-11 00:54:16,712][26022] Updated weights on worker 0-0, policy_version 965493 (0.00088) [2022-07-11 00:54:18,430][26022] Updated weights on worker 0-0, policy_version 965503 (0.00097) [2022-07-11 00:54:20,141][25689] Fps is (10 sec: 5482.9, 60 sec: 5508.3, 300 sec: 5522.9). Total num frames: 988683264. Throughput: 0: 5685.2. Samples: 988691388. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:54:20,141][25689] Avg episode reward: [(0, '0.865')] [2022-07-11 00:54:20,346][26022] Updated weights on worker 0-0, policy_version 965513 (0.00088) [2022-07-11 00:54:22,229][26022] Updated weights on worker 0-0, policy_version 965523 (0.00095) [2022-07-11 00:54:23,959][26022] Updated weights on worker 0-0, policy_version 965533 (0.00092) [2022-07-11 00:54:25,174][25689] Fps is (10 sec: 5577.4, 60 sec: 5507.1, 300 sec: 5523.1). Total num frames: 988711936. Throughput: 0: 4967.7. Samples: 988708270. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:54:25,175][25689] Avg episode reward: [(0, '1.362')] [2022-07-11 00:54:25,885][26022] Updated weights on worker 0-0, policy_version 965543 (0.00087) [2022-07-11 00:54:27,811][26022] Updated weights on worker 0-0, policy_version 965553 (0.00098) [2022-07-11 00:54:29,747][26022] Updated weights on worker 0-0, policy_version 965563 (0.00101) [2022-07-11 00:54:30,178][25689] Fps is (10 sec: 5509.0, 60 sec: 5494.0, 300 sec: 5523.1). Total num frames: 988738560. Throughput: 0: 5781.7. Samples: 988741472. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:54:30,180][25689] Avg episode reward: [(0, '1.227')] [2022-07-11 00:54:31,450][26022] Updated weights on worker 0-0, policy_version 965573 (0.00094) [2022-07-11 00:54:33,527][26022] Updated weights on worker 0-0, policy_version 965583 (0.00086) [2022-07-11 00:54:35,018][26022] Updated weights on worker 0-0, policy_version 965593 (0.00086) [2022-07-11 00:54:35,203][25689] Fps is (10 sec: 5514.0, 60 sec: 5509.4, 300 sec: 5520.5). Total num frames: 988767232. Throughput: 0: 5795.2. Samples: 988774792. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:54:35,203][25689] Avg episode reward: [(0, '1.006')] [2022-07-11 00:54:37,151][26022] Updated weights on worker 0-0, policy_version 965603 (0.00086) [2022-07-11 00:54:38,680][26022] Updated weights on worker 0-0, policy_version 965613 (0.00091) [2022-07-11 00:54:40,265][25689] Fps is (10 sec: 5583.5, 60 sec: 5533.6, 300 sec: 5519.9). Total num frames: 988794880. Throughput: 0: 4972.9. Samples: 988791464. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:54:40,265][25689] Avg episode reward: [(0, '0.584')] [2022-07-11 00:54:40,706][26022] Updated weights on worker 0-0, policy_version 965623 (0.00092) [2022-07-11 00:54:42,606][26022] Updated weights on worker 0-0, policy_version 965633 (0.00089) [2022-07-11 00:54:44,367][26022] Updated weights on worker 0-0, policy_version 965643 (0.00087) [2022-07-11 00:54:45,333][25689] Fps is (10 sec: 5559.4, 60 sec: 5495.6, 300 sec: 5519.5). Total num frames: 988823552. Throughput: 0: 5785.2. Samples: 988824894. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:54:45,334][25689] Avg episode reward: [(0, '0.440')] [2022-07-11 00:54:46,211][26022] Updated weights on worker 0-0, policy_version 965653 (0.00090) [2022-07-11 00:54:47,905][26022] Updated weights on worker 0-0, policy_version 965663 (0.00057) [2022-07-11 00:54:49,712][26022] Updated weights on worker 0-0, policy_version 965673 (0.00096) [2022-07-11 00:54:50,362][25689] Fps is (10 sec: 5679.1, 60 sec: 5513.8, 300 sec: 5523.2). Total num frames: 988852224. Throughput: 0: 5811.6. Samples: 988858776. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:54:50,363][25689] Avg episode reward: [(0, '0.688')] [2022-07-11 00:54:51,770][26022] Updated weights on worker 0-0, policy_version 965683 (0.00085) [2022-07-11 00:54:53,540][26022] Updated weights on worker 0-0, policy_version 965693 (0.00086) [2022-07-11 00:54:55,314][26022] Updated weights on worker 0-0, policy_version 965703 (0.00087) [2022-07-11 00:54:55,371][25689] Fps is (10 sec: 5610.6, 60 sec: 5531.1, 300 sec: 5524.1). Total num frames: 988879872. Throughput: 0: 4999.5. Samples: 988875626. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:54:55,373][25689] Avg episode reward: [(0, '0.031')] [2022-07-11 00:54:57,175][26022] Updated weights on worker 0-0, policy_version 965713 (0.00100) [2022-07-11 00:54:59,035][26022] Updated weights on worker 0-0, policy_version 965723 (0.00095) [2022-07-11 00:55:00,413][25689] Fps is (10 sec: 5501.7, 60 sec: 5524.1, 300 sec: 5527.4). Total num frames: 988907520. Throughput: 0: 5823.3. Samples: 988908792. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:55:00,414][25689] Avg episode reward: [(0, '-0.057')] [2022-07-11 00:55:00,952][26022] Updated weights on worker 0-0, policy_version 965733 (0.00091) [2022-07-11 00:55:03,230][26022] Updated weights on worker 0-0, policy_version 965743 (0.00093) [2022-07-11 00:55:04,953][26022] Updated weights on worker 0-0, policy_version 965753 (0.00085) [2022-07-11 00:55:05,436][25689] Fps is (10 sec: 5290.4, 60 sec: 5510.3, 300 sec: 5521.7). Total num frames: 988933120. Throughput: 0: 5711.0. Samples: 988939702. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:55:05,437][25689] Avg episode reward: [(0, '0.066')] [2022-07-11 00:55:06,736][26022] Updated weights on worker 0-0, policy_version 965763 (0.00096) [2022-07-11 00:55:08,817][26022] Updated weights on worker 0-0, policy_version 965773 (0.00091) [2022-07-11 00:55:10,272][26022] Updated weights on worker 0-0, policy_version 965783 (0.00088) [2022-07-11 00:55:10,463][25689] Fps is (10 sec: 5502.2, 60 sec: 5543.2, 300 sec: 5529.6). Total num frames: 988962816. Throughput: 0: 4866.0. Samples: 988956586. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:55:10,463][25689] Avg episode reward: [(0, '0.294')] [2022-07-11 00:55:12,439][26022] Updated weights on worker 0-0, policy_version 965793 (0.00084) [2022-07-11 00:55:13,879][26022] Updated weights on worker 0-0, policy_version 965803 (0.00085) [2022-07-11 00:55:15,467][25689] Fps is (10 sec: 5512.7, 60 sec: 5510.8, 300 sec: 5517.3). Total num frames: 988988416. Throughput: 0: 5696.8. Samples: 988990106. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:55:15,467][25689] Avg episode reward: [(0, '0.500')] [2022-07-11 00:55:16,027][26022] Updated weights on worker 0-0, policy_version 965813 (0.00093) [2022-07-11 00:55:18,199][26022] Updated weights on worker 0-0, policy_version 965823 (0.00087) [2022-07-11 00:55:19,425][26022] Updated weights on worker 0-0, policy_version 965833 (0.00088) [2022-07-11 00:55:20,595][25689] Fps is (10 sec: 5457.5, 60 sec: 5538.9, 300 sec: 5525.5). Total num frames: 989018112. Throughput: 0: 5683.7. Samples: 989023500. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:55:20,595][25689] Avg episode reward: [(0, '0.279')] [2022-07-11 00:55:20,678][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:55:20,692][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000965839_989019136.pth [2022-07-11 00:55:20,692][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000963894_987027456.pth [2022-07-11 00:55:21,776][26022] Updated weights on worker 0-0, policy_version 965843 (0.00082) [2022-07-11 00:55:23,081][26022] Updated weights on worker 0-0, policy_version 965853 (0.00085) [2022-07-11 00:55:25,346][26022] Updated weights on worker 0-0, policy_version 965863 (0.00091) [2022-07-11 00:55:25,642][25689] Fps is (10 sec: 5534.9, 60 sec: 5503.7, 300 sec: 5517.9). Total num frames: 989044736. Throughput: 0: 4974.3. Samples: 989040212. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:55:25,643][25689] Avg episode reward: [(0, '0.974')] [2022-07-11 00:55:26,893][26022] Updated weights on worker 0-0, policy_version 965873 (0.00092) [2022-07-11 00:55:28,998][26022] Updated weights on worker 0-0, policy_version 965883 (0.00085) [2022-07-11 00:55:30,564][26022] Updated weights on worker 0-0, policy_version 965893 (0.00087) [2022-07-11 00:55:30,662][25689] Fps is (10 sec: 5594.6, 60 sec: 5553.1, 300 sec: 5524.4). Total num frames: 989074432. Throughput: 0: 5788.0. Samples: 989073498. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:55:30,662][25689] Avg episode reward: [(0, '0.820')] [2022-07-11 00:55:32,678][26022] Updated weights on worker 0-0, policy_version 965903 (0.00093) [2022-07-11 00:55:34,450][26022] Updated weights on worker 0-0, policy_version 965913 (0.00088) [2022-07-11 00:55:35,685][25689] Fps is (10 sec: 5506.2, 60 sec: 5502.4, 300 sec: 5518.6). Total num frames: 989100032. Throughput: 0: 5757.4. Samples: 989106508. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:55:35,685][25689] Avg episode reward: [(0, '1.396')] [2022-07-11 00:55:36,326][26022] Updated weights on worker 0-0, policy_version 965923 (0.00094) [2022-07-11 00:55:38,206][26022] Updated weights on worker 0-0, policy_version 965933 (0.00098) [2022-07-11 00:55:39,906][26022] Updated weights on worker 0-0, policy_version 965943 (0.00091) [2022-07-11 00:55:40,726][25689] Fps is (10 sec: 5596.0, 60 sec: 5555.2, 300 sec: 5528.7). Total num frames: 989130752. Throughput: 0: 4950.9. Samples: 989123168. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:55:40,727][25689] Avg episode reward: [(0, '0.122')] [2022-07-11 00:55:41,850][26022] Updated weights on worker 0-0, policy_version 965953 (0.00089) [2022-07-11 00:55:43,575][26022] Updated weights on worker 0-0, policy_version 965963 (0.00084) [2022-07-11 00:55:45,546][26022] Updated weights on worker 0-0, policy_version 965973 (0.00091) [2022-07-11 00:55:45,738][25689] Fps is (10 sec: 5602.2, 60 sec: 5509.5, 300 sec: 5512.9). Total num frames: 989156352. Throughput: 0: 5792.5. Samples: 989156616. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:55:45,739][25689] Avg episode reward: [(0, '0.247')] [2022-07-11 00:55:47,455][26022] Updated weights on worker 0-0, policy_version 965983 (0.00049) [2022-07-11 00:55:49,456][26022] Updated weights on worker 0-0, policy_version 965993 (0.00085) [2022-07-11 00:55:50,747][25689] Fps is (10 sec: 5415.9, 60 sec: 5511.3, 300 sec: 5517.0). Total num frames: 989185024. Throughput: 0: 5803.2. Samples: 989190056. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:55:50,747][25689] Avg episode reward: [(0, '-0.746')] [2022-07-11 00:55:50,981][26022] Updated weights on worker 0-0, policy_version 966003 (0.00087) [2022-07-11 00:55:53,119][26022] Updated weights on worker 0-0, policy_version 966013 (0.00085) [2022-07-11 00:55:54,528][26022] Updated weights on worker 0-0, policy_version 966023 (0.00094) [2022-07-11 00:55:55,791][25689] Fps is (10 sec: 5602.6, 60 sec: 5508.2, 300 sec: 5520.9). Total num frames: 989212672. Throughput: 0: 4991.5. Samples: 989206866. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:55:55,791][25689] Avg episode reward: [(0, '-0.628')] [2022-07-11 00:55:56,750][26022] Updated weights on worker 0-0, policy_version 966033 (0.00098) [2022-07-11 00:55:58,161][26022] Updated weights on worker 0-0, policy_version 966043 (0.00092) [2022-07-11 00:56:00,395][26022] Updated weights on worker 0-0, policy_version 966053 (0.00087) [2022-07-11 00:56:00,919][25689] Fps is (10 sec: 5637.2, 60 sec: 5534.1, 300 sec: 5532.8). Total num frames: 989242368. Throughput: 0: 5784.6. Samples: 989239978. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:56:00,920][25689] Avg episode reward: [(0, '-0.503')] [2022-07-11 00:56:02,698][26022] Updated weights on worker 0-0, policy_version 966063 (0.00088) [2022-07-11 00:56:04,338][26022] Updated weights on worker 0-0, policy_version 966073 (0.00088) [2022-07-11 00:56:05,946][25689] Fps is (10 sec: 5344.2, 60 sec: 5516.9, 300 sec: 5515.9). Total num frames: 989266944. Throughput: 0: 5672.6. Samples: 989271246. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:56:05,947][25689] Avg episode reward: [(0, '-0.500')] [2022-07-11 00:56:06,275][26022] Updated weights on worker 0-0, policy_version 966083 (0.00083) [2022-07-11 00:56:07,882][26022] Updated weights on worker 0-0, policy_version 966093 (0.00092) [2022-07-11 00:56:09,893][26022] Updated weights on worker 0-0, policy_version 966103 (0.00078) [2022-07-11 00:56:10,968][25689] Fps is (10 sec: 5197.1, 60 sec: 5483.4, 300 sec: 5519.5). Total num frames: 989294592. Throughput: 0: 5686.8. Samples: 989305048. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:56:10,970][25689] Avg episode reward: [(0, '0.860')] [2022-07-11 00:56:11,743][26022] Updated weights on worker 0-0, policy_version 966113 (0.00089) [2022-07-11 00:56:13,482][26022] Updated weights on worker 0-0, policy_version 966123 (0.00086) [2022-07-11 00:56:15,398][26022] Updated weights on worker 0-0, policy_version 966133 (0.00092) [2022-07-11 00:56:15,991][25689] Fps is (10 sec: 5606.9, 60 sec: 5532.5, 300 sec: 5521.4). Total num frames: 989323264. Throughput: 0: 5687.5. Samples: 989321754. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:56:15,992][25689] Avg episode reward: [(0, '0.914')] [2022-07-11 00:56:17,382][26022] Updated weights on worker 0-0, policy_version 966143 (0.00083) [2022-07-11 00:56:18,978][26022] Updated weights on worker 0-0, policy_version 966153 (0.00091) [2022-07-11 00:56:21,029][25689] Fps is (10 sec: 5495.8, 60 sec: 5489.8, 300 sec: 5515.2). Total num frames: 989349888. Throughput: 0: 5720.4. Samples: 989355016. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:56:21,030][25689] Avg episode reward: [(0, '1.923')] [2022-07-11 00:56:21,058][26022] Updated weights on worker 0-0, policy_version 966163 (0.00088) [2022-07-11 00:56:22,774][26022] Updated weights on worker 0-0, policy_version 966173 (0.00090) [2022-07-11 00:56:24,687][26022] Updated weights on worker 0-0, policy_version 966183 (0.00088) [2022-07-11 00:56:26,056][25689] Fps is (10 sec: 5595.2, 60 sec: 5542.6, 300 sec: 5523.0). Total num frames: 989379584. Throughput: 0: 5856.5. Samples: 989389022. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:56:26,057][25689] Avg episode reward: [(0, '1.444')] [2022-07-11 00:56:26,285][26022] Updated weights on worker 0-0, policy_version 966193 (0.00088) [2022-07-11 00:56:28,189][26022] Updated weights on worker 0-0, policy_version 966203 (0.00090) [2022-07-11 00:56:29,959][26022] Updated weights on worker 0-0, policy_version 966213 (0.00100) [2022-07-11 00:56:31,151][25689] Fps is (10 sec: 5665.3, 60 sec: 5501.8, 300 sec: 5521.5). Total num frames: 989407232. Throughput: 0: 4990.0. Samples: 989405762. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:56:31,153][25689] Avg episode reward: [(0, '1.526')] [2022-07-11 00:56:32,097][26022] Updated weights on worker 0-0, policy_version 966223 (0.00095) [2022-07-11 00:56:33,628][26022] Updated weights on worker 0-0, policy_version 966233 (0.00055) [2022-07-11 00:56:35,615][26022] Updated weights on worker 0-0, policy_version 966243 (0.00086) [2022-07-11 00:56:36,233][25689] Fps is (10 sec: 5534.5, 60 sec: 5547.2, 300 sec: 5521.0). Total num frames: 989435904. Throughput: 0: 5790.4. Samples: 989438964. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:56:36,233][25689] Avg episode reward: [(0, '1.015')] [2022-07-11 00:56:37,365][26022] Updated weights on worker 0-0, policy_version 966253 (0.00087) [2022-07-11 00:56:39,295][26022] Updated weights on worker 0-0, policy_version 966263 (0.00091) [2022-07-11 00:56:41,054][26022] Updated weights on worker 0-0, policy_version 966273 (0.00084) [2022-07-11 00:56:41,351][25689] Fps is (10 sec: 5622.3, 60 sec: 5506.4, 300 sec: 5523.1). Total num frames: 989464576. Throughput: 0: 5790.8. Samples: 989472692. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:56:41,352][25689] Avg episode reward: [(0, '0.249')] [2022-07-11 00:56:42,775][26022] Updated weights on worker 0-0, policy_version 966283 (0.00085) [2022-07-11 00:56:44,816][26022] Updated weights on worker 0-0, policy_version 966293 (0.00088) [2022-07-11 00:56:46,439][25689] Fps is (10 sec: 5618.5, 60 sec: 5550.1, 300 sec: 5526.7). Total num frames: 989493248. Throughput: 0: 4929.7. Samples: 989489502. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:56:46,439][25689] Avg episode reward: [(0, '-0.037')] [2022-07-11 00:56:46,535][26022] Updated weights on worker 0-0, policy_version 966303 (0.00084) [2022-07-11 00:56:48,244][26022] Updated weights on worker 0-0, policy_version 966313 (0.00088) [2022-07-11 00:56:50,312][26022] Updated weights on worker 0-0, policy_version 966323 (0.00087) [2022-07-11 00:56:51,475][25689] Fps is (10 sec: 5664.2, 60 sec: 5547.6, 300 sec: 5526.2). Total num frames: 989521920. Throughput: 0: 5779.3. Samples: 989523216. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:56:51,475][25689] Avg episode reward: [(0, '-0.790')] [2022-07-11 00:56:51,840][26022] Updated weights on worker 0-0, policy_version 966333 (0.00095) [2022-07-11 00:56:53,844][26022] Updated weights on worker 0-0, policy_version 966343 (0.00085) [2022-07-11 00:56:55,665][26022] Updated weights on worker 0-0, policy_version 966353 (0.00084) [2022-07-11 00:56:56,549][25689] Fps is (10 sec: 5570.9, 60 sec: 5544.9, 300 sec: 5522.7). Total num frames: 989549568. Throughput: 0: 5800.4. Samples: 989556804. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:56:56,549][25689] Avg episode reward: [(0, '-0.128')] [2022-07-11 00:56:57,502][26022] Updated weights on worker 0-0, policy_version 966363 (0.00085) [2022-07-11 00:56:59,368][26022] Updated weights on worker 0-0, policy_version 966373 (0.00104) [2022-07-11 00:57:01,183][26022] Updated weights on worker 0-0, policy_version 966383 (0.00090) [2022-07-11 00:57:01,618][25689] Fps is (10 sec: 5653.4, 60 sec: 5550.3, 300 sec: 5532.4). Total num frames: 989579264. Throughput: 0: 4971.5. Samples: 989573450. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:57:01,619][25689] Avg episode reward: [(0, '-0.496')] [2022-07-11 00:57:03,356][26022] Updated weights on worker 0-0, policy_version 966393 (0.00092) [2022-07-11 00:57:05,219][26022] Updated weights on worker 0-0, policy_version 966403 (0.00110) [2022-07-11 00:57:06,663][25689] Fps is (10 sec: 5365.7, 60 sec: 5548.6, 300 sec: 5524.8). Total num frames: 989603840. Throughput: 0: 5691.8. Samples: 989604614. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:57:06,664][25689] Avg episode reward: [(0, '-0.216')] [2022-07-11 00:57:07,147][26022] Updated weights on worker 0-0, policy_version 966413 (0.00088) [2022-07-11 00:57:09,037][26022] Updated weights on worker 0-0, policy_version 966423 (0.00103) [2022-07-11 00:57:10,674][26022] Updated weights on worker 0-0, policy_version 966433 (0.00094) [2022-07-11 00:57:11,755][25689] Fps is (10 sec: 5151.9, 60 sec: 5542.3, 300 sec: 5516.9). Total num frames: 989631488. Throughput: 0: 5658.9. Samples: 989637980. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:57:11,755][25689] Avg episode reward: [(0, '0.260')] [2022-07-11 00:57:12,729][26022] Updated weights on worker 0-0, policy_version 966443 (0.00083) [2022-07-11 00:57:14,499][26022] Updated weights on worker 0-0, policy_version 966453 (0.00059) [2022-07-11 00:57:16,526][26022] Updated weights on worker 0-0, policy_version 966463 (0.00082) [2022-07-11 00:57:16,761][25689] Fps is (10 sec: 5475.7, 60 sec: 5526.9, 300 sec: 5518.0). Total num frames: 989659136. Throughput: 0: 4838.5. Samples: 989654604. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 00:57:16,762][25689] Avg episode reward: [(0, '0.150')] [2022-07-11 00:57:18,055][26022] Updated weights on worker 0-0, policy_version 966473 (0.00094) [2022-07-11 00:57:20,147][26022] Updated weights on worker 0-0, policy_version 966483 (0.00092) [2022-07-11 00:57:20,881][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:57:20,898][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000966487_989682688.pth [2022-07-11 00:57:20,898][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000964543_987692032.pth [2022-07-11 00:57:21,774][26022] Updated weights on worker 0-0, policy_version 966493 (0.00110) [2022-07-11 00:57:21,877][25689] Fps is (10 sec: 5665.1, 60 sec: 5570.4, 300 sec: 5519.6). Total num frames: 989688832. Throughput: 0: 5647.0. Samples: 989687854. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:57:21,878][25689] Avg episode reward: [(0, '-0.025')] [2022-07-11 00:57:23,922][26022] Updated weights on worker 0-0, policy_version 966503 (0.00083) [2022-07-11 00:57:25,511][26022] Updated weights on worker 0-0, policy_version 966513 (0.00083) [2022-07-11 00:57:26,890][25689] Fps is (10 sec: 5560.5, 60 sec: 5521.2, 300 sec: 5513.0). Total num frames: 989715456. Throughput: 0: 5772.8. Samples: 989721380. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:57:26,891][25689] Avg episode reward: [(0, '0.170')] [2022-07-11 00:57:27,563][26022] Updated weights on worker 0-0, policy_version 966523 (0.00095) [2022-07-11 00:57:29,216][26022] Updated weights on worker 0-0, policy_version 966533 (0.00093) [2022-07-11 00:57:31,222][26022] Updated weights on worker 0-0, policy_version 966543 (0.00088) [2022-07-11 00:57:31,905][25689] Fps is (10 sec: 5514.1, 60 sec: 5545.3, 300 sec: 5520.0). Total num frames: 989744128. Throughput: 0: 4956.6. Samples: 989737858. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:57:31,906][25689] Avg episode reward: [(0, '0.473')] [2022-07-11 00:57:33,002][26022] Updated weights on worker 0-0, policy_version 966553 (0.00091) [2022-07-11 00:57:34,936][26022] Updated weights on worker 0-0, policy_version 966563 (0.00101) [2022-07-11 00:57:36,789][26022] Updated weights on worker 0-0, policy_version 966573 (0.00088) [2022-07-11 00:57:36,964][25689] Fps is (10 sec: 5489.1, 60 sec: 5513.6, 300 sec: 5516.7). Total num frames: 989770752. Throughput: 0: 5761.7. Samples: 989771004. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:57:36,964][25689] Avg episode reward: [(0, '0.403')] [2022-07-11 00:57:38,613][26022] Updated weights on worker 0-0, policy_version 966583 (0.00082) [2022-07-11 00:57:40,403][26022] Updated weights on worker 0-0, policy_version 966593 (0.00090) [2022-07-11 00:57:42,060][25689] Fps is (10 sec: 5445.4, 60 sec: 5515.6, 300 sec: 5518.7). Total num frames: 989799424. Throughput: 0: 5755.4. Samples: 989804014. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:57:42,061][25689] Avg episode reward: [(0, '0.144')] [2022-07-11 00:57:42,444][26022] Updated weights on worker 0-0, policy_version 966603 (0.00089) [2022-07-11 00:57:43,937][26022] Updated weights on worker 0-0, policy_version 966613 (0.00087) [2022-07-11 00:57:46,092][26022] Updated weights on worker 0-0, policy_version 966623 (0.00090) [2022-07-11 00:57:47,135][25689] Fps is (10 sec: 5638.2, 60 sec: 5516.8, 300 sec: 5514.7). Total num frames: 989828096. Throughput: 0: 5728.6. Samples: 989837352. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:57:47,135][25689] Avg episode reward: [(0, '0.865')] [2022-07-11 00:57:47,598][26022] Updated weights on worker 0-0, policy_version 966633 (0.00089) [2022-07-11 00:57:49,611][26022] Updated weights on worker 0-0, policy_version 966643 (0.00079) [2022-07-11 00:57:51,670][26022] Updated weights on worker 0-0, policy_version 966653 (0.00092) [2022-07-11 00:57:52,159][25689] Fps is (10 sec: 5475.3, 60 sec: 5484.1, 300 sec: 5514.4). Total num frames: 989854720. Throughput: 0: 5739.3. Samples: 989854100. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:57:52,160][25689] Avg episode reward: [(0, '-0.155')] [2022-07-11 00:57:53,268][26022] Updated weights on worker 0-0, policy_version 966663 (0.00095) [2022-07-11 00:57:55,336][26022] Updated weights on worker 0-0, policy_version 966673 (0.00088) [2022-07-11 00:57:57,011][26022] Updated weights on worker 0-0, policy_version 966683 (0.00091) [2022-07-11 00:57:57,228][25689] Fps is (10 sec: 5580.0, 60 sec: 5518.3, 300 sec: 5518.7). Total num frames: 989884416. Throughput: 0: 5738.7. Samples: 989887292. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:57:57,228][25689] Avg episode reward: [(0, '0.010')] [2022-07-11 00:57:59,034][26022] Updated weights on worker 0-0, policy_version 966693 (0.00102) [2022-07-11 00:58:00,883][26022] Updated weights on worker 0-0, policy_version 966703 (0.00085) [2022-07-11 00:58:02,290][25689] Fps is (10 sec: 5357.2, 60 sec: 5434.6, 300 sec: 5514.3). Total num frames: 989908992. Throughput: 0: 5655.2. Samples: 989918418. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:58:02,290][25689] Avg episode reward: [(0, '0.030')] [2022-07-11 00:58:03,087][26022] Updated weights on worker 0-0, policy_version 966713 (0.00088) [2022-07-11 00:58:04,748][26022] Updated weights on worker 0-0, policy_version 966723 (0.00089) [2022-07-11 00:58:06,816][26022] Updated weights on worker 0-0, policy_version 966733 (0.00094) [2022-07-11 00:58:07,297][25689] Fps is (10 sec: 5186.3, 60 sec: 5488.6, 300 sec: 5517.8). Total num frames: 989936640. Throughput: 0: 4845.3. Samples: 989935048. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:58:07,298][25689] Avg episode reward: [(0, '-0.332')] [2022-07-11 00:58:08,580][26022] Updated weights on worker 0-0, policy_version 966743 (0.00048) [2022-07-11 00:58:10,413][26022] Updated weights on worker 0-0, policy_version 966753 (0.00086) [2022-07-11 00:58:12,325][25689] Fps is (10 sec: 5510.3, 60 sec: 5494.5, 300 sec: 5517.6). Total num frames: 989964288. Throughput: 0: 5667.0. Samples: 989968378. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:58:12,325][25689] Avg episode reward: [(0, '-0.439')] [2022-07-11 00:58:12,363][26022] Updated weights on worker 0-0, policy_version 966763 (0.00085) [2022-07-11 00:58:14,014][26022] Updated weights on worker 0-0, policy_version 966773 (0.00088) [2022-07-11 00:58:15,950][26022] Updated weights on worker 0-0, policy_version 966783 (0.00090) [2022-07-11 00:58:17,346][25689] Fps is (10 sec: 5706.5, 60 sec: 5526.9, 300 sec: 5522.1). Total num frames: 989993984. Throughput: 0: 5703.1. Samples: 990002030. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:58:17,347][25689] Avg episode reward: [(0, '-0.237')] [2022-07-11 00:58:17,546][26022] Updated weights on worker 0-0, policy_version 966793 (0.00091) [2022-07-11 00:58:19,724][26022] Updated weights on worker 0-0, policy_version 966803 (0.00092) [2022-07-11 00:58:21,275][26022] Updated weights on worker 0-0, policy_version 966813 (0.00086) [2022-07-11 00:58:22,478][25689] Fps is (10 sec: 5647.7, 60 sec: 5491.7, 300 sec: 5516.6). Total num frames: 990021632. Throughput: 0: 4975.8. Samples: 990018872. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:58:22,479][25689] Avg episode reward: [(0, '0.698')] [2022-07-11 00:58:23,371][26022] Updated weights on worker 0-0, policy_version 966823 (0.00086) [2022-07-11 00:58:25,070][26022] Updated weights on worker 0-0, policy_version 966833 (0.00099) [2022-07-11 00:58:27,135][26022] Updated weights on worker 0-0, policy_version 966843 (0.00096) [2022-07-11 00:58:27,483][25689] Fps is (10 sec: 5455.1, 60 sec: 5509.3, 300 sec: 5517.4). Total num frames: 990049280. Throughput: 0: 5796.8. Samples: 990052060. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:58:27,483][25689] Avg episode reward: [(0, '0.613')] [2022-07-11 00:58:28,874][26022] Updated weights on worker 0-0, policy_version 966853 (0.00094) [2022-07-11 00:58:30,788][26022] Updated weights on worker 0-0, policy_version 966863 (0.00089) [2022-07-11 00:58:32,543][25689] Fps is (10 sec: 5595.5, 60 sec: 5505.2, 300 sec: 5519.9). Total num frames: 990077952. Throughput: 0: 5764.5. Samples: 990084928. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:58:32,544][26022] Updated weights on worker 0-0, policy_version 966873 (0.00082) [2022-07-11 00:58:32,545][25689] Avg episode reward: [(0, '0.411')] [2022-07-11 00:58:34,503][26022] Updated weights on worker 0-0, policy_version 966883 (0.00085) [2022-07-11 00:58:36,395][26022] Updated weights on worker 0-0, policy_version 966893 (0.00088) [2022-07-11 00:58:37,578][25689] Fps is (10 sec: 5376.0, 60 sec: 5490.5, 300 sec: 5518.4). Total num frames: 990103552. Throughput: 0: 4911.9. Samples: 990101406. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:58:37,580][25689] Avg episode reward: [(0, '0.480')] [2022-07-11 00:58:38,149][26022] Updated weights on worker 0-0, policy_version 966903 (0.00088) [2022-07-11 00:58:40,031][26022] Updated weights on worker 0-0, policy_version 966913 (0.00090) [2022-07-11 00:58:42,035][26022] Updated weights on worker 0-0, policy_version 966923 (0.00089) [2022-07-11 00:58:42,643][25689] Fps is (10 sec: 5474.9, 60 sec: 5510.1, 300 sec: 5514.2). Total num frames: 990133248. Throughput: 0: 5724.7. Samples: 990134312. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:58:42,645][25689] Avg episode reward: [(0, '0.389')] [2022-07-11 00:58:43,829][26022] Updated weights on worker 0-0, policy_version 966933 (0.00088) [2022-07-11 00:58:45,643][26022] Updated weights on worker 0-0, policy_version 966943 (0.00092) [2022-07-11 00:58:47,622][26022] Updated weights on worker 0-0, policy_version 966953 (0.00089) [2022-07-11 00:58:47,674][25689] Fps is (10 sec: 5578.3, 60 sec: 5480.3, 300 sec: 5511.0). Total num frames: 990159872. Throughput: 0: 5717.4. Samples: 990167502. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:58:47,675][25689] Avg episode reward: [(0, '0.403')] [2022-07-11 00:58:49,458][26022] Updated weights on worker 0-0, policy_version 966963 (0.00087) [2022-07-11 00:58:51,099][26022] Updated weights on worker 0-0, policy_version 966973 (0.00091) [2022-07-11 00:58:52,711][25689] Fps is (10 sec: 5492.4, 60 sec: 5513.0, 300 sec: 5517.4). Total num frames: 990188544. Throughput: 0: 4929.6. Samples: 990184346. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:58:52,712][25689] Avg episode reward: [(0, '0.533')] [2022-07-11 00:58:53,040][26022] Updated weights on worker 0-0, policy_version 966983 (0.00088) [2022-07-11 00:58:54,846][26022] Updated weights on worker 0-0, policy_version 966993 (0.00087) [2022-07-11 00:58:56,852][26022] Updated weights on worker 0-0, policy_version 967003 (0.00088) [2022-07-11 00:58:57,734][25689] Fps is (10 sec: 5497.0, 60 sec: 5466.4, 300 sec: 5512.9). Total num frames: 990215168. Throughput: 0: 5757.6. Samples: 990217454. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:58:57,736][25689] Avg episode reward: [(0, '0.593')] [2022-07-11 00:58:58,297][26022] Updated weights on worker 0-0, policy_version 967013 (0.00091) [2022-07-11 00:59:00,490][26022] Updated weights on worker 0-0, policy_version 967023 (0.00085) [2022-07-11 00:59:02,419][26022] Updated weights on worker 0-0, policy_version 967033 (0.00079) [2022-07-11 00:59:02,779][25689] Fps is (10 sec: 5390.8, 60 sec: 5518.8, 300 sec: 5516.6). Total num frames: 990242816. Throughput: 0: 5692.0. Samples: 990248922. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:59:02,787][25689] Avg episode reward: [(0, '0.514')] [2022-07-11 00:59:04,636][26022] Updated weights on worker 0-0, policy_version 967043 (0.00084) [2022-07-11 00:59:06,107][26022] Updated weights on worker 0-0, policy_version 967053 (0.00094) [2022-07-11 00:59:07,827][25689] Fps is (10 sec: 5377.3, 60 sec: 5498.2, 300 sec: 5512.5). Total num frames: 990269440. Throughput: 0: 4879.7. Samples: 990265838. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:59:07,827][25689] Avg episode reward: [(0, '1.067')] [2022-07-11 00:59:08,250][26022] Updated weights on worker 0-0, policy_version 967063 (0.00086) [2022-07-11 00:59:09,776][26022] Updated weights on worker 0-0, policy_version 967073 (0.00086) [2022-07-11 00:59:11,896][26022] Updated weights on worker 0-0, policy_version 967083 (0.00093) [2022-07-11 00:59:12,850][25689] Fps is (10 sec: 5591.9, 60 sec: 5532.3, 300 sec: 5519.4). Total num frames: 990299136. Throughput: 0: 5718.2. Samples: 990299506. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:59:12,851][25689] Avg episode reward: [(0, '0.941')] [2022-07-11 00:59:13,393][26022] Updated weights on worker 0-0, policy_version 967093 (0.00085) [2022-07-11 00:59:15,392][26022] Updated weights on worker 0-0, policy_version 967103 (0.00078) [2022-07-11 00:59:17,161][26022] Updated weights on worker 0-0, policy_version 967113 (0.00086) [2022-07-11 00:59:17,855][25689] Fps is (10 sec: 5717.9, 60 sec: 5500.0, 300 sec: 5520.4). Total num frames: 990326784. Throughput: 0: 5768.7. Samples: 990333530. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:59:17,856][25689] Avg episode reward: [(0, '1.071')] [2022-07-11 00:59:18,988][26022] Updated weights on worker 0-0, policy_version 967123 (0.00095) [2022-07-11 00:59:20,847][26022] Updated weights on worker 0-0, policy_version 967133 (0.00081) [2022-07-11 00:59:21,191][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 00:59:21,200][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000967134_990345216.pth [2022-07-11 00:59:21,200][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000965191_988355584.pth [2022-07-11 00:59:22,747][26022] Updated weights on worker 0-0, policy_version 967143 (0.00083) [2022-07-11 00:59:22,897][25689] Fps is (10 sec: 5605.6, 60 sec: 5525.1, 300 sec: 5520.3). Total num frames: 990355456. Throughput: 0: 5028.6. Samples: 990350092. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:59:22,898][25689] Avg episode reward: [(0, '0.327')] [2022-07-11 00:59:24,568][26022] Updated weights on worker 0-0, policy_version 967153 (0.00090) [2022-07-11 00:59:26,501][26022] Updated weights on worker 0-0, policy_version 967163 (0.00094) [2022-07-11 00:59:27,924][25689] Fps is (10 sec: 5492.1, 60 sec: 5506.2, 300 sec: 5519.9). Total num frames: 990382080. Throughput: 0: 5849.6. Samples: 990383398. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:59:27,924][25689] Avg episode reward: [(0, '0.126')] [2022-07-11 00:59:28,216][26022] Updated weights on worker 0-0, policy_version 967173 (0.00105) [2022-07-11 00:59:30,327][26022] Updated weights on worker 0-0, policy_version 967183 (0.00093) [2022-07-11 00:59:32,028][26022] Updated weights on worker 0-0, policy_version 967193 (0.00093) [2022-07-11 00:59:32,948][25689] Fps is (10 sec: 5501.6, 60 sec: 5509.5, 300 sec: 5519.9). Total num frames: 990410752. Throughput: 0: 5804.0. Samples: 990416154. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:59:32,949][25689] Avg episode reward: [(0, '0.753')] [2022-07-11 00:59:34,003][26022] Updated weights on worker 0-0, policy_version 967203 (0.00091) [2022-07-11 00:59:35,825][26022] Updated weights on worker 0-0, policy_version 967213 (0.00094) [2022-07-11 00:59:37,735][26022] Updated weights on worker 0-0, policy_version 967223 (0.00084) [2022-07-11 00:59:37,957][25689] Fps is (10 sec: 5511.1, 60 sec: 5528.8, 300 sec: 5517.4). Total num frames: 990437376. Throughput: 0: 4938.6. Samples: 990432804. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:59:37,958][25689] Avg episode reward: [(0, '1.042')] [2022-07-11 00:59:39,640][26022] Updated weights on worker 0-0, policy_version 967233 (0.00092) [2022-07-11 00:59:41,478][26022] Updated weights on worker 0-0, policy_version 967243 (0.00085) [2022-07-11 00:59:43,064][25689] Fps is (10 sec: 5466.3, 60 sec: 5508.0, 300 sec: 5516.7). Total num frames: 990466048. Throughput: 0: 5738.1. Samples: 990465810. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:59:43,065][25689] Avg episode reward: [(0, '1.248')] [2022-07-11 00:59:43,124][26022] Updated weights on worker 0-0, policy_version 967253 (0.00087) [2022-07-11 00:59:45,155][26022] Updated weights on worker 0-0, policy_version 967263 (0.00087) [2022-07-11 00:59:46,838][26022] Updated weights on worker 0-0, policy_version 967273 (0.00100) [2022-07-11 00:59:48,074][25689] Fps is (10 sec: 5566.8, 60 sec: 5526.9, 300 sec: 5513.6). Total num frames: 990493696. Throughput: 0: 5729.9. Samples: 990498858. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:59:48,076][25689] Avg episode reward: [(0, '1.354')] [2022-07-11 00:59:48,844][26022] Updated weights on worker 0-0, policy_version 967283 (0.00090) [2022-07-11 00:59:50,711][26022] Updated weights on worker 0-0, policy_version 967293 (0.00094) [2022-07-11 00:59:52,484][26022] Updated weights on worker 0-0, policy_version 967303 (0.00087) [2022-07-11 00:59:53,098][25689] Fps is (10 sec: 5510.8, 60 sec: 5511.1, 300 sec: 5513.4). Total num frames: 990521344. Throughput: 0: 4930.3. Samples: 990515498. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:59:53,099][25689] Avg episode reward: [(0, '1.794')] [2022-07-11 00:59:54,291][26022] Updated weights on worker 0-0, policy_version 967313 (0.00097) [2022-07-11 00:59:56,095][26022] Updated weights on worker 0-0, policy_version 967323 (0.00092) [2022-07-11 00:59:57,927][26022] Updated weights on worker 0-0, policy_version 967333 (0.00092) [2022-07-11 00:59:58,130][25689] Fps is (10 sec: 5499.3, 60 sec: 5527.3, 300 sec: 5513.5). Total num frames: 990548992. Throughput: 0: 5761.0. Samples: 990549016. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 00:59:58,130][25689] Avg episode reward: [(0, '1.897')] [2022-07-11 00:59:59,916][26022] Updated weights on worker 0-0, policy_version 967343 (0.00090) [2022-07-11 01:00:01,854][26022] Updated weights on worker 0-0, policy_version 967353 (0.00096) [2022-07-11 01:00:03,195][25689] Fps is (10 sec: 5273.9, 60 sec: 5491.5, 300 sec: 5512.8). Total num frames: 990574592. Throughput: 0: 5726.1. Samples: 990581080. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 01:00:03,197][25689] Avg episode reward: [(0, '1.171')] [2022-07-11 01:00:03,968][26022] Updated weights on worker 0-0, policy_version 967363 (0.00095) [2022-07-11 01:00:05,695][26022] Updated weights on worker 0-0, policy_version 967373 (0.00084) [2022-07-11 01:00:07,751][26022] Updated weights on worker 0-0, policy_version 967383 (0.00096) [2022-07-11 01:00:08,208][25689] Fps is (10 sec: 5181.5, 60 sec: 5494.6, 300 sec: 5502.7). Total num frames: 990601216. Throughput: 0: 4856.8. Samples: 990596646. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 01:00:08,213][25689] Avg episode reward: [(0, '0.037')] [2022-07-11 01:00:09,444][26022] Updated weights on worker 0-0, policy_version 967393 (0.00090) [2022-07-11 01:00:11,551][26022] Updated weights on worker 0-0, policy_version 967403 (0.00087) [2022-07-11 01:00:13,066][26022] Updated weights on worker 0-0, policy_version 967413 (0.00090) [2022-07-11 01:00:13,236][25689] Fps is (10 sec: 5608.8, 60 sec: 5494.3, 300 sec: 5516.0). Total num frames: 990630912. Throughput: 0: 5686.0. Samples: 990630002. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 01:00:13,238][25689] Avg episode reward: [(0, '-1.094')] [2022-07-11 01:00:15,134][26022] Updated weights on worker 0-0, policy_version 967423 (0.00088) [2022-07-11 01:00:16,801][26022] Updated weights on worker 0-0, policy_version 967433 (0.00092) [2022-07-11 01:00:18,244][25689] Fps is (10 sec: 5713.9, 60 sec: 5494.0, 300 sec: 5511.3). Total num frames: 990658560. Throughput: 0: 5694.4. Samples: 990663558. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 01:00:18,246][25689] Avg episode reward: [(0, '-0.933')] [2022-07-11 01:00:18,672][26022] Updated weights on worker 0-0, policy_version 967443 (0.00084) [2022-07-11 01:00:20,455][26022] Updated weights on worker 0-0, policy_version 967453 (0.00086) [2022-07-11 01:00:22,427][26022] Updated weights on worker 0-0, policy_version 967463 (0.00102) [2022-07-11 01:00:23,357][25689] Fps is (10 sec: 5463.7, 60 sec: 5470.7, 300 sec: 5513.6). Total num frames: 990686208. Throughput: 0: 4921.8. Samples: 990680312. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 01:00:23,357][25689] Avg episode reward: [(0, '-0.921')] [2022-07-11 01:00:24,123][26022] Updated weights on worker 0-0, policy_version 967473 (0.00086) [2022-07-11 01:00:26,167][26022] Updated weights on worker 0-0, policy_version 967483 (0.00096) [2022-07-11 01:00:27,760][26022] Updated weights on worker 0-0, policy_version 967493 (0.00096) [2022-07-11 01:00:28,372][25689] Fps is (10 sec: 5662.0, 60 sec: 5522.5, 300 sec: 5513.7). Total num frames: 990715904. Throughput: 0: 5815.8. Samples: 990713914. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 01:00:28,373][25689] Avg episode reward: [(0, '-1.094')] [2022-07-11 01:00:29,880][26022] Updated weights on worker 0-0, policy_version 967503 (0.00087) [2022-07-11 01:00:31,515][26022] Updated weights on worker 0-0, policy_version 967513 (0.00091) [2022-07-11 01:00:33,331][26022] Updated weights on worker 0-0, policy_version 967523 (0.00084) [2022-07-11 01:00:33,429][25689] Fps is (10 sec: 5693.4, 60 sec: 5502.6, 300 sec: 5519.9). Total num frames: 990743552. Throughput: 0: 5807.3. Samples: 990747266. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 01:00:33,429][25689] Avg episode reward: [(0, '-0.970')] [2022-07-11 01:00:35,215][26022] Updated weights on worker 0-0, policy_version 967533 (0.00091) [2022-07-11 01:00:36,967][26022] Updated weights on worker 0-0, policy_version 967543 (0.00053) [2022-07-11 01:00:38,472][25689] Fps is (10 sec: 5576.6, 60 sec: 5533.4, 300 sec: 5513.0). Total num frames: 990772224. Throughput: 0: 5822.3. Samples: 990781328. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 01:00:38,472][25689] Avg episode reward: [(0, '-0.421')] [2022-07-11 01:00:38,799][26022] Updated weights on worker 0-0, policy_version 967553 (0.00084) [2022-07-11 01:00:40,795][26022] Updated weights on worker 0-0, policy_version 967563 (0.00091) [2022-07-11 01:00:42,510][26022] Updated weights on worker 0-0, policy_version 967573 (0.00092) [2022-07-11 01:00:43,518][25689] Fps is (10 sec: 5581.9, 60 sec: 5521.9, 300 sec: 5519.2). Total num frames: 990799872. Throughput: 0: 5830.9. Samples: 990797874. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 01:00:43,519][25689] Avg episode reward: [(0, '0.409')] [2022-07-11 01:00:44,353][26022] Updated weights on worker 0-0, policy_version 967583 (0.00094) [2022-07-11 01:00:46,128][26022] Updated weights on worker 0-0, policy_version 967593 (0.00087) [2022-07-11 01:00:47,966][26022] Updated weights on worker 0-0, policy_version 967603 (0.00083) [2022-07-11 01:00:48,551][25689] Fps is (10 sec: 5587.7, 60 sec: 5536.8, 300 sec: 5518.8). Total num frames: 990828544. Throughput: 0: 5846.1. Samples: 990831882. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 01:00:48,552][25689] Avg episode reward: [(0, '0.197')] [2022-07-11 01:00:49,817][26022] Updated weights on worker 0-0, policy_version 967613 (0.00088) [2022-07-11 01:00:51,430][26022] Updated weights on worker 0-0, policy_version 967623 (0.00085) [2022-07-11 01:00:53,331][26022] Updated weights on worker 0-0, policy_version 967633 (0.00739) [2022-07-11 01:00:53,558][25689] Fps is (10 sec: 5712.1, 60 sec: 5555.3, 300 sec: 5522.9). Total num frames: 990857216. Throughput: 0: 5898.0. Samples: 990865986. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 01:00:53,558][25689] Avg episode reward: [(0, '0.239')] [2022-07-11 01:00:55,193][26022] Updated weights on worker 0-0, policy_version 967643 (0.00086) [2022-07-11 01:00:56,960][26022] Updated weights on worker 0-0, policy_version 967653 (0.00087) [2022-07-11 01:00:58,623][25689] Fps is (10 sec: 5591.7, 60 sec: 5552.2, 300 sec: 5517.2). Total num frames: 990884864. Throughput: 0: 5041.7. Samples: 990882926. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 01:00:58,624][25689] Avg episode reward: [(0, '0.624')] [2022-07-11 01:00:58,819][26022] Updated weights on worker 0-0, policy_version 967663 (0.00081) [2022-07-11 01:01:00,475][26022] Updated weights on worker 0-0, policy_version 967673 (0.00083) [2022-07-11 01:01:02,820][26022] Updated weights on worker 0-0, policy_version 967683 (0.00088) [2022-07-11 01:01:03,667][25689] Fps is (10 sec: 5267.0, 60 sec: 5554.1, 300 sec: 5520.3). Total num frames: 990910464. Throughput: 0: 5785.8. Samples: 990914450. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 01:01:03,668][25689] Avg episode reward: [(0, '0.911')] [2022-07-11 01:01:04,533][26022] Updated weights on worker 0-0, policy_version 967693 (0.00096) [2022-07-11 01:01:06,699][26022] Updated weights on worker 0-0, policy_version 967703 (0.00087) [2022-07-11 01:01:08,348][26022] Updated weights on worker 0-0, policy_version 967713 (0.00085) [2022-07-11 01:01:08,685][25689] Fps is (10 sec: 5495.6, 60 sec: 5604.6, 300 sec: 5527.3). Total num frames: 990940160. Throughput: 0: 5757.4. Samples: 990947802. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 01:01:08,687][25689] Avg episode reward: [(0, '0.792')] [2022-07-11 01:01:10,403][26022] Updated weights on worker 0-0, policy_version 967723 (0.00095) [2022-07-11 01:01:11,987][26022] Updated weights on worker 0-0, policy_version 967733 (0.00082) [2022-07-11 01:01:13,703][25689] Fps is (10 sec: 5612.1, 60 sec: 5554.7, 300 sec: 5520.5). Total num frames: 990966784. Throughput: 0: 4902.8. Samples: 990964754. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:01:13,704][25689] Avg episode reward: [(0, '0.838')] [2022-07-11 01:01:14,030][26022] Updated weights on worker 0-0, policy_version 967743 (0.00096) [2022-07-11 01:01:15,540][26022] Updated weights on worker 0-0, policy_version 967753 (0.00087) [2022-07-11 01:01:17,572][26022] Updated weights on worker 0-0, policy_version 967763 (0.00094) [2022-07-11 01:01:18,708][25689] Fps is (10 sec: 5516.8, 60 sec: 5571.8, 300 sec: 5528.0). Total num frames: 990995456. Throughput: 0: 5747.2. Samples: 990998360. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:01:18,709][25689] Avg episode reward: [(0, '0.672')] [2022-07-11 01:01:19,363][26022] Updated weights on worker 0-0, policy_version 967773 (0.00093) [2022-07-11 01:01:21,215][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:01:21,226][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000967782_991008768.pth [2022-07-11 01:01:21,226][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000965839_989019136.pth [2022-07-11 01:01:21,280][26022] Updated weights on worker 0-0, policy_version 967783 (0.00097) [2022-07-11 01:01:22,958][26022] Updated weights on worker 0-0, policy_version 967793 (0.00083) [2022-07-11 01:01:23,762][25689] Fps is (10 sec: 5700.7, 60 sec: 5594.2, 300 sec: 5524.1). Total num frames: 991024128. Throughput: 0: 5839.8. Samples: 991031798. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:01:23,762][25689] Avg episode reward: [(0, '0.389')] [2022-07-11 01:01:24,706][26022] Updated weights on worker 0-0, policy_version 967803 (0.00085) [2022-07-11 01:01:26,569][26022] Updated weights on worker 0-0, policy_version 967813 (0.00082) [2022-07-11 01:01:28,645][26022] Updated weights on worker 0-0, policy_version 967823 (0.00091) [2022-07-11 01:01:28,768][25689] Fps is (10 sec: 5496.6, 60 sec: 5544.2, 300 sec: 5522.3). Total num frames: 991050752. Throughput: 0: 5021.6. Samples: 991048652. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:01:28,769][25689] Avg episode reward: [(0, '0.485')] [2022-07-11 01:01:30,296][26022] Updated weights on worker 0-0, policy_version 967833 (0.00086) [2022-07-11 01:01:32,175][26022] Updated weights on worker 0-0, policy_version 967843 (0.00088) [2022-07-11 01:01:33,782][25689] Fps is (10 sec: 5518.7, 60 sec: 5565.1, 300 sec: 5523.5). Total num frames: 991079424. Throughput: 0: 5859.4. Samples: 991082404. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:01:33,786][25689] Avg episode reward: [(0, '-0.277')] [2022-07-11 01:01:33,994][26022] Updated weights on worker 0-0, policy_version 967853 (0.00091) [2022-07-11 01:01:35,751][26022] Updated weights on worker 0-0, policy_version 967863 (0.00089) [2022-07-11 01:01:37,679][26022] Updated weights on worker 0-0, policy_version 967873 (0.00089) [2022-07-11 01:01:38,792][25689] Fps is (10 sec: 5720.7, 60 sec: 5568.1, 300 sec: 5525.5). Total num frames: 991108096. Throughput: 0: 5868.1. Samples: 991116216. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:01:38,793][25689] Avg episode reward: [(0, '-0.891')] [2022-07-11 01:01:39,439][26022] Updated weights on worker 0-0, policy_version 967883 (0.00079) [2022-07-11 01:01:41,376][26022] Updated weights on worker 0-0, policy_version 967893 (0.00108) [2022-07-11 01:01:43,244][26022] Updated weights on worker 0-0, policy_version 967903 (0.00088) [2022-07-11 01:01:43,834][25689] Fps is (10 sec: 5602.4, 60 sec: 5568.5, 300 sec: 5522.9). Total num frames: 991135744. Throughput: 0: 5023.7. Samples: 991132636. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:01:43,835][25689] Avg episode reward: [(0, '-1.139')] [2022-07-11 01:01:44,941][26022] Updated weights on worker 0-0, policy_version 967913 (0.00086) [2022-07-11 01:01:46,922][26022] Updated weights on worker 0-0, policy_version 967923 (0.00087) [2022-07-11 01:01:48,826][26022] Updated weights on worker 0-0, policy_version 967933 (0.00088) [2022-07-11 01:01:48,850][25689] Fps is (10 sec: 5497.7, 60 sec: 5553.1, 300 sec: 5519.9). Total num frames: 991163392. Throughput: 0: 5849.1. Samples: 991166114. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:01:48,851][25689] Avg episode reward: [(0, '-0.923')] [2022-07-11 01:01:50,335][26022] Updated weights on worker 0-0, policy_version 967943 (0.00060) [2022-07-11 01:01:52,450][26022] Updated weights on worker 0-0, policy_version 967953 (0.00095) [2022-07-11 01:01:53,883][25689] Fps is (10 sec: 5604.8, 60 sec: 5550.7, 300 sec: 5524.1). Total num frames: 991192064. Throughput: 0: 5834.0. Samples: 991199674. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:01:53,883][25689] Avg episode reward: [(0, '-0.877')] [2022-07-11 01:01:54,255][26022] Updated weights on worker 0-0, policy_version 967963 (0.00084) [2022-07-11 01:01:56,024][26022] Updated weights on worker 0-0, policy_version 967973 (0.00096) [2022-07-11 01:01:57,817][26022] Updated weights on worker 0-0, policy_version 967983 (0.00079) [2022-07-11 01:01:58,902][25689] Fps is (10 sec: 5500.9, 60 sec: 5538.0, 300 sec: 5514.7). Total num frames: 991218688. Throughput: 0: 4987.1. Samples: 991216508. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:01:58,904][25689] Avg episode reward: [(0, '-1.039')] [2022-07-11 01:01:59,583][26022] Updated weights on worker 0-0, policy_version 967993 (0.00087) [2022-07-11 01:02:01,761][26022] Updated weights on worker 0-0, policy_version 968003 (0.00093) [2022-07-11 01:02:03,600][26022] Updated weights on worker 0-0, policy_version 968013 (0.00085) [2022-07-11 01:02:03,983][25689] Fps is (10 sec: 5474.9, 60 sec: 5585.6, 300 sec: 5527.8). Total num frames: 991247360. Throughput: 0: 5736.1. Samples: 991248210. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:02:03,983][25689] Avg episode reward: [(0, '-0.222')] [2022-07-11 01:02:05,604][26022] Updated weights on worker 0-0, policy_version 968023 (0.00093) [2022-07-11 01:02:07,203][26022] Updated weights on worker 0-0, policy_version 968033 (0.00085) [2022-07-11 01:02:09,023][25689] Fps is (10 sec: 5463.5, 60 sec: 5532.5, 300 sec: 5525.3). Total num frames: 991273984. Throughput: 0: 5738.6. Samples: 991281880. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:02:09,024][25689] Avg episode reward: [(0, '0.602')] [2022-07-11 01:02:09,229][26022] Updated weights on worker 0-0, policy_version 968043 (0.00393) [2022-07-11 01:02:10,897][26022] Updated weights on worker 0-0, policy_version 968053 (0.00087) [2022-07-11 01:02:12,941][26022] Updated weights on worker 0-0, policy_version 968063 (0.00090) [2022-07-11 01:02:14,039][25689] Fps is (10 sec: 5600.7, 60 sec: 5583.7, 300 sec: 5532.0). Total num frames: 991303680. Throughput: 0: 4917.0. Samples: 991298780. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:02:14,040][25689] Avg episode reward: [(0, '0.293')] [2022-07-11 01:02:14,544][26022] Updated weights on worker 0-0, policy_version 968073 (0.00089) [2022-07-11 01:02:16,524][26022] Updated weights on worker 0-0, policy_version 968083 (0.00114) [2022-07-11 01:02:18,385][26022] Updated weights on worker 0-0, policy_version 968093 (0.00088) [2022-07-11 01:02:19,056][25689] Fps is (10 sec: 5613.9, 60 sec: 5548.7, 300 sec: 5523.5). Total num frames: 991330304. Throughput: 0: 5745.9. Samples: 991332306. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:02:19,057][25689] Avg episode reward: [(0, '-0.237')] [2022-07-11 01:02:20,277][26022] Updated weights on worker 0-0, policy_version 968103 (0.00091) [2022-07-11 01:02:22,087][26022] Updated weights on worker 0-0, policy_version 968113 (0.00088) [2022-07-11 01:02:23,808][26022] Updated weights on worker 0-0, policy_version 968123 (0.00088) [2022-07-11 01:02:24,104][25689] Fps is (10 sec: 5595.5, 60 sec: 5566.1, 300 sec: 5533.2). Total num frames: 991360000. Throughput: 0: 5853.3. Samples: 991365984. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:02:24,104][25689] Avg episode reward: [(0, '-0.248')] [2022-07-11 01:02:25,816][26022] Updated weights on worker 0-0, policy_version 968133 (0.00091) [2022-07-11 01:02:27,485][26022] Updated weights on worker 0-0, policy_version 968143 (0.00089) [2022-07-11 01:02:29,139][25689] Fps is (10 sec: 5484.1, 60 sec: 5546.6, 300 sec: 5522.5). Total num frames: 991385600. Throughput: 0: 5013.0. Samples: 991382718. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:02:29,139][25689] Avg episode reward: [(0, '0.066')] [2022-07-11 01:02:29,423][26022] Updated weights on worker 0-0, policy_version 968153 (0.00087) [2022-07-11 01:02:31,067][26022] Updated weights on worker 0-0, policy_version 968163 (0.00099) [2022-07-11 01:02:33,053][26022] Updated weights on worker 0-0, policy_version 968173 (0.00093) [2022-07-11 01:02:34,146][25689] Fps is (10 sec: 5506.2, 60 sec: 5564.0, 300 sec: 5533.8). Total num frames: 991415296. Throughput: 0: 5840.9. Samples: 991416228. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:02:34,147][25689] Avg episode reward: [(0, '0.270')] [2022-07-11 01:02:34,916][26022] Updated weights on worker 0-0, policy_version 968183 (0.00086) [2022-07-11 01:02:36,672][26022] Updated weights on worker 0-0, policy_version 968193 (0.00092) [2022-07-11 01:02:38,667][26022] Updated weights on worker 0-0, policy_version 968203 (0.00089) [2022-07-11 01:02:39,176][25689] Fps is (10 sec: 5712.8, 60 sec: 5545.3, 300 sec: 5531.6). Total num frames: 991442944. Throughput: 0: 5826.9. Samples: 991449548. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:02:39,177][25689] Avg episode reward: [(0, '0.225')] [2022-07-11 01:02:40,486][26022] Updated weights on worker 0-0, policy_version 968213 (0.00086) [2022-07-11 01:02:42,270][26022] Updated weights on worker 0-0, policy_version 968223 (0.00094) [2022-07-11 01:02:44,234][25689] Fps is (10 sec: 5380.0, 60 sec: 5526.9, 300 sec: 5525.0). Total num frames: 991469568. Throughput: 0: 4972.5. Samples: 991466082. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:02:44,236][25689] Avg episode reward: [(0, '1.149')] [2022-07-11 01:02:44,367][26022] Updated weights on worker 0-0, policy_version 968233 (0.00090) [2022-07-11 01:02:45,948][26022] Updated weights on worker 0-0, policy_version 968243 (0.00085) [2022-07-11 01:02:47,975][26022] Updated weights on worker 0-0, policy_version 968253 (0.00089) [2022-07-11 01:02:49,250][25689] Fps is (10 sec: 5489.1, 60 sec: 5543.9, 300 sec: 5532.0). Total num frames: 991498240. Throughput: 0: 5790.9. Samples: 991499182. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:02:49,251][25689] Avg episode reward: [(0, '1.128')] [2022-07-11 01:02:49,761][26022] Updated weights on worker 0-0, policy_version 968263 (0.00093) [2022-07-11 01:02:51,446][26022] Updated weights on worker 0-0, policy_version 968273 (0.00093) [2022-07-11 01:02:53,311][26022] Updated weights on worker 0-0, policy_version 968283 (0.00085) [2022-07-11 01:02:54,291][25689] Fps is (10 sec: 5702.2, 60 sec: 5543.1, 300 sec: 5529.1). Total num frames: 991526912. Throughput: 0: 5782.8. Samples: 991532718. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:02:54,291][25689] Avg episode reward: [(0, '0.799')] [2022-07-11 01:02:55,519][26022] Updated weights on worker 0-0, policy_version 968293 (0.00098) [2022-07-11 01:02:56,874][26022] Updated weights on worker 0-0, policy_version 968303 (0.00087) [2022-07-11 01:02:59,109][26022] Updated weights on worker 0-0, policy_version 968313 (0.00085) [2022-07-11 01:02:59,306][25689] Fps is (10 sec: 5498.7, 60 sec: 5543.5, 300 sec: 5536.9). Total num frames: 991553536. Throughput: 0: 4971.5. Samples: 991549624. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:02:59,307][25689] Avg episode reward: [(0, '1.236')] [2022-07-11 01:03:00,587][26022] Updated weights on worker 0-0, policy_version 968323 (0.00091) [2022-07-11 01:03:02,951][26022] Updated weights on worker 0-0, policy_version 968333 (0.00083) [2022-07-11 01:03:04,378][25689] Fps is (10 sec: 5380.1, 60 sec: 5527.3, 300 sec: 5535.7). Total num frames: 991581184. Throughput: 0: 5718.0. Samples: 991581266. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:03:04,379][25689] Avg episode reward: [(0, '0.164')] [2022-07-11 01:03:04,594][26022] Updated weights on worker 0-0, policy_version 968343 (0.00083) [2022-07-11 01:03:06,465][26022] Updated weights on worker 0-0, policy_version 968353 (0.00080) [2022-07-11 01:03:08,199][26022] Updated weights on worker 0-0, policy_version 968363 (0.00098) [2022-07-11 01:03:09,400][25689] Fps is (10 sec: 5579.7, 60 sec: 5562.9, 300 sec: 5539.2). Total num frames: 991609856. Throughput: 0: 5741.0. Samples: 991614864. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:03:09,401][25689] Avg episode reward: [(0, '0.155')] [2022-07-11 01:03:10,256][26022] Updated weights on worker 0-0, policy_version 968373 (0.00090) [2022-07-11 01:03:11,979][26022] Updated weights on worker 0-0, policy_version 968383 (0.00088) [2022-07-11 01:03:13,818][26022] Updated weights on worker 0-0, policy_version 968393 (0.00091) [2022-07-11 01:03:14,419][25689] Fps is (10 sec: 5507.3, 60 sec: 5511.8, 300 sec: 5528.9). Total num frames: 991636480. Throughput: 0: 4916.8. Samples: 991631684. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:03:14,420][25689] Avg episode reward: [(0, '0.145')] [2022-07-11 01:03:15,401][26022] Updated weights on worker 0-0, policy_version 968403 (0.00088) [2022-07-11 01:03:17,513][26022] Updated weights on worker 0-0, policy_version 968413 (0.00088) [2022-07-11 01:03:19,239][26022] Updated weights on worker 0-0, policy_version 968423 (0.00084) [2022-07-11 01:03:19,423][25689] Fps is (10 sec: 5721.4, 60 sec: 5580.8, 300 sec: 5541.7). Total num frames: 991667200. Throughput: 0: 5774.0. Samples: 991665778. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:03:19,423][25689] Avg episode reward: [(0, '0.191')] [2022-07-11 01:03:21,171][26022] Updated weights on worker 0-0, policy_version 968433 (0.00093) [2022-07-11 01:03:21,275][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:03:21,286][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000968434_991676416.pth [2022-07-11 01:03:21,286][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000966487_989682688.pth [2022-07-11 01:03:22,791][26022] Updated weights on worker 0-0, policy_version 968443 (0.00083) [2022-07-11 01:03:24,528][25689] Fps is (10 sec: 5672.6, 60 sec: 5524.7, 300 sec: 5536.4). Total num frames: 991693824. Throughput: 0: 5857.3. Samples: 991699288. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:03:24,530][25689] Avg episode reward: [(0, '0.047')] [2022-07-11 01:03:24,810][26022] Updated weights on worker 0-0, policy_version 968453 (0.00088) [2022-07-11 01:03:26,683][26022] Updated weights on worker 0-0, policy_version 968463 (0.00087) [2022-07-11 01:03:28,543][26022] Updated weights on worker 0-0, policy_version 968473 (0.00086) [2022-07-11 01:03:29,550][25689] Fps is (10 sec: 5359.3, 60 sec: 5559.8, 300 sec: 5533.6). Total num frames: 991721472. Throughput: 0: 5010.1. Samples: 991715814. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:03:29,550][25689] Avg episode reward: [(0, '0.118')] [2022-07-11 01:03:30,255][26022] Updated weights on worker 0-0, policy_version 968483 (0.00085) [2022-07-11 01:03:32,083][26022] Updated weights on worker 0-0, policy_version 968493 (0.00083) [2022-07-11 01:03:33,988][26022] Updated weights on worker 0-0, policy_version 968503 (0.00093) [2022-07-11 01:03:34,575][25689] Fps is (10 sec: 5401.4, 60 sec: 5507.3, 300 sec: 5537.2). Total num frames: 991748096. Throughput: 0: 5825.2. Samples: 991749100. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:03:34,576][25689] Avg episode reward: [(0, '0.177')] [2022-07-11 01:03:35,651][26022] Updated weights on worker 0-0, policy_version 968513 (0.00084) [2022-07-11 01:03:37,879][26022] Updated weights on worker 0-0, policy_version 968523 (0.00094) [2022-07-11 01:03:39,568][26022] Updated weights on worker 0-0, policy_version 968533 (0.00056) [2022-07-11 01:03:39,606][25689] Fps is (10 sec: 5600.6, 60 sec: 5541.2, 300 sec: 5537.9). Total num frames: 991777792. Throughput: 0: 5792.3. Samples: 991782682. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:03:39,606][25689] Avg episode reward: [(0, '0.311')] [2022-07-11 01:03:41,448][26022] Updated weights on worker 0-0, policy_version 968543 (0.00087) [2022-07-11 01:03:43,418][26022] Updated weights on worker 0-0, policy_version 968553 (0.00086) [2022-07-11 01:03:44,694][25689] Fps is (10 sec: 5768.1, 60 sec: 5572.2, 300 sec: 5543.7). Total num frames: 991806464. Throughput: 0: 4971.4. Samples: 991799544. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:03:44,695][25689] Avg episode reward: [(0, '0.194')] [2022-07-11 01:03:45,058][26022] Updated weights on worker 0-0, policy_version 968563 (0.00112) [2022-07-11 01:03:47,034][26022] Updated weights on worker 0-0, policy_version 968573 (0.00083) [2022-07-11 01:03:48,626][26022] Updated weights on worker 0-0, policy_version 968583 (0.00089) [2022-07-11 01:03:49,730][25689] Fps is (10 sec: 5562.7, 60 sec: 5553.4, 300 sec: 5540.3). Total num frames: 991834112. Throughput: 0: 5813.1. Samples: 991833126. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:03:49,733][25689] Avg episode reward: [(0, '0.385')] [2022-07-11 01:03:50,532][26022] Updated weights on worker 0-0, policy_version 968593 (0.00089) [2022-07-11 01:03:52,355][26022] Updated weights on worker 0-0, policy_version 968603 (0.00082) [2022-07-11 01:03:54,260][26022] Updated weights on worker 0-0, policy_version 968613 (0.00085) [2022-07-11 01:03:54,763][25689] Fps is (10 sec: 5593.7, 60 sec: 5554.2, 300 sec: 5547.0). Total num frames: 991862784. Throughput: 0: 5817.3. Samples: 991866536. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:03:54,765][25689] Avg episode reward: [(0, '0.491')] [2022-07-11 01:03:55,949][26022] Updated weights on worker 0-0, policy_version 968623 (0.00092) [2022-07-11 01:03:57,893][26022] Updated weights on worker 0-0, policy_version 968633 (0.00093) [2022-07-11 01:03:59,647][26022] Updated weights on worker 0-0, policy_version 968643 (0.00080) [2022-07-11 01:03:59,766][25689] Fps is (10 sec: 5612.2, 60 sec: 5572.3, 300 sec: 5547.8). Total num frames: 991890432. Throughput: 0: 4989.4. Samples: 991883268. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:03:59,770][25689] Avg episode reward: [(0, '1.313')] [2022-07-11 01:04:01,967][26022] Updated weights on worker 0-0, policy_version 968653 (0.00094) [2022-07-11 01:04:03,797][26022] Updated weights on worker 0-0, policy_version 968663 (0.00096) [2022-07-11 01:04:04,881][25689] Fps is (10 sec: 5262.8, 60 sec: 5534.5, 300 sec: 5543.1). Total num frames: 991916032. Throughput: 0: 5693.7. Samples: 991914478. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:04:04,881][25689] Avg episode reward: [(0, '1.122')] [2022-07-11 01:04:05,675][26022] Updated weights on worker 0-0, policy_version 968673 (0.00087) [2022-07-11 01:04:07,616][26022] Updated weights on worker 0-0, policy_version 968683 (0.00090) [2022-07-11 01:04:09,084][26022] Updated weights on worker 0-0, policy_version 968693 (0.00086) [2022-07-11 01:04:09,906][25689] Fps is (10 sec: 5352.3, 60 sec: 5534.2, 300 sec: 5539.6). Total num frames: 991944704. Throughput: 0: 5698.5. Samples: 991948094. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:04:09,906][25689] Avg episode reward: [(0, '0.964')] [2022-07-11 01:04:11,238][26022] Updated weights on worker 0-0, policy_version 968703 (0.00091) [2022-07-11 01:04:12,879][26022] Updated weights on worker 0-0, policy_version 968713 (0.00085) [2022-07-11 01:04:14,924][25689] Fps is (10 sec: 5505.9, 60 sec: 5534.2, 300 sec: 5535.9). Total num frames: 991971328. Throughput: 0: 5703.4. Samples: 991981522. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:04:14,924][25689] Avg episode reward: [(0, '1.070')] [2022-07-11 01:04:14,935][26022] Updated weights on worker 0-0, policy_version 968723 (0.00086) [2022-07-11 01:04:16,497][26022] Updated weights on worker 0-0, policy_version 968733 (0.00088) [2022-07-11 01:04:18,784][26022] Updated weights on worker 0-0, policy_version 968743 (0.00088) [2022-07-11 01:04:19,931][25689] Fps is (10 sec: 5515.5, 60 sec: 5500.1, 300 sec: 5536.6). Total num frames: 992000000. Throughput: 0: 5704.9. Samples: 991998310. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:04:19,932][25689] Avg episode reward: [(0, '1.101')] [2022-07-11 01:04:20,183][26022] Updated weights on worker 0-0, policy_version 968753 (0.00088) [2022-07-11 01:04:22,318][26022] Updated weights on worker 0-0, policy_version 968763 (0.00084) [2022-07-11 01:04:23,754][26022] Updated weights on worker 0-0, policy_version 968773 (0.00084) [2022-07-11 01:04:25,004][25689] Fps is (10 sec: 5688.5, 60 sec: 5536.8, 300 sec: 5542.6). Total num frames: 992028672. Throughput: 0: 5841.7. Samples: 992032034. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:04:25,005][25689] Avg episode reward: [(0, '0.994')] [2022-07-11 01:04:25,984][26022] Updated weights on worker 0-0, policy_version 968783 (0.00092) [2022-07-11 01:04:27,655][26022] Updated weights on worker 0-0, policy_version 968793 (0.00089) [2022-07-11 01:04:29,574][26022] Updated weights on worker 0-0, policy_version 968803 (0.00086) [2022-07-11 01:04:30,022][25689] Fps is (10 sec: 5682.8, 60 sec: 5554.1, 300 sec: 5542.7). Total num frames: 992057344. Throughput: 0: 5861.4. Samples: 992066004. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:04:30,023][25689] Avg episode reward: [(0, '1.001')] [2022-07-11 01:04:31,220][26022] Updated weights on worker 0-0, policy_version 968813 (0.00089) [2022-07-11 01:04:33,231][26022] Updated weights on worker 0-0, policy_version 968823 (0.00098) [2022-07-11 01:04:34,904][26022] Updated weights on worker 0-0, policy_version 968833 (0.00088) [2022-07-11 01:04:35,095][25689] Fps is (10 sec: 5683.0, 60 sec: 5583.7, 300 sec: 5548.4). Total num frames: 992086016. Throughput: 0: 5011.2. Samples: 992082604. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:04:35,095][25689] Avg episode reward: [(0, '1.013')] [2022-07-11 01:04:36,819][26022] Updated weights on worker 0-0, policy_version 968843 (0.00086) [2022-07-11 01:04:38,546][26022] Updated weights on worker 0-0, policy_version 968853 (0.00092) [2022-07-11 01:04:40,102][25689] Fps is (10 sec: 5485.8, 60 sec: 5535.0, 300 sec: 5543.4). Total num frames: 992112640. Throughput: 0: 5820.2. Samples: 992115708. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:04:40,104][25689] Avg episode reward: [(0, '0.611')] [2022-07-11 01:04:40,428][26022] Updated weights on worker 0-0, policy_version 968863 (0.00087) [2022-07-11 01:04:42,338][26022] Updated weights on worker 0-0, policy_version 968873 (0.00085) [2022-07-11 01:04:44,088][26022] Updated weights on worker 0-0, policy_version 968883 (0.00086) [2022-07-11 01:04:45,172][25689] Fps is (10 sec: 5487.1, 60 sec: 5536.7, 300 sec: 5545.7). Total num frames: 992141312. Throughput: 0: 5818.8. Samples: 992149388. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:04:45,174][25689] Avg episode reward: [(0, '0.121')] [2022-07-11 01:04:46,007][26022] Updated weights on worker 0-0, policy_version 968893 (0.00128) [2022-07-11 01:04:47,681][26022] Updated weights on worker 0-0, policy_version 968903 (0.00082) [2022-07-11 01:04:49,762][26022] Updated weights on worker 0-0, policy_version 968913 (0.00092) [2022-07-11 01:04:50,184][25689] Fps is (10 sec: 5789.1, 60 sec: 5572.8, 300 sec: 5552.8). Total num frames: 992171008. Throughput: 0: 4968.7. Samples: 992166184. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:04:50,186][25689] Avg episode reward: [(0, '0.264')] [2022-07-11 01:04:51,578][26022] Updated weights on worker 0-0, policy_version 968923 (0.00089) [2022-07-11 01:04:53,269][26022] Updated weights on worker 0-0, policy_version 968933 (0.00087) [2022-07-11 01:04:55,230][25689] Fps is (10 sec: 5599.9, 60 sec: 5537.7, 300 sec: 5549.1). Total num frames: 992197632. Throughput: 0: 5816.2. Samples: 992199712. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:04:55,231][25689] Avg episode reward: [(0, '0.180')] [2022-07-11 01:04:55,234][26022] Updated weights on worker 0-0, policy_version 968943 (0.00083) [2022-07-11 01:04:56,831][26022] Updated weights on worker 0-0, policy_version 968953 (0.00083) [2022-07-11 01:04:58,785][26022] Updated weights on worker 0-0, policy_version 968963 (0.00096) [2022-07-11 01:05:00,281][25689] Fps is (10 sec: 5476.4, 60 sec: 5550.1, 300 sec: 5559.7). Total num frames: 992226304. Throughput: 0: 5830.8. Samples: 992233370. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:05:00,282][25689] Avg episode reward: [(0, '0.174')] [2022-07-11 01:05:00,510][26022] Updated weights on worker 0-0, policy_version 968973 (0.00081) [2022-07-11 01:05:02,839][26022] Updated weights on worker 0-0, policy_version 968983 (0.00085) [2022-07-11 01:05:04,446][26022] Updated weights on worker 0-0, policy_version 968993 (0.00078) [2022-07-11 01:05:05,392][25689] Fps is (10 sec: 5340.5, 60 sec: 5550.6, 300 sec: 5554.4). Total num frames: 992251904. Throughput: 0: 4885.1. Samples: 992248164. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 01:05:05,393][25689] Avg episode reward: [(0, '0.394')] [2022-07-11 01:05:06,266][26022] Updated weights on worker 0-0, policy_version 969003 (0.00088) [2022-07-11 01:05:08,444][26022] Updated weights on worker 0-0, policy_version 969013 (0.00090) [2022-07-11 01:05:10,039][26022] Updated weights on worker 0-0, policy_version 969023 (0.00085) [2022-07-11 01:05:10,470][25689] Fps is (10 sec: 5427.5, 60 sec: 5562.6, 300 sec: 5553.5). Total num frames: 992281600. Throughput: 0: 5717.3. Samples: 992282162. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:05:10,471][25689] Avg episode reward: [(0, '1.114')] [2022-07-11 01:05:11,941][26022] Updated weights on worker 0-0, policy_version 969033 (0.00084) [2022-07-11 01:05:13,623][26022] Updated weights on worker 0-0, policy_version 969043 (0.00053) [2022-07-11 01:05:15,558][25689] Fps is (10 sec: 5640.9, 60 sec: 5573.1, 300 sec: 5552.0). Total num frames: 992309248. Throughput: 0: 5715.2. Samples: 992315892. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:05:15,559][25689] Avg episode reward: [(0, '1.756')] [2022-07-11 01:05:15,575][26022] Updated weights on worker 0-0, policy_version 969053 (0.00091) [2022-07-11 01:05:17,503][26022] Updated weights on worker 0-0, policy_version 969063 (0.00083) [2022-07-11 01:05:19,052][26022] Updated weights on worker 0-0, policy_version 969073 (0.00092) [2022-07-11 01:05:20,576][25689] Fps is (10 sec: 5471.5, 60 sec: 5555.2, 300 sec: 5553.8). Total num frames: 992336896. Throughput: 0: 4906.4. Samples: 992332942. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:05:20,578][25689] Avg episode reward: [(0, '1.980')] [2022-07-11 01:05:21,097][26022] Updated weights on worker 0-0, policy_version 969083 (0.00092) [2022-07-11 01:05:21,358][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:05:21,374][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000969086_992344064.pth [2022-07-11 01:05:21,375][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000967134_990345216.pth [2022-07-11 01:05:22,867][26022] Updated weights on worker 0-0, policy_version 969093 (0.00095) [2022-07-11 01:05:24,482][26022] Updated weights on worker 0-0, policy_version 969103 (0.00086) [2022-07-11 01:05:25,676][25689] Fps is (10 sec: 5768.8, 60 sec: 5586.6, 300 sec: 5555.6). Total num frames: 992367616. Throughput: 0: 5835.2. Samples: 992366524. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:05:25,676][25689] Avg episode reward: [(0, '1.773')] [2022-07-11 01:05:26,719][26022] Updated weights on worker 0-0, policy_version 969113 (0.00093) [2022-07-11 01:05:28,382][26022] Updated weights on worker 0-0, policy_version 969123 (0.00081) [2022-07-11 01:05:30,225][26022] Updated weights on worker 0-0, policy_version 969133 (0.00086) [2022-07-11 01:05:30,684][25689] Fps is (10 sec: 5774.2, 60 sec: 5570.5, 300 sec: 5556.5). Total num frames: 992395264. Throughput: 0: 5820.5. Samples: 992399820. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:05:30,685][25689] Avg episode reward: [(0, '1.318')] [2022-07-11 01:05:32,114][26022] Updated weights on worker 0-0, policy_version 969143 (0.00614) [2022-07-11 01:05:33,579][26022] Updated weights on worker 0-0, policy_version 969153 (0.00080) [2022-07-11 01:05:35,701][25689] Fps is (10 sec: 5413.3, 60 sec: 5541.8, 300 sec: 5550.1). Total num frames: 992421888. Throughput: 0: 5011.2. Samples: 992416836. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:05:35,702][25689] Avg episode reward: [(0, '1.477')] [2022-07-11 01:05:35,762][26022] Updated weights on worker 0-0, policy_version 969163 (0.00055) [2022-07-11 01:05:37,355][26022] Updated weights on worker 0-0, policy_version 969173 (0.00084) [2022-07-11 01:05:39,435][26022] Updated weights on worker 0-0, policy_version 969183 (0.00094) [2022-07-11 01:05:40,713][25689] Fps is (10 sec: 5513.8, 60 sec: 5575.2, 300 sec: 5554.2). Total num frames: 992450560. Throughput: 0: 5834.7. Samples: 992450434. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:05:40,713][25689] Avg episode reward: [(0, '0.184')] [2022-07-11 01:05:40,998][26022] Updated weights on worker 0-0, policy_version 969193 (0.00091) [2022-07-11 01:05:43,110][26022] Updated weights on worker 0-0, policy_version 969203 (0.00090) [2022-07-11 01:05:44,775][26022] Updated weights on worker 0-0, policy_version 969213 (0.00094) [2022-07-11 01:05:45,754][25689] Fps is (10 sec: 5602.5, 60 sec: 5561.0, 300 sec: 5550.6). Total num frames: 992478208. Throughput: 0: 5847.9. Samples: 992483940. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:05:45,754][25689] Avg episode reward: [(0, '-0.410')] [2022-07-11 01:05:46,792][26022] Updated weights on worker 0-0, policy_version 969223 (0.00091) [2022-07-11 01:05:48,398][26022] Updated weights on worker 0-0, policy_version 969233 (0.00092) [2022-07-11 01:05:50,380][26022] Updated weights on worker 0-0, policy_version 969243 (0.00085) [2022-07-11 01:05:50,762][25689] Fps is (10 sec: 5502.8, 60 sec: 5527.6, 300 sec: 5547.2). Total num frames: 992505856. Throughput: 0: 5033.0. Samples: 992500870. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:05:50,762][25689] Avg episode reward: [(0, '-0.315')] [2022-07-11 01:05:52,301][26022] Updated weights on worker 0-0, policy_version 969253 (0.00262) [2022-07-11 01:05:54,029][26022] Updated weights on worker 0-0, policy_version 969263 (0.00085) [2022-07-11 01:05:55,769][25689] Fps is (10 sec: 5521.2, 60 sec: 5548.0, 300 sec: 5548.2). Total num frames: 992533504. Throughput: 0: 5865.6. Samples: 992534546. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:05:55,770][25689] Avg episode reward: [(0, '-0.074')] [2022-07-11 01:05:55,928][26022] Updated weights on worker 0-0, policy_version 969273 (0.00086) [2022-07-11 01:05:57,603][26022] Updated weights on worker 0-0, policy_version 969283 (0.00088) [2022-07-11 01:05:59,588][26022] Updated weights on worker 0-0, policy_version 969293 (0.00090) [2022-07-11 01:06:00,774][25689] Fps is (10 sec: 5727.1, 60 sec: 5569.2, 300 sec: 5562.8). Total num frames: 992563200. Throughput: 0: 5885.4. Samples: 992568504. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:06:00,775][25689] Avg episode reward: [(0, '0.433')] [2022-07-11 01:06:01,163][26022] Updated weights on worker 0-0, policy_version 969303 (0.00097) [2022-07-11 01:06:03,595][26022] Updated weights on worker 0-0, policy_version 969313 (0.00086) [2022-07-11 01:06:05,505][26022] Updated weights on worker 0-0, policy_version 969323 (0.00085) [2022-07-11 01:06:05,821][25689] Fps is (10 sec: 5501.2, 60 sec: 5575.1, 300 sec: 5548.4). Total num frames: 992588800. Throughput: 0: 4949.6. Samples: 992583264. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:06:05,821][25689] Avg episode reward: [(0, '0.387')] [2022-07-11 01:06:07,159][26022] Updated weights on worker 0-0, policy_version 969333 (0.00085) [2022-07-11 01:06:09,123][26022] Updated weights on worker 0-0, policy_version 969343 (0.00092) [2022-07-11 01:06:10,809][26022] Updated weights on worker 0-0, policy_version 969353 (0.00622) [2022-07-11 01:06:10,901][25689] Fps is (10 sec: 5359.5, 60 sec: 5557.9, 300 sec: 5554.2). Total num frames: 992617472. Throughput: 0: 5762.0. Samples: 992616912. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:06:10,901][25689] Avg episode reward: [(0, '0.983')] [2022-07-11 01:06:12,623][26022] Updated weights on worker 0-0, policy_version 969363 (0.00087) [2022-07-11 01:06:14,568][26022] Updated weights on worker 0-0, policy_version 969373 (0.00080) [2022-07-11 01:06:15,919][25689] Fps is (10 sec: 5577.4, 60 sec: 5564.4, 300 sec: 5550.5). Total num frames: 992645120. Throughput: 0: 5754.0. Samples: 992650486. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:06:15,920][25689] Avg episode reward: [(0, '1.240')] [2022-07-11 01:06:16,382][26022] Updated weights on worker 0-0, policy_version 969383 (0.00089) [2022-07-11 01:06:18,231][26022] Updated weights on worker 0-0, policy_version 969393 (0.00085) [2022-07-11 01:06:20,217][26022] Updated weights on worker 0-0, policy_version 969403 (0.00089) [2022-07-11 01:06:20,923][25689] Fps is (10 sec: 5517.2, 60 sec: 5565.7, 300 sec: 5548.0). Total num frames: 992672768. Throughput: 0: 4907.5. Samples: 992667384. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:06:20,924][25689] Avg episode reward: [(0, '0.669')] [2022-07-11 01:06:21,898][26022] Updated weights on worker 0-0, policy_version 969413 (0.00087) [2022-07-11 01:06:24,019][26022] Updated weights on worker 0-0, policy_version 969423 (0.00096) [2022-07-11 01:06:25,208][26022] Updated weights on worker 0-0, policy_version 969433 (0.00082) [2022-07-11 01:06:26,021][25689] Fps is (10 sec: 5575.1, 60 sec: 5531.9, 300 sec: 5553.2). Total num frames: 992701440. Throughput: 0: 5820.0. Samples: 992700828. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:06:26,021][25689] Avg episode reward: [(0, '0.431')] [2022-07-11 01:06:27,553][26022] Updated weights on worker 0-0, policy_version 969443 (0.00085) [2022-07-11 01:06:29,267][26022] Updated weights on worker 0-0, policy_version 969453 (0.00091) [2022-07-11 01:06:30,965][26022] Updated weights on worker 0-0, policy_version 969463 (0.00097) [2022-07-11 01:06:31,063][25689] Fps is (10 sec: 5655.0, 60 sec: 5545.7, 300 sec: 5552.6). Total num frames: 992730112. Throughput: 0: 5810.9. Samples: 992734076. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:06:31,064][25689] Avg episode reward: [(0, '0.494')] [2022-07-11 01:06:33,148][26022] Updated weights on worker 0-0, policy_version 969473 (0.00088) [2022-07-11 01:06:34,590][26022] Updated weights on worker 0-0, policy_version 969483 (0.00092) [2022-07-11 01:06:36,125][25689] Fps is (10 sec: 5472.6, 60 sec: 5541.7, 300 sec: 5544.8). Total num frames: 992756736. Throughput: 0: 5799.9. Samples: 992767680. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:06:36,127][25689] Avg episode reward: [(0, '0.383')] [2022-07-11 01:06:36,714][26022] Updated weights on worker 0-0, policy_version 969493 (0.00092) [2022-07-11 01:06:38,445][26022] Updated weights on worker 0-0, policy_version 969503 (0.00103) [2022-07-11 01:06:40,256][26022] Updated weights on worker 0-0, policy_version 969513 (0.00089) [2022-07-11 01:06:41,164][25689] Fps is (10 sec: 5677.5, 60 sec: 5573.1, 300 sec: 5555.2). Total num frames: 992787456. Throughput: 0: 5787.1. Samples: 992784518. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:06:41,164][25689] Avg episode reward: [(0, '0.372')] [2022-07-11 01:06:42,229][26022] Updated weights on worker 0-0, policy_version 969523 (0.00089) [2022-07-11 01:06:43,904][26022] Updated weights on worker 0-0, policy_version 969533 (0.00090) [2022-07-11 01:06:45,708][26022] Updated weights on worker 0-0, policy_version 969543 (0.00094) [2022-07-11 01:06:46,259][25689] Fps is (10 sec: 5658.8, 60 sec: 5551.2, 300 sec: 5550.2). Total num frames: 992814080. Throughput: 0: 5788.0. Samples: 992817964. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:06:46,259][25689] Avg episode reward: [(0, '0.193')] [2022-07-11 01:06:47,692][26022] Updated weights on worker 0-0, policy_version 969553 (0.00091) [2022-07-11 01:06:49,255][26022] Updated weights on worker 0-0, policy_version 969563 (0.00093) [2022-07-11 01:06:51,261][25689] Fps is (10 sec: 5375.0, 60 sec: 5551.7, 300 sec: 5547.4). Total num frames: 992841728. Throughput: 0: 5813.0. Samples: 992851484. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:06:51,261][25689] Avg episode reward: [(0, '-0.086')] [2022-07-11 01:06:51,420][26022] Updated weights on worker 0-0, policy_version 969573 (0.00098) [2022-07-11 01:06:53,190][26022] Updated weights on worker 0-0, policy_version 969583 (0.00084) [2022-07-11 01:06:54,886][26022] Updated weights on worker 0-0, policy_version 969593 (0.00431) [2022-07-11 01:06:56,293][25689] Fps is (10 sec: 5612.9, 60 sec: 5566.4, 300 sec: 5554.0). Total num frames: 992870400. Throughput: 0: 4991.7. Samples: 992868350. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:06:56,293][25689] Avg episode reward: [(0, '0.083')] [2022-07-11 01:06:56,941][26022] Updated weights on worker 0-0, policy_version 969603 (0.00090) [2022-07-11 01:06:58,397][26022] Updated weights on worker 0-0, policy_version 969613 (0.00085) [2022-07-11 01:07:00,550][26022] Updated weights on worker 0-0, policy_version 969623 (0.00085) [2022-07-11 01:07:01,335][25689] Fps is (10 sec: 5692.3, 60 sec: 5546.1, 300 sec: 5554.7). Total num frames: 992899072. Throughput: 0: 5822.3. Samples: 992901960. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:07:01,335][25689] Avg episode reward: [(0, '0.111')] [2022-07-11 01:07:02,700][26022] Updated weights on worker 0-0, policy_version 969633 (0.00087) [2022-07-11 01:07:04,307][26022] Updated weights on worker 0-0, policy_version 969643 (0.00084) [2022-07-11 01:07:06,292][26022] Updated weights on worker 0-0, policy_version 969653 (0.00090) [2022-07-11 01:07:06,392][25689] Fps is (10 sec: 5373.8, 60 sec: 5545.1, 300 sec: 5551.0). Total num frames: 992924672. Throughput: 0: 5744.5. Samples: 992933620. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:07:06,393][25689] Avg episode reward: [(0, '-0.571')] [2022-07-11 01:07:07,762][26022] Updated weights on worker 0-0, policy_version 969663 (0.00092) [2022-07-11 01:07:09,901][26022] Updated weights on worker 0-0, policy_version 969673 (0.00088) [2022-07-11 01:07:11,394][25689] Fps is (10 sec: 5395.2, 60 sec: 5552.2, 300 sec: 5547.8). Total num frames: 992953344. Throughput: 0: 4922.4. Samples: 992950588. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:07:11,395][25689] Avg episode reward: [(0, '-1.404')] [2022-07-11 01:07:11,755][26022] Updated weights on worker 0-0, policy_version 969683 (0.00089) [2022-07-11 01:07:13,353][26022] Updated weights on worker 0-0, policy_version 969693 (0.00086) [2022-07-11 01:07:15,399][26022] Updated weights on worker 0-0, policy_version 969703 (0.00085) [2022-07-11 01:07:16,422][25689] Fps is (10 sec: 5615.3, 60 sec: 5551.3, 300 sec: 5551.0). Total num frames: 992980992. Throughput: 0: 5774.5. Samples: 992984586. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:07:16,422][25689] Avg episode reward: [(0, '-2.826')] [2022-07-11 01:07:17,264][26022] Updated weights on worker 0-0, policy_version 969713 (0.00080) [2022-07-11 01:07:19,041][26022] Updated weights on worker 0-0, policy_version 969723 (0.00078) [2022-07-11 01:07:20,892][26022] Updated weights on worker 0-0, policy_version 969733 (0.00096) [2022-07-11 01:07:21,390][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:07:21,406][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000969736_993009664.pth [2022-07-11 01:07:21,407][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000967782_991008768.pth [2022-07-11 01:07:21,432][25689] Fps is (10 sec: 5610.8, 60 sec: 5567.7, 300 sec: 5548.3). Total num frames: 993009664. Throughput: 0: 5758.3. Samples: 993017686. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:07:21,432][25689] Avg episode reward: [(0, '-1.833')] [2022-07-11 01:07:22,800][26022] Updated weights on worker 0-0, policy_version 969743 (0.00097) [2022-07-11 01:07:24,415][26022] Updated weights on worker 0-0, policy_version 969753 (0.00108) [2022-07-11 01:07:26,405][26022] Updated weights on worker 0-0, policy_version 969763 (0.00091) [2022-07-11 01:07:26,511][25689] Fps is (10 sec: 5683.9, 60 sec: 5569.5, 300 sec: 5557.8). Total num frames: 993038336. Throughput: 0: 5016.8. Samples: 993034550. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:07:26,511][25689] Avg episode reward: [(0, '-1.874')] [2022-07-11 01:07:28,266][26022] Updated weights on worker 0-0, policy_version 969773 (0.00087) [2022-07-11 01:07:30,306][26022] Updated weights on worker 0-0, policy_version 969783 (0.00089) [2022-07-11 01:07:31,548][25689] Fps is (10 sec: 5567.5, 60 sec: 5553.0, 300 sec: 5550.4). Total num frames: 993065984. Throughput: 0: 5830.6. Samples: 993068096. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:07:31,549][25689] Avg episode reward: [(0, '-3.220')] [2022-07-11 01:07:31,726][26022] Updated weights on worker 0-0, policy_version 969793 (0.00087) [2022-07-11 01:07:33,880][26022] Updated weights on worker 0-0, policy_version 969803 (0.00086) [2022-07-11 01:07:35,435][26022] Updated weights on worker 0-0, policy_version 969813 (0.00080) [2022-07-11 01:07:36,554][25689] Fps is (10 sec: 5505.7, 60 sec: 5575.1, 300 sec: 5550.8). Total num frames: 993093632. Throughput: 0: 5820.3. Samples: 993101762. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:07:36,559][25689] Avg episode reward: [(0, '-1.689')] [2022-07-11 01:07:37,625][26022] Updated weights on worker 0-0, policy_version 969823 (0.00092) [2022-07-11 01:07:39,093][26022] Updated weights on worker 0-0, policy_version 969833 (0.00092) [2022-07-11 01:07:41,151][26022] Updated weights on worker 0-0, policy_version 969843 (0.00089) [2022-07-11 01:07:41,563][25689] Fps is (10 sec: 5521.4, 60 sec: 5527.0, 300 sec: 5555.2). Total num frames: 993121280. Throughput: 0: 5014.2. Samples: 993118626. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:07:41,564][25689] Avg episode reward: [(0, '-1.427')] [2022-07-11 01:07:42,748][26022] Updated weights on worker 0-0, policy_version 969853 (0.00081) [2022-07-11 01:07:44,752][26022] Updated weights on worker 0-0, policy_version 969863 (0.00086) [2022-07-11 01:07:46,574][26022] Updated weights on worker 0-0, policy_version 969873 (0.00086) [2022-07-11 01:07:46,667][25689] Fps is (10 sec: 5568.9, 60 sec: 5560.0, 300 sec: 5553.5). Total num frames: 993149952. Throughput: 0: 5832.6. Samples: 993152116. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:07:46,668][25689] Avg episode reward: [(0, '0.090')] [2022-07-11 01:07:48,497][26022] Updated weights on worker 0-0, policy_version 969883 (0.00083) [2022-07-11 01:07:50,105][26022] Updated weights on worker 0-0, policy_version 969893 (0.00084) [2022-07-11 01:07:51,676][25689] Fps is (10 sec: 5670.2, 60 sec: 5576.4, 300 sec: 5554.1). Total num frames: 993178624. Throughput: 0: 5852.2. Samples: 993185890. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:07:51,676][25689] Avg episode reward: [(0, '0.060')] [2022-07-11 01:07:51,996][26022] Updated weights on worker 0-0, policy_version 969903 (0.00089) [2022-07-11 01:07:53,900][26022] Updated weights on worker 0-0, policy_version 969913 (0.00081) [2022-07-11 01:07:55,514][26022] Updated weights on worker 0-0, policy_version 969923 (0.00087) [2022-07-11 01:07:56,763][25689] Fps is (10 sec: 5578.5, 60 sec: 5554.3, 300 sec: 5556.2). Total num frames: 993206272. Throughput: 0: 4992.1. Samples: 993202648. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:07:56,765][25689] Avg episode reward: [(0, '0.590')] [2022-07-11 01:07:57,612][26022] Updated weights on worker 0-0, policy_version 969933 (0.00082) [2022-07-11 01:07:59,246][26022] Updated weights on worker 0-0, policy_version 969943 (0.00084) [2022-07-11 01:08:01,285][26022] Updated weights on worker 0-0, policy_version 969953 (0.00086) [2022-07-11 01:08:01,791][25689] Fps is (10 sec: 5466.5, 60 sec: 5538.7, 300 sec: 5557.0). Total num frames: 993233920. Throughput: 0: 5810.5. Samples: 993236164. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:08:01,793][25689] Avg episode reward: [(0, '1.449')] [2022-07-11 01:08:03,482][26022] Updated weights on worker 0-0, policy_version 969963 (0.00085) [2022-07-11 01:08:05,282][26022] Updated weights on worker 0-0, policy_version 969973 (0.00090) [2022-07-11 01:08:06,925][25689] Fps is (10 sec: 5340.8, 60 sec: 5548.6, 300 sec: 5548.1). Total num frames: 993260544. Throughput: 0: 5694.7. Samples: 993267478. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:08:06,925][25689] Avg episode reward: [(0, '1.102')] [2022-07-11 01:08:07,311][26022] Updated weights on worker 0-0, policy_version 969983 (0.00093) [2022-07-11 01:08:08,962][26022] Updated weights on worker 0-0, policy_version 969993 (0.00090) [2022-07-11 01:08:10,729][26022] Updated weights on worker 0-0, policy_version 970003 (0.00086) [2022-07-11 01:08:11,965][25689] Fps is (10 sec: 5435.5, 60 sec: 5545.2, 300 sec: 5554.6). Total num frames: 993289216. Throughput: 0: 4857.2. Samples: 993284436. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:08:11,965][25689] Avg episode reward: [(0, '1.045')] [2022-07-11 01:08:12,725][26022] Updated weights on worker 0-0, policy_version 970013 (0.00092) [2022-07-11 01:08:14,254][26022] Updated weights on worker 0-0, policy_version 970023 (0.00089) [2022-07-11 01:08:16,391][26022] Updated weights on worker 0-0, policy_version 970033 (0.00096) [2022-07-11 01:08:16,979][25689] Fps is (10 sec: 5805.7, 60 sec: 5580.2, 300 sec: 5550.9). Total num frames: 993318912. Throughput: 0: 5701.6. Samples: 993317908. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:08:16,979][25689] Avg episode reward: [(0, '1.143')] [2022-07-11 01:08:17,997][26022] Updated weights on worker 0-0, policy_version 970043 (0.00089) [2022-07-11 01:08:19,923][26022] Updated weights on worker 0-0, policy_version 970053 (0.00100) [2022-07-11 01:08:21,809][26022] Updated weights on worker 0-0, policy_version 970063 (0.00093) [2022-07-11 01:08:22,007][25689] Fps is (10 sec: 5506.3, 60 sec: 5527.8, 300 sec: 5548.9). Total num frames: 993344512. Throughput: 0: 5709.8. Samples: 993351590. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:08:22,007][25689] Avg episode reward: [(0, '1.237')] [2022-07-11 01:08:23,442][26022] Updated weights on worker 0-0, policy_version 970073 (0.00097) [2022-07-11 01:08:25,652][26022] Updated weights on worker 0-0, policy_version 970083 (0.00091) [2022-07-11 01:08:27,092][25689] Fps is (10 sec: 5467.8, 60 sec: 5544.2, 300 sec: 5554.6). Total num frames: 993374208. Throughput: 0: 5007.0. Samples: 993368452. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:08:27,093][25689] Avg episode reward: [(0, '0.976')] [2022-07-11 01:08:27,283][26022] Updated weights on worker 0-0, policy_version 970093 (0.00096) [2022-07-11 01:08:29,031][26022] Updated weights on worker 0-0, policy_version 970103 (0.00094) [2022-07-11 01:08:30,846][26022] Updated weights on worker 0-0, policy_version 970113 (0.00092) [2022-07-11 01:08:32,121][25689] Fps is (10 sec: 5771.0, 60 sec: 5561.8, 300 sec: 5561.4). Total num frames: 993402880. Throughput: 0: 5820.3. Samples: 993401752. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:08:32,122][25689] Avg episode reward: [(0, '0.960')] [2022-07-11 01:08:32,685][26022] Updated weights on worker 0-0, policy_version 970123 (0.00090) [2022-07-11 01:08:34,591][26022] Updated weights on worker 0-0, policy_version 970133 (0.00094) [2022-07-11 01:08:36,300][26022] Updated weights on worker 0-0, policy_version 970143 (0.00087) [2022-07-11 01:08:37,138][25689] Fps is (10 sec: 5503.9, 60 sec: 5543.9, 300 sec: 5551.4). Total num frames: 993429504. Throughput: 0: 5824.9. Samples: 993435336. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:08:37,139][25689] Avg episode reward: [(0, '1.042')] [2022-07-11 01:08:38,213][26022] Updated weights on worker 0-0, policy_version 970153 (0.00088) [2022-07-11 01:08:40,137][26022] Updated weights on worker 0-0, policy_version 970163 (0.00085) [2022-07-11 01:08:41,866][26022] Updated weights on worker 0-0, policy_version 970173 (0.00092) [2022-07-11 01:08:42,142][25689] Fps is (10 sec: 5517.8, 60 sec: 5561.2, 300 sec: 5552.9). Total num frames: 993458176. Throughput: 0: 4983.5. Samples: 993451934. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:08:42,144][25689] Avg episode reward: [(0, '0.325')] [2022-07-11 01:08:43,826][26022] Updated weights on worker 0-0, policy_version 970183 (0.00091) [2022-07-11 01:08:45,528][26022] Updated weights on worker 0-0, policy_version 970193 (0.00090) [2022-07-11 01:08:47,241][25689] Fps is (10 sec: 5676.3, 60 sec: 5561.8, 300 sec: 5555.2). Total num frames: 993486848. Throughput: 0: 5796.1. Samples: 993485238. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:08:47,241][25689] Avg episode reward: [(0, '-0.261')] [2022-07-11 01:08:47,456][26022] Updated weights on worker 0-0, policy_version 970203 (0.00083) [2022-07-11 01:08:49,353][26022] Updated weights on worker 0-0, policy_version 970213 (0.00084) [2022-07-11 01:08:51,280][26022] Updated weights on worker 0-0, policy_version 970223 (0.00085) [2022-07-11 01:08:52,283][25689] Fps is (10 sec: 5351.9, 60 sec: 5508.0, 300 sec: 5544.7). Total num frames: 993512448. Throughput: 0: 5786.9. Samples: 993518428. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:08:52,283][25689] Avg episode reward: [(0, '-0.133')] [2022-07-11 01:08:53,019][26022] Updated weights on worker 0-0, policy_version 970233 (0.00085) [2022-07-11 01:08:54,740][26022] Updated weights on worker 0-0, policy_version 970243 (0.00092) [2022-07-11 01:08:56,630][26022] Updated weights on worker 0-0, policy_version 970253 (0.00087) [2022-07-11 01:08:57,341][25689] Fps is (10 sec: 5575.9, 60 sec: 5561.3, 300 sec: 5554.0). Total num frames: 993543168. Throughput: 0: 4954.6. Samples: 993535434. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:08:57,343][25689] Avg episode reward: [(0, '0.162')] [2022-07-11 01:08:58,564][26022] Updated weights on worker 0-0, policy_version 970263 (0.00094) [2022-07-11 01:09:00,269][26022] Updated weights on worker 0-0, policy_version 970273 (0.00087) [2022-07-11 01:09:02,374][25689] Fps is (10 sec: 5580.9, 60 sec: 5527.1, 300 sec: 5555.5). Total num frames: 993568768. Throughput: 0: 5805.4. Samples: 993569390. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 01:09:02,375][25689] Avg episode reward: [(0, '-0.094')] [2022-07-11 01:09:02,607][26022] Updated weights on worker 0-0, policy_version 970283 (0.00086) [2022-07-11 01:09:04,418][26022] Updated weights on worker 0-0, policy_version 970293 (0.00091) [2022-07-11 01:09:06,298][26022] Updated weights on worker 0-0, policy_version 970303 (0.00083) [2022-07-11 01:09:07,429][25689] Fps is (10 sec: 5278.5, 60 sec: 5551.2, 300 sec: 5551.5). Total num frames: 993596416. Throughput: 0: 5717.1. Samples: 993600656. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:09:07,430][25689] Avg episode reward: [(0, '-0.820')] [2022-07-11 01:09:08,099][26022] Updated weights on worker 0-0, policy_version 970313 (0.00088) [2022-07-11 01:09:10,015][26022] Updated weights on worker 0-0, policy_version 970323 (0.00094) [2022-07-11 01:09:11,787][26022] Updated weights on worker 0-0, policy_version 970333 (0.00094) [2022-07-11 01:09:12,441][25689] Fps is (10 sec: 5594.7, 60 sec: 5553.8, 300 sec: 5558.5). Total num frames: 993625088. Throughput: 0: 4911.7. Samples: 993617440. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:09:12,442][25689] Avg episode reward: [(0, '-0.179')] [2022-07-11 01:09:13,637][26022] Updated weights on worker 0-0, policy_version 970343 (0.00087) [2022-07-11 01:09:15,274][26022] Updated weights on worker 0-0, policy_version 970353 (0.00316) [2022-07-11 01:09:17,442][26022] Updated weights on worker 0-0, policy_version 970363 (0.00091) [2022-07-11 01:09:17,455][25689] Fps is (10 sec: 5515.3, 60 sec: 5503.0, 300 sec: 5551.5). Total num frames: 993651712. Throughput: 0: 5745.9. Samples: 993651004. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:09:17,455][25689] Avg episode reward: [(0, '0.540')] [2022-07-11 01:09:18,879][26022] Updated weights on worker 0-0, policy_version 970373 (0.00100) [2022-07-11 01:09:21,059][26022] Updated weights on worker 0-0, policy_version 970383 (0.00087) [2022-07-11 01:09:21,498][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:09:21,510][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000970386_993675264.pth [2022-07-11 01:09:21,511][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000968434_991676416.pth [2022-07-11 01:09:22,489][25689] Fps is (10 sec: 5605.1, 60 sec: 5570.2, 300 sec: 5555.7). Total num frames: 993681408. Throughput: 0: 5705.1. Samples: 993684146. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:09:22,490][25689] Avg episode reward: [(0, '0.475')] [2022-07-11 01:09:22,681][26022] Updated weights on worker 0-0, policy_version 970393 (0.00099) [2022-07-11 01:09:24,767][26022] Updated weights on worker 0-0, policy_version 970403 (0.00090) [2022-07-11 01:09:26,503][26022] Updated weights on worker 0-0, policy_version 970413 (0.00085) [2022-07-11 01:09:27,564][25689] Fps is (10 sec: 5672.6, 60 sec: 5537.2, 300 sec: 5551.2). Total num frames: 993709056. Throughput: 0: 5798.0. Samples: 993717398. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:09:27,564][25689] Avg episode reward: [(0, '0.398')] [2022-07-11 01:09:28,241][26022] Updated weights on worker 0-0, policy_version 970423 (0.00087) [2022-07-11 01:09:30,075][26022] Updated weights on worker 0-0, policy_version 970433 (0.00085) [2022-07-11 01:09:32,123][26022] Updated weights on worker 0-0, policy_version 970443 (0.00087) [2022-07-11 01:09:32,591][25689] Fps is (10 sec: 5270.9, 60 sec: 5486.6, 300 sec: 5541.7). Total num frames: 993734656. Throughput: 0: 5784.3. Samples: 993733996. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:09:32,592][25689] Avg episode reward: [(0, '0.270')] [2022-07-11 01:09:33,916][26022] Updated weights on worker 0-0, policy_version 970453 (0.00094) [2022-07-11 01:09:35,790][26022] Updated weights on worker 0-0, policy_version 970463 (0.00090) [2022-07-11 01:09:37,621][25689] Fps is (10 sec: 5498.1, 60 sec: 5536.2, 300 sec: 5551.6). Total num frames: 993764352. Throughput: 0: 5781.9. Samples: 993767604. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:09:37,622][25689] Avg episode reward: [(0, '0.625')] [2022-07-11 01:09:37,627][26022] Updated weights on worker 0-0, policy_version 970473 (0.00084) [2022-07-11 01:09:39,511][26022] Updated weights on worker 0-0, policy_version 970483 (0.00082) [2022-07-11 01:09:41,323][26022] Updated weights on worker 0-0, policy_version 970493 (0.00088) [2022-07-11 01:09:42,672][25689] Fps is (10 sec: 5688.4, 60 sec: 5515.0, 300 sec: 5548.5). Total num frames: 993792000. Throughput: 0: 5771.8. Samples: 993800640. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:09:42,673][25689] Avg episode reward: [(0, '0.679')] [2022-07-11 01:09:43,333][26022] Updated weights on worker 0-0, policy_version 970503 (0.00080) [2022-07-11 01:09:44,940][26022] Updated weights on worker 0-0, policy_version 970513 (0.00079) [2022-07-11 01:09:47,100][26022] Updated weights on worker 0-0, policy_version 970523 (0.00091) [2022-07-11 01:09:47,760][25689] Fps is (10 sec: 5453.6, 60 sec: 5499.0, 300 sec: 5540.2). Total num frames: 993819648. Throughput: 0: 4953.6. Samples: 993817444. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:09:47,761][25689] Avg episode reward: [(0, '-0.102')] [2022-07-11 01:09:48,707][26022] Updated weights on worker 0-0, policy_version 970533 (0.00085) [2022-07-11 01:09:50,696][26022] Updated weights on worker 0-0, policy_version 970543 (0.00090) [2022-07-11 01:09:52,381][26022] Updated weights on worker 0-0, policy_version 970553 (0.00086) [2022-07-11 01:09:52,768][25689] Fps is (10 sec: 5578.8, 60 sec: 5553.0, 300 sec: 5547.8). Total num frames: 993848320. Throughput: 0: 5767.1. Samples: 993850354. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:09:52,768][25689] Avg episode reward: [(0, '-0.281')] [2022-07-11 01:09:54,231][26022] Updated weights on worker 0-0, policy_version 970563 (0.00097) [2022-07-11 01:09:56,250][26022] Updated weights on worker 0-0, policy_version 970573 (0.00092) [2022-07-11 01:09:57,801][25689] Fps is (10 sec: 5507.5, 60 sec: 5487.6, 300 sec: 5541.3). Total num frames: 993874944. Throughput: 0: 5749.7. Samples: 993883630. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:09:57,802][25689] Avg episode reward: [(0, '-0.096')] [2022-07-11 01:09:58,241][26022] Updated weights on worker 0-0, policy_version 970583 (0.00086) [2022-07-11 01:09:59,725][26022] Updated weights on worker 0-0, policy_version 970593 (0.00091) [2022-07-11 01:10:02,028][26022] Updated weights on worker 0-0, policy_version 970603 (0.00659) [2022-07-11 01:10:02,805][25689] Fps is (10 sec: 5304.7, 60 sec: 5507.1, 300 sec: 5546.7). Total num frames: 993901568. Throughput: 0: 4952.0. Samples: 993900340. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:10:02,806][25689] Avg episode reward: [(0, '0.072')] [2022-07-11 01:10:03,655][26022] Updated weights on worker 0-0, policy_version 970613 (0.00089) [2022-07-11 01:10:05,835][26022] Updated weights on worker 0-0, policy_version 970623 (0.00092) [2022-07-11 01:10:07,538][26022] Updated weights on worker 0-0, policy_version 970633 (0.00076) [2022-07-11 01:10:07,913][25689] Fps is (10 sec: 5366.9, 60 sec: 5502.3, 300 sec: 5539.3). Total num frames: 993929216. Throughput: 0: 5683.7. Samples: 993931984. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:10:07,913][25689] Avg episode reward: [(0, '0.326')] [2022-07-11 01:10:09,410][26022] Updated weights on worker 0-0, policy_version 970643 (0.00103) [2022-07-11 01:10:11,372][26022] Updated weights on worker 0-0, policy_version 970653 (0.00092) [2022-07-11 01:10:12,955][25689] Fps is (10 sec: 5549.2, 60 sec: 5499.6, 300 sec: 5543.6). Total num frames: 993957888. Throughput: 0: 5693.9. Samples: 993965298. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:10:12,955][25689] Avg episode reward: [(0, '0.252')] [2022-07-11 01:10:13,006][26022] Updated weights on worker 0-0, policy_version 970663 (0.00088) [2022-07-11 01:10:14,926][26022] Updated weights on worker 0-0, policy_version 970673 (0.00106) [2022-07-11 01:10:16,555][26022] Updated weights on worker 0-0, policy_version 970683 (0.00496) [2022-07-11 01:10:17,962][25689] Fps is (10 sec: 5604.7, 60 sec: 5517.1, 300 sec: 5543.8). Total num frames: 993985536. Throughput: 0: 4888.4. Samples: 993982186. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:10:17,963][25689] Avg episode reward: [(0, '0.410')] [2022-07-11 01:10:18,490][26022] Updated weights on worker 0-0, policy_version 970693 (0.00094) [2022-07-11 01:10:20,482][26022] Updated weights on worker 0-0, policy_version 970703 (0.00083) [2022-07-11 01:10:22,330][26022] Updated weights on worker 0-0, policy_version 970713 (0.00087) [2022-07-11 01:10:22,963][25689] Fps is (10 sec: 5524.7, 60 sec: 5486.2, 300 sec: 5535.3). Total num frames: 994013184. Throughput: 0: 5697.1. Samples: 994015182. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:10:22,964][25689] Avg episode reward: [(0, '0.040')] [2022-07-11 01:10:24,124][26022] Updated weights on worker 0-0, policy_version 970723 (0.00080) [2022-07-11 01:10:26,089][26022] Updated weights on worker 0-0, policy_version 970733 (0.00202) [2022-07-11 01:10:27,887][26022] Updated weights on worker 0-0, policy_version 970744 (0.00089) [2022-07-11 01:10:27,997][25689] Fps is (10 sec: 5612.2, 60 sec: 5506.9, 300 sec: 5538.3). Total num frames: 994041856. Throughput: 0: 5818.9. Samples: 994048850. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:10:27,997][25689] Avg episode reward: [(0, '0.003')] [2022-07-11 01:10:29,767][26022] Updated weights on worker 0-0, policy_version 970754 (0.00088) [2022-07-11 01:10:31,533][26022] Updated weights on worker 0-0, policy_version 970764 (0.00086) [2022-07-11 01:10:33,019][25689] Fps is (10 sec: 5601.1, 60 sec: 5541.3, 300 sec: 5541.6). Total num frames: 994069504. Throughput: 0: 4997.3. Samples: 994065564. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:10:33,019][25689] Avg episode reward: [(0, '-0.717')] [2022-07-11 01:10:33,518][26022] Updated weights on worker 0-0, policy_version 970774 (0.00084) [2022-07-11 01:10:35,243][26022] Updated weights on worker 0-0, policy_version 970784 (0.00089) [2022-07-11 01:10:37,117][26022] Updated weights on worker 0-0, policy_version 970794 (0.00091) [2022-07-11 01:10:38,046][25689] Fps is (10 sec: 5502.7, 60 sec: 5507.7, 300 sec: 5537.9). Total num frames: 994097152. Throughput: 0: 5840.0. Samples: 994099478. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:10:38,046][25689] Avg episode reward: [(0, '-0.750')] [2022-07-11 01:10:38,863][26022] Updated weights on worker 0-0, policy_version 970804 (0.00085) [2022-07-11 01:10:40,824][26022] Updated weights on worker 0-0, policy_version 970814 (0.00090) [2022-07-11 01:10:42,694][26022] Updated weights on worker 0-0, policy_version 970824 (0.00083) [2022-07-11 01:10:43,083][25689] Fps is (10 sec: 5596.1, 60 sec: 5525.9, 300 sec: 5541.4). Total num frames: 994125824. Throughput: 0: 5857.4. Samples: 994133028. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:10:43,083][25689] Avg episode reward: [(0, '-1.905')] [2022-07-11 01:10:44,456][26022] Updated weights on worker 0-0, policy_version 970834 (0.00078) [2022-07-11 01:10:46,250][26022] Updated weights on worker 0-0, policy_version 970844 (0.00095) [2022-07-11 01:10:48,125][25689] Fps is (10 sec: 5587.9, 60 sec: 5530.2, 300 sec: 5540.8). Total num frames: 994153472. Throughput: 0: 5023.4. Samples: 994149960. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:10:48,125][25689] Avg episode reward: [(0, '-1.374')] [2022-07-11 01:10:48,160][26022] Updated weights on worker 0-0, policy_version 970854 (0.00085) [2022-07-11 01:10:49,961][26022] Updated weights on worker 0-0, policy_version 970864 (0.00092) [2022-07-11 01:10:51,836][26022] Updated weights on worker 0-0, policy_version 970874 (0.00093) [2022-07-11 01:10:53,134][25689] Fps is (10 sec: 5501.5, 60 sec: 5513.0, 300 sec: 5540.7). Total num frames: 994181120. Throughput: 0: 5871.3. Samples: 994183666. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:10:53,134][25689] Avg episode reward: [(0, '-1.203')] [2022-07-11 01:10:53,635][26022] Updated weights on worker 0-0, policy_version 970884 (0.00095) [2022-07-11 01:10:55,363][26022] Updated weights on worker 0-0, policy_version 970894 (0.00091) [2022-07-11 01:10:57,261][26022] Updated weights on worker 0-0, policy_version 970904 (0.00090) [2022-07-11 01:10:58,140][25689] Fps is (10 sec: 5725.4, 60 sec: 5566.3, 300 sec: 5540.7). Total num frames: 994210816. Throughput: 0: 5850.1. Samples: 994217034. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:10:58,141][25689] Avg episode reward: [(0, '-1.873')] [2022-07-11 01:10:59,176][26022] Updated weights on worker 0-0, policy_version 970914 (0.00083) [2022-07-11 01:11:00,771][26022] Updated weights on worker 0-0, policy_version 970924 (0.00087) [2022-07-11 01:11:03,145][25689] Fps is (10 sec: 5421.0, 60 sec: 5532.4, 300 sec: 5538.0). Total num frames: 994235392. Throughput: 0: 5022.2. Samples: 994233786. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:11:03,146][25689] Avg episode reward: [(0, '-0.751')] [2022-07-11 01:11:03,404][26022] Updated weights on worker 0-0, policy_version 970934 (0.00086) [2022-07-11 01:11:04,819][26022] Updated weights on worker 0-0, policy_version 970944 (0.00095) [2022-07-11 01:11:06,819][26022] Updated weights on worker 0-0, policy_version 970954 (0.00093) [2022-07-11 01:11:08,224][25689] Fps is (10 sec: 5382.2, 60 sec: 5569.0, 300 sec: 5541.5). Total num frames: 994265088. Throughput: 0: 5741.2. Samples: 994265354. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:11:08,225][25689] Avg episode reward: [(0, '-1.429')] [2022-07-11 01:11:08,544][26022] Updated weights on worker 0-0, policy_version 970964 (0.00085) [2022-07-11 01:11:10,415][26022] Updated weights on worker 0-0, policy_version 970974 (0.00090) [2022-07-11 01:11:12,193][26022] Updated weights on worker 0-0, policy_version 970984 (0.00082) [2022-07-11 01:11:13,237][25689] Fps is (10 sec: 5581.0, 60 sec: 5537.7, 300 sec: 5538.2). Total num frames: 994291712. Throughput: 0: 5737.3. Samples: 994299002. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:11:13,238][25689] Avg episode reward: [(0, '-0.282')] [2022-07-11 01:11:14,177][26022] Updated weights on worker 0-0, policy_version 970994 (0.00090) [2022-07-11 01:11:15,783][26022] Updated weights on worker 0-0, policy_version 971004 (0.00087) [2022-07-11 01:11:17,912][26022] Updated weights on worker 0-0, policy_version 971014 (0.00089) [2022-07-11 01:11:18,249][25689] Fps is (10 sec: 5515.8, 60 sec: 5554.2, 300 sec: 5541.4). Total num frames: 994320384. Throughput: 0: 4917.0. Samples: 994315908. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:11:18,251][25689] Avg episode reward: [(0, '-0.278')] [2022-07-11 01:11:19,426][26022] Updated weights on worker 0-0, policy_version 971024 (0.00087) [2022-07-11 01:11:21,437][26022] Updated weights on worker 0-0, policy_version 971034 (0.00087) [2022-07-11 01:11:21,621][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:11:21,646][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000971035_994339840.pth [2022-07-11 01:11:21,656][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000969086_992344064.pth [2022-07-11 01:11:23,295][25689] Fps is (10 sec: 5599.1, 60 sec: 5550.1, 300 sec: 5539.0). Total num frames: 994348032. Throughput: 0: 5747.2. Samples: 994349592. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:11:23,296][25689] Avg episode reward: [(0, '-0.037')] [2022-07-11 01:11:23,444][26022] Updated weights on worker 0-0, policy_version 971044 (0.00087) [2022-07-11 01:11:25,105][26022] Updated weights on worker 0-0, policy_version 971054 (0.00082) [2022-07-11 01:11:27,027][26022] Updated weights on worker 0-0, policy_version 971064 (0.00084) [2022-07-11 01:11:28,411][25689] Fps is (10 sec: 5542.1, 60 sec: 5542.5, 300 sec: 5537.6). Total num frames: 994376704. Throughput: 0: 5819.7. Samples: 994382838. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:11:28,412][25689] Avg episode reward: [(0, '-0.077')] [2022-07-11 01:11:28,843][26022] Updated weights on worker 0-0, policy_version 971074 (0.00093) [2022-07-11 01:11:30,576][26022] Updated weights on worker 0-0, policy_version 971084 (0.00912) [2022-07-11 01:11:32,527][26022] Updated weights on worker 0-0, policy_version 971094 (0.00090) [2022-07-11 01:11:33,417][25689] Fps is (10 sec: 5766.8, 60 sec: 5577.9, 300 sec: 5549.0). Total num frames: 994406400. Throughput: 0: 4978.7. Samples: 994399472. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:11:33,418][25689] Avg episode reward: [(0, '-0.263')] [2022-07-11 01:11:34,263][26022] Updated weights on worker 0-0, policy_version 971104 (0.00087) [2022-07-11 01:11:36,084][26022] Updated weights on worker 0-0, policy_version 971114 (0.00084) [2022-07-11 01:11:37,964][26022] Updated weights on worker 0-0, policy_version 971124 (0.00094) [2022-07-11 01:11:38,467][25689] Fps is (10 sec: 5600.7, 60 sec: 5558.8, 300 sec: 5535.0). Total num frames: 994433024. Throughput: 0: 5802.0. Samples: 994433214. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:11:38,469][25689] Avg episode reward: [(0, '0.253')] [2022-07-11 01:11:39,778][26022] Updated weights on worker 0-0, policy_version 971134 (0.00085) [2022-07-11 01:11:41,695][26022] Updated weights on worker 0-0, policy_version 971144 (0.00091) [2022-07-11 01:11:43,449][26022] Updated weights on worker 0-0, policy_version 971154 (0.00091) [2022-07-11 01:11:43,548][25689] Fps is (10 sec: 5457.8, 60 sec: 5554.8, 300 sec: 5542.1). Total num frames: 994461696. Throughput: 0: 5775.0. Samples: 994466552. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:11:43,549][25689] Avg episode reward: [(0, '0.613')] [2022-07-11 01:11:45,325][26022] Updated weights on worker 0-0, policy_version 971164 (0.00087) [2022-07-11 01:11:47,260][26022] Updated weights on worker 0-0, policy_version 971174 (0.00088) [2022-07-11 01:11:48,594][25689] Fps is (10 sec: 5662.4, 60 sec: 5571.3, 300 sec: 5544.8). Total num frames: 994490368. Throughput: 0: 4968.1. Samples: 994483110. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:11:48,595][25689] Avg episode reward: [(0, '0.360')] [2022-07-11 01:11:48,895][26022] Updated weights on worker 0-0, policy_version 971184 (0.00088) [2022-07-11 01:11:50,995][26022] Updated weights on worker 0-0, policy_version 971194 (0.00081) [2022-07-11 01:11:52,676][26022] Updated weights on worker 0-0, policy_version 971204 (0.00087) [2022-07-11 01:11:53,683][25689] Fps is (10 sec: 5557.0, 60 sec: 5564.0, 300 sec: 5540.2). Total num frames: 994518016. Throughput: 0: 5801.5. Samples: 994517048. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:11:53,684][25689] Avg episode reward: [(0, '-0.129')] [2022-07-11 01:11:54,318][26022] Updated weights on worker 0-0, policy_version 971214 (0.00086) [2022-07-11 01:11:56,316][26022] Updated weights on worker 0-0, policy_version 971224 (0.00083) [2022-07-11 01:11:58,138][26022] Updated weights on worker 0-0, policy_version 971234 (0.00082) [2022-07-11 01:11:58,747][25689] Fps is (10 sec: 5446.2, 60 sec: 5524.9, 300 sec: 5536.4). Total num frames: 994545664. Throughput: 0: 5769.1. Samples: 994550212. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:11:58,748][25689] Avg episode reward: [(0, '0.175')] [2022-07-11 01:12:00,023][26022] Updated weights on worker 0-0, policy_version 971244 (0.00088) [2022-07-11 01:12:02,064][26022] Updated weights on worker 0-0, policy_version 971254 (0.00085) [2022-07-11 01:12:03,759][25689] Fps is (10 sec: 5386.3, 60 sec: 5558.0, 300 sec: 5540.7). Total num frames: 994572288. Throughput: 0: 5689.4. Samples: 994581540. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:12:03,760][25689] Avg episode reward: [(0, '0.238')] [2022-07-11 01:12:04,322][26022] Updated weights on worker 0-0, policy_version 971264 (0.00090) [2022-07-11 01:12:05,967][26022] Updated weights on worker 0-0, policy_version 971274 (0.00092) [2022-07-11 01:12:07,836][26022] Updated weights on worker 0-0, policy_version 971284 (0.00086) [2022-07-11 01:12:08,805][25689] Fps is (10 sec: 5396.3, 60 sec: 5527.3, 300 sec: 5536.4). Total num frames: 994599936. Throughput: 0: 5692.4. Samples: 994598156. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:12:08,805][25689] Avg episode reward: [(0, '0.498')] [2022-07-11 01:12:09,612][26022] Updated weights on worker 0-0, policy_version 971294 (0.00094) [2022-07-11 01:12:11,496][26022] Updated weights on worker 0-0, policy_version 971304 (0.00089) [2022-07-11 01:12:13,466][26022] Updated weights on worker 0-0, policy_version 971314 (0.00084) [2022-07-11 01:12:13,836][25689] Fps is (10 sec: 5487.4, 60 sec: 5542.5, 300 sec: 5536.4). Total num frames: 994627584. Throughput: 0: 5685.7. Samples: 994631632. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:12:13,837][25689] Avg episode reward: [(0, '0.746')] [2022-07-11 01:12:15,081][26022] Updated weights on worker 0-0, policy_version 971324 (0.00286) [2022-07-11 01:12:16,962][26022] Updated weights on worker 0-0, policy_version 971334 (0.00090) [2022-07-11 01:12:18,685][26022] Updated weights on worker 0-0, policy_version 971344 (0.00093) [2022-07-11 01:12:18,871][25689] Fps is (10 sec: 5595.0, 60 sec: 5540.4, 300 sec: 5535.9). Total num frames: 994656256. Throughput: 0: 5712.7. Samples: 994665172. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:12:18,871][25689] Avg episode reward: [(0, '0.505')] [2022-07-11 01:12:20,764][26022] Updated weights on worker 0-0, policy_version 971354 (0.00091) [2022-07-11 01:12:22,546][26022] Updated weights on worker 0-0, policy_version 971364 (0.00090) [2022-07-11 01:12:23,888][25689] Fps is (10 sec: 5602.8, 60 sec: 5543.1, 300 sec: 5533.6). Total num frames: 994683904. Throughput: 0: 4980.8. Samples: 994681800. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:12:23,892][25689] Avg episode reward: [(0, '0.274')] [2022-07-11 01:12:24,635][26022] Updated weights on worker 0-0, policy_version 971374 (0.00087) [2022-07-11 01:12:26,356][26022] Updated weights on worker 0-0, policy_version 971384 (0.00087) [2022-07-11 01:12:28,208][26022] Updated weights on worker 0-0, policy_version 971394 (0.00089) [2022-07-11 01:12:28,950][25689] Fps is (10 sec: 5486.3, 60 sec: 5531.2, 300 sec: 5533.1). Total num frames: 994711552. Throughput: 0: 5774.4. Samples: 994714482. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:12:28,950][25689] Avg episode reward: [(0, '0.922')] [2022-07-11 01:12:29,994][26022] Updated weights on worker 0-0, policy_version 971404 (0.00079) [2022-07-11 01:12:31,956][26022] Updated weights on worker 0-0, policy_version 971414 (0.00084) [2022-07-11 01:12:33,572][26022] Updated weights on worker 0-0, policy_version 971424 (0.00089) [2022-07-11 01:12:33,984][25689] Fps is (10 sec: 5477.5, 60 sec: 5494.8, 300 sec: 5532.6). Total num frames: 994739200. Throughput: 0: 5760.1. Samples: 994747682. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:12:33,985][25689] Avg episode reward: [(0, '0.598')] [2022-07-11 01:12:35,667][26022] Updated weights on worker 0-0, policy_version 971434 (0.00095) [2022-07-11 01:12:37,479][26022] Updated weights on worker 0-0, policy_version 971444 (0.00049) [2022-07-11 01:12:39,074][25689] Fps is (10 sec: 5461.9, 60 sec: 5508.1, 300 sec: 5531.1). Total num frames: 994766848. Throughput: 0: 4906.0. Samples: 994764288. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:12:39,074][25689] Avg episode reward: [(0, '0.524')] [2022-07-11 01:12:39,291][26022] Updated weights on worker 0-0, policy_version 971454 (0.00083) [2022-07-11 01:12:41,264][26022] Updated weights on worker 0-0, policy_version 971464 (0.00089) [2022-07-11 01:12:42,792][26022] Updated weights on worker 0-0, policy_version 971474 (0.00091) [2022-07-11 01:12:44,099][25689] Fps is (10 sec: 5365.2, 60 sec: 5479.3, 300 sec: 5525.7). Total num frames: 994793472. Throughput: 0: 5734.7. Samples: 994797702. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:12:44,100][25689] Avg episode reward: [(0, '0.703')] [2022-07-11 01:12:45,055][26022] Updated weights on worker 0-0, policy_version 971484 (0.00089) [2022-07-11 01:12:46,478][26022] Updated weights on worker 0-0, policy_version 971494 (0.00084) [2022-07-11 01:12:48,538][26022] Updated weights on worker 0-0, policy_version 971504 (0.00089) [2022-07-11 01:12:49,221][25689] Fps is (10 sec: 5651.2, 60 sec: 5506.2, 300 sec: 5530.5). Total num frames: 994824192. Throughput: 0: 5751.1. Samples: 994831064. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:12:49,222][25689] Avg episode reward: [(0, '1.254')] [2022-07-11 01:12:50,291][26022] Updated weights on worker 0-0, policy_version 971514 (0.00084) [2022-07-11 01:12:52,149][26022] Updated weights on worker 0-0, policy_version 971524 (0.00088) [2022-07-11 01:12:53,817][26022] Updated weights on worker 0-0, policy_version 971534 (0.00090) [2022-07-11 01:12:54,250][25689] Fps is (10 sec: 5750.0, 60 sec: 5511.7, 300 sec: 5531.6). Total num frames: 994851840. Throughput: 0: 4952.3. Samples: 994848048. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:12:54,251][25689] Avg episode reward: [(0, '0.819')] [2022-07-11 01:12:55,946][26022] Updated weights on worker 0-0, policy_version 971544 (0.00086) [2022-07-11 01:12:57,561][26022] Updated weights on worker 0-0, policy_version 971554 (0.00085) [2022-07-11 01:12:59,342][25689] Fps is (10 sec: 5362.4, 60 sec: 5492.3, 300 sec: 5526.9). Total num frames: 994878464. Throughput: 0: 5777.6. Samples: 994881390. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2022-07-11 01:12:59,342][25689] Avg episode reward: [(0, '0.272')] [2022-07-11 01:12:59,671][26022] Updated weights on worker 0-0, policy_version 971564 (0.00087) [2022-07-11 01:13:01,119][26022] Updated weights on worker 0-0, policy_version 971574 (0.00086) [2022-07-11 01:13:03,542][26022] Updated weights on worker 0-0, policy_version 971584 (0.00089) [2022-07-11 01:13:04,404][25689] Fps is (10 sec: 5546.8, 60 sec: 5538.4, 300 sec: 5538.6). Total num frames: 994908160. Throughput: 0: 5678.3. Samples: 994912998. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:13:04,404][25689] Avg episode reward: [(0, '0.014')] [2022-07-11 01:13:05,435][26022] Updated weights on worker 0-0, policy_version 971594 (0.00090) [2022-07-11 01:13:07,124][26022] Updated weights on worker 0-0, policy_version 971604 (0.00088) [2022-07-11 01:13:09,118][26022] Updated weights on worker 0-0, policy_version 971614 (0.00085) [2022-07-11 01:13:09,454][25689] Fps is (10 sec: 5569.6, 60 sec: 5521.1, 300 sec: 5531.5). Total num frames: 994934784. Throughput: 0: 4891.8. Samples: 994930040. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:13:09,454][25689] Avg episode reward: [(0, '-0.101')] [2022-07-11 01:13:11,047][26022] Updated weights on worker 0-0, policy_version 971624 (0.00088) [2022-07-11 01:13:12,618][26022] Updated weights on worker 0-0, policy_version 971634 (0.00087) [2022-07-11 01:13:14,486][25689] Fps is (10 sec: 5281.4, 60 sec: 5504.1, 300 sec: 5520.9). Total num frames: 994961408. Throughput: 0: 5708.6. Samples: 994963566. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:13:14,487][25689] Avg episode reward: [(0, '-0.909')] [2022-07-11 01:13:14,783][26022] Updated weights on worker 0-0, policy_version 971644 (0.00087) [2022-07-11 01:13:16,144][26022] Updated weights on worker 0-0, policy_version 971654 (0.00089) [2022-07-11 01:13:18,243][26022] Updated weights on worker 0-0, policy_version 971664 (0.00085) [2022-07-11 01:13:19,496][25689] Fps is (10 sec: 5608.6, 60 sec: 5523.3, 300 sec: 5535.0). Total num frames: 994991104. Throughput: 0: 5755.2. Samples: 994997380. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:13:19,496][25689] Avg episode reward: [(0, '-0.266')] [2022-07-11 01:13:20,099][26022] Updated weights on worker 0-0, policy_version 971674 (0.00086) [2022-07-11 01:13:21,721][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:13:21,733][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000971684_995004416.pth [2022-07-11 01:13:21,734][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000969736_993009664.pth [2022-07-11 01:13:21,740][26022] Updated weights on worker 0-0, policy_version 971684 (0.00085) [2022-07-11 01:13:23,867][26022] Updated weights on worker 0-0, policy_version 971694 (0.00088) [2022-07-11 01:13:24,523][25689] Fps is (10 sec: 5713.4, 60 sec: 5522.4, 300 sec: 5529.2). Total num frames: 995018752. Throughput: 0: 5028.4. Samples: 995014164. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:13:24,523][25689] Avg episode reward: [(0, '0.221')] [2022-07-11 01:13:25,519][26022] Updated weights on worker 0-0, policy_version 971704 (0.00094) [2022-07-11 01:13:27,428][26022] Updated weights on worker 0-0, policy_version 971714 (0.00094) [2022-07-11 01:13:29,231][26022] Updated weights on worker 0-0, policy_version 971724 (0.00091) [2022-07-11 01:13:29,663][25689] Fps is (10 sec: 5338.0, 60 sec: 5498.4, 300 sec: 5520.2). Total num frames: 995045376. Throughput: 0: 5796.4. Samples: 995047178. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:13:29,663][25689] Avg episode reward: [(0, '0.201')] [2022-07-11 01:13:31,133][26022] Updated weights on worker 0-0, policy_version 971734 (0.00085) [2022-07-11 01:13:33,111][26022] Updated weights on worker 0-0, policy_version 971744 (0.00085) [2022-07-11 01:13:34,675][25689] Fps is (10 sec: 5547.3, 60 sec: 5534.1, 300 sec: 5530.6). Total num frames: 995075072. Throughput: 0: 5783.5. Samples: 995080330. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:13:34,676][25689] Avg episode reward: [(0, '0.325')] [2022-07-11 01:13:34,852][26022] Updated weights on worker 0-0, policy_version 971754 (0.00092) [2022-07-11 01:13:36,678][26022] Updated weights on worker 0-0, policy_version 971764 (0.00116) [2022-07-11 01:13:38,656][26022] Updated weights on worker 0-0, policy_version 971774 (0.00091) [2022-07-11 01:13:39,693][25689] Fps is (10 sec: 5716.8, 60 sec: 5540.6, 300 sec: 5526.9). Total num frames: 995102720. Throughput: 0: 4937.8. Samples: 995097114. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:13:39,694][25689] Avg episode reward: [(0, '0.472')] [2022-07-11 01:13:40,413][26022] Updated weights on worker 0-0, policy_version 971784 (0.00086) [2022-07-11 01:13:42,275][26022] Updated weights on worker 0-0, policy_version 971794 (0.00091) [2022-07-11 01:13:44,035][26022] Updated weights on worker 0-0, policy_version 971804 (0.00095) [2022-07-11 01:13:44,714][25689] Fps is (10 sec: 5508.4, 60 sec: 5558.0, 300 sec: 5524.9). Total num frames: 995130368. Throughput: 0: 5778.2. Samples: 995130832. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:13:44,714][25689] Avg episode reward: [(0, '1.233')] [2022-07-11 01:13:45,713][26022] Updated weights on worker 0-0, policy_version 971814 (0.00078) [2022-07-11 01:13:47,760][26022] Updated weights on worker 0-0, policy_version 971824 (0.00079) [2022-07-11 01:13:49,497][26022] Updated weights on worker 0-0, policy_version 971834 (0.00086) [2022-07-11 01:13:49,782][25689] Fps is (10 sec: 5582.5, 60 sec: 5529.1, 300 sec: 5534.8). Total num frames: 995159040. Throughput: 0: 5832.2. Samples: 995164518. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:13:49,783][25689] Avg episode reward: [(0, '0.641')] [2022-07-11 01:13:51,496][26022] Updated weights on worker 0-0, policy_version 971844 (0.00086) [2022-07-11 01:13:53,209][26022] Updated weights on worker 0-0, policy_version 971854 (0.00094) [2022-07-11 01:13:54,808][25689] Fps is (10 sec: 5680.7, 60 sec: 5546.3, 300 sec: 5528.5). Total num frames: 995187712. Throughput: 0: 5001.1. Samples: 995181016. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:13:54,809][25689] Avg episode reward: [(0, '0.401')] [2022-07-11 01:13:54,959][26022] Updated weights on worker 0-0, policy_version 971864 (0.00092) [2022-07-11 01:13:57,130][26022] Updated weights on worker 0-0, policy_version 971874 (0.00097) [2022-07-11 01:13:58,585][26022] Updated weights on worker 0-0, policy_version 971884 (0.00087) [2022-07-11 01:13:59,838][25689] Fps is (10 sec: 5498.6, 60 sec: 5551.9, 300 sec: 5532.0). Total num frames: 995214336. Throughput: 0: 5827.5. Samples: 995214510. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:13:59,839][25689] Avg episode reward: [(0, '-0.224')] [2022-07-11 01:14:00,642][26022] Updated weights on worker 0-0, policy_version 971894 (0.00092) [2022-07-11 01:14:02,710][26022] Updated weights on worker 0-0, policy_version 971904 (0.00091) [2022-07-11 01:14:04,568][26022] Updated weights on worker 0-0, policy_version 971914 (0.00091) [2022-07-11 01:14:04,846][25689] Fps is (10 sec: 5406.8, 60 sec: 5523.0, 300 sec: 5532.9). Total num frames: 995241984. Throughput: 0: 5711.5. Samples: 995245816. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:14:04,848][25689] Avg episode reward: [(0, '-0.401')] [2022-07-11 01:14:06,559][26022] Updated weights on worker 0-0, policy_version 971924 (0.00087) [2022-07-11 01:14:08,137][26022] Updated weights on worker 0-0, policy_version 971934 (0.00092) [2022-07-11 01:14:09,885][25689] Fps is (10 sec: 5401.9, 60 sec: 5524.0, 300 sec: 5525.5). Total num frames: 995268608. Throughput: 0: 4882.7. Samples: 995262676. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:14:09,886][25689] Avg episode reward: [(0, '-0.408')] [2022-07-11 01:14:10,152][26022] Updated weights on worker 0-0, policy_version 971944 (0.00094) [2022-07-11 01:14:11,886][26022] Updated weights on worker 0-0, policy_version 971954 (0.00086) [2022-07-11 01:14:13,857][26022] Updated weights on worker 0-0, policy_version 971964 (0.00085) [2022-07-11 01:14:14,929][25689] Fps is (10 sec: 5382.6, 60 sec: 5539.9, 300 sec: 5528.4). Total num frames: 995296256. Throughput: 0: 5721.4. Samples: 995296134. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:14:14,929][25689] Avg episode reward: [(0, '0.190')] [2022-07-11 01:14:15,507][26022] Updated weights on worker 0-0, policy_version 971974 (0.00088) [2022-07-11 01:14:17,427][26022] Updated weights on worker 0-0, policy_version 971984 (0.00090) [2022-07-11 01:14:19,187][26022] Updated weights on worker 0-0, policy_version 971994 (0.01106) [2022-07-11 01:14:19,948][25689] Fps is (10 sec: 5698.4, 60 sec: 5539.0, 300 sec: 5528.6). Total num frames: 995325952. Throughput: 0: 5758.6. Samples: 995330316. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:14:19,949][25689] Avg episode reward: [(0, '0.148')] [2022-07-11 01:14:21,030][26022] Updated weights on worker 0-0, policy_version 972004 (0.00091) [2022-07-11 01:14:22,931][26022] Updated weights on worker 0-0, policy_version 972014 (0.00084) [2022-07-11 01:14:24,738][26022] Updated weights on worker 0-0, policy_version 972024 (0.00074) [2022-07-11 01:14:24,963][25689] Fps is (10 sec: 5613.0, 60 sec: 5523.2, 300 sec: 5526.3). Total num frames: 995352576. Throughput: 0: 5039.4. Samples: 995347192. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:14:24,963][25689] Avg episode reward: [(0, '-0.106')] [2022-07-11 01:14:26,764][26022] Updated weights on worker 0-0, policy_version 972034 (0.00096) [2022-07-11 01:14:28,336][26022] Updated weights on worker 0-0, policy_version 972044 (0.00085) [2022-07-11 01:14:30,007][25689] Fps is (10 sec: 5497.6, 60 sec: 5566.0, 300 sec: 5536.3). Total num frames: 995381248. Throughput: 0: 5836.6. Samples: 995380116. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:14:30,007][25689] Avg episode reward: [(0, '0.259')] [2022-07-11 01:14:30,448][26022] Updated weights on worker 0-0, policy_version 972054 (0.00084) [2022-07-11 01:14:32,075][26022] Updated weights on worker 0-0, policy_version 972064 (0.00092) [2022-07-11 01:14:33,992][26022] Updated weights on worker 0-0, policy_version 972074 (0.00095) [2022-07-11 01:14:35,015][25689] Fps is (10 sec: 5704.8, 60 sec: 5549.4, 300 sec: 5533.3). Total num frames: 995409920. Throughput: 0: 5863.6. Samples: 995413908. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:14:35,015][25689] Avg episode reward: [(0, '0.184')] [2022-07-11 01:14:35,881][26022] Updated weights on worker 0-0, policy_version 972084 (0.00088) [2022-07-11 01:14:37,475][26022] Updated weights on worker 0-0, policy_version 972094 (0.00088) [2022-07-11 01:14:39,633][26022] Updated weights on worker 0-0, policy_version 972104 (0.00089) [2022-07-11 01:14:40,032][25689] Fps is (10 sec: 5617.7, 60 sec: 5549.5, 300 sec: 5533.9). Total num frames: 995437568. Throughput: 0: 4999.2. Samples: 995430718. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:14:40,033][25689] Avg episode reward: [(0, '-0.204')] [2022-07-11 01:14:41,225][26022] Updated weights on worker 0-0, policy_version 972114 (0.00086) [2022-07-11 01:14:43,041][26022] Updated weights on worker 0-0, policy_version 972124 (0.00084) [2022-07-11 01:14:45,008][26022] Updated weights on worker 0-0, policy_version 972134 (0.00092) [2022-07-11 01:14:45,051][25689] Fps is (10 sec: 5509.7, 60 sec: 5549.6, 300 sec: 5535.2). Total num frames: 995465216. Throughput: 0: 5834.8. Samples: 995464402. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:14:45,051][25689] Avg episode reward: [(0, '-0.045')] [2022-07-11 01:14:46,815][26022] Updated weights on worker 0-0, policy_version 972144 (0.00082) [2022-07-11 01:14:48,763][26022] Updated weights on worker 0-0, policy_version 972154 (0.00083) [2022-07-11 01:14:50,216][25689] Fps is (10 sec: 5530.2, 60 sec: 5540.7, 300 sec: 5532.3). Total num frames: 995493888. Throughput: 0: 5808.5. Samples: 995497504. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:14:50,217][25689] Avg episode reward: [(0, '0.281')] [2022-07-11 01:14:50,552][26022] Updated weights on worker 0-0, policy_version 972164 (0.00096) [2022-07-11 01:14:52,376][26022] Updated weights on worker 0-0, policy_version 972174 (0.00095) [2022-07-11 01:14:54,313][26022] Updated weights on worker 0-0, policy_version 972184 (0.01162) [2022-07-11 01:14:55,282][25689] Fps is (10 sec: 5504.6, 60 sec: 5520.1, 300 sec: 5535.1). Total num frames: 995521536. Throughput: 0: 5756.0. Samples: 995530568. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:14:55,284][25689] Avg episode reward: [(0, '0.755')] [2022-07-11 01:14:56,088][26022] Updated weights on worker 0-0, policy_version 972194 (0.00087) [2022-07-11 01:14:58,046][26022] Updated weights on worker 0-0, policy_version 972204 (0.00087) [2022-07-11 01:14:59,791][26022] Updated weights on worker 0-0, policy_version 972214 (0.00093) [2022-07-11 01:15:00,378][25689] Fps is (10 sec: 5441.7, 60 sec: 5531.1, 300 sec: 5536.9). Total num frames: 995549184. Throughput: 0: 5709.7. Samples: 995546886. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:15:00,378][25689] Avg episode reward: [(0, '1.174')] [2022-07-11 01:15:01,639][26022] Updated weights on worker 0-0, policy_version 972224 (0.00092) [2022-07-11 01:15:03,897][26022] Updated weights on worker 0-0, policy_version 972234 (0.00086) [2022-07-11 01:15:05,472][25689] Fps is (10 sec: 5325.9, 60 sec: 5506.2, 300 sec: 5533.7). Total num frames: 995575808. Throughput: 0: 5555.7. Samples: 995577858. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:15:05,474][25689] Avg episode reward: [(0, '1.712')] [2022-07-11 01:15:05,856][26022] Updated weights on worker 0-0, policy_version 972244 (0.00091) [2022-07-11 01:15:07,812][26022] Updated weights on worker 0-0, policy_version 972254 (0.00088) [2022-07-11 01:15:09,635][26022] Updated weights on worker 0-0, policy_version 972264 (0.00086) [2022-07-11 01:15:10,524][25689] Fps is (10 sec: 5247.7, 60 sec: 5505.1, 300 sec: 5526.6). Total num frames: 995602432. Throughput: 0: 5584.4. Samples: 995610914. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:15:10,526][25689] Avg episode reward: [(0, '0.744')] [2022-07-11 01:15:11,354][26022] Updated weights on worker 0-0, policy_version 972274 (0.00087) [2022-07-11 01:15:13,258][26022] Updated weights on worker 0-0, policy_version 972284 (0.00086) [2022-07-11 01:15:15,143][26022] Updated weights on worker 0-0, policy_version 972294 (0.00096) [2022-07-11 01:15:15,551][25689] Fps is (10 sec: 5588.2, 60 sec: 5540.4, 300 sec: 5533.1). Total num frames: 995632128. Throughput: 0: 4791.0. Samples: 995627674. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:15:15,551][25689] Avg episode reward: [(0, '0.689')] [2022-07-11 01:15:16,994][26022] Updated weights on worker 0-0, policy_version 972304 (0.00085) [2022-07-11 01:15:18,848][26022] Updated weights on worker 0-0, policy_version 972314 (0.00085) [2022-07-11 01:15:20,487][26022] Updated weights on worker 0-0, policy_version 972324 (0.00091) [2022-07-11 01:15:20,555][25689] Fps is (10 sec: 5717.1, 60 sec: 5508.1, 300 sec: 5533.1). Total num frames: 995659776. Throughput: 0: 5667.2. Samples: 995661234. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:15:20,555][25689] Avg episode reward: [(0, '0.270')] [2022-07-11 01:15:21,871][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:15:21,880][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000972331_995666944.pth [2022-07-11 01:15:21,881][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000970386_993675264.pth [2022-07-11 01:15:22,297][26022] Updated weights on worker 0-0, policy_version 972334 (0.00089) [2022-07-11 01:15:24,496][26022] Updated weights on worker 0-0, policy_version 972344 (0.00097) [2022-07-11 01:15:25,568][25689] Fps is (10 sec: 5520.0, 60 sec: 5525.0, 300 sec: 5530.0). Total num frames: 995687424. Throughput: 0: 5838.5. Samples: 995695188. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:15:25,570][25689] Avg episode reward: [(0, '0.250')] [2022-07-11 01:15:25,983][26022] Updated weights on worker 0-0, policy_version 972354 (0.00093) [2022-07-11 01:15:28,140][26022] Updated weights on worker 0-0, policy_version 972364 (0.00093) [2022-07-11 01:15:29,861][26022] Updated weights on worker 0-0, policy_version 972374 (0.00089) [2022-07-11 01:15:30,695][25689] Fps is (10 sec: 5452.9, 60 sec: 5500.6, 300 sec: 5528.0). Total num frames: 995715072. Throughput: 0: 4973.5. Samples: 995711234. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:15:30,696][25689] Avg episode reward: [(0, '0.675')] [2022-07-11 01:15:31,759][26022] Updated weights on worker 0-0, policy_version 972384 (0.00086) [2022-07-11 01:15:33,496][26022] Updated weights on worker 0-0, policy_version 972394 (0.00088) [2022-07-11 01:15:35,337][26022] Updated weights on worker 0-0, policy_version 972404 (0.00086) [2022-07-11 01:15:35,745][25689] Fps is (10 sec: 5433.4, 60 sec: 5479.9, 300 sec: 5527.6). Total num frames: 995742720. Throughput: 0: 5794.4. Samples: 995744688. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:15:35,746][25689] Avg episode reward: [(0, '0.742')] [2022-07-11 01:15:37,153][26022] Updated weights on worker 0-0, policy_version 972414 (0.00094) [2022-07-11 01:15:39,200][26022] Updated weights on worker 0-0, policy_version 972424 (0.00087) [2022-07-11 01:15:40,748][25689] Fps is (10 sec: 5602.5, 60 sec: 5498.1, 300 sec: 5528.2). Total num frames: 995771392. Throughput: 0: 5785.4. Samples: 995778060. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:15:40,749][25689] Avg episode reward: [(0, '1.599')] [2022-07-11 01:15:40,831][26022] Updated weights on worker 0-0, policy_version 972434 (0.00108) [2022-07-11 01:15:42,816][26022] Updated weights on worker 0-0, policy_version 972444 (0.00088) [2022-07-11 01:15:44,538][26022] Updated weights on worker 0-0, policy_version 972454 (0.00091) [2022-07-11 01:15:45,810][25689] Fps is (10 sec: 5493.7, 60 sec: 5477.3, 300 sec: 5524.4). Total num frames: 995798016. Throughput: 0: 4906.4. Samples: 995794504. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:15:45,811][25689] Avg episode reward: [(0, '1.685')] [2022-07-11 01:15:46,482][26022] Updated weights on worker 0-0, policy_version 972464 (0.00086) [2022-07-11 01:15:48,387][26022] Updated weights on worker 0-0, policy_version 972474 (0.00084) [2022-07-11 01:15:49,922][26022] Updated weights on worker 0-0, policy_version 972484 (0.00088) [2022-07-11 01:15:50,876][25689] Fps is (10 sec: 5560.9, 60 sec: 5503.2, 300 sec: 5530.3). Total num frames: 995827712. Throughput: 0: 5783.2. Samples: 995827942. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:15:50,876][25689] Avg episode reward: [(0, '1.644')] [2022-07-11 01:15:52,033][26022] Updated weights on worker 0-0, policy_version 972494 (0.00094) [2022-07-11 01:15:53,821][26022] Updated weights on worker 0-0, policy_version 972504 (0.00088) [2022-07-11 01:15:55,555][26022] Updated weights on worker 0-0, policy_version 972514 (0.00085) [2022-07-11 01:15:55,896][25689] Fps is (10 sec: 5787.0, 60 sec: 5524.2, 300 sec: 5526.6). Total num frames: 995856384. Throughput: 0: 5794.3. Samples: 995861450. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:15:55,897][25689] Avg episode reward: [(0, '0.603')] [2022-07-11 01:15:57,559][26022] Updated weights on worker 0-0, policy_version 972524 (0.00094) [2022-07-11 01:15:59,202][26022] Updated weights on worker 0-0, policy_version 972534 (0.00086) [2022-07-11 01:16:00,997][25689] Fps is (10 sec: 5463.5, 60 sec: 5506.9, 300 sec: 5531.7). Total num frames: 995883008. Throughput: 0: 4937.8. Samples: 995878044. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:16:00,997][25689] Avg episode reward: [(0, '0.141')] [2022-07-11 01:16:01,270][26022] Updated weights on worker 0-0, policy_version 972544 (0.00097) [2022-07-11 01:16:03,323][26022] Updated weights on worker 0-0, policy_version 972554 (0.00083) [2022-07-11 01:16:05,254][26022] Updated weights on worker 0-0, policy_version 972564 (0.00084) [2022-07-11 01:16:06,028][25689] Fps is (10 sec: 5154.7, 60 sec: 5495.8, 300 sec: 5518.8). Total num frames: 995908608. Throughput: 0: 5684.1. Samples: 995909422. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:16:06,028][25689] Avg episode reward: [(0, '0.147')] [2022-07-11 01:16:06,941][26022] Updated weights on worker 0-0, policy_version 972574 (0.00094) [2022-07-11 01:16:08,975][26022] Updated weights on worker 0-0, policy_version 972584 (0.00085) [2022-07-11 01:16:10,752][26022] Updated weights on worker 0-0, policy_version 972594 (0.00088) [2022-07-11 01:16:11,103][25689] Fps is (10 sec: 5370.3, 60 sec: 5527.5, 300 sec: 5524.5). Total num frames: 995937280. Throughput: 0: 5683.9. Samples: 995942910. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:16:11,103][25689] Avg episode reward: [(0, '-0.681')] [2022-07-11 01:16:12,703][26022] Updated weights on worker 0-0, policy_version 972604 (0.00092) [2022-07-11 01:16:14,466][26022] Updated weights on worker 0-0, policy_version 972614 (0.00084) [2022-07-11 01:16:16,123][25689] Fps is (10 sec: 5680.3, 60 sec: 5511.2, 300 sec: 5524.4). Total num frames: 995965952. Throughput: 0: 4854.7. Samples: 995959644. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:16:16,123][25689] Avg episode reward: [(0, '-0.798')] [2022-07-11 01:16:16,233][26022] Updated weights on worker 0-0, policy_version 972624 (0.00081) [2022-07-11 01:16:18,228][26022] Updated weights on worker 0-0, policy_version 972634 (0.00089) [2022-07-11 01:16:19,997][26022] Updated weights on worker 0-0, policy_version 972644 (0.00088) [2022-07-11 01:16:21,138][25689] Fps is (10 sec: 5510.2, 60 sec: 5493.3, 300 sec: 5521.5). Total num frames: 995992576. Throughput: 0: 5726.0. Samples: 995993372. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:16:21,139][25689] Avg episode reward: [(0, '-0.513')] [2022-07-11 01:16:21,836][26022] Updated weights on worker 0-0, policy_version 972654 (0.00085) [2022-07-11 01:16:23,555][26022] Updated weights on worker 0-0, policy_version 972664 (0.00084) [2022-07-11 01:16:25,263][26022] Updated weights on worker 0-0, policy_version 972674 (0.00091) [2022-07-11 01:16:26,215][25689] Fps is (10 sec: 5681.8, 60 sec: 5538.1, 300 sec: 5529.1). Total num frames: 996023296. Throughput: 0: 5830.3. Samples: 996027122. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:16:26,216][25689] Avg episode reward: [(0, '0.106')] [2022-07-11 01:16:27,400][26022] Updated weights on worker 0-0, policy_version 972684 (0.00086) [2022-07-11 01:16:29,058][26022] Updated weights on worker 0-0, policy_version 972694 (0.00084) [2022-07-11 01:16:31,034][26022] Updated weights on worker 0-0, policy_version 972704 (0.00090) [2022-07-11 01:16:31,279][25689] Fps is (10 sec: 5654.6, 60 sec: 5527.0, 300 sec: 5517.7). Total num frames: 996049920. Throughput: 0: 4993.8. Samples: 996043666. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:16:31,280][25689] Avg episode reward: [(0, '1.031')] [2022-07-11 01:16:32,656][26022] Updated weights on worker 0-0, policy_version 972714 (0.00090) [2022-07-11 01:16:34,607][26022] Updated weights on worker 0-0, policy_version 972724 (0.00086) [2022-07-11 01:16:36,295][25689] Fps is (10 sec: 5384.4, 60 sec: 5530.1, 300 sec: 5521.8). Total num frames: 996077568. Throughput: 0: 5826.1. Samples: 996077168. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:16:36,295][25689] Avg episode reward: [(0, '1.004')] [2022-07-11 01:16:36,535][26022] Updated weights on worker 0-0, policy_version 972734 (0.00089) [2022-07-11 01:16:38,083][26022] Updated weights on worker 0-0, policy_version 972744 (0.00084) [2022-07-11 01:16:40,088][26022] Updated weights on worker 0-0, policy_version 972754 (0.00086) [2022-07-11 01:16:41,302][25689] Fps is (10 sec: 5823.6, 60 sec: 5563.6, 300 sec: 5530.0). Total num frames: 996108288. Throughput: 0: 5838.1. Samples: 996111090. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:16:41,302][25689] Avg episode reward: [(0, '0.490')] [2022-07-11 01:16:41,627][26022] Updated weights on worker 0-0, policy_version 972764 (0.00086) [2022-07-11 01:16:43,729][26022] Updated weights on worker 0-0, policy_version 972774 (0.00088) [2022-07-11 01:16:45,890][26022] Updated weights on worker 0-0, policy_version 972784 (0.00097) [2022-07-11 01:16:46,359][25689] Fps is (10 sec: 5596.3, 60 sec: 5547.2, 300 sec: 5519.5). Total num frames: 996133888. Throughput: 0: 5828.3. Samples: 996144524. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:16:46,359][25689] Avg episode reward: [(0, '0.712')] [2022-07-11 01:16:47,355][26022] Updated weights on worker 0-0, policy_version 972794 (0.00090) [2022-07-11 01:16:49,267][26022] Updated weights on worker 0-0, policy_version 972804 (0.00094) [2022-07-11 01:16:50,936][26022] Updated weights on worker 0-0, policy_version 972814 (0.00090) [2022-07-11 01:16:51,484][25689] Fps is (10 sec: 5430.9, 60 sec: 5541.7, 300 sec: 5525.7). Total num frames: 996163584. Throughput: 0: 5817.2. Samples: 996161200. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:16:51,484][25689] Avg episode reward: [(0, '-0.917')] [2022-07-11 01:16:53,002][26022] Updated weights on worker 0-0, policy_version 972824 (0.00090) [2022-07-11 01:16:54,800][26022] Updated weights on worker 0-0, policy_version 972834 (0.00087) [2022-07-11 01:16:56,559][25689] Fps is (10 sec: 5622.1, 60 sec: 5519.9, 300 sec: 5525.5). Total num frames: 996191232. Throughput: 0: 5808.1. Samples: 996194862. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 01:16:56,559][25689] Avg episode reward: [(0, '-0.647')] [2022-07-11 01:16:56,574][26022] Updated weights on worker 0-0, policy_version 972844 (0.00087) [2022-07-11 01:16:58,391][26022] Updated weights on worker 0-0, policy_version 972854 (0.00090) [2022-07-11 01:17:00,254][26022] Updated weights on worker 0-0, policy_version 972864 (0.00086) [2022-07-11 01:17:01,586][25689] Fps is (10 sec: 5473.3, 60 sec: 5543.4, 300 sec: 5528.7). Total num frames: 996218880. Throughput: 0: 5776.5. Samples: 996228264. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:17:01,587][25689] Avg episode reward: [(0, '-1.106')] [2022-07-11 01:17:02,499][26022] Updated weights on worker 0-0, policy_version 972874 (0.00086) [2022-07-11 01:17:04,275][26022] Updated weights on worker 0-0, policy_version 972884 (0.00094) [2022-07-11 01:17:06,265][26022] Updated weights on worker 0-0, policy_version 972894 (0.00082) [2022-07-11 01:17:06,639][25689] Fps is (10 sec: 5282.5, 60 sec: 5541.4, 300 sec: 5521.7). Total num frames: 996244480. Throughput: 0: 4856.0. Samples: 996243004. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:17:06,639][25689] Avg episode reward: [(0, '-0.861')] [2022-07-11 01:17:07,989][26022] Updated weights on worker 0-0, policy_version 972904 (0.00088) [2022-07-11 01:17:09,939][26022] Updated weights on worker 0-0, policy_version 972914 (0.00084) [2022-07-11 01:17:11,507][26022] Updated weights on worker 0-0, policy_version 972924 (0.00095) [2022-07-11 01:17:11,688][25689] Fps is (10 sec: 5575.5, 60 sec: 5577.6, 300 sec: 5531.7). Total num frames: 996275200. Throughput: 0: 5698.1. Samples: 996276326. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:17:11,688][25689] Avg episode reward: [(0, '-1.309')] [2022-07-11 01:17:13,694][26022] Updated weights on worker 0-0, policy_version 972934 (0.00083) [2022-07-11 01:17:15,251][26022] Updated weights on worker 0-0, policy_version 972944 (0.00083) [2022-07-11 01:17:16,707][25689] Fps is (10 sec: 5797.5, 60 sec: 5560.8, 300 sec: 5528.5). Total num frames: 996302848. Throughput: 0: 5712.5. Samples: 996309958. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:17:16,707][25689] Avg episode reward: [(0, '0.215')] [2022-07-11 01:17:17,168][26022] Updated weights on worker 0-0, policy_version 972954 (0.00088) [2022-07-11 01:17:18,863][26022] Updated weights on worker 0-0, policy_version 972964 (0.00099) [2022-07-11 01:17:20,838][26022] Updated weights on worker 0-0, policy_version 972974 (0.00090) [2022-07-11 01:17:21,728][25689] Fps is (10 sec: 5405.3, 60 sec: 5560.2, 300 sec: 5525.0). Total num frames: 996329472. Throughput: 0: 4890.6. Samples: 996326770. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:17:21,729][25689] Avg episode reward: [(0, '0.063')] [2022-07-11 01:17:22,064][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:17:22,073][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000972980_996331520.pth [2022-07-11 01:17:22,091][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000971035_994339840.pth [2022-07-11 01:17:22,558][26022] Updated weights on worker 0-0, policy_version 972984 (0.00086) [2022-07-11 01:17:24,541][26022] Updated weights on worker 0-0, policy_version 972994 (0.00087) [2022-07-11 01:17:26,137][26022] Updated weights on worker 0-0, policy_version 973004 (0.00090) [2022-07-11 01:17:26,731][25689] Fps is (10 sec: 5618.4, 60 sec: 5550.2, 300 sec: 5533.0). Total num frames: 996359168. Throughput: 0: 5855.0. Samples: 996360642. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:17:26,731][25689] Avg episode reward: [(0, '0.139')] [2022-07-11 01:17:28,186][26022] Updated weights on worker 0-0, policy_version 973014 (0.00087) [2022-07-11 01:17:29,911][26022] Updated weights on worker 0-0, policy_version 973024 (0.00088) [2022-07-11 01:17:31,728][26022] Updated weights on worker 0-0, policy_version 973034 (0.00088) [2022-07-11 01:17:31,765][25689] Fps is (10 sec: 5713.6, 60 sec: 5569.9, 300 sec: 5533.0). Total num frames: 996386816. Throughput: 0: 5858.9. Samples: 996393954. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:17:31,765][25689] Avg episode reward: [(0, '0.254')] [2022-07-11 01:17:33,608][26022] Updated weights on worker 0-0, policy_version 973044 (0.00089) [2022-07-11 01:17:35,354][26022] Updated weights on worker 0-0, policy_version 973054 (0.00094) [2022-07-11 01:17:36,776][25689] Fps is (10 sec: 5300.7, 60 sec: 5536.4, 300 sec: 5527.6). Total num frames: 996412416. Throughput: 0: 5024.8. Samples: 996410804. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:17:36,777][25689] Avg episode reward: [(0, '0.032')] [2022-07-11 01:17:37,438][26022] Updated weights on worker 0-0, policy_version 973064 (0.00087) [2022-07-11 01:17:38,856][26022] Updated weights on worker 0-0, policy_version 973074 (0.00094) [2022-07-11 01:17:40,958][26022] Updated weights on worker 0-0, policy_version 973084 (0.00087) [2022-07-11 01:17:41,809][25689] Fps is (10 sec: 5607.0, 60 sec: 5534.0, 300 sec: 5541.2). Total num frames: 996443136. Throughput: 0: 5870.3. Samples: 996444650. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:17:41,810][25689] Avg episode reward: [(0, '-0.053')] [2022-07-11 01:17:42,857][26022] Updated weights on worker 0-0, policy_version 973094 (0.00087) [2022-07-11 01:17:44,683][26022] Updated weights on worker 0-0, policy_version 973104 (0.00096) [2022-07-11 01:17:46,511][26022] Updated weights on worker 0-0, policy_version 973114 (0.00085) [2022-07-11 01:17:46,814][25689] Fps is (10 sec: 5611.0, 60 sec: 5538.8, 300 sec: 5526.2). Total num frames: 996468736. Throughput: 0: 5863.0. Samples: 996478388. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:17:46,814][25689] Avg episode reward: [(0, '-0.004')] [2022-07-11 01:17:48,359][26022] Updated weights on worker 0-0, policy_version 973124 (0.00108) [2022-07-11 01:17:50,178][26022] Updated weights on worker 0-0, policy_version 973134 (0.00083) [2022-07-11 01:17:51,850][25689] Fps is (10 sec: 5404.9, 60 sec: 5529.9, 300 sec: 5529.5). Total num frames: 996497408. Throughput: 0: 5043.0. Samples: 996495246. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:17:51,851][25689] Avg episode reward: [(0, '-0.174')] [2022-07-11 01:17:52,014][26022] Updated weights on worker 0-0, policy_version 973144 (0.00819) [2022-07-11 01:17:53,687][26022] Updated weights on worker 0-0, policy_version 973154 (0.00086) [2022-07-11 01:17:55,565][26022] Updated weights on worker 0-0, policy_version 973164 (0.00091) [2022-07-11 01:17:56,865][25689] Fps is (10 sec: 5806.8, 60 sec: 5569.4, 300 sec: 5541.2). Total num frames: 996527104. Throughput: 0: 5881.7. Samples: 996528962. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:17:56,866][25689] Avg episode reward: [(0, '-0.304')] [2022-07-11 01:17:57,408][26022] Updated weights on worker 0-0, policy_version 973174 (0.00076) [2022-07-11 01:17:59,279][26022] Updated weights on worker 0-0, policy_version 973184 (0.00087) [2022-07-11 01:18:01,262][26022] Updated weights on worker 0-0, policy_version 973194 (0.00094) [2022-07-11 01:18:01,879][25689] Fps is (10 sec: 5411.8, 60 sec: 5519.8, 300 sec: 5524.9). Total num frames: 996551680. Throughput: 0: 5810.8. Samples: 996561272. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:18:01,879][25689] Avg episode reward: [(0, '-0.528')] [2022-07-11 01:18:03,360][26022] Updated weights on worker 0-0, policy_version 973204 (0.00091) [2022-07-11 01:18:05,203][26022] Updated weights on worker 0-0, policy_version 973214 (0.00075) [2022-07-11 01:18:06,889][25689] Fps is (10 sec: 5312.3, 60 sec: 5574.6, 300 sec: 5532.5). Total num frames: 996580352. Throughput: 0: 4909.9. Samples: 996576956. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:18:06,890][25689] Avg episode reward: [(0, '-0.355')] [2022-07-11 01:18:07,107][26022] Updated weights on worker 0-0, policy_version 973224 (0.00087) [2022-07-11 01:18:08,947][26022] Updated weights on worker 0-0, policy_version 973234 (0.00095) [2022-07-11 01:18:10,680][26022] Updated weights on worker 0-0, policy_version 973244 (0.00086) [2022-07-11 01:18:11,936][25689] Fps is (10 sec: 5600.2, 60 sec: 5523.8, 300 sec: 5535.7). Total num frames: 996608000. Throughput: 0: 5741.5. Samples: 996610564. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:18:11,937][25689] Avg episode reward: [(0, '-0.212')] [2022-07-11 01:18:12,583][26022] Updated weights on worker 0-0, policy_version 973254 (0.00085) [2022-07-11 01:18:14,306][26022] Updated weights on worker 0-0, policy_version 973264 (0.00845) [2022-07-11 01:18:16,240][26022] Updated weights on worker 0-0, policy_version 973274 (0.00085) [2022-07-11 01:18:16,968][25689] Fps is (10 sec: 5486.2, 60 sec: 5522.6, 300 sec: 5528.4). Total num frames: 996635648. Throughput: 0: 5744.4. Samples: 996644440. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:18:16,968][25689] Avg episode reward: [(0, '0.103')] [2022-07-11 01:18:17,951][26022] Updated weights on worker 0-0, policy_version 973284 (0.00088) [2022-07-11 01:18:19,841][26022] Updated weights on worker 0-0, policy_version 973294 (0.00083) [2022-07-11 01:18:21,807][26022] Updated weights on worker 0-0, policy_version 973304 (0.00084) [2022-07-11 01:18:21,976][25689] Fps is (10 sec: 5711.2, 60 sec: 5574.8, 300 sec: 5535.6). Total num frames: 996665344. Throughput: 0: 4966.6. Samples: 996661088. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:18:21,977][25689] Avg episode reward: [(0, '0.375')] [2022-07-11 01:18:23,431][26022] Updated weights on worker 0-0, policy_version 973314 (0.00096) [2022-07-11 01:18:25,188][26022] Updated weights on worker 0-0, policy_version 973324 (0.00088) [2022-07-11 01:18:26,992][25689] Fps is (10 sec: 5720.9, 60 sec: 5539.6, 300 sec: 5541.4). Total num frames: 996692992. Throughput: 0: 5880.3. Samples: 996695166. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:18:26,992][25689] Avg episode reward: [(0, '0.145')] [2022-07-11 01:18:27,110][26022] Updated weights on worker 0-0, policy_version 973334 (0.00094) [2022-07-11 01:18:29,028][26022] Updated weights on worker 0-0, policy_version 973344 (0.00091) [2022-07-11 01:18:30,939][26022] Updated weights on worker 0-0, policy_version 973354 (0.00086) [2022-07-11 01:18:32,039][25689] Fps is (10 sec: 5393.6, 60 sec: 5521.4, 300 sec: 5530.4). Total num frames: 996719616. Throughput: 0: 5846.0. Samples: 996728086. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:18:32,039][25689] Avg episode reward: [(0, '-1.002')] [2022-07-11 01:18:32,679][26022] Updated weights on worker 0-0, policy_version 973364 (0.00087) [2022-07-11 01:18:34,618][26022] Updated weights on worker 0-0, policy_version 973374 (0.00096) [2022-07-11 01:18:36,351][26022] Updated weights on worker 0-0, policy_version 973384 (0.00085) [2022-07-11 01:18:37,051][25689] Fps is (10 sec: 5497.1, 60 sec: 5572.4, 300 sec: 5534.0). Total num frames: 996748288. Throughput: 0: 5003.8. Samples: 996744930. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:18:37,051][25689] Avg episode reward: [(0, '-1.557')] [2022-07-11 01:18:38,259][26022] Updated weights on worker 0-0, policy_version 973394 (0.00079) [2022-07-11 01:18:39,951][26022] Updated weights on worker 0-0, policy_version 973404 (0.00094) [2022-07-11 01:18:41,942][26022] Updated weights on worker 0-0, policy_version 973414 (0.00087) [2022-07-11 01:18:42,061][25689] Fps is (10 sec: 5619.2, 60 sec: 5523.4, 300 sec: 5534.2). Total num frames: 996775936. Throughput: 0: 5841.6. Samples: 996778418. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:18:42,063][25689] Avg episode reward: [(0, '-1.498')] [2022-07-11 01:18:43,673][26022] Updated weights on worker 0-0, policy_version 973424 (0.00090) [2022-07-11 01:18:45,733][26022] Updated weights on worker 0-0, policy_version 973434 (0.00083) [2022-07-11 01:18:47,080][25689] Fps is (10 sec: 5513.7, 60 sec: 5556.2, 300 sec: 5531.6). Total num frames: 996803584. Throughput: 0: 5802.5. Samples: 996811726. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:18:47,081][25689] Avg episode reward: [(0, '-1.449')] [2022-07-11 01:18:47,370][26022] Updated weights on worker 0-0, policy_version 973444 (0.00082) [2022-07-11 01:18:49,351][26022] Updated weights on worker 0-0, policy_version 973454 (0.00086) [2022-07-11 01:18:50,991][26022] Updated weights on worker 0-0, policy_version 973464 (0.00090) [2022-07-11 01:18:52,147][25689] Fps is (10 sec: 5685.9, 60 sec: 5570.4, 300 sec: 5534.3). Total num frames: 996833280. Throughput: 0: 4990.1. Samples: 996828428. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:18:52,147][25689] Avg episode reward: [(0, '-2.046')] [2022-07-11 01:18:53,138][26022] Updated weights on worker 0-0, policy_version 973474 (0.00106) [2022-07-11 01:18:54,690][26022] Updated weights on worker 0-0, policy_version 973484 (0.00087) [2022-07-11 01:18:56,704][26022] Updated weights on worker 0-0, policy_version 973494 (0.00088) [2022-07-11 01:18:57,160][25689] Fps is (10 sec: 5587.0, 60 sec: 5519.5, 300 sec: 5534.6). Total num frames: 996859904. Throughput: 0: 5794.8. Samples: 996861460. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:18:57,162][25689] Avg episode reward: [(0, '-0.556')] [2022-07-11 01:18:58,536][26022] Updated weights on worker 0-0, policy_version 973504 (0.00087) [2022-07-11 01:19:00,390][26022] Updated weights on worker 0-0, policy_version 973514 (0.00097) [2022-07-11 01:19:02,177][25689] Fps is (10 sec: 5104.4, 60 sec: 5519.2, 300 sec: 5524.1). Total num frames: 996884480. Throughput: 0: 5688.7. Samples: 996892852. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:19:02,179][25689] Avg episode reward: [(0, '-0.290')] [2022-07-11 01:19:02,659][26022] Updated weights on worker 0-0, policy_version 973524 (0.00081) [2022-07-11 01:19:04,573][26022] Updated weights on worker 0-0, policy_version 973534 (0.00084) [2022-07-11 01:19:06,318][26022] Updated weights on worker 0-0, policy_version 973544 (0.00083) [2022-07-11 01:19:07,191][25689] Fps is (10 sec: 5410.5, 60 sec: 5535.9, 300 sec: 5534.9). Total num frames: 996914176. Throughput: 0: 4858.9. Samples: 996909444. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:19:07,192][25689] Avg episode reward: [(0, '0.117')] [2022-07-11 01:19:08,392][26022] Updated weights on worker 0-0, policy_version 973554 (0.00096) [2022-07-11 01:19:09,887][26022] Updated weights on worker 0-0, policy_version 973564 (0.00094) [2022-07-11 01:19:12,005][26022] Updated weights on worker 0-0, policy_version 973574 (0.00082) [2022-07-11 01:19:12,256][25689] Fps is (10 sec: 5587.9, 60 sec: 5517.2, 300 sec: 5531.1). Total num frames: 996940800. Throughput: 0: 5685.5. Samples: 996942760. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:19:12,257][25689] Avg episode reward: [(0, '0.238')] [2022-07-11 01:19:13,559][26022] Updated weights on worker 0-0, policy_version 973584 (0.00088) [2022-07-11 01:19:15,624][26022] Updated weights on worker 0-0, policy_version 973594 (0.00090) [2022-07-11 01:19:17,185][26022] Updated weights on worker 0-0, policy_version 973604 (0.00084) [2022-07-11 01:19:17,271][25689] Fps is (10 sec: 5587.3, 60 sec: 5552.8, 300 sec: 5531.2). Total num frames: 996970496. Throughput: 0: 5713.8. Samples: 996976370. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:19:17,271][25689] Avg episode reward: [(0, '-0.584')] [2022-07-11 01:19:19,201][26022] Updated weights on worker 0-0, policy_version 973614 (0.00120) [2022-07-11 01:19:21,029][26022] Updated weights on worker 0-0, policy_version 973624 (0.00093) [2022-07-11 01:19:22,278][25689] Fps is (10 sec: 5619.8, 60 sec: 5501.9, 300 sec: 5531.3). Total num frames: 996997120. Throughput: 0: 4978.7. Samples: 996992926. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:19:22,279][25689] Avg episode reward: [(0, '0.032')] [2022-07-11 01:19:22,306][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:19:22,318][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000973631_996998144.pth [2022-07-11 01:19:22,318][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000971684_995004416.pth [2022-07-11 01:19:22,788][26022] Updated weights on worker 0-0, policy_version 973634 (0.00091) [2022-07-11 01:19:24,658][26022] Updated weights on worker 0-0, policy_version 973644 (0.00090) [2022-07-11 01:19:26,571][26022] Updated weights on worker 0-0, policy_version 973654 (0.00087) [2022-07-11 01:19:27,288][25689] Fps is (10 sec: 5417.9, 60 sec: 5502.4, 300 sec: 5528.5). Total num frames: 997024768. Throughput: 0: 5824.7. Samples: 997026504. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:19:27,289][25689] Avg episode reward: [(0, '0.005')] [2022-07-11 01:19:28,255][26022] Updated weights on worker 0-0, policy_version 973664 (0.00088) [2022-07-11 01:19:30,369][26022] Updated weights on worker 0-0, policy_version 973674 (0.00106) [2022-07-11 01:19:31,999][26022] Updated weights on worker 0-0, policy_version 973684 (0.00089) [2022-07-11 01:19:32,392][25689] Fps is (10 sec: 5568.4, 60 sec: 5531.1, 300 sec: 5526.7). Total num frames: 997053440. Throughput: 0: 5801.8. Samples: 997059584. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:19:32,393][25689] Avg episode reward: [(0, '0.324')] [2022-07-11 01:19:34,098][26022] Updated weights on worker 0-0, policy_version 973694 (0.00089) [2022-07-11 01:19:35,815][26022] Updated weights on worker 0-0, policy_version 973704 (0.00096) [2022-07-11 01:19:37,407][25689] Fps is (10 sec: 5565.6, 60 sec: 5513.9, 300 sec: 5526.8). Total num frames: 997081088. Throughput: 0: 4975.4. Samples: 997076558. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:19:37,408][25689] Avg episode reward: [(0, '0.270')] [2022-07-11 01:19:37,479][26022] Updated weights on worker 0-0, policy_version 973714 (0.00092) [2022-07-11 01:19:39,403][26022] Updated weights on worker 0-0, policy_version 973724 (0.00092) [2022-07-11 01:19:41,167][26022] Updated weights on worker 0-0, policy_version 973734 (0.00097) [2022-07-11 01:19:42,431][25689] Fps is (10 sec: 5712.4, 60 sec: 5546.7, 300 sec: 5533.6). Total num frames: 997110784. Throughput: 0: 5834.5. Samples: 997110508. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:19:42,431][25689] Avg episode reward: [(0, '0.876')] [2022-07-11 01:19:43,289][26022] Updated weights on worker 0-0, policy_version 973744 (0.00092) [2022-07-11 01:19:44,864][26022] Updated weights on worker 0-0, policy_version 973754 (0.00100) [2022-07-11 01:19:46,672][26022] Updated weights on worker 0-0, policy_version 973764 (0.00085) [2022-07-11 01:19:47,457][25689] Fps is (10 sec: 5706.0, 60 sec: 5545.9, 300 sec: 5532.7). Total num frames: 997138432. Throughput: 0: 5824.0. Samples: 997143970. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:19:47,467][25689] Avg episode reward: [(0, '0.822')] [2022-07-11 01:19:48,588][26022] Updated weights on worker 0-0, policy_version 973774 (0.00085) [2022-07-11 01:19:50,243][26022] Updated weights on worker 0-0, policy_version 973784 (0.00087) [2022-07-11 01:19:52,371][26022] Updated weights on worker 0-0, policy_version 973794 (0.00092) [2022-07-11 01:19:52,552][25689] Fps is (10 sec: 5463.3, 60 sec: 5509.5, 300 sec: 5532.2). Total num frames: 997166080. Throughput: 0: 5008.6. Samples: 997160556. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:19:52,553][25689] Avg episode reward: [(0, '0.758')] [2022-07-11 01:19:54,007][26022] Updated weights on worker 0-0, policy_version 973804 (0.00090) [2022-07-11 01:19:55,801][26022] Updated weights on worker 0-0, policy_version 973814 (0.00092) [2022-07-11 01:19:57,555][25689] Fps is (10 sec: 5577.7, 60 sec: 5544.3, 300 sec: 5537.3). Total num frames: 997194752. Throughput: 0: 5844.5. Samples: 997194308. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:19:57,555][25689] Avg episode reward: [(0, '1.376')] [2022-07-11 01:19:57,778][26022] Updated weights on worker 0-0, policy_version 973824 (0.00088) [2022-07-11 01:19:59,539][26022] Updated weights on worker 0-0, policy_version 973834 (0.00089) [2022-07-11 01:20:01,400][26022] Updated weights on worker 0-0, policy_version 973844 (0.00094) [2022-07-11 01:20:02,563][25689] Fps is (10 sec: 5319.1, 60 sec: 5545.2, 300 sec: 5532.1). Total num frames: 997219328. Throughput: 0: 5829.9. Samples: 997227876. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:20:02,563][25689] Avg episode reward: [(0, '0.393')] [2022-07-11 01:20:03,525][26022] Updated weights on worker 0-0, policy_version 973854 (0.00087) [2022-07-11 01:20:05,731][26022] Updated weights on worker 0-0, policy_version 973864 (0.00087) [2022-07-11 01:20:07,273][26022] Updated weights on worker 0-0, policy_version 973874 (0.00051) [2022-07-11 01:20:07,569][25689] Fps is (10 sec: 5419.3, 60 sec: 5545.8, 300 sec: 5543.2). Total num frames: 997249024. Throughput: 0: 4895.0. Samples: 997242418. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:20:07,570][25689] Avg episode reward: [(0, '0.389')] [2022-07-11 01:20:09,383][26022] Updated weights on worker 0-0, policy_version 973884 (0.00083) [2022-07-11 01:20:11,024][26022] Updated weights on worker 0-0, policy_version 973894 (0.00131) [2022-07-11 01:20:12,705][25689] Fps is (10 sec: 5653.8, 60 sec: 5556.3, 300 sec: 5534.3). Total num frames: 997276672. Throughput: 0: 5713.8. Samples: 997275708. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:20:12,706][25689] Avg episode reward: [(0, '0.320')] [2022-07-11 01:20:12,926][26022] Updated weights on worker 0-0, policy_version 973904 (0.00088) [2022-07-11 01:20:14,589][26022] Updated weights on worker 0-0, policy_version 973914 (0.00088) [2022-07-11 01:20:16,666][26022] Updated weights on worker 0-0, policy_version 973924 (0.00855) [2022-07-11 01:20:17,730][25689] Fps is (10 sec: 5341.4, 60 sec: 5504.6, 300 sec: 5530.5). Total num frames: 997303296. Throughput: 0: 5688.5. Samples: 997309074. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:20:17,730][25689] Avg episode reward: [(0, '-0.265')] [2022-07-11 01:20:18,266][26022] Updated weights on worker 0-0, policy_version 973934 (0.00089) [2022-07-11 01:20:20,119][26022] Updated weights on worker 0-0, policy_version 973944 (0.00093) [2022-07-11 01:20:21,892][26022] Updated weights on worker 0-0, policy_version 973954 (0.00095) [2022-07-11 01:20:22,756][25689] Fps is (10 sec: 5603.7, 60 sec: 5553.6, 300 sec: 5537.1). Total num frames: 997332992. Throughput: 0: 4863.0. Samples: 997326076. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:20:22,758][25689] Avg episode reward: [(0, '-0.311')] [2022-07-11 01:20:23,835][26022] Updated weights on worker 0-0, policy_version 973964 (0.00096) [2022-07-11 01:20:25,559][26022] Updated weights on worker 0-0, policy_version 973974 (0.00087) [2022-07-11 01:20:27,423][26022] Updated weights on worker 0-0, policy_version 973984 (0.00089) [2022-07-11 01:20:27,797][25689] Fps is (10 sec: 5695.9, 60 sec: 5550.7, 300 sec: 5538.7). Total num frames: 997360640. Throughput: 0: 5807.0. Samples: 997359884. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:20:27,798][25689] Avg episode reward: [(0, '-0.445')] [2022-07-11 01:20:29,274][26022] Updated weights on worker 0-0, policy_version 973994 (0.00091) [2022-07-11 01:20:31,311][26022] Updated weights on worker 0-0, policy_version 974004 (0.00090) [2022-07-11 01:20:32,879][25689] Fps is (10 sec: 5563.8, 60 sec: 5552.9, 300 sec: 5541.6). Total num frames: 997389312. Throughput: 0: 5829.5. Samples: 997393308. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:20:32,879][25689] Avg episode reward: [(0, '-0.104')] [2022-07-11 01:20:33,021][26022] Updated weights on worker 0-0, policy_version 974014 (0.00103) [2022-07-11 01:20:34,936][26022] Updated weights on worker 0-0, policy_version 974024 (0.00091) [2022-07-11 01:20:36,544][26022] Updated weights on worker 0-0, policy_version 974034 (0.00096) [2022-07-11 01:20:37,897][25689] Fps is (10 sec: 5576.3, 60 sec: 5552.5, 300 sec: 5537.9). Total num frames: 997416960. Throughput: 0: 5838.6. Samples: 997426824. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:20:37,898][25689] Avg episode reward: [(0, '-1.634')] [2022-07-11 01:20:38,640][26022] Updated weights on worker 0-0, policy_version 974044 (0.00082) [2022-07-11 01:20:40,279][26022] Updated weights on worker 0-0, policy_version 974054 (0.00086) [2022-07-11 01:20:42,254][26022] Updated weights on worker 0-0, policy_version 974064 (0.00089) [2022-07-11 01:20:42,903][25689] Fps is (10 sec: 5618.5, 60 sec: 5537.2, 300 sec: 5545.8). Total num frames: 997445632. Throughput: 0: 5840.4. Samples: 997443742. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:20:42,903][25689] Avg episode reward: [(0, '-1.891')] [2022-07-11 01:20:43,951][26022] Updated weights on worker 0-0, policy_version 974074 (0.00051) [2022-07-11 01:20:45,931][26022] Updated weights on worker 0-0, policy_version 974084 (0.00089) [2022-07-11 01:20:47,468][26022] Updated weights on worker 0-0, policy_version 974094 (0.00086) [2022-07-11 01:20:47,917][25689] Fps is (10 sec: 5722.8, 60 sec: 5555.2, 300 sec: 5543.3). Total num frames: 997474304. Throughput: 0: 5845.8. Samples: 997477502. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:20:47,918][25689] Avg episode reward: [(0, '-1.376')] [2022-07-11 01:20:49,647][26022] Updated weights on worker 0-0, policy_version 974104 (0.00093) [2022-07-11 01:20:51,234][26022] Updated weights on worker 0-0, policy_version 974114 (0.00093) [2022-07-11 01:20:53,012][25689] Fps is (10 sec: 5469.8, 60 sec: 5538.3, 300 sec: 5535.0). Total num frames: 997500928. Throughput: 0: 5841.4. Samples: 997510916. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 01:20:53,012][25689] Avg episode reward: [(0, '-1.390')] [2022-07-11 01:20:53,220][26022] Updated weights on worker 0-0, policy_version 974124 (0.00092) [2022-07-11 01:20:55,121][26022] Updated weights on worker 0-0, policy_version 974134 (0.00088) [2022-07-11 01:20:56,627][26022] Updated weights on worker 0-0, policy_version 974144 (0.00091) [2022-07-11 01:20:58,060][25689] Fps is (10 sec: 5451.9, 60 sec: 5534.2, 300 sec: 5542.9). Total num frames: 997529600. Throughput: 0: 5010.3. Samples: 997527846. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:20:58,062][25689] Avg episode reward: [(0, '-2.154')] [2022-07-11 01:20:58,832][26022] Updated weights on worker 0-0, policy_version 974154 (0.00608) [2022-07-11 01:21:00,367][26022] Updated weights on worker 0-0, policy_version 974164 (0.00096) [2022-07-11 01:21:02,636][26022] Updated weights on worker 0-0, policy_version 974174 (0.00086) [2022-07-11 01:21:03,136][25689] Fps is (10 sec: 5562.9, 60 sec: 5578.7, 300 sec: 5548.9). Total num frames: 997557248. Throughput: 0: 5788.9. Samples: 997560872. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:21:03,138][25689] Avg episode reward: [(0, '-0.642')] [2022-07-11 01:21:04,544][26022] Updated weights on worker 0-0, policy_version 974184 (0.00089) [2022-07-11 01:21:06,225][26022] Updated weights on worker 0-0, policy_version 974194 (0.00091) [2022-07-11 01:21:08,149][25689] Fps is (10 sec: 5379.5, 60 sec: 5527.4, 300 sec: 5543.2). Total num frames: 997583872. Throughput: 0: 5694.8. Samples: 997592714. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:21:08,149][25689] Avg episode reward: [(0, '-0.294')] [2022-07-11 01:21:08,348][26022] Updated weights on worker 0-0, policy_version 974204 (0.00096) [2022-07-11 01:21:09,940][26022] Updated weights on worker 0-0, policy_version 974214 (0.00084) [2022-07-11 01:21:11,900][26022] Updated weights on worker 0-0, policy_version 974224 (0.00089) [2022-07-11 01:21:13,222][25689] Fps is (10 sec: 5482.5, 60 sec: 5550.1, 300 sec: 5542.2). Total num frames: 997612544. Throughput: 0: 4886.8. Samples: 997609678. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:21:13,223][25689] Avg episode reward: [(0, '-0.038')] [2022-07-11 01:21:13,775][26022] Updated weights on worker 0-0, policy_version 974234 (0.00082) [2022-07-11 01:21:15,366][26022] Updated weights on worker 0-0, policy_version 974244 (0.00084) [2022-07-11 01:21:17,459][26022] Updated weights on worker 0-0, policy_version 974254 (0.00079) [2022-07-11 01:21:18,241][25689] Fps is (10 sec: 5681.6, 60 sec: 5584.4, 300 sec: 5549.0). Total num frames: 997641216. Throughput: 0: 5725.4. Samples: 997643394. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:21:18,242][25689] Avg episode reward: [(0, '-0.548')] [2022-07-11 01:21:19,130][26022] Updated weights on worker 0-0, policy_version 974264 (0.00096) [2022-07-11 01:21:21,106][26022] Updated weights on worker 0-0, policy_version 974274 (0.00093) [2022-07-11 01:21:22,529][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:21:22,541][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000974282_997664768.pth [2022-07-11 01:21:22,541][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000972331_995666944.pth [2022-07-11 01:21:22,873][26022] Updated weights on worker 0-0, policy_version 974284 (0.00090) [2022-07-11 01:21:23,261][25689] Fps is (10 sec: 5610.4, 60 sec: 5551.2, 300 sec: 5539.8). Total num frames: 997668864. Throughput: 0: 5763.6. Samples: 997676860. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:21:23,261][25689] Avg episode reward: [(0, '-0.459')] [2022-07-11 01:21:24,720][26022] Updated weights on worker 0-0, policy_version 974294 (0.00096) [2022-07-11 01:21:26,508][26022] Updated weights on worker 0-0, policy_version 974304 (0.00092) [2022-07-11 01:21:28,263][25689] Fps is (10 sec: 5517.9, 60 sec: 5554.8, 300 sec: 5544.4). Total num frames: 997696512. Throughput: 0: 5018.3. Samples: 997693652. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:21:28,263][25689] Avg episode reward: [(0, '-0.543')] [2022-07-11 01:21:28,472][26022] Updated weights on worker 0-0, policy_version 974314 (0.00087) [2022-07-11 01:21:30,179][26022] Updated weights on worker 0-0, policy_version 974324 (0.00085) [2022-07-11 01:21:32,195][26022] Updated weights on worker 0-0, policy_version 974334 (0.00094) [2022-07-11 01:21:33,330][25689] Fps is (10 sec: 5491.7, 60 sec: 5539.2, 300 sec: 5543.4). Total num frames: 997724160. Throughput: 0: 5826.4. Samples: 997726832. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:21:33,330][25689] Avg episode reward: [(0, '0.572')] [2022-07-11 01:21:33,855][26022] Updated weights on worker 0-0, policy_version 974344 (0.00100) [2022-07-11 01:21:35,947][26022] Updated weights on worker 0-0, policy_version 974354 (0.00091) [2022-07-11 01:21:37,503][26022] Updated weights on worker 0-0, policy_version 974364 (0.00082) [2022-07-11 01:21:38,414][25689] Fps is (10 sec: 5648.9, 60 sec: 5567.0, 300 sec: 5538.5). Total num frames: 997753856. Throughput: 0: 5791.8. Samples: 997760230. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:21:38,415][25689] Avg episode reward: [(0, '0.692')] [2022-07-11 01:21:39,616][26022] Updated weights on worker 0-0, policy_version 974374 (0.00091) [2022-07-11 01:21:40,956][26022] Updated weights on worker 0-0, policy_version 974384 (0.00097) [2022-07-11 01:21:43,146][26022] Updated weights on worker 0-0, policy_version 974394 (0.00090) [2022-07-11 01:21:43,433][25689] Fps is (10 sec: 5574.3, 60 sec: 5531.9, 300 sec: 5542.7). Total num frames: 997780480. Throughput: 0: 4968.6. Samples: 997777090. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:21:43,434][25689] Avg episode reward: [(0, '-0.250')] [2022-07-11 01:21:44,735][26022] Updated weights on worker 0-0, policy_version 974404 (0.00083) [2022-07-11 01:21:46,838][26022] Updated weights on worker 0-0, policy_version 974414 (0.00088) [2022-07-11 01:21:48,439][25689] Fps is (10 sec: 5515.9, 60 sec: 5532.7, 300 sec: 5541.5). Total num frames: 997809152. Throughput: 0: 5812.4. Samples: 997810924. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:21:48,439][25689] Avg episode reward: [(0, '0.858')] [2022-07-11 01:21:48,538][26022] Updated weights on worker 0-0, policy_version 974424 (0.00090) [2022-07-11 01:21:50,570][26022] Updated weights on worker 0-0, policy_version 974434 (0.00090) [2022-07-11 01:21:52,211][26022] Updated weights on worker 0-0, policy_version 974444 (0.00088) [2022-07-11 01:21:53,528][25689] Fps is (10 sec: 5579.1, 60 sec: 5550.2, 300 sec: 5541.2). Total num frames: 997836800. Throughput: 0: 5821.0. Samples: 997844406. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:21:53,528][25689] Avg episode reward: [(0, '0.642')] [2022-07-11 01:21:54,219][26022] Updated weights on worker 0-0, policy_version 974454 (0.00086) [2022-07-11 01:21:55,719][26022] Updated weights on worker 0-0, policy_version 974464 (0.00090) [2022-07-11 01:21:57,800][26022] Updated weights on worker 0-0, policy_version 974474 (0.00114) [2022-07-11 01:21:58,544][25689] Fps is (10 sec: 5573.2, 60 sec: 5553.0, 300 sec: 5544.9). Total num frames: 997865472. Throughput: 0: 5021.7. Samples: 997861318. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:21:58,545][25689] Avg episode reward: [(0, '1.009')] [2022-07-11 01:21:59,403][26022] Updated weights on worker 0-0, policy_version 974484 (0.00084) [2022-07-11 01:22:01,311][26022] Updated weights on worker 0-0, policy_version 974494 (0.00098) [2022-07-11 01:22:03,499][26022] Updated weights on worker 0-0, policy_version 974504 (0.00093) [2022-07-11 01:22:03,566][25689] Fps is (10 sec: 5508.7, 60 sec: 5541.1, 300 sec: 5548.9). Total num frames: 997892096. Throughput: 0: 5779.2. Samples: 997893440. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:22:03,568][25689] Avg episode reward: [(0, '0.123')] [2022-07-11 01:22:05,383][26022] Updated weights on worker 0-0, policy_version 974514 (0.00089) [2022-07-11 01:22:07,262][26022] Updated weights on worker 0-0, policy_version 974524 (0.00094) [2022-07-11 01:22:08,577][25689] Fps is (10 sec: 5409.7, 60 sec: 5558.2, 300 sec: 5539.3). Total num frames: 997919744. Throughput: 0: 5733.4. Samples: 997926382. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:22:08,578][25689] Avg episode reward: [(0, '0.745')] [2022-07-11 01:22:09,163][26022] Updated weights on worker 0-0, policy_version 974534 (0.00085) [2022-07-11 01:22:10,740][26022] Updated weights on worker 0-0, policy_version 974544 (0.00080) [2022-07-11 01:22:12,800][26022] Updated weights on worker 0-0, policy_version 974554 (0.00094) [2022-07-11 01:22:13,674][25689] Fps is (10 sec: 5571.7, 60 sec: 5556.0, 300 sec: 5541.2). Total num frames: 997948416. Throughput: 0: 4903.5. Samples: 997943192. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:22:13,675][25689] Avg episode reward: [(0, '0.740')] [2022-07-11 01:22:14,636][26022] Updated weights on worker 0-0, policy_version 974564 (0.00085) [2022-07-11 01:22:16,275][26022] Updated weights on worker 0-0, policy_version 974574 (0.00084) [2022-07-11 01:22:18,175][26022] Updated weights on worker 0-0, policy_version 974584 (0.00083) [2022-07-11 01:22:18,681][25689] Fps is (10 sec: 5573.7, 60 sec: 5540.2, 300 sec: 5545.0). Total num frames: 997976064. Throughput: 0: 5738.0. Samples: 997976864. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:22:18,683][25689] Avg episode reward: [(0, '0.230')] [2022-07-11 01:22:20,031][26022] Updated weights on worker 0-0, policy_version 974594 (0.00089) [2022-07-11 01:22:21,791][26022] Updated weights on worker 0-0, policy_version 974604 (0.00086) [2022-07-11 01:22:23,709][25689] Fps is (10 sec: 5510.3, 60 sec: 5539.4, 300 sec: 5537.6). Total num frames: 998003712. Throughput: 0: 5834.2. Samples: 998010960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:22:23,711][25689] Avg episode reward: [(0, '-0.495')] [2022-07-11 01:22:23,819][26022] Updated weights on worker 0-0, policy_version 974614 (0.00089) [2022-07-11 01:22:25,247][26022] Updated weights on worker 0-0, policy_version 974624 (0.00096) [2022-07-11 01:22:27,482][26022] Updated weights on worker 0-0, policy_version 974634 (0.00091) [2022-07-11 01:22:28,715][25689] Fps is (10 sec: 5715.0, 60 sec: 5572.9, 300 sec: 5545.0). Total num frames: 998033408. Throughput: 0: 5032.3. Samples: 998027726. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:22:28,716][25689] Avg episode reward: [(0, '0.165')] [2022-07-11 01:22:28,948][26022] Updated weights on worker 0-0, policy_version 974644 (0.00087) [2022-07-11 01:22:31,058][26022] Updated weights on worker 0-0, policy_version 974654 (0.00092) [2022-07-11 01:22:32,864][26022] Updated weights on worker 0-0, policy_version 974664 (0.00090) [2022-07-11 01:22:33,799][25689] Fps is (10 sec: 5581.9, 60 sec: 5554.4, 300 sec: 5547.1). Total num frames: 998060032. Throughput: 0: 5843.3. Samples: 998060788. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:22:33,799][25689] Avg episode reward: [(0, '0.397')] [2022-07-11 01:22:34,775][26022] Updated weights on worker 0-0, policy_version 974674 (0.00095) [2022-07-11 01:22:36,446][26022] Updated weights on worker 0-0, policy_version 974684 (0.00087) [2022-07-11 01:22:38,659][26022] Updated weights on worker 0-0, policy_version 974694 (0.00084) [2022-07-11 01:22:38,824][25689] Fps is (10 sec: 5267.6, 60 sec: 5509.1, 300 sec: 5533.5). Total num frames: 998086656. Throughput: 0: 5823.4. Samples: 998094162. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:22:38,824][25689] Avg episode reward: [(0, '-0.008')] [2022-07-11 01:22:40,117][26022] Updated weights on worker 0-0, policy_version 974704 (0.00088) [2022-07-11 01:22:42,102][26022] Updated weights on worker 0-0, policy_version 974714 (0.00088) [2022-07-11 01:22:43,828][25689] Fps is (10 sec: 5615.7, 60 sec: 5561.3, 300 sec: 5547.3). Total num frames: 998116352. Throughput: 0: 4966.2. Samples: 998110874. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:22:43,830][25689] Avg episode reward: [(0, '0.163')] [2022-07-11 01:22:43,859][26022] Updated weights on worker 0-0, policy_version 974724 (0.00090) [2022-07-11 01:22:45,764][26022] Updated weights on worker 0-0, policy_version 974734 (0.00093) [2022-07-11 01:22:47,695][26022] Updated weights on worker 0-0, policy_version 974744 (0.00093) [2022-07-11 01:22:48,867][25689] Fps is (10 sec: 5709.5, 60 sec: 5541.2, 300 sec: 5543.8). Total num frames: 998144000. Throughput: 0: 5787.2. Samples: 998144350. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:22:48,868][25689] Avg episode reward: [(0, '-1.220')] [2022-07-11 01:22:49,384][26022] Updated weights on worker 0-0, policy_version 974754 (0.00085) [2022-07-11 01:22:51,220][26022] Updated weights on worker 0-0, policy_version 974764 (0.00089) [2022-07-11 01:22:53,037][26022] Updated weights on worker 0-0, policy_version 974774 (0.00081) [2022-07-11 01:22:53,935][25689] Fps is (10 sec: 5369.6, 60 sec: 5526.2, 300 sec: 5532.4). Total num frames: 998170624. Throughput: 0: 5803.5. Samples: 998177648. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:22:53,936][25689] Avg episode reward: [(0, '-0.338')] [2022-07-11 01:22:55,175][26022] Updated weights on worker 0-0, policy_version 974784 (0.00088) [2022-07-11 01:22:56,803][26022] Updated weights on worker 0-0, policy_version 974794 (0.00087) [2022-07-11 01:22:58,690][26022] Updated weights on worker 0-0, policy_version 974804 (0.00105) [2022-07-11 01:22:58,970][25689] Fps is (10 sec: 5575.0, 60 sec: 5541.5, 300 sec: 5549.3). Total num frames: 998200320. Throughput: 0: 4982.8. Samples: 998194544. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:22:58,970][25689] Avg episode reward: [(0, '-0.999')] [2022-07-11 01:23:00,510][26022] Updated weights on worker 0-0, policy_version 974814 (0.01139) [2022-07-11 01:23:02,894][26022] Updated weights on worker 0-0, policy_version 974824 (0.00085) [2022-07-11 01:23:03,980][25689] Fps is (10 sec: 5606.8, 60 sec: 5542.5, 300 sec: 5542.4). Total num frames: 998226944. Throughput: 0: 5708.6. Samples: 998225914. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:23:03,981][25689] Avg episode reward: [(0, '-1.010')] [2022-07-11 01:23:04,479][26022] Updated weights on worker 0-0, policy_version 974834 (0.00494) [2022-07-11 01:23:06,229][26022] Updated weights on worker 0-0, policy_version 974844 (0.00088) [2022-07-11 01:23:08,331][26022] Updated weights on worker 0-0, policy_version 974854 (0.00089) [2022-07-11 01:23:08,987][25689] Fps is (10 sec: 5315.3, 60 sec: 5525.9, 300 sec: 5539.7). Total num frames: 998253568. Throughput: 0: 5719.4. Samples: 998259424. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:23:08,988][25689] Avg episode reward: [(0, '-0.779')] [2022-07-11 01:23:09,976][26022] Updated weights on worker 0-0, policy_version 974864 (0.00094) [2022-07-11 01:23:12,049][26022] Updated weights on worker 0-0, policy_version 974874 (0.00087) [2022-07-11 01:23:13,709][26022] Updated weights on worker 0-0, policy_version 974884 (0.00062) [2022-07-11 01:23:14,082][25689] Fps is (10 sec: 5473.7, 60 sec: 5526.1, 300 sec: 5541.9). Total num frames: 998282240. Throughput: 0: 4881.6. Samples: 998275998. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:23:14,083][25689] Avg episode reward: [(0, '-1.365')] [2022-07-11 01:23:15,711][26022] Updated weights on worker 0-0, policy_version 974894 (0.00083) [2022-07-11 01:23:17,580][26022] Updated weights on worker 0-0, policy_version 974904 (0.00087) [2022-07-11 01:23:19,179][25689] Fps is (10 sec: 5526.1, 60 sec: 5517.9, 300 sec: 5533.4). Total num frames: 998309888. Throughput: 0: 5682.3. Samples: 998309380. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:23:19,180][25689] Avg episode reward: [(0, '-0.447')] [2022-07-11 01:23:19,339][26022] Updated weights on worker 0-0, policy_version 974914 (0.00093) [2022-07-11 01:23:20,964][26022] Updated weights on worker 0-0, policy_version 974924 (0.00089) [2022-07-11 01:23:22,583][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:23:22,598][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000974931_998329344.pth [2022-07-11 01:23:22,598][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000972980_996331520.pth [2022-07-11 01:23:23,197][26022] Updated weights on worker 0-0, policy_version 974934 (0.00090) [2022-07-11 01:23:24,199][25689] Fps is (10 sec: 5769.3, 60 sec: 5569.4, 300 sec: 5543.7). Total num frames: 998340608. Throughput: 0: 5789.1. Samples: 998342964. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:23:24,201][25689] Avg episode reward: [(0, '-0.824')] [2022-07-11 01:23:24,828][26022] Updated weights on worker 0-0, policy_version 974944 (0.00090) [2022-07-11 01:23:26,794][26022] Updated weights on worker 0-0, policy_version 974954 (0.00094) [2022-07-11 01:23:28,488][26022] Updated weights on worker 0-0, policy_version 974964 (0.00086) [2022-07-11 01:23:29,202][25689] Fps is (10 sec: 5619.1, 60 sec: 5502.0, 300 sec: 5541.0). Total num frames: 998366208. Throughput: 0: 4947.2. Samples: 998359430. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:23:29,204][25689] Avg episode reward: [(0, '-0.788')] [2022-07-11 01:23:30,383][26022] Updated weights on worker 0-0, policy_version 974974 (0.00094) [2022-07-11 01:23:32,254][26022] Updated weights on worker 0-0, policy_version 974984 (0.00082) [2022-07-11 01:23:34,079][26022] Updated weights on worker 0-0, policy_version 974994 (0.00094) [2022-07-11 01:23:34,272][25689] Fps is (10 sec: 5387.6, 60 sec: 5537.1, 300 sec: 5539.9). Total num frames: 998394880. Throughput: 0: 5788.6. Samples: 998392872. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:23:34,274][25689] Avg episode reward: [(0, '-0.789')] [2022-07-11 01:23:35,822][26022] Updated weights on worker 0-0, policy_version 975004 (0.00086) [2022-07-11 01:23:37,900][26022] Updated weights on worker 0-0, policy_version 975014 (0.00057) [2022-07-11 01:23:39,288][25689] Fps is (10 sec: 5584.2, 60 sec: 5554.9, 300 sec: 5539.9). Total num frames: 998422528. Throughput: 0: 5817.9. Samples: 998426370. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:23:39,288][25689] Avg episode reward: [(0, '-0.396')] [2022-07-11 01:23:39,345][26022] Updated weights on worker 0-0, policy_version 975024 (0.00085) [2022-07-11 01:23:41,497][26022] Updated weights on worker 0-0, policy_version 975034 (0.00085) [2022-07-11 01:23:43,179][26022] Updated weights on worker 0-0, policy_version 975044 (0.00078) [2022-07-11 01:23:44,308][25689] Fps is (10 sec: 5510.0, 60 sec: 5519.6, 300 sec: 5539.8). Total num frames: 998450176. Throughput: 0: 5811.4. Samples: 998459826. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:23:44,310][25689] Avg episode reward: [(0, '0.069')] [2022-07-11 01:23:45,124][26022] Updated weights on worker 0-0, policy_version 975054 (0.00117) [2022-07-11 01:23:46,708][26022] Updated weights on worker 0-0, policy_version 975064 (0.00089) [2022-07-11 01:23:48,896][26022] Updated weights on worker 0-0, policy_version 975074 (0.00093) [2022-07-11 01:23:49,311][25689] Fps is (10 sec: 5516.9, 60 sec: 5522.9, 300 sec: 5534.1). Total num frames: 998477824. Throughput: 0: 5828.2. Samples: 998476628. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:23:49,311][25689] Avg episode reward: [(0, '0.428')] [2022-07-11 01:23:50,607][26022] Updated weights on worker 0-0, policy_version 975084 (0.00094) [2022-07-11 01:23:52,525][26022] Updated weights on worker 0-0, policy_version 975094 (0.00088) [2022-07-11 01:23:54,189][26022] Updated weights on worker 0-0, policy_version 975104 (0.00086) [2022-07-11 01:23:54,388][25689] Fps is (10 sec: 5688.9, 60 sec: 5572.9, 300 sec: 5543.3). Total num frames: 998507520. Throughput: 0: 5803.3. Samples: 998509608. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:23:54,388][25689] Avg episode reward: [(0, '0.602')] [2022-07-11 01:23:56,186][26022] Updated weights on worker 0-0, policy_version 975114 (0.00086) [2022-07-11 01:23:58,012][26022] Updated weights on worker 0-0, policy_version 975124 (0.00096) [2022-07-11 01:23:59,424][25689] Fps is (10 sec: 5670.0, 60 sec: 5538.8, 300 sec: 5553.3). Total num frames: 998535168. Throughput: 0: 5808.7. Samples: 998543336. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:23:59,425][25689] Avg episode reward: [(0, '1.393')] [2022-07-11 01:23:59,836][26022] Updated weights on worker 0-0, policy_version 975134 (0.00079) [2022-07-11 01:24:02,015][26022] Updated weights on worker 0-0, policy_version 975144 (0.00084) [2022-07-11 01:24:03,974][26022] Updated weights on worker 0-0, policy_version 975154 (0.00858) [2022-07-11 01:24:04,439][25689] Fps is (10 sec: 5297.8, 60 sec: 5521.5, 300 sec: 5539.5). Total num frames: 998560768. Throughput: 0: 4879.9. Samples: 998558062. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:24:04,441][25689] Avg episode reward: [(0, '0.793')] [2022-07-11 01:24:05,812][26022] Updated weights on worker 0-0, policy_version 975164 (0.00092) [2022-07-11 01:24:07,591][26022] Updated weights on worker 0-0, policy_version 975174 (0.00096) [2022-07-11 01:24:09,375][26022] Updated weights on worker 0-0, policy_version 975184 (0.00085) [2022-07-11 01:24:09,522][25689] Fps is (10 sec: 5273.1, 60 sec: 5531.5, 300 sec: 5542.6). Total num frames: 998588416. Throughput: 0: 5693.8. Samples: 998591708. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:24:09,524][25689] Avg episode reward: [(0, '0.379')] [2022-07-11 01:24:11,176][26022] Updated weights on worker 0-0, policy_version 975194 (0.00089) [2022-07-11 01:24:13,060][26022] Updated weights on worker 0-0, policy_version 975204 (0.00091) [2022-07-11 01:24:14,616][25689] Fps is (10 sec: 5634.4, 60 sec: 5548.5, 300 sec: 5541.1). Total num frames: 998618112. Throughput: 0: 5705.5. Samples: 998625020. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:24:14,617][25689] Avg episode reward: [(0, '0.409')] [2022-07-11 01:24:14,836][26022] Updated weights on worker 0-0, policy_version 975214 (0.00084) [2022-07-11 01:24:16,719][26022] Updated weights on worker 0-0, policy_version 975224 (0.00098) [2022-07-11 01:24:18,550][26022] Updated weights on worker 0-0, policy_version 975234 (0.00089) [2022-07-11 01:24:19,687][25689] Fps is (10 sec: 5641.6, 60 sec: 5550.9, 300 sec: 5543.3). Total num frames: 998645760. Throughput: 0: 4848.3. Samples: 998641574. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:24:19,688][25689] Avg episode reward: [(0, '0.035')] [2022-07-11 01:24:20,560][26022] Updated weights on worker 0-0, policy_version 975244 (0.00090) [2022-07-11 01:24:22,196][26022] Updated weights on worker 0-0, policy_version 975254 (0.00087) [2022-07-11 01:24:24,155][26022] Updated weights on worker 0-0, policy_version 975264 (0.00105) [2022-07-11 01:24:24,696][25689] Fps is (10 sec: 5587.3, 60 sec: 5518.0, 300 sec: 5546.8). Total num frames: 998674432. Throughput: 0: 5788.8. Samples: 998675328. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:24:24,697][25689] Avg episode reward: [(0, '-0.011')] [2022-07-11 01:24:25,780][26022] Updated weights on worker 0-0, policy_version 975274 (0.00093) [2022-07-11 01:24:27,796][26022] Updated weights on worker 0-0, policy_version 975284 (0.00088) [2022-07-11 01:24:29,720][25689] Fps is (10 sec: 5409.0, 60 sec: 5516.1, 300 sec: 5538.0). Total num frames: 998700032. Throughput: 0: 5795.9. Samples: 998708774. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:24:29,721][25689] Avg episode reward: [(0, '-0.078')] [2022-07-11 01:24:29,838][26022] Updated weights on worker 0-0, policy_version 975294 (0.00089) [2022-07-11 01:24:31,323][26022] Updated weights on worker 0-0, policy_version 975304 (0.00089) [2022-07-11 01:24:33,444][26022] Updated weights on worker 0-0, policy_version 975314 (0.00090) [2022-07-11 01:24:34,764][25689] Fps is (10 sec: 5594.0, 60 sec: 5552.4, 300 sec: 5547.8). Total num frames: 998730752. Throughput: 0: 4987.4. Samples: 998725506. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:24:34,765][25689] Avg episode reward: [(0, '0.440')] [2022-07-11 01:24:34,932][26022] Updated weights on worker 0-0, policy_version 975324 (0.00083) [2022-07-11 01:24:36,999][26022] Updated weights on worker 0-0, policy_version 975334 (0.00091) [2022-07-11 01:24:38,800][26022] Updated weights on worker 0-0, policy_version 975344 (0.00091) [2022-07-11 01:24:39,769][25689] Fps is (10 sec: 5604.6, 60 sec: 5519.4, 300 sec: 5534.3). Total num frames: 998756352. Throughput: 0: 5844.9. Samples: 998758954. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:24:39,770][25689] Avg episode reward: [(0, '-0.279')] [2022-07-11 01:24:40,803][26022] Updated weights on worker 0-0, policy_version 975354 (0.00082) [2022-07-11 01:24:42,595][26022] Updated weights on worker 0-0, policy_version 975364 (0.00095) [2022-07-11 01:24:44,407][26022] Updated weights on worker 0-0, policy_version 975374 (0.00099) [2022-07-11 01:24:44,779][25689] Fps is (10 sec: 5419.3, 60 sec: 5537.4, 300 sec: 5538.1). Total num frames: 998785024. Throughput: 0: 5823.7. Samples: 998792282. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:24:44,779][25689] Avg episode reward: [(0, '0.140')] [2022-07-11 01:24:46,127][26022] Updated weights on worker 0-0, policy_version 975384 (0.00082) [2022-07-11 01:24:48,098][26022] Updated weights on worker 0-0, policy_version 975394 (0.00088) [2022-07-11 01:24:49,783][25689] Fps is (10 sec: 5624.0, 60 sec: 5537.2, 300 sec: 5539.8). Total num frames: 998812672. Throughput: 0: 4993.3. Samples: 998808954. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 01:24:49,784][25689] Avg episode reward: [(0, '0.418')] [2022-07-11 01:24:50,011][26022] Updated weights on worker 0-0, policy_version 975404 (0.00083) [2022-07-11 01:24:51,767][26022] Updated weights on worker 0-0, policy_version 975414 (0.00074) [2022-07-11 01:24:53,584][26022] Updated weights on worker 0-0, policy_version 975424 (0.00093) [2022-07-11 01:24:54,919][25689] Fps is (10 sec: 5554.3, 60 sec: 5514.9, 300 sec: 5537.3). Total num frames: 998841344. Throughput: 0: 5807.6. Samples: 998842556. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:24:54,920][25689] Avg episode reward: [(0, '0.251')] [2022-07-11 01:24:55,423][26022] Updated weights on worker 0-0, policy_version 975434 (0.00093) [2022-07-11 01:24:57,207][26022] Updated weights on worker 0-0, policy_version 975444 (0.00087) [2022-07-11 01:24:58,960][26022] Updated weights on worker 0-0, policy_version 975454 (0.00090) [2022-07-11 01:24:59,929][25689] Fps is (10 sec: 5551.2, 60 sec: 5517.3, 300 sec: 5547.6). Total num frames: 998868992. Throughput: 0: 5809.2. Samples: 998876066. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:24:59,930][25689] Avg episode reward: [(0, '0.483')] [2022-07-11 01:25:00,802][26022] Updated weights on worker 0-0, policy_version 975464 (0.01035) [2022-07-11 01:25:02,948][26022] Updated weights on worker 0-0, policy_version 975474 (0.00095) [2022-07-11 01:25:04,911][26022] Updated weights on worker 0-0, policy_version 975484 (0.00092) [2022-07-11 01:25:04,945][25689] Fps is (10 sec: 5412.7, 60 sec: 5534.1, 300 sec: 5537.1). Total num frames: 998895616. Throughput: 0: 4898.1. Samples: 998891060. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:25:04,946][25689] Avg episode reward: [(0, '0.963')] [2022-07-11 01:25:06,739][26022] Updated weights on worker 0-0, policy_version 975494 (0.00097) [2022-07-11 01:25:08,625][26022] Updated weights on worker 0-0, policy_version 975504 (0.00084) [2022-07-11 01:25:09,955][25689] Fps is (10 sec: 5412.8, 60 sec: 5540.8, 300 sec: 5539.4). Total num frames: 998923264. Throughput: 0: 5731.2. Samples: 998924564. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:25:09,956][25689] Avg episode reward: [(0, '1.469')] [2022-07-11 01:25:10,519][26022] Updated weights on worker 0-0, policy_version 975514 (0.00095) [2022-07-11 01:25:12,467][26022] Updated weights on worker 0-0, policy_version 975524 (0.00093) [2022-07-11 01:25:14,168][26022] Updated weights on worker 0-0, policy_version 975534 (0.00092) [2022-07-11 01:25:15,085][25689] Fps is (10 sec: 5453.3, 60 sec: 5503.6, 300 sec: 5540.9). Total num frames: 998950912. Throughput: 0: 5693.3. Samples: 998957372. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:25:15,086][25689] Avg episode reward: [(0, '1.375')] [2022-07-11 01:25:16,024][26022] Updated weights on worker 0-0, policy_version 975544 (0.00079) [2022-07-11 01:25:17,860][26022] Updated weights on worker 0-0, policy_version 975554 (0.00083) [2022-07-11 01:25:19,795][26022] Updated weights on worker 0-0, policy_version 975564 (0.00831) [2022-07-11 01:25:20,103][25689] Fps is (10 sec: 5448.9, 60 sec: 5508.4, 300 sec: 5534.2). Total num frames: 998978560. Throughput: 0: 4853.5. Samples: 998973984. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:25:20,104][25689] Avg episode reward: [(0, '1.545')] [2022-07-11 01:25:21,719][26022] Updated weights on worker 0-0, policy_version 975574 (0.00084) [2022-07-11 01:25:22,635][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:25:22,650][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000975579_998992896.pth [2022-07-11 01:25:22,651][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000973631_996998144.pth [2022-07-11 01:25:23,588][26022] Updated weights on worker 0-0, policy_version 975584 (0.00091) [2022-07-11 01:25:25,206][25689] Fps is (10 sec: 5564.7, 60 sec: 5499.9, 300 sec: 5536.5). Total num frames: 999007232. Throughput: 0: 5730.0. Samples: 999007156. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:25:25,207][25689] Avg episode reward: [(0, '0.545')] [2022-07-11 01:25:25,474][26022] Updated weights on worker 0-0, policy_version 975594 (0.00086) [2022-07-11 01:25:27,344][26022] Updated weights on worker 0-0, policy_version 975604 (0.00083) [2022-07-11 01:25:29,093][26022] Updated weights on worker 0-0, policy_version 975614 (0.00092) [2022-07-11 01:25:30,219][25689] Fps is (10 sec: 5567.8, 60 sec: 5534.8, 300 sec: 5534.3). Total num frames: 999034880. Throughput: 0: 5698.4. Samples: 999040032. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:25:30,219][25689] Avg episode reward: [(0, '0.534')] [2022-07-11 01:25:31,064][26022] Updated weights on worker 0-0, policy_version 975624 (0.00081) [2022-07-11 01:25:32,778][26022] Updated weights on worker 0-0, policy_version 975634 (0.00093) [2022-07-11 01:25:34,796][26022] Updated weights on worker 0-0, policy_version 975644 (0.00088) [2022-07-11 01:25:35,294][25689] Fps is (10 sec: 5380.0, 60 sec: 5464.3, 300 sec: 5529.8). Total num frames: 999061504. Throughput: 0: 4908.0. Samples: 999056556. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:25:35,295][25689] Avg episode reward: [(0, '0.456')] [2022-07-11 01:25:36,461][26022] Updated weights on worker 0-0, policy_version 975654 (0.00085) [2022-07-11 01:25:38,365][26022] Updated weights on worker 0-0, policy_version 975664 (0.00094) [2022-07-11 01:25:40,226][26022] Updated weights on worker 0-0, policy_version 975674 (0.00088) [2022-07-11 01:25:40,311][25689] Fps is (10 sec: 5580.3, 60 sec: 5530.8, 300 sec: 5533.0). Total num frames: 999091200. Throughput: 0: 5748.3. Samples: 999090146. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:25:40,312][25689] Avg episode reward: [(0, '0.522')] [2022-07-11 01:25:42,017][26022] Updated weights on worker 0-0, policy_version 975684 (0.00080) [2022-07-11 01:25:43,963][26022] Updated weights on worker 0-0, policy_version 975694 (0.00092) [2022-07-11 01:25:45,390][25689] Fps is (10 sec: 5781.4, 60 sec: 5524.5, 300 sec: 5531.8). Total num frames: 999119872. Throughput: 0: 5763.5. Samples: 999123484. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:25:45,390][25689] Avg episode reward: [(0, '0.771')] [2022-07-11 01:25:45,745][26022] Updated weights on worker 0-0, policy_version 975704 (0.00086) [2022-07-11 01:25:47,623][26022] Updated weights on worker 0-0, policy_version 975714 (0.00085) [2022-07-11 01:25:49,307][26022] Updated weights on worker 0-0, policy_version 975724 (0.00094) [2022-07-11 01:25:50,484][25689] Fps is (10 sec: 5435.6, 60 sec: 5499.5, 300 sec: 5531.8). Total num frames: 999146496. Throughput: 0: 4944.6. Samples: 999140236. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:25:50,485][25689] Avg episode reward: [(0, '1.388')] [2022-07-11 01:25:51,262][26022] Updated weights on worker 0-0, policy_version 975734 (0.00089) [2022-07-11 01:25:52,960][26022] Updated weights on worker 0-0, policy_version 975744 (0.00108) [2022-07-11 01:25:54,908][26022] Updated weights on worker 0-0, policy_version 975754 (0.00094) [2022-07-11 01:25:55,558][25689] Fps is (10 sec: 5438.2, 60 sec: 5505.1, 300 sec: 5531.4). Total num frames: 999175168. Throughput: 0: 5785.8. Samples: 999173798. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:25:55,559][25689] Avg episode reward: [(0, '1.766')] [2022-07-11 01:25:56,507][26022] Updated weights on worker 0-0, policy_version 975764 (0.00087) [2022-07-11 01:25:58,676][26022] Updated weights on worker 0-0, policy_version 975774 (0.00083) [2022-07-11 01:26:00,132][26022] Updated weights on worker 0-0, policy_version 975784 (0.00093) [2022-07-11 01:26:00,559][25689] Fps is (10 sec: 5691.9, 60 sec: 5522.8, 300 sec: 5536.2). Total num frames: 999203840. Throughput: 0: 5808.6. Samples: 999207756. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:26:00,560][25689] Avg episode reward: [(0, '0.991')] [2022-07-11 01:26:02,537][26022] Updated weights on worker 0-0, policy_version 975794 (0.00101) [2022-07-11 01:26:04,327][26022] Updated weights on worker 0-0, policy_version 975804 (0.00084) [2022-07-11 01:26:05,607][25689] Fps is (10 sec: 5400.9, 60 sec: 5503.1, 300 sec: 5532.1). Total num frames: 999229440. Throughput: 0: 5724.9. Samples: 999239224. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:26:05,608][25689] Avg episode reward: [(0, '0.783')] [2022-07-11 01:26:06,150][26022] Updated weights on worker 0-0, policy_version 975814 (0.00094) [2022-07-11 01:26:07,736][26022] Updated weights on worker 0-0, policy_version 975824 (0.00090) [2022-07-11 01:26:09,878][26022] Updated weights on worker 0-0, policy_version 975834 (0.00093) [2022-07-11 01:26:10,635][25689] Fps is (10 sec: 5284.8, 60 sec: 5501.5, 300 sec: 5529.5). Total num frames: 999257088. Throughput: 0: 5747.0. Samples: 999256040. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:26:10,635][25689] Avg episode reward: [(0, '0.654')] [2022-07-11 01:26:11,672][26022] Updated weights on worker 0-0, policy_version 975844 (0.00086) [2022-07-11 01:26:13,635][26022] Updated weights on worker 0-0, policy_version 975854 (0.00089) [2022-07-11 01:26:15,368][26022] Updated weights on worker 0-0, policy_version 975864 (0.00087) [2022-07-11 01:26:15,762][25689] Fps is (10 sec: 5646.5, 60 sec: 5535.4, 300 sec: 5530.9). Total num frames: 999286784. Throughput: 0: 5721.7. Samples: 999289402. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:26:15,763][25689] Avg episode reward: [(0, '0.579')] [2022-07-11 01:26:17,206][26022] Updated weights on worker 0-0, policy_version 975874 (0.00089) [2022-07-11 01:26:18,971][26022] Updated weights on worker 0-0, policy_version 975884 (0.00087) [2022-07-11 01:26:20,792][25689] Fps is (10 sec: 5746.5, 60 sec: 5551.2, 300 sec: 5534.2). Total num frames: 999315456. Throughput: 0: 5704.6. Samples: 999323176. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:26:20,792][25689] Avg episode reward: [(0, '-1.371')] [2022-07-11 01:26:20,795][26022] Updated weights on worker 0-0, policy_version 975894 (0.00096) [2022-07-11 01:26:22,665][26022] Updated weights on worker 0-0, policy_version 975904 (0.00092) [2022-07-11 01:26:24,408][26022] Updated weights on worker 0-0, policy_version 975914 (0.00088) [2022-07-11 01:26:25,841][25689] Fps is (10 sec: 5588.2, 60 sec: 5539.3, 300 sec: 5533.3). Total num frames: 999343104. Throughput: 0: 4988.4. Samples: 999340160. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:26:25,843][25689] Avg episode reward: [(0, '-0.731')] [2022-07-11 01:26:26,565][26022] Updated weights on worker 0-0, policy_version 975924 (0.00084) [2022-07-11 01:26:27,925][26022] Updated weights on worker 0-0, policy_version 975934 (0.00093) [2022-07-11 01:26:30,113][26022] Updated weights on worker 0-0, policy_version 975944 (0.00093) [2022-07-11 01:26:30,880][25689] Fps is (10 sec: 5481.4, 60 sec: 5536.9, 300 sec: 5533.8). Total num frames: 999370752. Throughput: 0: 5812.1. Samples: 999373708. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:26:30,880][25689] Avg episode reward: [(0, '-0.470')] [2022-07-11 01:26:31,689][26022] Updated weights on worker 0-0, policy_version 975954 (0.00085) [2022-07-11 01:26:33,687][26022] Updated weights on worker 0-0, policy_version 975964 (0.00085) [2022-07-11 01:26:35,357][26022] Updated weights on worker 0-0, policy_version 975974 (0.00094) [2022-07-11 01:26:35,996][25689] Fps is (10 sec: 5445.4, 60 sec: 5550.1, 300 sec: 5526.4). Total num frames: 999398400. Throughput: 0: 5830.1. Samples: 999407364. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:26:35,996][25689] Avg episode reward: [(0, '-0.467')] [2022-07-11 01:26:37,167][26022] Updated weights on worker 0-0, policy_version 975984 (0.00084) [2022-07-11 01:26:39,154][26022] Updated weights on worker 0-0, policy_version 975994 (0.00089) [2022-07-11 01:26:41,015][25689] Fps is (10 sec: 5557.0, 60 sec: 5533.0, 300 sec: 5533.2). Total num frames: 999427072. Throughput: 0: 4991.0. Samples: 999424108. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:26:41,015][25689] Avg episode reward: [(0, '-0.436')] [2022-07-11 01:26:41,023][26022] Updated weights on worker 0-0, policy_version 976004 (0.00092) [2022-07-11 01:26:42,663][26022] Updated weights on worker 0-0, policy_version 976014 (0.00093) [2022-07-11 01:26:44,766][26022] Updated weights on worker 0-0, policy_version 976024 (0.00089) [2022-07-11 01:26:46,050][25689] Fps is (10 sec: 5805.2, 60 sec: 5553.8, 300 sec: 5536.1). Total num frames: 999456768. Throughput: 0: 5816.8. Samples: 999457712. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:26:46,052][25689] Avg episode reward: [(0, '0.064')] [2022-07-11 01:26:46,476][26022] Updated weights on worker 0-0, policy_version 976034 (0.00092) [2022-07-11 01:26:48,375][26022] Updated weights on worker 0-0, policy_version 976044 (0.00083) [2022-07-11 01:26:50,287][26022] Updated weights on worker 0-0, policy_version 976054 (0.00083) [2022-07-11 01:26:51,053][25689] Fps is (10 sec: 5508.6, 60 sec: 5545.3, 300 sec: 5530.9). Total num frames: 999482368. Throughput: 0: 5819.8. Samples: 999491112. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:26:51,054][25689] Avg episode reward: [(0, '0.648')] [2022-07-11 01:26:52,027][26022] Updated weights on worker 0-0, policy_version 976064 (0.00083) [2022-07-11 01:26:53,996][26022] Updated weights on worker 0-0, policy_version 976074 (0.00085) [2022-07-11 01:26:55,636][26022] Updated weights on worker 0-0, policy_version 976084 (0.00093) [2022-07-11 01:26:56,190][25689] Fps is (10 sec: 5554.3, 60 sec: 5573.3, 300 sec: 5535.5). Total num frames: 999513088. Throughput: 0: 4986.8. Samples: 999508070. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:26:56,190][25689] Avg episode reward: [(0, '0.448')] [2022-07-11 01:26:57,559][26022] Updated weights on worker 0-0, policy_version 976094 (0.00094) [2022-07-11 01:26:59,356][26022] Updated weights on worker 0-0, policy_version 976104 (0.00087) [2022-07-11 01:27:01,209][25689] Fps is (10 sec: 5646.5, 60 sec: 5537.9, 300 sec: 5535.5). Total num frames: 999539712. Throughput: 0: 5836.5. Samples: 999541968. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:27:01,209][25689] Avg episode reward: [(0, '0.255')] [2022-07-11 01:27:01,222][26022] Updated weights on worker 0-0, policy_version 976114 (0.00086) [2022-07-11 01:27:03,334][26022] Updated weights on worker 0-0, policy_version 976124 (0.00101) [2022-07-11 01:27:05,267][26022] Updated weights on worker 0-0, policy_version 976134 (0.00088) [2022-07-11 01:27:06,224][25689] Fps is (10 sec: 5306.6, 60 sec: 5557.7, 300 sec: 5532.0). Total num frames: 999566336. Throughput: 0: 5730.2. Samples: 999573314. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:27:06,225][25689] Avg episode reward: [(0, '0.529')] [2022-07-11 01:27:06,779][26022] Updated weights on worker 0-0, policy_version 976144 (0.00084) [2022-07-11 01:27:08,972][26022] Updated weights on worker 0-0, policy_version 976154 (0.00086) [2022-07-11 01:27:10,708][26022] Updated weights on worker 0-0, policy_version 976164 (0.00088) [2022-07-11 01:27:11,265][25689] Fps is (10 sec: 5498.4, 60 sec: 5573.4, 300 sec: 5533.1). Total num frames: 999595008. Throughput: 0: 4900.7. Samples: 999590166. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:27:11,266][25689] Avg episode reward: [(0, '0.941')] [2022-07-11 01:27:12,500][26022] Updated weights on worker 0-0, policy_version 976174 (0.00091) [2022-07-11 01:27:14,478][26022] Updated weights on worker 0-0, policy_version 976184 (0.00962) [2022-07-11 01:27:16,155][26022] Updated weights on worker 0-0, policy_version 976194 (0.00093) [2022-07-11 01:27:16,337][25689] Fps is (10 sec: 5670.6, 60 sec: 5561.7, 300 sec: 5535.3). Total num frames: 999623680. Throughput: 0: 5727.9. Samples: 999623470. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:27:16,337][25689] Avg episode reward: [(0, '1.397')] [2022-07-11 01:27:18,207][26022] Updated weights on worker 0-0, policy_version 976204 (0.00087) [2022-07-11 01:27:19,908][26022] Updated weights on worker 0-0, policy_version 976214 (0.00094) [2022-07-11 01:27:21,377][25689] Fps is (10 sec: 5569.5, 60 sec: 5543.8, 300 sec: 5535.1). Total num frames: 999651328. Throughput: 0: 5702.9. Samples: 999656990. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:27:21,378][25689] Avg episode reward: [(0, '1.445')] [2022-07-11 01:27:21,690][26022] Updated weights on worker 0-0, policy_version 976224 (0.00091) [2022-07-11 01:27:22,753][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:27:22,773][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000976230_999659520.pth [2022-07-11 01:27:22,774][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000974282_997664768.pth [2022-07-11 01:27:23,678][26022] Updated weights on worker 0-0, policy_version 976234 (0.00088) [2022-07-11 01:27:25,325][26022] Updated weights on worker 0-0, policy_version 976244 (0.00094) [2022-07-11 01:27:26,422][25689] Fps is (10 sec: 5381.5, 60 sec: 5527.3, 300 sec: 5524.0). Total num frames: 999677952. Throughput: 0: 4988.4. Samples: 999674068. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:27:26,422][25689] Avg episode reward: [(0, '1.518')] [2022-07-11 01:27:27,243][26022] Updated weights on worker 0-0, policy_version 976254 (0.00088) [2022-07-11 01:27:29,080][26022] Updated weights on worker 0-0, policy_version 976264 (0.00104) [2022-07-11 01:27:30,751][26022] Updated weights on worker 0-0, policy_version 976274 (0.00093) [2022-07-11 01:27:31,497][25689] Fps is (10 sec: 5464.1, 60 sec: 5540.8, 300 sec: 5531.1). Total num frames: 999706624. Throughput: 0: 5794.3. Samples: 999707396. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:27:31,498][25689] Avg episode reward: [(0, '-0.068')] [2022-07-11 01:27:32,789][26022] Updated weights on worker 0-0, policy_version 976284 (0.00112) [2022-07-11 01:27:34,608][26022] Updated weights on worker 0-0, policy_version 976294 (0.00091) [2022-07-11 01:27:36,379][26022] Updated weights on worker 0-0, policy_version 976304 (0.00084) [2022-07-11 01:27:36,560][25689] Fps is (10 sec: 5757.2, 60 sec: 5579.5, 300 sec: 5540.7). Total num frames: 999736320. Throughput: 0: 5803.9. Samples: 999740844. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:27:36,561][25689] Avg episode reward: [(0, '-0.409')] [2022-07-11 01:27:38,413][26022] Updated weights on worker 0-0, policy_version 976314 (0.00094) [2022-07-11 01:27:40,018][26022] Updated weights on worker 0-0, policy_version 976324 (0.00085) [2022-07-11 01:27:41,595][25689] Fps is (10 sec: 5679.1, 60 sec: 5561.2, 300 sec: 5533.2). Total num frames: 999763968. Throughput: 0: 4982.7. Samples: 999757726. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:27:41,595][25689] Avg episode reward: [(0, '-0.354')] [2022-07-11 01:27:41,994][26022] Updated weights on worker 0-0, policy_version 976334 (0.00085) [2022-07-11 01:27:43,699][26022] Updated weights on worker 0-0, policy_version 976344 (0.00090) [2022-07-11 01:27:45,627][26022] Updated weights on worker 0-0, policy_version 976354 (0.00091) [2022-07-11 01:27:46,621][25689] Fps is (10 sec: 5496.3, 60 sec: 5528.2, 300 sec: 5533.5). Total num frames: 999791616. Throughput: 0: 5789.7. Samples: 999791014. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:27:46,622][25689] Avg episode reward: [(0, '0.008')] [2022-07-11 01:27:47,353][26022] Updated weights on worker 0-0, policy_version 976364 (0.00542) [2022-07-11 01:27:49,385][26022] Updated weights on worker 0-0, policy_version 976374 (0.00086) [2022-07-11 01:27:51,101][26022] Updated weights on worker 0-0, policy_version 976384 (0.00102) [2022-07-11 01:27:51,635][25689] Fps is (10 sec: 5609.6, 60 sec: 5577.9, 300 sec: 5541.4). Total num frames: 999820288. Throughput: 0: 5820.9. Samples: 999824612. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:27:51,635][25689] Avg episode reward: [(0, '0.245')] [2022-07-11 01:27:53,039][26022] Updated weights on worker 0-0, policy_version 976394 (0.00969) [2022-07-11 01:27:54,709][26022] Updated weights on worker 0-0, policy_version 976404 (0.00093) [2022-07-11 01:27:56,734][25689] Fps is (10 sec: 5467.8, 60 sec: 5513.8, 300 sec: 5529.9). Total num frames: 999846912. Throughput: 0: 4981.3. Samples: 999841334. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:27:56,735][25689] Avg episode reward: [(0, '0.904')] [2022-07-11 01:27:56,863][26022] Updated weights on worker 0-0, policy_version 976414 (0.00090) [2022-07-11 01:27:58,363][26022] Updated weights on worker 0-0, policy_version 976424 (0.00081) [2022-07-11 01:28:00,349][26022] Updated weights on worker 0-0, policy_version 976434 (0.00086) [2022-07-11 01:28:01,755][25689] Fps is (10 sec: 5362.8, 60 sec: 5530.5, 300 sec: 5533.1). Total num frames: 999874560. Throughput: 0: 5820.0. Samples: 999875056. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:28:01,755][25689] Avg episode reward: [(0, '1.355')] [2022-07-11 01:28:02,475][26022] Updated weights on worker 0-0, policy_version 976444 (0.00086) [2022-07-11 01:28:04,381][26022] Updated weights on worker 0-0, policy_version 976454 (0.00092) [2022-07-11 01:28:06,267][26022] Updated weights on worker 0-0, policy_version 976464 (0.00093) [2022-07-11 01:28:06,770][25689] Fps is (10 sec: 5407.8, 60 sec: 5530.5, 300 sec: 5533.0). Total num frames: 999901184. Throughput: 0: 5725.8. Samples: 999906382. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:28:06,770][25689] Avg episode reward: [(0, '0.954')] [2022-07-11 01:28:07,869][26022] Updated weights on worker 0-0, policy_version 976474 (0.00082) [2022-07-11 01:28:09,913][26022] Updated weights on worker 0-0, policy_version 976484 (0.00088) [2022-07-11 01:28:11,735][26022] Updated weights on worker 0-0, policy_version 976494 (0.00068) [2022-07-11 01:28:11,825][25689] Fps is (10 sec: 5491.1, 60 sec: 5529.2, 300 sec: 5533.7). Total num frames: 999929856. Throughput: 0: 5709.7. Samples: 999939892. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:28:11,825][25689] Avg episode reward: [(0, '0.751')] [2022-07-11 01:28:13,494][26022] Updated weights on worker 0-0, policy_version 976504 (0.00095) [2022-07-11 01:28:15,571][26022] Updated weights on worker 0-0, policy_version 976514 (0.00090) [2022-07-11 01:28:16,878][25689] Fps is (10 sec: 5571.7, 60 sec: 5514.0, 300 sec: 5534.5). Total num frames: 999957504. Throughput: 0: 5707.6. Samples: 999956308. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:28:16,878][25689] Avg episode reward: [(0, '0.430')] [2022-07-11 01:28:17,181][26022] Updated weights on worker 0-0, policy_version 976524 (0.00091) [2022-07-11 01:28:18,977][26022] Updated weights on worker 0-0, policy_version 976534 (0.00092) [2022-07-11 01:28:21,154][26022] Updated weights on worker 0-0, policy_version 976544 (0.00084) [2022-07-11 01:28:21,892][25689] Fps is (10 sec: 5594.5, 60 sec: 5533.4, 300 sec: 5527.7). Total num frames: 999986176. Throughput: 0: 5694.5. Samples: 999989726. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:28:21,892][25689] Avg episode reward: [(0, '0.147')] [2022-07-11 01:28:22,524][26022] Updated weights on worker 0-0, policy_version 976554 (0.00094) [2022-07-11 01:28:24,654][26022] Updated weights on worker 0-0, policy_version 976564 (0.00087) [2022-07-11 01:28:26,371][26022] Updated weights on worker 0-0, policy_version 976574 (0.00090) [2022-07-11 01:28:26,930][25689] Fps is (10 sec: 5500.8, 60 sec: 5533.9, 300 sec: 5530.5). Total num frames: 1000012800. Throughput: 0: 5788.0. Samples: 1000023072. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:28:26,931][25689] Avg episode reward: [(0, '0.361')] [2022-07-11 01:28:28,269][26022] Updated weights on worker 0-0, policy_version 976584 (0.00088) [2022-07-11 01:28:30,219][26022] Updated weights on worker 0-0, policy_version 976594 (0.00087) [2022-07-11 01:28:31,943][25689] Fps is (10 sec: 5501.3, 60 sec: 5539.6, 300 sec: 5531.6). Total num frames: 1000041472. Throughput: 0: 4968.5. Samples: 1000039850. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:28:31,945][25689] Avg episode reward: [(0, '0.663')] [2022-07-11 01:28:32,093][26022] Updated weights on worker 0-0, policy_version 976604 (0.00088) [2022-07-11 01:28:34,010][26022] Updated weights on worker 0-0, policy_version 976614 (0.00091) [2022-07-11 01:28:35,660][26022] Updated weights on worker 0-0, policy_version 976624 (0.00091) [2022-07-11 01:28:37,021][25689] Fps is (10 sec: 5682.6, 60 sec: 5521.3, 300 sec: 5533.9). Total num frames: 1000070144. Throughput: 0: 5810.0. Samples: 1000073342. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:28:37,022][25689] Avg episode reward: [(0, '0.092')] [2022-07-11 01:28:37,460][26022] Updated weights on worker 0-0, policy_version 976634 (0.00083) [2022-07-11 01:28:39,272][26022] Updated weights on worker 0-0, policy_version 976644 (0.00086) [2022-07-11 01:28:41,222][26022] Updated weights on worker 0-0, policy_version 976654 (0.00086) [2022-07-11 01:28:42,066][25689] Fps is (10 sec: 5563.4, 60 sec: 5520.3, 300 sec: 5533.4). Total num frames: 1000097792. Throughput: 0: 5819.0. Samples: 1000107122. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:28:42,067][25689] Avg episode reward: [(0, '0.003')] [2022-07-11 01:28:42,866][26022] Updated weights on worker 0-0, policy_version 976664 (0.00092) [2022-07-11 01:28:44,832][26022] Updated weights on worker 0-0, policy_version 976674 (0.00085) [2022-07-11 01:28:46,565][26022] Updated weights on worker 0-0, policy_version 976684 (0.00083) [2022-07-11 01:28:47,087][25689] Fps is (10 sec: 5595.5, 60 sec: 5537.8, 300 sec: 5536.5). Total num frames: 1000126464. Throughput: 0: 4991.4. Samples: 1000123680. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 01:28:47,087][25689] Avg episode reward: [(0, '0.005')] [2022-07-11 01:28:48,631][26022] Updated weights on worker 0-0, policy_version 976694 (0.00095) [2022-07-11 01:28:50,382][26022] Updated weights on worker 0-0, policy_version 976704 (0.00086) [2022-07-11 01:28:52,095][25689] Fps is (10 sec: 5513.9, 60 sec: 5504.4, 300 sec: 5527.5). Total num frames: 1000153088. Throughput: 0: 5812.7. Samples: 1000156986. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:28:52,097][25689] Avg episode reward: [(0, '-0.169')] [2022-07-11 01:28:52,284][26022] Updated weights on worker 0-0, policy_version 976714 (0.00108) [2022-07-11 01:28:54,030][26022] Updated weights on worker 0-0, policy_version 976724 (0.00090) [2022-07-11 01:28:56,071][26022] Updated weights on worker 0-0, policy_version 976734 (0.00091) [2022-07-11 01:28:57,158][25689] Fps is (10 sec: 5490.6, 60 sec: 5541.6, 300 sec: 5530.4). Total num frames: 1000181760. Throughput: 0: 5788.1. Samples: 1000189892. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:28:57,158][25689] Avg episode reward: [(0, '0.066')] [2022-07-11 01:28:57,780][26022] Updated weights on worker 0-0, policy_version 976744 (0.00086) [2022-07-11 01:28:59,611][26022] Updated weights on worker 0-0, policy_version 976754 (0.00095) [2022-07-11 01:29:01,520][26022] Updated weights on worker 0-0, policy_version 976764 (0.00090) [2022-07-11 01:29:02,163][25689] Fps is (10 sec: 5492.3, 60 sec: 5526.2, 300 sec: 5534.0). Total num frames: 1000208384. Throughput: 0: 4951.8. Samples: 1000206634. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:29:02,163][25689] Avg episode reward: [(0, '0.001')] [2022-07-11 01:29:03,652][26022] Updated weights on worker 0-0, policy_version 976774 (0.00086) [2022-07-11 01:29:05,700][26022] Updated weights on worker 0-0, policy_version 976784 (0.00088) [2022-07-11 01:29:07,190][25689] Fps is (10 sec: 5307.8, 60 sec: 5525.0, 300 sec: 5531.6). Total num frames: 1000235008. Throughput: 0: 5680.3. Samples: 1000237872. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:29:07,192][25689] Avg episode reward: [(0, '-0.578')] [2022-07-11 01:29:07,504][26022] Updated weights on worker 0-0, policy_version 976794 (0.00093) [2022-07-11 01:29:09,095][26022] Updated weights on worker 0-0, policy_version 976804 (0.00090) [2022-07-11 01:29:11,169][26022] Updated weights on worker 0-0, policy_version 976814 (0.00092) [2022-07-11 01:29:12,193][25689] Fps is (10 sec: 5614.9, 60 sec: 5546.7, 300 sec: 5533.3). Total num frames: 1000264704. Throughput: 0: 5698.4. Samples: 1000271514. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:29:12,194][25689] Avg episode reward: [(0, '0.627')] [2022-07-11 01:29:12,882][26022] Updated weights on worker 0-0, policy_version 976824 (0.00089) [2022-07-11 01:29:14,622][26022] Updated weights on worker 0-0, policy_version 976834 (0.00090) [2022-07-11 01:29:16,942][26022] Updated weights on worker 0-0, policy_version 976844 (0.00083) [2022-07-11 01:29:17,278][25689] Fps is (10 sec: 5582.6, 60 sec: 5526.8, 300 sec: 5529.6). Total num frames: 1000291328. Throughput: 0: 4875.7. Samples: 1000287994. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:29:17,279][25689] Avg episode reward: [(0, '0.058')] [2022-07-11 01:29:18,280][26022] Updated weights on worker 0-0, policy_version 976854 (0.00081) [2022-07-11 01:29:20,313][26022] Updated weights on worker 0-0, policy_version 976864 (0.00082) [2022-07-11 01:29:22,036][26022] Updated weights on worker 0-0, policy_version 976874 (0.00089) [2022-07-11 01:29:22,357][25689] Fps is (10 sec: 5440.3, 60 sec: 5520.9, 300 sec: 5528.3). Total num frames: 1000320000. Throughput: 0: 5703.6. Samples: 1000321816. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:29:22,358][25689] Avg episode reward: [(0, '-0.054')] [2022-07-11 01:29:22,850][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:29:22,864][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000976877_1000322048.pth [2022-07-11 01:29:22,864][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000974931_998329344.pth [2022-07-11 01:29:23,756][26022] Updated weights on worker 0-0, policy_version 976884 (0.00090) [2022-07-11 01:29:25,891][26022] Updated weights on worker 0-0, policy_version 976894 (0.00092) [2022-07-11 01:29:27,332][26022] Updated weights on worker 0-0, policy_version 976904 (0.00100) [2022-07-11 01:29:27,426][25689] Fps is (10 sec: 5751.9, 60 sec: 5568.9, 300 sec: 5541.3). Total num frames: 1000349696. Throughput: 0: 5818.1. Samples: 1000355608. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:29:27,426][25689] Avg episode reward: [(0, '-0.692')] [2022-07-11 01:29:29,487][26022] Updated weights on worker 0-0, policy_version 976914 (0.00095) [2022-07-11 01:29:31,284][26022] Updated weights on worker 0-0, policy_version 976924 (0.00082) [2022-07-11 01:29:32,457][25689] Fps is (10 sec: 5474.9, 60 sec: 5516.5, 300 sec: 5524.3). Total num frames: 1000375296. Throughput: 0: 4978.0. Samples: 1000372388. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:29:32,458][25689] Avg episode reward: [(0, '-0.317')] [2022-07-11 01:29:32,820][26022] Updated weights on worker 0-0, policy_version 976934 (0.00090) [2022-07-11 01:29:34,918][26022] Updated weights on worker 0-0, policy_version 976944 (0.00084) [2022-07-11 01:29:36,599][26022] Updated weights on worker 0-0, policy_version 976954 (0.00095) [2022-07-11 01:29:37,537][25689] Fps is (10 sec: 5468.7, 60 sec: 5533.2, 300 sec: 5536.7). Total num frames: 1000404992. Throughput: 0: 5822.6. Samples: 1000405954. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:29:37,538][25689] Avg episode reward: [(0, '-0.458')] [2022-07-11 01:29:38,629][26022] Updated weights on worker 0-0, policy_version 976964 (0.00086) [2022-07-11 01:29:40,487][26022] Updated weights on worker 0-0, policy_version 976974 (0.00105) [2022-07-11 01:29:42,165][26022] Updated weights on worker 0-0, policy_version 976984 (0.00093) [2022-07-11 01:29:42,574][25689] Fps is (10 sec: 5769.3, 60 sec: 5550.9, 300 sec: 5536.1). Total num frames: 1000433664. Throughput: 0: 5818.3. Samples: 1000439444. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:29:42,575][25689] Avg episode reward: [(0, '-0.298')] [2022-07-11 01:29:44,055][26022] Updated weights on worker 0-0, policy_version 976994 (0.00083) [2022-07-11 01:29:45,912][26022] Updated weights on worker 0-0, policy_version 977004 (0.00091) [2022-07-11 01:29:47,593][25689] Fps is (10 sec: 5600.8, 60 sec: 5534.1, 300 sec: 5535.9). Total num frames: 1000461312. Throughput: 0: 4978.0. Samples: 1000456000. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:29:47,593][25689] Avg episode reward: [(0, '0.553')] [2022-07-11 01:29:47,764][26022] Updated weights on worker 0-0, policy_version 977014 (0.00092) [2022-07-11 01:29:49,671][26022] Updated weights on worker 0-0, policy_version 977024 (0.00091) [2022-07-11 01:29:51,324][26022] Updated weights on worker 0-0, policy_version 977034 (0.00086) [2022-07-11 01:29:52,628][25689] Fps is (10 sec: 5499.9, 60 sec: 5548.5, 300 sec: 5534.3). Total num frames: 1000488960. Throughput: 0: 5801.4. Samples: 1000489408. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:29:52,629][25689] Avg episode reward: [(0, '0.719')] [2022-07-11 01:29:53,524][26022] Updated weights on worker 0-0, policy_version 977044 (0.00093) [2022-07-11 01:29:54,947][26022] Updated weights on worker 0-0, policy_version 977054 (0.00088) [2022-07-11 01:29:57,064][26022] Updated weights on worker 0-0, policy_version 977064 (0.00086) [2022-07-11 01:29:57,700][25689] Fps is (10 sec: 5673.4, 60 sec: 5564.6, 300 sec: 5540.0). Total num frames: 1000518656. Throughput: 0: 5803.2. Samples: 1000522966. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:29:57,701][25689] Avg episode reward: [(0, '1.237')] [2022-07-11 01:29:58,654][26022] Updated weights on worker 0-0, policy_version 977074 (0.00082) [2022-07-11 01:30:00,648][26022] Updated weights on worker 0-0, policy_version 977084 (0.00374) [2022-07-11 01:30:02,798][25689] Fps is (10 sec: 5336.6, 60 sec: 5522.3, 300 sec: 5531.7). Total num frames: 1000543232. Throughput: 0: 4973.6. Samples: 1000540028. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:30:02,798][25689] Avg episode reward: [(0, '1.629')] [2022-07-11 01:30:02,907][26022] Updated weights on worker 0-0, policy_version 977094 (0.00094) [2022-07-11 01:30:04,441][26022] Updated weights on worker 0-0, policy_version 977104 (0.00085) [2022-07-11 01:30:06,461][26022] Updated weights on worker 0-0, policy_version 977114 (0.00081) [2022-07-11 01:30:07,830][25689] Fps is (10 sec: 5459.0, 60 sec: 5589.4, 300 sec: 5541.6). Total num frames: 1000573952. Throughput: 0: 5727.3. Samples: 1000571902. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:30:07,830][25689] Avg episode reward: [(0, '0.945')] [2022-07-11 01:30:07,938][26022] Updated weights on worker 0-0, policy_version 977124 (0.00095) [2022-07-11 01:30:10,155][26022] Updated weights on worker 0-0, policy_version 977134 (0.00093) [2022-07-11 01:30:11,774][26022] Updated weights on worker 0-0, policy_version 977144 (0.00085) [2022-07-11 01:30:12,840][25689] Fps is (10 sec: 5608.6, 60 sec: 5521.3, 300 sec: 5536.9). Total num frames: 1000599552. Throughput: 0: 5745.7. Samples: 1000605538. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:30:12,840][25689] Avg episode reward: [(0, '0.721')] [2022-07-11 01:30:13,767][26022] Updated weights on worker 0-0, policy_version 977154 (0.00089) [2022-07-11 01:30:15,715][26022] Updated weights on worker 0-0, policy_version 977164 (0.00076) [2022-07-11 01:30:17,512][26022] Updated weights on worker 0-0, policy_version 977174 (0.00088) [2022-07-11 01:30:17,877][25689] Fps is (10 sec: 5503.7, 60 sec: 5576.4, 300 sec: 5543.4). Total num frames: 1000629248. Throughput: 0: 4911.9. Samples: 1000622070. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:30:17,877][25689] Avg episode reward: [(0, '-1.162')] [2022-07-11 01:30:19,380][26022] Updated weights on worker 0-0, policy_version 977184 (0.00098) [2022-07-11 01:30:21,073][26022] Updated weights on worker 0-0, policy_version 977194 (0.00095) [2022-07-11 01:30:22,896][25689] Fps is (10 sec: 5600.6, 60 sec: 5548.1, 300 sec: 5538.1). Total num frames: 1000655872. Throughput: 0: 5729.1. Samples: 1000655170. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:30:22,897][25689] Avg episode reward: [(0, '-0.855')] [2022-07-11 01:30:23,070][26022] Updated weights on worker 0-0, policy_version 977204 (0.00096) [2022-07-11 01:30:24,695][26022] Updated weights on worker 0-0, policy_version 977214 (0.00085) [2022-07-11 01:30:26,680][26022] Updated weights on worker 0-0, policy_version 977224 (0.00080) [2022-07-11 01:30:27,962][25689] Fps is (10 sec: 5482.9, 60 sec: 5531.4, 300 sec: 5540.6). Total num frames: 1000684544. Throughput: 0: 5797.3. Samples: 1000688616. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:30:27,963][25689] Avg episode reward: [(0, '-1.170')] [2022-07-11 01:30:28,506][26022] Updated weights on worker 0-0, policy_version 977234 (0.00092) [2022-07-11 01:30:30,484][26022] Updated weights on worker 0-0, policy_version 977244 (0.00092) [2022-07-11 01:30:32,240][26022] Updated weights on worker 0-0, policy_version 977254 (0.00080) [2022-07-11 01:30:33,009][25689] Fps is (10 sec: 5467.4, 60 sec: 5546.8, 300 sec: 5541.1). Total num frames: 1000711168. Throughput: 0: 4937.2. Samples: 1000705120. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:30:33,010][25689] Avg episode reward: [(0, '-1.244')] [2022-07-11 01:30:34,075][26022] Updated weights on worker 0-0, policy_version 977264 (0.00082) [2022-07-11 01:30:36,181][26022] Updated weights on worker 0-0, policy_version 977274 (0.00068) [2022-07-11 01:30:37,832][26022] Updated weights on worker 0-0, policy_version 977284 (0.00088) [2022-07-11 01:30:38,113][25689] Fps is (10 sec: 5447.5, 60 sec: 5527.8, 300 sec: 5536.0). Total num frames: 1000739840. Throughput: 0: 5734.5. Samples: 1000738112. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:30:38,113][25689] Avg episode reward: [(0, '-0.731')] [2022-07-11 01:30:39,811][26022] Updated weights on worker 0-0, policy_version 977294 (0.00088) [2022-07-11 01:30:41,566][26022] Updated weights on worker 0-0, policy_version 977304 (0.00086) [2022-07-11 01:30:43,193][25689] Fps is (10 sec: 5430.0, 60 sec: 5490.1, 300 sec: 5529.1). Total num frames: 1000766464. Throughput: 0: 5735.2. Samples: 1000771578. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:30:43,193][25689] Avg episode reward: [(0, '-0.815')] [2022-07-11 01:30:43,413][26022] Updated weights on worker 0-0, policy_version 977314 (0.00085) [2022-07-11 01:30:45,167][26022] Updated weights on worker 0-0, policy_version 977324 (0.00089) [2022-07-11 01:30:47,063][26022] Updated weights on worker 0-0, policy_version 977334 (0.00089) [2022-07-11 01:30:48,196][25689] Fps is (10 sec: 5483.9, 60 sec: 5508.4, 300 sec: 5537.7). Total num frames: 1000795136. Throughput: 0: 5755.8. Samples: 1000805078. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:30:48,196][25689] Avg episode reward: [(0, '0.617')] [2022-07-11 01:30:48,878][26022] Updated weights on worker 0-0, policy_version 977344 (0.00084) [2022-07-11 01:30:50,895][26022] Updated weights on worker 0-0, policy_version 977354 (0.00092) [2022-07-11 01:30:52,724][26022] Updated weights on worker 0-0, policy_version 977364 (0.00091) [2022-07-11 01:30:53,227][25689] Fps is (10 sec: 5714.6, 60 sec: 5525.6, 300 sec: 5538.5). Total num frames: 1000823808. Throughput: 0: 5758.2. Samples: 1000821538. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:30:53,228][25689] Avg episode reward: [(0, '0.816')] [2022-07-11 01:30:54,430][26022] Updated weights on worker 0-0, policy_version 977374 (0.00091) [2022-07-11 01:30:56,376][26022] Updated weights on worker 0-0, policy_version 977384 (0.00086) [2022-07-11 01:30:58,151][26022] Updated weights on worker 0-0, policy_version 977394 (0.00089) [2022-07-11 01:30:58,281][25689] Fps is (10 sec: 5685.9, 60 sec: 5510.4, 300 sec: 5537.5). Total num frames: 1000852480. Throughput: 0: 5798.0. Samples: 1000855050. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:30:58,282][25689] Avg episode reward: [(0, '0.384')] [2022-07-11 01:30:59,944][26022] Updated weights on worker 0-0, policy_version 977404 (0.00094) [2022-07-11 01:31:02,287][26022] Updated weights on worker 0-0, policy_version 977414 (0.00088) [2022-07-11 01:31:03,300][25689] Fps is (10 sec: 5489.9, 60 sec: 5551.4, 300 sec: 5541.5). Total num frames: 1000879104. Throughput: 0: 5720.6. Samples: 1000886602. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:31:03,300][25689] Avg episode reward: [(0, '0.036')] [2022-07-11 01:31:03,968][26022] Updated weights on worker 0-0, policy_version 977424 (0.00093) [2022-07-11 01:31:05,947][26022] Updated weights on worker 0-0, policy_version 977434 (0.00088) [2022-07-11 01:31:07,839][26022] Updated weights on worker 0-0, policy_version 977444 (0.00100) [2022-07-11 01:31:08,319][25689] Fps is (10 sec: 5304.9, 60 sec: 5484.9, 300 sec: 5538.2). Total num frames: 1000905728. Throughput: 0: 4885.8. Samples: 1000903394. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:31:08,319][25689] Avg episode reward: [(0, '0.277')] [2022-07-11 01:31:09,371][26022] Updated weights on worker 0-0, policy_version 977454 (0.00091) [2022-07-11 01:31:11,482][26022] Updated weights on worker 0-0, policy_version 977464 (0.00087) [2022-07-11 01:31:13,201][26022] Updated weights on worker 0-0, policy_version 977474 (0.00084) [2022-07-11 01:31:13,352][25689] Fps is (10 sec: 5398.9, 60 sec: 5516.6, 300 sec: 5533.1). Total num frames: 1000933376. Throughput: 0: 5739.4. Samples: 1000937042. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:31:13,353][25689] Avg episode reward: [(0, '0.347')] [2022-07-11 01:31:15,047][26022] Updated weights on worker 0-0, policy_version 977484 (0.00087) [2022-07-11 01:31:16,999][26022] Updated weights on worker 0-0, policy_version 977494 (0.00090) [2022-07-11 01:31:18,443][25689] Fps is (10 sec: 5664.1, 60 sec: 5511.7, 300 sec: 5535.4). Total num frames: 1000963072. Throughput: 0: 5747.9. Samples: 1000970936. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:31:18,443][25689] Avg episode reward: [(0, '0.597')] [2022-07-11 01:31:18,647][26022] Updated weights on worker 0-0, policy_version 977504 (0.00088) [2022-07-11 01:31:20,555][26022] Updated weights on worker 0-0, policy_version 977514 (0.00088) [2022-07-11 01:31:22,139][26022] Updated weights on worker 0-0, policy_version 977524 (0.00090) [2022-07-11 01:31:23,197][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:31:23,207][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000977529_1000989696.pth [2022-07-11 01:31:23,210][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000975579_998992896.pth [2022-07-11 01:31:23,490][25689] Fps is (10 sec: 5656.4, 60 sec: 5526.1, 300 sec: 5535.4). Total num frames: 1000990720. Throughput: 0: 5008.2. Samples: 1000987720. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:31:23,491][25689] Avg episode reward: [(0, '0.913')] [2022-07-11 01:31:24,121][26022] Updated weights on worker 0-0, policy_version 977534 (0.00087) [2022-07-11 01:31:25,795][26022] Updated weights on worker 0-0, policy_version 977544 (0.00093) [2022-07-11 01:31:27,865][26022] Updated weights on worker 0-0, policy_version 977554 (0.00087) [2022-07-11 01:31:28,508][25689] Fps is (10 sec: 5595.6, 60 sec: 5530.5, 300 sec: 5539.3). Total num frames: 1001019392. Throughput: 0: 5839.2. Samples: 1001021284. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:31:28,508][25689] Avg episode reward: [(0, '0.809')] [2022-07-11 01:31:29,569][26022] Updated weights on worker 0-0, policy_version 977564 (0.00079) [2022-07-11 01:31:31,527][26022] Updated weights on worker 0-0, policy_version 977574 (0.00089) [2022-07-11 01:31:33,211][26022] Updated weights on worker 0-0, policy_version 977584 (0.00099) [2022-07-11 01:31:33,533][25689] Fps is (10 sec: 5607.7, 60 sec: 5549.4, 300 sec: 5540.9). Total num frames: 1001047040. Throughput: 0: 5840.1. Samples: 1001054902. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:31:33,534][25689] Avg episode reward: [(0, '0.948')] [2022-07-11 01:31:35,188][26022] Updated weights on worker 0-0, policy_version 977594 (0.00092) [2022-07-11 01:31:36,836][26022] Updated weights on worker 0-0, policy_version 977604 (0.00093) [2022-07-11 01:31:38,584][25689] Fps is (10 sec: 5487.9, 60 sec: 5537.3, 300 sec: 5536.9). Total num frames: 1001074688. Throughput: 0: 5004.4. Samples: 1001071730. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:31:38,584][25689] Avg episode reward: [(0, '0.851')] [2022-07-11 01:31:39,086][26022] Updated weights on worker 0-0, policy_version 977614 (0.00085) [2022-07-11 01:31:40,525][26022] Updated weights on worker 0-0, policy_version 977624 (0.00093) [2022-07-11 01:31:42,554][26022] Updated weights on worker 0-0, policy_version 977634 (0.00089) [2022-07-11 01:31:43,613][25689] Fps is (10 sec: 5689.0, 60 sec: 5592.8, 300 sec: 5537.0). Total num frames: 1001104384. Throughput: 0: 5841.9. Samples: 1001105278. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:31:43,614][25689] Avg episode reward: [(0, '0.920')] [2022-07-11 01:31:44,259][26022] Updated weights on worker 0-0, policy_version 977644 (0.00090) [2022-07-11 01:31:46,143][26022] Updated weights on worker 0-0, policy_version 977654 (0.00087) [2022-07-11 01:31:47,889][26022] Updated weights on worker 0-0, policy_version 977664 (0.00095) [2022-07-11 01:31:48,691][25689] Fps is (10 sec: 5572.1, 60 sec: 5552.0, 300 sec: 5539.1). Total num frames: 1001131008. Throughput: 0: 5827.9. Samples: 1001138912. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:31:48,692][25689] Avg episode reward: [(0, '1.475')] [2022-07-11 01:31:49,713][26022] Updated weights on worker 0-0, policy_version 977674 (0.00091) [2022-07-11 01:31:51,684][26022] Updated weights on worker 0-0, policy_version 977684 (0.00093) [2022-07-11 01:31:53,277][26022] Updated weights on worker 0-0, policy_version 977694 (0.00095) [2022-07-11 01:31:53,734][25689] Fps is (10 sec: 5463.8, 60 sec: 5551.0, 300 sec: 5534.0). Total num frames: 1001159680. Throughput: 0: 4984.6. Samples: 1001155592. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:31:53,734][25689] Avg episode reward: [(0, '1.167')] [2022-07-11 01:31:55,339][26022] Updated weights on worker 0-0, policy_version 977704 (0.00087) [2022-07-11 01:31:56,974][26022] Updated weights on worker 0-0, policy_version 977714 (0.00091) [2022-07-11 01:31:58,846][25689] Fps is (10 sec: 5647.3, 60 sec: 5545.7, 300 sec: 5539.1). Total num frames: 1001188352. Throughput: 0: 5795.3. Samples: 1001189152. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:31:58,846][25689] Avg episode reward: [(0, '1.687')] [2022-07-11 01:31:58,945][26022] Updated weights on worker 0-0, policy_version 977724 (0.00093) [2022-07-11 01:32:00,839][26022] Updated weights on worker 0-0, policy_version 977734 (0.00086) [2022-07-11 01:32:02,948][26022] Updated weights on worker 0-0, policy_version 977744 (0.00090) [2022-07-11 01:32:03,865][25689] Fps is (10 sec: 5458.1, 60 sec: 5545.7, 300 sec: 5539.0). Total num frames: 1001214976. Throughput: 0: 5705.0. Samples: 1001220812. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:32:03,865][25689] Avg episode reward: [(0, '1.895')] [2022-07-11 01:32:04,866][26022] Updated weights on worker 0-0, policy_version 977754 (0.00084) [2022-07-11 01:32:06,616][26022] Updated weights on worker 0-0, policy_version 977764 (0.00093) [2022-07-11 01:32:08,392][26022] Updated weights on worker 0-0, policy_version 977774 (0.00109) [2022-07-11 01:32:08,872][25689] Fps is (10 sec: 5412.8, 60 sec: 5563.6, 300 sec: 5536.2). Total num frames: 1001242624. Throughput: 0: 4889.8. Samples: 1001237592. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:32:08,873][25689] Avg episode reward: [(0, '1.657')] [2022-07-11 01:32:10,272][26022] Updated weights on worker 0-0, policy_version 977784 (0.00091) [2022-07-11 01:32:12,020][26022] Updated weights on worker 0-0, policy_version 977794 (0.00087) [2022-07-11 01:32:13,889][25689] Fps is (10 sec: 5516.3, 60 sec: 5565.2, 300 sec: 5533.8). Total num frames: 1001270272. Throughput: 0: 5738.9. Samples: 1001271260. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:32:13,889][25689] Avg episode reward: [(0, '1.507')] [2022-07-11 01:32:14,043][26022] Updated weights on worker 0-0, policy_version 977804 (0.00092) [2022-07-11 01:32:15,660][26022] Updated weights on worker 0-0, policy_version 977814 (0.00087) [2022-07-11 01:32:17,568][26022] Updated weights on worker 0-0, policy_version 977824 (0.00086) [2022-07-11 01:32:18,943][25689] Fps is (10 sec: 5694.3, 60 sec: 5568.5, 300 sec: 5540.4). Total num frames: 1001299968. Throughput: 0: 5744.4. Samples: 1001304598. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:32:18,943][25689] Avg episode reward: [(0, '1.209')] [2022-07-11 01:32:19,562][26022] Updated weights on worker 0-0, policy_version 977834 (0.00092) [2022-07-11 01:32:21,264][26022] Updated weights on worker 0-0, policy_version 977844 (0.00083) [2022-07-11 01:32:23,052][26022] Updated weights on worker 0-0, policy_version 977854 (0.00086) [2022-07-11 01:32:23,973][25689] Fps is (10 sec: 5483.2, 60 sec: 5536.2, 300 sec: 5537.2). Total num frames: 1001325568. Throughput: 0: 5004.1. Samples: 1001321438. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:32:23,974][25689] Avg episode reward: [(0, '0.571')] [2022-07-11 01:32:24,944][26022] Updated weights on worker 0-0, policy_version 977864 (0.00089) [2022-07-11 01:32:26,763][26022] Updated weights on worker 0-0, policy_version 977874 (0.00083) [2022-07-11 01:32:28,545][26022] Updated weights on worker 0-0, policy_version 977884 (0.00084) [2022-07-11 01:32:29,015][25689] Fps is (10 sec: 5388.5, 60 sec: 5534.1, 300 sec: 5537.9). Total num frames: 1001354240. Throughput: 0: 5830.6. Samples: 1001355034. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:32:29,015][25689] Avg episode reward: [(0, '0.414')] [2022-07-11 01:32:30,505][26022] Updated weights on worker 0-0, policy_version 977894 (0.00086) [2022-07-11 01:32:32,261][26022] Updated weights on worker 0-0, policy_version 977904 (0.00093) [2022-07-11 01:32:34,037][25689] Fps is (10 sec: 5697.8, 60 sec: 5551.2, 300 sec: 5535.2). Total num frames: 1001382912. Throughput: 0: 5821.3. Samples: 1001388552. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:32:34,038][25689] Avg episode reward: [(0, '-0.014')] [2022-07-11 01:32:34,080][26022] Updated weights on worker 0-0, policy_version 977914 (0.00082) [2022-07-11 01:32:35,922][26022] Updated weights on worker 0-0, policy_version 977924 (0.00083) [2022-07-11 01:32:37,774][26022] Updated weights on worker 0-0, policy_version 977934 (0.00089) [2022-07-11 01:32:39,079][25689] Fps is (10 sec: 5596.1, 60 sec: 5552.1, 300 sec: 5535.1). Total num frames: 1001410560. Throughput: 0: 5012.3. Samples: 1001405526. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:32:39,079][25689] Avg episode reward: [(0, '0.399')] [2022-07-11 01:32:39,688][26022] Updated weights on worker 0-0, policy_version 977944 (0.00090) [2022-07-11 01:32:41,385][26022] Updated weights on worker 0-0, policy_version 977954 (0.00085) [2022-07-11 01:32:43,370][26022] Updated weights on worker 0-0, policy_version 977964 (0.00430) [2022-07-11 01:32:44,094][25689] Fps is (10 sec: 5702.2, 60 sec: 5553.4, 300 sec: 5542.1). Total num frames: 1001440256. Throughput: 0: 5843.5. Samples: 1001439014. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 01:32:44,094][25689] Avg episode reward: [(0, '0.277')] [2022-07-11 01:32:44,970][26022] Updated weights on worker 0-0, policy_version 977974 (0.00879) [2022-07-11 01:32:46,953][26022] Updated weights on worker 0-0, policy_version 977984 (0.00092) [2022-07-11 01:32:48,815][26022] Updated weights on worker 0-0, policy_version 977994 (0.00088) [2022-07-11 01:32:49,124][25689] Fps is (10 sec: 5606.8, 60 sec: 5557.8, 300 sec: 5535.0). Total num frames: 1001466880. Throughput: 0: 5847.6. Samples: 1001472624. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:32:49,124][25689] Avg episode reward: [(0, '0.334')] [2022-07-11 01:32:50,616][26022] Updated weights on worker 0-0, policy_version 978004 (0.00081) [2022-07-11 01:32:52,516][26022] Updated weights on worker 0-0, policy_version 978014 (0.00083) [2022-07-11 01:32:54,142][25689] Fps is (10 sec: 5503.1, 60 sec: 5560.0, 300 sec: 5543.4). Total num frames: 1001495552. Throughput: 0: 5005.2. Samples: 1001489180. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:32:54,143][25689] Avg episode reward: [(0, '0.279')] [2022-07-11 01:32:54,265][26022] Updated weights on worker 0-0, policy_version 978024 (0.00089) [2022-07-11 01:32:56,128][26022] Updated weights on worker 0-0, policy_version 978034 (0.00095) [2022-07-11 01:32:57,954][26022] Updated weights on worker 0-0, policy_version 978044 (0.00088) [2022-07-11 01:32:59,260][25689] Fps is (10 sec: 5556.2, 60 sec: 5542.5, 300 sec: 5541.5). Total num frames: 1001523200. Throughput: 0: 5814.4. Samples: 1001522868. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:32:59,261][25689] Avg episode reward: [(0, '0.301')] [2022-07-11 01:32:59,861][26022] Updated weights on worker 0-0, policy_version 978054 (0.00091) [2022-07-11 01:33:01,420][26022] Updated weights on worker 0-0, policy_version 978064 (0.00089) [2022-07-11 01:33:03,775][26022] Updated weights on worker 0-0, policy_version 978074 (0.00093) [2022-07-11 01:33:04,295][25689] Fps is (10 sec: 5345.3, 60 sec: 5541.0, 300 sec: 5541.2). Total num frames: 1001549824. Throughput: 0: 5715.1. Samples: 1001554468. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:33:04,296][25689] Avg episode reward: [(0, '0.251')] [2022-07-11 01:33:05,614][26022] Updated weights on worker 0-0, policy_version 978084 (0.00087) [2022-07-11 01:33:07,594][26022] Updated weights on worker 0-0, policy_version 978094 (0.00086) [2022-07-11 01:33:09,299][26022] Updated weights on worker 0-0, policy_version 978104 (0.00094) [2022-07-11 01:33:09,338][25689] Fps is (10 sec: 5486.8, 60 sec: 5554.7, 300 sec: 5541.4). Total num frames: 1001578496. Throughput: 0: 4876.8. Samples: 1001571206. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:33:09,339][25689] Avg episode reward: [(0, '0.375')] [2022-07-11 01:33:11,334][26022] Updated weights on worker 0-0, policy_version 978114 (0.00064) [2022-07-11 01:33:12,841][26022] Updated weights on worker 0-0, policy_version 978124 (0.00084) [2022-07-11 01:33:14,341][25689] Fps is (10 sec: 5402.6, 60 sec: 5522.1, 300 sec: 5535.4). Total num frames: 1001604096. Throughput: 0: 5720.6. Samples: 1001604730. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:33:14,341][25689] Avg episode reward: [(0, '0.874')] [2022-07-11 01:33:15,065][26022] Updated weights on worker 0-0, policy_version 978134 (0.00096) [2022-07-11 01:33:16,587][26022] Updated weights on worker 0-0, policy_version 978144 (0.00087) [2022-07-11 01:33:18,575][26022] Updated weights on worker 0-0, policy_version 978154 (0.00080) [2022-07-11 01:33:19,402][25689] Fps is (10 sec: 5698.1, 60 sec: 5555.4, 300 sec: 5544.9). Total num frames: 1001635840. Throughput: 0: 5734.1. Samples: 1001638362. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:33:19,402][25689] Avg episode reward: [(0, '1.197')] [2022-07-11 01:33:20,303][26022] Updated weights on worker 0-0, policy_version 978164 (0.00081) [2022-07-11 01:33:22,014][26022] Updated weights on worker 0-0, policy_version 978174 (0.00090) [2022-07-11 01:33:23,352][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:33:23,363][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000978180_1001656320.pth [2022-07-11 01:33:23,364][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000976230_999659520.pth [2022-07-11 01:33:24,055][26022] Updated weights on worker 0-0, policy_version 978184 (0.00090) [2022-07-11 01:33:24,404][25689] Fps is (10 sec: 5800.1, 60 sec: 5574.9, 300 sec: 5545.6). Total num frames: 1001662464. Throughput: 0: 5010.7. Samples: 1001655226. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:33:24,405][25689] Avg episode reward: [(0, '1.090')] [2022-07-11 01:33:25,681][26022] Updated weights on worker 0-0, policy_version 978194 (0.00085) [2022-07-11 01:33:27,610][26022] Updated weights on worker 0-0, policy_version 978204 (0.00099) [2022-07-11 01:33:29,459][25689] Fps is (10 sec: 5294.8, 60 sec: 5539.8, 300 sec: 5537.9). Total num frames: 1001689088. Throughput: 0: 5838.4. Samples: 1001688676. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:33:29,459][25689] Avg episode reward: [(0, '1.064')] [2022-07-11 01:33:29,755][26022] Updated weights on worker 0-0, policy_version 978214 (0.00090) [2022-07-11 01:33:31,165][26022] Updated weights on worker 0-0, policy_version 978224 (0.00082) [2022-07-11 01:33:33,442][26022] Updated weights on worker 0-0, policy_version 978234 (0.00088) [2022-07-11 01:33:34,477][25689] Fps is (10 sec: 5693.2, 60 sec: 5574.1, 300 sec: 5545.9). Total num frames: 1001719808. Throughput: 0: 5834.8. Samples: 1001722218. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:33:34,477][25689] Avg episode reward: [(0, '1.125')] [2022-07-11 01:33:34,989][26022] Updated weights on worker 0-0, policy_version 978244 (0.00092) [2022-07-11 01:33:36,924][26022] Updated weights on worker 0-0, policy_version 978254 (0.00091) [2022-07-11 01:33:38,707][26022] Updated weights on worker 0-0, policy_version 978264 (0.00079) [2022-07-11 01:33:39,535][25689] Fps is (10 sec: 5792.8, 60 sec: 5572.6, 300 sec: 5545.7). Total num frames: 1001747456. Throughput: 0: 5834.1. Samples: 1001755818. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:33:39,535][25689] Avg episode reward: [(0, '1.302')] [2022-07-11 01:33:40,501][26022] Updated weights on worker 0-0, policy_version 978274 (0.00092) [2022-07-11 01:33:42,263][26022] Updated weights on worker 0-0, policy_version 978284 (0.00080) [2022-07-11 01:33:44,265][26022] Updated weights on worker 0-0, policy_version 978294 (0.00081) [2022-07-11 01:33:44,559][25689] Fps is (10 sec: 5383.0, 60 sec: 5521.0, 300 sec: 5538.7). Total num frames: 1001774080. Throughput: 0: 5822.5. Samples: 1001772576. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:33:44,559][25689] Avg episode reward: [(0, '1.633')] [2022-07-11 01:33:45,727][26022] Updated weights on worker 0-0, policy_version 978304 (0.00098) [2022-07-11 01:33:47,970][26022] Updated weights on worker 0-0, policy_version 978314 (0.00095) [2022-07-11 01:33:49,403][26022] Updated weights on worker 0-0, policy_version 978324 (0.00087) [2022-07-11 01:33:49,564][25689] Fps is (10 sec: 5615.6, 60 sec: 5574.1, 300 sec: 5549.1). Total num frames: 1001803776. Throughput: 0: 5850.6. Samples: 1001806304. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:33:49,564][25689] Avg episode reward: [(0, '0.112')] [2022-07-11 01:33:51,720][26022] Updated weights on worker 0-0, policy_version 978334 (0.00082) [2022-07-11 01:33:53,258][26022] Updated weights on worker 0-0, policy_version 978344 (0.00084) [2022-07-11 01:33:54,587][25689] Fps is (10 sec: 5616.3, 60 sec: 5539.8, 300 sec: 5543.0). Total num frames: 1001830400. Throughput: 0: 5833.3. Samples: 1001839526. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:33:54,587][25689] Avg episode reward: [(0, '-0.406')] [2022-07-11 01:33:55,301][26022] Updated weights on worker 0-0, policy_version 978354 (0.00087) [2022-07-11 01:33:57,155][26022] Updated weights on worker 0-0, policy_version 978364 (0.00078) [2022-07-11 01:33:58,864][26022] Updated weights on worker 0-0, policy_version 978374 (0.00090) [2022-07-11 01:33:59,723][25689] Fps is (10 sec: 5443.1, 60 sec: 5555.0, 300 sec: 5547.4). Total num frames: 1001859072. Throughput: 0: 4968.8. Samples: 1001856128. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:33:59,723][25689] Avg episode reward: [(0, '-0.747')] [2022-07-11 01:34:00,750][26022] Updated weights on worker 0-0, policy_version 978384 (0.00092) [2022-07-11 01:34:02,989][26022] Updated weights on worker 0-0, policy_version 978394 (0.00092) [2022-07-11 01:34:04,759][25689] Fps is (10 sec: 5335.4, 60 sec: 5538.0, 300 sec: 5543.8). Total num frames: 1001884672. Throughput: 0: 5685.6. Samples: 1001887426. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:34:04,761][25689] Avg episode reward: [(0, '-0.695')] [2022-07-11 01:34:04,887][26022] Updated weights on worker 0-0, policy_version 978404 (0.00085) [2022-07-11 01:34:06,655][26022] Updated weights on worker 0-0, policy_version 978414 (0.00080) [2022-07-11 01:34:08,591][26022] Updated weights on worker 0-0, policy_version 978424 (0.00093) [2022-07-11 01:34:09,784][25689] Fps is (10 sec: 5394.1, 60 sec: 5539.6, 300 sec: 5539.9). Total num frames: 1001913344. Throughput: 0: 5676.8. Samples: 1001921090. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:34:09,786][25689] Avg episode reward: [(0, '-0.647')] [2022-07-11 01:34:10,259][26022] Updated weights on worker 0-0, policy_version 978434 (0.00097) [2022-07-11 01:34:12,227][26022] Updated weights on worker 0-0, policy_version 978444 (0.00080) [2022-07-11 01:34:13,985][26022] Updated weights on worker 0-0, policy_version 978454 (0.00086) [2022-07-11 01:34:14,830][25689] Fps is (10 sec: 5490.5, 60 sec: 5552.6, 300 sec: 5540.7). Total num frames: 1001939968. Throughput: 0: 4861.4. Samples: 1001937938. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:34:14,832][25689] Avg episode reward: [(0, '-0.431')] [2022-07-11 01:34:15,819][26022] Updated weights on worker 0-0, policy_version 978464 (0.00083) [2022-07-11 01:34:17,758][26022] Updated weights on worker 0-0, policy_version 978474 (0.00094) [2022-07-11 01:34:19,338][26022] Updated weights on worker 0-0, policy_version 978484 (0.00089) [2022-07-11 01:34:19,920][25689] Fps is (10 sec: 5657.4, 60 sec: 5533.0, 300 sec: 5547.4). Total num frames: 1001970688. Throughput: 0: 5710.5. Samples: 1001971466. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:34:19,921][25689] Avg episode reward: [(0, '1.179')] [2022-07-11 01:34:21,314][26022] Updated weights on worker 0-0, policy_version 978494 (0.00090) [2022-07-11 01:34:23,156][26022] Updated weights on worker 0-0, policy_version 978504 (0.00087) [2022-07-11 01:34:24,979][25689] Fps is (10 sec: 5650.3, 60 sec: 5527.9, 300 sec: 5537.2). Total num frames: 1001997312. Throughput: 0: 5816.4. Samples: 1002005034. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:34:24,979][25689] Avg episode reward: [(0, '0.866')] [2022-07-11 01:34:25,058][26022] Updated weights on worker 0-0, policy_version 978514 (0.00084) [2022-07-11 01:34:26,918][26022] Updated weights on worker 0-0, policy_version 978524 (0.00090) [2022-07-11 01:34:28,851][26022] Updated weights on worker 0-0, policy_version 978534 (0.00091) [2022-07-11 01:34:29,990][25689] Fps is (10 sec: 5491.5, 60 sec: 5565.7, 300 sec: 5547.9). Total num frames: 1002025984. Throughput: 0: 4972.7. Samples: 1002021566. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:34:29,990][25689] Avg episode reward: [(0, '1.660')] [2022-07-11 01:34:30,461][26022] Updated weights on worker 0-0, policy_version 978544 (0.00087) [2022-07-11 01:34:32,591][26022] Updated weights on worker 0-0, policy_version 978554 (0.00091) [2022-07-11 01:34:34,156][26022] Updated weights on worker 0-0, policy_version 978564 (0.00089) [2022-07-11 01:34:35,026][25689] Fps is (10 sec: 5503.4, 60 sec: 5496.3, 300 sec: 5538.4). Total num frames: 1002052608. Throughput: 0: 5807.3. Samples: 1002055224. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:34:35,027][25689] Avg episode reward: [(0, '0.756')] [2022-07-11 01:34:36,025][26022] Updated weights on worker 0-0, policy_version 978574 (0.00093) [2022-07-11 01:34:38,040][26022] Updated weights on worker 0-0, policy_version 978584 (0.00085) [2022-07-11 01:34:39,667][26022] Updated weights on worker 0-0, policy_version 978594 (0.00098) [2022-07-11 01:34:40,088][25689] Fps is (10 sec: 5577.0, 60 sec: 5529.8, 300 sec: 5541.4). Total num frames: 1002082304. Throughput: 0: 5810.6. Samples: 1002088654. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:34:40,089][25689] Avg episode reward: [(0, '0.723')] [2022-07-11 01:34:41,602][26022] Updated weights on worker 0-0, policy_version 978604 (0.00083) [2022-07-11 01:34:43,394][26022] Updated weights on worker 0-0, policy_version 978614 (0.00087) [2022-07-11 01:34:45,040][26022] Updated weights on worker 0-0, policy_version 978624 (0.00114) [2022-07-11 01:34:45,124][25689] Fps is (10 sec: 5780.3, 60 sec: 5562.5, 300 sec: 5544.5). Total num frames: 1002110976. Throughput: 0: 4982.3. Samples: 1002105404. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:34:45,125][25689] Avg episode reward: [(0, '-0.044')] [2022-07-11 01:34:46,859][26022] Updated weights on worker 0-0, policy_version 978634 (0.00091) [2022-07-11 01:34:48,890][26022] Updated weights on worker 0-0, policy_version 978644 (0.00099) [2022-07-11 01:34:50,179][25689] Fps is (10 sec: 5479.9, 60 sec: 5507.3, 300 sec: 5540.7). Total num frames: 1002137600. Throughput: 0: 5814.3. Samples: 1002138952. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:34:50,179][25689] Avg episode reward: [(0, '0.129')] [2022-07-11 01:34:50,702][26022] Updated weights on worker 0-0, policy_version 978654 (0.00087) [2022-07-11 01:34:52,652][26022] Updated weights on worker 0-0, policy_version 978664 (0.00094) [2022-07-11 01:34:54,392][26022] Updated weights on worker 0-0, policy_version 978674 (0.00088) [2022-07-11 01:34:55,195][25689] Fps is (10 sec: 5287.2, 60 sec: 5507.9, 300 sec: 5531.4). Total num frames: 1002164224. Throughput: 0: 5800.6. Samples: 1002172216. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:34:55,196][25689] Avg episode reward: [(0, '0.204')] [2022-07-11 01:34:56,283][26022] Updated weights on worker 0-0, policy_version 978684 (0.00079) [2022-07-11 01:34:58,192][26022] Updated weights on worker 0-0, policy_version 978694 (0.00095) [2022-07-11 01:34:59,919][26022] Updated weights on worker 0-0, policy_version 978704 (0.00083) [2022-07-11 01:35:00,276][25689] Fps is (10 sec: 5679.1, 60 sec: 5546.7, 300 sec: 5552.4). Total num frames: 1002194944. Throughput: 0: 4964.4. Samples: 1002188876. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:35:00,277][25689] Avg episode reward: [(0, '0.982')] [2022-07-11 01:35:01,885][26022] Updated weights on worker 0-0, policy_version 978714 (0.00097) [2022-07-11 01:35:03,992][26022] Updated weights on worker 0-0, policy_version 978724 (0.00087) [2022-07-11 01:35:05,295][25689] Fps is (10 sec: 5576.4, 60 sec: 5548.3, 300 sec: 5535.4). Total num frames: 1002220544. Throughput: 0: 5716.9. Samples: 1002220718. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:35:05,295][25689] Avg episode reward: [(0, '1.241')] [2022-07-11 01:35:05,436][26022] Updated weights on worker 0-0, policy_version 978734 (0.00084) [2022-07-11 01:35:07,744][26022] Updated weights on worker 0-0, policy_version 978744 (0.00098) [2022-07-11 01:35:09,280][26022] Updated weights on worker 0-0, policy_version 978754 (0.00094) [2022-07-11 01:35:10,361][25689] Fps is (10 sec: 5280.0, 60 sec: 5527.6, 300 sec: 5541.2). Total num frames: 1002248192. Throughput: 0: 5723.2. Samples: 1002254460. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:35:10,362][25689] Avg episode reward: [(0, '1.707')] [2022-07-11 01:35:11,266][26022] Updated weights on worker 0-0, policy_version 978764 (0.00082) [2022-07-11 01:35:13,135][26022] Updated weights on worker 0-0, policy_version 978774 (0.00084) [2022-07-11 01:35:14,847][26022] Updated weights on worker 0-0, policy_version 978784 (0.00081) [2022-07-11 01:35:15,434][25689] Fps is (10 sec: 5655.8, 60 sec: 5575.9, 300 sec: 5540.6). Total num frames: 1002277888. Throughput: 0: 4891.5. Samples: 1002271212. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:35:15,436][25689] Avg episode reward: [(0, '2.164')] [2022-07-11 01:35:16,820][26022] Updated weights on worker 0-0, policy_version 978794 (0.00093) [2022-07-11 01:35:18,411][26022] Updated weights on worker 0-0, policy_version 978804 (0.00089) [2022-07-11 01:35:20,508][25689] Fps is (10 sec: 5651.5, 60 sec: 5526.7, 300 sec: 5543.0). Total num frames: 1002305536. Throughput: 0: 5738.7. Samples: 1002304980. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:35:20,509][25689] Avg episode reward: [(0, '1.851')] [2022-07-11 01:35:20,514][26022] Updated weights on worker 0-0, policy_version 978814 (0.00091) [2022-07-11 01:35:22,149][26022] Updated weights on worker 0-0, policy_version 978824 (0.00086) [2022-07-11 01:35:23,437][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:35:23,457][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000978830_1002321920.pth [2022-07-11 01:35:23,458][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000976877_1000322048.pth [2022-07-11 01:35:24,165][26022] Updated weights on worker 0-0, policy_version 978834 (0.00081) [2022-07-11 01:35:25,558][25689] Fps is (10 sec: 5461.6, 60 sec: 5544.3, 300 sec: 5539.9). Total num frames: 1002333184. Throughput: 0: 5805.1. Samples: 1002338348. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:35:25,559][25689] Avg episode reward: [(0, '1.904')] [2022-07-11 01:35:25,939][26022] Updated weights on worker 0-0, policy_version 978844 (0.00091) [2022-07-11 01:35:27,877][26022] Updated weights on worker 0-0, policy_version 978854 (0.00089) [2022-07-11 01:35:29,659][26022] Updated weights on worker 0-0, policy_version 978864 (0.00088) [2022-07-11 01:35:30,579][25689] Fps is (10 sec: 5490.8, 60 sec: 5526.6, 300 sec: 5543.8). Total num frames: 1002360832. Throughput: 0: 4972.4. Samples: 1002354988. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:35:30,580][25689] Avg episode reward: [(0, '1.660')] [2022-07-11 01:35:31,661][26022] Updated weights on worker 0-0, policy_version 978874 (0.00085) [2022-07-11 01:35:33,176][26022] Updated weights on worker 0-0, policy_version 978884 (0.00097) [2022-07-11 01:35:35,213][26022] Updated weights on worker 0-0, policy_version 978894 (0.00081) [2022-07-11 01:35:35,661][25689] Fps is (10 sec: 5575.0, 60 sec: 5556.2, 300 sec: 5544.2). Total num frames: 1002389504. Throughput: 0: 5792.7. Samples: 1002388380. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:35:35,661][25689] Avg episode reward: [(0, '1.506')] [2022-07-11 01:35:36,892][26022] Updated weights on worker 0-0, policy_version 978904 (0.00085) [2022-07-11 01:35:38,854][26022] Updated weights on worker 0-0, policy_version 978914 (0.00083) [2022-07-11 01:35:40,701][25689] Fps is (10 sec: 5564.0, 60 sec: 5524.4, 300 sec: 5548.4). Total num frames: 1002417152. Throughput: 0: 5778.8. Samples: 1002421670. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:35:40,701][25689] Avg episode reward: [(0, '0.773')] [2022-07-11 01:35:40,783][26022] Updated weights on worker 0-0, policy_version 978924 (0.00088) [2022-07-11 01:35:42,410][26022] Updated weights on worker 0-0, policy_version 978934 (0.00095) [2022-07-11 01:35:44,592][26022] Updated weights on worker 0-0, policy_version 978944 (0.00095) [2022-07-11 01:35:45,792][25689] Fps is (10 sec: 5660.1, 60 sec: 5536.3, 300 sec: 5550.2). Total num frames: 1002446848. Throughput: 0: 5768.7. Samples: 1002455070. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:35:45,792][25689] Avg episode reward: [(0, '0.628')] [2022-07-11 01:35:46,014][26022] Updated weights on worker 0-0, policy_version 978954 (0.00078) [2022-07-11 01:35:48,076][26022] Updated weights on worker 0-0, policy_version 978964 (0.00081) [2022-07-11 01:35:49,841][26022] Updated weights on worker 0-0, policy_version 978974 (0.00086) [2022-07-11 01:35:50,890][25689] Fps is (10 sec: 5527.6, 60 sec: 5532.3, 300 sec: 5542.1). Total num frames: 1002473472. Throughput: 0: 5755.6. Samples: 1002471890. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:35:50,890][25689] Avg episode reward: [(0, '0.734')] [2022-07-11 01:35:51,602][26022] Updated weights on worker 0-0, policy_version 978984 (0.00090) [2022-07-11 01:35:53,746][26022] Updated weights on worker 0-0, policy_version 978994 (0.00083) [2022-07-11 01:35:55,510][26022] Updated weights on worker 0-0, policy_version 979004 (0.00089) [2022-07-11 01:35:55,918][25689] Fps is (10 sec: 5460.6, 60 sec: 5564.9, 300 sec: 5542.5). Total num frames: 1002502144. Throughput: 0: 5769.8. Samples: 1002505262. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:35:55,919][25689] Avg episode reward: [(0, '0.703')] [2022-07-11 01:35:57,345][26022] Updated weights on worker 0-0, policy_version 979014 (0.00086) [2022-07-11 01:35:59,194][26022] Updated weights on worker 0-0, policy_version 979024 (0.00094) [2022-07-11 01:36:00,948][26022] Updated weights on worker 0-0, policy_version 979034 (0.00081) [2022-07-11 01:36:01,031][25689] Fps is (10 sec: 5654.2, 60 sec: 5528.3, 300 sec: 5547.7). Total num frames: 1002530816. Throughput: 0: 5768.3. Samples: 1002538944. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:36:01,032][25689] Avg episode reward: [(0, '0.747')] [2022-07-11 01:36:03,095][26022] Updated weights on worker 0-0, policy_version 979044 (0.00084) [2022-07-11 01:36:04,895][26022] Updated weights on worker 0-0, policy_version 979054 (0.00087) [2022-07-11 01:36:06,047][25689] Fps is (10 sec: 5459.4, 60 sec: 5545.5, 300 sec: 5547.7). Total num frames: 1002557440. Throughput: 0: 4870.2. Samples: 1002553714. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:36:06,047][25689] Avg episode reward: [(0, '0.399')] [2022-07-11 01:36:06,804][26022] Updated weights on worker 0-0, policy_version 979064 (0.00093) [2022-07-11 01:36:08,481][26022] Updated weights on worker 0-0, policy_version 979074 (0.00086) [2022-07-11 01:36:10,547][26022] Updated weights on worker 0-0, policy_version 979084 (0.00091) [2022-07-11 01:36:11,049][25689] Fps is (10 sec: 5315.7, 60 sec: 5534.5, 300 sec: 5544.9). Total num frames: 1002584064. Throughput: 0: 5733.8. Samples: 1002587478. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:36:11,053][25689] Avg episode reward: [(0, '-0.061')] [2022-07-11 01:36:12,143][26022] Updated weights on worker 0-0, policy_version 979094 (0.00083) [2022-07-11 01:36:14,212][26022] Updated weights on worker 0-0, policy_version 979104 (0.00086) [2022-07-11 01:36:15,818][26022] Updated weights on worker 0-0, policy_version 979114 (0.00098) [2022-07-11 01:36:16,065][25689] Fps is (10 sec: 5519.3, 60 sec: 5522.7, 300 sec: 5542.8). Total num frames: 1002612736. Throughput: 0: 5759.4. Samples: 1002621298. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:36:16,066][25689] Avg episode reward: [(0, '-0.010')] [2022-07-11 01:36:17,736][26022] Updated weights on worker 0-0, policy_version 979124 (0.00095) [2022-07-11 01:36:19,714][26022] Updated weights on worker 0-0, policy_version 979134 (0.00086) [2022-07-11 01:36:21,143][25689] Fps is (10 sec: 5680.8, 60 sec: 5539.2, 300 sec: 5545.7). Total num frames: 1002641408. Throughput: 0: 4904.2. Samples: 1002637574. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:36:21,144][25689] Avg episode reward: [(0, '-0.195')] [2022-07-11 01:36:21,438][26022] Updated weights on worker 0-0, policy_version 979144 (0.00086) [2022-07-11 01:36:23,475][26022] Updated weights on worker 0-0, policy_version 979154 (0.00090) [2022-07-11 01:36:25,230][26022] Updated weights on worker 0-0, policy_version 979164 (0.00093) [2022-07-11 01:36:26,157][25689] Fps is (10 sec: 5377.7, 60 sec: 5508.7, 300 sec: 5535.4). Total num frames: 1002667008. Throughput: 0: 5810.6. Samples: 1002670570. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:36:26,158][25689] Avg episode reward: [(0, '-0.261')] [2022-07-11 01:36:27,061][26022] Updated weights on worker 0-0, policy_version 979174 (0.00094) [2022-07-11 01:36:29,146][26022] Updated weights on worker 0-0, policy_version 979184 (0.00092) [2022-07-11 01:36:30,751][26022] Updated weights on worker 0-0, policy_version 979194 (0.00085) [2022-07-11 01:36:31,204][25689] Fps is (10 sec: 5496.0, 60 sec: 5540.1, 300 sec: 5541.9). Total num frames: 1002696704. Throughput: 0: 5753.3. Samples: 1002703440. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:36:31,205][25689] Avg episode reward: [(0, '-1.308')] [2022-07-11 01:36:32,789][26022] Updated weights on worker 0-0, policy_version 979204 (0.00087) [2022-07-11 01:36:34,567][26022] Updated weights on worker 0-0, policy_version 979214 (0.00097) [2022-07-11 01:36:36,209][25689] Fps is (10 sec: 5704.9, 60 sec: 5530.2, 300 sec: 5542.8). Total num frames: 1002724352. Throughput: 0: 4908.2. Samples: 1002720168. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:36:36,211][25689] Avg episode reward: [(0, '-0.927')] [2022-07-11 01:36:36,332][26022] Updated weights on worker 0-0, policy_version 979224 (0.00093) [2022-07-11 01:36:38,231][26022] Updated weights on worker 0-0, policy_version 979234 (0.00084) [2022-07-11 01:36:39,960][26022] Updated weights on worker 0-0, policy_version 979244 (0.00091) [2022-07-11 01:36:41,269][25689] Fps is (10 sec: 5392.2, 60 sec: 5511.5, 300 sec: 5531.9). Total num frames: 1002750976. Throughput: 0: 5760.1. Samples: 1002753502. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 01:36:41,271][25689] Avg episode reward: [(0, '0.027')] [2022-07-11 01:36:42,044][26022] Updated weights on worker 0-0, policy_version 979254 (0.00088) [2022-07-11 01:36:43,612][26022] Updated weights on worker 0-0, policy_version 979264 (0.00087) [2022-07-11 01:36:45,618][26022] Updated weights on worker 0-0, policy_version 979274 (0.00088) [2022-07-11 01:36:46,311][25689] Fps is (10 sec: 5575.3, 60 sec: 5516.0, 300 sec: 5542.9). Total num frames: 1002780672. Throughput: 0: 5768.5. Samples: 1002786826. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:36:46,311][25689] Avg episode reward: [(0, '0.081')] [2022-07-11 01:36:47,546][26022] Updated weights on worker 0-0, policy_version 979284 (0.00090) [2022-07-11 01:36:49,283][26022] Updated weights on worker 0-0, policy_version 979294 (0.00087) [2022-07-11 01:36:51,181][26022] Updated weights on worker 0-0, policy_version 979304 (0.00086) [2022-07-11 01:36:51,339][25689] Fps is (10 sec: 5694.7, 60 sec: 5539.3, 300 sec: 5539.7). Total num frames: 1002808320. Throughput: 0: 4967.0. Samples: 1002803448. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:36:51,339][25689] Avg episode reward: [(0, '0.050')] [2022-07-11 01:36:52,899][26022] Updated weights on worker 0-0, policy_version 979314 (0.00083) [2022-07-11 01:36:54,982][26022] Updated weights on worker 0-0, policy_version 979324 (0.00091) [2022-07-11 01:36:56,358][25689] Fps is (10 sec: 5402.0, 60 sec: 5506.3, 300 sec: 5534.5). Total num frames: 1002834944. Throughput: 0: 5777.7. Samples: 1002836580. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:36:56,358][25689] Avg episode reward: [(0, '-0.515')] [2022-07-11 01:36:56,548][26022] Updated weights on worker 0-0, policy_version 979334 (0.00087) [2022-07-11 01:36:58,680][26022] Updated weights on worker 0-0, policy_version 979344 (0.00092) [2022-07-11 01:37:00,254][26022] Updated weights on worker 0-0, policy_version 979354 (0.00086) [2022-07-11 01:37:01,414][25689] Fps is (10 sec: 5488.4, 60 sec: 5511.5, 300 sec: 5540.7). Total num frames: 1002863616. Throughput: 0: 5773.5. Samples: 1002869808. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:37:01,414][25689] Avg episode reward: [(0, '0.603')] [2022-07-11 01:37:02,666][26022] Updated weights on worker 0-0, policy_version 979364 (0.00091) [2022-07-11 01:37:04,361][26022] Updated weights on worker 0-0, policy_version 979374 (0.00087) [2022-07-11 01:37:06,322][26022] Updated weights on worker 0-0, policy_version 979384 (0.00096) [2022-07-11 01:37:06,451][25689] Fps is (10 sec: 5478.6, 60 sec: 5509.6, 300 sec: 5536.7). Total num frames: 1002890240. Throughput: 0: 4860.8. Samples: 1002884722. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:37:06,451][25689] Avg episode reward: [(0, '0.476')] [2022-07-11 01:37:07,959][26022] Updated weights on worker 0-0, policy_version 979394 (0.00096) [2022-07-11 01:37:09,948][26022] Updated weights on worker 0-0, policy_version 979404 (0.00085) [2022-07-11 01:37:11,471][25689] Fps is (10 sec: 5498.3, 60 sec: 5541.8, 300 sec: 5540.1). Total num frames: 1002918912. Throughput: 0: 5714.7. Samples: 1002918496. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:37:11,471][25689] Avg episode reward: [(0, '-0.919')] [2022-07-11 01:37:11,674][26022] Updated weights on worker 0-0, policy_version 979414 (0.00092) [2022-07-11 01:37:13,471][26022] Updated weights on worker 0-0, policy_version 979424 (0.00094) [2022-07-11 01:37:15,329][26022] Updated weights on worker 0-0, policy_version 979434 (0.00083) [2022-07-11 01:37:16,476][25689] Fps is (10 sec: 5413.3, 60 sec: 5492.0, 300 sec: 5527.3). Total num frames: 1002944512. Throughput: 0: 5745.2. Samples: 1002952166. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:37:16,478][25689] Avg episode reward: [(0, '-1.432')] [2022-07-11 01:37:17,162][26022] Updated weights on worker 0-0, policy_version 979444 (0.00089) [2022-07-11 01:37:19,090][26022] Updated weights on worker 0-0, policy_version 979454 (0.00097) [2022-07-11 01:37:20,632][26022] Updated weights on worker 0-0, policy_version 979464 (0.00091) [2022-07-11 01:37:21,608][25689] Fps is (10 sec: 5556.0, 60 sec: 5521.0, 300 sec: 5542.6). Total num frames: 1002975232. Throughput: 0: 4921.2. Samples: 1002969186. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:37:21,609][25689] Avg episode reward: [(0, '-1.238')] [2022-07-11 01:37:22,726][26022] Updated weights on worker 0-0, policy_version 979474 (0.00086) [2022-07-11 01:37:23,501][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:37:23,513][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000979480_1002987520.pth [2022-07-11 01:37:23,513][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000977529_1000989696.pth [2022-07-11 01:37:24,326][26022] Updated weights on worker 0-0, policy_version 979484 (0.00091) [2022-07-11 01:37:26,355][26022] Updated weights on worker 0-0, policy_version 979494 (0.00087) [2022-07-11 01:37:26,623][25689] Fps is (10 sec: 5853.5, 60 sec: 5571.7, 300 sec: 5543.1). Total num frames: 1003003904. Throughput: 0: 5869.1. Samples: 1003003114. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:37:26,624][25689] Avg episode reward: [(0, '-0.871')] [2022-07-11 01:37:28,480][26022] Updated weights on worker 0-0, policy_version 979504 (0.00089) [2022-07-11 01:37:29,999][26022] Updated weights on worker 0-0, policy_version 979514 (0.00078) [2022-07-11 01:37:31,654][25689] Fps is (10 sec: 5503.9, 60 sec: 5522.3, 300 sec: 5536.0). Total num frames: 1003030528. Throughput: 0: 5853.4. Samples: 1003036638. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:37:31,655][25689] Avg episode reward: [(0, '-1.449')] [2022-07-11 01:37:31,960][26022] Updated weights on worker 0-0, policy_version 979524 (0.00082) [2022-07-11 01:37:33,627][26022] Updated weights on worker 0-0, policy_version 979534 (0.00087) [2022-07-11 01:37:35,477][26022] Updated weights on worker 0-0, policy_version 979544 (0.00086) [2022-07-11 01:37:36,679][25689] Fps is (10 sec: 5396.6, 60 sec: 5520.5, 300 sec: 5536.3). Total num frames: 1003058176. Throughput: 0: 5027.3. Samples: 1003053734. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:37:36,681][25689] Avg episode reward: [(0, '-1.041')] [2022-07-11 01:37:37,218][26022] Updated weights on worker 0-0, policy_version 979554 (0.00085) [2022-07-11 01:37:39,143][26022] Updated weights on worker 0-0, policy_version 979564 (0.00094) [2022-07-11 01:37:40,748][26022] Updated weights on worker 0-0, policy_version 979574 (0.00092) [2022-07-11 01:37:41,774][25689] Fps is (10 sec: 5767.9, 60 sec: 5585.0, 300 sec: 5538.3). Total num frames: 1003088896. Throughput: 0: 5877.9. Samples: 1003087720. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:37:41,774][25689] Avg episode reward: [(0, '0.018')] [2022-07-11 01:37:42,776][26022] Updated weights on worker 0-0, policy_version 979584 (0.00089) [2022-07-11 01:37:44,555][26022] Updated weights on worker 0-0, policy_version 979594 (0.00082) [2022-07-11 01:37:46,431][26022] Updated weights on worker 0-0, policy_version 979604 (0.00081) [2022-07-11 01:37:46,822][25689] Fps is (10 sec: 5755.0, 60 sec: 5550.6, 300 sec: 5541.4). Total num frames: 1003116544. Throughput: 0: 5847.9. Samples: 1003121236. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:37:46,822][25689] Avg episode reward: [(0, '0.217')] [2022-07-11 01:37:48,248][26022] Updated weights on worker 0-0, policy_version 979614 (0.00085) [2022-07-11 01:37:49,852][26022] Updated weights on worker 0-0, policy_version 979624 (0.00085) [2022-07-11 01:37:51,910][25689] Fps is (10 sec: 5455.4, 60 sec: 5545.1, 300 sec: 5536.7). Total num frames: 1003144192. Throughput: 0: 5009.7. Samples: 1003138112. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:37:51,910][25689] Avg episode reward: [(0, '0.285')] [2022-07-11 01:37:51,947][26022] Updated weights on worker 0-0, policy_version 979634 (0.00094) [2022-07-11 01:37:53,539][26022] Updated weights on worker 0-0, policy_version 979644 (0.00082) [2022-07-11 01:37:55,610][26022] Updated weights on worker 0-0, policy_version 979654 (0.00107) [2022-07-11 01:37:56,957][25689] Fps is (10 sec: 5557.0, 60 sec: 5576.3, 300 sec: 5541.4). Total num frames: 1003172864. Throughput: 0: 5830.2. Samples: 1003171954. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:37:56,957][25689] Avg episode reward: [(0, '0.779')] [2022-07-11 01:37:57,450][26022] Updated weights on worker 0-0, policy_version 979664 (0.00086) [2022-07-11 01:37:59,127][26022] Updated weights on worker 0-0, policy_version 979674 (0.00084) [2022-07-11 01:38:01,433][26022] Updated weights on worker 0-0, policy_version 979684 (0.00091) [2022-07-11 01:38:02,021][25689] Fps is (10 sec: 5468.9, 60 sec: 5541.8, 300 sec: 5540.9). Total num frames: 1003199488. Throughput: 0: 5793.2. Samples: 1003205016. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:38:02,021][25689] Avg episode reward: [(0, '0.982')] [2022-07-11 01:38:03,148][26022] Updated weights on worker 0-0, policy_version 979694 (0.00081) [2022-07-11 01:38:05,201][26022] Updated weights on worker 0-0, policy_version 979704 (0.00090) [2022-07-11 01:38:07,052][25689] Fps is (10 sec: 5376.2, 60 sec: 5559.3, 300 sec: 5537.7). Total num frames: 1003227136. Throughput: 0: 5701.9. Samples: 1003236586. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:38:07,052][25689] Avg episode reward: [(0, '0.344')] [2022-07-11 01:38:07,059][26022] Updated weights on worker 0-0, policy_version 979714 (0.00092) [2022-07-11 01:38:08,748][26022] Updated weights on worker 0-0, policy_version 979724 (0.00613) [2022-07-11 01:38:10,694][26022] Updated weights on worker 0-0, policy_version 979734 (0.00091) [2022-07-11 01:38:12,123][25689] Fps is (10 sec: 5575.1, 60 sec: 5554.5, 300 sec: 5546.7). Total num frames: 1003255808. Throughput: 0: 5705.5. Samples: 1003253438. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:38:12,123][25689] Avg episode reward: [(0, '0.763')] [2022-07-11 01:38:12,456][26022] Updated weights on worker 0-0, policy_version 979744 (0.00094) [2022-07-11 01:38:14,375][26022] Updated weights on worker 0-0, policy_version 979754 (0.00088) [2022-07-11 01:38:16,185][26022] Updated weights on worker 0-0, policy_version 979764 (0.00089) [2022-07-11 01:38:17,146][25689] Fps is (10 sec: 5680.8, 60 sec: 5603.6, 300 sec: 5537.1). Total num frames: 1003284480. Throughput: 0: 5701.9. Samples: 1003287072. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:38:17,146][25689] Avg episode reward: [(0, '1.056')] [2022-07-11 01:38:17,888][26022] Updated weights on worker 0-0, policy_version 979774 (0.00093) [2022-07-11 01:38:19,877][26022] Updated weights on worker 0-0, policy_version 979784 (0.00088) [2022-07-11 01:38:21,315][26022] Updated weights on worker 0-0, policy_version 979794 (0.00083) [2022-07-11 01:38:22,202][25689] Fps is (10 sec: 5689.3, 60 sec: 5576.7, 300 sec: 5543.0). Total num frames: 1003313152. Throughput: 0: 5750.0. Samples: 1003321058. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:38:22,203][25689] Avg episode reward: [(0, '0.856')] [2022-07-11 01:38:23,542][26022] Updated weights on worker 0-0, policy_version 979804 (0.00088) [2022-07-11 01:38:24,950][26022] Updated weights on worker 0-0, policy_version 979814 (0.00084) [2022-07-11 01:38:27,090][26022] Updated weights on worker 0-0, policy_version 979824 (0.00097) [2022-07-11 01:38:27,215][25689] Fps is (10 sec: 5593.6, 60 sec: 5560.1, 300 sec: 5547.2). Total num frames: 1003340800. Throughput: 0: 5027.7. Samples: 1003337960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:38:27,215][25689] Avg episode reward: [(0, '1.149')] [2022-07-11 01:38:28,808][26022] Updated weights on worker 0-0, policy_version 979834 (0.00087) [2022-07-11 01:38:30,652][26022] Updated weights on worker 0-0, policy_version 979844 (0.00088) [2022-07-11 01:38:32,219][25689] Fps is (10 sec: 5520.1, 60 sec: 5579.4, 300 sec: 5537.1). Total num frames: 1003368448. Throughput: 0: 5873.3. Samples: 1003371470. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:38:32,220][25689] Avg episode reward: [(0, '0.714')] [2022-07-11 01:38:32,607][26022] Updated weights on worker 0-0, policy_version 979854 (0.00084) [2022-07-11 01:38:34,221][26022] Updated weights on worker 0-0, policy_version 979864 (0.00088) [2022-07-11 01:38:36,148][26022] Updated weights on worker 0-0, policy_version 979874 (0.00091) [2022-07-11 01:38:37,235][25689] Fps is (10 sec: 5518.6, 60 sec: 5580.4, 300 sec: 5537.9). Total num frames: 1003396096. Throughput: 0: 5876.9. Samples: 1003405132. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:38:37,235][25689] Avg episode reward: [(0, '1.040')] [2022-07-11 01:38:37,943][26022] Updated weights on worker 0-0, policy_version 979884 (0.00085) [2022-07-11 01:38:39,805][26022] Updated weights on worker 0-0, policy_version 979894 (0.00084) [2022-07-11 01:38:41,725][26022] Updated weights on worker 0-0, policy_version 979904 (0.00088) [2022-07-11 01:38:42,363][25689] Fps is (10 sec: 5552.3, 60 sec: 5543.4, 300 sec: 5542.9). Total num frames: 1003424768. Throughput: 0: 4990.2. Samples: 1003421664. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:38:42,363][25689] Avg episode reward: [(0, '0.720')] [2022-07-11 01:38:43,555][26022] Updated weights on worker 0-0, policy_version 979914 (0.00082) [2022-07-11 01:38:45,452][26022] Updated weights on worker 0-0, policy_version 979924 (0.00093) [2022-07-11 01:38:47,291][26022] Updated weights on worker 0-0, policy_version 979934 (0.00088) [2022-07-11 01:38:47,388][25689] Fps is (10 sec: 5546.8, 60 sec: 5545.5, 300 sec: 5535.6). Total num frames: 1003452416. Throughput: 0: 5812.5. Samples: 1003455218. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:38:47,389][25689] Avg episode reward: [(0, '0.321')] [2022-07-11 01:38:48,912][26022] Updated weights on worker 0-0, policy_version 979944 (0.00080) [2022-07-11 01:38:51,195][26022] Updated weights on worker 0-0, policy_version 979954 (0.00086) [2022-07-11 01:38:52,411][25689] Fps is (10 sec: 5605.1, 60 sec: 5568.4, 300 sec: 5542.5). Total num frames: 1003481088. Throughput: 0: 5792.9. Samples: 1003488438. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:38:52,411][25689] Avg episode reward: [(0, '0.164')] [2022-07-11 01:38:52,636][26022] Updated weights on worker 0-0, policy_version 979964 (0.00091) [2022-07-11 01:38:54,847][26022] Updated weights on worker 0-0, policy_version 979974 (0.00087) [2022-07-11 01:38:56,432][26022] Updated weights on worker 0-0, policy_version 979984 (0.00093) [2022-07-11 01:38:57,416][25689] Fps is (10 sec: 5514.3, 60 sec: 5538.4, 300 sec: 5538.1). Total num frames: 1003507712. Throughput: 0: 4957.2. Samples: 1003505174. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:38:57,416][25689] Avg episode reward: [(0, '0.367')] [2022-07-11 01:38:58,534][26022] Updated weights on worker 0-0, policy_version 979994 (0.00084) [2022-07-11 01:39:00,082][26022] Updated weights on worker 0-0, policy_version 980004 (0.00086) [2022-07-11 01:39:02,486][25689] Fps is (10 sec: 5285.3, 60 sec: 5537.9, 300 sec: 5540.9). Total num frames: 1003534336. Throughput: 0: 5803.6. Samples: 1003538448. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:39:02,486][25689] Avg episode reward: [(0, '0.356')] [2022-07-11 01:39:02,495][26022] Updated weights on worker 0-0, policy_version 980014 (0.00101) [2022-07-11 01:39:04,230][26022] Updated weights on worker 0-0, policy_version 980024 (0.00089) [2022-07-11 01:39:05,987][26022] Updated weights on worker 0-0, policy_version 980034 (0.00098) [2022-07-11 01:39:07,501][25689] Fps is (10 sec: 5483.1, 60 sec: 5556.3, 300 sec: 5541.1). Total num frames: 1003563008. Throughput: 0: 5702.5. Samples: 1003569910. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:39:07,503][25689] Avg episode reward: [(0, '-0.117')] [2022-07-11 01:39:07,947][26022] Updated weights on worker 0-0, policy_version 980044 (0.00087) [2022-07-11 01:39:09,620][26022] Updated weights on worker 0-0, policy_version 980054 (0.00090) [2022-07-11 01:39:11,588][26022] Updated weights on worker 0-0, policy_version 980064 (0.00089) [2022-07-11 01:39:12,510][25689] Fps is (10 sec: 5516.1, 60 sec: 5528.1, 300 sec: 5541.7). Total num frames: 1003589632. Throughput: 0: 4887.6. Samples: 1003586674. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:39:12,512][25689] Avg episode reward: [(0, '-0.288')] [2022-07-11 01:39:13,503][26022] Updated weights on worker 0-0, policy_version 980074 (0.00087) [2022-07-11 01:39:15,217][26022] Updated weights on worker 0-0, policy_version 980084 (0.00083) [2022-07-11 01:39:17,113][26022] Updated weights on worker 0-0, policy_version 980094 (0.00087) [2022-07-11 01:39:17,520][25689] Fps is (10 sec: 5518.8, 60 sec: 5529.2, 300 sec: 5536.3). Total num frames: 1003618304. Throughput: 0: 5723.0. Samples: 1003620232. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:39:17,521][25689] Avg episode reward: [(0, '0.235')] [2022-07-11 01:39:18,826][26022] Updated weights on worker 0-0, policy_version 980104 (0.00079) [2022-07-11 01:39:20,791][26022] Updated weights on worker 0-0, policy_version 980114 (0.00089) [2022-07-11 01:39:22,525][26022] Updated weights on worker 0-0, policy_version 980124 (0.00087) [2022-07-11 01:39:22,568][25689] Fps is (10 sec: 5803.4, 60 sec: 5547.0, 300 sec: 5546.9). Total num frames: 1003648000. Throughput: 0: 5754.6. Samples: 1003654012. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:39:22,568][25689] Avg episode reward: [(0, '-0.104')] [2022-07-11 01:39:23,623][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:39:23,634][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000980130_1003653120.pth [2022-07-11 01:39:23,634][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000978180_1001656320.pth [2022-07-11 01:39:24,249][26022] Updated weights on worker 0-0, policy_version 980134 (0.00081) [2022-07-11 01:39:26,024][26022] Updated weights on worker 0-0, policy_version 980144 (0.00092) [2022-07-11 01:39:27,610][25689] Fps is (10 sec: 5683.3, 60 sec: 5544.2, 300 sec: 5542.8). Total num frames: 1003675648. Throughput: 0: 5030.9. Samples: 1003671080. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:39:27,611][25689] Avg episode reward: [(0, '-0.201')] [2022-07-11 01:39:28,095][26022] Updated weights on worker 0-0, policy_version 980154 (0.00090) [2022-07-11 01:39:29,733][26022] Updated weights on worker 0-0, policy_version 980164 (0.00094) [2022-07-11 01:39:31,781][26022] Updated weights on worker 0-0, policy_version 980174 (0.00089) [2022-07-11 01:39:32,643][25689] Fps is (10 sec: 5589.7, 60 sec: 5558.6, 300 sec: 5549.8). Total num frames: 1003704320. Throughput: 0: 5854.2. Samples: 1003704538. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:39:32,644][25689] Avg episode reward: [(0, '0.038')] [2022-07-11 01:39:33,540][26022] Updated weights on worker 0-0, policy_version 980184 (0.00086) [2022-07-11 01:39:35,242][26022] Updated weights on worker 0-0, policy_version 980194 (0.00091) [2022-07-11 01:39:37,296][26022] Updated weights on worker 0-0, policy_version 980204 (0.00085) [2022-07-11 01:39:37,666][25689] Fps is (10 sec: 5498.7, 60 sec: 5540.9, 300 sec: 5540.2). Total num frames: 1003730944. Throughput: 0: 5863.1. Samples: 1003738350. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:39:37,667][25689] Avg episode reward: [(0, '0.331')] [2022-07-11 01:39:38,744][26022] Updated weights on worker 0-0, policy_version 980214 (0.00085) [2022-07-11 01:39:40,859][26022] Updated weights on worker 0-0, policy_version 980224 (0.00085) [2022-07-11 01:39:42,335][26022] Updated weights on worker 0-0, policy_version 980234 (0.00084) [2022-07-11 01:39:42,711][25689] Fps is (10 sec: 5696.0, 60 sec: 5582.6, 300 sec: 5546.9). Total num frames: 1003761664. Throughput: 0: 5031.2. Samples: 1003755356. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:39:42,711][25689] Avg episode reward: [(0, '0.478')] [2022-07-11 01:39:44,447][26022] Updated weights on worker 0-0, policy_version 980244 (0.00091) [2022-07-11 01:39:46,128][26022] Updated weights on worker 0-0, policy_version 980254 (0.00092) [2022-07-11 01:39:47,719][25689] Fps is (10 sec: 5704.3, 60 sec: 5567.2, 300 sec: 5547.8). Total num frames: 1003788288. Throughput: 0: 5862.1. Samples: 1003788960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:39:47,720][25689] Avg episode reward: [(0, '0.562')] [2022-07-11 01:39:48,018][26022] Updated weights on worker 0-0, policy_version 980264 (0.00089) [2022-07-11 01:39:50,081][26022] Updated weights on worker 0-0, policy_version 980274 (0.00089) [2022-07-11 01:39:51,804][26022] Updated weights on worker 0-0, policy_version 980284 (0.00091) [2022-07-11 01:39:52,725][25689] Fps is (10 sec: 5419.3, 60 sec: 5551.7, 300 sec: 5551.4). Total num frames: 1003815936. Throughput: 0: 5879.0. Samples: 1003822600. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:39:52,726][25689] Avg episode reward: [(0, '-0.269')] [2022-07-11 01:39:53,534][26022] Updated weights on worker 0-0, policy_version 980294 (0.00090) [2022-07-11 01:39:55,357][26022] Updated weights on worker 0-0, policy_version 980304 (0.00086) [2022-07-11 01:39:57,118][26022] Updated weights on worker 0-0, policy_version 980314 (0.00095) [2022-07-11 01:39:57,731][25689] Fps is (10 sec: 5523.2, 60 sec: 5568.7, 300 sec: 5542.5). Total num frames: 1003843584. Throughput: 0: 5049.7. Samples: 1003839668. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:39:57,732][25689] Avg episode reward: [(0, '0.019')] [2022-07-11 01:39:59,115][26022] Updated weights on worker 0-0, policy_version 980324 (0.00086) [2022-07-11 01:40:00,945][26022] Updated weights on worker 0-0, policy_version 980334 (0.00092) [2022-07-11 01:40:02,787][25689] Fps is (10 sec: 5394.1, 60 sec: 5569.9, 300 sec: 5545.2). Total num frames: 1003870208. Throughput: 0: 5852.4. Samples: 1003872850. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:40:02,789][25689] Avg episode reward: [(0, '0.133')] [2022-07-11 01:40:02,971][26022] Updated weights on worker 0-0, policy_version 980344 (0.00089) [2022-07-11 01:40:04,990][26022] Updated weights on worker 0-0, policy_version 980354 (0.00089) [2022-07-11 01:40:06,609][26022] Updated weights on worker 0-0, policy_version 980364 (0.00087) [2022-07-11 01:40:07,790][25689] Fps is (10 sec: 5293.7, 60 sec: 5537.1, 300 sec: 5543.0). Total num frames: 1003896832. Throughput: 0: 5759.7. Samples: 1003904560. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:40:07,790][25689] Avg episode reward: [(0, '0.338')] [2022-07-11 01:40:08,789][26022] Updated weights on worker 0-0, policy_version 980374 (0.00088) [2022-07-11 01:40:10,503][26022] Updated weights on worker 0-0, policy_version 980384 (0.00081) [2022-07-11 01:40:12,267][26022] Updated weights on worker 0-0, policy_version 980394 (0.00399) [2022-07-11 01:40:12,796][25689] Fps is (10 sec: 5627.0, 60 sec: 5588.3, 300 sec: 5544.2). Total num frames: 1003926528. Throughput: 0: 4916.0. Samples: 1003921266. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:40:12,796][25689] Avg episode reward: [(0, '0.024')] [2022-07-11 01:40:14,043][26022] Updated weights on worker 0-0, policy_version 980404 (0.00098) [2022-07-11 01:40:15,857][26022] Updated weights on worker 0-0, policy_version 980414 (0.00087) [2022-07-11 01:40:17,795][26022] Updated weights on worker 0-0, policy_version 980424 (0.00087) [2022-07-11 01:40:17,816][25689] Fps is (10 sec: 5719.5, 60 sec: 5570.5, 300 sec: 5545.2). Total num frames: 1003954176. Throughput: 0: 5727.9. Samples: 1003954712. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:40:17,816][25689] Avg episode reward: [(0, '0.902')] [2022-07-11 01:40:19,671][26022] Updated weights on worker 0-0, policy_version 980434 (0.00082) [2022-07-11 01:40:21,468][26022] Updated weights on worker 0-0, policy_version 980444 (0.00092) [2022-07-11 01:40:22,879][25689] Fps is (10 sec: 5484.1, 60 sec: 5535.0, 300 sec: 5545.0). Total num frames: 1003981824. Throughput: 0: 5731.6. Samples: 1003988010. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:40:22,881][25689] Avg episode reward: [(0, '1.622')] [2022-07-11 01:40:23,317][26022] Updated weights on worker 0-0, policy_version 980454 (0.00081) [2022-07-11 01:40:25,176][26022] Updated weights on worker 0-0, policy_version 980464 (0.00088) [2022-07-11 01:40:26,854][26022] Updated weights on worker 0-0, policy_version 980474 (0.00083) [2022-07-11 01:40:27,891][25689] Fps is (10 sec: 5488.4, 60 sec: 5537.9, 300 sec: 5545.2). Total num frames: 1004009472. Throughput: 0: 5005.2. Samples: 1004005170. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:40:27,891][25689] Avg episode reward: [(0, '1.491')] [2022-07-11 01:40:28,807][26022] Updated weights on worker 0-0, policy_version 980484 (0.00088) [2022-07-11 01:40:30,523][26022] Updated weights on worker 0-0, policy_version 980494 (0.00090) [2022-07-11 01:40:32,448][26022] Updated weights on worker 0-0, policy_version 980504 (0.00087) [2022-07-11 01:40:32,910][25689] Fps is (10 sec: 5716.3, 60 sec: 5556.1, 300 sec: 5549.8). Total num frames: 1004039168. Throughput: 0: 5840.3. Samples: 1004038742. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:40:32,911][25689] Avg episode reward: [(0, '1.383')] [2022-07-11 01:40:34,382][26022] Updated weights on worker 0-0, policy_version 980514 (0.00086) [2022-07-11 01:40:36,079][26022] Updated weights on worker 0-0, policy_version 980524 (0.00087) [2022-07-11 01:40:37,918][25689] Fps is (10 sec: 5616.7, 60 sec: 5557.5, 300 sec: 5546.9). Total num frames: 1004065792. Throughput: 0: 5846.0. Samples: 1004072230. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 01:40:37,918][25689] Avg episode reward: [(0, '1.536')] [2022-07-11 01:40:38,045][26022] Updated weights on worker 0-0, policy_version 980534 (0.00092) [2022-07-11 01:40:39,791][26022] Updated weights on worker 0-0, policy_version 980544 (0.00089) [2022-07-11 01:40:41,663][26022] Updated weights on worker 0-0, policy_version 980554 (0.00089) [2022-07-11 01:40:43,000][25689] Fps is (10 sec: 5480.3, 60 sec: 5520.0, 300 sec: 5543.6). Total num frames: 1004094464. Throughput: 0: 5029.9. Samples: 1004089220. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:40:43,001][25689] Avg episode reward: [(0, '1.735')] [2022-07-11 01:40:43,461][26022] Updated weights on worker 0-0, policy_version 980564 (0.00089) [2022-07-11 01:40:45,305][26022] Updated weights on worker 0-0, policy_version 980574 (0.00091) [2022-07-11 01:40:47,179][26022] Updated weights on worker 0-0, policy_version 980584 (0.00087) [2022-07-11 01:40:48,004][25689] Fps is (10 sec: 5685.5, 60 sec: 5554.4, 300 sec: 5552.3). Total num frames: 1004123136. Throughput: 0: 5840.6. Samples: 1004122644. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:40:48,004][25689] Avg episode reward: [(0, '1.838')] [2022-07-11 01:40:49,115][26022] Updated weights on worker 0-0, policy_version 980594 (0.00090) [2022-07-11 01:40:50,648][26022] Updated weights on worker 0-0, policy_version 980604 (0.00089) [2022-07-11 01:40:52,910][26022] Updated weights on worker 0-0, policy_version 980614 (0.00098) [2022-07-11 01:40:53,033][25689] Fps is (10 sec: 5409.7, 60 sec: 5518.4, 300 sec: 5541.9). Total num frames: 1004148736. Throughput: 0: 5829.8. Samples: 1004156052. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:40:53,033][25689] Avg episode reward: [(0, '1.082')] [2022-07-11 01:40:54,638][26022] Updated weights on worker 0-0, policy_version 980624 (0.00089) [2022-07-11 01:40:56,363][26022] Updated weights on worker 0-0, policy_version 980634 (0.00086) [2022-07-11 01:40:58,054][25689] Fps is (10 sec: 5603.8, 60 sec: 5567.8, 300 sec: 5550.5). Total num frames: 1004179456. Throughput: 0: 4988.3. Samples: 1004172676. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:40:58,055][25689] Avg episode reward: [(0, '1.119')] [2022-07-11 01:40:58,062][26022] Updated weights on worker 0-0, policy_version 980644 (0.00079) [2022-07-11 01:40:59,983][26022] Updated weights on worker 0-0, policy_version 980654 (0.00085) [2022-07-11 01:41:02,375][26022] Updated weights on worker 0-0, policy_version 980664 (0.00099) [2022-07-11 01:41:03,169][25689] Fps is (10 sec: 5455.3, 60 sec: 5528.5, 300 sec: 5541.8). Total num frames: 1004204032. Throughput: 0: 5792.4. Samples: 1004206044. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:41:03,170][25689] Avg episode reward: [(0, '1.194')] [2022-07-11 01:41:04,136][26022] Updated weights on worker 0-0, policy_version 980674 (0.00084) [2022-07-11 01:41:05,903][26022] Updated weights on worker 0-0, policy_version 980684 (0.00089) [2022-07-11 01:41:07,776][26022] Updated weights on worker 0-0, policy_version 980694 (0.00091) [2022-07-11 01:41:08,205][25689] Fps is (10 sec: 5246.0, 60 sec: 5559.4, 300 sec: 5548.1). Total num frames: 1004232704. Throughput: 0: 5691.8. Samples: 1004237622. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:41:08,205][25689] Avg episode reward: [(0, '0.928')] [2022-07-11 01:41:09,631][26022] Updated weights on worker 0-0, policy_version 980704 (0.00089) [2022-07-11 01:41:11,479][26022] Updated weights on worker 0-0, policy_version 980714 (0.00091) [2022-07-11 01:41:13,215][25689] Fps is (10 sec: 5606.0, 60 sec: 5525.1, 300 sec: 5544.7). Total num frames: 1004260352. Throughput: 0: 4875.3. Samples: 1004254448. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:41:13,216][25689] Avg episode reward: [(0, '0.561')] [2022-07-11 01:41:13,219][26022] Updated weights on worker 0-0, policy_version 980724 (0.00096) [2022-07-11 01:41:15,216][26022] Updated weights on worker 0-0, policy_version 980734 (0.00088) [2022-07-11 01:41:16,832][26022] Updated weights on worker 0-0, policy_version 980744 (0.00094) [2022-07-11 01:41:18,225][25689] Fps is (10 sec: 5620.5, 60 sec: 5543.0, 300 sec: 5546.0). Total num frames: 1004289024. Throughput: 0: 5722.2. Samples: 1004288096. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:41:18,227][25689] Avg episode reward: [(0, '0.389')] [2022-07-11 01:41:18,806][26022] Updated weights on worker 0-0, policy_version 980754 (0.00086) [2022-07-11 01:41:20,559][26022] Updated weights on worker 0-0, policy_version 980764 (0.00096) [2022-07-11 01:41:22,709][26022] Updated weights on worker 0-0, policy_version 980774 (0.00087) [2022-07-11 01:41:23,312][25689] Fps is (10 sec: 5578.0, 60 sec: 5540.8, 300 sec: 5551.5). Total num frames: 1004316672. Throughput: 0: 5740.4. Samples: 1004321674. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:41:23,313][25689] Avg episode reward: [(0, '0.074')] [2022-07-11 01:41:23,662][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:41:23,668][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000980782_1004320768.pth [2022-07-11 01:41:23,669][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000978830_1002321920.pth [2022-07-11 01:41:24,004][26022] Updated weights on worker 0-0, policy_version 980784 (0.00084) [2022-07-11 01:41:26,195][26022] Updated weights on worker 0-0, policy_version 980794 (0.00087) [2022-07-11 01:41:27,747][26022] Updated weights on worker 0-0, policy_version 980804 (0.00091) [2022-07-11 01:41:28,351][25689] Fps is (10 sec: 5561.7, 60 sec: 5555.2, 300 sec: 5548.2). Total num frames: 1004345344. Throughput: 0: 5010.7. Samples: 1004338572. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:41:28,353][25689] Avg episode reward: [(0, '0.698')] [2022-07-11 01:41:29,738][26022] Updated weights on worker 0-0, policy_version 980814 (0.00087) [2022-07-11 01:41:31,537][26022] Updated weights on worker 0-0, policy_version 980824 (0.00084) [2022-07-11 01:41:33,348][26022] Updated weights on worker 0-0, policy_version 980834 (0.00085) [2022-07-11 01:41:33,373][25689] Fps is (10 sec: 5699.4, 60 sec: 5538.1, 300 sec: 5551.4). Total num frames: 1004374016. Throughput: 0: 5850.0. Samples: 1004372374. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:41:33,374][25689] Avg episode reward: [(0, '1.005')] [2022-07-11 01:41:35,306][26022] Updated weights on worker 0-0, policy_version 980844 (0.00096) [2022-07-11 01:41:37,078][26022] Updated weights on worker 0-0, policy_version 980854 (0.00087) [2022-07-11 01:41:38,392][25689] Fps is (10 sec: 5507.0, 60 sec: 5537.0, 300 sec: 5552.1). Total num frames: 1004400640. Throughput: 0: 5837.4. Samples: 1004405822. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:41:38,394][25689] Avg episode reward: [(0, '0.842')] [2022-07-11 01:41:38,876][26022] Updated weights on worker 0-0, policy_version 980864 (0.00086) [2022-07-11 01:41:40,777][26022] Updated weights on worker 0-0, policy_version 980874 (0.00087) [2022-07-11 01:41:42,460][26022] Updated weights on worker 0-0, policy_version 980884 (0.00087) [2022-07-11 01:41:43,423][25689] Fps is (10 sec: 5604.1, 60 sec: 5558.7, 300 sec: 5552.3). Total num frames: 1004430336. Throughput: 0: 5850.6. Samples: 1004439336. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:41:43,431][25689] Avg episode reward: [(0, '0.636')] [2022-07-11 01:41:44,569][26022] Updated weights on worker 0-0, policy_version 980894 (0.00089) [2022-07-11 01:41:46,212][26022] Updated weights on worker 0-0, policy_version 980904 (0.00087) [2022-07-11 01:41:48,035][26022] Updated weights on worker 0-0, policy_version 980914 (0.00088) [2022-07-11 01:41:48,460][25689] Fps is (10 sec: 5696.2, 60 sec: 5538.7, 300 sec: 5552.2). Total num frames: 1004457984. Throughput: 0: 5846.0. Samples: 1004456124. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:41:48,460][25689] Avg episode reward: [(0, '0.379')] [2022-07-11 01:41:50,116][26022] Updated weights on worker 0-0, policy_version 980924 (0.00094) [2022-07-11 01:41:51,651][26022] Updated weights on worker 0-0, policy_version 980934 (0.00087) [2022-07-11 01:41:53,473][25689] Fps is (10 sec: 5502.3, 60 sec: 5574.1, 300 sec: 5555.7). Total num frames: 1004485632. Throughput: 0: 5834.3. Samples: 1004489640. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:41:53,473][25689] Avg episode reward: [(0, '-0.018')] [2022-07-11 01:41:53,658][26022] Updated weights on worker 0-0, policy_version 980944 (0.00082) [2022-07-11 01:41:55,308][26022] Updated weights on worker 0-0, policy_version 980954 (0.00084) [2022-07-11 01:41:57,204][26022] Updated weights on worker 0-0, policy_version 980964 (0.00089) [2022-07-11 01:41:58,497][25689] Fps is (10 sec: 5610.9, 60 sec: 5539.9, 300 sec: 5556.3). Total num frames: 1004514304. Throughput: 0: 5848.0. Samples: 1004523394. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:41:58,498][25689] Avg episode reward: [(0, '-0.061')] [2022-07-11 01:41:59,082][26022] Updated weights on worker 0-0, policy_version 980974 (0.00088) [2022-07-11 01:42:00,958][26022] Updated weights on worker 0-0, policy_version 980984 (0.00096) [2022-07-11 01:42:03,192][26022] Updated weights on worker 0-0, policy_version 980994 (0.00091) [2022-07-11 01:42:03,568][25689] Fps is (10 sec: 5477.5, 60 sec: 5577.8, 300 sec: 5555.7). Total num frames: 1004540928. Throughput: 0: 4977.2. Samples: 1004539600. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:42:03,570][25689] Avg episode reward: [(0, '0.130')] [2022-07-11 01:42:05,016][26022] Updated weights on worker 0-0, policy_version 981004 (0.00096) [2022-07-11 01:42:06,840][26022] Updated weights on worker 0-0, policy_version 981014 (0.00092) [2022-07-11 01:42:08,590][25689] Fps is (10 sec: 5275.7, 60 sec: 5545.2, 300 sec: 5548.7). Total num frames: 1004567552. Throughput: 0: 5749.1. Samples: 1004571858. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:42:08,591][25689] Avg episode reward: [(0, '0.129')] [2022-07-11 01:42:08,722][26022] Updated weights on worker 0-0, policy_version 981024 (0.00079) [2022-07-11 01:42:10,456][26022] Updated weights on worker 0-0, policy_version 981034 (0.00087) [2022-07-11 01:42:12,217][26022] Updated weights on worker 0-0, policy_version 981044 (0.00087) [2022-07-11 01:42:13,601][25689] Fps is (10 sec: 5511.3, 60 sec: 5562.1, 300 sec: 5559.0). Total num frames: 1004596224. Throughput: 0: 5774.6. Samples: 1004605872. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:42:13,603][25689] Avg episode reward: [(0, '-0.385')] [2022-07-11 01:42:14,066][26022] Updated weights on worker 0-0, policy_version 981054 (0.00086) [2022-07-11 01:42:15,912][26022] Updated weights on worker 0-0, policy_version 981064 (0.00085) [2022-07-11 01:42:17,719][26022] Updated weights on worker 0-0, policy_version 981074 (0.00092) [2022-07-11 01:42:18,614][25689] Fps is (10 sec: 5618.5, 60 sec: 5544.8, 300 sec: 5550.8). Total num frames: 1004623872. Throughput: 0: 4946.5. Samples: 1004622904. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:42:18,615][25689] Avg episode reward: [(0, '-0.932')] [2022-07-11 01:42:19,392][26022] Updated weights on worker 0-0, policy_version 981084 (0.00092) [2022-07-11 01:42:21,603][26022] Updated weights on worker 0-0, policy_version 981094 (0.00090) [2022-07-11 01:42:23,011][26022] Updated weights on worker 0-0, policy_version 981104 (0.00081) [2022-07-11 01:42:23,695][25689] Fps is (10 sec: 5681.2, 60 sec: 5579.4, 300 sec: 5553.1). Total num frames: 1004653568. Throughput: 0: 5811.8. Samples: 1004656572. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:42:23,695][25689] Avg episode reward: [(0, '-0.668')] [2022-07-11 01:42:25,199][26022] Updated weights on worker 0-0, policy_version 981114 (0.00085) [2022-07-11 01:42:26,644][26022] Updated weights on worker 0-0, policy_version 981124 (0.00086) [2022-07-11 01:42:28,705][25689] Fps is (10 sec: 5581.1, 60 sec: 5548.1, 300 sec: 5553.5). Total num frames: 1004680192. Throughput: 0: 5896.0. Samples: 1004690458. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:42:28,706][25689] Avg episode reward: [(0, '-0.419')] [2022-07-11 01:42:28,791][26022] Updated weights on worker 0-0, policy_version 981134 (0.00084) [2022-07-11 01:42:30,134][26022] Updated weights on worker 0-0, policy_version 981144 (0.00095) [2022-07-11 01:42:32,445][26022] Updated weights on worker 0-0, policy_version 981154 (0.00508) [2022-07-11 01:42:33,715][25689] Fps is (10 sec: 5722.7, 60 sec: 5583.2, 300 sec: 5564.1). Total num frames: 1004710912. Throughput: 0: 5042.7. Samples: 1004707300. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:42:33,716][25689] Avg episode reward: [(0, '-0.620')] [2022-07-11 01:42:33,946][26022] Updated weights on worker 0-0, policy_version 981164 (0.00090) [2022-07-11 01:42:36,025][26022] Updated weights on worker 0-0, policy_version 981174 (0.00071) [2022-07-11 01:42:37,735][26022] Updated weights on worker 0-0, policy_version 981184 (0.00087) [2022-07-11 01:42:38,740][25689] Fps is (10 sec: 5612.5, 60 sec: 5565.6, 300 sec: 5548.2). Total num frames: 1004736512. Throughput: 0: 5859.0. Samples: 1004740822. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:42:38,742][25689] Avg episode reward: [(0, '-0.282')] [2022-07-11 01:42:39,529][26022] Updated weights on worker 0-0, policy_version 981194 (0.00091) [2022-07-11 01:42:41,513][26022] Updated weights on worker 0-0, policy_version 981204 (0.00085) [2022-07-11 01:42:43,134][26022] Updated weights on worker 0-0, policy_version 981214 (0.00086) [2022-07-11 01:42:43,869][25689] Fps is (10 sec: 5445.9, 60 sec: 5556.6, 300 sec: 5553.5). Total num frames: 1004766208. Throughput: 0: 5851.8. Samples: 1004774626. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:42:43,871][25689] Avg episode reward: [(0, '-0.199')] [2022-07-11 01:42:45,279][26022] Updated weights on worker 0-0, policy_version 981224 (0.00090) [2022-07-11 01:42:46,925][26022] Updated weights on worker 0-0, policy_version 981234 (0.00086) [2022-07-11 01:42:48,696][26022] Updated weights on worker 0-0, policy_version 981244 (0.00088) [2022-07-11 01:42:48,883][25689] Fps is (10 sec: 5855.6, 60 sec: 5592.5, 300 sec: 5561.8). Total num frames: 1004795904. Throughput: 0: 5009.2. Samples: 1004791532. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:42:48,884][25689] Avg episode reward: [(0, '0.780')] [2022-07-11 01:42:50,482][26022] Updated weights on worker 0-0, policy_version 981254 (0.00086) [2022-07-11 01:42:52,263][26022] Updated weights on worker 0-0, policy_version 981264 (0.00090) [2022-07-11 01:42:53,888][25689] Fps is (10 sec: 5723.5, 60 sec: 5593.3, 300 sec: 5559.2). Total num frames: 1004823552. Throughput: 0: 5871.8. Samples: 1004825750. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:42:53,888][25689] Avg episode reward: [(0, '1.299')] [2022-07-11 01:42:54,015][26022] Updated weights on worker 0-0, policy_version 981274 (0.00095) [2022-07-11 01:42:56,119][26022] Updated weights on worker 0-0, policy_version 981284 (0.00087) [2022-07-11 01:42:57,605][26022] Updated weights on worker 0-0, policy_version 981294 (0.00086) [2022-07-11 01:42:58,921][25689] Fps is (10 sec: 5406.9, 60 sec: 5558.6, 300 sec: 5559.7). Total num frames: 1004850176. Throughput: 0: 5892.9. Samples: 1004859742. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:42:58,922][25689] Avg episode reward: [(0, '0.437')] [2022-07-11 01:42:59,595][26022] Updated weights on worker 0-0, policy_version 981304 (0.00095) [2022-07-11 01:43:01,181][26022] Updated weights on worker 0-0, policy_version 981314 (0.00092) [2022-07-11 01:43:03,576][26022] Updated weights on worker 0-0, policy_version 981324 (0.00087) [2022-07-11 01:43:04,013][25689] Fps is (10 sec: 5360.1, 60 sec: 5573.6, 300 sec: 5558.6). Total num frames: 1004877824. Throughput: 0: 5056.1. Samples: 1004876476. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:43:04,013][25689] Avg episode reward: [(0, '0.038')] [2022-07-11 01:43:05,570][26022] Updated weights on worker 0-0, policy_version 981334 (0.00090) [2022-07-11 01:43:07,381][26022] Updated weights on worker 0-0, policy_version 981344 (0.00085) [2022-07-11 01:43:08,950][26022] Updated weights on worker 0-0, policy_version 981354 (0.00097) [2022-07-11 01:43:09,047][25689] Fps is (10 sec: 5561.6, 60 sec: 5606.4, 300 sec: 5559.3). Total num frames: 1004906496. Throughput: 0: 5768.3. Samples: 1004907844. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:43:09,048][25689] Avg episode reward: [(0, '0.145')] [2022-07-11 01:43:10,987][26022] Updated weights on worker 0-0, policy_version 981364 (0.00091) [2022-07-11 01:43:12,652][26022] Updated weights on worker 0-0, policy_version 981374 (0.00095) [2022-07-11 01:43:14,051][25689] Fps is (10 sec: 5610.8, 60 sec: 5590.1, 300 sec: 5556.2). Total num frames: 1004934144. Throughput: 0: 5739.9. Samples: 1004941484. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:43:14,051][25689] Avg episode reward: [(0, '0.852')] [2022-07-11 01:43:14,736][26022] Updated weights on worker 0-0, policy_version 981384 (0.00088) [2022-07-11 01:43:16,350][26022] Updated weights on worker 0-0, policy_version 981394 (0.00081) [2022-07-11 01:43:18,258][26022] Updated weights on worker 0-0, policy_version 981404 (0.00115) [2022-07-11 01:43:19,095][25689] Fps is (10 sec: 5503.4, 60 sec: 5587.3, 300 sec: 5553.0). Total num frames: 1004961792. Throughput: 0: 4883.2. Samples: 1004958256. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:43:19,095][25689] Avg episode reward: [(0, '-0.958')] [2022-07-11 01:43:20,085][26022] Updated weights on worker 0-0, policy_version 981414 (0.00089) [2022-07-11 01:43:21,839][26022] Updated weights on worker 0-0, policy_version 981424 (0.00088) [2022-07-11 01:43:23,569][26022] Updated weights on worker 0-0, policy_version 981434 (0.00086) [2022-07-11 01:43:23,951][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:43:23,961][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000981435_1004989440.pth [2022-07-11 01:43:23,962][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000979480_1002987520.pth [2022-07-11 01:43:24,139][25689] Fps is (10 sec: 5582.6, 60 sec: 5573.6, 300 sec: 5555.8). Total num frames: 1004990464. Throughput: 0: 5736.2. Samples: 1004991924. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:43:24,140][25689] Avg episode reward: [(0, '-1.850')] [2022-07-11 01:43:25,580][26022] Updated weights on worker 0-0, policy_version 981444 (0.00083) [2022-07-11 01:43:27,350][26022] Updated weights on worker 0-0, policy_version 981454 (0.00092) [2022-07-11 01:43:29,141][25689] Fps is (10 sec: 5504.0, 60 sec: 5574.4, 300 sec: 5552.4). Total num frames: 1005017088. Throughput: 0: 5862.1. Samples: 1005025638. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:43:29,142][25689] Avg episode reward: [(0, '-1.417')] [2022-07-11 01:43:29,415][26022] Updated weights on worker 0-0, policy_version 981464 (0.00089) [2022-07-11 01:43:30,853][26022] Updated weights on worker 0-0, policy_version 981474 (0.00086) [2022-07-11 01:43:33,016][26022] Updated weights on worker 0-0, policy_version 981484 (0.00089) [2022-07-11 01:43:34,181][25689] Fps is (10 sec: 5710.2, 60 sec: 5571.6, 300 sec: 5562.3). Total num frames: 1005047808. Throughput: 0: 5017.1. Samples: 1005042482. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:43:34,182][25689] Avg episode reward: [(0, '-0.499')] [2022-07-11 01:43:34,634][26022] Updated weights on worker 0-0, policy_version 981494 (0.00087) [2022-07-11 01:43:36,657][26022] Updated weights on worker 0-0, policy_version 981504 (0.00089) [2022-07-11 01:43:38,307][26022] Updated weights on worker 0-0, policy_version 981514 (0.00084) [2022-07-11 01:43:39,194][25689] Fps is (10 sec: 5704.2, 60 sec: 5589.7, 300 sec: 5557.6). Total num frames: 1005074432. Throughput: 0: 5860.1. Samples: 1005076042. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:43:39,195][25689] Avg episode reward: [(0, '-0.777')] [2022-07-11 01:43:40,240][26022] Updated weights on worker 0-0, policy_version 981524 (0.00089) [2022-07-11 01:43:41,931][26022] Updated weights on worker 0-0, policy_version 981534 (0.00086) [2022-07-11 01:43:43,853][26022] Updated weights on worker 0-0, policy_version 981544 (0.00091) [2022-07-11 01:43:44,336][25689] Fps is (10 sec: 5445.5, 60 sec: 5571.5, 300 sec: 5558.8). Total num frames: 1005103104. Throughput: 0: 5831.6. Samples: 1005109704. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:43:44,337][25689] Avg episode reward: [(0, '-0.300')] [2022-07-11 01:43:45,712][26022] Updated weights on worker 0-0, policy_version 981554 (0.00091) [2022-07-11 01:43:47,481][26022] Updated weights on worker 0-0, policy_version 981564 (0.00086) [2022-07-11 01:43:49,202][26022] Updated weights on worker 0-0, policy_version 981574 (0.00087) [2022-07-11 01:43:49,368][25689] Fps is (10 sec: 5737.2, 60 sec: 5569.9, 300 sec: 5562.1). Total num frames: 1005132800. Throughput: 0: 4994.5. Samples: 1005126660. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:43:49,368][25689] Avg episode reward: [(0, '0.543')] [2022-07-11 01:43:51,312][26022] Updated weights on worker 0-0, policy_version 981584 (0.00089) [2022-07-11 01:43:52,839][26022] Updated weights on worker 0-0, policy_version 981594 (0.00093) [2022-07-11 01:43:54,389][25689] Fps is (10 sec: 5500.6, 60 sec: 5534.6, 300 sec: 5558.4). Total num frames: 1005158400. Throughput: 0: 5827.2. Samples: 1005160234. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:43:54,389][25689] Avg episode reward: [(0, '1.543')] [2022-07-11 01:43:55,011][26022] Updated weights on worker 0-0, policy_version 981604 (0.00084) [2022-07-11 01:43:56,456][26022] Updated weights on worker 0-0, policy_version 981614 (0.00089) [2022-07-11 01:43:58,522][26022] Updated weights on worker 0-0, policy_version 981624 (0.00092) [2022-07-11 01:43:59,407][25689] Fps is (10 sec: 5508.2, 60 sec: 5586.7, 300 sec: 5569.7). Total num frames: 1005188096. Throughput: 0: 5835.4. Samples: 1005193990. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:43:59,414][25689] Avg episode reward: [(0, '1.144')] [2022-07-11 01:44:00,080][26022] Updated weights on worker 0-0, policy_version 981634 (0.00087) [2022-07-11 01:44:02,565][26022] Updated weights on worker 0-0, policy_version 981644 (0.00086) [2022-07-11 01:44:04,219][26022] Updated weights on worker 0-0, policy_version 981654 (0.00066) [2022-07-11 01:44:04,536][25689] Fps is (10 sec: 5550.0, 60 sec: 5566.4, 300 sec: 5560.7). Total num frames: 1005214720. Throughput: 0: 4994.6. Samples: 1005210596. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:44:04,537][25689] Avg episode reward: [(0, '0.924')] [2022-07-11 01:44:06,323][26022] Updated weights on worker 0-0, policy_version 981664 (0.00087) [2022-07-11 01:44:07,969][26022] Updated weights on worker 0-0, policy_version 981674 (0.00087) [2022-07-11 01:44:09,617][25689] Fps is (10 sec: 5315.1, 60 sec: 5545.2, 300 sec: 5562.8). Total num frames: 1005242368. Throughput: 0: 5692.3. Samples: 1005241928. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:44:09,618][25689] Avg episode reward: [(0, '0.194')] [2022-07-11 01:44:10,019][26022] Updated weights on worker 0-0, policy_version 981684 (0.00093) [2022-07-11 01:44:11,668][26022] Updated weights on worker 0-0, policy_version 981694 (0.00093) [2022-07-11 01:44:13,765][26022] Updated weights on worker 0-0, policy_version 981704 (0.00092) [2022-07-11 01:44:14,714][25689] Fps is (10 sec: 5433.1, 60 sec: 5536.7, 300 sec: 5557.7). Total num frames: 1005270016. Throughput: 0: 5674.6. Samples: 1005275572. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:44:14,714][25689] Avg episode reward: [(0, '0.362')] [2022-07-11 01:44:15,283][26022] Updated weights on worker 0-0, policy_version 981714 (0.00087) [2022-07-11 01:44:17,306][26022] Updated weights on worker 0-0, policy_version 981724 (0.00089) [2022-07-11 01:44:18,863][26022] Updated weights on worker 0-0, policy_version 981734 (0.00087) [2022-07-11 01:44:19,811][25689] Fps is (10 sec: 5525.1, 60 sec: 5548.7, 300 sec: 5553.3). Total num frames: 1005298688. Throughput: 0: 4817.4. Samples: 1005292286. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:44:19,813][25689] Avg episode reward: [(0, '0.103')] [2022-07-11 01:44:21,030][26022] Updated weights on worker 0-0, policy_version 981744 (0.00092) [2022-07-11 01:44:22,672][26022] Updated weights on worker 0-0, policy_version 981754 (0.00092) [2022-07-11 01:44:24,658][26022] Updated weights on worker 0-0, policy_version 981764 (0.00094) [2022-07-11 01:44:24,907][25689] Fps is (10 sec: 5625.8, 60 sec: 5544.0, 300 sec: 5555.8). Total num frames: 1005327360. Throughput: 0: 5649.6. Samples: 1005325682. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:44:24,907][25689] Avg episode reward: [(0, '0.299')] [2022-07-11 01:44:26,249][26022] Updated weights on worker 0-0, policy_version 981774 (0.00085) [2022-07-11 01:44:28,332][26022] Updated weights on worker 0-0, policy_version 981784 (0.00089) [2022-07-11 01:44:29,937][25689] Fps is (10 sec: 5764.2, 60 sec: 5592.0, 300 sec: 5559.3). Total num frames: 1005357056. Throughput: 0: 5787.8. Samples: 1005359534. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:44:29,937][25689] Avg episode reward: [(0, '-0.368')] [2022-07-11 01:44:29,947][26022] Updated weights on worker 0-0, policy_version 981794 (0.00097) [2022-07-11 01:44:31,971][26022] Updated weights on worker 0-0, policy_version 981804 (0.00089) [2022-07-11 01:44:33,491][26022] Updated weights on worker 0-0, policy_version 981814 (0.00078) [2022-07-11 01:44:34,987][25689] Fps is (10 sec: 5689.1, 60 sec: 5540.6, 300 sec: 5562.2). Total num frames: 1005384704. Throughput: 0: 5798.2. Samples: 1005393118. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 01:44:34,987][25689] Avg episode reward: [(0, '0.015')] [2022-07-11 01:44:35,674][26022] Updated weights on worker 0-0, policy_version 981824 (0.00086) [2022-07-11 01:44:37,130][26022] Updated weights on worker 0-0, policy_version 981834 (0.00090) [2022-07-11 01:44:39,430][26022] Updated weights on worker 0-0, policy_version 981844 (0.00088) [2022-07-11 01:44:40,035][25689] Fps is (10 sec: 5374.7, 60 sec: 5537.4, 300 sec: 5548.4). Total num frames: 1005411328. Throughput: 0: 5810.0. Samples: 1005409786. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:44:40,035][25689] Avg episode reward: [(0, '0.937')] [2022-07-11 01:44:40,847][26022] Updated weights on worker 0-0, policy_version 981854 (0.00080) [2022-07-11 01:44:43,149][26022] Updated weights on worker 0-0, policy_version 981864 (0.00923) [2022-07-11 01:44:44,618][26022] Updated weights on worker 0-0, policy_version 981874 (0.00090) [2022-07-11 01:44:45,181][25689] Fps is (10 sec: 5524.5, 60 sec: 5553.7, 300 sec: 5556.1). Total num frames: 1005441024. Throughput: 0: 5812.2. Samples: 1005443520. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:44:45,182][25689] Avg episode reward: [(0, '0.947')] [2022-07-11 01:44:46,567][26022] Updated weights on worker 0-0, policy_version 981884 (0.00096) [2022-07-11 01:44:48,376][26022] Updated weights on worker 0-0, policy_version 981894 (0.00091) [2022-07-11 01:44:50,224][25689] Fps is (10 sec: 5628.0, 60 sec: 5519.1, 300 sec: 5555.4). Total num frames: 1005468672. Throughput: 0: 5801.9. Samples: 1005477236. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:44:50,224][25689] Avg episode reward: [(0, '0.636')] [2022-07-11 01:44:50,275][26022] Updated weights on worker 0-0, policy_version 981904 (0.00087) [2022-07-11 01:44:52,053][26022] Updated weights on worker 0-0, policy_version 981914 (0.00086) [2022-07-11 01:44:54,148][26022] Updated weights on worker 0-0, policy_version 981924 (0.00083) [2022-07-11 01:44:55,274][25689] Fps is (10 sec: 5681.7, 60 sec: 5583.7, 300 sec: 5561.5). Total num frames: 1005498368. Throughput: 0: 4971.4. Samples: 1005493974. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:44:55,275][25689] Avg episode reward: [(0, '1.502')] [2022-07-11 01:44:55,561][26022] Updated weights on worker 0-0, policy_version 981934 (0.00085) [2022-07-11 01:44:57,563][26022] Updated weights on worker 0-0, policy_version 981944 (0.00084) [2022-07-11 01:44:59,329][26022] Updated weights on worker 0-0, policy_version 981954 (0.00086) [2022-07-11 01:45:00,287][25689] Fps is (10 sec: 5596.8, 60 sec: 5533.8, 300 sec: 5562.3). Total num frames: 1005524992. Throughput: 0: 5824.5. Samples: 1005527744. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:45:00,287][25689] Avg episode reward: [(0, '0.970')] [2022-07-11 01:45:01,150][26022] Updated weights on worker 0-0, policy_version 981964 (0.00095) [2022-07-11 01:45:03,392][26022] Updated weights on worker 0-0, policy_version 981974 (0.00081) [2022-07-11 01:45:05,159][26022] Updated weights on worker 0-0, policy_version 981984 (0.00092) [2022-07-11 01:45:05,342][25689] Fps is (10 sec: 5390.8, 60 sec: 5557.4, 300 sec: 5564.8). Total num frames: 1005552640. Throughput: 0: 5762.6. Samples: 1005559696. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:45:05,342][25689] Avg episode reward: [(0, '0.461')] [2022-07-11 01:45:06,900][26022] Updated weights on worker 0-0, policy_version 981994 (0.00085) [2022-07-11 01:45:08,911][26022] Updated weights on worker 0-0, policy_version 982004 (0.00085) [2022-07-11 01:45:10,347][25689] Fps is (10 sec: 5496.6, 60 sec: 5564.4, 300 sec: 5557.9). Total num frames: 1005580288. Throughput: 0: 4927.7. Samples: 1005576396. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:45:10,347][25689] Avg episode reward: [(0, '0.292')] [2022-07-11 01:45:10,606][26022] Updated weights on worker 0-0, policy_version 982014 (0.00087) [2022-07-11 01:45:12,557][26022] Updated weights on worker 0-0, policy_version 982024 (0.00097) [2022-07-11 01:45:14,108][26022] Updated weights on worker 0-0, policy_version 982034 (0.00087) [2022-07-11 01:45:15,355][25689] Fps is (10 sec: 5420.2, 60 sec: 5555.6, 300 sec: 5554.7). Total num frames: 1005606912. Throughput: 0: 5790.8. Samples: 1005610256. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:45:15,357][25689] Avg episode reward: [(0, '0.154')] [2022-07-11 01:45:16,036][26022] Updated weights on worker 0-0, policy_version 982044 (0.00092) [2022-07-11 01:45:18,060][26022] Updated weights on worker 0-0, policy_version 982054 (0.00087) [2022-07-11 01:45:19,871][26022] Updated weights on worker 0-0, policy_version 982064 (0.00085) [2022-07-11 01:45:20,372][25689] Fps is (10 sec: 5617.7, 60 sec: 5579.8, 300 sec: 5562.4). Total num frames: 1005636608. Throughput: 0: 5780.9. Samples: 1005643856. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:45:20,373][25689] Avg episode reward: [(0, '0.510')] [2022-07-11 01:45:21,708][26022] Updated weights on worker 0-0, policy_version 982074 (0.00090) [2022-07-11 01:45:23,611][26022] Updated weights on worker 0-0, policy_version 982084 (0.00084) [2022-07-11 01:45:24,138][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:45:24,150][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000982087_1005657088.pth [2022-07-11 01:45:24,151][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000980130_1003653120.pth [2022-07-11 01:45:25,300][26022] Updated weights on worker 0-0, policy_version 982094 (0.00075) [2022-07-11 01:45:25,494][25689] Fps is (10 sec: 5655.8, 60 sec: 5560.6, 300 sec: 5560.4). Total num frames: 1005664256. Throughput: 0: 4997.9. Samples: 1005660414. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:45:25,495][25689] Avg episode reward: [(0, '0.287')] [2022-07-11 01:45:27,368][26022] Updated weights on worker 0-0, policy_version 982104 (0.00090) [2022-07-11 01:45:29,194][26022] Updated weights on worker 0-0, policy_version 982114 (0.00088) [2022-07-11 01:45:30,536][25689] Fps is (10 sec: 5440.6, 60 sec: 5525.7, 300 sec: 5553.1). Total num frames: 1005691904. Throughput: 0: 5807.3. Samples: 1005693640. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:45:30,541][25689] Avg episode reward: [(0, '-0.168')] [2022-07-11 01:45:30,876][26022] Updated weights on worker 0-0, policy_version 982124 (0.00108) [2022-07-11 01:45:32,921][26022] Updated weights on worker 0-0, policy_version 982134 (0.00090) [2022-07-11 01:45:34,618][26022] Updated weights on worker 0-0, policy_version 982144 (0.00086) [2022-07-11 01:45:35,611][25689] Fps is (10 sec: 5566.5, 60 sec: 5540.2, 300 sec: 5558.7). Total num frames: 1005720576. Throughput: 0: 5761.6. Samples: 1005726966. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:45:35,612][25689] Avg episode reward: [(0, '0.610')] [2022-07-11 01:45:36,574][26022] Updated weights on worker 0-0, policy_version 982154 (0.00086) [2022-07-11 01:45:38,470][26022] Updated weights on worker 0-0, policy_version 982164 (0.00096) [2022-07-11 01:45:39,930][26022] Updated weights on worker 0-0, policy_version 982174 (0.00100) [2022-07-11 01:45:40,658][25689] Fps is (10 sec: 5664.9, 60 sec: 5574.1, 300 sec: 5559.3). Total num frames: 1005749248. Throughput: 0: 4910.6. Samples: 1005743474. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:45:40,660][25689] Avg episode reward: [(0, '0.873')] [2022-07-11 01:45:42,096][26022] Updated weights on worker 0-0, policy_version 982184 (0.00092) [2022-07-11 01:45:43,923][26022] Updated weights on worker 0-0, policy_version 982194 (0.00084) [2022-07-11 01:45:45,716][25689] Fps is (10 sec: 5472.3, 60 sec: 5531.5, 300 sec: 5551.5). Total num frames: 1005775872. Throughput: 0: 5770.1. Samples: 1005777100. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:45:45,717][25689] Avg episode reward: [(0, '0.829')] [2022-07-11 01:45:45,722][26022] Updated weights on worker 0-0, policy_version 982204 (0.00096) [2022-07-11 01:45:47,552][26022] Updated weights on worker 0-0, policy_version 982214 (0.00094) [2022-07-11 01:45:49,190][26022] Updated weights on worker 0-0, policy_version 982224 (0.00090) [2022-07-11 01:45:50,733][25689] Fps is (10 sec: 5488.6, 60 sec: 5550.7, 300 sec: 5562.0). Total num frames: 1005804544. Throughput: 0: 5803.7. Samples: 1005810860. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:45:50,734][25689] Avg episode reward: [(0, '-0.097')] [2022-07-11 01:45:51,191][26022] Updated weights on worker 0-0, policy_version 982234 (0.00087) [2022-07-11 01:45:53,003][26022] Updated weights on worker 0-0, policy_version 982244 (0.00097) [2022-07-11 01:45:54,669][26022] Updated weights on worker 0-0, policy_version 982254 (0.00086) [2022-07-11 01:45:55,783][25689] Fps is (10 sec: 5696.4, 60 sec: 5533.9, 300 sec: 5554.6). Total num frames: 1005833216. Throughput: 0: 4984.1. Samples: 1005827502. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:45:55,783][25689] Avg episode reward: [(0, '0.137')] [2022-07-11 01:45:56,678][26022] Updated weights on worker 0-0, policy_version 982264 (0.00087) [2022-07-11 01:45:58,613][26022] Updated weights on worker 0-0, policy_version 982274 (0.00083) [2022-07-11 01:46:00,327][26022] Updated weights on worker 0-0, policy_version 982284 (0.00088) [2022-07-11 01:46:00,802][25689] Fps is (10 sec: 5695.0, 60 sec: 5567.1, 300 sec: 5570.1). Total num frames: 1005861888. Throughput: 0: 5833.9. Samples: 1005860992. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:46:00,803][25689] Avg episode reward: [(0, '0.779')] [2022-07-11 01:46:02,528][26022] Updated weights on worker 0-0, policy_version 982294 (0.00086) [2022-07-11 01:46:04,179][26022] Updated weights on worker 0-0, policy_version 982304 (0.00099) [2022-07-11 01:46:05,915][25689] Fps is (10 sec: 5255.3, 60 sec: 5511.1, 300 sec: 5554.9). Total num frames: 1005886464. Throughput: 0: 5710.7. Samples: 1005892450. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:46:05,915][25689] Avg episode reward: [(0, '-0.305')] [2022-07-11 01:46:06,407][26022] Updated weights on worker 0-0, policy_version 982314 (0.00098) [2022-07-11 01:46:08,194][26022] Updated weights on worker 0-0, policy_version 982324 (0.00086) [2022-07-11 01:46:09,964][26022] Updated weights on worker 0-0, policy_version 982334 (0.00086) [2022-07-11 01:46:10,956][25689] Fps is (10 sec: 5244.1, 60 sec: 5524.7, 300 sec: 5557.8). Total num frames: 1005915136. Throughput: 0: 4844.0. Samples: 1005908820. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:46:10,957][25689] Avg episode reward: [(0, '-0.339')] [2022-07-11 01:46:11,702][26022] Updated weights on worker 0-0, policy_version 982344 (0.00095) [2022-07-11 01:46:13,751][26022] Updated weights on worker 0-0, policy_version 982354 (0.00089) [2022-07-11 01:46:15,383][26022] Updated weights on worker 0-0, policy_version 982364 (0.00090) [2022-07-11 01:46:15,971][25689] Fps is (10 sec: 5702.7, 60 sec: 5557.9, 300 sec: 5557.7). Total num frames: 1005943808. Throughput: 0: 5700.2. Samples: 1005942576. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:46:15,971][25689] Avg episode reward: [(0, '0.508')] [2022-07-11 01:46:17,352][26022] Updated weights on worker 0-0, policy_version 982374 (0.00087) [2022-07-11 01:46:18,999][26022] Updated weights on worker 0-0, policy_version 982384 (0.00113) [2022-07-11 01:46:20,990][25689] Fps is (10 sec: 5510.7, 60 sec: 5507.0, 300 sec: 5555.5). Total num frames: 1005970432. Throughput: 0: 5715.6. Samples: 1005976378. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:46:20,991][25689] Avg episode reward: [(0, '0.768')] [2022-07-11 01:46:21,059][26022] Updated weights on worker 0-0, policy_version 982394 (0.00086) [2022-07-11 01:46:22,763][26022] Updated weights on worker 0-0, policy_version 982404 (0.00079) [2022-07-11 01:46:24,408][26022] Updated weights on worker 0-0, policy_version 982414 (0.00089) [2022-07-11 01:46:26,120][25689] Fps is (10 sec: 5549.1, 60 sec: 5540.0, 300 sec: 5557.3). Total num frames: 1006000128. Throughput: 0: 4986.3. Samples: 1005993198. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:46:26,122][25689] Avg episode reward: [(0, '0.427')] [2022-07-11 01:46:26,386][26022] Updated weights on worker 0-0, policy_version 982424 (0.01180) [2022-07-11 01:46:28,084][26022] Updated weights on worker 0-0, policy_version 982434 (0.00084) [2022-07-11 01:46:29,932][26022] Updated weights on worker 0-0, policy_version 982444 (0.00088) [2022-07-11 01:46:31,127][25689] Fps is (10 sec: 5758.0, 60 sec: 5560.1, 300 sec: 5557.6). Total num frames: 1006028800. Throughput: 0: 5858.6. Samples: 1006026994. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:46:31,129][25689] Avg episode reward: [(0, '1.463')] [2022-07-11 01:46:32,095][26022] Updated weights on worker 0-0, policy_version 982454 (0.00085) [2022-07-11 01:46:33,544][26022] Updated weights on worker 0-0, policy_version 982464 (0.00088) [2022-07-11 01:46:35,391][26022] Updated weights on worker 0-0, policy_version 982474 (0.00085) [2022-07-11 01:46:36,140][25689] Fps is (10 sec: 5620.8, 60 sec: 5549.0, 300 sec: 5561.1). Total num frames: 1006056448. Throughput: 0: 5862.9. Samples: 1006060826. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:46:36,140][25689] Avg episode reward: [(0, '1.318')] [2022-07-11 01:46:37,308][26022] Updated weights on worker 0-0, policy_version 982484 (0.00100) [2022-07-11 01:46:39,194][26022] Updated weights on worker 0-0, policy_version 982494 (0.00085) [2022-07-11 01:46:41,104][26022] Updated weights on worker 0-0, policy_version 982504 (0.00093) [2022-07-11 01:46:41,156][25689] Fps is (10 sec: 5616.1, 60 sec: 5551.9, 300 sec: 5558.0). Total num frames: 1006085120. Throughput: 0: 5007.7. Samples: 1006077360. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:46:41,156][25689] Avg episode reward: [(0, '1.414')] [2022-07-11 01:46:42,701][26022] Updated weights on worker 0-0, policy_version 982514 (0.00089) [2022-07-11 01:46:44,708][26022] Updated weights on worker 0-0, policy_version 982524 (0.00085) [2022-07-11 01:46:46,199][25689] Fps is (10 sec: 5700.8, 60 sec: 5587.1, 300 sec: 5561.3). Total num frames: 1006113792. Throughput: 0: 5868.0. Samples: 1006111020. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:46:46,204][25689] Avg episode reward: [(0, '1.253')] [2022-07-11 01:46:46,779][26022] Updated weights on worker 0-0, policy_version 982534 (0.00085) [2022-07-11 01:46:48,252][26022] Updated weights on worker 0-0, policy_version 982544 (0.00085) [2022-07-11 01:46:50,295][26022] Updated weights on worker 0-0, policy_version 982554 (0.00098) [2022-07-11 01:46:51,269][25689] Fps is (10 sec: 5569.1, 60 sec: 5565.3, 300 sec: 5560.2). Total num frames: 1006141440. Throughput: 0: 5844.0. Samples: 1006144700. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:46:51,269][25689] Avg episode reward: [(0, '1.138')] [2022-07-11 01:46:51,851][26022] Updated weights on worker 0-0, policy_version 982564 (0.00099) [2022-07-11 01:46:53,951][26022] Updated weights on worker 0-0, policy_version 982574 (0.00093) [2022-07-11 01:46:55,731][26022] Updated weights on worker 0-0, policy_version 982584 (0.00098) [2022-07-11 01:46:56,322][25689] Fps is (10 sec: 5462.4, 60 sec: 5548.0, 300 sec: 5556.2). Total num frames: 1006169088. Throughput: 0: 5832.5. Samples: 1006178538. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:46:56,322][25689] Avg episode reward: [(0, '1.727')] [2022-07-11 01:46:57,345][26022] Updated weights on worker 0-0, policy_version 982594 (0.00085) [2022-07-11 01:46:59,348][26022] Updated weights on worker 0-0, policy_version 982604 (0.00090) [2022-07-11 01:47:01,350][25689] Fps is (10 sec: 5484.8, 60 sec: 5530.3, 300 sec: 5560.5). Total num frames: 1006196736. Throughput: 0: 5844.5. Samples: 1006195388. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:47:01,359][25689] Avg episode reward: [(0, '1.788')] [2022-07-11 01:47:01,383][26022] Updated weights on worker 0-0, policy_version 982614 (0.00097) [2022-07-11 01:47:03,228][26022] Updated weights on worker 0-0, policy_version 982624 (0.00095) [2022-07-11 01:47:05,265][26022] Updated weights on worker 0-0, policy_version 982634 (0.00089) [2022-07-11 01:47:06,491][25689] Fps is (10 sec: 5437.6, 60 sec: 5578.5, 300 sec: 5561.7). Total num frames: 1006224384. Throughput: 0: 5704.3. Samples: 1006226772. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:47:06,492][25689] Avg episode reward: [(0, '1.785')] [2022-07-11 01:47:06,933][26022] Updated weights on worker 0-0, policy_version 982644 (0.00089) [2022-07-11 01:47:08,922][26022] Updated weights on worker 0-0, policy_version 982654 (0.00085) [2022-07-11 01:47:10,726][26022] Updated weights on worker 0-0, policy_version 982664 (0.00080) [2022-07-11 01:47:11,517][25689] Fps is (10 sec: 5439.0, 60 sec: 5562.9, 300 sec: 5558.0). Total num frames: 1006252032. Throughput: 0: 5703.2. Samples: 1006260180. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:47:11,518][25689] Avg episode reward: [(0, '1.658')] [2022-07-11 01:47:12,272][26022] Updated weights on worker 0-0, policy_version 982674 (0.00090) [2022-07-11 01:47:14,415][26022] Updated weights on worker 0-0, policy_version 982684 (0.00089) [2022-07-11 01:47:15,946][26022] Updated weights on worker 0-0, policy_version 982694 (0.00086) [2022-07-11 01:47:16,543][25689] Fps is (10 sec: 5602.9, 60 sec: 5561.9, 300 sec: 5561.2). Total num frames: 1006280704. Throughput: 0: 4880.7. Samples: 1006277230. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:47:16,543][25689] Avg episode reward: [(0, '1.193')] [2022-07-11 01:47:18,083][26022] Updated weights on worker 0-0, policy_version 982704 (0.00083) [2022-07-11 01:47:19,565][26022] Updated weights on worker 0-0, policy_version 982714 (0.00081) [2022-07-11 01:47:21,565][25689] Fps is (10 sec: 5604.8, 60 sec: 5578.5, 300 sec: 5555.4). Total num frames: 1006308352. Throughput: 0: 5725.1. Samples: 1006311120. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:47:21,566][25689] Avg episode reward: [(0, '0.711')] [2022-07-11 01:47:21,671][26022] Updated weights on worker 0-0, policy_version 982724 (0.00085) [2022-07-11 01:47:23,587][26022] Updated weights on worker 0-0, policy_version 982734 (0.00086) [2022-07-11 01:47:24,170][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:47:24,182][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000982738_1006323712.pth [2022-07-11 01:47:24,182][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000980782_1004320768.pth [2022-07-11 01:47:25,245][26022] Updated weights on worker 0-0, policy_version 982744 (0.00086) [2022-07-11 01:47:26,626][25689] Fps is (10 sec: 5687.1, 60 sec: 5584.9, 300 sec: 5564.8). Total num frames: 1006338048. Throughput: 0: 5850.3. Samples: 1006344568. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:47:26,626][25689] Avg episode reward: [(0, '0.228')] [2022-07-11 01:47:26,959][26022] Updated weights on worker 0-0, policy_version 982754 (0.00078) [2022-07-11 01:47:29,041][26022] Updated weights on worker 0-0, policy_version 982764 (0.00095) [2022-07-11 01:47:30,677][26022] Updated weights on worker 0-0, policy_version 982774 (0.00086) [2022-07-11 01:47:31,646][25689] Fps is (10 sec: 5485.6, 60 sec: 5533.0, 300 sec: 5547.4). Total num frames: 1006363648. Throughput: 0: 5031.9. Samples: 1006361466. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:47:31,646][25689] Avg episode reward: [(0, '0.172')] [2022-07-11 01:47:32,593][26022] Updated weights on worker 0-0, policy_version 982784 (0.00088) [2022-07-11 01:47:34,440][26022] Updated weights on worker 0-0, policy_version 982794 (0.00082) [2022-07-11 01:47:35,982][26022] Updated weights on worker 0-0, policy_version 982804 (0.00085) [2022-07-11 01:47:36,654][25689] Fps is (10 sec: 5616.1, 60 sec: 5584.1, 300 sec: 5564.9). Total num frames: 1006394368. Throughput: 0: 5870.7. Samples: 1006395298. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:47:36,655][25689] Avg episode reward: [(0, '0.519')] [2022-07-11 01:47:38,285][26022] Updated weights on worker 0-0, policy_version 982814 (0.00095) [2022-07-11 01:47:39,754][26022] Updated weights on worker 0-0, policy_version 982824 (0.00092) [2022-07-11 01:47:41,686][25689] Fps is (10 sec: 5609.4, 60 sec: 5531.9, 300 sec: 5552.9). Total num frames: 1006419968. Throughput: 0: 5851.7. Samples: 1006428860. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:47:41,688][25689] Avg episode reward: [(0, '0.606')] [2022-07-11 01:47:41,786][26022] Updated weights on worker 0-0, policy_version 982834 (0.00086) [2022-07-11 01:47:43,352][26022] Updated weights on worker 0-0, policy_version 982844 (0.00087) [2022-07-11 01:47:45,323][26022] Updated weights on worker 0-0, policy_version 982854 (0.00090) [2022-07-11 01:47:46,731][25689] Fps is (10 sec: 5588.9, 60 sec: 5565.5, 300 sec: 5555.8). Total num frames: 1006450688. Throughput: 0: 5033.8. Samples: 1006445776. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:47:46,733][25689] Avg episode reward: [(0, '1.348')] [2022-07-11 01:47:47,077][26022] Updated weights on worker 0-0, policy_version 982864 (0.00090) [2022-07-11 01:47:48,929][26022] Updated weights on worker 0-0, policy_version 982874 (0.00086) [2022-07-11 01:47:50,647][26022] Updated weights on worker 0-0, policy_version 982884 (0.00086) [2022-07-11 01:47:51,759][25689] Fps is (10 sec: 5794.7, 60 sec: 5569.4, 300 sec: 5555.4). Total num frames: 1006478336. Throughput: 0: 5889.0. Samples: 1006479912. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:47:51,760][25689] Avg episode reward: [(0, '1.857')] [2022-07-11 01:47:52,596][26022] Updated weights on worker 0-0, policy_version 982894 (0.00084) [2022-07-11 01:47:54,341][26022] Updated weights on worker 0-0, policy_version 982904 (0.00088) [2022-07-11 01:47:56,173][26022] Updated weights on worker 0-0, policy_version 982914 (0.00080) [2022-07-11 01:47:56,786][25689] Fps is (10 sec: 5703.2, 60 sec: 5605.7, 300 sec: 5565.8). Total num frames: 1006508032. Throughput: 0: 5898.8. Samples: 1006514052. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:47:56,787][25689] Avg episode reward: [(0, '1.914')] [2022-07-11 01:47:58,081][26022] Updated weights on worker 0-0, policy_version 982924 (0.00090) [2022-07-11 01:47:59,781][26022] Updated weights on worker 0-0, policy_version 982934 (0.01076) [2022-07-11 01:48:01,807][25689] Fps is (10 sec: 5503.2, 60 sec: 5572.6, 300 sec: 5560.2). Total num frames: 1006533632. Throughput: 0: 5076.3. Samples: 1006530998. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:48:01,808][25689] Avg episode reward: [(0, '1.712')] [2022-07-11 01:48:01,902][26022] Updated weights on worker 0-0, policy_version 982944 (0.00088) [2022-07-11 01:48:03,732][26022] Updated weights on worker 0-0, policy_version 982954 (0.00092) [2022-07-11 01:48:05,823][26022] Updated weights on worker 0-0, policy_version 982964 (0.00092) [2022-07-11 01:48:06,912][25689] Fps is (10 sec: 5359.9, 60 sec: 5592.8, 300 sec: 5558.9). Total num frames: 1006562304. Throughput: 0: 5781.3. Samples: 1006562444. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:48:06,912][25689] Avg episode reward: [(0, '1.356')] [2022-07-11 01:48:07,382][26022] Updated weights on worker 0-0, policy_version 982974 (0.00082) [2022-07-11 01:48:09,399][26022] Updated weights on worker 0-0, policy_version 982984 (0.00082) [2022-07-11 01:48:11,177][26022] Updated weights on worker 0-0, policy_version 982994 (0.00084) [2022-07-11 01:48:11,968][25689] Fps is (10 sec: 5542.3, 60 sec: 5589.9, 300 sec: 5557.9). Total num frames: 1006589952. Throughput: 0: 5741.1. Samples: 1006595938. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:48:11,969][25689] Avg episode reward: [(0, '0.296')] [2022-07-11 01:48:12,921][26022] Updated weights on worker 0-0, policy_version 983004 (0.00093) [2022-07-11 01:48:14,839][26022] Updated weights on worker 0-0, policy_version 983014 (0.00081) [2022-07-11 01:48:16,534][26022] Updated weights on worker 0-0, policy_version 983024 (0.00094) [2022-07-11 01:48:16,997][25689] Fps is (10 sec: 5381.0, 60 sec: 5555.8, 300 sec: 5554.8). Total num frames: 1006616576. Throughput: 0: 4890.3. Samples: 1006612892. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:48:16,998][25689] Avg episode reward: [(0, '0.268')] [2022-07-11 01:48:18,512][26022] Updated weights on worker 0-0, policy_version 983034 (0.00095) [2022-07-11 01:48:20,402][26022] Updated weights on worker 0-0, policy_version 983044 (0.00099) [2022-07-11 01:48:22,031][25689] Fps is (10 sec: 5597.1, 60 sec: 5588.7, 300 sec: 5558.4). Total num frames: 1006646272. Throughput: 0: 5720.5. Samples: 1006646690. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:48:22,032][25689] Avg episode reward: [(0, '0.077')] [2022-07-11 01:48:22,043][26022] Updated weights on worker 0-0, policy_version 983054 (0.00091) [2022-07-11 01:48:23,991][26022] Updated weights on worker 0-0, policy_version 983064 (0.00086) [2022-07-11 01:48:25,776][26022] Updated weights on worker 0-0, policy_version 983074 (0.00098) [2022-07-11 01:48:27,092][25689] Fps is (10 sec: 5680.7, 60 sec: 5554.7, 300 sec: 5560.8). Total num frames: 1006673920. Throughput: 0: 5822.5. Samples: 1006679944. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:48:27,092][25689] Avg episode reward: [(0, '0.052')] [2022-07-11 01:48:27,656][26022] Updated weights on worker 0-0, policy_version 983084 (0.00087) [2022-07-11 01:48:29,509][26022] Updated weights on worker 0-0, policy_version 983094 (0.00090) [2022-07-11 01:48:31,306][26022] Updated weights on worker 0-0, policy_version 983104 (0.00086) [2022-07-11 01:48:32,118][25689] Fps is (10 sec: 5481.7, 60 sec: 5588.0, 300 sec: 5550.7). Total num frames: 1006701568. Throughput: 0: 5002.6. Samples: 1006696738. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 01:48:32,119][25689] Avg episode reward: [(0, '0.379')] [2022-07-11 01:48:33,163][26022] Updated weights on worker 0-0, policy_version 983114 (0.00087) [2022-07-11 01:48:34,966][26022] Updated weights on worker 0-0, policy_version 983124 (0.00104) [2022-07-11 01:48:36,756][26022] Updated weights on worker 0-0, policy_version 983134 (0.00090) [2022-07-11 01:48:37,124][25689] Fps is (10 sec: 5614.1, 60 sec: 5554.4, 300 sec: 5557.7). Total num frames: 1006730240. Throughput: 0: 5835.3. Samples: 1006730336. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:48:37,125][25689] Avg episode reward: [(0, '-0.045')] [2022-07-11 01:48:38,744][26022] Updated weights on worker 0-0, policy_version 983144 (0.00091) [2022-07-11 01:48:40,569][26022] Updated weights on worker 0-0, policy_version 983154 (0.00091) [2022-07-11 01:48:42,163][25689] Fps is (10 sec: 5708.7, 60 sec: 5604.5, 300 sec: 5559.6). Total num frames: 1006758912. Throughput: 0: 5815.4. Samples: 1006763768. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:48:42,164][25689] Avg episode reward: [(0, '-0.654')] [2022-07-11 01:48:42,423][26022] Updated weights on worker 0-0, policy_version 983164 (0.00095) [2022-07-11 01:48:44,118][26022] Updated weights on worker 0-0, policy_version 983174 (0.00088) [2022-07-11 01:48:46,103][26022] Updated weights on worker 0-0, policy_version 983184 (0.00083) [2022-07-11 01:48:47,291][25689] Fps is (10 sec: 5539.4, 60 sec: 5546.1, 300 sec: 5551.0). Total num frames: 1006786560. Throughput: 0: 4992.4. Samples: 1006780786. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:48:47,291][25689] Avg episode reward: [(0, '-0.800')] [2022-07-11 01:48:47,757][26022] Updated weights on worker 0-0, policy_version 983194 (0.00084) [2022-07-11 01:48:49,696][26022] Updated weights on worker 0-0, policy_version 983204 (0.00089) [2022-07-11 01:48:51,500][26022] Updated weights on worker 0-0, policy_version 983214 (0.00087) [2022-07-11 01:48:52,301][25689] Fps is (10 sec: 5555.5, 60 sec: 5564.6, 300 sec: 5561.5). Total num frames: 1006815232. Throughput: 0: 5838.1. Samples: 1006814566. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:48:52,302][25689] Avg episode reward: [(0, '-0.913')] [2022-07-11 01:48:53,393][26022] Updated weights on worker 0-0, policy_version 983224 (0.00092) [2022-07-11 01:48:55,176][26022] Updated weights on worker 0-0, policy_version 983234 (0.00048) [2022-07-11 01:48:56,974][26022] Updated weights on worker 0-0, policy_version 983244 (0.00092) [2022-07-11 01:48:57,331][25689] Fps is (10 sec: 5711.7, 60 sec: 5547.5, 300 sec: 5557.8). Total num frames: 1006843904. Throughput: 0: 5835.3. Samples: 1006848250. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:48:57,332][25689] Avg episode reward: [(0, '-0.748')] [2022-07-11 01:48:58,778][26022] Updated weights on worker 0-0, policy_version 983254 (0.00085) [2022-07-11 01:49:00,428][26022] Updated weights on worker 0-0, policy_version 983264 (0.00095) [2022-07-11 01:49:02,379][25689] Fps is (10 sec: 5385.4, 60 sec: 5545.0, 300 sec: 5555.9). Total num frames: 1006869504. Throughput: 0: 5029.2. Samples: 1006865436. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:49:02,381][25689] Avg episode reward: [(0, '-0.915')] [2022-07-11 01:49:02,764][26022] Updated weights on worker 0-0, policy_version 983274 (0.00088) [2022-07-11 01:49:04,618][26022] Updated weights on worker 0-0, policy_version 983284 (0.00085) [2022-07-11 01:49:06,431][26022] Updated weights on worker 0-0, policy_version 983294 (0.00091) [2022-07-11 01:49:07,471][25689] Fps is (10 sec: 5352.3, 60 sec: 5546.2, 300 sec: 5559.1). Total num frames: 1006898176. Throughput: 0: 5770.7. Samples: 1006897236. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:49:07,472][25689] Avg episode reward: [(0, '0.470')] [2022-07-11 01:49:08,255][26022] Updated weights on worker 0-0, policy_version 983304 (0.00084) [2022-07-11 01:49:10,275][26022] Updated weights on worker 0-0, policy_version 983314 (0.00086) [2022-07-11 01:49:12,014][26022] Updated weights on worker 0-0, policy_version 983324 (0.00085) [2022-07-11 01:49:12,522][25689] Fps is (10 sec: 5653.1, 60 sec: 5563.6, 300 sec: 5563.4). Total num frames: 1006926848. Throughput: 0: 5745.5. Samples: 1006930748. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:49:12,524][25689] Avg episode reward: [(0, '1.237')] [2022-07-11 01:49:13,754][26022] Updated weights on worker 0-0, policy_version 983334 (0.00064) [2022-07-11 01:49:15,596][26022] Updated weights on worker 0-0, policy_version 983344 (0.00052) [2022-07-11 01:49:17,127][26022] Updated weights on worker 0-0, policy_version 983354 (0.00084) [2022-07-11 01:49:17,529][25689] Fps is (10 sec: 5599.5, 60 sec: 5582.6, 300 sec: 5561.7). Total num frames: 1006954496. Throughput: 0: 5763.5. Samples: 1006964662. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:49:17,530][25689] Avg episode reward: [(0, '1.694')] [2022-07-11 01:49:19,142][26022] Updated weights on worker 0-0, policy_version 983364 (0.00092) [2022-07-11 01:49:21,102][26022] Updated weights on worker 0-0, policy_version 983374 (0.00093) [2022-07-11 01:49:22,597][25689] Fps is (10 sec: 5488.6, 60 sec: 5545.5, 300 sec: 5558.7). Total num frames: 1006982144. Throughput: 0: 5740.6. Samples: 1006981502. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:49:22,597][25689] Avg episode reward: [(0, '1.345')] [2022-07-11 01:49:22,865][26022] Updated weights on worker 0-0, policy_version 983384 (0.00096) [2022-07-11 01:49:24,237][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:49:24,262][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000983392_1006993408.pth [2022-07-11 01:49:24,263][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000981435_1004989440.pth [2022-07-11 01:49:24,770][26022] Updated weights on worker 0-0, policy_version 983394 (0.00096) [2022-07-11 01:49:26,557][26022] Updated weights on worker 0-0, policy_version 983404 (0.00089) [2022-07-11 01:49:27,689][25689] Fps is (10 sec: 5643.9, 60 sec: 5576.5, 300 sec: 5557.6). Total num frames: 1007011840. Throughput: 0: 5822.7. Samples: 1007014962. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:49:27,689][25689] Avg episode reward: [(0, '0.210')] [2022-07-11 01:49:28,426][26022] Updated weights on worker 0-0, policy_version 983414 (0.00087) [2022-07-11 01:49:30,245][26022] Updated weights on worker 0-0, policy_version 983424 (0.00093) [2022-07-11 01:49:32,178][26022] Updated weights on worker 0-0, policy_version 983434 (0.00091) [2022-07-11 01:49:32,752][25689] Fps is (10 sec: 5545.8, 60 sec: 5556.2, 300 sec: 5553.9). Total num frames: 1007038464. Throughput: 0: 5812.2. Samples: 1007048330. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:49:32,753][25689] Avg episode reward: [(0, '-0.675')] [2022-07-11 01:49:33,848][26022] Updated weights on worker 0-0, policy_version 983444 (0.00082) [2022-07-11 01:49:35,678][26022] Updated weights on worker 0-0, policy_version 983454 (0.00086) [2022-07-11 01:49:37,439][26022] Updated weights on worker 0-0, policy_version 983464 (0.00095) [2022-07-11 01:49:37,758][25689] Fps is (10 sec: 5593.4, 60 sec: 5573.1, 300 sec: 5565.0). Total num frames: 1007068160. Throughput: 0: 4971.9. Samples: 1007065240. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:49:37,758][25689] Avg episode reward: [(0, '-1.060')] [2022-07-11 01:49:39,518][26022] Updated weights on worker 0-0, policy_version 983474 (0.00084) [2022-07-11 01:49:41,120][26022] Updated weights on worker 0-0, policy_version 983484 (0.00080) [2022-07-11 01:49:42,836][25689] Fps is (10 sec: 5585.0, 60 sec: 5535.7, 300 sec: 5556.0). Total num frames: 1007094784. Throughput: 0: 5794.2. Samples: 1007098774. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:49:42,837][25689] Avg episode reward: [(0, '-2.157')] [2022-07-11 01:49:43,066][26022] Updated weights on worker 0-0, policy_version 983494 (0.00094) [2022-07-11 01:49:44,807][26022] Updated weights on worker 0-0, policy_version 983504 (0.00085) [2022-07-11 01:49:46,808][26022] Updated weights on worker 0-0, policy_version 983514 (0.00093) [2022-07-11 01:49:47,943][25689] Fps is (10 sec: 5529.8, 60 sec: 5571.5, 300 sec: 5561.6). Total num frames: 1007124480. Throughput: 0: 5814.1. Samples: 1007132720. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:49:47,943][25689] Avg episode reward: [(0, '-2.522')] [2022-07-11 01:49:48,480][26022] Updated weights on worker 0-0, policy_version 983524 (0.00092) [2022-07-11 01:49:50,314][26022] Updated weights on worker 0-0, policy_version 983534 (0.00088) [2022-07-11 01:49:52,061][26022] Updated weights on worker 0-0, policy_version 983544 (0.00084) [2022-07-11 01:49:52,954][25689] Fps is (10 sec: 5768.9, 60 sec: 5571.3, 300 sec: 5558.9). Total num frames: 1007153152. Throughput: 0: 5020.9. Samples: 1007149762. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:49:52,955][25689] Avg episode reward: [(0, '-1.800')] [2022-07-11 01:49:54,203][26022] Updated weights on worker 0-0, policy_version 983554 (0.00093) [2022-07-11 01:49:55,554][26022] Updated weights on worker 0-0, policy_version 983564 (0.00099) [2022-07-11 01:49:57,657][26022] Updated weights on worker 0-0, policy_version 983574 (0.00086) [2022-07-11 01:49:57,977][25689] Fps is (10 sec: 5510.5, 60 sec: 5538.2, 300 sec: 5558.7). Total num frames: 1007179776. Throughput: 0: 5857.9. Samples: 1007183684. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:49:57,978][25689] Avg episode reward: [(0, '-1.192')] [2022-07-11 01:49:59,451][26022] Updated weights on worker 0-0, policy_version 983584 (0.00085) [2022-07-11 01:50:01,297][26022] Updated weights on worker 0-0, policy_version 983594 (0.00094) [2022-07-11 01:50:03,003][25689] Fps is (10 sec: 5400.9, 60 sec: 5574.0, 300 sec: 5559.3). Total num frames: 1007207424. Throughput: 0: 5781.0. Samples: 1007215356. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:50:03,003][25689] Avg episode reward: [(0, '0.063')] [2022-07-11 01:50:03,534][26022] Updated weights on worker 0-0, policy_version 983604 (0.00096) [2022-07-11 01:50:05,232][26022] Updated weights on worker 0-0, policy_version 983614 (0.00088) [2022-07-11 01:50:06,982][26022] Updated weights on worker 0-0, policy_version 983624 (0.00093) [2022-07-11 01:50:08,095][25689] Fps is (10 sec: 5667.8, 60 sec: 5590.9, 300 sec: 5564.5). Total num frames: 1007237120. Throughput: 0: 4944.7. Samples: 1007232368. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:50:08,095][25689] Avg episode reward: [(0, '0.440')] [2022-07-11 01:50:08,918][26022] Updated weights on worker 0-0, policy_version 983634 (0.00088) [2022-07-11 01:50:10,594][26022] Updated weights on worker 0-0, policy_version 983644 (0.00085) [2022-07-11 01:50:12,535][26022] Updated weights on worker 0-0, policy_version 983654 (0.00085) [2022-07-11 01:50:13,173][25689] Fps is (10 sec: 5537.5, 60 sec: 5554.6, 300 sec: 5563.2). Total num frames: 1007263744. Throughput: 0: 5745.4. Samples: 1007265930. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:50:13,174][25689] Avg episode reward: [(0, '-0.275')] [2022-07-11 01:50:14,233][26022] Updated weights on worker 0-0, policy_version 983664 (0.00089) [2022-07-11 01:50:16,244][26022] Updated weights on worker 0-0, policy_version 983674 (0.00080) [2022-07-11 01:50:17,950][26022] Updated weights on worker 0-0, policy_version 983684 (0.00090) [2022-07-11 01:50:18,188][25689] Fps is (10 sec: 5580.2, 60 sec: 5587.7, 300 sec: 5563.3). Total num frames: 1007293440. Throughput: 0: 5729.3. Samples: 1007299476. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:50:18,189][25689] Avg episode reward: [(0, '-0.019')] [2022-07-11 01:50:19,995][26022] Updated weights on worker 0-0, policy_version 983694 (0.00052) [2022-07-11 01:50:21,539][26022] Updated weights on worker 0-0, policy_version 983704 (0.00087) [2022-07-11 01:50:23,190][25689] Fps is (10 sec: 5622.9, 60 sec: 5576.9, 300 sec: 5562.1). Total num frames: 1007320064. Throughput: 0: 5008.3. Samples: 1007316458. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:50:23,190][25689] Avg episode reward: [(0, '0.129')] [2022-07-11 01:50:23,630][26022] Updated weights on worker 0-0, policy_version 983714 (0.00085) [2022-07-11 01:50:25,264][26022] Updated weights on worker 0-0, policy_version 983724 (0.00090) [2022-07-11 01:50:27,201][26022] Updated weights on worker 0-0, policy_version 983734 (0.00088) [2022-07-11 01:50:28,242][25689] Fps is (10 sec: 5499.7, 60 sec: 5563.6, 300 sec: 5565.3). Total num frames: 1007348736. Throughput: 0: 5834.9. Samples: 1007349926. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:50:28,243][25689] Avg episode reward: [(0, '0.168')] [2022-07-11 01:50:29,041][26022] Updated weights on worker 0-0, policy_version 983744 (0.00081) [2022-07-11 01:50:30,931][26022] Updated weights on worker 0-0, policy_version 983754 (0.00083) [2022-07-11 01:50:32,664][26022] Updated weights on worker 0-0, policy_version 983764 (0.00092) [2022-07-11 01:50:33,254][25689] Fps is (10 sec: 5697.7, 60 sec: 5602.2, 300 sec: 5566.5). Total num frames: 1007377408. Throughput: 0: 5869.1. Samples: 1007383784. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:50:33,255][25689] Avg episode reward: [(0, '0.234')] [2022-07-11 01:50:34,567][26022] Updated weights on worker 0-0, policy_version 983774 (0.00842) [2022-07-11 01:50:36,323][26022] Updated weights on worker 0-0, policy_version 983784 (0.00081) [2022-07-11 01:50:38,150][26022] Updated weights on worker 0-0, policy_version 983794 (0.00089) [2022-07-11 01:50:38,269][25689] Fps is (10 sec: 5719.2, 60 sec: 5584.4, 300 sec: 5567.1). Total num frames: 1007406080. Throughput: 0: 5049.1. Samples: 1007400866. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:50:38,275][25689] Avg episode reward: [(0, '1.093')] [2022-07-11 01:50:40,024][26022] Updated weights on worker 0-0, policy_version 983804 (0.00092) [2022-07-11 01:50:41,639][26022] Updated weights on worker 0-0, policy_version 983814 (0.00091) [2022-07-11 01:50:43,295][25689] Fps is (10 sec: 5507.2, 60 sec: 5589.3, 300 sec: 5567.7). Total num frames: 1007432704. Throughput: 0: 5866.5. Samples: 1007434404. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:50:43,295][25689] Avg episode reward: [(0, '1.643')] [2022-07-11 01:50:43,695][26022] Updated weights on worker 0-0, policy_version 983824 (0.00097) [2022-07-11 01:50:45,415][26022] Updated weights on worker 0-0, policy_version 983834 (0.00070) [2022-07-11 01:50:47,303][26022] Updated weights on worker 0-0, policy_version 983844 (0.00098) [2022-07-11 01:50:48,355][25689] Fps is (10 sec: 5482.4, 60 sec: 5576.6, 300 sec: 5566.9). Total num frames: 1007461376. Throughput: 0: 5868.2. Samples: 1007467952. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:50:48,356][25689] Avg episode reward: [(0, '1.548')] [2022-07-11 01:50:49,099][26022] Updated weights on worker 0-0, policy_version 983854 (0.00095) [2022-07-11 01:50:50,905][26022] Updated weights on worker 0-0, policy_version 983864 (0.00085) [2022-07-11 01:50:52,613][26022] Updated weights on worker 0-0, policy_version 983874 (0.00082) [2022-07-11 01:50:53,358][25689] Fps is (10 sec: 5800.3, 60 sec: 5594.4, 300 sec: 5571.2). Total num frames: 1007491072. Throughput: 0: 5031.0. Samples: 1007484926. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:50:53,358][25689] Avg episode reward: [(0, '1.392')] [2022-07-11 01:50:54,747][26022] Updated weights on worker 0-0, policy_version 983884 (0.00089) [2022-07-11 01:50:56,226][26022] Updated weights on worker 0-0, policy_version 983894 (0.00085) [2022-07-11 01:50:58,275][26022] Updated weights on worker 0-0, policy_version 983904 (0.00085) [2022-07-11 01:50:58,396][25689] Fps is (10 sec: 5609.0, 60 sec: 5593.0, 300 sec: 5564.0). Total num frames: 1007517696. Throughput: 0: 5845.4. Samples: 1007518516. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:50:58,396][25689] Avg episode reward: [(0, '1.666')] [2022-07-11 01:50:59,869][26022] Updated weights on worker 0-0, policy_version 983914 (0.00086) [2022-07-11 01:51:02,372][26022] Updated weights on worker 0-0, policy_version 983924 (0.00093) [2022-07-11 01:51:03,412][25689] Fps is (10 sec: 5194.2, 60 sec: 5560.0, 300 sec: 5569.2). Total num frames: 1007543296. Throughput: 0: 5727.6. Samples: 1007549628. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:51:03,412][25689] Avg episode reward: [(0, '1.595')] [2022-07-11 01:51:04,222][26022] Updated weights on worker 0-0, policy_version 983934 (0.00082) [2022-07-11 01:51:05,996][26022] Updated weights on worker 0-0, policy_version 983944 (0.00109) [2022-07-11 01:51:07,936][26022] Updated weights on worker 0-0, policy_version 983954 (0.00106) [2022-07-11 01:51:08,519][25689] Fps is (10 sec: 5259.8, 60 sec: 5524.7, 300 sec: 5564.6). Total num frames: 1007570944. Throughput: 0: 4877.3. Samples: 1007566300. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:51:08,520][25689] Avg episode reward: [(0, '1.905')] [2022-07-11 01:51:09,766][26022] Updated weights on worker 0-0, policy_version 983964 (0.00089) [2022-07-11 01:51:11,723][26022] Updated weights on worker 0-0, policy_version 983974 (0.00085) [2022-07-11 01:51:13,520][25689] Fps is (10 sec: 5571.6, 60 sec: 5565.7, 300 sec: 5564.8). Total num frames: 1007599616. Throughput: 0: 5672.0. Samples: 1007599290. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:51:13,521][25689] Avg episode reward: [(0, '1.134')] [2022-07-11 01:51:13,532][26022] Updated weights on worker 0-0, policy_version 983984 (0.00089) [2022-07-11 01:51:15,364][26022] Updated weights on worker 0-0, policy_version 983994 (0.00087) [2022-07-11 01:51:17,320][26022] Updated weights on worker 0-0, policy_version 984004 (0.00092) [2022-07-11 01:51:18,560][25689] Fps is (10 sec: 5609.0, 60 sec: 5529.4, 300 sec: 5567.9). Total num frames: 1007627264. Throughput: 0: 5656.1. Samples: 1007632568. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:51:18,561][25689] Avg episode reward: [(0, '0.400')] [2022-07-11 01:51:18,983][26022] Updated weights on worker 0-0, policy_version 984014 (0.00096) [2022-07-11 01:51:21,026][26022] Updated weights on worker 0-0, policy_version 984024 (0.00053) [2022-07-11 01:51:22,667][26022] Updated weights on worker 0-0, policy_version 984034 (0.00090) [2022-07-11 01:51:23,611][25689] Fps is (10 sec: 5581.6, 60 sec: 5558.8, 300 sec: 5565.9). Total num frames: 1007655936. Throughput: 0: 4941.8. Samples: 1007649446. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:51:23,611][25689] Avg episode reward: [(0, '-0.739')] [2022-07-11 01:51:24,271][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:51:24,282][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000984042_1007659008.pth [2022-07-11 01:51:24,282][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000982087_1005657088.pth [2022-07-11 01:51:24,666][26022] Updated weights on worker 0-0, policy_version 984044 (0.00080) [2022-07-11 01:51:26,405][26022] Updated weights on worker 0-0, policy_version 984054 (0.00086) [2022-07-11 01:51:28,452][26022] Updated weights on worker 0-0, policy_version 984064 (0.00092) [2022-07-11 01:51:28,665][25689] Fps is (10 sec: 5472.2, 60 sec: 5524.8, 300 sec: 5558.1). Total num frames: 1007682560. Throughput: 0: 5774.1. Samples: 1007682624. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:51:28,666][25689] Avg episode reward: [(0, '-0.887')] [2022-07-11 01:51:29,995][26022] Updated weights on worker 0-0, policy_version 984074 (0.00094) [2022-07-11 01:51:32,030][26022] Updated weights on worker 0-0, policy_version 984084 (0.00092) [2022-07-11 01:51:33,716][25689] Fps is (10 sec: 5472.2, 60 sec: 5521.3, 300 sec: 5560.9). Total num frames: 1007711232. Throughput: 0: 5785.2. Samples: 1007716124. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:51:33,716][25689] Avg episode reward: [(0, '-1.113')] [2022-07-11 01:51:33,824][26022] Updated weights on worker 0-0, policy_version 984094 (0.00093) [2022-07-11 01:51:35,555][26022] Updated weights on worker 0-0, policy_version 984104 (0.00087) [2022-07-11 01:51:37,516][26022] Updated weights on worker 0-0, policy_version 984114 (0.00078) [2022-07-11 01:51:38,742][25689] Fps is (10 sec: 5690.5, 60 sec: 5520.2, 300 sec: 5560.7). Total num frames: 1007739904. Throughput: 0: 4976.4. Samples: 1007733002. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:51:38,743][25689] Avg episode reward: [(0, '-0.548')] [2022-07-11 01:51:39,081][26022] Updated weights on worker 0-0, policy_version 984124 (0.00091) [2022-07-11 01:51:40,954][26022] Updated weights on worker 0-0, policy_version 984134 (0.00080) [2022-07-11 01:51:42,914][26022] Updated weights on worker 0-0, policy_version 984144 (0.00092) [2022-07-11 01:51:43,771][25689] Fps is (10 sec: 5600.9, 60 sec: 5536.8, 300 sec: 5557.5). Total num frames: 1007767552. Throughput: 0: 5820.2. Samples: 1007766784. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:51:43,773][25689] Avg episode reward: [(0, '-0.420')] [2022-07-11 01:51:44,581][26022] Updated weights on worker 0-0, policy_version 984154 (0.00079) [2022-07-11 01:51:46,609][26022] Updated weights on worker 0-0, policy_version 984164 (0.00089) [2022-07-11 01:51:48,270][26022] Updated weights on worker 0-0, policy_version 984174 (0.00095) [2022-07-11 01:51:48,818][25689] Fps is (10 sec: 5589.9, 60 sec: 5538.1, 300 sec: 5561.4). Total num frames: 1007796224. Throughput: 0: 5844.5. Samples: 1007800406. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:51:48,819][25689] Avg episode reward: [(0, '-0.529')] [2022-07-11 01:51:50,326][26022] Updated weights on worker 0-0, policy_version 984184 (0.00087) [2022-07-11 01:51:51,982][26022] Updated weights on worker 0-0, policy_version 984194 (0.00083) [2022-07-11 01:51:53,764][26022] Updated weights on worker 0-0, policy_version 984204 (0.00084) [2022-07-11 01:51:53,829][25689] Fps is (10 sec: 5701.6, 60 sec: 5520.4, 300 sec: 5565.6). Total num frames: 1007824896. Throughput: 0: 5035.8. Samples: 1007817410. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:51:53,829][25689] Avg episode reward: [(0, '0.021')] [2022-07-11 01:51:55,605][26022] Updated weights on worker 0-0, policy_version 984214 (0.00093) [2022-07-11 01:51:57,380][26022] Updated weights on worker 0-0, policy_version 984224 (0.00082) [2022-07-11 01:51:58,920][25689] Fps is (10 sec: 5777.5, 60 sec: 5566.3, 300 sec: 5571.3). Total num frames: 1007854592. Throughput: 0: 5878.9. Samples: 1007851626. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:51:58,921][25689] Avg episode reward: [(0, '-0.522')] [2022-07-11 01:51:59,037][26022] Updated weights on worker 0-0, policy_version 984234 (0.00352) [2022-07-11 01:52:01,007][26022] Updated weights on worker 0-0, policy_version 984244 (0.00094) [2022-07-11 01:52:03,357][26022] Updated weights on worker 0-0, policy_version 984254 (0.00089) [2022-07-11 01:52:03,948][25689] Fps is (10 sec: 5464.4, 60 sec: 5565.2, 300 sec: 5566.5). Total num frames: 1007880192. Throughput: 0: 5784.0. Samples: 1007883486. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:52:03,948][25689] Avg episode reward: [(0, '-0.707')] [2022-07-11 01:52:05,075][26022] Updated weights on worker 0-0, policy_version 984264 (0.00087) [2022-07-11 01:52:06,934][26022] Updated weights on worker 0-0, policy_version 984274 (0.00081) [2022-07-11 01:52:08,764][26022] Updated weights on worker 0-0, policy_version 984284 (0.00134) [2022-07-11 01:52:09,029][25689] Fps is (10 sec: 5267.5, 60 sec: 5567.6, 300 sec: 5565.5). Total num frames: 1007907840. Throughput: 0: 5765.6. Samples: 1007916938. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:52:09,030][25689] Avg episode reward: [(0, '0.078')] [2022-07-11 01:52:10,703][26022] Updated weights on worker 0-0, policy_version 984294 (0.00083) [2022-07-11 01:52:12,383][26022] Updated weights on worker 0-0, policy_version 984304 (0.00080) [2022-07-11 01:52:14,095][25689] Fps is (10 sec: 5449.6, 60 sec: 5544.8, 300 sec: 5561.3). Total num frames: 1007935488. Throughput: 0: 5743.1. Samples: 1007933800. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:52:14,095][25689] Avg episode reward: [(0, '-0.161')] [2022-07-11 01:52:14,301][26022] Updated weights on worker 0-0, policy_version 984314 (0.00087) [2022-07-11 01:52:15,869][26022] Updated weights on worker 0-0, policy_version 984324 (0.00087) [2022-07-11 01:52:18,062][26022] Updated weights on worker 0-0, policy_version 984334 (0.00092) [2022-07-11 01:52:19,168][25689] Fps is (10 sec: 5656.0, 60 sec: 5575.5, 300 sec: 5567.3). Total num frames: 1007965184. Throughput: 0: 5720.4. Samples: 1007967450. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:52:19,168][25689] Avg episode reward: [(0, '0.133')] [2022-07-11 01:52:19,481][26022] Updated weights on worker 0-0, policy_version 984344 (0.00097) [2022-07-11 01:52:21,740][26022] Updated weights on worker 0-0, policy_version 984354 (0.00088) [2022-07-11 01:52:23,268][26022] Updated weights on worker 0-0, policy_version 984364 (0.00086) [2022-07-11 01:52:24,211][25689] Fps is (10 sec: 5567.2, 60 sec: 5542.4, 300 sec: 5557.3). Total num frames: 1007991808. Throughput: 0: 5801.7. Samples: 1008001048. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:52:24,212][25689] Avg episode reward: [(0, '0.610')] [2022-07-11 01:52:25,399][26022] Updated weights on worker 0-0, policy_version 984374 (0.00085) [2022-07-11 01:52:26,963][26022] Updated weights on worker 0-0, policy_version 984384 (0.00088) [2022-07-11 01:52:28,904][26022] Updated weights on worker 0-0, policy_version 984394 (0.00091) [2022-07-11 01:52:29,309][25689] Fps is (10 sec: 5553.6, 60 sec: 5589.1, 300 sec: 5569.6). Total num frames: 1008021504. Throughput: 0: 4963.9. Samples: 1008017606. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 01:52:29,310][25689] Avg episode reward: [(0, '0.654')] [2022-07-11 01:52:30,730][26022] Updated weights on worker 0-0, policy_version 984404 (0.00083) [2022-07-11 01:52:32,475][26022] Updated weights on worker 0-0, policy_version 984414 (0.00093) [2022-07-11 01:52:34,349][26022] Updated weights on worker 0-0, policy_version 984424 (0.00370) [2022-07-11 01:52:34,363][25689] Fps is (10 sec: 5749.3, 60 sec: 5588.7, 300 sec: 5561.8). Total num frames: 1008050176. Throughput: 0: 5798.7. Samples: 1008051332. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:52:34,365][25689] Avg episode reward: [(0, '0.448')] [2022-07-11 01:52:36,341][26022] Updated weights on worker 0-0, policy_version 984434 (0.00098) [2022-07-11 01:52:37,877][26022] Updated weights on worker 0-0, policy_version 984444 (0.00088) [2022-07-11 01:52:39,399][25689] Fps is (10 sec: 5582.0, 60 sec: 5571.1, 300 sec: 5568.6). Total num frames: 1008077824. Throughput: 0: 5830.5. Samples: 1008085406. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:52:39,399][25689] Avg episode reward: [(0, '0.191')] [2022-07-11 01:52:39,875][26022] Updated weights on worker 0-0, policy_version 984454 (0.00089) [2022-07-11 01:52:41,601][26022] Updated weights on worker 0-0, policy_version 984464 (0.00093) [2022-07-11 01:52:43,459][26022] Updated weights on worker 0-0, policy_version 984474 (0.00093) [2022-07-11 01:52:44,413][25689] Fps is (10 sec: 5604.0, 60 sec: 5589.2, 300 sec: 5562.3). Total num frames: 1008106496. Throughput: 0: 5004.1. Samples: 1008102142. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:52:44,415][25689] Avg episode reward: [(0, '-0.067')] [2022-07-11 01:52:45,278][26022] Updated weights on worker 0-0, policy_version 984484 (0.00084) [2022-07-11 01:52:47,256][26022] Updated weights on worker 0-0, policy_version 984494 (0.00087) [2022-07-11 01:52:49,154][26022] Updated weights on worker 0-0, policy_version 984504 (0.00087) [2022-07-11 01:52:49,485][25689] Fps is (10 sec: 5685.1, 60 sec: 5586.9, 300 sec: 5564.9). Total num frames: 1008135168. Throughput: 0: 5844.5. Samples: 1008135526. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:52:49,486][25689] Avg episode reward: [(0, '-0.259')] [2022-07-11 01:52:50,947][26022] Updated weights on worker 0-0, policy_version 984514 (0.00096) [2022-07-11 01:52:52,627][26022] Updated weights on worker 0-0, policy_version 984524 (0.00093) [2022-07-11 01:52:54,563][25689] Fps is (10 sec: 5448.3, 60 sec: 5547.1, 300 sec: 5553.7). Total num frames: 1008161792. Throughput: 0: 5835.3. Samples: 1008169200. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:52:54,564][25689] Avg episode reward: [(0, '-0.655')] [2022-07-11 01:52:54,593][26022] Updated weights on worker 0-0, policy_version 984534 (0.00079) [2022-07-11 01:52:56,234][26022] Updated weights on worker 0-0, policy_version 984544 (0.00070) [2022-07-11 01:52:58,239][26022] Updated weights on worker 0-0, policy_version 984554 (0.00088) [2022-07-11 01:52:59,640][25689] Fps is (10 sec: 5546.2, 60 sec: 5548.4, 300 sec: 5566.4). Total num frames: 1008191488. Throughput: 0: 4978.2. Samples: 1008186172. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:52:59,641][25689] Avg episode reward: [(0, '-0.573')] [2022-07-11 01:52:59,874][26022] Updated weights on worker 0-0, policy_version 984564 (0.00084) [2022-07-11 01:53:02,234][26022] Updated weights on worker 0-0, policy_version 984574 (0.00087) [2022-07-11 01:53:03,835][26022] Updated weights on worker 0-0, policy_version 984584 (0.00088) [2022-07-11 01:53:04,651][25689] Fps is (10 sec: 5480.9, 60 sec: 5549.9, 300 sec: 5557.8). Total num frames: 1008217088. Throughput: 0: 5727.2. Samples: 1008218048. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:53:04,652][25689] Avg episode reward: [(0, '-0.429')] [2022-07-11 01:53:05,757][26022] Updated weights on worker 0-0, policy_version 984594 (0.00096) [2022-07-11 01:53:07,570][26022] Updated weights on worker 0-0, policy_version 984604 (0.00091) [2022-07-11 01:53:09,322][26022] Updated weights on worker 0-0, policy_version 984614 (0.00090) [2022-07-11 01:53:09,735][25689] Fps is (10 sec: 5376.3, 60 sec: 5566.5, 300 sec: 5560.8). Total num frames: 1008245760. Throughput: 0: 5753.4. Samples: 1008252030. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:53:09,735][25689] Avg episode reward: [(0, '-0.130')] [2022-07-11 01:53:11,119][26022] Updated weights on worker 0-0, policy_version 984624 (0.00092) [2022-07-11 01:53:13,123][26022] Updated weights on worker 0-0, policy_version 984634 (0.00092) [2022-07-11 01:53:14,671][26022] Updated weights on worker 0-0, policy_version 984644 (0.00086) [2022-07-11 01:53:14,743][25689] Fps is (10 sec: 5783.9, 60 sec: 5605.6, 300 sec: 5571.5). Total num frames: 1008275456. Throughput: 0: 4936.6. Samples: 1008268822. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:53:14,743][25689] Avg episode reward: [(0, '-0.432')] [2022-07-11 01:53:16,832][26022] Updated weights on worker 0-0, policy_version 984654 (0.00089) [2022-07-11 01:53:18,456][26022] Updated weights on worker 0-0, policy_version 984664 (0.00092) [2022-07-11 01:53:19,838][25689] Fps is (10 sec: 5574.4, 60 sec: 5552.9, 300 sec: 5560.0). Total num frames: 1008302080. Throughput: 0: 5727.7. Samples: 1008301860. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:53:19,839][25689] Avg episode reward: [(0, '-0.720')] [2022-07-11 01:53:20,572][26022] Updated weights on worker 0-0, policy_version 984674 (0.00092) [2022-07-11 01:53:22,115][26022] Updated weights on worker 0-0, policy_version 984684 (0.00083) [2022-07-11 01:53:24,178][26022] Updated weights on worker 0-0, policy_version 984694 (0.00088) [2022-07-11 01:53:24,435][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:53:24,450][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000984696_1008328704.pth [2022-07-11 01:53:24,450][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000982738_1006323712.pth [2022-07-11 01:53:24,926][25689] Fps is (10 sec: 5430.3, 60 sec: 5582.6, 300 sec: 5562.9). Total num frames: 1008330752. Throughput: 0: 5819.1. Samples: 1008336026. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:53:24,926][25689] Avg episode reward: [(0, '-0.013')] [2022-07-11 01:53:25,735][26022] Updated weights on worker 0-0, policy_version 984704 (0.00097) [2022-07-11 01:53:27,895][26022] Updated weights on worker 0-0, policy_version 984714 (0.00086) [2022-07-11 01:53:29,425][26022] Updated weights on worker 0-0, policy_version 984724 (0.00082) [2022-07-11 01:53:29,991][25689] Fps is (10 sec: 5648.3, 60 sec: 5568.7, 300 sec: 5565.7). Total num frames: 1008359424. Throughput: 0: 4964.6. Samples: 1008352592. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:53:29,991][25689] Avg episode reward: [(0, '-0.002')] [2022-07-11 01:53:31,382][26022] Updated weights on worker 0-0, policy_version 984734 (0.00083) [2022-07-11 01:53:33,041][26022] Updated weights on worker 0-0, policy_version 984744 (0.00085) [2022-07-11 01:53:34,998][25689] Fps is (10 sec: 5591.3, 60 sec: 5556.1, 300 sec: 5562.2). Total num frames: 1008387072. Throughput: 0: 5793.9. Samples: 1008386180. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:53:34,999][25689] Avg episode reward: [(0, '0.168')] [2022-07-11 01:53:35,014][26022] Updated weights on worker 0-0, policy_version 984754 (0.00085) [2022-07-11 01:53:36,603][26022] Updated weights on worker 0-0, policy_version 984764 (0.00084) [2022-07-11 01:53:38,688][26022] Updated weights on worker 0-0, policy_version 984774 (0.00082) [2022-07-11 01:53:40,016][25689] Fps is (10 sec: 5719.8, 60 sec: 5591.5, 300 sec: 5566.0). Total num frames: 1008416768. Throughput: 0: 5872.8. Samples: 1008420362. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:53:40,017][25689] Avg episode reward: [(0, '0.006')] [2022-07-11 01:53:40,307][26022] Updated weights on worker 0-0, policy_version 984784 (0.00082) [2022-07-11 01:53:42,453][26022] Updated weights on worker 0-0, policy_version 984794 (0.00083) [2022-07-11 01:53:43,964][26022] Updated weights on worker 0-0, policy_version 984804 (0.00087) [2022-07-11 01:53:45,024][25689] Fps is (10 sec: 5617.6, 60 sec: 5558.4, 300 sec: 5564.8). Total num frames: 1008443392. Throughput: 0: 5029.7. Samples: 1008437114. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:53:45,024][25689] Avg episode reward: [(0, '0.160')] [2022-07-11 01:53:46,068][26022] Updated weights on worker 0-0, policy_version 984814 (0.00093) [2022-07-11 01:53:47,589][26022] Updated weights on worker 0-0, policy_version 984824 (0.00095) [2022-07-11 01:53:49,844][26022] Updated weights on worker 0-0, policy_version 984834 (0.00086) [2022-07-11 01:53:50,107][25689] Fps is (10 sec: 5378.2, 60 sec: 5540.4, 300 sec: 5560.0). Total num frames: 1008471040. Throughput: 0: 5846.5. Samples: 1008470204. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:53:50,108][25689] Avg episode reward: [(0, '0.739')] [2022-07-11 01:53:51,678][26022] Updated weights on worker 0-0, policy_version 984844 (0.00085) [2022-07-11 01:53:53,460][26022] Updated weights on worker 0-0, policy_version 984854 (0.00094) [2022-07-11 01:53:55,087][26022] Updated weights on worker 0-0, policy_version 984864 (0.00089) [2022-07-11 01:53:55,138][25689] Fps is (10 sec: 5669.8, 60 sec: 5595.4, 300 sec: 5563.4). Total num frames: 1008500736. Throughput: 0: 5812.4. Samples: 1008503238. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:53:55,138][25689] Avg episode reward: [(0, '0.734')] [2022-07-11 01:53:57,382][26022] Updated weights on worker 0-0, policy_version 984874 (0.00091) [2022-07-11 01:53:58,841][26022] Updated weights on worker 0-0, policy_version 984884 (0.00079) [2022-07-11 01:54:00,148][25689] Fps is (10 sec: 5711.3, 60 sec: 5567.8, 300 sec: 5571.0). Total num frames: 1008528384. Throughput: 0: 5793.7. Samples: 1008536998. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:54:00,148][25689] Avg episode reward: [(0, '1.868')] [2022-07-11 01:54:00,844][26022] Updated weights on worker 0-0, policy_version 984894 (0.00087) [2022-07-11 01:54:02,735][26022] Updated weights on worker 0-0, policy_version 984904 (0.00087) [2022-07-11 01:54:05,088][26022] Updated weights on worker 0-0, policy_version 984914 (0.00426) [2022-07-11 01:54:05,162][25689] Fps is (10 sec: 5107.8, 60 sec: 5533.7, 300 sec: 5555.3). Total num frames: 1008551936. Throughput: 0: 5700.3. Samples: 1008551906. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:54:05,164][25689] Avg episode reward: [(0, '1.387')] [2022-07-11 01:54:06,460][26022] Updated weights on worker 0-0, policy_version 984924 (0.00086) [2022-07-11 01:54:08,696][26022] Updated weights on worker 0-0, policy_version 984934 (0.00095) [2022-07-11 01:54:10,062][26022] Updated weights on worker 0-0, policy_version 984944 (0.00085) [2022-07-11 01:54:10,247][25689] Fps is (10 sec: 5374.0, 60 sec: 5567.4, 300 sec: 5561.5). Total num frames: 1008582656. Throughput: 0: 5726.3. Samples: 1008585528. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:54:10,247][25689] Avg episode reward: [(0, '1.043')] [2022-07-11 01:54:12,144][26022] Updated weights on worker 0-0, policy_version 984954 (0.00092) [2022-07-11 01:54:13,639][26022] Updated weights on worker 0-0, policy_version 984964 (0.00096) [2022-07-11 01:54:15,274][25689] Fps is (10 sec: 5771.9, 60 sec: 5531.8, 300 sec: 5561.1). Total num frames: 1008610304. Throughput: 0: 5775.7. Samples: 1008619540. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:54:15,275][25689] Avg episode reward: [(0, '-0.931')] [2022-07-11 01:54:15,648][26022] Updated weights on worker 0-0, policy_version 984974 (0.00088) [2022-07-11 01:54:17,482][26022] Updated weights on worker 0-0, policy_version 984984 (0.00095) [2022-07-11 01:54:19,290][26022] Updated weights on worker 0-0, policy_version 984994 (0.00081) [2022-07-11 01:54:20,277][25689] Fps is (10 sec: 5614.9, 60 sec: 5574.2, 300 sec: 5565.8). Total num frames: 1008638976. Throughput: 0: 4923.9. Samples: 1008636114. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:54:20,284][25689] Avg episode reward: [(0, '-0.870')] [2022-07-11 01:54:21,092][26022] Updated weights on worker 0-0, policy_version 985004 (0.00092) [2022-07-11 01:54:23,113][26022] Updated weights on worker 0-0, policy_version 985014 (0.00088) [2022-07-11 01:54:24,655][26022] Updated weights on worker 0-0, policy_version 985024 (0.00086) [2022-07-11 01:54:25,314][25689] Fps is (10 sec: 5711.8, 60 sec: 5578.8, 300 sec: 5563.4). Total num frames: 1008667648. Throughput: 0: 5871.1. Samples: 1008670220. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:54:25,316][25689] Avg episode reward: [(0, '-1.022')] [2022-07-11 01:54:26,780][26022] Updated weights on worker 0-0, policy_version 985034 (0.00051) [2022-07-11 01:54:28,285][26022] Updated weights on worker 0-0, policy_version 985044 (0.00094) [2022-07-11 01:54:30,331][26022] Updated weights on worker 0-0, policy_version 985054 (0.00086) [2022-07-11 01:54:30,432][25689] Fps is (10 sec: 5647.2, 60 sec: 5574.0, 300 sec: 5569.3). Total num frames: 1008696320. Throughput: 0: 5873.6. Samples: 1008704086. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:54:30,432][25689] Avg episode reward: [(0, '-1.245')] [2022-07-11 01:54:31,984][26022] Updated weights on worker 0-0, policy_version 985064 (0.00095) [2022-07-11 01:54:33,954][26022] Updated weights on worker 0-0, policy_version 985074 (0.00089) [2022-07-11 01:54:35,506][25689] Fps is (10 sec: 5626.5, 60 sec: 5584.8, 300 sec: 5564.6). Total num frames: 1008724992. Throughput: 0: 5010.0. Samples: 1008720898. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:54:35,506][25689] Avg episode reward: [(0, '-0.959')] [2022-07-11 01:54:35,609][26022] Updated weights on worker 0-0, policy_version 985084 (0.00087) [2022-07-11 01:54:37,631][26022] Updated weights on worker 0-0, policy_version 985094 (0.00081) [2022-07-11 01:54:39,160][26022] Updated weights on worker 0-0, policy_version 985104 (0.00092) [2022-07-11 01:54:40,606][25689] Fps is (10 sec: 5434.7, 60 sec: 5526.4, 300 sec: 5564.1). Total num frames: 1008751616. Throughput: 0: 5845.2. Samples: 1008754942. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:54:40,607][25689] Avg episode reward: [(0, '-0.198')] [2022-07-11 01:54:41,291][26022] Updated weights on worker 0-0, policy_version 985114 (0.00090) [2022-07-11 01:54:42,871][26022] Updated weights on worker 0-0, policy_version 985124 (0.00097) [2022-07-11 01:54:44,799][26022] Updated weights on worker 0-0, policy_version 985134 (0.00080) [2022-07-11 01:54:45,671][25689] Fps is (10 sec: 5641.3, 60 sec: 5588.8, 300 sec: 5568.4). Total num frames: 1008782336. Throughput: 0: 5820.5. Samples: 1008788706. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:54:45,671][25689] Avg episode reward: [(0, '1.144')] [2022-07-11 01:54:46,727][26022] Updated weights on worker 0-0, policy_version 985144 (0.00085) [2022-07-11 01:54:48,526][26022] Updated weights on worker 0-0, policy_version 985154 (0.00089) [2022-07-11 01:54:50,303][26022] Updated weights on worker 0-0, policy_version 985164 (0.00085) [2022-07-11 01:54:50,731][25689] Fps is (10 sec: 5765.1, 60 sec: 5591.0, 300 sec: 5564.0). Total num frames: 1008809984. Throughput: 0: 4987.7. Samples: 1008805332. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:54:50,731][25689] Avg episode reward: [(0, '0.763')] [2022-07-11 01:54:52,391][26022] Updated weights on worker 0-0, policy_version 985174 (0.00085) [2022-07-11 01:54:53,645][26022] Updated weights on worker 0-0, policy_version 985184 (0.00091) [2022-07-11 01:54:55,737][25689] Fps is (10 sec: 5289.6, 60 sec: 5525.6, 300 sec: 5560.9). Total num frames: 1008835584. Throughput: 0: 5846.4. Samples: 1008839180. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:54:55,738][25689] Avg episode reward: [(0, '0.787')] [2022-07-11 01:54:55,957][26022] Updated weights on worker 0-0, policy_version 985194 (0.00084) [2022-07-11 01:54:57,235][26022] Updated weights on worker 0-0, policy_version 985204 (0.00079) [2022-07-11 01:54:59,595][26022] Updated weights on worker 0-0, policy_version 985214 (0.00089) [2022-07-11 01:55:00,747][25689] Fps is (10 sec: 5623.0, 60 sec: 5576.3, 300 sec: 5571.5). Total num frames: 1008866304. Throughput: 0: 5851.3. Samples: 1008872790. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:55:00,751][25689] Avg episode reward: [(0, '0.446')] [2022-07-11 01:55:01,326][26022] Updated weights on worker 0-0, policy_version 985224 (0.00087) [2022-07-11 01:55:03,367][26022] Updated weights on worker 0-0, policy_version 985234 (0.00093) [2022-07-11 01:55:05,384][26022] Updated weights on worker 0-0, policy_version 985244 (0.00087) [2022-07-11 01:55:05,775][25689] Fps is (10 sec: 5611.0, 60 sec: 5608.8, 300 sec: 5558.9). Total num frames: 1008891904. Throughput: 0: 4918.8. Samples: 1008887594. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:55:05,775][25689] Avg episode reward: [(0, '0.309')] [2022-07-11 01:55:07,138][26022] Updated weights on worker 0-0, policy_version 985254 (0.00083) [2022-07-11 01:55:09,012][26022] Updated weights on worker 0-0, policy_version 985264 (0.00090) [2022-07-11 01:55:10,767][26022] Updated weights on worker 0-0, policy_version 985274 (0.00094) [2022-07-11 01:55:10,835][25689] Fps is (10 sec: 5379.7, 60 sec: 5577.3, 300 sec: 5566.2). Total num frames: 1008920576. Throughput: 0: 5760.5. Samples: 1008921142. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:55:10,837][25689] Avg episode reward: [(0, '0.251')] [2022-07-11 01:55:12,519][26022] Updated weights on worker 0-0, policy_version 985284 (0.00084) [2022-07-11 01:55:14,443][26022] Updated weights on worker 0-0, policy_version 985294 (0.00088) [2022-07-11 01:55:15,843][25689] Fps is (10 sec: 5695.6, 60 sec: 5596.0, 300 sec: 5562.8). Total num frames: 1008949248. Throughput: 0: 5768.9. Samples: 1008955168. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:55:15,844][25689] Avg episode reward: [(0, '-0.204')] [2022-07-11 01:55:16,210][26022] Updated weights on worker 0-0, policy_version 985304 (0.00096) [2022-07-11 01:55:18,210][26022] Updated weights on worker 0-0, policy_version 985314 (0.00085) [2022-07-11 01:55:19,838][26022] Updated weights on worker 0-0, policy_version 985324 (0.00089) [2022-07-11 01:55:20,871][25689] Fps is (10 sec: 5611.9, 60 sec: 5576.8, 300 sec: 5565.8). Total num frames: 1008976896. Throughput: 0: 4934.9. Samples: 1008972100. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:55:20,873][25689] Avg episode reward: [(0, '-0.819')] [2022-07-11 01:55:21,729][26022] Updated weights on worker 0-0, policy_version 985334 (0.00083) [2022-07-11 01:55:23,452][26022] Updated weights on worker 0-0, policy_version 985344 (0.00083) [2022-07-11 01:55:24,456][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:55:24,468][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000985348_1008996352.pth [2022-07-11 01:55:24,469][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000983392_1006993408.pth [2022-07-11 01:55:25,204][26022] Updated weights on worker 0-0, policy_version 985354 (0.00086) [2022-07-11 01:55:25,881][25689] Fps is (10 sec: 5508.7, 60 sec: 5562.4, 300 sec: 5563.2). Total num frames: 1009004544. Throughput: 0: 5873.3. Samples: 1009005682. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:55:25,881][25689] Avg episode reward: [(0, '-0.715')] [2022-07-11 01:55:27,383][26022] Updated weights on worker 0-0, policy_version 985364 (0.00340) [2022-07-11 01:55:28,921][26022] Updated weights on worker 0-0, policy_version 985374 (0.00086) [2022-07-11 01:55:30,867][26022] Updated weights on worker 0-0, policy_version 985384 (0.00087) [2022-07-11 01:55:30,938][25689] Fps is (10 sec: 5696.3, 60 sec: 5584.9, 300 sec: 5565.7). Total num frames: 1009034240. Throughput: 0: 5886.5. Samples: 1009039476. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:55:30,938][25689] Avg episode reward: [(0, '-0.252')] [2022-07-11 01:55:32,633][26022] Updated weights on worker 0-0, policy_version 985394 (0.00087) [2022-07-11 01:55:34,423][26022] Updated weights on worker 0-0, policy_version 985404 (0.00080) [2022-07-11 01:55:35,971][25689] Fps is (10 sec: 5581.6, 60 sec: 5554.8, 300 sec: 5558.5). Total num frames: 1009060864. Throughput: 0: 5020.1. Samples: 1009056214. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:55:35,971][25689] Avg episode reward: [(0, '-0.289')] [2022-07-11 01:55:36,374][26022] Updated weights on worker 0-0, policy_version 985414 (0.00091) [2022-07-11 01:55:38,013][26022] Updated weights on worker 0-0, policy_version 985424 (0.00092) [2022-07-11 01:55:40,021][26022] Updated weights on worker 0-0, policy_version 985434 (0.00111) [2022-07-11 01:55:40,981][25689] Fps is (10 sec: 5505.4, 60 sec: 5597.0, 300 sec: 5565.7). Total num frames: 1009089536. Throughput: 0: 5870.9. Samples: 1009090168. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:55:40,982][25689] Avg episode reward: [(0, '0.397')] [2022-07-11 01:55:41,768][26022] Updated weights on worker 0-0, policy_version 985444 (0.00089) [2022-07-11 01:55:43,642][26022] Updated weights on worker 0-0, policy_version 985454 (0.00085) [2022-07-11 01:55:45,190][26022] Updated weights on worker 0-0, policy_version 985464 (0.00095) [2022-07-11 01:55:45,994][25689] Fps is (10 sec: 5721.5, 60 sec: 5567.9, 300 sec: 5566.6). Total num frames: 1009118208. Throughput: 0: 5883.6. Samples: 1009124018. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:55:45,994][25689] Avg episode reward: [(0, '-0.187')] [2022-07-11 01:55:47,398][26022] Updated weights on worker 0-0, policy_version 985474 (0.00088) [2022-07-11 01:55:48,929][26022] Updated weights on worker 0-0, policy_version 985484 (0.00089) [2022-07-11 01:55:50,971][26022] Updated weights on worker 0-0, policy_version 985494 (0.00090) [2022-07-11 01:55:51,067][25689] Fps is (10 sec: 5584.4, 60 sec: 5566.7, 300 sec: 5558.4). Total num frames: 1009145856. Throughput: 0: 5030.0. Samples: 1009140726. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:55:51,067][25689] Avg episode reward: [(0, '0.844')] [2022-07-11 01:55:52,644][26022] Updated weights on worker 0-0, policy_version 985504 (0.00092) [2022-07-11 01:55:54,603][26022] Updated weights on worker 0-0, policy_version 985514 (0.00086) [2022-07-11 01:55:56,071][25689] Fps is (10 sec: 5487.0, 60 sec: 5600.8, 300 sec: 5562.5). Total num frames: 1009173504. Throughput: 0: 5875.0. Samples: 1009174302. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:55:56,072][25689] Avg episode reward: [(0, '0.963')] [2022-07-11 01:55:56,301][26022] Updated weights on worker 0-0, policy_version 985524 (0.00085) [2022-07-11 01:55:58,266][26022] Updated weights on worker 0-0, policy_version 985534 (0.00085) [2022-07-11 01:55:59,930][26022] Updated weights on worker 0-0, policy_version 985544 (0.00086) [2022-07-11 01:56:01,099][25689] Fps is (10 sec: 5613.6, 60 sec: 5565.1, 300 sec: 5572.6). Total num frames: 1009202176. Throughput: 0: 5859.4. Samples: 1009208048. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:56:01,100][25689] Avg episode reward: [(0, '0.210')] [2022-07-11 01:56:02,279][26022] Updated weights on worker 0-0, policy_version 985554 (0.00090) [2022-07-11 01:56:03,942][26022] Updated weights on worker 0-0, policy_version 985564 (0.00084) [2022-07-11 01:56:05,971][26022] Updated weights on worker 0-0, policy_version 985574 (0.00091) [2022-07-11 01:56:06,118][25689] Fps is (10 sec: 5402.1, 60 sec: 5566.1, 300 sec: 5567.3). Total num frames: 1009227776. Throughput: 0: 4907.5. Samples: 1009222776. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:56:06,118][25689] Avg episode reward: [(0, '0.427')] [2022-07-11 01:56:07,586][26022] Updated weights on worker 0-0, policy_version 985584 (0.00106) [2022-07-11 01:56:09,600][26022] Updated weights on worker 0-0, policy_version 985594 (0.00085) [2022-07-11 01:56:11,173][26022] Updated weights on worker 0-0, policy_version 985604 (0.00090) [2022-07-11 01:56:11,190][25689] Fps is (10 sec: 5480.0, 60 sec: 5581.9, 300 sec: 5569.4). Total num frames: 1009257472. Throughput: 0: 5749.8. Samples: 1009256432. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:56:11,191][25689] Avg episode reward: [(0, '0.937')] [2022-07-11 01:56:13,411][26022] Updated weights on worker 0-0, policy_version 985614 (0.00062) [2022-07-11 01:56:14,765][26022] Updated weights on worker 0-0, policy_version 985624 (0.00089) [2022-07-11 01:56:16,214][25689] Fps is (10 sec: 5679.9, 60 sec: 5563.5, 300 sec: 5569.8). Total num frames: 1009285120. Throughput: 0: 5757.4. Samples: 1009290270. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:56:16,214][25689] Avg episode reward: [(0, '0.529')] [2022-07-11 01:56:17,086][26022] Updated weights on worker 0-0, policy_version 985634 (0.00082) [2022-07-11 01:56:18,484][26022] Updated weights on worker 0-0, policy_version 985644 (0.00089) [2022-07-11 01:56:20,690][26022] Updated weights on worker 0-0, policy_version 985654 (0.00099) [2022-07-11 01:56:21,248][25689] Fps is (10 sec: 5497.9, 60 sec: 5562.9, 300 sec: 5566.6). Total num frames: 1009312768. Throughput: 0: 4917.0. Samples: 1009307116. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:56:21,248][25689] Avg episode reward: [(0, '-0.037')] [2022-07-11 01:56:22,115][26022] Updated weights on worker 0-0, policy_version 985664 (0.00092) [2022-07-11 01:56:24,471][26022] Updated weights on worker 0-0, policy_version 985674 (0.00084) [2022-07-11 01:56:25,894][26022] Updated weights on worker 0-0, policy_version 985684 (0.00084) [2022-07-11 01:56:26,249][25689] Fps is (10 sec: 5611.9, 60 sec: 5580.6, 300 sec: 5574.5). Total num frames: 1009341440. Throughput: 0: 5831.1. Samples: 1009340166. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:56:26,250][25689] Avg episode reward: [(0, '-0.024')] [2022-07-11 01:56:28,251][26022] Updated weights on worker 0-0, policy_version 985694 (0.00087) [2022-07-11 01:56:29,639][26022] Updated weights on worker 0-0, policy_version 985704 (0.00091) [2022-07-11 01:56:31,371][25689] Fps is (10 sec: 5462.4, 60 sec: 5523.8, 300 sec: 5566.3). Total num frames: 1009368064. Throughput: 0: 5800.0. Samples: 1009373480. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:56:31,371][25689] Avg episode reward: [(0, '0.359')] [2022-07-11 01:56:31,763][26022] Updated weights on worker 0-0, policy_version 985714 (0.00081) [2022-07-11 01:56:33,348][26022] Updated weights on worker 0-0, policy_version 985724 (0.00092) [2022-07-11 01:56:35,430][26022] Updated weights on worker 0-0, policy_version 985734 (0.00097) [2022-07-11 01:56:36,401][25689] Fps is (10 sec: 5547.8, 60 sec: 5575.0, 300 sec: 5569.7). Total num frames: 1009397760. Throughput: 0: 4962.6. Samples: 1009390452. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:56:36,402][25689] Avg episode reward: [(0, '0.204')] [2022-07-11 01:56:36,908][26022] Updated weights on worker 0-0, policy_version 985744 (0.00636) [2022-07-11 01:56:38,971][26022] Updated weights on worker 0-0, policy_version 985754 (0.00082) [2022-07-11 01:56:40,590][26022] Updated weights on worker 0-0, policy_version 985764 (0.00087) [2022-07-11 01:56:41,431][25689] Fps is (10 sec: 5801.8, 60 sec: 5573.2, 300 sec: 5573.1). Total num frames: 1009426432. Throughput: 0: 5805.9. Samples: 1009424298. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:56:41,433][25689] Avg episode reward: [(0, '-0.081')] [2022-07-11 01:56:42,658][26022] Updated weights on worker 0-0, policy_version 985774 (0.00080) [2022-07-11 01:56:44,289][26022] Updated weights on worker 0-0, policy_version 985784 (0.00092) [2022-07-11 01:56:46,214][26022] Updated weights on worker 0-0, policy_version 985794 (0.00084) [2022-07-11 01:56:46,448][25689] Fps is (10 sec: 5605.5, 60 sec: 5555.7, 300 sec: 5570.2). Total num frames: 1009454080. Throughput: 0: 5835.0. Samples: 1009458028. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:56:46,449][25689] Avg episode reward: [(0, '-0.100')] [2022-07-11 01:56:47,975][26022] Updated weights on worker 0-0, policy_version 985804 (0.00087) [2022-07-11 01:56:49,901][26022] Updated weights on worker 0-0, policy_version 985814 (0.00088) [2022-07-11 01:56:51,487][25689] Fps is (10 sec: 5498.8, 60 sec: 5558.9, 300 sec: 5566.2). Total num frames: 1009481728. Throughput: 0: 5877.3. Samples: 1009491710. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:56:51,489][25689] Avg episode reward: [(0, '-0.091')] [2022-07-11 01:56:51,759][26022] Updated weights on worker 0-0, policy_version 985824 (0.00087) [2022-07-11 01:56:53,577][26022] Updated weights on worker 0-0, policy_version 985834 (0.00084) [2022-07-11 01:56:55,308][26022] Updated weights on worker 0-0, policy_version 985844 (0.00090) [2022-07-11 01:56:56,509][25689] Fps is (10 sec: 5598.1, 60 sec: 5574.3, 300 sec: 5564.1). Total num frames: 1009510400. Throughput: 0: 5873.0. Samples: 1009508546. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:56:56,510][25689] Avg episode reward: [(0, '-0.326')] [2022-07-11 01:56:57,246][26022] Updated weights on worker 0-0, policy_version 985854 (0.00090) [2022-07-11 01:56:58,921][26022] Updated weights on worker 0-0, policy_version 985864 (0.00093) [2022-07-11 01:57:00,859][26022] Updated weights on worker 0-0, policy_version 985874 (0.00089) [2022-07-11 01:57:01,579][25689] Fps is (10 sec: 5682.2, 60 sec: 5570.4, 300 sec: 5573.6). Total num frames: 1009539072. Throughput: 0: 5862.2. Samples: 1009542410. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:57:01,579][25689] Avg episode reward: [(0, '-0.371')] [2022-07-11 01:57:02,856][26022] Updated weights on worker 0-0, policy_version 985884 (0.00091) [2022-07-11 01:57:04,812][26022] Updated weights on worker 0-0, policy_version 985894 (0.00088) [2022-07-11 01:57:06,558][26022] Updated weights on worker 0-0, policy_version 985904 (0.00093) [2022-07-11 01:57:06,607][25689] Fps is (10 sec: 5476.1, 60 sec: 5586.5, 300 sec: 5571.2). Total num frames: 1009565696. Throughput: 0: 5759.5. Samples: 1009574130. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:57:06,607][25689] Avg episode reward: [(0, '-0.282')] [2022-07-11 01:57:08,386][26022] Updated weights on worker 0-0, policy_version 985914 (0.00095) [2022-07-11 01:57:10,185][26022] Updated weights on worker 0-0, policy_version 985924 (0.00088) [2022-07-11 01:57:11,662][25689] Fps is (10 sec: 5382.7, 60 sec: 5554.2, 300 sec: 5571.4). Total num frames: 1009593344. Throughput: 0: 4922.0. Samples: 1009591008. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:57:11,662][25689] Avg episode reward: [(0, '0.832')] [2022-07-11 01:57:12,119][26022] Updated weights on worker 0-0, policy_version 985934 (0.00096) [2022-07-11 01:57:13,851][26022] Updated weights on worker 0-0, policy_version 985944 (0.00086) [2022-07-11 01:57:15,837][26022] Updated weights on worker 0-0, policy_version 985954 (0.00082) [2022-07-11 01:57:16,686][25689] Fps is (10 sec: 5587.8, 60 sec: 5571.1, 300 sec: 5568.8). Total num frames: 1009622016. Throughput: 0: 5764.1. Samples: 1009624846. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:57:16,686][25689] Avg episode reward: [(0, '1.023')] [2022-07-11 01:57:17,535][26022] Updated weights on worker 0-0, policy_version 985964 (0.00085) [2022-07-11 01:57:19,405][26022] Updated weights on worker 0-0, policy_version 985974 (0.00085) [2022-07-11 01:57:21,273][26022] Updated weights on worker 0-0, policy_version 985984 (0.01149) [2022-07-11 01:57:21,721][25689] Fps is (10 sec: 5496.7, 60 sec: 5554.0, 300 sec: 5569.0). Total num frames: 1009648640. Throughput: 0: 5769.4. Samples: 1009658620. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:57:21,722][25689] Avg episode reward: [(0, '1.312')] [2022-07-11 01:57:22,837][26022] Updated weights on worker 0-0, policy_version 985994 (0.00086) [2022-07-11 01:57:24,546][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:57:24,562][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000986001_1009665024.pth [2022-07-11 01:57:24,563][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000984042_1007659008.pth [2022-07-11 01:57:25,001][26022] Updated weights on worker 0-0, policy_version 986004 (0.00085) [2022-07-11 01:57:26,483][26022] Updated weights on worker 0-0, policy_version 986014 (0.00095) [2022-07-11 01:57:26,763][25689] Fps is (10 sec: 5589.0, 60 sec: 5567.3, 300 sec: 5570.1). Total num frames: 1009678336. Throughput: 0: 5026.1. Samples: 1009675436. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:57:26,763][25689] Avg episode reward: [(0, '1.490')] [2022-07-11 01:57:28,520][26022] Updated weights on worker 0-0, policy_version 986024 (0.00088) [2022-07-11 01:57:30,597][26022] Updated weights on worker 0-0, policy_version 986034 (0.00423) [2022-07-11 01:57:31,868][25689] Fps is (10 sec: 5651.7, 60 sec: 5585.7, 300 sec: 5565.7). Total num frames: 1009705984. Throughput: 0: 5817.1. Samples: 1009708550. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:57:31,868][25689] Avg episode reward: [(0, '1.691')] [2022-07-11 01:57:32,284][26022] Updated weights on worker 0-0, policy_version 986044 (0.00098) [2022-07-11 01:57:34,043][26022] Updated weights on worker 0-0, policy_version 986054 (0.00085) [2022-07-11 01:57:35,903][26022] Updated weights on worker 0-0, policy_version 986064 (0.00085) [2022-07-11 01:57:36,879][25689] Fps is (10 sec: 5668.3, 60 sec: 5587.5, 300 sec: 5573.0). Total num frames: 1009735680. Throughput: 0: 5823.1. Samples: 1009742436. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:57:36,881][25689] Avg episode reward: [(0, '0.838')] [2022-07-11 01:57:37,587][26022] Updated weights on worker 0-0, policy_version 986074 (0.00086) [2022-07-11 01:57:39,670][26022] Updated weights on worker 0-0, policy_version 986084 (0.00095) [2022-07-11 01:57:41,122][26022] Updated weights on worker 0-0, policy_version 986094 (0.00096) [2022-07-11 01:57:41,923][25689] Fps is (10 sec: 5601.2, 60 sec: 5552.4, 300 sec: 5565.6). Total num frames: 1009762304. Throughput: 0: 4978.3. Samples: 1009759188. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:57:41,923][25689] Avg episode reward: [(0, '0.603')] [2022-07-11 01:57:43,188][26022] Updated weights on worker 0-0, policy_version 986104 (0.00082) [2022-07-11 01:57:45,103][26022] Updated weights on worker 0-0, policy_version 986114 (0.00088) [2022-07-11 01:57:46,787][26022] Updated weights on worker 0-0, policy_version 986124 (0.00084) [2022-07-11 01:57:46,936][25689] Fps is (10 sec: 5498.3, 60 sec: 5569.7, 300 sec: 5566.7). Total num frames: 1009790976. Throughput: 0: 5814.9. Samples: 1009792742. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:57:46,937][25689] Avg episode reward: [(0, '0.651')] [2022-07-11 01:57:48,549][26022] Updated weights on worker 0-0, policy_version 986134 (0.00093) [2022-07-11 01:57:50,423][26022] Updated weights on worker 0-0, policy_version 986144 (0.00081) [2022-07-11 01:57:52,040][25689] Fps is (10 sec: 5667.8, 60 sec: 5580.6, 300 sec: 5573.0). Total num frames: 1009819648. Throughput: 0: 5854.2. Samples: 1009826644. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:57:52,041][25689] Avg episode reward: [(0, '0.552')] [2022-07-11 01:57:52,219][26022] Updated weights on worker 0-0, policy_version 986154 (0.00084) [2022-07-11 01:57:54,302][26022] Updated weights on worker 0-0, policy_version 986164 (0.00083) [2022-07-11 01:57:55,979][26022] Updated weights on worker 0-0, policy_version 986174 (0.00093) [2022-07-11 01:57:57,079][25689] Fps is (10 sec: 5552.8, 60 sec: 5562.1, 300 sec: 5566.9). Total num frames: 1009847296. Throughput: 0: 5001.7. Samples: 1009843468. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:57:57,079][25689] Avg episode reward: [(0, '0.279')] [2022-07-11 01:57:57,872][26022] Updated weights on worker 0-0, policy_version 986184 (0.00086) [2022-07-11 01:57:59,573][26022] Updated weights on worker 0-0, policy_version 986194 (0.00085) [2022-07-11 01:58:01,882][26022] Updated weights on worker 0-0, policy_version 986204 (0.00084) [2022-07-11 01:58:02,102][25689] Fps is (10 sec: 5393.7, 60 sec: 5532.5, 300 sec: 5570.1). Total num frames: 1009873920. Throughput: 0: 5856.4. Samples: 1009877368. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:58:02,104][25689] Avg episode reward: [(0, '1.063')] [2022-07-11 01:58:03,574][26022] Updated weights on worker 0-0, policy_version 986214 (0.00082) [2022-07-11 01:58:05,550][26022] Updated weights on worker 0-0, policy_version 986224 (0.00092) [2022-07-11 01:58:07,075][26022] Updated weights on worker 0-0, policy_version 986234 (0.00083) [2022-07-11 01:58:07,111][25689] Fps is (10 sec: 5614.1, 60 sec: 5585.1, 300 sec: 5575.0). Total num frames: 1009903616. Throughput: 0: 5768.3. Samples: 1009909114. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:58:07,114][25689] Avg episode reward: [(0, '1.118')] [2022-07-11 01:58:09,247][26022] Updated weights on worker 0-0, policy_version 986244 (0.00055) [2022-07-11 01:58:10,685][26022] Updated weights on worker 0-0, policy_version 986254 (0.00083) [2022-07-11 01:58:12,155][25689] Fps is (10 sec: 5602.6, 60 sec: 5569.2, 300 sec: 5563.9). Total num frames: 1009930240. Throughput: 0: 4944.1. Samples: 1009926092. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:58:12,155][25689] Avg episode reward: [(0, '0.953')] [2022-07-11 01:58:12,590][26022] Updated weights on worker 0-0, policy_version 986264 (0.00089) [2022-07-11 01:58:14,352][26022] Updated weights on worker 0-0, policy_version 986274 (0.00087) [2022-07-11 01:58:16,570][26022] Updated weights on worker 0-0, policy_version 986284 (0.00093) [2022-07-11 01:58:17,169][25689] Fps is (10 sec: 5497.2, 60 sec: 5570.0, 300 sec: 5572.3). Total num frames: 1009958912. Throughput: 0: 5806.3. Samples: 1009960120. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:58:17,170][25689] Avg episode reward: [(0, '0.958')] [2022-07-11 01:58:18,152][26022] Updated weights on worker 0-0, policy_version 986294 (0.00081) [2022-07-11 01:58:20,080][26022] Updated weights on worker 0-0, policy_version 986304 (0.00086) [2022-07-11 01:58:21,685][26022] Updated weights on worker 0-0, policy_version 986314 (0.00082) [2022-07-11 01:58:22,187][25689] Fps is (10 sec: 5716.0, 60 sec: 5605.6, 300 sec: 5573.7). Total num frames: 1009987584. Throughput: 0: 5791.0. Samples: 1009993678. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:58:22,187][25689] Avg episode reward: [(0, '0.583')] [2022-07-11 01:58:23,728][26022] Updated weights on worker 0-0, policy_version 986324 (0.00085) [2022-07-11 01:58:25,477][26022] Updated weights on worker 0-0, policy_version 986334 (0.00085) [2022-07-11 01:58:27,195][25689] Fps is (10 sec: 5617.4, 60 sec: 5574.7, 300 sec: 5571.3). Total num frames: 1010015232. Throughput: 0: 5045.4. Samples: 1010010450. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:58:27,196][25689] Avg episode reward: [(0, '0.950')] [2022-07-11 01:58:27,271][26022] Updated weights on worker 0-0, policy_version 986344 (0.00089) [2022-07-11 01:58:29,354][26022] Updated weights on worker 0-0, policy_version 986354 (0.00088) [2022-07-11 01:58:30,934][26022] Updated weights on worker 0-0, policy_version 986364 (0.00089) [2022-07-11 01:58:32,307][25689] Fps is (10 sec: 5464.1, 60 sec: 5574.2, 300 sec: 5569.3). Total num frames: 1010042880. Throughput: 0: 5843.9. Samples: 1010043858. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:58:32,307][25689] Avg episode reward: [(0, '0.765')] [2022-07-11 01:58:32,908][26022] Updated weights on worker 0-0, policy_version 986374 (0.00086) [2022-07-11 01:58:34,571][26022] Updated weights on worker 0-0, policy_version 986384 (0.00085) [2022-07-11 01:58:36,377][26022] Updated weights on worker 0-0, policy_version 986394 (0.00084) [2022-07-11 01:58:37,325][25689] Fps is (10 sec: 5559.9, 60 sec: 5556.6, 300 sec: 5565.9). Total num frames: 1010071552. Throughput: 0: 5817.2. Samples: 1010077370. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:58:37,326][25689] Avg episode reward: [(0, '0.728')] [2022-07-11 01:58:38,382][26022] Updated weights on worker 0-0, policy_version 986404 (0.00085) [2022-07-11 01:58:40,171][26022] Updated weights on worker 0-0, policy_version 986414 (0.00086) [2022-07-11 01:58:42,033][26022] Updated weights on worker 0-0, policy_version 986424 (0.00715) [2022-07-11 01:58:42,330][25689] Fps is (10 sec: 5823.1, 60 sec: 5611.0, 300 sec: 5576.3). Total num frames: 1010101248. Throughput: 0: 4989.9. Samples: 1010094192. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:58:42,331][25689] Avg episode reward: [(0, '0.893')] [2022-07-11 01:58:43,604][26022] Updated weights on worker 0-0, policy_version 986434 (0.00090) [2022-07-11 01:58:45,504][26022] Updated weights on worker 0-0, policy_version 986444 (0.00100) [2022-07-11 01:58:47,335][25689] Fps is (10 sec: 5626.1, 60 sec: 5577.8, 300 sec: 5574.3). Total num frames: 1010127872. Throughput: 0: 5849.0. Samples: 1010128250. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:58:47,336][25689] Avg episode reward: [(0, '0.826')] [2022-07-11 01:58:47,375][26022] Updated weights on worker 0-0, policy_version 986454 (0.00084) [2022-07-11 01:58:49,350][26022] Updated weights on worker 0-0, policy_version 986464 (0.00089) [2022-07-11 01:58:51,035][26022] Updated weights on worker 0-0, policy_version 986474 (0.00090) [2022-07-11 01:58:52,466][25689] Fps is (10 sec: 5455.6, 60 sec: 5575.4, 300 sec: 5569.0). Total num frames: 1010156544. Throughput: 0: 5840.4. Samples: 1010161596. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:58:52,466][25689] Avg episode reward: [(0, '1.123')] [2022-07-11 01:58:52,989][26022] Updated weights on worker 0-0, policy_version 986484 (0.00088) [2022-07-11 01:58:54,693][26022] Updated weights on worker 0-0, policy_version 986494 (0.00086) [2022-07-11 01:58:56,798][26022] Updated weights on worker 0-0, policy_version 986504 (0.00082) [2022-07-11 01:58:57,498][25689] Fps is (10 sec: 5542.0, 60 sec: 5576.0, 300 sec: 5568.5). Total num frames: 1010184192. Throughput: 0: 5000.1. Samples: 1010178236. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:58:57,498][25689] Avg episode reward: [(0, '0.896')] [2022-07-11 01:58:58,364][26022] Updated weights on worker 0-0, policy_version 986514 (0.00090) [2022-07-11 01:59:00,343][26022] Updated weights on worker 0-0, policy_version 986524 (0.00092) [2022-07-11 01:59:02,123][26022] Updated weights on worker 0-0, policy_version 986534 (0.00080) [2022-07-11 01:59:02,518][25689] Fps is (10 sec: 5500.9, 60 sec: 5593.2, 300 sec: 5582.2). Total num frames: 1010211840. Throughput: 0: 5843.3. Samples: 1010212154. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:59:02,518][25689] Avg episode reward: [(0, '1.044')] [2022-07-11 01:59:04,421][26022] Updated weights on worker 0-0, policy_version 986544 (0.00087) [2022-07-11 01:59:05,990][26022] Updated weights on worker 0-0, policy_version 986554 (0.00096) [2022-07-11 01:59:07,551][25689] Fps is (10 sec: 5500.3, 60 sec: 5557.1, 300 sec: 5572.8). Total num frames: 1010239488. Throughput: 0: 5709.2. Samples: 1010243666. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:59:07,552][25689] Avg episode reward: [(0, '1.192')] [2022-07-11 01:59:08,126][26022] Updated weights on worker 0-0, policy_version 986564 (0.00091) [2022-07-11 01:59:09,682][26022] Updated weights on worker 0-0, policy_version 986574 (0.00092) [2022-07-11 01:59:11,644][26022] Updated weights on worker 0-0, policy_version 986584 (0.00092) [2022-07-11 01:59:12,672][25689] Fps is (10 sec: 5445.8, 60 sec: 5567.0, 300 sec: 5571.1). Total num frames: 1010267136. Throughput: 0: 4888.6. Samples: 1010260376. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:59:12,672][25689] Avg episode reward: [(0, '0.451')] [2022-07-11 01:59:13,291][26022] Updated weights on worker 0-0, policy_version 986594 (0.00087) [2022-07-11 01:59:15,197][26022] Updated weights on worker 0-0, policy_version 986604 (0.00079) [2022-07-11 01:59:16,910][26022] Updated weights on worker 0-0, policy_version 986614 (0.00086) [2022-07-11 01:59:17,688][25689] Fps is (10 sec: 5656.9, 60 sec: 5583.7, 300 sec: 5574.3). Total num frames: 1010296832. Throughput: 0: 5763.1. Samples: 1010294594. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:59:17,689][25689] Avg episode reward: [(0, '0.092')] [2022-07-11 01:59:18,943][26022] Updated weights on worker 0-0, policy_version 986624 (0.00093) [2022-07-11 01:59:20,565][26022] Updated weights on worker 0-0, policy_version 986634 (0.00089) [2022-07-11 01:59:22,695][25689] Fps is (10 sec: 5618.7, 60 sec: 5550.8, 300 sec: 5568.0). Total num frames: 1010323456. Throughput: 0: 5758.1. Samples: 1010328338. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:59:22,696][25689] Avg episode reward: [(0, '-0.909')] [2022-07-11 01:59:22,711][26022] Updated weights on worker 0-0, policy_version 986644 (0.00098) [2022-07-11 01:59:24,243][26022] Updated weights on worker 0-0, policy_version 986654 (0.00094) [2022-07-11 01:59:24,654][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 01:59:24,663][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000986656_1010335744.pth [2022-07-11 01:59:24,664][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000984696_1008328704.pth [2022-07-11 01:59:26,028][26022] Updated weights on worker 0-0, policy_version 986664 (0.00085) [2022-07-11 01:59:27,710][25689] Fps is (10 sec: 5415.5, 60 sec: 5550.3, 300 sec: 5566.5). Total num frames: 1010351104. Throughput: 0: 5040.5. Samples: 1010345276. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:59:27,710][25689] Avg episode reward: [(0, '-0.837')] [2022-07-11 01:59:28,022][26022] Updated weights on worker 0-0, policy_version 986674 (0.00093) [2022-07-11 01:59:29,646][26022] Updated weights on worker 0-0, policy_version 986684 (0.00097) [2022-07-11 01:59:31,796][26022] Updated weights on worker 0-0, policy_version 986694 (0.00091) [2022-07-11 01:59:32,770][25689] Fps is (10 sec: 5692.1, 60 sec: 5588.9, 300 sec: 5570.2). Total num frames: 1010380800. Throughput: 0: 5889.8. Samples: 1010378750. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:59:32,770][25689] Avg episode reward: [(0, '-1.619')] [2022-07-11 01:59:33,505][26022] Updated weights on worker 0-0, policy_version 986704 (0.00087) [2022-07-11 01:59:35,386][26022] Updated weights on worker 0-0, policy_version 986714 (0.00088) [2022-07-11 01:59:37,287][26022] Updated weights on worker 0-0, policy_version 986724 (0.00052) [2022-07-11 01:59:37,785][25689] Fps is (10 sec: 5589.7, 60 sec: 5555.2, 300 sec: 5571.8). Total num frames: 1010407424. Throughput: 0: 5854.9. Samples: 1010412262. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:59:37,786][25689] Avg episode reward: [(0, '-1.400')] [2022-07-11 01:59:39,004][26022] Updated weights on worker 0-0, policy_version 986734 (0.00082) [2022-07-11 01:59:40,854][26022] Updated weights on worker 0-0, policy_version 986744 (0.00086) [2022-07-11 01:59:42,698][26022] Updated weights on worker 0-0, policy_version 986754 (0.00087) [2022-07-11 01:59:42,851][25689] Fps is (10 sec: 5484.9, 60 sec: 5532.7, 300 sec: 5564.8). Total num frames: 1010436096. Throughput: 0: 5841.3. Samples: 1010446074. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:59:42,852][25689] Avg episode reward: [(0, '-0.391')] [2022-07-11 01:59:44,391][26022] Updated weights on worker 0-0, policy_version 986764 (0.00086) [2022-07-11 01:59:46,318][26022] Updated weights on worker 0-0, policy_version 986774 (0.00090) [2022-07-11 01:59:47,877][25689] Fps is (10 sec: 5783.7, 60 sec: 5581.6, 300 sec: 5572.4). Total num frames: 1010465792. Throughput: 0: 5833.0. Samples: 1010462914. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:59:47,878][25689] Avg episode reward: [(0, '0.450')] [2022-07-11 01:59:48,091][26022] Updated weights on worker 0-0, policy_version 986784 (0.00085) [2022-07-11 01:59:50,015][26022] Updated weights on worker 0-0, policy_version 986794 (0.00082) [2022-07-11 01:59:51,642][26022] Updated weights on worker 0-0, policy_version 986804 (0.00086) [2022-07-11 01:59:52,933][25689] Fps is (10 sec: 5687.7, 60 sec: 5571.5, 300 sec: 5578.3). Total num frames: 1010493440. Throughput: 0: 5849.5. Samples: 1010496698. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:59:52,934][25689] Avg episode reward: [(0, '0.110')] [2022-07-11 01:59:53,736][26022] Updated weights on worker 0-0, policy_version 986814 (0.00097) [2022-07-11 01:59:55,208][26022] Updated weights on worker 0-0, policy_version 986824 (0.00083) [2022-07-11 01:59:57,515][26022] Updated weights on worker 0-0, policy_version 986834 (0.00086) [2022-07-11 01:59:58,010][25689] Fps is (10 sec: 5457.1, 60 sec: 5567.4, 300 sec: 5566.7). Total num frames: 1010521088. Throughput: 0: 5841.7. Samples: 1010530410. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 01:59:58,011][25689] Avg episode reward: [(0, '-0.848')] [2022-07-11 01:59:58,993][26022] Updated weights on worker 0-0, policy_version 986844 (0.00089) [2022-07-11 02:00:01,076][26022] Updated weights on worker 0-0, policy_version 986854 (0.00085) [2022-07-11 02:00:02,975][26022] Updated weights on worker 0-0, policy_version 986864 (0.00088) [2022-07-11 02:00:03,070][25689] Fps is (10 sec: 5455.4, 60 sec: 5563.8, 300 sec: 5573.0). Total num frames: 1010548736. Throughput: 0: 5008.4. Samples: 1010547334. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:00:03,070][25689] Avg episode reward: [(0, '-0.013')] [2022-07-11 02:00:04,844][26022] Updated weights on worker 0-0, policy_version 986874 (0.00086) [2022-07-11 02:00:06,830][26022] Updated weights on worker 0-0, policy_version 986884 (0.00078) [2022-07-11 02:00:08,153][25689] Fps is (10 sec: 5552.6, 60 sec: 5576.0, 300 sec: 5572.6). Total num frames: 1010577408. Throughput: 0: 5731.5. Samples: 1010579126. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:00:08,154][25689] Avg episode reward: [(0, '-0.836')] [2022-07-11 02:00:08,500][26022] Updated weights on worker 0-0, policy_version 986894 (0.00082) [2022-07-11 02:00:10,354][26022] Updated weights on worker 0-0, policy_version 986904 (0.00082) [2022-07-11 02:00:12,184][26022] Updated weights on worker 0-0, policy_version 986914 (0.00090) [2022-07-11 02:00:13,198][25689] Fps is (10 sec: 5560.3, 60 sec: 5583.0, 300 sec: 5568.4). Total num frames: 1010605056. Throughput: 0: 5733.8. Samples: 1010612894. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:00:13,199][25689] Avg episode reward: [(0, '-1.136')] [2022-07-11 02:00:13,863][26022] Updated weights on worker 0-0, policy_version 986924 (0.00106) [2022-07-11 02:00:15,900][26022] Updated weights on worker 0-0, policy_version 986934 (0.00092) [2022-07-11 02:00:17,471][26022] Updated weights on worker 0-0, policy_version 986944 (0.00087) [2022-07-11 02:00:18,271][25689] Fps is (10 sec: 5566.6, 60 sec: 5560.9, 300 sec: 5571.1). Total num frames: 1010633728. Throughput: 0: 4906.2. Samples: 1010629812. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:00:18,271][25689] Avg episode reward: [(0, '-0.956')] [2022-07-11 02:00:19,418][26022] Updated weights on worker 0-0, policy_version 986954 (0.00092) [2022-07-11 02:00:21,110][26022] Updated weights on worker 0-0, policy_version 986964 (0.00100) [2022-07-11 02:00:23,178][26022] Updated weights on worker 0-0, policy_version 986974 (0.00090) [2022-07-11 02:00:23,314][25689] Fps is (10 sec: 5668.8, 60 sec: 5591.4, 300 sec: 5573.9). Total num frames: 1010662400. Throughput: 0: 5730.1. Samples: 1010663338. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:00:23,315][25689] Avg episode reward: [(0, '-0.346')] [2022-07-11 02:00:24,990][26022] Updated weights on worker 0-0, policy_version 986984 (0.00099) [2022-07-11 02:00:26,875][26022] Updated weights on worker 0-0, policy_version 986994 (0.00085) [2022-07-11 02:00:28,330][25689] Fps is (10 sec: 5598.7, 60 sec: 5591.2, 300 sec: 5567.7). Total num frames: 1010690048. Throughput: 0: 5813.1. Samples: 1010696418. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:00:28,331][25689] Avg episode reward: [(0, '-0.536')] [2022-07-11 02:00:28,768][26022] Updated weights on worker 0-0, policy_version 987004 (0.00096) [2022-07-11 02:00:30,553][26022] Updated weights on worker 0-0, policy_version 987014 (0.00089) [2022-07-11 02:00:32,521][26022] Updated weights on worker 0-0, policy_version 987024 (0.00097) [2022-07-11 02:00:33,400][25689] Fps is (10 sec: 5482.4, 60 sec: 5556.6, 300 sec: 5570.5). Total num frames: 1010717696. Throughput: 0: 4958.5. Samples: 1010713064. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:00:33,401][25689] Avg episode reward: [(0, '-0.651')] [2022-07-11 02:00:34,323][26022] Updated weights on worker 0-0, policy_version 987034 (0.00090) [2022-07-11 02:00:36,056][26022] Updated weights on worker 0-0, policy_version 987044 (0.00086) [2022-07-11 02:00:38,044][26022] Updated weights on worker 0-0, policy_version 987054 (0.00161) [2022-07-11 02:00:38,469][25689] Fps is (10 sec: 5353.1, 60 sec: 5551.7, 300 sec: 5562.5). Total num frames: 1010744320. Throughput: 0: 5774.4. Samples: 1010746442. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:00:38,469][25689] Avg episode reward: [(0, '-0.443')] [2022-07-11 02:00:39,573][26022] Updated weights on worker 0-0, policy_version 987064 (0.00081) [2022-07-11 02:00:41,836][26022] Updated weights on worker 0-0, policy_version 987074 (0.00088) [2022-07-11 02:00:43,175][26022] Updated weights on worker 0-0, policy_version 987084 (0.00084) [2022-07-11 02:00:43,530][25689] Fps is (10 sec: 5660.9, 60 sec: 5585.9, 300 sec: 5568.5). Total num frames: 1010775040. Throughput: 0: 5779.6. Samples: 1010780178. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:00:43,530][25689] Avg episode reward: [(0, '-0.009')] [2022-07-11 02:00:45,390][26022] Updated weights on worker 0-0, policy_version 987094 (0.00089) [2022-07-11 02:00:46,927][26022] Updated weights on worker 0-0, policy_version 987104 (0.00080) [2022-07-11 02:00:48,548][25689] Fps is (10 sec: 5689.2, 60 sec: 5536.0, 300 sec: 5566.1). Total num frames: 1010801664. Throughput: 0: 4964.7. Samples: 1010796792. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:00:48,550][25689] Avg episode reward: [(0, '0.159')] [2022-07-11 02:00:49,044][26022] Updated weights on worker 0-0, policy_version 987114 (0.00091) [2022-07-11 02:00:50,768][26022] Updated weights on worker 0-0, policy_version 987124 (0.00083) [2022-07-11 02:00:52,858][26022] Updated weights on worker 0-0, policy_version 987134 (0.00090) [2022-07-11 02:00:53,590][25689] Fps is (10 sec: 5394.9, 60 sec: 5537.3, 300 sec: 5565.4). Total num frames: 1010829312. Throughput: 0: 5794.8. Samples: 1010830058. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:00:53,590][25689] Avg episode reward: [(0, '1.209')] [2022-07-11 02:00:54,401][26022] Updated weights on worker 0-0, policy_version 987144 (0.00086) [2022-07-11 02:00:56,432][26022] Updated weights on worker 0-0, policy_version 987154 (0.00091) [2022-07-11 02:00:58,199][26022] Updated weights on worker 0-0, policy_version 987164 (0.00086) [2022-07-11 02:00:58,691][25689] Fps is (10 sec: 5552.4, 60 sec: 5551.9, 300 sec: 5564.0). Total num frames: 1010857984. Throughput: 0: 5780.9. Samples: 1010863348. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:00:58,692][25689] Avg episode reward: [(0, '1.358')] [2022-07-11 02:01:00,093][26022] Updated weights on worker 0-0, policy_version 987174 (0.00082) [2022-07-11 02:01:02,243][26022] Updated weights on worker 0-0, policy_version 987184 (0.00096) [2022-07-11 02:01:03,722][25689] Fps is (10 sec: 5356.6, 60 sec: 5520.8, 300 sec: 5563.8). Total num frames: 1010883584. Throughput: 0: 4954.3. Samples: 1010880214. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:01:03,722][25689] Avg episode reward: [(0, '2.129')] [2022-07-11 02:01:04,001][26022] Updated weights on worker 0-0, policy_version 987194 (0.00082) [2022-07-11 02:01:05,917][26022] Updated weights on worker 0-0, policy_version 987204 (0.00087) [2022-07-11 02:01:07,725][26022] Updated weights on worker 0-0, policy_version 987214 (0.00088) [2022-07-11 02:01:08,819][25689] Fps is (10 sec: 5358.9, 60 sec: 5519.6, 300 sec: 5559.9). Total num frames: 1010912256. Throughput: 0: 5659.7. Samples: 1010911518. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:01:08,819][25689] Avg episode reward: [(0, '2.148')] [2022-07-11 02:01:09,597][26022] Updated weights on worker 0-0, policy_version 987224 (0.00089) [2022-07-11 02:01:11,607][26022] Updated weights on worker 0-0, policy_version 987234 (0.00084) [2022-07-11 02:01:13,210][26022] Updated weights on worker 0-0, policy_version 987244 (0.00088) [2022-07-11 02:01:13,872][25689] Fps is (10 sec: 5548.3, 60 sec: 5518.8, 300 sec: 5559.3). Total num frames: 1010939904. Throughput: 0: 5670.2. Samples: 1010945064. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:01:13,873][25689] Avg episode reward: [(0, '1.826')] [2022-07-11 02:01:14,864][26022] Updated weights on worker 0-0, policy_version 987254 (0.00087) [2022-07-11 02:01:16,858][26022] Updated weights on worker 0-0, policy_version 987264 (0.00086) [2022-07-11 02:01:18,639][26022] Updated weights on worker 0-0, policy_version 987274 (0.00097) [2022-07-11 02:01:18,882][25689] Fps is (10 sec: 5596.6, 60 sec: 5524.5, 300 sec: 5563.2). Total num frames: 1010968576. Throughput: 0: 4890.1. Samples: 1010962082. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:01:18,884][25689] Avg episode reward: [(0, '1.846')] [2022-07-11 02:01:20,716][26022] Updated weights on worker 0-0, policy_version 987284 (0.00094) [2022-07-11 02:01:22,301][26022] Updated weights on worker 0-0, policy_version 987294 (0.00086) [2022-07-11 02:01:23,919][25689] Fps is (10 sec: 5708.0, 60 sec: 5525.1, 300 sec: 5562.6). Total num frames: 1010997248. Throughput: 0: 5729.9. Samples: 1010995942. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:01:23,920][25689] Avg episode reward: [(0, '1.905')] [2022-07-11 02:01:24,220][26022] Updated weights on worker 0-0, policy_version 987304 (0.00086) [2022-07-11 02:01:24,766][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:01:24,782][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000987307_1011002368.pth [2022-07-11 02:01:24,783][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000985348_1008996352.pth [2022-07-11 02:01:26,010][26022] Updated weights on worker 0-0, policy_version 987314 (0.00103) [2022-07-11 02:01:28,068][26022] Updated weights on worker 0-0, policy_version 987324 (0.00095) [2022-07-11 02:01:29,014][25689] Fps is (10 sec: 5659.6, 60 sec: 5534.8, 300 sec: 5569.9). Total num frames: 1011025920. Throughput: 0: 5829.8. Samples: 1011029252. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:01:29,015][25689] Avg episode reward: [(0, '1.476')] [2022-07-11 02:01:29,724][26022] Updated weights on worker 0-0, policy_version 987334 (0.00096) [2022-07-11 02:01:31,799][26022] Updated weights on worker 0-0, policy_version 987344 (0.00091) [2022-07-11 02:01:33,471][26022] Updated weights on worker 0-0, policy_version 987354 (0.00096) [2022-07-11 02:01:34,079][25689] Fps is (10 sec: 5442.3, 60 sec: 5518.4, 300 sec: 5559.0). Total num frames: 1011052544. Throughput: 0: 5795.3. Samples: 1011062168. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:01:34,080][25689] Avg episode reward: [(0, '1.490')] [2022-07-11 02:01:35,429][26022] Updated weights on worker 0-0, policy_version 987364 (0.00092) [2022-07-11 02:01:37,219][26022] Updated weights on worker 0-0, policy_version 987374 (0.00098) [2022-07-11 02:01:39,006][26022] Updated weights on worker 0-0, policy_version 987384 (0.00088) [2022-07-11 02:01:39,081][25689] Fps is (10 sec: 5492.8, 60 sec: 5558.2, 300 sec: 5559.5). Total num frames: 1011081216. Throughput: 0: 5784.1. Samples: 1011078916. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:01:39,083][25689] Avg episode reward: [(0, '0.766')] [2022-07-11 02:01:40,854][26022] Updated weights on worker 0-0, policy_version 987394 (0.00099) [2022-07-11 02:01:42,691][26022] Updated weights on worker 0-0, policy_version 987404 (0.00101) [2022-07-11 02:01:44,087][25689] Fps is (10 sec: 5729.8, 60 sec: 5529.5, 300 sec: 5563.2). Total num frames: 1011109888. Throughput: 0: 5783.9. Samples: 1011112594. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:01:44,087][25689] Avg episode reward: [(0, '0.719')] [2022-07-11 02:01:44,449][26022] Updated weights on worker 0-0, policy_version 987414 (0.00084) [2022-07-11 02:01:46,477][26022] Updated weights on worker 0-0, policy_version 987424 (0.00096) [2022-07-11 02:01:47,865][26022] Updated weights on worker 0-0, policy_version 987434 (0.00090) [2022-07-11 02:01:49,132][25689] Fps is (10 sec: 5501.4, 60 sec: 5527.0, 300 sec: 5559.6). Total num frames: 1011136512. Throughput: 0: 5801.2. Samples: 1011145962. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:01:49,133][25689] Avg episode reward: [(0, '0.602')] [2022-07-11 02:01:50,117][26022] Updated weights on worker 0-0, policy_version 987444 (0.00091) [2022-07-11 02:01:51,969][26022] Updated weights on worker 0-0, policy_version 987454 (0.00082) [2022-07-11 02:01:53,563][26022] Updated weights on worker 0-0, policy_version 987464 (0.00060) [2022-07-11 02:01:54,183][25689] Fps is (10 sec: 5477.2, 60 sec: 5543.1, 300 sec: 5559.1). Total num frames: 1011165184. Throughput: 0: 5005.3. Samples: 1011162792. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:01:54,183][25689] Avg episode reward: [(0, '0.524')] [2022-07-11 02:01:55,617][26022] Updated weights on worker 0-0, policy_version 987474 (0.00088) [2022-07-11 02:01:57,328][26022] Updated weights on worker 0-0, policy_version 987484 (0.00097) [2022-07-11 02:01:58,923][26022] Updated weights on worker 0-0, policy_version 987494 (0.00099) [2022-07-11 02:01:59,195][25689] Fps is (10 sec: 5800.3, 60 sec: 5568.2, 300 sec: 5563.6). Total num frames: 1011194880. Throughput: 0: 5867.0. Samples: 1011196926. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:01:59,196][25689] Avg episode reward: [(0, '0.871')] [2022-07-11 02:02:01,354][26022] Updated weights on worker 0-0, policy_version 987504 (0.00087) [2022-07-11 02:02:03,103][26022] Updated weights on worker 0-0, policy_version 987514 (0.00094) [2022-07-11 02:02:04,215][25689] Fps is (10 sec: 5409.5, 60 sec: 5552.2, 300 sec: 5556.8). Total num frames: 1011219456. Throughput: 0: 5732.6. Samples: 1011227982. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:02:04,216][25689] Avg episode reward: [(0, '1.497')] [2022-07-11 02:02:05,194][26022] Updated weights on worker 0-0, policy_version 987524 (0.00104) [2022-07-11 02:02:06,695][26022] Updated weights on worker 0-0, policy_version 987534 (0.00086) [2022-07-11 02:02:08,663][26022] Updated weights on worker 0-0, policy_version 987544 (0.00091) [2022-07-11 02:02:09,238][25689] Fps is (10 sec: 5302.3, 60 sec: 5559.0, 300 sec: 5560.9). Total num frames: 1011248128. Throughput: 0: 4918.3. Samples: 1011244846. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:02:09,238][25689] Avg episode reward: [(0, '1.417')] [2022-07-11 02:02:10,691][26022] Updated weights on worker 0-0, policy_version 987554 (0.00090) [2022-07-11 02:02:12,219][26022] Updated weights on worker 0-0, policy_version 987564 (0.00079) [2022-07-11 02:02:14,215][26022] Updated weights on worker 0-0, policy_version 987574 (0.00095) [2022-07-11 02:02:14,339][25689] Fps is (10 sec: 5563.4, 60 sec: 5554.7, 300 sec: 5556.0). Total num frames: 1011275776. Throughput: 0: 5742.9. Samples: 1011278546. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:02:14,339][25689] Avg episode reward: [(0, '1.523')] [2022-07-11 02:02:15,963][26022] Updated weights on worker 0-0, policy_version 987584 (0.00090) [2022-07-11 02:02:17,882][26022] Updated weights on worker 0-0, policy_version 987594 (0.00081) [2022-07-11 02:02:19,343][25689] Fps is (10 sec: 5674.9, 60 sec: 5572.2, 300 sec: 5566.9). Total num frames: 1011305472. Throughput: 0: 5735.5. Samples: 1011312480. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:02:19,344][25689] Avg episode reward: [(0, '1.261')] [2022-07-11 02:02:19,559][26022] Updated weights on worker 0-0, policy_version 987604 (0.00092) [2022-07-11 02:02:21,373][26022] Updated weights on worker 0-0, policy_version 987614 (0.00086) [2022-07-11 02:02:23,228][26022] Updated weights on worker 0-0, policy_version 987624 (0.00086) [2022-07-11 02:02:24,377][25689] Fps is (10 sec: 5712.5, 60 sec: 5555.4, 300 sec: 5560.2). Total num frames: 1011333120. Throughput: 0: 5028.8. Samples: 1011329372. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:02:24,378][25689] Avg episode reward: [(0, '1.456')] [2022-07-11 02:02:25,028][26022] Updated weights on worker 0-0, policy_version 987634 (0.00090) [2022-07-11 02:02:26,879][26022] Updated weights on worker 0-0, policy_version 987644 (0.00088) [2022-07-11 02:02:28,772][26022] Updated weights on worker 0-0, policy_version 987654 (0.00088) [2022-07-11 02:02:29,423][25689] Fps is (10 sec: 5384.0, 60 sec: 5526.1, 300 sec: 5557.8). Total num frames: 1011359744. Throughput: 0: 5848.0. Samples: 1011362888. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:02:29,423][25689] Avg episode reward: [(0, '1.644')] [2022-07-11 02:02:30,705][26022] Updated weights on worker 0-0, policy_version 987664 (0.00087) [2022-07-11 02:02:32,448][26022] Updated weights on worker 0-0, policy_version 987674 (0.00084) [2022-07-11 02:02:34,331][26022] Updated weights on worker 0-0, policy_version 987684 (0.00056) [2022-07-11 02:02:34,520][25689] Fps is (10 sec: 5653.5, 60 sec: 5590.9, 300 sec: 5559.7). Total num frames: 1011390464. Throughput: 0: 5837.7. Samples: 1011396360. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:02:34,521][25689] Avg episode reward: [(0, '0.909')] [2022-07-11 02:02:36,035][26022] Updated weights on worker 0-0, policy_version 987694 (0.00081) [2022-07-11 02:02:37,923][26022] Updated weights on worker 0-0, policy_version 987704 (0.00093) [2022-07-11 02:02:39,533][25689] Fps is (10 sec: 5773.0, 60 sec: 5572.9, 300 sec: 5563.7). Total num frames: 1011418112. Throughput: 0: 4986.5. Samples: 1011413162. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:02:39,534][25689] Avg episode reward: [(0, '-0.138')] [2022-07-11 02:02:39,752][26022] Updated weights on worker 0-0, policy_version 987714 (0.00092) [2022-07-11 02:02:41,667][26022] Updated weights on worker 0-0, policy_version 987724 (0.00087) [2022-07-11 02:02:43,185][26022] Updated weights on worker 0-0, policy_version 987734 (0.00086) [2022-07-11 02:02:44,589][25689] Fps is (10 sec: 5491.8, 60 sec: 5551.4, 300 sec: 5559.4). Total num frames: 1011445760. Throughput: 0: 5832.6. Samples: 1011447260. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:02:44,590][25689] Avg episode reward: [(0, '-0.220')] [2022-07-11 02:02:45,414][26022] Updated weights on worker 0-0, policy_version 987744 (0.00085) [2022-07-11 02:02:46,845][26022] Updated weights on worker 0-0, policy_version 987754 (0.00051) [2022-07-11 02:02:49,019][26022] Updated weights on worker 0-0, policy_version 987764 (0.00097) [2022-07-11 02:02:49,625][25689] Fps is (10 sec: 5581.0, 60 sec: 5586.2, 300 sec: 5560.7). Total num frames: 1011474432. Throughput: 0: 5852.5. Samples: 1011481120. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:02:49,625][25689] Avg episode reward: [(0, '-0.283')] [2022-07-11 02:02:50,568][26022] Updated weights on worker 0-0, policy_version 987774 (0.00089) [2022-07-11 02:02:52,575][26022] Updated weights on worker 0-0, policy_version 987784 (0.00089) [2022-07-11 02:02:54,216][26022] Updated weights on worker 0-0, policy_version 987794 (0.00093) [2022-07-11 02:02:54,672][25689] Fps is (10 sec: 5585.4, 60 sec: 5569.5, 300 sec: 5560.5). Total num frames: 1011502080. Throughput: 0: 5020.3. Samples: 1011497530. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:02:54,673][25689] Avg episode reward: [(0, '-0.460')] [2022-07-11 02:02:56,151][26022] Updated weights on worker 0-0, policy_version 987804 (0.00094) [2022-07-11 02:02:58,130][26022] Updated weights on worker 0-0, policy_version 987814 (0.00094) [2022-07-11 02:02:59,683][25689] Fps is (10 sec: 5599.5, 60 sec: 5552.8, 300 sec: 5567.7). Total num frames: 1011530752. Throughput: 0: 5857.0. Samples: 1011531178. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:02:59,683][25689] Avg episode reward: [(0, '-0.462')] [2022-07-11 02:02:59,826][26022] Updated weights on worker 0-0, policy_version 987824 (0.00093) [2022-07-11 02:03:02,123][26022] Updated weights on worker 0-0, policy_version 987834 (0.00089) [2022-07-11 02:03:03,803][26022] Updated weights on worker 0-0, policy_version 987844 (0.00092) [2022-07-11 02:03:04,715][25689] Fps is (10 sec: 5302.4, 60 sec: 5551.7, 300 sec: 5550.0). Total num frames: 1011555328. Throughput: 0: 5713.4. Samples: 1011562246. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:03:04,715][25689] Avg episode reward: [(0, '0.505')] [2022-07-11 02:03:05,599][26022] Updated weights on worker 0-0, policy_version 987854 (0.00087) [2022-07-11 02:03:07,579][26022] Updated weights on worker 0-0, policy_version 987864 (0.00089) [2022-07-11 02:03:09,444][26022] Updated weights on worker 0-0, policy_version 987874 (0.00062) [2022-07-11 02:03:09,724][25689] Fps is (10 sec: 5302.8, 60 sec: 5552.8, 300 sec: 5557.6). Total num frames: 1011584000. Throughput: 0: 4872.3. Samples: 1011579050. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:03:09,726][25689] Avg episode reward: [(0, '1.402')] [2022-07-11 02:03:11,353][26022] Updated weights on worker 0-0, policy_version 987884 (0.00081) [2022-07-11 02:03:13,129][26022] Updated weights on worker 0-0, policy_version 987894 (0.00108) [2022-07-11 02:03:14,790][25689] Fps is (10 sec: 5691.3, 60 sec: 5573.0, 300 sec: 5556.6). Total num frames: 1011612672. Throughput: 0: 5694.1. Samples: 1011612084. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:03:14,791][25689] Avg episode reward: [(0, '1.869')] [2022-07-11 02:03:15,055][26022] Updated weights on worker 0-0, policy_version 987904 (0.00086) [2022-07-11 02:03:16,977][26022] Updated weights on worker 0-0, policy_version 987914 (0.00091) [2022-07-11 02:03:18,695][26022] Updated weights on worker 0-0, policy_version 987924 (0.00098) [2022-07-11 02:03:19,843][25689] Fps is (10 sec: 5464.9, 60 sec: 5517.7, 300 sec: 5549.1). Total num frames: 1011639296. Throughput: 0: 5676.6. Samples: 1011645618. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:03:19,843][25689] Avg episode reward: [(0, '1.766')] [2022-07-11 02:03:20,621][26022] Updated weights on worker 0-0, policy_version 987934 (0.00089) [2022-07-11 02:03:22,307][26022] Updated weights on worker 0-0, policy_version 987944 (0.00086) [2022-07-11 02:03:24,289][26022] Updated weights on worker 0-0, policy_version 987954 (0.00417) [2022-07-11 02:03:24,851][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:03:24,863][25689] Fps is (10 sec: 5489.7, 60 sec: 5536.0, 300 sec: 5552.3). Total num frames: 1011667968. Throughput: 0: 4969.6. Samples: 1011662376. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:03:24,864][25689] Avg episode reward: [(0, '1.868')] [2022-07-11 02:03:24,867][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000987958_1011668992.pth [2022-07-11 02:03:24,868][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000986001_1009665024.pth [2022-07-11 02:03:26,024][26022] Updated weights on worker 0-0, policy_version 987964 (0.00092) [2022-07-11 02:03:27,736][26022] Updated weights on worker 0-0, policy_version 987974 (0.00090) [2022-07-11 02:03:29,606][26022] Updated weights on worker 0-0, policy_version 987984 (0.00086) [2022-07-11 02:03:29,904][25689] Fps is (10 sec: 5597.8, 60 sec: 5553.3, 300 sec: 5553.6). Total num frames: 1011695616. Throughput: 0: 5806.0. Samples: 1011696214. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:03:29,904][25689] Avg episode reward: [(0, '1.852')] [2022-07-11 02:03:31,582][26022] Updated weights on worker 0-0, policy_version 987994 (0.00073) [2022-07-11 02:03:33,690][26022] Updated weights on worker 0-0, policy_version 988004 (0.00091) [2022-07-11 02:03:34,957][25689] Fps is (10 sec: 5579.8, 60 sec: 5523.6, 300 sec: 5553.0). Total num frames: 1011724288. Throughput: 0: 5819.3. Samples: 1011729438. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:03:34,957][25689] Avg episode reward: [(0, '1.917')] [2022-07-11 02:03:35,163][26022] Updated weights on worker 0-0, policy_version 988014 (0.00096) [2022-07-11 02:03:37,150][26022] Updated weights on worker 0-0, policy_version 988024 (0.00085) [2022-07-11 02:03:38,797][26022] Updated weights on worker 0-0, policy_version 988034 (0.00086) [2022-07-11 02:03:39,958][25689] Fps is (10 sec: 5703.6, 60 sec: 5541.6, 300 sec: 5549.6). Total num frames: 1011752960. Throughput: 0: 4999.2. Samples: 1011746178. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:03:39,958][25689] Avg episode reward: [(0, '1.934')] [2022-07-11 02:03:40,856][26022] Updated weights on worker 0-0, policy_version 988044 (0.00093) [2022-07-11 02:03:42,527][26022] Updated weights on worker 0-0, policy_version 988054 (0.00077) [2022-07-11 02:03:44,490][26022] Updated weights on worker 0-0, policy_version 988064 (0.00095) [2022-07-11 02:03:44,973][25689] Fps is (10 sec: 5418.1, 60 sec: 5511.4, 300 sec: 5546.0). Total num frames: 1011778560. Throughput: 0: 5854.8. Samples: 1011780118. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:03:44,974][25689] Avg episode reward: [(0, '1.651')] [2022-07-11 02:03:46,008][26022] Updated weights on worker 0-0, policy_version 988074 (0.00093) [2022-07-11 02:03:48,208][26022] Updated weights on worker 0-0, policy_version 988084 (0.00091) [2022-07-11 02:03:49,919][26022] Updated weights on worker 0-0, policy_version 988094 (0.00089) [2022-07-11 02:03:50,015][25689] Fps is (10 sec: 5498.3, 60 sec: 5527.8, 300 sec: 5551.1). Total num frames: 1011808256. Throughput: 0: 5839.3. Samples: 1011813648. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:03:50,016][25689] Avg episode reward: [(0, '0.730')] [2022-07-11 02:03:51,872][26022] Updated weights on worker 0-0, policy_version 988104 (0.00094) [2022-07-11 02:03:53,549][26022] Updated weights on worker 0-0, policy_version 988114 (0.00082) [2022-07-11 02:03:55,094][25689] Fps is (10 sec: 5767.5, 60 sec: 5541.9, 300 sec: 5553.6). Total num frames: 1011836928. Throughput: 0: 4995.7. Samples: 1011830036. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:03:55,094][25689] Avg episode reward: [(0, '0.275')] [2022-07-11 02:03:55,402][26022] Updated weights on worker 0-0, policy_version 988124 (0.00091) [2022-07-11 02:03:57,153][26022] Updated weights on worker 0-0, policy_version 988134 (0.00093) [2022-07-11 02:03:59,158][26022] Updated weights on worker 0-0, policy_version 988144 (0.00096) [2022-07-11 02:04:00,168][25689] Fps is (10 sec: 5547.0, 60 sec: 5519.1, 300 sec: 5552.6). Total num frames: 1011864576. Throughput: 0: 5819.5. Samples: 1011863790. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:04:00,168][25689] Avg episode reward: [(0, '0.274')] [2022-07-11 02:04:00,863][26022] Updated weights on worker 0-0, policy_version 988154 (0.00097) [2022-07-11 02:04:03,243][26022] Updated weights on worker 0-0, policy_version 988164 (0.00089) [2022-07-11 02:04:04,994][26022] Updated weights on worker 0-0, policy_version 988174 (0.00078) [2022-07-11 02:04:05,170][25689] Fps is (10 sec: 5385.9, 60 sec: 5555.7, 300 sec: 5549.8). Total num frames: 1011891200. Throughput: 0: 5679.4. Samples: 1011894824. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:04:05,171][25689] Avg episode reward: [(0, '-0.034')] [2022-07-11 02:04:06,856][26022] Updated weights on worker 0-0, policy_version 988184 (0.00094) [2022-07-11 02:04:08,716][26022] Updated weights on worker 0-0, policy_version 988194 (0.00084) [2022-07-11 02:04:10,192][25689] Fps is (10 sec: 5516.4, 60 sec: 5554.6, 300 sec: 5555.0). Total num frames: 1011919872. Throughput: 0: 4854.2. Samples: 1011911592. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:04:10,192][25689] Avg episode reward: [(0, '-0.683')] [2022-07-11 02:04:10,392][26022] Updated weights on worker 0-0, policy_version 988204 (0.00096) [2022-07-11 02:04:12,517][26022] Updated weights on worker 0-0, policy_version 988214 (0.00094) [2022-07-11 02:04:14,073][26022] Updated weights on worker 0-0, policy_version 988224 (0.00089) [2022-07-11 02:04:15,295][25689] Fps is (10 sec: 5461.5, 60 sec: 5517.3, 300 sec: 5543.1). Total num frames: 1011946496. Throughput: 0: 5693.5. Samples: 1011945052. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:04:15,295][25689] Avg episode reward: [(0, '-0.736')] [2022-07-11 02:04:16,157][26022] Updated weights on worker 0-0, policy_version 988234 (0.00084) [2022-07-11 02:04:17,937][26022] Updated weights on worker 0-0, policy_version 988244 (0.00079) [2022-07-11 02:04:19,586][26022] Updated weights on worker 0-0, policy_version 988254 (0.00084) [2022-07-11 02:04:20,392][25689] Fps is (10 sec: 5421.2, 60 sec: 5547.1, 300 sec: 5548.3). Total num frames: 1011975168. Throughput: 0: 5676.6. Samples: 1011978592. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:04:20,392][25689] Avg episode reward: [(0, '-0.907')] [2022-07-11 02:04:21,699][26022] Updated weights on worker 0-0, policy_version 988264 (0.00089) [2022-07-11 02:04:23,434][26022] Updated weights on worker 0-0, policy_version 988274 (0.00087) [2022-07-11 02:04:25,282][26022] Updated weights on worker 0-0, policy_version 988284 (0.00091) [2022-07-11 02:04:25,431][25689] Fps is (10 sec: 5657.2, 60 sec: 5545.3, 300 sec: 5551.3). Total num frames: 1012003840. Throughput: 0: 5797.9. Samples: 1012012296. Policy #0 lag: (min: 0.0, avg: 8.9, max: 18.0) [2022-07-11 02:04:25,433][25689] Avg episode reward: [(0, '-0.607')] [2022-07-11 02:04:27,187][26022] Updated weights on worker 0-0, policy_version 988294 (0.00090) [2022-07-11 02:04:28,834][26022] Updated weights on worker 0-0, policy_version 988304 (0.00088) [2022-07-11 02:04:30,467][25689] Fps is (10 sec: 5589.7, 60 sec: 5545.8, 300 sec: 5544.8). Total num frames: 1012031488. Throughput: 0: 5797.3. Samples: 1012029136. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:04:30,468][25689] Avg episode reward: [(0, '-0.885')] [2022-07-11 02:04:30,765][26022] Updated weights on worker 0-0, policy_version 988314 (0.00088) [2022-07-11 02:04:32,435][26022] Updated weights on worker 0-0, policy_version 988324 (0.00087) [2022-07-11 02:04:34,553][26022] Updated weights on worker 0-0, policy_version 988334 (0.00085) [2022-07-11 02:04:35,567][25689] Fps is (10 sec: 5556.4, 60 sec: 5541.5, 300 sec: 5550.2). Total num frames: 1012060160. Throughput: 0: 5798.5. Samples: 1012062602. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:04:35,568][25689] Avg episode reward: [(0, '-0.644')] [2022-07-11 02:04:36,035][26022] Updated weights on worker 0-0, policy_version 988344 (0.00091) [2022-07-11 02:04:38,028][26022] Updated weights on worker 0-0, policy_version 988354 (0.00085) [2022-07-11 02:04:39,759][26022] Updated weights on worker 0-0, policy_version 988364 (0.00087) [2022-07-11 02:04:40,663][25689] Fps is (10 sec: 5624.1, 60 sec: 5532.8, 300 sec: 5549.6). Total num frames: 1012088832. Throughput: 0: 5816.6. Samples: 1012096504. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:04:40,664][25689] Avg episode reward: [(0, '0.600')] [2022-07-11 02:04:41,823][26022] Updated weights on worker 0-0, policy_version 988374 (0.00086) [2022-07-11 02:04:43,414][26022] Updated weights on worker 0-0, policy_version 988384 (0.00083) [2022-07-11 02:04:45,479][26022] Updated weights on worker 0-0, policy_version 988394 (0.00089) [2022-07-11 02:04:45,665][25689] Fps is (10 sec: 5577.7, 60 sec: 5567.8, 300 sec: 5543.2). Total num frames: 1012116480. Throughput: 0: 5002.4. Samples: 1012113512. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:04:45,665][25689] Avg episode reward: [(0, '1.441')] [2022-07-11 02:04:46,937][26022] Updated weights on worker 0-0, policy_version 988404 (0.00089) [2022-07-11 02:04:49,072][26022] Updated weights on worker 0-0, policy_version 988414 (0.00092) [2022-07-11 02:04:50,715][25689] Fps is (10 sec: 5603.2, 60 sec: 5550.2, 300 sec: 5546.7). Total num frames: 1012145152. Throughput: 0: 5844.0. Samples: 1012147462. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:04:50,715][25689] Avg episode reward: [(0, '0.240')] [2022-07-11 02:04:50,783][26022] Updated weights on worker 0-0, policy_version 988424 (0.00079) [2022-07-11 02:04:52,587][26022] Updated weights on worker 0-0, policy_version 988434 (0.00083) [2022-07-11 02:04:54,350][26022] Updated weights on worker 0-0, policy_version 988444 (0.00081) [2022-07-11 02:04:55,805][25689] Fps is (10 sec: 5655.1, 60 sec: 5549.1, 300 sec: 5549.9). Total num frames: 1012173824. Throughput: 0: 5857.7. Samples: 1012181148. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:04:55,806][25689] Avg episode reward: [(0, '-0.214')] [2022-07-11 02:04:56,349][26022] Updated weights on worker 0-0, policy_version 988454 (0.00086) [2022-07-11 02:04:57,994][26022] Updated weights on worker 0-0, policy_version 988464 (0.00090) [2022-07-11 02:04:59,906][26022] Updated weights on worker 0-0, policy_version 988474 (0.00080) [2022-07-11 02:05:00,821][25689] Fps is (10 sec: 5775.8, 60 sec: 5588.3, 300 sec: 5557.6). Total num frames: 1012203520. Throughput: 0: 5043.7. Samples: 1012198170. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:05:00,821][25689] Avg episode reward: [(0, '-0.568')] [2022-07-11 02:05:01,757][26022] Updated weights on worker 0-0, policy_version 988484 (0.00100) [2022-07-11 02:05:04,037][26022] Updated weights on worker 0-0, policy_version 988494 (0.00091) [2022-07-11 02:05:05,542][26022] Updated weights on worker 0-0, policy_version 988504 (0.00091) [2022-07-11 02:05:05,916][25689] Fps is (10 sec: 5469.1, 60 sec: 5562.9, 300 sec: 5547.1). Total num frames: 1012229120. Throughput: 0: 5736.2. Samples: 1012229676. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:05:05,916][25689] Avg episode reward: [(0, '-1.397')] [2022-07-11 02:05:07,543][26022] Updated weights on worker 0-0, policy_version 988514 (0.00348) [2022-07-11 02:05:09,107][26022] Updated weights on worker 0-0, policy_version 988524 (0.00093) [2022-07-11 02:05:10,947][25689] Fps is (10 sec: 5258.2, 60 sec: 5545.1, 300 sec: 5547.3). Total num frames: 1012256768. Throughput: 0: 5728.9. Samples: 1012263372. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:05:10,951][25689] Avg episode reward: [(0, '-1.509')] [2022-07-11 02:05:11,293][26022] Updated weights on worker 0-0, policy_version 988534 (0.00088) [2022-07-11 02:05:13,114][26022] Updated weights on worker 0-0, policy_version 988544 (0.00091) [2022-07-11 02:05:14,794][26022] Updated weights on worker 0-0, policy_version 988554 (0.00092) [2022-07-11 02:05:16,034][25689] Fps is (10 sec: 5566.0, 60 sec: 5580.3, 300 sec: 5547.1). Total num frames: 1012285440. Throughput: 0: 4888.1. Samples: 1012280030. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:05:16,035][25689] Avg episode reward: [(0, '-1.313')] [2022-07-11 02:05:16,691][26022] Updated weights on worker 0-0, policy_version 988564 (0.00091) [2022-07-11 02:05:18,438][26022] Updated weights on worker 0-0, policy_version 988574 (0.00092) [2022-07-11 02:05:20,355][26022] Updated weights on worker 0-0, policy_version 988584 (0.00096) [2022-07-11 02:05:21,057][25689] Fps is (10 sec: 5672.0, 60 sec: 5587.1, 300 sec: 5547.4). Total num frames: 1012314112. Throughput: 0: 5713.7. Samples: 1012313796. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:05:21,058][25689] Avg episode reward: [(0, '-0.190')] [2022-07-11 02:05:22,374][26022] Updated weights on worker 0-0, policy_version 988594 (0.00086) [2022-07-11 02:05:23,776][26022] Updated weights on worker 0-0, policy_version 988604 (0.00096) [2022-07-11 02:05:25,184][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:05:25,193][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000988610_1012336640.pth [2022-07-11 02:05:25,193][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000986656_1010335744.pth [2022-07-11 02:05:26,059][25689] Fps is (10 sec: 5413.7, 60 sec: 5539.9, 300 sec: 5540.8). Total num frames: 1012339712. Throughput: 0: 5860.4. Samples: 1012347724. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:05:26,061][25689] Avg episode reward: [(0, '0.333')] [2022-07-11 02:05:26,113][26022] Updated weights on worker 0-0, policy_version 988614 (0.00087) [2022-07-11 02:05:27,396][26022] Updated weights on worker 0-0, policy_version 988624 (0.00088) [2022-07-11 02:05:29,605][26022] Updated weights on worker 0-0, policy_version 988634 (0.00615) [2022-07-11 02:05:31,085][25689] Fps is (10 sec: 5616.6, 60 sec: 5591.5, 300 sec: 5552.0). Total num frames: 1012370432. Throughput: 0: 5017.5. Samples: 1012364412. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:05:31,085][25689] Avg episode reward: [(0, '1.106')] [2022-07-11 02:05:31,207][26022] Updated weights on worker 0-0, policy_version 988644 (0.00085) [2022-07-11 02:05:33,426][26022] Updated weights on worker 0-0, policy_version 988654 (0.01059) [2022-07-11 02:05:35,053][26022] Updated weights on worker 0-0, policy_version 988664 (0.00088) [2022-07-11 02:05:36,230][25689] Fps is (10 sec: 5638.2, 60 sec: 5553.6, 300 sec: 5550.5). Total num frames: 1012397056. Throughput: 0: 5820.0. Samples: 1012397568. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:05:36,230][25689] Avg episode reward: [(0, '1.247')] [2022-07-11 02:05:36,819][26022] Updated weights on worker 0-0, policy_version 988674 (0.00087) [2022-07-11 02:05:38,596][26022] Updated weights on worker 0-0, policy_version 988684 (0.00086) [2022-07-11 02:05:40,564][26022] Updated weights on worker 0-0, policy_version 988694 (0.00090) [2022-07-11 02:05:41,275][25689] Fps is (10 sec: 5426.5, 60 sec: 5558.3, 300 sec: 5544.0). Total num frames: 1012425728. Throughput: 0: 5808.9. Samples: 1012431238. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:05:41,275][25689] Avg episode reward: [(0, '0.520')] [2022-07-11 02:05:42,281][26022] Updated weights on worker 0-0, policy_version 988704 (0.00086) [2022-07-11 02:05:44,282][26022] Updated weights on worker 0-0, policy_version 988714 (0.00086) [2022-07-11 02:05:46,022][26022] Updated weights on worker 0-0, policy_version 988724 (0.00088) [2022-07-11 02:05:46,299][25689] Fps is (10 sec: 5695.3, 60 sec: 5573.1, 300 sec: 5550.7). Total num frames: 1012454400. Throughput: 0: 4956.5. Samples: 1012448040. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:05:46,299][25689] Avg episode reward: [(0, '0.150')] [2022-07-11 02:05:47,783][26022] Updated weights on worker 0-0, policy_version 988734 (0.00092) [2022-07-11 02:05:49,815][26022] Updated weights on worker 0-0, policy_version 988744 (0.00098) [2022-07-11 02:05:51,303][25689] Fps is (10 sec: 5718.2, 60 sec: 5577.3, 300 sec: 5554.9). Total num frames: 1012483072. Throughput: 0: 5799.3. Samples: 1012481664. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:05:51,303][25689] Avg episode reward: [(0, '0.222')] [2022-07-11 02:05:51,427][26022] Updated weights on worker 0-0, policy_version 988754 (0.00091) [2022-07-11 02:05:53,471][26022] Updated weights on worker 0-0, policy_version 988764 (0.00082) [2022-07-11 02:05:55,080][26022] Updated weights on worker 0-0, policy_version 988774 (0.00086) [2022-07-11 02:05:56,419][25689] Fps is (10 sec: 5564.8, 60 sec: 5558.0, 300 sec: 5551.2). Total num frames: 1012510720. Throughput: 0: 5829.4. Samples: 1012515260. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:05:56,420][25689] Avg episode reward: [(0, '0.049')] [2022-07-11 02:05:57,299][26022] Updated weights on worker 0-0, policy_version 988784 (0.00088) [2022-07-11 02:05:59,025][26022] Updated weights on worker 0-0, policy_version 988794 (0.00087) [2022-07-11 02:06:00,691][26022] Updated weights on worker 0-0, policy_version 988804 (0.00753) [2022-07-11 02:06:01,451][25689] Fps is (10 sec: 5650.9, 60 sec: 5556.6, 300 sec: 5564.9). Total num frames: 1012540416. Throughput: 0: 5003.0. Samples: 1012532180. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:06:01,451][25689] Avg episode reward: [(0, '0.740')] [2022-07-11 02:06:02,885][26022] Updated weights on worker 0-0, policy_version 988814 (0.00089) [2022-07-11 02:06:04,674][26022] Updated weights on worker 0-0, policy_version 988824 (0.00085) [2022-07-11 02:06:06,513][25689] Fps is (10 sec: 5376.9, 60 sec: 5542.7, 300 sec: 5551.8). Total num frames: 1012564992. Throughput: 0: 5723.6. Samples: 1012563738. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:06:06,513][25689] Avg episode reward: [(0, '0.598')] [2022-07-11 02:06:06,563][26022] Updated weights on worker 0-0, policy_version 988834 (0.00094) [2022-07-11 02:06:08,293][26022] Updated weights on worker 0-0, policy_version 988844 (0.00086) [2022-07-11 02:06:10,290][26022] Updated weights on worker 0-0, policy_version 988854 (0.00088) [2022-07-11 02:06:11,588][25689] Fps is (10 sec: 5252.6, 60 sec: 5555.6, 300 sec: 5554.8). Total num frames: 1012593664. Throughput: 0: 5702.9. Samples: 1012597350. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:06:11,590][25689] Avg episode reward: [(0, '1.202')] [2022-07-11 02:06:12,060][26022] Updated weights on worker 0-0, policy_version 988864 (0.00090) [2022-07-11 02:06:13,879][26022] Updated weights on worker 0-0, policy_version 988874 (0.00091) [2022-07-11 02:06:15,491][26022] Updated weights on worker 0-0, policy_version 988884 (0.00083) [2022-07-11 02:06:16,654][25689] Fps is (10 sec: 5553.5, 60 sec: 5540.6, 300 sec: 5550.3). Total num frames: 1012621312. Throughput: 0: 5719.1. Samples: 1012630984. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:06:16,656][25689] Avg episode reward: [(0, '-0.048')] [2022-07-11 02:06:17,508][26022] Updated weights on worker 0-0, policy_version 988894 (0.00086) [2022-07-11 02:06:19,435][26022] Updated weights on worker 0-0, policy_version 988904 (0.00089) [2022-07-11 02:06:21,018][26022] Updated weights on worker 0-0, policy_version 988914 (0.00087) [2022-07-11 02:06:21,662][25689] Fps is (10 sec: 5692.4, 60 sec: 5558.9, 300 sec: 5554.3). Total num frames: 1012651008. Throughput: 0: 5729.2. Samples: 1012647974. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:06:21,663][25689] Avg episode reward: [(0, '0.092')] [2022-07-11 02:06:23,052][26022] Updated weights on worker 0-0, policy_version 988924 (0.00090) [2022-07-11 02:06:24,656][26022] Updated weights on worker 0-0, policy_version 988934 (0.00086) [2022-07-11 02:06:26,520][26022] Updated weights on worker 0-0, policy_version 988944 (0.00088) [2022-07-11 02:06:26,670][25689] Fps is (10 sec: 5725.4, 60 sec: 5592.1, 300 sec: 5552.5). Total num frames: 1012678656. Throughput: 0: 5850.8. Samples: 1012681672. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:06:26,670][25689] Avg episode reward: [(0, '-0.427')] [2022-07-11 02:06:28,622][26022] Updated weights on worker 0-0, policy_version 988954 (0.00089) [2022-07-11 02:06:30,150][26022] Updated weights on worker 0-0, policy_version 988964 (0.00380) [2022-07-11 02:06:31,683][25689] Fps is (10 sec: 5416.0, 60 sec: 5525.7, 300 sec: 5553.5). Total num frames: 1012705280. Throughput: 0: 5868.3. Samples: 1012715270. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:06:31,683][25689] Avg episode reward: [(0, '-0.804')] [2022-07-11 02:06:32,181][26022] Updated weights on worker 0-0, policy_version 988974 (0.00085) [2022-07-11 02:06:34,002][26022] Updated weights on worker 0-0, policy_version 988984 (0.00089) [2022-07-11 02:06:35,660][26022] Updated weights on worker 0-0, policy_version 988994 (0.00088) [2022-07-11 02:06:36,815][25689] Fps is (10 sec: 5551.6, 60 sec: 5577.6, 300 sec: 5554.5). Total num frames: 1012734976. Throughput: 0: 4997.6. Samples: 1012731738. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:06:36,815][25689] Avg episode reward: [(0, '-0.947')] [2022-07-11 02:06:37,939][26022] Updated weights on worker 0-0, policy_version 989004 (0.00085) [2022-07-11 02:06:39,285][26022] Updated weights on worker 0-0, policy_version 989014 (0.00092) [2022-07-11 02:06:41,429][26022] Updated weights on worker 0-0, policy_version 989024 (0.00087) [2022-07-11 02:06:41,834][25689] Fps is (10 sec: 5749.9, 60 sec: 5580.0, 300 sec: 5554.2). Total num frames: 1012763648. Throughput: 0: 5803.2. Samples: 1012765036. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:06:41,834][25689] Avg episode reward: [(0, '0.058')] [2022-07-11 02:06:43,163][26022] Updated weights on worker 0-0, policy_version 989034 (0.00095) [2022-07-11 02:06:45,003][26022] Updated weights on worker 0-0, policy_version 989044 (0.00089) [2022-07-11 02:06:46,875][25689] Fps is (10 sec: 5496.7, 60 sec: 5544.6, 300 sec: 5554.3). Total num frames: 1012790272. Throughput: 0: 5784.2. Samples: 1012798540. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:06:46,875][25689] Avg episode reward: [(0, '-0.048')] [2022-07-11 02:06:46,919][26022] Updated weights on worker 0-0, policy_version 989054 (0.00096) [2022-07-11 02:06:48,702][26022] Updated weights on worker 0-0, policy_version 989064 (0.00052) [2022-07-11 02:06:50,552][26022] Updated weights on worker 0-0, policy_version 989074 (0.00093) [2022-07-11 02:06:51,890][25689] Fps is (10 sec: 5600.4, 60 sec: 5560.5, 300 sec: 5558.4). Total num frames: 1012819968. Throughput: 0: 4939.6. Samples: 1012815086. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:06:51,891][25689] Avg episode reward: [(0, '-0.323')] [2022-07-11 02:06:52,615][26022] Updated weights on worker 0-0, policy_version 989084 (0.00085) [2022-07-11 02:06:54,270][26022] Updated weights on worker 0-0, policy_version 989094 (0.00088) [2022-07-11 02:06:56,196][26022] Updated weights on worker 0-0, policy_version 989104 (0.00093) [2022-07-11 02:06:56,970][25689] Fps is (10 sec: 5680.0, 60 sec: 5563.9, 300 sec: 5550.3). Total num frames: 1012847616. Throughput: 0: 5802.0. Samples: 1012848680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:06:56,971][25689] Avg episode reward: [(0, '0.578')] [2022-07-11 02:06:57,718][26022] Updated weights on worker 0-0, policy_version 989114 (0.00087) [2022-07-11 02:06:59,849][26022] Updated weights on worker 0-0, policy_version 989124 (0.00085) [2022-07-11 02:07:01,596][26022] Updated weights on worker 0-0, policy_version 989134 (0.00097) [2022-07-11 02:07:02,031][25689] Fps is (10 sec: 5352.0, 60 sec: 5510.5, 300 sec: 5556.4). Total num frames: 1012874240. Throughput: 0: 5777.1. Samples: 1012881716. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:07:02,031][25689] Avg episode reward: [(0, '0.670')] [2022-07-11 02:07:03,741][26022] Updated weights on worker 0-0, policy_version 989144 (0.00086) [2022-07-11 02:07:05,555][26022] Updated weights on worker 0-0, policy_version 989154 (0.00098) [2022-07-11 02:07:07,047][25689] Fps is (10 sec: 5182.5, 60 sec: 5531.5, 300 sec: 5546.2). Total num frames: 1012899840. Throughput: 0: 4890.3. Samples: 1012897190. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:07:07,048][25689] Avg episode reward: [(0, '0.529')] [2022-07-11 02:07:07,479][26022] Updated weights on worker 0-0, policy_version 989164 (0.00084) [2022-07-11 02:07:09,147][26022] Updated weights on worker 0-0, policy_version 989174 (0.00113) [2022-07-11 02:07:11,197][26022] Updated weights on worker 0-0, policy_version 989184 (0.00086) [2022-07-11 02:07:12,064][25689] Fps is (10 sec: 5511.4, 60 sec: 5553.8, 300 sec: 5554.7). Total num frames: 1012929536. Throughput: 0: 5732.7. Samples: 1012930736. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:07:12,064][25689] Avg episode reward: [(0, '0.570')] [2022-07-11 02:07:12,817][26022] Updated weights on worker 0-0, policy_version 989194 (0.00356) [2022-07-11 02:07:14,810][26022] Updated weights on worker 0-0, policy_version 989204 (0.00091) [2022-07-11 02:07:16,806][26022] Updated weights on worker 0-0, policy_version 989214 (0.00085) [2022-07-11 02:07:17,133][25689] Fps is (10 sec: 5685.9, 60 sec: 5553.6, 300 sec: 5546.6). Total num frames: 1012957184. Throughput: 0: 5731.0. Samples: 1012964230. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:07:17,133][25689] Avg episode reward: [(0, '0.372')] [2022-07-11 02:07:18,486][26022] Updated weights on worker 0-0, policy_version 989224 (0.00088) [2022-07-11 02:07:20,388][26022] Updated weights on worker 0-0, policy_version 989234 (0.00091) [2022-07-11 02:07:22,078][26022] Updated weights on worker 0-0, policy_version 989244 (0.00368) [2022-07-11 02:07:22,160][25689] Fps is (10 sec: 5578.5, 60 sec: 5534.9, 300 sec: 5550.1). Total num frames: 1012985856. Throughput: 0: 4923.3. Samples: 1012980818. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:07:22,160][25689] Avg episode reward: [(0, '0.155')] [2022-07-11 02:07:23,831][26022] Updated weights on worker 0-0, policy_version 989254 (0.00094) [2022-07-11 02:07:25,307][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:07:25,322][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000989261_1013003264.pth [2022-07-11 02:07:25,325][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000987307_1011002368.pth [2022-07-11 02:07:26,255][26022] Updated weights on worker 0-0, policy_version 989264 (0.00092) [2022-07-11 02:07:27,172][25689] Fps is (10 sec: 5508.0, 60 sec: 5517.6, 300 sec: 5550.8). Total num frames: 1013012480. Throughput: 0: 5831.8. Samples: 1013014552. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:07:27,172][25689] Avg episode reward: [(0, '0.186')] [2022-07-11 02:07:27,378][26022] Updated weights on worker 0-0, policy_version 989274 (0.00089) [2022-07-11 02:07:29,692][26022] Updated weights on worker 0-0, policy_version 989284 (0.00091) [2022-07-11 02:07:31,159][26022] Updated weights on worker 0-0, policy_version 989294 (0.00094) [2022-07-11 02:07:32,179][25689] Fps is (10 sec: 5416.6, 60 sec: 5535.0, 300 sec: 5542.1). Total num frames: 1013040128. Throughput: 0: 5829.9. Samples: 1013048008. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:07:32,180][25689] Avg episode reward: [(0, '0.239')] [2022-07-11 02:07:33,257][26022] Updated weights on worker 0-0, policy_version 989304 (0.00088) [2022-07-11 02:07:35,242][26022] Updated weights on worker 0-0, policy_version 989314 (0.00086) [2022-07-11 02:07:36,969][26022] Updated weights on worker 0-0, policy_version 989324 (0.00084) [2022-07-11 02:07:37,289][25689] Fps is (10 sec: 5566.7, 60 sec: 5520.1, 300 sec: 5543.8). Total num frames: 1013068800. Throughput: 0: 4974.3. Samples: 1013064496. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:07:37,290][25689] Avg episode reward: [(0, '0.876')] [2022-07-11 02:07:38,703][26022] Updated weights on worker 0-0, policy_version 989334 (0.00084) [2022-07-11 02:07:40,707][26022] Updated weights on worker 0-0, policy_version 989344 (0.00095) [2022-07-11 02:07:42,267][26022] Updated weights on worker 0-0, policy_version 989354 (0.00095) [2022-07-11 02:07:42,364][25689] Fps is (10 sec: 5730.9, 60 sec: 5531.9, 300 sec: 5550.3). Total num frames: 1013098496. Throughput: 0: 5806.1. Samples: 1013098126. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:07:42,365][25689] Avg episode reward: [(0, '1.279')] [2022-07-11 02:07:44,503][26022] Updated weights on worker 0-0, policy_version 989364 (0.00088) [2022-07-11 02:07:46,019][26022] Updated weights on worker 0-0, policy_version 989374 (0.00092) [2022-07-11 02:07:47,425][25689] Fps is (10 sec: 5657.8, 60 sec: 5547.0, 300 sec: 5546.4). Total num frames: 1013126144. Throughput: 0: 5785.2. Samples: 1013131718. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:07:47,425][25689] Avg episode reward: [(0, '1.780')] [2022-07-11 02:07:48,021][26022] Updated weights on worker 0-0, policy_version 989384 (0.00086) [2022-07-11 02:07:49,702][26022] Updated weights on worker 0-0, policy_version 989394 (0.00085) [2022-07-11 02:07:51,785][26022] Updated weights on worker 0-0, policy_version 989404 (0.00084) [2022-07-11 02:07:52,454][25689] Fps is (10 sec: 5480.5, 60 sec: 5512.0, 300 sec: 5546.7). Total num frames: 1013153792. Throughput: 0: 4962.0. Samples: 1013148610. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:07:52,456][25689] Avg episode reward: [(0, '2.453')] [2022-07-11 02:07:53,264][26022] Updated weights on worker 0-0, policy_version 989414 (0.00093) [2022-07-11 02:07:55,339][26022] Updated weights on worker 0-0, policy_version 989424 (0.00097) [2022-07-11 02:07:57,011][26022] Updated weights on worker 0-0, policy_version 989434 (0.00087) [2022-07-11 02:07:57,522][25689] Fps is (10 sec: 5577.5, 60 sec: 5529.9, 300 sec: 5545.6). Total num frames: 1013182464. Throughput: 0: 5816.7. Samples: 1013182186. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:07:57,524][25689] Avg episode reward: [(0, '1.894')] [2022-07-11 02:07:59,046][26022] Updated weights on worker 0-0, policy_version 989444 (0.00088) [2022-07-11 02:08:00,837][26022] Updated weights on worker 0-0, policy_version 989454 (0.00089) [2022-07-11 02:08:02,525][25689] Fps is (10 sec: 5388.6, 60 sec: 5518.2, 300 sec: 5549.6). Total num frames: 1013208064. Throughput: 0: 5744.6. Samples: 1013213944. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:08:02,527][25689] Avg episode reward: [(0, '1.463')] [2022-07-11 02:08:02,880][26022] Updated weights on worker 0-0, policy_version 989464 (0.00081) [2022-07-11 02:08:05,094][26022] Updated weights on worker 0-0, policy_version 989474 (0.00063) [2022-07-11 02:08:06,520][26022] Updated weights on worker 0-0, policy_version 989484 (0.00095) [2022-07-11 02:08:07,532][25689] Fps is (10 sec: 5422.2, 60 sec: 5570.0, 300 sec: 5549.7). Total num frames: 1013236736. Throughput: 0: 4912.6. Samples: 1013230496. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:08:07,532][25689] Avg episode reward: [(0, '0.991')] [2022-07-11 02:08:08,605][26022] Updated weights on worker 0-0, policy_version 989494 (0.00094) [2022-07-11 02:08:10,365][26022] Updated weights on worker 0-0, policy_version 989504 (0.00083) [2022-07-11 02:08:12,086][26022] Updated weights on worker 0-0, policy_version 989514 (0.00086) [2022-07-11 02:08:12,551][25689] Fps is (10 sec: 5617.8, 60 sec: 5535.9, 300 sec: 5547.1). Total num frames: 1013264384. Throughput: 0: 5730.2. Samples: 1013263770. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:08:12,552][25689] Avg episode reward: [(0, '0.889')] [2022-07-11 02:08:14,235][26022] Updated weights on worker 0-0, policy_version 989524 (0.00094) [2022-07-11 02:08:15,684][26022] Updated weights on worker 0-0, policy_version 989534 (0.00084) [2022-07-11 02:08:17,635][25689] Fps is (10 sec: 5473.2, 60 sec: 5534.5, 300 sec: 5550.0). Total num frames: 1013292032. Throughput: 0: 5738.7. Samples: 1013297604. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:08:17,635][25689] Avg episode reward: [(0, '0.029')] [2022-07-11 02:08:17,667][26022] Updated weights on worker 0-0, policy_version 989544 (0.00108) [2022-07-11 02:08:19,541][26022] Updated weights on worker 0-0, policy_version 989554 (0.00083) [2022-07-11 02:08:21,275][26022] Updated weights on worker 0-0, policy_version 989564 (0.00091) [2022-07-11 02:08:22,655][25689] Fps is (10 sec: 5472.5, 60 sec: 5518.2, 300 sec: 5546.5). Total num frames: 1013319680. Throughput: 0: 4982.9. Samples: 1013314248. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:08:22,655][25689] Avg episode reward: [(0, '-0.308')] [2022-07-11 02:08:23,196][26022] Updated weights on worker 0-0, policy_version 989574 (0.00088) [2022-07-11 02:08:24,719][26022] Updated weights on worker 0-0, policy_version 989584 (0.00080) [2022-07-11 02:08:26,703][26022] Updated weights on worker 0-0, policy_version 989594 (0.00086) [2022-07-11 02:08:27,693][25689] Fps is (10 sec: 5599.6, 60 sec: 5549.7, 300 sec: 5550.0). Total num frames: 1013348352. Throughput: 0: 5821.4. Samples: 1013347860. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 02:08:27,693][25689] Avg episode reward: [(0, '0.336')] [2022-07-11 02:08:28,448][26022] Updated weights on worker 0-0, policy_version 989604 (0.00085) [2022-07-11 02:08:30,609][26022] Updated weights on worker 0-0, policy_version 989614 (0.00096) [2022-07-11 02:08:32,444][26022] Updated weights on worker 0-0, policy_version 989624 (0.00087) [2022-07-11 02:08:32,711][25689] Fps is (10 sec: 5600.6, 60 sec: 5548.7, 300 sec: 5547.2). Total num frames: 1013376000. Throughput: 0: 5819.6. Samples: 1013381096. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:08:32,712][25689] Avg episode reward: [(0, '0.531')] [2022-07-11 02:08:34,175][26022] Updated weights on worker 0-0, policy_version 989634 (0.00091) [2022-07-11 02:08:36,020][26022] Updated weights on worker 0-0, policy_version 989644 (0.00085) [2022-07-11 02:08:37,804][25689] Fps is (10 sec: 5569.9, 60 sec: 5550.2, 300 sec: 5545.5). Total num frames: 1013404672. Throughput: 0: 5795.7. Samples: 1013414500. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:08:37,805][25689] Avg episode reward: [(0, '-0.409')] [2022-07-11 02:08:37,863][26022] Updated weights on worker 0-0, policy_version 989654 (0.00094) [2022-07-11 02:08:39,775][26022] Updated weights on worker 0-0, policy_version 989664 (0.00095) [2022-07-11 02:08:41,481][26022] Updated weights on worker 0-0, policy_version 989674 (0.00094) [2022-07-11 02:08:42,808][25689] Fps is (10 sec: 5679.6, 60 sec: 5539.9, 300 sec: 5556.1). Total num frames: 1013433344. Throughput: 0: 5806.3. Samples: 1013431260. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:08:42,808][25689] Avg episode reward: [(0, '-0.134')] [2022-07-11 02:08:43,620][26022] Updated weights on worker 0-0, policy_version 989684 (0.00084) [2022-07-11 02:08:45,130][26022] Updated weights on worker 0-0, policy_version 989694 (0.00084) [2022-07-11 02:08:47,186][26022] Updated weights on worker 0-0, policy_version 989704 (0.00085) [2022-07-11 02:08:47,816][25689] Fps is (10 sec: 5522.9, 60 sec: 5527.7, 300 sec: 5546.4). Total num frames: 1013459968. Throughput: 0: 5811.0. Samples: 1013464798. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:08:47,817][25689] Avg episode reward: [(0, '0.392')] [2022-07-11 02:08:48,949][26022] Updated weights on worker 0-0, policy_version 989714 (0.00090) [2022-07-11 02:08:50,577][26022] Updated weights on worker 0-0, policy_version 989724 (0.00087) [2022-07-11 02:08:52,689][26022] Updated weights on worker 0-0, policy_version 989734 (0.00087) [2022-07-11 02:08:52,822][25689] Fps is (10 sec: 5419.2, 60 sec: 5529.8, 300 sec: 5544.3). Total num frames: 1013487616. Throughput: 0: 5823.4. Samples: 1013498212. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:08:52,823][25689] Avg episode reward: [(0, '0.613')] [2022-07-11 02:08:54,174][26022] Updated weights on worker 0-0, policy_version 989744 (0.00086) [2022-07-11 02:08:56,287][26022] Updated weights on worker 0-0, policy_version 989754 (0.00088) [2022-07-11 02:08:57,956][25689] Fps is (10 sec: 5655.6, 60 sec: 5540.8, 300 sec: 5550.1). Total num frames: 1013517312. Throughput: 0: 4990.8. Samples: 1013515072. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:08:57,956][25689] Avg episode reward: [(0, '0.634')] [2022-07-11 02:08:58,384][26022] Updated weights on worker 0-0, policy_version 989764 (0.00087) [2022-07-11 02:08:59,884][26022] Updated weights on worker 0-0, policy_version 989774 (0.00091) [2022-07-11 02:09:02,164][26022] Updated weights on worker 0-0, policy_version 989784 (0.00086) [2022-07-11 02:09:02,959][25689] Fps is (10 sec: 5354.1, 60 sec: 5523.8, 300 sec: 5543.2). Total num frames: 1013541888. Throughput: 0: 5812.0. Samples: 1013548378. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:09:02,959][25689] Avg episode reward: [(0, '1.519')] [2022-07-11 02:09:03,788][26022] Updated weights on worker 0-0, policy_version 989794 (0.00084) [2022-07-11 02:09:05,910][26022] Updated weights on worker 0-0, policy_version 989804 (0.00096) [2022-07-11 02:09:07,644][26022] Updated weights on worker 0-0, policy_version 989814 (0.00093) [2022-07-11 02:09:08,029][25689] Fps is (10 sec: 5286.2, 60 sec: 5518.0, 300 sec: 5542.3). Total num frames: 1013570560. Throughput: 0: 5678.8. Samples: 1013579580. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:09:08,029][25689] Avg episode reward: [(0, '1.879')] [2022-07-11 02:09:09,495][26022] Updated weights on worker 0-0, policy_version 989824 (0.00092) [2022-07-11 02:09:11,476][26022] Updated weights on worker 0-0, policy_version 989834 (0.00089) [2022-07-11 02:09:13,032][25689] Fps is (10 sec: 5693.0, 60 sec: 5536.4, 300 sec: 5551.0). Total num frames: 1013599232. Throughput: 0: 4855.4. Samples: 1013596336. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:09:13,032][25689] Avg episode reward: [(0, '1.672')] [2022-07-11 02:09:13,188][26022] Updated weights on worker 0-0, policy_version 989844 (0.00085) [2022-07-11 02:09:14,992][26022] Updated weights on worker 0-0, policy_version 989854 (0.00088) [2022-07-11 02:09:17,042][26022] Updated weights on worker 0-0, policy_version 989864 (0.00092) [2022-07-11 02:09:18,067][25689] Fps is (10 sec: 5610.4, 60 sec: 5540.9, 300 sec: 5548.7). Total num frames: 1013626880. Throughput: 0: 5712.9. Samples: 1013629966. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:09:18,067][25689] Avg episode reward: [(0, '1.668')] [2022-07-11 02:09:18,811][26022] Updated weights on worker 0-0, policy_version 989874 (0.00085) [2022-07-11 02:09:20,463][26022] Updated weights on worker 0-0, policy_version 989884 (0.00088) [2022-07-11 02:09:22,434][26022] Updated weights on worker 0-0, policy_version 989894 (0.00088) [2022-07-11 02:09:23,079][25689] Fps is (10 sec: 5503.7, 60 sec: 5541.7, 300 sec: 5545.8). Total num frames: 1013654528. Throughput: 0: 5720.1. Samples: 1013663466. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:09:23,079][25689] Avg episode reward: [(0, '1.607')] [2022-07-11 02:09:24,072][26022] Updated weights on worker 0-0, policy_version 989904 (0.00087) [2022-07-11 02:09:25,340][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:09:25,353][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000989909_1013666816.pth [2022-07-11 02:09:25,354][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000987958_1011668992.pth [2022-07-11 02:09:26,167][26022] Updated weights on worker 0-0, policy_version 989914 (0.00094) [2022-07-11 02:09:27,873][26022] Updated weights on worker 0-0, policy_version 989924 (0.00084) [2022-07-11 02:09:28,096][25689] Fps is (10 sec: 5513.3, 60 sec: 5526.5, 300 sec: 5546.1). Total num frames: 1013682176. Throughput: 0: 5006.9. Samples: 1013680058. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:09:28,097][25689] Avg episode reward: [(0, '1.147')] [2022-07-11 02:09:29,695][26022] Updated weights on worker 0-0, policy_version 989934 (0.00086) [2022-07-11 02:09:31,630][26022] Updated weights on worker 0-0, policy_version 989944 (0.00091) [2022-07-11 02:09:33,127][25689] Fps is (10 sec: 5706.8, 60 sec: 5559.3, 300 sec: 5550.9). Total num frames: 1013711872. Throughput: 0: 5831.9. Samples: 1013713530. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:09:33,129][25689] Avg episode reward: [(0, '1.003')] [2022-07-11 02:09:33,292][26022] Updated weights on worker 0-0, policy_version 989954 (0.00092) [2022-07-11 02:09:35,287][26022] Updated weights on worker 0-0, policy_version 989964 (0.00082) [2022-07-11 02:09:37,247][26022] Updated weights on worker 0-0, policy_version 989974 (0.00084) [2022-07-11 02:09:38,268][25689] Fps is (10 sec: 5536.8, 60 sec: 5521.0, 300 sec: 5543.1). Total num frames: 1013738496. Throughput: 0: 5777.6. Samples: 1013746682. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:09:38,269][25689] Avg episode reward: [(0, '1.026')] [2022-07-11 02:09:38,940][26022] Updated weights on worker 0-0, policy_version 989984 (0.00093) [2022-07-11 02:09:40,905][26022] Updated weights on worker 0-0, policy_version 989994 (0.00094) [2022-07-11 02:09:42,802][26022] Updated weights on worker 0-0, policy_version 990004 (0.00068) [2022-07-11 02:09:43,314][25689] Fps is (10 sec: 5428.2, 60 sec: 5517.2, 300 sec: 5545.8). Total num frames: 1013767168. Throughput: 0: 4947.5. Samples: 1013763580. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:09:43,314][25689] Avg episode reward: [(0, '0.842')] [2022-07-11 02:09:44,552][26022] Updated weights on worker 0-0, policy_version 990014 (0.00086) [2022-07-11 02:09:46,392][26022] Updated weights on worker 0-0, policy_version 990024 (0.00087) [2022-07-11 02:09:48,019][26022] Updated weights on worker 0-0, policy_version 990034 (0.00086) [2022-07-11 02:09:48,364][25689] Fps is (10 sec: 5680.2, 60 sec: 5547.2, 300 sec: 5545.8). Total num frames: 1013795840. Throughput: 0: 5774.9. Samples: 1013797100. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:09:48,364][25689] Avg episode reward: [(0, '0.723')] [2022-07-11 02:09:49,885][26022] Updated weights on worker 0-0, policy_version 990044 (0.00086) [2022-07-11 02:09:51,903][26022] Updated weights on worker 0-0, policy_version 990054 (0.00085) [2022-07-11 02:09:53,433][25689] Fps is (10 sec: 5565.7, 60 sec: 5541.5, 300 sec: 5542.7). Total num frames: 1013823488. Throughput: 0: 5788.7. Samples: 1013831076. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:09:53,433][25689] Avg episode reward: [(0, '-0.010')] [2022-07-11 02:09:53,630][26022] Updated weights on worker 0-0, policy_version 990064 (0.00092) [2022-07-11 02:09:55,504][26022] Updated weights on worker 0-0, policy_version 990074 (0.00088) [2022-07-11 02:09:57,294][26022] Updated weights on worker 0-0, policy_version 990084 (0.00078) [2022-07-11 02:09:58,515][25689] Fps is (10 sec: 5548.2, 60 sec: 5529.3, 300 sec: 5538.0). Total num frames: 1013852160. Throughput: 0: 4987.9. Samples: 1013847670. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:09:58,515][25689] Avg episode reward: [(0, '0.389')] [2022-07-11 02:09:59,063][26022] Updated weights on worker 0-0, policy_version 990094 (0.00093) [2022-07-11 02:10:01,197][26022] Updated weights on worker 0-0, policy_version 990104 (0.00093) [2022-07-11 02:10:03,236][26022] Updated weights on worker 0-0, policy_version 990114 (0.00094) [2022-07-11 02:10:03,527][25689] Fps is (10 sec: 5376.7, 60 sec: 5545.4, 300 sec: 5539.6). Total num frames: 1013877760. Throughput: 0: 5779.5. Samples: 1013880402. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:10:03,527][25689] Avg episode reward: [(0, '0.237')] [2022-07-11 02:10:05,016][26022] Updated weights on worker 0-0, policy_version 990124 (0.00085) [2022-07-11 02:10:06,939][26022] Updated weights on worker 0-0, policy_version 990134 (0.00098) [2022-07-11 02:10:08,546][25689] Fps is (10 sec: 5308.3, 60 sec: 5533.1, 300 sec: 5539.8). Total num frames: 1013905408. Throughput: 0: 5724.7. Samples: 1013912636. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:10:08,546][25689] Avg episode reward: [(0, '0.529')] [2022-07-11 02:10:08,658][26022] Updated weights on worker 0-0, policy_version 990144 (0.00084) [2022-07-11 02:10:10,583][26022] Updated weights on worker 0-0, policy_version 990154 (0.01421) [2022-07-11 02:10:12,408][26022] Updated weights on worker 0-0, policy_version 990164 (0.00089) [2022-07-11 02:10:13,585][25689] Fps is (10 sec: 5599.5, 60 sec: 5529.8, 300 sec: 5540.7). Total num frames: 1013934080. Throughput: 0: 4875.3. Samples: 1013929322. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:10:13,585][25689] Avg episode reward: [(0, '0.768')] [2022-07-11 02:10:14,234][26022] Updated weights on worker 0-0, policy_version 990174 (0.00093) [2022-07-11 02:10:16,118][26022] Updated weights on worker 0-0, policy_version 990184 (0.00095) [2022-07-11 02:10:17,949][26022] Updated weights on worker 0-0, policy_version 990194 (0.00088) [2022-07-11 02:10:18,646][25689] Fps is (10 sec: 5677.8, 60 sec: 5544.4, 300 sec: 5540.0). Total num frames: 1013962752. Throughput: 0: 5710.7. Samples: 1013962630. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:10:18,646][25689] Avg episode reward: [(0, '0.647')] [2022-07-11 02:10:19,727][26022] Updated weights on worker 0-0, policy_version 990204 (0.00085) [2022-07-11 02:10:21,624][26022] Updated weights on worker 0-0, policy_version 990214 (0.00085) [2022-07-11 02:10:23,395][26022] Updated weights on worker 0-0, policy_version 990224 (0.00091) [2022-07-11 02:10:23,723][25689] Fps is (10 sec: 5555.4, 60 sec: 5538.4, 300 sec: 5545.5). Total num frames: 1013990400. Throughput: 0: 5748.2. Samples: 1013996492. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:10:23,723][25689] Avg episode reward: [(0, '0.938')] [2022-07-11 02:10:25,228][26022] Updated weights on worker 0-0, policy_version 990234 (0.00100) [2022-07-11 02:10:27,202][26022] Updated weights on worker 0-0, policy_version 990244 (0.00093) [2022-07-11 02:10:28,800][25689] Fps is (10 sec: 5546.5, 60 sec: 5549.9, 300 sec: 5537.6). Total num frames: 1014019072. Throughput: 0: 4975.4. Samples: 1014013408. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:10:28,800][25689] Avg episode reward: [(0, '0.927')] [2022-07-11 02:10:28,951][26022] Updated weights on worker 0-0, policy_version 990254 (0.00720) [2022-07-11 02:10:30,671][26022] Updated weights on worker 0-0, policy_version 990264 (0.00096) [2022-07-11 02:10:32,565][26022] Updated weights on worker 0-0, policy_version 990274 (0.00093) [2022-07-11 02:10:33,809][25689] Fps is (10 sec: 5685.1, 60 sec: 5534.9, 300 sec: 5547.0). Total num frames: 1014047744. Throughput: 0: 5820.5. Samples: 1014047040. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:10:33,810][25689] Avg episode reward: [(0, '1.529')] [2022-07-11 02:10:34,520][26022] Updated weights on worker 0-0, policy_version 990284 (0.00084) [2022-07-11 02:10:36,335][26022] Updated weights on worker 0-0, policy_version 990294 (0.00569) [2022-07-11 02:10:37,918][26022] Updated weights on worker 0-0, policy_version 990304 (0.00087) [2022-07-11 02:10:38,892][25689] Fps is (10 sec: 5682.2, 60 sec: 5574.1, 300 sec: 5546.4). Total num frames: 1014076416. Throughput: 0: 5846.2. Samples: 1014080992. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:10:38,894][25689] Avg episode reward: [(0, '1.038')] [2022-07-11 02:10:40,019][26022] Updated weights on worker 0-0, policy_version 990314 (0.00088) [2022-07-11 02:10:41,465][26022] Updated weights on worker 0-0, policy_version 990324 (0.00087) [2022-07-11 02:10:43,516][26022] Updated weights on worker 0-0, policy_version 990334 (0.00079) [2022-07-11 02:10:43,918][25689] Fps is (10 sec: 5571.3, 60 sec: 5558.9, 300 sec: 5542.9). Total num frames: 1014104064. Throughput: 0: 5011.6. Samples: 1014097704. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:10:43,919][25689] Avg episode reward: [(0, '0.184')] [2022-07-11 02:10:45,280][26022] Updated weights on worker 0-0, policy_version 990344 (0.00093) [2022-07-11 02:10:47,101][26022] Updated weights on worker 0-0, policy_version 990354 (0.00090) [2022-07-11 02:10:48,954][25689] Fps is (10 sec: 5495.3, 60 sec: 5543.3, 300 sec: 5538.8). Total num frames: 1014131712. Throughput: 0: 5865.6. Samples: 1014131624. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:10:48,955][25689] Avg episode reward: [(0, '0.465')] [2022-07-11 02:10:49,023][26022] Updated weights on worker 0-0, policy_version 990364 (0.00081) [2022-07-11 02:10:50,899][26022] Updated weights on worker 0-0, policy_version 990374 (0.00085) [2022-07-11 02:10:52,686][26022] Updated weights on worker 0-0, policy_version 990384 (0.00093) [2022-07-11 02:10:53,959][25689] Fps is (10 sec: 5507.1, 60 sec: 5549.2, 300 sec: 5540.9). Total num frames: 1014159360. Throughput: 0: 5860.2. Samples: 1014165122. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:10:53,960][25689] Avg episode reward: [(0, '0.986')] [2022-07-11 02:10:54,517][26022] Updated weights on worker 0-0, policy_version 990394 (0.00093) [2022-07-11 02:10:56,341][26022] Updated weights on worker 0-0, policy_version 990404 (0.00097) [2022-07-11 02:10:58,323][26022] Updated weights on worker 0-0, policy_version 990414 (0.00094) [2022-07-11 02:10:59,069][25689] Fps is (10 sec: 5669.2, 60 sec: 5563.5, 300 sec: 5539.4). Total num frames: 1014189056. Throughput: 0: 4985.6. Samples: 1014181588. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:10:59,070][25689] Avg episode reward: [(0, '0.775')] [2022-07-11 02:11:00,025][26022] Updated weights on worker 0-0, policy_version 990424 (0.00086) [2022-07-11 02:11:02,084][26022] Updated weights on worker 0-0, policy_version 990434 (0.00097) [2022-07-11 02:11:04,102][25689] Fps is (10 sec: 5350.7, 60 sec: 5544.7, 300 sec: 5540.0). Total num frames: 1014213632. Throughput: 0: 5722.3. Samples: 1014213202. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:11:04,103][25689] Avg episode reward: [(0, '0.895')] [2022-07-11 02:11:04,225][26022] Updated weights on worker 0-0, policy_version 990444 (0.00093) [2022-07-11 02:11:05,924][26022] Updated weights on worker 0-0, policy_version 990454 (0.00087) [2022-07-11 02:11:07,803][26022] Updated weights on worker 0-0, policy_version 990464 (0.00088) [2022-07-11 02:11:09,120][25689] Fps is (10 sec: 5297.7, 60 sec: 5561.7, 300 sec: 5541.1). Total num frames: 1014242304. Throughput: 0: 5678.8. Samples: 1014246144. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:11:09,121][25689] Avg episode reward: [(0, '1.510')] [2022-07-11 02:11:09,722][26022] Updated weights on worker 0-0, policy_version 990474 (0.00085) [2022-07-11 02:11:11,431][26022] Updated weights on worker 0-0, policy_version 990484 (0.00088) [2022-07-11 02:11:13,514][26022] Updated weights on worker 0-0, policy_version 990494 (0.00085) [2022-07-11 02:11:14,147][25689] Fps is (10 sec: 5606.9, 60 sec: 5545.9, 300 sec: 5541.8). Total num frames: 1014269952. Throughput: 0: 5659.1. Samples: 1014279366. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:11:14,147][25689] Avg episode reward: [(0, '1.695')] [2022-07-11 02:11:15,211][26022] Updated weights on worker 0-0, policy_version 990504 (0.00089) [2022-07-11 02:11:17,111][26022] Updated weights on worker 0-0, policy_version 990514 (0.00116) [2022-07-11 02:11:18,975][26022] Updated weights on worker 0-0, policy_version 990524 (0.00090) [2022-07-11 02:11:19,227][25689] Fps is (10 sec: 5471.4, 60 sec: 5527.3, 300 sec: 5533.6). Total num frames: 1014297600. Throughput: 0: 5679.6. Samples: 1014296076. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:11:19,227][25689] Avg episode reward: [(0, '1.618')] [2022-07-11 02:11:20,640][26022] Updated weights on worker 0-0, policy_version 990534 (0.00098) [2022-07-11 02:11:22,607][26022] Updated weights on worker 0-0, policy_version 990544 (0.00086) [2022-07-11 02:11:24,271][25689] Fps is (10 sec: 5563.0, 60 sec: 5547.2, 300 sec: 5536.3). Total num frames: 1014326272. Throughput: 0: 5774.0. Samples: 1014329656. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:11:24,271][25689] Avg episode reward: [(0, '1.229')] [2022-07-11 02:11:24,326][26022] Updated weights on worker 0-0, policy_version 990554 (0.00084) [2022-07-11 02:11:25,439][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:11:25,448][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000990559_1014332416.pth [2022-07-11 02:11:25,449][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000988610_1012336640.pth [2022-07-11 02:11:26,113][26022] Updated weights on worker 0-0, policy_version 990564 (0.00084) [2022-07-11 02:11:28,232][26022] Updated weights on worker 0-0, policy_version 990574 (0.00086) [2022-07-11 02:11:29,303][25689] Fps is (10 sec: 5589.1, 60 sec: 5534.4, 300 sec: 5539.4). Total num frames: 1014353920. Throughput: 0: 5797.4. Samples: 1014363154. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:11:29,304][25689] Avg episode reward: [(0, '1.242')] [2022-07-11 02:11:29,648][26022] Updated weights on worker 0-0, policy_version 990584 (0.00086) [2022-07-11 02:11:31,751][26022] Updated weights on worker 0-0, policy_version 990594 (0.00086) [2022-07-11 02:11:33,496][26022] Updated weights on worker 0-0, policy_version 990604 (0.00091) [2022-07-11 02:11:34,306][25689] Fps is (10 sec: 5408.2, 60 sec: 5501.1, 300 sec: 5531.5). Total num frames: 1014380544. Throughput: 0: 4985.7. Samples: 1014379874. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:11:34,308][25689] Avg episode reward: [(0, '1.027')] [2022-07-11 02:11:35,360][26022] Updated weights on worker 0-0, policy_version 990614 (0.00088) [2022-07-11 02:11:37,533][26022] Updated weights on worker 0-0, policy_version 990624 (0.00083) [2022-07-11 02:11:38,892][26022] Updated weights on worker 0-0, policy_version 990634 (0.00089) [2022-07-11 02:11:39,378][25689] Fps is (10 sec: 5692.1, 60 sec: 5536.0, 300 sec: 5537.4). Total num frames: 1014411264. Throughput: 0: 5819.8. Samples: 1014413352. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:11:39,379][25689] Avg episode reward: [(0, '1.192')] [2022-07-11 02:11:41,038][26022] Updated weights on worker 0-0, policy_version 990644 (0.00096) [2022-07-11 02:11:42,841][26022] Updated weights on worker 0-0, policy_version 990654 (0.00100) [2022-07-11 02:11:44,385][25689] Fps is (10 sec: 5689.3, 60 sec: 5520.8, 300 sec: 5538.0). Total num frames: 1014437888. Throughput: 0: 5831.9. Samples: 1014446962. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:11:44,394][25689] Avg episode reward: [(0, '1.085')] [2022-07-11 02:11:44,600][26022] Updated weights on worker 0-0, policy_version 990664 (0.00087) [2022-07-11 02:11:46,500][26022] Updated weights on worker 0-0, policy_version 990674 (0.00093) [2022-07-11 02:11:48,243][26022] Updated weights on worker 0-0, policy_version 990684 (0.00091) [2022-07-11 02:11:49,411][25689] Fps is (10 sec: 5613.2, 60 sec: 5555.5, 300 sec: 5537.8). Total num frames: 1014467584. Throughput: 0: 4995.0. Samples: 1014463592. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:11:49,412][25689] Avg episode reward: [(0, '1.198')] [2022-07-11 02:11:50,250][26022] Updated weights on worker 0-0, policy_version 990694 (0.00096) [2022-07-11 02:11:51,985][26022] Updated weights on worker 0-0, policy_version 990704 (0.00084) [2022-07-11 02:11:53,929][26022] Updated weights on worker 0-0, policy_version 990714 (0.00098) [2022-07-11 02:11:54,424][25689] Fps is (10 sec: 5610.1, 60 sec: 5537.9, 300 sec: 5535.6). Total num frames: 1014494208. Throughput: 0: 5824.9. Samples: 1014497060. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:11:54,425][25689] Avg episode reward: [(0, '1.617')] [2022-07-11 02:11:55,661][26022] Updated weights on worker 0-0, policy_version 990724 (0.00080) [2022-07-11 02:11:57,469][26022] Updated weights on worker 0-0, policy_version 990734 (0.00092) [2022-07-11 02:11:59,267][26022] Updated weights on worker 0-0, policy_version 990744 (0.00638) [2022-07-11 02:11:59,510][25689] Fps is (10 sec: 5577.1, 60 sec: 5540.1, 300 sec: 5545.5). Total num frames: 1014523904. Throughput: 0: 5824.1. Samples: 1014530602. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:11:59,510][25689] Avg episode reward: [(0, '0.494')] [2022-07-11 02:12:01,081][26022] Updated weights on worker 0-0, policy_version 990754 (0.00089) [2022-07-11 02:12:03,266][26022] Updated weights on worker 0-0, policy_version 990764 (0.00085) [2022-07-11 02:12:04,574][25689] Fps is (10 sec: 5448.2, 60 sec: 5554.2, 300 sec: 5544.6). Total num frames: 1014549504. Throughput: 0: 4876.1. Samples: 1014545402. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:12:04,575][25689] Avg episode reward: [(0, '0.552')] [2022-07-11 02:12:05,120][26022] Updated weights on worker 0-0, policy_version 990774 (0.00096) [2022-07-11 02:12:06,823][26022] Updated weights on worker 0-0, policy_version 990784 (0.00086) [2022-07-11 02:12:08,747][26022] Updated weights on worker 0-0, policy_version 990794 (0.00086) [2022-07-11 02:12:09,630][25689] Fps is (10 sec: 5362.8, 60 sec: 5550.7, 300 sec: 5540.4). Total num frames: 1014578176. Throughput: 0: 5724.9. Samples: 1014579342. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:12:09,630][25689] Avg episode reward: [(0, '0.708')] [2022-07-11 02:12:10,531][26022] Updated weights on worker 0-0, policy_version 990804 (0.00094) [2022-07-11 02:12:12,488][26022] Updated weights on worker 0-0, policy_version 990814 (0.00091) [2022-07-11 02:12:14,308][26022] Updated weights on worker 0-0, policy_version 990824 (0.00088) [2022-07-11 02:12:14,643][25689] Fps is (10 sec: 5491.6, 60 sec: 5535.0, 300 sec: 5538.0). Total num frames: 1014604800. Throughput: 0: 5696.4. Samples: 1014612232. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:12:14,643][25689] Avg episode reward: [(0, '0.382')] [2022-07-11 02:12:16,307][26022] Updated weights on worker 0-0, policy_version 990834 (0.00085) [2022-07-11 02:12:18,055][26022] Updated weights on worker 0-0, policy_version 990844 (0.00083) [2022-07-11 02:12:19,721][25689] Fps is (10 sec: 5479.4, 60 sec: 5552.1, 300 sec: 5537.1). Total num frames: 1014633472. Throughput: 0: 4863.0. Samples: 1014628892. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:12:19,722][25689] Avg episode reward: [(0, '0.028')] [2022-07-11 02:12:19,946][26022] Updated weights on worker 0-0, policy_version 990854 (0.00086) [2022-07-11 02:12:21,528][26022] Updated weights on worker 0-0, policy_version 990864 (0.00087) [2022-07-11 02:12:23,438][26022] Updated weights on worker 0-0, policy_version 990874 (0.00086) [2022-07-11 02:12:24,783][25689] Fps is (10 sec: 5655.1, 60 sec: 5550.5, 300 sec: 5543.0). Total num frames: 1014662144. Throughput: 0: 5829.9. Samples: 1014663220. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 02:12:24,784][25689] Avg episode reward: [(0, '0.069')] [2022-07-11 02:12:25,403][26022] Updated weights on worker 0-0, policy_version 990884 (0.00093) [2022-07-11 02:12:26,972][26022] Updated weights on worker 0-0, policy_version 990894 (0.00095) [2022-07-11 02:12:29,043][26022] Updated weights on worker 0-0, policy_version 990904 (0.00090) [2022-07-11 02:12:29,817][25689] Fps is (10 sec: 5578.5, 60 sec: 5550.3, 300 sec: 5542.5). Total num frames: 1014689792. Throughput: 0: 5820.2. Samples: 1014696838. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:12:29,818][25689] Avg episode reward: [(0, '0.778')] [2022-07-11 02:12:30,677][26022] Updated weights on worker 0-0, policy_version 990914 (0.00080) [2022-07-11 02:12:32,669][26022] Updated weights on worker 0-0, policy_version 990924 (0.00084) [2022-07-11 02:12:34,585][26022] Updated weights on worker 0-0, policy_version 990934 (0.00090) [2022-07-11 02:12:34,847][25689] Fps is (10 sec: 5494.6, 60 sec: 5564.8, 300 sec: 5540.6). Total num frames: 1014717440. Throughput: 0: 5014.7. Samples: 1014713548. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:12:34,847][25689] Avg episode reward: [(0, '1.438')] [2022-07-11 02:12:36,241][26022] Updated weights on worker 0-0, policy_version 990944 (0.00091) [2022-07-11 02:12:38,223][26022] Updated weights on worker 0-0, policy_version 990954 (0.00088) [2022-07-11 02:12:39,716][26022] Updated weights on worker 0-0, policy_version 990964 (0.00086) [2022-07-11 02:12:39,902][25689] Fps is (10 sec: 5686.4, 60 sec: 5549.4, 300 sec: 5540.9). Total num frames: 1014747136. Throughput: 0: 5850.9. Samples: 1014746966. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:12:39,902][25689] Avg episode reward: [(0, '1.070')] [2022-07-11 02:12:41,744][26022] Updated weights on worker 0-0, policy_version 990974 (0.00080) [2022-07-11 02:12:43,473][26022] Updated weights on worker 0-0, policy_version 990984 (0.00091) [2022-07-11 02:12:44,905][25689] Fps is (10 sec: 5701.1, 60 sec: 5566.7, 300 sec: 5542.0). Total num frames: 1014774784. Throughput: 0: 5841.0. Samples: 1014780754. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:12:44,908][25689] Avg episode reward: [(0, '1.193')] [2022-07-11 02:12:45,466][26022] Updated weights on worker 0-0, policy_version 990994 (0.00083) [2022-07-11 02:12:47,195][26022] Updated weights on worker 0-0, policy_version 991004 (0.00081) [2022-07-11 02:12:49,213][26022] Updated weights on worker 0-0, policy_version 991014 (0.00090) [2022-07-11 02:12:49,919][25689] Fps is (10 sec: 5520.3, 60 sec: 5534.0, 300 sec: 5542.3). Total num frames: 1014802432. Throughput: 0: 5008.6. Samples: 1014797518. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:12:49,920][25689] Avg episode reward: [(0, '1.306')] [2022-07-11 02:12:50,762][26022] Updated weights on worker 0-0, policy_version 991024 (0.00088) [2022-07-11 02:12:52,796][26022] Updated weights on worker 0-0, policy_version 991034 (0.00091) [2022-07-11 02:12:54,335][26022] Updated weights on worker 0-0, policy_version 991044 (0.00085) [2022-07-11 02:12:54,940][25689] Fps is (10 sec: 5612.5, 60 sec: 5567.1, 300 sec: 5543.2). Total num frames: 1014831104. Throughput: 0: 5872.4. Samples: 1014831544. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:12:54,942][25689] Avg episode reward: [(0, '1.398')] [2022-07-11 02:12:56,399][26022] Updated weights on worker 0-0, policy_version 991054 (0.01132) [2022-07-11 02:12:58,168][26022] Updated weights on worker 0-0, policy_version 991064 (0.00100) [2022-07-11 02:12:59,953][26022] Updated weights on worker 0-0, policy_version 991074 (0.00087) [2022-07-11 02:13:00,045][25689] Fps is (10 sec: 5663.1, 60 sec: 5548.4, 300 sec: 5551.6). Total num frames: 1014859776. Throughput: 0: 5877.3. Samples: 1014865352. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:13:00,045][25689] Avg episode reward: [(0, '1.592')] [2022-07-11 02:13:01,740][26022] Updated weights on worker 0-0, policy_version 991084 (0.00109) [2022-07-11 02:13:04,018][26022] Updated weights on worker 0-0, policy_version 991094 (0.00103) [2022-07-11 02:13:05,051][25689] Fps is (10 sec: 5469.1, 60 sec: 5570.7, 300 sec: 5544.7). Total num frames: 1014886400. Throughput: 0: 5007.7. Samples: 1014881636. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:13:05,051][25689] Avg episode reward: [(0, '1.452')] [2022-07-11 02:13:05,803][26022] Updated weights on worker 0-0, policy_version 991104 (0.00086) [2022-07-11 02:13:07,504][26022] Updated weights on worker 0-0, policy_version 991114 (0.00092) [2022-07-11 02:13:09,595][26022] Updated weights on worker 0-0, policy_version 991124 (0.00087) [2022-07-11 02:13:10,136][25689] Fps is (10 sec: 5377.8, 60 sec: 5551.0, 300 sec: 5543.5). Total num frames: 1014914048. Throughput: 0: 5777.8. Samples: 1014914332. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:13:10,138][25689] Avg episode reward: [(0, '2.045')] [2022-07-11 02:13:11,075][26022] Updated weights on worker 0-0, policy_version 991134 (0.00083) [2022-07-11 02:13:13,392][26022] Updated weights on worker 0-0, policy_version 991144 (0.00102) [2022-07-11 02:13:14,590][26022] Updated weights on worker 0-0, policy_version 991154 (0.00089) [2022-07-11 02:13:15,163][25689] Fps is (10 sec: 5670.9, 60 sec: 5600.5, 300 sec: 5551.5). Total num frames: 1014943744. Throughput: 0: 5742.2. Samples: 1014947666. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:13:15,165][25689] Avg episode reward: [(0, '1.359')] [2022-07-11 02:13:16,847][26022] Updated weights on worker 0-0, policy_version 991164 (0.00086) [2022-07-11 02:13:18,426][26022] Updated weights on worker 0-0, policy_version 991174 (0.00090) [2022-07-11 02:13:20,271][25689] Fps is (10 sec: 5557.0, 60 sec: 5563.9, 300 sec: 5546.4). Total num frames: 1014970368. Throughput: 0: 4907.3. Samples: 1014964612. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:13:20,272][25689] Avg episode reward: [(0, '1.413')] [2022-07-11 02:13:20,472][26022] Updated weights on worker 0-0, policy_version 991184 (0.00080) [2022-07-11 02:13:22,289][26022] Updated weights on worker 0-0, policy_version 991194 (0.00093) [2022-07-11 02:13:24,211][26022] Updated weights on worker 0-0, policy_version 991204 (0.00083) [2022-07-11 02:13:25,304][25689] Fps is (10 sec: 5553.5, 60 sec: 5583.5, 300 sec: 5549.9). Total num frames: 1015000064. Throughput: 0: 5770.2. Samples: 1014998502. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:13:25,304][25689] Avg episode reward: [(0, '1.436')] [2022-07-11 02:13:25,531][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:13:25,550][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000991213_1015002112.pth [2022-07-11 02:13:25,551][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000989261_1013003264.pth [2022-07-11 02:13:25,854][26022] Updated weights on worker 0-0, policy_version 991214 (0.00096) [2022-07-11 02:13:27,984][26022] Updated weights on worker 0-0, policy_version 991224 (0.00091) [2022-07-11 02:13:29,425][26022] Updated weights on worker 0-0, policy_version 991234 (0.00083) [2022-07-11 02:13:30,383][25689] Fps is (10 sec: 5671.0, 60 sec: 5579.4, 300 sec: 5548.8). Total num frames: 1015027712. Throughput: 0: 5801.7. Samples: 1015031798. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:13:30,384][25689] Avg episode reward: [(0, '1.445')] [2022-07-11 02:13:31,393][26022] Updated weights on worker 0-0, policy_version 991244 (0.00092) [2022-07-11 02:13:33,243][26022] Updated weights on worker 0-0, policy_version 991254 (0.00091) [2022-07-11 02:13:35,106][26022] Updated weights on worker 0-0, policy_version 991264 (0.00086) [2022-07-11 02:13:35,404][25689] Fps is (10 sec: 5474.9, 60 sec: 5580.2, 300 sec: 5546.7). Total num frames: 1015055360. Throughput: 0: 5806.3. Samples: 1015065192. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:13:35,404][25689] Avg episode reward: [(0, '1.313')] [2022-07-11 02:13:37,004][26022] Updated weights on worker 0-0, policy_version 991274 (0.00085) [2022-07-11 02:13:38,708][26022] Updated weights on worker 0-0, policy_version 991284 (0.00088) [2022-07-11 02:13:40,443][25689] Fps is (10 sec: 5496.6, 60 sec: 5547.8, 300 sec: 5542.6). Total num frames: 1015083008. Throughput: 0: 5816.8. Samples: 1015081948. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:13:40,444][25689] Avg episode reward: [(0, '1.136')] [2022-07-11 02:13:40,576][26022] Updated weights on worker 0-0, policy_version 991294 (0.00088) [2022-07-11 02:13:42,407][26022] Updated weights on worker 0-0, policy_version 991304 (0.00107) [2022-07-11 02:13:44,336][26022] Updated weights on worker 0-0, policy_version 991314 (0.00092) [2022-07-11 02:13:45,451][25689] Fps is (10 sec: 5605.8, 60 sec: 5564.4, 300 sec: 5549.5). Total num frames: 1015111680. Throughput: 0: 5812.6. Samples: 1015115606. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:13:45,451][25689] Avg episode reward: [(0, '1.892')] [2022-07-11 02:13:46,000][26022] Updated weights on worker 0-0, policy_version 991324 (0.00092) [2022-07-11 02:13:48,259][26022] Updated weights on worker 0-0, policy_version 991334 (0.00087) [2022-07-11 02:13:49,791][26022] Updated weights on worker 0-0, policy_version 991344 (0.00097) [2022-07-11 02:13:50,457][25689] Fps is (10 sec: 5522.1, 60 sec: 5548.1, 300 sec: 5546.0). Total num frames: 1015138304. Throughput: 0: 5830.8. Samples: 1015148842. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:13:50,457][25689] Avg episode reward: [(0, '1.727')] [2022-07-11 02:13:51,695][26022] Updated weights on worker 0-0, policy_version 991354 (0.00091) [2022-07-11 02:13:53,526][26022] Updated weights on worker 0-0, policy_version 991364 (0.00092) [2022-07-11 02:13:55,356][26022] Updated weights on worker 0-0, policy_version 991374 (0.00084) [2022-07-11 02:13:55,470][25689] Fps is (10 sec: 5518.9, 60 sec: 5548.8, 300 sec: 5544.8). Total num frames: 1015166976. Throughput: 0: 4981.0. Samples: 1015165142. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:13:55,471][25689] Avg episode reward: [(0, '1.279')] [2022-07-11 02:13:57,313][26022] Updated weights on worker 0-0, policy_version 991384 (0.00091) [2022-07-11 02:13:59,122][26022] Updated weights on worker 0-0, policy_version 991394 (0.00086) [2022-07-11 02:14:00,540][25689] Fps is (10 sec: 5585.7, 60 sec: 5535.1, 300 sec: 5553.9). Total num frames: 1015194624. Throughput: 0: 5811.4. Samples: 1015198736. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:14:00,541][25689] Avg episode reward: [(0, '1.286')] [2022-07-11 02:14:00,763][26022] Updated weights on worker 0-0, policy_version 991404 (0.00083) [2022-07-11 02:14:03,166][26022] Updated weights on worker 0-0, policy_version 991414 (0.00086) [2022-07-11 02:14:04,893][26022] Updated weights on worker 0-0, policy_version 991424 (0.00091) [2022-07-11 02:14:05,561][25689] Fps is (10 sec: 5479.9, 60 sec: 5550.7, 300 sec: 5551.4). Total num frames: 1015222272. Throughput: 0: 5704.0. Samples: 1015230314. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:14:05,562][25689] Avg episode reward: [(0, '1.179')] [2022-07-11 02:14:06,856][26022] Updated weights on worker 0-0, policy_version 991434 (0.00094) [2022-07-11 02:14:08,577][26022] Updated weights on worker 0-0, policy_version 991444 (0.00078) [2022-07-11 02:14:10,499][26022] Updated weights on worker 0-0, policy_version 991454 (0.00093) [2022-07-11 02:14:10,652][25689] Fps is (10 sec: 5468.0, 60 sec: 5550.1, 300 sec: 5546.3). Total num frames: 1015249920. Throughput: 0: 4875.0. Samples: 1015247296. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:14:10,654][25689] Avg episode reward: [(0, '1.060')] [2022-07-11 02:14:12,407][26022] Updated weights on worker 0-0, policy_version 991464 (0.00082) [2022-07-11 02:14:13,972][26022] Updated weights on worker 0-0, policy_version 991474 (0.00093) [2022-07-11 02:14:15,694][25689] Fps is (10 sec: 5456.7, 60 sec: 5514.9, 300 sec: 5546.2). Total num frames: 1015277568. Throughput: 0: 5710.6. Samples: 1015280634. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:14:15,695][25689] Avg episode reward: [(0, '0.927')] [2022-07-11 02:14:15,952][26022] Updated weights on worker 0-0, policy_version 991484 (0.00093) [2022-07-11 02:14:17,678][26022] Updated weights on worker 0-0, policy_version 991494 (0.00086) [2022-07-11 02:14:19,663][26022] Updated weights on worker 0-0, policy_version 991504 (0.00086) [2022-07-11 02:14:20,767][25689] Fps is (10 sec: 5568.4, 60 sec: 5552.0, 300 sec: 5548.5). Total num frames: 1015306240. Throughput: 0: 5711.6. Samples: 1015314264. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:14:20,767][25689] Avg episode reward: [(0, '0.994')] [2022-07-11 02:14:21,390][26022] Updated weights on worker 0-0, policy_version 991514 (0.00093) [2022-07-11 02:14:23,313][26022] Updated weights on worker 0-0, policy_version 991524 (0.00085) [2022-07-11 02:14:24,956][26022] Updated weights on worker 0-0, policy_version 991534 (0.00091) [2022-07-11 02:14:25,788][25689] Fps is (10 sec: 5681.3, 60 sec: 5536.2, 300 sec: 5551.8). Total num frames: 1015334912. Throughput: 0: 4984.7. Samples: 1015331140. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:14:25,789][25689] Avg episode reward: [(0, '0.926')] [2022-07-11 02:14:27,097][26022] Updated weights on worker 0-0, policy_version 991544 (0.00085) [2022-07-11 02:14:28,771][26022] Updated weights on worker 0-0, policy_version 991554 (0.00088) [2022-07-11 02:14:30,740][26022] Updated weights on worker 0-0, policy_version 991564 (0.00083) [2022-07-11 02:14:30,838][25689] Fps is (10 sec: 5490.6, 60 sec: 5521.9, 300 sec: 5541.2). Total num frames: 1015361536. Throughput: 0: 5804.7. Samples: 1015364466. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:14:30,838][25689] Avg episode reward: [(0, '0.276')] [2022-07-11 02:14:32,397][26022] Updated weights on worker 0-0, policy_version 991574 (0.00088) [2022-07-11 02:14:34,254][26022] Updated weights on worker 0-0, policy_version 991584 (0.00092) [2022-07-11 02:14:35,881][25689] Fps is (10 sec: 5478.7, 60 sec: 5536.8, 300 sec: 5549.9). Total num frames: 1015390208. Throughput: 0: 5828.6. Samples: 1015398292. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:14:35,883][25689] Avg episode reward: [(0, '0.269')] [2022-07-11 02:14:35,951][26022] Updated weights on worker 0-0, policy_version 991594 (0.00087) [2022-07-11 02:14:37,851][26022] Updated weights on worker 0-0, policy_version 991604 (0.00087) [2022-07-11 02:14:39,703][26022] Updated weights on worker 0-0, policy_version 991614 (0.00093) [2022-07-11 02:14:40,943][25689] Fps is (10 sec: 5573.3, 60 sec: 5534.7, 300 sec: 5546.1). Total num frames: 1015417856. Throughput: 0: 4985.2. Samples: 1015414846. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:14:40,943][25689] Avg episode reward: [(0, '0.286')] [2022-07-11 02:14:41,626][26022] Updated weights on worker 0-0, policy_version 991624 (0.00089) [2022-07-11 02:14:43,454][26022] Updated weights on worker 0-0, policy_version 991634 (0.00253) [2022-07-11 02:14:45,167][26022] Updated weights on worker 0-0, policy_version 991644 (0.00082) [2022-07-11 02:14:45,946][25689] Fps is (10 sec: 5493.8, 60 sec: 5518.2, 300 sec: 5543.6). Total num frames: 1015445504. Throughput: 0: 5806.2. Samples: 1015448182. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:14:45,947][25689] Avg episode reward: [(0, '0.225')] [2022-07-11 02:14:47,236][26022] Updated weights on worker 0-0, policy_version 991654 (0.00103) [2022-07-11 02:14:48,887][26022] Updated weights on worker 0-0, policy_version 991664 (0.00084) [2022-07-11 02:14:50,860][26022] Updated weights on worker 0-0, policy_version 991674 (0.00091) [2022-07-11 02:14:50,960][25689] Fps is (10 sec: 5724.4, 60 sec: 5568.2, 300 sec: 5551.5). Total num frames: 1015475200. Throughput: 0: 5831.1. Samples: 1015481804. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:14:50,961][25689] Avg episode reward: [(0, '0.770')] [2022-07-11 02:14:52,416][26022] Updated weights on worker 0-0, policy_version 991684 (0.00092) [2022-07-11 02:14:54,405][26022] Updated weights on worker 0-0, policy_version 991694 (0.00088) [2022-07-11 02:14:55,976][25689] Fps is (10 sec: 5717.5, 60 sec: 5551.1, 300 sec: 5549.3). Total num frames: 1015502848. Throughput: 0: 4995.9. Samples: 1015498684. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:14:55,976][25689] Avg episode reward: [(0, '1.060')] [2022-07-11 02:14:56,281][26022] Updated weights on worker 0-0, policy_version 991704 (0.00090) [2022-07-11 02:14:58,210][26022] Updated weights on worker 0-0, policy_version 991714 (0.00086) [2022-07-11 02:14:59,839][26022] Updated weights on worker 0-0, policy_version 991724 (0.00083) [2022-07-11 02:15:01,035][25689] Fps is (10 sec: 5387.2, 60 sec: 5535.1, 300 sec: 5551.8). Total num frames: 1015529472. Throughput: 0: 5832.3. Samples: 1015532028. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:15:01,035][25689] Avg episode reward: [(0, '1.424')] [2022-07-11 02:15:02,231][26022] Updated weights on worker 0-0, policy_version 991734 (0.00076) [2022-07-11 02:15:03,962][26022] Updated weights on worker 0-0, policy_version 991744 (0.00089) [2022-07-11 02:15:06,048][25689] Fps is (10 sec: 5286.3, 60 sec: 5518.9, 300 sec: 5548.5). Total num frames: 1015556096. Throughput: 0: 5736.9. Samples: 1015563506. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:15:06,049][25689] Avg episode reward: [(0, '1.583')] [2022-07-11 02:15:06,050][26022] Updated weights on worker 0-0, policy_version 991754 (0.00088) [2022-07-11 02:15:07,567][26022] Updated weights on worker 0-0, policy_version 991764 (0.00088) [2022-07-11 02:15:09,637][26022] Updated weights on worker 0-0, policy_version 991774 (0.00094) [2022-07-11 02:15:11,056][25689] Fps is (10 sec: 5517.9, 60 sec: 5543.5, 300 sec: 5549.1). Total num frames: 1015584768. Throughput: 0: 4892.4. Samples: 1015580120. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:15:11,056][25689] Avg episode reward: [(0, '1.486')] [2022-07-11 02:15:11,491][26022] Updated weights on worker 0-0, policy_version 991784 (0.00083) [2022-07-11 02:15:13,415][26022] Updated weights on worker 0-0, policy_version 991794 (0.00410) [2022-07-11 02:15:15,058][26022] Updated weights on worker 0-0, policy_version 991804 (0.00101) [2022-07-11 02:15:16,068][25689] Fps is (10 sec: 5518.7, 60 sec: 5529.3, 300 sec: 5543.1). Total num frames: 1015611392. Throughput: 0: 5723.4. Samples: 1015613682. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:15:16,070][25689] Avg episode reward: [(0, '1.949')] [2022-07-11 02:15:17,057][26022] Updated weights on worker 0-0, policy_version 991814 (0.00086) [2022-07-11 02:15:18,624][26022] Updated weights on worker 0-0, policy_version 991824 (0.00091) [2022-07-11 02:15:20,740][26022] Updated weights on worker 0-0, policy_version 991834 (0.00086) [2022-07-11 02:15:21,121][25689] Fps is (10 sec: 5493.6, 60 sec: 5531.0, 300 sec: 5547.0). Total num frames: 1015640064. Throughput: 0: 5718.7. Samples: 1015646898. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:15:21,122][25689] Avg episode reward: [(0, '1.641')] [2022-07-11 02:15:22,365][26022] Updated weights on worker 0-0, policy_version 991844 (0.00092) [2022-07-11 02:15:24,402][26022] Updated weights on worker 0-0, policy_version 991854 (0.00809) [2022-07-11 02:15:25,708][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:15:25,727][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000991862_1015666688.pth [2022-07-11 02:15:25,728][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000989909_1013666816.pth [2022-07-11 02:15:26,018][26022] Updated weights on worker 0-0, policy_version 991864 (0.00095) [2022-07-11 02:15:26,129][25689] Fps is (10 sec: 5699.5, 60 sec: 5532.2, 300 sec: 5548.3). Total num frames: 1015668736. Throughput: 0: 4990.5. Samples: 1015663720. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:15:26,130][25689] Avg episode reward: [(0, '0.966')] [2022-07-11 02:15:28,020][26022] Updated weights on worker 0-0, policy_version 991874 (0.00088) [2022-07-11 02:15:29,787][26022] Updated weights on worker 0-0, policy_version 991884 (0.00099) [2022-07-11 02:15:31,161][25689] Fps is (10 sec: 5711.6, 60 sec: 5567.8, 300 sec: 5547.9). Total num frames: 1015697408. Throughput: 0: 5824.3. Samples: 1015697220. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:15:31,163][25689] Avg episode reward: [(0, '0.689')] [2022-07-11 02:15:31,743][26022] Updated weights on worker 0-0, policy_version 991894 (0.00077) [2022-07-11 02:15:33,372][26022] Updated weights on worker 0-0, policy_version 991904 (0.00090) [2022-07-11 02:15:35,474][26022] Updated weights on worker 0-0, policy_version 991914 (0.00093) [2022-07-11 02:15:36,177][25689] Fps is (10 sec: 5503.3, 60 sec: 5536.4, 300 sec: 5542.2). Total num frames: 1015724032. Throughput: 0: 5810.6. Samples: 1015730528. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:15:36,178][25689] Avg episode reward: [(0, '0.318')] [2022-07-11 02:15:37,194][26022] Updated weights on worker 0-0, policy_version 991924 (0.00087) [2022-07-11 02:15:39,155][26022] Updated weights on worker 0-0, policy_version 991934 (0.00088) [2022-07-11 02:15:41,023][26022] Updated weights on worker 0-0, policy_version 991944 (0.00096) [2022-07-11 02:15:41,227][25689] Fps is (10 sec: 5391.6, 60 sec: 5537.5, 300 sec: 5541.8). Total num frames: 1015751680. Throughput: 0: 4987.0. Samples: 1015747166. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:15:41,228][25689] Avg episode reward: [(0, '-0.507')] [2022-07-11 02:15:42,875][26022] Updated weights on worker 0-0, policy_version 991954 (0.00088) [2022-07-11 02:15:44,614][26022] Updated weights on worker 0-0, policy_version 991964 (0.00086) [2022-07-11 02:15:46,237][25689] Fps is (10 sec: 5496.7, 60 sec: 5536.9, 300 sec: 5542.2). Total num frames: 1015779328. Throughput: 0: 5812.8. Samples: 1015780602. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:15:46,242][25689] Avg episode reward: [(0, '-0.628')] [2022-07-11 02:15:46,469][26022] Updated weights on worker 0-0, policy_version 991974 (0.00091) [2022-07-11 02:15:48,339][26022] Updated weights on worker 0-0, policy_version 991984 (0.00091) [2022-07-11 02:15:50,119][26022] Updated weights on worker 0-0, policy_version 991994 (0.00094) [2022-07-11 02:15:51,250][25689] Fps is (10 sec: 5619.2, 60 sec: 5520.0, 300 sec: 5545.5). Total num frames: 1015808000. Throughput: 0: 5804.1. Samples: 1015813818. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:15:51,252][25689] Avg episode reward: [(0, '-0.444')] [2022-07-11 02:15:51,994][26022] Updated weights on worker 0-0, policy_version 992004 (0.00084) [2022-07-11 02:15:53,907][26022] Updated weights on worker 0-0, policy_version 992014 (0.00089) [2022-07-11 02:15:55,578][26022] Updated weights on worker 0-0, policy_version 992024 (0.00092) [2022-07-11 02:15:56,299][25689] Fps is (10 sec: 5597.3, 60 sec: 5516.9, 300 sec: 5539.8). Total num frames: 1015835648. Throughput: 0: 4980.0. Samples: 1015830736. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:15:56,299][25689] Avg episode reward: [(0, '-0.528')] [2022-07-11 02:15:57,420][26022] Updated weights on worker 0-0, policy_version 992034 (0.00088) [2022-07-11 02:15:59,267][26022] Updated weights on worker 0-0, policy_version 992044 (0.00093) [2022-07-11 02:16:01,247][26022] Updated weights on worker 0-0, policy_version 992054 (0.00085) [2022-07-11 02:16:01,335][25689] Fps is (10 sec: 5482.9, 60 sec: 5536.0, 300 sec: 5550.1). Total num frames: 1015863296. Throughput: 0: 5826.6. Samples: 1015864328. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:16:01,336][25689] Avg episode reward: [(0, '-0.423')] [2022-07-11 02:16:03,200][26022] Updated weights on worker 0-0, policy_version 992064 (0.00089) [2022-07-11 02:16:05,278][26022] Updated weights on worker 0-0, policy_version 992074 (0.00085) [2022-07-11 02:16:06,337][25689] Fps is (10 sec: 5406.4, 60 sec: 5537.0, 300 sec: 5543.5). Total num frames: 1015889920. Throughput: 0: 5735.6. Samples: 1015895890. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:16:06,338][25689] Avg episode reward: [(0, '-0.187')] [2022-07-11 02:16:06,816][26022] Updated weights on worker 0-0, policy_version 992084 (0.00087) [2022-07-11 02:16:09,023][26022] Updated weights on worker 0-0, policy_version 992094 (0.00082) [2022-07-11 02:16:10,719][26022] Updated weights on worker 0-0, policy_version 992104 (0.00088) [2022-07-11 02:16:11,370][25689] Fps is (10 sec: 5306.3, 60 sec: 5500.8, 300 sec: 5539.9). Total num frames: 1015916544. Throughput: 0: 4904.6. Samples: 1015912498. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:16:11,371][25689] Avg episode reward: [(0, '0.603')] [2022-07-11 02:16:12,591][26022] Updated weights on worker 0-0, policy_version 992114 (0.00085) [2022-07-11 02:16:14,361][26022] Updated weights on worker 0-0, policy_version 992124 (0.00092) [2022-07-11 02:16:16,373][25689] Fps is (10 sec: 5407.8, 60 sec: 5518.6, 300 sec: 5541.3). Total num frames: 1015944192. Throughput: 0: 5751.9. Samples: 1015946202. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:16:16,374][25689] Avg episode reward: [(0, '0.688')] [2022-07-11 02:16:16,386][26022] Updated weights on worker 0-0, policy_version 992134 (0.00091) [2022-07-11 02:16:18,052][26022] Updated weights on worker 0-0, policy_version 992144 (0.00090) [2022-07-11 02:16:19,991][26022] Updated weights on worker 0-0, policy_version 992154 (0.00089) [2022-07-11 02:16:21,480][25689] Fps is (10 sec: 5672.1, 60 sec: 5530.7, 300 sec: 5543.6). Total num frames: 1015973888. Throughput: 0: 5712.0. Samples: 1015979394. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 02:16:21,481][25689] Avg episode reward: [(0, '0.682')] [2022-07-11 02:16:21,735][26022] Updated weights on worker 0-0, policy_version 992164 (0.00090) [2022-07-11 02:16:23,598][26022] Updated weights on worker 0-0, policy_version 992174 (0.00597) [2022-07-11 02:16:25,358][26022] Updated weights on worker 0-0, policy_version 992184 (0.00078) [2022-07-11 02:16:26,503][25689] Fps is (10 sec: 5762.0, 60 sec: 5529.3, 300 sec: 5547.2). Total num frames: 1016002560. Throughput: 0: 4973.1. Samples: 1015996176. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:16:26,505][25689] Avg episode reward: [(0, '0.926')] [2022-07-11 02:16:27,507][26022] Updated weights on worker 0-0, policy_version 992194 (0.00353) [2022-07-11 02:16:29,159][26022] Updated weights on worker 0-0, policy_version 992204 (0.00091) [2022-07-11 02:16:31,247][26022] Updated weights on worker 0-0, policy_version 992214 (0.00091) [2022-07-11 02:16:31,509][25689] Fps is (10 sec: 5513.4, 60 sec: 5497.7, 300 sec: 5547.2). Total num frames: 1016029184. Throughput: 0: 5809.5. Samples: 1016029496. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:16:31,511][25689] Avg episode reward: [(0, '0.649')] [2022-07-11 02:16:32,882][26022] Updated weights on worker 0-0, policy_version 992224 (0.00088) [2022-07-11 02:16:34,664][26022] Updated weights on worker 0-0, policy_version 992234 (0.00105) [2022-07-11 02:16:36,447][26022] Updated weights on worker 0-0, policy_version 992244 (0.00083) [2022-07-11 02:16:36,519][25689] Fps is (10 sec: 5520.9, 60 sec: 5532.2, 300 sec: 5541.4). Total num frames: 1016057856. Throughput: 0: 5791.7. Samples: 1016062880. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:16:36,520][25689] Avg episode reward: [(0, '1.278')] [2022-07-11 02:16:38,571][26022] Updated weights on worker 0-0, policy_version 992254 (0.00090) [2022-07-11 02:16:40,239][26022] Updated weights on worker 0-0, policy_version 992264 (0.00095) [2022-07-11 02:16:41,604][25689] Fps is (10 sec: 5579.1, 60 sec: 5529.0, 300 sec: 5543.4). Total num frames: 1016085504. Throughput: 0: 4979.5. Samples: 1016079602. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:16:41,608][25689] Avg episode reward: [(0, '1.298')] [2022-07-11 02:16:42,109][26022] Updated weights on worker 0-0, policy_version 992274 (0.00089) [2022-07-11 02:16:43,824][26022] Updated weights on worker 0-0, policy_version 992284 (0.00088) [2022-07-11 02:16:45,527][26022] Updated weights on worker 0-0, policy_version 992294 (0.01295) [2022-07-11 02:16:46,623][25689] Fps is (10 sec: 5472.8, 60 sec: 5528.2, 300 sec: 5536.7). Total num frames: 1016113152. Throughput: 0: 5836.7. Samples: 1016113608. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:16:46,623][25689] Avg episode reward: [(0, '1.401')] [2022-07-11 02:16:47,491][26022] Updated weights on worker 0-0, policy_version 992304 (0.00083) [2022-07-11 02:16:49,409][26022] Updated weights on worker 0-0, policy_version 992314 (0.00089) [2022-07-11 02:16:50,946][26022] Updated weights on worker 0-0, policy_version 992324 (0.00089) [2022-07-11 02:16:51,636][25689] Fps is (10 sec: 5716.3, 60 sec: 5545.1, 300 sec: 5547.0). Total num frames: 1016142848. Throughput: 0: 5863.7. Samples: 1016147512. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:16:51,636][25689] Avg episode reward: [(0, '1.348')] [2022-07-11 02:16:52,951][26022] Updated weights on worker 0-0, policy_version 992334 (0.00087) [2022-07-11 02:16:54,621][26022] Updated weights on worker 0-0, policy_version 992344 (0.00094) [2022-07-11 02:16:56,650][25689] Fps is (10 sec: 5616.9, 60 sec: 5531.4, 300 sec: 5538.0). Total num frames: 1016169472. Throughput: 0: 5052.1. Samples: 1016164582. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:16:56,650][25689] Avg episode reward: [(0, '1.509')] [2022-07-11 02:16:56,875][26022] Updated weights on worker 0-0, policy_version 992354 (0.00084) [2022-07-11 02:16:58,367][26022] Updated weights on worker 0-0, policy_version 992364 (0.00092) [2022-07-11 02:17:00,232][26022] Updated weights on worker 0-0, policy_version 992374 (0.00095) [2022-07-11 02:17:01,700][25689] Fps is (10 sec: 5392.8, 60 sec: 5530.1, 300 sec: 5545.2). Total num frames: 1016197120. Throughput: 0: 5883.3. Samples: 1016197830. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:17:01,700][25689] Avg episode reward: [(0, '1.704')] [2022-07-11 02:17:02,355][26022] Updated weights on worker 0-0, policy_version 992384 (0.00086) [2022-07-11 02:17:04,483][26022] Updated weights on worker 0-0, policy_version 992394 (0.00054) [2022-07-11 02:17:06,084][26022] Updated weights on worker 0-0, policy_version 992404 (0.00089) [2022-07-11 02:17:06,705][25689] Fps is (10 sec: 5397.1, 60 sec: 5529.8, 300 sec: 5539.2). Total num frames: 1016223744. Throughput: 0: 5749.5. Samples: 1016229072. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:17:06,706][25689] Avg episode reward: [(0, '1.740')] [2022-07-11 02:17:08,095][26022] Updated weights on worker 0-0, policy_version 992414 (0.00087) [2022-07-11 02:17:09,865][26022] Updated weights on worker 0-0, policy_version 992424 (0.00109) [2022-07-11 02:17:11,711][25689] Fps is (10 sec: 5421.0, 60 sec: 5549.3, 300 sec: 5542.8). Total num frames: 1016251392. Throughput: 0: 4895.7. Samples: 1016245794. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:17:11,712][25689] Avg episode reward: [(0, '1.475')] [2022-07-11 02:17:11,723][26022] Updated weights on worker 0-0, policy_version 992434 (0.00082) [2022-07-11 02:17:13,418][26022] Updated weights on worker 0-0, policy_version 992444 (0.00087) [2022-07-11 02:17:15,415][26022] Updated weights on worker 0-0, policy_version 992454 (0.00093) [2022-07-11 02:17:16,757][25689] Fps is (10 sec: 5602.7, 60 sec: 5562.2, 300 sec: 5543.4). Total num frames: 1016280064. Throughput: 0: 5702.2. Samples: 1016279242. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:17:16,758][25689] Avg episode reward: [(0, '1.628')] [2022-07-11 02:17:17,259][26022] Updated weights on worker 0-0, policy_version 992464 (0.00087) [2022-07-11 02:17:19,169][26022] Updated weights on worker 0-0, policy_version 992474 (0.00088) [2022-07-11 02:17:20,835][26022] Updated weights on worker 0-0, policy_version 992484 (0.00078) [2022-07-11 02:17:21,898][25689] Fps is (10 sec: 5528.7, 60 sec: 5525.2, 300 sec: 5538.5). Total num frames: 1016307712. Throughput: 0: 5674.5. Samples: 1016312444. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:17:21,898][25689] Avg episode reward: [(0, '1.658')] [2022-07-11 02:17:22,900][26022] Updated weights on worker 0-0, policy_version 992494 (0.00089) [2022-07-11 02:17:24,386][26022] Updated weights on worker 0-0, policy_version 992504 (0.00085) [2022-07-11 02:17:25,748][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:17:25,758][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000992510_1016330240.pth [2022-07-11 02:17:25,763][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000990559_1014332416.pth [2022-07-11 02:17:26,445][26022] Updated weights on worker 0-0, policy_version 992514 (0.00083) [2022-07-11 02:17:26,903][25689] Fps is (10 sec: 5651.9, 60 sec: 5543.8, 300 sec: 5545.9). Total num frames: 1016337408. Throughput: 0: 5817.5. Samples: 1016346574. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:17:26,904][25689] Avg episode reward: [(0, '1.745')] [2022-07-11 02:17:28,230][26022] Updated weights on worker 0-0, policy_version 992524 (0.00090) [2022-07-11 02:17:30,029][26022] Updated weights on worker 0-0, policy_version 992534 (0.00086) [2022-07-11 02:17:31,905][25689] Fps is (10 sec: 5730.1, 60 sec: 5561.2, 300 sec: 5546.4). Total num frames: 1016365056. Throughput: 0: 5823.5. Samples: 1016363396. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:17:31,906][25689] Avg episode reward: [(0, '1.532')] [2022-07-11 02:17:31,907][26022] Updated weights on worker 0-0, policy_version 992544 (0.00087) [2022-07-11 02:17:33,699][26022] Updated weights on worker 0-0, policy_version 992554 (0.00082) [2022-07-11 02:17:35,479][26022] Updated weights on worker 0-0, policy_version 992564 (0.00084) [2022-07-11 02:17:36,919][25689] Fps is (10 sec: 5521.0, 60 sec: 5543.8, 300 sec: 5540.3). Total num frames: 1016392704. Throughput: 0: 5822.8. Samples: 1016396638. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:17:36,920][25689] Avg episode reward: [(0, '1.427')] [2022-07-11 02:17:37,558][26022] Updated weights on worker 0-0, policy_version 992574 (0.00094) [2022-07-11 02:17:39,135][26022] Updated weights on worker 0-0, policy_version 992584 (0.00086) [2022-07-11 02:17:41,240][26022] Updated weights on worker 0-0, policy_version 992594 (0.00092) [2022-07-11 02:17:42,032][25689] Fps is (10 sec: 5460.2, 60 sec: 5541.3, 300 sec: 5538.3). Total num frames: 1016420352. Throughput: 0: 5836.4. Samples: 1016429958. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:17:42,034][25689] Avg episode reward: [(0, '1.490')] [2022-07-11 02:17:42,948][26022] Updated weights on worker 0-0, policy_version 992604 (0.00090) [2022-07-11 02:17:44,695][26022] Updated weights on worker 0-0, policy_version 992614 (0.00085) [2022-07-11 02:17:46,471][26022] Updated weights on worker 0-0, policy_version 992624 (0.00091) [2022-07-11 02:17:47,042][25689] Fps is (10 sec: 5563.7, 60 sec: 5559.0, 300 sec: 5541.8). Total num frames: 1016449024. Throughput: 0: 4969.1. Samples: 1016446644. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:17:47,042][25689] Avg episode reward: [(0, '1.687')] [2022-07-11 02:17:48,425][26022] Updated weights on worker 0-0, policy_version 992634 (0.00085) [2022-07-11 02:17:50,363][26022] Updated weights on worker 0-0, policy_version 992644 (0.00085) [2022-07-11 02:17:52,052][25689] Fps is (10 sec: 5621.1, 60 sec: 5525.4, 300 sec: 5538.5). Total num frames: 1016476672. Throughput: 0: 5795.9. Samples: 1016480164. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:17:52,052][25689] Avg episode reward: [(0, '1.301')] [2022-07-11 02:17:52,215][26022] Updated weights on worker 0-0, policy_version 992654 (0.00091) [2022-07-11 02:17:53,835][26022] Updated weights on worker 0-0, policy_version 992664 (0.00082) [2022-07-11 02:17:55,876][26022] Updated weights on worker 0-0, policy_version 992674 (0.00091) [2022-07-11 02:17:57,054][25689] Fps is (10 sec: 5522.8, 60 sec: 5543.4, 300 sec: 5537.0). Total num frames: 1016504320. Throughput: 0: 5829.9. Samples: 1016514024. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:17:57,055][25689] Avg episode reward: [(0, '1.566')] [2022-07-11 02:17:57,514][26022] Updated weights on worker 0-0, policy_version 992684 (0.00088) [2022-07-11 02:17:59,531][26022] Updated weights on worker 0-0, policy_version 992694 (0.00099) [2022-07-11 02:18:01,303][26022] Updated weights on worker 0-0, policy_version 992704 (0.00089) [2022-07-11 02:18:02,124][25689] Fps is (10 sec: 5591.6, 60 sec: 5558.6, 300 sec: 5542.7). Total num frames: 1016532992. Throughput: 0: 5016.4. Samples: 1016530746. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:18:02,125][25689] Avg episode reward: [(0, '1.813')] [2022-07-11 02:18:03,799][26022] Updated weights on worker 0-0, policy_version 992714 (0.00091) [2022-07-11 02:18:05,092][26022] Updated weights on worker 0-0, policy_version 992724 (0.00094) [2022-07-11 02:18:07,145][25689] Fps is (10 sec: 5276.7, 60 sec: 5523.2, 300 sec: 5533.6). Total num frames: 1016557568. Throughput: 0: 5753.4. Samples: 1016562308. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:18:07,146][25689] Avg episode reward: [(0, '0.656')] [2022-07-11 02:18:07,341][26022] Updated weights on worker 0-0, policy_version 992734 (0.00091) [2022-07-11 02:18:08,800][26022] Updated weights on worker 0-0, policy_version 992744 (0.00090) [2022-07-11 02:18:10,768][26022] Updated weights on worker 0-0, policy_version 992754 (0.00091) [2022-07-11 02:18:12,230][25689] Fps is (10 sec: 5370.1, 60 sec: 5549.8, 300 sec: 5532.5). Total num frames: 1016587264. Throughput: 0: 5736.4. Samples: 1016595916. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:18:12,232][25689] Avg episode reward: [(0, '0.584')] [2022-07-11 02:18:12,540][26022] Updated weights on worker 0-0, policy_version 992764 (0.00091) [2022-07-11 02:18:14,458][26022] Updated weights on worker 0-0, policy_version 992774 (0.00092) [2022-07-11 02:18:16,462][26022] Updated weights on worker 0-0, policy_version 992784 (0.00237) [2022-07-11 02:18:17,259][25689] Fps is (10 sec: 5872.5, 60 sec: 5568.4, 300 sec: 5544.3). Total num frames: 1016616960. Throughput: 0: 4885.7. Samples: 1016612738. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:18:17,260][25689] Avg episode reward: [(0, '0.149')] [2022-07-11 02:18:18,127][26022] Updated weights on worker 0-0, policy_version 992794 (0.00088) [2022-07-11 02:18:19,926][26022] Updated weights on worker 0-0, policy_version 992804 (0.00087) [2022-07-11 02:18:21,864][26022] Updated weights on worker 0-0, policy_version 992814 (0.00054) [2022-07-11 02:18:22,400][25689] Fps is (10 sec: 5537.8, 60 sec: 5551.4, 300 sec: 5531.9). Total num frames: 1016643584. Throughput: 0: 5694.2. Samples: 1016646204. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:18:22,401][25689] Avg episode reward: [(0, '0.470')] [2022-07-11 02:18:23,615][26022] Updated weights on worker 0-0, policy_version 992824 (0.00091) [2022-07-11 02:18:25,518][26022] Updated weights on worker 0-0, policy_version 992834 (0.00089) [2022-07-11 02:18:27,274][26022] Updated weights on worker 0-0, policy_version 992844 (0.00090) [2022-07-11 02:18:27,468][25689] Fps is (10 sec: 5416.2, 60 sec: 5528.7, 300 sec: 5535.6). Total num frames: 1016672256. Throughput: 0: 5773.3. Samples: 1016679640. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:18:27,469][25689] Avg episode reward: [(0, '0.266')] [2022-07-11 02:18:29,015][26022] Updated weights on worker 0-0, policy_version 992854 (0.00087) [2022-07-11 02:18:31,081][26022] Updated weights on worker 0-0, policy_version 992864 (0.00086) [2022-07-11 02:18:32,531][25689] Fps is (10 sec: 5660.5, 60 sec: 5540.1, 300 sec: 5538.3). Total num frames: 1016700928. Throughput: 0: 4957.2. Samples: 1016696550. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:18:32,532][25689] Avg episode reward: [(0, '0.039')] [2022-07-11 02:18:32,708][26022] Updated weights on worker 0-0, policy_version 992874 (0.00083) [2022-07-11 02:18:34,636][26022] Updated weights on worker 0-0, policy_version 992884 (0.00087) [2022-07-11 02:18:36,681][26022] Updated weights on worker 0-0, policy_version 992894 (0.00084) [2022-07-11 02:18:37,605][25689] Fps is (10 sec: 5758.2, 60 sec: 5568.3, 300 sec: 5544.5). Total num frames: 1016730624. Throughput: 0: 5763.4. Samples: 1016730000. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:18:37,605][25689] Avg episode reward: [(0, '1.039')] [2022-07-11 02:18:38,116][26022] Updated weights on worker 0-0, policy_version 992904 (0.00085) [2022-07-11 02:18:40,219][26022] Updated weights on worker 0-0, policy_version 992914 (0.00061) [2022-07-11 02:18:41,912][26022] Updated weights on worker 0-0, policy_version 992924 (0.00091) [2022-07-11 02:18:42,669][25689] Fps is (10 sec: 5555.4, 60 sec: 5556.0, 300 sec: 5536.6). Total num frames: 1016757248. Throughput: 0: 5795.9. Samples: 1016763678. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:18:42,669][25689] Avg episode reward: [(0, '0.465')] [2022-07-11 02:18:43,830][26022] Updated weights on worker 0-0, policy_version 992934 (0.00107) [2022-07-11 02:18:45,804][26022] Updated weights on worker 0-0, policy_version 992944 (0.00087) [2022-07-11 02:18:47,464][26022] Updated weights on worker 0-0, policy_version 992954 (0.00090) [2022-07-11 02:18:47,684][25689] Fps is (10 sec: 5486.0, 60 sec: 5555.4, 300 sec: 5543.3). Total num frames: 1016785920. Throughput: 0: 4987.1. Samples: 1016780460. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:18:47,685][25689] Avg episode reward: [(0, '0.637')] [2022-07-11 02:18:49,230][26022] Updated weights on worker 0-0, policy_version 992964 (0.00097) [2022-07-11 02:18:51,209][26022] Updated weights on worker 0-0, policy_version 992974 (0.00093) [2022-07-11 02:18:52,693][25689] Fps is (10 sec: 5720.7, 60 sec: 5572.5, 300 sec: 5543.4). Total num frames: 1016814592. Throughput: 0: 5819.4. Samples: 1016813880. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:18:52,693][25689] Avg episode reward: [(0, '0.711')] [2022-07-11 02:18:52,840][26022] Updated weights on worker 0-0, policy_version 992984 (0.00088) [2022-07-11 02:18:55,006][26022] Updated weights on worker 0-0, policy_version 992994 (0.00090) [2022-07-11 02:18:56,652][26022] Updated weights on worker 0-0, policy_version 993004 (0.00093) [2022-07-11 02:18:57,700][25689] Fps is (10 sec: 5520.9, 60 sec: 5555.1, 300 sec: 5541.1). Total num frames: 1016841216. Throughput: 0: 5835.1. Samples: 1016847258. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:18:57,701][25689] Avg episode reward: [(0, '0.539')] [2022-07-11 02:18:58,554][26022] Updated weights on worker 0-0, policy_version 993014 (0.00099) [2022-07-11 02:19:00,354][26022] Updated weights on worker 0-0, policy_version 993024 (0.00085) [2022-07-11 02:19:02,619][26022] Updated weights on worker 0-0, policy_version 993034 (0.00087) [2022-07-11 02:19:02,776][25689] Fps is (10 sec: 5281.1, 60 sec: 5520.9, 300 sec: 5536.6). Total num frames: 1016867840. Throughput: 0: 4995.1. Samples: 1016864114. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:19:02,776][25689] Avg episode reward: [(0, '0.701')] [2022-07-11 02:19:04,417][26022] Updated weights on worker 0-0, policy_version 993044 (0.00053) [2022-07-11 02:19:06,432][26022] Updated weights on worker 0-0, policy_version 993054 (0.00088) [2022-07-11 02:19:07,789][25689] Fps is (10 sec: 5481.0, 60 sec: 5589.1, 300 sec: 5541.5). Total num frames: 1016896512. Throughput: 0: 5735.8. Samples: 1016895776. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:19:07,789][25689] Avg episode reward: [(0, '0.401')] [2022-07-11 02:19:07,977][26022] Updated weights on worker 0-0, policy_version 993064 (0.00088) [2022-07-11 02:19:09,973][26022] Updated weights on worker 0-0, policy_version 993074 (0.00085) [2022-07-11 02:19:11,900][26022] Updated weights on worker 0-0, policy_version 993084 (0.00094) [2022-07-11 02:19:12,826][25689] Fps is (10 sec: 5400.1, 60 sec: 5526.0, 300 sec: 5534.7). Total num frames: 1016922112. Throughput: 0: 5723.0. Samples: 1016929100. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:19:12,826][25689] Avg episode reward: [(0, '0.970')] [2022-07-11 02:19:13,639][26022] Updated weights on worker 0-0, policy_version 993094 (0.00086) [2022-07-11 02:19:15,415][26022] Updated weights on worker 0-0, policy_version 993104 (0.00087) [2022-07-11 02:19:17,064][26022] Updated weights on worker 0-0, policy_version 993114 (0.00084) [2022-07-11 02:19:17,831][25689] Fps is (10 sec: 5506.5, 60 sec: 5528.1, 300 sec: 5539.4). Total num frames: 1016951808. Throughput: 0: 4907.8. Samples: 1016946056. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:19:17,832][25689] Avg episode reward: [(0, '0.675')] [2022-07-11 02:19:19,348][26022] Updated weights on worker 0-0, policy_version 993124 (0.00083) [2022-07-11 02:19:20,963][26022] Updated weights on worker 0-0, policy_version 993134 (0.00088) [2022-07-11 02:19:22,903][25689] Fps is (10 sec: 5588.9, 60 sec: 5534.5, 300 sec: 5531.6). Total num frames: 1016978432. Throughput: 0: 5709.9. Samples: 1016979038. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:19:22,904][25689] Avg episode reward: [(0, '0.317')] [2022-07-11 02:19:22,934][26022] Updated weights on worker 0-0, policy_version 993144 (0.00086) [2022-07-11 02:19:24,488][26022] Updated weights on worker 0-0, policy_version 993154 (0.00364) [2022-07-11 02:19:25,892][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:19:25,905][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000993160_1016995840.pth [2022-07-11 02:19:25,906][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000991213_1015002112.pth [2022-07-11 02:19:26,387][26022] Updated weights on worker 0-0, policy_version 993164 (0.00083) [2022-07-11 02:19:27,932][25689] Fps is (10 sec: 5575.6, 60 sec: 5554.9, 300 sec: 5542.3). Total num frames: 1017008128. Throughput: 0: 5813.0. Samples: 1017012868. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:19:27,933][25689] Avg episode reward: [(0, '-0.140')] [2022-07-11 02:19:28,156][26022] Updated weights on worker 0-0, policy_version 993174 (0.00081) [2022-07-11 02:19:30,003][26022] Updated weights on worker 0-0, policy_version 993184 (0.00066) [2022-07-11 02:19:31,928][26022] Updated weights on worker 0-0, policy_version 993194 (0.00091) [2022-07-11 02:19:32,985][25689] Fps is (10 sec: 5789.2, 60 sec: 5555.8, 300 sec: 5542.1). Total num frames: 1017036800. Throughput: 0: 5001.6. Samples: 1017029928. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:19:32,986][25689] Avg episode reward: [(0, '-0.563')] [2022-07-11 02:19:33,674][26022] Updated weights on worker 0-0, policy_version 993204 (0.00604) [2022-07-11 02:19:35,400][26022] Updated weights on worker 0-0, policy_version 993214 (0.00091) [2022-07-11 02:19:37,285][26022] Updated weights on worker 0-0, policy_version 993224 (0.00085) [2022-07-11 02:19:37,998][25689] Fps is (10 sec: 5696.9, 60 sec: 5544.5, 300 sec: 5546.5). Total num frames: 1017065472. Throughput: 0: 5843.0. Samples: 1017063892. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:19:38,000][25689] Avg episode reward: [(0, '-0.164')] [2022-07-11 02:19:39,021][26022] Updated weights on worker 0-0, policy_version 993234 (0.00085) [2022-07-11 02:19:40,976][26022] Updated weights on worker 0-0, policy_version 993244 (0.00084) [2022-07-11 02:19:42,712][26022] Updated weights on worker 0-0, policy_version 993254 (0.00086) [2022-07-11 02:19:43,061][25689] Fps is (10 sec: 5589.3, 60 sec: 5561.5, 300 sec: 5545.4). Total num frames: 1017093120. Throughput: 0: 5883.2. Samples: 1017097636. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:19:43,062][25689] Avg episode reward: [(0, '0.299')] [2022-07-11 02:19:44,516][26022] Updated weights on worker 0-0, policy_version 993264 (0.00083) [2022-07-11 02:19:46,531][26022] Updated weights on worker 0-0, policy_version 993274 (0.00085) [2022-07-11 02:19:48,063][25689] Fps is (10 sec: 5595.8, 60 sec: 5562.8, 300 sec: 5542.2). Total num frames: 1017121792. Throughput: 0: 5049.9. Samples: 1017114526. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:19:48,064][25689] Avg episode reward: [(0, '0.248')] [2022-07-11 02:19:48,205][26022] Updated weights on worker 0-0, policy_version 993284 (0.00091) [2022-07-11 02:19:50,156][26022] Updated weights on worker 0-0, policy_version 993294 (0.00090) [2022-07-11 02:19:51,896][26022] Updated weights on worker 0-0, policy_version 993304 (0.00092) [2022-07-11 02:19:53,091][25689] Fps is (10 sec: 5615.6, 60 sec: 5544.0, 300 sec: 5541.9). Total num frames: 1017149440. Throughput: 0: 5878.2. Samples: 1017148112. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:19:53,091][25689] Avg episode reward: [(0, '1.533')] [2022-07-11 02:19:53,720][26022] Updated weights on worker 0-0, policy_version 993314 (0.00087) [2022-07-11 02:19:55,620][26022] Updated weights on worker 0-0, policy_version 993324 (0.00098) [2022-07-11 02:19:57,423][26022] Updated weights on worker 0-0, policy_version 993334 (0.00088) [2022-07-11 02:19:58,106][25689] Fps is (10 sec: 5506.0, 60 sec: 5560.3, 300 sec: 5546.2). Total num frames: 1017177088. Throughput: 0: 5869.3. Samples: 1017181910. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:19:58,106][25689] Avg episode reward: [(0, '1.475')] [2022-07-11 02:19:59,182][26022] Updated weights on worker 0-0, policy_version 993344 (0.00089) [2022-07-11 02:20:01,156][26022] Updated weights on worker 0-0, policy_version 993354 (0.00083) [2022-07-11 02:20:03,086][26022] Updated weights on worker 0-0, policy_version 993364 (0.00093) [2022-07-11 02:20:03,178][25689] Fps is (10 sec: 5583.0, 60 sec: 5594.4, 300 sec: 5552.0). Total num frames: 1017205760. Throughput: 0: 5753.1. Samples: 1017213372. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:20:03,179][25689] Avg episode reward: [(0, '1.616')] [2022-07-11 02:20:05,200][26022] Updated weights on worker 0-0, policy_version 993374 (0.00088) [2022-07-11 02:20:06,840][26022] Updated weights on worker 0-0, policy_version 993384 (0.00087) [2022-07-11 02:20:08,201][25689] Fps is (10 sec: 5274.5, 60 sec: 5525.8, 300 sec: 5537.9). Total num frames: 1017230336. Throughput: 0: 5749.8. Samples: 1017230318. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:20:08,201][25689] Avg episode reward: [(0, '1.660')] [2022-07-11 02:20:08,669][26022] Updated weights on worker 0-0, policy_version 993394 (0.00085) [2022-07-11 02:20:10,720][26022] Updated weights on worker 0-0, policy_version 993404 (0.00087) [2022-07-11 02:20:12,284][26022] Updated weights on worker 0-0, policy_version 993414 (0.00095) [2022-07-11 02:20:13,207][25689] Fps is (10 sec: 5513.7, 60 sec: 5613.4, 300 sec: 5551.8). Total num frames: 1017261056. Throughput: 0: 5775.4. Samples: 1017264294. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:20:13,208][25689] Avg episode reward: [(0, '1.868')] [2022-07-11 02:20:14,212][26022] Updated weights on worker 0-0, policy_version 993424 (0.00094) [2022-07-11 02:20:16,179][26022] Updated weights on worker 0-0, policy_version 993434 (0.00082) [2022-07-11 02:20:17,599][26022] Updated weights on worker 0-0, policy_version 993444 (0.00092) [2022-07-11 02:20:18,263][25689] Fps is (10 sec: 5801.1, 60 sec: 5574.8, 300 sec: 5548.3). Total num frames: 1017288704. Throughput: 0: 5757.0. Samples: 1017297954. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:20:18,263][25689] Avg episode reward: [(0, '1.082')] [2022-07-11 02:20:19,984][26022] Updated weights on worker 0-0, policy_version 993454 (0.00094) [2022-07-11 02:20:21,434][26022] Updated weights on worker 0-0, policy_version 993464 (0.00084) [2022-07-11 02:20:23,402][25689] Fps is (10 sec: 5424.1, 60 sec: 5585.5, 300 sec: 5542.4). Total num frames: 1017316352. Throughput: 0: 4991.9. Samples: 1017314326. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 02:20:23,402][25689] Avg episode reward: [(0, '0.835')] [2022-07-11 02:20:23,450][26022] Updated weights on worker 0-0, policy_version 993474 (0.00085) [2022-07-11 02:20:25,164][26022] Updated weights on worker 0-0, policy_version 993484 (0.00090) [2022-07-11 02:20:27,213][26022] Updated weights on worker 0-0, policy_version 993494 (0.00095) [2022-07-11 02:20:28,464][25689] Fps is (10 sec: 5620.9, 60 sec: 5582.5, 300 sec: 5545.3). Total num frames: 1017346048. Throughput: 0: 5830.2. Samples: 1017348458. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:20:28,466][25689] Avg episode reward: [(0, '0.668')] [2022-07-11 02:20:28,945][26022] Updated weights on worker 0-0, policy_version 993504 (0.00090) [2022-07-11 02:20:30,576][26022] Updated weights on worker 0-0, policy_version 993514 (0.00086) [2022-07-11 02:20:32,320][26022] Updated weights on worker 0-0, policy_version 993524 (0.00082) [2022-07-11 02:20:33,493][25689] Fps is (10 sec: 5682.3, 60 sec: 5567.8, 300 sec: 5548.5). Total num frames: 1017373696. Throughput: 0: 5815.0. Samples: 1017382258. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:20:33,494][25689] Avg episode reward: [(0, '0.859')] [2022-07-11 02:20:34,427][26022] Updated weights on worker 0-0, policy_version 993534 (0.00088) [2022-07-11 02:20:35,974][26022] Updated weights on worker 0-0, policy_version 993544 (0.00092) [2022-07-11 02:20:38,115][26022] Updated weights on worker 0-0, policy_version 993554 (0.00085) [2022-07-11 02:20:38,531][25689] Fps is (10 sec: 5594.9, 60 sec: 5565.5, 300 sec: 5552.2). Total num frames: 1017402368. Throughput: 0: 4987.1. Samples: 1017399034. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:20:38,531][25689] Avg episode reward: [(0, '0.686')] [2022-07-11 02:20:39,750][26022] Updated weights on worker 0-0, policy_version 993564 (0.00085) [2022-07-11 02:20:41,535][26022] Updated weights on worker 0-0, policy_version 993574 (0.00086) [2022-07-11 02:20:43,455][26022] Updated weights on worker 0-0, policy_version 993584 (0.00089) [2022-07-11 02:20:43,654][25689] Fps is (10 sec: 5543.1, 60 sec: 5560.0, 300 sec: 5550.1). Total num frames: 1017430016. Throughput: 0: 5845.7. Samples: 1017432714. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:20:43,654][25689] Avg episode reward: [(0, '-0.069')] [2022-07-11 02:20:45,361][26022] Updated weights on worker 0-0, policy_version 993594 (0.00093) [2022-07-11 02:20:47,254][26022] Updated weights on worker 0-0, policy_version 993604 (0.00095) [2022-07-11 02:20:48,683][25689] Fps is (10 sec: 5648.3, 60 sec: 5574.4, 300 sec: 5553.2). Total num frames: 1017459712. Throughput: 0: 5821.1. Samples: 1017466154. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:20:48,684][25689] Avg episode reward: [(0, '0.079')] [2022-07-11 02:20:48,950][26022] Updated weights on worker 0-0, policy_version 993614 (0.00095) [2022-07-11 02:20:50,756][26022] Updated weights on worker 0-0, policy_version 993624 (0.00084) [2022-07-11 02:20:52,811][26022] Updated weights on worker 0-0, policy_version 993634 (0.00090) [2022-07-11 02:20:53,707][25689] Fps is (10 sec: 5602.4, 60 sec: 5557.9, 300 sec: 5550.3). Total num frames: 1017486336. Throughput: 0: 4975.0. Samples: 1017482818. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:20:53,707][25689] Avg episode reward: [(0, '0.246')] [2022-07-11 02:20:54,426][26022] Updated weights on worker 0-0, policy_version 993644 (0.00082) [2022-07-11 02:20:56,269][26022] Updated weights on worker 0-0, policy_version 993654 (0.00091) [2022-07-11 02:20:58,224][26022] Updated weights on worker 0-0, policy_version 993664 (0.00086) [2022-07-11 02:20:58,716][25689] Fps is (10 sec: 5511.4, 60 sec: 5575.3, 300 sec: 5554.2). Total num frames: 1017515008. Throughput: 0: 5813.1. Samples: 1017516374. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:20:58,717][25689] Avg episode reward: [(0, '-1.014')] [2022-07-11 02:21:00,025][26022] Updated weights on worker 0-0, policy_version 993674 (0.00094) [2022-07-11 02:21:02,071][26022] Updated weights on worker 0-0, policy_version 993684 (0.00085) [2022-07-11 02:21:03,770][25689] Fps is (10 sec: 5393.0, 60 sec: 5526.3, 300 sec: 5549.8). Total num frames: 1017540608. Throughput: 0: 5725.5. Samples: 1017547890. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:21:03,772][25689] Avg episode reward: [(0, '-0.959')] [2022-07-11 02:21:03,942][26022] Updated weights on worker 0-0, policy_version 993694 (0.00090) [2022-07-11 02:21:05,804][26022] Updated weights on worker 0-0, policy_version 993704 (0.00084) [2022-07-11 02:21:07,662][26022] Updated weights on worker 0-0, policy_version 993714 (0.00086) [2022-07-11 02:21:08,778][25689] Fps is (10 sec: 5292.2, 60 sec: 5578.4, 300 sec: 5553.7). Total num frames: 1017568256. Throughput: 0: 4905.5. Samples: 1017564728. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:21:08,778][25689] Avg episode reward: [(0, '-0.006')] [2022-07-11 02:21:09,434][26022] Updated weights on worker 0-0, policy_version 993724 (0.00075) [2022-07-11 02:21:11,233][26022] Updated weights on worker 0-0, policy_version 993734 (0.00084) [2022-07-11 02:21:13,108][26022] Updated weights on worker 0-0, policy_version 993744 (0.00090) [2022-07-11 02:21:13,800][25689] Fps is (10 sec: 5615.0, 60 sec: 5543.1, 300 sec: 5556.8). Total num frames: 1017596928. Throughput: 0: 5754.8. Samples: 1017598454. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:21:13,801][25689] Avg episode reward: [(0, '-0.472')] [2022-07-11 02:21:14,855][26022] Updated weights on worker 0-0, policy_version 993754 (0.00096) [2022-07-11 02:21:16,726][26022] Updated weights on worker 0-0, policy_version 993764 (0.00091) [2022-07-11 02:21:18,539][26022] Updated weights on worker 0-0, policy_version 993774 (0.00091) [2022-07-11 02:21:18,812][25689] Fps is (10 sec: 5612.5, 60 sec: 5547.0, 300 sec: 5551.7). Total num frames: 1017624576. Throughput: 0: 5768.6. Samples: 1017632302. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:21:18,813][25689] Avg episode reward: [(0, '-0.262')] [2022-07-11 02:21:20,443][26022] Updated weights on worker 0-0, policy_version 993784 (0.00086) [2022-07-11 02:21:22,345][26022] Updated weights on worker 0-0, policy_version 993794 (0.00089) [2022-07-11 02:21:23,934][25689] Fps is (10 sec: 5557.5, 60 sec: 5565.5, 300 sec: 5549.8). Total num frames: 1017653248. Throughput: 0: 5015.6. Samples: 1017649026. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:21:23,935][25689] Avg episode reward: [(0, '-0.268')] [2022-07-11 02:21:24,184][26022] Updated weights on worker 0-0, policy_version 993804 (0.00087) [2022-07-11 02:21:26,006][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:21:26,026][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000993814_1017665536.pth [2022-07-11 02:21:26,027][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000991862_1015666688.pth [2022-07-11 02:21:26,030][26022] Updated weights on worker 0-0, policy_version 993814 (0.00099) [2022-07-11 02:21:27,931][26022] Updated weights on worker 0-0, policy_version 993824 (0.00085) [2022-07-11 02:21:28,936][25689] Fps is (10 sec: 5664.1, 60 sec: 5554.2, 300 sec: 5556.8). Total num frames: 1017681920. Throughput: 0: 5836.4. Samples: 1017682384. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:21:28,938][25689] Avg episode reward: [(0, '1.272')] [2022-07-11 02:21:29,557][26022] Updated weights on worker 0-0, policy_version 993834 (0.00087) [2022-07-11 02:21:31,550][26022] Updated weights on worker 0-0, policy_version 993844 (0.00088) [2022-07-11 02:21:33,223][26022] Updated weights on worker 0-0, policy_version 993854 (0.00084) [2022-07-11 02:21:33,957][25689] Fps is (10 sec: 5619.4, 60 sec: 5555.0, 300 sec: 5553.1). Total num frames: 1017709568. Throughput: 0: 5837.7. Samples: 1017716122. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:21:33,958][25689] Avg episode reward: [(0, '0.231')] [2022-07-11 02:21:34,999][26022] Updated weights on worker 0-0, policy_version 993864 (0.00092) [2022-07-11 02:21:37,056][26022] Updated weights on worker 0-0, policy_version 993874 (0.00079) [2022-07-11 02:21:38,789][26022] Updated weights on worker 0-0, policy_version 993884 (0.00093) [2022-07-11 02:21:38,980][25689] Fps is (10 sec: 5505.4, 60 sec: 5539.3, 300 sec: 5554.3). Total num frames: 1017737216. Throughput: 0: 4975.0. Samples: 1017732638. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:21:38,981][25689] Avg episode reward: [(0, '0.110')] [2022-07-11 02:21:40,858][26022] Updated weights on worker 0-0, policy_version 993894 (0.00070) [2022-07-11 02:21:42,599][26022] Updated weights on worker 0-0, policy_version 993904 (0.00051) [2022-07-11 02:21:44,056][25689] Fps is (10 sec: 5576.8, 60 sec: 5560.6, 300 sec: 5556.7). Total num frames: 1017765888. Throughput: 0: 5816.2. Samples: 1017766056. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:21:44,056][25689] Avg episode reward: [(0, '0.417')] [2022-07-11 02:21:44,391][26022] Updated weights on worker 0-0, policy_version 993914 (0.00098) [2022-07-11 02:21:46,187][26022] Updated weights on worker 0-0, policy_version 993924 (0.00093) [2022-07-11 02:21:47,856][26022] Updated weights on worker 0-0, policy_version 993934 (0.00092) [2022-07-11 02:21:49,119][25689] Fps is (10 sec: 5555.1, 60 sec: 5523.6, 300 sec: 5548.8). Total num frames: 1017793536. Throughput: 0: 5812.5. Samples: 1017799694. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:21:49,119][25689] Avg episode reward: [(0, '0.370')] [2022-07-11 02:21:49,875][26022] Updated weights on worker 0-0, policy_version 993944 (0.00086) [2022-07-11 02:21:51,456][26022] Updated weights on worker 0-0, policy_version 993954 (0.00083) [2022-07-11 02:21:53,695][26022] Updated weights on worker 0-0, policy_version 993964 (0.00089) [2022-07-11 02:21:54,131][25689] Fps is (10 sec: 5590.2, 60 sec: 5558.6, 300 sec: 5555.8). Total num frames: 1017822208. Throughput: 0: 4978.4. Samples: 1017816556. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:21:54,131][25689] Avg episode reward: [(0, '0.314')] [2022-07-11 02:21:55,372][26022] Updated weights on worker 0-0, policy_version 993974 (0.00084) [2022-07-11 02:21:57,218][26022] Updated weights on worker 0-0, policy_version 993984 (0.00096) [2022-07-11 02:21:58,964][26022] Updated weights on worker 0-0, policy_version 993994 (0.00078) [2022-07-11 02:21:59,160][25689] Fps is (10 sec: 5608.9, 60 sec: 5539.8, 300 sec: 5556.2). Total num frames: 1017849856. Throughput: 0: 5810.4. Samples: 1017849890. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:21:59,163][25689] Avg episode reward: [(0, '0.359')] [2022-07-11 02:22:00,862][26022] Updated weights on worker 0-0, policy_version 994004 (0.00085) [2022-07-11 02:22:03,042][26022] Updated weights on worker 0-0, policy_version 994014 (0.00084) [2022-07-11 02:22:04,284][25689] Fps is (10 sec: 5345.5, 60 sec: 5550.4, 300 sec: 5554.0). Total num frames: 1017876480. Throughput: 0: 5690.9. Samples: 1017881170. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:22:04,284][25689] Avg episode reward: [(0, '0.304')] [2022-07-11 02:22:04,988][26022] Updated weights on worker 0-0, policy_version 994024 (0.00083) [2022-07-11 02:22:06,724][26022] Updated weights on worker 0-0, policy_version 994034 (0.00089) [2022-07-11 02:22:08,623][26022] Updated weights on worker 0-0, policy_version 994044 (0.00080) [2022-07-11 02:22:09,363][25689] Fps is (10 sec: 5319.5, 60 sec: 5543.8, 300 sec: 5552.6). Total num frames: 1017904128. Throughput: 0: 4848.4. Samples: 1017897844. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:22:09,363][25689] Avg episode reward: [(0, '0.370')] [2022-07-11 02:22:10,391][26022] Updated weights on worker 0-0, policy_version 994054 (0.00092) [2022-07-11 02:22:12,206][26022] Updated weights on worker 0-0, policy_version 994064 (0.00087) [2022-07-11 02:22:14,146][26022] Updated weights on worker 0-0, policy_version 994074 (0.00089) [2022-07-11 02:22:14,389][25689] Fps is (10 sec: 5573.5, 60 sec: 5543.5, 300 sec: 5553.0). Total num frames: 1017932800. Throughput: 0: 5677.2. Samples: 1017931564. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:22:14,389][25689] Avg episode reward: [(0, '0.271')] [2022-07-11 02:22:15,900][26022] Updated weights on worker 0-0, policy_version 994084 (0.00087) [2022-07-11 02:22:18,035][26022] Updated weights on worker 0-0, policy_version 994094 (0.00089) [2022-07-11 02:22:19,357][26022] Updated weights on worker 0-0, policy_version 994104 (0.00083) [2022-07-11 02:22:19,483][25689] Fps is (10 sec: 5767.5, 60 sec: 5569.8, 300 sec: 5560.7). Total num frames: 1017962496. Throughput: 0: 5671.2. Samples: 1017965144. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:22:19,483][25689] Avg episode reward: [(0, '0.232')] [2022-07-11 02:22:21,557][26022] Updated weights on worker 0-0, policy_version 994114 (0.00085) [2022-07-11 02:22:22,955][26022] Updated weights on worker 0-0, policy_version 994124 (0.00081) [2022-07-11 02:22:24,547][25689] Fps is (10 sec: 5544.3, 60 sec: 5541.3, 300 sec: 5549.3). Total num frames: 1017989120. Throughput: 0: 5798.2. Samples: 1017998660. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:22:24,547][25689] Avg episode reward: [(0, '0.433')] [2022-07-11 02:22:25,099][26022] Updated weights on worker 0-0, policy_version 994134 (0.00084) [2022-07-11 02:22:26,886][26022] Updated weights on worker 0-0, policy_version 994144 (0.00082) [2022-07-11 02:22:28,676][26022] Updated weights on worker 0-0, policy_version 994154 (0.00086) [2022-07-11 02:22:29,564][25689] Fps is (10 sec: 5383.5, 60 sec: 5523.0, 300 sec: 5549.0). Total num frames: 1018016768. Throughput: 0: 5823.6. Samples: 1018015488. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:22:29,564][25689] Avg episode reward: [(0, '-0.201')] [2022-07-11 02:22:30,649][26022] Updated weights on worker 0-0, policy_version 994164 (0.00088) [2022-07-11 02:22:32,623][26022] Updated weights on worker 0-0, policy_version 994174 (0.00090) [2022-07-11 02:22:34,164][26022] Updated weights on worker 0-0, policy_version 994184 (0.00090) [2022-07-11 02:22:34,622][25689] Fps is (10 sec: 5691.4, 60 sec: 5553.3, 300 sec: 5555.1). Total num frames: 1018046464. Throughput: 0: 5802.9. Samples: 1018048980. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:22:34,623][25689] Avg episode reward: [(0, '0.485')] [2022-07-11 02:22:36,228][26022] Updated weights on worker 0-0, policy_version 994194 (0.00096) [2022-07-11 02:22:37,771][26022] Updated weights on worker 0-0, policy_version 994204 (0.00101) [2022-07-11 02:22:39,701][25689] Fps is (10 sec: 5656.9, 60 sec: 5548.3, 300 sec: 5555.7). Total num frames: 1018074112. Throughput: 0: 5801.1. Samples: 1018082432. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:22:39,701][25689] Avg episode reward: [(0, '0.390')] [2022-07-11 02:22:39,812][26022] Updated weights on worker 0-0, policy_version 994214 (0.00082) [2022-07-11 02:22:41,523][26022] Updated weights on worker 0-0, policy_version 994224 (0.00089) [2022-07-11 02:22:43,538][26022] Updated weights on worker 0-0, policy_version 994234 (0.00086) [2022-07-11 02:22:44,796][25689] Fps is (10 sec: 5535.8, 60 sec: 5546.5, 300 sec: 5554.1). Total num frames: 1018102784. Throughput: 0: 4966.9. Samples: 1018099236. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:22:44,797][25689] Avg episode reward: [(0, '0.609')] [2022-07-11 02:22:45,207][26022] Updated weights on worker 0-0, policy_version 994244 (0.00085) [2022-07-11 02:22:47,148][26022] Updated weights on worker 0-0, policy_version 994254 (0.00086) [2022-07-11 02:22:48,867][26022] Updated weights on worker 0-0, policy_version 994264 (0.00086) [2022-07-11 02:22:49,800][25689] Fps is (10 sec: 5576.6, 60 sec: 5551.9, 300 sec: 5554.2). Total num frames: 1018130432. Throughput: 0: 5805.5. Samples: 1018132970. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:22:49,803][25689] Avg episode reward: [(0, '0.444')] [2022-07-11 02:22:50,816][26022] Updated weights on worker 0-0, policy_version 994274 (0.00089) [2022-07-11 02:22:52,691][26022] Updated weights on worker 0-0, policy_version 994284 (0.00107) [2022-07-11 02:22:54,420][26022] Updated weights on worker 0-0, policy_version 994294 (0.00093) [2022-07-11 02:22:54,817][25689] Fps is (10 sec: 5518.0, 60 sec: 5534.5, 300 sec: 5553.9). Total num frames: 1018158080. Throughput: 0: 5820.8. Samples: 1018166530. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:22:54,818][25689] Avg episode reward: [(0, '0.491')] [2022-07-11 02:22:56,405][26022] Updated weights on worker 0-0, policy_version 994304 (0.00079) [2022-07-11 02:22:58,401][26022] Updated weights on worker 0-0, policy_version 994314 (0.00087) [2022-07-11 02:22:59,905][25689] Fps is (10 sec: 5573.9, 60 sec: 5546.1, 300 sec: 5553.6). Total num frames: 1018186752. Throughput: 0: 4979.0. Samples: 1018183026. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:22:59,905][25689] Avg episode reward: [(0, '0.933')] [2022-07-11 02:22:59,961][26022] Updated weights on worker 0-0, policy_version 994324 (0.00089) [2022-07-11 02:23:02,276][26022] Updated weights on worker 0-0, policy_version 994334 (0.00086) [2022-07-11 02:23:03,900][26022] Updated weights on worker 0-0, policy_version 994344 (0.00092) [2022-07-11 02:23:04,947][25689] Fps is (10 sec: 5357.8, 60 sec: 5536.7, 300 sec: 5556.7). Total num frames: 1018212352. Throughput: 0: 5714.0. Samples: 1018214376. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:23:04,947][25689] Avg episode reward: [(0, '0.989')] [2022-07-11 02:23:06,031][26022] Updated weights on worker 0-0, policy_version 994354 (0.00078) [2022-07-11 02:23:07,630][26022] Updated weights on worker 0-0, policy_version 994364 (0.00083) [2022-07-11 02:23:09,688][26022] Updated weights on worker 0-0, policy_version 994374 (0.00091) [2022-07-11 02:23:09,973][25689] Fps is (10 sec: 5288.5, 60 sec: 5541.5, 300 sec: 5550.9). Total num frames: 1018240000. Throughput: 0: 5698.7. Samples: 1018247930. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:23:09,975][25689] Avg episode reward: [(0, '1.415')] [2022-07-11 02:23:11,321][26022] Updated weights on worker 0-0, policy_version 994384 (0.00094) [2022-07-11 02:23:13,242][26022] Updated weights on worker 0-0, policy_version 994394 (0.00083) [2022-07-11 02:23:14,984][25689] Fps is (10 sec: 5713.0, 60 sec: 5559.7, 300 sec: 5551.2). Total num frames: 1018269696. Throughput: 0: 4872.3. Samples: 1018264790. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:23:14,985][25689] Avg episode reward: [(0, '0.815')] [2022-07-11 02:23:14,991][26022] Updated weights on worker 0-0, policy_version 994404 (0.00095) [2022-07-11 02:23:16,843][26022] Updated weights on worker 0-0, policy_version 994414 (0.00078) [2022-07-11 02:23:18,860][26022] Updated weights on worker 0-0, policy_version 994424 (0.00081) [2022-07-11 02:23:20,015][25689] Fps is (10 sec: 5710.7, 60 sec: 5531.7, 300 sec: 5556.7). Total num frames: 1018297344. Throughput: 0: 5730.4. Samples: 1018298266. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:23:20,015][25689] Avg episode reward: [(0, '1.103')] [2022-07-11 02:23:20,535][26022] Updated weights on worker 0-0, policy_version 994434 (0.00081) [2022-07-11 02:23:22,438][26022] Updated weights on worker 0-0, policy_version 994444 (0.00091) [2022-07-11 02:23:24,130][26022] Updated weights on worker 0-0, policy_version 994454 (0.00090) [2022-07-11 02:23:25,138][25689] Fps is (10 sec: 5446.2, 60 sec: 5543.3, 300 sec: 5552.3). Total num frames: 1018324992. Throughput: 0: 5818.2. Samples: 1018331850. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:23:25,138][25689] Avg episode reward: [(0, '1.104')] [2022-07-11 02:23:26,093][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:23:26,112][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000994464_1018331136.pth [2022-07-11 02:23:26,112][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000992510_1016330240.pth [2022-07-11 02:23:26,119][26022] Updated weights on worker 0-0, policy_version 994464 (0.00080) [2022-07-11 02:23:27,887][26022] Updated weights on worker 0-0, policy_version 994474 (0.00085) [2022-07-11 02:23:29,736][26022] Updated weights on worker 0-0, policy_version 994484 (0.00090) [2022-07-11 02:23:30,141][25689] Fps is (10 sec: 5561.9, 60 sec: 5561.5, 300 sec: 5553.4). Total num frames: 1018353664. Throughput: 0: 4990.2. Samples: 1018348570. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:23:30,141][25689] Avg episode reward: [(0, '1.352')] [2022-07-11 02:23:31,527][26022] Updated weights on worker 0-0, policy_version 994494 (0.00081) [2022-07-11 02:23:33,367][26022] Updated weights on worker 0-0, policy_version 994504 (0.00085) [2022-07-11 02:23:35,166][25689] Fps is (10 sec: 5513.6, 60 sec: 5513.7, 300 sec: 5543.9). Total num frames: 1018380288. Throughput: 0: 5818.9. Samples: 1018382228. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:23:35,167][25689] Avg episode reward: [(0, '0.810')] [2022-07-11 02:23:35,358][26022] Updated weights on worker 0-0, policy_version 994514 (0.00089) [2022-07-11 02:23:36,911][26022] Updated weights on worker 0-0, policy_version 994524 (0.00086) [2022-07-11 02:23:38,864][26022] Updated weights on worker 0-0, policy_version 994534 (0.00110) [2022-07-11 02:23:40,168][25689] Fps is (10 sec: 5616.9, 60 sec: 5554.6, 300 sec: 5555.4). Total num frames: 1018409984. Throughput: 0: 5854.8. Samples: 1018416258. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:23:40,168][25689] Avg episode reward: [(0, '0.630')] [2022-07-11 02:23:40,846][26022] Updated weights on worker 0-0, policy_version 994544 (0.00092) [2022-07-11 02:23:42,375][26022] Updated weights on worker 0-0, policy_version 994554 (0.00084) [2022-07-11 02:23:44,424][26022] Updated weights on worker 0-0, policy_version 994564 (0.00084) [2022-07-11 02:23:45,202][25689] Fps is (10 sec: 5918.0, 60 sec: 5577.2, 300 sec: 5558.5). Total num frames: 1018439680. Throughput: 0: 5039.9. Samples: 1018432974. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:23:45,203][25689] Avg episode reward: [(0, '1.165')] [2022-07-11 02:23:45,959][26022] Updated weights on worker 0-0, policy_version 994574 (0.00094) [2022-07-11 02:23:47,922][26022] Updated weights on worker 0-0, policy_version 994584 (0.00084) [2022-07-11 02:23:49,601][26022] Updated weights on worker 0-0, policy_version 994594 (0.00089) [2022-07-11 02:23:50,212][25689] Fps is (10 sec: 5607.3, 60 sec: 5559.7, 300 sec: 5551.6). Total num frames: 1018466304. Throughput: 0: 5898.0. Samples: 1018466950. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:23:50,213][25689] Avg episode reward: [(0, '1.009')] [2022-07-11 02:23:51,528][26022] Updated weights on worker 0-0, policy_version 994604 (0.00090) [2022-07-11 02:23:53,374][26022] Updated weights on worker 0-0, policy_version 994614 (0.00090) [2022-07-11 02:23:55,225][25689] Fps is (10 sec: 5415.1, 60 sec: 5560.1, 300 sec: 5555.0). Total num frames: 1018493952. Throughput: 0: 5891.9. Samples: 1018500408. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:23:55,225][25689] Avg episode reward: [(0, '0.246')] [2022-07-11 02:23:55,406][26022] Updated weights on worker 0-0, policy_version 994624 (0.00092) [2022-07-11 02:23:57,178][26022] Updated weights on worker 0-0, policy_version 994634 (0.00082) [2022-07-11 02:23:59,067][26022] Updated weights on worker 0-0, policy_version 994644 (0.00426) [2022-07-11 02:24:00,231][25689] Fps is (10 sec: 5518.9, 60 sec: 5550.6, 300 sec: 5559.7). Total num frames: 1018521600. Throughput: 0: 5022.3. Samples: 1018517024. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:24:00,232][25689] Avg episode reward: [(0, '0.242')] [2022-07-11 02:24:00,646][26022] Updated weights on worker 0-0, policy_version 994654 (0.00083) [2022-07-11 02:24:02,978][26022] Updated weights on worker 0-0, policy_version 994664 (0.00090) [2022-07-11 02:24:04,955][26022] Updated weights on worker 0-0, policy_version 994674 (0.00509) [2022-07-11 02:24:05,337][25689] Fps is (10 sec: 5265.8, 60 sec: 5544.8, 300 sec: 5547.6). Total num frames: 1018547200. Throughput: 0: 5728.7. Samples: 1018548318. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:24:05,337][25689] Avg episode reward: [(0, '0.397')] [2022-07-11 02:24:06,726][26022] Updated weights on worker 0-0, policy_version 994684 (0.00092) [2022-07-11 02:24:08,544][26022] Updated weights on worker 0-0, policy_version 994694 (0.00085) [2022-07-11 02:24:10,278][26022] Updated weights on worker 0-0, policy_version 994704 (0.00087) [2022-07-11 02:24:10,409][25689] Fps is (10 sec: 5533.9, 60 sec: 5591.4, 300 sec: 5564.2). Total num frames: 1018577920. Throughput: 0: 5708.0. Samples: 1018582234. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:24:10,409][25689] Avg episode reward: [(0, '-0.301')] [2022-07-11 02:24:12,058][26022] Updated weights on worker 0-0, policy_version 994714 (0.00086) [2022-07-11 02:24:13,870][26022] Updated weights on worker 0-0, policy_version 994724 (0.00094) [2022-07-11 02:24:15,464][25689] Fps is (10 sec: 5763.4, 60 sec: 5553.5, 300 sec: 5556.4). Total num frames: 1018605568. Throughput: 0: 4884.0. Samples: 1018599258. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:24:15,465][25689] Avg episode reward: [(0, '-0.727')] [2022-07-11 02:24:15,812][26022] Updated weights on worker 0-0, policy_version 994734 (0.00091) [2022-07-11 02:24:17,417][26022] Updated weights on worker 0-0, policy_version 994744 (0.00089) [2022-07-11 02:24:19,483][26022] Updated weights on worker 0-0, policy_version 994754 (0.00096) [2022-07-11 02:24:20,474][25689] Fps is (10 sec: 5493.8, 60 sec: 5555.4, 300 sec: 5561.0). Total num frames: 1018633216. Throughput: 0: 5727.0. Samples: 1018632954. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 02:24:20,475][25689] Avg episode reward: [(0, '-0.621')] [2022-07-11 02:24:21,302][26022] Updated weights on worker 0-0, policy_version 994764 (0.00090) [2022-07-11 02:24:23,303][26022] Updated weights on worker 0-0, policy_version 994774 (0.00093) [2022-07-11 02:24:24,951][26022] Updated weights on worker 0-0, policy_version 994784 (0.00389) [2022-07-11 02:24:25,570][25689] Fps is (10 sec: 5471.7, 60 sec: 5557.8, 300 sec: 5552.8). Total num frames: 1018660864. Throughput: 0: 5825.6. Samples: 1018666188. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:24:25,570][25689] Avg episode reward: [(0, '0.037')] [2022-07-11 02:24:26,953][26022] Updated weights on worker 0-0, policy_version 994794 (0.00088) [2022-07-11 02:24:28,804][26022] Updated weights on worker 0-0, policy_version 994804 (0.00088) [2022-07-11 02:24:30,622][25689] Fps is (10 sec: 5448.7, 60 sec: 5536.4, 300 sec: 5549.4). Total num frames: 1018688512. Throughput: 0: 4953.2. Samples: 1018682356. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:24:30,624][25689] Avg episode reward: [(0, '0.097')] [2022-07-11 02:24:30,811][26022] Updated weights on worker 0-0, policy_version 994814 (0.00089) [2022-07-11 02:24:32,524][26022] Updated weights on worker 0-0, policy_version 994824 (0.00089) [2022-07-11 02:24:34,436][26022] Updated weights on worker 0-0, policy_version 994834 (0.00088) [2022-07-11 02:24:35,626][25689] Fps is (10 sec: 5600.6, 60 sec: 5572.3, 300 sec: 5549.6). Total num frames: 1018717184. Throughput: 0: 5767.5. Samples: 1018715542. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:24:35,626][25689] Avg episode reward: [(0, '-0.109')] [2022-07-11 02:24:36,255][26022] Updated weights on worker 0-0, policy_version 994844 (0.00090) [2022-07-11 02:24:38,097][26022] Updated weights on worker 0-0, policy_version 994854 (0.00084) [2022-07-11 02:24:39,973][26022] Updated weights on worker 0-0, policy_version 994864 (0.00082) [2022-07-11 02:24:40,663][25689] Fps is (10 sec: 5609.3, 60 sec: 5535.1, 300 sec: 5550.1). Total num frames: 1018744832. Throughput: 0: 5749.3. Samples: 1018749026. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:24:40,664][25689] Avg episode reward: [(0, '0.641')] [2022-07-11 02:24:41,782][26022] Updated weights on worker 0-0, policy_version 994874 (0.00085) [2022-07-11 02:24:43,586][26022] Updated weights on worker 0-0, policy_version 994884 (0.00080) [2022-07-11 02:24:45,400][26022] Updated weights on worker 0-0, policy_version 994894 (0.00090) [2022-07-11 02:24:45,705][25689] Fps is (10 sec: 5587.9, 60 sec: 5517.5, 300 sec: 5549.3). Total num frames: 1018773504. Throughput: 0: 4949.1. Samples: 1018765838. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:24:45,705][25689] Avg episode reward: [(0, '1.234')] [2022-07-11 02:24:47,298][26022] Updated weights on worker 0-0, policy_version 994904 (0.00086) [2022-07-11 02:24:48,978][26022] Updated weights on worker 0-0, policy_version 994914 (0.00087) [2022-07-11 02:24:50,754][25689] Fps is (10 sec: 5581.1, 60 sec: 5530.8, 300 sec: 5548.9). Total num frames: 1018801152. Throughput: 0: 5821.7. Samples: 1018799556. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:24:50,755][25689] Avg episode reward: [(0, '1.008')] [2022-07-11 02:24:50,848][26022] Updated weights on worker 0-0, policy_version 994924 (0.00088) [2022-07-11 02:24:52,721][26022] Updated weights on worker 0-0, policy_version 994934 (0.00049) [2022-07-11 02:24:54,712][26022] Updated weights on worker 0-0, policy_version 994944 (0.00090) [2022-07-11 02:24:55,790][25689] Fps is (10 sec: 5482.7, 60 sec: 5528.7, 300 sec: 5548.5). Total num frames: 1018828800. Throughput: 0: 5827.8. Samples: 1018833056. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:24:55,792][25689] Avg episode reward: [(0, '0.827')] [2022-07-11 02:24:56,241][26022] Updated weights on worker 0-0, policy_version 994954 (0.00093) [2022-07-11 02:24:58,247][26022] Updated weights on worker 0-0, policy_version 994964 (0.00086) [2022-07-11 02:25:00,071][26022] Updated weights on worker 0-0, policy_version 994974 (0.00090) [2022-07-11 02:25:00,807][25689] Fps is (10 sec: 5500.4, 60 sec: 5527.8, 300 sec: 5546.1). Total num frames: 1018856448. Throughput: 0: 5834.6. Samples: 1018866560. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:25:00,809][25689] Avg episode reward: [(0, '1.033')] [2022-07-11 02:25:02,360][26022] Updated weights on worker 0-0, policy_version 994984 (0.00087) [2022-07-11 02:25:04,135][26022] Updated weights on worker 0-0, policy_version 994994 (0.00083) [2022-07-11 02:25:05,843][26022] Updated weights on worker 0-0, policy_version 995004 (0.00093) [2022-07-11 02:25:05,919][25689] Fps is (10 sec: 5459.5, 60 sec: 5561.0, 300 sec: 5554.8). Total num frames: 1018884096. Throughput: 0: 5708.2. Samples: 1018881224. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:25:05,919][25689] Avg episode reward: [(0, '1.122')] [2022-07-11 02:25:07,857][26022] Updated weights on worker 0-0, policy_version 995014 (0.00096) [2022-07-11 02:25:09,457][26022] Updated weights on worker 0-0, policy_version 995024 (0.00084) [2022-07-11 02:25:10,922][25689] Fps is (10 sec: 5365.5, 60 sec: 5499.6, 300 sec: 5541.1). Total num frames: 1018910720. Throughput: 0: 5728.6. Samples: 1018915090. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:25:10,924][25689] Avg episode reward: [(0, '1.309')] [2022-07-11 02:25:11,632][26022] Updated weights on worker 0-0, policy_version 995034 (0.00084) [2022-07-11 02:25:13,049][26022] Updated weights on worker 0-0, policy_version 995044 (0.00080) [2022-07-11 02:25:15,191][26022] Updated weights on worker 0-0, policy_version 995054 (0.00086) [2022-07-11 02:25:15,943][25689] Fps is (10 sec: 5720.6, 60 sec: 5553.6, 300 sec: 5552.0). Total num frames: 1018941440. Throughput: 0: 5742.4. Samples: 1018948780. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:25:15,943][25689] Avg episode reward: [(0, '0.509')] [2022-07-11 02:25:16,931][26022] Updated weights on worker 0-0, policy_version 995064 (0.00085) [2022-07-11 02:25:18,687][26022] Updated weights on worker 0-0, policy_version 995074 (0.00087) [2022-07-11 02:25:20,579][26022] Updated weights on worker 0-0, policy_version 995084 (0.00091) [2022-07-11 02:25:20,959][25689] Fps is (10 sec: 5611.5, 60 sec: 5519.1, 300 sec: 5547.5). Total num frames: 1018967040. Throughput: 0: 4921.4. Samples: 1018965734. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:25:20,959][25689] Avg episode reward: [(0, '0.900')] [2022-07-11 02:25:22,214][26022] Updated weights on worker 0-0, policy_version 995094 (0.00085) [2022-07-11 02:25:24,191][26022] Updated weights on worker 0-0, policy_version 995104 (0.00079) [2022-07-11 02:25:26,075][25689] Fps is (10 sec: 5356.4, 60 sec: 5534.2, 300 sec: 5543.0). Total num frames: 1018995712. Throughput: 0: 5867.1. Samples: 1018999484. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:25:26,076][25689] Avg episode reward: [(0, '1.385')] [2022-07-11 02:25:26,115][26022] Updated weights on worker 0-0, policy_version 995114 (0.00084) [2022-07-11 02:25:26,263][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:25:26,279][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000995115_1018997760.pth [2022-07-11 02:25:26,280][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000993160_1016995840.pth [2022-07-11 02:25:27,864][26022] Updated weights on worker 0-0, policy_version 995124 (0.00098) [2022-07-11 02:25:29,834][26022] Updated weights on worker 0-0, policy_version 995134 (0.00084) [2022-07-11 02:25:31,099][25689] Fps is (10 sec: 5655.1, 60 sec: 5553.8, 300 sec: 5546.6). Total num frames: 1019024384. Throughput: 0: 5833.8. Samples: 1019032796. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:25:31,100][25689] Avg episode reward: [(0, '1.159')] [2022-07-11 02:25:31,519][26022] Updated weights on worker 0-0, policy_version 995144 (0.00091) [2022-07-11 02:25:33,310][26022] Updated weights on worker 0-0, policy_version 995154 (0.00089) [2022-07-11 02:25:35,326][26022] Updated weights on worker 0-0, policy_version 995164 (0.00084) [2022-07-11 02:25:36,167][25689] Fps is (10 sec: 5580.8, 60 sec: 5530.9, 300 sec: 5542.6). Total num frames: 1019052032. Throughput: 0: 4975.9. Samples: 1019049412. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:25:36,168][25689] Avg episode reward: [(0, '0.882')] [2022-07-11 02:25:37,086][26022] Updated weights on worker 0-0, policy_version 995174 (0.00086) [2022-07-11 02:25:38,988][26022] Updated weights on worker 0-0, policy_version 995184 (0.00085) [2022-07-11 02:25:40,844][26022] Updated weights on worker 0-0, policy_version 995194 (0.00088) [2022-07-11 02:25:41,195][25689] Fps is (10 sec: 5578.5, 60 sec: 5548.7, 300 sec: 5547.8). Total num frames: 1019080704. Throughput: 0: 5793.1. Samples: 1019082964. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:25:41,195][25689] Avg episode reward: [(0, '1.084')] [2022-07-11 02:25:42,541][26022] Updated weights on worker 0-0, policy_version 995204 (0.00083) [2022-07-11 02:25:44,358][26022] Updated weights on worker 0-0, policy_version 995214 (0.00085) [2022-07-11 02:25:46,183][26022] Updated weights on worker 0-0, policy_version 995224 (0.00088) [2022-07-11 02:25:46,311][25689] Fps is (10 sec: 5652.9, 60 sec: 5541.9, 300 sec: 5542.7). Total num frames: 1019109376. Throughput: 0: 5779.6. Samples: 1019116438. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:25:46,311][25689] Avg episode reward: [(0, '0.810')] [2022-07-11 02:25:48,219][26022] Updated weights on worker 0-0, policy_version 995234 (0.00087) [2022-07-11 02:25:49,842][26022] Updated weights on worker 0-0, policy_version 995244 (0.00088) [2022-07-11 02:25:51,366][25689] Fps is (10 sec: 5537.0, 60 sec: 5541.3, 300 sec: 5545.6). Total num frames: 1019137024. Throughput: 0: 4961.6. Samples: 1019133354. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:25:51,367][25689] Avg episode reward: [(0, '0.712')] [2022-07-11 02:25:51,786][26022] Updated weights on worker 0-0, policy_version 995254 (0.00061) [2022-07-11 02:25:53,507][26022] Updated weights on worker 0-0, policy_version 995264 (0.00104) [2022-07-11 02:25:55,581][26022] Updated weights on worker 0-0, policy_version 995274 (0.00090) [2022-07-11 02:25:56,449][25689] Fps is (10 sec: 5656.4, 60 sec: 5570.9, 300 sec: 5547.6). Total num frames: 1019166720. Throughput: 0: 5815.9. Samples: 1019167370. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:25:56,449][25689] Avg episode reward: [(0, '0.608')] [2022-07-11 02:25:56,915][26022] Updated weights on worker 0-0, policy_version 995284 (0.00097) [2022-07-11 02:25:59,170][26022] Updated weights on worker 0-0, policy_version 995294 (0.00055) [2022-07-11 02:26:00,980][26022] Updated weights on worker 0-0, policy_version 995304 (0.00088) [2022-07-11 02:26:01,500][25689] Fps is (10 sec: 5557.7, 60 sec: 5550.9, 300 sec: 5551.1). Total num frames: 1019193344. Throughput: 0: 5791.0. Samples: 1019200552. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:26:01,500][25689] Avg episode reward: [(0, '0.339')] [2022-07-11 02:26:03,128][26022] Updated weights on worker 0-0, policy_version 995314 (0.00095) [2022-07-11 02:26:05,119][26022] Updated weights on worker 0-0, policy_version 995324 (0.00087) [2022-07-11 02:26:06,589][25689] Fps is (10 sec: 5453.2, 60 sec: 5569.8, 300 sec: 5553.0). Total num frames: 1019222016. Throughput: 0: 4873.6. Samples: 1019215264. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:26:06,589][25689] Avg episode reward: [(0, '-0.405')] [2022-07-11 02:26:06,599][26022] Updated weights on worker 0-0, policy_version 995334 (0.00086) [2022-07-11 02:26:08,570][26022] Updated weights on worker 0-0, policy_version 995344 (0.00087) [2022-07-11 02:26:10,380][26022] Updated weights on worker 0-0, policy_version 995354 (0.00084) [2022-07-11 02:26:11,595][25689] Fps is (10 sec: 5477.7, 60 sec: 5569.6, 300 sec: 5546.5). Total num frames: 1019248640. Throughput: 0: 5722.6. Samples: 1019249112. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:26:11,595][25689] Avg episode reward: [(0, '0.421')] [2022-07-11 02:26:12,343][26022] Updated weights on worker 0-0, policy_version 995364 (0.00095) [2022-07-11 02:26:14,066][26022] Updated weights on worker 0-0, policy_version 995374 (0.00092) [2022-07-11 02:26:15,734][26022] Updated weights on worker 0-0, policy_version 995384 (0.00092) [2022-07-11 02:26:16,644][25689] Fps is (10 sec: 5499.5, 60 sec: 5533.3, 300 sec: 5549.2). Total num frames: 1019277312. Throughput: 0: 5723.0. Samples: 1019282946. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:26:16,644][25689] Avg episode reward: [(0, '0.605')] [2022-07-11 02:26:17,819][26022] Updated weights on worker 0-0, policy_version 995394 (0.00089) [2022-07-11 02:26:19,359][26022] Updated weights on worker 0-0, policy_version 995404 (0.00077) [2022-07-11 02:26:21,340][26022] Updated weights on worker 0-0, policy_version 995414 (0.00083) [2022-07-11 02:26:21,658][25689] Fps is (10 sec: 5596.6, 60 sec: 5567.1, 300 sec: 5547.8). Total num frames: 1019304960. Throughput: 0: 4920.7. Samples: 1019299742. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:26:21,658][25689] Avg episode reward: [(0, '0.754')] [2022-07-11 02:26:23,091][26022] Updated weights on worker 0-0, policy_version 995424 (0.00089) [2022-07-11 02:26:25,037][26022] Updated weights on worker 0-0, policy_version 995434 (0.00086) [2022-07-11 02:26:26,716][25689] Fps is (10 sec: 5591.4, 60 sec: 5572.5, 300 sec: 5546.8). Total num frames: 1019333632. Throughput: 0: 5869.1. Samples: 1019333394. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:26:26,717][25689] Avg episode reward: [(0, '0.576')] [2022-07-11 02:26:26,786][26022] Updated weights on worker 0-0, policy_version 995444 (0.00088) [2022-07-11 02:26:28,670][26022] Updated weights on worker 0-0, policy_version 995454 (0.00345) [2022-07-11 02:26:30,555][26022] Updated weights on worker 0-0, policy_version 995464 (0.00097) [2022-07-11 02:26:31,733][25689] Fps is (10 sec: 5590.0, 60 sec: 5556.2, 300 sec: 5546.8). Total num frames: 1019361280. Throughput: 0: 5835.8. Samples: 1019366636. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:26:31,733][25689] Avg episode reward: [(0, '1.459')] [2022-07-11 02:26:32,497][26022] Updated weights on worker 0-0, policy_version 995474 (0.00090) [2022-07-11 02:26:34,188][26022] Updated weights on worker 0-0, policy_version 995484 (0.00089) [2022-07-11 02:26:36,024][26022] Updated weights on worker 0-0, policy_version 995494 (0.00109) [2022-07-11 02:26:36,781][25689] Fps is (10 sec: 5595.6, 60 sec: 5574.9, 300 sec: 5549.8). Total num frames: 1019389952. Throughput: 0: 4997.7. Samples: 1019383588. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:26:36,782][25689] Avg episode reward: [(0, '0.809')] [2022-07-11 02:26:37,752][26022] Updated weights on worker 0-0, policy_version 995504 (0.00094) [2022-07-11 02:26:39,684][26022] Updated weights on worker 0-0, policy_version 995514 (0.00088) [2022-07-11 02:26:41,464][26022] Updated weights on worker 0-0, policy_version 995524 (0.00078) [2022-07-11 02:26:41,808][25689] Fps is (10 sec: 5691.9, 60 sec: 5575.1, 300 sec: 5550.7). Total num frames: 1019418624. Throughput: 0: 5826.2. Samples: 1019417138. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:26:41,808][25689] Avg episode reward: [(0, '0.709')] [2022-07-11 02:26:43,399][26022] Updated weights on worker 0-0, policy_version 995534 (0.00094) [2022-07-11 02:26:45,117][26022] Updated weights on worker 0-0, policy_version 995544 (0.00083) [2022-07-11 02:26:46,868][25689] Fps is (10 sec: 5583.7, 60 sec: 5563.3, 300 sec: 5550.8). Total num frames: 1019446272. Throughput: 0: 5824.6. Samples: 1019450768. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:26:46,870][25689] Avg episode reward: [(0, '0.120')] [2022-07-11 02:26:47,140][26022] Updated weights on worker 0-0, policy_version 995554 (0.00091) [2022-07-11 02:26:49,024][26022] Updated weights on worker 0-0, policy_version 995564 (0.00084) [2022-07-11 02:26:50,707][26022] Updated weights on worker 0-0, policy_version 995574 (0.00083) [2022-07-11 02:26:51,874][25689] Fps is (10 sec: 5391.2, 60 sec: 5550.9, 300 sec: 5544.0). Total num frames: 1019472896. Throughput: 0: 5000.8. Samples: 1019467360. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:26:51,876][25689] Avg episode reward: [(0, '0.256')] [2022-07-11 02:26:52,505][26022] Updated weights on worker 0-0, policy_version 995584 (0.00095) [2022-07-11 02:26:54,261][26022] Updated weights on worker 0-0, policy_version 995594 (0.00083) [2022-07-11 02:26:56,197][26022] Updated weights on worker 0-0, policy_version 995604 (0.00083) [2022-07-11 02:26:56,903][25689] Fps is (10 sec: 5612.4, 60 sec: 5555.8, 300 sec: 5550.9). Total num frames: 1019502592. Throughput: 0: 5836.0. Samples: 1019501018. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:26:56,904][25689] Avg episode reward: [(0, '0.415')] [2022-07-11 02:26:57,967][26022] Updated weights on worker 0-0, policy_version 995614 (0.00093) [2022-07-11 02:26:59,869][26022] Updated weights on worker 0-0, policy_version 995624 (0.00090) [2022-07-11 02:27:01,582][26022] Updated weights on worker 0-0, policy_version 995634 (0.00087) [2022-07-11 02:27:01,923][25689] Fps is (10 sec: 5706.8, 60 sec: 5575.6, 300 sec: 5556.3). Total num frames: 1019530240. Throughput: 0: 5839.1. Samples: 1019534592. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:27:01,923][25689] Avg episode reward: [(0, '0.329')] [2022-07-11 02:27:04,000][26022] Updated weights on worker 0-0, policy_version 995644 (0.00092) [2022-07-11 02:27:05,823][26022] Updated weights on worker 0-0, policy_version 995654 (0.00089) [2022-07-11 02:27:07,014][25689] Fps is (10 sec: 5265.9, 60 sec: 5524.5, 300 sec: 5549.1). Total num frames: 1019555840. Throughput: 0: 4878.8. Samples: 1019549060. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:27:07,015][25689] Avg episode reward: [(0, '0.294')] [2022-07-11 02:27:07,517][26022] Updated weights on worker 0-0, policy_version 995664 (0.00092) [2022-07-11 02:27:09,386][26022] Updated weights on worker 0-0, policy_version 995674 (0.00088) [2022-07-11 02:27:11,465][26022] Updated weights on worker 0-0, policy_version 995684 (0.00092) [2022-07-11 02:27:12,063][25689] Fps is (10 sec: 5351.9, 60 sec: 5554.5, 300 sec: 5548.7). Total num frames: 1019584512. Throughput: 0: 5703.3. Samples: 1019582504. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:27:12,064][25689] Avg episode reward: [(0, '-0.007')] [2022-07-11 02:27:13,171][26022] Updated weights on worker 0-0, policy_version 995694 (0.00081) [2022-07-11 02:27:14,997][26022] Updated weights on worker 0-0, policy_version 995704 (0.00083) [2022-07-11 02:27:16,860][26022] Updated weights on worker 0-0, policy_version 995714 (0.00088) [2022-07-11 02:27:17,156][25689] Fps is (10 sec: 5654.4, 60 sec: 5550.5, 300 sec: 5545.3). Total num frames: 1019613184. Throughput: 0: 5694.4. Samples: 1019616348. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:27:17,156][25689] Avg episode reward: [(0, '-0.066')] [2022-07-11 02:27:18,657][26022] Updated weights on worker 0-0, policy_version 995724 (0.00083) [2022-07-11 02:27:20,468][26022] Updated weights on worker 0-0, policy_version 995734 (0.00092) [2022-07-11 02:27:22,179][25689] Fps is (10 sec: 5567.5, 60 sec: 5549.6, 300 sec: 5549.5). Total num frames: 1019640832. Throughput: 0: 5684.6. Samples: 1019649742. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:27:22,180][25689] Avg episode reward: [(0, '-1.021')] [2022-07-11 02:27:22,325][26022] Updated weights on worker 0-0, policy_version 995744 (0.00090) [2022-07-11 02:27:24,171][26022] Updated weights on worker 0-0, policy_version 995754 (0.00092) [2022-07-11 02:27:25,955][26022] Updated weights on worker 0-0, policy_version 995764 (0.00086) [2022-07-11 02:27:26,560][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:27:26,574][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000995766_1019664384.pth [2022-07-11 02:27:26,579][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000993814_1017665536.pth [2022-07-11 02:27:27,244][25689] Fps is (10 sec: 5481.4, 60 sec: 5532.1, 300 sec: 5548.6). Total num frames: 1019668480. Throughput: 0: 5804.7. Samples: 1019666486. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:27:27,244][25689] Avg episode reward: [(0, '-0.730')] [2022-07-11 02:27:27,845][26022] Updated weights on worker 0-0, policy_version 995774 (0.00051) [2022-07-11 02:27:29,848][26022] Updated weights on worker 0-0, policy_version 995784 (0.00088) [2022-07-11 02:27:31,437][26022] Updated weights on worker 0-0, policy_version 995794 (0.00093) [2022-07-11 02:27:32,313][25689] Fps is (10 sec: 5456.8, 60 sec: 5527.4, 300 sec: 5541.5). Total num frames: 1019696128. Throughput: 0: 5789.0. Samples: 1019699726. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:27:32,313][25689] Avg episode reward: [(0, '-0.566')] [2022-07-11 02:27:33,507][26022] Updated weights on worker 0-0, policy_version 995804 (0.00081) [2022-07-11 02:27:35,184][26022] Updated weights on worker 0-0, policy_version 995814 (0.00091) [2022-07-11 02:27:37,165][26022] Updated weights on worker 0-0, policy_version 995824 (0.00084) [2022-07-11 02:27:37,373][25689] Fps is (10 sec: 5661.1, 60 sec: 5543.1, 300 sec: 5548.8). Total num frames: 1019725824. Throughput: 0: 5787.7. Samples: 1019733360. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:27:37,374][25689] Avg episode reward: [(0, '-0.697')] [2022-07-11 02:27:38,846][26022] Updated weights on worker 0-0, policy_version 995834 (0.00086) [2022-07-11 02:27:40,732][26022] Updated weights on worker 0-0, policy_version 995844 (0.00090) [2022-07-11 02:27:42,441][25689] Fps is (10 sec: 5661.7, 60 sec: 5522.5, 300 sec: 5545.8). Total num frames: 1019753472. Throughput: 0: 4957.2. Samples: 1019750184. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:27:42,441][25689] Avg episode reward: [(0, '-0.004')] [2022-07-11 02:27:42,659][26022] Updated weights on worker 0-0, policy_version 995854 (0.00090) [2022-07-11 02:27:44,230][26022] Updated weights on worker 0-0, policy_version 995864 (0.00097) [2022-07-11 02:27:46,152][26022] Updated weights on worker 0-0, policy_version 995874 (0.00088) [2022-07-11 02:27:47,566][25689] Fps is (10 sec: 5525.8, 60 sec: 5533.5, 300 sec: 5547.0). Total num frames: 1019782144. Throughput: 0: 5770.3. Samples: 1019783748. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:27:47,567][25689] Avg episode reward: [(0, '0.811')] [2022-07-11 02:27:48,039][26022] Updated weights on worker 0-0, policy_version 995884 (0.00085) [2022-07-11 02:27:49,771][26022] Updated weights on worker 0-0, policy_version 995894 (0.00095) [2022-07-11 02:27:51,741][26022] Updated weights on worker 0-0, policy_version 995904 (0.00093) [2022-07-11 02:27:52,586][25689] Fps is (10 sec: 5551.4, 60 sec: 5549.1, 300 sec: 5547.0). Total num frames: 1019809792. Throughput: 0: 5813.4. Samples: 1019817584. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:27:52,586][25689] Avg episode reward: [(0, '1.186')] [2022-07-11 02:27:53,395][26022] Updated weights on worker 0-0, policy_version 995914 (0.00090) [2022-07-11 02:27:55,497][26022] Updated weights on worker 0-0, policy_version 995924 (0.00086) [2022-07-11 02:27:57,003][26022] Updated weights on worker 0-0, policy_version 995934 (0.00087) [2022-07-11 02:27:57,597][25689] Fps is (10 sec: 5512.3, 60 sec: 5517.0, 300 sec: 5544.9). Total num frames: 1019837440. Throughput: 0: 5001.7. Samples: 1019834512. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:27:57,599][25689] Avg episode reward: [(0, '1.356')] [2022-07-11 02:27:59,092][26022] Updated weights on worker 0-0, policy_version 995944 (0.00092) [2022-07-11 02:28:00,769][26022] Updated weights on worker 0-0, policy_version 995954 (0.00515) [2022-07-11 02:28:02,633][25689] Fps is (10 sec: 5300.1, 60 sec: 5481.8, 300 sec: 5545.1). Total num frames: 1019863040. Throughput: 0: 5821.8. Samples: 1019867736. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:28:02,639][25689] Avg episode reward: [(0, '0.016')] [2022-07-11 02:28:03,098][26022] Updated weights on worker 0-0, policy_version 995964 (0.00082) [2022-07-11 02:28:04,916][26022] Updated weights on worker 0-0, policy_version 995974 (0.00093) [2022-07-11 02:28:06,749][26022] Updated weights on worker 0-0, policy_version 995984 (0.00083) [2022-07-11 02:28:07,749][25689] Fps is (10 sec: 5446.7, 60 sec: 5547.0, 300 sec: 5550.3). Total num frames: 1019892736. Throughput: 0: 5723.5. Samples: 1019899268. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:28:07,750][25689] Avg episode reward: [(0, '0.395')] [2022-07-11 02:28:08,486][26022] Updated weights on worker 0-0, policy_version 995994 (0.00089) [2022-07-11 02:28:10,396][26022] Updated weights on worker 0-0, policy_version 996004 (0.00091) [2022-07-11 02:28:12,210][26022] Updated weights on worker 0-0, policy_version 996014 (0.00089) [2022-07-11 02:28:12,790][25689] Fps is (10 sec: 5746.3, 60 sec: 5547.7, 300 sec: 5546.3). Total num frames: 1019921408. Throughput: 0: 4873.5. Samples: 1019916048. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:28:12,791][25689] Avg episode reward: [(0, '0.294')] [2022-07-11 02:28:13,995][26022] Updated weights on worker 0-0, policy_version 996024 (0.00088) [2022-07-11 02:28:15,911][26022] Updated weights on worker 0-0, policy_version 996034 (0.00090) [2022-07-11 02:28:17,636][26022] Updated weights on worker 0-0, policy_version 996044 (0.00084) [2022-07-11 02:28:17,819][25689] Fps is (10 sec: 5694.7, 60 sec: 5553.6, 300 sec: 5549.7). Total num frames: 1019950080. Throughput: 0: 5708.8. Samples: 1019949954. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:28:17,819][25689] Avg episode reward: [(0, '-0.549')] [2022-07-11 02:28:19,551][26022] Updated weights on worker 0-0, policy_version 996054 (0.00082) [2022-07-11 02:28:21,232][26022] Updated weights on worker 0-0, policy_version 996064 (0.00086) [2022-07-11 02:28:22,918][25689] Fps is (10 sec: 5459.6, 60 sec: 5529.8, 300 sec: 5546.7). Total num frames: 1019976704. Throughput: 0: 5706.7. Samples: 1019983500. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 02:28:22,919][25689] Avg episode reward: [(0, '-0.491')] [2022-07-11 02:28:23,359][26022] Updated weights on worker 0-0, policy_version 996074 (0.00093) [2022-07-11 02:28:25,007][26022] Updated weights on worker 0-0, policy_version 996084 (0.00087) [2022-07-11 02:28:27,112][26022] Updated weights on worker 0-0, policy_version 996094 (0.00089) [2022-07-11 02:28:28,032][25689] Fps is (10 sec: 5514.3, 60 sec: 5558.9, 300 sec: 5548.1). Total num frames: 1020006400. Throughput: 0: 4968.5. Samples: 1020000046. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:28:28,033][25689] Avg episode reward: [(0, '-0.478')] [2022-07-11 02:28:28,575][26022] Updated weights on worker 0-0, policy_version 996104 (0.00095) [2022-07-11 02:28:30,675][26022] Updated weights on worker 0-0, policy_version 996114 (0.00085) [2022-07-11 02:28:32,322][26022] Updated weights on worker 0-0, policy_version 996124 (0.00083) [2022-07-11 02:28:33,039][25689] Fps is (10 sec: 5666.1, 60 sec: 5564.7, 300 sec: 5551.9). Total num frames: 1020034048. Throughput: 0: 5795.6. Samples: 1020033402. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:28:33,039][25689] Avg episode reward: [(0, '0.187')] [2022-07-11 02:28:34,319][26022] Updated weights on worker 0-0, policy_version 996134 (0.00086) [2022-07-11 02:28:35,996][26022] Updated weights on worker 0-0, policy_version 996144 (0.00085) [2022-07-11 02:28:37,929][26022] Updated weights on worker 0-0, policy_version 996154 (0.00082) [2022-07-11 02:28:38,053][25689] Fps is (10 sec: 5518.0, 60 sec: 5535.1, 300 sec: 5544.8). Total num frames: 1020061696. Throughput: 0: 5785.4. Samples: 1020067018. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:28:38,054][25689] Avg episode reward: [(0, '0.133')] [2022-07-11 02:28:39,867][26022] Updated weights on worker 0-0, policy_version 996164 (0.00081) [2022-07-11 02:28:41,496][26022] Updated weights on worker 0-0, policy_version 996174 (0.00090) [2022-07-11 02:28:43,097][25689] Fps is (10 sec: 5599.4, 60 sec: 5554.2, 300 sec: 5541.2). Total num frames: 1020090368. Throughput: 0: 4975.4. Samples: 1020083896. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:28:43,097][25689] Avg episode reward: [(0, '-0.186')] [2022-07-11 02:28:43,425][26022] Updated weights on worker 0-0, policy_version 996184 (0.00084) [2022-07-11 02:28:44,998][26022] Updated weights on worker 0-0, policy_version 996194 (0.00085) [2022-07-11 02:28:47,008][26022] Updated weights on worker 0-0, policy_version 996204 (0.00084) [2022-07-11 02:28:48,198][25689] Fps is (10 sec: 5652.8, 60 sec: 5556.4, 300 sec: 5546.4). Total num frames: 1020119040. Throughput: 0: 5822.6. Samples: 1020117460. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:28:48,198][25689] Avg episode reward: [(0, '0.128')] [2022-07-11 02:28:48,861][26022] Updated weights on worker 0-0, policy_version 996214 (0.00090) [2022-07-11 02:28:50,624][26022] Updated weights on worker 0-0, policy_version 996224 (0.00099) [2022-07-11 02:28:52,563][26022] Updated weights on worker 0-0, policy_version 996234 (0.00049) [2022-07-11 02:28:53,230][25689] Fps is (10 sec: 5558.3, 60 sec: 5555.4, 300 sec: 5546.0). Total num frames: 1020146688. Throughput: 0: 5812.4. Samples: 1020150760. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:28:53,230][25689] Avg episode reward: [(0, '0.362')] [2022-07-11 02:28:54,613][26022] Updated weights on worker 0-0, policy_version 996244 (0.00095) [2022-07-11 02:28:56,311][26022] Updated weights on worker 0-0, policy_version 996254 (0.00089) [2022-07-11 02:28:58,238][25689] Fps is (10 sec: 5507.7, 60 sec: 5555.6, 300 sec: 5546.0). Total num frames: 1020174336. Throughput: 0: 5805.2. Samples: 1020184190. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:28:58,238][25689] Avg episode reward: [(0, '0.675')] [2022-07-11 02:28:58,241][26022] Updated weights on worker 0-0, policy_version 996264 (0.00087) [2022-07-11 02:28:59,893][26022] Updated weights on worker 0-0, policy_version 996274 (0.00087) [2022-07-11 02:29:02,320][26022] Updated weights on worker 0-0, policy_version 996284 (0.00087) [2022-07-11 02:29:03,259][25689] Fps is (10 sec: 5309.3, 60 sec: 5556.9, 300 sec: 5547.5). Total num frames: 1020199936. Throughput: 0: 5807.4. Samples: 1020200984. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:29:03,259][25689] Avg episode reward: [(0, '0.644')] [2022-07-11 02:29:03,823][26022] Updated weights on worker 0-0, policy_version 996294 (0.00082) [2022-07-11 02:29:06,127][26022] Updated weights on worker 0-0, policy_version 996304 (0.00085) [2022-07-11 02:29:07,596][26022] Updated weights on worker 0-0, policy_version 996314 (0.00082) [2022-07-11 02:29:08,384][25689] Fps is (10 sec: 5348.6, 60 sec: 5539.2, 300 sec: 5539.7). Total num frames: 1020228608. Throughput: 0: 5686.8. Samples: 1020232258. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:29:08,385][25689] Avg episode reward: [(0, '0.050')] [2022-07-11 02:29:09,525][26022] Updated weights on worker 0-0, policy_version 996324 (0.00050) [2022-07-11 02:29:11,188][26022] Updated weights on worker 0-0, policy_version 996334 (0.00095) [2022-07-11 02:29:13,200][26022] Updated weights on worker 0-0, policy_version 996344 (0.00088) [2022-07-11 02:29:13,427][25689] Fps is (10 sec: 5639.6, 60 sec: 5539.1, 300 sec: 5543.4). Total num frames: 1020257280. Throughput: 0: 5710.9. Samples: 1020266104. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:29:13,427][25689] Avg episode reward: [(0, '0.725')] [2022-07-11 02:29:15,108][26022] Updated weights on worker 0-0, policy_version 996354 (0.00097) [2022-07-11 02:29:16,666][26022] Updated weights on worker 0-0, policy_version 996364 (0.00084) [2022-07-11 02:29:18,496][25689] Fps is (10 sec: 5569.8, 60 sec: 5518.5, 300 sec: 5542.2). Total num frames: 1020284928. Throughput: 0: 4884.2. Samples: 1020283136. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:29:18,497][25689] Avg episode reward: [(0, '0.687')] [2022-07-11 02:29:18,645][26022] Updated weights on worker 0-0, policy_version 996374 (0.00088) [2022-07-11 02:29:20,323][26022] Updated weights on worker 0-0, policy_version 996384 (0.00098) [2022-07-11 02:29:22,081][26022] Updated weights on worker 0-0, policy_version 996394 (0.00081) [2022-07-11 02:29:23,517][25689] Fps is (10 sec: 5581.6, 60 sec: 5559.4, 300 sec: 5547.1). Total num frames: 1020313600. Throughput: 0: 5729.8. Samples: 1020317060. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:29:23,518][25689] Avg episode reward: [(0, '0.251')] [2022-07-11 02:29:24,186][26022] Updated weights on worker 0-0, policy_version 996404 (0.00090) [2022-07-11 02:29:25,823][26022] Updated weights on worker 0-0, policy_version 996414 (0.00093) [2022-07-11 02:29:26,738][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:29:26,750][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000996419_1020333056.pth [2022-07-11 02:29:26,751][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000994464_1018331136.pth [2022-07-11 02:29:26,751][25974] Saving a milestone ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/milestones/checkpoint_000996419_1020333056.pth.milestone [2022-07-11 02:29:27,864][26022] Updated weights on worker 0-0, policy_version 996424 (0.00088) [2022-07-11 02:29:28,591][25689] Fps is (10 sec: 5680.5, 60 sec: 5546.2, 300 sec: 5550.1). Total num frames: 1020342272. Throughput: 0: 5854.9. Samples: 1020350566. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:29:28,593][25689] Avg episode reward: [(0, '0.033')] [2022-07-11 02:29:29,451][26022] Updated weights on worker 0-0, policy_version 996434 (0.00096) [2022-07-11 02:29:31,424][26022] Updated weights on worker 0-0, policy_version 996444 (0.00082) [2022-07-11 02:29:33,131][26022] Updated weights on worker 0-0, policy_version 996454 (0.00088) [2022-07-11 02:29:33,619][25689] Fps is (10 sec: 5676.7, 60 sec: 5561.2, 300 sec: 5549.7). Total num frames: 1020370944. Throughput: 0: 5012.1. Samples: 1020367308. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:29:33,619][25689] Avg episode reward: [(0, '0.524')] [2022-07-11 02:29:35,131][26022] Updated weights on worker 0-0, policy_version 996464 (0.00086) [2022-07-11 02:29:36,864][26022] Updated weights on worker 0-0, policy_version 996474 (0.00085) [2022-07-11 02:29:38,641][25689] Fps is (10 sec: 5603.7, 60 sec: 5560.4, 300 sec: 5549.9). Total num frames: 1020398592. Throughput: 0: 5846.4. Samples: 1020400914. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:29:38,642][25689] Avg episode reward: [(0, '0.680')] [2022-07-11 02:29:38,683][26022] Updated weights on worker 0-0, policy_version 996484 (0.00086) [2022-07-11 02:29:40,469][26022] Updated weights on worker 0-0, policy_version 996494 (0.00089) [2022-07-11 02:29:42,336][26022] Updated weights on worker 0-0, policy_version 996504 (0.00094) [2022-07-11 02:29:43,689][25689] Fps is (10 sec: 5491.2, 60 sec: 5543.2, 300 sec: 5546.4). Total num frames: 1020426240. Throughput: 0: 5824.8. Samples: 1020434556. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:29:43,689][25689] Avg episode reward: [(0, '0.484')] [2022-07-11 02:29:44,214][26022] Updated weights on worker 0-0, policy_version 996514 (0.00092) [2022-07-11 02:29:46,084][26022] Updated weights on worker 0-0, policy_version 996524 (0.00093) [2022-07-11 02:29:47,853][26022] Updated weights on worker 0-0, policy_version 996534 (0.00090) [2022-07-11 02:29:48,816][25689] Fps is (10 sec: 5535.5, 60 sec: 5540.8, 300 sec: 5548.4). Total num frames: 1020454912. Throughput: 0: 4963.0. Samples: 1020450942. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:29:48,816][25689] Avg episode reward: [(0, '0.940')] [2022-07-11 02:29:49,822][26022] Updated weights on worker 0-0, policy_version 996544 (0.00087) [2022-07-11 02:29:51,377][26022] Updated weights on worker 0-0, policy_version 996554 (0.00083) [2022-07-11 02:29:53,675][26022] Updated weights on worker 0-0, policy_version 996564 (0.00537) [2022-07-11 02:29:53,834][25689] Fps is (10 sec: 5551.5, 60 sec: 5542.1, 300 sec: 5548.7). Total num frames: 1020482560. Throughput: 0: 5778.7. Samples: 1020484124. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:29:53,835][25689] Avg episode reward: [(0, '0.903')] [2022-07-11 02:29:55,152][26022] Updated weights on worker 0-0, policy_version 996574 (0.00086) [2022-07-11 02:29:57,287][26022] Updated weights on worker 0-0, policy_version 996584 (0.00092) [2022-07-11 02:29:58,843][25689] Fps is (10 sec: 5616.9, 60 sec: 5558.9, 300 sec: 5552.3). Total num frames: 1020511232. Throughput: 0: 5775.9. Samples: 1020517594. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:29:58,844][25689] Avg episode reward: [(0, '1.131')] [2022-07-11 02:29:58,924][26022] Updated weights on worker 0-0, policy_version 996594 (0.00080) [2022-07-11 02:30:00,849][26022] Updated weights on worker 0-0, policy_version 996604 (0.00089) [2022-07-11 02:30:03,095][26022] Updated weights on worker 0-0, policy_version 996614 (0.00081) [2022-07-11 02:30:03,896][25689] Fps is (10 sec: 5393.6, 60 sec: 5555.9, 300 sec: 5546.5). Total num frames: 1020536832. Throughput: 0: 4917.9. Samples: 1020533934. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:30:03,898][25689] Avg episode reward: [(0, '0.519')] [2022-07-11 02:30:05,011][26022] Updated weights on worker 0-0, policy_version 996624 (0.00085) [2022-07-11 02:30:06,864][26022] Updated weights on worker 0-0, policy_version 996634 (0.00088) [2022-07-11 02:30:08,591][26022] Updated weights on worker 0-0, policy_version 996644 (0.00085) [2022-07-11 02:30:08,937][25689] Fps is (10 sec: 5173.8, 60 sec: 5529.9, 300 sec: 5545.8). Total num frames: 1020563456. Throughput: 0: 5691.4. Samples: 1020565458. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:30:08,937][25689] Avg episode reward: [(0, '0.880')] [2022-07-11 02:30:10,589][26022] Updated weights on worker 0-0, policy_version 996654 (0.00102) [2022-07-11 02:30:12,563][26022] Updated weights on worker 0-0, policy_version 996664 (0.00091) [2022-07-11 02:30:13,953][25689] Fps is (10 sec: 5599.8, 60 sec: 5549.2, 300 sec: 5542.5). Total num frames: 1020593152. Throughput: 0: 5697.3. Samples: 1020598752. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:30:13,955][25689] Avg episode reward: [(0, '0.765')] [2022-07-11 02:30:14,149][26022] Updated weights on worker 0-0, policy_version 996674 (0.00087) [2022-07-11 02:30:16,084][26022] Updated weights on worker 0-0, policy_version 996684 (0.00090) [2022-07-11 02:30:17,639][26022] Updated weights on worker 0-0, policy_version 996694 (0.00084) [2022-07-11 02:30:18,991][25689] Fps is (10 sec: 5499.5, 60 sec: 5518.2, 300 sec: 5542.1). Total num frames: 1020618752. Throughput: 0: 4872.3. Samples: 1020615764. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:30:18,992][25689] Avg episode reward: [(0, '0.870')] [2022-07-11 02:30:19,653][26022] Updated weights on worker 0-0, policy_version 996704 (0.00093) [2022-07-11 02:30:21,482][26022] Updated weights on worker 0-0, policy_version 996714 (0.00087) [2022-07-11 02:30:23,328][26022] Updated weights on worker 0-0, policy_version 996724 (0.00087) [2022-07-11 02:30:24,008][25689] Fps is (10 sec: 5397.8, 60 sec: 5518.6, 300 sec: 5543.9). Total num frames: 1020647424. Throughput: 0: 5735.9. Samples: 1020649294. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:30:24,009][25689] Avg episode reward: [(0, '-0.146')] [2022-07-11 02:30:25,159][26022] Updated weights on worker 0-0, policy_version 996734 (0.00079) [2022-07-11 02:30:27,168][26022] Updated weights on worker 0-0, policy_version 996744 (0.00090) [2022-07-11 02:30:28,702][26022] Updated weights on worker 0-0, policy_version 996754 (0.00096) [2022-07-11 02:30:29,066][25689] Fps is (10 sec: 5691.8, 60 sec: 5520.0, 300 sec: 5543.3). Total num frames: 1020676096. Throughput: 0: 5825.5. Samples: 1020682724. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:30:29,068][25689] Avg episode reward: [(0, '-0.062')] [2022-07-11 02:30:30,847][26022] Updated weights on worker 0-0, policy_version 996764 (0.00085) [2022-07-11 02:30:32,454][26022] Updated weights on worker 0-0, policy_version 996774 (0.00088) [2022-07-11 02:30:34,075][25689] Fps is (10 sec: 5696.4, 60 sec: 5521.8, 300 sec: 5547.8). Total num frames: 1020704768. Throughput: 0: 5002.6. Samples: 1020699412. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:30:34,075][25689] Avg episode reward: [(0, '-0.195')] [2022-07-11 02:30:34,340][26022] Updated weights on worker 0-0, policy_version 996784 (0.00089) [2022-07-11 02:30:36,326][26022] Updated weights on worker 0-0, policy_version 996794 (0.00091) [2022-07-11 02:30:38,187][26022] Updated weights on worker 0-0, policy_version 996804 (0.00083) [2022-07-11 02:30:39,082][25689] Fps is (10 sec: 5623.1, 60 sec: 5523.2, 300 sec: 5544.8). Total num frames: 1020732416. Throughput: 0: 5810.5. Samples: 1020732502. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:30:39,084][25689] Avg episode reward: [(0, '-0.198')] [2022-07-11 02:30:40,023][26022] Updated weights on worker 0-0, policy_version 996814 (0.00087) [2022-07-11 02:30:41,859][26022] Updated weights on worker 0-0, policy_version 996824 (0.00093) [2022-07-11 02:30:43,506][26022] Updated weights on worker 0-0, policy_version 996834 (0.00087) [2022-07-11 02:30:44,087][25689] Fps is (10 sec: 5522.9, 60 sec: 5527.1, 300 sec: 5543.4). Total num frames: 1020760064. Throughput: 0: 5827.2. Samples: 1020766298. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:30:44,089][25689] Avg episode reward: [(0, '0.504')] [2022-07-11 02:30:45,567][26022] Updated weights on worker 0-0, policy_version 996844 (0.00081) [2022-07-11 02:30:47,288][26022] Updated weights on worker 0-0, policy_version 996854 (0.00092) [2022-07-11 02:30:49,052][26022] Updated weights on worker 0-0, policy_version 996864 (0.00426) [2022-07-11 02:30:49,160][25689] Fps is (10 sec: 5689.9, 60 sec: 5548.9, 300 sec: 5549.9). Total num frames: 1020789760. Throughput: 0: 4988.8. Samples: 1020782970. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:30:49,162][25689] Avg episode reward: [(0, '0.170')] [2022-07-11 02:30:51,131][26022] Updated weights on worker 0-0, policy_version 996874 (0.00088) [2022-07-11 02:30:52,787][26022] Updated weights on worker 0-0, policy_version 996884 (0.00085) [2022-07-11 02:30:54,174][25689] Fps is (10 sec: 5481.8, 60 sec: 5515.4, 300 sec: 5537.5). Total num frames: 1020815360. Throughput: 0: 5813.6. Samples: 1020816262. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:30:54,174][25689] Avg episode reward: [(0, '0.995')] [2022-07-11 02:30:54,656][26022] Updated weights on worker 0-0, policy_version 996894 (0.00085) [2022-07-11 02:30:56,674][26022] Updated weights on worker 0-0, policy_version 996904 (0.00091) [2022-07-11 02:30:58,252][26022] Updated weights on worker 0-0, policy_version 996914 (0.00093) [2022-07-11 02:30:59,203][25689] Fps is (10 sec: 5404.3, 60 sec: 5513.6, 300 sec: 5544.8). Total num frames: 1020844032. Throughput: 0: 5827.5. Samples: 1020849754. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:30:59,203][25689] Avg episode reward: [(0, '1.377')] [2022-07-11 02:31:00,317][26022] Updated weights on worker 0-0, policy_version 996924 (0.00090) [2022-07-11 02:31:02,342][26022] Updated weights on worker 0-0, policy_version 996934 (0.00081) [2022-07-11 02:31:04,302][25689] Fps is (10 sec: 5358.5, 60 sec: 5509.4, 300 sec: 5534.2). Total num frames: 1020869632. Throughput: 0: 4903.6. Samples: 1020865426. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:31:04,303][25689] Avg episode reward: [(0, '1.332')] [2022-07-11 02:31:04,328][26022] Updated weights on worker 0-0, policy_version 996944 (0.00094) [2022-07-11 02:31:06,124][26022] Updated weights on worker 0-0, policy_version 996954 (0.00149) [2022-07-11 02:31:07,911][26022] Updated weights on worker 0-0, policy_version 996964 (0.00077) [2022-07-11 02:31:09,367][25689] Fps is (10 sec: 5339.5, 60 sec: 5541.0, 300 sec: 5540.0). Total num frames: 1020898304. Throughput: 0: 5669.3. Samples: 1020897528. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:31:09,367][25689] Avg episode reward: [(0, '0.476')] [2022-07-11 02:31:09,907][26022] Updated weights on worker 0-0, policy_version 996974 (0.00099) [2022-07-11 02:31:11,656][26022] Updated weights on worker 0-0, policy_version 996984 (0.00083) [2022-07-11 02:31:13,537][26022] Updated weights on worker 0-0, policy_version 996994 (0.00083) [2022-07-11 02:31:14,387][25689] Fps is (10 sec: 5584.4, 60 sec: 5506.8, 300 sec: 5537.1). Total num frames: 1020925952. Throughput: 0: 5689.7. Samples: 1020931270. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:31:14,389][25689] Avg episode reward: [(0, '0.728')] [2022-07-11 02:31:15,289][26022] Updated weights on worker 0-0, policy_version 997004 (0.00097) [2022-07-11 02:31:17,156][26022] Updated weights on worker 0-0, policy_version 997014 (0.00088) [2022-07-11 02:31:19,101][26022] Updated weights on worker 0-0, policy_version 997024 (0.00087) [2022-07-11 02:31:19,399][25689] Fps is (10 sec: 5715.8, 60 sec: 5577.0, 300 sec: 5544.0). Total num frames: 1020955648. Throughput: 0: 4862.7. Samples: 1020947964. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:31:19,405][25689] Avg episode reward: [(0, '0.427')] [2022-07-11 02:31:20,928][26022] Updated weights on worker 0-0, policy_version 997034 (0.00085) [2022-07-11 02:31:22,679][26022] Updated weights on worker 0-0, policy_version 997044 (0.00092) [2022-07-11 02:31:24,422][25689] Fps is (10 sec: 5612.7, 60 sec: 5542.5, 300 sec: 5537.8). Total num frames: 1020982272. Throughput: 0: 5770.0. Samples: 1020981516. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:31:24,423][25689] Avg episode reward: [(0, '0.239')] [2022-07-11 02:31:24,595][26022] Updated weights on worker 0-0, policy_version 997054 (0.00090) [2022-07-11 02:31:26,631][26022] Updated weights on worker 0-0, policy_version 997064 (0.00084) [2022-07-11 02:31:26,845][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:31:26,853][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000997065_1020994560.pth [2022-07-11 02:31:26,854][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000995115_1018997760.pth [2022-07-11 02:31:28,227][26022] Updated weights on worker 0-0, policy_version 997074 (0.00498) [2022-07-11 02:31:29,535][25689] Fps is (10 sec: 5455.8, 60 sec: 5537.5, 300 sec: 5539.5). Total num frames: 1021010944. Throughput: 0: 5806.4. Samples: 1021014630. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:31:29,537][25689] Avg episode reward: [(0, '0.145')] [2022-07-11 02:31:30,190][26022] Updated weights on worker 0-0, policy_version 997084 (0.00095) [2022-07-11 02:31:31,673][26022] Updated weights on worker 0-0, policy_version 997094 (0.00100) [2022-07-11 02:31:33,941][26022] Updated weights on worker 0-0, policy_version 997104 (0.00085) [2022-07-11 02:31:34,579][25689] Fps is (10 sec: 5746.5, 60 sec: 5551.2, 300 sec: 5543.0). Total num frames: 1021040640. Throughput: 0: 5786.8. Samples: 1021048114. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:31:34,580][25689] Avg episode reward: [(0, '-0.194')] [2022-07-11 02:31:35,655][26022] Updated weights on worker 0-0, policy_version 997114 (0.00457) [2022-07-11 02:31:37,514][26022] Updated weights on worker 0-0, policy_version 997124 (0.00095) [2022-07-11 02:31:39,271][26022] Updated weights on worker 0-0, policy_version 997134 (0.00082) [2022-07-11 02:31:39,604][25689] Fps is (10 sec: 5491.3, 60 sec: 5515.7, 300 sec: 5532.7). Total num frames: 1021066240. Throughput: 0: 5777.2. Samples: 1021064692. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:31:39,605][25689] Avg episode reward: [(0, '0.333')] [2022-07-11 02:31:41,128][26022] Updated weights on worker 0-0, policy_version 997144 (0.00083) [2022-07-11 02:31:42,957][26022] Updated weights on worker 0-0, policy_version 997154 (0.00090) [2022-07-11 02:31:44,631][25689] Fps is (10 sec: 5399.3, 60 sec: 5530.7, 300 sec: 5536.8). Total num frames: 1021094912. Throughput: 0: 5792.9. Samples: 1021098582. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:31:44,631][25689] Avg episode reward: [(0, '0.597')] [2022-07-11 02:31:44,780][26022] Updated weights on worker 0-0, policy_version 997164 (0.00108) [2022-07-11 02:31:46,505][26022] Updated weights on worker 0-0, policy_version 997174 (0.00085) [2022-07-11 02:31:48,482][26022] Updated weights on worker 0-0, policy_version 997184 (0.00093) [2022-07-11 02:31:49,673][25689] Fps is (10 sec: 5796.9, 60 sec: 5533.5, 300 sec: 5546.4). Total num frames: 1021124608. Throughput: 0: 5844.3. Samples: 1021132326. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:31:49,675][25689] Avg episode reward: [(0, '0.814')] [2022-07-11 02:31:50,077][26022] Updated weights on worker 0-0, policy_version 997194 (0.00087) [2022-07-11 02:31:52,030][26022] Updated weights on worker 0-0, policy_version 997204 (0.00090) [2022-07-11 02:31:53,796][26022] Updated weights on worker 0-0, policy_version 997214 (0.00085) [2022-07-11 02:31:54,712][25689] Fps is (10 sec: 5586.3, 60 sec: 5548.1, 300 sec: 5535.9). Total num frames: 1021151232. Throughput: 0: 5029.9. Samples: 1021149384. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:31:54,715][25689] Avg episode reward: [(0, '1.214')] [2022-07-11 02:31:55,656][26022] Updated weights on worker 0-0, policy_version 997224 (0.00085) [2022-07-11 02:31:57,488][26022] Updated weights on worker 0-0, policy_version 997234 (0.00090) [2022-07-11 02:31:59,163][26022] Updated weights on worker 0-0, policy_version 997244 (0.00093) [2022-07-11 02:31:59,761][25689] Fps is (10 sec: 5684.5, 60 sec: 5580.1, 300 sec: 5545.7). Total num frames: 1021181952. Throughput: 0: 5885.6. Samples: 1021183326. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:31:59,763][25689] Avg episode reward: [(0, '1.158')] [2022-07-11 02:32:01,095][26022] Updated weights on worker 0-0, policy_version 997254 (0.00096) [2022-07-11 02:32:03,280][26022] Updated weights on worker 0-0, policy_version 997264 (0.00086) [2022-07-11 02:32:04,823][25689] Fps is (10 sec: 5469.0, 60 sec: 5566.6, 300 sec: 5542.8). Total num frames: 1021206528. Throughput: 0: 5759.1. Samples: 1021214874. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:32:04,825][25689] Avg episode reward: [(0, '1.247')] [2022-07-11 02:32:05,164][26022] Updated weights on worker 0-0, policy_version 997274 (0.00089) [2022-07-11 02:32:06,997][26022] Updated weights on worker 0-0, policy_version 997284 (0.00094) [2022-07-11 02:32:08,831][26022] Updated weights on worker 0-0, policy_version 997294 (0.00081) [2022-07-11 02:32:09,864][25689] Fps is (10 sec: 5169.2, 60 sec: 5551.9, 300 sec: 5539.5). Total num frames: 1021234176. Throughput: 0: 4908.8. Samples: 1021231438. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:32:09,865][25689] Avg episode reward: [(0, '1.089')] [2022-07-11 02:32:10,691][26022] Updated weights on worker 0-0, policy_version 997304 (0.00094) [2022-07-11 02:32:12,589][26022] Updated weights on worker 0-0, policy_version 997314 (0.00086) [2022-07-11 02:32:14,175][26022] Updated weights on worker 0-0, policy_version 997324 (0.00083) [2022-07-11 02:32:14,883][25689] Fps is (10 sec: 5700.5, 60 sec: 5585.9, 300 sec: 5544.3). Total num frames: 1021263872. Throughput: 0: 5742.5. Samples: 1021265212. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:32:14,883][25689] Avg episode reward: [(0, '1.045')] [2022-07-11 02:32:16,176][26022] Updated weights on worker 0-0, policy_version 997334 (0.00082) [2022-07-11 02:32:17,848][26022] Updated weights on worker 0-0, policy_version 997344 (0.00088) [2022-07-11 02:32:19,895][25689] Fps is (10 sec: 5512.8, 60 sec: 5518.2, 300 sec: 5537.6). Total num frames: 1021289472. Throughput: 0: 5733.4. Samples: 1021298760. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 02:32:19,895][25689] Avg episode reward: [(0, '0.891')] [2022-07-11 02:32:19,971][26022] Updated weights on worker 0-0, policy_version 997354 (0.00090) [2022-07-11 02:32:21,623][26022] Updated weights on worker 0-0, policy_version 997364 (0.00079) [2022-07-11 02:32:23,369][26022] Updated weights on worker 0-0, policy_version 997374 (0.00086) [2022-07-11 02:32:24,909][25689] Fps is (10 sec: 5515.4, 60 sec: 5569.8, 300 sec: 5545.5). Total num frames: 1021319168. Throughput: 0: 5018.7. Samples: 1021315676. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:32:24,909][25689] Avg episode reward: [(0, '0.857')] [2022-07-11 02:32:25,329][26022] Updated weights on worker 0-0, policy_version 997384 (0.00090) [2022-07-11 02:32:26,977][26022] Updated weights on worker 0-0, policy_version 997394 (0.00092) [2022-07-11 02:32:29,134][26022] Updated weights on worker 0-0, policy_version 997404 (0.00090) [2022-07-11 02:32:29,979][25689] Fps is (10 sec: 5788.1, 60 sec: 5573.7, 300 sec: 5548.9). Total num frames: 1021347840. Throughput: 0: 5828.8. Samples: 1021348682. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:32:29,980][25689] Avg episode reward: [(0, '1.305')] [2022-07-11 02:32:30,850][26022] Updated weights on worker 0-0, policy_version 997414 (0.00096) [2022-07-11 02:32:32,749][26022] Updated weights on worker 0-0, policy_version 997424 (0.00080) [2022-07-11 02:32:34,599][26022] Updated weights on worker 0-0, policy_version 997434 (0.00084) [2022-07-11 02:32:35,006][25689] Fps is (10 sec: 5476.1, 60 sec: 5524.4, 300 sec: 5539.2). Total num frames: 1021374464. Throughput: 0: 5813.6. Samples: 1021382202. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:32:35,007][25689] Avg episode reward: [(0, '1.483')] [2022-07-11 02:32:36,418][26022] Updated weights on worker 0-0, policy_version 997444 (0.00094) [2022-07-11 02:32:38,224][26022] Updated weights on worker 0-0, policy_version 997454 (0.00091) [2022-07-11 02:32:40,014][25689] Fps is (10 sec: 5306.0, 60 sec: 5542.9, 300 sec: 5536.8). Total num frames: 1021401088. Throughput: 0: 4967.2. Samples: 1021398698. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:32:40,015][25689] Avg episode reward: [(0, '0.865')] [2022-07-11 02:32:40,290][26022] Updated weights on worker 0-0, policy_version 997464 (0.00096) [2022-07-11 02:32:41,903][26022] Updated weights on worker 0-0, policy_version 997474 (0.00088) [2022-07-11 02:32:44,104][26022] Updated weights on worker 0-0, policy_version 997484 (0.00107) [2022-07-11 02:32:45,024][25689] Fps is (10 sec: 5622.1, 60 sec: 5561.4, 300 sec: 5542.4). Total num frames: 1021430784. Throughput: 0: 5763.8. Samples: 1021431616. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:32:45,024][25689] Avg episode reward: [(0, '1.142')] [2022-07-11 02:32:45,660][26022] Updated weights on worker 0-0, policy_version 997494 (0.00084) [2022-07-11 02:32:47,673][26022] Updated weights on worker 0-0, policy_version 997504 (0.00085) [2022-07-11 02:32:49,272][26022] Updated weights on worker 0-0, policy_version 997514 (0.00086) [2022-07-11 02:32:50,155][25689] Fps is (10 sec: 5452.7, 60 sec: 5485.5, 300 sec: 5533.5). Total num frames: 1021456384. Throughput: 0: 5773.2. Samples: 1021465164. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:32:50,157][25689] Avg episode reward: [(0, '0.974')] [2022-07-11 02:32:51,099][26022] Updated weights on worker 0-0, policy_version 997524 (0.00102) [2022-07-11 02:32:53,178][26022] Updated weights on worker 0-0, policy_version 997534 (0.00084) [2022-07-11 02:32:54,748][26022] Updated weights on worker 0-0, policy_version 997544 (0.00090) [2022-07-11 02:32:55,175][25689] Fps is (10 sec: 5447.3, 60 sec: 5538.1, 300 sec: 5540.2). Total num frames: 1021486080. Throughput: 0: 4950.9. Samples: 1021482054. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:32:55,176][25689] Avg episode reward: [(0, '0.913')] [2022-07-11 02:32:56,740][26022] Updated weights on worker 0-0, policy_version 997554 (0.00086) [2022-07-11 02:32:58,571][26022] Updated weights on worker 0-0, policy_version 997564 (0.00087) [2022-07-11 02:33:00,216][25689] Fps is (10 sec: 5699.9, 60 sec: 5488.0, 300 sec: 5547.0). Total num frames: 1021513728. Throughput: 0: 5792.4. Samples: 1021515714. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:33:00,216][25689] Avg episode reward: [(0, '0.737')] [2022-07-11 02:33:00,368][26022] Updated weights on worker 0-0, policy_version 997574 (0.00094) [2022-07-11 02:33:02,614][26022] Updated weights on worker 0-0, policy_version 997584 (0.00087) [2022-07-11 02:33:04,391][26022] Updated weights on worker 0-0, policy_version 997594 (0.00086) [2022-07-11 02:33:05,241][25689] Fps is (10 sec: 5493.4, 60 sec: 5542.2, 300 sec: 5541.8). Total num frames: 1021541376. Throughput: 0: 5710.5. Samples: 1021547064. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:33:05,243][25689] Avg episode reward: [(0, '0.534')] [2022-07-11 02:33:06,367][26022] Updated weights on worker 0-0, policy_version 997604 (0.00094) [2022-07-11 02:33:08,144][26022] Updated weights on worker 0-0, policy_version 997614 (0.00091) [2022-07-11 02:33:10,018][26022] Updated weights on worker 0-0, policy_version 997624 (0.00090) [2022-07-11 02:33:10,303][25689] Fps is (10 sec: 5380.2, 60 sec: 5523.3, 300 sec: 5534.5). Total num frames: 1021568000. Throughput: 0: 4890.5. Samples: 1021563698. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:33:10,305][25689] Avg episode reward: [(0, '1.035')] [2022-07-11 02:33:11,832][26022] Updated weights on worker 0-0, policy_version 997634 (0.00085) [2022-07-11 02:33:13,643][26022] Updated weights on worker 0-0, policy_version 997644 (0.00085) [2022-07-11 02:33:15,335][25689] Fps is (10 sec: 5376.6, 60 sec: 5488.3, 300 sec: 5531.0). Total num frames: 1021595648. Throughput: 0: 5691.6. Samples: 1021596796. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:33:15,335][25689] Avg episode reward: [(0, '0.880')] [2022-07-11 02:33:15,528][26022] Updated weights on worker 0-0, policy_version 997654 (0.00085) [2022-07-11 02:33:17,260][26022] Updated weights on worker 0-0, policy_version 997664 (0.00087) [2022-07-11 02:33:19,354][26022] Updated weights on worker 0-0, policy_version 997674 (0.00084) [2022-07-11 02:33:20,367][25689] Fps is (10 sec: 5596.5, 60 sec: 5537.2, 300 sec: 5539.2). Total num frames: 1021624320. Throughput: 0: 5702.1. Samples: 1021630616. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:33:20,368][25689] Avg episode reward: [(0, '1.004')] [2022-07-11 02:33:20,940][26022] Updated weights on worker 0-0, policy_version 997684 (0.00091) [2022-07-11 02:33:22,913][26022] Updated weights on worker 0-0, policy_version 997694 (0.00055) [2022-07-11 02:33:24,735][26022] Updated weights on worker 0-0, policy_version 997704 (0.00100) [2022-07-11 02:33:25,385][25689] Fps is (10 sec: 5603.7, 60 sec: 5502.9, 300 sec: 5534.1). Total num frames: 1021651968. Throughput: 0: 4976.6. Samples: 1021647316. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:33:25,389][25689] Avg episode reward: [(0, '1.020')] [2022-07-11 02:33:26,527][26022] Updated weights on worker 0-0, policy_version 997714 (0.00091) [2022-07-11 02:33:27,101][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:33:27,123][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000997717_1021662208.pth [2022-07-11 02:33:27,123][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000995766_1019664384.pth [2022-07-11 02:33:28,550][26022] Updated weights on worker 0-0, policy_version 997724 (0.00089) [2022-07-11 02:33:30,291][26022] Updated weights on worker 0-0, policy_version 997734 (0.00084) [2022-07-11 02:33:30,447][25689] Fps is (10 sec: 5587.3, 60 sec: 5503.8, 300 sec: 5536.5). Total num frames: 1021680640. Throughput: 0: 5790.2. Samples: 1021680332. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:33:30,447][25689] Avg episode reward: [(0, '0.939')] [2022-07-11 02:33:32,168][26022] Updated weights on worker 0-0, policy_version 997744 (0.00087) [2022-07-11 02:33:33,967][26022] Updated weights on worker 0-0, policy_version 997754 (0.00090) [2022-07-11 02:33:35,452][25689] Fps is (10 sec: 5493.2, 60 sec: 5505.8, 300 sec: 5533.2). Total num frames: 1021707264. Throughput: 0: 5837.6. Samples: 1021714230. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:33:35,452][25689] Avg episode reward: [(0, '1.108')] [2022-07-11 02:33:35,791][26022] Updated weights on worker 0-0, policy_version 997764 (0.00084) [2022-07-11 02:33:37,532][26022] Updated weights on worker 0-0, policy_version 997774 (0.00089) [2022-07-11 02:33:39,622][26022] Updated weights on worker 0-0, policy_version 997784 (0.00092) [2022-07-11 02:33:40,459][25689] Fps is (10 sec: 5522.8, 60 sec: 5539.7, 300 sec: 5533.9). Total num frames: 1021735936. Throughput: 0: 4983.2. Samples: 1021730740. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:33:40,459][25689] Avg episode reward: [(0, '0.683')] [2022-07-11 02:33:41,216][26022] Updated weights on worker 0-0, policy_version 997794 (0.00089) [2022-07-11 02:33:43,343][26022] Updated weights on worker 0-0, policy_version 997804 (0.00056) [2022-07-11 02:33:45,041][26022] Updated weights on worker 0-0, policy_version 997814 (0.00087) [2022-07-11 02:33:45,463][25689] Fps is (10 sec: 5625.4, 60 sec: 5506.3, 300 sec: 5532.2). Total num frames: 1021763584. Throughput: 0: 5801.3. Samples: 1021763794. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:33:45,464][25689] Avg episode reward: [(0, '0.653')] [2022-07-11 02:33:46,933][26022] Updated weights on worker 0-0, policy_version 997824 (0.00084) [2022-07-11 02:33:48,793][26022] Updated weights on worker 0-0, policy_version 997834 (0.00088) [2022-07-11 02:33:50,539][25689] Fps is (10 sec: 5485.7, 60 sec: 5545.3, 300 sec: 5531.4). Total num frames: 1021791232. Throughput: 0: 5817.2. Samples: 1021797212. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:33:50,540][25689] Avg episode reward: [(0, '0.701')] [2022-07-11 02:33:50,579][26022] Updated weights on worker 0-0, policy_version 997844 (0.00104) [2022-07-11 02:33:52,361][26022] Updated weights on worker 0-0, policy_version 997854 (0.00087) [2022-07-11 02:33:54,360][26022] Updated weights on worker 0-0, policy_version 997864 (0.00088) [2022-07-11 02:33:55,549][25689] Fps is (10 sec: 5584.3, 60 sec: 5529.3, 300 sec: 5534.8). Total num frames: 1021819904. Throughput: 0: 4945.0. Samples: 1021813610. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:33:55,549][25689] Avg episode reward: [(0, '0.703')] [2022-07-11 02:33:55,918][26022] Updated weights on worker 0-0, policy_version 997874 (0.00085) [2022-07-11 02:33:58,105][26022] Updated weights on worker 0-0, policy_version 997884 (0.00082) [2022-07-11 02:33:59,878][26022] Updated weights on worker 0-0, policy_version 997894 (0.00082) [2022-07-11 02:34:00,571][25689] Fps is (10 sec: 5511.9, 60 sec: 5514.0, 300 sec: 5538.3). Total num frames: 1021846528. Throughput: 0: 5792.6. Samples: 1021847242. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:34:00,571][25689] Avg episode reward: [(0, '1.172')] [2022-07-11 02:34:01,617][26022] Updated weights on worker 0-0, policy_version 997904 (0.00082) [2022-07-11 02:34:04,086][26022] Updated weights on worker 0-0, policy_version 997914 (0.00094) [2022-07-11 02:34:05,397][26022] Updated weights on worker 0-0, policy_version 997924 (0.00084) [2022-07-11 02:34:05,600][25689] Fps is (10 sec: 5501.5, 60 sec: 5530.6, 300 sec: 5540.1). Total num frames: 1021875200. Throughput: 0: 5721.8. Samples: 1021879012. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:34:05,600][25689] Avg episode reward: [(0, '1.446')] [2022-07-11 02:34:07,683][26022] Updated weights on worker 0-0, policy_version 997934 (0.00087) [2022-07-11 02:34:09,215][26022] Updated weights on worker 0-0, policy_version 997944 (0.00095) [2022-07-11 02:34:10,638][25689] Fps is (10 sec: 5289.4, 60 sec: 5498.9, 300 sec: 5526.4). Total num frames: 1021899776. Throughput: 0: 4901.5. Samples: 1021895728. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:34:10,638][25689] Avg episode reward: [(0, '1.972')] [2022-07-11 02:34:11,208][26022] Updated weights on worker 0-0, policy_version 997954 (0.00082) [2022-07-11 02:34:13,111][26022] Updated weights on worker 0-0, policy_version 997964 (0.00086) [2022-07-11 02:34:14,777][26022] Updated weights on worker 0-0, policy_version 997974 (0.00084) [2022-07-11 02:34:15,656][25689] Fps is (10 sec: 5397.1, 60 sec: 5534.1, 300 sec: 5534.2). Total num frames: 1021929472. Throughput: 0: 5752.4. Samples: 1021929274. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:34:15,656][25689] Avg episode reward: [(0, '2.123')] [2022-07-11 02:34:16,895][26022] Updated weights on worker 0-0, policy_version 997984 (0.00093) [2022-07-11 02:34:18,347][26022] Updated weights on worker 0-0, policy_version 997994 (0.00095) [2022-07-11 02:34:20,526][26022] Updated weights on worker 0-0, policy_version 998004 (0.00082) [2022-07-11 02:34:20,685][25689] Fps is (10 sec: 5605.8, 60 sec: 5500.4, 300 sec: 5527.2). Total num frames: 1021956096. Throughput: 0: 5760.2. Samples: 1021963100. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:34:20,686][25689] Avg episode reward: [(0, '1.932')] [2022-07-11 02:34:22,100][26022] Updated weights on worker 0-0, policy_version 998014 (0.00093) [2022-07-11 02:34:24,030][26022] Updated weights on worker 0-0, policy_version 998024 (0.00084) [2022-07-11 02:34:25,700][25689] Fps is (10 sec: 5505.1, 60 sec: 5517.7, 300 sec: 5528.3). Total num frames: 1021984768. Throughput: 0: 5017.4. Samples: 1021979864. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:34:25,702][25689] Avg episode reward: [(0, '1.913')] [2022-07-11 02:34:25,919][26022] Updated weights on worker 0-0, policy_version 998034 (0.00082) [2022-07-11 02:34:27,791][26022] Updated weights on worker 0-0, policy_version 998044 (0.00087) [2022-07-11 02:34:29,552][26022] Updated weights on worker 0-0, policy_version 998054 (0.00090) [2022-07-11 02:34:30,764][25689] Fps is (10 sec: 5790.9, 60 sec: 5534.5, 300 sec: 5531.0). Total num frames: 1022014464. Throughput: 0: 5849.0. Samples: 1022013446. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:34:30,764][25689] Avg episode reward: [(0, '0.916')] [2022-07-11 02:34:31,456][26022] Updated weights on worker 0-0, policy_version 998064 (0.00095) [2022-07-11 02:34:33,043][26022] Updated weights on worker 0-0, policy_version 998074 (0.00085) [2022-07-11 02:34:35,144][26022] Updated weights on worker 0-0, policy_version 998084 (0.00086) [2022-07-11 02:34:35,770][25689] Fps is (10 sec: 5694.9, 60 sec: 5551.4, 300 sec: 5531.4). Total num frames: 1022042112. Throughput: 0: 5853.1. Samples: 1022047002. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:34:35,770][25689] Avg episode reward: [(0, '0.596')] [2022-07-11 02:34:36,917][26022] Updated weights on worker 0-0, policy_version 998094 (0.00092) [2022-07-11 02:34:38,723][26022] Updated weights on worker 0-0, policy_version 998104 (0.00084) [2022-07-11 02:34:40,559][26022] Updated weights on worker 0-0, policy_version 998114 (0.00082) [2022-07-11 02:34:40,775][25689] Fps is (10 sec: 5523.7, 60 sec: 5534.6, 300 sec: 5532.1). Total num frames: 1022069760. Throughput: 0: 5014.9. Samples: 1022063848. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:34:40,776][25689] Avg episode reward: [(0, '-0.118')] [2022-07-11 02:34:42,316][26022] Updated weights on worker 0-0, policy_version 998124 (0.00090) [2022-07-11 02:34:44,131][26022] Updated weights on worker 0-0, policy_version 998134 (0.00088) [2022-07-11 02:34:45,788][25689] Fps is (10 sec: 5417.1, 60 sec: 5516.8, 300 sec: 5527.4). Total num frames: 1022096384. Throughput: 0: 5840.7. Samples: 1022097192. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:34:45,790][25689] Avg episode reward: [(0, '-0.078')] [2022-07-11 02:34:46,043][26022] Updated weights on worker 0-0, policy_version 998144 (0.00084) [2022-07-11 02:34:47,864][26022] Updated weights on worker 0-0, policy_version 998154 (0.00080) [2022-07-11 02:34:49,718][26022] Updated weights on worker 0-0, policy_version 998164 (0.00091) [2022-07-11 02:34:50,825][25689] Fps is (10 sec: 5705.6, 60 sec: 5571.3, 300 sec: 5537.3). Total num frames: 1022127104. Throughput: 0: 5846.5. Samples: 1022130734. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:34:50,826][25689] Avg episode reward: [(0, '-0.094')] [2022-07-11 02:34:51,426][26022] Updated weights on worker 0-0, policy_version 998174 (0.00086) [2022-07-11 02:34:53,434][26022] Updated weights on worker 0-0, policy_version 998184 (0.00086) [2022-07-11 02:34:55,307][26022] Updated weights on worker 0-0, policy_version 998194 (0.00085) [2022-07-11 02:34:55,909][25689] Fps is (10 sec: 5665.7, 60 sec: 5530.5, 300 sec: 5529.0). Total num frames: 1022153728. Throughput: 0: 4999.1. Samples: 1022147682. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:34:55,910][25689] Avg episode reward: [(0, '-0.039')] [2022-07-11 02:34:56,939][26022] Updated weights on worker 0-0, policy_version 998204 (0.00079) [2022-07-11 02:34:58,914][26022] Updated weights on worker 0-0, policy_version 998214 (0.00095) [2022-07-11 02:35:00,710][26022] Updated weights on worker 0-0, policy_version 998224 (0.00083) [2022-07-11 02:35:00,980][25689] Fps is (10 sec: 5445.4, 60 sec: 5560.0, 300 sec: 5539.1). Total num frames: 1022182400. Throughput: 0: 5799.2. Samples: 1022181022. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:35:00,980][25689] Avg episode reward: [(0, '0.878')] [2022-07-11 02:35:02,835][26022] Updated weights on worker 0-0, policy_version 998234 (0.00093) [2022-07-11 02:35:04,930][26022] Updated weights on worker 0-0, policy_version 998244 (0.00082) [2022-07-11 02:35:05,987][25689] Fps is (10 sec: 5385.6, 60 sec: 5511.1, 300 sec: 5536.2). Total num frames: 1022208000. Throughput: 0: 5720.0. Samples: 1022212728. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:35:05,987][25689] Avg episode reward: [(0, '0.578')] [2022-07-11 02:35:06,379][26022] Updated weights on worker 0-0, policy_version 998254 (0.00091) [2022-07-11 02:35:08,652][26022] Updated weights on worker 0-0, policy_version 998264 (0.00087) [2022-07-11 02:35:10,253][26022] Updated weights on worker 0-0, policy_version 998274 (0.00095) [2022-07-11 02:35:11,138][25689] Fps is (10 sec: 5242.0, 60 sec: 5551.6, 300 sec: 5526.9). Total num frames: 1022235648. Throughput: 0: 5657.3. Samples: 1022245648. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:35:11,138][25689] Avg episode reward: [(0, '0.913')] [2022-07-11 02:35:12,222][26022] Updated weights on worker 0-0, policy_version 998284 (0.00085) [2022-07-11 02:35:14,052][26022] Updated weights on worker 0-0, policy_version 998294 (0.00096) [2022-07-11 02:35:15,907][26022] Updated weights on worker 0-0, policy_version 998304 (0.00095) [2022-07-11 02:35:16,211][25689] Fps is (10 sec: 5608.4, 60 sec: 5546.5, 300 sec: 5540.0). Total num frames: 1022265344. Throughput: 0: 5655.1. Samples: 1022262492. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:35:16,212][25689] Avg episode reward: [(0, '1.073')] [2022-07-11 02:35:17,735][26022] Updated weights on worker 0-0, policy_version 998314 (0.00090) [2022-07-11 02:35:19,478][26022] Updated weights on worker 0-0, policy_version 998324 (0.00092) [2022-07-11 02:35:21,243][25689] Fps is (10 sec: 5776.0, 60 sec: 5580.1, 300 sec: 5539.7). Total num frames: 1022294016. Throughput: 0: 5678.7. Samples: 1022296090. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:35:21,243][26022] Updated weights on worker 0-0, policy_version 998334 (0.00088) [2022-07-11 02:35:21,243][25689] Avg episode reward: [(0, '0.990')] [2022-07-11 02:35:23,198][26022] Updated weights on worker 0-0, policy_version 998344 (0.00086) [2022-07-11 02:35:24,978][26022] Updated weights on worker 0-0, policy_version 998354 (0.00085) [2022-07-11 02:35:26,250][25689] Fps is (10 sec: 5508.4, 60 sec: 5547.1, 300 sec: 5533.8). Total num frames: 1022320640. Throughput: 0: 5751.5. Samples: 1022329272. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:35:26,250][25689] Avg episode reward: [(0, '-0.033')] [2022-07-11 02:35:27,041][26022] Updated weights on worker 0-0, policy_version 998364 (0.00082) [2022-07-11 02:35:27,313][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:35:27,329][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000998366_1022326784.pth [2022-07-11 02:35:27,330][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000996419_1020333056.pth [2022-07-11 02:35:28,740][26022] Updated weights on worker 0-0, policy_version 998374 (0.00088) [2022-07-11 02:35:30,625][26022] Updated weights on worker 0-0, policy_version 998384 (0.00086) [2022-07-11 02:35:31,343][25689] Fps is (10 sec: 5474.9, 60 sec: 5527.5, 300 sec: 5532.2). Total num frames: 1022349312. Throughput: 0: 4962.5. Samples: 1022345916. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:35:31,343][25689] Avg episode reward: [(0, '0.322')] [2022-07-11 02:35:32,560][26022] Updated weights on worker 0-0, policy_version 998394 (0.00076) [2022-07-11 02:35:34,172][26022] Updated weights on worker 0-0, policy_version 998404 (0.00090) [2022-07-11 02:35:36,274][26022] Updated weights on worker 0-0, policy_version 998414 (0.00082) [2022-07-11 02:35:36,364][25689] Fps is (10 sec: 5568.5, 60 sec: 5526.1, 300 sec: 5531.9). Total num frames: 1022376960. Throughput: 0: 5794.3. Samples: 1022379262. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:35:36,365][25689] Avg episode reward: [(0, '0.717')] [2022-07-11 02:35:37,864][26022] Updated weights on worker 0-0, policy_version 998424 (0.00545) [2022-07-11 02:35:39,762][26022] Updated weights on worker 0-0, policy_version 998434 (0.00095) [2022-07-11 02:35:41,384][25689] Fps is (10 sec: 5608.9, 60 sec: 5541.6, 300 sec: 5535.1). Total num frames: 1022405632. Throughput: 0: 5797.3. Samples: 1022412854. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:35:41,386][25689] Avg episode reward: [(0, '0.804')] [2022-07-11 02:35:41,578][26022] Updated weights on worker 0-0, policy_version 998444 (0.00083) [2022-07-11 02:35:43,596][26022] Updated weights on worker 0-0, policy_version 998454 (0.00082) [2022-07-11 02:35:45,239][26022] Updated weights on worker 0-0, policy_version 998464 (0.00083) [2022-07-11 02:35:46,422][25689] Fps is (10 sec: 5497.5, 60 sec: 5539.3, 300 sec: 5525.4). Total num frames: 1022432256. Throughput: 0: 4982.6. Samples: 1022429784. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:35:46,423][25689] Avg episode reward: [(0, '0.503')] [2022-07-11 02:35:47,179][26022] Updated weights on worker 0-0, policy_version 998474 (0.00086) [2022-07-11 02:35:48,902][26022] Updated weights on worker 0-0, policy_version 998484 (0.00090) [2022-07-11 02:35:50,853][26022] Updated weights on worker 0-0, policy_version 998494 (0.00089) [2022-07-11 02:35:51,552][25689] Fps is (10 sec: 5337.5, 60 sec: 5480.3, 300 sec: 5530.1). Total num frames: 1022459904. Throughput: 0: 5801.7. Samples: 1022463164. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:35:51,552][25689] Avg episode reward: [(0, '0.518')] [2022-07-11 02:35:52,683][26022] Updated weights on worker 0-0, policy_version 998504 (0.00091) [2022-07-11 02:35:54,446][26022] Updated weights on worker 0-0, policy_version 998514 (0.00093) [2022-07-11 02:35:56,409][26022] Updated weights on worker 0-0, policy_version 998524 (0.00089) [2022-07-11 02:35:56,612][25689] Fps is (10 sec: 5627.8, 60 sec: 5533.1, 300 sec: 5533.0). Total num frames: 1022489600. Throughput: 0: 5786.5. Samples: 1022496426. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:35:56,613][25689] Avg episode reward: [(0, '1.321')] [2022-07-11 02:35:58,145][26022] Updated weights on worker 0-0, policy_version 998534 (0.00081) [2022-07-11 02:35:59,907][26022] Updated weights on worker 0-0, policy_version 998544 (0.00087) [2022-07-11 02:36:01,631][25689] Fps is (10 sec: 5587.8, 60 sec: 5504.0, 300 sec: 5537.9). Total num frames: 1022516224. Throughput: 0: 4966.1. Samples: 1022513404. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:36:01,632][25689] Avg episode reward: [(0, '1.228')] [2022-07-11 02:36:02,192][26022] Updated weights on worker 0-0, policy_version 998554 (0.00103) [2022-07-11 02:36:03,906][26022] Updated weights on worker 0-0, policy_version 998564 (0.00084) [2022-07-11 02:36:06,102][26022] Updated weights on worker 0-0, policy_version 998574 (0.00082) [2022-07-11 02:36:06,693][25689] Fps is (10 sec: 5383.6, 60 sec: 5532.8, 300 sec: 5534.6). Total num frames: 1022543872. Throughput: 0: 5670.8. Samples: 1022544732. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:36:06,693][25689] Avg episode reward: [(0, '0.752')] [2022-07-11 02:36:07,610][26022] Updated weights on worker 0-0, policy_version 998584 (0.00095) [2022-07-11 02:36:09,634][26022] Updated weights on worker 0-0, policy_version 998594 (0.00082) [2022-07-11 02:36:11,339][26022] Updated weights on worker 0-0, policy_version 998604 (0.00094) [2022-07-11 02:36:11,754][25689] Fps is (10 sec: 5563.7, 60 sec: 5557.9, 300 sec: 5537.3). Total num frames: 1022572544. Throughput: 0: 5698.9. Samples: 1022578292. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:36:11,754][25689] Avg episode reward: [(0, '1.055')] [2022-07-11 02:36:13,362][26022] Updated weights on worker 0-0, policy_version 998614 (0.00090) [2022-07-11 02:36:15,048][26022] Updated weights on worker 0-0, policy_version 998624 (0.00082) [2022-07-11 02:36:16,776][25689] Fps is (10 sec: 5585.7, 60 sec: 5528.8, 300 sec: 5530.2). Total num frames: 1022600192. Throughput: 0: 4883.6. Samples: 1022594898. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:36:16,776][25689] Avg episode reward: [(0, '0.944')] [2022-07-11 02:36:17,055][26022] Updated weights on worker 0-0, policy_version 998634 (0.00088) [2022-07-11 02:36:18,598][26022] Updated weights on worker 0-0, policy_version 998644 (0.00079) [2022-07-11 02:36:20,656][26022] Updated weights on worker 0-0, policy_version 998654 (0.00091) [2022-07-11 02:36:21,794][25689] Fps is (10 sec: 5609.7, 60 sec: 5530.0, 300 sec: 5537.2). Total num frames: 1022628864. Throughput: 0: 5712.6. Samples: 1022628584. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 02:36:21,794][25689] Avg episode reward: [(0, '0.772')] [2022-07-11 02:36:22,220][26022] Updated weights on worker 0-0, policy_version 998664 (0.00086) [2022-07-11 02:36:24,520][26022] Updated weights on worker 0-0, policy_version 998674 (0.00083) [2022-07-11 02:36:25,858][26022] Updated weights on worker 0-0, policy_version 998684 (0.00622) [2022-07-11 02:36:26,819][25689] Fps is (10 sec: 5607.5, 60 sec: 5545.2, 300 sec: 5535.3). Total num frames: 1022656512. Throughput: 0: 5827.1. Samples: 1022662012. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:36:26,820][25689] Avg episode reward: [(0, '0.876')] [2022-07-11 02:36:28,115][26022] Updated weights on worker 0-0, policy_version 998694 (0.00092) [2022-07-11 02:36:29,750][26022] Updated weights on worker 0-0, policy_version 998704 (0.00090) [2022-07-11 02:36:31,639][26022] Updated weights on worker 0-0, policy_version 998714 (0.00100) [2022-07-11 02:36:31,912][25689] Fps is (10 sec: 5566.1, 60 sec: 5545.3, 300 sec: 5531.0). Total num frames: 1022685184. Throughput: 0: 4975.5. Samples: 1022678588. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:36:31,913][25689] Avg episode reward: [(0, '0.892')] [2022-07-11 02:36:33,343][26022] Updated weights on worker 0-0, policy_version 998724 (0.00090) [2022-07-11 02:36:35,253][26022] Updated weights on worker 0-0, policy_version 998734 (0.00095) [2022-07-11 02:36:36,954][25689] Fps is (10 sec: 5556.9, 60 sec: 5543.3, 300 sec: 5537.6). Total num frames: 1022712832. Throughput: 0: 5823.9. Samples: 1022712418. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:36:36,955][25689] Avg episode reward: [(0, '0.731')] [2022-07-11 02:36:37,166][26022] Updated weights on worker 0-0, policy_version 998744 (0.00088) [2022-07-11 02:36:38,980][26022] Updated weights on worker 0-0, policy_version 998754 (0.00086) [2022-07-11 02:36:40,733][26022] Updated weights on worker 0-0, policy_version 998764 (0.00083) [2022-07-11 02:36:41,976][25689] Fps is (10 sec: 5494.5, 60 sec: 5526.3, 300 sec: 5534.2). Total num frames: 1022740480. Throughput: 0: 5810.0. Samples: 1022745844. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:36:41,976][25689] Avg episode reward: [(0, '0.739')] [2022-07-11 02:36:42,787][26022] Updated weights on worker 0-0, policy_version 998774 (0.00093) [2022-07-11 02:36:44,433][26022] Updated weights on worker 0-0, policy_version 998784 (0.00085) [2022-07-11 02:36:46,403][26022] Updated weights on worker 0-0, policy_version 998794 (0.00087) [2022-07-11 02:36:46,991][25689] Fps is (10 sec: 5407.6, 60 sec: 5528.4, 300 sec: 5524.4). Total num frames: 1022767104. Throughput: 0: 4981.2. Samples: 1022762490. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:36:46,991][25689] Avg episode reward: [(0, '0.305')] [2022-07-11 02:36:47,979][26022] Updated weights on worker 0-0, policy_version 998804 (0.00082) [2022-07-11 02:36:50,123][26022] Updated weights on worker 0-0, policy_version 998814 (0.00081) [2022-07-11 02:36:51,785][26022] Updated weights on worker 0-0, policy_version 998824 (0.00359) [2022-07-11 02:36:52,094][25689] Fps is (10 sec: 5667.3, 60 sec: 5581.5, 300 sec: 5537.0). Total num frames: 1022797824. Throughput: 0: 5816.6. Samples: 1022795980. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:36:52,095][25689] Avg episode reward: [(0, '0.453')] [2022-07-11 02:36:53,699][26022] Updated weights on worker 0-0, policy_version 998834 (0.00091) [2022-07-11 02:36:55,303][26022] Updated weights on worker 0-0, policy_version 998844 (0.00091) [2022-07-11 02:36:57,146][25689] Fps is (10 sec: 5646.6, 60 sec: 5531.5, 300 sec: 5523.1). Total num frames: 1022824448. Throughput: 0: 5805.3. Samples: 1022829638. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:36:57,147][25689] Avg episode reward: [(0, '0.490')] [2022-07-11 02:36:57,231][26022] Updated weights on worker 0-0, policy_version 998854 (0.00087) [2022-07-11 02:36:59,014][26022] Updated weights on worker 0-0, policy_version 998864 (0.00082) [2022-07-11 02:37:01,007][26022] Updated weights on worker 0-0, policy_version 998874 (0.00088) [2022-07-11 02:37:02,194][25689] Fps is (10 sec: 5272.4, 60 sec: 5528.9, 300 sec: 5530.3). Total num frames: 1022851072. Throughput: 0: 4979.4. Samples: 1022846516. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:37:02,194][25689] Avg episode reward: [(0, '-0.435')] [2022-07-11 02:37:03,147][26022] Updated weights on worker 0-0, policy_version 998884 (0.00083) [2022-07-11 02:37:05,107][26022] Updated weights on worker 0-0, policy_version 998894 (0.00082) [2022-07-11 02:37:06,649][26022] Updated weights on worker 0-0, policy_version 998904 (0.00084) [2022-07-11 02:37:07,209][25689] Fps is (10 sec: 5495.0, 60 sec: 5550.1, 300 sec: 5534.2). Total num frames: 1022879744. Throughput: 0: 5720.1. Samples: 1022878142. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:37:07,210][25689] Avg episode reward: [(0, '-0.058')] [2022-07-11 02:37:08,901][26022] Updated weights on worker 0-0, policy_version 998914 (0.00082) [2022-07-11 02:37:10,354][26022] Updated weights on worker 0-0, policy_version 998924 (0.00084) [2022-07-11 02:37:12,319][25689] Fps is (10 sec: 5461.4, 60 sec: 5511.8, 300 sec: 5522.2). Total num frames: 1022906368. Throughput: 0: 5723.6. Samples: 1022911736. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:37:12,319][25689] Avg episode reward: [(0, '-0.228')] [2022-07-11 02:37:12,551][26022] Updated weights on worker 0-0, policy_version 998934 (0.00088) [2022-07-11 02:37:13,923][26022] Updated weights on worker 0-0, policy_version 998944 (0.00084) [2022-07-11 02:37:16,012][26022] Updated weights on worker 0-0, policy_version 998954 (0.00083) [2022-07-11 02:37:17,342][25689] Fps is (10 sec: 5659.2, 60 sec: 5562.4, 300 sec: 5539.2). Total num frames: 1022937088. Throughput: 0: 5730.0. Samples: 1022945360. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:37:17,343][25689] Avg episode reward: [(0, '0.026')] [2022-07-11 02:37:17,679][26022] Updated weights on worker 0-0, policy_version 998964 (0.00092) [2022-07-11 02:37:19,590][26022] Updated weights on worker 0-0, policy_version 998974 (0.00090) [2022-07-11 02:37:21,339][26022] Updated weights on worker 0-0, policy_version 998984 (0.00086) [2022-07-11 02:37:22,365][25689] Fps is (10 sec: 5809.7, 60 sec: 5545.0, 300 sec: 5532.1). Total num frames: 1022964736. Throughput: 0: 5735.0. Samples: 1022962200. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:37:22,366][25689] Avg episode reward: [(0, '-0.193')] [2022-07-11 02:37:23,322][26022] Updated weights on worker 0-0, policy_version 998994 (0.00090) [2022-07-11 02:37:25,115][26022] Updated weights on worker 0-0, policy_version 999004 (0.00095) [2022-07-11 02:37:26,976][26022] Updated weights on worker 0-0, policy_version 999014 (0.00087) [2022-07-11 02:37:27,345][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:37:27,364][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000999017_1022993408.pth [2022-07-11 02:37:27,365][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000997065_1020994560.pth [2022-07-11 02:37:27,371][25689] Fps is (10 sec: 5616.0, 60 sec: 5563.8, 300 sec: 5533.3). Total num frames: 1022993408. Throughput: 0: 5840.0. Samples: 1022995884. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:37:27,371][25689] Avg episode reward: [(0, '-0.004')] [2022-07-11 02:37:28,754][26022] Updated weights on worker 0-0, policy_version 999024 (0.00085) [2022-07-11 02:37:30,647][26022] Updated weights on worker 0-0, policy_version 999034 (0.00094) [2022-07-11 02:37:32,427][25689] Fps is (10 sec: 5495.7, 60 sec: 5533.3, 300 sec: 5532.8). Total num frames: 1023020032. Throughput: 0: 5841.8. Samples: 1023029206. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:37:32,428][25689] Avg episode reward: [(0, '0.825')] [2022-07-11 02:37:32,600][26022] Updated weights on worker 0-0, policy_version 999044 (0.00089) [2022-07-11 02:37:34,291][26022] Updated weights on worker 0-0, policy_version 999054 (0.00084) [2022-07-11 02:37:36,098][26022] Updated weights on worker 0-0, policy_version 999064 (0.00065) [2022-07-11 02:37:37,437][25689] Fps is (10 sec: 5594.6, 60 sec: 5570.1, 300 sec: 5543.1). Total num frames: 1023049728. Throughput: 0: 5012.5. Samples: 1023046088. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:37:37,438][25689] Avg episode reward: [(0, '1.200')] [2022-07-11 02:37:37,864][26022] Updated weights on worker 0-0, policy_version 999074 (0.00091) [2022-07-11 02:37:39,989][26022] Updated weights on worker 0-0, policy_version 999084 (0.00088) [2022-07-11 02:37:41,572][26022] Updated weights on worker 0-0, policy_version 999094 (0.00093) [2022-07-11 02:37:42,439][25689] Fps is (10 sec: 5625.1, 60 sec: 5554.9, 300 sec: 5532.9). Total num frames: 1023076352. Throughput: 0: 5850.2. Samples: 1023079638. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:37:42,440][25689] Avg episode reward: [(0, '1.802')] [2022-07-11 02:37:43,530][26022] Updated weights on worker 0-0, policy_version 999104 (0.00093) [2022-07-11 02:37:45,244][26022] Updated weights on worker 0-0, policy_version 999114 (0.00085) [2022-07-11 02:37:47,171][26022] Updated weights on worker 0-0, policy_version 999124 (0.00100) [2022-07-11 02:37:47,522][25689] Fps is (10 sec: 5381.9, 60 sec: 5565.7, 300 sec: 5540.7). Total num frames: 1023104000. Throughput: 0: 5805.7. Samples: 1023112876. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:37:47,522][25689] Avg episode reward: [(0, '2.034')] [2022-07-11 02:37:49,179][26022] Updated weights on worker 0-0, policy_version 999134 (0.00086) [2022-07-11 02:37:50,748][26022] Updated weights on worker 0-0, policy_version 999144 (0.00092) [2022-07-11 02:37:52,684][25689] Fps is (10 sec: 5497.8, 60 sec: 5526.5, 300 sec: 5534.6). Total num frames: 1023132672. Throughput: 0: 4949.8. Samples: 1023129472. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:37:52,684][25689] Avg episode reward: [(0, '1.933')] [2022-07-11 02:37:52,838][26022] Updated weights on worker 0-0, policy_version 999154 (0.00082) [2022-07-11 02:37:54,451][26022] Updated weights on worker 0-0, policy_version 999164 (0.00082) [2022-07-11 02:37:56,503][26022] Updated weights on worker 0-0, policy_version 999174 (0.00079) [2022-07-11 02:37:57,690][25689] Fps is (10 sec: 5740.5, 60 sec: 5581.5, 300 sec: 5542.1). Total num frames: 1023162368. Throughput: 0: 5785.1. Samples: 1023163248. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:37:57,690][25689] Avg episode reward: [(0, '1.773')] [2022-07-11 02:37:58,170][26022] Updated weights on worker 0-0, policy_version 999184 (0.00093) [2022-07-11 02:38:00,205][26022] Updated weights on worker 0-0, policy_version 999194 (0.00089) [2022-07-11 02:38:02,233][26022] Updated weights on worker 0-0, policy_version 999204 (0.00094) [2022-07-11 02:38:02,763][25689] Fps is (10 sec: 5486.1, 60 sec: 5562.2, 300 sec: 5534.4). Total num frames: 1023187968. Throughput: 0: 5657.9. Samples: 1023194626. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:38:02,763][25689] Avg episode reward: [(0, '1.919')] [2022-07-11 02:38:04,159][26022] Updated weights on worker 0-0, policy_version 999214 (0.00085) [2022-07-11 02:38:05,916][26022] Updated weights on worker 0-0, policy_version 999224 (0.00083) [2022-07-11 02:38:07,826][25689] Fps is (10 sec: 5152.1, 60 sec: 5524.0, 300 sec: 5534.3). Total num frames: 1023214592. Throughput: 0: 4853.2. Samples: 1023211416. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:38:07,826][25689] Avg episode reward: [(0, '1.479')] [2022-07-11 02:38:08,002][26022] Updated weights on worker 0-0, policy_version 999234 (0.00082) [2022-07-11 02:38:09,594][26022] Updated weights on worker 0-0, policy_version 999244 (0.00086) [2022-07-11 02:38:11,411][26022] Updated weights on worker 0-0, policy_version 999254 (0.00086) [2022-07-11 02:38:12,909][25689] Fps is (10 sec: 5550.6, 60 sec: 5577.1, 300 sec: 5540.3). Total num frames: 1023244288. Throughput: 0: 5705.7. Samples: 1023244874. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:38:12,910][25689] Avg episode reward: [(0, '1.514')] [2022-07-11 02:38:13,295][26022] Updated weights on worker 0-0, policy_version 999264 (0.00091) [2022-07-11 02:38:15,120][26022] Updated weights on worker 0-0, policy_version 999274 (0.00085) [2022-07-11 02:38:16,999][26022] Updated weights on worker 0-0, policy_version 999284 (0.00090) [2022-07-11 02:38:17,927][25689] Fps is (10 sec: 5677.0, 60 sec: 5526.9, 300 sec: 5537.1). Total num frames: 1023271936. Throughput: 0: 5701.0. Samples: 1023278622. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:38:17,927][25689] Avg episode reward: [(0, '0.940')] [2022-07-11 02:38:18,704][26022] Updated weights on worker 0-0, policy_version 999294 (0.00079) [2022-07-11 02:38:20,671][26022] Updated weights on worker 0-0, policy_version 999304 (0.00080) [2022-07-11 02:38:22,533][26022] Updated weights on worker 0-0, policy_version 999314 (0.00086) [2022-07-11 02:38:22,957][25689] Fps is (10 sec: 5503.5, 60 sec: 5526.3, 300 sec: 5536.9). Total num frames: 1023299584. Throughput: 0: 4989.3. Samples: 1023295380. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:38:22,958][25689] Avg episode reward: [(0, '1.086')] [2022-07-11 02:38:24,264][26022] Updated weights on worker 0-0, policy_version 999324 (0.00085) [2022-07-11 02:38:26,180][26022] Updated weights on worker 0-0, policy_version 999334 (0.00090) [2022-07-11 02:38:27,976][25689] Fps is (10 sec: 5604.4, 60 sec: 5525.0, 300 sec: 5537.7). Total num frames: 1023328256. Throughput: 0: 5827.1. Samples: 1023328834. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:38:27,977][25689] Avg episode reward: [(0, '0.191')] [2022-07-11 02:38:27,981][26022] Updated weights on worker 0-0, policy_version 999344 (0.00093) [2022-07-11 02:38:29,648][26022] Updated weights on worker 0-0, policy_version 999354 (0.00091) [2022-07-11 02:38:31,844][26022] Updated weights on worker 0-0, policy_version 999364 (0.00084) [2022-07-11 02:38:33,043][25689] Fps is (10 sec: 5685.7, 60 sec: 5557.9, 300 sec: 5543.4). Total num frames: 1023356928. Throughput: 0: 5826.9. Samples: 1023362188. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:38:33,043][25689] Avg episode reward: [(0, '0.459')] [2022-07-11 02:38:33,367][26022] Updated weights on worker 0-0, policy_version 999374 (0.00091) [2022-07-11 02:38:35,519][26022] Updated weights on worker 0-0, policy_version 999384 (0.00089) [2022-07-11 02:38:36,938][26022] Updated weights on worker 0-0, policy_version 999394 (0.00088) [2022-07-11 02:38:38,097][25689] Fps is (10 sec: 5463.4, 60 sec: 5503.2, 300 sec: 5535.6). Total num frames: 1023383552. Throughput: 0: 4976.2. Samples: 1023378994. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:38:38,098][25689] Avg episode reward: [(0, '0.589')] [2022-07-11 02:38:39,035][26022] Updated weights on worker 0-0, policy_version 999404 (0.00084) [2022-07-11 02:38:40,912][26022] Updated weights on worker 0-0, policy_version 999414 (0.00087) [2022-07-11 02:38:42,695][26022] Updated weights on worker 0-0, policy_version 999424 (0.00086) [2022-07-11 02:38:43,126][25689] Fps is (10 sec: 5484.1, 60 sec: 5534.6, 300 sec: 5538.6). Total num frames: 1023412224. Throughput: 0: 5801.2. Samples: 1023412382. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:38:43,126][25689] Avg episode reward: [(0, '0.654')] [2022-07-11 02:38:44,621][26022] Updated weights on worker 0-0, policy_version 999434 (0.00094) [2022-07-11 02:38:46,511][26022] Updated weights on worker 0-0, policy_version 999444 (0.00092) [2022-07-11 02:38:48,137][25689] Fps is (10 sec: 5609.9, 60 sec: 5541.1, 300 sec: 5539.8). Total num frames: 1023439872. Throughput: 0: 5806.1. Samples: 1023445888. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:38:48,137][25689] Avg episode reward: [(0, '0.483')] [2022-07-11 02:38:48,163][26022] Updated weights on worker 0-0, policy_version 999454 (0.00095) [2022-07-11 02:38:50,043][26022] Updated weights on worker 0-0, policy_version 999464 (0.00086) [2022-07-11 02:38:51,804][26022] Updated weights on worker 0-0, policy_version 999474 (0.00112) [2022-07-11 02:38:53,283][25689] Fps is (10 sec: 5443.9, 60 sec: 5525.6, 300 sec: 5533.8). Total num frames: 1023467520. Throughput: 0: 4957.1. Samples: 1023462522. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:38:53,284][25689] Avg episode reward: [(0, '1.226')] [2022-07-11 02:38:53,963][26022] Updated weights on worker 0-0, policy_version 999484 (0.00090) [2022-07-11 02:38:55,491][26022] Updated weights on worker 0-0, policy_version 999494 (0.00088) [2022-07-11 02:38:57,487][26022] Updated weights on worker 0-0, policy_version 999504 (0.00087) [2022-07-11 02:38:58,321][25689] Fps is (10 sec: 5630.5, 60 sec: 5522.7, 300 sec: 5543.9). Total num frames: 1023497216. Throughput: 0: 5777.4. Samples: 1023495836. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:38:58,322][25689] Avg episode reward: [(0, '0.973')] [2022-07-11 02:38:59,160][26022] Updated weights on worker 0-0, policy_version 999514 (0.00085) [2022-07-11 02:39:01,073][26022] Updated weights on worker 0-0, policy_version 999524 (0.00085) [2022-07-11 02:39:03,355][25689] Fps is (10 sec: 5286.6, 60 sec: 5492.4, 300 sec: 5526.6). Total num frames: 1023520768. Throughput: 0: 5682.7. Samples: 1023527342. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:39:03,356][25689] Avg episode reward: [(0, '1.004')] [2022-07-11 02:39:03,600][26022] Updated weights on worker 0-0, policy_version 999534 (0.00091) [2022-07-11 02:39:04,999][26022] Updated weights on worker 0-0, policy_version 999544 (0.00091) [2022-07-11 02:39:07,187][26022] Updated weights on worker 0-0, policy_version 999554 (0.00089) [2022-07-11 02:39:08,371][25689] Fps is (10 sec: 5400.4, 60 sec: 5564.4, 300 sec: 5547.6). Total num frames: 1023551488. Throughput: 0: 4867.5. Samples: 1023544378. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:39:08,371][25689] Avg episode reward: [(0, '1.085')] [2022-07-11 02:39:08,533][26022] Updated weights on worker 0-0, policy_version 999564 (0.00087) [2022-07-11 02:39:10,652][26022] Updated weights on worker 0-0, policy_version 999574 (0.00101) [2022-07-11 02:39:12,231][26022] Updated weights on worker 0-0, policy_version 999584 (0.00082) [2022-07-11 02:39:13,504][25689] Fps is (10 sec: 5751.1, 60 sec: 5526.0, 300 sec: 5538.6). Total num frames: 1023579136. Throughput: 0: 5707.1. Samples: 1023577926. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:39:13,505][25689] Avg episode reward: [(0, '1.313')] [2022-07-11 02:39:14,152][26022] Updated weights on worker 0-0, policy_version 999594 (0.00079) [2022-07-11 02:39:16,129][26022] Updated weights on worker 0-0, policy_version 999604 (0.00085) [2022-07-11 02:39:17,895][26022] Updated weights on worker 0-0, policy_version 999614 (0.00089) [2022-07-11 02:39:18,564][25689] Fps is (10 sec: 5525.0, 60 sec: 5539.0, 300 sec: 5544.9). Total num frames: 1023607808. Throughput: 0: 5725.1. Samples: 1023611730. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:39:18,565][25689] Avg episode reward: [(0, '1.204')] [2022-07-11 02:39:19,734][26022] Updated weights on worker 0-0, policy_version 999624 (0.00083) [2022-07-11 02:39:21,603][26022] Updated weights on worker 0-0, policy_version 999634 (0.00088) [2022-07-11 02:39:23,408][26022] Updated weights on worker 0-0, policy_version 999644 (0.00088) [2022-07-11 02:39:23,608][25689] Fps is (10 sec: 5675.7, 60 sec: 5554.7, 300 sec: 5544.4). Total num frames: 1023636480. Throughput: 0: 5824.6. Samples: 1023645304. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:39:23,608][25689] Avg episode reward: [(0, '1.037')] [2022-07-11 02:39:25,259][26022] Updated weights on worker 0-0, policy_version 999654 (0.00088) [2022-07-11 02:39:27,078][26022] Updated weights on worker 0-0, policy_version 999664 (0.00089) [2022-07-11 02:39:27,509][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:39:27,522][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000999666_1023657984.pth [2022-07-11 02:39:27,523][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000997717_1021662208.pth [2022-07-11 02:39:28,637][25689] Fps is (10 sec: 5693.1, 60 sec: 5553.8, 300 sec: 5541.6). Total num frames: 1023665152. Throughput: 0: 5814.1. Samples: 1023662208. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:39:28,637][25689] Avg episode reward: [(0, '1.839')] [2022-07-11 02:39:28,819][26022] Updated weights on worker 0-0, policy_version 999674 (0.00088) [2022-07-11 02:39:30,787][26022] Updated weights on worker 0-0, policy_version 999684 (0.00086) [2022-07-11 02:39:32,395][26022] Updated weights on worker 0-0, policy_version 999694 (0.00093) [2022-07-11 02:39:33,736][25689] Fps is (10 sec: 5459.7, 60 sec: 5517.1, 300 sec: 5536.4). Total num frames: 1023691776. Throughput: 0: 5834.7. Samples: 1023695970. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:39:33,736][25689] Avg episode reward: [(0, '1.885')] [2022-07-11 02:39:34,441][26022] Updated weights on worker 0-0, policy_version 999704 (0.00085) [2022-07-11 02:39:36,220][26022] Updated weights on worker 0-0, policy_version 999714 (0.00080) [2022-07-11 02:39:37,937][26022] Updated weights on worker 0-0, policy_version 999724 (0.00087) [2022-07-11 02:39:38,766][25689] Fps is (10 sec: 5560.1, 60 sec: 5569.9, 300 sec: 5542.8). Total num frames: 1023721472. Throughput: 0: 5818.8. Samples: 1023729280. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:39:38,767][25689] Avg episode reward: [(0, '1.665')] [2022-07-11 02:39:40,009][26022] Updated weights on worker 0-0, policy_version 999734 (0.00084) [2022-07-11 02:39:41,516][26022] Updated weights on worker 0-0, policy_version 999744 (0.00084) [2022-07-11 02:39:43,599][26022] Updated weights on worker 0-0, policy_version 999754 (0.00086) [2022-07-11 02:39:43,792][25689] Fps is (10 sec: 5702.3, 60 sec: 5553.3, 300 sec: 5546.0). Total num frames: 1023749120. Throughput: 0: 4991.6. Samples: 1023746056. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:39:43,794][25689] Avg episode reward: [(0, '1.849')] [2022-07-11 02:39:45,410][26022] Updated weights on worker 0-0, policy_version 999764 (0.00095) [2022-07-11 02:39:47,414][26022] Updated weights on worker 0-0, policy_version 999774 (0.00083) [2022-07-11 02:39:48,869][25689] Fps is (10 sec: 5473.1, 60 sec: 5547.2, 300 sec: 5535.0). Total num frames: 1023776768. Throughput: 0: 5799.7. Samples: 1023779550. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:39:48,872][25689] Avg episode reward: [(0, '1.563')] [2022-07-11 02:39:49,181][26022] Updated weights on worker 0-0, policy_version 999784 (0.00083) [2022-07-11 02:39:51,106][26022] Updated weights on worker 0-0, policy_version 999794 (0.00082) [2022-07-11 02:39:52,626][26022] Updated weights on worker 0-0, policy_version 999804 (0.00085) [2022-07-11 02:39:53,936][25689] Fps is (10 sec: 5552.3, 60 sec: 5571.4, 300 sec: 5542.2). Total num frames: 1023805440. Throughput: 0: 5810.7. Samples: 1023813346. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:39:53,936][25689] Avg episode reward: [(0, '1.646')] [2022-07-11 02:39:54,651][26022] Updated weights on worker 0-0, policy_version 999814 (0.00089) [2022-07-11 02:39:56,337][26022] Updated weights on worker 0-0, policy_version 999824 (0.00092) [2022-07-11 02:39:58,303][26022] Updated weights on worker 0-0, policy_version 999834 (0.00088) [2022-07-11 02:39:59,010][25689] Fps is (10 sec: 5756.1, 60 sec: 5568.1, 300 sec: 5545.5). Total num frames: 1023835136. Throughput: 0: 4982.1. Samples: 1023830136. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:39:59,010][25689] Avg episode reward: [(0, '0.919')] [2022-07-11 02:39:59,877][26022] Updated weights on worker 0-0, policy_version 999844 (0.00081) [2022-07-11 02:40:02,208][26022] Updated weights on worker 0-0, policy_version 999854 (0.00090) [2022-07-11 02:40:04,059][25689] Fps is (10 sec: 5461.9, 60 sec: 5600.4, 300 sec: 5544.7). Total num frames: 1023860736. Throughput: 0: 5723.1. Samples: 1023862050. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:40:04,060][25689] Avg episode reward: [(0, '0.360')] [2022-07-11 02:40:04,060][26022] Updated weights on worker 0-0, policy_version 999864 (0.00081) [2022-07-11 02:40:05,814][26022] Updated weights on worker 0-0, policy_version 999874 (0.00052) [2022-07-11 02:40:07,582][26022] Updated weights on worker 0-0, policy_version 999884 (0.00086) [2022-07-11 02:40:09,142][25689] Fps is (10 sec: 5255.2, 60 sec: 5543.7, 300 sec: 5546.0). Total num frames: 1023888384. Throughput: 0: 5717.9. Samples: 1023895466. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:40:09,143][25689] Avg episode reward: [(0, '0.576')] [2022-07-11 02:40:09,519][26022] Updated weights on worker 0-0, policy_version 999894 (0.00121) [2022-07-11 02:40:11,402][26022] Updated weights on worker 0-0, policy_version 999904 (0.00079) [2022-07-11 02:40:13,044][26022] Updated weights on worker 0-0, policy_version 999914 (0.00092) [2022-07-11 02:40:14,202][25689] Fps is (10 sec: 5653.8, 60 sec: 5584.1, 300 sec: 5546.3). Total num frames: 1023918080. Throughput: 0: 4888.0. Samples: 1023912408. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:40:14,202][25689] Avg episode reward: [(0, '0.430')] [2022-07-11 02:40:14,977][26022] Updated weights on worker 0-0, policy_version 999924 (0.00085) [2022-07-11 02:40:16,827][26022] Updated weights on worker 0-0, policy_version 999934 (0.00082) [2022-07-11 02:40:18,643][26022] Updated weights on worker 0-0, policy_version 999944 (0.00428) [2022-07-11 02:40:19,267][25689] Fps is (10 sec: 5663.7, 60 sec: 5566.8, 300 sec: 5542.2). Total num frames: 1023945728. Throughput: 0: 5729.0. Samples: 1023946190. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 02:40:19,267][25689] Avg episode reward: [(0, '0.437')] [2022-07-11 02:40:20,471][26022] Updated weights on worker 0-0, policy_version 999954 (0.00119) [2022-07-11 02:40:22,202][26022] Updated weights on worker 0-0, policy_version 999964 (0.00083) [2022-07-11 02:40:24,146][26022] Updated weights on worker 0-0, policy_version 999974 (0.00075) [2022-07-11 02:40:24,287][25689] Fps is (10 sec: 5584.4, 60 sec: 5568.9, 300 sec: 5548.8). Total num frames: 1023974400. Throughput: 0: 5831.4. Samples: 1023980006. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:40:24,288][25689] Avg episode reward: [(0, '0.564')] [2022-07-11 02:40:25,995][26022] Updated weights on worker 0-0, policy_version 999984 (0.00715) [2022-07-11 02:40:27,862][26022] Updated weights on worker 0-0, policy_version 999994 (0.00088) [2022-07-11 02:40:29,351][25689] Fps is (10 sec: 5585.1, 60 sec: 5548.9, 300 sec: 5545.9). Total num frames: 1024002048. Throughput: 0: 5016.5. Samples: 1023996844. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:40:29,351][25689] Avg episode reward: [(0, '1.584')] [2022-07-11 02:40:29,571][26022] Updated weights on worker 0-0, policy_version 1000004 (0.00086) [2022-07-11 02:40:31,445][26022] Updated weights on worker 0-0, policy_version 1000014 (0.00093) [2022-07-11 02:40:33,119][26022] Updated weights on worker 0-0, policy_version 1000024 (0.00049) [2022-07-11 02:40:34,443][25689] Fps is (10 sec: 5444.7, 60 sec: 5566.4, 300 sec: 5544.6). Total num frames: 1024029696. Throughput: 0: 5826.9. Samples: 1024030350. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:40:34,444][25689] Avg episode reward: [(0, '1.498')] [2022-07-11 02:40:35,203][26022] Updated weights on worker 0-0, policy_version 1000034 (0.00089) [2022-07-11 02:40:36,899][26022] Updated weights on worker 0-0, policy_version 1000044 (0.00089) [2022-07-11 02:40:38,811][26022] Updated weights on worker 0-0, policy_version 1000054 (0.00086) [2022-07-11 02:40:39,466][25689] Fps is (10 sec: 5669.0, 60 sec: 5567.0, 300 sec: 5548.0). Total num frames: 1024059392. Throughput: 0: 5837.6. Samples: 1024064106. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:40:39,467][25689] Avg episode reward: [(0, '1.408')] [2022-07-11 02:40:40,479][26022] Updated weights on worker 0-0, policy_version 1000064 (0.00091) [2022-07-11 02:40:42,344][26022] Updated weights on worker 0-0, policy_version 1000074 (0.00088) [2022-07-11 02:40:44,142][26022] Updated weights on worker 0-0, policy_version 1000084 (0.00080) [2022-07-11 02:40:44,480][25689] Fps is (10 sec: 5713.3, 60 sec: 5568.1, 300 sec: 5551.9). Total num frames: 1024087040. Throughput: 0: 5007.3. Samples: 1024081118. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:40:44,482][25689] Avg episode reward: [(0, '0.915')] [2022-07-11 02:40:46,018][26022] Updated weights on worker 0-0, policy_version 1000094 (0.00088) [2022-07-11 02:40:47,808][26022] Updated weights on worker 0-0, policy_version 1000104 (0.00087) [2022-07-11 02:40:49,567][25689] Fps is (10 sec: 5474.6, 60 sec: 5567.3, 300 sec: 5552.7). Total num frames: 1024114688. Throughput: 0: 5834.9. Samples: 1024114802. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:40:49,568][25689] Avg episode reward: [(0, '0.835')] [2022-07-11 02:40:49,712][26022] Updated weights on worker 0-0, policy_version 1000114 (0.00092) [2022-07-11 02:40:51,631][26022] Updated weights on worker 0-0, policy_version 1000124 (0.00086) [2022-07-11 02:40:53,371][26022] Updated weights on worker 0-0, policy_version 1000134 (0.00089) [2022-07-11 02:40:54,669][25689] Fps is (10 sec: 5527.5, 60 sec: 5564.0, 300 sec: 5548.5). Total num frames: 1024143360. Throughput: 0: 5829.5. Samples: 1024148258. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:40:54,670][25689] Avg episode reward: [(0, '-0.525')] [2022-07-11 02:40:55,082][26022] Updated weights on worker 0-0, policy_version 1000144 (0.00095) [2022-07-11 02:40:56,905][26022] Updated weights on worker 0-0, policy_version 1000154 (0.00084) [2022-07-11 02:40:58,780][26022] Updated weights on worker 0-0, policy_version 1000164 (0.00084) [2022-07-11 02:40:59,691][25689] Fps is (10 sec: 5663.7, 60 sec: 5551.8, 300 sec: 5555.3). Total num frames: 1024172032. Throughput: 0: 5829.7. Samples: 1024182014. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:40:59,692][25689] Avg episode reward: [(0, '-0.697')] [2022-07-11 02:41:00,647][26022] Updated weights on worker 0-0, policy_version 1000174 (0.00091) [2022-07-11 02:41:02,860][26022] Updated weights on worker 0-0, policy_version 1000184 (0.00091) [2022-07-11 02:41:04,706][25689] Fps is (10 sec: 5509.2, 60 sec: 5571.9, 300 sec: 5552.7). Total num frames: 1024198656. Throughput: 0: 5724.7. Samples: 1024196906. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:41:04,707][25689] Avg episode reward: [(0, '-0.424')] [2022-07-11 02:41:04,708][26022] Updated weights on worker 0-0, policy_version 1000194 (0.00081) [2022-07-11 02:41:06,396][26022] Updated weights on worker 0-0, policy_version 1000204 (0.00092) [2022-07-11 02:41:08,403][26022] Updated weights on worker 0-0, policy_version 1000214 (0.00088) [2022-07-11 02:41:09,739][25689] Fps is (10 sec: 5503.6, 60 sec: 5593.4, 300 sec: 5553.2). Total num frames: 1024227328. Throughput: 0: 5739.3. Samples: 1024230576. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:41:09,739][25689] Avg episode reward: [(0, '-0.481')] [2022-07-11 02:41:10,139][26022] Updated weights on worker 0-0, policy_version 1000224 (0.00090) [2022-07-11 02:41:12,070][26022] Updated weights on worker 0-0, policy_version 1000234 (0.00087) [2022-07-11 02:41:13,682][26022] Updated weights on worker 0-0, policy_version 1000244 (0.00084) [2022-07-11 02:41:14,792][25689] Fps is (10 sec: 5584.2, 60 sec: 5560.3, 300 sec: 5552.7). Total num frames: 1024254976. Throughput: 0: 5768.8. Samples: 1024264342. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:41:14,792][25689] Avg episode reward: [(0, '-1.175')] [2022-07-11 02:41:15,636][26022] Updated weights on worker 0-0, policy_version 1000254 (0.00087) [2022-07-11 02:41:17,417][26022] Updated weights on worker 0-0, policy_version 1000264 (0.00790) [2022-07-11 02:41:19,265][26022] Updated weights on worker 0-0, policy_version 1000274 (0.00084) [2022-07-11 02:41:19,863][25689] Fps is (10 sec: 5563.0, 60 sec: 5576.6, 300 sec: 5551.7). Total num frames: 1024283648. Throughput: 0: 4924.4. Samples: 1024281346. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:41:19,863][25689] Avg episode reward: [(0, '-0.190')] [2022-07-11 02:41:20,899][26022] Updated weights on worker 0-0, policy_version 1000284 (0.00089) [2022-07-11 02:41:22,910][26022] Updated weights on worker 0-0, policy_version 1000294 (0.00107) [2022-07-11 02:41:24,720][26022] Updated weights on worker 0-0, policy_version 1000304 (0.00095) [2022-07-11 02:41:24,879][25689] Fps is (10 sec: 5583.4, 60 sec: 5560.1, 300 sec: 5551.9). Total num frames: 1024311296. Throughput: 0: 5849.7. Samples: 1024314910. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:41:24,879][25689] Avg episode reward: [(0, '-0.121')] [2022-07-11 02:41:26,618][26022] Updated weights on worker 0-0, policy_version 1000314 (0.00081) [2022-07-11 02:41:27,625][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:41:27,636][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001000319_1024326656.pth [2022-07-11 02:41:27,637][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000998366_1022326784.pth [2022-07-11 02:41:28,423][26022] Updated weights on worker 0-0, policy_version 1000324 (0.00094) [2022-07-11 02:41:29,911][25689] Fps is (10 sec: 5503.3, 60 sec: 5563.0, 300 sec: 5549.6). Total num frames: 1024338944. Throughput: 0: 5838.4. Samples: 1024348348. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:41:29,911][25689] Avg episode reward: [(0, '-0.890')] [2022-07-11 02:41:30,249][26022] Updated weights on worker 0-0, policy_version 1000334 (0.00096) [2022-07-11 02:41:31,942][26022] Updated weights on worker 0-0, policy_version 1000344 (0.00088) [2022-07-11 02:41:33,985][26022] Updated weights on worker 0-0, policy_version 1000354 (0.00614) [2022-07-11 02:41:34,963][25689] Fps is (10 sec: 5686.4, 60 sec: 5600.5, 300 sec: 5556.2). Total num frames: 1024368640. Throughput: 0: 4991.8. Samples: 1024365034. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:41:34,965][25689] Avg episode reward: [(0, '-0.983')] [2022-07-11 02:41:35,764][26022] Updated weights on worker 0-0, policy_version 1000364 (0.00100) [2022-07-11 02:41:37,605][26022] Updated weights on worker 0-0, policy_version 1000374 (0.00083) [2022-07-11 02:41:39,457][26022] Updated weights on worker 0-0, policy_version 1000384 (0.00079) [2022-07-11 02:41:40,001][25689] Fps is (10 sec: 5581.6, 60 sec: 5548.4, 300 sec: 5552.5). Total num frames: 1024395264. Throughput: 0: 5828.3. Samples: 1024398716. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:41:40,003][25689] Avg episode reward: [(0, '-0.669')] [2022-07-11 02:41:41,257][26022] Updated weights on worker 0-0, policy_version 1000394 (0.00082) [2022-07-11 02:41:43,068][26022] Updated weights on worker 0-0, policy_version 1000404 (0.00085) [2022-07-11 02:41:44,869][26022] Updated weights on worker 0-0, policy_version 1000414 (0.00086) [2022-07-11 02:41:45,003][25689] Fps is (10 sec: 5507.5, 60 sec: 5566.4, 300 sec: 5559.6). Total num frames: 1024423936. Throughput: 0: 5853.5. Samples: 1024432708. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:41:45,005][25689] Avg episode reward: [(0, '0.259')] [2022-07-11 02:41:46,658][26022] Updated weights on worker 0-0, policy_version 1000424 (0.00107) [2022-07-11 02:41:48,514][26022] Updated weights on worker 0-0, policy_version 1000434 (0.00090) [2022-07-11 02:41:50,018][25689] Fps is (10 sec: 5724.4, 60 sec: 5589.9, 300 sec: 5554.4). Total num frames: 1024452608. Throughput: 0: 5038.0. Samples: 1024449650. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:41:50,019][25689] Avg episode reward: [(0, '0.862')] [2022-07-11 02:41:50,179][26022] Updated weights on worker 0-0, policy_version 1000444 (0.00085) [2022-07-11 02:41:52,175][26022] Updated weights on worker 0-0, policy_version 1000454 (0.00086) [2022-07-11 02:41:54,059][26022] Updated weights on worker 0-0, policy_version 1000464 (0.00085) [2022-07-11 02:41:55,082][25689] Fps is (10 sec: 5486.6, 60 sec: 5559.6, 300 sec: 5554.2). Total num frames: 1024479232. Throughput: 0: 5863.8. Samples: 1024483004. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:41:55,083][25689] Avg episode reward: [(0, '1.443')] [2022-07-11 02:41:55,798][26022] Updated weights on worker 0-0, policy_version 1000474 (0.00085) [2022-07-11 02:41:57,879][26022] Updated weights on worker 0-0, policy_version 1000484 (0.00083) [2022-07-11 02:41:59,391][26022] Updated weights on worker 0-0, policy_version 1000494 (0.00093) [2022-07-11 02:42:00,091][25689] Fps is (10 sec: 5591.3, 60 sec: 5577.8, 300 sec: 5565.2). Total num frames: 1024508928. Throughput: 0: 5864.2. Samples: 1024516530. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:42:00,092][25689] Avg episode reward: [(0, '1.446')] [2022-07-11 02:42:01,505][26022] Updated weights on worker 0-0, policy_version 1000504 (0.00084) [2022-07-11 02:42:03,618][26022] Updated weights on worker 0-0, policy_version 1000514 (0.00089) [2022-07-11 02:42:05,102][25689] Fps is (10 sec: 5416.2, 60 sec: 5544.2, 300 sec: 5551.5). Total num frames: 1024533504. Throughput: 0: 4901.6. Samples: 1024531222. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:42:05,104][25689] Avg episode reward: [(0, '1.359')] [2022-07-11 02:42:05,583][26022] Updated weights on worker 0-0, policy_version 1000524 (0.00096) [2022-07-11 02:42:07,179][26022] Updated weights on worker 0-0, policy_version 1000534 (0.00089) [2022-07-11 02:42:09,156][26022] Updated weights on worker 0-0, policy_version 1000544 (0.00090) [2022-07-11 02:42:10,141][25689] Fps is (10 sec: 5196.3, 60 sec: 5526.6, 300 sec: 5556.3). Total num frames: 1024561152. Throughput: 0: 5721.3. Samples: 1024564780. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:42:10,143][25689] Avg episode reward: [(0, '1.391')] [2022-07-11 02:42:10,960][26022] Updated weights on worker 0-0, policy_version 1000554 (0.00085) [2022-07-11 02:42:12,883][26022] Updated weights on worker 0-0, policy_version 1000564 (0.00083) [2022-07-11 02:42:14,735][26022] Updated weights on worker 0-0, policy_version 1000574 (0.00086) [2022-07-11 02:42:15,211][25689] Fps is (10 sec: 5672.7, 60 sec: 5559.1, 300 sec: 5552.0). Total num frames: 1024590848. Throughput: 0: 5728.8. Samples: 1024598320. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:42:15,211][25689] Avg episode reward: [(0, '1.213')] [2022-07-11 02:42:16,444][26022] Updated weights on worker 0-0, policy_version 1000584 (0.00090) [2022-07-11 02:42:18,306][26022] Updated weights on worker 0-0, policy_version 1000594 (0.00091) [2022-07-11 02:42:19,997][26022] Updated weights on worker 0-0, policy_version 1000604 (0.00098) [2022-07-11 02:42:20,216][25689] Fps is (10 sec: 5793.4, 60 sec: 5565.1, 300 sec: 5555.7). Total num frames: 1024619520. Throughput: 0: 4904.9. Samples: 1024615242. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:42:20,217][25689] Avg episode reward: [(0, '0.152')] [2022-07-11 02:42:22,033][26022] Updated weights on worker 0-0, policy_version 1000614 (0.00079) [2022-07-11 02:42:23,601][26022] Updated weights on worker 0-0, policy_version 1000624 (0.00086) [2022-07-11 02:42:25,227][25689] Fps is (10 sec: 5520.8, 60 sec: 5548.6, 300 sec: 5548.8). Total num frames: 1024646144. Throughput: 0: 5845.3. Samples: 1024648858. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:42:25,227][25689] Avg episode reward: [(0, '-0.223')] [2022-07-11 02:42:25,638][26022] Updated weights on worker 0-0, policy_version 1000634 (0.00104) [2022-07-11 02:42:27,492][26022] Updated weights on worker 0-0, policy_version 1000644 (0.00092) [2022-07-11 02:42:29,076][26022] Updated weights on worker 0-0, policy_version 1000654 (0.00083) [2022-07-11 02:42:30,247][25689] Fps is (10 sec: 5512.7, 60 sec: 5566.7, 300 sec: 5556.3). Total num frames: 1024674816. Throughput: 0: 5865.4. Samples: 1024682708. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:42:30,248][25689] Avg episode reward: [(0, '0.100')] [2022-07-11 02:42:31,103][26022] Updated weights on worker 0-0, policy_version 1000664 (0.00082) [2022-07-11 02:42:32,821][26022] Updated weights on worker 0-0, policy_version 1000674 (0.00094) [2022-07-11 02:42:34,778][26022] Updated weights on worker 0-0, policy_version 1000684 (0.00088) [2022-07-11 02:42:35,392][25689] Fps is (10 sec: 5641.4, 60 sec: 5541.2, 300 sec: 5550.4). Total num frames: 1024703488. Throughput: 0: 5011.1. Samples: 1024699448. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:42:35,392][25689] Avg episode reward: [(0, '-0.242')] [2022-07-11 02:42:36,684][26022] Updated weights on worker 0-0, policy_version 1000694 (0.00084) [2022-07-11 02:42:38,447][26022] Updated weights on worker 0-0, policy_version 1000704 (0.00081) [2022-07-11 02:42:40,284][26022] Updated weights on worker 0-0, policy_version 1000714 (0.00090) [2022-07-11 02:42:40,420][25689] Fps is (10 sec: 5636.9, 60 sec: 5576.0, 300 sec: 5556.8). Total num frames: 1024732160. Throughput: 0: 5828.4. Samples: 1024732996. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:42:40,421][25689] Avg episode reward: [(0, '-1.232')] [2022-07-11 02:42:42,282][26022] Updated weights on worker 0-0, policy_version 1000724 (0.00085) [2022-07-11 02:42:43,948][26022] Updated weights on worker 0-0, policy_version 1000734 (0.00089) [2022-07-11 02:42:45,495][25689] Fps is (10 sec: 5472.8, 60 sec: 5535.4, 300 sec: 5553.5). Total num frames: 1024758784. Throughput: 0: 5807.9. Samples: 1024766576. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:42:45,496][25689] Avg episode reward: [(0, '-0.056')] [2022-07-11 02:42:45,880][26022] Updated weights on worker 0-0, policy_version 1000744 (0.00089) [2022-07-11 02:42:47,483][26022] Updated weights on worker 0-0, policy_version 1000754 (0.00089) [2022-07-11 02:42:49,624][26022] Updated weights on worker 0-0, policy_version 1000764 (0.00082) [2022-07-11 02:42:50,511][25689] Fps is (10 sec: 5479.9, 60 sec: 5535.4, 300 sec: 5556.2). Total num frames: 1024787456. Throughput: 0: 4962.6. Samples: 1024783268. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:42:50,511][25689] Avg episode reward: [(0, '0.576')] [2022-07-11 02:42:51,408][26022] Updated weights on worker 0-0, policy_version 1000774 (0.00092) [2022-07-11 02:42:53,076][26022] Updated weights on worker 0-0, policy_version 1000784 (0.00084) [2022-07-11 02:42:55,052][26022] Updated weights on worker 0-0, policy_version 1000794 (0.00083) [2022-07-11 02:42:55,644][25689] Fps is (10 sec: 5650.2, 60 sec: 5562.8, 300 sec: 5550.4). Total num frames: 1024816128. Throughput: 0: 5774.9. Samples: 1024816406. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:42:55,645][25689] Avg episode reward: [(0, '-0.028')] [2022-07-11 02:42:56,798][26022] Updated weights on worker 0-0, policy_version 1000804 (0.00096) [2022-07-11 02:42:58,708][26022] Updated weights on worker 0-0, policy_version 1000814 (0.00095) [2022-07-11 02:43:00,623][26022] Updated weights on worker 0-0, policy_version 1000824 (0.00080) [2022-07-11 02:43:00,691][25689] Fps is (10 sec: 5532.0, 60 sec: 5525.5, 300 sec: 5557.7). Total num frames: 1024843776. Throughput: 0: 5759.9. Samples: 1024849758. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:43:00,692][25689] Avg episode reward: [(0, '0.109')] [2022-07-11 02:43:02,686][26022] Updated weights on worker 0-0, policy_version 1000834 (0.00088) [2022-07-11 02:43:04,726][26022] Updated weights on worker 0-0, policy_version 1000844 (0.00085) [2022-07-11 02:43:05,701][25689] Fps is (10 sec: 5396.7, 60 sec: 5559.4, 300 sec: 5558.7). Total num frames: 1024870400. Throughput: 0: 4849.2. Samples: 1024864558. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:43:05,701][25689] Avg episode reward: [(0, '0.524')] [2022-07-11 02:43:06,270][26022] Updated weights on worker 0-0, policy_version 1000854 (0.00093) [2022-07-11 02:43:08,099][26022] Updated weights on worker 0-0, policy_version 1000864 (0.00083) [2022-07-11 02:43:09,961][26022] Updated weights on worker 0-0, policy_version 1000874 (0.00085) [2022-07-11 02:43:10,729][25689] Fps is (10 sec: 5509.0, 60 sec: 5577.4, 300 sec: 5556.3). Total num frames: 1024899072. Throughput: 0: 5694.6. Samples: 1024898402. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:43:10,731][25689] Avg episode reward: [(0, '1.676')] [2022-07-11 02:43:12,000][26022] Updated weights on worker 0-0, policy_version 1000884 (0.00087) [2022-07-11 02:43:13,662][26022] Updated weights on worker 0-0, policy_version 1000894 (0.00087) [2022-07-11 02:43:15,438][26022] Updated weights on worker 0-0, policy_version 1000904 (0.00077) [2022-07-11 02:43:15,768][25689] Fps is (10 sec: 5594.4, 60 sec: 5546.3, 300 sec: 5555.9). Total num frames: 1024926720. Throughput: 0: 5759.6. Samples: 1024932310. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:43:15,769][25689] Avg episode reward: [(0, '1.266')] [2022-07-11 02:43:17,233][26022] Updated weights on worker 0-0, policy_version 1000914 (0.00100) [2022-07-11 02:43:19,223][26022] Updated weights on worker 0-0, policy_version 1000924 (0.00090) [2022-07-11 02:43:20,799][25689] Fps is (10 sec: 5593.0, 60 sec: 5544.1, 300 sec: 5559.4). Total num frames: 1024955392. Throughput: 0: 4947.4. Samples: 1024949236. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:43:20,799][25689] Avg episode reward: [(0, '0.467')] [2022-07-11 02:43:20,840][26022] Updated weights on worker 0-0, policy_version 1000934 (0.00091) [2022-07-11 02:43:22,689][26022] Updated weights on worker 0-0, policy_version 1000944 (0.00083) [2022-07-11 02:43:24,682][26022] Updated weights on worker 0-0, policy_version 1000954 (0.00706) [2022-07-11 02:43:25,823][25689] Fps is (10 sec: 5703.3, 60 sec: 5576.6, 300 sec: 5559.3). Total num frames: 1024984064. Throughput: 0: 5884.3. Samples: 1024982960. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:43:25,824][25689] Avg episode reward: [(0, '0.472')] [2022-07-11 02:43:26,367][26022] Updated weights on worker 0-0, policy_version 1000964 (0.00088) [2022-07-11 02:43:27,737][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:43:27,747][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001000971_1024994304.pth [2022-07-11 02:43:27,747][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000999017_1022993408.pth [2022-07-11 02:43:28,299][26022] Updated weights on worker 0-0, policy_version 1000974 (0.00079) [2022-07-11 02:43:30,070][26022] Updated weights on worker 0-0, policy_version 1000984 (0.00097) [2022-07-11 02:43:30,840][25689] Fps is (10 sec: 5506.7, 60 sec: 5543.1, 300 sec: 5553.3). Total num frames: 1025010688. Throughput: 0: 5868.1. Samples: 1025016418. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:43:30,841][25689] Avg episode reward: [(0, '0.547')] [2022-07-11 02:43:32,075][26022] Updated weights on worker 0-0, policy_version 1000994 (0.00082) [2022-07-11 02:43:33,774][26022] Updated weights on worker 0-0, policy_version 1001004 (0.00087) [2022-07-11 02:43:35,644][26022] Updated weights on worker 0-0, policy_version 1001014 (0.00087) [2022-07-11 02:43:35,962][25689] Fps is (10 sec: 5453.6, 60 sec: 5545.1, 300 sec: 5558.9). Total num frames: 1025039360. Throughput: 0: 4989.7. Samples: 1025033072. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:43:35,963][25689] Avg episode reward: [(0, '0.199')] [2022-07-11 02:43:37,293][26022] Updated weights on worker 0-0, policy_version 1001024 (0.00089) [2022-07-11 02:43:39,398][26022] Updated weights on worker 0-0, policy_version 1001034 (0.00113) [2022-07-11 02:43:40,980][25689] Fps is (10 sec: 5655.5, 60 sec: 5546.1, 300 sec: 5559.1). Total num frames: 1025068032. Throughput: 0: 5820.8. Samples: 1025066706. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:43:40,980][25689] Avg episode reward: [(0, '0.034')] [2022-07-11 02:43:41,056][26022] Updated weights on worker 0-0, policy_version 1001044 (0.00081) [2022-07-11 02:43:43,101][26022] Updated weights on worker 0-0, policy_version 1001054 (0.00084) [2022-07-11 02:43:44,600][26022] Updated weights on worker 0-0, policy_version 1001064 (0.00087) [2022-07-11 02:43:46,048][25689] Fps is (10 sec: 5685.8, 60 sec: 5580.6, 300 sec: 5561.5). Total num frames: 1025096704. Throughput: 0: 5811.1. Samples: 1025100490. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:43:46,048][25689] Avg episode reward: [(0, '-0.018')] [2022-07-11 02:43:46,528][26022] Updated weights on worker 0-0, policy_version 1001074 (0.00051) [2022-07-11 02:43:48,551][26022] Updated weights on worker 0-0, policy_version 1001084 (0.00088) [2022-07-11 02:43:50,147][26022] Updated weights on worker 0-0, policy_version 1001094 (0.00086) [2022-07-11 02:43:51,055][25689] Fps is (10 sec: 5691.8, 60 sec: 5581.4, 300 sec: 5567.5). Total num frames: 1025125376. Throughput: 0: 5821.1. Samples: 1025134088. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:43:51,055][25689] Avg episode reward: [(0, '0.458')] [2022-07-11 02:43:52,087][26022] Updated weights on worker 0-0, policy_version 1001104 (0.00518) [2022-07-11 02:43:54,021][26022] Updated weights on worker 0-0, policy_version 1001114 (0.00092) [2022-07-11 02:43:55,759][26022] Updated weights on worker 0-0, policy_version 1001124 (0.00093) [2022-07-11 02:43:56,142][25689] Fps is (10 sec: 5680.8, 60 sec: 5585.7, 300 sec: 5563.2). Total num frames: 1025154048. Throughput: 0: 5823.5. Samples: 1025150590. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:43:56,143][25689] Avg episode reward: [(0, '0.339')] [2022-07-11 02:43:57,579][26022] Updated weights on worker 0-0, policy_version 1001134 (0.00088) [2022-07-11 02:43:59,238][26022] Updated weights on worker 0-0, policy_version 1001144 (0.00087) [2022-07-11 02:44:01,169][25689] Fps is (10 sec: 5467.1, 60 sec: 5570.6, 300 sec: 5573.6). Total num frames: 1025180672. Throughput: 0: 5849.9. Samples: 1025184810. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:44:01,170][25689] Avg episode reward: [(0, '0.392')] [2022-07-11 02:44:01,263][26022] Updated weights on worker 0-0, policy_version 1001154 (0.00080) [2022-07-11 02:44:03,308][26022] Updated weights on worker 0-0, policy_version 1001164 (0.00085) [2022-07-11 02:44:05,191][26022] Updated weights on worker 0-0, policy_version 1001174 (0.00082) [2022-07-11 02:44:06,193][25689] Fps is (10 sec: 5297.6, 60 sec: 5569.2, 300 sec: 5559.7). Total num frames: 1025207296. Throughput: 0: 5774.7. Samples: 1025216826. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:44:06,194][25689] Avg episode reward: [(0, '0.447')] [2022-07-11 02:44:06,919][26022] Updated weights on worker 0-0, policy_version 1001184 (0.00095) [2022-07-11 02:44:08,879][26022] Updated weights on worker 0-0, policy_version 1001194 (0.00080) [2022-07-11 02:44:10,847][26022] Updated weights on worker 0-0, policy_version 1001204 (0.00056) [2022-07-11 02:44:11,283][25689] Fps is (10 sec: 5467.0, 60 sec: 5563.5, 300 sec: 5564.0). Total num frames: 1025235968. Throughput: 0: 4922.5. Samples: 1025233664. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:44:11,284][25689] Avg episode reward: [(0, '-0.156')] [2022-07-11 02:44:12,400][26022] Updated weights on worker 0-0, policy_version 1001214 (0.00091) [2022-07-11 02:44:14,257][26022] Updated weights on worker 0-0, policy_version 1001224 (0.00090) [2022-07-11 02:44:15,982][26022] Updated weights on worker 0-0, policy_version 1001234 (0.00083) [2022-07-11 02:44:16,397][25689] Fps is (10 sec: 5619.7, 60 sec: 5573.5, 300 sec: 5563.0). Total num frames: 1025264640. Throughput: 0: 5765.3. Samples: 1025267368. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:44:16,398][25689] Avg episode reward: [(0, '0.441')] [2022-07-11 02:44:18,016][26022] Updated weights on worker 0-0, policy_version 1001244 (0.00089) [2022-07-11 02:44:19,888][26022] Updated weights on worker 0-0, policy_version 1001254 (0.00090) [2022-07-11 02:44:21,445][25689] Fps is (10 sec: 5744.2, 60 sec: 5588.9, 300 sec: 5566.3). Total num frames: 1025294336. Throughput: 0: 5741.0. Samples: 1025301212. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 02:44:21,446][25689] Avg episode reward: [(0, '0.560')] [2022-07-11 02:44:21,451][26022] Updated weights on worker 0-0, policy_version 1001264 (0.00089) [2022-07-11 02:44:23,526][26022] Updated weights on worker 0-0, policy_version 1001274 (0.00084) [2022-07-11 02:44:25,337][26022] Updated weights on worker 0-0, policy_version 1001284 (0.00106) [2022-07-11 02:44:26,510][25689] Fps is (10 sec: 5569.5, 60 sec: 5551.4, 300 sec: 5558.8). Total num frames: 1025320960. Throughput: 0: 4977.5. Samples: 1025317950. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:44:26,511][25689] Avg episode reward: [(0, '0.905')] [2022-07-11 02:44:27,075][26022] Updated weights on worker 0-0, policy_version 1001294 (0.00086) [2022-07-11 02:44:28,802][26022] Updated weights on worker 0-0, policy_version 1001304 (0.00085) [2022-07-11 02:44:30,731][26022] Updated weights on worker 0-0, policy_version 1001314 (0.00097) [2022-07-11 02:44:31,520][25689] Fps is (10 sec: 5488.3, 60 sec: 5585.8, 300 sec: 5567.3). Total num frames: 1025349632. Throughput: 0: 5827.1. Samples: 1025351584. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:44:31,520][25689] Avg episode reward: [(0, '0.709')] [2022-07-11 02:44:32,639][26022] Updated weights on worker 0-0, policy_version 1001324 (0.00079) [2022-07-11 02:44:34,439][26022] Updated weights on worker 0-0, policy_version 1001334 (0.00090) [2022-07-11 02:44:36,096][26022] Updated weights on worker 0-0, policy_version 1001344 (0.00090) [2022-07-11 02:44:36,574][25689] Fps is (10 sec: 5697.9, 60 sec: 5592.1, 300 sec: 5563.4). Total num frames: 1025378304. Throughput: 0: 5850.2. Samples: 1025385404. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:44:36,574][25689] Avg episode reward: [(0, '1.110')] [2022-07-11 02:44:38,254][26022] Updated weights on worker 0-0, policy_version 1001354 (0.00085) [2022-07-11 02:44:39,812][26022] Updated weights on worker 0-0, policy_version 1001364 (0.00089) [2022-07-11 02:44:41,588][25689] Fps is (10 sec: 5492.2, 60 sec: 5558.6, 300 sec: 5560.2). Total num frames: 1025404928. Throughput: 0: 5011.5. Samples: 1025402160. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:44:41,589][25689] Avg episode reward: [(0, '1.913')] [2022-07-11 02:44:41,935][26022] Updated weights on worker 0-0, policy_version 1001374 (0.00070) [2022-07-11 02:44:43,435][26022] Updated weights on worker 0-0, policy_version 1001384 (0.00087) [2022-07-11 02:44:45,435][26022] Updated weights on worker 0-0, policy_version 1001394 (0.00082) [2022-07-11 02:44:46,597][25689] Fps is (10 sec: 5619.1, 60 sec: 5580.9, 300 sec: 5568.4). Total num frames: 1025434624. Throughput: 0: 5871.7. Samples: 1025435896. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:44:46,597][25689] Avg episode reward: [(0, '1.629')] [2022-07-11 02:44:47,125][26022] Updated weights on worker 0-0, policy_version 1001404 (0.00090) [2022-07-11 02:44:49,039][26022] Updated weights on worker 0-0, policy_version 1001414 (0.00087) [2022-07-11 02:44:50,762][26022] Updated weights on worker 0-0, policy_version 1001424 (0.00087) [2022-07-11 02:44:51,606][25689] Fps is (10 sec: 5621.8, 60 sec: 5546.9, 300 sec: 5562.6). Total num frames: 1025461248. Throughput: 0: 5872.6. Samples: 1025469542. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:44:51,607][25689] Avg episode reward: [(0, '1.843')] [2022-07-11 02:44:52,635][26022] Updated weights on worker 0-0, policy_version 1001434 (0.00095) [2022-07-11 02:44:54,603][26022] Updated weights on worker 0-0, policy_version 1001444 (0.00083) [2022-07-11 02:44:56,343][26022] Updated weights on worker 0-0, policy_version 1001454 (0.00089) [2022-07-11 02:44:56,643][25689] Fps is (10 sec: 5606.2, 60 sec: 5568.5, 300 sec: 5563.3). Total num frames: 1025490944. Throughput: 0: 5014.0. Samples: 1025486030. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:44:56,643][25689] Avg episode reward: [(0, '1.068')] [2022-07-11 02:44:58,245][26022] Updated weights on worker 0-0, policy_version 1001464 (0.00082) [2022-07-11 02:45:00,032][26022] Updated weights on worker 0-0, policy_version 1001474 (0.00088) [2022-07-11 02:45:01,663][25689] Fps is (10 sec: 5396.3, 60 sec: 5535.2, 300 sec: 5560.4). Total num frames: 1025515520. Throughput: 0: 5854.2. Samples: 1025519686. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:45:01,664][25689] Avg episode reward: [(0, '0.715')] [2022-07-11 02:45:02,247][26022] Updated weights on worker 0-0, policy_version 1001484 (0.00086) [2022-07-11 02:45:04,087][26022] Updated weights on worker 0-0, policy_version 1001494 (0.00084) [2022-07-11 02:45:05,773][26022] Updated weights on worker 0-0, policy_version 1001504 (0.00270) [2022-07-11 02:45:06,691][25689] Fps is (10 sec: 5299.4, 60 sec: 5568.8, 300 sec: 5564.8). Total num frames: 1025544192. Throughput: 0: 5760.8. Samples: 1025551654. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:45:06,692][25689] Avg episode reward: [(0, '0.527')] [2022-07-11 02:45:07,754][26022] Updated weights on worker 0-0, policy_version 1001514 (0.00083) [2022-07-11 02:45:09,318][26022] Updated weights on worker 0-0, policy_version 1001524 (0.00087) [2022-07-11 02:45:11,438][26022] Updated weights on worker 0-0, policy_version 1001534 (0.00085) [2022-07-11 02:45:11,709][25689] Fps is (10 sec: 5810.5, 60 sec: 5592.4, 300 sec: 5565.6). Total num frames: 1025573888. Throughput: 0: 4929.7. Samples: 1025568642. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:45:11,709][25689] Avg episode reward: [(0, '0.426')] [2022-07-11 02:45:13,047][26022] Updated weights on worker 0-0, policy_version 1001544 (0.00089) [2022-07-11 02:45:14,947][26022] Updated weights on worker 0-0, policy_version 1001554 (0.00083) [2022-07-11 02:45:16,801][25689] Fps is (10 sec: 5672.0, 60 sec: 5577.5, 300 sec: 5565.1). Total num frames: 1025601536. Throughput: 0: 5780.3. Samples: 1025602548. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:45:16,801][25689] Avg episode reward: [(0, '0.506')] [2022-07-11 02:45:16,808][26022] Updated weights on worker 0-0, policy_version 1001564 (0.00083) [2022-07-11 02:45:18,608][26022] Updated weights on worker 0-0, policy_version 1001574 (0.00084) [2022-07-11 02:45:20,324][26022] Updated weights on worker 0-0, policy_version 1001584 (0.00080) [2022-07-11 02:45:21,839][25689] Fps is (10 sec: 5559.3, 60 sec: 5561.3, 300 sec: 5564.8). Total num frames: 1025630208. Throughput: 0: 5767.5. Samples: 1025636052. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:45:21,840][25689] Avg episode reward: [(0, '0.857')] [2022-07-11 02:45:22,355][26022] Updated weights on worker 0-0, policy_version 1001594 (0.00096) [2022-07-11 02:45:24,042][26022] Updated weights on worker 0-0, policy_version 1001604 (0.00089) [2022-07-11 02:45:25,991][26022] Updated weights on worker 0-0, policy_version 1001614 (0.00090) [2022-07-11 02:45:26,907][25689] Fps is (10 sec: 5572.8, 60 sec: 5578.0, 300 sec: 5564.7). Total num frames: 1025657856. Throughput: 0: 4993.2. Samples: 1025652598. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:45:26,907][25689] Avg episode reward: [(0, '1.265')] [2022-07-11 02:45:27,640][26022] Updated weights on worker 0-0, policy_version 1001624 (0.00087) [2022-07-11 02:45:27,763][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:45:27,785][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001001625_1025664000.pth [2022-07-11 02:45:27,800][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_000999666_1023657984.pth [2022-07-11 02:45:29,759][26022] Updated weights on worker 0-0, policy_version 1001634 (0.00096) [2022-07-11 02:45:31,247][26022] Updated weights on worker 0-0, policy_version 1001644 (0.00082) [2022-07-11 02:45:31,991][25689] Fps is (10 sec: 5547.9, 60 sec: 5571.2, 300 sec: 5568.3). Total num frames: 1025686528. Throughput: 0: 5799.8. Samples: 1025686276. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:45:31,992][25689] Avg episode reward: [(0, '1.759')] [2022-07-11 02:45:33,523][26022] Updated weights on worker 0-0, policy_version 1001654 (0.00090) [2022-07-11 02:45:34,929][26022] Updated weights on worker 0-0, policy_version 1001664 (0.00088) [2022-07-11 02:45:37,049][26022] Updated weights on worker 0-0, policy_version 1001674 (0.01062) [2022-07-11 02:45:37,123][25689] Fps is (10 sec: 5512.9, 60 sec: 5547.1, 300 sec: 5559.4). Total num frames: 1025714176. Throughput: 0: 5793.5. Samples: 1025720284. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:45:37,123][25689] Avg episode reward: [(0, '1.761')] [2022-07-11 02:45:38,675][26022] Updated weights on worker 0-0, policy_version 1001684 (0.00080) [2022-07-11 02:45:40,555][26022] Updated weights on worker 0-0, policy_version 1001694 (0.00089) [2022-07-11 02:45:42,166][25689] Fps is (10 sec: 5535.4, 60 sec: 5578.3, 300 sec: 5562.3). Total num frames: 1025742848. Throughput: 0: 4961.5. Samples: 1025736900. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:45:42,166][25689] Avg episode reward: [(0, '1.590')] [2022-07-11 02:45:42,327][26022] Updated weights on worker 0-0, policy_version 1001704 (0.00085) [2022-07-11 02:45:44,091][26022] Updated weights on worker 0-0, policy_version 1001714 (0.00093) [2022-07-11 02:45:46,029][26022] Updated weights on worker 0-0, policy_version 1001724 (0.00089) [2022-07-11 02:45:47,190][25689] Fps is (10 sec: 5696.1, 60 sec: 5560.0, 300 sec: 5566.9). Total num frames: 1025771520. Throughput: 0: 5817.2. Samples: 1025770590. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:45:47,191][25689] Avg episode reward: [(0, '0.837')] [2022-07-11 02:45:47,752][26022] Updated weights on worker 0-0, policy_version 1001734 (0.00091) [2022-07-11 02:45:49,596][26022] Updated weights on worker 0-0, policy_version 1001744 (0.00085) [2022-07-11 02:45:51,724][26022] Updated weights on worker 0-0, policy_version 1001754 (0.00085) [2022-07-11 02:45:52,196][25689] Fps is (10 sec: 5615.1, 60 sec: 5577.2, 300 sec: 5565.2). Total num frames: 1025799168. Throughput: 0: 5828.4. Samples: 1025804036. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:45:52,196][25689] Avg episode reward: [(0, '0.821')] [2022-07-11 02:45:53,285][26022] Updated weights on worker 0-0, policy_version 1001764 (0.00090) [2022-07-11 02:45:55,325][26022] Updated weights on worker 0-0, policy_version 1001774 (0.00089) [2022-07-11 02:45:57,047][26022] Updated weights on worker 0-0, policy_version 1001784 (0.00084) [2022-07-11 02:45:57,233][25689] Fps is (10 sec: 5506.1, 60 sec: 5543.3, 300 sec: 5561.5). Total num frames: 1025826816. Throughput: 0: 4987.9. Samples: 1025820590. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:45:57,233][25689] Avg episode reward: [(0, '0.284')] [2022-07-11 02:45:58,791][26022] Updated weights on worker 0-0, policy_version 1001794 (0.00085) [2022-07-11 02:46:00,709][26022] Updated weights on worker 0-0, policy_version 1001804 (0.00091) [2022-07-11 02:46:02,235][25689] Fps is (10 sec: 5610.0, 60 sec: 5612.7, 300 sec: 5568.6). Total num frames: 1025855488. Throughput: 0: 5866.5. Samples: 1025854636. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:46:02,235][25689] Avg episode reward: [(0, '-0.139')] [2022-07-11 02:46:02,675][26022] Updated weights on worker 0-0, policy_version 1001814 (0.00089) [2022-07-11 02:46:04,623][26022] Updated weights on worker 0-0, policy_version 1001824 (0.00082) [2022-07-11 02:46:06,431][26022] Updated weights on worker 0-0, policy_version 1001834 (0.00088) [2022-07-11 02:46:07,271][25689] Fps is (10 sec: 5406.3, 60 sec: 5561.1, 300 sec: 5558.2). Total num frames: 1025881088. Throughput: 0: 5759.7. Samples: 1025886254. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:46:07,276][25689] Avg episode reward: [(0, '-0.150')] [2022-07-11 02:46:08,196][26022] Updated weights on worker 0-0, policy_version 1001844 (0.00094) [2022-07-11 02:46:10,244][26022] Updated weights on worker 0-0, policy_version 1001854 (0.00093) [2022-07-11 02:46:11,632][26022] Updated weights on worker 0-0, policy_version 1001864 (0.00083) [2022-07-11 02:46:12,300][25689] Fps is (10 sec: 5392.3, 60 sec: 5543.3, 300 sec: 5562.2). Total num frames: 1025909760. Throughput: 0: 4929.2. Samples: 1025903132. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:46:12,300][25689] Avg episode reward: [(0, '0.005')] [2022-07-11 02:46:13,731][26022] Updated weights on worker 0-0, policy_version 1001874 (0.00087) [2022-07-11 02:46:15,695][26022] Updated weights on worker 0-0, policy_version 1001884 (0.00096) [2022-07-11 02:46:17,375][25689] Fps is (10 sec: 5675.9, 60 sec: 5561.7, 300 sec: 5562.1). Total num frames: 1025938432. Throughput: 0: 5779.8. Samples: 1025937006. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:46:17,375][25689] Avg episode reward: [(0, '0.738')] [2022-07-11 02:46:17,396][26022] Updated weights on worker 0-0, policy_version 1001894 (0.00086) [2022-07-11 02:46:19,282][26022] Updated weights on worker 0-0, policy_version 1001904 (0.00087) [2022-07-11 02:46:20,839][26022] Updated weights on worker 0-0, policy_version 1001914 (0.00082) [2022-07-11 02:46:22,379][25689] Fps is (10 sec: 5689.1, 60 sec: 5564.9, 300 sec: 5565.7). Total num frames: 1025967104. Throughput: 0: 5781.5. Samples: 1025971100. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:46:22,380][25689] Avg episode reward: [(0, '0.920')] [2022-07-11 02:46:22,751][26022] Updated weights on worker 0-0, policy_version 1001924 (0.00086) [2022-07-11 02:46:24,381][26022] Updated weights on worker 0-0, policy_version 1001934 (0.00090) [2022-07-11 02:46:26,572][26022] Updated weights on worker 0-0, policy_version 1001944 (0.00083) [2022-07-11 02:46:27,413][25689] Fps is (10 sec: 5610.3, 60 sec: 5567.9, 300 sec: 5565.7). Total num frames: 1025994752. Throughput: 0: 5896.4. Samples: 1026005018. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:46:27,416][25689] Avg episode reward: [(0, '1.082')] [2022-07-11 02:46:28,055][26022] Updated weights on worker 0-0, policy_version 1001954 (0.00085) [2022-07-11 02:46:30,298][26022] Updated weights on worker 0-0, policy_version 1001964 (0.00093) [2022-07-11 02:46:31,903][26022] Updated weights on worker 0-0, policy_version 1001974 (0.00095) [2022-07-11 02:46:32,427][25689] Fps is (10 sec: 5604.9, 60 sec: 5574.4, 300 sec: 5563.0). Total num frames: 1026023424. Throughput: 0: 5890.4. Samples: 1026021694. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:46:32,433][25689] Avg episode reward: [(0, '1.384')] [2022-07-11 02:46:33,969][26022] Updated weights on worker 0-0, policy_version 1001984 (0.00085) [2022-07-11 02:46:35,646][26022] Updated weights on worker 0-0, policy_version 1001994 (0.00089) [2022-07-11 02:46:37,475][25689] Fps is (10 sec: 5597.2, 60 sec: 5582.1, 300 sec: 5566.2). Total num frames: 1026051072. Throughput: 0: 5883.2. Samples: 1026055264. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:46:37,476][25689] Avg episode reward: [(0, '1.145')] [2022-07-11 02:46:37,547][26022] Updated weights on worker 0-0, policy_version 1002004 (0.00086) [2022-07-11 02:46:39,391][26022] Updated weights on worker 0-0, policy_version 1002014 (0.00088) [2022-07-11 02:46:41,158][26022] Updated weights on worker 0-0, policy_version 1002024 (0.00093) [2022-07-11 02:46:42,499][25689] Fps is (10 sec: 5490.4, 60 sec: 5567.0, 300 sec: 5562.4). Total num frames: 1026078720. Throughput: 0: 5865.3. Samples: 1026089108. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:46:42,501][25689] Avg episode reward: [(0, '1.264')] [2022-07-11 02:46:42,673][26022] Updated weights on worker 0-0, policy_version 1002034 (0.00083) [2022-07-11 02:46:44,757][26022] Updated weights on worker 0-0, policy_version 1002044 (0.00080) [2022-07-11 02:46:46,463][26022] Updated weights on worker 0-0, policy_version 1002054 (0.00088) [2022-07-11 02:46:47,525][25689] Fps is (10 sec: 5502.3, 60 sec: 5549.8, 300 sec: 5558.7). Total num frames: 1026106368. Throughput: 0: 5022.1. Samples: 1026106022. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:46:47,526][25689] Avg episode reward: [(0, '1.487')] [2022-07-11 02:46:48,306][26022] Updated weights on worker 0-0, policy_version 1002064 (0.00087) [2022-07-11 02:46:50,236][26022] Updated weights on worker 0-0, policy_version 1002074 (0.00085) [2022-07-11 02:46:51,867][26022] Updated weights on worker 0-0, policy_version 1002084 (0.00085) [2022-07-11 02:46:52,543][25689] Fps is (10 sec: 5811.1, 60 sec: 5599.6, 300 sec: 5573.3). Total num frames: 1026137088. Throughput: 0: 5863.0. Samples: 1026139632. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:46:52,544][25689] Avg episode reward: [(0, '1.773')] [2022-07-11 02:46:54,127][26022] Updated weights on worker 0-0, policy_version 1002094 (0.00087) [2022-07-11 02:46:55,608][26022] Updated weights on worker 0-0, policy_version 1002104 (0.00085) [2022-07-11 02:46:57,622][25689] Fps is (10 sec: 5780.9, 60 sec: 5595.7, 300 sec: 5565.2). Total num frames: 1026164736. Throughput: 0: 5845.2. Samples: 1026173024. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:46:57,623][26022] Updated weights on worker 0-0, policy_version 1002114 (0.00091) [2022-07-11 02:46:57,624][25689] Avg episode reward: [(0, '1.130')] [2022-07-11 02:46:59,505][26022] Updated weights on worker 0-0, policy_version 1002124 (0.00335) [2022-07-11 02:47:01,302][26022] Updated weights on worker 0-0, policy_version 1002134 (0.00085) [2022-07-11 02:47:02,711][25689] Fps is (10 sec: 5136.3, 60 sec: 5520.0, 300 sec: 5563.7). Total num frames: 1026189312. Throughput: 0: 4981.9. Samples: 1026189800. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:47:02,711][25689] Avg episode reward: [(0, '1.065')] [2022-07-11 02:47:03,465][26022] Updated weights on worker 0-0, policy_version 1002144 (0.00084) [2022-07-11 02:47:05,310][26022] Updated weights on worker 0-0, policy_version 1002154 (0.00083) [2022-07-11 02:47:07,221][26022] Updated weights on worker 0-0, policy_version 1002164 (0.00086) [2022-07-11 02:47:07,757][25689] Fps is (10 sec: 5354.9, 60 sec: 5586.8, 300 sec: 5570.5). Total num frames: 1026219008. Throughput: 0: 5678.7. Samples: 1026220912. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:47:07,757][25689] Avg episode reward: [(0, '0.879')] [2022-07-11 02:47:09,039][26022] Updated weights on worker 0-0, policy_version 1002174 (0.00088) [2022-07-11 02:47:10,901][26022] Updated weights on worker 0-0, policy_version 1002184 (0.00092) [2022-07-11 02:47:12,385][26022] Updated weights on worker 0-0, policy_version 1002194 (0.00089) [2022-07-11 02:47:12,771][25689] Fps is (10 sec: 5801.8, 60 sec: 5588.1, 300 sec: 5568.1). Total num frames: 1026247680. Throughput: 0: 5694.2. Samples: 1026254814. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:47:12,771][25689] Avg episode reward: [(0, '0.758')] [2022-07-11 02:47:14,685][26022] Updated weights on worker 0-0, policy_version 1002204 (0.00098) [2022-07-11 02:47:16,175][26022] Updated weights on worker 0-0, policy_version 1002214 (0.00089) [2022-07-11 02:47:17,834][25689] Fps is (10 sec: 5487.5, 60 sec: 5555.4, 300 sec: 5560.1). Total num frames: 1026274304. Throughput: 0: 4877.0. Samples: 1026271596. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:47:17,834][25689] Avg episode reward: [(0, '0.897')] [2022-07-11 02:47:18,235][26022] Updated weights on worker 0-0, policy_version 1002224 (0.00080) [2022-07-11 02:47:19,749][26022] Updated weights on worker 0-0, policy_version 1002234 (0.00087) [2022-07-11 02:47:21,942][26022] Updated weights on worker 0-0, policy_version 1002244 (0.00105) [2022-07-11 02:47:22,859][25689] Fps is (10 sec: 5684.5, 60 sec: 5587.3, 300 sec: 5573.6). Total num frames: 1026305024. Throughput: 0: 5748.6. Samples: 1026305624. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:47:22,859][25689] Avg episode reward: [(0, '1.059')] [2022-07-11 02:47:23,599][26022] Updated weights on worker 0-0, policy_version 1002254 (0.00086) [2022-07-11 02:47:25,407][26022] Updated weights on worker 0-0, policy_version 1002264 (0.00089) [2022-07-11 02:47:27,213][26022] Updated weights on worker 0-0, policy_version 1002274 (0.00085) [2022-07-11 02:47:27,845][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:47:27,858][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001002277_1026331648.pth [2022-07-11 02:47:27,859][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001000319_1024326656.pth [2022-07-11 02:47:27,860][25689] Fps is (10 sec: 5719.0, 60 sec: 5573.4, 300 sec: 5567.1). Total num frames: 1026331648. Throughput: 0: 5874.9. Samples: 1026339018. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:47:27,861][25689] Avg episode reward: [(0, '1.119')] [2022-07-11 02:47:29,265][26022] Updated weights on worker 0-0, policy_version 1002284 (0.00082) [2022-07-11 02:47:30,913][26022] Updated weights on worker 0-0, policy_version 1002294 (0.00092) [2022-07-11 02:47:32,790][26022] Updated weights on worker 0-0, policy_version 1002304 (0.00092) [2022-07-11 02:47:32,885][25689] Fps is (10 sec: 5413.1, 60 sec: 5555.5, 300 sec: 5565.9). Total num frames: 1026359296. Throughput: 0: 5017.4. Samples: 1026355732. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:47:32,885][25689] Avg episode reward: [(0, '1.048')] [2022-07-11 02:47:34,643][26022] Updated weights on worker 0-0, policy_version 1002314 (0.00096) [2022-07-11 02:47:36,434][26022] Updated weights on worker 0-0, policy_version 1002324 (0.00087) [2022-07-11 02:47:37,961][25689] Fps is (10 sec: 5575.9, 60 sec: 5569.9, 300 sec: 5565.0). Total num frames: 1026387968. Throughput: 0: 5855.4. Samples: 1026389450. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:47:37,962][25689] Avg episode reward: [(0, '1.068')] [2022-07-11 02:47:38,373][26022] Updated weights on worker 0-0, policy_version 1002334 (0.00092) [2022-07-11 02:47:40,037][26022] Updated weights on worker 0-0, policy_version 1002344 (0.00086) [2022-07-11 02:47:42,043][26022] Updated weights on worker 0-0, policy_version 1002354 (0.00097) [2022-07-11 02:47:42,985][25689] Fps is (10 sec: 5677.5, 60 sec: 5586.7, 300 sec: 5572.8). Total num frames: 1026416640. Throughput: 0: 5825.3. Samples: 1026422866. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:47:42,985][25689] Avg episode reward: [(0, '0.298')] [2022-07-11 02:47:43,759][26022] Updated weights on worker 0-0, policy_version 1002364 (0.00091) [2022-07-11 02:47:45,526][26022] Updated weights on worker 0-0, policy_version 1002374 (0.00094) [2022-07-11 02:47:47,536][26022] Updated weights on worker 0-0, policy_version 1002384 (0.00086) [2022-07-11 02:47:47,994][25689] Fps is (10 sec: 5511.5, 60 sec: 5571.4, 300 sec: 5566.1). Total num frames: 1026443264. Throughput: 0: 5001.5. Samples: 1026439716. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:47:47,995][25689] Avg episode reward: [(0, '-0.050')] [2022-07-11 02:47:49,252][26022] Updated weights on worker 0-0, policy_version 1002394 (0.00088) [2022-07-11 02:47:51,157][26022] Updated weights on worker 0-0, policy_version 1002404 (0.00089) [2022-07-11 02:47:52,996][25689] Fps is (10 sec: 5523.4, 60 sec: 5539.0, 300 sec: 5568.5). Total num frames: 1026471936. Throughput: 0: 5849.7. Samples: 1026473378. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:47:52,997][25689] Avg episode reward: [(0, '0.455')] [2022-07-11 02:47:53,002][26022] Updated weights on worker 0-0, policy_version 1002414 (0.00250) [2022-07-11 02:47:54,762][26022] Updated weights on worker 0-0, policy_version 1002424 (0.00085) [2022-07-11 02:47:56,619][26022] Updated weights on worker 0-0, policy_version 1002434 (0.00082) [2022-07-11 02:47:58,087][25689] Fps is (10 sec: 5579.9, 60 sec: 5537.8, 300 sec: 5567.7). Total num frames: 1026499584. Throughput: 0: 5831.1. Samples: 1026506808. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:47:58,088][25689] Avg episode reward: [(0, '0.438')] [2022-07-11 02:47:58,334][26022] Updated weights on worker 0-0, policy_version 1002444 (0.00090) [2022-07-11 02:48:00,251][26022] Updated weights on worker 0-0, policy_version 1002454 (0.00094) [2022-07-11 02:48:02,571][26022] Updated weights on worker 0-0, policy_version 1002464 (0.00097) [2022-07-11 02:48:03,095][25689] Fps is (10 sec: 5374.1, 60 sec: 5579.2, 300 sec: 5567.7). Total num frames: 1026526208. Throughput: 0: 5019.0. Samples: 1026523798. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:48:03,095][25689] Avg episode reward: [(0, '0.802')] [2022-07-11 02:48:04,236][26022] Updated weights on worker 0-0, policy_version 1002474 (0.00087) [2022-07-11 02:48:06,276][26022] Updated weights on worker 0-0, policy_version 1002484 (0.00111) [2022-07-11 02:48:08,080][26022] Updated weights on worker 0-0, policy_version 1002494 (0.00085) [2022-07-11 02:48:08,098][25689] Fps is (10 sec: 5421.6, 60 sec: 5549.3, 300 sec: 5564.8). Total num frames: 1026553856. Throughput: 0: 5731.7. Samples: 1026554944. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:48:08,098][25689] Avg episode reward: [(0, '1.039')] [2022-07-11 02:48:09,903][26022] Updated weights on worker 0-0, policy_version 1002504 (0.00084) [2022-07-11 02:48:11,937][26022] Updated weights on worker 0-0, policy_version 1002514 (0.00090) [2022-07-11 02:48:13,121][25689] Fps is (10 sec: 5412.9, 60 sec: 5514.5, 300 sec: 5561.6). Total num frames: 1026580480. Throughput: 0: 5703.1. Samples: 1026588154. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:48:13,122][25689] Avg episode reward: [(0, '1.064')] [2022-07-11 02:48:13,619][26022] Updated weights on worker 0-0, policy_version 1002524 (0.00084) [2022-07-11 02:48:15,426][26022] Updated weights on worker 0-0, policy_version 1002534 (0.00090) [2022-07-11 02:48:17,272][26022] Updated weights on worker 0-0, policy_version 1002544 (0.00089) [2022-07-11 02:48:18,168][25689] Fps is (10 sec: 5490.7, 60 sec: 5549.8, 300 sec: 5561.3). Total num frames: 1026609152. Throughput: 0: 4892.0. Samples: 1026605046. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:48:18,169][25689] Avg episode reward: [(0, '1.147')] [2022-07-11 02:48:19,134][26022] Updated weights on worker 0-0, policy_version 1002554 (0.00086) [2022-07-11 02:48:20,878][26022] Updated weights on worker 0-0, policy_version 1002564 (0.00086) [2022-07-11 02:48:22,646][26022] Updated weights on worker 0-0, policy_version 1002574 (0.00092) [2022-07-11 02:48:23,177][25689] Fps is (10 sec: 5702.4, 60 sec: 5517.3, 300 sec: 5561.6). Total num frames: 1026637824. Throughput: 0: 5728.9. Samples: 1026638848. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2022-07-11 02:48:23,178][25689] Avg episode reward: [(0, '1.290')] [2022-07-11 02:48:24,591][26022] Updated weights on worker 0-0, policy_version 1002584 (0.00094) [2022-07-11 02:48:26,384][26022] Updated weights on worker 0-0, policy_version 1002594 (0.00080) [2022-07-11 02:48:28,055][26022] Updated weights on worker 0-0, policy_version 1002604 (0.00080) [2022-07-11 02:48:28,214][25689] Fps is (10 sec: 5708.6, 60 sec: 5548.1, 300 sec: 5568.1). Total num frames: 1026666496. Throughput: 0: 5831.7. Samples: 1026672254. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:48:28,214][25689] Avg episode reward: [(0, '1.009')] [2022-07-11 02:48:30,137][26022] Updated weights on worker 0-0, policy_version 1002614 (0.00085) [2022-07-11 02:48:31,679][26022] Updated weights on worker 0-0, policy_version 1002624 (0.00085) [2022-07-11 02:48:33,218][25689] Fps is (10 sec: 5609.1, 60 sec: 5549.9, 300 sec: 5566.9). Total num frames: 1026694144. Throughput: 0: 5032.4. Samples: 1026689290. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:48:33,219][25689] Avg episode reward: [(0, '1.027')] [2022-07-11 02:48:33,601][26022] Updated weights on worker 0-0, policy_version 1002634 (0.00081) [2022-07-11 02:48:35,383][26022] Updated weights on worker 0-0, policy_version 1002644 (0.00085) [2022-07-11 02:48:37,277][26022] Updated weights on worker 0-0, policy_version 1002654 (0.00086) [2022-07-11 02:48:38,273][25689] Fps is (10 sec: 5598.6, 60 sec: 5551.9, 300 sec: 5566.2). Total num frames: 1026722816. Throughput: 0: 5874.1. Samples: 1026723144. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:48:38,274][25689] Avg episode reward: [(0, '0.916')] [2022-07-11 02:48:39,141][26022] Updated weights on worker 0-0, policy_version 1002664 (0.00077) [2022-07-11 02:48:41,122][26022] Updated weights on worker 0-0, policy_version 1002674 (0.00091) [2022-07-11 02:48:42,743][26022] Updated weights on worker 0-0, policy_version 1002684 (0.00085) [2022-07-11 02:48:43,319][25689] Fps is (10 sec: 5677.6, 60 sec: 5549.9, 300 sec: 5566.6). Total num frames: 1026751488. Throughput: 0: 5864.0. Samples: 1026756954. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:48:43,319][25689] Avg episode reward: [(0, '1.671')] [2022-07-11 02:48:44,680][26022] Updated weights on worker 0-0, policy_version 1002694 (0.00085) [2022-07-11 02:48:46,307][26022] Updated weights on worker 0-0, policy_version 1002704 (0.00090) [2022-07-11 02:48:48,227][26022] Updated weights on worker 0-0, policy_version 1002714 (0.00440) [2022-07-11 02:48:48,338][25689] Fps is (10 sec: 5595.6, 60 sec: 5565.9, 300 sec: 5562.9). Total num frames: 1026779136. Throughput: 0: 5048.1. Samples: 1026773846. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:48:48,339][25689] Avg episode reward: [(0, '1.163')] [2022-07-11 02:48:50,077][26022] Updated weights on worker 0-0, policy_version 1002724 (0.00098) [2022-07-11 02:48:51,786][26022] Updated weights on worker 0-0, policy_version 1002734 (0.00087) [2022-07-11 02:48:53,396][25689] Fps is (10 sec: 5385.3, 60 sec: 5526.9, 300 sec: 5556.6). Total num frames: 1026805760. Throughput: 0: 5840.7. Samples: 1026807144. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:48:53,397][25689] Avg episode reward: [(0, '1.073')] [2022-07-11 02:48:53,783][26022] Updated weights on worker 0-0, policy_version 1002744 (0.00082) [2022-07-11 02:48:55,618][26022] Updated weights on worker 0-0, policy_version 1002754 (0.00095) [2022-07-11 02:48:57,696][26022] Updated weights on worker 0-0, policy_version 1002764 (0.00100) [2022-07-11 02:48:58,500][25689] Fps is (10 sec: 5542.2, 60 sec: 5559.6, 300 sec: 5565.5). Total num frames: 1026835456. Throughput: 0: 5806.6. Samples: 1026840594. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:48:58,501][25689] Avg episode reward: [(0, '1.025')] [2022-07-11 02:48:59,179][26022] Updated weights on worker 0-0, policy_version 1002774 (0.00089) [2022-07-11 02:49:01,178][26022] Updated weights on worker 0-0, policy_version 1002784 (0.00086) [2022-07-11 02:49:03,346][26022] Updated weights on worker 0-0, policy_version 1002794 (0.00097) [2022-07-11 02:49:03,519][25689] Fps is (10 sec: 5462.7, 60 sec: 5541.7, 300 sec: 5562.1). Total num frames: 1026861056. Throughput: 0: 4973.6. Samples: 1026857422. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:49:03,521][25689] Avg episode reward: [(0, '0.135')] [2022-07-11 02:49:05,247][26022] Updated weights on worker 0-0, policy_version 1002804 (0.00086) [2022-07-11 02:49:06,986][26022] Updated weights on worker 0-0, policy_version 1002814 (0.00085) [2022-07-11 02:49:08,584][25689] Fps is (10 sec: 5382.2, 60 sec: 5552.9, 300 sec: 5562.6). Total num frames: 1026889728. Throughput: 0: 5701.4. Samples: 1026889274. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:49:08,586][25689] Avg episode reward: [(0, '-0.302')] [2022-07-11 02:49:08,911][26022] Updated weights on worker 0-0, policy_version 1002824 (0.00085) [2022-07-11 02:49:10,426][26022] Updated weights on worker 0-0, policy_version 1002834 (0.00083) [2022-07-11 02:49:12,463][26022] Updated weights on worker 0-0, policy_version 1002844 (0.00080) [2022-07-11 02:49:13,681][25689] Fps is (10 sec: 5743.5, 60 sec: 5596.8, 300 sec: 5566.3). Total num frames: 1026919424. Throughput: 0: 5718.7. Samples: 1026923148. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:49:13,683][25689] Avg episode reward: [(0, '-0.415')] [2022-07-11 02:49:14,211][26022] Updated weights on worker 0-0, policy_version 1002854 (0.00081) [2022-07-11 02:49:15,898][26022] Updated weights on worker 0-0, policy_version 1002864 (0.00092) [2022-07-11 02:49:17,823][26022] Updated weights on worker 0-0, policy_version 1002874 (0.00094) [2022-07-11 02:49:18,781][25689] Fps is (10 sec: 5523.2, 60 sec: 5558.2, 300 sec: 5555.0). Total num frames: 1026946048. Throughput: 0: 5725.7. Samples: 1026956714. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:49:18,781][25689] Avg episode reward: [(0, '0.103')] [2022-07-11 02:49:19,715][26022] Updated weights on worker 0-0, policy_version 1002884 (0.00086) [2022-07-11 02:49:21,449][26022] Updated weights on worker 0-0, policy_version 1002894 (0.00088) [2022-07-11 02:49:23,425][26022] Updated weights on worker 0-0, policy_version 1002904 (0.00084) [2022-07-11 02:49:23,809][25689] Fps is (10 sec: 5560.9, 60 sec: 5573.3, 300 sec: 5566.1). Total num frames: 1026975744. Throughput: 0: 5725.9. Samples: 1026973602. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:49:23,811][25689] Avg episode reward: [(0, '-0.004')] [2022-07-11 02:49:25,164][26022] Updated weights on worker 0-0, policy_version 1002914 (0.00075) [2022-07-11 02:49:27,008][26022] Updated weights on worker 0-0, policy_version 1002924 (0.00090) [2022-07-11 02:49:27,938][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:49:27,950][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001002929_1026999296.pth [2022-07-11 02:49:27,950][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001000971_1024994304.pth [2022-07-11 02:49:28,858][25689] Fps is (10 sec: 5690.4, 60 sec: 5555.3, 300 sec: 5561.9). Total num frames: 1027003392. Throughput: 0: 5823.2. Samples: 1027007336. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:49:28,859][25689] Avg episode reward: [(0, '0.294')] [2022-07-11 02:49:28,939][26022] Updated weights on worker 0-0, policy_version 1002934 (0.00095) [2022-07-11 02:49:30,880][26022] Updated weights on worker 0-0, policy_version 1002944 (0.00088) [2022-07-11 02:49:32,399][26022] Updated weights on worker 0-0, policy_version 1002954 (0.00104) [2022-07-11 02:49:33,900][25689] Fps is (10 sec: 5581.5, 60 sec: 5568.8, 300 sec: 5562.1). Total num frames: 1027032064. Throughput: 0: 5838.1. Samples: 1027041186. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:49:33,902][25689] Avg episode reward: [(0, '1.114')] [2022-07-11 02:49:34,426][26022] Updated weights on worker 0-0, policy_version 1002964 (0.00087) [2022-07-11 02:49:36,076][26022] Updated weights on worker 0-0, policy_version 1002974 (0.00085) [2022-07-11 02:49:37,995][26022] Updated weights on worker 0-0, policy_version 1002984 (0.00080) [2022-07-11 02:49:38,990][25689] Fps is (10 sec: 5760.8, 60 sec: 5582.4, 300 sec: 5571.0). Total num frames: 1027061760. Throughput: 0: 5021.4. Samples: 1027058194. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:49:38,991][25689] Avg episode reward: [(0, '1.349')] [2022-07-11 02:49:39,736][26022] Updated weights on worker 0-0, policy_version 1002994 (0.00088) [2022-07-11 02:49:41,614][26022] Updated weights on worker 0-0, policy_version 1003004 (0.00091) [2022-07-11 02:49:43,400][26022] Updated weights on worker 0-0, policy_version 1003014 (0.00094) [2022-07-11 02:49:44,006][25689] Fps is (10 sec: 5673.8, 60 sec: 5568.2, 300 sec: 5564.0). Total num frames: 1027089408. Throughput: 0: 5848.9. Samples: 1027091734. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:49:44,007][25689] Avg episode reward: [(0, '1.242')] [2022-07-11 02:49:45,388][26022] Updated weights on worker 0-0, policy_version 1003024 (0.00082) [2022-07-11 02:49:47,037][26022] Updated weights on worker 0-0, policy_version 1003034 (0.00081) [2022-07-11 02:49:48,944][26022] Updated weights on worker 0-0, policy_version 1003044 (0.00090) [2022-07-11 02:49:49,037][25689] Fps is (10 sec: 5503.8, 60 sec: 5567.2, 300 sec: 5567.0). Total num frames: 1027117056. Throughput: 0: 5844.3. Samples: 1027125268. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:49:49,037][25689] Avg episode reward: [(0, '1.365')] [2022-07-11 02:49:50,624][26022] Updated weights on worker 0-0, policy_version 1003054 (0.00085) [2022-07-11 02:49:52,574][26022] Updated weights on worker 0-0, policy_version 1003064 (0.00089) [2022-07-11 02:49:54,045][25689] Fps is (10 sec: 5712.5, 60 sec: 5622.4, 300 sec: 5567.6). Total num frames: 1027146752. Throughput: 0: 5015.9. Samples: 1027142232. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:49:54,045][25689] Avg episode reward: [(0, '1.297')] [2022-07-11 02:49:54,373][26022] Updated weights on worker 0-0, policy_version 1003074 (0.00082) [2022-07-11 02:49:56,275][26022] Updated weights on worker 0-0, policy_version 1003084 (0.00092) [2022-07-11 02:49:57,955][26022] Updated weights on worker 0-0, policy_version 1003094 (0.00095) [2022-07-11 02:49:59,083][25689] Fps is (10 sec: 5504.4, 60 sec: 5561.0, 300 sec: 5570.7). Total num frames: 1027172352. Throughput: 0: 5860.0. Samples: 1027175936. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:49:59,083][25689] Avg episode reward: [(0, '1.532')] [2022-07-11 02:49:59,874][26022] Updated weights on worker 0-0, policy_version 1003104 (0.00087) [2022-07-11 02:50:01,579][26022] Updated weights on worker 0-0, policy_version 1003114 (0.00095) [2022-07-11 02:50:03,920][26022] Updated weights on worker 0-0, policy_version 1003124 (0.00090) [2022-07-11 02:50:04,103][25689] Fps is (10 sec: 5192.2, 60 sec: 5577.7, 300 sec: 5563.9). Total num frames: 1027198976. Throughput: 0: 5756.2. Samples: 1027207414. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:50:04,103][25689] Avg episode reward: [(0, '1.685')] [2022-07-11 02:50:05,854][26022] Updated weights on worker 0-0, policy_version 1003134 (0.00084) [2022-07-11 02:50:07,632][26022] Updated weights on worker 0-0, policy_version 1003144 (0.00088) [2022-07-11 02:50:09,105][25689] Fps is (10 sec: 5517.4, 60 sec: 5583.5, 300 sec: 5560.8). Total num frames: 1027227648. Throughput: 0: 4934.1. Samples: 1027224284. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:50:09,105][25689] Avg episode reward: [(0, '1.895')] [2022-07-11 02:50:09,362][26022] Updated weights on worker 0-0, policy_version 1003154 (0.00094) [2022-07-11 02:50:11,375][26022] Updated weights on worker 0-0, policy_version 1003164 (0.00087) [2022-07-11 02:50:13,159][26022] Updated weights on worker 0-0, policy_version 1003174 (0.00094) [2022-07-11 02:50:14,115][25689] Fps is (10 sec: 5625.5, 60 sec: 5557.7, 300 sec: 5562.3). Total num frames: 1027255296. Throughput: 0: 5761.9. Samples: 1027257872. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:50:14,115][25689] Avg episode reward: [(0, '1.836')] [2022-07-11 02:50:15,023][26022] Updated weights on worker 0-0, policy_version 1003184 (0.00086) [2022-07-11 02:50:16,763][26022] Updated weights on worker 0-0, policy_version 1003194 (0.00144) [2022-07-11 02:50:18,507][26022] Updated weights on worker 0-0, policy_version 1003204 (0.00614) [2022-07-11 02:50:19,181][25689] Fps is (10 sec: 5589.1, 60 sec: 5594.6, 300 sec: 5561.8). Total num frames: 1027283968. Throughput: 0: 5743.9. Samples: 1027291382. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:50:19,182][25689] Avg episode reward: [(0, '1.823')] [2022-07-11 02:50:20,470][26022] Updated weights on worker 0-0, policy_version 1003214 (0.00089) [2022-07-11 02:50:22,222][26022] Updated weights on worker 0-0, policy_version 1003224 (0.00086) [2022-07-11 02:50:23,871][26022] Updated weights on worker 0-0, policy_version 1003234 (0.00089) [2022-07-11 02:50:24,218][25689] Fps is (10 sec: 5675.7, 60 sec: 5576.9, 300 sec: 5565.8). Total num frames: 1027312640. Throughput: 0: 5016.3. Samples: 1027308318. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:50:24,219][25689] Avg episode reward: [(0, '1.641')] [2022-07-11 02:50:26,048][26022] Updated weights on worker 0-0, policy_version 1003244 (0.00096) [2022-07-11 02:50:27,554][26022] Updated weights on worker 0-0, policy_version 1003254 (0.00083) [2022-07-11 02:50:29,221][25689] Fps is (10 sec: 5507.7, 60 sec: 5564.2, 300 sec: 5560.5). Total num frames: 1027339264. Throughput: 0: 5866.6. Samples: 1027342300. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:50:29,222][25689] Avg episode reward: [(0, '1.723')] [2022-07-11 02:50:29,501][26022] Updated weights on worker 0-0, policy_version 1003264 (0.00087) [2022-07-11 02:50:31,232][26022] Updated weights on worker 0-0, policy_version 1003274 (0.00088) [2022-07-11 02:50:33,035][26022] Updated weights on worker 0-0, policy_version 1003284 (0.00092) [2022-07-11 02:50:34,244][25689] Fps is (10 sec: 5515.3, 60 sec: 5565.9, 300 sec: 5565.9). Total num frames: 1027367936. Throughput: 0: 5874.3. Samples: 1027376118. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:50:34,247][25689] Avg episode reward: [(0, '1.291')] [2022-07-11 02:50:35,006][26022] Updated weights on worker 0-0, policy_version 1003294 (0.00091) [2022-07-11 02:50:36,670][26022] Updated weights on worker 0-0, policy_version 1003304 (0.00096) [2022-07-11 02:50:38,604][26022] Updated weights on worker 0-0, policy_version 1003314 (0.00096) [2022-07-11 02:50:39,370][25689] Fps is (10 sec: 5852.0, 60 sec: 5579.6, 300 sec: 5571.3). Total num frames: 1027398656. Throughput: 0: 5034.5. Samples: 1027393022. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:50:39,371][25689] Avg episode reward: [(0, '1.493')] [2022-07-11 02:50:40,638][26022] Updated weights on worker 0-0, policy_version 1003324 (0.00093) [2022-07-11 02:50:42,027][26022] Updated weights on worker 0-0, policy_version 1003334 (0.00087) [2022-07-11 02:50:44,151][26022] Updated weights on worker 0-0, policy_version 1003344 (0.00052) [2022-07-11 02:50:44,398][25689] Fps is (10 sec: 5647.2, 60 sec: 5561.5, 300 sec: 5564.3). Total num frames: 1027425280. Throughput: 0: 5869.2. Samples: 1027426760. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:50:44,399][25689] Avg episode reward: [(0, '1.769')] [2022-07-11 02:50:45,713][26022] Updated weights on worker 0-0, policy_version 1003354 (0.00089) [2022-07-11 02:50:47,761][26022] Updated weights on worker 0-0, policy_version 1003364 (0.00084) [2022-07-11 02:50:49,427][25689] Fps is (10 sec: 5498.6, 60 sec: 5578.7, 300 sec: 5567.3). Total num frames: 1027453952. Throughput: 0: 5861.8. Samples: 1027460740. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:50:49,432][25689] Avg episode reward: [(0, '1.583')] [2022-07-11 02:50:49,484][26022] Updated weights on worker 0-0, policy_version 1003374 (0.00082) [2022-07-11 02:50:51,346][26022] Updated weights on worker 0-0, policy_version 1003384 (0.00086) [2022-07-11 02:50:53,110][26022] Updated weights on worker 0-0, policy_version 1003394 (0.00094) [2022-07-11 02:50:54,463][25689] Fps is (10 sec: 5697.7, 60 sec: 5559.1, 300 sec: 5570.8). Total num frames: 1027482624. Throughput: 0: 5017.3. Samples: 1027477562. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:50:54,464][25689] Avg episode reward: [(0, '1.450')] [2022-07-11 02:50:54,945][26022] Updated weights on worker 0-0, policy_version 1003404 (0.00086) [2022-07-11 02:50:56,743][26022] Updated weights on worker 0-0, policy_version 1003414 (0.00085) [2022-07-11 02:50:58,682][26022] Updated weights on worker 0-0, policy_version 1003424 (0.00080) [2022-07-11 02:50:59,525][25689] Fps is (10 sec: 5678.6, 60 sec: 5607.7, 300 sec: 5569.7). Total num frames: 1027511296. Throughput: 0: 5872.1. Samples: 1027511372. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:50:59,526][25689] Avg episode reward: [(0, '0.859')] [2022-07-11 02:51:00,522][26022] Updated weights on worker 0-0, policy_version 1003434 (0.00088) [2022-07-11 02:51:02,537][26022] Updated weights on worker 0-0, policy_version 1003444 (0.00085) [2022-07-11 02:51:04,532][25689] Fps is (10 sec: 5288.0, 60 sec: 5575.0, 300 sec: 5566.8). Total num frames: 1027535872. Throughput: 0: 5763.9. Samples: 1027542810. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:51:04,533][25689] Avg episode reward: [(0, '0.969')] [2022-07-11 02:51:04,670][26022] Updated weights on worker 0-0, policy_version 1003454 (0.00095) [2022-07-11 02:51:06,211][26022] Updated weights on worker 0-0, policy_version 1003464 (0.00089) [2022-07-11 02:51:08,087][26022] Updated weights on worker 0-0, policy_version 1003474 (0.00083) [2022-07-11 02:51:09,541][25689] Fps is (10 sec: 5316.1, 60 sec: 5574.4, 300 sec: 5567.1). Total num frames: 1027564544. Throughput: 0: 4926.7. Samples: 1027559838. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:51:09,542][25689] Avg episode reward: [(0, '0.871')] [2022-07-11 02:51:09,935][26022] Updated weights on worker 0-0, policy_version 1003484 (0.00081) [2022-07-11 02:51:11,630][26022] Updated weights on worker 0-0, policy_version 1003494 (0.00090) [2022-07-11 02:51:13,731][26022] Updated weights on worker 0-0, policy_version 1003504 (0.00086) [2022-07-11 02:51:14,554][25689] Fps is (10 sec: 5824.0, 60 sec: 5608.0, 300 sec: 5571.7). Total num frames: 1027594240. Throughput: 0: 5774.7. Samples: 1027593584. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:51:14,555][25689] Avg episode reward: [(0, '1.136')] [2022-07-11 02:51:15,305][26022] Updated weights on worker 0-0, policy_version 1003514 (0.00091) [2022-07-11 02:51:17,176][26022] Updated weights on worker 0-0, policy_version 1003524 (0.00088) [2022-07-11 02:51:19,141][26022] Updated weights on worker 0-0, policy_version 1003534 (0.00085) [2022-07-11 02:51:19,661][25689] Fps is (10 sec: 5565.2, 60 sec: 5570.4, 300 sec: 5562.9). Total num frames: 1027620864. Throughput: 0: 5764.1. Samples: 1027627440. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:51:19,662][25689] Avg episode reward: [(0, '1.092')] [2022-07-11 02:51:20,719][26022] Updated weights on worker 0-0, policy_version 1003544 (0.00083) [2022-07-11 02:51:22,676][26022] Updated weights on worker 0-0, policy_version 1003554 (0.00091) [2022-07-11 02:51:24,670][25689] Fps is (10 sec: 5364.9, 60 sec: 5556.0, 300 sec: 5563.4). Total num frames: 1027648512. Throughput: 0: 5043.2. Samples: 1027644370. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:51:24,671][25689] Avg episode reward: [(0, '0.316')] [2022-07-11 02:51:24,769][26022] Updated weights on worker 0-0, policy_version 1003564 (0.00085) [2022-07-11 02:51:26,215][26022] Updated weights on worker 0-0, policy_version 1003574 (0.00086) [2022-07-11 02:51:28,043][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:51:28,054][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001003582_1027667968.pth [2022-07-11 02:51:28,055][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001001625_1025664000.pth [2022-07-11 02:51:28,315][26022] Updated weights on worker 0-0, policy_version 1003584 (0.00084) [2022-07-11 02:51:29,676][25689] Fps is (10 sec: 5828.3, 60 sec: 5623.5, 300 sec: 5570.5). Total num frames: 1027679232. Throughput: 0: 5868.2. Samples: 1027677992. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:51:29,676][25689] Avg episode reward: [(0, '0.372')] [2022-07-11 02:51:29,720][26022] Updated weights on worker 0-0, policy_version 1003594 (0.00087) [2022-07-11 02:51:31,927][26022] Updated weights on worker 0-0, policy_version 1003604 (0.00083) [2022-07-11 02:51:33,606][26022] Updated weights on worker 0-0, policy_version 1003614 (0.00084) [2022-07-11 02:51:34,687][25689] Fps is (10 sec: 5725.0, 60 sec: 5590.7, 300 sec: 5567.7). Total num frames: 1027705856. Throughput: 0: 5849.8. Samples: 1027711354. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:51:34,687][25689] Avg episode reward: [(0, '0.750')] [2022-07-11 02:51:35,515][26022] Updated weights on worker 0-0, policy_version 1003624 (0.00092) [2022-07-11 02:51:37,493][26022] Updated weights on worker 0-0, policy_version 1003634 (0.00089) [2022-07-11 02:51:39,202][26022] Updated weights on worker 0-0, policy_version 1003644 (0.00086) [2022-07-11 02:51:39,807][25689] Fps is (10 sec: 5357.0, 60 sec: 5540.5, 300 sec: 5565.9). Total num frames: 1027733504. Throughput: 0: 4988.1. Samples: 1027727928. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:51:39,807][25689] Avg episode reward: [(0, '0.243')] [2022-07-11 02:51:41,082][26022] Updated weights on worker 0-0, policy_version 1003654 (0.00086) [2022-07-11 02:51:43,058][26022] Updated weights on worker 0-0, policy_version 1003664 (0.00079) [2022-07-11 02:51:44,665][26022] Updated weights on worker 0-0, policy_version 1003674 (0.00090) [2022-07-11 02:51:44,838][25689] Fps is (10 sec: 5548.3, 60 sec: 5574.1, 300 sec: 5569.3). Total num frames: 1027762176. Throughput: 0: 5798.3. Samples: 1027761306. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:51:44,838][25689] Avg episode reward: [(0, '0.157')] [2022-07-11 02:51:46,752][26022] Updated weights on worker 0-0, policy_version 1003684 (0.00085) [2022-07-11 02:51:48,347][26022] Updated weights on worker 0-0, policy_version 1003694 (0.00083) [2022-07-11 02:51:49,868][25689] Fps is (10 sec: 5597.5, 60 sec: 5556.9, 300 sec: 5558.7). Total num frames: 1027789824. Throughput: 0: 5794.6. Samples: 1027795000. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:51:49,870][25689] Avg episode reward: [(0, '1.145')] [2022-07-11 02:51:50,422][26022] Updated weights on worker 0-0, policy_version 1003704 (0.00084) [2022-07-11 02:51:52,005][26022] Updated weights on worker 0-0, policy_version 1003714 (0.00086) [2022-07-11 02:51:53,988][26022] Updated weights on worker 0-0, policy_version 1003724 (0.00093) [2022-07-11 02:51:54,904][25689] Fps is (10 sec: 5696.6, 60 sec: 5573.9, 300 sec: 5566.4). Total num frames: 1027819520. Throughput: 0: 4976.9. Samples: 1027811974. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:51:54,909][25689] Avg episode reward: [(0, '1.170')] [2022-07-11 02:51:55,736][26022] Updated weights on worker 0-0, policy_version 1003734 (0.00098) [2022-07-11 02:51:57,474][26022] Updated weights on worker 0-0, policy_version 1003744 (0.00089) [2022-07-11 02:51:59,331][26022] Updated weights on worker 0-0, policy_version 1003754 (0.00089) [2022-07-11 02:52:00,009][25689] Fps is (10 sec: 5553.8, 60 sec: 5536.1, 300 sec: 5573.0). Total num frames: 1027846144. Throughput: 0: 5824.3. Samples: 1027845594. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:52:00,010][25689] Avg episode reward: [(0, '1.281')] [2022-07-11 02:52:01,180][26022] Updated weights on worker 0-0, policy_version 1003764 (0.00808) [2022-07-11 02:52:03,499][26022] Updated weights on worker 0-0, policy_version 1003774 (0.00086) [2022-07-11 02:52:05,031][25689] Fps is (10 sec: 5359.3, 60 sec: 5585.6, 300 sec: 5566.5). Total num frames: 1027873792. Throughput: 0: 5728.6. Samples: 1027876984. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:52:05,031][25689] Avg episode reward: [(0, '0.597')] [2022-07-11 02:52:05,103][26022] Updated weights on worker 0-0, policy_version 1003784 (0.00082) [2022-07-11 02:52:07,199][26022] Updated weights on worker 0-0, policy_version 1003794 (0.00084) [2022-07-11 02:52:08,971][26022] Updated weights on worker 0-0, policy_version 1003804 (0.00890) [2022-07-11 02:52:10,040][25689] Fps is (10 sec: 5512.4, 60 sec: 5568.6, 300 sec: 5563.2). Total num frames: 1027901440. Throughput: 0: 5735.5. Samples: 1027910698. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:52:10,041][25689] Avg episode reward: [(0, '0.506')] [2022-07-11 02:52:10,767][26022] Updated weights on worker 0-0, policy_version 1003814 (0.00103) [2022-07-11 02:52:12,590][26022] Updated weights on worker 0-0, policy_version 1003824 (0.00093) [2022-07-11 02:52:14,376][26022] Updated weights on worker 0-0, policy_version 1003834 (0.00090) [2022-07-11 02:52:15,082][25689] Fps is (10 sec: 5501.3, 60 sec: 5532.1, 300 sec: 5567.0). Total num frames: 1027929088. Throughput: 0: 5712.2. Samples: 1027927234. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:52:15,083][25689] Avg episode reward: [(0, '0.361')] [2022-07-11 02:52:16,422][26022] Updated weights on worker 0-0, policy_version 1003844 (0.00087) [2022-07-11 02:52:18,155][26022] Updated weights on worker 0-0, policy_version 1003854 (0.00093) [2022-07-11 02:52:19,899][26022] Updated weights on worker 0-0, policy_version 1003864 (0.00091) [2022-07-11 02:52:20,182][25689] Fps is (10 sec: 5654.3, 60 sec: 5583.5, 300 sec: 5562.2). Total num frames: 1027958784. Throughput: 0: 5715.5. Samples: 1027960892. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 02:52:20,189][25689] Avg episode reward: [(0, '-0.097')] [2022-07-11 02:52:21,671][26022] Updated weights on worker 0-0, policy_version 1003874 (0.00082) [2022-07-11 02:52:23,509][26022] Updated weights on worker 0-0, policy_version 1003884 (0.00088) [2022-07-11 02:52:25,235][25689] Fps is (10 sec: 5647.8, 60 sec: 5579.5, 300 sec: 5564.6). Total num frames: 1027986432. Throughput: 0: 5840.0. Samples: 1027994980. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:52:25,236][25689] Avg episode reward: [(0, '0.017')] [2022-07-11 02:52:25,435][26022] Updated weights on worker 0-0, policy_version 1003894 (0.00091) [2022-07-11 02:52:27,070][26022] Updated weights on worker 0-0, policy_version 1003904 (0.00102) [2022-07-11 02:52:28,993][26022] Updated weights on worker 0-0, policy_version 1003914 (0.00082) [2022-07-11 02:52:30,315][25689] Fps is (10 sec: 5558.2, 60 sec: 5538.9, 300 sec: 5567.0). Total num frames: 1028015104. Throughput: 0: 4987.5. Samples: 1028011816. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:52:30,315][25689] Avg episode reward: [(0, '0.726')] [2022-07-11 02:52:30,987][26022] Updated weights on worker 0-0, policy_version 1003924 (0.00089) [2022-07-11 02:52:32,543][26022] Updated weights on worker 0-0, policy_version 1003934 (0.00091) [2022-07-11 02:52:34,648][26022] Updated weights on worker 0-0, policy_version 1003944 (0.00085) [2022-07-11 02:52:35,347][25689] Fps is (10 sec: 5671.4, 60 sec: 5570.7, 300 sec: 5567.9). Total num frames: 1028043776. Throughput: 0: 5824.8. Samples: 1028045270. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:52:35,347][25689] Avg episode reward: [(0, '0.566')] [2022-07-11 02:52:36,295][26022] Updated weights on worker 0-0, policy_version 1003954 (0.00080) [2022-07-11 02:52:38,032][26022] Updated weights on worker 0-0, policy_version 1003964 (0.00080) [2022-07-11 02:52:40,118][26022] Updated weights on worker 0-0, policy_version 1003974 (0.00082) [2022-07-11 02:52:40,413][25689] Fps is (10 sec: 5475.9, 60 sec: 5558.8, 300 sec: 5560.2). Total num frames: 1028070400. Throughput: 0: 5832.7. Samples: 1028078892. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:52:40,413][25689] Avg episode reward: [(0, '0.933')] [2022-07-11 02:52:41,809][26022] Updated weights on worker 0-0, policy_version 1003984 (0.00087) [2022-07-11 02:52:43,822][26022] Updated weights on worker 0-0, policy_version 1003994 (0.00085) [2022-07-11 02:52:45,451][25689] Fps is (10 sec: 5472.5, 60 sec: 5558.1, 300 sec: 5566.5). Total num frames: 1028099072. Throughput: 0: 4986.9. Samples: 1028095792. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:52:45,451][25689] Avg episode reward: [(0, '1.156')] [2022-07-11 02:52:45,482][26022] Updated weights on worker 0-0, policy_version 1004004 (0.00085) [2022-07-11 02:52:47,361][26022] Updated weights on worker 0-0, policy_version 1004014 (0.00084) [2022-07-11 02:52:49,074][26022] Updated weights on worker 0-0, policy_version 1004024 (0.00084) [2022-07-11 02:52:50,471][25689] Fps is (10 sec: 5701.1, 60 sec: 5576.0, 300 sec: 5566.2). Total num frames: 1028127744. Throughput: 0: 5846.2. Samples: 1028129656. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:52:50,472][25689] Avg episode reward: [(0, '1.112')] [2022-07-11 02:52:51,102][26022] Updated weights on worker 0-0, policy_version 1004034 (0.00086) [2022-07-11 02:52:52,713][26022] Updated weights on worker 0-0, policy_version 1004044 (0.00087) [2022-07-11 02:52:54,835][26022] Updated weights on worker 0-0, policy_version 1004054 (0.00086) [2022-07-11 02:52:55,511][25689] Fps is (10 sec: 5699.9, 60 sec: 5558.7, 300 sec: 5570.6). Total num frames: 1028156416. Throughput: 0: 5849.9. Samples: 1028163232. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:52:55,513][25689] Avg episode reward: [(0, '1.176')] [2022-07-11 02:52:56,417][26022] Updated weights on worker 0-0, policy_version 1004064 (0.00092) [2022-07-11 02:52:58,295][26022] Updated weights on worker 0-0, policy_version 1004074 (0.00083) [2022-07-11 02:53:00,095][26022] Updated weights on worker 0-0, policy_version 1004084 (0.00083) [2022-07-11 02:53:00,577][25689] Fps is (10 sec: 5674.3, 60 sec: 5596.1, 300 sec: 5576.4). Total num frames: 1028185088. Throughput: 0: 5028.1. Samples: 1028180280. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:53:00,579][25689] Avg episode reward: [(0, '-0.002')] [2022-07-11 02:53:02,377][26022] Updated weights on worker 0-0, policy_version 1004094 (0.00085) [2022-07-11 02:53:04,090][26022] Updated weights on worker 0-0, policy_version 1004104 (0.00094) [2022-07-11 02:53:05,595][25689] Fps is (10 sec: 5280.6, 60 sec: 5545.7, 300 sec: 5565.8). Total num frames: 1028209664. Throughput: 0: 5750.1. Samples: 1028211624. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:53:05,597][25689] Avg episode reward: [(0, '0.171')] [2022-07-11 02:53:05,970][26022] Updated weights on worker 0-0, policy_version 1004114 (0.00089) [2022-07-11 02:53:07,620][26022] Updated weights on worker 0-0, policy_version 1004124 (0.00092) [2022-07-11 02:53:09,538][26022] Updated weights on worker 0-0, policy_version 1004134 (0.00084) [2022-07-11 02:53:10,607][25689] Fps is (10 sec: 5308.8, 60 sec: 5562.4, 300 sec: 5572.9). Total num frames: 1028238336. Throughput: 0: 5732.8. Samples: 1028245092. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:53:10,609][25689] Avg episode reward: [(0, '0.071')] [2022-07-11 02:53:11,443][26022] Updated weights on worker 0-0, policy_version 1004144 (0.00086) [2022-07-11 02:53:13,233][26022] Updated weights on worker 0-0, policy_version 1004154 (0.00086) [2022-07-11 02:53:15,192][26022] Updated weights on worker 0-0, policy_version 1004164 (0.00088) [2022-07-11 02:53:15,615][25689] Fps is (10 sec: 5621.0, 60 sec: 5565.5, 300 sec: 5570.2). Total num frames: 1028265984. Throughput: 0: 4907.9. Samples: 1028261898. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:53:15,615][25689] Avg episode reward: [(0, '0.032')] [2022-07-11 02:53:17,116][26022] Updated weights on worker 0-0, policy_version 1004174 (0.00087) [2022-07-11 02:53:18,598][26022] Updated weights on worker 0-0, policy_version 1004184 (0.00085) [2022-07-11 02:53:20,639][26022] Updated weights on worker 0-0, policy_version 1004194 (0.00085) [2022-07-11 02:53:20,736][25689] Fps is (10 sec: 5560.5, 60 sec: 5546.7, 300 sec: 5568.1). Total num frames: 1028294656. Throughput: 0: 5711.2. Samples: 1028295410. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:53:20,736][25689] Avg episode reward: [(0, '-0.118')] [2022-07-11 02:53:22,386][26022] Updated weights on worker 0-0, policy_version 1004204 (0.00090) [2022-07-11 02:53:24,274][26022] Updated weights on worker 0-0, policy_version 1004214 (0.00436) [2022-07-11 02:53:25,803][25689] Fps is (10 sec: 5628.4, 60 sec: 5562.3, 300 sec: 5567.5). Total num frames: 1028323328. Throughput: 0: 5815.4. Samples: 1028329142. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:53:25,805][25689] Avg episode reward: [(0, '0.051')] [2022-07-11 02:53:26,300][26022] Updated weights on worker 0-0, policy_version 1004224 (0.00086) [2022-07-11 02:53:27,830][26022] Updated weights on worker 0-0, policy_version 1004234 (0.00098) [2022-07-11 02:53:28,285][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:53:28,295][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001004236_1028337664.pth [2022-07-11 02:53:28,306][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001002277_1026331648.pth [2022-07-11 02:53:29,852][26022] Updated weights on worker 0-0, policy_version 1004244 (0.00086) [2022-07-11 02:53:30,839][25689] Fps is (10 sec: 5675.9, 60 sec: 5566.3, 300 sec: 5570.4). Total num frames: 1028352000. Throughput: 0: 4987.2. Samples: 1028345992. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:53:30,839][25689] Avg episode reward: [(0, '0.895')] [2022-07-11 02:53:31,639][26022] Updated weights on worker 0-0, policy_version 1004254 (0.00085) [2022-07-11 02:53:33,395][26022] Updated weights on worker 0-0, policy_version 1004264 (0.00083) [2022-07-11 02:53:35,245][26022] Updated weights on worker 0-0, policy_version 1004274 (0.00087) [2022-07-11 02:53:35,856][25689] Fps is (10 sec: 5500.4, 60 sec: 5533.8, 300 sec: 5564.2). Total num frames: 1028378624. Throughput: 0: 5820.5. Samples: 1028379714. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:53:35,857][25689] Avg episode reward: [(0, '0.738')] [2022-07-11 02:53:37,070][26022] Updated weights on worker 0-0, policy_version 1004284 (0.00088) [2022-07-11 02:53:38,900][26022] Updated weights on worker 0-0, policy_version 1004294 (0.00086) [2022-07-11 02:53:40,852][26022] Updated weights on worker 0-0, policy_version 1004304 (0.00084) [2022-07-11 02:53:41,000][25689] Fps is (10 sec: 5542.8, 60 sec: 5577.4, 300 sec: 5565.8). Total num frames: 1028408320. Throughput: 0: 5816.2. Samples: 1028413270. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:53:41,000][25689] Avg episode reward: [(0, '0.694')] [2022-07-11 02:53:42,519][26022] Updated weights on worker 0-0, policy_version 1004314 (0.00090) [2022-07-11 02:53:44,504][26022] Updated weights on worker 0-0, policy_version 1004324 (0.00086) [2022-07-11 02:53:46,056][25689] Fps is (10 sec: 5722.4, 60 sec: 5575.8, 300 sec: 5568.5). Total num frames: 1028436992. Throughput: 0: 4994.2. Samples: 1028430290. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:53:46,057][25689] Avg episode reward: [(0, '0.491')] [2022-07-11 02:53:46,191][26022] Updated weights on worker 0-0, policy_version 1004334 (0.00091) [2022-07-11 02:53:47,971][26022] Updated weights on worker 0-0, policy_version 1004344 (0.00093) [2022-07-11 02:53:49,855][26022] Updated weights on worker 0-0, policy_version 1004354 (0.00096) [2022-07-11 02:53:51,066][25689] Fps is (10 sec: 5594.9, 60 sec: 5559.8, 300 sec: 5572.9). Total num frames: 1028464640. Throughput: 0: 5851.5. Samples: 1028464352. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:53:51,067][25689] Avg episode reward: [(0, '0.585')] [2022-07-11 02:53:51,625][26022] Updated weights on worker 0-0, policy_version 1004364 (0.00090) [2022-07-11 02:53:53,367][26022] Updated weights on worker 0-0, policy_version 1004374 (0.00095) [2022-07-11 02:53:55,479][26022] Updated weights on worker 0-0, policy_version 1004384 (0.00089) [2022-07-11 02:53:56,073][25689] Fps is (10 sec: 5622.3, 60 sec: 5562.8, 300 sec: 5571.3). Total num frames: 1028493312. Throughput: 0: 5842.0. Samples: 1028497822. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:53:56,074][25689] Avg episode reward: [(0, '0.333')] [2022-07-11 02:53:57,110][26022] Updated weights on worker 0-0, policy_version 1004394 (0.00097) [2022-07-11 02:53:59,075][26022] Updated weights on worker 0-0, policy_version 1004404 (0.00986) [2022-07-11 02:54:00,656][26022] Updated weights on worker 0-0, policy_version 1004414 (0.00091) [2022-07-11 02:54:01,113][25689] Fps is (10 sec: 5605.9, 60 sec: 5548.3, 300 sec: 5577.7). Total num frames: 1028520960. Throughput: 0: 5031.5. Samples: 1028514468. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:54:01,113][25689] Avg episode reward: [(0, '0.183')] [2022-07-11 02:54:03,180][26022] Updated weights on worker 0-0, policy_version 1004424 (0.00086) [2022-07-11 02:54:04,738][26022] Updated weights on worker 0-0, policy_version 1004434 (0.00085) [2022-07-11 02:54:06,132][25689] Fps is (10 sec: 5191.9, 60 sec: 5548.2, 300 sec: 5564.8). Total num frames: 1028545536. Throughput: 0: 5758.1. Samples: 1028545890. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:54:06,133][25689] Avg episode reward: [(0, '-0.006')] [2022-07-11 02:54:06,734][26022] Updated weights on worker 0-0, policy_version 1004444 (0.00082) [2022-07-11 02:54:08,657][26022] Updated weights on worker 0-0, policy_version 1004454 (0.00086) [2022-07-11 02:54:10,475][26022] Updated weights on worker 0-0, policy_version 1004464 (0.00082) [2022-07-11 02:54:11,147][25689] Fps is (10 sec: 5306.4, 60 sec: 5547.9, 300 sec: 5562.9). Total num frames: 1028574208. Throughput: 0: 5719.5. Samples: 1028579206. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:54:11,148][25689] Avg episode reward: [(0, '0.774')] [2022-07-11 02:54:12,225][26022] Updated weights on worker 0-0, policy_version 1004474 (0.00085) [2022-07-11 02:54:14,193][26022] Updated weights on worker 0-0, policy_version 1004484 (0.00084) [2022-07-11 02:54:15,926][26022] Updated weights on worker 0-0, policy_version 1004494 (0.00099) [2022-07-11 02:54:16,163][25689] Fps is (10 sec: 5819.0, 60 sec: 5581.0, 300 sec: 5574.8). Total num frames: 1028603904. Throughput: 0: 4891.8. Samples: 1028596096. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:54:16,163][25689] Avg episode reward: [(0, '0.748')] [2022-07-11 02:54:17,993][26022] Updated weights on worker 0-0, policy_version 1004504 (0.00082) [2022-07-11 02:54:19,592][26022] Updated weights on worker 0-0, policy_version 1004514 (0.00088) [2022-07-11 02:54:21,216][25689] Fps is (10 sec: 5491.7, 60 sec: 5536.5, 300 sec: 5560.6). Total num frames: 1028629504. Throughput: 0: 5719.7. Samples: 1028629456. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:54:21,217][25689] Avg episode reward: [(0, '0.689')] [2022-07-11 02:54:21,570][26022] Updated weights on worker 0-0, policy_version 1004524 (0.00084) [2022-07-11 02:54:23,325][26022] Updated weights on worker 0-0, policy_version 1004534 (0.00086) [2022-07-11 02:54:25,108][26022] Updated weights on worker 0-0, policy_version 1004544 (0.00089) [2022-07-11 02:54:26,230][25689] Fps is (10 sec: 5492.4, 60 sec: 5558.3, 300 sec: 5568.1). Total num frames: 1028659200. Throughput: 0: 5846.3. Samples: 1028663390. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:54:26,231][25689] Avg episode reward: [(0, '0.779')] [2022-07-11 02:54:26,943][26022] Updated weights on worker 0-0, policy_version 1004554 (0.00083) [2022-07-11 02:54:28,823][26022] Updated weights on worker 0-0, policy_version 1004564 (0.00087) [2022-07-11 02:54:30,559][26022] Updated weights on worker 0-0, policy_version 1004574 (0.00095) [2022-07-11 02:54:31,234][25689] Fps is (10 sec: 5724.5, 60 sec: 5544.3, 300 sec: 5565.4). Total num frames: 1028686848. Throughput: 0: 5028.1. Samples: 1028680202. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:54:31,234][25689] Avg episode reward: [(0, '0.209')] [2022-07-11 02:54:32,508][26022] Updated weights on worker 0-0, policy_version 1004584 (0.00085) [2022-07-11 02:54:34,217][26022] Updated weights on worker 0-0, policy_version 1004594 (0.00095) [2022-07-11 02:54:36,119][26022] Updated weights on worker 0-0, policy_version 1004604 (0.00097) [2022-07-11 02:54:36,269][25689] Fps is (10 sec: 5610.4, 60 sec: 5576.6, 300 sec: 5563.0). Total num frames: 1028715520. Throughput: 0: 5848.0. Samples: 1028713676. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:54:36,269][25689] Avg episode reward: [(0, '0.756')] [2022-07-11 02:54:37,967][26022] Updated weights on worker 0-0, policy_version 1004614 (0.00089) [2022-07-11 02:54:39,726][26022] Updated weights on worker 0-0, policy_version 1004624 (0.00092) [2022-07-11 02:54:41,308][25689] Fps is (10 sec: 5590.3, 60 sec: 5552.3, 300 sec: 5562.6). Total num frames: 1028743168. Throughput: 0: 5845.9. Samples: 1028746912. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:54:41,309][25689] Avg episode reward: [(0, '0.628')] [2022-07-11 02:54:41,580][26022] Updated weights on worker 0-0, policy_version 1004634 (0.00088) [2022-07-11 02:54:43,554][26022] Updated weights on worker 0-0, policy_version 1004644 (0.00089) [2022-07-11 02:54:45,190][26022] Updated weights on worker 0-0, policy_version 1004654 (0.00846) [2022-07-11 02:54:46,321][25689] Fps is (10 sec: 5501.1, 60 sec: 5539.3, 300 sec: 5562.9). Total num frames: 1028770816. Throughput: 0: 4995.6. Samples: 1028763754. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:54:46,322][25689] Avg episode reward: [(0, '0.731')] [2022-07-11 02:54:47,269][26022] Updated weights on worker 0-0, policy_version 1004664 (0.00099) [2022-07-11 02:54:48,719][26022] Updated weights on worker 0-0, policy_version 1004674 (0.00085) [2022-07-11 02:54:50,886][26022] Updated weights on worker 0-0, policy_version 1004684 (0.00085) [2022-07-11 02:54:51,328][25689] Fps is (10 sec: 5620.7, 60 sec: 5556.5, 300 sec: 5559.5). Total num frames: 1028799488. Throughput: 0: 5838.5. Samples: 1028797524. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:54:51,329][25689] Avg episode reward: [(0, '1.368')] [2022-07-11 02:54:52,514][26022] Updated weights on worker 0-0, policy_version 1004694 (0.00087) [2022-07-11 02:54:54,363][26022] Updated weights on worker 0-0, policy_version 1004704 (0.00499) [2022-07-11 02:54:56,405][25689] Fps is (10 sec: 5585.1, 60 sec: 5533.2, 300 sec: 5565.6). Total num frames: 1028827136. Throughput: 0: 5836.3. Samples: 1028831194. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:54:56,405][25689] Avg episode reward: [(0, '1.781')] [2022-07-11 02:54:56,414][26022] Updated weights on worker 0-0, policy_version 1004714 (0.00087) [2022-07-11 02:54:57,955][26022] Updated weights on worker 0-0, policy_version 1004724 (0.00082) [2022-07-11 02:54:59,882][26022] Updated weights on worker 0-0, policy_version 1004734 (0.00090) [2022-07-11 02:55:01,533][25689] Fps is (10 sec: 5619.6, 60 sec: 5559.0, 300 sec: 5573.9). Total num frames: 1028856832. Throughput: 0: 5831.9. Samples: 1028864858. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:55:01,533][25689] Avg episode reward: [(0, '1.930')] [2022-07-11 02:55:01,674][26022] Updated weights on worker 0-0, policy_version 1004744 (0.00089) [2022-07-11 02:55:03,886][26022] Updated weights on worker 0-0, policy_version 1004754 (0.00085) [2022-07-11 02:55:05,952][26022] Updated weights on worker 0-0, policy_version 1004764 (0.00092) [2022-07-11 02:55:06,602][25689] Fps is (10 sec: 5422.4, 60 sec: 5571.3, 300 sec: 5562.3). Total num frames: 1028882432. Throughput: 0: 5713.1. Samples: 1028879624. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:55:06,603][25689] Avg episode reward: [(0, '1.864')] [2022-07-11 02:55:07,373][26022] Updated weights on worker 0-0, policy_version 1004774 (0.00083) [2022-07-11 02:55:09,472][26022] Updated weights on worker 0-0, policy_version 1004784 (0.00094) [2022-07-11 02:55:11,215][26022] Updated weights on worker 0-0, policy_version 1004794 (0.00090) [2022-07-11 02:55:11,655][25689] Fps is (10 sec: 5260.6, 60 sec: 5550.9, 300 sec: 5561.5). Total num frames: 1028910080. Throughput: 0: 5689.5. Samples: 1028913170. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:55:11,655][25689] Avg episode reward: [(0, '1.891')] [2022-07-11 02:55:13,036][26022] Updated weights on worker 0-0, policy_version 1004804 (0.00085) [2022-07-11 02:55:14,969][26022] Updated weights on worker 0-0, policy_version 1004814 (0.00093) [2022-07-11 02:55:16,656][26022] Updated weights on worker 0-0, policy_version 1004824 (0.00086) [2022-07-11 02:55:16,688][25689] Fps is (10 sec: 5685.5, 60 sec: 5549.3, 300 sec: 5565.6). Total num frames: 1028939776. Throughput: 0: 5712.3. Samples: 1028947060. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:55:16,690][25689] Avg episode reward: [(0, '0.853')] [2022-07-11 02:55:18,627][26022] Updated weights on worker 0-0, policy_version 1004834 (0.00081) [2022-07-11 02:55:20,539][26022] Updated weights on worker 0-0, policy_version 1004844 (0.00093) [2022-07-11 02:55:21,822][25689] Fps is (10 sec: 5639.9, 60 sec: 5575.7, 300 sec: 5560.3). Total num frames: 1028967424. Throughput: 0: 4867.7. Samples: 1028963622. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:55:21,823][25689] Avg episode reward: [(0, '0.656')] [2022-07-11 02:55:22,341][26022] Updated weights on worker 0-0, policy_version 1004854 (0.00085) [2022-07-11 02:55:24,102][26022] Updated weights on worker 0-0, policy_version 1004864 (0.00064) [2022-07-11 02:55:26,095][26022] Updated weights on worker 0-0, policy_version 1004874 (0.00083) [2022-07-11 02:55:26,865][25689] Fps is (10 sec: 5534.3, 60 sec: 5556.2, 300 sec: 5566.5). Total num frames: 1028996096. Throughput: 0: 5807.5. Samples: 1028997298. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:55:26,865][25689] Avg episode reward: [(0, '0.461')] [2022-07-11 02:55:27,850][26022] Updated weights on worker 0-0, policy_version 1004884 (0.00089) [2022-07-11 02:55:28,330][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:55:28,357][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001004888_1029005312.pth [2022-07-11 02:55:28,358][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001002929_1026999296.pth [2022-07-11 02:55:29,615][26022] Updated weights on worker 0-0, policy_version 1004894 (0.00090) [2022-07-11 02:55:31,475][26022] Updated weights on worker 0-0, policy_version 1004904 (0.00055) [2022-07-11 02:55:31,922][25689] Fps is (10 sec: 5576.4, 60 sec: 5551.3, 300 sec: 5562.4). Total num frames: 1029023744. Throughput: 0: 5792.7. Samples: 1029030572. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:55:31,923][25689] Avg episode reward: [(0, '0.722')] [2022-07-11 02:55:33,227][26022] Updated weights on worker 0-0, policy_version 1004914 (0.00087) [2022-07-11 02:55:35,231][26022] Updated weights on worker 0-0, policy_version 1004924 (0.00095) [2022-07-11 02:55:37,020][25689] Fps is (10 sec: 5445.3, 60 sec: 5528.7, 300 sec: 5552.6). Total num frames: 1029051392. Throughput: 0: 4937.5. Samples: 1029047446. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:55:37,021][25689] Avg episode reward: [(0, '0.315')] [2022-07-11 02:55:37,192][26022] Updated weights on worker 0-0, policy_version 1004934 (0.00088) [2022-07-11 02:55:38,963][26022] Updated weights on worker 0-0, policy_version 1004944 (0.00084) [2022-07-11 02:55:40,731][26022] Updated weights on worker 0-0, policy_version 1004954 (0.00083) [2022-07-11 02:55:42,079][25689] Fps is (10 sec: 5544.8, 60 sec: 5543.7, 300 sec: 5558.9). Total num frames: 1029080064. Throughput: 0: 5777.6. Samples: 1029080658. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:55:42,080][25689] Avg episode reward: [(0, '0.511')] [2022-07-11 02:55:42,684][26022] Updated weights on worker 0-0, policy_version 1004964 (0.00093) [2022-07-11 02:55:44,280][26022] Updated weights on worker 0-0, policy_version 1004974 (0.00086) [2022-07-11 02:55:46,266][26022] Updated weights on worker 0-0, policy_version 1004984 (0.00089) [2022-07-11 02:55:47,081][25689] Fps is (10 sec: 5598.0, 60 sec: 5544.7, 300 sec: 5556.0). Total num frames: 1029107712. Throughput: 0: 5794.3. Samples: 1029114434. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:55:47,081][25689] Avg episode reward: [(0, '1.077')] [2022-07-11 02:55:47,977][26022] Updated weights on worker 0-0, policy_version 1004994 (0.00087) [2022-07-11 02:55:50,013][26022] Updated weights on worker 0-0, policy_version 1005004 (0.00081) [2022-07-11 02:55:51,547][26022] Updated weights on worker 0-0, policy_version 1005014 (0.00086) [2022-07-11 02:55:52,125][25689] Fps is (10 sec: 5606.6, 60 sec: 5541.4, 300 sec: 5555.8). Total num frames: 1029136384. Throughput: 0: 4983.9. Samples: 1029131260. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:55:52,125][25689] Avg episode reward: [(0, '0.361')] [2022-07-11 02:55:53,712][26022] Updated weights on worker 0-0, policy_version 1005025 (0.00080) [2022-07-11 02:55:55,535][26022] Updated weights on worker 0-0, policy_version 1005035 (0.00095) [2022-07-11 02:55:57,134][25689] Fps is (10 sec: 5602.2, 60 sec: 5547.5, 300 sec: 5553.4). Total num frames: 1029164032. Throughput: 0: 5845.2. Samples: 1029165016. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:55:57,134][25689] Avg episode reward: [(0, '0.543')] [2022-07-11 02:55:57,339][26022] Updated weights on worker 0-0, policy_version 1005045 (0.00090) [2022-07-11 02:55:59,302][26022] Updated weights on worker 0-0, policy_version 1005055 (0.00089) [2022-07-11 02:56:01,038][26022] Updated weights on worker 0-0, policy_version 1005065 (0.00083) [2022-07-11 02:56:02,217][25689] Fps is (10 sec: 5377.5, 60 sec: 5501.0, 300 sec: 5558.9). Total num frames: 1029190656. Throughput: 0: 5847.0. Samples: 1029198402. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:56:02,217][25689] Avg episode reward: [(0, '0.593')] [2022-07-11 02:56:03,245][26022] Updated weights on worker 0-0, policy_version 1005075 (0.00080) [2022-07-11 02:56:04,998][26022] Updated weights on worker 0-0, policy_version 1005085 (0.00092) [2022-07-11 02:56:06,639][26022] Updated weights on worker 0-0, policy_version 1005095 (0.00086) [2022-07-11 02:56:07,235][25689] Fps is (10 sec: 5474.3, 60 sec: 5556.4, 300 sec: 5558.7). Total num frames: 1029219328. Throughput: 0: 4904.1. Samples: 1029213272. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:56:07,235][25689] Avg episode reward: [(0, '0.723')] [2022-07-11 02:56:08,796][26022] Updated weights on worker 0-0, policy_version 1005105 (0.00087) [2022-07-11 02:56:10,516][26022] Updated weights on worker 0-0, policy_version 1005115 (0.00088) [2022-07-11 02:56:12,254][25689] Fps is (10 sec: 5611.1, 60 sec: 5559.4, 300 sec: 5551.7). Total num frames: 1029246976. Throughput: 0: 5739.9. Samples: 1029246802. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:56:12,255][25689] Avg episode reward: [(0, '0.456')] [2022-07-11 02:56:12,455][26022] Updated weights on worker 0-0, policy_version 1005125 (0.00091) [2022-07-11 02:56:13,955][26022] Updated weights on worker 0-0, policy_version 1005135 (0.00094) [2022-07-11 02:56:16,079][26022] Updated weights on worker 0-0, policy_version 1005145 (0.00090) [2022-07-11 02:56:17,282][25689] Fps is (10 sec: 5707.4, 60 sec: 5559.9, 300 sec: 5563.5). Total num frames: 1029276672. Throughput: 0: 5739.4. Samples: 1029280656. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:56:17,283][25689] Avg episode reward: [(0, '0.018')] [2022-07-11 02:56:17,839][26022] Updated weights on worker 0-0, policy_version 1005155 (0.00099) [2022-07-11 02:56:19,663][26022] Updated weights on worker 0-0, policy_version 1005165 (0.00079) [2022-07-11 02:56:21,504][26022] Updated weights on worker 0-0, policy_version 1005175 (0.00100) [2022-07-11 02:56:22,391][25689] Fps is (10 sec: 5455.0, 60 sec: 5528.4, 300 sec: 5554.7). Total num frames: 1029302272. Throughput: 0: 4909.2. Samples: 1029297440. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 02:56:22,393][25689] Avg episode reward: [(0, '0.665')] [2022-07-11 02:56:23,307][26022] Updated weights on worker 0-0, policy_version 1005185 (0.00102) [2022-07-11 02:56:25,241][26022] Updated weights on worker 0-0, policy_version 1005195 (0.00086) [2022-07-11 02:56:26,915][26022] Updated weights on worker 0-0, policy_version 1005205 (0.00091) [2022-07-11 02:56:27,466][25689] Fps is (10 sec: 5530.3, 60 sec: 5559.2, 300 sec: 5553.4). Total num frames: 1029332992. Throughput: 0: 5825.4. Samples: 1029331126. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:56:27,468][25689] Avg episode reward: [(0, '-0.714')] [2022-07-11 02:56:28,782][26022] Updated weights on worker 0-0, policy_version 1005215 (0.00092) [2022-07-11 02:56:30,659][26022] Updated weights on worker 0-0, policy_version 1005225 (0.00091) [2022-07-11 02:56:32,462][26022] Updated weights on worker 0-0, policy_version 1005235 (0.00093) [2022-07-11 02:56:32,560][25689] Fps is (10 sec: 5739.6, 60 sec: 5555.8, 300 sec: 5555.3). Total num frames: 1029360640. Throughput: 0: 5806.2. Samples: 1029364704. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:56:32,561][25689] Avg episode reward: [(0, '-0.097')] [2022-07-11 02:56:34,282][26022] Updated weights on worker 0-0, policy_version 1005245 (0.00086) [2022-07-11 02:56:36,034][26022] Updated weights on worker 0-0, policy_version 1005255 (0.00083) [2022-07-11 02:56:37,613][25689] Fps is (10 sec: 5550.2, 60 sec: 5576.8, 300 sec: 5560.0). Total num frames: 1029389312. Throughput: 0: 5801.2. Samples: 1029398602. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:56:37,614][25689] Avg episode reward: [(0, '-0.706')] [2022-07-11 02:56:38,004][26022] Updated weights on worker 0-0, policy_version 1005265 (0.00088) [2022-07-11 02:56:39,814][26022] Updated weights on worker 0-0, policy_version 1005275 (0.00087) [2022-07-11 02:56:41,613][26022] Updated weights on worker 0-0, policy_version 1005285 (0.00291) [2022-07-11 02:56:42,663][25689] Fps is (10 sec: 5574.8, 60 sec: 5560.8, 300 sec: 5556.2). Total num frames: 1029416960. Throughput: 0: 5801.7. Samples: 1029415052. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:56:42,664][25689] Avg episode reward: [(0, '-0.537')] [2022-07-11 02:56:43,574][26022] Updated weights on worker 0-0, policy_version 1005295 (0.00081) [2022-07-11 02:56:45,336][26022] Updated weights on worker 0-0, policy_version 1005305 (0.00080) [2022-07-11 02:56:47,245][26022] Updated weights on worker 0-0, policy_version 1005315 (0.00085) [2022-07-11 02:56:47,712][25689] Fps is (10 sec: 5577.1, 60 sec: 5573.4, 300 sec: 5559.3). Total num frames: 1029445632. Throughput: 0: 5800.8. Samples: 1029448568. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:56:47,712][25689] Avg episode reward: [(0, '0.258')] [2022-07-11 02:56:49,085][26022] Updated weights on worker 0-0, policy_version 1005325 (0.00084) [2022-07-11 02:56:50,734][26022] Updated weights on worker 0-0, policy_version 1005335 (0.00084) [2022-07-11 02:56:52,714][25689] Fps is (10 sec: 5603.5, 60 sec: 5560.3, 300 sec: 5553.1). Total num frames: 1029473280. Throughput: 0: 5836.6. Samples: 1029482332. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:56:52,714][25689] Avg episode reward: [(0, '0.329')] [2022-07-11 02:56:52,715][26022] Updated weights on worker 0-0, policy_version 1005345 (0.00078) [2022-07-11 02:56:54,302][26022] Updated weights on worker 0-0, policy_version 1005355 (0.00083) [2022-07-11 02:56:56,402][26022] Updated weights on worker 0-0, policy_version 1005365 (0.00092) [2022-07-11 02:56:57,758][25689] Fps is (10 sec: 5605.8, 60 sec: 5574.0, 300 sec: 5561.1). Total num frames: 1029501952. Throughput: 0: 4990.0. Samples: 1029499126. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:56:57,760][25689] Avg episode reward: [(0, '0.760')] [2022-07-11 02:56:58,012][26022] Updated weights on worker 0-0, policy_version 1005375 (0.00084) [2022-07-11 02:56:59,970][26022] Updated weights on worker 0-0, policy_version 1005385 (0.00475) [2022-07-11 02:57:02,103][26022] Updated weights on worker 0-0, policy_version 1005395 (0.00093) [2022-07-11 02:57:02,832][25689] Fps is (10 sec: 5363.5, 60 sec: 5557.9, 300 sec: 5553.2). Total num frames: 1029527552. Throughput: 0: 5804.6. Samples: 1029532128. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:57:02,833][25689] Avg episode reward: [(0, '1.454')] [2022-07-11 02:57:04,036][26022] Updated weights on worker 0-0, policy_version 1005405 (0.00087) [2022-07-11 02:57:05,959][26022] Updated weights on worker 0-0, policy_version 1005415 (0.00083) [2022-07-11 02:57:07,778][26022] Updated weights on worker 0-0, policy_version 1005425 (0.00090) [2022-07-11 02:57:07,872][25689] Fps is (10 sec: 5264.8, 60 sec: 5539.0, 300 sec: 5552.7). Total num frames: 1029555200. Throughput: 0: 5730.5. Samples: 1029564098. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:57:07,873][25689] Avg episode reward: [(0, '1.160')] [2022-07-11 02:57:09,539][26022] Updated weights on worker 0-0, policy_version 1005435 (0.00088) [2022-07-11 02:57:11,416][26022] Updated weights on worker 0-0, policy_version 1005445 (0.00090) [2022-07-11 02:57:12,934][25689] Fps is (10 sec: 5575.4, 60 sec: 5552.0, 300 sec: 5555.7). Total num frames: 1029583872. Throughput: 0: 4851.1. Samples: 1029580422. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:57:12,936][25689] Avg episode reward: [(0, '0.912')] [2022-07-11 02:57:13,416][26022] Updated weights on worker 0-0, policy_version 1005455 (0.00081) [2022-07-11 02:57:15,000][26022] Updated weights on worker 0-0, policy_version 1005465 (0.00105) [2022-07-11 02:57:16,830][26022] Updated weights on worker 0-0, policy_version 1005475 (0.00079) [2022-07-11 02:57:17,975][25689] Fps is (10 sec: 5574.5, 60 sec: 5517.0, 300 sec: 5549.9). Total num frames: 1029611520. Throughput: 0: 5697.1. Samples: 1029614304. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:57:17,976][25689] Avg episode reward: [(0, '0.989')] [2022-07-11 02:57:18,569][26022] Updated weights on worker 0-0, policy_version 1005485 (0.00086) [2022-07-11 02:57:20,596][26022] Updated weights on worker 0-0, policy_version 1005495 (0.00089) [2022-07-11 02:57:22,424][26022] Updated weights on worker 0-0, policy_version 1005505 (0.00095) [2022-07-11 02:57:23,098][25689] Fps is (10 sec: 5742.6, 60 sec: 5600.1, 300 sec: 5559.0). Total num frames: 1029642240. Throughput: 0: 5713.3. Samples: 1029647912. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:57:23,099][25689] Avg episode reward: [(0, '1.524')] [2022-07-11 02:57:24,404][26022] Updated weights on worker 0-0, policy_version 1005515 (0.00090) [2022-07-11 02:57:25,895][26022] Updated weights on worker 0-0, policy_version 1005525 (0.00093) [2022-07-11 02:57:27,957][26022] Updated weights on worker 0-0, policy_version 1005535 (0.00088) [2022-07-11 02:57:28,116][25689] Fps is (10 sec: 5654.9, 60 sec: 5537.8, 300 sec: 5553.2). Total num frames: 1029668864. Throughput: 0: 4972.3. Samples: 1029664756. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:57:28,117][25689] Avg episode reward: [(0, '1.713')] [2022-07-11 02:57:28,437][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:57:28,450][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001005538_1029670912.pth [2022-07-11 02:57:28,450][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001003582_1027667968.pth [2022-07-11 02:57:29,700][26022] Updated weights on worker 0-0, policy_version 1005545 (0.00087) [2022-07-11 02:57:31,583][26022] Updated weights on worker 0-0, policy_version 1005555 (0.00087) [2022-07-11 02:57:33,175][25689] Fps is (10 sec: 5385.8, 60 sec: 5541.1, 300 sec: 5549.3). Total num frames: 1029696512. Throughput: 0: 5807.6. Samples: 1029697972. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:57:33,175][25689] Avg episode reward: [(0, '1.484')] [2022-07-11 02:57:33,447][26022] Updated weights on worker 0-0, policy_version 1005565 (0.00096) [2022-07-11 02:57:35,187][26022] Updated weights on worker 0-0, policy_version 1005575 (0.00488) [2022-07-11 02:57:37,160][26022] Updated weights on worker 0-0, policy_version 1005585 (0.00090) [2022-07-11 02:57:38,214][25689] Fps is (10 sec: 5577.5, 60 sec: 5542.4, 300 sec: 5556.7). Total num frames: 1029725184. Throughput: 0: 5802.3. Samples: 1029731732. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:57:38,214][25689] Avg episode reward: [(0, '0.882')] [2022-07-11 02:57:38,992][26022] Updated weights on worker 0-0, policy_version 1005595 (0.00092) [2022-07-11 02:57:40,614][26022] Updated weights on worker 0-0, policy_version 1005605 (0.00094) [2022-07-11 02:57:42,542][26022] Updated weights on worker 0-0, policy_version 1005615 (0.00088) [2022-07-11 02:57:43,299][25689] Fps is (10 sec: 5461.9, 60 sec: 5522.3, 300 sec: 5548.9). Total num frames: 1029751808. Throughput: 0: 4966.7. Samples: 1029748244. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:57:43,299][25689] Avg episode reward: [(0, '1.002')] [2022-07-11 02:57:44,207][26022] Updated weights on worker 0-0, policy_version 1005625 (0.00086) [2022-07-11 02:57:46,331][26022] Updated weights on worker 0-0, policy_version 1005635 (0.00085) [2022-07-11 02:57:47,944][26022] Updated weights on worker 0-0, policy_version 1005645 (0.00091) [2022-07-11 02:57:48,327][25689] Fps is (10 sec: 5670.2, 60 sec: 5557.9, 300 sec: 5555.7). Total num frames: 1029782528. Throughput: 0: 5815.1. Samples: 1029782284. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:57:48,332][25689] Avg episode reward: [(0, '1.235')] [2022-07-11 02:57:49,828][26022] Updated weights on worker 0-0, policy_version 1005655 (0.00084) [2022-07-11 02:57:51,402][26022] Updated weights on worker 0-0, policy_version 1005665 (0.00069) [2022-07-11 02:57:53,340][25689] Fps is (10 sec: 5812.9, 60 sec: 5556.9, 300 sec: 5552.7). Total num frames: 1029810176. Throughput: 0: 5865.1. Samples: 1029816242. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:57:53,341][25689] Avg episode reward: [(0, '1.147')] [2022-07-11 02:57:53,539][26022] Updated weights on worker 0-0, policy_version 1005675 (0.00075) [2022-07-11 02:57:55,243][26022] Updated weights on worker 0-0, policy_version 1005685 (0.00083) [2022-07-11 02:57:57,170][26022] Updated weights on worker 0-0, policy_version 1005695 (0.00091) [2022-07-11 02:57:58,360][25689] Fps is (10 sec: 5613.6, 60 sec: 5559.2, 300 sec: 5553.6). Total num frames: 1029838848. Throughput: 0: 5041.1. Samples: 1029833290. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:57:58,360][25689] Avg episode reward: [(0, '1.062')] [2022-07-11 02:57:58,876][26022] Updated weights on worker 0-0, policy_version 1005705 (0.00096) [2022-07-11 02:58:00,860][26022] Updated weights on worker 0-0, policy_version 1005715 (0.00094) [2022-07-11 02:58:02,815][26022] Updated weights on worker 0-0, policy_version 1005725 (0.00086) [2022-07-11 02:58:03,453][25689] Fps is (10 sec: 5467.6, 60 sec: 5574.3, 300 sec: 5559.0). Total num frames: 1029865472. Throughput: 0: 5814.3. Samples: 1029865428. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:58:03,454][25689] Avg episode reward: [(0, '1.361')] [2022-07-11 02:58:04,717][26022] Updated weights on worker 0-0, policy_version 1005735 (0.00092) [2022-07-11 02:58:06,514][26022] Updated weights on worker 0-0, policy_version 1005745 (0.00090) [2022-07-11 02:58:08,464][25689] Fps is (10 sec: 5371.2, 60 sec: 5577.0, 300 sec: 5555.6). Total num frames: 1029893120. Throughput: 0: 5786.9. Samples: 1029898816. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:58:08,464][25689] Avg episode reward: [(0, '1.776')] [2022-07-11 02:58:08,469][26022] Updated weights on worker 0-0, policy_version 1005755 (0.00054) [2022-07-11 02:58:10,275][26022] Updated weights on worker 0-0, policy_version 1005765 (0.00086) [2022-07-11 02:58:12,161][26022] Updated weights on worker 0-0, policy_version 1005775 (0.00089) [2022-07-11 02:58:13,522][25689] Fps is (10 sec: 5491.7, 60 sec: 5560.4, 300 sec: 5554.7). Total num frames: 1029920768. Throughput: 0: 4917.2. Samples: 1029915482. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:58:13,523][25689] Avg episode reward: [(0, '1.896')] [2022-07-11 02:58:13,811][26022] Updated weights on worker 0-0, policy_version 1005785 (0.00081) [2022-07-11 02:58:15,834][26022] Updated weights on worker 0-0, policy_version 1005795 (0.00088) [2022-07-11 02:58:17,535][26022] Updated weights on worker 0-0, policy_version 1005805 (0.00086) [2022-07-11 02:58:18,555][25689] Fps is (10 sec: 5581.5, 60 sec: 5578.2, 300 sec: 5556.3). Total num frames: 1029949440. Throughput: 0: 5743.7. Samples: 1029949282. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:58:18,555][25689] Avg episode reward: [(0, '1.735')] [2022-07-11 02:58:19,388][26022] Updated weights on worker 0-0, policy_version 1005815 (0.00086) [2022-07-11 02:58:21,051][26022] Updated weights on worker 0-0, policy_version 1005825 (0.00085) [2022-07-11 02:58:23,256][26022] Updated weights on worker 0-0, policy_version 1005835 (0.00085) [2022-07-11 02:58:23,657][25689] Fps is (10 sec: 5658.3, 60 sec: 5546.2, 300 sec: 5555.7). Total num frames: 1029978112. Throughput: 0: 5811.6. Samples: 1029982842. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:58:23,659][25689] Avg episode reward: [(0, '1.669')] [2022-07-11 02:58:24,699][26022] Updated weights on worker 0-0, policy_version 1005845 (0.00547) [2022-07-11 02:58:26,740][26022] Updated weights on worker 0-0, policy_version 1005855 (0.00094) [2022-07-11 02:58:28,507][26022] Updated weights on worker 0-0, policy_version 1005865 (0.00095) [2022-07-11 02:58:28,682][25689] Fps is (10 sec: 5662.2, 60 sec: 5579.4, 300 sec: 5555.9). Total num frames: 1030006784. Throughput: 0: 4980.2. Samples: 1029999506. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:58:28,683][25689] Avg episode reward: [(0, '1.542')] [2022-07-11 02:58:30,595][26022] Updated weights on worker 0-0, policy_version 1005875 (0.00088) [2022-07-11 02:58:32,095][26022] Updated weights on worker 0-0, policy_version 1005885 (0.00084) [2022-07-11 02:58:33,731][25689] Fps is (10 sec: 5488.6, 60 sec: 5563.3, 300 sec: 5555.3). Total num frames: 1030033408. Throughput: 0: 5800.7. Samples: 1030032708. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:58:33,734][25689] Avg episode reward: [(0, '1.380')] [2022-07-11 02:58:34,321][26022] Updated weights on worker 0-0, policy_version 1005895 (0.00093) [2022-07-11 02:58:35,814][26022] Updated weights on worker 0-0, policy_version 1005905 (0.00090) [2022-07-11 02:58:37,895][26022] Updated weights on worker 0-0, policy_version 1005915 (0.00880) [2022-07-11 02:58:38,751][25689] Fps is (10 sec: 5593.3, 60 sec: 5582.0, 300 sec: 5557.6). Total num frames: 1030063104. Throughput: 0: 5801.2. Samples: 1030066446. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:58:38,752][25689] Avg episode reward: [(0, '1.273')] [2022-07-11 02:58:39,424][26022] Updated weights on worker 0-0, policy_version 1005925 (0.00090) [2022-07-11 02:58:41,600][26022] Updated weights on worker 0-0, policy_version 1005935 (0.00095) [2022-07-11 02:58:43,187][26022] Updated weights on worker 0-0, policy_version 1005945 (0.00089) [2022-07-11 02:58:43,887][25689] Fps is (10 sec: 5646.4, 60 sec: 5594.2, 300 sec: 5552.7). Total num frames: 1030090752. Throughput: 0: 5778.3. Samples: 1030099740. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:58:43,888][25689] Avg episode reward: [(0, '1.159')] [2022-07-11 02:58:45,170][26022] Updated weights on worker 0-0, policy_version 1005955 (0.00092) [2022-07-11 02:58:46,903][26022] Updated weights on worker 0-0, policy_version 1005965 (0.00082) [2022-07-11 02:58:48,746][26022] Updated weights on worker 0-0, policy_version 1005975 (0.00081) [2022-07-11 02:58:48,889][25689] Fps is (10 sec: 5454.5, 60 sec: 5545.9, 300 sec: 5552.8). Total num frames: 1030118400. Throughput: 0: 5797.3. Samples: 1030116652. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:58:48,890][25689] Avg episode reward: [(0, '1.123')] [2022-07-11 02:58:50,569][26022] Updated weights on worker 0-0, policy_version 1005985 (0.00098) [2022-07-11 02:58:52,492][26022] Updated weights on worker 0-0, policy_version 1005995 (0.00093) [2022-07-11 02:58:53,923][25689] Fps is (10 sec: 5612.0, 60 sec: 5560.9, 300 sec: 5552.3). Total num frames: 1030147072. Throughput: 0: 5810.3. Samples: 1030150026. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:58:53,923][25689] Avg episode reward: [(0, '0.779')] [2022-07-11 02:58:54,263][26022] Updated weights on worker 0-0, policy_version 1006005 (0.00085) [2022-07-11 02:58:56,215][26022] Updated weights on worker 0-0, policy_version 1006015 (0.00079) [2022-07-11 02:58:57,870][26022] Updated weights on worker 0-0, policy_version 1006025 (0.00095) [2022-07-11 02:58:58,959][25689] Fps is (10 sec: 5593.0, 60 sec: 5542.5, 300 sec: 5552.4). Total num frames: 1030174720. Throughput: 0: 5813.6. Samples: 1030183924. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:58:58,959][25689] Avg episode reward: [(0, '0.122')] [2022-07-11 02:58:59,862][26022] Updated weights on worker 0-0, policy_version 1006035 (0.00089) [2022-07-11 02:59:01,398][26022] Updated weights on worker 0-0, policy_version 1006045 (0.00080) [2022-07-11 02:59:03,892][26022] Updated weights on worker 0-0, policy_version 1006055 (0.00089) [2022-07-11 02:59:04,075][25689] Fps is (10 sec: 5245.0, 60 sec: 5523.6, 300 sec: 5554.0). Total num frames: 1030200320. Throughput: 0: 5003.7. Samples: 1030200754. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:59:04,076][25689] Avg episode reward: [(0, '0.482')] [2022-07-11 02:59:05,465][26022] Updated weights on worker 0-0, policy_version 1006065 (0.00090) [2022-07-11 02:59:07,495][26022] Updated weights on worker 0-0, policy_version 1006075 (0.00084) [2022-07-11 02:59:09,141][25689] Fps is (10 sec: 5430.4, 60 sec: 5552.3, 300 sec: 5556.5). Total num frames: 1030230016. Throughput: 0: 5721.9. Samples: 1030232534. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:59:09,142][25689] Avg episode reward: [(0, '0.638')] [2022-07-11 02:59:09,352][26022] Updated weights on worker 0-0, policy_version 1006085 (0.00095) [2022-07-11 02:59:11,171][26022] Updated weights on worker 0-0, policy_version 1006095 (0.00083) [2022-07-11 02:59:12,992][26022] Updated weights on worker 0-0, policy_version 1006105 (0.00087) [2022-07-11 02:59:14,177][25689] Fps is (10 sec: 5676.2, 60 sec: 5554.3, 300 sec: 5549.3). Total num frames: 1030257664. Throughput: 0: 5728.2. Samples: 1030266048. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:59:14,178][25689] Avg episode reward: [(0, '0.416')] [2022-07-11 02:59:14,724][26022] Updated weights on worker 0-0, policy_version 1006115 (0.00098) [2022-07-11 02:59:16,577][26022] Updated weights on worker 0-0, policy_version 1006125 (0.00092) [2022-07-11 02:59:18,486][26022] Updated weights on worker 0-0, policy_version 1006135 (0.00086) [2022-07-11 02:59:19,199][25689] Fps is (10 sec: 5599.9, 60 sec: 5555.3, 300 sec: 5560.2). Total num frames: 1030286336. Throughput: 0: 4893.0. Samples: 1030282958. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:59:19,203][25689] Avg episode reward: [(0, '0.856')] [2022-07-11 02:59:20,354][26022] Updated weights on worker 0-0, policy_version 1006145 (0.00092) [2022-07-11 02:59:22,132][26022] Updated weights on worker 0-0, policy_version 1006155 (0.00084) [2022-07-11 02:59:23,555][26022] Updated weights on worker 0-0, policy_version 1006165 (0.00087) [2022-07-11 02:59:24,294][25689] Fps is (10 sec: 5567.1, 60 sec: 5539.0, 300 sec: 5551.8). Total num frames: 1030313984. Throughput: 0: 5731.1. Samples: 1030316632. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:59:24,295][25689] Avg episode reward: [(0, '0.466')] [2022-07-11 02:59:25,875][26022] Updated weights on worker 0-0, policy_version 1006175 (0.00086) [2022-07-11 02:59:27,498][26022] Updated weights on worker 0-0, policy_version 1006185 (0.00086) [2022-07-11 02:59:28,493][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 02:59:28,508][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001006189_1030337536.pth [2022-07-11 02:59:28,509][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001004236_1028337664.pth [2022-07-11 02:59:29,274][26022] Updated weights on worker 0-0, policy_version 1006195 (0.00616) [2022-07-11 02:59:29,333][25689] Fps is (10 sec: 5658.3, 60 sec: 5554.7, 300 sec: 5558.0). Total num frames: 1030343680. Throughput: 0: 5834.2. Samples: 1030350336. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:59:29,334][25689] Avg episode reward: [(0, '0.674')] [2022-07-11 02:59:31,281][26022] Updated weights on worker 0-0, policy_version 1006205 (0.00085) [2022-07-11 02:59:32,907][26022] Updated weights on worker 0-0, policy_version 1006215 (0.00091) [2022-07-11 02:59:34,340][25689] Fps is (10 sec: 5707.9, 60 sec: 5575.4, 300 sec: 5555.1). Total num frames: 1030371328. Throughput: 0: 5018.4. Samples: 1030367234. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:59:34,341][25689] Avg episode reward: [(0, '0.549')] [2022-07-11 02:59:34,969][26022] Updated weights on worker 0-0, policy_version 1006225 (0.00084) [2022-07-11 02:59:36,756][26022] Updated weights on worker 0-0, policy_version 1006235 (0.00090) [2022-07-11 02:59:38,471][26022] Updated weights on worker 0-0, policy_version 1006245 (0.00085) [2022-07-11 02:59:39,357][25689] Fps is (10 sec: 5516.3, 60 sec: 5541.9, 300 sec: 5555.5). Total num frames: 1030398976. Throughput: 0: 5834.1. Samples: 1030400566. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:59:39,359][25689] Avg episode reward: [(0, '0.744')] [2022-07-11 02:59:40,554][26022] Updated weights on worker 0-0, policy_version 1006255 (0.00087) [2022-07-11 02:59:42,307][26022] Updated weights on worker 0-0, policy_version 1006265 (0.00087) [2022-07-11 02:59:44,115][26022] Updated weights on worker 0-0, policy_version 1006275 (0.00083) [2022-07-11 02:59:44,430][25689] Fps is (10 sec: 5582.2, 60 sec: 5564.6, 300 sec: 5557.8). Total num frames: 1030427648. Throughput: 0: 5838.3. Samples: 1030434190. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:59:44,432][25689] Avg episode reward: [(0, '0.920')] [2022-07-11 02:59:45,833][26022] Updated weights on worker 0-0, policy_version 1006285 (0.00088) [2022-07-11 02:59:47,739][26022] Updated weights on worker 0-0, policy_version 1006295 (0.00106) [2022-07-11 02:59:49,412][26022] Updated weights on worker 0-0, policy_version 1006305 (0.00085) [2022-07-11 02:59:49,500][25689] Fps is (10 sec: 5653.8, 60 sec: 5575.2, 300 sec: 5556.6). Total num frames: 1030456320. Throughput: 0: 4994.2. Samples: 1030451052. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:59:49,500][25689] Avg episode reward: [(0, '0.817')] [2022-07-11 02:59:51,435][26022] Updated weights on worker 0-0, policy_version 1006315 (0.00090) [2022-07-11 02:59:53,183][26022] Updated weights on worker 0-0, policy_version 1006325 (0.00084) [2022-07-11 02:59:54,540][25689] Fps is (10 sec: 5469.2, 60 sec: 5540.9, 300 sec: 5553.9). Total num frames: 1030482944. Throughput: 0: 5817.5. Samples: 1030484746. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:59:54,541][25689] Avg episode reward: [(0, '0.430')] [2022-07-11 02:59:54,979][26022] Updated weights on worker 0-0, policy_version 1006335 (0.00084) [2022-07-11 02:59:56,905][26022] Updated weights on worker 0-0, policy_version 1006345 (0.00057) [2022-07-11 02:59:58,667][26022] Updated weights on worker 0-0, policy_version 1006355 (0.00093) [2022-07-11 02:59:59,549][25689] Fps is (10 sec: 5706.6, 60 sec: 5594.1, 300 sec: 5559.5). Total num frames: 1030513664. Throughput: 0: 5847.1. Samples: 1030518626. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 02:59:59,549][25689] Avg episode reward: [(0, '0.895')] [2022-07-11 03:00:00,801][26022] Updated weights on worker 0-0, policy_version 1006365 (0.01497) [2022-07-11 03:00:02,571][26022] Updated weights on worker 0-0, policy_version 1006375 (0.00092) [2022-07-11 03:00:04,608][25689] Fps is (10 sec: 5390.9, 60 sec: 5565.5, 300 sec: 5552.9). Total num frames: 1030537216. Throughput: 0: 4969.9. Samples: 1030534472. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 03:00:04,608][25689] Avg episode reward: [(0, '1.294')] [2022-07-11 03:00:04,783][26022] Updated weights on worker 0-0, policy_version 1006385 (0.00087) [2022-07-11 03:00:06,511][26022] Updated weights on worker 0-0, policy_version 1006395 (0.00556) [2022-07-11 03:00:08,336][26022] Updated weights on worker 0-0, policy_version 1006405 (0.00087) [2022-07-11 03:00:09,616][25689] Fps is (10 sec: 5085.8, 60 sec: 5537.0, 300 sec: 5553.7). Total num frames: 1030564864. Throughput: 0: 5750.6. Samples: 1030566732. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 03:00:09,616][25689] Avg episode reward: [(0, '1.175')] [2022-07-11 03:00:10,048][26022] Updated weights on worker 0-0, policy_version 1006415 (0.00085) [2022-07-11 03:00:11,824][26022] Updated weights on worker 0-0, policy_version 1006425 (0.00092) [2022-07-11 03:00:13,771][26022] Updated weights on worker 0-0, policy_version 1006435 (0.00091) [2022-07-11 03:00:14,704][25689] Fps is (10 sec: 5578.4, 60 sec: 5549.2, 300 sec: 5549.2). Total num frames: 1030593536. Throughput: 0: 5708.5. Samples: 1030599850. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 03:00:14,704][25689] Avg episode reward: [(0, '1.197')] [2022-07-11 03:00:15,791][26022] Updated weights on worker 0-0, policy_version 1006445 (0.00095) [2022-07-11 03:00:17,447][26022] Updated weights on worker 0-0, policy_version 1006455 (0.00087) [2022-07-11 03:00:19,453][26022] Updated weights on worker 0-0, policy_version 1006465 (0.00091) [2022-07-11 03:00:19,760][25689] Fps is (10 sec: 5552.0, 60 sec: 5529.1, 300 sec: 5550.7). Total num frames: 1030621184. Throughput: 0: 4856.1. Samples: 1030616774. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 03:00:19,761][25689] Avg episode reward: [(0, '1.152')] [2022-07-11 03:00:20,984][26022] Updated weights on worker 0-0, policy_version 1006475 (0.00091) [2022-07-11 03:00:23,326][26022] Updated weights on worker 0-0, policy_version 1006485 (0.00096) [2022-07-11 03:00:24,642][26022] Updated weights on worker 0-0, policy_version 1006495 (0.00086) [2022-07-11 03:00:24,889][25689] Fps is (10 sec: 5730.5, 60 sec: 5576.7, 300 sec: 5556.0). Total num frames: 1030651904. Throughput: 0: 5711.3. Samples: 1030650306. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 03:00:24,891][25689] Avg episode reward: [(0, '1.211')] [2022-07-11 03:00:26,791][26022] Updated weights on worker 0-0, policy_version 1006505 (0.00085) [2022-07-11 03:00:28,235][26022] Updated weights on worker 0-0, policy_version 1006515 (0.00088) [2022-07-11 03:00:29,938][25689] Fps is (10 sec: 5734.5, 60 sec: 5542.0, 300 sec: 5556.1). Total num frames: 1030679552. Throughput: 0: 5778.4. Samples: 1030684164. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:00:29,940][25689] Avg episode reward: [(0, '0.984')] [2022-07-11 03:00:30,500][26022] Updated weights on worker 0-0, policy_version 1006525 (0.00093) [2022-07-11 03:00:31,904][26022] Updated weights on worker 0-0, policy_version 1006535 (0.00092) [2022-07-11 03:00:34,145][26022] Updated weights on worker 0-0, policy_version 1006545 (0.00086) [2022-07-11 03:00:34,975][25689] Fps is (10 sec: 5482.2, 60 sec: 5539.2, 300 sec: 5557.2). Total num frames: 1030707200. Throughput: 0: 4998.4. Samples: 1030701176. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:00:34,977][25689] Avg episode reward: [(0, '1.043')] [2022-07-11 03:00:35,563][26022] Updated weights on worker 0-0, policy_version 1006555 (0.00090) [2022-07-11 03:00:37,938][26022] Updated weights on worker 0-0, policy_version 1006565 (0.00083) [2022-07-11 03:00:39,119][26022] Updated weights on worker 0-0, policy_version 1006575 (0.00090) [2022-07-11 03:00:40,019][25689] Fps is (10 sec: 5587.1, 60 sec: 5553.7, 300 sec: 5557.5). Total num frames: 1030735872. Throughput: 0: 5835.1. Samples: 1030734986. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:00:40,019][25689] Avg episode reward: [(0, '1.127')] [2022-07-11 03:00:41,463][26022] Updated weights on worker 0-0, policy_version 1006585 (0.00093) [2022-07-11 03:00:42,880][26022] Updated weights on worker 0-0, policy_version 1006595 (0.00088) [2022-07-11 03:00:45,043][26022] Updated weights on worker 0-0, policy_version 1006605 (0.00089) [2022-07-11 03:00:45,134][25689] Fps is (10 sec: 5544.2, 60 sec: 5532.9, 300 sec: 5555.4). Total num frames: 1030763520. Throughput: 0: 5841.4. Samples: 1030768564. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:00:45,134][25689] Avg episode reward: [(0, '0.094')] [2022-07-11 03:00:46,614][26022] Updated weights on worker 0-0, policy_version 1006615 (0.00082) [2022-07-11 03:00:48,838][26022] Updated weights on worker 0-0, policy_version 1006625 (0.00086) [2022-07-11 03:00:50,165][25689] Fps is (10 sec: 5752.7, 60 sec: 5570.3, 300 sec: 5562.5). Total num frames: 1030794240. Throughput: 0: 5843.1. Samples: 1030802348. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:00:50,165][25689] Avg episode reward: [(0, '-0.242')] [2022-07-11 03:00:50,166][26022] Updated weights on worker 0-0, policy_version 1006635 (0.00099) [2022-07-11 03:00:52,323][26022] Updated weights on worker 0-0, policy_version 1006645 (0.00086) [2022-07-11 03:00:53,711][26022] Updated weights on worker 0-0, policy_version 1006655 (0.00097) [2022-07-11 03:00:55,208][25689] Fps is (10 sec: 5692.0, 60 sec: 5570.0, 300 sec: 5558.4). Total num frames: 1030820864. Throughput: 0: 5830.3. Samples: 1030819140. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:00:55,209][25689] Avg episode reward: [(0, '-0.028')] [2022-07-11 03:00:55,859][26022] Updated weights on worker 0-0, policy_version 1006665 (0.00085) [2022-07-11 03:00:57,596][26022] Updated weights on worker 0-0, policy_version 1006675 (0.00081) [2022-07-11 03:00:59,478][26022] Updated weights on worker 0-0, policy_version 1006685 (0.00090) [2022-07-11 03:01:00,276][25689] Fps is (10 sec: 5367.6, 60 sec: 5514.0, 300 sec: 5562.2). Total num frames: 1030848512. Throughput: 0: 5825.3. Samples: 1030852990. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:01:00,278][25689] Avg episode reward: [(0, '-0.569')] [2022-07-11 03:01:01,349][26022] Updated weights on worker 0-0, policy_version 1006695 (0.00086) [2022-07-11 03:01:03,366][26022] Updated weights on worker 0-0, policy_version 1006705 (0.00088) [2022-07-11 03:01:05,362][25689] Fps is (10 sec: 5446.0, 60 sec: 5578.9, 300 sec: 5557.5). Total num frames: 1030876160. Throughput: 0: 5720.4. Samples: 1030884276. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:01:05,362][25689] Avg episode reward: [(0, '-0.444')] [2022-07-11 03:01:05,368][26022] Updated weights on worker 0-0, policy_version 1006715 (0.00087) [2022-07-11 03:01:07,290][26022] Updated weights on worker 0-0, policy_version 1006725 (0.00095) [2022-07-11 03:01:09,042][26022] Updated weights on worker 0-0, policy_version 1006735 (0.00089) [2022-07-11 03:01:10,436][25689] Fps is (10 sec: 5442.2, 60 sec: 5572.8, 300 sec: 5556.4). Total num frames: 1030903808. Throughput: 0: 4859.4. Samples: 1030900856. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:01:10,437][25689] Avg episode reward: [(0, '-0.262')] [2022-07-11 03:01:11,075][26022] Updated weights on worker 0-0, policy_version 1006745 (0.00086) [2022-07-11 03:01:12,744][26022] Updated weights on worker 0-0, policy_version 1006755 (0.00088) [2022-07-11 03:01:14,537][26022] Updated weights on worker 0-0, policy_version 1006765 (0.00087) [2022-07-11 03:01:15,513][25689] Fps is (10 sec: 5346.4, 60 sec: 5540.2, 300 sec: 5545.2). Total num frames: 1030930432. Throughput: 0: 5681.9. Samples: 1030934510. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:01:15,514][25689] Avg episode reward: [(0, '0.064')] [2022-07-11 03:01:16,383][26022] Updated weights on worker 0-0, policy_version 1006775 (0.00087) [2022-07-11 03:01:18,192][26022] Updated weights on worker 0-0, policy_version 1006785 (0.00088) [2022-07-11 03:01:20,035][26022] Updated weights on worker 0-0, policy_version 1006795 (0.00082) [2022-07-11 03:01:20,533][25689] Fps is (10 sec: 5780.9, 60 sec: 5610.9, 300 sec: 5567.5). Total num frames: 1030962176. Throughput: 0: 5693.6. Samples: 1030968328. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:01:20,534][25689] Avg episode reward: [(0, '0.482')] [2022-07-11 03:01:22,015][26022] Updated weights on worker 0-0, policy_version 1006805 (0.00084) [2022-07-11 03:01:23,505][26022] Updated weights on worker 0-0, policy_version 1006815 (0.00092) [2022-07-11 03:01:25,472][26022] Updated weights on worker 0-0, policy_version 1006825 (0.00087) [2022-07-11 03:01:25,624][25689] Fps is (10 sec: 5772.8, 60 sec: 5547.0, 300 sec: 5553.5). Total num frames: 1030988800. Throughput: 0: 4976.6. Samples: 1030985112. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:01:25,624][25689] Avg episode reward: [(0, '1.142')] [2022-07-11 03:01:27,339][26022] Updated weights on worker 0-0, policy_version 1006835 (0.00089) [2022-07-11 03:01:28,655][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:01:28,669][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001006842_1031006208.pth [2022-07-11 03:01:28,669][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001004888_1029005312.pth [2022-07-11 03:01:29,351][26022] Updated weights on worker 0-0, policy_version 1006845 (0.00093) [2022-07-11 03:01:30,637][25689] Fps is (10 sec: 5472.7, 60 sec: 5567.2, 300 sec: 5558.4). Total num frames: 1031017472. Throughput: 0: 5829.5. Samples: 1031018616. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:01:30,638][25689] Avg episode reward: [(0, '1.022')] [2022-07-11 03:01:31,075][26022] Updated weights on worker 0-0, policy_version 1006855 (0.00083) [2022-07-11 03:01:33,035][26022] Updated weights on worker 0-0, policy_version 1006865 (0.00096) [2022-07-11 03:01:34,524][26022] Updated weights on worker 0-0, policy_version 1006875 (0.00091) [2022-07-11 03:01:35,640][25689] Fps is (10 sec: 5622.6, 60 sec: 5570.3, 300 sec: 5555.9). Total num frames: 1031045120. Throughput: 0: 5851.0. Samples: 1031052276. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:01:35,641][25689] Avg episode reward: [(0, '0.939')] [2022-07-11 03:01:36,800][26022] Updated weights on worker 0-0, policy_version 1006885 (0.00087) [2022-07-11 03:01:38,124][26022] Updated weights on worker 0-0, policy_version 1006895 (0.00085) [2022-07-11 03:01:40,307][26022] Updated weights on worker 0-0, policy_version 1006905 (0.00085) [2022-07-11 03:01:40,655][25689] Fps is (10 sec: 5520.0, 60 sec: 5556.0, 300 sec: 5556.6). Total num frames: 1031072768. Throughput: 0: 5015.7. Samples: 1031069250. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:01:40,655][25689] Avg episode reward: [(0, '0.970')] [2022-07-11 03:01:41,827][26022] Updated weights on worker 0-0, policy_version 1006915 (0.00090) [2022-07-11 03:01:43,867][26022] Updated weights on worker 0-0, policy_version 1006925 (0.00093) [2022-07-11 03:01:45,684][26022] Updated weights on worker 0-0, policy_version 1006935 (0.00089) [2022-07-11 03:01:45,742][25689] Fps is (10 sec: 5575.2, 60 sec: 5575.5, 300 sec: 5555.8). Total num frames: 1031101440. Throughput: 0: 5855.3. Samples: 1031102912. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:01:45,743][25689] Avg episode reward: [(0, '1.155')] [2022-07-11 03:01:47,492][26022] Updated weights on worker 0-0, policy_version 1006945 (0.00086) [2022-07-11 03:01:49,355][26022] Updated weights on worker 0-0, policy_version 1006955 (0.00082) [2022-07-11 03:01:50,757][25689] Fps is (10 sec: 5574.9, 60 sec: 5526.2, 300 sec: 5555.6). Total num frames: 1031129088. Throughput: 0: 5859.3. Samples: 1031136504. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:01:50,758][25689] Avg episode reward: [(0, '1.270')] [2022-07-11 03:01:51,240][26022] Updated weights on worker 0-0, policy_version 1006965 (0.00080) [2022-07-11 03:01:52,899][26022] Updated weights on worker 0-0, policy_version 1006975 (0.00086) [2022-07-11 03:01:54,917][26022] Updated weights on worker 0-0, policy_version 1006985 (0.00401) [2022-07-11 03:01:55,762][25689] Fps is (10 sec: 5518.8, 60 sec: 5546.7, 300 sec: 5552.9). Total num frames: 1031156736. Throughput: 0: 5013.4. Samples: 1031153152. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:01:55,764][25689] Avg episode reward: [(0, '0.991')] [2022-07-11 03:01:56,546][26022] Updated weights on worker 0-0, policy_version 1006995 (0.00092) [2022-07-11 03:01:58,581][26022] Updated weights on worker 0-0, policy_version 1007005 (0.00092) [2022-07-11 03:02:00,303][26022] Updated weights on worker 0-0, policy_version 1007015 (0.00080) [2022-07-11 03:02:00,778][25689] Fps is (10 sec: 5722.3, 60 sec: 5585.2, 300 sec: 5567.7). Total num frames: 1031186432. Throughput: 0: 5846.3. Samples: 1031186900. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:02:00,779][25689] Avg episode reward: [(0, '1.741')] [2022-07-11 03:02:02,603][26022] Updated weights on worker 0-0, policy_version 1007025 (0.00089) [2022-07-11 03:02:04,244][26022] Updated weights on worker 0-0, policy_version 1007035 (0.00095) [2022-07-11 03:02:05,837][25689] Fps is (10 sec: 5488.6, 60 sec: 5553.9, 300 sec: 5560.5). Total num frames: 1031212032. Throughput: 0: 5746.4. Samples: 1031218382. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:02:05,837][25689] Avg episode reward: [(0, '1.748')] [2022-07-11 03:02:06,264][26022] Updated weights on worker 0-0, policy_version 1007045 (0.00089) [2022-07-11 03:02:07,855][26022] Updated weights on worker 0-0, policy_version 1007055 (0.00093) [2022-07-11 03:02:10,159][26022] Updated weights on worker 0-0, policy_version 1007065 (0.00091) [2022-07-11 03:02:10,935][25689] Fps is (10 sec: 5141.9, 60 sec: 5534.8, 300 sec: 5553.0). Total num frames: 1031238656. Throughput: 0: 4884.5. Samples: 1031235062. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:02:10,936][25689] Avg episode reward: [(0, '1.314')] [2022-07-11 03:02:11,555][26022] Updated weights on worker 0-0, policy_version 1007075 (0.00086) [2022-07-11 03:02:13,807][26022] Updated weights on worker 0-0, policy_version 1007085 (0.00084) [2022-07-11 03:02:15,282][26022] Updated weights on worker 0-0, policy_version 1007095 (0.00086) [2022-07-11 03:02:16,021][25689] Fps is (10 sec: 5630.5, 60 sec: 5601.6, 300 sec: 5562.4). Total num frames: 1031269376. Throughput: 0: 5690.2. Samples: 1031268432. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:02:16,022][25689] Avg episode reward: [(0, '1.256')] [2022-07-11 03:02:17,445][26022] Updated weights on worker 0-0, policy_version 1007105 (0.00086) [2022-07-11 03:02:18,862][26022] Updated weights on worker 0-0, policy_version 1007115 (0.00087) [2022-07-11 03:02:20,983][26022] Updated weights on worker 0-0, policy_version 1007125 (0.00560) [2022-07-11 03:02:21,033][25689] Fps is (10 sec: 5678.7, 60 sec: 5517.8, 300 sec: 5550.7). Total num frames: 1031296000. Throughput: 0: 5695.3. Samples: 1031302256. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:02:21,033][25689] Avg episode reward: [(0, '1.306')] [2022-07-11 03:02:22,619][26022] Updated weights on worker 0-0, policy_version 1007135 (0.00094) [2022-07-11 03:02:24,667][26022] Updated weights on worker 0-0, policy_version 1007145 (0.00083) [2022-07-11 03:02:26,092][25689] Fps is (10 sec: 5592.3, 60 sec: 5571.4, 300 sec: 5560.3). Total num frames: 1031325696. Throughput: 0: 4962.5. Samples: 1031318900. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:02:26,095][25689] Avg episode reward: [(0, '1.812')] [2022-07-11 03:02:26,189][26022] Updated weights on worker 0-0, policy_version 1007155 (0.00090) [2022-07-11 03:02:28,421][26022] Updated weights on worker 0-0, policy_version 1007165 (0.00089) [2022-07-11 03:02:29,980][26022] Updated weights on worker 0-0, policy_version 1007175 (0.00088) [2022-07-11 03:02:31,131][25689] Fps is (10 sec: 5577.6, 60 sec: 5535.3, 300 sec: 5557.2). Total num frames: 1031352320. Throughput: 0: 5807.1. Samples: 1031352340. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:02:31,131][25689] Avg episode reward: [(0, '1.475')] [2022-07-11 03:02:32,133][26022] Updated weights on worker 0-0, policy_version 1007185 (0.00085) [2022-07-11 03:02:33,514][26022] Updated weights on worker 0-0, policy_version 1007195 (0.00086) [2022-07-11 03:02:35,759][26022] Updated weights on worker 0-0, policy_version 1007205 (0.00091) [2022-07-11 03:02:36,135][25689] Fps is (10 sec: 5506.0, 60 sec: 5552.1, 300 sec: 5557.9). Total num frames: 1031380992. Throughput: 0: 5832.9. Samples: 1031385754. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:02:36,136][25689] Avg episode reward: [(0, '1.588')] [2022-07-11 03:02:37,476][26022] Updated weights on worker 0-0, policy_version 1007215 (0.00096) [2022-07-11 03:02:39,332][26022] Updated weights on worker 0-0, policy_version 1007225 (0.00079) [2022-07-11 03:02:40,987][26022] Updated weights on worker 0-0, policy_version 1007235 (0.00086) [2022-07-11 03:02:41,185][25689] Fps is (10 sec: 5601.7, 60 sec: 5548.8, 300 sec: 5562.0). Total num frames: 1031408640. Throughput: 0: 4982.8. Samples: 1031402668. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:02:41,185][25689] Avg episode reward: [(0, '1.902')] [2022-07-11 03:02:43,031][26022] Updated weights on worker 0-0, policy_version 1007245 (0.00088) [2022-07-11 03:02:44,682][26022] Updated weights on worker 0-0, policy_version 1007255 (0.00072) [2022-07-11 03:02:46,275][25689] Fps is (10 sec: 5453.1, 60 sec: 5531.7, 300 sec: 5550.5). Total num frames: 1031436288. Throughput: 0: 5821.4. Samples: 1031436394. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:02:46,276][25689] Avg episode reward: [(0, '1.745')] [2022-07-11 03:02:46,588][26022] Updated weights on worker 0-0, policy_version 1007265 (0.00092) [2022-07-11 03:02:48,216][26022] Updated weights on worker 0-0, policy_version 1007275 (0.00085) [2022-07-11 03:02:50,350][26022] Updated weights on worker 0-0, policy_version 1007285 (0.00091) [2022-07-11 03:02:51,294][25689] Fps is (10 sec: 5773.8, 60 sec: 5582.1, 300 sec: 5560.7). Total num frames: 1031467008. Throughput: 0: 5843.4. Samples: 1031470162. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:02:51,294][25689] Avg episode reward: [(0, '1.743')] [2022-07-11 03:02:52,115][26022] Updated weights on worker 0-0, policy_version 1007295 (0.00075) [2022-07-11 03:02:53,683][26022] Updated weights on worker 0-0, policy_version 1007305 (0.00080) [2022-07-11 03:02:55,628][26022] Updated weights on worker 0-0, policy_version 1007315 (0.00092) [2022-07-11 03:02:56,332][25689] Fps is (10 sec: 5702.0, 60 sec: 5562.1, 300 sec: 5553.5). Total num frames: 1031493632. Throughput: 0: 5019.5. Samples: 1031487134. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:02:56,333][25689] Avg episode reward: [(0, '1.294')] [2022-07-11 03:02:57,607][26022] Updated weights on worker 0-0, policy_version 1007325 (0.00084) [2022-07-11 03:02:59,322][26022] Updated weights on worker 0-0, policy_version 1007335 (0.00055) [2022-07-11 03:03:01,218][26022] Updated weights on worker 0-0, policy_version 1007345 (0.00064) [2022-07-11 03:03:01,356][25689] Fps is (10 sec: 5393.7, 60 sec: 5527.6, 300 sec: 5558.2). Total num frames: 1031521280. Throughput: 0: 5863.4. Samples: 1031520940. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:03:01,357][25689] Avg episode reward: [(0, '1.441')] [2022-07-11 03:03:03,323][26022] Updated weights on worker 0-0, policy_version 1007355 (0.00889) [2022-07-11 03:03:05,106][26022] Updated weights on worker 0-0, policy_version 1007365 (0.00050) [2022-07-11 03:03:06,441][25689] Fps is (10 sec: 5469.9, 60 sec: 5558.9, 300 sec: 5556.8). Total num frames: 1031548928. Throughput: 0: 5761.5. Samples: 1031552580. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:03:06,442][25689] Avg episode reward: [(0, '0.936')] [2022-07-11 03:03:06,983][26022] Updated weights on worker 0-0, policy_version 1007375 (0.00088) [2022-07-11 03:03:08,937][26022] Updated weights on worker 0-0, policy_version 1007385 (0.00089) [2022-07-11 03:03:10,712][26022] Updated weights on worker 0-0, policy_version 1007395 (0.00091) [2022-07-11 03:03:11,460][25689] Fps is (10 sec: 5472.9, 60 sec: 5583.2, 300 sec: 5557.6). Total num frames: 1031576576. Throughput: 0: 5740.5. Samples: 1031585924. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:03:11,460][25689] Avg episode reward: [(0, '0.854')] [2022-07-11 03:03:12,548][26022] Updated weights on worker 0-0, policy_version 1007405 (0.00090) [2022-07-11 03:03:14,360][26022] Updated weights on worker 0-0, policy_version 1007415 (0.00085) [2022-07-11 03:03:16,157][26022] Updated weights on worker 0-0, policy_version 1007425 (0.00085) [2022-07-11 03:03:16,479][25689] Fps is (10 sec: 5509.0, 60 sec: 5538.6, 300 sec: 5554.4). Total num frames: 1031604224. Throughput: 0: 5746.2. Samples: 1031602900. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:03:16,479][25689] Avg episode reward: [(0, '1.028')] [2022-07-11 03:03:18,060][26022] Updated weights on worker 0-0, policy_version 1007435 (0.00084) [2022-07-11 03:03:19,856][26022] Updated weights on worker 0-0, policy_version 1007445 (0.00094) [2022-07-11 03:03:21,441][26022] Updated weights on worker 0-0, policy_version 1007455 (0.00087) [2022-07-11 03:03:21,480][25689] Fps is (10 sec: 5722.9, 60 sec: 5590.4, 300 sec: 5559.7). Total num frames: 1031633920. Throughput: 0: 5750.2. Samples: 1031636654. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:03:21,480][25689] Avg episode reward: [(0, '1.119')] [2022-07-11 03:03:23,496][26022] Updated weights on worker 0-0, policy_version 1007465 (0.00099) [2022-07-11 03:03:25,159][26022] Updated weights on worker 0-0, policy_version 1007475 (0.00086) [2022-07-11 03:03:26,586][25689] Fps is (10 sec: 5673.5, 60 sec: 5552.2, 300 sec: 5554.8). Total num frames: 1031661568. Throughput: 0: 5842.4. Samples: 1031670272. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:03:26,588][25689] Avg episode reward: [(0, '1.415')] [2022-07-11 03:03:27,062][26022] Updated weights on worker 0-0, policy_version 1007485 (0.00088) [2022-07-11 03:03:28,820][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:03:28,845][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001007494_1031673856.pth [2022-07-11 03:03:28,846][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001005538_1029670912.pth [2022-07-11 03:03:28,916][26022] Updated weights on worker 0-0, policy_version 1007495 (0.00090) [2022-07-11 03:03:30,650][26022] Updated weights on worker 0-0, policy_version 1007505 (0.00094) [2022-07-11 03:03:31,631][25689] Fps is (10 sec: 5447.4, 60 sec: 5568.5, 300 sec: 5558.3). Total num frames: 1031689216. Throughput: 0: 5019.4. Samples: 1031687170. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:03:31,631][25689] Avg episode reward: [(0, '1.550')] [2022-07-11 03:03:32,639][26022] Updated weights on worker 0-0, policy_version 1007515 (0.00089) [2022-07-11 03:03:34,265][26022] Updated weights on worker 0-0, policy_version 1007525 (0.00085) [2022-07-11 03:03:36,186][26022] Updated weights on worker 0-0, policy_version 1007535 (0.00090) [2022-07-11 03:03:36,643][25689] Fps is (10 sec: 5599.9, 60 sec: 5567.8, 300 sec: 5555.0). Total num frames: 1031717888. Throughput: 0: 5864.1. Samples: 1031721148. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:03:36,645][25689] Avg episode reward: [(0, '1.818')] [2022-07-11 03:03:37,801][26022] Updated weights on worker 0-0, policy_version 1007545 (0.01075) [2022-07-11 03:03:39,841][26022] Updated weights on worker 0-0, policy_version 1007555 (0.00083) [2022-07-11 03:03:41,511][26022] Updated weights on worker 0-0, policy_version 1007565 (0.00095) [2022-07-11 03:03:41,659][25689] Fps is (10 sec: 5717.9, 60 sec: 5587.8, 300 sec: 5560.7). Total num frames: 1031746560. Throughput: 0: 5843.4. Samples: 1031754572. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:03:41,660][25689] Avg episode reward: [(0, '1.970')] [2022-07-11 03:03:43,684][26022] Updated weights on worker 0-0, policy_version 1007575 (0.00085) [2022-07-11 03:03:45,291][26022] Updated weights on worker 0-0, policy_version 1007585 (0.00081) [2022-07-11 03:03:46,751][25689] Fps is (10 sec: 5470.8, 60 sec: 5570.8, 300 sec: 5555.5). Total num frames: 1031773184. Throughput: 0: 5003.8. Samples: 1031771174. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:03:46,751][25689] Avg episode reward: [(0, '2.214')] [2022-07-11 03:03:47,388][26022] Updated weights on worker 0-0, policy_version 1007595 (0.00097) [2022-07-11 03:03:48,907][26022] Updated weights on worker 0-0, policy_version 1007605 (0.00092) [2022-07-11 03:03:50,892][26022] Updated weights on worker 0-0, policy_version 1007615 (0.00087) [2022-07-11 03:03:51,843][25689] Fps is (10 sec: 5429.8, 60 sec: 5530.2, 300 sec: 5554.4). Total num frames: 1031801856. Throughput: 0: 5827.4. Samples: 1031804958. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:03:51,844][25689] Avg episode reward: [(0, '2.263')] [2022-07-11 03:03:52,714][26022] Updated weights on worker 0-0, policy_version 1007625 (0.00089) [2022-07-11 03:03:54,433][26022] Updated weights on worker 0-0, policy_version 1007635 (0.00084) [2022-07-11 03:03:56,273][26022] Updated weights on worker 0-0, policy_version 1007645 (0.00086) [2022-07-11 03:03:56,874][25689] Fps is (10 sec: 5664.6, 60 sec: 5564.7, 300 sec: 5558.0). Total num frames: 1031830528. Throughput: 0: 5819.3. Samples: 1031838876. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:03:56,875][25689] Avg episode reward: [(0, '1.408')] [2022-07-11 03:03:58,029][26022] Updated weights on worker 0-0, policy_version 1007655 (0.00090) [2022-07-11 03:04:00,011][26022] Updated weights on worker 0-0, policy_version 1007665 (0.00080) [2022-07-11 03:04:01,758][26022] Updated weights on worker 0-0, policy_version 1007675 (0.00090) [2022-07-11 03:04:01,983][25689] Fps is (10 sec: 5655.6, 60 sec: 5573.8, 300 sec: 5568.4). Total num frames: 1031859200. Throughput: 0: 4990.7. Samples: 1031855998. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:04:01,983][25689] Avg episode reward: [(0, '1.274')] [2022-07-11 03:04:03,846][26022] Updated weights on worker 0-0, policy_version 1007685 (0.00079) [2022-07-11 03:04:05,560][26022] Updated weights on worker 0-0, policy_version 1007695 (0.00085) [2022-07-11 03:04:07,152][25689] Fps is (10 sec: 5479.9, 60 sec: 5566.1, 300 sec: 5559.6). Total num frames: 1031886848. Throughput: 0: 5716.4. Samples: 1031887794. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:04:07,152][25689] Avg episode reward: [(0, '1.251')] [2022-07-11 03:04:07,586][26022] Updated weights on worker 0-0, policy_version 1007705 (0.00087) [2022-07-11 03:04:09,268][26022] Updated weights on worker 0-0, policy_version 1007715 (0.00089) [2022-07-11 03:04:11,313][26022] Updated weights on worker 0-0, policy_version 1007725 (0.00058) [2022-07-11 03:04:12,209][25689] Fps is (10 sec: 5507.1, 60 sec: 5579.3, 300 sec: 5562.7). Total num frames: 1031915520. Throughput: 0: 5720.3. Samples: 1031921458. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:04:12,210][25689] Avg episode reward: [(0, '0.732')] [2022-07-11 03:04:12,944][26022] Updated weights on worker 0-0, policy_version 1007735 (0.00085) [2022-07-11 03:04:14,882][26022] Updated weights on worker 0-0, policy_version 1007745 (0.00084) [2022-07-11 03:04:16,662][26022] Updated weights on worker 0-0, policy_version 1007755 (0.00088) [2022-07-11 03:04:17,306][25689] Fps is (10 sec: 5647.0, 60 sec: 5589.0, 300 sec: 5561.3). Total num frames: 1031944192. Throughput: 0: 4877.5. Samples: 1031938534. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:04:17,307][25689] Avg episode reward: [(0, '0.312')] [2022-07-11 03:04:18,436][26022] Updated weights on worker 0-0, policy_version 1007765 (0.00088) [2022-07-11 03:04:20,216][26022] Updated weights on worker 0-0, policy_version 1007775 (0.00086) [2022-07-11 03:04:22,047][26022] Updated weights on worker 0-0, policy_version 1007785 (0.00087) [2022-07-11 03:04:22,363][25689] Fps is (10 sec: 5647.8, 60 sec: 5567.1, 300 sec: 5565.4). Total num frames: 1031972864. Throughput: 0: 5729.7. Samples: 1031972770. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2022-07-11 03:04:22,363][25689] Avg episode reward: [(0, '0.295')] [2022-07-11 03:04:24,023][26022] Updated weights on worker 0-0, policy_version 1007795 (0.00086) [2022-07-11 03:04:25,694][26022] Updated weights on worker 0-0, policy_version 1007805 (0.00089) [2022-07-11 03:04:27,488][25689] Fps is (10 sec: 5632.2, 60 sec: 5582.2, 300 sec: 5560.4). Total num frames: 1032001536. Throughput: 0: 5837.4. Samples: 1032006504. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:04:27,488][25689] Avg episode reward: [(0, '0.738')] [2022-07-11 03:04:27,794][26022] Updated weights on worker 0-0, policy_version 1007815 (0.00070) [2022-07-11 03:04:29,254][26022] Updated weights on worker 0-0, policy_version 1007825 (0.00089) [2022-07-11 03:04:31,295][26022] Updated weights on worker 0-0, policy_version 1007835 (0.00079) [2022-07-11 03:04:32,498][25689] Fps is (10 sec: 5759.0, 60 sec: 5619.1, 300 sec: 5567.2). Total num frames: 1032031232. Throughput: 0: 5022.6. Samples: 1032023358. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:04:32,498][25689] Avg episode reward: [(0, '0.556')] [2022-07-11 03:04:32,811][26022] Updated weights on worker 0-0, policy_version 1007845 (0.00085) [2022-07-11 03:04:34,967][26022] Updated weights on worker 0-0, policy_version 1007855 (0.00085) [2022-07-11 03:04:36,537][26022] Updated weights on worker 0-0, policy_version 1007865 (0.00057) [2022-07-11 03:04:37,541][25689] Fps is (10 sec: 5500.2, 60 sec: 5565.8, 300 sec: 5559.8). Total num frames: 1032056832. Throughput: 0: 5872.2. Samples: 1032057358. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:04:37,542][25689] Avg episode reward: [(0, '1.127')] [2022-07-11 03:04:38,659][26022] Updated weights on worker 0-0, policy_version 1007875 (0.00084) [2022-07-11 03:04:40,383][26022] Updated weights on worker 0-0, policy_version 1007885 (0.00088) [2022-07-11 03:04:42,086][26022] Updated weights on worker 0-0, policy_version 1007895 (0.00085) [2022-07-11 03:04:42,573][25689] Fps is (10 sec: 5590.2, 60 sec: 5598.0, 300 sec: 5567.5). Total num frames: 1032087552. Throughput: 0: 5855.3. Samples: 1032091104. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:04:42,573][25689] Avg episode reward: [(0, '1.260')] [2022-07-11 03:04:44,190][26022] Updated weights on worker 0-0, policy_version 1007905 (0.00087) [2022-07-11 03:04:45,606][26022] Updated weights on worker 0-0, policy_version 1007915 (0.00092) [2022-07-11 03:04:47,620][26022] Updated weights on worker 0-0, policy_version 1007925 (0.00109) [2022-07-11 03:04:47,653][25689] Fps is (10 sec: 5772.6, 60 sec: 5615.9, 300 sec: 5563.9). Total num frames: 1032115200. Throughput: 0: 5864.5. Samples: 1032124760. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:04:47,653][25689] Avg episode reward: [(0, '1.348')] [2022-07-11 03:04:49,393][26022] Updated weights on worker 0-0, policy_version 1007935 (0.00083) [2022-07-11 03:04:51,152][26022] Updated weights on worker 0-0, policy_version 1007945 (0.00085) [2022-07-11 03:04:52,687][25689] Fps is (10 sec: 5467.1, 60 sec: 5604.4, 300 sec: 5567.4). Total num frames: 1032142848. Throughput: 0: 5861.3. Samples: 1032141692. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:04:52,687][25689] Avg episode reward: [(0, '1.631')] [2022-07-11 03:04:53,274][26022] Updated weights on worker 0-0, policy_version 1007955 (0.00054) [2022-07-11 03:04:54,801][26022] Updated weights on worker 0-0, policy_version 1007965 (0.00092) [2022-07-11 03:04:56,781][26022] Updated weights on worker 0-0, policy_version 1007975 (0.00087) [2022-07-11 03:04:57,699][25689] Fps is (10 sec: 5707.9, 60 sec: 5623.0, 300 sec: 5563.9). Total num frames: 1032172544. Throughput: 0: 5845.6. Samples: 1032175192. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:04:57,700][25689] Avg episode reward: [(0, '1.363')] [2022-07-11 03:04:58,712][26022] Updated weights on worker 0-0, policy_version 1007985 (0.00087) [2022-07-11 03:05:00,279][26022] Updated weights on worker 0-0, policy_version 1007995 (0.00087) [2022-07-11 03:05:02,776][25689] Fps is (10 sec: 5379.3, 60 sec: 5558.5, 300 sec: 5567.0). Total num frames: 1032197120. Throughput: 0: 5729.4. Samples: 1032206858. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:05:02,776][25689] Avg episode reward: [(0, '1.596')] [2022-07-11 03:05:02,778][26022] Updated weights on worker 0-0, policy_version 1008005 (0.00087) [2022-07-11 03:05:04,400][26022] Updated weights on worker 0-0, policy_version 1008015 (0.00099) [2022-07-11 03:05:06,286][26022] Updated weights on worker 0-0, policy_version 1008025 (0.00083) [2022-07-11 03:05:07,859][25689] Fps is (10 sec: 5341.7, 60 sec: 5600.1, 300 sec: 5572.5). Total num frames: 1032226816. Throughput: 0: 4890.7. Samples: 1032223584. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:05:07,860][25689] Avg episode reward: [(0, '1.659')] [2022-07-11 03:05:08,000][26022] Updated weights on worker 0-0, policy_version 1008035 (0.00086) [2022-07-11 03:05:09,781][26022] Updated weights on worker 0-0, policy_version 1008045 (0.00086) [2022-07-11 03:05:11,725][26022] Updated weights on worker 0-0, policy_version 1008055 (0.00084) [2022-07-11 03:05:12,883][25689] Fps is (10 sec: 5572.5, 60 sec: 5569.5, 300 sec: 5566.8). Total num frames: 1032253440. Throughput: 0: 5733.8. Samples: 1032257490. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:05:12,883][25689] Avg episode reward: [(0, '1.764')] [2022-07-11 03:05:13,536][26022] Updated weights on worker 0-0, policy_version 1008065 (0.00091) [2022-07-11 03:05:15,494][26022] Updated weights on worker 0-0, policy_version 1008075 (0.00084) [2022-07-11 03:05:17,101][26022] Updated weights on worker 0-0, policy_version 1008085 (0.00088) [2022-07-11 03:05:17,893][25689] Fps is (10 sec: 5612.9, 60 sec: 5594.4, 300 sec: 5574.5). Total num frames: 1032283136. Throughput: 0: 5744.3. Samples: 1032291194. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:05:17,894][25689] Avg episode reward: [(0, '1.957')] [2022-07-11 03:05:19,116][26022] Updated weights on worker 0-0, policy_version 1008095 (0.00090) [2022-07-11 03:05:20,787][26022] Updated weights on worker 0-0, policy_version 1008105 (0.00087) [2022-07-11 03:05:22,731][26022] Updated weights on worker 0-0, policy_version 1008115 (0.00047) [2022-07-11 03:05:22,901][25689] Fps is (10 sec: 5621.9, 60 sec: 5565.0, 300 sec: 5563.1). Total num frames: 1032309760. Throughput: 0: 5017.3. Samples: 1032307830. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:05:22,901][25689] Avg episode reward: [(0, '0.997')] [2022-07-11 03:05:24,457][26022] Updated weights on worker 0-0, policy_version 1008125 (0.00082) [2022-07-11 03:05:26,402][26022] Updated weights on worker 0-0, policy_version 1008135 (0.00087) [2022-07-11 03:05:27,937][25689] Fps is (10 sec: 5505.3, 60 sec: 5573.2, 300 sec: 5566.7). Total num frames: 1032338432. Throughput: 0: 5870.1. Samples: 1032341446. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:05:27,939][25689] Avg episode reward: [(0, '1.122')] [2022-07-11 03:05:28,290][26022] Updated weights on worker 0-0, policy_version 1008145 (0.00085) [2022-07-11 03:05:28,916][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:05:28,930][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001008148_1032343552.pth [2022-07-11 03:05:28,930][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001006189_1030337536.pth [2022-07-11 03:05:29,841][26022] Updated weights on worker 0-0, policy_version 1008155 (0.00086) [2022-07-11 03:05:32,035][26022] Updated weights on worker 0-0, policy_version 1008165 (0.00087) [2022-07-11 03:05:32,946][25689] Fps is (10 sec: 5708.6, 60 sec: 5556.4, 300 sec: 5570.7). Total num frames: 1032367104. Throughput: 0: 5864.9. Samples: 1032375158. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:05:32,946][25689] Avg episode reward: [(0, '0.851')] [2022-07-11 03:05:33,616][26022] Updated weights on worker 0-0, policy_version 1008175 (0.00050) [2022-07-11 03:05:35,635][26022] Updated weights on worker 0-0, policy_version 1008185 (0.00086) [2022-07-11 03:05:37,276][26022] Updated weights on worker 0-0, policy_version 1008195 (0.00089) [2022-07-11 03:05:37,974][25689] Fps is (10 sec: 5509.4, 60 sec: 5574.7, 300 sec: 5564.1). Total num frames: 1032393728. Throughput: 0: 5016.0. Samples: 1032391920. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:05:37,975][25689] Avg episode reward: [(0, '0.704')] [2022-07-11 03:05:39,020][26022] Updated weights on worker 0-0, policy_version 1008205 (0.00095) [2022-07-11 03:05:41,110][26022] Updated weights on worker 0-0, policy_version 1008215 (0.00098) [2022-07-11 03:05:42,816][26022] Updated weights on worker 0-0, policy_version 1008225 (0.00088) [2022-07-11 03:05:42,988][25689] Fps is (10 sec: 5710.3, 60 sec: 5576.3, 300 sec: 5576.3). Total num frames: 1032424448. Throughput: 0: 5864.3. Samples: 1032425628. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:05:42,989][25689] Avg episode reward: [(0, '0.728')] [2022-07-11 03:05:44,534][26022] Updated weights on worker 0-0, policy_version 1008235 (0.00083) [2022-07-11 03:05:46,415][26022] Updated weights on worker 0-0, policy_version 1008245 (0.00086) [2022-07-11 03:05:48,034][25689] Fps is (10 sec: 5700.2, 60 sec: 5562.5, 300 sec: 5562.3). Total num frames: 1032451072. Throughput: 0: 5865.6. Samples: 1032459326. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:05:48,036][25689] Avg episode reward: [(0, '0.489')] [2022-07-11 03:05:48,267][26022] Updated weights on worker 0-0, policy_version 1008255 (0.00096) [2022-07-11 03:05:50,054][26022] Updated weights on worker 0-0, policy_version 1008265 (0.00086) [2022-07-11 03:05:52,011][26022] Updated weights on worker 0-0, policy_version 1008275 (0.00085) [2022-07-11 03:05:53,071][25689] Fps is (10 sec: 5484.3, 60 sec: 5579.2, 300 sec: 5569.3). Total num frames: 1032479744. Throughput: 0: 5007.0. Samples: 1032475922. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:05:53,071][25689] Avg episode reward: [(0, '0.914')] [2022-07-11 03:05:53,970][26022] Updated weights on worker 0-0, policy_version 1008285 (0.00084) [2022-07-11 03:05:55,787][26022] Updated weights on worker 0-0, policy_version 1008295 (0.00091) [2022-07-11 03:05:57,421][26022] Updated weights on worker 0-0, policy_version 1008305 (0.00453) [2022-07-11 03:05:58,075][25689] Fps is (10 sec: 5710.9, 60 sec: 5563.0, 300 sec: 5573.9). Total num frames: 1032508416. Throughput: 0: 5849.2. Samples: 1032509496. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:05:58,076][25689] Avg episode reward: [(0, '0.992')] [2022-07-11 03:05:59,440][26022] Updated weights on worker 0-0, policy_version 1008315 (0.00085) [2022-07-11 03:06:01,007][26022] Updated weights on worker 0-0, policy_version 1008325 (0.00097) [2022-07-11 03:06:03,078][25689] Fps is (10 sec: 5218.8, 60 sec: 5552.9, 300 sec: 5561.7). Total num frames: 1032531968. Throughput: 0: 5756.9. Samples: 1032541280. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:06:03,080][25689] Avg episode reward: [(0, '1.008')] [2022-07-11 03:06:03,486][26022] Updated weights on worker 0-0, policy_version 1008335 (0.00088) [2022-07-11 03:06:05,163][26022] Updated weights on worker 0-0, policy_version 1008345 (0.00090) [2022-07-11 03:06:07,158][26022] Updated weights on worker 0-0, policy_version 1008355 (0.00090) [2022-07-11 03:06:08,135][25689] Fps is (10 sec: 5293.5, 60 sec: 5555.3, 300 sec: 5568.9). Total num frames: 1032561664. Throughput: 0: 4899.8. Samples: 1032557814. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:06:08,135][25689] Avg episode reward: [(0, '0.915')] [2022-07-11 03:06:08,940][26022] Updated weights on worker 0-0, policy_version 1008365 (0.00080) [2022-07-11 03:06:10,875][26022] Updated weights on worker 0-0, policy_version 1008375 (0.00095) [2022-07-11 03:06:12,431][26022] Updated weights on worker 0-0, policy_version 1008385 (0.00088) [2022-07-11 03:06:13,139][25689] Fps is (10 sec: 5699.6, 60 sec: 5574.1, 300 sec: 5573.7). Total num frames: 1032589312. Throughput: 0: 5763.8. Samples: 1032591590. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:06:13,139][25689] Avg episode reward: [(0, '0.781')] [2022-07-11 03:06:14,311][26022] Updated weights on worker 0-0, policy_version 1008395 (0.00095) [2022-07-11 03:06:16,147][26022] Updated weights on worker 0-0, policy_version 1008405 (0.00087) [2022-07-11 03:06:17,977][26022] Updated weights on worker 0-0, policy_version 1008415 (0.00083) [2022-07-11 03:06:18,147][25689] Fps is (10 sec: 5522.9, 60 sec: 5540.3, 300 sec: 5560.2). Total num frames: 1032616960. Throughput: 0: 5766.1. Samples: 1032625228. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:06:18,147][25689] Avg episode reward: [(0, '0.773')] [2022-07-11 03:06:19,936][26022] Updated weights on worker 0-0, policy_version 1008425 (0.00090) [2022-07-11 03:06:21,787][26022] Updated weights on worker 0-0, policy_version 1008435 (0.00082) [2022-07-11 03:06:23,171][25689] Fps is (10 sec: 5511.9, 60 sec: 5555.8, 300 sec: 5564.9). Total num frames: 1032644608. Throughput: 0: 5010.3. Samples: 1032641954. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:06:23,171][25689] Avg episode reward: [(0, '0.670')] [2022-07-11 03:06:23,551][26022] Updated weights on worker 0-0, policy_version 1008445 (0.00092) [2022-07-11 03:06:25,503][26022] Updated weights on worker 0-0, policy_version 1008455 (0.00082) [2022-07-11 03:06:27,184][26022] Updated weights on worker 0-0, policy_version 1008465 (0.01002) [2022-07-11 03:06:28,217][25689] Fps is (10 sec: 5592.9, 60 sec: 5555.0, 300 sec: 5564.3). Total num frames: 1032673280. Throughput: 0: 5858.8. Samples: 1032675470. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:06:28,217][25689] Avg episode reward: [(0, '0.835')] [2022-07-11 03:06:29,069][26022] Updated weights on worker 0-0, policy_version 1008475 (0.00085) [2022-07-11 03:06:30,876][26022] Updated weights on worker 0-0, policy_version 1008485 (0.00078) [2022-07-11 03:06:32,742][26022] Updated weights on worker 0-0, policy_version 1008495 (0.00090) [2022-07-11 03:06:33,226][25689] Fps is (10 sec: 5702.7, 60 sec: 5554.8, 300 sec: 5567.6). Total num frames: 1032701952. Throughput: 0: 5860.1. Samples: 1032709306. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:06:33,227][25689] Avg episode reward: [(0, '0.548')] [2022-07-11 03:06:34,492][26022] Updated weights on worker 0-0, policy_version 1008505 (0.00087) [2022-07-11 03:06:36,407][26022] Updated weights on worker 0-0, policy_version 1008515 (0.00088) [2022-07-11 03:06:38,115][26022] Updated weights on worker 0-0, policy_version 1008525 (0.00084) [2022-07-11 03:06:38,234][25689] Fps is (10 sec: 5622.4, 60 sec: 5573.7, 300 sec: 5567.7). Total num frames: 1032729600. Throughput: 0: 5027.4. Samples: 1032726214. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:06:38,234][25689] Avg episode reward: [(0, '0.788')] [2022-07-11 03:06:40,099][26022] Updated weights on worker 0-0, policy_version 1008535 (0.00399) [2022-07-11 03:06:41,986][26022] Updated weights on worker 0-0, policy_version 1008545 (0.00084) [2022-07-11 03:06:43,264][25689] Fps is (10 sec: 5407.0, 60 sec: 5504.3, 300 sec: 5561.9). Total num frames: 1032756224. Throughput: 0: 5868.4. Samples: 1032759868. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:06:43,264][25689] Avg episode reward: [(0, '0.862')] [2022-07-11 03:06:43,811][26022] Updated weights on worker 0-0, policy_version 1008555 (0.00082) [2022-07-11 03:06:45,503][26022] Updated weights on worker 0-0, policy_version 1008565 (0.00083) [2022-07-11 03:06:47,379][26022] Updated weights on worker 0-0, policy_version 1008575 (0.00092) [2022-07-11 03:06:48,400][25689] Fps is (10 sec: 5640.8, 60 sec: 5564.0, 300 sec: 5570.0). Total num frames: 1032786944. Throughput: 0: 5838.0. Samples: 1032793298. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:06:48,400][25689] Avg episode reward: [(0, '1.594')] [2022-07-11 03:06:49,030][26022] Updated weights on worker 0-0, policy_version 1008585 (0.00447) [2022-07-11 03:06:51,166][26022] Updated weights on worker 0-0, policy_version 1008595 (0.00054) [2022-07-11 03:06:52,947][26022] Updated weights on worker 0-0, policy_version 1008605 (0.00091) [2022-07-11 03:06:53,455][25689] Fps is (10 sec: 5626.7, 60 sec: 5528.3, 300 sec: 5565.6). Total num frames: 1032813568. Throughput: 0: 4975.0. Samples: 1032809944. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:06:53,456][25689] Avg episode reward: [(0, '1.340')] [2022-07-11 03:06:54,675][26022] Updated weights on worker 0-0, policy_version 1008615 (0.00081) [2022-07-11 03:06:56,571][26022] Updated weights on worker 0-0, policy_version 1008625 (0.00083) [2022-07-11 03:06:58,192][26022] Updated weights on worker 0-0, policy_version 1008635 (0.00084) [2022-07-11 03:06:58,503][25689] Fps is (10 sec: 5574.5, 60 sec: 5541.3, 300 sec: 5565.0). Total num frames: 1032843264. Throughput: 0: 5799.2. Samples: 1032843758. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:06:58,503][25689] Avg episode reward: [(0, '1.402')] [2022-07-11 03:07:00,359][26022] Updated weights on worker 0-0, policy_version 1008645 (0.00084) [2022-07-11 03:07:02,195][26022] Updated weights on worker 0-0, policy_version 1008655 (0.00092) [2022-07-11 03:07:03,566][25689] Fps is (10 sec: 5570.2, 60 sec: 5586.5, 300 sec: 5568.3). Total num frames: 1032869888. Throughput: 0: 5696.0. Samples: 1032875510. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:07:03,567][25689] Avg episode reward: [(0, '1.241')] [2022-07-11 03:07:04,266][26022] Updated weights on worker 0-0, policy_version 1008665 (0.00101) [2022-07-11 03:07:05,953][26022] Updated weights on worker 0-0, policy_version 1008675 (0.00087) [2022-07-11 03:07:07,860][26022] Updated weights on worker 0-0, policy_version 1008685 (0.00088) [2022-07-11 03:07:08,649][25689] Fps is (10 sec: 5248.1, 60 sec: 5533.4, 300 sec: 5568.6). Total num frames: 1032896512. Throughput: 0: 4883.7. Samples: 1032892190. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:07:08,650][25689] Avg episode reward: [(0, '1.256')] [2022-07-11 03:07:09,564][26022] Updated weights on worker 0-0, policy_version 1008695 (0.00089) [2022-07-11 03:07:11,766][26022] Updated weights on worker 0-0, policy_version 1008705 (0.00089) [2022-07-11 03:07:13,257][26022] Updated weights on worker 0-0, policy_version 1008715 (0.00094) [2022-07-11 03:07:13,719][25689] Fps is (10 sec: 5547.3, 60 sec: 5561.2, 300 sec: 5565.5). Total num frames: 1032926208. Throughput: 0: 5704.9. Samples: 1032925546. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:07:13,721][25689] Avg episode reward: [(0, '1.235')] [2022-07-11 03:07:15,184][26022] Updated weights on worker 0-0, policy_version 1008725 (0.00090) [2022-07-11 03:07:16,935][26022] Updated weights on worker 0-0, policy_version 1008735 (0.00085) [2022-07-11 03:07:18,723][25689] Fps is (10 sec: 5590.7, 60 sec: 5544.6, 300 sec: 5565.6). Total num frames: 1032952832. Throughput: 0: 5705.7. Samples: 1032959128. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:07:18,723][25689] Avg episode reward: [(0, '1.359')] [2022-07-11 03:07:18,831][26022] Updated weights on worker 0-0, policy_version 1008745 (0.00081) [2022-07-11 03:07:20,700][26022] Updated weights on worker 0-0, policy_version 1008755 (0.00091) [2022-07-11 03:07:22,600][26022] Updated weights on worker 0-0, policy_version 1008765 (0.00093) [2022-07-11 03:07:23,779][25689] Fps is (10 sec: 5598.3, 60 sec: 5575.5, 300 sec: 5565.7). Total num frames: 1032982528. Throughput: 0: 4962.1. Samples: 1032975808. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:07:23,780][25689] Avg episode reward: [(0, '1.507')] [2022-07-11 03:07:24,333][26022] Updated weights on worker 0-0, policy_version 1008775 (0.00086) [2022-07-11 03:07:26,466][26022] Updated weights on worker 0-0, policy_version 1008785 (0.00084) [2022-07-11 03:07:27,974][26022] Updated weights on worker 0-0, policy_version 1008795 (0.00090) [2022-07-11 03:07:28,852][25689] Fps is (10 sec: 5661.3, 60 sec: 5556.1, 300 sec: 5568.5). Total num frames: 1033010176. Throughput: 0: 5783.2. Samples: 1033009028. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:07:28,853][25689] Avg episode reward: [(0, '0.811')] [2022-07-11 03:07:29,136][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:07:29,147][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001008800_1033011200.pth [2022-07-11 03:07:29,147][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001006842_1031006208.pth [2022-07-11 03:07:29,992][26022] Updated weights on worker 0-0, policy_version 1008805 (0.00086) [2022-07-11 03:07:31,748][26022] Updated weights on worker 0-0, policy_version 1008815 (0.00092) [2022-07-11 03:07:33,740][26022] Updated weights on worker 0-0, policy_version 1008825 (0.00081) [2022-07-11 03:07:33,858][25689] Fps is (10 sec: 5486.1, 60 sec: 5539.5, 300 sec: 5565.0). Total num frames: 1033037824. Throughput: 0: 5792.7. Samples: 1033042208. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:07:33,859][25689] Avg episode reward: [(0, '1.006')] [2022-07-11 03:07:35,578][26022] Updated weights on worker 0-0, policy_version 1008835 (0.00089) [2022-07-11 03:07:37,495][26022] Updated weights on worker 0-0, policy_version 1008845 (0.00084) [2022-07-11 03:07:38,904][25689] Fps is (10 sec: 5501.2, 60 sec: 5536.1, 300 sec: 5565.1). Total num frames: 1033065472. Throughput: 0: 4947.0. Samples: 1033058962. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:07:38,904][25689] Avg episode reward: [(0, '1.009')] [2022-07-11 03:07:39,101][26022] Updated weights on worker 0-0, policy_version 1008855 (0.00087) [2022-07-11 03:07:41,115][26022] Updated weights on worker 0-0, policy_version 1008865 (0.00083) [2022-07-11 03:07:42,631][26022] Updated weights on worker 0-0, policy_version 1008875 (0.00095) [2022-07-11 03:07:43,914][25689] Fps is (10 sec: 5498.9, 60 sec: 5554.7, 300 sec: 5566.6). Total num frames: 1033093120. Throughput: 0: 5789.2. Samples: 1033092374. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:07:43,915][25689] Avg episode reward: [(0, '0.966')] [2022-07-11 03:07:45,043][26022] Updated weights on worker 0-0, policy_version 1008885 (0.00096) [2022-07-11 03:07:46,272][26022] Updated weights on worker 0-0, policy_version 1008895 (0.00101) [2022-07-11 03:07:48,456][26022] Updated weights on worker 0-0, policy_version 1008905 (0.00088) [2022-07-11 03:07:48,984][25689] Fps is (10 sec: 5587.1, 60 sec: 5527.0, 300 sec: 5558.7). Total num frames: 1033121792. Throughput: 0: 5804.0. Samples: 1033125874. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:07:48,985][25689] Avg episode reward: [(0, '1.098')] [2022-07-11 03:07:49,947][26022] Updated weights on worker 0-0, policy_version 1008915 (0.00095) [2022-07-11 03:07:52,081][26022] Updated weights on worker 0-0, policy_version 1008925 (0.00108) [2022-07-11 03:07:53,911][26022] Updated weights on worker 0-0, policy_version 1008935 (0.00087) [2022-07-11 03:07:53,998][25689] Fps is (10 sec: 5585.3, 60 sec: 5547.7, 300 sec: 5562.6). Total num frames: 1033149440. Throughput: 0: 5822.9. Samples: 1033159478. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:07:54,008][25689] Avg episode reward: [(0, '1.118')] [2022-07-11 03:07:55,688][26022] Updated weights on worker 0-0, policy_version 1008945 (0.00090) [2022-07-11 03:07:57,479][26022] Updated weights on worker 0-0, policy_version 1008955 (0.00082) [2022-07-11 03:07:59,028][25689] Fps is (10 sec: 5607.5, 60 sec: 5532.4, 300 sec: 5566.0). Total num frames: 1033178112. Throughput: 0: 5826.2. Samples: 1033176208. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:07:59,028][25689] Avg episode reward: [(0, '1.655')] [2022-07-11 03:07:59,692][26022] Updated weights on worker 0-0, policy_version 1008965 (0.00080) [2022-07-11 03:08:01,083][26022] Updated weights on worker 0-0, policy_version 1008975 (0.00087) [2022-07-11 03:08:03,558][26022] Updated weights on worker 0-0, policy_version 1008985 (0.00084) [2022-07-11 03:08:04,055][25689] Fps is (10 sec: 5396.5, 60 sec: 5518.8, 300 sec: 5560.2). Total num frames: 1033203712. Throughput: 0: 5730.7. Samples: 1033207794. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:08:04,055][25689] Avg episode reward: [(0, '1.315')] [2022-07-11 03:08:04,963][26022] Updated weights on worker 0-0, policy_version 1008995 (0.00080) [2022-07-11 03:08:07,197][26022] Updated weights on worker 0-0, policy_version 1009005 (0.00086) [2022-07-11 03:08:08,554][26022] Updated weights on worker 0-0, policy_version 1009015 (0.00096) [2022-07-11 03:08:09,159][25689] Fps is (10 sec: 5356.7, 60 sec: 5550.7, 300 sec: 5562.0). Total num frames: 1033232384. Throughput: 0: 5741.5. Samples: 1033241710. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:08:09,160][25689] Avg episode reward: [(0, '1.181')] [2022-07-11 03:08:10,697][26022] Updated weights on worker 0-0, policy_version 1009025 (0.00091) [2022-07-11 03:08:12,440][26022] Updated weights on worker 0-0, policy_version 1009035 (0.00089) [2022-07-11 03:08:14,170][25689] Fps is (10 sec: 5669.2, 60 sec: 5539.2, 300 sec: 5565.6). Total num frames: 1033261056. Throughput: 0: 4914.0. Samples: 1033258602. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:08:14,170][25689] Avg episode reward: [(0, '1.294')] [2022-07-11 03:08:14,373][26022] Updated weights on worker 0-0, policy_version 1009045 (0.00094) [2022-07-11 03:08:16,028][26022] Updated weights on worker 0-0, policy_version 1009055 (0.01025) [2022-07-11 03:08:18,171][26022] Updated weights on worker 0-0, policy_version 1009065 (0.00092) [2022-07-11 03:08:19,183][25689] Fps is (10 sec: 5618.9, 60 sec: 5555.3, 300 sec: 5558.5). Total num frames: 1033288704. Throughput: 0: 5759.3. Samples: 1033292286. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:08:19,183][25689] Avg episode reward: [(0, '0.859')] [2022-07-11 03:08:19,763][26022] Updated weights on worker 0-0, policy_version 1009075 (0.00084) [2022-07-11 03:08:21,755][26022] Updated weights on worker 0-0, policy_version 1009085 (0.00106) [2022-07-11 03:08:23,401][26022] Updated weights on worker 0-0, policy_version 1009095 (0.00087) [2022-07-11 03:08:24,195][25689] Fps is (10 sec: 5515.5, 60 sec: 5525.4, 300 sec: 5560.3). Total num frames: 1033316352. Throughput: 0: 5858.4. Samples: 1033325784. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2022-07-11 03:08:24,196][25689] Avg episode reward: [(0, '0.776')] [2022-07-11 03:08:25,348][26022] Updated weights on worker 0-0, policy_version 1009105 (0.00088) [2022-07-11 03:08:27,186][26022] Updated weights on worker 0-0, policy_version 1009115 (0.00089) [2022-07-11 03:08:29,079][26022] Updated weights on worker 0-0, policy_version 1009125 (0.00088) [2022-07-11 03:08:29,275][25689] Fps is (10 sec: 5580.6, 60 sec: 5541.8, 300 sec: 5563.0). Total num frames: 1033345024. Throughput: 0: 5000.0. Samples: 1033342286. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:08:29,275][25689] Avg episode reward: [(0, '1.016')] [2022-07-11 03:08:30,703][26022] Updated weights on worker 0-0, policy_version 1009135 (0.00088) [2022-07-11 03:08:32,763][26022] Updated weights on worker 0-0, policy_version 1009145 (0.00481) [2022-07-11 03:08:34,239][26022] Updated weights on worker 0-0, policy_version 1009155 (0.00086) [2022-07-11 03:08:34,287][25689] Fps is (10 sec: 5783.9, 60 sec: 5575.2, 300 sec: 5566.5). Total num frames: 1033374720. Throughput: 0: 5829.6. Samples: 1033375876. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:08:34,287][25689] Avg episode reward: [(0, '0.946')] [2022-07-11 03:08:36,461][26022] Updated weights on worker 0-0, policy_version 1009165 (0.00495) [2022-07-11 03:08:38,011][26022] Updated weights on worker 0-0, policy_version 1009175 (0.00086) [2022-07-11 03:08:39,288][25689] Fps is (10 sec: 5419.7, 60 sec: 5528.3, 300 sec: 5553.0). Total num frames: 1033399296. Throughput: 0: 5830.4. Samples: 1033409510. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:08:39,289][25689] Avg episode reward: [(0, '0.148')] [2022-07-11 03:08:39,979][26022] Updated weights on worker 0-0, policy_version 1009185 (0.00086) [2022-07-11 03:08:42,098][26022] Updated weights on worker 0-0, policy_version 1009195 (0.00087) [2022-07-11 03:08:43,564][26022] Updated weights on worker 0-0, policy_version 1009205 (0.00087) [2022-07-11 03:08:44,298][25689] Fps is (10 sec: 5523.4, 60 sec: 5579.3, 300 sec: 5568.3). Total num frames: 1033430016. Throughput: 0: 4991.8. Samples: 1033426132. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:08:44,298][25689] Avg episode reward: [(0, '0.406')] [2022-07-11 03:08:45,727][26022] Updated weights on worker 0-0, policy_version 1009215 (0.00098) [2022-07-11 03:08:47,388][26022] Updated weights on worker 0-0, policy_version 1009225 (0.00094) [2022-07-11 03:08:49,214][26022] Updated weights on worker 0-0, policy_version 1009235 (0.00081) [2022-07-11 03:08:49,403][25689] Fps is (10 sec: 5669.7, 60 sec: 5542.1, 300 sec: 5561.2). Total num frames: 1033456640. Throughput: 0: 5814.8. Samples: 1033459326. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:08:49,403][25689] Avg episode reward: [(0, '0.550')] [2022-07-11 03:08:51,336][26022] Updated weights on worker 0-0, policy_version 1009245 (0.00084) [2022-07-11 03:08:52,985][26022] Updated weights on worker 0-0, policy_version 1009255 (0.00097) [2022-07-11 03:08:54,413][25689] Fps is (10 sec: 5264.3, 60 sec: 5525.5, 300 sec: 5554.7). Total num frames: 1033483264. Throughput: 0: 5779.6. Samples: 1033492196. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:08:54,413][25689] Avg episode reward: [(0, '-0.234')] [2022-07-11 03:08:55,054][26022] Updated weights on worker 0-0, policy_version 1009265 (0.00088) [2022-07-11 03:08:56,699][26022] Updated weights on worker 0-0, policy_version 1009275 (0.00089) [2022-07-11 03:08:58,579][26022] Updated weights on worker 0-0, policy_version 1009285 (0.00768) [2022-07-11 03:08:59,421][25689] Fps is (10 sec: 5519.5, 60 sec: 5527.5, 300 sec: 5556.6). Total num frames: 1033511936. Throughput: 0: 4939.4. Samples: 1033508952. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:08:59,421][25689] Avg episode reward: [(0, '-0.427')] [2022-07-11 03:09:00,657][26022] Updated weights on worker 0-0, policy_version 1009295 (0.00083) [2022-07-11 03:09:02,586][26022] Updated weights on worker 0-0, policy_version 1009305 (0.00086) [2022-07-11 03:09:04,450][25689] Fps is (10 sec: 5406.7, 60 sec: 5527.3, 300 sec: 5552.3). Total num frames: 1033537536. Throughput: 0: 5666.6. Samples: 1033540330. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:09:04,451][25689] Avg episode reward: [(0, '-0.929')] [2022-07-11 03:09:04,500][26022] Updated weights on worker 0-0, policy_version 1009315 (0.00090) [2022-07-11 03:09:06,386][26022] Updated weights on worker 0-0, policy_version 1009325 (0.00094) [2022-07-11 03:09:08,035][26022] Updated weights on worker 0-0, policy_version 1009335 (0.00085) [2022-07-11 03:09:09,506][25689] Fps is (10 sec: 5280.1, 60 sec: 5514.8, 300 sec: 5548.9). Total num frames: 1033565184. Throughput: 0: 5689.1. Samples: 1033573694. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:09:09,506][25689] Avg episode reward: [(0, '-0.296')] [2022-07-11 03:09:09,959][26022] Updated weights on worker 0-0, policy_version 1009345 (0.00079) [2022-07-11 03:09:11,805][26022] Updated weights on worker 0-0, policy_version 1009355 (0.00087) [2022-07-11 03:09:13,536][26022] Updated weights on worker 0-0, policy_version 1009365 (0.00093) [2022-07-11 03:09:14,510][25689] Fps is (10 sec: 5598.7, 60 sec: 5515.4, 300 sec: 5550.6). Total num frames: 1033593856. Throughput: 0: 4885.5. Samples: 1033590382. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:09:14,511][25689] Avg episode reward: [(0, '-0.338')] [2022-07-11 03:09:15,508][26022] Updated weights on worker 0-0, policy_version 1009375 (0.00089) [2022-07-11 03:09:17,388][26022] Updated weights on worker 0-0, policy_version 1009385 (0.00085) [2022-07-11 03:09:19,080][26022] Updated weights on worker 0-0, policy_version 1009395 (0.00093) [2022-07-11 03:09:19,513][25689] Fps is (10 sec: 5832.5, 60 sec: 5550.2, 300 sec: 5555.1). Total num frames: 1033623552. Throughput: 0: 5742.0. Samples: 1033624320. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:09:19,513][25689] Avg episode reward: [(0, '-0.141')] [2022-07-11 03:09:21,154][26022] Updated weights on worker 0-0, policy_version 1009405 (0.00092) [2022-07-11 03:09:22,711][26022] Updated weights on worker 0-0, policy_version 1009415 (0.00094) [2022-07-11 03:09:24,565][25689] Fps is (10 sec: 5601.1, 60 sec: 5529.6, 300 sec: 5549.5). Total num frames: 1033650176. Throughput: 0: 5832.9. Samples: 1033657656. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:09:24,566][25689] Avg episode reward: [(0, '0.452')] [2022-07-11 03:09:24,661][26022] Updated weights on worker 0-0, policy_version 1009425 (0.00087) [2022-07-11 03:09:26,575][26022] Updated weights on worker 0-0, policy_version 1009435 (0.00087) [2022-07-11 03:09:28,396][26022] Updated weights on worker 0-0, policy_version 1009445 (0.00094) [2022-07-11 03:09:29,275][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:09:29,289][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001009451_1033677824.pth [2022-07-11 03:09:29,289][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001007494_1031673856.pth [2022-07-11 03:09:29,628][25689] Fps is (10 sec: 5466.9, 60 sec: 5531.2, 300 sec: 5545.1). Total num frames: 1033678848. Throughput: 0: 5013.0. Samples: 1033674566. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:09:29,628][25689] Avg episode reward: [(0, '0.370')] [2022-07-11 03:09:30,309][26022] Updated weights on worker 0-0, policy_version 1009455 (0.00619) [2022-07-11 03:09:31,933][26022] Updated weights on worker 0-0, policy_version 1009465 (0.00087) [2022-07-11 03:09:33,784][26022] Updated weights on worker 0-0, policy_version 1009475 (0.00087) [2022-07-11 03:09:34,654][25689] Fps is (10 sec: 5582.4, 60 sec: 5496.0, 300 sec: 5552.3). Total num frames: 1033706496. Throughput: 0: 5855.2. Samples: 1033708328. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:09:34,655][25689] Avg episode reward: [(0, '0.922')] [2022-07-11 03:09:35,433][26022] Updated weights on worker 0-0, policy_version 1009485 (0.00095) [2022-07-11 03:09:37,500][26022] Updated weights on worker 0-0, policy_version 1009495 (0.00094) [2022-07-11 03:09:39,154][26022] Updated weights on worker 0-0, policy_version 1009505 (0.00089) [2022-07-11 03:09:39,663][25689] Fps is (10 sec: 5612.3, 60 sec: 5563.2, 300 sec: 5545.8). Total num frames: 1033735168. Throughput: 0: 5865.6. Samples: 1033742512. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:09:39,665][25689] Avg episode reward: [(0, '0.323')] [2022-07-11 03:09:41,026][26022] Updated weights on worker 0-0, policy_version 1009515 (0.00083) [2022-07-11 03:09:42,934][26022] Updated weights on worker 0-0, policy_version 1009525 (0.00087) [2022-07-11 03:09:44,623][26022] Updated weights on worker 0-0, policy_version 1009535 (0.00094) [2022-07-11 03:09:44,714][25689] Fps is (10 sec: 5700.2, 60 sec: 5525.4, 300 sec: 5549.8). Total num frames: 1033763840. Throughput: 0: 5056.3. Samples: 1033759534. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:09:44,714][25689] Avg episode reward: [(0, '0.258')] [2022-07-11 03:09:46,522][26022] Updated weights on worker 0-0, policy_version 1009545 (0.00092) [2022-07-11 03:09:48,424][26022] Updated weights on worker 0-0, policy_version 1009555 (0.00082) [2022-07-11 03:09:49,776][25689] Fps is (10 sec: 5568.8, 60 sec: 5546.3, 300 sec: 5549.3). Total num frames: 1033791488. Throughput: 0: 5881.6. Samples: 1033793072. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:09:49,777][25689] Avg episode reward: [(0, '-0.358')] [2022-07-11 03:09:50,017][26022] Updated weights on worker 0-0, policy_version 1009565 (0.00086) [2022-07-11 03:09:52,187][26022] Updated weights on worker 0-0, policy_version 1009575 (0.00086) [2022-07-11 03:09:53,802][26022] Updated weights on worker 0-0, policy_version 1009585 (0.00092) [2022-07-11 03:09:54,796][25689] Fps is (10 sec: 5484.3, 60 sec: 5562.3, 300 sec: 5542.3). Total num frames: 1033819136. Throughput: 0: 5870.8. Samples: 1033826582. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:09:54,797][25689] Avg episode reward: [(0, '0.147')] [2022-07-11 03:09:55,784][26022] Updated weights on worker 0-0, policy_version 1009595 (0.00047) [2022-07-11 03:09:57,560][26022] Updated weights on worker 0-0, policy_version 1009605 (0.00089) [2022-07-11 03:09:59,345][26022] Updated weights on worker 0-0, policy_version 1009615 (0.00086) [2022-07-11 03:09:59,815][25689] Fps is (10 sec: 5712.4, 60 sec: 5578.3, 300 sec: 5560.6). Total num frames: 1033848832. Throughput: 0: 4990.1. Samples: 1033843072. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:09:59,815][25689] Avg episode reward: [(0, '0.363')] [2022-07-11 03:10:01,369][26022] Updated weights on worker 0-0, policy_version 1009625 (0.00100) [2022-07-11 03:10:03,327][26022] Updated weights on worker 0-0, policy_version 1009635 (0.00089) [2022-07-11 03:10:04,880][25689] Fps is (10 sec: 5382.0, 60 sec: 5558.0, 300 sec: 5543.7). Total num frames: 1033873408. Throughput: 0: 5715.0. Samples: 1033874786. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:10:04,881][25689] Avg episode reward: [(0, '1.260')] [2022-07-11 03:10:05,383][26022] Updated weights on worker 0-0, policy_version 1009645 (0.00086) [2022-07-11 03:10:07,159][26022] Updated weights on worker 0-0, policy_version 1009655 (0.00089) [2022-07-11 03:10:08,958][26022] Updated weights on worker 0-0, policy_version 1009665 (0.00088) [2022-07-11 03:10:09,920][25689] Fps is (10 sec: 5370.9, 60 sec: 5593.4, 300 sec: 5553.7). Total num frames: 1033903104. Throughput: 0: 5729.7. Samples: 1033908488. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:10:09,920][25689] Avg episode reward: [(0, '0.158')] [2022-07-11 03:10:10,749][26022] Updated weights on worker 0-0, policy_version 1009675 (0.00083) [2022-07-11 03:10:12,584][26022] Updated weights on worker 0-0, policy_version 1009685 (0.00080) [2022-07-11 03:10:14,320][26022] Updated weights on worker 0-0, policy_version 1009695 (0.00094) [2022-07-11 03:10:15,008][25689] Fps is (10 sec: 5763.5, 60 sec: 5585.7, 300 sec: 5548.8). Total num frames: 1033931776. Throughput: 0: 4893.5. Samples: 1033925486. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:10:15,008][25689] Avg episode reward: [(0, '0.349')] [2022-07-11 03:10:16,165][26022] Updated weights on worker 0-0, policy_version 1009705 (0.00095) [2022-07-11 03:10:17,936][26022] Updated weights on worker 0-0, policy_version 1009715 (0.00087) [2022-07-11 03:10:19,832][26022] Updated weights on worker 0-0, policy_version 1009725 (0.00087) [2022-07-11 03:10:20,013][25689] Fps is (10 sec: 5579.8, 60 sec: 5551.6, 300 sec: 5552.3). Total num frames: 1033959424. Throughput: 0: 5755.2. Samples: 1033959318. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:10:20,015][25689] Avg episode reward: [(0, '0.495')] [2022-07-11 03:10:21,547][26022] Updated weights on worker 0-0, policy_version 1009735 (0.00087) [2022-07-11 03:10:23,546][26022] Updated weights on worker 0-0, policy_version 1009745 (0.00092) [2022-07-11 03:10:25,062][25689] Fps is (10 sec: 5499.6, 60 sec: 5568.8, 300 sec: 5548.6). Total num frames: 1033987072. Throughput: 0: 5852.2. Samples: 1033992892. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:10:25,064][25689] Avg episode reward: [(0, '0.607')] [2022-07-11 03:10:25,283][26022] Updated weights on worker 0-0, policy_version 1009755 (0.00054) [2022-07-11 03:10:27,438][26022] Updated weights on worker 0-0, policy_version 1009765 (0.00085) [2022-07-11 03:10:28,862][26022] Updated weights on worker 0-0, policy_version 1009775 (0.00094) [2022-07-11 03:10:30,188][25689] Fps is (10 sec: 5535.2, 60 sec: 5563.0, 300 sec: 5546.5). Total num frames: 1034015744. Throughput: 0: 4973.7. Samples: 1034009298. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:10:30,190][25689] Avg episode reward: [(0, '0.595')] [2022-07-11 03:10:31,085][26022] Updated weights on worker 0-0, policy_version 1009785 (0.00088) [2022-07-11 03:10:32,499][26022] Updated weights on worker 0-0, policy_version 1009795 (0.00089) [2022-07-11 03:10:34,666][26022] Updated weights on worker 0-0, policy_version 1009805 (0.00089) [2022-07-11 03:10:35,279][25689] Fps is (10 sec: 5612.7, 60 sec: 5573.9, 300 sec: 5552.2). Total num frames: 1034044416. Throughput: 0: 5787.3. Samples: 1034042802. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:10:35,279][25689] Avg episode reward: [(0, '-0.014')] [2022-07-11 03:10:36,232][26022] Updated weights on worker 0-0, policy_version 1009815 (0.00108) [2022-07-11 03:10:38,052][26022] Updated weights on worker 0-0, policy_version 1009825 (0.00088) [2022-07-11 03:10:40,107][26022] Updated weights on worker 0-0, policy_version 1009835 (0.00081) [2022-07-11 03:10:40,301][25689] Fps is (10 sec: 5569.3, 60 sec: 5555.9, 300 sec: 5541.7). Total num frames: 1034072064. Throughput: 0: 5791.8. Samples: 1034076820. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:10:40,303][25689] Avg episode reward: [(0, '1.026')] [2022-07-11 03:10:41,682][26022] Updated weights on worker 0-0, policy_version 1009845 (0.00092) [2022-07-11 03:10:43,662][26022] Updated weights on worker 0-0, policy_version 1009855 (0.00089) [2022-07-11 03:10:45,286][26022] Updated weights on worker 0-0, policy_version 1009865 (0.00084) [2022-07-11 03:10:45,330][25689] Fps is (10 sec: 5705.1, 60 sec: 5574.7, 300 sec: 5552.3). Total num frames: 1034101760. Throughput: 0: 5794.3. Samples: 1034110332. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:10:45,331][25689] Avg episode reward: [(0, '1.045')] [2022-07-11 03:10:47,344][26022] Updated weights on worker 0-0, policy_version 1009875 (0.00089) [2022-07-11 03:10:49,266][26022] Updated weights on worker 0-0, policy_version 1009885 (0.00088) [2022-07-11 03:10:50,407][25689] Fps is (10 sec: 5674.3, 60 sec: 5573.4, 300 sec: 5548.1). Total num frames: 1034129408. Throughput: 0: 5832.8. Samples: 1034127228. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:10:50,407][25689] Avg episode reward: [(0, '0.965')] [2022-07-11 03:10:50,818][26022] Updated weights on worker 0-0, policy_version 1009895 (0.00087) [2022-07-11 03:10:52,751][26022] Updated weights on worker 0-0, policy_version 1009905 (0.00087) [2022-07-11 03:10:54,482][26022] Updated weights on worker 0-0, policy_version 1009915 (0.00088) [2022-07-11 03:10:55,449][25689] Fps is (10 sec: 5464.8, 60 sec: 5571.4, 300 sec: 5544.0). Total num frames: 1034157056. Throughput: 0: 5855.6. Samples: 1034160910. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:10:55,451][25689] Avg episode reward: [(0, '1.071')] [2022-07-11 03:10:56,346][26022] Updated weights on worker 0-0, policy_version 1009925 (0.00081) [2022-07-11 03:10:58,328][26022] Updated weights on worker 0-0, policy_version 1009935 (0.00087) [2022-07-11 03:10:59,956][26022] Updated weights on worker 0-0, policy_version 1009945 (0.00085) [2022-07-11 03:11:00,462][25689] Fps is (10 sec: 5601.1, 60 sec: 5555.0, 300 sec: 5561.0). Total num frames: 1034185728. Throughput: 0: 5836.4. Samples: 1034194488. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:11:00,463][25689] Avg episode reward: [(0, '1.450')] [2022-07-11 03:11:02,191][26022] Updated weights on worker 0-0, policy_version 1009955 (0.00081) [2022-07-11 03:11:04,190][26022] Updated weights on worker 0-0, policy_version 1009965 (0.00084) [2022-07-11 03:11:05,469][25689] Fps is (10 sec: 5518.5, 60 sec: 5594.2, 300 sec: 5551.6). Total num frames: 1034212352. Throughput: 0: 4912.5. Samples: 1034209264. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:11:05,470][25689] Avg episode reward: [(0, '1.728')] [2022-07-11 03:11:05,819][26022] Updated weights on worker 0-0, policy_version 1009975 (0.00088) [2022-07-11 03:11:07,727][26022] Updated weights on worker 0-0, policy_version 1009985 (0.00086) [2022-07-11 03:11:09,359][26022] Updated weights on worker 0-0, policy_version 1009995 (0.00094) [2022-07-11 03:11:10,580][25689] Fps is (10 sec: 5363.6, 60 sec: 5553.8, 300 sec: 5549.6). Total num frames: 1034240000. Throughput: 0: 5743.1. Samples: 1034243088. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:11:10,581][25689] Avg episode reward: [(0, '1.024')] [2022-07-11 03:11:11,429][26022] Updated weights on worker 0-0, policy_version 1010005 (0.00081) [2022-07-11 03:11:13,186][26022] Updated weights on worker 0-0, policy_version 1010015 (0.00088) [2022-07-11 03:11:15,040][26022] Updated weights on worker 0-0, policy_version 1010025 (0.00089) [2022-07-11 03:11:15,619][25689] Fps is (10 sec: 5447.9, 60 sec: 5541.4, 300 sec: 5549.0). Total num frames: 1034267648. Throughput: 0: 5735.9. Samples: 1034276604. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:11:15,619][25689] Avg episode reward: [(0, '0.292')] [2022-07-11 03:11:16,780][26022] Updated weights on worker 0-0, policy_version 1010035 (0.00094) [2022-07-11 03:11:18,791][26022] Updated weights on worker 0-0, policy_version 1010045 (0.00084) [2022-07-11 03:11:20,607][26022] Updated weights on worker 0-0, policy_version 1010055 (0.00084) [2022-07-11 03:11:20,678][25689] Fps is (10 sec: 5577.7, 60 sec: 5553.4, 300 sec: 5551.8). Total num frames: 1034296320. Throughput: 0: 4895.1. Samples: 1034293448. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:11:20,678][25689] Avg episode reward: [(0, '0.240')] [2022-07-11 03:11:22,493][26022] Updated weights on worker 0-0, policy_version 1010065 (0.00979) [2022-07-11 03:11:24,005][26022] Updated weights on worker 0-0, policy_version 1010075 (0.00089) [2022-07-11 03:11:25,704][25689] Fps is (10 sec: 5685.8, 60 sec: 5572.3, 300 sec: 5552.2). Total num frames: 1034324992. Throughput: 0: 5830.1. Samples: 1034327240. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:11:25,705][25689] Avg episode reward: [(0, '0.590')] [2022-07-11 03:11:26,167][26022] Updated weights on worker 0-0, policy_version 1010085 (0.00082) [2022-07-11 03:11:27,704][26022] Updated weights on worker 0-0, policy_version 1010095 (0.00092) [2022-07-11 03:11:29,299][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:11:29,320][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001010102_1034344448.pth [2022-07-11 03:11:29,321][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001008148_1032343552.pth [2022-07-11 03:11:29,877][26022] Updated weights on worker 0-0, policy_version 1010105 (0.00092) [2022-07-11 03:11:30,774][25689] Fps is (10 sec: 5680.0, 60 sec: 5577.6, 300 sec: 5551.1). Total num frames: 1034353664. Throughput: 0: 5810.1. Samples: 1034360412. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:11:30,774][25689] Avg episode reward: [(0, '0.265')] [2022-07-11 03:11:31,557][26022] Updated weights on worker 0-0, policy_version 1010115 (0.00092) [2022-07-11 03:11:33,550][26022] Updated weights on worker 0-0, policy_version 1010125 (0.00074) [2022-07-11 03:11:35,245][26022] Updated weights on worker 0-0, policy_version 1010135 (0.00094) [2022-07-11 03:11:35,802][25689] Fps is (10 sec: 5476.5, 60 sec: 5549.5, 300 sec: 5547.3). Total num frames: 1034380288. Throughput: 0: 4990.1. Samples: 1034377318. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:11:35,802][25689] Avg episode reward: [(0, '0.625')] [2022-07-11 03:11:37,098][26022] Updated weights on worker 0-0, policy_version 1010145 (0.00093) [2022-07-11 03:11:39,012][26022] Updated weights on worker 0-0, policy_version 1010155 (0.00090) [2022-07-11 03:11:40,638][26022] Updated weights on worker 0-0, policy_version 1010165 (0.00088) [2022-07-11 03:11:40,811][25689] Fps is (10 sec: 5509.1, 60 sec: 5567.6, 300 sec: 5554.5). Total num frames: 1034408960. Throughput: 0: 5845.7. Samples: 1034411140. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:11:40,811][25689] Avg episode reward: [(0, '1.240')] [2022-07-11 03:11:42,694][26022] Updated weights on worker 0-0, policy_version 1010175 (0.00086) [2022-07-11 03:11:44,372][26022] Updated weights on worker 0-0, policy_version 1010185 (0.00094) [2022-07-11 03:11:45,859][25689] Fps is (10 sec: 5600.0, 60 sec: 5532.1, 300 sec: 5545.9). Total num frames: 1034436608. Throughput: 0: 5836.1. Samples: 1034444862. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:11:45,859][25689] Avg episode reward: [(0, '0.981')] [2022-07-11 03:11:46,178][26022] Updated weights on worker 0-0, policy_version 1010195 (0.00085) [2022-07-11 03:11:48,102][26022] Updated weights on worker 0-0, policy_version 1010205 (0.00088) [2022-07-11 03:11:49,829][26022] Updated weights on worker 0-0, policy_version 1010215 (0.00090) [2022-07-11 03:11:50,918][25689] Fps is (10 sec: 5572.2, 60 sec: 5550.6, 300 sec: 5552.7). Total num frames: 1034465280. Throughput: 0: 5024.5. Samples: 1034461628. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:11:50,919][25689] Avg episode reward: [(0, '0.657')] [2022-07-11 03:11:51,652][26022] Updated weights on worker 0-0, policy_version 1010225 (0.00089) [2022-07-11 03:11:53,586][26022] Updated weights on worker 0-0, policy_version 1010235 (0.00089) [2022-07-11 03:11:55,327][26022] Updated weights on worker 0-0, policy_version 1010245 (0.00091) [2022-07-11 03:11:56,003][25689] Fps is (10 sec: 5652.9, 60 sec: 5563.6, 300 sec: 5548.5). Total num frames: 1034493952. Throughput: 0: 5852.7. Samples: 1034495550. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:11:56,004][25689] Avg episode reward: [(0, '0.623')] [2022-07-11 03:11:57,158][26022] Updated weights on worker 0-0, policy_version 1010255 (0.00080) [2022-07-11 03:11:58,892][26022] Updated weights on worker 0-0, policy_version 1010265 (0.00448) [2022-07-11 03:12:00,856][26022] Updated weights on worker 0-0, policy_version 1010275 (0.00096) [2022-07-11 03:12:01,022][25689] Fps is (10 sec: 5675.6, 60 sec: 5563.0, 300 sec: 5556.2). Total num frames: 1034522624. Throughput: 0: 5845.0. Samples: 1034529274. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:12:01,022][25689] Avg episode reward: [(0, '-0.133')] [2022-07-11 03:12:02,863][26022] Updated weights on worker 0-0, policy_version 1010285 (0.00087) [2022-07-11 03:12:04,958][26022] Updated weights on worker 0-0, policy_version 1010295 (0.00091) [2022-07-11 03:12:06,025][25689] Fps is (10 sec: 5517.5, 60 sec: 5563.4, 300 sec: 5557.7). Total num frames: 1034549248. Throughput: 0: 4920.1. Samples: 1034544082. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:12:06,025][25689] Avg episode reward: [(0, '0.148')] [2022-07-11 03:12:06,527][26022] Updated weights on worker 0-0, policy_version 1010305 (0.00089) [2022-07-11 03:12:08,549][26022] Updated weights on worker 0-0, policy_version 1010315 (0.00209) [2022-07-11 03:12:10,206][26022] Updated weights on worker 0-0, policy_version 1010325 (0.00086) [2022-07-11 03:12:11,136][25689] Fps is (10 sec: 5264.7, 60 sec: 5546.5, 300 sec: 5546.6). Total num frames: 1034575872. Throughput: 0: 5749.1. Samples: 1034577862. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:12:11,138][25689] Avg episode reward: [(0, '-0.371')] [2022-07-11 03:12:11,994][26022] Updated weights on worker 0-0, policy_version 1010335 (0.00086) [2022-07-11 03:12:14,126][26022] Updated weights on worker 0-0, policy_version 1010345 (0.00084) [2022-07-11 03:12:15,576][26022] Updated weights on worker 0-0, policy_version 1010355 (0.00092) [2022-07-11 03:12:16,186][25689] Fps is (10 sec: 5542.6, 60 sec: 5579.3, 300 sec: 5556.1). Total num frames: 1034605568. Throughput: 0: 5745.5. Samples: 1034611512. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:12:16,187][25689] Avg episode reward: [(0, '0.149')] [2022-07-11 03:12:17,524][26022] Updated weights on worker 0-0, policy_version 1010365 (0.00086) [2022-07-11 03:12:19,575][26022] Updated weights on worker 0-0, policy_version 1010375 (0.00079) [2022-07-11 03:12:21,134][26022] Updated weights on worker 0-0, policy_version 1010385 (0.00075) [2022-07-11 03:12:21,274][25689] Fps is (10 sec: 5858.4, 60 sec: 5593.5, 300 sec: 5555.5). Total num frames: 1034635264. Throughput: 0: 4899.7. Samples: 1034628502. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 03:12:21,279][25689] Avg episode reward: [(0, '0.402')] [2022-07-11 03:12:23,088][26022] Updated weights on worker 0-0, policy_version 1010395 (0.00085) [2022-07-11 03:12:24,704][26022] Updated weights on worker 0-0, policy_version 1010405 (0.00083) [2022-07-11 03:12:26,312][25689] Fps is (10 sec: 5561.8, 60 sec: 5558.6, 300 sec: 5552.7). Total num frames: 1034661888. Throughput: 0: 5825.7. Samples: 1034662272. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:12:26,313][25689] Avg episode reward: [(0, '1.000')] [2022-07-11 03:12:26,798][26022] Updated weights on worker 0-0, policy_version 1010415 (0.00079) [2022-07-11 03:12:28,702][26022] Updated weights on worker 0-0, policy_version 1010425 (0.00089) [2022-07-11 03:12:30,429][26022] Updated weights on worker 0-0, policy_version 1010435 (0.00084) [2022-07-11 03:12:31,424][25689] Fps is (10 sec: 5447.4, 60 sec: 5554.7, 300 sec: 5554.2). Total num frames: 1034690560. Throughput: 0: 5791.2. Samples: 1034695358. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:12:31,425][25689] Avg episode reward: [(0, '-0.342')] [2022-07-11 03:12:32,214][26022] Updated weights on worker 0-0, policy_version 1010445 (0.00084) [2022-07-11 03:12:33,967][26022] Updated weights on worker 0-0, policy_version 1010455 (0.00084) [2022-07-11 03:12:35,940][26022] Updated weights on worker 0-0, policy_version 1010465 (0.00052) [2022-07-11 03:12:36,488][25689] Fps is (10 sec: 5534.8, 60 sec: 5568.3, 300 sec: 5553.8). Total num frames: 1034718208. Throughput: 0: 4959.0. Samples: 1034712190. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:12:36,489][25689] Avg episode reward: [(0, '-0.632')] [2022-07-11 03:12:37,795][26022] Updated weights on worker 0-0, policy_version 1010475 (0.00086) [2022-07-11 03:12:39,757][26022] Updated weights on worker 0-0, policy_version 1010485 (0.00089) [2022-07-11 03:12:41,332][26022] Updated weights on worker 0-0, policy_version 1010495 (0.00090) [2022-07-11 03:12:41,501][25689] Fps is (10 sec: 5690.7, 60 sec: 5584.8, 300 sec: 5560.7). Total num frames: 1034747904. Throughput: 0: 5796.1. Samples: 1034745744. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:12:41,502][25689] Avg episode reward: [(0, '-0.309')] [2022-07-11 03:12:43,373][26022] Updated weights on worker 0-0, policy_version 1010505 (0.00092) [2022-07-11 03:12:45,009][26022] Updated weights on worker 0-0, policy_version 1010515 (0.00092) [2022-07-11 03:12:46,559][25689] Fps is (10 sec: 5592.3, 60 sec: 5567.0, 300 sec: 5554.0). Total num frames: 1034774528. Throughput: 0: 5785.2. Samples: 1034779404. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:12:46,560][25689] Avg episode reward: [(0, '-0.449')] [2022-07-11 03:12:46,967][26022] Updated weights on worker 0-0, policy_version 1010525 (0.00094) [2022-07-11 03:12:48,782][26022] Updated weights on worker 0-0, policy_version 1010535 (0.00088) [2022-07-11 03:12:50,506][26022] Updated weights on worker 0-0, policy_version 1010545 (0.00086) [2022-07-11 03:12:51,655][25689] Fps is (10 sec: 5445.7, 60 sec: 5563.6, 300 sec: 5555.9). Total num frames: 1034803200. Throughput: 0: 5817.0. Samples: 1034813040. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:12:51,657][25689] Avg episode reward: [(0, '-0.837')] [2022-07-11 03:12:52,481][26022] Updated weights on worker 0-0, policy_version 1010555 (0.00087) [2022-07-11 03:12:54,195][26022] Updated weights on worker 0-0, policy_version 1010565 (0.00435) [2022-07-11 03:12:56,039][26022] Updated weights on worker 0-0, policy_version 1010575 (0.00092) [2022-07-11 03:12:56,720][25689] Fps is (10 sec: 5744.6, 60 sec: 5582.4, 300 sec: 5558.7). Total num frames: 1034832896. Throughput: 0: 5822.5. Samples: 1034829988. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:12:56,720][25689] Avg episode reward: [(0, '-0.068')] [2022-07-11 03:12:57,845][26022] Updated weights on worker 0-0, policy_version 1010585 (0.00094) [2022-07-11 03:12:59,667][26022] Updated weights on worker 0-0, policy_version 1010595 (0.00091) [2022-07-11 03:13:01,725][25689] Fps is (10 sec: 5592.8, 60 sec: 5549.9, 300 sec: 5562.6). Total num frames: 1034859520. Throughput: 0: 5806.6. Samples: 1034863176. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:13:01,726][25689] Avg episode reward: [(0, '0.849')] [2022-07-11 03:13:01,737][26022] Updated weights on worker 0-0, policy_version 1010605 (0.00087) [2022-07-11 03:13:03,715][26022] Updated weights on worker 0-0, policy_version 1010615 (0.00082) [2022-07-11 03:13:05,636][26022] Updated weights on worker 0-0, policy_version 1010625 (0.00091) [2022-07-11 03:13:06,789][25689] Fps is (10 sec: 5288.0, 60 sec: 5544.3, 300 sec: 5556.4). Total num frames: 1034886144. Throughput: 0: 5708.1. Samples: 1034894880. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:13:06,790][25689] Avg episode reward: [(0, '0.525')] [2022-07-11 03:13:07,319][26022] Updated weights on worker 0-0, policy_version 1010635 (0.00088) [2022-07-11 03:13:09,424][26022] Updated weights on worker 0-0, policy_version 1010645 (0.00092) [2022-07-11 03:13:11,199][26022] Updated weights on worker 0-0, policy_version 1010655 (0.00095) [2022-07-11 03:13:11,925][25689] Fps is (10 sec: 5421.1, 60 sec: 5575.7, 300 sec: 5554.1). Total num frames: 1034914816. Throughput: 0: 4856.3. Samples: 1034911478. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:13:11,926][25689] Avg episode reward: [(0, '1.506')] [2022-07-11 03:13:12,883][26022] Updated weights on worker 0-0, policy_version 1010665 (0.00087) [2022-07-11 03:13:14,906][26022] Updated weights on worker 0-0, policy_version 1010675 (0.00089) [2022-07-11 03:13:16,475][26022] Updated weights on worker 0-0, policy_version 1010685 (0.00088) [2022-07-11 03:13:16,956][25689] Fps is (10 sec: 5640.6, 60 sec: 5560.7, 300 sec: 5557.2). Total num frames: 1034943488. Throughput: 0: 5686.0. Samples: 1034945050. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:13:16,956][25689] Avg episode reward: [(0, '1.647')] [2022-07-11 03:13:18,589][26022] Updated weights on worker 0-0, policy_version 1010695 (0.00092) [2022-07-11 03:13:20,230][26022] Updated weights on worker 0-0, policy_version 1010705 (0.00088) [2022-07-11 03:13:21,961][25689] Fps is (10 sec: 5612.0, 60 sec: 5534.5, 300 sec: 5557.3). Total num frames: 1034971136. Throughput: 0: 5715.2. Samples: 1034978828. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:13:21,963][25689] Avg episode reward: [(0, '1.636')] [2022-07-11 03:13:22,101][26022] Updated weights on worker 0-0, policy_version 1010715 (0.00087) [2022-07-11 03:13:23,834][26022] Updated weights on worker 0-0, policy_version 1010725 (0.00079) [2022-07-11 03:13:25,836][26022] Updated weights on worker 0-0, policy_version 1010735 (0.00092) [2022-07-11 03:13:27,015][25689] Fps is (10 sec: 5598.8, 60 sec: 5566.8, 300 sec: 5557.8). Total num frames: 1034999808. Throughput: 0: 4986.7. Samples: 1034995740. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:13:27,017][25689] Avg episode reward: [(0, '0.714')] [2022-07-11 03:13:27,532][26022] Updated weights on worker 0-0, policy_version 1010745 (0.00103) [2022-07-11 03:13:29,414][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:13:29,430][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001010755_1035013120.pth [2022-07-11 03:13:29,431][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001008800_1033011200.pth [2022-07-11 03:13:29,436][26022] Updated weights on worker 0-0, policy_version 1010755 (0.00087) [2022-07-11 03:13:31,214][26022] Updated weights on worker 0-0, policy_version 1010765 (0.00085) [2022-07-11 03:13:32,083][25689] Fps is (10 sec: 5462.7, 60 sec: 5537.1, 300 sec: 5546.4). Total num frames: 1035026432. Throughput: 0: 5824.5. Samples: 1035028888. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:13:32,084][25689] Avg episode reward: [(0, '0.654')] [2022-07-11 03:13:33,113][26022] Updated weights on worker 0-0, policy_version 1010775 (0.00995) [2022-07-11 03:13:35,041][26022] Updated weights on worker 0-0, policy_version 1010785 (0.00087) [2022-07-11 03:13:36,739][26022] Updated weights on worker 0-0, policy_version 1010795 (0.00083) [2022-07-11 03:13:37,101][25689] Fps is (10 sec: 5584.1, 60 sec: 5575.1, 300 sec: 5563.3). Total num frames: 1035056128. Throughput: 0: 5835.0. Samples: 1035062598. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:13:37,101][25689] Avg episode reward: [(0, '0.489')] [2022-07-11 03:13:38,688][26022] Updated weights on worker 0-0, policy_version 1010805 (0.00094) [2022-07-11 03:13:40,385][26022] Updated weights on worker 0-0, policy_version 1010815 (0.00091) [2022-07-11 03:13:42,176][25689] Fps is (10 sec: 5580.5, 60 sec: 5518.8, 300 sec: 5548.4). Total num frames: 1035082752. Throughput: 0: 4974.4. Samples: 1035079384. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:13:42,176][25689] Avg episode reward: [(0, '0.036')] [2022-07-11 03:13:42,312][26022] Updated weights on worker 0-0, policy_version 1010825 (0.00086) [2022-07-11 03:13:44,067][26022] Updated weights on worker 0-0, policy_version 1010835 (0.00087) [2022-07-11 03:13:45,962][26022] Updated weights on worker 0-0, policy_version 1010845 (0.00088) [2022-07-11 03:13:47,219][25689] Fps is (10 sec: 5566.3, 60 sec: 5570.7, 300 sec: 5559.8). Total num frames: 1035112448. Throughput: 0: 5808.5. Samples: 1035113094. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:13:47,219][25689] Avg episode reward: [(0, '-0.070')] [2022-07-11 03:13:47,646][26022] Updated weights on worker 0-0, policy_version 1010855 (0.00086) [2022-07-11 03:13:49,783][26022] Updated weights on worker 0-0, policy_version 1010865 (0.00088) [2022-07-11 03:13:51,457][26022] Updated weights on worker 0-0, policy_version 1010875 (0.00090) [2022-07-11 03:13:52,296][25689] Fps is (10 sec: 5565.3, 60 sec: 5538.8, 300 sec: 5558.6). Total num frames: 1035139072. Throughput: 0: 5786.3. Samples: 1035145842. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:13:52,296][25689] Avg episode reward: [(0, '-0.109')] [2022-07-11 03:13:53,539][26022] Updated weights on worker 0-0, policy_version 1010885 (0.00084) [2022-07-11 03:13:55,167][26022] Updated weights on worker 0-0, policy_version 1010895 (0.00089) [2022-07-11 03:13:57,031][26022] Updated weights on worker 0-0, policy_version 1010905 (0.00092) [2022-07-11 03:13:57,323][25689] Fps is (10 sec: 5574.1, 60 sec: 5542.2, 300 sec: 5561.7). Total num frames: 1035168768. Throughput: 0: 4943.2. Samples: 1035162558. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:13:57,323][25689] Avg episode reward: [(0, '1.097')] [2022-07-11 03:13:58,947][26022] Updated weights on worker 0-0, policy_version 1010915 (0.00088) [2022-07-11 03:14:00,584][26022] Updated weights on worker 0-0, policy_version 1010925 (0.00089) [2022-07-11 03:14:02,324][25689] Fps is (10 sec: 5309.8, 60 sec: 5491.9, 300 sec: 5555.3). Total num frames: 1035192320. Throughput: 0: 5776.4. Samples: 1035195768. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:14:02,324][25689] Avg episode reward: [(0, '1.390')] [2022-07-11 03:14:03,202][26022] Updated weights on worker 0-0, policy_version 1010935 (0.00086) [2022-07-11 03:14:04,751][26022] Updated weights on worker 0-0, policy_version 1010945 (0.00084) [2022-07-11 03:14:06,625][26022] Updated weights on worker 0-0, policy_version 1010955 (0.00077) [2022-07-11 03:14:07,359][25689] Fps is (10 sec: 5203.7, 60 sec: 5528.3, 300 sec: 5559.1). Total num frames: 1035220992. Throughput: 0: 5665.8. Samples: 1035227202. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:14:07,359][25689] Avg episode reward: [(0, '1.683')] [2022-07-11 03:14:08,478][26022] Updated weights on worker 0-0, policy_version 1010965 (0.00091) [2022-07-11 03:14:10,292][26022] Updated weights on worker 0-0, policy_version 1010975 (0.00090) [2022-07-11 03:14:12,178][26022] Updated weights on worker 0-0, policy_version 1010985 (0.00092) [2022-07-11 03:14:12,436][25689] Fps is (10 sec: 5670.8, 60 sec: 5533.7, 300 sec: 5557.8). Total num frames: 1035249664. Throughput: 0: 4866.5. Samples: 1035243856. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:14:12,436][25689] Avg episode reward: [(0, '1.902')] [2022-07-11 03:14:14,086][26022] Updated weights on worker 0-0, policy_version 1010995 (0.00083) [2022-07-11 03:14:15,802][26022] Updated weights on worker 0-0, policy_version 1011005 (0.00093) [2022-07-11 03:14:17,448][25689] Fps is (10 sec: 5582.3, 60 sec: 5518.5, 300 sec: 5550.7). Total num frames: 1035277312. Throughput: 0: 5728.9. Samples: 1035277852. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:14:17,448][25689] Avg episode reward: [(0, '1.690')] [2022-07-11 03:14:17,711][26022] Updated weights on worker 0-0, policy_version 1011015 (0.00089) [2022-07-11 03:14:19,345][26022] Updated weights on worker 0-0, policy_version 1011025 (0.00087) [2022-07-11 03:14:21,325][26022] Updated weights on worker 0-0, policy_version 1011035 (0.00113) [2022-07-11 03:14:22,459][25689] Fps is (10 sec: 5721.0, 60 sec: 5551.7, 300 sec: 5561.8). Total num frames: 1035307008. Throughput: 0: 5754.7. Samples: 1035311642. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:14:22,460][25689] Avg episode reward: [(0, '1.893')] [2022-07-11 03:14:23,313][26022] Updated weights on worker 0-0, policy_version 1011045 (0.00086) [2022-07-11 03:14:24,849][26022] Updated weights on worker 0-0, policy_version 1011055 (0.00079) [2022-07-11 03:14:26,808][26022] Updated weights on worker 0-0, policy_version 1011065 (0.00088) [2022-07-11 03:14:27,475][25689] Fps is (10 sec: 5718.9, 60 sec: 5538.3, 300 sec: 5559.3). Total num frames: 1035334656. Throughput: 0: 5036.4. Samples: 1035328514. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:14:27,475][25689] Avg episode reward: [(0, '1.371')] [2022-07-11 03:14:28,539][26022] Updated weights on worker 0-0, policy_version 1011075 (0.00094) [2022-07-11 03:14:30,279][26022] Updated weights on worker 0-0, policy_version 1011085 (0.00094) [2022-07-11 03:14:32,312][26022] Updated weights on worker 0-0, policy_version 1011095 (0.00094) [2022-07-11 03:14:32,525][25689] Fps is (10 sec: 5493.5, 60 sec: 5557.0, 300 sec: 5558.8). Total num frames: 1035362304. Throughput: 0: 5871.0. Samples: 1035361800. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:14:32,526][25689] Avg episode reward: [(0, '1.249')] [2022-07-11 03:14:34,025][26022] Updated weights on worker 0-0, policy_version 1011105 (0.00089) [2022-07-11 03:14:35,953][26022] Updated weights on worker 0-0, policy_version 1011115 (0.00082) [2022-07-11 03:14:37,541][25689] Fps is (10 sec: 5493.3, 60 sec: 5523.2, 300 sec: 5555.2). Total num frames: 1035389952. Throughput: 0: 5852.0. Samples: 1035395438. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:14:37,541][25689] Avg episode reward: [(0, '1.139')] [2022-07-11 03:14:37,805][26022] Updated weights on worker 0-0, policy_version 1011125 (0.00086) [2022-07-11 03:14:39,667][26022] Updated weights on worker 0-0, policy_version 1011135 (0.00088) [2022-07-11 03:14:41,614][26022] Updated weights on worker 0-0, policy_version 1011145 (0.00084) [2022-07-11 03:14:42,551][25689] Fps is (10 sec: 5617.2, 60 sec: 5563.0, 300 sec: 5556.0). Total num frames: 1035418624. Throughput: 0: 5012.6. Samples: 1035412354. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:14:42,554][25689] Avg episode reward: [(0, '0.848')] [2022-07-11 03:14:43,143][26022] Updated weights on worker 0-0, policy_version 1011155 (0.00093) [2022-07-11 03:14:45,075][26022] Updated weights on worker 0-0, policy_version 1011165 (0.00087) [2022-07-11 03:14:46,886][26022] Updated weights on worker 0-0, policy_version 1011175 (0.00081) [2022-07-11 03:14:47,567][25689] Fps is (10 sec: 5515.4, 60 sec: 5514.7, 300 sec: 5553.4). Total num frames: 1035445248. Throughput: 0: 5844.6. Samples: 1035445942. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:14:47,567][25689] Avg episode reward: [(0, '0.694')] [2022-07-11 03:14:48,852][26022] Updated weights on worker 0-0, policy_version 1011185 (0.00085) [2022-07-11 03:14:50,691][26022] Updated weights on worker 0-0, policy_version 1011195 (0.00048) [2022-07-11 03:14:52,399][26022] Updated weights on worker 0-0, policy_version 1011205 (0.00082) [2022-07-11 03:14:52,619][25689] Fps is (10 sec: 5594.4, 60 sec: 5567.9, 300 sec: 5559.7). Total num frames: 1035474944. Throughput: 0: 5869.7. Samples: 1035479744. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:14:52,619][25689] Avg episode reward: [(0, '1.144')] [2022-07-11 03:14:54,204][26022] Updated weights on worker 0-0, policy_version 1011215 (0.00090) [2022-07-11 03:14:55,940][26022] Updated weights on worker 0-0, policy_version 1011225 (0.00084) [2022-07-11 03:14:57,658][25689] Fps is (10 sec: 5682.5, 60 sec: 5532.8, 300 sec: 5552.5). Total num frames: 1035502592. Throughput: 0: 5036.6. Samples: 1035496760. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:14:57,658][25689] Avg episode reward: [(0, '1.067')] [2022-07-11 03:14:57,800][26022] Updated weights on worker 0-0, policy_version 1011235 (0.00090) [2022-07-11 03:14:59,570][26022] Updated weights on worker 0-0, policy_version 1011245 (0.00088) [2022-07-11 03:15:02,027][26022] Updated weights on worker 0-0, policy_version 1011255 (0.00091) [2022-07-11 03:15:02,671][25689] Fps is (10 sec: 5399.1, 60 sec: 5582.7, 300 sec: 5560.3). Total num frames: 1035529216. Throughput: 0: 5871.2. Samples: 1035530478. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:15:02,671][25689] Avg episode reward: [(0, '1.091')] [2022-07-11 03:15:03,911][26022] Updated weights on worker 0-0, policy_version 1011265 (0.00082) [2022-07-11 03:15:05,466][26022] Updated weights on worker 0-0, policy_version 1011275 (0.00095) [2022-07-11 03:15:07,247][26022] Updated weights on worker 0-0, policy_version 1011285 (0.00086) [2022-07-11 03:15:07,686][25689] Fps is (10 sec: 5411.9, 60 sec: 5567.5, 300 sec: 5553.9). Total num frames: 1035556864. Throughput: 0: 5777.9. Samples: 1035562190. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:15:07,687][25689] Avg episode reward: [(0, '1.125')] [2022-07-11 03:15:09,088][26022] Updated weights on worker 0-0, policy_version 1011295 (0.00085) [2022-07-11 03:15:10,994][26022] Updated weights on worker 0-0, policy_version 1011305 (0.00088) [2022-07-11 03:15:12,703][26022] Updated weights on worker 0-0, policy_version 1011315 (0.00086) [2022-07-11 03:15:12,739][25689] Fps is (10 sec: 5695.7, 60 sec: 5586.8, 300 sec: 5558.0). Total num frames: 1035586560. Throughput: 0: 4940.9. Samples: 1035579154. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:15:12,739][25689] Avg episode reward: [(0, '0.511')] [2022-07-11 03:15:14,603][26022] Updated weights on worker 0-0, policy_version 1011325 (0.00082) [2022-07-11 03:15:16,336][26022] Updated weights on worker 0-0, policy_version 1011335 (0.00087) [2022-07-11 03:15:17,783][25689] Fps is (10 sec: 5679.5, 60 sec: 5583.7, 300 sec: 5557.3). Total num frames: 1035614208. Throughput: 0: 5785.9. Samples: 1035613202. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:15:17,783][25689] Avg episode reward: [(0, '0.437')] [2022-07-11 03:15:18,175][26022] Updated weights on worker 0-0, policy_version 1011345 (0.00090) [2022-07-11 03:15:20,004][26022] Updated weights on worker 0-0, policy_version 1011355 (0.00088) [2022-07-11 03:15:21,735][26022] Updated weights on worker 0-0, policy_version 1011365 (0.00091) [2022-07-11 03:15:22,786][25689] Fps is (10 sec: 5605.5, 60 sec: 5567.6, 300 sec: 5561.6). Total num frames: 1035642880. Throughput: 0: 5806.6. Samples: 1035647278. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:15:22,786][25689] Avg episode reward: [(0, '0.176')] [2022-07-11 03:15:23,784][26022] Updated weights on worker 0-0, policy_version 1011375 (0.00084) [2022-07-11 03:15:25,480][26022] Updated weights on worker 0-0, policy_version 1011385 (0.00086) [2022-07-11 03:15:27,292][26022] Updated weights on worker 0-0, policy_version 1011395 (0.00085) [2022-07-11 03:15:27,828][25689] Fps is (10 sec: 5606.9, 60 sec: 5565.1, 300 sec: 5559.7). Total num frames: 1035670528. Throughput: 0: 5062.7. Samples: 1035664160. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:15:27,829][25689] Avg episode reward: [(0, '0.276')] [2022-07-11 03:15:29,260][26022] Updated weights on worker 0-0, policy_version 1011405 (0.00085) [2022-07-11 03:15:29,744][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:15:29,752][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001011407_1035680768.pth [2022-07-11 03:15:29,752][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001009451_1033677824.pth [2022-07-11 03:15:30,976][26022] Updated weights on worker 0-0, policy_version 1011415 (0.00098) [2022-07-11 03:15:32,883][25689] Fps is (10 sec: 5476.5, 60 sec: 5564.7, 300 sec: 5556.9). Total num frames: 1035698176. Throughput: 0: 5871.0. Samples: 1035697420. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:15:32,883][25689] Avg episode reward: [(0, '0.152')] [2022-07-11 03:15:32,928][26022] Updated weights on worker 0-0, policy_version 1011425 (0.00089) [2022-07-11 03:15:34,672][26022] Updated weights on worker 0-0, policy_version 1011435 (0.00090) [2022-07-11 03:15:36,478][26022] Updated weights on worker 0-0, policy_version 1011445 (0.00091) [2022-07-11 03:15:37,898][25689] Fps is (10 sec: 5694.3, 60 sec: 5598.7, 300 sec: 5563.9). Total num frames: 1035727872. Throughput: 0: 5862.0. Samples: 1035731116. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:15:37,900][25689] Avg episode reward: [(0, '0.480')] [2022-07-11 03:15:38,207][26022] Updated weights on worker 0-0, policy_version 1011455 (0.00089) [2022-07-11 03:15:40,268][26022] Updated weights on worker 0-0, policy_version 1011465 (0.00095) [2022-07-11 03:15:41,982][26022] Updated weights on worker 0-0, policy_version 1011475 (0.00084) [2022-07-11 03:15:42,928][25689] Fps is (10 sec: 5606.6, 60 sec: 5562.9, 300 sec: 5553.6). Total num frames: 1035754496. Throughput: 0: 5812.6. Samples: 1035764356. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:15:42,930][25689] Avg episode reward: [(0, '0.786')] [2022-07-11 03:15:43,774][26022] Updated weights on worker 0-0, policy_version 1011485 (0.00080) [2022-07-11 03:15:45,723][26022] Updated weights on worker 0-0, policy_version 1011495 (0.00092) [2022-07-11 03:15:47,574][26022] Updated weights on worker 0-0, policy_version 1011505 (0.00085) [2022-07-11 03:15:47,940][25689] Fps is (10 sec: 5506.5, 60 sec: 5597.2, 300 sec: 5558.2). Total num frames: 1035783168. Throughput: 0: 5813.2. Samples: 1035781078. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:15:47,941][25689] Avg episode reward: [(0, '0.633')] [2022-07-11 03:15:49,486][26022] Updated weights on worker 0-0, policy_version 1011515 (0.00086) [2022-07-11 03:15:51,354][26022] Updated weights on worker 0-0, policy_version 1011525 (0.00089) [2022-07-11 03:15:53,012][25689] Fps is (10 sec: 5585.2, 60 sec: 5561.4, 300 sec: 5557.7). Total num frames: 1035810816. Throughput: 0: 5822.8. Samples: 1035814628. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:15:53,012][25689] Avg episode reward: [(0, '0.506')] [2022-07-11 03:15:53,061][26022] Updated weights on worker 0-0, policy_version 1011535 (0.00086) [2022-07-11 03:15:54,913][26022] Updated weights on worker 0-0, policy_version 1011545 (0.00052) [2022-07-11 03:15:56,612][26022] Updated weights on worker 0-0, policy_version 1011555 (0.00090) [2022-07-11 03:15:58,094][25689] Fps is (10 sec: 5546.7, 60 sec: 5574.4, 300 sec: 5556.4). Total num frames: 1035839488. Throughput: 0: 5819.7. Samples: 1035848648. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:15:58,094][25689] Avg episode reward: [(0, '-0.920')] [2022-07-11 03:15:58,552][26022] Updated weights on worker 0-0, policy_version 1011565 (0.00068) [2022-07-11 03:16:00,258][26022] Updated weights on worker 0-0, policy_version 1011575 (0.00052) [2022-07-11 03:16:02,147][26022] Updated weights on worker 0-0, policy_version 1011585 (0.00081) [2022-07-11 03:16:03,122][25689] Fps is (10 sec: 5367.8, 60 sec: 5556.0, 300 sec: 5552.6). Total num frames: 1035865088. Throughput: 0: 5011.3. Samples: 1035865556. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:16:03,123][25689] Avg episode reward: [(0, '-0.840')] [2022-07-11 03:16:04,273][26022] Updated weights on worker 0-0, policy_version 1011595 (0.00091) [2022-07-11 03:16:06,197][26022] Updated weights on worker 0-0, policy_version 1011605 (0.00070) [2022-07-11 03:16:07,851][26022] Updated weights on worker 0-0, policy_version 1011615 (0.00103) [2022-07-11 03:16:08,174][25689] Fps is (10 sec: 5485.8, 60 sec: 5586.6, 300 sec: 5560.6). Total num frames: 1035894784. Throughput: 0: 5733.5. Samples: 1035897088. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:16:08,174][25689] Avg episode reward: [(0, '-1.304')] [2022-07-11 03:16:10,033][26022] Updated weights on worker 0-0, policy_version 1011625 (0.00084) [2022-07-11 03:16:11,524][26022] Updated weights on worker 0-0, policy_version 1011635 (0.00083) [2022-07-11 03:16:13,211][25689] Fps is (10 sec: 5684.3, 60 sec: 5554.2, 300 sec: 5560.6). Total num frames: 1035922432. Throughput: 0: 5749.1. Samples: 1035930752. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:16:13,211][25689] Avg episode reward: [(0, '-1.076')] [2022-07-11 03:16:13,580][26022] Updated weights on worker 0-0, policy_version 1011645 (0.00086) [2022-07-11 03:16:15,292][26022] Updated weights on worker 0-0, policy_version 1011655 (0.00088) [2022-07-11 03:16:17,190][26022] Updated weights on worker 0-0, policy_version 1011665 (0.00087) [2022-07-11 03:16:18,235][25689] Fps is (10 sec: 5496.0, 60 sec: 5556.0, 300 sec: 5557.8). Total num frames: 1035950080. Throughput: 0: 4915.5. Samples: 1035947652. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:16:18,235][25689] Avg episode reward: [(0, '-0.746')] [2022-07-11 03:16:18,973][26022] Updated weights on worker 0-0, policy_version 1011675 (0.00259) [2022-07-11 03:16:20,826][26022] Updated weights on worker 0-0, policy_version 1011685 (0.00080) [2022-07-11 03:16:22,622][26022] Updated weights on worker 0-0, policy_version 1011695 (0.00084) [2022-07-11 03:16:23,244][25689] Fps is (10 sec: 5715.2, 60 sec: 5572.4, 300 sec: 5561.6). Total num frames: 1035979776. Throughput: 0: 5771.4. Samples: 1035981688. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 03:16:23,245][25689] Avg episode reward: [(0, '-0.572')] [2022-07-11 03:16:24,638][26022] Updated weights on worker 0-0, policy_version 1011705 (0.00094) [2022-07-11 03:16:26,083][26022] Updated weights on worker 0-0, policy_version 1011715 (0.00093) [2022-07-11 03:16:28,243][26022] Updated weights on worker 0-0, policy_version 1011725 (0.00085) [2022-07-11 03:16:28,255][25689] Fps is (10 sec: 5620.8, 60 sec: 5558.3, 300 sec: 5555.8). Total num frames: 1036006400. Throughput: 0: 5886.2. Samples: 1036015288. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:16:28,255][25689] Avg episode reward: [(0, '0.524')] [2022-07-11 03:16:29,755][26022] Updated weights on worker 0-0, policy_version 1011735 (0.00089) [2022-07-11 03:16:31,901][26022] Updated weights on worker 0-0, policy_version 1011745 (0.00083) [2022-07-11 03:16:33,315][25689] Fps is (10 sec: 5694.3, 60 sec: 5608.7, 300 sec: 5568.9). Total num frames: 1036037120. Throughput: 0: 5044.0. Samples: 1036032156. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:16:33,315][25689] Avg episode reward: [(0, '0.557')] [2022-07-11 03:16:33,319][26022] Updated weights on worker 0-0, policy_version 1011755 (0.00097) [2022-07-11 03:16:35,566][26022] Updated weights on worker 0-0, policy_version 1011765 (0.00089) [2022-07-11 03:16:37,171][26022] Updated weights on worker 0-0, policy_version 1011775 (0.00087) [2022-07-11 03:16:38,361][25689] Fps is (10 sec: 5572.6, 60 sec: 5538.0, 300 sec: 5557.9). Total num frames: 1036062720. Throughput: 0: 5856.9. Samples: 1036065532. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:16:38,362][25689] Avg episode reward: [(0, '1.430')] [2022-07-11 03:16:39,052][26022] Updated weights on worker 0-0, policy_version 1011785 (0.00086) [2022-07-11 03:16:40,852][26022] Updated weights on worker 0-0, policy_version 1011795 (0.00092) [2022-07-11 03:16:42,746][26022] Updated weights on worker 0-0, policy_version 1011805 (0.00622) [2022-07-11 03:16:43,395][25689] Fps is (10 sec: 5282.3, 60 sec: 5554.6, 300 sec: 5558.2). Total num frames: 1036090368. Throughput: 0: 5823.3. Samples: 1036099032. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:16:43,396][25689] Avg episode reward: [(0, '1.247')] [2022-07-11 03:16:44,516][26022] Updated weights on worker 0-0, policy_version 1011815 (0.00095) [2022-07-11 03:16:46,413][26022] Updated weights on worker 0-0, policy_version 1011825 (0.00099) [2022-07-11 03:16:48,114][26022] Updated weights on worker 0-0, policy_version 1011835 (0.00097) [2022-07-11 03:16:48,397][25689] Fps is (10 sec: 5612.1, 60 sec: 5555.6, 300 sec: 5559.3). Total num frames: 1036119040. Throughput: 0: 4991.8. Samples: 1036115834. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:16:48,397][25689] Avg episode reward: [(0, '1.247')] [2022-07-11 03:16:50,338][26022] Updated weights on worker 0-0, policy_version 1011845 (0.00052) [2022-07-11 03:16:51,711][26022] Updated weights on worker 0-0, policy_version 1011855 (0.00096) [2022-07-11 03:16:53,467][25689] Fps is (10 sec: 5592.1, 60 sec: 5555.8, 300 sec: 5556.1). Total num frames: 1036146688. Throughput: 0: 5811.1. Samples: 1036149260. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:16:53,468][25689] Avg episode reward: [(0, '0.478')] [2022-07-11 03:16:53,733][26022] Updated weights on worker 0-0, policy_version 1011865 (0.00083) [2022-07-11 03:16:55,385][26022] Updated weights on worker 0-0, policy_version 1011875 (0.00471) [2022-07-11 03:16:57,413][26022] Updated weights on worker 0-0, policy_version 1011885 (0.00090) [2022-07-11 03:16:58,486][25689] Fps is (10 sec: 5683.8, 60 sec: 5578.5, 300 sec: 5559.5). Total num frames: 1036176384. Throughput: 0: 5846.2. Samples: 1036183184. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:16:58,488][25689] Avg episode reward: [(0, '0.749')] [2022-07-11 03:16:59,255][26022] Updated weights on worker 0-0, policy_version 1011895 (0.00082) [2022-07-11 03:17:01,178][26022] Updated weights on worker 0-0, policy_version 1011905 (0.00086) [2022-07-11 03:17:03,040][26022] Updated weights on worker 0-0, policy_version 1011915 (0.00084) [2022-07-11 03:17:03,519][25689] Fps is (10 sec: 5500.8, 60 sec: 5578.1, 300 sec: 5555.5). Total num frames: 1036201984. Throughput: 0: 5019.2. Samples: 1036200038. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:17:03,520][25689] Avg episode reward: [(0, '0.864')] [2022-07-11 03:17:05,195][26022] Updated weights on worker 0-0, policy_version 1011925 (0.00093) [2022-07-11 03:17:06,923][26022] Updated weights on worker 0-0, policy_version 1011935 (0.00088) [2022-07-11 03:17:08,614][25689] Fps is (10 sec: 5257.6, 60 sec: 5540.2, 300 sec: 5559.3). Total num frames: 1036229632. Throughput: 0: 5714.2. Samples: 1036231358. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:17:08,615][25689] Avg episode reward: [(0, '1.157')] [2022-07-11 03:17:08,885][26022] Updated weights on worker 0-0, policy_version 1011945 (0.00090) [2022-07-11 03:17:10,538][26022] Updated weights on worker 0-0, policy_version 1011955 (0.00086) [2022-07-11 03:17:12,347][26022] Updated weights on worker 0-0, policy_version 1011965 (0.00083) [2022-07-11 03:17:13,729][25689] Fps is (10 sec: 5516.4, 60 sec: 5550.0, 300 sec: 5554.6). Total num frames: 1036258304. Throughput: 0: 5705.9. Samples: 1036264872. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:17:13,729][25689] Avg episode reward: [(0, '1.166')] [2022-07-11 03:17:14,138][26022] Updated weights on worker 0-0, policy_version 1011975 (0.00078) [2022-07-11 03:17:16,199][26022] Updated weights on worker 0-0, policy_version 1011985 (0.00081) [2022-07-11 03:17:17,876][26022] Updated weights on worker 0-0, policy_version 1011995 (0.00091) [2022-07-11 03:17:18,765][25689] Fps is (10 sec: 5749.9, 60 sec: 5582.7, 300 sec: 5555.6). Total num frames: 1036288000. Throughput: 0: 4865.6. Samples: 1036281854. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:17:18,765][25689] Avg episode reward: [(0, '1.046')] [2022-07-11 03:17:19,922][26022] Updated weights on worker 0-0, policy_version 1012005 (0.00088) [2022-07-11 03:17:21,456][26022] Updated weights on worker 0-0, policy_version 1012015 (0.00075) [2022-07-11 03:17:23,405][26022] Updated weights on worker 0-0, policy_version 1012025 (0.00091) [2022-07-11 03:17:23,768][25689] Fps is (10 sec: 5609.7, 60 sec: 5532.5, 300 sec: 5556.2). Total num frames: 1036314624. Throughput: 0: 5713.7. Samples: 1036315736. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:17:23,769][25689] Avg episode reward: [(0, '1.451')] [2022-07-11 03:17:25,019][26022] Updated weights on worker 0-0, policy_version 1012035 (0.00085) [2022-07-11 03:17:26,954][26022] Updated weights on worker 0-0, policy_version 1012045 (0.00085) [2022-07-11 03:17:28,771][25689] Fps is (10 sec: 5423.9, 60 sec: 5550.1, 300 sec: 5554.8). Total num frames: 1036342272. Throughput: 0: 5842.0. Samples: 1036349116. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:17:28,771][25689] Avg episode reward: [(0, '0.749')] [2022-07-11 03:17:28,876][26022] Updated weights on worker 0-0, policy_version 1012055 (0.00084) [2022-07-11 03:17:29,899][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:17:29,914][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001012061_1036350464.pth [2022-07-11 03:17:29,914][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001010102_1034344448.pth [2022-07-11 03:17:30,741][26022] Updated weights on worker 0-0, policy_version 1012065 (0.00091) [2022-07-11 03:17:32,659][26022] Updated weights on worker 0-0, policy_version 1012075 (0.00094) [2022-07-11 03:17:33,865][25689] Fps is (10 sec: 5780.6, 60 sec: 5547.0, 300 sec: 5564.6). Total num frames: 1036372992. Throughput: 0: 5007.2. Samples: 1036365702. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:17:33,866][25689] Avg episode reward: [(0, '0.906')] [2022-07-11 03:17:34,448][26022] Updated weights on worker 0-0, policy_version 1012085 (0.00096) [2022-07-11 03:17:36,198][26022] Updated weights on worker 0-0, policy_version 1012095 (0.00091) [2022-07-11 03:17:38,212][26022] Updated weights on worker 0-0, policy_version 1012105 (0.00080) [2022-07-11 03:17:38,893][25689] Fps is (10 sec: 5462.9, 60 sec: 5531.8, 300 sec: 5547.1). Total num frames: 1036397568. Throughput: 0: 5830.4. Samples: 1036399208. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:17:38,893][25689] Avg episode reward: [(0, '0.871')] [2022-07-11 03:17:39,776][26022] Updated weights on worker 0-0, policy_version 1012115 (0.00083) [2022-07-11 03:17:42,008][26022] Updated weights on worker 0-0, policy_version 1012125 (0.00092) [2022-07-11 03:17:43,254][26022] Updated weights on worker 0-0, policy_version 1012135 (0.00092) [2022-07-11 03:17:43,906][25689] Fps is (10 sec: 5609.0, 60 sec: 5601.3, 300 sec: 5565.2). Total num frames: 1036429312. Throughput: 0: 5828.7. Samples: 1036433116. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:17:43,907][25689] Avg episode reward: [(0, '0.893')] [2022-07-11 03:17:45,615][26022] Updated weights on worker 0-0, policy_version 1012145 (0.00086) [2022-07-11 03:17:46,813][26022] Updated weights on worker 0-0, policy_version 1012155 (0.00085) [2022-07-11 03:17:48,912][25689] Fps is (10 sec: 5621.5, 60 sec: 5533.3, 300 sec: 5553.1). Total num frames: 1036453888. Throughput: 0: 5005.7. Samples: 1036449934. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:17:48,912][25689] Avg episode reward: [(0, '1.282')] [2022-07-11 03:17:49,246][26022] Updated weights on worker 0-0, policy_version 1012165 (0.00088) [2022-07-11 03:17:50,617][26022] Updated weights on worker 0-0, policy_version 1012175 (0.00090) [2022-07-11 03:17:52,802][26022] Updated weights on worker 0-0, policy_version 1012185 (0.00083) [2022-07-11 03:17:53,997][25689] Fps is (10 sec: 5479.7, 60 sec: 5582.6, 300 sec: 5556.1). Total num frames: 1036484608. Throughput: 0: 5854.2. Samples: 1036483558. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:17:53,998][25689] Avg episode reward: [(0, '1.418')] [2022-07-11 03:17:54,300][26022] Updated weights on worker 0-0, policy_version 1012195 (0.00082) [2022-07-11 03:17:56,326][26022] Updated weights on worker 0-0, policy_version 1012205 (0.00087) [2022-07-11 03:17:57,946][26022] Updated weights on worker 0-0, policy_version 1012215 (0.00085) [2022-07-11 03:17:59,019][25689] Fps is (10 sec: 5775.0, 60 sec: 5548.6, 300 sec: 5559.3). Total num frames: 1036512256. Throughput: 0: 5881.5. Samples: 1036517578. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:17:59,019][25689] Avg episode reward: [(0, '1.482')] [2022-07-11 03:18:00,032][26022] Updated weights on worker 0-0, policy_version 1012225 (0.00080) [2022-07-11 03:18:01,553][26022] Updated weights on worker 0-0, policy_version 1012235 (0.00090) [2022-07-11 03:18:04,035][25689] Fps is (10 sec: 5305.0, 60 sec: 5550.1, 300 sec: 5556.7). Total num frames: 1036537856. Throughput: 0: 5041.3. Samples: 1036534590. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:18:04,035][25689] Avg episode reward: [(0, '1.451')] [2022-07-11 03:18:04,039][26022] Updated weights on worker 0-0, policy_version 1012245 (0.00100) [2022-07-11 03:18:05,669][26022] Updated weights on worker 0-0, policy_version 1012255 (0.00091) [2022-07-11 03:18:07,749][26022] Updated weights on worker 0-0, policy_version 1012265 (0.00087) [2022-07-11 03:18:09,114][25689] Fps is (10 sec: 5477.1, 60 sec: 5585.4, 300 sec: 5561.2). Total num frames: 1036567552. Throughput: 0: 5748.5. Samples: 1036566070. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:18:09,115][25689] Avg episode reward: [(0, '1.186')] [2022-07-11 03:18:09,502][26022] Updated weights on worker 0-0, policy_version 1012275 (0.00089) [2022-07-11 03:18:11,306][26022] Updated weights on worker 0-0, policy_version 1012285 (0.00083) [2022-07-11 03:18:13,238][26022] Updated weights on worker 0-0, policy_version 1012295 (0.00089) [2022-07-11 03:18:14,208][25689] Fps is (10 sec: 5636.6, 60 sec: 5570.4, 300 sec: 5556.6). Total num frames: 1036595200. Throughput: 0: 5742.8. Samples: 1036599626. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:18:14,209][25689] Avg episode reward: [(0, '1.272')] [2022-07-11 03:18:14,907][26022] Updated weights on worker 0-0, policy_version 1012305 (0.00092) [2022-07-11 03:18:16,736][26022] Updated weights on worker 0-0, policy_version 1012315 (0.00087) [2022-07-11 03:18:18,818][26022] Updated weights on worker 0-0, policy_version 1012325 (0.00089) [2022-07-11 03:18:19,289][25689] Fps is (10 sec: 5535.2, 60 sec: 5549.3, 300 sec: 5558.6). Total num frames: 1036623872. Throughput: 0: 5697.7. Samples: 1036633074. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:18:19,291][25689] Avg episode reward: [(0, '0.880')] [2022-07-11 03:18:20,659][26022] Updated weights on worker 0-0, policy_version 1012335 (0.00053) [2022-07-11 03:18:22,438][26022] Updated weights on worker 0-0, policy_version 1012345 (0.00087) [2022-07-11 03:18:24,096][26022] Updated weights on worker 0-0, policy_version 1012355 (0.00083) [2022-07-11 03:18:24,349][25689] Fps is (10 sec: 5553.8, 60 sec: 5561.1, 300 sec: 5555.1). Total num frames: 1036651520. Throughput: 0: 5672.8. Samples: 1036649828. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:18:24,350][25689] Avg episode reward: [(0, '0.842')] [2022-07-11 03:18:26,215][26022] Updated weights on worker 0-0, policy_version 1012365 (0.00088) [2022-07-11 03:18:27,805][26022] Updated weights on worker 0-0, policy_version 1012375 (0.00086) [2022-07-11 03:18:29,412][25689] Fps is (10 sec: 5462.7, 60 sec: 5555.6, 300 sec: 5558.6). Total num frames: 1036679168. Throughput: 0: 5764.8. Samples: 1036683080. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:18:29,412][25689] Avg episode reward: [(0, '1.046')] [2022-07-11 03:18:29,980][26022] Updated weights on worker 0-0, policy_version 1012385 (0.00086) [2022-07-11 03:18:31,475][26022] Updated weights on worker 0-0, policy_version 1012395 (0.00085) [2022-07-11 03:18:33,663][26022] Updated weights on worker 0-0, policy_version 1012405 (0.00108) [2022-07-11 03:18:34,527][25689] Fps is (10 sec: 5634.4, 60 sec: 5536.9, 300 sec: 5556.8). Total num frames: 1036708864. Throughput: 0: 5758.3. Samples: 1036716624. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:18:34,529][25689] Avg episode reward: [(0, '1.023')] [2022-07-11 03:18:35,042][26022] Updated weights on worker 0-0, policy_version 1012415 (0.00091) [2022-07-11 03:18:37,302][26022] Updated weights on worker 0-0, policy_version 1012425 (0.00095) [2022-07-11 03:18:38,815][26022] Updated weights on worker 0-0, policy_version 1012435 (0.00087) [2022-07-11 03:18:39,594][25689] Fps is (10 sec: 5531.1, 60 sec: 5566.9, 300 sec: 5556.9). Total num frames: 1036735488. Throughput: 0: 4925.8. Samples: 1036733092. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:18:39,595][25689] Avg episode reward: [(0, '1.051')] [2022-07-11 03:18:40,843][26022] Updated weights on worker 0-0, policy_version 1012445 (0.00088) [2022-07-11 03:18:42,549][26022] Updated weights on worker 0-0, policy_version 1012455 (0.00086) [2022-07-11 03:18:44,575][26022] Updated weights on worker 0-0, policy_version 1012465 (0.00098) [2022-07-11 03:18:44,620][25689] Fps is (10 sec: 5478.5, 60 sec: 5515.2, 300 sec: 5553.8). Total num frames: 1036764160. Throughput: 0: 5781.7. Samples: 1036767026. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:18:44,621][25689] Avg episode reward: [(0, '1.159')] [2022-07-11 03:18:46,255][26022] Updated weights on worker 0-0, policy_version 1012475 (0.00085) [2022-07-11 03:18:48,133][26022] Updated weights on worker 0-0, policy_version 1012485 (0.00088) [2022-07-11 03:18:49,630][25689] Fps is (10 sec: 5713.7, 60 sec: 5582.2, 300 sec: 5561.9). Total num frames: 1036792832. Throughput: 0: 5808.4. Samples: 1036800518. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:18:49,631][25689] Avg episode reward: [(0, '1.307')] [2022-07-11 03:18:49,878][26022] Updated weights on worker 0-0, policy_version 1012495 (0.00094) [2022-07-11 03:18:51,822][26022] Updated weights on worker 0-0, policy_version 1012505 (0.00078) [2022-07-11 03:18:53,698][26022] Updated weights on worker 0-0, policy_version 1012515 (0.00088) [2022-07-11 03:18:54,699][25689] Fps is (10 sec: 5588.0, 60 sec: 5533.2, 300 sec: 5554.3). Total num frames: 1036820480. Throughput: 0: 4972.2. Samples: 1036816924. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:18:54,699][25689] Avg episode reward: [(0, '1.141')] [2022-07-11 03:18:55,513][26022] Updated weights on worker 0-0, policy_version 1012525 (0.00083) [2022-07-11 03:18:57,532][26022] Updated weights on worker 0-0, policy_version 1012535 (0.00088) [2022-07-11 03:18:59,063][26022] Updated weights on worker 0-0, policy_version 1012545 (0.01395) [2022-07-11 03:18:59,703][25689] Fps is (10 sec: 5591.5, 60 sec: 5551.6, 300 sec: 5571.4). Total num frames: 1036849152. Throughput: 0: 5859.4. Samples: 1036850916. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:18:59,703][25689] Avg episode reward: [(0, '1.316')] [2022-07-11 03:19:01,068][26022] Updated weights on worker 0-0, policy_version 1012555 (0.00102) [2022-07-11 03:19:03,103][26022] Updated weights on worker 0-0, policy_version 1012565 (0.00084) [2022-07-11 03:19:04,706][25689] Fps is (10 sec: 5423.2, 60 sec: 5552.8, 300 sec: 5561.7). Total num frames: 1036874752. Throughput: 0: 5743.2. Samples: 1036882386. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:19:04,707][25689] Avg episode reward: [(0, '1.081')] [2022-07-11 03:19:04,962][26022] Updated weights on worker 0-0, policy_version 1012575 (0.00087) [2022-07-11 03:19:06,871][26022] Updated weights on worker 0-0, policy_version 1012585 (0.00097) [2022-07-11 03:19:08,543][26022] Updated weights on worker 0-0, policy_version 1012595 (0.00093) [2022-07-11 03:19:09,710][25689] Fps is (10 sec: 5218.4, 60 sec: 5509.0, 300 sec: 5556.2). Total num frames: 1036901376. Throughput: 0: 4901.1. Samples: 1036898932. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:19:09,711][25689] Avg episode reward: [(0, '1.003')] [2022-07-11 03:19:10,659][26022] Updated weights on worker 0-0, policy_version 1012605 (0.00088) [2022-07-11 03:19:12,358][26022] Updated weights on worker 0-0, policy_version 1012615 (0.00081) [2022-07-11 03:19:14,211][26022] Updated weights on worker 0-0, policy_version 1012625 (0.00088) [2022-07-11 03:19:14,776][25689] Fps is (10 sec: 5593.0, 60 sec: 5545.4, 300 sec: 5562.1). Total num frames: 1036931072. Throughput: 0: 5746.5. Samples: 1036932296. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:19:14,776][25689] Avg episode reward: [(0, '1.346')] [2022-07-11 03:19:16,099][26022] Updated weights on worker 0-0, policy_version 1012635 (0.00089) [2022-07-11 03:19:17,830][26022] Updated weights on worker 0-0, policy_version 1012645 (0.00084) [2022-07-11 03:19:19,754][26022] Updated weights on worker 0-0, policy_version 1012655 (0.00081) [2022-07-11 03:19:19,777][25689] Fps is (10 sec: 5696.5, 60 sec: 5535.8, 300 sec: 5555.4). Total num frames: 1036958720. Throughput: 0: 5729.1. Samples: 1036965924. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:19:19,778][25689] Avg episode reward: [(0, '1.418')] [2022-07-11 03:19:21,707][26022] Updated weights on worker 0-0, policy_version 1012665 (0.00090) [2022-07-11 03:19:23,340][26022] Updated weights on worker 0-0, policy_version 1012675 (0.00088) [2022-07-11 03:19:24,789][25689] Fps is (10 sec: 5420.1, 60 sec: 5523.3, 300 sec: 5552.0). Total num frames: 1036985344. Throughput: 0: 5004.1. Samples: 1036982882. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:19:24,789][25689] Avg episode reward: [(0, '1.578')] [2022-07-11 03:19:25,282][26022] Updated weights on worker 0-0, policy_version 1012685 (0.00086) [2022-07-11 03:19:27,066][26022] Updated weights on worker 0-0, policy_version 1012695 (0.00099) [2022-07-11 03:19:28,909][26022] Updated weights on worker 0-0, policy_version 1012705 (0.00092) [2022-07-11 03:19:29,795][25689] Fps is (10 sec: 5724.3, 60 sec: 5579.3, 300 sec: 5563.2). Total num frames: 1037016064. Throughput: 0: 5861.9. Samples: 1037016664. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:19:29,795][25689] Avg episode reward: [(0, '1.616')] [2022-07-11 03:19:30,163][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:19:30,179][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001012712_1037017088.pth [2022-07-11 03:19:30,179][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001010755_1035013120.pth [2022-07-11 03:19:30,724][26022] Updated weights on worker 0-0, policy_version 1012715 (0.00095) [2022-07-11 03:19:32,339][26022] Updated weights on worker 0-0, policy_version 1012725 (0.00085) [2022-07-11 03:19:34,424][26022] Updated weights on worker 0-0, policy_version 1012735 (0.00091) [2022-07-11 03:19:34,854][25689] Fps is (10 sec: 5697.4, 60 sec: 5533.6, 300 sec: 5558.9). Total num frames: 1037042688. Throughput: 0: 5881.6. Samples: 1037050388. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:19:34,854][25689] Avg episode reward: [(0, '2.080')] [2022-07-11 03:19:36,099][26022] Updated weights on worker 0-0, policy_version 1012745 (0.00103) [2022-07-11 03:19:37,945][26022] Updated weights on worker 0-0, policy_version 1012755 (0.00086) [2022-07-11 03:19:39,722][26022] Updated weights on worker 0-0, policy_version 1012765 (0.00103) [2022-07-11 03:19:39,872][25689] Fps is (10 sec: 5588.6, 60 sec: 5589.0, 300 sec: 5562.2). Total num frames: 1037072384. Throughput: 0: 5026.7. Samples: 1037066938. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:19:39,873][25689] Avg episode reward: [(0, '1.719')] [2022-07-11 03:19:41,822][26022] Updated weights on worker 0-0, policy_version 1012775 (0.00086) [2022-07-11 03:19:43,440][26022] Updated weights on worker 0-0, policy_version 1012785 (0.00084) [2022-07-11 03:19:44,878][25689] Fps is (10 sec: 5618.4, 60 sec: 5556.9, 300 sec: 5562.4). Total num frames: 1037099008. Throughput: 0: 5873.7. Samples: 1037100880. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:19:44,880][25689] Avg episode reward: [(0, '1.886')] [2022-07-11 03:19:45,464][26022] Updated weights on worker 0-0, policy_version 1012795 (0.00092) [2022-07-11 03:19:46,910][26022] Updated weights on worker 0-0, policy_version 1012805 (0.00092) [2022-07-11 03:19:49,067][26022] Updated weights on worker 0-0, policy_version 1012815 (0.00097) [2022-07-11 03:19:49,886][25689] Fps is (10 sec: 5521.9, 60 sec: 5557.1, 300 sec: 5559.8). Total num frames: 1037127680. Throughput: 0: 5856.7. Samples: 1037134334. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:19:49,887][25689] Avg episode reward: [(0, '1.455')] [2022-07-11 03:19:50,727][26022] Updated weights on worker 0-0, policy_version 1012825 (0.00092) [2022-07-11 03:19:52,727][26022] Updated weights on worker 0-0, policy_version 1012835 (0.00088) [2022-07-11 03:19:54,324][26022] Updated weights on worker 0-0, policy_version 1012845 (0.00089) [2022-07-11 03:19:54,947][25689] Fps is (10 sec: 5695.3, 60 sec: 5574.8, 300 sec: 5562.8). Total num frames: 1037156352. Throughput: 0: 5016.6. Samples: 1037151186. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:19:54,947][25689] Avg episode reward: [(0, '1.644')] [2022-07-11 03:19:56,337][26022] Updated weights on worker 0-0, policy_version 1012855 (0.00084) [2022-07-11 03:19:57,994][26022] Updated weights on worker 0-0, policy_version 1012865 (0.00081) [2022-07-11 03:19:59,951][25689] Fps is (10 sec: 5493.9, 60 sec: 5540.8, 300 sec: 5563.0). Total num frames: 1037182976. Throughput: 0: 5880.4. Samples: 1037185010. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:19:59,952][25689] Avg episode reward: [(0, '1.659')] [2022-07-11 03:20:00,056][26022] Updated weights on worker 0-0, policy_version 1012875 (0.00091) [2022-07-11 03:20:01,973][26022] Updated weights on worker 0-0, policy_version 1012885 (0.00091) [2022-07-11 03:20:04,102][26022] Updated weights on worker 0-0, policy_version 1012895 (0.00088) [2022-07-11 03:20:04,961][25689] Fps is (10 sec: 5419.3, 60 sec: 5574.2, 300 sec: 5563.1). Total num frames: 1037210624. Throughput: 0: 5755.1. Samples: 1037216462. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:20:04,962][25689] Avg episode reward: [(0, '1.977')] [2022-07-11 03:20:05,755][26022] Updated weights on worker 0-0, policy_version 1012905 (0.00086) [2022-07-11 03:20:07,754][26022] Updated weights on worker 0-0, policy_version 1012915 (0.00087) [2022-07-11 03:20:09,479][26022] Updated weights on worker 0-0, policy_version 1012925 (0.00086) [2022-07-11 03:20:09,964][25689] Fps is (10 sec: 5420.5, 60 sec: 5574.3, 300 sec: 5553.7). Total num frames: 1037237248. Throughput: 0: 4915.9. Samples: 1037233032. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:20:09,964][25689] Avg episode reward: [(0, '1.845')] [2022-07-11 03:20:11,308][26022] Updated weights on worker 0-0, policy_version 1012935 (0.00082) [2022-07-11 03:20:13,191][26022] Updated weights on worker 0-0, policy_version 1012945 (0.00091) [2022-07-11 03:20:15,033][25689] Fps is (10 sec: 5388.6, 60 sec: 5540.0, 300 sec: 5553.2). Total num frames: 1037264896. Throughput: 0: 5738.9. Samples: 1037266458. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:20:15,033][25689] Avg episode reward: [(0, '1.838')] [2022-07-11 03:20:15,115][26022] Updated weights on worker 0-0, policy_version 1012955 (0.00077) [2022-07-11 03:20:16,857][26022] Updated weights on worker 0-0, policy_version 1012965 (0.00087) [2022-07-11 03:20:18,803][26022] Updated weights on worker 0-0, policy_version 1012975 (0.00093) [2022-07-11 03:20:20,049][25689] Fps is (10 sec: 5584.3, 60 sec: 5555.6, 300 sec: 5552.9). Total num frames: 1037293568. Throughput: 0: 5740.0. Samples: 1037300372. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:20:20,050][25689] Avg episode reward: [(0, '1.830')] [2022-07-11 03:20:20,473][26022] Updated weights on worker 0-0, policy_version 1012985 (0.00088) [2022-07-11 03:20:22,354][26022] Updated weights on worker 0-0, policy_version 1012995 (0.00089) [2022-07-11 03:20:23,983][26022] Updated weights on worker 0-0, policy_version 1013005 (0.00084) [2022-07-11 03:20:25,059][25689] Fps is (10 sec: 5719.6, 60 sec: 5589.8, 300 sec: 5557.0). Total num frames: 1037322240. Throughput: 0: 5018.5. Samples: 1037317320. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 03:20:25,059][25689] Avg episode reward: [(0, '1.793')] [2022-07-11 03:20:25,901][26022] Updated weights on worker 0-0, policy_version 1013015 (0.00085) [2022-07-11 03:20:27,622][26022] Updated weights on worker 0-0, policy_version 1013025 (0.00094) [2022-07-11 03:20:29,730][26022] Updated weights on worker 0-0, policy_version 1013035 (0.00096) [2022-07-11 03:20:30,067][25689] Fps is (10 sec: 5519.9, 60 sec: 5521.6, 300 sec: 5554.4). Total num frames: 1037348864. Throughput: 0: 5872.1. Samples: 1037351078. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:20:30,067][25689] Avg episode reward: [(0, '1.598')] [2022-07-11 03:20:31,425][26022] Updated weights on worker 0-0, policy_version 1013045 (0.00086) [2022-07-11 03:20:33,342][26022] Updated weights on worker 0-0, policy_version 1013055 (0.00090) [2022-07-11 03:20:34,978][26022] Updated weights on worker 0-0, policy_version 1013065 (0.00092) [2022-07-11 03:20:35,135][25689] Fps is (10 sec: 5691.2, 60 sec: 5588.8, 300 sec: 5556.9). Total num frames: 1037379584. Throughput: 0: 5887.8. Samples: 1037384814. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:20:35,135][25689] Avg episode reward: [(0, '0.385')] [2022-07-11 03:20:36,907][26022] Updated weights on worker 0-0, policy_version 1013075 (0.00081) [2022-07-11 03:20:38,839][26022] Updated weights on worker 0-0, policy_version 1013085 (0.00085) [2022-07-11 03:20:40,168][25689] Fps is (10 sec: 5778.3, 60 sec: 5553.4, 300 sec: 5560.3). Total num frames: 1037407232. Throughput: 0: 5044.3. Samples: 1037401858. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:20:40,170][25689] Avg episode reward: [(0, '0.138')] [2022-07-11 03:20:40,471][26022] Updated weights on worker 0-0, policy_version 1013095 (0.00086) [2022-07-11 03:20:42,343][26022] Updated weights on worker 0-0, policy_version 1013105 (0.00087) [2022-07-11 03:20:44,130][26022] Updated weights on worker 0-0, policy_version 1013115 (0.00090) [2022-07-11 03:20:45,176][25689] Fps is (10 sec: 5506.9, 60 sec: 5570.2, 300 sec: 5556.9). Total num frames: 1037434880. Throughput: 0: 5865.1. Samples: 1037435310. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:20:45,177][25689] Avg episode reward: [(0, '0.159')] [2022-07-11 03:20:45,917][26022] Updated weights on worker 0-0, policy_version 1013125 (0.00072) [2022-07-11 03:20:47,881][26022] Updated weights on worker 0-0, policy_version 1013135 (0.00082) [2022-07-11 03:20:49,545][26022] Updated weights on worker 0-0, policy_version 1013145 (0.00087) [2022-07-11 03:20:50,187][25689] Fps is (10 sec: 5519.2, 60 sec: 5553.0, 300 sec: 5558.0). Total num frames: 1037462528. Throughput: 0: 5876.5. Samples: 1037469314. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:20:50,187][25689] Avg episode reward: [(0, '0.318')] [2022-07-11 03:20:51,583][26022] Updated weights on worker 0-0, policy_version 1013155 (0.00109) [2022-07-11 03:20:53,359][26022] Updated weights on worker 0-0, policy_version 1013165 (0.00089) [2022-07-11 03:20:54,937][26022] Updated weights on worker 0-0, policy_version 1013175 (0.00084) [2022-07-11 03:20:55,251][25689] Fps is (10 sec: 5691.8, 60 sec: 5569.6, 300 sec: 5561.8). Total num frames: 1037492224. Throughput: 0: 5021.1. Samples: 1037485818. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:20:55,251][25689] Avg episode reward: [(0, '0.300')] [2022-07-11 03:20:57,105][26022] Updated weights on worker 0-0, policy_version 1013185 (0.00094) [2022-07-11 03:20:58,551][26022] Updated weights on worker 0-0, policy_version 1013195 (0.00084) [2022-07-11 03:21:00,283][25689] Fps is (10 sec: 5679.8, 60 sec: 5584.1, 300 sec: 5568.7). Total num frames: 1037519872. Throughput: 0: 5861.6. Samples: 1037519764. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:21:00,285][25689] Avg episode reward: [(0, '0.580')] [2022-07-11 03:21:00,672][26022] Updated weights on worker 0-0, policy_version 1013205 (0.00092) [2022-07-11 03:21:02,693][26022] Updated weights on worker 0-0, policy_version 1013215 (0.00083) [2022-07-11 03:21:04,513][26022] Updated weights on worker 0-0, policy_version 1013225 (0.00084) [2022-07-11 03:21:05,294][25689] Fps is (10 sec: 5199.7, 60 sec: 5533.0, 300 sec: 5552.2). Total num frames: 1037544448. Throughput: 0: 5770.7. Samples: 1037551408. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:21:05,296][25689] Avg episode reward: [(0, '0.980')] [2022-07-11 03:21:06,456][26022] Updated weights on worker 0-0, policy_version 1013235 (0.00093) [2022-07-11 03:21:08,131][26022] Updated weights on worker 0-0, policy_version 1013245 (0.00103) [2022-07-11 03:21:10,140][26022] Updated weights on worker 0-0, policy_version 1013255 (0.00085) [2022-07-11 03:21:10,310][25689] Fps is (10 sec: 5412.6, 60 sec: 5582.7, 300 sec: 5559.5). Total num frames: 1037574144. Throughput: 0: 4915.0. Samples: 1037568220. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:21:10,312][25689] Avg episode reward: [(0, '0.263')] [2022-07-11 03:21:11,888][26022] Updated weights on worker 0-0, policy_version 1013265 (0.00088) [2022-07-11 03:21:13,604][26022] Updated weights on worker 0-0, policy_version 1013275 (0.00093) [2022-07-11 03:21:15,361][25689] Fps is (10 sec: 5696.2, 60 sec: 5584.4, 300 sec: 5558.9). Total num frames: 1037601792. Throughput: 0: 5759.7. Samples: 1037601650. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:21:15,362][25689] Avg episode reward: [(0, '-0.402')] [2022-07-11 03:21:15,716][26022] Updated weights on worker 0-0, policy_version 1013285 (0.00108) [2022-07-11 03:21:17,335][26022] Updated weights on worker 0-0, policy_version 1013295 (0.00459) [2022-07-11 03:21:19,370][26022] Updated weights on worker 0-0, policy_version 1013305 (0.00092) [2022-07-11 03:21:20,365][25689] Fps is (10 sec: 5702.8, 60 sec: 5602.5, 300 sec: 5559.1). Total num frames: 1037631488. Throughput: 0: 5750.8. Samples: 1037635254. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:21:20,366][25689] Avg episode reward: [(0, '-0.470')] [2022-07-11 03:21:21,071][26022] Updated weights on worker 0-0, policy_version 1013315 (0.00090) [2022-07-11 03:21:22,863][26022] Updated weights on worker 0-0, policy_version 1013325 (0.00083) [2022-07-11 03:21:24,804][26022] Updated weights on worker 0-0, policy_version 1013335 (0.00091) [2022-07-11 03:21:25,379][25689] Fps is (10 sec: 5520.1, 60 sec: 5551.2, 300 sec: 5555.6). Total num frames: 1037657088. Throughput: 0: 5007.5. Samples: 1037651980. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:21:25,379][25689] Avg episode reward: [(0, '-0.060')] [2022-07-11 03:21:26,734][26022] Updated weights on worker 0-0, policy_version 1013345 (0.00082) [2022-07-11 03:21:28,412][26022] Updated weights on worker 0-0, policy_version 1013355 (0.00080) [2022-07-11 03:21:30,206][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:21:30,220][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001013365_1037685760.pth [2022-07-11 03:21:30,220][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001011407_1035680768.pth [2022-07-11 03:21:30,228][26022] Updated weights on worker 0-0, policy_version 1013365 (0.00094) [2022-07-11 03:21:30,408][25689] Fps is (10 sec: 5506.1, 60 sec: 5600.1, 300 sec: 5552.7). Total num frames: 1037686784. Throughput: 0: 5853.6. Samples: 1037685866. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:21:30,409][25689] Avg episode reward: [(0, '0.190')] [2022-07-11 03:21:32,002][26022] Updated weights on worker 0-0, policy_version 1013375 (0.00084) [2022-07-11 03:21:33,807][26022] Updated weights on worker 0-0, policy_version 1013385 (0.00089) [2022-07-11 03:21:35,465][25689] Fps is (10 sec: 5685.6, 60 sec: 5550.3, 300 sec: 5559.4). Total num frames: 1037714432. Throughput: 0: 5863.7. Samples: 1037719528. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:21:35,465][25689] Avg episode reward: [(0, '1.026')] [2022-07-11 03:21:35,786][26022] Updated weights on worker 0-0, policy_version 1013395 (0.00089) [2022-07-11 03:21:37,535][26022] Updated weights on worker 0-0, policy_version 1013405 (0.00093) [2022-07-11 03:21:39,421][26022] Updated weights on worker 0-0, policy_version 1013415 (0.00088) [2022-07-11 03:21:40,509][25689] Fps is (10 sec: 5575.9, 60 sec: 5566.3, 300 sec: 5562.6). Total num frames: 1037743104. Throughput: 0: 5020.3. Samples: 1037736378. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:21:40,509][25689] Avg episode reward: [(0, '0.576')] [2022-07-11 03:21:41,340][26022] Updated weights on worker 0-0, policy_version 1013425 (0.00092) [2022-07-11 03:21:43,195][26022] Updated weights on worker 0-0, policy_version 1013435 (0.00087) [2022-07-11 03:21:44,921][26022] Updated weights on worker 0-0, policy_version 1013445 (0.00088) [2022-07-11 03:21:45,521][25689] Fps is (10 sec: 5600.3, 60 sec: 5565.8, 300 sec: 5559.0). Total num frames: 1037770752. Throughput: 0: 5847.7. Samples: 1037769768. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:21:45,522][25689] Avg episode reward: [(0, '1.268')] [2022-07-11 03:21:46,628][26022] Updated weights on worker 0-0, policy_version 1013455 (0.00087) [2022-07-11 03:21:48,670][26022] Updated weights on worker 0-0, policy_version 1013465 (0.00084) [2022-07-11 03:21:50,389][26022] Updated weights on worker 0-0, policy_version 1013475 (0.00093) [2022-07-11 03:21:50,525][25689] Fps is (10 sec: 5520.3, 60 sec: 5566.5, 300 sec: 5560.2). Total num frames: 1037798400. Throughput: 0: 5848.4. Samples: 1037803520. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:21:50,526][25689] Avg episode reward: [(0, '1.101')] [2022-07-11 03:21:51,997][26022] Updated weights on worker 0-0, policy_version 1013485 (0.00083) [2022-07-11 03:21:54,116][26022] Updated weights on worker 0-0, policy_version 1013495 (0.00105) [2022-07-11 03:21:55,569][25689] Fps is (10 sec: 5707.3, 60 sec: 5568.4, 300 sec: 5559.8). Total num frames: 1037828096. Throughput: 0: 5016.5. Samples: 1037820380. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:21:55,571][25689] Avg episode reward: [(0, '0.215')] [2022-07-11 03:21:55,700][26022] Updated weights on worker 0-0, policy_version 1013505 (0.00087) [2022-07-11 03:21:57,584][26022] Updated weights on worker 0-0, policy_version 1013515 (0.00093) [2022-07-11 03:21:59,411][26022] Updated weights on worker 0-0, policy_version 1013525 (0.00095) [2022-07-11 03:22:00,647][25689] Fps is (10 sec: 5665.6, 60 sec: 5564.1, 300 sec: 5565.8). Total num frames: 1037855744. Throughput: 0: 5851.7. Samples: 1037854222. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:22:00,647][25689] Avg episode reward: [(0, '0.131')] [2022-07-11 03:22:01,348][26022] Updated weights on worker 0-0, policy_version 1013535 (0.00091) [2022-07-11 03:22:03,599][26022] Updated weights on worker 0-0, policy_version 1013545 (0.00094) [2022-07-11 03:22:05,356][26022] Updated weights on worker 0-0, policy_version 1013555 (0.00090) [2022-07-11 03:22:05,663][25689] Fps is (10 sec: 5275.3, 60 sec: 5580.7, 300 sec: 5560.4). Total num frames: 1037881344. Throughput: 0: 5758.4. Samples: 1037885750. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:22:05,663][25689] Avg episode reward: [(0, '0.260')] [2022-07-11 03:22:07,051][26022] Updated weights on worker 0-0, policy_version 1013565 (0.00091) [2022-07-11 03:22:09,092][26022] Updated weights on worker 0-0, policy_version 1013575 (0.00053) [2022-07-11 03:22:10,675][25689] Fps is (10 sec: 5412.1, 60 sec: 5564.0, 300 sec: 5562.3). Total num frames: 1037910016. Throughput: 0: 5759.2. Samples: 1037919564. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:22:10,675][25689] Avg episode reward: [(0, '0.674')] [2022-07-11 03:22:10,831][26022] Updated weights on worker 0-0, policy_version 1013585 (0.00096) [2022-07-11 03:22:12,717][26022] Updated weights on worker 0-0, policy_version 1013595 (0.00088) [2022-07-11 03:22:14,441][26022] Updated weights on worker 0-0, policy_version 1013605 (0.01026) [2022-07-11 03:22:15,728][25689] Fps is (10 sec: 5595.2, 60 sec: 5563.8, 300 sec: 5555.1). Total num frames: 1037937664. Throughput: 0: 5735.9. Samples: 1037936014. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:22:15,729][25689] Avg episode reward: [(0, '0.760')] [2022-07-11 03:22:16,400][26022] Updated weights on worker 0-0, policy_version 1013615 (0.00089) [2022-07-11 03:22:17,987][26022] Updated weights on worker 0-0, policy_version 1013625 (0.00085) [2022-07-11 03:22:20,020][26022] Updated weights on worker 0-0, policy_version 1013635 (0.00089) [2022-07-11 03:22:20,733][25689] Fps is (10 sec: 5497.4, 60 sec: 5529.8, 300 sec: 5558.5). Total num frames: 1037965312. Throughput: 0: 5747.4. Samples: 1037969666. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:22:20,734][25689] Avg episode reward: [(0, '0.515')] [2022-07-11 03:22:21,651][26022] Updated weights on worker 0-0, policy_version 1013645 (0.00087) [2022-07-11 03:22:23,815][26022] Updated weights on worker 0-0, policy_version 1013655 (0.00084) [2022-07-11 03:22:25,453][26022] Updated weights on worker 0-0, policy_version 1013665 (0.00098) [2022-07-11 03:22:25,777][25689] Fps is (10 sec: 5604.8, 60 sec: 5577.9, 300 sec: 5561.2). Total num frames: 1037993984. Throughput: 0: 5833.0. Samples: 1038003078. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:22:25,779][25689] Avg episode reward: [(0, '0.396')] [2022-07-11 03:22:27,481][26022] Updated weights on worker 0-0, policy_version 1013675 (0.00086) [2022-07-11 03:22:29,294][26022] Updated weights on worker 0-0, policy_version 1013685 (0.00082) [2022-07-11 03:22:30,799][25689] Fps is (10 sec: 5595.4, 60 sec: 5544.7, 300 sec: 5552.2). Total num frames: 1038021632. Throughput: 0: 4984.1. Samples: 1038019864. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:22:30,799][25689] Avg episode reward: [(0, '0.420')] [2022-07-11 03:22:30,959][26022] Updated weights on worker 0-0, policy_version 1013695 (0.00089) [2022-07-11 03:22:33,143][26022] Updated weights on worker 0-0, policy_version 1013705 (0.00247) [2022-07-11 03:22:34,746][26022] Updated weights on worker 0-0, policy_version 1013715 (0.00082) [2022-07-11 03:22:35,860][25689] Fps is (10 sec: 5585.7, 60 sec: 5561.2, 300 sec: 5565.4). Total num frames: 1038050304. Throughput: 0: 5846.9. Samples: 1038053720. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:22:35,860][25689] Avg episode reward: [(0, '0.421')] [2022-07-11 03:22:36,443][26022] Updated weights on worker 0-0, policy_version 1013725 (0.00098) [2022-07-11 03:22:38,235][26022] Updated weights on worker 0-0, policy_version 1013735 (0.00082) [2022-07-11 03:22:40,120][26022] Updated weights on worker 0-0, policy_version 1013745 (0.00113) [2022-07-11 03:22:40,869][25689] Fps is (10 sec: 5592.5, 60 sec: 5547.4, 300 sec: 5551.7). Total num frames: 1038077952. Throughput: 0: 5849.8. Samples: 1038087458. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:22:40,870][25689] Avg episode reward: [(0, '0.159')] [2022-07-11 03:22:42,138][26022] Updated weights on worker 0-0, policy_version 1013755 (0.00097) [2022-07-11 03:22:43,875][26022] Updated weights on worker 0-0, policy_version 1013765 (0.00092) [2022-07-11 03:22:45,698][26022] Updated weights on worker 0-0, policy_version 1013775 (0.00086) [2022-07-11 03:22:45,898][25689] Fps is (10 sec: 5610.9, 60 sec: 5563.0, 300 sec: 5565.0). Total num frames: 1038106624. Throughput: 0: 5024.0. Samples: 1038104164. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:22:45,898][25689] Avg episode reward: [(0, '0.017')] [2022-07-11 03:22:47,516][26022] Updated weights on worker 0-0, policy_version 1013785 (0.00089) [2022-07-11 03:22:49,221][26022] Updated weights on worker 0-0, policy_version 1013795 (0.00085) [2022-07-11 03:22:50,904][25689] Fps is (10 sec: 5714.6, 60 sec: 5579.7, 300 sec: 5559.6). Total num frames: 1038135296. Throughput: 0: 5880.2. Samples: 1038138088. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:22:50,904][25689] Avg episode reward: [(0, '1.050')] [2022-07-11 03:22:51,098][26022] Updated weights on worker 0-0, policy_version 1013805 (0.00083) [2022-07-11 03:22:53,036][26022] Updated weights on worker 0-0, policy_version 1013815 (0.00086) [2022-07-11 03:22:54,774][26022] Updated weights on worker 0-0, policy_version 1013825 (0.00086) [2022-07-11 03:22:55,951][25689] Fps is (10 sec: 5602.0, 60 sec: 5545.5, 300 sec: 5559.1). Total num frames: 1038162944. Throughput: 0: 5881.6. Samples: 1038171888. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:22:55,952][25689] Avg episode reward: [(0, '1.323')] [2022-07-11 03:22:56,692][26022] Updated weights on worker 0-0, policy_version 1013835 (0.00085) [2022-07-11 03:22:58,323][26022] Updated weights on worker 0-0, policy_version 1013845 (0.00083) [2022-07-11 03:23:00,340][26022] Updated weights on worker 0-0, policy_version 1013855 (0.00084) [2022-07-11 03:23:00,955][25689] Fps is (10 sec: 5603.4, 60 sec: 5569.3, 300 sec: 5569.7). Total num frames: 1038191616. Throughput: 0: 5047.8. Samples: 1038188846. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:23:00,956][25689] Avg episode reward: [(0, '1.103')] [2022-07-11 03:23:02,343][26022] Updated weights on worker 0-0, policy_version 1013865 (0.00094) [2022-07-11 03:23:04,151][26022] Updated weights on worker 0-0, policy_version 1013875 (0.00084) [2022-07-11 03:23:05,981][25689] Fps is (10 sec: 5410.9, 60 sec: 5568.3, 300 sec: 5556.9). Total num frames: 1038217216. Throughput: 0: 5791.4. Samples: 1038220476. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:23:05,982][25689] Avg episode reward: [(0, '1.471')] [2022-07-11 03:23:06,071][26022] Updated weights on worker 0-0, policy_version 1013885 (0.00082) [2022-07-11 03:23:07,926][26022] Updated weights on worker 0-0, policy_version 1013895 (0.00091) [2022-07-11 03:23:09,723][26022] Updated weights on worker 0-0, policy_version 1013905 (0.00089) [2022-07-11 03:23:10,990][25689] Fps is (10 sec: 5306.4, 60 sec: 5551.6, 300 sec: 5558.5). Total num frames: 1038244864. Throughput: 0: 5781.3. Samples: 1038254210. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:23:10,992][25689] Avg episode reward: [(0, '1.651')] [2022-07-11 03:23:11,606][26022] Updated weights on worker 0-0, policy_version 1013915 (0.00082) [2022-07-11 03:23:13,531][26022] Updated weights on worker 0-0, policy_version 1013925 (0.00089) [2022-07-11 03:23:15,120][26022] Updated weights on worker 0-0, policy_version 1013935 (0.00092) [2022-07-11 03:23:16,079][25689] Fps is (10 sec: 5678.8, 60 sec: 5582.3, 300 sec: 5561.8). Total num frames: 1038274560. Throughput: 0: 4916.8. Samples: 1038270854. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:23:16,080][25689] Avg episode reward: [(0, '1.494')] [2022-07-11 03:23:17,029][26022] Updated weights on worker 0-0, policy_version 1013945 (0.00092) [2022-07-11 03:23:19,043][26022] Updated weights on worker 0-0, policy_version 1013955 (0.00087) [2022-07-11 03:23:20,757][26022] Updated weights on worker 0-0, policy_version 1013965 (0.00089) [2022-07-11 03:23:21,141][25689] Fps is (10 sec: 5749.7, 60 sec: 5594.0, 300 sec: 5565.2). Total num frames: 1038303232. Throughput: 0: 5742.3. Samples: 1038304762. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:23:21,143][25689] Avg episode reward: [(0, '1.532')] [2022-07-11 03:23:22,563][26022] Updated weights on worker 0-0, policy_version 1013975 (0.00090) [2022-07-11 03:23:24,411][26022] Updated weights on worker 0-0, policy_version 1013985 (0.00086) [2022-07-11 03:23:26,149][25689] Fps is (10 sec: 5592.7, 60 sec: 5580.3, 300 sec: 5566.2). Total num frames: 1038330880. Throughput: 0: 5854.6. Samples: 1038338552. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:23:26,151][25689] Avg episode reward: [(0, '1.834')] [2022-07-11 03:23:26,155][26022] Updated weights on worker 0-0, policy_version 1013995 (0.00093) [2022-07-11 03:23:28,110][26022] Updated weights on worker 0-0, policy_version 1014005 (0.00083) [2022-07-11 03:23:29,726][26022] Updated weights on worker 0-0, policy_version 1014015 (0.00093) [2022-07-11 03:23:30,285][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:23:30,300][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001014018_1038354432.pth [2022-07-11 03:23:30,300][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001012061_1036350464.pth [2022-07-11 03:23:31,227][25689] Fps is (10 sec: 5381.1, 60 sec: 5558.2, 300 sec: 5556.6). Total num frames: 1038357504. Throughput: 0: 4998.3. Samples: 1038355360. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:23:31,227][25689] Avg episode reward: [(0, '2.030')] [2022-07-11 03:23:31,487][26022] Updated weights on worker 0-0, policy_version 1014025 (0.00084) [2022-07-11 03:23:33,351][26022] Updated weights on worker 0-0, policy_version 1014035 (0.00085) [2022-07-11 03:23:35,282][26022] Updated weights on worker 0-0, policy_version 1014045 (0.00090) [2022-07-11 03:23:36,368][25689] Fps is (10 sec: 5511.3, 60 sec: 5567.7, 300 sec: 5565.6). Total num frames: 1038387200. Throughput: 0: 5817.2. Samples: 1038388880. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:23:36,369][25689] Avg episode reward: [(0, '1.665')] [2022-07-11 03:23:37,258][26022] Updated weights on worker 0-0, policy_version 1014055 (0.00098) [2022-07-11 03:23:38,769][26022] Updated weights on worker 0-0, policy_version 1014065 (0.00092) [2022-07-11 03:23:40,859][26022] Updated weights on worker 0-0, policy_version 1014075 (0.00081) [2022-07-11 03:23:41,418][25689] Fps is (10 sec: 5727.2, 60 sec: 5580.9, 300 sec: 5565.1). Total num frames: 1038415872. Throughput: 0: 5810.8. Samples: 1038422588. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:23:41,420][25689] Avg episode reward: [(0, '1.434')] [2022-07-11 03:23:42,515][26022] Updated weights on worker 0-0, policy_version 1014085 (0.00087) [2022-07-11 03:23:44,564][26022] Updated weights on worker 0-0, policy_version 1014095 (0.00089) [2022-07-11 03:23:46,393][26022] Updated weights on worker 0-0, policy_version 1014105 (0.00086) [2022-07-11 03:23:46,435][25689] Fps is (10 sec: 5594.9, 60 sec: 5565.1, 300 sec: 5561.5). Total num frames: 1038443520. Throughput: 0: 4955.6. Samples: 1038439078. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:23:46,435][25689] Avg episode reward: [(0, '1.675')] [2022-07-11 03:23:48,196][26022] Updated weights on worker 0-0, policy_version 1014115 (0.00081) [2022-07-11 03:23:49,981][26022] Updated weights on worker 0-0, policy_version 1014125 (0.00085) [2022-07-11 03:23:51,439][25689] Fps is (10 sec: 5518.2, 60 sec: 5548.4, 300 sec: 5562.7). Total num frames: 1038471168. Throughput: 0: 5794.0. Samples: 1038472468. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:23:51,439][25689] Avg episode reward: [(0, '1.323')] [2022-07-11 03:23:51,887][26022] Updated weights on worker 0-0, policy_version 1014135 (0.00090) [2022-07-11 03:23:53,756][26022] Updated weights on worker 0-0, policy_version 1014145 (0.00082) [2022-07-11 03:23:55,582][26022] Updated weights on worker 0-0, policy_version 1014155 (0.00095) [2022-07-11 03:23:56,486][25689] Fps is (10 sec: 5501.3, 60 sec: 5548.4, 300 sec: 5558.5). Total num frames: 1038498816. Throughput: 0: 5803.2. Samples: 1038505628. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:23:56,487][25689] Avg episode reward: [(0, '1.258')] [2022-07-11 03:23:57,731][26022] Updated weights on worker 0-0, policy_version 1014165 (0.00090) [2022-07-11 03:23:59,245][26022] Updated weights on worker 0-0, policy_version 1014175 (0.00087) [2022-07-11 03:24:01,127][26022] Updated weights on worker 0-0, policy_version 1014185 (0.00092) [2022-07-11 03:24:01,498][25689] Fps is (10 sec: 5496.9, 60 sec: 5530.7, 300 sec: 5565.2). Total num frames: 1038526464. Throughput: 0: 4970.5. Samples: 1038522396. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:24:01,499][25689] Avg episode reward: [(0, '0.730')] [2022-07-11 03:24:03,240][26022] Updated weights on worker 0-0, policy_version 1014195 (0.00088) [2022-07-11 03:24:05,046][26022] Updated weights on worker 0-0, policy_version 1014205 (0.00083) [2022-07-11 03:24:06,547][25689] Fps is (10 sec: 5292.7, 60 sec: 5528.7, 300 sec: 5560.9). Total num frames: 1038552064. Throughput: 0: 5698.4. Samples: 1038553684. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:24:06,547][25689] Avg episode reward: [(0, '0.404')] [2022-07-11 03:24:07,195][26022] Updated weights on worker 0-0, policy_version 1014215 (0.00090) [2022-07-11 03:24:08,905][26022] Updated weights on worker 0-0, policy_version 1014225 (0.00081) [2022-07-11 03:24:10,660][26022] Updated weights on worker 0-0, policy_version 1014235 (0.00092) [2022-07-11 03:24:11,572][25689] Fps is (10 sec: 5488.9, 60 sec: 5560.9, 300 sec: 5561.7). Total num frames: 1038581760. Throughput: 0: 5711.1. Samples: 1038587452. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:24:11,573][25689] Avg episode reward: [(0, '0.485')] [2022-07-11 03:24:12,475][26022] Updated weights on worker 0-0, policy_version 1014245 (0.00080) [2022-07-11 03:24:14,201][26022] Updated weights on worker 0-0, policy_version 1014255 (0.00595) [2022-07-11 03:24:16,227][26022] Updated weights on worker 0-0, policy_version 1014265 (0.00086) [2022-07-11 03:24:16,620][25689] Fps is (10 sec: 5692.5, 60 sec: 5530.9, 300 sec: 5560.8). Total num frames: 1038609408. Throughput: 0: 4897.1. Samples: 1038604228. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:24:16,621][25689] Avg episode reward: [(0, '0.621')] [2022-07-11 03:24:17,909][26022] Updated weights on worker 0-0, policy_version 1014275 (0.00086) [2022-07-11 03:24:19,946][26022] Updated weights on worker 0-0, policy_version 1014285 (0.00054) [2022-07-11 03:24:21,643][25689] Fps is (10 sec: 5491.1, 60 sec: 5517.6, 300 sec: 5564.1). Total num frames: 1038637056. Throughput: 0: 5712.1. Samples: 1038637462. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2022-07-11 03:24:21,643][25689] Avg episode reward: [(0, '0.757')] [2022-07-11 03:24:21,740][26022] Updated weights on worker 0-0, policy_version 1014295 (0.00083) [2022-07-11 03:24:23,442][26022] Updated weights on worker 0-0, policy_version 1014305 (0.00097) [2022-07-11 03:24:25,372][26022] Updated weights on worker 0-0, policy_version 1014315 (0.00091) [2022-07-11 03:24:26,645][25689] Fps is (10 sec: 5516.2, 60 sec: 5518.2, 300 sec: 5553.8). Total num frames: 1038664704. Throughput: 0: 5831.0. Samples: 1038670876. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:24:26,645][25689] Avg episode reward: [(0, '-0.401')] [2022-07-11 03:24:27,110][26022] Updated weights on worker 0-0, policy_version 1014325 (0.00099) [2022-07-11 03:24:29,100][26022] Updated weights on worker 0-0, policy_version 1014335 (0.00091) [2022-07-11 03:24:30,927][26022] Updated weights on worker 0-0, policy_version 1014345 (0.00088) [2022-07-11 03:24:31,667][25689] Fps is (10 sec: 5516.3, 60 sec: 5540.2, 300 sec: 5557.9). Total num frames: 1038692352. Throughput: 0: 4983.7. Samples: 1038687596. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:24:31,667][25689] Avg episode reward: [(0, '-0.096')] [2022-07-11 03:24:32,758][26022] Updated weights on worker 0-0, policy_version 1014355 (0.00087) [2022-07-11 03:24:34,516][26022] Updated weights on worker 0-0, policy_version 1014365 (0.00086) [2022-07-11 03:24:36,469][26022] Updated weights on worker 0-0, policy_version 1014375 (0.00095) [2022-07-11 03:24:36,791][25689] Fps is (10 sec: 5550.8, 60 sec: 5524.8, 300 sec: 5552.5). Total num frames: 1038721024. Throughput: 0: 5801.5. Samples: 1038721248. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:24:36,792][25689] Avg episode reward: [(0, '-0.439')] [2022-07-11 03:24:38,179][26022] Updated weights on worker 0-0, policy_version 1014385 (0.00094) [2022-07-11 03:24:40,251][26022] Updated weights on worker 0-0, policy_version 1014395 (0.00082) [2022-07-11 03:24:41,815][25689] Fps is (10 sec: 5650.4, 60 sec: 5527.1, 300 sec: 5559.0). Total num frames: 1038749696. Throughput: 0: 5801.3. Samples: 1038754490. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:24:41,816][25689] Avg episode reward: [(0, '-0.682')] [2022-07-11 03:24:41,839][26022] Updated weights on worker 0-0, policy_version 1014405 (0.00082) [2022-07-11 03:24:43,792][26022] Updated weights on worker 0-0, policy_version 1014415 (0.00083) [2022-07-11 03:24:45,580][26022] Updated weights on worker 0-0, policy_version 1014425 (0.00087) [2022-07-11 03:24:46,849][25689] Fps is (10 sec: 5497.9, 60 sec: 5508.7, 300 sec: 5551.7). Total num frames: 1038776320. Throughput: 0: 4974.9. Samples: 1038771388. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:24:46,849][25689] Avg episode reward: [(0, '-0.484')] [2022-07-11 03:24:47,373][26022] Updated weights on worker 0-0, policy_version 1014435 (0.00084) [2022-07-11 03:24:49,343][26022] Updated weights on worker 0-0, policy_version 1014445 (0.00086) [2022-07-11 03:24:51,230][26022] Updated weights on worker 0-0, policy_version 1014455 (0.00100) [2022-07-11 03:24:51,866][25689] Fps is (10 sec: 5399.7, 60 sec: 5507.4, 300 sec: 5549.0). Total num frames: 1038803968. Throughput: 0: 5798.3. Samples: 1038804718. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:24:51,867][25689] Avg episode reward: [(0, '0.605')] [2022-07-11 03:24:52,938][26022] Updated weights on worker 0-0, policy_version 1014465 (0.00089) [2022-07-11 03:24:55,075][26022] Updated weights on worker 0-0, policy_version 1014475 (0.00111) [2022-07-11 03:24:56,572][26022] Updated weights on worker 0-0, policy_version 1014485 (0.00084) [2022-07-11 03:24:56,916][25689] Fps is (10 sec: 5797.6, 60 sec: 5558.0, 300 sec: 5562.0). Total num frames: 1038834688. Throughput: 0: 5808.6. Samples: 1038838146. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:24:56,917][25689] Avg episode reward: [(0, '0.677')] [2022-07-11 03:24:58,715][26022] Updated weights on worker 0-0, policy_version 1014495 (0.00082) [2022-07-11 03:25:00,165][26022] Updated weights on worker 0-0, policy_version 1014505 (0.00083) [2022-07-11 03:25:01,931][25689] Fps is (10 sec: 5697.7, 60 sec: 5540.9, 300 sec: 5558.4). Total num frames: 1038861312. Throughput: 0: 4999.4. Samples: 1038855054. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:25:01,931][25689] Avg episode reward: [(0, '0.722')] [2022-07-11 03:25:02,566][26022] Updated weights on worker 0-0, policy_version 1014515 (0.00097) [2022-07-11 03:25:04,603][26022] Updated weights on worker 0-0, policy_version 1014525 (0.00085) [2022-07-11 03:25:06,201][26022] Updated weights on worker 0-0, policy_version 1014535 (0.00089) [2022-07-11 03:25:06,951][25689] Fps is (10 sec: 5306.5, 60 sec: 5560.4, 300 sec: 5558.1). Total num frames: 1038887936. Throughput: 0: 5718.1. Samples: 1038886334. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:25:06,951][25689] Avg episode reward: [(0, '0.813')] [2022-07-11 03:25:08,175][26022] Updated weights on worker 0-0, policy_version 1014545 (0.00080) [2022-07-11 03:25:10,075][26022] Updated weights on worker 0-0, policy_version 1014555 (0.00085) [2022-07-11 03:25:11,916][26022] Updated weights on worker 0-0, policy_version 1014565 (0.00089) [2022-07-11 03:25:11,984][25689] Fps is (10 sec: 5296.5, 60 sec: 5508.9, 300 sec: 5555.3). Total num frames: 1038914560. Throughput: 0: 5718.3. Samples: 1038919758. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:25:11,985][25689] Avg episode reward: [(0, '1.477')] [2022-07-11 03:25:13,702][26022] Updated weights on worker 0-0, policy_version 1014575 (0.00091) [2022-07-11 03:25:15,609][26022] Updated weights on worker 0-0, policy_version 1014585 (0.00095) [2022-07-11 03:25:17,045][25689] Fps is (10 sec: 5477.7, 60 sec: 5524.6, 300 sec: 5554.5). Total num frames: 1038943232. Throughput: 0: 4885.0. Samples: 1038936478. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:25:17,046][25689] Avg episode reward: [(0, '1.448')] [2022-07-11 03:25:17,409][26022] Updated weights on worker 0-0, policy_version 1014595 (0.00085) [2022-07-11 03:25:19,155][26022] Updated weights on worker 0-0, policy_version 1014605 (0.00092) [2022-07-11 03:25:21,005][26022] Updated weights on worker 0-0, policy_version 1014615 (0.00090) [2022-07-11 03:25:22,056][25689] Fps is (10 sec: 5591.9, 60 sec: 5525.7, 300 sec: 5551.0). Total num frames: 1038970880. Throughput: 0: 5710.9. Samples: 1038969986. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:25:22,056][25689] Avg episode reward: [(0, '1.230')] [2022-07-11 03:25:22,771][26022] Updated weights on worker 0-0, policy_version 1014625 (0.00084) [2022-07-11 03:25:24,661][26022] Updated weights on worker 0-0, policy_version 1014635 (0.00084) [2022-07-11 03:25:26,530][26022] Updated weights on worker 0-0, policy_version 1014645 (0.00089) [2022-07-11 03:25:27,059][25689] Fps is (10 sec: 5624.5, 60 sec: 5542.6, 300 sec: 5558.0). Total num frames: 1038999552. Throughput: 0: 5837.3. Samples: 1039003710. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:25:27,059][25689] Avg episode reward: [(0, '1.247')] [2022-07-11 03:25:28,262][26022] Updated weights on worker 0-0, policy_version 1014655 (0.00083) [2022-07-11 03:25:30,277][26022] Updated weights on worker 0-0, policy_version 1014665 (0.00096) [2022-07-11 03:25:30,338][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:25:30,346][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001014666_1039017984.pth [2022-07-11 03:25:30,351][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001012712_1037017088.pth [2022-07-11 03:25:31,963][26022] Updated weights on worker 0-0, policy_version 1014675 (0.00080) [2022-07-11 03:25:32,092][25689] Fps is (10 sec: 5611.7, 60 sec: 5541.5, 300 sec: 5548.3). Total num frames: 1039027200. Throughput: 0: 5835.2. Samples: 1039037092. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:25:32,093][25689] Avg episode reward: [(0, '1.251')] [2022-07-11 03:25:33,923][26022] Updated weights on worker 0-0, policy_version 1014685 (0.00080) [2022-07-11 03:25:35,487][26022] Updated weights on worker 0-0, policy_version 1014695 (0.00082) [2022-07-11 03:25:37,216][25689] Fps is (10 sec: 5544.7, 60 sec: 5541.5, 300 sec: 5550.1). Total num frames: 1039055872. Throughput: 0: 5820.2. Samples: 1039053876. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:25:37,217][25689] Avg episode reward: [(0, '1.206')] [2022-07-11 03:25:37,661][26022] Updated weights on worker 0-0, policy_version 1014705 (0.00085) [2022-07-11 03:25:39,260][26022] Updated weights on worker 0-0, policy_version 1014715 (0.00086) [2022-07-11 03:25:41,194][26022] Updated weights on worker 0-0, policy_version 1014725 (0.00082) [2022-07-11 03:25:42,230][25689] Fps is (10 sec: 5656.4, 60 sec: 5542.5, 300 sec: 5553.4). Total num frames: 1039084544. Throughput: 0: 5818.7. Samples: 1039087372. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:25:42,230][25689] Avg episode reward: [(0, '0.462')] [2022-07-11 03:25:42,963][26022] Updated weights on worker 0-0, policy_version 1014735 (0.00098) [2022-07-11 03:25:44,738][26022] Updated weights on worker 0-0, policy_version 1014745 (0.00083) [2022-07-11 03:25:46,889][26022] Updated weights on worker 0-0, policy_version 1014755 (0.00092) [2022-07-11 03:25:47,255][25689] Fps is (10 sec: 5508.2, 60 sec: 5543.3, 300 sec: 5549.7). Total num frames: 1039111168. Throughput: 0: 5803.3. Samples: 1039120916. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:25:47,257][25689] Avg episode reward: [(0, '0.000')] [2022-07-11 03:25:48,489][26022] Updated weights on worker 0-0, policy_version 1014765 (0.00101) [2022-07-11 03:25:50,422][26022] Updated weights on worker 0-0, policy_version 1014775 (0.00088) [2022-07-11 03:25:52,032][26022] Updated weights on worker 0-0, policy_version 1014785 (0.00084) [2022-07-11 03:25:52,323][25689] Fps is (10 sec: 5580.0, 60 sec: 5572.5, 300 sec: 5549.6). Total num frames: 1039140864. Throughput: 0: 4977.3. Samples: 1039137788. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:25:52,324][25689] Avg episode reward: [(0, '0.131')] [2022-07-11 03:25:53,943][26022] Updated weights on worker 0-0, policy_version 1014795 (0.00087) [2022-07-11 03:25:55,974][26022] Updated weights on worker 0-0, policy_version 1014805 (0.00094) [2022-07-11 03:25:57,393][25689] Fps is (10 sec: 5757.4, 60 sec: 5536.8, 300 sec: 5552.4). Total num frames: 1039169536. Throughput: 0: 5817.5. Samples: 1039171254. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:25:57,393][25689] Avg episode reward: [(0, '0.982')] [2022-07-11 03:25:57,721][26022] Updated weights on worker 0-0, policy_version 1014815 (0.00087) [2022-07-11 03:25:59,439][26022] Updated weights on worker 0-0, policy_version 1014825 (0.00093) [2022-07-11 03:26:01,090][26022] Updated weights on worker 0-0, policy_version 1014835 (0.00095) [2022-07-11 03:26:02,399][25689] Fps is (10 sec: 5284.8, 60 sec: 5503.7, 300 sec: 5552.5). Total num frames: 1039194112. Throughput: 0: 5779.6. Samples: 1039203940. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:26:02,399][25689] Avg episode reward: [(0, '1.148')] [2022-07-11 03:26:03,495][26022] Updated weights on worker 0-0, policy_version 1014845 (0.00086) [2022-07-11 03:26:05,350][26022] Updated weights on worker 0-0, policy_version 1014855 (0.00086) [2022-07-11 03:26:07,259][26022] Updated weights on worker 0-0, policy_version 1014865 (0.00087) [2022-07-11 03:26:07,472][25689] Fps is (10 sec: 5283.3, 60 sec: 5532.8, 300 sec: 5548.0). Total num frames: 1039222784. Throughput: 0: 4906.4. Samples: 1039220106. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:26:07,472][25689] Avg episode reward: [(0, '1.209')] [2022-07-11 03:26:08,804][26022] Updated weights on worker 0-0, policy_version 1014875 (0.00085) [2022-07-11 03:26:10,949][26022] Updated weights on worker 0-0, policy_version 1014885 (0.00086) [2022-07-11 03:26:12,519][25689] Fps is (10 sec: 5666.2, 60 sec: 5565.3, 300 sec: 5551.5). Total num frames: 1039251456. Throughput: 0: 5734.9. Samples: 1039253610. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:26:12,520][25689] Avg episode reward: [(0, '1.318')] [2022-07-11 03:26:12,574][26022] Updated weights on worker 0-0, policy_version 1014895 (0.00085) [2022-07-11 03:26:14,626][26022] Updated weights on worker 0-0, policy_version 1014905 (0.00088) [2022-07-11 03:26:16,360][26022] Updated weights on worker 0-0, policy_version 1014915 (0.00091) [2022-07-11 03:26:17,581][25689] Fps is (10 sec: 5571.0, 60 sec: 5548.3, 300 sec: 5543.5). Total num frames: 1039279104. Throughput: 0: 5727.1. Samples: 1039286874. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:26:17,582][25689] Avg episode reward: [(0, '1.396')] [2022-07-11 03:26:18,207][26022] Updated weights on worker 0-0, policy_version 1014925 (0.00094) [2022-07-11 03:26:20,077][26022] Updated weights on worker 0-0, policy_version 1014935 (0.00086) [2022-07-11 03:26:21,850][26022] Updated weights on worker 0-0, policy_version 1014945 (0.00053) [2022-07-11 03:26:22,665][25689] Fps is (10 sec: 5450.3, 60 sec: 5541.6, 300 sec: 5549.1). Total num frames: 1039306752. Throughput: 0: 4924.2. Samples: 1039303732. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:26:22,665][25689] Avg episode reward: [(0, '1.269')] [2022-07-11 03:26:23,515][26022] Updated weights on worker 0-0, policy_version 1014955 (0.00094) [2022-07-11 03:26:25,525][26022] Updated weights on worker 0-0, policy_version 1014965 (0.00100) [2022-07-11 03:26:27,311][26022] Updated weights on worker 0-0, policy_version 1014975 (0.00101) [2022-07-11 03:26:27,757][25689] Fps is (10 sec: 5534.5, 60 sec: 5533.4, 300 sec: 5544.5). Total num frames: 1039335424. Throughput: 0: 5779.6. Samples: 1039337350. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:26:27,758][25689] Avg episode reward: [(0, '1.365')] [2022-07-11 03:26:29,246][26022] Updated weights on worker 0-0, policy_version 1014986 (0.00082) [2022-07-11 03:26:31,088][26022] Updated weights on worker 0-0, policy_version 1014996 (0.00051) [2022-07-11 03:26:32,773][25689] Fps is (10 sec: 5774.3, 60 sec: 5568.8, 300 sec: 5552.1). Total num frames: 1039365120. Throughput: 0: 5802.9. Samples: 1039371140. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:26:32,773][25689] Avg episode reward: [(0, '1.104')] [2022-07-11 03:26:32,966][26022] Updated weights on worker 0-0, policy_version 1015006 (0.00088) [2022-07-11 03:26:34,936][26022] Updated weights on worker 0-0, policy_version 1015016 (0.00090) [2022-07-11 03:26:36,538][26022] Updated weights on worker 0-0, policy_version 1015026 (0.00094) [2022-07-11 03:26:37,816][25689] Fps is (10 sec: 5701.1, 60 sec: 5559.4, 300 sec: 5548.7). Total num frames: 1039392768. Throughput: 0: 4998.4. Samples: 1039388014. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:26:37,817][25689] Avg episode reward: [(0, '1.170')] [2022-07-11 03:26:38,554][26022] Updated weights on worker 0-0, policy_version 1015036 (0.00085) [2022-07-11 03:26:40,070][26022] Updated weights on worker 0-0, policy_version 1015046 (0.00082) [2022-07-11 03:26:42,268][26022] Updated weights on worker 0-0, policy_version 1015056 (0.00090) [2022-07-11 03:26:42,824][25689] Fps is (10 sec: 5501.7, 60 sec: 5543.0, 300 sec: 5548.8). Total num frames: 1039420416. Throughput: 0: 5854.8. Samples: 1039421758. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:26:42,824][25689] Avg episode reward: [(0, '1.440')] [2022-07-11 03:26:43,700][26022] Updated weights on worker 0-0, policy_version 1015066 (0.00613) [2022-07-11 03:26:45,894][26022] Updated weights on worker 0-0, policy_version 1015076 (0.00087) [2022-07-11 03:26:47,688][26022] Updated weights on worker 0-0, policy_version 1015086 (0.00093) [2022-07-11 03:26:47,827][25689] Fps is (10 sec: 5523.4, 60 sec: 5561.9, 300 sec: 5548.8). Total num frames: 1039448064. Throughput: 0: 5868.7. Samples: 1039455132. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:26:47,828][25689] Avg episode reward: [(0, '1.701')] [2022-07-11 03:26:49,327][26022] Updated weights on worker 0-0, policy_version 1015096 (0.00098) [2022-07-11 03:26:51,303][26022] Updated weights on worker 0-0, policy_version 1015106 (0.00087) [2022-07-11 03:26:52,853][25689] Fps is (10 sec: 5717.7, 60 sec: 5565.8, 300 sec: 5549.1). Total num frames: 1039477760. Throughput: 0: 5034.0. Samples: 1039472222. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:26:52,853][25689] Avg episode reward: [(0, '1.399')] [2022-07-11 03:26:53,131][26022] Updated weights on worker 0-0, policy_version 1015116 (0.00084) [2022-07-11 03:26:54,773][26022] Updated weights on worker 0-0, policy_version 1015126 (0.00081) [2022-07-11 03:26:57,027][26022] Updated weights on worker 0-0, policy_version 1015136 (0.00093) [2022-07-11 03:26:57,914][25689] Fps is (10 sec: 5786.2, 60 sec: 5566.5, 300 sec: 5552.9). Total num frames: 1039506432. Throughput: 0: 5861.9. Samples: 1039505832. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:26:57,915][25689] Avg episode reward: [(0, '1.267')] [2022-07-11 03:26:58,319][26022] Updated weights on worker 0-0, policy_version 1015146 (0.00084) [2022-07-11 03:27:00,503][26022] Updated weights on worker 0-0, policy_version 1015156 (0.00091) [2022-07-11 03:27:02,335][26022] Updated weights on worker 0-0, policy_version 1015166 (0.00092) [2022-07-11 03:27:02,923][25689] Fps is (10 sec: 5287.4, 60 sec: 5566.3, 300 sec: 5549.5). Total num frames: 1039531008. Throughput: 0: 5762.8. Samples: 1039537590. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:27:02,924][25689] Avg episode reward: [(0, '1.405')] [2022-07-11 03:27:04,412][26022] Updated weights on worker 0-0, policy_version 1015176 (0.00334) [2022-07-11 03:27:06,142][26022] Updated weights on worker 0-0, policy_version 1015186 (0.00083) [2022-07-11 03:27:07,928][25689] Fps is (10 sec: 5317.5, 60 sec: 5572.5, 300 sec: 5549.7). Total num frames: 1039559680. Throughput: 0: 4931.8. Samples: 1039554268. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:27:07,929][25689] Avg episode reward: [(0, '1.759')] [2022-07-11 03:27:08,088][26022] Updated weights on worker 0-0, policy_version 1015196 (0.00090) [2022-07-11 03:27:09,860][26022] Updated weights on worker 0-0, policy_version 1015206 (0.00084) [2022-07-11 03:27:11,753][26022] Updated weights on worker 0-0, policy_version 1015216 (0.00088) [2022-07-11 03:27:12,939][25689] Fps is (10 sec: 5725.4, 60 sec: 5575.9, 300 sec: 5553.9). Total num frames: 1039588352. Throughput: 0: 5763.1. Samples: 1039587982. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:27:12,940][25689] Avg episode reward: [(0, '1.621')] [2022-07-11 03:27:13,564][26022] Updated weights on worker 0-0, policy_version 1015226 (0.00084) [2022-07-11 03:27:15,288][26022] Updated weights on worker 0-0, policy_version 1015236 (0.00089) [2022-07-11 03:27:17,206][26022] Updated weights on worker 0-0, policy_version 1015246 (0.00092) [2022-07-11 03:27:18,025][25689] Fps is (10 sec: 5476.3, 60 sec: 5556.7, 300 sec: 5549.0). Total num frames: 1039614976. Throughput: 0: 5757.0. Samples: 1039621610. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:27:18,026][25689] Avg episode reward: [(0, '1.444')] [2022-07-11 03:27:19,104][26022] Updated weights on worker 0-0, policy_version 1015256 (0.00086) [2022-07-11 03:27:21,018][26022] Updated weights on worker 0-0, policy_version 1015266 (0.00093) [2022-07-11 03:27:22,696][26022] Updated weights on worker 0-0, policy_version 1015276 (0.00049) [2022-07-11 03:27:23,037][25689] Fps is (10 sec: 5476.3, 60 sec: 5580.3, 300 sec: 5549.6). Total num frames: 1039643648. Throughput: 0: 5006.1. Samples: 1039638278. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:27:23,037][25689] Avg episode reward: [(0, '1.360')] [2022-07-11 03:27:24,497][26022] Updated weights on worker 0-0, policy_version 1015286 (0.00091) [2022-07-11 03:27:26,451][26022] Updated weights on worker 0-0, policy_version 1015296 (0.00087) [2022-07-11 03:27:28,052][25689] Fps is (10 sec: 5719.1, 60 sec: 5587.5, 300 sec: 5553.1). Total num frames: 1039672320. Throughput: 0: 5828.1. Samples: 1039671552. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:27:28,054][25689] Avg episode reward: [(0, '1.830')] [2022-07-11 03:27:28,178][26022] Updated weights on worker 0-0, policy_version 1015306 (0.00079) [2022-07-11 03:27:30,170][26022] Updated weights on worker 0-0, policy_version 1015316 (0.00093) [2022-07-11 03:27:30,380][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:27:30,390][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001015317_1039684608.pth [2022-07-11 03:27:30,391][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001013365_1037685760.pth [2022-07-11 03:27:31,995][26022] Updated weights on worker 0-0, policy_version 1015326 (0.00088) [2022-07-11 03:27:33,084][25689] Fps is (10 sec: 5503.5, 60 sec: 5535.0, 300 sec: 5546.8). Total num frames: 1039698944. Throughput: 0: 5797.7. Samples: 1039704774. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:27:33,086][25689] Avg episode reward: [(0, '1.694')] [2022-07-11 03:27:33,837][26022] Updated weights on worker 0-0, policy_version 1015336 (0.00084) [2022-07-11 03:27:35,617][26022] Updated weights on worker 0-0, policy_version 1015346 (0.00089) [2022-07-11 03:27:37,586][26022] Updated weights on worker 0-0, policy_version 1015356 (0.00091) [2022-07-11 03:27:38,210][25689] Fps is (10 sec: 5443.8, 60 sec: 5544.4, 300 sec: 5548.1). Total num frames: 1039727616. Throughput: 0: 4942.8. Samples: 1039721378. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:27:38,210][25689] Avg episode reward: [(0, '0.418')] [2022-07-11 03:27:39,342][26022] Updated weights on worker 0-0, policy_version 1015366 (0.00092) [2022-07-11 03:27:41,253][26022] Updated weights on worker 0-0, policy_version 1015376 (0.00086) [2022-07-11 03:27:43,027][26022] Updated weights on worker 0-0, policy_version 1015386 (0.00087) [2022-07-11 03:27:43,212][25689] Fps is (10 sec: 5560.6, 60 sec: 5544.9, 300 sec: 5545.1). Total num frames: 1039755264. Throughput: 0: 5769.5. Samples: 1039754682. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:27:43,213][25689] Avg episode reward: [(0, '0.651')] [2022-07-11 03:27:44,968][26022] Updated weights on worker 0-0, policy_version 1015396 (0.00086) [2022-07-11 03:27:46,865][26022] Updated weights on worker 0-0, policy_version 1015406 (0.00090) [2022-07-11 03:27:48,214][25689] Fps is (10 sec: 5425.0, 60 sec: 5528.1, 300 sec: 5538.3). Total num frames: 1039781888. Throughput: 0: 5785.8. Samples: 1039788204. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:27:48,215][25689] Avg episode reward: [(0, '0.630')] [2022-07-11 03:27:48,479][26022] Updated weights on worker 0-0, policy_version 1015416 (0.00087) [2022-07-11 03:27:50,375][26022] Updated weights on worker 0-0, policy_version 1015426 (0.00073) [2022-07-11 03:27:52,157][26022] Updated weights on worker 0-0, policy_version 1015436 (0.00085) [2022-07-11 03:27:53,257][25689] Fps is (10 sec: 5606.7, 60 sec: 5526.5, 300 sec: 5545.3). Total num frames: 1039811584. Throughput: 0: 4959.8. Samples: 1039804830. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:27:53,258][25689] Avg episode reward: [(0, '0.500')] [2022-07-11 03:27:54,274][26022] Updated weights on worker 0-0, policy_version 1015446 (0.00082) [2022-07-11 03:27:55,941][26022] Updated weights on worker 0-0, policy_version 1015456 (0.00078) [2022-07-11 03:27:57,917][26022] Updated weights on worker 0-0, policy_version 1015466 (0.00082) [2022-07-11 03:27:58,363][25689] Fps is (10 sec: 5650.4, 60 sec: 5505.5, 300 sec: 5539.9). Total num frames: 1039839232. Throughput: 0: 5799.9. Samples: 1039838262. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:27:58,363][25689] Avg episode reward: [(0, '0.475')] [2022-07-11 03:27:59,478][26022] Updated weights on worker 0-0, policy_version 1015476 (0.00092) [2022-07-11 03:28:01,866][26022] Updated weights on worker 0-0, policy_version 1015486 (0.00097) [2022-07-11 03:28:03,384][25689] Fps is (10 sec: 5359.5, 60 sec: 5538.3, 300 sec: 5543.5). Total num frames: 1039865856. Throughput: 0: 5708.9. Samples: 1039869838. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:28:03,384][25689] Avg episode reward: [(0, '0.660')] [2022-07-11 03:28:03,610][26022] Updated weights on worker 0-0, policy_version 1015496 (0.00098) [2022-07-11 03:28:05,624][26022] Updated weights on worker 0-0, policy_version 1015506 (0.00088) [2022-07-11 03:28:07,155][26022] Updated weights on worker 0-0, policy_version 1015516 (0.00088) [2022-07-11 03:28:08,386][25689] Fps is (10 sec: 5414.7, 60 sec: 5521.6, 300 sec: 5543.6). Total num frames: 1039893504. Throughput: 0: 5714.6. Samples: 1039903478. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:28:08,386][25689] Avg episode reward: [(0, '1.385')] [2022-07-11 03:28:09,363][26022] Updated weights on worker 0-0, policy_version 1015526 (0.00086) [2022-07-11 03:28:10,930][26022] Updated weights on worker 0-0, policy_version 1015536 (0.00088) [2022-07-11 03:28:13,000][26022] Updated weights on worker 0-0, policy_version 1015546 (0.00092) [2022-07-11 03:28:13,424][25689] Fps is (10 sec: 5609.4, 60 sec: 5519.1, 300 sec: 5541.1). Total num frames: 1039922176. Throughput: 0: 5714.3. Samples: 1039920068. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:28:13,425][25689] Avg episode reward: [(0, '-0.091')] [2022-07-11 03:28:14,730][26022] Updated weights on worker 0-0, policy_version 1015556 (0.00082) [2022-07-11 03:28:16,634][26022] Updated weights on worker 0-0, policy_version 1015566 (0.00089) [2022-07-11 03:28:18,554][25689] Fps is (10 sec: 5538.9, 60 sec: 5532.1, 300 sec: 5536.4). Total num frames: 1039949824. Throughput: 0: 5698.9. Samples: 1039953330. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:28:18,555][25689] Avg episode reward: [(0, '0.025')] [2022-07-11 03:28:18,557][26022] Updated weights on worker 0-0, policy_version 1015576 (0.00093) [2022-07-11 03:28:20,148][26022] Updated weights on worker 0-0, policy_version 1015586 (0.00093) [2022-07-11 03:28:22,264][26022] Updated weights on worker 0-0, policy_version 1015596 (0.00092) [2022-07-11 03:28:23,580][25689] Fps is (10 sec: 5545.4, 60 sec: 5530.7, 300 sec: 5539.5). Total num frames: 1039978496. Throughput: 0: 5777.3. Samples: 1039986520. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2022-07-11 03:28:23,581][25689] Avg episode reward: [(0, '-1.056')] [2022-07-11 03:28:23,951][26022] Updated weights on worker 0-0, policy_version 1015606 (0.00062) [2022-07-11 03:28:25,787][26022] Updated weights on worker 0-0, policy_version 1015616 (0.00085) [2022-07-11 03:28:27,677][26022] Updated weights on worker 0-0, policy_version 1015626 (0.00082) [2022-07-11 03:28:28,599][25689] Fps is (10 sec: 5504.7, 60 sec: 5496.6, 300 sec: 5540.6). Total num frames: 1040005120. Throughput: 0: 4939.4. Samples: 1040003320. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:28:28,600][25689] Avg episode reward: [(0, '-1.040')] [2022-07-11 03:28:29,507][26022] Updated weights on worker 0-0, policy_version 1015636 (0.00092) [2022-07-11 03:28:31,535][26022] Updated weights on worker 0-0, policy_version 1015646 (0.00092) [2022-07-11 03:28:33,319][26022] Updated weights on worker 0-0, policy_version 1015656 (0.00100) [2022-07-11 03:28:33,700][25689] Fps is (10 sec: 5363.0, 60 sec: 5507.2, 300 sec: 5534.5). Total num frames: 1040032768. Throughput: 0: 5746.5. Samples: 1040036584. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:28:33,700][25689] Avg episode reward: [(0, '-0.849')] [2022-07-11 03:28:35,093][26022] Updated weights on worker 0-0, policy_version 1015666 (0.00090) [2022-07-11 03:28:37,031][26022] Updated weights on worker 0-0, policy_version 1015676 (0.00080) [2022-07-11 03:28:38,593][26022] Updated weights on worker 0-0, policy_version 1015686 (0.00086) [2022-07-11 03:28:38,832][25689] Fps is (10 sec: 5704.1, 60 sec: 5540.4, 300 sec: 5539.8). Total num frames: 1040063488. Throughput: 0: 5752.9. Samples: 1040069986. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:28:38,832][25689] Avg episode reward: [(0, '-0.010')] [2022-07-11 03:28:40,789][26022] Updated weights on worker 0-0, policy_version 1015696 (0.00085) [2022-07-11 03:28:42,362][26022] Updated weights on worker 0-0, policy_version 1015706 (0.00085) [2022-07-11 03:28:43,864][25689] Fps is (10 sec: 5641.8, 60 sec: 5520.8, 300 sec: 5536.1). Total num frames: 1040090112. Throughput: 0: 4939.7. Samples: 1040086716. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:28:43,865][25689] Avg episode reward: [(0, '0.714')] [2022-07-11 03:28:44,305][26022] Updated weights on worker 0-0, policy_version 1015716 (0.00093) [2022-07-11 03:28:46,088][26022] Updated weights on worker 0-0, policy_version 1015726 (0.00094) [2022-07-11 03:28:47,768][26022] Updated weights on worker 0-0, policy_version 1015736 (0.00089) [2022-07-11 03:28:48,897][25689] Fps is (10 sec: 5494.0, 60 sec: 5551.8, 300 sec: 5539.0). Total num frames: 1040118784. Throughput: 0: 5761.5. Samples: 1040120264. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:28:48,897][25689] Avg episode reward: [(0, '0.691')] [2022-07-11 03:28:49,666][26022] Updated weights on worker 0-0, policy_version 1015746 (0.00056) [2022-07-11 03:28:51,789][26022] Updated weights on worker 0-0, policy_version 1015756 (0.00088) [2022-07-11 03:28:53,477][26022] Updated weights on worker 0-0, policy_version 1015766 (0.00056) [2022-07-11 03:28:53,909][25689] Fps is (10 sec: 5606.6, 60 sec: 5520.8, 300 sec: 5539.6). Total num frames: 1040146432. Throughput: 0: 5782.7. Samples: 1040153448. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:28:53,910][25689] Avg episode reward: [(0, '1.541')] [2022-07-11 03:28:55,332][26022] Updated weights on worker 0-0, policy_version 1015776 (0.00088) [2022-07-11 03:28:57,163][26022] Updated weights on worker 0-0, policy_version 1015786 (0.00090) [2022-07-11 03:28:58,939][26022] Updated weights on worker 0-0, policy_version 1015796 (0.00083) [2022-07-11 03:28:59,034][25689] Fps is (10 sec: 5555.6, 60 sec: 5535.9, 300 sec: 5541.0). Total num frames: 1040175104. Throughput: 0: 4952.7. Samples: 1040170042. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:28:59,035][25689] Avg episode reward: [(0, '1.912')] [2022-07-11 03:29:00,815][26022] Updated weights on worker 0-0, policy_version 1015806 (0.00091) [2022-07-11 03:29:03,103][26022] Updated weights on worker 0-0, policy_version 1015816 (0.00090) [2022-07-11 03:29:04,051][25689] Fps is (10 sec: 5351.5, 60 sec: 5519.4, 300 sec: 5541.6). Total num frames: 1040200704. Throughput: 0: 5695.0. Samples: 1040201680. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:29:04,051][25689] Avg episode reward: [(0, '1.592')] [2022-07-11 03:29:04,785][26022] Updated weights on worker 0-0, policy_version 1015826 (0.00086) [2022-07-11 03:29:06,688][26022] Updated weights on worker 0-0, policy_version 1015836 (0.00089) [2022-07-11 03:29:08,537][26022] Updated weights on worker 0-0, policy_version 1015846 (0.00101) [2022-07-11 03:29:09,083][25689] Fps is (10 sec: 5400.9, 60 sec: 5533.6, 300 sec: 5538.0). Total num frames: 1040229376. Throughput: 0: 5712.1. Samples: 1040235570. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:29:09,083][25689] Avg episode reward: [(0, '1.403')] [2022-07-11 03:29:10,271][26022] Updated weights on worker 0-0, policy_version 1015856 (0.00091) [2022-07-11 03:29:12,291][26022] Updated weights on worker 0-0, policy_version 1015866 (0.00096) [2022-07-11 03:29:14,085][26022] Updated weights on worker 0-0, policy_version 1015876 (0.00084) [2022-07-11 03:29:14,171][25689] Fps is (10 sec: 5565.1, 60 sec: 5512.2, 300 sec: 5537.3). Total num frames: 1040257024. Throughput: 0: 4876.2. Samples: 1040252248. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:29:14,171][25689] Avg episode reward: [(0, '1.028')] [2022-07-11 03:29:15,745][26022] Updated weights on worker 0-0, policy_version 1015886 (0.00085) [2022-07-11 03:29:17,761][26022] Updated weights on worker 0-0, policy_version 1015896 (0.00082) [2022-07-11 03:29:19,267][25689] Fps is (10 sec: 5530.3, 60 sec: 5532.1, 300 sec: 5539.3). Total num frames: 1040285696. Throughput: 0: 5710.9. Samples: 1040285588. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:29:19,267][25689] Avg episode reward: [(0, '1.114')] [2022-07-11 03:29:19,475][26022] Updated weights on worker 0-0, policy_version 1015906 (0.00091) [2022-07-11 03:29:21,747][26022] Updated weights on worker 0-0, policy_version 1015916 (0.00077) [2022-07-11 03:29:23,306][26022] Updated weights on worker 0-0, policy_version 1015926 (0.00077) [2022-07-11 03:29:24,359][25689] Fps is (10 sec: 5628.4, 60 sec: 5526.1, 300 sec: 5541.1). Total num frames: 1040314368. Throughput: 0: 5763.9. Samples: 1040318736. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:29:24,360][25689] Avg episode reward: [(0, '1.022')] [2022-07-11 03:29:25,229][26022] Updated weights on worker 0-0, policy_version 1015936 (0.00086) [2022-07-11 03:29:26,880][26022] Updated weights on worker 0-0, policy_version 1015946 (0.00089) [2022-07-11 03:29:29,036][26022] Updated weights on worker 0-0, policy_version 1015956 (0.00086) [2022-07-11 03:29:29,388][25689] Fps is (10 sec: 5362.4, 60 sec: 5508.4, 300 sec: 5534.1). Total num frames: 1040339968. Throughput: 0: 4912.2. Samples: 1040335312. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:29:29,388][25689] Avg episode reward: [(0, '1.089')] [2022-07-11 03:29:30,494][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:29:30,509][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001015965_1040348160.pth [2022-07-11 03:29:30,509][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001014018_1038354432.pth [2022-07-11 03:29:30,540][26022] Updated weights on worker 0-0, policy_version 1015966 (0.00090) [2022-07-11 03:29:32,760][26022] Updated weights on worker 0-0, policy_version 1015976 (0.00089) [2022-07-11 03:29:34,159][26022] Updated weights on worker 0-0, policy_version 1015986 (0.00100) [2022-07-11 03:29:34,390][25689] Fps is (10 sec: 5614.7, 60 sec: 5568.0, 300 sec: 5543.2). Total num frames: 1040370688. Throughput: 0: 5758.5. Samples: 1040368680. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:29:34,391][25689] Avg episode reward: [(0, '1.253')] [2022-07-11 03:29:36,411][26022] Updated weights on worker 0-0, policy_version 1015996 (0.00079) [2022-07-11 03:29:37,886][26022] Updated weights on worker 0-0, policy_version 1016006 (0.00085) [2022-07-11 03:29:39,487][25689] Fps is (10 sec: 5678.3, 60 sec: 5503.7, 300 sec: 5535.0). Total num frames: 1040397312. Throughput: 0: 5760.4. Samples: 1040402062. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:29:39,487][25689] Avg episode reward: [(0, '1.469')] [2022-07-11 03:29:39,806][26022] Updated weights on worker 0-0, policy_version 1016016 (0.00086) [2022-07-11 03:29:41,731][26022] Updated weights on worker 0-0, policy_version 1016026 (0.00089) [2022-07-11 03:29:43,533][26022] Updated weights on worker 0-0, policy_version 1016036 (0.00083) [2022-07-11 03:29:44,521][25689] Fps is (10 sec: 5357.1, 60 sec: 5520.4, 300 sec: 5538.4). Total num frames: 1040424960. Throughput: 0: 4960.5. Samples: 1040418748. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:29:44,522][25689] Avg episode reward: [(0, '1.064')] [2022-07-11 03:29:45,524][26022] Updated weights on worker 0-0, policy_version 1016046 (0.00095) [2022-07-11 03:29:47,181][26022] Updated weights on worker 0-0, policy_version 1016056 (0.00091) [2022-07-11 03:29:49,057][26022] Updated weights on worker 0-0, policy_version 1016066 (0.00086) [2022-07-11 03:29:49,570][25689] Fps is (10 sec: 5687.2, 60 sec: 5535.8, 300 sec: 5544.7). Total num frames: 1040454656. Throughput: 0: 5803.4. Samples: 1040452434. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:29:49,570][25689] Avg episode reward: [(0, '1.880')] [2022-07-11 03:29:50,835][26022] Updated weights on worker 0-0, policy_version 1016076 (0.00101) [2022-07-11 03:29:52,598][26022] Updated weights on worker 0-0, policy_version 1016086 (0.00083) [2022-07-11 03:29:54,603][25689] Fps is (10 sec: 5586.1, 60 sec: 5517.0, 300 sec: 5531.3). Total num frames: 1040481280. Throughput: 0: 5807.5. Samples: 1040486066. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:29:54,605][25689] Avg episode reward: [(0, '1.722')] [2022-07-11 03:29:54,735][26022] Updated weights on worker 0-0, policy_version 1016096 (0.00087) [2022-07-11 03:29:56,371][26022] Updated weights on worker 0-0, policy_version 1016106 (0.00087) [2022-07-11 03:29:58,328][26022] Updated weights on worker 0-0, policy_version 1016116 (0.00090) [2022-07-11 03:29:59,693][25689] Fps is (10 sec: 5563.0, 60 sec: 5537.0, 300 sec: 5540.2). Total num frames: 1040510976. Throughput: 0: 5800.8. Samples: 1040519278. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:29:59,694][25689] Avg episode reward: [(0, '0.865')] [2022-07-11 03:30:00,035][26022] Updated weights on worker 0-0, policy_version 1016126 (0.00094) [2022-07-11 03:30:02,172][26022] Updated weights on worker 0-0, policy_version 1016136 (0.00089) [2022-07-11 03:30:04,078][26022] Updated weights on worker 0-0, policy_version 1016146 (0.00089) [2022-07-11 03:30:04,725][25689] Fps is (10 sec: 5462.7, 60 sec: 5535.6, 300 sec: 5536.5). Total num frames: 1040536576. Throughput: 0: 5697.5. Samples: 1040533864. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:30:04,726][25689] Avg episode reward: [(0, '1.089')] [2022-07-11 03:30:05,998][26022] Updated weights on worker 0-0, policy_version 1016156 (0.00086) [2022-07-11 03:30:07,802][26022] Updated weights on worker 0-0, policy_version 1016166 (0.00083) [2022-07-11 03:30:09,785][25689] Fps is (10 sec: 5174.9, 60 sec: 5499.4, 300 sec: 5536.0). Total num frames: 1040563200. Throughput: 0: 5678.7. Samples: 1040567232. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:30:09,786][25689] Avg episode reward: [(0, '1.077')] [2022-07-11 03:30:09,800][26022] Updated weights on worker 0-0, policy_version 1016176 (0.00086) [2022-07-11 03:30:11,490][26022] Updated weights on worker 0-0, policy_version 1016186 (0.00088) [2022-07-11 03:30:13,250][26022] Updated weights on worker 0-0, policy_version 1016196 (0.00092) [2022-07-11 03:30:14,865][25689] Fps is (10 sec: 5554.2, 60 sec: 5533.8, 300 sec: 5539.1). Total num frames: 1040592896. Throughput: 0: 5670.3. Samples: 1040600960. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:30:14,866][25689] Avg episode reward: [(0, '0.822')] [2022-07-11 03:30:15,219][26022] Updated weights on worker 0-0, policy_version 1016206 (0.00096) [2022-07-11 03:30:17,039][26022] Updated weights on worker 0-0, policy_version 1016216 (0.00087) [2022-07-11 03:30:18,778][26022] Updated weights on worker 0-0, policy_version 1016226 (0.00104) [2022-07-11 03:30:19,909][25689] Fps is (10 sec: 5664.2, 60 sec: 5521.7, 300 sec: 5538.5). Total num frames: 1040620544. Throughput: 0: 4870.3. Samples: 1040617734. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:30:19,910][25689] Avg episode reward: [(0, '0.453')] [2022-07-11 03:30:21,036][26022] Updated weights on worker 0-0, policy_version 1016236 (0.00088) [2022-07-11 03:30:22,606][26022] Updated weights on worker 0-0, policy_version 1016246 (0.00092) [2022-07-11 03:30:24,424][26022] Updated weights on worker 0-0, policy_version 1016256 (0.00088) [2022-07-11 03:30:24,946][25689] Fps is (10 sec: 5587.0, 60 sec: 5526.8, 300 sec: 5537.9). Total num frames: 1040649216. Throughput: 0: 5781.3. Samples: 1040650764. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:30:24,946][25689] Avg episode reward: [(0, '0.286')] [2022-07-11 03:30:26,535][26022] Updated weights on worker 0-0, policy_version 1016266 (0.00094) [2022-07-11 03:30:27,911][26022] Updated weights on worker 0-0, policy_version 1016276 (0.00095) [2022-07-11 03:30:29,980][25689] Fps is (10 sec: 5389.0, 60 sec: 5526.2, 300 sec: 5531.0). Total num frames: 1040674816. Throughput: 0: 5801.3. Samples: 1040684388. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:30:29,981][25689] Avg episode reward: [(0, '0.957')] [2022-07-11 03:30:30,268][26022] Updated weights on worker 0-0, policy_version 1016286 (0.00090) [2022-07-11 03:30:31,628][26022] Updated weights on worker 0-0, policy_version 1016296 (0.00094) [2022-07-11 03:30:33,705][26022] Updated weights on worker 0-0, policy_version 1016306 (0.00094) [2022-07-11 03:30:34,999][25689] Fps is (10 sec: 5500.4, 60 sec: 5507.8, 300 sec: 5536.4). Total num frames: 1040704512. Throughput: 0: 4982.5. Samples: 1040701276. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:30:35,000][25689] Avg episode reward: [(0, '0.621')] [2022-07-11 03:30:35,457][26022] Updated weights on worker 0-0, policy_version 1016316 (0.00084) [2022-07-11 03:30:37,123][26022] Updated weights on worker 0-0, policy_version 1016326 (0.00085) [2022-07-11 03:30:39,295][26022] Updated weights on worker 0-0, policy_version 1016336 (0.00095) [2022-07-11 03:30:40,104][25689] Fps is (10 sec: 5866.5, 60 sec: 5557.7, 300 sec: 5538.1). Total num frames: 1040734208. Throughput: 0: 5809.8. Samples: 1040735062. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:30:40,104][25689] Avg episode reward: [(0, '0.451')] [2022-07-11 03:30:40,706][26022] Updated weights on worker 0-0, policy_version 1016346 (0.00085) [2022-07-11 03:30:42,787][26022] Updated weights on worker 0-0, policy_version 1016356 (0.00089) [2022-07-11 03:30:44,692][26022] Updated weights on worker 0-0, policy_version 1016366 (0.00092) [2022-07-11 03:30:45,186][25689] Fps is (10 sec: 5629.3, 60 sec: 5553.4, 300 sec: 5540.5). Total num frames: 1040761856. Throughput: 0: 5834.9. Samples: 1040768862. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:30:45,186][25689] Avg episode reward: [(0, '0.521')] [2022-07-11 03:30:46,326][26022] Updated weights on worker 0-0, policy_version 1016376 (0.00092) [2022-07-11 03:30:48,200][26022] Updated weights on worker 0-0, policy_version 1016386 (0.00089) [2022-07-11 03:30:50,134][26022] Updated weights on worker 0-0, policy_version 1016396 (0.00092) [2022-07-11 03:30:50,231][25689] Fps is (10 sec: 5460.4, 60 sec: 5520.0, 300 sec: 5534.1). Total num frames: 1040789504. Throughput: 0: 5006.7. Samples: 1040785776. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:30:50,231][25689] Avg episode reward: [(0, '0.813')] [2022-07-11 03:30:51,754][26022] Updated weights on worker 0-0, policy_version 1016406 (0.00091) [2022-07-11 03:30:54,031][26022] Updated weights on worker 0-0, policy_version 1016416 (0.00091) [2022-07-11 03:30:55,245][25689] Fps is (10 sec: 5700.8, 60 sec: 5572.4, 300 sec: 5538.5). Total num frames: 1040819200. Throughput: 0: 5807.6. Samples: 1040818856. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:30:55,245][25689] Avg episode reward: [(0, '0.763')] [2022-07-11 03:30:55,594][26022] Updated weights on worker 0-0, policy_version 1016426 (0.00089) [2022-07-11 03:30:57,521][26022] Updated weights on worker 0-0, policy_version 1016436 (0.00082) [2022-07-11 03:30:59,237][26022] Updated weights on worker 0-0, policy_version 1016446 (0.00332) [2022-07-11 03:31:00,305][25689] Fps is (10 sec: 5590.3, 60 sec: 5524.5, 300 sec: 5544.4). Total num frames: 1040845824. Throughput: 0: 5803.5. Samples: 1040852300. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:31:00,306][25689] Avg episode reward: [(0, '0.943')] [2022-07-11 03:31:01,171][26022] Updated weights on worker 0-0, policy_version 1016456 (0.00091) [2022-07-11 03:31:03,360][26022] Updated weights on worker 0-0, policy_version 1016466 (0.00088) [2022-07-11 03:31:05,188][26022] Updated weights on worker 0-0, policy_version 1016476 (0.00091) [2022-07-11 03:31:05,349][25689] Fps is (10 sec: 5269.9, 60 sec: 5540.3, 300 sec: 5538.1). Total num frames: 1040872448. Throughput: 0: 4866.1. Samples: 1040866978. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:31:05,349][25689] Avg episode reward: [(0, '1.049')] [2022-07-11 03:31:06,803][26022] Updated weights on worker 0-0, policy_version 1016486 (0.00085) [2022-07-11 03:31:08,918][26022] Updated weights on worker 0-0, policy_version 1016496 (0.00097) [2022-07-11 03:31:10,378][25689] Fps is (10 sec: 5489.4, 60 sec: 5576.9, 300 sec: 5538.4). Total num frames: 1040901120. Throughput: 0: 5718.2. Samples: 1040900986. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:31:10,380][25689] Avg episode reward: [(0, '1.441')] [2022-07-11 03:31:10,564][26022] Updated weights on worker 0-0, policy_version 1016506 (0.00089) [2022-07-11 03:31:12,447][26022] Updated weights on worker 0-0, policy_version 1016516 (0.00086) [2022-07-11 03:31:14,316][26022] Updated weights on worker 0-0, policy_version 1016526 (0.00092) [2022-07-11 03:31:15,456][25689] Fps is (10 sec: 5470.8, 60 sec: 5526.4, 300 sec: 5534.7). Total num frames: 1040927744. Throughput: 0: 5724.6. Samples: 1040934560. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:31:15,458][25689] Avg episode reward: [(0, '1.747')] [2022-07-11 03:31:16,059][26022] Updated weights on worker 0-0, policy_version 1016536 (0.00103) [2022-07-11 03:31:17,879][26022] Updated weights on worker 0-0, policy_version 1016546 (0.00086) [2022-07-11 03:31:19,785][26022] Updated weights on worker 0-0, policy_version 1016556 (0.00088) [2022-07-11 03:31:20,583][25689] Fps is (10 sec: 5418.9, 60 sec: 5535.7, 300 sec: 5537.3). Total num frames: 1040956416. Throughput: 0: 4882.6. Samples: 1040951306. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:31:20,583][25689] Avg episode reward: [(0, '2.064')] [2022-07-11 03:31:21,620][26022] Updated weights on worker 0-0, policy_version 1016566 (0.00091) [2022-07-11 03:31:23,576][26022] Updated weights on worker 0-0, policy_version 1016576 (0.00080) [2022-07-11 03:31:25,055][26022] Updated weights on worker 0-0, policy_version 1016586 (0.00097) [2022-07-11 03:31:25,681][25689] Fps is (10 sec: 5808.8, 60 sec: 5563.8, 300 sec: 5544.1). Total num frames: 1040987136. Throughput: 0: 5814.7. Samples: 1040985202. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:31:25,681][25689] Avg episode reward: [(0, '2.103')] [2022-07-11 03:31:27,185][26022] Updated weights on worker 0-0, policy_version 1016596 (0.00533) [2022-07-11 03:31:28,714][26022] Updated weights on worker 0-0, policy_version 1016606 (0.00086) [2022-07-11 03:31:30,678][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:31:30,688][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001016615_1041013760.pth [2022-07-11 03:31:30,689][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001014666_1039017984.pth [2022-07-11 03:31:30,690][25689] Fps is (10 sec: 5673.5, 60 sec: 5583.0, 300 sec: 5533.9). Total num frames: 1041013760. Throughput: 0: 5791.0. Samples: 1041018610. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:31:30,690][25689] Avg episode reward: [(0, '2.231')] [2022-07-11 03:31:30,931][26022] Updated weights on worker 0-0, policy_version 1016616 (0.00083) [2022-07-11 03:31:32,671][26022] Updated weights on worker 0-0, policy_version 1016626 (0.00097) [2022-07-11 03:31:34,443][26022] Updated weights on worker 0-0, policy_version 1016636 (0.00086) [2022-07-11 03:31:35,738][25689] Fps is (10 sec: 5498.2, 60 sec: 5563.5, 300 sec: 5537.2). Total num frames: 1041042432. Throughput: 0: 4968.5. Samples: 1041035330. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:31:35,739][25689] Avg episode reward: [(0, '2.090')] [2022-07-11 03:31:36,154][26022] Updated weights on worker 0-0, policy_version 1016646 (0.00084) [2022-07-11 03:31:38,311][26022] Updated weights on worker 0-0, policy_version 1016656 (0.00100) [2022-07-11 03:31:39,817][26022] Updated weights on worker 0-0, policy_version 1016666 (0.00089) [2022-07-11 03:31:40,846][25689] Fps is (10 sec: 5646.2, 60 sec: 5546.3, 300 sec: 5538.8). Total num frames: 1041071104. Throughput: 0: 5800.5. Samples: 1041068846. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:31:40,847][25689] Avg episode reward: [(0, '2.258')] [2022-07-11 03:31:41,991][26022] Updated weights on worker 0-0, policy_version 1016676 (0.00093) [2022-07-11 03:31:43,270][26022] Updated weights on worker 0-0, policy_version 1016686 (0.00104) [2022-07-11 03:31:45,539][26022] Updated weights on worker 0-0, policy_version 1016696 (0.00092) [2022-07-11 03:31:45,903][25689] Fps is (10 sec: 5741.8, 60 sec: 5582.3, 300 sec: 5544.7). Total num frames: 1041100800. Throughput: 0: 5814.5. Samples: 1041102786. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:31:45,904][25689] Avg episode reward: [(0, '2.021')] [2022-07-11 03:31:47,196][26022] Updated weights on worker 0-0, policy_version 1016706 (0.00088) [2022-07-11 03:31:49,064][26022] Updated weights on worker 0-0, policy_version 1016716 (0.00089) [2022-07-11 03:31:50,945][25689] Fps is (10 sec: 5475.7, 60 sec: 5548.9, 300 sec: 5530.6). Total num frames: 1041126400. Throughput: 0: 5819.3. Samples: 1041136478. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:31:50,945][25689] Avg episode reward: [(0, '1.960')] [2022-07-11 03:31:50,968][26022] Updated weights on worker 0-0, policy_version 1016726 (0.00096) [2022-07-11 03:31:52,604][26022] Updated weights on worker 0-0, policy_version 1016736 (0.00084) [2022-07-11 03:31:54,428][26022] Updated weights on worker 0-0, policy_version 1016746 (0.00086) [2022-07-11 03:31:55,950][25689] Fps is (10 sec: 5402.1, 60 sec: 5532.8, 300 sec: 5531.7). Total num frames: 1041155072. Throughput: 0: 5836.3. Samples: 1041153292. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:31:55,950][25689] Avg episode reward: [(0, '1.490')] [2022-07-11 03:31:56,350][26022] Updated weights on worker 0-0, policy_version 1016756 (0.00097) [2022-07-11 03:31:58,138][26022] Updated weights on worker 0-0, policy_version 1016766 (0.00089) [2022-07-11 03:32:00,223][26022] Updated weights on worker 0-0, policy_version 1016776 (0.00456) [2022-07-11 03:32:01,043][25689] Fps is (10 sec: 5780.2, 60 sec: 5580.5, 300 sec: 5547.3). Total num frames: 1041184768. Throughput: 0: 5824.4. Samples: 1041186476. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:32:01,043][25689] Avg episode reward: [(0, '1.485')] [2022-07-11 03:32:02,427][26022] Updated weights on worker 0-0, policy_version 1016786 (0.00086) [2022-07-11 03:32:04,175][26022] Updated weights on worker 0-0, policy_version 1016796 (0.00084) [2022-07-11 03:32:05,979][26022] Updated weights on worker 0-0, policy_version 1016806 (0.00421) [2022-07-11 03:32:06,065][25689] Fps is (10 sec: 5466.7, 60 sec: 5565.6, 300 sec: 5536.7). Total num frames: 1041210368. Throughput: 0: 5715.9. Samples: 1041218026. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:32:06,065][25689] Avg episode reward: [(0, '1.092')] [2022-07-11 03:32:07,765][26022] Updated weights on worker 0-0, policy_version 1016816 (0.00085) [2022-07-11 03:32:09,488][26022] Updated weights on worker 0-0, policy_version 1016826 (0.00092) [2022-07-11 03:32:11,121][25689] Fps is (10 sec: 5283.6, 60 sec: 5546.3, 300 sec: 5532.4). Total num frames: 1041238016. Throughput: 0: 4881.2. Samples: 1041234960. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:32:11,122][25689] Avg episode reward: [(0, '0.893')] [2022-07-11 03:32:11,581][26022] Updated weights on worker 0-0, policy_version 1016836 (0.00084) [2022-07-11 03:32:13,144][26022] Updated weights on worker 0-0, policy_version 1016846 (0.00088) [2022-07-11 03:32:15,255][26022] Updated weights on worker 0-0, policy_version 1016856 (0.00080) [2022-07-11 03:32:16,189][25689] Fps is (10 sec: 5462.0, 60 sec: 5564.1, 300 sec: 5536.2). Total num frames: 1041265664. Throughput: 0: 5690.7. Samples: 1041268466. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:32:16,189][25689] Avg episode reward: [(0, '0.745')] [2022-07-11 03:32:16,795][26022] Updated weights on worker 0-0, policy_version 1016866 (0.00083) [2022-07-11 03:32:18,772][26022] Updated weights on worker 0-0, policy_version 1016876 (0.00096) [2022-07-11 03:32:20,483][26022] Updated weights on worker 0-0, policy_version 1016886 (0.00091) [2022-07-11 03:32:21,255][25689] Fps is (10 sec: 5658.1, 60 sec: 5586.4, 300 sec: 5538.6). Total num frames: 1041295360. Throughput: 0: 5724.9. Samples: 1041302192. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 03:32:21,256][25689] Avg episode reward: [(0, '1.174')] [2022-07-11 03:32:22,454][26022] Updated weights on worker 0-0, policy_version 1016896 (0.00098) [2022-07-11 03:32:24,227][26022] Updated weights on worker 0-0, policy_version 1016906 (0.00082) [2022-07-11 03:32:26,114][26022] Updated weights on worker 0-0, policy_version 1016916 (0.00091) [2022-07-11 03:32:26,269][25689] Fps is (10 sec: 5587.1, 60 sec: 5526.6, 300 sec: 5531.8). Total num frames: 1041321984. Throughput: 0: 5000.5. Samples: 1041319056. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:32:26,270][25689] Avg episode reward: [(0, '1.237')] [2022-07-11 03:32:27,974][26022] Updated weights on worker 0-0, policy_version 1016926 (0.00083) [2022-07-11 03:32:29,803][26022] Updated weights on worker 0-0, policy_version 1016936 (0.00089) [2022-07-11 03:32:31,279][25689] Fps is (10 sec: 5516.9, 60 sec: 5560.4, 300 sec: 5539.0). Total num frames: 1041350656. Throughput: 0: 5824.1. Samples: 1041352362. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:32:31,281][25689] Avg episode reward: [(0, '0.232')] [2022-07-11 03:32:31,623][26022] Updated weights on worker 0-0, policy_version 1016946 (0.00086) [2022-07-11 03:32:33,445][26022] Updated weights on worker 0-0, policy_version 1016956 (0.00092) [2022-07-11 03:32:35,505][26022] Updated weights on worker 0-0, policy_version 1016966 (0.00087) [2022-07-11 03:32:36,306][25689] Fps is (10 sec: 5509.2, 60 sec: 5528.5, 300 sec: 5534.0). Total num frames: 1041377280. Throughput: 0: 5829.9. Samples: 1041385748. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:32:36,308][25689] Avg episode reward: [(0, '0.607')] [2022-07-11 03:32:36,974][26022] Updated weights on worker 0-0, policy_version 1016976 (0.00087) [2022-07-11 03:32:39,323][26022] Updated weights on worker 0-0, policy_version 1016986 (0.00089) [2022-07-11 03:32:40,581][26022] Updated weights on worker 0-0, policy_version 1016996 (0.00080) [2022-07-11 03:32:41,430][25689] Fps is (10 sec: 5547.9, 60 sec: 5543.9, 300 sec: 5538.6). Total num frames: 1041406976. Throughput: 0: 4969.8. Samples: 1041402456. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:32:41,432][25689] Avg episode reward: [(0, '1.010')] [2022-07-11 03:32:42,920][26022] Updated weights on worker 0-0, policy_version 1017006 (0.00089) [2022-07-11 03:32:44,252][26022] Updated weights on worker 0-0, policy_version 1017016 (0.00078) [2022-07-11 03:32:46,301][26022] Updated weights on worker 0-0, policy_version 1017026 (0.00081) [2022-07-11 03:32:46,511][25689] Fps is (10 sec: 5719.7, 60 sec: 5524.9, 300 sec: 5544.0). Total num frames: 1041435648. Throughput: 0: 5782.6. Samples: 1041436106. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:32:46,512][25689] Avg episode reward: [(0, '0.926')] [2022-07-11 03:32:48,048][26022] Updated weights on worker 0-0, policy_version 1017036 (0.00084) [2022-07-11 03:32:49,794][26022] Updated weights on worker 0-0, policy_version 1017046 (0.00085) [2022-07-11 03:32:51,517][25689] Fps is (10 sec: 5583.8, 60 sec: 5561.9, 300 sec: 5537.8). Total num frames: 1041463296. Throughput: 0: 5828.1. Samples: 1041470312. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:32:51,523][25689] Avg episode reward: [(0, '0.769')] [2022-07-11 03:32:51,740][26022] Updated weights on worker 0-0, policy_version 1017056 (0.00092) [2022-07-11 03:32:53,598][26022] Updated weights on worker 0-0, policy_version 1017066 (0.00087) [2022-07-11 03:32:55,299][26022] Updated weights on worker 0-0, policy_version 1017076 (0.00084) [2022-07-11 03:32:56,587][25689] Fps is (10 sec: 5487.8, 60 sec: 5539.0, 300 sec: 5538.5). Total num frames: 1041490944. Throughput: 0: 5000.5. Samples: 1041487162. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:32:56,588][25689] Avg episode reward: [(0, '1.972')] [2022-07-11 03:32:57,283][26022] Updated weights on worker 0-0, policy_version 1017086 (0.00095) [2022-07-11 03:32:59,060][26022] Updated weights on worker 0-0, policy_version 1017096 (0.00085) [2022-07-11 03:33:00,837][26022] Updated weights on worker 0-0, policy_version 1017106 (0.00089) [2022-07-11 03:33:01,651][25689] Fps is (10 sec: 5557.0, 60 sec: 5524.7, 300 sec: 5544.5). Total num frames: 1041519616. Throughput: 0: 5844.8. Samples: 1041520646. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:33:01,652][25689] Avg episode reward: [(0, '1.975')] [2022-07-11 03:33:03,289][26022] Updated weights on worker 0-0, policy_version 1017116 (0.00084) [2022-07-11 03:33:04,763][26022] Updated weights on worker 0-0, policy_version 1017126 (0.00088) [2022-07-11 03:33:06,655][25689] Fps is (10 sec: 5492.2, 60 sec: 5543.4, 300 sec: 5541.1). Total num frames: 1041546240. Throughput: 0: 5764.2. Samples: 1041552222. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:33:06,657][25689] Avg episode reward: [(0, '1.835')] [2022-07-11 03:33:06,902][26022] Updated weights on worker 0-0, policy_version 1017136 (0.00093) [2022-07-11 03:33:08,591][26022] Updated weights on worker 0-0, policy_version 1017146 (0.00089) [2022-07-11 03:33:10,503][26022] Updated weights on worker 0-0, policy_version 1017156 (0.00094) [2022-07-11 03:33:11,721][25689] Fps is (10 sec: 5491.4, 60 sec: 5559.3, 300 sec: 5540.6). Total num frames: 1041574912. Throughput: 0: 4888.5. Samples: 1041569082. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:33:11,722][25689] Avg episode reward: [(0, '1.386')] [2022-07-11 03:33:12,255][26022] Updated weights on worker 0-0, policy_version 1017166 (0.00082) [2022-07-11 03:33:14,066][26022] Updated weights on worker 0-0, policy_version 1017176 (0.00085) [2022-07-11 03:33:15,842][26022] Updated weights on worker 0-0, policy_version 1017186 (0.00094) [2022-07-11 03:33:16,734][25689] Fps is (10 sec: 5486.3, 60 sec: 5547.5, 300 sec: 5539.3). Total num frames: 1041601536. Throughput: 0: 5729.9. Samples: 1041602602. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:33:16,734][25689] Avg episode reward: [(0, '1.595')] [2022-07-11 03:33:17,971][26022] Updated weights on worker 0-0, policy_version 1017196 (0.00081) [2022-07-11 03:33:19,621][26022] Updated weights on worker 0-0, policy_version 1017206 (0.00107) [2022-07-11 03:33:21,362][26022] Updated weights on worker 0-0, policy_version 1017216 (0.00101) [2022-07-11 03:33:21,781][25689] Fps is (10 sec: 5598.2, 60 sec: 5549.2, 300 sec: 5542.4). Total num frames: 1041631232. Throughput: 0: 5743.1. Samples: 1041636254. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:33:21,783][25689] Avg episode reward: [(0, '0.685')] [2022-07-11 03:33:23,451][26022] Updated weights on worker 0-0, policy_version 1017226 (0.00093) [2022-07-11 03:33:24,996][26022] Updated weights on worker 0-0, policy_version 1017236 (0.00086) [2022-07-11 03:33:26,803][25689] Fps is (10 sec: 5491.8, 60 sec: 5531.6, 300 sec: 5538.9). Total num frames: 1041656832. Throughput: 0: 5005.1. Samples: 1041653064. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:33:26,805][25689] Avg episode reward: [(0, '0.281')] [2022-07-11 03:33:27,113][26022] Updated weights on worker 0-0, policy_version 1017246 (0.00083) [2022-07-11 03:33:28,703][26022] Updated weights on worker 0-0, policy_version 1017256 (0.00078) [2022-07-11 03:33:30,785][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:33:30,798][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001017266_1041680384.pth [2022-07-11 03:33:30,799][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001015317_1039684608.pth [2022-07-11 03:33:30,801][26022] Updated weights on worker 0-0, policy_version 1017266 (0.00095) [2022-07-11 03:33:31,808][25689] Fps is (10 sec: 5514.7, 60 sec: 5548.9, 300 sec: 5547.5). Total num frames: 1041686528. Throughput: 0: 5838.0. Samples: 1041686352. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:33:31,810][25689] Avg episode reward: [(0, '0.545')] [2022-07-11 03:33:32,712][26022] Updated weights on worker 0-0, policy_version 1017276 (0.00089) [2022-07-11 03:33:34,495][26022] Updated weights on worker 0-0, policy_version 1017286 (0.00088) [2022-07-11 03:33:36,095][26022] Updated weights on worker 0-0, policy_version 1017296 (0.00094) [2022-07-11 03:33:36,835][25689] Fps is (10 sec: 5715.8, 60 sec: 5565.8, 300 sec: 5539.2). Total num frames: 1041714176. Throughput: 0: 5829.3. Samples: 1041719780. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:33:36,837][25689] Avg episode reward: [(0, '0.630')] [2022-07-11 03:33:38,086][26022] Updated weights on worker 0-0, policy_version 1017306 (0.00082) [2022-07-11 03:33:39,786][26022] Updated weights on worker 0-0, policy_version 1017316 (0.00106) [2022-07-11 03:33:41,762][26022] Updated weights on worker 0-0, policy_version 1017326 (0.00089) [2022-07-11 03:33:41,929][25689] Fps is (10 sec: 5564.8, 60 sec: 5551.7, 300 sec: 5544.9). Total num frames: 1041742848. Throughput: 0: 4973.3. Samples: 1041736454. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:33:41,930][25689] Avg episode reward: [(0, '0.432')] [2022-07-11 03:33:43,530][26022] Updated weights on worker 0-0, policy_version 1017336 (0.00090) [2022-07-11 03:33:45,596][26022] Updated weights on worker 0-0, policy_version 1017346 (0.00085) [2022-07-11 03:33:46,936][25689] Fps is (10 sec: 5575.6, 60 sec: 5541.5, 300 sec: 5541.9). Total num frames: 1041770496. Throughput: 0: 5814.0. Samples: 1041770122. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:33:46,937][25689] Avg episode reward: [(0, '0.296')] [2022-07-11 03:33:47,275][26022] Updated weights on worker 0-0, policy_version 1017356 (0.00085) [2022-07-11 03:33:49,143][26022] Updated weights on worker 0-0, policy_version 1017366 (0.00054) [2022-07-11 03:33:50,883][26022] Updated weights on worker 0-0, policy_version 1017376 (0.00094) [2022-07-11 03:33:51,953][25689] Fps is (10 sec: 5516.6, 60 sec: 5540.5, 300 sec: 5541.9). Total num frames: 1041798144. Throughput: 0: 5815.3. Samples: 1041803498. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:33:51,954][25689] Avg episode reward: [(0, '0.963')] [2022-07-11 03:33:52,809][26022] Updated weights on worker 0-0, policy_version 1017386 (0.00088) [2022-07-11 03:33:54,626][26022] Updated weights on worker 0-0, policy_version 1017396 (0.00083) [2022-07-11 03:33:56,654][26022] Updated weights on worker 0-0, policy_version 1017406 (0.00088) [2022-07-11 03:33:56,967][25689] Fps is (10 sec: 5512.5, 60 sec: 5545.6, 300 sec: 5540.5). Total num frames: 1041825792. Throughput: 0: 4992.0. Samples: 1041820280. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:33:56,968][25689] Avg episode reward: [(0, '0.917')] [2022-07-11 03:33:58,298][26022] Updated weights on worker 0-0, policy_version 1017416 (0.00085) [2022-07-11 03:34:00,371][26022] Updated weights on worker 0-0, policy_version 1017426 (0.00089) [2022-07-11 03:34:01,887][26022] Updated weights on worker 0-0, policy_version 1017436 (0.00087) [2022-07-11 03:34:02,036][25689] Fps is (10 sec: 5686.9, 60 sec: 5562.2, 300 sec: 5553.2). Total num frames: 1041855488. Throughput: 0: 5820.3. Samples: 1041853484. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:34:02,037][25689] Avg episode reward: [(0, '0.991')] [2022-07-11 03:34:04,465][26022] Updated weights on worker 0-0, policy_version 1017446 (0.00086) [2022-07-11 03:34:05,720][26022] Updated weights on worker 0-0, policy_version 1017456 (0.00092) [2022-07-11 03:34:07,098][25689] Fps is (10 sec: 5458.4, 60 sec: 5539.9, 300 sec: 5542.4). Total num frames: 1041881088. Throughput: 0: 5712.9. Samples: 1041885302. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:34:07,098][25689] Avg episode reward: [(0, '1.309')] [2022-07-11 03:34:08,187][26022] Updated weights on worker 0-0, policy_version 1017466 (0.00098) [2022-07-11 03:34:09,439][26022] Updated weights on worker 0-0, policy_version 1017476 (0.00961) [2022-07-11 03:34:11,668][26022] Updated weights on worker 0-0, policy_version 1017486 (0.00086) [2022-07-11 03:34:12,175][25689] Fps is (10 sec: 5251.9, 60 sec: 5521.9, 300 sec: 5542.6). Total num frames: 1041908736. Throughput: 0: 4875.5. Samples: 1041902092. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:34:12,176][25689] Avg episode reward: [(0, '1.587')] [2022-07-11 03:34:13,194][26022] Updated weights on worker 0-0, policy_version 1017496 (0.00088) [2022-07-11 03:34:15,227][26022] Updated weights on worker 0-0, policy_version 1017506 (0.00091) [2022-07-11 03:34:16,804][26022] Updated weights on worker 0-0, policy_version 1017516 (0.00084) [2022-07-11 03:34:17,192][25689] Fps is (10 sec: 5579.5, 60 sec: 5555.4, 300 sec: 5544.0). Total num frames: 1041937408. Throughput: 0: 5712.9. Samples: 1041935822. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:34:17,194][25689] Avg episode reward: [(0, '1.392')] [2022-07-11 03:34:18,731][26022] Updated weights on worker 0-0, policy_version 1017526 (0.00090) [2022-07-11 03:34:20,677][26022] Updated weights on worker 0-0, policy_version 1017536 (0.00090) [2022-07-11 03:34:22,270][25689] Fps is (10 sec: 5680.6, 60 sec: 5535.7, 300 sec: 5544.3). Total num frames: 1041966080. Throughput: 0: 5732.6. Samples: 1041969476. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:34:22,270][25689] Avg episode reward: [(0, '1.552')] [2022-07-11 03:34:22,415][26022] Updated weights on worker 0-0, policy_version 1017546 (0.00084) [2022-07-11 03:34:24,259][26022] Updated weights on worker 0-0, policy_version 1017556 (0.00089) [2022-07-11 03:34:25,900][26022] Updated weights on worker 0-0, policy_version 1017566 (0.00088) [2022-07-11 03:34:27,279][25689] Fps is (10 sec: 5583.6, 60 sec: 5570.7, 300 sec: 5551.6). Total num frames: 1041993728. Throughput: 0: 5834.1. Samples: 1042003038. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:34:27,279][25689] Avg episode reward: [(0, '1.829')] [2022-07-11 03:34:27,977][26022] Updated weights on worker 0-0, policy_version 1017576 (0.00095) [2022-07-11 03:34:29,787][26022] Updated weights on worker 0-0, policy_version 1017586 (0.00086) [2022-07-11 03:34:31,756][26022] Updated weights on worker 0-0, policy_version 1017596 (0.00087) [2022-07-11 03:34:32,291][25689] Fps is (10 sec: 5518.1, 60 sec: 5536.3, 300 sec: 5541.1). Total num frames: 1042021376. Throughput: 0: 5833.7. Samples: 1042019440. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:34:32,291][25689] Avg episode reward: [(0, '1.703')] [2022-07-11 03:34:33,497][26022] Updated weights on worker 0-0, policy_version 1017606 (0.00086) [2022-07-11 03:34:35,362][26022] Updated weights on worker 0-0, policy_version 1017616 (0.00082) [2022-07-11 03:34:37,076][26022] Updated weights on worker 0-0, policy_version 1017626 (0.00086) [2022-07-11 03:34:37,306][25689] Fps is (10 sec: 5616.9, 60 sec: 5554.3, 300 sec: 5549.5). Total num frames: 1042050048. Throughput: 0: 5822.4. Samples: 1042052930. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:34:37,306][25689] Avg episode reward: [(0, '2.004')] [2022-07-11 03:34:38,982][26022] Updated weights on worker 0-0, policy_version 1017636 (0.00093) [2022-07-11 03:34:40,939][26022] Updated weights on worker 0-0, policy_version 1017646 (0.00080) [2022-07-11 03:34:42,356][25689] Fps is (10 sec: 5595.3, 60 sec: 5541.3, 300 sec: 5549.2). Total num frames: 1042077696. Throughput: 0: 5812.6. Samples: 1042086230. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:34:42,357][25689] Avg episode reward: [(0, '2.247')] [2022-07-11 03:34:42,756][26022] Updated weights on worker 0-0, policy_version 1017656 (0.00091) [2022-07-11 03:34:44,703][26022] Updated weights on worker 0-0, policy_version 1017666 (0.00084) [2022-07-11 03:34:46,242][26022] Updated weights on worker 0-0, policy_version 1017676 (0.00090) [2022-07-11 03:34:47,386][25689] Fps is (10 sec: 5383.9, 60 sec: 5522.4, 300 sec: 5539.2). Total num frames: 1042104320. Throughput: 0: 4977.4. Samples: 1042103120. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:34:47,386][25689] Avg episode reward: [(0, '2.253')] [2022-07-11 03:34:48,178][26022] Updated weights on worker 0-0, policy_version 1017686 (0.00081) [2022-07-11 03:34:49,972][26022] Updated weights on worker 0-0, policy_version 1017696 (0.00164) [2022-07-11 03:34:51,867][26022] Updated weights on worker 0-0, policy_version 1017706 (0.00086) [2022-07-11 03:34:52,404][25689] Fps is (10 sec: 5503.4, 60 sec: 5539.1, 300 sec: 5546.4). Total num frames: 1042132992. Throughput: 0: 5843.8. Samples: 1042136978. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:34:52,404][25689] Avg episode reward: [(0, '2.078')] [2022-07-11 03:34:53,483][26022] Updated weights on worker 0-0, policy_version 1017716 (0.00086) [2022-07-11 03:34:55,632][26022] Updated weights on worker 0-0, policy_version 1017726 (0.00053) [2022-07-11 03:34:57,084][26022] Updated weights on worker 0-0, policy_version 1017736 (0.00091) [2022-07-11 03:34:57,435][25689] Fps is (10 sec: 5808.5, 60 sec: 5571.5, 300 sec: 5547.5). Total num frames: 1042162688. Throughput: 0: 5836.1. Samples: 1042170406. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:34:57,435][25689] Avg episode reward: [(0, '2.071')] [2022-07-11 03:34:59,275][26022] Updated weights on worker 0-0, policy_version 1017746 (0.00078) [2022-07-11 03:35:00,799][26022] Updated weights on worker 0-0, policy_version 1017756 (0.00079) [2022-07-11 03:35:02,480][25689] Fps is (10 sec: 5182.8, 60 sec: 5455.1, 300 sec: 5536.9). Total num frames: 1042185216. Throughput: 0: 5018.1. Samples: 1042187214. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:35:02,481][25689] Avg episode reward: [(0, '1.580')] [2022-07-11 03:35:03,241][26022] Updated weights on worker 0-0, policy_version 1017766 (0.00084) [2022-07-11 03:35:04,977][26022] Updated weights on worker 0-0, policy_version 1017776 (0.00084) [2022-07-11 03:35:06,813][26022] Updated weights on worker 0-0, policy_version 1017786 (0.00085) [2022-07-11 03:35:07,512][25689] Fps is (10 sec: 5284.0, 60 sec: 5542.6, 300 sec: 5551.2). Total num frames: 1042215936. Throughput: 0: 5737.2. Samples: 1042218586. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:35:07,512][25689] Avg episode reward: [(0, '1.229')] [2022-07-11 03:35:08,575][26022] Updated weights on worker 0-0, policy_version 1017796 (0.00086) [2022-07-11 03:35:10,667][26022] Updated weights on worker 0-0, policy_version 1017806 (0.00092) [2022-07-11 03:35:12,249][26022] Updated weights on worker 0-0, policy_version 1017816 (0.00089) [2022-07-11 03:35:12,522][25689] Fps is (10 sec: 5914.7, 60 sec: 5565.7, 300 sec: 5549.1). Total num frames: 1042244608. Throughput: 0: 5752.9. Samples: 1042252714. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:35:12,522][25689] Avg episode reward: [(0, '0.991')] [2022-07-11 03:35:14,240][26022] Updated weights on worker 0-0, policy_version 1017826 (0.00087) [2022-07-11 03:35:15,979][26022] Updated weights on worker 0-0, policy_version 1017836 (0.00090) [2022-07-11 03:35:17,543][25689] Fps is (10 sec: 5716.5, 60 sec: 5565.3, 300 sec: 5552.9). Total num frames: 1042273280. Throughput: 0: 4935.0. Samples: 1042269642. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:35:17,544][25689] Avg episode reward: [(0, '0.888')] [2022-07-11 03:35:17,688][26022] Updated weights on worker 0-0, policy_version 1017846 (0.00091) [2022-07-11 03:35:19,753][26022] Updated weights on worker 0-0, policy_version 1017856 (0.00094) [2022-07-11 03:35:21,335][26022] Updated weights on worker 0-0, policy_version 1017866 (0.00086) [2022-07-11 03:35:22,576][25689] Fps is (10 sec: 5499.6, 60 sec: 5535.5, 300 sec: 5546.1). Total num frames: 1042299904. Throughput: 0: 5784.5. Samples: 1042303460. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:35:22,577][25689] Avg episode reward: [(0, '-0.010')] [2022-07-11 03:35:23,365][26022] Updated weights on worker 0-0, policy_version 1017876 (0.00085) [2022-07-11 03:35:25,215][26022] Updated weights on worker 0-0, policy_version 1017886 (0.00086) [2022-07-11 03:35:26,828][26022] Updated weights on worker 0-0, policy_version 1017896 (0.00099) [2022-07-11 03:35:27,578][25689] Fps is (10 sec: 5714.6, 60 sec: 5587.1, 300 sec: 5563.9). Total num frames: 1042330624. Throughput: 0: 5916.1. Samples: 1042337298. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:35:27,581][25689] Avg episode reward: [(0, '0.689')] [2022-07-11 03:35:28,795][26022] Updated weights on worker 0-0, policy_version 1017906 (0.00083) [2022-07-11 03:35:30,435][26022] Updated weights on worker 0-0, policy_version 1017916 (0.00086) [2022-07-11 03:35:30,816][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:35:30,839][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001017918_1042348032.pth [2022-07-11 03:35:30,843][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001015965_1040348160.pth [2022-07-11 03:35:32,478][26022] Updated weights on worker 0-0, policy_version 1017926 (0.00086) [2022-07-11 03:35:32,603][25689] Fps is (10 sec: 5617.2, 60 sec: 5551.9, 300 sec: 5550.0). Total num frames: 1042356224. Throughput: 0: 5049.5. Samples: 1042354112. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:35:32,603][25689] Avg episode reward: [(0, '0.782')] [2022-07-11 03:35:34,099][26022] Updated weights on worker 0-0, policy_version 1017936 (0.00081) [2022-07-11 03:35:35,844][26022] Updated weights on worker 0-0, policy_version 1017946 (0.00088) [2022-07-11 03:35:37,587][26022] Updated weights on worker 0-0, policy_version 1017956 (0.00082) [2022-07-11 03:35:37,635][25689] Fps is (10 sec: 5600.1, 60 sec: 5584.3, 300 sec: 5554.8). Total num frames: 1042386944. Throughput: 0: 5912.0. Samples: 1042388424. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:35:37,635][25689] Avg episode reward: [(0, '0.823')] [2022-07-11 03:35:39,867][26022] Updated weights on worker 0-0, policy_version 1017966 (0.00081) [2022-07-11 03:35:41,313][26022] Updated weights on worker 0-0, policy_version 1017976 (0.00080) [2022-07-11 03:35:42,764][25689] Fps is (10 sec: 5643.3, 60 sec: 5560.1, 300 sec: 5550.5). Total num frames: 1042413568. Throughput: 0: 5871.5. Samples: 1042421992. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:35:42,765][25689] Avg episode reward: [(0, '-0.523')] [2022-07-11 03:35:43,340][26022] Updated weights on worker 0-0, policy_version 1017986 (0.00087) [2022-07-11 03:35:44,869][26022] Updated weights on worker 0-0, policy_version 1017996 (0.00089) [2022-07-11 03:35:47,068][26022] Updated weights on worker 0-0, policy_version 1018006 (0.00084) [2022-07-11 03:35:47,767][25689] Fps is (10 sec: 5558.7, 60 sec: 5613.5, 300 sec: 5558.2). Total num frames: 1042443264. Throughput: 0: 5021.2. Samples: 1042438670. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:35:47,767][25689] Avg episode reward: [(0, '-0.332')] [2022-07-11 03:35:48,692][26022] Updated weights on worker 0-0, policy_version 1018016 (0.00089) [2022-07-11 03:35:50,578][26022] Updated weights on worker 0-0, policy_version 1018026 (0.00085) [2022-07-11 03:35:52,382][26022] Updated weights on worker 0-0, policy_version 1018036 (0.00091) [2022-07-11 03:35:52,770][25689] Fps is (10 sec: 5628.9, 60 sec: 5580.9, 300 sec: 5548.1). Total num frames: 1042469888. Throughput: 0: 5860.6. Samples: 1042472302. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:35:52,770][25689] Avg episode reward: [(0, '0.620')] [2022-07-11 03:35:54,269][26022] Updated weights on worker 0-0, policy_version 1018046 (0.00086) [2022-07-11 03:35:56,113][26022] Updated weights on worker 0-0, policy_version 1018056 (0.00090) [2022-07-11 03:35:57,846][25689] Fps is (10 sec: 5485.9, 60 sec: 5559.7, 300 sec: 5554.6). Total num frames: 1042498560. Throughput: 0: 5820.2. Samples: 1042506060. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:35:57,847][25689] Avg episode reward: [(0, '0.256')] [2022-07-11 03:35:57,999][26022] Updated weights on worker 0-0, policy_version 1018066 (0.00083) [2022-07-11 03:35:59,744][26022] Updated weights on worker 0-0, policy_version 1018076 (0.00091) [2022-07-11 03:36:01,875][26022] Updated weights on worker 0-0, policy_version 1018086 (0.00083) [2022-07-11 03:36:02,937][25689] Fps is (10 sec: 5438.8, 60 sec: 5623.4, 300 sec: 5553.8). Total num frames: 1042525184. Throughput: 0: 4994.4. Samples: 1042522742. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:36:02,937][25689] Avg episode reward: [(0, '0.272')] [2022-07-11 03:36:03,824][26022] Updated weights on worker 0-0, policy_version 1018096 (0.00090) [2022-07-11 03:36:05,621][26022] Updated weights on worker 0-0, policy_version 1018106 (0.00091) [2022-07-11 03:36:07,390][26022] Updated weights on worker 0-0, policy_version 1018116 (0.00092) [2022-07-11 03:36:07,948][25689] Fps is (10 sec: 5473.8, 60 sec: 5591.3, 300 sec: 5554.1). Total num frames: 1042553856. Throughput: 0: 5751.0. Samples: 1042554734. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:36:07,949][25689] Avg episode reward: [(0, '0.848')] [2022-07-11 03:36:09,161][26022] Updated weights on worker 0-0, policy_version 1018126 (0.00087) [2022-07-11 03:36:10,827][26022] Updated weights on worker 0-0, policy_version 1018136 (0.00077) [2022-07-11 03:36:13,021][25689] Fps is (10 sec: 5483.6, 60 sec: 5551.7, 300 sec: 5554.2). Total num frames: 1042580480. Throughput: 0: 5741.5. Samples: 1042588572. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:36:13,022][25689] Avg episode reward: [(0, '1.149')] [2022-07-11 03:36:13,032][26022] Updated weights on worker 0-0, policy_version 1018146 (0.00084) [2022-07-11 03:36:14,579][26022] Updated weights on worker 0-0, policy_version 1018156 (0.00089) [2022-07-11 03:36:16,603][26022] Updated weights on worker 0-0, policy_version 1018166 (0.00086) [2022-07-11 03:36:18,007][26022] Updated weights on worker 0-0, policy_version 1018176 (0.00079) [2022-07-11 03:36:18,103][25689] Fps is (10 sec: 5747.7, 60 sec: 5596.8, 300 sec: 5565.4). Total num frames: 1042612224. Throughput: 0: 4897.5. Samples: 1042605268. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:36:18,104][25689] Avg episode reward: [(0, '1.283')] [2022-07-11 03:36:20,141][26022] Updated weights on worker 0-0, policy_version 1018186 (0.00087) [2022-07-11 03:36:21,866][26022] Updated weights on worker 0-0, policy_version 1018196 (0.00088) [2022-07-11 03:36:23,191][25689] Fps is (10 sec: 5738.9, 60 sec: 5591.8, 300 sec: 5551.8). Total num frames: 1042638848. Throughput: 0: 5770.1. Samples: 1042639612. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2022-07-11 03:36:23,192][25689] Avg episode reward: [(0, '1.146')] [2022-07-11 03:36:23,649][26022] Updated weights on worker 0-0, policy_version 1018206 (0.00089) [2022-07-11 03:36:25,755][26022] Updated weights on worker 0-0, policy_version 1018216 (0.00105) [2022-07-11 03:36:27,190][26022] Updated weights on worker 0-0, policy_version 1018226 (0.00054) [2022-07-11 03:36:28,193][25689] Fps is (10 sec: 5480.6, 60 sec: 5558.0, 300 sec: 5558.8). Total num frames: 1042667520. Throughput: 0: 5860.1. Samples: 1042673368. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:36:28,193][25689] Avg episode reward: [(0, '1.330')] [2022-07-11 03:36:29,276][26022] Updated weights on worker 0-0, policy_version 1018236 (0.00091) [2022-07-11 03:36:30,975][26022] Updated weights on worker 0-0, policy_version 1018246 (0.00093) [2022-07-11 03:36:32,928][26022] Updated weights on worker 0-0, policy_version 1018256 (0.00086) [2022-07-11 03:36:33,221][25689] Fps is (10 sec: 5717.2, 60 sec: 5608.3, 300 sec: 5559.2). Total num frames: 1042696192. Throughput: 0: 5018.0. Samples: 1042689938. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:36:33,222][25689] Avg episode reward: [(0, '0.901')] [2022-07-11 03:36:34,653][26022] Updated weights on worker 0-0, policy_version 1018266 (0.00087) [2022-07-11 03:36:36,436][26022] Updated weights on worker 0-0, policy_version 1018276 (0.00093) [2022-07-11 03:36:38,197][26022] Updated weights on worker 0-0, policy_version 1018286 (0.00088) [2022-07-11 03:36:38,313][25689] Fps is (10 sec: 5666.3, 60 sec: 5569.1, 300 sec: 5559.5). Total num frames: 1042724864. Throughput: 0: 5863.9. Samples: 1042723776. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:36:38,313][25689] Avg episode reward: [(0, '0.853')] [2022-07-11 03:36:40,433][26022] Updated weights on worker 0-0, policy_version 1018296 (0.00084) [2022-07-11 03:36:41,990][26022] Updated weights on worker 0-0, policy_version 1018306 (0.00087) [2022-07-11 03:36:43,389][25689] Fps is (10 sec: 5740.3, 60 sec: 5624.6, 300 sec: 5559.1). Total num frames: 1042754560. Throughput: 0: 5834.7. Samples: 1042757462. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:36:43,390][25689] Avg episode reward: [(0, '1.235')] [2022-07-11 03:36:43,679][26022] Updated weights on worker 0-0, policy_version 1018316 (0.00113) [2022-07-11 03:36:45,623][26022] Updated weights on worker 0-0, policy_version 1018326 (0.00093) [2022-07-11 03:36:47,669][26022] Updated weights on worker 0-0, policy_version 1018336 (0.00091) [2022-07-11 03:36:48,446][25689] Fps is (10 sec: 5557.8, 60 sec: 5568.9, 300 sec: 5562.3). Total num frames: 1042781184. Throughput: 0: 4983.4. Samples: 1042774300. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:36:48,447][25689] Avg episode reward: [(0, '1.179')] [2022-07-11 03:36:49,236][26022] Updated weights on worker 0-0, policy_version 1018346 (0.00084) [2022-07-11 03:36:51,163][26022] Updated weights on worker 0-0, policy_version 1018356 (0.00086) [2022-07-11 03:36:52,863][26022] Updated weights on worker 0-0, policy_version 1018366 (0.00086) [2022-07-11 03:36:53,467][25689] Fps is (10 sec: 5588.7, 60 sec: 5617.9, 300 sec: 5565.4). Total num frames: 1042810880. Throughput: 0: 5840.3. Samples: 1042808178. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:36:53,467][25689] Avg episode reward: [(0, '1.230')] [2022-07-11 03:36:54,624][26022] Updated weights on worker 0-0, policy_version 1018376 (0.00094) [2022-07-11 03:36:56,569][26022] Updated weights on worker 0-0, policy_version 1018386 (0.00088) [2022-07-11 03:36:58,384][26022] Updated weights on worker 0-0, policy_version 1018396 (0.00084) [2022-07-11 03:36:58,483][25689] Fps is (10 sec: 5611.2, 60 sec: 5589.7, 300 sec: 5556.5). Total num frames: 1042837504. Throughput: 0: 5860.8. Samples: 1042841992. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:36:58,484][25689] Avg episode reward: [(0, '1.225')] [2022-07-11 03:37:00,220][26022] Updated weights on worker 0-0, policy_version 1018406 (0.00083) [2022-07-11 03:37:02,324][26022] Updated weights on worker 0-0, policy_version 1018416 (0.00096) [2022-07-11 03:37:03,539][25689] Fps is (10 sec: 5185.1, 60 sec: 5576.1, 300 sec: 5555.9). Total num frames: 1042863104. Throughput: 0: 5754.4. Samples: 1042873410. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:37:03,539][25689] Avg episode reward: [(0, '2.014')] [2022-07-11 03:37:04,203][26022] Updated weights on worker 0-0, policy_version 1018426 (0.00084) [2022-07-11 03:37:06,171][26022] Updated weights on worker 0-0, policy_version 1018436 (0.00083) [2022-07-11 03:37:07,839][26022] Updated weights on worker 0-0, policy_version 1018446 (0.00086) [2022-07-11 03:37:08,555][25689] Fps is (10 sec: 5591.9, 60 sec: 5609.4, 300 sec: 5567.0). Total num frames: 1042893824. Throughput: 0: 5774.3. Samples: 1042890414. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:37:08,556][25689] Avg episode reward: [(0, '2.135')] [2022-07-11 03:37:09,871][26022] Updated weights on worker 0-0, policy_version 1018456 (0.00087) [2022-07-11 03:37:11,360][26022] Updated weights on worker 0-0, policy_version 1018466 (0.00091) [2022-07-11 03:37:13,427][26022] Updated weights on worker 0-0, policy_version 1018476 (0.00085) [2022-07-11 03:37:13,570][25689] Fps is (10 sec: 5716.5, 60 sec: 5614.7, 300 sec: 5564.5). Total num frames: 1042920448. Throughput: 0: 5779.4. Samples: 1042924362. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:37:13,571][25689] Avg episode reward: [(0, '0.949')] [2022-07-11 03:37:15,084][26022] Updated weights on worker 0-0, policy_version 1018486 (0.00086) [2022-07-11 03:37:16,882][26022] Updated weights on worker 0-0, policy_version 1018496 (0.00086) [2022-07-11 03:37:18,573][25689] Fps is (10 sec: 5417.4, 60 sec: 5554.3, 300 sec: 5558.8). Total num frames: 1042948096. Throughput: 0: 5793.6. Samples: 1042958384. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:37:18,574][25689] Avg episode reward: [(0, '1.114')] [2022-07-11 03:37:18,704][26022] Updated weights on worker 0-0, policy_version 1018506 (0.00088) [2022-07-11 03:37:20,398][26022] Updated weights on worker 0-0, policy_version 1018516 (0.00085) [2022-07-11 03:37:22,565][26022] Updated weights on worker 0-0, policy_version 1018526 (0.00084) [2022-07-11 03:37:23,644][25689] Fps is (10 sec: 5692.2, 60 sec: 5606.7, 300 sec: 5568.1). Total num frames: 1042977792. Throughput: 0: 5054.6. Samples: 1042975036. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:37:23,645][25689] Avg episode reward: [(0, '0.183')] [2022-07-11 03:37:24,216][26022] Updated weights on worker 0-0, policy_version 1018536 (0.00054) [2022-07-11 03:37:25,982][26022] Updated weights on worker 0-0, policy_version 1018546 (0.00101) [2022-07-11 03:37:28,007][26022] Updated weights on worker 0-0, policy_version 1018556 (0.00093) [2022-07-11 03:37:28,745][25689] Fps is (10 sec: 5637.8, 60 sec: 5580.7, 300 sec: 5562.9). Total num frames: 1043005440. Throughput: 0: 5869.1. Samples: 1043008908. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:37:28,746][25689] Avg episode reward: [(0, '0.063')] [2022-07-11 03:37:29,591][26022] Updated weights on worker 0-0, policy_version 1018566 (0.00085) [2022-07-11 03:37:30,889][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:37:30,901][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001018572_1043017728.pth [2022-07-11 03:37:30,902][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001016615_1041013760.pth [2022-07-11 03:37:31,621][26022] Updated weights on worker 0-0, policy_version 1018576 (0.00085) [2022-07-11 03:37:33,439][26022] Updated weights on worker 0-0, policy_version 1018586 (0.00081) [2022-07-11 03:37:33,763][25689] Fps is (10 sec: 5464.8, 60 sec: 5564.7, 300 sec: 5566.6). Total num frames: 1043033088. Throughput: 0: 5841.8. Samples: 1043042324. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:37:33,765][25689] Avg episode reward: [(0, '-0.227')] [2022-07-11 03:37:35,151][26022] Updated weights on worker 0-0, policy_version 1018596 (0.00091) [2022-07-11 03:37:36,938][26022] Updated weights on worker 0-0, policy_version 1018606 (0.00084) [2022-07-11 03:37:38,797][25689] Fps is (10 sec: 5602.5, 60 sec: 5570.0, 300 sec: 5564.8). Total num frames: 1043061760. Throughput: 0: 4991.7. Samples: 1043059336. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:37:38,798][25689] Avg episode reward: [(0, '-0.344')] [2022-07-11 03:37:38,853][26022] Updated weights on worker 0-0, policy_version 1018616 (0.00084) [2022-07-11 03:37:40,655][26022] Updated weights on worker 0-0, policy_version 1018626 (0.00098) [2022-07-11 03:37:42,545][26022] Updated weights on worker 0-0, policy_version 1018636 (0.00094) [2022-07-11 03:37:43,859][25689] Fps is (10 sec: 5781.4, 60 sec: 5571.4, 300 sec: 5568.6). Total num frames: 1043091456. Throughput: 0: 5840.5. Samples: 1043093098. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:37:43,859][25689] Avg episode reward: [(0, '0.479')] [2022-07-11 03:37:44,270][26022] Updated weights on worker 0-0, policy_version 1018646 (0.00082) [2022-07-11 03:37:46,134][26022] Updated weights on worker 0-0, policy_version 1018656 (0.00086) [2022-07-11 03:37:48,033][26022] Updated weights on worker 0-0, policy_version 1018666 (0.00204) [2022-07-11 03:37:48,861][25689] Fps is (10 sec: 5697.9, 60 sec: 5593.3, 300 sec: 5568.6). Total num frames: 1043119104. Throughput: 0: 5849.8. Samples: 1043126586. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:37:48,862][25689] Avg episode reward: [(0, '1.567')] [2022-07-11 03:37:49,839][26022] Updated weights on worker 0-0, policy_version 1018676 (0.00105) [2022-07-11 03:37:51,655][26022] Updated weights on worker 0-0, policy_version 1018686 (0.00085) [2022-07-11 03:37:53,436][26022] Updated weights on worker 0-0, policy_version 1018696 (0.00078) [2022-07-11 03:37:53,902][25689] Fps is (10 sec: 5505.8, 60 sec: 5557.6, 300 sec: 5569.2). Total num frames: 1043146752. Throughput: 0: 5035.2. Samples: 1043143728. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:37:53,902][25689] Avg episode reward: [(0, '1.649')] [2022-07-11 03:37:55,170][26022] Updated weights on worker 0-0, policy_version 1018706 (0.00090) [2022-07-11 03:37:57,087][26022] Updated weights on worker 0-0, policy_version 1018716 (0.00083) [2022-07-11 03:37:58,716][26022] Updated weights on worker 0-0, policy_version 1018726 (0.00082) [2022-07-11 03:37:58,941][25689] Fps is (10 sec: 5587.4, 60 sec: 5589.4, 300 sec: 5569.7). Total num frames: 1043175424. Throughput: 0: 5875.2. Samples: 1043177684. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:37:58,942][25689] Avg episode reward: [(0, '1.627')] [2022-07-11 03:38:00,783][26022] Updated weights on worker 0-0, policy_version 1018736 (0.00087) [2022-07-11 03:38:02,832][26022] Updated weights on worker 0-0, policy_version 1018746 (0.00085) [2022-07-11 03:38:03,991][25689] Fps is (10 sec: 5277.9, 60 sec: 5572.9, 300 sec: 5561.9). Total num frames: 1043200000. Throughput: 0: 5755.0. Samples: 1043208958. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:38:03,993][25689] Avg episode reward: [(0, '1.367')] [2022-07-11 03:38:04,761][26022] Updated weights on worker 0-0, policy_version 1018756 (0.00086) [2022-07-11 03:38:06,663][26022] Updated weights on worker 0-0, policy_version 1018766 (0.00474) [2022-07-11 03:38:08,425][26022] Updated weights on worker 0-0, policy_version 1018776 (0.00085) [2022-07-11 03:38:09,043][25689] Fps is (10 sec: 5372.4, 60 sec: 5552.7, 300 sec: 5565.6). Total num frames: 1043229696. Throughput: 0: 4922.8. Samples: 1043225936. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:38:09,043][25689] Avg episode reward: [(0, '1.473')] [2022-07-11 03:38:10,144][26022] Updated weights on worker 0-0, policy_version 1018786 (0.00090) [2022-07-11 03:38:12,046][26022] Updated weights on worker 0-0, policy_version 1018796 (0.00083) [2022-07-11 03:38:13,747][26022] Updated weights on worker 0-0, policy_version 1018806 (0.00092) [2022-07-11 03:38:14,059][25689] Fps is (10 sec: 5695.8, 60 sec: 5569.6, 300 sec: 5569.0). Total num frames: 1043257344. Throughput: 0: 5745.0. Samples: 1043259528. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:38:14,059][25689] Avg episode reward: [(0, '1.456')] [2022-07-11 03:38:15,703][26022] Updated weights on worker 0-0, policy_version 1018816 (0.00095) [2022-07-11 03:38:17,658][26022] Updated weights on worker 0-0, policy_version 1018826 (0.00086) [2022-07-11 03:38:19,062][25689] Fps is (10 sec: 5621.5, 60 sec: 5586.5, 300 sec: 5566.4). Total num frames: 1043286016. Throughput: 0: 5738.5. Samples: 1043293146. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:38:19,062][25689] Avg episode reward: [(0, '1.316')] [2022-07-11 03:38:19,413][26022] Updated weights on worker 0-0, policy_version 1018836 (0.00083) [2022-07-11 03:38:21,327][26022] Updated weights on worker 0-0, policy_version 1018846 (0.00090) [2022-07-11 03:38:22,898][26022] Updated weights on worker 0-0, policy_version 1018856 (0.00094) [2022-07-11 03:38:24,120][25689] Fps is (10 sec: 5699.4, 60 sec: 5570.8, 300 sec: 5576.1). Total num frames: 1043314688. Throughput: 0: 5007.6. Samples: 1043309756. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:38:24,120][25689] Avg episode reward: [(0, '1.201')] [2022-07-11 03:38:24,871][26022] Updated weights on worker 0-0, policy_version 1018866 (0.00087) [2022-07-11 03:38:26,890][26022] Updated weights on worker 0-0, policy_version 1018876 (0.00095) [2022-07-11 03:38:28,603][26022] Updated weights on worker 0-0, policy_version 1018886 (0.00091) [2022-07-11 03:38:29,143][25689] Fps is (10 sec: 5586.6, 60 sec: 5577.9, 300 sec: 5568.8). Total num frames: 1043342336. Throughput: 0: 5845.4. Samples: 1043343428. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:38:29,143][25689] Avg episode reward: [(0, '1.804')] [2022-07-11 03:38:30,366][26022] Updated weights on worker 0-0, policy_version 1018896 (0.00093) [2022-07-11 03:38:32,265][26022] Updated weights on worker 0-0, policy_version 1018906 (0.00083) [2022-07-11 03:38:34,062][26022] Updated weights on worker 0-0, policy_version 1018916 (0.00091) [2022-07-11 03:38:34,244][25689] Fps is (10 sec: 5461.8, 60 sec: 5570.3, 300 sec: 5567.4). Total num frames: 1043369984. Throughput: 0: 5823.7. Samples: 1043377082. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:38:34,255][25689] Avg episode reward: [(0, '1.409')] [2022-07-11 03:38:35,907][26022] Updated weights on worker 0-0, policy_version 1018926 (0.00089) [2022-07-11 03:38:37,685][26022] Updated weights on worker 0-0, policy_version 1018936 (0.00090) [2022-07-11 03:38:39,261][25689] Fps is (10 sec: 5566.2, 60 sec: 5571.9, 300 sec: 5568.9). Total num frames: 1043398656. Throughput: 0: 4994.3. Samples: 1043394028. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:38:39,262][25689] Avg episode reward: [(0, '1.407')] [2022-07-11 03:38:39,472][26022] Updated weights on worker 0-0, policy_version 1018946 (0.00099) [2022-07-11 03:38:41,486][26022] Updated weights on worker 0-0, policy_version 1018956 (0.00085) [2022-07-11 03:38:43,015][26022] Updated weights on worker 0-0, policy_version 1018966 (0.00085) [2022-07-11 03:38:44,320][25689] Fps is (10 sec: 5589.5, 60 sec: 5538.2, 300 sec: 5567.9). Total num frames: 1043426304. Throughput: 0: 5827.8. Samples: 1043427480. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:38:44,321][25689] Avg episode reward: [(0, '1.306')] [2022-07-11 03:38:45,180][26022] Updated weights on worker 0-0, policy_version 1018976 (0.00078) [2022-07-11 03:38:46,712][26022] Updated weights on worker 0-0, policy_version 1018986 (0.00092) [2022-07-11 03:38:48,762][26022] Updated weights on worker 0-0, policy_version 1018996 (0.00084) [2022-07-11 03:38:49,342][25689] Fps is (10 sec: 5586.8, 60 sec: 5553.4, 300 sec: 5571.3). Total num frames: 1043454976. Throughput: 0: 5833.1. Samples: 1043461252. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:38:49,342][25689] Avg episode reward: [(0, '1.326')] [2022-07-11 03:38:50,469][26022] Updated weights on worker 0-0, policy_version 1019006 (0.00083) [2022-07-11 03:38:52,567][26022] Updated weights on worker 0-0, policy_version 1019016 (0.00094) [2022-07-11 03:38:54,095][26022] Updated weights on worker 0-0, policy_version 1019026 (0.00060) [2022-07-11 03:38:54,347][25689] Fps is (10 sec: 5719.1, 60 sec: 5573.6, 300 sec: 5574.9). Total num frames: 1043483648. Throughput: 0: 5001.2. Samples: 1043477620. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:38:54,347][25689] Avg episode reward: [(0, '1.478')] [2022-07-11 03:38:56,330][26022] Updated weights on worker 0-0, policy_version 1019036 (0.00084) [2022-07-11 03:38:57,744][26022] Updated weights on worker 0-0, policy_version 1019046 (0.00095) [2022-07-11 03:38:59,358][25689] Fps is (10 sec: 5316.3, 60 sec: 5508.4, 300 sec: 5558.7). Total num frames: 1043508224. Throughput: 0: 5808.8. Samples: 1043510766. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:38:59,358][25689] Avg episode reward: [(0, '0.942')] [2022-07-11 03:38:59,996][26022] Updated weights on worker 0-0, policy_version 1019056 (0.00082) [2022-07-11 03:39:02,059][26022] Updated weights on worker 0-0, policy_version 1019066 (0.00096) [2022-07-11 03:39:03,876][26022] Updated weights on worker 0-0, policy_version 1019076 (0.00086) [2022-07-11 03:39:04,405][25689] Fps is (10 sec: 5294.2, 60 sec: 5576.4, 300 sec: 5569.4). Total num frames: 1043536896. Throughput: 0: 5702.1. Samples: 1043542004. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:39:04,405][25689] Avg episode reward: [(0, '1.072')] [2022-07-11 03:39:05,828][26022] Updated weights on worker 0-0, policy_version 1019086 (0.00093) [2022-07-11 03:39:07,503][26022] Updated weights on worker 0-0, policy_version 1019096 (0.00081) [2022-07-11 03:39:09,410][26022] Updated weights on worker 0-0, policy_version 1019106 (0.00098) [2022-07-11 03:39:09,414][25689] Fps is (10 sec: 5600.4, 60 sec: 5546.5, 300 sec: 5570.6). Total num frames: 1043564544. Throughput: 0: 4856.2. Samples: 1043558728. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:39:09,415][25689] Avg episode reward: [(0, '1.355')] [2022-07-11 03:39:11,296][26022] Updated weights on worker 0-0, policy_version 1019116 (0.00091) [2022-07-11 03:39:13,138][26022] Updated weights on worker 0-0, policy_version 1019126 (0.00111) [2022-07-11 03:39:14,419][25689] Fps is (10 sec: 5624.2, 60 sec: 5564.5, 300 sec: 5570.9). Total num frames: 1043593216. Throughput: 0: 5720.9. Samples: 1043592448. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:39:14,419][25689] Avg episode reward: [(0, '0.399')] [2022-07-11 03:39:14,914][26022] Updated weights on worker 0-0, policy_version 1019136 (0.00089) [2022-07-11 03:39:16,934][26022] Updated weights on worker 0-0, policy_version 1019146 (0.00087) [2022-07-11 03:39:18,579][26022] Updated weights on worker 0-0, policy_version 1019156 (0.00083) [2022-07-11 03:39:19,422][25689] Fps is (10 sec: 5627.9, 60 sec: 5547.5, 300 sec: 5568.8). Total num frames: 1043620864. Throughput: 0: 5752.4. Samples: 1043626180. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:39:19,422][25689] Avg episode reward: [(0, '0.235')] [2022-07-11 03:39:20,444][26022] Updated weights on worker 0-0, policy_version 1019166 (0.00090) [2022-07-11 03:39:22,203][26022] Updated weights on worker 0-0, policy_version 1019176 (0.00086) [2022-07-11 03:39:24,034][26022] Updated weights on worker 0-0, policy_version 1019186 (0.00095) [2022-07-11 03:39:24,501][25689] Fps is (10 sec: 5484.2, 60 sec: 5528.6, 300 sec: 5567.5). Total num frames: 1043648512. Throughput: 0: 5021.8. Samples: 1043642926. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:39:24,502][25689] Avg episode reward: [(0, '0.207')] [2022-07-11 03:39:25,738][26022] Updated weights on worker 0-0, policy_version 1019196 (0.00092) [2022-07-11 03:39:27,536][26022] Updated weights on worker 0-0, policy_version 1019206 (0.00083) [2022-07-11 03:39:29,488][26022] Updated weights on worker 0-0, policy_version 1019216 (0.00092) [2022-07-11 03:39:29,526][25689] Fps is (10 sec: 5573.9, 60 sec: 5545.4, 300 sec: 5570.7). Total num frames: 1043677184. Throughput: 0: 5873.6. Samples: 1043676854. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:39:29,526][25689] Avg episode reward: [(0, '-0.015')] [2022-07-11 03:39:31,016][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:39:31,028][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001019224_1043685376.pth [2022-07-11 03:39:31,029][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001017266_1041680384.pth [2022-07-11 03:39:31,570][26022] Updated weights on worker 0-0, policy_version 1019226 (0.00089) [2022-07-11 03:39:33,129][26022] Updated weights on worker 0-0, policy_version 1019236 (0.00053) [2022-07-11 03:39:34,540][25689] Fps is (10 sec: 5508.4, 60 sec: 5536.4, 300 sec: 5563.8). Total num frames: 1043703808. Throughput: 0: 5837.6. Samples: 1043709908. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:39:34,540][25689] Avg episode reward: [(0, '0.393')] [2022-07-11 03:39:35,123][26022] Updated weights on worker 0-0, policy_version 1019246 (0.00087) [2022-07-11 03:39:36,871][26022] Updated weights on worker 0-0, policy_version 1019256 (0.00092) [2022-07-11 03:39:38,853][26022] Updated weights on worker 0-0, policy_version 1019266 (0.00087) [2022-07-11 03:39:39,576][25689] Fps is (10 sec: 5501.9, 60 sec: 5534.7, 300 sec: 5567.5). Total num frames: 1043732480. Throughput: 0: 4989.1. Samples: 1043726732. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:39:39,577][25689] Avg episode reward: [(0, '-0.133')] [2022-07-11 03:39:40,684][26022] Updated weights on worker 0-0, policy_version 1019276 (0.00094) [2022-07-11 03:39:42,611][26022] Updated weights on worker 0-0, policy_version 1019286 (0.00096) [2022-07-11 03:39:44,345][26022] Updated weights on worker 0-0, policy_version 1019296 (0.00087) [2022-07-11 03:39:44,687][25689] Fps is (10 sec: 5550.4, 60 sec: 5529.9, 300 sec: 5569.5). Total num frames: 1043760128. Throughput: 0: 5791.3. Samples: 1043759828. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:39:44,689][25689] Avg episode reward: [(0, '1.445')] [2022-07-11 03:39:46,371][26022] Updated weights on worker 0-0, policy_version 1019306 (0.00093) [2022-07-11 03:39:47,911][26022] Updated weights on worker 0-0, policy_version 1019316 (0.00085) [2022-07-11 03:39:49,723][25689] Fps is (10 sec: 5449.4, 60 sec: 5511.6, 300 sec: 5565.7). Total num frames: 1043787776. Throughput: 0: 5749.9. Samples: 1043792988. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:39:49,724][25689] Avg episode reward: [(0, '1.442')] [2022-07-11 03:39:50,044][26022] Updated weights on worker 0-0, policy_version 1019326 (0.00091) [2022-07-11 03:39:51,667][26022] Updated weights on worker 0-0, policy_version 1019336 (0.00086) [2022-07-11 03:39:53,765][26022] Updated weights on worker 0-0, policy_version 1019346 (0.00081) [2022-07-11 03:39:54,751][25689] Fps is (10 sec: 5698.1, 60 sec: 5526.5, 300 sec: 5565.7). Total num frames: 1043817472. Throughput: 0: 4925.0. Samples: 1043809446. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:39:54,751][25689] Avg episode reward: [(0, '1.511')] [2022-07-11 03:39:55,308][26022] Updated weights on worker 0-0, policy_version 1019356 (0.00270) [2022-07-11 03:39:57,272][26022] Updated weights on worker 0-0, policy_version 1019366 (0.00089) [2022-07-11 03:39:59,070][26022] Updated weights on worker 0-0, policy_version 1019376 (0.00093) [2022-07-11 03:39:59,767][25689] Fps is (10 sec: 5505.7, 60 sec: 5543.0, 300 sec: 5576.6). Total num frames: 1043843072. Throughput: 0: 5775.0. Samples: 1043843332. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:39:59,767][25689] Avg episode reward: [(0, '1.614')] [2022-07-11 03:40:00,827][26022] Updated weights on worker 0-0, policy_version 1019386 (0.00086) [2022-07-11 03:40:03,126][26022] Updated weights on worker 0-0, policy_version 1019396 (0.00092) [2022-07-11 03:40:04,803][26022] Updated weights on worker 0-0, policy_version 1019406 (0.00087) [2022-07-11 03:40:04,808][25689] Fps is (10 sec: 5396.3, 60 sec: 5543.6, 300 sec: 5569.6). Total num frames: 1043871744. Throughput: 0: 5712.9. Samples: 1043874776. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:40:04,808][25689] Avg episode reward: [(0, '1.576')] [2022-07-11 03:40:06,918][26022] Updated weights on worker 0-0, policy_version 1019416 (0.00084) [2022-07-11 03:40:08,577][26022] Updated weights on worker 0-0, policy_version 1019426 (0.00085) [2022-07-11 03:40:09,817][25689] Fps is (10 sec: 5502.2, 60 sec: 5526.7, 300 sec: 5562.7). Total num frames: 1043898368. Throughput: 0: 4910.7. Samples: 1043891660. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:40:09,817][25689] Avg episode reward: [(0, '2.378')] [2022-07-11 03:40:10,504][26022] Updated weights on worker 0-0, policy_version 1019436 (0.00099) [2022-07-11 03:40:12,284][26022] Updated weights on worker 0-0, policy_version 1019446 (0.00085) [2022-07-11 03:40:14,186][26022] Updated weights on worker 0-0, policy_version 1019456 (0.00094) [2022-07-11 03:40:14,826][25689] Fps is (10 sec: 5417.2, 60 sec: 5509.2, 300 sec: 5559.5). Total num frames: 1043926016. Throughput: 0: 5752.7. Samples: 1043924936. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:40:14,827][25689] Avg episode reward: [(0, '1.904')] [2022-07-11 03:40:16,067][26022] Updated weights on worker 0-0, policy_version 1019466 (0.00089) [2022-07-11 03:40:17,808][26022] Updated weights on worker 0-0, policy_version 1019476 (0.00090) [2022-07-11 03:40:19,622][26022] Updated weights on worker 0-0, policy_version 1019486 (0.00089) [2022-07-11 03:40:19,831][25689] Fps is (10 sec: 5624.1, 60 sec: 5526.1, 300 sec: 5566.9). Total num frames: 1043954688. Throughput: 0: 5740.7. Samples: 1043958514. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:40:19,831][25689] Avg episode reward: [(0, '1.579')] [2022-07-11 03:40:21,546][26022] Updated weights on worker 0-0, policy_version 1019496 (0.00086) [2022-07-11 03:40:23,303][26022] Updated weights on worker 0-0, policy_version 1019506 (0.00087) [2022-07-11 03:40:24,927][25689] Fps is (10 sec: 5474.7, 60 sec: 5507.6, 300 sec: 5551.3). Total num frames: 1043981312. Throughput: 0: 5845.0. Samples: 1043992370. Policy #0 lag: (min: 1.0, avg: 7.7, max: 18.0) [2022-07-11 03:40:24,927][25689] Avg episode reward: [(0, '0.815')] [2022-07-11 03:40:25,161][26022] Updated weights on worker 0-0, policy_version 1019516 (0.00085) [2022-07-11 03:40:27,018][26022] Updated weights on worker 0-0, policy_version 1019526 (0.00081) [2022-07-11 03:40:28,766][26022] Updated weights on worker 0-0, policy_version 1019536 (0.00072) [2022-07-11 03:40:29,932][25689] Fps is (10 sec: 5575.6, 60 sec: 5526.3, 300 sec: 5565.5). Total num frames: 1044011008. Throughput: 0: 5843.3. Samples: 1044009200. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:40:29,932][25689] Avg episode reward: [(0, '0.730')] [2022-07-11 03:40:30,869][26022] Updated weights on worker 0-0, policy_version 1019546 (0.00086) [2022-07-11 03:40:32,423][26022] Updated weights on worker 0-0, policy_version 1019556 (0.00091) [2022-07-11 03:40:34,368][26022] Updated weights on worker 0-0, policy_version 1019566 (0.00097) [2022-07-11 03:40:34,940][25689] Fps is (10 sec: 5726.6, 60 sec: 5543.8, 300 sec: 5555.6). Total num frames: 1044038656. Throughput: 0: 5854.0. Samples: 1044042684. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:40:34,941][25689] Avg episode reward: [(0, '0.579')] [2022-07-11 03:40:36,100][26022] Updated weights on worker 0-0, policy_version 1019576 (0.00089) [2022-07-11 03:40:38,103][26022] Updated weights on worker 0-0, policy_version 1019586 (0.00083) [2022-07-11 03:40:39,853][26022] Updated weights on worker 0-0, policy_version 1019596 (0.00099) [2022-07-11 03:40:39,965][25689] Fps is (10 sec: 5511.2, 60 sec: 5527.9, 300 sec: 5561.0). Total num frames: 1044066304. Throughput: 0: 5847.4. Samples: 1044076250. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:40:39,966][25689] Avg episode reward: [(0, '0.425')] [2022-07-11 03:40:41,689][26022] Updated weights on worker 0-0, policy_version 1019606 (0.00089) [2022-07-11 03:40:43,539][26022] Updated weights on worker 0-0, policy_version 1019616 (0.00092) [2022-07-11 03:40:45,019][25689] Fps is (10 sec: 5587.8, 60 sec: 5550.1, 300 sec: 5556.6). Total num frames: 1044094976. Throughput: 0: 5014.9. Samples: 1044093132. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:40:45,021][25689] Avg episode reward: [(0, '0.530')] [2022-07-11 03:40:45,340][26022] Updated weights on worker 0-0, policy_version 1019626 (0.00090) [2022-07-11 03:40:47,337][26022] Updated weights on worker 0-0, policy_version 1019636 (0.00088) [2022-07-11 03:40:48,909][26022] Updated weights on worker 0-0, policy_version 1019646 (0.00084) [2022-07-11 03:40:50,026][25689] Fps is (10 sec: 5597.8, 60 sec: 5552.8, 300 sec: 5560.0). Total num frames: 1044122624. Throughput: 0: 5846.2. Samples: 1044126676. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:40:50,027][25689] Avg episode reward: [(0, '0.725')] [2022-07-11 03:40:50,857][26022] Updated weights on worker 0-0, policy_version 1019656 (0.00086) [2022-07-11 03:40:52,528][26022] Updated weights on worker 0-0, policy_version 1019666 (0.00112) [2022-07-11 03:40:54,668][26022] Updated weights on worker 0-0, policy_version 1019676 (0.00091) [2022-07-11 03:40:55,033][25689] Fps is (10 sec: 5521.9, 60 sec: 5520.7, 300 sec: 5557.8). Total num frames: 1044150272. Throughput: 0: 5849.6. Samples: 1044160220. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:40:55,037][25689] Avg episode reward: [(0, '1.111')] [2022-07-11 03:40:56,388][26022] Updated weights on worker 0-0, policy_version 1019686 (0.00085) [2022-07-11 03:40:58,227][26022] Updated weights on worker 0-0, policy_version 1019696 (0.00086) [2022-07-11 03:41:00,000][26022] Updated weights on worker 0-0, policy_version 1019706 (0.00090) [2022-07-11 03:41:00,050][25689] Fps is (10 sec: 5618.6, 60 sec: 5571.5, 300 sec: 5566.1). Total num frames: 1044178944. Throughput: 0: 5004.4. Samples: 1044176762. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:41:00,052][25689] Avg episode reward: [(0, '1.177')] [2022-07-11 03:41:01,952][26022] Updated weights on worker 0-0, policy_version 1019716 (0.00091) [2022-07-11 03:41:04,191][26022] Updated weights on worker 0-0, policy_version 1019726 (0.00091) [2022-07-11 03:41:05,108][25689] Fps is (10 sec: 5488.7, 60 sec: 5536.1, 300 sec: 5558.3). Total num frames: 1044205568. Throughput: 0: 5716.7. Samples: 1044207972. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:41:05,110][25689] Avg episode reward: [(0, '1.170')] [2022-07-11 03:41:06,138][26022] Updated weights on worker 0-0, policy_version 1019736 (0.00089) [2022-07-11 03:41:07,851][26022] Updated weights on worker 0-0, policy_version 1019746 (0.00084) [2022-07-11 03:41:09,748][26022] Updated weights on worker 0-0, policy_version 1019756 (0.00084) [2022-07-11 03:41:10,116][25689] Fps is (10 sec: 5289.8, 60 sec: 5536.1, 300 sec: 5559.5). Total num frames: 1044232192. Throughput: 0: 5693.7. Samples: 1044241062. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:41:10,118][25689] Avg episode reward: [(0, '1.287')] [2022-07-11 03:41:11,647][26022] Updated weights on worker 0-0, policy_version 1019766 (0.00088) [2022-07-11 03:41:13,243][26022] Updated weights on worker 0-0, policy_version 1019776 (0.00093) [2022-07-11 03:41:15,126][25689] Fps is (10 sec: 5315.3, 60 sec: 5519.1, 300 sec: 5543.7). Total num frames: 1044258816. Throughput: 0: 4858.8. Samples: 1044257844. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:41:15,126][25689] Avg episode reward: [(0, '1.332')] [2022-07-11 03:41:15,272][26022] Updated weights on worker 0-0, policy_version 1019786 (0.00098) [2022-07-11 03:41:16,906][26022] Updated weights on worker 0-0, policy_version 1019796 (0.00085) [2022-07-11 03:41:19,091][26022] Updated weights on worker 0-0, policy_version 1019806 (0.00085) [2022-07-11 03:41:20,155][25689] Fps is (10 sec: 5610.0, 60 sec: 5533.8, 300 sec: 5555.1). Total num frames: 1044288512. Throughput: 0: 5684.4. Samples: 1044291048. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:41:20,158][25689] Avg episode reward: [(0, '1.203')] [2022-07-11 03:41:20,676][26022] Updated weights on worker 0-0, policy_version 1019816 (0.00086) [2022-07-11 03:41:22,741][26022] Updated weights on worker 0-0, policy_version 1019826 (0.00092) [2022-07-11 03:41:24,562][26022] Updated weights on worker 0-0, policy_version 1019836 (0.00072) [2022-07-11 03:41:25,232][25689] Fps is (10 sec: 5572.7, 60 sec: 5535.5, 300 sec: 5546.8). Total num frames: 1044315136. Throughput: 0: 5774.4. Samples: 1044324178. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:41:25,233][25689] Avg episode reward: [(0, '1.268')] [2022-07-11 03:41:26,412][26022] Updated weights on worker 0-0, policy_version 1019846 (0.00082) [2022-07-11 03:41:28,315][26022] Updated weights on worker 0-0, policy_version 1019856 (0.00084) [2022-07-11 03:41:30,100][26022] Updated weights on worker 0-0, policy_version 1019866 (0.00083) [2022-07-11 03:41:30,253][25689] Fps is (10 sec: 5475.9, 60 sec: 5517.1, 300 sec: 5546.9). Total num frames: 1044343808. Throughput: 0: 4957.9. Samples: 1044340900. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:41:30,254][25689] Avg episode reward: [(0, '1.301')] [2022-07-11 03:41:31,154][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:41:31,170][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001019872_1044348928.pth [2022-07-11 03:41:31,170][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001017918_1042348032.pth [2022-07-11 03:41:32,083][26022] Updated weights on worker 0-0, policy_version 1019876 (0.00083) [2022-07-11 03:41:33,573][26022] Updated weights on worker 0-0, policy_version 1019886 (0.00086) [2022-07-11 03:41:35,260][25689] Fps is (10 sec: 5616.3, 60 sec: 5517.3, 300 sec: 5545.1). Total num frames: 1044371456. Throughput: 0: 5803.7. Samples: 1044374698. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:41:35,261][25689] Avg episode reward: [(0, '0.785')] [2022-07-11 03:41:35,491][26022] Updated weights on worker 0-0, policy_version 1019896 (0.00089) [2022-07-11 03:41:37,382][26022] Updated weights on worker 0-0, policy_version 1019906 (0.00090) [2022-07-11 03:41:39,257][26022] Updated weights on worker 0-0, policy_version 1019916 (0.00083) [2022-07-11 03:41:40,269][25689] Fps is (10 sec: 5623.3, 60 sec: 5535.7, 300 sec: 5542.9). Total num frames: 1044400128. Throughput: 0: 5815.7. Samples: 1044408022. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:41:40,269][25689] Avg episode reward: [(0, '0.588')] [2022-07-11 03:41:41,168][26022] Updated weights on worker 0-0, policy_version 1019926 (0.00096) [2022-07-11 03:41:42,902][26022] Updated weights on worker 0-0, policy_version 1019936 (0.00090) [2022-07-11 03:41:44,724][26022] Updated weights on worker 0-0, policy_version 1019946 (0.00083) [2022-07-11 03:41:45,399][25689] Fps is (10 sec: 5554.8, 60 sec: 5511.8, 300 sec: 5545.0). Total num frames: 1044427776. Throughput: 0: 4991.6. Samples: 1044424842. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:41:45,400][25689] Avg episode reward: [(0, '0.333')] [2022-07-11 03:41:46,403][26022] Updated weights on worker 0-0, policy_version 1019956 (0.00083) [2022-07-11 03:41:48,377][26022] Updated weights on worker 0-0, policy_version 1019966 (0.00086) [2022-07-11 03:41:50,260][26022] Updated weights on worker 0-0, policy_version 1019976 (0.00085) [2022-07-11 03:41:50,401][25689] Fps is (10 sec: 5659.8, 60 sec: 5546.2, 300 sec: 5545.3). Total num frames: 1044457472. Throughput: 0: 5847.6. Samples: 1044458714. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:41:50,401][25689] Avg episode reward: [(0, '0.505')] [2022-07-11 03:41:52,168][26022] Updated weights on worker 0-0, policy_version 1019986 (0.00090) [2022-07-11 03:41:53,783][26022] Updated weights on worker 0-0, policy_version 1019996 (0.00093) [2022-07-11 03:41:55,449][25689] Fps is (10 sec: 5705.6, 60 sec: 5542.4, 300 sec: 5548.2). Total num frames: 1044485120. Throughput: 0: 5837.6. Samples: 1044492556. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:41:55,450][25689] Avg episode reward: [(0, '0.770')] [2022-07-11 03:41:55,655][26022] Updated weights on worker 0-0, policy_version 1020006 (0.00089) [2022-07-11 03:41:57,436][26022] Updated weights on worker 0-0, policy_version 1020016 (0.00087) [2022-07-11 03:41:59,236][26022] Updated weights on worker 0-0, policy_version 1020026 (0.00086) [2022-07-11 03:42:00,486][25689] Fps is (10 sec: 5381.2, 60 sec: 5506.6, 300 sec: 5552.0). Total num frames: 1044511744. Throughput: 0: 5010.1. Samples: 1044509312. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:42:00,487][25689] Avg episode reward: [(0, '0.647')] [2022-07-11 03:42:01,121][26022] Updated weights on worker 0-0, policy_version 1020036 (0.00082) [2022-07-11 03:42:03,123][26022] Updated weights on worker 0-0, policy_version 1020046 (0.00084) [2022-07-11 03:42:05,071][26022] Updated weights on worker 0-0, policy_version 1020056 (0.00089) [2022-07-11 03:42:05,596][25689] Fps is (10 sec: 5348.8, 60 sec: 5518.8, 300 sec: 5539.9). Total num frames: 1044539392. Throughput: 0: 5754.4. Samples: 1044541064. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:42:05,596][25689] Avg episode reward: [(0, '0.736')] [2022-07-11 03:42:06,899][26022] Updated weights on worker 0-0, policy_version 1020066 (0.00085) [2022-07-11 03:42:08,788][26022] Updated weights on worker 0-0, policy_version 1020076 (0.00085) [2022-07-11 03:42:10,662][25689] Fps is (10 sec: 5434.2, 60 sec: 5530.5, 300 sec: 5542.4). Total num frames: 1044567040. Throughput: 0: 5714.6. Samples: 1044574498. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:42:10,663][25689] Avg episode reward: [(0, '1.102')] [2022-07-11 03:42:10,766][26022] Updated weights on worker 0-0, policy_version 1020086 (0.00085) [2022-07-11 03:42:12,545][26022] Updated weights on worker 0-0, policy_version 1020096 (0.00091) [2022-07-11 03:42:14,230][26022] Updated weights on worker 0-0, policy_version 1020106 (0.00081) [2022-07-11 03:42:15,668][25689] Fps is (10 sec: 5693.3, 60 sec: 5581.5, 300 sec: 5549.2). Total num frames: 1044596736. Throughput: 0: 4884.7. Samples: 1044591318. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:42:15,669][25689] Avg episode reward: [(0, '0.543')] [2022-07-11 03:42:16,179][26022] Updated weights on worker 0-0, policy_version 1020116 (0.00089) [2022-07-11 03:42:18,046][26022] Updated weights on worker 0-0, policy_version 1020126 (0.00090) [2022-07-11 03:42:20,024][26022] Updated weights on worker 0-0, policy_version 1020136 (0.00078) [2022-07-11 03:42:20,683][25689] Fps is (10 sec: 5722.4, 60 sec: 5549.1, 300 sec: 5543.4). Total num frames: 1044624384. Throughput: 0: 5730.3. Samples: 1044625044. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:42:20,684][25689] Avg episode reward: [(0, '-0.386')] [2022-07-11 03:42:21,387][26022] Updated weights on worker 0-0, policy_version 1020146 (0.00095) [2022-07-11 03:42:23,588][26022] Updated weights on worker 0-0, policy_version 1020156 (0.00085) [2022-07-11 03:42:25,269][26022] Updated weights on worker 0-0, policy_version 1020166 (0.00083) [2022-07-11 03:42:25,803][25689] Fps is (10 sec: 5556.9, 60 sec: 5578.9, 300 sec: 5546.4). Total num frames: 1044653056. Throughput: 0: 5830.9. Samples: 1044658892. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:42:25,804][25689] Avg episode reward: [(0, '-1.120')] [2022-07-11 03:42:27,117][26022] Updated weights on worker 0-0, policy_version 1020176 (0.00089) [2022-07-11 03:42:28,881][26022] Updated weights on worker 0-0, policy_version 1020186 (0.00089) [2022-07-11 03:42:30,825][25689] Fps is (10 sec: 5553.1, 60 sec: 5562.0, 300 sec: 5546.4). Total num frames: 1044680704. Throughput: 0: 5001.9. Samples: 1044675352. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:42:30,826][26022] Updated weights on worker 0-0, policy_version 1020196 (0.00097) [2022-07-11 03:42:30,826][25689] Avg episode reward: [(0, '-0.888')] [2022-07-11 03:42:32,638][26022] Updated weights on worker 0-0, policy_version 1020206 (0.00091) [2022-07-11 03:42:34,731][26022] Updated weights on worker 0-0, policy_version 1020216 (0.00094) [2022-07-11 03:42:35,860][25689] Fps is (10 sec: 5600.1, 60 sec: 5576.2, 300 sec: 5546.4). Total num frames: 1044709376. Throughput: 0: 5811.0. Samples: 1044708654. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:42:35,861][25689] Avg episode reward: [(0, '-0.715')] [2022-07-11 03:42:36,339][26022] Updated weights on worker 0-0, policy_version 1020226 (0.00092) [2022-07-11 03:42:38,384][26022] Updated weights on worker 0-0, policy_version 1020236 (0.00090) [2022-07-11 03:42:39,943][26022] Updated weights on worker 0-0, policy_version 1020246 (0.00086) [2022-07-11 03:42:40,889][25689] Fps is (10 sec: 5392.7, 60 sec: 5523.7, 300 sec: 5533.2). Total num frames: 1044734976. Throughput: 0: 5783.7. Samples: 1044741910. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:42:40,889][25689] Avg episode reward: [(0, '-0.765')] [2022-07-11 03:42:41,941][26022] Updated weights on worker 0-0, policy_version 1020256 (0.00085) [2022-07-11 03:42:43,702][26022] Updated weights on worker 0-0, policy_version 1020266 (0.00091) [2022-07-11 03:42:45,675][26022] Updated weights on worker 0-0, policy_version 1020276 (0.00093) [2022-07-11 03:42:46,014][25689] Fps is (10 sec: 5344.8, 60 sec: 5541.0, 300 sec: 5534.3). Total num frames: 1044763648. Throughput: 0: 4928.6. Samples: 1044758504. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:42:46,015][25689] Avg episode reward: [(0, '0.042')] [2022-07-11 03:42:47,286][26022] Updated weights on worker 0-0, policy_version 1020286 (0.00086) [2022-07-11 03:42:49,279][26022] Updated weights on worker 0-0, policy_version 1020296 (0.00087) [2022-07-11 03:42:50,915][26022] Updated weights on worker 0-0, policy_version 1020306 (0.00099) [2022-07-11 03:42:51,092][25689] Fps is (10 sec: 5821.1, 60 sec: 5551.0, 300 sec: 5544.0). Total num frames: 1044794368. Throughput: 0: 5772.0. Samples: 1044792332. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:42:51,092][25689] Avg episode reward: [(0, '0.455')] [2022-07-11 03:42:52,951][26022] Updated weights on worker 0-0, policy_version 1020316 (0.00087) [2022-07-11 03:42:54,497][26022] Updated weights on worker 0-0, policy_version 1020326 (0.00081) [2022-07-11 03:42:56,097][25689] Fps is (10 sec: 5687.3, 60 sec: 5538.1, 300 sec: 5537.7). Total num frames: 1044820992. Throughput: 0: 5812.5. Samples: 1044826280. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:42:56,098][25689] Avg episode reward: [(0, '1.093')] [2022-07-11 03:42:56,568][26022] Updated weights on worker 0-0, policy_version 1020336 (0.00089) [2022-07-11 03:42:58,291][26022] Updated weights on worker 0-0, policy_version 1020346 (0.00087) [2022-07-11 03:43:00,172][26022] Updated weights on worker 0-0, policy_version 1020356 (0.00088) [2022-07-11 03:43:01,103][25689] Fps is (10 sec: 5523.3, 60 sec: 5574.7, 300 sec: 5552.3). Total num frames: 1044849664. Throughput: 0: 5003.3. Samples: 1044843048. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:43:01,105][25689] Avg episode reward: [(0, '1.300')] [2022-07-11 03:43:02,411][26022] Updated weights on worker 0-0, policy_version 1020366 (0.00084) [2022-07-11 03:43:04,230][26022] Updated weights on worker 0-0, policy_version 1020376 (0.00096) [2022-07-11 03:43:06,013][26022] Updated weights on worker 0-0, policy_version 1020386 (0.00085) [2022-07-11 03:43:06,169][25689] Fps is (10 sec: 5388.4, 60 sec: 5544.9, 300 sec: 5538.3). Total num frames: 1044875264. Throughput: 0: 5758.7. Samples: 1044874568. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:43:06,171][25689] Avg episode reward: [(0, '1.354')] [2022-07-11 03:43:07,883][26022] Updated weights on worker 0-0, policy_version 1020396 (0.00085) [2022-07-11 03:43:09,696][26022] Updated weights on worker 0-0, policy_version 1020406 (0.00091) [2022-07-11 03:43:11,192][25689] Fps is (10 sec: 5379.3, 60 sec: 5565.8, 300 sec: 5541.6). Total num frames: 1044903936. Throughput: 0: 5767.5. Samples: 1044908260. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:43:11,192][25689] Avg episode reward: [(0, '1.231')] [2022-07-11 03:43:11,476][26022] Updated weights on worker 0-0, policy_version 1020416 (0.00092) [2022-07-11 03:43:13,264][26022] Updated weights on worker 0-0, policy_version 1020426 (0.00097) [2022-07-11 03:43:15,220][26022] Updated weights on worker 0-0, policy_version 1020436 (0.00091) [2022-07-11 03:43:16,218][25689] Fps is (10 sec: 5604.7, 60 sec: 5530.1, 300 sec: 5537.7). Total num frames: 1044931584. Throughput: 0: 4907.6. Samples: 1044925024. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:43:16,220][25689] Avg episode reward: [(0, '1.488')] [2022-07-11 03:43:16,794][26022] Updated weights on worker 0-0, policy_version 1020446 (0.00081) [2022-07-11 03:43:18,816][26022] Updated weights on worker 0-0, policy_version 1020456 (0.00086) [2022-07-11 03:43:20,457][26022] Updated weights on worker 0-0, policy_version 1020466 (0.00088) [2022-07-11 03:43:21,229][25689] Fps is (10 sec: 5509.3, 60 sec: 5530.5, 300 sec: 5535.2). Total num frames: 1044959232. Throughput: 0: 5757.2. Samples: 1044958916. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:43:21,231][25689] Avg episode reward: [(0, '1.364')] [2022-07-11 03:43:22,388][26022] Updated weights on worker 0-0, policy_version 1020476 (0.00095) [2022-07-11 03:43:24,279][26022] Updated weights on worker 0-0, policy_version 1020486 (0.00085) [2022-07-11 03:43:25,920][26022] Updated weights on worker 0-0, policy_version 1020496 (0.00092) [2022-07-11 03:43:26,374][25689] Fps is (10 sec: 5646.5, 60 sec: 5545.2, 300 sec: 5539.8). Total num frames: 1044988928. Throughput: 0: 5840.4. Samples: 1044992570. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:43:26,374][25689] Avg episode reward: [(0, '1.407')] [2022-07-11 03:43:28,008][26022] Updated weights on worker 0-0, policy_version 1020506 (0.00086) [2022-07-11 03:43:29,700][26022] Updated weights on worker 0-0, policy_version 1020516 (0.00090) [2022-07-11 03:43:31,381][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:43:31,391][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001020525_1045017600.pth [2022-07-11 03:43:31,391][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001018572_1043017728.pth [2022-07-11 03:43:31,392][25689] Fps is (10 sec: 5742.9, 60 sec: 5562.3, 300 sec: 5544.8). Total num frames: 1045017600. Throughput: 0: 5841.8. Samples: 1045026266. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:43:31,393][25689] Avg episode reward: [(0, '0.806')] [2022-07-11 03:43:31,640][26022] Updated weights on worker 0-0, policy_version 1020526 (0.00094) [2022-07-11 03:43:33,554][26022] Updated weights on worker 0-0, policy_version 1020536 (0.00092) [2022-07-11 03:43:35,094][26022] Updated weights on worker 0-0, policy_version 1020546 (0.00096) [2022-07-11 03:43:36,485][25689] Fps is (10 sec: 5468.5, 60 sec: 5523.3, 300 sec: 5536.5). Total num frames: 1045044224. Throughput: 0: 5828.4. Samples: 1045043150. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:43:36,486][25689] Avg episode reward: [(0, '-0.028')] [2022-07-11 03:43:37,082][26022] Updated weights on worker 0-0, policy_version 1020556 (0.00091) [2022-07-11 03:43:39,168][26022] Updated weights on worker 0-0, policy_version 1020566 (0.00093) [2022-07-11 03:43:40,613][26022] Updated weights on worker 0-0, policy_version 1020576 (0.00093) [2022-07-11 03:43:41,499][25689] Fps is (10 sec: 5673.7, 60 sec: 5609.1, 300 sec: 5547.6). Total num frames: 1045074944. Throughput: 0: 5800.6. Samples: 1045076496. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:43:41,500][25689] Avg episode reward: [(0, '0.163')] [2022-07-11 03:43:42,856][26022] Updated weights on worker 0-0, policy_version 1020586 (0.00092) [2022-07-11 03:43:44,352][26022] Updated weights on worker 0-0, policy_version 1020596 (0.00086) [2022-07-11 03:43:46,404][26022] Updated weights on worker 0-0, policy_version 1020606 (0.00086) [2022-07-11 03:43:46,586][25689] Fps is (10 sec: 5575.8, 60 sec: 5561.9, 300 sec: 5536.1). Total num frames: 1045100544. Throughput: 0: 5811.3. Samples: 1045110030. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:43:46,597][25689] Avg episode reward: [(0, '-0.453')] [2022-07-11 03:43:47,862][26022] Updated weights on worker 0-0, policy_version 1020616 (0.00089) [2022-07-11 03:43:50,079][26022] Updated weights on worker 0-0, policy_version 1020626 (0.00088) [2022-07-11 03:43:51,619][25689] Fps is (10 sec: 5363.2, 60 sec: 5532.2, 300 sec: 5535.6). Total num frames: 1045129216. Throughput: 0: 4977.6. Samples: 1045126944. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:43:51,619][25689] Avg episode reward: [(0, '0.099')] [2022-07-11 03:43:51,764][26022] Updated weights on worker 0-0, policy_version 1020636 (0.00093) [2022-07-11 03:43:53,834][26022] Updated weights on worker 0-0, policy_version 1020646 (0.00085) [2022-07-11 03:43:55,343][26022] Updated weights on worker 0-0, policy_version 1020656 (0.00095) [2022-07-11 03:43:56,655][25689] Fps is (10 sec: 5593.6, 60 sec: 5546.3, 300 sec: 5545.4). Total num frames: 1045156864. Throughput: 0: 5787.8. Samples: 1045159886. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:43:56,655][25689] Avg episode reward: [(0, '-0.183')] [2022-07-11 03:43:57,594][26022] Updated weights on worker 0-0, policy_version 1020666 (0.00097) [2022-07-11 03:43:59,140][26022] Updated weights on worker 0-0, policy_version 1020676 (0.00084) [2022-07-11 03:44:01,123][26022] Updated weights on worker 0-0, policy_version 1020686 (0.00092) [2022-07-11 03:44:01,659][25689] Fps is (10 sec: 5507.2, 60 sec: 5529.5, 300 sec: 5542.8). Total num frames: 1045184512. Throughput: 0: 5780.3. Samples: 1045193026. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:44:01,660][25689] Avg episode reward: [(0, '0.201')] [2022-07-11 03:44:03,338][26022] Updated weights on worker 0-0, policy_version 1020696 (0.00090) [2022-07-11 03:44:05,249][26022] Updated weights on worker 0-0, policy_version 1020706 (0.00086) [2022-07-11 03:44:06,788][25689] Fps is (10 sec: 5355.6, 60 sec: 5540.7, 300 sec: 5537.1). Total num frames: 1045211136. Throughput: 0: 4840.3. Samples: 1045207816. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:44:06,789][25689] Avg episode reward: [(0, '0.793')] [2022-07-11 03:44:06,967][26022] Updated weights on worker 0-0, policy_version 1020716 (0.00089) [2022-07-11 03:44:08,999][26022] Updated weights on worker 0-0, policy_version 1020726 (0.00085) [2022-07-11 03:44:10,593][26022] Updated weights on worker 0-0, policy_version 1020736 (0.00096) [2022-07-11 03:44:11,795][25689] Fps is (10 sec: 5354.7, 60 sec: 5525.3, 300 sec: 5533.6). Total num frames: 1045238784. Throughput: 0: 5657.0. Samples: 1045241080. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:44:11,795][25689] Avg episode reward: [(0, '0.997')] [2022-07-11 03:44:12,635][26022] Updated weights on worker 0-0, policy_version 1020746 (0.00101) [2022-07-11 03:44:14,349][26022] Updated weights on worker 0-0, policy_version 1020756 (0.00088) [2022-07-11 03:44:16,259][26022] Updated weights on worker 0-0, policy_version 1020766 (0.00085) [2022-07-11 03:44:16,810][25689] Fps is (10 sec: 5619.9, 60 sec: 5543.2, 300 sec: 5536.8). Total num frames: 1045267456. Throughput: 0: 5698.6. Samples: 1045274742. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:44:16,810][25689] Avg episode reward: [(0, '0.789')] [2022-07-11 03:44:18,015][26022] Updated weights on worker 0-0, policy_version 1020776 (0.00091) [2022-07-11 03:44:19,779][26022] Updated weights on worker 0-0, policy_version 1020786 (0.00088) [2022-07-11 03:44:21,705][26022] Updated weights on worker 0-0, policy_version 1020796 (0.00088) [2022-07-11 03:44:21,816][25689] Fps is (10 sec: 5619.9, 60 sec: 5543.6, 300 sec: 5538.2). Total num frames: 1045295104. Throughput: 0: 4892.6. Samples: 1045291646. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 03:44:21,817][25689] Avg episode reward: [(0, '0.632')] [2022-07-11 03:44:23,558][26022] Updated weights on worker 0-0, policy_version 1020806 (0.00096) [2022-07-11 03:44:25,397][26022] Updated weights on worker 0-0, policy_version 1020816 (0.00088) [2022-07-11 03:44:26,891][25689] Fps is (10 sec: 5485.0, 60 sec: 5516.1, 300 sec: 5533.8). Total num frames: 1045322752. Throughput: 0: 5852.0. Samples: 1045325458. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:44:26,892][25689] Avg episode reward: [(0, '0.877')] [2022-07-11 03:44:27,158][26022] Updated weights on worker 0-0, policy_version 1020826 (0.00089) [2022-07-11 03:44:28,943][26022] Updated weights on worker 0-0, policy_version 1020836 (0.00087) [2022-07-11 03:44:31,027][26022] Updated weights on worker 0-0, policy_version 1020846 (0.00092) [2022-07-11 03:44:31,912][25689] Fps is (10 sec: 5680.0, 60 sec: 5532.9, 300 sec: 5544.0). Total num frames: 1045352448. Throughput: 0: 5847.3. Samples: 1045358712. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:44:31,914][25689] Avg episode reward: [(0, '0.639')] [2022-07-11 03:44:32,808][26022] Updated weights on worker 0-0, policy_version 1020856 (0.00091) [2022-07-11 03:44:34,528][26022] Updated weights on worker 0-0, policy_version 1020866 (0.00093) [2022-07-11 03:44:36,260][26022] Updated weights on worker 0-0, policy_version 1020876 (0.00079) [2022-07-11 03:44:36,919][25689] Fps is (10 sec: 5514.5, 60 sec: 5523.8, 300 sec: 5534.2). Total num frames: 1045378048. Throughput: 0: 5010.4. Samples: 1045375494. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:44:36,920][25689] Avg episode reward: [(0, '1.251')] [2022-07-11 03:44:38,186][26022] Updated weights on worker 0-0, policy_version 1020886 (0.00095) [2022-07-11 03:44:40,200][26022] Updated weights on worker 0-0, policy_version 1020896 (0.00094) [2022-07-11 03:44:41,824][26022] Updated weights on worker 0-0, policy_version 1020906 (0.00083) [2022-07-11 03:44:41,929][25689] Fps is (10 sec: 5520.2, 60 sec: 5507.2, 300 sec: 5543.0). Total num frames: 1045407744. Throughput: 0: 5821.3. Samples: 1045408728. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:44:41,930][25689] Avg episode reward: [(0, '0.540')] [2022-07-11 03:44:43,783][26022] Updated weights on worker 0-0, policy_version 1020916 (0.00078) [2022-07-11 03:44:45,531][26022] Updated weights on worker 0-0, policy_version 1020926 (0.00083) [2022-07-11 03:44:47,050][25689] Fps is (10 sec: 5660.0, 60 sec: 5538.0, 300 sec: 5541.4). Total num frames: 1045435392. Throughput: 0: 5784.4. Samples: 1045442064. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:44:47,052][25689] Avg episode reward: [(0, '0.479')] [2022-07-11 03:44:47,539][26022] Updated weights on worker 0-0, policy_version 1020936 (0.00081) [2022-07-11 03:44:49,214][26022] Updated weights on worker 0-0, policy_version 1020946 (0.00082) [2022-07-11 03:44:51,023][26022] Updated weights on worker 0-0, policy_version 1020956 (0.00084) [2022-07-11 03:44:52,061][25689] Fps is (10 sec: 5558.7, 60 sec: 5540.0, 300 sec: 5538.3). Total num frames: 1045464064. Throughput: 0: 4977.8. Samples: 1045459008. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:44:52,061][25689] Avg episode reward: [(0, '0.653')] [2022-07-11 03:44:53,160][26022] Updated weights on worker 0-0, policy_version 1020966 (0.00092) [2022-07-11 03:44:54,674][26022] Updated weights on worker 0-0, policy_version 1020976 (0.00090) [2022-07-11 03:44:56,788][26022] Updated weights on worker 0-0, policy_version 1020986 (0.00084) [2022-07-11 03:44:57,121][25689] Fps is (10 sec: 5592.4, 60 sec: 5537.8, 300 sec: 5544.4). Total num frames: 1045491712. Throughput: 0: 5782.8. Samples: 1045492320. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:44:57,121][25689] Avg episode reward: [(0, '0.710')] [2022-07-11 03:44:58,483][26022] Updated weights on worker 0-0, policy_version 1020996 (0.00092) [2022-07-11 03:45:00,243][26022] Updated weights on worker 0-0, policy_version 1021006 (0.00084) [2022-07-11 03:45:02,131][25689] Fps is (10 sec: 5287.8, 60 sec: 5503.4, 300 sec: 5534.6). Total num frames: 1045517312. Throughput: 0: 5742.1. Samples: 1045524728. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:45:02,131][25689] Avg episode reward: [(0, '0.864')] [2022-07-11 03:45:02,576][26022] Updated weights on worker 0-0, policy_version 1021016 (0.00088) [2022-07-11 03:45:04,383][26022] Updated weights on worker 0-0, policy_version 1021026 (0.00085) [2022-07-11 03:45:06,456][26022] Updated weights on worker 0-0, policy_version 1021036 (0.00086) [2022-07-11 03:45:07,226][25689] Fps is (10 sec: 5370.6, 60 sec: 5540.4, 300 sec: 5539.9). Total num frames: 1045545984. Throughput: 0: 4861.4. Samples: 1045540148. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:45:07,227][25689] Avg episode reward: [(0, '0.076')] [2022-07-11 03:45:08,107][26022] Updated weights on worker 0-0, policy_version 1021046 (0.00099) [2022-07-11 03:45:09,992][26022] Updated weights on worker 0-0, policy_version 1021056 (0.00086) [2022-07-11 03:45:11,919][26022] Updated weights on worker 0-0, policy_version 1021066 (0.00089) [2022-07-11 03:45:12,234][25689] Fps is (10 sec: 5473.1, 60 sec: 5523.3, 300 sec: 5536.5). Total num frames: 1045572608. Throughput: 0: 5673.1. Samples: 1045573452. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:45:12,235][25689] Avg episode reward: [(0, '0.441')] [2022-07-11 03:45:13,663][26022] Updated weights on worker 0-0, policy_version 1021076 (0.00094) [2022-07-11 03:45:15,449][26022] Updated weights on worker 0-0, policy_version 1021086 (0.00085) [2022-07-11 03:45:17,244][25689] Fps is (10 sec: 5519.8, 60 sec: 5523.8, 300 sec: 5536.4). Total num frames: 1045601280. Throughput: 0: 5686.3. Samples: 1045606746. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:45:17,245][25689] Avg episode reward: [(0, '0.418')] [2022-07-11 03:45:17,410][26022] Updated weights on worker 0-0, policy_version 1021096 (0.00090) [2022-07-11 03:45:19,120][26022] Updated weights on worker 0-0, policy_version 1021106 (0.00092) [2022-07-11 03:45:21,106][26022] Updated weights on worker 0-0, policy_version 1021116 (0.00086) [2022-07-11 03:45:22,258][25689] Fps is (10 sec: 5720.6, 60 sec: 5540.0, 300 sec: 5544.8). Total num frames: 1045629952. Throughput: 0: 4912.0. Samples: 1045623594. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:45:22,258][25689] Avg episode reward: [(0, '0.328')] [2022-07-11 03:45:22,725][26022] Updated weights on worker 0-0, policy_version 1021126 (0.00096) [2022-07-11 03:45:24,789][26022] Updated weights on worker 0-0, policy_version 1021136 (0.00086) [2022-07-11 03:45:26,432][26022] Updated weights on worker 0-0, policy_version 1021146 (0.00092) [2022-07-11 03:45:27,392][25689] Fps is (10 sec: 5549.8, 60 sec: 5534.6, 300 sec: 5535.5). Total num frames: 1045657600. Throughput: 0: 5793.1. Samples: 1045656970. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:45:27,393][25689] Avg episode reward: [(0, '-1.364')] [2022-07-11 03:45:28,623][26022] Updated weights on worker 0-0, policy_version 1021156 (0.00088) [2022-07-11 03:45:30,180][26022] Updated weights on worker 0-0, policy_version 1021166 (0.00091) [2022-07-11 03:45:31,402][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:45:31,416][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001021172_1045680128.pth [2022-07-11 03:45:31,417][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001019224_1043685376.pth [2022-07-11 03:45:32,301][26022] Updated weights on worker 0-0, policy_version 1021176 (0.00066) [2022-07-11 03:45:32,458][25689] Fps is (10 sec: 5421.3, 60 sec: 5496.7, 300 sec: 5534.4). Total num frames: 1045685248. Throughput: 0: 5774.0. Samples: 1045690224. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:45:32,458][25689] Avg episode reward: [(0, '-0.940')] [2022-07-11 03:45:34,063][26022] Updated weights on worker 0-0, policy_version 1021186 (0.00089) [2022-07-11 03:45:35,820][26022] Updated weights on worker 0-0, policy_version 1021196 (0.00094) [2022-07-11 03:45:37,495][25689] Fps is (10 sec: 5473.2, 60 sec: 5527.7, 300 sec: 5534.2). Total num frames: 1045712896. Throughput: 0: 4949.7. Samples: 1045706982. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:45:37,496][25689] Avg episode reward: [(0, '-1.593')] [2022-07-11 03:45:37,709][26022] Updated weights on worker 0-0, policy_version 1021206 (0.00093) [2022-07-11 03:45:39,368][26022] Updated weights on worker 0-0, policy_version 1021216 (0.00100) [2022-07-11 03:45:41,540][26022] Updated weights on worker 0-0, policy_version 1021226 (0.00090) [2022-07-11 03:45:42,509][25689] Fps is (10 sec: 5704.9, 60 sec: 5527.3, 300 sec: 5538.4). Total num frames: 1045742592. Throughput: 0: 5756.2. Samples: 1045740166. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:45:42,510][25689] Avg episode reward: [(0, '-0.697')] [2022-07-11 03:45:43,215][26022] Updated weights on worker 0-0, policy_version 1021236 (0.00085) [2022-07-11 03:45:44,987][26022] Updated weights on worker 0-0, policy_version 1021246 (0.00095) [2022-07-11 03:45:47,088][26022] Updated weights on worker 0-0, policy_version 1021256 (0.00083) [2022-07-11 03:45:47,560][25689] Fps is (10 sec: 5493.7, 60 sec: 5499.9, 300 sec: 5530.7). Total num frames: 1045768192. Throughput: 0: 5783.0. Samples: 1045773604. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:45:47,561][25689] Avg episode reward: [(0, '-0.872')] [2022-07-11 03:45:48,687][26022] Updated weights on worker 0-0, policy_version 1021266 (0.00096) [2022-07-11 03:45:50,683][26022] Updated weights on worker 0-0, policy_version 1021276 (0.00087) [2022-07-11 03:45:52,391][26022] Updated weights on worker 0-0, policy_version 1021286 (0.00093) [2022-07-11 03:45:52,563][25689] Fps is (10 sec: 5500.0, 60 sec: 5517.5, 300 sec: 5537.6). Total num frames: 1045797888. Throughput: 0: 4980.6. Samples: 1045790362. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:45:52,564][25689] Avg episode reward: [(0, '-0.929')] [2022-07-11 03:45:54,076][26022] Updated weights on worker 0-0, policy_version 1021296 (0.00081) [2022-07-11 03:45:56,080][26022] Updated weights on worker 0-0, policy_version 1021306 (0.00099) [2022-07-11 03:45:57,584][25689] Fps is (10 sec: 5720.9, 60 sec: 5521.1, 300 sec: 5534.1). Total num frames: 1045825536. Throughput: 0: 5826.3. Samples: 1045824026. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:45:57,585][25689] Avg episode reward: [(0, '-0.606')] [2022-07-11 03:45:57,984][26022] Updated weights on worker 0-0, policy_version 1021316 (0.00126) [2022-07-11 03:45:59,726][26022] Updated weights on worker 0-0, policy_version 1021326 (0.00096) [2022-07-11 03:46:01,676][26022] Updated weights on worker 0-0, policy_version 1021336 (0.00092) [2022-07-11 03:46:02,660][25689] Fps is (10 sec: 5273.8, 60 sec: 5515.1, 300 sec: 5530.4). Total num frames: 1045851136. Throughput: 0: 5825.0. Samples: 1045857542. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:46:02,660][25689] Avg episode reward: [(0, '-0.180')] [2022-07-11 03:46:03,759][26022] Updated weights on worker 0-0, policy_version 1021346 (0.00093) [2022-07-11 03:46:05,748][26022] Updated weights on worker 0-0, policy_version 1021356 (0.00090) [2022-07-11 03:46:07,498][26022] Updated weights on worker 0-0, policy_version 1021366 (0.00083) [2022-07-11 03:46:07,705][25689] Fps is (10 sec: 5463.6, 60 sec: 5536.6, 300 sec: 5540.0). Total num frames: 1045880832. Throughput: 0: 5730.7. Samples: 1045889046. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:46:07,705][25689] Avg episode reward: [(0, '-1.048')] [2022-07-11 03:46:09,437][26022] Updated weights on worker 0-0, policy_version 1021376 (0.00089) [2022-07-11 03:46:11,071][26022] Updated weights on worker 0-0, policy_version 1021386 (0.00086) [2022-07-11 03:46:12,793][25689] Fps is (10 sec: 5557.8, 60 sec: 5529.2, 300 sec: 5538.5). Total num frames: 1045907456. Throughput: 0: 5705.1. Samples: 1045905778. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:46:12,794][25689] Avg episode reward: [(0, '-1.792')] [2022-07-11 03:46:13,216][26022] Updated weights on worker 0-0, policy_version 1021396 (0.00084) [2022-07-11 03:46:14,772][26022] Updated weights on worker 0-0, policy_version 1021406 (0.00462) [2022-07-11 03:46:16,713][26022] Updated weights on worker 0-0, policy_version 1021416 (0.00091) [2022-07-11 03:46:17,889][25689] Fps is (10 sec: 5529.9, 60 sec: 5538.3, 300 sec: 5537.3). Total num frames: 1045937152. Throughput: 0: 5679.7. Samples: 1045939354. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:46:17,890][25689] Avg episode reward: [(0, '-1.942')] [2022-07-11 03:46:18,499][26022] Updated weights on worker 0-0, policy_version 1021426 (0.00083) [2022-07-11 03:46:20,266][26022] Updated weights on worker 0-0, policy_version 1021436 (0.00091) [2022-07-11 03:46:22,071][26022] Updated weights on worker 0-0, policy_version 1021446 (0.00092) [2022-07-11 03:46:22,896][25689] Fps is (10 sec: 5574.7, 60 sec: 5505.2, 300 sec: 5538.6). Total num frames: 1045963776. Throughput: 0: 5706.6. Samples: 1045973022. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:46:22,897][25689] Avg episode reward: [(0, '-2.023')] [2022-07-11 03:46:23,923][26022] Updated weights on worker 0-0, policy_version 1021456 (0.00096) [2022-07-11 03:46:26,009][26022] Updated weights on worker 0-0, policy_version 1021466 (0.00095) [2022-07-11 03:46:27,683][26022] Updated weights on worker 0-0, policy_version 1021476 (0.00090) [2022-07-11 03:46:28,051][25689] Fps is (10 sec: 5441.9, 60 sec: 5520.2, 300 sec: 5536.1). Total num frames: 1045992448. Throughput: 0: 4936.5. Samples: 1045989484. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:46:28,051][25689] Avg episode reward: [(0, '-1.544')] [2022-07-11 03:46:29,459][26022] Updated weights on worker 0-0, policy_version 1021486 (0.00088) [2022-07-11 03:46:31,584][26022] Updated weights on worker 0-0, policy_version 1021496 (0.00088) [2022-07-11 03:46:33,053][25689] Fps is (10 sec: 5746.9, 60 sec: 5559.8, 300 sec: 5543.1). Total num frames: 1046022144. Throughput: 0: 5791.3. Samples: 1046023104. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:46:33,053][25689] Avg episode reward: [(0, '-0.434')] [2022-07-11 03:46:33,054][26022] Updated weights on worker 0-0, policy_version 1021506 (0.00082) [2022-07-11 03:46:35,179][26022] Updated weights on worker 0-0, policy_version 1021516 (0.00095) [2022-07-11 03:46:36,826][26022] Updated weights on worker 0-0, policy_version 1021526 (0.00435) [2022-07-11 03:46:38,055][25689] Fps is (10 sec: 5527.2, 60 sec: 5529.2, 300 sec: 5532.9). Total num frames: 1046047744. Throughput: 0: 5816.7. Samples: 1046056650. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:46:38,055][25689] Avg episode reward: [(0, '-0.042')] [2022-07-11 03:46:38,634][26022] Updated weights on worker 0-0, policy_version 1021536 (0.00092) [2022-07-11 03:46:40,608][26022] Updated weights on worker 0-0, policy_version 1021546 (0.00089) [2022-07-11 03:46:42,392][26022] Updated weights on worker 0-0, policy_version 1021556 (0.00088) [2022-07-11 03:46:43,064][25689] Fps is (10 sec: 5318.9, 60 sec: 5495.9, 300 sec: 5535.1). Total num frames: 1046075392. Throughput: 0: 4967.0. Samples: 1046073194. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:46:43,064][25689] Avg episode reward: [(0, '0.497')] [2022-07-11 03:46:44,284][26022] Updated weights on worker 0-0, policy_version 1021566 (0.00093) [2022-07-11 03:46:46,223][26022] Updated weights on worker 0-0, policy_version 1021576 (0.00055) [2022-07-11 03:46:47,835][26022] Updated weights on worker 0-0, policy_version 1021586 (0.00092) [2022-07-11 03:46:48,119][25689] Fps is (10 sec: 5596.1, 60 sec: 5546.2, 300 sec: 5530.7). Total num frames: 1046104064. Throughput: 0: 5827.1. Samples: 1046106424. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:46:48,120][25689] Avg episode reward: [(0, '0.420')] [2022-07-11 03:46:49,933][26022] Updated weights on worker 0-0, policy_version 1021596 (0.00090) [2022-07-11 03:46:51,615][26022] Updated weights on worker 0-0, policy_version 1021606 (0.00086) [2022-07-11 03:46:53,130][25689] Fps is (10 sec: 5595.0, 60 sec: 5511.7, 300 sec: 5531.4). Total num frames: 1046131712. Throughput: 0: 5817.1. Samples: 1046139894. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:46:53,130][25689] Avg episode reward: [(0, '0.544')] [2022-07-11 03:46:53,672][26022] Updated weights on worker 0-0, policy_version 1021616 (0.00086) [2022-07-11 03:46:55,167][26022] Updated weights on worker 0-0, policy_version 1021626 (0.00087) [2022-07-11 03:46:57,327][26022] Updated weights on worker 0-0, policy_version 1021636 (0.00084) [2022-07-11 03:46:58,154][25689] Fps is (10 sec: 5612.6, 60 sec: 5528.3, 300 sec: 5538.5). Total num frames: 1046160384. Throughput: 0: 4984.6. Samples: 1046156834. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:46:58,154][25689] Avg episode reward: [(0, '0.798')] [2022-07-11 03:46:59,084][26022] Updated weights on worker 0-0, policy_version 1021646 (0.00090) [2022-07-11 03:47:00,926][26022] Updated weights on worker 0-0, policy_version 1021656 (0.00081) [2022-07-11 03:47:02,955][26022] Updated weights on worker 0-0, policy_version 1021666 (0.00084) [2022-07-11 03:47:03,164][25689] Fps is (10 sec: 5510.8, 60 sec: 5551.3, 300 sec: 5536.9). Total num frames: 1046187008. Throughput: 0: 5824.4. Samples: 1046190266. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:47:03,164][25689] Avg episode reward: [(0, '0.465')] [2022-07-11 03:47:04,813][26022] Updated weights on worker 0-0, policy_version 1021676 (0.00091) [2022-07-11 03:47:06,701][26022] Updated weights on worker 0-0, policy_version 1021686 (0.00091) [2022-07-11 03:47:08,250][25689] Fps is (10 sec: 5476.9, 60 sec: 5530.5, 300 sec: 5540.0). Total num frames: 1046215680. Throughput: 0: 5739.0. Samples: 1046221954. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:47:08,250][25689] Avg episode reward: [(0, '0.828')] [2022-07-11 03:47:08,469][26022] Updated weights on worker 0-0, policy_version 1021696 (0.00095) [2022-07-11 03:47:10,410][26022] Updated weights on worker 0-0, policy_version 1021706 (0.00094) [2022-07-11 03:47:12,067][26022] Updated weights on worker 0-0, policy_version 1021716 (0.00082) [2022-07-11 03:47:13,277][25689] Fps is (10 sec: 5467.6, 60 sec: 5536.2, 300 sec: 5529.3). Total num frames: 1046242304. Throughput: 0: 4906.8. Samples: 1046238754. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:47:13,278][25689] Avg episode reward: [(0, '0.807')] [2022-07-11 03:47:13,979][26022] Updated weights on worker 0-0, policy_version 1021726 (0.00087) [2022-07-11 03:47:15,881][26022] Updated weights on worker 0-0, policy_version 1021736 (0.00085) [2022-07-11 03:47:17,820][26022] Updated weights on worker 0-0, policy_version 1021746 (0.00093) [2022-07-11 03:47:18,299][25689] Fps is (10 sec: 5502.7, 60 sec: 5526.0, 300 sec: 5532.6). Total num frames: 1046270976. Throughput: 0: 5731.1. Samples: 1046272288. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:47:18,299][25689] Avg episode reward: [(0, '1.030')] [2022-07-11 03:47:19,559][26022] Updated weights on worker 0-0, policy_version 1021756 (0.00087) [2022-07-11 03:47:21,373][26022] Updated weights on worker 0-0, policy_version 1021766 (0.00087) [2022-07-11 03:47:23,134][26022] Updated weights on worker 0-0, policy_version 1021776 (0.00088) [2022-07-11 03:47:23,308][25689] Fps is (10 sec: 5614.8, 60 sec: 5542.8, 300 sec: 5531.2). Total num frames: 1046298624. Throughput: 0: 5743.8. Samples: 1046305968. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:47:23,308][25689] Avg episode reward: [(0, '0.868')] [2022-07-11 03:47:24,894][26022] Updated weights on worker 0-0, policy_version 1021786 (0.00091) [2022-07-11 03:47:26,849][26022] Updated weights on worker 0-0, policy_version 1021796 (0.00083) [2022-07-11 03:47:28,383][25689] Fps is (10 sec: 5585.2, 60 sec: 5550.1, 300 sec: 5533.7). Total num frames: 1046327296. Throughput: 0: 5000.2. Samples: 1046322622. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:47:28,383][25689] Avg episode reward: [(0, '0.669')] [2022-07-11 03:47:28,807][26022] Updated weights on worker 0-0, policy_version 1021806 (0.00076) [2022-07-11 03:47:30,536][26022] Updated weights on worker 0-0, policy_version 1021816 (0.00076) [2022-07-11 03:47:31,480][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:47:31,498][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001021821_1046344704.pth [2022-07-11 03:47:31,499][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001019872_1044348928.pth [2022-07-11 03:47:32,391][26022] Updated weights on worker 0-0, policy_version 1021826 (0.00090) [2022-07-11 03:47:33,449][25689] Fps is (10 sec: 5452.6, 60 sec: 5493.3, 300 sec: 5526.2). Total num frames: 1046353920. Throughput: 0: 5815.5. Samples: 1046356064. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:47:33,450][25689] Avg episode reward: [(0, '1.208')] [2022-07-11 03:47:34,203][26022] Updated weights on worker 0-0, policy_version 1021836 (0.00087) [2022-07-11 03:47:36,046][26022] Updated weights on worker 0-0, policy_version 1021846 (0.00090) [2022-07-11 03:47:37,814][26022] Updated weights on worker 0-0, policy_version 1021856 (0.00091) [2022-07-11 03:47:38,468][25689] Fps is (10 sec: 5685.9, 60 sec: 5576.6, 300 sec: 5543.6). Total num frames: 1046384640. Throughput: 0: 5818.0. Samples: 1046389632. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:47:38,469][25689] Avg episode reward: [(0, '1.492')] [2022-07-11 03:47:39,800][26022] Updated weights on worker 0-0, policy_version 1021866 (0.00084) [2022-07-11 03:47:41,451][26022] Updated weights on worker 0-0, policy_version 1021876 (0.00082) [2022-07-11 03:47:43,516][25689] Fps is (10 sec: 5594.5, 60 sec: 5539.0, 300 sec: 5534.7). Total num frames: 1046410240. Throughput: 0: 4965.7. Samples: 1046406316. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:47:43,517][25689] Avg episode reward: [(0, '1.941')] [2022-07-11 03:47:43,603][26022] Updated weights on worker 0-0, policy_version 1021886 (0.00096) [2022-07-11 03:47:45,132][26022] Updated weights on worker 0-0, policy_version 1021896 (0.00095) [2022-07-11 03:47:47,292][26022] Updated weights on worker 0-0, policy_version 1021906 (0.00090) [2022-07-11 03:47:48,596][25689] Fps is (10 sec: 5560.8, 60 sec: 5570.7, 300 sec: 5534.7). Total num frames: 1046440960. Throughput: 0: 5785.9. Samples: 1046439574. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:47:48,597][25689] Avg episode reward: [(0, '1.679')] [2022-07-11 03:47:48,811][26022] Updated weights on worker 0-0, policy_version 1021916 (0.00085) [2022-07-11 03:47:50,848][26022] Updated weights on worker 0-0, policy_version 1021926 (0.00083) [2022-07-11 03:47:52,655][26022] Updated weights on worker 0-0, policy_version 1021936 (0.00097) [2022-07-11 03:47:53,694][25689] Fps is (10 sec: 5634.3, 60 sec: 5545.7, 300 sec: 5533.0). Total num frames: 1046467584. Throughput: 0: 5773.1. Samples: 1046472938. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:47:53,694][25689] Avg episode reward: [(0, '1.300')] [2022-07-11 03:47:54,451][26022] Updated weights on worker 0-0, policy_version 1021946 (0.00109) [2022-07-11 03:47:56,326][26022] Updated weights on worker 0-0, policy_version 1021956 (0.00087) [2022-07-11 03:47:57,935][26022] Updated weights on worker 0-0, policy_version 1021966 (0.00082) [2022-07-11 03:47:58,744][25689] Fps is (10 sec: 5348.0, 60 sec: 5526.4, 300 sec: 5528.7). Total num frames: 1046495232. Throughput: 0: 4933.5. Samples: 1046489664. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:47:58,745][25689] Avg episode reward: [(0, '0.560')] [2022-07-11 03:48:00,084][26022] Updated weights on worker 0-0, policy_version 1021976 (0.00087) [2022-07-11 03:48:02,086][26022] Updated weights on worker 0-0, policy_version 1021986 (0.00094) [2022-07-11 03:48:03,797][25689] Fps is (10 sec: 5372.0, 60 sec: 5522.6, 300 sec: 5532.4). Total num frames: 1046521856. Throughput: 0: 5744.9. Samples: 1046522826. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:48:03,797][25689] Avg episode reward: [(0, '0.219')] [2022-07-11 03:48:03,901][26022] Updated weights on worker 0-0, policy_version 1021996 (0.00084) [2022-07-11 03:48:05,908][26022] Updated weights on worker 0-0, policy_version 1022006 (0.00086) [2022-07-11 03:48:07,656][26022] Updated weights on worker 0-0, policy_version 1022016 (0.00094) [2022-07-11 03:48:08,847][25689] Fps is (10 sec: 5372.1, 60 sec: 5508.9, 300 sec: 5528.4). Total num frames: 1046549504. Throughput: 0: 5700.2. Samples: 1046555006. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:48:08,847][25689] Avg episode reward: [(0, '0.053')] [2022-07-11 03:48:09,462][26022] Updated weights on worker 0-0, policy_version 1022026 (0.00094) [2022-07-11 03:48:11,384][26022] Updated weights on worker 0-0, policy_version 1022036 (0.00090) [2022-07-11 03:48:13,015][26022] Updated weights on worker 0-0, policy_version 1022046 (0.00080) [2022-07-11 03:48:13,869][25689] Fps is (10 sec: 5591.3, 60 sec: 5543.2, 300 sec: 5531.9). Total num frames: 1046578176. Throughput: 0: 4892.5. Samples: 1046571646. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:48:13,870][25689] Avg episode reward: [(0, '-0.209')] [2022-07-11 03:48:15,267][26022] Updated weights on worker 0-0, policy_version 1022056 (0.00087) [2022-07-11 03:48:16,923][26022] Updated weights on worker 0-0, policy_version 1022066 (0.00082) [2022-07-11 03:48:18,755][26022] Updated weights on worker 0-0, policy_version 1022076 (0.00097) [2022-07-11 03:48:18,893][25689] Fps is (10 sec: 5605.9, 60 sec: 5526.1, 300 sec: 5531.7). Total num frames: 1046605824. Throughput: 0: 5718.3. Samples: 1046604884. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:48:18,894][25689] Avg episode reward: [(0, '-0.492')] [2022-07-11 03:48:20,587][26022] Updated weights on worker 0-0, policy_version 1022086 (0.00088) [2022-07-11 03:48:22,524][26022] Updated weights on worker 0-0, policy_version 1022096 (0.00084) [2022-07-11 03:48:23,901][25689] Fps is (10 sec: 5512.2, 60 sec: 5526.2, 300 sec: 5527.4). Total num frames: 1046633472. Throughput: 0: 5752.0. Samples: 1046638466. Policy #0 lag: (min: 0.0, avg: 8.5, max: 18.0) [2022-07-11 03:48:23,901][25689] Avg episode reward: [(0, '0.098')] [2022-07-11 03:48:24,279][26022] Updated weights on worker 0-0, policy_version 1022106 (0.00086) [2022-07-11 03:48:26,201][26022] Updated weights on worker 0-0, policy_version 1022116 (0.00095) [2022-07-11 03:48:27,915][26022] Updated weights on worker 0-0, policy_version 1022126 (0.00087) [2022-07-11 03:48:28,974][25689] Fps is (10 sec: 5586.7, 60 sec: 5526.3, 300 sec: 5526.4). Total num frames: 1046662144. Throughput: 0: 5797.1. Samples: 1046671688. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:48:28,975][25689] Avg episode reward: [(0, '0.693')] [2022-07-11 03:48:29,775][26022] Updated weights on worker 0-0, policy_version 1022136 (0.00109) [2022-07-11 03:48:31,736][26022] Updated weights on worker 0-0, policy_version 1022146 (0.00081) [2022-07-11 03:48:33,395][26022] Updated weights on worker 0-0, policy_version 1022156 (0.00091) [2022-07-11 03:48:34,054][25689] Fps is (10 sec: 5647.9, 60 sec: 5558.9, 300 sec: 5533.5). Total num frames: 1046690816. Throughput: 0: 5791.0. Samples: 1046688536. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:48:34,055][25689] Avg episode reward: [(0, '0.893')] [2022-07-11 03:48:35,499][26022] Updated weights on worker 0-0, policy_version 1022166 (0.00086) [2022-07-11 03:48:36,874][26022] Updated weights on worker 0-0, policy_version 1022176 (0.00090) [2022-07-11 03:48:38,868][26022] Updated weights on worker 0-0, policy_version 1022186 (0.00089) [2022-07-11 03:48:39,056][25689] Fps is (10 sec: 5586.5, 60 sec: 5509.7, 300 sec: 5523.4). Total num frames: 1046718464. Throughput: 0: 5825.6. Samples: 1046722344. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:48:39,056][25689] Avg episode reward: [(0, '0.352')] [2022-07-11 03:48:40,927][26022] Updated weights on worker 0-0, policy_version 1022196 (0.00086) [2022-07-11 03:48:42,565][26022] Updated weights on worker 0-0, policy_version 1022206 (0.00084) [2022-07-11 03:48:44,095][25689] Fps is (10 sec: 5609.1, 60 sec: 5561.3, 300 sec: 5534.6). Total num frames: 1046747136. Throughput: 0: 5826.8. Samples: 1046756132. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:48:44,095][25689] Avg episode reward: [(0, '0.019')] [2022-07-11 03:48:44,360][26022] Updated weights on worker 0-0, policy_version 1022216 (0.00094) [2022-07-11 03:48:46,406][26022] Updated weights on worker 0-0, policy_version 1022226 (0.00084) [2022-07-11 03:48:47,979][26022] Updated weights on worker 0-0, policy_version 1022236 (0.00085) [2022-07-11 03:48:49,211][25689] Fps is (10 sec: 5445.3, 60 sec: 5490.4, 300 sec: 5526.2). Total num frames: 1046773760. Throughput: 0: 4983.3. Samples: 1046772532. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:48:49,211][25689] Avg episode reward: [(0, '0.245')] [2022-07-11 03:48:49,976][26022] Updated weights on worker 0-0, policy_version 1022246 (0.00084) [2022-07-11 03:48:51,803][26022] Updated weights on worker 0-0, policy_version 1022256 (0.00090) [2022-07-11 03:48:53,540][26022] Updated weights on worker 0-0, policy_version 1022266 (0.00087) [2022-07-11 03:48:54,264][25689] Fps is (10 sec: 5639.1, 60 sec: 5562.1, 300 sec: 5536.2). Total num frames: 1046804480. Throughput: 0: 5806.6. Samples: 1046805886. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:48:54,264][25689] Avg episode reward: [(0, '-0.868')] [2022-07-11 03:48:55,532][26022] Updated weights on worker 0-0, policy_version 1022276 (0.00087) [2022-07-11 03:48:57,074][26022] Updated weights on worker 0-0, policy_version 1022286 (0.00092) [2022-07-11 03:48:59,319][25689] Fps is (10 sec: 5571.7, 60 sec: 5527.8, 300 sec: 5528.4). Total num frames: 1046830080. Throughput: 0: 5779.0. Samples: 1046839444. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:48:59,319][25689] Avg episode reward: [(0, '-2.604')] [2022-07-11 03:48:59,366][26022] Updated weights on worker 0-0, policy_version 1022296 (0.00085) [2022-07-11 03:49:00,967][26022] Updated weights on worker 0-0, policy_version 1022306 (0.00093) [2022-07-11 03:49:03,235][26022] Updated weights on worker 0-0, policy_version 1022316 (0.00086) [2022-07-11 03:49:04,341][25689] Fps is (10 sec: 5182.4, 60 sec: 5530.6, 300 sec: 5530.3). Total num frames: 1046856704. Throughput: 0: 4851.2. Samples: 1046854350. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:49:04,342][25689] Avg episode reward: [(0, '-2.468')] [2022-07-11 03:49:05,074][26022] Updated weights on worker 0-0, policy_version 1022326 (0.00085) [2022-07-11 03:49:06,965][26022] Updated weights on worker 0-0, policy_version 1022336 (0.00090) [2022-07-11 03:49:08,745][26022] Updated weights on worker 0-0, policy_version 1022346 (0.00094) [2022-07-11 03:49:09,427][25689] Fps is (10 sec: 5571.6, 60 sec: 5561.1, 300 sec: 5535.7). Total num frames: 1046886400. Throughput: 0: 5689.4. Samples: 1046887552. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:49:09,428][25689] Avg episode reward: [(0, '-1.085')] [2022-07-11 03:49:10,597][26022] Updated weights on worker 0-0, policy_version 1022356 (0.00096) [2022-07-11 03:49:12,431][26022] Updated weights on worker 0-0, policy_version 1022366 (0.00087) [2022-07-11 03:49:14,255][26022] Updated weights on worker 0-0, policy_version 1022376 (0.00086) [2022-07-11 03:49:14,503][25689] Fps is (10 sec: 5643.1, 60 sec: 5539.4, 300 sec: 5531.2). Total num frames: 1046914048. Throughput: 0: 5699.9. Samples: 1046921246. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:49:14,503][25689] Avg episode reward: [(0, '-2.173')] [2022-07-11 03:49:16,097][26022] Updated weights on worker 0-0, policy_version 1022386 (0.00097) [2022-07-11 03:49:17,872][26022] Updated weights on worker 0-0, policy_version 1022396 (0.00091) [2022-07-11 03:49:19,563][25689] Fps is (10 sec: 5455.7, 60 sec: 5536.1, 300 sec: 5530.2). Total num frames: 1046941696. Throughput: 0: 4860.2. Samples: 1046937834. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:49:19,564][25689] Avg episode reward: [(0, '-2.183')] [2022-07-11 03:49:19,881][26022] Updated weights on worker 0-0, policy_version 1022406 (0.00319) [2022-07-11 03:49:21,628][26022] Updated weights on worker 0-0, policy_version 1022416 (0.00088) [2022-07-11 03:49:23,635][26022] Updated weights on worker 0-0, policy_version 1022426 (0.00083) [2022-07-11 03:49:24,571][25689] Fps is (10 sec: 5593.7, 60 sec: 5552.9, 300 sec: 5534.9). Total num frames: 1046970368. Throughput: 0: 5788.4. Samples: 1046971450. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:49:24,572][25689] Avg episode reward: [(0, '-0.425')] [2022-07-11 03:49:25,111][26022] Updated weights on worker 0-0, policy_version 1022436 (0.00082) [2022-07-11 03:49:27,151][26022] Updated weights on worker 0-0, policy_version 1022446 (0.00088) [2022-07-11 03:49:28,702][26022] Updated weights on worker 0-0, policy_version 1022456 (0.00083) [2022-07-11 03:49:29,647][25689] Fps is (10 sec: 5585.1, 60 sec: 5535.8, 300 sec: 5527.0). Total num frames: 1046998016. Throughput: 0: 5799.2. Samples: 1047004808. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:49:29,647][25689] Avg episode reward: [(0, '-0.329')] [2022-07-11 03:49:30,887][26022] Updated weights on worker 0-0, policy_version 1022466 (0.00085) [2022-07-11 03:49:31,574][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:49:31,582][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001022472_1047011328.pth [2022-07-11 03:49:31,591][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001020525_1045017600.pth [2022-07-11 03:49:32,448][26022] Updated weights on worker 0-0, policy_version 1022476 (0.00093) [2022-07-11 03:49:34,485][26022] Updated weights on worker 0-0, policy_version 1022486 (0.00085) [2022-07-11 03:49:34,685][25689] Fps is (10 sec: 5669.8, 60 sec: 5556.5, 300 sec: 5540.1). Total num frames: 1047027712. Throughput: 0: 4968.9. Samples: 1047021532. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:49:34,686][25689] Avg episode reward: [(0, '-0.618')] [2022-07-11 03:49:36,398][26022] Updated weights on worker 0-0, policy_version 1022496 (0.00085) [2022-07-11 03:49:38,131][26022] Updated weights on worker 0-0, policy_version 1022506 (0.00091) [2022-07-11 03:49:39,766][25689] Fps is (10 sec: 5565.9, 60 sec: 5532.4, 300 sec: 5528.5). Total num frames: 1047054336. Throughput: 0: 5807.9. Samples: 1047055170. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:49:39,766][25689] Avg episode reward: [(0, '-0.828')] [2022-07-11 03:49:39,905][26022] Updated weights on worker 0-0, policy_version 1022516 (0.00084) [2022-07-11 03:49:41,788][26022] Updated weights on worker 0-0, policy_version 1022526 (0.00092) [2022-07-11 03:49:43,477][26022] Updated weights on worker 0-0, policy_version 1022536 (0.00087) [2022-07-11 03:49:44,770][25689] Fps is (10 sec: 5483.3, 60 sec: 5535.6, 300 sec: 5534.1). Total num frames: 1047083008. Throughput: 0: 5809.3. Samples: 1047088790. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:49:44,770][25689] Avg episode reward: [(0, '0.284')] [2022-07-11 03:49:45,665][26022] Updated weights on worker 0-0, policy_version 1022546 (0.00087) [2022-07-11 03:49:47,347][26022] Updated weights on worker 0-0, policy_version 1022556 (0.00086) [2022-07-11 03:49:49,124][26022] Updated weights on worker 0-0, policy_version 1022566 (0.00093) [2022-07-11 03:49:49,901][25689] Fps is (10 sec: 5556.8, 60 sec: 5551.1, 300 sec: 5528.4). Total num frames: 1047110656. Throughput: 0: 4966.4. Samples: 1047105398. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:49:49,902][25689] Avg episode reward: [(0, '-0.237')] [2022-07-11 03:49:51,120][26022] Updated weights on worker 0-0, policy_version 1022576 (0.00081) [2022-07-11 03:49:52,864][26022] Updated weights on worker 0-0, policy_version 1022586 (0.00092) [2022-07-11 03:49:54,645][26022] Updated weights on worker 0-0, policy_version 1022596 (0.00087) [2022-07-11 03:49:54,911][25689] Fps is (10 sec: 5654.4, 60 sec: 5538.1, 300 sec: 5536.2). Total num frames: 1047140352. Throughput: 0: 5804.6. Samples: 1047138936. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:49:54,912][25689] Avg episode reward: [(0, '-0.666')] [2022-07-11 03:49:56,608][26022] Updated weights on worker 0-0, policy_version 1022606 (0.00085) [2022-07-11 03:49:58,296][26022] Updated weights on worker 0-0, policy_version 1022616 (0.00088) [2022-07-11 03:49:59,955][25689] Fps is (10 sec: 5500.1, 60 sec: 5539.2, 300 sec: 5535.6). Total num frames: 1047165952. Throughput: 0: 5814.1. Samples: 1047172550. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:49:59,955][25689] Avg episode reward: [(0, '-0.561')] [2022-07-11 03:50:00,142][26022] Updated weights on worker 0-0, policy_version 1022626 (0.00086) [2022-07-11 03:50:02,323][26022] Updated weights on worker 0-0, policy_version 1022636 (0.00085) [2022-07-11 03:50:04,302][26022] Updated weights on worker 0-0, policy_version 1022646 (0.00094) [2022-07-11 03:50:04,971][25689] Fps is (10 sec: 5191.6, 60 sec: 5539.7, 300 sec: 5530.2). Total num frames: 1047192576. Throughput: 0: 4861.1. Samples: 1047186988. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:50:04,971][25689] Avg episode reward: [(0, '-0.524')] [2022-07-11 03:50:05,948][26022] Updated weights on worker 0-0, policy_version 1022656 (0.00094) [2022-07-11 03:50:08,000][26022] Updated weights on worker 0-0, policy_version 1022666 (0.00087) [2022-07-11 03:50:09,794][26022] Updated weights on worker 0-0, policy_version 1022676 (0.00083) [2022-07-11 03:50:10,029][25689] Fps is (10 sec: 5387.4, 60 sec: 5508.5, 300 sec: 5532.7). Total num frames: 1047220224. Throughput: 0: 5706.0. Samples: 1047220246. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:50:10,030][25689] Avg episode reward: [(0, '-1.091')] [2022-07-11 03:50:11,672][26022] Updated weights on worker 0-0, policy_version 1022686 (0.00085) [2022-07-11 03:50:13,510][26022] Updated weights on worker 0-0, policy_version 1022696 (0.00098) [2022-07-11 03:50:15,038][25689] Fps is (10 sec: 5594.4, 60 sec: 5531.5, 300 sec: 5532.7). Total num frames: 1047248896. Throughput: 0: 5691.6. Samples: 1047253488. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:50:15,038][25689] Avg episode reward: [(0, '-1.277')] [2022-07-11 03:50:15,382][26022] Updated weights on worker 0-0, policy_version 1022706 (0.00084) [2022-07-11 03:50:17,240][26022] Updated weights on worker 0-0, policy_version 1022716 (0.00089) [2022-07-11 03:50:19,100][26022] Updated weights on worker 0-0, policy_version 1022726 (0.00085) [2022-07-11 03:50:20,074][25689] Fps is (10 sec: 5504.8, 60 sec: 5516.8, 300 sec: 5525.5). Total num frames: 1047275520. Throughput: 0: 4837.5. Samples: 1047269876. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:50:20,074][25689] Avg episode reward: [(0, '-1.238')] [2022-07-11 03:50:20,872][26022] Updated weights on worker 0-0, policy_version 1022736 (0.00098) [2022-07-11 03:50:22,947][26022] Updated weights on worker 0-0, policy_version 1022746 (0.00089) [2022-07-11 03:50:24,630][26022] Updated weights on worker 0-0, policy_version 1022756 (0.00107) [2022-07-11 03:50:25,108][25689] Fps is (10 sec: 5491.3, 60 sec: 5514.4, 300 sec: 5530.8). Total num frames: 1047304192. Throughput: 0: 5763.9. Samples: 1047303056. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:50:25,108][25689] Avg episode reward: [(0, '-0.114')] [2022-07-11 03:50:26,480][26022] Updated weights on worker 0-0, policy_version 1022766 (0.00094) [2022-07-11 03:50:28,495][26022] Updated weights on worker 0-0, policy_version 1022776 (0.00493) [2022-07-11 03:50:30,163][25689] Fps is (10 sec: 5582.3, 60 sec: 5516.3, 300 sec: 5531.0). Total num frames: 1047331840. Throughput: 0: 5763.2. Samples: 1047336282. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:50:30,163][25689] Avg episode reward: [(0, '-0.221')] [2022-07-11 03:50:30,202][26022] Updated weights on worker 0-0, policy_version 1022786 (0.00099) [2022-07-11 03:50:32,278][26022] Updated weights on worker 0-0, policy_version 1022796 (0.00089) [2022-07-11 03:50:33,822][26022] Updated weights on worker 0-0, policy_version 1022806 (0.00085) [2022-07-11 03:50:35,199][25689] Fps is (10 sec: 5479.6, 60 sec: 5482.7, 300 sec: 5531.0). Total num frames: 1047359488. Throughput: 0: 5773.8. Samples: 1047369894. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:50:35,199][25689] Avg episode reward: [(0, '-0.174')] [2022-07-11 03:50:35,793][26022] Updated weights on worker 0-0, policy_version 1022816 (0.00083) [2022-07-11 03:50:37,604][26022] Updated weights on worker 0-0, policy_version 1022826 (0.00083) [2022-07-11 03:50:39,529][26022] Updated weights on worker 0-0, policy_version 1022836 (0.00084) [2022-07-11 03:50:40,295][25689] Fps is (10 sec: 5558.5, 60 sec: 5515.1, 300 sec: 5526.0). Total num frames: 1047388160. Throughput: 0: 5773.1. Samples: 1047386616. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:50:40,296][25689] Avg episode reward: [(0, '0.323')] [2022-07-11 03:50:41,447][26022] Updated weights on worker 0-0, policy_version 1022846 (0.00088) [2022-07-11 03:50:43,099][26022] Updated weights on worker 0-0, policy_version 1022856 (0.00091) [2022-07-11 03:50:45,005][26022] Updated weights on worker 0-0, policy_version 1022866 (0.00098) [2022-07-11 03:50:45,300][25689] Fps is (10 sec: 5677.0, 60 sec: 5515.0, 300 sec: 5537.2). Total num frames: 1047416832. Throughput: 0: 5783.9. Samples: 1047419846. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:50:45,301][25689] Avg episode reward: [(0, '-0.001')] [2022-07-11 03:50:46,873][26022] Updated weights on worker 0-0, policy_version 1022876 (0.00091) [2022-07-11 03:50:48,771][26022] Updated weights on worker 0-0, policy_version 1022886 (0.00093) [2022-07-11 03:50:50,404][25689] Fps is (10 sec: 5571.4, 60 sec: 5517.5, 300 sec: 5528.4). Total num frames: 1047444480. Throughput: 0: 5771.4. Samples: 1047453102. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:50:50,404][25689] Avg episode reward: [(0, '0.432')] [2022-07-11 03:50:50,515][26022] Updated weights on worker 0-0, policy_version 1022896 (0.00096) [2022-07-11 03:50:52,548][26022] Updated weights on worker 0-0, policy_version 1022906 (0.00091) [2022-07-11 03:50:54,074][26022] Updated weights on worker 0-0, policy_version 1022916 (0.00089) [2022-07-11 03:50:55,463][25689] Fps is (10 sec: 5541.6, 60 sec: 5496.1, 300 sec: 5531.2). Total num frames: 1047473152. Throughput: 0: 4936.9. Samples: 1047469936. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:50:55,464][25689] Avg episode reward: [(0, '0.591')] [2022-07-11 03:50:56,107][26022] Updated weights on worker 0-0, policy_version 1022926 (0.00095) [2022-07-11 03:50:57,824][26022] Updated weights on worker 0-0, policy_version 1022936 (0.00090) [2022-07-11 03:50:59,687][26022] Updated weights on worker 0-0, policy_version 1022946 (0.00104) [2022-07-11 03:51:00,516][25689] Fps is (10 sec: 5569.8, 60 sec: 5529.1, 300 sec: 5538.5). Total num frames: 1047500800. Throughput: 0: 5785.3. Samples: 1047503598. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:51:00,516][25689] Avg episode reward: [(0, '-0.161')] [2022-07-11 03:51:01,562][26022] Updated weights on worker 0-0, policy_version 1022956 (0.00096) [2022-07-11 03:51:03,795][26022] Updated weights on worker 0-0, policy_version 1022966 (0.00087) [2022-07-11 03:51:05,344][26022] Updated weights on worker 0-0, policy_version 1022976 (0.00086) [2022-07-11 03:51:05,544][25689] Fps is (10 sec: 5383.7, 60 sec: 5528.0, 300 sec: 5528.5). Total num frames: 1047527424. Throughput: 0: 5680.4. Samples: 1047534840. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:51:05,545][25689] Avg episode reward: [(0, '-0.128')] [2022-07-11 03:51:07,485][26022] Updated weights on worker 0-0, policy_version 1022986 (0.00090) [2022-07-11 03:51:09,224][26022] Updated weights on worker 0-0, policy_version 1022996 (0.00096) [2022-07-11 03:51:10,589][25689] Fps is (10 sec: 5286.0, 60 sec: 5512.3, 300 sec: 5529.3). Total num frames: 1047554048. Throughput: 0: 4875.8. Samples: 1047551520. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:51:10,589][25689] Avg episode reward: [(0, '0.175')] [2022-07-11 03:51:10,947][26022] Updated weights on worker 0-0, policy_version 1023006 (0.00083) [2022-07-11 03:51:13,047][26022] Updated weights on worker 0-0, policy_version 1023016 (0.00089) [2022-07-11 03:51:14,508][26022] Updated weights on worker 0-0, policy_version 1023026 (0.00082) [2022-07-11 03:51:15,595][25689] Fps is (10 sec: 5501.5, 60 sec: 5512.6, 300 sec: 5527.5). Total num frames: 1047582720. Throughput: 0: 5733.2. Samples: 1047585356. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:51:15,596][25689] Avg episode reward: [(0, '0.879')] [2022-07-11 03:51:16,618][26022] Updated weights on worker 0-0, policy_version 1023036 (0.00080) [2022-07-11 03:51:18,480][26022] Updated weights on worker 0-0, policy_version 1023046 (0.00085) [2022-07-11 03:51:20,153][26022] Updated weights on worker 0-0, policy_version 1023056 (0.00086) [2022-07-11 03:51:20,614][25689] Fps is (10 sec: 5618.0, 60 sec: 5531.0, 300 sec: 5530.7). Total num frames: 1047610368. Throughput: 0: 5734.3. Samples: 1047618848. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:51:20,614][25689] Avg episode reward: [(0, '-0.125')] [2022-07-11 03:51:22,129][26022] Updated weights on worker 0-0, policy_version 1023066 (0.00084) [2022-07-11 03:51:23,967][26022] Updated weights on worker 0-0, policy_version 1023076 (0.00093) [2022-07-11 03:51:25,569][26022] Updated weights on worker 0-0, policy_version 1023086 (0.00091) [2022-07-11 03:51:25,660][25689] Fps is (10 sec: 5697.4, 60 sec: 5546.8, 300 sec: 5536.2). Total num frames: 1047640064. Throughput: 0: 5000.9. Samples: 1047635438. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:51:25,662][25689] Avg episode reward: [(0, '-0.773')] [2022-07-11 03:51:27,798][26022] Updated weights on worker 0-0, policy_version 1023096 (0.00099) [2022-07-11 03:51:29,169][26022] Updated weights on worker 0-0, policy_version 1023106 (0.00087) [2022-07-11 03:51:30,735][25689] Fps is (10 sec: 5463.6, 60 sec: 5511.2, 300 sec: 5521.1). Total num frames: 1047665664. Throughput: 0: 5809.7. Samples: 1047668560. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:51:30,735][25689] Avg episode reward: [(0, '0.030')] [2022-07-11 03:51:31,603][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:51:31,614][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001023116_1047670784.pth [2022-07-11 03:51:31,614][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001021172_1045680128.pth [2022-07-11 03:51:31,620][26022] Updated weights on worker 0-0, policy_version 1023116 (0.00089) [2022-07-11 03:51:32,929][26022] Updated weights on worker 0-0, policy_version 1023126 (0.00085) [2022-07-11 03:51:35,031][26022] Updated weights on worker 0-0, policy_version 1023136 (0.00090) [2022-07-11 03:51:35,742][25689] Fps is (10 sec: 5484.7, 60 sec: 5547.7, 300 sec: 5534.7). Total num frames: 1047695360. Throughput: 0: 5799.7. Samples: 1047702200. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:51:35,742][25689] Avg episode reward: [(0, '-0.064')] [2022-07-11 03:51:36,753][26022] Updated weights on worker 0-0, policy_version 1023146 (0.00086) [2022-07-11 03:51:38,734][26022] Updated weights on worker 0-0, policy_version 1023156 (0.00093) [2022-07-11 03:51:40,579][26022] Updated weights on worker 0-0, policy_version 1023166 (0.00090) [2022-07-11 03:51:40,788][25689] Fps is (10 sec: 5703.7, 60 sec: 5535.3, 300 sec: 5534.0). Total num frames: 1047723008. Throughput: 0: 4960.8. Samples: 1047718930. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:51:40,789][25689] Avg episode reward: [(0, '-0.514')] [2022-07-11 03:51:42,402][26022] Updated weights on worker 0-0, policy_version 1023176 (0.00084) [2022-07-11 03:51:44,117][26022] Updated weights on worker 0-0, policy_version 1023186 (0.00088) [2022-07-11 03:51:45,807][25689] Fps is (10 sec: 5391.8, 60 sec: 5500.2, 300 sec: 5527.8). Total num frames: 1047749632. Throughput: 0: 5793.9. Samples: 1047752170. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:51:45,808][25689] Avg episode reward: [(0, '-0.380')] [2022-07-11 03:51:46,167][26022] Updated weights on worker 0-0, policy_version 1023196 (0.00084) [2022-07-11 03:51:47,849][26022] Updated weights on worker 0-0, policy_version 1023206 (0.00088) [2022-07-11 03:51:49,778][26022] Updated weights on worker 0-0, policy_version 1023216 (0.00091) [2022-07-11 03:51:50,874][25689] Fps is (10 sec: 5482.7, 60 sec: 5520.5, 300 sec: 5530.2). Total num frames: 1047778304. Throughput: 0: 5802.8. Samples: 1047785424. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:51:50,874][25689] Avg episode reward: [(0, '0.507')] [2022-07-11 03:51:51,478][26022] Updated weights on worker 0-0, policy_version 1023226 (0.00086) [2022-07-11 03:51:53,584][26022] Updated weights on worker 0-0, policy_version 1023236 (0.00087) [2022-07-11 03:51:55,192][26022] Updated weights on worker 0-0, policy_version 1023246 (0.00087) [2022-07-11 03:51:55,882][25689] Fps is (10 sec: 5692.0, 60 sec: 5525.2, 300 sec: 5530.5). Total num frames: 1047806976. Throughput: 0: 4964.0. Samples: 1047802176. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:51:55,882][25689] Avg episode reward: [(0, '1.020')] [2022-07-11 03:51:57,230][26022] Updated weights on worker 0-0, policy_version 1023256 (0.00090) [2022-07-11 03:51:58,854][26022] Updated weights on worker 0-0, policy_version 1023266 (0.00088) [2022-07-11 03:52:00,846][26022] Updated weights on worker 0-0, policy_version 1023276 (0.00094) [2022-07-11 03:52:00,895][25689] Fps is (10 sec: 5620.2, 60 sec: 5528.8, 300 sec: 5533.9). Total num frames: 1047834624. Throughput: 0: 5808.0. Samples: 1047835708. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:52:00,895][25689] Avg episode reward: [(0, '0.659')] [2022-07-11 03:52:02,891][26022] Updated weights on worker 0-0, policy_version 1023286 (0.00068) [2022-07-11 03:52:04,810][26022] Updated weights on worker 0-0, policy_version 1023296 (0.00097) [2022-07-11 03:52:05,919][25689] Fps is (10 sec: 5305.1, 60 sec: 5512.2, 300 sec: 5524.7). Total num frames: 1047860224. Throughput: 0: 5743.0. Samples: 1047867672. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:52:05,920][25689] Avg episode reward: [(0, '0.430')] [2022-07-11 03:52:06,608][26022] Updated weights on worker 0-0, policy_version 1023306 (0.00085) [2022-07-11 03:52:08,522][26022] Updated weights on worker 0-0, policy_version 1023316 (0.00091) [2022-07-11 03:52:10,317][26022] Updated weights on worker 0-0, policy_version 1023326 (0.00085) [2022-07-11 03:52:11,062][25689] Fps is (10 sec: 5438.7, 60 sec: 5554.1, 300 sec: 5532.9). Total num frames: 1047889920. Throughput: 0: 4895.0. Samples: 1047884246. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:52:11,063][25689] Avg episode reward: [(0, '0.782')] [2022-07-11 03:52:12,351][26022] Updated weights on worker 0-0, policy_version 1023336 (0.00089) [2022-07-11 03:52:13,866][26022] Updated weights on worker 0-0, policy_version 1023346 (0.00092) [2022-07-11 03:52:15,955][26022] Updated weights on worker 0-0, policy_version 1023356 (0.00092) [2022-07-11 03:52:16,072][25689] Fps is (10 sec: 5648.0, 60 sec: 5536.8, 300 sec: 5529.7). Total num frames: 1047917568. Throughput: 0: 5731.1. Samples: 1047917888. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:52:16,072][25689] Avg episode reward: [(0, '0.885')] [2022-07-11 03:52:17,544][26022] Updated weights on worker 0-0, policy_version 1023366 (0.00086) [2022-07-11 03:52:19,420][26022] Updated weights on worker 0-0, policy_version 1023376 (0.00093) [2022-07-11 03:52:21,085][25689] Fps is (10 sec: 5619.1, 60 sec: 5554.3, 300 sec: 5533.0). Total num frames: 1047946240. Throughput: 0: 5752.9. Samples: 1047951860. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:52:21,086][25689] Avg episode reward: [(0, '1.136')] [2022-07-11 03:52:21,233][26022] Updated weights on worker 0-0, policy_version 1023386 (0.00090) [2022-07-11 03:52:23,043][26022] Updated weights on worker 0-0, policy_version 1023396 (0.00081) [2022-07-11 03:52:24,868][26022] Updated weights on worker 0-0, policy_version 1023406 (0.00090) [2022-07-11 03:52:26,116][25689] Fps is (10 sec: 5607.2, 60 sec: 5521.7, 300 sec: 5530.4). Total num frames: 1047973888. Throughput: 0: 5018.8. Samples: 1047969038. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 03:52:26,117][25689] Avg episode reward: [(0, '0.870')] [2022-07-11 03:52:26,656][26022] Updated weights on worker 0-0, policy_version 1023416 (0.00093) [2022-07-11 03:52:28,593][26022] Updated weights on worker 0-0, policy_version 1023426 (0.00233) [2022-07-11 03:52:30,475][26022] Updated weights on worker 0-0, policy_version 1023436 (0.00091) [2022-07-11 03:52:31,175][25689] Fps is (10 sec: 5581.6, 60 sec: 5574.0, 300 sec: 5537.4). Total num frames: 1048002560. Throughput: 0: 5873.7. Samples: 1048002384. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:52:31,176][25689] Avg episode reward: [(0, '1.493')] [2022-07-11 03:52:32,261][26022] Updated weights on worker 0-0, policy_version 1023446 (0.00086) [2022-07-11 03:52:34,016][26022] Updated weights on worker 0-0, policy_version 1023456 (0.00097) [2022-07-11 03:52:35,927][26022] Updated weights on worker 0-0, policy_version 1023466 (0.00088) [2022-07-11 03:52:36,259][25689] Fps is (10 sec: 5653.8, 60 sec: 5550.0, 300 sec: 5529.4). Total num frames: 1048031232. Throughput: 0: 5858.8. Samples: 1048036158. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:52:36,259][25689] Avg episode reward: [(0, '0.899')] [2022-07-11 03:52:37,542][26022] Updated weights on worker 0-0, policy_version 1023476 (0.00084) [2022-07-11 03:52:39,404][26022] Updated weights on worker 0-0, policy_version 1023486 (0.00100) [2022-07-11 03:52:41,264][25689] Fps is (10 sec: 5582.4, 60 sec: 5553.8, 300 sec: 5537.0). Total num frames: 1048058880. Throughput: 0: 5015.8. Samples: 1048053074. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:52:41,265][25689] Avg episode reward: [(0, '0.781')] [2022-07-11 03:52:41,413][26022] Updated weights on worker 0-0, policy_version 1023496 (0.00086) [2022-07-11 03:52:43,082][26022] Updated weights on worker 0-0, policy_version 1023506 (0.00087) [2022-07-11 03:52:44,988][26022] Updated weights on worker 0-0, policy_version 1023516 (0.00091) [2022-07-11 03:52:46,306][25689] Fps is (10 sec: 5707.4, 60 sec: 5602.5, 300 sec: 5534.3). Total num frames: 1048088576. Throughput: 0: 5837.1. Samples: 1048086888. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:52:46,310][25689] Avg episode reward: [(0, '0.458')] [2022-07-11 03:52:46,670][26022] Updated weights on worker 0-0, policy_version 1023526 (0.00094) [2022-07-11 03:52:48,480][26022] Updated weights on worker 0-0, policy_version 1023536 (0.00085) [2022-07-11 03:52:50,492][26022] Updated weights on worker 0-0, policy_version 1023546 (0.00088) [2022-07-11 03:52:51,395][25689] Fps is (10 sec: 5559.6, 60 sec: 5566.6, 300 sec: 5534.5). Total num frames: 1048115200. Throughput: 0: 5833.0. Samples: 1048120322. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:52:51,395][25689] Avg episode reward: [(0, '0.369')] [2022-07-11 03:52:52,253][26022] Updated weights on worker 0-0, policy_version 1023556 (0.00088) [2022-07-11 03:52:54,158][26022] Updated weights on worker 0-0, policy_version 1023566 (0.00095) [2022-07-11 03:52:56,084][26022] Updated weights on worker 0-0, policy_version 1023576 (0.00092) [2022-07-11 03:52:56,408][25689] Fps is (10 sec: 5473.7, 60 sec: 5566.1, 300 sec: 5538.6). Total num frames: 1048143872. Throughput: 0: 5005.4. Samples: 1048137014. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:52:56,409][25689] Avg episode reward: [(0, '0.522')] [2022-07-11 03:52:57,861][26022] Updated weights on worker 0-0, policy_version 1023586 (0.00089) [2022-07-11 03:52:59,588][26022] Updated weights on worker 0-0, policy_version 1023596 (0.00086) [2022-07-11 03:53:01,381][26022] Updated weights on worker 0-0, policy_version 1023606 (0.00094) [2022-07-11 03:53:01,422][25689] Fps is (10 sec: 5718.9, 60 sec: 5583.0, 300 sec: 5546.2). Total num frames: 1048172544. Throughput: 0: 5837.0. Samples: 1048170732. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:53:01,422][25689] Avg episode reward: [(0, '0.303')] [2022-07-11 03:53:03,801][26022] Updated weights on worker 0-0, policy_version 1023616 (0.00089) [2022-07-11 03:53:05,439][26022] Updated weights on worker 0-0, policy_version 1023626 (0.00087) [2022-07-11 03:53:06,444][25689] Fps is (10 sec: 5305.9, 60 sec: 5566.3, 300 sec: 5536.4). Total num frames: 1048197120. Throughput: 0: 5736.9. Samples: 1048202416. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:53:06,445][25689] Avg episode reward: [(0, '1.248')] [2022-07-11 03:53:07,291][26022] Updated weights on worker 0-0, policy_version 1023636 (0.00089) [2022-07-11 03:53:09,152][26022] Updated weights on worker 0-0, policy_version 1023646 (0.00092) [2022-07-11 03:53:11,052][26022] Updated weights on worker 0-0, policy_version 1023656 (0.00094) [2022-07-11 03:53:11,506][25689] Fps is (10 sec: 5280.0, 60 sec: 5556.7, 300 sec: 5535.7). Total num frames: 1048225792. Throughput: 0: 4918.1. Samples: 1048219234. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:53:11,507][25689] Avg episode reward: [(0, '1.362')] [2022-07-11 03:53:12,618][26022] Updated weights on worker 0-0, policy_version 1023666 (0.01209) [2022-07-11 03:53:14,605][26022] Updated weights on worker 0-0, policy_version 1023676 (0.00082) [2022-07-11 03:53:16,514][25689] Fps is (10 sec: 5592.4, 60 sec: 5556.9, 300 sec: 5536.0). Total num frames: 1048253440. Throughput: 0: 5775.8. Samples: 1048253144. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:53:16,515][25689] Avg episode reward: [(0, '1.313')] [2022-07-11 03:53:16,539][26022] Updated weights on worker 0-0, policy_version 1023686 (0.00099) [2022-07-11 03:53:18,263][26022] Updated weights on worker 0-0, policy_version 1023696 (0.00093) [2022-07-11 03:53:20,207][26022] Updated weights on worker 0-0, policy_version 1023706 (0.00596) [2022-07-11 03:53:21,529][25689] Fps is (10 sec: 5619.0, 60 sec: 5556.7, 300 sec: 5539.3). Total num frames: 1048282112. Throughput: 0: 5770.3. Samples: 1048286762. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:53:21,530][25689] Avg episode reward: [(0, '1.315')] [2022-07-11 03:53:21,744][26022] Updated weights on worker 0-0, policy_version 1023716 (0.00102) [2022-07-11 03:53:23,781][26022] Updated weights on worker 0-0, policy_version 1023726 (0.00088) [2022-07-11 03:53:25,602][26022] Updated weights on worker 0-0, policy_version 1023736 (0.00075) [2022-07-11 03:53:26,539][25689] Fps is (10 sec: 5618.4, 60 sec: 5558.7, 300 sec: 5537.0). Total num frames: 1048309760. Throughput: 0: 5022.0. Samples: 1048303332. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:53:26,539][25689] Avg episode reward: [(0, '1.467')] [2022-07-11 03:53:27,493][26022] Updated weights on worker 0-0, policy_version 1023746 (0.00086) [2022-07-11 03:53:29,324][26022] Updated weights on worker 0-0, policy_version 1023756 (0.00089) [2022-07-11 03:53:31,179][26022] Updated weights on worker 0-0, policy_version 1023766 (0.00087) [2022-07-11 03:53:31,638][25689] Fps is (10 sec: 5470.3, 60 sec: 5538.1, 300 sec: 5533.2). Total num frames: 1048337408. Throughput: 0: 5827.4. Samples: 1048336548. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:53:31,639][25689] Avg episode reward: [(0, '1.011')] [2022-07-11 03:53:31,658][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:53:31,670][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001023768_1048338432.pth [2022-07-11 03:53:31,671][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001021821_1046344704.pth [2022-07-11 03:53:32,853][26022] Updated weights on worker 0-0, policy_version 1023776 (0.00090) [2022-07-11 03:53:34,927][26022] Updated weights on worker 0-0, policy_version 1023786 (0.00086) [2022-07-11 03:53:36,598][26022] Updated weights on worker 0-0, policy_version 1023796 (0.00092) [2022-07-11 03:53:36,646][25689] Fps is (10 sec: 5673.6, 60 sec: 5562.0, 300 sec: 5540.0). Total num frames: 1048367104. Throughput: 0: 5809.2. Samples: 1048370092. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:53:36,646][25689] Avg episode reward: [(0, '0.542')] [2022-07-11 03:53:38,532][26022] Updated weights on worker 0-0, policy_version 1023806 (0.00096) [2022-07-11 03:53:40,367][26022] Updated weights on worker 0-0, policy_version 1023816 (0.00097) [2022-07-11 03:53:41,656][25689] Fps is (10 sec: 5723.8, 60 sec: 5561.5, 300 sec: 5537.1). Total num frames: 1048394752. Throughput: 0: 5806.2. Samples: 1048403622. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:53:41,658][25689] Avg episode reward: [(0, '0.650')] [2022-07-11 03:53:42,172][26022] Updated weights on worker 0-0, policy_version 1023826 (0.00086) [2022-07-11 03:53:44,122][26022] Updated weights on worker 0-0, policy_version 1023836 (0.00093) [2022-07-11 03:53:45,955][26022] Updated weights on worker 0-0, policy_version 1023846 (0.00083) [2022-07-11 03:53:46,663][25689] Fps is (10 sec: 5520.3, 60 sec: 5530.9, 300 sec: 5542.5). Total num frames: 1048422400. Throughput: 0: 5810.1. Samples: 1048420254. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:53:46,663][25689] Avg episode reward: [(0, '0.508')] [2022-07-11 03:53:47,643][26022] Updated weights on worker 0-0, policy_version 1023856 (0.00080) [2022-07-11 03:53:49,576][26022] Updated weights on worker 0-0, policy_version 1023866 (0.00096) [2022-07-11 03:53:51,343][26022] Updated weights on worker 0-0, policy_version 1023876 (0.00095) [2022-07-11 03:53:51,770][25689] Fps is (10 sec: 5467.3, 60 sec: 5546.1, 300 sec: 5531.2). Total num frames: 1048450048. Throughput: 0: 5819.2. Samples: 1048453702. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:53:51,771][25689] Avg episode reward: [(0, '0.078')] [2022-07-11 03:53:53,591][26022] Updated weights on worker 0-0, policy_version 1023886 (0.00084) [2022-07-11 03:53:55,107][26022] Updated weights on worker 0-0, policy_version 1023896 (0.00092) [2022-07-11 03:53:56,783][25689] Fps is (10 sec: 5362.8, 60 sec: 5512.2, 300 sec: 5535.4). Total num frames: 1048476672. Throughput: 0: 5783.9. Samples: 1048486562. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:53:56,783][25689] Avg episode reward: [(0, '-0.816')] [2022-07-11 03:53:56,959][26022] Updated weights on worker 0-0, policy_version 1023906 (0.00096) [2022-07-11 03:53:58,911][26022] Updated weights on worker 0-0, policy_version 1023916 (0.00083) [2022-07-11 03:54:00,569][26022] Updated weights on worker 0-0, policy_version 1023926 (0.00085) [2022-07-11 03:54:01,809][25689] Fps is (10 sec: 5508.2, 60 sec: 5511.1, 300 sec: 5542.2). Total num frames: 1048505344. Throughput: 0: 4943.7. Samples: 1048503252. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:54:01,810][25689] Avg episode reward: [(0, '0.252')] [2022-07-11 03:54:02,866][26022] Updated weights on worker 0-0, policy_version 1023936 (0.00084) [2022-07-11 03:54:04,723][26022] Updated weights on worker 0-0, policy_version 1023946 (0.00088) [2022-07-11 03:54:06,615][26022] Updated weights on worker 0-0, policy_version 1023956 (0.00083) [2022-07-11 03:54:06,820][25689] Fps is (10 sec: 5509.1, 60 sec: 5546.0, 300 sec: 5533.3). Total num frames: 1048531968. Throughput: 0: 5681.1. Samples: 1048534770. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:54:06,820][25689] Avg episode reward: [(0, '0.292')] [2022-07-11 03:54:08,515][26022] Updated weights on worker 0-0, policy_version 1023966 (0.00093) [2022-07-11 03:54:10,300][26022] Updated weights on worker 0-0, policy_version 1023976 (0.00089) [2022-07-11 03:54:11,899][25689] Fps is (10 sec: 5480.6, 60 sec: 5544.6, 300 sec: 5536.7). Total num frames: 1048560640. Throughput: 0: 5679.9. Samples: 1048568028. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:54:11,899][25689] Avg episode reward: [(0, '0.529')] [2022-07-11 03:54:12,087][26022] Updated weights on worker 0-0, policy_version 1023986 (0.00086) [2022-07-11 03:54:13,813][26022] Updated weights on worker 0-0, policy_version 1023996 (0.00089) [2022-07-11 03:54:15,679][26022] Updated weights on worker 0-0, policy_version 1024006 (0.00091) [2022-07-11 03:54:16,919][25689] Fps is (10 sec: 5576.9, 60 sec: 5543.4, 300 sec: 5537.4). Total num frames: 1048588288. Throughput: 0: 4874.2. Samples: 1048584708. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:54:16,919][25689] Avg episode reward: [(0, '0.596')] [2022-07-11 03:54:17,766][26022] Updated weights on worker 0-0, policy_version 1024016 (0.00085) [2022-07-11 03:54:19,404][26022] Updated weights on worker 0-0, policy_version 1024026 (0.00091) [2022-07-11 03:54:21,392][26022] Updated weights on worker 0-0, policy_version 1024036 (0.00089) [2022-07-11 03:54:21,927][25689] Fps is (10 sec: 5514.2, 60 sec: 5527.2, 300 sec: 5534.0). Total num frames: 1048615936. Throughput: 0: 5722.3. Samples: 1048618370. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:54:21,927][25689] Avg episode reward: [(0, '0.630')] [2022-07-11 03:54:23,135][26022] Updated weights on worker 0-0, policy_version 1024046 (0.00088) [2022-07-11 03:54:24,906][26022] Updated weights on worker 0-0, policy_version 1024056 (0.00097) [2022-07-11 03:54:26,702][26022] Updated weights on worker 0-0, policy_version 1024066 (0.00097) [2022-07-11 03:54:26,946][25689] Fps is (10 sec: 5514.7, 60 sec: 5526.2, 300 sec: 5535.1). Total num frames: 1048643584. Throughput: 0: 5815.5. Samples: 1048651812. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:54:26,946][25689] Avg episode reward: [(0, '0.601')] [2022-07-11 03:54:28,724][26022] Updated weights on worker 0-0, policy_version 1024076 (0.00092) [2022-07-11 03:54:30,537][26022] Updated weights on worker 0-0, policy_version 1024086 (0.00091) [2022-07-11 03:54:32,009][25689] Fps is (10 sec: 5585.9, 60 sec: 5546.5, 300 sec: 5531.2). Total num frames: 1048672256. Throughput: 0: 4990.8. Samples: 1048668396. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:54:32,011][25689] Avg episode reward: [(0, '0.885')] [2022-07-11 03:54:32,361][26022] Updated weights on worker 0-0, policy_version 1024096 (0.00086) [2022-07-11 03:54:34,193][26022] Updated weights on worker 0-0, policy_version 1024106 (0.00088) [2022-07-11 03:54:36,074][26022] Updated weights on worker 0-0, policy_version 1024116 (0.00082) [2022-07-11 03:54:37,020][25689] Fps is (10 sec: 5692.0, 60 sec: 5529.2, 300 sec: 5539.3). Total num frames: 1048700928. Throughput: 0: 5823.1. Samples: 1048701760. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:54:37,022][25689] Avg episode reward: [(0, '0.450')] [2022-07-11 03:54:37,995][26022] Updated weights on worker 0-0, policy_version 1024126 (0.00085) [2022-07-11 03:54:39,683][26022] Updated weights on worker 0-0, policy_version 1024136 (0.00084) [2022-07-11 03:54:41,691][26022] Updated weights on worker 0-0, policy_version 1024146 (0.00084) [2022-07-11 03:54:42,046][25689] Fps is (10 sec: 5407.1, 60 sec: 5493.9, 300 sec: 5528.6). Total num frames: 1048726528. Throughput: 0: 5812.5. Samples: 1048735316. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:54:42,047][25689] Avg episode reward: [(0, '0.466')] [2022-07-11 03:54:43,419][26022] Updated weights on worker 0-0, policy_version 1024156 (0.00085) [2022-07-11 03:54:45,473][26022] Updated weights on worker 0-0, policy_version 1024166 (0.00079) [2022-07-11 03:54:47,053][25689] Fps is (10 sec: 5409.3, 60 sec: 5510.8, 300 sec: 5534.3). Total num frames: 1048755200. Throughput: 0: 4983.5. Samples: 1048752018. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:54:47,055][25689] Avg episode reward: [(0, '0.473')] [2022-07-11 03:54:47,113][26022] Updated weights on worker 0-0, policy_version 1024176 (0.00078) [2022-07-11 03:54:49,000][26022] Updated weights on worker 0-0, policy_version 1024186 (0.00085) [2022-07-11 03:54:50,999][26022] Updated weights on worker 0-0, policy_version 1024196 (0.00112) [2022-07-11 03:54:52,177][25689] Fps is (10 sec: 5660.2, 60 sec: 5526.2, 300 sec: 5528.8). Total num frames: 1048783872. Throughput: 0: 5802.2. Samples: 1048785416. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:54:52,178][25689] Avg episode reward: [(0, '1.128')] [2022-07-11 03:54:52,547][26022] Updated weights on worker 0-0, policy_version 1024206 (0.00091) [2022-07-11 03:54:54,617][26022] Updated weights on worker 0-0, policy_version 1024216 (0.00085) [2022-07-11 03:54:56,222][26022] Updated weights on worker 0-0, policy_version 1024226 (0.00089) [2022-07-11 03:54:57,244][25689] Fps is (10 sec: 5627.0, 60 sec: 5555.1, 300 sec: 5538.7). Total num frames: 1048812544. Throughput: 0: 5791.9. Samples: 1048818896. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:54:57,245][25689] Avg episode reward: [(0, '1.838')] [2022-07-11 03:54:58,139][26022] Updated weights on worker 0-0, policy_version 1024236 (0.00082) [2022-07-11 03:54:59,874][26022] Updated weights on worker 0-0, policy_version 1024246 (0.00095) [2022-07-11 03:55:02,112][26022] Updated weights on worker 0-0, policy_version 1024256 (0.00088) [2022-07-11 03:55:02,337][25689] Fps is (10 sec: 5442.7, 60 sec: 5515.2, 300 sec: 5537.2). Total num frames: 1048839168. Throughput: 0: 4945.7. Samples: 1048835668. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:55:02,338][25689] Avg episode reward: [(0, '1.049')] [2022-07-11 03:55:03,946][26022] Updated weights on worker 0-0, policy_version 1024266 (0.00085) [2022-07-11 03:55:05,835][26022] Updated weights on worker 0-0, policy_version 1024276 (0.00093) [2022-07-11 03:55:07,436][25689] Fps is (10 sec: 5425.7, 60 sec: 5541.0, 300 sec: 5539.9). Total num frames: 1048867840. Throughput: 0: 5652.7. Samples: 1048867234. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:55:07,436][25689] Avg episode reward: [(0, '1.005')] [2022-07-11 03:55:07,571][26022] Updated weights on worker 0-0, policy_version 1024286 (0.00084) [2022-07-11 03:55:09,527][26022] Updated weights on worker 0-0, policy_version 1024296 (0.00085) [2022-07-11 03:55:11,207][26022] Updated weights on worker 0-0, policy_version 1024306 (0.00089) [2022-07-11 03:55:12,573][25689] Fps is (10 sec: 5402.2, 60 sec: 5501.9, 300 sec: 5530.7). Total num frames: 1048894464. Throughput: 0: 5658.8. Samples: 1048900830. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:55:12,573][25689] Avg episode reward: [(0, '0.865')] [2022-07-11 03:55:13,155][26022] Updated weights on worker 0-0, policy_version 1024316 (0.00087) [2022-07-11 03:55:14,943][26022] Updated weights on worker 0-0, policy_version 1024326 (0.00092) [2022-07-11 03:55:16,952][26022] Updated weights on worker 0-0, policy_version 1024336 (0.00090) [2022-07-11 03:55:17,621][25689] Fps is (10 sec: 5529.6, 60 sec: 5533.1, 300 sec: 5540.8). Total num frames: 1048924160. Throughput: 0: 4842.9. Samples: 1048917578. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:55:17,621][25689] Avg episode reward: [(0, '0.428')] [2022-07-11 03:55:18,592][26022] Updated weights on worker 0-0, policy_version 1024346 (0.00094) [2022-07-11 03:55:20,557][26022] Updated weights on worker 0-0, policy_version 1024356 (0.00091) [2022-07-11 03:55:22,291][26022] Updated weights on worker 0-0, policy_version 1024366 (0.00093) [2022-07-11 03:55:22,698][25689] Fps is (10 sec: 5764.7, 60 sec: 5543.7, 300 sec: 5540.0). Total num frames: 1048952832. Throughput: 0: 5664.3. Samples: 1048950998. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:55:22,699][25689] Avg episode reward: [(0, '0.363')] [2022-07-11 03:55:24,244][26022] Updated weights on worker 0-0, policy_version 1024376 (0.00087) [2022-07-11 03:55:26,014][26022] Updated weights on worker 0-0, policy_version 1024386 (0.00091) [2022-07-11 03:55:27,769][25689] Fps is (10 sec: 5549.6, 60 sec: 5538.9, 300 sec: 5539.7). Total num frames: 1048980480. Throughput: 0: 5751.5. Samples: 1048984182. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:55:27,770][25689] Avg episode reward: [(0, '-0.321')] [2022-07-11 03:55:28,010][26022] Updated weights on worker 0-0, policy_version 1024396 (0.00085) [2022-07-11 03:55:29,875][26022] Updated weights on worker 0-0, policy_version 1024406 (0.00093) [2022-07-11 03:55:31,622][26022] Updated weights on worker 0-0, policy_version 1024416 (0.00091) [2022-07-11 03:55:31,987][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:55:32,010][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001024417_1049003008.pth [2022-07-11 03:55:32,011][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001022472_1047011328.pth [2022-07-11 03:55:32,873][25689] Fps is (10 sec: 5534.9, 60 sec: 5535.2, 300 sec: 5541.8). Total num frames: 1049009152. Throughput: 0: 5738.7. Samples: 1049017328. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:55:32,874][25689] Avg episode reward: [(0, '-1.262')] [2022-07-11 03:55:33,551][26022] Updated weights on worker 0-0, policy_version 1024426 (0.00090) [2022-07-11 03:55:35,399][26022] Updated weights on worker 0-0, policy_version 1024436 (0.00091) [2022-07-11 03:55:37,247][26022] Updated weights on worker 0-0, policy_version 1024446 (0.00093) [2022-07-11 03:55:37,898][25689] Fps is (10 sec: 5560.3, 60 sec: 5517.1, 300 sec: 5539.7). Total num frames: 1049036800. Throughput: 0: 5745.5. Samples: 1049034082. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:55:37,899][25689] Avg episode reward: [(0, '-1.021')] [2022-07-11 03:55:38,969][26022] Updated weights on worker 0-0, policy_version 1024456 (0.00093) [2022-07-11 03:55:40,860][26022] Updated weights on worker 0-0, policy_version 1024466 (0.00098) [2022-07-11 03:55:42,597][26022] Updated weights on worker 0-0, policy_version 1024476 (0.00084) [2022-07-11 03:55:42,949][25689] Fps is (10 sec: 5589.6, 60 sec: 5565.3, 300 sec: 5538.9). Total num frames: 1049065472. Throughput: 0: 5768.0. Samples: 1049067808. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:55:42,950][25689] Avg episode reward: [(0, '-1.916')] [2022-07-11 03:55:44,470][26022] Updated weights on worker 0-0, policy_version 1024486 (0.00091) [2022-07-11 03:55:46,257][26022] Updated weights on worker 0-0, policy_version 1024496 (0.00086) [2022-07-11 03:55:47,964][25689] Fps is (10 sec: 5595.4, 60 sec: 5547.8, 300 sec: 5540.5). Total num frames: 1049093120. Throughput: 0: 5787.1. Samples: 1049101048. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:55:47,964][25689] Avg episode reward: [(0, '-2.025')] [2022-07-11 03:55:48,225][26022] Updated weights on worker 0-0, policy_version 1024506 (0.00096) [2022-07-11 03:55:50,155][26022] Updated weights on worker 0-0, policy_version 1024516 (0.00093) [2022-07-11 03:55:51,899][26022] Updated weights on worker 0-0, policy_version 1024526 (0.00090) [2022-07-11 03:55:53,016][25689] Fps is (10 sec: 5391.2, 60 sec: 5520.7, 300 sec: 5533.8). Total num frames: 1049119744. Throughput: 0: 4974.5. Samples: 1049117528. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:55:53,017][25689] Avg episode reward: [(0, '-2.534')] [2022-07-11 03:55:53,965][26022] Updated weights on worker 0-0, policy_version 1024536 (0.00086) [2022-07-11 03:55:55,798][26022] Updated weights on worker 0-0, policy_version 1024546 (0.00091) [2022-07-11 03:55:57,495][26022] Updated weights on worker 0-0, policy_version 1024556 (0.00086) [2022-07-11 03:55:58,038][25689] Fps is (10 sec: 5488.8, 60 sec: 5524.8, 300 sec: 5537.8). Total num frames: 1049148416. Throughput: 0: 5800.3. Samples: 1049150898. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:55:58,038][25689] Avg episode reward: [(0, '-1.148')] [2022-07-11 03:55:59,360][26022] Updated weights on worker 0-0, policy_version 1024566 (0.00092) [2022-07-11 03:56:01,207][26022] Updated weights on worker 0-0, policy_version 1024576 (0.00083) [2022-07-11 03:56:03,082][25689] Fps is (10 sec: 5391.6, 60 sec: 5512.4, 300 sec: 5534.1). Total num frames: 1049174016. Throughput: 0: 5682.6. Samples: 1049182214. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:56:03,083][25689] Avg episode reward: [(0, '-0.833')] [2022-07-11 03:56:03,383][26022] Updated weights on worker 0-0, policy_version 1024586 (0.00092) [2022-07-11 03:56:05,272][26022] Updated weights on worker 0-0, policy_version 1024596 (0.00106) [2022-07-11 03:56:07,087][26022] Updated weights on worker 0-0, policy_version 1024606 (0.00094) [2022-07-11 03:56:08,097][25689] Fps is (10 sec: 5293.6, 60 sec: 5503.1, 300 sec: 5538.1). Total num frames: 1049201664. Throughput: 0: 4857.2. Samples: 1049198838. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:56:08,097][25689] Avg episode reward: [(0, '-0.350')] [2022-07-11 03:56:09,067][26022] Updated weights on worker 0-0, policy_version 1024616 (0.00108) [2022-07-11 03:56:10,928][26022] Updated weights on worker 0-0, policy_version 1024626 (0.00097) [2022-07-11 03:56:12,723][26022] Updated weights on worker 0-0, policy_version 1024636 (0.00088) [2022-07-11 03:56:13,164][25689] Fps is (10 sec: 5586.3, 60 sec: 5543.3, 300 sec: 5536.9). Total num frames: 1049230336. Throughput: 0: 5686.3. Samples: 1049232094. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:56:13,164][25689] Avg episode reward: [(0, '0.361')] [2022-07-11 03:56:14,374][26022] Updated weights on worker 0-0, policy_version 1024646 (0.00056) [2022-07-11 03:56:16,391][26022] Updated weights on worker 0-0, policy_version 1024656 (0.00094) [2022-07-11 03:56:17,962][26022] Updated weights on worker 0-0, policy_version 1024666 (0.00087) [2022-07-11 03:56:18,255][25689] Fps is (10 sec: 5645.2, 60 sec: 5522.4, 300 sec: 5539.0). Total num frames: 1049259008. Throughput: 0: 5663.8. Samples: 1049265402. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:56:18,256][25689] Avg episode reward: [(0, '-0.087')] [2022-07-11 03:56:20,251][26022] Updated weights on worker 0-0, policy_version 1024676 (0.00085) [2022-07-11 03:56:21,780][26022] Updated weights on worker 0-0, policy_version 1024686 (0.00090) [2022-07-11 03:56:23,309][25689] Fps is (10 sec: 5450.7, 60 sec: 5490.8, 300 sec: 5528.6). Total num frames: 1049285632. Throughput: 0: 4935.6. Samples: 1049282044. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 03:56:23,309][25689] Avg episode reward: [(0, '-0.754')] [2022-07-11 03:56:23,774][26022] Updated weights on worker 0-0, policy_version 1024696 (0.00093) [2022-07-11 03:56:25,657][26022] Updated weights on worker 0-0, policy_version 1024706 (0.00092) [2022-07-11 03:56:27,318][26022] Updated weights on worker 0-0, policy_version 1024716 (0.00094) [2022-07-11 03:56:28,361][25689] Fps is (10 sec: 5471.7, 60 sec: 5509.4, 300 sec: 5539.3). Total num frames: 1049314304. Throughput: 0: 5756.5. Samples: 1049315488. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:56:28,362][25689] Avg episode reward: [(0, '-0.889')] [2022-07-11 03:56:29,342][26022] Updated weights on worker 0-0, policy_version 1024726 (0.00086) [2022-07-11 03:56:31,026][26022] Updated weights on worker 0-0, policy_version 1024736 (0.00086) [2022-07-11 03:56:32,940][26022] Updated weights on worker 0-0, policy_version 1024746 (0.00083) [2022-07-11 03:56:33,498][25689] Fps is (10 sec: 5627.9, 60 sec: 5506.4, 300 sec: 5533.4). Total num frames: 1049342976. Throughput: 0: 5749.3. Samples: 1049349000. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:56:33,498][25689] Avg episode reward: [(0, '-0.762')] [2022-07-11 03:56:34,879][26022] Updated weights on worker 0-0, policy_version 1024756 (0.00093) [2022-07-11 03:56:36,509][26022] Updated weights on worker 0-0, policy_version 1024766 (0.00086) [2022-07-11 03:56:38,527][25689] Fps is (10 sec: 5439.3, 60 sec: 5489.2, 300 sec: 5530.3). Total num frames: 1049369600. Throughput: 0: 4949.8. Samples: 1049365744. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:56:38,527][25689] Avg episode reward: [(0, '0.008')] [2022-07-11 03:56:38,551][26022] Updated weights on worker 0-0, policy_version 1024776 (0.00096) [2022-07-11 03:56:40,238][26022] Updated weights on worker 0-0, policy_version 1024786 (0.00088) [2022-07-11 03:56:42,086][26022] Updated weights on worker 0-0, policy_version 1024796 (0.00094) [2022-07-11 03:56:43,535][25689] Fps is (10 sec: 5509.5, 60 sec: 5493.1, 300 sec: 5537.4). Total num frames: 1049398272. Throughput: 0: 5789.6. Samples: 1049399144. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:56:43,535][25689] Avg episode reward: [(0, '-0.484')] [2022-07-11 03:56:44,020][26022] Updated weights on worker 0-0, policy_version 1024806 (0.00091) [2022-07-11 03:56:45,848][26022] Updated weights on worker 0-0, policy_version 1024816 (0.00095) [2022-07-11 03:56:47,855][26022] Updated weights on worker 0-0, policy_version 1024826 (0.00340) [2022-07-11 03:56:48,544][25689] Fps is (10 sec: 5724.8, 60 sec: 5510.5, 300 sec: 5538.5). Total num frames: 1049426944. Throughput: 0: 5807.2. Samples: 1049432694. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:56:48,544][25689] Avg episode reward: [(0, '-0.262')] [2022-07-11 03:56:49,700][26022] Updated weights on worker 0-0, policy_version 1024836 (0.00091) [2022-07-11 03:56:51,335][26022] Updated weights on worker 0-0, policy_version 1024846 (0.00710) [2022-07-11 03:56:53,372][26022] Updated weights on worker 0-0, policy_version 1024856 (0.00090) [2022-07-11 03:56:53,599][25689] Fps is (10 sec: 5494.4, 60 sec: 5510.2, 300 sec: 5530.7). Total num frames: 1049453568. Throughput: 0: 4996.0. Samples: 1049449424. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:56:53,600][25689] Avg episode reward: [(0, '1.363')] [2022-07-11 03:56:55,129][26022] Updated weights on worker 0-0, policy_version 1024866 (0.00106) [2022-07-11 03:56:57,036][26022] Updated weights on worker 0-0, policy_version 1024876 (0.00081) [2022-07-11 03:56:58,618][25689] Fps is (10 sec: 5488.9, 60 sec: 5510.5, 300 sec: 5534.1). Total num frames: 1049482240. Throughput: 0: 5798.5. Samples: 1049482242. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:56:58,618][25689] Avg episode reward: [(0, '1.153')] [2022-07-11 03:56:58,768][26022] Updated weights on worker 0-0, policy_version 1024886 (0.00082) [2022-07-11 03:57:00,593][26022] Updated weights on worker 0-0, policy_version 1024896 (0.00088) [2022-07-11 03:57:02,578][26022] Updated weights on worker 0-0, policy_version 1024906 (0.00085) [2022-07-11 03:57:03,645][25689] Fps is (10 sec: 5402.2, 60 sec: 5512.0, 300 sec: 5534.0). Total num frames: 1049507840. Throughput: 0: 5712.2. Samples: 1049514018. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:57:03,647][25689] Avg episode reward: [(0, '1.199')] [2022-07-11 03:57:04,596][26022] Updated weights on worker 0-0, policy_version 1024916 (0.00078) [2022-07-11 03:57:06,412][26022] Updated weights on worker 0-0, policy_version 1024926 (0.00087) [2022-07-11 03:57:08,360][26022] Updated weights on worker 0-0, policy_version 1024936 (0.00092) [2022-07-11 03:57:08,660][25689] Fps is (10 sec: 5302.5, 60 sec: 5512.0, 300 sec: 5529.5). Total num frames: 1049535488. Throughput: 0: 4866.2. Samples: 1049530584. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:57:08,660][25689] Avg episode reward: [(0, '1.360')] [2022-07-11 03:57:10,126][26022] Updated weights on worker 0-0, policy_version 1024946 (0.00090) [2022-07-11 03:57:12,147][26022] Updated weights on worker 0-0, policy_version 1024956 (0.00079) [2022-07-11 03:57:13,746][25689] Fps is (10 sec: 5575.6, 60 sec: 5510.3, 300 sec: 5531.5). Total num frames: 1049564160. Throughput: 0: 5695.1. Samples: 1049564166. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:57:13,748][25689] Avg episode reward: [(0, '1.308')] [2022-07-11 03:57:13,970][26022] Updated weights on worker 0-0, policy_version 1024966 (0.00093) [2022-07-11 03:57:15,645][26022] Updated weights on worker 0-0, policy_version 1024976 (0.00087) [2022-07-11 03:57:17,667][26022] Updated weights on worker 0-0, policy_version 1024986 (0.00086) [2022-07-11 03:57:18,802][25689] Fps is (10 sec: 5654.3, 60 sec: 5513.5, 300 sec: 5530.7). Total num frames: 1049592832. Throughput: 0: 5715.3. Samples: 1049597598. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:57:18,802][25689] Avg episode reward: [(0, '1.732')] [2022-07-11 03:57:19,436][26022] Updated weights on worker 0-0, policy_version 1024996 (0.01092) [2022-07-11 03:57:21,364][26022] Updated weights on worker 0-0, policy_version 1025006 (0.00084) [2022-07-11 03:57:23,008][26022] Updated weights on worker 0-0, policy_version 1025016 (0.00089) [2022-07-11 03:57:23,820][25689] Fps is (10 sec: 5590.5, 60 sec: 5533.6, 300 sec: 5531.0). Total num frames: 1049620480. Throughput: 0: 4961.4. Samples: 1049614116. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:57:23,822][25689] Avg episode reward: [(0, '1.578')] [2022-07-11 03:57:24,818][26022] Updated weights on worker 0-0, policy_version 1025026 (0.00092) [2022-07-11 03:57:26,863][26022] Updated weights on worker 0-0, policy_version 1025036 (0.00107) [2022-07-11 03:57:28,447][26022] Updated weights on worker 0-0, policy_version 1025046 (0.00090) [2022-07-11 03:57:28,864][25689] Fps is (10 sec: 5495.0, 60 sec: 5517.4, 300 sec: 5527.8). Total num frames: 1049648128. Throughput: 0: 5777.4. Samples: 1049647314. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:57:28,866][25689] Avg episode reward: [(0, '1.541')] [2022-07-11 03:57:30,523][26022] Updated weights on worker 0-0, policy_version 1025056 (0.00088) [2022-07-11 03:57:32,055][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:57:32,066][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001025065_1049666560.pth [2022-07-11 03:57:32,067][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001023116_1047670784.pth [2022-07-11 03:57:32,368][26022] Updated weights on worker 0-0, policy_version 1025066 (0.00086) [2022-07-11 03:57:33,952][25689] Fps is (10 sec: 5558.6, 60 sec: 5521.9, 300 sec: 5527.7). Total num frames: 1049676800. Throughput: 0: 5777.7. Samples: 1049680912. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:57:33,953][25689] Avg episode reward: [(0, '0.768')] [2022-07-11 03:57:34,097][26022] Updated weights on worker 0-0, policy_version 1025076 (0.00085) [2022-07-11 03:57:36,010][26022] Updated weights on worker 0-0, policy_version 1025086 (0.00090) [2022-07-11 03:57:37,624][26022] Updated weights on worker 0-0, policy_version 1025096 (0.00090) [2022-07-11 03:57:38,954][25689] Fps is (10 sec: 5582.2, 60 sec: 5541.4, 300 sec: 5527.8). Total num frames: 1049704448. Throughput: 0: 4972.3. Samples: 1049697800. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:57:38,954][25689] Avg episode reward: [(0, '0.905')] [2022-07-11 03:57:39,794][26022] Updated weights on worker 0-0, policy_version 1025106 (0.00090) [2022-07-11 03:57:41,559][26022] Updated weights on worker 0-0, policy_version 1025116 (0.00085) [2022-07-11 03:57:43,247][26022] Updated weights on worker 0-0, policy_version 1025126 (0.00087) [2022-07-11 03:57:43,984][25689] Fps is (10 sec: 5614.3, 60 sec: 5539.3, 300 sec: 5524.6). Total num frames: 1049733120. Throughput: 0: 5828.0. Samples: 1049731630. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:57:43,985][25689] Avg episode reward: [(0, '0.220')] [2022-07-11 03:57:45,167][26022] Updated weights on worker 0-0, policy_version 1025136 (0.00093) [2022-07-11 03:57:46,935][26022] Updated weights on worker 0-0, policy_version 1025146 (0.00090) [2022-07-11 03:57:48,834][26022] Updated weights on worker 0-0, policy_version 1025156 (0.00090) [2022-07-11 03:57:49,020][25689] Fps is (10 sec: 5696.3, 60 sec: 5536.8, 300 sec: 5532.4). Total num frames: 1049761792. Throughput: 0: 5845.4. Samples: 1049765134. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:57:49,021][25689] Avg episode reward: [(0, '0.167')] [2022-07-11 03:57:50,787][26022] Updated weights on worker 0-0, policy_version 1025166 (0.00101) [2022-07-11 03:57:52,477][26022] Updated weights on worker 0-0, policy_version 1025176 (0.00088) [2022-07-11 03:57:54,084][25689] Fps is (10 sec: 5474.8, 60 sec: 5536.0, 300 sec: 5524.6). Total num frames: 1049788416. Throughput: 0: 5007.9. Samples: 1049781730. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:57:54,085][25689] Avg episode reward: [(0, '-0.425')] [2022-07-11 03:57:54,367][26022] Updated weights on worker 0-0, policy_version 1025186 (0.00103) [2022-07-11 03:57:56,008][26022] Updated weights on worker 0-0, policy_version 1025196 (0.00091) [2022-07-11 03:57:57,962][26022] Updated weights on worker 0-0, policy_version 1025206 (0.00086) [2022-07-11 03:57:59,087][25689] Fps is (10 sec: 5391.3, 60 sec: 5520.5, 300 sec: 5521.4). Total num frames: 1049816064. Throughput: 0: 5841.2. Samples: 1049815404. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:57:59,088][25689] Avg episode reward: [(0, '-0.647')] [2022-07-11 03:57:59,675][26022] Updated weights on worker 0-0, policy_version 1025216 (0.00088) [2022-07-11 03:58:02,247][26022] Updated weights on worker 0-0, policy_version 1025226 (0.00551) [2022-07-11 03:58:03,744][26022] Updated weights on worker 0-0, policy_version 1025236 (0.00074) [2022-07-11 03:58:04,112][25689] Fps is (10 sec: 5514.4, 60 sec: 5554.7, 300 sec: 5531.6). Total num frames: 1049843712. Throughput: 0: 5726.6. Samples: 1049846892. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:58:04,112][25689] Avg episode reward: [(0, '0.199')] [2022-07-11 03:58:05,799][26022] Updated weights on worker 0-0, policy_version 1025246 (0.01321) [2022-07-11 03:58:07,258][26022] Updated weights on worker 0-0, policy_version 1025256 (0.00087) [2022-07-11 03:58:09,127][25689] Fps is (10 sec: 5405.8, 60 sec: 5537.7, 300 sec: 5525.6). Total num frames: 1049870336. Throughput: 0: 5739.9. Samples: 1049880542. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:58:09,127][25689] Avg episode reward: [(0, '0.023')] [2022-07-11 03:58:09,416][26022] Updated weights on worker 0-0, policy_version 1025266 (0.00089) [2022-07-11 03:58:11,252][26022] Updated weights on worker 0-0, policy_version 1025276 (0.00087) [2022-07-11 03:58:12,975][26022] Updated weights on worker 0-0, policy_version 1025286 (0.00084) [2022-07-11 03:58:14,172][25689] Fps is (10 sec: 5394.8, 60 sec: 5524.6, 300 sec: 5525.0). Total num frames: 1049897984. Throughput: 0: 5756.4. Samples: 1049897362. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:58:14,172][25689] Avg episode reward: [(0, '0.915')] [2022-07-11 03:58:14,926][26022] Updated weights on worker 0-0, policy_version 1025296 (0.00084) [2022-07-11 03:58:16,478][26022] Updated weights on worker 0-0, policy_version 1025306 (0.00110) [2022-07-11 03:58:18,415][26022] Updated weights on worker 0-0, policy_version 1025316 (0.00083) [2022-07-11 03:58:19,190][25689] Fps is (10 sec: 5800.2, 60 sec: 5561.9, 300 sec: 5531.8). Total num frames: 1049928704. Throughput: 0: 5763.8. Samples: 1049931272. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:58:19,190][25689] Avg episode reward: [(0, '1.752')] [2022-07-11 03:58:20,135][26022] Updated weights on worker 0-0, policy_version 1025326 (0.00086) [2022-07-11 03:58:22,022][26022] Updated weights on worker 0-0, policy_version 1025336 (0.00088) [2022-07-11 03:58:23,968][26022] Updated weights on worker 0-0, policy_version 1025346 (0.00090) [2022-07-11 03:58:24,198][25689] Fps is (10 sec: 5617.2, 60 sec: 5528.9, 300 sec: 5524.9). Total num frames: 1049954304. Throughput: 0: 5869.7. Samples: 1049964794. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:58:24,200][25689] Avg episode reward: [(0, '1.855')] [2022-07-11 03:58:25,719][26022] Updated weights on worker 0-0, policy_version 1025356 (0.00085) [2022-07-11 03:58:27,575][26022] Updated weights on worker 0-0, policy_version 1025366 (0.00088) [2022-07-11 03:58:29,223][25689] Fps is (10 sec: 5409.0, 60 sec: 5547.6, 300 sec: 5529.7). Total num frames: 1049982976. Throughput: 0: 5024.9. Samples: 1049981526. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:58:29,224][25689] Avg episode reward: [(0, '1.895')] [2022-07-11 03:58:29,523][26022] Updated weights on worker 0-0, policy_version 1025376 (0.00088) [2022-07-11 03:58:31,227][26022] Updated weights on worker 0-0, policy_version 1025386 (0.00087) [2022-07-11 03:58:33,358][26022] Updated weights on worker 0-0, policy_version 1025396 (0.00093) [2022-07-11 03:58:34,319][25689] Fps is (10 sec: 5766.8, 60 sec: 5563.8, 300 sec: 5528.1). Total num frames: 1050012672. Throughput: 0: 5811.9. Samples: 1050014458. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:58:34,321][25689] Avg episode reward: [(0, '1.736')] [2022-07-11 03:58:34,907][26022] Updated weights on worker 0-0, policy_version 1025406 (0.00081) [2022-07-11 03:58:36,976][26022] Updated weights on worker 0-0, policy_version 1025416 (0.00087) [2022-07-11 03:58:38,687][26022] Updated weights on worker 0-0, policy_version 1025426 (0.00084) [2022-07-11 03:58:39,348][25689] Fps is (10 sec: 5461.8, 60 sec: 5527.4, 300 sec: 5520.9). Total num frames: 1050038272. Throughput: 0: 5799.0. Samples: 1050048168. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:58:39,348][25689] Avg episode reward: [(0, '2.209')] [2022-07-11 03:58:40,372][26022] Updated weights on worker 0-0, policy_version 1025436 (0.00089) [2022-07-11 03:58:42,366][26022] Updated weights on worker 0-0, policy_version 1025446 (0.00086) [2022-07-11 03:58:43,955][26022] Updated weights on worker 0-0, policy_version 1025456 (0.00086) [2022-07-11 03:58:44,372][25689] Fps is (10 sec: 5602.3, 60 sec: 5561.9, 300 sec: 5530.9). Total num frames: 1050068992. Throughput: 0: 4974.1. Samples: 1050065144. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:58:44,374][25689] Avg episode reward: [(0, '2.258')] [2022-07-11 03:58:46,081][26022] Updated weights on worker 0-0, policy_version 1025466 (0.00095) [2022-07-11 03:58:47,653][26022] Updated weights on worker 0-0, policy_version 1025476 (0.00086) [2022-07-11 03:58:49,377][25689] Fps is (10 sec: 5717.4, 60 sec: 5530.8, 300 sec: 5529.3). Total num frames: 1050095616. Throughput: 0: 5816.0. Samples: 1050098742. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:58:49,379][25689] Avg episode reward: [(0, '2.075')] [2022-07-11 03:58:49,759][26022] Updated weights on worker 0-0, policy_version 1025486 (0.00081) [2022-07-11 03:58:51,563][26022] Updated weights on worker 0-0, policy_version 1025496 (0.00083) [2022-07-11 03:58:53,340][26022] Updated weights on worker 0-0, policy_version 1025506 (0.00055) [2022-07-11 03:58:54,428][25689] Fps is (10 sec: 5498.8, 60 sec: 5565.9, 300 sec: 5535.5). Total num frames: 1050124288. Throughput: 0: 5848.7. Samples: 1050132070. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:58:54,429][25689] Avg episode reward: [(0, '1.225')] [2022-07-11 03:58:55,296][26022] Updated weights on worker 0-0, policy_version 1025516 (0.00088) [2022-07-11 03:58:56,872][26022] Updated weights on worker 0-0, policy_version 1025526 (0.00095) [2022-07-11 03:58:58,999][26022] Updated weights on worker 0-0, policy_version 1025536 (0.00084) [2022-07-11 03:58:59,521][25689] Fps is (10 sec: 5552.3, 60 sec: 5557.7, 300 sec: 5530.8). Total num frames: 1050151936. Throughput: 0: 4996.9. Samples: 1050148972. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:58:59,521][25689] Avg episode reward: [(0, '0.493')] [2022-07-11 03:59:00,501][26022] Updated weights on worker 0-0, policy_version 1025546 (0.00092) [2022-07-11 03:59:03,027][26022] Updated weights on worker 0-0, policy_version 1025556 (0.00088) [2022-07-11 03:59:04,602][25689] Fps is (10 sec: 5334.8, 60 sec: 5535.6, 300 sec: 5529.5). Total num frames: 1050178560. Throughput: 0: 5698.5. Samples: 1050180420. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:59:04,602][25689] Avg episode reward: [(0, '-0.054')] [2022-07-11 03:59:04,736][26022] Updated weights on worker 0-0, policy_version 1025566 (0.00089) [2022-07-11 03:59:06,487][26022] Updated weights on worker 0-0, policy_version 1025576 (0.00088) [2022-07-11 03:59:08,296][26022] Updated weights on worker 0-0, policy_version 1025586 (0.00092) [2022-07-11 03:59:09,607][25689] Fps is (10 sec: 5381.0, 60 sec: 5553.4, 300 sec: 5527.5). Total num frames: 1050206208. Throughput: 0: 5713.2. Samples: 1050214316. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:59:09,607][25689] Avg episode reward: [(0, '-0.316')] [2022-07-11 03:59:10,203][26022] Updated weights on worker 0-0, policy_version 1025596 (0.01128) [2022-07-11 03:59:11,986][26022] Updated weights on worker 0-0, policy_version 1025606 (0.00086) [2022-07-11 03:59:13,783][26022] Updated weights on worker 0-0, policy_version 1025616 (0.00105) [2022-07-11 03:59:14,735][25689] Fps is (10 sec: 5658.8, 60 sec: 5579.6, 300 sec: 5532.3). Total num frames: 1050235904. Throughput: 0: 4880.9. Samples: 1050231186. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:59:14,736][25689] Avg episode reward: [(0, '-0.178')] [2022-07-11 03:59:15,571][26022] Updated weights on worker 0-0, policy_version 1025626 (0.00090) [2022-07-11 03:59:17,459][26022] Updated weights on worker 0-0, policy_version 1025636 (0.00084) [2022-07-11 03:59:19,243][26022] Updated weights on worker 0-0, policy_version 1025646 (0.00093) [2022-07-11 03:59:19,767][25689] Fps is (10 sec: 5644.1, 60 sec: 5527.6, 300 sec: 5531.9). Total num frames: 1050263552. Throughput: 0: 5728.8. Samples: 1050264956. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:59:19,767][25689] Avg episode reward: [(0, '0.218')] [2022-07-11 03:59:21,228][26022] Updated weights on worker 0-0, policy_version 1025656 (0.00089) [2022-07-11 03:59:22,957][26022] Updated weights on worker 0-0, policy_version 1025666 (0.00085) [2022-07-11 03:59:24,799][25689] Fps is (10 sec: 5596.1, 60 sec: 5576.1, 300 sec: 5535.1). Total num frames: 1050292224. Throughput: 0: 5857.0. Samples: 1050298718. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:59:24,800][25689] Avg episode reward: [(0, '0.246')] [2022-07-11 03:59:24,801][26022] Updated weights on worker 0-0, policy_version 1025676 (0.00086) [2022-07-11 03:59:26,553][26022] Updated weights on worker 0-0, policy_version 1025686 (0.00089) [2022-07-11 03:59:28,473][26022] Updated weights on worker 0-0, policy_version 1025696 (0.00102) [2022-07-11 03:59:29,809][25689] Fps is (10 sec: 5608.4, 60 sec: 5560.7, 300 sec: 5532.6). Total num frames: 1050319872. Throughput: 0: 5010.7. Samples: 1050315542. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:59:29,809][25689] Avg episode reward: [(0, '0.933')] [2022-07-11 03:59:30,299][26022] Updated weights on worker 0-0, policy_version 1025706 (0.00087) [2022-07-11 03:59:32,092][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 03:59:32,106][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001025716_1050333184.pth [2022-07-11 03:59:32,107][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001023768_1048338432.pth [2022-07-11 03:59:32,110][26022] Updated weights on worker 0-0, policy_version 1025716 (0.00091) [2022-07-11 03:59:33,879][26022] Updated weights on worker 0-0, policy_version 1025726 (0.00090) [2022-07-11 03:59:34,863][25689] Fps is (10 sec: 5596.5, 60 sec: 5547.6, 300 sec: 5531.8). Total num frames: 1050348544. Throughput: 0: 5850.8. Samples: 1050348948. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:59:34,863][25689] Avg episode reward: [(0, '1.129')] [2022-07-11 03:59:35,823][26022] Updated weights on worker 0-0, policy_version 1025736 (0.00098) [2022-07-11 03:59:37,679][26022] Updated weights on worker 0-0, policy_version 1025746 (0.00088) [2022-07-11 03:59:39,369][26022] Updated weights on worker 0-0, policy_version 1025756 (0.00092) [2022-07-11 03:59:39,886][25689] Fps is (10 sec: 5487.1, 60 sec: 5565.0, 300 sec: 5535.3). Total num frames: 1050375168. Throughput: 0: 5833.7. Samples: 1050382328. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:59:39,888][25689] Avg episode reward: [(0, '1.206')] [2022-07-11 03:59:41,271][26022] Updated weights on worker 0-0, policy_version 1025766 (0.00089) [2022-07-11 03:59:43,178][26022] Updated weights on worker 0-0, policy_version 1025776 (0.00089) [2022-07-11 03:59:44,919][25689] Fps is (10 sec: 5498.7, 60 sec: 5530.4, 300 sec: 5534.8). Total num frames: 1050403840. Throughput: 0: 4989.4. Samples: 1050399102. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:59:44,920][25689] Avg episode reward: [(0, '1.009')] [2022-07-11 03:59:45,028][26022] Updated weights on worker 0-0, policy_version 1025786 (0.00091) [2022-07-11 03:59:46,859][26022] Updated weights on worker 0-0, policy_version 1025796 (0.00088) [2022-07-11 03:59:48,623][26022] Updated weights on worker 0-0, policy_version 1025806 (0.00085) [2022-07-11 03:59:49,923][25689] Fps is (10 sec: 5611.7, 60 sec: 5547.5, 300 sec: 5533.6). Total num frames: 1050431488. Throughput: 0: 5820.6. Samples: 1050432616. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:59:49,924][25689] Avg episode reward: [(0, '0.581')] [2022-07-11 03:59:50,365][26022] Updated weights on worker 0-0, policy_version 1025816 (0.00089) [2022-07-11 03:59:52,607][26022] Updated weights on worker 0-0, policy_version 1025826 (0.00090) [2022-07-11 03:59:54,021][26022] Updated weights on worker 0-0, policy_version 1025836 (0.00084) [2022-07-11 03:59:54,971][25689] Fps is (10 sec: 5704.7, 60 sec: 5564.6, 300 sec: 5537.4). Total num frames: 1050461184. Throughput: 0: 5831.8. Samples: 1050466214. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:59:54,971][25689] Avg episode reward: [(0, '0.732')] [2022-07-11 03:59:56,186][26022] Updated weights on worker 0-0, policy_version 1025846 (0.00090) [2022-07-11 03:59:57,680][26022] Updated weights on worker 0-0, policy_version 1025856 (0.00083) [2022-07-11 03:59:59,670][26022] Updated weights on worker 0-0, policy_version 1025866 (0.00089) [2022-07-11 03:59:59,994][25689] Fps is (10 sec: 5693.6, 60 sec: 5571.0, 300 sec: 5542.2). Total num frames: 1050488832. Throughput: 0: 5013.9. Samples: 1050483146. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 03:59:59,995][25689] Avg episode reward: [(0, '0.648')] [2022-07-11 04:00:01,477][26022] Updated weights on worker 0-0, policy_version 1025876 (0.00087) [2022-07-11 04:00:03,772][26022] Updated weights on worker 0-0, policy_version 1025886 (0.00086) [2022-07-11 04:00:05,007][25689] Fps is (10 sec: 5305.9, 60 sec: 5560.3, 300 sec: 5533.4). Total num frames: 1050514432. Throughput: 0: 5733.6. Samples: 1050514276. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 04:00:05,009][25689] Avg episode reward: [(0, '0.662')] [2022-07-11 04:00:05,543][26022] Updated weights on worker 0-0, policy_version 1025896 (0.00088) [2022-07-11 04:00:07,550][26022] Updated weights on worker 0-0, policy_version 1025906 (0.00097) [2022-07-11 04:00:09,222][26022] Updated weights on worker 0-0, policy_version 1025916 (0.00085) [2022-07-11 04:00:10,031][25689] Fps is (10 sec: 5305.5, 60 sec: 5558.6, 300 sec: 5539.0). Total num frames: 1050542080. Throughput: 0: 5722.1. Samples: 1050547676. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 04:00:10,031][25689] Avg episode reward: [(0, '-0.059')] [2022-07-11 04:00:11,220][26022] Updated weights on worker 0-0, policy_version 1025926 (0.00085) [2022-07-11 04:00:12,791][26022] Updated weights on worker 0-0, policy_version 1025936 (0.00091) [2022-07-11 04:00:14,780][26022] Updated weights on worker 0-0, policy_version 1025946 (0.00087) [2022-07-11 04:00:15,078][25689] Fps is (10 sec: 5490.3, 60 sec: 5532.1, 300 sec: 5532.1). Total num frames: 1050569728. Throughput: 0: 4887.8. Samples: 1050564494. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 04:00:15,079][25689] Avg episode reward: [(0, '0.605')] [2022-07-11 04:00:16,638][26022] Updated weights on worker 0-0, policy_version 1025956 (0.00081) [2022-07-11 04:00:18,424][26022] Updated weights on worker 0-0, policy_version 1025966 (0.00092) [2022-07-11 04:00:20,087][25689] Fps is (10 sec: 5600.6, 60 sec: 5551.2, 300 sec: 5533.4). Total num frames: 1050598400. Throughput: 0: 5741.4. Samples: 1050598504. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 04:00:20,087][25689] Avg episode reward: [(0, '-0.638')] [2022-07-11 04:00:20,150][26022] Updated weights on worker 0-0, policy_version 1025976 (0.00093) [2022-07-11 04:00:22,001][26022] Updated weights on worker 0-0, policy_version 1025986 (0.01158) [2022-07-11 04:00:23,836][26022] Updated weights on worker 0-0, policy_version 1025996 (0.00088) [2022-07-11 04:00:25,108][25689] Fps is (10 sec: 5615.3, 60 sec: 5535.3, 300 sec: 5534.3). Total num frames: 1050626048. Throughput: 0: 5842.3. Samples: 1050631714. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 04:00:25,108][25689] Avg episode reward: [(0, '-1.040')] [2022-07-11 04:00:25,753][26022] Updated weights on worker 0-0, policy_version 1026006 (0.00095) [2022-07-11 04:00:27,618][26022] Updated weights on worker 0-0, policy_version 1026016 (0.00090) [2022-07-11 04:00:29,375][26022] Updated weights on worker 0-0, policy_version 1026026 (0.00088) [2022-07-11 04:00:30,132][25689] Fps is (10 sec: 5504.6, 60 sec: 5533.9, 300 sec: 5532.4). Total num frames: 1050653696. Throughput: 0: 5004.4. Samples: 1050648272. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:00:30,133][25689] Avg episode reward: [(0, '-1.249')] [2022-07-11 04:00:31,548][26022] Updated weights on worker 0-0, policy_version 1026036 (0.00090) [2022-07-11 04:00:33,186][26022] Updated weights on worker 0-0, policy_version 1026046 (0.00086) [2022-07-11 04:00:35,071][26022] Updated weights on worker 0-0, policy_version 1026056 (0.00096) [2022-07-11 04:00:35,182][25689] Fps is (10 sec: 5590.8, 60 sec: 5534.3, 300 sec: 5535.4). Total num frames: 1050682368. Throughput: 0: 5798.8. Samples: 1050681070. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:00:35,182][25689] Avg episode reward: [(0, '-1.396')] [2022-07-11 04:00:37,030][26022] Updated weights on worker 0-0, policy_version 1026066 (0.00093) [2022-07-11 04:00:38,537][26022] Updated weights on worker 0-0, policy_version 1026076 (0.00083) [2022-07-11 04:00:40,207][25689] Fps is (10 sec: 5488.6, 60 sec: 5534.2, 300 sec: 5528.9). Total num frames: 1050708992. Throughput: 0: 5775.8. Samples: 1050714714. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:00:40,207][25689] Avg episode reward: [(0, '-0.659')] [2022-07-11 04:00:40,776][26022] Updated weights on worker 0-0, policy_version 1026086 (0.00095) [2022-07-11 04:00:42,252][26022] Updated weights on worker 0-0, policy_version 1026096 (0.00092) [2022-07-11 04:00:44,318][26022] Updated weights on worker 0-0, policy_version 1026106 (0.00614) [2022-07-11 04:00:45,225][25689] Fps is (10 sec: 5505.9, 60 sec: 5535.5, 300 sec: 5532.3). Total num frames: 1050737664. Throughput: 0: 4957.6. Samples: 1050731446. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:00:45,225][25689] Avg episode reward: [(0, '-0.726')] [2022-07-11 04:00:45,880][26022] Updated weights on worker 0-0, policy_version 1026116 (0.00055) [2022-07-11 04:00:48,027][26022] Updated weights on worker 0-0, policy_version 1026126 (0.00082) [2022-07-11 04:00:49,635][26022] Updated weights on worker 0-0, policy_version 1026136 (0.00085) [2022-07-11 04:00:50,236][25689] Fps is (10 sec: 5615.4, 60 sec: 5534.8, 300 sec: 5536.5). Total num frames: 1050765312. Throughput: 0: 5796.9. Samples: 1050764814. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:00:50,237][25689] Avg episode reward: [(0, '0.029')] [2022-07-11 04:00:51,813][26022] Updated weights on worker 0-0, policy_version 1026146 (0.00088) [2022-07-11 04:00:53,356][26022] Updated weights on worker 0-0, policy_version 1026156 (0.01368) [2022-07-11 04:00:55,340][25689] Fps is (10 sec: 5568.0, 60 sec: 5512.8, 300 sec: 5535.0). Total num frames: 1050793984. Throughput: 0: 5815.7. Samples: 1050798304. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:00:55,340][25689] Avg episode reward: [(0, '0.395')] [2022-07-11 04:00:55,341][26022] Updated weights on worker 0-0, policy_version 1026166 (0.00100) [2022-07-11 04:00:56,920][26022] Updated weights on worker 0-0, policy_version 1026176 (0.00085) [2022-07-11 04:00:59,015][26022] Updated weights on worker 0-0, policy_version 1026186 (0.00083) [2022-07-11 04:01:00,353][25689] Fps is (10 sec: 5870.7, 60 sec: 5564.6, 300 sec: 5552.8). Total num frames: 1050824704. Throughput: 0: 4971.5. Samples: 1050814870. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:01:00,353][25689] Avg episode reward: [(0, '-0.461')] [2022-07-11 04:01:00,365][26022] Updated weights on worker 0-0, policy_version 1026196 (0.00078) [2022-07-11 04:01:03,139][26022] Updated weights on worker 0-0, policy_version 1026206 (0.00096) [2022-07-11 04:01:04,941][26022] Updated weights on worker 0-0, policy_version 1026216 (0.00696) [2022-07-11 04:01:05,372][25689] Fps is (10 sec: 5307.6, 60 sec: 5513.1, 300 sec: 5535.5). Total num frames: 1050847232. Throughput: 0: 5680.8. Samples: 1050845900. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:01:05,373][25689] Avg episode reward: [(0, '-0.201')] [2022-07-11 04:01:06,705][26022] Updated weights on worker 0-0, policy_version 1026226 (0.00091) [2022-07-11 04:01:08,479][26022] Updated weights on worker 0-0, policy_version 1026236 (0.00083) [2022-07-11 04:01:10,403][25689] Fps is (10 sec: 4992.8, 60 sec: 5512.5, 300 sec: 5532.7). Total num frames: 1050874880. Throughput: 0: 5679.8. Samples: 1050879356. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:01:10,404][25689] Avg episode reward: [(0, '-0.224')] [2022-07-11 04:01:10,628][26022] Updated weights on worker 0-0, policy_version 1026246 (0.00085) [2022-07-11 04:01:12,282][26022] Updated weights on worker 0-0, policy_version 1026256 (0.00093) [2022-07-11 04:01:14,289][26022] Updated weights on worker 0-0, policy_version 1026266 (0.00086) [2022-07-11 04:01:15,470][25689] Fps is (10 sec: 5678.8, 60 sec: 5544.6, 300 sec: 5536.6). Total num frames: 1050904576. Throughput: 0: 5685.6. Samples: 1050912756. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:01:15,471][25689] Avg episode reward: [(0, '-0.256')] [2022-07-11 04:01:15,926][26022] Updated weights on worker 0-0, policy_version 1026276 (0.00084) [2022-07-11 04:01:17,967][26022] Updated weights on worker 0-0, policy_version 1026286 (0.00082) [2022-07-11 04:01:19,595][26022] Updated weights on worker 0-0, policy_version 1026296 (0.00088) [2022-07-11 04:01:20,542][25689] Fps is (10 sec: 5453.6, 60 sec: 5488.0, 300 sec: 5532.8). Total num frames: 1050930176. Throughput: 0: 5681.1. Samples: 1050929566. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:01:20,543][25689] Avg episode reward: [(0, '0.184')] [2022-07-11 04:01:21,543][26022] Updated weights on worker 0-0, policy_version 1026306 (0.00095) [2022-07-11 04:01:23,471][26022] Updated weights on worker 0-0, policy_version 1026316 (0.00057) [2022-07-11 04:01:25,347][26022] Updated weights on worker 0-0, policy_version 1026326 (0.00083) [2022-07-11 04:01:25,579][25689] Fps is (10 sec: 5368.9, 60 sec: 5503.5, 300 sec: 5533.1). Total num frames: 1050958848. Throughput: 0: 5794.1. Samples: 1050962978. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:01:25,579][25689] Avg episode reward: [(0, '-0.674')] [2022-07-11 04:01:27,171][26022] Updated weights on worker 0-0, policy_version 1026336 (0.00087) [2022-07-11 04:01:29,147][26022] Updated weights on worker 0-0, policy_version 1026346 (0.00090) [2022-07-11 04:01:30,578][26022] Updated weights on worker 0-0, policy_version 1026356 (0.00087) [2022-07-11 04:01:30,601][25689] Fps is (10 sec: 5802.4, 60 sec: 5537.5, 300 sec: 5538.7). Total num frames: 1050988544. Throughput: 0: 5789.1. Samples: 1050996288. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:01:30,602][25689] Avg episode reward: [(0, '0.543')] [2022-07-11 04:01:32,252][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:01:32,266][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001026363_1050995712.pth [2022-07-11 04:01:32,266][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001024417_1049003008.pth [2022-07-11 04:01:32,730][26022] Updated weights on worker 0-0, policy_version 1026366 (0.00094) [2022-07-11 04:01:34,262][26022] Updated weights on worker 0-0, policy_version 1026376 (0.00094) [2022-07-11 04:01:35,716][25689] Fps is (10 sec: 5555.7, 60 sec: 5497.7, 300 sec: 5537.1). Total num frames: 1051015168. Throughput: 0: 4941.0. Samples: 1051012792. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:01:35,717][25689] Avg episode reward: [(0, '1.021')] [2022-07-11 04:01:36,656][26022] Updated weights on worker 0-0, policy_version 1026386 (0.00097) [2022-07-11 04:01:38,120][26022] Updated weights on worker 0-0, policy_version 1026396 (0.00083) [2022-07-11 04:01:39,911][26022] Updated weights on worker 0-0, policy_version 1026406 (0.00102) [2022-07-11 04:01:40,722][25689] Fps is (10 sec: 5463.4, 60 sec: 5533.2, 300 sec: 5537.1). Total num frames: 1051043840. Throughput: 0: 5785.6. Samples: 1051046322. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:01:40,723][25689] Avg episode reward: [(0, '0.945')] [2022-07-11 04:01:41,747][26022] Updated weights on worker 0-0, policy_version 1026416 (0.00082) [2022-07-11 04:01:43,607][26022] Updated weights on worker 0-0, policy_version 1026426 (0.00086) [2022-07-11 04:01:45,543][26022] Updated weights on worker 0-0, policy_version 1026436 (0.00506) [2022-07-11 04:01:45,740][25689] Fps is (10 sec: 5618.7, 60 sec: 5516.4, 300 sec: 5533.5). Total num frames: 1051071488. Throughput: 0: 5802.5. Samples: 1051079962. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:01:45,740][25689] Avg episode reward: [(0, '1.132')] [2022-07-11 04:01:47,336][26022] Updated weights on worker 0-0, policy_version 1026446 (0.00090) [2022-07-11 04:01:49,163][26022] Updated weights on worker 0-0, policy_version 1026456 (0.00086) [2022-07-11 04:01:50,747][25689] Fps is (10 sec: 5618.4, 60 sec: 5533.7, 300 sec: 5541.3). Total num frames: 1051100160. Throughput: 0: 4994.4. Samples: 1051096902. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:01:50,748][25689] Avg episode reward: [(0, '1.338')] [2022-07-11 04:01:51,082][26022] Updated weights on worker 0-0, policy_version 1026466 (0.00102) [2022-07-11 04:01:52,751][26022] Updated weights on worker 0-0, policy_version 1026476 (0.00084) [2022-07-11 04:01:54,559][26022] Updated weights on worker 0-0, policy_version 1026486 (0.00083) [2022-07-11 04:01:55,835][25689] Fps is (10 sec: 5579.1, 60 sec: 5518.2, 300 sec: 5536.6). Total num frames: 1051127808. Throughput: 0: 5846.6. Samples: 1051130416. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:01:55,835][25689] Avg episode reward: [(0, '2.104')] [2022-07-11 04:01:56,518][26022] Updated weights on worker 0-0, policy_version 1026496 (0.00091) [2022-07-11 04:01:58,315][26022] Updated weights on worker 0-0, policy_version 1026506 (0.00089) [2022-07-11 04:02:00,264][26022] Updated weights on worker 0-0, policy_version 1026516 (0.00088) [2022-07-11 04:02:00,859][25689] Fps is (10 sec: 5468.3, 60 sec: 5466.4, 300 sec: 5543.5). Total num frames: 1051155456. Throughput: 0: 5842.2. Samples: 1051163962. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:02:00,859][25689] Avg episode reward: [(0, '1.944')] [2022-07-11 04:02:02,239][26022] Updated weights on worker 0-0, policy_version 1026526 (0.00090) [2022-07-11 04:02:04,206][26022] Updated weights on worker 0-0, policy_version 1026536 (0.00091) [2022-07-11 04:02:05,883][25689] Fps is (10 sec: 5401.2, 60 sec: 5533.7, 300 sec: 5539.9). Total num frames: 1051182080. Throughput: 0: 4913.0. Samples: 1051178924. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:02:05,883][25689] Avg episode reward: [(0, '1.577')] [2022-07-11 04:02:06,000][26022] Updated weights on worker 0-0, policy_version 1026546 (0.00084) [2022-07-11 04:02:07,943][26022] Updated weights on worker 0-0, policy_version 1026556 (0.00088) [2022-07-11 04:02:09,691][26022] Updated weights on worker 0-0, policy_version 1026566 (0.00095) [2022-07-11 04:02:10,899][25689] Fps is (10 sec: 5405.6, 60 sec: 5535.0, 300 sec: 5537.8). Total num frames: 1051209728. Throughput: 0: 5730.1. Samples: 1051212376. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:02:10,899][25689] Avg episode reward: [(0, '1.593')] [2022-07-11 04:02:11,511][26022] Updated weights on worker 0-0, policy_version 1026576 (0.00093) [2022-07-11 04:02:13,494][26022] Updated weights on worker 0-0, policy_version 1026586 (0.00085) [2022-07-11 04:02:15,081][26022] Updated weights on worker 0-0, policy_version 1026596 (0.00090) [2022-07-11 04:02:15,965][25689] Fps is (10 sec: 5585.8, 60 sec: 5518.2, 300 sec: 5537.6). Total num frames: 1051238400. Throughput: 0: 5719.4. Samples: 1051245552. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:02:15,966][25689] Avg episode reward: [(0, '1.617')] [2022-07-11 04:02:17,424][26022] Updated weights on worker 0-0, policy_version 1026606 (0.00101) [2022-07-11 04:02:18,928][26022] Updated weights on worker 0-0, policy_version 1026616 (0.00098) [2022-07-11 04:02:20,818][26022] Updated weights on worker 0-0, policy_version 1026626 (0.00090) [2022-07-11 04:02:21,051][25689] Fps is (10 sec: 5648.5, 60 sec: 5567.7, 300 sec: 5539.8). Total num frames: 1051267072. Throughput: 0: 4868.1. Samples: 1051262260. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:02:21,052][25689] Avg episode reward: [(0, '0.973')] [2022-07-11 04:02:22,654][26022] Updated weights on worker 0-0, policy_version 1026636 (0.00081) [2022-07-11 04:02:24,237][26022] Updated weights on worker 0-0, policy_version 1026646 (0.00054) [2022-07-11 04:02:26,078][25689] Fps is (10 sec: 5468.0, 60 sec: 5534.7, 300 sec: 5536.7). Total num frames: 1051293696. Throughput: 0: 5805.6. Samples: 1051296170. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:02:26,089][25689] Avg episode reward: [(0, '0.463')] [2022-07-11 04:02:26,314][26022] Updated weights on worker 0-0, policy_version 1026656 (0.00086) [2022-07-11 04:02:28,003][26022] Updated weights on worker 0-0, policy_version 1026666 (0.00087) [2022-07-11 04:02:29,850][26022] Updated weights on worker 0-0, policy_version 1026676 (0.00092) [2022-07-11 04:02:31,101][25689] Fps is (10 sec: 5501.9, 60 sec: 5517.7, 300 sec: 5537.9). Total num frames: 1051322368. Throughput: 0: 5814.0. Samples: 1051329834. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:02:31,102][25689] Avg episode reward: [(0, '0.689')] [2022-07-11 04:02:31,640][26022] Updated weights on worker 0-0, policy_version 1026686 (0.00087) [2022-07-11 04:02:33,598][26022] Updated weights on worker 0-0, policy_version 1026696 (0.00085) [2022-07-11 04:02:35,302][26022] Updated weights on worker 0-0, policy_version 1026706 (0.00085) [2022-07-11 04:02:36,212][25689] Fps is (10 sec: 5658.7, 60 sec: 5552.0, 300 sec: 5539.3). Total num frames: 1051351040. Throughput: 0: 4984.6. Samples: 1051346472. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:02:36,213][25689] Avg episode reward: [(0, '0.811')] [2022-07-11 04:02:37,288][26022] Updated weights on worker 0-0, policy_version 1026716 (0.00094) [2022-07-11 04:02:38,962][26022] Updated weights on worker 0-0, policy_version 1026726 (0.00084) [2022-07-11 04:02:40,909][26022] Updated weights on worker 0-0, policy_version 1026736 (0.00085) [2022-07-11 04:02:41,239][25689] Fps is (10 sec: 5454.6, 60 sec: 5516.3, 300 sec: 5532.4). Total num frames: 1051377664. Throughput: 0: 5828.6. Samples: 1051379928. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:02:41,239][25689] Avg episode reward: [(0, '0.640')] [2022-07-11 04:02:42,589][26022] Updated weights on worker 0-0, policy_version 1026746 (0.00088) [2022-07-11 04:02:44,772][26022] Updated weights on worker 0-0, policy_version 1026756 (0.00092) [2022-07-11 04:02:46,111][26022] Updated weights on worker 0-0, policy_version 1026766 (0.00085) [2022-07-11 04:02:46,261][25689] Fps is (10 sec: 5706.3, 60 sec: 5566.6, 300 sec: 5539.6). Total num frames: 1051408384. Throughput: 0: 5825.2. Samples: 1051413740. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:02:46,262][25689] Avg episode reward: [(0, '0.512')] [2022-07-11 04:02:48,384][26022] Updated weights on worker 0-0, policy_version 1026776 (0.00089) [2022-07-11 04:02:49,766][26022] Updated weights on worker 0-0, policy_version 1026786 (0.00084) [2022-07-11 04:02:51,331][25689] Fps is (10 sec: 5682.3, 60 sec: 5527.0, 300 sec: 5539.5). Total num frames: 1051435008. Throughput: 0: 4978.5. Samples: 1051430546. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:02:51,331][25689] Avg episode reward: [(0, '1.172')] [2022-07-11 04:02:51,999][26022] Updated weights on worker 0-0, policy_version 1026796 (0.00082) [2022-07-11 04:02:53,669][26022] Updated weights on worker 0-0, policy_version 1026806 (0.00084) [2022-07-11 04:02:55,599][26022] Updated weights on worker 0-0, policy_version 1026816 (0.00087) [2022-07-11 04:02:56,450][25689] Fps is (10 sec: 5427.1, 60 sec: 5541.0, 300 sec: 5540.8). Total num frames: 1051463680. Throughput: 0: 5802.9. Samples: 1051463912. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:02:56,451][25689] Avg episode reward: [(0, '1.695')] [2022-07-11 04:02:57,358][26022] Updated weights on worker 0-0, policy_version 1026826 (0.00090) [2022-07-11 04:02:59,390][26022] Updated weights on worker 0-0, policy_version 1026836 (0.00087) [2022-07-11 04:03:01,073][26022] Updated weights on worker 0-0, policy_version 1026846 (0.00093) [2022-07-11 04:03:01,463][25689] Fps is (10 sec: 5558.5, 60 sec: 5542.1, 300 sec: 5541.0). Total num frames: 1051491328. Throughput: 0: 5815.2. Samples: 1051497534. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:03:01,463][25689] Avg episode reward: [(0, '1.717')] [2022-07-11 04:03:03,339][26022] Updated weights on worker 0-0, policy_version 1026856 (0.00095) [2022-07-11 04:03:05,223][26022] Updated weights on worker 0-0, policy_version 1026866 (0.00099) [2022-07-11 04:03:06,551][25689] Fps is (10 sec: 5373.1, 60 sec: 5536.2, 300 sec: 5539.6). Total num frames: 1051517952. Throughput: 0: 4855.2. Samples: 1051512250. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:03:06,551][25689] Avg episode reward: [(0, '1.451')] [2022-07-11 04:03:06,990][26022] Updated weights on worker 0-0, policy_version 1026876 (0.00085) [2022-07-11 04:03:08,910][26022] Updated weights on worker 0-0, policy_version 1026886 (0.00087) [2022-07-11 04:03:11,047][26022] Updated weights on worker 0-0, policy_version 1026896 (0.00094) [2022-07-11 04:03:11,558][25689] Fps is (10 sec: 5274.4, 60 sec: 5520.1, 300 sec: 5536.9). Total num frames: 1051544576. Throughput: 0: 5655.8. Samples: 1051544950. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:03:11,559][25689] Avg episode reward: [(0, '0.495')] [2022-07-11 04:03:12,548][26022] Updated weights on worker 0-0, policy_version 1026906 (0.00093) [2022-07-11 04:03:14,691][26022] Updated weights on worker 0-0, policy_version 1026916 (0.00098) [2022-07-11 04:03:16,564][26022] Updated weights on worker 0-0, policy_version 1026926 (0.00089) [2022-07-11 04:03:16,652][25689] Fps is (10 sec: 5473.9, 60 sec: 5517.6, 300 sec: 5528.6). Total num frames: 1051573248. Throughput: 0: 5624.4. Samples: 1051577538. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:03:16,653][25689] Avg episode reward: [(0, '0.521')] [2022-07-11 04:03:18,334][26022] Updated weights on worker 0-0, policy_version 1026936 (0.00090) [2022-07-11 04:03:20,255][26022] Updated weights on worker 0-0, policy_version 1026946 (0.00095) [2022-07-11 04:03:21,668][25689] Fps is (10 sec: 5571.0, 60 sec: 5507.1, 300 sec: 5535.3). Total num frames: 1051600896. Throughput: 0: 4777.2. Samples: 1051594058. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:03:21,669][25689] Avg episode reward: [(0, '-0.944')] [2022-07-11 04:03:21,894][26022] Updated weights on worker 0-0, policy_version 1026956 (0.00092) [2022-07-11 04:03:23,953][26022] Updated weights on worker 0-0, policy_version 1026966 (0.00087) [2022-07-11 04:03:25,816][26022] Updated weights on worker 0-0, policy_version 1026976 (0.00088) [2022-07-11 04:03:26,674][25689] Fps is (10 sec: 5415.8, 60 sec: 5509.1, 300 sec: 5528.8). Total num frames: 1051627520. Throughput: 0: 5711.1. Samples: 1051627174. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:03:26,674][25689] Avg episode reward: [(0, '-1.018')] [2022-07-11 04:03:27,483][26022] Updated weights on worker 0-0, policy_version 1026986 (0.00082) [2022-07-11 04:03:29,509][26022] Updated weights on worker 0-0, policy_version 1026996 (0.00086) [2022-07-11 04:03:30,989][26022] Updated weights on worker 0-0, policy_version 1027006 (0.00437) [2022-07-11 04:03:31,708][25689] Fps is (10 sec: 5405.6, 60 sec: 5491.2, 300 sec: 5523.1). Total num frames: 1051655168. Throughput: 0: 5733.9. Samples: 1051660486. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:03:31,709][25689] Avg episode reward: [(0, '-0.792')] [2022-07-11 04:03:32,363][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:03:32,373][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001027011_1051659264.pth [2022-07-11 04:03:32,377][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001025065_1049666560.pth [2022-07-11 04:03:33,319][26022] Updated weights on worker 0-0, policy_version 1027016 (0.00082) [2022-07-11 04:03:35,219][26022] Updated weights on worker 0-0, policy_version 1027026 (0.00092) [2022-07-11 04:03:36,786][25689] Fps is (10 sec: 5569.3, 60 sec: 5494.1, 300 sec: 5532.5). Total num frames: 1051683840. Throughput: 0: 5769.5. Samples: 1051693700. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:03:36,787][25689] Avg episode reward: [(0, '-1.812')] [2022-07-11 04:03:36,841][26022] Updated weights on worker 0-0, policy_version 1027036 (0.00091) [2022-07-11 04:03:38,756][26022] Updated weights on worker 0-0, policy_version 1027046 (0.00083) [2022-07-11 04:03:40,396][26022] Updated weights on worker 0-0, policy_version 1027056 (0.00091) [2022-07-11 04:03:41,880][25689] Fps is (10 sec: 5637.2, 60 sec: 5521.8, 300 sec: 5524.3). Total num frames: 1051712512. Throughput: 0: 5766.9. Samples: 1051710620. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:03:41,881][25689] Avg episode reward: [(0, '-0.962')] [2022-07-11 04:03:42,385][26022] Updated weights on worker 0-0, policy_version 1027066 (0.00093) [2022-07-11 04:03:44,099][26022] Updated weights on worker 0-0, policy_version 1027076 (0.00084) [2022-07-11 04:03:45,886][26022] Updated weights on worker 0-0, policy_version 1027086 (0.00092) [2022-07-11 04:03:46,909][25689] Fps is (10 sec: 5664.8, 60 sec: 5487.4, 300 sec: 5530.7). Total num frames: 1051741184. Throughput: 0: 5788.2. Samples: 1051744300. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:03:46,909][25689] Avg episode reward: [(0, '-0.254')] [2022-07-11 04:03:47,858][26022] Updated weights on worker 0-0, policy_version 1027096 (0.00093) [2022-07-11 04:03:49,765][26022] Updated weights on worker 0-0, policy_version 1027106 (0.00082) [2022-07-11 04:03:51,493][26022] Updated weights on worker 0-0, policy_version 1027116 (0.00090) [2022-07-11 04:03:51,968][25689] Fps is (10 sec: 5481.6, 60 sec: 5488.4, 300 sec: 5523.7). Total num frames: 1051767808. Throughput: 0: 5783.4. Samples: 1051777658. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:03:51,968][25689] Avg episode reward: [(0, '0.609')] [2022-07-11 04:03:53,583][26022] Updated weights on worker 0-0, policy_version 1027126 (0.00629) [2022-07-11 04:03:55,231][26022] Updated weights on worker 0-0, policy_version 1027136 (0.00086) [2022-07-11 04:03:57,059][25689] Fps is (10 sec: 5448.0, 60 sec: 5491.0, 300 sec: 5527.2). Total num frames: 1051796480. Throughput: 0: 4946.3. Samples: 1051793974. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:03:57,059][25689] Avg episode reward: [(0, '0.618')] [2022-07-11 04:03:57,088][26022] Updated weights on worker 0-0, policy_version 1027146 (0.00093) [2022-07-11 04:03:59,147][26022] Updated weights on worker 0-0, policy_version 1027156 (0.00096) [2022-07-11 04:04:00,683][26022] Updated weights on worker 0-0, policy_version 1027166 (0.00090) [2022-07-11 04:04:02,122][25689] Fps is (10 sec: 5445.5, 60 sec: 5469.5, 300 sec: 5527.5). Total num frames: 1051823104. Throughput: 0: 5778.8. Samples: 1051827594. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:04:02,123][25689] Avg episode reward: [(0, '-0.352')] [2022-07-11 04:04:03,058][26022] Updated weights on worker 0-0, policy_version 1027176 (0.00087) [2022-07-11 04:04:04,769][26022] Updated weights on worker 0-0, policy_version 1027186 (0.00090) [2022-07-11 04:04:06,735][26022] Updated weights on worker 0-0, policy_version 1027196 (0.00089) [2022-07-11 04:04:07,132][25689] Fps is (10 sec: 5387.6, 60 sec: 5493.4, 300 sec: 5527.4). Total num frames: 1051850752. Throughput: 0: 5661.8. Samples: 1051858800. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:04:07,133][25689] Avg episode reward: [(0, '-0.054')] [2022-07-11 04:04:08,427][26022] Updated weights on worker 0-0, policy_version 1027206 (0.00107) [2022-07-11 04:04:10,425][26022] Updated weights on worker 0-0, policy_version 1027216 (0.00078) [2022-07-11 04:04:12,159][25689] Fps is (10 sec: 5509.5, 60 sec: 5508.6, 300 sec: 5522.4). Total num frames: 1051878400. Throughput: 0: 4855.0. Samples: 1051875686. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:04:12,159][25689] Avg episode reward: [(0, '0.070')] [2022-07-11 04:04:12,276][26022] Updated weights on worker 0-0, policy_version 1027226 (0.00086) [2022-07-11 04:04:14,155][26022] Updated weights on worker 0-0, policy_version 1027236 (0.00087) [2022-07-11 04:04:15,972][26022] Updated weights on worker 0-0, policy_version 1027246 (0.00087) [2022-07-11 04:04:17,232][25689] Fps is (10 sec: 5474.8, 60 sec: 5493.6, 300 sec: 5521.6). Total num frames: 1051906048. Throughput: 0: 5709.0. Samples: 1051909144. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:04:17,233][25689] Avg episode reward: [(0, '0.197')] [2022-07-11 04:04:17,824][26022] Updated weights on worker 0-0, policy_version 1027256 (0.00095) [2022-07-11 04:04:19,453][26022] Updated weights on worker 0-0, policy_version 1027266 (0.00082) [2022-07-11 04:04:21,332][26022] Updated weights on worker 0-0, policy_version 1027276 (0.00091) [2022-07-11 04:04:22,267][25689] Fps is (10 sec: 5673.0, 60 sec: 5525.6, 300 sec: 5525.0). Total num frames: 1051935744. Throughput: 0: 5709.0. Samples: 1051942600. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:04:22,267][25689] Avg episode reward: [(0, '0.174')] [2022-07-11 04:04:23,283][26022] Updated weights on worker 0-0, policy_version 1027286 (0.00088) [2022-07-11 04:04:25,100][26022] Updated weights on worker 0-0, policy_version 1027296 (0.00101) [2022-07-11 04:04:27,068][26022] Updated weights on worker 0-0, policy_version 1027306 (0.00092) [2022-07-11 04:04:27,282][25689] Fps is (10 sec: 5604.0, 60 sec: 5524.8, 300 sec: 5521.5). Total num frames: 1051962368. Throughput: 0: 4976.3. Samples: 1051959072. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:04:27,283][25689] Avg episode reward: [(0, '-0.005')] [2022-07-11 04:04:28,835][26022] Updated weights on worker 0-0, policy_version 1027316 (0.00083) [2022-07-11 04:04:30,738][26022] Updated weights on worker 0-0, policy_version 1027326 (0.00083) [2022-07-11 04:04:32,311][25689] Fps is (10 sec: 5403.4, 60 sec: 5525.2, 300 sec: 5518.5). Total num frames: 1051990016. Throughput: 0: 5789.1. Samples: 1051992350. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:04:32,311][25689] Avg episode reward: [(0, '-0.232')] [2022-07-11 04:04:32,526][26022] Updated weights on worker 0-0, policy_version 1027336 (0.00085) [2022-07-11 04:04:34,342][26022] Updated weights on worker 0-0, policy_version 1027346 (0.00081) [2022-07-11 04:04:36,236][26022] Updated weights on worker 0-0, policy_version 1027356 (0.00094) [2022-07-11 04:04:37,368][25689] Fps is (10 sec: 5584.1, 60 sec: 5527.2, 300 sec: 5524.8). Total num frames: 1052018688. Throughput: 0: 5783.9. Samples: 1052025608. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:04:37,369][25689] Avg episode reward: [(0, '-0.205')] [2022-07-11 04:04:38,074][26022] Updated weights on worker 0-0, policy_version 1027366 (0.00080) [2022-07-11 04:04:39,840][26022] Updated weights on worker 0-0, policy_version 1027376 (0.00098) [2022-07-11 04:04:41,731][26022] Updated weights on worker 0-0, policy_version 1027386 (0.00094) [2022-07-11 04:04:42,396][25689] Fps is (10 sec: 5584.3, 60 sec: 5516.3, 300 sec: 5521.4). Total num frames: 1052046336. Throughput: 0: 4956.3. Samples: 1052042368. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:04:42,397][25689] Avg episode reward: [(0, '-1.700')] [2022-07-11 04:04:43,507][26022] Updated weights on worker 0-0, policy_version 1027396 (0.00088) [2022-07-11 04:04:45,478][26022] Updated weights on worker 0-0, policy_version 1027406 (0.00084) [2022-07-11 04:04:47,248][26022] Updated weights on worker 0-0, policy_version 1027416 (0.00087) [2022-07-11 04:04:47,408][25689] Fps is (10 sec: 5507.6, 60 sec: 5500.9, 300 sec: 5521.3). Total num frames: 1052073984. Throughput: 0: 5799.5. Samples: 1052075792. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:04:47,409][25689] Avg episode reward: [(0, '-2.448')] [2022-07-11 04:04:49,082][26022] Updated weights on worker 0-0, policy_version 1027426 (0.00091) [2022-07-11 04:04:51,035][26022] Updated weights on worker 0-0, policy_version 1027436 (0.00096) [2022-07-11 04:04:52,505][25689] Fps is (10 sec: 5571.4, 60 sec: 5531.3, 300 sec: 5516.9). Total num frames: 1052102656. Throughput: 0: 5797.7. Samples: 1052109430. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:04:52,506][25689] Avg episode reward: [(0, '-2.530')] [2022-07-11 04:04:52,583][26022] Updated weights on worker 0-0, policy_version 1027446 (0.00084) [2022-07-11 04:04:54,696][26022] Updated weights on worker 0-0, policy_version 1027456 (0.00087) [2022-07-11 04:04:56,343][26022] Updated weights on worker 0-0, policy_version 1027466 (0.00086) [2022-07-11 04:04:57,619][25689] Fps is (10 sec: 5415.6, 60 sec: 5495.4, 300 sec: 5511.8). Total num frames: 1052129280. Throughput: 0: 4953.9. Samples: 1052125926. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:04:57,620][25689] Avg episode reward: [(0, '-1.260')] [2022-07-11 04:04:58,299][26022] Updated weights on worker 0-0, policy_version 1027476 (0.00085) [2022-07-11 04:05:00,323][26022] Updated weights on worker 0-0, policy_version 1027486 (0.00092) [2022-07-11 04:05:02,083][26022] Updated weights on worker 0-0, policy_version 1027496 (0.00085) [2022-07-11 04:05:02,714][25689] Fps is (10 sec: 5416.6, 60 sec: 5526.3, 300 sec: 5520.6). Total num frames: 1052157952. Throughput: 0: 5762.4. Samples: 1052159444. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:05:02,715][25689] Avg episode reward: [(0, '-1.352')] [2022-07-11 04:05:04,374][26022] Updated weights on worker 0-0, policy_version 1027506 (0.00088) [2022-07-11 04:05:06,033][26022] Updated weights on worker 0-0, policy_version 1027516 (0.00088) [2022-07-11 04:05:07,752][25689] Fps is (10 sec: 5557.9, 60 sec: 5523.7, 300 sec: 5520.3). Total num frames: 1052185600. Throughput: 0: 5653.2. Samples: 1052190800. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:05:07,752][25689] Avg episode reward: [(0, '-0.462')] [2022-07-11 04:05:07,979][26022] Updated weights on worker 0-0, policy_version 1027526 (0.00091) [2022-07-11 04:05:09,795][26022] Updated weights on worker 0-0, policy_version 1027536 (0.00084) [2022-07-11 04:05:11,591][26022] Updated weights on worker 0-0, policy_version 1027546 (0.00084) [2022-07-11 04:05:12,815][25689] Fps is (10 sec: 5372.6, 60 sec: 5503.5, 300 sec: 5516.6). Total num frames: 1052212224. Throughput: 0: 4819.7. Samples: 1052207320. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:05:12,816][25689] Avg episode reward: [(0, '0.841')] [2022-07-11 04:05:13,557][26022] Updated weights on worker 0-0, policy_version 1027556 (0.00087) [2022-07-11 04:05:15,253][26022] Updated weights on worker 0-0, policy_version 1027566 (0.00089) [2022-07-11 04:05:17,302][26022] Updated weights on worker 0-0, policy_version 1027576 (0.00091) [2022-07-11 04:05:17,891][25689] Fps is (10 sec: 5453.9, 60 sec: 5520.2, 300 sec: 5515.4). Total num frames: 1052240896. Throughput: 0: 5653.5. Samples: 1052240534. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:05:17,891][25689] Avg episode reward: [(0, '1.152')] [2022-07-11 04:05:19,002][26022] Updated weights on worker 0-0, policy_version 1027586 (0.00087) [2022-07-11 04:05:21,025][26022] Updated weights on worker 0-0, policy_version 1027596 (0.00089) [2022-07-11 04:05:22,722][26022] Updated weights on worker 0-0, policy_version 1027606 (0.00096) [2022-07-11 04:05:22,906][25689] Fps is (10 sec: 5682.8, 60 sec: 5505.1, 300 sec: 5518.9). Total num frames: 1052269568. Throughput: 0: 5664.9. Samples: 1052273832. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:05:22,907][25689] Avg episode reward: [(0, '0.867')] [2022-07-11 04:05:24,820][26022] Updated weights on worker 0-0, policy_version 1027616 (0.00088) [2022-07-11 04:05:26,399][26022] Updated weights on worker 0-0, policy_version 1027626 (0.00083) [2022-07-11 04:05:27,954][25689] Fps is (10 sec: 5494.8, 60 sec: 5502.1, 300 sec: 5515.0). Total num frames: 1052296192. Throughput: 0: 4942.1. Samples: 1052290640. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:05:27,955][25689] Avg episode reward: [(0, '0.731')] [2022-07-11 04:05:28,377][26022] Updated weights on worker 0-0, policy_version 1027636 (0.00086) [2022-07-11 04:05:30,384][26022] Updated weights on worker 0-0, policy_version 1027646 (0.00094) [2022-07-11 04:05:32,123][26022] Updated weights on worker 0-0, policy_version 1027656 (0.00084) [2022-07-11 04:05:32,389][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:05:32,412][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001027658_1052321792.pth [2022-07-11 04:05:32,413][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001025716_1050333184.pth [2022-07-11 04:05:32,959][25689] Fps is (10 sec: 5398.7, 60 sec: 5504.3, 300 sec: 5512.4). Total num frames: 1052323840. Throughput: 0: 5770.5. Samples: 1052323558. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:05:32,959][25689] Avg episode reward: [(0, '0.677')] [2022-07-11 04:05:33,923][26022] Updated weights on worker 0-0, policy_version 1027666 (0.00086) [2022-07-11 04:05:35,895][26022] Updated weights on worker 0-0, policy_version 1027676 (0.00101) [2022-07-11 04:05:37,683][26022] Updated weights on worker 0-0, policy_version 1027686 (0.00088) [2022-07-11 04:05:38,040][25689] Fps is (10 sec: 5584.0, 60 sec: 5502.1, 300 sec: 5518.3). Total num frames: 1052352512. Throughput: 0: 5763.7. Samples: 1052356670. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:05:38,041][25689] Avg episode reward: [(0, '0.752')] [2022-07-11 04:05:39,615][26022] Updated weights on worker 0-0, policy_version 1027696 (0.00093) [2022-07-11 04:05:41,402][26022] Updated weights on worker 0-0, policy_version 1027706 (0.00087) [2022-07-11 04:05:43,046][25689] Fps is (10 sec: 5583.4, 60 sec: 5504.1, 300 sec: 5515.0). Total num frames: 1052380160. Throughput: 0: 4929.4. Samples: 1052373110. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:05:43,046][25689] Avg episode reward: [(0, '1.148')] [2022-07-11 04:05:43,194][26022] Updated weights on worker 0-0, policy_version 1027716 (0.00087) [2022-07-11 04:05:45,070][26022] Updated weights on worker 0-0, policy_version 1027726 (0.00099) [2022-07-11 04:05:47,043][26022] Updated weights on worker 0-0, policy_version 1027736 (0.00079) [2022-07-11 04:05:48,117][25689] Fps is (10 sec: 5386.0, 60 sec: 5481.9, 300 sec: 5510.5). Total num frames: 1052406784. Throughput: 0: 5728.0. Samples: 1052406132. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:05:48,117][25689] Avg episode reward: [(0, '1.096')] [2022-07-11 04:05:48,819][26022] Updated weights on worker 0-0, policy_version 1027746 (0.00089) [2022-07-11 04:05:50,637][26022] Updated weights on worker 0-0, policy_version 1027756 (0.00089) [2022-07-11 04:05:52,522][26022] Updated weights on worker 0-0, policy_version 1027766 (0.00081) [2022-07-11 04:05:53,138][25689] Fps is (10 sec: 5377.7, 60 sec: 5471.9, 300 sec: 5508.6). Total num frames: 1052434432. Throughput: 0: 5736.2. Samples: 1052439308. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:05:53,140][25689] Avg episode reward: [(0, '1.305')] [2022-07-11 04:05:54,443][26022] Updated weights on worker 0-0, policy_version 1027776 (0.00092) [2022-07-11 04:05:56,342][26022] Updated weights on worker 0-0, policy_version 1027786 (0.00093) [2022-07-11 04:05:58,035][26022] Updated weights on worker 0-0, policy_version 1027796 (0.00082) [2022-07-11 04:05:58,190][25689] Fps is (10 sec: 5692.7, 60 sec: 5528.2, 300 sec: 5504.4). Total num frames: 1052464128. Throughput: 0: 5753.6. Samples: 1052472604. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:05:58,191][25689] Avg episode reward: [(0, '1.374')] [2022-07-11 04:05:59,840][26022] Updated weights on worker 0-0, policy_version 1027806 (0.00090) [2022-07-11 04:06:02,087][26022] Updated weights on worker 0-0, policy_version 1027816 (0.00074) [2022-07-11 04:06:03,212][25689] Fps is (10 sec: 5489.0, 60 sec: 5484.1, 300 sec: 5514.7). Total num frames: 1052489728. Throughput: 0: 5767.9. Samples: 1052489424. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:06:03,214][25689] Avg episode reward: [(0, '0.928')] [2022-07-11 04:06:04,011][26022] Updated weights on worker 0-0, policy_version 1027826 (0.00092) [2022-07-11 04:06:05,910][26022] Updated weights on worker 0-0, policy_version 1027836 (0.00089) [2022-07-11 04:06:07,470][26022] Updated weights on worker 0-0, policy_version 1027846 (0.00086) [2022-07-11 04:06:08,272][25689] Fps is (10 sec: 5281.7, 60 sec: 5482.1, 300 sec: 5514.2). Total num frames: 1052517376. Throughput: 0: 5685.8. Samples: 1052520728. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:06:08,273][25689] Avg episode reward: [(0, '0.150')] [2022-07-11 04:06:09,655][26022] Updated weights on worker 0-0, policy_version 1027856 (0.00086) [2022-07-11 04:06:11,199][26022] Updated weights on worker 0-0, policy_version 1027866 (0.00091) [2022-07-11 04:06:13,099][26022] Updated weights on worker 0-0, policy_version 1027876 (0.00079) [2022-07-11 04:06:13,374][25689] Fps is (10 sec: 5441.6, 60 sec: 5495.5, 300 sec: 5506.6). Total num frames: 1052545024. Throughput: 0: 5672.2. Samples: 1052554088. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:06:13,375][25689] Avg episode reward: [(0, '0.056')] [2022-07-11 04:06:14,972][26022] Updated weights on worker 0-0, policy_version 1027886 (0.00107) [2022-07-11 04:06:16,689][26022] Updated weights on worker 0-0, policy_version 1027896 (0.00092) [2022-07-11 04:06:18,489][25689] Fps is (10 sec: 5512.1, 60 sec: 5491.9, 300 sec: 5516.1). Total num frames: 1052573696. Throughput: 0: 4842.0. Samples: 1052570892. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:06:18,491][25689] Avg episode reward: [(0, '-0.249')] [2022-07-11 04:06:18,872][26022] Updated weights on worker 0-0, policy_version 1027906 (0.00096) [2022-07-11 04:06:20,574][26022] Updated weights on worker 0-0, policy_version 1027916 (0.00098) [2022-07-11 04:06:22,235][26022] Updated weights on worker 0-0, policy_version 1027926 (0.00081) [2022-07-11 04:06:23,515][25689] Fps is (10 sec: 5553.3, 60 sec: 5474.0, 300 sec: 5512.9). Total num frames: 1052601344. Throughput: 0: 5655.7. Samples: 1052604254. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:06:23,517][25689] Avg episode reward: [(0, '-0.349')] [2022-07-11 04:06:24,192][26022] Updated weights on worker 0-0, policy_version 1027936 (0.00086) [2022-07-11 04:06:26,149][26022] Updated weights on worker 0-0, policy_version 1027946 (0.00093) [2022-07-11 04:06:28,034][26022] Updated weights on worker 0-0, policy_version 1027956 (0.00095) [2022-07-11 04:06:28,526][25689] Fps is (10 sec: 5713.4, 60 sec: 5528.1, 300 sec: 5513.1). Total num frames: 1052631040. Throughput: 0: 5763.7. Samples: 1052637468. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:06:28,528][25689] Avg episode reward: [(0, '-1.483')] [2022-07-11 04:06:29,806][26022] Updated weights on worker 0-0, policy_version 1027966 (0.00087) [2022-07-11 04:06:31,583][26022] Updated weights on worker 0-0, policy_version 1027976 (0.00093) [2022-07-11 04:06:33,529][26022] Updated weights on worker 0-0, policy_version 1027986 (0.00088) [2022-07-11 04:06:33,544][25689] Fps is (10 sec: 5615.7, 60 sec: 5510.0, 300 sec: 5514.9). Total num frames: 1052657664. Throughput: 0: 4960.2. Samples: 1052654138. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:06:33,545][25689] Avg episode reward: [(0, '-0.128')] [2022-07-11 04:06:35,339][26022] Updated weights on worker 0-0, policy_version 1027996 (0.00081) [2022-07-11 04:06:37,120][26022] Updated weights on worker 0-0, policy_version 1028006 (0.00085) [2022-07-11 04:06:38,614][25689] Fps is (10 sec: 5481.4, 60 sec: 5511.0, 300 sec: 5513.7). Total num frames: 1052686336. Throughput: 0: 5803.3. Samples: 1052687682. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:06:38,614][25689] Avg episode reward: [(0, '-0.501')] [2022-07-11 04:06:39,187][26022] Updated weights on worker 0-0, policy_version 1028016 (0.00093) [2022-07-11 04:06:40,762][26022] Updated weights on worker 0-0, policy_version 1028026 (0.00094) [2022-07-11 04:06:42,763][26022] Updated weights on worker 0-0, policy_version 1028036 (0.00087) [2022-07-11 04:06:43,647][25689] Fps is (10 sec: 5676.0, 60 sec: 5525.4, 300 sec: 5516.9). Total num frames: 1052715008. Throughput: 0: 5802.2. Samples: 1052721062. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:06:43,648][25689] Avg episode reward: [(0, '-0.485')] [2022-07-11 04:06:44,568][26022] Updated weights on worker 0-0, policy_version 1028046 (0.00085) [2022-07-11 04:06:46,428][26022] Updated weights on worker 0-0, policy_version 1028056 (0.00083) [2022-07-11 04:06:48,324][26022] Updated weights on worker 0-0, policy_version 1028066 (0.00083) [2022-07-11 04:06:48,705][25689] Fps is (10 sec: 5378.3, 60 sec: 5509.7, 300 sec: 5505.6). Total num frames: 1052740608. Throughput: 0: 4959.3. Samples: 1052737540. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:06:48,705][25689] Avg episode reward: [(0, '0.125')] [2022-07-11 04:06:50,214][26022] Updated weights on worker 0-0, policy_version 1028076 (0.00084) [2022-07-11 04:06:52,001][26022] Updated weights on worker 0-0, policy_version 1028086 (0.00090) [2022-07-11 04:06:53,674][26022] Updated weights on worker 0-0, policy_version 1028096 (0.00088) [2022-07-11 04:06:53,769][25689] Fps is (10 sec: 5462.8, 60 sec: 5539.6, 300 sec: 5512.9). Total num frames: 1052770304. Throughput: 0: 5787.6. Samples: 1052771192. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:06:53,770][25689] Avg episode reward: [(0, '0.341')] [2022-07-11 04:06:55,667][26022] Updated weights on worker 0-0, policy_version 1028106 (0.00087) [2022-07-11 04:06:57,394][26022] Updated weights on worker 0-0, policy_version 1028116 (0.00087) [2022-07-11 04:06:58,834][25689] Fps is (10 sec: 5661.3, 60 sec: 5504.7, 300 sec: 5512.2). Total num frames: 1052797952. Throughput: 0: 5792.0. Samples: 1052804796. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:06:58,834][25689] Avg episode reward: [(0, '1.579')] [2022-07-11 04:06:59,341][26022] Updated weights on worker 0-0, policy_version 1028126 (0.00083) [2022-07-11 04:07:01,059][26022] Updated weights on worker 0-0, policy_version 1028136 (0.00089) [2022-07-11 04:07:03,220][26022] Updated weights on worker 0-0, policy_version 1028146 (0.00080) [2022-07-11 04:07:03,880][25689] Fps is (10 sec: 5266.6, 60 sec: 5502.5, 300 sec: 5508.3). Total num frames: 1052823552. Throughput: 0: 4968.1. Samples: 1052821580. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:07:03,880][25689] Avg episode reward: [(0, '1.628')] [2022-07-11 04:07:05,144][26022] Updated weights on worker 0-0, policy_version 1028156 (0.00085) [2022-07-11 04:07:07,100][26022] Updated weights on worker 0-0, policy_version 1028166 (0.00091) [2022-07-11 04:07:08,769][26022] Updated weights on worker 0-0, policy_version 1028176 (0.00087) [2022-07-11 04:07:08,963][25689] Fps is (10 sec: 5357.9, 60 sec: 5517.2, 300 sec: 5510.5). Total num frames: 1052852224. Throughput: 0: 5700.4. Samples: 1052853022. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:07:08,964][25689] Avg episode reward: [(0, '1.937')] [2022-07-11 04:07:10,809][26022] Updated weights on worker 0-0, policy_version 1028186 (0.00090) [2022-07-11 04:07:12,447][26022] Updated weights on worker 0-0, policy_version 1028196 (0.00088) [2022-07-11 04:07:14,027][25689] Fps is (10 sec: 5550.4, 60 sec: 5520.7, 300 sec: 5507.2). Total num frames: 1052879872. Throughput: 0: 5679.5. Samples: 1052886244. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:07:14,027][25689] Avg episode reward: [(0, '2.130')] [2022-07-11 04:07:14,357][26022] Updated weights on worker 0-0, policy_version 1028206 (0.00074) [2022-07-11 04:07:16,330][26022] Updated weights on worker 0-0, policy_version 1028216 (0.00089) [2022-07-11 04:07:17,921][26022] Updated weights on worker 0-0, policy_version 1028226 (0.00082) [2022-07-11 04:07:19,187][25689] Fps is (10 sec: 5608.8, 60 sec: 5533.5, 300 sec: 5509.2). Total num frames: 1052909568. Throughput: 0: 4831.5. Samples: 1052903132. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:07:19,189][25689] Avg episode reward: [(0, '0.865')] [2022-07-11 04:07:19,996][26022] Updated weights on worker 0-0, policy_version 1028236 (0.00085) [2022-07-11 04:07:21,496][26022] Updated weights on worker 0-0, policy_version 1028246 (0.00088) [2022-07-11 04:07:23,629][26022] Updated weights on worker 0-0, policy_version 1028256 (0.00091) [2022-07-11 04:07:24,207][25689] Fps is (10 sec: 5632.9, 60 sec: 5534.1, 300 sec: 5512.8). Total num frames: 1052937216. Throughput: 0: 5670.0. Samples: 1052936834. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:07:24,207][25689] Avg episode reward: [(0, '0.927')] [2022-07-11 04:07:25,183][26022] Updated weights on worker 0-0, policy_version 1028266 (0.00089) [2022-07-11 04:07:27,372][26022] Updated weights on worker 0-0, policy_version 1028276 (0.00084) [2022-07-11 04:07:28,755][26022] Updated weights on worker 0-0, policy_version 1028286 (0.00093) [2022-07-11 04:07:29,259][25689] Fps is (10 sec: 5693.5, 60 sec: 5530.3, 300 sec: 5515.7). Total num frames: 1052966912. Throughput: 0: 5782.4. Samples: 1052970380. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:07:29,259][25689] Avg episode reward: [(0, '-0.762')] [2022-07-11 04:07:30,982][26022] Updated weights on worker 0-0, policy_version 1028296 (0.00088) [2022-07-11 04:07:32,444][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:07:32,457][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001028306_1052985344.pth [2022-07-11 04:07:32,457][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001026363_1050995712.pth [2022-07-11 04:07:32,467][26022] Updated weights on worker 0-0, policy_version 1028306 (0.00086) [2022-07-11 04:07:34,308][25689] Fps is (10 sec: 5677.0, 60 sec: 5544.4, 300 sec: 5513.4). Total num frames: 1052994560. Throughput: 0: 4980.0. Samples: 1052987244. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:07:34,308][25689] Avg episode reward: [(0, '-1.812')] [2022-07-11 04:07:34,589][26022] Updated weights on worker 0-0, policy_version 1028316 (0.00075) [2022-07-11 04:07:36,244][26022] Updated weights on worker 0-0, policy_version 1028326 (0.00085) [2022-07-11 04:07:38,077][26022] Updated weights on worker 0-0, policy_version 1028336 (0.00085) [2022-07-11 04:07:39,379][25689] Fps is (10 sec: 5464.1, 60 sec: 5527.4, 300 sec: 5516.0). Total num frames: 1053022208. Throughput: 0: 5819.0. Samples: 1053020628. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:07:39,379][25689] Avg episode reward: [(0, '-1.825')] [2022-07-11 04:07:40,185][26022] Updated weights on worker 0-0, policy_version 1028346 (0.00752) [2022-07-11 04:07:41,699][26022] Updated weights on worker 0-0, policy_version 1028356 (0.00101) [2022-07-11 04:07:43,707][26022] Updated weights on worker 0-0, policy_version 1028366 (0.00087) [2022-07-11 04:07:44,391][25689] Fps is (10 sec: 5484.2, 60 sec: 5512.5, 300 sec: 5505.9). Total num frames: 1053049856. Throughput: 0: 5806.6. Samples: 1053054034. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:07:44,391][25689] Avg episode reward: [(0, '-1.082')] [2022-07-11 04:07:45,625][26022] Updated weights on worker 0-0, policy_version 1028376 (0.00089) [2022-07-11 04:07:47,327][26022] Updated weights on worker 0-0, policy_version 1028386 (0.00086) [2022-07-11 04:07:49,328][26022] Updated weights on worker 0-0, policy_version 1028396 (0.00091) [2022-07-11 04:07:49,433][25689] Fps is (10 sec: 5499.5, 60 sec: 5547.5, 300 sec: 5509.8). Total num frames: 1053077504. Throughput: 0: 5811.6. Samples: 1053087626. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:07:49,434][25689] Avg episode reward: [(0, '-0.887')] [2022-07-11 04:07:50,933][26022] Updated weights on worker 0-0, policy_version 1028406 (0.00087) [2022-07-11 04:07:53,024][26022] Updated weights on worker 0-0, policy_version 1028416 (0.00088) [2022-07-11 04:07:54,466][25689] Fps is (10 sec: 5691.7, 60 sec: 5550.5, 300 sec: 5514.9). Total num frames: 1053107200. Throughput: 0: 5814.9. Samples: 1053104460. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:07:54,466][25689] Avg episode reward: [(0, '-0.041')] [2022-07-11 04:07:54,634][26022] Updated weights on worker 0-0, policy_version 1028426 (0.00085) [2022-07-11 04:07:56,570][26022] Updated weights on worker 0-0, policy_version 1028436 (0.00086) [2022-07-11 04:07:58,322][26022] Updated weights on worker 0-0, policy_version 1028446 (0.00084) [2022-07-11 04:07:59,555][25689] Fps is (10 sec: 5665.2, 60 sec: 5548.2, 300 sec: 5513.5). Total num frames: 1053134848. Throughput: 0: 5812.4. Samples: 1053137904. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:07:59,556][25689] Avg episode reward: [(0, '1.799')] [2022-07-11 04:08:00,187][26022] Updated weights on worker 0-0, policy_version 1028456 (0.00083) [2022-07-11 04:08:02,454][26022] Updated weights on worker 0-0, policy_version 1028466 (0.00091) [2022-07-11 04:08:04,167][26022] Updated weights on worker 0-0, policy_version 1028476 (0.00088) [2022-07-11 04:08:04,571][25689] Fps is (10 sec: 5269.4, 60 sec: 5551.0, 300 sec: 5511.4). Total num frames: 1053160448. Throughput: 0: 5727.6. Samples: 1053169618. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:08:04,572][25689] Avg episode reward: [(0, '1.864')] [2022-07-11 04:08:06,014][26022] Updated weights on worker 0-0, policy_version 1028486 (0.00109) [2022-07-11 04:08:07,735][26022] Updated weights on worker 0-0, policy_version 1028496 (0.00088) [2022-07-11 04:08:09,608][25689] Fps is (10 sec: 5296.7, 60 sec: 5538.3, 300 sec: 5514.2). Total num frames: 1053188096. Throughput: 0: 4897.0. Samples: 1053186424. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:08:09,609][25689] Avg episode reward: [(0, '1.269')] [2022-07-11 04:08:09,772][26022] Updated weights on worker 0-0, policy_version 1028506 (0.00086) [2022-07-11 04:08:11,451][26022] Updated weights on worker 0-0, policy_version 1028516 (0.00096) [2022-07-11 04:08:13,435][26022] Updated weights on worker 0-0, policy_version 1028526 (0.00087) [2022-07-11 04:08:14,629][25689] Fps is (10 sec: 5599.5, 60 sec: 5559.1, 300 sec: 5515.6). Total num frames: 1053216768. Throughput: 0: 5732.7. Samples: 1053220050. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:08:14,630][25689] Avg episode reward: [(0, '1.363')] [2022-07-11 04:08:15,315][26022] Updated weights on worker 0-0, policy_version 1028536 (0.00089) [2022-07-11 04:08:16,998][26022] Updated weights on worker 0-0, policy_version 1028546 (0.01606) [2022-07-11 04:08:18,958][26022] Updated weights on worker 0-0, policy_version 1028556 (0.00087) [2022-07-11 04:08:19,738][25689] Fps is (10 sec: 5661.4, 60 sec: 5547.0, 300 sec: 5517.3). Total num frames: 1053245440. Throughput: 0: 5731.5. Samples: 1053253578. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:08:19,739][25689] Avg episode reward: [(0, '0.483')] [2022-07-11 04:08:20,627][26022] Updated weights on worker 0-0, policy_version 1028566 (0.00088) [2022-07-11 04:08:22,522][26022] Updated weights on worker 0-0, policy_version 1028576 (0.00081) [2022-07-11 04:08:24,628][26022] Updated weights on worker 0-0, policy_version 1028586 (0.00097) [2022-07-11 04:08:24,799][25689] Fps is (10 sec: 5537.9, 60 sec: 5543.2, 300 sec: 5519.7). Total num frames: 1053273088. Throughput: 0: 4993.1. Samples: 1053270618. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 04:08:24,799][25689] Avg episode reward: [(0, '0.379')] [2022-07-11 04:08:26,196][26022] Updated weights on worker 0-0, policy_version 1028596 (0.00090) [2022-07-11 04:08:28,096][26022] Updated weights on worker 0-0, policy_version 1028606 (0.00091) [2022-07-11 04:08:29,667][26022] Updated weights on worker 0-0, policy_version 1028616 (0.00090) [2022-07-11 04:08:29,859][25689] Fps is (10 sec: 5665.5, 60 sec: 5542.4, 300 sec: 5526.1). Total num frames: 1053302784. Throughput: 0: 5803.6. Samples: 1053303950. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:08:29,860][25689] Avg episode reward: [(0, '0.277')] [2022-07-11 04:08:31,731][26022] Updated weights on worker 0-0, policy_version 1028626 (0.00083) [2022-07-11 04:08:33,635][26022] Updated weights on worker 0-0, policy_version 1028636 (0.00086) [2022-07-11 04:08:34,960][25689] Fps is (10 sec: 5643.6, 60 sec: 5537.7, 300 sec: 5522.3). Total num frames: 1053330432. Throughput: 0: 5773.4. Samples: 1053337428. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:08:34,960][25689] Avg episode reward: [(0, '0.629')] [2022-07-11 04:08:35,507][26022] Updated weights on worker 0-0, policy_version 1028646 (0.00082) [2022-07-11 04:08:37,059][26022] Updated weights on worker 0-0, policy_version 1028656 (0.00083) [2022-07-11 04:08:39,282][26022] Updated weights on worker 0-0, policy_version 1028666 (0.00082) [2022-07-11 04:08:40,035][25689] Fps is (10 sec: 5534.9, 60 sec: 5554.2, 300 sec: 5522.6). Total num frames: 1053359104. Throughput: 0: 4959.7. Samples: 1053354246. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:08:40,035][25689] Avg episode reward: [(0, '0.653')] [2022-07-11 04:08:41,051][26022] Updated weights on worker 0-0, policy_version 1028676 (0.00087) [2022-07-11 04:08:42,868][26022] Updated weights on worker 0-0, policy_version 1028686 (0.00120) [2022-07-11 04:08:44,659][26022] Updated weights on worker 0-0, policy_version 1028696 (0.00088) [2022-07-11 04:08:45,062][25689] Fps is (10 sec: 5473.9, 60 sec: 5536.0, 300 sec: 5515.8). Total num frames: 1053385728. Throughput: 0: 5783.0. Samples: 1053387796. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:08:45,062][25689] Avg episode reward: [(0, '0.570')] [2022-07-11 04:08:46,415][26022] Updated weights on worker 0-0, policy_version 1028706 (0.00087) [2022-07-11 04:08:48,553][26022] Updated weights on worker 0-0, policy_version 1028716 (0.00058) [2022-07-11 04:08:50,075][25689] Fps is (10 sec: 5507.4, 60 sec: 5555.5, 300 sec: 5523.5). Total num frames: 1053414400. Throughput: 0: 5804.7. Samples: 1053421296. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:08:50,075][26022] Updated weights on worker 0-0, policy_version 1028726 (0.00093) [2022-07-11 04:08:50,075][25689] Avg episode reward: [(0, '0.100')] [2022-07-11 04:08:52,068][26022] Updated weights on worker 0-0, policy_version 1028736 (0.00087) [2022-07-11 04:08:53,915][26022] Updated weights on worker 0-0, policy_version 1028746 (0.00087) [2022-07-11 04:08:55,106][25689] Fps is (10 sec: 5708.8, 60 sec: 5538.7, 300 sec: 5524.6). Total num frames: 1053443072. Throughput: 0: 4974.2. Samples: 1053437642. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:08:55,108][25689] Avg episode reward: [(0, '0.475')] [2022-07-11 04:08:55,705][26022] Updated weights on worker 0-0, policy_version 1028756 (0.00087) [2022-07-11 04:08:57,609][26022] Updated weights on worker 0-0, policy_version 1028766 (0.00083) [2022-07-11 04:08:59,633][26022] Updated weights on worker 0-0, policy_version 1028776 (0.00086) [2022-07-11 04:09:00,157][25689] Fps is (10 sec: 5281.6, 60 sec: 5491.6, 300 sec: 5518.0). Total num frames: 1053467648. Throughput: 0: 5792.9. Samples: 1053470812. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:09:00,157][25689] Avg episode reward: [(0, '0.551')] [2022-07-11 04:09:01,111][26022] Updated weights on worker 0-0, policy_version 1028786 (0.00090) [2022-07-11 04:09:03,988][26022] Updated weights on worker 0-0, policy_version 1028796 (0.00081) [2022-07-11 04:09:05,069][26022] Updated weights on worker 0-0, policy_version 1028806 (0.00095) [2022-07-11 04:09:05,225][25689] Fps is (10 sec: 5363.7, 60 sec: 5554.4, 300 sec: 5523.8). Total num frames: 1053497344. Throughput: 0: 5661.3. Samples: 1053501946. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:09:05,225][25689] Avg episode reward: [(0, '0.550')] [2022-07-11 04:09:07,586][26022] Updated weights on worker 0-0, policy_version 1028816 (0.00090) [2022-07-11 04:09:08,889][26022] Updated weights on worker 0-0, policy_version 1028826 (0.00096) [2022-07-11 04:09:10,245][25689] Fps is (10 sec: 5582.4, 60 sec: 5539.1, 300 sec: 5520.4). Total num frames: 1053523968. Throughput: 0: 4827.7. Samples: 1053518674. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:09:10,246][25689] Avg episode reward: [(0, '0.327')] [2022-07-11 04:09:10,995][26022] Updated weights on worker 0-0, policy_version 1028836 (0.00094) [2022-07-11 04:09:12,668][26022] Updated weights on worker 0-0, policy_version 1028846 (0.00094) [2022-07-11 04:09:14,731][26022] Updated weights on worker 0-0, policy_version 1028856 (0.00088) [2022-07-11 04:09:15,254][25689] Fps is (10 sec: 5411.2, 60 sec: 5523.2, 300 sec: 5521.6). Total num frames: 1053551616. Throughput: 0: 5691.2. Samples: 1053552308. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:09:15,255][25689] Avg episode reward: [(0, '0.214')] [2022-07-11 04:09:16,368][26022] Updated weights on worker 0-0, policy_version 1028866 (0.00094) [2022-07-11 04:09:18,340][26022] Updated weights on worker 0-0, policy_version 1028876 (0.00086) [2022-07-11 04:09:20,012][26022] Updated weights on worker 0-0, policy_version 1028886 (0.00086) [2022-07-11 04:09:20,302][25689] Fps is (10 sec: 5702.2, 60 sec: 5545.7, 300 sec: 5521.4). Total num frames: 1053581312. Throughput: 0: 5706.2. Samples: 1053585766. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:09:20,302][25689] Avg episode reward: [(0, '0.891')] [2022-07-11 04:09:22,020][26022] Updated weights on worker 0-0, policy_version 1028896 (0.00095) [2022-07-11 04:09:23,616][26022] Updated weights on worker 0-0, policy_version 1028906 (0.00088) [2022-07-11 04:09:25,305][25689] Fps is (10 sec: 5603.6, 60 sec: 5534.1, 300 sec: 5521.6). Total num frames: 1053607936. Throughput: 0: 5015.3. Samples: 1053602654. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:09:25,305][25689] Avg episode reward: [(0, '0.479')] [2022-07-11 04:09:25,530][26022] Updated weights on worker 0-0, policy_version 1028916 (0.00085) [2022-07-11 04:09:27,411][26022] Updated weights on worker 0-0, policy_version 1028926 (0.00083) [2022-07-11 04:09:29,459][26022] Updated weights on worker 0-0, policy_version 1028936 (0.00083) [2022-07-11 04:09:30,323][25689] Fps is (10 sec: 5518.0, 60 sec: 5521.1, 300 sec: 5525.3). Total num frames: 1053636608. Throughput: 0: 5848.8. Samples: 1053636104. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:09:30,323][25689] Avg episode reward: [(0, '0.705')] [2022-07-11 04:09:31,104][26022] Updated weights on worker 0-0, policy_version 1028946 (0.00084) [2022-07-11 04:09:32,674][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:09:32,685][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001028954_1053648896.pth [2022-07-11 04:09:32,686][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001027011_1051659264.pth [2022-07-11 04:09:33,148][26022] Updated weights on worker 0-0, policy_version 1028956 (0.00086) [2022-07-11 04:09:34,686][26022] Updated weights on worker 0-0, policy_version 1028966 (0.00095) [2022-07-11 04:09:35,363][25689] Fps is (10 sec: 5497.7, 60 sec: 5509.6, 300 sec: 5518.7). Total num frames: 1053663232. Throughput: 0: 5828.3. Samples: 1053669508. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:09:35,363][25689] Avg episode reward: [(0, '0.598')] [2022-07-11 04:09:36,839][26022] Updated weights on worker 0-0, policy_version 1028976 (0.00084) [2022-07-11 04:09:38,498][26022] Updated weights on worker 0-0, policy_version 1028986 (0.00092) [2022-07-11 04:09:40,440][25689] Fps is (10 sec: 5465.7, 60 sec: 5509.4, 300 sec: 5521.2). Total num frames: 1053691904. Throughput: 0: 4983.6. Samples: 1053686124. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:09:40,440][25689] Avg episode reward: [(0, '0.140')] [2022-07-11 04:09:40,442][26022] Updated weights on worker 0-0, policy_version 1028996 (0.00087) [2022-07-11 04:09:42,015][26022] Updated weights on worker 0-0, policy_version 1029006 (0.00087) [2022-07-11 04:09:44,243][26022] Updated weights on worker 0-0, policy_version 1029016 (0.00087) [2022-07-11 04:09:45,457][25689] Fps is (10 sec: 5681.0, 60 sec: 5544.3, 300 sec: 5524.6). Total num frames: 1053720576. Throughput: 0: 5794.8. Samples: 1053719430. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:09:45,457][25689] Avg episode reward: [(0, '-0.371')] [2022-07-11 04:09:45,987][26022] Updated weights on worker 0-0, policy_version 1029026 (0.00094) [2022-07-11 04:09:47,877][26022] Updated weights on worker 0-0, policy_version 1029036 (0.00093) [2022-07-11 04:09:49,506][26022] Updated weights on worker 0-0, policy_version 1029046 (0.00083) [2022-07-11 04:09:50,519][25689] Fps is (10 sec: 5587.6, 60 sec: 5522.8, 300 sec: 5521.8). Total num frames: 1053748224. Throughput: 0: 5781.6. Samples: 1053752870. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:09:50,528][25689] Avg episode reward: [(0, '0.505')] [2022-07-11 04:09:51,554][26022] Updated weights on worker 0-0, policy_version 1029056 (0.00052) [2022-07-11 04:09:53,083][26022] Updated weights on worker 0-0, policy_version 1029066 (0.00088) [2022-07-11 04:09:55,110][26022] Updated weights on worker 0-0, policy_version 1029076 (0.00102) [2022-07-11 04:09:55,537][25689] Fps is (10 sec: 5485.4, 60 sec: 5507.1, 300 sec: 5527.0). Total num frames: 1053775872. Throughput: 0: 4970.6. Samples: 1053769788. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:09:55,538][25689] Avg episode reward: [(0, '0.375')] [2022-07-11 04:09:56,822][26022] Updated weights on worker 0-0, policy_version 1029086 (0.00083) [2022-07-11 04:09:58,701][26022] Updated weights on worker 0-0, policy_version 1029096 (0.00087) [2022-07-11 04:10:00,584][25689] Fps is (10 sec: 5595.9, 60 sec: 5575.2, 300 sec: 5527.9). Total num frames: 1053804544. Throughput: 0: 5828.6. Samples: 1053803534. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:10:00,585][25689] Avg episode reward: [(0, '0.226')] [2022-07-11 04:10:00,590][26022] Updated weights on worker 0-0, policy_version 1029106 (0.00088) [2022-07-11 04:10:02,652][26022] Updated weights on worker 0-0, policy_version 1029116 (0.00086) [2022-07-11 04:10:04,793][26022] Updated weights on worker 0-0, policy_version 1029126 (0.00090) [2022-07-11 04:10:05,668][25689] Fps is (10 sec: 5357.2, 60 sec: 5506.0, 300 sec: 5520.2). Total num frames: 1053830144. Throughput: 0: 5725.1. Samples: 1053835140. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:10:05,670][25689] Avg episode reward: [(0, '0.007')] [2022-07-11 04:10:06,518][26022] Updated weights on worker 0-0, policy_version 1029136 (0.00090) [2022-07-11 04:10:08,161][26022] Updated weights on worker 0-0, policy_version 1029146 (0.00080) [2022-07-11 04:10:10,370][26022] Updated weights on worker 0-0, policy_version 1029156 (0.00091) [2022-07-11 04:10:10,678][25689] Fps is (10 sec: 5275.1, 60 sec: 5523.9, 300 sec: 5524.6). Total num frames: 1053857792. Throughput: 0: 4905.7. Samples: 1053851762. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:10:10,680][25689] Avg episode reward: [(0, '0.044')] [2022-07-11 04:10:11,963][26022] Updated weights on worker 0-0, policy_version 1029166 (0.00091) [2022-07-11 04:10:13,925][26022] Updated weights on worker 0-0, policy_version 1029176 (0.00085) [2022-07-11 04:10:15,693][25689] Fps is (10 sec: 5515.8, 60 sec: 5523.3, 300 sec: 5522.3). Total num frames: 1053885440. Throughput: 0: 5727.9. Samples: 1053885238. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:10:15,694][25689] Avg episode reward: [(0, '0.548')] [2022-07-11 04:10:15,836][26022] Updated weights on worker 0-0, policy_version 1029186 (0.00087) [2022-07-11 04:10:17,381][26022] Updated weights on worker 0-0, policy_version 1029196 (0.00085) [2022-07-11 04:10:19,327][26022] Updated weights on worker 0-0, policy_version 1029206 (0.00080) [2022-07-11 04:10:20,817][25689] Fps is (10 sec: 5655.9, 60 sec: 5516.4, 300 sec: 5523.7). Total num frames: 1053915136. Throughput: 0: 5708.4. Samples: 1053919032. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:10:20,817][25689] Avg episode reward: [(0, '0.746')] [2022-07-11 04:10:21,004][26022] Updated weights on worker 0-0, policy_version 1029216 (0.00084) [2022-07-11 04:10:22,958][26022] Updated weights on worker 0-0, policy_version 1029226 (0.00094) [2022-07-11 04:10:24,964][26022] Updated weights on worker 0-0, policy_version 1029236 (0.00085) [2022-07-11 04:10:25,910][25689] Fps is (10 sec: 5612.4, 60 sec: 5525.0, 300 sec: 5526.3). Total num frames: 1053942784. Throughput: 0: 5796.0. Samples: 1053952464. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:10:25,911][25689] Avg episode reward: [(0, '0.992')] [2022-07-11 04:10:26,522][26022] Updated weights on worker 0-0, policy_version 1029246 (0.00082) [2022-07-11 04:10:28,415][26022] Updated weights on worker 0-0, policy_version 1029256 (0.00088) [2022-07-11 04:10:30,210][26022] Updated weights on worker 0-0, policy_version 1029266 (0.00097) [2022-07-11 04:10:30,951][25689] Fps is (10 sec: 5456.5, 60 sec: 5506.1, 300 sec: 5525.6). Total num frames: 1053970432. Throughput: 0: 5786.3. Samples: 1053969066. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:10:30,951][25689] Avg episode reward: [(0, '1.177')] [2022-07-11 04:10:32,073][26022] Updated weights on worker 0-0, policy_version 1029276 (0.00087) [2022-07-11 04:10:34,142][26022] Updated weights on worker 0-0, policy_version 1029286 (0.00091) [2022-07-11 04:10:35,655][26022] Updated weights on worker 0-0, policy_version 1029296 (0.00095) [2022-07-11 04:10:36,026][25689] Fps is (10 sec: 5770.1, 60 sec: 5570.4, 300 sec: 5532.6). Total num frames: 1054001152. Throughput: 0: 5779.0. Samples: 1054002742. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:10:36,027][25689] Avg episode reward: [(0, '1.106')] [2022-07-11 04:10:37,935][26022] Updated weights on worker 0-0, policy_version 1029306 (0.00093) [2022-07-11 04:10:39,439][26022] Updated weights on worker 0-0, policy_version 1029316 (0.00051) [2022-07-11 04:10:41,151][25689] Fps is (10 sec: 5621.7, 60 sec: 5532.3, 300 sec: 5527.0). Total num frames: 1054027776. Throughput: 0: 5741.1. Samples: 1054035774. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:10:41,152][25689] Avg episode reward: [(0, '1.056')] [2022-07-11 04:10:41,475][26022] Updated weights on worker 0-0, policy_version 1029326 (0.00089) [2022-07-11 04:10:43,478][26022] Updated weights on worker 0-0, policy_version 1029336 (0.00087) [2022-07-11 04:10:45,114][26022] Updated weights on worker 0-0, policy_version 1029346 (0.00080) [2022-07-11 04:10:46,158][25689] Fps is (10 sec: 5356.4, 60 sec: 5516.3, 300 sec: 5531.6). Total num frames: 1054055424. Throughput: 0: 4931.7. Samples: 1054052320. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:10:46,159][25689] Avg episode reward: [(0, '1.138')] [2022-07-11 04:10:47,116][26022] Updated weights on worker 0-0, policy_version 1029356 (0.00083) [2022-07-11 04:10:48,940][26022] Updated weights on worker 0-0, policy_version 1029366 (0.00088) [2022-07-11 04:10:50,692][26022] Updated weights on worker 0-0, policy_version 1029376 (0.00091) [2022-07-11 04:10:51,231][25689] Fps is (10 sec: 5485.8, 60 sec: 5515.4, 300 sec: 5530.6). Total num frames: 1054083072. Throughput: 0: 5730.7. Samples: 1054085286. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:10:51,231][25689] Avg episode reward: [(0, '0.688')] [2022-07-11 04:10:52,627][26022] Updated weights on worker 0-0, policy_version 1029386 (0.00077) [2022-07-11 04:10:54,504][26022] Updated weights on worker 0-0, policy_version 1029396 (0.00088) [2022-07-11 04:10:56,196][26022] Updated weights on worker 0-0, policy_version 1029406 (0.00091) [2022-07-11 04:10:56,235][25689] Fps is (10 sec: 5589.2, 60 sec: 5533.6, 300 sec: 5528.1). Total num frames: 1054111744. Throughput: 0: 5734.0. Samples: 1054118620. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:10:56,235][25689] Avg episode reward: [(0, '-0.873')] [2022-07-11 04:10:58,277][26022] Updated weights on worker 0-0, policy_version 1029416 (0.00082) [2022-07-11 04:10:59,862][26022] Updated weights on worker 0-0, policy_version 1029426 (0.00093) [2022-07-11 04:11:01,335][25689] Fps is (10 sec: 5574.1, 60 sec: 5511.8, 300 sec: 5533.5). Total num frames: 1054139392. Throughput: 0: 4927.1. Samples: 1054135220. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:11:01,335][25689] Avg episode reward: [(0, '-0.891')] [2022-07-11 04:11:02,222][26022] Updated weights on worker 0-0, policy_version 1029436 (0.00087) [2022-07-11 04:11:04,132][26022] Updated weights on worker 0-0, policy_version 1029446 (0.00086) [2022-07-11 04:11:05,946][26022] Updated weights on worker 0-0, policy_version 1029456 (0.00085) [2022-07-11 04:11:06,395][25689] Fps is (10 sec: 5240.6, 60 sec: 5514.0, 300 sec: 5526.6). Total num frames: 1054164992. Throughput: 0: 5659.8. Samples: 1054166858. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:11:06,398][25689] Avg episode reward: [(0, '-0.016')] [2022-07-11 04:11:07,747][26022] Updated weights on worker 0-0, policy_version 1029466 (0.00088) [2022-07-11 04:11:09,658][26022] Updated weights on worker 0-0, policy_version 1029476 (0.00086) [2022-07-11 04:11:11,210][26022] Updated weights on worker 0-0, policy_version 1029486 (0.00089) [2022-07-11 04:11:11,401][25689] Fps is (10 sec: 5391.6, 60 sec: 5531.2, 300 sec: 5531.8). Total num frames: 1054193664. Throughput: 0: 5712.5. Samples: 1054200506. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:11:11,402][25689] Avg episode reward: [(0, '-0.753')] [2022-07-11 04:11:13,301][26022] Updated weights on worker 0-0, policy_version 1029496 (0.00086) [2022-07-11 04:11:14,863][26022] Updated weights on worker 0-0, policy_version 1029506 (0.00085) [2022-07-11 04:11:16,464][25689] Fps is (10 sec: 5593.5, 60 sec: 5526.8, 300 sec: 5529.4). Total num frames: 1054221312. Throughput: 0: 4895.7. Samples: 1054217658. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:11:16,465][25689] Avg episode reward: [(0, '-0.342')] [2022-07-11 04:11:16,767][26022] Updated weights on worker 0-0, policy_version 1029516 (0.00090) [2022-07-11 04:11:18,562][26022] Updated weights on worker 0-0, policy_version 1029526 (0.00091) [2022-07-11 04:11:20,455][26022] Updated weights on worker 0-0, policy_version 1029536 (0.00092) [2022-07-11 04:11:21,509][25689] Fps is (10 sec: 5571.8, 60 sec: 5517.2, 300 sec: 5532.5). Total num frames: 1054249984. Throughput: 0: 5750.2. Samples: 1054251224. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:11:21,511][25689] Avg episode reward: [(0, '-0.372')] [2022-07-11 04:11:22,317][26022] Updated weights on worker 0-0, policy_version 1029546 (0.00091) [2022-07-11 04:11:24,104][26022] Updated weights on worker 0-0, policy_version 1029556 (0.00079) [2022-07-11 04:11:25,840][26022] Updated weights on worker 0-0, policy_version 1029566 (0.00090) [2022-07-11 04:11:26,520][25689] Fps is (10 sec: 5600.7, 60 sec: 5524.7, 300 sec: 5525.6). Total num frames: 1054277632. Throughput: 0: 5861.2. Samples: 1054284814. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:11:26,521][25689] Avg episode reward: [(0, '0.204')] [2022-07-11 04:11:27,863][26022] Updated weights on worker 0-0, policy_version 1029576 (0.00085) [2022-07-11 04:11:29,670][26022] Updated weights on worker 0-0, policy_version 1029586 (0.00090) [2022-07-11 04:11:31,553][25689] Fps is (10 sec: 5505.0, 60 sec: 5525.3, 300 sec: 5528.7). Total num frames: 1054305280. Throughput: 0: 5000.1. Samples: 1054301272. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:11:31,554][25689] Avg episode reward: [(0, '0.450')] [2022-07-11 04:11:31,585][26022] Updated weights on worker 0-0, policy_version 1029596 (0.00090) [2022-07-11 04:11:32,690][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:11:32,702][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001029603_1054313472.pth [2022-07-11 04:11:32,702][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001027658_1052321792.pth [2022-07-11 04:11:33,299][26022] Updated weights on worker 0-0, policy_version 1029606 (0.00089) [2022-07-11 04:11:35,412][26022] Updated weights on worker 0-0, policy_version 1029616 (0.00085) [2022-07-11 04:11:36,575][25689] Fps is (10 sec: 5703.3, 60 sec: 5513.3, 300 sec: 5533.1). Total num frames: 1054334976. Throughput: 0: 5818.4. Samples: 1054334670. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:11:36,575][25689] Avg episode reward: [(0, '0.804')] [2022-07-11 04:11:37,087][26022] Updated weights on worker 0-0, policy_version 1029626 (0.00085) [2022-07-11 04:11:38,868][26022] Updated weights on worker 0-0, policy_version 1029636 (0.00100) [2022-07-11 04:11:40,611][26022] Updated weights on worker 0-0, policy_version 1029646 (0.00087) [2022-07-11 04:11:41,691][25689] Fps is (10 sec: 5555.7, 60 sec: 5514.1, 300 sec: 5524.7). Total num frames: 1054361600. Throughput: 0: 5800.3. Samples: 1054368286. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:11:41,692][25689] Avg episode reward: [(0, '-0.464')] [2022-07-11 04:11:42,339][26022] Updated weights on worker 0-0, policy_version 1029656 (0.00089) [2022-07-11 04:11:44,223][26022] Updated weights on worker 0-0, policy_version 1029666 (0.00088) [2022-07-11 04:11:46,073][26022] Updated weights on worker 0-0, policy_version 1029676 (0.00093) [2022-07-11 04:11:46,697][25689] Fps is (10 sec: 5463.0, 60 sec: 5531.2, 300 sec: 5535.9). Total num frames: 1054390272. Throughput: 0: 4979.3. Samples: 1054385278. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:11:46,697][25689] Avg episode reward: [(0, '-1.289')] [2022-07-11 04:11:47,984][26022] Updated weights on worker 0-0, policy_version 1029686 (0.00087) [2022-07-11 04:11:49,832][26022] Updated weights on worker 0-0, policy_version 1029696 (0.00086) [2022-07-11 04:11:51,647][26022] Updated weights on worker 0-0, policy_version 1029706 (0.00086) [2022-07-11 04:11:51,729][25689] Fps is (10 sec: 5814.7, 60 sec: 5568.7, 300 sec: 5536.5). Total num frames: 1054419968. Throughput: 0: 5824.0. Samples: 1054418772. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:11:51,729][25689] Avg episode reward: [(0, '-1.815')] [2022-07-11 04:11:53,570][26022] Updated weights on worker 0-0, policy_version 1029716 (0.00095) [2022-07-11 04:11:55,183][26022] Updated weights on worker 0-0, policy_version 1029726 (0.00084) [2022-07-11 04:11:56,742][25689] Fps is (10 sec: 5606.4, 60 sec: 5534.0, 300 sec: 5534.0). Total num frames: 1054446592. Throughput: 0: 5820.6. Samples: 1054452058. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:11:56,743][25689] Avg episode reward: [(0, '-1.921')] [2022-07-11 04:11:57,181][26022] Updated weights on worker 0-0, policy_version 1029736 (0.00096) [2022-07-11 04:11:59,104][26022] Updated weights on worker 0-0, policy_version 1029746 (0.00081) [2022-07-11 04:12:00,930][26022] Updated weights on worker 0-0, policy_version 1029756 (0.00090) [2022-07-11 04:12:01,815][25689] Fps is (10 sec: 5380.7, 60 sec: 5536.5, 300 sec: 5540.4). Total num frames: 1054474240. Throughput: 0: 4970.8. Samples: 1054468320. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:12:01,816][25689] Avg episode reward: [(0, '-0.975')] [2022-07-11 04:12:03,187][26022] Updated weights on worker 0-0, policy_version 1029766 (0.00085) [2022-07-11 04:12:04,810][26022] Updated weights on worker 0-0, policy_version 1029776 (0.00089) [2022-07-11 04:12:06,745][26022] Updated weights on worker 0-0, policy_version 1029786 (0.00087) [2022-07-11 04:12:06,843][25689] Fps is (10 sec: 5372.9, 60 sec: 5556.4, 300 sec: 5534.6). Total num frames: 1054500864. Throughput: 0: 5702.0. Samples: 1054500154. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:12:06,844][25689] Avg episode reward: [(0, '-0.923')] [2022-07-11 04:12:08,656][26022] Updated weights on worker 0-0, policy_version 1029796 (0.00085) [2022-07-11 04:12:10,513][26022] Updated weights on worker 0-0, policy_version 1029806 (0.00804) [2022-07-11 04:12:11,896][25689] Fps is (10 sec: 5383.9, 60 sec: 5535.1, 300 sec: 5534.8). Total num frames: 1054528512. Throughput: 0: 5703.1. Samples: 1054533786. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:12:11,896][25689] Avg episode reward: [(0, '-0.504')] [2022-07-11 04:12:12,450][26022] Updated weights on worker 0-0, policy_version 1029816 (0.00098) [2022-07-11 04:12:14,124][26022] Updated weights on worker 0-0, policy_version 1029826 (0.00083) [2022-07-11 04:12:16,127][26022] Updated weights on worker 0-0, policy_version 1029836 (0.00083) [2022-07-11 04:12:16,907][25689] Fps is (10 sec: 5596.4, 60 sec: 5556.9, 300 sec: 5534.1). Total num frames: 1054557184. Throughput: 0: 4877.5. Samples: 1054550412. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:12:16,909][25689] Avg episode reward: [(0, '0.554')] [2022-07-11 04:12:17,764][26022] Updated weights on worker 0-0, policy_version 1029846 (0.00083) [2022-07-11 04:12:19,678][26022] Updated weights on worker 0-0, policy_version 1029856 (0.00080) [2022-07-11 04:12:21,507][26022] Updated weights on worker 0-0, policy_version 1029866 (0.00086) [2022-07-11 04:12:21,999][25689] Fps is (10 sec: 5574.6, 60 sec: 5535.6, 300 sec: 5532.8). Total num frames: 1054584832. Throughput: 0: 5709.9. Samples: 1054583566. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2022-07-11 04:12:22,000][25689] Avg episode reward: [(0, '0.543')] [2022-07-11 04:12:23,600][26022] Updated weights on worker 0-0, policy_version 1029876 (0.00093) [2022-07-11 04:12:25,102][26022] Updated weights on worker 0-0, policy_version 1029886 (0.00087) [2022-07-11 04:12:27,065][25689] Fps is (10 sec: 5443.4, 60 sec: 5530.5, 300 sec: 5525.6). Total num frames: 1054612480. Throughput: 0: 5788.2. Samples: 1054617202. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:12:27,066][25689] Avg episode reward: [(0, '0.461')] [2022-07-11 04:12:27,104][26022] Updated weights on worker 0-0, policy_version 1029896 (0.00093) [2022-07-11 04:12:28,818][26022] Updated weights on worker 0-0, policy_version 1029906 (0.00079) [2022-07-11 04:12:30,810][26022] Updated weights on worker 0-0, policy_version 1029916 (0.00086) [2022-07-11 04:12:32,092][25689] Fps is (10 sec: 5580.0, 60 sec: 5548.1, 300 sec: 5529.5). Total num frames: 1054641152. Throughput: 0: 5769.5. Samples: 1054650308. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:12:32,094][25689] Avg episode reward: [(0, '-0.286')] [2022-07-11 04:12:32,563][26022] Updated weights on worker 0-0, policy_version 1029926 (0.00085) [2022-07-11 04:12:34,336][26022] Updated weights on worker 0-0, policy_version 1029936 (0.00091) [2022-07-11 04:12:36,400][26022] Updated weights on worker 0-0, policy_version 1029947 (0.00088) [2022-07-11 04:12:37,131][25689] Fps is (10 sec: 5696.9, 60 sec: 5529.5, 300 sec: 5533.5). Total num frames: 1054669824. Throughput: 0: 5769.9. Samples: 1054667104. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:12:37,132][25689] Avg episode reward: [(0, '-1.375')] [2022-07-11 04:12:38,318][26022] Updated weights on worker 0-0, policy_version 1029957 (0.00083) [2022-07-11 04:12:39,924][26022] Updated weights on worker 0-0, policy_version 1029967 (0.00086) [2022-07-11 04:12:42,047][26022] Updated weights on worker 0-0, policy_version 1029977 (0.00081) [2022-07-11 04:12:42,215][25689] Fps is (10 sec: 5462.0, 60 sec: 5532.4, 300 sec: 5528.7). Total num frames: 1054696448. Throughput: 0: 5798.0. Samples: 1054700784. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:12:42,216][25689] Avg episode reward: [(0, '-2.148')] [2022-07-11 04:12:43,654][26022] Updated weights on worker 0-0, policy_version 1029987 (0.00090) [2022-07-11 04:12:45,604][26022] Updated weights on worker 0-0, policy_version 1029997 (0.00084) [2022-07-11 04:12:47,231][25689] Fps is (10 sec: 5576.1, 60 sec: 5548.4, 300 sec: 5536.1). Total num frames: 1054726144. Throughput: 0: 5817.8. Samples: 1054734524. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:12:47,232][25689] Avg episode reward: [(0, '-1.811')] [2022-07-11 04:12:47,612][26022] Updated weights on worker 0-0, policy_version 1030007 (0.00081) [2022-07-11 04:12:49,091][26022] Updated weights on worker 0-0, policy_version 1030017 (0.00099) [2022-07-11 04:12:51,228][26022] Updated weights on worker 0-0, policy_version 1030027 (0.00083) [2022-07-11 04:12:52,238][25689] Fps is (10 sec: 5823.7, 60 sec: 5533.8, 300 sec: 5533.1). Total num frames: 1054754816. Throughput: 0: 5007.6. Samples: 1054751192. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:12:52,238][25689] Avg episode reward: [(0, '-1.444')] [2022-07-11 04:12:52,699][26022] Updated weights on worker 0-0, policy_version 1030037 (0.00086) [2022-07-11 04:12:54,811][26022] Updated weights on worker 0-0, policy_version 1030047 (0.00090) [2022-07-11 04:12:56,932][26022] Updated weights on worker 0-0, policy_version 1030057 (0.00086) [2022-07-11 04:12:57,248][25689] Fps is (10 sec: 5316.0, 60 sec: 5500.3, 300 sec: 5524.3). Total num frames: 1054779392. Throughput: 0: 5833.1. Samples: 1054784448. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:12:57,249][25689] Avg episode reward: [(0, '-2.462')] [2022-07-11 04:12:58,381][26022] Updated weights on worker 0-0, policy_version 1030067 (0.00081) [2022-07-11 04:13:00,496][26022] Updated weights on worker 0-0, policy_version 1030077 (0.00089) [2022-07-11 04:13:02,160][26022] Updated weights on worker 0-0, policy_version 1030087 (0.00095) [2022-07-11 04:13:02,350][25689] Fps is (10 sec: 5367.0, 60 sec: 5531.5, 300 sec: 5536.5). Total num frames: 1054809088. Throughput: 0: 5783.4. Samples: 1054817230. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:13:02,351][25689] Avg episode reward: [(0, '-1.689')] [2022-07-11 04:13:04,379][26022] Updated weights on worker 0-0, policy_version 1030097 (0.00087) [2022-07-11 04:13:06,096][26022] Updated weights on worker 0-0, policy_version 1030107 (0.00090) [2022-07-11 04:13:07,360][25689] Fps is (10 sec: 5569.5, 60 sec: 5533.1, 300 sec: 5533.5). Total num frames: 1054835712. Throughput: 0: 4894.8. Samples: 1054833050. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:13:07,361][25689] Avg episode reward: [(0, '-0.238')] [2022-07-11 04:13:08,016][26022] Updated weights on worker 0-0, policy_version 1030117 (0.00085) [2022-07-11 04:13:09,706][26022] Updated weights on worker 0-0, policy_version 1030127 (0.00086) [2022-07-11 04:13:11,674][26022] Updated weights on worker 0-0, policy_version 1030137 (0.00091) [2022-07-11 04:13:12,387][25689] Fps is (10 sec: 5509.1, 60 sec: 5552.4, 300 sec: 5533.4). Total num frames: 1054864384. Throughput: 0: 5733.1. Samples: 1054866710. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:13:12,388][25689] Avg episode reward: [(0, '0.171')] [2022-07-11 04:13:13,392][26022] Updated weights on worker 0-0, policy_version 1030147 (0.00094) [2022-07-11 04:13:15,203][26022] Updated weights on worker 0-0, policy_version 1030157 (0.00089) [2022-07-11 04:13:17,027][26022] Updated weights on worker 0-0, policy_version 1030167 (0.00084) [2022-07-11 04:13:17,403][25689] Fps is (10 sec: 5710.2, 60 sec: 5552.0, 300 sec: 5535.1). Total num frames: 1054893056. Throughput: 0: 5769.0. Samples: 1054900720. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:13:17,403][25689] Avg episode reward: [(0, '0.514')] [2022-07-11 04:13:18,843][26022] Updated weights on worker 0-0, policy_version 1030177 (0.00093) [2022-07-11 04:13:20,674][26022] Updated weights on worker 0-0, policy_version 1030187 (0.00087) [2022-07-11 04:13:22,455][25689] Fps is (10 sec: 5492.5, 60 sec: 5538.7, 300 sec: 5531.9). Total num frames: 1054919680. Throughput: 0: 4978.8. Samples: 1054917328. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:13:22,455][25689] Avg episode reward: [(0, '0.331')] [2022-07-11 04:13:22,667][26022] Updated weights on worker 0-0, policy_version 1030197 (0.00083) [2022-07-11 04:13:24,436][26022] Updated weights on worker 0-0, policy_version 1030207 (0.00085) [2022-07-11 04:13:26,399][26022] Updated weights on worker 0-0, policy_version 1030217 (0.00100) [2022-07-11 04:13:27,535][25689] Fps is (10 sec: 5558.5, 60 sec: 5571.3, 300 sec: 5531.5). Total num frames: 1054949376. Throughput: 0: 5835.0. Samples: 1054950768. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:13:27,535][25689] Avg episode reward: [(0, '1.038')] [2022-07-11 04:13:27,955][26022] Updated weights on worker 0-0, policy_version 1030227 (0.00094) [2022-07-11 04:13:30,175][26022] Updated weights on worker 0-0, policy_version 1030237 (0.00086) [2022-07-11 04:13:31,591][26022] Updated weights on worker 0-0, policy_version 1030247 (0.00108) [2022-07-11 04:13:32,575][25689] Fps is (10 sec: 5565.2, 60 sec: 5536.2, 300 sec: 5529.2). Total num frames: 1054976000. Throughput: 0: 5814.2. Samples: 1054984084. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:13:32,575][25689] Avg episode reward: [(0, '0.609')] [2022-07-11 04:13:32,808][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:13:32,824][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001030252_1054978048.pth [2022-07-11 04:13:32,824][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001028306_1052985344.pth [2022-07-11 04:13:33,655][26022] Updated weights on worker 0-0, policy_version 1030257 (0.00087) [2022-07-11 04:13:35,537][26022] Updated weights on worker 0-0, policy_version 1030267 (0.00092) [2022-07-11 04:13:37,328][26022] Updated weights on worker 0-0, policy_version 1030277 (0.00090) [2022-07-11 04:13:37,579][25689] Fps is (10 sec: 5607.4, 60 sec: 5556.4, 300 sec: 5533.9). Total num frames: 1055005696. Throughput: 0: 4965.6. Samples: 1055000906. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:13:37,579][25689] Avg episode reward: [(0, '0.703')] [2022-07-11 04:13:39,244][26022] Updated weights on worker 0-0, policy_version 1030287 (0.00090) [2022-07-11 04:13:41,041][26022] Updated weights on worker 0-0, policy_version 1030297 (0.00082) [2022-07-11 04:13:42,629][25689] Fps is (10 sec: 5703.3, 60 sec: 5576.5, 300 sec: 5536.9). Total num frames: 1055033344. Throughput: 0: 5812.8. Samples: 1055034598. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:13:42,633][25689] Avg episode reward: [(0, '0.778')] [2022-07-11 04:13:42,674][26022] Updated weights on worker 0-0, policy_version 1030307 (0.00092) [2022-07-11 04:13:44,790][26022] Updated weights on worker 0-0, policy_version 1030317 (0.00101) [2022-07-11 04:13:46,462][26022] Updated weights on worker 0-0, policy_version 1030327 (0.00087) [2022-07-11 04:13:47,700][25689] Fps is (10 sec: 5463.5, 60 sec: 5537.6, 300 sec: 5532.4). Total num frames: 1055060992. Throughput: 0: 5814.9. Samples: 1055068024. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:13:47,700][25689] Avg episode reward: [(0, '1.328')] [2022-07-11 04:13:48,395][26022] Updated weights on worker 0-0, policy_version 1030337 (0.00101) [2022-07-11 04:13:50,084][26022] Updated weights on worker 0-0, policy_version 1030347 (0.00085) [2022-07-11 04:13:52,215][26022] Updated weights on worker 0-0, policy_version 1030357 (0.00087) [2022-07-11 04:13:52,714][25689] Fps is (10 sec: 5483.4, 60 sec: 5520.0, 300 sec: 5529.3). Total num frames: 1055088640. Throughput: 0: 4992.4. Samples: 1055084624. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:13:52,714][25689] Avg episode reward: [(0, '1.351')] [2022-07-11 04:13:54,012][26022] Updated weights on worker 0-0, policy_version 1030367 (0.00089) [2022-07-11 04:13:55,919][26022] Updated weights on worker 0-0, policy_version 1030377 (0.00472) [2022-07-11 04:13:57,571][26022] Updated weights on worker 0-0, policy_version 1030387 (0.00088) [2022-07-11 04:13:57,769][25689] Fps is (10 sec: 5491.6, 60 sec: 5566.6, 300 sec: 5539.6). Total num frames: 1055116288. Throughput: 0: 5775.8. Samples: 1055117520. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:13:57,769][25689] Avg episode reward: [(0, '0.758')] [2022-07-11 04:13:59,568][26022] Updated weights on worker 0-0, policy_version 1030397 (0.00091) [2022-07-11 04:14:01,458][26022] Updated weights on worker 0-0, policy_version 1030407 (0.00081) [2022-07-11 04:14:02,849][25689] Fps is (10 sec: 5253.4, 60 sec: 5500.9, 300 sec: 5525.6). Total num frames: 1055141888. Throughput: 0: 5665.8. Samples: 1055149160. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:14:02,850][25689] Avg episode reward: [(0, '1.219')] [2022-07-11 04:14:03,591][26022] Updated weights on worker 0-0, policy_version 1030417 (0.00090) [2022-07-11 04:14:05,302][26022] Updated weights on worker 0-0, policy_version 1030427 (0.00096) [2022-07-11 04:14:07,220][26022] Updated weights on worker 0-0, policy_version 1030437 (0.00088) [2022-07-11 04:14:07,895][25689] Fps is (10 sec: 5359.7, 60 sec: 5531.5, 300 sec: 5532.0). Total num frames: 1055170560. Throughput: 0: 4828.9. Samples: 1055165544. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:14:07,895][25689] Avg episode reward: [(0, '0.772')] [2022-07-11 04:14:09,256][26022] Updated weights on worker 0-0, policy_version 1030447 (0.00085) [2022-07-11 04:14:11,006][26022] Updated weights on worker 0-0, policy_version 1030457 (0.00087) [2022-07-11 04:14:12,704][26022] Updated weights on worker 0-0, policy_version 1030467 (0.00084) [2022-07-11 04:14:12,897][25689] Fps is (10 sec: 5605.1, 60 sec: 5516.8, 300 sec: 5532.1). Total num frames: 1055198208. Throughput: 0: 5661.9. Samples: 1055198904. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:14:12,898][25689] Avg episode reward: [(0, '0.876')] [2022-07-11 04:14:14,640][26022] Updated weights on worker 0-0, policy_version 1030477 (0.00091) [2022-07-11 04:14:16,349][26022] Updated weights on worker 0-0, policy_version 1030487 (0.00085) [2022-07-11 04:14:17,935][25689] Fps is (10 sec: 5405.7, 60 sec: 5481.0, 300 sec: 5522.0). Total num frames: 1055224832. Throughput: 0: 5700.9. Samples: 1055232482. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:14:17,935][25689] Avg episode reward: [(0, '0.276')] [2022-07-11 04:14:18,444][26022] Updated weights on worker 0-0, policy_version 1030497 (0.00095) [2022-07-11 04:14:20,093][26022] Updated weights on worker 0-0, policy_version 1030507 (0.00086) [2022-07-11 04:14:22,035][26022] Updated weights on worker 0-0, policy_version 1030517 (0.00082) [2022-07-11 04:14:22,980][25689] Fps is (10 sec: 5789.0, 60 sec: 5566.2, 300 sec: 5538.4). Total num frames: 1055256576. Throughput: 0: 4960.3. Samples: 1055249014. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:14:22,980][25689] Avg episode reward: [(0, '0.503')] [2022-07-11 04:14:23,823][26022] Updated weights on worker 0-0, policy_version 1030527 (0.00091) [2022-07-11 04:14:25,736][26022] Updated weights on worker 0-0, policy_version 1030537 (0.00079) [2022-07-11 04:14:27,507][26022] Updated weights on worker 0-0, policy_version 1030547 (0.00082) [2022-07-11 04:14:27,987][25689] Fps is (10 sec: 5806.5, 60 sec: 5522.1, 300 sec: 5531.7). Total num frames: 1055283200. Throughput: 0: 5821.6. Samples: 1055282512. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:14:27,987][25689] Avg episode reward: [(0, '1.270')] [2022-07-11 04:14:29,492][26022] Updated weights on worker 0-0, policy_version 1030557 (0.00091) [2022-07-11 04:14:30,894][26022] Updated weights on worker 0-0, policy_version 1030567 (0.00051) [2022-07-11 04:14:33,015][25689] Fps is (10 sec: 5306.1, 60 sec: 5523.2, 300 sec: 5531.9). Total num frames: 1055309824. Throughput: 0: 5829.1. Samples: 1055316172. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:14:33,016][25689] Avg episode reward: [(0, '1.009')] [2022-07-11 04:14:33,259][26022] Updated weights on worker 0-0, policy_version 1030577 (0.00083) [2022-07-11 04:14:34,750][26022] Updated weights on worker 0-0, policy_version 1030587 (0.00084) [2022-07-11 04:14:36,792][26022] Updated weights on worker 0-0, policy_version 1030597 (0.00089) [2022-07-11 04:14:38,034][25689] Fps is (10 sec: 5707.4, 60 sec: 5538.8, 300 sec: 5539.9). Total num frames: 1055340544. Throughput: 0: 4998.1. Samples: 1055332940. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:14:38,034][25689] Avg episode reward: [(0, '0.768')] [2022-07-11 04:14:38,355][26022] Updated weights on worker 0-0, policy_version 1030607 (0.00098) [2022-07-11 04:14:40,479][26022] Updated weights on worker 0-0, policy_version 1030617 (0.00094) [2022-07-11 04:14:42,127][26022] Updated weights on worker 0-0, policy_version 1030627 (0.00092) [2022-07-11 04:14:43,100][25689] Fps is (10 sec: 5584.8, 60 sec: 5503.5, 300 sec: 5528.7). Total num frames: 1055366144. Throughput: 0: 5842.2. Samples: 1055366558. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:14:43,100][25689] Avg episode reward: [(0, '0.633')] [2022-07-11 04:14:43,927][26022] Updated weights on worker 0-0, policy_version 1030637 (0.00087) [2022-07-11 04:14:45,838][26022] Updated weights on worker 0-0, policy_version 1030647 (0.00090) [2022-07-11 04:14:47,650][26022] Updated weights on worker 0-0, policy_version 1030657 (0.00092) [2022-07-11 04:14:48,176][25689] Fps is (10 sec: 5351.3, 60 sec: 5519.9, 300 sec: 5531.9). Total num frames: 1055394816. Throughput: 0: 5817.6. Samples: 1055399964. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:14:48,176][25689] Avg episode reward: [(0, '0.102')] [2022-07-11 04:14:49,376][26022] Updated weights on worker 0-0, policy_version 1030667 (0.00101) [2022-07-11 04:14:51,411][26022] Updated weights on worker 0-0, policy_version 1030677 (0.00096) [2022-07-11 04:14:52,975][26022] Updated weights on worker 0-0, policy_version 1030687 (0.00088) [2022-07-11 04:14:53,185][25689] Fps is (10 sec: 5685.9, 60 sec: 5537.3, 300 sec: 5535.5). Total num frames: 1055423488. Throughput: 0: 4977.1. Samples: 1055416560. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:14:53,185][25689] Avg episode reward: [(0, '-0.730')] [2022-07-11 04:14:55,033][26022] Updated weights on worker 0-0, policy_version 1030697 (0.00091) [2022-07-11 04:14:56,865][26022] Updated weights on worker 0-0, policy_version 1030707 (0.00085) [2022-07-11 04:14:58,271][25689] Fps is (10 sec: 5579.1, 60 sec: 5534.5, 300 sec: 5531.3). Total num frames: 1055451136. Throughput: 0: 5790.7. Samples: 1055450124. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:14:58,271][25689] Avg episode reward: [(0, '-0.679')] [2022-07-11 04:14:58,818][26022] Updated weights on worker 0-0, policy_version 1030717 (0.00086) [2022-07-11 04:15:00,776][26022] Updated weights on worker 0-0, policy_version 1030727 (0.00118) [2022-07-11 04:15:02,621][26022] Updated weights on worker 0-0, policy_version 1030737 (0.00087) [2022-07-11 04:15:03,332][25689] Fps is (10 sec: 5449.4, 60 sec: 5570.1, 300 sec: 5538.6). Total num frames: 1055478784. Throughput: 0: 5670.7. Samples: 1055481290. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:15:03,333][25689] Avg episode reward: [(0, '0.145')] [2022-07-11 04:15:04,845][26022] Updated weights on worker 0-0, policy_version 1030747 (0.00089) [2022-07-11 04:15:06,351][26022] Updated weights on worker 0-0, policy_version 1030757 (0.00085) [2022-07-11 04:15:08,343][25689] Fps is (10 sec: 5286.6, 60 sec: 5522.5, 300 sec: 5531.7). Total num frames: 1055504384. Throughput: 0: 5696.8. Samples: 1055514852. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:15:08,343][25689] Avg episode reward: [(0, '-0.728')] [2022-07-11 04:15:08,400][26022] Updated weights on worker 0-0, policy_version 1030767 (0.00091) [2022-07-11 04:15:10,048][26022] Updated weights on worker 0-0, policy_version 1030777 (0.00090) [2022-07-11 04:15:11,978][26022] Updated weights on worker 0-0, policy_version 1030787 (0.00088) [2022-07-11 04:15:13,374][25689] Fps is (10 sec: 5506.4, 60 sec: 5553.7, 300 sec: 5538.3). Total num frames: 1055534080. Throughput: 0: 5694.7. Samples: 1055531532. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:15:13,375][25689] Avg episode reward: [(0, '0.219')] [2022-07-11 04:15:13,804][26022] Updated weights on worker 0-0, policy_version 1030797 (0.00089) [2022-07-11 04:15:15,611][26022] Updated weights on worker 0-0, policy_version 1030807 (0.00081) [2022-07-11 04:15:17,325][26022] Updated weights on worker 0-0, policy_version 1030817 (0.00086) [2022-07-11 04:15:18,387][25689] Fps is (10 sec: 5607.4, 60 sec: 5556.0, 300 sec: 5530.0). Total num frames: 1055560704. Throughput: 0: 5728.1. Samples: 1055565352. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:15:18,387][25689] Avg episode reward: [(0, '0.399')] [2022-07-11 04:15:19,303][26022] Updated weights on worker 0-0, policy_version 1030827 (0.00064) [2022-07-11 04:15:21,162][26022] Updated weights on worker 0-0, policy_version 1030837 (0.00086) [2022-07-11 04:15:22,948][26022] Updated weights on worker 0-0, policy_version 1030847 (0.00085) [2022-07-11 04:15:23,449][25689] Fps is (10 sec: 5590.0, 60 sec: 5520.5, 300 sec: 5537.5). Total num frames: 1055590400. Throughput: 0: 5832.6. Samples: 1055598626. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:15:23,450][25689] Avg episode reward: [(0, '0.503')] [2022-07-11 04:15:24,797][26022] Updated weights on worker 0-0, policy_version 1030857 (0.00089) [2022-07-11 04:15:26,642][26022] Updated weights on worker 0-0, policy_version 1030867 (0.00078) [2022-07-11 04:15:28,477][25689] Fps is (10 sec: 5683.3, 60 sec: 5535.6, 300 sec: 5537.7). Total num frames: 1055618048. Throughput: 0: 4990.0. Samples: 1055615320. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:15:28,477][25689] Avg episode reward: [(0, '-0.129')] [2022-07-11 04:15:28,479][26022] Updated weights on worker 0-0, policy_version 1030877 (0.00084) [2022-07-11 04:15:30,601][26022] Updated weights on worker 0-0, policy_version 1030887 (0.00359) [2022-07-11 04:15:32,079][26022] Updated weights on worker 0-0, policy_version 1030897 (0.00088) [2022-07-11 04:15:33,229][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:15:33,249][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001030902_1055643648.pth [2022-07-11 04:15:33,250][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001028954_1053648896.pth [2022-07-11 04:15:33,532][25689] Fps is (10 sec: 5382.6, 60 sec: 5533.1, 300 sec: 5524.3). Total num frames: 1055644672. Throughput: 0: 5810.4. Samples: 1055648658. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:15:33,533][25689] Avg episode reward: [(0, '-1.782')] [2022-07-11 04:15:34,102][26022] Updated weights on worker 0-0, policy_version 1030907 (0.00077) [2022-07-11 04:15:35,748][26022] Updated weights on worker 0-0, policy_version 1030917 (0.00490) [2022-07-11 04:15:37,874][26022] Updated weights on worker 0-0, policy_version 1030927 (0.00095) [2022-07-11 04:15:38,582][25689] Fps is (10 sec: 5471.7, 60 sec: 5496.4, 300 sec: 5532.6). Total num frames: 1055673344. Throughput: 0: 5783.5. Samples: 1055682154. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:15:38,583][25689] Avg episode reward: [(0, '-2.926')] [2022-07-11 04:15:39,600][26022] Updated weights on worker 0-0, policy_version 1030937 (0.00085) [2022-07-11 04:15:41,545][26022] Updated weights on worker 0-0, policy_version 1030947 (0.00088) [2022-07-11 04:15:43,265][26022] Updated weights on worker 0-0, policy_version 1030957 (0.00086) [2022-07-11 04:15:43,690][25689] Fps is (10 sec: 5746.2, 60 sec: 5560.2, 300 sec: 5537.6). Total num frames: 1055703040. Throughput: 0: 4933.8. Samples: 1055698490. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:15:43,691][25689] Avg episode reward: [(0, '-2.898')] [2022-07-11 04:15:45,319][26022] Updated weights on worker 0-0, policy_version 1030967 (0.00095) [2022-07-11 04:15:46,894][26022] Updated weights on worker 0-0, policy_version 1030977 (0.00081) [2022-07-11 04:15:48,768][25689] Fps is (10 sec: 5428.9, 60 sec: 5509.4, 300 sec: 5530.6). Total num frames: 1055728640. Throughput: 0: 5750.4. Samples: 1055732004. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:15:48,768][25689] Avg episode reward: [(0, '-2.937')] [2022-07-11 04:15:49,046][26022] Updated weights on worker 0-0, policy_version 1030987 (0.00085) [2022-07-11 04:15:50,663][26022] Updated weights on worker 0-0, policy_version 1030997 (0.00084) [2022-07-11 04:15:52,593][26022] Updated weights on worker 0-0, policy_version 1031007 (0.00082) [2022-07-11 04:15:53,778][25689] Fps is (10 sec: 5380.0, 60 sec: 5509.3, 300 sec: 5530.5). Total num frames: 1055757312. Throughput: 0: 5756.5. Samples: 1055765200. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:15:53,779][25689] Avg episode reward: [(0, '-3.079')] [2022-07-11 04:15:54,305][26022] Updated weights on worker 0-0, policy_version 1031017 (0.00086) [2022-07-11 04:15:56,340][26022] Updated weights on worker 0-0, policy_version 1031027 (0.00087) [2022-07-11 04:15:58,155][26022] Updated weights on worker 0-0, policy_version 1031037 (0.00101) [2022-07-11 04:15:58,822][25689] Fps is (10 sec: 5601.5, 60 sec: 5513.0, 300 sec: 5531.6). Total num frames: 1055784960. Throughput: 0: 4922.5. Samples: 1055781784. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:15:58,823][25689] Avg episode reward: [(0, '-3.316')] [2022-07-11 04:16:00,073][26022] Updated weights on worker 0-0, policy_version 1031047 (0.00084) [2022-07-11 04:16:02,263][26022] Updated weights on worker 0-0, policy_version 1031057 (0.00083) [2022-07-11 04:16:03,896][25689] Fps is (10 sec: 5262.5, 60 sec: 5478.1, 300 sec: 5531.3). Total num frames: 1055810560. Throughput: 0: 5649.6. Samples: 1055812646. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:16:03,897][25689] Avg episode reward: [(0, '-1.737')] [2022-07-11 04:16:04,062][26022] Updated weights on worker 0-0, policy_version 1031067 (0.00087) [2022-07-11 04:16:05,965][26022] Updated weights on worker 0-0, policy_version 1031077 (0.00086) [2022-07-11 04:16:07,754][26022] Updated weights on worker 0-0, policy_version 1031087 (0.00066) [2022-07-11 04:16:08,915][25689] Fps is (10 sec: 5377.5, 60 sec: 5528.1, 300 sec: 5531.1). Total num frames: 1055839232. Throughput: 0: 5658.9. Samples: 1055846012. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:16:08,915][25689] Avg episode reward: [(0, '-0.847')] [2022-07-11 04:16:09,749][26022] Updated weights on worker 0-0, policy_version 1031097 (0.00088) [2022-07-11 04:16:11,434][26022] Updated weights on worker 0-0, policy_version 1031107 (0.00081) [2022-07-11 04:16:13,343][26022] Updated weights on worker 0-0, policy_version 1031117 (0.00089) [2022-07-11 04:16:13,917][25689] Fps is (10 sec: 5620.3, 60 sec: 5497.0, 300 sec: 5532.2). Total num frames: 1055866880. Throughput: 0: 4843.6. Samples: 1055862746. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:16:13,917][25689] Avg episode reward: [(0, '0.452')] [2022-07-11 04:16:15,278][26022] Updated weights on worker 0-0, policy_version 1031127 (0.00085) [2022-07-11 04:16:16,972][26022] Updated weights on worker 0-0, policy_version 1031137 (0.00086) [2022-07-11 04:16:18,964][25689] Fps is (10 sec: 5400.6, 60 sec: 5493.8, 300 sec: 5525.3). Total num frames: 1055893504. Throughput: 0: 5684.1. Samples: 1055896272. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:16:18,966][25689] Avg episode reward: [(0, '0.272')] [2022-07-11 04:16:18,976][26022] Updated weights on worker 0-0, policy_version 1031147 (0.00091) [2022-07-11 04:16:20,603][26022] Updated weights on worker 0-0, policy_version 1031157 (0.00101) [2022-07-11 04:16:22,665][26022] Updated weights on worker 0-0, policy_version 1031167 (0.00087) [2022-07-11 04:16:24,090][25689] Fps is (10 sec: 5536.1, 60 sec: 5488.1, 300 sec: 5530.0). Total num frames: 1055923200. Throughput: 0: 5801.6. Samples: 1055929804. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 04:16:24,093][25689] Avg episode reward: [(0, '1.720')] [2022-07-11 04:16:24,301][26022] Updated weights on worker 0-0, policy_version 1031177 (0.00085) [2022-07-11 04:16:26,271][26022] Updated weights on worker 0-0, policy_version 1031187 (0.00085) [2022-07-11 04:16:28,115][26022] Updated weights on worker 0-0, policy_version 1031197 (0.00096) [2022-07-11 04:16:29,101][25689] Fps is (10 sec: 5657.2, 60 sec: 5489.6, 300 sec: 5530.4). Total num frames: 1055950848. Throughput: 0: 4982.4. Samples: 1055946586. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:16:29,101][25689] Avg episode reward: [(0, '1.489')] [2022-07-11 04:16:29,875][26022] Updated weights on worker 0-0, policy_version 1031207 (0.00084) [2022-07-11 04:16:31,615][26022] Updated weights on worker 0-0, policy_version 1031217 (0.00087) [2022-07-11 04:16:33,375][26022] Updated weights on worker 0-0, policy_version 1031227 (0.00085) [2022-07-11 04:16:34,108][25689] Fps is (10 sec: 5417.4, 60 sec: 5493.9, 300 sec: 5520.4). Total num frames: 1055977472. Throughput: 0: 5814.6. Samples: 1055980152. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:16:34,109][25689] Avg episode reward: [(0, '-0.146')] [2022-07-11 04:16:35,242][26022] Updated weights on worker 0-0, policy_version 1031237 (0.00091) [2022-07-11 04:16:37,340][26022] Updated weights on worker 0-0, policy_version 1031247 (0.00086) [2022-07-11 04:16:38,889][26022] Updated weights on worker 0-0, policy_version 1031257 (0.00089) [2022-07-11 04:16:39,119][25689] Fps is (10 sec: 5723.7, 60 sec: 5531.3, 300 sec: 5536.1). Total num frames: 1056008192. Throughput: 0: 5834.0. Samples: 1056013858. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:16:39,120][25689] Avg episode reward: [(0, '-1.925')] [2022-07-11 04:16:40,864][26022] Updated weights on worker 0-0, policy_version 1031267 (0.00087) [2022-07-11 04:16:42,575][26022] Updated weights on worker 0-0, policy_version 1031277 (0.00091) [2022-07-11 04:16:44,254][25689] Fps is (10 sec: 5853.7, 60 sec: 5511.9, 300 sec: 5533.7). Total num frames: 1056036864. Throughput: 0: 5005.2. Samples: 1056030730. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:16:44,255][25689] Avg episode reward: [(0, '-2.406')] [2022-07-11 04:16:44,311][26022] Updated weights on worker 0-0, policy_version 1031287 (0.00088) [2022-07-11 04:16:46,345][26022] Updated weights on worker 0-0, policy_version 1031297 (0.00083) [2022-07-11 04:16:48,118][26022] Updated weights on worker 0-0, policy_version 1031307 (0.00086) [2022-07-11 04:16:49,324][25689] Fps is (10 sec: 5418.7, 60 sec: 5529.6, 300 sec: 5522.7). Total num frames: 1056063488. Throughput: 0: 5828.5. Samples: 1056064458. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:16:49,324][25689] Avg episode reward: [(0, '-2.472')] [2022-07-11 04:16:49,931][26022] Updated weights on worker 0-0, policy_version 1031317 (0.00923) [2022-07-11 04:16:51,991][26022] Updated weights on worker 0-0, policy_version 1031327 (0.00089) [2022-07-11 04:16:53,429][26022] Updated weights on worker 0-0, policy_version 1031337 (0.00085) [2022-07-11 04:16:54,347][25689] Fps is (10 sec: 5580.2, 60 sec: 5545.2, 300 sec: 5532.8). Total num frames: 1056093184. Throughput: 0: 5820.4. Samples: 1056097952. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:16:54,348][25689] Avg episode reward: [(0, '-3.633')] [2022-07-11 04:16:55,611][26022] Updated weights on worker 0-0, policy_version 1031347 (0.00085) [2022-07-11 04:16:57,154][26022] Updated weights on worker 0-0, policy_version 1031357 (0.00086) [2022-07-11 04:16:59,222][26022] Updated weights on worker 0-0, policy_version 1031367 (0.00081) [2022-07-11 04:16:59,355][25689] Fps is (10 sec: 5614.5, 60 sec: 5531.7, 300 sec: 5530.6). Total num frames: 1056119808. Throughput: 0: 4986.8. Samples: 1056114770. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:16:59,356][25689] Avg episode reward: [(0, '-3.617')] [2022-07-11 04:17:00,776][26022] Updated weights on worker 0-0, policy_version 1031377 (0.00085) [2022-07-11 04:17:03,227][26022] Updated weights on worker 0-0, policy_version 1031387 (0.00082) [2022-07-11 04:17:04,461][25689] Fps is (10 sec: 5366.3, 60 sec: 5562.6, 300 sec: 5532.6). Total num frames: 1056147456. Throughput: 0: 5711.4. Samples: 1056146136. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:17:04,463][25689] Avg episode reward: [(0, '-2.734')] [2022-07-11 04:17:04,950][26022] Updated weights on worker 0-0, policy_version 1031397 (0.00148) [2022-07-11 04:17:06,759][26022] Updated weights on worker 0-0, policy_version 1031407 (0.00087) [2022-07-11 04:17:08,705][26022] Updated weights on worker 0-0, policy_version 1031417 (0.00085) [2022-07-11 04:17:09,502][25689] Fps is (10 sec: 5449.5, 60 sec: 5543.6, 300 sec: 5532.8). Total num frames: 1056175104. Throughput: 0: 5729.8. Samples: 1056180074. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:17:09,502][25689] Avg episode reward: [(0, '-1.418')] [2022-07-11 04:17:10,248][26022] Updated weights on worker 0-0, policy_version 1031427 (0.00084) [2022-07-11 04:17:12,395][26022] Updated weights on worker 0-0, policy_version 1031437 (0.00082) [2022-07-11 04:17:13,976][26022] Updated weights on worker 0-0, policy_version 1031447 (0.00092) [2022-07-11 04:17:14,519][25689] Fps is (10 sec: 5599.6, 60 sec: 5559.2, 300 sec: 5532.7). Total num frames: 1056203776. Throughput: 0: 4906.2. Samples: 1056196916. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:17:14,519][25689] Avg episode reward: [(0, '-1.277')] [2022-07-11 04:17:15,923][26022] Updated weights on worker 0-0, policy_version 1031457 (0.00093) [2022-07-11 04:17:17,684][26022] Updated weights on worker 0-0, policy_version 1031467 (0.00087) [2022-07-11 04:17:19,535][25689] Fps is (10 sec: 5613.6, 60 sec: 5578.9, 300 sec: 5534.1). Total num frames: 1056231424. Throughput: 0: 5743.4. Samples: 1056230670. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:17:19,535][25689] Avg episode reward: [(0, '0.040')] [2022-07-11 04:17:19,603][26022] Updated weights on worker 0-0, policy_version 1031477 (0.00081) [2022-07-11 04:17:21,595][26022] Updated weights on worker 0-0, policy_version 1031487 (0.00076) [2022-07-11 04:17:23,248][26022] Updated weights on worker 0-0, policy_version 1031497 (0.00088) [2022-07-11 04:17:24,635][25689] Fps is (10 sec: 5466.2, 60 sec: 5547.5, 300 sec: 5533.5). Total num frames: 1056259072. Throughput: 0: 5836.1. Samples: 1056263874. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:17:24,635][25689] Avg episode reward: [(0, '-0.561')] [2022-07-11 04:17:25,135][26022] Updated weights on worker 0-0, policy_version 1031507 (0.00097) [2022-07-11 04:17:26,963][26022] Updated weights on worker 0-0, policy_version 1031517 (0.00093) [2022-07-11 04:17:28,824][26022] Updated weights on worker 0-0, policy_version 1031527 (0.00091) [2022-07-11 04:17:29,655][25689] Fps is (10 sec: 5565.1, 60 sec: 5563.5, 300 sec: 5533.6). Total num frames: 1056287744. Throughput: 0: 4989.0. Samples: 1056280618. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:17:29,656][25689] Avg episode reward: [(0, '-0.406')] [2022-07-11 04:17:30,698][26022] Updated weights on worker 0-0, policy_version 1031537 (0.00092) [2022-07-11 04:17:32,542][26022] Updated weights on worker 0-0, policy_version 1031547 (0.00085) [2022-07-11 04:17:33,309][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:17:33,318][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001031552_1056309248.pth [2022-07-11 04:17:33,326][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001029603_1054313472.pth [2022-07-11 04:17:34,223][26022] Updated weights on worker 0-0, policy_version 1031557 (0.00086) [2022-07-11 04:17:34,681][25689] Fps is (10 sec: 5707.8, 60 sec: 5595.6, 300 sec: 5533.8). Total num frames: 1056316416. Throughput: 0: 5815.6. Samples: 1056314174. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:17:34,682][25689] Avg episode reward: [(0, '-0.552')] [2022-07-11 04:17:36,387][26022] Updated weights on worker 0-0, policy_version 1031567 (0.00087) [2022-07-11 04:17:37,884][26022] Updated weights on worker 0-0, policy_version 1031577 (0.00083) [2022-07-11 04:17:39,699][25689] Fps is (10 sec: 5505.3, 60 sec: 5527.4, 300 sec: 5535.1). Total num frames: 1056343040. Throughput: 0: 5807.0. Samples: 1056347764. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:17:39,701][25689] Avg episode reward: [(0, '0.154')] [2022-07-11 04:17:39,962][26022] Updated weights on worker 0-0, policy_version 1031587 (0.00087) [2022-07-11 04:17:41,727][26022] Updated weights on worker 0-0, policy_version 1031597 (0.00093) [2022-07-11 04:17:43,430][26022] Updated weights on worker 0-0, policy_version 1031607 (0.00087) [2022-07-11 04:17:44,763][25689] Fps is (10 sec: 5586.6, 60 sec: 5550.9, 300 sec: 5534.2). Total num frames: 1056372736. Throughput: 0: 5842.1. Samples: 1056381464. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:17:44,763][25689] Avg episode reward: [(0, '0.194')] [2022-07-11 04:17:45,311][26022] Updated weights on worker 0-0, policy_version 1031617 (0.00086) [2022-07-11 04:17:47,083][26022] Updated weights on worker 0-0, policy_version 1031627 (0.00087) [2022-07-11 04:17:49,138][26022] Updated weights on worker 0-0, policy_version 1031637 (0.00092) [2022-07-11 04:17:49,769][25689] Fps is (10 sec: 5593.1, 60 sec: 5556.7, 300 sec: 5527.3). Total num frames: 1056399360. Throughput: 0: 5836.6. Samples: 1056398014. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:17:49,769][25689] Avg episode reward: [(0, '-1.220')] [2022-07-11 04:17:51,006][26022] Updated weights on worker 0-0, policy_version 1031647 (0.00085) [2022-07-11 04:17:52,627][26022] Updated weights on worker 0-0, policy_version 1031657 (0.00084) [2022-07-11 04:17:54,594][26022] Updated weights on worker 0-0, policy_version 1031667 (0.00850) [2022-07-11 04:17:54,815][25689] Fps is (10 sec: 5501.0, 60 sec: 5537.7, 300 sec: 5540.4). Total num frames: 1056428032. Throughput: 0: 5815.3. Samples: 1056431256. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:17:54,815][25689] Avg episode reward: [(0, '-0.735')] [2022-07-11 04:17:56,431][26022] Updated weights on worker 0-0, policy_version 1031677 (0.00082) [2022-07-11 04:17:58,327][26022] Updated weights on worker 0-0, policy_version 1031687 (0.00081) [2022-07-11 04:17:59,835][25689] Fps is (10 sec: 5696.8, 60 sec: 5570.4, 300 sec: 5538.5). Total num frames: 1056456704. Throughput: 0: 5825.5. Samples: 1056465064. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:17:59,835][25689] Avg episode reward: [(0, '-0.852')] [2022-07-11 04:18:00,160][26022] Updated weights on worker 0-0, policy_version 1031697 (0.00134) [2022-07-11 04:18:02,024][26022] Updated weights on worker 0-0, policy_version 1031707 (0.00094) [2022-07-11 04:18:04,017][26022] Updated weights on worker 0-0, policy_version 1031717 (0.00092) [2022-07-11 04:18:04,953][25689] Fps is (10 sec: 5353.2, 60 sec: 5535.4, 300 sec: 5533.0). Total num frames: 1056482304. Throughput: 0: 4863.4. Samples: 1056479658. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:18:04,953][25689] Avg episode reward: [(0, '0.137')] [2022-07-11 04:18:06,015][26022] Updated weights on worker 0-0, policy_version 1031727 (0.00095) [2022-07-11 04:18:07,630][26022] Updated weights on worker 0-0, policy_version 1031737 (0.00094) [2022-07-11 04:18:09,710][26022] Updated weights on worker 0-0, policy_version 1031747 (0.00097) [2022-07-11 04:18:09,984][25689] Fps is (10 sec: 5246.6, 60 sec: 5536.3, 300 sec: 5529.5). Total num frames: 1056509952. Throughput: 0: 5690.8. Samples: 1056513054. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:18:09,984][25689] Avg episode reward: [(0, '-0.135')] [2022-07-11 04:18:11,590][26022] Updated weights on worker 0-0, policy_version 1031757 (0.00083) [2022-07-11 04:18:13,439][26022] Updated weights on worker 0-0, policy_version 1031767 (0.00087) [2022-07-11 04:18:15,033][25689] Fps is (10 sec: 5485.6, 60 sec: 5516.4, 300 sec: 5525.5). Total num frames: 1056537600. Throughput: 0: 5705.3. Samples: 1056546610. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:18:15,034][25689] Avg episode reward: [(0, '0.422')] [2022-07-11 04:18:15,275][26022] Updated weights on worker 0-0, policy_version 1031777 (0.00085) [2022-07-11 04:18:17,005][26022] Updated weights on worker 0-0, policy_version 1031787 (0.00087) [2022-07-11 04:18:18,875][26022] Updated weights on worker 0-0, policy_version 1031797 (0.00086) [2022-07-11 04:18:20,110][25689] Fps is (10 sec: 5561.6, 60 sec: 5527.8, 300 sec: 5531.9). Total num frames: 1056566272. Throughput: 0: 4849.9. Samples: 1056563398. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:18:20,111][25689] Avg episode reward: [(0, '0.739')] [2022-07-11 04:18:20,662][26022] Updated weights on worker 0-0, policy_version 1031807 (0.01189) [2022-07-11 04:18:22,525][26022] Updated weights on worker 0-0, policy_version 1031817 (0.00132) [2022-07-11 04:18:24,486][26022] Updated weights on worker 0-0, policy_version 1031827 (0.00091) [2022-07-11 04:18:25,262][25689] Fps is (10 sec: 5605.9, 60 sec: 5539.9, 300 sec: 5527.1). Total num frames: 1056594944. Throughput: 0: 5765.4. Samples: 1056596750. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:18:25,263][25689] Avg episode reward: [(0, '0.846')] [2022-07-11 04:18:26,214][26022] Updated weights on worker 0-0, policy_version 1031837 (0.00088) [2022-07-11 04:18:28,223][26022] Updated weights on worker 0-0, policy_version 1031847 (0.00089) [2022-07-11 04:18:29,937][26022] Updated weights on worker 0-0, policy_version 1031857 (0.00092) [2022-07-11 04:18:30,284][25689] Fps is (10 sec: 5636.4, 60 sec: 5539.8, 300 sec: 5534.3). Total num frames: 1056623616. Throughput: 0: 5766.3. Samples: 1056630112. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:18:30,285][25689] Avg episode reward: [(0, '1.047')] [2022-07-11 04:18:31,771][26022] Updated weights on worker 0-0, policy_version 1031867 (0.00097) [2022-07-11 04:18:33,597][26022] Updated weights on worker 0-0, policy_version 1031877 (0.00085) [2022-07-11 04:18:35,299][25689] Fps is (10 sec: 5611.6, 60 sec: 5524.0, 300 sec: 5527.2). Total num frames: 1056651264. Throughput: 0: 4949.5. Samples: 1056646914. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:18:35,299][25689] Avg episode reward: [(0, '0.073')] [2022-07-11 04:18:35,391][26022] Updated weights on worker 0-0, policy_version 1031887 (0.00087) [2022-07-11 04:18:37,263][26022] Updated weights on worker 0-0, policy_version 1031897 (0.00090) [2022-07-11 04:18:38,912][26022] Updated weights on worker 0-0, policy_version 1031907 (0.00090) [2022-07-11 04:18:40,326][25689] Fps is (10 sec: 5506.5, 60 sec: 5540.0, 300 sec: 5527.7). Total num frames: 1056678912. Throughput: 0: 5808.9. Samples: 1056680830. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:18:40,327][25689] Avg episode reward: [(0, '0.306')] [2022-07-11 04:18:40,892][26022] Updated weights on worker 0-0, policy_version 1031917 (0.00091) [2022-07-11 04:18:42,532][26022] Updated weights on worker 0-0, policy_version 1031927 (0.00087) [2022-07-11 04:18:44,465][26022] Updated weights on worker 0-0, policy_version 1031937 (0.00092) [2022-07-11 04:18:45,442][25689] Fps is (10 sec: 5552.5, 60 sec: 5518.3, 300 sec: 5530.3). Total num frames: 1056707584. Throughput: 0: 5837.9. Samples: 1056714556. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:18:45,442][25689] Avg episode reward: [(0, '0.145')] [2022-07-11 04:18:46,193][26022] Updated weights on worker 0-0, policy_version 1031947 (0.00086) [2022-07-11 04:18:48,085][26022] Updated weights on worker 0-0, policy_version 1031957 (0.00086) [2022-07-11 04:18:49,960][26022] Updated weights on worker 0-0, policy_version 1031967 (0.00084) [2022-07-11 04:18:50,491][25689] Fps is (10 sec: 5742.2, 60 sec: 5565.0, 300 sec: 5536.5). Total num frames: 1056737280. Throughput: 0: 5006.6. Samples: 1056731280. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:18:50,491][25689] Avg episode reward: [(0, '0.197')] [2022-07-11 04:18:51,971][26022] Updated weights on worker 0-0, policy_version 1031977 (0.00085) [2022-07-11 04:18:53,554][26022] Updated weights on worker 0-0, policy_version 1031987 (0.00084) [2022-07-11 04:18:55,503][25689] Fps is (10 sec: 5597.6, 60 sec: 5534.3, 300 sec: 5533.8). Total num frames: 1056763904. Throughput: 0: 5817.7. Samples: 1056764458. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:18:55,504][25689] Avg episode reward: [(0, '-2.003')] [2022-07-11 04:18:55,847][26022] Updated weights on worker 0-0, policy_version 1031997 (0.00088) [2022-07-11 04:18:57,310][26022] Updated weights on worker 0-0, policy_version 1032007 (0.00097) [2022-07-11 04:18:59,219][26022] Updated weights on worker 0-0, policy_version 1032017 (0.00086) [2022-07-11 04:19:00,506][25689] Fps is (10 sec: 5419.1, 60 sec: 5519.1, 300 sec: 5542.2). Total num frames: 1056791552. Throughput: 0: 5809.9. Samples: 1056798072. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:19:00,506][25689] Avg episode reward: [(0, '-1.418')] [2022-07-11 04:19:01,087][26022] Updated weights on worker 0-0, policy_version 1032027 (0.00079) [2022-07-11 04:19:03,234][26022] Updated weights on worker 0-0, policy_version 1032037 (0.00084) [2022-07-11 04:19:05,267][26022] Updated weights on worker 0-0, policy_version 1032047 (0.00097) [2022-07-11 04:19:05,569][25689] Fps is (10 sec: 5290.3, 60 sec: 5524.1, 300 sec: 5531.5). Total num frames: 1056817152. Throughput: 0: 4871.4. Samples: 1056812602. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:19:05,569][25689] Avg episode reward: [(0, '-1.537')] [2022-07-11 04:19:06,899][26022] Updated weights on worker 0-0, policy_version 1032057 (0.00075) [2022-07-11 04:19:08,796][26022] Updated weights on worker 0-0, policy_version 1032067 (0.00108) [2022-07-11 04:19:10,668][25689] Fps is (10 sec: 5340.8, 60 sec: 5534.8, 300 sec: 5533.2). Total num frames: 1056845824. Throughput: 0: 5688.3. Samples: 1056846052. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:19:10,668][25689] Avg episode reward: [(0, '-2.027')] [2022-07-11 04:19:10,726][26022] Updated weights on worker 0-0, policy_version 1032077 (0.00612) [2022-07-11 04:19:12,363][26022] Updated weights on worker 0-0, policy_version 1032087 (0.00054) [2022-07-11 04:19:14,463][26022] Updated weights on worker 0-0, policy_version 1032097 (0.00098) [2022-07-11 04:19:15,731][25689] Fps is (10 sec: 5743.4, 60 sec: 5567.2, 300 sec: 5543.0). Total num frames: 1056875520. Throughput: 0: 5704.9. Samples: 1056879856. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:19:15,732][25689] Avg episode reward: [(0, '-1.942')] [2022-07-11 04:19:16,043][26022] Updated weights on worker 0-0, policy_version 1032107 (0.00084) [2022-07-11 04:19:17,954][26022] Updated weights on worker 0-0, policy_version 1032117 (0.00082) [2022-07-11 04:19:19,787][26022] Updated weights on worker 0-0, policy_version 1032127 (0.00096) [2022-07-11 04:19:20,752][25689] Fps is (10 sec: 5483.1, 60 sec: 5521.7, 300 sec: 5522.8). Total num frames: 1056901120. Throughput: 0: 4863.2. Samples: 1056896536. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:19:20,753][25689] Avg episode reward: [(0, '-2.586')] [2022-07-11 04:19:21,510][26022] Updated weights on worker 0-0, policy_version 1032137 (0.00090) [2022-07-11 04:19:23,728][26022] Updated weights on worker 0-0, policy_version 1032147 (0.00088) [2022-07-11 04:19:25,306][26022] Updated weights on worker 0-0, policy_version 1032157 (0.00089) [2022-07-11 04:19:25,822][25689] Fps is (10 sec: 5581.5, 60 sec: 5563.1, 300 sec: 5535.4). Total num frames: 1056931840. Throughput: 0: 5807.2. Samples: 1056930216. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:19:25,822][25689] Avg episode reward: [(0, '-2.561')] [2022-07-11 04:19:27,266][26022] Updated weights on worker 0-0, policy_version 1032167 (0.00089) [2022-07-11 04:19:28,980][26022] Updated weights on worker 0-0, policy_version 1032177 (0.00089) [2022-07-11 04:19:30,823][25689] Fps is (10 sec: 5694.0, 60 sec: 5531.1, 300 sec: 5535.9). Total num frames: 1056958464. Throughput: 0: 5834.5. Samples: 1056963650. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:19:30,824][25689] Avg episode reward: [(0, '-2.227')] [2022-07-11 04:19:30,845][26022] Updated weights on worker 0-0, policy_version 1032187 (0.00081) [2022-07-11 04:19:32,687][26022] Updated weights on worker 0-0, policy_version 1032197 (0.00092) [2022-07-11 04:19:33,430][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:19:33,443][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001032202_1056974848.pth [2022-07-11 04:19:33,444][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001030252_1054978048.pth [2022-07-11 04:19:34,642][26022] Updated weights on worker 0-0, policy_version 1032207 (0.00087) [2022-07-11 04:19:35,891][25689] Fps is (10 sec: 5491.9, 60 sec: 5543.2, 300 sec: 5528.1). Total num frames: 1056987136. Throughput: 0: 4991.9. Samples: 1056980488. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:19:35,892][25689] Avg episode reward: [(0, '-2.732')] [2022-07-11 04:19:36,350][26022] Updated weights on worker 0-0, policy_version 1032217 (0.00094) [2022-07-11 04:19:38,256][26022] Updated weights on worker 0-0, policy_version 1032227 (0.00092) [2022-07-11 04:19:40,122][26022] Updated weights on worker 0-0, policy_version 1032237 (0.00086) [2022-07-11 04:19:40,906][25689] Fps is (10 sec: 5687.5, 60 sec: 5561.2, 300 sec: 5539.4). Total num frames: 1057015808. Throughput: 0: 5806.0. Samples: 1057013546. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:19:40,906][25689] Avg episode reward: [(0, '-2.232')] [2022-07-11 04:19:42,084][26022] Updated weights on worker 0-0, policy_version 1032247 (0.00087) [2022-07-11 04:19:43,597][26022] Updated weights on worker 0-0, policy_version 1032257 (0.00092) [2022-07-11 04:19:45,682][26022] Updated weights on worker 0-0, policy_version 1032267 (0.00387) [2022-07-11 04:19:45,968][25689] Fps is (10 sec: 5486.9, 60 sec: 5532.3, 300 sec: 5532.8). Total num frames: 1057042432. Throughput: 0: 5806.1. Samples: 1057047188. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:19:45,969][25689] Avg episode reward: [(0, '-2.991')] [2022-07-11 04:19:47,263][26022] Updated weights on worker 0-0, policy_version 1032277 (0.00090) [2022-07-11 04:19:49,287][26022] Updated weights on worker 0-0, policy_version 1032287 (0.00091) [2022-07-11 04:19:50,997][25689] Fps is (10 sec: 5479.4, 60 sec: 5517.2, 300 sec: 5532.4). Total num frames: 1057071104. Throughput: 0: 5804.4. Samples: 1057080748. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:19:50,998][25689] Avg episode reward: [(0, '-3.405')] [2022-07-11 04:19:51,231][26022] Updated weights on worker 0-0, policy_version 1032297 (0.00084) [2022-07-11 04:19:52,892][26022] Updated weights on worker 0-0, policy_version 1032307 (0.00089) [2022-07-11 04:19:54,752][26022] Updated weights on worker 0-0, policy_version 1032317 (0.00101) [2022-07-11 04:19:56,009][25689] Fps is (10 sec: 5609.2, 60 sec: 5534.2, 300 sec: 5533.8). Total num frames: 1057098752. Throughput: 0: 5822.6. Samples: 1057097630. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:19:56,010][25689] Avg episode reward: [(0, '-2.792')] [2022-07-11 04:19:56,639][26022] Updated weights on worker 0-0, policy_version 1032327 (0.00092) [2022-07-11 04:19:58,367][26022] Updated weights on worker 0-0, policy_version 1032337 (0.00085) [2022-07-11 04:20:00,325][26022] Updated weights on worker 0-0, policy_version 1032347 (0.00089) [2022-07-11 04:20:01,015][25689] Fps is (10 sec: 5519.7, 60 sec: 5533.8, 300 sec: 5534.8). Total num frames: 1057126400. Throughput: 0: 5840.3. Samples: 1057130992. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:20:01,016][25689] Avg episode reward: [(0, '-2.855')] [2022-07-11 04:20:02,323][26022] Updated weights on worker 0-0, policy_version 1032357 (0.00083) [2022-07-11 04:20:04,437][26022] Updated weights on worker 0-0, policy_version 1032367 (0.00092) [2022-07-11 04:20:06,058][25689] Fps is (10 sec: 5502.6, 60 sec: 5569.5, 300 sec: 5541.1). Total num frames: 1057154048. Throughput: 0: 5727.3. Samples: 1057162248. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:20:06,059][25689] Avg episode reward: [(0, '-2.946')] [2022-07-11 04:20:06,068][26022] Updated weights on worker 0-0, policy_version 1032377 (0.00087) [2022-07-11 04:20:08,187][26022] Updated weights on worker 0-0, policy_version 1032387 (0.00091) [2022-07-11 04:20:09,731][26022] Updated weights on worker 0-0, policy_version 1032397 (0.00096) [2022-07-11 04:20:11,069][25689] Fps is (10 sec: 5296.2, 60 sec: 5526.7, 300 sec: 5527.7). Total num frames: 1057179648. Throughput: 0: 4890.1. Samples: 1057178902. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:20:11,070][25689] Avg episode reward: [(0, '-1.222')] [2022-07-11 04:20:11,780][26022] Updated weights on worker 0-0, policy_version 1032407 (0.00089) [2022-07-11 04:20:13,552][26022] Updated weights on worker 0-0, policy_version 1032417 (0.00087) [2022-07-11 04:20:15,356][26022] Updated weights on worker 0-0, policy_version 1032427 (0.00090) [2022-07-11 04:20:16,110][25689] Fps is (10 sec: 5399.1, 60 sec: 5511.9, 300 sec: 5534.0). Total num frames: 1057208320. Throughput: 0: 5707.7. Samples: 1057212360. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:20:16,111][25689] Avg episode reward: [(0, '0.369')] [2022-07-11 04:20:17,199][26022] Updated weights on worker 0-0, policy_version 1032437 (0.00084) [2022-07-11 04:20:19,064][26022] Updated weights on worker 0-0, policy_version 1032447 (0.00093) [2022-07-11 04:20:21,044][26022] Updated weights on worker 0-0, policy_version 1032457 (0.00102) [2022-07-11 04:20:21,112][25689] Fps is (10 sec: 5608.0, 60 sec: 5547.5, 300 sec: 5528.3). Total num frames: 1057235968. Throughput: 0: 5722.5. Samples: 1057245996. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:20:21,114][25689] Avg episode reward: [(0, '0.064')] [2022-07-11 04:20:22,606][26022] Updated weights on worker 0-0, policy_version 1032467 (0.00078) [2022-07-11 04:20:24,519][26022] Updated weights on worker 0-0, policy_version 1032477 (0.00089) [2022-07-11 04:20:26,209][25689] Fps is (10 sec: 5577.0, 60 sec: 5511.1, 300 sec: 5530.4). Total num frames: 1057264640. Throughput: 0: 4985.9. Samples: 1057262718. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 04:20:26,209][25689] Avg episode reward: [(0, '0.888')] [2022-07-11 04:20:26,490][26022] Updated weights on worker 0-0, policy_version 1032487 (0.00093) [2022-07-11 04:20:28,081][26022] Updated weights on worker 0-0, policy_version 1032497 (0.00072) [2022-07-11 04:20:30,213][26022] Updated weights on worker 0-0, policy_version 1032507 (0.00091) [2022-07-11 04:20:31,299][25689] Fps is (10 sec: 5729.8, 60 sec: 5553.8, 300 sec: 5540.1). Total num frames: 1057294336. Throughput: 0: 5783.6. Samples: 1057295902. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:20:31,299][25689] Avg episode reward: [(0, '0.785')] [2022-07-11 04:20:31,888][26022] Updated weights on worker 0-0, policy_version 1032517 (0.00088) [2022-07-11 04:20:33,741][26022] Updated weights on worker 0-0, policy_version 1032527 (0.00093) [2022-07-11 04:20:35,876][26022] Updated weights on worker 0-0, policy_version 1032537 (0.00083) [2022-07-11 04:20:36,379][25689] Fps is (10 sec: 5537.5, 60 sec: 5518.8, 300 sec: 5532.7). Total num frames: 1057320960. Throughput: 0: 5796.2. Samples: 1057329846. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:20:36,380][25689] Avg episode reward: [(0, '1.017')] [2022-07-11 04:20:37,110][26022] Updated weights on worker 0-0, policy_version 1032547 (0.00090) [2022-07-11 04:20:39,522][26022] Updated weights on worker 0-0, policy_version 1032557 (0.00087) [2022-07-11 04:20:40,932][26022] Updated weights on worker 0-0, policy_version 1032567 (0.00093) [2022-07-11 04:20:41,381][25689] Fps is (10 sec: 5484.4, 60 sec: 5520.0, 300 sec: 5531.2). Total num frames: 1057349632. Throughput: 0: 4962.2. Samples: 1057346576. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:20:41,382][25689] Avg episode reward: [(0, '-0.171')] [2022-07-11 04:20:43,085][26022] Updated weights on worker 0-0, policy_version 1032577 (0.00087) [2022-07-11 04:20:44,766][26022] Updated weights on worker 0-0, policy_version 1032587 (0.00091) [2022-07-11 04:20:46,516][25689] Fps is (10 sec: 5555.9, 60 sec: 5530.3, 300 sec: 5537.0). Total num frames: 1057377280. Throughput: 0: 5770.8. Samples: 1057379910. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:20:46,517][25689] Avg episode reward: [(0, '-0.402')] [2022-07-11 04:20:46,697][26022] Updated weights on worker 0-0, policy_version 1032597 (0.00082) [2022-07-11 04:20:48,463][26022] Updated weights on worker 0-0, policy_version 1032607 (0.00099) [2022-07-11 04:20:50,403][26022] Updated weights on worker 0-0, policy_version 1032617 (0.00086) [2022-07-11 04:20:51,567][25689] Fps is (10 sec: 5529.1, 60 sec: 5528.3, 300 sec: 5536.3). Total num frames: 1057405952. Throughput: 0: 5794.5. Samples: 1057413348. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:20:51,568][25689] Avg episode reward: [(0, '-0.972')] [2022-07-11 04:20:52,051][26022] Updated weights on worker 0-0, policy_version 1032627 (0.00085) [2022-07-11 04:20:53,990][26022] Updated weights on worker 0-0, policy_version 1032637 (0.00092) [2022-07-11 04:20:55,784][26022] Updated weights on worker 0-0, policy_version 1032647 (0.00088) [2022-07-11 04:20:56,638][25689] Fps is (10 sec: 5766.5, 60 sec: 5556.7, 300 sec: 5542.6). Total num frames: 1057435648. Throughput: 0: 4945.1. Samples: 1057430028. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:20:56,639][25689] Avg episode reward: [(0, '-1.099')] [2022-07-11 04:20:57,934][26022] Updated weights on worker 0-0, policy_version 1032657 (0.00084) [2022-07-11 04:20:59,506][26022] Updated weights on worker 0-0, policy_version 1032667 (0.00088) [2022-07-11 04:21:01,666][26022] Updated weights on worker 0-0, policy_version 1032677 (0.00106) [2022-07-11 04:21:01,724][25689] Fps is (10 sec: 5444.4, 60 sec: 5515.7, 300 sec: 5542.4). Total num frames: 1057461248. Throughput: 0: 5728.4. Samples: 1057463106. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:21:01,724][25689] Avg episode reward: [(0, '-1.112')] [2022-07-11 04:21:03,444][26022] Updated weights on worker 0-0, policy_version 1032687 (0.00085) [2022-07-11 04:21:05,674][26022] Updated weights on worker 0-0, policy_version 1032697 (0.00091) [2022-07-11 04:21:06,819][25689] Fps is (10 sec: 5230.3, 60 sec: 5510.9, 300 sec: 5537.6). Total num frames: 1057488896. Throughput: 0: 5629.4. Samples: 1057494202. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:21:06,819][25689] Avg episode reward: [(0, '-1.366')] [2022-07-11 04:21:07,343][26022] Updated weights on worker 0-0, policy_version 1032707 (0.00097) [2022-07-11 04:21:09,406][26022] Updated weights on worker 0-0, policy_version 1032717 (0.00083) [2022-07-11 04:21:10,948][26022] Updated weights on worker 0-0, policy_version 1032727 (0.00089) [2022-07-11 04:21:11,910][25689] Fps is (10 sec: 5428.6, 60 sec: 5537.4, 300 sec: 5535.9). Total num frames: 1057516544. Throughput: 0: 4778.6. Samples: 1057510558. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:21:11,910][25689] Avg episode reward: [(0, '-2.067')] [2022-07-11 04:21:13,187][26022] Updated weights on worker 0-0, policy_version 1032737 (0.00088) [2022-07-11 04:21:14,587][26022] Updated weights on worker 0-0, policy_version 1032747 (0.00088) [2022-07-11 04:21:16,742][26022] Updated weights on worker 0-0, policy_version 1032757 (0.00093) [2022-07-11 04:21:16,922][25689] Fps is (10 sec: 5473.0, 60 sec: 5523.1, 300 sec: 5540.0). Total num frames: 1057544192. Throughput: 0: 5615.6. Samples: 1057543936. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:21:16,923][25689] Avg episode reward: [(0, '-1.082')] [2022-07-11 04:21:18,335][26022] Updated weights on worker 0-0, policy_version 1032767 (0.00083) [2022-07-11 04:21:20,331][26022] Updated weights on worker 0-0, policy_version 1032777 (0.00084) [2022-07-11 04:21:22,001][25689] Fps is (10 sec: 5479.5, 60 sec: 5516.1, 300 sec: 5534.0). Total num frames: 1057571840. Throughput: 0: 5643.4. Samples: 1057577540. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:21:22,002][25689] Avg episode reward: [(0, '-0.486')] [2022-07-11 04:21:22,238][26022] Updated weights on worker 0-0, policy_version 1032787 (0.00084) [2022-07-11 04:21:23,951][26022] Updated weights on worker 0-0, policy_version 1032797 (0.00096) [2022-07-11 04:21:26,003][26022] Updated weights on worker 0-0, policy_version 1032807 (0.00092) [2022-07-11 04:21:27,103][25689] Fps is (10 sec: 5632.9, 60 sec: 5532.5, 300 sec: 5539.2). Total num frames: 1057601536. Throughput: 0: 4922.1. Samples: 1057594040. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:21:27,103][25689] Avg episode reward: [(0, '-1.482')] [2022-07-11 04:21:27,811][26022] Updated weights on worker 0-0, policy_version 1032817 (0.00091) [2022-07-11 04:21:29,469][26022] Updated weights on worker 0-0, policy_version 1032827 (0.00086) [2022-07-11 04:21:31,508][26022] Updated weights on worker 0-0, policy_version 1032837 (0.00090) [2022-07-11 04:21:32,127][25689] Fps is (10 sec: 5562.3, 60 sec: 5488.0, 300 sec: 5538.9). Total num frames: 1057628160. Throughput: 0: 5771.5. Samples: 1057627240. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:21:32,127][25689] Avg episode reward: [(0, '-1.495')] [2022-07-11 04:21:33,335][26022] Updated weights on worker 0-0, policy_version 1032847 (0.00084) [2022-07-11 04:21:33,465][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:21:33,478][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001032848_1057636352.pth [2022-07-11 04:21:33,478][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001030902_1055643648.pth [2022-07-11 04:21:35,067][26022] Updated weights on worker 0-0, policy_version 1032857 (0.00087) [2022-07-11 04:21:37,090][26022] Updated weights on worker 0-0, policy_version 1032867 (0.00094) [2022-07-11 04:21:37,164][25689] Fps is (10 sec: 5394.2, 60 sec: 5508.8, 300 sec: 5528.1). Total num frames: 1057655808. Throughput: 0: 5778.1. Samples: 1057660894. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:21:37,164][25689] Avg episode reward: [(0, '-0.423')] [2022-07-11 04:21:38,725][26022] Updated weights on worker 0-0, policy_version 1032877 (0.00085) [2022-07-11 04:21:40,569][26022] Updated weights on worker 0-0, policy_version 1032887 (0.00092) [2022-07-11 04:21:42,168][25689] Fps is (10 sec: 5507.0, 60 sec: 5491.7, 300 sec: 5527.1). Total num frames: 1057683456. Throughput: 0: 4957.2. Samples: 1057677510. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:21:42,168][25689] Avg episode reward: [(0, '0.395')] [2022-07-11 04:21:42,663][26022] Updated weights on worker 0-0, policy_version 1032897 (0.00087) [2022-07-11 04:21:44,143][26022] Updated weights on worker 0-0, policy_version 1032907 (0.00086) [2022-07-11 04:21:46,188][26022] Updated weights on worker 0-0, policy_version 1032917 (0.00092) [2022-07-11 04:21:47,270][25689] Fps is (10 sec: 5775.5, 60 sec: 5545.3, 300 sec: 5540.2). Total num frames: 1057714176. Throughput: 0: 5804.7. Samples: 1057711106. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:21:47,271][25689] Avg episode reward: [(0, '-1.179')] [2022-07-11 04:21:47,775][26022] Updated weights on worker 0-0, policy_version 1032927 (0.00085) [2022-07-11 04:21:49,785][26022] Updated weights on worker 0-0, policy_version 1032937 (0.00084) [2022-07-11 04:21:51,671][26022] Updated weights on worker 0-0, policy_version 1032947 (0.00088) [2022-07-11 04:21:52,272][25689] Fps is (10 sec: 5776.7, 60 sec: 5532.9, 300 sec: 5533.7). Total num frames: 1057741824. Throughput: 0: 5848.1. Samples: 1057745050. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:21:52,272][25689] Avg episode reward: [(0, '-0.902')] [2022-07-11 04:21:53,352][26022] Updated weights on worker 0-0, policy_version 1032957 (0.00088) [2022-07-11 04:21:55,311][26022] Updated weights on worker 0-0, policy_version 1032967 (0.00094) [2022-07-11 04:21:57,251][26022] Updated weights on worker 0-0, policy_version 1032977 (0.00086) [2022-07-11 04:21:57,348][25689] Fps is (10 sec: 5385.1, 60 sec: 5481.8, 300 sec: 5532.5). Total num frames: 1057768448. Throughput: 0: 5813.4. Samples: 1057778234. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:21:57,349][25689] Avg episode reward: [(0, '0.120')] [2022-07-11 04:21:58,930][26022] Updated weights on worker 0-0, policy_version 1032987 (0.00082) [2022-07-11 04:22:00,937][26022] Updated weights on worker 0-0, policy_version 1032997 (0.00087) [2022-07-11 04:22:02,423][25689] Fps is (10 sec: 5346.3, 60 sec: 5516.5, 300 sec: 5533.0). Total num frames: 1057796096. Throughput: 0: 5794.9. Samples: 1057794888. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:22:02,424][25689] Avg episode reward: [(0, '0.176')] [2022-07-11 04:22:02,779][26022] Updated weights on worker 0-0, policy_version 1033007 (0.00096) [2022-07-11 04:22:04,778][26022] Updated weights on worker 0-0, policy_version 1033017 (0.00085) [2022-07-11 04:22:06,780][26022] Updated weights on worker 0-0, policy_version 1033027 (0.00089) [2022-07-11 04:22:07,490][25689] Fps is (10 sec: 5452.3, 60 sec: 5519.1, 300 sec: 5532.6). Total num frames: 1057823744. Throughput: 0: 5705.7. Samples: 1057826474. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:22:07,490][25689] Avg episode reward: [(0, '0.288')] [2022-07-11 04:22:08,471][26022] Updated weights on worker 0-0, policy_version 1033037 (0.00085) [2022-07-11 04:22:10,283][26022] Updated weights on worker 0-0, policy_version 1033047 (0.00083) [2022-07-11 04:22:12,279][26022] Updated weights on worker 0-0, policy_version 1033057 (0.00051) [2022-07-11 04:22:12,509][25689] Fps is (10 sec: 5583.9, 60 sec: 5542.5, 300 sec: 5532.5). Total num frames: 1057852416. Throughput: 0: 5686.0. Samples: 1057860120. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:22:12,510][25689] Avg episode reward: [(0, '1.043')] [2022-07-11 04:22:13,959][26022] Updated weights on worker 0-0, policy_version 1033067 (0.00082) [2022-07-11 04:22:15,826][26022] Updated weights on worker 0-0, policy_version 1033077 (0.00084) [2022-07-11 04:22:17,530][25689] Fps is (10 sec: 5609.4, 60 sec: 5541.7, 300 sec: 5532.4). Total num frames: 1057880064. Throughput: 0: 4898.5. Samples: 1057877096. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:22:17,531][25689] Avg episode reward: [(0, '1.710')] [2022-07-11 04:22:17,717][26022] Updated weights on worker 0-0, policy_version 1033087 (0.00091) [2022-07-11 04:22:19,461][26022] Updated weights on worker 0-0, policy_version 1033097 (0.00088) [2022-07-11 04:22:21,280][26022] Updated weights on worker 0-0, policy_version 1033107 (0.00081) [2022-07-11 04:22:22,603][25689] Fps is (10 sec: 5478.5, 60 sec: 5542.3, 300 sec: 5532.9). Total num frames: 1057907712. Throughput: 0: 5736.9. Samples: 1057910656. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:22:22,604][25689] Avg episode reward: [(0, '0.235')] [2022-07-11 04:22:23,005][26022] Updated weights on worker 0-0, policy_version 1033117 (0.00094) [2022-07-11 04:22:24,878][26022] Updated weights on worker 0-0, policy_version 1033127 (0.00082) [2022-07-11 04:22:26,680][26022] Updated weights on worker 0-0, policy_version 1033137 (0.00080) [2022-07-11 04:22:27,738][25689] Fps is (10 sec: 5617.6, 60 sec: 5539.2, 300 sec: 5534.2). Total num frames: 1057937408. Throughput: 0: 5830.1. Samples: 1057944522. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:22:27,740][25689] Avg episode reward: [(0, '0.165')] [2022-07-11 04:22:28,499][26022] Updated weights on worker 0-0, policy_version 1033147 (0.00090) [2022-07-11 04:22:30,410][26022] Updated weights on worker 0-0, policy_version 1033157 (0.00095) [2022-07-11 04:22:32,384][26022] Updated weights on worker 0-0, policy_version 1033167 (0.00086) [2022-07-11 04:22:32,766][25689] Fps is (10 sec: 5743.2, 60 sec: 5572.7, 300 sec: 5534.2). Total num frames: 1057966080. Throughput: 0: 4990.8. Samples: 1057961216. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:22:32,766][25689] Avg episode reward: [(0, '-1.223')] [2022-07-11 04:22:34,124][26022] Updated weights on worker 0-0, policy_version 1033177 (0.00084) [2022-07-11 04:22:35,826][26022] Updated weights on worker 0-0, policy_version 1033187 (0.00091) [2022-07-11 04:22:37,794][25689] Fps is (10 sec: 5499.0, 60 sec: 5556.6, 300 sec: 5534.0). Total num frames: 1057992704. Throughput: 0: 5792.2. Samples: 1057994466. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:22:37,795][25689] Avg episode reward: [(0, '-2.553')] [2022-07-11 04:22:37,980][26022] Updated weights on worker 0-0, policy_version 1033197 (0.00081) [2022-07-11 04:22:39,396][26022] Updated weights on worker 0-0, policy_version 1033207 (0.00085) [2022-07-11 04:22:41,465][26022] Updated weights on worker 0-0, policy_version 1033217 (0.00085) [2022-07-11 04:22:42,800][25689] Fps is (10 sec: 5510.8, 60 sec: 5573.3, 300 sec: 5531.7). Total num frames: 1058021376. Throughput: 0: 5815.1. Samples: 1058028104. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:22:42,800][25689] Avg episode reward: [(0, '-3.359')] [2022-07-11 04:22:43,291][26022] Updated weights on worker 0-0, policy_version 1033227 (0.00084) [2022-07-11 04:22:45,338][26022] Updated weights on worker 0-0, policy_version 1033237 (0.00084) [2022-07-11 04:22:46,959][26022] Updated weights on worker 0-0, policy_version 1033247 (0.00087) [2022-07-11 04:22:47,927][25689] Fps is (10 sec: 5659.0, 60 sec: 5537.2, 300 sec: 5536.3). Total num frames: 1058050048. Throughput: 0: 4967.1. Samples: 1058044802. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:22:47,928][25689] Avg episode reward: [(0, '-2.620')] [2022-07-11 04:22:48,893][26022] Updated weights on worker 0-0, policy_version 1033257 (0.00087) [2022-07-11 04:22:50,524][26022] Updated weights on worker 0-0, policy_version 1033267 (0.00081) [2022-07-11 04:22:52,616][26022] Updated weights on worker 0-0, policy_version 1033277 (0.00088) [2022-07-11 04:22:52,937][25689] Fps is (10 sec: 5555.9, 60 sec: 5536.5, 300 sec: 5533.5). Total num frames: 1058077696. Throughput: 0: 5800.7. Samples: 1058078222. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:22:52,937][25689] Avg episode reward: [(0, '-1.742')] [2022-07-11 04:22:54,206][26022] Updated weights on worker 0-0, policy_version 1033287 (0.00091) [2022-07-11 04:22:56,241][26022] Updated weights on worker 0-0, policy_version 1033297 (0.00085) [2022-07-11 04:22:57,963][25689] Fps is (10 sec: 5509.9, 60 sec: 5558.0, 300 sec: 5530.0). Total num frames: 1058105344. Throughput: 0: 5809.1. Samples: 1058111630. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:22:57,963][25689] Avg episode reward: [(0, '-0.228')] [2022-07-11 04:22:58,063][26022] Updated weights on worker 0-0, policy_version 1033307 (0.00082) [2022-07-11 04:22:59,827][26022] Updated weights on worker 0-0, policy_version 1033317 (0.00086) [2022-07-11 04:23:01,662][26022] Updated weights on worker 0-0, policy_version 1033327 (0.00094) [2022-07-11 04:23:02,987][25689] Fps is (10 sec: 5400.3, 60 sec: 5545.8, 300 sec: 5535.2). Total num frames: 1058131968. Throughput: 0: 4964.5. Samples: 1058128322. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:23:02,987][25689] Avg episode reward: [(0, '-0.617')] [2022-07-11 04:23:03,883][26022] Updated weights on worker 0-0, policy_version 1033337 (0.00088) [2022-07-11 04:23:05,613][26022] Updated weights on worker 0-0, policy_version 1033347 (0.00089) [2022-07-11 04:23:07,563][26022] Updated weights on worker 0-0, policy_version 1033357 (0.00090) [2022-07-11 04:23:08,112][25689] Fps is (10 sec: 5347.5, 60 sec: 5540.4, 300 sec: 5533.4). Total num frames: 1058159616. Throughput: 0: 5699.1. Samples: 1058159838. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:23:08,112][25689] Avg episode reward: [(0, '0.515')] [2022-07-11 04:23:09,391][26022] Updated weights on worker 0-0, policy_version 1033367 (0.00088) [2022-07-11 04:23:11,088][26022] Updated weights on worker 0-0, policy_version 1033377 (0.00081) [2022-07-11 04:23:13,132][25689] Fps is (10 sec: 5450.5, 60 sec: 5523.5, 300 sec: 5534.0). Total num frames: 1058187264. Throughput: 0: 5704.7. Samples: 1058193428. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:23:13,132][25689] Avg episode reward: [(0, '0.675')] [2022-07-11 04:23:13,176][26022] Updated weights on worker 0-0, policy_version 1033387 (0.00089) [2022-07-11 04:23:14,794][26022] Updated weights on worker 0-0, policy_version 1033397 (0.00081) [2022-07-11 04:23:16,768][26022] Updated weights on worker 0-0, policy_version 1033407 (0.00090) [2022-07-11 04:23:18,227][25689] Fps is (10 sec: 5668.9, 60 sec: 5550.4, 300 sec: 5537.0). Total num frames: 1058216960. Throughput: 0: 4872.2. Samples: 1058210364. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:23:18,228][25689] Avg episode reward: [(0, '0.919')] [2022-07-11 04:23:18,487][26022] Updated weights on worker 0-0, policy_version 1033417 (0.00091) [2022-07-11 04:23:20,489][26022] Updated weights on worker 0-0, policy_version 1033427 (0.00089) [2022-07-11 04:23:22,154][26022] Updated weights on worker 0-0, policy_version 1033437 (0.00083) [2022-07-11 04:23:23,304][25689] Fps is (10 sec: 5637.2, 60 sec: 5550.0, 300 sec: 5535.0). Total num frames: 1058244608. Throughput: 0: 5691.5. Samples: 1058243960. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:23:23,305][25689] Avg episode reward: [(0, '0.764')] [2022-07-11 04:23:24,064][26022] Updated weights on worker 0-0, policy_version 1033447 (0.00091) [2022-07-11 04:23:25,876][26022] Updated weights on worker 0-0, policy_version 1033457 (0.00090) [2022-07-11 04:23:27,768][26022] Updated weights on worker 0-0, policy_version 1033467 (0.00087) [2022-07-11 04:23:28,400][25689] Fps is (10 sec: 5435.9, 60 sec: 5519.9, 300 sec: 5530.2). Total num frames: 1058272256. Throughput: 0: 5784.7. Samples: 1058277198. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:23:28,400][25689] Avg episode reward: [(0, '1.130')] [2022-07-11 04:23:29,657][26022] Updated weights on worker 0-0, policy_version 1033477 (0.00090) [2022-07-11 04:23:31,487][26022] Updated weights on worker 0-0, policy_version 1033487 (0.00086) [2022-07-11 04:23:33,314][26022] Updated weights on worker 0-0, policy_version 1033497 (0.00086) [2022-07-11 04:23:33,412][25689] Fps is (10 sec: 5673.0, 60 sec: 5538.2, 300 sec: 5537.1). Total num frames: 1058301952. Throughput: 0: 5781.1. Samples: 1058310674. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:23:33,413][25689] Avg episode reward: [(0, '-0.009')] [2022-07-11 04:23:33,554][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:23:33,570][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001033499_1058302976.pth [2022-07-11 04:23:33,571][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001031552_1056309248.pth [2022-07-11 04:23:35,029][26022] Updated weights on worker 0-0, policy_version 1033507 (0.00093) [2022-07-11 04:23:36,903][26022] Updated weights on worker 0-0, policy_version 1033517 (0.00086) [2022-07-11 04:23:38,426][25689] Fps is (10 sec: 5617.2, 60 sec: 5539.5, 300 sec: 5533.9). Total num frames: 1058328576. Throughput: 0: 5803.6. Samples: 1058327588. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:23:38,427][25689] Avg episode reward: [(0, '0.732')] [2022-07-11 04:23:38,867][26022] Updated weights on worker 0-0, policy_version 1033527 (0.00088) [2022-07-11 04:23:40,457][26022] Updated weights on worker 0-0, policy_version 1033537 (0.00093) [2022-07-11 04:23:42,653][26022] Updated weights on worker 0-0, policy_version 1033547 (0.00086) [2022-07-11 04:23:43,442][25689] Fps is (10 sec: 5513.3, 60 sec: 5538.6, 300 sec: 5535.8). Total num frames: 1058357248. Throughput: 0: 5829.0. Samples: 1058361344. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:23:43,443][25689] Avg episode reward: [(0, '0.706')] [2022-07-11 04:23:44,108][26022] Updated weights on worker 0-0, policy_version 1033557 (0.00091) [2022-07-11 04:23:46,300][26022] Updated weights on worker 0-0, policy_version 1033567 (0.00094) [2022-07-11 04:23:47,844][26022] Updated weights on worker 0-0, policy_version 1033577 (0.00091) [2022-07-11 04:23:48,509][25689] Fps is (10 sec: 5687.0, 60 sec: 5544.1, 300 sec: 5532.0). Total num frames: 1058385920. Throughput: 0: 5845.6. Samples: 1058394750. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:23:48,510][25689] Avg episode reward: [(0, '0.315')] [2022-07-11 04:23:49,799][26022] Updated weights on worker 0-0, policy_version 1033587 (0.00084) [2022-07-11 04:23:51,594][26022] Updated weights on worker 0-0, policy_version 1033597 (0.00088) [2022-07-11 04:23:53,550][25689] Fps is (10 sec: 5470.6, 60 sec: 5524.4, 300 sec: 5531.5). Total num frames: 1058412544. Throughput: 0: 4998.1. Samples: 1058411322. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:23:53,550][25689] Avg episode reward: [(0, '-0.888')] [2022-07-11 04:23:53,603][26022] Updated weights on worker 0-0, policy_version 1033607 (0.00087) [2022-07-11 04:23:55,304][26022] Updated weights on worker 0-0, policy_version 1033617 (0.00087) [2022-07-11 04:23:57,339][26022] Updated weights on worker 0-0, policy_version 1033627 (0.00086) [2022-07-11 04:23:58,570][25689] Fps is (10 sec: 5598.3, 60 sec: 5558.7, 300 sec: 5538.0). Total num frames: 1058442240. Throughput: 0: 5828.7. Samples: 1058444998. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:23:58,570][25689] Avg episode reward: [(0, '-1.347')] [2022-07-11 04:23:58,947][26022] Updated weights on worker 0-0, policy_version 1033637 (0.00092) [2022-07-11 04:24:00,952][26022] Updated weights on worker 0-0, policy_version 1033647 (0.00091) [2022-07-11 04:24:02,968][26022] Updated weights on worker 0-0, policy_version 1033657 (0.00085) [2022-07-11 04:24:03,578][25689] Fps is (10 sec: 5514.0, 60 sec: 5543.2, 300 sec: 5539.0). Total num frames: 1058467840. Throughput: 0: 5701.1. Samples: 1058476142. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:24:03,580][25689] Avg episode reward: [(0, '-0.475')] [2022-07-11 04:24:05,016][26022] Updated weights on worker 0-0, policy_version 1033667 (0.00083) [2022-07-11 04:24:06,640][26022] Updated weights on worker 0-0, policy_version 1033677 (0.00090) [2022-07-11 04:24:08,711][25689] Fps is (10 sec: 5149.5, 60 sec: 5525.6, 300 sec: 5531.5). Total num frames: 1058494464. Throughput: 0: 4849.4. Samples: 1058492716. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:24:08,712][25689] Avg episode reward: [(0, '-0.435')] [2022-07-11 04:24:08,759][26022] Updated weights on worker 0-0, policy_version 1033687 (0.00085) [2022-07-11 04:24:10,343][26022] Updated weights on worker 0-0, policy_version 1033697 (0.00087) [2022-07-11 04:24:12,535][26022] Updated weights on worker 0-0, policy_version 1033707 (0.00412) [2022-07-11 04:24:13,717][25689] Fps is (10 sec: 5554.8, 60 sec: 5560.7, 300 sec: 5532.6). Total num frames: 1058524160. Throughput: 0: 5703.2. Samples: 1058526340. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:24:13,718][25689] Avg episode reward: [(0, '-1.343')] [2022-07-11 04:24:13,973][26022] Updated weights on worker 0-0, policy_version 1033717 (0.00088) [2022-07-11 04:24:16,141][26022] Updated weights on worker 0-0, policy_version 1033727 (0.00091) [2022-07-11 04:24:17,675][26022] Updated weights on worker 0-0, policy_version 1033737 (0.00088) [2022-07-11 04:24:18,767][25689] Fps is (10 sec: 5702.6, 60 sec: 5531.1, 300 sec: 5539.0). Total num frames: 1058551808. Throughput: 0: 5684.6. Samples: 1058559812. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:24:18,767][25689] Avg episode reward: [(0, '-0.017')] [2022-07-11 04:24:19,770][26022] Updated weights on worker 0-0, policy_version 1033747 (0.00098) [2022-07-11 04:24:21,471][26022] Updated weights on worker 0-0, policy_version 1033757 (0.00100) [2022-07-11 04:24:23,418][26022] Updated weights on worker 0-0, policy_version 1033767 (0.00096) [2022-07-11 04:24:23,803][25689] Fps is (10 sec: 5583.8, 60 sec: 5551.7, 300 sec: 5532.7). Total num frames: 1058580480. Throughput: 0: 4960.0. Samples: 1058576460. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2022-07-11 04:24:23,804][25689] Avg episode reward: [(0, '0.574')] [2022-07-11 04:24:25,198][26022] Updated weights on worker 0-0, policy_version 1033777 (0.00094) [2022-07-11 04:24:27,110][26022] Updated weights on worker 0-0, policy_version 1033787 (0.00478) [2022-07-11 04:24:28,923][25689] Fps is (10 sec: 5444.8, 60 sec: 5532.6, 300 sec: 5530.5). Total num frames: 1058607104. Throughput: 0: 5778.1. Samples: 1058609500. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:24:28,923][25689] Avg episode reward: [(0, '-0.281')] [2022-07-11 04:24:28,930][26022] Updated weights on worker 0-0, policy_version 1033797 (0.00090) [2022-07-11 04:24:30,851][26022] Updated weights on worker 0-0, policy_version 1033807 (0.00080) [2022-07-11 04:24:32,626][26022] Updated weights on worker 0-0, policy_version 1033817 (0.00080) [2022-07-11 04:24:33,942][25689] Fps is (10 sec: 5454.0, 60 sec: 5515.1, 300 sec: 5531.4). Total num frames: 1058635776. Throughput: 0: 5759.3. Samples: 1058642822. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:24:33,943][25689] Avg episode reward: [(0, '-1.450')] [2022-07-11 04:24:34,529][26022] Updated weights on worker 0-0, policy_version 1033827 (0.00088) [2022-07-11 04:24:36,436][26022] Updated weights on worker 0-0, policy_version 1033837 (0.00085) [2022-07-11 04:24:38,097][26022] Updated weights on worker 0-0, policy_version 1033847 (0.00059) [2022-07-11 04:24:38,966][25689] Fps is (10 sec: 5709.8, 60 sec: 5548.0, 300 sec: 5531.2). Total num frames: 1058664448. Throughput: 0: 4939.1. Samples: 1058659576. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:24:38,966][25689] Avg episode reward: [(0, '-1.888')] [2022-07-11 04:24:39,967][26022] Updated weights on worker 0-0, policy_version 1033857 (0.00082) [2022-07-11 04:24:41,700][26022] Updated weights on worker 0-0, policy_version 1033867 (0.00086) [2022-07-11 04:24:43,663][26022] Updated weights on worker 0-0, policy_version 1033877 (0.00096) [2022-07-11 04:24:43,980][25689] Fps is (10 sec: 5508.6, 60 sec: 5514.3, 300 sec: 5532.1). Total num frames: 1058691072. Throughput: 0: 5779.5. Samples: 1058693072. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:24:43,981][25689] Avg episode reward: [(0, '-0.708')] [2022-07-11 04:24:45,424][26022] Updated weights on worker 0-0, policy_version 1033887 (0.00091) [2022-07-11 04:24:47,265][26022] Updated weights on worker 0-0, policy_version 1033897 (0.00113) [2022-07-11 04:24:49,099][25689] Fps is (10 sec: 5456.9, 60 sec: 5509.6, 300 sec: 5530.5). Total num frames: 1058719744. Throughput: 0: 5799.3. Samples: 1058726508. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:24:49,099][25689] Avg episode reward: [(0, '-0.376')] [2022-07-11 04:24:49,198][26022] Updated weights on worker 0-0, policy_version 1033907 (0.00091) [2022-07-11 04:24:50,846][26022] Updated weights on worker 0-0, policy_version 1033917 (0.00090) [2022-07-11 04:24:52,774][26022] Updated weights on worker 0-0, policy_version 1033927 (0.00091) [2022-07-11 04:24:54,179][25689] Fps is (10 sec: 5722.9, 60 sec: 5556.7, 300 sec: 5536.1). Total num frames: 1058749440. Throughput: 0: 4962.5. Samples: 1058743248. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:24:54,180][25689] Avg episode reward: [(0, '0.202')] [2022-07-11 04:24:55,014][26022] Updated weights on worker 0-0, policy_version 1033937 (0.00093) [2022-07-11 04:24:56,411][26022] Updated weights on worker 0-0, policy_version 1033947 (0.00086) [2022-07-11 04:24:58,445][26022] Updated weights on worker 0-0, policy_version 1033957 (0.00093) [2022-07-11 04:24:59,208][25689] Fps is (10 sec: 5571.1, 60 sec: 5505.1, 300 sec: 5532.2). Total num frames: 1058776064. Throughput: 0: 5786.0. Samples: 1058776700. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:24:59,209][25689] Avg episode reward: [(0, '0.769')] [2022-07-11 04:25:00,061][26022] Updated weights on worker 0-0, policy_version 1033967 (0.00087) [2022-07-11 04:25:02,446][26022] Updated weights on worker 0-0, policy_version 1033977 (0.00073) [2022-07-11 04:25:04,228][25689] Fps is (10 sec: 5095.3, 60 sec: 5487.3, 300 sec: 5522.3). Total num frames: 1058800640. Throughput: 0: 5672.8. Samples: 1058807932. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:25:04,228][25689] Avg episode reward: [(0, '1.723')] [2022-07-11 04:25:04,257][26022] Updated weights on worker 0-0, policy_version 1033987 (0.00078) [2022-07-11 04:25:06,109][26022] Updated weights on worker 0-0, policy_version 1033997 (0.00083) [2022-07-11 04:25:08,155][26022] Updated weights on worker 0-0, policy_version 1034007 (0.00083) [2022-07-11 04:25:09,370][25689] Fps is (10 sec: 5340.7, 60 sec: 5537.1, 300 sec: 5533.6). Total num frames: 1058830336. Throughput: 0: 4838.2. Samples: 1058824584. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:25:09,371][25689] Avg episode reward: [(0, '0.622')] [2022-07-11 04:25:09,887][26022] Updated weights on worker 0-0, policy_version 1034017 (0.00085) [2022-07-11 04:25:11,661][26022] Updated weights on worker 0-0, policy_version 1034027 (0.00085) [2022-07-11 04:25:13,459][26022] Updated weights on worker 0-0, policy_version 1034037 (0.00092) [2022-07-11 04:25:14,401][25689] Fps is (10 sec: 5737.1, 60 sec: 5517.9, 300 sec: 5533.8). Total num frames: 1058859008. Throughput: 0: 5671.3. Samples: 1058857934. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:25:14,402][25689] Avg episode reward: [(0, '0.248')] [2022-07-11 04:25:15,391][26022] Updated weights on worker 0-0, policy_version 1034047 (0.00086) [2022-07-11 04:25:17,075][26022] Updated weights on worker 0-0, policy_version 1034057 (0.00080) [2022-07-11 04:25:18,985][26022] Updated weights on worker 0-0, policy_version 1034067 (0.00087) [2022-07-11 04:25:19,471][25689] Fps is (10 sec: 5576.0, 60 sec: 5516.1, 300 sec: 5532.6). Total num frames: 1058886656. Throughput: 0: 5676.1. Samples: 1058891712. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:25:19,472][25689] Avg episode reward: [(0, '0.004')] [2022-07-11 04:25:20,739][26022] Updated weights on worker 0-0, policy_version 1034077 (0.00089) [2022-07-11 04:25:22,885][26022] Updated weights on worker 0-0, policy_version 1034087 (0.00091) [2022-07-11 04:25:24,471][26022] Updated weights on worker 0-0, policy_version 1034097 (0.00087) [2022-07-11 04:25:24,495][25689] Fps is (10 sec: 5579.8, 60 sec: 5517.2, 300 sec: 5533.9). Total num frames: 1058915328. Throughput: 0: 4955.1. Samples: 1058908358. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:25:24,495][25689] Avg episode reward: [(0, '-0.231')] [2022-07-11 04:25:26,342][26022] Updated weights on worker 0-0, policy_version 1034107 (0.00086) [2022-07-11 04:25:28,323][26022] Updated weights on worker 0-0, policy_version 1034117 (0.00114) [2022-07-11 04:25:29,575][25689] Fps is (10 sec: 5573.6, 60 sec: 5537.6, 300 sec: 5527.2). Total num frames: 1058942976. Throughput: 0: 5777.2. Samples: 1058941314. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:25:29,576][25689] Avg episode reward: [(0, '-1.066')] [2022-07-11 04:25:30,094][26022] Updated weights on worker 0-0, policy_version 1034127 (0.00088) [2022-07-11 04:25:32,043][26022] Updated weights on worker 0-0, policy_version 1034137 (0.00089) [2022-07-11 04:25:33,629][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:25:33,642][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001034146_1058965504.pth [2022-07-11 04:25:33,642][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001032202_1056974848.pth [2022-07-11 04:25:33,815][26022] Updated weights on worker 0-0, policy_version 1034147 (0.00090) [2022-07-11 04:25:34,615][25689] Fps is (10 sec: 5463.9, 60 sec: 5518.9, 300 sec: 5531.4). Total num frames: 1058970624. Throughput: 0: 5761.3. Samples: 1058974392. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:25:34,616][25689] Avg episode reward: [(0, '-0.369')] [2022-07-11 04:25:35,551][26022] Updated weights on worker 0-0, policy_version 1034157 (0.00097) [2022-07-11 04:25:37,558][26022] Updated weights on worker 0-0, policy_version 1034167 (0.00087) [2022-07-11 04:25:39,315][26022] Updated weights on worker 0-0, policy_version 1034177 (0.00089) [2022-07-11 04:25:39,669][25689] Fps is (10 sec: 5478.5, 60 sec: 5499.3, 300 sec: 5527.0). Total num frames: 1058998272. Throughput: 0: 5752.6. Samples: 1059007904. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:25:39,669][25689] Avg episode reward: [(0, '-0.301')] [2022-07-11 04:25:41,212][26022] Updated weights on worker 0-0, policy_version 1034187 (0.00089) [2022-07-11 04:25:43,056][26022] Updated weights on worker 0-0, policy_version 1034197 (0.00086) [2022-07-11 04:25:44,709][25689] Fps is (10 sec: 5478.0, 60 sec: 5513.8, 300 sec: 5528.8). Total num frames: 1059025920. Throughput: 0: 5742.2. Samples: 1059024434. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:25:44,710][25689] Avg episode reward: [(0, '-1.723')] [2022-07-11 04:25:45,028][26022] Updated weights on worker 0-0, policy_version 1034207 (0.00092) [2022-07-11 04:25:46,796][26022] Updated weights on worker 0-0, policy_version 1034217 (0.00082) [2022-07-11 04:25:48,717][26022] Updated weights on worker 0-0, policy_version 1034227 (0.00088) [2022-07-11 04:25:49,750][25689] Fps is (10 sec: 5586.4, 60 sec: 5520.8, 300 sec: 5529.0). Total num frames: 1059054592. Throughput: 0: 5765.3. Samples: 1059057628. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:25:49,751][25689] Avg episode reward: [(0, '-2.470')] [2022-07-11 04:25:50,534][26022] Updated weights on worker 0-0, policy_version 1034237 (0.00094) [2022-07-11 04:25:52,387][26022] Updated weights on worker 0-0, policy_version 1034247 (0.00088) [2022-07-11 04:25:54,190][26022] Updated weights on worker 0-0, policy_version 1034257 (0.00095) [2022-07-11 04:25:54,805][25689] Fps is (10 sec: 5679.9, 60 sec: 5506.3, 300 sec: 5525.8). Total num frames: 1059083264. Throughput: 0: 5775.1. Samples: 1059090992. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:25:54,806][25689] Avg episode reward: [(0, '-3.304')] [2022-07-11 04:25:56,273][26022] Updated weights on worker 0-0, policy_version 1034267 (0.00088) [2022-07-11 04:25:57,720][26022] Updated weights on worker 0-0, policy_version 1034277 (0.00089) [2022-07-11 04:25:59,842][25689] Fps is (10 sec: 5377.8, 60 sec: 5488.7, 300 sec: 5526.7). Total num frames: 1059108864. Throughput: 0: 4944.8. Samples: 1059107658. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:25:59,843][25689] Avg episode reward: [(0, '-2.908')] [2022-07-11 04:25:59,875][26022] Updated weights on worker 0-0, policy_version 1034287 (0.00077) [2022-07-11 04:26:01,413][26022] Updated weights on worker 0-0, policy_version 1034297 (0.00096) [2022-07-11 04:26:03,799][26022] Updated weights on worker 0-0, policy_version 1034307 (0.00089) [2022-07-11 04:26:04,851][25689] Fps is (10 sec: 5300.7, 60 sec: 5540.3, 300 sec: 5528.3). Total num frames: 1059136512. Throughput: 0: 5695.7. Samples: 1059139154. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:26:04,851][25689] Avg episode reward: [(0, '-2.377')] [2022-07-11 04:26:05,607][26022] Updated weights on worker 0-0, policy_version 1034317 (0.00092) [2022-07-11 04:26:07,407][26022] Updated weights on worker 0-0, policy_version 1034327 (0.00084) [2022-07-11 04:26:09,278][26022] Updated weights on worker 0-0, policy_version 1034337 (0.00087) [2022-07-11 04:26:09,891][25689] Fps is (10 sec: 5502.8, 60 sec: 5515.8, 300 sec: 5529.3). Total num frames: 1059164160. Throughput: 0: 5703.0. Samples: 1059172490. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:26:09,892][25689] Avg episode reward: [(0, '-2.721')] [2022-07-11 04:26:11,202][26022] Updated weights on worker 0-0, policy_version 1034347 (0.00090) [2022-07-11 04:26:12,991][26022] Updated weights on worker 0-0, policy_version 1034357 (0.00092) [2022-07-11 04:26:14,927][25689] Fps is (10 sec: 5487.8, 60 sec: 5498.5, 300 sec: 5528.8). Total num frames: 1059191808. Throughput: 0: 4874.6. Samples: 1059189076. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:26:14,928][25689] Avg episode reward: [(0, '-0.674')] [2022-07-11 04:26:14,930][26022] Updated weights on worker 0-0, policy_version 1034367 (0.00088) [2022-07-11 04:26:16,475][26022] Updated weights on worker 0-0, policy_version 1034377 (0.00082) [2022-07-11 04:26:18,631][26022] Updated weights on worker 0-0, policy_version 1034387 (0.00096) [2022-07-11 04:26:19,993][25689] Fps is (10 sec: 5575.4, 60 sec: 5515.7, 300 sec: 5532.5). Total num frames: 1059220480. Throughput: 0: 5703.3. Samples: 1059222582. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:26:19,993][25689] Avg episode reward: [(0, '0.176')] [2022-07-11 04:26:20,326][26022] Updated weights on worker 0-0, policy_version 1034397 (0.00078) [2022-07-11 04:26:22,430][26022] Updated weights on worker 0-0, policy_version 1034407 (0.00102) [2022-07-11 04:26:23,942][26022] Updated weights on worker 0-0, policy_version 1034417 (0.00088) [2022-07-11 04:26:25,036][25689] Fps is (10 sec: 5571.3, 60 sec: 5497.1, 300 sec: 5526.7). Total num frames: 1059248128. Throughput: 0: 5786.6. Samples: 1059255956. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:26:25,037][25689] Avg episode reward: [(0, '1.364')] [2022-07-11 04:26:26,165][26022] Updated weights on worker 0-0, policy_version 1034427 (0.00098) [2022-07-11 04:26:27,668][26022] Updated weights on worker 0-0, policy_version 1034437 (0.00088) [2022-07-11 04:26:29,735][26022] Updated weights on worker 0-0, policy_version 1034447 (0.00084) [2022-07-11 04:26:30,106][25689] Fps is (10 sec: 5366.6, 60 sec: 5481.2, 300 sec: 5525.9). Total num frames: 1059274752. Throughput: 0: 4947.2. Samples: 1059272496. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:26:30,107][25689] Avg episode reward: [(0, '0.873')] [2022-07-11 04:26:31,312][26022] Updated weights on worker 0-0, policy_version 1034457 (0.00092) [2022-07-11 04:26:33,459][26022] Updated weights on worker 0-0, policy_version 1034467 (0.00083) [2022-07-11 04:26:35,069][26022] Updated weights on worker 0-0, policy_version 1034477 (0.00086) [2022-07-11 04:26:35,160][25689] Fps is (10 sec: 5563.1, 60 sec: 5513.7, 300 sec: 5532.4). Total num frames: 1059304448. Throughput: 0: 5765.8. Samples: 1059305734. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:26:35,160][25689] Avg episode reward: [(0, '0.535')] [2022-07-11 04:26:37,175][26022] Updated weights on worker 0-0, policy_version 1034487 (0.00091) [2022-07-11 04:26:38,784][26022] Updated weights on worker 0-0, policy_version 1034497 (0.00090) [2022-07-11 04:26:40,205][25689] Fps is (10 sec: 5678.0, 60 sec: 5514.5, 300 sec: 5531.7). Total num frames: 1059332096. Throughput: 0: 5767.1. Samples: 1059339148. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:26:40,205][25689] Avg episode reward: [(0, '0.200')] [2022-07-11 04:26:40,892][26022] Updated weights on worker 0-0, policy_version 1034507 (0.00083) [2022-07-11 04:26:42,415][26022] Updated weights on worker 0-0, policy_version 1034517 (0.00085) [2022-07-11 04:26:44,486][26022] Updated weights on worker 0-0, policy_version 1034527 (0.00082) [2022-07-11 04:26:45,217][25689] Fps is (10 sec: 5396.0, 60 sec: 5500.1, 300 sec: 5519.6). Total num frames: 1059358720. Throughput: 0: 4958.1. Samples: 1059356018. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:26:45,219][25689] Avg episode reward: [(0, '0.451')] [2022-07-11 04:26:45,997][26022] Updated weights on worker 0-0, policy_version 1034537 (0.00095) [2022-07-11 04:26:48,168][26022] Updated weights on worker 0-0, policy_version 1034547 (0.00086) [2022-07-11 04:26:49,682][26022] Updated weights on worker 0-0, policy_version 1034557 (0.00087) [2022-07-11 04:26:50,251][25689] Fps is (10 sec: 5708.2, 60 sec: 5534.6, 300 sec: 5529.3). Total num frames: 1059389440. Throughput: 0: 5807.8. Samples: 1059389496. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:26:50,251][25689] Avg episode reward: [(0, '-0.338')] [2022-07-11 04:26:51,698][26022] Updated weights on worker 0-0, policy_version 1034567 (0.00086) [2022-07-11 04:26:53,394][26022] Updated weights on worker 0-0, policy_version 1034577 (0.00095) [2022-07-11 04:26:55,126][26022] Updated weights on worker 0-0, policy_version 1034587 (0.00084) [2022-07-11 04:26:55,266][25689] Fps is (10 sec: 5808.8, 60 sec: 5521.4, 300 sec: 5533.9). Total num frames: 1059417088. Throughput: 0: 5834.7. Samples: 1059423046. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:26:55,266][25689] Avg episode reward: [(0, '0.238')] [2022-07-11 04:26:57,263][26022] Updated weights on worker 0-0, policy_version 1034597 (0.00083) [2022-07-11 04:26:59,074][26022] Updated weights on worker 0-0, policy_version 1034607 (0.00083) [2022-07-11 04:27:00,292][25689] Fps is (10 sec: 5303.1, 60 sec: 5522.4, 300 sec: 5527.9). Total num frames: 1059442688. Throughput: 0: 4999.4. Samples: 1059439570. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:27:00,293][25689] Avg episode reward: [(0, '0.267')] [2022-07-11 04:27:00,812][26022] Updated weights on worker 0-0, policy_version 1034617 (0.00085) [2022-07-11 04:27:03,238][26022] Updated weights on worker 0-0, policy_version 1034627 (0.00084) [2022-07-11 04:27:04,755][26022] Updated weights on worker 0-0, policy_version 1034637 (0.00088) [2022-07-11 04:27:05,304][25689] Fps is (10 sec: 5304.4, 60 sec: 5522.0, 300 sec: 5528.9). Total num frames: 1059470336. Throughput: 0: 5729.3. Samples: 1059471100. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:27:05,305][25689] Avg episode reward: [(0, '0.300')] [2022-07-11 04:27:06,989][26022] Updated weights on worker 0-0, policy_version 1034647 (0.00085) [2022-07-11 04:27:08,611][26022] Updated weights on worker 0-0, policy_version 1034657 (0.00084) [2022-07-11 04:27:10,454][25689] Fps is (10 sec: 5441.4, 60 sec: 5512.0, 300 sec: 5523.1). Total num frames: 1059497984. Throughput: 0: 5679.8. Samples: 1059504244. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:27:10,454][25689] Avg episode reward: [(0, '0.473')] [2022-07-11 04:27:10,560][26022] Updated weights on worker 0-0, policy_version 1034667 (0.00085) [2022-07-11 04:27:12,399][26022] Updated weights on worker 0-0, policy_version 1034677 (0.00089) [2022-07-11 04:27:14,314][26022] Updated weights on worker 0-0, policy_version 1034687 (0.00082) [2022-07-11 04:27:15,533][25689] Fps is (10 sec: 5606.2, 60 sec: 5541.9, 300 sec: 5528.9). Total num frames: 1059527680. Throughput: 0: 4832.3. Samples: 1059520984. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:27:15,535][25689] Avg episode reward: [(0, '1.019')] [2022-07-11 04:27:16,066][26022] Updated weights on worker 0-0, policy_version 1034697 (0.00089) [2022-07-11 04:27:17,928][26022] Updated weights on worker 0-0, policy_version 1034707 (0.00092) [2022-07-11 04:27:19,595][26022] Updated weights on worker 0-0, policy_version 1034717 (0.00086) [2022-07-11 04:27:20,558][25689] Fps is (10 sec: 5675.4, 60 sec: 5528.7, 300 sec: 5529.7). Total num frames: 1059555328. Throughput: 0: 5688.6. Samples: 1059554856. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:27:20,559][25689] Avg episode reward: [(0, '0.323')] [2022-07-11 04:27:21,657][26022] Updated weights on worker 0-0, policy_version 1034727 (0.00084) [2022-07-11 04:27:23,031][26022] Updated weights on worker 0-0, policy_version 1034737 (0.00088) [2022-07-11 04:27:25,320][26022] Updated weights on worker 0-0, policy_version 1034747 (0.00100) [2022-07-11 04:27:25,570][25689] Fps is (10 sec: 5509.1, 60 sec: 5531.5, 300 sec: 5525.2). Total num frames: 1059582976. Throughput: 0: 5791.6. Samples: 1059588472. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:27:25,571][25689] Avg episode reward: [(0, '0.738')] [2022-07-11 04:27:26,805][26022] Updated weights on worker 0-0, policy_version 1034757 (0.00089) [2022-07-11 04:27:28,893][26022] Updated weights on worker 0-0, policy_version 1034767 (0.00081) [2022-07-11 04:27:30,402][26022] Updated weights on worker 0-0, policy_version 1034777 (0.00101) [2022-07-11 04:27:30,641][25689] Fps is (10 sec: 5687.1, 60 sec: 5582.2, 300 sec: 5527.8). Total num frames: 1059612672. Throughput: 0: 5001.2. Samples: 1059605204. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:27:30,643][25689] Avg episode reward: [(0, '0.265')] [2022-07-11 04:27:32,642][26022] Updated weights on worker 0-0, policy_version 1034787 (0.00089) [2022-07-11 04:27:33,646][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:27:33,660][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001034794_1059629056.pth [2022-07-11 04:27:33,660][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001032848_1057636352.pth [2022-07-11 04:27:34,086][26022] Updated weights on worker 0-0, policy_version 1034797 (0.00083) [2022-07-11 04:27:35,720][25689] Fps is (10 sec: 5447.9, 60 sec: 5512.3, 300 sec: 5523.4). Total num frames: 1059638272. Throughput: 0: 5834.2. Samples: 1059638760. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:27:35,721][25689] Avg episode reward: [(0, '1.064')] [2022-07-11 04:27:36,238][26022] Updated weights on worker 0-0, policy_version 1034807 (0.00091) [2022-07-11 04:27:37,742][26022] Updated weights on worker 0-0, policy_version 1034817 (0.00092) [2022-07-11 04:27:39,825][26022] Updated weights on worker 0-0, policy_version 1034827 (0.00090) [2022-07-11 04:27:40,739][25689] Fps is (10 sec: 5475.8, 60 sec: 5548.5, 300 sec: 5526.6). Total num frames: 1059667968. Throughput: 0: 5829.6. Samples: 1059672506. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:27:40,741][25689] Avg episode reward: [(0, '1.071')] [2022-07-11 04:27:41,523][26022] Updated weights on worker 0-0, policy_version 1034837 (0.00086) [2022-07-11 04:27:43,334][26022] Updated weights on worker 0-0, policy_version 1034847 (0.00087) [2022-07-11 04:27:45,235][26022] Updated weights on worker 0-0, policy_version 1034857 (0.00093) [2022-07-11 04:27:45,751][25689] Fps is (10 sec: 5716.8, 60 sec: 5565.5, 300 sec: 5525.3). Total num frames: 1059695616. Throughput: 0: 4990.8. Samples: 1059689190. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:27:45,751][25689] Avg episode reward: [(0, '1.212')] [2022-07-11 04:27:47,205][26022] Updated weights on worker 0-0, policy_version 1034867 (0.00102) [2022-07-11 04:27:48,909][26022] Updated weights on worker 0-0, policy_version 1034877 (0.00089) [2022-07-11 04:27:50,758][26022] Updated weights on worker 0-0, policy_version 1034887 (0.00083) [2022-07-11 04:27:50,843][25689] Fps is (10 sec: 5574.2, 60 sec: 5526.3, 300 sec: 5527.2). Total num frames: 1059724288. Throughput: 0: 5803.5. Samples: 1059722446. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:27:50,843][25689] Avg episode reward: [(0, '0.820')] [2022-07-11 04:27:52,575][26022] Updated weights on worker 0-0, policy_version 1034897 (0.00088) [2022-07-11 04:27:54,519][26022] Updated weights on worker 0-0, policy_version 1034907 (0.00089) [2022-07-11 04:27:55,856][25689] Fps is (10 sec: 5573.3, 60 sec: 5526.4, 300 sec: 5527.5). Total num frames: 1059751936. Throughput: 0: 5813.0. Samples: 1059755812. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:27:55,857][25689] Avg episode reward: [(0, '0.076')] [2022-07-11 04:27:56,433][26022] Updated weights on worker 0-0, policy_version 1034917 (0.00084) [2022-07-11 04:27:58,132][26022] Updated weights on worker 0-0, policy_version 1034927 (0.00093) [2022-07-11 04:28:00,140][26022] Updated weights on worker 0-0, policy_version 1034937 (0.00093) [2022-07-11 04:28:00,861][25689] Fps is (10 sec: 5622.0, 60 sec: 5579.1, 300 sec: 5534.7). Total num frames: 1059780608. Throughput: 0: 4972.8. Samples: 1059772566. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:28:00,861][25689] Avg episode reward: [(0, '-0.126')] [2022-07-11 04:28:02,032][26022] Updated weights on worker 0-0, policy_version 1034947 (0.00097) [2022-07-11 04:28:04,091][26022] Updated weights on worker 0-0, policy_version 1034957 (0.00091) [2022-07-11 04:28:05,891][25689] Fps is (10 sec: 5306.5, 60 sec: 5526.8, 300 sec: 5526.1). Total num frames: 1059805184. Throughput: 0: 5700.9. Samples: 1059804006. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:28:05,891][25689] Avg episode reward: [(0, '-0.379')] [2022-07-11 04:28:05,910][26022] Updated weights on worker 0-0, policy_version 1034967 (0.00088) [2022-07-11 04:28:07,821][26022] Updated weights on worker 0-0, policy_version 1034977 (0.00085) [2022-07-11 04:28:09,708][26022] Updated weights on worker 0-0, policy_version 1034987 (0.00083) [2022-07-11 04:28:10,936][25689] Fps is (10 sec: 5081.6, 60 sec: 5519.4, 300 sec: 5522.2). Total num frames: 1059831808. Throughput: 0: 5715.2. Samples: 1059837284. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:28:10,936][25689] Avg episode reward: [(0, '-0.478')] [2022-07-11 04:28:11,575][26022] Updated weights on worker 0-0, policy_version 1034997 (0.00091) [2022-07-11 04:28:13,275][26022] Updated weights on worker 0-0, policy_version 1035007 (0.00087) [2022-07-11 04:28:15,183][26022] Updated weights on worker 0-0, policy_version 1035017 (0.00097) [2022-07-11 04:28:15,940][25689] Fps is (10 sec: 5502.7, 60 sec: 5509.3, 300 sec: 5520.5). Total num frames: 1059860480. Throughput: 0: 5723.3. Samples: 1059870756. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:28:15,940][25689] Avg episode reward: [(0, '0.376')] [2022-07-11 04:28:17,057][26022] Updated weights on worker 0-0, policy_version 1035027 (0.00091) [2022-07-11 04:28:18,820][26022] Updated weights on worker 0-0, policy_version 1035037 (0.00086) [2022-07-11 04:28:20,831][26022] Updated weights on worker 0-0, policy_version 1035047 (0.00098) [2022-07-11 04:28:20,953][25689] Fps is (10 sec: 5622.6, 60 sec: 5510.4, 300 sec: 5521.7). Total num frames: 1059888128. Throughput: 0: 5726.4. Samples: 1059887622. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:28:20,953][25689] Avg episode reward: [(0, '0.668')] [2022-07-11 04:28:22,580][26022] Updated weights on worker 0-0, policy_version 1035057 (0.00086) [2022-07-11 04:28:24,372][26022] Updated weights on worker 0-0, policy_version 1035067 (0.00090) [2022-07-11 04:28:25,963][25689] Fps is (10 sec: 5618.8, 60 sec: 5527.5, 300 sec: 5526.7). Total num frames: 1059916800. Throughput: 0: 5829.8. Samples: 1059921024. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:28:25,964][25689] Avg episode reward: [(0, '1.178')] [2022-07-11 04:28:26,363][26022] Updated weights on worker 0-0, policy_version 1035077 (0.00094) [2022-07-11 04:28:28,094][26022] Updated weights on worker 0-0, policy_version 1035087 (0.00088) [2022-07-11 04:28:30,043][26022] Updated weights on worker 0-0, policy_version 1035097 (0.00091) [2022-07-11 04:28:31,022][25689] Fps is (10 sec: 5694.7, 60 sec: 5511.6, 300 sec: 5522.4). Total num frames: 1059945472. Throughput: 0: 5808.1. Samples: 1059953948. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:28:31,023][25689] Avg episode reward: [(0, '2.156')] [2022-07-11 04:28:31,756][26022] Updated weights on worker 0-0, policy_version 1035107 (0.00086) [2022-07-11 04:28:33,695][26022] Updated weights on worker 0-0, policy_version 1035117 (0.00086) [2022-07-11 04:28:35,274][26022] Updated weights on worker 0-0, policy_version 1035127 (0.00092) [2022-07-11 04:28:36,082][25689] Fps is (10 sec: 5565.9, 60 sec: 5547.3, 300 sec: 5525.0). Total num frames: 1059973120. Throughput: 0: 4969.7. Samples: 1059970858. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:28:36,083][25689] Avg episode reward: [(0, '1.917')] [2022-07-11 04:28:37,368][26022] Updated weights on worker 0-0, policy_version 1035137 (0.00069) [2022-07-11 04:28:39,063][26022] Updated weights on worker 0-0, policy_version 1035147 (0.00093) [2022-07-11 04:28:40,985][26022] Updated weights on worker 0-0, policy_version 1035157 (0.00090) [2022-07-11 04:28:41,088][25689] Fps is (10 sec: 5493.7, 60 sec: 5514.6, 300 sec: 5521.7). Total num frames: 1060000768. Throughput: 0: 5794.9. Samples: 1060004302. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:28:41,088][25689] Avg episode reward: [(0, '1.783')] [2022-07-11 04:28:42,900][26022] Updated weights on worker 0-0, policy_version 1035167 (0.00085) [2022-07-11 04:28:44,570][26022] Updated weights on worker 0-0, policy_version 1035177 (0.00096) [2022-07-11 04:28:46,128][25689] Fps is (10 sec: 5606.1, 60 sec: 5529.0, 300 sec: 5522.2). Total num frames: 1060029440. Throughput: 0: 5799.3. Samples: 1060037968. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:28:46,130][25689] Avg episode reward: [(0, '1.510')] [2022-07-11 04:28:46,583][26022] Updated weights on worker 0-0, policy_version 1035187 (0.00091) [2022-07-11 04:28:48,268][26022] Updated weights on worker 0-0, policy_version 1035197 (0.00087) [2022-07-11 04:28:50,216][26022] Updated weights on worker 0-0, policy_version 1035207 (0.00094) [2022-07-11 04:28:51,186][25689] Fps is (10 sec: 5678.6, 60 sec: 5532.1, 300 sec: 5528.8). Total num frames: 1060058112. Throughput: 0: 5000.3. Samples: 1060054772. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:28:51,188][25689] Avg episode reward: [(0, '0.915')] [2022-07-11 04:28:52,034][26022] Updated weights on worker 0-0, policy_version 1035217 (0.00082) [2022-07-11 04:28:54,003][26022] Updated weights on worker 0-0, policy_version 1035227 (0.00086) [2022-07-11 04:28:55,584][26022] Updated weights on worker 0-0, policy_version 1035237 (0.00090) [2022-07-11 04:28:56,209][25689] Fps is (10 sec: 5587.0, 60 sec: 5531.2, 300 sec: 5521.9). Total num frames: 1060085760. Throughput: 0: 5818.2. Samples: 1060087960. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:28:56,210][25689] Avg episode reward: [(0, '-0.023')] [2022-07-11 04:28:57,630][26022] Updated weights on worker 0-0, policy_version 1035247 (0.00093) [2022-07-11 04:28:59,154][26022] Updated weights on worker 0-0, policy_version 1035257 (0.00079) [2022-07-11 04:29:01,223][25689] Fps is (10 sec: 5407.1, 60 sec: 5496.4, 300 sec: 5525.2). Total num frames: 1060112384. Throughput: 0: 5832.4. Samples: 1060121740. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:29:01,225][25689] Avg episode reward: [(0, '-1.588')] [2022-07-11 04:29:01,340][26022] Updated weights on worker 0-0, policy_version 1035267 (0.00090) [2022-07-11 04:29:03,207][26022] Updated weights on worker 0-0, policy_version 1035277 (0.00095) [2022-07-11 04:29:05,410][26022] Updated weights on worker 0-0, policy_version 1035287 (0.00102) [2022-07-11 04:29:06,239][25689] Fps is (10 sec: 5410.8, 60 sec: 5548.6, 300 sec: 5530.8). Total num frames: 1060140032. Throughput: 0: 4888.0. Samples: 1060136270. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:29:06,240][25689] Avg episode reward: [(0, '-2.180')] [2022-07-11 04:29:07,007][26022] Updated weights on worker 0-0, policy_version 1035297 (0.00424) [2022-07-11 04:29:09,017][26022] Updated weights on worker 0-0, policy_version 1035307 (0.00086) [2022-07-11 04:29:10,465][26022] Updated weights on worker 0-0, policy_version 1035317 (0.00087) [2022-07-11 04:29:11,362][25689] Fps is (10 sec: 5453.8, 60 sec: 5558.4, 300 sec: 5521.7). Total num frames: 1060167680. Throughput: 0: 5692.7. Samples: 1060169628. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:29:11,363][25689] Avg episode reward: [(0, '-2.051')] [2022-07-11 04:29:12,647][26022] Updated weights on worker 0-0, policy_version 1035327 (0.00082) [2022-07-11 04:29:14,398][26022] Updated weights on worker 0-0, policy_version 1035337 (0.00088) [2022-07-11 04:29:16,311][26022] Updated weights on worker 0-0, policy_version 1035347 (0.00089) [2022-07-11 04:29:16,428][25689] Fps is (10 sec: 5427.0, 60 sec: 5535.7, 300 sec: 5521.4). Total num frames: 1060195328. Throughput: 0: 5701.8. Samples: 1060203246. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:29:16,429][25689] Avg episode reward: [(0, '-1.539')] [2022-07-11 04:29:17,938][26022] Updated weights on worker 0-0, policy_version 1035357 (0.00091) [2022-07-11 04:29:20,009][26022] Updated weights on worker 0-0, policy_version 1035367 (0.00095) [2022-07-11 04:29:21,438][25689] Fps is (10 sec: 5589.5, 60 sec: 5552.9, 300 sec: 5521.9). Total num frames: 1060224000. Throughput: 0: 4854.6. Samples: 1060219876. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:29:21,439][25689] Avg episode reward: [(0, '-1.152')] [2022-07-11 04:29:21,715][26022] Updated weights on worker 0-0, policy_version 1035377 (0.00491) [2022-07-11 04:29:23,607][26022] Updated weights on worker 0-0, policy_version 1035387 (0.00088) [2022-07-11 04:29:25,324][26022] Updated weights on worker 0-0, policy_version 1035397 (0.00086) [2022-07-11 04:29:26,467][25689] Fps is (10 sec: 5610.4, 60 sec: 5534.4, 300 sec: 5527.1). Total num frames: 1060251648. Throughput: 0: 5804.7. Samples: 1060253686. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:29:26,467][25689] Avg episode reward: [(0, '-0.829')] [2022-07-11 04:29:27,361][26022] Updated weights on worker 0-0, policy_version 1035407 (0.00087) [2022-07-11 04:29:29,134][26022] Updated weights on worker 0-0, policy_version 1035417 (0.00087) [2022-07-11 04:29:30,854][26022] Updated weights on worker 0-0, policy_version 1035427 (0.00085) [2022-07-11 04:29:31,511][25689] Fps is (10 sec: 5489.5, 60 sec: 5518.8, 300 sec: 5523.2). Total num frames: 1060279296. Throughput: 0: 5820.6. Samples: 1060286908. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:29:31,512][25689] Avg episode reward: [(0, '0.912')] [2022-07-11 04:29:32,563][26022] Updated weights on worker 0-0, policy_version 1035437 (0.00087) [2022-07-11 04:29:33,746][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:29:33,758][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001035443_1060293632.pth [2022-07-11 04:29:33,758][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001033499_1058302976.pth [2022-07-11 04:29:34,792][26022] Updated weights on worker 0-0, policy_version 1035447 (0.00082) [2022-07-11 04:29:36,374][26022] Updated weights on worker 0-0, policy_version 1035457 (0.00086) [2022-07-11 04:29:36,516][25689] Fps is (10 sec: 5706.1, 60 sec: 5557.6, 300 sec: 5526.9). Total num frames: 1060308992. Throughput: 0: 4994.1. Samples: 1060303568. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:29:36,517][25689] Avg episode reward: [(0, '0.249')] [2022-07-11 04:29:38,596][26022] Updated weights on worker 0-0, policy_version 1035467 (0.00081) [2022-07-11 04:29:40,132][26022] Updated weights on worker 0-0, policy_version 1035477 (0.00085) [2022-07-11 04:29:41,547][25689] Fps is (10 sec: 5408.0, 60 sec: 5504.6, 300 sec: 5519.8). Total num frames: 1060333568. Throughput: 0: 5808.5. Samples: 1060336676. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:29:41,547][25689] Avg episode reward: [(0, '0.550')] [2022-07-11 04:29:42,165][26022] Updated weights on worker 0-0, policy_version 1035487 (0.00088) [2022-07-11 04:29:43,688][26022] Updated weights on worker 0-0, policy_version 1035497 (0.00085) [2022-07-11 04:29:45,797][26022] Updated weights on worker 0-0, policy_version 1035507 (0.00086) [2022-07-11 04:29:46,555][25689] Fps is (10 sec: 5508.2, 60 sec: 5541.4, 300 sec: 5528.7). Total num frames: 1060364288. Throughput: 0: 5800.4. Samples: 1060370208. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:29:46,556][25689] Avg episode reward: [(0, '0.289')] [2022-07-11 04:29:47,748][26022] Updated weights on worker 0-0, policy_version 1035517 (0.00353) [2022-07-11 04:29:49,520][26022] Updated weights on worker 0-0, policy_version 1035527 (0.00094) [2022-07-11 04:29:51,287][26022] Updated weights on worker 0-0, policy_version 1035537 (0.00085) [2022-07-11 04:29:51,637][25689] Fps is (10 sec: 5784.3, 60 sec: 5522.2, 300 sec: 5521.8). Total num frames: 1060391936. Throughput: 0: 4961.8. Samples: 1060386770. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:29:51,638][25689] Avg episode reward: [(0, '-0.306')] [2022-07-11 04:29:53,243][26022] Updated weights on worker 0-0, policy_version 1035547 (0.00089) [2022-07-11 04:29:55,079][26022] Updated weights on worker 0-0, policy_version 1035557 (0.00084) [2022-07-11 04:29:56,642][25689] Fps is (10 sec: 5380.2, 60 sec: 5506.9, 300 sec: 5522.2). Total num frames: 1060418560. Throughput: 0: 5783.4. Samples: 1060419966. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:29:56,643][25689] Avg episode reward: [(0, '0.144')] [2022-07-11 04:29:56,796][26022] Updated weights on worker 0-0, policy_version 1035567 (0.00080) [2022-07-11 04:29:58,520][26022] Updated weights on worker 0-0, policy_version 1035577 (0.00079) [2022-07-11 04:30:00,470][26022] Updated weights on worker 0-0, policy_version 1035587 (0.00083) [2022-07-11 04:30:01,704][25689] Fps is (10 sec: 5493.2, 60 sec: 5536.5, 300 sec: 5535.2). Total num frames: 1060447232. Throughput: 0: 5823.7. Samples: 1060454064. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:30:01,704][25689] Avg episode reward: [(0, '-0.638')] [2022-07-11 04:30:02,537][26022] Updated weights on worker 0-0, policy_version 1035597 (0.00083) [2022-07-11 04:30:04,470][26022] Updated weights on worker 0-0, policy_version 1035607 (0.00088) [2022-07-11 04:30:06,112][26022] Updated weights on worker 0-0, policy_version 1035617 (0.00089) [2022-07-11 04:30:06,723][25689] Fps is (10 sec: 5485.5, 60 sec: 5519.2, 300 sec: 5527.2). Total num frames: 1060473856. Throughput: 0: 4898.8. Samples: 1060469006. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:30:06,723][25689] Avg episode reward: [(0, '0.240')] [2022-07-11 04:30:08,151][26022] Updated weights on worker 0-0, policy_version 1035627 (0.00087) [2022-07-11 04:30:10,017][26022] Updated weights on worker 0-0, policy_version 1035637 (0.00088) [2022-07-11 04:30:11,716][26022] Updated weights on worker 0-0, policy_version 1035647 (0.00081) [2022-07-11 04:30:11,831][25689] Fps is (10 sec: 5561.2, 60 sec: 5554.5, 300 sec: 5529.2). Total num frames: 1060503552. Throughput: 0: 5715.8. Samples: 1060502192. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:30:11,832][25689] Avg episode reward: [(0, '0.801')] [2022-07-11 04:30:13,759][26022] Updated weights on worker 0-0, policy_version 1035657 (0.00089) [2022-07-11 04:30:15,343][26022] Updated weights on worker 0-0, policy_version 1035667 (0.00086) [2022-07-11 04:30:16,853][25689] Fps is (10 sec: 5458.6, 60 sec: 5524.6, 300 sec: 5523.2). Total num frames: 1060529152. Throughput: 0: 5729.8. Samples: 1060535768. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:30:16,853][25689] Avg episode reward: [(0, '0.224')] [2022-07-11 04:30:17,471][26022] Updated weights on worker 0-0, policy_version 1035677 (0.00088) [2022-07-11 04:30:18,874][26022] Updated weights on worker 0-0, policy_version 1035687 (0.00087) [2022-07-11 04:30:21,058][26022] Updated weights on worker 0-0, policy_version 1035697 (0.00086) [2022-07-11 04:30:21,859][25689] Fps is (10 sec: 5514.3, 60 sec: 5542.0, 300 sec: 5527.0). Total num frames: 1060558848. Throughput: 0: 4894.3. Samples: 1060552710. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:30:21,860][25689] Avg episode reward: [(0, '0.412')] [2022-07-11 04:30:22,600][26022] Updated weights on worker 0-0, policy_version 1035707 (0.00092) [2022-07-11 04:30:24,626][26022] Updated weights on worker 0-0, policy_version 1035717 (0.00094) [2022-07-11 04:30:26,317][26022] Updated weights on worker 0-0, policy_version 1035727 (0.00088) [2022-07-11 04:30:26,889][25689] Fps is (10 sec: 5815.7, 60 sec: 5558.7, 300 sec: 5531.3). Total num frames: 1060587520. Throughput: 0: 5817.3. Samples: 1060586320. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:30:26,891][25689] Avg episode reward: [(0, '1.053')] [2022-07-11 04:30:28,578][26022] Updated weights on worker 0-0, policy_version 1035737 (0.00087) [2022-07-11 04:30:30,014][26022] Updated weights on worker 0-0, policy_version 1035747 (0.00094) [2022-07-11 04:30:32,015][25689] Fps is (10 sec: 5343.8, 60 sec: 5517.4, 300 sec: 5522.9). Total num frames: 1060613120. Throughput: 0: 5815.4. Samples: 1060619570. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:30:32,015][25689] Avg episode reward: [(0, '0.519')] [2022-07-11 04:30:32,143][26022] Updated weights on worker 0-0, policy_version 1035757 (0.00084) [2022-07-11 04:30:33,605][26022] Updated weights on worker 0-0, policy_version 1035767 (0.00086) [2022-07-11 04:30:35,848][26022] Updated weights on worker 0-0, policy_version 1035777 (0.00085) [2022-07-11 04:30:37,046][25689] Fps is (10 sec: 5545.1, 60 sec: 5532.0, 300 sec: 5533.6). Total num frames: 1060643840. Throughput: 0: 4991.3. Samples: 1060636558. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:30:37,050][25689] Avg episode reward: [(0, '0.536')] [2022-07-11 04:30:37,321][26022] Updated weights on worker 0-0, policy_version 1035787 (0.00087) [2022-07-11 04:30:39,463][26022] Updated weights on worker 0-0, policy_version 1035797 (0.00096) [2022-07-11 04:30:40,847][26022] Updated weights on worker 0-0, policy_version 1035807 (0.00081) [2022-07-11 04:30:42,091][25689] Fps is (10 sec: 5793.0, 60 sec: 5581.4, 300 sec: 5533.5). Total num frames: 1060671488. Throughput: 0: 5822.8. Samples: 1060670516. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:30:42,091][25689] Avg episode reward: [(0, '0.525')] [2022-07-11 04:30:43,204][26022] Updated weights on worker 0-0, policy_version 1035817 (0.00088) [2022-07-11 04:30:44,463][26022] Updated weights on worker 0-0, policy_version 1035827 (0.00088) [2022-07-11 04:30:46,692][26022] Updated weights on worker 0-0, policy_version 1035837 (0.00084) [2022-07-11 04:30:47,121][25689] Fps is (10 sec: 5386.8, 60 sec: 5511.8, 300 sec: 5526.8). Total num frames: 1060698112. Throughput: 0: 5823.6. Samples: 1060704142. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:30:47,121][25689] Avg episode reward: [(0, '0.917')] [2022-07-11 04:30:48,250][26022] Updated weights on worker 0-0, policy_version 1035847 (0.00084) [2022-07-11 04:30:50,332][26022] Updated weights on worker 0-0, policy_version 1035857 (0.00088) [2022-07-11 04:30:51,874][26022] Updated weights on worker 0-0, policy_version 1035867 (0.00096) [2022-07-11 04:30:52,183][25689] Fps is (10 sec: 5682.0, 60 sec: 5564.4, 300 sec: 5533.6). Total num frames: 1060728832. Throughput: 0: 5017.1. Samples: 1060720754. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:30:52,183][25689] Avg episode reward: [(0, '-0.009')] [2022-07-11 04:30:54,078][26022] Updated weights on worker 0-0, policy_version 1035877 (0.00085) [2022-07-11 04:30:55,623][26022] Updated weights on worker 0-0, policy_version 1035887 (0.00085) [2022-07-11 04:30:57,209][25689] Fps is (10 sec: 5684.2, 60 sec: 5562.4, 300 sec: 5537.2). Total num frames: 1060755456. Throughput: 0: 5852.1. Samples: 1060754556. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:30:57,210][25689] Avg episode reward: [(0, '0.435')] [2022-07-11 04:30:57,819][26022] Updated weights on worker 0-0, policy_version 1035897 (0.00085) [2022-07-11 04:30:59,164][26022] Updated weights on worker 0-0, policy_version 1035907 (0.00085) [2022-07-11 04:31:01,577][26022] Updated weights on worker 0-0, policy_version 1035917 (0.00087) [2022-07-11 04:31:02,215][25689] Fps is (10 sec: 5205.6, 60 sec: 5516.7, 300 sec: 5530.4). Total num frames: 1060781056. Throughput: 0: 5862.3. Samples: 1060788492. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:31:02,216][25689] Avg episode reward: [(0, '0.844')] [2022-07-11 04:31:03,224][26022] Updated weights on worker 0-0, policy_version 1035927 (0.00097) [2022-07-11 04:31:05,362][26022] Updated weights on worker 0-0, policy_version 1035937 (0.00083) [2022-07-11 04:31:06,936][26022] Updated weights on worker 0-0, policy_version 1035947 (0.00086) [2022-07-11 04:31:07,219][25689] Fps is (10 sec: 5524.2, 60 sec: 5568.9, 300 sec: 5538.0). Total num frames: 1060810752. Throughput: 0: 4930.3. Samples: 1060803232. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:31:07,219][25689] Avg episode reward: [(0, '0.591')] [2022-07-11 04:31:09,012][26022] Updated weights on worker 0-0, policy_version 1035957 (0.00089) [2022-07-11 04:31:10,788][26022] Updated weights on worker 0-0, policy_version 1035967 (0.00096) [2022-07-11 04:31:12,256][25689] Fps is (10 sec: 5710.8, 60 sec: 5541.5, 300 sec: 5537.9). Total num frames: 1060838400. Throughput: 0: 5773.6. Samples: 1060836652. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:31:12,257][25689] Avg episode reward: [(0, '0.739')] [2022-07-11 04:31:12,602][26022] Updated weights on worker 0-0, policy_version 1035977 (0.00084) [2022-07-11 04:31:14,391][26022] Updated weights on worker 0-0, policy_version 1035987 (0.00085) [2022-07-11 04:31:16,244][26022] Updated weights on worker 0-0, policy_version 1035997 (0.00080) [2022-07-11 04:31:17,281][25689] Fps is (10 sec: 5495.6, 60 sec: 5575.2, 300 sec: 5535.3). Total num frames: 1060866048. Throughput: 0: 5769.4. Samples: 1060870358. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:31:17,281][25689] Avg episode reward: [(0, '1.248')] [2022-07-11 04:31:18,089][26022] Updated weights on worker 0-0, policy_version 1036007 (0.00386) [2022-07-11 04:31:19,983][26022] Updated weights on worker 0-0, policy_version 1036017 (0.00080) [2022-07-11 04:31:21,671][26022] Updated weights on worker 0-0, policy_version 1036027 (0.00090) [2022-07-11 04:31:22,301][25689] Fps is (10 sec: 5606.8, 60 sec: 5556.9, 300 sec: 5539.1). Total num frames: 1060894720. Throughput: 0: 5743.8. Samples: 1060903864. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:31:22,302][25689] Avg episode reward: [(0, '1.974')] [2022-07-11 04:31:23,782][26022] Updated weights on worker 0-0, policy_version 1036037 (0.00088) [2022-07-11 04:31:25,242][26022] Updated weights on worker 0-0, policy_version 1036047 (0.00085) [2022-07-11 04:31:27,314][25689] Fps is (10 sec: 5409.1, 60 sec: 5507.6, 300 sec: 5536.7). Total num frames: 1060920320. Throughput: 0: 5833.4. Samples: 1060920456. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:31:27,315][25689] Avg episode reward: [(0, '0.358')] [2022-07-11 04:31:27,536][26022] Updated weights on worker 0-0, policy_version 1036057 (0.00092) [2022-07-11 04:31:28,845][26022] Updated weights on worker 0-0, policy_version 1036067 (0.00088) [2022-07-11 04:31:31,048][26022] Updated weights on worker 0-0, policy_version 1036077 (0.00083) [2022-07-11 04:31:32,384][25689] Fps is (10 sec: 5687.3, 60 sec: 5614.5, 300 sec: 5543.3). Total num frames: 1060952064. Throughput: 0: 5828.2. Samples: 1060953960. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:31:32,386][25689] Avg episode reward: [(0, '0.477')] [2022-07-11 04:31:32,445][26022] Updated weights on worker 0-0, policy_version 1036087 (0.00085) [2022-07-11 04:31:33,880][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:31:33,898][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001036092_1060958208.pth [2022-07-11 04:31:33,898][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001034146_1058965504.pth [2022-07-11 04:31:34,727][26022] Updated weights on worker 0-0, policy_version 1036097 (0.00082) [2022-07-11 04:31:36,468][26022] Updated weights on worker 0-0, policy_version 1036107 (0.00086) [2022-07-11 04:31:37,391][25689] Fps is (10 sec: 5690.6, 60 sec: 5531.9, 300 sec: 5537.2). Total num frames: 1060977664. Throughput: 0: 5819.8. Samples: 1060987398. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:31:37,392][25689] Avg episode reward: [(0, '0.020')] [2022-07-11 04:31:38,305][26022] Updated weights on worker 0-0, policy_version 1036117 (0.00615) [2022-07-11 04:31:40,041][26022] Updated weights on worker 0-0, policy_version 1036127 (0.00086) [2022-07-11 04:31:42,250][26022] Updated weights on worker 0-0, policy_version 1036137 (0.00088) [2022-07-11 04:31:42,398][25689] Fps is (10 sec: 5215.3, 60 sec: 5518.4, 300 sec: 5537.3). Total num frames: 1061004288. Throughput: 0: 4988.7. Samples: 1061004118. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:31:42,398][25689] Avg episode reward: [(0, '0.083')] [2022-07-11 04:31:43,608][26022] Updated weights on worker 0-0, policy_version 1036147 (0.00092) [2022-07-11 04:31:45,948][26022] Updated weights on worker 0-0, policy_version 1036157 (0.00089) [2022-07-11 04:31:47,128][26022] Updated weights on worker 0-0, policy_version 1036167 (0.00088) [2022-07-11 04:31:47,428][25689] Fps is (10 sec: 5815.7, 60 sec: 5603.3, 300 sec: 5540.8). Total num frames: 1061036032. Throughput: 0: 5816.0. Samples: 1061037436. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:31:47,428][25689] Avg episode reward: [(0, '0.456')] [2022-07-11 04:31:49,531][26022] Updated weights on worker 0-0, policy_version 1036177 (0.00085) [2022-07-11 04:31:50,947][26022] Updated weights on worker 0-0, policy_version 1036187 (0.00093) [2022-07-11 04:31:52,481][25689] Fps is (10 sec: 5585.4, 60 sec: 5502.2, 300 sec: 5529.7). Total num frames: 1061060608. Throughput: 0: 5825.9. Samples: 1061071044. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:31:52,483][25689] Avg episode reward: [(0, '0.794')] [2022-07-11 04:31:53,072][26022] Updated weights on worker 0-0, policy_version 1036197 (0.00088) [2022-07-11 04:31:54,623][26022] Updated weights on worker 0-0, policy_version 1036207 (0.00090) [2022-07-11 04:31:56,730][26022] Updated weights on worker 0-0, policy_version 1036217 (0.00093) [2022-07-11 04:31:57,508][25689] Fps is (10 sec: 5485.7, 60 sec: 5570.1, 300 sec: 5546.9). Total num frames: 1061091328. Throughput: 0: 5006.5. Samples: 1061088110. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:31:57,510][25689] Avg episode reward: [(0, '1.561')] [2022-07-11 04:31:58,459][26022] Updated weights on worker 0-0, policy_version 1036227 (0.00086) [2022-07-11 04:32:00,408][26022] Updated weights on worker 0-0, policy_version 1036237 (0.00101) [2022-07-11 04:32:02,176][26022] Updated weights on worker 0-0, policy_version 1036247 (0.00096) [2022-07-11 04:32:02,542][25689] Fps is (10 sec: 5598.0, 60 sec: 5567.5, 300 sec: 5539.6). Total num frames: 1061116928. Throughput: 0: 5831.1. Samples: 1061121582. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:32:02,547][25689] Avg episode reward: [(0, '0.944')] [2022-07-11 04:32:04,367][26022] Updated weights on worker 0-0, policy_version 1036257 (0.00086) [2022-07-11 04:32:06,185][26022] Updated weights on worker 0-0, policy_version 1036267 (0.00091) [2022-07-11 04:32:07,562][25689] Fps is (10 sec: 5397.8, 60 sec: 5549.1, 300 sec: 5545.5). Total num frames: 1061145600. Throughput: 0: 5738.1. Samples: 1061152970. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:32:07,563][25689] Avg episode reward: [(0, '0.407')] [2022-07-11 04:32:07,976][26022] Updated weights on worker 0-0, policy_version 1036277 (0.00092) [2022-07-11 04:32:09,936][26022] Updated weights on worker 0-0, policy_version 1036287 (0.00084) [2022-07-11 04:32:12,029][26022] Updated weights on worker 0-0, policy_version 1036297 (0.00094) [2022-07-11 04:32:12,616][25689] Fps is (10 sec: 5387.6, 60 sec: 5513.7, 300 sec: 5532.2). Total num frames: 1061171200. Throughput: 0: 4884.2. Samples: 1061169384. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:32:12,616][25689] Avg episode reward: [(0, '-0.361')] [2022-07-11 04:32:13,437][26022] Updated weights on worker 0-0, policy_version 1036307 (0.00087) [2022-07-11 04:32:15,627][26022] Updated weights on worker 0-0, policy_version 1036317 (0.00086) [2022-07-11 04:32:16,975][26022] Updated weights on worker 0-0, policy_version 1036327 (0.00087) [2022-07-11 04:32:17,680][25689] Fps is (10 sec: 5364.1, 60 sec: 5527.0, 300 sec: 5534.9). Total num frames: 1061199872. Throughput: 0: 5687.7. Samples: 1061202844. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:32:17,680][25689] Avg episode reward: [(0, '-0.529')] [2022-07-11 04:32:19,269][26022] Updated weights on worker 0-0, policy_version 1036337 (0.00086) [2022-07-11 04:32:20,963][26022] Updated weights on worker 0-0, policy_version 1036347 (0.00093) [2022-07-11 04:32:22,696][25689] Fps is (10 sec: 5688.5, 60 sec: 5527.4, 300 sec: 5538.3). Total num frames: 1061228544. Throughput: 0: 5686.8. Samples: 1061236194. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2022-07-11 04:32:22,698][25689] Avg episode reward: [(0, '-1.845')] [2022-07-11 04:32:22,783][26022] Updated weights on worker 0-0, policy_version 1036357 (0.00084) [2022-07-11 04:32:24,584][26022] Updated weights on worker 0-0, policy_version 1036367 (0.00090) [2022-07-11 04:32:26,388][26022] Updated weights on worker 0-0, policy_version 1036377 (0.00085) [2022-07-11 04:32:27,716][25689] Fps is (10 sec: 5611.7, 60 sec: 5560.6, 300 sec: 5532.3). Total num frames: 1061256192. Throughput: 0: 4967.2. Samples: 1061253078. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:32:27,718][25689] Avg episode reward: [(0, '-1.754')] [2022-07-11 04:32:28,363][26022] Updated weights on worker 0-0, policy_version 1036387 (0.00090) [2022-07-11 04:32:30,246][26022] Updated weights on worker 0-0, policy_version 1036397 (0.00085) [2022-07-11 04:32:32,083][26022] Updated weights on worker 0-0, policy_version 1036407 (0.00099) [2022-07-11 04:32:32,806][25689] Fps is (10 sec: 5469.4, 60 sec: 5491.0, 300 sec: 5539.0). Total num frames: 1061283840. Throughput: 0: 5789.0. Samples: 1061286268. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:32:32,807][25689] Avg episode reward: [(0, '-0.185')] [2022-07-11 04:32:33,731][26022] Updated weights on worker 0-0, policy_version 1036417 (0.00083) [2022-07-11 04:32:35,608][26022] Updated weights on worker 0-0, policy_version 1036427 (0.00089) [2022-07-11 04:32:37,444][26022] Updated weights on worker 0-0, policy_version 1036437 (0.00085) [2022-07-11 04:32:37,823][25689] Fps is (10 sec: 5572.5, 60 sec: 5540.9, 300 sec: 5535.6). Total num frames: 1061312512. Throughput: 0: 5810.6. Samples: 1061319888. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:32:37,823][25689] Avg episode reward: [(0, '-0.239')] [2022-07-11 04:32:39,264][26022] Updated weights on worker 0-0, policy_version 1036447 (0.00088) [2022-07-11 04:32:41,238][26022] Updated weights on worker 0-0, policy_version 1036457 (0.00087) [2022-07-11 04:32:42,843][25689] Fps is (10 sec: 5713.6, 60 sec: 5573.6, 300 sec: 5538.9). Total num frames: 1061341184. Throughput: 0: 4973.6. Samples: 1061336394. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:32:42,844][25689] Avg episode reward: [(0, '0.521')] [2022-07-11 04:32:43,060][26022] Updated weights on worker 0-0, policy_version 1036467 (0.00091) [2022-07-11 04:32:44,953][26022] Updated weights on worker 0-0, policy_version 1036477 (0.00096) [2022-07-11 04:32:46,936][26022] Updated weights on worker 0-0, policy_version 1036487 (0.00090) [2022-07-11 04:32:47,879][25689] Fps is (10 sec: 5397.1, 60 sec: 5471.4, 300 sec: 5529.6). Total num frames: 1061366784. Throughput: 0: 5777.5. Samples: 1061369568. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:32:47,880][25689] Avg episode reward: [(0, '0.741')] [2022-07-11 04:32:48,591][26022] Updated weights on worker 0-0, policy_version 1036497 (0.00080) [2022-07-11 04:32:50,687][26022] Updated weights on worker 0-0, policy_version 1036507 (0.00087) [2022-07-11 04:32:52,251][26022] Updated weights on worker 0-0, policy_version 1036517 (0.00087) [2022-07-11 04:32:52,939][25689] Fps is (10 sec: 5375.6, 60 sec: 5538.6, 300 sec: 5532.2). Total num frames: 1061395456. Throughput: 0: 5806.9. Samples: 1061403174. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:32:52,939][25689] Avg episode reward: [(0, '0.532')] [2022-07-11 04:32:54,218][26022] Updated weights on worker 0-0, policy_version 1036527 (0.00085) [2022-07-11 04:32:55,948][26022] Updated weights on worker 0-0, policy_version 1036537 (0.00089) [2022-07-11 04:32:57,683][26022] Updated weights on worker 0-0, policy_version 1036547 (0.00088) [2022-07-11 04:32:58,020][25689] Fps is (10 sec: 5755.4, 60 sec: 5516.6, 300 sec: 5534.2). Total num frames: 1061425152. Throughput: 0: 4958.8. Samples: 1061420042. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:32:58,021][25689] Avg episode reward: [(0, '0.576')] [2022-07-11 04:32:59,814][26022] Updated weights on worker 0-0, policy_version 1036557 (0.00090) [2022-07-11 04:33:01,329][26022] Updated weights on worker 0-0, policy_version 1036567 (0.00270) [2022-07-11 04:33:03,049][25689] Fps is (10 sec: 5368.1, 60 sec: 5500.2, 300 sec: 5534.2). Total num frames: 1061449728. Throughput: 0: 5796.6. Samples: 1061453520. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:33:03,049][25689] Avg episode reward: [(0, '-0.000')] [2022-07-11 04:33:03,688][26022] Updated weights on worker 0-0, policy_version 1036577 (0.00088) [2022-07-11 04:33:05,491][26022] Updated weights on worker 0-0, policy_version 1036587 (0.00082) [2022-07-11 04:33:07,359][26022] Updated weights on worker 0-0, policy_version 1036597 (0.00086) [2022-07-11 04:33:08,073][25689] Fps is (10 sec: 5296.8, 60 sec: 5499.8, 300 sec: 5541.5). Total num frames: 1061478400. Throughput: 0: 5702.3. Samples: 1061484722. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:33:08,074][25689] Avg episode reward: [(0, '-0.528')] [2022-07-11 04:33:09,269][26022] Updated weights on worker 0-0, policy_version 1036607 (0.00090) [2022-07-11 04:33:10,964][26022] Updated weights on worker 0-0, policy_version 1036617 (0.00091) [2022-07-11 04:33:12,923][26022] Updated weights on worker 0-0, policy_version 1036627 (0.00090) [2022-07-11 04:33:13,149][25689] Fps is (10 sec: 5677.7, 60 sec: 5548.6, 300 sec: 5540.2). Total num frames: 1061507072. Throughput: 0: 4863.0. Samples: 1061501456. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:33:13,149][25689] Avg episode reward: [(0, '-1.095')] [2022-07-11 04:33:14,717][26022] Updated weights on worker 0-0, policy_version 1036637 (0.00091) [2022-07-11 04:33:16,595][26022] Updated weights on worker 0-0, policy_version 1036647 (0.00086) [2022-07-11 04:33:18,209][25689] Fps is (10 sec: 5556.6, 60 sec: 5532.0, 300 sec: 5539.3). Total num frames: 1061534720. Throughput: 0: 5690.4. Samples: 1061534924. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:33:18,209][25689] Avg episode reward: [(0, '-1.090')] [2022-07-11 04:33:18,401][26022] Updated weights on worker 0-0, policy_version 1036657 (0.00068) [2022-07-11 04:33:20,228][26022] Updated weights on worker 0-0, policy_version 1036667 (0.00085) [2022-07-11 04:33:22,103][26022] Updated weights on worker 0-0, policy_version 1036677 (0.00085) [2022-07-11 04:33:23,309][25689] Fps is (10 sec: 5543.1, 60 sec: 5524.4, 300 sec: 5537.6). Total num frames: 1061563392. Throughput: 0: 5678.4. Samples: 1061568566. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:33:23,310][25689] Avg episode reward: [(0, '-1.008')] [2022-07-11 04:33:23,746][26022] Updated weights on worker 0-0, policy_version 1036687 (0.00086) [2022-07-11 04:33:25,681][26022] Updated weights on worker 0-0, policy_version 1036697 (0.00091) [2022-07-11 04:33:27,534][26022] Updated weights on worker 0-0, policy_version 1036707 (0.00086) [2022-07-11 04:33:28,321][25689] Fps is (10 sec: 5670.7, 60 sec: 5542.0, 300 sec: 5538.5). Total num frames: 1061592064. Throughput: 0: 4961.8. Samples: 1061585188. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:33:28,322][25689] Avg episode reward: [(0, '-1.411')] [2022-07-11 04:33:29,537][26022] Updated weights on worker 0-0, policy_version 1036717 (0.00088) [2022-07-11 04:33:31,254][26022] Updated weights on worker 0-0, policy_version 1036727 (0.00084) [2022-07-11 04:33:33,175][26022] Updated weights on worker 0-0, policy_version 1036737 (0.00084) [2022-07-11 04:33:33,385][25689] Fps is (10 sec: 5589.5, 60 sec: 5544.4, 300 sec: 5538.4). Total num frames: 1061619712. Throughput: 0: 5771.3. Samples: 1061618246. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:33:33,386][25689] Avg episode reward: [(0, '-0.468')] [2022-07-11 04:33:33,948][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:33:33,962][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001036741_1061622784.pth [2022-07-11 04:33:33,962][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001034794_1059629056.pth [2022-07-11 04:33:35,034][26022] Updated weights on worker 0-0, policy_version 1036747 (0.00088) [2022-07-11 04:33:36,791][26022] Updated weights on worker 0-0, policy_version 1036757 (0.00100) [2022-07-11 04:33:38,448][25689] Fps is (10 sec: 5460.2, 60 sec: 5523.2, 300 sec: 5537.4). Total num frames: 1061647360. Throughput: 0: 5784.0. Samples: 1061651988. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:33:38,449][25689] Avg episode reward: [(0, '-0.223')] [2022-07-11 04:33:38,784][26022] Updated weights on worker 0-0, policy_version 1036767 (0.00084) [2022-07-11 04:33:40,411][26022] Updated weights on worker 0-0, policy_version 1036777 (0.00095) [2022-07-11 04:33:42,447][26022] Updated weights on worker 0-0, policy_version 1036787 (0.00084) [2022-07-11 04:33:43,451][25689] Fps is (10 sec: 5493.3, 60 sec: 5507.9, 300 sec: 5534.6). Total num frames: 1061675008. Throughput: 0: 4979.5. Samples: 1061668864. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:33:43,455][25689] Avg episode reward: [(0, '-0.073')] [2022-07-11 04:33:44,103][26022] Updated weights on worker 0-0, policy_version 1036797 (0.00082) [2022-07-11 04:33:45,962][26022] Updated weights on worker 0-0, policy_version 1036807 (0.00091) [2022-07-11 04:33:47,898][26022] Updated weights on worker 0-0, policy_version 1036817 (0.00089) [2022-07-11 04:33:48,463][25689] Fps is (10 sec: 5623.5, 60 sec: 5560.7, 300 sec: 5535.5). Total num frames: 1061703680. Throughput: 0: 5808.4. Samples: 1061702182. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:33:48,464][25689] Avg episode reward: [(0, '-0.050')] [2022-07-11 04:33:49,762][26022] Updated weights on worker 0-0, policy_version 1036827 (0.00611) [2022-07-11 04:33:51,680][26022] Updated weights on worker 0-0, policy_version 1036837 (0.00091) [2022-07-11 04:33:53,350][26022] Updated weights on worker 0-0, policy_version 1036847 (0.00088) [2022-07-11 04:33:53,525][25689] Fps is (10 sec: 5692.5, 60 sec: 5560.6, 300 sec: 5538.2). Total num frames: 1061732352. Throughput: 0: 5806.7. Samples: 1061735192. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:33:53,525][25689] Avg episode reward: [(0, '0.589')] [2022-07-11 04:33:55,443][26022] Updated weights on worker 0-0, policy_version 1036857 (0.00079) [2022-07-11 04:33:56,975][26022] Updated weights on worker 0-0, policy_version 1036867 (0.00086) [2022-07-11 04:33:58,545][25689] Fps is (10 sec: 5484.9, 60 sec: 5515.5, 300 sec: 5538.1). Total num frames: 1061758976. Throughput: 0: 5817.5. Samples: 1061768900. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:33:58,545][25689] Avg episode reward: [(0, '1.160')] [2022-07-11 04:33:58,902][26022] Updated weights on worker 0-0, policy_version 1036877 (0.00093) [2022-07-11 04:34:00,607][26022] Updated weights on worker 0-0, policy_version 1036887 (0.00091) [2022-07-11 04:34:02,763][26022] Updated weights on worker 0-0, policy_version 1036897 (0.00082) [2022-07-11 04:34:03,561][25689] Fps is (10 sec: 5305.6, 60 sec: 5550.5, 300 sec: 5534.6). Total num frames: 1061785600. Throughput: 0: 5811.2. Samples: 1061785726. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:34:03,563][25689] Avg episode reward: [(0, '1.615')] [2022-07-11 04:34:04,836][26022] Updated weights on worker 0-0, policy_version 1036907 (0.00085) [2022-07-11 04:34:06,649][26022] Updated weights on worker 0-0, policy_version 1036917 (0.00088) [2022-07-11 04:34:08,334][26022] Updated weights on worker 0-0, policy_version 1036927 (0.00080) [2022-07-11 04:34:08,593][25689] Fps is (10 sec: 5605.1, 60 sec: 5566.7, 300 sec: 5543.2). Total num frames: 1061815296. Throughput: 0: 5718.0. Samples: 1061817282. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:34:08,593][25689] Avg episode reward: [(0, '1.613')] [2022-07-11 04:34:10,362][26022] Updated weights on worker 0-0, policy_version 1036937 (0.00089) [2022-07-11 04:34:11,798][26022] Updated weights on worker 0-0, policy_version 1036947 (0.00086) [2022-07-11 04:34:13,642][25689] Fps is (10 sec: 5485.4, 60 sec: 5518.4, 300 sec: 5536.7). Total num frames: 1061840896. Throughput: 0: 5754.9. Samples: 1061850962. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:34:13,642][25689] Avg episode reward: [(0, '1.478')] [2022-07-11 04:34:14,159][26022] Updated weights on worker 0-0, policy_version 1036957 (0.00061) [2022-07-11 04:34:15,596][26022] Updated weights on worker 0-0, policy_version 1036967 (0.00089) [2022-07-11 04:34:17,680][26022] Updated weights on worker 0-0, policy_version 1036977 (0.00091) [2022-07-11 04:34:18,651][25689] Fps is (10 sec: 5497.6, 60 sec: 5556.9, 300 sec: 5540.1). Total num frames: 1061870592. Throughput: 0: 4902.6. Samples: 1061867472. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:34:18,652][25689] Avg episode reward: [(0, '1.831')] [2022-07-11 04:34:19,429][26022] Updated weights on worker 0-0, policy_version 1036987 (0.00112) [2022-07-11 04:34:21,177][26022] Updated weights on worker 0-0, policy_version 1036997 (0.00088) [2022-07-11 04:34:23,207][26022] Updated weights on worker 0-0, policy_version 1037007 (0.00084) [2022-07-11 04:34:23,685][25689] Fps is (10 sec: 5607.4, 60 sec: 5529.0, 300 sec: 5536.6). Total num frames: 1061897216. Throughput: 0: 5732.3. Samples: 1061901086. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:34:23,686][25689] Avg episode reward: [(0, '2.140')] [2022-07-11 04:34:24,835][26022] Updated weights on worker 0-0, policy_version 1037017 (0.00087) [2022-07-11 04:34:26,814][26022] Updated weights on worker 0-0, policy_version 1037027 (0.00100) [2022-07-11 04:34:28,763][25689] Fps is (10 sec: 5367.3, 60 sec: 5506.1, 300 sec: 5535.9). Total num frames: 1061924864. Throughput: 0: 5800.1. Samples: 1061934270. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:34:28,763][25689] Avg episode reward: [(0, '2.163')] [2022-07-11 04:34:28,767][26022] Updated weights on worker 0-0, policy_version 1037037 (0.00086) [2022-07-11 04:34:30,449][26022] Updated weights on worker 0-0, policy_version 1037047 (0.00092) [2022-07-11 04:34:32,452][26022] Updated weights on worker 0-0, policy_version 1037057 (0.00091) [2022-07-11 04:34:33,825][25689] Fps is (10 sec: 5655.5, 60 sec: 5540.2, 300 sec: 5534.9). Total num frames: 1061954560. Throughput: 0: 4954.7. Samples: 1061950964. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:34:33,825][25689] Avg episode reward: [(0, '0.902')] [2022-07-11 04:34:34,165][26022] Updated weights on worker 0-0, policy_version 1037067 (0.00087) [2022-07-11 04:34:36,039][26022] Updated weights on worker 0-0, policy_version 1037077 (0.00089) [2022-07-11 04:34:38,014][26022] Updated weights on worker 0-0, policy_version 1037087 (0.00086) [2022-07-11 04:34:38,844][25689] Fps is (10 sec: 5789.6, 60 sec: 5561.2, 300 sec: 5548.9). Total num frames: 1061983232. Throughput: 0: 5814.5. Samples: 1061984884. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:34:38,845][25689] Avg episode reward: [(0, '0.821')] [2022-07-11 04:34:39,530][26022] Updated weights on worker 0-0, policy_version 1037097 (0.00093) [2022-07-11 04:34:41,452][26022] Updated weights on worker 0-0, policy_version 1037107 (0.00089) [2022-07-11 04:34:43,279][26022] Updated weights on worker 0-0, policy_version 1037117 (0.00449) [2022-07-11 04:34:43,851][25689] Fps is (10 sec: 5515.3, 60 sec: 5543.9, 300 sec: 5535.1). Total num frames: 1062009856. Throughput: 0: 5838.2. Samples: 1062018816. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:34:43,851][25689] Avg episode reward: [(0, '0.761')] [2022-07-11 04:34:45,224][26022] Updated weights on worker 0-0, policy_version 1037127 (0.00089) [2022-07-11 04:34:47,046][26022] Updated weights on worker 0-0, policy_version 1037137 (0.00092) [2022-07-11 04:34:48,855][25689] Fps is (10 sec: 5421.2, 60 sec: 5527.6, 300 sec: 5536.6). Total num frames: 1062037504. Throughput: 0: 5022.9. Samples: 1062035194. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:34:48,856][25689] Avg episode reward: [(0, '0.928')] [2022-07-11 04:34:48,961][26022] Updated weights on worker 0-0, policy_version 1037147 (0.00084) [2022-07-11 04:34:50,782][26022] Updated weights on worker 0-0, policy_version 1037157 (0.00098) [2022-07-11 04:34:52,511][26022] Updated weights on worker 0-0, policy_version 1037167 (0.00084) [2022-07-11 04:34:53,927][25689] Fps is (10 sec: 5589.5, 60 sec: 5526.7, 300 sec: 5542.2). Total num frames: 1062066176. Throughput: 0: 5853.2. Samples: 1062068624. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:34:53,927][25689] Avg episode reward: [(0, '0.994')] [2022-07-11 04:34:54,254][26022] Updated weights on worker 0-0, policy_version 1037177 (0.00085) [2022-07-11 04:34:56,134][26022] Updated weights on worker 0-0, policy_version 1037187 (0.00097) [2022-07-11 04:34:57,835][26022] Updated weights on worker 0-0, policy_version 1037197 (0.00086) [2022-07-11 04:34:58,930][25689] Fps is (10 sec: 5590.3, 60 sec: 5545.2, 300 sec: 5539.9). Total num frames: 1062093824. Throughput: 0: 5844.0. Samples: 1062102264. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:34:58,930][25689] Avg episode reward: [(0, '0.740')] [2022-07-11 04:34:59,998][26022] Updated weights on worker 0-0, policy_version 1037207 (0.00087) [2022-07-11 04:35:02,240][26022] Updated weights on worker 0-0, policy_version 1037217 (0.00095) [2022-07-11 04:35:03,860][26022] Updated weights on worker 0-0, policy_version 1037227 (0.00084) [2022-07-11 04:35:03,947][25689] Fps is (10 sec: 5416.5, 60 sec: 5545.2, 300 sec: 5539.9). Total num frames: 1062120448. Throughput: 0: 4956.8. Samples: 1062118428. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:35:03,947][25689] Avg episode reward: [(0, '1.948')] [2022-07-11 04:35:05,667][26022] Updated weights on worker 0-0, policy_version 1037237 (0.00090) [2022-07-11 04:35:07,474][26022] Updated weights on worker 0-0, policy_version 1037247 (0.00083) [2022-07-11 04:35:08,972][25689] Fps is (10 sec: 5404.3, 60 sec: 5511.8, 300 sec: 5534.6). Total num frames: 1062148096. Throughput: 0: 5747.6. Samples: 1062150818. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:35:08,975][25689] Avg episode reward: [(0, '1.502')] [2022-07-11 04:35:09,424][26022] Updated weights on worker 0-0, policy_version 1037257 (0.00088) [2022-07-11 04:35:11,252][26022] Updated weights on worker 0-0, policy_version 1037267 (0.00092) [2022-07-11 04:35:13,178][26022] Updated weights on worker 0-0, policy_version 1037277 (0.00095) [2022-07-11 04:35:14,103][25689] Fps is (10 sec: 5545.4, 60 sec: 5555.2, 300 sec: 5542.9). Total num frames: 1062176768. Throughput: 0: 5736.0. Samples: 1062184352. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:35:14,104][25689] Avg episode reward: [(0, '0.522')] [2022-07-11 04:35:15,007][26022] Updated weights on worker 0-0, policy_version 1037287 (0.00093) [2022-07-11 04:35:16,729][26022] Updated weights on worker 0-0, policy_version 1037297 (0.00087) [2022-07-11 04:35:18,503][26022] Updated weights on worker 0-0, policy_version 1037307 (0.00091) [2022-07-11 04:35:19,118][25689] Fps is (10 sec: 5752.8, 60 sec: 5554.6, 300 sec: 5542.7). Total num frames: 1062206464. Throughput: 0: 4886.4. Samples: 1062200912. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:35:19,119][25689] Avg episode reward: [(0, '0.700')] [2022-07-11 04:35:20,576][26022] Updated weights on worker 0-0, policy_version 1037317 (0.00088) [2022-07-11 04:35:21,962][26022] Updated weights on worker 0-0, policy_version 1037327 (0.00092) [2022-07-11 04:35:24,140][25689] Fps is (10 sec: 5611.2, 60 sec: 5555.8, 300 sec: 5536.0). Total num frames: 1062233088. Throughput: 0: 5762.5. Samples: 1062234792. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:35:24,140][25689] Avg episode reward: [(0, '-0.081')] [2022-07-11 04:35:24,141][26022] Updated weights on worker 0-0, policy_version 1037337 (0.00086) [2022-07-11 04:35:25,778][26022] Updated weights on worker 0-0, policy_version 1037347 (0.00066) [2022-07-11 04:35:27,636][26022] Updated weights on worker 0-0, policy_version 1037357 (0.00087) [2022-07-11 04:35:29,175][25689] Fps is (10 sec: 5396.3, 60 sec: 5559.6, 300 sec: 5544.5). Total num frames: 1062260736. Throughput: 0: 5827.9. Samples: 1062268560. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:35:29,176][25689] Avg episode reward: [(0, '0.025')] [2022-07-11 04:35:29,424][26022] Updated weights on worker 0-0, policy_version 1037367 (0.00091) [2022-07-11 04:35:31,245][26022] Updated weights on worker 0-0, policy_version 1037377 (0.00088) [2022-07-11 04:35:33,142][26022] Updated weights on worker 0-0, policy_version 1037387 (0.00086) [2022-07-11 04:35:34,071][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:35:34,081][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001037393_1062290432.pth [2022-07-11 04:35:34,081][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001035443_1060293632.pth [2022-07-11 04:35:34,249][25689] Fps is (10 sec: 5672.3, 60 sec: 5558.5, 300 sec: 5540.3). Total num frames: 1062290432. Throughput: 0: 5010.0. Samples: 1062285286. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:35:34,250][25689] Avg episode reward: [(0, '-0.833')] [2022-07-11 04:35:35,160][26022] Updated weights on worker 0-0, policy_version 1037397 (0.00092) [2022-07-11 04:35:36,687][26022] Updated weights on worker 0-0, policy_version 1037407 (0.00089) [2022-07-11 04:35:38,743][26022] Updated weights on worker 0-0, policy_version 1037417 (0.00082) [2022-07-11 04:35:39,279][25689] Fps is (10 sec: 5675.7, 60 sec: 5540.7, 300 sec: 5540.6). Total num frames: 1062318080. Throughput: 0: 5865.4. Samples: 1062319162. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:35:39,279][25689] Avg episode reward: [(0, '-0.533')] [2022-07-11 04:35:40,290][26022] Updated weights on worker 0-0, policy_version 1037427 (0.00086) [2022-07-11 04:35:42,287][26022] Updated weights on worker 0-0, policy_version 1037437 (0.00095) [2022-07-11 04:35:44,063][26022] Updated weights on worker 0-0, policy_version 1037447 (0.00082) [2022-07-11 04:35:44,285][25689] Fps is (10 sec: 5612.1, 60 sec: 5574.6, 300 sec: 5547.9). Total num frames: 1062346752. Throughput: 0: 5861.8. Samples: 1062352878. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:35:44,285][25689] Avg episode reward: [(0, '0.346')] [2022-07-11 04:35:45,988][26022] Updated weights on worker 0-0, policy_version 1037457 (0.00091) [2022-07-11 04:35:47,687][26022] Updated weights on worker 0-0, policy_version 1037467 (0.00087) [2022-07-11 04:35:49,298][25689] Fps is (10 sec: 5723.1, 60 sec: 5590.7, 300 sec: 5541.9). Total num frames: 1062375424. Throughput: 0: 5014.7. Samples: 1062369472. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:35:49,299][25689] Avg episode reward: [(0, '1.050')] [2022-07-11 04:35:49,467][26022] Updated weights on worker 0-0, policy_version 1037477 (0.00085) [2022-07-11 04:35:51,418][26022] Updated weights on worker 0-0, policy_version 1037487 (0.00092) [2022-07-11 04:35:53,319][26022] Updated weights on worker 0-0, policy_version 1037497 (0.00086) [2022-07-11 04:35:54,436][25689] Fps is (10 sec: 5548.0, 60 sec: 5567.7, 300 sec: 5543.3). Total num frames: 1062403072. Throughput: 0: 5836.9. Samples: 1062403114. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:35:54,437][25689] Avg episode reward: [(0, '1.105')] [2022-07-11 04:35:55,129][26022] Updated weights on worker 0-0, policy_version 1037507 (0.00091) [2022-07-11 04:35:56,912][26022] Updated weights on worker 0-0, policy_version 1037517 (0.00090) [2022-07-11 04:35:58,672][26022] Updated weights on worker 0-0, policy_version 1037527 (0.00087) [2022-07-11 04:35:59,443][25689] Fps is (10 sec: 5450.9, 60 sec: 5567.3, 300 sec: 5550.2). Total num frames: 1062430720. Throughput: 0: 5838.6. Samples: 1062436892. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:35:59,443][25689] Avg episode reward: [(0, '1.122')] [2022-07-11 04:36:00,496][26022] Updated weights on worker 0-0, policy_version 1037537 (0.00080) [2022-07-11 04:36:02,739][26022] Updated weights on worker 0-0, policy_version 1037547 (0.00082) [2022-07-11 04:36:04,452][25689] Fps is (10 sec: 5418.8, 60 sec: 5568.1, 300 sec: 5539.7). Total num frames: 1062457344. Throughput: 0: 4932.5. Samples: 1062452352. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:36:04,452][25689] Avg episode reward: [(0, '2.098')] [2022-07-11 04:36:04,493][26022] Updated weights on worker 0-0, policy_version 1037557 (0.00092) [2022-07-11 04:36:06,479][26022] Updated weights on worker 0-0, policy_version 1037567 (0.00093) [2022-07-11 04:36:08,244][26022] Updated weights on worker 0-0, policy_version 1037577 (0.00089) [2022-07-11 04:36:09,472][25689] Fps is (10 sec: 5513.3, 60 sec: 5585.4, 300 sec: 5543.5). Total num frames: 1062486016. Throughput: 0: 5759.6. Samples: 1062485666. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:36:09,473][25689] Avg episode reward: [(0, '2.110')] [2022-07-11 04:36:10,025][26022] Updated weights on worker 0-0, policy_version 1037587 (0.00086) [2022-07-11 04:36:11,873][26022] Updated weights on worker 0-0, policy_version 1037597 (0.00087) [2022-07-11 04:36:13,728][26022] Updated weights on worker 0-0, policy_version 1037607 (0.00086) [2022-07-11 04:36:14,605][25689] Fps is (10 sec: 5546.7, 60 sec: 5568.3, 300 sec: 5541.5). Total num frames: 1062513664. Throughput: 0: 5750.5. Samples: 1062519096. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:36:14,606][25689] Avg episode reward: [(0, '1.345')] [2022-07-11 04:36:15,485][26022] Updated weights on worker 0-0, policy_version 1037617 (0.00086) [2022-07-11 04:36:17,305][26022] Updated weights on worker 0-0, policy_version 1037627 (0.00086) [2022-07-11 04:36:19,406][26022] Updated weights on worker 0-0, policy_version 1037637 (0.00086) [2022-07-11 04:36:19,678][25689] Fps is (10 sec: 5518.5, 60 sec: 5546.1, 300 sec: 5540.5). Total num frames: 1062542336. Throughput: 0: 4883.7. Samples: 1062535714. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:36:19,679][25689] Avg episode reward: [(0, '1.210')] [2022-07-11 04:36:20,827][26022] Updated weights on worker 0-0, policy_version 1037647 (0.00091) [2022-07-11 04:36:23,071][26022] Updated weights on worker 0-0, policy_version 1037657 (0.00087) [2022-07-11 04:36:24,563][26022] Updated weights on worker 0-0, policy_version 1037667 (0.00095) [2022-07-11 04:36:24,762][25689] Fps is (10 sec: 5645.8, 60 sec: 5574.2, 300 sec: 5549.5). Total num frames: 1062571008. Throughput: 0: 5746.8. Samples: 1062569074. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 04:36:24,763][25689] Avg episode reward: [(0, '0.719')] [2022-07-11 04:36:26,636][26022] Updated weights on worker 0-0, policy_version 1037677 (0.00081) [2022-07-11 04:36:28,472][26022] Updated weights on worker 0-0, policy_version 1037687 (0.00087) [2022-07-11 04:36:29,837][25689] Fps is (10 sec: 5644.6, 60 sec: 5587.4, 300 sec: 5539.1). Total num frames: 1062599680. Throughput: 0: 5737.8. Samples: 1062602514. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:36:29,838][25689] Avg episode reward: [(0, '0.846')] [2022-07-11 04:36:30,071][26022] Updated weights on worker 0-0, policy_version 1037697 (0.00097) [2022-07-11 04:36:32,172][26022] Updated weights on worker 0-0, policy_version 1037707 (0.00084) [2022-07-11 04:36:33,968][26022] Updated weights on worker 0-0, policy_version 1037717 (0.00092) [2022-07-11 04:36:34,952][25689] Fps is (10 sec: 5527.3, 60 sec: 5550.0, 300 sec: 5544.0). Total num frames: 1062627328. Throughput: 0: 5747.6. Samples: 1062636040. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:36:34,953][25689] Avg episode reward: [(0, '-0.187')] [2022-07-11 04:36:35,809][26022] Updated weights on worker 0-0, policy_version 1037727 (0.00084) [2022-07-11 04:36:37,638][26022] Updated weights on worker 0-0, policy_version 1037737 (0.00081) [2022-07-11 04:36:39,297][26022] Updated weights on worker 0-0, policy_version 1037747 (0.00082) [2022-07-11 04:36:39,965][25689] Fps is (10 sec: 5459.7, 60 sec: 5551.4, 300 sec: 5547.3). Total num frames: 1062654976. Throughput: 0: 5769.2. Samples: 1062652754. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:36:39,966][25689] Avg episode reward: [(0, '-1.296')] [2022-07-11 04:36:41,418][26022] Updated weights on worker 0-0, policy_version 1037757 (0.00093) [2022-07-11 04:36:43,219][26022] Updated weights on worker 0-0, policy_version 1037767 (0.00093) [2022-07-11 04:36:44,967][25689] Fps is (10 sec: 5623.2, 60 sec: 5551.8, 300 sec: 5537.5). Total num frames: 1062683648. Throughput: 0: 5801.2. Samples: 1062686288. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:36:44,968][25689] Avg episode reward: [(0, '-1.200')] [2022-07-11 04:36:44,972][26022] Updated weights on worker 0-0, policy_version 1037777 (0.00089) [2022-07-11 04:36:46,881][26022] Updated weights on worker 0-0, policy_version 1037787 (0.00084) [2022-07-11 04:36:48,866][26022] Updated weights on worker 0-0, policy_version 1037797 (0.00090) [2022-07-11 04:36:50,021][25689] Fps is (10 sec: 5498.7, 60 sec: 5514.4, 300 sec: 5544.3). Total num frames: 1062710272. Throughput: 0: 5799.1. Samples: 1062719564. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:36:50,022][25689] Avg episode reward: [(0, '-2.579')] [2022-07-11 04:36:50,496][26022] Updated weights on worker 0-0, policy_version 1037807 (0.00090) [2022-07-11 04:36:52,319][26022] Updated weights on worker 0-0, policy_version 1037817 (0.00087) [2022-07-11 04:36:54,242][26022] Updated weights on worker 0-0, policy_version 1037827 (0.00081) [2022-07-11 04:36:55,081][25689] Fps is (10 sec: 5568.9, 60 sec: 5555.3, 300 sec: 5540.3). Total num frames: 1062739968. Throughput: 0: 4987.2. Samples: 1062736426. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:36:55,081][25689] Avg episode reward: [(0, '-2.822')] [2022-07-11 04:36:56,039][26022] Updated weights on worker 0-0, policy_version 1037837 (0.00095) [2022-07-11 04:36:58,031][26022] Updated weights on worker 0-0, policy_version 1037847 (0.00087) [2022-07-11 04:36:59,436][26022] Updated weights on worker 0-0, policy_version 1037857 (0.00092) [2022-07-11 04:37:00,175][25689] Fps is (10 sec: 5748.5, 60 sec: 5564.1, 300 sec: 5549.5). Total num frames: 1062768640. Throughput: 0: 5819.6. Samples: 1062770366. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:37:00,175][25689] Avg episode reward: [(0, '-2.134')] [2022-07-11 04:37:01,919][26022] Updated weights on worker 0-0, policy_version 1037867 (0.00060) [2022-07-11 04:37:03,527][26022] Updated weights on worker 0-0, policy_version 1037877 (0.00093) [2022-07-11 04:37:05,257][25689] Fps is (10 sec: 5433.9, 60 sec: 5557.4, 300 sec: 5541.5). Total num frames: 1062795264. Throughput: 0: 5708.5. Samples: 1062802110. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:37:05,258][25689] Avg episode reward: [(0, '-1.330')] [2022-07-11 04:37:05,555][26022] Updated weights on worker 0-0, policy_version 1037887 (0.00081) [2022-07-11 04:37:07,248][26022] Updated weights on worker 0-0, policy_version 1037897 (0.00093) [2022-07-11 04:37:09,031][26022] Updated weights on worker 0-0, policy_version 1037907 (0.00086) [2022-07-11 04:37:10,346][25689] Fps is (10 sec: 5336.1, 60 sec: 5534.4, 300 sec: 5547.7). Total num frames: 1062822912. Throughput: 0: 4890.1. Samples: 1062818952. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:37:10,346][25689] Avg episode reward: [(0, '-0.578')] [2022-07-11 04:37:10,850][26022] Updated weights on worker 0-0, policy_version 1037917 (0.00082) [2022-07-11 04:37:12,793][26022] Updated weights on worker 0-0, policy_version 1037927 (0.00085) [2022-07-11 04:37:14,529][26022] Updated weights on worker 0-0, policy_version 1037937 (0.00089) [2022-07-11 04:37:15,422][25689] Fps is (10 sec: 5540.7, 60 sec: 5556.4, 300 sec: 5547.5). Total num frames: 1062851584. Throughput: 0: 5725.0. Samples: 1062852878. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:37:15,422][25689] Avg episode reward: [(0, '0.699')] [2022-07-11 04:37:16,380][26022] Updated weights on worker 0-0, policy_version 1037947 (0.00089) [2022-07-11 04:37:18,256][26022] Updated weights on worker 0-0, policy_version 1037957 (0.00092) [2022-07-11 04:37:20,071][26022] Updated weights on worker 0-0, policy_version 1037967 (0.00086) [2022-07-11 04:37:20,459][25689] Fps is (10 sec: 5568.7, 60 sec: 5542.8, 300 sec: 5543.6). Total num frames: 1062879232. Throughput: 0: 5723.6. Samples: 1062886466. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:37:20,461][25689] Avg episode reward: [(0, '0.554')] [2022-07-11 04:37:21,939][26022] Updated weights on worker 0-0, policy_version 1037977 (0.00090) [2022-07-11 04:37:23,744][26022] Updated weights on worker 0-0, policy_version 1037987 (0.00086) [2022-07-11 04:37:25,509][25689] Fps is (10 sec: 5685.0, 60 sec: 5562.8, 300 sec: 5550.0). Total num frames: 1062908928. Throughput: 0: 5002.3. Samples: 1062903414. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:37:25,510][25689] Avg episode reward: [(0, '0.347')] [2022-07-11 04:37:25,511][26022] Updated weights on worker 0-0, policy_version 1037997 (0.00108) [2022-07-11 04:37:27,258][26022] Updated weights on worker 0-0, policy_version 1038007 (0.00273) [2022-07-11 04:37:29,258][26022] Updated weights on worker 0-0, policy_version 1038017 (0.00087) [2022-07-11 04:37:30,553][25689] Fps is (10 sec: 5579.6, 60 sec: 5531.9, 300 sec: 5547.4). Total num frames: 1062935552. Throughput: 0: 5843.3. Samples: 1062937030. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:37:30,555][25689] Avg episode reward: [(0, '-0.156')] [2022-07-11 04:37:31,014][26022] Updated weights on worker 0-0, policy_version 1038027 (0.00095) [2022-07-11 04:37:33,230][26022] Updated weights on worker 0-0, policy_version 1038037 (0.00086) [2022-07-11 04:37:34,296][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:37:34,310][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001038044_1062957056.pth [2022-07-11 04:37:34,310][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001036092_1060958208.pth [2022-07-11 04:37:34,574][26022] Updated weights on worker 0-0, policy_version 1038047 (0.00085) [2022-07-11 04:37:35,630][25689] Fps is (10 sec: 5463.1, 60 sec: 5552.2, 300 sec: 5546.2). Total num frames: 1062964224. Throughput: 0: 5819.9. Samples: 1062970490. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:37:35,632][25689] Avg episode reward: [(0, '0.572')] [2022-07-11 04:37:36,707][26022] Updated weights on worker 0-0, policy_version 1038057 (0.00094) [2022-07-11 04:37:38,356][26022] Updated weights on worker 0-0, policy_version 1038067 (0.00093) [2022-07-11 04:37:40,097][26022] Updated weights on worker 0-0, policy_version 1038077 (0.00094) [2022-07-11 04:37:40,639][25689] Fps is (10 sec: 5787.2, 60 sec: 5586.4, 300 sec: 5549.9). Total num frames: 1062993920. Throughput: 0: 5004.9. Samples: 1062987462. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:37:40,639][25689] Avg episode reward: [(0, '0.233')] [2022-07-11 04:37:42,130][26022] Updated weights on worker 0-0, policy_version 1038087 (0.00100) [2022-07-11 04:37:43,485][26022] Updated weights on worker 0-0, policy_version 1038097 (0.00086) [2022-07-11 04:37:45,641][25689] Fps is (10 sec: 5626.2, 60 sec: 5552.6, 300 sec: 5554.0). Total num frames: 1063020544. Throughput: 0: 5859.9. Samples: 1063021386. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:37:45,641][25689] Avg episode reward: [(0, '0.061')] [2022-07-11 04:37:45,716][26022] Updated weights on worker 0-0, policy_version 1038107 (0.00094) [2022-07-11 04:37:47,843][26022] Updated weights on worker 0-0, policy_version 1038117 (0.00094) [2022-07-11 04:37:49,276][26022] Updated weights on worker 0-0, policy_version 1038127 (0.00102) [2022-07-11 04:37:50,657][25689] Fps is (10 sec: 5519.7, 60 sec: 5589.9, 300 sec: 5554.8). Total num frames: 1063049216. Throughput: 0: 5836.7. Samples: 1063054368. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:37:50,663][25689] Avg episode reward: [(0, '1.059')] [2022-07-11 04:37:51,584][26022] Updated weights on worker 0-0, policy_version 1038137 (0.00089) [2022-07-11 04:37:52,762][26022] Updated weights on worker 0-0, policy_version 1038147 (0.00089) [2022-07-11 04:37:55,064][26022] Updated weights on worker 0-0, policy_version 1038157 (0.00080) [2022-07-11 04:37:55,727][25689] Fps is (10 sec: 5685.3, 60 sec: 5572.0, 300 sec: 5551.6). Total num frames: 1063077888. Throughput: 0: 5005.3. Samples: 1063071080. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:37:55,727][25689] Avg episode reward: [(0, '2.073')] [2022-07-11 04:37:56,829][26022] Updated weights on worker 0-0, policy_version 1038167 (0.00086) [2022-07-11 04:37:58,481][26022] Updated weights on worker 0-0, policy_version 1038177 (0.00085) [2022-07-11 04:38:00,477][26022] Updated weights on worker 0-0, policy_version 1038187 (0.00091) [2022-07-11 04:38:00,806][25689] Fps is (10 sec: 5549.1, 60 sec: 5556.5, 300 sec: 5560.9). Total num frames: 1063105536. Throughput: 0: 5830.6. Samples: 1063105050. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:38:00,806][25689] Avg episode reward: [(0, '1.825')] [2022-07-11 04:38:02,452][26022] Updated weights on worker 0-0, policy_version 1038197 (0.00087) [2022-07-11 04:38:04,312][26022] Updated weights on worker 0-0, policy_version 1038207 (0.00086) [2022-07-11 04:38:05,847][25689] Fps is (10 sec: 5363.0, 60 sec: 5560.3, 300 sec: 5553.7). Total num frames: 1063132160. Throughput: 0: 5718.8. Samples: 1063136940. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:38:05,847][25689] Avg episode reward: [(0, '-0.097')] [2022-07-11 04:38:06,218][26022] Updated weights on worker 0-0, policy_version 1038217 (0.00091) [2022-07-11 04:38:08,053][26022] Updated weights on worker 0-0, policy_version 1038227 (0.00086) [2022-07-11 04:38:09,805][26022] Updated weights on worker 0-0, policy_version 1038237 (0.00093) [2022-07-11 04:38:10,867][25689] Fps is (10 sec: 5292.6, 60 sec: 5549.6, 300 sec: 5547.9). Total num frames: 1063158784. Throughput: 0: 4916.1. Samples: 1063153724. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:38:10,867][25689] Avg episode reward: [(0, '0.148')] [2022-07-11 04:38:11,672][26022] Updated weights on worker 0-0, policy_version 1038247 (0.00087) [2022-07-11 04:38:13,533][26022] Updated weights on worker 0-0, policy_version 1038257 (0.00103) [2022-07-11 04:38:15,311][26022] Updated weights on worker 0-0, policy_version 1038267 (0.00095) [2022-07-11 04:38:15,937][25689] Fps is (10 sec: 5683.3, 60 sec: 5584.1, 300 sec: 5558.0). Total num frames: 1063189504. Throughput: 0: 5752.5. Samples: 1063187336. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:38:15,937][25689] Avg episode reward: [(0, '0.302')] [2022-07-11 04:38:17,127][26022] Updated weights on worker 0-0, policy_version 1038277 (0.00087) [2022-07-11 04:38:18,983][26022] Updated weights on worker 0-0, policy_version 1038287 (0.00083) [2022-07-11 04:38:20,748][26022] Updated weights on worker 0-0, policy_version 1038297 (0.00080) [2022-07-11 04:38:20,967][25689] Fps is (10 sec: 5779.0, 60 sec: 5584.7, 300 sec: 5555.9). Total num frames: 1063217152. Throughput: 0: 5745.3. Samples: 1063220882. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:38:20,967][25689] Avg episode reward: [(0, '0.047')] [2022-07-11 04:38:22,959][26022] Updated weights on worker 0-0, policy_version 1038307 (0.00094) [2022-07-11 04:38:24,327][26022] Updated weights on worker 0-0, policy_version 1038317 (0.00088) [2022-07-11 04:38:25,971][25689] Fps is (10 sec: 5408.8, 60 sec: 5538.2, 300 sec: 5549.2). Total num frames: 1063243776. Throughput: 0: 5013.4. Samples: 1063237830. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:38:25,972][25689] Avg episode reward: [(0, '-0.217')] [2022-07-11 04:38:26,370][26022] Updated weights on worker 0-0, policy_version 1038327 (0.00382) [2022-07-11 04:38:27,866][26022] Updated weights on worker 0-0, policy_version 1038337 (0.00082) [2022-07-11 04:38:30,090][26022] Updated weights on worker 0-0, policy_version 1038347 (0.00100) [2022-07-11 04:38:31,011][25689] Fps is (10 sec: 5505.2, 60 sec: 5572.4, 300 sec: 5553.1). Total num frames: 1063272448. Throughput: 0: 5847.0. Samples: 1063271508. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:38:31,012][25689] Avg episode reward: [(0, '0.483')] [2022-07-11 04:38:31,632][26022] Updated weights on worker 0-0, policy_version 1038357 (0.00086) [2022-07-11 04:38:33,612][26022] Updated weights on worker 0-0, policy_version 1038367 (0.00092) [2022-07-11 04:38:35,252][26022] Updated weights on worker 0-0, policy_version 1038377 (0.00095) [2022-07-11 04:38:36,067][25689] Fps is (10 sec: 5679.6, 60 sec: 5574.4, 300 sec: 5556.6). Total num frames: 1063301120. Throughput: 0: 5856.5. Samples: 1063305230. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:38:36,068][25689] Avg episode reward: [(0, '0.502')] [2022-07-11 04:38:37,388][26022] Updated weights on worker 0-0, policy_version 1038387 (0.00086) [2022-07-11 04:38:39,078][26022] Updated weights on worker 0-0, policy_version 1038397 (0.00093) [2022-07-11 04:38:40,898][26022] Updated weights on worker 0-0, policy_version 1038407 (0.00086) [2022-07-11 04:38:41,070][25689] Fps is (10 sec: 5599.0, 60 sec: 5541.0, 300 sec: 5556.6). Total num frames: 1063328768. Throughput: 0: 5878.0. Samples: 1063339050. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:38:41,071][25689] Avg episode reward: [(0, '0.714')] [2022-07-11 04:38:42,595][26022] Updated weights on worker 0-0, policy_version 1038417 (0.00082) [2022-07-11 04:38:44,623][26022] Updated weights on worker 0-0, policy_version 1038427 (0.00081) [2022-07-11 04:38:46,164][25689] Fps is (10 sec: 5577.8, 60 sec: 5566.4, 300 sec: 5555.1). Total num frames: 1063357440. Throughput: 0: 5851.0. Samples: 1063355984. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:38:46,165][25689] Avg episode reward: [(0, '0.239')] [2022-07-11 04:38:46,599][26022] Updated weights on worker 0-0, policy_version 1038437 (0.00088) [2022-07-11 04:38:48,325][26022] Updated weights on worker 0-0, policy_version 1038447 (0.00084) [2022-07-11 04:38:50,052][26022] Updated weights on worker 0-0, policy_version 1038457 (0.00085) [2022-07-11 04:38:51,180][25689] Fps is (10 sec: 5672.1, 60 sec: 5566.4, 300 sec: 5556.0). Total num frames: 1063386112. Throughput: 0: 5841.1. Samples: 1063389316. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:38:51,181][25689] Avg episode reward: [(0, '0.498')] [2022-07-11 04:38:52,043][26022] Updated weights on worker 0-0, policy_version 1038467 (0.00087) [2022-07-11 04:38:53,753][26022] Updated weights on worker 0-0, policy_version 1038477 (0.00087) [2022-07-11 04:38:55,721][26022] Updated weights on worker 0-0, policy_version 1038487 (0.00085) [2022-07-11 04:38:56,284][25689] Fps is (10 sec: 5565.2, 60 sec: 5546.3, 300 sec: 5557.8). Total num frames: 1063413760. Throughput: 0: 5801.7. Samples: 1063422524. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:38:56,285][25689] Avg episode reward: [(0, '0.404')] [2022-07-11 04:38:57,669][26022] Updated weights on worker 0-0, policy_version 1038497 (0.00082) [2022-07-11 04:38:59,329][26022] Updated weights on worker 0-0, policy_version 1038507 (0.00077) [2022-07-11 04:39:01,104][26022] Updated weights on worker 0-0, policy_version 1038517 (0.00089) [2022-07-11 04:39:01,374][25689] Fps is (10 sec: 5524.8, 60 sec: 5562.3, 300 sec: 5563.3). Total num frames: 1063442432. Throughput: 0: 4950.7. Samples: 1063439568. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:39:01,375][25689] Avg episode reward: [(0, '0.602')] [2022-07-11 04:39:03,285][26022] Updated weights on worker 0-0, policy_version 1038527 (0.00085) [2022-07-11 04:39:05,062][26022] Updated weights on worker 0-0, policy_version 1038537 (0.00093) [2022-07-11 04:39:06,473][25689] Fps is (10 sec: 5427.4, 60 sec: 5557.0, 300 sec: 5551.8). Total num frames: 1063469056. Throughput: 0: 5687.0. Samples: 1063471478. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:39:06,474][25689] Avg episode reward: [(0, '0.616')] [2022-07-11 04:39:07,074][26022] Updated weights on worker 0-0, policy_version 1038547 (0.00083) [2022-07-11 04:39:08,817][26022] Updated weights on worker 0-0, policy_version 1038557 (0.00087) [2022-07-11 04:39:10,598][26022] Updated weights on worker 0-0, policy_version 1038567 (0.00090) [2022-07-11 04:39:11,486][25689] Fps is (10 sec: 5468.4, 60 sec: 5591.4, 300 sec: 5562.7). Total num frames: 1063497728. Throughput: 0: 5707.1. Samples: 1063505204. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:39:11,486][25689] Avg episode reward: [(0, '-0.508')] [2022-07-11 04:39:12,364][26022] Updated weights on worker 0-0, policy_version 1038577 (0.00089) [2022-07-11 04:39:14,168][26022] Updated weights on worker 0-0, policy_version 1038587 (0.00091) [2022-07-11 04:39:15,888][26022] Updated weights on worker 0-0, policy_version 1038597 (0.00086) [2022-07-11 04:39:16,532][25689] Fps is (10 sec: 5700.8, 60 sec: 5559.8, 300 sec: 5558.6). Total num frames: 1063526400. Throughput: 0: 4916.6. Samples: 1063522072. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:39:16,532][25689] Avg episode reward: [(0, '-0.178')] [2022-07-11 04:39:18,053][26022] Updated weights on worker 0-0, policy_version 1038607 (0.00085) [2022-07-11 04:39:19,507][26022] Updated weights on worker 0-0, policy_version 1038617 (0.00096) [2022-07-11 04:39:21,561][25689] Fps is (10 sec: 5386.9, 60 sec: 5526.1, 300 sec: 5555.3). Total num frames: 1063552000. Throughput: 0: 5744.6. Samples: 1063555532. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:39:21,562][25689] Avg episode reward: [(0, '-0.166')] [2022-07-11 04:39:21,788][26022] Updated weights on worker 0-0, policy_version 1038627 (0.00102) [2022-07-11 04:39:23,141][26022] Updated weights on worker 0-0, policy_version 1038637 (0.00092) [2022-07-11 04:39:25,307][26022] Updated weights on worker 0-0, policy_version 1038647 (0.00088) [2022-07-11 04:39:26,623][25689] Fps is (10 sec: 5682.3, 60 sec: 5605.2, 300 sec: 5569.3). Total num frames: 1063583744. Throughput: 0: 5842.9. Samples: 1063589216. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:39:26,624][25689] Avg episode reward: [(0, '-0.938')] [2022-07-11 04:39:26,966][26022] Updated weights on worker 0-0, policy_version 1038657 (0.00094) [2022-07-11 04:39:28,915][26022] Updated weights on worker 0-0, policy_version 1038667 (0.00088) [2022-07-11 04:39:30,763][26022] Updated weights on worker 0-0, policy_version 1038677 (0.00093) [2022-07-11 04:39:31,646][25689] Fps is (10 sec: 5685.8, 60 sec: 5556.1, 300 sec: 5556.3). Total num frames: 1063609344. Throughput: 0: 5000.8. Samples: 1063606020. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:39:31,647][25689] Avg episode reward: [(0, '-1.485')] [2022-07-11 04:39:32,433][26022] Updated weights on worker 0-0, policy_version 1038687 (0.00093) [2022-07-11 04:39:34,310][26022] Updated weights on worker 0-0, policy_version 1038697 (0.00092) [2022-07-11 04:39:34,485][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:39:34,502][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001038698_1063626752.pth [2022-07-11 04:39:34,503][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001036741_1061622784.pth [2022-07-11 04:39:36,243][26022] Updated weights on worker 0-0, policy_version 1038707 (0.00079) [2022-07-11 04:39:36,763][25689] Fps is (10 sec: 5453.3, 60 sec: 5567.4, 300 sec: 5557.9). Total num frames: 1063639040. Throughput: 0: 5812.9. Samples: 1063639676. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:39:36,764][25689] Avg episode reward: [(0, '-0.326')] [2022-07-11 04:39:37,888][26022] Updated weights on worker 0-0, policy_version 1038717 (0.00087) [2022-07-11 04:39:39,843][26022] Updated weights on worker 0-0, policy_version 1038727 (0.00086) [2022-07-11 04:39:41,380][26022] Updated weights on worker 0-0, policy_version 1038737 (0.00086) [2022-07-11 04:39:41,798][25689] Fps is (10 sec: 5749.6, 60 sec: 5581.4, 300 sec: 5564.3). Total num frames: 1063667712. Throughput: 0: 5827.0. Samples: 1063673454. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:39:41,798][25689] Avg episode reward: [(0, '0.069')] [2022-07-11 04:39:43,534][26022] Updated weights on worker 0-0, policy_version 1038747 (0.00090) [2022-07-11 04:39:45,266][26022] Updated weights on worker 0-0, policy_version 1038757 (0.00096) [2022-07-11 04:39:46,820][25689] Fps is (10 sec: 5702.1, 60 sec: 5588.0, 300 sec: 5567.4). Total num frames: 1063696384. Throughput: 0: 5010.6. Samples: 1063690414. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:39:46,820][25689] Avg episode reward: [(0, '0.259')] [2022-07-11 04:39:46,947][26022] Updated weights on worker 0-0, policy_version 1038767 (0.00083) [2022-07-11 04:39:49,037][26022] Updated weights on worker 0-0, policy_version 1038777 (0.00092) [2022-07-11 04:39:50,639][26022] Updated weights on worker 0-0, policy_version 1038787 (0.00093) [2022-07-11 04:39:51,838][25689] Fps is (10 sec: 5507.3, 60 sec: 5554.0, 300 sec: 5561.5). Total num frames: 1063723008. Throughput: 0: 5843.7. Samples: 1063724018. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:39:51,839][25689] Avg episode reward: [(0, '1.515')] [2022-07-11 04:39:52,653][26022] Updated weights on worker 0-0, policy_version 1038797 (0.00087) [2022-07-11 04:39:54,404][26022] Updated weights on worker 0-0, policy_version 1038807 (0.00094) [2022-07-11 04:39:56,325][26022] Updated weights on worker 0-0, policy_version 1038817 (0.00051) [2022-07-11 04:39:56,898][25689] Fps is (10 sec: 5588.2, 60 sec: 5591.8, 300 sec: 5567.3). Total num frames: 1063752704. Throughput: 0: 5869.4. Samples: 1063757856. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:39:56,899][25689] Avg episode reward: [(0, '1.583')] [2022-07-11 04:39:58,067][26022] Updated weights on worker 0-0, policy_version 1038827 (0.00084) [2022-07-11 04:39:59,823][26022] Updated weights on worker 0-0, policy_version 1038837 (0.00086) [2022-07-11 04:40:01,900][26022] Updated weights on worker 0-0, policy_version 1038847 (0.00115) [2022-07-11 04:40:01,965][25689] Fps is (10 sec: 5561.3, 60 sec: 5560.1, 300 sec: 5566.4). Total num frames: 1063779328. Throughput: 0: 5019.3. Samples: 1063774682. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:40:01,966][25689] Avg episode reward: [(0, '1.383')] [2022-07-11 04:40:03,955][26022] Updated weights on worker 0-0, policy_version 1038857 (0.00083) [2022-07-11 04:40:05,781][26022] Updated weights on worker 0-0, policy_version 1038867 (0.00079) [2022-07-11 04:40:07,011][25689] Fps is (10 sec: 5265.7, 60 sec: 5565.0, 300 sec: 5562.6). Total num frames: 1063805952. Throughput: 0: 5724.7. Samples: 1063806000. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:40:07,011][25689] Avg episode reward: [(0, '1.466')] [2022-07-11 04:40:07,606][26022] Updated weights on worker 0-0, policy_version 1038877 (0.00090) [2022-07-11 04:40:09,349][26022] Updated weights on worker 0-0, policy_version 1038887 (0.00083) [2022-07-11 04:40:11,286][26022] Updated weights on worker 0-0, policy_version 1038897 (0.00079) [2022-07-11 04:40:12,059][25689] Fps is (10 sec: 5376.9, 60 sec: 5544.9, 300 sec: 5560.7). Total num frames: 1063833600. Throughput: 0: 5709.0. Samples: 1063839458. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:40:12,059][25689] Avg episode reward: [(0, '0.858')] [2022-07-11 04:40:13,087][26022] Updated weights on worker 0-0, policy_version 1038907 (0.00085) [2022-07-11 04:40:15,105][26022] Updated weights on worker 0-0, policy_version 1038917 (0.00094) [2022-07-11 04:40:16,704][26022] Updated weights on worker 0-0, policy_version 1038927 (0.00094) [2022-07-11 04:40:17,101][25689] Fps is (10 sec: 5581.6, 60 sec: 5545.3, 300 sec: 5556.7). Total num frames: 1063862272. Throughput: 0: 4861.4. Samples: 1063856068. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:40:17,101][25689] Avg episode reward: [(0, '-0.101')] [2022-07-11 04:40:18,740][26022] Updated weights on worker 0-0, policy_version 1038937 (0.00077) [2022-07-11 04:40:20,394][26022] Updated weights on worker 0-0, policy_version 1038947 (0.00090) [2022-07-11 04:40:22,136][25689] Fps is (10 sec: 5588.7, 60 sec: 5578.5, 300 sec: 5559.9). Total num frames: 1063889920. Throughput: 0: 5715.1. Samples: 1063889960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:40:22,138][25689] Avg episode reward: [(0, '-1.372')] [2022-07-11 04:40:22,408][26022] Updated weights on worker 0-0, policy_version 1038957 (0.00088) [2022-07-11 04:40:23,908][26022] Updated weights on worker 0-0, policy_version 1038967 (0.00080) [2022-07-11 04:40:26,056][26022] Updated weights on worker 0-0, policy_version 1038977 (0.00089) [2022-07-11 04:40:27,160][25689] Fps is (10 sec: 5700.6, 60 sec: 5548.2, 300 sec: 5567.0). Total num frames: 1063919616. Throughput: 0: 5830.4. Samples: 1063923480. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 04:40:27,162][25689] Avg episode reward: [(0, '-0.800')] [2022-07-11 04:40:27,718][26022] Updated weights on worker 0-0, policy_version 1038987 (0.00090) [2022-07-11 04:40:29,640][26022] Updated weights on worker 0-0, policy_version 1038997 (0.00080) [2022-07-11 04:40:31,384][26022] Updated weights on worker 0-0, policy_version 1039007 (0.00089) [2022-07-11 04:40:32,194][25689] Fps is (10 sec: 5599.7, 60 sec: 5564.1, 300 sec: 5557.4). Total num frames: 1063946240. Throughput: 0: 5003.0. Samples: 1063940194. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:40:32,195][25689] Avg episode reward: [(0, '-0.665')] [2022-07-11 04:40:33,203][26022] Updated weights on worker 0-0, policy_version 1039017 (0.00084) [2022-07-11 04:40:35,089][26022] Updated weights on worker 0-0, policy_version 1039027 (0.00092) [2022-07-11 04:40:36,869][26022] Updated weights on worker 0-0, policy_version 1039037 (0.00091) [2022-07-11 04:40:37,273][25689] Fps is (10 sec: 5670.2, 60 sec: 5584.5, 300 sec: 5566.8). Total num frames: 1063976960. Throughput: 0: 5842.0. Samples: 1063973916. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:40:37,274][25689] Avg episode reward: [(0, '-0.716')] [2022-07-11 04:40:38,881][26022] Updated weights on worker 0-0, policy_version 1039047 (0.00088) [2022-07-11 04:40:40,643][26022] Updated weights on worker 0-0, policy_version 1039057 (0.00088) [2022-07-11 04:40:42,312][25689] Fps is (10 sec: 5667.7, 60 sec: 5550.3, 300 sec: 5559.3). Total num frames: 1064003584. Throughput: 0: 5826.5. Samples: 1064007512. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:40:42,322][25689] Avg episode reward: [(0, '-0.128')] [2022-07-11 04:40:42,411][26022] Updated weights on worker 0-0, policy_version 1039067 (0.00091) [2022-07-11 04:40:44,277][26022] Updated weights on worker 0-0, policy_version 1039077 (0.00090) [2022-07-11 04:40:46,077][26022] Updated weights on worker 0-0, policy_version 1039087 (0.00090) [2022-07-11 04:40:47,416][25689] Fps is (10 sec: 5351.0, 60 sec: 5525.9, 300 sec: 5554.2). Total num frames: 1064031232. Throughput: 0: 5830.0. Samples: 1064041570. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:40:47,416][25689] Avg episode reward: [(0, '0.455')] [2022-07-11 04:40:47,768][26022] Updated weights on worker 0-0, policy_version 1039097 (0.00094) [2022-07-11 04:40:49,690][26022] Updated weights on worker 0-0, policy_version 1039107 (0.00089) [2022-07-11 04:40:51,593][26022] Updated weights on worker 0-0, policy_version 1039117 (0.00085) [2022-07-11 04:40:52,436][25689] Fps is (10 sec: 5562.9, 60 sec: 5559.5, 300 sec: 5559.8). Total num frames: 1064059904. Throughput: 0: 5837.8. Samples: 1064058362. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:40:52,437][25689] Avg episode reward: [(0, '1.616')] [2022-07-11 04:40:53,523][26022] Updated weights on worker 0-0, policy_version 1039127 (0.00090) [2022-07-11 04:40:55,283][26022] Updated weights on worker 0-0, policy_version 1039137 (0.00086) [2022-07-11 04:40:57,111][26022] Updated weights on worker 0-0, policy_version 1039147 (0.00087) [2022-07-11 04:40:57,568][25689] Fps is (10 sec: 5648.5, 60 sec: 5536.1, 300 sec: 5560.9). Total num frames: 1064088576. Throughput: 0: 5785.6. Samples: 1064091332. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:40:57,568][25689] Avg episode reward: [(0, '1.462')] [2022-07-11 04:40:58,832][26022] Updated weights on worker 0-0, policy_version 1039157 (0.00084) [2022-07-11 04:41:00,661][26022] Updated weights on worker 0-0, policy_version 1039167 (0.00084) [2022-07-11 04:41:02,588][25689] Fps is (10 sec: 5446.8, 60 sec: 5540.4, 300 sec: 5560.7). Total num frames: 1064115200. Throughput: 0: 5701.7. Samples: 1064123120. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:41:02,589][25689] Avg episode reward: [(0, '1.574')] [2022-07-11 04:41:02,870][26022] Updated weights on worker 0-0, policy_version 1039177 (0.00091) [2022-07-11 04:41:04,816][26022] Updated weights on worker 0-0, policy_version 1039187 (0.00095) [2022-07-11 04:41:06,483][26022] Updated weights on worker 0-0, policy_version 1039197 (0.00094) [2022-07-11 04:41:07,603][25689] Fps is (10 sec: 5408.5, 60 sec: 5560.1, 300 sec: 5557.4). Total num frames: 1064142848. Throughput: 0: 4870.3. Samples: 1064139884. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:41:07,603][25689] Avg episode reward: [(0, '1.658')] [2022-07-11 04:41:08,562][26022] Updated weights on worker 0-0, policy_version 1039207 (0.00087) [2022-07-11 04:41:10,420][26022] Updated weights on worker 0-0, policy_version 1039217 (0.00087) [2022-07-11 04:41:12,277][26022] Updated weights on worker 0-0, policy_version 1039228 (0.00086) [2022-07-11 04:41:12,610][25689] Fps is (10 sec: 5517.2, 60 sec: 5563.8, 300 sec: 5559.7). Total num frames: 1064170496. Throughput: 0: 5680.6. Samples: 1064172964. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:41:12,611][25689] Avg episode reward: [(0, '1.122')] [2022-07-11 04:41:14,025][26022] Updated weights on worker 0-0, policy_version 1039238 (0.00093) [2022-07-11 04:41:16,167][26022] Updated weights on worker 0-0, policy_version 1039248 (0.00086) [2022-07-11 04:41:17,621][26022] Updated weights on worker 0-0, policy_version 1039258 (0.00085) [2022-07-11 04:41:17,667][25689] Fps is (10 sec: 5697.7, 60 sec: 5579.4, 300 sec: 5563.5). Total num frames: 1064200192. Throughput: 0: 5753.4. Samples: 1064206970. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:41:17,667][25689] Avg episode reward: [(0, '1.256')] [2022-07-11 04:41:19,651][26022] Updated weights on worker 0-0, policy_version 1039268 (0.00084) [2022-07-11 04:41:21,433][26022] Updated weights on worker 0-0, policy_version 1039278 (0.00086) [2022-07-11 04:41:22,699][25689] Fps is (10 sec: 5582.6, 60 sec: 5562.8, 300 sec: 5557.6). Total num frames: 1064226816. Throughput: 0: 5013.6. Samples: 1064223948. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:41:22,701][25689] Avg episode reward: [(0, '1.115')] [2022-07-11 04:41:23,184][26022] Updated weights on worker 0-0, policy_version 1039288 (0.00087) [2022-07-11 04:41:25,198][26022] Updated weights on worker 0-0, policy_version 1039298 (0.00086) [2022-07-11 04:41:26,856][26022] Updated weights on worker 0-0, policy_version 1039308 (0.00106) [2022-07-11 04:41:27,718][25689] Fps is (10 sec: 5399.6, 60 sec: 5529.4, 300 sec: 5555.2). Total num frames: 1064254464. Throughput: 0: 5837.1. Samples: 1064257302. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:41:27,719][25689] Avg episode reward: [(0, '1.237')] [2022-07-11 04:41:28,811][26022] Updated weights on worker 0-0, policy_version 1039318 (0.00092) [2022-07-11 04:41:30,486][26022] Updated weights on worker 0-0, policy_version 1039328 (0.00096) [2022-07-11 04:41:32,403][26022] Updated weights on worker 0-0, policy_version 1039338 (0.00081) [2022-07-11 04:41:32,726][25689] Fps is (10 sec: 5718.7, 60 sec: 5582.5, 300 sec: 5564.1). Total num frames: 1064284160. Throughput: 0: 5872.1. Samples: 1064291086. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:41:32,728][25689] Avg episode reward: [(0, '1.334')] [2022-07-11 04:41:34,479][26022] Updated weights on worker 0-0, policy_version 1039348 (0.00087) [2022-07-11 04:41:34,590][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:41:34,599][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001039349_1064293376.pth [2022-07-11 04:41:34,600][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001037393_1062290432.pth [2022-07-11 04:41:36,015][26022] Updated weights on worker 0-0, policy_version 1039358 (0.00338) [2022-07-11 04:41:37,823][25689] Fps is (10 sec: 5674.7, 60 sec: 5530.1, 300 sec: 5562.5). Total num frames: 1064311808. Throughput: 0: 5007.1. Samples: 1064307896. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:41:37,824][25689] Avg episode reward: [(0, '2.232')] [2022-07-11 04:41:37,918][26022] Updated weights on worker 0-0, policy_version 1039368 (0.00083) [2022-07-11 04:41:39,629][26022] Updated weights on worker 0-0, policy_version 1039378 (0.00083) [2022-07-11 04:41:41,538][26022] Updated weights on worker 0-0, policy_version 1039388 (0.00086) [2022-07-11 04:41:42,876][25689] Fps is (10 sec: 5548.7, 60 sec: 5562.6, 300 sec: 5561.5). Total num frames: 1064340480. Throughput: 0: 5830.2. Samples: 1064341586. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:41:42,877][25689] Avg episode reward: [(0, '2.014')] [2022-07-11 04:41:43,291][26022] Updated weights on worker 0-0, policy_version 1039398 (0.00098) [2022-07-11 04:41:45,498][26022] Updated weights on worker 0-0, policy_version 1039408 (0.00090) [2022-07-11 04:41:47,175][26022] Updated weights on worker 0-0, policy_version 1039418 (0.00093) [2022-07-11 04:41:47,903][25689] Fps is (10 sec: 5689.1, 60 sec: 5586.6, 300 sec: 5568.9). Total num frames: 1064369152. Throughput: 0: 5830.5. Samples: 1064374990. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:41:47,904][25689] Avg episode reward: [(0, '1.932')] [2022-07-11 04:41:49,061][26022] Updated weights on worker 0-0, policy_version 1039428 (0.00084) [2022-07-11 04:41:50,628][26022] Updated weights on worker 0-0, policy_version 1039438 (0.00097) [2022-07-11 04:41:52,608][26022] Updated weights on worker 0-0, policy_version 1039448 (0.00087) [2022-07-11 04:41:52,923][25689] Fps is (10 sec: 5503.8, 60 sec: 5552.8, 300 sec: 5559.3). Total num frames: 1064395776. Throughput: 0: 4990.5. Samples: 1064391878. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:41:52,924][25689] Avg episode reward: [(0, '1.595')] [2022-07-11 04:41:54,315][26022] Updated weights on worker 0-0, policy_version 1039458 (0.00094) [2022-07-11 04:41:56,343][26022] Updated weights on worker 0-0, policy_version 1039468 (0.00093) [2022-07-11 04:41:57,991][25689] Fps is (10 sec: 5481.3, 60 sec: 5558.7, 300 sec: 5559.8). Total num frames: 1064424448. Throughput: 0: 5809.2. Samples: 1064425054. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:41:57,992][25689] Avg episode reward: [(0, '1.435')] [2022-07-11 04:41:58,199][26022] Updated weights on worker 0-0, policy_version 1039478 (0.00084) [2022-07-11 04:41:59,949][26022] Updated weights on worker 0-0, policy_version 1039488 (0.00097) [2022-07-11 04:42:01,788][26022] Updated weights on worker 0-0, policy_version 1039498 (0.00086) [2022-07-11 04:42:03,039][25689] Fps is (10 sec: 5365.3, 60 sec: 5539.2, 300 sec: 5557.0). Total num frames: 1064450048. Throughput: 0: 5743.9. Samples: 1064457396. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:42:03,039][25689] Avg episode reward: [(0, '0.845')] [2022-07-11 04:42:04,037][26022] Updated weights on worker 0-0, policy_version 1039508 (0.00094) [2022-07-11 04:42:05,746][26022] Updated weights on worker 0-0, policy_version 1039518 (0.00091) [2022-07-11 04:42:07,577][26022] Updated weights on worker 0-0, policy_version 1039528 (0.00091) [2022-07-11 04:42:08,043][25689] Fps is (10 sec: 5399.4, 60 sec: 5557.1, 300 sec: 5562.1). Total num frames: 1064478720. Throughput: 0: 4901.0. Samples: 1064473694. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:42:08,043][25689] Avg episode reward: [(0, '0.652')] [2022-07-11 04:42:09,391][26022] Updated weights on worker 0-0, policy_version 1039538 (0.00087) [2022-07-11 04:42:11,312][26022] Updated weights on worker 0-0, policy_version 1039548 (0.00091) [2022-07-11 04:42:13,046][25689] Fps is (10 sec: 5627.8, 60 sec: 5557.5, 300 sec: 5560.0). Total num frames: 1064506368. Throughput: 0: 5726.8. Samples: 1064507118. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:42:13,047][25689] Avg episode reward: [(0, '0.714')] [2022-07-11 04:42:13,175][26022] Updated weights on worker 0-0, policy_version 1039558 (0.00081) [2022-07-11 04:42:14,858][26022] Updated weights on worker 0-0, policy_version 1039568 (0.00085) [2022-07-11 04:42:16,725][26022] Updated weights on worker 0-0, policy_version 1039578 (0.00086) [2022-07-11 04:42:18,099][25689] Fps is (10 sec: 5702.3, 60 sec: 5557.9, 300 sec: 5566.6). Total num frames: 1064536064. Throughput: 0: 5758.8. Samples: 1064540850. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:42:18,101][25689] Avg episode reward: [(0, '0.550')] [2022-07-11 04:42:18,463][26022] Updated weights on worker 0-0, policy_version 1039588 (0.00088) [2022-07-11 04:42:20,445][26022] Updated weights on worker 0-0, policy_version 1039598 (0.00085) [2022-07-11 04:42:22,336][26022] Updated weights on worker 0-0, policy_version 1039608 (0.00087) [2022-07-11 04:42:23,104][25689] Fps is (10 sec: 5497.7, 60 sec: 5543.4, 300 sec: 5553.6). Total num frames: 1064561664. Throughput: 0: 5000.4. Samples: 1064557732. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:42:23,105][25689] Avg episode reward: [(0, '-0.287')] [2022-07-11 04:42:23,878][26022] Updated weights on worker 0-0, policy_version 1039618 (0.00085) [2022-07-11 04:42:26,100][26022] Updated weights on worker 0-0, policy_version 1039628 (0.00092) [2022-07-11 04:42:27,513][26022] Updated weights on worker 0-0, policy_version 1039638 (0.00087) [2022-07-11 04:42:28,157][25689] Fps is (10 sec: 5497.7, 60 sec: 5574.2, 300 sec: 5563.8). Total num frames: 1064591360. Throughput: 0: 5838.9. Samples: 1064591140. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:42:28,157][25689] Avg episode reward: [(0, '0.008')] [2022-07-11 04:42:29,779][26022] Updated weights on worker 0-0, policy_version 1039648 (0.00082) [2022-07-11 04:42:31,399][26022] Updated weights on worker 0-0, policy_version 1039658 (0.00094) [2022-07-11 04:42:33,183][25689] Fps is (10 sec: 5689.7, 60 sec: 5538.7, 300 sec: 5561.3). Total num frames: 1064619008. Throughput: 0: 5834.9. Samples: 1064624612. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:42:33,183][25689] Avg episode reward: [(0, '0.230')] [2022-07-11 04:42:33,186][26022] Updated weights on worker 0-0, policy_version 1039668 (0.00146) [2022-07-11 04:42:35,262][26022] Updated weights on worker 0-0, policy_version 1039678 (0.00082) [2022-07-11 04:42:37,083][26022] Updated weights on worker 0-0, policy_version 1039688 (0.00085) [2022-07-11 04:42:38,231][25689] Fps is (10 sec: 5489.1, 60 sec: 5543.2, 300 sec: 5553.7). Total num frames: 1064646656. Throughput: 0: 4994.6. Samples: 1064641400. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:42:38,232][25689] Avg episode reward: [(0, '0.132')] [2022-07-11 04:42:38,818][26022] Updated weights on worker 0-0, policy_version 1039698 (0.00086) [2022-07-11 04:42:40,723][26022] Updated weights on worker 0-0, policy_version 1039708 (0.00089) [2022-07-11 04:42:42,447][26022] Updated weights on worker 0-0, policy_version 1039718 (0.00092) [2022-07-11 04:42:43,245][25689] Fps is (10 sec: 5597.0, 60 sec: 5546.7, 300 sec: 5560.3). Total num frames: 1064675328. Throughput: 0: 5837.4. Samples: 1064675304. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:42:43,246][25689] Avg episode reward: [(0, '-0.053')] [2022-07-11 04:42:44,415][26022] Updated weights on worker 0-0, policy_version 1039728 (0.00087) [2022-07-11 04:42:46,046][26022] Updated weights on worker 0-0, policy_version 1039738 (0.00090) [2022-07-11 04:42:47,943][26022] Updated weights on worker 0-0, policy_version 1039748 (0.00082) [2022-07-11 04:42:48,258][25689] Fps is (10 sec: 5718.5, 60 sec: 5547.9, 300 sec: 5560.4). Total num frames: 1064704000. Throughput: 0: 5867.4. Samples: 1064709084. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:42:48,260][25689] Avg episode reward: [(0, '0.028')] [2022-07-11 04:42:49,541][26022] Updated weights on worker 0-0, policy_version 1039758 (0.00081) [2022-07-11 04:42:51,574][26022] Updated weights on worker 0-0, policy_version 1039768 (0.00083) [2022-07-11 04:42:53,311][25689] Fps is (10 sec: 5595.3, 60 sec: 5561.9, 300 sec: 5557.3). Total num frames: 1064731648. Throughput: 0: 5041.6. Samples: 1064726090. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:42:53,312][25689] Avg episode reward: [(0, '0.824')] [2022-07-11 04:42:53,516][26022] Updated weights on worker 0-0, policy_version 1039778 (0.00090) [2022-07-11 04:42:55,363][26022] Updated weights on worker 0-0, policy_version 1039788 (0.00093) [2022-07-11 04:42:57,075][26022] Updated weights on worker 0-0, policy_version 1039798 (0.00090) [2022-07-11 04:42:58,377][25689] Fps is (10 sec: 5464.6, 60 sec: 5545.1, 300 sec: 5557.5). Total num frames: 1064759296. Throughput: 0: 5851.2. Samples: 1064759282. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:42:58,378][25689] Avg episode reward: [(0, '0.490')] [2022-07-11 04:42:58,713][26022] Updated weights on worker 0-0, policy_version 1039808 (0.00086) [2022-07-11 04:43:00,724][26022] Updated weights on worker 0-0, policy_version 1039818 (0.00087) [2022-07-11 04:43:02,926][26022] Updated weights on worker 0-0, policy_version 1039828 (0.00085) [2022-07-11 04:43:03,438][25689] Fps is (10 sec: 5359.1, 60 sec: 5560.9, 300 sec: 5557.2). Total num frames: 1064785920. Throughput: 0: 5728.2. Samples: 1064790972. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:43:03,438][25689] Avg episode reward: [(0, '0.621')] [2022-07-11 04:43:04,811][26022] Updated weights on worker 0-0, policy_version 1039838 (0.00090) [2022-07-11 04:43:06,683][26022] Updated weights on worker 0-0, policy_version 1039848 (0.00086) [2022-07-11 04:43:08,416][26022] Updated weights on worker 0-0, policy_version 1039859 (0.00094) [2022-07-11 04:43:08,514][25689] Fps is (10 sec: 5556.2, 60 sec: 5571.2, 300 sec: 5566.4). Total num frames: 1064815616. Throughput: 0: 4872.5. Samples: 1064807778. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:43:08,514][25689] Avg episode reward: [(0, '-0.040')] [2022-07-11 04:43:10,499][26022] Updated weights on worker 0-0, policy_version 1039869 (0.00082) [2022-07-11 04:43:12,227][26022] Updated weights on worker 0-0, policy_version 1039879 (0.00088) [2022-07-11 04:43:13,527][25689] Fps is (10 sec: 5582.4, 60 sec: 5553.4, 300 sec: 5553.7). Total num frames: 1064842240. Throughput: 0: 5695.2. Samples: 1064841224. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:43:13,527][25689] Avg episode reward: [(0, '-0.366')] [2022-07-11 04:43:14,201][26022] Updated weights on worker 0-0, policy_version 1039889 (0.00083) [2022-07-11 04:43:16,047][26022] Updated weights on worker 0-0, policy_version 1039899 (0.00080) [2022-07-11 04:43:17,786][26022] Updated weights on worker 0-0, policy_version 1039909 (0.00079) [2022-07-11 04:43:18,587][25689] Fps is (10 sec: 5591.0, 60 sec: 5552.7, 300 sec: 5560.1). Total num frames: 1064871936. Throughput: 0: 5715.1. Samples: 1064874784. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:43:18,588][25689] Avg episode reward: [(0, '-0.622')] [2022-07-11 04:43:19,502][26022] Updated weights on worker 0-0, policy_version 1039919 (0.00082) [2022-07-11 04:43:21,440][26022] Updated weights on worker 0-0, policy_version 1039929 (0.00085) [2022-07-11 04:43:23,351][26022] Updated weights on worker 0-0, policy_version 1039939 (0.00086) [2022-07-11 04:43:23,667][25689] Fps is (10 sec: 5554.3, 60 sec: 5562.8, 300 sec: 5558.6). Total num frames: 1064898560. Throughput: 0: 5796.3. Samples: 1064908224. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:43:23,667][25689] Avg episode reward: [(0, '-0.770')] [2022-07-11 04:43:25,143][26022] Updated weights on worker 0-0, policy_version 1039949 (0.00088) [2022-07-11 04:43:26,926][26022] Updated weights on worker 0-0, policy_version 1039959 (0.00085) [2022-07-11 04:43:28,690][25689] Fps is (10 sec: 5473.4, 60 sec: 5548.6, 300 sec: 5559.0). Total num frames: 1064927232. Throughput: 0: 5798.4. Samples: 1064924768. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:43:28,690][25689] Avg episode reward: [(0, '-0.452')] [2022-07-11 04:43:28,935][26022] Updated weights on worker 0-0, policy_version 1039969 (0.00090) [2022-07-11 04:43:30,559][26022] Updated weights on worker 0-0, policy_version 1039979 (0.00089) [2022-07-11 04:43:32,549][26022] Updated weights on worker 0-0, policy_version 1039989 (0.00094) [2022-07-11 04:43:33,716][25689] Fps is (10 sec: 5706.4, 60 sec: 5565.5, 300 sec: 5559.5). Total num frames: 1064955904. Throughput: 0: 5805.6. Samples: 1064958434. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:43:33,716][25689] Avg episode reward: [(0, '-0.545')] [2022-07-11 04:43:34,416][26022] Updated weights on worker 0-0, policy_version 1039999 (0.00091) [2022-07-11 04:43:34,810][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:43:34,824][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001040001_1064961024.pth [2022-07-11 04:43:34,824][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001038044_1062957056.pth [2022-07-11 04:43:36,273][26022] Updated weights on worker 0-0, policy_version 1040009 (0.00086) [2022-07-11 04:43:37,922][26022] Updated weights on worker 0-0, policy_version 1040019 (0.00089) [2022-07-11 04:43:38,824][25689] Fps is (10 sec: 5557.7, 60 sec: 5560.0, 300 sec: 5557.6). Total num frames: 1064983552. Throughput: 0: 5808.2. Samples: 1064992322. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:43:38,824][25689] Avg episode reward: [(0, '0.234')] [2022-07-11 04:43:39,702][26022] Updated weights on worker 0-0, policy_version 1040029 (0.00089) [2022-07-11 04:43:41,614][26022] Updated weights on worker 0-0, policy_version 1040039 (0.00092) [2022-07-11 04:43:43,472][26022] Updated weights on worker 0-0, policy_version 1040049 (0.00092) [2022-07-11 04:43:43,852][25689] Fps is (10 sec: 5455.6, 60 sec: 5541.9, 300 sec: 5555.4). Total num frames: 1065011200. Throughput: 0: 5005.0. Samples: 1065009248. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:43:43,853][25689] Avg episode reward: [(0, '0.603')] [2022-07-11 04:43:45,120][26022] Updated weights on worker 0-0, policy_version 1040059 (0.00085) [2022-07-11 04:43:47,243][26022] Updated weights on worker 0-0, policy_version 1040069 (0.00088) [2022-07-11 04:43:48,724][26022] Updated weights on worker 0-0, policy_version 1040079 (0.00082) [2022-07-11 04:43:48,876][25689] Fps is (10 sec: 5704.3, 60 sec: 5557.7, 300 sec: 5558.6). Total num frames: 1065040896. Throughput: 0: 5857.4. Samples: 1065043008. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:43:48,877][25689] Avg episode reward: [(0, '0.630')] [2022-07-11 04:43:50,896][26022] Updated weights on worker 0-0, policy_version 1040089 (0.00091) [2022-07-11 04:43:52,624][26022] Updated weights on worker 0-0, policy_version 1040099 (0.00091) [2022-07-11 04:43:53,885][25689] Fps is (10 sec: 5613.1, 60 sec: 5544.8, 300 sec: 5557.0). Total num frames: 1065067520. Throughput: 0: 5841.6. Samples: 1065076256. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:43:53,886][25689] Avg episode reward: [(0, '0.679')] [2022-07-11 04:43:54,529][26022] Updated weights on worker 0-0, policy_version 1040109 (0.00086) [2022-07-11 04:43:56,314][26022] Updated weights on worker 0-0, policy_version 1040119 (0.00094) [2022-07-11 04:43:58,315][26022] Updated weights on worker 0-0, policy_version 1040129 (0.00088) [2022-07-11 04:43:59,026][25689] Fps is (10 sec: 5548.8, 60 sec: 5571.7, 300 sec: 5559.5). Total num frames: 1065097216. Throughput: 0: 4974.5. Samples: 1065092820. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:43:59,027][25689] Avg episode reward: [(0, '0.660')] [2022-07-11 04:44:00,055][26022] Updated weights on worker 0-0, policy_version 1040139 (0.00084) [2022-07-11 04:44:02,110][26022] Updated weights on worker 0-0, policy_version 1040149 (0.00083) [2022-07-11 04:44:04,069][25689] Fps is (10 sec: 5329.1, 60 sec: 5539.6, 300 sec: 5553.6). Total num frames: 1065121792. Throughput: 0: 5698.9. Samples: 1065124466. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:44:04,071][25689] Avg episode reward: [(0, '0.786')] [2022-07-11 04:44:04,217][26022] Updated weights on worker 0-0, policy_version 1040159 (0.00092) [2022-07-11 04:44:05,766][26022] Updated weights on worker 0-0, policy_version 1040169 (0.00088) [2022-07-11 04:44:07,712][26022] Updated weights on worker 0-0, policy_version 1040179 (0.00084) [2022-07-11 04:44:09,080][25689] Fps is (10 sec: 5398.1, 60 sec: 5545.5, 300 sec: 5557.1). Total num frames: 1065151488. Throughput: 0: 5687.1. Samples: 1065157908. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:44:09,081][25689] Avg episode reward: [(0, '0.797')] [2022-07-11 04:44:09,403][26022] Updated weights on worker 0-0, policy_version 1040189 (0.00088) [2022-07-11 04:44:11,531][26022] Updated weights on worker 0-0, policy_version 1040199 (0.00090) [2022-07-11 04:44:13,191][26022] Updated weights on worker 0-0, policy_version 1040209 (0.00109) [2022-07-11 04:44:14,107][25689] Fps is (10 sec: 5713.0, 60 sec: 5561.2, 300 sec: 5554.0). Total num frames: 1065179136. Throughput: 0: 4857.7. Samples: 1065174486. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:44:14,108][25689] Avg episode reward: [(0, '0.724')] [2022-07-11 04:44:15,233][26022] Updated weights on worker 0-0, policy_version 1040219 (0.00086) [2022-07-11 04:44:16,957][26022] Updated weights on worker 0-0, policy_version 1040229 (0.00086) [2022-07-11 04:44:18,921][26022] Updated weights on worker 0-0, policy_version 1040239 (0.00086) [2022-07-11 04:44:19,242][25689] Fps is (10 sec: 5441.6, 60 sec: 5520.6, 300 sec: 5558.9). Total num frames: 1065206784. Throughput: 0: 5695.9. Samples: 1065207966. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:44:19,242][25689] Avg episode reward: [(0, '0.455')] [2022-07-11 04:44:20,507][26022] Updated weights on worker 0-0, policy_version 1040249 (0.00082) [2022-07-11 04:44:22,419][26022] Updated weights on worker 0-0, policy_version 1040259 (0.00089) [2022-07-11 04:44:24,248][25689] Fps is (10 sec: 5452.3, 60 sec: 5544.1, 300 sec: 5546.2). Total num frames: 1065234432. Throughput: 0: 5825.5. Samples: 1065242020. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 04:44:24,249][25689] Avg episode reward: [(0, '0.275')] [2022-07-11 04:44:24,324][26022] Updated weights on worker 0-0, policy_version 1040269 (0.00084) [2022-07-11 04:44:26,017][26022] Updated weights on worker 0-0, policy_version 1040279 (0.00097) [2022-07-11 04:44:27,858][26022] Updated weights on worker 0-0, policy_version 1040289 (0.00082) [2022-07-11 04:44:29,269][25689] Fps is (10 sec: 5616.6, 60 sec: 5544.3, 300 sec: 5556.6). Total num frames: 1065263104. Throughput: 0: 4997.7. Samples: 1065258808. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:44:29,270][25689] Avg episode reward: [(0, '0.119')] [2022-07-11 04:44:29,811][26022] Updated weights on worker 0-0, policy_version 1040299 (0.00099) [2022-07-11 04:44:31,512][26022] Updated weights on worker 0-0, policy_version 1040309 (0.00089) [2022-07-11 04:44:33,514][26022] Updated weights on worker 0-0, policy_version 1040319 (0.00082) [2022-07-11 04:44:34,301][25689] Fps is (10 sec: 5704.2, 60 sec: 5543.8, 300 sec: 5554.7). Total num frames: 1065291776. Throughput: 0: 5838.2. Samples: 1065292386. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:44:34,302][25689] Avg episode reward: [(0, '-0.542')] [2022-07-11 04:44:35,219][26022] Updated weights on worker 0-0, policy_version 1040329 (0.00086) [2022-07-11 04:44:36,903][26022] Updated weights on worker 0-0, policy_version 1040339 (0.00089) [2022-07-11 04:44:39,149][26022] Updated weights on worker 0-0, policy_version 1040349 (0.00088) [2022-07-11 04:44:39,373][25689] Fps is (10 sec: 5472.9, 60 sec: 5530.2, 300 sec: 5547.1). Total num frames: 1065318400. Throughput: 0: 5859.1. Samples: 1065325916. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:44:39,374][25689] Avg episode reward: [(0, '-0.712')] [2022-07-11 04:44:40,495][26022] Updated weights on worker 0-0, policy_version 1040359 (0.00093) [2022-07-11 04:44:42,743][26022] Updated weights on worker 0-0, policy_version 1040369 (0.00092) [2022-07-11 04:44:44,189][26022] Updated weights on worker 0-0, policy_version 1040379 (0.00081) [2022-07-11 04:44:44,375][25689] Fps is (10 sec: 5692.6, 60 sec: 5583.3, 300 sec: 5554.4). Total num frames: 1065349120. Throughput: 0: 5005.3. Samples: 1065342758. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:44:44,375][25689] Avg episode reward: [(0, '0.250')] [2022-07-11 04:44:46,282][26022] Updated weights on worker 0-0, policy_version 1040389 (0.00083) [2022-07-11 04:44:47,940][26022] Updated weights on worker 0-0, policy_version 1040399 (0.00080) [2022-07-11 04:44:49,400][25689] Fps is (10 sec: 5718.8, 60 sec: 5532.5, 300 sec: 5554.3). Total num frames: 1065375744. Throughput: 0: 5855.5. Samples: 1065376684. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:44:49,401][25689] Avg episode reward: [(0, '0.795')] [2022-07-11 04:44:49,819][26022] Updated weights on worker 0-0, policy_version 1040409 (0.00082) [2022-07-11 04:44:51,593][26022] Updated weights on worker 0-0, policy_version 1040419 (0.00085) [2022-07-11 04:44:53,427][26022] Updated weights on worker 0-0, policy_version 1040429 (0.00089) [2022-07-11 04:44:54,411][25689] Fps is (10 sec: 5509.5, 60 sec: 5566.1, 300 sec: 5551.7). Total num frames: 1065404416. Throughput: 0: 5865.9. Samples: 1065410350. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:44:54,412][25689] Avg episode reward: [(0, '0.998')] [2022-07-11 04:44:55,331][26022] Updated weights on worker 0-0, policy_version 1040439 (0.00085) [2022-07-11 04:44:57,032][26022] Updated weights on worker 0-0, policy_version 1040449 (0.00100) [2022-07-11 04:44:58,920][26022] Updated weights on worker 0-0, policy_version 1040459 (0.00090) [2022-07-11 04:44:59,478][25689] Fps is (10 sec: 5588.5, 60 sec: 5539.1, 300 sec: 5555.2). Total num frames: 1065432064. Throughput: 0: 5034.9. Samples: 1065427144. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:44:59,479][25689] Avg episode reward: [(0, '1.337')] [2022-07-11 04:45:00,628][26022] Updated weights on worker 0-0, policy_version 1040469 (0.00084) [2022-07-11 04:45:03,050][26022] Updated weights on worker 0-0, policy_version 1040479 (0.00092) [2022-07-11 04:45:04,558][25689] Fps is (10 sec: 5348.9, 60 sec: 5569.6, 300 sec: 5554.5). Total num frames: 1065458688. Throughput: 0: 5750.3. Samples: 1065458816. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:45:04,558][25689] Avg episode reward: [(0, '1.141')] [2022-07-11 04:45:04,803][26022] Updated weights on worker 0-0, policy_version 1040489 (0.00093) [2022-07-11 04:45:06,631][26022] Updated weights on worker 0-0, policy_version 1040499 (0.00082) [2022-07-11 04:45:08,367][26022] Updated weights on worker 0-0, policy_version 1040509 (0.00097) [2022-07-11 04:45:09,608][25689] Fps is (10 sec: 5458.6, 60 sec: 5549.0, 300 sec: 5557.9). Total num frames: 1065487360. Throughput: 0: 5742.6. Samples: 1065492730. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:45:09,610][25689] Avg episode reward: [(0, '1.362')] [2022-07-11 04:45:10,235][26022] Updated weights on worker 0-0, policy_version 1040519 (0.00618) [2022-07-11 04:45:12,255][26022] Updated weights on worker 0-0, policy_version 1040529 (0.00087) [2022-07-11 04:45:13,783][26022] Updated weights on worker 0-0, policy_version 1040539 (0.00084) [2022-07-11 04:45:14,611][25689] Fps is (10 sec: 5704.1, 60 sec: 5568.1, 300 sec: 5558.7). Total num frames: 1065516032. Throughput: 0: 4913.4. Samples: 1065509598. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:45:14,612][25689] Avg episode reward: [(0, '0.731')] [2022-07-11 04:45:15,765][26022] Updated weights on worker 0-0, policy_version 1040549 (0.00365) [2022-07-11 04:45:17,477][26022] Updated weights on worker 0-0, policy_version 1040559 (0.00085) [2022-07-11 04:45:19,228][26022] Updated weights on worker 0-0, policy_version 1040569 (0.00089) [2022-07-11 04:45:19,712][25689] Fps is (10 sec: 5675.6, 60 sec: 5588.2, 300 sec: 5560.9). Total num frames: 1065544704. Throughput: 0: 5748.4. Samples: 1065543456. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:45:19,714][25689] Avg episode reward: [(0, '0.820')] [2022-07-11 04:45:21,234][26022] Updated weights on worker 0-0, policy_version 1040579 (0.00085) [2022-07-11 04:45:22,874][26022] Updated weights on worker 0-0, policy_version 1040589 (0.00081) [2022-07-11 04:45:24,774][25689] Fps is (10 sec: 5541.3, 60 sec: 5583.1, 300 sec: 5553.3). Total num frames: 1065572352. Throughput: 0: 5853.2. Samples: 1065577148. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:45:24,775][25689] Avg episode reward: [(0, '0.295')] [2022-07-11 04:45:24,894][26022] Updated weights on worker 0-0, policy_version 1040599 (0.00088) [2022-07-11 04:45:26,736][26022] Updated weights on worker 0-0, policy_version 1040609 (0.00086) [2022-07-11 04:45:28,470][26022] Updated weights on worker 0-0, policy_version 1040619 (0.00088) [2022-07-11 04:45:29,806][25689] Fps is (10 sec: 5478.2, 60 sec: 5565.2, 300 sec: 5556.8). Total num frames: 1065600000. Throughput: 0: 5843.0. Samples: 1065610744. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:45:29,807][25689] Avg episode reward: [(0, '0.552')] [2022-07-11 04:45:30,351][26022] Updated weights on worker 0-0, policy_version 1040629 (0.00094) [2022-07-11 04:45:32,068][26022] Updated weights on worker 0-0, policy_version 1040639 (0.00090) [2022-07-11 04:45:33,885][26022] Updated weights on worker 0-0, policy_version 1040649 (0.00084) [2022-07-11 04:45:34,813][25689] Fps is (10 sec: 5610.6, 60 sec: 5567.5, 300 sec: 5551.2). Total num frames: 1065628672. Throughput: 0: 5840.6. Samples: 1065627588. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:45:34,813][25689] Avg episode reward: [(0, '0.737')] [2022-07-11 04:45:34,874][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:45:34,884][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001040654_1065629696.pth [2022-07-11 04:45:34,884][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001038698_1063626752.pth [2022-07-11 04:45:35,811][26022] Updated weights on worker 0-0, policy_version 1040659 (0.00107) [2022-07-11 04:45:37,597][26022] Updated weights on worker 0-0, policy_version 1040669 (0.00089) [2022-07-11 04:45:39,535][26022] Updated weights on worker 0-0, policy_version 1040679 (0.00082) [2022-07-11 04:45:39,883][25689] Fps is (10 sec: 5690.7, 60 sec: 5601.5, 300 sec: 5557.5). Total num frames: 1065657344. Throughput: 0: 5837.0. Samples: 1065661192. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:45:39,883][25689] Avg episode reward: [(0, '1.073')] [2022-07-11 04:45:41,220][26022] Updated weights on worker 0-0, policy_version 1040689 (0.00087) [2022-07-11 04:45:43,154][26022] Updated weights on worker 0-0, policy_version 1040699 (0.00087) [2022-07-11 04:45:44,679][26022] Updated weights on worker 0-0, policy_version 1040709 (0.00087) [2022-07-11 04:45:44,886][25689] Fps is (10 sec: 5692.4, 60 sec: 5567.5, 300 sec: 5562.9). Total num frames: 1065686016. Throughput: 0: 5862.9. Samples: 1065695062. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:45:44,887][25689] Avg episode reward: [(0, '0.913')] [2022-07-11 04:45:46,790][26022] Updated weights on worker 0-0, policy_version 1040719 (0.00094) [2022-07-11 04:45:48,399][26022] Updated weights on worker 0-0, policy_version 1040729 (0.00085) [2022-07-11 04:45:49,903][25689] Fps is (10 sec: 5620.7, 60 sec: 5585.2, 300 sec: 5559.5). Total num frames: 1065713664. Throughput: 0: 5034.2. Samples: 1065711916. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:45:49,903][25689] Avg episode reward: [(0, '0.139')] [2022-07-11 04:45:50,438][26022] Updated weights on worker 0-0, policy_version 1040739 (0.00086) [2022-07-11 04:45:52,120][26022] Updated weights on worker 0-0, policy_version 1040749 (0.00087) [2022-07-11 04:45:54,090][26022] Updated weights on worker 0-0, policy_version 1040759 (0.00100) [2022-07-11 04:45:54,963][25689] Fps is (10 sec: 5589.3, 60 sec: 5580.7, 300 sec: 5560.8). Total num frames: 1065742336. Throughput: 0: 5835.3. Samples: 1065745170. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:45:54,963][25689] Avg episode reward: [(0, '-0.784')] [2022-07-11 04:45:56,118][26022] Updated weights on worker 0-0, policy_version 1040769 (0.00092) [2022-07-11 04:45:57,584][26022] Updated weights on worker 0-0, policy_version 1040779 (0.00099) [2022-07-11 04:45:59,795][26022] Updated weights on worker 0-0, policy_version 1040789 (0.00083) [2022-07-11 04:46:00,035][25689] Fps is (10 sec: 5457.4, 60 sec: 5563.3, 300 sec: 5559.8). Total num frames: 1065768960. Throughput: 0: 5811.1. Samples: 1065778300. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:46:00,036][25689] Avg episode reward: [(0, '-2.822')] [2022-07-11 04:46:01,960][26022] Updated weights on worker 0-0, policy_version 1040799 (0.00101) [2022-07-11 04:46:03,621][26022] Updated weights on worker 0-0, policy_version 1040809 (0.00053) [2022-07-11 04:46:05,079][25689] Fps is (10 sec: 5263.4, 60 sec: 5566.5, 300 sec: 5555.8). Total num frames: 1065795584. Throughput: 0: 4848.6. Samples: 1065792970. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:46:05,082][25689] Avg episode reward: [(0, '-2.977')] [2022-07-11 04:46:05,551][26022] Updated weights on worker 0-0, policy_version 1040819 (0.00084) [2022-07-11 04:46:07,305][26022] Updated weights on worker 0-0, policy_version 1040829 (0.00087) [2022-07-11 04:46:09,383][26022] Updated weights on worker 0-0, policy_version 1040839 (0.00085) [2022-07-11 04:46:10,096][25689] Fps is (10 sec: 5496.2, 60 sec: 5569.7, 300 sec: 5559.1). Total num frames: 1065824256. Throughput: 0: 5672.4. Samples: 1065826460. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:46:10,096][25689] Avg episode reward: [(0, '-2.837')] [2022-07-11 04:46:11,211][26022] Updated weights on worker 0-0, policy_version 1040849 (0.00084) [2022-07-11 04:46:12,814][26022] Updated weights on worker 0-0, policy_version 1040859 (0.00086) [2022-07-11 04:46:14,823][26022] Updated weights on worker 0-0, policy_version 1040869 (0.00087) [2022-07-11 04:46:15,114][25689] Fps is (10 sec: 5510.6, 60 sec: 5534.4, 300 sec: 5549.5). Total num frames: 1065850880. Throughput: 0: 5716.6. Samples: 1065860366. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:46:15,115][25689] Avg episode reward: [(0, '-1.880')] [2022-07-11 04:46:16,392][26022] Updated weights on worker 0-0, policy_version 1040879 (0.00081) [2022-07-11 04:46:18,400][26022] Updated weights on worker 0-0, policy_version 1040889 (0.00092) [2022-07-11 04:46:20,164][25689] Fps is (10 sec: 5593.8, 60 sec: 5556.0, 300 sec: 5559.5). Total num frames: 1065880576. Throughput: 0: 4914.5. Samples: 1065877224. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:46:20,165][25689] Avg episode reward: [(0, '-0.950')] [2022-07-11 04:46:20,171][26022] Updated weights on worker 0-0, policy_version 1040899 (0.00083) [2022-07-11 04:46:22,096][26022] Updated weights on worker 0-0, policy_version 1040909 (0.00095) [2022-07-11 04:46:23,826][26022] Updated weights on worker 0-0, policy_version 1040919 (0.00090) [2022-07-11 04:46:25,171][25689] Fps is (10 sec: 5600.1, 60 sec: 5544.2, 300 sec: 5556.3). Total num frames: 1065907200. Throughput: 0: 5860.4. Samples: 1065910714. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:46:25,171][25689] Avg episode reward: [(0, '0.239')] [2022-07-11 04:46:25,857][26022] Updated weights on worker 0-0, policy_version 1040929 (0.00082) [2022-07-11 04:46:27,632][26022] Updated weights on worker 0-0, policy_version 1040939 (0.00568) [2022-07-11 04:46:29,493][26022] Updated weights on worker 0-0, policy_version 1040949 (0.00092) [2022-07-11 04:46:30,191][25689] Fps is (10 sec: 5412.8, 60 sec: 5545.2, 300 sec: 5549.2). Total num frames: 1065934848. Throughput: 0: 5848.1. Samples: 1065943978. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:46:30,192][25689] Avg episode reward: [(0, '1.291')] [2022-07-11 04:46:31,223][26022] Updated weights on worker 0-0, policy_version 1040959 (0.00086) [2022-07-11 04:46:33,142][26022] Updated weights on worker 0-0, policy_version 1040969 (0.00589) [2022-07-11 04:46:34,979][26022] Updated weights on worker 0-0, policy_version 1040979 (0.00100) [2022-07-11 04:46:35,201][25689] Fps is (10 sec: 5615.3, 60 sec: 5544.9, 300 sec: 5554.3). Total num frames: 1065963520. Throughput: 0: 4998.6. Samples: 1065960772. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:46:35,201][25689] Avg episode reward: [(0, '1.226')] [2022-07-11 04:46:36,862][26022] Updated weights on worker 0-0, policy_version 1040989 (0.00090) [2022-07-11 04:46:38,624][26022] Updated weights on worker 0-0, policy_version 1040999 (0.00083) [2022-07-11 04:46:40,334][25689] Fps is (10 sec: 5653.1, 60 sec: 5539.1, 300 sec: 5552.8). Total num frames: 1065992192. Throughput: 0: 5810.7. Samples: 1065994428. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:46:40,335][25689] Avg episode reward: [(0, '0.607')] [2022-07-11 04:46:40,344][26022] Updated weights on worker 0-0, policy_version 1041009 (0.00505) [2022-07-11 04:46:42,331][26022] Updated weights on worker 0-0, policy_version 1041019 (0.00082) [2022-07-11 04:46:44,125][26022] Updated weights on worker 0-0, policy_version 1041029 (0.00085) [2022-07-11 04:46:45,391][25689] Fps is (10 sec: 5526.8, 60 sec: 5517.3, 300 sec: 5548.8). Total num frames: 1066019840. Throughput: 0: 5803.2. Samples: 1066028054. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:46:45,391][25689] Avg episode reward: [(0, '0.699')] [2022-07-11 04:46:46,026][26022] Updated weights on worker 0-0, policy_version 1041039 (0.00093) [2022-07-11 04:46:47,626][26022] Updated weights on worker 0-0, policy_version 1041049 (0.00090) [2022-07-11 04:46:49,486][26022] Updated weights on worker 0-0, policy_version 1041059 (0.00086) [2022-07-11 04:46:50,423][25689] Fps is (10 sec: 5582.8, 60 sec: 5532.8, 300 sec: 5555.4). Total num frames: 1066048512. Throughput: 0: 4994.4. Samples: 1066045024. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:46:50,423][25689] Avg episode reward: [(0, '1.000')] [2022-07-11 04:46:51,468][26022] Updated weights on worker 0-0, policy_version 1041069 (0.00086) [2022-07-11 04:46:53,164][26022] Updated weights on worker 0-0, policy_version 1041079 (0.00082) [2022-07-11 04:46:55,163][26022] Updated weights on worker 0-0, policy_version 1041089 (0.00090) [2022-07-11 04:46:55,439][25689] Fps is (10 sec: 5605.2, 60 sec: 5519.9, 300 sec: 5552.9). Total num frames: 1066076160. Throughput: 0: 5821.4. Samples: 1066078586. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:46:55,439][25689] Avg episode reward: [(0, '1.200')] [2022-07-11 04:46:56,822][26022] Updated weights on worker 0-0, policy_version 1041099 (0.00103) [2022-07-11 04:46:58,908][26022] Updated weights on worker 0-0, policy_version 1041109 (0.00090) [2022-07-11 04:47:00,559][25689] Fps is (10 sec: 5556.4, 60 sec: 5549.4, 300 sec: 5561.9). Total num frames: 1066104832. Throughput: 0: 5778.1. Samples: 1066111286. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:47:00,559][25689] Avg episode reward: [(0, '0.960')] [2022-07-11 04:47:00,638][26022] Updated weights on worker 0-0, policy_version 1041119 (0.00093) [2022-07-11 04:47:02,881][26022] Updated weights on worker 0-0, policy_version 1041129 (0.00091) [2022-07-11 04:47:04,803][26022] Updated weights on worker 0-0, policy_version 1041139 (0.00095) [2022-07-11 04:47:05,575][25689] Fps is (10 sec: 5354.4, 60 sec: 5535.1, 300 sec: 5551.4). Total num frames: 1066130432. Throughput: 0: 4854.3. Samples: 1066126032. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:47:05,575][25689] Avg episode reward: [(0, '0.207')] [2022-07-11 04:47:06,619][26022] Updated weights on worker 0-0, policy_version 1041149 (0.00091) [2022-07-11 04:47:08,257][26022] Updated weights on worker 0-0, policy_version 1041159 (0.00095) [2022-07-11 04:47:10,477][26022] Updated weights on worker 0-0, policy_version 1041169 (0.00088) [2022-07-11 04:47:10,628][25689] Fps is (10 sec: 5288.0, 60 sec: 5514.8, 300 sec: 5550.4). Total num frames: 1066158080. Throughput: 0: 5653.4. Samples: 1066159254. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:47:10,629][25689] Avg episode reward: [(0, '1.283')] [2022-07-11 04:47:11,925][26022] Updated weights on worker 0-0, policy_version 1041179 (0.00086) [2022-07-11 04:47:14,034][26022] Updated weights on worker 0-0, policy_version 1041189 (0.00085) [2022-07-11 04:47:15,641][25689] Fps is (10 sec: 5595.1, 60 sec: 5549.1, 300 sec: 5547.7). Total num frames: 1066186752. Throughput: 0: 5645.7. Samples: 1066192640. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:47:15,641][25689] Avg episode reward: [(0, '0.319')] [2022-07-11 04:47:15,761][26022] Updated weights on worker 0-0, policy_version 1041199 (0.00092) [2022-07-11 04:47:17,768][26022] Updated weights on worker 0-0, policy_version 1041209 (0.00089) [2022-07-11 04:47:19,480][26022] Updated weights on worker 0-0, policy_version 1041219 (0.00086) [2022-07-11 04:47:20,685][25689] Fps is (10 sec: 5600.0, 60 sec: 5515.8, 300 sec: 5553.9). Total num frames: 1066214400. Throughput: 0: 4868.7. Samples: 1066209276. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:47:20,686][25689] Avg episode reward: [(0, '0.134')] [2022-07-11 04:47:21,393][26022] Updated weights on worker 0-0, policy_version 1041229 (0.00091) [2022-07-11 04:47:23,056][26022] Updated weights on worker 0-0, policy_version 1041239 (0.00085) [2022-07-11 04:47:25,246][26022] Updated weights on worker 0-0, policy_version 1041249 (0.00087) [2022-07-11 04:47:25,700][25689] Fps is (10 sec: 5497.0, 60 sec: 5532.0, 300 sec: 5547.7). Total num frames: 1066242048. Throughput: 0: 5809.4. Samples: 1066242948. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:47:25,701][25689] Avg episode reward: [(0, '0.027')] [2022-07-11 04:47:26,715][26022] Updated weights on worker 0-0, policy_version 1041259 (0.00092) [2022-07-11 04:47:28,807][26022] Updated weights on worker 0-0, policy_version 1041269 (0.00085) [2022-07-11 04:47:30,328][26022] Updated weights on worker 0-0, policy_version 1041279 (0.00091) [2022-07-11 04:47:30,732][25689] Fps is (10 sec: 5605.7, 60 sec: 5547.8, 300 sec: 5551.0). Total num frames: 1066270720. Throughput: 0: 5836.3. Samples: 1066276586. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:47:30,733][25689] Avg episode reward: [(0, '0.920')] [2022-07-11 04:47:32,268][26022] Updated weights on worker 0-0, policy_version 1041289 (0.00087) [2022-07-11 04:47:34,229][26022] Updated weights on worker 0-0, policy_version 1041299 (0.00092) [2022-07-11 04:47:34,941][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:47:34,949][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001041303_1066294272.pth [2022-07-11 04:47:34,950][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001039349_1064293376.pth [2022-07-11 04:47:35,751][25689] Fps is (10 sec: 5705.4, 60 sec: 5547.0, 300 sec: 5555.0). Total num frames: 1066299392. Throughput: 0: 4999.2. Samples: 1066293174. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:47:35,753][25689] Avg episode reward: [(0, '0.807')] [2022-07-11 04:47:35,975][26022] Updated weights on worker 0-0, policy_version 1041309 (0.00086) [2022-07-11 04:47:37,875][26022] Updated weights on worker 0-0, policy_version 1041319 (0.00106) [2022-07-11 04:47:39,880][26022] Updated weights on worker 0-0, policy_version 1041329 (0.00091) [2022-07-11 04:47:40,816][25689] Fps is (10 sec: 5585.0, 60 sec: 5536.3, 300 sec: 5550.6). Total num frames: 1066327040. Throughput: 0: 5840.1. Samples: 1066326842. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:47:40,817][25689] Avg episode reward: [(0, '0.728')] [2022-07-11 04:47:41,402][26022] Updated weights on worker 0-0, policy_version 1041339 (0.00096) [2022-07-11 04:47:43,536][26022] Updated weights on worker 0-0, policy_version 1041349 (0.00083) [2022-07-11 04:47:45,107][26022] Updated weights on worker 0-0, policy_version 1041359 (0.00091) [2022-07-11 04:47:45,823][25689] Fps is (10 sec: 5388.6, 60 sec: 5524.0, 300 sec: 5543.9). Total num frames: 1066353664. Throughput: 0: 5841.0. Samples: 1066360484. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:47:45,824][25689] Avg episode reward: [(0, '0.674')] [2022-07-11 04:47:47,000][26022] Updated weights on worker 0-0, policy_version 1041369 (0.00085) [2022-07-11 04:47:48,864][26022] Updated weights on worker 0-0, policy_version 1041379 (0.00093) [2022-07-11 04:47:50,545][26022] Updated weights on worker 0-0, policy_version 1041389 (0.00081) [2022-07-11 04:47:50,828][25689] Fps is (10 sec: 5625.6, 60 sec: 5543.4, 300 sec: 5551.7). Total num frames: 1066383360. Throughput: 0: 5013.8. Samples: 1066377340. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:47:50,828][25689] Avg episode reward: [(0, '0.891')] [2022-07-11 04:47:52,477][26022] Updated weights on worker 0-0, policy_version 1041399 (0.00083) [2022-07-11 04:47:54,351][26022] Updated weights on worker 0-0, policy_version 1041409 (0.00088) [2022-07-11 04:47:55,862][25689] Fps is (10 sec: 5711.8, 60 sec: 5541.7, 300 sec: 5552.3). Total num frames: 1066411008. Throughput: 0: 5860.5. Samples: 1066411036. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:47:55,864][25689] Avg episode reward: [(0, '1.299')] [2022-07-11 04:47:56,108][26022] Updated weights on worker 0-0, policy_version 1041419 (0.00088) [2022-07-11 04:47:58,113][26022] Updated weights on worker 0-0, policy_version 1041429 (0.00079) [2022-07-11 04:47:59,690][26022] Updated weights on worker 0-0, policy_version 1041439 (0.00112) [2022-07-11 04:48:00,912][25689] Fps is (10 sec: 5483.2, 60 sec: 5531.1, 300 sec: 5555.9). Total num frames: 1066438656. Throughput: 0: 5862.8. Samples: 1066444660. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:48:00,913][25689] Avg episode reward: [(0, '0.888')] [2022-07-11 04:48:01,850][26022] Updated weights on worker 0-0, policy_version 1041449 (0.00085) [2022-07-11 04:48:03,773][26022] Updated weights on worker 0-0, policy_version 1041459 (0.00091) [2022-07-11 04:48:05,771][26022] Updated weights on worker 0-0, policy_version 1041469 (0.00083) [2022-07-11 04:48:05,913][25689] Fps is (10 sec: 5399.6, 60 sec: 5549.5, 300 sec: 5547.0). Total num frames: 1066465280. Throughput: 0: 4914.5. Samples: 1066459220. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:48:05,914][25689] Avg episode reward: [(0, '0.960')] [2022-07-11 04:48:07,470][26022] Updated weights on worker 0-0, policy_version 1041479 (0.00090) [2022-07-11 04:48:09,176][26022] Updated weights on worker 0-0, policy_version 1041489 (0.00096) [2022-07-11 04:48:10,915][25689] Fps is (10 sec: 5323.6, 60 sec: 5537.3, 300 sec: 5547.2). Total num frames: 1066491904. Throughput: 0: 5772.7. Samples: 1066493296. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:48:10,915][25689] Avg episode reward: [(0, '1.760')] [2022-07-11 04:48:11,167][26022] Updated weights on worker 0-0, policy_version 1041499 (0.00083) [2022-07-11 04:48:12,942][26022] Updated weights on worker 0-0, policy_version 1041509 (0.00085) [2022-07-11 04:48:14,755][26022] Updated weights on worker 0-0, policy_version 1041519 (0.00087) [2022-07-11 04:48:15,936][25689] Fps is (10 sec: 5721.6, 60 sec: 5570.5, 300 sec: 5551.4). Total num frames: 1066522624. Throughput: 0: 5771.3. Samples: 1066526886. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:48:15,936][25689] Avg episode reward: [(0, '2.048')] [2022-07-11 04:48:16,742][26022] Updated weights on worker 0-0, policy_version 1041529 (0.00086) [2022-07-11 04:48:18,364][26022] Updated weights on worker 0-0, policy_version 1041539 (0.00084) [2022-07-11 04:48:20,303][26022] Updated weights on worker 0-0, policy_version 1041549 (0.00082) [2022-07-11 04:48:20,990][25689] Fps is (10 sec: 5691.3, 60 sec: 5552.6, 300 sec: 5551.8). Total num frames: 1066549248. Throughput: 0: 4919.9. Samples: 1066543442. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:48:20,991][25689] Avg episode reward: [(0, '1.964')] [2022-07-11 04:48:22,177][26022] Updated weights on worker 0-0, policy_version 1041559 (0.00082) [2022-07-11 04:48:24,030][26022] Updated weights on worker 0-0, policy_version 1041569 (0.00084) [2022-07-11 04:48:25,807][26022] Updated weights on worker 0-0, policy_version 1041579 (0.00088) [2022-07-11 04:48:25,998][25689] Fps is (10 sec: 5495.4, 60 sec: 5570.2, 300 sec: 5552.1). Total num frames: 1066577920. Throughput: 0: 5877.2. Samples: 1066577260. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 04:48:25,998][25689] Avg episode reward: [(0, '2.128')] [2022-07-11 04:48:27,617][26022] Updated weights on worker 0-0, policy_version 1041589 (0.00090) [2022-07-11 04:48:29,391][26022] Updated weights on worker 0-0, policy_version 1041599 (0.00095) [2022-07-11 04:48:31,007][25689] Fps is (10 sec: 5623.0, 60 sec: 5555.4, 300 sec: 5549.0). Total num frames: 1066605568. Throughput: 0: 5842.9. Samples: 1066610690. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:48:31,007][25689] Avg episode reward: [(0, '2.339')] [2022-07-11 04:48:31,334][26022] Updated weights on worker 0-0, policy_version 1041609 (0.00095) [2022-07-11 04:48:33,119][26022] Updated weights on worker 0-0, policy_version 1041619 (0.00081) [2022-07-11 04:48:34,971][26022] Updated weights on worker 0-0, policy_version 1041629 (0.00093) [2022-07-11 04:48:36,019][25689] Fps is (10 sec: 5415.9, 60 sec: 5522.0, 300 sec: 5547.3). Total num frames: 1066632192. Throughput: 0: 4999.0. Samples: 1066627282. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:48:36,019][25689] Avg episode reward: [(0, '2.290')] [2022-07-11 04:48:36,696][26022] Updated weights on worker 0-0, policy_version 1041639 (0.00082) [2022-07-11 04:48:38,981][26022] Updated weights on worker 0-0, policy_version 1041649 (0.00084) [2022-07-11 04:48:40,392][26022] Updated weights on worker 0-0, policy_version 1041659 (0.00498) [2022-07-11 04:48:41,148][25689] Fps is (10 sec: 5553.4, 60 sec: 5550.1, 300 sec: 5552.3). Total num frames: 1066661888. Throughput: 0: 5823.3. Samples: 1066660826. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:48:41,148][25689] Avg episode reward: [(0, '0.480')] [2022-07-11 04:48:42,379][26022] Updated weights on worker 0-0, policy_version 1041669 (0.00081) [2022-07-11 04:48:44,131][26022] Updated weights on worker 0-0, policy_version 1041679 (0.00087) [2022-07-11 04:48:46,027][26022] Updated weights on worker 0-0, policy_version 1041689 (0.00081) [2022-07-11 04:48:46,150][25689] Fps is (10 sec: 5660.3, 60 sec: 5567.5, 300 sec: 5545.9). Total num frames: 1066689536. Throughput: 0: 5837.0. Samples: 1066694886. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:48:46,150][25689] Avg episode reward: [(0, '-0.078')] [2022-07-11 04:48:47,792][26022] Updated weights on worker 0-0, policy_version 1041699 (0.00084) [2022-07-11 04:48:49,666][26022] Updated weights on worker 0-0, policy_version 1041709 (0.00087) [2022-07-11 04:48:51,170][25689] Fps is (10 sec: 5619.7, 60 sec: 5549.2, 300 sec: 5552.5). Total num frames: 1066718208. Throughput: 0: 4997.6. Samples: 1066711458. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:48:51,170][25689] Avg episode reward: [(0, '-0.749')] [2022-07-11 04:48:51,555][26022] Updated weights on worker 0-0, policy_version 1041719 (0.00086) [2022-07-11 04:48:53,439][26022] Updated weights on worker 0-0, policy_version 1041729 (0.00095) [2022-07-11 04:48:55,128][26022] Updated weights on worker 0-0, policy_version 1041739 (0.00091) [2022-07-11 04:48:56,186][25689] Fps is (10 sec: 5611.6, 60 sec: 5550.8, 300 sec: 5548.0). Total num frames: 1066745856. Throughput: 0: 5835.3. Samples: 1066744964. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:48:56,186][25689] Avg episode reward: [(0, '-0.688')] [2022-07-11 04:48:56,797][26022] Updated weights on worker 0-0, policy_version 1041749 (0.00090) [2022-07-11 04:48:59,020][26022] Updated weights on worker 0-0, policy_version 1041759 (0.00085) [2022-07-11 04:49:00,366][26022] Updated weights on worker 0-0, policy_version 1041769 (0.00086) [2022-07-11 04:49:01,245][25689] Fps is (10 sec: 5691.7, 60 sec: 5584.0, 300 sec: 5564.9). Total num frames: 1066775552. Throughput: 0: 5860.2. Samples: 1066778598. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:49:01,245][25689] Avg episode reward: [(0, '-0.991')] [2022-07-11 04:49:03,164][26022] Updated weights on worker 0-0, policy_version 1041779 (0.00089) [2022-07-11 04:49:04,390][26022] Updated weights on worker 0-0, policy_version 1041789 (0.00096) [2022-07-11 04:49:06,263][25689] Fps is (10 sec: 5284.0, 60 sec: 5531.5, 300 sec: 5544.1). Total num frames: 1066799104. Throughput: 0: 4884.6. Samples: 1066793132. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:49:06,265][25689] Avg episode reward: [(0, '-1.200')] [2022-07-11 04:49:06,712][26022] Updated weights on worker 0-0, policy_version 1041799 (0.00088) [2022-07-11 04:49:08,255][26022] Updated weights on worker 0-0, policy_version 1041809 (0.00086) [2022-07-11 04:49:10,186][26022] Updated weights on worker 0-0, policy_version 1041819 (0.00081) [2022-07-11 04:49:11,339][25689] Fps is (10 sec: 5376.7, 60 sec: 5592.5, 300 sec: 5553.5). Total num frames: 1066829824. Throughput: 0: 5732.3. Samples: 1066827072. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:49:11,340][25689] Avg episode reward: [(0, '-1.252')] [2022-07-11 04:49:11,997][26022] Updated weights on worker 0-0, policy_version 1041829 (0.00086) [2022-07-11 04:49:13,811][26022] Updated weights on worker 0-0, policy_version 1041839 (0.00082) [2022-07-11 04:49:15,704][26022] Updated weights on worker 0-0, policy_version 1041849 (0.00089) [2022-07-11 04:49:16,355][25689] Fps is (10 sec: 5682.4, 60 sec: 5525.1, 300 sec: 5552.3). Total num frames: 1066856448. Throughput: 0: 5747.2. Samples: 1066860878. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:49:16,356][25689] Avg episode reward: [(0, '0.175')] [2022-07-11 04:49:17,457][26022] Updated weights on worker 0-0, policy_version 1041859 (0.00098) [2022-07-11 04:49:19,183][26022] Updated weights on worker 0-0, policy_version 1041869 (0.00088) [2022-07-11 04:49:21,138][26022] Updated weights on worker 0-0, policy_version 1041879 (0.00084) [2022-07-11 04:49:21,401][25689] Fps is (10 sec: 5495.3, 60 sec: 5559.8, 300 sec: 5555.0). Total num frames: 1066885120. Throughput: 0: 5756.1. Samples: 1066894620. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:49:21,402][25689] Avg episode reward: [(0, '-0.244')] [2022-07-11 04:49:22,904][26022] Updated weights on worker 0-0, policy_version 1041889 (0.00083) [2022-07-11 04:49:24,983][26022] Updated weights on worker 0-0, policy_version 1041899 (0.00092) [2022-07-11 04:49:26,487][25689] Fps is (10 sec: 5558.5, 60 sec: 5535.7, 300 sec: 5550.4). Total num frames: 1066912768. Throughput: 0: 5849.1. Samples: 1066911422. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:49:26,487][25689] Avg episode reward: [(0, '0.162')] [2022-07-11 04:49:26,633][26022] Updated weights on worker 0-0, policy_version 1041909 (0.00089) [2022-07-11 04:49:28,477][26022] Updated weights on worker 0-0, policy_version 1041919 (0.00090) [2022-07-11 04:49:30,431][26022] Updated weights on worker 0-0, policy_version 1041929 (0.00086) [2022-07-11 04:49:31,562][25689] Fps is (10 sec: 5543.2, 60 sec: 5546.6, 300 sec: 5549.6). Total num frames: 1066941440. Throughput: 0: 5796.6. Samples: 1066944296. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:49:31,562][25689] Avg episode reward: [(0, '-0.283')] [2022-07-11 04:49:32,148][26022] Updated weights on worker 0-0, policy_version 1041939 (0.00082) [2022-07-11 04:49:34,098][26022] Updated weights on worker 0-0, policy_version 1041949 (0.00087) [2022-07-11 04:49:35,100][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:49:35,122][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001041954_1066960896.pth [2022-07-11 04:49:35,123][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001040001_1064961024.pth [2022-07-11 04:49:35,959][26022] Updated weights on worker 0-0, policy_version 1041959 (0.00089) [2022-07-11 04:49:36,614][25689] Fps is (10 sec: 5561.1, 60 sec: 5559.7, 300 sec: 5553.4). Total num frames: 1066969088. Throughput: 0: 5755.8. Samples: 1066977488. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:49:36,615][25689] Avg episode reward: [(0, '-0.747')] [2022-07-11 04:49:37,804][26022] Updated weights on worker 0-0, policy_version 1041969 (0.00089) [2022-07-11 04:49:39,731][26022] Updated weights on worker 0-0, policy_version 1041979 (0.00087) [2022-07-11 04:49:41,511][26022] Updated weights on worker 0-0, policy_version 1041989 (0.00085) [2022-07-11 04:49:41,722][25689] Fps is (10 sec: 5543.1, 60 sec: 5544.8, 300 sec: 5544.5). Total num frames: 1066997760. Throughput: 0: 4887.2. Samples: 1066993932. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:49:41,722][25689] Avg episode reward: [(0, '-0.966')] [2022-07-11 04:49:43,423][26022] Updated weights on worker 0-0, policy_version 1041999 (0.00082) [2022-07-11 04:49:44,970][26022] Updated weights on worker 0-0, policy_version 1042009 (0.00086) [2022-07-11 04:49:46,728][25689] Fps is (10 sec: 5467.5, 60 sec: 5527.5, 300 sec: 5544.9). Total num frames: 1067024384. Throughput: 0: 5722.1. Samples: 1067027242. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:49:46,728][25689] Avg episode reward: [(0, '-1.124')] [2022-07-11 04:49:47,246][26022] Updated weights on worker 0-0, policy_version 1042019 (0.00420) [2022-07-11 04:49:48,732][26022] Updated weights on worker 0-0, policy_version 1042029 (0.00092) [2022-07-11 04:49:50,855][26022] Updated weights on worker 0-0, policy_version 1042039 (0.00093) [2022-07-11 04:49:51,815][25689] Fps is (10 sec: 5681.4, 60 sec: 5555.2, 300 sec: 5550.3). Total num frames: 1067055104. Throughput: 0: 5755.0. Samples: 1067060856. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:49:51,816][25689] Avg episode reward: [(0, '-0.250')] [2022-07-11 04:49:52,353][26022] Updated weights on worker 0-0, policy_version 1042049 (0.00084) [2022-07-11 04:49:54,437][26022] Updated weights on worker 0-0, policy_version 1042059 (0.00087) [2022-07-11 04:49:56,187][26022] Updated weights on worker 0-0, policy_version 1042069 (0.00086) [2022-07-11 04:49:56,818][25689] Fps is (10 sec: 5581.7, 60 sec: 5522.6, 300 sec: 5544.6). Total num frames: 1067080704. Throughput: 0: 4957.0. Samples: 1067077638. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:49:56,819][25689] Avg episode reward: [(0, '-0.431')] [2022-07-11 04:49:57,995][26022] Updated weights on worker 0-0, policy_version 1042079 (0.00084) [2022-07-11 04:50:00,045][26022] Updated weights on worker 0-0, policy_version 1042089 (0.00095) [2022-07-11 04:50:01,728][26022] Updated weights on worker 0-0, policy_version 1042099 (0.00089) [2022-07-11 04:50:01,926][25689] Fps is (10 sec: 5367.6, 60 sec: 5501.3, 300 sec: 5551.0). Total num frames: 1067109376. Throughput: 0: 5796.7. Samples: 1067111050. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:50:01,927][25689] Avg episode reward: [(0, '-0.084')] [2022-07-11 04:50:04,037][26022] Updated weights on worker 0-0, policy_version 1042109 (0.00083) [2022-07-11 04:50:05,782][26022] Updated weights on worker 0-0, policy_version 1042119 (0.00087) [2022-07-11 04:50:06,961][25689] Fps is (10 sec: 5451.5, 60 sec: 5550.4, 300 sec: 5544.4). Total num frames: 1067136000. Throughput: 0: 5689.7. Samples: 1067142364. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:50:06,962][25689] Avg episode reward: [(0, '1.589')] [2022-07-11 04:50:07,561][26022] Updated weights on worker 0-0, policy_version 1042129 (0.00078) [2022-07-11 04:50:09,614][26022] Updated weights on worker 0-0, policy_version 1042139 (0.00093) [2022-07-11 04:50:11,245][26022] Updated weights on worker 0-0, policy_version 1042149 (0.00084) [2022-07-11 04:50:11,984][25689] Fps is (10 sec: 5294.2, 60 sec: 5487.6, 300 sec: 5537.1). Total num frames: 1067162624. Throughput: 0: 4872.8. Samples: 1067159134. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:50:11,985][25689] Avg episode reward: [(0, '0.926')] [2022-07-11 04:50:13,178][26022] Updated weights on worker 0-0, policy_version 1042159 (0.00080) [2022-07-11 04:50:14,916][26022] Updated weights on worker 0-0, policy_version 1042169 (0.00091) [2022-07-11 04:50:16,690][26022] Updated weights on worker 0-0, policy_version 1042179 (0.00105) [2022-07-11 04:50:16,989][25689] Fps is (10 sec: 5616.4, 60 sec: 5539.3, 300 sec: 5542.4). Total num frames: 1067192320. Throughput: 0: 5721.7. Samples: 1067193050. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:50:16,989][25689] Avg episode reward: [(0, '0.905')] [2022-07-11 04:50:18,571][26022] Updated weights on worker 0-0, policy_version 1042189 (0.00079) [2022-07-11 04:50:20,309][26022] Updated weights on worker 0-0, policy_version 1042199 (0.00081) [2022-07-11 04:50:22,047][25689] Fps is (10 sec: 5800.5, 60 sec: 5538.3, 300 sec: 5545.9). Total num frames: 1067220992. Throughput: 0: 5757.6. Samples: 1067226896. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:50:22,047][25689] Avg episode reward: [(0, '0.326')] [2022-07-11 04:50:22,246][26022] Updated weights on worker 0-0, policy_version 1042209 (0.00092) [2022-07-11 04:50:24,150][26022] Updated weights on worker 0-0, policy_version 1042219 (0.00086) [2022-07-11 04:50:25,826][26022] Updated weights on worker 0-0, policy_version 1042229 (0.00092) [2022-07-11 04:50:27,068][25689] Fps is (10 sec: 5587.7, 60 sec: 5544.2, 300 sec: 5546.1). Total num frames: 1067248640. Throughput: 0: 5041.1. Samples: 1067243726. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:50:27,069][25689] Avg episode reward: [(0, '-0.066')] [2022-07-11 04:50:27,706][26022] Updated weights on worker 0-0, policy_version 1042239 (0.00082) [2022-07-11 04:50:29,543][26022] Updated weights on worker 0-0, policy_version 1042249 (0.00086) [2022-07-11 04:50:31,336][26022] Updated weights on worker 0-0, policy_version 1042259 (0.00084) [2022-07-11 04:50:32,082][25689] Fps is (10 sec: 5509.9, 60 sec: 5532.8, 300 sec: 5542.5). Total num frames: 1067276288. Throughput: 0: 5871.0. Samples: 1067277130. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:50:32,083][25689] Avg episode reward: [(0, '-0.003')] [2022-07-11 04:50:33,163][26022] Updated weights on worker 0-0, policy_version 1042269 (0.00088) [2022-07-11 04:50:35,237][26022] Updated weights on worker 0-0, policy_version 1042279 (0.00089) [2022-07-11 04:50:36,909][26022] Updated weights on worker 0-0, policy_version 1042289 (0.00086) [2022-07-11 04:50:37,098][25689] Fps is (10 sec: 5614.9, 60 sec: 5553.1, 300 sec: 5543.5). Total num frames: 1067304960. Throughput: 0: 5857.0. Samples: 1067310832. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:50:37,099][25689] Avg episode reward: [(0, '-0.027')] [2022-07-11 04:50:38,733][26022] Updated weights on worker 0-0, policy_version 1042299 (0.00084) [2022-07-11 04:50:40,686][26022] Updated weights on worker 0-0, policy_version 1042309 (0.00087) [2022-07-11 04:50:42,139][25689] Fps is (10 sec: 5803.8, 60 sec: 5576.1, 300 sec: 5546.3). Total num frames: 1067334656. Throughput: 0: 5014.7. Samples: 1067327654. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:50:42,140][25689] Avg episode reward: [(0, '0.826')] [2022-07-11 04:50:42,141][26022] Updated weights on worker 0-0, policy_version 1042319 (0.00089) [2022-07-11 04:50:44,400][26022] Updated weights on worker 0-0, policy_version 1042329 (0.00086) [2022-07-11 04:50:45,782][26022] Updated weights on worker 0-0, policy_version 1042339 (0.00089) [2022-07-11 04:50:47,147][25689] Fps is (10 sec: 5502.6, 60 sec: 5559.0, 300 sec: 5539.5). Total num frames: 1067360256. Throughput: 0: 5864.9. Samples: 1067361490. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:50:47,149][25689] Avg episode reward: [(0, '0.726')] [2022-07-11 04:50:47,817][26022] Updated weights on worker 0-0, policy_version 1042349 (0.00089) [2022-07-11 04:50:49,626][26022] Updated weights on worker 0-0, policy_version 1042359 (0.00087) [2022-07-11 04:50:51,174][26022] Updated weights on worker 0-0, policy_version 1042369 (0.00083) [2022-07-11 04:50:52,167][25689] Fps is (10 sec: 5412.4, 60 sec: 5531.3, 300 sec: 5540.3). Total num frames: 1067388928. Throughput: 0: 5890.3. Samples: 1067395432. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:50:52,167][25689] Avg episode reward: [(0, '1.254')] [2022-07-11 04:50:53,319][26022] Updated weights on worker 0-0, policy_version 1042379 (0.00082) [2022-07-11 04:50:54,935][26022] Updated weights on worker 0-0, policy_version 1042389 (0.00086) [2022-07-11 04:50:56,870][26022] Updated weights on worker 0-0, policy_version 1042399 (0.00082) [2022-07-11 04:50:57,182][25689] Fps is (10 sec: 5816.5, 60 sec: 5598.0, 300 sec: 5551.7). Total num frames: 1067418624. Throughput: 0: 5055.9. Samples: 1067412374. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:50:57,184][25689] Avg episode reward: [(0, '1.570')] [2022-07-11 04:50:58,738][26022] Updated weights on worker 0-0, policy_version 1042409 (0.00080) [2022-07-11 04:51:00,495][26022] Updated weights on worker 0-0, policy_version 1042419 (0.00096) [2022-07-11 04:51:02,244][25689] Fps is (10 sec: 5385.5, 60 sec: 5534.4, 300 sec: 5544.5). Total num frames: 1067443200. Throughput: 0: 5895.5. Samples: 1067446182. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:51:02,245][25689] Avg episode reward: [(0, '1.415')] [2022-07-11 04:51:02,828][26022] Updated weights on worker 0-0, policy_version 1042429 (0.00087) [2022-07-11 04:51:04,451][26022] Updated weights on worker 0-0, policy_version 1042439 (0.00084) [2022-07-11 04:51:06,472][26022] Updated weights on worker 0-0, policy_version 1042449 (0.00103) [2022-07-11 04:51:07,259][25689] Fps is (10 sec: 5284.5, 60 sec: 5570.3, 300 sec: 5544.5). Total num frames: 1067471872. Throughput: 0: 5802.3. Samples: 1067478180. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:51:07,260][25689] Avg episode reward: [(0, '1.604')] [2022-07-11 04:51:07,975][26022] Updated weights on worker 0-0, policy_version 1042459 (0.00087) [2022-07-11 04:51:10,176][26022] Updated weights on worker 0-0, policy_version 1042469 (0.00086) [2022-07-11 04:51:11,850][26022] Updated weights on worker 0-0, policy_version 1042479 (0.00098) [2022-07-11 04:51:12,278][25689] Fps is (10 sec: 5613.2, 60 sec: 5587.6, 300 sec: 5547.9). Total num frames: 1067499520. Throughput: 0: 4935.9. Samples: 1067494698. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:51:12,278][25689] Avg episode reward: [(0, '1.368')] [2022-07-11 04:51:13,644][26022] Updated weights on worker 0-0, policy_version 1042489 (0.00093) [2022-07-11 04:51:15,638][26022] Updated weights on worker 0-0, policy_version 1042499 (0.00087) [2022-07-11 04:51:17,289][25689] Fps is (10 sec: 5614.7, 60 sec: 5570.0, 300 sec: 5545.2). Total num frames: 1067528192. Throughput: 0: 5766.9. Samples: 1067528330. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:51:17,290][25689] Avg episode reward: [(0, '0.541')] [2022-07-11 04:51:17,389][26022] Updated weights on worker 0-0, policy_version 1042509 (0.00085) [2022-07-11 04:51:19,112][26022] Updated weights on worker 0-0, policy_version 1042519 (0.00090) [2022-07-11 04:51:21,096][26022] Updated weights on worker 0-0, policy_version 1042529 (0.00092) [2022-07-11 04:51:22,418][25689] Fps is (10 sec: 5655.2, 60 sec: 5563.5, 300 sec: 5549.8). Total num frames: 1067556864. Throughput: 0: 5737.0. Samples: 1067561918. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:51:22,418][25689] Avg episode reward: [(0, '-0.788')] [2022-07-11 04:51:22,700][26022] Updated weights on worker 0-0, policy_version 1042539 (0.00086) [2022-07-11 04:51:24,704][26022] Updated weights on worker 0-0, policy_version 1042549 (0.00619) [2022-07-11 04:51:26,289][26022] Updated weights on worker 0-0, policy_version 1042559 (0.00090) [2022-07-11 04:51:27,428][25689] Fps is (10 sec: 5555.0, 60 sec: 5564.5, 300 sec: 5550.0). Total num frames: 1067584512. Throughput: 0: 4997.9. Samples: 1067578984. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:51:27,428][25689] Avg episode reward: [(0, '-0.838')] [2022-07-11 04:51:28,350][26022] Updated weights on worker 0-0, policy_version 1042569 (0.00086) [2022-07-11 04:51:30,085][26022] Updated weights on worker 0-0, policy_version 1042579 (0.00083) [2022-07-11 04:51:31,927][26022] Updated weights on worker 0-0, policy_version 1042589 (0.00090) [2022-07-11 04:51:32,459][25689] Fps is (10 sec: 5608.9, 60 sec: 5579.9, 300 sec: 5549.6). Total num frames: 1067613184. Throughput: 0: 5842.3. Samples: 1067612602. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:51:32,459][25689] Avg episode reward: [(0, '-0.578')] [2022-07-11 04:51:33,860][26022] Updated weights on worker 0-0, policy_version 1042599 (0.00093) [2022-07-11 04:51:35,310][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:51:35,321][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001042607_1067629568.pth [2022-07-11 04:51:35,325][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001040654_1065629696.pth [2022-07-11 04:51:35,596][26022] Updated weights on worker 0-0, policy_version 1042609 (0.00094) [2022-07-11 04:51:37,422][26022] Updated weights on worker 0-0, policy_version 1042619 (0.00097) [2022-07-11 04:51:37,517][25689] Fps is (10 sec: 5684.0, 60 sec: 5576.1, 300 sec: 5551.0). Total num frames: 1067641856. Throughput: 0: 5832.3. Samples: 1067646302. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:51:37,517][25689] Avg episode reward: [(0, '-0.338')] [2022-07-11 04:51:39,247][26022] Updated weights on worker 0-0, policy_version 1042629 (0.00089) [2022-07-11 04:51:41,094][26022] Updated weights on worker 0-0, policy_version 1042639 (0.00094) [2022-07-11 04:51:42,579][25689] Fps is (10 sec: 5565.1, 60 sec: 5540.2, 300 sec: 5550.9). Total num frames: 1067669504. Throughput: 0: 5841.4. Samples: 1067679688. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:51:42,580][25689] Avg episode reward: [(0, '-0.190')] [2022-07-11 04:51:43,084][26022] Updated weights on worker 0-0, policy_version 1042649 (0.00092) [2022-07-11 04:51:44,688][26022] Updated weights on worker 0-0, policy_version 1042659 (0.00089) [2022-07-11 04:51:46,563][26022] Updated weights on worker 0-0, policy_version 1042669 (0.00088) [2022-07-11 04:51:47,617][25689] Fps is (10 sec: 5575.8, 60 sec: 5588.2, 300 sec: 5550.8). Total num frames: 1067698176. Throughput: 0: 5819.7. Samples: 1067696480. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:51:47,618][25689] Avg episode reward: [(0, '-0.160')] [2022-07-11 04:51:48,486][26022] Updated weights on worker 0-0, policy_version 1042679 (0.00084) [2022-07-11 04:51:50,398][26022] Updated weights on worker 0-0, policy_version 1042689 (0.00086) [2022-07-11 04:51:51,971][26022] Updated weights on worker 0-0, policy_version 1042699 (0.00088) [2022-07-11 04:51:52,718][25689] Fps is (10 sec: 5655.6, 60 sec: 5580.7, 300 sec: 5552.6). Total num frames: 1067726848. Throughput: 0: 5806.2. Samples: 1067730234. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:51:52,719][25689] Avg episode reward: [(0, '0.735')] [2022-07-11 04:51:53,912][26022] Updated weights on worker 0-0, policy_version 1042709 (0.00085) [2022-07-11 04:51:55,723][26022] Updated weights on worker 0-0, policy_version 1042719 (0.00089) [2022-07-11 04:51:57,727][25689] Fps is (10 sec: 5570.7, 60 sec: 5547.5, 300 sec: 5551.3). Total num frames: 1067754496. Throughput: 0: 5798.5. Samples: 1067763494. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:51:57,728][25689] Avg episode reward: [(0, '0.135')] [2022-07-11 04:51:57,735][26022] Updated weights on worker 0-0, policy_version 1042729 (0.00054) [2022-07-11 04:51:59,550][26022] Updated weights on worker 0-0, policy_version 1042739 (0.00095) [2022-07-11 04:52:01,527][26022] Updated weights on worker 0-0, policy_version 1042749 (0.00087) [2022-07-11 04:52:02,807][25689] Fps is (10 sec: 5379.6, 60 sec: 5579.7, 300 sec: 5553.5). Total num frames: 1067781120. Throughput: 0: 4963.8. Samples: 1067780094. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:52:02,807][25689] Avg episode reward: [(0, '-0.126')] [2022-07-11 04:52:03,613][26022] Updated weights on worker 0-0, policy_version 1042759 (0.00086) [2022-07-11 04:52:05,410][26022] Updated weights on worker 0-0, policy_version 1042769 (0.00095) [2022-07-11 04:52:07,185][26022] Updated weights on worker 0-0, policy_version 1042779 (0.00086) [2022-07-11 04:52:07,866][25689] Fps is (10 sec: 5353.1, 60 sec: 5558.7, 300 sec: 5553.4). Total num frames: 1067808768. Throughput: 0: 5673.7. Samples: 1067811364. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:52:07,866][25689] Avg episode reward: [(0, '0.223')] [2022-07-11 04:52:09,135][26022] Updated weights on worker 0-0, policy_version 1042789 (0.00085) [2022-07-11 04:52:11,122][26022] Updated weights on worker 0-0, policy_version 1042799 (0.00090) [2022-07-11 04:52:12,823][26022] Updated weights on worker 0-0, policy_version 1042809 (0.00079) [2022-07-11 04:52:12,916][25689] Fps is (10 sec: 5470.1, 60 sec: 5555.9, 300 sec: 5549.3). Total num frames: 1067836416. Throughput: 0: 5683.6. Samples: 1067845028. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:52:12,916][25689] Avg episode reward: [(0, '0.805')] [2022-07-11 04:52:14,581][26022] Updated weights on worker 0-0, policy_version 1042819 (0.00086) [2022-07-11 04:52:16,472][26022] Updated weights on worker 0-0, policy_version 1042829 (0.00090) [2022-07-11 04:52:17,988][25689] Fps is (10 sec: 5463.1, 60 sec: 5533.5, 300 sec: 5548.8). Total num frames: 1067864064. Throughput: 0: 4847.1. Samples: 1067861698. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:52:17,988][25689] Avg episode reward: [(0, '0.592')] [2022-07-11 04:52:18,339][26022] Updated weights on worker 0-0, policy_version 1042839 (0.00088) [2022-07-11 04:52:20,111][26022] Updated weights on worker 0-0, policy_version 1042849 (0.00081) [2022-07-11 04:52:21,955][26022] Updated weights on worker 0-0, policy_version 1042859 (0.00090) [2022-07-11 04:52:23,031][25689] Fps is (10 sec: 5466.7, 60 sec: 5524.4, 300 sec: 5548.2). Total num frames: 1067891712. Throughput: 0: 5686.0. Samples: 1067895088. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:52:23,031][25689] Avg episode reward: [(0, '0.788')] [2022-07-11 04:52:23,734][26022] Updated weights on worker 0-0, policy_version 1042869 (0.00092) [2022-07-11 04:52:25,587][26022] Updated weights on worker 0-0, policy_version 1042879 (0.00092) [2022-07-11 04:52:27,501][26022] Updated weights on worker 0-0, policy_version 1042889 (0.00084) [2022-07-11 04:52:28,037][25689] Fps is (10 sec: 5604.3, 60 sec: 5541.6, 300 sec: 5548.7). Total num frames: 1067920384. Throughput: 0: 5815.2. Samples: 1067928664. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 04:52:28,037][25689] Avg episode reward: [(0, '1.568')] [2022-07-11 04:52:29,250][26022] Updated weights on worker 0-0, policy_version 1042899 (0.00096) [2022-07-11 04:52:31,213][26022] Updated weights on worker 0-0, policy_version 1042909 (0.00086) [2022-07-11 04:52:33,045][25689] Fps is (10 sec: 5623.9, 60 sec: 5526.8, 300 sec: 5545.5). Total num frames: 1067948032. Throughput: 0: 4982.0. Samples: 1067945314. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:52:33,046][25689] Avg episode reward: [(0, '1.622')] [2022-07-11 04:52:33,103][26022] Updated weights on worker 0-0, policy_version 1042919 (0.00081) [2022-07-11 04:52:34,866][26022] Updated weights on worker 0-0, policy_version 1042929 (0.00085) [2022-07-11 04:52:36,557][26022] Updated weights on worker 0-0, policy_version 1042939 (0.00084) [2022-07-11 04:52:38,053][25689] Fps is (10 sec: 5725.2, 60 sec: 5548.3, 300 sec: 5553.4). Total num frames: 1067977728. Throughput: 0: 5855.6. Samples: 1067979192. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:52:38,054][25689] Avg episode reward: [(0, '1.106')] [2022-07-11 04:52:38,599][26022] Updated weights on worker 0-0, policy_version 1042949 (0.00098) [2022-07-11 04:52:40,194][26022] Updated weights on worker 0-0, policy_version 1042959 (0.00083) [2022-07-11 04:52:42,391][26022] Updated weights on worker 0-0, policy_version 1042969 (0.00089) [2022-07-11 04:52:43,159][25689] Fps is (10 sec: 5568.7, 60 sec: 5527.4, 300 sec: 5551.6). Total num frames: 1068004352. Throughput: 0: 5842.1. Samples: 1068012678. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:52:43,159][25689] Avg episode reward: [(0, '0.511')] [2022-07-11 04:52:43,823][26022] Updated weights on worker 0-0, policy_version 1042979 (0.00080) [2022-07-11 04:52:45,878][26022] Updated weights on worker 0-0, policy_version 1042989 (0.00061) [2022-07-11 04:52:47,443][26022] Updated weights on worker 0-0, policy_version 1042999 (0.00100) [2022-07-11 04:52:48,215][25689] Fps is (10 sec: 5441.7, 60 sec: 5525.8, 300 sec: 5547.2). Total num frames: 1068033024. Throughput: 0: 5013.1. Samples: 1068029816. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:52:48,215][25689] Avg episode reward: [(0, '0.536')] [2022-07-11 04:52:49,212][26022] Updated weights on worker 0-0, policy_version 1043009 (0.00092) [2022-07-11 04:52:51,300][26022] Updated weights on worker 0-0, policy_version 1043019 (0.00083) [2022-07-11 04:52:52,921][26022] Updated weights on worker 0-0, policy_version 1043029 (0.00092) [2022-07-11 04:52:53,224][25689] Fps is (10 sec: 5697.3, 60 sec: 5534.2, 300 sec: 5551.1). Total num frames: 1068061696. Throughput: 0: 5871.4. Samples: 1068063792. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:52:53,225][25689] Avg episode reward: [(0, '0.156')] [2022-07-11 04:52:54,946][26022] Updated weights on worker 0-0, policy_version 1043039 (0.00090) [2022-07-11 04:52:56,826][26022] Updated weights on worker 0-0, policy_version 1043049 (0.00087) [2022-07-11 04:52:58,251][25689] Fps is (10 sec: 5815.8, 60 sec: 5566.4, 300 sec: 5558.4). Total num frames: 1068091392. Throughput: 0: 5856.9. Samples: 1068097488. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:52:58,251][25689] Avg episode reward: [(0, '0.162')] [2022-07-11 04:52:58,295][26022] Updated weights on worker 0-0, policy_version 1043059 (0.00088) [2022-07-11 04:53:00,584][26022] Updated weights on worker 0-0, policy_version 1043069 (0.00086) [2022-07-11 04:53:02,295][26022] Updated weights on worker 0-0, policy_version 1043079 (0.00104) [2022-07-11 04:53:03,374][25689] Fps is (10 sec: 5447.6, 60 sec: 5545.4, 300 sec: 5552.7). Total num frames: 1068116992. Throughput: 0: 5030.5. Samples: 1068114372. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:53:03,375][25689] Avg episode reward: [(0, '-0.243')] [2022-07-11 04:53:04,474][26022] Updated weights on worker 0-0, policy_version 1043089 (0.00083) [2022-07-11 04:53:06,063][26022] Updated weights on worker 0-0, policy_version 1043099 (0.00087) [2022-07-11 04:53:08,006][26022] Updated weights on worker 0-0, policy_version 1043109 (0.00126) [2022-07-11 04:53:08,389][25689] Fps is (10 sec: 5353.2, 60 sec: 5566.4, 300 sec: 5559.3). Total num frames: 1068145664. Throughput: 0: 5761.3. Samples: 1068146046. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:53:08,389][25689] Avg episode reward: [(0, '-0.459')] [2022-07-11 04:53:09,885][26022] Updated weights on worker 0-0, policy_version 1043119 (0.00048) [2022-07-11 04:53:11,512][26022] Updated weights on worker 0-0, policy_version 1043129 (0.00078) [2022-07-11 04:53:13,345][26022] Updated weights on worker 0-0, policy_version 1043139 (0.00084) [2022-07-11 04:53:13,441][25689] Fps is (10 sec: 5696.5, 60 sec: 5583.1, 300 sec: 5551.9). Total num frames: 1068174336. Throughput: 0: 5718.0. Samples: 1068179394. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:53:13,442][25689] Avg episode reward: [(0, '0.121')] [2022-07-11 04:53:15,513][26022] Updated weights on worker 0-0, policy_version 1043149 (0.00091) [2022-07-11 04:53:16,900][26022] Updated weights on worker 0-0, policy_version 1043159 (0.00091) [2022-07-11 04:53:18,494][25689] Fps is (10 sec: 5471.9, 60 sec: 5567.9, 300 sec: 5551.9). Total num frames: 1068200960. Throughput: 0: 4879.3. Samples: 1068196262. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:53:18,495][25689] Avg episode reward: [(0, '-1.262')] [2022-07-11 04:53:19,088][26022] Updated weights on worker 0-0, policy_version 1043169 (0.00101) [2022-07-11 04:53:20,763][26022] Updated weights on worker 0-0, policy_version 1043179 (0.00090) [2022-07-11 04:53:22,490][26022] Updated weights on worker 0-0, policy_version 1043189 (0.00088) [2022-07-11 04:53:23,573][25689] Fps is (10 sec: 5558.6, 60 sec: 5598.5, 300 sec: 5554.0). Total num frames: 1068230656. Throughput: 0: 5728.9. Samples: 1068230088. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:53:23,574][25689] Avg episode reward: [(0, '-0.716')] [2022-07-11 04:53:24,536][26022] Updated weights on worker 0-0, policy_version 1043199 (0.00091) [2022-07-11 04:53:26,156][26022] Updated weights on worker 0-0, policy_version 1043209 (0.00095) [2022-07-11 04:53:28,127][26022] Updated weights on worker 0-0, policy_version 1043219 (0.00084) [2022-07-11 04:53:28,611][25689] Fps is (10 sec: 5871.0, 60 sec: 5612.5, 300 sec: 5560.4). Total num frames: 1068260352. Throughput: 0: 5829.0. Samples: 1068263918. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:53:28,611][25689] Avg episode reward: [(0, '-0.005')] [2022-07-11 04:53:30,102][26022] Updated weights on worker 0-0, policy_version 1043229 (0.00096) [2022-07-11 04:53:31,702][26022] Updated weights on worker 0-0, policy_version 1043239 (0.00090) [2022-07-11 04:53:33,643][25689] Fps is (10 sec: 5491.2, 60 sec: 5576.4, 300 sec: 5556.5). Total num frames: 1068285952. Throughput: 0: 5008.3. Samples: 1068280572. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:53:33,643][25689] Avg episode reward: [(0, '0.422')] [2022-07-11 04:53:33,716][26022] Updated weights on worker 0-0, policy_version 1043249 (0.00089) [2022-07-11 04:53:35,472][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:53:35,489][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001043259_1068297216.pth [2022-07-11 04:53:35,489][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001041303_1066294272.pth [2022-07-11 04:53:35,499][26022] Updated weights on worker 0-0, policy_version 1043259 (0.00092) [2022-07-11 04:53:37,234][26022] Updated weights on worker 0-0, policy_version 1043269 (0.00084) [2022-07-11 04:53:38,669][25689] Fps is (10 sec: 5395.4, 60 sec: 5557.8, 300 sec: 5555.0). Total num frames: 1068314624. Throughput: 0: 5852.0. Samples: 1068314326. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:53:38,670][25689] Avg episode reward: [(0, '0.304')] [2022-07-11 04:53:39,154][26022] Updated weights on worker 0-0, policy_version 1043279 (0.00084) [2022-07-11 04:53:41,019][26022] Updated weights on worker 0-0, policy_version 1043289 (0.00093) [2022-07-11 04:53:42,703][26022] Updated weights on worker 0-0, policy_version 1043299 (0.00073) [2022-07-11 04:53:43,814][25689] Fps is (10 sec: 5738.9, 60 sec: 5605.0, 300 sec: 5559.2). Total num frames: 1068344320. Throughput: 0: 5813.9. Samples: 1068347764. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:53:43,814][25689] Avg episode reward: [(0, '0.285')] [2022-07-11 04:53:44,867][26022] Updated weights on worker 0-0, policy_version 1043309 (0.00092) [2022-07-11 04:53:46,279][26022] Updated weights on worker 0-0, policy_version 1043319 (0.00088) [2022-07-11 04:53:48,520][26022] Updated weights on worker 0-0, policy_version 1043329 (0.00089) [2022-07-11 04:53:48,846][25689] Fps is (10 sec: 5534.0, 60 sec: 5573.3, 300 sec: 5552.1). Total num frames: 1068370944. Throughput: 0: 5815.3. Samples: 1068381596. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:53:48,847][25689] Avg episode reward: [(0, '1.802')] [2022-07-11 04:53:49,698][26022] Updated weights on worker 0-0, policy_version 1043339 (0.00086) [2022-07-11 04:53:52,173][26022] Updated weights on worker 0-0, policy_version 1043349 (0.00079) [2022-07-11 04:53:53,678][26022] Updated weights on worker 0-0, policy_version 1043359 (0.00087) [2022-07-11 04:53:53,862][25689] Fps is (10 sec: 5503.2, 60 sec: 5572.8, 300 sec: 5555.6). Total num frames: 1068399616. Throughput: 0: 5824.1. Samples: 1068398326. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:53:53,862][25689] Avg episode reward: [(0, '1.954')] [2022-07-11 04:53:55,682][26022] Updated weights on worker 0-0, policy_version 1043369 (0.00086) [2022-07-11 04:53:57,391][26022] Updated weights on worker 0-0, policy_version 1043379 (0.00091) [2022-07-11 04:53:58,869][25689] Fps is (10 sec: 5619.6, 60 sec: 5540.8, 300 sec: 5549.7). Total num frames: 1068427264. Throughput: 0: 5816.3. Samples: 1068431812. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:53:58,869][25689] Avg episode reward: [(0, '1.367')] [2022-07-11 04:53:59,319][26022] Updated weights on worker 0-0, policy_version 1043389 (0.00080) [2022-07-11 04:54:01,051][26022] Updated weights on worker 0-0, policy_version 1043399 (0.00087) [2022-07-11 04:54:03,439][26022] Updated weights on worker 0-0, policy_version 1043409 (0.00084) [2022-07-11 04:54:03,962][25689] Fps is (10 sec: 5373.3, 60 sec: 5560.5, 300 sec: 5558.6). Total num frames: 1068453888. Throughput: 0: 5738.9. Samples: 1068463394. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:54:03,962][25689] Avg episode reward: [(0, '1.561')] [2022-07-11 04:54:04,949][26022] Updated weights on worker 0-0, policy_version 1043419 (0.00093) [2022-07-11 04:54:07,018][26022] Updated weights on worker 0-0, policy_version 1043429 (0.00090) [2022-07-11 04:54:08,617][26022] Updated weights on worker 0-0, policy_version 1043439 (0.00091) [2022-07-11 04:54:08,986][25689] Fps is (10 sec: 5465.4, 60 sec: 5559.6, 300 sec: 5552.7). Total num frames: 1068482560. Throughput: 0: 4904.9. Samples: 1068480378. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:54:08,987][25689] Avg episode reward: [(0, '1.430')] [2022-07-11 04:54:10,667][26022] Updated weights on worker 0-0, policy_version 1043449 (0.00085) [2022-07-11 04:54:12,407][26022] Updated weights on worker 0-0, policy_version 1043459 (0.00084) [2022-07-11 04:54:14,005][25689] Fps is (10 sec: 5607.8, 60 sec: 5545.7, 300 sec: 5556.1). Total num frames: 1068510208. Throughput: 0: 5736.5. Samples: 1068513878. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:54:14,005][25689] Avg episode reward: [(0, '1.516')] [2022-07-11 04:54:14,216][26022] Updated weights on worker 0-0, policy_version 1043469 (0.00090) [2022-07-11 04:54:16,016][26022] Updated weights on worker 0-0, policy_version 1043479 (0.00086) [2022-07-11 04:54:17,925][26022] Updated weights on worker 0-0, policy_version 1043489 (0.00092) [2022-07-11 04:54:19,008][25689] Fps is (10 sec: 5619.8, 60 sec: 5584.2, 300 sec: 5556.9). Total num frames: 1068538880. Throughput: 0: 5751.7. Samples: 1068547646. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:54:19,008][25689] Avg episode reward: [(0, '1.568')] [2022-07-11 04:54:19,819][26022] Updated weights on worker 0-0, policy_version 1043499 (0.00084) [2022-07-11 04:54:21,685][26022] Updated weights on worker 0-0, policy_version 1043509 (0.00082) [2022-07-11 04:54:23,305][26022] Updated weights on worker 0-0, policy_version 1043519 (0.00084) [2022-07-11 04:54:24,142][25689] Fps is (10 sec: 5656.9, 60 sec: 5562.2, 300 sec: 5559.4). Total num frames: 1068567552. Throughput: 0: 5006.0. Samples: 1068564416. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:54:24,142][25689] Avg episode reward: [(0, '0.824')] [2022-07-11 04:54:25,173][26022] Updated weights on worker 0-0, policy_version 1043529 (0.00086) [2022-07-11 04:54:26,938][26022] Updated weights on worker 0-0, policy_version 1043539 (0.00091) [2022-07-11 04:54:28,775][26022] Updated weights on worker 0-0, policy_version 1043549 (0.00087) [2022-07-11 04:54:29,155][25689] Fps is (10 sec: 5651.4, 60 sec: 5547.5, 300 sec: 5560.6). Total num frames: 1068596224. Throughput: 0: 5853.7. Samples: 1068598440. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:54:29,155][25689] Avg episode reward: [(0, '1.606')] [2022-07-11 04:54:30,755][26022] Updated weights on worker 0-0, policy_version 1043559 (0.00085) [2022-07-11 04:54:32,438][26022] Updated weights on worker 0-0, policy_version 1043569 (0.00092) [2022-07-11 04:54:34,241][25689] Fps is (10 sec: 5475.6, 60 sec: 5559.6, 300 sec: 5556.5). Total num frames: 1068622848. Throughput: 0: 5838.6. Samples: 1068632026. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:54:34,241][25689] Avg episode reward: [(0, '1.680')] [2022-07-11 04:54:34,444][26022] Updated weights on worker 0-0, policy_version 1043579 (0.00090) [2022-07-11 04:54:36,131][26022] Updated weights on worker 0-0, policy_version 1043589 (0.00089) [2022-07-11 04:54:37,939][26022] Updated weights on worker 0-0, policy_version 1043599 (0.00089) [2022-07-11 04:54:39,267][25689] Fps is (10 sec: 5569.7, 60 sec: 5576.5, 300 sec: 5561.5). Total num frames: 1068652544. Throughput: 0: 4998.4. Samples: 1068648908. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:54:39,267][25689] Avg episode reward: [(0, '1.763')] [2022-07-11 04:54:39,749][26022] Updated weights on worker 0-0, policy_version 1043609 (0.00092) [2022-07-11 04:54:41,755][26022] Updated weights on worker 0-0, policy_version 1043619 (0.00080) [2022-07-11 04:54:43,376][26022] Updated weights on worker 0-0, policy_version 1043629 (0.00345) [2022-07-11 04:54:44,383][25689] Fps is (10 sec: 5754.8, 60 sec: 5562.1, 300 sec: 5566.3). Total num frames: 1068681216. Throughput: 0: 5834.9. Samples: 1068682522. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:54:44,384][25689] Avg episode reward: [(0, '1.432')] [2022-07-11 04:54:45,443][26022] Updated weights on worker 0-0, policy_version 1043639 (0.00090) [2022-07-11 04:54:47,233][26022] Updated weights on worker 0-0, policy_version 1043649 (0.00090) [2022-07-11 04:54:48,934][26022] Updated weights on worker 0-0, policy_version 1043659 (0.00090) [2022-07-11 04:54:49,401][25689] Fps is (10 sec: 5557.5, 60 sec: 5580.5, 300 sec: 5557.3). Total num frames: 1068708864. Throughput: 0: 5814.0. Samples: 1068716150. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:54:49,401][25689] Avg episode reward: [(0, '1.938')] [2022-07-11 04:54:50,835][26022] Updated weights on worker 0-0, policy_version 1043669 (0.00088) [2022-07-11 04:54:52,545][26022] Updated weights on worker 0-0, policy_version 1043679 (0.00085) [2022-07-11 04:54:54,405][25689] Fps is (10 sec: 5517.5, 60 sec: 5564.5, 300 sec: 5564.1). Total num frames: 1068736512. Throughput: 0: 5005.3. Samples: 1068732954. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:54:54,405][25689] Avg episode reward: [(0, '1.556')] [2022-07-11 04:54:54,516][26022] Updated weights on worker 0-0, policy_version 1043689 (0.00088) [2022-07-11 04:54:56,056][26022] Updated weights on worker 0-0, policy_version 1043699 (0.00092) [2022-07-11 04:54:58,307][26022] Updated weights on worker 0-0, policy_version 1043709 (0.00089) [2022-07-11 04:54:59,419][25689] Fps is (10 sec: 5622.0, 60 sec: 5580.8, 300 sec: 5565.9). Total num frames: 1068765184. Throughput: 0: 5843.6. Samples: 1068766668. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:54:59,420][25689] Avg episode reward: [(0, '0.857')] [2022-07-11 04:54:59,766][26022] Updated weights on worker 0-0, policy_version 1043719 (0.00083) [2022-07-11 04:55:02,279][26022] Updated weights on worker 0-0, policy_version 1043729 (0.00083) [2022-07-11 04:55:03,800][26022] Updated weights on worker 0-0, policy_version 1043739 (0.00107) [2022-07-11 04:55:04,535][25689] Fps is (10 sec: 5458.7, 60 sec: 5578.7, 300 sec: 5564.4). Total num frames: 1068791808. Throughput: 0: 5731.8. Samples: 1068798028. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:55:04,535][25689] Avg episode reward: [(0, '0.979')] [2022-07-11 04:55:05,885][26022] Updated weights on worker 0-0, policy_version 1043749 (0.00083) [2022-07-11 04:55:07,370][26022] Updated weights on worker 0-0, policy_version 1043759 (0.00095) [2022-07-11 04:55:09,535][26022] Updated weights on worker 0-0, policy_version 1043769 (0.00081) [2022-07-11 04:55:09,539][25689] Fps is (10 sec: 5362.6, 60 sec: 5563.6, 300 sec: 5568.2). Total num frames: 1068819456. Throughput: 0: 4906.4. Samples: 1068814958. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:55:09,540][25689] Avg episode reward: [(0, '0.162')] [2022-07-11 04:55:11,136][26022] Updated weights on worker 0-0, policy_version 1043779 (0.00081) [2022-07-11 04:55:13,104][26022] Updated weights on worker 0-0, policy_version 1043789 (0.00079) [2022-07-11 04:55:14,622][25689] Fps is (10 sec: 5685.0, 60 sec: 5591.6, 300 sec: 5566.7). Total num frames: 1068849152. Throughput: 0: 5729.0. Samples: 1068848776. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:55:14,622][25689] Avg episode reward: [(0, '-0.646')] [2022-07-11 04:55:14,751][26022] Updated weights on worker 0-0, policy_version 1043799 (0.00096) [2022-07-11 04:55:16,528][26022] Updated weights on worker 0-0, policy_version 1043809 (0.00086) [2022-07-11 04:55:18,628][26022] Updated weights on worker 0-0, policy_version 1043819 (0.00085) [2022-07-11 04:55:19,660][25689] Fps is (10 sec: 5665.7, 60 sec: 5571.4, 300 sec: 5563.6). Total num frames: 1068876800. Throughput: 0: 5731.0. Samples: 1068882674. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:55:19,661][25689] Avg episode reward: [(0, '-1.052')] [2022-07-11 04:55:20,168][26022] Updated weights on worker 0-0, policy_version 1043829 (0.00092) [2022-07-11 04:55:22,441][26022] Updated weights on worker 0-0, policy_version 1043839 (0.00085) [2022-07-11 04:55:24,076][26022] Updated weights on worker 0-0, policy_version 1043849 (0.00084) [2022-07-11 04:55:24,783][25689] Fps is (10 sec: 5542.7, 60 sec: 5572.5, 300 sec: 5565.2). Total num frames: 1068905472. Throughput: 0: 5008.2. Samples: 1068899434. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:55:24,783][25689] Avg episode reward: [(0, '-0.662')] [2022-07-11 04:55:25,886][26022] Updated weights on worker 0-0, policy_version 1043859 (0.00092) [2022-07-11 04:55:27,649][26022] Updated weights on worker 0-0, policy_version 1043869 (0.00092) [2022-07-11 04:55:29,392][26022] Updated weights on worker 0-0, policy_version 1043879 (0.00083) [2022-07-11 04:55:29,845][25689] Fps is (10 sec: 5530.0, 60 sec: 5551.1, 300 sec: 5564.3). Total num frames: 1068933120. Throughput: 0: 5819.8. Samples: 1068933132. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:55:29,845][25689] Avg episode reward: [(0, '0.105')] [2022-07-11 04:55:31,550][26022] Updated weights on worker 0-0, policy_version 1043889 (0.00089) [2022-07-11 04:55:33,110][26022] Updated weights on worker 0-0, policy_version 1043899 (0.00092) [2022-07-11 04:55:34,849][25689] Fps is (10 sec: 5696.5, 60 sec: 5609.2, 300 sec: 5567.9). Total num frames: 1068962816. Throughput: 0: 5837.6. Samples: 1068966856. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:55:34,850][25689] Avg episode reward: [(0, '0.122')] [2022-07-11 04:55:34,851][26022] Updated weights on worker 0-0, policy_version 1043909 (0.00080) [2022-07-11 04:55:35,600][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:55:35,616][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001043912_1068965888.pth [2022-07-11 04:55:35,616][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001041954_1066960896.pth [2022-07-11 04:55:36,577][26022] Updated weights on worker 0-0, policy_version 1043919 (0.00088) [2022-07-11 04:55:38,611][26022] Updated weights on worker 0-0, policy_version 1043929 (0.00085) [2022-07-11 04:55:39,855][25689] Fps is (10 sec: 5830.8, 60 sec: 5594.2, 300 sec: 5565.1). Total num frames: 1068991488. Throughput: 0: 5015.2. Samples: 1068983954. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:55:39,856][25689] Avg episode reward: [(0, '-1.682')] [2022-07-11 04:55:40,368][26022] Updated weights on worker 0-0, policy_version 1043939 (0.00091) [2022-07-11 04:55:42,268][26022] Updated weights on worker 0-0, policy_version 1043949 (0.00078) [2022-07-11 04:55:43,948][26022] Updated weights on worker 0-0, policy_version 1043959 (0.00093) [2022-07-11 04:55:44,928][25689] Fps is (10 sec: 5486.2, 60 sec: 5564.4, 300 sec: 5567.4). Total num frames: 1069018112. Throughput: 0: 5869.1. Samples: 1069017670. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:55:44,929][25689] Avg episode reward: [(0, '-1.211')] [2022-07-11 04:55:46,037][26022] Updated weights on worker 0-0, policy_version 1043969 (0.00088) [2022-07-11 04:55:47,692][26022] Updated weights on worker 0-0, policy_version 1043979 (0.00089) [2022-07-11 04:55:49,607][26022] Updated weights on worker 0-0, policy_version 1043989 (0.00085) [2022-07-11 04:55:49,945][25689] Fps is (10 sec: 5480.3, 60 sec: 5581.4, 300 sec: 5567.4). Total num frames: 1069046784. Throughput: 0: 5884.5. Samples: 1069051412. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:55:49,945][25689] Avg episode reward: [(0, '-0.705')] [2022-07-11 04:55:51,286][26022] Updated weights on worker 0-0, policy_version 1043999 (0.00090) [2022-07-11 04:55:53,208][26022] Updated weights on worker 0-0, policy_version 1044009 (0.00090) [2022-07-11 04:55:54,991][25689] Fps is (10 sec: 5698.5, 60 sec: 5594.4, 300 sec: 5563.4). Total num frames: 1069075456. Throughput: 0: 5017.8. Samples: 1069067924. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:55:54,992][25689] Avg episode reward: [(0, '-0.864')] [2022-07-11 04:55:54,997][26022] Updated weights on worker 0-0, policy_version 1044019 (0.00086) [2022-07-11 04:55:56,810][26022] Updated weights on worker 0-0, policy_version 1044029 (0.00097) [2022-07-11 04:55:58,774][26022] Updated weights on worker 0-0, policy_version 1044039 (0.00088) [2022-07-11 04:56:00,017][25689] Fps is (10 sec: 5591.9, 60 sec: 5576.4, 300 sec: 5574.4). Total num frames: 1069103104. Throughput: 0: 5844.8. Samples: 1069101794. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:56:00,017][25689] Avg episode reward: [(0, '-0.946')] [2022-07-11 04:56:00,479][26022] Updated weights on worker 0-0, policy_version 1044049 (0.00088) [2022-07-11 04:56:02,948][26022] Updated weights on worker 0-0, policy_version 1044059 (0.00089) [2022-07-11 04:56:04,387][26022] Updated weights on worker 0-0, policy_version 1044069 (0.00087) [2022-07-11 04:56:05,061][25689] Fps is (10 sec: 5389.5, 60 sec: 5583.0, 300 sec: 5567.0). Total num frames: 1069129728. Throughput: 0: 5732.5. Samples: 1069133082. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:56:05,062][25689] Avg episode reward: [(0, '-0.664')] [2022-07-11 04:56:06,620][26022] Updated weights on worker 0-0, policy_version 1044079 (0.00090) [2022-07-11 04:56:08,177][26022] Updated weights on worker 0-0, policy_version 1044089 (0.00090) [2022-07-11 04:56:10,103][25689] Fps is (10 sec: 5279.3, 60 sec: 5562.7, 300 sec: 5563.1). Total num frames: 1069156352. Throughput: 0: 5718.3. Samples: 1069166680. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:56:10,103][25689] Avg episode reward: [(0, '0.347')] [2022-07-11 04:56:10,184][26022] Updated weights on worker 0-0, policy_version 1044099 (0.00088) [2022-07-11 04:56:11,969][26022] Updated weights on worker 0-0, policy_version 1044109 (0.00093) [2022-07-11 04:56:13,721][26022] Updated weights on worker 0-0, policy_version 1044119 (0.00095) [2022-07-11 04:56:15,169][25689] Fps is (10 sec: 5470.8, 60 sec: 5547.3, 300 sec: 5562.1). Total num frames: 1069185024. Throughput: 0: 5715.9. Samples: 1069183256. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:56:15,169][25689] Avg episode reward: [(0, '0.113')] [2022-07-11 04:56:15,714][26022] Updated weights on worker 0-0, policy_version 1044129 (0.00094) [2022-07-11 04:56:17,607][26022] Updated weights on worker 0-0, policy_version 1044139 (0.00093) [2022-07-11 04:56:19,049][26022] Updated weights on worker 0-0, policy_version 1044149 (0.00740) [2022-07-11 04:56:20,227][25689] Fps is (10 sec: 5563.1, 60 sec: 5545.5, 300 sec: 5560.0). Total num frames: 1069212672. Throughput: 0: 5697.9. Samples: 1069216948. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:56:20,228][25689] Avg episode reward: [(0, '-0.138')] [2022-07-11 04:56:21,322][26022] Updated weights on worker 0-0, policy_version 1044159 (0.00083) [2022-07-11 04:56:22,891][26022] Updated weights on worker 0-0, policy_version 1044169 (0.00088) [2022-07-11 04:56:24,802][26022] Updated weights on worker 0-0, policy_version 1044179 (0.00091) [2022-07-11 04:56:25,281][25689] Fps is (10 sec: 5670.6, 60 sec: 5568.7, 300 sec: 5566.0). Total num frames: 1069242368. Throughput: 0: 5806.8. Samples: 1069250496. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 04:56:25,282][25689] Avg episode reward: [(0, '-0.343')] [2022-07-11 04:56:26,806][26022] Updated weights on worker 0-0, policy_version 1044189 (0.00094) [2022-07-11 04:56:28,461][26022] Updated weights on worker 0-0, policy_version 1044199 (0.00087) [2022-07-11 04:56:30,316][25689] Fps is (10 sec: 5582.2, 60 sec: 5554.3, 300 sec: 5559.1). Total num frames: 1069268992. Throughput: 0: 4974.2. Samples: 1069267224. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:56:30,316][25689] Avg episode reward: [(0, '-0.110')] [2022-07-11 04:56:30,398][26022] Updated weights on worker 0-0, policy_version 1044209 (0.00091) [2022-07-11 04:56:32,207][26022] Updated weights on worker 0-0, policy_version 1044219 (0.00088) [2022-07-11 04:56:34,062][26022] Updated weights on worker 0-0, policy_version 1044229 (0.00084) [2022-07-11 04:56:35,331][25689] Fps is (10 sec: 5400.2, 60 sec: 5519.4, 300 sec: 5556.4). Total num frames: 1069296640. Throughput: 0: 5807.1. Samples: 1069300342. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:56:35,332][25689] Avg episode reward: [(0, '1.070')] [2022-07-11 04:56:35,939][26022] Updated weights on worker 0-0, policy_version 1044239 (0.00091) [2022-07-11 04:56:37,665][26022] Updated weights on worker 0-0, policy_version 1044249 (0.00834) [2022-07-11 04:56:39,599][26022] Updated weights on worker 0-0, policy_version 1044259 (0.00096) [2022-07-11 04:56:40,365][25689] Fps is (10 sec: 5706.0, 60 sec: 5533.7, 300 sec: 5563.8). Total num frames: 1069326336. Throughput: 0: 5801.6. Samples: 1069333786. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:56:40,366][25689] Avg episode reward: [(0, '1.624')] [2022-07-11 04:56:41,560][26022] Updated weights on worker 0-0, policy_version 1044269 (0.00092) [2022-07-11 04:56:43,327][26022] Updated weights on worker 0-0, policy_version 1044279 (0.00084) [2022-07-11 04:56:45,232][26022] Updated weights on worker 0-0, policy_version 1044289 (0.00094) [2022-07-11 04:56:45,451][25689] Fps is (10 sec: 5565.5, 60 sec: 5532.6, 300 sec: 5556.1). Total num frames: 1069352960. Throughput: 0: 4944.4. Samples: 1069350222. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:56:45,451][25689] Avg episode reward: [(0, '1.425')] [2022-07-11 04:56:46,823][26022] Updated weights on worker 0-0, policy_version 1044299 (0.00091) [2022-07-11 04:56:48,730][26022] Updated weights on worker 0-0, policy_version 1044309 (0.00088) [2022-07-11 04:56:50,486][25689] Fps is (10 sec: 5463.6, 60 sec: 5530.9, 300 sec: 5557.3). Total num frames: 1069381632. Throughput: 0: 5790.5. Samples: 1069384022. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:56:50,487][25689] Avg episode reward: [(0, '1.637')] [2022-07-11 04:56:50,593][26022] Updated weights on worker 0-0, policy_version 1044319 (0.00094) [2022-07-11 04:56:52,320][26022] Updated weights on worker 0-0, policy_version 1044329 (0.00093) [2022-07-11 04:56:54,227][26022] Updated weights on worker 0-0, policy_version 1044339 (0.00082) [2022-07-11 04:56:55,547][25689] Fps is (10 sec: 5679.7, 60 sec: 5529.6, 300 sec: 5559.8). Total num frames: 1069410304. Throughput: 0: 5809.8. Samples: 1069417792. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:56:55,548][25689] Avg episode reward: [(0, '1.544')] [2022-07-11 04:56:56,011][26022] Updated weights on worker 0-0, policy_version 1044349 (0.00094) [2022-07-11 04:56:57,770][26022] Updated weights on worker 0-0, policy_version 1044359 (0.00086) [2022-07-11 04:56:59,787][26022] Updated weights on worker 0-0, policy_version 1044369 (0.00083) [2022-07-11 04:57:00,560][25689] Fps is (10 sec: 5590.6, 60 sec: 5530.7, 300 sec: 5564.4). Total num frames: 1069437952. Throughput: 0: 4995.4. Samples: 1069434668. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:57:00,561][25689] Avg episode reward: [(0, '1.354')] [2022-07-11 04:57:01,544][26022] Updated weights on worker 0-0, policy_version 1044379 (0.00090) [2022-07-11 04:57:03,740][26022] Updated weights on worker 0-0, policy_version 1044389 (0.00086) [2022-07-11 04:57:05,613][25689] Fps is (10 sec: 5391.4, 60 sec: 5529.9, 300 sec: 5561.1). Total num frames: 1069464576. Throughput: 0: 5747.1. Samples: 1069466098. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:57:05,614][25689] Avg episode reward: [(0, '0.842')] [2022-07-11 04:57:05,616][26022] Updated weights on worker 0-0, policy_version 1044399 (0.00090) [2022-07-11 04:57:07,254][26022] Updated weights on worker 0-0, policy_version 1044409 (0.00079) [2022-07-11 04:57:09,372][26022] Updated weights on worker 0-0, policy_version 1044419 (0.00095) [2022-07-11 04:57:10,622][25689] Fps is (10 sec: 5496.0, 60 sec: 5566.8, 300 sec: 5565.3). Total num frames: 1069493248. Throughput: 0: 5746.8. Samples: 1069499736. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:57:10,622][25689] Avg episode reward: [(0, '0.954')] [2022-07-11 04:57:11,100][26022] Updated weights on worker 0-0, policy_version 1044429 (0.00091) [2022-07-11 04:57:12,884][26022] Updated weights on worker 0-0, policy_version 1044439 (0.00091) [2022-07-11 04:57:14,967][26022] Updated weights on worker 0-0, policy_version 1044449 (0.00089) [2022-07-11 04:57:15,702][25689] Fps is (10 sec: 5481.2, 60 sec: 5531.6, 300 sec: 5561.7). Total num frames: 1069519872. Throughput: 0: 4899.4. Samples: 1069516538. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:57:15,702][25689] Avg episode reward: [(0, '1.289')] [2022-07-11 04:57:16,500][26022] Updated weights on worker 0-0, policy_version 1044459 (0.00093) [2022-07-11 04:57:18,492][26022] Updated weights on worker 0-0, policy_version 1044469 (0.00056) [2022-07-11 04:57:20,419][26022] Updated weights on worker 0-0, policy_version 1044479 (0.00083) [2022-07-11 04:57:20,714][25689] Fps is (10 sec: 5478.9, 60 sec: 5552.7, 300 sec: 5565.8). Total num frames: 1069548544. Throughput: 0: 5716.8. Samples: 1069549884. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:57:20,715][25689] Avg episode reward: [(0, '1.370')] [2022-07-11 04:57:21,923][26022] Updated weights on worker 0-0, policy_version 1044489 (0.00094) [2022-07-11 04:57:24,147][26022] Updated weights on worker 0-0, policy_version 1044499 (0.00058) [2022-07-11 04:57:25,636][26022] Updated weights on worker 0-0, policy_version 1044509 (0.00080) [2022-07-11 04:57:25,835][25689] Fps is (10 sec: 5659.2, 60 sec: 5529.8, 300 sec: 5563.6). Total num frames: 1069577216. Throughput: 0: 5801.7. Samples: 1069583416. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:57:25,835][25689] Avg episode reward: [(0, '1.429')] [2022-07-11 04:57:27,669][26022] Updated weights on worker 0-0, policy_version 1044519 (0.00085) [2022-07-11 04:57:29,404][26022] Updated weights on worker 0-0, policy_version 1044529 (0.00088) [2022-07-11 04:57:30,874][25689] Fps is (10 sec: 5543.3, 60 sec: 5546.3, 300 sec: 5563.0). Total num frames: 1069604864. Throughput: 0: 4954.4. Samples: 1069600076. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:57:30,875][25689] Avg episode reward: [(0, '2.327')] [2022-07-11 04:57:31,181][26022] Updated weights on worker 0-0, policy_version 1044539 (0.00090) [2022-07-11 04:57:33,267][26022] Updated weights on worker 0-0, policy_version 1044549 (0.00086) [2022-07-11 04:57:34,795][26022] Updated weights on worker 0-0, policy_version 1044559 (0.00086) [2022-07-11 04:57:35,764][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:57:35,786][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001044563_1069632512.pth [2022-07-11 04:57:35,787][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001042607_1067629568.pth [2022-07-11 04:57:35,888][25689] Fps is (10 sec: 5500.2, 60 sec: 5546.4, 300 sec: 5556.0). Total num frames: 1069632512. Throughput: 0: 5808.4. Samples: 1069633788. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:57:35,889][25689] Avg episode reward: [(0, '1.889')] [2022-07-11 04:57:36,762][26022] Updated weights on worker 0-0, policy_version 1044569 (0.00093) [2022-07-11 04:57:38,654][26022] Updated weights on worker 0-0, policy_version 1044579 (0.00089) [2022-07-11 04:57:40,427][26022] Updated weights on worker 0-0, policy_version 1044589 (0.00092) [2022-07-11 04:57:40,895][25689] Fps is (10 sec: 5620.2, 60 sec: 5532.0, 300 sec: 5564.8). Total num frames: 1069661184. Throughput: 0: 5821.3. Samples: 1069667362. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:57:40,896][25689] Avg episode reward: [(0, '1.048')] [2022-07-11 04:57:42,353][26022] Updated weights on worker 0-0, policy_version 1044599 (0.00092) [2022-07-11 04:57:44,276][26022] Updated weights on worker 0-0, policy_version 1044609 (0.00093) [2022-07-11 04:57:46,022][25689] Fps is (10 sec: 5557.4, 60 sec: 5545.1, 300 sec: 5560.0). Total num frames: 1069688832. Throughput: 0: 4978.2. Samples: 1069683914. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:57:46,023][25689] Avg episode reward: [(0, '0.979')] [2022-07-11 04:57:46,031][26022] Updated weights on worker 0-0, policy_version 1044619 (0.00094) [2022-07-11 04:57:47,814][26022] Updated weights on worker 0-0, policy_version 1044629 (0.00048) [2022-07-11 04:57:49,572][26022] Updated weights on worker 0-0, policy_version 1044639 (0.00086) [2022-07-11 04:57:51,035][25689] Fps is (10 sec: 5655.2, 60 sec: 5564.0, 300 sec: 5563.4). Total num frames: 1069718528. Throughput: 0: 5831.4. Samples: 1069717642. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:57:51,036][25689] Avg episode reward: [(0, '0.101')] [2022-07-11 04:57:51,564][26022] Updated weights on worker 0-0, policy_version 1044649 (0.00088) [2022-07-11 04:57:53,323][26022] Updated weights on worker 0-0, policy_version 1044659 (0.00086) [2022-07-11 04:57:55,372][26022] Updated weights on worker 0-0, policy_version 1044669 (0.00093) [2022-07-11 04:57:56,051][25689] Fps is (10 sec: 5615.8, 60 sec: 5534.3, 300 sec: 5553.3). Total num frames: 1069745152. Throughput: 0: 5820.0. Samples: 1069751136. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:57:56,051][25689] Avg episode reward: [(0, '-0.344')] [2022-07-11 04:57:56,939][26022] Updated weights on worker 0-0, policy_version 1044679 (0.00083) [2022-07-11 04:57:58,923][26022] Updated weights on worker 0-0, policy_version 1044689 (0.00083) [2022-07-11 04:58:00,479][26022] Updated weights on worker 0-0, policy_version 1044699 (0.00084) [2022-07-11 04:58:01,071][25689] Fps is (10 sec: 5713.9, 60 sec: 5584.5, 300 sec: 5572.4). Total num frames: 1069775872. Throughput: 0: 4984.2. Samples: 1069767922. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:58:01,073][25689] Avg episode reward: [(0, '0.050')] [2022-07-11 04:58:02,821][26022] Updated weights on worker 0-0, policy_version 1044709 (0.00083) [2022-07-11 04:58:04,552][26022] Updated weights on worker 0-0, policy_version 1044719 (0.00091) [2022-07-11 04:58:06,188][25689] Fps is (10 sec: 5353.6, 60 sec: 5527.8, 300 sec: 5553.3). Total num frames: 1069799424. Throughput: 0: 5727.4. Samples: 1069799414. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:58:06,189][25689] Avg episode reward: [(0, '0.777')] [2022-07-11 04:58:06,536][26022] Updated weights on worker 0-0, policy_version 1044729 (0.00090) [2022-07-11 04:58:08,175][26022] Updated weights on worker 0-0, policy_version 1044739 (0.00086) [2022-07-11 04:58:10,131][26022] Updated weights on worker 0-0, policy_version 1044749 (0.00089) [2022-07-11 04:58:11,231][25689] Fps is (10 sec: 5140.1, 60 sec: 5524.7, 300 sec: 5553.4). Total num frames: 1069828096. Throughput: 0: 5726.9. Samples: 1069833302. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:58:11,232][25689] Avg episode reward: [(0, '-0.325')] [2022-07-11 04:58:11,779][26022] Updated weights on worker 0-0, policy_version 1044759 (0.00085) [2022-07-11 04:58:13,752][26022] Updated weights on worker 0-0, policy_version 1044769 (0.00083) [2022-07-11 04:58:15,435][26022] Updated weights on worker 0-0, policy_version 1044779 (0.00086) [2022-07-11 04:58:16,255][25689] Fps is (10 sec: 5798.2, 60 sec: 5580.5, 300 sec: 5564.3). Total num frames: 1069857792. Throughput: 0: 4913.4. Samples: 1069850406. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:58:16,255][25689] Avg episode reward: [(0, '-0.107')] [2022-07-11 04:58:17,571][26022] Updated weights on worker 0-0, policy_version 1044789 (0.00085) [2022-07-11 04:58:19,182][26022] Updated weights on worker 0-0, policy_version 1044799 (0.00090) [2022-07-11 04:58:21,150][26022] Updated weights on worker 0-0, policy_version 1044809 (0.00083) [2022-07-11 04:58:21,261][25689] Fps is (10 sec: 5615.0, 60 sec: 5547.3, 300 sec: 5555.3). Total num frames: 1069884416. Throughput: 0: 5740.3. Samples: 1069883820. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:58:21,262][25689] Avg episode reward: [(0, '0.075')] [2022-07-11 04:58:22,755][26022] Updated weights on worker 0-0, policy_version 1044819 (0.00086) [2022-07-11 04:58:24,894][26022] Updated weights on worker 0-0, policy_version 1044829 (0.00085) [2022-07-11 04:58:26,315][25689] Fps is (10 sec: 5598.1, 60 sec: 5570.3, 300 sec: 5555.0). Total num frames: 1069914112. Throughput: 0: 5885.6. Samples: 1069917874. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:58:26,316][25689] Avg episode reward: [(0, '0.852')] [2022-07-11 04:58:26,456][26022] Updated weights on worker 0-0, policy_version 1044839 (0.00096) [2022-07-11 04:58:28,462][26022] Updated weights on worker 0-0, policy_version 1044849 (0.00092) [2022-07-11 04:58:30,129][26022] Updated weights on worker 0-0, policy_version 1044859 (0.00093) [2022-07-11 04:58:31,331][25689] Fps is (10 sec: 5593.3, 60 sec: 5555.6, 300 sec: 5558.8). Total num frames: 1069940736. Throughput: 0: 5031.2. Samples: 1069934426. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:58:31,331][25689] Avg episode reward: [(0, '0.331')] [2022-07-11 04:58:32,099][26022] Updated weights on worker 0-0, policy_version 1044869 (0.00089) [2022-07-11 04:58:33,842][26022] Updated weights on worker 0-0, policy_version 1044879 (0.00096) [2022-07-11 04:58:35,791][26022] Updated weights on worker 0-0, policy_version 1044889 (0.00086) [2022-07-11 04:58:36,356][25689] Fps is (10 sec: 5405.4, 60 sec: 5554.6, 300 sec: 5555.4). Total num frames: 1069968384. Throughput: 0: 5838.6. Samples: 1069967766. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:58:36,356][25689] Avg episode reward: [(0, '1.706')] [2022-07-11 04:58:37,608][26022] Updated weights on worker 0-0, policy_version 1044899 (0.00084) [2022-07-11 04:58:39,516][26022] Updated weights on worker 0-0, policy_version 1044909 (0.00084) [2022-07-11 04:58:41,137][26022] Updated weights on worker 0-0, policy_version 1044919 (0.00083) [2022-07-11 04:58:41,359][25689] Fps is (10 sec: 5718.3, 60 sec: 5571.9, 300 sec: 5558.0). Total num frames: 1069998080. Throughput: 0: 5854.8. Samples: 1070001486. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:58:41,359][25689] Avg episode reward: [(0, '1.550')] [2022-07-11 04:58:43,017][26022] Updated weights on worker 0-0, policy_version 1044929 (0.00085) [2022-07-11 04:58:44,953][26022] Updated weights on worker 0-0, policy_version 1044939 (0.00092) [2022-07-11 04:58:46,417][25689] Fps is (10 sec: 5699.7, 60 sec: 5578.3, 300 sec: 5561.0). Total num frames: 1070025728. Throughput: 0: 5834.1. Samples: 1070035146. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:58:46,418][25689] Avg episode reward: [(0, '1.098')] [2022-07-11 04:58:46,794][26022] Updated weights on worker 0-0, policy_version 1044949 (0.00085) [2022-07-11 04:58:48,462][26022] Updated weights on worker 0-0, policy_version 1044959 (0.00080) [2022-07-11 04:58:50,329][26022] Updated weights on worker 0-0, policy_version 1044969 (0.00093) [2022-07-11 04:58:51,463][25689] Fps is (10 sec: 5472.7, 60 sec: 5541.3, 300 sec: 5557.0). Total num frames: 1070053376. Throughput: 0: 5848.0. Samples: 1070052158. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:58:51,463][25689] Avg episode reward: [(0, '0.770')] [2022-07-11 04:58:52,295][26022] Updated weights on worker 0-0, policy_version 1044979 (0.00086) [2022-07-11 04:58:54,004][26022] Updated weights on worker 0-0, policy_version 1044989 (0.00088) [2022-07-11 04:58:55,799][26022] Updated weights on worker 0-0, policy_version 1044999 (0.00102) [2022-07-11 04:58:56,500][25689] Fps is (10 sec: 5484.1, 60 sec: 5556.3, 300 sec: 5556.4). Total num frames: 1070081024. Throughput: 0: 5851.6. Samples: 1070085640. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:58:56,501][25689] Avg episode reward: [(0, '1.216')] [2022-07-11 04:58:57,527][26022] Updated weights on worker 0-0, policy_version 1045009 (0.00084) [2022-07-11 04:58:59,511][26022] Updated weights on worker 0-0, policy_version 1045019 (0.00083) [2022-07-11 04:59:01,214][26022] Updated weights on worker 0-0, policy_version 1045029 (0.00088) [2022-07-11 04:59:01,515][25689] Fps is (10 sec: 5704.4, 60 sec: 5539.8, 300 sec: 5568.2). Total num frames: 1070110720. Throughput: 0: 5849.0. Samples: 1070119382. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:59:01,516][25689] Avg episode reward: [(0, '1.582')] [2022-07-11 04:59:03,566][26022] Updated weights on worker 0-0, policy_version 1045039 (0.00095) [2022-07-11 04:59:05,264][26022] Updated weights on worker 0-0, policy_version 1045049 (0.00090) [2022-07-11 04:59:06,595][25689] Fps is (10 sec: 5375.8, 60 sec: 5560.2, 300 sec: 5553.4). Total num frames: 1070135296. Throughput: 0: 4903.1. Samples: 1070134084. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:59:06,596][25689] Avg episode reward: [(0, '1.688')] [2022-07-11 04:59:07,239][26022] Updated weights on worker 0-0, policy_version 1045059 (0.00081) [2022-07-11 04:59:09,193][26022] Updated weights on worker 0-0, policy_version 1045069 (0.00087) [2022-07-11 04:59:10,859][26022] Updated weights on worker 0-0, policy_version 1045079 (0.00089) [2022-07-11 04:59:11,603][25689] Fps is (10 sec: 5481.5, 60 sec: 5597.3, 300 sec: 5563.9). Total num frames: 1070166016. Throughput: 0: 5754.7. Samples: 1070168060. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:59:11,603][25689] Avg episode reward: [(0, '0.804')] [2022-07-11 04:59:12,858][26022] Updated weights on worker 0-0, policy_version 1045089 (0.00083) [2022-07-11 04:59:14,431][26022] Updated weights on worker 0-0, policy_version 1045099 (0.00094) [2022-07-11 04:59:16,473][26022] Updated weights on worker 0-0, policy_version 1045109 (0.00048) [2022-07-11 04:59:16,621][25689] Fps is (10 sec: 5617.6, 60 sec: 5530.0, 300 sec: 5553.3). Total num frames: 1070191616. Throughput: 0: 5745.0. Samples: 1070201236. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:59:16,623][25689] Avg episode reward: [(0, '0.815')] [2022-07-11 04:59:18,236][26022] Updated weights on worker 0-0, policy_version 1045119 (0.00084) [2022-07-11 04:59:20,043][26022] Updated weights on worker 0-0, policy_version 1045129 (0.00095) [2022-07-11 04:59:21,640][25689] Fps is (10 sec: 5407.1, 60 sec: 5562.8, 300 sec: 5555.5). Total num frames: 1070220288. Throughput: 0: 4910.0. Samples: 1070218194. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:59:21,641][25689] Avg episode reward: [(0, '0.976')] [2022-07-11 04:59:22,023][26022] Updated weights on worker 0-0, policy_version 1045139 (0.00087) [2022-07-11 04:59:23,551][26022] Updated weights on worker 0-0, policy_version 1045149 (0.00100) [2022-07-11 04:59:25,533][26022] Updated weights on worker 0-0, policy_version 1045159 (0.01134) [2022-07-11 04:59:26,697][25689] Fps is (10 sec: 5792.5, 60 sec: 5562.5, 300 sec: 5558.1). Total num frames: 1070249984. Throughput: 0: 5849.2. Samples: 1070251666. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:59:26,698][25689] Avg episode reward: [(0, '0.764')] [2022-07-11 04:59:27,420][26022] Updated weights on worker 0-0, policy_version 1045169 (0.00082) [2022-07-11 04:59:29,125][26022] Updated weights on worker 0-0, policy_version 1045179 (0.00080) [2022-07-11 04:59:31,171][26022] Updated weights on worker 0-0, policy_version 1045189 (0.00086) [2022-07-11 04:59:31,713][25689] Fps is (10 sec: 5591.2, 60 sec: 5562.4, 300 sec: 5559.4). Total num frames: 1070276608. Throughput: 0: 5838.5. Samples: 1070285472. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:59:31,715][25689] Avg episode reward: [(0, '0.275')] [2022-07-11 04:59:32,741][26022] Updated weights on worker 0-0, policy_version 1045199 (0.00079) [2022-07-11 04:59:34,813][26022] Updated weights on worker 0-0, policy_version 1045209 (0.00086) [2022-07-11 04:59:35,925][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 04:59:35,938][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001045216_1070301184.pth [2022-07-11 04:59:35,939][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001043259_1068297216.pth [2022-07-11 04:59:36,622][26022] Updated weights on worker 0-0, policy_version 1045219 (0.00088) [2022-07-11 04:59:36,764][25689] Fps is (10 sec: 5493.0, 60 sec: 5577.0, 300 sec: 5555.5). Total num frames: 1070305280. Throughput: 0: 5023.5. Samples: 1070302424. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:59:36,764][25689] Avg episode reward: [(0, '0.186')] [2022-07-11 04:59:38,279][26022] Updated weights on worker 0-0, policy_version 1045229 (0.00085) [2022-07-11 04:59:40,357][26022] Updated weights on worker 0-0, policy_version 1045239 (0.00086) [2022-07-11 04:59:41,858][25689] Fps is (10 sec: 5652.3, 60 sec: 5551.7, 300 sec: 5555.9). Total num frames: 1070333952. Throughput: 0: 5823.7. Samples: 1070335938. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:59:41,859][25689] Avg episode reward: [(0, '0.066')] [2022-07-11 04:59:42,028][26022] Updated weights on worker 0-0, policy_version 1045249 (0.00088) [2022-07-11 04:59:43,872][26022] Updated weights on worker 0-0, policy_version 1045259 (0.00085) [2022-07-11 04:59:45,708][26022] Updated weights on worker 0-0, policy_version 1045269 (0.00094) [2022-07-11 04:59:46,907][25689] Fps is (10 sec: 5653.7, 60 sec: 5569.4, 300 sec: 5558.7). Total num frames: 1070362624. Throughput: 0: 5829.2. Samples: 1070369470. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:59:46,907][25689] Avg episode reward: [(0, '-0.856')] [2022-07-11 04:59:47,576][26022] Updated weights on worker 0-0, policy_version 1045279 (0.00088) [2022-07-11 04:59:49,352][26022] Updated weights on worker 0-0, policy_version 1045289 (0.00090) [2022-07-11 04:59:51,291][26022] Updated weights on worker 0-0, policy_version 1045299 (0.00082) [2022-07-11 04:59:51,936][25689] Fps is (10 sec: 5588.6, 60 sec: 5571.0, 300 sec: 5558.3). Total num frames: 1070390272. Throughput: 0: 4995.1. Samples: 1070386488. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:59:51,936][25689] Avg episode reward: [(0, '-1.752')] [2022-07-11 04:59:52,850][26022] Updated weights on worker 0-0, policy_version 1045309 (0.00090) [2022-07-11 04:59:54,871][26022] Updated weights on worker 0-0, policy_version 1045319 (0.00093) [2022-07-11 04:59:56,593][26022] Updated weights on worker 0-0, policy_version 1045329 (0.00065) [2022-07-11 04:59:57,037][25689] Fps is (10 sec: 5458.5, 60 sec: 5565.1, 300 sec: 5553.2). Total num frames: 1070417920. Throughput: 0: 5822.3. Samples: 1070420458. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 04:59:57,037][25689] Avg episode reward: [(0, '-1.873')] [2022-07-11 04:59:58,264][26022] Updated weights on worker 0-0, policy_version 1045339 (0.00090) [2022-07-11 05:00:00,459][26022] Updated weights on worker 0-0, policy_version 1045349 (0.00082) [2022-07-11 05:00:02,090][25689] Fps is (10 sec: 5445.7, 60 sec: 5527.8, 300 sec: 5557.8). Total num frames: 1070445568. Throughput: 0: 5791.4. Samples: 1070453108. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 05:00:02,092][25689] Avg episode reward: [(0, '-0.979')] [2022-07-11 05:00:02,449][26022] Updated weights on worker 0-0, policy_version 1045359 (0.00083) [2022-07-11 05:00:04,252][26022] Updated weights on worker 0-0, policy_version 1045369 (0.00093) [2022-07-11 05:00:06,168][26022] Updated weights on worker 0-0, policy_version 1045379 (0.00088) [2022-07-11 05:00:07,147][25689] Fps is (10 sec: 5469.4, 60 sec: 5580.7, 300 sec: 5556.8). Total num frames: 1070473216. Throughput: 0: 4917.5. Samples: 1070469002. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 05:00:07,147][25689] Avg episode reward: [(0, '-0.781')] [2022-07-11 05:00:07,832][26022] Updated weights on worker 0-0, policy_version 1045389 (0.00091) [2022-07-11 05:00:09,760][26022] Updated weights on worker 0-0, policy_version 1045399 (0.00084) [2022-07-11 05:00:11,549][26022] Updated weights on worker 0-0, policy_version 1045409 (0.00088) [2022-07-11 05:00:12,152][25689] Fps is (10 sec: 5597.0, 60 sec: 5547.1, 300 sec: 5554.8). Total num frames: 1070501888. Throughput: 0: 5741.2. Samples: 1070502556. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 05:00:12,153][25689] Avg episode reward: [(0, '-0.804')] [2022-07-11 05:00:13,489][26022] Updated weights on worker 0-0, policy_version 1045419 (0.00082) [2022-07-11 05:00:15,171][26022] Updated weights on worker 0-0, policy_version 1045429 (0.00094) [2022-07-11 05:00:17,167][25689] Fps is (10 sec: 5518.5, 60 sec: 5564.3, 300 sec: 5551.8). Total num frames: 1070528512. Throughput: 0: 5751.0. Samples: 1070536226. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 05:00:17,168][25689] Avg episode reward: [(0, '0.900')] [2022-07-11 05:00:17,289][26022] Updated weights on worker 0-0, policy_version 1045439 (0.00091) [2022-07-11 05:00:18,911][26022] Updated weights on worker 0-0, policy_version 1045449 (0.00085) [2022-07-11 05:00:20,825][26022] Updated weights on worker 0-0, policy_version 1045459 (0.00085) [2022-07-11 05:00:22,214][25689] Fps is (10 sec: 5597.3, 60 sec: 5578.6, 300 sec: 5556.7). Total num frames: 1070558208. Throughput: 0: 4959.4. Samples: 1070552914. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 05:00:22,215][25689] Avg episode reward: [(0, '1.032')] [2022-07-11 05:00:22,612][26022] Updated weights on worker 0-0, policy_version 1045469 (0.00078) [2022-07-11 05:00:24,427][26022] Updated weights on worker 0-0, policy_version 1045479 (0.00095) [2022-07-11 05:00:26,294][26022] Updated weights on worker 0-0, policy_version 1045489 (0.00081) [2022-07-11 05:00:27,271][25689] Fps is (10 sec: 5776.9, 60 sec: 5561.8, 300 sec: 5560.2). Total num frames: 1070586880. Throughput: 0: 5834.4. Samples: 1070586412. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 05:00:27,272][25689] Avg episode reward: [(0, '0.849')] [2022-07-11 05:00:28,182][26022] Updated weights on worker 0-0, policy_version 1045499 (0.00090) [2022-07-11 05:00:29,891][26022] Updated weights on worker 0-0, policy_version 1045509 (0.00094) [2022-07-11 05:00:31,904][26022] Updated weights on worker 0-0, policy_version 1045519 (0.00085) [2022-07-11 05:00:32,287][25689] Fps is (10 sec: 5286.6, 60 sec: 5527.9, 300 sec: 5542.8). Total num frames: 1070611456. Throughput: 0: 5830.3. Samples: 1070619944. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:00:32,287][25689] Avg episode reward: [(0, '0.801')] [2022-07-11 05:00:33,463][26022] Updated weights on worker 0-0, policy_version 1045529 (0.00086) [2022-07-11 05:00:35,687][26022] Updated weights on worker 0-0, policy_version 1045539 (0.00090) [2022-07-11 05:00:37,007][26022] Updated weights on worker 0-0, policy_version 1045549 (0.00084) [2022-07-11 05:00:37,289][25689] Fps is (10 sec: 5519.2, 60 sec: 5566.2, 300 sec: 5549.8). Total num frames: 1070642176. Throughput: 0: 4996.2. Samples: 1070636762. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:00:37,290][25689] Avg episode reward: [(0, '-1.103')] [2022-07-11 05:00:39,222][26022] Updated weights on worker 0-0, policy_version 1045559 (0.00089) [2022-07-11 05:00:40,834][26022] Updated weights on worker 0-0, policy_version 1045569 (0.00086) [2022-07-11 05:00:42,303][25689] Fps is (10 sec: 5725.3, 60 sec: 5539.8, 300 sec: 5550.9). Total num frames: 1070668800. Throughput: 0: 5847.0. Samples: 1070670370. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:00:42,303][25689] Avg episode reward: [(0, '-1.215')] [2022-07-11 05:00:42,929][26022] Updated weights on worker 0-0, policy_version 1045579 (0.00095) [2022-07-11 05:00:44,501][26022] Updated weights on worker 0-0, policy_version 1045589 (0.00091) [2022-07-11 05:00:46,307][26022] Updated weights on worker 0-0, policy_version 1045599 (0.00093) [2022-07-11 05:00:47,355][25689] Fps is (10 sec: 5493.6, 60 sec: 5539.4, 300 sec: 5550.2). Total num frames: 1070697472. Throughput: 0: 5873.4. Samples: 1070704376. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:00:47,355][25689] Avg episode reward: [(0, '-1.037')] [2022-07-11 05:00:48,159][26022] Updated weights on worker 0-0, policy_version 1045609 (0.00088) [2022-07-11 05:00:49,965][26022] Updated weights on worker 0-0, policy_version 1045619 (0.00085) [2022-07-11 05:00:52,015][26022] Updated weights on worker 0-0, policy_version 1045629 (0.00085) [2022-07-11 05:00:52,395][25689] Fps is (10 sec: 5783.5, 60 sec: 5572.3, 300 sec: 5553.8). Total num frames: 1070727168. Throughput: 0: 5018.2. Samples: 1070720850. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:00:52,395][25689] Avg episode reward: [(0, '-1.691')] [2022-07-11 05:00:53,947][26022] Updated weights on worker 0-0, policy_version 1045639 (0.00085) [2022-07-11 05:00:55,438][26022] Updated weights on worker 0-0, policy_version 1045649 (0.00108) [2022-07-11 05:00:57,400][25689] Fps is (10 sec: 5606.8, 60 sec: 5564.2, 300 sec: 5550.7). Total num frames: 1070753792. Throughput: 0: 5856.5. Samples: 1070754540. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:00:57,402][25689] Avg episode reward: [(0, '-1.888')] [2022-07-11 05:00:57,479][26022] Updated weights on worker 0-0, policy_version 1045659 (0.00085) [2022-07-11 05:00:59,125][26022] Updated weights on worker 0-0, policy_version 1045669 (0.00085) [2022-07-11 05:01:01,089][26022] Updated weights on worker 0-0, policy_version 1045679 (0.00090) [2022-07-11 05:01:02,439][25689] Fps is (10 sec: 5403.4, 60 sec: 5565.5, 300 sec: 5554.3). Total num frames: 1070781440. Throughput: 0: 5835.6. Samples: 1070787876. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:01:02,441][25689] Avg episode reward: [(0, '-1.378')] [2022-07-11 05:01:03,440][26022] Updated weights on worker 0-0, policy_version 1045689 (0.00093) [2022-07-11 05:01:05,042][26022] Updated weights on worker 0-0, policy_version 1045699 (0.00093) [2022-07-11 05:01:07,131][26022] Updated weights on worker 0-0, policy_version 1045709 (0.00084) [2022-07-11 05:01:07,500][25689] Fps is (10 sec: 5373.5, 60 sec: 5548.1, 300 sec: 5553.9). Total num frames: 1070808064. Throughput: 0: 4862.3. Samples: 1070802326. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:01:07,502][25689] Avg episode reward: [(0, '-0.130')] [2022-07-11 05:01:08,834][26022] Updated weights on worker 0-0, policy_version 1045719 (0.00083) [2022-07-11 05:01:10,646][26022] Updated weights on worker 0-0, policy_version 1045729 (0.00086) [2022-07-11 05:01:12,536][25689] Fps is (10 sec: 5476.4, 60 sec: 5545.4, 300 sec: 5554.5). Total num frames: 1070836736. Throughput: 0: 5708.5. Samples: 1070835826. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:01:12,536][25689] Avg episode reward: [(0, '0.588')] [2022-07-11 05:01:12,546][26022] Updated weights on worker 0-0, policy_version 1045739 (0.00088) [2022-07-11 05:01:14,431][26022] Updated weights on worker 0-0, policy_version 1045749 (0.00090) [2022-07-11 05:01:16,331][26022] Updated weights on worker 0-0, policy_version 1045759 (0.00087) [2022-07-11 05:01:17,555][25689] Fps is (10 sec: 5499.4, 60 sec: 5544.9, 300 sec: 5551.7). Total num frames: 1070863360. Throughput: 0: 5700.0. Samples: 1070869424. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:01:17,556][25689] Avg episode reward: [(0, '0.557')] [2022-07-11 05:01:18,148][26022] Updated weights on worker 0-0, policy_version 1045769 (0.00089) [2022-07-11 05:01:19,788][26022] Updated weights on worker 0-0, policy_version 1045779 (0.00089) [2022-07-11 05:01:21,808][26022] Updated weights on worker 0-0, policy_version 1045789 (0.00090) [2022-07-11 05:01:22,583][25689] Fps is (10 sec: 5605.5, 60 sec: 5546.7, 300 sec: 5552.2). Total num frames: 1070893056. Throughput: 0: 4875.6. Samples: 1070886092. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:01:22,584][25689] Avg episode reward: [(0, '0.972')] [2022-07-11 05:01:23,646][26022] Updated weights on worker 0-0, policy_version 1045799 (0.00095) [2022-07-11 05:01:25,373][26022] Updated weights on worker 0-0, policy_version 1045809 (0.00091) [2022-07-11 05:01:27,412][26022] Updated weights on worker 0-0, policy_version 1045819 (0.00087) [2022-07-11 05:01:27,728][25689] Fps is (10 sec: 5637.1, 60 sec: 5521.7, 300 sec: 5553.6). Total num frames: 1070920704. Throughput: 0: 5800.1. Samples: 1070919648. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:01:27,728][25689] Avg episode reward: [(0, '0.839')] [2022-07-11 05:01:28,970][26022] Updated weights on worker 0-0, policy_version 1045829 (0.00083) [2022-07-11 05:01:31,063][26022] Updated weights on worker 0-0, policy_version 1045839 (0.00091) [2022-07-11 05:01:32,813][25689] Fps is (10 sec: 5405.5, 60 sec: 5566.1, 300 sec: 5552.3). Total num frames: 1070948352. Throughput: 0: 5780.7. Samples: 1070953040. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:01:32,813][25689] Avg episode reward: [(0, '1.123')] [2022-07-11 05:01:32,820][26022] Updated weights on worker 0-0, policy_version 1045849 (0.00091) [2022-07-11 05:01:34,477][26022] Updated weights on worker 0-0, policy_version 1045859 (0.00086) [2022-07-11 05:01:35,990][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:01:36,006][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001045866_1070966784.pth [2022-07-11 05:01:36,007][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001043912_1068965888.pth [2022-07-11 05:01:36,436][26022] Updated weights on worker 0-0, policy_version 1045869 (0.00097) [2022-07-11 05:01:37,828][25689] Fps is (10 sec: 5778.7, 60 sec: 5565.0, 300 sec: 5556.1). Total num frames: 1070979072. Throughput: 0: 5759.0. Samples: 1070986178. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:01:37,830][25689] Avg episode reward: [(0, '0.198')] [2022-07-11 05:01:38,197][26022] Updated weights on worker 0-0, policy_version 1045879 (0.00088) [2022-07-11 05:01:40,130][26022] Updated weights on worker 0-0, policy_version 1045889 (0.00083) [2022-07-11 05:01:42,044][26022] Updated weights on worker 0-0, policy_version 1045899 (0.00089) [2022-07-11 05:01:42,833][25689] Fps is (10 sec: 5620.7, 60 sec: 5548.8, 300 sec: 5554.2). Total num frames: 1071004672. Throughput: 0: 5769.7. Samples: 1071002926. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:01:42,835][25689] Avg episode reward: [(0, '0.912')] [2022-07-11 05:01:43,635][26022] Updated weights on worker 0-0, policy_version 1045909 (0.00084) [2022-07-11 05:01:45,834][26022] Updated weights on worker 0-0, policy_version 1045919 (0.00090) [2022-07-11 05:01:47,353][26022] Updated weights on worker 0-0, policy_version 1045929 (0.00085) [2022-07-11 05:01:47,880][25689] Fps is (10 sec: 5297.1, 60 sec: 5532.4, 300 sec: 5550.5). Total num frames: 1071032320. Throughput: 0: 5784.3. Samples: 1071036218. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:01:47,881][25689] Avg episode reward: [(0, '0.866')] [2022-07-11 05:01:49,371][26022] Updated weights on worker 0-0, policy_version 1045939 (0.00087) [2022-07-11 05:01:51,291][26022] Updated weights on worker 0-0, policy_version 1045949 (0.00090) [2022-07-11 05:01:52,920][25689] Fps is (10 sec: 5685.1, 60 sec: 5532.4, 300 sec: 5554.4). Total num frames: 1071062016. Throughput: 0: 5795.9. Samples: 1071069578. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:01:52,920][25689] Avg episode reward: [(0, '1.154')] [2022-07-11 05:01:52,923][26022] Updated weights on worker 0-0, policy_version 1045959 (0.00089) [2022-07-11 05:01:55,032][26022] Updated weights on worker 0-0, policy_version 1045969 (0.00082) [2022-07-11 05:01:56,783][26022] Updated weights on worker 0-0, policy_version 1045979 (0.00083) [2022-07-11 05:01:57,945][25689] Fps is (10 sec: 5596.1, 60 sec: 5530.6, 300 sec: 5550.7). Total num frames: 1071088640. Throughput: 0: 4972.5. Samples: 1071086208. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:01:57,945][25689] Avg episode reward: [(0, '1.005')] [2022-07-11 05:01:58,431][26022] Updated weights on worker 0-0, policy_version 1045989 (0.00097) [2022-07-11 05:02:00,435][26022] Updated weights on worker 0-0, policy_version 1045999 (0.00087) [2022-07-11 05:02:02,448][26022] Updated weights on worker 0-0, policy_version 1046009 (0.00093) [2022-07-11 05:02:03,015][25689] Fps is (10 sec: 5173.2, 60 sec: 5493.9, 300 sec: 5546.9). Total num frames: 1071114240. Throughput: 0: 5796.5. Samples: 1071119910. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:02:03,015][25689] Avg episode reward: [(0, '2.115')] [2022-07-11 05:02:04,443][26022] Updated weights on worker 0-0, policy_version 1046019 (0.00069) [2022-07-11 05:02:06,316][26022] Updated weights on worker 0-0, policy_version 1046029 (0.00092) [2022-07-11 05:02:08,113][25689] Fps is (10 sec: 5437.9, 60 sec: 5541.2, 300 sec: 5548.7). Total num frames: 1071143936. Throughput: 0: 5689.9. Samples: 1071151342. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:02:08,114][25689] Avg episode reward: [(0, '2.254')] [2022-07-11 05:02:08,127][26022] Updated weights on worker 0-0, policy_version 1046039 (0.01224) [2022-07-11 05:02:10,231][26022] Updated weights on worker 0-0, policy_version 1046049 (0.00084) [2022-07-11 05:02:11,876][26022] Updated weights on worker 0-0, policy_version 1046059 (0.00084) [2022-07-11 05:02:13,120][25689] Fps is (10 sec: 5674.5, 60 sec: 5527.0, 300 sec: 5553.5). Total num frames: 1071171584. Throughput: 0: 4865.8. Samples: 1071167870. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:02:13,121][25689] Avg episode reward: [(0, '1.937')] [2022-07-11 05:02:13,723][26022] Updated weights on worker 0-0, policy_version 1046069 (0.00083) [2022-07-11 05:02:15,671][26022] Updated weights on worker 0-0, policy_version 1046079 (0.00084) [2022-07-11 05:02:17,374][26022] Updated weights on worker 0-0, policy_version 1046089 (0.00082) [2022-07-11 05:02:18,127][25689] Fps is (10 sec: 5521.8, 60 sec: 5545.0, 300 sec: 5550.1). Total num frames: 1071199232. Throughput: 0: 5708.7. Samples: 1071201426. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:02:18,128][25689] Avg episode reward: [(0, '2.080')] [2022-07-11 05:02:19,153][26022] Updated weights on worker 0-0, policy_version 1046099 (0.00088) [2022-07-11 05:02:21,026][26022] Updated weights on worker 0-0, policy_version 1046109 (0.00088) [2022-07-11 05:02:22,913][26022] Updated weights on worker 0-0, policy_version 1046119 (0.00086) [2022-07-11 05:02:23,152][25689] Fps is (10 sec: 5511.9, 60 sec: 5511.4, 300 sec: 5548.5). Total num frames: 1071226880. Throughput: 0: 5715.9. Samples: 1071235014. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:02:23,153][25689] Avg episode reward: [(0, '1.951')] [2022-07-11 05:02:24,844][26022] Updated weights on worker 0-0, policy_version 1046129 (0.00085) [2022-07-11 05:02:26,533][26022] Updated weights on worker 0-0, policy_version 1046139 (0.00086) [2022-07-11 05:02:28,248][25689] Fps is (10 sec: 5666.0, 60 sec: 5549.7, 300 sec: 5554.3). Total num frames: 1071256576. Throughput: 0: 4983.2. Samples: 1071251678. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:02:28,249][25689] Avg episode reward: [(0, '1.476')] [2022-07-11 05:02:28,253][26022] Updated weights on worker 0-0, policy_version 1046149 (0.00083) [2022-07-11 05:02:30,391][26022] Updated weights on worker 0-0, policy_version 1046159 (0.00089) [2022-07-11 05:02:31,893][26022] Updated weights on worker 0-0, policy_version 1046169 (0.00082) [2022-07-11 05:02:33,265][25689] Fps is (10 sec: 5468.1, 60 sec: 5522.1, 300 sec: 5547.4). Total num frames: 1071282176. Throughput: 0: 5809.8. Samples: 1071284906. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:02:33,265][25689] Avg episode reward: [(0, '1.540')] [2022-07-11 05:02:34,261][26022] Updated weights on worker 0-0, policy_version 1046179 (0.00090) [2022-07-11 05:02:35,647][26022] Updated weights on worker 0-0, policy_version 1046189 (0.00095) [2022-07-11 05:02:37,817][26022] Updated weights on worker 0-0, policy_version 1046199 (0.00090) [2022-07-11 05:02:38,272][25689] Fps is (10 sec: 5414.3, 60 sec: 5489.0, 300 sec: 5547.4). Total num frames: 1071310848. Throughput: 0: 5793.0. Samples: 1071318124. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:02:38,274][25689] Avg episode reward: [(0, '1.896')] [2022-07-11 05:02:39,517][26022] Updated weights on worker 0-0, policy_version 1046209 (0.00082) [2022-07-11 05:02:41,496][26022] Updated weights on worker 0-0, policy_version 1046219 (0.00086) [2022-07-11 05:02:43,115][26022] Updated weights on worker 0-0, policy_version 1046229 (0.00088) [2022-07-11 05:02:43,274][25689] Fps is (10 sec: 5626.6, 60 sec: 5523.1, 300 sec: 5549.7). Total num frames: 1071338496. Throughput: 0: 4951.6. Samples: 1071334650. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:02:43,275][25689] Avg episode reward: [(0, '2.168')] [2022-07-11 05:02:44,902][26022] Updated weights on worker 0-0, policy_version 1046239 (0.00094) [2022-07-11 05:02:46,795][26022] Updated weights on worker 0-0, policy_version 1046249 (0.00086) [2022-07-11 05:02:48,352][25689] Fps is (10 sec: 5485.7, 60 sec: 5520.3, 300 sec: 5541.6). Total num frames: 1071366144. Throughput: 0: 5811.3. Samples: 1071368508. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:02:48,353][25689] Avg episode reward: [(0, '1.920')] [2022-07-11 05:02:48,772][26022] Updated weights on worker 0-0, policy_version 1046259 (0.00089) [2022-07-11 05:02:50,363][26022] Updated weights on worker 0-0, policy_version 1046269 (0.00091) [2022-07-11 05:02:52,563][26022] Updated weights on worker 0-0, policy_version 1046279 (0.00085) [2022-07-11 05:02:53,417][25689] Fps is (10 sec: 5653.7, 60 sec: 5518.0, 300 sec: 5551.0). Total num frames: 1071395840. Throughput: 0: 5814.5. Samples: 1071402082. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:02:53,419][25689] Avg episode reward: [(0, '2.241')] [2022-07-11 05:02:54,056][26022] Updated weights on worker 0-0, policy_version 1046289 (0.00088) [2022-07-11 05:02:56,141][26022] Updated weights on worker 0-0, policy_version 1046299 (0.00090) [2022-07-11 05:02:57,741][26022] Updated weights on worker 0-0, policy_version 1046309 (0.00087) [2022-07-11 05:02:58,431][25689] Fps is (10 sec: 5588.1, 60 sec: 5519.0, 300 sec: 5537.4). Total num frames: 1071422464. Throughput: 0: 4996.0. Samples: 1071418836. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:02:58,433][25689] Avg episode reward: [(0, '1.804')] [2022-07-11 05:02:59,782][26022] Updated weights on worker 0-0, policy_version 1046319 (0.00090) [2022-07-11 05:03:01,719][26022] Updated weights on worker 0-0, policy_version 1046329 (0.00087) [2022-07-11 05:03:03,449][25689] Fps is (10 sec: 5205.8, 60 sec: 5523.7, 300 sec: 5546.1). Total num frames: 1071448064. Throughput: 0: 5779.1. Samples: 1071451242. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:03:03,450][25689] Avg episode reward: [(0, '1.799')] [2022-07-11 05:03:03,786][26022] Updated weights on worker 0-0, policy_version 1046339 (0.00086) [2022-07-11 05:03:05,612][26022] Updated weights on worker 0-0, policy_version 1046349 (0.00100) [2022-07-11 05:03:07,459][26022] Updated weights on worker 0-0, policy_version 1046359 (0.00087) [2022-07-11 05:03:08,552][25689] Fps is (10 sec: 5463.4, 60 sec: 5523.4, 300 sec: 5548.4). Total num frames: 1071477760. Throughput: 0: 5702.1. Samples: 1071483688. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:03:08,552][25689] Avg episode reward: [(0, '0.273')] [2022-07-11 05:03:09,220][26022] Updated weights on worker 0-0, policy_version 1046369 (0.00090) [2022-07-11 05:03:11,080][26022] Updated weights on worker 0-0, policy_version 1046379 (0.00093) [2022-07-11 05:03:13,035][26022] Updated weights on worker 0-0, policy_version 1046389 (0.00090) [2022-07-11 05:03:13,615][25689] Fps is (10 sec: 5539.9, 60 sec: 5501.3, 300 sec: 5537.4). Total num frames: 1071504384. Throughput: 0: 4858.7. Samples: 1071500218. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:03:13,616][25689] Avg episode reward: [(0, '-0.202')] [2022-07-11 05:03:14,855][26022] Updated weights on worker 0-0, policy_version 1046399 (0.00099) [2022-07-11 05:03:16,851][26022] Updated weights on worker 0-0, policy_version 1046409 (0.00087) [2022-07-11 05:03:18,440][26022] Updated weights on worker 0-0, policy_version 1046419 (0.00088) [2022-07-11 05:03:18,620][25689] Fps is (10 sec: 5492.1, 60 sec: 5518.4, 300 sec: 5544.3). Total num frames: 1071533056. Throughput: 0: 5687.1. Samples: 1071533656. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:03:18,621][25689] Avg episode reward: [(0, '-0.149')] [2022-07-11 05:03:20,371][26022] Updated weights on worker 0-0, policy_version 1046429 (0.00092) [2022-07-11 05:03:22,147][26022] Updated weights on worker 0-0, policy_version 1046439 (0.00090) [2022-07-11 05:03:23,656][25689] Fps is (10 sec: 5711.1, 60 sec: 5534.3, 300 sec: 5541.2). Total num frames: 1071561728. Throughput: 0: 5737.2. Samples: 1071567174. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:03:23,657][25689] Avg episode reward: [(0, '-0.356')] [2022-07-11 05:03:23,973][26022] Updated weights on worker 0-0, policy_version 1046449 (0.00083) [2022-07-11 05:03:25,943][26022] Updated weights on worker 0-0, policy_version 1046459 (0.00094) [2022-07-11 05:03:27,693][26022] Updated weights on worker 0-0, policy_version 1046469 (0.00616) [2022-07-11 05:03:28,714][25689] Fps is (10 sec: 5579.8, 60 sec: 5504.0, 300 sec: 5543.8). Total num frames: 1071589376. Throughput: 0: 4954.1. Samples: 1071583572. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:03:28,714][25689] Avg episode reward: [(0, '-0.330')] [2022-07-11 05:03:29,650][26022] Updated weights on worker 0-0, policy_version 1046479 (0.00086) [2022-07-11 05:03:31,318][26022] Updated weights on worker 0-0, policy_version 1046489 (0.00092) [2022-07-11 05:03:33,440][26022] Updated weights on worker 0-0, policy_version 1046499 (0.00091) [2022-07-11 05:03:33,751][25689] Fps is (10 sec: 5477.6, 60 sec: 5535.9, 300 sec: 5543.6). Total num frames: 1071617024. Throughput: 0: 5800.3. Samples: 1071617010. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:03:33,752][25689] Avg episode reward: [(0, '0.213')] [2022-07-11 05:03:35,040][26022] Updated weights on worker 0-0, policy_version 1046509 (0.00085) [2022-07-11 05:03:36,125][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:03:36,142][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001046515_1071631360.pth [2022-07-11 05:03:36,142][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001044563_1069632512.pth [2022-07-11 05:03:37,192][26022] Updated weights on worker 0-0, policy_version 1046519 (0.00085) [2022-07-11 05:03:38,698][26022] Updated weights on worker 0-0, policy_version 1046529 (0.00079) [2022-07-11 05:03:38,766][25689] Fps is (10 sec: 5704.7, 60 sec: 5552.2, 300 sec: 5543.4). Total num frames: 1071646720. Throughput: 0: 5803.6. Samples: 1071650572. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:03:38,766][25689] Avg episode reward: [(0, '1.017')] [2022-07-11 05:03:40,596][26022] Updated weights on worker 0-0, policy_version 1046539 (0.00084) [2022-07-11 05:03:42,249][26022] Updated weights on worker 0-0, policy_version 1046549 (0.00081) [2022-07-11 05:03:43,771][25689] Fps is (10 sec: 5621.0, 60 sec: 5535.0, 300 sec: 5540.9). Total num frames: 1071673344. Throughput: 0: 4983.5. Samples: 1071667414. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:03:43,771][25689] Avg episode reward: [(0, '0.132')] [2022-07-11 05:03:44,312][26022] Updated weights on worker 0-0, policy_version 1046559 (0.00086) [2022-07-11 05:03:45,926][26022] Updated weights on worker 0-0, policy_version 1046569 (0.00095) [2022-07-11 05:03:47,877][26022] Updated weights on worker 0-0, policy_version 1046579 (0.00089) [2022-07-11 05:03:48,808][25689] Fps is (10 sec: 5506.2, 60 sec: 5555.6, 300 sec: 5544.5). Total num frames: 1071702016. Throughput: 0: 5864.5. Samples: 1071701416. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:03:48,809][25689] Avg episode reward: [(0, '0.269')] [2022-07-11 05:03:49,702][26022] Updated weights on worker 0-0, policy_version 1046589 (0.00080) [2022-07-11 05:03:51,422][26022] Updated weights on worker 0-0, policy_version 1046599 (0.00084) [2022-07-11 05:03:53,546][26022] Updated weights on worker 0-0, policy_version 1046609 (0.00085) [2022-07-11 05:03:53,813][25689] Fps is (10 sec: 5506.4, 60 sec: 5510.4, 300 sec: 5541.7). Total num frames: 1071728640. Throughput: 0: 5870.5. Samples: 1071734782. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:03:53,813][25689] Avg episode reward: [(0, '-0.444')] [2022-07-11 05:03:55,077][26022] Updated weights on worker 0-0, policy_version 1046619 (0.00092) [2022-07-11 05:03:57,001][26022] Updated weights on worker 0-0, policy_version 1046629 (0.00084) [2022-07-11 05:03:58,838][25689] Fps is (10 sec: 5411.0, 60 sec: 5526.2, 300 sec: 5534.6). Total num frames: 1071756288. Throughput: 0: 5041.4. Samples: 1071751764. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:03:58,839][25689] Avg episode reward: [(0, '-0.068')] [2022-07-11 05:03:59,005][26022] Updated weights on worker 0-0, policy_version 1046639 (0.00085) [2022-07-11 05:04:00,603][26022] Updated weights on worker 0-0, policy_version 1046649 (0.00083) [2022-07-11 05:04:02,959][26022] Updated weights on worker 0-0, policy_version 1046659 (0.00092) [2022-07-11 05:04:03,852][25689] Fps is (10 sec: 5406.1, 60 sec: 5543.6, 300 sec: 5542.7). Total num frames: 1071782912. Throughput: 0: 5787.1. Samples: 1071783626. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:04:03,854][25689] Avg episode reward: [(0, '0.043')] [2022-07-11 05:04:04,779][26022] Updated weights on worker 0-0, policy_version 1046669 (0.00085) [2022-07-11 05:04:06,355][26022] Updated weights on worker 0-0, policy_version 1046679 (0.00087) [2022-07-11 05:04:08,558][26022] Updated weights on worker 0-0, policy_version 1046689 (0.00084) [2022-07-11 05:04:08,913][25689] Fps is (10 sec: 5488.3, 60 sec: 5530.4, 300 sec: 5534.8). Total num frames: 1071811584. Throughput: 0: 5767.8. Samples: 1071817378. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:04:08,914][25689] Avg episode reward: [(0, '0.818')] [2022-07-11 05:04:10,127][26022] Updated weights on worker 0-0, policy_version 1046699 (0.00089) [2022-07-11 05:04:12,051][26022] Updated weights on worker 0-0, policy_version 1046709 (0.00086) [2022-07-11 05:04:13,808][26022] Updated weights on worker 0-0, policy_version 1046719 (0.00090) [2022-07-11 05:04:13,919][25689] Fps is (10 sec: 5696.3, 60 sec: 5569.7, 300 sec: 5545.4). Total num frames: 1071840256. Throughput: 0: 4941.3. Samples: 1071834132. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:04:13,919][25689] Avg episode reward: [(0, '0.842')] [2022-07-11 05:04:15,599][26022] Updated weights on worker 0-0, policy_version 1046729 (0.00085) [2022-07-11 05:04:17,583][26022] Updated weights on worker 0-0, policy_version 1046739 (0.00088) [2022-07-11 05:04:18,922][25689] Fps is (10 sec: 5627.1, 60 sec: 5552.9, 300 sec: 5542.3). Total num frames: 1071867904. Throughput: 0: 5787.5. Samples: 1071868000. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:04:18,923][25689] Avg episode reward: [(0, '0.451')] [2022-07-11 05:04:19,392][26022] Updated weights on worker 0-0, policy_version 1046749 (0.00087) [2022-07-11 05:04:21,204][26022] Updated weights on worker 0-0, policy_version 1046759 (0.00089) [2022-07-11 05:04:22,875][26022] Updated weights on worker 0-0, policy_version 1046769 (0.00080) [2022-07-11 05:04:23,931][25689] Fps is (10 sec: 5625.2, 60 sec: 5555.4, 300 sec: 5539.7). Total num frames: 1071896576. Throughput: 0: 5888.1. Samples: 1071901854. Policy #0 lag: (min: 0.0, avg: 7.7, max: 19.0) [2022-07-11 05:04:23,932][25689] Avg episode reward: [(0, '0.928')] [2022-07-11 05:04:24,739][26022] Updated weights on worker 0-0, policy_version 1046779 (0.00086) [2022-07-11 05:04:26,484][26022] Updated weights on worker 0-0, policy_version 1046789 (0.00097) [2022-07-11 05:04:28,586][26022] Updated weights on worker 0-0, policy_version 1046799 (0.00089) [2022-07-11 05:04:29,051][25689] Fps is (10 sec: 5560.2, 60 sec: 5549.6, 300 sec: 5541.2). Total num frames: 1071924224. Throughput: 0: 5018.1. Samples: 1071918432. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:04:29,052][25689] Avg episode reward: [(0, '1.012')] [2022-07-11 05:04:30,283][26022] Updated weights on worker 0-0, policy_version 1046809 (0.00394) [2022-07-11 05:04:32,216][26022] Updated weights on worker 0-0, policy_version 1046819 (0.00098) [2022-07-11 05:04:34,035][26022] Updated weights on worker 0-0, policy_version 1046829 (0.00086) [2022-07-11 05:04:34,134][25689] Fps is (10 sec: 5519.7, 60 sec: 5562.3, 300 sec: 5540.6). Total num frames: 1071952896. Throughput: 0: 5824.9. Samples: 1071951886. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:04:34,135][25689] Avg episode reward: [(0, '0.882')] [2022-07-11 05:04:35,988][26022] Updated weights on worker 0-0, policy_version 1046839 (0.00084) [2022-07-11 05:04:37,648][26022] Updated weights on worker 0-0, policy_version 1046849 (0.00095) [2022-07-11 05:04:39,137][25689] Fps is (10 sec: 5584.0, 60 sec: 5529.5, 300 sec: 5538.9). Total num frames: 1071980544. Throughput: 0: 5807.1. Samples: 1071985392. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:04:39,138][25689] Avg episode reward: [(0, '-0.057')] [2022-07-11 05:04:39,505][26022] Updated weights on worker 0-0, policy_version 1046859 (0.00082) [2022-07-11 05:04:41,283][26022] Updated weights on worker 0-0, policy_version 1046869 (0.00093) [2022-07-11 05:04:43,083][26022] Updated weights on worker 0-0, policy_version 1046879 (0.00096) [2022-07-11 05:04:44,162][25689] Fps is (10 sec: 5616.6, 60 sec: 5561.6, 300 sec: 5539.3). Total num frames: 1072009216. Throughput: 0: 4955.6. Samples: 1072002110. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:04:44,162][25689] Avg episode reward: [(0, '-0.380')] [2022-07-11 05:04:44,909][26022] Updated weights on worker 0-0, policy_version 1046889 (0.00084) [2022-07-11 05:04:46,851][26022] Updated weights on worker 0-0, policy_version 1046899 (0.00087) [2022-07-11 05:04:48,563][26022] Updated weights on worker 0-0, policy_version 1046909 (0.00089) [2022-07-11 05:04:49,291][25689] Fps is (10 sec: 5546.6, 60 sec: 5536.2, 300 sec: 5537.5). Total num frames: 1072036864. Throughput: 0: 5796.4. Samples: 1072035750. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:04:49,292][25689] Avg episode reward: [(0, '0.259')] [2022-07-11 05:04:50,514][26022] Updated weights on worker 0-0, policy_version 1046919 (0.00086) [2022-07-11 05:04:52,373][26022] Updated weights on worker 0-0, policy_version 1046929 (0.00085) [2022-07-11 05:04:54,303][25689] Fps is (10 sec: 5553.4, 60 sec: 5569.4, 300 sec: 5542.6). Total num frames: 1072065536. Throughput: 0: 5824.9. Samples: 1072069368. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:04:54,304][25689] Avg episode reward: [(0, '0.393')] [2022-07-11 05:04:54,307][26022] Updated weights on worker 0-0, policy_version 1046939 (0.00086) [2022-07-11 05:04:56,093][26022] Updated weights on worker 0-0, policy_version 1046949 (0.00087) [2022-07-11 05:04:57,853][26022] Updated weights on worker 0-0, policy_version 1046959 (0.00083) [2022-07-11 05:04:59,364][25689] Fps is (10 sec: 5591.0, 60 sec: 5566.1, 300 sec: 5542.4). Total num frames: 1072093184. Throughput: 0: 5795.7. Samples: 1072102624. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:04:59,365][25689] Avg episode reward: [(0, '0.076')] [2022-07-11 05:04:59,743][26022] Updated weights on worker 0-0, policy_version 1046969 (0.00085) [2022-07-11 05:05:02,050][26022] Updated weights on worker 0-0, policy_version 1046979 (0.00088) [2022-07-11 05:05:03,629][26022] Updated weights on worker 0-0, policy_version 1046989 (0.00085) [2022-07-11 05:05:04,453][25689] Fps is (10 sec: 5448.3, 60 sec: 5576.2, 300 sec: 5541.8). Total num frames: 1072120832. Throughput: 0: 5663.8. Samples: 1072117034. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:05:04,455][25689] Avg episode reward: [(0, '0.645')] [2022-07-11 05:05:05,686][26022] Updated weights on worker 0-0, policy_version 1046999 (0.00096) [2022-07-11 05:05:07,318][26022] Updated weights on worker 0-0, policy_version 1047009 (0.00086) [2022-07-11 05:05:09,442][26022] Updated weights on worker 0-0, policy_version 1047019 (0.00089) [2022-07-11 05:05:09,531][25689] Fps is (10 sec: 5338.3, 60 sec: 5540.8, 300 sec: 5533.6). Total num frames: 1072147456. Throughput: 0: 5670.5. Samples: 1072150520. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:05:09,531][25689] Avg episode reward: [(0, '0.762')] [2022-07-11 05:05:11,072][26022] Updated weights on worker 0-0, policy_version 1047029 (0.00094) [2022-07-11 05:05:12,863][26022] Updated weights on worker 0-0, policy_version 1047039 (0.00092) [2022-07-11 05:05:14,575][25689] Fps is (10 sec: 5563.8, 60 sec: 5554.2, 300 sec: 5543.4). Total num frames: 1072177152. Throughput: 0: 5676.3. Samples: 1072184436. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:05:14,577][25689] Avg episode reward: [(0, '1.420')] [2022-07-11 05:05:14,950][26022] Updated weights on worker 0-0, policy_version 1047049 (0.00087) [2022-07-11 05:05:16,618][26022] Updated weights on worker 0-0, policy_version 1047059 (0.00087) [2022-07-11 05:05:18,565][26022] Updated weights on worker 0-0, policy_version 1047069 (0.00090) [2022-07-11 05:05:19,603][25689] Fps is (10 sec: 5693.2, 60 sec: 5551.9, 300 sec: 5536.8). Total num frames: 1072204800. Throughput: 0: 4883.1. Samples: 1072201452. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:05:19,605][25689] Avg episode reward: [(0, '0.349')] [2022-07-11 05:05:20,209][26022] Updated weights on worker 0-0, policy_version 1047079 (0.00092) [2022-07-11 05:05:22,148][26022] Updated weights on worker 0-0, policy_version 1047089 (0.00085) [2022-07-11 05:05:23,797][26022] Updated weights on worker 0-0, policy_version 1047099 (0.00086) [2022-07-11 05:05:24,609][25689] Fps is (10 sec: 5613.0, 60 sec: 5552.2, 300 sec: 5537.8). Total num frames: 1072233472. Throughput: 0: 5865.2. Samples: 1072235256. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:05:24,610][25689] Avg episode reward: [(0, '0.214')] [2022-07-11 05:05:25,688][26022] Updated weights on worker 0-0, policy_version 1047109 (0.00089) [2022-07-11 05:05:27,374][26022] Updated weights on worker 0-0, policy_version 1047119 (0.00090) [2022-07-11 05:05:29,407][26022] Updated weights on worker 0-0, policy_version 1047129 (0.00832) [2022-07-11 05:05:29,737][25689] Fps is (10 sec: 5557.9, 60 sec: 5551.5, 300 sec: 5546.0). Total num frames: 1072261120. Throughput: 0: 5846.3. Samples: 1072268650. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:05:29,737][25689] Avg episode reward: [(0, '-0.261')] [2022-07-11 05:05:31,131][26022] Updated weights on worker 0-0, policy_version 1047139 (0.00086) [2022-07-11 05:05:32,944][26022] Updated weights on worker 0-0, policy_version 1047149 (0.00091) [2022-07-11 05:05:34,798][25689] Fps is (10 sec: 5527.4, 60 sec: 5553.5, 300 sec: 5538.0). Total num frames: 1072289792. Throughput: 0: 4996.7. Samples: 1072285484. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:05:34,799][25689] Avg episode reward: [(0, '-0.673')] [2022-07-11 05:05:34,823][26022] Updated weights on worker 0-0, policy_version 1047159 (0.00086) [2022-07-11 05:05:36,262][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:05:36,283][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001047167_1072299008.pth [2022-07-11 05:05:36,284][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001045216_1070301184.pth [2022-07-11 05:05:36,730][26022] Updated weights on worker 0-0, policy_version 1047169 (0.00090) [2022-07-11 05:05:38,504][26022] Updated weights on worker 0-0, policy_version 1047179 (0.00105) [2022-07-11 05:05:39,826][25689] Fps is (10 sec: 5582.0, 60 sec: 5551.2, 300 sec: 5541.2). Total num frames: 1072317440. Throughput: 0: 5808.4. Samples: 1072318916. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:05:39,827][25689] Avg episode reward: [(0, '-0.657')] [2022-07-11 05:05:40,473][26022] Updated weights on worker 0-0, policy_version 1047189 (0.00083) [2022-07-11 05:05:42,138][26022] Updated weights on worker 0-0, policy_version 1047199 (0.00090) [2022-07-11 05:05:44,012][26022] Updated weights on worker 0-0, policy_version 1047209 (0.00085) [2022-07-11 05:05:44,862][25689] Fps is (10 sec: 5596.4, 60 sec: 5550.2, 300 sec: 5541.5). Total num frames: 1072346112. Throughput: 0: 5812.4. Samples: 1072352974. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:05:44,862][25689] Avg episode reward: [(0, '-0.710')] [2022-07-11 05:05:45,857][26022] Updated weights on worker 0-0, policy_version 1047219 (0.00087) [2022-07-11 05:05:47,723][26022] Updated weights on worker 0-0, policy_version 1047229 (0.00087) [2022-07-11 05:05:49,499][26022] Updated weights on worker 0-0, policy_version 1047239 (0.00084) [2022-07-11 05:05:49,952][25689] Fps is (10 sec: 5764.3, 60 sec: 5587.5, 300 sec: 5540.6). Total num frames: 1072375808. Throughput: 0: 4999.8. Samples: 1072369724. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:05:49,953][25689] Avg episode reward: [(0, '0.109')] [2022-07-11 05:05:51,262][26022] Updated weights on worker 0-0, policy_version 1047249 (0.00089) [2022-07-11 05:05:53,242][26022] Updated weights on worker 0-0, policy_version 1047259 (0.00092) [2022-07-11 05:05:55,014][25689] Fps is (10 sec: 5648.6, 60 sec: 5566.1, 300 sec: 5543.0). Total num frames: 1072403456. Throughput: 0: 5852.0. Samples: 1072403786. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:05:55,014][25689] Avg episode reward: [(0, '-0.658')] [2022-07-11 05:05:55,019][26022] Updated weights on worker 0-0, policy_version 1047269 (0.00091) [2022-07-11 05:05:56,803][26022] Updated weights on worker 0-0, policy_version 1047279 (0.00088) [2022-07-11 05:05:58,739][26022] Updated weights on worker 0-0, policy_version 1047289 (0.00085) [2022-07-11 05:06:00,048][25689] Fps is (10 sec: 5477.3, 60 sec: 5568.6, 300 sec: 5543.1). Total num frames: 1072431104. Throughput: 0: 5828.6. Samples: 1072436778. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:06:00,051][25689] Avg episode reward: [(0, '-1.505')] [2022-07-11 05:06:00,490][26022] Updated weights on worker 0-0, policy_version 1047299 (0.00092) [2022-07-11 05:06:02,847][26022] Updated weights on worker 0-0, policy_version 1047309 (0.00096) [2022-07-11 05:06:04,489][26022] Updated weights on worker 0-0, policy_version 1047319 (0.00570) [2022-07-11 05:06:05,062][25689] Fps is (10 sec: 5299.2, 60 sec: 5541.6, 300 sec: 5540.5). Total num frames: 1072456704. Throughput: 0: 4915.5. Samples: 1072452268. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:06:05,063][25689] Avg episode reward: [(0, '-1.328')] [2022-07-11 05:06:06,383][26022] Updated weights on worker 0-0, policy_version 1047329 (0.00084) [2022-07-11 05:06:08,497][26022] Updated weights on worker 0-0, policy_version 1047339 (0.00096) [2022-07-11 05:06:10,026][26022] Updated weights on worker 0-0, policy_version 1047349 (0.00093) [2022-07-11 05:06:10,161][25689] Fps is (10 sec: 5366.1, 60 sec: 5573.4, 300 sec: 5539.3). Total num frames: 1072485376. Throughput: 0: 5699.3. Samples: 1072484904. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:06:10,162][25689] Avg episode reward: [(0, '-1.234')] [2022-07-11 05:06:12,100][26022] Updated weights on worker 0-0, policy_version 1047359 (0.00092) [2022-07-11 05:06:13,634][26022] Updated weights on worker 0-0, policy_version 1047369 (0.00086) [2022-07-11 05:06:15,186][25689] Fps is (10 sec: 5563.1, 60 sec: 5541.4, 300 sec: 5542.6). Total num frames: 1072513024. Throughput: 0: 5677.1. Samples: 1072518306. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:06:15,187][25689] Avg episode reward: [(0, '-0.987')] [2022-07-11 05:06:15,747][26022] Updated weights on worker 0-0, policy_version 1047379 (0.00097) [2022-07-11 05:06:17,631][26022] Updated weights on worker 0-0, policy_version 1047389 (0.00092) [2022-07-11 05:06:19,291][26022] Updated weights on worker 0-0, policy_version 1047399 (0.00086) [2022-07-11 05:06:20,190][25689] Fps is (10 sec: 5514.0, 60 sec: 5543.7, 300 sec: 5536.2). Total num frames: 1072540672. Throughput: 0: 4869.0. Samples: 1072534850. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:06:20,192][25689] Avg episode reward: [(0, '-0.991')] [2022-07-11 05:06:21,234][26022] Updated weights on worker 0-0, policy_version 1047409 (0.00519) [2022-07-11 05:06:23,015][26022] Updated weights on worker 0-0, policy_version 1047419 (0.00078) [2022-07-11 05:06:24,917][26022] Updated weights on worker 0-0, policy_version 1047429 (0.00097) [2022-07-11 05:06:25,224][25689] Fps is (10 sec: 5508.8, 60 sec: 5524.2, 300 sec: 5538.3). Total num frames: 1072568320. Throughput: 0: 5763.3. Samples: 1072568464. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:06:25,224][25689] Avg episode reward: [(0, '-0.486')] [2022-07-11 05:06:26,686][26022] Updated weights on worker 0-0, policy_version 1047439 (0.00384) [2022-07-11 05:06:28,855][26022] Updated weights on worker 0-0, policy_version 1047449 (0.00093) [2022-07-11 05:06:30,139][26022] Updated weights on worker 0-0, policy_version 1047459 (0.00087) [2022-07-11 05:06:30,299][25689] Fps is (10 sec: 5672.4, 60 sec: 5562.8, 300 sec: 5545.3). Total num frames: 1072598016. Throughput: 0: 5803.1. Samples: 1072601764. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:06:30,300][25689] Avg episode reward: [(0, '1.918')] [2022-07-11 05:06:32,519][26022] Updated weights on worker 0-0, policy_version 1047469 (0.00093) [2022-07-11 05:06:33,905][26022] Updated weights on worker 0-0, policy_version 1047479 (0.00085) [2022-07-11 05:06:35,318][25689] Fps is (10 sec: 5478.0, 60 sec: 5516.0, 300 sec: 5528.1). Total num frames: 1072623616. Throughput: 0: 4973.1. Samples: 1072618424. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:06:35,318][25689] Avg episode reward: [(0, '2.045')] [2022-07-11 05:06:36,053][26022] Updated weights on worker 0-0, policy_version 1047489 (0.00085) [2022-07-11 05:06:37,474][26022] Updated weights on worker 0-0, policy_version 1047499 (0.00095) [2022-07-11 05:06:39,554][26022] Updated weights on worker 0-0, policy_version 1047509 (0.00087) [2022-07-11 05:06:40,369][25689] Fps is (10 sec: 5592.9, 60 sec: 5564.6, 300 sec: 5544.4). Total num frames: 1072654336. Throughput: 0: 5820.4. Samples: 1072652300. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:06:40,371][25689] Avg episode reward: [(0, '2.047')] [2022-07-11 05:06:41,156][26022] Updated weights on worker 0-0, policy_version 1047519 (0.00085) [2022-07-11 05:06:43,287][26022] Updated weights on worker 0-0, policy_version 1047529 (0.00082) [2022-07-11 05:06:45,020][26022] Updated weights on worker 0-0, policy_version 1047539 (0.00085) [2022-07-11 05:06:45,453][25689] Fps is (10 sec: 5758.9, 60 sec: 5543.2, 300 sec: 5543.7). Total num frames: 1072681984. Throughput: 0: 5813.6. Samples: 1072686070. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:06:45,455][25689] Avg episode reward: [(0, '2.139')] [2022-07-11 05:06:46,888][26022] Updated weights on worker 0-0, policy_version 1047549 (0.00085) [2022-07-11 05:06:48,685][26022] Updated weights on worker 0-0, policy_version 1047559 (0.00080) [2022-07-11 05:06:50,451][26022] Updated weights on worker 0-0, policy_version 1047569 (0.00094) [2022-07-11 05:06:50,511][25689] Fps is (10 sec: 5553.0, 60 sec: 5529.3, 300 sec: 5539.9). Total num frames: 1072710656. Throughput: 0: 5830.0. Samples: 1072719602. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:06:50,512][25689] Avg episode reward: [(0, '2.133')] [2022-07-11 05:06:52,451][26022] Updated weights on worker 0-0, policy_version 1047579 (0.00086) [2022-07-11 05:06:54,169][26022] Updated weights on worker 0-0, policy_version 1047589 (0.00087) [2022-07-11 05:06:55,547][25689] Fps is (10 sec: 5478.3, 60 sec: 5514.8, 300 sec: 5539.7). Total num frames: 1072737280. Throughput: 0: 5837.7. Samples: 1072736514. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:06:55,547][25689] Avg episode reward: [(0, '2.288')] [2022-07-11 05:06:55,955][26022] Updated weights on worker 0-0, policy_version 1047599 (0.00086) [2022-07-11 05:06:57,923][26022] Updated weights on worker 0-0, policy_version 1047609 (0.00092) [2022-07-11 05:06:59,755][26022] Updated weights on worker 0-0, policy_version 1047619 (0.01132) [2022-07-11 05:07:00,550][25689] Fps is (10 sec: 5508.2, 60 sec: 5534.5, 300 sec: 5551.3). Total num frames: 1072765952. Throughput: 0: 5823.4. Samples: 1072769822. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:07:00,551][25689] Avg episode reward: [(0, '2.023')] [2022-07-11 05:07:01,636][26022] Updated weights on worker 0-0, policy_version 1047629 (0.00383) [2022-07-11 05:07:03,821][26022] Updated weights on worker 0-0, policy_version 1047639 (0.00085) [2022-07-11 05:07:05,411][26022] Updated weights on worker 0-0, policy_version 1047649 (0.00087) [2022-07-11 05:07:05,571][25689] Fps is (10 sec: 5516.3, 60 sec: 5550.8, 300 sec: 5542.4). Total num frames: 1072792576. Throughput: 0: 5731.4. Samples: 1072801372. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:07:05,571][25689] Avg episode reward: [(0, '1.843')] [2022-07-11 05:07:07,670][26022] Updated weights on worker 0-0, policy_version 1047659 (0.00086) [2022-07-11 05:07:09,049][26022] Updated weights on worker 0-0, policy_version 1047669 (0.00090) [2022-07-11 05:07:10,625][25689] Fps is (10 sec: 5285.0, 60 sec: 5521.0, 300 sec: 5538.1). Total num frames: 1072819200. Throughput: 0: 4889.1. Samples: 1072817942. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:07:10,626][25689] Avg episode reward: [(0, '0.804')] [2022-07-11 05:07:11,357][26022] Updated weights on worker 0-0, policy_version 1047679 (0.00072) [2022-07-11 05:07:12,767][26022] Updated weights on worker 0-0, policy_version 1047689 (0.00083) [2022-07-11 05:07:14,888][26022] Updated weights on worker 0-0, policy_version 1047699 (0.00092) [2022-07-11 05:07:15,648][25689] Fps is (10 sec: 5487.4, 60 sec: 5538.2, 300 sec: 5541.3). Total num frames: 1072847872. Throughput: 0: 5696.3. Samples: 1072851014. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:07:15,648][25689] Avg episode reward: [(0, '0.104')] [2022-07-11 05:07:16,818][26022] Updated weights on worker 0-0, policy_version 1047709 (0.00086) [2022-07-11 05:07:18,607][26022] Updated weights on worker 0-0, policy_version 1047719 (0.00100) [2022-07-11 05:07:20,605][26022] Updated weights on worker 0-0, policy_version 1047729 (0.00091) [2022-07-11 05:07:20,671][25689] Fps is (10 sec: 5606.6, 60 sec: 5536.4, 300 sec: 5541.3). Total num frames: 1072875520. Throughput: 0: 5709.1. Samples: 1072884692. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:07:20,671][25689] Avg episode reward: [(0, '0.258')] [2022-07-11 05:07:22,067][26022] Updated weights on worker 0-0, policy_version 1047739 (0.00101) [2022-07-11 05:07:24,003][26022] Updated weights on worker 0-0, policy_version 1047749 (0.00089) [2022-07-11 05:07:25,698][25689] Fps is (10 sec: 5603.8, 60 sec: 5554.0, 300 sec: 5539.1). Total num frames: 1072904192. Throughput: 0: 4969.2. Samples: 1072901388. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:07:25,698][25689] Avg episode reward: [(0, '0.254')] [2022-07-11 05:07:25,784][26022] Updated weights on worker 0-0, policy_version 1047759 (0.00094) [2022-07-11 05:07:27,833][26022] Updated weights on worker 0-0, policy_version 1047769 (0.00085) [2022-07-11 05:07:29,669][26022] Updated weights on worker 0-0, policy_version 1047779 (0.00098) [2022-07-11 05:07:30,750][25689] Fps is (10 sec: 5587.6, 60 sec: 5522.2, 300 sec: 5545.4). Total num frames: 1072931840. Throughput: 0: 5792.1. Samples: 1072934508. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:07:30,751][25689] Avg episode reward: [(0, '0.190')] [2022-07-11 05:07:31,473][26022] Updated weights on worker 0-0, policy_version 1047789 (0.00086) [2022-07-11 05:07:33,335][26022] Updated weights on worker 0-0, policy_version 1047799 (0.00086) [2022-07-11 05:07:35,211][26022] Updated weights on worker 0-0, policy_version 1047809 (0.00091) [2022-07-11 05:07:35,757][25689] Fps is (10 sec: 5599.1, 60 sec: 5574.2, 300 sec: 5545.4). Total num frames: 1072960512. Throughput: 0: 5817.4. Samples: 1072967996. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:07:35,757][25689] Avg episode reward: [(0, '0.249')] [2022-07-11 05:07:36,348][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:07:36,361][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001047816_1072963584.pth [2022-07-11 05:07:36,361][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001045866_1070966784.pth [2022-07-11 05:07:36,850][26022] Updated weights on worker 0-0, policy_version 1047819 (0.00057) [2022-07-11 05:07:38,777][26022] Updated weights on worker 0-0, policy_version 1047829 (0.00083) [2022-07-11 05:07:40,545][26022] Updated weights on worker 0-0, policy_version 1047839 (0.00087) [2022-07-11 05:07:40,832][25689] Fps is (10 sec: 5484.9, 60 sec: 5504.2, 300 sec: 5540.6). Total num frames: 1072987136. Throughput: 0: 4951.1. Samples: 1072984512. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:07:40,834][25689] Avg episode reward: [(0, '1.397')] [2022-07-11 05:07:42,574][26022] Updated weights on worker 0-0, policy_version 1047849 (0.00080) [2022-07-11 05:07:44,559][26022] Updated weights on worker 0-0, policy_version 1047859 (0.00090) [2022-07-11 05:07:45,892][25689] Fps is (10 sec: 5455.5, 60 sec: 5523.3, 300 sec: 5544.3). Total num frames: 1073015808. Throughput: 0: 5763.4. Samples: 1073017776. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:07:45,893][25689] Avg episode reward: [(0, '0.614')] [2022-07-11 05:07:46,259][26022] Updated weights on worker 0-0, policy_version 1047869 (0.00081) [2022-07-11 05:07:48,099][26022] Updated weights on worker 0-0, policy_version 1047879 (0.00087) [2022-07-11 05:07:50,143][26022] Updated weights on worker 0-0, policy_version 1047889 (0.00085) [2022-07-11 05:07:50,952][25689] Fps is (10 sec: 5666.2, 60 sec: 5523.2, 300 sec: 5541.0). Total num frames: 1073044480. Throughput: 0: 5773.5. Samples: 1073051144. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:07:50,953][25689] Avg episode reward: [(0, '0.898')] [2022-07-11 05:07:51,871][26022] Updated weights on worker 0-0, policy_version 1047899 (0.00094) [2022-07-11 05:07:53,842][26022] Updated weights on worker 0-0, policy_version 1047909 (0.00090) [2022-07-11 05:07:55,209][26022] Updated weights on worker 0-0, policy_version 1047919 (0.00085) [2022-07-11 05:07:55,968][25689] Fps is (10 sec: 5691.3, 60 sec: 5558.8, 300 sec: 5547.8). Total num frames: 1073073152. Throughput: 0: 4936.5. Samples: 1073067770. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:07:55,970][25689] Avg episode reward: [(0, '0.013')] [2022-07-11 05:07:57,597][26022] Updated weights on worker 0-0, policy_version 1047929 (0.00089) [2022-07-11 05:07:58,835][26022] Updated weights on worker 0-0, policy_version 1047939 (0.00087) [2022-07-11 05:08:00,976][25689] Fps is (10 sec: 5312.0, 60 sec: 5490.6, 300 sec: 5544.6). Total num frames: 1073097728. Throughput: 0: 5787.2. Samples: 1073101092. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:08:00,977][25689] Avg episode reward: [(0, '-0.616')] [2022-07-11 05:08:01,283][26022] Updated weights on worker 0-0, policy_version 1047949 (0.00100) [2022-07-11 05:08:02,925][26022] Updated weights on worker 0-0, policy_version 1047959 (0.00115) [2022-07-11 05:08:05,295][26022] Updated weights on worker 0-0, policy_version 1047969 (0.00088) [2022-07-11 05:08:05,992][25689] Fps is (10 sec: 5312.1, 60 sec: 5524.9, 300 sec: 5542.8). Total num frames: 1073126400. Throughput: 0: 5704.5. Samples: 1073132434. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:08:05,993][25689] Avg episode reward: [(0, '-1.034')] [2022-07-11 05:08:06,657][26022] Updated weights on worker 0-0, policy_version 1047979 (0.00095) [2022-07-11 05:08:08,950][26022] Updated weights on worker 0-0, policy_version 1047989 (0.00065) [2022-07-11 05:08:10,587][26022] Updated weights on worker 0-0, policy_version 1047999 (0.00084) [2022-07-11 05:08:11,104][25689] Fps is (10 sec: 5459.7, 60 sec: 5519.7, 300 sec: 5541.9). Total num frames: 1073153024. Throughput: 0: 4853.7. Samples: 1073148956. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:08:11,105][25689] Avg episode reward: [(0, '-0.151')] [2022-07-11 05:08:12,561][26022] Updated weights on worker 0-0, policy_version 1048009 (0.00091) [2022-07-11 05:08:14,233][26022] Updated weights on worker 0-0, policy_version 1048019 (0.00096) [2022-07-11 05:08:16,125][25689] Fps is (10 sec: 5355.9, 60 sec: 5502.9, 300 sec: 5538.1). Total num frames: 1073180672. Throughput: 0: 5697.4. Samples: 1073182612. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:08:16,126][25689] Avg episode reward: [(0, '-0.159')] [2022-07-11 05:08:16,188][26022] Updated weights on worker 0-0, policy_version 1048029 (0.00091) [2022-07-11 05:08:17,872][26022] Updated weights on worker 0-0, policy_version 1048039 (0.00087) [2022-07-11 05:08:19,971][26022] Updated weights on worker 0-0, policy_version 1048049 (0.00095) [2022-07-11 05:08:21,131][25689] Fps is (10 sec: 5617.1, 60 sec: 5521.4, 300 sec: 5538.7). Total num frames: 1073209344. Throughput: 0: 5704.1. Samples: 1073216056. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:08:21,131][25689] Avg episode reward: [(0, '0.119')] [2022-07-11 05:08:21,488][26022] Updated weights on worker 0-0, policy_version 1048059 (0.00093) [2022-07-11 05:08:23,618][26022] Updated weights on worker 0-0, policy_version 1048069 (0.00084) [2022-07-11 05:08:25,203][26022] Updated weights on worker 0-0, policy_version 1048079 (0.00090) [2022-07-11 05:08:26,146][25689] Fps is (10 sec: 5620.1, 60 sec: 5505.5, 300 sec: 5539.5). Total num frames: 1073236992. Throughput: 0: 4987.0. Samples: 1073232942. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 05:08:26,147][25689] Avg episode reward: [(0, '0.055')] [2022-07-11 05:08:27,231][26022] Updated weights on worker 0-0, policy_version 1048089 (0.00091) [2022-07-11 05:08:29,263][26022] Updated weights on worker 0-0, policy_version 1048099 (0.00087) [2022-07-11 05:08:30,957][26022] Updated weights on worker 0-0, policy_version 1048109 (0.00086) [2022-07-11 05:08:31,213][25689] Fps is (10 sec: 5484.2, 60 sec: 5504.2, 300 sec: 5538.9). Total num frames: 1073264640. Throughput: 0: 5809.5. Samples: 1073265780. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:08:31,214][25689] Avg episode reward: [(0, '1.852')] [2022-07-11 05:08:32,751][26022] Updated weights on worker 0-0, policy_version 1048119 (0.00094) [2022-07-11 05:08:34,608][26022] Updated weights on worker 0-0, policy_version 1048129 (0.00090) [2022-07-11 05:08:36,243][25689] Fps is (10 sec: 5476.5, 60 sec: 5485.1, 300 sec: 5531.7). Total num frames: 1073292288. Throughput: 0: 5801.1. Samples: 1073299318. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:08:36,244][25689] Avg episode reward: [(0, '2.165')] [2022-07-11 05:08:36,411][26022] Updated weights on worker 0-0, policy_version 1048139 (0.00087) [2022-07-11 05:08:38,298][26022] Updated weights on worker 0-0, policy_version 1048149 (0.00087) [2022-07-11 05:08:40,087][26022] Updated weights on worker 0-0, policy_version 1048159 (0.00086) [2022-07-11 05:08:41,251][25689] Fps is (10 sec: 5610.9, 60 sec: 5525.1, 300 sec: 5538.6). Total num frames: 1073320960. Throughput: 0: 4966.3. Samples: 1073315980. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:08:41,252][25689] Avg episode reward: [(0, '1.991')] [2022-07-11 05:08:42,109][26022] Updated weights on worker 0-0, policy_version 1048169 (0.00092) [2022-07-11 05:08:43,879][26022] Updated weights on worker 0-0, policy_version 1048179 (0.00096) [2022-07-11 05:08:45,700][26022] Updated weights on worker 0-0, policy_version 1048189 (0.00093) [2022-07-11 05:08:46,263][25689] Fps is (10 sec: 5518.6, 60 sec: 5495.6, 300 sec: 5532.2). Total num frames: 1073347584. Throughput: 0: 5776.8. Samples: 1073349154. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:08:46,264][25689] Avg episode reward: [(0, '2.065')] [2022-07-11 05:08:47,646][26022] Updated weights on worker 0-0, policy_version 1048199 (0.00056) [2022-07-11 05:08:49,367][26022] Updated weights on worker 0-0, policy_version 1048209 (0.00084) [2022-07-11 05:08:51,291][26022] Updated weights on worker 0-0, policy_version 1048219 (0.00085) [2022-07-11 05:08:51,388][25689] Fps is (10 sec: 5454.9, 60 sec: 5489.7, 300 sec: 5536.8). Total num frames: 1073376256. Throughput: 0: 5776.1. Samples: 1073382310. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:08:51,389][25689] Avg episode reward: [(0, '1.705')] [2022-07-11 05:08:53,051][26022] Updated weights on worker 0-0, policy_version 1048229 (0.00085) [2022-07-11 05:08:55,050][26022] Updated weights on worker 0-0, policy_version 1048239 (0.00082) [2022-07-11 05:08:56,396][25689] Fps is (10 sec: 5659.1, 60 sec: 5490.4, 300 sec: 5540.6). Total num frames: 1073404928. Throughput: 0: 4934.4. Samples: 1073398762. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:08:56,397][25689] Avg episode reward: [(0, '1.593')] [2022-07-11 05:08:57,006][26022] Updated weights on worker 0-0, policy_version 1048249 (0.00083) [2022-07-11 05:08:58,713][26022] Updated weights on worker 0-0, policy_version 1048259 (0.00099) [2022-07-11 05:09:00,659][26022] Updated weights on worker 0-0, policy_version 1048269 (0.00084) [2022-07-11 05:09:01,433][25689] Fps is (10 sec: 5504.9, 60 sec: 5521.7, 300 sec: 5540.1). Total num frames: 1073431552. Throughput: 0: 5746.7. Samples: 1073431958. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:09:01,433][25689] Avg episode reward: [(0, '1.565')] [2022-07-11 05:09:02,623][26022] Updated weights on worker 0-0, policy_version 1048279 (0.00080) [2022-07-11 05:09:04,546][26022] Updated weights on worker 0-0, policy_version 1048289 (0.00085) [2022-07-11 05:09:06,426][26022] Updated weights on worker 0-0, policy_version 1048299 (0.00096) [2022-07-11 05:09:06,482][25689] Fps is (10 sec: 5279.4, 60 sec: 5484.8, 300 sec: 5533.5). Total num frames: 1073458176. Throughput: 0: 5657.7. Samples: 1073463546. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:09:06,483][25689] Avg episode reward: [(0, '1.776')] [2022-07-11 05:09:08,362][26022] Updated weights on worker 0-0, policy_version 1048309 (0.00096) [2022-07-11 05:09:09,929][26022] Updated weights on worker 0-0, policy_version 1048319 (0.00087) [2022-07-11 05:09:11,580][25689] Fps is (10 sec: 5348.3, 60 sec: 5503.0, 300 sec: 5528.3). Total num frames: 1073485824. Throughput: 0: 4855.5. Samples: 1073480354. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:09:11,581][25689] Avg episode reward: [(0, '1.501')] [2022-07-11 05:09:12,051][26022] Updated weights on worker 0-0, policy_version 1048329 (0.00084) [2022-07-11 05:09:13,661][26022] Updated weights on worker 0-0, policy_version 1048339 (0.00086) [2022-07-11 05:09:15,566][26022] Updated weights on worker 0-0, policy_version 1048349 (0.00084) [2022-07-11 05:09:16,599][25689] Fps is (10 sec: 5668.3, 60 sec: 5537.1, 300 sec: 5534.9). Total num frames: 1073515520. Throughput: 0: 5692.6. Samples: 1073513766. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:09:16,599][25689] Avg episode reward: [(0, '0.717')] [2022-07-11 05:09:17,496][26022] Updated weights on worker 0-0, policy_version 1048359 (0.00086) [2022-07-11 05:09:19,101][26022] Updated weights on worker 0-0, policy_version 1048369 (0.00093) [2022-07-11 05:09:21,231][26022] Updated weights on worker 0-0, policy_version 1048379 (0.00085) [2022-07-11 05:09:21,625][25689] Fps is (10 sec: 5708.8, 60 sec: 5518.2, 300 sec: 5531.1). Total num frames: 1073543168. Throughput: 0: 5721.9. Samples: 1073547498. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:09:21,627][25689] Avg episode reward: [(0, '0.848')] [2022-07-11 05:09:22,648][26022] Updated weights on worker 0-0, policy_version 1048389 (0.00091) [2022-07-11 05:09:24,789][26022] Updated weights on worker 0-0, policy_version 1048399 (0.00091) [2022-07-11 05:09:26,340][26022] Updated weights on worker 0-0, policy_version 1048409 (0.00086) [2022-07-11 05:09:26,661][25689] Fps is (10 sec: 5597.1, 60 sec: 5533.3, 300 sec: 5536.2). Total num frames: 1073571840. Throughput: 0: 5003.2. Samples: 1073564506. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:09:26,662][25689] Avg episode reward: [(0, '0.233')] [2022-07-11 05:09:28,475][26022] Updated weights on worker 0-0, policy_version 1048419 (0.00092) [2022-07-11 05:09:30,263][26022] Updated weights on worker 0-0, policy_version 1048429 (0.00091) [2022-07-11 05:09:31,734][25689] Fps is (10 sec: 5571.4, 60 sec: 5532.8, 300 sec: 5532.9). Total num frames: 1073599488. Throughput: 0: 5830.5. Samples: 1073597860. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:09:31,735][25689] Avg episode reward: [(0, '0.357')] [2022-07-11 05:09:32,083][26022] Updated weights on worker 0-0, policy_version 1048439 (0.00087) [2022-07-11 05:09:33,971][26022] Updated weights on worker 0-0, policy_version 1048449 (0.00081) [2022-07-11 05:09:35,601][26022] Updated weights on worker 0-0, policy_version 1048459 (0.00090) [2022-07-11 05:09:36,533][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:09:36,543][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001048463_1073626112.pth [2022-07-11 05:09:36,544][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001046515_1071631360.pth [2022-07-11 05:09:36,806][25689] Fps is (10 sec: 5450.5, 60 sec: 5528.9, 300 sec: 5531.6). Total num frames: 1073627136. Throughput: 0: 5822.2. Samples: 1073631418. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:09:36,807][25689] Avg episode reward: [(0, '0.421')] [2022-07-11 05:09:37,615][26022] Updated weights on worker 0-0, policy_version 1048469 (0.00088) [2022-07-11 05:09:39,298][26022] Updated weights on worker 0-0, policy_version 1048479 (0.00092) [2022-07-11 05:09:41,226][26022] Updated weights on worker 0-0, policy_version 1048489 (0.00092) [2022-07-11 05:09:41,851][25689] Fps is (10 sec: 5567.2, 60 sec: 5525.6, 300 sec: 5531.3). Total num frames: 1073655808. Throughput: 0: 5816.3. Samples: 1073665134. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:09:41,851][25689] Avg episode reward: [(0, '0.940')] [2022-07-11 05:09:43,023][26022] Updated weights on worker 0-0, policy_version 1048499 (0.00092) [2022-07-11 05:09:44,928][26022] Updated weights on worker 0-0, policy_version 1048509 (0.00086) [2022-07-11 05:09:46,728][26022] Updated weights on worker 0-0, policy_version 1048519 (0.00089) [2022-07-11 05:09:46,855][25689] Fps is (10 sec: 5706.8, 60 sec: 5560.1, 300 sec: 5537.0). Total num frames: 1073684480. Throughput: 0: 5819.0. Samples: 1073682012. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:09:46,855][25689] Avg episode reward: [(0, '1.151')] [2022-07-11 05:09:48,490][26022] Updated weights on worker 0-0, policy_version 1048529 (0.00085) [2022-07-11 05:09:50,389][26022] Updated weights on worker 0-0, policy_version 1048539 (0.00090) [2022-07-11 05:09:51,967][25689] Fps is (10 sec: 5567.5, 60 sec: 5544.4, 300 sec: 5531.7). Total num frames: 1073712128. Throughput: 0: 5813.0. Samples: 1073715470. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:09:51,967][25689] Avg episode reward: [(0, '0.920')] [2022-07-11 05:09:52,240][26022] Updated weights on worker 0-0, policy_version 1048549 (0.00086) [2022-07-11 05:09:54,027][26022] Updated weights on worker 0-0, policy_version 1048559 (0.00090) [2022-07-11 05:09:55,724][26022] Updated weights on worker 0-0, policy_version 1048569 (0.00090) [2022-07-11 05:09:56,995][25689] Fps is (10 sec: 5553.8, 60 sec: 5542.5, 300 sec: 5535.8). Total num frames: 1073740800. Throughput: 0: 5828.2. Samples: 1073749084. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:09:56,996][25689] Avg episode reward: [(0, '0.966')] [2022-07-11 05:09:57,747][26022] Updated weights on worker 0-0, policy_version 1048579 (0.00088) [2022-07-11 05:09:59,511][26022] Updated weights on worker 0-0, policy_version 1048589 (0.00093) [2022-07-11 05:10:01,716][26022] Updated weights on worker 0-0, policy_version 1048599 (0.00100) [2022-07-11 05:10:02,094][25689] Fps is (10 sec: 5258.0, 60 sec: 5503.1, 300 sec: 5525.3). Total num frames: 1073765376. Throughput: 0: 4957.8. Samples: 1073765494. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:10:02,094][25689] Avg episode reward: [(0, '0.527')] [2022-07-11 05:10:03,401][26022] Updated weights on worker 0-0, policy_version 1048609 (0.00089) [2022-07-11 05:10:05,512][26022] Updated weights on worker 0-0, policy_version 1048619 (0.00079) [2022-07-11 05:10:07,156][25689] Fps is (10 sec: 5341.1, 60 sec: 5552.5, 300 sec: 5535.9). Total num frames: 1073795072. Throughput: 0: 5673.0. Samples: 1073797184. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:10:07,157][25689] Avg episode reward: [(0, '0.253')] [2022-07-11 05:10:07,376][26022] Updated weights on worker 0-0, policy_version 1048629 (0.00089) [2022-07-11 05:10:09,148][26022] Updated weights on worker 0-0, policy_version 1048639 (0.00091) [2022-07-11 05:10:11,055][26022] Updated weights on worker 0-0, policy_version 1048649 (0.00088) [2022-07-11 05:10:12,257][25689] Fps is (10 sec: 5642.1, 60 sec: 5552.3, 300 sec: 5527.9). Total num frames: 1073822720. Throughput: 0: 5658.6. Samples: 1073830286. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:10:12,258][25689] Avg episode reward: [(0, '0.556')] [2022-07-11 05:10:12,751][26022] Updated weights on worker 0-0, policy_version 1048659 (0.00083) [2022-07-11 05:10:14,865][26022] Updated weights on worker 0-0, policy_version 1048669 (0.00083) [2022-07-11 05:10:16,583][26022] Updated weights on worker 0-0, policy_version 1048679 (0.00082) [2022-07-11 05:10:17,267][25689] Fps is (10 sec: 5570.5, 60 sec: 5536.2, 300 sec: 5531.7). Total num frames: 1073851392. Throughput: 0: 4826.1. Samples: 1073846918. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:10:17,267][25689] Avg episode reward: [(0, '-0.032')] [2022-07-11 05:10:18,346][26022] Updated weights on worker 0-0, policy_version 1048689 (0.00083) [2022-07-11 05:10:20,232][26022] Updated weights on worker 0-0, policy_version 1048699 (0.00084) [2022-07-11 05:10:22,066][26022] Updated weights on worker 0-0, policy_version 1048709 (0.00085) [2022-07-11 05:10:22,368][25689] Fps is (10 sec: 5469.0, 60 sec: 5512.6, 300 sec: 5523.1). Total num frames: 1073878016. Throughput: 0: 5678.1. Samples: 1073880614. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:10:22,369][25689] Avg episode reward: [(0, '0.427')] [2022-07-11 05:10:23,816][26022] Updated weights on worker 0-0, policy_version 1048719 (0.00085) [2022-07-11 05:10:25,918][26022] Updated weights on worker 0-0, policy_version 1048729 (0.00084) [2022-07-11 05:10:27,312][26022] Updated weights on worker 0-0, policy_version 1048739 (0.00081) [2022-07-11 05:10:27,407][25689] Fps is (10 sec: 5654.9, 60 sec: 5545.9, 300 sec: 5535.0). Total num frames: 1073908736. Throughput: 0: 5777.7. Samples: 1073914188. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:10:27,408][25689] Avg episode reward: [(0, '-0.371')] [2022-07-11 05:10:29,392][26022] Updated weights on worker 0-0, policy_version 1048749 (0.00087) [2022-07-11 05:10:31,180][26022] Updated weights on worker 0-0, policy_version 1048759 (0.00090) [2022-07-11 05:10:32,508][25689] Fps is (10 sec: 5655.4, 60 sec: 5526.6, 300 sec: 5527.4). Total num frames: 1073935360. Throughput: 0: 4959.0. Samples: 1073930708. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:10:32,508][25689] Avg episode reward: [(0, '-0.118')] [2022-07-11 05:10:33,184][26022] Updated weights on worker 0-0, policy_version 1048769 (0.00086) [2022-07-11 05:10:34,930][26022] Updated weights on worker 0-0, policy_version 1048779 (0.00090) [2022-07-11 05:10:36,663][26022] Updated weights on worker 0-0, policy_version 1048789 (0.00093) [2022-07-11 05:10:37,602][25689] Fps is (10 sec: 5524.7, 60 sec: 5558.3, 300 sec: 5533.1). Total num frames: 1073965056. Throughput: 0: 5789.2. Samples: 1073964640. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:10:37,602][25689] Avg episode reward: [(0, '0.188')] [2022-07-11 05:10:38,559][26022] Updated weights on worker 0-0, policy_version 1048799 (0.00090) [2022-07-11 05:10:40,331][26022] Updated weights on worker 0-0, policy_version 1048809 (0.00096) [2022-07-11 05:10:42,260][26022] Updated weights on worker 0-0, policy_version 1048819 (0.00090) [2022-07-11 05:10:42,661][25689] Fps is (10 sec: 5647.7, 60 sec: 5540.1, 300 sec: 5529.2). Total num frames: 1073992704. Throughput: 0: 5788.2. Samples: 1073998076. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:10:42,662][25689] Avg episode reward: [(0, '-0.099')] [2022-07-11 05:10:44,005][26022] Updated weights on worker 0-0, policy_version 1048829 (0.00078) [2022-07-11 05:10:45,933][26022] Updated weights on worker 0-0, policy_version 1048839 (0.00085) [2022-07-11 05:10:47,657][26022] Updated weights on worker 0-0, policy_version 1048849 (0.00092) [2022-07-11 05:10:47,678][25689] Fps is (10 sec: 5589.3, 60 sec: 5538.9, 300 sec: 5527.1). Total num frames: 1074021376. Throughput: 0: 4973.5. Samples: 1074015004. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:10:47,679][25689] Avg episode reward: [(0, '0.482')] [2022-07-11 05:10:49,430][26022] Updated weights on worker 0-0, policy_version 1048859 (0.00095) [2022-07-11 05:10:51,431][26022] Updated weights on worker 0-0, policy_version 1048869 (0.00088) [2022-07-11 05:10:52,748][25689] Fps is (10 sec: 5583.4, 60 sec: 5542.7, 300 sec: 5527.0). Total num frames: 1074049024. Throughput: 0: 5822.6. Samples: 1074048560. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:10:52,749][25689] Avg episode reward: [(0, '1.012')] [2022-07-11 05:10:53,236][26022] Updated weights on worker 0-0, policy_version 1048879 (0.00089) [2022-07-11 05:10:55,114][26022] Updated weights on worker 0-0, policy_version 1048889 (0.00087) [2022-07-11 05:10:56,975][26022] Updated weights on worker 0-0, policy_version 1048899 (0.00100) [2022-07-11 05:10:57,757][25689] Fps is (10 sec: 5486.2, 60 sec: 5527.7, 300 sec: 5527.4). Total num frames: 1074076672. Throughput: 0: 5811.2. Samples: 1074081768. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:10:57,758][25689] Avg episode reward: [(0, '0.530')] [2022-07-11 05:10:58,704][26022] Updated weights on worker 0-0, policy_version 1048909 (0.00089) [2022-07-11 05:11:00,690][26022] Updated weights on worker 0-0, policy_version 1048919 (0.00084) [2022-07-11 05:11:02,839][25689] Fps is (10 sec: 5277.1, 60 sec: 5546.1, 300 sec: 5526.2). Total num frames: 1074102272. Throughput: 0: 4978.5. Samples: 1074098530. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:11:02,839][25689] Avg episode reward: [(0, '0.482')] [2022-07-11 05:11:02,872][26022] Updated weights on worker 0-0, policy_version 1048929 (0.00090) [2022-07-11 05:11:04,662][26022] Updated weights on worker 0-0, policy_version 1048939 (0.00088) [2022-07-11 05:11:06,643][26022] Updated weights on worker 0-0, policy_version 1048949 (0.00097) [2022-07-11 05:11:07,859][25689] Fps is (10 sec: 5271.0, 60 sec: 5516.2, 300 sec: 5524.2). Total num frames: 1074129920. Throughput: 0: 5692.9. Samples: 1074129892. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:11:07,860][25689] Avg episode reward: [(0, '0.097')] [2022-07-11 05:11:08,284][26022] Updated weights on worker 0-0, policy_version 1048959 (0.00089) [2022-07-11 05:11:10,262][26022] Updated weights on worker 0-0, policy_version 1048969 (0.00082) [2022-07-11 05:11:11,928][26022] Updated weights on worker 0-0, policy_version 1048979 (0.00087) [2022-07-11 05:11:12,953][25689] Fps is (10 sec: 5568.4, 60 sec: 5533.7, 300 sec: 5526.4). Total num frames: 1074158592. Throughput: 0: 5680.8. Samples: 1074163338. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:11:12,954][25689] Avg episode reward: [(0, '0.071')] [2022-07-11 05:11:13,781][26022] Updated weights on worker 0-0, policy_version 1048989 (0.00081) [2022-07-11 05:11:15,488][26022] Updated weights on worker 0-0, policy_version 1048999 (0.00088) [2022-07-11 05:11:17,470][26022] Updated weights on worker 0-0, policy_version 1049009 (0.00087) [2022-07-11 05:11:17,961][25689] Fps is (10 sec: 5777.7, 60 sec: 5550.7, 300 sec: 5533.2). Total num frames: 1074188288. Throughput: 0: 5715.3. Samples: 1074197242. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:11:17,962][25689] Avg episode reward: [(0, '0.227')] [2022-07-11 05:11:19,141][26022] Updated weights on worker 0-0, policy_version 1049019 (0.00088) [2022-07-11 05:11:20,946][26022] Updated weights on worker 0-0, policy_version 1049029 (0.00087) [2022-07-11 05:11:22,757][26022] Updated weights on worker 0-0, policy_version 1049039 (0.00090) [2022-07-11 05:11:22,972][25689] Fps is (10 sec: 5723.4, 60 sec: 5575.9, 300 sec: 5533.6). Total num frames: 1074215936. Throughput: 0: 5732.0. Samples: 1074213934. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:11:22,973][25689] Avg episode reward: [(0, '0.526')] [2022-07-11 05:11:24,740][26022] Updated weights on worker 0-0, policy_version 1049049 (0.00083) [2022-07-11 05:11:26,863][26022] Updated weights on worker 0-0, policy_version 1049059 (0.00099) [2022-07-11 05:11:27,975][25689] Fps is (10 sec: 5522.0, 60 sec: 5528.5, 300 sec: 5528.0). Total num frames: 1074243584. Throughput: 0: 5840.1. Samples: 1074247372. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:11:27,976][25689] Avg episode reward: [(0, '1.578')] [2022-07-11 05:11:28,341][26022] Updated weights on worker 0-0, policy_version 1049069 (0.00084) [2022-07-11 05:11:30,527][26022] Updated weights on worker 0-0, policy_version 1049079 (0.00092) [2022-07-11 05:11:31,945][26022] Updated weights on worker 0-0, policy_version 1049089 (0.00090) [2022-07-11 05:11:33,043][25689] Fps is (10 sec: 5490.9, 60 sec: 5548.4, 300 sec: 5534.0). Total num frames: 1074271232. Throughput: 0: 5847.0. Samples: 1074280802. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:11:33,043][25689] Avg episode reward: [(0, '1.631')] [2022-07-11 05:11:34,188][26022] Updated weights on worker 0-0, policy_version 1049099 (0.00092) [2022-07-11 05:11:35,755][26022] Updated weights on worker 0-0, policy_version 1049109 (0.00086) [2022-07-11 05:11:36,650][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:11:36,658][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001049113_1074291712.pth [2022-07-11 05:11:36,659][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001047167_1072299008.pth [2022-07-11 05:11:37,667][26022] Updated weights on worker 0-0, policy_version 1049119 (0.00098) [2022-07-11 05:11:38,057][25689] Fps is (10 sec: 5586.5, 60 sec: 5538.8, 300 sec: 5527.8). Total num frames: 1074299904. Throughput: 0: 4992.8. Samples: 1074297572. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:11:38,057][25689] Avg episode reward: [(0, '1.809')] [2022-07-11 05:11:39,488][26022] Updated weights on worker 0-0, policy_version 1049129 (0.00090) [2022-07-11 05:11:41,462][26022] Updated weights on worker 0-0, policy_version 1049139 (0.00083) [2022-07-11 05:11:43,147][25689] Fps is (10 sec: 5675.1, 60 sec: 5552.9, 300 sec: 5531.2). Total num frames: 1074328576. Throughput: 0: 5820.9. Samples: 1074331372. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:11:43,148][25689] Avg episode reward: [(0, '1.669')] [2022-07-11 05:11:43,160][26022] Updated weights on worker 0-0, policy_version 1049149 (0.00086) [2022-07-11 05:11:45,171][26022] Updated weights on worker 0-0, policy_version 1049159 (0.00091) [2022-07-11 05:11:46,563][26022] Updated weights on worker 0-0, policy_version 1049169 (0.00089) [2022-07-11 05:11:48,155][25689] Fps is (10 sec: 5475.8, 60 sec: 5519.9, 300 sec: 5525.2). Total num frames: 1074355200. Throughput: 0: 5824.4. Samples: 1074364906. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:11:48,156][25689] Avg episode reward: [(0, '1.547')] [2022-07-11 05:11:48,842][26022] Updated weights on worker 0-0, policy_version 1049179 (0.00087) [2022-07-11 05:11:50,351][26022] Updated weights on worker 0-0, policy_version 1049189 (0.00080) [2022-07-11 05:11:52,414][26022] Updated weights on worker 0-0, policy_version 1049199 (0.00089) [2022-07-11 05:11:53,207][25689] Fps is (10 sec: 5496.8, 60 sec: 5538.5, 300 sec: 5531.8). Total num frames: 1074383872. Throughput: 0: 5001.7. Samples: 1074381658. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:11:53,207][25689] Avg episode reward: [(0, '1.614')] [2022-07-11 05:11:54,125][26022] Updated weights on worker 0-0, policy_version 1049209 (0.00086) [2022-07-11 05:11:56,075][26022] Updated weights on worker 0-0, policy_version 1049219 (0.00081) [2022-07-11 05:11:57,703][26022] Updated weights on worker 0-0, policy_version 1049229 (0.00090) [2022-07-11 05:11:58,254][25689] Fps is (10 sec: 5678.0, 60 sec: 5551.9, 300 sec: 5531.0). Total num frames: 1074412544. Throughput: 0: 5829.4. Samples: 1074415312. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:11:58,255][25689] Avg episode reward: [(0, '1.853')] [2022-07-11 05:11:59,766][26022] Updated weights on worker 0-0, policy_version 1049239 (0.00086) [2022-07-11 05:12:01,262][26022] Updated weights on worker 0-0, policy_version 1049249 (0.00089) [2022-07-11 05:12:03,279][25689] Fps is (10 sec: 5286.9, 60 sec: 5540.2, 300 sec: 5524.0). Total num frames: 1074437120. Throughput: 0: 5749.5. Samples: 1074447118. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:12:03,279][25689] Avg episode reward: [(0, '1.753')] [2022-07-11 05:12:03,743][26022] Updated weights on worker 0-0, policy_version 1049259 (0.00093) [2022-07-11 05:12:05,271][26022] Updated weights on worker 0-0, policy_version 1049269 (0.00088) [2022-07-11 05:12:07,563][26022] Updated weights on worker 0-0, policy_version 1049279 (0.00894) [2022-07-11 05:12:08,298][25689] Fps is (10 sec: 5505.7, 60 sec: 5591.1, 300 sec: 5538.4). Total num frames: 1074467840. Throughput: 0: 4902.1. Samples: 1074463648. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:12:08,298][25689] Avg episode reward: [(0, '1.586')] [2022-07-11 05:12:09,124][26022] Updated weights on worker 0-0, policy_version 1049289 (0.00081) [2022-07-11 05:12:11,086][26022] Updated weights on worker 0-0, policy_version 1049299 (0.00099) [2022-07-11 05:12:12,996][26022] Updated weights on worker 0-0, policy_version 1049309 (0.00085) [2022-07-11 05:12:13,371][25689] Fps is (10 sec: 5681.9, 60 sec: 5559.1, 300 sec: 5530.6). Total num frames: 1074494464. Throughput: 0: 5717.7. Samples: 1074496950. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:12:13,372][25689] Avg episode reward: [(0, '1.234')] [2022-07-11 05:12:14,657][26022] Updated weights on worker 0-0, policy_version 1049319 (0.00086) [2022-07-11 05:12:16,842][26022] Updated weights on worker 0-0, policy_version 1049329 (0.00084) [2022-07-11 05:12:18,392][25689] Fps is (10 sec: 5477.9, 60 sec: 5541.0, 300 sec: 5534.1). Total num frames: 1074523136. Throughput: 0: 5705.2. Samples: 1074530202. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:12:18,392][25689] Avg episode reward: [(0, '0.200')] [2022-07-11 05:12:18,394][26022] Updated weights on worker 0-0, policy_version 1049339 (0.00085) [2022-07-11 05:12:20,262][26022] Updated weights on worker 0-0, policy_version 1049349 (0.00089) [2022-07-11 05:12:22,119][26022] Updated weights on worker 0-0, policy_version 1049359 (0.00091) [2022-07-11 05:12:23,486][25689] Fps is (10 sec: 5568.0, 60 sec: 5533.4, 300 sec: 5529.4). Total num frames: 1074550784. Throughput: 0: 4943.1. Samples: 1074547006. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:12:23,487][25689] Avg episode reward: [(0, '0.395')] [2022-07-11 05:12:23,923][26022] Updated weights on worker 0-0, policy_version 1049369 (0.00092) [2022-07-11 05:12:25,896][26022] Updated weights on worker 0-0, policy_version 1049379 (0.00084) [2022-07-11 05:12:27,567][26022] Updated weights on worker 0-0, policy_version 1049389 (0.00086) [2022-07-11 05:12:28,496][25689] Fps is (10 sec: 5472.6, 60 sec: 5532.8, 300 sec: 5530.2). Total num frames: 1074578432. Throughput: 0: 5794.4. Samples: 1074580686. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 05:12:28,498][25689] Avg episode reward: [(0, '0.747')] [2022-07-11 05:12:29,451][26022] Updated weights on worker 0-0, policy_version 1049399 (0.00081) [2022-07-11 05:12:31,274][26022] Updated weights on worker 0-0, policy_version 1049409 (0.00089) [2022-07-11 05:12:33,192][26022] Updated weights on worker 0-0, policy_version 1049419 (0.00085) [2022-07-11 05:12:33,616][25689] Fps is (10 sec: 5559.6, 60 sec: 5544.8, 300 sec: 5528.1). Total num frames: 1074607104. Throughput: 0: 5778.9. Samples: 1074613946. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:12:33,617][25689] Avg episode reward: [(0, '-0.963')] [2022-07-11 05:12:34,936][26022] Updated weights on worker 0-0, policy_version 1049429 (0.00087) [2022-07-11 05:12:36,767][26022] Updated weights on worker 0-0, policy_version 1049439 (0.00090) [2022-07-11 05:12:38,666][25689] Fps is (10 sec: 5538.0, 60 sec: 5524.7, 300 sec: 5532.0). Total num frames: 1074634752. Throughput: 0: 4956.9. Samples: 1074630696. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:12:38,666][25689] Avg episode reward: [(0, '-0.360')] [2022-07-11 05:12:38,726][26022] Updated weights on worker 0-0, policy_version 1049449 (0.00091) [2022-07-11 05:12:40,582][26022] Updated weights on worker 0-0, policy_version 1049459 (0.00085) [2022-07-11 05:12:42,307][26022] Updated weights on worker 0-0, policy_version 1049469 (0.00438) [2022-07-11 05:12:43,677][25689] Fps is (10 sec: 5394.5, 60 sec: 5498.1, 300 sec: 5526.0). Total num frames: 1074661376. Throughput: 0: 5795.5. Samples: 1074664022. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:12:43,678][25689] Avg episode reward: [(0, '-0.436')] [2022-07-11 05:12:44,201][26022] Updated weights on worker 0-0, policy_version 1049479 (0.00089) [2022-07-11 05:12:46,203][26022] Updated weights on worker 0-0, policy_version 1049489 (0.00093) [2022-07-11 05:12:47,927][26022] Updated weights on worker 0-0, policy_version 1049499 (0.00084) [2022-07-11 05:12:48,680][25689] Fps is (10 sec: 5624.2, 60 sec: 5549.3, 300 sec: 5530.5). Total num frames: 1074691072. Throughput: 0: 5785.9. Samples: 1074697466. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:12:48,680][25689] Avg episode reward: [(0, '-0.305')] [2022-07-11 05:12:49,879][26022] Updated weights on worker 0-0, policy_version 1049509 (0.00087) [2022-07-11 05:12:51,652][26022] Updated weights on worker 0-0, policy_version 1049519 (0.00093) [2022-07-11 05:12:53,531][26022] Updated weights on worker 0-0, policy_version 1049529 (0.00100) [2022-07-11 05:12:53,719][25689] Fps is (10 sec: 5710.6, 60 sec: 5533.6, 300 sec: 5526.7). Total num frames: 1074718720. Throughput: 0: 4978.0. Samples: 1074714016. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:12:53,719][25689] Avg episode reward: [(0, '-0.602')] [2022-07-11 05:12:55,400][26022] Updated weights on worker 0-0, policy_version 1049539 (0.00085) [2022-07-11 05:12:57,168][26022] Updated weights on worker 0-0, policy_version 1049549 (0.00088) [2022-07-11 05:12:58,727][25689] Fps is (10 sec: 5503.7, 60 sec: 5520.3, 300 sec: 5537.0). Total num frames: 1074746368. Throughput: 0: 5803.2. Samples: 1074747114. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:12:58,727][25689] Avg episode reward: [(0, '-0.883')] [2022-07-11 05:12:59,061][26022] Updated weights on worker 0-0, policy_version 1049559 (0.00103) [2022-07-11 05:13:00,831][26022] Updated weights on worker 0-0, policy_version 1049569 (0.00086) [2022-07-11 05:13:03,002][26022] Updated weights on worker 0-0, policy_version 1049579 (0.00088) [2022-07-11 05:13:03,732][25689] Fps is (10 sec: 5420.2, 60 sec: 5555.9, 300 sec: 5530.3). Total num frames: 1074772992. Throughput: 0: 5696.5. Samples: 1074778262. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:13:03,732][25689] Avg episode reward: [(0, '0.033')] [2022-07-11 05:13:05,104][26022] Updated weights on worker 0-0, policy_version 1049589 (0.00095) [2022-07-11 05:13:06,644][26022] Updated weights on worker 0-0, policy_version 1049599 (0.00092) [2022-07-11 05:13:08,671][26022] Updated weights on worker 0-0, policy_version 1049609 (0.00083) [2022-07-11 05:13:08,756][25689] Fps is (10 sec: 5309.5, 60 sec: 5487.7, 300 sec: 5532.0). Total num frames: 1074799616. Throughput: 0: 4847.9. Samples: 1074794792. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:13:08,756][25689] Avg episode reward: [(0, '1.404')] [2022-07-11 05:13:10,323][26022] Updated weights on worker 0-0, policy_version 1049619 (0.00087) [2022-07-11 05:13:12,439][26022] Updated weights on worker 0-0, policy_version 1049629 (0.00086) [2022-07-11 05:13:13,857][25689] Fps is (10 sec: 5562.2, 60 sec: 5536.0, 300 sec: 5537.3). Total num frames: 1074829312. Throughput: 0: 5679.3. Samples: 1074828388. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:13:13,858][25689] Avg episode reward: [(0, '1.339')] [2022-07-11 05:13:14,099][26022] Updated weights on worker 0-0, policy_version 1049639 (0.00081) [2022-07-11 05:13:15,998][26022] Updated weights on worker 0-0, policy_version 1049649 (0.00080) [2022-07-11 05:13:17,769][26022] Updated weights on worker 0-0, policy_version 1049659 (0.00085) [2022-07-11 05:13:18,872][25689] Fps is (10 sec: 5668.2, 60 sec: 5519.5, 300 sec: 5533.7). Total num frames: 1074856960. Throughput: 0: 5702.1. Samples: 1074861986. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:13:18,873][25689] Avg episode reward: [(0, '0.768')] [2022-07-11 05:13:19,616][26022] Updated weights on worker 0-0, policy_version 1049669 (0.00087) [2022-07-11 05:13:21,411][26022] Updated weights on worker 0-0, policy_version 1049679 (0.00087) [2022-07-11 05:13:23,420][26022] Updated weights on worker 0-0, policy_version 1049689 (0.00088) [2022-07-11 05:13:23,954][25689] Fps is (10 sec: 5476.4, 60 sec: 5520.7, 300 sec: 5532.5). Total num frames: 1074884608. Throughput: 0: 4975.6. Samples: 1074878882. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:13:23,955][25689] Avg episode reward: [(0, '0.573')] [2022-07-11 05:13:25,179][26022] Updated weights on worker 0-0, policy_version 1049699 (0.00090) [2022-07-11 05:13:26,916][26022] Updated weights on worker 0-0, policy_version 1049709 (0.00089) [2022-07-11 05:13:28,804][26022] Updated weights on worker 0-0, policy_version 1049719 (0.00083) [2022-07-11 05:13:28,971][25689] Fps is (10 sec: 5475.5, 60 sec: 5520.0, 300 sec: 5533.4). Total num frames: 1074912256. Throughput: 0: 5808.5. Samples: 1074912214. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:13:28,972][25689] Avg episode reward: [(0, '-0.279')] [2022-07-11 05:13:30,757][26022] Updated weights on worker 0-0, policy_version 1049729 (0.00086) [2022-07-11 05:13:32,525][26022] Updated weights on worker 0-0, policy_version 1049739 (0.00085) [2022-07-11 05:13:34,062][25689] Fps is (10 sec: 5673.4, 60 sec: 5539.7, 300 sec: 5539.2). Total num frames: 1074941952. Throughput: 0: 5811.3. Samples: 1074945804. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:13:34,062][25689] Avg episode reward: [(0, '-0.331')] [2022-07-11 05:13:34,516][26022] Updated weights on worker 0-0, policy_version 1049749 (0.00092) [2022-07-11 05:13:36,278][26022] Updated weights on worker 0-0, policy_version 1049759 (0.00082) [2022-07-11 05:13:36,681][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:13:36,696][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001049761_1074955264.pth [2022-07-11 05:13:36,697][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001047816_1072963584.pth [2022-07-11 05:13:38,230][26022] Updated weights on worker 0-0, policy_version 1049769 (0.00096) [2022-07-11 05:13:39,121][25689] Fps is (10 sec: 5548.9, 60 sec: 5521.8, 300 sec: 5531.3). Total num frames: 1074968576. Throughput: 0: 4947.8. Samples: 1074962172. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:13:39,122][25689] Avg episode reward: [(0, '0.173')] [2022-07-11 05:13:39,932][26022] Updated weights on worker 0-0, policy_version 1049779 (0.00079) [2022-07-11 05:13:41,628][26022] Updated weights on worker 0-0, policy_version 1049789 (0.00085) [2022-07-11 05:13:43,554][26022] Updated weights on worker 0-0, policy_version 1049799 (0.00084) [2022-07-11 05:13:44,196][25689] Fps is (10 sec: 5456.4, 60 sec: 5549.9, 300 sec: 5537.0). Total num frames: 1074997248. Throughput: 0: 5788.4. Samples: 1074996048. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:13:44,197][25689] Avg episode reward: [(0, '0.055')] [2022-07-11 05:13:45,384][26022] Updated weights on worker 0-0, policy_version 1049809 (0.00085) [2022-07-11 05:13:47,193][26022] Updated weights on worker 0-0, policy_version 1049819 (0.00082) [2022-07-11 05:13:48,978][26022] Updated weights on worker 0-0, policy_version 1049829 (0.00085) [2022-07-11 05:13:49,225][25689] Fps is (10 sec: 5675.3, 60 sec: 5530.5, 300 sec: 5538.8). Total num frames: 1075025920. Throughput: 0: 5812.3. Samples: 1075029936. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:13:49,226][25689] Avg episode reward: [(0, '0.758')] [2022-07-11 05:13:50,727][26022] Updated weights on worker 0-0, policy_version 1049839 (0.00075) [2022-07-11 05:13:52,876][26022] Updated weights on worker 0-0, policy_version 1049849 (0.00091) [2022-07-11 05:13:54,273][25689] Fps is (10 sec: 5588.9, 60 sec: 5529.7, 300 sec: 5534.6). Total num frames: 1075053568. Throughput: 0: 5796.9. Samples: 1075062966. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:13:54,274][25689] Avg episode reward: [(0, '0.116')] [2022-07-11 05:13:54,701][26022] Updated weights on worker 0-0, policy_version 1049859 (0.00088) [2022-07-11 05:13:56,300][26022] Updated weights on worker 0-0, policy_version 1049869 (0.00103) [2022-07-11 05:13:58,467][26022] Updated weights on worker 0-0, policy_version 1049879 (0.00095) [2022-07-11 05:13:59,303][25689] Fps is (10 sec: 5588.7, 60 sec: 5544.6, 300 sec: 5541.6). Total num frames: 1075082240. Throughput: 0: 5823.6. Samples: 1075079702. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:13:59,303][25689] Avg episode reward: [(0, '0.734')] [2022-07-11 05:14:00,135][26022] Updated weights on worker 0-0, policy_version 1049889 (0.00088) [2022-07-11 05:14:02,482][26022] Updated weights on worker 0-0, policy_version 1049899 (0.00085) [2022-07-11 05:14:04,203][26022] Updated weights on worker 0-0, policy_version 1049909 (0.00097) [2022-07-11 05:14:04,344][25689] Fps is (10 sec: 5287.4, 60 sec: 5507.5, 300 sec: 5534.9). Total num frames: 1075106816. Throughput: 0: 5715.5. Samples: 1075111202. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:14:04,344][25689] Avg episode reward: [(0, '0.783')] [2022-07-11 05:14:06,158][26022] Updated weights on worker 0-0, policy_version 1049919 (0.00085) [2022-07-11 05:14:07,894][26022] Updated weights on worker 0-0, policy_version 1049929 (0.00091) [2022-07-11 05:14:09,367][25689] Fps is (10 sec: 5290.9, 60 sec: 5541.4, 300 sec: 5539.8). Total num frames: 1075135488. Throughput: 0: 5693.0. Samples: 1075144602. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:14:09,368][25689] Avg episode reward: [(0, '0.853')] [2022-07-11 05:14:09,902][26022] Updated weights on worker 0-0, policy_version 1049939 (0.00092) [2022-07-11 05:14:11,520][26022] Updated weights on worker 0-0, policy_version 1049949 (0.00084) [2022-07-11 05:14:13,557][26022] Updated weights on worker 0-0, policy_version 1049959 (0.00084) [2022-07-11 05:14:14,461][25689] Fps is (10 sec: 5667.9, 60 sec: 5525.2, 300 sec: 5534.9). Total num frames: 1075164160. Throughput: 0: 4873.3. Samples: 1075161348. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:14:14,462][25689] Avg episode reward: [(0, '0.802')] [2022-07-11 05:14:15,051][26022] Updated weights on worker 0-0, policy_version 1049969 (0.00089) [2022-07-11 05:14:17,176][26022] Updated weights on worker 0-0, policy_version 1049979 (0.00092) [2022-07-11 05:14:18,717][26022] Updated weights on worker 0-0, policy_version 1049989 (0.00084) [2022-07-11 05:14:19,535][25689] Fps is (10 sec: 5437.9, 60 sec: 5502.9, 300 sec: 5530.6). Total num frames: 1075190784. Throughput: 0: 5702.4. Samples: 1075195076. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:14:19,536][25689] Avg episode reward: [(0, '1.412')] [2022-07-11 05:14:20,591][26022] Updated weights on worker 0-0, policy_version 1049999 (0.00084) [2022-07-11 05:14:22,465][26022] Updated weights on worker 0-0, policy_version 1050009 (0.00092) [2022-07-11 05:14:24,362][26022] Updated weights on worker 0-0, policy_version 1050019 (0.00107) [2022-07-11 05:14:24,575][25689] Fps is (10 sec: 5568.4, 60 sec: 5540.5, 300 sec: 5533.9). Total num frames: 1075220480. Throughput: 0: 5827.2. Samples: 1075229096. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:14:24,576][25689] Avg episode reward: [(0, '0.489')] [2022-07-11 05:14:26,160][26022] Updated weights on worker 0-0, policy_version 1050029 (0.00087) [2022-07-11 05:14:27,995][26022] Updated weights on worker 0-0, policy_version 1050039 (0.00085) [2022-07-11 05:14:29,659][25689] Fps is (10 sec: 5765.5, 60 sec: 5551.3, 300 sec: 5537.2). Total num frames: 1075249152. Throughput: 0: 4992.0. Samples: 1075245900. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:14:29,659][25689] Avg episode reward: [(0, '0.490')] [2022-07-11 05:14:29,840][26022] Updated weights on worker 0-0, policy_version 1050049 (0.00091) [2022-07-11 05:14:31,719][26022] Updated weights on worker 0-0, policy_version 1050059 (0.00083) [2022-07-11 05:14:33,529][26022] Updated weights on worker 0-0, policy_version 1050069 (0.00086) [2022-07-11 05:14:34,721][25689] Fps is (10 sec: 5450.1, 60 sec: 5503.2, 300 sec: 5533.9). Total num frames: 1075275776. Throughput: 0: 5816.7. Samples: 1075279196. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:14:34,722][25689] Avg episode reward: [(0, '0.459')] [2022-07-11 05:14:35,174][26022] Updated weights on worker 0-0, policy_version 1050079 (0.00088) [2022-07-11 05:14:37,288][26022] Updated weights on worker 0-0, policy_version 1050089 (0.00091) [2022-07-11 05:14:39,067][26022] Updated weights on worker 0-0, policy_version 1050099 (0.00090) [2022-07-11 05:14:39,722][25689] Fps is (10 sec: 5596.4, 60 sec: 5559.2, 300 sec: 5538.2). Total num frames: 1075305472. Throughput: 0: 5817.4. Samples: 1075312514. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:14:39,723][25689] Avg episode reward: [(0, '0.957')] [2022-07-11 05:14:40,935][26022] Updated weights on worker 0-0, policy_version 1050109 (0.00087) [2022-07-11 05:14:42,583][26022] Updated weights on worker 0-0, policy_version 1050119 (0.00092) [2022-07-11 05:14:44,642][26022] Updated weights on worker 0-0, policy_version 1050129 (0.00482) [2022-07-11 05:14:44,736][25689] Fps is (10 sec: 5623.7, 60 sec: 5531.0, 300 sec: 5531.1). Total num frames: 1075332096. Throughput: 0: 4968.1. Samples: 1075329256. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:14:44,736][25689] Avg episode reward: [(0, '0.630')] [2022-07-11 05:14:46,299][26022] Updated weights on worker 0-0, policy_version 1050139 (0.00096) [2022-07-11 05:14:48,237][26022] Updated weights on worker 0-0, policy_version 1050149 (0.00086) [2022-07-11 05:14:49,790][25689] Fps is (10 sec: 5492.7, 60 sec: 5528.8, 300 sec: 5535.6). Total num frames: 1075360768. Throughput: 0: 5808.9. Samples: 1075362838. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:14:49,790][25689] Avg episode reward: [(0, '-0.027')] [2022-07-11 05:14:49,892][26022] Updated weights on worker 0-0, policy_version 1050159 (0.00091) [2022-07-11 05:14:52,035][26022] Updated weights on worker 0-0, policy_version 1050169 (0.00094) [2022-07-11 05:14:53,558][26022] Updated weights on worker 0-0, policy_version 1050179 (0.00084) [2022-07-11 05:14:54,831][25689] Fps is (10 sec: 5579.3, 60 sec: 5529.4, 300 sec: 5532.0). Total num frames: 1075388416. Throughput: 0: 5833.2. Samples: 1075396498. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:14:54,831][25689] Avg episode reward: [(0, '0.154')] [2022-07-11 05:14:55,655][26022] Updated weights on worker 0-0, policy_version 1050189 (0.00088) [2022-07-11 05:14:57,359][26022] Updated weights on worker 0-0, policy_version 1050199 (0.00096) [2022-07-11 05:14:59,355][26022] Updated weights on worker 0-0, policy_version 1050209 (0.00086) [2022-07-11 05:14:59,845][25689] Fps is (10 sec: 5601.1, 60 sec: 5530.8, 300 sec: 5547.3). Total num frames: 1075417088. Throughput: 0: 4995.4. Samples: 1075413034. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:14:59,845][25689] Avg episode reward: [(0, '0.165')] [2022-07-11 05:15:01,039][26022] Updated weights on worker 0-0, policy_version 1050219 (0.00090) [2022-07-11 05:15:03,427][26022] Updated weights on worker 0-0, policy_version 1050229 (0.00091) [2022-07-11 05:15:04,904][25689] Fps is (10 sec: 5591.0, 60 sec: 5579.9, 300 sec: 5540.5). Total num frames: 1075444736. Throughput: 0: 5717.7. Samples: 1075444570. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:15:04,906][25689] Avg episode reward: [(0, '-0.000')] [2022-07-11 05:15:04,912][26022] Updated weights on worker 0-0, policy_version 1050239 (0.00090) [2022-07-11 05:15:07,041][26022] Updated weights on worker 0-0, policy_version 1050249 (0.00089) [2022-07-11 05:15:08,829][26022] Updated weights on worker 0-0, policy_version 1050259 (0.00089) [2022-07-11 05:15:09,911][25689] Fps is (10 sec: 5391.3, 60 sec: 5547.5, 300 sec: 5538.8). Total num frames: 1075471360. Throughput: 0: 5724.8. Samples: 1075478030. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:15:09,913][25689] Avg episode reward: [(0, '0.420')] [2022-07-11 05:15:10,719][26022] Updated weights on worker 0-0, policy_version 1050269 (0.00090) [2022-07-11 05:15:12,533][26022] Updated weights on worker 0-0, policy_version 1050279 (0.00085) [2022-07-11 05:15:14,344][26022] Updated weights on worker 0-0, policy_version 1050289 (0.00087) [2022-07-11 05:15:15,011][25689] Fps is (10 sec: 5268.3, 60 sec: 5513.2, 300 sec: 5530.2). Total num frames: 1075497984. Throughput: 0: 4870.3. Samples: 1075494784. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:15:15,012][25689] Avg episode reward: [(0, '-0.304')] [2022-07-11 05:15:16,119][26022] Updated weights on worker 0-0, policy_version 1050299 (0.00088) [2022-07-11 05:15:18,298][26022] Updated weights on worker 0-0, policy_version 1050309 (0.00088) [2022-07-11 05:15:19,652][26022] Updated weights on worker 0-0, policy_version 1050319 (0.00086) [2022-07-11 05:15:20,048][25689] Fps is (10 sec: 5657.1, 60 sec: 5584.3, 300 sec: 5545.2). Total num frames: 1075528704. Throughput: 0: 5716.0. Samples: 1075528516. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:15:20,049][25689] Avg episode reward: [(0, '0.582')] [2022-07-11 05:15:21,755][26022] Updated weights on worker 0-0, policy_version 1050329 (0.00089) [2022-07-11 05:15:23,264][26022] Updated weights on worker 0-0, policy_version 1050339 (0.00087) [2022-07-11 05:15:25,082][25689] Fps is (10 sec: 5694.1, 60 sec: 5534.1, 300 sec: 5531.5). Total num frames: 1075555328. Throughput: 0: 5840.4. Samples: 1075562418. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:15:25,083][25689] Avg episode reward: [(0, '1.383')] [2022-07-11 05:15:25,231][26022] Updated weights on worker 0-0, policy_version 1050349 (0.00098) [2022-07-11 05:15:26,951][26022] Updated weights on worker 0-0, policy_version 1050359 (0.00101) [2022-07-11 05:15:28,955][26022] Updated weights on worker 0-0, policy_version 1050369 (0.00796) [2022-07-11 05:15:30,170][25689] Fps is (10 sec: 5463.1, 60 sec: 5533.7, 300 sec: 5538.6). Total num frames: 1075584000. Throughput: 0: 4989.8. Samples: 1075579116. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:15:30,170][25689] Avg episode reward: [(0, '1.584')] [2022-07-11 05:15:30,771][26022] Updated weights on worker 0-0, policy_version 1050379 (0.00096) [2022-07-11 05:15:32,762][26022] Updated weights on worker 0-0, policy_version 1050389 (0.00091) [2022-07-11 05:15:34,236][26022] Updated weights on worker 0-0, policy_version 1050399 (0.00090) [2022-07-11 05:15:35,286][25689] Fps is (10 sec: 5519.5, 60 sec: 5545.7, 300 sec: 5531.3). Total num frames: 1075611648. Throughput: 0: 5802.1. Samples: 1075612420. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:15:35,286][25689] Avg episode reward: [(0, '0.699')] [2022-07-11 05:15:36,395][26022] Updated weights on worker 0-0, policy_version 1050409 (0.00092) [2022-07-11 05:15:36,806][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:15:36,818][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001050411_1075620864.pth [2022-07-11 05:15:36,818][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001048463_1073626112.pth [2022-07-11 05:15:38,060][26022] Updated weights on worker 0-0, policy_version 1050419 (0.00090) [2022-07-11 05:15:39,968][26022] Updated weights on worker 0-0, policy_version 1050429 (0.00090) [2022-07-11 05:15:40,363][25689] Fps is (10 sec: 5625.8, 60 sec: 5538.7, 300 sec: 5537.9). Total num frames: 1075641344. Throughput: 0: 5782.5. Samples: 1075645986. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:15:40,363][25689] Avg episode reward: [(0, '1.281')] [2022-07-11 05:15:41,774][26022] Updated weights on worker 0-0, policy_version 1050439 (0.00085) [2022-07-11 05:15:43,495][26022] Updated weights on worker 0-0, policy_version 1050449 (0.00086) [2022-07-11 05:15:45,375][25689] Fps is (10 sec: 5582.2, 60 sec: 5538.9, 300 sec: 5531.1). Total num frames: 1075667968. Throughput: 0: 4951.1. Samples: 1075662892. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:15:45,375][25689] Avg episode reward: [(0, '0.559')] [2022-07-11 05:15:45,580][26022] Updated weights on worker 0-0, policy_version 1050459 (0.00091) [2022-07-11 05:15:47,209][26022] Updated weights on worker 0-0, policy_version 1050469 (0.00088) [2022-07-11 05:15:49,054][26022] Updated weights on worker 0-0, policy_version 1050479 (0.00091) [2022-07-11 05:15:50,418][25689] Fps is (10 sec: 5499.1, 60 sec: 5539.8, 300 sec: 5535.0). Total num frames: 1075696640. Throughput: 0: 5789.6. Samples: 1075696346. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:15:50,419][25689] Avg episode reward: [(0, '0.695')] [2022-07-11 05:15:51,144][26022] Updated weights on worker 0-0, policy_version 1050489 (0.00089) [2022-07-11 05:15:52,635][26022] Updated weights on worker 0-0, policy_version 1050499 (0.00089) [2022-07-11 05:15:54,854][26022] Updated weights on worker 0-0, policy_version 1050509 (0.00091) [2022-07-11 05:15:55,495][25689] Fps is (10 sec: 5767.5, 60 sec: 5570.3, 300 sec: 5540.7). Total num frames: 1075726336. Throughput: 0: 5815.6. Samples: 1075729950. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:15:55,495][25689] Avg episode reward: [(0, '0.628')] [2022-07-11 05:15:56,306][26022] Updated weights on worker 0-0, policy_version 1050519 (0.00079) [2022-07-11 05:15:58,392][26022] Updated weights on worker 0-0, policy_version 1050529 (0.00086) [2022-07-11 05:15:59,962][26022] Updated weights on worker 0-0, policy_version 1050539 (0.00102) [2022-07-11 05:16:00,549][25689] Fps is (10 sec: 5660.7, 60 sec: 5549.9, 300 sec: 5548.1). Total num frames: 1075753984. Throughput: 0: 4991.2. Samples: 1075746736. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:16:00,549][25689] Avg episode reward: [(0, '0.809')] [2022-07-11 05:16:02,122][26022] Updated weights on worker 0-0, policy_version 1050549 (0.00091) [2022-07-11 05:16:04,039][26022] Updated weights on worker 0-0, policy_version 1050559 (0.00085) [2022-07-11 05:16:05,557][25689] Fps is (10 sec: 5291.9, 60 sec: 5520.7, 300 sec: 5541.4). Total num frames: 1075779584. Throughput: 0: 5717.4. Samples: 1075778282. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:16:05,558][25689] Avg episode reward: [(0, '-0.069')] [2022-07-11 05:16:06,204][26022] Updated weights on worker 0-0, policy_version 1050569 (0.00094) [2022-07-11 05:16:07,586][26022] Updated weights on worker 0-0, policy_version 1050579 (0.00091) [2022-07-11 05:16:10,021][26022] Updated weights on worker 0-0, policy_version 1050589 (0.00085) [2022-07-11 05:16:10,594][25689] Fps is (10 sec: 5402.7, 60 sec: 5551.8, 300 sec: 5542.5). Total num frames: 1075808256. Throughput: 0: 5723.4. Samples: 1075811818. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:16:10,594][25689] Avg episode reward: [(0, '0.684')] [2022-07-11 05:16:11,308][26022] Updated weights on worker 0-0, policy_version 1050599 (0.00092) [2022-07-11 05:16:13,507][26022] Updated weights on worker 0-0, policy_version 1050609 (0.00093) [2022-07-11 05:16:15,075][26022] Updated weights on worker 0-0, policy_version 1050619 (0.00083) [2022-07-11 05:16:15,725][25689] Fps is (10 sec: 5539.1, 60 sec: 5565.8, 300 sec: 5533.3). Total num frames: 1075835904. Throughput: 0: 5704.5. Samples: 1075845352. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:16:15,725][25689] Avg episode reward: [(0, '0.249')] [2022-07-11 05:16:17,196][26022] Updated weights on worker 0-0, policy_version 1050629 (0.00083) [2022-07-11 05:16:18,656][26022] Updated weights on worker 0-0, policy_version 1050639 (0.00088) [2022-07-11 05:16:20,739][25689] Fps is (10 sec: 5450.7, 60 sec: 5517.3, 300 sec: 5533.2). Total num frames: 1075863552. Throughput: 0: 5718.1. Samples: 1075862186. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:16:20,739][25689] Avg episode reward: [(0, '0.276')] [2022-07-11 05:16:20,983][26022] Updated weights on worker 0-0, policy_version 1050649 (0.00096) [2022-07-11 05:16:22,379][26022] Updated weights on worker 0-0, policy_version 1050659 (0.00087) [2022-07-11 05:16:24,504][26022] Updated weights on worker 0-0, policy_version 1050669 (0.00088) [2022-07-11 05:16:25,747][25689] Fps is (10 sec: 5619.6, 60 sec: 5553.4, 300 sec: 5536.6). Total num frames: 1075892224. Throughput: 0: 5819.8. Samples: 1075895782. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 05:16:25,749][25689] Avg episode reward: [(0, '-0.197')] [2022-07-11 05:16:26,239][26022] Updated weights on worker 0-0, policy_version 1050679 (0.00088) [2022-07-11 05:16:28,062][26022] Updated weights on worker 0-0, policy_version 1050689 (0.00089) [2022-07-11 05:16:29,977][26022] Updated weights on worker 0-0, policy_version 1050699 (0.00086) [2022-07-11 05:16:30,778][25689] Fps is (10 sec: 5609.7, 60 sec: 5541.7, 300 sec: 5537.3). Total num frames: 1075919872. Throughput: 0: 5808.0. Samples: 1075929050. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:16:30,779][25689] Avg episode reward: [(0, '0.619')] [2022-07-11 05:16:31,628][26022] Updated weights on worker 0-0, policy_version 1050709 (0.00090) [2022-07-11 05:16:33,694][26022] Updated weights on worker 0-0, policy_version 1050719 (0.00082) [2022-07-11 05:16:35,695][26022] Updated weights on worker 0-0, policy_version 1050729 (0.00083) [2022-07-11 05:16:35,836][25689] Fps is (10 sec: 5480.5, 60 sec: 5546.9, 300 sec: 5533.0). Total num frames: 1075947520. Throughput: 0: 4989.2. Samples: 1075945692. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:16:35,837][25689] Avg episode reward: [(0, '0.646')] [2022-07-11 05:16:37,287][26022] Updated weights on worker 0-0, policy_version 1050739 (0.00086) [2022-07-11 05:16:39,319][26022] Updated weights on worker 0-0, policy_version 1050749 (0.00089) [2022-07-11 05:16:40,942][25689] Fps is (10 sec: 5541.2, 60 sec: 5527.4, 300 sec: 5532.7). Total num frames: 1075976192. Throughput: 0: 5787.0. Samples: 1075979104. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:16:40,943][25689] Avg episode reward: [(0, '1.335')] [2022-07-11 05:16:41,045][26022] Updated weights on worker 0-0, policy_version 1050759 (0.00094) [2022-07-11 05:16:42,885][26022] Updated weights on worker 0-0, policy_version 1050769 (0.00086) [2022-07-11 05:16:44,761][26022] Updated weights on worker 0-0, policy_version 1050779 (0.00086) [2022-07-11 05:16:45,998][25689] Fps is (10 sec: 5643.2, 60 sec: 5557.2, 300 sec: 5538.7). Total num frames: 1076004864. Throughput: 0: 5773.3. Samples: 1076012698. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:16:45,999][25689] Avg episode reward: [(0, '1.370')] [2022-07-11 05:16:46,586][26022] Updated weights on worker 0-0, policy_version 1050789 (0.00083) [2022-07-11 05:16:48,306][26022] Updated weights on worker 0-0, policy_version 1050799 (0.00089) [2022-07-11 05:16:50,111][26022] Updated weights on worker 0-0, policy_version 1050809 (0.00091) [2022-07-11 05:16:51,050][25689] Fps is (10 sec: 5572.0, 60 sec: 5539.6, 300 sec: 5535.3). Total num frames: 1076032512. Throughput: 0: 4955.2. Samples: 1076029500. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:16:51,052][25689] Avg episode reward: [(0, '1.216')] [2022-07-11 05:16:51,906][26022] Updated weights on worker 0-0, policy_version 1050819 (0.00580) [2022-07-11 05:16:53,906][26022] Updated weights on worker 0-0, policy_version 1050829 (0.00081) [2022-07-11 05:16:55,602][26022] Updated weights on worker 0-0, policy_version 1050839 (0.00088) [2022-07-11 05:16:56,156][25689] Fps is (10 sec: 5544.2, 60 sec: 5519.9, 300 sec: 5534.2). Total num frames: 1076061184. Throughput: 0: 5762.4. Samples: 1076062784. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:16:56,157][25689] Avg episode reward: [(0, '0.549')] [2022-07-11 05:16:57,482][26022] Updated weights on worker 0-0, policy_version 1050849 (0.00090) [2022-07-11 05:16:59,379][26022] Updated weights on worker 0-0, policy_version 1050859 (0.00083) [2022-07-11 05:17:01,127][26022] Updated weights on worker 0-0, policy_version 1050869 (0.00084) [2022-07-11 05:17:01,220][25689] Fps is (10 sec: 5638.5, 60 sec: 5535.9, 300 sec: 5547.2). Total num frames: 1076089856. Throughput: 0: 5770.4. Samples: 1076096114. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:17:01,220][25689] Avg episode reward: [(0, '0.752')] [2022-07-11 05:17:03,493][26022] Updated weights on worker 0-0, policy_version 1050879 (0.00090) [2022-07-11 05:17:05,284][26022] Updated weights on worker 0-0, policy_version 1050889 (0.00096) [2022-07-11 05:17:06,292][25689] Fps is (10 sec: 5354.8, 60 sec: 5530.2, 300 sec: 5529.0). Total num frames: 1076115456. Throughput: 0: 4833.6. Samples: 1076110788. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:17:06,292][25689] Avg episode reward: [(0, '0.726')] [2022-07-11 05:17:07,052][26022] Updated weights on worker 0-0, policy_version 1050899 (0.00091) [2022-07-11 05:17:08,934][26022] Updated weights on worker 0-0, policy_version 1050909 (0.00091) [2022-07-11 05:17:10,722][26022] Updated weights on worker 0-0, policy_version 1050919 (0.00089) [2022-07-11 05:17:11,356][25689] Fps is (10 sec: 5354.2, 60 sec: 5527.6, 300 sec: 5536.1). Total num frames: 1076144128. Throughput: 0: 5651.6. Samples: 1076144264. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:17:11,357][25689] Avg episode reward: [(0, '0.848')] [2022-07-11 05:17:12,600][26022] Updated weights on worker 0-0, policy_version 1050929 (0.00085) [2022-07-11 05:17:14,441][26022] Updated weights on worker 0-0, policy_version 1050939 (0.00087) [2022-07-11 05:17:16,332][26022] Updated weights on worker 0-0, policy_version 1050949 (0.00094) [2022-07-11 05:17:16,415][25689] Fps is (10 sec: 5563.3, 60 sec: 5534.2, 300 sec: 5531.9). Total num frames: 1076171776. Throughput: 0: 5694.8. Samples: 1076178154. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:17:16,416][25689] Avg episode reward: [(0, '-0.104')] [2022-07-11 05:17:18,053][26022] Updated weights on worker 0-0, policy_version 1050959 (0.00093) [2022-07-11 05:17:19,898][26022] Updated weights on worker 0-0, policy_version 1050969 (0.00087) [2022-07-11 05:17:21,423][25689] Fps is (10 sec: 5594.8, 60 sec: 5551.6, 300 sec: 5537.0). Total num frames: 1076200448. Throughput: 0: 4892.6. Samples: 1076194958. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:17:21,423][25689] Avg episode reward: [(0, '0.847')] [2022-07-11 05:17:21,733][26022] Updated weights on worker 0-0, policy_version 1050979 (0.00088) [2022-07-11 05:17:23,435][26022] Updated weights on worker 0-0, policy_version 1050989 (0.00090) [2022-07-11 05:17:25,465][26022] Updated weights on worker 0-0, policy_version 1050999 (0.00091) [2022-07-11 05:17:26,437][25689] Fps is (10 sec: 5722.1, 60 sec: 5551.1, 300 sec: 5540.3). Total num frames: 1076229120. Throughput: 0: 5853.8. Samples: 1076228712. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:17:26,437][25689] Avg episode reward: [(0, '0.440')] [2022-07-11 05:17:27,265][26022] Updated weights on worker 0-0, policy_version 1051009 (0.01468) [2022-07-11 05:17:29,102][26022] Updated weights on worker 0-0, policy_version 1051019 (0.00084) [2022-07-11 05:17:31,012][26022] Updated weights on worker 0-0, policy_version 1051029 (0.00091) [2022-07-11 05:17:31,480][25689] Fps is (10 sec: 5498.3, 60 sec: 5533.2, 300 sec: 5534.9). Total num frames: 1076255744. Throughput: 0: 5860.6. Samples: 1076262200. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:17:31,480][25689] Avg episode reward: [(0, '0.641')] [2022-07-11 05:17:32,678][26022] Updated weights on worker 0-0, policy_version 1051039 (0.00087) [2022-07-11 05:17:34,772][26022] Updated weights on worker 0-0, policy_version 1051049 (0.00089) [2022-07-11 05:17:36,335][26022] Updated weights on worker 0-0, policy_version 1051059 (0.00082) [2022-07-11 05:17:36,618][25689] Fps is (10 sec: 5632.0, 60 sec: 5576.4, 300 sec: 5543.5). Total num frames: 1076286464. Throughput: 0: 4984.4. Samples: 1076278856. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:17:36,619][25689] Avg episode reward: [(0, '-0.008')] [2022-07-11 05:17:36,909][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:17:36,920][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001051062_1076287488.pth [2022-07-11 05:17:36,921][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001049113_1074291712.pth [2022-07-11 05:17:36,926][25974] Saving a milestone ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/milestones/checkpoint_001051062_1076287488.pth.milestone [2022-07-11 05:17:38,387][26022] Updated weights on worker 0-0, policy_version 1051069 (0.00093) [2022-07-11 05:17:40,177][26022] Updated weights on worker 0-0, policy_version 1051079 (0.00090) [2022-07-11 05:17:41,643][25689] Fps is (10 sec: 5541.7, 60 sec: 5533.3, 300 sec: 5539.9). Total num frames: 1076312064. Throughput: 0: 5795.1. Samples: 1076312134. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:17:41,643][25689] Avg episode reward: [(0, '-0.014')] [2022-07-11 05:17:41,981][26022] Updated weights on worker 0-0, policy_version 1051089 (0.00089) [2022-07-11 05:17:43,888][26022] Updated weights on worker 0-0, policy_version 1051099 (0.00090) [2022-07-11 05:17:45,411][26022] Updated weights on worker 0-0, policy_version 1051109 (0.00084) [2022-07-11 05:17:46,671][25689] Fps is (10 sec: 5500.8, 60 sec: 5552.7, 300 sec: 5539.4). Total num frames: 1076341760. Throughput: 0: 5806.0. Samples: 1076346192. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:17:46,671][25689] Avg episode reward: [(0, '1.310')] [2022-07-11 05:17:47,461][26022] Updated weights on worker 0-0, policy_version 1051119 (0.00083) [2022-07-11 05:17:49,174][26022] Updated weights on worker 0-0, policy_version 1051129 (0.00090) [2022-07-11 05:17:51,141][26022] Updated weights on worker 0-0, policy_version 1051139 (0.00093) [2022-07-11 05:17:51,676][25689] Fps is (10 sec: 5715.2, 60 sec: 5556.9, 300 sec: 5540.0). Total num frames: 1076369408. Throughput: 0: 4994.6. Samples: 1076363074. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:17:51,677][25689] Avg episode reward: [(0, '0.414')] [2022-07-11 05:17:52,868][26022] Updated weights on worker 0-0, policy_version 1051149 (0.00089) [2022-07-11 05:17:55,004][26022] Updated weights on worker 0-0, policy_version 1051159 (0.00089) [2022-07-11 05:17:56,553][26022] Updated weights on worker 0-0, policy_version 1051169 (0.00086) [2022-07-11 05:17:56,741][25689] Fps is (10 sec: 5592.5, 60 sec: 5560.7, 300 sec: 5542.4). Total num frames: 1076398080. Throughput: 0: 5842.5. Samples: 1076396424. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:17:56,742][25689] Avg episode reward: [(0, '-0.056')] [2022-07-11 05:17:58,783][26022] Updated weights on worker 0-0, policy_version 1051179 (0.00093) [2022-07-11 05:18:00,246][26022] Updated weights on worker 0-0, policy_version 1051189 (0.00096) [2022-07-11 05:18:01,784][25689] Fps is (10 sec: 5369.3, 60 sec: 5511.9, 300 sec: 5538.2). Total num frames: 1076423680. Throughput: 0: 5812.9. Samples: 1076429214. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:18:01,785][25689] Avg episode reward: [(0, '0.744')] [2022-07-11 05:18:02,791][26022] Updated weights on worker 0-0, policy_version 1051199 (0.00085) [2022-07-11 05:18:04,456][26022] Updated weights on worker 0-0, policy_version 1051209 (0.00093) [2022-07-11 05:18:06,322][26022] Updated weights on worker 0-0, policy_version 1051219 (0.00081) [2022-07-11 05:18:06,797][25689] Fps is (10 sec: 5193.4, 60 sec: 5534.2, 300 sec: 5538.4). Total num frames: 1076450304. Throughput: 0: 4846.8. Samples: 1076443742. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:18:06,798][25689] Avg episode reward: [(0, '0.767')] [2022-07-11 05:18:08,252][26022] Updated weights on worker 0-0, policy_version 1051229 (0.00084) [2022-07-11 05:18:10,104][26022] Updated weights on worker 0-0, policy_version 1051239 (0.00086) [2022-07-11 05:18:11,812][25689] Fps is (10 sec: 5412.1, 60 sec: 5521.8, 300 sec: 5533.2). Total num frames: 1076477952. Throughput: 0: 5662.0. Samples: 1076477082. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:18:11,813][25689] Avg episode reward: [(0, '0.578')] [2022-07-11 05:18:11,919][26022] Updated weights on worker 0-0, policy_version 1051249 (0.00090) [2022-07-11 05:18:13,753][26022] Updated weights on worker 0-0, policy_version 1051259 (0.00084) [2022-07-11 05:18:15,624][26022] Updated weights on worker 0-0, policy_version 1051269 (0.00091) [2022-07-11 05:18:16,866][25689] Fps is (10 sec: 5491.8, 60 sec: 5522.2, 300 sec: 5532.5). Total num frames: 1076505600. Throughput: 0: 5663.9. Samples: 1076510408. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:18:16,867][25689] Avg episode reward: [(0, '0.710')] [2022-07-11 05:18:17,359][26022] Updated weights on worker 0-0, policy_version 1051279 (0.00091) [2022-07-11 05:18:19,361][26022] Updated weights on worker 0-0, policy_version 1051289 (0.00102) [2022-07-11 05:18:20,875][26022] Updated weights on worker 0-0, policy_version 1051299 (0.00094) [2022-07-11 05:18:21,875][25689] Fps is (10 sec: 5495.0, 60 sec: 5505.2, 300 sec: 5533.8). Total num frames: 1076533248. Throughput: 0: 5721.7. Samples: 1076544168. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:18:21,876][25689] Avg episode reward: [(0, '1.630')] [2022-07-11 05:18:22,816][26022] Updated weights on worker 0-0, policy_version 1051309 (0.00089) [2022-07-11 05:18:24,488][26022] Updated weights on worker 0-0, policy_version 1051319 (0.00084) [2022-07-11 05:18:26,445][26022] Updated weights on worker 0-0, policy_version 1051329 (0.00084) [2022-07-11 05:18:26,890][25689] Fps is (10 sec: 5720.8, 60 sec: 5522.0, 300 sec: 5540.7). Total num frames: 1076562944. Throughput: 0: 5834.7. Samples: 1076560976. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:18:26,891][25689] Avg episode reward: [(0, '1.338')] [2022-07-11 05:18:28,336][26022] Updated weights on worker 0-0, policy_version 1051339 (0.00093) [2022-07-11 05:18:30,237][26022] Updated weights on worker 0-0, policy_version 1051349 (0.00106) [2022-07-11 05:18:31,894][25689] Fps is (10 sec: 5723.5, 60 sec: 5542.5, 300 sec: 5535.5). Total num frames: 1076590592. Throughput: 0: 5831.4. Samples: 1076594188. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:18:31,895][25689] Avg episode reward: [(0, '2.308')] [2022-07-11 05:18:32,096][26022] Updated weights on worker 0-0, policy_version 1051359 (0.00089) [2022-07-11 05:18:33,882][26022] Updated weights on worker 0-0, policy_version 1051369 (0.00087) [2022-07-11 05:18:35,605][26022] Updated weights on worker 0-0, policy_version 1051379 (0.00087) [2022-07-11 05:18:36,982][25689] Fps is (10 sec: 5478.9, 60 sec: 5496.3, 300 sec: 5538.4). Total num frames: 1076618240. Throughput: 0: 5830.7. Samples: 1076627700. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:18:36,983][25689] Avg episode reward: [(0, '1.888')] [2022-07-11 05:18:37,578][26022] Updated weights on worker 0-0, policy_version 1051389 (0.00096) [2022-07-11 05:18:39,337][26022] Updated weights on worker 0-0, policy_version 1051399 (0.00081) [2022-07-11 05:18:41,453][26022] Updated weights on worker 0-0, policy_version 1051409 (0.00087) [2022-07-11 05:18:42,002][25689] Fps is (10 sec: 5470.6, 60 sec: 5530.6, 300 sec: 5536.0). Total num frames: 1076645888. Throughput: 0: 4982.2. Samples: 1076644444. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:18:42,003][25689] Avg episode reward: [(0, '2.082')] [2022-07-11 05:18:42,927][26022] Updated weights on worker 0-0, policy_version 1051419 (0.00605) [2022-07-11 05:18:45,061][26022] Updated weights on worker 0-0, policy_version 1051429 (0.00085) [2022-07-11 05:18:46,531][26022] Updated weights on worker 0-0, policy_version 1051439 (0.00089) [2022-07-11 05:18:47,051][25689] Fps is (10 sec: 5695.4, 60 sec: 5528.7, 300 sec: 5539.0). Total num frames: 1076675584. Throughput: 0: 5807.6. Samples: 1076678062. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:18:47,051][25689] Avg episode reward: [(0, '1.769')] [2022-07-11 05:18:48,694][26022] Updated weights on worker 0-0, policy_version 1051449 (0.00090) [2022-07-11 05:18:50,522][26022] Updated weights on worker 0-0, policy_version 1051459 (0.00538) [2022-07-11 05:18:52,105][25689] Fps is (10 sec: 5676.1, 60 sec: 5524.3, 300 sec: 5538.9). Total num frames: 1076703232. Throughput: 0: 5789.0. Samples: 1076711188. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:18:52,105][25689] Avg episode reward: [(0, '1.353')] [2022-07-11 05:18:52,250][26022] Updated weights on worker 0-0, policy_version 1051469 (0.00093) [2022-07-11 05:18:54,222][26022] Updated weights on worker 0-0, policy_version 1051479 (0.00086) [2022-07-11 05:18:55,925][26022] Updated weights on worker 0-0, policy_version 1051489 (0.00093) [2022-07-11 05:18:57,280][25689] Fps is (10 sec: 5405.6, 60 sec: 5497.3, 300 sec: 5532.8). Total num frames: 1076730880. Throughput: 0: 4943.9. Samples: 1076728050. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:18:57,281][25689] Avg episode reward: [(0, '1.394')] [2022-07-11 05:18:57,815][26022] Updated weights on worker 0-0, policy_version 1051499 (0.00846) [2022-07-11 05:18:59,628][26022] Updated weights on worker 0-0, policy_version 1051509 (0.00088) [2022-07-11 05:19:01,248][26022] Updated weights on worker 0-0, policy_version 1051519 (0.00086) [2022-07-11 05:19:02,371][25689] Fps is (10 sec: 5386.4, 60 sec: 5526.8, 300 sec: 5542.2). Total num frames: 1076758528. Throughput: 0: 5764.1. Samples: 1076761852. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:19:02,371][25689] Avg episode reward: [(0, '1.298')] [2022-07-11 05:19:03,495][26022] Updated weights on worker 0-0, policy_version 1051529 (0.00085) [2022-07-11 05:19:05,581][26022] Updated weights on worker 0-0, policy_version 1051539 (0.00085) [2022-07-11 05:19:07,223][26022] Updated weights on worker 0-0, policy_version 1051549 (0.00087) [2022-07-11 05:19:07,426][25689] Fps is (10 sec: 5450.0, 60 sec: 5539.8, 300 sec: 5538.1). Total num frames: 1076786176. Throughput: 0: 5657.8. Samples: 1076793342. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:19:07,427][25689] Avg episode reward: [(0, '1.542')] [2022-07-11 05:19:09,370][26022] Updated weights on worker 0-0, policy_version 1051559 (0.00087) [2022-07-11 05:19:10,895][26022] Updated weights on worker 0-0, policy_version 1051569 (0.00093) [2022-07-11 05:19:12,452][25689] Fps is (10 sec: 5383.3, 60 sec: 5521.9, 300 sec: 5532.5). Total num frames: 1076812800. Throughput: 0: 4851.1. Samples: 1076809896. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:19:12,453][25689] Avg episode reward: [(0, '1.745')] [2022-07-11 05:19:12,991][26022] Updated weights on worker 0-0, policy_version 1051579 (0.00084) [2022-07-11 05:19:14,817][26022] Updated weights on worker 0-0, policy_version 1051589 (0.00084) [2022-07-11 05:19:16,553][26022] Updated weights on worker 0-0, policy_version 1051599 (0.00080) [2022-07-11 05:19:17,512][25689] Fps is (10 sec: 5584.0, 60 sec: 5555.1, 300 sec: 5543.1). Total num frames: 1076842496. Throughput: 0: 5702.2. Samples: 1076843416. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:19:17,514][25689] Avg episode reward: [(0, '1.755')] [2022-07-11 05:19:18,333][26022] Updated weights on worker 0-0, policy_version 1051609 (0.00059) [2022-07-11 05:19:20,128][26022] Updated weights on worker 0-0, policy_version 1051619 (0.00081) [2022-07-11 05:19:22,168][26022] Updated weights on worker 0-0, policy_version 1051629 (0.00083) [2022-07-11 05:19:22,531][25689] Fps is (10 sec: 5689.4, 60 sec: 5554.2, 300 sec: 5536.6). Total num frames: 1076870144. Throughput: 0: 5720.2. Samples: 1076877174. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:19:22,532][25689] Avg episode reward: [(0, '2.234')] [2022-07-11 05:19:23,680][26022] Updated weights on worker 0-0, policy_version 1051639 (0.00082) [2022-07-11 05:19:25,652][26022] Updated weights on worker 0-0, policy_version 1051649 (0.00088) [2022-07-11 05:19:27,428][26022] Updated weights on worker 0-0, policy_version 1051659 (0.00090) [2022-07-11 05:19:27,558][25689] Fps is (10 sec: 5606.0, 60 sec: 5536.2, 300 sec: 5537.7). Total num frames: 1076898816. Throughput: 0: 4999.4. Samples: 1076893992. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:19:27,559][25689] Avg episode reward: [(0, '2.014')] [2022-07-11 05:19:29,341][26022] Updated weights on worker 0-0, policy_version 1051669 (0.00086) [2022-07-11 05:19:31,220][26022] Updated weights on worker 0-0, policy_version 1051679 (0.00079) [2022-07-11 05:19:32,576][25689] Fps is (10 sec: 5606.7, 60 sec: 5534.9, 300 sec: 5541.9). Total num frames: 1076926464. Throughput: 0: 5833.3. Samples: 1076927286. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:19:32,577][25689] Avg episode reward: [(0, '1.754')] [2022-07-11 05:19:33,103][26022] Updated weights on worker 0-0, policy_version 1051689 (0.00082) [2022-07-11 05:19:34,948][26022] Updated weights on worker 0-0, policy_version 1051699 (0.00094) [2022-07-11 05:19:36,870][26022] Updated weights on worker 0-0, policy_version 1051709 (0.00093) [2022-07-11 05:19:37,005][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:19:37,023][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001051710_1076951040.pth [2022-07-11 05:19:37,023][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001049761_1074955264.pth [2022-07-11 05:19:37,722][25689] Fps is (10 sec: 5440.5, 60 sec: 5529.7, 300 sec: 5532.4). Total num frames: 1076954112. Throughput: 0: 5785.0. Samples: 1076960332. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:19:37,723][25689] Avg episode reward: [(0, '1.661')] [2022-07-11 05:19:38,753][26022] Updated weights on worker 0-0, policy_version 1051719 (0.00092) [2022-07-11 05:19:40,534][26022] Updated weights on worker 0-0, policy_version 1051729 (0.00114) [2022-07-11 05:19:42,467][26022] Updated weights on worker 0-0, policy_version 1051739 (0.00084) [2022-07-11 05:19:42,744][25689] Fps is (10 sec: 5538.9, 60 sec: 5546.3, 300 sec: 5539.1). Total num frames: 1076982784. Throughput: 0: 4935.3. Samples: 1076976932. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:19:42,745][25689] Avg episode reward: [(0, '1.032')] [2022-07-11 05:19:44,191][26022] Updated weights on worker 0-0, policy_version 1051749 (0.00049) [2022-07-11 05:19:46,185][26022] Updated weights on worker 0-0, policy_version 1051759 (0.00677) [2022-07-11 05:19:47,755][25689] Fps is (10 sec: 5511.7, 60 sec: 5499.2, 300 sec: 5533.0). Total num frames: 1077009408. Throughput: 0: 5734.2. Samples: 1077009802. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:19:47,755][25689] Avg episode reward: [(0, '0.980')] [2022-07-11 05:19:47,914][26022] Updated weights on worker 0-0, policy_version 1051769 (0.00111) [2022-07-11 05:19:49,995][26022] Updated weights on worker 0-0, policy_version 1051779 (0.00086) [2022-07-11 05:19:51,588][26022] Updated weights on worker 0-0, policy_version 1051789 (0.00110) [2022-07-11 05:19:52,790][25689] Fps is (10 sec: 5402.9, 60 sec: 5500.9, 300 sec: 5533.1). Total num frames: 1077037056. Throughput: 0: 5744.2. Samples: 1077043394. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:19:52,790][25689] Avg episode reward: [(0, '0.586')] [2022-07-11 05:19:53,557][26022] Updated weights on worker 0-0, policy_version 1051799 (0.00082) [2022-07-11 05:19:55,285][26022] Updated weights on worker 0-0, policy_version 1051809 (0.00086) [2022-07-11 05:19:57,123][26022] Updated weights on worker 0-0, policy_version 1051819 (0.00090) [2022-07-11 05:19:57,863][25689] Fps is (10 sec: 5571.4, 60 sec: 5527.1, 300 sec: 5532.0). Total num frames: 1077065728. Throughput: 0: 4945.5. Samples: 1077059940. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:19:57,864][25689] Avg episode reward: [(0, '0.639')] [2022-07-11 05:19:59,083][26022] Updated weights on worker 0-0, policy_version 1051829 (0.00090) [2022-07-11 05:20:00,828][26022] Updated weights on worker 0-0, policy_version 1051839 (0.00085) [2022-07-11 05:20:02,914][25689] Fps is (10 sec: 5360.7, 60 sec: 5496.9, 300 sec: 5525.3). Total num frames: 1077091328. Throughput: 0: 5772.5. Samples: 1077093358. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:20:02,914][25689] Avg episode reward: [(0, '0.976')] [2022-07-11 05:20:03,106][26022] Updated weights on worker 0-0, policy_version 1051849 (0.00083) [2022-07-11 05:20:04,943][26022] Updated weights on worker 0-0, policy_version 1051859 (0.00082) [2022-07-11 05:20:06,811][26022] Updated weights on worker 0-0, policy_version 1051869 (0.00082) [2022-07-11 05:20:07,944][25689] Fps is (10 sec: 5282.0, 60 sec: 5499.2, 300 sec: 5528.3). Total num frames: 1077118976. Throughput: 0: 5702.3. Samples: 1077124930. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:20:07,945][25689] Avg episode reward: [(0, '-0.739')] [2022-07-11 05:20:08,632][26022] Updated weights on worker 0-0, policy_version 1051879 (0.00087) [2022-07-11 05:20:10,598][26022] Updated weights on worker 0-0, policy_version 1051889 (0.00089) [2022-07-11 05:20:12,498][26022] Updated weights on worker 0-0, policy_version 1051899 (0.00119) [2022-07-11 05:20:12,953][25689] Fps is (10 sec: 5609.6, 60 sec: 5534.6, 300 sec: 5536.9). Total num frames: 1077147648. Throughput: 0: 4863.0. Samples: 1077141452. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:20:12,954][25689] Avg episode reward: [(0, '-1.130')] [2022-07-11 05:20:14,207][26022] Updated weights on worker 0-0, policy_version 1051909 (0.00088) [2022-07-11 05:20:16,137][26022] Updated weights on worker 0-0, policy_version 1051919 (0.00084) [2022-07-11 05:20:17,910][26022] Updated weights on worker 0-0, policy_version 1051929 (0.00095) [2022-07-11 05:20:18,067][25689] Fps is (10 sec: 5564.0, 60 sec: 5495.9, 300 sec: 5525.1). Total num frames: 1077175296. Throughput: 0: 5663.5. Samples: 1077174360. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:20:18,067][25689] Avg episode reward: [(0, '-1.642')] [2022-07-11 05:20:19,839][26022] Updated weights on worker 0-0, policy_version 1051939 (0.00096) [2022-07-11 05:20:21,439][26022] Updated weights on worker 0-0, policy_version 1051949 (0.00084) [2022-07-11 05:20:23,085][25689] Fps is (10 sec: 5458.0, 60 sec: 5496.0, 300 sec: 5528.9). Total num frames: 1077202944. Throughput: 0: 5681.7. Samples: 1077207964. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:20:23,086][25689] Avg episode reward: [(0, '-1.550')] [2022-07-11 05:20:23,539][26022] Updated weights on worker 0-0, policy_version 1051959 (0.00089) [2022-07-11 05:20:25,094][26022] Updated weights on worker 0-0, policy_version 1051969 (0.00093) [2022-07-11 05:20:27,304][26022] Updated weights on worker 0-0, policy_version 1051979 (0.00091) [2022-07-11 05:20:28,123][25689] Fps is (10 sec: 5702.2, 60 sec: 5511.9, 300 sec: 5533.2). Total num frames: 1077232640. Throughput: 0: 5768.7. Samples: 1077241334. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:20:28,125][25689] Avg episode reward: [(0, '-1.853')] [2022-07-11 05:20:28,954][26022] Updated weights on worker 0-0, policy_version 1051989 (0.00082) [2022-07-11 05:20:30,790][26022] Updated weights on worker 0-0, policy_version 1051999 (0.00089) [2022-07-11 05:20:32,884][26022] Updated weights on worker 0-0, policy_version 1052009 (0.00091) [2022-07-11 05:20:33,163][25689] Fps is (10 sec: 5588.3, 60 sec: 5493.0, 300 sec: 5531.2). Total num frames: 1077259264. Throughput: 0: 5761.2. Samples: 1077257882. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:20:33,163][25689] Avg episode reward: [(0, '-2.755')] [2022-07-11 05:20:34,575][26022] Updated weights on worker 0-0, policy_version 1052019 (0.00096) [2022-07-11 05:20:36,280][26022] Updated weights on worker 0-0, policy_version 1052029 (0.00093) [2022-07-11 05:20:38,241][25689] Fps is (10 sec: 5364.2, 60 sec: 5499.2, 300 sec: 5524.3). Total num frames: 1077286912. Throughput: 0: 5777.5. Samples: 1077290916. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:20:38,241][25689] Avg episode reward: [(0, '0.300')] [2022-07-11 05:20:38,409][26022] Updated weights on worker 0-0, policy_version 1052039 (0.00091) [2022-07-11 05:20:39,943][26022] Updated weights on worker 0-0, policy_version 1052049 (0.00086) [2022-07-11 05:20:42,085][26022] Updated weights on worker 0-0, policy_version 1052059 (0.00622) [2022-07-11 05:20:43,311][25689] Fps is (10 sec: 5550.0, 60 sec: 5494.8, 300 sec: 5530.1). Total num frames: 1077315584. Throughput: 0: 5761.8. Samples: 1077324502. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:20:43,311][25689] Avg episode reward: [(0, '0.844')] [2022-07-11 05:20:43,623][26022] Updated weights on worker 0-0, policy_version 1052069 (0.00096) [2022-07-11 05:20:45,684][26022] Updated weights on worker 0-0, policy_version 1052079 (0.00085) [2022-07-11 05:20:47,484][26022] Updated weights on worker 0-0, policy_version 1052089 (0.00088) [2022-07-11 05:20:48,320][25689] Fps is (10 sec: 5688.9, 60 sec: 5528.7, 300 sec: 5530.7). Total num frames: 1077344256. Throughput: 0: 4933.8. Samples: 1077340984. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:20:48,321][25689] Avg episode reward: [(0, '-0.111')] [2022-07-11 05:20:49,413][26022] Updated weights on worker 0-0, policy_version 1052099 (0.00092) [2022-07-11 05:20:51,169][26022] Updated weights on worker 0-0, policy_version 1052109 (0.00088) [2022-07-11 05:20:53,099][26022] Updated weights on worker 0-0, policy_version 1052119 (0.00083) [2022-07-11 05:20:53,357][25689] Fps is (10 sec: 5606.3, 60 sec: 5528.6, 300 sec: 5524.6). Total num frames: 1077371904. Throughput: 0: 5774.2. Samples: 1077374484. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:20:53,357][25689] Avg episode reward: [(0, '-1.013')] [2022-07-11 05:20:54,820][26022] Updated weights on worker 0-0, policy_version 1052129 (0.00092) [2022-07-11 05:20:56,763][26022] Updated weights on worker 0-0, policy_version 1052139 (0.00084) [2022-07-11 05:20:58,407][25689] Fps is (10 sec: 5482.3, 60 sec: 5513.8, 300 sec: 5524.7). Total num frames: 1077399552. Throughput: 0: 5819.3. Samples: 1077408268. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:20:58,407][25689] Avg episode reward: [(0, '-1.033')] [2022-07-11 05:20:58,467][26022] Updated weights on worker 0-0, policy_version 1052149 (0.00083) [2022-07-11 05:21:00,267][26022] Updated weights on worker 0-0, policy_version 1052159 (0.00091) [2022-07-11 05:21:02,485][26022] Updated weights on worker 0-0, policy_version 1052169 (0.00086) [2022-07-11 05:21:03,409][25689] Fps is (10 sec: 5398.7, 60 sec: 5535.1, 300 sec: 5528.2). Total num frames: 1077426176. Throughput: 0: 5007.1. Samples: 1077425138. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:21:03,410][25689] Avg episode reward: [(0, '-1.230')] [2022-07-11 05:21:04,226][26022] Updated weights on worker 0-0, policy_version 1052179 (0.00085) [2022-07-11 05:21:06,006][26022] Updated weights on worker 0-0, policy_version 1052189 (0.00085) [2022-07-11 05:21:08,136][26022] Updated weights on worker 0-0, policy_version 1052199 (0.00095) [2022-07-11 05:21:08,439][25689] Fps is (10 sec: 5409.4, 60 sec: 5535.1, 300 sec: 5524.9). Total num frames: 1077453824. Throughput: 0: 5756.1. Samples: 1077456792. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:21:08,442][25689] Avg episode reward: [(0, '-0.710')] [2022-07-11 05:21:09,640][26022] Updated weights on worker 0-0, policy_version 1052209 (0.00085) [2022-07-11 05:21:11,777][26022] Updated weights on worker 0-0, policy_version 1052219 (0.00086) [2022-07-11 05:21:13,294][26022] Updated weights on worker 0-0, policy_version 1052229 (0.00081) [2022-07-11 05:21:13,457][25689] Fps is (10 sec: 5707.4, 60 sec: 5551.3, 300 sec: 5533.9). Total num frames: 1077483520. Throughput: 0: 5746.6. Samples: 1077489992. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:21:13,457][25689] Avg episode reward: [(0, '-0.404')] [2022-07-11 05:21:15,462][26022] Updated weights on worker 0-0, policy_version 1052239 (0.00095) [2022-07-11 05:21:17,118][26022] Updated weights on worker 0-0, policy_version 1052249 (0.00093) [2022-07-11 05:21:18,559][25689] Fps is (10 sec: 5363.0, 60 sec: 5501.5, 300 sec: 5521.9). Total num frames: 1077508096. Throughput: 0: 4873.6. Samples: 1077506484. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:21:18,561][25689] Avg episode reward: [(0, '1.562')] [2022-07-11 05:21:19,119][26022] Updated weights on worker 0-0, policy_version 1052259 (0.00090) [2022-07-11 05:21:20,975][26022] Updated weights on worker 0-0, policy_version 1052269 (0.00083) [2022-07-11 05:21:22,610][26022] Updated weights on worker 0-0, policy_version 1052279 (0.00091) [2022-07-11 05:21:23,598][25689] Fps is (10 sec: 5351.8, 60 sec: 5533.5, 300 sec: 5524.8). Total num frames: 1077537792. Throughput: 0: 5699.5. Samples: 1077540200. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:21:23,598][25689] Avg episode reward: [(0, '1.478')] [2022-07-11 05:21:24,401][26022] Updated weights on worker 0-0, policy_version 1052289 (0.00085) [2022-07-11 05:21:26,468][26022] Updated weights on worker 0-0, policy_version 1052299 (0.00086) [2022-07-11 05:21:28,105][26022] Updated weights on worker 0-0, policy_version 1052309 (0.00088) [2022-07-11 05:21:28,679][25689] Fps is (10 sec: 5868.6, 60 sec: 5529.5, 300 sec: 5530.8). Total num frames: 1077567488. Throughput: 0: 5784.0. Samples: 1077573860. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:21:28,680][25689] Avg episode reward: [(0, '1.392')] [2022-07-11 05:21:30,240][26022] Updated weights on worker 0-0, policy_version 1052319 (0.00082) [2022-07-11 05:21:31,763][26022] Updated weights on worker 0-0, policy_version 1052329 (0.00085) [2022-07-11 05:21:33,696][25689] Fps is (10 sec: 5576.9, 60 sec: 5531.6, 300 sec: 5528.1). Total num frames: 1077594112. Throughput: 0: 4972.2. Samples: 1077590628. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:21:33,698][25689] Avg episode reward: [(0, '1.262')] [2022-07-11 05:21:33,760][26022] Updated weights on worker 0-0, policy_version 1052339 (0.00089) [2022-07-11 05:21:35,529][26022] Updated weights on worker 0-0, policy_version 1052349 (0.00083) [2022-07-11 05:21:37,104][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:21:37,124][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001052357_1077613568.pth [2022-07-11 05:21:37,124][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001050411_1075620864.pth [2022-07-11 05:21:37,334][26022] Updated weights on worker 0-0, policy_version 1052359 (0.00084) [2022-07-11 05:21:38,737][25689] Fps is (10 sec: 5498.1, 60 sec: 5552.0, 300 sec: 5529.3). Total num frames: 1077622784. Throughput: 0: 5837.7. Samples: 1077624274. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:21:38,738][25689] Avg episode reward: [(0, '0.053')] [2022-07-11 05:21:39,085][26022] Updated weights on worker 0-0, policy_version 1052369 (0.00085) [2022-07-11 05:21:40,994][26022] Updated weights on worker 0-0, policy_version 1052379 (0.00627) [2022-07-11 05:21:42,875][26022] Updated weights on worker 0-0, policy_version 1052389 (0.00087) [2022-07-11 05:21:43,806][25689] Fps is (10 sec: 5671.9, 60 sec: 5552.0, 300 sec: 5529.0). Total num frames: 1077651456. Throughput: 0: 5823.5. Samples: 1077657886. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:21:43,808][25689] Avg episode reward: [(0, '-0.612')] [2022-07-11 05:21:44,704][26022] Updated weights on worker 0-0, policy_version 1052399 (0.00613) [2022-07-11 05:21:46,400][26022] Updated weights on worker 0-0, policy_version 1052409 (0.00080) [2022-07-11 05:21:48,627][26022] Updated weights on worker 0-0, policy_version 1052419 (0.00086) [2022-07-11 05:21:48,814][25689] Fps is (10 sec: 5487.2, 60 sec: 5518.3, 300 sec: 5526.4). Total num frames: 1077678080. Throughput: 0: 5005.7. Samples: 1077674648. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:21:48,816][25689] Avg episode reward: [(0, '-0.991')] [2022-07-11 05:21:50,075][26022] Updated weights on worker 0-0, policy_version 1052429 (0.00102) [2022-07-11 05:21:52,197][26022] Updated weights on worker 0-0, policy_version 1052439 (0.00086) [2022-07-11 05:21:53,763][26022] Updated weights on worker 0-0, policy_version 1052449 (0.00087) [2022-07-11 05:21:53,823][25689] Fps is (10 sec: 5623.0, 60 sec: 5554.7, 300 sec: 5531.7). Total num frames: 1077707776. Throughput: 0: 5837.0. Samples: 1077708104. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:21:53,823][25689] Avg episode reward: [(0, '-1.198')] [2022-07-11 05:21:55,961][26022] Updated weights on worker 0-0, policy_version 1052459 (0.00084) [2022-07-11 05:21:57,550][26022] Updated weights on worker 0-0, policy_version 1052469 (0.00095) [2022-07-11 05:21:58,903][25689] Fps is (10 sec: 5582.2, 60 sec: 5535.0, 300 sec: 5524.5). Total num frames: 1077734400. Throughput: 0: 5807.1. Samples: 1077741384. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:21:58,904][25689] Avg episode reward: [(0, '-1.003')] [2022-07-11 05:21:59,463][26022] Updated weights on worker 0-0, policy_version 1052479 (0.00092) [2022-07-11 05:22:01,252][26022] Updated weights on worker 0-0, policy_version 1052489 (0.00093) [2022-07-11 05:22:03,460][26022] Updated weights on worker 0-0, policy_version 1052499 (0.00088) [2022-07-11 05:22:03,974][25689] Fps is (10 sec: 5346.5, 60 sec: 5545.7, 300 sec: 5531.4). Total num frames: 1077762048. Throughput: 0: 4965.1. Samples: 1077758018. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:22:03,974][25689] Avg episode reward: [(0, '-0.628')] [2022-07-11 05:22:05,356][26022] Updated weights on worker 0-0, policy_version 1052509 (0.00082) [2022-07-11 05:22:07,135][26022] Updated weights on worker 0-0, policy_version 1052519 (0.00085) [2022-07-11 05:22:08,979][26022] Updated weights on worker 0-0, policy_version 1052529 (0.00086) [2022-07-11 05:22:09,072][25689] Fps is (10 sec: 5437.8, 60 sec: 5539.5, 300 sec: 5527.3). Total num frames: 1077789696. Throughput: 0: 5682.1. Samples: 1077789756. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:22:09,073][25689] Avg episode reward: [(0, '0.301')] [2022-07-11 05:22:10,883][26022] Updated weights on worker 0-0, policy_version 1052539 (0.00089) [2022-07-11 05:22:12,580][26022] Updated weights on worker 0-0, policy_version 1052549 (0.00085) [2022-07-11 05:22:14,088][25689] Fps is (10 sec: 5568.0, 60 sec: 5522.6, 300 sec: 5531.5). Total num frames: 1077818368. Throughput: 0: 5689.5. Samples: 1077823406. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:22:14,089][25689] Avg episode reward: [(0, '-0.618')] [2022-07-11 05:22:14,431][26022] Updated weights on worker 0-0, policy_version 1052559 (0.00102) [2022-07-11 05:22:16,353][26022] Updated weights on worker 0-0, policy_version 1052569 (0.00089) [2022-07-11 05:22:18,038][26022] Updated weights on worker 0-0, policy_version 1052579 (0.00097) [2022-07-11 05:22:19,122][25689] Fps is (10 sec: 5502.4, 60 sec: 5562.8, 300 sec: 5524.2). Total num frames: 1077844992. Throughput: 0: 4888.5. Samples: 1077840222. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:22:19,122][25689] Avg episode reward: [(0, '-0.667')] [2022-07-11 05:22:20,007][26022] Updated weights on worker 0-0, policy_version 1052589 (0.00084) [2022-07-11 05:22:21,645][26022] Updated weights on worker 0-0, policy_version 1052599 (0.00829) [2022-07-11 05:22:23,673][26022] Updated weights on worker 0-0, policy_version 1052609 (0.00087) [2022-07-11 05:22:24,135][25689] Fps is (10 sec: 5605.7, 60 sec: 5565.0, 300 sec: 5527.6). Total num frames: 1077874688. Throughput: 0: 5745.0. Samples: 1077873848. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:22:24,136][25689] Avg episode reward: [(0, '0.645')] [2022-07-11 05:22:25,298][26022] Updated weights on worker 0-0, policy_version 1052619 (0.00115) [2022-07-11 05:22:27,242][26022] Updated weights on worker 0-0, policy_version 1052629 (0.00107) [2022-07-11 05:22:29,045][26022] Updated weights on worker 0-0, policy_version 1052639 (0.00093) [2022-07-11 05:22:29,139][25689] Fps is (10 sec: 5724.5, 60 sec: 5538.4, 300 sec: 5531.8). Total num frames: 1077902336. Throughput: 0: 5856.7. Samples: 1077907282. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:22:29,139][25689] Avg episode reward: [(0, '-0.575')] [2022-07-11 05:22:30,898][26022] Updated weights on worker 0-0, policy_version 1052649 (0.00088) [2022-07-11 05:22:32,911][26022] Updated weights on worker 0-0, policy_version 1052659 (0.00087) [2022-07-11 05:22:34,149][25689] Fps is (10 sec: 5419.6, 60 sec: 5539.0, 300 sec: 5520.4). Total num frames: 1077928960. Throughput: 0: 5020.9. Samples: 1077924130. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:22:34,150][25689] Avg episode reward: [(0, '-0.230')] [2022-07-11 05:22:34,450][26022] Updated weights on worker 0-0, policy_version 1052669 (0.00085) [2022-07-11 05:22:36,525][26022] Updated weights on worker 0-0, policy_version 1052679 (0.00089) [2022-07-11 05:22:38,017][26022] Updated weights on worker 0-0, policy_version 1052689 (0.00092) [2022-07-11 05:22:39,247][25689] Fps is (10 sec: 5571.7, 60 sec: 5550.6, 300 sec: 5532.8). Total num frames: 1077958656. Throughput: 0: 5853.2. Samples: 1077958022. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:22:39,249][25689] Avg episode reward: [(0, '0.539')] [2022-07-11 05:22:40,082][26022] Updated weights on worker 0-0, policy_version 1052699 (0.00093) [2022-07-11 05:22:41,949][26022] Updated weights on worker 0-0, policy_version 1052709 (0.00080) [2022-07-11 05:22:43,591][26022] Updated weights on worker 0-0, policy_version 1052719 (0.00084) [2022-07-11 05:22:44,277][25689] Fps is (10 sec: 5763.1, 60 sec: 5554.3, 300 sec: 5529.3). Total num frames: 1077987328. Throughput: 0: 5856.6. Samples: 1077991812. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:22:44,279][25689] Avg episode reward: [(0, '1.099')] [2022-07-11 05:22:45,495][26022] Updated weights on worker 0-0, policy_version 1052729 (0.00066) [2022-07-11 05:22:47,319][26022] Updated weights on worker 0-0, policy_version 1052739 (0.00086) [2022-07-11 05:22:49,235][26022] Updated weights on worker 0-0, policy_version 1052749 (0.00085) [2022-07-11 05:22:49,280][25689] Fps is (10 sec: 5613.8, 60 sec: 5571.7, 300 sec: 5529.4). Total num frames: 1078014976. Throughput: 0: 5031.8. Samples: 1078008626. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:22:49,281][25689] Avg episode reward: [(0, '0.817')] [2022-07-11 05:22:50,977][26022] Updated weights on worker 0-0, policy_version 1052759 (0.00100) [2022-07-11 05:22:52,866][26022] Updated weights on worker 0-0, policy_version 1052769 (0.00090) [2022-07-11 05:22:54,293][25689] Fps is (10 sec: 5623.2, 60 sec: 5554.3, 300 sec: 5530.4). Total num frames: 1078043648. Throughput: 0: 5873.1. Samples: 1078042436. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:22:54,294][25689] Avg episode reward: [(0, '1.452')] [2022-07-11 05:22:54,726][26022] Updated weights on worker 0-0, policy_version 1052779 (0.00086) [2022-07-11 05:22:56,440][26022] Updated weights on worker 0-0, policy_version 1052789 (0.00087) [2022-07-11 05:22:58,280][26022] Updated weights on worker 0-0, policy_version 1052799 (0.00087) [2022-07-11 05:22:59,350][25689] Fps is (10 sec: 5592.7, 60 sec: 5573.4, 300 sec: 5537.0). Total num frames: 1078071296. Throughput: 0: 5875.4. Samples: 1078076134. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:22:59,350][25689] Avg episode reward: [(0, '1.769')] [2022-07-11 05:23:00,034][26022] Updated weights on worker 0-0, policy_version 1052809 (0.00090) [2022-07-11 05:23:02,218][26022] Updated weights on worker 0-0, policy_version 1052819 (0.00089) [2022-07-11 05:23:04,261][26022] Updated weights on worker 0-0, policy_version 1052829 (0.00090) [2022-07-11 05:23:04,358][25689] Fps is (10 sec: 5290.4, 60 sec: 5545.3, 300 sec: 5533.6). Total num frames: 1078096896. Throughput: 0: 5028.4. Samples: 1078092786. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:23:04,361][25689] Avg episode reward: [(0, '1.737')] [2022-07-11 05:23:05,900][26022] Updated weights on worker 0-0, policy_version 1052839 (0.00082) [2022-07-11 05:23:07,929][26022] Updated weights on worker 0-0, policy_version 1052849 (0.00087) [2022-07-11 05:23:09,364][25689] Fps is (10 sec: 5521.7, 60 sec: 5587.7, 300 sec: 5540.7). Total num frames: 1078126592. Throughput: 0: 5753.6. Samples: 1078124184. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:23:09,365][25689] Avg episode reward: [(0, '2.021')] [2022-07-11 05:23:09,636][26022] Updated weights on worker 0-0, policy_version 1052859 (0.00090) [2022-07-11 05:23:11,620][26022] Updated weights on worker 0-0, policy_version 1052869 (0.00091) [2022-07-11 05:23:13,439][26022] Updated weights on worker 0-0, policy_version 1052879 (0.00079) [2022-07-11 05:23:14,377][25689] Fps is (10 sec: 5519.2, 60 sec: 5537.1, 300 sec: 5534.6). Total num frames: 1078152192. Throughput: 0: 5723.5. Samples: 1078157386. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:23:14,377][25689] Avg episode reward: [(0, '1.537')] [2022-07-11 05:23:15,298][26022] Updated weights on worker 0-0, policy_version 1052889 (0.00095) [2022-07-11 05:23:17,107][26022] Updated weights on worker 0-0, policy_version 1052899 (0.00083) [2022-07-11 05:23:18,901][26022] Updated weights on worker 0-0, policy_version 1052909 (0.00090) [2022-07-11 05:23:19,431][25689] Fps is (10 sec: 5391.3, 60 sec: 5569.2, 300 sec: 5537.2). Total num frames: 1078180864. Throughput: 0: 5695.9. Samples: 1078190512. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:23:19,431][25689] Avg episode reward: [(0, '1.872')] [2022-07-11 05:23:20,838][26022] Updated weights on worker 0-0, policy_version 1052919 (0.00089) [2022-07-11 05:23:22,662][26022] Updated weights on worker 0-0, policy_version 1052929 (0.00091) [2022-07-11 05:23:24,520][25689] Fps is (10 sec: 5552.7, 60 sec: 5528.3, 300 sec: 5528.9). Total num frames: 1078208512. Throughput: 0: 5682.3. Samples: 1078207350. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:23:24,520][25689] Avg episode reward: [(0, '1.461')] [2022-07-11 05:23:24,565][26022] Updated weights on worker 0-0, policy_version 1052939 (0.00084) [2022-07-11 05:23:26,424][26022] Updated weights on worker 0-0, policy_version 1052949 (0.00094) [2022-07-11 05:23:28,276][26022] Updated weights on worker 0-0, policy_version 1052959 (0.00641) [2022-07-11 05:23:29,526][25689] Fps is (10 sec: 5477.5, 60 sec: 5528.1, 300 sec: 5528.9). Total num frames: 1078236160. Throughput: 0: 5790.0. Samples: 1078240920. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:23:29,526][25689] Avg episode reward: [(0, '1.139')] [2022-07-11 05:23:29,955][26022] Updated weights on worker 0-0, policy_version 1052969 (0.00086) [2022-07-11 05:23:31,937][26022] Updated weights on worker 0-0, policy_version 1052979 (0.00091) [2022-07-11 05:23:33,647][26022] Updated weights on worker 0-0, policy_version 1052989 (0.00083) [2022-07-11 05:23:34,533][25689] Fps is (10 sec: 5726.9, 60 sec: 5579.3, 300 sec: 5537.3). Total num frames: 1078265856. Throughput: 0: 5799.7. Samples: 1078274286. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:23:34,533][25689] Avg episode reward: [(0, '1.100')] [2022-07-11 05:23:35,804][26022] Updated weights on worker 0-0, policy_version 1052999 (0.00094) [2022-07-11 05:23:37,236][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:23:37,253][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001053008_1078280192.pth [2022-07-11 05:23:37,254][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001051062_1076287488.pth [2022-07-11 05:23:37,405][26022] Updated weights on worker 0-0, policy_version 1053009 (0.00082) [2022-07-11 05:23:39,390][26022] Updated weights on worker 0-0, policy_version 1053019 (0.00093) [2022-07-11 05:23:39,638][25689] Fps is (10 sec: 5569.2, 60 sec: 5527.7, 300 sec: 5532.2). Total num frames: 1078292480. Throughput: 0: 4965.9. Samples: 1078290862. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:23:39,639][25689] Avg episode reward: [(0, '0.794')] [2022-07-11 05:23:40,975][26022] Updated weights on worker 0-0, policy_version 1053029 (0.00090) [2022-07-11 05:23:43,039][26022] Updated weights on worker 0-0, policy_version 1053039 (0.00084) [2022-07-11 05:23:44,649][25689] Fps is (10 sec: 5567.2, 60 sec: 5546.4, 300 sec: 5532.9). Total num frames: 1078322176. Throughput: 0: 5818.8. Samples: 1078324480. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:23:44,650][25689] Avg episode reward: [(0, '0.918')] [2022-07-11 05:23:44,652][26022] Updated weights on worker 0-0, policy_version 1053049 (0.00088) [2022-07-11 05:23:46,688][26022] Updated weights on worker 0-0, policy_version 1053059 (0.00086) [2022-07-11 05:23:48,315][26022] Updated weights on worker 0-0, policy_version 1053069 (0.00088) [2022-07-11 05:23:49,654][25689] Fps is (10 sec: 5623.3, 60 sec: 5529.2, 300 sec: 5530.4). Total num frames: 1078348800. Throughput: 0: 5812.7. Samples: 1078357918. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:23:49,654][25689] Avg episode reward: [(0, '0.447')] [2022-07-11 05:23:50,327][26022] Updated weights on worker 0-0, policy_version 1053079 (0.00094) [2022-07-11 05:23:52,107][26022] Updated weights on worker 0-0, policy_version 1053089 (0.00094) [2022-07-11 05:23:54,079][26022] Updated weights on worker 0-0, policy_version 1053099 (0.00090) [2022-07-11 05:23:54,704][25689] Fps is (10 sec: 5295.9, 60 sec: 5492.0, 300 sec: 5529.3). Total num frames: 1078375424. Throughput: 0: 4962.3. Samples: 1078374382. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:23:54,705][25689] Avg episode reward: [(0, '0.942')] [2022-07-11 05:23:55,672][26022] Updated weights on worker 0-0, policy_version 1053109 (0.00086) [2022-07-11 05:23:57,782][26022] Updated weights on worker 0-0, policy_version 1053119 (0.00083) [2022-07-11 05:23:59,423][26022] Updated weights on worker 0-0, policy_version 1053129 (0.00087) [2022-07-11 05:23:59,752][25689] Fps is (10 sec: 5577.2, 60 sec: 5526.7, 300 sec: 5537.0). Total num frames: 1078405120. Throughput: 0: 5813.9. Samples: 1078407800. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:23:59,753][25689] Avg episode reward: [(0, '-0.220')] [2022-07-11 05:24:01,488][26022] Updated weights on worker 0-0, policy_version 1053139 (0.00087) [2022-07-11 05:24:03,584][26022] Updated weights on worker 0-0, policy_version 1053149 (0.00089) [2022-07-11 05:24:04,767][25689] Fps is (10 sec: 5495.3, 60 sec: 5526.1, 300 sec: 5530.8). Total num frames: 1078430720. Throughput: 0: 5712.1. Samples: 1078439392. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:24:04,769][25689] Avg episode reward: [(0, '0.147')] [2022-07-11 05:24:05,326][26022] Updated weights on worker 0-0, policy_version 1053159 (0.00646) [2022-07-11 05:24:07,320][26022] Updated weights on worker 0-0, policy_version 1053169 (0.00091) [2022-07-11 05:24:08,963][26022] Updated weights on worker 0-0, policy_version 1053179 (0.00083) [2022-07-11 05:24:09,779][25689] Fps is (10 sec: 5310.3, 60 sec: 5491.6, 300 sec: 5534.5). Total num frames: 1078458368. Throughput: 0: 4869.7. Samples: 1078455924. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:24:09,781][25689] Avg episode reward: [(0, '-0.015')] [2022-07-11 05:24:11,166][26022] Updated weights on worker 0-0, policy_version 1053189 (0.00087) [2022-07-11 05:24:12,654][26022] Updated weights on worker 0-0, policy_version 1053199 (0.00091) [2022-07-11 05:24:14,604][26022] Updated weights on worker 0-0, policy_version 1053209 (0.00088) [2022-07-11 05:24:14,791][25689] Fps is (10 sec: 5618.4, 60 sec: 5542.6, 300 sec: 5532.0). Total num frames: 1078487040. Throughput: 0: 5721.1. Samples: 1078489300. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:24:14,793][25689] Avg episode reward: [(0, '-0.627')] [2022-07-11 05:24:16,381][26022] Updated weights on worker 0-0, policy_version 1053219 (0.00072) [2022-07-11 05:24:18,078][26022] Updated weights on worker 0-0, policy_version 1053229 (0.00087) [2022-07-11 05:24:19,892][25689] Fps is (10 sec: 5569.2, 60 sec: 5521.3, 300 sec: 5530.5). Total num frames: 1078514688. Throughput: 0: 5708.4. Samples: 1078522768. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:24:19,894][25689] Avg episode reward: [(0, '-0.488')] [2022-07-11 05:24:20,028][26022] Updated weights on worker 0-0, policy_version 1053239 (0.00078) [2022-07-11 05:24:22,068][26022] Updated weights on worker 0-0, policy_version 1053249 (0.00091) [2022-07-11 05:24:23,844][26022] Updated weights on worker 0-0, policy_version 1053259 (0.00086) [2022-07-11 05:24:24,933][25689] Fps is (10 sec: 5452.0, 60 sec: 5525.7, 300 sec: 5526.8). Total num frames: 1078542336. Throughput: 0: 4961.7. Samples: 1078539452. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2022-07-11 05:24:24,934][25689] Avg episode reward: [(0, '-0.615')] [2022-07-11 05:24:25,734][26022] Updated weights on worker 0-0, policy_version 1053269 (0.00083) [2022-07-11 05:24:27,532][26022] Updated weights on worker 0-0, policy_version 1053279 (0.00090) [2022-07-11 05:24:29,368][26022] Updated weights on worker 0-0, policy_version 1053289 (0.00091) [2022-07-11 05:24:29,960][25689] Fps is (10 sec: 5594.1, 60 sec: 5540.7, 300 sec: 5530.0). Total num frames: 1078571008. Throughput: 0: 5800.6. Samples: 1078572982. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:24:29,960][25689] Avg episode reward: [(0, '0.720')] [2022-07-11 05:24:31,121][26022] Updated weights on worker 0-0, policy_version 1053299 (0.00091) [2022-07-11 05:24:33,033][26022] Updated weights on worker 0-0, policy_version 1053309 (0.00092) [2022-07-11 05:24:34,937][26022] Updated weights on worker 0-0, policy_version 1053319 (0.00089) [2022-07-11 05:24:35,035][25689] Fps is (10 sec: 5575.5, 60 sec: 5500.7, 300 sec: 5531.4). Total num frames: 1078598656. Throughput: 0: 5786.6. Samples: 1078606442. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:24:35,035][25689] Avg episode reward: [(0, '0.604')] [2022-07-11 05:24:36,557][26022] Updated weights on worker 0-0, policy_version 1053329 (0.00055) [2022-07-11 05:24:38,707][26022] Updated weights on worker 0-0, policy_version 1053339 (0.00105) [2022-07-11 05:24:40,089][25689] Fps is (10 sec: 5661.3, 60 sec: 5556.2, 300 sec: 5534.2). Total num frames: 1078628352. Throughput: 0: 4966.0. Samples: 1078623066. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:24:40,089][25689] Avg episode reward: [(0, '0.639')] [2022-07-11 05:24:40,229][26022] Updated weights on worker 0-0, policy_version 1053349 (0.00085) [2022-07-11 05:24:42,279][26022] Updated weights on worker 0-0, policy_version 1053359 (0.00087) [2022-07-11 05:24:43,864][26022] Updated weights on worker 0-0, policy_version 1053369 (0.00090) [2022-07-11 05:24:45,132][25689] Fps is (10 sec: 5678.9, 60 sec: 5519.3, 300 sec: 5537.0). Total num frames: 1078656000. Throughput: 0: 5805.5. Samples: 1078656718. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:24:45,135][25689] Avg episode reward: [(0, '0.772')] [2022-07-11 05:24:45,818][26022] Updated weights on worker 0-0, policy_version 1053379 (0.00090) [2022-07-11 05:24:47,534][26022] Updated weights on worker 0-0, policy_version 1053389 (0.00096) [2022-07-11 05:24:49,543][26022] Updated weights on worker 0-0, policy_version 1053399 (0.00092) [2022-07-11 05:24:50,151][25689] Fps is (10 sec: 5495.7, 60 sec: 5535.0, 300 sec: 5537.3). Total num frames: 1078683648. Throughput: 0: 5822.0. Samples: 1078690532. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:24:50,151][25689] Avg episode reward: [(0, '0.239')] [2022-07-11 05:24:51,399][26022] Updated weights on worker 0-0, policy_version 1053409 (0.00085) [2022-07-11 05:24:53,066][26022] Updated weights on worker 0-0, policy_version 1053419 (0.00092) [2022-07-11 05:24:54,995][26022] Updated weights on worker 0-0, policy_version 1053429 (0.00080) [2022-07-11 05:24:55,178][25689] Fps is (10 sec: 5606.6, 60 sec: 5571.0, 300 sec: 5538.2). Total num frames: 1078712320. Throughput: 0: 5005.2. Samples: 1078707262. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:24:55,178][25689] Avg episode reward: [(0, '-0.113')] [2022-07-11 05:24:56,738][26022] Updated weights on worker 0-0, policy_version 1053439 (0.00088) [2022-07-11 05:24:58,559][26022] Updated weights on worker 0-0, policy_version 1053449 (0.00083) [2022-07-11 05:25:00,262][25689] Fps is (10 sec: 5569.8, 60 sec: 5533.7, 300 sec: 5544.5). Total num frames: 1078739968. Throughput: 0: 5833.3. Samples: 1078740744. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:25:00,263][25689] Avg episode reward: [(0, '-0.549')] [2022-07-11 05:25:00,533][26022] Updated weights on worker 0-0, policy_version 1053459 (0.00092) [2022-07-11 05:25:02,587][26022] Updated weights on worker 0-0, policy_version 1053469 (0.00087) [2022-07-11 05:25:04,630][26022] Updated weights on worker 0-0, policy_version 1053479 (0.00097) [2022-07-11 05:25:05,279][25689] Fps is (10 sec: 5271.4, 60 sec: 5533.5, 300 sec: 5537.8). Total num frames: 1078765568. Throughput: 0: 5732.3. Samples: 1078772204. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:25:05,280][25689] Avg episode reward: [(0, '-0.054')] [2022-07-11 05:25:06,325][26022] Updated weights on worker 0-0, policy_version 1053489 (0.00093) [2022-07-11 05:25:08,147][26022] Updated weights on worker 0-0, policy_version 1053499 (0.00080) [2022-07-11 05:25:10,158][26022] Updated weights on worker 0-0, policy_version 1053509 (0.00080) [2022-07-11 05:25:10,304][25689] Fps is (10 sec: 5506.4, 60 sec: 5566.2, 300 sec: 5541.0). Total num frames: 1078795264. Throughput: 0: 4893.8. Samples: 1078789160. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:25:10,305][25689] Avg episode reward: [(0, '0.607')] [2022-07-11 05:25:11,694][26022] Updated weights on worker 0-0, policy_version 1053519 (0.00087) [2022-07-11 05:25:13,755][26022] Updated weights on worker 0-0, policy_version 1053529 (0.00098) [2022-07-11 05:25:15,315][25689] Fps is (10 sec: 5713.6, 60 sec: 5549.3, 300 sec: 5542.9). Total num frames: 1078822912. Throughput: 0: 5744.2. Samples: 1078822938. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:25:15,316][25689] Avg episode reward: [(0, '1.109')] [2022-07-11 05:25:15,550][26022] Updated weights on worker 0-0, policy_version 1053539 (0.00095) [2022-07-11 05:25:17,314][26022] Updated weights on worker 0-0, policy_version 1053549 (0.00090) [2022-07-11 05:25:19,187][26022] Updated weights on worker 0-0, policy_version 1053559 (0.00103) [2022-07-11 05:25:20,431][25689] Fps is (10 sec: 5460.5, 60 sec: 5548.0, 300 sec: 5541.1). Total num frames: 1078850560. Throughput: 0: 5747.2. Samples: 1078856658. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:25:20,433][25689] Avg episode reward: [(0, '1.105')] [2022-07-11 05:25:20,934][26022] Updated weights on worker 0-0, policy_version 1053569 (0.00087) [2022-07-11 05:25:22,946][26022] Updated weights on worker 0-0, policy_version 1053579 (0.00080) [2022-07-11 05:25:24,660][26022] Updated weights on worker 0-0, policy_version 1053589 (0.00083) [2022-07-11 05:25:25,442][25689] Fps is (10 sec: 5460.2, 60 sec: 5550.7, 300 sec: 5534.7). Total num frames: 1078878208. Throughput: 0: 5008.0. Samples: 1078873182. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:25:25,443][25689] Avg episode reward: [(0, '0.224')] [2022-07-11 05:25:26,662][26022] Updated weights on worker 0-0, policy_version 1053599 (0.00083) [2022-07-11 05:25:28,270][26022] Updated weights on worker 0-0, policy_version 1053609 (0.00094) [2022-07-11 05:25:30,196][26022] Updated weights on worker 0-0, policy_version 1053619 (0.00085) [2022-07-11 05:25:30,450][25689] Fps is (10 sec: 5519.3, 60 sec: 5535.6, 300 sec: 5538.7). Total num frames: 1078905856. Throughput: 0: 5841.9. Samples: 1078906846. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:25:30,450][25689] Avg episode reward: [(0, '1.066')] [2022-07-11 05:25:32,129][26022] Updated weights on worker 0-0, policy_version 1053629 (0.00092) [2022-07-11 05:25:33,835][26022] Updated weights on worker 0-0, policy_version 1053639 (0.00087) [2022-07-11 05:25:35,474][25689] Fps is (10 sec: 5614.4, 60 sec: 5557.2, 300 sec: 5543.2). Total num frames: 1078934528. Throughput: 0: 5820.5. Samples: 1078940270. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:25:35,474][25689] Avg episode reward: [(0, '-0.362')] [2022-07-11 05:25:35,911][26022] Updated weights on worker 0-0, policy_version 1053649 (0.00085) [2022-07-11 05:25:37,278][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:25:37,287][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001053658_1078945792.pth [2022-07-11 05:25:37,287][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001051710_1076951040.pth [2022-07-11 05:25:37,393][26022] Updated weights on worker 0-0, policy_version 1053659 (0.00081) [2022-07-11 05:25:39,555][26022] Updated weights on worker 0-0, policy_version 1053669 (0.00083) [2022-07-11 05:25:40,533][25689] Fps is (10 sec: 5686.9, 60 sec: 5539.8, 300 sec: 5543.4). Total num frames: 1078963200. Throughput: 0: 4981.6. Samples: 1078956798. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:25:40,534][25689] Avg episode reward: [(0, '-0.111')] [2022-07-11 05:25:41,207][26022] Updated weights on worker 0-0, policy_version 1053679 (0.00092) [2022-07-11 05:25:43,151][26022] Updated weights on worker 0-0, policy_version 1053689 (0.00060) [2022-07-11 05:25:44,990][26022] Updated weights on worker 0-0, policy_version 1053699 (0.00089) [2022-07-11 05:25:45,583][25689] Fps is (10 sec: 5571.4, 60 sec: 5539.2, 300 sec: 5539.2). Total num frames: 1078990848. Throughput: 0: 5815.4. Samples: 1078990304. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:25:45,583][25689] Avg episode reward: [(0, '-0.915')] [2022-07-11 05:25:46,717][26022] Updated weights on worker 0-0, policy_version 1053709 (0.00090) [2022-07-11 05:25:48,732][26022] Updated weights on worker 0-0, policy_version 1053719 (0.00087) [2022-07-11 05:25:50,208][26022] Updated weights on worker 0-0, policy_version 1053729 (0.00090) [2022-07-11 05:25:50,607][25689] Fps is (10 sec: 5590.9, 60 sec: 5555.6, 300 sec: 5542.9). Total num frames: 1079019520. Throughput: 0: 5808.5. Samples: 1079023928. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:25:50,607][25689] Avg episode reward: [(0, '0.079')] [2022-07-11 05:25:52,377][26022] Updated weights on worker 0-0, policy_version 1053739 (0.00085) [2022-07-11 05:25:53,992][26022] Updated weights on worker 0-0, policy_version 1053749 (0.00084) [2022-07-11 05:25:55,608][25689] Fps is (10 sec: 5515.3, 60 sec: 5524.1, 300 sec: 5540.3). Total num frames: 1079046144. Throughput: 0: 4979.3. Samples: 1079040528. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:25:55,609][25689] Avg episode reward: [(0, '0.160')] [2022-07-11 05:25:56,037][26022] Updated weights on worker 0-0, policy_version 1053759 (0.00090) [2022-07-11 05:25:57,793][26022] Updated weights on worker 0-0, policy_version 1053769 (0.00085) [2022-07-11 05:25:59,770][26022] Updated weights on worker 0-0, policy_version 1053779 (0.00095) [2022-07-11 05:26:00,751][25689] Fps is (10 sec: 5451.0, 60 sec: 5535.7, 300 sec: 5544.6). Total num frames: 1079074816. Throughput: 0: 5776.1. Samples: 1079073578. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:26:00,751][25689] Avg episode reward: [(0, '-0.817')] [2022-07-11 05:26:01,648][26022] Updated weights on worker 0-0, policy_version 1053789 (0.00094) [2022-07-11 05:26:03,851][26022] Updated weights on worker 0-0, policy_version 1053799 (0.00092) [2022-07-11 05:26:05,755][26022] Updated weights on worker 0-0, policy_version 1053809 (0.00082) [2022-07-11 05:26:05,789][25689] Fps is (10 sec: 5431.6, 60 sec: 5550.7, 300 sec: 5541.0). Total num frames: 1079101440. Throughput: 0: 5673.7. Samples: 1079104948. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:26:05,789][25689] Avg episode reward: [(0, '0.272')] [2022-07-11 05:26:07,575][26022] Updated weights on worker 0-0, policy_version 1053819 (0.00096) [2022-07-11 05:26:09,320][26022] Updated weights on worker 0-0, policy_version 1053829 (0.00092) [2022-07-11 05:26:10,840][25689] Fps is (10 sec: 5379.4, 60 sec: 5514.5, 300 sec: 5533.5). Total num frames: 1079129088. Throughput: 0: 5648.3. Samples: 1079138210. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:26:10,840][25689] Avg episode reward: [(0, '-0.847')] [2022-07-11 05:26:11,177][26022] Updated weights on worker 0-0, policy_version 1053839 (0.00086) [2022-07-11 05:26:12,923][26022] Updated weights on worker 0-0, policy_version 1053849 (0.00080) [2022-07-11 05:26:14,824][26022] Updated weights on worker 0-0, policy_version 1053859 (0.00080) [2022-07-11 05:26:15,931][25689] Fps is (10 sec: 5452.1, 60 sec: 5507.2, 300 sec: 5544.0). Total num frames: 1079156736. Throughput: 0: 5630.1. Samples: 1079154946. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:26:15,932][25689] Avg episode reward: [(0, '0.239')] [2022-07-11 05:26:16,663][26022] Updated weights on worker 0-0, policy_version 1053869 (0.00096) [2022-07-11 05:26:18,588][26022] Updated weights on worker 0-0, policy_version 1053879 (0.00084) [2022-07-11 05:26:20,425][26022] Updated weights on worker 0-0, policy_version 1053889 (0.00086) [2022-07-11 05:26:21,045][25689] Fps is (10 sec: 5518.8, 60 sec: 5524.3, 300 sec: 5539.2). Total num frames: 1079185408. Throughput: 0: 5647.1. Samples: 1079188180. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:26:21,045][25689] Avg episode reward: [(0, '0.404')] [2022-07-11 05:26:22,179][26022] Updated weights on worker 0-0, policy_version 1053899 (0.00090) [2022-07-11 05:26:24,151][26022] Updated weights on worker 0-0, policy_version 1053909 (0.00084) [2022-07-11 05:26:25,815][26022] Updated weights on worker 0-0, policy_version 1053919 (0.00096) [2022-07-11 05:26:26,063][25689] Fps is (10 sec: 5558.7, 60 sec: 5523.7, 300 sec: 5533.5). Total num frames: 1079213056. Throughput: 0: 5753.6. Samples: 1079221596. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:26:26,063][25689] Avg episode reward: [(0, '0.757')] [2022-07-11 05:26:27,930][26022] Updated weights on worker 0-0, policy_version 1053929 (0.00109) [2022-07-11 05:26:29,548][26022] Updated weights on worker 0-0, policy_version 1053939 (0.00087) [2022-07-11 05:26:31,074][25689] Fps is (10 sec: 5513.7, 60 sec: 5523.4, 300 sec: 5537.0). Total num frames: 1079240704. Throughput: 0: 4954.5. Samples: 1079238460. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:26:31,074][25689] Avg episode reward: [(0, '0.495')] [2022-07-11 05:26:31,539][26022] Updated weights on worker 0-0, policy_version 1053949 (0.00086) [2022-07-11 05:26:33,207][26022] Updated weights on worker 0-0, policy_version 1053959 (0.00082) [2022-07-11 05:26:34,979][26022] Updated weights on worker 0-0, policy_version 1053969 (0.00088) [2022-07-11 05:26:36,092][25689] Fps is (10 sec: 5615.4, 60 sec: 5523.8, 300 sec: 5537.4). Total num frames: 1079269376. Throughput: 0: 5814.5. Samples: 1079272174. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:26:36,093][25689] Avg episode reward: [(0, '1.201')] [2022-07-11 05:26:36,851][26022] Updated weights on worker 0-0, policy_version 1053979 (0.00896) [2022-07-11 05:26:38,753][26022] Updated weights on worker 0-0, policy_version 1053989 (0.00092) [2022-07-11 05:26:40,617][26022] Updated weights on worker 0-0, policy_version 1053999 (0.00096) [2022-07-11 05:26:41,169][25689] Fps is (10 sec: 5680.4, 60 sec: 5522.3, 300 sec: 5537.3). Total num frames: 1079298048. Throughput: 0: 5836.5. Samples: 1079305632. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:26:41,169][25689] Avg episode reward: [(0, '0.915')] [2022-07-11 05:26:42,428][26022] Updated weights on worker 0-0, policy_version 1054009 (0.00093) [2022-07-11 05:26:44,101][26022] Updated weights on worker 0-0, policy_version 1054019 (0.00093) [2022-07-11 05:26:46,194][25689] Fps is (10 sec: 5575.2, 60 sec: 5524.5, 300 sec: 5540.4). Total num frames: 1079325696. Throughput: 0: 5006.3. Samples: 1079322380. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:26:46,195][25689] Avg episode reward: [(0, '0.982')] [2022-07-11 05:26:46,197][26022] Updated weights on worker 0-0, policy_version 1054029 (0.00090) [2022-07-11 05:26:47,896][26022] Updated weights on worker 0-0, policy_version 1054039 (0.00090) [2022-07-11 05:26:49,816][26022] Updated weights on worker 0-0, policy_version 1054049 (0.00085) [2022-07-11 05:26:51,240][25689] Fps is (10 sec: 5591.9, 60 sec: 5522.5, 300 sec: 5536.3). Total num frames: 1079354368. Throughput: 0: 5842.7. Samples: 1079356288. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:26:51,241][25689] Avg episode reward: [(0, '1.261')] [2022-07-11 05:26:51,519][26022] Updated weights on worker 0-0, policy_version 1054059 (0.00093) [2022-07-11 05:26:53,366][26022] Updated weights on worker 0-0, policy_version 1054069 (0.00087) [2022-07-11 05:26:55,213][26022] Updated weights on worker 0-0, policy_version 1054079 (0.00084) [2022-07-11 05:26:56,274][25689] Fps is (10 sec: 5587.4, 60 sec: 5536.4, 300 sec: 5540.6). Total num frames: 1079382016. Throughput: 0: 5820.7. Samples: 1079389646. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:26:56,275][25689] Avg episode reward: [(0, '1.209')] [2022-07-11 05:26:56,998][26022] Updated weights on worker 0-0, policy_version 1054089 (0.00094) [2022-07-11 05:26:59,035][26022] Updated weights on worker 0-0, policy_version 1054099 (0.00086) [2022-07-11 05:27:00,754][26022] Updated weights on worker 0-0, policy_version 1054109 (0.00085) [2022-07-11 05:27:01,356][25689] Fps is (10 sec: 5567.8, 60 sec: 5542.0, 300 sec: 5543.8). Total num frames: 1079410688. Throughput: 0: 4982.3. Samples: 1079406210. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:27:01,356][25689] Avg episode reward: [(0, '1.690')] [2022-07-11 05:27:03,020][26022] Updated weights on worker 0-0, policy_version 1054119 (0.00092) [2022-07-11 05:27:04,761][26022] Updated weights on worker 0-0, policy_version 1054129 (0.00086) [2022-07-11 05:27:06,442][25689] Fps is (10 sec: 5337.5, 60 sec: 5520.7, 300 sec: 5537.2). Total num frames: 1079436288. Throughput: 0: 5673.3. Samples: 1079437252. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:27:06,443][25689] Avg episode reward: [(0, '0.573')] [2022-07-11 05:27:06,836][26022] Updated weights on worker 0-0, policy_version 1054139 (0.00094) [2022-07-11 05:27:08,652][26022] Updated weights on worker 0-0, policy_version 1054149 (0.00081) [2022-07-11 05:27:10,586][26022] Updated weights on worker 0-0, policy_version 1054159 (0.00108) [2022-07-11 05:27:11,471][25689] Fps is (10 sec: 5365.1, 60 sec: 5539.5, 300 sec: 5536.9). Total num frames: 1079464960. Throughput: 0: 5651.9. Samples: 1079470632. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:27:11,472][25689] Avg episode reward: [(0, '0.580')] [2022-07-11 05:27:12,346][26022] Updated weights on worker 0-0, policy_version 1054169 (0.00086) [2022-07-11 05:27:14,221][26022] Updated weights on worker 0-0, policy_version 1054179 (0.00082) [2022-07-11 05:27:15,942][26022] Updated weights on worker 0-0, policy_version 1054189 (0.00088) [2022-07-11 05:27:16,472][25689] Fps is (10 sec: 5615.0, 60 sec: 5547.8, 300 sec: 5541.0). Total num frames: 1079492608. Throughput: 0: 4836.3. Samples: 1079487330. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:27:16,473][25689] Avg episode reward: [(0, '0.166')] [2022-07-11 05:27:17,799][26022] Updated weights on worker 0-0, policy_version 1054199 (0.00084) [2022-07-11 05:27:19,584][26022] Updated weights on worker 0-0, policy_version 1054209 (0.00091) [2022-07-11 05:27:21,565][25689] Fps is (10 sec: 5478.4, 60 sec: 5532.9, 300 sec: 5532.6). Total num frames: 1079520256. Throughput: 0: 5655.4. Samples: 1079520502. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:27:21,565][25689] Avg episode reward: [(0, '-0.356')] [2022-07-11 05:27:21,568][26022] Updated weights on worker 0-0, policy_version 1054219 (0.00091) [2022-07-11 05:27:23,276][26022] Updated weights on worker 0-0, policy_version 1054229 (0.00084) [2022-07-11 05:27:25,065][26022] Updated weights on worker 0-0, policy_version 1054239 (0.00089) [2022-07-11 05:27:26,637][25689] Fps is (10 sec: 5541.0, 60 sec: 5544.8, 300 sec: 5534.8). Total num frames: 1079548928. Throughput: 0: 5789.2. Samples: 1079554162. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:27:26,637][25689] Avg episode reward: [(0, '-0.406')] [2022-07-11 05:27:26,914][26022] Updated weights on worker 0-0, policy_version 1054249 (0.00089) [2022-07-11 05:27:28,917][26022] Updated weights on worker 0-0, policy_version 1054259 (0.00091) [2022-07-11 05:27:30,778][26022] Updated weights on worker 0-0, policy_version 1054269 (0.00090) [2022-07-11 05:27:31,650][25689] Fps is (10 sec: 5584.7, 60 sec: 5544.6, 300 sec: 5538.2). Total num frames: 1079576576. Throughput: 0: 4963.5. Samples: 1079570786. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:27:31,650][25689] Avg episode reward: [(0, '-0.707')] [2022-07-11 05:27:32,749][26022] Updated weights on worker 0-0, policy_version 1054279 (0.00089) [2022-07-11 05:27:34,373][26022] Updated weights on worker 0-0, policy_version 1054289 (0.00082) [2022-07-11 05:27:36,380][26022] Updated weights on worker 0-0, policy_version 1054299 (0.00094) [2022-07-11 05:27:36,741][25689] Fps is (10 sec: 5370.9, 60 sec: 5504.2, 300 sec: 5528.0). Total num frames: 1079603200. Throughput: 0: 5737.5. Samples: 1079603624. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:27:36,742][25689] Avg episode reward: [(0, '0.527')] [2022-07-11 05:27:37,316][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:27:37,338][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001054304_1079607296.pth [2022-07-11 05:27:37,338][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001052357_1077613568.pth [2022-07-11 05:27:38,152][26022] Updated weights on worker 0-0, policy_version 1054309 (0.00090) [2022-07-11 05:27:40,099][26022] Updated weights on worker 0-0, policy_version 1054319 (0.00093) [2022-07-11 05:27:41,767][26022] Updated weights on worker 0-0, policy_version 1054329 (0.00085) [2022-07-11 05:27:41,861][25689] Fps is (10 sec: 5515.4, 60 sec: 5517.2, 300 sec: 5529.8). Total num frames: 1079632896. Throughput: 0: 5745.9. Samples: 1079637122. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:27:41,861][25689] Avg episode reward: [(0, '0.534')] [2022-07-11 05:27:43,817][26022] Updated weights on worker 0-0, policy_version 1054339 (0.00083) [2022-07-11 05:27:45,219][26022] Updated weights on worker 0-0, policy_version 1054349 (0.00094) [2022-07-11 05:27:46,870][25689] Fps is (10 sec: 5560.5, 60 sec: 5501.8, 300 sec: 5526.2). Total num frames: 1079659520. Throughput: 0: 4925.0. Samples: 1079653812. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:27:46,870][25689] Avg episode reward: [(0, '1.320')] [2022-07-11 05:27:47,609][26022] Updated weights on worker 0-0, policy_version 1054359 (0.00087) [2022-07-11 05:27:48,980][26022] Updated weights on worker 0-0, policy_version 1054369 (0.00086) [2022-07-11 05:27:51,077][26022] Updated weights on worker 0-0, policy_version 1054379 (0.00084) [2022-07-11 05:27:51,901][25689] Fps is (10 sec: 5609.6, 60 sec: 5520.0, 300 sec: 5529.3). Total num frames: 1079689216. Throughput: 0: 5750.1. Samples: 1079687232. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:27:51,901][25689] Avg episode reward: [(0, '1.617')] [2022-07-11 05:27:52,791][26022] Updated weights on worker 0-0, policy_version 1054389 (0.00083) [2022-07-11 05:27:54,763][26022] Updated weights on worker 0-0, policy_version 1054399 (0.00087) [2022-07-11 05:27:56,758][26022] Updated weights on worker 0-0, policy_version 1054409 (0.00094) [2022-07-11 05:27:56,915][25689] Fps is (10 sec: 5606.9, 60 sec: 5505.0, 300 sec: 5526.7). Total num frames: 1079715840. Throughput: 0: 5790.2. Samples: 1079720432. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:27:56,915][25689] Avg episode reward: [(0, '1.677')] [2022-07-11 05:27:58,348][26022] Updated weights on worker 0-0, policy_version 1054419 (0.00077) [2022-07-11 05:28:00,376][26022] Updated weights on worker 0-0, policy_version 1054429 (0.00085) [2022-07-11 05:28:01,979][25689] Fps is (10 sec: 5384.8, 60 sec: 5489.6, 300 sec: 5532.5). Total num frames: 1079743488. Throughput: 0: 4966.0. Samples: 1079737030. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:28:01,987][25689] Avg episode reward: [(0, '1.702')] [2022-07-11 05:28:02,543][26022] Updated weights on worker 0-0, policy_version 1054439 (0.00083) [2022-07-11 05:28:04,289][26022] Updated weights on worker 0-0, policy_version 1054449 (0.00089) [2022-07-11 05:28:06,170][26022] Updated weights on worker 0-0, policy_version 1054459 (0.00093) [2022-07-11 05:28:06,994][25689] Fps is (10 sec: 5384.5, 60 sec: 5513.0, 300 sec: 5522.0). Total num frames: 1079770112. Throughput: 0: 5710.2. Samples: 1079768724. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:28:06,994][25689] Avg episode reward: [(0, '2.120')] [2022-07-11 05:28:07,860][26022] Updated weights on worker 0-0, policy_version 1054469 (0.00090) [2022-07-11 05:28:09,681][26022] Updated weights on worker 0-0, policy_version 1054479 (0.01138) [2022-07-11 05:28:11,647][26022] Updated weights on worker 0-0, policy_version 1054489 (0.00079) [2022-07-11 05:28:11,998][25689] Fps is (10 sec: 5519.6, 60 sec: 5515.4, 300 sec: 5532.5). Total num frames: 1079798784. Throughput: 0: 5728.1. Samples: 1079802350. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:28:11,998][25689] Avg episode reward: [(0, '2.220')] [2022-07-11 05:28:13,498][26022] Updated weights on worker 0-0, policy_version 1054499 (0.00086) [2022-07-11 05:28:15,296][26022] Updated weights on worker 0-0, policy_version 1054509 (0.00089) [2022-07-11 05:28:17,011][25689] Fps is (10 sec: 5520.4, 60 sec: 5497.4, 300 sec: 5526.4). Total num frames: 1079825408. Throughput: 0: 5753.4. Samples: 1079836052. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:28:17,011][25689] Avg episode reward: [(0, '1.870')] [2022-07-11 05:28:17,144][26022] Updated weights on worker 0-0, policy_version 1054519 (0.00091) [2022-07-11 05:28:19,071][26022] Updated weights on worker 0-0, policy_version 1054529 (0.00084) [2022-07-11 05:28:20,803][26022] Updated weights on worker 0-0, policy_version 1054539 (0.00092) [2022-07-11 05:28:22,079][25689] Fps is (10 sec: 5485.1, 60 sec: 5516.5, 300 sec: 5530.2). Total num frames: 1079854080. Throughput: 0: 5751.2. Samples: 1079852624. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:28:22,079][25689] Avg episode reward: [(0, '1.703')] [2022-07-11 05:28:22,764][26022] Updated weights on worker 0-0, policy_version 1054549 (0.00086) [2022-07-11 05:28:24,581][26022] Updated weights on worker 0-0, policy_version 1054559 (0.00093) [2022-07-11 05:28:26,474][26022] Updated weights on worker 0-0, policy_version 1054569 (0.00426) [2022-07-11 05:28:27,096][25689] Fps is (10 sec: 5685.7, 60 sec: 5521.4, 300 sec: 5533.5). Total num frames: 1079882752. Throughput: 0: 5810.2. Samples: 1079885524. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 05:28:27,097][25689] Avg episode reward: [(0, '1.782')] [2022-07-11 05:28:28,395][26022] Updated weights on worker 0-0, policy_version 1054579 (0.00089) [2022-07-11 05:28:30,051][26022] Updated weights on worker 0-0, policy_version 1054589 (0.00093) [2022-07-11 05:28:32,095][26022] Updated weights on worker 0-0, policy_version 1054599 (0.00085) [2022-07-11 05:28:32,135][25689] Fps is (10 sec: 5498.8, 60 sec: 5502.2, 300 sec: 5522.5). Total num frames: 1079909376. Throughput: 0: 5773.3. Samples: 1079918608. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:28:32,135][25689] Avg episode reward: [(0, '1.790')] [2022-07-11 05:28:33,763][26022] Updated weights on worker 0-0, policy_version 1054609 (0.00089) [2022-07-11 05:28:35,715][26022] Updated weights on worker 0-0, policy_version 1054619 (0.00089) [2022-07-11 05:28:37,138][25689] Fps is (10 sec: 5506.8, 60 sec: 5544.2, 300 sec: 5531.3). Total num frames: 1079938048. Throughput: 0: 4945.7. Samples: 1079935596. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:28:37,138][25689] Avg episode reward: [(0, '1.322')] [2022-07-11 05:28:37,350][26022] Updated weights on worker 0-0, policy_version 1054629 (0.00099) [2022-07-11 05:28:39,530][26022] Updated weights on worker 0-0, policy_version 1054639 (0.00086) [2022-07-11 05:28:41,155][26022] Updated weights on worker 0-0, policy_version 1054649 (0.00088) [2022-07-11 05:28:42,205][25689] Fps is (10 sec: 5491.1, 60 sec: 5498.1, 300 sec: 5520.0). Total num frames: 1079964672. Throughput: 0: 5779.1. Samples: 1079968936. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:28:42,205][25689] Avg episode reward: [(0, '0.950')] [2022-07-11 05:28:42,984][26022] Updated weights on worker 0-0, policy_version 1054659 (0.00086) [2022-07-11 05:28:44,832][26022] Updated weights on worker 0-0, policy_version 1054669 (0.00084) [2022-07-11 05:28:46,743][26022] Updated weights on worker 0-0, policy_version 1054679 (0.00086) [2022-07-11 05:28:47,239][25689] Fps is (10 sec: 5575.4, 60 sec: 5546.6, 300 sec: 5529.7). Total num frames: 1079994368. Throughput: 0: 5828.1. Samples: 1080002920. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:28:47,240][25689] Avg episode reward: [(0, '0.692')] [2022-07-11 05:28:48,406][26022] Updated weights on worker 0-0, policy_version 1054689 (0.00089) [2022-07-11 05:28:50,355][26022] Updated weights on worker 0-0, policy_version 1054699 (0.00095) [2022-07-11 05:28:52,096][26022] Updated weights on worker 0-0, policy_version 1054709 (0.00090) [2022-07-11 05:28:52,281][25689] Fps is (10 sec: 5792.4, 60 sec: 5528.6, 300 sec: 5536.8). Total num frames: 1080023040. Throughput: 0: 5015.4. Samples: 1080019652. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:28:52,282][25689] Avg episode reward: [(0, '0.730')] [2022-07-11 05:28:54,228][26022] Updated weights on worker 0-0, policy_version 1054719 (0.00086) [2022-07-11 05:28:55,802][26022] Updated weights on worker 0-0, policy_version 1054729 (0.00084) [2022-07-11 05:28:57,323][25689] Fps is (10 sec: 5382.3, 60 sec: 5509.2, 300 sec: 5523.1). Total num frames: 1080048640. Throughput: 0: 5787.2. Samples: 1080052410. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:28:57,323][25689] Avg episode reward: [(0, '0.481')] [2022-07-11 05:28:57,896][26022] Updated weights on worker 0-0, policy_version 1054739 (0.00094) [2022-07-11 05:28:59,505][26022] Updated weights on worker 0-0, policy_version 1054749 (0.00083) [2022-07-11 05:29:01,744][26022] Updated weights on worker 0-0, policy_version 1054759 (0.00570) [2022-07-11 05:29:02,361][25689] Fps is (10 sec: 5181.0, 60 sec: 5494.6, 300 sec: 5526.1). Total num frames: 1080075264. Throughput: 0: 5706.4. Samples: 1080083958. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:29:02,362][25689] Avg episode reward: [(0, '0.575')] [2022-07-11 05:29:03,562][26022] Updated weights on worker 0-0, policy_version 1054769 (0.00091) [2022-07-11 05:29:05,471][26022] Updated weights on worker 0-0, policy_version 1054779 (0.00081) [2022-07-11 05:29:07,362][25689] Fps is (10 sec: 5507.7, 60 sec: 5529.8, 300 sec: 5529.8). Total num frames: 1080103936. Throughput: 0: 4861.2. Samples: 1080100742. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:29:07,363][25689] Avg episode reward: [(0, '1.109')] [2022-07-11 05:29:07,367][26022] Updated weights on worker 0-0, policy_version 1054789 (0.00094) [2022-07-11 05:29:09,084][26022] Updated weights on worker 0-0, policy_version 1054799 (0.00082) [2022-07-11 05:29:10,958][26022] Updated weights on worker 0-0, policy_version 1054809 (0.00089) [2022-07-11 05:29:12,377][25689] Fps is (10 sec: 5520.9, 60 sec: 5494.8, 300 sec: 5522.8). Total num frames: 1080130560. Throughput: 0: 5701.8. Samples: 1080134234. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:29:12,378][25689] Avg episode reward: [(0, '1.004')] [2022-07-11 05:29:12,859][26022] Updated weights on worker 0-0, policy_version 1054819 (0.00085) [2022-07-11 05:29:14,805][26022] Updated weights on worker 0-0, policy_version 1054829 (0.00095) [2022-07-11 05:29:16,428][26022] Updated weights on worker 0-0, policy_version 1054839 (0.00086) [2022-07-11 05:29:17,383][25689] Fps is (10 sec: 5416.1, 60 sec: 5512.5, 300 sec: 5524.6). Total num frames: 1080158208. Throughput: 0: 5756.9. Samples: 1080167894. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:29:17,385][25689] Avg episode reward: [(0, '1.065')] [2022-07-11 05:29:18,286][26022] Updated weights on worker 0-0, policy_version 1054849 (0.00090) [2022-07-11 05:29:20,136][26022] Updated weights on worker 0-0, policy_version 1054859 (0.00087) [2022-07-11 05:29:22,018][26022] Updated weights on worker 0-0, policy_version 1054869 (0.00082) [2022-07-11 05:29:22,430][25689] Fps is (10 sec: 5806.3, 60 sec: 5548.3, 300 sec: 5534.8). Total num frames: 1080188928. Throughput: 0: 5010.5. Samples: 1080184510. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:29:22,430][25689] Avg episode reward: [(0, '1.088')] [2022-07-11 05:29:23,901][26022] Updated weights on worker 0-0, policy_version 1054879 (0.00096) [2022-07-11 05:29:25,725][26022] Updated weights on worker 0-0, policy_version 1054889 (0.00101) [2022-07-11 05:29:27,439][25689] Fps is (10 sec: 5702.4, 60 sec: 5515.2, 300 sec: 5528.3). Total num frames: 1080215552. Throughput: 0: 5829.9. Samples: 1080217786. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:29:27,440][25689] Avg episode reward: [(0, '1.263')] [2022-07-11 05:29:27,507][26022] Updated weights on worker 0-0, policy_version 1054899 (0.00087) [2022-07-11 05:29:29,651][26022] Updated weights on worker 0-0, policy_version 1054909 (0.00084) [2022-07-11 05:29:31,091][26022] Updated weights on worker 0-0, policy_version 1054919 (0.01145) [2022-07-11 05:29:32,460][25689] Fps is (10 sec: 5308.9, 60 sec: 5516.8, 300 sec: 5525.8). Total num frames: 1080242176. Throughput: 0: 5835.4. Samples: 1080251424. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:29:32,460][25689] Avg episode reward: [(0, '-0.172')] [2022-07-11 05:29:33,080][26022] Updated weights on worker 0-0, policy_version 1054929 (0.00089) [2022-07-11 05:29:34,690][26022] Updated weights on worker 0-0, policy_version 1054939 (0.00091) [2022-07-11 05:29:36,654][26022] Updated weights on worker 0-0, policy_version 1054949 (0.00084) [2022-07-11 05:29:37,462][25689] Fps is (10 sec: 5517.1, 60 sec: 5516.9, 300 sec: 5523.4). Total num frames: 1080270848. Throughput: 0: 4998.0. Samples: 1080268248. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:29:37,462][25689] Avg episode reward: [(0, '-0.337')] [2022-07-11 05:29:37,554][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:29:37,580][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001054953_1080271872.pth [2022-07-11 05:29:37,581][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001053008_1078280192.pth [2022-07-11 05:29:38,695][26022] Updated weights on worker 0-0, policy_version 1054959 (0.00088) [2022-07-11 05:29:40,306][26022] Updated weights on worker 0-0, policy_version 1054969 (0.00090) [2022-07-11 05:29:42,297][26022] Updated weights on worker 0-0, policy_version 1054979 (0.00089) [2022-07-11 05:29:42,510][25689] Fps is (10 sec: 5705.7, 60 sec: 5552.6, 300 sec: 5526.7). Total num frames: 1080299520. Throughput: 0: 5836.8. Samples: 1080301714. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:29:42,511][25689] Avg episode reward: [(0, '0.363')] [2022-07-11 05:29:44,206][26022] Updated weights on worker 0-0, policy_version 1054989 (0.00080) [2022-07-11 05:29:45,883][26022] Updated weights on worker 0-0, policy_version 1054999 (0.00083) [2022-07-11 05:29:47,531][25689] Fps is (10 sec: 5593.4, 60 sec: 5519.9, 300 sec: 5526.7). Total num frames: 1080327168. Throughput: 0: 5822.6. Samples: 1080334772. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:29:47,531][25689] Avg episode reward: [(0, '-0.269')] [2022-07-11 05:29:47,951][26022] Updated weights on worker 0-0, policy_version 1055009 (0.00090) [2022-07-11 05:29:49,536][26022] Updated weights on worker 0-0, policy_version 1055019 (0.00088) [2022-07-11 05:29:51,687][26022] Updated weights on worker 0-0, policy_version 1055029 (0.00085) [2022-07-11 05:29:52,559][25689] Fps is (10 sec: 5604.6, 60 sec: 5521.2, 300 sec: 5526.6). Total num frames: 1080355840. Throughput: 0: 4984.2. Samples: 1080351602. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:29:52,559][25689] Avg episode reward: [(0, '-0.240')] [2022-07-11 05:29:53,393][26022] Updated weights on worker 0-0, policy_version 1055039 (0.00093) [2022-07-11 05:29:55,234][26022] Updated weights on worker 0-0, policy_version 1055049 (0.00113) [2022-07-11 05:29:57,003][26022] Updated weights on worker 0-0, policy_version 1055059 (0.00109) [2022-07-11 05:29:57,599][25689] Fps is (10 sec: 5491.9, 60 sec: 5538.2, 300 sec: 5524.0). Total num frames: 1080382464. Throughput: 0: 5804.7. Samples: 1080385142. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:29:57,600][25689] Avg episode reward: [(0, '0.325')] [2022-07-11 05:29:58,859][26022] Updated weights on worker 0-0, policy_version 1055069 (0.00095) [2022-07-11 05:30:00,720][26022] Updated weights on worker 0-0, policy_version 1055079 (0.00093) [2022-07-11 05:30:02,742][25689] Fps is (10 sec: 5329.3, 60 sec: 5545.6, 300 sec: 5528.6). Total num frames: 1080410112. Throughput: 0: 5665.5. Samples: 1080416342. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:30:02,743][25689] Avg episode reward: [(0, '1.303')] [2022-07-11 05:30:02,916][26022] Updated weights on worker 0-0, policy_version 1055089 (0.00089) [2022-07-11 05:30:04,771][26022] Updated weights on worker 0-0, policy_version 1055099 (0.00086) [2022-07-11 05:30:06,607][26022] Updated weights on worker 0-0, policy_version 1055109 (0.00091) [2022-07-11 05:30:07,766][25689] Fps is (10 sec: 5439.2, 60 sec: 5526.6, 300 sec: 5521.7). Total num frames: 1080437760. Throughput: 0: 4861.6. Samples: 1080433146. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:30:07,767][25689] Avg episode reward: [(0, '0.562')] [2022-07-11 05:30:08,503][26022] Updated weights on worker 0-0, policy_version 1055119 (0.00090) [2022-07-11 05:30:10,286][26022] Updated weights on worker 0-0, policy_version 1055129 (0.00096) [2022-07-11 05:30:12,139][26022] Updated weights on worker 0-0, policy_version 1055139 (0.00090) [2022-07-11 05:30:12,793][25689] Fps is (10 sec: 5502.0, 60 sec: 5542.4, 300 sec: 5521.4). Total num frames: 1080465408. Throughput: 0: 5683.7. Samples: 1080466606. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:30:12,793][25689] Avg episode reward: [(0, '0.582')] [2022-07-11 05:30:14,110][26022] Updated weights on worker 0-0, policy_version 1055149 (0.00096) [2022-07-11 05:30:15,831][26022] Updated weights on worker 0-0, policy_version 1055159 (0.00098) [2022-07-11 05:30:17,796][25689] Fps is (10 sec: 5411.0, 60 sec: 5525.7, 300 sec: 5520.1). Total num frames: 1080492032. Throughput: 0: 5680.3. Samples: 1080499862. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:30:17,797][25689] Avg episode reward: [(0, '1.320')] [2022-07-11 05:30:17,904][26022] Updated weights on worker 0-0, policy_version 1055169 (0.00086) [2022-07-11 05:30:19,503][26022] Updated weights on worker 0-0, policy_version 1055179 (0.00090) [2022-07-11 05:30:21,533][26022] Updated weights on worker 0-0, policy_version 1055189 (0.00094) [2022-07-11 05:30:22,869][25689] Fps is (10 sec: 5589.1, 60 sec: 5506.3, 300 sec: 5525.8). Total num frames: 1080521728. Throughput: 0: 4972.4. Samples: 1080516422. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:30:22,871][25689] Avg episode reward: [(0, '1.497')] [2022-07-11 05:30:23,197][26022] Updated weights on worker 0-0, policy_version 1055199 (0.00085) [2022-07-11 05:30:25,175][26022] Updated weights on worker 0-0, policy_version 1055209 (0.00082) [2022-07-11 05:30:26,875][26022] Updated weights on worker 0-0, policy_version 1055219 (0.00084) [2022-07-11 05:30:27,872][25689] Fps is (10 sec: 5589.3, 60 sec: 5506.9, 300 sec: 5522.4). Total num frames: 1080548352. Throughput: 0: 5781.5. Samples: 1080549390. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:30:27,872][25689] Avg episode reward: [(0, '-0.085')] [2022-07-11 05:30:28,937][26022] Updated weights on worker 0-0, policy_version 1055229 (0.00090) [2022-07-11 05:30:30,617][26022] Updated weights on worker 0-0, policy_version 1055239 (0.00086) [2022-07-11 05:30:32,569][26022] Updated weights on worker 0-0, policy_version 1055249 (0.00086) [2022-07-11 05:30:32,906][25689] Fps is (10 sec: 5407.2, 60 sec: 5522.6, 300 sec: 5518.8). Total num frames: 1080576000. Throughput: 0: 5774.3. Samples: 1080582748. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:30:32,907][25689] Avg episode reward: [(0, '-0.158')] [2022-07-11 05:30:34,280][26022] Updated weights on worker 0-0, policy_version 1055259 (0.00089) [2022-07-11 05:30:36,239][26022] Updated weights on worker 0-0, policy_version 1055269 (0.00087) [2022-07-11 05:30:37,931][25689] Fps is (10 sec: 5599.2, 60 sec: 5520.6, 300 sec: 5519.5). Total num frames: 1080604672. Throughput: 0: 4945.6. Samples: 1080599442. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:30:37,931][25689] Avg episode reward: [(0, '0.451')] [2022-07-11 05:30:37,984][26022] Updated weights on worker 0-0, policy_version 1055279 (0.00081) [2022-07-11 05:30:39,776][26022] Updated weights on worker 0-0, policy_version 1055289 (0.00507) [2022-07-11 05:30:41,772][26022] Updated weights on worker 0-0, policy_version 1055299 (0.00091) [2022-07-11 05:30:43,035][25689] Fps is (10 sec: 5661.4, 60 sec: 5515.5, 300 sec: 5521.9). Total num frames: 1080633344. Throughput: 0: 5782.6. Samples: 1080633032. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:30:43,036][25689] Avg episode reward: [(0, '-0.219')] [2022-07-11 05:30:43,532][26022] Updated weights on worker 0-0, policy_version 1055309 (0.00087) [2022-07-11 05:30:45,436][26022] Updated weights on worker 0-0, policy_version 1055319 (0.00100) [2022-07-11 05:30:47,207][26022] Updated weights on worker 0-0, policy_version 1055329 (0.00087) [2022-07-11 05:30:48,076][25689] Fps is (10 sec: 5551.3, 60 sec: 5513.6, 300 sec: 5518.1). Total num frames: 1080660992. Throughput: 0: 5802.5. Samples: 1080666624. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:30:48,076][25689] Avg episode reward: [(0, '-0.288')] [2022-07-11 05:30:48,891][26022] Updated weights on worker 0-0, policy_version 1055339 (0.00474) [2022-07-11 05:30:50,994][26022] Updated weights on worker 0-0, policy_version 1055349 (0.00085) [2022-07-11 05:30:52,636][26022] Updated weights on worker 0-0, policy_version 1055359 (0.00092) [2022-07-11 05:30:53,083][25689] Fps is (10 sec: 5604.9, 60 sec: 5515.5, 300 sec: 5524.9). Total num frames: 1080689664. Throughput: 0: 4991.7. Samples: 1080683466. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:30:53,084][25689] Avg episode reward: [(0, '-0.538')] [2022-07-11 05:30:54,525][26022] Updated weights on worker 0-0, policy_version 1055369 (0.00089) [2022-07-11 05:30:56,330][26022] Updated weights on worker 0-0, policy_version 1055379 (0.00056) [2022-07-11 05:30:58,027][26022] Updated weights on worker 0-0, policy_version 1055389 (0.00092) [2022-07-11 05:30:58,138][25689] Fps is (10 sec: 5699.0, 60 sec: 5548.1, 300 sec: 5526.5). Total num frames: 1080718336. Throughput: 0: 5822.8. Samples: 1080717106. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:30:58,139][25689] Avg episode reward: [(0, '-1.301')] [2022-07-11 05:31:00,198][26022] Updated weights on worker 0-0, policy_version 1055399 (0.00085) [2022-07-11 05:31:02,168][26022] Updated weights on worker 0-0, policy_version 1055409 (0.00093) [2022-07-11 05:31:03,278][25689] Fps is (10 sec: 5122.5, 60 sec: 5480.6, 300 sec: 5514.3). Total num frames: 1080741888. Throughput: 0: 5710.4. Samples: 1080748630. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:31:03,279][25689] Avg episode reward: [(0, '-0.142')] [2022-07-11 05:31:04,187][26022] Updated weights on worker 0-0, policy_version 1055419 (0.00092) [2022-07-11 05:31:06,127][26022] Updated weights on worker 0-0, policy_version 1055429 (0.00090) [2022-07-11 05:31:07,875][26022] Updated weights on worker 0-0, policy_version 1055439 (0.00085) [2022-07-11 05:31:08,329][25689] Fps is (10 sec: 5224.8, 60 sec: 5511.9, 300 sec: 5521.2). Total num frames: 1080771584. Throughput: 0: 4852.5. Samples: 1080764904. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:31:08,330][25689] Avg episode reward: [(0, '-0.412')] [2022-07-11 05:31:09,724][26022] Updated weights on worker 0-0, policy_version 1055449 (0.00080) [2022-07-11 05:31:11,700][26022] Updated weights on worker 0-0, policy_version 1055459 (0.00093) [2022-07-11 05:31:13,255][26022] Updated weights on worker 0-0, policy_version 1055469 (0.00092) [2022-07-11 05:31:13,351][25689] Fps is (10 sec: 5795.0, 60 sec: 5529.4, 300 sec: 5525.9). Total num frames: 1080800256. Throughput: 0: 5666.2. Samples: 1080798304. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:31:13,351][25689] Avg episode reward: [(0, '0.129')] [2022-07-11 05:31:15,364][26022] Updated weights on worker 0-0, policy_version 1055479 (0.00094) [2022-07-11 05:31:16,960][26022] Updated weights on worker 0-0, policy_version 1055489 (0.00095) [2022-07-11 05:31:18,373][25689] Fps is (10 sec: 5505.5, 60 sec: 5527.6, 300 sec: 5520.7). Total num frames: 1080826880. Throughput: 0: 5661.1. Samples: 1080831660. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:31:18,374][25689] Avg episode reward: [(0, '-0.075')] [2022-07-11 05:31:18,958][26022] Updated weights on worker 0-0, policy_version 1055499 (0.00092) [2022-07-11 05:31:20,649][26022] Updated weights on worker 0-0, policy_version 1055509 (0.00093) [2022-07-11 05:31:22,765][26022] Updated weights on worker 0-0, policy_version 1055519 (0.00085) [2022-07-11 05:31:23,461][25689] Fps is (10 sec: 5469.1, 60 sec: 5509.4, 300 sec: 5522.9). Total num frames: 1080855552. Throughput: 0: 5758.5. Samples: 1080864852. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:31:23,463][25689] Avg episode reward: [(0, '0.820')] [2022-07-11 05:31:24,331][26022] Updated weights on worker 0-0, policy_version 1055529 (0.00091) [2022-07-11 05:31:26,375][26022] Updated weights on worker 0-0, policy_version 1055539 (0.00087) [2022-07-11 05:31:28,169][26022] Updated weights on worker 0-0, policy_version 1055549 (0.00091) [2022-07-11 05:31:28,500][25689] Fps is (10 sec: 5561.3, 60 sec: 5523.0, 300 sec: 5522.4). Total num frames: 1080883200. Throughput: 0: 5775.0. Samples: 1080881392. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:31:28,502][25689] Avg episode reward: [(0, '0.941')] [2022-07-11 05:31:30,163][26022] Updated weights on worker 0-0, policy_version 1055559 (0.00083) [2022-07-11 05:31:31,813][26022] Updated weights on worker 0-0, policy_version 1055569 (0.00082) [2022-07-11 05:31:33,509][25689] Fps is (10 sec: 5503.2, 60 sec: 5525.3, 300 sec: 5519.1). Total num frames: 1080910848. Throughput: 0: 5769.3. Samples: 1080914604. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:31:33,511][25689] Avg episode reward: [(0, '1.193')] [2022-07-11 05:31:33,774][26022] Updated weights on worker 0-0, policy_version 1055579 (0.00085) [2022-07-11 05:31:35,543][26022] Updated weights on worker 0-0, policy_version 1055589 (0.00082) [2022-07-11 05:31:37,559][26022] Updated weights on worker 0-0, policy_version 1055599 (0.00088) [2022-07-11 05:31:37,761][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:31:37,774][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001055600_1080934400.pth [2022-07-11 05:31:37,774][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001053658_1078945792.pth [2022-07-11 05:31:38,547][25689] Fps is (10 sec: 5503.8, 60 sec: 5507.1, 300 sec: 5516.4). Total num frames: 1080938496. Throughput: 0: 5776.1. Samples: 1080948186. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:31:38,549][25689] Avg episode reward: [(0, '1.263')] [2022-07-11 05:31:39,172][26022] Updated weights on worker 0-0, policy_version 1055609 (0.00086) [2022-07-11 05:31:41,282][26022] Updated weights on worker 0-0, policy_version 1055619 (0.00093) [2022-07-11 05:31:42,960][26022] Updated weights on worker 0-0, policy_version 1055629 (0.00094) [2022-07-11 05:31:43,637][25689] Fps is (10 sec: 5661.8, 60 sec: 5525.3, 300 sec: 5522.0). Total num frames: 1080968192. Throughput: 0: 4951.9. Samples: 1080964762. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:31:43,639][25689] Avg episode reward: [(0, '2.101')] [2022-07-11 05:31:44,843][26022] Updated weights on worker 0-0, policy_version 1055639 (0.00095) [2022-07-11 05:31:46,583][26022] Updated weights on worker 0-0, policy_version 1055649 (0.00086) [2022-07-11 05:31:48,498][26022] Updated weights on worker 0-0, policy_version 1055659 (0.00094) [2022-07-11 05:31:48,655][25689] Fps is (10 sec: 5572.0, 60 sec: 5510.6, 300 sec: 5515.7). Total num frames: 1080994816. Throughput: 0: 5779.7. Samples: 1080997880. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:31:48,655][25689] Avg episode reward: [(0, '1.678')] [2022-07-11 05:31:50,356][26022] Updated weights on worker 0-0, policy_version 1055669 (0.00095) [2022-07-11 05:31:52,310][26022] Updated weights on worker 0-0, policy_version 1055679 (0.00099) [2022-07-11 05:31:53,657][25689] Fps is (10 sec: 5416.8, 60 sec: 5494.2, 300 sec: 5516.3). Total num frames: 1081022464. Throughput: 0: 5791.0. Samples: 1081031278. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:31:53,657][25689] Avg episode reward: [(0, '1.458')] [2022-07-11 05:31:53,992][26022] Updated weights on worker 0-0, policy_version 1055689 (0.00084) [2022-07-11 05:31:55,941][26022] Updated weights on worker 0-0, policy_version 1055699 (0.00092) [2022-07-11 05:31:57,447][26022] Updated weights on worker 0-0, policy_version 1055709 (0.00091) [2022-07-11 05:31:58,664][25689] Fps is (10 sec: 5626.6, 60 sec: 5498.4, 300 sec: 5517.7). Total num frames: 1081051136. Throughput: 0: 4971.9. Samples: 1081048206. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:31:58,666][25689] Avg episode reward: [(0, '1.357')] [2022-07-11 05:31:59,882][26022] Updated weights on worker 0-0, policy_version 1055719 (0.00093) [2022-07-11 05:32:01,377][26022] Updated weights on worker 0-0, policy_version 1055729 (0.00083) [2022-07-11 05:32:03,623][26022] Updated weights on worker 0-0, policy_version 1055739 (0.00088) [2022-07-11 05:32:03,794][25689] Fps is (10 sec: 5454.7, 60 sec: 5550.2, 300 sec: 5520.3). Total num frames: 1081077760. Throughput: 0: 5692.8. Samples: 1081079510. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:32:03,794][25689] Avg episode reward: [(0, '1.453')] [2022-07-11 05:32:05,284][26022] Updated weights on worker 0-0, policy_version 1055749 (0.00086) [2022-07-11 05:32:07,221][26022] Updated weights on worker 0-0, policy_version 1055759 (0.00086) [2022-07-11 05:32:08,807][25689] Fps is (10 sec: 5350.7, 60 sec: 5519.8, 300 sec: 5517.2). Total num frames: 1081105408. Throughput: 0: 5715.5. Samples: 1081113062. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:32:08,808][25689] Avg episode reward: [(0, '0.833')] [2022-07-11 05:32:09,137][26022] Updated weights on worker 0-0, policy_version 1055769 (0.00086) [2022-07-11 05:32:11,044][26022] Updated weights on worker 0-0, policy_version 1055779 (0.00094) [2022-07-11 05:32:12,634][26022] Updated weights on worker 0-0, policy_version 1055789 (0.00088) [2022-07-11 05:32:13,880][25689] Fps is (10 sec: 5482.6, 60 sec: 5498.2, 300 sec: 5515.8). Total num frames: 1081133056. Throughput: 0: 4862.7. Samples: 1081129616. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:32:13,881][25689] Avg episode reward: [(0, '0.500')] [2022-07-11 05:32:14,746][26022] Updated weights on worker 0-0, policy_version 1055799 (0.00091) [2022-07-11 05:32:16,498][26022] Updated weights on worker 0-0, policy_version 1055809 (0.00091) [2022-07-11 05:32:18,351][26022] Updated weights on worker 0-0, policy_version 1055819 (0.00096) [2022-07-11 05:32:18,892][25689] Fps is (10 sec: 5483.2, 60 sec: 5516.0, 300 sec: 5517.3). Total num frames: 1081160704. Throughput: 0: 5679.9. Samples: 1081163096. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:32:18,893][25689] Avg episode reward: [(0, '0.433')] [2022-07-11 05:32:20,164][26022] Updated weights on worker 0-0, policy_version 1055829 (0.00092) [2022-07-11 05:32:22,070][26022] Updated weights on worker 0-0, policy_version 1055839 (0.00087) [2022-07-11 05:32:23,874][26022] Updated weights on worker 0-0, policy_version 1055849 (0.00083) [2022-07-11 05:32:23,958][25689] Fps is (10 sec: 5689.7, 60 sec: 5534.9, 300 sec: 5520.9). Total num frames: 1081190400. Throughput: 0: 5803.2. Samples: 1081196528. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:32:23,959][25689] Avg episode reward: [(0, '0.986')] [2022-07-11 05:32:25,904][26022] Updated weights on worker 0-0, policy_version 1055859 (0.00093) [2022-07-11 05:32:27,474][26022] Updated weights on worker 0-0, policy_version 1055869 (0.00091) [2022-07-11 05:32:28,977][25689] Fps is (10 sec: 5686.2, 60 sec: 5536.8, 300 sec: 5520.8). Total num frames: 1081218048. Throughput: 0: 4962.3. Samples: 1081213148. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 05:32:28,977][25689] Avg episode reward: [(0, '0.767')] [2022-07-11 05:32:29,578][26022] Updated weights on worker 0-0, policy_version 1055879 (0.00099) [2022-07-11 05:32:31,333][26022] Updated weights on worker 0-0, policy_version 1055889 (0.00086) [2022-07-11 05:32:33,312][26022] Updated weights on worker 0-0, policy_version 1055899 (0.00092) [2022-07-11 05:32:33,985][25689] Fps is (10 sec: 5412.8, 60 sec: 5520.0, 300 sec: 5522.3). Total num frames: 1081244672. Throughput: 0: 5819.8. Samples: 1081246624. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:32:33,985][25689] Avg episode reward: [(0, '0.341')] [2022-07-11 05:32:34,942][26022] Updated weights on worker 0-0, policy_version 1055909 (0.00086) [2022-07-11 05:32:36,712][26022] Updated weights on worker 0-0, policy_version 1055919 (0.00086) [2022-07-11 05:32:38,624][26022] Updated weights on worker 0-0, policy_version 1055929 (0.00091) [2022-07-11 05:32:38,997][25689] Fps is (10 sec: 5518.2, 60 sec: 5539.2, 300 sec: 5520.9). Total num frames: 1081273344. Throughput: 0: 5829.2. Samples: 1081280294. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:32:38,998][25689] Avg episode reward: [(0, '0.481')] [2022-07-11 05:32:40,619][26022] Updated weights on worker 0-0, policy_version 1055939 (0.00099) [2022-07-11 05:32:42,411][26022] Updated weights on worker 0-0, policy_version 1055949 (0.00080) [2022-07-11 05:32:44,082][25689] Fps is (10 sec: 5577.7, 60 sec: 5505.9, 300 sec: 5522.9). Total num frames: 1081300992. Throughput: 0: 4981.0. Samples: 1081296766. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:32:44,083][25689] Avg episode reward: [(0, '-0.004')] [2022-07-11 05:32:44,339][26022] Updated weights on worker 0-0, policy_version 1055959 (0.00086) [2022-07-11 05:32:45,977][26022] Updated weights on worker 0-0, policy_version 1055969 (0.00084) [2022-07-11 05:32:48,077][26022] Updated weights on worker 0-0, policy_version 1055979 (0.00080) [2022-07-11 05:32:49,112][25689] Fps is (10 sec: 5568.3, 60 sec: 5538.6, 300 sec: 5519.5). Total num frames: 1081329664. Throughput: 0: 5810.4. Samples: 1081330142. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:32:49,112][25689] Avg episode reward: [(0, '-0.063')] [2022-07-11 05:32:49,451][26022] Updated weights on worker 0-0, policy_version 1055989 (0.00087) [2022-07-11 05:32:51,621][26022] Updated weights on worker 0-0, policy_version 1055999 (0.00102) [2022-07-11 05:32:53,027][26022] Updated weights on worker 0-0, policy_version 1056009 (0.00079) [2022-07-11 05:32:54,135][25689] Fps is (10 sec: 5602.2, 60 sec: 5536.6, 300 sec: 5522.7). Total num frames: 1081357312. Throughput: 0: 5833.3. Samples: 1081364168. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:32:54,136][25689] Avg episode reward: [(0, '-0.670')] [2022-07-11 05:32:55,137][26022] Updated weights on worker 0-0, policy_version 1056019 (0.01471) [2022-07-11 05:32:56,954][26022] Updated weights on worker 0-0, policy_version 1056029 (0.00085) [2022-07-11 05:32:58,834][26022] Updated weights on worker 0-0, policy_version 1056039 (0.00091) [2022-07-11 05:32:59,194][25689] Fps is (10 sec: 5586.1, 60 sec: 5532.0, 300 sec: 5526.3). Total num frames: 1081385984. Throughput: 0: 4983.5. Samples: 1081380944. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:32:59,195][25689] Avg episode reward: [(0, '-0.365')] [2022-07-11 05:33:00,699][26022] Updated weights on worker 0-0, policy_version 1056049 (0.00089) [2022-07-11 05:33:02,935][26022] Updated weights on worker 0-0, policy_version 1056059 (0.00088) [2022-07-11 05:33:04,233][25689] Fps is (10 sec: 5374.7, 60 sec: 5523.3, 300 sec: 5522.4). Total num frames: 1081411584. Throughput: 0: 5719.5. Samples: 1081412020. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:33:04,234][25689] Avg episode reward: [(0, '-0.366')] [2022-07-11 05:33:04,635][26022] Updated weights on worker 0-0, policy_version 1056069 (0.00082) [2022-07-11 05:33:06,543][26022] Updated weights on worker 0-0, policy_version 1056079 (0.00088) [2022-07-11 05:33:08,462][26022] Updated weights on worker 0-0, policy_version 1056089 (0.00092) [2022-07-11 05:33:09,251][25689] Fps is (10 sec: 5396.4, 60 sec: 5539.8, 300 sec: 5522.1). Total num frames: 1081440256. Throughput: 0: 5744.5. Samples: 1081445832. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:33:09,252][25689] Avg episode reward: [(0, '-1.179')] [2022-07-11 05:33:10,174][26022] Updated weights on worker 0-0, policy_version 1056099 (0.00087) [2022-07-11 05:33:12,051][26022] Updated weights on worker 0-0, policy_version 1056109 (0.00086) [2022-07-11 05:33:14,051][26022] Updated weights on worker 0-0, policy_version 1056119 (0.00096) [2022-07-11 05:33:14,288][25689] Fps is (10 sec: 5499.3, 60 sec: 5526.1, 300 sec: 5521.7). Total num frames: 1081466880. Throughput: 0: 4888.1. Samples: 1081462676. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:33:14,289][25689] Avg episode reward: [(0, '-0.362')] [2022-07-11 05:33:15,679][26022] Updated weights on worker 0-0, policy_version 1056129 (0.00091) [2022-07-11 05:33:17,713][26022] Updated weights on worker 0-0, policy_version 1056139 (0.00086) [2022-07-11 05:33:19,306][25689] Fps is (10 sec: 5601.4, 60 sec: 5559.5, 300 sec: 5526.1). Total num frames: 1081496576. Throughput: 0: 5712.7. Samples: 1081495836. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:33:19,306][25689] Avg episode reward: [(0, '-0.076')] [2022-07-11 05:33:19,315][26022] Updated weights on worker 0-0, policy_version 1056149 (0.00093) [2022-07-11 05:33:21,127][26022] Updated weights on worker 0-0, policy_version 1056159 (0.00091) [2022-07-11 05:33:23,055][26022] Updated weights on worker 0-0, policy_version 1056169 (0.00408) [2022-07-11 05:33:24,455][25689] Fps is (10 sec: 5740.8, 60 sec: 5534.9, 300 sec: 5523.6). Total num frames: 1081525248. Throughput: 0: 5816.3. Samples: 1081529640. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:33:24,456][25689] Avg episode reward: [(0, '-0.192')] [2022-07-11 05:33:24,820][26022] Updated weights on worker 0-0, policy_version 1056179 (0.00086) [2022-07-11 05:33:26,818][26022] Updated weights on worker 0-0, policy_version 1056189 (0.00053) [2022-07-11 05:33:28,468][26022] Updated weights on worker 0-0, policy_version 1056199 (0.00090) [2022-07-11 05:33:29,469][25689] Fps is (10 sec: 5440.8, 60 sec: 5518.5, 300 sec: 5524.1). Total num frames: 1081551872. Throughput: 0: 4968.9. Samples: 1081546292. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:33:29,469][25689] Avg episode reward: [(0, '-1.256')] [2022-07-11 05:33:30,409][26022] Updated weights on worker 0-0, policy_version 1056209 (0.00091) [2022-07-11 05:33:32,391][26022] Updated weights on worker 0-0, policy_version 1056219 (0.00090) [2022-07-11 05:33:33,960][26022] Updated weights on worker 0-0, policy_version 1056229 (0.00089) [2022-07-11 05:33:34,528][25689] Fps is (10 sec: 5489.7, 60 sec: 5547.6, 300 sec: 5523.0). Total num frames: 1081580544. Throughput: 0: 5767.4. Samples: 1081579408. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:33:34,529][25689] Avg episode reward: [(0, '-0.995')] [2022-07-11 05:33:36,100][26022] Updated weights on worker 0-0, policy_version 1056239 (0.00092) [2022-07-11 05:33:37,756][26022] Updated weights on worker 0-0, policy_version 1056249 (0.00087) [2022-07-11 05:33:38,065][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:33:38,080][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001056250_1081600000.pth [2022-07-11 05:33:38,081][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001054304_1079607296.pth [2022-07-11 05:33:39,559][25689] Fps is (10 sec: 5581.7, 60 sec: 5529.1, 300 sec: 5527.2). Total num frames: 1081608192. Throughput: 0: 5776.9. Samples: 1081612836. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:33:39,559][25689] Avg episode reward: [(0, '-0.409')] [2022-07-11 05:33:39,596][26022] Updated weights on worker 0-0, policy_version 1056259 (0.00080) [2022-07-11 05:33:41,732][26022] Updated weights on worker 0-0, policy_version 1056269 (0.00082) [2022-07-11 05:33:43,125][26022] Updated weights on worker 0-0, policy_version 1056279 (0.00094) [2022-07-11 05:33:44,672][25689] Fps is (10 sec: 5551.9, 60 sec: 5543.3, 300 sec: 5522.2). Total num frames: 1081636864. Throughput: 0: 5774.2. Samples: 1081646376. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:33:44,674][25689] Avg episode reward: [(0, '-0.633')] [2022-07-11 05:33:45,411][26022] Updated weights on worker 0-0, policy_version 1056289 (0.00091) [2022-07-11 05:33:46,849][26022] Updated weights on worker 0-0, policy_version 1056299 (0.00086) [2022-07-11 05:33:48,867][26022] Updated weights on worker 0-0, policy_version 1056309 (0.00086) [2022-07-11 05:33:49,717][25689] Fps is (10 sec: 5544.2, 60 sec: 5525.1, 300 sec: 5518.7). Total num frames: 1081664512. Throughput: 0: 5780.0. Samples: 1081663328. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:33:49,719][25689] Avg episode reward: [(0, '0.064')] [2022-07-11 05:33:50,532][26022] Updated weights on worker 0-0, policy_version 1056319 (0.00093) [2022-07-11 05:33:52,500][26022] Updated weights on worker 0-0, policy_version 1056329 (0.00083) [2022-07-11 05:33:54,479][26022] Updated weights on worker 0-0, policy_version 1056339 (0.00083) [2022-07-11 05:33:54,747][25689] Fps is (10 sec: 5488.9, 60 sec: 5524.5, 300 sec: 5525.9). Total num frames: 1081692160. Throughput: 0: 5806.6. Samples: 1081696808. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:33:54,748][25689] Avg episode reward: [(0, '0.142')] [2022-07-11 05:33:56,195][26022] Updated weights on worker 0-0, policy_version 1056349 (0.00516) [2022-07-11 05:33:58,151][26022] Updated weights on worker 0-0, policy_version 1056359 (0.00086) [2022-07-11 05:33:59,682][26022] Updated weights on worker 0-0, policy_version 1056369 (0.00083) [2022-07-11 05:33:59,760][25689] Fps is (10 sec: 5710.1, 60 sec: 5545.6, 300 sec: 5536.7). Total num frames: 1081721856. Throughput: 0: 5831.0. Samples: 1081730628. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:33:59,761][25689] Avg episode reward: [(0, '1.177')] [2022-07-11 05:34:02,141][26022] Updated weights on worker 0-0, policy_version 1056379 (0.00086) [2022-07-11 05:34:03,774][26022] Updated weights on worker 0-0, policy_version 1056389 (0.00085) [2022-07-11 05:34:04,835][25689] Fps is (10 sec: 5481.2, 60 sec: 5542.3, 300 sec: 5524.9). Total num frames: 1081747456. Throughput: 0: 4890.9. Samples: 1081744990. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:34:04,837][25689] Avg episode reward: [(0, '1.886')] [2022-07-11 05:34:05,658][26022] Updated weights on worker 0-0, policy_version 1056399 (0.00091) [2022-07-11 05:34:07,592][26022] Updated weights on worker 0-0, policy_version 1056409 (0.00093) [2022-07-11 05:34:09,473][26022] Updated weights on worker 0-0, policy_version 1056419 (0.00110) [2022-07-11 05:34:09,928][25689] Fps is (10 sec: 5337.3, 60 sec: 5535.4, 300 sec: 5530.4). Total num frames: 1081776128. Throughput: 0: 5704.1. Samples: 1081778614. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:34:09,930][25689] Avg episode reward: [(0, '0.611')] [2022-07-11 05:34:11,193][26022] Updated weights on worker 0-0, policy_version 1056429 (0.00090) [2022-07-11 05:34:13,166][26022] Updated weights on worker 0-0, policy_version 1056439 (0.00091) [2022-07-11 05:34:14,808][26022] Updated weights on worker 0-0, policy_version 1056449 (0.00087) [2022-07-11 05:34:14,944][25689] Fps is (10 sec: 5672.5, 60 sec: 5571.1, 300 sec: 5533.6). Total num frames: 1081804800. Throughput: 0: 5707.2. Samples: 1081812080. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:34:14,945][25689] Avg episode reward: [(0, '-0.320')] [2022-07-11 05:34:16,839][26022] Updated weights on worker 0-0, policy_version 1056459 (0.00049) [2022-07-11 05:34:18,543][26022] Updated weights on worker 0-0, policy_version 1056469 (0.00077) [2022-07-11 05:34:19,973][25689] Fps is (10 sec: 5403.0, 60 sec: 5502.5, 300 sec: 5516.8). Total num frames: 1081830400. Throughput: 0: 4852.2. Samples: 1081828706. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:34:19,974][25689] Avg episode reward: [(0, '-0.236')] [2022-07-11 05:34:20,532][26022] Updated weights on worker 0-0, policy_version 1056479 (0.00078) [2022-07-11 05:34:22,259][26022] Updated weights on worker 0-0, policy_version 1056489 (0.00087) [2022-07-11 05:34:24,205][26022] Updated weights on worker 0-0, policy_version 1056499 (0.00085) [2022-07-11 05:34:25,089][25689] Fps is (10 sec: 5450.5, 60 sec: 5522.5, 300 sec: 5525.1). Total num frames: 1081860096. Throughput: 0: 5782.0. Samples: 1081862100. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:34:25,090][25689] Avg episode reward: [(0, '-1.243')] [2022-07-11 05:34:26,022][26022] Updated weights on worker 0-0, policy_version 1056509 (0.00089) [2022-07-11 05:34:27,839][26022] Updated weights on worker 0-0, policy_version 1056519 (0.00088) [2022-07-11 05:34:29,583][26022] Updated weights on worker 0-0, policy_version 1056529 (0.00087) [2022-07-11 05:34:30,092][25689] Fps is (10 sec: 5565.7, 60 sec: 5523.4, 300 sec: 5525.4). Total num frames: 1081886720. Throughput: 0: 5815.6. Samples: 1081895878. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:34:30,093][25689] Avg episode reward: [(0, '-2.033')] [2022-07-11 05:34:31,408][26022] Updated weights on worker 0-0, policy_version 1056539 (0.00094) [2022-07-11 05:34:33,511][26022] Updated weights on worker 0-0, policy_version 1056549 (0.00094) [2022-07-11 05:34:34,963][26022] Updated weights on worker 0-0, policy_version 1056559 (0.00086) [2022-07-11 05:34:35,096][25689] Fps is (10 sec: 5628.2, 60 sec: 5545.4, 300 sec: 5528.8). Total num frames: 1081916416. Throughput: 0: 4984.4. Samples: 1081912524. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:34:35,096][25689] Avg episode reward: [(0, '-2.758')] [2022-07-11 05:34:37,024][26022] Updated weights on worker 0-0, policy_version 1056569 (0.00099) [2022-07-11 05:34:38,711][26022] Updated weights on worker 0-0, policy_version 1056579 (0.00087) [2022-07-11 05:34:40,104][25689] Fps is (10 sec: 5829.9, 60 sec: 5564.4, 300 sec: 5529.6). Total num frames: 1081945088. Throughput: 0: 5851.0. Samples: 1081946492. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:34:40,104][25689] Avg episode reward: [(0, '-0.991')] [2022-07-11 05:34:40,486][26022] Updated weights on worker 0-0, policy_version 1056589 (0.00089) [2022-07-11 05:34:42,489][26022] Updated weights on worker 0-0, policy_version 1056599 (0.00084) [2022-07-11 05:34:44,066][26022] Updated weights on worker 0-0, policy_version 1056609 (0.00084) [2022-07-11 05:34:45,158][25689] Fps is (10 sec: 5495.5, 60 sec: 5536.0, 300 sec: 5525.5). Total num frames: 1081971712. Throughput: 0: 5882.3. Samples: 1081980150. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:34:45,158][25689] Avg episode reward: [(0, '-0.714')] [2022-07-11 05:34:46,210][26022] Updated weights on worker 0-0, policy_version 1056619 (0.00092) [2022-07-11 05:34:47,892][26022] Updated weights on worker 0-0, policy_version 1056629 (0.00092) [2022-07-11 05:34:49,773][26022] Updated weights on worker 0-0, policy_version 1056639 (0.00084) [2022-07-11 05:34:50,167][25689] Fps is (10 sec: 5494.7, 60 sec: 5556.2, 300 sec: 5525.9). Total num frames: 1082000384. Throughput: 0: 5022.9. Samples: 1081996714. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:34:50,168][25689] Avg episode reward: [(0, '-1.562')] [2022-07-11 05:34:51,615][26022] Updated weights on worker 0-0, policy_version 1056649 (0.00084) [2022-07-11 05:34:53,596][26022] Updated weights on worker 0-0, policy_version 1056659 (0.00089) [2022-07-11 05:34:55,171][25689] Fps is (10 sec: 5624.9, 60 sec: 5558.6, 300 sec: 5530.0). Total num frames: 1082028032. Throughput: 0: 5868.1. Samples: 1082030324. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:34:55,171][25689] Avg episode reward: [(0, '0.132')] [2022-07-11 05:34:55,390][26022] Updated weights on worker 0-0, policy_version 1056669 (0.00096) [2022-07-11 05:34:57,300][26022] Updated weights on worker 0-0, policy_version 1056679 (0.00105) [2022-07-11 05:34:58,955][26022] Updated weights on worker 0-0, policy_version 1056689 (0.00096) [2022-07-11 05:35:00,179][25689] Fps is (10 sec: 5420.6, 60 sec: 5508.1, 300 sec: 5529.0). Total num frames: 1082054656. Throughput: 0: 5830.6. Samples: 1082063544. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:35:00,180][25689] Avg episode reward: [(0, '0.439')] [2022-07-11 05:35:00,911][26022] Updated weights on worker 0-0, policy_version 1056699 (0.00087) [2022-07-11 05:35:02,950][26022] Updated weights on worker 0-0, policy_version 1056709 (0.00093) [2022-07-11 05:35:04,962][26022] Updated weights on worker 0-0, policy_version 1056719 (0.00086) [2022-07-11 05:35:05,221][25689] Fps is (10 sec: 5298.1, 60 sec: 5528.2, 300 sec: 5525.3). Total num frames: 1082081280. Throughput: 0: 4870.4. Samples: 1082077866. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:35:05,222][25689] Avg episode reward: [(0, '-0.403')] [2022-07-11 05:35:06,758][26022] Updated weights on worker 0-0, policy_version 1056729 (0.00094) [2022-07-11 05:35:08,722][26022] Updated weights on worker 0-0, policy_version 1056739 (0.00087) [2022-07-11 05:35:10,241][25689] Fps is (10 sec: 5394.2, 60 sec: 5517.9, 300 sec: 5525.4). Total num frames: 1082108928. Throughput: 0: 5700.7. Samples: 1082111146. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:35:10,241][25689] Avg episode reward: [(0, '-0.774')] [2022-07-11 05:35:10,571][26022] Updated weights on worker 0-0, policy_version 1056749 (0.00106) [2022-07-11 05:35:12,250][26022] Updated weights on worker 0-0, policy_version 1056759 (0.00099) [2022-07-11 05:35:14,167][26022] Updated weights on worker 0-0, policy_version 1056769 (0.00089) [2022-07-11 05:35:15,265][25689] Fps is (10 sec: 5505.5, 60 sec: 5500.2, 300 sec: 5528.5). Total num frames: 1082136576. Throughput: 0: 5689.7. Samples: 1082144654. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:35:15,265][25689] Avg episode reward: [(0, '-0.893')] [2022-07-11 05:35:15,912][26022] Updated weights on worker 0-0, policy_version 1056779 (0.00083) [2022-07-11 05:35:17,930][26022] Updated weights on worker 0-0, policy_version 1056789 (0.00085) [2022-07-11 05:35:19,588][26022] Updated weights on worker 0-0, policy_version 1056799 (0.00090) [2022-07-11 05:35:20,272][25689] Fps is (10 sec: 5614.2, 60 sec: 5553.1, 300 sec: 5526.2). Total num frames: 1082165248. Throughput: 0: 4878.6. Samples: 1082161570. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:35:20,273][25689] Avg episode reward: [(0, '0.291')] [2022-07-11 05:35:21,602][26022] Updated weights on worker 0-0, policy_version 1056809 (0.00081) [2022-07-11 05:35:23,352][26022] Updated weights on worker 0-0, policy_version 1056819 (0.00088) [2022-07-11 05:35:25,301][26022] Updated weights on worker 0-0, policy_version 1056829 (0.00087) [2022-07-11 05:35:25,328][25689] Fps is (10 sec: 5698.2, 60 sec: 5541.6, 300 sec: 5532.1). Total num frames: 1082193920. Throughput: 0: 5830.9. Samples: 1082195110. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:35:25,329][25689] Avg episode reward: [(0, '0.452')] [2022-07-11 05:35:27,129][26022] Updated weights on worker 0-0, policy_version 1056839 (0.00090) [2022-07-11 05:35:28,753][26022] Updated weights on worker 0-0, policy_version 1056849 (0.00089) [2022-07-11 05:35:30,348][25689] Fps is (10 sec: 5488.0, 60 sec: 5540.1, 300 sec: 5529.0). Total num frames: 1082220544. Throughput: 0: 5837.3. Samples: 1082228520. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:35:30,349][25689] Avg episode reward: [(0, '0.311')] [2022-07-11 05:35:30,879][26022] Updated weights on worker 0-0, policy_version 1056859 (0.00096) [2022-07-11 05:35:32,322][26022] Updated weights on worker 0-0, policy_version 1056869 (0.00084) [2022-07-11 05:35:34,608][26022] Updated weights on worker 0-0, policy_version 1056879 (0.00523) [2022-07-11 05:35:35,351][25689] Fps is (10 sec: 5415.2, 60 sec: 5506.2, 300 sec: 5525.9). Total num frames: 1082248192. Throughput: 0: 4999.5. Samples: 1082245072. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:35:35,351][25689] Avg episode reward: [(0, '1.048')] [2022-07-11 05:35:36,064][26022] Updated weights on worker 0-0, policy_version 1056889 (0.00086) [2022-07-11 05:35:38,244][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:35:38,261][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001056899_1082264576.pth [2022-07-11 05:35:38,261][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001054953_1080271872.pth [2022-07-11 05:35:38,268][26022] Updated weights on worker 0-0, policy_version 1056899 (0.00089) [2022-07-11 05:35:39,971][26022] Updated weights on worker 0-0, policy_version 1056909 (0.00094) [2022-07-11 05:35:40,363][25689] Fps is (10 sec: 5623.8, 60 sec: 5505.8, 300 sec: 5527.6). Total num frames: 1082276864. Throughput: 0: 5804.1. Samples: 1082278178. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:35:40,363][25689] Avg episode reward: [(0, '0.516')] [2022-07-11 05:35:41,930][26022] Updated weights on worker 0-0, policy_version 1056919 (0.00099) [2022-07-11 05:35:43,614][26022] Updated weights on worker 0-0, policy_version 1056929 (0.00090) [2022-07-11 05:35:45,460][25689] Fps is (10 sec: 5571.1, 60 sec: 5518.9, 300 sec: 5526.6). Total num frames: 1082304512. Throughput: 0: 5778.5. Samples: 1082311440. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:35:45,460][25689] Avg episode reward: [(0, '0.475')] [2022-07-11 05:35:45,651][26022] Updated weights on worker 0-0, policy_version 1056939 (0.00091) [2022-07-11 05:35:47,487][26022] Updated weights on worker 0-0, policy_version 1056949 (0.00085) [2022-07-11 05:35:49,305][26022] Updated weights on worker 0-0, policy_version 1056959 (0.00090) [2022-07-11 05:35:50,523][25689] Fps is (10 sec: 5442.7, 60 sec: 5497.1, 300 sec: 5522.1). Total num frames: 1082332160. Throughput: 0: 4936.0. Samples: 1082328100. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:35:50,523][25689] Avg episode reward: [(0, '-0.467')] [2022-07-11 05:35:51,013][26022] Updated weights on worker 0-0, policy_version 1056969 (0.00084) [2022-07-11 05:35:52,975][26022] Updated weights on worker 0-0, policy_version 1056979 (0.00086) [2022-07-11 05:35:54,741][26022] Updated weights on worker 0-0, policy_version 1056989 (0.00085) [2022-07-11 05:35:55,524][25689] Fps is (10 sec: 5494.4, 60 sec: 5497.2, 300 sec: 5519.7). Total num frames: 1082359808. Throughput: 0: 5762.8. Samples: 1082361328. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:35:55,524][25689] Avg episode reward: [(0, '-0.343')] [2022-07-11 05:35:56,595][26022] Updated weights on worker 0-0, policy_version 1056999 (0.00087) [2022-07-11 05:35:58,612][26022] Updated weights on worker 0-0, policy_version 1057009 (0.00089) [2022-07-11 05:36:00,138][26022] Updated weights on worker 0-0, policy_version 1057019 (0.00087) [2022-07-11 05:36:00,603][25689] Fps is (10 sec: 5587.1, 60 sec: 5524.7, 300 sec: 5538.0). Total num frames: 1082388480. Throughput: 0: 5766.3. Samples: 1082394888. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:36:00,604][25689] Avg episode reward: [(0, '-0.005')] [2022-07-11 05:36:02,393][26022] Updated weights on worker 0-0, policy_version 1057029 (0.00098) [2022-07-11 05:36:04,476][26022] Updated weights on worker 0-0, policy_version 1057039 (0.00092) [2022-07-11 05:36:05,737][25689] Fps is (10 sec: 5414.3, 60 sec: 5516.3, 300 sec: 5526.2). Total num frames: 1082415104. Throughput: 0: 4844.5. Samples: 1082409672. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:36:05,737][25689] Avg episode reward: [(0, '0.261')] [2022-07-11 05:36:06,111][26022] Updated weights on worker 0-0, policy_version 1057049 (0.00083) [2022-07-11 05:36:08,191][26022] Updated weights on worker 0-0, policy_version 1057059 (0.00089) [2022-07-11 05:36:09,870][26022] Updated weights on worker 0-0, policy_version 1057069 (0.00095) [2022-07-11 05:36:10,803][25689] Fps is (10 sec: 5320.8, 60 sec: 5512.1, 300 sec: 5521.9). Total num frames: 1082442752. Throughput: 0: 5655.8. Samples: 1082442802. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:36:10,804][25689] Avg episode reward: [(0, '0.514')] [2022-07-11 05:36:11,739][26022] Updated weights on worker 0-0, policy_version 1057079 (0.00078) [2022-07-11 05:36:13,659][26022] Updated weights on worker 0-0, policy_version 1057090 (0.00094) [2022-07-11 05:36:15,504][26022] Updated weights on worker 0-0, policy_version 1057100 (0.00092) [2022-07-11 05:36:15,806][25689] Fps is (10 sec: 5695.3, 60 sec: 5547.9, 300 sec: 5532.6). Total num frames: 1082472448. Throughput: 0: 5687.6. Samples: 1082476682. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:36:15,806][25689] Avg episode reward: [(0, '0.924')] [2022-07-11 05:36:17,381][26022] Updated weights on worker 0-0, policy_version 1057110 (0.00092) [2022-07-11 05:36:19,223][26022] Updated weights on worker 0-0, policy_version 1057120 (0.00092) [2022-07-11 05:36:20,883][25689] Fps is (10 sec: 5688.7, 60 sec: 5524.6, 300 sec: 5529.3). Total num frames: 1082500096. Throughput: 0: 4863.0. Samples: 1082493512. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:36:20,884][25689] Avg episode reward: [(0, '1.163')] [2022-07-11 05:36:21,039][26022] Updated weights on worker 0-0, policy_version 1057130 (0.00087) [2022-07-11 05:36:22,882][26022] Updated weights on worker 0-0, policy_version 1057140 (0.00081) [2022-07-11 05:36:24,699][26022] Updated weights on worker 0-0, policy_version 1057150 (0.00086) [2022-07-11 05:36:25,975][25689] Fps is (10 sec: 5336.9, 60 sec: 5487.6, 300 sec: 5524.9). Total num frames: 1082526720. Throughput: 0: 5798.1. Samples: 1082527012. Policy #0 lag: (min: 0.0, avg: 8.2, max: 18.0) [2022-07-11 05:36:25,975][25689] Avg episode reward: [(0, '1.190')] [2022-07-11 05:36:26,504][26022] Updated weights on worker 0-0, policy_version 1057160 (0.00084) [2022-07-11 05:36:28,402][26022] Updated weights on worker 0-0, policy_version 1057170 (0.00087) [2022-07-11 05:36:30,138][26022] Updated weights on worker 0-0, policy_version 1057180 (0.00092) [2022-07-11 05:36:31,001][25689] Fps is (10 sec: 5566.5, 60 sec: 5537.6, 300 sec: 5531.5). Total num frames: 1082556416. Throughput: 0: 5822.4. Samples: 1082560402. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:36:31,003][25689] Avg episode reward: [(0, '0.751')] [2022-07-11 05:36:31,978][26022] Updated weights on worker 0-0, policy_version 1057190 (0.00086) [2022-07-11 05:36:34,007][26022] Updated weights on worker 0-0, policy_version 1057200 (0.00087) [2022-07-11 05:36:35,784][26022] Updated weights on worker 0-0, policy_version 1057210 (0.00090) [2022-07-11 05:36:36,011][25689] Fps is (10 sec: 5713.4, 60 sec: 5536.9, 300 sec: 5532.0). Total num frames: 1082584064. Throughput: 0: 4973.2. Samples: 1082577170. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:36:36,012][25689] Avg episode reward: [(0, '-0.167')] [2022-07-11 05:36:37,615][26022] Updated weights on worker 0-0, policy_version 1057220 (0.00084) [2022-07-11 05:36:39,439][26022] Updated weights on worker 0-0, policy_version 1057230 (0.00084) [2022-07-11 05:36:41,031][25689] Fps is (10 sec: 5411.2, 60 sec: 5502.5, 300 sec: 5523.0). Total num frames: 1082610688. Throughput: 0: 5808.5. Samples: 1082610538. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:36:41,031][25689] Avg episode reward: [(0, '-0.085')] [2022-07-11 05:36:41,303][26022] Updated weights on worker 0-0, policy_version 1057240 (0.00091) [2022-07-11 05:36:43,142][26022] Updated weights on worker 0-0, policy_version 1057250 (0.00095) [2022-07-11 05:36:45,048][26022] Updated weights on worker 0-0, policy_version 1057260 (0.00089) [2022-07-11 05:36:46,072][25689] Fps is (10 sec: 5496.4, 60 sec: 5524.4, 300 sec: 5529.4). Total num frames: 1082639360. Throughput: 0: 5814.3. Samples: 1082643864. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:36:46,072][25689] Avg episode reward: [(0, '-0.244')] [2022-07-11 05:36:46,746][26022] Updated weights on worker 0-0, policy_version 1057270 (0.00083) [2022-07-11 05:36:48,684][26022] Updated weights on worker 0-0, policy_version 1057280 (0.00089) [2022-07-11 05:36:50,398][26022] Updated weights on worker 0-0, policy_version 1057290 (0.00086) [2022-07-11 05:36:51,081][25689] Fps is (10 sec: 5705.6, 60 sec: 5546.3, 300 sec: 5532.7). Total num frames: 1082668032. Throughput: 0: 5839.9. Samples: 1082677666. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:36:51,083][25689] Avg episode reward: [(0, '-0.357')] [2022-07-11 05:36:52,478][26022] Updated weights on worker 0-0, policy_version 1057300 (0.00093) [2022-07-11 05:36:54,067][26022] Updated weights on worker 0-0, policy_version 1057310 (0.00093) [2022-07-11 05:36:56,027][26022] Updated weights on worker 0-0, policy_version 1057320 (0.00092) [2022-07-11 05:36:56,113][25689] Fps is (10 sec: 5608.7, 60 sec: 5543.4, 300 sec: 5528.8). Total num frames: 1082695680. Throughput: 0: 5846.2. Samples: 1082694690. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:36:56,115][25689] Avg episode reward: [(0, '-0.382')] [2022-07-11 05:36:57,515][26022] Updated weights on worker 0-0, policy_version 1057330 (0.00083) [2022-07-11 05:36:59,748][26022] Updated weights on worker 0-0, policy_version 1057340 (0.00090) [2022-07-11 05:37:01,127][25689] Fps is (10 sec: 5606.3, 60 sec: 5549.4, 300 sec: 5537.9). Total num frames: 1082724352. Throughput: 0: 5820.4. Samples: 1082727506. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:37:01,129][25689] Avg episode reward: [(0, '-0.885')] [2022-07-11 05:37:01,492][26022] Updated weights on worker 0-0, policy_version 1057350 (0.00092) [2022-07-11 05:37:04,041][26022] Updated weights on worker 0-0, policy_version 1057360 (0.00088) [2022-07-11 05:37:05,735][26022] Updated weights on worker 0-0, policy_version 1057370 (0.00088) [2022-07-11 05:37:06,256][25689] Fps is (10 sec: 5250.2, 60 sec: 5516.1, 300 sec: 5525.4). Total num frames: 1082748928. Throughput: 0: 5651.8. Samples: 1082757938. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:37:06,256][25689] Avg episode reward: [(0, '0.290')] [2022-07-11 05:37:07,671][26022] Updated weights on worker 0-0, policy_version 1057380 (0.00086) [2022-07-11 05:37:09,457][26022] Updated weights on worker 0-0, policy_version 1057390 (0.00092) [2022-07-11 05:37:11,284][25689] Fps is (10 sec: 5141.6, 60 sec: 5519.5, 300 sec: 5526.2). Total num frames: 1082776576. Throughput: 0: 4788.2. Samples: 1082774402. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:37:11,284][25689] Avg episode reward: [(0, '0.776')] [2022-07-11 05:37:11,419][26022] Updated weights on worker 0-0, policy_version 1057400 (0.00218) [2022-07-11 05:37:13,258][26022] Updated weights on worker 0-0, policy_version 1057410 (0.00087) [2022-07-11 05:37:15,267][26022] Updated weights on worker 0-0, policy_version 1057420 (0.00085) [2022-07-11 05:37:16,333][25689] Fps is (10 sec: 5486.8, 60 sec: 5481.4, 300 sec: 5525.5). Total num frames: 1082804224. Throughput: 0: 5581.4. Samples: 1082807546. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:37:16,334][25689] Avg episode reward: [(0, '0.490')] [2022-07-11 05:37:16,906][26022] Updated weights on worker 0-0, policy_version 1057430 (0.00090) [2022-07-11 05:37:18,971][26022] Updated weights on worker 0-0, policy_version 1057440 (0.00082) [2022-07-11 05:37:20,516][26022] Updated weights on worker 0-0, policy_version 1057450 (0.00095) [2022-07-11 05:37:21,347][25689] Fps is (10 sec: 5393.3, 60 sec: 5470.3, 300 sec: 5516.2). Total num frames: 1082830848. Throughput: 0: 5597.7. Samples: 1082840692. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:37:21,347][25689] Avg episode reward: [(0, '-0.533')] [2022-07-11 05:37:22,595][26022] Updated weights on worker 0-0, policy_version 1057460 (0.00080) [2022-07-11 05:37:24,380][26022] Updated weights on worker 0-0, policy_version 1057470 (0.00091) [2022-07-11 05:37:26,179][26022] Updated weights on worker 0-0, policy_version 1057480 (0.00088) [2022-07-11 05:37:26,414][25689] Fps is (10 sec: 5587.2, 60 sec: 5523.3, 300 sec: 5522.2). Total num frames: 1082860544. Throughput: 0: 4917.8. Samples: 1082857072. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:37:26,414][25689] Avg episode reward: [(0, '-0.229')] [2022-07-11 05:37:28,132][26022] Updated weights on worker 0-0, policy_version 1057490 (0.00085) [2022-07-11 05:37:30,072][26022] Updated weights on worker 0-0, policy_version 1057500 (0.00091) [2022-07-11 05:37:31,435][25689] Fps is (10 sec: 5582.9, 60 sec: 5473.0, 300 sec: 5521.9). Total num frames: 1082887168. Throughput: 0: 5740.4. Samples: 1082890076. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:37:31,435][25689] Avg episode reward: [(0, '-0.807')] [2022-07-11 05:37:31,656][26022] Updated weights on worker 0-0, policy_version 1057510 (0.00090) [2022-07-11 05:37:33,852][26022] Updated weights on worker 0-0, policy_version 1057520 (0.00078) [2022-07-11 05:37:35,365][26022] Updated weights on worker 0-0, policy_version 1057530 (0.00086) [2022-07-11 05:37:36,445][25689] Fps is (10 sec: 5410.3, 60 sec: 5473.0, 300 sec: 5518.5). Total num frames: 1082914816. Throughput: 0: 5758.6. Samples: 1082923360. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:37:36,445][25689] Avg episode reward: [(0, '-0.827')] [2022-07-11 05:37:37,558][26022] Updated weights on worker 0-0, policy_version 1057540 (0.00081) [2022-07-11 05:37:38,335][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:37:38,345][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001057546_1082927104.pth [2022-07-11 05:37:38,346][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001055600_1080934400.pth [2022-07-11 05:37:39,213][26022] Updated weights on worker 0-0, policy_version 1057550 (0.00165) [2022-07-11 05:37:41,169][26022] Updated weights on worker 0-0, policy_version 1057560 (0.00092) [2022-07-11 05:37:41,454][25689] Fps is (10 sec: 5621.0, 60 sec: 5507.8, 300 sec: 5523.4). Total num frames: 1082943488. Throughput: 0: 4949.5. Samples: 1082940214. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:37:41,455][25689] Avg episode reward: [(0, '-1.050')] [2022-07-11 05:37:42,816][26022] Updated weights on worker 0-0, policy_version 1057570 (0.00091) [2022-07-11 05:37:44,814][26022] Updated weights on worker 0-0, policy_version 1057580 (0.00086) [2022-07-11 05:37:46,470][26022] Updated weights on worker 0-0, policy_version 1057590 (0.00095) [2022-07-11 05:37:46,512][25689] Fps is (10 sec: 5696.1, 60 sec: 5506.3, 300 sec: 5522.8). Total num frames: 1082972160. Throughput: 0: 5812.2. Samples: 1082973888. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:37:46,512][25689] Avg episode reward: [(0, '-0.295')] [2022-07-11 05:37:48,413][26022] Updated weights on worker 0-0, policy_version 1057600 (0.00086) [2022-07-11 05:37:50,051][26022] Updated weights on worker 0-0, policy_version 1057610 (0.00095) [2022-07-11 05:37:51,514][25689] Fps is (10 sec: 5496.9, 60 sec: 5473.1, 300 sec: 5519.8). Total num frames: 1082998784. Throughput: 0: 5848.4. Samples: 1083007506. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:37:51,516][25689] Avg episode reward: [(0, '-0.224')] [2022-07-11 05:37:51,987][26022] Updated weights on worker 0-0, policy_version 1057620 (0.00091) [2022-07-11 05:37:53,939][26022] Updated weights on worker 0-0, policy_version 1057630 (0.00088) [2022-07-11 05:37:55,835][26022] Updated weights on worker 0-0, policy_version 1057640 (0.00109) [2022-07-11 05:37:56,519][25689] Fps is (10 sec: 5525.6, 60 sec: 5492.4, 300 sec: 5520.8). Total num frames: 1083027456. Throughput: 0: 5017.9. Samples: 1083024092. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:37:56,521][25689] Avg episode reward: [(0, '-0.443')] [2022-07-11 05:37:57,655][26022] Updated weights on worker 0-0, policy_version 1057650 (0.00084) [2022-07-11 05:37:59,432][26022] Updated weights on worker 0-0, policy_version 1057660 (0.00089) [2022-07-11 05:38:01,315][26022] Updated weights on worker 0-0, policy_version 1057670 (0.00088) [2022-07-11 05:38:01,523][25689] Fps is (10 sec: 5626.9, 60 sec: 5476.4, 300 sec: 5528.3). Total num frames: 1083055104. Throughput: 0: 5838.2. Samples: 1083057378. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:38:01,523][25689] Avg episode reward: [(0, '0.483')] [2022-07-11 05:38:03,610][26022] Updated weights on worker 0-0, policy_version 1057680 (0.00080) [2022-07-11 05:38:05,397][26022] Updated weights on worker 0-0, policy_version 1057690 (0.00082) [2022-07-11 05:38:06,639][25689] Fps is (10 sec: 5261.6, 60 sec: 5494.4, 300 sec: 5516.2). Total num frames: 1083080704. Throughput: 0: 5698.0. Samples: 1083088574. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:38:06,640][25689] Avg episode reward: [(0, '0.807')] [2022-07-11 05:38:07,214][26022] Updated weights on worker 0-0, policy_version 1057700 (0.00082) [2022-07-11 05:38:09,042][26022] Updated weights on worker 0-0, policy_version 1057710 (0.00085) [2022-07-11 05:38:11,077][26022] Updated weights on worker 0-0, policy_version 1057720 (0.00081) [2022-07-11 05:38:11,643][25689] Fps is (10 sec: 5362.7, 60 sec: 5513.7, 300 sec: 5523.7). Total num frames: 1083109376. Throughput: 0: 4853.8. Samples: 1083105210. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:38:11,643][25689] Avg episode reward: [(0, '1.005')] [2022-07-11 05:38:12,711][26022] Updated weights on worker 0-0, policy_version 1057730 (0.00087) [2022-07-11 05:38:14,592][26022] Updated weights on worker 0-0, policy_version 1057740 (0.00087) [2022-07-11 05:38:16,152][26022] Updated weights on worker 0-0, policy_version 1057750 (0.00082) [2022-07-11 05:38:16,650][25689] Fps is (10 sec: 5728.6, 60 sec: 5534.6, 300 sec: 5520.4). Total num frames: 1083138048. Throughput: 0: 5710.9. Samples: 1083139054. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:38:16,651][25689] Avg episode reward: [(0, '0.875')] [2022-07-11 05:38:18,199][26022] Updated weights on worker 0-0, policy_version 1057760 (0.00095) [2022-07-11 05:38:19,925][26022] Updated weights on worker 0-0, policy_version 1057770 (0.00101) [2022-07-11 05:38:21,676][25689] Fps is (10 sec: 5511.3, 60 sec: 5533.4, 300 sec: 5515.8). Total num frames: 1083164672. Throughput: 0: 5720.4. Samples: 1083172664. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:38:21,678][25689] Avg episode reward: [(0, '1.780')] [2022-07-11 05:38:21,819][26022] Updated weights on worker 0-0, policy_version 1057780 (0.00094) [2022-07-11 05:38:23,651][26022] Updated weights on worker 0-0, policy_version 1057790 (0.00090) [2022-07-11 05:38:25,644][26022] Updated weights on worker 0-0, policy_version 1057800 (0.00086) [2022-07-11 05:38:26,762][25689] Fps is (10 sec: 5467.9, 60 sec: 5514.6, 300 sec: 5521.4). Total num frames: 1083193344. Throughput: 0: 5015.6. Samples: 1083189502. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:38:26,764][25689] Avg episode reward: [(0, '1.763')] [2022-07-11 05:38:27,363][26022] Updated weights on worker 0-0, policy_version 1057810 (0.00088) [2022-07-11 05:38:29,364][26022] Updated weights on worker 0-0, policy_version 1057820 (0.00095) [2022-07-11 05:38:30,980][26022] Updated weights on worker 0-0, policy_version 1057830 (0.00078) [2022-07-11 05:38:31,775][25689] Fps is (10 sec: 5577.1, 60 sec: 5532.4, 300 sec: 5518.8). Total num frames: 1083220992. Throughput: 0: 5832.5. Samples: 1083222628. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:38:31,776][25689] Avg episode reward: [(0, '1.706')] [2022-07-11 05:38:32,863][26022] Updated weights on worker 0-0, policy_version 1057840 (0.00084) [2022-07-11 05:38:34,603][26022] Updated weights on worker 0-0, policy_version 1057850 (0.00084) [2022-07-11 05:38:36,636][26022] Updated weights on worker 0-0, policy_version 1057860 (0.00087) [2022-07-11 05:38:36,787][25689] Fps is (10 sec: 5618.3, 60 sec: 5549.2, 300 sec: 5522.6). Total num frames: 1083249664. Throughput: 0: 5811.4. Samples: 1083256080. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:38:36,789][25689] Avg episode reward: [(0, '1.611')] [2022-07-11 05:38:38,473][26022] Updated weights on worker 0-0, policy_version 1057870 (0.00087) [2022-07-11 05:38:40,279][26022] Updated weights on worker 0-0, policy_version 1057880 (0.00095) [2022-07-11 05:38:41,797][25689] Fps is (10 sec: 5619.4, 60 sec: 5532.1, 300 sec: 5521.1). Total num frames: 1083277312. Throughput: 0: 4969.3. Samples: 1083272650. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:38:41,799][25689] Avg episode reward: [(0, '1.492')] [2022-07-11 05:38:42,164][26022] Updated weights on worker 0-0, policy_version 1057890 (0.00092) [2022-07-11 05:38:43,920][26022] Updated weights on worker 0-0, policy_version 1057900 (0.00089) [2022-07-11 05:38:45,932][26022] Updated weights on worker 0-0, policy_version 1057910 (0.00085) [2022-07-11 05:38:46,894][25689] Fps is (10 sec: 5572.0, 60 sec: 5528.5, 300 sec: 5523.5). Total num frames: 1083305984. Throughput: 0: 5792.6. Samples: 1083306120. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:38:46,896][25689] Avg episode reward: [(0, '1.249')] [2022-07-11 05:38:47,811][26022] Updated weights on worker 0-0, policy_version 1057920 (0.00086) [2022-07-11 05:38:49,482][26022] Updated weights on worker 0-0, policy_version 1057930 (0.00087) [2022-07-11 05:38:51,615][26022] Updated weights on worker 0-0, policy_version 1057940 (0.00079) [2022-07-11 05:38:51,991][25689] Fps is (10 sec: 5424.5, 60 sec: 5519.8, 300 sec: 5518.9). Total num frames: 1083332608. Throughput: 0: 5760.9. Samples: 1083339090. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:38:51,991][25689] Avg episode reward: [(0, '1.189')] [2022-07-11 05:38:53,071][26022] Updated weights on worker 0-0, policy_version 1057950 (0.00088) [2022-07-11 05:38:55,159][26022] Updated weights on worker 0-0, policy_version 1057960 (0.00087) [2022-07-11 05:38:56,686][26022] Updated weights on worker 0-0, policy_version 1057970 (0.00092) [2022-07-11 05:38:57,038][25689] Fps is (10 sec: 5451.1, 60 sec: 5516.0, 300 sec: 5514.8). Total num frames: 1083361280. Throughput: 0: 4932.8. Samples: 1083355978. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:38:57,040][25689] Avg episode reward: [(0, '0.344')] [2022-07-11 05:38:58,768][26022] Updated weights on worker 0-0, policy_version 1057980 (0.00099) [2022-07-11 05:39:00,426][26022] Updated weights on worker 0-0, policy_version 1057990 (0.00095) [2022-07-11 05:39:02,067][25689] Fps is (10 sec: 5487.9, 60 sec: 5496.8, 300 sec: 5519.1). Total num frames: 1083387904. Throughput: 0: 5769.5. Samples: 1083389596. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:39:02,067][25689] Avg episode reward: [(0, '0.417')] [2022-07-11 05:39:02,768][26022] Updated weights on worker 0-0, policy_version 1058000 (0.00087) [2022-07-11 05:39:04,532][26022] Updated weights on worker 0-0, policy_version 1058010 (0.00084) [2022-07-11 05:39:06,483][26022] Updated weights on worker 0-0, policy_version 1058020 (0.00085) [2022-07-11 05:39:07,163][25689] Fps is (10 sec: 5461.8, 60 sec: 5549.5, 300 sec: 5519.0). Total num frames: 1083416576. Throughput: 0: 5671.5. Samples: 1083421070. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:39:07,163][25689] Avg episode reward: [(0, '0.619')] [2022-07-11 05:39:08,218][26022] Updated weights on worker 0-0, policy_version 1058030 (0.00091) [2022-07-11 05:39:10,326][26022] Updated weights on worker 0-0, policy_version 1058040 (0.00085) [2022-07-11 05:39:11,828][26022] Updated weights on worker 0-0, policy_version 1058050 (0.00083) [2022-07-11 05:39:12,207][25689] Fps is (10 sec: 5655.1, 60 sec: 5545.7, 300 sec: 5518.5). Total num frames: 1083445248. Throughput: 0: 4875.0. Samples: 1083437644. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:39:12,207][25689] Avg episode reward: [(0, '-0.003')] [2022-07-11 05:39:14,066][26022] Updated weights on worker 0-0, policy_version 1058060 (0.00091) [2022-07-11 05:39:15,267][26022] Updated weights on worker 0-0, policy_version 1058070 (0.00091) [2022-07-11 05:39:17,249][25689] Fps is (10 sec: 5380.5, 60 sec: 5491.8, 300 sec: 5518.3). Total num frames: 1083470848. Throughput: 0: 5698.2. Samples: 1083471144. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:39:17,251][25689] Avg episode reward: [(0, '-0.326')] [2022-07-11 05:39:17,614][26022] Updated weights on worker 0-0, policy_version 1058080 (0.00088) [2022-07-11 05:39:19,210][26022] Updated weights on worker 0-0, policy_version 1058090 (0.00095) [2022-07-11 05:39:21,259][26022] Updated weights on worker 0-0, policy_version 1058100 (0.00087) [2022-07-11 05:39:22,273][25689] Fps is (10 sec: 5594.8, 60 sec: 5559.6, 300 sec: 5523.4). Total num frames: 1083501568. Throughput: 0: 5697.6. Samples: 1083504726. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:39:22,275][25689] Avg episode reward: [(0, '0.649')] [2022-07-11 05:39:22,909][26022] Updated weights on worker 0-0, policy_version 1058110 (0.00088) [2022-07-11 05:39:24,871][26022] Updated weights on worker 0-0, policy_version 1058120 (0.00087) [2022-07-11 05:39:26,488][26022] Updated weights on worker 0-0, policy_version 1058130 (0.00087) [2022-07-11 05:39:27,326][25689] Fps is (10 sec: 5792.1, 60 sec: 5545.7, 300 sec: 5525.9). Total num frames: 1083529216. Throughput: 0: 4996.4. Samples: 1083521818. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:39:27,327][25689] Avg episode reward: [(0, '0.447')] [2022-07-11 05:39:28,652][26022] Updated weights on worker 0-0, policy_version 1058140 (0.00081) [2022-07-11 05:39:30,331][26022] Updated weights on worker 0-0, policy_version 1058150 (0.00091) [2022-07-11 05:39:32,148][26022] Updated weights on worker 0-0, policy_version 1058160 (0.00085) [2022-07-11 05:39:32,335][25689] Fps is (10 sec: 5496.0, 60 sec: 5546.1, 300 sec: 5519.0). Total num frames: 1083556864. Throughput: 0: 5844.8. Samples: 1083555286. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:39:32,335][25689] Avg episode reward: [(0, '0.608')] [2022-07-11 05:39:33,719][26022] Updated weights on worker 0-0, policy_version 1058170 (0.00082) [2022-07-11 05:39:35,876][26022] Updated weights on worker 0-0, policy_version 1058180 (0.00086) [2022-07-11 05:39:37,347][25689] Fps is (10 sec: 5518.3, 60 sec: 5529.2, 300 sec: 5515.4). Total num frames: 1083584512. Throughput: 0: 5879.5. Samples: 1083589308. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:39:37,348][25689] Avg episode reward: [(0, '1.489')] [2022-07-11 05:39:37,597][26022] Updated weights on worker 0-0, policy_version 1058190 (0.00088) [2022-07-11 05:39:38,366][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:39:38,377][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001058195_1083591680.pth [2022-07-11 05:39:38,377][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001056250_1081600000.pth [2022-07-11 05:39:39,451][26022] Updated weights on worker 0-0, policy_version 1058200 (0.00096) [2022-07-11 05:39:41,144][26022] Updated weights on worker 0-0, policy_version 1058210 (0.00081) [2022-07-11 05:39:42,361][25689] Fps is (10 sec: 5515.2, 60 sec: 5528.9, 300 sec: 5519.6). Total num frames: 1083612160. Throughput: 0: 5881.8. Samples: 1083622876. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:39:42,361][25689] Avg episode reward: [(0, '0.980')] [2022-07-11 05:39:43,246][26022] Updated weights on worker 0-0, policy_version 1058220 (0.00081) [2022-07-11 05:39:44,819][26022] Updated weights on worker 0-0, policy_version 1058230 (0.00090) [2022-07-11 05:39:46,681][26022] Updated weights on worker 0-0, policy_version 1058240 (0.00313) [2022-07-11 05:39:47,454][25689] Fps is (10 sec: 5673.7, 60 sec: 5546.1, 300 sec: 5521.5). Total num frames: 1083641856. Throughput: 0: 5857.1. Samples: 1083639706. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:39:47,454][25689] Avg episode reward: [(0, '1.237')] [2022-07-11 05:39:48,462][26022] Updated weights on worker 0-0, policy_version 1058250 (0.00091) [2022-07-11 05:39:50,352][26022] Updated weights on worker 0-0, policy_version 1058260 (0.00093) [2022-07-11 05:39:52,059][26022] Updated weights on worker 0-0, policy_version 1058270 (0.00083) [2022-07-11 05:39:52,458][25689] Fps is (10 sec: 5780.7, 60 sec: 5588.5, 300 sec: 5524.9). Total num frames: 1083670528. Throughput: 0: 5888.0. Samples: 1083673772. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:39:52,458][25689] Avg episode reward: [(0, '0.893')] [2022-07-11 05:39:54,121][26022] Updated weights on worker 0-0, policy_version 1058280 (0.00081) [2022-07-11 05:39:55,502][26022] Updated weights on worker 0-0, policy_version 1058290 (0.00091) [2022-07-11 05:39:57,516][25689] Fps is (10 sec: 5495.6, 60 sec: 5553.7, 300 sec: 5524.0). Total num frames: 1083697152. Throughput: 0: 5881.4. Samples: 1083707930. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:39:57,516][25689] Avg episode reward: [(0, '0.712')] [2022-07-11 05:39:57,663][26022] Updated weights on worker 0-0, policy_version 1058300 (0.00086) [2022-07-11 05:39:59,103][26022] Updated weights on worker 0-0, policy_version 1058310 (0.00079) [2022-07-11 05:40:01,195][26022] Updated weights on worker 0-0, policy_version 1058320 (0.00086) [2022-07-11 05:40:02,547][25689] Fps is (10 sec: 5480.9, 60 sec: 5587.3, 300 sec: 5531.1). Total num frames: 1083725824. Throughput: 0: 5066.6. Samples: 1083725150. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:40:02,548][25689] Avg episode reward: [(0, '0.547')] [2022-07-11 05:40:03,259][26022] Updated weights on worker 0-0, policy_version 1058330 (0.00087) [2022-07-11 05:40:05,023][26022] Updated weights on worker 0-0, policy_version 1058340 (0.00089) [2022-07-11 05:40:06,831][26022] Updated weights on worker 0-0, policy_version 1058350 (0.00086) [2022-07-11 05:40:07,670][25689] Fps is (10 sec: 5546.8, 60 sec: 5567.9, 300 sec: 5529.2). Total num frames: 1083753472. Throughput: 0: 5806.8. Samples: 1083757094. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:40:07,670][25689] Avg episode reward: [(0, '1.420')] [2022-07-11 05:40:08,628][26022] Updated weights on worker 0-0, policy_version 1058360 (0.00092) [2022-07-11 05:40:10,575][26022] Updated weights on worker 0-0, policy_version 1058370 (0.00089) [2022-07-11 05:40:12,266][26022] Updated weights on worker 0-0, policy_version 1058380 (0.00089) [2022-07-11 05:40:12,701][25689] Fps is (10 sec: 5647.3, 60 sec: 5586.0, 300 sec: 5536.0). Total num frames: 1083783168. Throughput: 0: 5796.9. Samples: 1083791118. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:40:12,702][25689] Avg episode reward: [(0, '1.222')] [2022-07-11 05:40:14,082][26022] Updated weights on worker 0-0, policy_version 1058390 (0.00080) [2022-07-11 05:40:15,806][26022] Updated weights on worker 0-0, policy_version 1058400 (0.00094) [2022-07-11 05:40:17,658][26022] Updated weights on worker 0-0, policy_version 1058410 (0.00100) [2022-07-11 05:40:17,715][25689] Fps is (10 sec: 5810.3, 60 sec: 5639.4, 300 sec: 5535.8). Total num frames: 1083811840. Throughput: 0: 4971.4. Samples: 1083808348. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:40:17,717][25689] Avg episode reward: [(0, '1.784')] [2022-07-11 05:40:19,487][26022] Updated weights on worker 0-0, policy_version 1058420 (0.00092) [2022-07-11 05:40:21,205][26022] Updated weights on worker 0-0, policy_version 1058430 (0.00101) [2022-07-11 05:40:22,750][25689] Fps is (10 sec: 5604.4, 60 sec: 5587.6, 300 sec: 5532.8). Total num frames: 1083839488. Throughput: 0: 5812.9. Samples: 1083842590. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:40:22,751][25689] Avg episode reward: [(0, '0.601')] [2022-07-11 05:40:23,115][26022] Updated weights on worker 0-0, policy_version 1058440 (0.00089) [2022-07-11 05:40:24,994][26022] Updated weights on worker 0-0, policy_version 1058450 (0.00091) [2022-07-11 05:40:26,777][26022] Updated weights on worker 0-0, policy_version 1058460 (0.00087) [2022-07-11 05:40:27,809][25689] Fps is (10 sec: 5680.8, 60 sec: 5620.9, 300 sec: 5542.4). Total num frames: 1083869184. Throughput: 0: 5939.0. Samples: 1083876704. Policy #0 lag: (min: 0.0, avg: 7.4, max: 17.0) [2022-07-11 05:40:27,811][25689] Avg episode reward: [(0, '-0.388')] [2022-07-11 05:40:28,563][26022] Updated weights on worker 0-0, policy_version 1058470 (0.00088) [2022-07-11 05:40:30,200][26022] Updated weights on worker 0-0, policy_version 1058480 (0.00081) [2022-07-11 05:40:32,110][26022] Updated weights on worker 0-0, policy_version 1058490 (0.00094) [2022-07-11 05:40:32,819][25689] Fps is (10 sec: 5797.1, 60 sec: 5637.7, 300 sec: 5545.7). Total num frames: 1083897856. Throughput: 0: 5098.4. Samples: 1083893686. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:40:32,819][25689] Avg episode reward: [(0, '-0.376')] [2022-07-11 05:40:33,990][26022] Updated weights on worker 0-0, policy_version 1058500 (0.00087) [2022-07-11 05:40:35,677][26022] Updated weights on worker 0-0, policy_version 1058510 (0.00085) [2022-07-11 05:40:37,694][26022] Updated weights on worker 0-0, policy_version 1058520 (0.00081) [2022-07-11 05:40:37,832][25689] Fps is (10 sec: 5619.0, 60 sec: 5637.6, 300 sec: 5542.2). Total num frames: 1083925504. Throughput: 0: 5927.2. Samples: 1083927588. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:40:37,833][25689] Avg episode reward: [(0, '-1.589')] [2022-07-11 05:40:39,270][26022] Updated weights on worker 0-0, policy_version 1058530 (0.00088) [2022-07-11 05:40:41,355][26022] Updated weights on worker 0-0, policy_version 1058540 (0.00093) [2022-07-11 05:40:42,850][25689] Fps is (10 sec: 5512.5, 60 sec: 5637.3, 300 sec: 5543.7). Total num frames: 1083953152. Throughput: 0: 5918.4. Samples: 1083961548. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:40:42,851][25689] Avg episode reward: [(0, '-1.156')] [2022-07-11 05:40:42,946][26022] Updated weights on worker 0-0, policy_version 1058550 (0.00086) [2022-07-11 05:40:44,798][26022] Updated weights on worker 0-0, policy_version 1058560 (0.00086) [2022-07-11 05:40:46,763][26022] Updated weights on worker 0-0, policy_version 1058570 (0.00087) [2022-07-11 05:40:47,892][25689] Fps is (10 sec: 5598.8, 60 sec: 5625.1, 300 sec: 5547.5). Total num frames: 1083981824. Throughput: 0: 5064.3. Samples: 1083978408. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:40:47,893][25689] Avg episode reward: [(0, '-1.098')] [2022-07-11 05:40:48,535][26022] Updated weights on worker 0-0, policy_version 1058580 (0.00092) [2022-07-11 05:40:50,349][26022] Updated weights on worker 0-0, policy_version 1058590 (0.00093) [2022-07-11 05:40:52,077][26022] Updated weights on worker 0-0, policy_version 1058600 (0.00094) [2022-07-11 05:40:52,895][25689] Fps is (10 sec: 5708.7, 60 sec: 5625.2, 300 sec: 5550.9). Total num frames: 1084010496. Throughput: 0: 5898.4. Samples: 1084012102. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:40:52,895][25689] Avg episode reward: [(0, '-0.151')] [2022-07-11 05:40:53,895][26022] Updated weights on worker 0-0, policy_version 1058610 (0.00091) [2022-07-11 05:40:55,888][26022] Updated weights on worker 0-0, policy_version 1058620 (0.00094) [2022-07-11 05:40:57,688][26022] Updated weights on worker 0-0, policy_version 1058630 (0.00083) [2022-07-11 05:40:57,923][25689] Fps is (10 sec: 5614.6, 60 sec: 5644.9, 300 sec: 5548.4). Total num frames: 1084038144. Throughput: 0: 5881.8. Samples: 1084045756. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:40:57,925][25689] Avg episode reward: [(0, '0.612')] [2022-07-11 05:40:59,525][26022] Updated weights on worker 0-0, policy_version 1058640 (0.00082) [2022-07-11 05:41:01,195][26022] Updated weights on worker 0-0, policy_version 1058650 (0.00090) [2022-07-11 05:41:02,966][25689] Fps is (10 sec: 5287.0, 60 sec: 5592.9, 300 sec: 5546.7). Total num frames: 1084063744. Throughput: 0: 5033.6. Samples: 1084062808. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:41:02,968][25689] Avg episode reward: [(0, '0.708')] [2022-07-11 05:41:03,463][26022] Updated weights on worker 0-0, policy_version 1058660 (0.00084) [2022-07-11 05:41:05,198][26022] Updated weights on worker 0-0, policy_version 1058670 (0.00087) [2022-07-11 05:41:07,069][26022] Updated weights on worker 0-0, policy_version 1058680 (0.00091) [2022-07-11 05:41:08,081][25689] Fps is (10 sec: 5544.5, 60 sec: 5644.5, 300 sec: 5556.1). Total num frames: 1084094464. Throughput: 0: 5759.0. Samples: 1084094676. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:41:08,081][25689] Avg episode reward: [(0, '1.145')] [2022-07-11 05:41:08,850][26022] Updated weights on worker 0-0, policy_version 1058690 (0.00109) [2022-07-11 05:41:10,664][26022] Updated weights on worker 0-0, policy_version 1058700 (0.00083) [2022-07-11 05:41:12,534][26022] Updated weights on worker 0-0, policy_version 1058710 (0.00083) [2022-07-11 05:41:13,098][25689] Fps is (10 sec: 5660.0, 60 sec: 5595.0, 300 sec: 5545.5). Total num frames: 1084121088. Throughput: 0: 5749.3. Samples: 1084128254. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:41:13,099][25689] Avg episode reward: [(0, '0.799')] [2022-07-11 05:41:14,387][26022] Updated weights on worker 0-0, policy_version 1058720 (0.00095) [2022-07-11 05:41:16,115][26022] Updated weights on worker 0-0, policy_version 1058730 (0.00084) [2022-07-11 05:41:17,848][26022] Updated weights on worker 0-0, policy_version 1058740 (0.00086) [2022-07-11 05:41:18,134][25689] Fps is (10 sec: 5601.9, 60 sec: 5609.9, 300 sec: 5553.2). Total num frames: 1084150784. Throughput: 0: 4928.6. Samples: 1084145370. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:41:18,137][25689] Avg episode reward: [(0, '0.716')] [2022-07-11 05:41:19,957][26022] Updated weights on worker 0-0, policy_version 1058750 (0.00087) [2022-07-11 05:41:21,400][26022] Updated weights on worker 0-0, policy_version 1058760 (0.00093) [2022-07-11 05:41:23,140][25689] Fps is (10 sec: 5608.4, 60 sec: 5595.7, 300 sec: 5554.8). Total num frames: 1084177408. Throughput: 0: 5779.6. Samples: 1084179402. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:41:23,141][25689] Avg episode reward: [(0, '0.902')] [2022-07-11 05:41:23,562][26022] Updated weights on worker 0-0, policy_version 1058770 (0.00081) [2022-07-11 05:41:25,097][26022] Updated weights on worker 0-0, policy_version 1058780 (0.00086) [2022-07-11 05:41:27,035][26022] Updated weights on worker 0-0, policy_version 1058790 (0.00095) [2022-07-11 05:41:28,188][25689] Fps is (10 sec: 5601.7, 60 sec: 5596.6, 300 sec: 5554.4). Total num frames: 1084207104. Throughput: 0: 5903.7. Samples: 1084213386. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:41:28,190][25689] Avg episode reward: [(0, '0.541')] [2022-07-11 05:41:28,819][26022] Updated weights on worker 0-0, policy_version 1058800 (0.00090) [2022-07-11 05:41:30,616][26022] Updated weights on worker 0-0, policy_version 1058810 (0.00085) [2022-07-11 05:41:32,392][26022] Updated weights on worker 0-0, policy_version 1058820 (0.00091) [2022-07-11 05:41:33,213][25689] Fps is (10 sec: 5794.1, 60 sec: 5595.2, 300 sec: 5557.5). Total num frames: 1084235776. Throughput: 0: 5066.4. Samples: 1084230168. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:41:33,214][25689] Avg episode reward: [(0, '1.260')] [2022-07-11 05:41:34,410][26022] Updated weights on worker 0-0, policy_version 1058830 (0.00098) [2022-07-11 05:41:35,940][26022] Updated weights on worker 0-0, policy_version 1058840 (0.00088) [2022-07-11 05:41:37,977][26022] Updated weights on worker 0-0, policy_version 1058850 (0.00096) [2022-07-11 05:41:38,228][25689] Fps is (10 sec: 5609.6, 60 sec: 5595.1, 300 sec: 5561.1). Total num frames: 1084263424. Throughput: 0: 5928.3. Samples: 1084264494. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:41:38,230][25689] Avg episode reward: [(0, '0.808')] [2022-07-11 05:41:38,513][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:41:38,534][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001058853_1084265472.pth [2022-07-11 05:41:38,534][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001056899_1082264576.pth [2022-07-11 05:41:39,595][26022] Updated weights on worker 0-0, policy_version 1058860 (0.00083) [2022-07-11 05:41:41,508][26022] Updated weights on worker 0-0, policy_version 1058870 (0.00674) [2022-07-11 05:41:43,266][26022] Updated weights on worker 0-0, policy_version 1058880 (0.00082) [2022-07-11 05:41:43,266][25689] Fps is (10 sec: 5602.5, 60 sec: 5610.1, 300 sec: 5561.1). Total num frames: 1084292096. Throughput: 0: 5913.9. Samples: 1084298428. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:41:43,268][25689] Avg episode reward: [(0, '0.853')] [2022-07-11 05:41:45,225][26022] Updated weights on worker 0-0, policy_version 1058890 (0.00083) [2022-07-11 05:41:46,999][26022] Updated weights on worker 0-0, policy_version 1058900 (0.00085) [2022-07-11 05:41:48,344][25689] Fps is (10 sec: 5770.0, 60 sec: 5623.7, 300 sec: 5563.3). Total num frames: 1084321792. Throughput: 0: 5067.2. Samples: 1084315518. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:41:48,345][25689] Avg episode reward: [(0, '0.611')] [2022-07-11 05:41:48,682][26022] Updated weights on worker 0-0, policy_version 1058910 (0.00093) [2022-07-11 05:41:50,471][26022] Updated weights on worker 0-0, policy_version 1058920 (0.00090) [2022-07-11 05:41:52,637][26022] Updated weights on worker 0-0, policy_version 1058930 (0.00082) [2022-07-11 05:41:53,350][25689] Fps is (10 sec: 5686.6, 60 sec: 5606.5, 300 sec: 5563.8). Total num frames: 1084349440. Throughput: 0: 5939.0. Samples: 1084349760. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:41:53,351][25689] Avg episode reward: [(0, '0.619')] [2022-07-11 05:41:53,853][26022] Updated weights on worker 0-0, policy_version 1058940 (0.00092) [2022-07-11 05:41:56,139][26022] Updated weights on worker 0-0, policy_version 1058950 (0.00095) [2022-07-11 05:41:57,619][26022] Updated weights on worker 0-0, policy_version 1058960 (0.00091) [2022-07-11 05:41:58,372][25689] Fps is (10 sec: 5514.1, 60 sec: 5607.1, 300 sec: 5560.2). Total num frames: 1084377088. Throughput: 0: 5914.7. Samples: 1084383640. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:41:58,373][25689] Avg episode reward: [(0, '0.473')] [2022-07-11 05:41:59,673][26022] Updated weights on worker 0-0, policy_version 1058970 (0.00092) [2022-07-11 05:42:01,380][26022] Updated weights on worker 0-0, policy_version 1058980 (0.00085) [2022-07-11 05:42:03,386][25689] Fps is (10 sec: 5408.0, 60 sec: 5626.8, 300 sec: 5569.2). Total num frames: 1084403712. Throughput: 0: 5078.0. Samples: 1084400596. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:42:03,386][25689] Avg episode reward: [(0, '0.680')] [2022-07-11 05:42:03,683][26022] Updated weights on worker 0-0, policy_version 1058990 (0.00093) [2022-07-11 05:42:05,349][26022] Updated weights on worker 0-0, policy_version 1059000 (0.00083) [2022-07-11 05:42:07,348][26022] Updated weights on worker 0-0, policy_version 1059010 (0.00078) [2022-07-11 05:42:08,491][25689] Fps is (10 sec: 5464.8, 60 sec: 5593.7, 300 sec: 5571.2). Total num frames: 1084432384. Throughput: 0: 5786.9. Samples: 1084432106. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:42:08,492][25689] Avg episode reward: [(0, '0.671')] [2022-07-11 05:42:08,958][26022] Updated weights on worker 0-0, policy_version 1059020 (0.00093) [2022-07-11 05:42:11,070][26022] Updated weights on worker 0-0, policy_version 1059030 (0.00089) [2022-07-11 05:42:12,557][26022] Updated weights on worker 0-0, policy_version 1059040 (0.00091) [2022-07-11 05:42:13,557][25689] Fps is (10 sec: 5537.0, 60 sec: 5606.1, 300 sec: 5570.9). Total num frames: 1084460032. Throughput: 0: 5747.3. Samples: 1084465896. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:42:13,558][25689] Avg episode reward: [(0, '0.833')] [2022-07-11 05:42:14,712][26022] Updated weights on worker 0-0, policy_version 1059050 (0.00083) [2022-07-11 05:42:16,158][26022] Updated weights on worker 0-0, policy_version 1059060 (0.00069) [2022-07-11 05:42:18,202][26022] Updated weights on worker 0-0, policy_version 1059070 (0.00084) [2022-07-11 05:42:18,560][25689] Fps is (10 sec: 5694.8, 60 sec: 5609.2, 300 sec: 5581.4). Total num frames: 1084489728. Throughput: 0: 4918.1. Samples: 1084482926. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:42:18,562][25689] Avg episode reward: [(0, '0.333')] [2022-07-11 05:42:19,868][26022] Updated weights on worker 0-0, policy_version 1059080 (0.00082) [2022-07-11 05:42:21,774][26022] Updated weights on worker 0-0, policy_version 1059090 (0.00089) [2022-07-11 05:42:23,524][26022] Updated weights on worker 0-0, policy_version 1059100 (0.00090) [2022-07-11 05:42:23,613][25689] Fps is (10 sec: 5804.4, 60 sec: 5638.7, 300 sec: 5578.3). Total num frames: 1084518400. Throughput: 0: 5761.4. Samples: 1084517134. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:42:23,614][25689] Avg episode reward: [(0, '1.188')] [2022-07-11 05:42:25,459][26022] Updated weights on worker 0-0, policy_version 1059110 (0.00085) [2022-07-11 05:42:27,224][26022] Updated weights on worker 0-0, policy_version 1059120 (0.00085) [2022-07-11 05:42:28,685][25689] Fps is (10 sec: 5562.6, 60 sec: 5602.7, 300 sec: 5580.8). Total num frames: 1084546048. Throughput: 0: 5877.0. Samples: 1084550786. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:42:28,686][25689] Avg episode reward: [(0, '0.852')] [2022-07-11 05:42:29,013][26022] Updated weights on worker 0-0, policy_version 1059130 (0.00089) [2022-07-11 05:42:30,668][26022] Updated weights on worker 0-0, policy_version 1059140 (0.00090) [2022-07-11 05:42:32,793][26022] Updated weights on worker 0-0, policy_version 1059150 (0.00087) [2022-07-11 05:42:33,695][25689] Fps is (10 sec: 5586.4, 60 sec: 5604.1, 300 sec: 5584.2). Total num frames: 1084574720. Throughput: 0: 5051.7. Samples: 1084567624. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:42:33,696][25689] Avg episode reward: [(0, '0.973')] [2022-07-11 05:42:34,249][26022] Updated weights on worker 0-0, policy_version 1059160 (0.00084) [2022-07-11 05:42:36,338][26022] Updated weights on worker 0-0, policy_version 1059170 (0.00081) [2022-07-11 05:42:37,916][26022] Updated weights on worker 0-0, policy_version 1059180 (0.00087) [2022-07-11 05:42:38,699][25689] Fps is (10 sec: 5624.0, 60 sec: 5605.1, 300 sec: 5580.9). Total num frames: 1084602368. Throughput: 0: 5897.4. Samples: 1084601692. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:42:38,701][25689] Avg episode reward: [(0, '1.045')] [2022-07-11 05:42:39,939][26022] Updated weights on worker 0-0, policy_version 1059190 (0.00083) [2022-07-11 05:42:41,704][26022] Updated weights on worker 0-0, policy_version 1059200 (0.00082) [2022-07-11 05:42:43,521][26022] Updated weights on worker 0-0, policy_version 1059210 (0.00085) [2022-07-11 05:42:43,706][25689] Fps is (10 sec: 5625.4, 60 sec: 5607.9, 300 sec: 5581.8). Total num frames: 1084631040. Throughput: 0: 5909.5. Samples: 1084635876. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:42:43,709][25689] Avg episode reward: [(0, '1.037')] [2022-07-11 05:42:45,322][26022] Updated weights on worker 0-0, policy_version 1059220 (0.00093) [2022-07-11 05:42:47,172][26022] Updated weights on worker 0-0, policy_version 1059230 (0.00081) [2022-07-11 05:42:48,775][25689] Fps is (10 sec: 5793.0, 60 sec: 5608.8, 300 sec: 5590.9). Total num frames: 1084660736. Throughput: 0: 5919.2. Samples: 1084669702. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:42:48,777][25689] Avg episode reward: [(0, '1.583')] [2022-07-11 05:42:48,955][26022] Updated weights on worker 0-0, policy_version 1059240 (0.00081) [2022-07-11 05:42:50,692][26022] Updated weights on worker 0-0, policy_version 1059250 (0.00087) [2022-07-11 05:42:52,627][26022] Updated weights on worker 0-0, policy_version 1059260 (0.00084) [2022-07-11 05:42:53,795][25689] Fps is (10 sec: 5684.1, 60 sec: 5607.5, 300 sec: 5587.2). Total num frames: 1084688384. Throughput: 0: 5933.8. Samples: 1084686894. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:42:53,795][25689] Avg episode reward: [(0, '1.596')] [2022-07-11 05:42:54,468][26022] Updated weights on worker 0-0, policy_version 1059270 (0.00093) [2022-07-11 05:42:56,057][26022] Updated weights on worker 0-0, policy_version 1059280 (0.00092) [2022-07-11 05:42:58,129][26022] Updated weights on worker 0-0, policy_version 1059290 (0.00086) [2022-07-11 05:42:58,817][25689] Fps is (10 sec: 5608.5, 60 sec: 5624.5, 300 sec: 5590.3). Total num frames: 1084717056. Throughput: 0: 5917.0. Samples: 1084720728. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:42:58,819][25689] Avg episode reward: [(0, '2.142')] [2022-07-11 05:42:59,945][26022] Updated weights on worker 0-0, policy_version 1059300 (0.00096) [2022-07-11 05:43:01,774][26022] Updated weights on worker 0-0, policy_version 1059310 (0.00089) [2022-07-11 05:43:03,903][25689] Fps is (10 sec: 5369.3, 60 sec: 5600.8, 300 sec: 5590.8). Total num frames: 1084742656. Throughput: 0: 5781.9. Samples: 1084752648. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:43:03,903][25689] Avg episode reward: [(0, '2.057')] [2022-07-11 05:43:04,055][26022] Updated weights on worker 0-0, policy_version 1059320 (0.00087) [2022-07-11 05:43:05,663][26022] Updated weights on worker 0-0, policy_version 1059330 (0.00405) [2022-07-11 05:43:07,500][26022] Updated weights on worker 0-0, policy_version 1059340 (0.00081) [2022-07-11 05:43:09,015][25689] Fps is (10 sec: 5421.9, 60 sec: 5617.0, 300 sec: 5592.2). Total num frames: 1084772352. Throughput: 0: 4934.6. Samples: 1084769582. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:43:09,019][25689] Avg episode reward: [(0, '2.393')] [2022-07-11 05:43:09,397][26022] Updated weights on worker 0-0, policy_version 1059350 (0.00095) [2022-07-11 05:43:10,984][26022] Updated weights on worker 0-0, policy_version 1059360 (0.00084) [2022-07-11 05:43:12,903][26022] Updated weights on worker 0-0, policy_version 1059370 (0.00096) [2022-07-11 05:43:14,031][25689] Fps is (10 sec: 5763.0, 60 sec: 5638.7, 300 sec: 5592.1). Total num frames: 1084801024. Throughput: 0: 5802.6. Samples: 1084804314. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:43:14,031][25689] Avg episode reward: [(0, '2.336')] [2022-07-11 05:43:14,628][26022] Updated weights on worker 0-0, policy_version 1059380 (0.00087) [2022-07-11 05:43:16,525][26022] Updated weights on worker 0-0, policy_version 1059390 (0.00086) [2022-07-11 05:43:18,100][26022] Updated weights on worker 0-0, policy_version 1059400 (0.00090) [2022-07-11 05:43:19,034][25689] Fps is (10 sec: 5723.6, 60 sec: 5621.7, 300 sec: 5599.4). Total num frames: 1084829696. Throughput: 0: 5822.6. Samples: 1084838446. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:43:19,036][25689] Avg episode reward: [(0, '2.288')] [2022-07-11 05:43:20,201][26022] Updated weights on worker 0-0, policy_version 1059410 (0.00089) [2022-07-11 05:43:21,653][26022] Updated weights on worker 0-0, policy_version 1059420 (0.00088) [2022-07-11 05:43:23,685][26022] Updated weights on worker 0-0, policy_version 1059430 (0.00085) [2022-07-11 05:43:24,044][25689] Fps is (10 sec: 5726.9, 60 sec: 5625.8, 300 sec: 5600.8). Total num frames: 1084858368. Throughput: 0: 5117.4. Samples: 1084855718. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:43:24,046][25689] Avg episode reward: [(0, '2.168')] [2022-07-11 05:43:25,222][26022] Updated weights on worker 0-0, policy_version 1059440 (0.00087) [2022-07-11 05:43:27,293][26022] Updated weights on worker 0-0, policy_version 1059450 (0.01383) [2022-07-11 05:43:29,105][25689] Fps is (10 sec: 5694.1, 60 sec: 5643.7, 300 sec: 5603.4). Total num frames: 1084887040. Throughput: 0: 5977.4. Samples: 1084889666. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:43:29,107][25689] Avg episode reward: [(0, '1.985')] [2022-07-11 05:43:29,112][26022] Updated weights on worker 0-0, policy_version 1059460 (0.00097) [2022-07-11 05:43:30,905][26022] Updated weights on worker 0-0, policy_version 1059470 (0.00080) [2022-07-11 05:43:32,492][26022] Updated weights on worker 0-0, policy_version 1059480 (0.00089) [2022-07-11 05:43:34,143][25689] Fps is (10 sec: 5576.4, 60 sec: 5624.1, 300 sec: 5599.4). Total num frames: 1084914688. Throughput: 0: 5944.6. Samples: 1084923876. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:43:34,144][25689] Avg episode reward: [(0, '1.740')] [2022-07-11 05:43:34,547][26022] Updated weights on worker 0-0, policy_version 1059490 (0.00087) [2022-07-11 05:43:36,108][26022] Updated weights on worker 0-0, policy_version 1059500 (0.00084) [2022-07-11 05:43:38,235][26022] Updated weights on worker 0-0, policy_version 1059510 (0.00088) [2022-07-11 05:43:38,603][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:43:38,613][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001059513_1084941312.pth [2022-07-11 05:43:38,614][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001057546_1082927104.pth [2022-07-11 05:43:39,162][25689] Fps is (10 sec: 5702.2, 60 sec: 5656.7, 300 sec: 5606.2). Total num frames: 1084944384. Throughput: 0: 5099.1. Samples: 1084941078. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:43:39,162][25689] Avg episode reward: [(0, '1.446')] [2022-07-11 05:43:39,615][26022] Updated weights on worker 0-0, policy_version 1059520 (0.00113) [2022-07-11 05:43:41,837][26022] Updated weights on worker 0-0, policy_version 1059530 (0.00603) [2022-07-11 05:43:43,254][26022] Updated weights on worker 0-0, policy_version 1059540 (0.00085) [2022-07-11 05:43:44,181][25689] Fps is (10 sec: 5712.8, 60 sec: 5638.6, 300 sec: 5604.2). Total num frames: 1084972032. Throughput: 0: 5923.0. Samples: 1084974992. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:43:44,182][25689] Avg episode reward: [(0, '1.306')] [2022-07-11 05:43:45,386][26022] Updated weights on worker 0-0, policy_version 1059550 (0.00098) [2022-07-11 05:43:47,213][26022] Updated weights on worker 0-0, policy_version 1059560 (0.00086) [2022-07-11 05:43:49,035][26022] Updated weights on worker 0-0, policy_version 1059570 (0.00080) [2022-07-11 05:43:49,256][25689] Fps is (10 sec: 5579.2, 60 sec: 5621.0, 300 sec: 5611.5). Total num frames: 1085000704. Throughput: 0: 5893.3. Samples: 1085008422. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:43:49,257][25689] Avg episode reward: [(0, '1.368')] [2022-07-11 05:43:50,910][26022] Updated weights on worker 0-0, policy_version 1059580 (0.00098) [2022-07-11 05:43:52,625][26022] Updated weights on worker 0-0, policy_version 1059590 (0.00089) [2022-07-11 05:43:54,288][25689] Fps is (10 sec: 5572.6, 60 sec: 5620.0, 300 sec: 5608.3). Total num frames: 1085028352. Throughput: 0: 5025.0. Samples: 1085025102. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:43:54,289][25689] Avg episode reward: [(0, '1.359')] [2022-07-11 05:43:54,461][26022] Updated weights on worker 0-0, policy_version 1059600 (0.00081) [2022-07-11 05:43:56,438][26022] Updated weights on worker 0-0, policy_version 1059610 (0.00085) [2022-07-11 05:43:58,052][26022] Updated weights on worker 0-0, policy_version 1059620 (0.00087) [2022-07-11 05:43:59,378][25689] Fps is (10 sec: 5564.5, 60 sec: 5613.7, 300 sec: 5614.1). Total num frames: 1085057024. Throughput: 0: 5812.1. Samples: 1085058576. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:43:59,379][25689] Avg episode reward: [(0, '1.398')] [2022-07-11 05:44:00,089][26022] Updated weights on worker 0-0, policy_version 1059630 (0.00061) [2022-07-11 05:44:02,178][26022] Updated weights on worker 0-0, policy_version 1059640 (0.00090) [2022-07-11 05:44:04,104][26022] Updated weights on worker 0-0, policy_version 1059650 (0.00090) [2022-07-11 05:44:04,394][25689] Fps is (10 sec: 5471.6, 60 sec: 5637.1, 300 sec: 5608.7). Total num frames: 1085083648. Throughput: 0: 5698.9. Samples: 1085090182. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:44:04,396][25689] Avg episode reward: [(0, '1.249')] [2022-07-11 05:44:05,839][26022] Updated weights on worker 0-0, policy_version 1059660 (0.00095) [2022-07-11 05:44:07,858][26022] Updated weights on worker 0-0, policy_version 1059670 (0.00090) [2022-07-11 05:44:09,448][25689] Fps is (10 sec: 5389.2, 60 sec: 5608.6, 300 sec: 5605.0). Total num frames: 1085111296. Throughput: 0: 4875.4. Samples: 1085106866. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:44:09,449][25689] Avg episode reward: [(0, '1.221')] [2022-07-11 05:44:09,567][26022] Updated weights on worker 0-0, policy_version 1059680 (0.00088) [2022-07-11 05:44:11,320][26022] Updated weights on worker 0-0, policy_version 1059690 (0.00089) [2022-07-11 05:44:13,414][26022] Updated weights on worker 0-0, policy_version 1059700 (0.00093) [2022-07-11 05:44:14,487][25689] Fps is (10 sec: 5478.7, 60 sec: 5589.5, 300 sec: 5612.0). Total num frames: 1085138944. Throughput: 0: 5720.8. Samples: 1085140656. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:44:14,487][25689] Avg episode reward: [(0, '1.002')] [2022-07-11 05:44:14,907][26022] Updated weights on worker 0-0, policy_version 1059710 (0.00087) [2022-07-11 05:44:17,056][26022] Updated weights on worker 0-0, policy_version 1059720 (0.00092) [2022-07-11 05:44:18,482][26022] Updated weights on worker 0-0, policy_version 1059730 (0.00089) [2022-07-11 05:44:19,517][25689] Fps is (10 sec: 5491.6, 60 sec: 5570.1, 300 sec: 5601.5). Total num frames: 1085166592. Throughput: 0: 5735.2. Samples: 1085174080. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:44:19,518][25689] Avg episode reward: [(0, '0.023')] [2022-07-11 05:44:20,596][26022] Updated weights on worker 0-0, policy_version 1059740 (0.00086) [2022-07-11 05:44:22,483][26022] Updated weights on worker 0-0, policy_version 1059750 (0.00084) [2022-07-11 05:44:24,110][26022] Updated weights on worker 0-0, policy_version 1059760 (0.00088) [2022-07-11 05:44:24,535][25689] Fps is (10 sec: 5707.0, 60 sec: 5586.3, 300 sec: 5609.1). Total num frames: 1085196288. Throughput: 0: 5009.0. Samples: 1085191068. Policy #0 lag: (min: 0.0, avg: 8.0, max: 19.0) [2022-07-11 05:44:24,535][25689] Avg episode reward: [(0, '-0.007')] [2022-07-11 05:44:25,965][26022] Updated weights on worker 0-0, policy_version 1059770 (0.00090) [2022-07-11 05:44:27,719][26022] Updated weights on worker 0-0, policy_version 1059780 (0.00086) [2022-07-11 05:44:29,586][25689] Fps is (10 sec: 5695.0, 60 sec: 5570.3, 300 sec: 5608.3). Total num frames: 1085223936. Throughput: 0: 5857.1. Samples: 1085224818. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:44:29,588][25689] Avg episode reward: [(0, '0.098')] [2022-07-11 05:44:29,670][26022] Updated weights on worker 0-0, policy_version 1059790 (0.00089) [2022-07-11 05:44:31,525][26022] Updated weights on worker 0-0, policy_version 1059800 (0.00088) [2022-07-11 05:44:33,185][26022] Updated weights on worker 0-0, policy_version 1059810 (0.00085) [2022-07-11 05:44:34,589][25689] Fps is (10 sec: 5601.5, 60 sec: 5590.5, 300 sec: 5611.9). Total num frames: 1085252608. Throughput: 0: 5865.3. Samples: 1085258562. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:44:34,590][25689] Avg episode reward: [(0, '0.132')] [2022-07-11 05:44:35,225][26022] Updated weights on worker 0-0, policy_version 1059820 (0.00080) [2022-07-11 05:44:36,842][26022] Updated weights on worker 0-0, policy_version 1059830 (0.00082) [2022-07-11 05:44:38,771][26022] Updated weights on worker 0-0, policy_version 1059840 (0.00082) [2022-07-11 05:44:39,684][25689] Fps is (10 sec: 5577.8, 60 sec: 5549.6, 300 sec: 5610.4). Total num frames: 1085280256. Throughput: 0: 5034.3. Samples: 1085275604. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:44:39,684][25689] Avg episode reward: [(0, '0.292')] [2022-07-11 05:44:40,617][26022] Updated weights on worker 0-0, policy_version 1059850 (0.00084) [2022-07-11 05:44:42,496][26022] Updated weights on worker 0-0, policy_version 1059860 (0.00098) [2022-07-11 05:44:44,020][26022] Updated weights on worker 0-0, policy_version 1059870 (0.00090) [2022-07-11 05:44:44,719][25689] Fps is (10 sec: 5559.9, 60 sec: 5565.1, 300 sec: 5608.0). Total num frames: 1085308928. Throughput: 0: 5867.0. Samples: 1085309488. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:44:44,719][25689] Avg episode reward: [(0, '-0.068')] [2022-07-11 05:44:46,131][26022] Updated weights on worker 0-0, policy_version 1059880 (0.00084) [2022-07-11 05:44:47,813][26022] Updated weights on worker 0-0, policy_version 1059890 (0.00087) [2022-07-11 05:44:49,758][25689] Fps is (10 sec: 5590.2, 60 sec: 5551.4, 300 sec: 5603.9). Total num frames: 1085336576. Throughput: 0: 5848.7. Samples: 1085342798. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:44:49,759][25689] Avg episode reward: [(0, '0.336')] [2022-07-11 05:44:49,785][26022] Updated weights on worker 0-0, policy_version 1059900 (0.00082) [2022-07-11 05:44:51,599][26022] Updated weights on worker 0-0, policy_version 1059910 (0.00083) [2022-07-11 05:44:53,455][26022] Updated weights on worker 0-0, policy_version 1059920 (0.00088) [2022-07-11 05:44:54,841][25689] Fps is (10 sec: 5563.9, 60 sec: 5563.6, 300 sec: 5610.3). Total num frames: 1085365248. Throughput: 0: 4989.9. Samples: 1085359618. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:44:54,842][25689] Avg episode reward: [(0, '0.048')] [2022-07-11 05:44:55,381][26022] Updated weights on worker 0-0, policy_version 1059930 (0.00094) [2022-07-11 05:44:57,240][26022] Updated weights on worker 0-0, policy_version 1059940 (0.00090) [2022-07-11 05:44:59,043][26022] Updated weights on worker 0-0, policy_version 1059950 (0.00081) [2022-07-11 05:44:59,920][25689] Fps is (10 sec: 5643.0, 60 sec: 5564.6, 300 sec: 5609.4). Total num frames: 1085393920. Throughput: 0: 5803.9. Samples: 1085393056. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:44:59,921][25689] Avg episode reward: [(0, '0.121')] [2022-07-11 05:45:00,801][26022] Updated weights on worker 0-0, policy_version 1059960 (0.00099) [2022-07-11 05:45:02,891][26022] Updated weights on worker 0-0, policy_version 1059970 (0.00092) [2022-07-11 05:45:04,963][25689] Fps is (10 sec: 5260.9, 60 sec: 5528.4, 300 sec: 5600.6). Total num frames: 1085418496. Throughput: 0: 5688.4. Samples: 1085424644. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:45:04,964][25689] Avg episode reward: [(0, '-0.076')] [2022-07-11 05:45:04,967][26022] Updated weights on worker 0-0, policy_version 1059980 (0.00084) [2022-07-11 05:45:06,672][26022] Updated weights on worker 0-0, policy_version 1059990 (0.00089) [2022-07-11 05:45:08,708][26022] Updated weights on worker 0-0, policy_version 1060000 (0.00081) [2022-07-11 05:45:10,042][25689] Fps is (10 sec: 5463.3, 60 sec: 5576.8, 300 sec: 5603.2). Total num frames: 1085449216. Throughput: 0: 4856.5. Samples: 1085441304. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:45:10,043][25689] Avg episode reward: [(0, '-0.617')] [2022-07-11 05:45:10,275][26022] Updated weights on worker 0-0, policy_version 1060010 (0.00088) [2022-07-11 05:45:12,337][26022] Updated weights on worker 0-0, policy_version 1060020 (0.00087) [2022-07-11 05:45:13,960][26022] Updated weights on worker 0-0, policy_version 1060030 (0.00089) [2022-07-11 05:45:15,062][25689] Fps is (10 sec: 5576.5, 60 sec: 5544.7, 300 sec: 5592.7). Total num frames: 1085474816. Throughput: 0: 5695.0. Samples: 1085474776. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:45:15,064][25689] Avg episode reward: [(0, '0.203')] [2022-07-11 05:45:15,908][26022] Updated weights on worker 0-0, policy_version 1060040 (0.00095) [2022-07-11 05:45:17,723][26022] Updated weights on worker 0-0, policy_version 1060050 (0.00108) [2022-07-11 05:45:19,628][26022] Updated weights on worker 0-0, policy_version 1060060 (0.00087) [2022-07-11 05:45:20,117][25689] Fps is (10 sec: 5590.1, 60 sec: 5593.2, 300 sec: 5602.7). Total num frames: 1085505536. Throughput: 0: 5706.5. Samples: 1085508306. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:45:20,118][25689] Avg episode reward: [(0, '-0.816')] [2022-07-11 05:45:21,390][26022] Updated weights on worker 0-0, policy_version 1060070 (0.00094) [2022-07-11 05:45:23,059][26022] Updated weights on worker 0-0, policy_version 1060080 (0.00093) [2022-07-11 05:45:25,040][26022] Updated weights on worker 0-0, policy_version 1060090 (0.00100) [2022-07-11 05:45:25,131][25689] Fps is (10 sec: 5695.4, 60 sec: 5542.8, 300 sec: 5593.2). Total num frames: 1085532160. Throughput: 0: 4989.4. Samples: 1085525270. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:45:25,131][25689] Avg episode reward: [(0, '0.001')] [2022-07-11 05:45:26,873][26022] Updated weights on worker 0-0, policy_version 1060100 (0.00082) [2022-07-11 05:45:28,582][26022] Updated weights on worker 0-0, policy_version 1060110 (0.00088) [2022-07-11 05:45:30,210][25689] Fps is (10 sec: 5478.5, 60 sec: 5557.2, 300 sec: 5591.9). Total num frames: 1085560832. Throughput: 0: 5851.0. Samples: 1085559308. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:45:30,210][25689] Avg episode reward: [(0, '0.159')] [2022-07-11 05:45:30,422][26022] Updated weights on worker 0-0, policy_version 1060120 (0.00098) [2022-07-11 05:45:32,116][26022] Updated weights on worker 0-0, policy_version 1060130 (0.00086) [2022-07-11 05:45:34,306][26022] Updated weights on worker 0-0, policy_version 1060140 (0.00091) [2022-07-11 05:45:35,240][25689] Fps is (10 sec: 5672.6, 60 sec: 5554.7, 300 sec: 5595.0). Total num frames: 1085589504. Throughput: 0: 5849.6. Samples: 1085592806. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:45:35,240][25689] Avg episode reward: [(0, '1.226')] [2022-07-11 05:45:35,751][26022] Updated weights on worker 0-0, policy_version 1060150 (0.00095) [2022-07-11 05:45:37,747][26022] Updated weights on worker 0-0, policy_version 1060160 (0.00083) [2022-07-11 05:45:38,704][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:45:38,715][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001060165_1085608960.pth [2022-07-11 05:45:38,716][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001058195_1083591680.pth [2022-07-11 05:45:39,612][26022] Updated weights on worker 0-0, policy_version 1060170 (0.00089) [2022-07-11 05:45:40,309][25689] Fps is (10 sec: 5576.8, 60 sec: 5557.0, 300 sec: 5594.0). Total num frames: 1085617152. Throughput: 0: 5847.6. Samples: 1085626382. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:45:40,309][25689] Avg episode reward: [(0, '1.200')] [2022-07-11 05:45:41,475][26022] Updated weights on worker 0-0, policy_version 1060180 (0.00086) [2022-07-11 05:45:43,021][26022] Updated weights on worker 0-0, policy_version 1060190 (0.00625) [2022-07-11 05:45:44,983][26022] Updated weights on worker 0-0, policy_version 1060200 (0.00089) [2022-07-11 05:45:45,336][25689] Fps is (10 sec: 5578.2, 60 sec: 5557.8, 300 sec: 5594.3). Total num frames: 1085645824. Throughput: 0: 5848.8. Samples: 1085643448. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:45:45,337][25689] Avg episode reward: [(0, '2.083')] [2022-07-11 05:45:46,843][26022] Updated weights on worker 0-0, policy_version 1060210 (0.00093) [2022-07-11 05:45:48,803][26022] Updated weights on worker 0-0, policy_version 1060220 (0.00088) [2022-07-11 05:45:50,470][25689] Fps is (10 sec: 5643.7, 60 sec: 5566.0, 300 sec: 5591.9). Total num frames: 1085674496. Throughput: 0: 5818.6. Samples: 1085677190. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:45:50,470][25689] Avg episode reward: [(0, '1.946')] [2022-07-11 05:45:50,504][26022] Updated weights on worker 0-0, policy_version 1060230 (0.00089) [2022-07-11 05:45:52,207][26022] Updated weights on worker 0-0, policy_version 1060240 (0.00095) [2022-07-11 05:45:54,214][26022] Updated weights on worker 0-0, policy_version 1060250 (0.00083) [2022-07-11 05:45:55,512][25689] Fps is (10 sec: 5635.1, 60 sec: 5569.7, 300 sec: 5595.0). Total num frames: 1085703168. Throughput: 0: 5836.0. Samples: 1085711116. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:45:55,513][25689] Avg episode reward: [(0, '1.778')] [2022-07-11 05:45:55,984][26022] Updated weights on worker 0-0, policy_version 1060260 (0.00060) [2022-07-11 05:45:57,838][26022] Updated weights on worker 0-0, policy_version 1060270 (0.00084) [2022-07-11 05:45:59,606][26022] Updated weights on worker 0-0, policy_version 1060280 (0.00089) [2022-07-11 05:46:00,524][25689] Fps is (10 sec: 5601.3, 60 sec: 5559.0, 300 sec: 5602.5). Total num frames: 1085730816. Throughput: 0: 5026.1. Samples: 1085727988. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:46:00,525][25689] Avg episode reward: [(0, '1.884')] [2022-07-11 05:46:01,521][26022] Updated weights on worker 0-0, policy_version 1060290 (0.00085) [2022-07-11 05:46:03,570][26022] Updated weights on worker 0-0, policy_version 1060300 (0.00088) [2022-07-11 05:46:05,438][26022] Updated weights on worker 0-0, policy_version 1060310 (0.00082) [2022-07-11 05:46:05,537][25689] Fps is (10 sec: 5414.0, 60 sec: 5595.6, 300 sec: 5590.7). Total num frames: 1085757440. Throughput: 0: 5750.4. Samples: 1085759608. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:46:05,537][25689] Avg episode reward: [(0, '1.916')] [2022-07-11 05:46:07,358][26022] Updated weights on worker 0-0, policy_version 1060320 (0.00085) [2022-07-11 05:46:09,184][26022] Updated weights on worker 0-0, policy_version 1060330 (0.00090) [2022-07-11 05:46:10,594][25689] Fps is (10 sec: 5491.6, 60 sec: 5563.8, 300 sec: 5596.8). Total num frames: 1085786112. Throughput: 0: 5768.3. Samples: 1085793270. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:46:10,594][25689] Avg episode reward: [(0, '2.201')] [2022-07-11 05:46:10,963][26022] Updated weights on worker 0-0, policy_version 1060340 (0.00085) [2022-07-11 05:46:12,811][26022] Updated weights on worker 0-0, policy_version 1060350 (0.00083) [2022-07-11 05:46:14,633][26022] Updated weights on worker 0-0, policy_version 1060360 (0.00085) [2022-07-11 05:46:15,638][25689] Fps is (10 sec: 5676.6, 60 sec: 5612.2, 300 sec: 5593.2). Total num frames: 1085814784. Throughput: 0: 4919.7. Samples: 1085810130. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:46:15,639][25689] Avg episode reward: [(0, '2.203')] [2022-07-11 05:46:16,617][26022] Updated weights on worker 0-0, policy_version 1060370 (0.00504) [2022-07-11 05:46:18,092][26022] Updated weights on worker 0-0, policy_version 1060380 (0.00091) [2022-07-11 05:46:20,307][26022] Updated weights on worker 0-0, policy_version 1060390 (0.00621) [2022-07-11 05:46:20,648][25689] Fps is (10 sec: 5397.9, 60 sec: 5531.8, 300 sec: 5589.7). Total num frames: 1085840384. Throughput: 0: 5745.7. Samples: 1085843610. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:46:20,648][25689] Avg episode reward: [(0, '2.198')] [2022-07-11 05:46:22,031][26022] Updated weights on worker 0-0, policy_version 1060400 (0.00087) [2022-07-11 05:46:23,738][26022] Updated weights on worker 0-0, policy_version 1060410 (0.00087) [2022-07-11 05:46:25,663][25689] Fps is (10 sec: 5311.9, 60 sec: 5548.7, 300 sec: 5583.4). Total num frames: 1085868032. Throughput: 0: 5839.0. Samples: 1085877124. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:46:25,663][25689] Avg episode reward: [(0, '0.933')] [2022-07-11 05:46:25,907][26022] Updated weights on worker 0-0, policy_version 1060420 (0.00084) [2022-07-11 05:46:27,293][26022] Updated weights on worker 0-0, policy_version 1060430 (0.00084) [2022-07-11 05:46:29,541][26022] Updated weights on worker 0-0, policy_version 1060440 (0.00080) [2022-07-11 05:46:30,786][25689] Fps is (10 sec: 5757.4, 60 sec: 5578.5, 300 sec: 5588.5). Total num frames: 1085898752. Throughput: 0: 4971.7. Samples: 1085893660. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:46:30,786][25689] Avg episode reward: [(0, '0.851')] [2022-07-11 05:46:31,240][26022] Updated weights on worker 0-0, policy_version 1060450 (0.00089) [2022-07-11 05:46:32,987][26022] Updated weights on worker 0-0, policy_version 1060460 (0.00091) [2022-07-11 05:46:35,182][26022] Updated weights on worker 0-0, policy_version 1060470 (0.00381) [2022-07-11 05:46:35,805][25689] Fps is (10 sec: 5654.1, 60 sec: 5545.6, 300 sec: 5585.0). Total num frames: 1085925376. Throughput: 0: 5807.0. Samples: 1085927236. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:46:35,805][25689] Avg episode reward: [(0, '0.802')] [2022-07-11 05:46:36,523][26022] Updated weights on worker 0-0, policy_version 1060480 (0.00094) [2022-07-11 05:46:38,691][26022] Updated weights on worker 0-0, policy_version 1060490 (0.00093) [2022-07-11 05:46:40,319][26022] Updated weights on worker 0-0, policy_version 1060500 (0.00085) [2022-07-11 05:46:40,815][25689] Fps is (10 sec: 5411.5, 60 sec: 5551.1, 300 sec: 5582.0). Total num frames: 1085953024. Throughput: 0: 5812.3. Samples: 1085960826. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:46:40,815][25689] Avg episode reward: [(0, '0.763')] [2022-07-11 05:46:42,276][26022] Updated weights on worker 0-0, policy_version 1060510 (0.00087) [2022-07-11 05:46:44,000][26022] Updated weights on worker 0-0, policy_version 1060520 (0.00086) [2022-07-11 05:46:45,828][26022] Updated weights on worker 0-0, policy_version 1060530 (0.00092) [2022-07-11 05:46:45,911][25689] Fps is (10 sec: 5674.3, 60 sec: 5561.7, 300 sec: 5581.7). Total num frames: 1085982720. Throughput: 0: 4965.2. Samples: 1085977660. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:46:45,911][25689] Avg episode reward: [(0, '1.018')] [2022-07-11 05:46:47,518][26022] Updated weights on worker 0-0, policy_version 1060540 (0.00094) [2022-07-11 05:46:49,437][26022] Updated weights on worker 0-0, policy_version 1060550 (0.00097) [2022-07-11 05:46:51,024][25689] Fps is (10 sec: 5817.5, 60 sec: 5580.5, 300 sec: 5586.6). Total num frames: 1086012416. Throughput: 0: 5821.2. Samples: 1086011470. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:46:51,026][25689] Avg episode reward: [(0, '1.153')] [2022-07-11 05:46:51,271][26022] Updated weights on worker 0-0, policy_version 1060560 (0.00095) [2022-07-11 05:46:53,034][26022] Updated weights on worker 0-0, policy_version 1060570 (0.00104) [2022-07-11 05:46:55,060][26022] Updated weights on worker 0-0, policy_version 1060580 (0.00078) [2022-07-11 05:46:56,030][25689] Fps is (10 sec: 5667.0, 60 sec: 5566.9, 300 sec: 5586.9). Total num frames: 1086040064. Throughput: 0: 5861.7. Samples: 1086045788. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:46:56,031][25689] Avg episode reward: [(0, '2.271')] [2022-07-11 05:46:56,632][26022] Updated weights on worker 0-0, policy_version 1060590 (0.00090) [2022-07-11 05:46:58,504][26022] Updated weights on worker 0-0, policy_version 1060600 (0.00087) [2022-07-11 05:47:00,291][26022] Updated weights on worker 0-0, policy_version 1060610 (0.00087) [2022-07-11 05:47:01,034][25689] Fps is (10 sec: 5421.8, 60 sec: 5550.8, 300 sec: 5587.1). Total num frames: 1086066688. Throughput: 0: 5036.7. Samples: 1086062664. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:47:01,034][25689] Avg episode reward: [(0, '1.946')] [2022-07-11 05:47:02,279][26022] Updated weights on worker 0-0, policy_version 1060620 (0.00100) [2022-07-11 05:47:04,512][26022] Updated weights on worker 0-0, policy_version 1060630 (0.00083) [2022-07-11 05:47:06,036][25689] Fps is (10 sec: 5424.0, 60 sec: 5568.6, 300 sec: 5585.6). Total num frames: 1086094336. Throughput: 0: 5800.1. Samples: 1086094386. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:47:06,036][25689] Avg episode reward: [(0, '1.728')] [2022-07-11 05:47:06,107][26022] Updated weights on worker 0-0, policy_version 1060640 (0.00086) [2022-07-11 05:47:07,974][26022] Updated weights on worker 0-0, policy_version 1060650 (0.00083) [2022-07-11 05:47:09,826][26022] Updated weights on worker 0-0, policy_version 1060660 (0.00084) [2022-07-11 05:47:11,158][25689] Fps is (10 sec: 5562.9, 60 sec: 5562.6, 300 sec: 5587.9). Total num frames: 1086123008. Throughput: 0: 5820.5. Samples: 1086128660. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:47:11,158][25689] Avg episode reward: [(0, '1.828')] [2022-07-11 05:47:11,432][26022] Updated weights on worker 0-0, policy_version 1060670 (0.00089) [2022-07-11 05:47:13,389][26022] Updated weights on worker 0-0, policy_version 1060680 (0.00086) [2022-07-11 05:47:15,079][26022] Updated weights on worker 0-0, policy_version 1060690 (0.00085) [2022-07-11 05:47:16,181][25689] Fps is (10 sec: 5652.2, 60 sec: 5564.6, 300 sec: 5584.1). Total num frames: 1086151680. Throughput: 0: 4963.4. Samples: 1086145808. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:47:16,182][25689] Avg episode reward: [(0, '1.882')] [2022-07-11 05:47:16,859][26022] Updated weights on worker 0-0, policy_version 1060700 (0.00087) [2022-07-11 05:47:18,823][26022] Updated weights on worker 0-0, policy_version 1060710 (0.00090) [2022-07-11 05:47:20,519][26022] Updated weights on worker 0-0, policy_version 1060720 (0.00090) [2022-07-11 05:47:21,220][25689] Fps is (10 sec: 5800.5, 60 sec: 5629.5, 300 sec: 5587.8). Total num frames: 1086181376. Throughput: 0: 5801.1. Samples: 1086179770. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:47:21,221][25689] Avg episode reward: [(0, '0.793')] [2022-07-11 05:47:22,307][26022] Updated weights on worker 0-0, policy_version 1060730 (0.00085) [2022-07-11 05:47:24,159][26022] Updated weights on worker 0-0, policy_version 1060740 (0.00084) [2022-07-11 05:47:25,859][26022] Updated weights on worker 0-0, policy_version 1060750 (0.00560) [2022-07-11 05:47:26,244][25689] Fps is (10 sec: 5698.5, 60 sec: 5628.7, 300 sec: 5588.7). Total num frames: 1086209024. Throughput: 0: 5921.5. Samples: 1086214050. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:47:26,245][25689] Avg episode reward: [(0, '0.123')] [2022-07-11 05:47:27,779][26022] Updated weights on worker 0-0, policy_version 1060760 (0.00087) [2022-07-11 05:47:29,757][26022] Updated weights on worker 0-0, policy_version 1060770 (0.00095) [2022-07-11 05:47:31,321][25689] Fps is (10 sec: 5677.4, 60 sec: 5616.1, 300 sec: 5590.9). Total num frames: 1086238720. Throughput: 0: 5074.9. Samples: 1086230984. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:47:31,321][25689] Avg episode reward: [(0, '0.434')] [2022-07-11 05:47:31,329][26022] Updated weights on worker 0-0, policy_version 1060780 (0.00823) [2022-07-11 05:47:33,108][26022] Updated weights on worker 0-0, policy_version 1060790 (0.00084) [2022-07-11 05:47:34,856][26022] Updated weights on worker 0-0, policy_version 1060800 (0.00089) [2022-07-11 05:47:36,386][25689] Fps is (10 sec: 5654.0, 60 sec: 5628.6, 300 sec: 5589.8). Total num frames: 1086266368. Throughput: 0: 5897.8. Samples: 1086264974. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:47:36,387][25689] Avg episode reward: [(0, '-1.201')] [2022-07-11 05:47:36,720][26022] Updated weights on worker 0-0, policy_version 1060810 (0.00086) [2022-07-11 05:47:38,517][26022] Updated weights on worker 0-0, policy_version 1060820 (0.00108) [2022-07-11 05:47:38,918][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:47:38,927][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001060822_1086281728.pth [2022-07-11 05:47:38,927][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001058853_1084265472.pth [2022-07-11 05:47:40,348][26022] Updated weights on worker 0-0, policy_version 1060830 (0.00091) [2022-07-11 05:47:41,392][25689] Fps is (10 sec: 5592.3, 60 sec: 5646.0, 300 sec: 5589.8). Total num frames: 1086295040. Throughput: 0: 5940.3. Samples: 1086299594. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:47:41,392][25689] Avg episode reward: [(0, '-1.992')] [2022-07-11 05:47:42,127][26022] Updated weights on worker 0-0, policy_version 1060840 (0.00088) [2022-07-11 05:47:43,788][26022] Updated weights on worker 0-0, policy_version 1060850 (0.00081) [2022-07-11 05:47:45,846][26022] Updated weights on worker 0-0, policy_version 1060860 (0.00110) [2022-07-11 05:47:46,402][25689] Fps is (10 sec: 5827.8, 60 sec: 5654.0, 300 sec: 5590.9). Total num frames: 1086324736. Throughput: 0: 5096.6. Samples: 1086316784. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:47:46,402][25689] Avg episode reward: [(0, '-1.752')] [2022-07-11 05:47:47,570][26022] Updated weights on worker 0-0, policy_version 1060870 (0.00099) [2022-07-11 05:47:49,283][26022] Updated weights on worker 0-0, policy_version 1060880 (0.00094) [2022-07-11 05:47:51,267][26022] Updated weights on worker 0-0, policy_version 1060890 (0.00579) [2022-07-11 05:47:51,443][25689] Fps is (10 sec: 5705.2, 60 sec: 5626.8, 300 sec: 5590.5). Total num frames: 1086352384. Throughput: 0: 5945.9. Samples: 1086350628. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:47:51,443][25689] Avg episode reward: [(0, '-1.050')] [2022-07-11 05:47:53,161][26022] Updated weights on worker 0-0, policy_version 1060900 (0.00091) [2022-07-11 05:47:54,679][26022] Updated weights on worker 0-0, policy_version 1060910 (0.00089) [2022-07-11 05:47:56,516][25689] Fps is (10 sec: 5467.3, 60 sec: 5620.6, 300 sec: 5586.1). Total num frames: 1086380032. Throughput: 0: 5946.5. Samples: 1086384674. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:47:56,516][25689] Avg episode reward: [(0, '-1.718')] [2022-07-11 05:47:56,790][26022] Updated weights on worker 0-0, policy_version 1060920 (0.00096) [2022-07-11 05:47:58,236][26022] Updated weights on worker 0-0, policy_version 1060930 (0.00085) [2022-07-11 05:48:00,361][26022] Updated weights on worker 0-0, policy_version 1060940 (0.00086) [2022-07-11 05:48:01,582][25689] Fps is (10 sec: 5655.6, 60 sec: 5665.6, 300 sec: 5600.2). Total num frames: 1086409728. Throughput: 0: 5904.4. Samples: 1086418806. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:48:01,583][25689] Avg episode reward: [(0, '-0.828')] [2022-07-11 05:48:02,220][26022] Updated weights on worker 0-0, policy_version 1060950 (0.00095) [2022-07-11 05:48:04,103][26022] Updated weights on worker 0-0, policy_version 1060960 (0.00084) [2022-07-11 05:48:05,949][26022] Updated weights on worker 0-0, policy_version 1060970 (0.00089) [2022-07-11 05:48:06,635][25689] Fps is (10 sec: 5565.8, 60 sec: 5643.9, 300 sec: 5591.0). Total num frames: 1086436352. Throughput: 0: 5784.6. Samples: 1086433824. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:48:06,635][25689] Avg episode reward: [(0, '-0.970')] [2022-07-11 05:48:07,801][26022] Updated weights on worker 0-0, policy_version 1060980 (0.00089) [2022-07-11 05:48:09,519][26022] Updated weights on worker 0-0, policy_version 1060990 (0.00089) [2022-07-11 05:48:11,506][26022] Updated weights on worker 0-0, policy_version 1061000 (0.00084) [2022-07-11 05:48:11,685][25689] Fps is (10 sec: 5473.4, 60 sec: 5650.6, 300 sec: 5590.4). Total num frames: 1086465024. Throughput: 0: 5792.2. Samples: 1086467874. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:48:11,685][25689] Avg episode reward: [(0, '-0.321')] [2022-07-11 05:48:13,070][26022] Updated weights on worker 0-0, policy_version 1061010 (0.00084) [2022-07-11 05:48:15,139][26022] Updated weights on worker 0-0, policy_version 1061020 (0.00083) [2022-07-11 05:48:16,730][25689] Fps is (10 sec: 5680.1, 60 sec: 5648.6, 300 sec: 5589.6). Total num frames: 1086493696. Throughput: 0: 5796.3. Samples: 1086501844. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:48:16,731][25689] Avg episode reward: [(0, '-0.726')] [2022-07-11 05:48:16,920][26022] Updated weights on worker 0-0, policy_version 1061030 (0.00082) [2022-07-11 05:48:18,568][26022] Updated weights on worker 0-0, policy_version 1061040 (0.00086) [2022-07-11 05:48:20,699][26022] Updated weights on worker 0-0, policy_version 1061050 (0.00085) [2022-07-11 05:48:21,780][25689] Fps is (10 sec: 5578.8, 60 sec: 5613.8, 300 sec: 5585.4). Total num frames: 1086521344. Throughput: 0: 4939.7. Samples: 1086518578. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:48:21,781][25689] Avg episode reward: [(0, '-0.666')] [2022-07-11 05:48:22,303][26022] Updated weights on worker 0-0, policy_version 1061060 (0.00095) [2022-07-11 05:48:24,251][26022] Updated weights on worker 0-0, policy_version 1061070 (0.00091) [2022-07-11 05:48:26,008][26022] Updated weights on worker 0-0, policy_version 1061080 (0.00095) [2022-07-11 05:48:26,798][25689] Fps is (10 sec: 5594.1, 60 sec: 5631.2, 300 sec: 5586.2). Total num frames: 1086550016. Throughput: 0: 5895.0. Samples: 1086552686. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 05:48:26,798][25689] Avg episode reward: [(0, '-1.055')] [2022-07-11 05:48:28,002][26022] Updated weights on worker 0-0, policy_version 1061090 (0.00090) [2022-07-11 05:48:29,464][26022] Updated weights on worker 0-0, policy_version 1061100 (0.00078) [2022-07-11 05:48:31,623][26022] Updated weights on worker 0-0, policy_version 1061110 (0.00094) [2022-07-11 05:48:31,838][25689] Fps is (10 sec: 5599.8, 60 sec: 5600.8, 300 sec: 5586.2). Total num frames: 1086577664. Throughput: 0: 5876.9. Samples: 1086586310. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:48:31,838][25689] Avg episode reward: [(0, '-0.928')] [2022-07-11 05:48:33,044][26022] Updated weights on worker 0-0, policy_version 1061120 (0.00089) [2022-07-11 05:48:35,202][26022] Updated weights on worker 0-0, policy_version 1061130 (0.00095) [2022-07-11 05:48:36,697][26022] Updated weights on worker 0-0, policy_version 1061140 (0.00088) [2022-07-11 05:48:36,842][25689] Fps is (10 sec: 5709.0, 60 sec: 5640.3, 300 sec: 5586.4). Total num frames: 1086607360. Throughput: 0: 5051.3. Samples: 1086603436. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:48:36,843][25689] Avg episode reward: [(0, '-0.553')] [2022-07-11 05:48:38,653][26022] Updated weights on worker 0-0, policy_version 1061150 (0.00085) [2022-07-11 05:48:40,314][26022] Updated weights on worker 0-0, policy_version 1061160 (0.00086) [2022-07-11 05:48:41,848][25689] Fps is (10 sec: 5728.4, 60 sec: 5623.4, 300 sec: 5586.7). Total num frames: 1086635008. Throughput: 0: 5926.5. Samples: 1086637510. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:48:41,848][25689] Avg episode reward: [(0, '-0.636')] [2022-07-11 05:48:42,402][26022] Updated weights on worker 0-0, policy_version 1061170 (0.00086) [2022-07-11 05:48:43,931][26022] Updated weights on worker 0-0, policy_version 1061180 (0.00082) [2022-07-11 05:48:45,769][26022] Updated weights on worker 0-0, policy_version 1061190 (0.00110) [2022-07-11 05:48:46,855][25689] Fps is (10 sec: 5625.2, 60 sec: 5606.7, 300 sec: 5588.0). Total num frames: 1086663680. Throughput: 0: 5925.7. Samples: 1086671536. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:48:46,855][25689] Avg episode reward: [(0, '-0.666')] [2022-07-11 05:48:47,758][26022] Updated weights on worker 0-0, policy_version 1061200 (0.00083) [2022-07-11 05:48:49,443][26022] Updated weights on worker 0-0, policy_version 1061210 (0.00092) [2022-07-11 05:48:51,498][26022] Updated weights on worker 0-0, policy_version 1061220 (0.00099) [2022-07-11 05:48:51,923][25689] Fps is (10 sec: 5691.9, 60 sec: 5621.2, 300 sec: 5590.7). Total num frames: 1086692352. Throughput: 0: 5082.5. Samples: 1086688392. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:48:51,923][25689] Avg episode reward: [(0, '-0.958')] [2022-07-11 05:48:53,309][26022] Updated weights on worker 0-0, policy_version 1061230 (0.00089) [2022-07-11 05:48:55,099][26022] Updated weights on worker 0-0, policy_version 1061240 (0.00083) [2022-07-11 05:48:56,967][25689] Fps is (10 sec: 5467.9, 60 sec: 5606.9, 300 sec: 5584.7). Total num frames: 1086718976. Throughput: 0: 5879.1. Samples: 1086721752. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:48:56,968][25689] Avg episode reward: [(0, '-0.167')] [2022-07-11 05:48:57,105][26022] Updated weights on worker 0-0, policy_version 1061250 (0.00084) [2022-07-11 05:48:58,635][26022] Updated weights on worker 0-0, policy_version 1061260 (0.00084) [2022-07-11 05:49:00,589][26022] Updated weights on worker 0-0, policy_version 1061270 (0.00092) [2022-07-11 05:49:01,988][25689] Fps is (10 sec: 5290.5, 60 sec: 5560.3, 300 sec: 5584.6). Total num frames: 1086745600. Throughput: 0: 5819.4. Samples: 1086754710. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:49:01,988][25689] Avg episode reward: [(0, '-0.202')] [2022-07-11 05:49:02,656][26022] Updated weights on worker 0-0, policy_version 1061280 (0.00081) [2022-07-11 05:49:04,603][26022] Updated weights on worker 0-0, policy_version 1061290 (0.00089) [2022-07-11 05:49:06,430][26022] Updated weights on worker 0-0, policy_version 1061300 (0.00095) [2022-07-11 05:49:07,011][25689] Fps is (10 sec: 5505.7, 60 sec: 5596.9, 300 sec: 5588.7). Total num frames: 1086774272. Throughput: 0: 4898.9. Samples: 1086770278. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:49:07,012][25689] Avg episode reward: [(0, '-0.152')] [2022-07-11 05:49:08,486][26022] Updated weights on worker 0-0, policy_version 1061310 (0.00079) [2022-07-11 05:49:09,967][26022] Updated weights on worker 0-0, policy_version 1061320 (0.00092) [2022-07-11 05:49:11,889][26022] Updated weights on worker 0-0, policy_version 1061330 (0.00086) [2022-07-11 05:49:12,151][25689] Fps is (10 sec: 5541.5, 60 sec: 5571.7, 300 sec: 5586.8). Total num frames: 1086801920. Throughput: 0: 5691.6. Samples: 1086803522. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:49:12,151][25689] Avg episode reward: [(0, '-0.321')] [2022-07-11 05:49:13,546][26022] Updated weights on worker 0-0, policy_version 1061340 (0.00086) [2022-07-11 05:49:15,766][26022] Updated weights on worker 0-0, policy_version 1061350 (0.00091) [2022-07-11 05:49:17,155][25689] Fps is (10 sec: 5653.1, 60 sec: 5592.5, 300 sec: 5594.1). Total num frames: 1086831616. Throughput: 0: 5730.8. Samples: 1086837440. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:49:17,155][25689] Avg episode reward: [(0, '-0.091')] [2022-07-11 05:49:17,188][26022] Updated weights on worker 0-0, policy_version 1061360 (0.00093) [2022-07-11 05:49:19,250][26022] Updated weights on worker 0-0, policy_version 1061370 (0.00092) [2022-07-11 05:49:20,854][26022] Updated weights on worker 0-0, policy_version 1061380 (0.00094) [2022-07-11 05:49:22,179][25689] Fps is (10 sec: 5718.4, 60 sec: 5594.8, 300 sec: 5587.1). Total num frames: 1086859264. Throughput: 0: 4937.8. Samples: 1086854410. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:49:22,179][25689] Avg episode reward: [(0, '-0.360')] [2022-07-11 05:49:23,051][26022] Updated weights on worker 0-0, policy_version 1061390 (0.00087) [2022-07-11 05:49:24,615][26022] Updated weights on worker 0-0, policy_version 1061400 (0.00084) [2022-07-11 05:49:26,507][26022] Updated weights on worker 0-0, policy_version 1061410 (0.00089) [2022-07-11 05:49:27,218][25689] Fps is (10 sec: 5596.4, 60 sec: 5592.8, 300 sec: 5590.8). Total num frames: 1086887936. Throughput: 0: 5834.2. Samples: 1086888172. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:49:27,219][25689] Avg episode reward: [(0, '-0.343')] [2022-07-11 05:49:28,252][26022] Updated weights on worker 0-0, policy_version 1061420 (0.00098) [2022-07-11 05:49:30,091][26022] Updated weights on worker 0-0, policy_version 1061430 (0.00087) [2022-07-11 05:49:32,073][26022] Updated weights on worker 0-0, policy_version 1061440 (0.00082) [2022-07-11 05:49:32,267][25689] Fps is (10 sec: 5582.5, 60 sec: 5591.9, 300 sec: 5586.5). Total num frames: 1086915584. Throughput: 0: 5889.9. Samples: 1086922006. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:49:32,268][25689] Avg episode reward: [(0, '-0.239')] [2022-07-11 05:49:33,634][26022] Updated weights on worker 0-0, policy_version 1061450 (0.00085) [2022-07-11 05:49:35,628][26022] Updated weights on worker 0-0, policy_version 1061460 (0.00089) [2022-07-11 05:49:37,141][26022] Updated weights on worker 0-0, policy_version 1061470 (0.00072) [2022-07-11 05:49:37,274][25689] Fps is (10 sec: 5804.1, 60 sec: 5608.7, 300 sec: 5598.5). Total num frames: 1086946304. Throughput: 0: 5046.9. Samples: 1086938982. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:49:37,275][25689] Avg episode reward: [(0, '-0.009')] [2022-07-11 05:49:39,007][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:49:39,022][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001061479_1086954496.pth [2022-07-11 05:49:39,022][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001059513_1084941312.pth [2022-07-11 05:49:39,156][26022] Updated weights on worker 0-0, policy_version 1061480 (0.00070) [2022-07-11 05:49:40,844][26022] Updated weights on worker 0-0, policy_version 1061490 (0.00090) [2022-07-11 05:49:42,299][25689] Fps is (10 sec: 5818.1, 60 sec: 5606.9, 300 sec: 5595.2). Total num frames: 1086973952. Throughput: 0: 5925.7. Samples: 1086973640. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:49:42,300][25689] Avg episode reward: [(0, '0.476')] [2022-07-11 05:49:42,883][26022] Updated weights on worker 0-0, policy_version 1061500 (0.00091) [2022-07-11 05:49:44,577][26022] Updated weights on worker 0-0, policy_version 1061510 (0.00084) [2022-07-11 05:49:46,136][26022] Updated weights on worker 0-0, policy_version 1061520 (0.00089) [2022-07-11 05:49:47,326][25689] Fps is (10 sec: 5501.4, 60 sec: 5588.1, 300 sec: 5595.5). Total num frames: 1087001600. Throughput: 0: 5961.6. Samples: 1087008046. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:49:47,326][25689] Avg episode reward: [(0, '0.578')] [2022-07-11 05:49:48,252][26022] Updated weights on worker 0-0, policy_version 1061530 (0.00082) [2022-07-11 05:49:49,929][26022] Updated weights on worker 0-0, policy_version 1061540 (0.00081) [2022-07-11 05:49:51,737][26022] Updated weights on worker 0-0, policy_version 1061550 (0.00406) [2022-07-11 05:49:52,395][25689] Fps is (10 sec: 5680.3, 60 sec: 5605.0, 300 sec: 5599.2). Total num frames: 1087031296. Throughput: 0: 5120.9. Samples: 1087025076. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:49:52,395][25689] Avg episode reward: [(0, '0.334')] [2022-07-11 05:49:53,407][26022] Updated weights on worker 0-0, policy_version 1061560 (0.00064) [2022-07-11 05:49:55,246][26022] Updated weights on worker 0-0, policy_version 1061570 (0.00088) [2022-07-11 05:49:56,953][26022] Updated weights on worker 0-0, policy_version 1061580 (0.00087) [2022-07-11 05:49:57,410][25689] Fps is (10 sec: 5788.2, 60 sec: 5641.6, 300 sec: 5600.4). Total num frames: 1087059968. Throughput: 0: 5987.0. Samples: 1087059532. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:49:57,410][25689] Avg episode reward: [(0, '1.536')] [2022-07-11 05:49:58,903][26022] Updated weights on worker 0-0, policy_version 1061590 (0.00082) [2022-07-11 05:50:00,584][26022] Updated weights on worker 0-0, policy_version 1061600 (0.00086) [2022-07-11 05:50:02,456][25689] Fps is (10 sec: 5394.4, 60 sec: 5622.3, 300 sec: 5603.7). Total num frames: 1087085568. Throughput: 0: 5863.1. Samples: 1087091818. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:50:02,456][25689] Avg episode reward: [(0, '0.252')] [2022-07-11 05:50:02,933][26022] Updated weights on worker 0-0, policy_version 1061610 (0.00091) [2022-07-11 05:50:04,403][26022] Updated weights on worker 0-0, policy_version 1061620 (0.00080) [2022-07-11 05:50:06,556][26022] Updated weights on worker 0-0, policy_version 1061630 (0.00086) [2022-07-11 05:50:07,492][25689] Fps is (10 sec: 5484.2, 60 sec: 5637.9, 300 sec: 5601.1). Total num frames: 1087115264. Throughput: 0: 4987.7. Samples: 1087108632. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:50:07,493][25689] Avg episode reward: [(0, '-1.051')] [2022-07-11 05:50:08,091][26022] Updated weights on worker 0-0, policy_version 1061640 (0.00090) [2022-07-11 05:50:10,174][26022] Updated weights on worker 0-0, policy_version 1061650 (0.00087) [2022-07-11 05:50:11,911][26022] Updated weights on worker 0-0, policy_version 1061660 (0.00088) [2022-07-11 05:50:12,602][25689] Fps is (10 sec: 5651.9, 60 sec: 5640.8, 300 sec: 5606.3). Total num frames: 1087142912. Throughput: 0: 5795.1. Samples: 1087142176. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:50:12,602][25689] Avg episode reward: [(0, '-1.098')] [2022-07-11 05:50:13,619][26022] Updated weights on worker 0-0, policy_version 1061670 (0.00081) [2022-07-11 05:50:15,619][26022] Updated weights on worker 0-0, policy_version 1061680 (0.00087) [2022-07-11 05:50:17,240][26022] Updated weights on worker 0-0, policy_version 1061690 (0.00811) [2022-07-11 05:50:17,608][25689] Fps is (10 sec: 5668.5, 60 sec: 5640.5, 300 sec: 5603.8). Total num frames: 1087172608. Throughput: 0: 5795.5. Samples: 1087176594. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:50:17,609][25689] Avg episode reward: [(0, '-0.750')] [2022-07-11 05:50:19,178][26022] Updated weights on worker 0-0, policy_version 1061700 (0.00089) [2022-07-11 05:50:20,816][26022] Updated weights on worker 0-0, policy_version 1061710 (0.00094) [2022-07-11 05:50:22,611][25689] Fps is (10 sec: 5729.0, 60 sec: 5642.5, 300 sec: 5607.4). Total num frames: 1087200256. Throughput: 0: 5046.5. Samples: 1087193532. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:50:22,612][25689] Avg episode reward: [(0, '-0.356')] [2022-07-11 05:50:22,631][26022] Updated weights on worker 0-0, policy_version 1061720 (0.00085) [2022-07-11 05:50:24,641][26022] Updated weights on worker 0-0, policy_version 1061730 (0.00090) [2022-07-11 05:50:26,366][26022] Updated weights on worker 0-0, policy_version 1061740 (0.00099) [2022-07-11 05:50:27,618][25689] Fps is (10 sec: 5626.6, 60 sec: 5645.6, 300 sec: 5608.8). Total num frames: 1087228928. Throughput: 0: 5909.0. Samples: 1087227556. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:50:27,620][25689] Avg episode reward: [(0, '-0.378')] [2022-07-11 05:50:28,075][26022] Updated weights on worker 0-0, policy_version 1061750 (0.00086) [2022-07-11 05:50:30,015][26022] Updated weights on worker 0-0, policy_version 1061760 (0.00090) [2022-07-11 05:50:31,875][26022] Updated weights on worker 0-0, policy_version 1061770 (0.00087) [2022-07-11 05:50:32,719][25689] Fps is (10 sec: 5571.9, 60 sec: 5640.8, 300 sec: 5604.0). Total num frames: 1087256576. Throughput: 0: 5910.4. Samples: 1087261078. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:50:32,719][25689] Avg episode reward: [(0, '0.950')] [2022-07-11 05:50:33,647][26022] Updated weights on worker 0-0, policy_version 1061780 (0.00081) [2022-07-11 05:50:35,340][26022] Updated weights on worker 0-0, policy_version 1061790 (0.00089) [2022-07-11 05:50:37,200][26022] Updated weights on worker 0-0, policy_version 1061800 (0.00087) [2022-07-11 05:50:37,739][25689] Fps is (10 sec: 5665.8, 60 sec: 5622.6, 300 sec: 5611.8). Total num frames: 1087286272. Throughput: 0: 5045.3. Samples: 1087278160. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:50:37,742][25689] Avg episode reward: [(0, '0.459')] [2022-07-11 05:50:39,018][26022] Updated weights on worker 0-0, policy_version 1061810 (0.00088) [2022-07-11 05:50:40,819][26022] Updated weights on worker 0-0, policy_version 1061820 (0.00088) [2022-07-11 05:50:42,625][26022] Updated weights on worker 0-0, policy_version 1061830 (0.00082) [2022-07-11 05:50:42,761][25689] Fps is (10 sec: 5812.1, 60 sec: 5639.8, 300 sec: 5611.9). Total num frames: 1087314944. Throughput: 0: 5894.8. Samples: 1087312314. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:50:42,762][25689] Avg episode reward: [(0, '0.475')] [2022-07-11 05:50:44,540][26022] Updated weights on worker 0-0, policy_version 1061840 (0.00087) [2022-07-11 05:50:46,455][26022] Updated weights on worker 0-0, policy_version 1061850 (0.00397) [2022-07-11 05:50:47,827][25689] Fps is (10 sec: 5583.0, 60 sec: 5636.1, 300 sec: 5609.7). Total num frames: 1087342592. Throughput: 0: 5897.8. Samples: 1087346744. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:50:47,827][25689] Avg episode reward: [(0, '0.119')] [2022-07-11 05:50:48,045][26022] Updated weights on worker 0-0, policy_version 1061860 (0.00085) [2022-07-11 05:50:49,953][26022] Updated weights on worker 0-0, policy_version 1061870 (0.00086) [2022-07-11 05:50:51,611][26022] Updated weights on worker 0-0, policy_version 1061880 (0.00086) [2022-07-11 05:50:52,878][25689] Fps is (10 sec: 5567.2, 60 sec: 5620.9, 300 sec: 5609.6). Total num frames: 1087371264. Throughput: 0: 5934.5. Samples: 1087380710. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:50:52,878][25689] Avg episode reward: [(0, '-0.144')] [2022-07-11 05:50:53,332][26022] Updated weights on worker 0-0, policy_version 1061890 (0.00092) [2022-07-11 05:50:55,494][26022] Updated weights on worker 0-0, policy_version 1061900 (0.00084) [2022-07-11 05:50:57,120][26022] Updated weights on worker 0-0, policy_version 1061910 (0.00086) [2022-07-11 05:50:57,949][25689] Fps is (10 sec: 5665.2, 60 sec: 5615.6, 300 sec: 5611.9). Total num frames: 1087399936. Throughput: 0: 5911.5. Samples: 1087397630. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:50:57,950][25689] Avg episode reward: [(0, '-0.862')] [2022-07-11 05:50:59,035][26022] Updated weights on worker 0-0, policy_version 1061920 (0.00087) [2022-07-11 05:51:00,542][26022] Updated weights on worker 0-0, policy_version 1061930 (0.00093) [2022-07-11 05:51:02,881][26022] Updated weights on worker 0-0, policy_version 1061940 (0.00082) [2022-07-11 05:51:02,995][25689] Fps is (10 sec: 5465.3, 60 sec: 5632.5, 300 sec: 5611.3). Total num frames: 1087426560. Throughput: 0: 5891.4. Samples: 1087431520. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:51:02,996][25689] Avg episode reward: [(0, '-0.106')] [2022-07-11 05:51:04,679][26022] Updated weights on worker 0-0, policy_version 1061950 (0.00084) [2022-07-11 05:51:06,469][26022] Updated weights on worker 0-0, policy_version 1061960 (0.00083) [2022-07-11 05:51:08,018][25689] Fps is (10 sec: 5491.9, 60 sec: 5616.9, 300 sec: 5611.9). Total num frames: 1087455232. Throughput: 0: 5797.7. Samples: 1087463804. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:51:08,018][25689] Avg episode reward: [(0, '0.037')] [2022-07-11 05:51:08,468][26022] Updated weights on worker 0-0, policy_version 1061970 (0.00095) [2022-07-11 05:51:10,084][26022] Updated weights on worker 0-0, policy_version 1061980 (0.00088) [2022-07-11 05:51:11,988][26022] Updated weights on worker 0-0, policy_version 1061990 (0.00085) [2022-07-11 05:51:13,121][25689] Fps is (10 sec: 5663.1, 60 sec: 5634.4, 300 sec: 5610.8). Total num frames: 1087483904. Throughput: 0: 4940.1. Samples: 1087480714. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:51:13,122][25689] Avg episode reward: [(0, '-0.710')] [2022-07-11 05:51:13,726][26022] Updated weights on worker 0-0, policy_version 1062000 (0.00083) [2022-07-11 05:51:15,531][26022] Updated weights on worker 0-0, policy_version 1062010 (0.00094) [2022-07-11 05:51:17,359][26022] Updated weights on worker 0-0, policy_version 1062020 (0.00087) [2022-07-11 05:51:18,142][25689] Fps is (10 sec: 5663.9, 60 sec: 5616.2, 300 sec: 5620.9). Total num frames: 1087512576. Throughput: 0: 5818.9. Samples: 1087515132. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:51:18,143][25689] Avg episode reward: [(0, '-0.567')] [2022-07-11 05:51:19,069][26022] Updated weights on worker 0-0, policy_version 1062030 (0.00081) [2022-07-11 05:51:21,044][26022] Updated weights on worker 0-0, policy_version 1062040 (0.00090) [2022-07-11 05:51:22,745][26022] Updated weights on worker 0-0, policy_version 1062050 (0.00084) [2022-07-11 05:51:23,191][25689] Fps is (10 sec: 5694.9, 60 sec: 5628.8, 300 sec: 5623.7). Total num frames: 1087541248. Throughput: 0: 5839.8. Samples: 1087549456. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:51:23,191][25689] Avg episode reward: [(0, '-0.540')] [2022-07-11 05:51:24,531][26022] Updated weights on worker 0-0, policy_version 1062060 (0.00112) [2022-07-11 05:51:26,379][26022] Updated weights on worker 0-0, policy_version 1062070 (0.00085) [2022-07-11 05:51:28,116][26022] Updated weights on worker 0-0, policy_version 1062080 (0.00084) [2022-07-11 05:51:28,249][25689] Fps is (10 sec: 5775.5, 60 sec: 5641.0, 300 sec: 5621.5). Total num frames: 1087570944. Throughput: 0: 5078.0. Samples: 1087566536. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:51:28,249][25689] Avg episode reward: [(0, '0.594')] [2022-07-11 05:51:29,981][26022] Updated weights on worker 0-0, policy_version 1062090 (0.01020) [2022-07-11 05:51:31,792][26022] Updated weights on worker 0-0, policy_version 1062100 (0.00091) [2022-07-11 05:51:33,307][25689] Fps is (10 sec: 5668.2, 60 sec: 5644.9, 300 sec: 5624.2). Total num frames: 1087598592. Throughput: 0: 5923.6. Samples: 1087600288. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:51:33,308][25689] Avg episode reward: [(0, '-0.642')] [2022-07-11 05:51:33,538][26022] Updated weights on worker 0-0, policy_version 1062110 (0.00087) [2022-07-11 05:51:35,479][26022] Updated weights on worker 0-0, policy_version 1062120 (0.00085) [2022-07-11 05:51:37,240][26022] Updated weights on worker 0-0, policy_version 1062130 (0.00090) [2022-07-11 05:51:38,327][25689] Fps is (10 sec: 5588.1, 60 sec: 5628.1, 300 sec: 5627.5). Total num frames: 1087627264. Throughput: 0: 5899.7. Samples: 1087634216. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:51:38,327][25689] Avg episode reward: [(0, '-0.306')] [2022-07-11 05:51:38,987][26022] Updated weights on worker 0-0, policy_version 1062140 (0.00090) [2022-07-11 05:51:39,294][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:51:39,306][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001062141_1087632384.pth [2022-07-11 05:51:39,306][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001060165_1085608960.pth [2022-07-11 05:51:40,741][26022] Updated weights on worker 0-0, policy_version 1062150 (0.00094) [2022-07-11 05:51:42,749][26022] Updated weights on worker 0-0, policy_version 1062160 (0.00080) [2022-07-11 05:51:43,400][25689] Fps is (10 sec: 5479.0, 60 sec: 5589.6, 300 sec: 5617.6). Total num frames: 1087653888. Throughput: 0: 5032.9. Samples: 1087651166. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:51:43,400][25689] Avg episode reward: [(0, '0.847')] [2022-07-11 05:51:44,361][26022] Updated weights on worker 0-0, policy_version 1062170 (0.00081) [2022-07-11 05:51:46,408][26022] Updated weights on worker 0-0, policy_version 1062180 (0.00088) [2022-07-11 05:51:47,795][26022] Updated weights on worker 0-0, policy_version 1062190 (0.00079) [2022-07-11 05:51:48,439][25689] Fps is (10 sec: 5772.1, 60 sec: 5659.6, 300 sec: 5625.9). Total num frames: 1087685632. Throughput: 0: 5889.7. Samples: 1087685452. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:51:48,440][25689] Avg episode reward: [(0, '1.591')] [2022-07-11 05:51:50,055][26022] Updated weights on worker 0-0, policy_version 1062200 (0.00083) [2022-07-11 05:51:51,385][26022] Updated weights on worker 0-0, policy_version 1062210 (0.00088) [2022-07-11 05:51:53,482][25689] Fps is (10 sec: 5789.2, 60 sec: 5626.5, 300 sec: 5621.7). Total num frames: 1087712256. Throughput: 0: 5904.5. Samples: 1087719410. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:51:53,483][25689] Avg episode reward: [(0, '2.651')] [2022-07-11 05:51:53,542][26022] Updated weights on worker 0-0, policy_version 1062220 (0.00090) [2022-07-11 05:51:55,426][26022] Updated weights on worker 0-0, policy_version 1062230 (0.00089) [2022-07-11 05:51:57,118][26022] Updated weights on worker 0-0, policy_version 1062240 (0.00086) [2022-07-11 05:51:58,503][25689] Fps is (10 sec: 5596.4, 60 sec: 5648.2, 300 sec: 5631.7). Total num frames: 1087741952. Throughput: 0: 5053.5. Samples: 1087736174. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:51:58,503][25689] Avg episode reward: [(0, '2.844')] [2022-07-11 05:51:59,102][26022] Updated weights on worker 0-0, policy_version 1062250 (0.00086) [2022-07-11 05:52:00,877][26022] Updated weights on worker 0-0, policy_version 1062260 (0.00090) [2022-07-11 05:52:02,925][26022] Updated weights on worker 0-0, policy_version 1062270 (0.00093) [2022-07-11 05:52:03,513][25689] Fps is (10 sec: 5614.7, 60 sec: 5651.5, 300 sec: 5628.1). Total num frames: 1087768576. Throughput: 0: 5864.8. Samples: 1087769126. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:52:03,513][25689] Avg episode reward: [(0, '1.948')] [2022-07-11 05:52:04,802][26022] Updated weights on worker 0-0, policy_version 1062280 (0.00080) [2022-07-11 05:52:06,342][26022] Updated weights on worker 0-0, policy_version 1062290 (0.00080) [2022-07-11 05:52:08,335][26022] Updated weights on worker 0-0, policy_version 1062300 (0.00115) [2022-07-11 05:52:08,534][25689] Fps is (10 sec: 5308.1, 60 sec: 5617.8, 300 sec: 5623.2). Total num frames: 1087795200. Throughput: 0: 5859.7. Samples: 1087803202. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:52:08,535][25689] Avg episode reward: [(0, '1.693')] [2022-07-11 05:52:09,895][26022] Updated weights on worker 0-0, policy_version 1062310 (0.00087) [2022-07-11 05:52:11,873][26022] Updated weights on worker 0-0, policy_version 1062320 (0.00085) [2022-07-11 05:52:13,546][26022] Updated weights on worker 0-0, policy_version 1062330 (0.00079) [2022-07-11 05:52:13,659][25689] Fps is (10 sec: 5651.9, 60 sec: 5649.6, 300 sec: 5628.1). Total num frames: 1087825920. Throughput: 0: 5001.7. Samples: 1087820328. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:52:13,659][25689] Avg episode reward: [(0, '1.001')] [2022-07-11 05:52:15,401][26022] Updated weights on worker 0-0, policy_version 1062340 (0.00090) [2022-07-11 05:52:17,320][26022] Updated weights on worker 0-0, policy_version 1062350 (0.00091) [2022-07-11 05:52:18,695][25689] Fps is (10 sec: 5845.2, 60 sec: 5648.3, 300 sec: 5624.7). Total num frames: 1087854592. Throughput: 0: 5870.8. Samples: 1087854718. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:52:18,695][25689] Avg episode reward: [(0, '-0.424')] [2022-07-11 05:52:18,908][26022] Updated weights on worker 0-0, policy_version 1062360 (0.00084) [2022-07-11 05:52:20,740][26022] Updated weights on worker 0-0, policy_version 1062370 (0.00084) [2022-07-11 05:52:22,623][26022] Updated weights on worker 0-0, policy_version 1062380 (0.00078) [2022-07-11 05:52:23,769][25689] Fps is (10 sec: 5772.9, 60 sec: 5662.7, 300 sec: 5630.7). Total num frames: 1087884288. Throughput: 0: 5930.5. Samples: 1087889258. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:52:23,770][25689] Avg episode reward: [(0, '-0.878')] [2022-07-11 05:52:24,187][26022] Updated weights on worker 0-0, policy_version 1062390 (0.00085) [2022-07-11 05:52:26,295][26022] Updated weights on worker 0-0, policy_version 1062400 (0.00084) [2022-07-11 05:52:27,959][26022] Updated weights on worker 0-0, policy_version 1062410 (0.00095) [2022-07-11 05:52:28,783][25689] Fps is (10 sec: 5785.6, 60 sec: 5649.9, 300 sec: 5628.4). Total num frames: 1087912960. Throughput: 0: 5083.8. Samples: 1087906146. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 05:52:28,785][25689] Avg episode reward: [(0, '-0.472')] [2022-07-11 05:52:30,030][26022] Updated weights on worker 0-0, policy_version 1062420 (0.00092) [2022-07-11 05:52:31,503][26022] Updated weights on worker 0-0, policy_version 1062430 (0.00084) [2022-07-11 05:52:33,537][26022] Updated weights on worker 0-0, policy_version 1062440 (0.00087) [2022-07-11 05:52:33,851][25689] Fps is (10 sec: 5383.3, 60 sec: 5615.3, 300 sec: 5621.5). Total num frames: 1087938560. Throughput: 0: 5930.0. Samples: 1087940068. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:52:33,853][25689] Avg episode reward: [(0, '-1.242')] [2022-07-11 05:52:34,883][26022] Updated weights on worker 0-0, policy_version 1062450 (0.00087) [2022-07-11 05:52:37,185][26022] Updated weights on worker 0-0, policy_version 1062460 (0.00094) [2022-07-11 05:52:38,596][26022] Updated weights on worker 0-0, policy_version 1062470 (0.00081) [2022-07-11 05:52:38,892][25689] Fps is (10 sec: 5672.7, 60 sec: 5664.0, 300 sec: 5631.1). Total num frames: 1087970304. Throughput: 0: 5928.1. Samples: 1087974450. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:52:38,893][25689] Avg episode reward: [(0, '-1.281')] [2022-07-11 05:52:40,575][26022] Updated weights on worker 0-0, policy_version 1062480 (0.00097) [2022-07-11 05:52:42,182][26022] Updated weights on worker 0-0, policy_version 1062490 (0.00083) [2022-07-11 05:52:43,980][25689] Fps is (10 sec: 5863.7, 60 sec: 5679.5, 300 sec: 5622.8). Total num frames: 1087997952. Throughput: 0: 5069.9. Samples: 1087991722. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:52:43,980][25689] Avg episode reward: [(0, '-1.893')] [2022-07-11 05:52:44,258][26022] Updated weights on worker 0-0, policy_version 1062500 (0.00742) [2022-07-11 05:52:45,876][26022] Updated weights on worker 0-0, policy_version 1062510 (0.00080) [2022-07-11 05:52:47,869][26022] Updated weights on worker 0-0, policy_version 1062520 (0.00091) [2022-07-11 05:52:49,017][25689] Fps is (10 sec: 5663.4, 60 sec: 5645.8, 300 sec: 5629.7). Total num frames: 1088027648. Throughput: 0: 5932.6. Samples: 1088026188. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:52:49,018][25689] Avg episode reward: [(0, '-1.056')] [2022-07-11 05:52:49,458][26022] Updated weights on worker 0-0, policy_version 1062530 (0.00085) [2022-07-11 05:52:51,395][26022] Updated weights on worker 0-0, policy_version 1062540 (0.00079) [2022-07-11 05:52:52,955][26022] Updated weights on worker 0-0, policy_version 1062550 (0.00086) [2022-07-11 05:52:54,135][25689] Fps is (10 sec: 5747.7, 60 sec: 5672.7, 300 sec: 5632.3). Total num frames: 1088056320. Throughput: 0: 5940.1. Samples: 1088060556. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:52:54,135][25689] Avg episode reward: [(0, '-0.701')] [2022-07-11 05:52:54,983][26022] Updated weights on worker 0-0, policy_version 1062560 (0.00083) [2022-07-11 05:52:56,659][26022] Updated weights on worker 0-0, policy_version 1062570 (0.00095) [2022-07-11 05:52:58,496][26022] Updated weights on worker 0-0, policy_version 1062580 (0.00081) [2022-07-11 05:52:59,143][25689] Fps is (10 sec: 5663.3, 60 sec: 5657.0, 300 sec: 5630.0). Total num frames: 1088084992. Throughput: 0: 5943.2. Samples: 1088094804. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:52:59,144][25689] Avg episode reward: [(0, '0.396')] [2022-07-11 05:53:00,150][26022] Updated weights on worker 0-0, policy_version 1062590 (0.00087) [2022-07-11 05:53:02,348][26022] Updated weights on worker 0-0, policy_version 1062600 (0.00090) [2022-07-11 05:53:04,034][26022] Updated weights on worker 0-0, policy_version 1062610 (0.00083) [2022-07-11 05:53:04,182][25689] Fps is (10 sec: 5605.5, 60 sec: 5671.2, 300 sec: 5633.7). Total num frames: 1088112640. Throughput: 0: 5851.2. Samples: 1088109930. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:53:04,182][25689] Avg episode reward: [(0, '1.070')] [2022-07-11 05:53:05,957][26022] Updated weights on worker 0-0, policy_version 1062620 (0.00087) [2022-07-11 05:53:07,792][26022] Updated weights on worker 0-0, policy_version 1062630 (0.00083) [2022-07-11 05:53:09,227][25689] Fps is (10 sec: 5382.2, 60 sec: 5669.0, 300 sec: 5626.9). Total num frames: 1088139264. Throughput: 0: 5828.4. Samples: 1088143976. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:53:09,227][25689] Avg episode reward: [(0, '1.823')] [2022-07-11 05:53:09,564][26022] Updated weights on worker 0-0, policy_version 1062640 (0.00085) [2022-07-11 05:53:11,379][26022] Updated weights on worker 0-0, policy_version 1062650 (0.00087) [2022-07-11 05:53:13,204][26022] Updated weights on worker 0-0, policy_version 1062660 (0.00097) [2022-07-11 05:53:14,270][25689] Fps is (10 sec: 5583.1, 60 sec: 5659.7, 300 sec: 5630.4). Total num frames: 1088168960. Throughput: 0: 5835.2. Samples: 1088178048. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:53:14,270][25689] Avg episode reward: [(0, '1.552')] [2022-07-11 05:53:14,965][26022] Updated weights on worker 0-0, policy_version 1062670 (0.00085) [2022-07-11 05:53:16,862][26022] Updated weights on worker 0-0, policy_version 1062680 (0.00084) [2022-07-11 05:53:18,730][26022] Updated weights on worker 0-0, policy_version 1062690 (0.00084) [2022-07-11 05:53:19,290][25689] Fps is (10 sec: 5799.9, 60 sec: 5661.2, 300 sec: 5634.4). Total num frames: 1088197632. Throughput: 0: 4980.7. Samples: 1088195152. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:53:19,291][25689] Avg episode reward: [(0, '0.988')] [2022-07-11 05:53:20,335][26022] Updated weights on worker 0-0, policy_version 1062700 (0.00089) [2022-07-11 05:53:22,403][26022] Updated weights on worker 0-0, policy_version 1062710 (0.00087) [2022-07-11 05:53:24,111][26022] Updated weights on worker 0-0, policy_version 1062720 (0.00089) [2022-07-11 05:53:24,303][25689] Fps is (10 sec: 5715.5, 60 sec: 5650.1, 300 sec: 5634.5). Total num frames: 1088226304. Throughput: 0: 5912.7. Samples: 1088228898. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:53:24,303][25689] Avg episode reward: [(0, '0.445')] [2022-07-11 05:53:25,930][26022] Updated weights on worker 0-0, policy_version 1062730 (0.00115) [2022-07-11 05:53:28,032][26022] Updated weights on worker 0-0, policy_version 1062740 (0.00093) [2022-07-11 05:53:29,318][25689] Fps is (10 sec: 5718.6, 60 sec: 5649.9, 300 sec: 5638.4). Total num frames: 1088254976. Throughput: 0: 5892.6. Samples: 1088262364. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:53:29,318][25689] Avg episode reward: [(0, '0.439')] [2022-07-11 05:53:29,684][26022] Updated weights on worker 0-0, policy_version 1062750 (0.00092) [2022-07-11 05:53:31,575][26022] Updated weights on worker 0-0, policy_version 1062760 (0.00093) [2022-07-11 05:53:33,337][26022] Updated weights on worker 0-0, policy_version 1062770 (0.00084) [2022-07-11 05:53:34,449][25689] Fps is (10 sec: 5550.8, 60 sec: 5677.8, 300 sec: 5629.1). Total num frames: 1088282624. Throughput: 0: 5015.4. Samples: 1088279256. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:53:34,449][25689] Avg episode reward: [(0, '-0.110')] [2022-07-11 05:53:35,143][26022] Updated weights on worker 0-0, policy_version 1062780 (0.00092) [2022-07-11 05:53:36,898][26022] Updated weights on worker 0-0, policy_version 1062790 (0.00087) [2022-07-11 05:53:38,813][26022] Updated weights on worker 0-0, policy_version 1062800 (0.00088) [2022-07-11 05:53:39,380][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:53:39,390][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001062804_1088311296.pth [2022-07-11 05:53:39,390][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001060822_1086281728.pth [2022-07-11 05:53:39,492][25689] Fps is (10 sec: 5535.6, 60 sec: 5626.9, 300 sec: 5631.9). Total num frames: 1088311296. Throughput: 0: 5862.0. Samples: 1088313574. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:53:39,492][25689] Avg episode reward: [(0, '-0.625')] [2022-07-11 05:53:40,671][26022] Updated weights on worker 0-0, policy_version 1062810 (0.00089) [2022-07-11 05:53:42,368][26022] Updated weights on worker 0-0, policy_version 1062820 (0.00087) [2022-07-11 05:53:44,151][26022] Updated weights on worker 0-0, policy_version 1062830 (0.00089) [2022-07-11 05:53:44,505][25689] Fps is (10 sec: 5600.7, 60 sec: 5633.9, 300 sec: 5628.3). Total num frames: 1088338944. Throughput: 0: 5873.1. Samples: 1088347546. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:53:44,505][25689] Avg episode reward: [(0, '-1.146')] [2022-07-11 05:53:45,831][26022] Updated weights on worker 0-0, policy_version 1062840 (0.00084) [2022-07-11 05:53:48,015][26022] Updated weights on worker 0-0, policy_version 1062850 (0.00082) [2022-07-11 05:53:49,543][25689] Fps is (10 sec: 5603.4, 60 sec: 5616.9, 300 sec: 5628.9). Total num frames: 1088367616. Throughput: 0: 5053.5. Samples: 1088364572. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:53:49,544][25689] Avg episode reward: [(0, '-0.176')] [2022-07-11 05:53:49,573][26022] Updated weights on worker 0-0, policy_version 1062860 (0.00085) [2022-07-11 05:53:51,337][26022] Updated weights on worker 0-0, policy_version 1062870 (0.00084) [2022-07-11 05:53:53,334][26022] Updated weights on worker 0-0, policy_version 1062880 (0.00090) [2022-07-11 05:53:54,593][25689] Fps is (10 sec: 5684.3, 60 sec: 5623.2, 300 sec: 5635.6). Total num frames: 1088396288. Throughput: 0: 5921.7. Samples: 1088398544. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:53:54,593][25689] Avg episode reward: [(0, '-0.027')] [2022-07-11 05:53:54,866][26022] Updated weights on worker 0-0, policy_version 1062890 (0.00085) [2022-07-11 05:53:56,975][26022] Updated weights on worker 0-0, policy_version 1062900 (0.00099) [2022-07-11 05:53:58,494][26022] Updated weights on worker 0-0, policy_version 1062910 (0.00081) [2022-07-11 05:53:59,643][25689] Fps is (10 sec: 5576.3, 60 sec: 5602.4, 300 sec: 5638.5). Total num frames: 1088423936. Throughput: 0: 5885.1. Samples: 1088432164. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:53:59,645][25689] Avg episode reward: [(0, '-0.512')] [2022-07-11 05:54:00,598][26022] Updated weights on worker 0-0, policy_version 1062920 (0.00085) [2022-07-11 05:54:02,184][26022] Updated weights on worker 0-0, policy_version 1062930 (0.00089) [2022-07-11 05:54:04,676][25689] Fps is (10 sec: 5281.0, 60 sec: 5569.1, 300 sec: 5628.0). Total num frames: 1088449536. Throughput: 0: 5007.6. Samples: 1088448554. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:54:04,676][25689] Avg episode reward: [(0, '-0.690')] [2022-07-11 05:54:04,691][26022] Updated weights on worker 0-0, policy_version 1062940 (0.00083) [2022-07-11 05:54:06,390][26022] Updated weights on worker 0-0, policy_version 1062950 (0.00091) [2022-07-11 05:54:08,238][26022] Updated weights on worker 0-0, policy_version 1062960 (0.00090) [2022-07-11 05:54:09,686][25689] Fps is (10 sec: 5608.0, 60 sec: 5640.0, 300 sec: 5640.8). Total num frames: 1088480256. Throughput: 0: 5763.8. Samples: 1088480670. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:54:09,686][25689] Avg episode reward: [(0, '-0.046')] [2022-07-11 05:54:09,973][26022] Updated weights on worker 0-0, policy_version 1062970 (0.00081) [2022-07-11 05:54:11,950][26022] Updated weights on worker 0-0, policy_version 1062980 (0.00092) [2022-07-11 05:54:13,798][26022] Updated weights on worker 0-0, policy_version 1062990 (0.00090) [2022-07-11 05:54:14,768][25689] Fps is (10 sec: 5682.0, 60 sec: 5585.6, 300 sec: 5629.0). Total num frames: 1088506880. Throughput: 0: 5716.9. Samples: 1088513882. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:54:14,769][25689] Avg episode reward: [(0, '0.492')] [2022-07-11 05:54:15,557][26022] Updated weights on worker 0-0, policy_version 1063000 (0.00090) [2022-07-11 05:54:17,195][26022] Updated weights on worker 0-0, policy_version 1063010 (0.00090) [2022-07-11 05:54:19,327][26022] Updated weights on worker 0-0, policy_version 1063020 (0.00054) [2022-07-11 05:54:19,794][25689] Fps is (10 sec: 5369.1, 60 sec: 5568.2, 300 sec: 5629.0). Total num frames: 1088534528. Throughput: 0: 4887.9. Samples: 1088530660. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:54:19,795][25689] Avg episode reward: [(0, '0.469')] [2022-07-11 05:54:20,954][26022] Updated weights on worker 0-0, policy_version 1063030 (0.00089) [2022-07-11 05:54:23,030][26022] Updated weights on worker 0-0, policy_version 1063040 (0.00087) [2022-07-11 05:54:24,433][26022] Updated weights on worker 0-0, policy_version 1063050 (0.00090) [2022-07-11 05:54:24,796][25689] Fps is (10 sec: 5820.3, 60 sec: 5603.0, 300 sec: 5636.5). Total num frames: 1088565248. Throughput: 0: 5772.5. Samples: 1088564700. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:54:24,797][25689] Avg episode reward: [(0, '0.345')] [2022-07-11 05:54:26,718][26022] Updated weights on worker 0-0, policy_version 1063060 (0.00082) [2022-07-11 05:54:28,068][26022] Updated weights on worker 0-0, policy_version 1063070 (0.00084) [2022-07-11 05:54:29,821][25689] Fps is (10 sec: 5718.7, 60 sec: 5568.2, 300 sec: 5633.6). Total num frames: 1088591872. Throughput: 0: 5849.5. Samples: 1088598454. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:54:29,822][25689] Avg episode reward: [(0, '-0.132')] [2022-07-11 05:54:30,163][26022] Updated weights on worker 0-0, policy_version 1063080 (0.00088) [2022-07-11 05:54:31,963][26022] Updated weights on worker 0-0, policy_version 1063090 (0.00086) [2022-07-11 05:54:33,842][26022] Updated weights on worker 0-0, policy_version 1063100 (0.00083) [2022-07-11 05:54:34,894][25689] Fps is (10 sec: 5475.9, 60 sec: 5590.5, 300 sec: 5625.4). Total num frames: 1088620544. Throughput: 0: 5026.6. Samples: 1088615048. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:54:34,895][25689] Avg episode reward: [(0, '-0.541')] [2022-07-11 05:54:35,651][26022] Updated weights on worker 0-0, policy_version 1063110 (0.00094) [2022-07-11 05:54:37,397][26022] Updated weights on worker 0-0, policy_version 1063120 (0.00096) [2022-07-11 05:54:39,254][26022] Updated weights on worker 0-0, policy_version 1063130 (0.00091) [2022-07-11 05:54:39,955][25689] Fps is (10 sec: 5557.9, 60 sec: 5572.0, 300 sec: 5624.8). Total num frames: 1088648192. Throughput: 0: 5864.9. Samples: 1088648900. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:54:39,955][25689] Avg episode reward: [(0, '-1.142')] [2022-07-11 05:54:41,101][26022] Updated weights on worker 0-0, policy_version 1063140 (0.00087) [2022-07-11 05:54:42,722][26022] Updated weights on worker 0-0, policy_version 1063150 (0.00086) [2022-07-11 05:54:44,832][26022] Updated weights on worker 0-0, policy_version 1063160 (0.00085) [2022-07-11 05:54:45,052][25689] Fps is (10 sec: 5544.4, 60 sec: 5581.1, 300 sec: 5626.9). Total num frames: 1088676864. Throughput: 0: 5841.5. Samples: 1088683026. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:54:45,053][25689] Avg episode reward: [(0, '-1.016')] [2022-07-11 05:54:46,292][26022] Updated weights on worker 0-0, policy_version 1063170 (0.00083) [2022-07-11 05:54:48,448][26022] Updated weights on worker 0-0, policy_version 1063180 (0.00086) [2022-07-11 05:54:50,042][26022] Updated weights on worker 0-0, policy_version 1063190 (0.00085) [2022-07-11 05:54:50,079][25689] Fps is (10 sec: 5765.3, 60 sec: 5599.1, 300 sec: 5627.7). Total num frames: 1088706560. Throughput: 0: 5010.7. Samples: 1088699958. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:54:50,079][25689] Avg episode reward: [(0, '-0.828')] [2022-07-11 05:54:52,019][26022] Updated weights on worker 0-0, policy_version 1063200 (0.00089) [2022-07-11 05:54:53,779][26022] Updated weights on worker 0-0, policy_version 1063210 (0.00082) [2022-07-11 05:54:55,122][25689] Fps is (10 sec: 5694.8, 60 sec: 5582.8, 300 sec: 5623.7). Total num frames: 1088734208. Throughput: 0: 5869.2. Samples: 1088733768. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:54:55,122][25689] Avg episode reward: [(0, '-0.494')] [2022-07-11 05:54:55,620][26022] Updated weights on worker 0-0, policy_version 1063220 (0.00086) [2022-07-11 05:54:57,419][26022] Updated weights on worker 0-0, policy_version 1063230 (0.00099) [2022-07-11 05:54:59,322][26022] Updated weights on worker 0-0, policy_version 1063240 (0.00087) [2022-07-11 05:55:00,183][25689] Fps is (10 sec: 5675.5, 60 sec: 5615.6, 300 sec: 5637.2). Total num frames: 1088763904. Throughput: 0: 5878.2. Samples: 1088767804. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:55:00,183][25689] Avg episode reward: [(0, '0.271')] [2022-07-11 05:55:00,986][26022] Updated weights on worker 0-0, policy_version 1063250 (0.00089) [2022-07-11 05:55:03,183][26022] Updated weights on worker 0-0, policy_version 1063260 (0.00079) [2022-07-11 05:55:04,907][26022] Updated weights on worker 0-0, policy_version 1063270 (0.00082) [2022-07-11 05:55:05,237][25689] Fps is (10 sec: 5567.7, 60 sec: 5630.5, 300 sec: 5626.5). Total num frames: 1088790528. Throughput: 0: 4983.5. Samples: 1088783618. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:55:05,238][25689] Avg episode reward: [(0, '0.842')] [2022-07-11 05:55:06,875][26022] Updated weights on worker 0-0, policy_version 1063280 (0.00076) [2022-07-11 05:55:08,595][26022] Updated weights on worker 0-0, policy_version 1063290 (0.00085) [2022-07-11 05:55:10,278][25689] Fps is (10 sec: 5376.0, 60 sec: 5577.0, 300 sec: 5627.8). Total num frames: 1088818176. Throughput: 0: 5818.3. Samples: 1088817482. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:55:10,278][25689] Avg episode reward: [(0, '0.974')] [2022-07-11 05:55:10,419][26022] Updated weights on worker 0-0, policy_version 1063300 (0.00086) [2022-07-11 05:55:12,116][26022] Updated weights on worker 0-0, policy_version 1063310 (0.00087) [2022-07-11 05:55:13,951][26022] Updated weights on worker 0-0, policy_version 1063320 (0.00088) [2022-07-11 05:55:15,359][25689] Fps is (10 sec: 5564.4, 60 sec: 5610.9, 300 sec: 5623.0). Total num frames: 1088846848. Throughput: 0: 5824.2. Samples: 1088851632. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:55:15,359][25689] Avg episode reward: [(0, '-0.179')] [2022-07-11 05:55:15,722][26022] Updated weights on worker 0-0, policy_version 1063330 (0.00094) [2022-07-11 05:55:17,542][26022] Updated weights on worker 0-0, policy_version 1063340 (0.00100) [2022-07-11 05:55:19,228][26022] Updated weights on worker 0-0, policy_version 1063350 (0.00085) [2022-07-11 05:55:20,363][25689] Fps is (10 sec: 5685.9, 60 sec: 5629.8, 300 sec: 5626.4). Total num frames: 1088875520. Throughput: 0: 5846.7. Samples: 1088885794. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:55:20,364][25689] Avg episode reward: [(0, '-0.218')] [2022-07-11 05:55:21,134][26022] Updated weights on worker 0-0, policy_version 1063360 (0.00088) [2022-07-11 05:55:22,784][26022] Updated weights on worker 0-0, policy_version 1063370 (0.00097) [2022-07-11 05:55:24,827][26022] Updated weights on worker 0-0, policy_version 1063380 (0.00086) [2022-07-11 05:55:25,441][25689] Fps is (10 sec: 5789.0, 60 sec: 5605.9, 300 sec: 5628.5). Total num frames: 1088905216. Throughput: 0: 5910.0. Samples: 1088903024. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:55:25,442][25689] Avg episode reward: [(0, '-0.408')] [2022-07-11 05:55:26,440][26022] Updated weights on worker 0-0, policy_version 1063390 (0.00096) [2022-07-11 05:55:28,319][26022] Updated weights on worker 0-0, policy_version 1063400 (0.00082) [2022-07-11 05:55:29,949][26022] Updated weights on worker 0-0, policy_version 1063410 (0.00080) [2022-07-11 05:55:30,514][25689] Fps is (10 sec: 5850.7, 60 sec: 5652.1, 300 sec: 5635.9). Total num frames: 1088934912. Throughput: 0: 5918.8. Samples: 1088937258. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:55:30,515][25689] Avg episode reward: [(0, '0.220')] [2022-07-11 05:55:31,907][26022] Updated weights on worker 0-0, policy_version 1063420 (0.00085) [2022-07-11 05:55:33,507][26022] Updated weights on worker 0-0, policy_version 1063430 (0.00084) [2022-07-11 05:55:35,607][25689] Fps is (10 sec: 5540.5, 60 sec: 5616.5, 300 sec: 5624.2). Total num frames: 1088961536. Throughput: 0: 5918.6. Samples: 1088971470. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:55:35,607][25689] Avg episode reward: [(0, '0.488')] [2022-07-11 05:55:35,617][26022] Updated weights on worker 0-0, policy_version 1063440 (0.00084) [2022-07-11 05:55:37,127][26022] Updated weights on worker 0-0, policy_version 1063450 (0.00090) [2022-07-11 05:55:39,135][26022] Updated weights on worker 0-0, policy_version 1063460 (0.00086) [2022-07-11 05:55:39,404][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:55:39,416][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001063462_1088985088.pth [2022-07-11 05:55:39,417][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001061479_1086954496.pth [2022-07-11 05:55:40,608][25689] Fps is (10 sec: 5681.2, 60 sec: 5672.6, 300 sec: 5631.5). Total num frames: 1088992256. Throughput: 0: 5074.5. Samples: 1088988526. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:55:40,610][25689] Avg episode reward: [(0, '1.600')] [2022-07-11 05:55:40,740][26022] Updated weights on worker 0-0, policy_version 1063470 (0.00084) [2022-07-11 05:55:42,703][26022] Updated weights on worker 0-0, policy_version 1063480 (0.00091) [2022-07-11 05:55:44,377][26022] Updated weights on worker 0-0, policy_version 1063490 (0.00083) [2022-07-11 05:55:45,627][25689] Fps is (10 sec: 5825.0, 60 sec: 5663.1, 300 sec: 5632.3). Total num frames: 1089019904. Throughput: 0: 5952.0. Samples: 1089023168. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:55:45,627][25689] Avg episode reward: [(0, '1.821')] [2022-07-11 05:55:46,186][26022] Updated weights on worker 0-0, policy_version 1063500 (0.00081) [2022-07-11 05:55:47,824][26022] Updated weights on worker 0-0, policy_version 1063510 (0.00093) [2022-07-11 05:55:49,915][26022] Updated weights on worker 0-0, policy_version 1063520 (0.00087) [2022-07-11 05:55:50,663][25689] Fps is (10 sec: 5703.3, 60 sec: 5662.2, 300 sec: 5636.1). Total num frames: 1089049600. Throughput: 0: 5972.1. Samples: 1089057586. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:55:50,665][25689] Avg episode reward: [(0, '1.008')] [2022-07-11 05:55:51,559][26022] Updated weights on worker 0-0, policy_version 1063530 (0.00093) [2022-07-11 05:55:53,446][26022] Updated weights on worker 0-0, policy_version 1063540 (0.00086) [2022-07-11 05:55:55,109][26022] Updated weights on worker 0-0, policy_version 1063550 (0.00092) [2022-07-11 05:55:55,787][25689] Fps is (10 sec: 5644.2, 60 sec: 5654.7, 300 sec: 5631.6). Total num frames: 1089077248. Throughput: 0: 5096.4. Samples: 1089074316. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:55:55,787][25689] Avg episode reward: [(0, '1.907')] [2022-07-11 05:55:57,030][26022] Updated weights on worker 0-0, policy_version 1063560 (0.00082) [2022-07-11 05:55:58,897][26022] Updated weights on worker 0-0, policy_version 1063570 (0.00689) [2022-07-11 05:56:00,618][26022] Updated weights on worker 0-0, policy_version 1063580 (0.00051) [2022-07-11 05:56:00,799][25689] Fps is (10 sec: 5556.3, 60 sec: 5642.3, 300 sec: 5639.2). Total num frames: 1089105920. Throughput: 0: 5932.3. Samples: 1089108304. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:56:00,799][25689] Avg episode reward: [(0, '1.649')] [2022-07-11 05:56:02,791][26022] Updated weights on worker 0-0, policy_version 1063590 (0.00079) [2022-07-11 05:56:04,463][26022] Updated weights on worker 0-0, policy_version 1063600 (0.00081) [2022-07-11 05:56:05,821][25689] Fps is (10 sec: 5510.7, 60 sec: 5645.4, 300 sec: 5632.3). Total num frames: 1089132544. Throughput: 0: 5826.6. Samples: 1089140832. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:56:05,821][25689] Avg episode reward: [(0, '0.909')] [2022-07-11 05:56:06,309][26022] Updated weights on worker 0-0, policy_version 1063610 (0.00085) [2022-07-11 05:56:08,136][26022] Updated weights on worker 0-0, policy_version 1063620 (0.00049) [2022-07-11 05:56:10,051][26022] Updated weights on worker 0-0, policy_version 1063630 (0.00090) [2022-07-11 05:56:10,858][25689] Fps is (10 sec: 5497.3, 60 sec: 5662.6, 300 sec: 5633.5). Total num frames: 1089161216. Throughput: 0: 4946.7. Samples: 1089157484. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:56:10,858][25689] Avg episode reward: [(0, '-1.159')] [2022-07-11 05:56:11,966][26022] Updated weights on worker 0-0, policy_version 1063640 (0.00092) [2022-07-11 05:56:13,795][26022] Updated weights on worker 0-0, policy_version 1063650 (0.00087) [2022-07-11 05:56:15,194][26022] Updated weights on worker 0-0, policy_version 1063660 (0.00092) [2022-07-11 05:56:15,937][25689] Fps is (10 sec: 5668.5, 60 sec: 5662.7, 300 sec: 5632.4). Total num frames: 1089189888. Throughput: 0: 5828.9. Samples: 1089191772. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:56:15,938][25689] Avg episode reward: [(0, '-0.696')] [2022-07-11 05:56:17,221][26022] Updated weights on worker 0-0, policy_version 1063670 (0.00095) [2022-07-11 05:56:18,896][26022] Updated weights on worker 0-0, policy_version 1063680 (0.00091) [2022-07-11 05:56:20,880][26022] Updated weights on worker 0-0, policy_version 1063690 (0.00089) [2022-07-11 05:56:20,944][25689] Fps is (10 sec: 5685.2, 60 sec: 5662.5, 300 sec: 5633.2). Total num frames: 1089218560. Throughput: 0: 5846.8. Samples: 1089226090. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:56:20,945][25689] Avg episode reward: [(0, '-0.815')] [2022-07-11 05:56:22,648][26022] Updated weights on worker 0-0, policy_version 1063700 (0.00083) [2022-07-11 05:56:24,477][26022] Updated weights on worker 0-0, policy_version 1063710 (0.00080) [2022-07-11 05:56:25,950][25689] Fps is (10 sec: 5829.6, 60 sec: 5669.3, 300 sec: 5634.2). Total num frames: 1089248256. Throughput: 0: 5078.4. Samples: 1089243054. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 05:56:25,950][25689] Avg episode reward: [(0, '-0.634')] [2022-07-11 05:56:26,123][26022] Updated weights on worker 0-0, policy_version 1063720 (0.00084) [2022-07-11 05:56:28,058][26022] Updated weights on worker 0-0, policy_version 1063730 (0.00100) [2022-07-11 05:56:29,724][26022] Updated weights on worker 0-0, policy_version 1063740 (0.00290) [2022-07-11 05:56:30,973][25689] Fps is (10 sec: 5616.1, 60 sec: 5623.2, 300 sec: 5631.5). Total num frames: 1089274880. Throughput: 0: 5961.9. Samples: 1089277406. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:56:30,973][25689] Avg episode reward: [(0, '-0.833')] [2022-07-11 05:56:31,600][26022] Updated weights on worker 0-0, policy_version 1063750 (0.00081) [2022-07-11 05:56:33,547][26022] Updated weights on worker 0-0, policy_version 1063760 (0.00087) [2022-07-11 05:56:35,199][26022] Updated weights on worker 0-0, policy_version 1063770 (0.00090) [2022-07-11 05:56:36,030][25689] Fps is (10 sec: 5587.3, 60 sec: 5677.3, 300 sec: 5634.2). Total num frames: 1089304576. Throughput: 0: 5956.4. Samples: 1089311450. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:56:36,030][25689] Avg episode reward: [(0, '-1.435')] [2022-07-11 05:56:37,106][26022] Updated weights on worker 0-0, policy_version 1063780 (0.00083) [2022-07-11 05:56:38,760][26022] Updated weights on worker 0-0, policy_version 1063790 (0.00085) [2022-07-11 05:56:40,607][26022] Updated weights on worker 0-0, policy_version 1063800 (0.00078) [2022-07-11 05:56:41,053][25689] Fps is (10 sec: 5790.2, 60 sec: 5641.4, 300 sec: 5642.0). Total num frames: 1089333248. Throughput: 0: 5107.2. Samples: 1089328790. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:56:41,054][25689] Avg episode reward: [(0, '0.624')] [2022-07-11 05:56:42,342][26022] Updated weights on worker 0-0, policy_version 1063810 (0.00085) [2022-07-11 05:56:44,175][26022] Updated weights on worker 0-0, policy_version 1063820 (0.00083) [2022-07-11 05:56:45,934][26022] Updated weights on worker 0-0, policy_version 1063830 (0.00085) [2022-07-11 05:56:46,063][25689] Fps is (10 sec: 5715.0, 60 sec: 5659.1, 300 sec: 5632.2). Total num frames: 1089361920. Throughput: 0: 5970.6. Samples: 1089363146. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:56:46,064][25689] Avg episode reward: [(0, '0.652')] [2022-07-11 05:56:47,674][26022] Updated weights on worker 0-0, policy_version 1063840 (0.00086) [2022-07-11 05:56:49,649][26022] Updated weights on worker 0-0, policy_version 1063850 (0.00084) [2022-07-11 05:56:51,084][25689] Fps is (10 sec: 5716.5, 60 sec: 5643.5, 300 sec: 5639.5). Total num frames: 1089390592. Throughput: 0: 5968.5. Samples: 1089397444. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:56:51,085][25689] Avg episode reward: [(0, '0.528')] [2022-07-11 05:56:51,390][26022] Updated weights on worker 0-0, policy_version 1063860 (0.00088) [2022-07-11 05:56:53,334][26022] Updated weights on worker 0-0, policy_version 1063870 (0.00083) [2022-07-11 05:56:54,888][26022] Updated weights on worker 0-0, policy_version 1063880 (0.00087) [2022-07-11 05:56:56,135][25689] Fps is (10 sec: 5592.2, 60 sec: 5650.4, 300 sec: 5632.1). Total num frames: 1089418240. Throughput: 0: 5128.0. Samples: 1089414550. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:56:56,137][25689] Avg episode reward: [(0, '0.418')] [2022-07-11 05:56:56,928][26022] Updated weights on worker 0-0, policy_version 1063890 (0.00091) [2022-07-11 05:56:58,455][26022] Updated weights on worker 0-0, policy_version 1063900 (0.00082) [2022-07-11 05:57:00,591][26022] Updated weights on worker 0-0, policy_version 1063910 (0.00086) [2022-07-11 05:57:01,149][25689] Fps is (10 sec: 5697.5, 60 sec: 5667.2, 300 sec: 5642.3). Total num frames: 1089447936. Throughput: 0: 5957.5. Samples: 1089448512. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:57:01,150][25689] Avg episode reward: [(0, '0.914')] [2022-07-11 05:57:02,651][26022] Updated weights on worker 0-0, policy_version 1063920 (0.00088) [2022-07-11 05:57:04,369][26022] Updated weights on worker 0-0, policy_version 1063930 (0.00092) [2022-07-11 05:57:06,229][25689] Fps is (10 sec: 5579.4, 60 sec: 5661.8, 300 sec: 5641.2). Total num frames: 1089474560. Throughput: 0: 5810.2. Samples: 1089480312. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:57:06,236][26022] Updated weights on worker 0-0, policy_version 1063940 (0.00088) [2022-07-11 05:57:06,230][25689] Avg episode reward: [(0, '0.690')] [2022-07-11 05:57:08,199][26022] Updated weights on worker 0-0, policy_version 1063950 (0.00089) [2022-07-11 05:57:09,727][26022] Updated weights on worker 0-0, policy_version 1063960 (0.00094) [2022-07-11 05:57:11,263][25689] Fps is (10 sec: 5265.1, 60 sec: 5628.1, 300 sec: 5629.2). Total num frames: 1089501184. Throughput: 0: 4952.7. Samples: 1089497382. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:57:11,263][25689] Avg episode reward: [(0, '-0.230')] [2022-07-11 05:57:11,717][26022] Updated weights on worker 0-0, policy_version 1063970 (0.00082) [2022-07-11 05:57:13,304][26022] Updated weights on worker 0-0, policy_version 1063980 (0.00085) [2022-07-11 05:57:15,152][26022] Updated weights on worker 0-0, policy_version 1063990 (0.00093) [2022-07-11 05:57:16,359][25689] Fps is (10 sec: 5660.9, 60 sec: 5660.5, 300 sec: 5634.9). Total num frames: 1089531904. Throughput: 0: 5786.7. Samples: 1089531582. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:57:16,359][25689] Avg episode reward: [(0, '-0.392')] [2022-07-11 05:57:17,022][26022] Updated weights on worker 0-0, policy_version 1064000 (0.00088) [2022-07-11 05:57:18,767][26022] Updated weights on worker 0-0, policy_version 1064010 (0.00086) [2022-07-11 05:57:20,540][26022] Updated weights on worker 0-0, policy_version 1064020 (0.00107) [2022-07-11 05:57:21,446][25689] Fps is (10 sec: 5832.4, 60 sec: 5653.0, 300 sec: 5631.2). Total num frames: 1089560576. Throughput: 0: 5777.7. Samples: 1089565780. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:57:21,446][25689] Avg episode reward: [(0, '0.126')] [2022-07-11 05:57:22,322][26022] Updated weights on worker 0-0, policy_version 1064030 (0.00086) [2022-07-11 05:57:24,303][26022] Updated weights on worker 0-0, policy_version 1064040 (0.00084) [2022-07-11 05:57:26,176][26022] Updated weights on worker 0-0, policy_version 1064050 (0.00096) [2022-07-11 05:57:26,453][25689] Fps is (10 sec: 5681.0, 60 sec: 5635.9, 300 sec: 5631.4). Total num frames: 1089589248. Throughput: 0: 5069.9. Samples: 1089582844. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:57:26,453][25689] Avg episode reward: [(0, '0.792')] [2022-07-11 05:57:27,736][26022] Updated weights on worker 0-0, policy_version 1064060 (0.00050) [2022-07-11 05:57:29,579][26022] Updated weights on worker 0-0, policy_version 1064070 (0.00083) [2022-07-11 05:57:31,500][25689] Fps is (10 sec: 5703.3, 60 sec: 5667.5, 300 sec: 5642.1). Total num frames: 1089617920. Throughput: 0: 5915.5. Samples: 1089617098. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:57:31,501][25689] Avg episode reward: [(0, '1.100')] [2022-07-11 05:57:31,506][26022] Updated weights on worker 0-0, policy_version 1064080 (0.00087) [2022-07-11 05:57:33,370][26022] Updated weights on worker 0-0, policy_version 1064090 (0.00090) [2022-07-11 05:57:34,968][26022] Updated weights on worker 0-0, policy_version 1064100 (0.00099) [2022-07-11 05:57:36,567][25689] Fps is (10 sec: 5568.5, 60 sec: 5632.8, 300 sec: 5627.8). Total num frames: 1089645568. Throughput: 0: 5892.3. Samples: 1089650654. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:57:36,567][25689] Avg episode reward: [(0, '1.360')] [2022-07-11 05:57:36,986][26022] Updated weights on worker 0-0, policy_version 1064110 (0.00086) [2022-07-11 05:57:38,470][26022] Updated weights on worker 0-0, policy_version 1064120 (0.00095) [2022-07-11 05:57:39,557][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:57:39,565][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001064124_1089662976.pth [2022-07-11 05:57:39,566][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001062141_1087632384.pth [2022-07-11 05:57:40,770][26022] Updated weights on worker 0-0, policy_version 1064130 (0.00085) [2022-07-11 05:57:41,601][25689] Fps is (10 sec: 5677.4, 60 sec: 5648.7, 300 sec: 5635.7). Total num frames: 1089675264. Throughput: 0: 5054.0. Samples: 1089667642. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:57:41,601][25689] Avg episode reward: [(0, '0.530')] [2022-07-11 05:57:42,239][26022] Updated weights on worker 0-0, policy_version 1064140 (0.00078) [2022-07-11 05:57:44,335][26022] Updated weights on worker 0-0, policy_version 1064150 (0.00096) [2022-07-11 05:57:45,712][26022] Updated weights on worker 0-0, policy_version 1064160 (0.00099) [2022-07-11 05:57:46,687][25689] Fps is (10 sec: 5565.2, 60 sec: 5607.9, 300 sec: 5624.5). Total num frames: 1089701888. Throughput: 0: 5871.7. Samples: 1089701652. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:57:46,687][25689] Avg episode reward: [(0, '0.655')] [2022-07-11 05:57:47,910][26022] Updated weights on worker 0-0, policy_version 1064170 (0.00095) [2022-07-11 05:57:49,725][26022] Updated weights on worker 0-0, policy_version 1064180 (0.00084) [2022-07-11 05:57:51,447][26022] Updated weights on worker 0-0, policy_version 1064190 (0.00096) [2022-07-11 05:57:51,691][25689] Fps is (10 sec: 5581.5, 60 sec: 5626.3, 300 sec: 5630.1). Total num frames: 1089731584. Throughput: 0: 5856.7. Samples: 1089735350. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:57:51,692][25689] Avg episode reward: [(0, '0.116')] [2022-07-11 05:57:53,340][26022] Updated weights on worker 0-0, policy_version 1064200 (0.00054) [2022-07-11 05:57:55,286][26022] Updated weights on worker 0-0, policy_version 1064210 (0.00102) [2022-07-11 05:57:56,843][25689] Fps is (10 sec: 5746.8, 60 sec: 5633.7, 300 sec: 5627.3). Total num frames: 1089760256. Throughput: 0: 5813.8. Samples: 1089768538. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:57:56,844][25689] Avg episode reward: [(0, '-0.564')] [2022-07-11 05:57:57,020][26022] Updated weights on worker 0-0, policy_version 1064220 (0.00097) [2022-07-11 05:57:58,940][26022] Updated weights on worker 0-0, policy_version 1064230 (0.00084) [2022-07-11 05:58:00,754][26022] Updated weights on worker 0-0, policy_version 1064240 (0.00091) [2022-07-11 05:58:01,917][25689] Fps is (10 sec: 5207.4, 60 sec: 5544.0, 300 sec: 5616.4). Total num frames: 1089784832. Throughput: 0: 5786.6. Samples: 1089785202. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:58:01,917][25689] Avg episode reward: [(0, '0.235')] [2022-07-11 05:58:03,057][26022] Updated weights on worker 0-0, policy_version 1064250 (0.00095) [2022-07-11 05:58:04,928][26022] Updated weights on worker 0-0, policy_version 1064260 (0.00091) [2022-07-11 05:58:06,780][26022] Updated weights on worker 0-0, policy_version 1064270 (0.00090) [2022-07-11 05:58:06,940][25689] Fps is (10 sec: 5172.4, 60 sec: 5566.0, 300 sec: 5620.2). Total num frames: 1089812480. Throughput: 0: 5675.5. Samples: 1089816600. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:58:06,941][25689] Avg episode reward: [(0, '0.433')] [2022-07-11 05:58:08,347][26022] Updated weights on worker 0-0, policy_version 1064280 (0.00091) [2022-07-11 05:58:10,561][26022] Updated weights on worker 0-0, policy_version 1064290 (0.00083) [2022-07-11 05:58:11,970][25689] Fps is (10 sec: 5806.0, 60 sec: 5633.9, 300 sec: 5623.9). Total num frames: 1089843200. Throughput: 0: 5650.2. Samples: 1089849928. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:58:11,970][25689] Avg episode reward: [(0, '1.009')] [2022-07-11 05:58:11,971][26022] Updated weights on worker 0-0, policy_version 1064300 (0.00129) [2022-07-11 05:58:14,133][26022] Updated weights on worker 0-0, policy_version 1064310 (0.00089) [2022-07-11 05:58:15,717][26022] Updated weights on worker 0-0, policy_version 1064320 (0.00086) [2022-07-11 05:58:17,011][25689] Fps is (10 sec: 5592.5, 60 sec: 5554.6, 300 sec: 5613.2). Total num frames: 1089868800. Throughput: 0: 4862.4. Samples: 1089866600. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:58:17,011][25689] Avg episode reward: [(0, '0.058')] [2022-07-11 05:58:17,975][26022] Updated weights on worker 0-0, policy_version 1064330 (0.00092) [2022-07-11 05:58:19,520][26022] Updated weights on worker 0-0, policy_version 1064340 (0.00089) [2022-07-11 05:58:21,539][26022] Updated weights on worker 0-0, policy_version 1064350 (0.00078) [2022-07-11 05:58:22,016][25689] Fps is (10 sec: 5300.3, 60 sec: 5545.2, 300 sec: 5609.9). Total num frames: 1089896448. Throughput: 0: 5722.2. Samples: 1089900214. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:58:22,017][25689] Avg episode reward: [(0, '-0.970')] [2022-07-11 05:58:23,083][26022] Updated weights on worker 0-0, policy_version 1064360 (0.00094) [2022-07-11 05:58:25,195][26022] Updated weights on worker 0-0, policy_version 1064370 (0.00088) [2022-07-11 05:58:26,761][26022] Updated weights on worker 0-0, policy_version 1064380 (0.00087) [2022-07-11 05:58:27,023][25689] Fps is (10 sec: 5727.3, 60 sec: 5562.0, 300 sec: 5613.5). Total num frames: 1089926144. Throughput: 0: 5825.3. Samples: 1089933590. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:58:27,024][25689] Avg episode reward: [(0, '-0.513')] [2022-07-11 05:58:28,793][26022] Updated weights on worker 0-0, policy_version 1064390 (0.00083) [2022-07-11 05:58:30,469][26022] Updated weights on worker 0-0, policy_version 1064400 (0.00093) [2022-07-11 05:58:32,056][25689] Fps is (10 sec: 5609.6, 60 sec: 5529.6, 300 sec: 5611.9). Total num frames: 1089952768. Throughput: 0: 4999.6. Samples: 1089950350. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:58:32,056][25689] Avg episode reward: [(0, '-1.155')] [2022-07-11 05:58:32,532][26022] Updated weights on worker 0-0, policy_version 1064410 (0.00094) [2022-07-11 05:58:34,252][26022] Updated weights on worker 0-0, policy_version 1064420 (0.00089) [2022-07-11 05:58:36,106][26022] Updated weights on worker 0-0, policy_version 1064430 (0.00079) [2022-07-11 05:58:37,134][25689] Fps is (10 sec: 5570.5, 60 sec: 5562.4, 300 sec: 5614.7). Total num frames: 1089982464. Throughput: 0: 5821.4. Samples: 1089983744. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:58:37,134][25689] Avg episode reward: [(0, '-0.960')] [2022-07-11 05:58:37,883][26022] Updated weights on worker 0-0, policy_version 1064440 (0.00087) [2022-07-11 05:58:39,939][26022] Updated weights on worker 0-0, policy_version 1064450 (0.00095) [2022-07-11 05:58:41,583][26022] Updated weights on worker 0-0, policy_version 1064460 (0.00095) [2022-07-11 05:58:42,159][25689] Fps is (10 sec: 5777.3, 60 sec: 5546.3, 300 sec: 5617.9). Total num frames: 1090011136. Throughput: 0: 5819.6. Samples: 1090017438. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:58:42,159][25689] Avg episode reward: [(0, '-0.073')] [2022-07-11 05:58:43,545][26022] Updated weights on worker 0-0, policy_version 1064470 (0.00092) [2022-07-11 05:58:45,131][26022] Updated weights on worker 0-0, policy_version 1064480 (0.00092) [2022-07-11 05:58:47,082][26022] Updated weights on worker 0-0, policy_version 1064490 (0.00082) [2022-07-11 05:58:47,186][25689] Fps is (10 sec: 5602.7, 60 sec: 5568.6, 300 sec: 5614.7). Total num frames: 1090038784. Throughput: 0: 5003.9. Samples: 1090034482. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:58:47,187][25689] Avg episode reward: [(0, '1.115')] [2022-07-11 05:58:48,967][26022] Updated weights on worker 0-0, policy_version 1064500 (0.00084) [2022-07-11 05:58:50,596][26022] Updated weights on worker 0-0, policy_version 1064510 (0.00099) [2022-07-11 05:58:52,196][25689] Fps is (10 sec: 5509.2, 60 sec: 5534.2, 300 sec: 5612.0). Total num frames: 1090066432. Throughput: 0: 5852.4. Samples: 1090068218. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:58:52,196][25689] Avg episode reward: [(0, '1.135')] [2022-07-11 05:58:52,560][26022] Updated weights on worker 0-0, policy_version 1064520 (0.00088) [2022-07-11 05:58:54,383][26022] Updated weights on worker 0-0, policy_version 1064530 (0.00061) [2022-07-11 05:58:56,189][26022] Updated weights on worker 0-0, policy_version 1064540 (0.00086) [2022-07-11 05:58:57,323][25689] Fps is (10 sec: 5455.0, 60 sec: 5519.6, 300 sec: 5610.5). Total num frames: 1090094080. Throughput: 0: 5848.4. Samples: 1090101818. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:58:57,323][25689] Avg episode reward: [(0, '0.998')] [2022-07-11 05:58:58,163][26022] Updated weights on worker 0-0, policy_version 1064550 (0.00092) [2022-07-11 05:58:59,653][26022] Updated weights on worker 0-0, policy_version 1064560 (0.00085) [2022-07-11 05:59:01,694][26022] Updated weights on worker 0-0, policy_version 1064570 (0.00086) [2022-07-11 05:59:02,356][25689] Fps is (10 sec: 5543.1, 60 sec: 5591.0, 300 sec: 5620.8). Total num frames: 1090122752. Throughput: 0: 5013.3. Samples: 1090118694. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:59:02,357][25689] Avg episode reward: [(0, '1.153')] [2022-07-11 05:59:03,847][26022] Updated weights on worker 0-0, policy_version 1064580 (0.00087) [2022-07-11 05:59:05,587][26022] Updated weights on worker 0-0, policy_version 1064590 (0.00182) [2022-07-11 05:59:07,372][25689] Fps is (10 sec: 5400.4, 60 sec: 5557.8, 300 sec: 5603.5). Total num frames: 1090148352. Throughput: 0: 5732.1. Samples: 1090150192. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:59:07,373][25689] Avg episode reward: [(0, '1.088')] [2022-07-11 05:59:07,472][26022] Updated weights on worker 0-0, policy_version 1064600 (0.00090) [2022-07-11 05:59:09,363][26022] Updated weights on worker 0-0, policy_version 1064610 (0.00083) [2022-07-11 05:59:11,127][26022] Updated weights on worker 0-0, policy_version 1064620 (0.00095) [2022-07-11 05:59:12,478][25689] Fps is (10 sec: 5362.1, 60 sec: 5517.0, 300 sec: 5610.0). Total num frames: 1090177024. Throughput: 0: 5713.0. Samples: 1090184088. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:59:12,479][25689] Avg episode reward: [(0, '0.986')] [2022-07-11 05:59:13,065][26022] Updated weights on worker 0-0, policy_version 1064630 (0.00084) [2022-07-11 05:59:14,722][26022] Updated weights on worker 0-0, policy_version 1064640 (0.00085) [2022-07-11 05:59:16,713][26022] Updated weights on worker 0-0, policy_version 1064650 (0.00087) [2022-07-11 05:59:17,581][25689] Fps is (10 sec: 5717.8, 60 sec: 5579.0, 300 sec: 5615.4). Total num frames: 1090206720. Throughput: 0: 4895.3. Samples: 1090200986. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:59:17,581][25689] Avg episode reward: [(0, '-0.040')] [2022-07-11 05:59:18,312][26022] Updated weights on worker 0-0, policy_version 1064660 (0.00089) [2022-07-11 05:59:20,327][26022] Updated weights on worker 0-0, policy_version 1064670 (0.00089) [2022-07-11 05:59:22,003][26022] Updated weights on worker 0-0, policy_version 1064680 (0.00095) [2022-07-11 05:59:22,587][25689] Fps is (10 sec: 5672.4, 60 sec: 5578.9, 300 sec: 5605.0). Total num frames: 1090234368. Throughput: 0: 5722.1. Samples: 1090234456. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:59:22,587][25689] Avg episode reward: [(0, '0.554')] [2022-07-11 05:59:24,144][26022] Updated weights on worker 0-0, policy_version 1064690 (0.00079) [2022-07-11 05:59:25,777][26022] Updated weights on worker 0-0, policy_version 1064700 (0.00091) [2022-07-11 05:59:27,626][25689] Fps is (10 sec: 5504.7, 60 sec: 5542.2, 300 sec: 5608.2). Total num frames: 1090262016. Throughput: 0: 5808.5. Samples: 1090267834. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:59:27,626][25689] Avg episode reward: [(0, '-0.259')] [2022-07-11 05:59:27,819][26022] Updated weights on worker 0-0, policy_version 1064710 (0.00083) [2022-07-11 05:59:29,308][26022] Updated weights on worker 0-0, policy_version 1064720 (0.00093) [2022-07-11 05:59:31,460][26022] Updated weights on worker 0-0, policy_version 1064730 (0.00094) [2022-07-11 05:59:32,664][25689] Fps is (10 sec: 5588.7, 60 sec: 5575.5, 300 sec: 5608.8). Total num frames: 1090290688. Throughput: 0: 4968.5. Samples: 1090284386. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:59:32,666][25689] Avg episode reward: [(0, '-1.768')] [2022-07-11 05:59:33,327][26022] Updated weights on worker 0-0, policy_version 1064740 (0.00080) [2022-07-11 05:59:35,074][26022] Updated weights on worker 0-0, policy_version 1064750 (0.00086) [2022-07-11 05:59:37,072][26022] Updated weights on worker 0-0, policy_version 1064760 (0.00085) [2022-07-11 05:59:37,764][25689] Fps is (10 sec: 5454.5, 60 sec: 5522.8, 300 sec: 5604.7). Total num frames: 1090317312. Throughput: 0: 5776.2. Samples: 1090317566. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:59:37,764][25689] Avg episode reward: [(0, '-2.277')] [2022-07-11 05:59:38,629][26022] Updated weights on worker 0-0, policy_version 1064770 (0.00080) [2022-07-11 05:59:39,615][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 05:59:39,629][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001064776_1090330624.pth [2022-07-11 05:59:39,629][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001062804_1088311296.pth [2022-07-11 05:59:40,759][26022] Updated weights on worker 0-0, policy_version 1064780 (0.00098) [2022-07-11 05:59:42,235][26022] Updated weights on worker 0-0, policy_version 1064790 (0.00080) [2022-07-11 05:59:42,819][25689] Fps is (10 sec: 5546.5, 60 sec: 5537.0, 300 sec: 5608.9). Total num frames: 1090347008. Throughput: 0: 5780.0. Samples: 1090351394. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:59:42,819][25689] Avg episode reward: [(0, '-2.175')] [2022-07-11 05:59:44,347][26022] Updated weights on worker 0-0, policy_version 1064800 (0.00087) [2022-07-11 05:59:45,987][26022] Updated weights on worker 0-0, policy_version 1064810 (0.00091) [2022-07-11 05:59:47,775][26022] Updated weights on worker 0-0, policy_version 1064820 (0.00096) [2022-07-11 05:59:47,851][25689] Fps is (10 sec: 5786.4, 60 sec: 5553.4, 300 sec: 5605.4). Total num frames: 1090375680. Throughput: 0: 4967.9. Samples: 1090368308. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:59:47,851][25689] Avg episode reward: [(0, '-1.954')] [2022-07-11 05:59:49,873][26022] Updated weights on worker 0-0, policy_version 1064830 (0.00086) [2022-07-11 05:59:51,424][26022] Updated weights on worker 0-0, policy_version 1064840 (0.00087) [2022-07-11 05:59:52,864][25689] Fps is (10 sec: 5402.6, 60 sec: 5519.3, 300 sec: 5599.0). Total num frames: 1090401280. Throughput: 0: 5820.2. Samples: 1090401952. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:59:52,865][25689] Avg episode reward: [(0, '-2.093')] [2022-07-11 05:59:53,411][26022] Updated weights on worker 0-0, policy_version 1064850 (0.00100) [2022-07-11 05:59:55,296][26022] Updated weights on worker 0-0, policy_version 1064860 (0.00107) [2022-07-11 05:59:57,074][26022] Updated weights on worker 0-0, policy_version 1064870 (0.00078) [2022-07-11 05:59:57,975][25689] Fps is (10 sec: 5563.0, 60 sec: 5571.5, 300 sec: 5601.5). Total num frames: 1090432000. Throughput: 0: 5823.5. Samples: 1090435266. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 05:59:57,975][25689] Avg episode reward: [(0, '-0.253')] [2022-07-11 05:59:59,133][26022] Updated weights on worker 0-0, policy_version 1064880 (0.00082) [2022-07-11 06:00:00,532][26022] Updated weights on worker 0-0, policy_version 1064890 (0.00081) [2022-07-11 06:00:02,986][25689] Fps is (10 sec: 5564.2, 60 sec: 5522.9, 300 sec: 5598.9). Total num frames: 1090457600. Throughput: 0: 5731.2. Samples: 1090466978. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 06:00:02,986][25689] Avg episode reward: [(0, '-0.477')] [2022-07-11 06:00:02,988][26022] Updated weights on worker 0-0, policy_version 1064900 (0.00089) [2022-07-11 06:00:04,944][26022] Updated weights on worker 0-0, policy_version 1064910 (0.00085) [2022-07-11 06:00:06,504][26022] Updated weights on worker 0-0, policy_version 1064920 (0.00089) [2022-07-11 06:00:08,057][25689] Fps is (10 sec: 5281.2, 60 sec: 5551.6, 300 sec: 5598.3). Total num frames: 1090485248. Throughput: 0: 5706.4. Samples: 1090483616. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 06:00:08,058][25689] Avg episode reward: [(0, '-0.400')] [2022-07-11 06:00:08,605][26022] Updated weights on worker 0-0, policy_version 1064930 (0.00090) [2022-07-11 06:00:10,080][26022] Updated weights on worker 0-0, policy_version 1064940 (0.00088) [2022-07-11 06:00:12,207][26022] Updated weights on worker 0-0, policy_version 1064950 (0.00085) [2022-07-11 06:00:13,075][25689] Fps is (10 sec: 5684.0, 60 sec: 5576.5, 300 sec: 5603.0). Total num frames: 1090514944. Throughput: 0: 5716.4. Samples: 1090517484. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 06:00:13,075][25689] Avg episode reward: [(0, '-0.500')] [2022-07-11 06:00:13,853][26022] Updated weights on worker 0-0, policy_version 1064960 (0.00086) [2022-07-11 06:00:15,650][26022] Updated weights on worker 0-0, policy_version 1064970 (0.00087) [2022-07-11 06:00:17,709][26022] Updated weights on worker 0-0, policy_version 1064980 (0.00085) [2022-07-11 06:00:18,135][25689] Fps is (10 sec: 5690.2, 60 sec: 5546.6, 300 sec: 5598.5). Total num frames: 1090542592. Throughput: 0: 5764.3. Samples: 1090551476. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 06:00:18,136][25689] Avg episode reward: [(0, '0.065')] [2022-07-11 06:00:19,278][26022] Updated weights on worker 0-0, policy_version 1064990 (0.00087) [2022-07-11 06:00:21,248][26022] Updated weights on worker 0-0, policy_version 1065000 (0.00086) [2022-07-11 06:00:22,920][26022] Updated weights on worker 0-0, policy_version 1065010 (0.00085) [2022-07-11 06:00:23,152][25689] Fps is (10 sec: 5588.6, 60 sec: 5562.5, 300 sec: 5596.2). Total num frames: 1090571264. Throughput: 0: 5027.5. Samples: 1090568366. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 06:00:23,153][25689] Avg episode reward: [(0, '-0.130')] [2022-07-11 06:00:24,875][26022] Updated weights on worker 0-0, policy_version 1065020 (0.00079) [2022-07-11 06:00:26,696][26022] Updated weights on worker 0-0, policy_version 1065030 (0.00086) [2022-07-11 06:00:28,201][25689] Fps is (10 sec: 5595.4, 60 sec: 5561.7, 300 sec: 5589.8). Total num frames: 1090598912. Throughput: 0: 5874.1. Samples: 1090601940. Policy #0 lag: (min: 0.0, avg: 9.3, max: 18.0) [2022-07-11 06:00:28,201][25689] Avg episode reward: [(0, '-0.348')] [2022-07-11 06:00:28,511][26022] Updated weights on worker 0-0, policy_version 1065040 (0.00094) [2022-07-11 06:00:30,194][26022] Updated weights on worker 0-0, policy_version 1065050 (0.00086) [2022-07-11 06:00:32,142][26022] Updated weights on worker 0-0, policy_version 1065060 (0.00083) [2022-07-11 06:00:33,271][25689] Fps is (10 sec: 5565.9, 60 sec: 5558.7, 300 sec: 5597.1). Total num frames: 1090627584. Throughput: 0: 5863.9. Samples: 1090635914. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:00:33,272][25689] Avg episode reward: [(0, '-0.542')] [2022-07-11 06:00:34,047][26022] Updated weights on worker 0-0, policy_version 1065070 (0.00088) [2022-07-11 06:00:35,815][26022] Updated weights on worker 0-0, policy_version 1065080 (0.00084) [2022-07-11 06:00:37,486][26022] Updated weights on worker 0-0, policy_version 1065090 (0.00085) [2022-07-11 06:00:38,330][25689] Fps is (10 sec: 5660.9, 60 sec: 5596.2, 300 sec: 5589.1). Total num frames: 1090656256. Throughput: 0: 5010.8. Samples: 1090652670. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:00:38,331][25689] Avg episode reward: [(0, '0.169')] [2022-07-11 06:00:39,346][26022] Updated weights on worker 0-0, policy_version 1065100 (0.00087) [2022-07-11 06:00:41,118][26022] Updated weights on worker 0-0, policy_version 1065110 (0.00081) [2022-07-11 06:00:43,171][26022] Updated weights on worker 0-0, policy_version 1065120 (0.00078) [2022-07-11 06:00:43,343][25689] Fps is (10 sec: 5693.6, 60 sec: 5583.2, 300 sec: 5592.7). Total num frames: 1090684928. Throughput: 0: 5849.4. Samples: 1090686468. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:00:43,345][25689] Avg episode reward: [(0, '0.286')] [2022-07-11 06:00:44,880][26022] Updated weights on worker 0-0, policy_version 1065130 (0.00089) [2022-07-11 06:00:46,630][26022] Updated weights on worker 0-0, policy_version 1065140 (0.00086) [2022-07-11 06:00:48,284][26022] Updated weights on worker 0-0, policy_version 1065150 (0.00100) [2022-07-11 06:00:48,378][25689] Fps is (10 sec: 5707.3, 60 sec: 5582.9, 300 sec: 5589.2). Total num frames: 1090713600. Throughput: 0: 5877.0. Samples: 1090720522. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:00:48,378][25689] Avg episode reward: [(0, '1.173')] [2022-07-11 06:00:50,494][26022] Updated weights on worker 0-0, policy_version 1065160 (0.00091) [2022-07-11 06:00:51,827][26022] Updated weights on worker 0-0, policy_version 1065170 (0.00085) [2022-07-11 06:00:53,384][25689] Fps is (10 sec: 5405.1, 60 sec: 5583.6, 300 sec: 5584.6). Total num frames: 1090739200. Throughput: 0: 5049.8. Samples: 1090737480. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:00:53,384][25689] Avg episode reward: [(0, '0.956')] [2022-07-11 06:00:53,921][26022] Updated weights on worker 0-0, policy_version 1065180 (0.00088) [2022-07-11 06:00:55,595][26022] Updated weights on worker 0-0, policy_version 1065190 (0.00084) [2022-07-11 06:00:57,531][26022] Updated weights on worker 0-0, policy_version 1065200 (0.00091) [2022-07-11 06:00:58,485][25689] Fps is (10 sec: 5572.4, 60 sec: 5584.5, 300 sec: 5589.8). Total num frames: 1090769920. Throughput: 0: 5893.0. Samples: 1090771440. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:00:58,485][25689] Avg episode reward: [(0, '-0.118')] [2022-07-11 06:00:59,504][26022] Updated weights on worker 0-0, policy_version 1065210 (0.00083) [2022-07-11 06:01:01,087][26022] Updated weights on worker 0-0, policy_version 1065220 (0.00085) [2022-07-11 06:01:03,361][26022] Updated weights on worker 0-0, policy_version 1065230 (0.00090) [2022-07-11 06:01:03,545][25689] Fps is (10 sec: 5542.5, 60 sec: 5580.0, 300 sec: 5585.6). Total num frames: 1090795520. Throughput: 0: 5756.5. Samples: 1090802764. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:01:03,546][25689] Avg episode reward: [(0, '-0.716')] [2022-07-11 06:01:05,240][26022] Updated weights on worker 0-0, policy_version 1065240 (0.00080) [2022-07-11 06:01:06,843][26022] Updated weights on worker 0-0, policy_version 1065250 (0.00088) [2022-07-11 06:01:08,601][25689] Fps is (10 sec: 5263.7, 60 sec: 5581.4, 300 sec: 5581.8). Total num frames: 1090823168. Throughput: 0: 4916.1. Samples: 1090819940. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:01:08,603][25689] Avg episode reward: [(0, '-0.692')] [2022-07-11 06:01:08,979][26022] Updated weights on worker 0-0, policy_version 1065260 (0.00089) [2022-07-11 06:01:10,647][26022] Updated weights on worker 0-0, policy_version 1065270 (0.00092) [2022-07-11 06:01:12,380][26022] Updated weights on worker 0-0, policy_version 1065280 (0.00083) [2022-07-11 06:01:13,659][25689] Fps is (10 sec: 5670.2, 60 sec: 5577.7, 300 sec: 5585.7). Total num frames: 1090852864. Throughput: 0: 5748.3. Samples: 1090854024. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:01:13,659][25689] Avg episode reward: [(0, '-0.802')] [2022-07-11 06:01:14,163][26022] Updated weights on worker 0-0, policy_version 1065290 (0.00084) [2022-07-11 06:01:15,993][26022] Updated weights on worker 0-0, policy_version 1065300 (0.00616) [2022-07-11 06:01:17,747][26022] Updated weights on worker 0-0, policy_version 1065310 (0.00089) [2022-07-11 06:01:18,757][25689] Fps is (10 sec: 5848.1, 60 sec: 5608.0, 300 sec: 5587.4). Total num frames: 1090882560. Throughput: 0: 5776.5. Samples: 1090888540. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:01:18,757][25689] Avg episode reward: [(0, '-0.687')] [2022-07-11 06:01:19,847][26022] Updated weights on worker 0-0, policy_version 1065320 (0.00081) [2022-07-11 06:01:21,258][26022] Updated weights on worker 0-0, policy_version 1065330 (0.00079) [2022-07-11 06:01:23,220][26022] Updated weights on worker 0-0, policy_version 1065340 (0.00098) [2022-07-11 06:01:23,770][25689] Fps is (10 sec: 5772.7, 60 sec: 5608.4, 300 sec: 5583.8). Total num frames: 1090911232. Throughput: 0: 5100.2. Samples: 1090905912. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:01:23,771][25689] Avg episode reward: [(0, '-0.208')] [2022-07-11 06:01:24,860][26022] Updated weights on worker 0-0, policy_version 1065350 (0.00085) [2022-07-11 06:01:26,848][26022] Updated weights on worker 0-0, policy_version 1065360 (0.00082) [2022-07-11 06:01:28,446][26022] Updated weights on worker 0-0, policy_version 1065370 (0.00091) [2022-07-11 06:01:28,786][25689] Fps is (10 sec: 5717.5, 60 sec: 5628.3, 300 sec: 5590.8). Total num frames: 1090939904. Throughput: 0: 5958.3. Samples: 1090940210. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:01:28,788][25689] Avg episode reward: [(0, '-0.309')] [2022-07-11 06:01:30,240][26022] Updated weights on worker 0-0, policy_version 1065380 (0.00090) [2022-07-11 06:01:32,080][26022] Updated weights on worker 0-0, policy_version 1065390 (0.00079) [2022-07-11 06:01:33,799][25689] Fps is (10 sec: 5615.2, 60 sec: 5616.7, 300 sec: 5584.8). Total num frames: 1090967552. Throughput: 0: 5977.6. Samples: 1090974418. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:01:33,800][25689] Avg episode reward: [(0, '0.799')] [2022-07-11 06:01:33,895][26022] Updated weights on worker 0-0, policy_version 1065400 (0.00084) [2022-07-11 06:01:35,684][26022] Updated weights on worker 0-0, policy_version 1065410 (0.00083) [2022-07-11 06:01:37,531][26022] Updated weights on worker 0-0, policy_version 1065420 (0.00093) [2022-07-11 06:01:38,886][25689] Fps is (10 sec: 5779.3, 60 sec: 5648.0, 300 sec: 5590.5). Total num frames: 1090998272. Throughput: 0: 5119.9. Samples: 1090991598. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:01:38,886][25689] Avg episode reward: [(0, '1.044')] [2022-07-11 06:01:39,359][26022] Updated weights on worker 0-0, policy_version 1065430 (0.00088) [2022-07-11 06:01:39,733][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:01:39,749][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001065432_1091002368.pth [2022-07-11 06:01:39,750][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001063462_1088985088.pth [2022-07-11 06:01:41,147][26022] Updated weights on worker 0-0, policy_version 1065440 (0.00086) [2022-07-11 06:01:43,038][26022] Updated weights on worker 0-0, policy_version 1065450 (0.00086) [2022-07-11 06:01:43,921][25689] Fps is (10 sec: 5766.7, 60 sec: 5629.0, 300 sec: 5586.6). Total num frames: 1091025920. Throughput: 0: 5918.5. Samples: 1091025178. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:01:43,922][25689] Avg episode reward: [(0, '0.760')] [2022-07-11 06:01:44,826][26022] Updated weights on worker 0-0, policy_version 1065460 (0.00801) [2022-07-11 06:01:46,596][26022] Updated weights on worker 0-0, policy_version 1065470 (0.00090) [2022-07-11 06:01:48,364][26022] Updated weights on worker 0-0, policy_version 1065480 (0.00085) [2022-07-11 06:01:48,998][25689] Fps is (10 sec: 5569.3, 60 sec: 5625.0, 300 sec: 5585.5). Total num frames: 1091054592. Throughput: 0: 5898.2. Samples: 1091059426. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:01:48,999][25689] Avg episode reward: [(0, '0.485')] [2022-07-11 06:01:50,278][26022] Updated weights on worker 0-0, policy_version 1065490 (0.00086) [2022-07-11 06:01:51,966][26022] Updated weights on worker 0-0, policy_version 1065500 (0.00091) [2022-07-11 06:01:53,753][26022] Updated weights on worker 0-0, policy_version 1065510 (0.00083) [2022-07-11 06:01:54,003][25689] Fps is (10 sec: 5687.9, 60 sec: 5675.9, 300 sec: 5589.8). Total num frames: 1091083264. Throughput: 0: 5054.7. Samples: 1091076540. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:01:54,003][25689] Avg episode reward: [(0, '0.410')] [2022-07-11 06:01:55,635][26022] Updated weights on worker 0-0, policy_version 1065520 (0.00092) [2022-07-11 06:01:57,344][26022] Updated weights on worker 0-0, policy_version 1065530 (0.00087) [2022-07-11 06:01:59,045][25689] Fps is (10 sec: 5605.9, 60 sec: 5630.6, 300 sec: 5582.4). Total num frames: 1091110912. Throughput: 0: 5896.9. Samples: 1091110474. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:01:59,045][25689] Avg episode reward: [(0, '1.192')] [2022-07-11 06:01:59,303][26022] Updated weights on worker 0-0, policy_version 1065540 (0.00423) [2022-07-11 06:02:01,106][26022] Updated weights on worker 0-0, policy_version 1065550 (0.00240) [2022-07-11 06:02:03,390][26022] Updated weights on worker 0-0, policy_version 1065560 (0.00092) [2022-07-11 06:02:04,078][25689] Fps is (10 sec: 5386.8, 60 sec: 5650.1, 300 sec: 5583.3). Total num frames: 1091137536. Throughput: 0: 5798.1. Samples: 1091142048. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:02:04,078][25689] Avg episode reward: [(0, '0.181')] [2022-07-11 06:02:05,235][26022] Updated weights on worker 0-0, policy_version 1065570 (0.00091) [2022-07-11 06:02:06,883][26022] Updated weights on worker 0-0, policy_version 1065580 (0.00089) [2022-07-11 06:02:08,734][26022] Updated weights on worker 0-0, policy_version 1065590 (0.00091) [2022-07-11 06:02:09,079][25689] Fps is (10 sec: 5510.7, 60 sec: 5672.1, 300 sec: 5590.8). Total num frames: 1091166208. Throughput: 0: 4955.6. Samples: 1091158938. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:02:09,080][25689] Avg episode reward: [(0, '0.146')] [2022-07-11 06:02:10,756][26022] Updated weights on worker 0-0, policy_version 1065600 (0.00080) [2022-07-11 06:02:12,368][26022] Updated weights on worker 0-0, policy_version 1065610 (0.00088) [2022-07-11 06:02:14,087][25689] Fps is (10 sec: 5422.4, 60 sec: 5609.0, 300 sec: 5575.2). Total num frames: 1091191808. Throughput: 0: 5764.4. Samples: 1091192312. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:02:14,087][25689] Avg episode reward: [(0, '0.394')] [2022-07-11 06:02:14,272][26022] Updated weights on worker 0-0, policy_version 1065620 (0.00086) [2022-07-11 06:02:15,962][26022] Updated weights on worker 0-0, policy_version 1065630 (0.00085) [2022-07-11 06:02:18,062][26022] Updated weights on worker 0-0, policy_version 1065640 (0.00097) [2022-07-11 06:02:19,178][25689] Fps is (10 sec: 5475.7, 60 sec: 5609.7, 300 sec: 5578.6). Total num frames: 1091221504. Throughput: 0: 5730.9. Samples: 1091225852. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:02:19,178][25689] Avg episode reward: [(0, '0.700')] [2022-07-11 06:02:19,980][26022] Updated weights on worker 0-0, policy_version 1065650 (0.00083) [2022-07-11 06:02:21,559][26022] Updated weights on worker 0-0, policy_version 1065660 (0.00089) [2022-07-11 06:02:23,496][26022] Updated weights on worker 0-0, policy_version 1065670 (0.00090) [2022-07-11 06:02:24,207][25689] Fps is (10 sec: 5666.3, 60 sec: 5591.2, 300 sec: 5574.7). Total num frames: 1091249152. Throughput: 0: 5004.2. Samples: 1091242778. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:02:24,207][25689] Avg episode reward: [(0, '1.123')] [2022-07-11 06:02:25,324][26022] Updated weights on worker 0-0, policy_version 1065680 (0.00083) [2022-07-11 06:02:27,111][26022] Updated weights on worker 0-0, policy_version 1065690 (0.00098) [2022-07-11 06:02:29,175][26022] Updated weights on worker 0-0, policy_version 1065700 (0.00085) [2022-07-11 06:02:29,227][25689] Fps is (10 sec: 5502.6, 60 sec: 5574.0, 300 sec: 5571.8). Total num frames: 1091276800. Throughput: 0: 5826.6. Samples: 1091276330. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:02:29,227][25689] Avg episode reward: [(0, '2.353')] [2022-07-11 06:02:30,738][26022] Updated weights on worker 0-0, policy_version 1065710 (0.00093) [2022-07-11 06:02:32,618][26022] Updated weights on worker 0-0, policy_version 1065720 (0.00090) [2022-07-11 06:02:34,229][25689] Fps is (10 sec: 5619.7, 60 sec: 5592.0, 300 sec: 5576.5). Total num frames: 1091305472. Throughput: 0: 5833.3. Samples: 1091309806. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:02:34,231][25689] Avg episode reward: [(0, '2.112')] [2022-07-11 06:02:34,547][26022] Updated weights on worker 0-0, policy_version 1065730 (0.00102) [2022-07-11 06:02:36,336][26022] Updated weights on worker 0-0, policy_version 1065740 (0.00096) [2022-07-11 06:02:38,384][26022] Updated weights on worker 0-0, policy_version 1065750 (0.00078) [2022-07-11 06:02:39,303][25689] Fps is (10 sec: 5589.3, 60 sec: 5542.3, 300 sec: 5568.8). Total num frames: 1091333120. Throughput: 0: 5820.5. Samples: 1091342992. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:02:39,304][25689] Avg episode reward: [(0, '2.285')] [2022-07-11 06:02:39,987][26022] Updated weights on worker 0-0, policy_version 1065760 (0.00085) [2022-07-11 06:02:42,047][26022] Updated weights on worker 0-0, policy_version 1065770 (0.00093) [2022-07-11 06:02:43,803][26022] Updated weights on worker 0-0, policy_version 1065780 (0.00090) [2022-07-11 06:02:44,313][25689] Fps is (10 sec: 5585.0, 60 sec: 5561.5, 300 sec: 5577.1). Total num frames: 1091361792. Throughput: 0: 5819.7. Samples: 1091359790. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:02:44,315][25689] Avg episode reward: [(0, '2.651')] [2022-07-11 06:02:45,519][26022] Updated weights on worker 0-0, policy_version 1065790 (0.00096) [2022-07-11 06:02:47,387][26022] Updated weights on worker 0-0, policy_version 1065800 (0.00086) [2022-07-11 06:02:49,188][26022] Updated weights on worker 0-0, policy_version 1065810 (0.00092) [2022-07-11 06:02:49,342][25689] Fps is (10 sec: 5711.9, 60 sec: 5566.0, 300 sec: 5573.2). Total num frames: 1091390464. Throughput: 0: 5835.7. Samples: 1091393718. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:02:49,344][25689] Avg episode reward: [(0, '1.406')] [2022-07-11 06:02:50,941][26022] Updated weights on worker 0-0, policy_version 1065820 (0.00092) [2022-07-11 06:02:52,758][26022] Updated weights on worker 0-0, policy_version 1065830 (0.00087) [2022-07-11 06:02:54,363][25689] Fps is (10 sec: 5604.2, 60 sec: 5547.5, 300 sec: 5572.3). Total num frames: 1091418112. Throughput: 0: 5846.6. Samples: 1091427520. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:02:54,365][25689] Avg episode reward: [(0, '0.910')] [2022-07-11 06:02:54,501][26022] Updated weights on worker 0-0, policy_version 1065840 (0.00090) [2022-07-11 06:02:56,490][26022] Updated weights on worker 0-0, policy_version 1065850 (0.00508) [2022-07-11 06:02:58,260][26022] Updated weights on worker 0-0, policy_version 1065860 (0.00089) [2022-07-11 06:02:59,523][25689] Fps is (10 sec: 5632.7, 60 sec: 5570.6, 300 sec: 5587.8). Total num frames: 1091447808. Throughput: 0: 5010.7. Samples: 1091444308. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:02:59,523][25689] Avg episode reward: [(0, '0.697')] [2022-07-11 06:03:00,280][26022] Updated weights on worker 0-0, policy_version 1065870 (0.00085) [2022-07-11 06:03:02,200][26022] Updated weights on worker 0-0, policy_version 1065880 (0.00090) [2022-07-11 06:03:04,281][26022] Updated weights on worker 0-0, policy_version 1065890 (0.00089) [2022-07-11 06:03:04,553][25689] Fps is (10 sec: 5325.6, 60 sec: 5536.9, 300 sec: 5577.4). Total num frames: 1091472384. Throughput: 0: 5736.8. Samples: 1091475904. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:03:04,555][25689] Avg episode reward: [(0, '0.785')] [2022-07-11 06:03:06,052][26022] Updated weights on worker 0-0, policy_version 1065900 (0.00090) [2022-07-11 06:03:07,984][26022] Updated weights on worker 0-0, policy_version 1065910 (0.00087) [2022-07-11 06:03:09,589][25689] Fps is (10 sec: 5188.1, 60 sec: 5516.9, 300 sec: 5567.0). Total num frames: 1091500032. Throughput: 0: 5719.7. Samples: 1091509522. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:03:09,591][25689] Avg episode reward: [(0, '0.363')] [2022-07-11 06:03:09,709][26022] Updated weights on worker 0-0, policy_version 1065920 (0.00092) [2022-07-11 06:03:11,547][26022] Updated weights on worker 0-0, policy_version 1065930 (0.00093) [2022-07-11 06:03:13,184][26022] Updated weights on worker 0-0, policy_version 1065940 (0.00088) [2022-07-11 06:03:14,607][25689] Fps is (10 sec: 5704.0, 60 sec: 5583.6, 300 sec: 5581.2). Total num frames: 1091529728. Throughput: 0: 4876.2. Samples: 1091526234. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:03:14,608][25689] Avg episode reward: [(0, '1.201')] [2022-07-11 06:03:15,312][26022] Updated weights on worker 0-0, policy_version 1065950 (0.00085) [2022-07-11 06:03:16,935][26022] Updated weights on worker 0-0, policy_version 1065960 (0.00086) [2022-07-11 06:03:18,717][26022] Updated weights on worker 0-0, policy_version 1065970 (0.00083) [2022-07-11 06:03:19,666][25689] Fps is (10 sec: 5690.7, 60 sec: 5552.7, 300 sec: 5580.1). Total num frames: 1091557376. Throughput: 0: 5746.8. Samples: 1091560064. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:03:19,666][25689] Avg episode reward: [(0, '0.432')] [2022-07-11 06:03:20,485][26022] Updated weights on worker 0-0, policy_version 1065980 (0.00089) [2022-07-11 06:03:22,237][26022] Updated weights on worker 0-0, policy_version 1065990 (0.00080) [2022-07-11 06:03:24,124][26022] Updated weights on worker 0-0, policy_version 1066000 (0.00094) [2022-07-11 06:03:24,678][25689] Fps is (10 sec: 5592.3, 60 sec: 5571.2, 300 sec: 5576.6). Total num frames: 1091586048. Throughput: 0: 5877.9. Samples: 1091594192. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:03:24,678][25689] Avg episode reward: [(0, '0.853')] [2022-07-11 06:03:26,051][26022] Updated weights on worker 0-0, policy_version 1066010 (0.00098) [2022-07-11 06:03:27,894][26022] Updated weights on worker 0-0, policy_version 1066020 (0.00094) [2022-07-11 06:03:29,709][25689] Fps is (10 sec: 5607.7, 60 sec: 5570.2, 300 sec: 5580.1). Total num frames: 1091613696. Throughput: 0: 5033.8. Samples: 1091610802. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:03:29,709][25689] Avg episode reward: [(0, '-0.531')] [2022-07-11 06:03:29,728][26022] Updated weights on worker 0-0, policy_version 1066030 (0.00089) [2022-07-11 06:03:31,408][26022] Updated weights on worker 0-0, policy_version 1066040 (0.00087) [2022-07-11 06:03:33,357][26022] Updated weights on worker 0-0, policy_version 1066050 (0.00087) [2022-07-11 06:03:34,718][25689] Fps is (10 sec: 5609.3, 60 sec: 5569.5, 300 sec: 5577.9). Total num frames: 1091642368. Throughput: 0: 5873.9. Samples: 1091644366. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:03:34,719][25689] Avg episode reward: [(0, '-1.909')] [2022-07-11 06:03:35,272][26022] Updated weights on worker 0-0, policy_version 1066060 (0.00093) [2022-07-11 06:03:36,881][26022] Updated weights on worker 0-0, policy_version 1066070 (0.00089) [2022-07-11 06:03:39,055][26022] Updated weights on worker 0-0, policy_version 1066080 (0.00097) [2022-07-11 06:03:39,759][25689] Fps is (10 sec: 5604.0, 60 sec: 5572.6, 300 sec: 5574.2). Total num frames: 1091670016. Throughput: 0: 5871.7. Samples: 1091678046. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:03:39,759][25689] Avg episode reward: [(0, '-1.908')] [2022-07-11 06:03:39,849][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:03:39,862][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001066085_1091671040.pth [2022-07-11 06:03:39,863][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001064124_1089662976.pth [2022-07-11 06:03:40,599][26022] Updated weights on worker 0-0, policy_version 1066090 (0.00086) [2022-07-11 06:03:42,534][26022] Updated weights on worker 0-0, policy_version 1066100 (0.00089) [2022-07-11 06:03:44,258][26022] Updated weights on worker 0-0, policy_version 1066110 (0.00089) [2022-07-11 06:03:44,793][25689] Fps is (10 sec: 5590.0, 60 sec: 5570.4, 300 sec: 5577.5). Total num frames: 1091698688. Throughput: 0: 5003.3. Samples: 1091694836. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:03:44,794][25689] Avg episode reward: [(0, '-0.861')] [2022-07-11 06:03:46,237][26022] Updated weights on worker 0-0, policy_version 1066120 (0.00090) [2022-07-11 06:03:47,977][26022] Updated weights on worker 0-0, policy_version 1066130 (0.00092) [2022-07-11 06:03:49,765][26022] Updated weights on worker 0-0, policy_version 1066140 (0.00088) [2022-07-11 06:03:49,801][25689] Fps is (10 sec: 5710.6, 60 sec: 5572.4, 300 sec: 5581.0). Total num frames: 1091727360. Throughput: 0: 5861.7. Samples: 1091728574. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:03:49,801][25689] Avg episode reward: [(0, '-0.591')] [2022-07-11 06:03:51,647][26022] Updated weights on worker 0-0, policy_version 1066150 (0.00086) [2022-07-11 06:03:53,662][26022] Updated weights on worker 0-0, policy_version 1066160 (0.00092) [2022-07-11 06:03:54,829][25689] Fps is (10 sec: 5714.2, 60 sec: 5588.6, 300 sec: 5586.3). Total num frames: 1091756032. Throughput: 0: 5848.6. Samples: 1091761984. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:03:54,829][25689] Avg episode reward: [(0, '-2.531')] [2022-07-11 06:03:55,203][26022] Updated weights on worker 0-0, policy_version 1066170 (0.00084) [2022-07-11 06:03:57,320][26022] Updated weights on worker 0-0, policy_version 1066180 (0.00093) [2022-07-11 06:03:59,038][26022] Updated weights on worker 0-0, policy_version 1066190 (0.00083) [2022-07-11 06:03:59,865][25689] Fps is (10 sec: 5392.5, 60 sec: 5532.2, 300 sec: 5575.9). Total num frames: 1091781632. Throughput: 0: 5006.7. Samples: 1091778712. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:03:59,866][25689] Avg episode reward: [(0, '-1.952')] [2022-07-11 06:04:00,998][26022] Updated weights on worker 0-0, policy_version 1066200 (0.00080) [2022-07-11 06:04:03,106][26022] Updated weights on worker 0-0, policy_version 1066210 (0.00090) [2022-07-11 06:04:04,721][26022] Updated weights on worker 0-0, policy_version 1066220 (0.00084) [2022-07-11 06:04:04,868][25689] Fps is (10 sec: 5304.0, 60 sec: 5585.7, 300 sec: 5583.0). Total num frames: 1091809280. Throughput: 0: 5757.7. Samples: 1091810420. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:04:04,868][25689] Avg episode reward: [(0, '-1.324')] [2022-07-11 06:04:06,635][26022] Updated weights on worker 0-0, policy_version 1066230 (0.00082) [2022-07-11 06:04:08,427][26022] Updated weights on worker 0-0, policy_version 1066240 (0.00094) [2022-07-11 06:04:09,876][25689] Fps is (10 sec: 5625.9, 60 sec: 5605.2, 300 sec: 5584.9). Total num frames: 1091837952. Throughput: 0: 5767.4. Samples: 1091844356. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:04:09,877][25689] Avg episode reward: [(0, '-2.845')] [2022-07-11 06:04:10,396][26022] Updated weights on worker 0-0, policy_version 1066250 (0.00095) [2022-07-11 06:04:12,178][26022] Updated weights on worker 0-0, policy_version 1066260 (0.00087) [2022-07-11 06:04:13,993][26022] Updated weights on worker 0-0, policy_version 1066270 (0.00086) [2022-07-11 06:04:14,886][25689] Fps is (10 sec: 5621.7, 60 sec: 5571.9, 300 sec: 5579.7). Total num frames: 1091865600. Throughput: 0: 4943.4. Samples: 1091861138. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:04:14,887][25689] Avg episode reward: [(0, '-4.066')] [2022-07-11 06:04:15,740][26022] Updated weights on worker 0-0, policy_version 1066280 (0.00086) [2022-07-11 06:04:17,654][26022] Updated weights on worker 0-0, policy_version 1066290 (0.00095) [2022-07-11 06:04:19,335][26022] Updated weights on worker 0-0, policy_version 1066300 (0.00088) [2022-07-11 06:04:19,940][25689] Fps is (10 sec: 5595.9, 60 sec: 5589.3, 300 sec: 5582.3). Total num frames: 1091894272. Throughput: 0: 5794.0. Samples: 1091895028. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:04:19,941][25689] Avg episode reward: [(0, '-3.363')] [2022-07-11 06:04:21,332][26022] Updated weights on worker 0-0, policy_version 1066310 (0.00089) [2022-07-11 06:04:22,818][26022] Updated weights on worker 0-0, policy_version 1066320 (0.00092) [2022-07-11 06:04:24,943][25689] Fps is (10 sec: 5498.6, 60 sec: 5556.2, 300 sec: 5579.5). Total num frames: 1091920896. Throughput: 0: 5895.7. Samples: 1091928776. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:04:24,949][25689] Avg episode reward: [(0, '-2.337')] [2022-07-11 06:04:24,979][26022] Updated weights on worker 0-0, policy_version 1066330 (0.00094) [2022-07-11 06:04:26,537][26022] Updated weights on worker 0-0, policy_version 1066340 (0.00082) [2022-07-11 06:04:28,700][26022] Updated weights on worker 0-0, policy_version 1066350 (0.00091) [2022-07-11 06:04:29,979][25689] Fps is (10 sec: 5610.6, 60 sec: 5589.8, 300 sec: 5583.0). Total num frames: 1091950592. Throughput: 0: 5035.3. Samples: 1091945580. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2022-07-11 06:04:29,980][25689] Avg episode reward: [(0, '-2.936')] [2022-07-11 06:04:30,234][26022] Updated weights on worker 0-0, policy_version 1066360 (0.00087) [2022-07-11 06:04:32,210][26022] Updated weights on worker 0-0, policy_version 1066370 (0.00092) [2022-07-11 06:04:34,165][26022] Updated weights on worker 0-0, policy_version 1066380 (0.00099) [2022-07-11 06:04:35,012][25689] Fps is (10 sec: 5695.0, 60 sec: 5570.6, 300 sec: 5587.7). Total num frames: 1091978240. Throughput: 0: 5857.3. Samples: 1091979020. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:04:35,013][25689] Avg episode reward: [(0, '-1.819')] [2022-07-11 06:04:35,861][26022] Updated weights on worker 0-0, policy_version 1066390 (0.00089) [2022-07-11 06:04:37,665][26022] Updated weights on worker 0-0, policy_version 1066400 (0.00095) [2022-07-11 06:04:39,621][26022] Updated weights on worker 0-0, policy_version 1066410 (0.00093) [2022-07-11 06:04:40,144][25689] Fps is (10 sec: 5540.3, 60 sec: 5579.1, 300 sec: 5582.8). Total num frames: 1092006912. Throughput: 0: 5825.0. Samples: 1092012714. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:04:40,145][25689] Avg episode reward: [(0, '-1.238')] [2022-07-11 06:04:41,310][26022] Updated weights on worker 0-0, policy_version 1066420 (0.00089) [2022-07-11 06:04:43,334][26022] Updated weights on worker 0-0, policy_version 1066430 (0.00083) [2022-07-11 06:04:45,125][26022] Updated weights on worker 0-0, policy_version 1066440 (0.00084) [2022-07-11 06:04:45,160][25689] Fps is (10 sec: 5650.7, 60 sec: 5580.8, 300 sec: 5583.1). Total num frames: 1092035584. Throughput: 0: 4977.0. Samples: 1092029398. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:04:45,161][25689] Avg episode reward: [(0, '-0.588')] [2022-07-11 06:04:46,799][26022] Updated weights on worker 0-0, policy_version 1066450 (0.00087) [2022-07-11 06:04:48,669][26022] Updated weights on worker 0-0, policy_version 1066460 (0.00079) [2022-07-11 06:04:50,208][25689] Fps is (10 sec: 5698.1, 60 sec: 5577.1, 300 sec: 5592.8). Total num frames: 1092064256. Throughput: 0: 5818.3. Samples: 1092063280. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:04:50,208][25689] Avg episode reward: [(0, '-0.493')] [2022-07-11 06:04:50,295][26022] Updated weights on worker 0-0, policy_version 1066470 (0.00092) [2022-07-11 06:04:52,409][26022] Updated weights on worker 0-0, policy_version 1066480 (0.00078) [2022-07-11 06:04:53,958][26022] Updated weights on worker 0-0, policy_version 1066490 (0.00084) [2022-07-11 06:04:55,269][25689] Fps is (10 sec: 5571.2, 60 sec: 5557.0, 300 sec: 5583.4). Total num frames: 1092091904. Throughput: 0: 5839.6. Samples: 1092097314. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:04:55,270][25689] Avg episode reward: [(0, '-0.084')] [2022-07-11 06:04:55,968][26022] Updated weights on worker 0-0, policy_version 1066500 (0.00083) [2022-07-11 06:04:57,974][26022] Updated weights on worker 0-0, policy_version 1066510 (0.00091) [2022-07-11 06:04:59,653][26022] Updated weights on worker 0-0, policy_version 1066520 (0.00086) [2022-07-11 06:05:00,367][25689] Fps is (10 sec: 5543.9, 60 sec: 5602.2, 300 sec: 5592.1). Total num frames: 1092120576. Throughput: 0: 5005.3. Samples: 1092113934. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:05:00,368][25689] Avg episode reward: [(0, '-0.162')] [2022-07-11 06:05:01,499][26022] Updated weights on worker 0-0, policy_version 1066530 (0.00085) [2022-07-11 06:05:03,516][26022] Updated weights on worker 0-0, policy_version 1066540 (0.00091) [2022-07-11 06:05:05,299][26022] Updated weights on worker 0-0, policy_version 1066550 (0.00080) [2022-07-11 06:05:05,429][25689] Fps is (10 sec: 5442.8, 60 sec: 5579.8, 300 sec: 5588.8). Total num frames: 1092147200. Throughput: 0: 5732.5. Samples: 1092145588. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:05:05,429][25689] Avg episode reward: [(0, '-0.596')] [2022-07-11 06:05:07,232][26022] Updated weights on worker 0-0, policy_version 1066560 (0.00093) [2022-07-11 06:05:08,964][26022] Updated weights on worker 0-0, policy_version 1066570 (0.00086) [2022-07-11 06:05:10,436][25689] Fps is (10 sec: 5389.9, 60 sec: 5563.0, 300 sec: 5582.1). Total num frames: 1092174848. Throughput: 0: 5760.0. Samples: 1092179794. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:05:10,437][25689] Avg episode reward: [(0, '-0.452')] [2022-07-11 06:05:10,884][26022] Updated weights on worker 0-0, policy_version 1066580 (0.00088) [2022-07-11 06:05:12,659][26022] Updated weights on worker 0-0, policy_version 1066590 (0.00083) [2022-07-11 06:05:14,489][26022] Updated weights on worker 0-0, policy_version 1066600 (0.00088) [2022-07-11 06:05:15,462][25689] Fps is (10 sec: 5613.6, 60 sec: 5578.5, 300 sec: 5586.2). Total num frames: 1092203520. Throughput: 0: 4912.5. Samples: 1092196508. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:05:15,463][25689] Avg episode reward: [(0, '0.032')] [2022-07-11 06:05:16,391][26022] Updated weights on worker 0-0, policy_version 1066610 (0.00095) [2022-07-11 06:05:18,024][26022] Updated weights on worker 0-0, policy_version 1066620 (0.00086) [2022-07-11 06:05:20,197][26022] Updated weights on worker 0-0, policy_version 1066630 (0.00087) [2022-07-11 06:05:20,597][25689] Fps is (10 sec: 5643.7, 60 sec: 5571.1, 300 sec: 5584.0). Total num frames: 1092232192. Throughput: 0: 5751.2. Samples: 1092230280. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:05:20,597][25689] Avg episode reward: [(0, '0.195')] [2022-07-11 06:05:21,794][26022] Updated weights on worker 0-0, policy_version 1066640 (0.00090) [2022-07-11 06:05:23,655][26022] Updated weights on worker 0-0, policy_version 1066650 (0.00076) [2022-07-11 06:05:25,287][26022] Updated weights on worker 0-0, policy_version 1066660 (0.00087) [2022-07-11 06:05:25,599][25689] Fps is (10 sec: 5656.8, 60 sec: 5604.9, 300 sec: 5588.3). Total num frames: 1092260864. Throughput: 0: 5886.9. Samples: 1092264326. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:05:25,601][25689] Avg episode reward: [(0, '0.991')] [2022-07-11 06:05:27,354][26022] Updated weights on worker 0-0, policy_version 1066670 (0.00094) [2022-07-11 06:05:28,955][26022] Updated weights on worker 0-0, policy_version 1066680 (0.00082) [2022-07-11 06:05:30,642][25689] Fps is (10 sec: 5606.8, 60 sec: 5570.5, 300 sec: 5585.4). Total num frames: 1092288512. Throughput: 0: 5871.1. Samples: 1092298422. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:05:30,642][25689] Avg episode reward: [(0, '1.470')] [2022-07-11 06:05:30,888][26022] Updated weights on worker 0-0, policy_version 1066690 (0.00087) [2022-07-11 06:05:32,749][26022] Updated weights on worker 0-0, policy_version 1066700 (0.00084) [2022-07-11 06:05:34,427][26022] Updated weights on worker 0-0, policy_version 1066710 (0.00084) [2022-07-11 06:05:35,684][25689] Fps is (10 sec: 5685.8, 60 sec: 5603.4, 300 sec: 5589.1). Total num frames: 1092318208. Throughput: 0: 5886.6. Samples: 1092315550. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:05:35,685][25689] Avg episode reward: [(0, '1.460')] [2022-07-11 06:05:36,359][26022] Updated weights on worker 0-0, policy_version 1066720 (0.00079) [2022-07-11 06:05:37,941][26022] Updated weights on worker 0-0, policy_version 1066730 (0.00084) [2022-07-11 06:05:39,935][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:05:39,944][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001066740_1092341760.pth [2022-07-11 06:05:39,951][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001064776_1090330624.pth [2022-07-11 06:05:39,955][26022] Updated weights on worker 0-0, policy_version 1066740 (0.00107) [2022-07-11 06:05:40,802][25689] Fps is (10 sec: 5744.6, 60 sec: 5604.7, 300 sec: 5587.2). Total num frames: 1092346880. Throughput: 0: 5906.5. Samples: 1092349624. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:05:40,803][25689] Avg episode reward: [(0, '1.540')] [2022-07-11 06:05:41,590][26022] Updated weights on worker 0-0, policy_version 1066750 (0.00084) [2022-07-11 06:05:43,447][26022] Updated weights on worker 0-0, policy_version 1066760 (0.00079) [2022-07-11 06:05:45,158][26022] Updated weights on worker 0-0, policy_version 1066770 (0.00085) [2022-07-11 06:05:45,814][25689] Fps is (10 sec: 5660.9, 60 sec: 5605.0, 300 sec: 5587.6). Total num frames: 1092375552. Throughput: 0: 5920.3. Samples: 1092384006. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:05:45,816][25689] Avg episode reward: [(0, '1.225')] [2022-07-11 06:05:47,179][26022] Updated weights on worker 0-0, policy_version 1066780 (0.00087) [2022-07-11 06:05:48,719][26022] Updated weights on worker 0-0, policy_version 1066790 (0.00093) [2022-07-11 06:05:50,663][26022] Updated weights on worker 0-0, policy_version 1066800 (0.00089) [2022-07-11 06:05:50,854][25689] Fps is (10 sec: 5602.9, 60 sec: 5588.9, 300 sec: 5593.8). Total num frames: 1092403200. Throughput: 0: 5083.8. Samples: 1092401178. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:05:50,855][25689] Avg episode reward: [(0, '1.128')] [2022-07-11 06:05:52,370][26022] Updated weights on worker 0-0, policy_version 1066810 (0.00081) [2022-07-11 06:05:54,276][26022] Updated weights on worker 0-0, policy_version 1066820 (0.00098) [2022-07-11 06:05:55,934][25689] Fps is (10 sec: 5666.5, 60 sec: 5620.9, 300 sec: 5590.8). Total num frames: 1092432896. Throughput: 0: 5922.3. Samples: 1092435474. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:05:55,936][25689] Avg episode reward: [(0, '0.051')] [2022-07-11 06:05:56,124][26022] Updated weights on worker 0-0, policy_version 1066830 (0.00094) [2022-07-11 06:05:57,934][26022] Updated weights on worker 0-0, policy_version 1066840 (0.00084) [2022-07-11 06:05:59,626][26022] Updated weights on worker 0-0, policy_version 1066850 (0.00077) [2022-07-11 06:06:01,001][25689] Fps is (10 sec: 5752.5, 60 sec: 5623.8, 300 sec: 5601.0). Total num frames: 1092461568. Throughput: 0: 5918.7. Samples: 1092469172. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:06:01,003][25689] Avg episode reward: [(0, '0.183')] [2022-07-11 06:06:01,591][26022] Updated weights on worker 0-0, policy_version 1066860 (0.00082) [2022-07-11 06:06:03,725][26022] Updated weights on worker 0-0, policy_version 1066870 (0.00089) [2022-07-11 06:06:05,440][26022] Updated weights on worker 0-0, policy_version 1066880 (0.00077) [2022-07-11 06:06:06,021][25689] Fps is (10 sec: 5481.7, 60 sec: 5627.6, 300 sec: 5598.2). Total num frames: 1092488192. Throughput: 0: 4959.6. Samples: 1092484226. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:06:06,022][25689] Avg episode reward: [(0, '0.087')] [2022-07-11 06:06:07,452][26022] Updated weights on worker 0-0, policy_version 1066890 (0.00086) [2022-07-11 06:06:08,975][26022] Updated weights on worker 0-0, policy_version 1066900 (0.00086) [2022-07-11 06:06:11,067][25689] Fps is (10 sec: 5289.8, 60 sec: 5607.2, 300 sec: 5588.1). Total num frames: 1092514816. Throughput: 0: 5784.7. Samples: 1092518102. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:06:11,067][25689] Avg episode reward: [(0, '0.565')] [2022-07-11 06:06:11,117][26022] Updated weights on worker 0-0, policy_version 1066910 (0.00095) [2022-07-11 06:06:12,530][26022] Updated weights on worker 0-0, policy_version 1066920 (0.00086) [2022-07-11 06:06:14,650][26022] Updated weights on worker 0-0, policy_version 1066930 (0.00089) [2022-07-11 06:06:16,145][25689] Fps is (10 sec: 5664.3, 60 sec: 5636.1, 300 sec: 5591.9). Total num frames: 1092545536. Throughput: 0: 5762.6. Samples: 1092551942. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:06:16,147][25689] Avg episode reward: [(0, '0.869')] [2022-07-11 06:06:16,313][26022] Updated weights on worker 0-0, policy_version 1066940 (0.00090) [2022-07-11 06:06:18,353][26022] Updated weights on worker 0-0, policy_version 1066950 (0.00093) [2022-07-11 06:06:19,844][26022] Updated weights on worker 0-0, policy_version 1066960 (0.00082) [2022-07-11 06:06:21,209][25689] Fps is (10 sec: 5755.2, 60 sec: 5625.8, 300 sec: 5587.5). Total num frames: 1092573184. Throughput: 0: 4934.2. Samples: 1092568882. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:06:21,210][25689] Avg episode reward: [(0, '1.804')] [2022-07-11 06:06:21,924][26022] Updated weights on worker 0-0, policy_version 1066970 (0.00088) [2022-07-11 06:06:23,603][26022] Updated weights on worker 0-0, policy_version 1066980 (0.00083) [2022-07-11 06:06:25,424][26022] Updated weights on worker 0-0, policy_version 1066990 (0.00090) [2022-07-11 06:06:26,298][25689] Fps is (10 sec: 5446.3, 60 sec: 5600.9, 300 sec: 5582.7). Total num frames: 1092600832. Throughput: 0: 5857.5. Samples: 1092602996. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:06:26,299][25689] Avg episode reward: [(0, '1.023')] [2022-07-11 06:06:27,149][26022] Updated weights on worker 0-0, policy_version 1067000 (0.00098) [2022-07-11 06:06:29,097][26022] Updated weights on worker 0-0, policy_version 1067010 (0.00053) [2022-07-11 06:06:30,957][26022] Updated weights on worker 0-0, policy_version 1067020 (0.00083) [2022-07-11 06:06:31,305][25689] Fps is (10 sec: 5679.6, 60 sec: 5637.9, 300 sec: 5589.7). Total num frames: 1092630528. Throughput: 0: 5847.7. Samples: 1092636450. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:06:31,307][25689] Avg episode reward: [(0, '1.069')] [2022-07-11 06:06:32,867][26022] Updated weights on worker 0-0, policy_version 1067030 (0.00087) [2022-07-11 06:06:34,518][26022] Updated weights on worker 0-0, policy_version 1067040 (0.00091) [2022-07-11 06:06:36,332][25689] Fps is (10 sec: 5714.8, 60 sec: 5605.6, 300 sec: 5580.5). Total num frames: 1092658176. Throughput: 0: 5023.3. Samples: 1092653348. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:06:36,334][25689] Avg episode reward: [(0, '1.401')] [2022-07-11 06:06:36,448][26022] Updated weights on worker 0-0, policy_version 1067050 (0.00093) [2022-07-11 06:06:38,275][26022] Updated weights on worker 0-0, policy_version 1067060 (0.00092) [2022-07-11 06:06:40,171][26022] Updated weights on worker 0-0, policy_version 1067070 (0.00083) [2022-07-11 06:06:41,432][25689] Fps is (10 sec: 5561.7, 60 sec: 5607.3, 300 sec: 5582.8). Total num frames: 1092686848. Throughput: 0: 5847.2. Samples: 1092687130. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:06:41,432][25689] Avg episode reward: [(0, '0.950')] [2022-07-11 06:06:41,807][26022] Updated weights on worker 0-0, policy_version 1067080 (0.00104) [2022-07-11 06:06:43,935][26022] Updated weights on worker 0-0, policy_version 1067090 (0.00084) [2022-07-11 06:06:45,483][26022] Updated weights on worker 0-0, policy_version 1067100 (0.00084) [2022-07-11 06:06:46,469][25689] Fps is (10 sec: 5555.8, 60 sec: 5588.0, 300 sec: 5580.1). Total num frames: 1092714496. Throughput: 0: 5831.6. Samples: 1092720628. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:06:46,471][25689] Avg episode reward: [(0, '-0.534')] [2022-07-11 06:06:47,539][26022] Updated weights on worker 0-0, policy_version 1067110 (0.00083) [2022-07-11 06:06:49,015][26022] Updated weights on worker 0-0, policy_version 1067120 (0.00093) [2022-07-11 06:06:51,106][26022] Updated weights on worker 0-0, policy_version 1067130 (0.00093) [2022-07-11 06:06:51,481][25689] Fps is (10 sec: 5604.5, 60 sec: 5607.6, 300 sec: 5579.9). Total num frames: 1092743168. Throughput: 0: 5010.3. Samples: 1092737534. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:06:51,482][25689] Avg episode reward: [(0, '-0.226')] [2022-07-11 06:06:52,756][26022] Updated weights on worker 0-0, policy_version 1067140 (0.00099) [2022-07-11 06:06:54,601][26022] Updated weights on worker 0-0, policy_version 1067150 (0.00085) [2022-07-11 06:06:56,484][25689] Fps is (10 sec: 5726.1, 60 sec: 5597.8, 300 sec: 5584.1). Total num frames: 1092771840. Throughput: 0: 5852.5. Samples: 1092771284. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:06:56,485][25689] Avg episode reward: [(0, '-0.315')] [2022-07-11 06:06:56,489][26022] Updated weights on worker 0-0, policy_version 1067160 (0.00087) [2022-07-11 06:06:58,396][26022] Updated weights on worker 0-0, policy_version 1067170 (0.00080) [2022-07-11 06:07:00,108][26022] Updated weights on worker 0-0, policy_version 1067180 (0.00087) [2022-07-11 06:07:01,563][25689] Fps is (10 sec: 5586.1, 60 sec: 5579.7, 300 sec: 5586.7). Total num frames: 1092799488. Throughput: 0: 5862.5. Samples: 1092805150. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:07:01,563][25689] Avg episode reward: [(0, '-0.883')] [2022-07-11 06:07:02,592][26022] Updated weights on worker 0-0, policy_version 1067190 (0.00093) [2022-07-11 06:07:03,749][26022] Updated weights on worker 0-0, policy_version 1067200 (0.00091) [2022-07-11 06:07:06,156][26022] Updated weights on worker 0-0, policy_version 1067210 (0.00090) [2022-07-11 06:07:06,564][25689] Fps is (10 sec: 5384.0, 60 sec: 5581.5, 300 sec: 5579.8). Total num frames: 1092826112. Throughput: 0: 4951.6. Samples: 1092820128. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:07:06,564][25689] Avg episode reward: [(0, '-1.597')] [2022-07-11 06:07:07,382][26022] Updated weights on worker 0-0, policy_version 1067220 (0.00099) [2022-07-11 06:07:09,629][26022] Updated weights on worker 0-0, policy_version 1067230 (0.00080) [2022-07-11 06:07:11,134][26022] Updated weights on worker 0-0, policy_version 1067240 (0.00092) [2022-07-11 06:07:11,608][25689] Fps is (10 sec: 5504.7, 60 sec: 5615.5, 300 sec: 5589.4). Total num frames: 1092854784. Throughput: 0: 5804.3. Samples: 1092854360. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:07:11,609][25689] Avg episode reward: [(0, '-1.072')] [2022-07-11 06:07:13,142][26022] Updated weights on worker 0-0, policy_version 1067250 (0.00092) [2022-07-11 06:07:15,026][26022] Updated weights on worker 0-0, policy_version 1067260 (0.00086) [2022-07-11 06:07:16,614][25689] Fps is (10 sec: 5604.0, 60 sec: 5571.4, 300 sec: 5584.1). Total num frames: 1092882432. Throughput: 0: 5807.0. Samples: 1092888180. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:07:16,614][25689] Avg episode reward: [(0, '-2.122')] [2022-07-11 06:07:16,761][26022] Updated weights on worker 0-0, policy_version 1067270 (0.00674) [2022-07-11 06:07:18,653][26022] Updated weights on worker 0-0, policy_version 1067280 (0.00091) [2022-07-11 06:07:20,431][26022] Updated weights on worker 0-0, policy_version 1067290 (0.00083) [2022-07-11 06:07:21,721][25689] Fps is (10 sec: 5670.3, 60 sec: 5601.2, 300 sec: 5589.6). Total num frames: 1092912128. Throughput: 0: 4957.7. Samples: 1092905088. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:07:21,723][25689] Avg episode reward: [(0, '-1.992')] [2022-07-11 06:07:22,125][26022] Updated weights on worker 0-0, policy_version 1067300 (0.00084) [2022-07-11 06:07:24,346][26022] Updated weights on worker 0-0, policy_version 1067310 (0.00080) [2022-07-11 06:07:25,592][26022] Updated weights on worker 0-0, policy_version 1067320 (0.00091) [2022-07-11 06:07:26,772][25689] Fps is (10 sec: 5644.9, 60 sec: 5604.7, 300 sec: 5589.0). Total num frames: 1092939776. Throughput: 0: 5900.3. Samples: 1092939364. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:07:26,773][25689] Avg episode reward: [(0, '-1.304')] [2022-07-11 06:07:27,745][26022] Updated weights on worker 0-0, policy_version 1067330 (0.00087) [2022-07-11 06:07:29,481][26022] Updated weights on worker 0-0, policy_version 1067340 (0.00081) [2022-07-11 06:07:31,203][26022] Updated weights on worker 0-0, policy_version 1067350 (0.00080) [2022-07-11 06:07:31,807][25689] Fps is (10 sec: 5787.1, 60 sec: 5619.1, 300 sec: 5595.3). Total num frames: 1092970496. Throughput: 0: 5887.1. Samples: 1092973272. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:07:31,807][25689] Avg episode reward: [(0, '-0.990')] [2022-07-11 06:07:33,230][26022] Updated weights on worker 0-0, policy_version 1067360 (0.00097) [2022-07-11 06:07:34,775][26022] Updated weights on worker 0-0, policy_version 1067370 (0.00090) [2022-07-11 06:07:36,850][25689] Fps is (10 sec: 5588.7, 60 sec: 5583.8, 300 sec: 5589.0). Total num frames: 1092996096. Throughput: 0: 5872.2. Samples: 1093007010. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:07:36,850][25689] Avg episode reward: [(0, '-0.621')] [2022-07-11 06:07:36,974][26022] Updated weights on worker 0-0, policy_version 1067380 (0.00086) [2022-07-11 06:07:38,282][26022] Updated weights on worker 0-0, policy_version 1067390 (0.00082) [2022-07-11 06:07:40,060][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:07:40,075][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001067397_1093014528.pth [2022-07-11 06:07:40,076][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001065432_1091002368.pth [2022-07-11 06:07:40,415][26022] Updated weights on worker 0-0, policy_version 1067400 (0.00087) [2022-07-11 06:07:42,008][25689] Fps is (10 sec: 5420.7, 60 sec: 5595.3, 300 sec: 5589.6). Total num frames: 1093025792. Throughput: 0: 5857.8. Samples: 1093023924. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:07:42,008][25689] Avg episode reward: [(0, '-1.489')] [2022-07-11 06:07:42,184][26022] Updated weights on worker 0-0, policy_version 1067410 (0.00080) [2022-07-11 06:07:44,028][26022] Updated weights on worker 0-0, policy_version 1067420 (0.00093) [2022-07-11 06:07:45,881][26022] Updated weights on worker 0-0, policy_version 1067430 (0.00088) [2022-07-11 06:07:47,032][25689] Fps is (10 sec: 5732.3, 60 sec: 5613.5, 300 sec: 5589.7). Total num frames: 1093054464. Throughput: 0: 5828.4. Samples: 1093057446. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:07:47,033][25689] Avg episode reward: [(0, '-1.760')] [2022-07-11 06:07:47,742][26022] Updated weights on worker 0-0, policy_version 1067440 (0.00084) [2022-07-11 06:07:49,596][26022] Updated weights on worker 0-0, policy_version 1067450 (0.00091) [2022-07-11 06:07:51,313][26022] Updated weights on worker 0-0, policy_version 1067460 (0.00097) [2022-07-11 06:07:52,053][25689] Fps is (10 sec: 5708.3, 60 sec: 5612.5, 300 sec: 5593.1). Total num frames: 1093083136. Throughput: 0: 5845.0. Samples: 1093091614. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:07:52,054][25689] Avg episode reward: [(0, '-1.947')] [2022-07-11 06:07:53,107][26022] Updated weights on worker 0-0, policy_version 1067470 (0.00085) [2022-07-11 06:07:55,040][26022] Updated weights on worker 0-0, policy_version 1067480 (0.00233) [2022-07-11 06:07:56,633][26022] Updated weights on worker 0-0, policy_version 1067490 (0.00084) [2022-07-11 06:07:57,081][25689] Fps is (10 sec: 5706.6, 60 sec: 5610.3, 300 sec: 5592.2). Total num frames: 1093111808. Throughput: 0: 5032.5. Samples: 1093108824. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:07:57,081][25689] Avg episode reward: [(0, '-1.732')] [2022-07-11 06:07:58,607][26022] Updated weights on worker 0-0, policy_version 1067500 (0.00085) [2022-07-11 06:08:00,300][26022] Updated weights on worker 0-0, policy_version 1067510 (0.00086) [2022-07-11 06:08:02,115][25689] Fps is (10 sec: 5292.3, 60 sec: 5563.7, 300 sec: 5592.1). Total num frames: 1093136384. Throughput: 0: 5923.6. Samples: 1093143030. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:08:02,115][25689] Avg episode reward: [(0, '-1.623')] [2022-07-11 06:08:02,588][26022] Updated weights on worker 0-0, policy_version 1067520 (0.00089) [2022-07-11 06:08:04,262][26022] Updated weights on worker 0-0, policy_version 1067530 (0.00084) [2022-07-11 06:08:06,059][26022] Updated weights on worker 0-0, policy_version 1067540 (0.00076) [2022-07-11 06:08:07,134][25689] Fps is (10 sec: 5398.8, 60 sec: 5612.9, 300 sec: 5599.3). Total num frames: 1093166080. Throughput: 0: 5845.5. Samples: 1093174948. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:08:07,134][25689] Avg episode reward: [(0, '-2.462')] [2022-07-11 06:08:07,960][26022] Updated weights on worker 0-0, policy_version 1067550 (0.00089) [2022-07-11 06:08:09,677][26022] Updated weights on worker 0-0, policy_version 1067560 (0.00102) [2022-07-11 06:08:11,556][26022] Updated weights on worker 0-0, policy_version 1067570 (0.00080) [2022-07-11 06:08:12,163][25689] Fps is (10 sec: 5910.5, 60 sec: 5631.1, 300 sec: 5599.1). Total num frames: 1093195776. Throughput: 0: 4988.0. Samples: 1093191922. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:08:12,164][25689] Avg episode reward: [(0, '-0.931')] [2022-07-11 06:08:13,341][26022] Updated weights on worker 0-0, policy_version 1067580 (0.00088) [2022-07-11 06:08:15,108][26022] Updated weights on worker 0-0, policy_version 1067590 (0.00087) [2022-07-11 06:08:17,085][26022] Updated weights on worker 0-0, policy_version 1067600 (0.00079) [2022-07-11 06:08:17,170][25689] Fps is (10 sec: 5611.8, 60 sec: 5614.1, 300 sec: 5596.6). Total num frames: 1093222400. Throughput: 0: 5826.3. Samples: 1093225866. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:08:17,170][25689] Avg episode reward: [(0, '-0.195')] [2022-07-11 06:08:18,669][26022] Updated weights on worker 0-0, policy_version 1067610 (0.00084) [2022-07-11 06:08:20,696][26022] Updated weights on worker 0-0, policy_version 1067620 (0.00082) [2022-07-11 06:08:22,230][25689] Fps is (10 sec: 5594.5, 60 sec: 5618.4, 300 sec: 5599.2). Total num frames: 1093252096. Throughput: 0: 5805.1. Samples: 1093259804. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:08:22,231][25689] Avg episode reward: [(0, '-0.186')] [2022-07-11 06:08:22,251][26022] Updated weights on worker 0-0, policy_version 1067630 (0.00084) [2022-07-11 06:08:24,343][26022] Updated weights on worker 0-0, policy_version 1067640 (0.00082) [2022-07-11 06:08:25,990][26022] Updated weights on worker 0-0, policy_version 1067650 (0.00101) [2022-07-11 06:08:27,254][25689] Fps is (10 sec: 5686.4, 60 sec: 5621.0, 300 sec: 5599.3). Total num frames: 1093279744. Throughput: 0: 5069.5. Samples: 1093276948. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 06:08:27,255][25689] Avg episode reward: [(0, '-0.410')] [2022-07-11 06:08:27,827][26022] Updated weights on worker 0-0, policy_version 1067660 (0.00088) [2022-07-11 06:08:29,792][26022] Updated weights on worker 0-0, policy_version 1067670 (0.00086) [2022-07-11 06:08:31,325][26022] Updated weights on worker 0-0, policy_version 1067680 (0.00080) [2022-07-11 06:08:32,347][25689] Fps is (10 sec: 5668.5, 60 sec: 5598.7, 300 sec: 5601.2). Total num frames: 1093309440. Throughput: 0: 5890.4. Samples: 1093310810. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:08:32,347][25689] Avg episode reward: [(0, '1.048')] [2022-07-11 06:08:33,527][26022] Updated weights on worker 0-0, policy_version 1067690 (0.00108) [2022-07-11 06:08:34,860][26022] Updated weights on worker 0-0, policy_version 1067700 (0.00092) [2022-07-11 06:08:36,993][26022] Updated weights on worker 0-0, policy_version 1067710 (0.00080) [2022-07-11 06:08:37,382][25689] Fps is (10 sec: 5762.8, 60 sec: 5650.1, 300 sec: 5604.7). Total num frames: 1093338112. Throughput: 0: 5895.0. Samples: 1093345022. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:08:37,383][25689] Avg episode reward: [(0, '1.108')] [2022-07-11 06:08:38,542][26022] Updated weights on worker 0-0, policy_version 1067720 (0.00081) [2022-07-11 06:08:40,421][26022] Updated weights on worker 0-0, policy_version 1067730 (0.00084) [2022-07-11 06:08:42,322][26022] Updated weights on worker 0-0, policy_version 1067740 (0.00090) [2022-07-11 06:08:42,442][25689] Fps is (10 sec: 5680.4, 60 sec: 5642.4, 300 sec: 5604.3). Total num frames: 1093366784. Throughput: 0: 5069.1. Samples: 1093362256. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:08:42,442][25689] Avg episode reward: [(0, '1.186')] [2022-07-11 06:08:43,967][26022] Updated weights on worker 0-0, policy_version 1067750 (0.00084) [2022-07-11 06:08:45,988][26022] Updated weights on worker 0-0, policy_version 1067760 (0.00090) [2022-07-11 06:08:47,519][25689] Fps is (10 sec: 5556.1, 60 sec: 5620.5, 300 sec: 5599.5). Total num frames: 1093394432. Throughput: 0: 5876.9. Samples: 1093396044. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:08:47,519][25689] Avg episode reward: [(0, '0.712')] [2022-07-11 06:08:47,649][26022] Updated weights on worker 0-0, policy_version 1067770 (0.00085) [2022-07-11 06:08:49,384][26022] Updated weights on worker 0-0, policy_version 1067780 (0.00087) [2022-07-11 06:08:51,438][26022] Updated weights on worker 0-0, policy_version 1067790 (0.00083) [2022-07-11 06:08:52,554][25689] Fps is (10 sec: 5670.9, 60 sec: 5636.2, 300 sec: 5602.8). Total num frames: 1093424128. Throughput: 0: 5905.3. Samples: 1093430138. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:08:52,554][25689] Avg episode reward: [(0, '1.042')] [2022-07-11 06:08:52,997][26022] Updated weights on worker 0-0, policy_version 1067800 (0.00087) [2022-07-11 06:08:55,107][26022] Updated weights on worker 0-0, policy_version 1067810 (0.00078) [2022-07-11 06:08:56,443][26022] Updated weights on worker 0-0, policy_version 1067820 (0.00090) [2022-07-11 06:08:57,579][25689] Fps is (10 sec: 5700.3, 60 sec: 5619.5, 300 sec: 5609.9). Total num frames: 1093451776. Throughput: 0: 5073.6. Samples: 1093447490. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:08:57,580][25689] Avg episode reward: [(0, '0.908')] [2022-07-11 06:08:58,564][26022] Updated weights on worker 0-0, policy_version 1067830 (0.00086) [2022-07-11 06:09:00,313][26022] Updated weights on worker 0-0, policy_version 1067840 (0.00087) [2022-07-11 06:09:02,343][26022] Updated weights on worker 0-0, policy_version 1067850 (0.00075) [2022-07-11 06:09:02,674][25689] Fps is (10 sec: 5463.8, 60 sec: 5664.5, 300 sec: 5608.2). Total num frames: 1093479424. Throughput: 0: 5909.6. Samples: 1093481820. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:09:02,674][25689] Avg episode reward: [(0, '1.291')] [2022-07-11 06:09:04,251][26022] Updated weights on worker 0-0, policy_version 1067860 (0.00086) [2022-07-11 06:09:05,907][26022] Updated weights on worker 0-0, policy_version 1067870 (0.00082) [2022-07-11 06:09:07,709][25689] Fps is (10 sec: 5458.3, 60 sec: 5629.2, 300 sec: 5604.2). Total num frames: 1093507072. Throughput: 0: 5842.8. Samples: 1093514012. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:09:07,710][25689] Avg episode reward: [(0, '1.503')] [2022-07-11 06:09:07,916][26022] Updated weights on worker 0-0, policy_version 1067880 (0.00083) [2022-07-11 06:09:09,591][26022] Updated weights on worker 0-0, policy_version 1067890 (0.00088) [2022-07-11 06:09:11,372][26022] Updated weights on worker 0-0, policy_version 1067900 (0.00087) [2022-07-11 06:09:12,768][25689] Fps is (10 sec: 5782.3, 60 sec: 5643.4, 300 sec: 5613.6). Total num frames: 1093537792. Throughput: 0: 5002.8. Samples: 1093531268. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:09:12,769][25689] Avg episode reward: [(0, '1.396')] [2022-07-11 06:09:13,104][26022] Updated weights on worker 0-0, policy_version 1067910 (0.00099) [2022-07-11 06:09:14,835][26022] Updated weights on worker 0-0, policy_version 1067920 (0.00078) [2022-07-11 06:09:16,687][26022] Updated weights on worker 0-0, policy_version 1067930 (0.00089) [2022-07-11 06:09:17,799][25689] Fps is (10 sec: 5785.1, 60 sec: 5658.0, 300 sec: 5610.6). Total num frames: 1093565440. Throughput: 0: 5840.2. Samples: 1093565578. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:09:17,799][25689] Avg episode reward: [(0, '0.490')] [2022-07-11 06:09:18,552][26022] Updated weights on worker 0-0, policy_version 1067940 (0.00092) [2022-07-11 06:09:20,618][26022] Updated weights on worker 0-0, policy_version 1067950 (0.00084) [2022-07-11 06:09:22,259][26022] Updated weights on worker 0-0, policy_version 1067960 (0.00095) [2022-07-11 06:09:22,871][25689] Fps is (10 sec: 5574.5, 60 sec: 5640.0, 300 sec: 5616.2). Total num frames: 1093594112. Throughput: 0: 5824.4. Samples: 1093599458. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:09:22,873][25689] Avg episode reward: [(0, '0.782')] [2022-07-11 06:09:24,035][26022] Updated weights on worker 0-0, policy_version 1067970 (0.00087) [2022-07-11 06:09:26,107][26022] Updated weights on worker 0-0, policy_version 1067980 (0.00080) [2022-07-11 06:09:27,596][26022] Updated weights on worker 0-0, policy_version 1067990 (0.00089) [2022-07-11 06:09:27,903][25689] Fps is (10 sec: 5776.7, 60 sec: 5673.1, 300 sec: 5616.3). Total num frames: 1093623808. Throughput: 0: 5066.0. Samples: 1093616312. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:09:27,903][25689] Avg episode reward: [(0, '0.489')] [2022-07-11 06:09:29,595][26022] Updated weights on worker 0-0, policy_version 1068000 (0.00092) [2022-07-11 06:09:31,105][26022] Updated weights on worker 0-0, policy_version 1068010 (0.00097) [2022-07-11 06:09:32,907][25689] Fps is (10 sec: 5611.8, 60 sec: 5630.6, 300 sec: 5613.4). Total num frames: 1093650432. Throughput: 0: 5923.0. Samples: 1093650552. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:09:32,908][25689] Avg episode reward: [(0, '0.367')] [2022-07-11 06:09:33,088][26022] Updated weights on worker 0-0, policy_version 1068020 (0.00086) [2022-07-11 06:09:34,893][26022] Updated weights on worker 0-0, policy_version 1068030 (0.00086) [2022-07-11 06:09:36,626][26022] Updated weights on worker 0-0, policy_version 1068040 (0.00096) [2022-07-11 06:09:37,914][25689] Fps is (10 sec: 5625.7, 60 sec: 5650.2, 300 sec: 5619.2). Total num frames: 1093680128. Throughput: 0: 5936.8. Samples: 1093684998. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:09:37,916][25689] Avg episode reward: [(0, '0.338')] [2022-07-11 06:09:38,484][26022] Updated weights on worker 0-0, policy_version 1068050 (0.00086) [2022-07-11 06:09:39,988][26022] Updated weights on worker 0-0, policy_version 1068060 (0.00090) [2022-07-11 06:09:40,256][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:09:40,270][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001068061_1093694464.pth [2022-07-11 06:09:40,270][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001066085_1091671040.pth [2022-07-11 06:09:42,023][26022] Updated weights on worker 0-0, policy_version 1068070 (0.00086) [2022-07-11 06:09:42,984][25689] Fps is (10 sec: 5893.8, 60 sec: 5666.1, 300 sec: 5621.6). Total num frames: 1093709824. Throughput: 0: 5098.0. Samples: 1093701994. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:09:42,985][25689] Avg episode reward: [(0, '1.037')] [2022-07-11 06:09:43,943][26022] Updated weights on worker 0-0, policy_version 1068080 (0.00084) [2022-07-11 06:09:45,590][26022] Updated weights on worker 0-0, policy_version 1068090 (0.00092) [2022-07-11 06:09:47,646][26022] Updated weights on worker 0-0, policy_version 1068100 (0.00092) [2022-07-11 06:09:48,033][25689] Fps is (10 sec: 5565.5, 60 sec: 5651.8, 300 sec: 5614.7). Total num frames: 1093736448. Throughput: 0: 5947.0. Samples: 1093736030. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:09:48,034][25689] Avg episode reward: [(0, '1.159')] [2022-07-11 06:09:49,391][26022] Updated weights on worker 0-0, policy_version 1068110 (0.00083) [2022-07-11 06:09:51,228][26022] Updated weights on worker 0-0, policy_version 1068120 (0.00084) [2022-07-11 06:09:52,902][26022] Updated weights on worker 0-0, policy_version 1068130 (0.00078) [2022-07-11 06:09:53,059][25689] Fps is (10 sec: 5488.8, 60 sec: 5635.8, 300 sec: 5618.8). Total num frames: 1093765120. Throughput: 0: 5907.3. Samples: 1093769592. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:09:53,059][25689] Avg episode reward: [(0, '1.331')] [2022-07-11 06:09:54,785][26022] Updated weights on worker 0-0, policy_version 1068140 (0.00086) [2022-07-11 06:09:56,823][26022] Updated weights on worker 0-0, policy_version 1068150 (0.00082) [2022-07-11 06:09:58,087][25689] Fps is (10 sec: 5805.8, 60 sec: 5669.3, 300 sec: 5623.5). Total num frames: 1093794816. Throughput: 0: 5039.3. Samples: 1093786656. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:09:58,087][25689] Avg episode reward: [(0, '0.259')] [2022-07-11 06:09:58,347][26022] Updated weights on worker 0-0, policy_version 1068160 (0.00086) [2022-07-11 06:10:00,290][26022] Updated weights on worker 0-0, policy_version 1068170 (0.00089) [2022-07-11 06:10:02,108][26022] Updated weights on worker 0-0, policy_version 1068180 (0.00089) [2022-07-11 06:10:03,216][25689] Fps is (10 sec: 5444.0, 60 sec: 5632.3, 300 sec: 5618.8). Total num frames: 1093820416. Throughput: 0: 5877.6. Samples: 1093820906. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:10:03,217][25689] Avg episode reward: [(0, '0.219')] [2022-07-11 06:10:04,090][26022] Updated weights on worker 0-0, policy_version 1068190 (0.00095) [2022-07-11 06:10:06,006][26022] Updated weights on worker 0-0, policy_version 1068200 (0.00111) [2022-07-11 06:10:07,645][26022] Updated weights on worker 0-0, policy_version 1068210 (0.00100) [2022-07-11 06:10:08,262][25689] Fps is (10 sec: 5434.3, 60 sec: 5665.1, 300 sec: 5625.0). Total num frames: 1093850112. Throughput: 0: 5792.2. Samples: 1093853200. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:10:08,263][25689] Avg episode reward: [(0, '0.031')] [2022-07-11 06:10:09,536][26022] Updated weights on worker 0-0, policy_version 1068220 (0.00089) [2022-07-11 06:10:11,220][26022] Updated weights on worker 0-0, policy_version 1068230 (0.00093) [2022-07-11 06:10:13,211][26022] Updated weights on worker 0-0, policy_version 1068240 (0.00053) [2022-07-11 06:10:13,290][25689] Fps is (10 sec: 5793.8, 60 sec: 5634.2, 300 sec: 5624.9). Total num frames: 1093878784. Throughput: 0: 5841.5. Samples: 1093887774. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:10:13,291][25689] Avg episode reward: [(0, '0.643')] [2022-07-11 06:10:14,751][26022] Updated weights on worker 0-0, policy_version 1068250 (0.00083) [2022-07-11 06:10:16,665][26022] Updated weights on worker 0-0, policy_version 1068260 (0.00084) [2022-07-11 06:10:18,308][25689] Fps is (10 sec: 5708.1, 60 sec: 5652.3, 300 sec: 5627.1). Total num frames: 1093907456. Throughput: 0: 5841.6. Samples: 1093904782. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:10:18,310][25689] Avg episode reward: [(0, '-0.600')] [2022-07-11 06:10:18,561][26022] Updated weights on worker 0-0, policy_version 1068270 (0.00085) [2022-07-11 06:10:20,303][26022] Updated weights on worker 0-0, policy_version 1068280 (0.00095) [2022-07-11 06:10:22,238][26022] Updated weights on worker 0-0, policy_version 1068290 (0.00084) [2022-07-11 06:10:23,387][25689] Fps is (10 sec: 5577.8, 60 sec: 5634.8, 300 sec: 5622.2). Total num frames: 1093935104. Throughput: 0: 5834.1. Samples: 1093938588. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:10:23,388][25689] Avg episode reward: [(0, '-0.447')] [2022-07-11 06:10:23,867][26022] Updated weights on worker 0-0, policy_version 1068300 (0.00088) [2022-07-11 06:10:25,860][26022] Updated weights on worker 0-0, policy_version 1068310 (0.00085) [2022-07-11 06:10:27,594][26022] Updated weights on worker 0-0, policy_version 1068320 (0.00088) [2022-07-11 06:10:28,412][25689] Fps is (10 sec: 5574.4, 60 sec: 5618.5, 300 sec: 5626.0). Total num frames: 1093963776. Throughput: 0: 5919.0. Samples: 1093972464. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:10:28,413][25689] Avg episode reward: [(0, '0.390')] [2022-07-11 06:10:29,482][26022] Updated weights on worker 0-0, policy_version 1068330 (0.00090) [2022-07-11 06:10:31,308][26022] Updated weights on worker 0-0, policy_version 1068340 (0.00096) [2022-07-11 06:10:32,972][26022] Updated weights on worker 0-0, policy_version 1068350 (0.00094) [2022-07-11 06:10:33,504][25689] Fps is (10 sec: 5769.5, 60 sec: 5661.0, 300 sec: 5625.1). Total num frames: 1093993472. Throughput: 0: 5028.3. Samples: 1093989414. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:10:33,505][25689] Avg episode reward: [(0, '0.740')] [2022-07-11 06:10:34,876][26022] Updated weights on worker 0-0, policy_version 1068360 (0.00090) [2022-07-11 06:10:36,513][26022] Updated weights on worker 0-0, policy_version 1068370 (0.00091) [2022-07-11 06:10:38,447][26022] Updated weights on worker 0-0, policy_version 1068380 (0.00094) [2022-07-11 06:10:38,517][25689] Fps is (10 sec: 5674.3, 60 sec: 5626.6, 300 sec: 5623.6). Total num frames: 1094021120. Throughput: 0: 5881.6. Samples: 1094023644. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:10:38,519][25689] Avg episode reward: [(0, '-0.215')] [2022-07-11 06:10:40,218][26022] Updated weights on worker 0-0, policy_version 1068390 (0.00086) [2022-07-11 06:10:41,984][26022] Updated weights on worker 0-0, policy_version 1068400 (0.00080) [2022-07-11 06:10:43,598][25689] Fps is (10 sec: 5478.1, 60 sec: 5591.9, 300 sec: 5618.9). Total num frames: 1094048768. Throughput: 0: 5884.0. Samples: 1094057508. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:10:43,599][25689] Avg episode reward: [(0, '-1.860')] [2022-07-11 06:10:43,858][26022] Updated weights on worker 0-0, policy_version 1068410 (0.00089) [2022-07-11 06:10:45,784][26022] Updated weights on worker 0-0, policy_version 1068420 (0.00083) [2022-07-11 06:10:47,420][26022] Updated weights on worker 0-0, policy_version 1068430 (0.00081) [2022-07-11 06:10:48,629][25689] Fps is (10 sec: 5772.4, 60 sec: 5661.1, 300 sec: 5629.4). Total num frames: 1094079488. Throughput: 0: 5055.6. Samples: 1094074676. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:10:48,631][25689] Avg episode reward: [(0, '-0.739')] [2022-07-11 06:10:49,586][26022] Updated weights on worker 0-0, policy_version 1068440 (0.00084) [2022-07-11 06:10:51,121][26022] Updated weights on worker 0-0, policy_version 1068450 (0.00088) [2022-07-11 06:10:53,183][26022] Updated weights on worker 0-0, policy_version 1068460 (0.00090) [2022-07-11 06:10:53,667][25689] Fps is (10 sec: 5695.3, 60 sec: 5626.2, 300 sec: 5619.9). Total num frames: 1094106112. Throughput: 0: 5900.4. Samples: 1094108382. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:10:53,667][25689] Avg episode reward: [(0, '-1.327')] [2022-07-11 06:10:54,670][26022] Updated weights on worker 0-0, policy_version 1068470 (0.00084) [2022-07-11 06:10:56,603][26022] Updated weights on worker 0-0, policy_version 1068480 (0.00088) [2022-07-11 06:10:58,371][26022] Updated weights on worker 0-0, policy_version 1068490 (0.00476) [2022-07-11 06:10:58,669][25689] Fps is (10 sec: 5507.7, 60 sec: 5611.7, 300 sec: 5621.1). Total num frames: 1094134784. Throughput: 0: 5901.8. Samples: 1094142574. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:10:58,669][25689] Avg episode reward: [(0, '-1.667')] [2022-07-11 06:11:00,099][26022] Updated weights on worker 0-0, policy_version 1068500 (0.00084) [2022-07-11 06:11:02,405][26022] Updated weights on worker 0-0, policy_version 1068510 (0.00084) [2022-07-11 06:11:03,728][25689] Fps is (10 sec: 5597.7, 60 sec: 5652.0, 300 sec: 5623.8). Total num frames: 1094162432. Throughput: 0: 5080.0. Samples: 1094159768. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:11:03,729][25689] Avg episode reward: [(0, '-1.579')] [2022-07-11 06:11:04,011][26022] Updated weights on worker 0-0, policy_version 1068520 (0.00084) [2022-07-11 06:11:05,903][26022] Updated weights on worker 0-0, policy_version 1068530 (0.00086) [2022-07-11 06:11:07,833][26022] Updated weights on worker 0-0, policy_version 1068540 (0.00089) [2022-07-11 06:11:08,771][25689] Fps is (10 sec: 5473.8, 60 sec: 5618.5, 300 sec: 5627.3). Total num frames: 1094190080. Throughput: 0: 5825.0. Samples: 1094192002. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:11:08,771][25689] Avg episode reward: [(0, '-0.673')] [2022-07-11 06:11:09,359][26022] Updated weights on worker 0-0, policy_version 1068550 (0.00092) [2022-07-11 06:11:11,334][26022] Updated weights on worker 0-0, policy_version 1068560 (0.00098) [2022-07-11 06:11:13,027][26022] Updated weights on worker 0-0, policy_version 1068570 (0.00088) [2022-07-11 06:11:13,799][25689] Fps is (10 sec: 5694.1, 60 sec: 5635.4, 300 sec: 5624.8). Total num frames: 1094219776. Throughput: 0: 5874.2. Samples: 1094226642. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:11:13,801][25689] Avg episode reward: [(0, '0.832')] [2022-07-11 06:11:14,854][26022] Updated weights on worker 0-0, policy_version 1068580 (0.00090) [2022-07-11 06:11:16,582][26022] Updated weights on worker 0-0, policy_version 1068590 (0.00085) [2022-07-11 06:11:18,419][26022] Updated weights on worker 0-0, policy_version 1068600 (0.00090) [2022-07-11 06:11:18,826][25689] Fps is (10 sec: 5804.7, 60 sec: 5634.6, 300 sec: 5628.9). Total num frames: 1094248448. Throughput: 0: 5017.0. Samples: 1094243702. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:11:18,828][25689] Avg episode reward: [(0, '1.130')] [2022-07-11 06:11:20,035][26022] Updated weights on worker 0-0, policy_version 1068610 (0.00093) [2022-07-11 06:11:22,120][26022] Updated weights on worker 0-0, policy_version 1068620 (0.00087) [2022-07-11 06:11:23,712][26022] Updated weights on worker 0-0, policy_version 1068630 (0.00080) [2022-07-11 06:11:23,949][25689] Fps is (10 sec: 5649.4, 60 sec: 5647.4, 300 sec: 5631.7). Total num frames: 1094277120. Throughput: 0: 5822.8. Samples: 1094277512. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:11:23,951][25689] Avg episode reward: [(0, '1.727')] [2022-07-11 06:11:25,708][26022] Updated weights on worker 0-0, policy_version 1068640 (0.00094) [2022-07-11 06:11:27,364][26022] Updated weights on worker 0-0, policy_version 1068650 (0.00086) [2022-07-11 06:11:28,985][25689] Fps is (10 sec: 5644.5, 60 sec: 5646.3, 300 sec: 5627.7). Total num frames: 1094305792. Throughput: 0: 5926.1. Samples: 1094311796. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:11:28,987][25689] Avg episode reward: [(0, '2.158')] [2022-07-11 06:11:29,274][26022] Updated weights on worker 0-0, policy_version 1068660 (0.00081) [2022-07-11 06:11:31,083][26022] Updated weights on worker 0-0, policy_version 1068670 (0.00087) [2022-07-11 06:11:32,935][26022] Updated weights on worker 0-0, policy_version 1068680 (0.00096) [2022-07-11 06:11:34,078][25689] Fps is (10 sec: 5560.5, 60 sec: 5612.4, 300 sec: 5626.5). Total num frames: 1094333440. Throughput: 0: 5037.5. Samples: 1094328798. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:11:34,078][25689] Avg episode reward: [(0, '1.962')] [2022-07-11 06:11:34,647][26022] Updated weights on worker 0-0, policy_version 1068690 (0.00087) [2022-07-11 06:11:36,523][26022] Updated weights on worker 0-0, policy_version 1068700 (0.00083) [2022-07-11 06:11:38,131][26022] Updated weights on worker 0-0, policy_version 1068710 (0.00079) [2022-07-11 06:11:39,098][25689] Fps is (10 sec: 5569.3, 60 sec: 5628.7, 300 sec: 5628.0). Total num frames: 1094362112. Throughput: 0: 5887.7. Samples: 1094363056. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:11:39,100][25689] Avg episode reward: [(0, '1.498')] [2022-07-11 06:11:40,080][26022] Updated weights on worker 0-0, policy_version 1068720 (0.00096) [2022-07-11 06:11:40,375][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:11:40,390][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001068721_1094370304.pth [2022-07-11 06:11:40,391][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001066740_1092341760.pth [2022-07-11 06:11:41,678][26022] Updated weights on worker 0-0, policy_version 1068730 (0.00088) [2022-07-11 06:11:43,864][26022] Updated weights on worker 0-0, policy_version 1068740 (0.00079) [2022-07-11 06:11:44,205][25689] Fps is (10 sec: 5864.4, 60 sec: 5676.9, 300 sec: 5637.0). Total num frames: 1094392832. Throughput: 0: 5919.8. Samples: 1094397424. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:11:44,207][25689] Avg episode reward: [(0, '1.527')] [2022-07-11 06:11:45,212][26022] Updated weights on worker 0-0, policy_version 1068750 (0.00084) [2022-07-11 06:11:47,304][26022] Updated weights on worker 0-0, policy_version 1068760 (0.00081) [2022-07-11 06:11:48,747][26022] Updated weights on worker 0-0, policy_version 1068770 (0.00081) [2022-07-11 06:11:49,266][25689] Fps is (10 sec: 5841.1, 60 sec: 5640.4, 300 sec: 5636.1). Total num frames: 1094421504. Throughput: 0: 5074.6. Samples: 1094414712. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:11:49,268][25689] Avg episode reward: [(0, '0.777')] [2022-07-11 06:11:50,926][26022] Updated weights on worker 0-0, policy_version 1068780 (0.00084) [2022-07-11 06:11:52,513][26022] Updated weights on worker 0-0, policy_version 1068790 (0.00087) [2022-07-11 06:11:54,253][26022] Updated weights on worker 0-0, policy_version 1068800 (0.00099) [2022-07-11 06:11:54,291][25689] Fps is (10 sec: 5787.5, 60 sec: 5692.3, 300 sec: 5639.1). Total num frames: 1094451200. Throughput: 0: 5954.9. Samples: 1094449164. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:11:54,292][25689] Avg episode reward: [(0, '-0.056')] [2022-07-11 06:11:55,964][26022] Updated weights on worker 0-0, policy_version 1068810 (0.00081) [2022-07-11 06:11:57,849][26022] Updated weights on worker 0-0, policy_version 1068820 (0.00082) [2022-07-11 06:11:59,293][25689] Fps is (10 sec: 5718.7, 60 sec: 5675.3, 300 sec: 5640.5). Total num frames: 1094478848. Throughput: 0: 5984.8. Samples: 1094483922. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:11:59,294][25689] Avg episode reward: [(0, '-0.005')] [2022-07-11 06:11:59,624][26022] Updated weights on worker 0-0, policy_version 1068830 (0.00085) [2022-07-11 06:12:02,001][26022] Updated weights on worker 0-0, policy_version 1068840 (0.00083) [2022-07-11 06:12:03,434][26022] Updated weights on worker 0-0, policy_version 1068850 (0.00080) [2022-07-11 06:12:04,346][25689] Fps is (10 sec: 5499.2, 60 sec: 5675.9, 300 sec: 5643.0). Total num frames: 1094506496. Throughput: 0: 5122.7. Samples: 1094500598. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:12:04,346][25689] Avg episode reward: [(0, '0.197')] [2022-07-11 06:12:05,537][26022] Updated weights on worker 0-0, policy_version 1068860 (0.00081) [2022-07-11 06:12:07,110][26022] Updated weights on worker 0-0, policy_version 1068870 (0.00091) [2022-07-11 06:12:09,072][26022] Updated weights on worker 0-0, policy_version 1068880 (0.00098) [2022-07-11 06:12:09,353][25689] Fps is (10 sec: 5496.9, 60 sec: 5679.3, 300 sec: 5640.3). Total num frames: 1094534144. Throughput: 0: 5896.8. Samples: 1094533160. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:12:09,353][25689] Avg episode reward: [(0, '-0.268')] [2022-07-11 06:12:10,921][26022] Updated weights on worker 0-0, policy_version 1068890 (0.00096) [2022-07-11 06:12:12,819][26022] Updated weights on worker 0-0, policy_version 1068900 (0.00080) [2022-07-11 06:12:14,380][25689] Fps is (10 sec: 5613.0, 60 sec: 5662.5, 300 sec: 5643.3). Total num frames: 1094562816. Throughput: 0: 5860.5. Samples: 1094566896. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:12:14,380][25689] Avg episode reward: [(0, '0.195')] [2022-07-11 06:12:14,475][26022] Updated weights on worker 0-0, policy_version 1068910 (0.00363) [2022-07-11 06:12:16,384][26022] Updated weights on worker 0-0, policy_version 1068920 (0.00081) [2022-07-11 06:12:17,994][26022] Updated weights on worker 0-0, policy_version 1068930 (0.00056) [2022-07-11 06:12:19,403][25689] Fps is (10 sec: 5807.9, 60 sec: 5679.8, 300 sec: 5644.9). Total num frames: 1094592512. Throughput: 0: 4981.9. Samples: 1094584104. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:12:19,404][25689] Avg episode reward: [(0, '1.092')] [2022-07-11 06:12:19,933][26022] Updated weights on worker 0-0, policy_version 1068940 (0.00080) [2022-07-11 06:12:21,575][26022] Updated weights on worker 0-0, policy_version 1068950 (0.00090) [2022-07-11 06:12:23,612][26022] Updated weights on worker 0-0, policy_version 1068960 (0.00086) [2022-07-11 06:12:24,562][25689] Fps is (10 sec: 5732.5, 60 sec: 5676.5, 300 sec: 5646.3). Total num frames: 1094621184. Throughput: 0: 5823.2. Samples: 1094618318. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:12:24,562][25689] Avg episode reward: [(0, '0.375')] [2022-07-11 06:12:25,323][26022] Updated weights on worker 0-0, policy_version 1068970 (0.00084) [2022-07-11 06:12:27,094][26022] Updated weights on worker 0-0, policy_version 1068980 (0.00086) [2022-07-11 06:12:28,878][26022] Updated weights on worker 0-0, policy_version 1068990 (0.00081) [2022-07-11 06:12:29,566][25689] Fps is (10 sec: 5541.7, 60 sec: 5662.6, 300 sec: 5636.6). Total num frames: 1094648832. Throughput: 0: 5919.0. Samples: 1094652798. Policy #0 lag: (min: 0.0, avg: 7.3, max: 18.0) [2022-07-11 06:12:29,567][25689] Avg episode reward: [(0, '-0.227')] [2022-07-11 06:12:30,631][26022] Updated weights on worker 0-0, policy_version 1069000 (0.00084) [2022-07-11 06:12:32,414][26022] Updated weights on worker 0-0, policy_version 1069010 (0.00086) [2022-07-11 06:12:34,211][26022] Updated weights on worker 0-0, policy_version 1069020 (0.00095) [2022-07-11 06:12:34,598][25689] Fps is (10 sec: 5611.8, 60 sec: 5685.1, 300 sec: 5647.1). Total num frames: 1094677504. Throughput: 0: 5951.7. Samples: 1094687226. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:12:34,598][25689] Avg episode reward: [(0, '-0.854')] [2022-07-11 06:12:35,834][26022] Updated weights on worker 0-0, policy_version 1069030 (0.00082) [2022-07-11 06:12:38,017][26022] Updated weights on worker 0-0, policy_version 1069040 (0.00093) [2022-07-11 06:12:39,515][26022] Updated weights on worker 0-0, policy_version 1069050 (0.00087) [2022-07-11 06:12:39,641][25689] Fps is (10 sec: 5792.9, 60 sec: 5699.9, 300 sec: 5649.3). Total num frames: 1094707200. Throughput: 0: 5926.5. Samples: 1094704048. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:12:39,643][25689] Avg episode reward: [(0, '0.302')] [2022-07-11 06:12:41,584][26022] Updated weights on worker 0-0, policy_version 1069060 (0.00090) [2022-07-11 06:12:43,140][26022] Updated weights on worker 0-0, policy_version 1069070 (0.00081) [2022-07-11 06:12:44,694][25689] Fps is (10 sec: 5679.8, 60 sec: 5654.2, 300 sec: 5645.3). Total num frames: 1094734848. Throughput: 0: 5958.4. Samples: 1094738274. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:12:44,696][25689] Avg episode reward: [(0, '0.125')] [2022-07-11 06:12:45,027][26022] Updated weights on worker 0-0, policy_version 1069080 (0.00078) [2022-07-11 06:12:46,802][26022] Updated weights on worker 0-0, policy_version 1069090 (0.00085) [2022-07-11 06:12:48,533][26022] Updated weights on worker 0-0, policy_version 1069100 (0.00082) [2022-07-11 06:12:49,705][25689] Fps is (10 sec: 5697.9, 60 sec: 5675.8, 300 sec: 5648.9). Total num frames: 1094764544. Throughput: 0: 5947.3. Samples: 1094772576. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:12:49,706][25689] Avg episode reward: [(0, '-0.181')] [2022-07-11 06:12:50,454][26022] Updated weights on worker 0-0, policy_version 1069110 (0.00369) [2022-07-11 06:12:52,165][26022] Updated weights on worker 0-0, policy_version 1069120 (0.00094) [2022-07-11 06:12:53,891][26022] Updated weights on worker 0-0, policy_version 1069130 (0.00087) [2022-07-11 06:12:54,740][25689] Fps is (10 sec: 5708.0, 60 sec: 5640.9, 300 sec: 5645.3). Total num frames: 1094792192. Throughput: 0: 5068.2. Samples: 1094789312. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:12:54,742][25689] Avg episode reward: [(0, '0.380')] [2022-07-11 06:12:56,168][26022] Updated weights on worker 0-0, policy_version 1069140 (0.00089) [2022-07-11 06:12:57,479][26022] Updated weights on worker 0-0, policy_version 1069150 (0.00084) [2022-07-11 06:12:59,594][26022] Updated weights on worker 0-0, policy_version 1069160 (0.00082) [2022-07-11 06:12:59,795][25689] Fps is (10 sec: 5480.6, 60 sec: 5636.1, 300 sec: 5655.3). Total num frames: 1094819840. Throughput: 0: 5905.4. Samples: 1094823062. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:12:59,795][25689] Avg episode reward: [(0, '0.871')] [2022-07-11 06:13:01,171][26022] Updated weights on worker 0-0, policy_version 1069170 (0.00084) [2022-07-11 06:13:03,634][26022] Updated weights on worker 0-0, policy_version 1069180 (0.00094) [2022-07-11 06:13:04,885][25689] Fps is (10 sec: 5450.5, 60 sec: 5632.6, 300 sec: 5647.0). Total num frames: 1094847488. Throughput: 0: 5773.2. Samples: 1094854842. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:13:04,887][25689] Avg episode reward: [(0, '0.736')] [2022-07-11 06:13:05,389][26022] Updated weights on worker 0-0, policy_version 1069190 (0.00088) [2022-07-11 06:13:07,231][26022] Updated weights on worker 0-0, policy_version 1069200 (0.00085) [2022-07-11 06:13:08,831][26022] Updated weights on worker 0-0, policy_version 1069210 (0.00084) [2022-07-11 06:13:09,922][25689] Fps is (10 sec: 5460.1, 60 sec: 5629.8, 300 sec: 5640.0). Total num frames: 1094875136. Throughput: 0: 4918.1. Samples: 1094872004. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:13:09,923][25689] Avg episode reward: [(0, '0.085')] [2022-07-11 06:13:10,908][26022] Updated weights on worker 0-0, policy_version 1069220 (0.00084) [2022-07-11 06:13:12,370][26022] Updated weights on worker 0-0, policy_version 1069230 (0.00101) [2022-07-11 06:13:14,469][26022] Updated weights on worker 0-0, policy_version 1069240 (0.00080) [2022-07-11 06:13:14,930][25689] Fps is (10 sec: 5606.9, 60 sec: 5631.6, 300 sec: 5646.8). Total num frames: 1094903808. Throughput: 0: 5793.8. Samples: 1094906288. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:13:14,932][25689] Avg episode reward: [(0, '0.104')] [2022-07-11 06:13:16,032][26022] Updated weights on worker 0-0, policy_version 1069250 (0.00087) [2022-07-11 06:13:18,043][26022] Updated weights on worker 0-0, policy_version 1069260 (0.00082) [2022-07-11 06:13:19,554][26022] Updated weights on worker 0-0, policy_version 1069270 (0.00095) [2022-07-11 06:13:19,937][25689] Fps is (10 sec: 5828.1, 60 sec: 5633.0, 300 sec: 5647.9). Total num frames: 1094933504. Throughput: 0: 5826.0. Samples: 1094940412. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:13:19,939][25689] Avg episode reward: [(0, '0.098')] [2022-07-11 06:13:21,635][26022] Updated weights on worker 0-0, policy_version 1069280 (0.00085) [2022-07-11 06:13:23,222][26022] Updated weights on worker 0-0, policy_version 1069290 (0.00083) [2022-07-11 06:13:25,048][25689] Fps is (10 sec: 5667.6, 60 sec: 5620.6, 300 sec: 5646.2). Total num frames: 1094961152. Throughput: 0: 5095.2. Samples: 1094957574. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:13:25,048][25689] Avg episode reward: [(0, '-0.040')] [2022-07-11 06:13:25,285][26022] Updated weights on worker 0-0, policy_version 1069300 (0.00090) [2022-07-11 06:13:26,996][26022] Updated weights on worker 0-0, policy_version 1069310 (0.00083) [2022-07-11 06:13:28,882][26022] Updated weights on worker 0-0, policy_version 1069320 (0.00085) [2022-07-11 06:13:30,093][25689] Fps is (10 sec: 5646.6, 60 sec: 5650.6, 300 sec: 5647.1). Total num frames: 1094990848. Throughput: 0: 5933.2. Samples: 1094991678. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:13:30,093][25689] Avg episode reward: [(0, '-0.197')] [2022-07-11 06:13:30,595][26022] Updated weights on worker 0-0, policy_version 1069330 (0.00085) [2022-07-11 06:13:32,400][26022] Updated weights on worker 0-0, policy_version 1069340 (0.00084) [2022-07-11 06:13:34,283][26022] Updated weights on worker 0-0, policy_version 1069350 (0.00083) [2022-07-11 06:13:35,163][25689] Fps is (10 sec: 5669.4, 60 sec: 5630.2, 300 sec: 5643.0). Total num frames: 1095018496. Throughput: 0: 5903.1. Samples: 1095025722. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:13:35,163][25689] Avg episode reward: [(0, '-0.526')] [2022-07-11 06:13:36,040][26022] Updated weights on worker 0-0, policy_version 1069360 (0.00086) [2022-07-11 06:13:37,975][26022] Updated weights on worker 0-0, policy_version 1069370 (0.00090) [2022-07-11 06:13:39,614][26022] Updated weights on worker 0-0, policy_version 1069380 (0.00085) [2022-07-11 06:13:40,175][25689] Fps is (10 sec: 5687.7, 60 sec: 5633.1, 300 sec: 5647.4). Total num frames: 1095048192. Throughput: 0: 5057.6. Samples: 1095042766. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:13:40,175][25689] Avg episode reward: [(0, '-0.081')] [2022-07-11 06:13:40,597][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:13:40,606][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001069385_1095050240.pth [2022-07-11 06:13:40,607][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001067397_1093014528.pth [2022-07-11 06:13:41,340][26022] Updated weights on worker 0-0, policy_version 1069390 (0.00082) [2022-07-11 06:13:43,263][26022] Updated weights on worker 0-0, policy_version 1069400 (0.00086) [2022-07-11 06:13:44,966][26022] Updated weights on worker 0-0, policy_version 1069410 (0.00088) [2022-07-11 06:13:45,251][25689] Fps is (10 sec: 5887.3, 60 sec: 5664.8, 300 sec: 5654.3). Total num frames: 1095077888. Throughput: 0: 5917.1. Samples: 1095077116. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:13:45,251][25689] Avg episode reward: [(0, '-0.583')] [2022-07-11 06:13:46,800][26022] Updated weights on worker 0-0, policy_version 1069420 (0.00083) [2022-07-11 06:13:48,418][26022] Updated weights on worker 0-0, policy_version 1069430 (0.00086) [2022-07-11 06:13:50,252][25689] Fps is (10 sec: 5588.9, 60 sec: 5615.0, 300 sec: 5644.6). Total num frames: 1095104512. Throughput: 0: 5966.4. Samples: 1095111956. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:13:50,252][25689] Avg episode reward: [(0, '-0.655')] [2022-07-11 06:13:50,402][26022] Updated weights on worker 0-0, policy_version 1069440 (0.00093) [2022-07-11 06:13:52,186][26022] Updated weights on worker 0-0, policy_version 1069450 (0.00085) [2022-07-11 06:13:54,039][26022] Updated weights on worker 0-0, policy_version 1069460 (0.00091) [2022-07-11 06:13:55,265][25689] Fps is (10 sec: 5623.8, 60 sec: 5650.8, 300 sec: 5651.7). Total num frames: 1095134208. Throughput: 0: 5122.0. Samples: 1095128688. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:13:55,266][25689] Avg episode reward: [(0, '0.457')] [2022-07-11 06:13:55,820][26022] Updated weights on worker 0-0, policy_version 1069470 (0.00091) [2022-07-11 06:13:57,676][26022] Updated weights on worker 0-0, policy_version 1069480 (0.00090) [2022-07-11 06:13:59,494][26022] Updated weights on worker 0-0, policy_version 1069490 (0.00081) [2022-07-11 06:14:00,303][25689] Fps is (10 sec: 5807.1, 60 sec: 5669.3, 300 sec: 5656.2). Total num frames: 1095162880. Throughput: 0: 5947.4. Samples: 1095162478. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:14:00,303][25689] Avg episode reward: [(0, '0.066')] [2022-07-11 06:14:01,402][26022] Updated weights on worker 0-0, policy_version 1069500 (0.00084) [2022-07-11 06:14:03,210][26022] Updated weights on worker 0-0, policy_version 1069510 (0.00090) [2022-07-11 06:14:05,248][26022] Updated weights on worker 0-0, policy_version 1069520 (0.00087) [2022-07-11 06:14:05,383][25689] Fps is (10 sec: 5364.2, 60 sec: 5636.5, 300 sec: 5648.5). Total num frames: 1095188480. Throughput: 0: 5841.0. Samples: 1095194706. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:14:05,383][25689] Avg episode reward: [(0, '-0.232')] [2022-07-11 06:14:06,651][26022] Updated weights on worker 0-0, policy_version 1069530 (0.00088) [2022-07-11 06:14:08,671][26022] Updated weights on worker 0-0, policy_version 1069540 (0.00083) [2022-07-11 06:14:10,373][26022] Updated weights on worker 0-0, policy_version 1069550 (0.00080) [2022-07-11 06:14:10,427][25689] Fps is (10 sec: 5562.9, 60 sec: 5686.5, 300 sec: 5648.8). Total num frames: 1095219200. Throughput: 0: 4968.1. Samples: 1095212190. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:14:10,428][25689] Avg episode reward: [(0, '0.628')] [2022-07-11 06:14:12,141][26022] Updated weights on worker 0-0, policy_version 1069560 (0.00086) [2022-07-11 06:14:14,100][26022] Updated weights on worker 0-0, policy_version 1069570 (0.00085) [2022-07-11 06:14:15,445][25689] Fps is (10 sec: 5800.6, 60 sec: 5668.7, 300 sec: 5649.0). Total num frames: 1095246848. Throughput: 0: 5842.0. Samples: 1095246578. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:14:15,446][25689] Avg episode reward: [(0, '0.492')] [2022-07-11 06:14:15,787][26022] Updated weights on worker 0-0, policy_version 1069580 (0.00084) [2022-07-11 06:14:17,736][26022] Updated weights on worker 0-0, policy_version 1069590 (0.00088) [2022-07-11 06:14:19,503][26022] Updated weights on worker 0-0, policy_version 1069600 (0.00086) [2022-07-11 06:14:20,456][25689] Fps is (10 sec: 5615.9, 60 sec: 5651.4, 300 sec: 5650.2). Total num frames: 1095275520. Throughput: 0: 5874.6. Samples: 1095280868. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:14:20,456][25689] Avg episode reward: [(0, '0.812')] [2022-07-11 06:14:21,232][26022] Updated weights on worker 0-0, policy_version 1069610 (0.00086) [2022-07-11 06:14:23,059][26022] Updated weights on worker 0-0, policy_version 1069620 (0.00058) [2022-07-11 06:14:24,947][26022] Updated weights on worker 0-0, policy_version 1069630 (0.00085) [2022-07-11 06:14:25,539][25689] Fps is (10 sec: 5579.3, 60 sec: 5653.9, 300 sec: 5642.3). Total num frames: 1095303168. Throughput: 0: 5109.2. Samples: 1095297692. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:14:25,540][25689] Avg episode reward: [(0, '-0.940')] [2022-07-11 06:14:26,687][26022] Updated weights on worker 0-0, policy_version 1069640 (0.00087) [2022-07-11 06:14:28,661][26022] Updated weights on worker 0-0, policy_version 1069650 (0.00086) [2022-07-11 06:14:30,284][26022] Updated weights on worker 0-0, policy_version 1069660 (0.00089) [2022-07-11 06:14:30,544][25689] Fps is (10 sec: 5684.4, 60 sec: 5657.7, 300 sec: 5652.6). Total num frames: 1095332864. Throughput: 0: 5920.6. Samples: 1095331294. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:14:30,544][25689] Avg episode reward: [(0, '-0.771')] [2022-07-11 06:14:32,303][26022] Updated weights on worker 0-0, policy_version 1069670 (0.00084) [2022-07-11 06:14:34,094][26022] Updated weights on worker 0-0, policy_version 1069680 (0.00090) [2022-07-11 06:14:35,569][25689] Fps is (10 sec: 5717.5, 60 sec: 5661.9, 300 sec: 5645.4). Total num frames: 1095360512. Throughput: 0: 5895.5. Samples: 1095365220. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:14:35,570][25689] Avg episode reward: [(0, '-1.353')] [2022-07-11 06:14:35,838][26022] Updated weights on worker 0-0, policy_version 1069690 (0.00084) [2022-07-11 06:14:37,713][26022] Updated weights on worker 0-0, policy_version 1069700 (0.00087) [2022-07-11 06:14:39,575][26022] Updated weights on worker 0-0, policy_version 1069710 (0.00117) [2022-07-11 06:14:40,576][25689] Fps is (10 sec: 5614.1, 60 sec: 5645.4, 300 sec: 5643.2). Total num frames: 1095389184. Throughput: 0: 5033.9. Samples: 1095382150. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:14:40,576][25689] Avg episode reward: [(0, '-1.275')] [2022-07-11 06:14:41,262][26022] Updated weights on worker 0-0, policy_version 1069720 (0.00094) [2022-07-11 06:14:43,210][26022] Updated weights on worker 0-0, policy_version 1069730 (0.00095) [2022-07-11 06:14:44,881][26022] Updated weights on worker 0-0, policy_version 1069740 (0.00079) [2022-07-11 06:14:45,635][25689] Fps is (10 sec: 5595.2, 60 sec: 5613.1, 300 sec: 5646.4). Total num frames: 1095416832. Throughput: 0: 5894.5. Samples: 1095416144. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:14:45,635][25689] Avg episode reward: [(0, '-1.487')] [2022-07-11 06:14:46,731][26022] Updated weights on worker 0-0, policy_version 1069750 (0.00085) [2022-07-11 06:14:48,607][26022] Updated weights on worker 0-0, policy_version 1069760 (0.00092) [2022-07-11 06:14:50,183][26022] Updated weights on worker 0-0, policy_version 1069770 (0.00079) [2022-07-11 06:14:50,663][25689] Fps is (10 sec: 5583.2, 60 sec: 5644.5, 300 sec: 5646.4). Total num frames: 1095445504. Throughput: 0: 5927.5. Samples: 1095450552. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:14:50,664][25689] Avg episode reward: [(0, '-2.184')] [2022-07-11 06:14:52,230][26022] Updated weights on worker 0-0, policy_version 1069780 (0.00098) [2022-07-11 06:14:53,864][26022] Updated weights on worker 0-0, policy_version 1069790 (0.00086) [2022-07-11 06:14:55,674][25689] Fps is (10 sec: 5712.2, 60 sec: 5627.8, 300 sec: 5643.3). Total num frames: 1095474176. Throughput: 0: 5094.9. Samples: 1095467650. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:14:55,674][25689] Avg episode reward: [(0, '-1.254')] [2022-07-11 06:14:55,828][26022] Updated weights on worker 0-0, policy_version 1069800 (0.00082) [2022-07-11 06:14:57,467][26022] Updated weights on worker 0-0, policy_version 1069810 (0.00084) [2022-07-11 06:14:59,351][26022] Updated weights on worker 0-0, policy_version 1069820 (0.00099) [2022-07-11 06:15:00,714][25689] Fps is (10 sec: 5705.7, 60 sec: 5627.6, 300 sec: 5655.3). Total num frames: 1095502848. Throughput: 0: 5917.4. Samples: 1095501314. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:15:00,714][25689] Avg episode reward: [(0, '-0.938')] [2022-07-11 06:15:01,288][26022] Updated weights on worker 0-0, policy_version 1069830 (0.00093) [2022-07-11 06:15:03,579][26022] Updated weights on worker 0-0, policy_version 1069840 (0.00086) [2022-07-11 06:15:05,132][26022] Updated weights on worker 0-0, policy_version 1069850 (0.00084) [2022-07-11 06:15:05,835][25689] Fps is (10 sec: 5441.9, 60 sec: 5640.7, 300 sec: 5643.5). Total num frames: 1095529472. Throughput: 0: 5782.0. Samples: 1095532940. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:15:05,835][25689] Avg episode reward: [(0, '0.162')] [2022-07-11 06:15:07,081][26022] Updated weights on worker 0-0, policy_version 1069860 (0.00850) [2022-07-11 06:15:08,644][26022] Updated weights on worker 0-0, policy_version 1069870 (0.00089) [2022-07-11 06:15:10,848][26022] Updated weights on worker 0-0, policy_version 1069880 (0.00080) [2022-07-11 06:15:10,851][25689] Fps is (10 sec: 5353.8, 60 sec: 5592.5, 300 sec: 5640.3). Total num frames: 1095557120. Throughput: 0: 5783.2. Samples: 1095567300. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:15:10,851][25689] Avg episode reward: [(0, '0.130')] [2022-07-11 06:15:12,368][26022] Updated weights on worker 0-0, policy_version 1069890 (0.00083) [2022-07-11 06:15:14,260][26022] Updated weights on worker 0-0, policy_version 1069900 (0.00087) [2022-07-11 06:15:15,881][25689] Fps is (10 sec: 5707.8, 60 sec: 5625.2, 300 sec: 5643.5). Total num frames: 1095586816. Throughput: 0: 5773.2. Samples: 1095584314. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:15:15,882][25689] Avg episode reward: [(0, '0.839')] [2022-07-11 06:15:16,219][26022] Updated weights on worker 0-0, policy_version 1069910 (0.00094) [2022-07-11 06:15:17,756][26022] Updated weights on worker 0-0, policy_version 1069920 (0.00089) [2022-07-11 06:15:19,779][26022] Updated weights on worker 0-0, policy_version 1069930 (0.00089) [2022-07-11 06:15:20,917][25689] Fps is (10 sec: 5798.4, 60 sec: 5622.9, 300 sec: 5647.8). Total num frames: 1095615488. Throughput: 0: 5813.4. Samples: 1095618764. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:15:20,917][25689] Avg episode reward: [(0, '1.560')] [2022-07-11 06:15:21,319][26022] Updated weights on worker 0-0, policy_version 1069940 (0.00092) [2022-07-11 06:15:23,327][26022] Updated weights on worker 0-0, policy_version 1069950 (0.00095) [2022-07-11 06:15:25,076][26022] Updated weights on worker 0-0, policy_version 1069960 (0.00089) [2022-07-11 06:15:26,039][25689] Fps is (10 sec: 5544.6, 60 sec: 5619.4, 300 sec: 5642.5). Total num frames: 1095643136. Throughput: 0: 5919.7. Samples: 1095652542. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:15:26,039][25689] Avg episode reward: [(0, '2.071')] [2022-07-11 06:15:26,835][26022] Updated weights on worker 0-0, policy_version 1069970 (0.00088) [2022-07-11 06:15:28,863][26022] Updated weights on worker 0-0, policy_version 1069980 (0.00081) [2022-07-11 06:15:30,401][26022] Updated weights on worker 0-0, policy_version 1069990 (0.00085) [2022-07-11 06:15:31,082][25689] Fps is (10 sec: 5641.2, 60 sec: 5615.8, 300 sec: 5643.4). Total num frames: 1095672832. Throughput: 0: 5049.4. Samples: 1095669456. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:15:31,083][25689] Avg episode reward: [(0, '2.055')] [2022-07-11 06:15:32,485][26022] Updated weights on worker 0-0, policy_version 1070000 (0.00084) [2022-07-11 06:15:33,892][26022] Updated weights on worker 0-0, policy_version 1070010 (0.00089) [2022-07-11 06:15:35,754][26022] Updated weights on worker 0-0, policy_version 1070020 (0.00088) [2022-07-11 06:15:36,114][25689] Fps is (10 sec: 5793.1, 60 sec: 5632.0, 300 sec: 5646.5). Total num frames: 1095701504. Throughput: 0: 5919.7. Samples: 1095704086. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:15:36,115][25689] Avg episode reward: [(0, '2.018')] [2022-07-11 06:15:37,582][26022] Updated weights on worker 0-0, policy_version 1070030 (0.00091) [2022-07-11 06:15:39,428][26022] Updated weights on worker 0-0, policy_version 1070040 (0.00090) [2022-07-11 06:15:40,829][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:15:40,838][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001070047_1095728128.pth [2022-07-11 06:15:40,838][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001068061_1093694464.pth [2022-07-11 06:15:41,190][25689] Fps is (10 sec: 5673.3, 60 sec: 5625.6, 300 sec: 5650.0). Total num frames: 1095730176. Throughput: 0: 5884.7. Samples: 1095738064. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:15:41,190][25689] Avg episode reward: [(0, '2.010')] [2022-07-11 06:15:41,226][26022] Updated weights on worker 0-0, policy_version 1070050 (0.00083) [2022-07-11 06:15:43,052][26022] Updated weights on worker 0-0, policy_version 1070060 (0.00083) [2022-07-11 06:15:44,803][26022] Updated weights on worker 0-0, policy_version 1070070 (0.00094) [2022-07-11 06:15:46,274][25689] Fps is (10 sec: 5644.2, 60 sec: 5640.2, 300 sec: 5642.2). Total num frames: 1095758848. Throughput: 0: 5064.4. Samples: 1095755020. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:15:46,274][25689] Avg episode reward: [(0, '2.397')] [2022-07-11 06:15:46,670][26022] Updated weights on worker 0-0, policy_version 1070080 (0.00083) [2022-07-11 06:15:48,516][26022] Updated weights on worker 0-0, policy_version 1070090 (0.00088) [2022-07-11 06:15:50,350][26022] Updated weights on worker 0-0, policy_version 1070100 (0.00084) [2022-07-11 06:15:51,293][25689] Fps is (10 sec: 5675.7, 60 sec: 5641.0, 300 sec: 5649.4). Total num frames: 1095787520. Throughput: 0: 5911.0. Samples: 1095788924. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:15:51,294][25689] Avg episode reward: [(0, '1.564')] [2022-07-11 06:15:52,180][26022] Updated weights on worker 0-0, policy_version 1070110 (0.00081) [2022-07-11 06:15:54,019][26022] Updated weights on worker 0-0, policy_version 1070120 (0.00080) [2022-07-11 06:15:55,869][26022] Updated weights on worker 0-0, policy_version 1070130 (0.00079) [2022-07-11 06:15:56,301][25689] Fps is (10 sec: 5514.5, 60 sec: 5607.5, 300 sec: 5642.4). Total num frames: 1095814144. Throughput: 0: 5869.0. Samples: 1095822564. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:15:56,302][25689] Avg episode reward: [(0, '1.121')] [2022-07-11 06:15:57,644][26022] Updated weights on worker 0-0, policy_version 1070140 (0.00761) [2022-07-11 06:15:59,502][26022] Updated weights on worker 0-0, policy_version 1070150 (0.00086) [2022-07-11 06:16:01,346][25689] Fps is (10 sec: 5500.7, 60 sec: 5607.1, 300 sec: 5646.1). Total num frames: 1095842816. Throughput: 0: 5036.2. Samples: 1095839574. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:16:01,346][25689] Avg episode reward: [(0, '1.312')] [2022-07-11 06:16:01,405][26022] Updated weights on worker 0-0, policy_version 1070160 (0.00086) [2022-07-11 06:16:03,458][26022] Updated weights on worker 0-0, policy_version 1070170 (0.00086) [2022-07-11 06:16:05,272][26022] Updated weights on worker 0-0, policy_version 1070180 (0.00088) [2022-07-11 06:16:06,459][25689] Fps is (10 sec: 5544.7, 60 sec: 5624.7, 300 sec: 5644.8). Total num frames: 1095870464. Throughput: 0: 5791.9. Samples: 1095871928. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:16:06,459][25689] Avg episode reward: [(0, '1.278')] [2022-07-11 06:16:06,899][26022] Updated weights on worker 0-0, policy_version 1070190 (0.00087) [2022-07-11 06:16:08,694][26022] Updated weights on worker 0-0, policy_version 1070200 (0.00085) [2022-07-11 06:16:10,639][26022] Updated weights on worker 0-0, policy_version 1070210 (0.00094) [2022-07-11 06:16:11,496][25689] Fps is (10 sec: 5750.7, 60 sec: 5673.4, 300 sec: 5648.1). Total num frames: 1095901184. Throughput: 0: 5820.4. Samples: 1095906508. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:16:11,496][25689] Avg episode reward: [(0, '1.357')] [2022-07-11 06:16:12,408][26022] Updated weights on worker 0-0, policy_version 1070220 (0.00080) [2022-07-11 06:16:14,151][26022] Updated weights on worker 0-0, policy_version 1070230 (0.00076) [2022-07-11 06:16:15,963][26022] Updated weights on worker 0-0, policy_version 1070240 (0.00084) [2022-07-11 06:16:16,504][25689] Fps is (10 sec: 5810.5, 60 sec: 5641.7, 300 sec: 5645.0). Total num frames: 1095928832. Throughput: 0: 5012.7. Samples: 1095923834. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:16:16,505][25689] Avg episode reward: [(0, '0.702')] [2022-07-11 06:16:17,547][26022] Updated weights on worker 0-0, policy_version 1070250 (0.00082) [2022-07-11 06:16:19,518][26022] Updated weights on worker 0-0, policy_version 1070260 (0.00088) [2022-07-11 06:16:20,982][26022] Updated weights on worker 0-0, policy_version 1070270 (0.00095) [2022-07-11 06:16:21,522][25689] Fps is (10 sec: 5719.0, 60 sec: 5660.2, 300 sec: 5650.4). Total num frames: 1095958528. Throughput: 0: 5898.3. Samples: 1095958582. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:16:21,523][25689] Avg episode reward: [(0, '0.730')] [2022-07-11 06:16:23,171][26022] Updated weights on worker 0-0, policy_version 1070280 (0.00091) [2022-07-11 06:16:24,692][26022] Updated weights on worker 0-0, policy_version 1070290 (0.00089) [2022-07-11 06:16:26,564][25689] Fps is (10 sec: 5801.8, 60 sec: 5684.6, 300 sec: 5650.3). Total num frames: 1095987200. Throughput: 0: 6025.6. Samples: 1095993076. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 06:16:26,565][25689] Avg episode reward: [(0, '0.642')] [2022-07-11 06:16:26,567][26022] Updated weights on worker 0-0, policy_version 1070300 (0.00094) [2022-07-11 06:16:28,298][26022] Updated weights on worker 0-0, policy_version 1070310 (0.00087) [2022-07-11 06:16:30,187][26022] Updated weights on worker 0-0, policy_version 1070320 (0.00085) [2022-07-11 06:16:31,633][25689] Fps is (10 sec: 5671.6, 60 sec: 5665.3, 300 sec: 5654.2). Total num frames: 1096015872. Throughput: 0: 5144.1. Samples: 1096010098. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:16:31,634][25689] Avg episode reward: [(0, '0.805')] [2022-07-11 06:16:31,990][26022] Updated weights on worker 0-0, policy_version 1070330 (0.00114) [2022-07-11 06:16:33,759][26022] Updated weights on worker 0-0, policy_version 1070340 (0.00092) [2022-07-11 06:16:35,530][26022] Updated weights on worker 0-0, policy_version 1070350 (0.00083) [2022-07-11 06:16:36,643][25689] Fps is (10 sec: 5689.8, 60 sec: 5667.4, 300 sec: 5654.4). Total num frames: 1096044544. Throughput: 0: 5980.7. Samples: 1096044276. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:16:36,643][25689] Avg episode reward: [(0, '0.277')] [2022-07-11 06:16:37,368][26022] Updated weights on worker 0-0, policy_version 1070360 (0.00089) [2022-07-11 06:16:39,098][26022] Updated weights on worker 0-0, policy_version 1070370 (0.00081) [2022-07-11 06:16:40,880][26022] Updated weights on worker 0-0, policy_version 1070380 (0.00083) [2022-07-11 06:16:41,662][25689] Fps is (10 sec: 5615.6, 60 sec: 5655.7, 300 sec: 5645.7). Total num frames: 1096072192. Throughput: 0: 5956.3. Samples: 1096078542. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:16:41,663][25689] Avg episode reward: [(0, '1.307')] [2022-07-11 06:16:42,758][26022] Updated weights on worker 0-0, policy_version 1070390 (0.00085) [2022-07-11 06:16:44,584][26022] Updated weights on worker 0-0, policy_version 1070400 (0.00088) [2022-07-11 06:16:46,397][26022] Updated weights on worker 0-0, policy_version 1070410 (0.00084) [2022-07-11 06:16:46,738][25689] Fps is (10 sec: 5680.0, 60 sec: 5673.4, 300 sec: 5648.9). Total num frames: 1096101888. Throughput: 0: 5070.9. Samples: 1096095376. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:16:46,739][25689] Avg episode reward: [(0, '0.681')] [2022-07-11 06:16:48,110][26022] Updated weights on worker 0-0, policy_version 1070420 (0.00085) [2022-07-11 06:16:50,010][26022] Updated weights on worker 0-0, policy_version 1070430 (0.00084) [2022-07-11 06:16:51,749][25689] Fps is (10 sec: 5786.6, 60 sec: 5674.2, 300 sec: 5645.7). Total num frames: 1096130560. Throughput: 0: 5941.2. Samples: 1096129612. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:16:51,751][25689] Avg episode reward: [(0, '0.886')] [2022-07-11 06:16:51,755][26022] Updated weights on worker 0-0, policy_version 1070440 (0.00085) [2022-07-11 06:16:53,646][26022] Updated weights on worker 0-0, policy_version 1070450 (0.00094) [2022-07-11 06:16:55,387][26022] Updated weights on worker 0-0, policy_version 1070460 (0.00090) [2022-07-11 06:16:56,842][25689] Fps is (10 sec: 5472.7, 60 sec: 5666.2, 300 sec: 5640.5). Total num frames: 1096157184. Throughput: 0: 5905.8. Samples: 1096163572. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:16:56,843][25689] Avg episode reward: [(0, '0.796')] [2022-07-11 06:16:57,236][26022] Updated weights on worker 0-0, policy_version 1070470 (0.00079) [2022-07-11 06:16:59,151][26022] Updated weights on worker 0-0, policy_version 1070480 (0.00081) [2022-07-11 06:17:00,797][26022] Updated weights on worker 0-0, policy_version 1070490 (0.00080) [2022-07-11 06:17:01,883][25689] Fps is (10 sec: 5557.6, 60 sec: 5683.5, 300 sec: 5647.6). Total num frames: 1096186880. Throughput: 0: 5039.0. Samples: 1096180434. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:17:01,885][25689] Avg episode reward: [(0, '0.663')] [2022-07-11 06:17:03,215][26022] Updated weights on worker 0-0, policy_version 1070500 (0.00084) [2022-07-11 06:17:04,754][26022] Updated weights on worker 0-0, policy_version 1070510 (0.00091) [2022-07-11 06:17:06,817][26022] Updated weights on worker 0-0, policy_version 1070520 (0.00083) [2022-07-11 06:17:06,931][25689] Fps is (10 sec: 5582.5, 60 sec: 5672.7, 300 sec: 5643.4). Total num frames: 1096213504. Throughput: 0: 5792.3. Samples: 1096212338. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:17:06,932][25689] Avg episode reward: [(0, '0.838')] [2022-07-11 06:17:08,465][26022] Updated weights on worker 0-0, policy_version 1070530 (0.00089) [2022-07-11 06:17:10,361][26022] Updated weights on worker 0-0, policy_version 1070540 (0.00082) [2022-07-11 06:17:11,951][25689] Fps is (10 sec: 5492.3, 60 sec: 5640.4, 300 sec: 5643.6). Total num frames: 1096242176. Throughput: 0: 5777.3. Samples: 1096246324. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:17:11,952][25689] Avg episode reward: [(0, '0.904')] [2022-07-11 06:17:12,053][26022] Updated weights on worker 0-0, policy_version 1070550 (0.00083) [2022-07-11 06:17:13,882][26022] Updated weights on worker 0-0, policy_version 1070560 (0.00078) [2022-07-11 06:17:15,613][26022] Updated weights on worker 0-0, policy_version 1070570 (0.00084) [2022-07-11 06:17:16,990][25689] Fps is (10 sec: 5599.1, 60 sec: 5637.6, 300 sec: 5636.4). Total num frames: 1096269824. Throughput: 0: 4966.0. Samples: 1096263624. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:17:16,990][25689] Avg episode reward: [(0, '0.692')] [2022-07-11 06:17:17,506][26022] Updated weights on worker 0-0, policy_version 1070580 (0.00084) [2022-07-11 06:17:19,184][26022] Updated weights on worker 0-0, policy_version 1070590 (0.00083) [2022-07-11 06:17:21,073][26022] Updated weights on worker 0-0, policy_version 1070600 (0.00080) [2022-07-11 06:17:22,015][25689] Fps is (10 sec: 5698.1, 60 sec: 5637.0, 300 sec: 5642.4). Total num frames: 1096299520. Throughput: 0: 5842.2. Samples: 1096298048. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:17:22,015][25689] Avg episode reward: [(0, '0.724')] [2022-07-11 06:17:22,925][26022] Updated weights on worker 0-0, policy_version 1070610 (0.00091) [2022-07-11 06:17:24,782][26022] Updated weights on worker 0-0, policy_version 1070620 (0.00082) [2022-07-11 06:17:26,510][26022] Updated weights on worker 0-0, policy_version 1070630 (0.00105) [2022-07-11 06:17:27,056][25689] Fps is (10 sec: 5798.4, 60 sec: 5637.0, 300 sec: 5645.1). Total num frames: 1096328192. Throughput: 0: 5949.7. Samples: 1096332076. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:17:27,057][25689] Avg episode reward: [(0, '0.653')] [2022-07-11 06:17:28,397][26022] Updated weights on worker 0-0, policy_version 1070640 (0.00082) [2022-07-11 06:17:30,058][26022] Updated weights on worker 0-0, policy_version 1070650 (0.00084) [2022-07-11 06:17:31,828][26022] Updated weights on worker 0-0, policy_version 1070660 (0.00090) [2022-07-11 06:17:32,080][25689] Fps is (10 sec: 5697.0, 60 sec: 5641.2, 300 sec: 5645.2). Total num frames: 1096356864. Throughput: 0: 5969.7. Samples: 1096366490. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:17:32,081][25689] Avg episode reward: [(0, '-0.194')] [2022-07-11 06:17:33,691][26022] Updated weights on worker 0-0, policy_version 1070670 (0.00083) [2022-07-11 06:17:35,324][26022] Updated weights on worker 0-0, policy_version 1070680 (0.00086) [2022-07-11 06:17:37,095][25689] Fps is (10 sec: 5610.2, 60 sec: 5623.7, 300 sec: 5638.9). Total num frames: 1096384512. Throughput: 0: 5981.8. Samples: 1096383890. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:17:37,096][25689] Avg episode reward: [(0, '-0.144')] [2022-07-11 06:17:37,359][26022] Updated weights on worker 0-0, policy_version 1070690 (0.00091) [2022-07-11 06:17:38,812][26022] Updated weights on worker 0-0, policy_version 1070700 (0.00079) [2022-07-11 06:17:40,771][26022] Updated weights on worker 0-0, policy_version 1070710 (0.00080) [2022-07-11 06:17:40,884][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:17:40,898][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001070711_1096408064.pth [2022-07-11 06:17:40,899][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001068721_1094370304.pth [2022-07-11 06:17:42,101][25689] Fps is (10 sec: 5722.7, 60 sec: 5658.9, 300 sec: 5646.7). Total num frames: 1096414208. Throughput: 0: 5984.6. Samples: 1096418256. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:17:42,101][25689] Avg episode reward: [(0, '0.452')] [2022-07-11 06:17:42,735][26022] Updated weights on worker 0-0, policy_version 1070720 (0.00086) [2022-07-11 06:17:44,351][26022] Updated weights on worker 0-0, policy_version 1070730 (0.00080) [2022-07-11 06:17:46,288][26022] Updated weights on worker 0-0, policy_version 1070740 (0.00653) [2022-07-11 06:17:47,227][25689] Fps is (10 sec: 5962.9, 60 sec: 5671.2, 300 sec: 5647.9). Total num frames: 1096444928. Throughput: 0: 5983.1. Samples: 1096452760. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:17:47,228][25689] Avg episode reward: [(0, '0.578')] [2022-07-11 06:17:47,826][26022] Updated weights on worker 0-0, policy_version 1070750 (0.00088) [2022-07-11 06:17:49,745][26022] Updated weights on worker 0-0, policy_version 1070760 (0.00086) [2022-07-11 06:17:51,725][26022] Updated weights on worker 0-0, policy_version 1070770 (0.01080) [2022-07-11 06:17:52,263][25689] Fps is (10 sec: 5642.7, 60 sec: 5634.9, 300 sec: 5644.5). Total num frames: 1096471552. Throughput: 0: 5129.3. Samples: 1096470014. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:17:52,264][25689] Avg episode reward: [(0, '0.650')] [2022-07-11 06:17:53,168][26022] Updated weights on worker 0-0, policy_version 1070780 (0.00091) [2022-07-11 06:17:55,251][26022] Updated weights on worker 0-0, policy_version 1070790 (0.00091) [2022-07-11 06:17:56,949][26022] Updated weights on worker 0-0, policy_version 1070800 (0.00088) [2022-07-11 06:17:57,285][25689] Fps is (10 sec: 5497.7, 60 sec: 5675.5, 300 sec: 5648.5). Total num frames: 1096500224. Throughput: 0: 5956.4. Samples: 1096504150. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:17:57,286][25689] Avg episode reward: [(0, '0.910')] [2022-07-11 06:17:58,735][26022] Updated weights on worker 0-0, policy_version 1070810 (0.00098) [2022-07-11 06:18:00,855][26022] Updated weights on worker 0-0, policy_version 1070820 (0.00097) [2022-07-11 06:18:02,299][25689] Fps is (10 sec: 5714.2, 60 sec: 5661.0, 300 sec: 5653.4). Total num frames: 1096528896. Throughput: 0: 5892.9. Samples: 1096537280. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:18:02,299][25689] Avg episode reward: [(0, '1.906')] [2022-07-11 06:18:02,640][26022] Updated weights on worker 0-0, policy_version 1070830 (0.00087) [2022-07-11 06:18:04,516][26022] Updated weights on worker 0-0, policy_version 1070840 (0.00089) [2022-07-11 06:18:06,230][26022] Updated weights on worker 0-0, policy_version 1070850 (0.00085) [2022-07-11 06:18:07,405][25689] Fps is (10 sec: 5565.1, 60 sec: 5672.5, 300 sec: 5652.1). Total num frames: 1096556544. Throughput: 0: 4986.3. Samples: 1096553376. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:18:07,406][25689] Avg episode reward: [(0, '2.000')] [2022-07-11 06:18:08,198][26022] Updated weights on worker 0-0, policy_version 1070860 (0.00085) [2022-07-11 06:18:09,986][26022] Updated weights on worker 0-0, policy_version 1070870 (0.00092) [2022-07-11 06:18:11,639][26022] Updated weights on worker 0-0, policy_version 1070880 (0.00088) [2022-07-11 06:18:12,435][25689] Fps is (10 sec: 5455.2, 60 sec: 5654.6, 300 sec: 5648.3). Total num frames: 1096584192. Throughput: 0: 5817.9. Samples: 1096587370. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:18:12,437][25689] Avg episode reward: [(0, '1.536')] [2022-07-11 06:18:13,516][26022] Updated weights on worker 0-0, policy_version 1070890 (0.00077) [2022-07-11 06:18:15,319][26022] Updated weights on worker 0-0, policy_version 1070900 (0.00084) [2022-07-11 06:18:17,127][26022] Updated weights on worker 0-0, policy_version 1070910 (0.00082) [2022-07-11 06:18:17,504][25689] Fps is (10 sec: 5678.5, 60 sec: 5685.7, 300 sec: 5647.1). Total num frames: 1096613888. Throughput: 0: 5818.6. Samples: 1096621794. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:18:17,504][25689] Avg episode reward: [(0, '0.616')] [2022-07-11 06:18:18,990][26022] Updated weights on worker 0-0, policy_version 1070920 (0.00092) [2022-07-11 06:18:20,796][26022] Updated weights on worker 0-0, policy_version 1070930 (0.00087) [2022-07-11 06:18:22,552][25689] Fps is (10 sec: 5668.1, 60 sec: 5649.7, 300 sec: 5648.3). Total num frames: 1096641536. Throughput: 0: 5018.9. Samples: 1096638930. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:18:22,553][25689] Avg episode reward: [(0, '0.242')] [2022-07-11 06:18:22,619][26022] Updated weights on worker 0-0, policy_version 1070940 (0.00089) [2022-07-11 06:18:24,215][26022] Updated weights on worker 0-0, policy_version 1070950 (0.00102) [2022-07-11 06:18:26,279][26022] Updated weights on worker 0-0, policy_version 1070960 (0.00090) [2022-07-11 06:18:27,614][25689] Fps is (10 sec: 5772.9, 60 sec: 5681.5, 300 sec: 5651.4). Total num frames: 1096672256. Throughput: 0: 5930.6. Samples: 1096673228. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:18:27,615][25689] Avg episode reward: [(0, '-0.685')] [2022-07-11 06:18:27,660][26022] Updated weights on worker 0-0, policy_version 1070970 (0.00087) [2022-07-11 06:18:29,900][26022] Updated weights on worker 0-0, policy_version 1070980 (0.00081) [2022-07-11 06:18:31,361][26022] Updated weights on worker 0-0, policy_version 1070990 (0.00091) [2022-07-11 06:18:32,629][25689] Fps is (10 sec: 5690.7, 60 sec: 5648.6, 300 sec: 5649.0). Total num frames: 1096698880. Throughput: 0: 5918.9. Samples: 1096706894. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:18:32,629][25689] Avg episode reward: [(0, '-1.336')] [2022-07-11 06:18:33,385][26022] Updated weights on worker 0-0, policy_version 1071000 (0.00083) [2022-07-11 06:18:35,070][26022] Updated weights on worker 0-0, policy_version 1071010 (0.00089) [2022-07-11 06:18:36,883][26022] Updated weights on worker 0-0, policy_version 1071020 (0.00080) [2022-07-11 06:18:37,707][25689] Fps is (10 sec: 5580.1, 60 sec: 5676.5, 300 sec: 5647.7). Total num frames: 1096728576. Throughput: 0: 5070.0. Samples: 1096724226. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:18:37,708][25689] Avg episode reward: [(0, '-1.573')] [2022-07-11 06:18:38,747][26022] Updated weights on worker 0-0, policy_version 1071030 (0.00046) [2022-07-11 06:18:40,548][26022] Updated weights on worker 0-0, policy_version 1071040 (0.00083) [2022-07-11 06:18:42,392][26022] Updated weights on worker 0-0, policy_version 1071050 (0.00088) [2022-07-11 06:18:42,770][25689] Fps is (10 sec: 5856.3, 60 sec: 5671.1, 300 sec: 5648.0). Total num frames: 1096758272. Throughput: 0: 5902.8. Samples: 1096758274. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:18:42,771][25689] Avg episode reward: [(0, '0.498')] [2022-07-11 06:18:44,283][26022] Updated weights on worker 0-0, policy_version 1071060 (0.00089) [2022-07-11 06:18:45,903][26022] Updated weights on worker 0-0, policy_version 1071070 (0.00087) [2022-07-11 06:18:47,846][25689] Fps is (10 sec: 5655.9, 60 sec: 5625.2, 300 sec: 5650.0). Total num frames: 1096785920. Throughput: 0: 5909.3. Samples: 1096792784. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:18:47,847][25689] Avg episode reward: [(0, '0.034')] [2022-07-11 06:18:47,850][26022] Updated weights on worker 0-0, policy_version 1071080 (0.00089) [2022-07-11 06:18:49,459][26022] Updated weights on worker 0-0, policy_version 1071090 (0.00051) [2022-07-11 06:18:51,319][26022] Updated weights on worker 0-0, policy_version 1071100 (0.00079) [2022-07-11 06:18:52,869][25689] Fps is (10 sec: 5678.6, 60 sec: 5677.1, 300 sec: 5649.8). Total num frames: 1096815616. Throughput: 0: 5096.6. Samples: 1096810048. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:18:52,869][25689] Avg episode reward: [(0, '-0.781')] [2022-07-11 06:18:53,133][26022] Updated weights on worker 0-0, policy_version 1071110 (0.00090) [2022-07-11 06:18:54,916][26022] Updated weights on worker 0-0, policy_version 1071120 (0.00087) [2022-07-11 06:18:56,659][26022] Updated weights on worker 0-0, policy_version 1071130 (0.00094) [2022-07-11 06:18:57,924][25689] Fps is (10 sec: 5690.2, 60 sec: 5657.1, 300 sec: 5646.1). Total num frames: 1096843264. Throughput: 0: 5928.0. Samples: 1096844070. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:18:57,924][25689] Avg episode reward: [(0, '-1.334')] [2022-07-11 06:18:58,307][26022] Updated weights on worker 0-0, policy_version 1071140 (0.00090) [2022-07-11 06:19:00,259][26022] Updated weights on worker 0-0, policy_version 1071150 (0.00081) [2022-07-11 06:19:02,512][26022] Updated weights on worker 0-0, policy_version 1071160 (0.00081) [2022-07-11 06:19:02,939][25689] Fps is (10 sec: 5389.2, 60 sec: 5623.2, 300 sec: 5650.7). Total num frames: 1096869888. Throughput: 0: 5837.6. Samples: 1096876012. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:19:02,940][25689] Avg episode reward: [(0, '-1.208')] [2022-07-11 06:19:04,299][26022] Updated weights on worker 0-0, policy_version 1071170 (0.00080) [2022-07-11 06:19:06,018][26022] Updated weights on worker 0-0, policy_version 1071180 (0.00085) [2022-07-11 06:19:07,767][26022] Updated weights on worker 0-0, policy_version 1071190 (0.00085) [2022-07-11 06:19:07,983][25689] Fps is (10 sec: 5598.8, 60 sec: 5662.8, 300 sec: 5647.3). Total num frames: 1096899584. Throughput: 0: 4973.5. Samples: 1096892936. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:19:07,991][25689] Avg episode reward: [(0, '-1.478')] [2022-07-11 06:19:09,855][26022] Updated weights on worker 0-0, policy_version 1071200 (0.00092) [2022-07-11 06:19:11,480][26022] Updated weights on worker 0-0, policy_version 1071210 (0.00086) [2022-07-11 06:19:13,035][25689] Fps is (10 sec: 5578.7, 60 sec: 5643.9, 300 sec: 5643.2). Total num frames: 1096926208. Throughput: 0: 5804.4. Samples: 1096927100. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:19:13,035][25689] Avg episode reward: [(0, '-1.612')] [2022-07-11 06:19:13,466][26022] Updated weights on worker 0-0, policy_version 1071220 (0.00085) [2022-07-11 06:19:15,039][26022] Updated weights on worker 0-0, policy_version 1071230 (0.00085) [2022-07-11 06:19:17,028][26022] Updated weights on worker 0-0, policy_version 1071240 (0.00103) [2022-07-11 06:19:18,043][25689] Fps is (10 sec: 5598.7, 60 sec: 5649.5, 300 sec: 5646.7). Total num frames: 1096955904. Throughput: 0: 5824.4. Samples: 1096961250. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:19:18,045][25689] Avg episode reward: [(0, '-1.120')] [2022-07-11 06:19:18,797][26022] Updated weights on worker 0-0, policy_version 1071250 (0.00097) [2022-07-11 06:19:20,514][26022] Updated weights on worker 0-0, policy_version 1071260 (0.00089) [2022-07-11 06:19:22,316][26022] Updated weights on worker 0-0, policy_version 1071270 (0.00084) [2022-07-11 06:19:23,049][25689] Fps is (10 sec: 5930.8, 60 sec: 5687.3, 300 sec: 5655.1). Total num frames: 1096985600. Throughput: 0: 5094.5. Samples: 1096978460. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:19:23,050][25689] Avg episode reward: [(0, '-0.507')] [2022-07-11 06:19:24,301][26022] Updated weights on worker 0-0, policy_version 1071280 (0.00088) [2022-07-11 06:19:25,871][26022] Updated weights on worker 0-0, policy_version 1071290 (0.00081) [2022-07-11 06:19:27,732][26022] Updated weights on worker 0-0, policy_version 1071300 (0.00082) [2022-07-11 06:19:28,091][25689] Fps is (10 sec: 5605.0, 60 sec: 5621.5, 300 sec: 5644.0). Total num frames: 1097012224. Throughput: 0: 5957.3. Samples: 1097012724. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:19:28,092][25689] Avg episode reward: [(0, '0.041')] [2022-07-11 06:19:29,361][26022] Updated weights on worker 0-0, policy_version 1071310 (0.00092) [2022-07-11 06:19:31,355][26022] Updated weights on worker 0-0, policy_version 1071320 (0.00086) [2022-07-11 06:19:33,096][25689] Fps is (10 sec: 5503.6, 60 sec: 5656.2, 300 sec: 5647.8). Total num frames: 1097040896. Throughput: 0: 5949.5. Samples: 1097046454. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:19:33,096][25689] Avg episode reward: [(0, '0.999')] [2022-07-11 06:19:33,166][26022] Updated weights on worker 0-0, policy_version 1071330 (0.00084) [2022-07-11 06:19:34,896][26022] Updated weights on worker 0-0, policy_version 1071340 (0.00083) [2022-07-11 06:19:36,700][26022] Updated weights on worker 0-0, policy_version 1071350 (0.00086) [2022-07-11 06:19:38,098][25689] Fps is (10 sec: 5832.4, 60 sec: 5663.4, 300 sec: 5651.4). Total num frames: 1097070592. Throughput: 0: 5110.4. Samples: 1097063744. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:19:38,100][25689] Avg episode reward: [(0, '1.062')] [2022-07-11 06:19:38,500][26022] Updated weights on worker 0-0, policy_version 1071360 (0.00091) [2022-07-11 06:19:40,241][26022] Updated weights on worker 0-0, policy_version 1071370 (0.00086) [2022-07-11 06:19:41,126][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:19:41,138][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001071375_1097088000.pth [2022-07-11 06:19:41,138][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001069385_1095050240.pth [2022-07-11 06:19:42,055][26022] Updated weights on worker 0-0, policy_version 1071380 (0.00081) [2022-07-11 06:19:43,112][25689] Fps is (10 sec: 5725.0, 60 sec: 5634.0, 300 sec: 5652.2). Total num frames: 1097098240. Throughput: 0: 5962.6. Samples: 1097098092. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:19:43,114][25689] Avg episode reward: [(0, '0.937')] [2022-07-11 06:19:43,827][26022] Updated weights on worker 0-0, policy_version 1071390 (0.00085) [2022-07-11 06:19:45,683][26022] Updated weights on worker 0-0, policy_version 1071400 (0.00095) [2022-07-11 06:19:47,548][26022] Updated weights on worker 0-0, policy_version 1071410 (0.00083) [2022-07-11 06:19:48,163][25689] Fps is (10 sec: 5595.7, 60 sec: 5653.3, 300 sec: 5651.8). Total num frames: 1097126912. Throughput: 0: 5966.3. Samples: 1097132482. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:19:48,164][25689] Avg episode reward: [(0, '1.447')] [2022-07-11 06:19:49,241][26022] Updated weights on worker 0-0, policy_version 1071420 (0.00091) [2022-07-11 06:19:51,093][26022] Updated weights on worker 0-0, policy_version 1071430 (0.00084) [2022-07-11 06:19:52,824][26022] Updated weights on worker 0-0, policy_version 1071440 (0.00092) [2022-07-11 06:19:53,165][25689] Fps is (10 sec: 5704.4, 60 sec: 5638.3, 300 sec: 5652.0). Total num frames: 1097155584. Throughput: 0: 5145.6. Samples: 1097149722. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:19:53,165][25689] Avg episode reward: [(0, '1.164')] [2022-07-11 06:19:54,699][26022] Updated weights on worker 0-0, policy_version 1071450 (0.00090) [2022-07-11 06:19:56,494][26022] Updated weights on worker 0-0, policy_version 1071460 (0.00086) [2022-07-11 06:19:58,187][25689] Fps is (10 sec: 5721.0, 60 sec: 5658.4, 300 sec: 5652.3). Total num frames: 1097184256. Throughput: 0: 5979.0. Samples: 1097183852. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:19:58,188][25689] Avg episode reward: [(0, '1.292')] [2022-07-11 06:19:58,231][26022] Updated weights on worker 0-0, policy_version 1071470 (0.00089) [2022-07-11 06:19:59,998][26022] Updated weights on worker 0-0, policy_version 1071480 (0.00089) [2022-07-11 06:20:02,433][26022] Updated weights on worker 0-0, policy_version 1071490 (0.00079) [2022-07-11 06:20:03,196][25689] Fps is (10 sec: 5410.3, 60 sec: 5642.0, 300 sec: 5651.0). Total num frames: 1097209856. Throughput: 0: 5841.1. Samples: 1097215404. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:20:03,197][25689] Avg episode reward: [(0, '1.216')] [2022-07-11 06:20:04,226][26022] Updated weights on worker 0-0, policy_version 1071500 (0.00086) [2022-07-11 06:20:06,031][26022] Updated weights on worker 0-0, policy_version 1071510 (0.00098) [2022-07-11 06:20:07,842][26022] Updated weights on worker 0-0, policy_version 1071520 (0.00081) [2022-07-11 06:20:08,334][25689] Fps is (10 sec: 5348.3, 60 sec: 5616.2, 300 sec: 5652.1). Total num frames: 1097238528. Throughput: 0: 4946.9. Samples: 1097232266. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:20:08,335][25689] Avg episode reward: [(0, '0.632')] [2022-07-11 06:20:09,481][26022] Updated weights on worker 0-0, policy_version 1071530 (0.00095) [2022-07-11 06:20:11,607][26022] Updated weights on worker 0-0, policy_version 1071540 (0.00096) [2022-07-11 06:20:13,132][26022] Updated weights on worker 0-0, policy_version 1071550 (0.00087) [2022-07-11 06:20:13,379][25689] Fps is (10 sec: 5732.2, 60 sec: 5667.8, 300 sec: 5651.8). Total num frames: 1097268224. Throughput: 0: 5776.0. Samples: 1097266476. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:20:13,379][25689] Avg episode reward: [(0, '0.355')] [2022-07-11 06:20:15,083][26022] Updated weights on worker 0-0, policy_version 1071560 (0.00090) [2022-07-11 06:20:16,688][26022] Updated weights on worker 0-0, policy_version 1071570 (0.00102) [2022-07-11 06:20:18,395][25689] Fps is (10 sec: 5699.8, 60 sec: 5633.1, 300 sec: 5648.8). Total num frames: 1097295872. Throughput: 0: 5784.3. Samples: 1097300742. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:20:18,395][25689] Avg episode reward: [(0, '0.185')] [2022-07-11 06:20:18,619][26022] Updated weights on worker 0-0, policy_version 1071580 (0.00088) [2022-07-11 06:20:20,318][26022] Updated weights on worker 0-0, policy_version 1071590 (0.00082) [2022-07-11 06:20:22,058][26022] Updated weights on worker 0-0, policy_version 1071600 (0.00081) [2022-07-11 06:20:23,445][25689] Fps is (10 sec: 5696.9, 60 sec: 5629.0, 300 sec: 5657.0). Total num frames: 1097325568. Throughput: 0: 5069.0. Samples: 1097318042. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:20:23,445][25689] Avg episode reward: [(0, '0.329')] [2022-07-11 06:20:23,887][26022] Updated weights on worker 0-0, policy_version 1071610 (0.00087) [2022-07-11 06:20:25,769][26022] Updated weights on worker 0-0, policy_version 1071620 (0.00084) [2022-07-11 06:20:27,395][26022] Updated weights on worker 0-0, policy_version 1071630 (0.00102) [2022-07-11 06:20:28,523][25689] Fps is (10 sec: 5763.1, 60 sec: 5659.5, 300 sec: 5652.9). Total num frames: 1097354240. Throughput: 0: 5952.7. Samples: 1097352442. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 06:20:28,523][25689] Avg episode reward: [(0, '0.025')] [2022-07-11 06:20:29,484][26022] Updated weights on worker 0-0, policy_version 1071640 (0.00086) [2022-07-11 06:20:30,982][26022] Updated weights on worker 0-0, policy_version 1071650 (0.00087) [2022-07-11 06:20:32,967][26022] Updated weights on worker 0-0, policy_version 1071660 (0.00095) [2022-07-11 06:20:33,537][25689] Fps is (10 sec: 5580.4, 60 sec: 5641.7, 300 sec: 5649.8). Total num frames: 1097381888. Throughput: 0: 5960.7. Samples: 1097386634. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:20:33,538][25689] Avg episode reward: [(0, '-0.141')] [2022-07-11 06:20:34,583][26022] Updated weights on worker 0-0, policy_version 1071670 (0.00086) [2022-07-11 06:20:36,480][26022] Updated weights on worker 0-0, policy_version 1071680 (0.00080) [2022-07-11 06:20:38,379][26022] Updated weights on worker 0-0, policy_version 1071690 (0.00091) [2022-07-11 06:20:38,552][25689] Fps is (10 sec: 5717.6, 60 sec: 5640.6, 300 sec: 5654.4). Total num frames: 1097411584. Throughput: 0: 5970.8. Samples: 1097421096. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:20:38,553][25689] Avg episode reward: [(0, '0.849')] [2022-07-11 06:20:39,998][26022] Updated weights on worker 0-0, policy_version 1071700 (0.00085) [2022-07-11 06:20:41,862][26022] Updated weights on worker 0-0, policy_version 1071710 (0.00092) [2022-07-11 06:20:43,590][25689] Fps is (10 sec: 5806.2, 60 sec: 5655.3, 300 sec: 5655.3). Total num frames: 1097440256. Throughput: 0: 5970.0. Samples: 1097438310. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:20:43,590][25689] Avg episode reward: [(0, '0.325')] [2022-07-11 06:20:43,826][26022] Updated weights on worker 0-0, policy_version 1071720 (0.00085) [2022-07-11 06:20:45,392][26022] Updated weights on worker 0-0, policy_version 1071730 (0.00081) [2022-07-11 06:20:47,458][26022] Updated weights on worker 0-0, policy_version 1071740 (0.00081) [2022-07-11 06:20:48,664][25689] Fps is (10 sec: 5772.0, 60 sec: 5670.0, 300 sec: 5657.7). Total num frames: 1097469952. Throughput: 0: 5967.3. Samples: 1097472634. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:20:48,665][25689] Avg episode reward: [(0, '0.033')] [2022-07-11 06:20:49,114][26022] Updated weights on worker 0-0, policy_version 1071750 (0.00086) [2022-07-11 06:20:50,794][26022] Updated weights on worker 0-0, policy_version 1071760 (0.00085) [2022-07-11 06:20:52,653][26022] Updated weights on worker 0-0, policy_version 1071770 (0.00085) [2022-07-11 06:20:53,691][25689] Fps is (10 sec: 5778.5, 60 sec: 5667.7, 300 sec: 5664.2). Total num frames: 1097498624. Throughput: 0: 5966.5. Samples: 1097506880. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:20:53,691][25689] Avg episode reward: [(0, '0.530')] [2022-07-11 06:20:54,331][26022] Updated weights on worker 0-0, policy_version 1071780 (0.00080) [2022-07-11 06:20:56,203][26022] Updated weights on worker 0-0, policy_version 1071790 (0.00086) [2022-07-11 06:20:58,096][26022] Updated weights on worker 0-0, policy_version 1071800 (0.00086) [2022-07-11 06:20:58,709][25689] Fps is (10 sec: 5504.8, 60 sec: 5634.1, 300 sec: 5657.8). Total num frames: 1097525248. Throughput: 0: 5099.5. Samples: 1097523888. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:20:58,710][25689] Avg episode reward: [(0, '0.788')] [2022-07-11 06:20:59,730][26022] Updated weights on worker 0-0, policy_version 1071810 (0.00111) [2022-07-11 06:21:02,124][26022] Updated weights on worker 0-0, policy_version 1071820 (0.00103) [2022-07-11 06:21:03,655][26022] Updated weights on worker 0-0, policy_version 1071830 (0.00091) [2022-07-11 06:21:03,744][25689] Fps is (10 sec: 5500.1, 60 sec: 5682.5, 300 sec: 5662.8). Total num frames: 1097553920. Throughput: 0: 5827.4. Samples: 1097555758. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:21:03,745][25689] Avg episode reward: [(0, '1.007')] [2022-07-11 06:21:05,662][26022] Updated weights on worker 0-0, policy_version 1071840 (0.00095) [2022-07-11 06:21:07,514][26022] Updated weights on worker 0-0, policy_version 1071850 (0.00087) [2022-07-11 06:21:08,816][25689] Fps is (10 sec: 5572.7, 60 sec: 5671.8, 300 sec: 5651.8). Total num frames: 1097581568. Throughput: 0: 5809.1. Samples: 1097589696. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:21:08,817][25689] Avg episode reward: [(0, '1.456')] [2022-07-11 06:21:09,300][26022] Updated weights on worker 0-0, policy_version 1071860 (0.00084) [2022-07-11 06:21:11,102][26022] Updated weights on worker 0-0, policy_version 1071870 (0.00084) [2022-07-11 06:21:13,027][26022] Updated weights on worker 0-0, policy_version 1071880 (0.00082) [2022-07-11 06:21:13,819][25689] Fps is (10 sec: 5488.7, 60 sec: 5641.8, 300 sec: 5651.9). Total num frames: 1097609216. Throughput: 0: 4954.2. Samples: 1097606598. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:21:13,819][25689] Avg episode reward: [(0, '1.432')] [2022-07-11 06:21:14,770][26022] Updated weights on worker 0-0, policy_version 1071890 (0.00080) [2022-07-11 06:21:16,700][26022] Updated weights on worker 0-0, policy_version 1071900 (0.00084) [2022-07-11 06:21:18,416][26022] Updated weights on worker 0-0, policy_version 1071910 (0.00090) [2022-07-11 06:21:18,831][25689] Fps is (10 sec: 5726.0, 60 sec: 5676.1, 300 sec: 5652.0). Total num frames: 1097638912. Throughput: 0: 5804.9. Samples: 1097640690. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:21:18,831][25689] Avg episode reward: [(0, '2.004')] [2022-07-11 06:21:20,292][26022] Updated weights on worker 0-0, policy_version 1071920 (0.00093) [2022-07-11 06:21:21,947][26022] Updated weights on worker 0-0, policy_version 1071930 (0.00095) [2022-07-11 06:21:23,841][25689] Fps is (10 sec: 5619.2, 60 sec: 5628.9, 300 sec: 5645.7). Total num frames: 1097665536. Throughput: 0: 5914.9. Samples: 1097674632. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:21:23,842][25689] Avg episode reward: [(0, '2.026')] [2022-07-11 06:21:24,036][26022] Updated weights on worker 0-0, policy_version 1071940 (0.00088) [2022-07-11 06:21:25,441][26022] Updated weights on worker 0-0, policy_version 1071950 (0.00085) [2022-07-11 06:21:27,579][26022] Updated weights on worker 0-0, policy_version 1071960 (0.00091) [2022-07-11 06:21:28,883][25689] Fps is (10 sec: 5602.7, 60 sec: 5649.3, 300 sec: 5649.7). Total num frames: 1097695232. Throughput: 0: 5066.3. Samples: 1097691362. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:21:28,884][25689] Avg episode reward: [(0, '1.845')] [2022-07-11 06:21:29,186][26022] Updated weights on worker 0-0, policy_version 1071970 (0.00084) [2022-07-11 06:21:31,412][26022] Updated weights on worker 0-0, policy_version 1071980 (0.00087) [2022-07-11 06:21:32,865][26022] Updated weights on worker 0-0, policy_version 1071990 (0.00083) [2022-07-11 06:21:33,897][25689] Fps is (10 sec: 5601.2, 60 sec: 5632.4, 300 sec: 5642.7). Total num frames: 1097721856. Throughput: 0: 5899.6. Samples: 1097725048. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:21:33,897][25689] Avg episode reward: [(0, '1.996')] [2022-07-11 06:21:34,958][26022] Updated weights on worker 0-0, policy_version 1072000 (0.00090) [2022-07-11 06:21:36,648][26022] Updated weights on worker 0-0, policy_version 1072010 (0.00095) [2022-07-11 06:21:38,413][26022] Updated weights on worker 0-0, policy_version 1072020 (0.00085) [2022-07-11 06:21:38,925][25689] Fps is (10 sec: 5506.4, 60 sec: 5614.1, 300 sec: 5646.0). Total num frames: 1097750528. Throughput: 0: 5869.0. Samples: 1097758624. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:21:38,926][25689] Avg episode reward: [(0, '2.036')] [2022-07-11 06:21:40,131][26022] Updated weights on worker 0-0, policy_version 1072030 (0.00087) [2022-07-11 06:21:41,259][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:21:41,267][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001072035_1097763840.pth [2022-07-11 06:21:41,278][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001070047_1095728128.pth [2022-07-11 06:21:42,096][26022] Updated weights on worker 0-0, policy_version 1072040 (0.00089) [2022-07-11 06:21:43,911][26022] Updated weights on worker 0-0, policy_version 1072050 (0.00081) [2022-07-11 06:21:43,950][25689] Fps is (10 sec: 5704.1, 60 sec: 5615.4, 300 sec: 5643.5). Total num frames: 1097779200. Throughput: 0: 5025.2. Samples: 1097775680. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:21:43,950][25689] Avg episode reward: [(0, '2.059')] [2022-07-11 06:21:45,751][26022] Updated weights on worker 0-0, policy_version 1072060 (0.00083) [2022-07-11 06:21:47,446][26022] Updated weights on worker 0-0, policy_version 1072070 (0.00122) [2022-07-11 06:21:49,041][25689] Fps is (10 sec: 5668.6, 60 sec: 5596.8, 300 sec: 5642.0). Total num frames: 1097807872. Throughput: 0: 5873.0. Samples: 1097809752. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:21:49,042][25689] Avg episode reward: [(0, '1.997')] [2022-07-11 06:21:49,438][26022] Updated weights on worker 0-0, policy_version 1072080 (0.00091) [2022-07-11 06:21:51,196][26022] Updated weights on worker 0-0, policy_version 1072090 (0.00111) [2022-07-11 06:21:53,063][26022] Updated weights on worker 0-0, policy_version 1072100 (0.00080) [2022-07-11 06:21:54,057][25689] Fps is (10 sec: 5572.2, 60 sec: 5580.9, 300 sec: 5646.9). Total num frames: 1097835520. Throughput: 0: 5895.1. Samples: 1097843896. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:21:54,058][25689] Avg episode reward: [(0, '1.844')] [2022-07-11 06:21:54,790][26022] Updated weights on worker 0-0, policy_version 1072110 (0.00089) [2022-07-11 06:21:56,700][26022] Updated weights on worker 0-0, policy_version 1072120 (0.00096) [2022-07-11 06:21:58,402][26022] Updated weights on worker 0-0, policy_version 1072130 (0.00079) [2022-07-11 06:21:59,063][25689] Fps is (10 sec: 5722.2, 60 sec: 5633.0, 300 sec: 5647.5). Total num frames: 1097865216. Throughput: 0: 5075.0. Samples: 1097860820. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:21:59,063][25689] Avg episode reward: [(0, '1.570')] [2022-07-11 06:22:00,280][26022] Updated weights on worker 0-0, policy_version 1072140 (0.00083) [2022-07-11 06:22:01,952][26022] Updated weights on worker 0-0, policy_version 1072150 (0.00082) [2022-07-11 06:22:04,078][25689] Fps is (10 sec: 5517.8, 60 sec: 5583.8, 300 sec: 5644.7). Total num frames: 1097890816. Throughput: 0: 5827.1. Samples: 1097892972. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:22:04,080][25689] Avg episode reward: [(0, '1.519')] [2022-07-11 06:22:04,298][26022] Updated weights on worker 0-0, policy_version 1072160 (0.00091) [2022-07-11 06:22:05,925][26022] Updated weights on worker 0-0, policy_version 1072170 (0.00087) [2022-07-11 06:22:07,883][26022] Updated weights on worker 0-0, policy_version 1072180 (0.00085) [2022-07-11 06:22:09,119][25689] Fps is (10 sec: 5396.9, 60 sec: 5603.7, 300 sec: 5644.3). Total num frames: 1097919488. Throughput: 0: 5824.5. Samples: 1097926692. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:22:09,119][25689] Avg episode reward: [(0, '1.203')] [2022-07-11 06:22:09,580][26022] Updated weights on worker 0-0, policy_version 1072190 (0.00086) [2022-07-11 06:22:11,361][26022] Updated weights on worker 0-0, policy_version 1072200 (0.00090) [2022-07-11 06:22:13,333][26022] Updated weights on worker 0-0, policy_version 1072210 (0.00086) [2022-07-11 06:22:14,151][25689] Fps is (10 sec: 5693.2, 60 sec: 5618.0, 300 sec: 5647.9). Total num frames: 1097948160. Throughput: 0: 4963.0. Samples: 1097943622. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:22:14,151][25689] Avg episode reward: [(0, '1.364')] [2022-07-11 06:22:15,110][26022] Updated weights on worker 0-0, policy_version 1072220 (0.00091) [2022-07-11 06:22:16,535][26022] Updated weights on worker 0-0, policy_version 1072230 (0.00052) [2022-07-11 06:22:18,739][26022] Updated weights on worker 0-0, policy_version 1072240 (0.00086) [2022-07-11 06:22:19,178][25689] Fps is (10 sec: 5700.3, 60 sec: 5599.5, 300 sec: 5644.4). Total num frames: 1097976832. Throughput: 0: 5819.9. Samples: 1097977892. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:22:19,179][25689] Avg episode reward: [(0, '1.393')] [2022-07-11 06:22:20,255][26022] Updated weights on worker 0-0, policy_version 1072250 (0.00086) [2022-07-11 06:22:22,225][26022] Updated weights on worker 0-0, policy_version 1072260 (0.00090) [2022-07-11 06:22:23,948][26022] Updated weights on worker 0-0, policy_version 1072270 (0.00094) [2022-07-11 06:22:24,208][25689] Fps is (10 sec: 5701.4, 60 sec: 5631.7, 300 sec: 5644.6). Total num frames: 1098005504. Throughput: 0: 5927.5. Samples: 1098012294. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:22:24,209][25689] Avg episode reward: [(0, '1.047')] [2022-07-11 06:22:25,898][26022] Updated weights on worker 0-0, policy_version 1072280 (0.00084) [2022-07-11 06:22:27,518][26022] Updated weights on worker 0-0, policy_version 1072290 (0.00083) [2022-07-11 06:22:29,265][25689] Fps is (10 sec: 5583.5, 60 sec: 5596.3, 300 sec: 5640.6). Total num frames: 1098033152. Throughput: 0: 5091.1. Samples: 1098029262. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:22:29,266][25689] Avg episode reward: [(0, '1.696')] [2022-07-11 06:22:29,581][26022] Updated weights on worker 0-0, policy_version 1072300 (0.00090) [2022-07-11 06:22:31,023][26022] Updated weights on worker 0-0, policy_version 1072310 (0.00085) [2022-07-11 06:22:33,192][26022] Updated weights on worker 0-0, policy_version 1072320 (0.00091) [2022-07-11 06:22:34,354][25689] Fps is (10 sec: 5753.2, 60 sec: 5657.2, 300 sec: 5649.5). Total num frames: 1098063872. Throughput: 0: 5909.3. Samples: 1098063008. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:22:34,354][25689] Avg episode reward: [(0, '0.434')] [2022-07-11 06:22:34,656][26022] Updated weights on worker 0-0, policy_version 1072330 (0.00085) [2022-07-11 06:22:36,651][26022] Updated weights on worker 0-0, policy_version 1072340 (0.00084) [2022-07-11 06:22:38,492][26022] Updated weights on worker 0-0, policy_version 1072350 (0.00091) [2022-07-11 06:22:39,418][25689] Fps is (10 sec: 5648.1, 60 sec: 5620.0, 300 sec: 5638.0). Total num frames: 1098090496. Throughput: 0: 5887.2. Samples: 1098097048. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:22:39,419][25689] Avg episode reward: [(0, '0.793')] [2022-07-11 06:22:40,246][26022] Updated weights on worker 0-0, policy_version 1072360 (0.00093) [2022-07-11 06:22:42,085][26022] Updated weights on worker 0-0, policy_version 1072370 (0.00078) [2022-07-11 06:22:43,839][26022] Updated weights on worker 0-0, policy_version 1072380 (0.00080) [2022-07-11 06:22:44,439][25689] Fps is (10 sec: 5482.6, 60 sec: 5620.3, 300 sec: 5633.1). Total num frames: 1098119168. Throughput: 0: 5027.2. Samples: 1098114000. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:22:44,440][25689] Avg episode reward: [(0, '0.525')] [2022-07-11 06:22:45,671][26022] Updated weights on worker 0-0, policy_version 1072390 (0.00072) [2022-07-11 06:22:47,684][26022] Updated weights on worker 0-0, policy_version 1072400 (0.00081) [2022-07-11 06:22:49,176][26022] Updated weights on worker 0-0, policy_version 1072410 (0.00091) [2022-07-11 06:22:49,526][25689] Fps is (10 sec: 5875.8, 60 sec: 5654.6, 300 sec: 5646.0). Total num frames: 1098149888. Throughput: 0: 5874.5. Samples: 1098148282. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:22:49,527][25689] Avg episode reward: [(0, '0.284')] [2022-07-11 06:22:51,341][26022] Updated weights on worker 0-0, policy_version 1072420 (0.00086) [2022-07-11 06:22:52,799][26022] Updated weights on worker 0-0, policy_version 1072430 (0.00084) [2022-07-11 06:22:54,530][25689] Fps is (10 sec: 5784.3, 60 sec: 5655.7, 300 sec: 5642.9). Total num frames: 1098177536. Throughput: 0: 5933.5. Samples: 1098182726. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:22:54,531][25689] Avg episode reward: [(0, '-0.082')] [2022-07-11 06:22:54,837][26022] Updated weights on worker 0-0, policy_version 1072440 (0.00084) [2022-07-11 06:22:56,431][26022] Updated weights on worker 0-0, policy_version 1072450 (0.00083) [2022-07-11 06:22:58,358][26022] Updated weights on worker 0-0, policy_version 1072460 (0.00086) [2022-07-11 06:22:59,537][25689] Fps is (10 sec: 5523.3, 60 sec: 5621.6, 300 sec: 5639.5). Total num frames: 1098205184. Throughput: 0: 5118.4. Samples: 1098200026. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:22:59,538][25689] Avg episode reward: [(0, '-0.275')] [2022-07-11 06:22:59,940][26022] Updated weights on worker 0-0, policy_version 1072470 (0.00080) [2022-07-11 06:23:02,172][26022] Updated weights on worker 0-0, policy_version 1072480 (0.00089) [2022-07-11 06:23:03,843][26022] Updated weights on worker 0-0, policy_version 1072490 (0.00085) [2022-07-11 06:23:04,566][25689] Fps is (10 sec: 5510.0, 60 sec: 5654.3, 300 sec: 5641.0). Total num frames: 1098232832. Throughput: 0: 5861.3. Samples: 1098231968. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:23:04,566][25689] Avg episode reward: [(0, '-0.719')] [2022-07-11 06:23:05,987][26022] Updated weights on worker 0-0, policy_version 1072500 (0.00088) [2022-07-11 06:23:07,451][26022] Updated weights on worker 0-0, policy_version 1072510 (0.00082) [2022-07-11 06:23:09,496][26022] Updated weights on worker 0-0, policy_version 1072520 (0.00085) [2022-07-11 06:23:09,648][25689] Fps is (10 sec: 5570.6, 60 sec: 5650.5, 300 sec: 5643.5). Total num frames: 1098261504. Throughput: 0: 5850.1. Samples: 1098265996. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:23:09,648][25689] Avg episode reward: [(0, '-1.316')] [2022-07-11 06:23:11,463][26022] Updated weights on worker 0-0, policy_version 1072530 (0.00080) [2022-07-11 06:23:12,984][26022] Updated weights on worker 0-0, policy_version 1072540 (0.00082) [2022-07-11 06:23:14,702][25689] Fps is (10 sec: 5556.3, 60 sec: 5631.4, 300 sec: 5636.9). Total num frames: 1098289152. Throughput: 0: 4985.5. Samples: 1098283292. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:23:14,703][25689] Avg episode reward: [(0, '-1.186')] [2022-07-11 06:23:15,019][26022] Updated weights on worker 0-0, policy_version 1072550 (0.00083) [2022-07-11 06:23:16,519][26022] Updated weights on worker 0-0, policy_version 1072560 (0.00093) [2022-07-11 06:23:18,611][26022] Updated weights on worker 0-0, policy_version 1072570 (0.00091) [2022-07-11 06:23:19,715][25689] Fps is (10 sec: 5695.8, 60 sec: 5649.7, 300 sec: 5644.4). Total num frames: 1098318848. Throughput: 0: 5811.3. Samples: 1098317286. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:23:19,717][25689] Avg episode reward: [(0, '-1.635')] [2022-07-11 06:23:20,243][26022] Updated weights on worker 0-0, policy_version 1072580 (0.00082) [2022-07-11 06:23:22,097][26022] Updated weights on worker 0-0, policy_version 1072590 (0.00093) [2022-07-11 06:23:23,776][26022] Updated weights on worker 0-0, policy_version 1072600 (0.00085) [2022-07-11 06:23:24,724][25689] Fps is (10 sec: 5823.9, 60 sec: 5651.7, 300 sec: 5638.5). Total num frames: 1098347520. Throughput: 0: 5943.3. Samples: 1098351776. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:23:24,725][25689] Avg episode reward: [(0, '-1.042')] [2022-07-11 06:23:25,773][26022] Updated weights on worker 0-0, policy_version 1072610 (0.00056) [2022-07-11 06:23:27,411][26022] Updated weights on worker 0-0, policy_version 1072620 (0.00085) [2022-07-11 06:23:29,261][26022] Updated weights on worker 0-0, policy_version 1072630 (0.00089) [2022-07-11 06:23:29,814][25689] Fps is (10 sec: 5576.9, 60 sec: 5648.6, 300 sec: 5640.5). Total num frames: 1098375168. Throughput: 0: 5089.2. Samples: 1098368628. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:23:29,815][25689] Avg episode reward: [(0, '0.170')] [2022-07-11 06:23:30,913][26022] Updated weights on worker 0-0, policy_version 1072640 (0.00082) [2022-07-11 06:23:32,893][26022] Updated weights on worker 0-0, policy_version 1072650 (0.00086) [2022-07-11 06:23:34,749][26022] Updated weights on worker 0-0, policy_version 1072660 (0.00080) [2022-07-11 06:23:34,868][25689] Fps is (10 sec: 5653.3, 60 sec: 5634.9, 300 sec: 5641.0). Total num frames: 1098404864. Throughput: 0: 5930.8. Samples: 1098402890. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:23:34,868][25689] Avg episode reward: [(0, '-0.045')] [2022-07-11 06:23:36,558][26022] Updated weights on worker 0-0, policy_version 1072670 (0.00083) [2022-07-11 06:23:38,239][26022] Updated weights on worker 0-0, policy_version 1072680 (0.00081) [2022-07-11 06:23:39,894][25689] Fps is (10 sec: 5790.4, 60 sec: 5672.3, 300 sec: 5638.3). Total num frames: 1098433536. Throughput: 0: 5928.5. Samples: 1098436918. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:23:39,895][25689] Avg episode reward: [(0, '0.323')] [2022-07-11 06:23:39,916][26022] Updated weights on worker 0-0, policy_version 1072690 (0.00088) [2022-07-11 06:23:41,285][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:23:41,297][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001072697_1098441728.pth [2022-07-11 06:23:41,298][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001070711_1096408064.pth [2022-07-11 06:23:41,921][26022] Updated weights on worker 0-0, policy_version 1072700 (0.00083) [2022-07-11 06:23:43,622][26022] Updated weights on worker 0-0, policy_version 1072710 (0.00092) [2022-07-11 06:23:44,904][25689] Fps is (10 sec: 5611.5, 60 sec: 5656.4, 300 sec: 5639.5). Total num frames: 1098461184. Throughput: 0: 5934.9. Samples: 1098471542. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:23:44,905][25689] Avg episode reward: [(0, '0.780')] [2022-07-11 06:23:45,439][26022] Updated weights on worker 0-0, policy_version 1072720 (0.00085) [2022-07-11 06:23:47,311][26022] Updated weights on worker 0-0, policy_version 1072730 (0.00082) [2022-07-11 06:23:49,066][26022] Updated weights on worker 0-0, policy_version 1072740 (0.00085) [2022-07-11 06:23:50,025][25689] Fps is (10 sec: 5660.3, 60 sec: 5636.3, 300 sec: 5637.6). Total num frames: 1098490880. Throughput: 0: 5938.2. Samples: 1098488646. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:23:50,026][25689] Avg episode reward: [(0, '0.873')] [2022-07-11 06:23:50,787][26022] Updated weights on worker 0-0, policy_version 1072750 (0.00084) [2022-07-11 06:23:52,787][26022] Updated weights on worker 0-0, policy_version 1072760 (0.00086) [2022-07-11 06:23:54,589][26022] Updated weights on worker 0-0, policy_version 1072770 (0.00088) [2022-07-11 06:23:55,064][25689] Fps is (10 sec: 5644.1, 60 sec: 5633.1, 300 sec: 5638.0). Total num frames: 1098518528. Throughput: 0: 5908.5. Samples: 1098522222. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:23:55,065][25689] Avg episode reward: [(0, '1.125')] [2022-07-11 06:23:56,411][26022] Updated weights on worker 0-0, policy_version 1072780 (0.00080) [2022-07-11 06:23:58,185][26022] Updated weights on worker 0-0, policy_version 1072790 (0.00081) [2022-07-11 06:23:59,873][26022] Updated weights on worker 0-0, policy_version 1072800 (0.00086) [2022-07-11 06:24:00,079][25689] Fps is (10 sec: 5703.9, 60 sec: 5666.2, 300 sec: 5648.3). Total num frames: 1098548224. Throughput: 0: 5907.0. Samples: 1098556146. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:24:00,079][25689] Avg episode reward: [(0, '0.972')] [2022-07-11 06:24:02,455][26022] Updated weights on worker 0-0, policy_version 1072810 (0.00492) [2022-07-11 06:24:03,808][26022] Updated weights on worker 0-0, policy_version 1072820 (0.00084) [2022-07-11 06:24:05,085][25689] Fps is (10 sec: 5518.2, 60 sec: 5634.5, 300 sec: 5635.2). Total num frames: 1098573824. Throughput: 0: 4934.6. Samples: 1098571126. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:24:05,085][25689] Avg episode reward: [(0, '1.124')] [2022-07-11 06:24:05,823][26022] Updated weights on worker 0-0, policy_version 1072830 (0.00094) [2022-07-11 06:24:07,320][26022] Updated weights on worker 0-0, policy_version 1072840 (0.00091) [2022-07-11 06:24:09,299][26022] Updated weights on worker 0-0, policy_version 1072850 (0.00093) [2022-07-11 06:24:10,155][25689] Fps is (10 sec: 5386.3, 60 sec: 5635.6, 300 sec: 5641.8). Total num frames: 1098602496. Throughput: 0: 5803.1. Samples: 1098605460. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:24:10,155][25689] Avg episode reward: [(0, '1.142')] [2022-07-11 06:24:11,079][26022] Updated weights on worker 0-0, policy_version 1072860 (0.00094) [2022-07-11 06:24:12,948][26022] Updated weights on worker 0-0, policy_version 1072870 (0.00088) [2022-07-11 06:24:14,820][26022] Updated weights on worker 0-0, policy_version 1072880 (0.00091) [2022-07-11 06:24:15,162][25689] Fps is (10 sec: 5690.6, 60 sec: 5656.9, 300 sec: 5638.4). Total num frames: 1098631168. Throughput: 0: 5825.0. Samples: 1098639290. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:24:15,162][25689] Avg episode reward: [(0, '0.980')] [2022-07-11 06:24:16,536][26022] Updated weights on worker 0-0, policy_version 1072890 (0.00105) [2022-07-11 06:24:18,183][26022] Updated weights on worker 0-0, policy_version 1072900 (0.00083) [2022-07-11 06:24:20,159][26022] Updated weights on worker 0-0, policy_version 1072910 (0.00081) [2022-07-11 06:24:20,179][25689] Fps is (10 sec: 5720.7, 60 sec: 5639.6, 300 sec: 5634.7). Total num frames: 1098659840. Throughput: 0: 4996.2. Samples: 1098656570. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:24:20,179][25689] Avg episode reward: [(0, '1.226')] [2022-07-11 06:24:21,687][26022] Updated weights on worker 0-0, policy_version 1072920 (0.00092) [2022-07-11 06:24:23,687][26022] Updated weights on worker 0-0, policy_version 1072930 (0.00089) [2022-07-11 06:24:25,184][25689] Fps is (10 sec: 5824.1, 60 sec: 5656.9, 300 sec: 5645.7). Total num frames: 1098689536. Throughput: 0: 5980.4. Samples: 1098691324. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:24:25,186][25689] Avg episode reward: [(0, '0.844')] [2022-07-11 06:24:25,326][26022] Updated weights on worker 0-0, policy_version 1072940 (0.00081) [2022-07-11 06:24:27,316][26022] Updated weights on worker 0-0, policy_version 1072950 (0.00240) [2022-07-11 06:24:28,937][26022] Updated weights on worker 0-0, policy_version 1072960 (0.00082) [2022-07-11 06:24:30,273][25689] Fps is (10 sec: 5680.8, 60 sec: 5657.0, 300 sec: 5640.7). Total num frames: 1098717184. Throughput: 0: 5966.7. Samples: 1098725500. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 06:24:30,274][25689] Avg episode reward: [(0, '1.129')] [2022-07-11 06:24:30,932][26022] Updated weights on worker 0-0, policy_version 1072970 (0.00086) [2022-07-11 06:24:32,446][26022] Updated weights on worker 0-0, policy_version 1072980 (0.00080) [2022-07-11 06:24:34,541][26022] Updated weights on worker 0-0, policy_version 1072990 (0.00082) [2022-07-11 06:24:35,290][25689] Fps is (10 sec: 5572.7, 60 sec: 5643.5, 300 sec: 5637.0). Total num frames: 1098745856. Throughput: 0: 5133.1. Samples: 1098742610. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:24:35,291][25689] Avg episode reward: [(0, '0.517')] [2022-07-11 06:24:36,042][26022] Updated weights on worker 0-0, policy_version 1073000 (0.00088) [2022-07-11 06:24:38,031][26022] Updated weights on worker 0-0, policy_version 1073010 (0.00083) [2022-07-11 06:24:39,843][26022] Updated weights on worker 0-0, policy_version 1073020 (0.00087) [2022-07-11 06:24:40,312][25689] Fps is (10 sec: 5814.2, 60 sec: 5660.9, 300 sec: 5643.7). Total num frames: 1098775552. Throughput: 0: 5988.5. Samples: 1098777138. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:24:40,312][25689] Avg episode reward: [(0, '0.650')] [2022-07-11 06:24:41,717][26022] Updated weights on worker 0-0, policy_version 1073030 (0.00088) [2022-07-11 06:24:43,273][26022] Updated weights on worker 0-0, policy_version 1073040 (0.00359) [2022-07-11 06:24:45,160][26022] Updated weights on worker 0-0, policy_version 1073050 (0.00082) [2022-07-11 06:24:45,336][25689] Fps is (10 sec: 5708.3, 60 sec: 5659.6, 300 sec: 5640.8). Total num frames: 1098803200. Throughput: 0: 5974.5. Samples: 1098811722. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:24:45,336][25689] Avg episode reward: [(0, '0.715')] [2022-07-11 06:24:47,019][26022] Updated weights on worker 0-0, policy_version 1073060 (0.00084) [2022-07-11 06:24:48,551][26022] Updated weights on worker 0-0, policy_version 1073070 (0.00084) [2022-07-11 06:24:50,390][26022] Updated weights on worker 0-0, policy_version 1073080 (0.00081) [2022-07-11 06:24:50,485][25689] Fps is (10 sec: 5737.4, 60 sec: 5673.8, 300 sec: 5644.9). Total num frames: 1098833920. Throughput: 0: 5117.1. Samples: 1098828928. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:24:50,486][25689] Avg episode reward: [(0, '-0.063')] [2022-07-11 06:24:52,191][26022] Updated weights on worker 0-0, policy_version 1073090 (0.00085) [2022-07-11 06:24:54,177][26022] Updated weights on worker 0-0, policy_version 1073100 (0.00086) [2022-07-11 06:24:55,499][25689] Fps is (10 sec: 5944.8, 60 sec: 5710.1, 300 sec: 5648.5). Total num frames: 1098863616. Throughput: 0: 5982.2. Samples: 1098863502. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:24:55,499][25689] Avg episode reward: [(0, '-0.193')] [2022-07-11 06:24:55,747][26022] Updated weights on worker 0-0, policy_version 1073110 (0.00094) [2022-07-11 06:24:57,889][26022] Updated weights on worker 0-0, policy_version 1073120 (0.00095) [2022-07-11 06:24:59,363][26022] Updated weights on worker 0-0, policy_version 1073130 (0.00082) [2022-07-11 06:25:00,581][25689] Fps is (10 sec: 5476.9, 60 sec: 5636.0, 300 sec: 5647.1). Total num frames: 1098889216. Throughput: 0: 5940.2. Samples: 1098897544. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:25:00,582][25689] Avg episode reward: [(0, '-0.169')] [2022-07-11 06:25:01,240][26022] Updated weights on worker 0-0, policy_version 1073140 (0.00099) [2022-07-11 06:25:03,525][26022] Updated weights on worker 0-0, policy_version 1073150 (0.00090) [2022-07-11 06:25:05,179][26022] Updated weights on worker 0-0, policy_version 1073160 (0.00084) [2022-07-11 06:25:05,662][25689] Fps is (10 sec: 5440.7, 60 sec: 5696.7, 300 sec: 5651.6). Total num frames: 1098918912. Throughput: 0: 4963.6. Samples: 1098912612. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:25:05,662][25689] Avg episode reward: [(0, '-0.058')] [2022-07-11 06:25:07,075][26022] Updated weights on worker 0-0, policy_version 1073170 (0.00089) [2022-07-11 06:25:08,855][26022] Updated weights on worker 0-0, policy_version 1073180 (0.00096) [2022-07-11 06:25:10,647][26022] Updated weights on worker 0-0, policy_version 1073190 (0.00912) [2022-07-11 06:25:10,718][25689] Fps is (10 sec: 5758.3, 60 sec: 5698.0, 300 sec: 5648.0). Total num frames: 1098947584. Throughput: 0: 5835.9. Samples: 1098947004. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:25:10,718][25689] Avg episode reward: [(0, '-0.835')] [2022-07-11 06:25:12,587][26022] Updated weights on worker 0-0, policy_version 1073200 (0.00083) [2022-07-11 06:25:14,333][26022] Updated weights on worker 0-0, policy_version 1073210 (0.00080) [2022-07-11 06:25:15,752][25689] Fps is (10 sec: 5581.6, 60 sec: 5678.5, 300 sec: 5647.6). Total num frames: 1098975232. Throughput: 0: 5811.5. Samples: 1098981206. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:25:15,753][25689] Avg episode reward: [(0, '-0.480')] [2022-07-11 06:25:16,083][26022] Updated weights on worker 0-0, policy_version 1073220 (0.00079) [2022-07-11 06:25:17,772][26022] Updated weights on worker 0-0, policy_version 1073230 (0.00092) [2022-07-11 06:25:19,671][26022] Updated weights on worker 0-0, policy_version 1073240 (0.00087) [2022-07-11 06:25:20,775][25689] Fps is (10 sec: 5599.7, 60 sec: 5677.9, 300 sec: 5644.7). Total num frames: 1099003904. Throughput: 0: 4983.9. Samples: 1098998188. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:25:20,776][25689] Avg episode reward: [(0, '-0.507')] [2022-07-11 06:25:21,504][26022] Updated weights on worker 0-0, policy_version 1073250 (0.00086) [2022-07-11 06:25:23,146][26022] Updated weights on worker 0-0, policy_version 1073260 (0.00088) [2022-07-11 06:25:24,955][26022] Updated weights on worker 0-0, policy_version 1073270 (0.00093) [2022-07-11 06:25:25,790][25689] Fps is (10 sec: 5610.6, 60 sec: 5643.2, 300 sec: 5642.4). Total num frames: 1099031552. Throughput: 0: 5944.2. Samples: 1099032260. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:25:25,791][25689] Avg episode reward: [(0, '-1.033')] [2022-07-11 06:25:26,767][26022] Updated weights on worker 0-0, policy_version 1073280 (0.00088) [2022-07-11 06:25:28,715][26022] Updated weights on worker 0-0, policy_version 1073290 (0.00088) [2022-07-11 06:25:30,381][26022] Updated weights on worker 0-0, policy_version 1073300 (0.00085) [2022-07-11 06:25:30,834][25689] Fps is (10 sec: 5701.1, 60 sec: 5681.3, 300 sec: 5648.8). Total num frames: 1099061248. Throughput: 0: 5928.1. Samples: 1099066254. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:25:30,838][25689] Avg episode reward: [(0, '-0.942')] [2022-07-11 06:25:32,252][26022] Updated weights on worker 0-0, policy_version 1073310 (0.00084) [2022-07-11 06:25:34,096][26022] Updated weights on worker 0-0, policy_version 1073320 (0.00085) [2022-07-11 06:25:35,778][26022] Updated weights on worker 0-0, policy_version 1073330 (0.00085) [2022-07-11 06:25:35,843][25689] Fps is (10 sec: 5806.2, 60 sec: 5682.0, 300 sec: 5645.4). Total num frames: 1099089920. Throughput: 0: 5080.9. Samples: 1099083284. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:25:35,845][25689] Avg episode reward: [(0, '-0.485')] [2022-07-11 06:25:37,711][26022] Updated weights on worker 0-0, policy_version 1073340 (0.00084) [2022-07-11 06:25:39,325][26022] Updated weights on worker 0-0, policy_version 1073350 (0.00082) [2022-07-11 06:25:40,852][25689] Fps is (10 sec: 5519.6, 60 sec: 5632.5, 300 sec: 5639.1). Total num frames: 1099116544. Throughput: 0: 5953.8. Samples: 1099117720. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:25:40,855][25689] Avg episode reward: [(0, '0.034')] [2022-07-11 06:25:41,353][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:25:41,366][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001073360_1099120640.pth [2022-07-11 06:25:41,370][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001071375_1097088000.pth [2022-07-11 06:25:41,375][26022] Updated weights on worker 0-0, policy_version 1073360 (0.00092) [2022-07-11 06:25:42,868][26022] Updated weights on worker 0-0, policy_version 1073370 (0.00085) [2022-07-11 06:25:44,894][26022] Updated weights on worker 0-0, policy_version 1073380 (0.00630) [2022-07-11 06:25:45,911][25689] Fps is (10 sec: 5797.2, 60 sec: 5696.8, 300 sec: 5646.3). Total num frames: 1099148288. Throughput: 0: 5959.0. Samples: 1099152162. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:25:45,912][25689] Avg episode reward: [(0, '0.165')] [2022-07-11 06:25:46,309][26022] Updated weights on worker 0-0, policy_version 1073390 (0.00083) [2022-07-11 06:25:48,473][26022] Updated weights on worker 0-0, policy_version 1073400 (0.00084) [2022-07-11 06:25:50,259][26022] Updated weights on worker 0-0, policy_version 1073410 (0.00084) [2022-07-11 06:25:50,968][25689] Fps is (10 sec: 5769.8, 60 sec: 5637.8, 300 sec: 5638.8). Total num frames: 1099174912. Throughput: 0: 5106.1. Samples: 1099169062. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:25:50,969][25689] Avg episode reward: [(0, '0.846')] [2022-07-11 06:25:51,942][26022] Updated weights on worker 0-0, policy_version 1073420 (0.00704) [2022-07-11 06:25:53,880][26022] Updated weights on worker 0-0, policy_version 1073430 (0.00083) [2022-07-11 06:25:55,773][26022] Updated weights on worker 0-0, policy_version 1073440 (0.00085) [2022-07-11 06:25:56,035][25689] Fps is (10 sec: 5462.2, 60 sec: 5616.0, 300 sec: 5644.8). Total num frames: 1099203584. Throughput: 0: 5937.4. Samples: 1099203170. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:25:56,035][25689] Avg episode reward: [(0, '0.746')] [2022-07-11 06:25:57,355][26022] Updated weights on worker 0-0, policy_version 1073450 (0.00093) [2022-07-11 06:25:59,359][26022] Updated weights on worker 0-0, policy_version 1073460 (0.00085) [2022-07-11 06:26:00,722][26022] Updated weights on worker 0-0, policy_version 1073470 (0.00090) [2022-07-11 06:26:01,043][25689] Fps is (10 sec: 5793.6, 60 sec: 5690.7, 300 sec: 5648.7). Total num frames: 1099233280. Throughput: 0: 5922.6. Samples: 1099237302. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:26:01,043][25689] Avg episode reward: [(0, '0.770')] [2022-07-11 06:26:03,400][26022] Updated weights on worker 0-0, policy_version 1073480 (0.00089) [2022-07-11 06:26:05,019][26022] Updated weights on worker 0-0, policy_version 1073490 (0.00088) [2022-07-11 06:26:06,054][25689] Fps is (10 sec: 5416.9, 60 sec: 5612.4, 300 sec: 5639.6). Total num frames: 1099257856. Throughput: 0: 4968.6. Samples: 1099252242. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:26:06,054][25689] Avg episode reward: [(0, '0.765')] [2022-07-11 06:26:06,850][26022] Updated weights on worker 0-0, policy_version 1073500 (0.00088) [2022-07-11 06:26:08,686][26022] Updated weights on worker 0-0, policy_version 1073510 (0.00086) [2022-07-11 06:26:10,578][26022] Updated weights on worker 0-0, policy_version 1073520 (0.00081) [2022-07-11 06:26:11,137][25689] Fps is (10 sec: 5478.1, 60 sec: 5643.8, 300 sec: 5648.4). Total num frames: 1099288576. Throughput: 0: 5804.9. Samples: 1099286138. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:26:11,137][25689] Avg episode reward: [(0, '-0.251')] [2022-07-11 06:26:12,271][26022] Updated weights on worker 0-0, policy_version 1073530 (0.00093) [2022-07-11 06:26:14,212][26022] Updated weights on worker 0-0, policy_version 1073540 (0.00083) [2022-07-11 06:26:15,787][26022] Updated weights on worker 0-0, policy_version 1073550 (0.00087) [2022-07-11 06:26:16,166][25689] Fps is (10 sec: 5873.1, 60 sec: 5661.2, 300 sec: 5644.6). Total num frames: 1099317248. Throughput: 0: 5824.3. Samples: 1099320422. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:26:16,167][25689] Avg episode reward: [(0, '0.718')] [2022-07-11 06:26:17,788][26022] Updated weights on worker 0-0, policy_version 1073560 (0.00116) [2022-07-11 06:26:19,340][26022] Updated weights on worker 0-0, policy_version 1073570 (0.00090) [2022-07-11 06:26:21,229][25689] Fps is (10 sec: 5580.6, 60 sec: 5640.6, 300 sec: 5647.0). Total num frames: 1099344896. Throughput: 0: 5815.6. Samples: 1099354696. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:26:21,229][25689] Avg episode reward: [(0, '0.567')] [2022-07-11 06:26:21,359][26022] Updated weights on worker 0-0, policy_version 1073580 (0.00086) [2022-07-11 06:26:23,057][26022] Updated weights on worker 0-0, policy_version 1073590 (0.00105) [2022-07-11 06:26:24,928][26022] Updated weights on worker 0-0, policy_version 1073600 (0.00090) [2022-07-11 06:26:26,259][25689] Fps is (10 sec: 5681.9, 60 sec: 5673.0, 300 sec: 5647.3). Total num frames: 1099374592. Throughput: 0: 5931.9. Samples: 1099372096. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:26:26,259][25689] Avg episode reward: [(0, '-0.185')] [2022-07-11 06:26:26,606][26022] Updated weights on worker 0-0, policy_version 1073610 (0.00086) [2022-07-11 06:26:28,564][26022] Updated weights on worker 0-0, policy_version 1073620 (0.00085) [2022-07-11 06:26:30,107][26022] Updated weights on worker 0-0, policy_version 1073630 (0.00079) [2022-07-11 06:26:31,354][25689] Fps is (10 sec: 5764.7, 60 sec: 5651.3, 300 sec: 5652.6). Total num frames: 1099403264. Throughput: 0: 5933.6. Samples: 1099406100. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:26:31,354][25689] Avg episode reward: [(0, '-1.104')] [2022-07-11 06:26:32,061][26022] Updated weights on worker 0-0, policy_version 1073640 (0.00091) [2022-07-11 06:26:33,770][26022] Updated weights on worker 0-0, policy_version 1073650 (0.00084) [2022-07-11 06:26:35,602][26022] Updated weights on worker 0-0, policy_version 1073660 (0.00092) [2022-07-11 06:26:36,390][25689] Fps is (10 sec: 5558.9, 60 sec: 5631.8, 300 sec: 5649.0). Total num frames: 1099430912. Throughput: 0: 5938.3. Samples: 1099440518. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:26:36,392][25689] Avg episode reward: [(0, '0.059')] [2022-07-11 06:26:37,321][26022] Updated weights on worker 0-0, policy_version 1073670 (0.00083) [2022-07-11 06:26:39,348][26022] Updated weights on worker 0-0, policy_version 1073680 (0.00097) [2022-07-11 06:26:40,851][26022] Updated weights on worker 0-0, policy_version 1073690 (0.00090) [2022-07-11 06:26:41,419][25689] Fps is (10 sec: 5697.4, 60 sec: 5680.7, 300 sec: 5652.4). Total num frames: 1099460608. Throughput: 0: 5092.9. Samples: 1099457522. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:26:41,419][25689] Avg episode reward: [(0, '0.287')] [2022-07-11 06:26:43,002][26022] Updated weights on worker 0-0, policy_version 1073700 (0.01031) [2022-07-11 06:26:44,507][26022] Updated weights on worker 0-0, policy_version 1073710 (0.00079) [2022-07-11 06:26:46,427][25689] Fps is (10 sec: 5611.2, 60 sec: 5600.9, 300 sec: 5647.1). Total num frames: 1099487232. Throughput: 0: 5931.8. Samples: 1099491732. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:26:46,429][25689] Avg episode reward: [(0, '0.179')] [2022-07-11 06:26:46,580][26022] Updated weights on worker 0-0, policy_version 1073720 (0.00088) [2022-07-11 06:26:48,285][26022] Updated weights on worker 0-0, policy_version 1073730 (0.00083) [2022-07-11 06:26:50,012][26022] Updated weights on worker 0-0, policy_version 1073740 (0.00083) [2022-07-11 06:26:51,456][25689] Fps is (10 sec: 5611.0, 60 sec: 5654.3, 300 sec: 5653.7). Total num frames: 1099516928. Throughput: 0: 5964.7. Samples: 1099526004. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:26:51,458][25689] Avg episode reward: [(0, '-0.506')] [2022-07-11 06:26:51,725][26022] Updated weights on worker 0-0, policy_version 1073750 (0.00087) [2022-07-11 06:26:53,786][26022] Updated weights on worker 0-0, policy_version 1073760 (0.00088) [2022-07-11 06:26:55,279][26022] Updated weights on worker 0-0, policy_version 1073770 (0.00081) [2022-07-11 06:26:56,465][25689] Fps is (10 sec: 5814.8, 60 sec: 5659.7, 300 sec: 5650.2). Total num frames: 1099545600. Throughput: 0: 5101.5. Samples: 1099542930. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:26:56,467][25689] Avg episode reward: [(0, '1.113')] [2022-07-11 06:26:57,478][26022] Updated weights on worker 0-0, policy_version 1073780 (0.00082) [2022-07-11 06:26:58,735][26022] Updated weights on worker 0-0, policy_version 1073790 (0.00087) [2022-07-11 06:27:01,098][26022] Updated weights on worker 0-0, policy_version 1073800 (0.00090) [2022-07-11 06:27:01,521][25689] Fps is (10 sec: 5900.8, 60 sec: 5672.1, 300 sec: 5666.6). Total num frames: 1099576320. Throughput: 0: 5955.0. Samples: 1099577234. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:27:01,523][25689] Avg episode reward: [(0, '0.596')] [2022-07-11 06:27:02,847][26022] Updated weights on worker 0-0, policy_version 1073810 (0.00082) [2022-07-11 06:27:04,789][26022] Updated weights on worker 0-0, policy_version 1073820 (0.00082) [2022-07-11 06:27:06,534][25689] Fps is (10 sec: 5491.5, 60 sec: 5671.9, 300 sec: 5653.4). Total num frames: 1099600896. Throughput: 0: 5872.9. Samples: 1099609820. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:27:06,536][25689] Avg episode reward: [(0, '0.018')] [2022-07-11 06:27:06,571][26022] Updated weights on worker 0-0, policy_version 1073830 (0.00056) [2022-07-11 06:27:08,410][26022] Updated weights on worker 0-0, policy_version 1073840 (0.00084) [2022-07-11 06:27:09,982][26022] Updated weights on worker 0-0, policy_version 1073850 (0.00091) [2022-07-11 06:27:11,638][25689] Fps is (10 sec: 5263.3, 60 sec: 5636.1, 300 sec: 5652.0). Total num frames: 1099629568. Throughput: 0: 4996.4. Samples: 1099626840. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:27:11,639][25689] Avg episode reward: [(0, '-0.144')] [2022-07-11 06:27:12,020][26022] Updated weights on worker 0-0, policy_version 1073860 (0.00079) [2022-07-11 06:27:13,524][26022] Updated weights on worker 0-0, policy_version 1073870 (0.00086) [2022-07-11 06:27:15,476][26022] Updated weights on worker 0-0, policy_version 1073880 (0.00087) [2022-07-11 06:27:16,640][25689] Fps is (10 sec: 5775.9, 60 sec: 5655.6, 300 sec: 5656.0). Total num frames: 1099659264. Throughput: 0: 5851.6. Samples: 1099660984. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:27:16,640][25689] Avg episode reward: [(0, '-1.029')] [2022-07-11 06:27:17,451][26022] Updated weights on worker 0-0, policy_version 1073890 (0.00084) [2022-07-11 06:27:19,105][26022] Updated weights on worker 0-0, policy_version 1073900 (0.00083) [2022-07-11 06:27:20,869][26022] Updated weights on worker 0-0, policy_version 1073910 (0.00081) [2022-07-11 06:27:21,648][25689] Fps is (10 sec: 5831.0, 60 sec: 5677.7, 300 sec: 5656.4). Total num frames: 1099687936. Throughput: 0: 5867.8. Samples: 1099695334. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:27:21,649][25689] Avg episode reward: [(0, '-1.128')] [2022-07-11 06:27:22,667][26022] Updated weights on worker 0-0, policy_version 1073920 (0.00632) [2022-07-11 06:27:24,226][26022] Updated weights on worker 0-0, policy_version 1073930 (0.00085) [2022-07-11 06:27:26,352][26022] Updated weights on worker 0-0, policy_version 1073940 (0.00090) [2022-07-11 06:27:26,662][25689] Fps is (10 sec: 5619.7, 60 sec: 5645.3, 300 sec: 5657.2). Total num frames: 1099715584. Throughput: 0: 5112.5. Samples: 1099712720. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:27:26,662][25689] Avg episode reward: [(0, '-0.823')] [2022-07-11 06:27:28,032][26022] Updated weights on worker 0-0, policy_version 1073950 (0.00078) [2022-07-11 06:27:29,936][26022] Updated weights on worker 0-0, policy_version 1073960 (0.00083) [2022-07-11 06:27:31,665][26022] Updated weights on worker 0-0, policy_version 1073970 (0.00092) [2022-07-11 06:27:31,789][25689] Fps is (10 sec: 5654.5, 60 sec: 5659.2, 300 sec: 5653.0). Total num frames: 1099745280. Throughput: 0: 5945.2. Samples: 1099746642. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:27:31,790][25689] Avg episode reward: [(0, '-0.312')] [2022-07-11 06:27:33,542][26022] Updated weights on worker 0-0, policy_version 1073980 (0.00088) [2022-07-11 06:27:35,344][26022] Updated weights on worker 0-0, policy_version 1073990 (0.00087) [2022-07-11 06:27:36,853][25689] Fps is (10 sec: 5727.3, 60 sec: 5673.6, 300 sec: 5659.9). Total num frames: 1099773952. Throughput: 0: 5926.4. Samples: 1099780774. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:27:36,853][25689] Avg episode reward: [(0, '-0.918')] [2022-07-11 06:27:37,126][26022] Updated weights on worker 0-0, policy_version 1074000 (0.00082) [2022-07-11 06:27:38,896][26022] Updated weights on worker 0-0, policy_version 1074010 (0.00105) [2022-07-11 06:27:40,766][26022] Updated weights on worker 0-0, policy_version 1074020 (0.00080) [2022-07-11 06:27:41,386][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:27:41,397][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001074023_1099799552.pth [2022-07-11 06:27:41,398][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001072035_1097763840.pth [2022-07-11 06:27:41,886][25689] Fps is (10 sec: 5780.9, 60 sec: 5673.1, 300 sec: 5663.1). Total num frames: 1099803648. Throughput: 0: 5073.3. Samples: 1099798008. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:27:41,887][25689] Avg episode reward: [(0, '0.128')] [2022-07-11 06:27:42,406][26022] Updated weights on worker 0-0, policy_version 1074030 (0.00080) [2022-07-11 06:27:44,301][26022] Updated weights on worker 0-0, policy_version 1074040 (0.00078) [2022-07-11 06:27:45,971][26022] Updated weights on worker 0-0, policy_version 1074050 (0.00093) [2022-07-11 06:27:46,902][25689] Fps is (10 sec: 5706.7, 60 sec: 5689.4, 300 sec: 5654.2). Total num frames: 1099831296. Throughput: 0: 5903.0. Samples: 1099832196. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:27:46,902][25689] Avg episode reward: [(0, '0.837')] [2022-07-11 06:27:47,860][26022] Updated weights on worker 0-0, policy_version 1074060 (0.00093) [2022-07-11 06:27:49,534][26022] Updated weights on worker 0-0, policy_version 1074070 (0.00085) [2022-07-11 06:27:51,637][26022] Updated weights on worker 0-0, policy_version 1074080 (0.00094) [2022-07-11 06:27:52,027][25689] Fps is (10 sec: 5654.9, 60 sec: 5680.4, 300 sec: 5658.7). Total num frames: 1099860992. Throughput: 0: 5935.6. Samples: 1099866764. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:27:52,027][25689] Avg episode reward: [(0, '0.826')] [2022-07-11 06:27:53,041][26022] Updated weights on worker 0-0, policy_version 1074090 (0.00093) [2022-07-11 06:27:55,103][26022] Updated weights on worker 0-0, policy_version 1074100 (0.00086) [2022-07-11 06:27:56,659][26022] Updated weights on worker 0-0, policy_version 1074110 (0.00094) [2022-07-11 06:27:57,041][25689] Fps is (10 sec: 5756.6, 60 sec: 5679.9, 300 sec: 5662.1). Total num frames: 1099889664. Throughput: 0: 5102.5. Samples: 1099883782. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:27:57,041][25689] Avg episode reward: [(0, '0.821')] [2022-07-11 06:27:58,703][26022] Updated weights on worker 0-0, policy_version 1074120 (0.00081) [2022-07-11 06:28:00,390][26022] Updated weights on worker 0-0, policy_version 1074130 (0.00744) [2022-07-11 06:28:02,075][25689] Fps is (10 sec: 5401.3, 60 sec: 5597.5, 300 sec: 5655.1). Total num frames: 1099915264. Throughput: 0: 5932.4. Samples: 1099917774. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:28:02,075][25689] Avg episode reward: [(0, '0.125')] [2022-07-11 06:28:02,660][26022] Updated weights on worker 0-0, policy_version 1074140 (0.00128) [2022-07-11 06:28:04,338][26022] Updated weights on worker 0-0, policy_version 1074150 (0.00081) [2022-07-11 06:28:06,363][26022] Updated weights on worker 0-0, policy_version 1074160 (0.00093) [2022-07-11 06:28:07,094][25689] Fps is (10 sec: 5500.2, 60 sec: 5681.4, 300 sec: 5659.7). Total num frames: 1099944960. Throughput: 0: 5825.1. Samples: 1099949822. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:28:07,095][25689] Avg episode reward: [(0, '0.114')] [2022-07-11 06:28:08,001][26022] Updated weights on worker 0-0, policy_version 1074170 (0.00093) [2022-07-11 06:28:09,995][26022] Updated weights on worker 0-0, policy_version 1074180 (0.00082) [2022-07-11 06:28:11,807][26022] Updated weights on worker 0-0, policy_version 1074190 (0.00081) [2022-07-11 06:28:12,162][25689] Fps is (10 sec: 5684.5, 60 sec: 5667.8, 300 sec: 5659.4). Total num frames: 1099972608. Throughput: 0: 4971.1. Samples: 1099966860. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:28:12,163][25689] Avg episode reward: [(0, '-0.405')] [2022-07-11 06:28:13,574][26022] Updated weights on worker 0-0, policy_version 1074200 (0.00083) [2022-07-11 06:28:15,418][26022] Updated weights on worker 0-0, policy_version 1074210 (0.00081) [2022-07-11 06:28:17,155][26022] Updated weights on worker 0-0, policy_version 1074220 (0.00088) [2022-07-11 06:28:17,248][25689] Fps is (10 sec: 5546.6, 60 sec: 5643.0, 300 sec: 5654.6). Total num frames: 1100001280. Throughput: 0: 5798.7. Samples: 1100000958. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:28:17,249][25689] Avg episode reward: [(0, '-0.844')] [2022-07-11 06:28:18,858][26022] Updated weights on worker 0-0, policy_version 1074230 (0.00085) [2022-07-11 06:28:20,813][26022] Updated weights on worker 0-0, policy_version 1074240 (0.00083) [2022-07-11 06:28:22,286][25689] Fps is (10 sec: 5765.5, 60 sec: 5657.2, 300 sec: 5657.5). Total num frames: 1100030976. Throughput: 0: 5812.4. Samples: 1100035250. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:28:22,288][25689] Avg episode reward: [(0, '-1.067')] [2022-07-11 06:28:22,352][26022] Updated weights on worker 0-0, policy_version 1074250 (0.00091) [2022-07-11 06:28:24,364][26022] Updated weights on worker 0-0, policy_version 1074260 (0.00081) [2022-07-11 06:28:25,965][26022] Updated weights on worker 0-0, policy_version 1074270 (0.00086) [2022-07-11 06:28:27,302][25689] Fps is (10 sec: 5703.7, 60 sec: 5657.0, 300 sec: 5658.9). Total num frames: 1100058624. Throughput: 0: 5082.4. Samples: 1100052526. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 06:28:27,304][25689] Avg episode reward: [(0, '-0.858')] [2022-07-11 06:28:27,998][26022] Updated weights on worker 0-0, policy_version 1074280 (0.00085) [2022-07-11 06:28:29,575][26022] Updated weights on worker 0-0, policy_version 1074290 (0.00082) [2022-07-11 06:28:31,727][26022] Updated weights on worker 0-0, policy_version 1074300 (0.00081) [2022-07-11 06:28:32,375][25689] Fps is (10 sec: 5683.9, 60 sec: 5662.1, 300 sec: 5658.5). Total num frames: 1100088320. Throughput: 0: 5912.9. Samples: 1100086376. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:28:32,376][25689] Avg episode reward: [(0, '-1.318')] [2022-07-11 06:28:33,248][26022] Updated weights on worker 0-0, policy_version 1074310 (0.00609) [2022-07-11 06:28:35,199][26022] Updated weights on worker 0-0, policy_version 1074320 (0.00086) [2022-07-11 06:28:36,939][26022] Updated weights on worker 0-0, policy_version 1074330 (0.00084) [2022-07-11 06:28:37,391][25689] Fps is (10 sec: 5683.8, 60 sec: 5649.6, 300 sec: 5655.3). Total num frames: 1100115968. Throughput: 0: 5941.8. Samples: 1100120644. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:28:37,392][25689] Avg episode reward: [(0, '-1.472')] [2022-07-11 06:28:38,818][26022] Updated weights on worker 0-0, policy_version 1074340 (0.00084) [2022-07-11 06:28:40,618][26022] Updated weights on worker 0-0, policy_version 1074350 (0.00085) [2022-07-11 06:28:42,281][26022] Updated weights on worker 0-0, policy_version 1074360 (0.00082) [2022-07-11 06:28:42,410][25689] Fps is (10 sec: 5612.3, 60 sec: 5634.0, 300 sec: 5658.6). Total num frames: 1100144640. Throughput: 0: 5086.5. Samples: 1100137610. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:28:42,411][25689] Avg episode reward: [(0, '0.762')] [2022-07-11 06:28:44,459][26022] Updated weights on worker 0-0, policy_version 1074370 (0.00085) [2022-07-11 06:28:45,907][26022] Updated weights on worker 0-0, policy_version 1074380 (0.00086) [2022-07-11 06:28:47,438][25689] Fps is (10 sec: 5707.7, 60 sec: 5649.8, 300 sec: 5656.9). Total num frames: 1100173312. Throughput: 0: 5906.5. Samples: 1100171458. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:28:47,439][25689] Avg episode reward: [(0, '0.909')] [2022-07-11 06:28:48,077][26022] Updated weights on worker 0-0, policy_version 1074390 (0.00092) [2022-07-11 06:28:49,438][26022] Updated weights on worker 0-0, policy_version 1074400 (0.01018) [2022-07-11 06:28:51,761][26022] Updated weights on worker 0-0, policy_version 1074410 (0.00094) [2022-07-11 06:28:52,512][25689] Fps is (10 sec: 5575.4, 60 sec: 5620.7, 300 sec: 5656.2). Total num frames: 1100200960. Throughput: 0: 5901.3. Samples: 1100205208. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:28:52,512][25689] Avg episode reward: [(0, '1.724')] [2022-07-11 06:28:53,129][26022] Updated weights on worker 0-0, policy_version 1074420 (0.00081) [2022-07-11 06:28:55,166][26022] Updated weights on worker 0-0, policy_version 1074430 (0.00094) [2022-07-11 06:28:56,750][26022] Updated weights on worker 0-0, policy_version 1074440 (0.00085) [2022-07-11 06:28:57,530][25689] Fps is (10 sec: 5682.1, 60 sec: 5637.3, 300 sec: 5656.2). Total num frames: 1100230656. Throughput: 0: 5903.9. Samples: 1100239542. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:28:57,530][25689] Avg episode reward: [(0, '-0.496')] [2022-07-11 06:28:58,656][26022] Updated weights on worker 0-0, policy_version 1074450 (0.00095) [2022-07-11 06:29:00,421][26022] Updated weights on worker 0-0, policy_version 1074460 (0.00084) [2022-07-11 06:29:02,555][25689] Fps is (10 sec: 5403.7, 60 sec: 5621.1, 300 sec: 5652.4). Total num frames: 1100255232. Throughput: 0: 5909.3. Samples: 1100256654. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:29:02,556][25689] Avg episode reward: [(0, '-0.549')] [2022-07-11 06:29:02,694][26022] Updated weights on worker 0-0, policy_version 1074470 (0.00095) [2022-07-11 06:29:04,407][26022] Updated weights on worker 0-0, policy_version 1074480 (0.00085) [2022-07-11 06:29:06,092][26022] Updated weights on worker 0-0, policy_version 1074490 (0.00088) [2022-07-11 06:29:07,557][25689] Fps is (10 sec: 5412.4, 60 sec: 5622.8, 300 sec: 5657.1). Total num frames: 1100284928. Throughput: 0: 5831.8. Samples: 1100288790. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:29:07,558][25689] Avg episode reward: [(0, '-1.063')] [2022-07-11 06:29:07,997][26022] Updated weights on worker 0-0, policy_version 1074500 (0.00092) [2022-07-11 06:29:09,777][26022] Updated weights on worker 0-0, policy_version 1074510 (0.00093) [2022-07-11 06:29:11,603][26022] Updated weights on worker 0-0, policy_version 1074520 (0.00095) [2022-07-11 06:29:12,699][25689] Fps is (10 sec: 5855.2, 60 sec: 5649.8, 300 sec: 5658.0). Total num frames: 1100314624. Throughput: 0: 5836.3. Samples: 1100323026. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:29:12,699][25689] Avg episode reward: [(0, '-1.326')] [2022-07-11 06:29:13,344][26022] Updated weights on worker 0-0, policy_version 1074530 (0.00094) [2022-07-11 06:29:15,125][26022] Updated weights on worker 0-0, policy_version 1074540 (0.00080) [2022-07-11 06:29:16,904][26022] Updated weights on worker 0-0, policy_version 1074550 (0.00084) [2022-07-11 06:29:17,790][25689] Fps is (10 sec: 5603.9, 60 sec: 5632.3, 300 sec: 5653.1). Total num frames: 1100342272. Throughput: 0: 4951.3. Samples: 1100339856. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:29:17,792][25689] Avg episode reward: [(0, '-1.378')] [2022-07-11 06:29:18,681][26022] Updated weights on worker 0-0, policy_version 1074560 (0.00088) [2022-07-11 06:29:20,455][26022] Updated weights on worker 0-0, policy_version 1074570 (0.00078) [2022-07-11 06:29:22,392][26022] Updated weights on worker 0-0, policy_version 1074580 (0.00085) [2022-07-11 06:29:22,803][25689] Fps is (10 sec: 5675.3, 60 sec: 5634.7, 300 sec: 5653.0). Total num frames: 1100371968. Throughput: 0: 5807.6. Samples: 1100374244. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:29:22,804][25689] Avg episode reward: [(0, '-0.425')] [2022-07-11 06:29:24,375][26022] Updated weights on worker 0-0, policy_version 1074590 (0.00086) [2022-07-11 06:29:25,979][26022] Updated weights on worker 0-0, policy_version 1074600 (0.00091) [2022-07-11 06:29:27,861][25689] Fps is (10 sec: 5693.8, 60 sec: 5630.7, 300 sec: 5653.6). Total num frames: 1100399616. Throughput: 0: 5873.9. Samples: 1100408054. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:29:27,864][25689] Avg episode reward: [(0, '0.560')] [2022-07-11 06:29:27,975][26022] Updated weights on worker 0-0, policy_version 1074610 (0.00092) [2022-07-11 06:29:29,451][26022] Updated weights on worker 0-0, policy_version 1074620 (0.00096) [2022-07-11 06:29:31,723][26022] Updated weights on worker 0-0, policy_version 1074630 (0.00078) [2022-07-11 06:29:32,927][25689] Fps is (10 sec: 5562.9, 60 sec: 5614.5, 300 sec: 5652.6). Total num frames: 1100428288. Throughput: 0: 5042.1. Samples: 1100425018. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:29:32,929][25689] Avg episode reward: [(0, '0.882')] [2022-07-11 06:29:33,189][26022] Updated weights on worker 0-0, policy_version 1074640 (0.00089) [2022-07-11 06:29:35,203][26022] Updated weights on worker 0-0, policy_version 1074650 (0.00083) [2022-07-11 06:29:36,696][26022] Updated weights on worker 0-0, policy_version 1074660 (0.00092) [2022-07-11 06:29:37,939][25689] Fps is (10 sec: 5588.4, 60 sec: 5614.8, 300 sec: 5645.9). Total num frames: 1100455936. Throughput: 0: 5926.1. Samples: 1100459262. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:29:37,940][25689] Avg episode reward: [(0, '0.925')] [2022-07-11 06:29:38,817][26022] Updated weights on worker 0-0, policy_version 1074670 (0.00082) [2022-07-11 06:29:40,221][26022] Updated weights on worker 0-0, policy_version 1074680 (0.00090) [2022-07-11 06:29:41,460][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:29:41,470][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001074685_1100477440.pth [2022-07-11 06:29:41,472][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001072697_1098441728.pth [2022-07-11 06:29:42,412][26022] Updated weights on worker 0-0, policy_version 1074690 (0.00095) [2022-07-11 06:29:43,019][25689] Fps is (10 sec: 5783.3, 60 sec: 5643.0, 300 sec: 5655.2). Total num frames: 1100486656. Throughput: 0: 5881.5. Samples: 1100493148. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:29:43,020][25689] Avg episode reward: [(0, '0.959')] [2022-07-11 06:29:44,079][26022] Updated weights on worker 0-0, policy_version 1074700 (0.00086) [2022-07-11 06:29:45,827][26022] Updated weights on worker 0-0, policy_version 1074710 (0.00089) [2022-07-11 06:29:47,731][26022] Updated weights on worker 0-0, policy_version 1074720 (0.00087) [2022-07-11 06:29:48,079][25689] Fps is (10 sec: 5857.6, 60 sec: 5640.0, 300 sec: 5650.0). Total num frames: 1100515328. Throughput: 0: 5059.0. Samples: 1100510332. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:29:48,079][25689] Avg episode reward: [(0, '1.158')] [2022-07-11 06:29:49,523][26022] Updated weights on worker 0-0, policy_version 1074730 (0.00083) [2022-07-11 06:29:51,239][26022] Updated weights on worker 0-0, policy_version 1074740 (0.00083) [2022-07-11 06:29:53,155][25689] Fps is (10 sec: 5657.9, 60 sec: 5656.7, 300 sec: 5645.4). Total num frames: 1100544000. Throughput: 0: 5896.6. Samples: 1100544294. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:29:53,155][25689] Avg episode reward: [(0, '0.903')] [2022-07-11 06:29:53,161][26022] Updated weights on worker 0-0, policy_version 1074750 (0.00087) [2022-07-11 06:29:54,856][26022] Updated weights on worker 0-0, policy_version 1074760 (0.00088) [2022-07-11 06:29:56,721][26022] Updated weights on worker 0-0, policy_version 1074770 (0.00092) [2022-07-11 06:29:58,163][25689] Fps is (10 sec: 5686.5, 60 sec: 5640.8, 300 sec: 5657.1). Total num frames: 1100572672. Throughput: 0: 5913.8. Samples: 1100578860. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:29:58,164][25689] Avg episode reward: [(0, '1.684')] [2022-07-11 06:29:58,316][26022] Updated weights on worker 0-0, policy_version 1074780 (0.00082) [2022-07-11 06:30:00,455][26022] Updated weights on worker 0-0, policy_version 1074790 (0.00083) [2022-07-11 06:30:02,338][26022] Updated weights on worker 0-0, policy_version 1074800 (0.00087) [2022-07-11 06:30:03,232][25689] Fps is (10 sec: 5487.4, 60 sec: 5670.5, 300 sec: 5647.0). Total num frames: 1100599296. Throughput: 0: 5085.7. Samples: 1100595942. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:30:03,234][25689] Avg episode reward: [(0, '1.760')] [2022-07-11 06:30:04,366][26022] Updated weights on worker 0-0, policy_version 1074810 (0.00097) [2022-07-11 06:30:06,079][26022] Updated weights on worker 0-0, policy_version 1074820 (0.00085) [2022-07-11 06:30:07,867][26022] Updated weights on worker 0-0, policy_version 1074830 (0.00088) [2022-07-11 06:30:08,315][25689] Fps is (10 sec: 5547.3, 60 sec: 5662.8, 300 sec: 5649.9). Total num frames: 1100628992. Throughput: 0: 5822.2. Samples: 1100628154. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:30:08,317][25689] Avg episode reward: [(0, '1.412')] [2022-07-11 06:30:09,808][26022] Updated weights on worker 0-0, policy_version 1074840 (0.00556) [2022-07-11 06:30:11,432][26022] Updated weights on worker 0-0, policy_version 1074850 (0.00089) [2022-07-11 06:30:13,259][26022] Updated weights on worker 0-0, policy_version 1074860 (0.00090) [2022-07-11 06:30:13,354][25689] Fps is (10 sec: 5665.0, 60 sec: 5638.7, 300 sec: 5649.9). Total num frames: 1100656640. Throughput: 0: 5838.3. Samples: 1100662224. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:30:13,355][25689] Avg episode reward: [(0, '1.179')] [2022-07-11 06:30:15,078][26022] Updated weights on worker 0-0, policy_version 1074870 (0.00904) [2022-07-11 06:30:16,987][26022] Updated weights on worker 0-0, policy_version 1074880 (0.00090) [2022-07-11 06:30:18,360][25689] Fps is (10 sec: 5606.9, 60 sec: 5663.5, 300 sec: 5650.2). Total num frames: 1100685312. Throughput: 0: 4950.9. Samples: 1100678856. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:30:18,361][25689] Avg episode reward: [(0, '0.671')] [2022-07-11 06:30:18,824][26022] Updated weights on worker 0-0, policy_version 1074890 (0.00083) [2022-07-11 06:30:20,386][26022] Updated weights on worker 0-0, policy_version 1074900 (0.00085) [2022-07-11 06:30:22,430][26022] Updated weights on worker 0-0, policy_version 1074910 (0.00094) [2022-07-11 06:30:23,407][25689] Fps is (10 sec: 5704.4, 60 sec: 5643.4, 300 sec: 5653.0). Total num frames: 1100713984. Throughput: 0: 5814.7. Samples: 1100713254. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:30:23,408][25689] Avg episode reward: [(0, '1.081')] [2022-07-11 06:30:24,031][26022] Updated weights on worker 0-0, policy_version 1074920 (0.00083) [2022-07-11 06:30:25,952][26022] Updated weights on worker 0-0, policy_version 1074930 (0.00087) [2022-07-11 06:30:27,615][26022] Updated weights on worker 0-0, policy_version 1074940 (0.00088) [2022-07-11 06:30:28,443][25689] Fps is (10 sec: 5585.6, 60 sec: 5645.5, 300 sec: 5646.3). Total num frames: 1100741632. Throughput: 0: 5925.4. Samples: 1100747420. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:30:28,445][25689] Avg episode reward: [(0, '0.990')] [2022-07-11 06:30:29,365][26022] Updated weights on worker 0-0, policy_version 1074950 (0.00086) [2022-07-11 06:30:31,424][26022] Updated weights on worker 0-0, policy_version 1074960 (0.00115) [2022-07-11 06:30:33,175][26022] Updated weights on worker 0-0, policy_version 1074970 (0.00086) [2022-07-11 06:30:33,501][25689] Fps is (10 sec: 5680.7, 60 sec: 5663.1, 300 sec: 5648.8). Total num frames: 1100771328. Throughput: 0: 5063.0. Samples: 1100764230. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:30:33,502][25689] Avg episode reward: [(0, '1.135')] [2022-07-11 06:30:35,134][26022] Updated weights on worker 0-0, policy_version 1074980 (0.00083) [2022-07-11 06:30:36,809][26022] Updated weights on worker 0-0, policy_version 1074990 (0.00081) [2022-07-11 06:30:38,559][25689] Fps is (10 sec: 5668.5, 60 sec: 5658.8, 300 sec: 5651.3). Total num frames: 1100798976. Throughput: 0: 5901.4. Samples: 1100798060. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:30:38,561][25689] Avg episode reward: [(0, '1.564')] [2022-07-11 06:30:38,697][26022] Updated weights on worker 0-0, policy_version 1075000 (0.00087) [2022-07-11 06:30:40,473][26022] Updated weights on worker 0-0, policy_version 1075010 (0.00083) [2022-07-11 06:30:42,265][26022] Updated weights on worker 0-0, policy_version 1075020 (0.00082) [2022-07-11 06:30:43,639][25689] Fps is (10 sec: 5555.6, 60 sec: 5625.1, 300 sec: 5640.6). Total num frames: 1100827648. Throughput: 0: 5881.5. Samples: 1100832250. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:30:43,639][25689] Avg episode reward: [(0, '1.579')] [2022-07-11 06:30:44,079][26022] Updated weights on worker 0-0, policy_version 1075030 (0.00087) [2022-07-11 06:30:45,891][26022] Updated weights on worker 0-0, policy_version 1075040 (0.00093) [2022-07-11 06:30:47,639][26022] Updated weights on worker 0-0, policy_version 1075050 (0.00080) [2022-07-11 06:30:48,656][25689] Fps is (10 sec: 5780.9, 60 sec: 5645.9, 300 sec: 5651.7). Total num frames: 1100857344. Throughput: 0: 5038.3. Samples: 1100849258. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:30:48,658][25689] Avg episode reward: [(0, '1.481')] [2022-07-11 06:30:49,483][26022] Updated weights on worker 0-0, policy_version 1075060 (0.00085) [2022-07-11 06:30:51,227][26022] Updated weights on worker 0-0, policy_version 1075070 (0.00086) [2022-07-11 06:30:53,123][26022] Updated weights on worker 0-0, policy_version 1075080 (0.00092) [2022-07-11 06:30:53,709][25689] Fps is (10 sec: 5694.5, 60 sec: 5631.2, 300 sec: 5648.5). Total num frames: 1100884992. Throughput: 0: 5889.5. Samples: 1100883244. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:30:53,710][25689] Avg episode reward: [(0, '0.072')] [2022-07-11 06:30:55,022][26022] Updated weights on worker 0-0, policy_version 1075090 (0.00082) [2022-07-11 06:30:56,754][26022] Updated weights on worker 0-0, policy_version 1075100 (0.00092) [2022-07-11 06:30:58,394][26022] Updated weights on worker 0-0, policy_version 1075110 (0.00080) [2022-07-11 06:30:58,723][25689] Fps is (10 sec: 5696.3, 60 sec: 5647.5, 300 sec: 5648.4). Total num frames: 1100914688. Throughput: 0: 5941.2. Samples: 1100917856. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:30:58,723][25689] Avg episode reward: [(0, '0.022')] [2022-07-11 06:31:00,219][26022] Updated weights on worker 0-0, policy_version 1075120 (0.00087) [2022-07-11 06:31:02,015][26022] Updated weights on worker 0-0, policy_version 1075130 (0.00084) [2022-07-11 06:31:03,799][25689] Fps is (10 sec: 5480.4, 60 sec: 5630.0, 300 sec: 5650.6). Total num frames: 1100940288. Throughput: 0: 5852.9. Samples: 1100950244. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:31:03,799][25689] Avg episode reward: [(0, '0.177')] [2022-07-11 06:31:04,222][26022] Updated weights on worker 0-0, policy_version 1075140 (0.00086) [2022-07-11 06:31:05,938][26022] Updated weights on worker 0-0, policy_version 1075150 (0.00088) [2022-07-11 06:31:07,571][26022] Updated weights on worker 0-0, policy_version 1075160 (0.00084) [2022-07-11 06:31:08,886][25689] Fps is (10 sec: 5440.6, 60 sec: 5629.6, 300 sec: 5647.0). Total num frames: 1100969984. Throughput: 0: 5843.2. Samples: 1100967468. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:31:08,887][25689] Avg episode reward: [(0, '-0.464')] [2022-07-11 06:31:09,619][26022] Updated weights on worker 0-0, policy_version 1075170 (0.01559) [2022-07-11 06:31:11,242][26022] Updated weights on worker 0-0, policy_version 1075180 (0.00084) [2022-07-11 06:31:13,070][26022] Updated weights on worker 0-0, policy_version 1075190 (0.00084) [2022-07-11 06:31:13,935][25689] Fps is (10 sec: 5960.4, 60 sec: 5679.4, 300 sec: 5653.6). Total num frames: 1101000704. Throughput: 0: 5874.7. Samples: 1101002066. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:31:13,936][25689] Avg episode reward: [(0, '-0.665')] [2022-07-11 06:31:14,912][26022] Updated weights on worker 0-0, policy_version 1075200 (0.00079) [2022-07-11 06:31:16,527][26022] Updated weights on worker 0-0, policy_version 1075210 (0.00099) [2022-07-11 06:31:18,447][26022] Updated weights on worker 0-0, policy_version 1075220 (0.00422) [2022-07-11 06:31:18,950][25689] Fps is (10 sec: 5698.0, 60 sec: 5644.7, 300 sec: 5651.0). Total num frames: 1101027328. Throughput: 0: 5855.1. Samples: 1101036290. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:31:18,951][25689] Avg episode reward: [(0, '0.009')] [2022-07-11 06:31:20,123][26022] Updated weights on worker 0-0, policy_version 1075230 (0.00081) [2022-07-11 06:31:21,904][26022] Updated weights on worker 0-0, policy_version 1075240 (0.00087) [2022-07-11 06:31:23,861][26022] Updated weights on worker 0-0, policy_version 1075250 (0.00084) [2022-07-11 06:31:23,959][25689] Fps is (10 sec: 5516.2, 60 sec: 5648.3, 300 sec: 5648.0). Total num frames: 1101056000. Throughput: 0: 5115.7. Samples: 1101053378. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:31:23,960][25689] Avg episode reward: [(0, '0.439')] [2022-07-11 06:31:25,739][26022] Updated weights on worker 0-0, policy_version 1075260 (0.00090) [2022-07-11 06:31:27,306][26022] Updated weights on worker 0-0, policy_version 1075270 (0.00086) [2022-07-11 06:31:28,993][25689] Fps is (10 sec: 5812.1, 60 sec: 5682.3, 300 sec: 5652.6). Total num frames: 1101085696. Throughput: 0: 5994.0. Samples: 1101087986. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:31:28,994][25689] Avg episode reward: [(0, '0.334')] [2022-07-11 06:31:29,254][26022] Updated weights on worker 0-0, policy_version 1075280 (0.00082) [2022-07-11 06:31:30,911][26022] Updated weights on worker 0-0, policy_version 1075290 (0.00084) [2022-07-11 06:31:32,809][26022] Updated weights on worker 0-0, policy_version 1075300 (0.00084) [2022-07-11 06:31:34,054][25689] Fps is (10 sec: 5782.1, 60 sec: 5665.2, 300 sec: 5655.6). Total num frames: 1101114368. Throughput: 0: 5977.3. Samples: 1101122322. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:31:34,055][25689] Avg episode reward: [(0, '0.156')] [2022-07-11 06:31:34,578][26022] Updated weights on worker 0-0, policy_version 1075310 (0.00091) [2022-07-11 06:31:36,128][26022] Updated weights on worker 0-0, policy_version 1075320 (0.00085) [2022-07-11 06:31:38,170][26022] Updated weights on worker 0-0, policy_version 1075330 (0.00083) [2022-07-11 06:31:39,057][25689] Fps is (10 sec: 5799.9, 60 sec: 5704.2, 300 sec: 5656.1). Total num frames: 1101144064. Throughput: 0: 5148.5. Samples: 1101139806. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:31:39,057][25689] Avg episode reward: [(0, '0.177')] [2022-07-11 06:31:39,759][26022] Updated weights on worker 0-0, policy_version 1075340 (0.00091) [2022-07-11 06:31:41,628][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:31:41,637][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001075350_1101158400.pth [2022-07-11 06:31:41,637][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001073360_1099120640.pth [2022-07-11 06:31:41,642][26022] Updated weights on worker 0-0, policy_version 1075350 (0.00091) [2022-07-11 06:31:43,400][26022] Updated weights on worker 0-0, policy_version 1075360 (0.00086) [2022-07-11 06:31:44,070][25689] Fps is (10 sec: 5623.1, 60 sec: 5676.6, 300 sec: 5656.0). Total num frames: 1101170688. Throughput: 0: 5995.5. Samples: 1101173950. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:31:44,070][25689] Avg episode reward: [(0, '0.851')] [2022-07-11 06:31:45,203][26022] Updated weights on worker 0-0, policy_version 1075370 (0.00089) [2022-07-11 06:31:47,295][26022] Updated weights on worker 0-0, policy_version 1075380 (0.00087) [2022-07-11 06:31:48,750][26022] Updated weights on worker 0-0, policy_version 1075390 (0.00091) [2022-07-11 06:31:49,105][25689] Fps is (10 sec: 5706.9, 60 sec: 5691.8, 300 sec: 5659.3). Total num frames: 1101201408. Throughput: 0: 5968.8. Samples: 1101208030. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:31:49,105][25689] Avg episode reward: [(0, '-1.322')] [2022-07-11 06:31:50,691][26022] Updated weights on worker 0-0, policy_version 1075400 (0.00091) [2022-07-11 06:31:52,370][26022] Updated weights on worker 0-0, policy_version 1075410 (0.00086) [2022-07-11 06:31:54,169][25689] Fps is (10 sec: 5678.0, 60 sec: 5673.9, 300 sec: 5651.4). Total num frames: 1101228032. Throughput: 0: 5112.1. Samples: 1101225152. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:31:54,170][25689] Avg episode reward: [(0, '-2.298')] [2022-07-11 06:31:54,241][26022] Updated weights on worker 0-0, policy_version 1075420 (0.00088) [2022-07-11 06:31:56,142][26022] Updated weights on worker 0-0, policy_version 1075430 (0.00090) [2022-07-11 06:31:57,810][26022] Updated weights on worker 0-0, policy_version 1075440 (0.00087) [2022-07-11 06:31:59,250][25689] Fps is (10 sec: 5551.3, 60 sec: 5667.5, 300 sec: 5647.5). Total num frames: 1101257728. Throughput: 0: 5930.5. Samples: 1101259564. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:31:59,252][25689] Avg episode reward: [(0, '-2.098')] [2022-07-11 06:31:59,710][26022] Updated weights on worker 0-0, policy_version 1075450 (0.00086) [2022-07-11 06:32:01,335][26022] Updated weights on worker 0-0, policy_version 1075460 (0.00085) [2022-07-11 06:32:03,562][26022] Updated weights on worker 0-0, policy_version 1075470 (0.00088) [2022-07-11 06:32:04,297][25689] Fps is (10 sec: 5662.0, 60 sec: 5704.1, 300 sec: 5657.1). Total num frames: 1101285376. Throughput: 0: 5828.5. Samples: 1101291844. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:32:04,299][25689] Avg episode reward: [(0, '-1.967')] [2022-07-11 06:32:05,314][26022] Updated weights on worker 0-0, policy_version 1075480 (0.00087) [2022-07-11 06:32:07,048][26022] Updated weights on worker 0-0, policy_version 1075490 (0.00083) [2022-07-11 06:32:08,903][26022] Updated weights on worker 0-0, policy_version 1075500 (0.00085) [2022-07-11 06:32:09,305][25689] Fps is (10 sec: 5499.2, 60 sec: 5677.7, 300 sec: 5655.5). Total num frames: 1101313024. Throughput: 0: 5008.8. Samples: 1101309210. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:32:09,307][25689] Avg episode reward: [(0, '-1.665')] [2022-07-11 06:32:10,821][26022] Updated weights on worker 0-0, policy_version 1075510 (0.00086) [2022-07-11 06:32:12,374][26022] Updated weights on worker 0-0, policy_version 1075520 (0.00079) [2022-07-11 06:32:14,363][25689] Fps is (10 sec: 5594.9, 60 sec: 5642.9, 300 sec: 5651.0). Total num frames: 1101341696. Throughput: 0: 5870.1. Samples: 1101343694. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:32:14,364][25689] Avg episode reward: [(0, '-0.774')] [2022-07-11 06:32:14,445][26022] Updated weights on worker 0-0, policy_version 1075530 (0.00085) [2022-07-11 06:32:15,894][26022] Updated weights on worker 0-0, policy_version 1075540 (0.00087) [2022-07-11 06:32:17,954][26022] Updated weights on worker 0-0, policy_version 1075550 (0.00084) [2022-07-11 06:32:19,375][25689] Fps is (10 sec: 5694.8, 60 sec: 5677.2, 300 sec: 5650.9). Total num frames: 1101370368. Throughput: 0: 5872.8. Samples: 1101377752. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:32:19,375][25689] Avg episode reward: [(0, '-0.754')] [2022-07-11 06:32:19,646][26022] Updated weights on worker 0-0, policy_version 1075560 (0.00083) [2022-07-11 06:32:21,443][26022] Updated weights on worker 0-0, policy_version 1075570 (0.00086) [2022-07-11 06:32:23,214][26022] Updated weights on worker 0-0, policy_version 1075580 (0.00100) [2022-07-11 06:32:24,387][25689] Fps is (10 sec: 5823.0, 60 sec: 5693.8, 300 sec: 5657.8). Total num frames: 1101400064. Throughput: 0: 5140.2. Samples: 1101395110. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:32:24,387][25689] Avg episode reward: [(0, '-1.825')] [2022-07-11 06:32:25,096][26022] Updated weights on worker 0-0, policy_version 1075590 (0.00087) [2022-07-11 06:32:26,831][26022] Updated weights on worker 0-0, policy_version 1075600 (0.00082) [2022-07-11 06:32:28,730][26022] Updated weights on worker 0-0, policy_version 1075610 (0.00080) [2022-07-11 06:32:29,391][25689] Fps is (10 sec: 5725.2, 60 sec: 5662.7, 300 sec: 5653.3). Total num frames: 1101427712. Throughput: 0: 5986.7. Samples: 1101429456. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 06:32:29,391][25689] Avg episode reward: [(0, '-1.934')] [2022-07-11 06:32:30,378][26022] Updated weights on worker 0-0, policy_version 1075620 (0.00076) [2022-07-11 06:32:32,231][26022] Updated weights on worker 0-0, policy_version 1075630 (0.00083) [2022-07-11 06:32:33,820][26022] Updated weights on worker 0-0, policy_version 1075640 (0.00086) [2022-07-11 06:32:34,479][25689] Fps is (10 sec: 5681.7, 60 sec: 5677.1, 300 sec: 5656.3). Total num frames: 1101457408. Throughput: 0: 5988.1. Samples: 1101464154. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:32:34,480][25689] Avg episode reward: [(0, '-2.106')] [2022-07-11 06:32:35,704][26022] Updated weights on worker 0-0, policy_version 1075650 (0.00092) [2022-07-11 06:32:37,615][26022] Updated weights on worker 0-0, policy_version 1075660 (0.00079) [2022-07-11 06:32:39,282][26022] Updated weights on worker 0-0, policy_version 1075670 (0.00081) [2022-07-11 06:32:39,483][25689] Fps is (10 sec: 5885.0, 60 sec: 5677.0, 300 sec: 5656.8). Total num frames: 1101487104. Throughput: 0: 5148.1. Samples: 1101481270. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:32:39,483][25689] Avg episode reward: [(0, '-1.844')] [2022-07-11 06:32:41,164][26022] Updated weights on worker 0-0, policy_version 1075680 (0.00089) [2022-07-11 06:32:43,143][26022] Updated weights on worker 0-0, policy_version 1075690 (0.00095) [2022-07-11 06:32:44,525][25689] Fps is (10 sec: 5810.1, 60 sec: 5708.1, 300 sec: 5659.8). Total num frames: 1101515776. Throughput: 0: 5975.6. Samples: 1101515452. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:32:44,526][25689] Avg episode reward: [(0, '-1.289')] [2022-07-11 06:32:44,727][26022] Updated weights on worker 0-0, policy_version 1075700 (0.00079) [2022-07-11 06:32:46,570][26022] Updated weights on worker 0-0, policy_version 1075710 (0.00090) [2022-07-11 06:32:48,451][26022] Updated weights on worker 0-0, policy_version 1075720 (0.00100) [2022-07-11 06:32:49,538][25689] Fps is (10 sec: 5601.1, 60 sec: 5659.4, 300 sec: 5655.0). Total num frames: 1101543424. Throughput: 0: 5962.5. Samples: 1101549584. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:32:49,539][25689] Avg episode reward: [(0, '-1.321')] [2022-07-11 06:32:50,114][26022] Updated weights on worker 0-0, policy_version 1075730 (0.00087) [2022-07-11 06:32:52,084][26022] Updated weights on worker 0-0, policy_version 1075740 (0.00093) [2022-07-11 06:32:53,633][26022] Updated weights on worker 0-0, policy_version 1075750 (0.00081) [2022-07-11 06:32:54,585][25689] Fps is (10 sec: 5598.5, 60 sec: 5694.9, 300 sec: 5654.4). Total num frames: 1101572096. Throughput: 0: 5099.1. Samples: 1101566676. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:32:54,587][25689] Avg episode reward: [(0, '-0.132')] [2022-07-11 06:32:55,558][26022] Updated weights on worker 0-0, policy_version 1075760 (0.00096) [2022-07-11 06:32:57,182][26022] Updated weights on worker 0-0, policy_version 1075770 (0.00078) [2022-07-11 06:32:59,116][26022] Updated weights on worker 0-0, policy_version 1075780 (0.00080) [2022-07-11 06:32:59,595][25689] Fps is (10 sec: 5804.0, 60 sec: 5701.7, 300 sec: 5668.6). Total num frames: 1101601792. Throughput: 0: 5968.5. Samples: 1101601306. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:32:59,595][25689] Avg episode reward: [(0, '0.909')] [2022-07-11 06:33:00,984][26022] Updated weights on worker 0-0, policy_version 1075790 (0.00089) [2022-07-11 06:33:02,901][26022] Updated weights on worker 0-0, policy_version 1075800 (0.01071) [2022-07-11 06:33:04,615][25689] Fps is (10 sec: 5513.2, 60 sec: 5670.2, 300 sec: 5654.8). Total num frames: 1101627392. Throughput: 0: 5871.4. Samples: 1101633406. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:33:04,615][25689] Avg episode reward: [(0, '1.116')] [2022-07-11 06:33:04,931][26022] Updated weights on worker 0-0, policy_version 1075810 (0.00080) [2022-07-11 06:33:06,748][26022] Updated weights on worker 0-0, policy_version 1075820 (0.00088) [2022-07-11 06:33:08,434][26022] Updated weights on worker 0-0, policy_version 1075830 (0.00092) [2022-07-11 06:33:09,626][25689] Fps is (10 sec: 5410.1, 60 sec: 5686.9, 300 sec: 5659.4). Total num frames: 1101656064. Throughput: 0: 5035.9. Samples: 1101650746. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:33:09,627][25689] Avg episode reward: [(0, '1.110')] [2022-07-11 06:33:10,269][26022] Updated weights on worker 0-0, policy_version 1075840 (0.00081) [2022-07-11 06:33:12,080][26022] Updated weights on worker 0-0, policy_version 1075850 (0.00086) [2022-07-11 06:33:13,923][26022] Updated weights on worker 0-0, policy_version 1075860 (0.00085) [2022-07-11 06:33:14,684][25689] Fps is (10 sec: 5796.6, 60 sec: 5703.8, 300 sec: 5663.3). Total num frames: 1101685760. Throughput: 0: 5885.9. Samples: 1101684978. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:33:14,685][25689] Avg episode reward: [(0, '0.399')] [2022-07-11 06:33:15,689][26022] Updated weights on worker 0-0, policy_version 1075870 (0.00087) [2022-07-11 06:33:17,318][26022] Updated weights on worker 0-0, policy_version 1075880 (0.00089) [2022-07-11 06:33:19,124][26022] Updated weights on worker 0-0, policy_version 1075890 (0.00083) [2022-07-11 06:33:19,698][25689] Fps is (10 sec: 5693.7, 60 sec: 5686.7, 300 sec: 5656.9). Total num frames: 1101713408. Throughput: 0: 5854.8. Samples: 1101719006. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:33:19,699][25689] Avg episode reward: [(0, '0.524')] [2022-07-11 06:33:21,212][26022] Updated weights on worker 0-0, policy_version 1075900 (0.00081) [2022-07-11 06:33:22,923][26022] Updated weights on worker 0-0, policy_version 1075910 (0.00098) [2022-07-11 06:33:24,538][26022] Updated weights on worker 0-0, policy_version 1075920 (0.00090) [2022-07-11 06:33:24,711][25689] Fps is (10 sec: 5616.8, 60 sec: 5669.6, 300 sec: 5660.4). Total num frames: 1101742080. Throughput: 0: 5109.6. Samples: 1101736092. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:33:24,714][25689] Avg episode reward: [(0, '0.480')] [2022-07-11 06:33:26,578][26022] Updated weights on worker 0-0, policy_version 1075930 (0.00094) [2022-07-11 06:33:28,050][26022] Updated weights on worker 0-0, policy_version 1075940 (0.00111) [2022-07-11 06:33:29,728][25689] Fps is (10 sec: 5615.2, 60 sec: 5668.4, 300 sec: 5654.6). Total num frames: 1101769728. Throughput: 0: 5943.6. Samples: 1101770220. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:33:29,728][25689] Avg episode reward: [(0, '0.511')] [2022-07-11 06:33:30,153][26022] Updated weights on worker 0-0, policy_version 1075950 (0.00082) [2022-07-11 06:33:31,795][26022] Updated weights on worker 0-0, policy_version 1075960 (0.00093) [2022-07-11 06:33:33,854][26022] Updated weights on worker 0-0, policy_version 1075970 (0.00080) [2022-07-11 06:33:34,775][25689] Fps is (10 sec: 5698.3, 60 sec: 5672.3, 300 sec: 5660.9). Total num frames: 1101799424. Throughput: 0: 5940.2. Samples: 1101804318. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:33:34,775][25689] Avg episode reward: [(0, '0.141')] [2022-07-11 06:33:35,440][26022] Updated weights on worker 0-0, policy_version 1075980 (0.00088) [2022-07-11 06:33:37,569][26022] Updated weights on worker 0-0, policy_version 1075990 (0.00087) [2022-07-11 06:33:39,015][26022] Updated weights on worker 0-0, policy_version 1076000 (0.00082) [2022-07-11 06:33:39,792][25689] Fps is (10 sec: 5799.5, 60 sec: 5654.1, 300 sec: 5660.9). Total num frames: 1101828096. Throughput: 0: 5095.5. Samples: 1101821396. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:33:39,795][25689] Avg episode reward: [(0, '-0.434')] [2022-07-11 06:33:40,929][26022] Updated weights on worker 0-0, policy_version 1076010 (0.00091) [2022-07-11 06:33:41,663][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:33:41,677][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001076015_1101839360.pth [2022-07-11 06:33:41,678][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001074023_1099799552.pth [2022-07-11 06:33:42,572][26022] Updated weights on worker 0-0, policy_version 1076020 (0.00083) [2022-07-11 06:33:44,411][26022] Updated weights on worker 0-0, policy_version 1076030 (0.00082) [2022-07-11 06:33:44,802][25689] Fps is (10 sec: 5616.8, 60 sec: 5640.1, 300 sec: 5657.8). Total num frames: 1101855744. Throughput: 0: 5961.5. Samples: 1101855860. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:33:44,802][25689] Avg episode reward: [(0, '0.535')] [2022-07-11 06:33:46,222][26022] Updated weights on worker 0-0, policy_version 1076040 (0.00083) [2022-07-11 06:33:47,943][26022] Updated weights on worker 0-0, policy_version 1076050 (0.00085) [2022-07-11 06:33:49,819][25689] Fps is (10 sec: 5617.0, 60 sec: 5656.8, 300 sec: 5662.3). Total num frames: 1101884416. Throughput: 0: 5968.8. Samples: 1101890140. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:33:49,819][25689] Avg episode reward: [(0, '0.337')] [2022-07-11 06:33:49,838][26022] Updated weights on worker 0-0, policy_version 1076060 (0.01031) [2022-07-11 06:33:51,639][26022] Updated weights on worker 0-0, policy_version 1076070 (0.00082) [2022-07-11 06:33:53,480][26022] Updated weights on worker 0-0, policy_version 1076080 (0.00083) [2022-07-11 06:33:54,873][25689] Fps is (10 sec: 5795.4, 60 sec: 5673.0, 300 sec: 5661.7). Total num frames: 1101914112. Throughput: 0: 5112.9. Samples: 1101907080. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:33:54,874][25689] Avg episode reward: [(0, '-0.760')] [2022-07-11 06:33:55,376][26022] Updated weights on worker 0-0, policy_version 1076090 (0.00053) [2022-07-11 06:33:56,998][26022] Updated weights on worker 0-0, policy_version 1076100 (0.00086) [2022-07-11 06:33:58,765][26022] Updated weights on worker 0-0, policy_version 1076110 (0.00083) [2022-07-11 06:33:59,880][25689] Fps is (10 sec: 5801.2, 60 sec: 5656.3, 300 sec: 5675.8). Total num frames: 1101942784. Throughput: 0: 5978.8. Samples: 1101941498. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:33:59,880][25689] Avg episode reward: [(0, '-1.583')] [2022-07-11 06:34:00,543][26022] Updated weights on worker 0-0, policy_version 1076120 (0.00085) [2022-07-11 06:34:02,764][26022] Updated weights on worker 0-0, policy_version 1076130 (0.00091) [2022-07-11 06:34:04,494][26022] Updated weights on worker 0-0, policy_version 1076140 (0.00088) [2022-07-11 06:34:04,903][25689] Fps is (10 sec: 5513.4, 60 sec: 5673.1, 300 sec: 5665.1). Total num frames: 1101969408. Throughput: 0: 5864.2. Samples: 1101973734. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:34:04,904][25689] Avg episode reward: [(0, '-0.166')] [2022-07-11 06:34:06,414][26022] Updated weights on worker 0-0, policy_version 1076150 (0.00084) [2022-07-11 06:34:08,075][26022] Updated weights on worker 0-0, policy_version 1076160 (0.00089) [2022-07-11 06:34:09,890][26022] Updated weights on worker 0-0, policy_version 1076170 (0.00095) [2022-07-11 06:34:09,915][25689] Fps is (10 sec: 5510.5, 60 sec: 5673.0, 300 sec: 5664.1). Total num frames: 1101998080. Throughput: 0: 5014.1. Samples: 1101990904. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:34:09,915][25689] Avg episode reward: [(0, '-0.138')] [2022-07-11 06:34:11,729][26022] Updated weights on worker 0-0, policy_version 1076180 (0.00091) [2022-07-11 06:34:13,483][26022] Updated weights on worker 0-0, policy_version 1076190 (0.00079) [2022-07-11 06:34:14,984][25689] Fps is (10 sec: 5688.1, 60 sec: 5655.0, 300 sec: 5667.9). Total num frames: 1102026752. Throughput: 0: 5886.1. Samples: 1102025454. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:34:14,986][25689] Avg episode reward: [(0, '-0.646')] [2022-07-11 06:34:15,232][26022] Updated weights on worker 0-0, policy_version 1076200 (0.00090) [2022-07-11 06:34:17,098][26022] Updated weights on worker 0-0, policy_version 1076210 (0.00090) [2022-07-11 06:34:18,803][26022] Updated weights on worker 0-0, policy_version 1076220 (0.00088) [2022-07-11 06:34:20,038][25689] Fps is (10 sec: 5664.7, 60 sec: 5668.2, 300 sec: 5663.7). Total num frames: 1102055424. Throughput: 0: 5882.6. Samples: 1102060078. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:34:20,038][25689] Avg episode reward: [(0, '-0.877')] [2022-07-11 06:34:20,691][26022] Updated weights on worker 0-0, policy_version 1076230 (0.00083) [2022-07-11 06:34:22,224][26022] Updated weights on worker 0-0, policy_version 1076240 (0.00081) [2022-07-11 06:34:24,322][26022] Updated weights on worker 0-0, policy_version 1076250 (0.00087) [2022-07-11 06:34:25,078][25689] Fps is (10 sec: 5782.4, 60 sec: 5682.6, 300 sec: 5671.0). Total num frames: 1102085120. Throughput: 0: 6000.0. Samples: 1102094786. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:34:25,079][25689] Avg episode reward: [(0, '-0.744')] [2022-07-11 06:34:25,775][26022] Updated weights on worker 0-0, policy_version 1076260 (0.00094) [2022-07-11 06:34:27,675][26022] Updated weights on worker 0-0, policy_version 1076270 (0.00085) [2022-07-11 06:34:29,361][26022] Updated weights on worker 0-0, policy_version 1076280 (0.00069) [2022-07-11 06:34:30,102][25689] Fps is (10 sec: 5799.2, 60 sec: 5698.8, 300 sec: 5671.7). Total num frames: 1102113792. Throughput: 0: 6005.9. Samples: 1102112150. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:34:30,104][25689] Avg episode reward: [(0, '0.011')] [2022-07-11 06:34:31,202][26022] Updated weights on worker 0-0, policy_version 1076290 (0.00089) [2022-07-11 06:34:33,073][26022] Updated weights on worker 0-0, policy_version 1076300 (0.00084) [2022-07-11 06:34:34,947][26022] Updated weights on worker 0-0, policy_version 1076310 (0.00084) [2022-07-11 06:34:35,171][25689] Fps is (10 sec: 5783.1, 60 sec: 5696.8, 300 sec: 5677.6). Total num frames: 1102143488. Throughput: 0: 5993.0. Samples: 1102146434. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:34:35,172][25689] Avg episode reward: [(0, '-0.323')] [2022-07-11 06:34:36,659][26022] Updated weights on worker 0-0, policy_version 1076320 (0.00091) [2022-07-11 06:34:38,475][26022] Updated weights on worker 0-0, policy_version 1076330 (0.00093) [2022-07-11 06:34:40,169][26022] Updated weights on worker 0-0, policy_version 1076340 (0.00089) [2022-07-11 06:34:40,194][25689] Fps is (10 sec: 5783.7, 60 sec: 5696.2, 300 sec: 5671.8). Total num frames: 1102172160. Throughput: 0: 5983.0. Samples: 1102180676. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:34:40,195][25689] Avg episode reward: [(0, '0.141')] [2022-07-11 06:34:42,100][26022] Updated weights on worker 0-0, policy_version 1076350 (0.00087) [2022-07-11 06:34:43,876][26022] Updated weights on worker 0-0, policy_version 1076360 (0.00087) [2022-07-11 06:34:45,219][25689] Fps is (10 sec: 5605.3, 60 sec: 5694.8, 300 sec: 5669.0). Total num frames: 1102199808. Throughput: 0: 5113.8. Samples: 1102197780. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:34:45,219][25689] Avg episode reward: [(0, '0.227')] [2022-07-11 06:34:45,559][26022] Updated weights on worker 0-0, policy_version 1076370 (0.00093) [2022-07-11 06:34:47,289][26022] Updated weights on worker 0-0, policy_version 1076380 (0.00079) [2022-07-11 06:34:49,097][26022] Updated weights on worker 0-0, policy_version 1076390 (0.00087) [2022-07-11 06:34:50,248][25689] Fps is (10 sec: 5602.0, 60 sec: 5693.7, 300 sec: 5669.9). Total num frames: 1102228480. Throughput: 0: 5964.8. Samples: 1102232312. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:34:50,248][25689] Avg episode reward: [(0, '1.305')] [2022-07-11 06:34:50,879][26022] Updated weights on worker 0-0, policy_version 1076400 (0.00081) [2022-07-11 06:34:52,753][26022] Updated weights on worker 0-0, policy_version 1076410 (0.00087) [2022-07-11 06:34:54,334][26022] Updated weights on worker 0-0, policy_version 1076420 (0.00612) [2022-07-11 06:34:55,364][25689] Fps is (10 sec: 5652.5, 60 sec: 5671.0, 300 sec: 5667.8). Total num frames: 1102257152. Throughput: 0: 5952.9. Samples: 1102266638. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:34:55,364][25689] Avg episode reward: [(0, '1.403')] [2022-07-11 06:34:56,504][26022] Updated weights on worker 0-0, policy_version 1076430 (0.00081) [2022-07-11 06:34:58,317][26022] Updated weights on worker 0-0, policy_version 1076440 (0.00093) [2022-07-11 06:35:00,026][26022] Updated weights on worker 0-0, policy_version 1076450 (0.00092) [2022-07-11 06:35:00,439][25689] Fps is (10 sec: 5827.8, 60 sec: 5698.4, 300 sec: 5681.5). Total num frames: 1102287872. Throughput: 0: 5083.7. Samples: 1102283594. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:35:00,439][25689] Avg episode reward: [(0, '0.072')] [2022-07-11 06:35:02,076][26022] Updated weights on worker 0-0, policy_version 1076460 (0.00089) [2022-07-11 06:35:03,878][26022] Updated weights on worker 0-0, policy_version 1076470 (0.00094) [2022-07-11 06:35:05,493][25689] Fps is (10 sec: 5660.9, 60 sec: 5695.4, 300 sec: 5671.7). Total num frames: 1102314496. Throughput: 0: 5813.0. Samples: 1102315638. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:35:05,494][25689] Avg episode reward: [(0, '0.041')] [2022-07-11 06:35:05,706][26022] Updated weights on worker 0-0, policy_version 1076480 (0.00089) [2022-07-11 06:35:07,546][26022] Updated weights on worker 0-0, policy_version 1076490 (0.00086) [2022-07-11 06:35:09,234][26022] Updated weights on worker 0-0, policy_version 1076500 (0.00087) [2022-07-11 06:35:10,501][25689] Fps is (10 sec: 5393.8, 60 sec: 5678.9, 300 sec: 5672.3). Total num frames: 1102342144. Throughput: 0: 5827.8. Samples: 1102350344. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:35:10,501][25689] Avg episode reward: [(0, '0.158')] [2022-07-11 06:35:11,135][26022] Updated weights on worker 0-0, policy_version 1076510 (0.00079) [2022-07-11 06:35:12,873][26022] Updated weights on worker 0-0, policy_version 1076520 (0.00079) [2022-07-11 06:35:14,655][26022] Updated weights on worker 0-0, policy_version 1076530 (0.00083) [2022-07-11 06:35:15,567][25689] Fps is (10 sec: 5692.7, 60 sec: 5696.2, 300 sec: 5674.6). Total num frames: 1102371840. Throughput: 0: 4995.0. Samples: 1102367552. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:35:15,567][25689] Avg episode reward: [(0, '-0.774')] [2022-07-11 06:35:16,372][26022] Updated weights on worker 0-0, policy_version 1076540 (0.00086) [2022-07-11 06:35:18,217][26022] Updated weights on worker 0-0, policy_version 1076550 (0.00083) [2022-07-11 06:35:20,166][26022] Updated weights on worker 0-0, policy_version 1076560 (0.00089) [2022-07-11 06:35:20,575][25689] Fps is (10 sec: 5692.4, 60 sec: 5683.5, 300 sec: 5671.9). Total num frames: 1102399488. Throughput: 0: 5880.0. Samples: 1102401992. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:35:20,575][25689] Avg episode reward: [(0, '-1.468')] [2022-07-11 06:35:21,742][26022] Updated weights on worker 0-0, policy_version 1076570 (0.00089) [2022-07-11 06:35:23,679][26022] Updated weights on worker 0-0, policy_version 1076580 (0.00097) [2022-07-11 06:35:25,359][26022] Updated weights on worker 0-0, policy_version 1076590 (0.00094) [2022-07-11 06:35:25,596][25689] Fps is (10 sec: 5717.9, 60 sec: 5685.4, 300 sec: 5679.1). Total num frames: 1102429184. Throughput: 0: 5990.5. Samples: 1102436060. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:35:25,596][25689] Avg episode reward: [(0, '-2.194')] [2022-07-11 06:35:27,250][26022] Updated weights on worker 0-0, policy_version 1076600 (0.00083) [2022-07-11 06:35:29,098][26022] Updated weights on worker 0-0, policy_version 1076610 (0.00093) [2022-07-11 06:35:30,629][25689] Fps is (10 sec: 5703.2, 60 sec: 5667.6, 300 sec: 5672.7). Total num frames: 1102456832. Throughput: 0: 5107.1. Samples: 1102453142. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:35:30,631][25689] Avg episode reward: [(0, '-0.035')] [2022-07-11 06:35:30,778][26022] Updated weights on worker 0-0, policy_version 1076620 (0.00080) [2022-07-11 06:35:32,750][26022] Updated weights on worker 0-0, policy_version 1076630 (0.00095) [2022-07-11 06:35:34,498][26022] Updated weights on worker 0-0, policy_version 1076640 (0.00087) [2022-07-11 06:35:35,724][25689] Fps is (10 sec: 5560.9, 60 sec: 5648.2, 300 sec: 5675.4). Total num frames: 1102485504. Throughput: 0: 5938.3. Samples: 1102487250. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:35:35,724][25689] Avg episode reward: [(0, '-0.966')] [2022-07-11 06:35:36,185][26022] Updated weights on worker 0-0, policy_version 1076650 (0.00090) [2022-07-11 06:35:38,099][26022] Updated weights on worker 0-0, policy_version 1076660 (0.00097) [2022-07-11 06:35:39,823][26022] Updated weights on worker 0-0, policy_version 1076670 (0.00076) [2022-07-11 06:35:40,738][25689] Fps is (10 sec: 5773.9, 60 sec: 5666.0, 300 sec: 5680.1). Total num frames: 1102515200. Throughput: 0: 5943.0. Samples: 1102521824. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:35:40,739][25689] Avg episode reward: [(0, '-0.197')] [2022-07-11 06:35:41,471][26022] Updated weights on worker 0-0, policy_version 1076680 (0.00090) [2022-07-11 06:35:41,820][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:35:41,835][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001076682_1102522368.pth [2022-07-11 06:35:41,835][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001074685_1100477440.pth [2022-07-11 06:35:43,489][26022] Updated weights on worker 0-0, policy_version 1076690 (0.00095) [2022-07-11 06:35:45,003][26022] Updated weights on worker 0-0, policy_version 1076700 (0.00083) [2022-07-11 06:35:45,754][25689] Fps is (10 sec: 5921.5, 60 sec: 5700.6, 300 sec: 5680.1). Total num frames: 1102544896. Throughput: 0: 5103.7. Samples: 1102538940. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:35:45,755][25689] Avg episode reward: [(0, '0.405')] [2022-07-11 06:35:47,026][26022] Updated weights on worker 0-0, policy_version 1076710 (0.00087) [2022-07-11 06:35:48,647][26022] Updated weights on worker 0-0, policy_version 1076720 (0.00089) [2022-07-11 06:35:50,515][26022] Updated weights on worker 0-0, policy_version 1076730 (0.00090) [2022-07-11 06:35:50,783][25689] Fps is (10 sec: 5810.9, 60 sec: 5700.6, 300 sec: 5684.0). Total num frames: 1102573568. Throughput: 0: 5975.9. Samples: 1102573578. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:35:50,783][25689] Avg episode reward: [(0, '1.297')] [2022-07-11 06:35:52,308][26022] Updated weights on worker 0-0, policy_version 1076740 (0.00089) [2022-07-11 06:35:54,103][26022] Updated weights on worker 0-0, policy_version 1076750 (0.00084) [2022-07-11 06:35:55,927][25689] Fps is (10 sec: 5536.1, 60 sec: 5681.1, 300 sec: 5674.7). Total num frames: 1102601216. Throughput: 0: 5983.0. Samples: 1102608126. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:35:55,927][25689] Avg episode reward: [(0, '-0.697')] [2022-07-11 06:35:55,987][26022] Updated weights on worker 0-0, policy_version 1076760 (0.00091) [2022-07-11 06:35:57,732][26022] Updated weights on worker 0-0, policy_version 1076770 (0.00853) [2022-07-11 06:35:59,258][26022] Updated weights on worker 0-0, policy_version 1076780 (0.00086) [2022-07-11 06:36:00,946][25689] Fps is (10 sec: 5642.5, 60 sec: 5669.4, 300 sec: 5689.5). Total num frames: 1102630912. Throughput: 0: 5103.7. Samples: 1102624956. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:36:00,946][25689] Avg episode reward: [(0, '-0.821')] [2022-07-11 06:36:01,366][26022] Updated weights on worker 0-0, policy_version 1076790 (0.00090) [2022-07-11 06:36:03,351][26022] Updated weights on worker 0-0, policy_version 1076800 (0.00085) [2022-07-11 06:36:05,307][26022] Updated weights on worker 0-0, policy_version 1076810 (0.00081) [2022-07-11 06:36:05,948][25689] Fps is (10 sec: 5620.0, 60 sec: 5674.3, 300 sec: 5680.8). Total num frames: 1102657536. Throughput: 0: 5858.1. Samples: 1102657242. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:36:05,949][25689] Avg episode reward: [(0, '-1.015')] [2022-07-11 06:36:06,951][26022] Updated weights on worker 0-0, policy_version 1076820 (0.00082) [2022-07-11 06:36:08,764][26022] Updated weights on worker 0-0, policy_version 1076830 (0.00083) [2022-07-11 06:36:10,683][26022] Updated weights on worker 0-0, policy_version 1076840 (0.00098) [2022-07-11 06:36:10,964][25689] Fps is (10 sec: 5417.3, 60 sec: 5673.5, 300 sec: 5671.1). Total num frames: 1102685184. Throughput: 0: 5846.9. Samples: 1102691576. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:36:10,965][25689] Avg episode reward: [(0, '-1.150')] [2022-07-11 06:36:12,389][26022] Updated weights on worker 0-0, policy_version 1076850 (0.00082) [2022-07-11 06:36:14,272][26022] Updated weights on worker 0-0, policy_version 1076860 (0.00088) [2022-07-11 06:36:15,804][26022] Updated weights on worker 0-0, policy_version 1076870 (0.00093) [2022-07-11 06:36:16,020][25689] Fps is (10 sec: 5693.3, 60 sec: 5674.4, 300 sec: 5680.7). Total num frames: 1102714880. Throughput: 0: 5007.1. Samples: 1102708736. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:36:16,021][25689] Avg episode reward: [(0, '-1.476')] [2022-07-11 06:36:17,836][26022] Updated weights on worker 0-0, policy_version 1076880 (0.00085) [2022-07-11 06:36:19,508][26022] Updated weights on worker 0-0, policy_version 1076890 (0.00098) [2022-07-11 06:36:21,028][25689] Fps is (10 sec: 5901.5, 60 sec: 5708.3, 300 sec: 5684.1). Total num frames: 1102744576. Throughput: 0: 5890.7. Samples: 1102743254. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:36:21,031][25689] Avg episode reward: [(0, '0.120')] [2022-07-11 06:36:21,234][26022] Updated weights on worker 0-0, policy_version 1076900 (0.00088) [2022-07-11 06:36:23,180][26022] Updated weights on worker 0-0, policy_version 1076910 (0.00090) [2022-07-11 06:36:24,828][26022] Updated weights on worker 0-0, policy_version 1076920 (0.00087) [2022-07-11 06:36:26,131][25689] Fps is (10 sec: 5671.7, 60 sec: 5666.8, 300 sec: 5675.9). Total num frames: 1102772224. Throughput: 0: 5958.8. Samples: 1102777508. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:36:26,131][25689] Avg episode reward: [(0, '0.066')] [2022-07-11 06:36:26,579][26022] Updated weights on worker 0-0, policy_version 1076930 (0.00092) [2022-07-11 06:36:28,723][26022] Updated weights on worker 0-0, policy_version 1076940 (0.00871) [2022-07-11 06:36:30,378][26022] Updated weights on worker 0-0, policy_version 1076950 (0.00090) [2022-07-11 06:36:31,136][25689] Fps is (10 sec: 5470.4, 60 sec: 5669.5, 300 sec: 5673.5). Total num frames: 1102799872. Throughput: 0: 5102.7. Samples: 1102794508. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 06:36:31,137][25689] Avg episode reward: [(0, '-0.002')] [2022-07-11 06:36:32,073][26022] Updated weights on worker 0-0, policy_version 1076960 (0.00082) [2022-07-11 06:36:34,007][26022] Updated weights on worker 0-0, policy_version 1076970 (0.00083) [2022-07-11 06:36:35,541][26022] Updated weights on worker 0-0, policy_version 1076980 (0.00089) [2022-07-11 06:36:36,210][25689] Fps is (10 sec: 5790.9, 60 sec: 5705.2, 300 sec: 5675.6). Total num frames: 1102830592. Throughput: 0: 5970.0. Samples: 1102829270. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:36:36,211][25689] Avg episode reward: [(0, '0.990')] [2022-07-11 06:36:37,604][26022] Updated weights on worker 0-0, policy_version 1076990 (0.00086) [2022-07-11 06:36:39,163][26022] Updated weights on worker 0-0, policy_version 1077000 (0.00082) [2022-07-11 06:36:40,927][26022] Updated weights on worker 0-0, policy_version 1077010 (0.00083) [2022-07-11 06:36:41,240][25689] Fps is (10 sec: 5979.4, 60 sec: 5703.8, 300 sec: 5685.6). Total num frames: 1102860288. Throughput: 0: 5964.6. Samples: 1102863812. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:36:41,242][25689] Avg episode reward: [(0, '1.269')] [2022-07-11 06:36:42,742][26022] Updated weights on worker 0-0, policy_version 1077020 (0.00088) [2022-07-11 06:36:44,595][26022] Updated weights on worker 0-0, policy_version 1077030 (0.00084) [2022-07-11 06:36:46,295][25689] Fps is (10 sec: 5686.0, 60 sec: 5666.2, 300 sec: 5674.9). Total num frames: 1102887936. Throughput: 0: 5134.8. Samples: 1102881048. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:36:46,296][25689] Avg episode reward: [(0, '-0.956')] [2022-07-11 06:36:46,397][26022] Updated weights on worker 0-0, policy_version 1077040 (0.00082) [2022-07-11 06:36:48,306][26022] Updated weights on worker 0-0, policy_version 1077050 (0.00091) [2022-07-11 06:36:50,010][26022] Updated weights on worker 0-0, policy_version 1077060 (0.00087) [2022-07-11 06:36:51,354][25689] Fps is (10 sec: 5467.1, 60 sec: 5646.5, 300 sec: 5678.4). Total num frames: 1102915584. Throughput: 0: 5962.4. Samples: 1102915058. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:36:51,355][25689] Avg episode reward: [(0, '-1.176')] [2022-07-11 06:36:51,832][26022] Updated weights on worker 0-0, policy_version 1077070 (0.00094) [2022-07-11 06:36:53,713][26022] Updated weights on worker 0-0, policy_version 1077080 (0.00085) [2022-07-11 06:36:55,275][26022] Updated weights on worker 0-0, policy_version 1077090 (0.00084) [2022-07-11 06:36:56,433][25689] Fps is (10 sec: 5555.5, 60 sec: 5669.5, 300 sec: 5675.1). Total num frames: 1102944256. Throughput: 0: 5922.5. Samples: 1102949040. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:36:56,435][25689] Avg episode reward: [(0, '-1.436')] [2022-07-11 06:36:57,391][26022] Updated weights on worker 0-0, policy_version 1077100 (0.00083) [2022-07-11 06:36:58,872][26022] Updated weights on worker 0-0, policy_version 1077110 (0.00084) [2022-07-11 06:37:00,866][26022] Updated weights on worker 0-0, policy_version 1077120 (0.00079) [2022-07-11 06:37:01,489][25689] Fps is (10 sec: 5860.1, 60 sec: 5682.9, 300 sec: 5685.2). Total num frames: 1102974976. Throughput: 0: 5910.3. Samples: 1102983492. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:37:01,490][25689] Avg episode reward: [(0, '-1.678')] [2022-07-11 06:37:02,955][26022] Updated weights on worker 0-0, policy_version 1077130 (0.00080) [2022-07-11 06:37:04,745][26022] Updated weights on worker 0-0, policy_version 1077140 (0.00091) [2022-07-11 06:37:06,495][26022] Updated weights on worker 0-0, policy_version 1077150 (0.00089) [2022-07-11 06:37:06,588][25689] Fps is (10 sec: 5646.7, 60 sec: 5673.9, 300 sec: 5680.0). Total num frames: 1103001600. Throughput: 0: 5794.1. Samples: 1102998628. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:37:06,589][25689] Avg episode reward: [(0, '-1.586')] [2022-07-11 06:37:08,310][26022] Updated weights on worker 0-0, policy_version 1077160 (0.00088) [2022-07-11 06:37:10,067][26022] Updated weights on worker 0-0, policy_version 1077170 (0.00093) [2022-07-11 06:37:11,601][25689] Fps is (10 sec: 5367.1, 60 sec: 5674.1, 300 sec: 5677.4). Total num frames: 1103029248. Throughput: 0: 5835.4. Samples: 1103033208. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:37:11,604][25689] Avg episode reward: [(0, '-0.402')] [2022-07-11 06:37:11,880][26022] Updated weights on worker 0-0, policy_version 1077180 (0.00092) [2022-07-11 06:37:13,386][26022] Updated weights on worker 0-0, policy_version 1077190 (0.00091) [2022-07-11 06:37:15,379][26022] Updated weights on worker 0-0, policy_version 1077200 (0.00085) [2022-07-11 06:37:16,695][25689] Fps is (10 sec: 5775.4, 60 sec: 5687.5, 300 sec: 5682.8). Total num frames: 1103059968. Throughput: 0: 5856.7. Samples: 1103067706. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:37:16,696][25689] Avg episode reward: [(0, '1.369')] [2022-07-11 06:37:17,288][26022] Updated weights on worker 0-0, policy_version 1077210 (0.00091) [2022-07-11 06:37:18,984][26022] Updated weights on worker 0-0, policy_version 1077220 (0.00081) [2022-07-11 06:37:20,854][26022] Updated weights on worker 0-0, policy_version 1077230 (0.00087) [2022-07-11 06:37:21,702][25689] Fps is (10 sec: 5880.2, 60 sec: 5670.7, 300 sec: 5679.4). Total num frames: 1103088640. Throughput: 0: 5022.5. Samples: 1103085008. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:37:21,702][25689] Avg episode reward: [(0, '0.201')] [2022-07-11 06:37:22,429][26022] Updated weights on worker 0-0, policy_version 1077240 (0.00091) [2022-07-11 06:37:24,493][26022] Updated weights on worker 0-0, policy_version 1077250 (0.00086) [2022-07-11 06:37:26,242][26022] Updated weights on worker 0-0, policy_version 1077260 (0.00084) [2022-07-11 06:37:26,715][25689] Fps is (10 sec: 5620.7, 60 sec: 5679.1, 300 sec: 5679.2). Total num frames: 1103116288. Throughput: 0: 5995.4. Samples: 1103119294. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:37:26,715][25689] Avg episode reward: [(0, '-0.329')] [2022-07-11 06:37:27,884][26022] Updated weights on worker 0-0, policy_version 1077270 (0.00085) [2022-07-11 06:37:29,907][26022] Updated weights on worker 0-0, policy_version 1077280 (0.00085) [2022-07-11 06:37:31,477][26022] Updated weights on worker 0-0, policy_version 1077290 (0.00082) [2022-07-11 06:37:31,762][25689] Fps is (10 sec: 5700.0, 60 sec: 5709.0, 300 sec: 5680.0). Total num frames: 1103145984. Throughput: 0: 5973.1. Samples: 1103153630. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:37:31,763][25689] Avg episode reward: [(0, '-0.342')] [2022-07-11 06:37:33,389][26022] Updated weights on worker 0-0, policy_version 1077300 (0.00089) [2022-07-11 06:37:35,138][26022] Updated weights on worker 0-0, policy_version 1077310 (0.00082) [2022-07-11 06:37:36,860][25689] Fps is (10 sec: 5753.4, 60 sec: 5673.0, 300 sec: 5674.8). Total num frames: 1103174656. Throughput: 0: 5109.2. Samples: 1103170734. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:37:36,860][25689] Avg episode reward: [(0, '-0.144')] [2022-07-11 06:37:37,154][26022] Updated weights on worker 0-0, policy_version 1077320 (0.00084) [2022-07-11 06:37:38,691][26022] Updated weights on worker 0-0, policy_version 1077330 (0.00085) [2022-07-11 06:37:40,640][26022] Updated weights on worker 0-0, policy_version 1077340 (0.00087) [2022-07-11 06:37:41,920][25689] Fps is (10 sec: 5645.0, 60 sec: 5653.2, 300 sec: 5674.4). Total num frames: 1103203328. Throughput: 0: 5931.5. Samples: 1103204936. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:37:41,921][25689] Avg episode reward: [(0, '-0.058')] [2022-07-11 06:37:42,000][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:37:42,016][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001077348_1103204352.pth [2022-07-11 06:37:42,016][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001075350_1101158400.pth [2022-07-11 06:37:42,375][26022] Updated weights on worker 0-0, policy_version 1077350 (0.00091) [2022-07-11 06:37:44,191][26022] Updated weights on worker 0-0, policy_version 1077360 (0.00082) [2022-07-11 06:37:45,894][26022] Updated weights on worker 0-0, policy_version 1077370 (0.00081) [2022-07-11 06:37:46,962][25689] Fps is (10 sec: 5676.3, 60 sec: 5671.4, 300 sec: 5677.3). Total num frames: 1103232000. Throughput: 0: 5939.2. Samples: 1103239546. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:37:46,963][25689] Avg episode reward: [(0, '1.260')] [2022-07-11 06:37:47,837][26022] Updated weights on worker 0-0, policy_version 1077380 (0.00085) [2022-07-11 06:37:49,552][26022] Updated weights on worker 0-0, policy_version 1077390 (0.00090) [2022-07-11 06:37:51,304][26022] Updated weights on worker 0-0, policy_version 1077400 (0.00090) [2022-07-11 06:37:51,972][25689] Fps is (10 sec: 5807.1, 60 sec: 5709.8, 300 sec: 5681.5). Total num frames: 1103261696. Throughput: 0: 5103.0. Samples: 1103256760. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:37:51,972][25689] Avg episode reward: [(0, '1.891')] [2022-07-11 06:37:53,186][26022] Updated weights on worker 0-0, policy_version 1077410 (0.00093) [2022-07-11 06:37:54,991][26022] Updated weights on worker 0-0, policy_version 1077420 (0.00090) [2022-07-11 06:37:56,739][26022] Updated weights on worker 0-0, policy_version 1077430 (0.00081) [2022-07-11 06:37:57,017][25689] Fps is (10 sec: 5703.2, 60 sec: 5696.0, 300 sec: 5673.9). Total num frames: 1103289344. Throughput: 0: 5952.0. Samples: 1103290708. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:37:57,017][25689] Avg episode reward: [(0, '1.625')] [2022-07-11 06:37:58,443][26022] Updated weights on worker 0-0, policy_version 1077440 (0.00086) [2022-07-11 06:38:00,288][26022] Updated weights on worker 0-0, policy_version 1077450 (0.00083) [2022-07-11 06:38:02,048][25689] Fps is (10 sec: 5487.8, 60 sec: 5647.7, 300 sec: 5680.6). Total num frames: 1103316992. Throughput: 0: 5963.8. Samples: 1103324972. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:38:02,048][25689] Avg episode reward: [(0, '1.598')] [2022-07-11 06:38:02,403][26022] Updated weights on worker 0-0, policy_version 1077460 (0.00092) [2022-07-11 06:38:04,343][26022] Updated weights on worker 0-0, policy_version 1077470 (0.00093) [2022-07-11 06:38:06,033][26022] Updated weights on worker 0-0, policy_version 1077480 (0.00088) [2022-07-11 06:38:07,110][25689] Fps is (10 sec: 5579.9, 60 sec: 5685.0, 300 sec: 5679.6). Total num frames: 1103345664. Throughput: 0: 5004.3. Samples: 1103340372. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:38:07,112][25689] Avg episode reward: [(0, '1.499')] [2022-07-11 06:38:07,752][26022] Updated weights on worker 0-0, policy_version 1077490 (0.00093) [2022-07-11 06:38:09,691][26022] Updated weights on worker 0-0, policy_version 1077500 (0.00090) [2022-07-11 06:38:11,327][26022] Updated weights on worker 0-0, policy_version 1077510 (0.00087) [2022-07-11 06:38:12,135][25689] Fps is (10 sec: 5684.8, 60 sec: 5700.8, 300 sec: 5676.8). Total num frames: 1103374336. Throughput: 0: 5853.7. Samples: 1103374790. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:38:12,137][25689] Avg episode reward: [(0, '1.679')] [2022-07-11 06:38:13,052][26022] Updated weights on worker 0-0, policy_version 1077520 (0.00084) [2022-07-11 06:38:14,959][26022] Updated weights on worker 0-0, policy_version 1077530 (0.00092) [2022-07-11 06:38:16,923][26022] Updated weights on worker 0-0, policy_version 1077540 (0.00913) [2022-07-11 06:38:17,240][25689] Fps is (10 sec: 5661.0, 60 sec: 5665.9, 300 sec: 5678.5). Total num frames: 1103403008. Throughput: 0: 5861.7. Samples: 1103409248. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:38:17,240][25689] Avg episode reward: [(0, '1.360')] [2022-07-11 06:38:18,395][26022] Updated weights on worker 0-0, policy_version 1077550 (0.00091) [2022-07-11 06:38:20,382][26022] Updated weights on worker 0-0, policy_version 1077560 (0.00085) [2022-07-11 06:38:21,983][26022] Updated weights on worker 0-0, policy_version 1077570 (0.00087) [2022-07-11 06:38:22,250][25689] Fps is (10 sec: 5669.0, 60 sec: 5665.5, 300 sec: 5678.6). Total num frames: 1103431680. Throughput: 0: 5030.5. Samples: 1103426602. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:38:22,251][25689] Avg episode reward: [(0, '1.309')] [2022-07-11 06:38:23,994][26022] Updated weights on worker 0-0, policy_version 1077580 (0.00091) [2022-07-11 06:38:25,719][26022] Updated weights on worker 0-0, policy_version 1077590 (0.00083) [2022-07-11 06:38:27,261][25689] Fps is (10 sec: 5722.3, 60 sec: 5682.7, 300 sec: 5682.1). Total num frames: 1103460352. Throughput: 0: 5967.3. Samples: 1103460618. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:38:27,261][25689] Avg episode reward: [(0, '1.781')] [2022-07-11 06:38:27,694][26022] Updated weights on worker 0-0, policy_version 1077600 (0.00084) [2022-07-11 06:38:29,377][26022] Updated weights on worker 0-0, policy_version 1077610 (0.00083) [2022-07-11 06:38:31,157][26022] Updated weights on worker 0-0, policy_version 1077620 (0.00085) [2022-07-11 06:38:32,279][25689] Fps is (10 sec: 5615.9, 60 sec: 5651.6, 300 sec: 5675.8). Total num frames: 1103488000. Throughput: 0: 5956.1. Samples: 1103494770. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:38:32,279][25689] Avg episode reward: [(0, '1.860')] [2022-07-11 06:38:33,019][26022] Updated weights on worker 0-0, policy_version 1077630 (0.00086) [2022-07-11 06:38:34,636][26022] Updated weights on worker 0-0, policy_version 1077640 (0.00091) [2022-07-11 06:38:36,327][26022] Updated weights on worker 0-0, policy_version 1077650 (0.00081) [2022-07-11 06:38:37,393][25689] Fps is (10 sec: 5861.9, 60 sec: 5700.8, 300 sec: 5684.3). Total num frames: 1103519744. Throughput: 0: 5106.1. Samples: 1103512152. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:38:37,393][25689] Avg episode reward: [(0, '2.045')] [2022-07-11 06:38:38,346][26022] Updated weights on worker 0-0, policy_version 1077660 (0.00092) [2022-07-11 06:38:39,969][26022] Updated weights on worker 0-0, policy_version 1077670 (0.00085) [2022-07-11 06:38:41,875][26022] Updated weights on worker 0-0, policy_version 1077680 (0.00083) [2022-07-11 06:38:42,431][25689] Fps is (10 sec: 5850.1, 60 sec: 5686.0, 300 sec: 5683.7). Total num frames: 1103547392. Throughput: 0: 5947.5. Samples: 1103546628. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:38:42,432][25689] Avg episode reward: [(0, '1.824')] [2022-07-11 06:38:43,571][26022] Updated weights on worker 0-0, policy_version 1077690 (0.00089) [2022-07-11 06:38:45,370][26022] Updated weights on worker 0-0, policy_version 1077700 (0.00084) [2022-07-11 06:38:47,422][26022] Updated weights on worker 0-0, policy_version 1077710 (0.00080) [2022-07-11 06:38:47,510][25689] Fps is (10 sec: 5465.5, 60 sec: 5665.6, 300 sec: 5679.1). Total num frames: 1103575040. Throughput: 0: 5960.0. Samples: 1103581302. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:38:47,511][25689] Avg episode reward: [(0, '1.904')] [2022-07-11 06:38:48,785][26022] Updated weights on worker 0-0, policy_version 1077720 (0.00084) [2022-07-11 06:38:50,694][26022] Updated weights on worker 0-0, policy_version 1077730 (0.00095) [2022-07-11 06:38:52,341][26022] Updated weights on worker 0-0, policy_version 1077740 (0.00081) [2022-07-11 06:38:52,517][25689] Fps is (10 sec: 5787.0, 60 sec: 5682.7, 300 sec: 5683.5). Total num frames: 1103605760. Throughput: 0: 5127.1. Samples: 1103598534. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:38:52,518][25689] Avg episode reward: [(0, '1.982')] [2022-07-11 06:38:54,511][26022] Updated weights on worker 0-0, policy_version 1077750 (0.00098) [2022-07-11 06:38:56,375][26022] Updated weights on worker 0-0, policy_version 1077760 (0.00084) [2022-07-11 06:38:57,639][25689] Fps is (10 sec: 5661.5, 60 sec: 5658.7, 300 sec: 5674.4). Total num frames: 1103632384. Throughput: 0: 5904.2. Samples: 1103631690. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:38:57,639][25689] Avg episode reward: [(0, '1.878')] [2022-07-11 06:38:58,073][26022] Updated weights on worker 0-0, policy_version 1077770 (0.00084) [2022-07-11 06:38:59,794][26022] Updated weights on worker 0-0, policy_version 1077780 (0.00078) [2022-07-11 06:39:02,150][26022] Updated weights on worker 0-0, policy_version 1077790 (0.00095) [2022-07-11 06:39:02,651][25689] Fps is (10 sec: 5355.8, 60 sec: 5660.5, 300 sec: 5678.0). Total num frames: 1103660032. Throughput: 0: 5819.7. Samples: 1103664298. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:39:02,651][25689] Avg episode reward: [(0, '0.962')] [2022-07-11 06:39:03,838][26022] Updated weights on worker 0-0, policy_version 1077800 (0.00102) [2022-07-11 06:39:05,902][26022] Updated weights on worker 0-0, policy_version 1077810 (0.00090) [2022-07-11 06:39:07,450][26022] Updated weights on worker 0-0, policy_version 1077820 (0.00085) [2022-07-11 06:39:07,659][25689] Fps is (10 sec: 5620.5, 60 sec: 5665.5, 300 sec: 5678.1). Total num frames: 1103688704. Throughput: 0: 4933.7. Samples: 1103680712. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:39:07,661][25689] Avg episode reward: [(0, '0.701')] [2022-07-11 06:39:09,232][26022] Updated weights on worker 0-0, policy_version 1077830 (0.00082) [2022-07-11 06:39:11,133][26022] Updated weights on worker 0-0, policy_version 1077840 (0.00086) [2022-07-11 06:39:12,716][25689] Fps is (10 sec: 5595.3, 60 sec: 5645.6, 300 sec: 5674.9). Total num frames: 1103716352. Throughput: 0: 5766.6. Samples: 1103715012. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:39:12,718][25689] Avg episode reward: [(0, '0.854')] [2022-07-11 06:39:12,859][26022] Updated weights on worker 0-0, policy_version 1077850 (0.00080) [2022-07-11 06:39:14,780][26022] Updated weights on worker 0-0, policy_version 1077860 (0.00109) [2022-07-11 06:39:16,585][26022] Updated weights on worker 0-0, policy_version 1077870 (0.00082) [2022-07-11 06:39:17,840][25689] Fps is (10 sec: 5632.8, 60 sec: 5660.7, 300 sec: 5677.0). Total num frames: 1103746048. Throughput: 0: 5806.8. Samples: 1103748992. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:39:17,842][25689] Avg episode reward: [(0, '0.768')] [2022-07-11 06:39:18,094][26022] Updated weights on worker 0-0, policy_version 1077880 (0.00078) [2022-07-11 06:39:20,246][26022] Updated weights on worker 0-0, policy_version 1077890 (0.00086) [2022-07-11 06:39:21,895][26022] Updated weights on worker 0-0, policy_version 1077900 (0.00087) [2022-07-11 06:39:22,867][25689] Fps is (10 sec: 5649.3, 60 sec: 5642.3, 300 sec: 5670.4). Total num frames: 1103773696. Throughput: 0: 5887.1. Samples: 1103783314. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:39:22,867][25689] Avg episode reward: [(0, '1.152')] [2022-07-11 06:39:23,623][26022] Updated weights on worker 0-0, policy_version 1077910 (0.00086) [2022-07-11 06:39:25,663][26022] Updated weights on worker 0-0, policy_version 1077920 (0.00085) [2022-07-11 06:39:27,271][26022] Updated weights on worker 0-0, policy_version 1077930 (0.00087) [2022-07-11 06:39:27,887][25689] Fps is (10 sec: 5605.8, 60 sec: 5641.4, 300 sec: 5670.5). Total num frames: 1103802368. Throughput: 0: 5908.7. Samples: 1103800228. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:39:27,888][25689] Avg episode reward: [(0, '1.922')] [2022-07-11 06:39:29,195][26022] Updated weights on worker 0-0, policy_version 1077940 (0.00094) [2022-07-11 06:39:31,156][26022] Updated weights on worker 0-0, policy_version 1077950 (0.00085) [2022-07-11 06:39:32,847][26022] Updated weights on worker 0-0, policy_version 1077960 (0.00089) [2022-07-11 06:39:32,927][25689] Fps is (10 sec: 5700.2, 60 sec: 5656.3, 300 sec: 5667.6). Total num frames: 1103831040. Throughput: 0: 5877.8. Samples: 1103833806. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:39:32,927][25689] Avg episode reward: [(0, '1.969')] [2022-07-11 06:39:34,647][26022] Updated weights on worker 0-0, policy_version 1077970 (0.00084) [2022-07-11 06:39:36,631][26022] Updated weights on worker 0-0, policy_version 1077980 (0.00110) [2022-07-11 06:39:38,048][25689] Fps is (10 sec: 5743.9, 60 sec: 5621.8, 300 sec: 5669.1). Total num frames: 1103860736. Throughput: 0: 5890.6. Samples: 1103868032. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:39:38,049][25689] Avg episode reward: [(0, '2.073')] [2022-07-11 06:39:38,073][26022] Updated weights on worker 0-0, policy_version 1077990 (0.00090) [2022-07-11 06:39:40,276][26022] Updated weights on worker 0-0, policy_version 1078000 (0.00084) [2022-07-11 06:39:41,610][26022] Updated weights on worker 0-0, policy_version 1078010 (0.00103) [2022-07-11 06:39:42,287][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:39:42,303][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001078012_1103884288.pth [2022-07-11 06:39:42,303][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001076015_1101839360.pth [2022-07-11 06:39:43,073][25689] Fps is (10 sec: 5651.7, 60 sec: 5623.0, 300 sec: 5669.1). Total num frames: 1103888384. Throughput: 0: 5041.9. Samples: 1103885192. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:39:43,074][25689] Avg episode reward: [(0, '1.998')] [2022-07-11 06:39:43,501][26022] Updated weights on worker 0-0, policy_version 1078020 (0.00089) [2022-07-11 06:39:45,598][26022] Updated weights on worker 0-0, policy_version 1078030 (0.00084) [2022-07-11 06:39:46,991][26022] Updated weights on worker 0-0, policy_version 1078040 (0.00085) [2022-07-11 06:39:48,108][25689] Fps is (10 sec: 5598.8, 60 sec: 5644.1, 300 sec: 5669.0). Total num frames: 1103917056. Throughput: 0: 5913.1. Samples: 1103919796. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:39:48,108][25689] Avg episode reward: [(0, '1.706')] [2022-07-11 06:39:48,918][26022] Updated weights on worker 0-0, policy_version 1078050 (0.00084) [2022-07-11 06:39:50,866][26022] Updated weights on worker 0-0, policy_version 1078060 (0.00080) [2022-07-11 06:39:52,508][26022] Updated weights on worker 0-0, policy_version 1078070 (0.00089) [2022-07-11 06:39:53,117][25689] Fps is (10 sec: 5913.3, 60 sec: 5643.9, 300 sec: 5677.9). Total num frames: 1103947776. Throughput: 0: 5974.5. Samples: 1103954432. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:39:53,118][25689] Avg episode reward: [(0, '1.639')] [2022-07-11 06:39:54,358][26022] Updated weights on worker 0-0, policy_version 1078080 (0.00471) [2022-07-11 06:39:56,135][26022] Updated weights on worker 0-0, policy_version 1078090 (0.00087) [2022-07-11 06:39:57,909][26022] Updated weights on worker 0-0, policy_version 1078100 (0.00084) [2022-07-11 06:39:58,180][25689] Fps is (10 sec: 5794.6, 60 sec: 5666.2, 300 sec: 5667.8). Total num frames: 1103975424. Throughput: 0: 5128.0. Samples: 1103971270. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:39:58,181][25689] Avg episode reward: [(0, '1.442')] [2022-07-11 06:40:00,042][26022] Updated weights on worker 0-0, policy_version 1078110 (0.00092) [2022-07-11 06:40:01,438][26022] Updated weights on worker 0-0, policy_version 1078120 (0.00086) [2022-07-11 06:40:03,210][25689] Fps is (10 sec: 5174.5, 60 sec: 5613.8, 300 sec: 5661.4). Total num frames: 1104000000. Throughput: 0: 5962.2. Samples: 1104005250. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:40:03,210][25689] Avg episode reward: [(0, '0.509')] [2022-07-11 06:40:03,925][26022] Updated weights on worker 0-0, policy_version 1078130 (0.00097) [2022-07-11 06:40:05,347][26022] Updated weights on worker 0-0, policy_version 1078140 (0.00090) [2022-07-11 06:40:07,395][26022] Updated weights on worker 0-0, policy_version 1078150 (0.00083) [2022-07-11 06:40:08,223][25689] Fps is (10 sec: 5506.3, 60 sec: 5647.3, 300 sec: 5671.7). Total num frames: 1104030720. Throughput: 0: 5846.4. Samples: 1104037396. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:40:08,223][25689] Avg episode reward: [(0, '0.190')] [2022-07-11 06:40:09,335][26022] Updated weights on worker 0-0, policy_version 1078160 (0.00081) [2022-07-11 06:40:10,985][26022] Updated weights on worker 0-0, policy_version 1078170 (0.00083) [2022-07-11 06:40:12,797][26022] Updated weights on worker 0-0, policy_version 1078180 (0.00084) [2022-07-11 06:40:13,231][25689] Fps is (10 sec: 5926.8, 60 sec: 5668.7, 300 sec: 5669.3). Total num frames: 1104059392. Throughput: 0: 4979.1. Samples: 1104054580. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:40:13,231][25689] Avg episode reward: [(0, '0.141')] [2022-07-11 06:40:14,481][26022] Updated weights on worker 0-0, policy_version 1078190 (0.00082) [2022-07-11 06:40:16,158][26022] Updated weights on worker 0-0, policy_version 1078200 (0.00082) [2022-07-11 06:40:18,274][26022] Updated weights on worker 0-0, policy_version 1078210 (0.00086) [2022-07-11 06:40:18,323][25689] Fps is (10 sec: 5576.2, 60 sec: 5637.8, 300 sec: 5667.7). Total num frames: 1104087040. Throughput: 0: 5842.8. Samples: 1104088958. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:40:18,323][25689] Avg episode reward: [(0, '-0.111')] [2022-07-11 06:40:19,803][26022] Updated weights on worker 0-0, policy_version 1078220 (0.00090) [2022-07-11 06:40:21,827][26022] Updated weights on worker 0-0, policy_version 1078230 (0.00083) [2022-07-11 06:40:23,369][25689] Fps is (10 sec: 5656.3, 60 sec: 5669.9, 300 sec: 5667.2). Total num frames: 1104116736. Throughput: 0: 5836.4. Samples: 1104122906. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:40:23,369][25689] Avg episode reward: [(0, '-0.129')] [2022-07-11 06:40:23,521][26022] Updated weights on worker 0-0, policy_version 1078240 (0.00085) [2022-07-11 06:40:25,315][26022] Updated weights on worker 0-0, policy_version 1078250 (0.00087) [2022-07-11 06:40:27,273][26022] Updated weights on worker 0-0, policy_version 1078260 (0.00082) [2022-07-11 06:40:28,405][25689] Fps is (10 sec: 5789.3, 60 sec: 5668.4, 300 sec: 5670.6). Total num frames: 1104145408. Throughput: 0: 5077.6. Samples: 1104139872. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 06:40:28,405][25689] Avg episode reward: [(0, '0.052')] [2022-07-11 06:40:28,777][26022] Updated weights on worker 0-0, policy_version 1078270 (0.00088) [2022-07-11 06:40:30,879][26022] Updated weights on worker 0-0, policy_version 1078280 (0.00079) [2022-07-11 06:40:32,702][26022] Updated weights on worker 0-0, policy_version 1078290 (0.00085) [2022-07-11 06:40:33,427][25689] Fps is (10 sec: 5701.3, 60 sec: 5670.1, 300 sec: 5672.0). Total num frames: 1104174080. Throughput: 0: 5910.8. Samples: 1104173954. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:40:33,428][25689] Avg episode reward: [(0, '0.361')] [2022-07-11 06:40:34,369][26022] Updated weights on worker 0-0, policy_version 1078300 (0.00084) [2022-07-11 06:40:36,154][26022] Updated weights on worker 0-0, policy_version 1078310 (0.00085) [2022-07-11 06:40:38,140][26022] Updated weights on worker 0-0, policy_version 1078320 (0.00083) [2022-07-11 06:40:38,486][25689] Fps is (10 sec: 5687.9, 60 sec: 5658.9, 300 sec: 5667.7). Total num frames: 1104202752. Throughput: 0: 5934.2. Samples: 1104208614. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:40:38,488][25689] Avg episode reward: [(0, '0.528')] [2022-07-11 06:40:39,747][26022] Updated weights on worker 0-0, policy_version 1078330 (0.00086) [2022-07-11 06:40:41,648][26022] Updated weights on worker 0-0, policy_version 1078340 (0.00086) [2022-07-11 06:40:43,128][26022] Updated weights on worker 0-0, policy_version 1078350 (0.00086) [2022-07-11 06:40:43,515][25689] Fps is (10 sec: 5684.2, 60 sec: 5675.6, 300 sec: 5664.0). Total num frames: 1104231424. Throughput: 0: 5103.7. Samples: 1104225726. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:40:43,515][25689] Avg episode reward: [(0, '1.532')] [2022-07-11 06:40:45,091][26022] Updated weights on worker 0-0, policy_version 1078360 (0.00094) [2022-07-11 06:40:46,759][26022] Updated weights on worker 0-0, policy_version 1078370 (0.00092) [2022-07-11 06:40:48,469][26022] Updated weights on worker 0-0, policy_version 1078380 (0.00089) [2022-07-11 06:40:48,540][25689] Fps is (10 sec: 5805.7, 60 sec: 5693.4, 300 sec: 5667.5). Total num frames: 1104261120. Throughput: 0: 5982.7. Samples: 1104260336. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:40:48,540][25689] Avg episode reward: [(0, '1.173')] [2022-07-11 06:40:50,552][26022] Updated weights on worker 0-0, policy_version 1078390 (0.00085) [2022-07-11 06:40:52,140][26022] Updated weights on worker 0-0, policy_version 1078400 (0.00084) [2022-07-11 06:40:53,578][25689] Fps is (10 sec: 5494.8, 60 sec: 5606.0, 300 sec: 5662.7). Total num frames: 1104286720. Throughput: 0: 5978.0. Samples: 1104294420. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:40:53,579][25689] Avg episode reward: [(0, '1.105')] [2022-07-11 06:40:54,065][26022] Updated weights on worker 0-0, policy_version 1078410 (0.00099) [2022-07-11 06:40:55,836][26022] Updated weights on worker 0-0, policy_version 1078420 (0.00083) [2022-07-11 06:40:57,518][26022] Updated weights on worker 0-0, policy_version 1078430 (0.00088) [2022-07-11 06:40:58,718][25689] Fps is (10 sec: 5533.6, 60 sec: 5649.7, 300 sec: 5663.8). Total num frames: 1104317440. Throughput: 0: 5078.2. Samples: 1104311352. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:40:58,718][25689] Avg episode reward: [(0, '1.166')] [2022-07-11 06:40:59,394][26022] Updated weights on worker 0-0, policy_version 1078440 (0.00084) [2022-07-11 06:41:01,317][26022] Updated weights on worker 0-0, policy_version 1078450 (0.00090) [2022-07-11 06:41:03,282][26022] Updated weights on worker 0-0, policy_version 1078460 (0.00082) [2022-07-11 06:41:03,779][25689] Fps is (10 sec: 5722.0, 60 sec: 5697.4, 300 sec: 5666.1). Total num frames: 1104345088. Throughput: 0: 5909.6. Samples: 1104345478. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:41:03,779][25689] Avg episode reward: [(0, '0.778')] [2022-07-11 06:41:05,215][26022] Updated weights on worker 0-0, policy_version 1078470 (0.00080) [2022-07-11 06:41:06,909][26022] Updated weights on worker 0-0, policy_version 1078480 (0.00091) [2022-07-11 06:41:08,823][25689] Fps is (10 sec: 5573.4, 60 sec: 5660.7, 300 sec: 5669.0). Total num frames: 1104373760. Throughput: 0: 5831.0. Samples: 1104378606. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:41:08,823][25689] Avg episode reward: [(0, '0.930')] [2022-07-11 06:41:08,832][26022] Updated weights on worker 0-0, policy_version 1078490 (0.00095) [2022-07-11 06:41:10,527][26022] Updated weights on worker 0-0, policy_version 1078500 (0.00078) [2022-07-11 06:41:12,348][26022] Updated weights on worker 0-0, policy_version 1078510 (0.00092) [2022-07-11 06:41:13,860][25689] Fps is (10 sec: 5789.6, 60 sec: 5674.8, 300 sec: 5669.4). Total num frames: 1104403456. Throughput: 0: 4996.9. Samples: 1104395768. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:41:13,861][25689] Avg episode reward: [(0, '-0.028')] [2022-07-11 06:41:14,096][26022] Updated weights on worker 0-0, policy_version 1078520 (0.00090) [2022-07-11 06:41:15,753][26022] Updated weights on worker 0-0, policy_version 1078530 (0.00087) [2022-07-11 06:41:17,548][26022] Updated weights on worker 0-0, policy_version 1078540 (0.00082) [2022-07-11 06:41:18,918][25689] Fps is (10 sec: 5680.6, 60 sec: 5678.1, 300 sec: 5661.6). Total num frames: 1104431104. Throughput: 0: 5891.2. Samples: 1104430354. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:41:18,918][25689] Avg episode reward: [(0, '-0.252')] [2022-07-11 06:41:19,360][26022] Updated weights on worker 0-0, policy_version 1078550 (0.00085) [2022-07-11 06:41:21,277][26022] Updated weights on worker 0-0, policy_version 1078560 (0.00093) [2022-07-11 06:41:22,852][26022] Updated weights on worker 0-0, policy_version 1078570 (0.00089) [2022-07-11 06:41:23,967][25689] Fps is (10 sec: 5673.9, 60 sec: 5677.8, 300 sec: 5669.5). Total num frames: 1104460800. Throughput: 0: 5915.4. Samples: 1104464900. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:41:23,968][25689] Avg episode reward: [(0, '-0.092')] [2022-07-11 06:41:24,944][26022] Updated weights on worker 0-0, policy_version 1078580 (0.00090) [2022-07-11 06:41:26,420][26022] Updated weights on worker 0-0, policy_version 1078590 (0.00099) [2022-07-11 06:41:28,440][26022] Updated weights on worker 0-0, policy_version 1078600 (0.00081) [2022-07-11 06:41:28,995][25689] Fps is (10 sec: 5690.2, 60 sec: 5661.6, 300 sec: 5669.0). Total num frames: 1104488448. Throughput: 0: 5982.7. Samples: 1104499292. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:41:28,999][25689] Avg episode reward: [(0, '-0.127')] [2022-07-11 06:41:30,086][26022] Updated weights on worker 0-0, policy_version 1078610 (0.00083) [2022-07-11 06:41:31,971][26022] Updated weights on worker 0-0, policy_version 1078620 (0.00112) [2022-07-11 06:41:33,652][26022] Updated weights on worker 0-0, policy_version 1078630 (0.00079) [2022-07-11 06:41:34,009][25689] Fps is (10 sec: 5812.3, 60 sec: 5696.2, 300 sec: 5670.2). Total num frames: 1104519168. Throughput: 0: 5984.5. Samples: 1104516348. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:41:34,010][25689] Avg episode reward: [(0, '-0.526')] [2022-07-11 06:41:35,581][26022] Updated weights on worker 0-0, policy_version 1078640 (0.00089) [2022-07-11 06:41:37,291][26022] Updated weights on worker 0-0, policy_version 1078650 (0.00097) [2022-07-11 06:41:39,058][25689] Fps is (10 sec: 5698.7, 60 sec: 5663.4, 300 sec: 5659.5). Total num frames: 1104545792. Throughput: 0: 5976.8. Samples: 1104550730. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:41:39,058][25689] Avg episode reward: [(0, '0.519')] [2022-07-11 06:41:39,176][26022] Updated weights on worker 0-0, policy_version 1078660 (0.00091) [2022-07-11 06:41:40,788][26022] Updated weights on worker 0-0, policy_version 1078670 (0.00089) [2022-07-11 06:41:42,307][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:41:42,330][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001078677_1104565248.pth [2022-07-11 06:41:42,331][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001076682_1102522368.pth [2022-07-11 06:41:42,680][26022] Updated weights on worker 0-0, policy_version 1078680 (0.00085) [2022-07-11 06:41:44,061][25689] Fps is (10 sec: 5602.8, 60 sec: 5682.6, 300 sec: 5667.4). Total num frames: 1104575488. Throughput: 0: 6005.0. Samples: 1104585566. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:41:44,062][25689] Avg episode reward: [(0, '0.952')] [2022-07-11 06:41:44,377][26022] Updated weights on worker 0-0, policy_version 1078690 (0.00085) [2022-07-11 06:41:46,042][26022] Updated weights on worker 0-0, policy_version 1078700 (0.00083) [2022-07-11 06:41:47,939][26022] Updated weights on worker 0-0, policy_version 1078710 (0.00087) [2022-07-11 06:41:49,067][25689] Fps is (10 sec: 5934.0, 60 sec: 5684.5, 300 sec: 5675.3). Total num frames: 1104605184. Throughput: 0: 5168.1. Samples: 1104603022. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:41:49,067][25689] Avg episode reward: [(0, '0.827')] [2022-07-11 06:41:49,651][26022] Updated weights on worker 0-0, policy_version 1078720 (0.00089) [2022-07-11 06:41:51,404][26022] Updated weights on worker 0-0, policy_version 1078730 (0.00085) [2022-07-11 06:41:53,317][26022] Updated weights on worker 0-0, policy_version 1078740 (0.00080) [2022-07-11 06:41:54,090][25689] Fps is (10 sec: 5819.9, 60 sec: 5736.6, 300 sec: 5676.3). Total num frames: 1104633856. Throughput: 0: 6050.9. Samples: 1104637858. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:41:54,091][25689] Avg episode reward: [(0, '0.523')] [2022-07-11 06:41:54,859][26022] Updated weights on worker 0-0, policy_version 1078750 (0.00088) [2022-07-11 06:41:56,848][26022] Updated weights on worker 0-0, policy_version 1078760 (0.00081) [2022-07-11 06:41:58,536][26022] Updated weights on worker 0-0, policy_version 1078770 (0.00080) [2022-07-11 06:41:59,189][25689] Fps is (10 sec: 5665.1, 60 sec: 5706.7, 300 sec: 5668.6). Total num frames: 1104662528. Throughput: 0: 6048.2. Samples: 1104672486. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:41:59,191][25689] Avg episode reward: [(0, '0.564')] [2022-07-11 06:42:00,325][26022] Updated weights on worker 0-0, policy_version 1078780 (0.00086) [2022-07-11 06:42:02,663][26022] Updated weights on worker 0-0, policy_version 1078790 (0.00081) [2022-07-11 06:42:04,199][26022] Updated weights on worker 0-0, policy_version 1078800 (0.00088) [2022-07-11 06:42:04,291][25689] Fps is (10 sec: 5621.7, 60 sec: 5719.7, 300 sec: 5675.5). Total num frames: 1104691200. Throughput: 0: 5063.1. Samples: 1104687990. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:42:04,291][25689] Avg episode reward: [(0, '1.476')] [2022-07-11 06:42:06,160][26022] Updated weights on worker 0-0, policy_version 1078810 (0.00083) [2022-07-11 06:42:08,061][26022] Updated weights on worker 0-0, policy_version 1078820 (0.00077) [2022-07-11 06:42:09,311][25689] Fps is (10 sec: 5564.3, 60 sec: 5705.1, 300 sec: 5675.3). Total num frames: 1104718848. Throughput: 0: 5869.2. Samples: 1104721838. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:42:09,312][25689] Avg episode reward: [(0, '1.588')] [2022-07-11 06:42:09,788][26022] Updated weights on worker 0-0, policy_version 1078830 (0.00089) [2022-07-11 06:42:11,421][26022] Updated weights on worker 0-0, policy_version 1078840 (0.00082) [2022-07-11 06:42:13,338][26022] Updated weights on worker 0-0, policy_version 1078850 (0.00083) [2022-07-11 06:42:14,333][25689] Fps is (10 sec: 5506.3, 60 sec: 5672.6, 300 sec: 5666.4). Total num frames: 1104746496. Throughput: 0: 5842.6. Samples: 1104756128. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:42:14,334][25689] Avg episode reward: [(0, '1.248')] [2022-07-11 06:42:14,994][26022] Updated weights on worker 0-0, policy_version 1078860 (0.00089) [2022-07-11 06:42:17,157][26022] Updated weights on worker 0-0, policy_version 1078870 (0.00090) [2022-07-11 06:42:18,535][26022] Updated weights on worker 0-0, policy_version 1078880 (0.00089) [2022-07-11 06:42:19,426][25689] Fps is (10 sec: 5669.1, 60 sec: 5703.2, 300 sec: 5668.2). Total num frames: 1104776192. Throughput: 0: 4980.8. Samples: 1104773282. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:42:19,426][25689] Avg episode reward: [(0, '-0.583')] [2022-07-11 06:42:20,483][26022] Updated weights on worker 0-0, policy_version 1078890 (0.00085) [2022-07-11 06:42:22,266][26022] Updated weights on worker 0-0, policy_version 1078900 (0.00083) [2022-07-11 06:42:23,970][26022] Updated weights on worker 0-0, policy_version 1078910 (0.00086) [2022-07-11 06:42:24,464][25689] Fps is (10 sec: 5963.6, 60 sec: 5721.1, 300 sec: 5678.0). Total num frames: 1104806912. Throughput: 0: 5950.5. Samples: 1104808030. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:42:24,464][25689] Avg episode reward: [(0, '-0.555')] [2022-07-11 06:42:25,919][26022] Updated weights on worker 0-0, policy_version 1078920 (0.00083) [2022-07-11 06:42:27,497][26022] Updated weights on worker 0-0, policy_version 1078930 (0.00079) [2022-07-11 06:42:29,415][26022] Updated weights on worker 0-0, policy_version 1078940 (0.00083) [2022-07-11 06:42:29,530][25689] Fps is (10 sec: 5776.6, 60 sec: 5717.6, 300 sec: 5670.8). Total num frames: 1104834560. Throughput: 0: 5964.0. Samples: 1104842426. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:42:29,531][25689] Avg episode reward: [(0, '-0.461')] [2022-07-11 06:42:31,421][26022] Updated weights on worker 0-0, policy_version 1078950 (0.00057) [2022-07-11 06:42:32,868][26022] Updated weights on worker 0-0, policy_version 1078960 (0.00088) [2022-07-11 06:42:34,585][25689] Fps is (10 sec: 5564.5, 60 sec: 5679.8, 300 sec: 5671.6). Total num frames: 1104863232. Throughput: 0: 5102.7. Samples: 1104859470. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:42:34,586][25689] Avg episode reward: [(0, '-0.701')] [2022-07-11 06:42:35,051][26022] Updated weights on worker 0-0, policy_version 1078970 (0.00089) [2022-07-11 06:42:36,491][26022] Updated weights on worker 0-0, policy_version 1078980 (0.00093) [2022-07-11 06:42:38,551][26022] Updated weights on worker 0-0, policy_version 1078990 (0.00360) [2022-07-11 06:42:39,641][25689] Fps is (10 sec: 5671.7, 60 sec: 5713.1, 300 sec: 5671.7). Total num frames: 1104891904. Throughput: 0: 5976.2. Samples: 1104894090. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:42:39,641][25689] Avg episode reward: [(0, '-0.332')] [2022-07-11 06:42:40,142][26022] Updated weights on worker 0-0, policy_version 1079000 (0.00093) [2022-07-11 06:42:42,095][26022] Updated weights on worker 0-0, policy_version 1079010 (0.00088) [2022-07-11 06:42:43,834][26022] Updated weights on worker 0-0, policy_version 1079020 (0.00088) [2022-07-11 06:42:44,642][25689] Fps is (10 sec: 5702.3, 60 sec: 5696.4, 300 sec: 5672.5). Total num frames: 1104920576. Throughput: 0: 5971.3. Samples: 1104928516. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:42:44,642][25689] Avg episode reward: [(0, '-0.748')] [2022-07-11 06:42:45,439][26022] Updated weights on worker 0-0, policy_version 1079030 (0.00083) [2022-07-11 06:42:47,348][26022] Updated weights on worker 0-0, policy_version 1079040 (0.00081) [2022-07-11 06:42:49,030][26022] Updated weights on worker 0-0, policy_version 1079050 (0.00088) [2022-07-11 06:42:49,645][25689] Fps is (10 sec: 5834.5, 60 sec: 5696.6, 300 sec: 5672.6). Total num frames: 1104950272. Throughput: 0: 5124.2. Samples: 1104945496. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:42:49,645][25689] Avg episode reward: [(0, '0.800')] [2022-07-11 06:42:50,892][26022] Updated weights on worker 0-0, policy_version 1079060 (0.00083) [2022-07-11 06:42:52,590][26022] Updated weights on worker 0-0, policy_version 1079070 (0.00096) [2022-07-11 06:42:54,501][26022] Updated weights on worker 0-0, policy_version 1079080 (0.00085) [2022-07-11 06:42:54,653][25689] Fps is (10 sec: 5727.9, 60 sec: 5681.1, 300 sec: 5673.3). Total num frames: 1104977920. Throughput: 0: 6000.7. Samples: 1104979890. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:42:54,654][25689] Avg episode reward: [(0, '0.470')] [2022-07-11 06:42:56,207][26022] Updated weights on worker 0-0, policy_version 1079090 (0.00083) [2022-07-11 06:42:58,239][26022] Updated weights on worker 0-0, policy_version 1079100 (0.00079) [2022-07-11 06:42:59,694][25689] Fps is (10 sec: 5706.2, 60 sec: 5703.5, 300 sec: 5680.0). Total num frames: 1105007616. Throughput: 0: 5986.7. Samples: 1105014144. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:42:59,694][25689] Avg episode reward: [(0, '0.173')] [2022-07-11 06:42:59,871][26022] Updated weights on worker 0-0, policy_version 1079110 (0.00083) [2022-07-11 06:43:01,983][26022] Updated weights on worker 0-0, policy_version 1079120 (0.00088) [2022-07-11 06:43:03,944][26022] Updated weights on worker 0-0, policy_version 1079130 (0.00080) [2022-07-11 06:43:04,712][25689] Fps is (10 sec: 5599.1, 60 sec: 5677.5, 300 sec: 5673.9). Total num frames: 1105034240. Throughput: 0: 4997.7. Samples: 1105028820. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:43:04,712][25689] Avg episode reward: [(0, '-0.035')] [2022-07-11 06:43:05,663][26022] Updated weights on worker 0-0, policy_version 1079140 (0.00086) [2022-07-11 06:43:07,527][26022] Updated weights on worker 0-0, policy_version 1079150 (0.00088) [2022-07-11 06:43:09,287][26022] Updated weights on worker 0-0, policy_version 1079160 (0.00086) [2022-07-11 06:43:09,721][25689] Fps is (10 sec: 5412.3, 60 sec: 5678.5, 300 sec: 5670.8). Total num frames: 1105061888. Throughput: 0: 5877.1. Samples: 1105063490. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:43:09,722][25689] Avg episode reward: [(0, '0.468')] [2022-07-11 06:43:11,052][26022] Updated weights on worker 0-0, policy_version 1079170 (0.00086) [2022-07-11 06:43:12,770][26022] Updated weights on worker 0-0, policy_version 1079180 (0.00083) [2022-07-11 06:43:14,566][26022] Updated weights on worker 0-0, policy_version 1079190 (0.00081) [2022-07-11 06:43:14,744][25689] Fps is (10 sec: 5716.0, 60 sec: 5712.3, 300 sec: 5675.8). Total num frames: 1105091584. Throughput: 0: 5883.6. Samples: 1105098096. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:43:14,745][25689] Avg episode reward: [(0, '0.418')] [2022-07-11 06:43:16,540][26022] Updated weights on worker 0-0, policy_version 1079200 (0.00082) [2022-07-11 06:43:17,927][26022] Updated weights on worker 0-0, policy_version 1079210 (0.00084) [2022-07-11 06:43:19,775][26022] Updated weights on worker 0-0, policy_version 1079220 (0.00093) [2022-07-11 06:43:19,803][25689] Fps is (10 sec: 5891.4, 60 sec: 5715.6, 300 sec: 5678.3). Total num frames: 1105121280. Throughput: 0: 5034.1. Samples: 1105115374. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:43:19,803][25689] Avg episode reward: [(0, '1.417')] [2022-07-11 06:43:21,561][26022] Updated weights on worker 0-0, policy_version 1079230 (0.00087) [2022-07-11 06:43:23,377][26022] Updated weights on worker 0-0, policy_version 1079240 (0.00079) [2022-07-11 06:43:24,804][25689] Fps is (10 sec: 5801.9, 60 sec: 5685.1, 300 sec: 5678.5). Total num frames: 1105149952. Throughput: 0: 6030.2. Samples: 1105149980. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:43:24,806][25689] Avg episode reward: [(0, '1.238')] [2022-07-11 06:43:25,220][26022] Updated weights on worker 0-0, policy_version 1079250 (0.00083) [2022-07-11 06:43:26,980][26022] Updated weights on worker 0-0, policy_version 1079260 (0.00091) [2022-07-11 06:43:28,831][26022] Updated weights on worker 0-0, policy_version 1079270 (0.00082) [2022-07-11 06:43:29,815][25689] Fps is (10 sec: 5625.2, 60 sec: 5690.3, 300 sec: 5678.6). Total num frames: 1105177600. Throughput: 0: 6003.9. Samples: 1105184128. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:43:29,815][25689] Avg episode reward: [(0, '1.668')] [2022-07-11 06:43:30,590][26022] Updated weights on worker 0-0, policy_version 1079280 (0.00383) [2022-07-11 06:43:32,399][26022] Updated weights on worker 0-0, policy_version 1079290 (0.00083) [2022-07-11 06:43:34,266][26022] Updated weights on worker 0-0, policy_version 1079300 (0.00083) [2022-07-11 06:43:34,894][25689] Fps is (10 sec: 5683.4, 60 sec: 5705.1, 300 sec: 5672.4). Total num frames: 1105207296. Throughput: 0: 5113.9. Samples: 1105201140. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:43:34,895][25689] Avg episode reward: [(0, '1.598')] [2022-07-11 06:43:36,014][26022] Updated weights on worker 0-0, policy_version 1079310 (0.00085) [2022-07-11 06:43:37,723][26022] Updated weights on worker 0-0, policy_version 1079320 (0.00085) [2022-07-11 06:43:39,723][26022] Updated weights on worker 0-0, policy_version 1079330 (0.00088) [2022-07-11 06:43:39,986][25689] Fps is (10 sec: 5637.9, 60 sec: 5684.6, 300 sec: 5671.4). Total num frames: 1105234944. Throughput: 0: 5953.2. Samples: 1105235528. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:43:39,986][25689] Avg episode reward: [(0, '1.361')] [2022-07-11 06:43:41,471][26022] Updated weights on worker 0-0, policy_version 1079340 (0.00874) [2022-07-11 06:43:42,572][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:43:42,593][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001079346_1105250304.pth [2022-07-11 06:43:42,593][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001077348_1103204352.pth [2022-07-11 06:43:43,355][26022] Updated weights on worker 0-0, policy_version 1079350 (0.00090) [2022-07-11 06:43:45,022][25689] Fps is (10 sec: 5560.8, 60 sec: 5681.3, 300 sec: 5675.7). Total num frames: 1105263616. Throughput: 0: 5911.7. Samples: 1105269502. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:43:45,023][25689] Avg episode reward: [(0, '1.277')] [2022-07-11 06:43:45,143][26022] Updated weights on worker 0-0, policy_version 1079360 (0.00093) [2022-07-11 06:43:46,737][26022] Updated weights on worker 0-0, policy_version 1079370 (0.00086) [2022-07-11 06:43:48,823][26022] Updated weights on worker 0-0, policy_version 1079380 (0.00086) [2022-07-11 06:43:50,047][25689] Fps is (10 sec: 5801.4, 60 sec: 5679.3, 300 sec: 5671.9). Total num frames: 1105293312. Throughput: 0: 5060.3. Samples: 1105286508. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:43:50,047][25689] Avg episode reward: [(0, '1.152')] [2022-07-11 06:43:50,385][26022] Updated weights on worker 0-0, policy_version 1079390 (0.00080) [2022-07-11 06:43:52,363][26022] Updated weights on worker 0-0, policy_version 1079400 (0.00092) [2022-07-11 06:43:54,135][26022] Updated weights on worker 0-0, policy_version 1079410 (0.00086) [2022-07-11 06:43:55,078][25689] Fps is (10 sec: 5702.5, 60 sec: 5677.1, 300 sec: 5677.0). Total num frames: 1105320960. Throughput: 0: 5915.1. Samples: 1105320530. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:43:55,078][25689] Avg episode reward: [(0, '0.573')] [2022-07-11 06:43:55,949][26022] Updated weights on worker 0-0, policy_version 1079420 (0.00080) [2022-07-11 06:43:57,748][26022] Updated weights on worker 0-0, policy_version 1079430 (0.00087) [2022-07-11 06:43:59,536][26022] Updated weights on worker 0-0, policy_version 1079440 (0.00090) [2022-07-11 06:44:00,183][25689] Fps is (10 sec: 5556.3, 60 sec: 5654.2, 300 sec: 5678.7). Total num frames: 1105349632. Throughput: 0: 5908.8. Samples: 1105354870. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:44:00,183][25689] Avg episode reward: [(0, '0.579')] [2022-07-11 06:44:01,206][26022] Updated weights on worker 0-0, policy_version 1079450 (0.00108) [2022-07-11 06:44:03,465][26022] Updated weights on worker 0-0, policy_version 1079460 (0.00084) [2022-07-11 06:44:05,175][26022] Updated weights on worker 0-0, policy_version 1079470 (0.00096) [2022-07-11 06:44:05,212][25689] Fps is (10 sec: 5557.7, 60 sec: 5670.1, 300 sec: 5674.9). Total num frames: 1105377280. Throughput: 0: 4960.3. Samples: 1105369648. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:44:05,212][25689] Avg episode reward: [(0, '0.163')] [2022-07-11 06:44:07,186][26022] Updated weights on worker 0-0, policy_version 1079480 (0.00084) [2022-07-11 06:44:08,890][26022] Updated weights on worker 0-0, policy_version 1079490 (0.00090) [2022-07-11 06:44:10,237][25689] Fps is (10 sec: 5499.9, 60 sec: 5668.6, 300 sec: 5675.5). Total num frames: 1105404928. Throughput: 0: 5816.9. Samples: 1105403952. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:44:10,239][25689] Avg episode reward: [(0, '0.282')] [2022-07-11 06:44:10,599][26022] Updated weights on worker 0-0, policy_version 1079500 (0.00090) [2022-07-11 06:44:12,584][26022] Updated weights on worker 0-0, policy_version 1079510 (0.00093) [2022-07-11 06:44:14,131][26022] Updated weights on worker 0-0, policy_version 1079520 (0.00093) [2022-07-11 06:44:15,252][25689] Fps is (10 sec: 5711.4, 60 sec: 5669.4, 300 sec: 5677.6). Total num frames: 1105434624. Throughput: 0: 5826.1. Samples: 1105438066. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:44:15,254][25689] Avg episode reward: [(0, '0.619')] [2022-07-11 06:44:16,338][26022] Updated weights on worker 0-0, policy_version 1079530 (0.00081) [2022-07-11 06:44:17,889][26022] Updated weights on worker 0-0, policy_version 1079540 (0.00096) [2022-07-11 06:44:19,759][26022] Updated weights on worker 0-0, policy_version 1079550 (0.00459) [2022-07-11 06:44:20,328][25689] Fps is (10 sec: 5682.8, 60 sec: 5633.9, 300 sec: 5676.6). Total num frames: 1105462272. Throughput: 0: 5819.6. Samples: 1105472104. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:44:20,328][25689] Avg episode reward: [(0, '0.510')] [2022-07-11 06:44:21,553][26022] Updated weights on worker 0-0, policy_version 1079560 (0.00082) [2022-07-11 06:44:23,264][26022] Updated weights on worker 0-0, policy_version 1079570 (0.00083) [2022-07-11 06:44:25,087][26022] Updated weights on worker 0-0, policy_version 1079580 (0.00086) [2022-07-11 06:44:25,382][25689] Fps is (10 sec: 5559.5, 60 sec: 5629.0, 300 sec: 5676.0). Total num frames: 1105490944. Throughput: 0: 5926.1. Samples: 1105489182. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:44:25,383][25689] Avg episode reward: [(0, '1.035')] [2022-07-11 06:44:26,906][26022] Updated weights on worker 0-0, policy_version 1079590 (0.00086) [2022-07-11 06:44:28,614][26022] Updated weights on worker 0-0, policy_version 1079600 (0.00084) [2022-07-11 06:44:30,414][25689] Fps is (10 sec: 5685.3, 60 sec: 5643.9, 300 sec: 5676.1). Total num frames: 1105519616. Throughput: 0: 5916.5. Samples: 1105523330. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 06:44:30,415][25689] Avg episode reward: [(0, '0.538')] [2022-07-11 06:44:30,550][26022] Updated weights on worker 0-0, policy_version 1079610 (0.00081) [2022-07-11 06:44:32,372][26022] Updated weights on worker 0-0, policy_version 1079620 (0.00093) [2022-07-11 06:44:34,151][26022] Updated weights on worker 0-0, policy_version 1079630 (0.00084) [2022-07-11 06:44:35,443][25689] Fps is (10 sec: 5700.0, 60 sec: 5631.7, 300 sec: 5674.5). Total num frames: 1105548288. Throughput: 0: 5883.5. Samples: 1105556858. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:44:35,443][25689] Avg episode reward: [(0, '1.101')] [2022-07-11 06:44:36,100][26022] Updated weights on worker 0-0, policy_version 1079640 (0.00079) [2022-07-11 06:44:37,750][26022] Updated weights on worker 0-0, policy_version 1079650 (0.00092) [2022-07-11 06:44:39,698][26022] Updated weights on worker 0-0, policy_version 1079660 (0.00083) [2022-07-11 06:44:40,583][25689] Fps is (10 sec: 5538.8, 60 sec: 5627.2, 300 sec: 5672.3). Total num frames: 1105575936. Throughput: 0: 5028.5. Samples: 1105573956. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:44:40,583][25689] Avg episode reward: [(0, '0.627')] [2022-07-11 06:44:41,231][26022] Updated weights on worker 0-0, policy_version 1079670 (0.00091) [2022-07-11 06:44:43,253][26022] Updated weights on worker 0-0, policy_version 1079680 (0.00087) [2022-07-11 06:44:44,956][26022] Updated weights on worker 0-0, policy_version 1079690 (0.00097) [2022-07-11 06:44:45,606][25689] Fps is (10 sec: 5541.7, 60 sec: 5628.5, 300 sec: 5672.5). Total num frames: 1105604608. Throughput: 0: 5889.7. Samples: 1105608292. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:44:45,607][25689] Avg episode reward: [(0, '0.822')] [2022-07-11 06:44:46,871][26022] Updated weights on worker 0-0, policy_version 1079700 (0.00081) [2022-07-11 06:44:48,615][26022] Updated weights on worker 0-0, policy_version 1079710 (0.00088) [2022-07-11 06:44:50,176][26022] Updated weights on worker 0-0, policy_version 1079720 (0.00089) [2022-07-11 06:44:50,632][25689] Fps is (10 sec: 5807.9, 60 sec: 5628.3, 300 sec: 5668.7). Total num frames: 1105634304. Throughput: 0: 5908.3. Samples: 1105642784. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:44:50,639][25689] Avg episode reward: [(0, '0.694')] [2022-07-11 06:44:52,205][26022] Updated weights on worker 0-0, policy_version 1079730 (0.00092) [2022-07-11 06:44:54,014][26022] Updated weights on worker 0-0, policy_version 1079740 (0.00081) [2022-07-11 06:44:55,659][25689] Fps is (10 sec: 5805.8, 60 sec: 5645.6, 300 sec: 5672.9). Total num frames: 1105662976. Throughput: 0: 5095.6. Samples: 1105659872. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:44:55,660][25689] Avg episode reward: [(0, '1.658')] [2022-07-11 06:44:55,776][26022] Updated weights on worker 0-0, policy_version 1079750 (0.00093) [2022-07-11 06:44:57,677][26022] Updated weights on worker 0-0, policy_version 1079760 (0.00081) [2022-07-11 06:44:59,320][26022] Updated weights on worker 0-0, policy_version 1079770 (0.00128) [2022-07-11 06:45:00,771][25689] Fps is (10 sec: 5757.2, 60 sec: 5661.9, 300 sec: 5688.5). Total num frames: 1105692672. Throughput: 0: 5952.0. Samples: 1105694114. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:45:00,772][25689] Avg episode reward: [(0, '0.716')] [2022-07-11 06:45:01,311][26022] Updated weights on worker 0-0, policy_version 1079780 (0.00081) [2022-07-11 06:45:03,323][26022] Updated weights on worker 0-0, policy_version 1079790 (0.00081) [2022-07-11 06:45:05,177][26022] Updated weights on worker 0-0, policy_version 1079800 (0.00600) [2022-07-11 06:45:05,774][25689] Fps is (10 sec: 5466.7, 60 sec: 5630.4, 300 sec: 5671.5). Total num frames: 1105718272. Throughput: 0: 5843.0. Samples: 1105726136. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:45:05,775][25689] Avg episode reward: [(0, '0.095')] [2022-07-11 06:45:06,830][26022] Updated weights on worker 0-0, policy_version 1079810 (0.00093) [2022-07-11 06:45:08,860][26022] Updated weights on worker 0-0, policy_version 1079820 (0.00091) [2022-07-11 06:45:10,607][26022] Updated weights on worker 0-0, policy_version 1079830 (0.00087) [2022-07-11 06:45:10,814][25689] Fps is (10 sec: 5403.6, 60 sec: 5646.0, 300 sec: 5670.9). Total num frames: 1105746944. Throughput: 0: 4976.7. Samples: 1105743224. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:45:10,816][25689] Avg episode reward: [(0, '0.427')] [2022-07-11 06:45:12,359][26022] Updated weights on worker 0-0, policy_version 1079840 (0.00095) [2022-07-11 06:45:14,223][26022] Updated weights on worker 0-0, policy_version 1079850 (0.00092) [2022-07-11 06:45:15,828][25689] Fps is (10 sec: 5703.4, 60 sec: 5629.1, 300 sec: 5675.8). Total num frames: 1105775616. Throughput: 0: 5835.5. Samples: 1105777572. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:45:15,830][25689] Avg episode reward: [(0, '-0.129')] [2022-07-11 06:45:15,852][26022] Updated weights on worker 0-0, policy_version 1079860 (0.01003) [2022-07-11 06:45:17,765][26022] Updated weights on worker 0-0, policy_version 1079870 (0.00085) [2022-07-11 06:45:19,493][26022] Updated weights on worker 0-0, policy_version 1079880 (0.00090) [2022-07-11 06:45:20,974][25689] Fps is (10 sec: 5744.8, 60 sec: 5656.4, 300 sec: 5673.9). Total num frames: 1105805312. Throughput: 0: 5832.0. Samples: 1105811944. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:45:20,975][25689] Avg episode reward: [(0, '-0.456')] [2022-07-11 06:45:21,214][26022] Updated weights on worker 0-0, policy_version 1079890 (0.00082) [2022-07-11 06:45:23,183][26022] Updated weights on worker 0-0, policy_version 1079900 (0.00088) [2022-07-11 06:45:25,005][26022] Updated weights on worker 0-0, policy_version 1079910 (0.00088) [2022-07-11 06:45:25,983][25689] Fps is (10 sec: 5647.3, 60 sec: 5643.8, 300 sec: 5671.0). Total num frames: 1105832960. Throughput: 0: 5088.7. Samples: 1105828972. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:45:25,983][25689] Avg episode reward: [(0, '0.308')] [2022-07-11 06:45:26,820][26022] Updated weights on worker 0-0, policy_version 1079920 (0.00090) [2022-07-11 06:45:28,486][26022] Updated weights on worker 0-0, policy_version 1079930 (0.00080) [2022-07-11 06:45:30,277][26022] Updated weights on worker 0-0, policy_version 1079940 (0.00086) [2022-07-11 06:45:31,002][25689] Fps is (10 sec: 5718.7, 60 sec: 5661.9, 300 sec: 5674.5). Total num frames: 1105862656. Throughput: 0: 5945.3. Samples: 1105863246. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:45:31,004][25689] Avg episode reward: [(0, '0.881')] [2022-07-11 06:45:32,089][26022] Updated weights on worker 0-0, policy_version 1079950 (0.00090) [2022-07-11 06:45:33,935][26022] Updated weights on worker 0-0, policy_version 1079960 (0.00086) [2022-07-11 06:45:35,845][26022] Updated weights on worker 0-0, policy_version 1079970 (0.00084) [2022-07-11 06:45:36,017][25689] Fps is (10 sec: 5714.5, 60 sec: 5646.2, 300 sec: 5671.9). Total num frames: 1105890304. Throughput: 0: 5936.3. Samples: 1105897420. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:45:36,018][25689] Avg episode reward: [(0, '0.948')] [2022-07-11 06:45:37,305][26022] Updated weights on worker 0-0, policy_version 1079980 (0.00081) [2022-07-11 06:45:39,342][26022] Updated weights on worker 0-0, policy_version 1079990 (0.00086) [2022-07-11 06:45:40,871][26022] Updated weights on worker 0-0, policy_version 1080000 (0.00093) [2022-07-11 06:45:41,057][25689] Fps is (10 sec: 5703.1, 60 sec: 5689.4, 300 sec: 5675.1). Total num frames: 1105920000. Throughput: 0: 5108.2. Samples: 1105914528. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:45:41,057][25689] Avg episode reward: [(0, '1.411')] [2022-07-11 06:45:42,699][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:45:42,707][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001080009_1105929216.pth [2022-07-11 06:45:42,708][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001078012_1103884288.pth [2022-07-11 06:45:42,915][26022] Updated weights on worker 0-0, policy_version 1080010 (0.00089) [2022-07-11 06:45:44,574][26022] Updated weights on worker 0-0, policy_version 1080020 (0.00084) [2022-07-11 06:45:46,075][25689] Fps is (10 sec: 5803.5, 60 sec: 5689.9, 300 sec: 5671.8). Total num frames: 1105948672. Throughput: 0: 5964.0. Samples: 1105948802. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:45:46,075][25689] Avg episode reward: [(0, '1.814')] [2022-07-11 06:45:46,403][26022] Updated weights on worker 0-0, policy_version 1080030 (0.00079) [2022-07-11 06:45:48,195][26022] Updated weights on worker 0-0, policy_version 1080040 (0.00080) [2022-07-11 06:45:50,043][26022] Updated weights on worker 0-0, policy_version 1080050 (0.00092) [2022-07-11 06:45:51,081][25689] Fps is (10 sec: 5516.1, 60 sec: 5641.0, 300 sec: 5675.8). Total num frames: 1105975296. Throughput: 0: 5958.8. Samples: 1105982894. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:45:51,082][25689] Avg episode reward: [(0, '1.906')] [2022-07-11 06:45:51,891][26022] Updated weights on worker 0-0, policy_version 1080060 (0.00055) [2022-07-11 06:45:53,745][26022] Updated weights on worker 0-0, policy_version 1080070 (0.00086) [2022-07-11 06:45:55,419][26022] Updated weights on worker 0-0, policy_version 1080080 (0.00082) [2022-07-11 06:45:56,096][25689] Fps is (10 sec: 5620.0, 60 sec: 5659.0, 300 sec: 5674.8). Total num frames: 1106004992. Throughput: 0: 5092.1. Samples: 1105999664. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:45:56,097][25689] Avg episode reward: [(0, '2.190')] [2022-07-11 06:45:57,320][26022] Updated weights on worker 0-0, policy_version 1080090 (0.00079) [2022-07-11 06:45:59,076][26022] Updated weights on worker 0-0, policy_version 1080100 (0.00102) [2022-07-11 06:46:00,951][26022] Updated weights on worker 0-0, policy_version 1080110 (0.00084) [2022-07-11 06:46:01,213][25689] Fps is (10 sec: 5760.9, 60 sec: 5641.6, 300 sec: 5677.1). Total num frames: 1106033664. Throughput: 0: 5914.2. Samples: 1106033736. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:46:01,213][25689] Avg episode reward: [(0, '2.485')] [2022-07-11 06:46:03,164][26022] Updated weights on worker 0-0, policy_version 1080120 (0.00081) [2022-07-11 06:46:05,027][26022] Updated weights on worker 0-0, policy_version 1080130 (0.00087) [2022-07-11 06:46:06,234][25689] Fps is (10 sec: 5353.4, 60 sec: 5640.0, 300 sec: 5667.3). Total num frames: 1106059264. Throughput: 0: 5767.8. Samples: 1106065076. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:46:06,235][25689] Avg episode reward: [(0, '1.637')] [2022-07-11 06:46:06,959][26022] Updated weights on worker 0-0, policy_version 1080140 (0.00084) [2022-07-11 06:46:08,611][26022] Updated weights on worker 0-0, policy_version 1080150 (0.00084) [2022-07-11 06:46:10,283][26022] Updated weights on worker 0-0, policy_version 1080160 (0.00087) [2022-07-11 06:46:11,260][25689] Fps is (10 sec: 5503.4, 60 sec: 5658.2, 300 sec: 5667.5). Total num frames: 1106088960. Throughput: 0: 4927.3. Samples: 1106082324. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:46:11,261][25689] Avg episode reward: [(0, '0.788')] [2022-07-11 06:46:12,290][26022] Updated weights on worker 0-0, policy_version 1080170 (0.00087) [2022-07-11 06:46:14,149][26022] Updated weights on worker 0-0, policy_version 1080180 (0.00086) [2022-07-11 06:46:15,743][26022] Updated weights on worker 0-0, policy_version 1080190 (0.00091) [2022-07-11 06:46:16,280][25689] Fps is (10 sec: 5708.1, 60 sec: 5640.7, 300 sec: 5668.2). Total num frames: 1106116608. Throughput: 0: 5799.8. Samples: 1106116726. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:46:16,281][25689] Avg episode reward: [(0, '0.734')] [2022-07-11 06:46:17,636][26022] Updated weights on worker 0-0, policy_version 1080200 (0.00089) [2022-07-11 06:46:19,603][26022] Updated weights on worker 0-0, policy_version 1080210 (0.00093) [2022-07-11 06:46:21,357][25689] Fps is (10 sec: 5375.5, 60 sec: 5596.3, 300 sec: 5657.3). Total num frames: 1106143232. Throughput: 0: 5734.8. Samples: 1106149256. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:46:21,357][25689] Avg episode reward: [(0, '-0.011')] [2022-07-11 06:46:21,448][26022] Updated weights on worker 0-0, policy_version 1080220 (0.00084) [2022-07-11 06:46:23,233][26022] Updated weights on worker 0-0, policy_version 1080230 (0.00313) [2022-07-11 06:46:25,069][26022] Updated weights on worker 0-0, policy_version 1080240 (0.00089) [2022-07-11 06:46:26,374][25689] Fps is (10 sec: 5579.8, 60 sec: 5629.5, 300 sec: 5664.4). Total num frames: 1106172928. Throughput: 0: 5030.2. Samples: 1106166382. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:46:26,374][25689] Avg episode reward: [(0, '-0.269')] [2022-07-11 06:46:26,842][26022] Updated weights on worker 0-0, policy_version 1080250 (0.00081) [2022-07-11 06:46:28,724][26022] Updated weights on worker 0-0, policy_version 1080260 (0.00057) [2022-07-11 06:46:30,602][26022] Updated weights on worker 0-0, policy_version 1080270 (0.00089) [2022-07-11 06:46:31,376][25689] Fps is (10 sec: 5825.4, 60 sec: 5614.0, 300 sec: 5657.8). Total num frames: 1106201600. Throughput: 0: 5876.9. Samples: 1106200542. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:46:31,377][25689] Avg episode reward: [(0, '-0.413')] [2022-07-11 06:46:32,241][26022] Updated weights on worker 0-0, policy_version 1080280 (0.00085) [2022-07-11 06:46:33,959][26022] Updated weights on worker 0-0, policy_version 1080290 (0.00085) [2022-07-11 06:46:35,900][26022] Updated weights on worker 0-0, policy_version 1080300 (0.00055) [2022-07-11 06:46:36,403][25689] Fps is (10 sec: 5615.7, 60 sec: 5613.0, 300 sec: 5661.6). Total num frames: 1106229248. Throughput: 0: 5847.3. Samples: 1106234388. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:46:36,403][25689] Avg episode reward: [(0, '0.048')] [2022-07-11 06:46:37,767][26022] Updated weights on worker 0-0, policy_version 1080310 (0.00090) [2022-07-11 06:46:39,437][26022] Updated weights on worker 0-0, policy_version 1080320 (0.00082) [2022-07-11 06:46:41,355][26022] Updated weights on worker 0-0, policy_version 1080330 (0.00083) [2022-07-11 06:46:41,505][25689] Fps is (10 sec: 5560.4, 60 sec: 5590.2, 300 sec: 5656.3). Total num frames: 1106257920. Throughput: 0: 5067.4. Samples: 1106251356. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:46:41,505][25689] Avg episode reward: [(0, '1.050')] [2022-07-11 06:46:43,201][26022] Updated weights on worker 0-0, policy_version 1080340 (0.00082) [2022-07-11 06:46:44,973][26022] Updated weights on worker 0-0, policy_version 1080350 (0.00091) [2022-07-11 06:46:46,545][25689] Fps is (10 sec: 5754.6, 60 sec: 5605.1, 300 sec: 5655.6). Total num frames: 1106287616. Throughput: 0: 5907.3. Samples: 1106285542. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:46:46,546][25689] Avg episode reward: [(0, '1.738')] [2022-07-11 06:46:46,879][26022] Updated weights on worker 0-0, policy_version 1080360 (0.00092) [2022-07-11 06:46:48,515][26022] Updated weights on worker 0-0, policy_version 1080370 (0.00087) [2022-07-11 06:46:50,198][26022] Updated weights on worker 0-0, policy_version 1080380 (0.00617) [2022-07-11 06:46:51,607][25689] Fps is (10 sec: 5777.6, 60 sec: 5633.8, 300 sec: 5654.9). Total num frames: 1106316288. Throughput: 0: 5913.3. Samples: 1106320176. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:46:51,608][25689] Avg episode reward: [(0, '0.022')] [2022-07-11 06:46:51,970][26022] Updated weights on worker 0-0, policy_version 1080390 (0.00086) [2022-07-11 06:46:53,795][26022] Updated weights on worker 0-0, policy_version 1080400 (0.00093) [2022-07-11 06:46:55,469][26022] Updated weights on worker 0-0, policy_version 1080410 (0.00084) [2022-07-11 06:46:56,614][25689] Fps is (10 sec: 5695.2, 60 sec: 5617.6, 300 sec: 5656.7). Total num frames: 1106344960. Throughput: 0: 5948.5. Samples: 1106354618. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:46:56,615][25689] Avg episode reward: [(0, '0.185')] [2022-07-11 06:46:57,521][26022] Updated weights on worker 0-0, policy_version 1080420 (0.00086) [2022-07-11 06:46:59,343][26022] Updated weights on worker 0-0, policy_version 1080430 (0.00071) [2022-07-11 06:47:01,043][26022] Updated weights on worker 0-0, policy_version 1080440 (0.00090) [2022-07-11 06:47:01,666][25689] Fps is (10 sec: 5599.0, 60 sec: 5606.7, 300 sec: 5654.2). Total num frames: 1106372608. Throughput: 0: 5963.7. Samples: 1106371594. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:47:01,675][25689] Avg episode reward: [(0, '0.400')] [2022-07-11 06:47:03,392][26022] Updated weights on worker 0-0, policy_version 1080450 (0.00086) [2022-07-11 06:47:04,883][26022] Updated weights on worker 0-0, policy_version 1080460 (0.00085) [2022-07-11 06:47:06,686][25689] Fps is (10 sec: 5388.5, 60 sec: 5623.8, 300 sec: 5650.7). Total num frames: 1106399232. Throughput: 0: 5837.1. Samples: 1106403106. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:47:06,687][25689] Avg episode reward: [(0, '0.568')] [2022-07-11 06:47:06,871][26022] Updated weights on worker 0-0, policy_version 1080470 (0.00105) [2022-07-11 06:47:08,900][26022] Updated weights on worker 0-0, policy_version 1080480 (0.00608) [2022-07-11 06:47:10,275][26022] Updated weights on worker 0-0, policy_version 1080490 (0.00079) [2022-07-11 06:47:11,730][25689] Fps is (10 sec: 5494.6, 60 sec: 5605.2, 300 sec: 5653.7). Total num frames: 1106427904. Throughput: 0: 5840.1. Samples: 1106437694. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:47:11,731][25689] Avg episode reward: [(0, '-0.754')] [2022-07-11 06:47:12,468][26022] Updated weights on worker 0-0, policy_version 1080500 (0.00080) [2022-07-11 06:47:13,795][26022] Updated weights on worker 0-0, policy_version 1080510 (0.00085) [2022-07-11 06:47:15,975][26022] Updated weights on worker 0-0, policy_version 1080520 (0.00086) [2022-07-11 06:47:16,758][25689] Fps is (10 sec: 5794.9, 60 sec: 5638.2, 300 sec: 5655.0). Total num frames: 1106457600. Throughput: 0: 4977.0. Samples: 1106454876. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:47:16,759][25689] Avg episode reward: [(0, '-1.528')] [2022-07-11 06:47:17,464][26022] Updated weights on worker 0-0, policy_version 1080530 (0.00085) [2022-07-11 06:47:19,338][26022] Updated weights on worker 0-0, policy_version 1080540 (0.00083) [2022-07-11 06:47:21,384][26022] Updated weights on worker 0-0, policy_version 1080550 (0.00088) [2022-07-11 06:47:21,812][25689] Fps is (10 sec: 5687.9, 60 sec: 5657.3, 300 sec: 5644.4). Total num frames: 1106485248. Throughput: 0: 5827.2. Samples: 1106488986. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:47:21,813][25689] Avg episode reward: [(0, '-0.514')] [2022-07-11 06:47:22,939][26022] Updated weights on worker 0-0, policy_version 1080560 (0.00088) [2022-07-11 06:47:25,022][26022] Updated weights on worker 0-0, policy_version 1080570 (0.00087) [2022-07-11 06:47:26,452][26022] Updated weights on worker 0-0, policy_version 1080580 (0.00085) [2022-07-11 06:47:26,865][25689] Fps is (10 sec: 5674.0, 60 sec: 5654.0, 300 sec: 5651.5). Total num frames: 1106514944. Throughput: 0: 5952.6. Samples: 1106523222. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:47:26,865][25689] Avg episode reward: [(0, '-0.528')] [2022-07-11 06:47:28,527][26022] Updated weights on worker 0-0, policy_version 1080590 (0.00084) [2022-07-11 06:47:30,166][26022] Updated weights on worker 0-0, policy_version 1080600 (0.00082) [2022-07-11 06:47:31,867][25689] Fps is (10 sec: 5702.9, 60 sec: 5637.1, 300 sec: 5649.1). Total num frames: 1106542592. Throughput: 0: 5091.9. Samples: 1106540230. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:47:31,868][25689] Avg episode reward: [(0, '-0.371')] [2022-07-11 06:47:32,239][26022] Updated weights on worker 0-0, policy_version 1080610 (0.00084) [2022-07-11 06:47:33,935][26022] Updated weights on worker 0-0, policy_version 1080620 (0.00085) [2022-07-11 06:47:35,715][26022] Updated weights on worker 0-0, policy_version 1080630 (0.00080) [2022-07-11 06:47:36,887][25689] Fps is (10 sec: 5619.7, 60 sec: 5654.6, 300 sec: 5649.7). Total num frames: 1106571264. Throughput: 0: 5937.3. Samples: 1106574384. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:47:36,887][25689] Avg episode reward: [(0, '-0.720')] [2022-07-11 06:47:37,444][26022] Updated weights on worker 0-0, policy_version 1080640 (0.00083) [2022-07-11 06:47:39,443][26022] Updated weights on worker 0-0, policy_version 1080650 (0.00084) [2022-07-11 06:47:41,183][26022] Updated weights on worker 0-0, policy_version 1080660 (0.00077) [2022-07-11 06:47:42,002][25689] Fps is (10 sec: 5658.0, 60 sec: 5653.4, 300 sec: 5647.6). Total num frames: 1106599936. Throughput: 0: 5902.3. Samples: 1106608154. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:47:42,004][25689] Avg episode reward: [(0, '-0.326')] [2022-07-11 06:47:42,840][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:47:42,849][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001080669_1106605056.pth [2022-07-11 06:47:42,850][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001078677_1104565248.pth [2022-07-11 06:47:42,927][26022] Updated weights on worker 0-0, policy_version 1080670 (0.00080) [2022-07-11 06:47:44,627][26022] Updated weights on worker 0-0, policy_version 1080680 (0.00090) [2022-07-11 06:47:46,547][26022] Updated weights on worker 0-0, policy_version 1080690 (0.00085) [2022-07-11 06:47:47,019][25689] Fps is (10 sec: 5558.7, 60 sec: 5621.8, 300 sec: 5640.4). Total num frames: 1106627584. Throughput: 0: 5066.3. Samples: 1106625328. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:47:47,019][25689] Avg episode reward: [(0, '-0.223')] [2022-07-11 06:47:48,257][26022] Updated weights on worker 0-0, policy_version 1080700 (0.00084) [2022-07-11 06:47:50,191][26022] Updated weights on worker 0-0, policy_version 1080710 (0.00087) [2022-07-11 06:47:51,719][26022] Updated weights on worker 0-0, policy_version 1080720 (0.00084) [2022-07-11 06:47:52,027][25689] Fps is (10 sec: 5822.6, 60 sec: 5660.7, 300 sec: 5650.8). Total num frames: 1106658304. Throughput: 0: 5938.5. Samples: 1106659948. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:47:52,027][25689] Avg episode reward: [(0, '0.430')] [2022-07-11 06:47:53,788][26022] Updated weights on worker 0-0, policy_version 1080730 (0.00051) [2022-07-11 06:47:55,586][26022] Updated weights on worker 0-0, policy_version 1080740 (0.00087) [2022-07-11 06:47:57,051][25689] Fps is (10 sec: 5818.1, 60 sec: 5642.1, 300 sec: 5644.2). Total num frames: 1106685952. Throughput: 0: 5953.3. Samples: 1106694428. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:47:57,052][25689] Avg episode reward: [(0, '0.241')] [2022-07-11 06:47:57,299][26022] Updated weights on worker 0-0, policy_version 1080750 (0.00086) [2022-07-11 06:47:59,187][26022] Updated weights on worker 0-0, policy_version 1080760 (0.00087) [2022-07-11 06:48:00,728][26022] Updated weights on worker 0-0, policy_version 1080770 (0.00089) [2022-07-11 06:48:02,115][25689] Fps is (10 sec: 5481.3, 60 sec: 5641.0, 300 sec: 5646.8). Total num frames: 1106713600. Throughput: 0: 5141.3. Samples: 1106711560. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:48:02,117][25689] Avg episode reward: [(0, '-0.872')] [2022-07-11 06:48:03,192][26022] Updated weights on worker 0-0, policy_version 1080780 (0.00095) [2022-07-11 06:48:04,795][26022] Updated weights on worker 0-0, policy_version 1080790 (0.00086) [2022-07-11 06:48:06,592][26022] Updated weights on worker 0-0, policy_version 1080800 (0.00079) [2022-07-11 06:48:07,166][25689] Fps is (10 sec: 5466.6, 60 sec: 5655.0, 300 sec: 5646.0). Total num frames: 1106741248. Throughput: 0: 5856.4. Samples: 1106743320. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:48:07,168][25689] Avg episode reward: [(0, '-0.832')] [2022-07-11 06:48:08,670][26022] Updated weights on worker 0-0, policy_version 1080810 (0.00088) [2022-07-11 06:48:10,067][26022] Updated weights on worker 0-0, policy_version 1080820 (0.00082) [2022-07-11 06:48:12,159][26022] Updated weights on worker 0-0, policy_version 1080830 (0.00086) [2022-07-11 06:48:12,201][25689] Fps is (10 sec: 5583.8, 60 sec: 5655.9, 300 sec: 5642.3). Total num frames: 1106769920. Throughput: 0: 5818.8. Samples: 1106777340. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:48:12,202][25689] Avg episode reward: [(0, '-1.284')] [2022-07-11 06:48:13,739][26022] Updated weights on worker 0-0, policy_version 1080840 (0.00097) [2022-07-11 06:48:15,701][26022] Updated weights on worker 0-0, policy_version 1080850 (0.00085) [2022-07-11 06:48:17,203][25689] Fps is (10 sec: 5815.4, 60 sec: 5658.3, 300 sec: 5643.4). Total num frames: 1106799616. Throughput: 0: 4967.7. Samples: 1106794538. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:48:17,203][25689] Avg episode reward: [(0, '-2.768')] [2022-07-11 06:48:17,456][26022] Updated weights on worker 0-0, policy_version 1080860 (0.00084) [2022-07-11 06:48:19,224][26022] Updated weights on worker 0-0, policy_version 1080870 (0.00083) [2022-07-11 06:48:21,173][26022] Updated weights on worker 0-0, policy_version 1080880 (0.00087) [2022-07-11 06:48:22,341][25689] Fps is (10 sec: 5756.5, 60 sec: 5667.4, 300 sec: 5640.8). Total num frames: 1106828288. Throughput: 0: 5791.7. Samples: 1106828702. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:48:22,341][25689] Avg episode reward: [(0, '-3.098')] [2022-07-11 06:48:22,915][26022] Updated weights on worker 0-0, policy_version 1080890 (0.00088) [2022-07-11 06:48:24,575][26022] Updated weights on worker 0-0, policy_version 1080900 (0.00083) [2022-07-11 06:48:26,358][26022] Updated weights on worker 0-0, policy_version 1080910 (0.00088) [2022-07-11 06:48:27,356][25689] Fps is (10 sec: 5647.8, 60 sec: 5654.0, 300 sec: 5644.1). Total num frames: 1106856960. Throughput: 0: 5937.0. Samples: 1106863190. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2022-07-11 06:48:27,357][25689] Avg episode reward: [(0, '-3.905')] [2022-07-11 06:48:28,217][26022] Updated weights on worker 0-0, policy_version 1080920 (0.00093) [2022-07-11 06:48:30,190][26022] Updated weights on worker 0-0, policy_version 1080930 (0.00087) [2022-07-11 06:48:31,703][26022] Updated weights on worker 0-0, policy_version 1080940 (0.00908) [2022-07-11 06:48:32,370][25689] Fps is (10 sec: 5615.3, 60 sec: 5652.9, 300 sec: 5638.5). Total num frames: 1106884608. Throughput: 0: 5110.2. Samples: 1106880408. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:48:32,371][25689] Avg episode reward: [(0, '-3.298')] [2022-07-11 06:48:33,584][26022] Updated weights on worker 0-0, policy_version 1080950 (0.00085) [2022-07-11 06:48:35,345][26022] Updated weights on worker 0-0, policy_version 1080960 (0.00084) [2022-07-11 06:48:37,193][26022] Updated weights on worker 0-0, policy_version 1080970 (0.00101) [2022-07-11 06:48:37,431][25689] Fps is (10 sec: 5692.0, 60 sec: 5665.9, 300 sec: 5646.0). Total num frames: 1106914304. Throughput: 0: 5933.1. Samples: 1106914552. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:48:37,431][25689] Avg episode reward: [(0, '-2.573')] [2022-07-11 06:48:39,170][26022] Updated weights on worker 0-0, policy_version 1080980 (0.00087) [2022-07-11 06:48:40,747][26022] Updated weights on worker 0-0, policy_version 1080990 (0.00090) [2022-07-11 06:48:42,542][25689] Fps is (10 sec: 5738.1, 60 sec: 5666.3, 300 sec: 5644.5). Total num frames: 1106942976. Throughput: 0: 5936.7. Samples: 1106948634. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:48:42,543][25689] Avg episode reward: [(0, '-0.960')] [2022-07-11 06:48:42,551][26022] Updated weights on worker 0-0, policy_version 1081000 (0.00087) [2022-07-11 06:48:44,449][26022] Updated weights on worker 0-0, policy_version 1081010 (0.00086) [2022-07-11 06:48:46,287][26022] Updated weights on worker 0-0, policy_version 1081020 (0.00095) [2022-07-11 06:48:47,556][25689] Fps is (10 sec: 5562.4, 60 sec: 5666.6, 300 sec: 5637.9). Total num frames: 1106970624. Throughput: 0: 5067.9. Samples: 1106965560. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:48:47,557][25689] Avg episode reward: [(0, '0.455')] [2022-07-11 06:48:47,891][26022] Updated weights on worker 0-0, policy_version 1081030 (0.00092) [2022-07-11 06:48:49,902][26022] Updated weights on worker 0-0, policy_version 1081040 (0.00093) [2022-07-11 06:48:51,603][26022] Updated weights on worker 0-0, policy_version 1081050 (0.00086) [2022-07-11 06:48:52,632][25689] Fps is (10 sec: 5683.7, 60 sec: 5643.4, 300 sec: 5643.9). Total num frames: 1107000320. Throughput: 0: 5908.6. Samples: 1107000124. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:48:52,632][25689] Avg episode reward: [(0, '-0.412')] [2022-07-11 06:48:53,413][26022] Updated weights on worker 0-0, policy_version 1081060 (0.00097) [2022-07-11 06:48:55,195][26022] Updated weights on worker 0-0, policy_version 1081070 (0.00086) [2022-07-11 06:48:57,038][26022] Updated weights on worker 0-0, policy_version 1081080 (0.00086) [2022-07-11 06:48:57,659][25689] Fps is (10 sec: 5878.5, 60 sec: 5676.8, 300 sec: 5648.8). Total num frames: 1107030016. Throughput: 0: 5917.2. Samples: 1107034248. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:48:57,660][25689] Avg episode reward: [(0, '0.460')] [2022-07-11 06:48:58,807][26022] Updated weights on worker 0-0, policy_version 1081090 (0.00091) [2022-07-11 06:49:00,485][26022] Updated weights on worker 0-0, policy_version 1081100 (0.00083) [2022-07-11 06:49:02,600][26022] Updated weights on worker 0-0, policy_version 1081110 (0.00087) [2022-07-11 06:49:02,806][25689] Fps is (10 sec: 5636.4, 60 sec: 5669.1, 300 sec: 5646.6). Total num frames: 1107057664. Throughput: 0: 5080.7. Samples: 1107051584. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:49:02,808][25689] Avg episode reward: [(0, '0.491')] [2022-07-11 06:49:04,360][26022] Updated weights on worker 0-0, policy_version 1081120 (0.00090) [2022-07-11 06:49:06,166][26022] Updated weights on worker 0-0, policy_version 1081130 (0.00090) [2022-07-11 06:49:07,872][25689] Fps is (10 sec: 5514.5, 60 sec: 5684.6, 300 sec: 5649.2). Total num frames: 1107086336. Throughput: 0: 5826.9. Samples: 1107083942. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:49:07,874][25689] Avg episode reward: [(0, '0.776')] [2022-07-11 06:49:08,084][26022] Updated weights on worker 0-0, policy_version 1081140 (0.00093) [2022-07-11 06:49:09,979][26022] Updated weights on worker 0-0, policy_version 1081150 (0.00090) [2022-07-11 06:49:11,546][26022] Updated weights on worker 0-0, policy_version 1081160 (0.00089) [2022-07-11 06:49:12,950][25689] Fps is (10 sec: 5451.1, 60 sec: 5646.9, 300 sec: 5637.7). Total num frames: 1107112960. Throughput: 0: 5792.3. Samples: 1107117812. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:49:12,950][25689] Avg episode reward: [(0, '0.627')] [2022-07-11 06:49:13,597][26022] Updated weights on worker 0-0, policy_version 1081170 (0.00088) [2022-07-11 06:49:15,270][26022] Updated weights on worker 0-0, policy_version 1081180 (0.00083) [2022-07-11 06:49:17,076][26022] Updated weights on worker 0-0, policy_version 1081190 (0.00082) [2022-07-11 06:49:17,972][25689] Fps is (10 sec: 5677.7, 60 sec: 5661.8, 300 sec: 5649.1). Total num frames: 1107143680. Throughput: 0: 5815.2. Samples: 1107152372. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:49:17,973][25689] Avg episode reward: [(0, '0.853')] [2022-07-11 06:49:18,814][26022] Updated weights on worker 0-0, policy_version 1081200 (0.00086) [2022-07-11 06:49:20,657][26022] Updated weights on worker 0-0, policy_version 1081210 (0.00085) [2022-07-11 06:49:22,462][26022] Updated weights on worker 0-0, policy_version 1081220 (0.00090) [2022-07-11 06:49:23,076][25689] Fps is (10 sec: 5966.2, 60 sec: 5681.8, 300 sec: 5651.6). Total num frames: 1107173376. Throughput: 0: 5816.2. Samples: 1107169480. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:49:23,076][25689] Avg episode reward: [(0, '1.363')] [2022-07-11 06:49:24,291][26022] Updated weights on worker 0-0, policy_version 1081230 (0.00086) [2022-07-11 06:49:26,108][26022] Updated weights on worker 0-0, policy_version 1081240 (0.00091) [2022-07-11 06:49:27,897][26022] Updated weights on worker 0-0, policy_version 1081250 (0.00059) [2022-07-11 06:49:28,097][25689] Fps is (10 sec: 5562.5, 60 sec: 5647.6, 300 sec: 5644.9). Total num frames: 1107200000. Throughput: 0: 5910.7. Samples: 1107203484. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:49:28,097][25689] Avg episode reward: [(0, '0.757')] [2022-07-11 06:49:29,637][26022] Updated weights on worker 0-0, policy_version 1081260 (0.00081) [2022-07-11 06:49:31,590][26022] Updated weights on worker 0-0, policy_version 1081270 (0.00089) [2022-07-11 06:49:33,115][25689] Fps is (10 sec: 5610.2, 60 sec: 5681.0, 300 sec: 5648.5). Total num frames: 1107229696. Throughput: 0: 5943.2. Samples: 1107237658. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:49:33,115][25689] Avg episode reward: [(0, '-0.023')] [2022-07-11 06:49:33,255][26022] Updated weights on worker 0-0, policy_version 1081280 (0.00082) [2022-07-11 06:49:35,193][26022] Updated weights on worker 0-0, policy_version 1081290 (0.00086) [2022-07-11 06:49:36,867][26022] Updated weights on worker 0-0, policy_version 1081300 (0.00087) [2022-07-11 06:49:38,131][25689] Fps is (10 sec: 5817.1, 60 sec: 5668.3, 300 sec: 5654.3). Total num frames: 1107258368. Throughput: 0: 5079.3. Samples: 1107254764. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:49:38,131][25689] Avg episode reward: [(0, '-0.481')] [2022-07-11 06:49:38,848][26022] Updated weights on worker 0-0, policy_version 1081310 (0.00092) [2022-07-11 06:49:40,383][26022] Updated weights on worker 0-0, policy_version 1081320 (0.00081) [2022-07-11 06:49:42,391][26022] Updated weights on worker 0-0, policy_version 1081330 (0.00085) [2022-07-11 06:49:42,979][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:49:43,002][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001081333_1107284992.pth [2022-07-11 06:49:43,003][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001079346_1105250304.pth [2022-07-11 06:49:43,265][25689] Fps is (10 sec: 5548.6, 60 sec: 5649.3, 300 sec: 5648.8). Total num frames: 1107286016. Throughput: 0: 5900.1. Samples: 1107288598. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:49:43,265][25689] Avg episode reward: [(0, '-0.522')] [2022-07-11 06:49:44,015][26022] Updated weights on worker 0-0, policy_version 1081340 (0.00083) [2022-07-11 06:49:46,116][26022] Updated weights on worker 0-0, policy_version 1081350 (0.00093) [2022-07-11 06:49:47,779][26022] Updated weights on worker 0-0, policy_version 1081360 (0.00079) [2022-07-11 06:49:48,347][25689] Fps is (10 sec: 5612.8, 60 sec: 5676.6, 300 sec: 5647.7). Total num frames: 1107315712. Throughput: 0: 5877.5. Samples: 1107322506. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:49:48,348][25689] Avg episode reward: [(0, '0.166')] [2022-07-11 06:49:49,383][26022] Updated weights on worker 0-0, policy_version 1081370 (0.00089) [2022-07-11 06:49:51,359][26022] Updated weights on worker 0-0, policy_version 1081380 (0.00086) [2022-07-11 06:49:53,131][26022] Updated weights on worker 0-0, policy_version 1081390 (0.00078) [2022-07-11 06:49:53,417][25689] Fps is (10 sec: 5648.3, 60 sec: 5643.4, 300 sec: 5643.4). Total num frames: 1107343360. Throughput: 0: 5026.0. Samples: 1107339700. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:49:53,418][25689] Avg episode reward: [(0, '-0.222')] [2022-07-11 06:49:54,899][26022] Updated weights on worker 0-0, policy_version 1081400 (0.00089) [2022-07-11 06:49:57,007][26022] Updated weights on worker 0-0, policy_version 1081410 (0.00093) [2022-07-11 06:49:58,385][26022] Updated weights on worker 0-0, policy_version 1081420 (0.00090) [2022-07-11 06:49:58,471][25689] Fps is (10 sec: 5765.4, 60 sec: 5657.8, 300 sec: 5648.0). Total num frames: 1107374080. Throughput: 0: 5848.2. Samples: 1107373718. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:49:58,472][25689] Avg episode reward: [(0, '-0.713')] [2022-07-11 06:50:00,690][26022] Updated weights on worker 0-0, policy_version 1081430 (0.00088) [2022-07-11 06:50:02,347][26022] Updated weights on worker 0-0, policy_version 1081440 (0.00084) [2022-07-11 06:50:03,563][25689] Fps is (10 sec: 5450.3, 60 sec: 5612.4, 300 sec: 5642.9). Total num frames: 1107398656. Throughput: 0: 5764.8. Samples: 1107405612. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:50:03,563][25689] Avg episode reward: [(0, '0.517')] [2022-07-11 06:50:04,598][26022] Updated weights on worker 0-0, policy_version 1081450 (0.00084) [2022-07-11 06:50:06,091][26022] Updated weights on worker 0-0, policy_version 1081460 (0.00080) [2022-07-11 06:50:08,043][26022] Updated weights on worker 0-0, policy_version 1081470 (0.00734) [2022-07-11 06:50:08,630][25689] Fps is (10 sec: 5342.3, 60 sec: 5629.2, 300 sec: 5645.8). Total num frames: 1107428352. Throughput: 0: 4945.1. Samples: 1107422806. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:50:08,631][25689] Avg episode reward: [(0, '0.465')] [2022-07-11 06:50:09,863][26022] Updated weights on worker 0-0, policy_version 1081480 (0.00084) [2022-07-11 06:50:11,718][26022] Updated weights on worker 0-0, policy_version 1081490 (0.00537) [2022-07-11 06:50:13,409][26022] Updated weights on worker 0-0, policy_version 1081500 (0.00088) [2022-07-11 06:50:13,652][25689] Fps is (10 sec: 5785.5, 60 sec: 5668.1, 300 sec: 5645.7). Total num frames: 1107457024. Throughput: 0: 5786.4. Samples: 1107456782. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:50:13,652][25689] Avg episode reward: [(0, '0.162')] [2022-07-11 06:50:15,287][26022] Updated weights on worker 0-0, policy_version 1081510 (0.00091) [2022-07-11 06:50:16,991][26022] Updated weights on worker 0-0, policy_version 1081520 (0.00086) [2022-07-11 06:50:18,672][25689] Fps is (10 sec: 5608.4, 60 sec: 5617.6, 300 sec: 5641.2). Total num frames: 1107484672. Throughput: 0: 5802.6. Samples: 1107490936. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:50:18,673][25689] Avg episode reward: [(0, '-0.185')] [2022-07-11 06:50:18,917][26022] Updated weights on worker 0-0, policy_version 1081530 (0.00085) [2022-07-11 06:50:20,723][26022] Updated weights on worker 0-0, policy_version 1081540 (0.00111) [2022-07-11 06:50:22,603][26022] Updated weights on worker 0-0, policy_version 1081550 (0.00086) [2022-07-11 06:50:23,804][25689] Fps is (10 sec: 5547.3, 60 sec: 5598.2, 300 sec: 5642.2). Total num frames: 1107513344. Throughput: 0: 5055.9. Samples: 1107507950. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:50:23,805][25689] Avg episode reward: [(0, '0.173')] [2022-07-11 06:50:24,303][26022] Updated weights on worker 0-0, policy_version 1081560 (0.00086) [2022-07-11 06:50:26,157][26022] Updated weights on worker 0-0, policy_version 1081570 (0.00087) [2022-07-11 06:50:27,907][26022] Updated weights on worker 0-0, policy_version 1081580 (0.00081) [2022-07-11 06:50:28,844][25689] Fps is (10 sec: 5738.3, 60 sec: 5647.0, 300 sec: 5641.9). Total num frames: 1107543040. Throughput: 0: 5897.7. Samples: 1107542020. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:50:28,845][25689] Avg episode reward: [(0, '1.400')] [2022-07-11 06:50:29,612][26022] Updated weights on worker 0-0, policy_version 1081590 (0.00082) [2022-07-11 06:50:31,385][26022] Updated weights on worker 0-0, policy_version 1081600 (0.00088) [2022-07-11 06:50:33,321][26022] Updated weights on worker 0-0, policy_version 1081610 (0.00080) [2022-07-11 06:50:33,857][25689] Fps is (10 sec: 5806.3, 60 sec: 5630.6, 300 sec: 5645.4). Total num frames: 1107571712. Throughput: 0: 5908.0. Samples: 1107576154. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:50:33,857][25689] Avg episode reward: [(0, '1.474')] [2022-07-11 06:50:35,283][26022] Updated weights on worker 0-0, policy_version 1081620 (0.00087) [2022-07-11 06:50:36,871][26022] Updated weights on worker 0-0, policy_version 1081630 (0.00083) [2022-07-11 06:50:38,626][26022] Updated weights on worker 0-0, policy_version 1081640 (0.00091) [2022-07-11 06:50:38,938][25689] Fps is (10 sec: 5681.0, 60 sec: 5624.6, 300 sec: 5641.1). Total num frames: 1107600384. Throughput: 0: 5048.8. Samples: 1107593252. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:50:38,938][25689] Avg episode reward: [(0, '1.567')] [2022-07-11 06:50:40,302][26022] Updated weights on worker 0-0, policy_version 1081650 (0.00089) [2022-07-11 06:50:42,270][26022] Updated weights on worker 0-0, policy_version 1081660 (0.00089) [2022-07-11 06:50:44,009][25689] Fps is (10 sec: 5648.4, 60 sec: 5647.3, 300 sec: 5640.1). Total num frames: 1107629056. Throughput: 0: 5885.5. Samples: 1107626864. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:50:44,009][25689] Avg episode reward: [(0, '-0.280')] [2022-07-11 06:50:44,237][26022] Updated weights on worker 0-0, policy_version 1081670 (0.00085) [2022-07-11 06:50:46,018][26022] Updated weights on worker 0-0, policy_version 1081680 (0.00092) [2022-07-11 06:50:47,721][26022] Updated weights on worker 0-0, policy_version 1081690 (0.00050) [2022-07-11 06:50:49,025][25689] Fps is (10 sec: 5583.3, 60 sec: 5619.7, 300 sec: 5643.4). Total num frames: 1107656704. Throughput: 0: 5894.0. Samples: 1107660968. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:50:49,026][25689] Avg episode reward: [(0, '0.087')] [2022-07-11 06:50:49,605][26022] Updated weights on worker 0-0, policy_version 1081700 (0.00086) [2022-07-11 06:50:51,221][26022] Updated weights on worker 0-0, policy_version 1081710 (0.00085) [2022-07-11 06:50:53,335][26022] Updated weights on worker 0-0, policy_version 1081720 (0.00089) [2022-07-11 06:50:54,051][25689] Fps is (10 sec: 5608.4, 60 sec: 5640.7, 300 sec: 5639.7). Total num frames: 1107685376. Throughput: 0: 5891.3. Samples: 1107695124. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:50:54,053][25689] Avg episode reward: [(0, '-0.202')] [2022-07-11 06:50:54,991][26022] Updated weights on worker 0-0, policy_version 1081730 (0.00090) [2022-07-11 06:50:56,919][26022] Updated weights on worker 0-0, policy_version 1081740 (0.00082) [2022-07-11 06:50:58,514][26022] Updated weights on worker 0-0, policy_version 1081750 (0.00080) [2022-07-11 06:50:59,063][25689] Fps is (10 sec: 5712.7, 60 sec: 5610.8, 300 sec: 5641.7). Total num frames: 1107714048. Throughput: 0: 5902.0. Samples: 1107712032. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:50:59,064][25689] Avg episode reward: [(0, '-0.152')] [2022-07-11 06:51:00,370][26022] Updated weights on worker 0-0, policy_version 1081760 (0.00095) [2022-07-11 06:51:02,595][26022] Updated weights on worker 0-0, policy_version 1081770 (0.00083) [2022-07-11 06:51:04,143][25689] Fps is (10 sec: 5479.4, 60 sec: 5645.7, 300 sec: 5644.0). Total num frames: 1107740672. Throughput: 0: 5813.3. Samples: 1107743908. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:51:04,143][25689] Avg episode reward: [(0, '-0.244')] [2022-07-11 06:51:04,364][26022] Updated weights on worker 0-0, policy_version 1081780 (0.00081) [2022-07-11 06:51:06,336][26022] Updated weights on worker 0-0, policy_version 1081790 (0.00055) [2022-07-11 06:51:08,105][26022] Updated weights on worker 0-0, policy_version 1081800 (0.00085) [2022-07-11 06:51:09,170][25689] Fps is (10 sec: 5369.6, 60 sec: 5615.6, 300 sec: 5637.1). Total num frames: 1107768320. Throughput: 0: 5798.4. Samples: 1107777778. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:51:09,171][25689] Avg episode reward: [(0, '-0.147')] [2022-07-11 06:51:09,902][26022] Updated weights on worker 0-0, policy_version 1081810 (0.00084) [2022-07-11 06:51:11,592][26022] Updated weights on worker 0-0, policy_version 1081820 (0.00091) [2022-07-11 06:51:13,379][26022] Updated weights on worker 0-0, policy_version 1081830 (0.00091) [2022-07-11 06:51:14,193][25689] Fps is (10 sec: 5501.7, 60 sec: 5598.5, 300 sec: 5637.1). Total num frames: 1107795968. Throughput: 0: 4944.6. Samples: 1107794720. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:51:14,194][25689] Avg episode reward: [(0, '0.779')] [2022-07-11 06:51:15,440][26022] Updated weights on worker 0-0, policy_version 1081840 (0.00089) [2022-07-11 06:51:17,196][26022] Updated weights on worker 0-0, policy_version 1081850 (0.00082) [2022-07-11 06:51:18,966][26022] Updated weights on worker 0-0, policy_version 1081860 (0.00778) [2022-07-11 06:51:19,236][25689] Fps is (10 sec: 5696.9, 60 sec: 5630.3, 300 sec: 5648.0). Total num frames: 1107825664. Throughput: 0: 5783.3. Samples: 1107828700. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:51:19,237][25689] Avg episode reward: [(0, '1.272')] [2022-07-11 06:51:20,599][26022] Updated weights on worker 0-0, policy_version 1081870 (0.00077) [2022-07-11 06:51:22,398][26022] Updated weights on worker 0-0, policy_version 1081880 (0.00084) [2022-07-11 06:51:24,324][25689] Fps is (10 sec: 5761.4, 60 sec: 5634.4, 300 sec: 5643.2). Total num frames: 1107854336. Throughput: 0: 5884.7. Samples: 1107862670. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:51:24,325][25689] Avg episode reward: [(0, '0.137')] [2022-07-11 06:51:24,570][26022] Updated weights on worker 0-0, policy_version 1081890 (0.00084) [2022-07-11 06:51:26,099][26022] Updated weights on worker 0-0, policy_version 1081900 (0.00088) [2022-07-11 06:51:28,080][26022] Updated weights on worker 0-0, policy_version 1081910 (0.00088) [2022-07-11 06:51:29,372][25689] Fps is (10 sec: 5758.9, 60 sec: 5633.7, 300 sec: 5645.8). Total num frames: 1107884032. Throughput: 0: 5045.9. Samples: 1107879712. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:51:29,372][25689] Avg episode reward: [(0, '-0.497')] [2022-07-11 06:51:29,714][26022] Updated weights on worker 0-0, policy_version 1081920 (0.00088) [2022-07-11 06:51:31,686][26022] Updated weights on worker 0-0, policy_version 1081930 (0.00085) [2022-07-11 06:51:33,505][26022] Updated weights on worker 0-0, policy_version 1081940 (0.00080) [2022-07-11 06:51:34,408][25689] Fps is (10 sec: 5686.6, 60 sec: 5614.5, 300 sec: 5645.6). Total num frames: 1107911680. Throughput: 0: 5888.0. Samples: 1107913746. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:51:34,409][25689] Avg episode reward: [(0, '-0.149')] [2022-07-11 06:51:35,142][26022] Updated weights on worker 0-0, policy_version 1081950 (0.00084) [2022-07-11 06:51:37,208][26022] Updated weights on worker 0-0, policy_version 1081960 (0.00085) [2022-07-11 06:51:38,827][26022] Updated weights on worker 0-0, policy_version 1081970 (0.00087) [2022-07-11 06:51:39,429][25689] Fps is (10 sec: 5497.8, 60 sec: 5603.2, 300 sec: 5643.7). Total num frames: 1107939328. Throughput: 0: 5908.5. Samples: 1107948012. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:51:39,430][25689] Avg episode reward: [(0, '-0.008')] [2022-07-11 06:51:40,695][26022] Updated weights on worker 0-0, policy_version 1081980 (0.00088) [2022-07-11 06:51:42,379][26022] Updated weights on worker 0-0, policy_version 1081990 (0.00082) [2022-07-11 06:51:43,177][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:51:43,190][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001081993_1107960832.pth [2022-07-11 06:51:43,190][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001080009_1105929216.pth [2022-07-11 06:51:44,442][26022] Updated weights on worker 0-0, policy_version 1082000 (0.00090) [2022-07-11 06:51:44,540][25689] Fps is (10 sec: 5660.1, 60 sec: 5616.5, 300 sec: 5642.4). Total num frames: 1107969024. Throughput: 0: 5047.4. Samples: 1107964704. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:51:44,540][25689] Avg episode reward: [(0, '-0.025')] [2022-07-11 06:51:46,192][26022] Updated weights on worker 0-0, policy_version 1082010 (0.00114) [2022-07-11 06:51:47,941][26022] Updated weights on worker 0-0, policy_version 1082020 (0.00079) [2022-07-11 06:51:49,548][25689] Fps is (10 sec: 5768.7, 60 sec: 5634.1, 300 sec: 5643.4). Total num frames: 1107997696. Throughput: 0: 5902.8. Samples: 1107998806. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:51:49,549][25689] Avg episode reward: [(0, '-0.238')] [2022-07-11 06:51:49,723][26022] Updated weights on worker 0-0, policy_version 1082030 (0.00085) [2022-07-11 06:51:51,402][26022] Updated weights on worker 0-0, policy_version 1082040 (0.00086) [2022-07-11 06:51:53,211][26022] Updated weights on worker 0-0, policy_version 1082050 (0.00078) [2022-07-11 06:51:54,647][25689] Fps is (10 sec: 5673.6, 60 sec: 5627.3, 300 sec: 5641.7). Total num frames: 1108026368. Throughput: 0: 5898.5. Samples: 1108033122. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:51:54,649][25689] Avg episode reward: [(0, '0.590')] [2022-07-11 06:51:54,972][26022] Updated weights on worker 0-0, policy_version 1082060 (0.00082) [2022-07-11 06:51:56,899][26022] Updated weights on worker 0-0, policy_version 1082070 (0.00079) [2022-07-11 06:51:58,871][26022] Updated weights on worker 0-0, policy_version 1082080 (0.00616) [2022-07-11 06:51:59,687][25689] Fps is (10 sec: 5655.4, 60 sec: 5624.7, 300 sec: 5645.4). Total num frames: 1108055040. Throughput: 0: 5041.9. Samples: 1108050150. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:51:59,689][25689] Avg episode reward: [(0, '1.460')] [2022-07-11 06:52:00,523][26022] Updated weights on worker 0-0, policy_version 1082090 (0.00089) [2022-07-11 06:52:02,742][26022] Updated weights on worker 0-0, policy_version 1082100 (0.00085) [2022-07-11 06:52:04,615][26022] Updated weights on worker 0-0, policy_version 1082110 (0.00084) [2022-07-11 06:52:04,743][25689] Fps is (10 sec: 5375.6, 60 sec: 5610.0, 300 sec: 5641.2). Total num frames: 1108080640. Throughput: 0: 5816.3. Samples: 1108082212. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:52:04,743][25689] Avg episode reward: [(0, '2.080')] [2022-07-11 06:52:06,266][26022] Updated weights on worker 0-0, policy_version 1082120 (0.00080) [2022-07-11 06:52:08,206][26022] Updated weights on worker 0-0, policy_version 1082130 (0.00088) [2022-07-11 06:52:09,816][25689] Fps is (10 sec: 5358.4, 60 sec: 5622.7, 300 sec: 5640.7). Total num frames: 1108109312. Throughput: 0: 5796.3. Samples: 1108116286. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:52:09,816][25689] Avg episode reward: [(0, '1.834')] [2022-07-11 06:52:10,011][26022] Updated weights on worker 0-0, policy_version 1082140 (0.00088) [2022-07-11 06:52:11,824][26022] Updated weights on worker 0-0, policy_version 1082150 (0.00088) [2022-07-11 06:52:13,473][26022] Updated weights on worker 0-0, policy_version 1082160 (0.00088) [2022-07-11 06:52:14,840][25689] Fps is (10 sec: 5679.1, 60 sec: 5639.5, 300 sec: 5637.3). Total num frames: 1108137984. Throughput: 0: 4973.1. Samples: 1108133546. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:52:14,842][25689] Avg episode reward: [(0, '1.309')] [2022-07-11 06:52:15,276][26022] Updated weights on worker 0-0, policy_version 1082170 (0.00094) [2022-07-11 06:52:17,157][26022] Updated weights on worker 0-0, policy_version 1082180 (0.00085) [2022-07-11 06:52:19,149][26022] Updated weights on worker 0-0, policy_version 1082190 (0.00085) [2022-07-11 06:52:19,882][25689] Fps is (10 sec: 5798.4, 60 sec: 5639.6, 300 sec: 5644.4). Total num frames: 1108167680. Throughput: 0: 5814.7. Samples: 1108167576. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:52:19,884][25689] Avg episode reward: [(0, '1.619')] [2022-07-11 06:52:20,822][26022] Updated weights on worker 0-0, policy_version 1082200 (0.00087) [2022-07-11 06:52:22,612][26022] Updated weights on worker 0-0, policy_version 1082210 (0.00092) [2022-07-11 06:52:24,320][26022] Updated weights on worker 0-0, policy_version 1082220 (0.00083) [2022-07-11 06:52:24,991][25689] Fps is (10 sec: 5749.9, 60 sec: 5637.6, 300 sec: 5639.9). Total num frames: 1108196352. Throughput: 0: 5909.5. Samples: 1108201870. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:52:24,992][25689] Avg episode reward: [(0, '0.638')] [2022-07-11 06:52:26,135][26022] Updated weights on worker 0-0, policy_version 1082230 (0.00095) [2022-07-11 06:52:27,819][26022] Updated weights on worker 0-0, policy_version 1082240 (0.00083) [2022-07-11 06:52:29,703][26022] Updated weights on worker 0-0, policy_version 1082250 (0.00095) [2022-07-11 06:52:30,018][25689] Fps is (10 sec: 5657.4, 60 sec: 5622.6, 300 sec: 5642.9). Total num frames: 1108225024. Throughput: 0: 5090.4. Samples: 1108219124. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 06:52:30,018][25689] Avg episode reward: [(0, '0.161')] [2022-07-11 06:52:31,411][26022] Updated weights on worker 0-0, policy_version 1082260 (0.00083) [2022-07-11 06:52:33,422][26022] Updated weights on worker 0-0, policy_version 1082270 (0.00087) [2022-07-11 06:52:34,908][26022] Updated weights on worker 0-0, policy_version 1082280 (0.00090) [2022-07-11 06:52:35,031][25689] Fps is (10 sec: 5814.0, 60 sec: 5658.7, 300 sec: 5646.5). Total num frames: 1108254720. Throughput: 0: 5947.9. Samples: 1108253636. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:52:35,031][25689] Avg episode reward: [(0, '0.033')] [2022-07-11 06:52:36,866][26022] Updated weights on worker 0-0, policy_version 1082290 (0.00093) [2022-07-11 06:52:38,459][26022] Updated weights on worker 0-0, policy_version 1082300 (0.00093) [2022-07-11 06:52:40,041][25689] Fps is (10 sec: 5721.5, 60 sec: 5659.7, 300 sec: 5645.0). Total num frames: 1108282368. Throughput: 0: 5962.7. Samples: 1108287776. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:52:40,041][25689] Avg episode reward: [(0, '0.173')] [2022-07-11 06:52:40,582][26022] Updated weights on worker 0-0, policy_version 1082310 (0.00086) [2022-07-11 06:52:42,368][26022] Updated weights on worker 0-0, policy_version 1082320 (0.00091) [2022-07-11 06:52:44,005][26022] Updated weights on worker 0-0, policy_version 1082330 (0.00096) [2022-07-11 06:52:45,112][25689] Fps is (10 sec: 5485.2, 60 sec: 5629.5, 300 sec: 5644.0). Total num frames: 1108310016. Throughput: 0: 5119.8. Samples: 1108304882. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:52:45,112][25689] Avg episode reward: [(0, '0.334')] [2022-07-11 06:52:45,999][26022] Updated weights on worker 0-0, policy_version 1082340 (0.00084) [2022-07-11 06:52:47,763][26022] Updated weights on worker 0-0, policy_version 1082350 (0.00091) [2022-07-11 06:52:49,624][26022] Updated weights on worker 0-0, policy_version 1082360 (0.00085) [2022-07-11 06:52:50,204][25689] Fps is (10 sec: 5742.7, 60 sec: 5655.4, 300 sec: 5642.4). Total num frames: 1108340736. Throughput: 0: 5934.8. Samples: 1108338926. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:52:50,205][25689] Avg episode reward: [(0, '0.634')] [2022-07-11 06:52:51,405][26022] Updated weights on worker 0-0, policy_version 1082370 (0.00081) [2022-07-11 06:52:53,191][26022] Updated weights on worker 0-0, policy_version 1082380 (0.00057) [2022-07-11 06:52:55,023][26022] Updated weights on worker 0-0, policy_version 1082390 (0.00079) [2022-07-11 06:52:55,228][25689] Fps is (10 sec: 5769.8, 60 sec: 5645.6, 300 sec: 5642.4). Total num frames: 1108368384. Throughput: 0: 5911.7. Samples: 1108373036. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:52:55,228][25689] Avg episode reward: [(0, '1.301')] [2022-07-11 06:52:56,593][26022] Updated weights on worker 0-0, policy_version 1082400 (0.00083) [2022-07-11 06:52:58,580][26022] Updated weights on worker 0-0, policy_version 1082410 (0.00077) [2022-07-11 06:53:00,242][25689] Fps is (10 sec: 5713.1, 60 sec: 5665.0, 300 sec: 5650.2). Total num frames: 1108398080. Throughput: 0: 5084.3. Samples: 1108390484. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:53:00,242][25689] Avg episode reward: [(0, '1.150')] [2022-07-11 06:53:00,245][26022] Updated weights on worker 0-0, policy_version 1082420 (0.00210) [2022-07-11 06:53:02,139][26022] Updated weights on worker 0-0, policy_version 1082430 (0.00087) [2022-07-11 06:53:04,254][26022] Updated weights on worker 0-0, policy_version 1082440 (0.00083) [2022-07-11 06:53:05,288][25689] Fps is (10 sec: 5598.3, 60 sec: 5682.8, 300 sec: 5646.9). Total num frames: 1108424704. Throughput: 0: 5831.6. Samples: 1108422542. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:53:05,290][25689] Avg episode reward: [(0, '1.203')] [2022-07-11 06:53:06,091][26022] Updated weights on worker 0-0, policy_version 1082450 (0.00084) [2022-07-11 06:53:07,771][26022] Updated weights on worker 0-0, policy_version 1082460 (0.00077) [2022-07-11 06:53:09,608][26022] Updated weights on worker 0-0, policy_version 1082470 (0.00082) [2022-07-11 06:53:10,318][25689] Fps is (10 sec: 5487.8, 60 sec: 5686.8, 300 sec: 5647.0). Total num frames: 1108453376. Throughput: 0: 5861.5. Samples: 1108456820. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:53:10,319][25689] Avg episode reward: [(0, '0.150')] [2022-07-11 06:53:11,341][26022] Updated weights on worker 0-0, policy_version 1082480 (0.00086) [2022-07-11 06:53:13,049][26022] Updated weights on worker 0-0, policy_version 1082490 (0.00079) [2022-07-11 06:53:15,135][26022] Updated weights on worker 0-0, policy_version 1082500 (0.00085) [2022-07-11 06:53:15,324][25689] Fps is (10 sec: 5611.9, 60 sec: 5671.6, 300 sec: 5640.0). Total num frames: 1108481024. Throughput: 0: 5034.1. Samples: 1108474202. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:53:15,324][25689] Avg episode reward: [(0, '0.088')] [2022-07-11 06:53:16,717][26022] Updated weights on worker 0-0, policy_version 1082510 (0.00084) [2022-07-11 06:53:18,547][26022] Updated weights on worker 0-0, policy_version 1082520 (0.00088) [2022-07-11 06:53:20,369][25689] Fps is (10 sec: 5603.6, 60 sec: 5654.4, 300 sec: 5641.8). Total num frames: 1108509696. Throughput: 0: 5858.5. Samples: 1108508398. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:53:20,370][25689] Avg episode reward: [(0, '0.385')] [2022-07-11 06:53:20,543][26022] Updated weights on worker 0-0, policy_version 1082530 (0.00082) [2022-07-11 06:53:22,366][26022] Updated weights on worker 0-0, policy_version 1082540 (0.00083) [2022-07-11 06:53:24,063][26022] Updated weights on worker 0-0, policy_version 1082550 (0.00088) [2022-07-11 06:53:25,497][25689] Fps is (10 sec: 5737.2, 60 sec: 5669.5, 300 sec: 5643.1). Total num frames: 1108539392. Throughput: 0: 5931.3. Samples: 1108542410. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:53:25,498][25689] Avg episode reward: [(0, '0.149')] [2022-07-11 06:53:25,753][26022] Updated weights on worker 0-0, policy_version 1082560 (0.00087) [2022-07-11 06:53:27,615][26022] Updated weights on worker 0-0, policy_version 1082570 (0.00087) [2022-07-11 06:53:29,587][26022] Updated weights on worker 0-0, policy_version 1082580 (0.00087) [2022-07-11 06:53:30,527][25689] Fps is (10 sec: 5645.1, 60 sec: 5652.3, 300 sec: 5642.8). Total num frames: 1108567040. Throughput: 0: 5910.9. Samples: 1108576274. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:53:30,527][25689] Avg episode reward: [(0, '0.302')] [2022-07-11 06:53:31,210][26022] Updated weights on worker 0-0, policy_version 1082590 (0.00093) [2022-07-11 06:53:33,232][26022] Updated weights on worker 0-0, policy_version 1082600 (0.00085) [2022-07-11 06:53:34,835][26022] Updated weights on worker 0-0, policy_version 1082610 (0.00105) [2022-07-11 06:53:35,537][25689] Fps is (10 sec: 5609.7, 60 sec: 5635.6, 300 sec: 5640.3). Total num frames: 1108595712. Throughput: 0: 5889.6. Samples: 1108593252. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:53:35,538][25689] Avg episode reward: [(0, '0.243')] [2022-07-11 06:53:36,804][26022] Updated weights on worker 0-0, policy_version 1082620 (0.00086) [2022-07-11 06:53:38,670][26022] Updated weights on worker 0-0, policy_version 1082630 (0.00095) [2022-07-11 06:53:40,387][26022] Updated weights on worker 0-0, policy_version 1082640 (0.00084) [2022-07-11 06:53:40,549][25689] Fps is (10 sec: 5721.9, 60 sec: 5652.4, 300 sec: 5642.2). Total num frames: 1108624384. Throughput: 0: 5877.6. Samples: 1108627008. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:53:40,549][25689] Avg episode reward: [(0, '0.902')] [2022-07-11 06:53:42,235][26022] Updated weights on worker 0-0, policy_version 1082650 (0.00089) [2022-07-11 06:53:43,360][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:53:43,370][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001082657_1108640768.pth [2022-07-11 06:53:43,371][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001080669_1106605056.pth [2022-07-11 06:53:43,720][26022] Updated weights on worker 0-0, policy_version 1082660 (0.00049) [2022-07-11 06:53:45,593][25689] Fps is (10 sec: 5601.0, 60 sec: 5654.9, 300 sec: 5641.6). Total num frames: 1108652032. Throughput: 0: 5909.4. Samples: 1108661160. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:53:45,593][25689] Avg episode reward: [(0, '-0.185')] [2022-07-11 06:53:45,806][26022] Updated weights on worker 0-0, policy_version 1082670 (0.00097) [2022-07-11 06:53:47,671][26022] Updated weights on worker 0-0, policy_version 1082680 (0.00516) [2022-07-11 06:53:49,263][26022] Updated weights on worker 0-0, policy_version 1082690 (0.00089) [2022-07-11 06:53:50,604][25689] Fps is (10 sec: 5601.3, 60 sec: 5628.7, 300 sec: 5639.4). Total num frames: 1108680704. Throughput: 0: 5079.9. Samples: 1108678262. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:53:50,604][25689] Avg episode reward: [(0, '0.058')] [2022-07-11 06:53:51,178][26022] Updated weights on worker 0-0, policy_version 1082700 (0.01541) [2022-07-11 06:53:52,814][26022] Updated weights on worker 0-0, policy_version 1082710 (0.00090) [2022-07-11 06:53:54,839][26022] Updated weights on worker 0-0, policy_version 1082720 (0.00089) [2022-07-11 06:53:55,610][25689] Fps is (10 sec: 5724.4, 60 sec: 5647.2, 300 sec: 5636.4). Total num frames: 1108709376. Throughput: 0: 5941.2. Samples: 1108712508. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:53:55,611][25689] Avg episode reward: [(0, '0.224')] [2022-07-11 06:53:56,441][26022] Updated weights on worker 0-0, policy_version 1082730 (0.00094) [2022-07-11 06:53:58,236][26022] Updated weights on worker 0-0, policy_version 1082740 (0.00081) [2022-07-11 06:54:00,063][26022] Updated weights on worker 0-0, policy_version 1082750 (0.00080) [2022-07-11 06:54:00,619][25689] Fps is (10 sec: 5623.1, 60 sec: 5613.7, 300 sec: 5639.0). Total num frames: 1108737024. Throughput: 0: 5981.8. Samples: 1108747066. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:54:00,620][25689] Avg episode reward: [(0, '0.380')] [2022-07-11 06:54:02,311][26022] Updated weights on worker 0-0, policy_version 1082760 (0.00097) [2022-07-11 06:54:03,980][26022] Updated weights on worker 0-0, policy_version 1082770 (0.00082) [2022-07-11 06:54:05,682][25689] Fps is (10 sec: 5490.1, 60 sec: 5629.2, 300 sec: 5635.6). Total num frames: 1108764672. Throughput: 0: 5016.4. Samples: 1108761936. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:54:05,683][25689] Avg episode reward: [(0, '1.345')] [2022-07-11 06:54:05,862][26022] Updated weights on worker 0-0, policy_version 1082780 (0.00059) [2022-07-11 06:54:07,784][26022] Updated weights on worker 0-0, policy_version 1082790 (0.00090) [2022-07-11 06:54:09,514][26022] Updated weights on worker 0-0, policy_version 1082800 (0.00082) [2022-07-11 06:54:10,697][25689] Fps is (10 sec: 5689.9, 60 sec: 5647.5, 300 sec: 5647.1). Total num frames: 1108794368. Throughput: 0: 5870.8. Samples: 1108796230. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:54:10,698][25689] Avg episode reward: [(0, '0.192')] [2022-07-11 06:54:11,355][26022] Updated weights on worker 0-0, policy_version 1082810 (0.00084) [2022-07-11 06:54:12,921][26022] Updated weights on worker 0-0, policy_version 1082820 (0.00092) [2022-07-11 06:54:15,256][26022] Updated weights on worker 0-0, policy_version 1082830 (0.00090) [2022-07-11 06:54:15,716][25689] Fps is (10 sec: 5715.2, 60 sec: 5646.3, 300 sec: 5636.8). Total num frames: 1108822016. Throughput: 0: 5852.8. Samples: 1108830182. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:54:15,716][25689] Avg episode reward: [(0, '0.023')] [2022-07-11 06:54:16,616][26022] Updated weights on worker 0-0, policy_version 1082840 (0.00086) [2022-07-11 06:54:18,647][26022] Updated weights on worker 0-0, policy_version 1082850 (0.00083) [2022-07-11 06:54:20,323][26022] Updated weights on worker 0-0, policy_version 1082860 (0.00086) [2022-07-11 06:54:20,777][25689] Fps is (10 sec: 5486.2, 60 sec: 5627.9, 300 sec: 5630.8). Total num frames: 1108849664. Throughput: 0: 4968.8. Samples: 1108847224. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:54:20,777][25689] Avg episode reward: [(0, '-0.420')] [2022-07-11 06:54:22,087][26022] Updated weights on worker 0-0, policy_version 1082870 (0.00092) [2022-07-11 06:54:23,973][26022] Updated weights on worker 0-0, policy_version 1082880 (0.00082) [2022-07-11 06:54:25,581][26022] Updated weights on worker 0-0, policy_version 1082890 (0.00086) [2022-07-11 06:54:25,866][25689] Fps is (10 sec: 5750.3, 60 sec: 5648.5, 300 sec: 5643.2). Total num frames: 1108880384. Throughput: 0: 5937.3. Samples: 1108881774. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:54:25,867][25689] Avg episode reward: [(0, '-0.513')] [2022-07-11 06:54:27,496][26022] Updated weights on worker 0-0, policy_version 1082900 (0.00092) [2022-07-11 06:54:29,269][26022] Updated weights on worker 0-0, policy_version 1082910 (0.00081) [2022-07-11 06:54:30,871][25689] Fps is (10 sec: 5782.5, 60 sec: 5650.8, 300 sec: 5636.6). Total num frames: 1108908032. Throughput: 0: 5918.2. Samples: 1108915618. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:54:30,871][25689] Avg episode reward: [(0, '-0.704')] [2022-07-11 06:54:31,071][26022] Updated weights on worker 0-0, policy_version 1082920 (0.00084) [2022-07-11 06:54:33,059][26022] Updated weights on worker 0-0, policy_version 1082930 (0.00087) [2022-07-11 06:54:34,753][26022] Updated weights on worker 0-0, policy_version 1082940 (0.00100) [2022-07-11 06:54:35,935][25689] Fps is (10 sec: 5593.2, 60 sec: 5645.7, 300 sec: 5635.7). Total num frames: 1108936704. Throughput: 0: 5067.7. Samples: 1108932650. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:54:35,936][25689] Avg episode reward: [(0, '-0.601')] [2022-07-11 06:54:36,551][26022] Updated weights on worker 0-0, policy_version 1082950 (0.00103) [2022-07-11 06:54:38,483][26022] Updated weights on worker 0-0, policy_version 1082960 (0.00084) [2022-07-11 06:54:40,058][26022] Updated weights on worker 0-0, policy_version 1082970 (0.00087) [2022-07-11 06:54:41,035][25689] Fps is (10 sec: 5742.4, 60 sec: 5654.4, 300 sec: 5643.2). Total num frames: 1108966400. Throughput: 0: 5915.9. Samples: 1108967070. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:54:41,037][25689] Avg episode reward: [(0, '0.489')] [2022-07-11 06:54:41,901][26022] Updated weights on worker 0-0, policy_version 1082980 (0.00086) [2022-07-11 06:54:43,575][26022] Updated weights on worker 0-0, policy_version 1082990 (0.00085) [2022-07-11 06:54:45,600][26022] Updated weights on worker 0-0, policy_version 1083000 (0.00086) [2022-07-11 06:54:46,145][25689] Fps is (10 sec: 5717.0, 60 sec: 5665.2, 300 sec: 5639.3). Total num frames: 1108995072. Throughput: 0: 5907.5. Samples: 1109001572. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:54:46,146][25689] Avg episode reward: [(0, '1.321')] [2022-07-11 06:54:47,432][26022] Updated weights on worker 0-0, policy_version 1083010 (0.00085) [2022-07-11 06:54:49,158][26022] Updated weights on worker 0-0, policy_version 1083020 (0.00088) [2022-07-11 06:54:50,891][26022] Updated weights on worker 0-0, policy_version 1083030 (0.00092) [2022-07-11 06:54:51,152][25689] Fps is (10 sec: 5567.1, 60 sec: 5648.7, 300 sec: 5640.5). Total num frames: 1109022720. Throughput: 0: 5060.6. Samples: 1109018252. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:54:51,152][25689] Avg episode reward: [(0, '1.664')] [2022-07-11 06:54:52,640][26022] Updated weights on worker 0-0, policy_version 1083040 (0.00085) [2022-07-11 06:54:54,610][26022] Updated weights on worker 0-0, policy_version 1083050 (0.00086) [2022-07-11 06:54:56,231][25689] Fps is (10 sec: 5584.0, 60 sec: 5641.9, 300 sec: 5633.1). Total num frames: 1109051392. Throughput: 0: 5908.4. Samples: 1109052564. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:54:56,231][25689] Avg episode reward: [(0, '1.673')] [2022-07-11 06:54:56,397][26022] Updated weights on worker 0-0, policy_version 1083060 (0.00086) [2022-07-11 06:54:58,043][26022] Updated weights on worker 0-0, policy_version 1083070 (0.00080) [2022-07-11 06:54:59,975][26022] Updated weights on worker 0-0, policy_version 1083080 (0.00096) [2022-07-11 06:55:01,237][25689] Fps is (10 sec: 5787.6, 60 sec: 5676.0, 300 sec: 5651.9). Total num frames: 1109081088. Throughput: 0: 5927.7. Samples: 1109086820. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:55:01,237][25689] Avg episode reward: [(0, '1.750')] [2022-07-11 06:55:01,935][26022] Updated weights on worker 0-0, policy_version 1083090 (0.00084) [2022-07-11 06:55:04,002][26022] Updated weights on worker 0-0, policy_version 1083100 (0.00083) [2022-07-11 06:55:05,543][26022] Updated weights on worker 0-0, policy_version 1083110 (0.00082) [2022-07-11 06:55:06,297][25689] Fps is (10 sec: 5595.2, 60 sec: 5659.4, 300 sec: 5641.8). Total num frames: 1109107712. Throughput: 0: 4976.6. Samples: 1109101860. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:55:06,297][25689] Avg episode reward: [(0, '1.149')] [2022-07-11 06:55:07,616][26022] Updated weights on worker 0-0, policy_version 1083120 (0.00101) [2022-07-11 06:55:09,133][26022] Updated weights on worker 0-0, policy_version 1083130 (0.00080) [2022-07-11 06:55:11,122][26022] Updated weights on worker 0-0, policy_version 1083140 (0.00088) [2022-07-11 06:55:11,384][25689] Fps is (10 sec: 5449.5, 60 sec: 5635.8, 300 sec: 5640.5). Total num frames: 1109136384. Throughput: 0: 5819.7. Samples: 1109135996. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:55:11,384][25689] Avg episode reward: [(0, '1.110')] [2022-07-11 06:55:12,806][26022] Updated weights on worker 0-0, policy_version 1083150 (0.00097) [2022-07-11 06:55:14,687][26022] Updated weights on worker 0-0, policy_version 1083160 (0.00082) [2022-07-11 06:55:16,400][25689] Fps is (10 sec: 5675.8, 60 sec: 5652.9, 300 sec: 5644.0). Total num frames: 1109165056. Throughput: 0: 5846.0. Samples: 1109170472. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:55:16,400][25689] Avg episode reward: [(0, '1.659')] [2022-07-11 06:55:16,656][26022] Updated weights on worker 0-0, policy_version 1083170 (0.00442) [2022-07-11 06:55:18,244][26022] Updated weights on worker 0-0, policy_version 1083180 (0.00081) [2022-07-11 06:55:20,144][26022] Updated weights on worker 0-0, policy_version 1083190 (0.00086) [2022-07-11 06:55:21,444][25689] Fps is (10 sec: 5598.2, 60 sec: 5654.4, 300 sec: 5642.3). Total num frames: 1109192704. Throughput: 0: 4974.5. Samples: 1109187342. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:55:21,444][25689] Avg episode reward: [(0, '1.787')] [2022-07-11 06:55:21,851][26022] Updated weights on worker 0-0, policy_version 1083200 (0.00093) [2022-07-11 06:55:23,623][26022] Updated weights on worker 0-0, policy_version 1083210 (0.00856) [2022-07-11 06:55:25,519][26022] Updated weights on worker 0-0, policy_version 1083220 (0.00084) [2022-07-11 06:55:26,541][25689] Fps is (10 sec: 5755.4, 60 sec: 5653.7, 300 sec: 5644.6). Total num frames: 1109223424. Throughput: 0: 5895.3. Samples: 1109221208. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:55:26,542][25689] Avg episode reward: [(0, '1.362')] [2022-07-11 06:55:27,345][26022] Updated weights on worker 0-0, policy_version 1083230 (0.00087) [2022-07-11 06:55:29,246][26022] Updated weights on worker 0-0, policy_version 1083240 (0.00082) [2022-07-11 06:55:31,068][26022] Updated weights on worker 0-0, policy_version 1083250 (0.00085) [2022-07-11 06:55:31,605][25689] Fps is (10 sec: 5643.4, 60 sec: 5631.3, 300 sec: 5636.8). Total num frames: 1109250048. Throughput: 0: 5894.0. Samples: 1109255182. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:55:31,606][25689] Avg episode reward: [(0, '1.378')] [2022-07-11 06:55:32,650][26022] Updated weights on worker 0-0, policy_version 1083260 (0.00080) [2022-07-11 06:55:34,677][26022] Updated weights on worker 0-0, policy_version 1083270 (0.00086) [2022-07-11 06:55:36,120][26022] Updated weights on worker 0-0, policy_version 1083280 (0.00088) [2022-07-11 06:55:36,608][25689] Fps is (10 sec: 5696.2, 60 sec: 5670.8, 300 sec: 5645.1). Total num frames: 1109280768. Throughput: 0: 5887.1. Samples: 1109289442. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:55:36,609][25689] Avg episode reward: [(0, '0.992')] [2022-07-11 06:55:38,288][26022] Updated weights on worker 0-0, policy_version 1083290 (0.00088) [2022-07-11 06:55:39,949][26022] Updated weights on worker 0-0, policy_version 1083300 (0.00084) [2022-07-11 06:55:41,627][25689] Fps is (10 sec: 5823.9, 60 sec: 5644.5, 300 sec: 5642.7). Total num frames: 1109308416. Throughput: 0: 5907.9. Samples: 1109306584. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:55:41,628][25689] Avg episode reward: [(0, '1.037')] [2022-07-11 06:55:41,729][26022] Updated weights on worker 0-0, policy_version 1083310 (0.00089) [2022-07-11 06:55:43,481][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:55:43,496][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001083320_1109319680.pth [2022-07-11 06:55:43,497][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001081333_1107284992.pth [2022-07-11 06:55:43,502][26022] Updated weights on worker 0-0, policy_version 1083320 (0.00085) [2022-07-11 06:55:45,345][26022] Updated weights on worker 0-0, policy_version 1083330 (0.00087) [2022-07-11 06:55:46,702][25689] Fps is (10 sec: 5579.5, 60 sec: 5647.8, 300 sec: 5645.0). Total num frames: 1109337088. Throughput: 0: 5943.1. Samples: 1109341028. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:55:46,704][25689] Avg episode reward: [(0, '-0.018')] [2022-07-11 06:55:47,243][26022] Updated weights on worker 0-0, policy_version 1083340 (0.00092) [2022-07-11 06:55:48,971][26022] Updated weights on worker 0-0, policy_version 1083350 (0.00087) [2022-07-11 06:55:50,766][26022] Updated weights on worker 0-0, policy_version 1083360 (0.00091) [2022-07-11 06:55:51,715][25689] Fps is (10 sec: 5786.2, 60 sec: 5681.1, 300 sec: 5648.7). Total num frames: 1109366784. Throughput: 0: 5952.2. Samples: 1109374878. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:55:51,715][25689] Avg episode reward: [(0, '0.318')] [2022-07-11 06:55:52,887][26022] Updated weights on worker 0-0, policy_version 1083370 (0.00085) [2022-07-11 06:55:54,310][26022] Updated weights on worker 0-0, policy_version 1083380 (0.00081) [2022-07-11 06:55:56,366][26022] Updated weights on worker 0-0, policy_version 1083390 (0.00083) [2022-07-11 06:55:56,803][25689] Fps is (10 sec: 5474.7, 60 sec: 5629.6, 300 sec: 5636.9). Total num frames: 1109392384. Throughput: 0: 5064.9. Samples: 1109391724. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:55:56,804][25689] Avg episode reward: [(0, '0.080')] [2022-07-11 06:55:57,879][26022] Updated weights on worker 0-0, policy_version 1083400 (0.00068) [2022-07-11 06:56:00,039][26022] Updated weights on worker 0-0, policy_version 1083410 (0.00094) [2022-07-11 06:56:01,526][26022] Updated weights on worker 0-0, policy_version 1083420 (0.00077) [2022-07-11 06:56:01,822][25689] Fps is (10 sec: 5572.3, 60 sec: 5645.2, 300 sec: 5651.8). Total num frames: 1109423104. Throughput: 0: 5909.6. Samples: 1109425926. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:56:01,824][25689] Avg episode reward: [(0, '0.689')] [2022-07-11 06:56:03,760][26022] Updated weights on worker 0-0, policy_version 1083430 (0.00083) [2022-07-11 06:56:05,363][26022] Updated weights on worker 0-0, policy_version 1083440 (0.00079) [2022-07-11 06:56:06,907][25689] Fps is (10 sec: 5776.7, 60 sec: 5659.8, 300 sec: 5650.7). Total num frames: 1109450752. Throughput: 0: 5794.0. Samples: 1109458092. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:56:06,907][25689] Avg episode reward: [(0, '-0.163')] [2022-07-11 06:56:07,589][26022] Updated weights on worker 0-0, policy_version 1083450 (0.00090) [2022-07-11 06:56:09,203][26022] Updated weights on worker 0-0, policy_version 1083460 (0.00090) [2022-07-11 06:56:10,961][26022] Updated weights on worker 0-0, policy_version 1083470 (0.00089) [2022-07-11 06:56:11,931][25689] Fps is (10 sec: 5368.8, 60 sec: 5631.8, 300 sec: 5647.3). Total num frames: 1109477376. Throughput: 0: 4951.6. Samples: 1109474980. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:56:11,931][25689] Avg episode reward: [(0, '0.387')] [2022-07-11 06:56:12,738][26022] Updated weights on worker 0-0, policy_version 1083480 (0.00083) [2022-07-11 06:56:14,683][26022] Updated weights on worker 0-0, policy_version 1083490 (0.00081) [2022-07-11 06:56:16,474][26022] Updated weights on worker 0-0, policy_version 1083500 (0.00088) [2022-07-11 06:56:16,953][25689] Fps is (10 sec: 5504.3, 60 sec: 5631.3, 300 sec: 5644.2). Total num frames: 1109506048. Throughput: 0: 5818.4. Samples: 1109508964. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:56:16,953][25689] Avg episode reward: [(0, '0.306')] [2022-07-11 06:56:18,404][26022] Updated weights on worker 0-0, policy_version 1083510 (0.00087) [2022-07-11 06:56:20,058][26022] Updated weights on worker 0-0, policy_version 1083520 (0.00082) [2022-07-11 06:56:21,904][26022] Updated weights on worker 0-0, policy_version 1083530 (0.00093) [2022-07-11 06:56:21,979][25689] Fps is (10 sec: 5706.8, 60 sec: 5649.9, 300 sec: 5645.4). Total num frames: 1109534720. Throughput: 0: 5788.3. Samples: 1109542602. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:56:21,980][25689] Avg episode reward: [(0, '0.509')] [2022-07-11 06:56:23,777][26022] Updated weights on worker 0-0, policy_version 1083540 (0.00091) [2022-07-11 06:56:25,453][26022] Updated weights on worker 0-0, policy_version 1083550 (0.00084) [2022-07-11 06:56:27,126][25689] Fps is (10 sec: 5636.9, 60 sec: 5611.5, 300 sec: 5640.1). Total num frames: 1109563392. Throughput: 0: 5015.9. Samples: 1109559508. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 06:56:27,126][25689] Avg episode reward: [(0, '0.622')] [2022-07-11 06:56:27,469][26022] Updated weights on worker 0-0, policy_version 1083560 (0.00089) [2022-07-11 06:56:29,125][26022] Updated weights on worker 0-0, policy_version 1083570 (0.00085) [2022-07-11 06:56:30,988][26022] Updated weights on worker 0-0, policy_version 1083580 (0.00092) [2022-07-11 06:56:32,205][25689] Fps is (10 sec: 5607.5, 60 sec: 5643.8, 300 sec: 5642.7). Total num frames: 1109592064. Throughput: 0: 5839.3. Samples: 1109593370. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:56:32,206][25689] Avg episode reward: [(0, '0.274')] [2022-07-11 06:56:32,796][26022] Updated weights on worker 0-0, policy_version 1083590 (0.00085) [2022-07-11 06:56:34,730][26022] Updated weights on worker 0-0, policy_version 1083600 (0.00089) [2022-07-11 06:56:36,423][26022] Updated weights on worker 0-0, policy_version 1083610 (0.00089) [2022-07-11 06:56:37,207][25689] Fps is (10 sec: 5688.1, 60 sec: 5610.2, 300 sec: 5646.5). Total num frames: 1109620736. Throughput: 0: 5859.8. Samples: 1109627650. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:56:37,208][25689] Avg episode reward: [(0, '0.528')] [2022-07-11 06:56:38,090][26022] Updated weights on worker 0-0, policy_version 1083620 (0.00096) [2022-07-11 06:56:40,047][26022] Updated weights on worker 0-0, policy_version 1083630 (0.00086) [2022-07-11 06:56:41,907][26022] Updated weights on worker 0-0, policy_version 1083640 (0.00087) [2022-07-11 06:56:42,277][25689] Fps is (10 sec: 5693.5, 60 sec: 5622.3, 300 sec: 5643.8). Total num frames: 1109649408. Throughput: 0: 5016.9. Samples: 1109644436. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:56:42,278][25689] Avg episode reward: [(0, '0.495')] [2022-07-11 06:56:43,530][26022] Updated weights on worker 0-0, policy_version 1083650 (0.00087) [2022-07-11 06:56:45,299][26022] Updated weights on worker 0-0, policy_version 1083660 (0.00088) [2022-07-11 06:56:47,272][26022] Updated weights on worker 0-0, policy_version 1083670 (0.00086) [2022-07-11 06:56:47,369][25689] Fps is (10 sec: 5643.0, 60 sec: 5620.8, 300 sec: 5642.3). Total num frames: 1109678080. Throughput: 0: 5898.4. Samples: 1109678912. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:56:47,370][25689] Avg episode reward: [(0, '0.761')] [2022-07-11 06:56:49,049][26022] Updated weights on worker 0-0, policy_version 1083680 (0.00088) [2022-07-11 06:56:50,891][26022] Updated weights on worker 0-0, policy_version 1083690 (0.00087) [2022-07-11 06:56:52,425][25689] Fps is (10 sec: 5752.1, 60 sec: 5616.8, 300 sec: 5646.5). Total num frames: 1109707776. Throughput: 0: 5921.5. Samples: 1109713098. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:56:52,425][25689] Avg episode reward: [(0, '0.160')] [2022-07-11 06:56:52,602][26022] Updated weights on worker 0-0, policy_version 1083700 (0.00086) [2022-07-11 06:56:54,389][26022] Updated weights on worker 0-0, policy_version 1083710 (0.00069) [2022-07-11 06:56:56,406][26022] Updated weights on worker 0-0, policy_version 1083720 (0.00092) [2022-07-11 06:56:57,486][25689] Fps is (10 sec: 5769.4, 60 sec: 5669.8, 300 sec: 5646.1). Total num frames: 1109736448. Throughput: 0: 5048.0. Samples: 1109730018. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:56:57,487][25689] Avg episode reward: [(0, '0.383')] [2022-07-11 06:56:57,925][26022] Updated weights on worker 0-0, policy_version 1083730 (0.00084) [2022-07-11 06:56:59,946][26022] Updated weights on worker 0-0, policy_version 1083740 (0.00082) [2022-07-11 06:57:01,791][26022] Updated weights on worker 0-0, policy_version 1083750 (0.00082) [2022-07-11 06:57:02,521][25689] Fps is (10 sec: 5375.3, 60 sec: 5584.0, 300 sec: 5646.5). Total num frames: 1109762048. Throughput: 0: 5914.2. Samples: 1109764162. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:57:02,522][25689] Avg episode reward: [(0, '1.280')] [2022-07-11 06:57:03,800][26022] Updated weights on worker 0-0, policy_version 1083760 (0.00578) [2022-07-11 06:57:05,498][26022] Updated weights on worker 0-0, policy_version 1083770 (0.00079) [2022-07-11 06:57:07,158][26022] Updated weights on worker 0-0, policy_version 1083780 (0.00089) [2022-07-11 06:57:07,574][25689] Fps is (10 sec: 5481.1, 60 sec: 5620.6, 300 sec: 5650.3). Total num frames: 1109791744. Throughput: 0: 5814.8. Samples: 1109796402. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:57:07,575][25689] Avg episode reward: [(0, '0.900')] [2022-07-11 06:57:09,370][26022] Updated weights on worker 0-0, policy_version 1083790 (0.00089) [2022-07-11 06:57:10,744][26022] Updated weights on worker 0-0, policy_version 1083800 (0.00085) [2022-07-11 06:57:12,652][25689] Fps is (10 sec: 5761.2, 60 sec: 5649.4, 300 sec: 5649.3). Total num frames: 1109820416. Throughput: 0: 4959.9. Samples: 1109813426. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:57:12,653][25689] Avg episode reward: [(0, '1.142')] [2022-07-11 06:57:12,876][26022] Updated weights on worker 0-0, policy_version 1083810 (0.00085) [2022-07-11 06:57:14,414][26022] Updated weights on worker 0-0, policy_version 1083820 (0.00084) [2022-07-11 06:57:16,379][26022] Updated weights on worker 0-0, policy_version 1083830 (0.00082) [2022-07-11 06:57:17,691][25689] Fps is (10 sec: 5668.2, 60 sec: 5647.8, 300 sec: 5645.9). Total num frames: 1109849088. Throughput: 0: 5821.3. Samples: 1109847640. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:57:17,692][25689] Avg episode reward: [(0, '0.758')] [2022-07-11 06:57:18,122][26022] Updated weights on worker 0-0, policy_version 1083840 (0.00086) [2022-07-11 06:57:19,892][26022] Updated weights on worker 0-0, policy_version 1083850 (0.00096) [2022-07-11 06:57:21,826][26022] Updated weights on worker 0-0, policy_version 1083860 (0.00049) [2022-07-11 06:57:22,721][25689] Fps is (10 sec: 5695.4, 60 sec: 5647.5, 300 sec: 5647.4). Total num frames: 1109877760. Throughput: 0: 5829.7. Samples: 1109881920. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:57:22,721][25689] Avg episode reward: [(0, '-0.833')] [2022-07-11 06:57:23,521][26022] Updated weights on worker 0-0, policy_version 1083870 (0.00084) [2022-07-11 06:57:25,309][26022] Updated weights on worker 0-0, policy_version 1083880 (0.00082) [2022-07-11 06:57:27,279][26022] Updated weights on worker 0-0, policy_version 1083890 (0.00096) [2022-07-11 06:57:27,787][25689] Fps is (10 sec: 5679.9, 60 sec: 5655.0, 300 sec: 5646.7). Total num frames: 1109906432. Throughput: 0: 5921.2. Samples: 1109916086. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:57:27,788][25689] Avg episode reward: [(0, '-2.150')] [2022-07-11 06:57:28,811][26022] Updated weights on worker 0-0, policy_version 1083900 (0.00093) [2022-07-11 06:57:30,850][26022] Updated weights on worker 0-0, policy_version 1083910 (0.00097) [2022-07-11 06:57:32,417][26022] Updated weights on worker 0-0, policy_version 1083920 (0.00090) [2022-07-11 06:57:32,789][25689] Fps is (10 sec: 5593.6, 60 sec: 5645.3, 300 sec: 5640.0). Total num frames: 1109934080. Throughput: 0: 5939.8. Samples: 1109933036. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:57:32,790][25689] Avg episode reward: [(0, '-1.162')] [2022-07-11 06:57:34,433][26022] Updated weights on worker 0-0, policy_version 1083930 (0.00085) [2022-07-11 06:57:36,169][26022] Updated weights on worker 0-0, policy_version 1083940 (0.00087) [2022-07-11 06:57:37,815][25689] Fps is (10 sec: 5718.6, 60 sec: 5660.0, 300 sec: 5646.6). Total num frames: 1109963776. Throughput: 0: 5951.1. Samples: 1109967396. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:57:37,815][25689] Avg episode reward: [(0, '-1.774')] [2022-07-11 06:57:37,894][26022] Updated weights on worker 0-0, policy_version 1083950 (0.00085) [2022-07-11 06:57:39,783][26022] Updated weights on worker 0-0, policy_version 1083960 (0.00086) [2022-07-11 06:57:41,616][26022] Updated weights on worker 0-0, policy_version 1083970 (0.00088) [2022-07-11 06:57:42,828][25689] Fps is (10 sec: 5814.5, 60 sec: 5665.3, 300 sec: 5651.1). Total num frames: 1109992448. Throughput: 0: 5942.3. Samples: 1110001402. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:57:42,830][25689] Avg episode reward: [(0, '-1.734')] [2022-07-11 06:57:43,346][26022] Updated weights on worker 0-0, policy_version 1083980 (0.00088) [2022-07-11 06:57:43,558][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:57:43,575][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001083982_1109997568.pth [2022-07-11 06:57:43,575][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001081993_1107960832.pth [2022-07-11 06:57:45,203][26022] Updated weights on worker 0-0, policy_version 1083990 (0.00085) [2022-07-11 06:57:46,811][26022] Updated weights on worker 0-0, policy_version 1084000 (0.00099) [2022-07-11 06:57:47,958][25689] Fps is (10 sec: 5552.7, 60 sec: 5644.9, 300 sec: 5640.1). Total num frames: 1110020096. Throughput: 0: 5086.9. Samples: 1110018692. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:57:47,958][25689] Avg episode reward: [(0, '-1.136')] [2022-07-11 06:57:48,871][26022] Updated weights on worker 0-0, policy_version 1084010 (0.00083) [2022-07-11 06:57:50,411][26022] Updated weights on worker 0-0, policy_version 1084020 (0.00082) [2022-07-11 06:57:52,263][26022] Updated weights on worker 0-0, policy_version 1084030 (0.00082) [2022-07-11 06:57:52,970][25689] Fps is (10 sec: 5654.2, 60 sec: 5649.0, 300 sec: 5647.2). Total num frames: 1110049792. Throughput: 0: 5943.8. Samples: 1110052984. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:57:52,970][25689] Avg episode reward: [(0, '-0.502')] [2022-07-11 06:57:54,063][26022] Updated weights on worker 0-0, policy_version 1084040 (0.00087) [2022-07-11 06:57:55,985][26022] Updated weights on worker 0-0, policy_version 1084050 (0.00088) [2022-07-11 06:57:57,733][26022] Updated weights on worker 0-0, policy_version 1084060 (0.00094) [2022-07-11 06:57:58,036][25689] Fps is (10 sec: 5892.9, 60 sec: 5665.4, 300 sec: 5646.2). Total num frames: 1110079488. Throughput: 0: 5942.0. Samples: 1110087552. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:57:58,037][25689] Avg episode reward: [(0, '0.552')] [2022-07-11 06:57:59,332][26022] Updated weights on worker 0-0, policy_version 1084070 (0.00080) [2022-07-11 06:58:01,205][26022] Updated weights on worker 0-0, policy_version 1084080 (0.00087) [2022-07-11 06:58:03,057][25689] Fps is (10 sec: 5684.7, 60 sec: 5700.5, 300 sec: 5650.1). Total num frames: 1110107136. Throughput: 0: 5118.1. Samples: 1110104938. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:58:03,058][25689] Avg episode reward: [(0, '0.189')] [2022-07-11 06:58:03,266][26022] Updated weights on worker 0-0, policy_version 1084090 (0.00086) [2022-07-11 06:58:05,088][26022] Updated weights on worker 0-0, policy_version 1084100 (0.00081) [2022-07-11 06:58:06,984][26022] Updated weights on worker 0-0, policy_version 1084110 (0.00084) [2022-07-11 06:58:08,185][25689] Fps is (10 sec: 5549.4, 60 sec: 5676.6, 300 sec: 5648.3). Total num frames: 1110135808. Throughput: 0: 5881.8. Samples: 1110137666. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:58:08,187][25689] Avg episode reward: [(0, '1.142')] [2022-07-11 06:58:08,680][26022] Updated weights on worker 0-0, policy_version 1084120 (0.00085) [2022-07-11 06:58:10,461][26022] Updated weights on worker 0-0, policy_version 1084130 (0.00083) [2022-07-11 06:58:12,213][26022] Updated weights on worker 0-0, policy_version 1084140 (0.00087) [2022-07-11 06:58:13,228][25689] Fps is (10 sec: 5638.0, 60 sec: 5679.9, 300 sec: 5651.0). Total num frames: 1110164480. Throughput: 0: 5877.6. Samples: 1110172054. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:58:13,230][25689] Avg episode reward: [(0, '0.915')] [2022-07-11 06:58:13,955][26022] Updated weights on worker 0-0, policy_version 1084150 (0.00106) [2022-07-11 06:58:15,996][26022] Updated weights on worker 0-0, policy_version 1084160 (0.00085) [2022-07-11 06:58:17,688][26022] Updated weights on worker 0-0, policy_version 1084170 (0.00086) [2022-07-11 06:58:18,241][25689] Fps is (10 sec: 5702.3, 60 sec: 5682.3, 300 sec: 5651.6). Total num frames: 1110193152. Throughput: 0: 5034.8. Samples: 1110189280. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:58:18,243][25689] Avg episode reward: [(0, '0.398')] [2022-07-11 06:58:19,377][26022] Updated weights on worker 0-0, policy_version 1084180 (0.00086) [2022-07-11 06:58:21,218][26022] Updated weights on worker 0-0, policy_version 1084190 (0.00086) [2022-07-11 06:58:23,052][26022] Updated weights on worker 0-0, policy_version 1084200 (0.00086) [2022-07-11 06:58:23,262][25689] Fps is (10 sec: 5715.1, 60 sec: 5683.2, 300 sec: 5650.2). Total num frames: 1110221824. Throughput: 0: 5863.4. Samples: 1110223406. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:58:23,264][25689] Avg episode reward: [(0, '1.255')] [2022-07-11 06:58:24,870][26022] Updated weights on worker 0-0, policy_version 1084210 (0.00085) [2022-07-11 06:58:26,591][26022] Updated weights on worker 0-0, policy_version 1084220 (0.00083) [2022-07-11 06:58:28,340][25689] Fps is (10 sec: 5678.1, 60 sec: 5682.0, 300 sec: 5652.7). Total num frames: 1110250496. Throughput: 0: 5935.5. Samples: 1110257298. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:58:28,341][25689] Avg episode reward: [(0, '1.406')] [2022-07-11 06:58:28,433][26022] Updated weights on worker 0-0, policy_version 1084230 (0.00084) [2022-07-11 06:58:30,431][26022] Updated weights on worker 0-0, policy_version 1084240 (0.00079) [2022-07-11 06:58:32,169][26022] Updated weights on worker 0-0, policy_version 1084250 (0.00082) [2022-07-11 06:58:33,372][25689] Fps is (10 sec: 5570.6, 60 sec: 5679.3, 300 sec: 5648.9). Total num frames: 1110278144. Throughput: 0: 5079.8. Samples: 1110274382. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:58:33,373][25689] Avg episode reward: [(0, '0.653')] [2022-07-11 06:58:33,833][26022] Updated weights on worker 0-0, policy_version 1084260 (0.00090) [2022-07-11 06:58:35,727][26022] Updated weights on worker 0-0, policy_version 1084270 (0.00090) [2022-07-11 06:58:37,311][26022] Updated weights on worker 0-0, policy_version 1084280 (0.00089) [2022-07-11 06:58:38,418][25689] Fps is (10 sec: 5690.3, 60 sec: 5677.3, 300 sec: 5651.7). Total num frames: 1110307840. Throughput: 0: 5916.7. Samples: 1110308658. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:58:38,418][25689] Avg episode reward: [(0, '1.088')] [2022-07-11 06:58:39,384][26022] Updated weights on worker 0-0, policy_version 1084290 (0.00092) [2022-07-11 06:58:40,993][26022] Updated weights on worker 0-0, policy_version 1084300 (0.00083) [2022-07-11 06:58:42,857][26022] Updated weights on worker 0-0, policy_version 1084310 (0.00090) [2022-07-11 06:58:43,435][25689] Fps is (10 sec: 5698.6, 60 sec: 5660.1, 300 sec: 5652.2). Total num frames: 1110335488. Throughput: 0: 5938.5. Samples: 1110343204. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:58:43,435][25689] Avg episode reward: [(0, '1.560')] [2022-07-11 06:58:44,549][26022] Updated weights on worker 0-0, policy_version 1084320 (0.00078) [2022-07-11 06:58:46,592][26022] Updated weights on worker 0-0, policy_version 1084330 (0.00080) [2022-07-11 06:58:48,223][26022] Updated weights on worker 0-0, policy_version 1084340 (0.00079) [2022-07-11 06:58:48,526][25689] Fps is (10 sec: 5774.2, 60 sec: 5714.4, 300 sec: 5657.5). Total num frames: 1110366208. Throughput: 0: 5108.0. Samples: 1110360408. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:58:48,527][25689] Avg episode reward: [(0, '-0.630')] [2022-07-11 06:58:49,959][26022] Updated weights on worker 0-0, policy_version 1084350 (0.00089) [2022-07-11 06:58:51,889][26022] Updated weights on worker 0-0, policy_version 1084360 (0.00089) [2022-07-11 06:58:53,607][25689] Fps is (10 sec: 5738.2, 60 sec: 5674.2, 300 sec: 5652.7). Total num frames: 1110393856. Throughput: 0: 5912.3. Samples: 1110394016. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:58:53,607][25689] Avg episode reward: [(0, '-0.720')] [2022-07-11 06:58:53,675][26022] Updated weights on worker 0-0, policy_version 1084370 (0.00084) [2022-07-11 06:58:55,623][26022] Updated weights on worker 0-0, policy_version 1084380 (0.00097) [2022-07-11 06:58:57,173][26022] Updated weights on worker 0-0, policy_version 1084390 (0.00085) [2022-07-11 06:58:58,639][25689] Fps is (10 sec: 5468.1, 60 sec: 5643.6, 300 sec: 5652.2). Total num frames: 1110421504. Throughput: 0: 5920.8. Samples: 1110428382. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:58:58,639][25689] Avg episode reward: [(0, '-0.766')] [2022-07-11 06:58:59,269][26022] Updated weights on worker 0-0, policy_version 1084400 (0.00087) [2022-07-11 06:59:00,721][26022] Updated weights on worker 0-0, policy_version 1084410 (0.00082) [2022-07-11 06:59:03,148][26022] Updated weights on worker 0-0, policy_version 1084420 (0.00079) [2022-07-11 06:59:03,647][25689] Fps is (10 sec: 5507.4, 60 sec: 5644.8, 300 sec: 5653.3). Total num frames: 1110449152. Throughput: 0: 5054.8. Samples: 1110445372. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:59:03,648][25689] Avg episode reward: [(0, '-0.489')] [2022-07-11 06:59:04,928][26022] Updated weights on worker 0-0, policy_version 1084430 (0.00086) [2022-07-11 06:59:06,708][26022] Updated weights on worker 0-0, policy_version 1084440 (0.00084) [2022-07-11 06:59:08,515][26022] Updated weights on worker 0-0, policy_version 1084450 (0.00079) [2022-07-11 06:59:08,724][25689] Fps is (10 sec: 5685.8, 60 sec: 5666.4, 300 sec: 5652.1). Total num frames: 1110478848. Throughput: 0: 5792.6. Samples: 1110477406. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:59:08,725][25689] Avg episode reward: [(0, '-1.100')] [2022-07-11 06:59:10,494][26022] Updated weights on worker 0-0, policy_version 1084460 (0.00085) [2022-07-11 06:59:11,888][26022] Updated weights on worker 0-0, policy_version 1084470 (0.00088) [2022-07-11 06:59:13,823][25689] Fps is (10 sec: 5534.7, 60 sec: 5627.4, 300 sec: 5647.2). Total num frames: 1110505472. Throughput: 0: 5819.1. Samples: 1110511656. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:59:13,823][25689] Avg episode reward: [(0, '1.203')] [2022-07-11 06:59:14,059][26022] Updated weights on worker 0-0, policy_version 1084480 (0.00086) [2022-07-11 06:59:15,425][26022] Updated weights on worker 0-0, policy_version 1084490 (0.00086) [2022-07-11 06:59:17,519][26022] Updated weights on worker 0-0, policy_version 1084500 (0.00095) [2022-07-11 06:59:18,827][25689] Fps is (10 sec: 5676.1, 60 sec: 5662.1, 300 sec: 5658.5). Total num frames: 1110536192. Throughput: 0: 4988.4. Samples: 1110529088. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:59:18,827][25689] Avg episode reward: [(0, '1.339')] [2022-07-11 06:59:19,207][26022] Updated weights on worker 0-0, policy_version 1084510 (0.00085) [2022-07-11 06:59:21,058][26022] Updated weights on worker 0-0, policy_version 1084520 (0.00083) [2022-07-11 06:59:22,700][26022] Updated weights on worker 0-0, policy_version 1084530 (0.00083) [2022-07-11 06:59:23,832][25689] Fps is (10 sec: 5831.7, 60 sec: 5646.6, 300 sec: 5649.8). Total num frames: 1110563840. Throughput: 0: 5844.1. Samples: 1110563332. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:59:23,832][25689] Avg episode reward: [(0, '0.517')] [2022-07-11 06:59:24,686][26022] Updated weights on worker 0-0, policy_version 1084540 (0.00082) [2022-07-11 06:59:26,307][26022] Updated weights on worker 0-0, policy_version 1084550 (0.00086) [2022-07-11 06:59:28,202][26022] Updated weights on worker 0-0, policy_version 1084560 (0.00086) [2022-07-11 06:59:28,890][25689] Fps is (10 sec: 5596.9, 60 sec: 5648.5, 300 sec: 5652.3). Total num frames: 1110592512. Throughput: 0: 5949.7. Samples: 1110597384. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:59:28,890][25689] Avg episode reward: [(0, '0.311')] [2022-07-11 06:59:30,061][26022] Updated weights on worker 0-0, policy_version 1084570 (0.00087) [2022-07-11 06:59:31,865][26022] Updated weights on worker 0-0, policy_version 1084580 (0.00086) [2022-07-11 06:59:33,551][26022] Updated weights on worker 0-0, policy_version 1084590 (0.00091) [2022-07-11 06:59:33,945][25689] Fps is (10 sec: 5771.2, 60 sec: 5680.1, 300 sec: 5655.9). Total num frames: 1110622208. Throughput: 0: 5114.2. Samples: 1110614566. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:59:33,946][25689] Avg episode reward: [(0, '0.418')] [2022-07-11 06:59:35,435][26022] Updated weights on worker 0-0, policy_version 1084600 (0.00084) [2022-07-11 06:59:37,104][26022] Updated weights on worker 0-0, policy_version 1084610 (0.00086) [2022-07-11 06:59:39,033][25689] Fps is (10 sec: 5653.2, 60 sec: 5642.4, 300 sec: 5649.2). Total num frames: 1110649856. Throughput: 0: 5937.7. Samples: 1110649068. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:59:39,034][25689] Avg episode reward: [(0, '1.085')] [2022-07-11 06:59:39,098][26022] Updated weights on worker 0-0, policy_version 1084620 (0.00081) [2022-07-11 06:59:40,521][26022] Updated weights on worker 0-0, policy_version 1084630 (0.00085) [2022-07-11 06:59:42,618][26022] Updated weights on worker 0-0, policy_version 1084640 (0.00081) [2022-07-11 06:59:43,743][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 06:59:43,755][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001084647_1110678528.pth [2022-07-11 06:59:43,755][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001082657_1108640768.pth [2022-07-11 06:59:44,084][25689] Fps is (10 sec: 5756.8, 60 sec: 5689.9, 300 sec: 5657.2). Total num frames: 1110680576. Throughput: 0: 5943.0. Samples: 1110683694. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:59:44,086][25689] Avg episode reward: [(0, '0.795')] [2022-07-11 06:59:44,115][26022] Updated weights on worker 0-0, policy_version 1084650 (0.00089) [2022-07-11 06:59:46,245][26022] Updated weights on worker 0-0, policy_version 1084660 (0.00088) [2022-07-11 06:59:47,831][26022] Updated weights on worker 0-0, policy_version 1084670 (0.00063) [2022-07-11 06:59:49,201][25689] Fps is (10 sec: 5740.3, 60 sec: 5636.8, 300 sec: 5655.1). Total num frames: 1110708224. Throughput: 0: 5936.5. Samples: 1110717964. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:59:49,202][25689] Avg episode reward: [(0, '0.517')] [2022-07-11 06:59:49,673][26022] Updated weights on worker 0-0, policy_version 1084680 (0.00086) [2022-07-11 06:59:51,467][26022] Updated weights on worker 0-0, policy_version 1084690 (0.00081) [2022-07-11 06:59:53,335][26022] Updated weights on worker 0-0, policy_version 1084700 (0.00085) [2022-07-11 06:59:54,210][25689] Fps is (10 sec: 5663.2, 60 sec: 5677.3, 300 sec: 5659.9). Total num frames: 1110737920. Throughput: 0: 5943.2. Samples: 1110735004. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:59:54,210][25689] Avg episode reward: [(0, '0.531')] [2022-07-11 06:59:55,109][26022] Updated weights on worker 0-0, policy_version 1084710 (0.00086) [2022-07-11 06:59:56,862][26022] Updated weights on worker 0-0, policy_version 1084720 (0.00097) [2022-07-11 06:59:58,474][26022] Updated weights on worker 0-0, policy_version 1084730 (0.00082) [2022-07-11 06:59:59,270][25689] Fps is (10 sec: 5695.4, 60 sec: 5674.7, 300 sec: 5652.0). Total num frames: 1110765568. Throughput: 0: 5950.0. Samples: 1110769476. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 06:59:59,270][25689] Avg episode reward: [(0, '0.469')] [2022-07-11 07:00:00,518][26022] Updated weights on worker 0-0, policy_version 1084740 (0.01105) [2022-07-11 07:00:02,622][26022] Updated weights on worker 0-0, policy_version 1084750 (0.00088) [2022-07-11 07:00:04,271][25689] Fps is (10 sec: 5394.3, 60 sec: 5658.5, 300 sec: 5653.1). Total num frames: 1110792192. Throughput: 0: 5852.2. Samples: 1110801830. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 07:00:04,271][25689] Avg episode reward: [(0, '0.407')] [2022-07-11 07:00:04,547][26022] Updated weights on worker 0-0, policy_version 1084760 (0.00086) [2022-07-11 07:00:06,207][26022] Updated weights on worker 0-0, policy_version 1084770 (0.00081) [2022-07-11 07:00:08,000][26022] Updated weights on worker 0-0, policy_version 1084780 (0.00080) [2022-07-11 07:00:09,331][25689] Fps is (10 sec: 5495.9, 60 sec: 5643.2, 300 sec: 5653.6). Total num frames: 1110820864. Throughput: 0: 5027.1. Samples: 1110819158. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 07:00:09,332][25689] Avg episode reward: [(0, '0.596')] [2022-07-11 07:00:09,784][26022] Updated weights on worker 0-0, policy_version 1084790 (0.00091) [2022-07-11 07:00:11,479][26022] Updated weights on worker 0-0, policy_version 1084800 (0.00088) [2022-07-11 07:00:13,451][26022] Updated weights on worker 0-0, policy_version 1084810 (0.00080) [2022-07-11 07:00:14,359][25689] Fps is (10 sec: 5887.5, 60 sec: 5717.5, 300 sec: 5660.3). Total num frames: 1110851584. Throughput: 0: 5885.1. Samples: 1110853582. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 07:00:14,359][25689] Avg episode reward: [(0, '0.468')] [2022-07-11 07:00:15,093][26022] Updated weights on worker 0-0, policy_version 1084820 (0.00057) [2022-07-11 07:00:17,021][26022] Updated weights on worker 0-0, policy_version 1084830 (0.00080) [2022-07-11 07:00:18,767][26022] Updated weights on worker 0-0, policy_version 1084840 (0.00096) [2022-07-11 07:00:19,373][25689] Fps is (10 sec: 5914.7, 60 sec: 5682.7, 300 sec: 5664.3). Total num frames: 1110880256. Throughput: 0: 5915.0. Samples: 1110888384. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 07:00:19,373][25689] Avg episode reward: [(0, '-0.144')] [2022-07-11 07:00:20,372][26022] Updated weights on worker 0-0, policy_version 1084850 (0.00088) [2022-07-11 07:00:22,231][26022] Updated weights on worker 0-0, policy_version 1084860 (0.00510) [2022-07-11 07:00:24,079][26022] Updated weights on worker 0-0, policy_version 1084870 (0.00081) [2022-07-11 07:00:24,411][25689] Fps is (10 sec: 5602.7, 60 sec: 5679.6, 300 sec: 5655.1). Total num frames: 1110907904. Throughput: 0: 5133.9. Samples: 1110905226. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 07:00:24,411][25689] Avg episode reward: [(0, '0.928')] [2022-07-11 07:00:26,173][26022] Updated weights on worker 0-0, policy_version 1084880 (0.00086) [2022-07-11 07:00:27,616][26022] Updated weights on worker 0-0, policy_version 1084890 (0.00091) [2022-07-11 07:00:29,483][25689] Fps is (10 sec: 5570.3, 60 sec: 5678.2, 300 sec: 5661.8). Total num frames: 1110936576. Throughput: 0: 5942.5. Samples: 1110938912. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 07:00:29,484][25689] Avg episode reward: [(0, '1.267')] [2022-07-11 07:00:29,627][26022] Updated weights on worker 0-0, policy_version 1084900 (0.00081) [2022-07-11 07:00:31,453][26022] Updated weights on worker 0-0, policy_version 1084910 (0.00091) [2022-07-11 07:00:33,116][26022] Updated weights on worker 0-0, policy_version 1084920 (0.00084) [2022-07-11 07:00:34,503][25689] Fps is (10 sec: 5682.3, 60 sec: 5664.7, 300 sec: 5654.6). Total num frames: 1110965248. Throughput: 0: 5932.2. Samples: 1110973080. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:00:34,504][25689] Avg episode reward: [(0, '1.361')] [2022-07-11 07:00:35,060][26022] Updated weights on worker 0-0, policy_version 1084930 (0.00080) [2022-07-11 07:00:36,698][26022] Updated weights on worker 0-0, policy_version 1084940 (0.00091) [2022-07-11 07:00:38,719][26022] Updated weights on worker 0-0, policy_version 1084950 (0.00094) [2022-07-11 07:00:39,507][25689] Fps is (10 sec: 5822.9, 60 sec: 5706.4, 300 sec: 5661.8). Total num frames: 1110994944. Throughput: 0: 5058.5. Samples: 1110990234. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:00:39,508][25689] Avg episode reward: [(0, '0.738')] [2022-07-11 07:00:40,384][26022] Updated weights on worker 0-0, policy_version 1084960 (0.00091) [2022-07-11 07:00:42,337][26022] Updated weights on worker 0-0, policy_version 1084970 (0.00081) [2022-07-11 07:00:43,951][26022] Updated weights on worker 0-0, policy_version 1084980 (0.00090) [2022-07-11 07:00:44,535][25689] Fps is (10 sec: 5511.7, 60 sec: 5623.9, 300 sec: 5652.4). Total num frames: 1111020544. Throughput: 0: 5892.1. Samples: 1111023800. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:00:44,536][25689] Avg episode reward: [(0, '1.247')] [2022-07-11 07:00:45,753][26022] Updated weights on worker 0-0, policy_version 1084990 (0.00086) [2022-07-11 07:00:47,947][26022] Updated weights on worker 0-0, policy_version 1085000 (0.00084) [2022-07-11 07:00:49,503][26022] Updated weights on worker 0-0, policy_version 1085010 (0.00085) [2022-07-11 07:00:49,605][25689] Fps is (10 sec: 5476.1, 60 sec: 5662.2, 300 sec: 5651.3). Total num frames: 1111050240. Throughput: 0: 5915.8. Samples: 1111057948. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:00:49,605][25689] Avg episode reward: [(0, '1.125')] [2022-07-11 07:00:51,504][26022] Updated weights on worker 0-0, policy_version 1085020 (0.00090) [2022-07-11 07:00:52,990][26022] Updated weights on worker 0-0, policy_version 1085030 (0.00083) [2022-07-11 07:00:54,639][25689] Fps is (10 sec: 5776.9, 60 sec: 5642.9, 300 sec: 5662.6). Total num frames: 1111078912. Throughput: 0: 5061.8. Samples: 1111075004. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:00:54,640][25689] Avg episode reward: [(0, '1.089')] [2022-07-11 07:00:55,015][26022] Updated weights on worker 0-0, policy_version 1085040 (0.00090) [2022-07-11 07:00:56,927][26022] Updated weights on worker 0-0, policy_version 1085050 (0.00092) [2022-07-11 07:00:58,486][26022] Updated weights on worker 0-0, policy_version 1085060 (0.00093) [2022-07-11 07:00:59,668][25689] Fps is (10 sec: 5698.5, 60 sec: 5662.7, 300 sec: 5655.5). Total num frames: 1111107584. Throughput: 0: 5884.3. Samples: 1111108866. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:00:59,669][25689] Avg episode reward: [(0, '0.861')] [2022-07-11 07:01:00,258][26022] Updated weights on worker 0-0, policy_version 1085070 (0.00086) [2022-07-11 07:01:02,532][26022] Updated weights on worker 0-0, policy_version 1085080 (0.00086) [2022-07-11 07:01:04,275][26022] Updated weights on worker 0-0, policy_version 1085090 (0.00082) [2022-07-11 07:01:04,702][25689] Fps is (10 sec: 5393.0, 60 sec: 5642.6, 300 sec: 5649.6). Total num frames: 1111133184. Throughput: 0: 5813.1. Samples: 1111141034. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:01:04,703][25689] Avg episode reward: [(0, '0.567')] [2022-07-11 07:01:06,183][26022] Updated weights on worker 0-0, policy_version 1085100 (0.00082) [2022-07-11 07:01:07,922][26022] Updated weights on worker 0-0, policy_version 1085110 (0.00086) [2022-07-11 07:01:09,648][26022] Updated weights on worker 0-0, policy_version 1085120 (0.00084) [2022-07-11 07:01:09,760][25689] Fps is (10 sec: 5580.8, 60 sec: 5676.8, 300 sec: 5662.7). Total num frames: 1111163904. Throughput: 0: 4966.3. Samples: 1111158042. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:01:09,760][25689] Avg episode reward: [(0, '0.474')] [2022-07-11 07:01:11,542][26022] Updated weights on worker 0-0, policy_version 1085130 (0.00095) [2022-07-11 07:01:13,423][26022] Updated weights on worker 0-0, policy_version 1085140 (0.00083) [2022-07-11 07:01:14,769][25689] Fps is (10 sec: 5798.0, 60 sec: 5627.6, 300 sec: 5659.5). Total num frames: 1111191552. Throughput: 0: 5835.6. Samples: 1111192476. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:01:14,770][25689] Avg episode reward: [(0, '1.598')] [2022-07-11 07:01:15,024][26022] Updated weights on worker 0-0, policy_version 1085150 (0.00081) [2022-07-11 07:01:16,763][26022] Updated weights on worker 0-0, policy_version 1085160 (0.00090) [2022-07-11 07:01:18,585][26022] Updated weights on worker 0-0, policy_version 1085170 (0.00081) [2022-07-11 07:01:19,794][25689] Fps is (10 sec: 5612.9, 60 sec: 5626.6, 300 sec: 5659.6). Total num frames: 1111220224. Throughput: 0: 5881.3. Samples: 1111227232. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:01:19,795][25689] Avg episode reward: [(0, '1.017')] [2022-07-11 07:01:20,323][26022] Updated weights on worker 0-0, policy_version 1085180 (0.00092) [2022-07-11 07:01:22,450][26022] Updated weights on worker 0-0, policy_version 1085190 (0.00097) [2022-07-11 07:01:24,010][26022] Updated weights on worker 0-0, policy_version 1085200 (0.00094) [2022-07-11 07:01:24,819][25689] Fps is (10 sec: 5604.1, 60 sec: 5627.9, 300 sec: 5658.4). Total num frames: 1111247872. Throughput: 0: 5119.0. Samples: 1111244012. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:01:24,819][25689] Avg episode reward: [(0, '1.110')] [2022-07-11 07:01:26,045][26022] Updated weights on worker 0-0, policy_version 1085210 (0.00091) [2022-07-11 07:01:27,629][26022] Updated weights on worker 0-0, policy_version 1085220 (0.00086) [2022-07-11 07:01:29,579][26022] Updated weights on worker 0-0, policy_version 1085230 (0.00089) [2022-07-11 07:01:29,888][25689] Fps is (10 sec: 5680.8, 60 sec: 5645.1, 300 sec: 5662.1). Total num frames: 1111277568. Throughput: 0: 5937.9. Samples: 1111277562. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:01:29,888][25689] Avg episode reward: [(0, '1.283')] [2022-07-11 07:01:31,443][26022] Updated weights on worker 0-0, policy_version 1085240 (0.00091) [2022-07-11 07:01:33,304][26022] Updated weights on worker 0-0, policy_version 1085250 (0.00086) [2022-07-11 07:01:34,811][26022] Updated weights on worker 0-0, policy_version 1085260 (0.00088) [2022-07-11 07:01:34,891][25689] Fps is (10 sec: 5795.2, 60 sec: 5646.7, 300 sec: 5662.1). Total num frames: 1111306240. Throughput: 0: 5909.3. Samples: 1111311380. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:01:34,892][25689] Avg episode reward: [(0, '1.397')] [2022-07-11 07:01:36,926][26022] Updated weights on worker 0-0, policy_version 1085270 (0.00093) [2022-07-11 07:01:38,467][26022] Updated weights on worker 0-0, policy_version 1085280 (0.00091) [2022-07-11 07:01:39,909][25689] Fps is (10 sec: 5517.9, 60 sec: 5594.5, 300 sec: 5656.2). Total num frames: 1111332864. Throughput: 0: 5034.1. Samples: 1111328496. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:01:39,911][25689] Avg episode reward: [(0, '1.551')] [2022-07-11 07:01:40,542][26022] Updated weights on worker 0-0, policy_version 1085290 (0.00087) [2022-07-11 07:01:42,059][26022] Updated weights on worker 0-0, policy_version 1085300 (0.00081) [2022-07-11 07:01:43,898][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:01:43,906][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001085309_1111356416.pth [2022-07-11 07:01:43,907][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001083320_1109319680.pth [2022-07-11 07:01:44,042][26022] Updated weights on worker 0-0, policy_version 1085310 (0.00088) [2022-07-11 07:01:44,950][25689] Fps is (10 sec: 5700.6, 60 sec: 5678.1, 300 sec: 5664.0). Total num frames: 1111363584. Throughput: 0: 5914.9. Samples: 1111363084. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:01:44,951][25689] Avg episode reward: [(0, '1.413')] [2022-07-11 07:01:45,814][26022] Updated weights on worker 0-0, policy_version 1085320 (0.00083) [2022-07-11 07:01:47,613][26022] Updated weights on worker 0-0, policy_version 1085330 (0.00080) [2022-07-11 07:01:49,463][26022] Updated weights on worker 0-0, policy_version 1085340 (0.00080) [2022-07-11 07:01:50,063][25689] Fps is (10 sec: 5849.4, 60 sec: 5657.1, 300 sec: 5659.5). Total num frames: 1111392256. Throughput: 0: 5937.9. Samples: 1111397358. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:01:50,063][25689] Avg episode reward: [(0, '1.189')] [2022-07-11 07:01:51,107][26022] Updated weights on worker 0-0, policy_version 1085350 (0.00087) [2022-07-11 07:01:52,846][26022] Updated weights on worker 0-0, policy_version 1085360 (0.00085) [2022-07-11 07:01:54,818][26022] Updated weights on worker 0-0, policy_version 1085370 (0.00086) [2022-07-11 07:01:55,115][25689] Fps is (10 sec: 5540.5, 60 sec: 5638.4, 300 sec: 5656.2). Total num frames: 1111419904. Throughput: 0: 5109.8. Samples: 1111414724. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:01:55,116][25689] Avg episode reward: [(0, '1.958')] [2022-07-11 07:01:56,558][26022] Updated weights on worker 0-0, policy_version 1085380 (0.00083) [2022-07-11 07:01:58,351][26022] Updated weights on worker 0-0, policy_version 1085390 (0.00081) [2022-07-11 07:02:00,132][25689] Fps is (10 sec: 5593.2, 60 sec: 5639.6, 300 sec: 5666.9). Total num frames: 1111448576. Throughput: 0: 5937.1. Samples: 1111448562. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:02:00,133][25689] Avg episode reward: [(0, '1.802')] [2022-07-11 07:02:00,172][26022] Updated weights on worker 0-0, policy_version 1085400 (0.00085) [2022-07-11 07:02:01,877][26022] Updated weights on worker 0-0, policy_version 1085410 (0.00086) [2022-07-11 07:02:04,201][26022] Updated weights on worker 0-0, policy_version 1085420 (0.00086) [2022-07-11 07:02:05,158][25689] Fps is (10 sec: 5506.0, 60 sec: 5657.3, 300 sec: 5657.1). Total num frames: 1111475200. Throughput: 0: 5803.2. Samples: 1111480356. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:02:05,158][25689] Avg episode reward: [(0, '0.156')] [2022-07-11 07:02:06,029][26022] Updated weights on worker 0-0, policy_version 1085430 (0.00086) [2022-07-11 07:02:07,722][26022] Updated weights on worker 0-0, policy_version 1085440 (0.00082) [2022-07-11 07:02:09,623][26022] Updated weights on worker 0-0, policy_version 1085450 (0.00087) [2022-07-11 07:02:10,220][25689] Fps is (10 sec: 5380.0, 60 sec: 5606.1, 300 sec: 5654.0). Total num frames: 1111502848. Throughput: 0: 4972.5. Samples: 1111497590. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:02:10,220][25689] Avg episode reward: [(0, '0.498')] [2022-07-11 07:02:11,208][26022] Updated weights on worker 0-0, policy_version 1085460 (0.00084) [2022-07-11 07:02:13,253][26022] Updated weights on worker 0-0, policy_version 1085470 (0.00086) [2022-07-11 07:02:14,994][26022] Updated weights on worker 0-0, policy_version 1085480 (0.00085) [2022-07-11 07:02:15,249][25689] Fps is (10 sec: 5682.4, 60 sec: 5638.1, 300 sec: 5657.6). Total num frames: 1111532544. Throughput: 0: 5815.7. Samples: 1111531820. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:02:15,250][25689] Avg episode reward: [(0, '0.395')] [2022-07-11 07:02:16,748][26022] Updated weights on worker 0-0, policy_version 1085490 (0.00080) [2022-07-11 07:02:18,715][26022] Updated weights on worker 0-0, policy_version 1085500 (0.00084) [2022-07-11 07:02:20,118][26022] Updated weights on worker 0-0, policy_version 1085510 (0.00089) [2022-07-11 07:02:20,268][25689] Fps is (10 sec: 6012.7, 60 sec: 5672.5, 300 sec: 5664.7). Total num frames: 1111563264. Throughput: 0: 5838.0. Samples: 1111566116. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:02:20,268][25689] Avg episode reward: [(0, '-0.323')] [2022-07-11 07:02:22,207][26022] Updated weights on worker 0-0, policy_version 1085520 (0.00083) [2022-07-11 07:02:23,727][26022] Updated weights on worker 0-0, policy_version 1085530 (0.00083) [2022-07-11 07:02:25,272][25689] Fps is (10 sec: 5619.6, 60 sec: 5640.6, 300 sec: 5655.5). Total num frames: 1111588864. Throughput: 0: 5112.4. Samples: 1111583184. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:02:25,272][25689] Avg episode reward: [(0, '-0.613')] [2022-07-11 07:02:25,763][26022] Updated weights on worker 0-0, policy_version 1085540 (0.00085) [2022-07-11 07:02:27,735][26022] Updated weights on worker 0-0, policy_version 1085550 (0.00088) [2022-07-11 07:02:29,349][26022] Updated weights on worker 0-0, policy_version 1085560 (0.00082) [2022-07-11 07:02:30,332][25689] Fps is (10 sec: 5494.3, 60 sec: 5641.4, 300 sec: 5661.3). Total num frames: 1111618560. Throughput: 0: 5947.1. Samples: 1111617200. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:02:30,333][25689] Avg episode reward: [(0, '-0.638')] [2022-07-11 07:02:31,199][26022] Updated weights on worker 0-0, policy_version 1085570 (0.00090) [2022-07-11 07:02:32,969][26022] Updated weights on worker 0-0, policy_version 1085580 (0.00089) [2022-07-11 07:02:34,506][26022] Updated weights on worker 0-0, policy_version 1085590 (0.00086) [2022-07-11 07:02:35,360][25689] Fps is (10 sec: 5785.7, 60 sec: 5639.1, 300 sec: 5657.8). Total num frames: 1111647232. Throughput: 0: 5947.3. Samples: 1111651422. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:02:35,360][25689] Avg episode reward: [(0, '0.662')] [2022-07-11 07:02:36,617][26022] Updated weights on worker 0-0, policy_version 1085600 (0.00086) [2022-07-11 07:02:38,266][26022] Updated weights on worker 0-0, policy_version 1085610 (0.00082) [2022-07-11 07:02:40,210][26022] Updated weights on worker 0-0, policy_version 1085620 (0.00087) [2022-07-11 07:02:40,379][25689] Fps is (10 sec: 5605.9, 60 sec: 5656.0, 300 sec: 5654.3). Total num frames: 1111674880. Throughput: 0: 5094.2. Samples: 1111668564. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:02:40,379][25689] Avg episode reward: [(0, '0.563')] [2022-07-11 07:02:41,824][26022] Updated weights on worker 0-0, policy_version 1085630 (0.00090) [2022-07-11 07:02:43,863][26022] Updated weights on worker 0-0, policy_version 1085640 (0.00079) [2022-07-11 07:02:45,385][25689] Fps is (10 sec: 5720.2, 60 sec: 5642.3, 300 sec: 5663.5). Total num frames: 1111704576. Throughput: 0: 5926.4. Samples: 1111702382. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:02:45,385][25689] Avg episode reward: [(0, '0.907')] [2022-07-11 07:02:45,486][26022] Updated weights on worker 0-0, policy_version 1085650 (0.00087) [2022-07-11 07:02:47,591][26022] Updated weights on worker 0-0, policy_version 1085660 (0.00083) [2022-07-11 07:02:49,154][26022] Updated weights on worker 0-0, policy_version 1085670 (0.00088) [2022-07-11 07:02:50,467][25689] Fps is (10 sec: 5582.5, 60 sec: 5611.2, 300 sec: 5651.8). Total num frames: 1111731200. Throughput: 0: 5901.7. Samples: 1111736032. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:02:50,468][25689] Avg episode reward: [(0, '1.245')] [2022-07-11 07:02:51,314][26022] Updated weights on worker 0-0, policy_version 1085680 (0.00090) [2022-07-11 07:02:52,881][26022] Updated weights on worker 0-0, policy_version 1085690 (0.00094) [2022-07-11 07:02:54,756][26022] Updated weights on worker 0-0, policy_version 1085700 (0.00091) [2022-07-11 07:02:55,480][25689] Fps is (10 sec: 5578.7, 60 sec: 5648.8, 300 sec: 5652.9). Total num frames: 1111760896. Throughput: 0: 5890.7. Samples: 1111769946. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:02:55,482][25689] Avg episode reward: [(0, '1.493')] [2022-07-11 07:02:56,581][26022] Updated weights on worker 0-0, policy_version 1085710 (0.00094) [2022-07-11 07:02:58,539][26022] Updated weights on worker 0-0, policy_version 1085720 (0.00086) [2022-07-11 07:03:00,201][26022] Updated weights on worker 0-0, policy_version 1085730 (0.00091) [2022-07-11 07:03:00,487][25689] Fps is (10 sec: 5722.9, 60 sec: 5632.8, 300 sec: 5653.1). Total num frames: 1111788544. Throughput: 0: 5883.6. Samples: 1111786876. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:03:00,488][25689] Avg episode reward: [(0, '1.440')] [2022-07-11 07:03:02,487][26022] Updated weights on worker 0-0, policy_version 1085740 (0.00076) [2022-07-11 07:03:04,030][26022] Updated weights on worker 0-0, policy_version 1085750 (0.00100) [2022-07-11 07:03:05,511][25689] Fps is (10 sec: 5308.7, 60 sec: 5616.0, 300 sec: 5644.8). Total num frames: 1111814144. Throughput: 0: 5770.3. Samples: 1111818514. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:03:05,512][25689] Avg episode reward: [(0, '0.385')] [2022-07-11 07:03:06,248][26022] Updated weights on worker 0-0, policy_version 1085760 (0.00095) [2022-07-11 07:03:07,784][26022] Updated weights on worker 0-0, policy_version 1085770 (0.00095) [2022-07-11 07:03:09,674][26022] Updated weights on worker 0-0, policy_version 1085780 (0.00083) [2022-07-11 07:03:10,598][25689] Fps is (10 sec: 5469.4, 60 sec: 5647.6, 300 sec: 5647.4). Total num frames: 1111843840. Throughput: 0: 5785.8. Samples: 1111852502. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:03:10,599][25689] Avg episode reward: [(0, '0.213')] [2022-07-11 07:03:11,371][26022] Updated weights on worker 0-0, policy_version 1085790 (0.00092) [2022-07-11 07:03:13,387][26022] Updated weights on worker 0-0, policy_version 1085800 (0.00083) [2022-07-11 07:03:15,055][26022] Updated weights on worker 0-0, policy_version 1085810 (0.00084) [2022-07-11 07:03:15,600][25689] Fps is (10 sec: 5784.9, 60 sec: 5633.2, 300 sec: 5647.6). Total num frames: 1111872512. Throughput: 0: 4947.0. Samples: 1111869480. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:03:15,602][25689] Avg episode reward: [(0, '0.733')] [2022-07-11 07:03:16,976][26022] Updated weights on worker 0-0, policy_version 1085820 (0.00081) [2022-07-11 07:03:18,663][26022] Updated weights on worker 0-0, policy_version 1085830 (0.00080) [2022-07-11 07:03:20,607][25689] Fps is (10 sec: 5524.6, 60 sec: 5566.4, 300 sec: 5641.0). Total num frames: 1111899136. Throughput: 0: 5795.5. Samples: 1111903476. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:03:20,616][25689] Avg episode reward: [(0, '0.658')] [2022-07-11 07:03:20,652][26022] Updated weights on worker 0-0, policy_version 1085840 (0.00086) [2022-07-11 07:03:22,346][26022] Updated weights on worker 0-0, policy_version 1085850 (0.00087) [2022-07-11 07:03:24,272][26022] Updated weights on worker 0-0, policy_version 1085860 (0.00093) [2022-07-11 07:03:25,622][25689] Fps is (10 sec: 5620.0, 60 sec: 5633.3, 300 sec: 5645.6). Total num frames: 1111928832. Throughput: 0: 5924.2. Samples: 1111937656. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:03:25,622][25689] Avg episode reward: [(0, '0.314')] [2022-07-11 07:03:25,941][26022] Updated weights on worker 0-0, policy_version 1085870 (0.00093) [2022-07-11 07:03:27,862][26022] Updated weights on worker 0-0, policy_version 1085880 (0.00101) [2022-07-11 07:03:29,719][26022] Updated weights on worker 0-0, policy_version 1085890 (0.00095) [2022-07-11 07:03:30,747][25689] Fps is (10 sec: 5655.3, 60 sec: 5593.4, 300 sec: 5643.8). Total num frames: 1111956480. Throughput: 0: 5036.9. Samples: 1111953990. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:03:30,747][25689] Avg episode reward: [(0, '1.292')] [2022-07-11 07:03:31,520][26022] Updated weights on worker 0-0, policy_version 1085900 (0.00086) [2022-07-11 07:03:33,339][26022] Updated weights on worker 0-0, policy_version 1085910 (0.00089) [2022-07-11 07:03:35,097][26022] Updated weights on worker 0-0, policy_version 1085920 (0.00085) [2022-07-11 07:03:35,812][25689] Fps is (10 sec: 5526.8, 60 sec: 5589.9, 300 sec: 5640.0). Total num frames: 1111985152. Throughput: 0: 5847.4. Samples: 1111987666. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:03:35,813][25689] Avg episode reward: [(0, '1.063')] [2022-07-11 07:03:36,887][26022] Updated weights on worker 0-0, policy_version 1085930 (0.00083) [2022-07-11 07:03:38,762][26022] Updated weights on worker 0-0, policy_version 1085940 (0.00087) [2022-07-11 07:03:40,782][26022] Updated weights on worker 0-0, policy_version 1085950 (0.00085) [2022-07-11 07:03:40,826][25689] Fps is (10 sec: 5587.6, 60 sec: 5590.3, 300 sec: 5640.1). Total num frames: 1112012800. Throughput: 0: 5851.1. Samples: 1112021782. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:03:40,826][25689] Avg episode reward: [(0, '0.068')] [2022-07-11 07:03:42,370][26022] Updated weights on worker 0-0, policy_version 1085960 (0.00085) [2022-07-11 07:03:44,011][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:03:44,026][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001085968_1112031232.pth [2022-07-11 07:03:44,027][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001083982_1109997568.pth [2022-07-11 07:03:44,376][26022] Updated weights on worker 0-0, policy_version 1085970 (0.00086) [2022-07-11 07:03:45,857][25689] Fps is (10 sec: 5606.6, 60 sec: 5571.1, 300 sec: 5634.3). Total num frames: 1112041472. Throughput: 0: 4991.9. Samples: 1112038670. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:03:45,858][25689] Avg episode reward: [(0, '-0.076')] [2022-07-11 07:03:46,018][26022] Updated weights on worker 0-0, policy_version 1085980 (0.00091) [2022-07-11 07:03:47,790][26022] Updated weights on worker 0-0, policy_version 1085990 (0.00088) [2022-07-11 07:03:49,662][26022] Updated weights on worker 0-0, policy_version 1086000 (0.00085) [2022-07-11 07:03:50,907][25689] Fps is (10 sec: 5790.1, 60 sec: 5624.9, 300 sec: 5641.8). Total num frames: 1112071168. Throughput: 0: 5883.7. Samples: 1112072606. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:03:50,907][25689] Avg episode reward: [(0, '-0.141')] [2022-07-11 07:03:51,397][26022] Updated weights on worker 0-0, policy_version 1086010 (0.00084) [2022-07-11 07:03:53,347][26022] Updated weights on worker 0-0, policy_version 1086020 (0.00078) [2022-07-11 07:03:55,211][26022] Updated weights on worker 0-0, policy_version 1086030 (0.00089) [2022-07-11 07:03:55,926][25689] Fps is (10 sec: 5593.5, 60 sec: 5573.5, 300 sec: 5638.6). Total num frames: 1112097792. Throughput: 0: 5883.3. Samples: 1112106004. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:03:55,927][25689] Avg episode reward: [(0, '-0.047')] [2022-07-11 07:03:56,915][26022] Updated weights on worker 0-0, policy_version 1086040 (0.00090) [2022-07-11 07:03:58,999][26022] Updated weights on worker 0-0, policy_version 1086050 (0.00083) [2022-07-11 07:04:00,652][26022] Updated weights on worker 0-0, policy_version 1086060 (0.00089) [2022-07-11 07:04:00,938][25689] Fps is (10 sec: 5410.4, 60 sec: 5573.1, 300 sec: 5638.5). Total num frames: 1112125440. Throughput: 0: 5015.5. Samples: 1112122652. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:04:00,938][25689] Avg episode reward: [(0, '0.197')] [2022-07-11 07:04:02,881][26022] Updated weights on worker 0-0, policy_version 1086070 (0.00086) [2022-07-11 07:04:04,708][26022] Updated weights on worker 0-0, policy_version 1086080 (0.00101) [2022-07-11 07:04:05,955][25689] Fps is (10 sec: 5411.9, 60 sec: 5590.6, 300 sec: 5629.4). Total num frames: 1112152064. Throughput: 0: 5747.5. Samples: 1112154178. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:04:05,955][25689] Avg episode reward: [(0, '1.058')] [2022-07-11 07:04:06,571][26022] Updated weights on worker 0-0, policy_version 1086090 (0.00094) [2022-07-11 07:04:08,502][26022] Updated weights on worker 0-0, policy_version 1086100 (0.00084) [2022-07-11 07:04:10,106][26022] Updated weights on worker 0-0, policy_version 1086110 (0.00085) [2022-07-11 07:04:11,078][25689] Fps is (10 sec: 5453.2, 60 sec: 5570.3, 300 sec: 5635.8). Total num frames: 1112180736. Throughput: 0: 5723.7. Samples: 1112188060. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:04:11,078][25689] Avg episode reward: [(0, '0.256')] [2022-07-11 07:04:12,135][26022] Updated weights on worker 0-0, policy_version 1086120 (0.00089) [2022-07-11 07:04:13,809][26022] Updated weights on worker 0-0, policy_version 1086130 (0.00085) [2022-07-11 07:04:15,734][26022] Updated weights on worker 0-0, policy_version 1086140 (0.00086) [2022-07-11 07:04:16,109][25689] Fps is (10 sec: 5647.0, 60 sec: 5567.7, 300 sec: 5628.4). Total num frames: 1112209408. Throughput: 0: 4899.4. Samples: 1112204890. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:04:16,110][25689] Avg episode reward: [(0, '0.051')] [2022-07-11 07:04:17,433][26022] Updated weights on worker 0-0, policy_version 1086150 (0.00085) [2022-07-11 07:04:19,367][26022] Updated weights on worker 0-0, policy_version 1086160 (0.00092) [2022-07-11 07:04:21,089][26022] Updated weights on worker 0-0, policy_version 1086170 (0.00085) [2022-07-11 07:04:21,161][25689] Fps is (10 sec: 5687.2, 60 sec: 5597.4, 300 sec: 5630.9). Total num frames: 1112238080. Throughput: 0: 5736.7. Samples: 1112238666. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:04:21,161][25689] Avg episode reward: [(0, '0.232')] [2022-07-11 07:04:23,134][26022] Updated weights on worker 0-0, policy_version 1086180 (0.00085) [2022-07-11 07:04:24,542][26022] Updated weights on worker 0-0, policy_version 1086190 (0.00086) [2022-07-11 07:04:26,239][25689] Fps is (10 sec: 5459.0, 60 sec: 5540.9, 300 sec: 5623.7). Total num frames: 1112264704. Throughput: 0: 5837.9. Samples: 1112272594. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:04:26,239][25689] Avg episode reward: [(0, '0.357')] [2022-07-11 07:04:26,784][26022] Updated weights on worker 0-0, policy_version 1086200 (0.00081) [2022-07-11 07:04:28,363][26022] Updated weights on worker 0-0, policy_version 1086210 (0.00087) [2022-07-11 07:04:30,367][26022] Updated weights on worker 0-0, policy_version 1086220 (0.00080) [2022-07-11 07:04:31,351][25689] Fps is (10 sec: 5627.4, 60 sec: 5592.7, 300 sec: 5626.0). Total num frames: 1112295424. Throughput: 0: 4994.3. Samples: 1112289310. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:04:31,352][25689] Avg episode reward: [(0, '-0.160')] [2022-07-11 07:04:32,048][26022] Updated weights on worker 0-0, policy_version 1086230 (0.00126) [2022-07-11 07:04:33,990][26022] Updated weights on worker 0-0, policy_version 1086240 (0.00094) [2022-07-11 07:04:35,799][26022] Updated weights on worker 0-0, policy_version 1086250 (0.00083) [2022-07-11 07:04:36,377][25689] Fps is (10 sec: 5757.1, 60 sec: 5579.5, 300 sec: 5627.2). Total num frames: 1112323072. Throughput: 0: 5819.5. Samples: 1112322838. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:04:36,378][25689] Avg episode reward: [(0, '-0.130')] [2022-07-11 07:04:37,633][26022] Updated weights on worker 0-0, policy_version 1086260 (0.00080) [2022-07-11 07:04:39,363][26022] Updated weights on worker 0-0, policy_version 1086270 (0.00410) [2022-07-11 07:04:41,263][26022] Updated weights on worker 0-0, policy_version 1086280 (0.00079) [2022-07-11 07:04:41,404][25689] Fps is (10 sec: 5602.4, 60 sec: 5595.2, 300 sec: 5620.8). Total num frames: 1112351744. Throughput: 0: 5840.2. Samples: 1112356890. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:04:41,404][25689] Avg episode reward: [(0, '0.726')] [2022-07-11 07:04:43,118][26022] Updated weights on worker 0-0, policy_version 1086290 (0.00086) [2022-07-11 07:04:44,898][26022] Updated weights on worker 0-0, policy_version 1086300 (0.00088) [2022-07-11 07:04:46,441][25689] Fps is (10 sec: 5494.6, 60 sec: 5560.9, 300 sec: 5618.8). Total num frames: 1112378368. Throughput: 0: 5004.8. Samples: 1112373702. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:04:46,441][25689] Avg episode reward: [(0, '1.260')] [2022-07-11 07:04:46,775][26022] Updated weights on worker 0-0, policy_version 1086310 (0.00087) [2022-07-11 07:04:48,514][26022] Updated weights on worker 0-0, policy_version 1086320 (0.00083) [2022-07-11 07:04:50,285][26022] Updated weights on worker 0-0, policy_version 1086330 (0.00080) [2022-07-11 07:04:51,543][25689] Fps is (10 sec: 5655.8, 60 sec: 5572.9, 300 sec: 5620.5). Total num frames: 1112409088. Throughput: 0: 5845.1. Samples: 1112407334. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:04:51,543][25689] Avg episode reward: [(0, '0.137')] [2022-07-11 07:04:52,258][26022] Updated weights on worker 0-0, policy_version 1086340 (0.00091) [2022-07-11 07:04:53,925][26022] Updated weights on worker 0-0, policy_version 1086350 (0.00085) [2022-07-11 07:04:55,956][26022] Updated weights on worker 0-0, policy_version 1086360 (0.00090) [2022-07-11 07:04:56,610][25689] Fps is (10 sec: 5739.5, 60 sec: 5585.4, 300 sec: 5620.4). Total num frames: 1112436736. Throughput: 0: 5852.2. Samples: 1112441248. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:04:56,611][25689] Avg episode reward: [(0, '-0.489')] [2022-07-11 07:04:57,539][26022] Updated weights on worker 0-0, policy_version 1086370 (0.00087) [2022-07-11 07:04:59,638][26022] Updated weights on worker 0-0, policy_version 1086380 (0.00098) [2022-07-11 07:05:01,615][26022] Updated weights on worker 0-0, policy_version 1086390 (0.00098) [2022-07-11 07:05:01,678][25689] Fps is (10 sec: 5354.7, 60 sec: 5563.3, 300 sec: 5619.1). Total num frames: 1112463360. Throughput: 0: 4998.8. Samples: 1112458238. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:05:01,679][25689] Avg episode reward: [(0, '-0.567')] [2022-07-11 07:05:03,630][26022] Updated weights on worker 0-0, policy_version 1086400 (0.00099) [2022-07-11 07:05:05,379][26022] Updated weights on worker 0-0, policy_version 1086410 (0.00416) [2022-07-11 07:05:06,702][25689] Fps is (10 sec: 5378.2, 60 sec: 5579.6, 300 sec: 5616.4). Total num frames: 1112491008. Throughput: 0: 5704.4. Samples: 1112489278. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:05:06,702][25689] Avg episode reward: [(0, '-0.917')] [2022-07-11 07:05:07,216][26022] Updated weights on worker 0-0, policy_version 1086420 (0.00086) [2022-07-11 07:05:08,974][26022] Updated weights on worker 0-0, policy_version 1086430 (0.00085) [2022-07-11 07:05:10,782][26022] Updated weights on worker 0-0, policy_version 1086440 (0.00089) [2022-07-11 07:05:11,808][25689] Fps is (10 sec: 5559.8, 60 sec: 5581.1, 300 sec: 5608.0). Total num frames: 1112519680. Throughput: 0: 5716.2. Samples: 1112523174. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:05:11,809][25689] Avg episode reward: [(0, '-0.621')] [2022-07-11 07:05:12,543][26022] Updated weights on worker 0-0, policy_version 1086450 (0.00096) [2022-07-11 07:05:14,435][26022] Updated weights on worker 0-0, policy_version 1086460 (0.00091) [2022-07-11 07:05:16,250][26022] Updated weights on worker 0-0, policy_version 1086470 (0.00092) [2022-07-11 07:05:16,822][25689] Fps is (10 sec: 5565.1, 60 sec: 5565.9, 300 sec: 5604.6). Total num frames: 1112547328. Throughput: 0: 5735.4. Samples: 1112557168. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:05:16,822][25689] Avg episode reward: [(0, '0.248')] [2022-07-11 07:05:18,044][26022] Updated weights on worker 0-0, policy_version 1086480 (0.00087) [2022-07-11 07:05:19,938][26022] Updated weights on worker 0-0, policy_version 1086490 (0.00094) [2022-07-11 07:05:21,691][26022] Updated weights on worker 0-0, policy_version 1086500 (0.00093) [2022-07-11 07:05:21,898][25689] Fps is (10 sec: 5683.7, 60 sec: 5580.5, 300 sec: 5610.7). Total num frames: 1112577024. Throughput: 0: 5721.9. Samples: 1112573930. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:05:21,898][25689] Avg episode reward: [(0, '1.151')] [2022-07-11 07:05:23,424][26022] Updated weights on worker 0-0, policy_version 1086510 (0.00090) [2022-07-11 07:05:25,286][26022] Updated weights on worker 0-0, policy_version 1086520 (0.00085) [2022-07-11 07:05:26,983][25689] Fps is (10 sec: 5643.3, 60 sec: 5596.7, 300 sec: 5607.0). Total num frames: 1112604672. Throughput: 0: 5862.6. Samples: 1112608182. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:05:26,984][25689] Avg episode reward: [(0, '1.229')] [2022-07-11 07:05:27,095][26022] Updated weights on worker 0-0, policy_version 1086530 (0.00092) [2022-07-11 07:05:29,132][26022] Updated weights on worker 0-0, policy_version 1086540 (0.00093) [2022-07-11 07:05:30,905][26022] Updated weights on worker 0-0, policy_version 1086550 (0.00086) [2022-07-11 07:05:32,031][25689] Fps is (10 sec: 5558.0, 60 sec: 5568.9, 300 sec: 5606.5). Total num frames: 1112633344. Throughput: 0: 5856.0. Samples: 1112641596. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:05:32,031][25689] Avg episode reward: [(0, '1.201')] [2022-07-11 07:05:32,634][26022] Updated weights on worker 0-0, policy_version 1086560 (0.00088) [2022-07-11 07:05:34,607][26022] Updated weights on worker 0-0, policy_version 1086570 (0.00085) [2022-07-11 07:05:36,340][26022] Updated weights on worker 0-0, policy_version 1086580 (0.00082) [2022-07-11 07:05:37,033][25689] Fps is (10 sec: 5705.9, 60 sec: 5588.0, 300 sec: 5603.1). Total num frames: 1112662016. Throughput: 0: 5012.8. Samples: 1112658484. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:05:37,034][25689] Avg episode reward: [(0, '1.682')] [2022-07-11 07:05:38,143][26022] Updated weights on worker 0-0, policy_version 1086590 (0.00088) [2022-07-11 07:05:40,006][26022] Updated weights on worker 0-0, policy_version 1086600 (0.00086) [2022-07-11 07:05:41,626][26022] Updated weights on worker 0-0, policy_version 1086610 (0.00092) [2022-07-11 07:05:42,054][25689] Fps is (10 sec: 5619.2, 60 sec: 5571.7, 300 sec: 5610.1). Total num frames: 1112689664. Throughput: 0: 5873.1. Samples: 1112692308. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:05:42,054][25689] Avg episode reward: [(0, '0.740')] [2022-07-11 07:05:43,609][26022] Updated weights on worker 0-0, policy_version 1086620 (0.00081) [2022-07-11 07:05:44,045][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:05:44,064][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001086622_1112700928.pth [2022-07-11 07:05:44,065][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001084647_1110678528.pth [2022-07-11 07:05:45,459][26022] Updated weights on worker 0-0, policy_version 1086630 (0.00085) [2022-07-11 07:05:47,064][25689] Fps is (10 sec: 5512.5, 60 sec: 5591.0, 300 sec: 5604.4). Total num frames: 1112717312. Throughput: 0: 5867.8. Samples: 1112726014. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:05:47,067][25689] Avg episode reward: [(0, '0.676')] [2022-07-11 07:05:47,179][26022] Updated weights on worker 0-0, policy_version 1086640 (0.00085) [2022-07-11 07:05:49,210][26022] Updated weights on worker 0-0, policy_version 1086650 (0.00091) [2022-07-11 07:05:50,988][26022] Updated weights on worker 0-0, policy_version 1086660 (0.00089) [2022-07-11 07:05:52,153][25689] Fps is (10 sec: 5576.6, 60 sec: 5558.4, 300 sec: 5603.3). Total num frames: 1112745984. Throughput: 0: 5020.5. Samples: 1112742620. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:05:52,155][25689] Avg episode reward: [(0, '0.524')] [2022-07-11 07:05:52,817][26022] Updated weights on worker 0-0, policy_version 1086670 (0.00084) [2022-07-11 07:05:54,729][26022] Updated weights on worker 0-0, policy_version 1086680 (0.00086) [2022-07-11 07:05:56,226][26022] Updated weights on worker 0-0, policy_version 1086690 (0.00087) [2022-07-11 07:05:57,157][25689] Fps is (10 sec: 5682.0, 60 sec: 5581.2, 300 sec: 5603.8). Total num frames: 1112774656. Throughput: 0: 5857.8. Samples: 1112776362. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:05:57,157][25689] Avg episode reward: [(0, '-0.421')] [2022-07-11 07:05:58,369][26022] Updated weights on worker 0-0, policy_version 1086700 (0.00085) [2022-07-11 07:06:00,037][26022] Updated weights on worker 0-0, policy_version 1086710 (0.00095) [2022-07-11 07:06:02,171][25689] Fps is (10 sec: 5417.6, 60 sec: 5569.2, 300 sec: 5604.2). Total num frames: 1112800256. Throughput: 0: 5756.8. Samples: 1112808118. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:06:02,171][25689] Avg episode reward: [(0, '-1.470')] [2022-07-11 07:06:02,267][26022] Updated weights on worker 0-0, policy_version 1086720 (0.00094) [2022-07-11 07:06:04,217][26022] Updated weights on worker 0-0, policy_version 1086730 (0.00082) [2022-07-11 07:06:05,788][26022] Updated weights on worker 0-0, policy_version 1086740 (0.00088) [2022-07-11 07:06:07,215][25689] Fps is (10 sec: 5395.9, 60 sec: 5584.2, 300 sec: 5597.6). Total num frames: 1112828928. Throughput: 0: 4902.7. Samples: 1112824804. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:06:07,215][25689] Avg episode reward: [(0, '-1.542')] [2022-07-11 07:06:07,826][26022] Updated weights on worker 0-0, policy_version 1086750 (0.00087) [2022-07-11 07:06:09,600][26022] Updated weights on worker 0-0, policy_version 1086760 (0.00092) [2022-07-11 07:06:11,482][26022] Updated weights on worker 0-0, policy_version 1086770 (0.00085) [2022-07-11 07:06:12,357][25689] Fps is (10 sec: 5629.6, 60 sec: 5581.0, 300 sec: 5598.5). Total num frames: 1112857600. Throughput: 0: 5732.0. Samples: 1112858428. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:06:12,357][25689] Avg episode reward: [(0, '-0.764')] [2022-07-11 07:06:13,214][26022] Updated weights on worker 0-0, policy_version 1086780 (0.00087) [2022-07-11 07:06:14,938][26022] Updated weights on worker 0-0, policy_version 1086790 (0.00083) [2022-07-11 07:06:16,892][26022] Updated weights on worker 0-0, policy_version 1086800 (0.00096) [2022-07-11 07:06:17,454][25689] Fps is (10 sec: 5500.6, 60 sec: 5573.3, 300 sec: 5593.7). Total num frames: 1112885248. Throughput: 0: 5720.7. Samples: 1112892474. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:06:17,454][25689] Avg episode reward: [(0, '-0.779')] [2022-07-11 07:06:18,698][26022] Updated weights on worker 0-0, policy_version 1086810 (0.00086) [2022-07-11 07:06:20,398][26022] Updated weights on worker 0-0, policy_version 1086820 (0.00089) [2022-07-11 07:06:22,377][26022] Updated weights on worker 0-0, policy_version 1086830 (0.00079) [2022-07-11 07:06:22,475][25689] Fps is (10 sec: 5566.2, 60 sec: 5561.4, 300 sec: 5597.2). Total num frames: 1112913920. Throughput: 0: 4997.7. Samples: 1112909592. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:06:22,475][25689] Avg episode reward: [(0, '0.757')] [2022-07-11 07:06:23,921][26022] Updated weights on worker 0-0, policy_version 1086840 (0.00103) [2022-07-11 07:06:25,978][26022] Updated weights on worker 0-0, policy_version 1086850 (0.00097) [2022-07-11 07:06:27,419][26022] Updated weights on worker 0-0, policy_version 1086860 (0.00077) [2022-07-11 07:06:27,508][25689] Fps is (10 sec: 5907.2, 60 sec: 5617.0, 300 sec: 5601.3). Total num frames: 1112944640. Throughput: 0: 5852.5. Samples: 1112943570. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:06:27,508][25689] Avg episode reward: [(0, '0.472')] [2022-07-11 07:06:29,411][26022] Updated weights on worker 0-0, policy_version 1086870 (0.00081) [2022-07-11 07:06:31,453][26022] Updated weights on worker 0-0, policy_version 1086880 (0.00086) [2022-07-11 07:06:32,578][25689] Fps is (10 sec: 5675.7, 60 sec: 5581.1, 300 sec: 5593.2). Total num frames: 1112971264. Throughput: 0: 5877.6. Samples: 1112977282. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:06:32,580][25689] Avg episode reward: [(0, '-0.578')] [2022-07-11 07:06:33,224][26022] Updated weights on worker 0-0, policy_version 1086890 (0.00611) [2022-07-11 07:06:34,911][26022] Updated weights on worker 0-0, policy_version 1086900 (0.00091) [2022-07-11 07:06:36,841][26022] Updated weights on worker 0-0, policy_version 1086910 (0.00103) [2022-07-11 07:06:37,587][25689] Fps is (10 sec: 5384.5, 60 sec: 5563.6, 300 sec: 5596.8). Total num frames: 1112998912. Throughput: 0: 5050.2. Samples: 1112994152. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:06:37,589][25689] Avg episode reward: [(0, '-0.632')] [2022-07-11 07:06:38,487][26022] Updated weights on worker 0-0, policy_version 1086920 (0.00057) [2022-07-11 07:06:40,647][26022] Updated weights on worker 0-0, policy_version 1086930 (0.00084) [2022-07-11 07:06:42,283][26022] Updated weights on worker 0-0, policy_version 1086940 (0.00084) [2022-07-11 07:06:42,598][25689] Fps is (10 sec: 5620.8, 60 sec: 5581.3, 300 sec: 5590.5). Total num frames: 1113027584. Throughput: 0: 5884.3. Samples: 1113028004. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:06:42,600][25689] Avg episode reward: [(0, '-0.857')] [2022-07-11 07:06:44,142][26022] Updated weights on worker 0-0, policy_version 1086950 (0.00094) [2022-07-11 07:06:45,967][26022] Updated weights on worker 0-0, policy_version 1086960 (0.00074) [2022-07-11 07:06:47,610][25689] Fps is (10 sec: 5721.3, 60 sec: 5598.2, 300 sec: 5592.4). Total num frames: 1113056256. Throughput: 0: 5872.0. Samples: 1113061610. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:06:47,613][25689] Avg episode reward: [(0, '-1.044')] [2022-07-11 07:06:47,888][26022] Updated weights on worker 0-0, policy_version 1086970 (0.00091) [2022-07-11 07:06:49,652][26022] Updated weights on worker 0-0, policy_version 1086980 (0.00089) [2022-07-11 07:06:51,586][26022] Updated weights on worker 0-0, policy_version 1086990 (0.00080) [2022-07-11 07:06:52,666][25689] Fps is (10 sec: 5594.3, 60 sec: 5584.3, 300 sec: 5592.3). Total num frames: 1113083904. Throughput: 0: 5037.2. Samples: 1113078466. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:06:52,668][25689] Avg episode reward: [(0, '-0.732')] [2022-07-11 07:06:53,023][26022] Updated weights on worker 0-0, policy_version 1087000 (0.00081) [2022-07-11 07:06:55,216][26022] Updated weights on worker 0-0, policy_version 1087010 (0.00094) [2022-07-11 07:06:56,597][26022] Updated weights on worker 0-0, policy_version 1087020 (0.00069) [2022-07-11 07:06:57,678][25689] Fps is (10 sec: 5695.7, 60 sec: 5600.4, 300 sec: 5595.9). Total num frames: 1113113600. Throughput: 0: 5894.9. Samples: 1113112586. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:06:57,680][25689] Avg episode reward: [(0, '0.292')] [2022-07-11 07:06:58,763][26022] Updated weights on worker 0-0, policy_version 1087030 (0.00080) [2022-07-11 07:07:00,463][26022] Updated weights on worker 0-0, policy_version 1087040 (0.00092) [2022-07-11 07:07:02,689][25689] Fps is (10 sec: 5414.4, 60 sec: 5583.7, 300 sec: 5589.2). Total num frames: 1113138176. Throughput: 0: 5891.6. Samples: 1113146372. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:07:02,690][25689] Avg episode reward: [(0, '-0.075')] [2022-07-11 07:07:02,814][26022] Updated weights on worker 0-0, policy_version 1087050 (0.00057) [2022-07-11 07:07:04,403][26022] Updated weights on worker 0-0, policy_version 1087060 (0.00083) [2022-07-11 07:07:06,253][26022] Updated weights on worker 0-0, policy_version 1087070 (0.00084) [2022-07-11 07:07:07,693][25689] Fps is (10 sec: 5316.6, 60 sec: 5587.4, 300 sec: 5593.8). Total num frames: 1113166848. Throughput: 0: 4958.2. Samples: 1113161188. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:07:07,694][25689] Avg episode reward: [(0, '-0.699')] [2022-07-11 07:07:08,250][26022] Updated weights on worker 0-0, policy_version 1087080 (0.00091) [2022-07-11 07:07:10,008][26022] Updated weights on worker 0-0, policy_version 1087090 (0.00087) [2022-07-11 07:07:11,693][26022] Updated weights on worker 0-0, policy_version 1087100 (0.00085) [2022-07-11 07:07:12,828][25689] Fps is (10 sec: 5656.1, 60 sec: 5588.1, 300 sec: 5588.4). Total num frames: 1113195520. Throughput: 0: 5777.4. Samples: 1113194950. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:07:12,828][25689] Avg episode reward: [(0, '-0.688')] [2022-07-11 07:07:13,607][26022] Updated weights on worker 0-0, policy_version 1087110 (0.00083) [2022-07-11 07:07:15,517][26022] Updated weights on worker 0-0, policy_version 1087120 (0.00082) [2022-07-11 07:07:17,214][26022] Updated weights on worker 0-0, policy_version 1087130 (0.00089) [2022-07-11 07:07:17,923][25689] Fps is (10 sec: 5605.9, 60 sec: 5605.2, 300 sec: 5580.1). Total num frames: 1113224192. Throughput: 0: 5710.2. Samples: 1113228186. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:07:17,923][25689] Avg episode reward: [(0, '0.436')] [2022-07-11 07:07:19,149][26022] Updated weights on worker 0-0, policy_version 1087140 (0.00083) [2022-07-11 07:07:20,930][26022] Updated weights on worker 0-0, policy_version 1087150 (0.00090) [2022-07-11 07:07:22,908][26022] Updated weights on worker 0-0, policy_version 1087160 (0.00081) [2022-07-11 07:07:22,961][25689] Fps is (10 sec: 5558.0, 60 sec: 5586.8, 300 sec: 5586.3). Total num frames: 1113251840. Throughput: 0: 5707.4. Samples: 1113262070. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:07:22,961][25689] Avg episode reward: [(0, '-0.338')] [2022-07-11 07:07:24,520][26022] Updated weights on worker 0-0, policy_version 1087170 (0.00103) [2022-07-11 07:07:26,538][26022] Updated weights on worker 0-0, policy_version 1087180 (0.00085) [2022-07-11 07:07:27,993][25689] Fps is (10 sec: 5592.9, 60 sec: 5553.0, 300 sec: 5583.4). Total num frames: 1113280512. Throughput: 0: 5793.9. Samples: 1113278800. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:07:27,993][25689] Avg episode reward: [(0, '-0.537')] [2022-07-11 07:07:28,231][26022] Updated weights on worker 0-0, policy_version 1087190 (0.00083) [2022-07-11 07:07:30,042][26022] Updated weights on worker 0-0, policy_version 1087200 (0.00093) [2022-07-11 07:07:32,103][26022] Updated weights on worker 0-0, policy_version 1087210 (0.00086) [2022-07-11 07:07:33,058][25689] Fps is (10 sec: 5578.1, 60 sec: 5570.4, 300 sec: 5579.2). Total num frames: 1113308160. Throughput: 0: 5795.5. Samples: 1113312192. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:07:33,058][25689] Avg episode reward: [(0, '-0.301')] [2022-07-11 07:07:33,911][26022] Updated weights on worker 0-0, policy_version 1087220 (0.00087) [2022-07-11 07:07:35,594][26022] Updated weights on worker 0-0, policy_version 1087230 (0.00091) [2022-07-11 07:07:37,410][26022] Updated weights on worker 0-0, policy_version 1087240 (0.00090) [2022-07-11 07:07:38,062][25689] Fps is (10 sec: 5593.0, 60 sec: 5587.7, 300 sec: 5582.9). Total num frames: 1113336832. Throughput: 0: 5855.0. Samples: 1113346104. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:07:38,063][25689] Avg episode reward: [(0, '-0.287')] [2022-07-11 07:07:39,236][26022] Updated weights on worker 0-0, policy_version 1087250 (0.00089) [2022-07-11 07:07:41,042][26022] Updated weights on worker 0-0, policy_version 1087260 (0.00089) [2022-07-11 07:07:43,106][25689] Fps is (10 sec: 5502.7, 60 sec: 5550.9, 300 sec: 5571.9). Total num frames: 1113363456. Throughput: 0: 5013.9. Samples: 1113363078. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:07:43,108][25689] Avg episode reward: [(0, '-0.851')] [2022-07-11 07:07:43,176][26022] Updated weights on worker 0-0, policy_version 1087270 (0.00046) [2022-07-11 07:07:44,099][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:07:44,113][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001087276_1113370624.pth [2022-07-11 07:07:44,114][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001085309_1111356416.pth [2022-07-11 07:07:44,544][26022] Updated weights on worker 0-0, policy_version 1087280 (0.00089) [2022-07-11 07:07:46,719][26022] Updated weights on worker 0-0, policy_version 1087290 (0.00082) [2022-07-11 07:07:48,196][25689] Fps is (10 sec: 5759.9, 60 sec: 5594.5, 300 sec: 5589.0). Total num frames: 1113395200. Throughput: 0: 5833.9. Samples: 1113396662. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:07:48,198][25689] Avg episode reward: [(0, '-0.708')] [2022-07-11 07:07:48,200][26022] Updated weights on worker 0-0, policy_version 1087300 (0.00091) [2022-07-11 07:07:50,156][26022] Updated weights on worker 0-0, policy_version 1087310 (0.00081) [2022-07-11 07:07:51,869][26022] Updated weights on worker 0-0, policy_version 1087320 (0.00082) [2022-07-11 07:07:53,263][25689] Fps is (10 sec: 5847.9, 60 sec: 5593.4, 300 sec: 5581.1). Total num frames: 1113422848. Throughput: 0: 5860.7. Samples: 1113430608. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:07:53,263][25689] Avg episode reward: [(0, '-0.734')] [2022-07-11 07:07:53,916][26022] Updated weights on worker 0-0, policy_version 1087330 (0.00089) [2022-07-11 07:07:55,648][26022] Updated weights on worker 0-0, policy_version 1087340 (0.00088) [2022-07-11 07:07:57,610][26022] Updated weights on worker 0-0, policy_version 1087350 (0.00086) [2022-07-11 07:07:58,275][25689] Fps is (10 sec: 5486.1, 60 sec: 5559.6, 300 sec: 5581.0). Total num frames: 1113450496. Throughput: 0: 5019.3. Samples: 1113447558. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:07:58,276][25689] Avg episode reward: [(0, '-0.303')] [2022-07-11 07:07:59,207][26022] Updated weights on worker 0-0, policy_version 1087360 (0.00101) [2022-07-11 07:08:01,159][26022] Updated weights on worker 0-0, policy_version 1087370 (0.00093) [2022-07-11 07:08:03,305][25689] Fps is (10 sec: 5302.2, 60 sec: 5574.8, 300 sec: 5580.9). Total num frames: 1113476096. Throughput: 0: 5804.1. Samples: 1113480314. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:08:03,308][25689] Avg episode reward: [(0, '-0.127')] [2022-07-11 07:08:03,600][26022] Updated weights on worker 0-0, policy_version 1087380 (0.00081) [2022-07-11 07:08:05,119][26022] Updated weights on worker 0-0, policy_version 1087390 (0.00082) [2022-07-11 07:08:07,138][26022] Updated weights on worker 0-0, policy_version 1087400 (0.00086) [2022-07-11 07:08:08,323][25689] Fps is (10 sec: 5503.3, 60 sec: 5590.4, 300 sec: 5582.2). Total num frames: 1113505792. Throughput: 0: 5779.8. Samples: 1113512992. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:08:08,323][25689] Avg episode reward: [(0, '-0.287')] [2022-07-11 07:08:08,712][26022] Updated weights on worker 0-0, policy_version 1087410 (0.00080) [2022-07-11 07:08:10,711][26022] Updated weights on worker 0-0, policy_version 1087420 (0.00093) [2022-07-11 07:08:12,313][26022] Updated weights on worker 0-0, policy_version 1087430 (0.00091) [2022-07-11 07:08:13,363][25689] Fps is (10 sec: 5599.7, 60 sec: 5565.3, 300 sec: 5574.6). Total num frames: 1113532416. Throughput: 0: 4935.0. Samples: 1113529802. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:08:13,364][25689] Avg episode reward: [(0, '0.200')] [2022-07-11 07:08:14,288][26022] Updated weights on worker 0-0, policy_version 1087440 (0.00093) [2022-07-11 07:08:15,996][26022] Updated weights on worker 0-0, policy_version 1087450 (0.00090) [2022-07-11 07:08:17,851][26022] Updated weights on worker 0-0, policy_version 1087460 (0.00083) [2022-07-11 07:08:18,374][25689] Fps is (10 sec: 5501.5, 60 sec: 5573.0, 300 sec: 5581.4). Total num frames: 1113561088. Throughput: 0: 5770.6. Samples: 1113563540. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:08:18,374][25689] Avg episode reward: [(0, '0.071')] [2022-07-11 07:08:19,804][26022] Updated weights on worker 0-0, policy_version 1087470 (0.00092) [2022-07-11 07:08:21,571][26022] Updated weights on worker 0-0, policy_version 1087480 (0.00080) [2022-07-11 07:08:23,384][25689] Fps is (10 sec: 5620.2, 60 sec: 5575.6, 300 sec: 5574.6). Total num frames: 1113588736. Throughput: 0: 5823.1. Samples: 1113597234. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:08:23,385][25689] Avg episode reward: [(0, '-0.246')] [2022-07-11 07:08:23,439][26022] Updated weights on worker 0-0, policy_version 1087490 (0.00081) [2022-07-11 07:08:25,319][26022] Updated weights on worker 0-0, policy_version 1087500 (0.00094) [2022-07-11 07:08:27,118][26022] Updated weights on worker 0-0, policy_version 1087510 (0.00082) [2022-07-11 07:08:28,422][25689] Fps is (10 sec: 5502.9, 60 sec: 5558.1, 300 sec: 5576.2). Total num frames: 1113616384. Throughput: 0: 5011.5. Samples: 1113613720. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 07:08:28,424][25689] Avg episode reward: [(0, '-0.761')] [2022-07-11 07:08:28,793][26022] Updated weights on worker 0-0, policy_version 1087520 (0.00084) [2022-07-11 07:08:30,733][26022] Updated weights on worker 0-0, policy_version 1087530 (0.00090) [2022-07-11 07:08:32,552][26022] Updated weights on worker 0-0, policy_version 1087540 (0.00082) [2022-07-11 07:08:33,513][25689] Fps is (10 sec: 5661.0, 60 sec: 5589.5, 300 sec: 5579.2). Total num frames: 1113646080. Throughput: 0: 5835.5. Samples: 1113647390. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:08:33,514][25689] Avg episode reward: [(0, '-0.836')] [2022-07-11 07:08:34,475][26022] Updated weights on worker 0-0, policy_version 1087550 (0.00094) [2022-07-11 07:08:36,324][26022] Updated weights on worker 0-0, policy_version 1087560 (0.00085) [2022-07-11 07:08:38,103][26022] Updated weights on worker 0-0, policy_version 1087570 (0.00082) [2022-07-11 07:08:38,534][25689] Fps is (10 sec: 5671.3, 60 sec: 5571.2, 300 sec: 5579.1). Total num frames: 1113673728. Throughput: 0: 5825.4. Samples: 1113680980. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:08:38,534][25689] Avg episode reward: [(0, '-0.726')] [2022-07-11 07:08:39,853][26022] Updated weights on worker 0-0, policy_version 1087580 (0.00082) [2022-07-11 07:08:41,736][26022] Updated weights on worker 0-0, policy_version 1087590 (0.00092) [2022-07-11 07:08:43,535][25689] Fps is (10 sec: 5517.7, 60 sec: 5592.0, 300 sec: 5576.2). Total num frames: 1113701376. Throughput: 0: 4997.5. Samples: 1113697940. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:08:43,536][25689] Avg episode reward: [(0, '-0.520')] [2022-07-11 07:08:43,548][26022] Updated weights on worker 0-0, policy_version 1087600 (0.00086) [2022-07-11 07:08:45,362][26022] Updated weights on worker 0-0, policy_version 1087610 (0.00084) [2022-07-11 07:08:47,185][26022] Updated weights on worker 0-0, policy_version 1087620 (0.00086) [2022-07-11 07:08:48,573][25689] Fps is (10 sec: 5609.8, 60 sec: 5545.9, 300 sec: 5572.9). Total num frames: 1113730048. Throughput: 0: 5860.9. Samples: 1113731822. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:08:48,574][25689] Avg episode reward: [(0, '-0.265')] [2022-07-11 07:08:48,929][26022] Updated weights on worker 0-0, policy_version 1087630 (0.00085) [2022-07-11 07:08:50,815][26022] Updated weights on worker 0-0, policy_version 1087640 (0.00423) [2022-07-11 07:08:52,642][26022] Updated weights on worker 0-0, policy_version 1087650 (0.00064) [2022-07-11 07:08:53,634][25689] Fps is (10 sec: 5576.8, 60 sec: 5546.5, 300 sec: 5575.6). Total num frames: 1113757696. Throughput: 0: 5878.6. Samples: 1113765672. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:08:53,635][25689] Avg episode reward: [(0, '0.913')] [2022-07-11 07:08:54,380][26022] Updated weights on worker 0-0, policy_version 1087660 (0.00083) [2022-07-11 07:08:56,286][26022] Updated weights on worker 0-0, policy_version 1087670 (0.00085) [2022-07-11 07:08:57,920][26022] Updated weights on worker 0-0, policy_version 1087680 (0.00086) [2022-07-11 07:08:58,650][25689] Fps is (10 sec: 5690.9, 60 sec: 5580.1, 300 sec: 5582.4). Total num frames: 1113787392. Throughput: 0: 5065.3. Samples: 1113782874. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:08:58,650][25689] Avg episode reward: [(0, '0.559')] [2022-07-11 07:08:59,854][26022] Updated weights on worker 0-0, policy_version 1087690 (0.00091) [2022-07-11 07:09:01,924][26022] Updated weights on worker 0-0, policy_version 1087700 (0.00098) [2022-07-11 07:09:03,671][25689] Fps is (10 sec: 5611.4, 60 sec: 5597.9, 300 sec: 5582.3). Total num frames: 1113814016. Throughput: 0: 5816.4. Samples: 1113815056. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:09:03,671][25689] Avg episode reward: [(0, '1.130')] [2022-07-11 07:09:03,731][26022] Updated weights on worker 0-0, policy_version 1087710 (0.00086) [2022-07-11 07:09:05,887][26022] Updated weights on worker 0-0, policy_version 1087720 (0.00094) [2022-07-11 07:09:07,358][26022] Updated weights on worker 0-0, policy_version 1087730 (0.00086) [2022-07-11 07:09:08,690][25689] Fps is (10 sec: 5303.5, 60 sec: 5546.9, 300 sec: 5577.4). Total num frames: 1113840640. Throughput: 0: 5795.9. Samples: 1113848416. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:09:08,690][25689] Avg episode reward: [(0, '0.707')] [2022-07-11 07:09:09,328][26022] Updated weights on worker 0-0, policy_version 1087740 (0.00103) [2022-07-11 07:09:11,203][26022] Updated weights on worker 0-0, policy_version 1087750 (0.00086) [2022-07-11 07:09:12,890][26022] Updated weights on worker 0-0, policy_version 1087760 (0.00097) [2022-07-11 07:09:13,813][25689] Fps is (10 sec: 5654.2, 60 sec: 5607.0, 300 sec: 5582.6). Total num frames: 1113871360. Throughput: 0: 4942.2. Samples: 1113865398. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:09:13,813][25689] Avg episode reward: [(0, '0.293')] [2022-07-11 07:09:14,903][26022] Updated weights on worker 0-0, policy_version 1087770 (0.00088) [2022-07-11 07:09:16,443][26022] Updated weights on worker 0-0, policy_version 1087780 (0.00095) [2022-07-11 07:09:18,495][26022] Updated weights on worker 0-0, policy_version 1087790 (0.00088) [2022-07-11 07:09:18,860][25689] Fps is (10 sec: 5739.4, 60 sec: 5586.8, 300 sec: 5579.2). Total num frames: 1113899008. Throughput: 0: 5759.5. Samples: 1113899272. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:09:18,860][25689] Avg episode reward: [(0, '0.109')] [2022-07-11 07:09:20,247][26022] Updated weights on worker 0-0, policy_version 1087800 (0.00086) [2022-07-11 07:09:22,189][26022] Updated weights on worker 0-0, policy_version 1087810 (0.00087) [2022-07-11 07:09:23,733][26022] Updated weights on worker 0-0, policy_version 1087820 (0.00084) [2022-07-11 07:09:23,929][25689] Fps is (10 sec: 5668.2, 60 sec: 5615.1, 300 sec: 5589.7). Total num frames: 1113928704. Throughput: 0: 5825.9. Samples: 1113933080. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:09:23,930][25689] Avg episode reward: [(0, '0.941')] [2022-07-11 07:09:25,689][26022] Updated weights on worker 0-0, policy_version 1087830 (0.00082) [2022-07-11 07:09:27,260][26022] Updated weights on worker 0-0, policy_version 1087840 (0.00084) [2022-07-11 07:09:28,967][25689] Fps is (10 sec: 5572.1, 60 sec: 5598.2, 300 sec: 5577.3). Total num frames: 1113955328. Throughput: 0: 5011.8. Samples: 1113950042. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:09:28,968][25689] Avg episode reward: [(0, '-0.507')] [2022-07-11 07:09:29,312][26022] Updated weights on worker 0-0, policy_version 1087850 (0.00591) [2022-07-11 07:09:31,080][26022] Updated weights on worker 0-0, policy_version 1087860 (0.00089) [2022-07-11 07:09:32,802][26022] Updated weights on worker 0-0, policy_version 1087870 (0.00084) [2022-07-11 07:09:34,066][25689] Fps is (10 sec: 5454.9, 60 sec: 5580.6, 300 sec: 5579.4). Total num frames: 1113984000. Throughput: 0: 5864.2. Samples: 1113984170. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:09:34,067][25689] Avg episode reward: [(0, '-0.403')] [2022-07-11 07:09:34,933][26022] Updated weights on worker 0-0, policy_version 1087880 (0.00091) [2022-07-11 07:09:36,552][26022] Updated weights on worker 0-0, policy_version 1087890 (0.00095) [2022-07-11 07:09:38,299][26022] Updated weights on worker 0-0, policy_version 1087900 (0.00610) [2022-07-11 07:09:39,115][25689] Fps is (10 sec: 5751.7, 60 sec: 5611.7, 300 sec: 5582.4). Total num frames: 1114013696. Throughput: 0: 5868.2. Samples: 1114018136. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:09:39,117][25689] Avg episode reward: [(0, '-1.148')] [2022-07-11 07:09:40,106][26022] Updated weights on worker 0-0, policy_version 1087910 (0.00086) [2022-07-11 07:09:41,905][26022] Updated weights on worker 0-0, policy_version 1087920 (0.00093) [2022-07-11 07:09:43,709][26022] Updated weights on worker 0-0, policy_version 1087930 (0.00079) [2022-07-11 07:09:44,129][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:09:44,142][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001087932_1114042368.pth [2022-07-11 07:09:44,142][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001085968_1112031232.pth [2022-07-11 07:09:44,143][25689] Fps is (10 sec: 5792.3, 60 sec: 5626.2, 300 sec: 5589.5). Total num frames: 1114042368. Throughput: 0: 5049.2. Samples: 1114035144. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:09:44,144][25689] Avg episode reward: [(0, '-1.000')] [2022-07-11 07:09:45,493][26022] Updated weights on worker 0-0, policy_version 1087940 (0.00092) [2022-07-11 07:09:47,337][26022] Updated weights on worker 0-0, policy_version 1087950 (0.00088) [2022-07-11 07:09:49,189][25689] Fps is (10 sec: 5692.3, 60 sec: 5625.5, 300 sec: 5583.7). Total num frames: 1114071040. Throughput: 0: 5901.6. Samples: 1114069384. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:09:49,190][25689] Avg episode reward: [(0, '-1.365')] [2022-07-11 07:09:49,192][26022] Updated weights on worker 0-0, policy_version 1087960 (0.00088) [2022-07-11 07:09:50,896][26022] Updated weights on worker 0-0, policy_version 1087970 (0.00084) [2022-07-11 07:09:52,726][26022] Updated weights on worker 0-0, policy_version 1087980 (0.00085) [2022-07-11 07:09:54,303][25689] Fps is (10 sec: 5745.1, 60 sec: 5654.3, 300 sec: 5589.7). Total num frames: 1114100736. Throughput: 0: 5900.0. Samples: 1114103564. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:09:54,303][25689] Avg episode reward: [(0, '0.312')] [2022-07-11 07:09:54,541][26022] Updated weights on worker 0-0, policy_version 1087990 (0.00091) [2022-07-11 07:09:56,456][26022] Updated weights on worker 0-0, policy_version 1088000 (0.00089) [2022-07-11 07:09:58,074][26022] Updated weights on worker 0-0, policy_version 1088010 (0.00079) [2022-07-11 07:09:59,324][25689] Fps is (10 sec: 5556.8, 60 sec: 5603.1, 300 sec: 5590.5). Total num frames: 1114127360. Throughput: 0: 5912.2. Samples: 1114137618. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:09:59,325][25689] Avg episode reward: [(0, '0.379')] [2022-07-11 07:10:00,027][26022] Updated weights on worker 0-0, policy_version 1088020 (0.00090) [2022-07-11 07:10:02,236][26022] Updated weights on worker 0-0, policy_version 1088030 (0.00086) [2022-07-11 07:10:03,868][26022] Updated weights on worker 0-0, policy_version 1088040 (0.00112) [2022-07-11 07:10:04,405][25689] Fps is (10 sec: 5575.1, 60 sec: 5648.2, 300 sec: 5596.3). Total num frames: 1114157056. Throughput: 0: 5807.6. Samples: 1114152816. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:10:04,405][25689] Avg episode reward: [(0, '-0.333')] [2022-07-11 07:10:05,840][26022] Updated weights on worker 0-0, policy_version 1088050 (0.00085) [2022-07-11 07:10:07,619][26022] Updated weights on worker 0-0, policy_version 1088060 (0.00086) [2022-07-11 07:10:09,224][26022] Updated weights on worker 0-0, policy_version 1088070 (0.00091) [2022-07-11 07:10:09,413][25689] Fps is (10 sec: 5582.4, 60 sec: 5649.2, 300 sec: 5591.3). Total num frames: 1114183680. Throughput: 0: 5820.0. Samples: 1114187088. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:10:09,414][25689] Avg episode reward: [(0, '0.312')] [2022-07-11 07:10:11,143][26022] Updated weights on worker 0-0, policy_version 1088080 (0.00084) [2022-07-11 07:10:12,699][26022] Updated weights on worker 0-0, policy_version 1088090 (0.00081) [2022-07-11 07:10:14,475][25689] Fps is (10 sec: 5491.2, 60 sec: 5621.2, 300 sec: 5593.8). Total num frames: 1114212352. Throughput: 0: 5852.4. Samples: 1114221618. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:10:14,475][25689] Avg episode reward: [(0, '1.041')] [2022-07-11 07:10:14,751][26022] Updated weights on worker 0-0, policy_version 1088100 (0.00084) [2022-07-11 07:10:16,371][26022] Updated weights on worker 0-0, policy_version 1088110 (0.00090) [2022-07-11 07:10:18,320][26022] Updated weights on worker 0-0, policy_version 1088120 (0.00087) [2022-07-11 07:10:19,487][25689] Fps is (10 sec: 5793.8, 60 sec: 5658.1, 300 sec: 5595.0). Total num frames: 1114242048. Throughput: 0: 5009.5. Samples: 1114238624. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:10:19,488][25689] Avg episode reward: [(0, '0.438')] [2022-07-11 07:10:20,032][26022] Updated weights on worker 0-0, policy_version 1088130 (0.00081) [2022-07-11 07:10:21,816][26022] Updated weights on worker 0-0, policy_version 1088140 (0.00080) [2022-07-11 07:10:23,490][26022] Updated weights on worker 0-0, policy_version 1088150 (0.00083) [2022-07-11 07:10:24,555][25689] Fps is (10 sec: 5689.0, 60 sec: 5624.6, 300 sec: 5595.4). Total num frames: 1114269696. Throughput: 0: 5956.7. Samples: 1114272842. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:10:24,555][25689] Avg episode reward: [(0, '-0.676')] [2022-07-11 07:10:25,393][26022] Updated weights on worker 0-0, policy_version 1088160 (0.00081) [2022-07-11 07:10:27,331][26022] Updated weights on worker 0-0, policy_version 1088170 (0.00096) [2022-07-11 07:10:28,919][26022] Updated weights on worker 0-0, policy_version 1088180 (0.00093) [2022-07-11 07:10:29,580][25689] Fps is (10 sec: 5681.7, 60 sec: 5676.4, 300 sec: 5599.2). Total num frames: 1114299392. Throughput: 0: 5953.8. Samples: 1114307158. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:10:29,581][25689] Avg episode reward: [(0, '0.255')] [2022-07-11 07:10:30,852][26022] Updated weights on worker 0-0, policy_version 1088190 (0.00078) [2022-07-11 07:10:32,492][26022] Updated weights on worker 0-0, policy_version 1088200 (0.00100) [2022-07-11 07:10:34,453][26022] Updated weights on worker 0-0, policy_version 1088210 (0.00085) [2022-07-11 07:10:34,648][25689] Fps is (10 sec: 5884.6, 60 sec: 5696.3, 300 sec: 5601.5). Total num frames: 1114329088. Throughput: 0: 5090.1. Samples: 1114324298. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:10:34,648][25689] Avg episode reward: [(0, '0.147')] [2022-07-11 07:10:36,400][26022] Updated weights on worker 0-0, policy_version 1088220 (0.00084) [2022-07-11 07:10:38,155][26022] Updated weights on worker 0-0, policy_version 1088230 (0.00102) [2022-07-11 07:10:39,653][25689] Fps is (10 sec: 5591.5, 60 sec: 5649.7, 300 sec: 5598.3). Total num frames: 1114355712. Throughput: 0: 5920.9. Samples: 1114358020. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:10:39,653][25689] Avg episode reward: [(0, '-0.688')] [2022-07-11 07:10:39,910][26022] Updated weights on worker 0-0, policy_version 1088240 (0.00094) [2022-07-11 07:10:41,707][26022] Updated weights on worker 0-0, policy_version 1088250 (0.00087) [2022-07-11 07:10:43,291][26022] Updated weights on worker 0-0, policy_version 1088260 (0.00088) [2022-07-11 07:10:44,723][25689] Fps is (10 sec: 5590.1, 60 sec: 5662.7, 300 sec: 5604.1). Total num frames: 1114385408. Throughput: 0: 5915.6. Samples: 1114392146. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:10:44,723][25689] Avg episode reward: [(0, '-0.678')] [2022-07-11 07:10:45,515][26022] Updated weights on worker 0-0, policy_version 1088270 (0.00103) [2022-07-11 07:10:47,033][26022] Updated weights on worker 0-0, policy_version 1088280 (0.00084) [2022-07-11 07:10:49,162][26022] Updated weights on worker 0-0, policy_version 1088290 (0.00086) [2022-07-11 07:10:49,755][25689] Fps is (10 sec: 5575.0, 60 sec: 5630.1, 300 sec: 5598.2). Total num frames: 1114412032. Throughput: 0: 5047.9. Samples: 1114408998. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:10:49,756][25689] Avg episode reward: [(0, '-0.441')] [2022-07-11 07:10:50,874][26022] Updated weights on worker 0-0, policy_version 1088300 (0.01249) [2022-07-11 07:10:52,815][26022] Updated weights on worker 0-0, policy_version 1088310 (0.00095) [2022-07-11 07:10:54,375][26022] Updated weights on worker 0-0, policy_version 1088320 (0.00086) [2022-07-11 07:10:54,799][25689] Fps is (10 sec: 5589.4, 60 sec: 5636.6, 300 sec: 5600.9). Total num frames: 1114441728. Throughput: 0: 5889.0. Samples: 1114442968. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:10:54,800][25689] Avg episode reward: [(0, '1.195')] [2022-07-11 07:10:56,295][26022] Updated weights on worker 0-0, policy_version 1088330 (0.00086) [2022-07-11 07:10:58,088][26022] Updated weights on worker 0-0, policy_version 1088340 (0.00087) [2022-07-11 07:10:59,809][25689] Fps is (10 sec: 5703.3, 60 sec: 5654.6, 300 sec: 5607.9). Total num frames: 1114469376. Throughput: 0: 5907.4. Samples: 1114477094. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:10:59,810][25689] Avg episode reward: [(0, '1.127')] [2022-07-11 07:10:59,855][26022] Updated weights on worker 0-0, policy_version 1088350 (0.00097) [2022-07-11 07:11:01,845][26022] Updated weights on worker 0-0, policy_version 1088360 (0.00092) [2022-07-11 07:11:03,844][26022] Updated weights on worker 0-0, policy_version 1088370 (0.00089) [2022-07-11 07:11:04,907][25689] Fps is (10 sec: 5470.5, 60 sec: 5619.1, 300 sec: 5603.4). Total num frames: 1114497024. Throughput: 0: 4959.0. Samples: 1114492240. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:11:04,907][25689] Avg episode reward: [(0, '1.118')] [2022-07-11 07:11:05,739][26022] Updated weights on worker 0-0, policy_version 1088380 (0.00083) [2022-07-11 07:11:07,457][26022] Updated weights on worker 0-0, policy_version 1088390 (0.00082) [2022-07-11 07:11:09,367][26022] Updated weights on worker 0-0, policy_version 1088400 (0.00085) [2022-07-11 07:11:09,978][25689] Fps is (10 sec: 5438.0, 60 sec: 5630.2, 300 sec: 5601.3). Total num frames: 1114524672. Throughput: 0: 5779.6. Samples: 1114525878. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:11:09,978][25689] Avg episode reward: [(0, '2.065')] [2022-07-11 07:11:10,966][26022] Updated weights on worker 0-0, policy_version 1088410 (0.00098) [2022-07-11 07:11:12,963][26022] Updated weights on worker 0-0, policy_version 1088420 (0.00090) [2022-07-11 07:11:14,751][26022] Updated weights on worker 0-0, policy_version 1088430 (0.00095) [2022-07-11 07:11:15,038][25689] Fps is (10 sec: 5559.1, 60 sec: 5630.4, 300 sec: 5605.5). Total num frames: 1114553344. Throughput: 0: 5794.8. Samples: 1114560250. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:11:15,038][25689] Avg episode reward: [(0, '1.931')] [2022-07-11 07:11:16,267][26022] Updated weights on worker 0-0, policy_version 1088440 (0.00083) [2022-07-11 07:11:18,353][26022] Updated weights on worker 0-0, policy_version 1088450 (0.00085) [2022-07-11 07:11:20,015][26022] Updated weights on worker 0-0, policy_version 1088460 (0.01357) [2022-07-11 07:11:20,049][25689] Fps is (10 sec: 5795.7, 60 sec: 5630.5, 300 sec: 5609.1). Total num frames: 1114583040. Throughput: 0: 4951.6. Samples: 1114577312. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:11:20,049][25689] Avg episode reward: [(0, '2.018')] [2022-07-11 07:11:21,836][26022] Updated weights on worker 0-0, policy_version 1088470 (0.00081) [2022-07-11 07:11:23,875][26022] Updated weights on worker 0-0, policy_version 1088480 (0.00086) [2022-07-11 07:11:25,060][25689] Fps is (10 sec: 5823.7, 60 sec: 5652.6, 300 sec: 5602.6). Total num frames: 1114611712. Throughput: 0: 5932.1. Samples: 1114611794. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:11:25,062][25689] Avg episode reward: [(0, '1.761')] [2022-07-11 07:11:25,294][26022] Updated weights on worker 0-0, policy_version 1088490 (0.00090) [2022-07-11 07:11:27,363][26022] Updated weights on worker 0-0, policy_version 1088500 (0.00088) [2022-07-11 07:11:28,995][26022] Updated weights on worker 0-0, policy_version 1088510 (0.00089) [2022-07-11 07:11:30,131][25689] Fps is (10 sec: 5687.5, 60 sec: 5631.5, 300 sec: 5609.5). Total num frames: 1114640384. Throughput: 0: 5953.1. Samples: 1114645854. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:11:30,133][25689] Avg episode reward: [(0, '1.680')] [2022-07-11 07:11:30,875][26022] Updated weights on worker 0-0, policy_version 1088520 (0.00096) [2022-07-11 07:11:32,674][26022] Updated weights on worker 0-0, policy_version 1088530 (0.00085) [2022-07-11 07:11:34,343][26022] Updated weights on worker 0-0, policy_version 1088540 (0.00089) [2022-07-11 07:11:35,175][25689] Fps is (10 sec: 5568.4, 60 sec: 5599.9, 300 sec: 5608.8). Total num frames: 1114668032. Throughput: 0: 5092.5. Samples: 1114662796. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:11:35,175][25689] Avg episode reward: [(0, '0.549')] [2022-07-11 07:11:36,143][26022] Updated weights on worker 0-0, policy_version 1088550 (0.00082) [2022-07-11 07:11:37,816][26022] Updated weights on worker 0-0, policy_version 1088560 (0.00095) [2022-07-11 07:11:39,742][26022] Updated weights on worker 0-0, policy_version 1088570 (0.00081) [2022-07-11 07:11:40,223][25689] Fps is (10 sec: 5682.0, 60 sec: 5646.6, 300 sec: 5611.6). Total num frames: 1114697728. Throughput: 0: 5948.6. Samples: 1114697322. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:11:40,224][25689] Avg episode reward: [(0, '0.634')] [2022-07-11 07:11:41,523][26022] Updated weights on worker 0-0, policy_version 1088580 (0.00085) [2022-07-11 07:11:43,398][26022] Updated weights on worker 0-0, policy_version 1088590 (0.00085) [2022-07-11 07:11:44,280][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:11:44,297][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001088595_1114721280.pth [2022-07-11 07:11:44,297][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001086622_1112700928.pth [2022-07-11 07:11:45,150][26022] Updated weights on worker 0-0, policy_version 1088600 (0.00091) [2022-07-11 07:11:45,247][25689] Fps is (10 sec: 5794.9, 60 sec: 5634.0, 300 sec: 5611.3). Total num frames: 1114726400. Throughput: 0: 5917.8. Samples: 1114731254. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:11:45,249][25689] Avg episode reward: [(0, '0.133')] [2022-07-11 07:11:46,965][26022] Updated weights on worker 0-0, policy_version 1088610 (0.00093) [2022-07-11 07:11:48,781][26022] Updated weights on worker 0-0, policy_version 1088620 (0.00086) [2022-07-11 07:11:50,251][25689] Fps is (10 sec: 5514.5, 60 sec: 5636.6, 300 sec: 5608.9). Total num frames: 1114753024. Throughput: 0: 5087.6. Samples: 1114748214. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:11:50,252][25689] Avg episode reward: [(0, '-0.202')] [2022-07-11 07:11:50,717][26022] Updated weights on worker 0-0, policy_version 1088630 (0.00088) [2022-07-11 07:11:52,563][26022] Updated weights on worker 0-0, policy_version 1088640 (0.00085) [2022-07-11 07:11:54,299][26022] Updated weights on worker 0-0, policy_version 1088650 (0.00085) [2022-07-11 07:11:55,299][25689] Fps is (10 sec: 5501.2, 60 sec: 5619.3, 300 sec: 5604.8). Total num frames: 1114781696. Throughput: 0: 5914.8. Samples: 1114781826. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:11:55,299][25689] Avg episode reward: [(0, '-0.288')] [2022-07-11 07:11:56,205][26022] Updated weights on worker 0-0, policy_version 1088660 (0.00081) [2022-07-11 07:11:57,950][26022] Updated weights on worker 0-0, policy_version 1088670 (0.00089) [2022-07-11 07:11:59,848][26022] Updated weights on worker 0-0, policy_version 1088680 (0.00087) [2022-07-11 07:12:00,314][25689] Fps is (10 sec: 5698.5, 60 sec: 5635.8, 300 sec: 5618.5). Total num frames: 1114810368. Throughput: 0: 5909.3. Samples: 1114816042. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:12:00,314][25689] Avg episode reward: [(0, '-0.508')] [2022-07-11 07:12:01,435][26022] Updated weights on worker 0-0, policy_version 1088690 (0.00085) [2022-07-11 07:12:03,574][26022] Updated weights on worker 0-0, policy_version 1088700 (0.00091) [2022-07-11 07:12:05,319][25689] Fps is (10 sec: 5518.6, 60 sec: 5627.5, 300 sec: 5611.6). Total num frames: 1114836992. Throughput: 0: 5010.1. Samples: 1114831816. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:12:05,320][25689] Avg episode reward: [(0, '0.063')] [2022-07-11 07:12:05,545][26022] Updated weights on worker 0-0, policy_version 1088710 (0.00087) [2022-07-11 07:12:07,302][26022] Updated weights on worker 0-0, policy_version 1088720 (0.00079) [2022-07-11 07:12:09,070][26022] Updated weights on worker 0-0, policy_version 1088730 (0.00093) [2022-07-11 07:12:10,332][25689] Fps is (10 sec: 5519.6, 60 sec: 5649.9, 300 sec: 5613.9). Total num frames: 1114865664. Throughput: 0: 5852.0. Samples: 1114865728. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:12:10,332][25689] Avg episode reward: [(0, '-0.236')] [2022-07-11 07:12:11,078][26022] Updated weights on worker 0-0, policy_version 1088740 (0.00093) [2022-07-11 07:12:12,662][26022] Updated weights on worker 0-0, policy_version 1088750 (0.00086) [2022-07-11 07:12:14,811][26022] Updated weights on worker 0-0, policy_version 1088760 (0.00085) [2022-07-11 07:12:15,468][25689] Fps is (10 sec: 5649.9, 60 sec: 5642.7, 300 sec: 5613.1). Total num frames: 1114894336. Throughput: 0: 5834.1. Samples: 1114899498. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:12:15,469][25689] Avg episode reward: [(0, '-0.049')] [2022-07-11 07:12:16,284][26022] Updated weights on worker 0-0, policy_version 1088770 (0.00084) [2022-07-11 07:12:18,186][26022] Updated weights on worker 0-0, policy_version 1088780 (0.00091) [2022-07-11 07:12:19,878][26022] Updated weights on worker 0-0, policy_version 1088790 (0.00368) [2022-07-11 07:12:20,539][25689] Fps is (10 sec: 5618.0, 60 sec: 5620.2, 300 sec: 5615.9). Total num frames: 1114923008. Throughput: 0: 4953.7. Samples: 1114916232. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:12:20,539][25689] Avg episode reward: [(0, '-0.186')] [2022-07-11 07:12:21,640][26022] Updated weights on worker 0-0, policy_version 1088800 (0.00078) [2022-07-11 07:12:23,618][26022] Updated weights on worker 0-0, policy_version 1088810 (0.00082) [2022-07-11 07:12:25,228][26022] Updated weights on worker 0-0, policy_version 1088820 (0.00086) [2022-07-11 07:12:25,578][25689] Fps is (10 sec: 5874.7, 60 sec: 5651.5, 300 sec: 5622.6). Total num frames: 1114953728. Throughput: 0: 5871.1. Samples: 1114950760. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:12:25,579][25689] Avg episode reward: [(0, '-0.409')] [2022-07-11 07:12:27,076][26022] Updated weights on worker 0-0, policy_version 1088830 (0.00082) [2022-07-11 07:12:29,049][26022] Updated weights on worker 0-0, policy_version 1088840 (0.00087) [2022-07-11 07:12:30,605][25689] Fps is (10 sec: 5798.2, 60 sec: 5638.6, 300 sec: 5623.3). Total num frames: 1114981376. Throughput: 0: 5885.7. Samples: 1114985054. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:12:30,606][25689] Avg episode reward: [(0, '-0.548')] [2022-07-11 07:12:30,659][26022] Updated weights on worker 0-0, policy_version 1088850 (0.00086) [2022-07-11 07:12:32,594][26022] Updated weights on worker 0-0, policy_version 1088860 (0.00080) [2022-07-11 07:12:34,143][26022] Updated weights on worker 0-0, policy_version 1088870 (0.00081) [2022-07-11 07:12:35,707][25689] Fps is (10 sec: 5560.5, 60 sec: 5650.2, 300 sec: 5621.5). Total num frames: 1115010048. Throughput: 0: 5909.3. Samples: 1115019092. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:12:35,707][25689] Avg episode reward: [(0, '0.507')] [2022-07-11 07:12:36,319][26022] Updated weights on worker 0-0, policy_version 1088880 (0.00089) [2022-07-11 07:12:37,975][26022] Updated weights on worker 0-0, policy_version 1088890 (0.00089) [2022-07-11 07:12:39,781][26022] Updated weights on worker 0-0, policy_version 1088900 (0.00089) [2022-07-11 07:12:40,735][25689] Fps is (10 sec: 5560.1, 60 sec: 5618.2, 300 sec: 5625.3). Total num frames: 1115037696. Throughput: 0: 5940.8. Samples: 1115036212. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:12:40,735][25689] Avg episode reward: [(0, '0.620')] [2022-07-11 07:12:41,573][26022] Updated weights on worker 0-0, policy_version 1088910 (0.00242) [2022-07-11 07:12:43,370][26022] Updated weights on worker 0-0, policy_version 1088920 (0.00081) [2022-07-11 07:12:45,277][26022] Updated weights on worker 0-0, policy_version 1088930 (0.00085) [2022-07-11 07:12:45,743][25689] Fps is (10 sec: 5611.7, 60 sec: 5619.7, 300 sec: 5616.5). Total num frames: 1115066368. Throughput: 0: 5926.1. Samples: 1115070258. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:12:45,743][25689] Avg episode reward: [(0, '0.726')] [2022-07-11 07:12:46,903][26022] Updated weights on worker 0-0, policy_version 1088940 (0.00086) [2022-07-11 07:12:48,871][26022] Updated weights on worker 0-0, policy_version 1088950 (0.00085) [2022-07-11 07:12:50,393][26022] Updated weights on worker 0-0, policy_version 1088960 (0.00085) [2022-07-11 07:12:50,798][25689] Fps is (10 sec: 5698.5, 60 sec: 5648.7, 300 sec: 5620.1). Total num frames: 1115095040. Throughput: 0: 5907.9. Samples: 1115104348. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:12:50,798][25689] Avg episode reward: [(0, '0.181')] [2022-07-11 07:12:52,288][26022] Updated weights on worker 0-0, policy_version 1088970 (0.00089) [2022-07-11 07:12:54,424][26022] Updated weights on worker 0-0, policy_version 1088980 (0.00081) [2022-07-11 07:12:55,820][26022] Updated weights on worker 0-0, policy_version 1088990 (0.00079) [2022-07-11 07:12:55,920][25689] Fps is (10 sec: 5835.8, 60 sec: 5675.6, 300 sec: 5628.4). Total num frames: 1115125760. Throughput: 0: 5058.2. Samples: 1115121336. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:12:55,921][25689] Avg episode reward: [(0, '0.040')] [2022-07-11 07:12:57,956][26022] Updated weights on worker 0-0, policy_version 1089000 (0.00047) [2022-07-11 07:12:59,453][26022] Updated weights on worker 0-0, policy_version 1089010 (0.00088) [2022-07-11 07:13:00,932][25689] Fps is (10 sec: 5759.6, 60 sec: 5659.0, 300 sec: 5635.6). Total num frames: 1115153408. Throughput: 0: 5935.2. Samples: 1115156086. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:13:00,933][25689] Avg episode reward: [(0, '0.566')] [2022-07-11 07:13:01,324][26022] Updated weights on worker 0-0, policy_version 1089020 (0.00091) [2022-07-11 07:13:03,535][26022] Updated weights on worker 0-0, policy_version 1089030 (0.00087) [2022-07-11 07:13:05,187][26022] Updated weights on worker 0-0, policy_version 1089040 (0.00079) [2022-07-11 07:13:05,971][25689] Fps is (10 sec: 5400.0, 60 sec: 5655.9, 300 sec: 5624.9). Total num frames: 1115180032. Throughput: 0: 5844.1. Samples: 1115188470. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:13:05,971][25689] Avg episode reward: [(0, '-0.278')] [2022-07-11 07:13:07,130][26022] Updated weights on worker 0-0, policy_version 1089050 (0.00091) [2022-07-11 07:13:08,900][26022] Updated weights on worker 0-0, policy_version 1089060 (0.00081) [2022-07-11 07:13:10,644][26022] Updated weights on worker 0-0, policy_version 1089070 (0.00096) [2022-07-11 07:13:11,041][25689] Fps is (10 sec: 5570.9, 60 sec: 5667.4, 300 sec: 5634.6). Total num frames: 1115209728. Throughput: 0: 5010.3. Samples: 1115205774. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:13:11,042][25689] Avg episode reward: [(0, '-0.508')] [2022-07-11 07:13:12,488][26022] Updated weights on worker 0-0, policy_version 1089080 (0.00087) [2022-07-11 07:13:14,070][26022] Updated weights on worker 0-0, policy_version 1089090 (0.00091) [2022-07-11 07:13:16,036][26022] Updated weights on worker 0-0, policy_version 1089100 (0.00190) [2022-07-11 07:13:16,110][25689] Fps is (10 sec: 5756.6, 60 sec: 5673.7, 300 sec: 5633.6). Total num frames: 1115238400. Throughput: 0: 5881.9. Samples: 1115240090. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:13:16,110][25689] Avg episode reward: [(0, '-0.580')] [2022-07-11 07:13:17,747][26022] Updated weights on worker 0-0, policy_version 1089110 (0.00079) [2022-07-11 07:13:19,619][26022] Updated weights on worker 0-0, policy_version 1089120 (0.00112) [2022-07-11 07:13:21,119][25689] Fps is (10 sec: 5690.6, 60 sec: 5679.5, 300 sec: 5637.0). Total num frames: 1115267072. Throughput: 0: 5856.0. Samples: 1115274298. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:13:21,119][25689] Avg episode reward: [(0, '0.307')] [2022-07-11 07:13:21,516][26022] Updated weights on worker 0-0, policy_version 1089130 (0.00082) [2022-07-11 07:13:23,189][26022] Updated weights on worker 0-0, policy_version 1089140 (0.00091) [2022-07-11 07:13:25,078][26022] Updated weights on worker 0-0, policy_version 1089150 (0.00094) [2022-07-11 07:13:26,146][25689] Fps is (10 sec: 5815.8, 60 sec: 5663.7, 300 sec: 5644.1). Total num frames: 1115296768. Throughput: 0: 5103.1. Samples: 1115291428. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:13:26,146][25689] Avg episode reward: [(0, '0.670')] [2022-07-11 07:13:26,661][26022] Updated weights on worker 0-0, policy_version 1089160 (0.00085) [2022-07-11 07:13:28,735][26022] Updated weights on worker 0-0, policy_version 1089170 (0.00087) [2022-07-11 07:13:30,363][26022] Updated weights on worker 0-0, policy_version 1089180 (0.00084) [2022-07-11 07:13:31,159][25689] Fps is (10 sec: 5507.5, 60 sec: 5631.3, 300 sec: 5631.8). Total num frames: 1115322368. Throughput: 0: 5953.3. Samples: 1115325538. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:13:31,159][25689] Avg episode reward: [(0, '1.617')] [2022-07-11 07:13:32,258][26022] Updated weights on worker 0-0, policy_version 1089190 (0.00093) [2022-07-11 07:13:34,126][26022] Updated weights on worker 0-0, policy_version 1089200 (0.00089) [2022-07-11 07:13:35,874][26022] Updated weights on worker 0-0, policy_version 1089210 (0.00051) [2022-07-11 07:13:36,266][25689] Fps is (10 sec: 5564.9, 60 sec: 5664.5, 300 sec: 5640.5). Total num frames: 1115353088. Throughput: 0: 5937.1. Samples: 1115359762. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:13:36,267][25689] Avg episode reward: [(0, '1.304')] [2022-07-11 07:13:37,864][26022] Updated weights on worker 0-0, policy_version 1089220 (0.00096) [2022-07-11 07:13:39,540][26022] Updated weights on worker 0-0, policy_version 1089230 (0.00090) [2022-07-11 07:13:41,193][26022] Updated weights on worker 0-0, policy_version 1089240 (0.00107) [2022-07-11 07:13:41,287][25689] Fps is (10 sec: 5864.2, 60 sec: 5682.1, 300 sec: 5643.6). Total num frames: 1115381760. Throughput: 0: 5061.4. Samples: 1115376378. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:13:41,287][25689] Avg episode reward: [(0, '1.724')] [2022-07-11 07:13:43,152][26022] Updated weights on worker 0-0, policy_version 1089250 (0.00085) [2022-07-11 07:13:44,464][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:13:44,472][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001089258_1115400192.pth [2022-07-11 07:13:44,479][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001087276_1113370624.pth [2022-07-11 07:13:44,867][26022] Updated weights on worker 0-0, policy_version 1089260 (0.00103) [2022-07-11 07:13:46,288][25689] Fps is (10 sec: 5619.8, 60 sec: 5665.8, 300 sec: 5640.8). Total num frames: 1115409408. Throughput: 0: 5924.1. Samples: 1115410752. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:13:46,289][25689] Avg episode reward: [(0, '1.951')] [2022-07-11 07:13:46,793][26022] Updated weights on worker 0-0, policy_version 1089270 (0.00083) [2022-07-11 07:13:48,433][26022] Updated weights on worker 0-0, policy_version 1089280 (0.00086) [2022-07-11 07:13:50,350][26022] Updated weights on worker 0-0, policy_version 1089290 (0.00097) [2022-07-11 07:13:51,302][25689] Fps is (10 sec: 5623.5, 60 sec: 5669.7, 300 sec: 5645.2). Total num frames: 1115438080. Throughput: 0: 5936.9. Samples: 1115445126. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:13:51,302][25689] Avg episode reward: [(0, '1.473')] [2022-07-11 07:13:51,936][26022] Updated weights on worker 0-0, policy_version 1089300 (0.00085) [2022-07-11 07:13:54,108][26022] Updated weights on worker 0-0, policy_version 1089310 (0.00091) [2022-07-11 07:13:55,533][26022] Updated weights on worker 0-0, policy_version 1089320 (0.00084) [2022-07-11 07:13:56,404][25689] Fps is (10 sec: 5668.8, 60 sec: 5637.7, 300 sec: 5640.1). Total num frames: 1115466752. Throughput: 0: 5073.2. Samples: 1115461924. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:13:56,404][25689] Avg episode reward: [(0, '1.268')] [2022-07-11 07:13:57,602][26022] Updated weights on worker 0-0, policy_version 1089330 (0.00084) [2022-07-11 07:13:59,263][26022] Updated weights on worker 0-0, policy_version 1089340 (0.00084) [2022-07-11 07:14:01,135][26022] Updated weights on worker 0-0, policy_version 1089350 (0.00079) [2022-07-11 07:14:01,478][25689] Fps is (10 sec: 5735.5, 60 sec: 5665.7, 300 sec: 5649.4). Total num frames: 1115496448. Throughput: 0: 5926.6. Samples: 1115496046. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:14:01,479][25689] Avg episode reward: [(0, '1.054')] [2022-07-11 07:14:03,288][26022] Updated weights on worker 0-0, policy_version 1089360 (0.00085) [2022-07-11 07:14:04,995][26022] Updated weights on worker 0-0, policy_version 1089370 (0.00085) [2022-07-11 07:14:06,518][25689] Fps is (10 sec: 5568.6, 60 sec: 5665.7, 300 sec: 5649.0). Total num frames: 1115523072. Throughput: 0: 5815.1. Samples: 1115528390. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:14:06,518][25689] Avg episode reward: [(0, '1.066')] [2022-07-11 07:14:07,043][26022] Updated weights on worker 0-0, policy_version 1089380 (0.00073) [2022-07-11 07:14:08,548][26022] Updated weights on worker 0-0, policy_version 1089390 (0.00085) [2022-07-11 07:14:10,503][26022] Updated weights on worker 0-0, policy_version 1089400 (0.00087) [2022-07-11 07:14:11,536][25689] Fps is (10 sec: 5396.0, 60 sec: 5636.7, 300 sec: 5640.7). Total num frames: 1115550720. Throughput: 0: 4963.0. Samples: 1115545550. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:14:11,537][25689] Avg episode reward: [(0, '0.377')] [2022-07-11 07:14:12,126][26022] Updated weights on worker 0-0, policy_version 1089410 (0.00087) [2022-07-11 07:14:14,184][26022] Updated weights on worker 0-0, policy_version 1089420 (0.00084) [2022-07-11 07:14:15,936][26022] Updated weights on worker 0-0, policy_version 1089430 (0.00089) [2022-07-11 07:14:16,655][25689] Fps is (10 sec: 5656.9, 60 sec: 5649.0, 300 sec: 5646.2). Total num frames: 1115580416. Throughput: 0: 5789.2. Samples: 1115579160. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:14:16,655][25689] Avg episode reward: [(0, '0.703')] [2022-07-11 07:14:17,856][26022] Updated weights on worker 0-0, policy_version 1089440 (0.00097) [2022-07-11 07:14:19,471][26022] Updated weights on worker 0-0, policy_version 1089450 (0.00090) [2022-07-11 07:14:21,272][26022] Updated weights on worker 0-0, policy_version 1089460 (0.00085) [2022-07-11 07:14:21,659][25689] Fps is (10 sec: 5664.9, 60 sec: 5632.5, 300 sec: 5640.6). Total num frames: 1115608064. Throughput: 0: 5821.3. Samples: 1115613522. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:14:21,659][25689] Avg episode reward: [(0, '1.082')] [2022-07-11 07:14:23,131][26022] Updated weights on worker 0-0, policy_version 1089470 (0.00088) [2022-07-11 07:14:25,054][26022] Updated weights on worker 0-0, policy_version 1089480 (0.00088) [2022-07-11 07:14:26,668][25689] Fps is (10 sec: 5624.9, 60 sec: 5617.3, 300 sec: 5648.0). Total num frames: 1115636736. Throughput: 0: 5066.9. Samples: 1115630486. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:14:26,668][25689] Avg episode reward: [(0, '-0.533')] [2022-07-11 07:14:26,692][26022] Updated weights on worker 0-0, policy_version 1089490 (0.00094) [2022-07-11 07:14:28,345][26022] Updated weights on worker 0-0, policy_version 1089500 (0.00089) [2022-07-11 07:14:30,362][26022] Updated weights on worker 0-0, policy_version 1089510 (0.00085) [2022-07-11 07:14:31,764][25689] Fps is (10 sec: 5776.3, 60 sec: 5677.2, 300 sec: 5651.5). Total num frames: 1115666432. Throughput: 0: 5887.5. Samples: 1115664638. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:14:31,765][25689] Avg episode reward: [(0, '-0.611')] [2022-07-11 07:14:32,040][26022] Updated weights on worker 0-0, policy_version 1089520 (0.00086) [2022-07-11 07:14:34,139][26022] Updated weights on worker 0-0, policy_version 1089530 (0.00085) [2022-07-11 07:14:35,474][26022] Updated weights on worker 0-0, policy_version 1089540 (0.00085) [2022-07-11 07:14:36,881][25689] Fps is (10 sec: 5514.2, 60 sec: 5608.7, 300 sec: 5639.9). Total num frames: 1115693056. Throughput: 0: 5914.2. Samples: 1115698784. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:14:36,882][25689] Avg episode reward: [(0, '0.641')] [2022-07-11 07:14:37,611][26022] Updated weights on worker 0-0, policy_version 1089550 (0.00085) [2022-07-11 07:14:39,096][26022] Updated weights on worker 0-0, policy_version 1089560 (0.00081) [2022-07-11 07:14:41,023][26022] Updated weights on worker 0-0, policy_version 1089570 (0.00086) [2022-07-11 07:14:41,912][25689] Fps is (10 sec: 5751.2, 60 sec: 5658.3, 300 sec: 5650.1). Total num frames: 1115724800. Throughput: 0: 5907.0. Samples: 1115733160. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:14:41,913][25689] Avg episode reward: [(0, '0.335')] [2022-07-11 07:14:42,938][26022] Updated weights on worker 0-0, policy_version 1089580 (0.00050) [2022-07-11 07:14:44,763][26022] Updated weights on worker 0-0, policy_version 1089590 (0.00090) [2022-07-11 07:14:46,504][26022] Updated weights on worker 0-0, policy_version 1089600 (0.00087) [2022-07-11 07:14:46,926][25689] Fps is (10 sec: 5810.5, 60 sec: 5640.3, 300 sec: 5643.8). Total num frames: 1115751424. Throughput: 0: 5913.1. Samples: 1115750280. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:14:46,927][25689] Avg episode reward: [(0, '-0.599')] [2022-07-11 07:14:48,391][26022] Updated weights on worker 0-0, policy_version 1089610 (0.00083) [2022-07-11 07:14:50,096][26022] Updated weights on worker 0-0, policy_version 1089620 (0.00096) [2022-07-11 07:14:51,843][26022] Updated weights on worker 0-0, policy_version 1089630 (0.00084) [2022-07-11 07:14:51,940][25689] Fps is (10 sec: 5616.4, 60 sec: 5657.1, 300 sec: 5645.7). Total num frames: 1115781120. Throughput: 0: 5945.8. Samples: 1115784604. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:14:51,941][25689] Avg episode reward: [(0, '-0.106')] [2022-07-11 07:14:53,666][26022] Updated weights on worker 0-0, policy_version 1089640 (0.00081) [2022-07-11 07:14:55,335][26022] Updated weights on worker 0-0, policy_version 1089650 (0.00077) [2022-07-11 07:14:57,064][25689] Fps is (10 sec: 5757.6, 60 sec: 5655.1, 300 sec: 5650.7). Total num frames: 1115809792. Throughput: 0: 5954.8. Samples: 1115818968. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:14:57,065][25689] Avg episode reward: [(0, '-0.566')] [2022-07-11 07:14:57,410][26022] Updated weights on worker 0-0, policy_version 1089660 (0.00089) [2022-07-11 07:14:59,154][26022] Updated weights on worker 0-0, policy_version 1089670 (0.00091) [2022-07-11 07:15:00,934][26022] Updated weights on worker 0-0, policy_version 1089680 (0.00086) [2022-07-11 07:15:02,088][25689] Fps is (10 sec: 5449.3, 60 sec: 5609.2, 300 sec: 5641.4). Total num frames: 1115836416. Throughput: 0: 5091.5. Samples: 1115835882. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:15:02,088][25689] Avg episode reward: [(0, '-0.368')] [2022-07-11 07:15:03,087][26022] Updated weights on worker 0-0, policy_version 1089690 (0.00087) [2022-07-11 07:15:04,771][26022] Updated weights on worker 0-0, policy_version 1089700 (0.00086) [2022-07-11 07:15:06,659][26022] Updated weights on worker 0-0, policy_version 1089710 (0.00078) [2022-07-11 07:15:07,136][25689] Fps is (10 sec: 5490.1, 60 sec: 5642.1, 300 sec: 5647.6). Total num frames: 1115865088. Throughput: 0: 5825.4. Samples: 1115868008. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:15:07,137][25689] Avg episode reward: [(0, '-0.477')] [2022-07-11 07:15:08,455][26022] Updated weights on worker 0-0, policy_version 1089720 (0.00090) [2022-07-11 07:15:10,317][26022] Updated weights on worker 0-0, policy_version 1089730 (0.00085) [2022-07-11 07:15:12,156][25689] Fps is (10 sec: 5593.6, 60 sec: 5642.0, 300 sec: 5644.9). Total num frames: 1115892736. Throughput: 0: 5815.3. Samples: 1115902166. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:15:12,157][25689] Avg episode reward: [(0, '-0.604')] [2022-07-11 07:15:12,220][26022] Updated weights on worker 0-0, policy_version 1089740 (0.00095) [2022-07-11 07:15:13,948][26022] Updated weights on worker 0-0, policy_version 1089750 (0.00080) [2022-07-11 07:15:15,632][26022] Updated weights on worker 0-0, policy_version 1089760 (0.00085) [2022-07-11 07:15:17,251][25689] Fps is (10 sec: 5669.1, 60 sec: 5644.2, 300 sec: 5643.3). Total num frames: 1115922432. Throughput: 0: 4957.8. Samples: 1115919052. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:15:17,252][25689] Avg episode reward: [(0, '-0.295')] [2022-07-11 07:15:17,521][26022] Updated weights on worker 0-0, policy_version 1089770 (0.00088) [2022-07-11 07:15:19,315][26022] Updated weights on worker 0-0, policy_version 1089780 (0.00093) [2022-07-11 07:15:21,138][26022] Updated weights on worker 0-0, policy_version 1089790 (0.00079) [2022-07-11 07:15:22,258][25689] Fps is (10 sec: 5777.9, 60 sec: 5660.8, 300 sec: 5647.9). Total num frames: 1115951104. Throughput: 0: 5820.1. Samples: 1115953276. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:15:22,260][25689] Avg episode reward: [(0, '0.725')] [2022-07-11 07:15:23,050][26022] Updated weights on worker 0-0, policy_version 1089800 (0.00085) [2022-07-11 07:15:24,776][26022] Updated weights on worker 0-0, policy_version 1089810 (0.00083) [2022-07-11 07:15:26,542][26022] Updated weights on worker 0-0, policy_version 1089820 (0.00086) [2022-07-11 07:15:27,273][25689] Fps is (10 sec: 5721.7, 60 sec: 5660.2, 300 sec: 5644.7). Total num frames: 1115979776. Throughput: 0: 5939.5. Samples: 1115987614. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:15:27,275][25689] Avg episode reward: [(0, '0.662')] [2022-07-11 07:15:28,267][26022] Updated weights on worker 0-0, policy_version 1089830 (0.00086) [2022-07-11 07:15:29,992][26022] Updated weights on worker 0-0, policy_version 1089840 (0.00083) [2022-07-11 07:15:31,878][26022] Updated weights on worker 0-0, policy_version 1089850 (0.00081) [2022-07-11 07:15:32,279][25689] Fps is (10 sec: 5722.2, 60 sec: 5651.7, 300 sec: 5642.4). Total num frames: 1116008448. Throughput: 0: 5097.3. Samples: 1116004740. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:15:32,280][25689] Avg episode reward: [(0, '0.718')] [2022-07-11 07:15:33,699][26022] Updated weights on worker 0-0, policy_version 1089860 (0.00085) [2022-07-11 07:15:35,482][26022] Updated weights on worker 0-0, policy_version 1089870 (0.00080) [2022-07-11 07:15:37,233][26022] Updated weights on worker 0-0, policy_version 1089880 (0.00079) [2022-07-11 07:15:37,326][25689] Fps is (10 sec: 5704.3, 60 sec: 5692.2, 300 sec: 5648.5). Total num frames: 1116037120. Throughput: 0: 5968.4. Samples: 1116038868. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:15:37,328][25689] Avg episode reward: [(0, '0.703')] [2022-07-11 07:15:39,127][26022] Updated weights on worker 0-0, policy_version 1089890 (0.00086) [2022-07-11 07:15:40,804][26022] Updated weights on worker 0-0, policy_version 1089900 (0.00082) [2022-07-11 07:15:42,334][25689] Fps is (10 sec: 5601.3, 60 sec: 5626.6, 300 sec: 5642.8). Total num frames: 1116064768. Throughput: 0: 5994.2. Samples: 1116073616. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:15:42,336][25689] Avg episode reward: [(0, '0.894')] [2022-07-11 07:15:42,732][26022] Updated weights on worker 0-0, policy_version 1089910 (0.00094) [2022-07-11 07:15:44,345][26022] Updated weights on worker 0-0, policy_version 1089920 (0.00085) [2022-07-11 07:15:44,570][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:15:44,582][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001089921_1116079104.pth [2022-07-11 07:15:44,582][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001087932_1114042368.pth [2022-07-11 07:15:46,220][26022] Updated weights on worker 0-0, policy_version 1089930 (0.00090) [2022-07-11 07:15:47,359][25689] Fps is (10 sec: 5715.5, 60 sec: 5676.4, 300 sec: 5653.2). Total num frames: 1116094464. Throughput: 0: 5134.7. Samples: 1116090748. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:15:47,360][25689] Avg episode reward: [(0, '-0.508')] [2022-07-11 07:15:48,008][26022] Updated weights on worker 0-0, policy_version 1089940 (0.00105) [2022-07-11 07:15:49,785][26022] Updated weights on worker 0-0, policy_version 1089950 (0.00084) [2022-07-11 07:15:51,717][26022] Updated weights on worker 0-0, policy_version 1089960 (0.00084) [2022-07-11 07:15:52,393][25689] Fps is (10 sec: 5700.8, 60 sec: 5640.6, 300 sec: 5646.6). Total num frames: 1116122112. Throughput: 0: 5982.8. Samples: 1116125076. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:15:52,394][25689] Avg episode reward: [(0, '-0.069')] [2022-07-11 07:15:53,331][26022] Updated weights on worker 0-0, policy_version 1089970 (0.00087) [2022-07-11 07:15:55,320][26022] Updated weights on worker 0-0, policy_version 1089980 (0.00102) [2022-07-11 07:15:56,815][26022] Updated weights on worker 0-0, policy_version 1089990 (0.00081) [2022-07-11 07:15:57,517][25689] Fps is (10 sec: 5746.0, 60 sec: 5674.5, 300 sec: 5654.7). Total num frames: 1116152832. Throughput: 0: 5966.7. Samples: 1116159342. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:15:57,518][25689] Avg episode reward: [(0, '0.357')] [2022-07-11 07:15:58,843][26022] Updated weights on worker 0-0, policy_version 1090000 (0.00081) [2022-07-11 07:16:00,515][26022] Updated weights on worker 0-0, policy_version 1090010 (0.00092) [2022-07-11 07:16:02,551][25689] Fps is (10 sec: 5645.2, 60 sec: 5673.5, 300 sec: 5652.5). Total num frames: 1116179456. Throughput: 0: 5094.3. Samples: 1116176608. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:16:02,552][25689] Avg episode reward: [(0, '0.834')] [2022-07-11 07:16:02,779][26022] Updated weights on worker 0-0, policy_version 1090020 (0.00086) [2022-07-11 07:16:04,268][26022] Updated weights on worker 0-0, policy_version 1090030 (0.00101) [2022-07-11 07:16:06,198][26022] Updated weights on worker 0-0, policy_version 1090040 (0.00084) [2022-07-11 07:16:07,573][25689] Fps is (10 sec: 5397.3, 60 sec: 5659.1, 300 sec: 5653.4). Total num frames: 1116207104. Throughput: 0: 5835.6. Samples: 1116208706. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:16:07,573][25689] Avg episode reward: [(0, '-0.183')] [2022-07-11 07:16:08,123][26022] Updated weights on worker 0-0, policy_version 1090050 (0.00083) [2022-07-11 07:16:09,843][26022] Updated weights on worker 0-0, policy_version 1090060 (0.00082) [2022-07-11 07:16:11,723][26022] Updated weights on worker 0-0, policy_version 1090070 (0.00080) [2022-07-11 07:16:12,591][25689] Fps is (10 sec: 5711.8, 60 sec: 5693.2, 300 sec: 5657.7). Total num frames: 1116236800. Throughput: 0: 5859.4. Samples: 1116243422. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:16:12,591][25689] Avg episode reward: [(0, '0.728')] [2022-07-11 07:16:13,468][26022] Updated weights on worker 0-0, policy_version 1090080 (0.00107) [2022-07-11 07:16:15,321][26022] Updated weights on worker 0-0, policy_version 1090090 (0.00080) [2022-07-11 07:16:17,119][26022] Updated weights on worker 0-0, policy_version 1090100 (0.00093) [2022-07-11 07:16:17,687][25689] Fps is (10 sec: 5669.7, 60 sec: 5659.2, 300 sec: 5649.2). Total num frames: 1116264448. Throughput: 0: 4993.0. Samples: 1116260048. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:16:17,687][25689] Avg episode reward: [(0, '0.671')] [2022-07-11 07:16:18,820][26022] Updated weights on worker 0-0, policy_version 1090110 (0.00083) [2022-07-11 07:16:20,829][26022] Updated weights on worker 0-0, policy_version 1090120 (0.00089) [2022-07-11 07:16:22,473][26022] Updated weights on worker 0-0, policy_version 1090130 (0.00091) [2022-07-11 07:16:22,713][25689] Fps is (10 sec: 5563.8, 60 sec: 5657.3, 300 sec: 5648.9). Total num frames: 1116293120. Throughput: 0: 5833.6. Samples: 1116294226. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:16:22,714][25689] Avg episode reward: [(0, '0.670')] [2022-07-11 07:16:24,431][26022] Updated weights on worker 0-0, policy_version 1090140 (0.00080) [2022-07-11 07:16:25,963][26022] Updated weights on worker 0-0, policy_version 1090150 (0.00089) [2022-07-11 07:16:27,737][25689] Fps is (10 sec: 5706.0, 60 sec: 5656.6, 300 sec: 5649.8). Total num frames: 1116321792. Throughput: 0: 5946.0. Samples: 1116328602. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2022-07-11 07:16:27,737][25689] Avg episode reward: [(0, '0.576')] [2022-07-11 07:16:28,078][26022] Updated weights on worker 0-0, policy_version 1090160 (0.00084) [2022-07-11 07:16:29,596][26022] Updated weights on worker 0-0, policy_version 1090170 (0.00088) [2022-07-11 07:16:31,578][26022] Updated weights on worker 0-0, policy_version 1090180 (0.00086) [2022-07-11 07:16:32,740][25689] Fps is (10 sec: 5821.2, 60 sec: 5673.8, 300 sec: 5657.4). Total num frames: 1116351488. Throughput: 0: 5081.2. Samples: 1116345806. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:16:32,741][25689] Avg episode reward: [(0, '0.400')] [2022-07-11 07:16:33,234][26022] Updated weights on worker 0-0, policy_version 1090190 (0.00083) [2022-07-11 07:16:35,196][26022] Updated weights on worker 0-0, policy_version 1090200 (0.00088) [2022-07-11 07:16:36,936][26022] Updated weights on worker 0-0, policy_version 1090210 (0.00092) [2022-07-11 07:16:37,874][25689] Fps is (10 sec: 5757.6, 60 sec: 5665.6, 300 sec: 5652.4). Total num frames: 1116380160. Throughput: 0: 5944.4. Samples: 1116380050. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:16:37,875][25689] Avg episode reward: [(0, '0.724')] [2022-07-11 07:16:38,648][26022] Updated weights on worker 0-0, policy_version 1090220 (0.00082) [2022-07-11 07:16:40,531][26022] Updated weights on worker 0-0, policy_version 1090230 (0.00092) [2022-07-11 07:16:42,416][26022] Updated weights on worker 0-0, policy_version 1090240 (0.00081) [2022-07-11 07:16:42,896][25689] Fps is (10 sec: 5545.8, 60 sec: 5664.3, 300 sec: 5649.0). Total num frames: 1116407808. Throughput: 0: 5951.3. Samples: 1116414338. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:16:42,896][25689] Avg episode reward: [(0, '1.595')] [2022-07-11 07:16:44,009][26022] Updated weights on worker 0-0, policy_version 1090250 (0.00085) [2022-07-11 07:16:46,153][26022] Updated weights on worker 0-0, policy_version 1090260 (0.00085) [2022-07-11 07:16:47,598][26022] Updated weights on worker 0-0, policy_version 1090270 (0.00083) [2022-07-11 07:16:47,981][25689] Fps is (10 sec: 5673.6, 60 sec: 5658.7, 300 sec: 5657.7). Total num frames: 1116437504. Throughput: 0: 5085.9. Samples: 1116431564. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:16:47,982][25689] Avg episode reward: [(0, '1.244')] [2022-07-11 07:16:49,563][26022] Updated weights on worker 0-0, policy_version 1090280 (0.00093) [2022-07-11 07:16:51,458][26022] Updated weights on worker 0-0, policy_version 1090290 (0.00084) [2022-07-11 07:16:53,007][25689] Fps is (10 sec: 5873.7, 60 sec: 5693.2, 300 sec: 5661.6). Total num frames: 1116467200. Throughput: 0: 5926.1. Samples: 1116465910. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:16:53,008][25689] Avg episode reward: [(0, '0.997')] [2022-07-11 07:16:53,018][26022] Updated weights on worker 0-0, policy_version 1090300 (0.00083) [2022-07-11 07:16:55,193][26022] Updated weights on worker 0-0, policy_version 1090310 (0.00088) [2022-07-11 07:16:56,639][26022] Updated weights on worker 0-0, policy_version 1090320 (0.00084) [2022-07-11 07:16:58,056][25689] Fps is (10 sec: 5590.3, 60 sec: 5632.6, 300 sec: 5654.1). Total num frames: 1116493824. Throughput: 0: 5932.7. Samples: 1116499782. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:16:58,056][25689] Avg episode reward: [(0, '0.951')] [2022-07-11 07:16:58,609][26022] Updated weights on worker 0-0, policy_version 1090330 (0.00085) [2022-07-11 07:17:00,238][26022] Updated weights on worker 0-0, policy_version 1090340 (0.00081) [2022-07-11 07:17:02,232][26022] Updated weights on worker 0-0, policy_version 1090350 (0.00096) [2022-07-11 07:17:03,065][25689] Fps is (10 sec: 5294.3, 60 sec: 5635.0, 300 sec: 5654.0). Total num frames: 1116520448. Throughput: 0: 5093.6. Samples: 1116517072. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:17:03,065][25689] Avg episode reward: [(0, '0.924')] [2022-07-11 07:17:04,133][26022] Updated weights on worker 0-0, policy_version 1090360 (0.00086) [2022-07-11 07:17:06,343][26022] Updated weights on worker 0-0, policy_version 1090370 (0.00085) [2022-07-11 07:17:07,720][26022] Updated weights on worker 0-0, policy_version 1090380 (0.00083) [2022-07-11 07:17:08,085][25689] Fps is (10 sec: 5717.9, 60 sec: 5685.9, 300 sec: 5660.7). Total num frames: 1116551168. Throughput: 0: 5850.8. Samples: 1116549186. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:17:08,085][25689] Avg episode reward: [(0, '1.062')] [2022-07-11 07:17:09,834][26022] Updated weights on worker 0-0, policy_version 1090390 (0.00094) [2022-07-11 07:17:11,358][26022] Updated weights on worker 0-0, policy_version 1090400 (0.00086) [2022-07-11 07:17:13,113][25689] Fps is (10 sec: 5707.1, 60 sec: 5634.2, 300 sec: 5655.9). Total num frames: 1116577792. Throughput: 0: 5836.1. Samples: 1116583248. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:17:13,113][25689] Avg episode reward: [(0, '0.963')] [2022-07-11 07:17:13,270][26022] Updated weights on worker 0-0, policy_version 1090410 (0.00083) [2022-07-11 07:17:15,175][26022] Updated weights on worker 0-0, policy_version 1090420 (0.00085) [2022-07-11 07:17:16,868][26022] Updated weights on worker 0-0, policy_version 1090430 (0.00084) [2022-07-11 07:17:18,236][25689] Fps is (10 sec: 5447.4, 60 sec: 5648.6, 300 sec: 5654.9). Total num frames: 1116606464. Throughput: 0: 4972.8. Samples: 1116600132. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:17:18,236][25689] Avg episode reward: [(0, '1.148')] [2022-07-11 07:17:18,687][26022] Updated weights on worker 0-0, policy_version 1090440 (0.00089) [2022-07-11 07:17:20,589][26022] Updated weights on worker 0-0, policy_version 1090450 (0.00091) [2022-07-11 07:17:22,083][26022] Updated weights on worker 0-0, policy_version 1090460 (0.00081) [2022-07-11 07:17:23,267][25689] Fps is (10 sec: 5647.4, 60 sec: 5648.2, 300 sec: 5648.2). Total num frames: 1116635136. Throughput: 0: 5798.3. Samples: 1116634208. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:17:23,267][25689] Avg episode reward: [(0, '0.873')] [2022-07-11 07:17:24,242][26022] Updated weights on worker 0-0, policy_version 1090470 (0.00090) [2022-07-11 07:17:25,875][26022] Updated weights on worker 0-0, policy_version 1090480 (0.00084) [2022-07-11 07:17:27,659][26022] Updated weights on worker 0-0, policy_version 1090490 (0.00086) [2022-07-11 07:17:28,293][25689] Fps is (10 sec: 5803.5, 60 sec: 5664.8, 300 sec: 5655.1). Total num frames: 1116664832. Throughput: 0: 5903.2. Samples: 1116668478. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:17:28,294][25689] Avg episode reward: [(0, '0.890')] [2022-07-11 07:17:29,671][26022] Updated weights on worker 0-0, policy_version 1090500 (0.00084) [2022-07-11 07:17:31,120][26022] Updated weights on worker 0-0, policy_version 1090510 (0.00090) [2022-07-11 07:17:33,265][26022] Updated weights on worker 0-0, policy_version 1090520 (0.00081) [2022-07-11 07:17:33,313][25689] Fps is (10 sec: 5708.1, 60 sec: 5629.5, 300 sec: 5653.2). Total num frames: 1116692480. Throughput: 0: 5909.2. Samples: 1116702614. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:17:33,313][25689] Avg episode reward: [(0, '1.050')] [2022-07-11 07:17:34,834][26022] Updated weights on worker 0-0, policy_version 1090530 (0.00097) [2022-07-11 07:17:36,594][26022] Updated weights on worker 0-0, policy_version 1090540 (0.00085) [2022-07-11 07:17:38,419][25689] Fps is (10 sec: 5562.2, 60 sec: 5632.1, 300 sec: 5655.2). Total num frames: 1116721152. Throughput: 0: 5928.9. Samples: 1116719794. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:17:38,419][25689] Avg episode reward: [(0, '1.159')] [2022-07-11 07:17:38,535][26022] Updated weights on worker 0-0, policy_version 1090550 (0.00083) [2022-07-11 07:17:40,246][26022] Updated weights on worker 0-0, policy_version 1090560 (0.00087) [2022-07-11 07:17:42,286][26022] Updated weights on worker 0-0, policy_version 1090571 (0.00086) [2022-07-11 07:17:43,447][25689] Fps is (10 sec: 5860.7, 60 sec: 5682.2, 300 sec: 5661.7). Total num frames: 1116751872. Throughput: 0: 5927.9. Samples: 1116753832. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:17:43,447][25689] Avg episode reward: [(0, '1.470')] [2022-07-11 07:17:44,202][26022] Updated weights on worker 0-0, policy_version 1090581 (0.00092) [2022-07-11 07:17:44,619][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:17:44,630][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001090583_1116756992.pth [2022-07-11 07:17:44,630][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001088595_1114721280.pth [2022-07-11 07:17:45,893][26022] Updated weights on worker 0-0, policy_version 1090591 (0.00098) [2022-07-11 07:17:47,922][26022] Updated weights on worker 0-0, policy_version 1090601 (0.00055) [2022-07-11 07:17:48,462][25689] Fps is (10 sec: 5709.6, 60 sec: 5638.0, 300 sec: 5655.6). Total num frames: 1116778496. Throughput: 0: 5912.4. Samples: 1116787726. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:17:48,463][25689] Avg episode reward: [(0, '1.709')] [2022-07-11 07:17:49,647][26022] Updated weights on worker 0-0, policy_version 1090611 (0.00080) [2022-07-11 07:17:51,384][26022] Updated weights on worker 0-0, policy_version 1090621 (0.00085) [2022-07-11 07:17:53,043][26022] Updated weights on worker 0-0, policy_version 1090631 (0.00086) [2022-07-11 07:17:53,492][25689] Fps is (10 sec: 5504.5, 60 sec: 5620.7, 300 sec: 5650.4). Total num frames: 1116807168. Throughput: 0: 5066.0. Samples: 1116804844. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:17:53,493][25689] Avg episode reward: [(0, '1.601')] [2022-07-11 07:17:54,904][26022] Updated weights on worker 0-0, policy_version 1090641 (0.00087) [2022-07-11 07:17:56,855][26022] Updated weights on worker 0-0, policy_version 1090651 (0.00092) [2022-07-11 07:17:58,539][25689] Fps is (10 sec: 5690.8, 60 sec: 5654.8, 300 sec: 5653.2). Total num frames: 1116835840. Throughput: 0: 5929.3. Samples: 1116839092. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:17:58,539][25689] Avg episode reward: [(0, '0.891')] [2022-07-11 07:17:58,554][26022] Updated weights on worker 0-0, policy_version 1090661 (0.00076) [2022-07-11 07:18:00,437][26022] Updated weights on worker 0-0, policy_version 1090671 (0.00084) [2022-07-11 07:18:02,543][26022] Updated weights on worker 0-0, policy_version 1090681 (0.00081) [2022-07-11 07:18:03,558][25689] Fps is (10 sec: 5391.5, 60 sec: 5636.8, 300 sec: 5650.1). Total num frames: 1116861440. Throughput: 0: 5834.4. Samples: 1116871172. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:18:03,559][25689] Avg episode reward: [(0, '0.618')] [2022-07-11 07:18:04,400][26022] Updated weights on worker 0-0, policy_version 1090691 (0.00090) [2022-07-11 07:18:06,220][26022] Updated weights on worker 0-0, policy_version 1090701 (0.00085) [2022-07-11 07:18:07,868][26022] Updated weights on worker 0-0, policy_version 1090711 (0.00093) [2022-07-11 07:18:08,575][25689] Fps is (10 sec: 5509.4, 60 sec: 5620.2, 300 sec: 5651.1). Total num frames: 1116891136. Throughput: 0: 5003.9. Samples: 1116888372. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:18:08,576][25689] Avg episode reward: [(0, '0.567')] [2022-07-11 07:18:09,618][26022] Updated weights on worker 0-0, policy_version 1090721 (0.00090) [2022-07-11 07:18:11,611][26022] Updated weights on worker 0-0, policy_version 1090731 (0.00085) [2022-07-11 07:18:13,358][26022] Updated weights on worker 0-0, policy_version 1090741 (0.00082) [2022-07-11 07:18:13,581][25689] Fps is (10 sec: 5721.3, 60 sec: 5639.1, 300 sec: 5648.9). Total num frames: 1116918784. Throughput: 0: 5843.4. Samples: 1116922234. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:18:13,582][25689] Avg episode reward: [(0, '0.476')] [2022-07-11 07:18:15,235][26022] Updated weights on worker 0-0, policy_version 1090751 (0.00086) [2022-07-11 07:18:16,891][26022] Updated weights on worker 0-0, policy_version 1090761 (0.00085) [2022-07-11 07:18:18,659][25689] Fps is (10 sec: 5686.9, 60 sec: 5660.3, 300 sec: 5651.0). Total num frames: 1116948480. Throughput: 0: 5839.5. Samples: 1116956584. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:18:18,660][25689] Avg episode reward: [(0, '0.546')] [2022-07-11 07:18:18,863][26022] Updated weights on worker 0-0, policy_version 1090771 (0.00090) [2022-07-11 07:18:20,472][26022] Updated weights on worker 0-0, policy_version 1090781 (0.00049) [2022-07-11 07:18:22,449][26022] Updated weights on worker 0-0, policy_version 1090791 (0.00086) [2022-07-11 07:18:23,683][25689] Fps is (10 sec: 5778.3, 60 sec: 5661.0, 300 sec: 5647.7). Total num frames: 1116977152. Throughput: 0: 5095.0. Samples: 1116973704. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:18:23,684][25689] Avg episode reward: [(0, '0.673')] [2022-07-11 07:18:24,149][26022] Updated weights on worker 0-0, policy_version 1090801 (0.00091) [2022-07-11 07:18:26,142][26022] Updated weights on worker 0-0, policy_version 1090811 (0.00093) [2022-07-11 07:18:27,651][26022] Updated weights on worker 0-0, policy_version 1090821 (0.00089) [2022-07-11 07:18:28,694][25689] Fps is (10 sec: 5612.4, 60 sec: 5628.5, 300 sec: 5654.6). Total num frames: 1117004800. Throughput: 0: 5949.9. Samples: 1117008074. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:18:28,696][25689] Avg episode reward: [(0, '1.653')] [2022-07-11 07:18:29,668][26022] Updated weights on worker 0-0, policy_version 1090831 (0.00085) [2022-07-11 07:18:31,228][26022] Updated weights on worker 0-0, policy_version 1090841 (0.00090) [2022-07-11 07:18:33,192][26022] Updated weights on worker 0-0, policy_version 1090851 (0.00089) [2022-07-11 07:18:33,735][25689] Fps is (10 sec: 5806.6, 60 sec: 5677.4, 300 sec: 5655.8). Total num frames: 1117035520. Throughput: 0: 5961.2. Samples: 1117042372. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:18:33,735][25689] Avg episode reward: [(0, '1.854')] [2022-07-11 07:18:35,154][26022] Updated weights on worker 0-0, policy_version 1090861 (0.00083) [2022-07-11 07:18:36,539][26022] Updated weights on worker 0-0, policy_version 1090871 (0.00079) [2022-07-11 07:18:38,618][26022] Updated weights on worker 0-0, policy_version 1090881 (0.00092) [2022-07-11 07:18:38,804][25689] Fps is (10 sec: 5773.0, 60 sec: 5663.8, 300 sec: 5651.5). Total num frames: 1117063168. Throughput: 0: 5114.4. Samples: 1117059616. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:18:38,805][25689] Avg episode reward: [(0, '1.693')] [2022-07-11 07:18:40,159][26022] Updated weights on worker 0-0, policy_version 1090891 (0.00079) [2022-07-11 07:18:42,055][26022] Updated weights on worker 0-0, policy_version 1090901 (0.00082) [2022-07-11 07:18:43,744][26022] Updated weights on worker 0-0, policy_version 1090911 (0.00090) [2022-07-11 07:18:43,860][25689] Fps is (10 sec: 5663.3, 60 sec: 5644.3, 300 sec: 5657.3). Total num frames: 1117092864. Throughput: 0: 5969.1. Samples: 1117094146. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:18:43,861][25689] Avg episode reward: [(0, '1.650')] [2022-07-11 07:18:45,685][26022] Updated weights on worker 0-0, policy_version 1090921 (0.00091) [2022-07-11 07:18:47,296][26022] Updated weights on worker 0-0, policy_version 1090931 (0.00081) [2022-07-11 07:18:48,884][25689] Fps is (10 sec: 5790.8, 60 sec: 5677.4, 300 sec: 5657.1). Total num frames: 1117121536. Throughput: 0: 5953.6. Samples: 1117128276. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:18:48,884][25689] Avg episode reward: [(0, '1.689')] [2022-07-11 07:18:49,354][26022] Updated weights on worker 0-0, policy_version 1090941 (0.00048) [2022-07-11 07:18:50,862][26022] Updated weights on worker 0-0, policy_version 1090951 (0.00094) [2022-07-11 07:18:53,027][26022] Updated weights on worker 0-0, policy_version 1090961 (0.00086) [2022-07-11 07:18:53,923][25689] Fps is (10 sec: 5698.3, 60 sec: 5676.5, 300 sec: 5658.3). Total num frames: 1117150208. Throughput: 0: 5080.3. Samples: 1117144938. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:18:53,924][25689] Avg episode reward: [(0, '0.747')] [2022-07-11 07:18:54,697][26022] Updated weights on worker 0-0, policy_version 1090971 (0.00084) [2022-07-11 07:18:56,320][26022] Updated weights on worker 0-0, policy_version 1090981 (0.00083) [2022-07-11 07:18:58,287][26022] Updated weights on worker 0-0, policy_version 1090991 (0.00081) [2022-07-11 07:18:58,977][25689] Fps is (10 sec: 5681.5, 60 sec: 5675.9, 300 sec: 5655.3). Total num frames: 1117178880. Throughput: 0: 5950.5. Samples: 1117179652. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:18:58,977][25689] Avg episode reward: [(0, '0.854')] [2022-07-11 07:18:59,950][26022] Updated weights on worker 0-0, policy_version 1091001 (0.00095) [2022-07-11 07:19:02,327][26022] Updated weights on worker 0-0, policy_version 1091011 (0.00096) [2022-07-11 07:19:03,983][25689] Fps is (10 sec: 5394.9, 60 sec: 5677.1, 300 sec: 5652.4). Total num frames: 1117204480. Throughput: 0: 5833.1. Samples: 1117211526. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:19:03,984][25689] Avg episode reward: [(0, '0.735')] [2022-07-11 07:19:04,100][26022] Updated weights on worker 0-0, policy_version 1091021 (0.00084) [2022-07-11 07:19:05,924][26022] Updated weights on worker 0-0, policy_version 1091031 (0.00088) [2022-07-11 07:19:07,660][26022] Updated weights on worker 0-0, policy_version 1091041 (0.00082) [2022-07-11 07:19:09,007][25689] Fps is (10 sec: 5411.1, 60 sec: 5659.6, 300 sec: 5655.8). Total num frames: 1117233152. Throughput: 0: 4988.3. Samples: 1117228654. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:19:09,007][25689] Avg episode reward: [(0, '0.873')] [2022-07-11 07:19:09,594][26022] Updated weights on worker 0-0, policy_version 1091051 (0.00084) [2022-07-11 07:19:11,111][26022] Updated weights on worker 0-0, policy_version 1091061 (0.00094) [2022-07-11 07:19:13,042][26022] Updated weights on worker 0-0, policy_version 1091071 (0.00082) [2022-07-11 07:19:14,019][25689] Fps is (10 sec: 5714.0, 60 sec: 5675.9, 300 sec: 5654.4). Total num frames: 1117261824. Throughput: 0: 5864.3. Samples: 1117262784. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:19:14,021][25689] Avg episode reward: [(0, '0.779')] [2022-07-11 07:19:14,598][26022] Updated weights on worker 0-0, policy_version 1091081 (0.00089) [2022-07-11 07:19:16,591][26022] Updated weights on worker 0-0, policy_version 1091091 (0.00090) [2022-07-11 07:19:18,448][26022] Updated weights on worker 0-0, policy_version 1091101 (0.00087) [2022-07-11 07:19:19,178][25689] Fps is (10 sec: 5537.2, 60 sec: 5634.5, 300 sec: 5651.4). Total num frames: 1117289472. Throughput: 0: 5792.1. Samples: 1117296658. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:19:19,180][25689] Avg episode reward: [(0, '0.949')] [2022-07-11 07:19:20,266][26022] Updated weights on worker 0-0, policy_version 1091111 (0.00087) [2022-07-11 07:19:22,188][26022] Updated weights on worker 0-0, policy_version 1091121 (0.00085) [2022-07-11 07:19:23,907][26022] Updated weights on worker 0-0, policy_version 1091131 (0.00094) [2022-07-11 07:19:24,271][25689] Fps is (10 sec: 5693.6, 60 sec: 5661.9, 300 sec: 5656.7). Total num frames: 1117320192. Throughput: 0: 5031.2. Samples: 1117313596. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:19:24,271][25689] Avg episode reward: [(0, '0.561')] [2022-07-11 07:19:25,794][26022] Updated weights on worker 0-0, policy_version 1091141 (0.00078) [2022-07-11 07:19:27,504][26022] Updated weights on worker 0-0, policy_version 1091151 (0.00079) [2022-07-11 07:19:29,275][25689] Fps is (10 sec: 5679.2, 60 sec: 5645.6, 300 sec: 5648.1). Total num frames: 1117346816. Throughput: 0: 5864.4. Samples: 1117347516. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:19:29,276][25689] Avg episode reward: [(0, '0.630')] [2022-07-11 07:19:29,484][26022] Updated weights on worker 0-0, policy_version 1091161 (0.00086) [2022-07-11 07:19:31,067][26022] Updated weights on worker 0-0, policy_version 1091171 (0.00086) [2022-07-11 07:19:33,068][26022] Updated weights on worker 0-0, policy_version 1091181 (0.00090) [2022-07-11 07:19:34,312][25689] Fps is (10 sec: 5710.8, 60 sec: 5646.0, 300 sec: 5663.4). Total num frames: 1117377536. Throughput: 0: 5864.8. Samples: 1117381798. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:19:34,312][25689] Avg episode reward: [(0, '0.749')] [2022-07-11 07:19:34,477][26022] Updated weights on worker 0-0, policy_version 1091191 (0.00089) [2022-07-11 07:19:36,871][26022] Updated weights on worker 0-0, policy_version 1091201 (0.00093) [2022-07-11 07:19:38,137][26022] Updated weights on worker 0-0, policy_version 1091211 (0.00086) [2022-07-11 07:19:39,372][25689] Fps is (10 sec: 5679.1, 60 sec: 5629.9, 300 sec: 5645.7). Total num frames: 1117404160. Throughput: 0: 5060.5. Samples: 1117398852. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:19:39,374][25689] Avg episode reward: [(0, '1.012')] [2022-07-11 07:19:40,291][26022] Updated weights on worker 0-0, policy_version 1091221 (0.00095) [2022-07-11 07:19:41,941][26022] Updated weights on worker 0-0, policy_version 1091231 (0.00651) [2022-07-11 07:19:43,727][26022] Updated weights on worker 0-0, policy_version 1091241 (0.00083) [2022-07-11 07:19:44,379][25689] Fps is (10 sec: 5696.0, 60 sec: 5651.4, 300 sec: 5659.6). Total num frames: 1117434880. Throughput: 0: 5951.1. Samples: 1117433266. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:19:44,381][25689] Avg episode reward: [(0, '0.473')] [2022-07-11 07:19:44,790][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:19:44,804][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001091247_1117436928.pth [2022-07-11 07:19:44,805][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001089258_1115400192.pth [2022-07-11 07:19:45,530][26022] Updated weights on worker 0-0, policy_version 1091251 (0.00088) [2022-07-11 07:19:47,315][26022] Updated weights on worker 0-0, policy_version 1091261 (0.00375) [2022-07-11 07:19:49,092][26022] Updated weights on worker 0-0, policy_version 1091271 (0.00087) [2022-07-11 07:19:49,392][25689] Fps is (10 sec: 5723.2, 60 sec: 5618.5, 300 sec: 5649.3). Total num frames: 1117461504. Throughput: 0: 5966.9. Samples: 1117467552. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:19:49,394][25689] Avg episode reward: [(0, '0.372')] [2022-07-11 07:19:51,007][26022] Updated weights on worker 0-0, policy_version 1091281 (0.00093) [2022-07-11 07:19:52,634][26022] Updated weights on worker 0-0, policy_version 1091291 (0.00088) [2022-07-11 07:19:54,411][25689] Fps is (10 sec: 5614.1, 60 sec: 5637.4, 300 sec: 5654.7). Total num frames: 1117491200. Throughput: 0: 5126.6. Samples: 1117484840. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:19:54,412][25689] Avg episode reward: [(0, '1.397')] [2022-07-11 07:19:54,417][26022] Updated weights on worker 0-0, policy_version 1091301 (0.00081) [2022-07-11 07:19:56,408][26022] Updated weights on worker 0-0, policy_version 1091311 (0.00085) [2022-07-11 07:19:58,157][26022] Updated weights on worker 0-0, policy_version 1091321 (0.00091) [2022-07-11 07:19:59,491][25689] Fps is (10 sec: 5779.8, 60 sec: 5635.0, 300 sec: 5660.5). Total num frames: 1117519872. Throughput: 0: 5973.0. Samples: 1117519018. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:19:59,491][25689] Avg episode reward: [(0, '1.007')] [2022-07-11 07:19:59,920][26022] Updated weights on worker 0-0, policy_version 1091331 (0.00082) [2022-07-11 07:20:02,207][26022] Updated weights on worker 0-0, policy_version 1091341 (0.00080) [2022-07-11 07:20:03,828][26022] Updated weights on worker 0-0, policy_version 1091351 (0.00090) [2022-07-11 07:20:04,496][25689] Fps is (10 sec: 5584.5, 60 sec: 5668.9, 300 sec: 5657.9). Total num frames: 1117547520. Throughput: 0: 5864.9. Samples: 1117551250. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:20:04,497][25689] Avg episode reward: [(0, '0.897')] [2022-07-11 07:20:05,764][26022] Updated weights on worker 0-0, policy_version 1091361 (0.00089) [2022-07-11 07:20:07,416][26022] Updated weights on worker 0-0, policy_version 1091371 (0.00089) [2022-07-11 07:20:09,291][26022] Updated weights on worker 0-0, policy_version 1091381 (0.00089) [2022-07-11 07:20:09,528][25689] Fps is (10 sec: 5508.8, 60 sec: 5651.1, 300 sec: 5657.7). Total num frames: 1117575168. Throughput: 0: 5001.3. Samples: 1117568260. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:20:09,529][25689] Avg episode reward: [(0, '0.931')] [2022-07-11 07:20:11,016][26022] Updated weights on worker 0-0, policy_version 1091391 (0.00082) [2022-07-11 07:20:12,840][26022] Updated weights on worker 0-0, policy_version 1091401 (0.00094) [2022-07-11 07:20:14,557][25689] Fps is (10 sec: 5598.0, 60 sec: 5649.6, 300 sec: 5655.5). Total num frames: 1117603840. Throughput: 0: 5850.5. Samples: 1117602704. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:20:14,559][25689] Avg episode reward: [(0, '0.776')] [2022-07-11 07:20:14,663][26022] Updated weights on worker 0-0, policy_version 1091411 (0.00085) [2022-07-11 07:20:16,463][26022] Updated weights on worker 0-0, policy_version 1091421 (0.00086) [2022-07-11 07:20:18,254][26022] Updated weights on worker 0-0, policy_version 1091431 (0.00082) [2022-07-11 07:20:19,609][25689] Fps is (10 sec: 5587.2, 60 sec: 5659.6, 300 sec: 5651.2). Total num frames: 1117631488. Throughput: 0: 5861.2. Samples: 1117636934. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:20:19,609][25689] Avg episode reward: [(0, '0.905')] [2022-07-11 07:20:19,947][26022] Updated weights on worker 0-0, policy_version 1091441 (0.00083) [2022-07-11 07:20:21,988][26022] Updated weights on worker 0-0, policy_version 1091451 (0.00093) [2022-07-11 07:20:23,469][26022] Updated weights on worker 0-0, policy_version 1091461 (0.00081) [2022-07-11 07:20:24,621][25689] Fps is (10 sec: 5596.0, 60 sec: 5633.2, 300 sec: 5651.2). Total num frames: 1117660160. Throughput: 0: 5095.3. Samples: 1117653794. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:20:24,622][25689] Avg episode reward: [(0, '0.379')] [2022-07-11 07:20:25,575][26022] Updated weights on worker 0-0, policy_version 1091471 (0.00082) [2022-07-11 07:20:27,248][26022] Updated weights on worker 0-0, policy_version 1091481 (0.00086) [2022-07-11 07:20:29,157][26022] Updated weights on worker 0-0, policy_version 1091491 (0.00090) [2022-07-11 07:20:29,630][25689] Fps is (10 sec: 5824.4, 60 sec: 5683.7, 300 sec: 5654.6). Total num frames: 1117689856. Throughput: 0: 5942.4. Samples: 1117687712. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 07:20:29,631][25689] Avg episode reward: [(0, '-0.423')] [2022-07-11 07:20:30,915][26022] Updated weights on worker 0-0, policy_version 1091501 (0.00080) [2022-07-11 07:20:32,487][26022] Updated weights on worker 0-0, policy_version 1091511 (0.00084) [2022-07-11 07:20:34,547][26022] Updated weights on worker 0-0, policy_version 1091521 (0.00089) [2022-07-11 07:20:34,640][25689] Fps is (10 sec: 5826.2, 60 sec: 5652.3, 300 sec: 5655.3). Total num frames: 1117718528. Throughput: 0: 5951.2. Samples: 1117722220. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:20:34,641][25689] Avg episode reward: [(0, '-1.371')] [2022-07-11 07:20:36,126][26022] Updated weights on worker 0-0, policy_version 1091531 (0.00080) [2022-07-11 07:20:38,125][26022] Updated weights on worker 0-0, policy_version 1091541 (0.00088) [2022-07-11 07:20:39,685][25689] Fps is (10 sec: 5601.3, 60 sec: 5670.7, 300 sec: 5654.6). Total num frames: 1117746176. Throughput: 0: 5947.3. Samples: 1117756334. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:20:39,685][25689] Avg episode reward: [(0, '-0.942')] [2022-07-11 07:20:39,851][26022] Updated weights on worker 0-0, policy_version 1091551 (0.00083) [2022-07-11 07:20:41,716][26022] Updated weights on worker 0-0, policy_version 1091561 (0.00085) [2022-07-11 07:20:43,691][26022] Updated weights on worker 0-0, policy_version 1091571 (0.00090) [2022-07-11 07:20:44,695][25689] Fps is (10 sec: 5601.0, 60 sec: 5636.5, 300 sec: 5651.5). Total num frames: 1117774848. Throughput: 0: 5956.8. Samples: 1117773368. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:20:44,697][25689] Avg episode reward: [(0, '-0.647')] [2022-07-11 07:20:45,250][26022] Updated weights on worker 0-0, policy_version 1091581 (0.00084) [2022-07-11 07:20:47,051][26022] Updated weights on worker 0-0, policy_version 1091591 (0.00077) [2022-07-11 07:20:48,880][26022] Updated weights on worker 0-0, policy_version 1091601 (0.00085) [2022-07-11 07:20:49,698][25689] Fps is (10 sec: 5726.7, 60 sec: 5671.3, 300 sec: 5655.5). Total num frames: 1117803520. Throughput: 0: 5969.8. Samples: 1117807516. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:20:49,699][25689] Avg episode reward: [(0, '-1.427')] [2022-07-11 07:20:50,829][26022] Updated weights on worker 0-0, policy_version 1091611 (0.00086) [2022-07-11 07:20:52,501][26022] Updated weights on worker 0-0, policy_version 1091621 (0.00086) [2022-07-11 07:20:54,438][26022] Updated weights on worker 0-0, policy_version 1091631 (0.00088) [2022-07-11 07:20:54,710][25689] Fps is (10 sec: 5623.4, 60 sec: 5638.0, 300 sec: 5647.3). Total num frames: 1117831168. Throughput: 0: 5091.3. Samples: 1117824406. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:20:54,712][25689] Avg episode reward: [(0, '-1.349')] [2022-07-11 07:20:56,129][26022] Updated weights on worker 0-0, policy_version 1091641 (0.00080) [2022-07-11 07:20:57,873][26022] Updated weights on worker 0-0, policy_version 1091651 (0.00083) [2022-07-11 07:20:59,769][25689] Fps is (10 sec: 5592.3, 60 sec: 5639.9, 300 sec: 5653.7). Total num frames: 1117859840. Throughput: 0: 5089.6. Samples: 1117858556. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:20:59,773][25689] Avg episode reward: [(0, '0.137')] [2022-07-11 07:20:59,825][26022] Updated weights on worker 0-0, policy_version 1091661 (0.00084) [2022-07-11 07:21:01,679][26022] Updated weights on worker 0-0, policy_version 1091671 (0.00092) [2022-07-11 07:21:03,807][26022] Updated weights on worker 0-0, policy_version 1091681 (0.00084) [2022-07-11 07:21:04,788][25689] Fps is (10 sec: 5588.6, 60 sec: 5638.7, 300 sec: 5653.8). Total num frames: 1117887488. Throughput: 0: 5822.1. Samples: 1117890352. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:21:04,789][25689] Avg episode reward: [(0, '-0.514')] [2022-07-11 07:21:05,633][26022] Updated weights on worker 0-0, policy_version 1091691 (0.00091) [2022-07-11 07:21:07,334][26022] Updated weights on worker 0-0, policy_version 1091701 (0.00085) [2022-07-11 07:21:09,178][26022] Updated weights on worker 0-0, policy_version 1091711 (0.00085) [2022-07-11 07:21:09,803][25689] Fps is (10 sec: 5511.2, 60 sec: 5640.4, 300 sec: 5646.9). Total num frames: 1117915136. Throughput: 0: 5827.4. Samples: 1117924672. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:21:09,803][25689] Avg episode reward: [(0, '-1.590')] [2022-07-11 07:21:10,959][26022] Updated weights on worker 0-0, policy_version 1091721 (0.00092) [2022-07-11 07:21:12,725][26022] Updated weights on worker 0-0, policy_version 1091731 (0.00094) [2022-07-11 07:21:14,503][26022] Updated weights on worker 0-0, policy_version 1091741 (0.00088) [2022-07-11 07:21:14,805][25689] Fps is (10 sec: 5622.2, 60 sec: 5642.8, 300 sec: 5652.2). Total num frames: 1117943808. Throughput: 0: 5834.0. Samples: 1117941642. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:21:14,806][25689] Avg episode reward: [(0, '-1.643')] [2022-07-11 07:21:16,459][26022] Updated weights on worker 0-0, policy_version 1091751 (0.00087) [2022-07-11 07:21:18,309][26022] Updated weights on worker 0-0, policy_version 1091761 (0.00087) [2022-07-11 07:21:19,916][25689] Fps is (10 sec: 5771.4, 60 sec: 5671.2, 300 sec: 5654.0). Total num frames: 1117973504. Throughput: 0: 5817.8. Samples: 1117975766. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:21:19,917][25689] Avg episode reward: [(0, '-1.371')] [2022-07-11 07:21:19,922][26022] Updated weights on worker 0-0, policy_version 1091771 (0.00088) [2022-07-11 07:21:21,913][26022] Updated weights on worker 0-0, policy_version 1091781 (0.00086) [2022-07-11 07:21:23,595][26022] Updated weights on worker 0-0, policy_version 1091791 (0.00085) [2022-07-11 07:21:24,965][25689] Fps is (10 sec: 5543.6, 60 sec: 5633.9, 300 sec: 5646.6). Total num frames: 1118000128. Throughput: 0: 5916.9. Samples: 1118009738. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:21:24,965][25689] Avg episode reward: [(0, '-0.724')] [2022-07-11 07:21:25,340][26022] Updated weights on worker 0-0, policy_version 1091801 (0.00083) [2022-07-11 07:21:27,580][26022] Updated weights on worker 0-0, policy_version 1091811 (0.00087) [2022-07-11 07:21:29,021][26022] Updated weights on worker 0-0, policy_version 1091821 (0.00090) [2022-07-11 07:21:29,984][25689] Fps is (10 sec: 5492.5, 60 sec: 5616.0, 300 sec: 5642.9). Total num frames: 1118028800. Throughput: 0: 5049.7. Samples: 1118026580. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:21:29,984][25689] Avg episode reward: [(0, '-0.931')] [2022-07-11 07:21:31,025][26022] Updated weights on worker 0-0, policy_version 1091831 (0.00088) [2022-07-11 07:21:32,649][26022] Updated weights on worker 0-0, policy_version 1091841 (0.00087) [2022-07-11 07:21:34,709][26022] Updated weights on worker 0-0, policy_version 1091851 (0.00080) [2022-07-11 07:21:34,990][25689] Fps is (10 sec: 5720.3, 60 sec: 5616.3, 300 sec: 5645.3). Total num frames: 1118057472. Throughput: 0: 5888.8. Samples: 1118060502. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:21:34,990][25689] Avg episode reward: [(0, '-1.860')] [2022-07-11 07:21:36,463][26022] Updated weights on worker 0-0, policy_version 1091861 (0.00621) [2022-07-11 07:21:38,390][26022] Updated weights on worker 0-0, policy_version 1091871 (0.00095) [2022-07-11 07:21:40,035][25689] Fps is (10 sec: 5704.9, 60 sec: 5633.2, 300 sec: 5648.3). Total num frames: 1118086144. Throughput: 0: 5906.5. Samples: 1118094600. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:21:40,037][25689] Avg episode reward: [(0, '0.164')] [2022-07-11 07:21:40,038][26022] Updated weights on worker 0-0, policy_version 1091881 (0.00126) [2022-07-11 07:21:41,754][26022] Updated weights on worker 0-0, policy_version 1091891 (0.00084) [2022-07-11 07:21:43,571][26022] Updated weights on worker 0-0, policy_version 1091901 (0.00089) [2022-07-11 07:21:45,043][25689] Fps is (10 sec: 5704.0, 60 sec: 5633.5, 300 sec: 5646.3). Total num frames: 1118114816. Throughput: 0: 5073.2. Samples: 1118111598. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:21:45,043][25689] Avg episode reward: [(0, '-0.022')] [2022-07-11 07:21:45,208][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:21:45,219][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001091910_1118115840.pth [2022-07-11 07:21:45,219][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001089921_1116079104.pth [2022-07-11 07:21:45,513][26022] Updated weights on worker 0-0, policy_version 1091911 (0.00093) [2022-07-11 07:21:47,147][26022] Updated weights on worker 0-0, policy_version 1091921 (0.00095) [2022-07-11 07:21:49,281][26022] Updated weights on worker 0-0, policy_version 1091931 (0.00093) [2022-07-11 07:21:50,071][25689] Fps is (10 sec: 5612.1, 60 sec: 5614.2, 300 sec: 5639.4). Total num frames: 1118142464. Throughput: 0: 5912.5. Samples: 1118145344. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:21:50,072][25689] Avg episode reward: [(0, '-0.105')] [2022-07-11 07:21:50,702][26022] Updated weights on worker 0-0, policy_version 1091941 (0.00083) [2022-07-11 07:21:52,918][26022] Updated weights on worker 0-0, policy_version 1091951 (0.00091) [2022-07-11 07:21:54,395][26022] Updated weights on worker 0-0, policy_version 1091961 (0.00086) [2022-07-11 07:21:55,079][25689] Fps is (10 sec: 5611.7, 60 sec: 5631.5, 300 sec: 5647.1). Total num frames: 1118171136. Throughput: 0: 5904.6. Samples: 1118179122. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:21:55,080][25689] Avg episode reward: [(0, '0.070')] [2022-07-11 07:21:56,429][26022] Updated weights on worker 0-0, policy_version 1091971 (0.00084) [2022-07-11 07:21:58,028][26022] Updated weights on worker 0-0, policy_version 1091981 (0.00091) [2022-07-11 07:21:59,881][26022] Updated weights on worker 0-0, policy_version 1091991 (0.00079) [2022-07-11 07:22:00,186][25689] Fps is (10 sec: 5567.6, 60 sec: 5610.1, 300 sec: 5648.7). Total num frames: 1118198784. Throughput: 0: 5031.3. Samples: 1118195986. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:22:00,195][25689] Avg episode reward: [(0, '0.032')] [2022-07-11 07:22:01,904][26022] Updated weights on worker 0-0, policy_version 1092001 (0.00111) [2022-07-11 07:22:04,115][26022] Updated weights on worker 0-0, policy_version 1092011 (0.00083) [2022-07-11 07:22:05,272][25689] Fps is (10 sec: 5425.2, 60 sec: 5603.9, 300 sec: 5637.1). Total num frames: 1118226432. Throughput: 0: 5735.5. Samples: 1118227620. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:22:05,273][25689] Avg episode reward: [(0, '1.264')] [2022-07-11 07:22:05,955][26022] Updated weights on worker 0-0, policy_version 1092021 (0.00091) [2022-07-11 07:22:07,661][26022] Updated weights on worker 0-0, policy_version 1092031 (0.00083) [2022-07-11 07:22:09,447][26022] Updated weights on worker 0-0, policy_version 1092041 (0.00083) [2022-07-11 07:22:10,276][25689] Fps is (10 sec: 5581.8, 60 sec: 5621.8, 300 sec: 5644.4). Total num frames: 1118255104. Throughput: 0: 5764.9. Samples: 1118261826. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:22:10,277][25689] Avg episode reward: [(0, '1.693')] [2022-07-11 07:22:11,109][26022] Updated weights on worker 0-0, policy_version 1092051 (0.00090) [2022-07-11 07:22:13,039][26022] Updated weights on worker 0-0, policy_version 1092061 (0.00084) [2022-07-11 07:22:14,823][26022] Updated weights on worker 0-0, policy_version 1092071 (0.00084) [2022-07-11 07:22:15,358][25689] Fps is (10 sec: 5583.7, 60 sec: 5597.5, 300 sec: 5641.7). Total num frames: 1118282752. Throughput: 0: 4907.2. Samples: 1118278636. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:22:15,360][25689] Avg episode reward: [(0, '1.569')] [2022-07-11 07:22:16,711][26022] Updated weights on worker 0-0, policy_version 1092081 (0.00091) [2022-07-11 07:22:18,471][26022] Updated weights on worker 0-0, policy_version 1092091 (0.00084) [2022-07-11 07:22:20,343][26022] Updated weights on worker 0-0, policy_version 1092101 (0.00082) [2022-07-11 07:22:20,416][25689] Fps is (10 sec: 5554.4, 60 sec: 5585.5, 300 sec: 5641.2). Total num frames: 1118311424. Throughput: 0: 5773.8. Samples: 1118312788. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:22:20,416][25689] Avg episode reward: [(0, '1.679')] [2022-07-11 07:22:22,063][26022] Updated weights on worker 0-0, policy_version 1092111 (0.00095) [2022-07-11 07:22:23,881][26022] Updated weights on worker 0-0, policy_version 1092121 (0.00082) [2022-07-11 07:22:25,415][26022] Updated weights on worker 0-0, policy_version 1092131 (0.00087) [2022-07-11 07:22:25,439][25689] Fps is (10 sec: 5891.7, 60 sec: 5655.6, 300 sec: 5644.7). Total num frames: 1118342144. Throughput: 0: 5934.9. Samples: 1118347310. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:22:25,439][25689] Avg episode reward: [(0, '2.085')] [2022-07-11 07:22:27,494][26022] Updated weights on worker 0-0, policy_version 1092141 (0.00072) [2022-07-11 07:22:29,174][26022] Updated weights on worker 0-0, policy_version 1092151 (0.00087) [2022-07-11 07:22:30,444][25689] Fps is (10 sec: 5718.4, 60 sec: 5623.0, 300 sec: 5641.6). Total num frames: 1118368768. Throughput: 0: 5080.6. Samples: 1118364290. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:22:30,444][25689] Avg episode reward: [(0, '2.049')] [2022-07-11 07:22:31,087][26022] Updated weights on worker 0-0, policy_version 1092161 (0.00085) [2022-07-11 07:22:32,834][26022] Updated weights on worker 0-0, policy_version 1092171 (0.00083) [2022-07-11 07:22:34,485][26022] Updated weights on worker 0-0, policy_version 1092181 (0.00091) [2022-07-11 07:22:35,453][25689] Fps is (10 sec: 5521.8, 60 sec: 5622.7, 300 sec: 5643.4). Total num frames: 1118397440. Throughput: 0: 5966.2. Samples: 1118398524. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:22:35,453][25689] Avg episode reward: [(0, '0.769')] [2022-07-11 07:22:36,585][26022] Updated weights on worker 0-0, policy_version 1092191 (0.00089) [2022-07-11 07:22:38,094][26022] Updated weights on worker 0-0, policy_version 1092201 (0.00087) [2022-07-11 07:22:40,126][26022] Updated weights on worker 0-0, policy_version 1092211 (0.00086) [2022-07-11 07:22:40,512][25689] Fps is (10 sec: 5797.2, 60 sec: 5638.4, 300 sec: 5639.4). Total num frames: 1118427136. Throughput: 0: 5964.7. Samples: 1118432656. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:22:40,513][25689] Avg episode reward: [(0, '0.709')] [2022-07-11 07:22:41,741][26022] Updated weights on worker 0-0, policy_version 1092221 (0.00086) [2022-07-11 07:22:43,588][26022] Updated weights on worker 0-0, policy_version 1092231 (0.00088) [2022-07-11 07:22:45,577][25689] Fps is (10 sec: 5664.3, 60 sec: 5616.2, 300 sec: 5641.9). Total num frames: 1118454784. Throughput: 0: 5095.2. Samples: 1118449916. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:22:45,577][25689] Avg episode reward: [(0, '0.470')] [2022-07-11 07:22:45,579][26022] Updated weights on worker 0-0, policy_version 1092241 (0.00095) [2022-07-11 07:22:47,163][26022] Updated weights on worker 0-0, policy_version 1092251 (0.00093) [2022-07-11 07:22:48,970][26022] Updated weights on worker 0-0, policy_version 1092261 (0.00383) [2022-07-11 07:22:50,609][25689] Fps is (10 sec: 5679.2, 60 sec: 5649.5, 300 sec: 5645.3). Total num frames: 1118484480. Throughput: 0: 5937.6. Samples: 1118484024. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:22:50,610][25689] Avg episode reward: [(0, '-0.531')] [2022-07-11 07:22:50,620][26022] Updated weights on worker 0-0, policy_version 1092271 (0.00089) [2022-07-11 07:22:52,603][26022] Updated weights on worker 0-0, policy_version 1092281 (0.00094) [2022-07-11 07:22:54,439][26022] Updated weights on worker 0-0, policy_version 1092291 (0.00079) [2022-07-11 07:22:55,627][25689] Fps is (10 sec: 5807.3, 60 sec: 5648.7, 300 sec: 5645.8). Total num frames: 1118513152. Throughput: 0: 5945.2. Samples: 1118518466. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:22:55,628][25689] Avg episode reward: [(0, '-0.900')] [2022-07-11 07:22:56,166][26022] Updated weights on worker 0-0, policy_version 1092301 (0.00082) [2022-07-11 07:22:58,099][26022] Updated weights on worker 0-0, policy_version 1092311 (0.00082) [2022-07-11 07:22:59,759][26022] Updated weights on worker 0-0, policy_version 1092321 (0.00085) [2022-07-11 07:23:00,722][25689] Fps is (10 sec: 5670.8, 60 sec: 5666.8, 300 sec: 5654.8). Total num frames: 1118541824. Throughput: 0: 5085.7. Samples: 1118535434. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:23:00,723][25689] Avg episode reward: [(0, '-0.040')] [2022-07-11 07:23:01,854][26022] Updated weights on worker 0-0, policy_version 1092331 (0.00097) [2022-07-11 07:23:03,776][26022] Updated weights on worker 0-0, policy_version 1092341 (0.00080) [2022-07-11 07:23:05,519][26022] Updated weights on worker 0-0, policy_version 1092351 (0.00095) [2022-07-11 07:23:05,748][25689] Fps is (10 sec: 5463.6, 60 sec: 5655.3, 300 sec: 5644.2). Total num frames: 1118568448. Throughput: 0: 5836.6. Samples: 1118567650. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:23:05,749][25689] Avg episode reward: [(0, '-0.539')] [2022-07-11 07:23:07,291][26022] Updated weights on worker 0-0, policy_version 1092361 (0.00088) [2022-07-11 07:23:09,059][26022] Updated weights on worker 0-0, policy_version 1092371 (0.00086) [2022-07-11 07:23:10,780][25689] Fps is (10 sec: 5395.5, 60 sec: 5635.9, 300 sec: 5643.7). Total num frames: 1118596096. Throughput: 0: 5846.7. Samples: 1118601958. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:23:10,781][25689] Avg episode reward: [(0, '-0.504')] [2022-07-11 07:23:11,056][26022] Updated weights on worker 0-0, policy_version 1092381 (0.00087) [2022-07-11 07:23:12,651][26022] Updated weights on worker 0-0, policy_version 1092391 (0.00099) [2022-07-11 07:23:14,475][26022] Updated weights on worker 0-0, policy_version 1092401 (0.00084) [2022-07-11 07:23:15,784][25689] Fps is (10 sec: 5714.0, 60 sec: 5677.0, 300 sec: 5645.1). Total num frames: 1118625792. Throughput: 0: 5002.5. Samples: 1118619300. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:23:15,786][25689] Avg episode reward: [(0, '-0.410')] [2022-07-11 07:23:16,329][26022] Updated weights on worker 0-0, policy_version 1092411 (0.00088) [2022-07-11 07:23:18,142][26022] Updated weights on worker 0-0, policy_version 1092421 (0.00089) [2022-07-11 07:23:20,036][26022] Updated weights on worker 0-0, policy_version 1092431 (0.00096) [2022-07-11 07:23:20,870][25689] Fps is (10 sec: 5785.1, 60 sec: 5674.4, 300 sec: 5644.0). Total num frames: 1118654464. Throughput: 0: 5835.0. Samples: 1118652998. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:23:20,871][25689] Avg episode reward: [(0, '0.745')] [2022-07-11 07:23:21,837][26022] Updated weights on worker 0-0, policy_version 1092441 (0.00088) [2022-07-11 07:23:23,648][26022] Updated weights on worker 0-0, policy_version 1092451 (0.00087) [2022-07-11 07:23:25,510][26022] Updated weights on worker 0-0, policy_version 1092461 (0.00092) [2022-07-11 07:23:25,917][25689] Fps is (10 sec: 5557.8, 60 sec: 5621.3, 300 sec: 5643.3). Total num frames: 1118682112. Throughput: 0: 5916.9. Samples: 1118686988. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:23:25,919][25689] Avg episode reward: [(0, '0.788')] [2022-07-11 07:23:27,174][26022] Updated weights on worker 0-0, policy_version 1092471 (0.00083) [2022-07-11 07:23:29,047][26022] Updated weights on worker 0-0, policy_version 1092481 (0.00049) [2022-07-11 07:23:30,822][26022] Updated weights on worker 0-0, policy_version 1092491 (0.00084) [2022-07-11 07:23:30,955][25689] Fps is (10 sec: 5584.6, 60 sec: 5652.2, 300 sec: 5636.4). Total num frames: 1118710784. Throughput: 0: 5054.9. Samples: 1118703938. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:23:30,955][25689] Avg episode reward: [(0, '0.887')] [2022-07-11 07:23:32,672][26022] Updated weights on worker 0-0, policy_version 1092501 (0.00089) [2022-07-11 07:23:34,413][26022] Updated weights on worker 0-0, policy_version 1092511 (0.00086) [2022-07-11 07:23:35,990][25689] Fps is (10 sec: 5692.8, 60 sec: 5649.6, 300 sec: 5640.5). Total num frames: 1118739456. Throughput: 0: 5866.2. Samples: 1118737836. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:23:35,992][25689] Avg episode reward: [(0, '0.518')] [2022-07-11 07:23:36,590][26022] Updated weights on worker 0-0, policy_version 1092521 (0.00082) [2022-07-11 07:23:37,902][26022] Updated weights on worker 0-0, policy_version 1092531 (0.00085) [2022-07-11 07:23:40,167][26022] Updated weights on worker 0-0, policy_version 1092541 (0.00431) [2022-07-11 07:23:41,102][25689] Fps is (10 sec: 5651.0, 60 sec: 5627.9, 300 sec: 5636.0). Total num frames: 1118768128. Throughput: 0: 5874.4. Samples: 1118771852. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:23:41,103][25689] Avg episode reward: [(0, '0.299')] [2022-07-11 07:23:41,514][26022] Updated weights on worker 0-0, policy_version 1092551 (0.00082) [2022-07-11 07:23:43,657][26022] Updated weights on worker 0-0, policy_version 1092561 (0.00086) [2022-07-11 07:23:45,179][26022] Updated weights on worker 0-0, policy_version 1092571 (0.00094) [2022-07-11 07:23:45,321][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:23:45,330][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001092572_1118793728.pth [2022-07-11 07:23:45,339][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001090583_1116756992.pth [2022-07-11 07:23:46,152][25689] Fps is (10 sec: 5643.1, 60 sec: 5646.1, 300 sec: 5635.5). Total num frames: 1118796800. Throughput: 0: 5875.0. Samples: 1118805868. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:23:46,153][25689] Avg episode reward: [(0, '0.625')] [2022-07-11 07:23:47,051][26022] Updated weights on worker 0-0, policy_version 1092581 (0.00085) [2022-07-11 07:23:48,852][26022] Updated weights on worker 0-0, policy_version 1092591 (0.00084) [2022-07-11 07:23:50,775][26022] Updated weights on worker 0-0, policy_version 1092601 (0.00081) [2022-07-11 07:23:51,159][25689] Fps is (10 sec: 5701.7, 60 sec: 5631.6, 300 sec: 5636.1). Total num frames: 1118825472. Throughput: 0: 5893.3. Samples: 1118823010. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:23:51,160][25689] Avg episode reward: [(0, '0.468')] [2022-07-11 07:23:52,627][26022] Updated weights on worker 0-0, policy_version 1092611 (0.00087) [2022-07-11 07:23:54,465][26022] Updated weights on worker 0-0, policy_version 1092621 (0.00088) [2022-07-11 07:23:56,170][25689] Fps is (10 sec: 5622.3, 60 sec: 5615.4, 300 sec: 5633.5). Total num frames: 1118853120. Throughput: 0: 5873.7. Samples: 1118856362. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:23:56,170][25689] Avg episode reward: [(0, '0.859')] [2022-07-11 07:23:56,225][26022] Updated weights on worker 0-0, policy_version 1092631 (0.00092) [2022-07-11 07:23:57,999][26022] Updated weights on worker 0-0, policy_version 1092641 (0.00097) [2022-07-11 07:23:59,949][26022] Updated weights on worker 0-0, policy_version 1092651 (0.00090) [2022-07-11 07:24:01,276][25689] Fps is (10 sec: 5668.4, 60 sec: 5631.2, 300 sec: 5645.4). Total num frames: 1118882816. Throughput: 0: 5860.2. Samples: 1118890074. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:24:01,276][25689] Avg episode reward: [(0, '1.232')] [2022-07-11 07:24:01,963][26022] Updated weights on worker 0-0, policy_version 1092661 (0.00099) [2022-07-11 07:24:04,105][26022] Updated weights on worker 0-0, policy_version 1092671 (0.00094) [2022-07-11 07:24:05,820][26022] Updated weights on worker 0-0, policy_version 1092681 (0.00089) [2022-07-11 07:24:06,307][25689] Fps is (10 sec: 5454.6, 60 sec: 5613.9, 300 sec: 5634.9). Total num frames: 1118908416. Throughput: 0: 4915.9. Samples: 1118904946. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:24:06,307][25689] Avg episode reward: [(0, '1.737')] [2022-07-11 07:24:07,627][26022] Updated weights on worker 0-0, policy_version 1092691 (0.00093) [2022-07-11 07:24:09,378][26022] Updated weights on worker 0-0, policy_version 1092701 (0.00090) [2022-07-11 07:24:11,279][26022] Updated weights on worker 0-0, policy_version 1092711 (0.00088) [2022-07-11 07:24:11,357][25689] Fps is (10 sec: 5383.3, 60 sec: 5629.1, 300 sec: 5634.2). Total num frames: 1118937088. Throughput: 0: 5736.4. Samples: 1118938874. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:24:11,358][25689] Avg episode reward: [(0, '1.207')] [2022-07-11 07:24:13,264][26022] Updated weights on worker 0-0, policy_version 1092721 (0.00090) [2022-07-11 07:24:14,846][26022] Updated weights on worker 0-0, policy_version 1092731 (0.00084) [2022-07-11 07:24:16,363][25689] Fps is (10 sec: 5498.9, 60 sec: 5578.2, 300 sec: 5633.7). Total num frames: 1118963712. Throughput: 0: 5745.1. Samples: 1118972374. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:24:16,363][25689] Avg episode reward: [(0, '0.842')] [2022-07-11 07:24:16,764][26022] Updated weights on worker 0-0, policy_version 1092741 (0.00089) [2022-07-11 07:24:18,732][26022] Updated weights on worker 0-0, policy_version 1092751 (0.00092) [2022-07-11 07:24:20,418][26022] Updated weights on worker 0-0, policy_version 1092761 (0.00089) [2022-07-11 07:24:21,447][25689] Fps is (10 sec: 5480.3, 60 sec: 5578.3, 300 sec: 5626.9). Total num frames: 1118992384. Throughput: 0: 4899.8. Samples: 1118988910. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:24:21,448][25689] Avg episode reward: [(0, '-1.446')] [2022-07-11 07:24:22,286][26022] Updated weights on worker 0-0, policy_version 1092771 (0.00081) [2022-07-11 07:24:24,017][26022] Updated weights on worker 0-0, policy_version 1092781 (0.00093) [2022-07-11 07:24:25,970][26022] Updated weights on worker 0-0, policy_version 1092791 (0.00422) [2022-07-11 07:24:26,449][25689] Fps is (10 sec: 5685.3, 60 sec: 5599.4, 300 sec: 5633.9). Total num frames: 1119021056. Throughput: 0: 5834.4. Samples: 1119022464. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:24:26,450][25689] Avg episode reward: [(0, '-1.515')] [2022-07-11 07:24:27,815][26022] Updated weights on worker 0-0, policy_version 1092801 (0.00088) [2022-07-11 07:24:29,545][26022] Updated weights on worker 0-0, policy_version 1092811 (0.00085) [2022-07-11 07:24:31,471][25689] Fps is (10 sec: 5516.4, 60 sec: 5567.0, 300 sec: 5620.4). Total num frames: 1119047680. Throughput: 0: 5823.9. Samples: 1119056016. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 07:24:31,472][25689] Avg episode reward: [(0, '-1.868')] [2022-07-11 07:24:31,589][26022] Updated weights on worker 0-0, policy_version 1092821 (0.00090) [2022-07-11 07:24:33,226][26022] Updated weights on worker 0-0, policy_version 1092831 (0.00080) [2022-07-11 07:24:35,262][26022] Updated weights on worker 0-0, policy_version 1092841 (0.00055) [2022-07-11 07:24:36,477][25689] Fps is (10 sec: 5514.2, 60 sec: 5569.8, 300 sec: 5628.3). Total num frames: 1119076352. Throughput: 0: 4984.8. Samples: 1119072640. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:24:36,479][25689] Avg episode reward: [(0, '-1.688')] [2022-07-11 07:24:36,847][26022] Updated weights on worker 0-0, policy_version 1092851 (0.00088) [2022-07-11 07:24:38,882][26022] Updated weights on worker 0-0, policy_version 1092861 (0.00087) [2022-07-11 07:24:40,835][26022] Updated weights on worker 0-0, policy_version 1092871 (0.00085) [2022-07-11 07:24:41,562][25689] Fps is (10 sec: 5581.6, 60 sec: 5555.3, 300 sec: 5616.5). Total num frames: 1119104000. Throughput: 0: 5817.0. Samples: 1119105914. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:24:41,563][25689] Avg episode reward: [(0, '-1.581')] [2022-07-11 07:24:42,520][26022] Updated weights on worker 0-0, policy_version 1092881 (0.00098) [2022-07-11 07:24:44,393][26022] Updated weights on worker 0-0, policy_version 1092891 (0.00086) [2022-07-11 07:24:46,177][26022] Updated weights on worker 0-0, policy_version 1092901 (0.00079) [2022-07-11 07:24:46,585][25689] Fps is (10 sec: 5470.6, 60 sec: 5540.8, 300 sec: 5619.7). Total num frames: 1119131648. Throughput: 0: 5810.3. Samples: 1119139460. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:24:46,585][25689] Avg episode reward: [(0, '-1.519')] [2022-07-11 07:24:48,085][26022] Updated weights on worker 0-0, policy_version 1092911 (0.00086) [2022-07-11 07:24:49,907][26022] Updated weights on worker 0-0, policy_version 1092921 (0.00095) [2022-07-11 07:24:51,618][25689] Fps is (10 sec: 5702.5, 60 sec: 5555.4, 300 sec: 5619.5). Total num frames: 1119161344. Throughput: 0: 4987.2. Samples: 1119156490. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:24:51,618][25689] Avg episode reward: [(0, '-0.010')] [2022-07-11 07:24:51,619][26022] Updated weights on worker 0-0, policy_version 1092931 (0.00091) [2022-07-11 07:24:53,584][26022] Updated weights on worker 0-0, policy_version 1092941 (0.00106) [2022-07-11 07:24:55,351][26022] Updated weights on worker 0-0, policy_version 1092951 (0.00089) [2022-07-11 07:24:56,635][25689] Fps is (10 sec: 5604.2, 60 sec: 5537.9, 300 sec: 5613.8). Total num frames: 1119187968. Throughput: 0: 5820.0. Samples: 1119189956. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:24:56,635][25689] Avg episode reward: [(0, '0.472')] [2022-07-11 07:24:57,228][26022] Updated weights on worker 0-0, policy_version 1092961 (0.00086) [2022-07-11 07:24:59,020][26022] Updated weights on worker 0-0, policy_version 1092971 (0.00079) [2022-07-11 07:25:00,763][26022] Updated weights on worker 0-0, policy_version 1092981 (0.00087) [2022-07-11 07:25:01,755][25689] Fps is (10 sec: 5353.8, 60 sec: 5502.7, 300 sec: 5611.6). Total num frames: 1119215616. Throughput: 0: 5834.3. Samples: 1119223726. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:25:01,755][25689] Avg episode reward: [(0, '0.187')] [2022-07-11 07:25:03,123][26022] Updated weights on worker 0-0, policy_version 1092991 (0.00082) [2022-07-11 07:25:04,735][26022] Updated weights on worker 0-0, policy_version 1093001 (0.00095) [2022-07-11 07:25:06,682][26022] Updated weights on worker 0-0, policy_version 1093011 (0.00089) [2022-07-11 07:25:06,839][25689] Fps is (10 sec: 5418.6, 60 sec: 5531.7, 300 sec: 5610.6). Total num frames: 1119243264. Throughput: 0: 4909.7. Samples: 1119238904. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:25:06,840][25689] Avg episode reward: [(0, '1.193')] [2022-07-11 07:25:08,440][26022] Updated weights on worker 0-0, policy_version 1093021 (0.00098) [2022-07-11 07:25:10,331][26022] Updated weights on worker 0-0, policy_version 1093031 (0.00078) [2022-07-11 07:25:11,844][25689] Fps is (10 sec: 5581.9, 60 sec: 5535.9, 300 sec: 5611.1). Total num frames: 1119271936. Throughput: 0: 5754.0. Samples: 1119272874. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:25:11,845][25689] Avg episode reward: [(0, '0.205')] [2022-07-11 07:25:11,977][26022] Updated weights on worker 0-0, policy_version 1093041 (0.00087) [2022-07-11 07:25:14,015][26022] Updated weights on worker 0-0, policy_version 1093051 (0.00088) [2022-07-11 07:25:15,627][26022] Updated weights on worker 0-0, policy_version 1093061 (0.00084) [2022-07-11 07:25:16,945][25689] Fps is (10 sec: 5572.9, 60 sec: 5544.0, 300 sec: 5610.1). Total num frames: 1119299584. Throughput: 0: 5745.8. Samples: 1119306658. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:25:16,946][25689] Avg episode reward: [(0, '0.747')] [2022-07-11 07:25:17,535][26022] Updated weights on worker 0-0, policy_version 1093071 (0.00084) [2022-07-11 07:25:19,388][26022] Updated weights on worker 0-0, policy_version 1093081 (0.00092) [2022-07-11 07:25:21,258][26022] Updated weights on worker 0-0, policy_version 1093091 (0.00091) [2022-07-11 07:25:22,027][25689] Fps is (10 sec: 5531.2, 60 sec: 5544.3, 300 sec: 5608.8). Total num frames: 1119328256. Throughput: 0: 4922.2. Samples: 1119323508. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:25:22,027][25689] Avg episode reward: [(0, '0.960')] [2022-07-11 07:25:22,980][26022] Updated weights on worker 0-0, policy_version 1093101 (0.00102) [2022-07-11 07:25:24,943][26022] Updated weights on worker 0-0, policy_version 1093111 (0.00097) [2022-07-11 07:25:26,610][26022] Updated weights on worker 0-0, policy_version 1093121 (0.00087) [2022-07-11 07:25:27,063][25689] Fps is (10 sec: 5769.0, 60 sec: 5558.1, 300 sec: 5608.3). Total num frames: 1119357952. Throughput: 0: 5836.7. Samples: 1119356942. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:25:27,065][25689] Avg episode reward: [(0, '0.966')] [2022-07-11 07:25:28,741][26022] Updated weights on worker 0-0, policy_version 1093131 (0.00089) [2022-07-11 07:25:30,108][26022] Updated weights on worker 0-0, policy_version 1093141 (0.00092) [2022-07-11 07:25:32,081][25689] Fps is (10 sec: 5601.7, 60 sec: 5558.5, 300 sec: 5601.3). Total num frames: 1119384576. Throughput: 0: 5830.4. Samples: 1119390860. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:25:32,081][25689] Avg episode reward: [(0, '1.078')] [2022-07-11 07:25:32,321][26022] Updated weights on worker 0-0, policy_version 1093151 (0.01118) [2022-07-11 07:25:33,839][26022] Updated weights on worker 0-0, policy_version 1093161 (0.00083) [2022-07-11 07:25:35,807][26022] Updated weights on worker 0-0, policy_version 1093171 (0.00083) [2022-07-11 07:25:37,104][25689] Fps is (10 sec: 5710.7, 60 sec: 5590.6, 300 sec: 5612.0). Total num frames: 1119415296. Throughput: 0: 5017.8. Samples: 1119407812. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:25:37,105][25689] Avg episode reward: [(0, '-0.105')] [2022-07-11 07:25:37,634][26022] Updated weights on worker 0-0, policy_version 1093181 (0.00088) [2022-07-11 07:25:39,485][26022] Updated weights on worker 0-0, policy_version 1093191 (0.00086) [2022-07-11 07:25:41,372][26022] Updated weights on worker 0-0, policy_version 1093201 (0.00088) [2022-07-11 07:25:42,163][25689] Fps is (10 sec: 5687.5, 60 sec: 5576.1, 300 sec: 5604.2). Total num frames: 1119441920. Throughput: 0: 5850.8. Samples: 1119441322. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:25:42,164][25689] Avg episode reward: [(0, '0.669')] [2022-07-11 07:25:43,087][26022] Updated weights on worker 0-0, policy_version 1093211 (0.00088) [2022-07-11 07:25:44,993][26022] Updated weights on worker 0-0, policy_version 1093221 (0.00091) [2022-07-11 07:25:45,410][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:25:45,422][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001093223_1119460352.pth [2022-07-11 07:25:45,422][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001091247_1117436928.pth [2022-07-11 07:25:46,946][26022] Updated weights on worker 0-0, policy_version 1093231 (0.00087) [2022-07-11 07:25:47,179][25689] Fps is (10 sec: 5387.2, 60 sec: 5576.8, 300 sec: 5600.5). Total num frames: 1119469568. Throughput: 0: 5864.2. Samples: 1119474906. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:25:47,180][25689] Avg episode reward: [(0, '0.519')] [2022-07-11 07:25:48,607][26022] Updated weights on worker 0-0, policy_version 1093241 (0.00535) [2022-07-11 07:25:50,483][26022] Updated weights on worker 0-0, policy_version 1093251 (0.00085) [2022-07-11 07:25:52,157][26022] Updated weights on worker 0-0, policy_version 1093261 (0.00086) [2022-07-11 07:25:52,210][25689] Fps is (10 sec: 5707.3, 60 sec: 5576.9, 300 sec: 5607.0). Total num frames: 1119499264. Throughput: 0: 5019.8. Samples: 1119491908. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:25:52,215][25689] Avg episode reward: [(0, '0.667')] [2022-07-11 07:25:54,195][26022] Updated weights on worker 0-0, policy_version 1093271 (0.00090) [2022-07-11 07:25:55,922][26022] Updated weights on worker 0-0, policy_version 1093281 (0.00081) [2022-07-11 07:25:57,296][25689] Fps is (10 sec: 5566.7, 60 sec: 5570.6, 300 sec: 5599.7). Total num frames: 1119525888. Throughput: 0: 5830.3. Samples: 1119525534. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:25:57,297][25689] Avg episode reward: [(0, '-0.398')] [2022-07-11 07:25:57,966][26022] Updated weights on worker 0-0, policy_version 1093291 (0.00084) [2022-07-11 07:25:59,582][26022] Updated weights on worker 0-0, policy_version 1093301 (0.00071) [2022-07-11 07:26:01,515][26022] Updated weights on worker 0-0, policy_version 1093311 (0.00080) [2022-07-11 07:26:02,406][25689] Fps is (10 sec: 5423.5, 60 sec: 5588.4, 300 sec: 5601.4). Total num frames: 1119554560. Throughput: 0: 5808.6. Samples: 1119558906. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:26:02,406][25689] Avg episode reward: [(0, '-0.156')] [2022-07-11 07:26:03,775][26022] Updated weights on worker 0-0, policy_version 1093321 (0.00081) [2022-07-11 07:26:05,613][26022] Updated weights on worker 0-0, policy_version 1093331 (0.00086) [2022-07-11 07:26:07,319][26022] Updated weights on worker 0-0, policy_version 1093341 (0.00090) [2022-07-11 07:26:07,464][25689] Fps is (10 sec: 5539.0, 60 sec: 5590.9, 300 sec: 5600.6). Total num frames: 1119582208. Throughput: 0: 5698.4. Samples: 1119590498. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:26:07,464][25689] Avg episode reward: [(0, '0.981')] [2022-07-11 07:26:09,252][26022] Updated weights on worker 0-0, policy_version 1093351 (0.00091) [2022-07-11 07:26:10,845][26022] Updated weights on worker 0-0, policy_version 1093361 (0.00090) [2022-07-11 07:26:12,481][25689] Fps is (10 sec: 5386.6, 60 sec: 5556.0, 300 sec: 5593.4). Total num frames: 1119608832. Throughput: 0: 5696.4. Samples: 1119607380. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:26:12,482][25689] Avg episode reward: [(0, '0.849')] [2022-07-11 07:26:12,918][26022] Updated weights on worker 0-0, policy_version 1093371 (0.00090) [2022-07-11 07:26:14,369][26022] Updated weights on worker 0-0, policy_version 1093381 (0.00090) [2022-07-11 07:26:16,581][26022] Updated weights on worker 0-0, policy_version 1093391 (0.00091) [2022-07-11 07:26:17,513][25689] Fps is (10 sec: 5604.3, 60 sec: 5596.1, 300 sec: 5594.9). Total num frames: 1119638528. Throughput: 0: 5721.8. Samples: 1119641214. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:26:17,514][25689] Avg episode reward: [(0, '0.600')] [2022-07-11 07:26:18,067][26022] Updated weights on worker 0-0, policy_version 1093401 (0.00069) [2022-07-11 07:26:20,245][26022] Updated weights on worker 0-0, policy_version 1093411 (0.00092) [2022-07-11 07:26:21,711][26022] Updated weights on worker 0-0, policy_version 1093421 (0.00086) [2022-07-11 07:26:22,605][25689] Fps is (10 sec: 5664.0, 60 sec: 5578.2, 300 sec: 5597.5). Total num frames: 1119666176. Throughput: 0: 5742.4. Samples: 1119674900. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:26:22,606][25689] Avg episode reward: [(0, '0.564')] [2022-07-11 07:26:23,823][26022] Updated weights on worker 0-0, policy_version 1093431 (0.00081) [2022-07-11 07:26:25,215][26022] Updated weights on worker 0-0, policy_version 1093441 (0.00086) [2022-07-11 07:26:27,422][26022] Updated weights on worker 0-0, policy_version 1093451 (0.00090) [2022-07-11 07:26:27,667][25689] Fps is (10 sec: 5446.0, 60 sec: 5542.1, 300 sec: 5593.3). Total num frames: 1119693824. Throughput: 0: 5021.2. Samples: 1119691942. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:26:27,668][25689] Avg episode reward: [(0, '0.923')] [2022-07-11 07:26:29,084][26022] Updated weights on worker 0-0, policy_version 1093461 (0.00091) [2022-07-11 07:26:31,064][26022] Updated weights on worker 0-0, policy_version 1093471 (0.00080) [2022-07-11 07:26:32,685][25689] Fps is (10 sec: 5689.0, 60 sec: 5592.7, 300 sec: 5596.5). Total num frames: 1119723520. Throughput: 0: 5845.1. Samples: 1119725474. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:26:32,686][25689] Avg episode reward: [(0, '0.111')] [2022-07-11 07:26:32,720][26022] Updated weights on worker 0-0, policy_version 1093481 (0.00084) [2022-07-11 07:26:34,862][26022] Updated weights on worker 0-0, policy_version 1093491 (0.00091) [2022-07-11 07:26:36,390][26022] Updated weights on worker 0-0, policy_version 1093501 (0.00094) [2022-07-11 07:26:37,688][25689] Fps is (10 sec: 5619.9, 60 sec: 5527.0, 300 sec: 5590.4). Total num frames: 1119750144. Throughput: 0: 5847.6. Samples: 1119759190. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:26:37,689][25689] Avg episode reward: [(0, '0.460')] [2022-07-11 07:26:38,532][26022] Updated weights on worker 0-0, policy_version 1093511 (0.00086) [2022-07-11 07:26:40,074][26022] Updated weights on worker 0-0, policy_version 1093521 (0.00091) [2022-07-11 07:26:42,203][26022] Updated weights on worker 0-0, policy_version 1093531 (0.00089) [2022-07-11 07:26:42,736][25689] Fps is (10 sec: 5399.7, 60 sec: 5544.9, 300 sec: 5586.2). Total num frames: 1119777792. Throughput: 0: 5011.6. Samples: 1119775788. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:26:42,737][25689] Avg episode reward: [(0, '0.431')] [2022-07-11 07:26:43,779][26022] Updated weights on worker 0-0, policy_version 1093541 (0.00092) [2022-07-11 07:26:45,822][26022] Updated weights on worker 0-0, policy_version 1093551 (0.00088) [2022-07-11 07:26:47,512][26022] Updated weights on worker 0-0, policy_version 1093561 (0.00092) [2022-07-11 07:26:47,739][25689] Fps is (10 sec: 5705.5, 60 sec: 5579.9, 300 sec: 5593.6). Total num frames: 1119807488. Throughput: 0: 5838.4. Samples: 1119809132. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:26:47,739][25689] Avg episode reward: [(0, '0.799')] [2022-07-11 07:26:49,495][26022] Updated weights on worker 0-0, policy_version 1093571 (0.00084) [2022-07-11 07:26:51,139][26022] Updated weights on worker 0-0, policy_version 1093581 (0.00083) [2022-07-11 07:26:52,831][25689] Fps is (10 sec: 5680.5, 60 sec: 5540.5, 300 sec: 5588.5). Total num frames: 1119835136. Throughput: 0: 5826.7. Samples: 1119842858. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:26:52,833][25689] Avg episode reward: [(0, '1.369')] [2022-07-11 07:26:53,212][26022] Updated weights on worker 0-0, policy_version 1093591 (0.00079) [2022-07-11 07:26:54,774][26022] Updated weights on worker 0-0, policy_version 1093601 (0.00081) [2022-07-11 07:26:56,974][26022] Updated weights on worker 0-0, policy_version 1093611 (0.00092) [2022-07-11 07:26:57,876][25689] Fps is (10 sec: 5556.1, 60 sec: 5578.1, 300 sec: 5593.2). Total num frames: 1119863808. Throughput: 0: 4976.6. Samples: 1119859654. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:26:57,876][25689] Avg episode reward: [(0, '0.686')] [2022-07-11 07:26:58,547][26022] Updated weights on worker 0-0, policy_version 1093621 (0.00088) [2022-07-11 07:27:00,771][26022] Updated weights on worker 0-0, policy_version 1093631 (0.00087) [2022-07-11 07:27:02,488][26022] Updated weights on worker 0-0, policy_version 1093641 (0.00082) [2022-07-11 07:27:02,940][25689] Fps is (10 sec: 5470.2, 60 sec: 5548.5, 300 sec: 5590.1). Total num frames: 1119890432. Throughput: 0: 5788.7. Samples: 1119892740. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:27:02,942][25689] Avg episode reward: [(0, '1.492')] [2022-07-11 07:27:04,616][26022] Updated weights on worker 0-0, policy_version 1093651 (0.00096) [2022-07-11 07:27:06,117][26022] Updated weights on worker 0-0, policy_version 1093661 (0.00086) [2022-07-11 07:27:07,983][25689] Fps is (10 sec: 5268.3, 60 sec: 5532.9, 300 sec: 5582.5). Total num frames: 1119917056. Throughput: 0: 5682.4. Samples: 1119924166. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:27:07,984][25689] Avg episode reward: [(0, '1.455')] [2022-07-11 07:27:08,336][26022] Updated weights on worker 0-0, policy_version 1093671 (0.00092) [2022-07-11 07:27:09,977][26022] Updated weights on worker 0-0, policy_version 1093681 (0.00086) [2022-07-11 07:27:11,809][26022] Updated weights on worker 0-0, policy_version 1093691 (0.00083) [2022-07-11 07:27:13,016][25689] Fps is (10 sec: 5691.2, 60 sec: 5599.2, 300 sec: 5593.8). Total num frames: 1119947776. Throughput: 0: 4867.2. Samples: 1119941100. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:27:13,016][25689] Avg episode reward: [(0, '1.167')] [2022-07-11 07:27:13,624][26022] Updated weights on worker 0-0, policy_version 1093701 (0.00089) [2022-07-11 07:27:15,661][26022] Updated weights on worker 0-0, policy_version 1093711 (0.00081) [2022-07-11 07:27:17,322][26022] Updated weights on worker 0-0, policy_version 1093721 (0.00084) [2022-07-11 07:27:18,051][25689] Fps is (10 sec: 5594.0, 60 sec: 5531.3, 300 sec: 5583.9). Total num frames: 1119973376. Throughput: 0: 5688.8. Samples: 1119974426. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:27:18,051][25689] Avg episode reward: [(0, '0.945')] [2022-07-11 07:27:19,323][26022] Updated weights on worker 0-0, policy_version 1093731 (0.00086) [2022-07-11 07:27:21,095][26022] Updated weights on worker 0-0, policy_version 1093741 (0.00083) [2022-07-11 07:27:22,989][26022] Updated weights on worker 0-0, policy_version 1093751 (0.00084) [2022-07-11 07:27:23,162][25689] Fps is (10 sec: 5248.1, 60 sec: 5529.5, 300 sec: 5571.9). Total num frames: 1120001024. Throughput: 0: 5690.0. Samples: 1120007804. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:27:23,162][25689] Avg episode reward: [(0, '1.614')] [2022-07-11 07:27:24,792][26022] Updated weights on worker 0-0, policy_version 1093761 (0.00089) [2022-07-11 07:27:26,612][26022] Updated weights on worker 0-0, policy_version 1093771 (0.00088) [2022-07-11 07:27:28,195][25689] Fps is (10 sec: 5551.9, 60 sec: 5549.0, 300 sec: 5578.2). Total num frames: 1120029696. Throughput: 0: 4969.9. Samples: 1120024618. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:27:28,195][25689] Avg episode reward: [(0, '1.394')] [2022-07-11 07:27:28,382][26022] Updated weights on worker 0-0, policy_version 1093781 (0.00085) [2022-07-11 07:27:30,607][26022] Updated weights on worker 0-0, policy_version 1093791 (0.00096) [2022-07-11 07:27:32,035][26022] Updated weights on worker 0-0, policy_version 1093801 (0.00092) [2022-07-11 07:27:33,232][25689] Fps is (10 sec: 5592.8, 60 sec: 5513.5, 300 sec: 5574.3). Total num frames: 1120057344. Throughput: 0: 5763.8. Samples: 1120057622. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:27:33,232][25689] Avg episode reward: [(0, '1.469')] [2022-07-11 07:27:34,199][26022] Updated weights on worker 0-0, policy_version 1093811 (0.00088) [2022-07-11 07:27:35,573][26022] Updated weights on worker 0-0, policy_version 1093821 (0.00083) [2022-07-11 07:27:37,841][26022] Updated weights on worker 0-0, policy_version 1093831 (0.00084) [2022-07-11 07:27:38,261][25689] Fps is (10 sec: 5493.0, 60 sec: 5528.0, 300 sec: 5568.0). Total num frames: 1120084992. Throughput: 0: 5773.8. Samples: 1120091120. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:27:38,262][25689] Avg episode reward: [(0, '1.367')] [2022-07-11 07:27:39,394][26022] Updated weights on worker 0-0, policy_version 1093841 (0.00087) [2022-07-11 07:27:41,470][26022] Updated weights on worker 0-0, policy_version 1093851 (0.00092) [2022-07-11 07:27:43,150][26022] Updated weights on worker 0-0, policy_version 1093861 (0.00086) [2022-07-11 07:27:43,355][25689] Fps is (10 sec: 5563.7, 60 sec: 5540.8, 300 sec: 5570.9). Total num frames: 1120113664. Throughput: 0: 4953.2. Samples: 1120107822. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:27:43,355][25689] Avg episode reward: [(0, '0.483')] [2022-07-11 07:27:44,973][26022] Updated weights on worker 0-0, policy_version 1093871 (0.00084) [2022-07-11 07:27:45,559][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:27:45,578][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001093873_1120125952.pth [2022-07-11 07:27:45,579][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001091910_1118115840.pth [2022-07-11 07:27:46,866][26022] Updated weights on worker 0-0, policy_version 1093881 (0.00089) [2022-07-11 07:27:48,386][25689] Fps is (10 sec: 5563.0, 60 sec: 5504.4, 300 sec: 5564.0). Total num frames: 1120141312. Throughput: 0: 5784.5. Samples: 1120141410. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:27:48,386][25689] Avg episode reward: [(0, '0.809')] [2022-07-11 07:27:48,892][26022] Updated weights on worker 0-0, policy_version 1093891 (0.00090) [2022-07-11 07:27:50,392][26022] Updated weights on worker 0-0, policy_version 1093901 (0.00089) [2022-07-11 07:27:52,597][26022] Updated weights on worker 0-0, policy_version 1093911 (0.00091) [2022-07-11 07:27:53,394][25689] Fps is (10 sec: 5609.9, 60 sec: 5528.9, 300 sec: 5564.2). Total num frames: 1120169984. Throughput: 0: 5827.8. Samples: 1120175122. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:27:53,396][25689] Avg episode reward: [(0, '-0.024')] [2022-07-11 07:27:54,033][26022] Updated weights on worker 0-0, policy_version 1093921 (0.00086) [2022-07-11 07:27:56,022][26022] Updated weights on worker 0-0, policy_version 1093931 (0.00112) [2022-07-11 07:27:57,689][26022] Updated weights on worker 0-0, policy_version 1093941 (0.00094) [2022-07-11 07:27:58,442][25689] Fps is (10 sec: 5600.3, 60 sec: 5511.7, 300 sec: 5561.6). Total num frames: 1120197632. Throughput: 0: 5003.6. Samples: 1120192096. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:27:58,444][25689] Avg episode reward: [(0, '-0.438')] [2022-07-11 07:27:59,739][26022] Updated weights on worker 0-0, policy_version 1093951 (0.00089) [2022-07-11 07:28:01,828][26022] Updated weights on worker 0-0, policy_version 1093961 (0.00091) [2022-07-11 07:28:03,605][25689] Fps is (10 sec: 5415.2, 60 sec: 5519.6, 300 sec: 5562.5). Total num frames: 1120225280. Throughput: 0: 5801.5. Samples: 1120225306. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:28:03,606][25689] Avg episode reward: [(0, '-2.905')] [2022-07-11 07:28:03,635][26022] Updated weights on worker 0-0, policy_version 1093971 (0.00078) [2022-07-11 07:28:05,611][26022] Updated weights on worker 0-0, policy_version 1093981 (0.00064) [2022-07-11 07:28:07,207][26022] Updated weights on worker 0-0, policy_version 1093991 (0.00092) [2022-07-11 07:28:08,643][25689] Fps is (10 sec: 5320.6, 60 sec: 5520.1, 300 sec: 5559.0). Total num frames: 1120251904. Throughput: 0: 5710.6. Samples: 1120257090. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:28:08,643][25689] Avg episode reward: [(0, '-2.202')] [2022-07-11 07:28:09,312][26022] Updated weights on worker 0-0, policy_version 1094001 (0.00088) [2022-07-11 07:28:10,964][26022] Updated weights on worker 0-0, policy_version 1094011 (0.00087) [2022-07-11 07:28:12,906][26022] Updated weights on worker 0-0, policy_version 1094021 (0.00089) [2022-07-11 07:28:13,685][25689] Fps is (10 sec: 5587.6, 60 sec: 5502.4, 300 sec: 5558.3). Total num frames: 1120281600. Throughput: 0: 5690.4. Samples: 1120290584. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:28:13,686][25689] Avg episode reward: [(0, '-2.392')] [2022-07-11 07:28:14,644][26022] Updated weights on worker 0-0, policy_version 1094031 (0.00094) [2022-07-11 07:28:16,537][26022] Updated weights on worker 0-0, policy_version 1094041 (0.00087) [2022-07-11 07:28:18,280][26022] Updated weights on worker 0-0, policy_version 1094051 (0.00091) [2022-07-11 07:28:18,771][25689] Fps is (10 sec: 5661.8, 60 sec: 5531.5, 300 sec: 5554.8). Total num frames: 1120309248. Throughput: 0: 5659.6. Samples: 1120307148. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:28:18,771][25689] Avg episode reward: [(0, '-2.508')] [2022-07-11 07:28:20,245][26022] Updated weights on worker 0-0, policy_version 1094061 (0.00095) [2022-07-11 07:28:22,076][26022] Updated weights on worker 0-0, policy_version 1094071 (0.00089) [2022-07-11 07:28:23,833][25689] Fps is (10 sec: 5549.7, 60 sec: 5552.8, 300 sec: 5558.0). Total num frames: 1120337920. Throughput: 0: 5710.7. Samples: 1120340820. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:28:23,835][25689] Avg episode reward: [(0, '-1.822')] [2022-07-11 07:28:23,985][26022] Updated weights on worker 0-0, policy_version 1094081 (0.00087) [2022-07-11 07:28:25,615][26022] Updated weights on worker 0-0, policy_version 1094091 (0.00090) [2022-07-11 07:28:27,707][26022] Updated weights on worker 0-0, policy_version 1094101 (0.00085) [2022-07-11 07:28:28,856][25689] Fps is (10 sec: 5584.2, 60 sec: 5536.8, 300 sec: 5554.8). Total num frames: 1120365568. Throughput: 0: 5782.5. Samples: 1120373976. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2022-07-11 07:28:28,858][25689] Avg episode reward: [(0, '-1.507')] [2022-07-11 07:28:29,345][26022] Updated weights on worker 0-0, policy_version 1094111 (0.00085) [2022-07-11 07:28:31,439][26022] Updated weights on worker 0-0, policy_version 1094121 (0.00085) [2022-07-11 07:28:32,995][26022] Updated weights on worker 0-0, policy_version 1094131 (0.00097) [2022-07-11 07:28:33,872][25689] Fps is (10 sec: 5508.3, 60 sec: 5538.8, 300 sec: 5551.7). Total num frames: 1120393216. Throughput: 0: 4972.8. Samples: 1120390970. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:28:33,872][25689] Avg episode reward: [(0, '0.140')] [2022-07-11 07:28:35,027][26022] Updated weights on worker 0-0, policy_version 1094141 (0.00093) [2022-07-11 07:28:36,925][26022] Updated weights on worker 0-0, policy_version 1094151 (0.00088) [2022-07-11 07:28:38,662][26022] Updated weights on worker 0-0, policy_version 1094161 (0.00086) [2022-07-11 07:28:38,915][25689] Fps is (10 sec: 5701.2, 60 sec: 5571.4, 300 sec: 5556.5). Total num frames: 1120422912. Throughput: 0: 5826.2. Samples: 1120424510. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:28:38,916][25689] Avg episode reward: [(0, '0.745')] [2022-07-11 07:28:40,571][26022] Updated weights on worker 0-0, policy_version 1094171 (0.00100) [2022-07-11 07:28:42,370][26022] Updated weights on worker 0-0, policy_version 1094181 (0.00094) [2022-07-11 07:28:44,039][25689] Fps is (10 sec: 5640.2, 60 sec: 5551.7, 300 sec: 5551.6). Total num frames: 1120450560. Throughput: 0: 5805.3. Samples: 1120458120. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:28:44,039][25689] Avg episode reward: [(0, '1.007')] [2022-07-11 07:28:44,095][26022] Updated weights on worker 0-0, policy_version 1094191 (0.00084) [2022-07-11 07:28:46,041][26022] Updated weights on worker 0-0, policy_version 1094201 (0.00091) [2022-07-11 07:28:47,562][26022] Updated weights on worker 0-0, policy_version 1094211 (0.00090) [2022-07-11 07:28:49,093][25689] Fps is (10 sec: 5432.5, 60 sec: 5549.5, 300 sec: 5547.3). Total num frames: 1120478208. Throughput: 0: 4987.6. Samples: 1120474910. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:28:49,094][25689] Avg episode reward: [(0, '0.851')] [2022-07-11 07:28:49,623][26022] Updated weights on worker 0-0, policy_version 1094221 (0.00087) [2022-07-11 07:28:51,411][26022] Updated weights on worker 0-0, policy_version 1094231 (0.00087) [2022-07-11 07:28:53,378][26022] Updated weights on worker 0-0, policy_version 1094241 (0.00072) [2022-07-11 07:28:54,096][25689] Fps is (10 sec: 5498.1, 60 sec: 5533.1, 300 sec: 5547.5). Total num frames: 1120505856. Throughput: 0: 5803.7. Samples: 1120508346. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:28:54,096][25689] Avg episode reward: [(0, '0.969')] [2022-07-11 07:28:55,083][26022] Updated weights on worker 0-0, policy_version 1094251 (0.00081) [2022-07-11 07:28:57,063][26022] Updated weights on worker 0-0, policy_version 1094261 (0.00089) [2022-07-11 07:28:58,679][26022] Updated weights on worker 0-0, policy_version 1094271 (0.00100) [2022-07-11 07:28:59,106][25689] Fps is (10 sec: 5727.2, 60 sec: 5570.4, 300 sec: 5549.3). Total num frames: 1120535552. Throughput: 0: 5822.0. Samples: 1120542064. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:28:59,106][25689] Avg episode reward: [(0, '0.934')] [2022-07-11 07:29:00,727][26022] Updated weights on worker 0-0, policy_version 1094281 (0.00093) [2022-07-11 07:29:02,687][26022] Updated weights on worker 0-0, policy_version 1094291 (0.00087) [2022-07-11 07:29:04,160][25689] Fps is (10 sec: 5494.4, 60 sec: 5546.6, 300 sec: 5548.8). Total num frames: 1120561152. Throughput: 0: 4976.6. Samples: 1120558256. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:29:04,160][25689] Avg episode reward: [(0, '1.838')] [2022-07-11 07:29:04,615][26022] Updated weights on worker 0-0, policy_version 1094301 (0.00093) [2022-07-11 07:29:06,562][26022] Updated weights on worker 0-0, policy_version 1094311 (0.00088) [2022-07-11 07:29:08,299][26022] Updated weights on worker 0-0, policy_version 1094321 (0.00088) [2022-07-11 07:29:09,221][25689] Fps is (10 sec: 5264.4, 60 sec: 5561.4, 300 sec: 5545.2). Total num frames: 1120588800. Throughput: 0: 5742.3. Samples: 1120590488. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:29:09,222][25689] Avg episode reward: [(0, '1.917')] [2022-07-11 07:29:10,159][26022] Updated weights on worker 0-0, policy_version 1094331 (0.00078) [2022-07-11 07:29:11,830][26022] Updated weights on worker 0-0, policy_version 1094341 (0.00091) [2022-07-11 07:29:13,831][26022] Updated weights on worker 0-0, policy_version 1094351 (0.00084) [2022-07-11 07:29:14,238][25689] Fps is (10 sec: 5588.5, 60 sec: 5546.7, 300 sec: 5551.9). Total num frames: 1120617472. Throughput: 0: 5759.9. Samples: 1120624360. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:29:14,238][25689] Avg episode reward: [(0, '1.419')] [2022-07-11 07:29:15,666][26022] Updated weights on worker 0-0, policy_version 1094361 (0.00083) [2022-07-11 07:29:17,430][26022] Updated weights on worker 0-0, policy_version 1094371 (0.00093) [2022-07-11 07:29:19,282][25689] Fps is (10 sec: 5597.6, 60 sec: 5550.6, 300 sec: 5549.2). Total num frames: 1120645120. Throughput: 0: 4921.1. Samples: 1120641354. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:29:19,283][25689] Avg episode reward: [(0, '1.683')] [2022-07-11 07:29:19,344][26022] Updated weights on worker 0-0, policy_version 1094381 (0.00084) [2022-07-11 07:29:21,217][26022] Updated weights on worker 0-0, policy_version 1094391 (0.00070) [2022-07-11 07:29:22,739][26022] Updated weights on worker 0-0, policy_version 1094401 (0.00085) [2022-07-11 07:29:24,346][25689] Fps is (10 sec: 5571.8, 60 sec: 5550.4, 300 sec: 5548.0). Total num frames: 1120673792. Throughput: 0: 5765.8. Samples: 1120674644. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:29:24,347][25689] Avg episode reward: [(0, '2.309')] [2022-07-11 07:29:25,157][26022] Updated weights on worker 0-0, policy_version 1094411 (0.00093) [2022-07-11 07:29:26,686][26022] Updated weights on worker 0-0, policy_version 1094421 (0.00086) [2022-07-11 07:29:28,599][26022] Updated weights on worker 0-0, policy_version 1094431 (0.00088) [2022-07-11 07:29:29,363][25689] Fps is (10 sec: 5688.5, 60 sec: 5567.9, 300 sec: 5555.0). Total num frames: 1120702464. Throughput: 0: 5820.8. Samples: 1120707732. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:29:29,364][25689] Avg episode reward: [(0, '2.340')] [2022-07-11 07:29:30,469][26022] Updated weights on worker 0-0, policy_version 1094441 (0.00092) [2022-07-11 07:29:32,234][26022] Updated weights on worker 0-0, policy_version 1094451 (0.00089) [2022-07-11 07:29:34,257][26022] Updated weights on worker 0-0, policy_version 1094461 (0.00086) [2022-07-11 07:29:34,372][25689] Fps is (10 sec: 5515.3, 60 sec: 5551.6, 300 sec: 5548.1). Total num frames: 1120729088. Throughput: 0: 4962.0. Samples: 1120724266. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:29:34,372][25689] Avg episode reward: [(0, '1.089')] [2022-07-11 07:29:35,715][26022] Updated weights on worker 0-0, policy_version 1094471 (0.00082) [2022-07-11 07:29:37,772][26022] Updated weights on worker 0-0, policy_version 1094481 (0.00086) [2022-07-11 07:29:39,390][25689] Fps is (10 sec: 5514.7, 60 sec: 5536.9, 300 sec: 5552.7). Total num frames: 1120757760. Throughput: 0: 5803.7. Samples: 1120758054. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:29:39,391][25689] Avg episode reward: [(0, '0.361')] [2022-07-11 07:29:39,519][26022] Updated weights on worker 0-0, policy_version 1094491 (0.00084) [2022-07-11 07:29:41,372][26022] Updated weights on worker 0-0, policy_version 1094501 (0.00088) [2022-07-11 07:29:43,363][26022] Updated weights on worker 0-0, policy_version 1094511 (0.00085) [2022-07-11 07:29:44,476][25689] Fps is (10 sec: 5573.7, 60 sec: 5540.4, 300 sec: 5551.6). Total num frames: 1120785408. Throughput: 0: 5801.9. Samples: 1120791440. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:29:44,477][25689] Avg episode reward: [(0, '0.080')] [2022-07-11 07:29:45,071][26022] Updated weights on worker 0-0, policy_version 1094521 (0.00082) [2022-07-11 07:29:45,610][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:29:45,625][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001094524_1120792576.pth [2022-07-11 07:29:45,625][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001092572_1118793728.pth [2022-07-11 07:29:47,119][26022] Updated weights on worker 0-0, policy_version 1094531 (0.00502) [2022-07-11 07:29:48,655][26022] Updated weights on worker 0-0, policy_version 1094541 (0.00087) [2022-07-11 07:29:49,501][25689] Fps is (10 sec: 5570.4, 60 sec: 5560.1, 300 sec: 5548.3). Total num frames: 1120814080. Throughput: 0: 4973.0. Samples: 1120807876. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:29:49,501][25689] Avg episode reward: [(0, '-0.301')] [2022-07-11 07:29:50,735][26022] Updated weights on worker 0-0, policy_version 1094551 (0.00088) [2022-07-11 07:29:52,501][26022] Updated weights on worker 0-0, policy_version 1094561 (0.00086) [2022-07-11 07:29:54,435][26022] Updated weights on worker 0-0, policy_version 1094571 (0.00097) [2022-07-11 07:29:54,553][25689] Fps is (10 sec: 5487.3, 60 sec: 5538.6, 300 sec: 5547.6). Total num frames: 1120840704. Throughput: 0: 5806.3. Samples: 1120841446. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:29:54,554][25689] Avg episode reward: [(0, '-0.571')] [2022-07-11 07:29:56,114][26022] Updated weights on worker 0-0, policy_version 1094581 (0.00089) [2022-07-11 07:29:58,035][26022] Updated weights on worker 0-0, policy_version 1094591 (0.00086) [2022-07-11 07:29:59,569][25689] Fps is (10 sec: 5593.8, 60 sec: 5538.1, 300 sec: 5556.4). Total num frames: 1120870400. Throughput: 0: 5797.9. Samples: 1120875050. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:29:59,569][25689] Avg episode reward: [(0, '0.679')] [2022-07-11 07:29:59,689][26022] Updated weights on worker 0-0, policy_version 1094601 (0.00092) [2022-07-11 07:30:02,222][26022] Updated weights on worker 0-0, policy_version 1094611 (0.00086) [2022-07-11 07:30:04,015][26022] Updated weights on worker 0-0, policy_version 1094622 (0.00085) [2022-07-11 07:30:04,676][25689] Fps is (10 sec: 5462.5, 60 sec: 5533.2, 300 sec: 5549.1). Total num frames: 1120896000. Throughput: 0: 4880.3. Samples: 1120890022. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:30:04,677][25689] Avg episode reward: [(0, '0.745')] [2022-07-11 07:30:05,998][26022] Updated weights on worker 0-0, policy_version 1094632 (0.00092) [2022-07-11 07:30:07,789][26022] Updated weights on worker 0-0, policy_version 1094642 (0.00096) [2022-07-11 07:30:09,654][26022] Updated weights on worker 0-0, policy_version 1094652 (0.00054) [2022-07-11 07:30:09,739][25689] Fps is (10 sec: 5235.8, 60 sec: 5533.0, 300 sec: 5544.6). Total num frames: 1120923648. Throughput: 0: 5695.5. Samples: 1120923142. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:30:09,739][25689] Avg episode reward: [(0, '1.697')] [2022-07-11 07:30:11,412][26022] Updated weights on worker 0-0, policy_version 1094662 (0.00097) [2022-07-11 07:30:13,172][26022] Updated weights on worker 0-0, policy_version 1094672 (0.00083) [2022-07-11 07:30:14,745][25689] Fps is (10 sec: 5593.4, 60 sec: 5534.0, 300 sec: 5549.8). Total num frames: 1120952320. Throughput: 0: 5703.9. Samples: 1120956618. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:30:14,746][25689] Avg episode reward: [(0, '1.659')] [2022-07-11 07:30:15,117][26022] Updated weights on worker 0-0, policy_version 1094682 (0.00084) [2022-07-11 07:30:17,002][26022] Updated weights on worker 0-0, policy_version 1094692 (0.00094) [2022-07-11 07:30:18,699][26022] Updated weights on worker 0-0, policy_version 1094702 (0.00086) [2022-07-11 07:30:19,766][25689] Fps is (10 sec: 5616.8, 60 sec: 5536.2, 300 sec: 5547.5). Total num frames: 1120979968. Throughput: 0: 4881.8. Samples: 1120973648. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:30:19,766][25689] Avg episode reward: [(0, '1.529')] [2022-07-11 07:30:20,604][26022] Updated weights on worker 0-0, policy_version 1094712 (0.00091) [2022-07-11 07:30:22,367][26022] Updated weights on worker 0-0, policy_version 1094722 (0.00084) [2022-07-11 07:30:24,108][26022] Updated weights on worker 0-0, policy_version 1094732 (0.00084) [2022-07-11 07:30:24,832][25689] Fps is (10 sec: 5685.2, 60 sec: 5552.9, 300 sec: 5547.0). Total num frames: 1121009664. Throughput: 0: 5836.9. Samples: 1121007668. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:30:24,832][25689] Avg episode reward: [(0, '1.712')] [2022-07-11 07:30:26,110][26022] Updated weights on worker 0-0, policy_version 1094742 (0.00087) [2022-07-11 07:30:27,727][26022] Updated weights on worker 0-0, policy_version 1094752 (0.00086) [2022-07-11 07:30:29,753][26022] Updated weights on worker 0-0, policy_version 1094762 (0.00085) [2022-07-11 07:30:29,842][25689] Fps is (10 sec: 5589.4, 60 sec: 5519.7, 300 sec: 5547.1). Total num frames: 1121036288. Throughput: 0: 5884.8. Samples: 1121041446. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:30:29,842][25689] Avg episode reward: [(0, '1.552')] [2022-07-11 07:30:31,295][26022] Updated weights on worker 0-0, policy_version 1094772 (0.00086) [2022-07-11 07:30:33,256][26022] Updated weights on worker 0-0, policy_version 1094782 (0.00085) [2022-07-11 07:30:34,867][25689] Fps is (10 sec: 5612.5, 60 sec: 5569.0, 300 sec: 5543.6). Total num frames: 1121065984. Throughput: 0: 5050.6. Samples: 1121058242. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:30:34,867][25689] Avg episode reward: [(0, '0.869')] [2022-07-11 07:30:35,123][26022] Updated weights on worker 0-0, policy_version 1094792 (0.00093) [2022-07-11 07:30:36,809][26022] Updated weights on worker 0-0, policy_version 1094802 (0.00082) [2022-07-11 07:30:38,744][26022] Updated weights on worker 0-0, policy_version 1094812 (0.00089) [2022-07-11 07:30:39,876][25689] Fps is (10 sec: 5715.1, 60 sec: 5552.9, 300 sec: 5548.0). Total num frames: 1121093632. Throughput: 0: 5885.3. Samples: 1121092000. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:30:39,877][25689] Avg episode reward: [(0, '0.269')] [2022-07-11 07:30:40,462][26022] Updated weights on worker 0-0, policy_version 1094822 (0.00092) [2022-07-11 07:30:42,475][26022] Updated weights on worker 0-0, policy_version 1094832 (0.00058) [2022-07-11 07:30:44,215][26022] Updated weights on worker 0-0, policy_version 1094842 (0.00093) [2022-07-11 07:30:44,992][25689] Fps is (10 sec: 5562.1, 60 sec: 5567.1, 300 sec: 5549.6). Total num frames: 1121122304. Throughput: 0: 5844.5. Samples: 1121125496. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:30:44,993][25689] Avg episode reward: [(0, '0.497')] [2022-07-11 07:30:46,162][26022] Updated weights on worker 0-0, policy_version 1094852 (0.00082) [2022-07-11 07:30:47,799][26022] Updated weights on worker 0-0, policy_version 1094862 (0.00086) [2022-07-11 07:30:49,702][26022] Updated weights on worker 0-0, policy_version 1094872 (0.00088) [2022-07-11 07:30:50,028][25689] Fps is (10 sec: 5648.3, 60 sec: 5566.0, 300 sec: 5546.0). Total num frames: 1121150976. Throughput: 0: 5835.2. Samples: 1121159236. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:30:50,029][25689] Avg episode reward: [(0, '-0.511')] [2022-07-11 07:30:51,486][26022] Updated weights on worker 0-0, policy_version 1094882 (0.00093) [2022-07-11 07:30:53,345][26022] Updated weights on worker 0-0, policy_version 1094892 (0.00090) [2022-07-11 07:30:55,047][25689] Fps is (10 sec: 5601.4, 60 sec: 5586.0, 300 sec: 5550.7). Total num frames: 1121178624. Throughput: 0: 5830.2. Samples: 1121175898. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:30:55,047][25689] Avg episode reward: [(0, '-0.275')] [2022-07-11 07:30:55,309][26022] Updated weights on worker 0-0, policy_version 1094902 (0.00111) [2022-07-11 07:30:57,144][26022] Updated weights on worker 0-0, policy_version 1094912 (0.00092) [2022-07-11 07:30:58,831][26022] Updated weights on worker 0-0, policy_version 1094922 (0.00076) [2022-07-11 07:31:00,071][25689] Fps is (10 sec: 5607.7, 60 sec: 5568.3, 300 sec: 5552.3). Total num frames: 1121207296. Throughput: 0: 5818.2. Samples: 1121209504. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:31:00,072][25689] Avg episode reward: [(0, '0.688')] [2022-07-11 07:31:00,897][26022] Updated weights on worker 0-0, policy_version 1094932 (0.00093) [2022-07-11 07:31:02,865][26022] Updated weights on worker 0-0, policy_version 1094942 (0.00089) [2022-07-11 07:31:04,970][26022] Updated weights on worker 0-0, policy_version 1094952 (0.00086) [2022-07-11 07:31:05,205][25689] Fps is (10 sec: 5242.0, 60 sec: 5549.0, 300 sec: 5540.6). Total num frames: 1121231872. Throughput: 0: 5701.8. Samples: 1121240744. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:31:05,205][25689] Avg episode reward: [(0, '1.224')] [2022-07-11 07:31:06,641][26022] Updated weights on worker 0-0, policy_version 1094962 (0.00090) [2022-07-11 07:31:08,507][26022] Updated weights on worker 0-0, policy_version 1094972 (0.00085) [2022-07-11 07:31:10,253][25689] Fps is (10 sec: 5229.8, 60 sec: 5567.2, 300 sec: 5546.9). Total num frames: 1121260544. Throughput: 0: 4852.3. Samples: 1121257374. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:31:10,254][25689] Avg episode reward: [(0, '1.207')] [2022-07-11 07:31:10,340][26022] Updated weights on worker 0-0, policy_version 1094982 (0.00089) [2022-07-11 07:31:12,217][26022] Updated weights on worker 0-0, policy_version 1094992 (0.00089) [2022-07-11 07:31:14,037][26022] Updated weights on worker 0-0, policy_version 1095002 (0.00114) [2022-07-11 07:31:15,260][25689] Fps is (10 sec: 5702.8, 60 sec: 5567.2, 300 sec: 5543.9). Total num frames: 1121289216. Throughput: 0: 5689.4. Samples: 1121290900. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:31:15,261][25689] Avg episode reward: [(0, '2.068')] [2022-07-11 07:31:15,902][26022] Updated weights on worker 0-0, policy_version 1095012 (0.00087) [2022-07-11 07:31:17,604][26022] Updated weights on worker 0-0, policy_version 1095022 (0.00085) [2022-07-11 07:31:19,647][26022] Updated weights on worker 0-0, policy_version 1095032 (0.00088) [2022-07-11 07:31:20,282][25689] Fps is (10 sec: 5513.7, 60 sec: 5550.1, 300 sec: 5541.8). Total num frames: 1121315840. Throughput: 0: 5696.8. Samples: 1121324640. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:31:20,282][25689] Avg episode reward: [(0, '1.871')] [2022-07-11 07:31:21,235][26022] Updated weights on worker 0-0, policy_version 1095042 (0.00086) [2022-07-11 07:31:23,216][26022] Updated weights on worker 0-0, policy_version 1095052 (0.00087) [2022-07-11 07:31:24,928][26022] Updated weights on worker 0-0, policy_version 1095062 (0.00093) [2022-07-11 07:31:25,389][25689] Fps is (10 sec: 5459.2, 60 sec: 5529.4, 300 sec: 5544.4). Total num frames: 1121344512. Throughput: 0: 4998.8. Samples: 1121341642. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:31:25,390][25689] Avg episode reward: [(0, '0.799')] [2022-07-11 07:31:26,807][26022] Updated weights on worker 0-0, policy_version 1095072 (0.00091) [2022-07-11 07:31:28,589][26022] Updated weights on worker 0-0, policy_version 1095082 (0.00085) [2022-07-11 07:31:30,405][25689] Fps is (10 sec: 5664.4, 60 sec: 5562.7, 300 sec: 5541.0). Total num frames: 1121373184. Throughput: 0: 5840.4. Samples: 1121375072. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:31:30,406][25689] Avg episode reward: [(0, '0.571')] [2022-07-11 07:31:30,639][26022] Updated weights on worker 0-0, policy_version 1095092 (0.00093) [2022-07-11 07:31:32,219][26022] Updated weights on worker 0-0, policy_version 1095102 (0.00084) [2022-07-11 07:31:34,173][26022] Updated weights on worker 0-0, policy_version 1095112 (0.00090) [2022-07-11 07:31:35,454][25689] Fps is (10 sec: 5697.2, 60 sec: 5543.6, 300 sec: 5547.0). Total num frames: 1121401856. Throughput: 0: 5830.0. Samples: 1121408632. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:31:35,456][25689] Avg episode reward: [(0, '0.573')] [2022-07-11 07:31:35,947][26022] Updated weights on worker 0-0, policy_version 1095122 (0.00089) [2022-07-11 07:31:37,764][26022] Updated weights on worker 0-0, policy_version 1095132 (0.00102) [2022-07-11 07:31:39,694][26022] Updated weights on worker 0-0, policy_version 1095142 (0.00079) [2022-07-11 07:31:40,463][25689] Fps is (10 sec: 5599.8, 60 sec: 5543.6, 300 sec: 5547.7). Total num frames: 1121429504. Throughput: 0: 4999.4. Samples: 1121425534. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:31:40,467][25689] Avg episode reward: [(0, '0.157')] [2022-07-11 07:31:41,425][26022] Updated weights on worker 0-0, policy_version 1095152 (0.00084) [2022-07-11 07:31:43,433][26022] Updated weights on worker 0-0, policy_version 1095162 (0.00090) [2022-07-11 07:31:45,152][26022] Updated weights on worker 0-0, policy_version 1095172 (0.00086) [2022-07-11 07:31:45,567][25689] Fps is (10 sec: 5569.6, 60 sec: 5544.8, 300 sec: 5542.4). Total num frames: 1121458176. Throughput: 0: 5827.6. Samples: 1121459228. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:31:45,568][25689] Avg episode reward: [(0, '0.257')] [2022-07-11 07:31:45,774][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:31:45,786][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001095175_1121459200.pth [2022-07-11 07:31:45,786][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001093223_1119460352.pth [2022-07-11 07:31:46,818][26022] Updated weights on worker 0-0, policy_version 1095182 (0.00090) [2022-07-11 07:31:48,791][26022] Updated weights on worker 0-0, policy_version 1095192 (0.00089) [2022-07-11 07:31:50,614][25689] Fps is (10 sec: 5548.0, 60 sec: 5526.8, 300 sec: 5543.3). Total num frames: 1121485824. Throughput: 0: 5841.9. Samples: 1121493130. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:31:50,615][25689] Avg episode reward: [(0, '1.437')] [2022-07-11 07:31:50,633][26022] Updated weights on worker 0-0, policy_version 1095202 (0.00082) [2022-07-11 07:31:52,471][26022] Updated weights on worker 0-0, policy_version 1095212 (0.00092) [2022-07-11 07:31:54,155][26022] Updated weights on worker 0-0, policy_version 1095222 (0.00082) [2022-07-11 07:31:55,627][25689] Fps is (10 sec: 5598.3, 60 sec: 5544.3, 300 sec: 5543.9). Total num frames: 1121514496. Throughput: 0: 5023.8. Samples: 1121509974. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:31:55,627][25689] Avg episode reward: [(0, '1.661')] [2022-07-11 07:31:55,903][26022] Updated weights on worker 0-0, policy_version 1095232 (0.00086) [2022-07-11 07:31:57,759][26022] Updated weights on worker 0-0, policy_version 1095242 (0.00082) [2022-07-11 07:31:59,447][26022] Updated weights on worker 0-0, policy_version 1095252 (0.00084) [2022-07-11 07:32:00,647][25689] Fps is (10 sec: 5715.7, 60 sec: 5544.7, 300 sec: 5551.5). Total num frames: 1121543168. Throughput: 0: 5871.1. Samples: 1121544036. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:32:00,649][25689] Avg episode reward: [(0, '0.655')] [2022-07-11 07:32:01,564][26022] Updated weights on worker 0-0, policy_version 1095262 (0.00092) [2022-07-11 07:32:03,792][26022] Updated weights on worker 0-0, policy_version 1095272 (0.00084) [2022-07-11 07:32:05,535][26022] Updated weights on worker 0-0, policy_version 1095282 (0.00094) [2022-07-11 07:32:05,704][25689] Fps is (10 sec: 5487.0, 60 sec: 5585.5, 300 sec: 5551.3). Total num frames: 1121569792. Throughput: 0: 5750.8. Samples: 1121575036. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:32:05,707][25689] Avg episode reward: [(0, '0.107')] [2022-07-11 07:32:07,426][26022] Updated weights on worker 0-0, policy_version 1095292 (0.00078) [2022-07-11 07:32:09,213][26022] Updated weights on worker 0-0, policy_version 1095302 (0.00086) [2022-07-11 07:32:10,729][25689] Fps is (10 sec: 5383.2, 60 sec: 5570.8, 300 sec: 5541.1). Total num frames: 1121597440. Throughput: 0: 4904.9. Samples: 1121591788. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:32:10,729][25689] Avg episode reward: [(0, '-0.387')] [2022-07-11 07:32:10,991][26022] Updated weights on worker 0-0, policy_version 1095312 (0.00093) [2022-07-11 07:32:12,929][26022] Updated weights on worker 0-0, policy_version 1095322 (0.00085) [2022-07-11 07:32:14,650][26022] Updated weights on worker 0-0, policy_version 1095332 (0.00093) [2022-07-11 07:32:15,799][25689] Fps is (10 sec: 5477.9, 60 sec: 5548.1, 300 sec: 5547.3). Total num frames: 1121625088. Throughput: 0: 5735.6. Samples: 1121625672. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:32:15,799][25689] Avg episode reward: [(0, '-0.020')] [2022-07-11 07:32:16,689][26022] Updated weights on worker 0-0, policy_version 1095342 (0.00091) [2022-07-11 07:32:18,252][26022] Updated weights on worker 0-0, policy_version 1095352 (0.00092) [2022-07-11 07:32:20,258][26022] Updated weights on worker 0-0, policy_version 1095362 (0.00094) [2022-07-11 07:32:20,802][25689] Fps is (10 sec: 5590.9, 60 sec: 5583.6, 300 sec: 5552.8). Total num frames: 1121653760. Throughput: 0: 5719.1. Samples: 1121659306. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:32:20,811][25689] Avg episode reward: [(0, '0.055')] [2022-07-11 07:32:22,058][26022] Updated weights on worker 0-0, policy_version 1095372 (0.00086) [2022-07-11 07:32:24,056][26022] Updated weights on worker 0-0, policy_version 1095382 (0.00090) [2022-07-11 07:32:25,934][25689] Fps is (10 sec: 5455.6, 60 sec: 5547.5, 300 sec: 5544.1). Total num frames: 1121680384. Throughput: 0: 4982.9. Samples: 1121675840. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:32:25,935][25689] Avg episode reward: [(0, '-0.994')] [2022-07-11 07:32:25,981][26022] Updated weights on worker 0-0, policy_version 1095392 (0.00092) [2022-07-11 07:32:27,703][26022] Updated weights on worker 0-0, policy_version 1095402 (0.00102) [2022-07-11 07:32:29,575][26022] Updated weights on worker 0-0, policy_version 1095412 (0.00082) [2022-07-11 07:32:30,975][25689] Fps is (10 sec: 5435.8, 60 sec: 5545.2, 300 sec: 5547.4). Total num frames: 1121709056. Throughput: 0: 5788.1. Samples: 1121708974. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2022-07-11 07:32:30,975][25689] Avg episode reward: [(0, '0.046')] [2022-07-11 07:32:31,431][26022] Updated weights on worker 0-0, policy_version 1095422 (0.00084) [2022-07-11 07:32:33,332][26022] Updated weights on worker 0-0, policy_version 1095432 (0.00093) [2022-07-11 07:32:35,119][26022] Updated weights on worker 0-0, policy_version 1095442 (0.00086) [2022-07-11 07:32:35,995][25689] Fps is (10 sec: 5496.1, 60 sec: 5514.0, 300 sec: 5544.2). Total num frames: 1121735680. Throughput: 0: 5762.3. Samples: 1121742050. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:32:35,996][25689] Avg episode reward: [(0, '0.410')] [2022-07-11 07:32:36,996][26022] Updated weights on worker 0-0, policy_version 1095452 (0.00088) [2022-07-11 07:32:39,003][26022] Updated weights on worker 0-0, policy_version 1095462 (0.00083) [2022-07-11 07:32:40,677][26022] Updated weights on worker 0-0, policy_version 1095472 (0.00088) [2022-07-11 07:32:41,010][25689] Fps is (10 sec: 5714.5, 60 sec: 5564.2, 300 sec: 5552.5). Total num frames: 1121766400. Throughput: 0: 4907.6. Samples: 1121758476. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:32:41,010][25689] Avg episode reward: [(0, '0.797')] [2022-07-11 07:32:42,667][26022] Updated weights on worker 0-0, policy_version 1095482 (0.00093) [2022-07-11 07:32:44,363][26022] Updated weights on worker 0-0, policy_version 1095492 (0.00090) [2022-07-11 07:32:46,138][25689] Fps is (10 sec: 5552.5, 60 sec: 5511.2, 300 sec: 5543.8). Total num frames: 1121792000. Throughput: 0: 5744.9. Samples: 1121791910. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:32:46,139][25689] Avg episode reward: [(0, '0.651')] [2022-07-11 07:32:46,394][26022] Updated weights on worker 0-0, policy_version 1095502 (0.00093) [2022-07-11 07:32:48,110][26022] Updated weights on worker 0-0, policy_version 1095512 (0.00087) [2022-07-11 07:32:49,874][26022] Updated weights on worker 0-0, policy_version 1095522 (0.00086) [2022-07-11 07:32:51,148][25689] Fps is (10 sec: 5353.0, 60 sec: 5531.6, 300 sec: 5543.8). Total num frames: 1121820672. Throughput: 0: 5772.8. Samples: 1121825432. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:32:51,149][25689] Avg episode reward: [(0, '0.074')] [2022-07-11 07:32:51,869][26022] Updated weights on worker 0-0, policy_version 1095532 (0.00084) [2022-07-11 07:32:53,445][26022] Updated weights on worker 0-0, policy_version 1095542 (0.00089) [2022-07-11 07:32:55,525][26022] Updated weights on worker 0-0, policy_version 1095552 (0.00091) [2022-07-11 07:32:56,157][25689] Fps is (10 sec: 5723.6, 60 sec: 5531.9, 300 sec: 5548.0). Total num frames: 1121849344. Throughput: 0: 4980.3. Samples: 1121842462. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:32:56,158][25689] Avg episode reward: [(0, '0.883')] [2022-07-11 07:32:56,987][26022] Updated weights on worker 0-0, policy_version 1095562 (0.00086) [2022-07-11 07:32:58,963][26022] Updated weights on worker 0-0, policy_version 1095572 (0.00090) [2022-07-11 07:33:00,876][26022] Updated weights on worker 0-0, policy_version 1095582 (0.00091) [2022-07-11 07:33:01,167][25689] Fps is (10 sec: 5621.7, 60 sec: 5516.0, 300 sec: 5550.8). Total num frames: 1121876992. Throughput: 0: 5846.9. Samples: 1121876332. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:33:01,167][25689] Avg episode reward: [(0, '0.949')] [2022-07-11 07:33:02,915][26022] Updated weights on worker 0-0, policy_version 1095592 (0.00086) [2022-07-11 07:33:04,819][26022] Updated weights on worker 0-0, policy_version 1095602 (0.00087) [2022-07-11 07:33:06,285][25689] Fps is (10 sec: 5459.6, 60 sec: 5527.3, 300 sec: 5552.7). Total num frames: 1121904640. Throughput: 0: 5739.2. Samples: 1121907538. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:33:06,286][25689] Avg episode reward: [(0, '0.970')] [2022-07-11 07:33:06,725][26022] Updated weights on worker 0-0, policy_version 1095612 (0.00086) [2022-07-11 07:33:08,632][26022] Updated weights on worker 0-0, policy_version 1095622 (0.00088) [2022-07-11 07:33:10,506][26022] Updated weights on worker 0-0, policy_version 1095632 (0.00086) [2022-07-11 07:33:11,327][25689] Fps is (10 sec: 5341.8, 60 sec: 5508.8, 300 sec: 5542.4). Total num frames: 1121931264. Throughput: 0: 5723.2. Samples: 1121940916. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:33:11,328][25689] Avg episode reward: [(0, '1.219')] [2022-07-11 07:33:12,208][26022] Updated weights on worker 0-0, policy_version 1095642 (0.00086) [2022-07-11 07:33:14,068][26022] Updated weights on worker 0-0, policy_version 1095652 (0.00096) [2022-07-11 07:33:15,855][26022] Updated weights on worker 0-0, policy_version 1095662 (0.00100) [2022-07-11 07:33:16,358][25689] Fps is (10 sec: 5591.5, 60 sec: 5546.2, 300 sec: 5550.3). Total num frames: 1121960960. Throughput: 0: 5719.3. Samples: 1121957994. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:33:16,358][25689] Avg episode reward: [(0, '1.438')] [2022-07-11 07:33:17,698][26022] Updated weights on worker 0-0, policy_version 1095672 (0.00085) [2022-07-11 07:33:19,440][26022] Updated weights on worker 0-0, policy_version 1095682 (0.00085) [2022-07-11 07:33:21,336][26022] Updated weights on worker 0-0, policy_version 1095692 (0.00087) [2022-07-11 07:33:21,375][25689] Fps is (10 sec: 5706.8, 60 sec: 5528.0, 300 sec: 5547.7). Total num frames: 1121988608. Throughput: 0: 5734.1. Samples: 1121992208. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:33:21,377][25689] Avg episode reward: [(0, '1.778')] [2022-07-11 07:33:22,895][26022] Updated weights on worker 0-0, policy_version 1095702 (0.00509) [2022-07-11 07:33:25,047][26022] Updated weights on worker 0-0, policy_version 1095712 (0.00086) [2022-07-11 07:33:26,498][25689] Fps is (10 sec: 5554.3, 60 sec: 5562.7, 300 sec: 5549.3). Total num frames: 1122017280. Throughput: 0: 5855.6. Samples: 1122025892. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:33:26,500][25689] Avg episode reward: [(0, '1.781')] [2022-07-11 07:33:26,562][26022] Updated weights on worker 0-0, policy_version 1095722 (0.00085) [2022-07-11 07:33:28,486][26022] Updated weights on worker 0-0, policy_version 1095732 (0.00085) [2022-07-11 07:33:30,264][26022] Updated weights on worker 0-0, policy_version 1095742 (0.00071) [2022-07-11 07:33:31,519][25689] Fps is (10 sec: 5653.1, 60 sec: 5564.5, 300 sec: 5552.7). Total num frames: 1122045952. Throughput: 0: 5052.6. Samples: 1122042938. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:33:31,519][25689] Avg episode reward: [(0, '1.628')] [2022-07-11 07:33:32,194][26022] Updated weights on worker 0-0, policy_version 1095752 (0.00085) [2022-07-11 07:33:34,071][26022] Updated weights on worker 0-0, policy_version 1095762 (0.00089) [2022-07-11 07:33:35,888][26022] Updated weights on worker 0-0, policy_version 1095772 (0.00084) [2022-07-11 07:33:36,575][25689] Fps is (10 sec: 5487.3, 60 sec: 5561.2, 300 sec: 5542.1). Total num frames: 1122072576. Throughput: 0: 5867.8. Samples: 1122076622. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:33:36,576][25689] Avg episode reward: [(0, '1.941')] [2022-07-11 07:33:37,533][26022] Updated weights on worker 0-0, policy_version 1095782 (0.00089) [2022-07-11 07:33:39,696][26022] Updated weights on worker 0-0, policy_version 1095792 (0.00081) [2022-07-11 07:33:41,410][26022] Updated weights on worker 0-0, policy_version 1095802 (0.00085) [2022-07-11 07:33:41,606][25689] Fps is (10 sec: 5583.2, 60 sec: 5542.8, 300 sec: 5550.7). Total num frames: 1122102272. Throughput: 0: 5823.0. Samples: 1122110016. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:33:41,608][25689] Avg episode reward: [(0, '2.054')] [2022-07-11 07:33:43,212][26022] Updated weights on worker 0-0, policy_version 1095812 (0.00079) [2022-07-11 07:33:45,121][26022] Updated weights on worker 0-0, policy_version 1095822 (0.00086) [2022-07-11 07:33:45,884][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:33:45,901][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001095826_1122125824.pth [2022-07-11 07:33:45,902][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001093873_1120125952.pth [2022-07-11 07:33:46,726][25689] Fps is (10 sec: 5750.0, 60 sec: 5594.3, 300 sec: 5552.9). Total num frames: 1122130944. Throughput: 0: 4993.7. Samples: 1122126904. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:33:46,726][25689] Avg episode reward: [(0, '2.053')] [2022-07-11 07:33:46,854][26022] Updated weights on worker 0-0, policy_version 1095832 (0.00080) [2022-07-11 07:33:48,787][26022] Updated weights on worker 0-0, policy_version 1095842 (0.00084) [2022-07-11 07:33:50,613][26022] Updated weights on worker 0-0, policy_version 1095852 (0.00091) [2022-07-11 07:33:51,747][25689] Fps is (10 sec: 5553.9, 60 sec: 5576.4, 300 sec: 5552.6). Total num frames: 1122158592. Throughput: 0: 5806.4. Samples: 1122160390. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:33:51,747][25689] Avg episode reward: [(0, '1.943')] [2022-07-11 07:33:52,364][26022] Updated weights on worker 0-0, policy_version 1095862 (0.00099) [2022-07-11 07:33:54,294][26022] Updated weights on worker 0-0, policy_version 1095872 (0.00090) [2022-07-11 07:33:56,210][26022] Updated weights on worker 0-0, policy_version 1095882 (0.00094) [2022-07-11 07:33:56,795][25689] Fps is (10 sec: 5390.0, 60 sec: 5539.0, 300 sec: 5541.6). Total num frames: 1122185216. Throughput: 0: 5771.4. Samples: 1122193320. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:33:56,795][25689] Avg episode reward: [(0, '1.896')] [2022-07-11 07:33:58,066][26022] Updated weights on worker 0-0, policy_version 1095892 (0.00090) [2022-07-11 07:33:59,903][26022] Updated weights on worker 0-0, policy_version 1095902 (0.00088) [2022-07-11 07:34:01,797][25689] Fps is (10 sec: 5400.1, 60 sec: 5539.7, 300 sec: 5549.4). Total num frames: 1122212864. Throughput: 0: 4956.2. Samples: 1122210086. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:34:01,798][25689] Avg episode reward: [(0, '1.522')] [2022-07-11 07:34:01,920][26022] Updated weights on worker 0-0, policy_version 1095912 (0.00087) [2022-07-11 07:34:03,927][26022] Updated weights on worker 0-0, policy_version 1095922 (0.00086) [2022-07-11 07:34:05,895][26022] Updated weights on worker 0-0, policy_version 1095932 (0.00114) [2022-07-11 07:34:06,950][25689] Fps is (10 sec: 5344.1, 60 sec: 5519.6, 300 sec: 5544.3). Total num frames: 1122239488. Throughput: 0: 5664.7. Samples: 1122241470. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:34:06,951][25689] Avg episode reward: [(0, '0.446')] [2022-07-11 07:34:07,454][26022] Updated weights on worker 0-0, policy_version 1095942 (0.00085) [2022-07-11 07:34:09,607][26022] Updated weights on worker 0-0, policy_version 1095952 (0.00087) [2022-07-11 07:34:11,286][26022] Updated weights on worker 0-0, policy_version 1095962 (0.00087) [2022-07-11 07:34:12,020][25689] Fps is (10 sec: 5609.3, 60 sec: 5584.5, 300 sec: 5550.1). Total num frames: 1122270208. Throughput: 0: 5642.3. Samples: 1122274778. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:34:12,021][25689] Avg episode reward: [(0, '0.475')] [2022-07-11 07:34:13,263][26022] Updated weights on worker 0-0, policy_version 1095972 (0.00086) [2022-07-11 07:34:14,820][26022] Updated weights on worker 0-0, policy_version 1095982 (0.00085) [2022-07-11 07:34:16,635][26022] Updated weights on worker 0-0, policy_version 1095992 (0.00088) [2022-07-11 07:34:17,063][25689] Fps is (10 sec: 5670.5, 60 sec: 5532.8, 300 sec: 5546.7). Total num frames: 1122296832. Throughput: 0: 4848.8. Samples: 1122291596. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:34:17,063][25689] Avg episode reward: [(0, '0.675')] [2022-07-11 07:34:18,595][26022] Updated weights on worker 0-0, policy_version 1096002 (0.00091) [2022-07-11 07:34:20,489][26022] Updated weights on worker 0-0, policy_version 1096012 (0.00085) [2022-07-11 07:34:22,105][25689] Fps is (10 sec: 5483.1, 60 sec: 5547.4, 300 sec: 5547.1). Total num frames: 1122325504. Throughput: 0: 5677.2. Samples: 1122325378. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:34:22,105][25689] Avg episode reward: [(0, '0.636')] [2022-07-11 07:34:22,139][26022] Updated weights on worker 0-0, policy_version 1096022 (0.00084) [2022-07-11 07:34:24,128][26022] Updated weights on worker 0-0, policy_version 1096032 (0.00083) [2022-07-11 07:34:25,978][26022] Updated weights on worker 0-0, policy_version 1096042 (0.00076) [2022-07-11 07:34:27,166][25689] Fps is (10 sec: 5574.4, 60 sec: 5536.2, 300 sec: 5542.9). Total num frames: 1122353152. Throughput: 0: 5805.9. Samples: 1122358842. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:34:27,167][25689] Avg episode reward: [(0, '0.609')] [2022-07-11 07:34:27,875][26022] Updated weights on worker 0-0, policy_version 1096052 (0.00086) [2022-07-11 07:34:29,626][26022] Updated weights on worker 0-0, policy_version 1096062 (0.00094) [2022-07-11 07:34:31,266][26022] Updated weights on worker 0-0, policy_version 1096072 (0.00088) [2022-07-11 07:34:32,203][25689] Fps is (10 sec: 5475.9, 60 sec: 5517.9, 300 sec: 5545.8). Total num frames: 1122380800. Throughput: 0: 5001.5. Samples: 1122375722. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:34:32,204][25689] Avg episode reward: [(0, '0.927')] [2022-07-11 07:34:33,333][26022] Updated weights on worker 0-0, policy_version 1096082 (0.00090) [2022-07-11 07:34:35,040][26022] Updated weights on worker 0-0, policy_version 1096092 (0.00092) [2022-07-11 07:34:36,787][26022] Updated weights on worker 0-0, policy_version 1096102 (0.00108) [2022-07-11 07:34:37,226][25689] Fps is (10 sec: 5802.0, 60 sec: 5588.4, 300 sec: 5552.6). Total num frames: 1122411520. Throughput: 0: 5851.8. Samples: 1122409588. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:34:37,227][25689] Avg episode reward: [(0, '1.828')] [2022-07-11 07:34:38,783][26022] Updated weights on worker 0-0, policy_version 1096112 (0.00092) [2022-07-11 07:34:40,583][26022] Updated weights on worker 0-0, policy_version 1096122 (0.00086) [2022-07-11 07:34:42,255][25689] Fps is (10 sec: 5704.9, 60 sec: 5538.0, 300 sec: 5550.2). Total num frames: 1122438144. Throughput: 0: 5840.1. Samples: 1122443056. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:34:42,256][25689] Avg episode reward: [(0, '1.564')] [2022-07-11 07:34:42,330][26022] Updated weights on worker 0-0, policy_version 1096132 (0.00068) [2022-07-11 07:34:44,247][26022] Updated weights on worker 0-0, policy_version 1096142 (0.00084) [2022-07-11 07:34:45,747][26022] Updated weights on worker 0-0, policy_version 1096152 (0.00097) [2022-07-11 07:34:47,398][25689] Fps is (10 sec: 5335.4, 60 sec: 5518.9, 300 sec: 5544.6). Total num frames: 1122465792. Throughput: 0: 5001.4. Samples: 1122460030. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:34:47,399][25689] Avg episode reward: [(0, '0.972')] [2022-07-11 07:34:48,033][26022] Updated weights on worker 0-0, policy_version 1096162 (0.00086) [2022-07-11 07:34:49,410][26022] Updated weights on worker 0-0, policy_version 1096172 (0.00088) [2022-07-11 07:34:51,466][26022] Updated weights on worker 0-0, policy_version 1096182 (0.00086) [2022-07-11 07:34:52,496][25689] Fps is (10 sec: 5699.5, 60 sec: 5562.6, 300 sec: 5557.5). Total num frames: 1122496512. Throughput: 0: 5826.2. Samples: 1122493950. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:34:52,496][25689] Avg episode reward: [(0, '1.055')] [2022-07-11 07:34:53,460][26022] Updated weights on worker 0-0, policy_version 1096192 (0.00085) [2022-07-11 07:34:54,938][26022] Updated weights on worker 0-0, policy_version 1096202 (0.00088) [2022-07-11 07:34:57,046][26022] Updated weights on worker 0-0, policy_version 1096212 (0.00096) [2022-07-11 07:34:57,513][25689] Fps is (10 sec: 5669.4, 60 sec: 5565.4, 300 sec: 5547.1). Total num frames: 1122523136. Throughput: 0: 5813.7. Samples: 1122527528. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:34:57,514][25689] Avg episode reward: [(0, '0.914')] [2022-07-11 07:34:58,819][26022] Updated weights on worker 0-0, policy_version 1096222 (0.00083) [2022-07-11 07:35:00,641][26022] Updated weights on worker 0-0, policy_version 1096232 (0.00092) [2022-07-11 07:35:02,602][25689] Fps is (10 sec: 5268.6, 60 sec: 5540.6, 300 sec: 5550.9). Total num frames: 1122549760. Throughput: 0: 5699.1. Samples: 1122559020. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:35:02,603][25689] Avg episode reward: [(0, '0.633')] [2022-07-11 07:35:02,811][26022] Updated weights on worker 0-0, policy_version 1096242 (0.00082) [2022-07-11 07:35:04,818][26022] Updated weights on worker 0-0, policy_version 1096252 (0.00084) [2022-07-11 07:35:06,463][26022] Updated weights on worker 0-0, policy_version 1096262 (0.00085) [2022-07-11 07:35:07,664][25689] Fps is (10 sec: 5447.3, 60 sec: 5582.6, 300 sec: 5554.4). Total num frames: 1122578432. Throughput: 0: 5708.4. Samples: 1122575714. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:35:07,665][25689] Avg episode reward: [(0, '0.561')] [2022-07-11 07:35:08,486][26022] Updated weights on worker 0-0, policy_version 1096272 (0.00090) [2022-07-11 07:35:10,175][26022] Updated weights on worker 0-0, policy_version 1096282 (0.00084) [2022-07-11 07:35:12,193][26022] Updated weights on worker 0-0, policy_version 1096292 (0.00095) [2022-07-11 07:35:12,683][25689] Fps is (10 sec: 5485.3, 60 sec: 5519.8, 300 sec: 5547.3). Total num frames: 1122605056. Throughput: 0: 5693.0. Samples: 1122608876. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:35:12,684][25689] Avg episode reward: [(0, '-0.947')] [2022-07-11 07:35:13,791][26022] Updated weights on worker 0-0, policy_version 1096302 (0.00084) [2022-07-11 07:35:15,749][26022] Updated weights on worker 0-0, policy_version 1096312 (0.00086) [2022-07-11 07:35:17,537][26022] Updated weights on worker 0-0, policy_version 1096322 (0.00092) [2022-07-11 07:35:17,692][25689] Fps is (10 sec: 5616.1, 60 sec: 5573.5, 300 sec: 5554.3). Total num frames: 1122634752. Throughput: 0: 5701.1. Samples: 1122642572. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:35:17,693][25689] Avg episode reward: [(0, '-0.457')] [2022-07-11 07:35:19,394][26022] Updated weights on worker 0-0, policy_version 1096332 (0.00087) [2022-07-11 07:35:20,939][26022] Updated weights on worker 0-0, policy_version 1096342 (0.00088) [2022-07-11 07:35:22,735][25689] Fps is (10 sec: 5603.0, 60 sec: 5539.7, 300 sec: 5544.5). Total num frames: 1122661376. Throughput: 0: 4996.1. Samples: 1122659604. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:35:22,735][25689] Avg episode reward: [(0, '-1.629')] [2022-07-11 07:35:23,211][26022] Updated weights on worker 0-0, policy_version 1096352 (0.00087) [2022-07-11 07:35:24,609][26022] Updated weights on worker 0-0, policy_version 1096362 (0.00082) [2022-07-11 07:35:26,839][26022] Updated weights on worker 0-0, policy_version 1096372 (0.00094) [2022-07-11 07:35:27,802][25689] Fps is (10 sec: 5570.7, 60 sec: 5572.9, 300 sec: 5553.7). Total num frames: 1122691072. Throughput: 0: 5834.9. Samples: 1122693220. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:35:27,803][25689] Avg episode reward: [(0, '-2.402')] [2022-07-11 07:35:28,372][26022] Updated weights on worker 0-0, policy_version 1096382 (0.00089) [2022-07-11 07:35:30,288][26022] Updated weights on worker 0-0, policy_version 1096392 (0.00093) [2022-07-11 07:35:32,377][26022] Updated weights on worker 0-0, policy_version 1096402 (0.00083) [2022-07-11 07:35:32,874][25689] Fps is (10 sec: 5655.5, 60 sec: 5569.7, 300 sec: 5546.0). Total num frames: 1122718720. Throughput: 0: 5838.2. Samples: 1122726756. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:35:32,875][25689] Avg episode reward: [(0, '-2.388')] [2022-07-11 07:35:33,924][26022] Updated weights on worker 0-0, policy_version 1096412 (0.00088) [2022-07-11 07:35:35,868][26022] Updated weights on worker 0-0, policy_version 1096422 (0.00089) [2022-07-11 07:35:37,644][26022] Updated weights on worker 0-0, policy_version 1096432 (0.00106) [2022-07-11 07:35:37,923][25689] Fps is (10 sec: 5463.5, 60 sec: 5516.7, 300 sec: 5545.2). Total num frames: 1122746368. Throughput: 0: 4994.8. Samples: 1122743622. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:35:37,924][25689] Avg episode reward: [(0, '-3.029')] [2022-07-11 07:35:39,462][26022] Updated weights on worker 0-0, policy_version 1096442 (0.00090) [2022-07-11 07:35:41,580][26022] Updated weights on worker 0-0, policy_version 1096452 (0.00924) [2022-07-11 07:35:42,941][25689] Fps is (10 sec: 5797.8, 60 sec: 5585.1, 300 sec: 5553.9). Total num frames: 1122777088. Throughput: 0: 5806.0. Samples: 1122776924. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:35:42,942][25689] Avg episode reward: [(0, '-1.017')] [2022-07-11 07:35:42,951][26022] Updated weights on worker 0-0, policy_version 1096462 (0.00096) [2022-07-11 07:35:45,121][26022] Updated weights on worker 0-0, policy_version 1096472 (0.00072) [2022-07-11 07:35:46,104][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:35:46,118][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001096478_1122793472.pth [2022-07-11 07:35:46,118][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001094524_1120792576.pth [2022-07-11 07:35:46,794][26022] Updated weights on worker 0-0, policy_version 1096482 (0.00084) [2022-07-11 07:35:48,005][25689] Fps is (10 sec: 5586.4, 60 sec: 5558.7, 300 sec: 5543.1). Total num frames: 1122802688. Throughput: 0: 5819.6. Samples: 1122810790. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:35:48,007][25689] Avg episode reward: [(0, '0.105')] [2022-07-11 07:35:48,481][26022] Updated weights on worker 0-0, policy_version 1096492 (0.00590) [2022-07-11 07:35:50,667][26022] Updated weights on worker 0-0, policy_version 1096502 (0.00093) [2022-07-11 07:35:52,364][26022] Updated weights on worker 0-0, policy_version 1096512 (0.00082) [2022-07-11 07:35:53,036][25689] Fps is (10 sec: 5376.5, 60 sec: 5531.0, 300 sec: 5546.3). Total num frames: 1122831360. Throughput: 0: 4994.0. Samples: 1122827444. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:35:53,036][25689] Avg episode reward: [(0, '0.733')] [2022-07-11 07:35:53,944][26022] Updated weights on worker 0-0, policy_version 1096522 (0.00089) [2022-07-11 07:35:56,119][26022] Updated weights on worker 0-0, policy_version 1096532 (0.00051) [2022-07-11 07:35:57,471][26022] Updated weights on worker 0-0, policy_version 1096542 (0.00087) [2022-07-11 07:35:58,062][25689] Fps is (10 sec: 5701.6, 60 sec: 5564.0, 300 sec: 5546.3). Total num frames: 1122860032. Throughput: 0: 5819.0. Samples: 1122860812. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:35:58,063][25689] Avg episode reward: [(0, '0.963')] [2022-07-11 07:35:59,710][26022] Updated weights on worker 0-0, policy_version 1096552 (0.00097) [2022-07-11 07:36:01,696][26022] Updated weights on worker 0-0, policy_version 1096562 (0.00096) [2022-07-11 07:36:03,077][25689] Fps is (10 sec: 5303.2, 60 sec: 5537.0, 300 sec: 5548.5). Total num frames: 1122884608. Throughput: 0: 5739.5. Samples: 1122892490. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:36:03,077][25689] Avg episode reward: [(0, '0.027')] [2022-07-11 07:36:03,632][26022] Updated weights on worker 0-0, policy_version 1096572 (0.00085) [2022-07-11 07:36:05,737][26022] Updated weights on worker 0-0, policy_version 1096582 (0.00083) [2022-07-11 07:36:07,316][26022] Updated weights on worker 0-0, policy_version 1096592 (0.00087) [2022-07-11 07:36:08,119][25689] Fps is (10 sec: 5295.0, 60 sec: 5538.8, 300 sec: 5548.6). Total num frames: 1122913280. Throughput: 0: 4893.0. Samples: 1122909206. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:36:08,119][25689] Avg episode reward: [(0, '0.161')] [2022-07-11 07:36:09,276][26022] Updated weights on worker 0-0, policy_version 1096602 (0.00098) [2022-07-11 07:36:11,268][26022] Updated weights on worker 0-0, policy_version 1096612 (0.00085) [2022-07-11 07:36:12,970][26022] Updated weights on worker 0-0, policy_version 1096622 (0.00586) [2022-07-11 07:36:13,126][25689] Fps is (10 sec: 5706.1, 60 sec: 5573.8, 300 sec: 5548.6). Total num frames: 1122941952. Throughput: 0: 5728.4. Samples: 1122942530. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:36:13,127][25689] Avg episode reward: [(0, '-0.458')] [2022-07-11 07:36:14,935][26022] Updated weights on worker 0-0, policy_version 1096632 (0.00083) [2022-07-11 07:36:16,669][26022] Updated weights on worker 0-0, policy_version 1096642 (0.00054) [2022-07-11 07:36:18,133][25689] Fps is (10 sec: 5624.2, 60 sec: 5540.2, 300 sec: 5552.3). Total num frames: 1122969600. Throughput: 0: 5744.6. Samples: 1122976108. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:36:18,133][25689] Avg episode reward: [(0, '-0.254')] [2022-07-11 07:36:18,440][26022] Updated weights on worker 0-0, policy_version 1096652 (0.00086) [2022-07-11 07:36:20,507][26022] Updated weights on worker 0-0, policy_version 1096662 (0.00088) [2022-07-11 07:36:21,928][26022] Updated weights on worker 0-0, policy_version 1096672 (0.00090) [2022-07-11 07:36:23,151][25689] Fps is (10 sec: 5413.9, 60 sec: 5542.4, 300 sec: 5547.1). Total num frames: 1122996224. Throughput: 0: 4992.0. Samples: 1122992700. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:36:23,151][25689] Avg episode reward: [(0, '-0.751')] [2022-07-11 07:36:24,013][26022] Updated weights on worker 0-0, policy_version 1096682 (0.00093) [2022-07-11 07:36:25,909][26022] Updated weights on worker 0-0, policy_version 1096692 (0.00102) [2022-07-11 07:36:27,668][26022] Updated weights on worker 0-0, policy_version 1096702 (0.00089) [2022-07-11 07:36:28,210][25689] Fps is (10 sec: 5588.7, 60 sec: 5543.1, 300 sec: 5549.7). Total num frames: 1123025920. Throughput: 0: 5816.5. Samples: 1123026068. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 07:36:28,211][25689] Avg episode reward: [(0, '0.333')] [2022-07-11 07:36:29,852][26022] Updated weights on worker 0-0, policy_version 1096712 (0.00090) [2022-07-11 07:36:31,171][26022] Updated weights on worker 0-0, policy_version 1096722 (0.00085) [2022-07-11 07:36:33,232][25689] Fps is (10 sec: 5586.8, 60 sec: 5530.8, 300 sec: 5543.4). Total num frames: 1123052544. Throughput: 0: 5805.8. Samples: 1123059258. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:36:33,232][25689] Avg episode reward: [(0, '0.931')] [2022-07-11 07:36:33,414][26022] Updated weights on worker 0-0, policy_version 1096732 (0.00093) [2022-07-11 07:36:35,242][26022] Updated weights on worker 0-0, policy_version 1096743 (0.00085) [2022-07-11 07:36:37,191][26022] Updated weights on worker 0-0, policy_version 1096753 (0.00085) [2022-07-11 07:36:38,247][25689] Fps is (10 sec: 5509.3, 60 sec: 5550.9, 300 sec: 5546.7). Total num frames: 1123081216. Throughput: 0: 4956.9. Samples: 1123075812. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:36:38,247][25689] Avg episode reward: [(0, '1.396')] [2022-07-11 07:36:38,775][26022] Updated weights on worker 0-0, policy_version 1096763 (0.00084) [2022-07-11 07:36:41,077][26022] Updated weights on worker 0-0, policy_version 1096773 (0.00097) [2022-07-11 07:36:42,571][26022] Updated weights on worker 0-0, policy_version 1096783 (0.00097) [2022-07-11 07:36:43,254][25689] Fps is (10 sec: 5517.4, 60 sec: 5484.0, 300 sec: 5541.6). Total num frames: 1123107840. Throughput: 0: 5777.6. Samples: 1123108846. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:36:43,254][25689] Avg episode reward: [(0, '0.243')] [2022-07-11 07:36:44,667][26022] Updated weights on worker 0-0, policy_version 1096793 (0.00084) [2022-07-11 07:36:46,326][26022] Updated weights on worker 0-0, policy_version 1096803 (0.00086) [2022-07-11 07:36:48,249][26022] Updated weights on worker 0-0, policy_version 1096813 (0.00088) [2022-07-11 07:36:48,326][25689] Fps is (10 sec: 5486.0, 60 sec: 5534.1, 300 sec: 5544.6). Total num frames: 1123136512. Throughput: 0: 5783.6. Samples: 1123142412. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:36:48,327][25689] Avg episode reward: [(0, '0.033')] [2022-07-11 07:36:50,149][26022] Updated weights on worker 0-0, policy_version 1096823 (0.00081) [2022-07-11 07:36:51,872][26022] Updated weights on worker 0-0, policy_version 1096833 (0.00091) [2022-07-11 07:36:53,353][25689] Fps is (10 sec: 5678.2, 60 sec: 5534.5, 300 sec: 5544.3). Total num frames: 1123165184. Throughput: 0: 4963.1. Samples: 1123159120. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:36:53,353][25689] Avg episode reward: [(0, '0.898')] [2022-07-11 07:36:53,840][26022] Updated weights on worker 0-0, policy_version 1096843 (0.00085) [2022-07-11 07:36:55,544][26022] Updated weights on worker 0-0, policy_version 1096853 (0.00090) [2022-07-11 07:36:57,541][26022] Updated weights on worker 0-0, policy_version 1096863 (0.00096) [2022-07-11 07:36:58,363][25689] Fps is (10 sec: 5611.6, 60 sec: 5519.0, 300 sec: 5541.1). Total num frames: 1123192832. Throughput: 0: 5817.1. Samples: 1123192828. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:36:58,363][25689] Avg episode reward: [(0, '-0.497')] [2022-07-11 07:36:59,200][26022] Updated weights on worker 0-0, policy_version 1096873 (0.00086) [2022-07-11 07:37:01,032][26022] Updated weights on worker 0-0, policy_version 1096883 (0.00089) [2022-07-11 07:37:03,238][26022] Updated weights on worker 0-0, policy_version 1096893 (0.00093) [2022-07-11 07:37:03,372][25689] Fps is (10 sec: 5314.3, 60 sec: 5536.4, 300 sec: 5538.5). Total num frames: 1123218432. Throughput: 0: 5725.5. Samples: 1123224036. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:37:03,374][25689] Avg episode reward: [(0, '-0.245')] [2022-07-11 07:37:05,202][26022] Updated weights on worker 0-0, policy_version 1096903 (0.00089) [2022-07-11 07:37:07,102][26022] Updated weights on worker 0-0, policy_version 1096913 (0.00084) [2022-07-11 07:37:08,459][25689] Fps is (10 sec: 5274.2, 60 sec: 5515.4, 300 sec: 5537.4). Total num frames: 1123246080. Throughput: 0: 4881.1. Samples: 1123240680. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:37:08,460][25689] Avg episode reward: [(0, '-0.841')] [2022-07-11 07:37:08,767][26022] Updated weights on worker 0-0, policy_version 1096923 (0.00070) [2022-07-11 07:37:10,665][26022] Updated weights on worker 0-0, policy_version 1096933 (0.00091) [2022-07-11 07:37:12,472][26022] Updated weights on worker 0-0, policy_version 1096943 (0.00087) [2022-07-11 07:37:13,462][25689] Fps is (10 sec: 5582.2, 60 sec: 5515.8, 300 sec: 5542.1). Total num frames: 1123274752. Throughput: 0: 5716.7. Samples: 1123274076. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:37:13,462][25689] Avg episode reward: [(0, '-0.931')] [2022-07-11 07:37:14,345][26022] Updated weights on worker 0-0, policy_version 1096953 (0.00080) [2022-07-11 07:37:16,055][26022] Updated weights on worker 0-0, policy_version 1096963 (0.00080) [2022-07-11 07:37:18,062][26022] Updated weights on worker 0-0, policy_version 1096973 (0.00096) [2022-07-11 07:37:18,485][25689] Fps is (10 sec: 5719.1, 60 sec: 5531.2, 300 sec: 5541.7). Total num frames: 1123303424. Throughput: 0: 5728.1. Samples: 1123308094. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:37:18,486][25689] Avg episode reward: [(0, '-0.816')] [2022-07-11 07:37:19,633][26022] Updated weights on worker 0-0, policy_version 1096983 (0.00079) [2022-07-11 07:37:21,739][26022] Updated weights on worker 0-0, policy_version 1096993 (0.00087) [2022-07-11 07:37:23,195][26022] Updated weights on worker 0-0, policy_version 1097003 (0.00092) [2022-07-11 07:37:23,505][25689] Fps is (10 sec: 5709.8, 60 sec: 5565.0, 300 sec: 5550.7). Total num frames: 1123332096. Throughput: 0: 5021.2. Samples: 1123325124. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:37:23,506][25689] Avg episode reward: [(0, '0.124')] [2022-07-11 07:37:25,416][26022] Updated weights on worker 0-0, policy_version 1097013 (0.00088) [2022-07-11 07:37:26,973][26022] Updated weights on worker 0-0, policy_version 1097023 (0.00092) [2022-07-11 07:37:28,590][25689] Fps is (10 sec: 5573.8, 60 sec: 5528.7, 300 sec: 5546.4). Total num frames: 1123359744. Throughput: 0: 5861.8. Samples: 1123358686. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:37:28,590][25689] Avg episode reward: [(0, '0.265')] [2022-07-11 07:37:28,876][26022] Updated weights on worker 0-0, policy_version 1097033 (0.00087) [2022-07-11 07:37:30,804][26022] Updated weights on worker 0-0, policy_version 1097043 (0.00092) [2022-07-11 07:37:32,587][26022] Updated weights on worker 0-0, policy_version 1097053 (0.00088) [2022-07-11 07:37:33,675][25689] Fps is (10 sec: 5437.1, 60 sec: 5539.9, 300 sec: 5548.6). Total num frames: 1123387392. Throughput: 0: 5856.6. Samples: 1123392456. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:37:33,675][25689] Avg episode reward: [(0, '0.051')] [2022-07-11 07:37:34,414][26022] Updated weights on worker 0-0, policy_version 1097063 (0.00086) [2022-07-11 07:37:36,371][26022] Updated weights on worker 0-0, policy_version 1097073 (0.00085) [2022-07-11 07:37:38,016][26022] Updated weights on worker 0-0, policy_version 1097083 (0.00088) [2022-07-11 07:37:38,686][25689] Fps is (10 sec: 5477.2, 60 sec: 5523.4, 300 sec: 5538.4). Total num frames: 1123415040. Throughput: 0: 5001.9. Samples: 1123409134. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:37:38,686][25689] Avg episode reward: [(0, '0.523')] [2022-07-11 07:37:40,003][26022] Updated weights on worker 0-0, policy_version 1097093 (0.00090) [2022-07-11 07:37:41,848][26022] Updated weights on worker 0-0, policy_version 1097103 (0.00081) [2022-07-11 07:37:43,646][26022] Updated weights on worker 0-0, policy_version 1097113 (0.00055) [2022-07-11 07:37:43,733][25689] Fps is (10 sec: 5599.3, 60 sec: 5553.5, 300 sec: 5550.2). Total num frames: 1123443712. Throughput: 0: 5796.3. Samples: 1123442374. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:37:43,734][25689] Avg episode reward: [(0, '1.775')] [2022-07-11 07:37:45,525][26022] Updated weights on worker 0-0, policy_version 1097123 (0.00083) [2022-07-11 07:37:46,259][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:37:46,277][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001097127_1123458048.pth [2022-07-11 07:37:46,278][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001095175_1121459200.pth [2022-07-11 07:37:47,343][26022] Updated weights on worker 0-0, policy_version 1097133 (0.00087) [2022-07-11 07:37:48,855][25689] Fps is (10 sec: 5538.3, 60 sec: 5532.1, 300 sec: 5544.7). Total num frames: 1123471360. Throughput: 0: 5759.5. Samples: 1123475402. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:37:48,855][25689] Avg episode reward: [(0, '1.708')] [2022-07-11 07:37:49,263][26022] Updated weights on worker 0-0, policy_version 1097143 (0.00094) [2022-07-11 07:37:51,112][26022] Updated weights on worker 0-0, policy_version 1097153 (0.00052) [2022-07-11 07:37:52,948][26022] Updated weights on worker 0-0, policy_version 1097163 (0.00091) [2022-07-11 07:37:53,891][25689] Fps is (10 sec: 5544.2, 60 sec: 5531.1, 300 sec: 5544.2). Total num frames: 1123500032. Throughput: 0: 5740.8. Samples: 1123508516. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:37:53,892][25689] Avg episode reward: [(0, '1.798')] [2022-07-11 07:37:54,796][26022] Updated weights on worker 0-0, policy_version 1097173 (0.00104) [2022-07-11 07:37:56,693][26022] Updated weights on worker 0-0, policy_version 1097183 (0.00085) [2022-07-11 07:37:58,468][26022] Updated weights on worker 0-0, policy_version 1097193 (0.00084) [2022-07-11 07:37:58,937][25689] Fps is (10 sec: 5586.2, 60 sec: 5527.9, 300 sec: 5543.5). Total num frames: 1123527680. Throughput: 0: 5739.2. Samples: 1123525360. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:37:58,937][25689] Avg episode reward: [(0, '1.947')] [2022-07-11 07:38:00,310][26022] Updated weights on worker 0-0, policy_version 1097203 (0.00086) [2022-07-11 07:38:02,647][26022] Updated weights on worker 0-0, policy_version 1097213 (0.00087) [2022-07-11 07:38:03,970][25689] Fps is (10 sec: 5283.2, 60 sec: 5525.7, 300 sec: 5538.2). Total num frames: 1123553280. Throughput: 0: 5644.0. Samples: 1123556592. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:38:03,971][25689] Avg episode reward: [(0, '1.893')] [2022-07-11 07:38:04,496][26022] Updated weights on worker 0-0, policy_version 1097223 (0.00212) [2022-07-11 07:38:06,291][26022] Updated weights on worker 0-0, policy_version 1097233 (0.00097) [2022-07-11 07:38:07,891][26022] Updated weights on worker 0-0, policy_version 1097243 (0.00099) [2022-07-11 07:38:09,062][25689] Fps is (10 sec: 5360.3, 60 sec: 5542.2, 300 sec: 5544.2). Total num frames: 1123581952. Throughput: 0: 5666.5. Samples: 1123589904. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:38:09,062][25689] Avg episode reward: [(0, '1.952')] [2022-07-11 07:38:09,967][26022] Updated weights on worker 0-0, policy_version 1097253 (0.00082) [2022-07-11 07:38:11,679][26022] Updated weights on worker 0-0, policy_version 1097263 (0.00093) [2022-07-11 07:38:13,796][26022] Updated weights on worker 0-0, policy_version 1097273 (0.00084) [2022-07-11 07:38:14,132][25689] Fps is (10 sec: 5542.0, 60 sec: 5519.1, 300 sec: 5536.5). Total num frames: 1123609600. Throughput: 0: 4831.1. Samples: 1123606302. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:38:14,133][25689] Avg episode reward: [(0, '1.915')] [2022-07-11 07:38:15,459][26022] Updated weights on worker 0-0, policy_version 1097283 (0.00086) [2022-07-11 07:38:17,267][26022] Updated weights on worker 0-0, policy_version 1097293 (0.00086) [2022-07-11 07:38:19,144][25689] Fps is (10 sec: 5484.7, 60 sec: 5503.4, 300 sec: 5536.6). Total num frames: 1123637248. Throughput: 0: 5660.7. Samples: 1123639744. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:38:19,144][25689] Avg episode reward: [(0, '0.857')] [2022-07-11 07:38:19,247][26022] Updated weights on worker 0-0, policy_version 1097303 (0.00081) [2022-07-11 07:38:20,918][26022] Updated weights on worker 0-0, policy_version 1097313 (0.00083) [2022-07-11 07:38:23,035][26022] Updated weights on worker 0-0, policy_version 1097323 (0.00084) [2022-07-11 07:38:24,151][25689] Fps is (10 sec: 5621.8, 60 sec: 5504.5, 300 sec: 5538.8). Total num frames: 1123665920. Throughput: 0: 5777.6. Samples: 1123673188. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:38:24,151][25689] Avg episode reward: [(0, '0.928')] [2022-07-11 07:38:24,768][26022] Updated weights on worker 0-0, policy_version 1097333 (0.00092) [2022-07-11 07:38:26,592][26022] Updated weights on worker 0-0, policy_version 1097343 (0.00085) [2022-07-11 07:38:28,607][26022] Updated weights on worker 0-0, policy_version 1097353 (0.00095) [2022-07-11 07:38:29,205][25689] Fps is (10 sec: 5597.7, 60 sec: 5507.3, 300 sec: 5534.7). Total num frames: 1123693568. Throughput: 0: 4953.2. Samples: 1123689678. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:38:29,206][25689] Avg episode reward: [(0, '0.949')] [2022-07-11 07:38:30,250][26022] Updated weights on worker 0-0, policy_version 1097363 (0.00088) [2022-07-11 07:38:32,229][26022] Updated weights on worker 0-0, policy_version 1097373 (0.00088) [2022-07-11 07:38:33,887][26022] Updated weights on worker 0-0, policy_version 1097383 (0.00086) [2022-07-11 07:38:34,226][25689] Fps is (10 sec: 5488.5, 60 sec: 5513.1, 300 sec: 5538.8). Total num frames: 1123721216. Throughput: 0: 5814.7. Samples: 1123723138. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:38:34,227][25689] Avg episode reward: [(0, '1.056')] [2022-07-11 07:38:35,878][26022] Updated weights on worker 0-0, policy_version 1097393 (0.00087) [2022-07-11 07:38:37,585][26022] Updated weights on worker 0-0, policy_version 1097403 (0.00098) [2022-07-11 07:38:39,312][25689] Fps is (10 sec: 5572.4, 60 sec: 5523.2, 300 sec: 5534.4). Total num frames: 1123749888. Throughput: 0: 5794.5. Samples: 1123756608. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:38:39,313][25689] Avg episode reward: [(0, '0.147')] [2022-07-11 07:38:39,402][26022] Updated weights on worker 0-0, policy_version 1097413 (0.00086) [2022-07-11 07:38:41,406][26022] Updated weights on worker 0-0, policy_version 1097423 (0.00478) [2022-07-11 07:38:43,248][26022] Updated weights on worker 0-0, policy_version 1097433 (0.00092) [2022-07-11 07:38:44,326][25689] Fps is (10 sec: 5677.5, 60 sec: 5526.2, 300 sec: 5536.3). Total num frames: 1123778560. Throughput: 0: 4957.9. Samples: 1123773214. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:38:44,326][25689] Avg episode reward: [(0, '0.122')] [2022-07-11 07:38:45,125][26022] Updated weights on worker 0-0, policy_version 1097443 (0.00091) [2022-07-11 07:38:46,881][26022] Updated weights on worker 0-0, policy_version 1097453 (0.00087) [2022-07-11 07:38:48,662][26022] Updated weights on worker 0-0, policy_version 1097463 (0.00084) [2022-07-11 07:38:49,388][25689] Fps is (10 sec: 5487.6, 60 sec: 5514.7, 300 sec: 5532.1). Total num frames: 1123805184. Throughput: 0: 5793.4. Samples: 1123806608. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:38:49,390][25689] Avg episode reward: [(0, '1.235')] [2022-07-11 07:38:50,396][26022] Updated weights on worker 0-0, policy_version 1097473 (0.00096) [2022-07-11 07:38:52,455][26022] Updated weights on worker 0-0, policy_version 1097483 (0.00086) [2022-07-11 07:38:54,253][26022] Updated weights on worker 0-0, policy_version 1097493 (0.00084) [2022-07-11 07:38:54,453][25689] Fps is (10 sec: 5358.9, 60 sec: 5495.2, 300 sec: 5535.3). Total num frames: 1123832832. Throughput: 0: 5785.0. Samples: 1123840154. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:38:54,455][25689] Avg episode reward: [(0, '1.069')] [2022-07-11 07:38:56,031][26022] Updated weights on worker 0-0, policy_version 1097503 (0.00087) [2022-07-11 07:38:57,860][26022] Updated weights on worker 0-0, policy_version 1097513 (0.00084) [2022-07-11 07:38:59,487][25689] Fps is (10 sec: 5678.0, 60 sec: 5530.0, 300 sec: 5541.5). Total num frames: 1123862528. Throughput: 0: 4961.3. Samples: 1123856706. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:38:59,491][25689] Avg episode reward: [(0, '1.137')] [2022-07-11 07:38:59,847][26022] Updated weights on worker 0-0, policy_version 1097523 (0.00091) [2022-07-11 07:39:01,663][26022] Updated weights on worker 0-0, policy_version 1097533 (0.00092) [2022-07-11 07:39:03,788][26022] Updated weights on worker 0-0, policy_version 1097543 (0.00090) [2022-07-11 07:39:04,506][25689] Fps is (10 sec: 5500.3, 60 sec: 5531.4, 300 sec: 5540.6). Total num frames: 1123888128. Throughput: 0: 5696.0. Samples: 1123888164. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:39:04,508][25689] Avg episode reward: [(0, '1.162')] [2022-07-11 07:39:05,608][26022] Updated weights on worker 0-0, policy_version 1097553 (0.00088) [2022-07-11 07:39:07,483][26022] Updated weights on worker 0-0, policy_version 1097563 (0.00086) [2022-07-11 07:39:09,267][26022] Updated weights on worker 0-0, policy_version 1097573 (0.00091) [2022-07-11 07:39:09,638][25689] Fps is (10 sec: 5245.7, 60 sec: 5510.8, 300 sec: 5529.1). Total num frames: 1123915776. Throughput: 0: 5671.6. Samples: 1123921458. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:39:09,639][25689] Avg episode reward: [(0, '1.225')] [2022-07-11 07:39:11,320][26022] Updated weights on worker 0-0, policy_version 1097583 (0.00092) [2022-07-11 07:39:13,040][26022] Updated weights on worker 0-0, policy_version 1097593 (0.00111) [2022-07-11 07:39:14,645][25689] Fps is (10 sec: 5453.9, 60 sec: 5516.6, 300 sec: 5533.2). Total num frames: 1123943424. Throughput: 0: 4838.4. Samples: 1123937852. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:39:14,647][25689] Avg episode reward: [(0, '1.223')] [2022-07-11 07:39:15,208][26022] Updated weights on worker 0-0, policy_version 1097603 (0.00094) [2022-07-11 07:39:16,651][26022] Updated weights on worker 0-0, policy_version 1097613 (0.00087) [2022-07-11 07:39:18,954][26022] Updated weights on worker 0-0, policy_version 1097623 (0.00087) [2022-07-11 07:39:19,701][25689] Fps is (10 sec: 5495.3, 60 sec: 5512.5, 300 sec: 5529.5). Total num frames: 1123971072. Throughput: 0: 5677.2. Samples: 1123971460. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:39:19,701][25689] Avg episode reward: [(0, '0.714')] [2022-07-11 07:39:20,395][26022] Updated weights on worker 0-0, policy_version 1097633 (0.00090) [2022-07-11 07:39:22,427][26022] Updated weights on worker 0-0, policy_version 1097643 (0.00076) [2022-07-11 07:39:24,038][26022] Updated weights on worker 0-0, policy_version 1097653 (0.00089) [2022-07-11 07:39:24,714][25689] Fps is (10 sec: 5593.7, 60 sec: 5512.0, 300 sec: 5533.9). Total num frames: 1123999744. Throughput: 0: 5781.3. Samples: 1124004988. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:39:24,714][25689] Avg episode reward: [(0, '0.496')] [2022-07-11 07:39:25,946][26022] Updated weights on worker 0-0, policy_version 1097663 (0.00089) [2022-07-11 07:39:27,844][26022] Updated weights on worker 0-0, policy_version 1097673 (0.00085) [2022-07-11 07:39:29,622][26022] Updated weights on worker 0-0, policy_version 1097683 (0.00099) [2022-07-11 07:39:29,808][25689] Fps is (10 sec: 5571.9, 60 sec: 5508.3, 300 sec: 5532.8). Total num frames: 1124027392. Throughput: 0: 4967.2. Samples: 1124021648. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:39:29,809][25689] Avg episode reward: [(0, '0.277')] [2022-07-11 07:39:31,427][26022] Updated weights on worker 0-0, policy_version 1097693 (0.00095) [2022-07-11 07:39:33,377][26022] Updated weights on worker 0-0, policy_version 1097703 (0.00085) [2022-07-11 07:39:34,819][25689] Fps is (10 sec: 5573.4, 60 sec: 5526.1, 300 sec: 5526.2). Total num frames: 1124056064. Throughput: 0: 5803.5. Samples: 1124054930. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:39:34,821][25689] Avg episode reward: [(0, '1.344')] [2022-07-11 07:39:35,140][26022] Updated weights on worker 0-0, policy_version 1097713 (0.00087) [2022-07-11 07:39:37,010][26022] Updated weights on worker 0-0, policy_version 1097723 (0.00094) [2022-07-11 07:39:38,712][26022] Updated weights on worker 0-0, policy_version 1097733 (0.00085) [2022-07-11 07:39:39,831][25689] Fps is (10 sec: 5619.1, 60 sec: 5516.0, 300 sec: 5529.9). Total num frames: 1124083712. Throughput: 0: 5815.2. Samples: 1124088524. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:39:39,832][25689] Avg episode reward: [(0, '1.264')] [2022-07-11 07:39:40,767][26022] Updated weights on worker 0-0, policy_version 1097743 (0.00094) [2022-07-11 07:39:42,374][26022] Updated weights on worker 0-0, policy_version 1097753 (0.00560) [2022-07-11 07:39:44,410][26022] Updated weights on worker 0-0, policy_version 1097763 (0.00091) [2022-07-11 07:39:44,851][25689] Fps is (10 sec: 5512.0, 60 sec: 5498.5, 300 sec: 5532.2). Total num frames: 1124111360. Throughput: 0: 4977.8. Samples: 1124105226. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:39:44,851][25689] Avg episode reward: [(0, '1.074')] [2022-07-11 07:39:46,334][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:39:46,347][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001097773_1124119552.pth [2022-07-11 07:39:46,348][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001095826_1122125824.pth [2022-07-11 07:39:46,350][26022] Updated weights on worker 0-0, policy_version 1097773 (0.00097) [2022-07-11 07:39:48,195][26022] Updated weights on worker 0-0, policy_version 1097783 (0.00093) [2022-07-11 07:39:49,869][26022] Updated weights on worker 0-0, policy_version 1097793 (0.00085) [2022-07-11 07:39:49,949][25689] Fps is (10 sec: 5566.4, 60 sec: 5529.1, 300 sec: 5525.3). Total num frames: 1124140032. Throughput: 0: 5787.4. Samples: 1124138210. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:39:49,950][25689] Avg episode reward: [(0, '1.171')] [2022-07-11 07:39:51,890][26022] Updated weights on worker 0-0, policy_version 1097803 (0.00077) [2022-07-11 07:39:53,567][26022] Updated weights on worker 0-0, policy_version 1097813 (0.00092) [2022-07-11 07:39:54,968][25689] Fps is (10 sec: 5566.7, 60 sec: 5533.3, 300 sec: 5528.7). Total num frames: 1124167680. Throughput: 0: 5800.9. Samples: 1124171814. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:39:54,969][25689] Avg episode reward: [(0, '1.794')] [2022-07-11 07:39:55,456][26022] Updated weights on worker 0-0, policy_version 1097823 (0.00090) [2022-07-11 07:39:57,246][26022] Updated weights on worker 0-0, policy_version 1097833 (0.00081) [2022-07-11 07:39:59,155][26022] Updated weights on worker 0-0, policy_version 1097843 (0.00086) [2022-07-11 07:39:59,986][25689] Fps is (10 sec: 5509.2, 60 sec: 5500.9, 300 sec: 5533.5). Total num frames: 1124195328. Throughput: 0: 4956.6. Samples: 1124188424. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:39:59,987][25689] Avg episode reward: [(0, '1.749')] [2022-07-11 07:40:01,091][26022] Updated weights on worker 0-0, policy_version 1097853 (0.00094) [2022-07-11 07:40:03,143][26022] Updated weights on worker 0-0, policy_version 1097863 (0.00093) [2022-07-11 07:40:04,987][26022] Updated weights on worker 0-0, policy_version 1097873 (0.00088) [2022-07-11 07:40:05,010][25689] Fps is (10 sec: 5404.8, 60 sec: 5517.5, 300 sec: 5527.3). Total num frames: 1124221952. Throughput: 0: 5689.9. Samples: 1124219928. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:40:05,010][25689] Avg episode reward: [(0, '1.587')] [2022-07-11 07:40:06,956][26022] Updated weights on worker 0-0, policy_version 1097883 (0.00090) [2022-07-11 07:40:08,747][26022] Updated weights on worker 0-0, policy_version 1097893 (0.00081) [2022-07-11 07:40:10,111][25689] Fps is (10 sec: 5360.4, 60 sec: 5520.3, 300 sec: 5529.2). Total num frames: 1124249600. Throughput: 0: 5726.3. Samples: 1124253662. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:40:10,111][25689] Avg episode reward: [(0, '1.757')] [2022-07-11 07:40:10,481][26022] Updated weights on worker 0-0, policy_version 1097903 (0.00090) [2022-07-11 07:40:12,339][26022] Updated weights on worker 0-0, policy_version 1097913 (0.00094) [2022-07-11 07:40:14,047][26022] Updated weights on worker 0-0, policy_version 1097923 (0.00086) [2022-07-11 07:40:15,170][25689] Fps is (10 sec: 5442.2, 60 sec: 5515.5, 300 sec: 5521.4). Total num frames: 1124277248. Throughput: 0: 5694.1. Samples: 1124286846. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:40:15,172][25689] Avg episode reward: [(0, '2.120')] [2022-07-11 07:40:16,122][26022] Updated weights on worker 0-0, policy_version 1097933 (0.00104) [2022-07-11 07:40:17,990][26022] Updated weights on worker 0-0, policy_version 1097943 (0.00052) [2022-07-11 07:40:19,637][26022] Updated weights on worker 0-0, policy_version 1097953 (0.00680) [2022-07-11 07:40:20,209][25689] Fps is (10 sec: 5577.3, 60 sec: 5533.9, 300 sec: 5528.4). Total num frames: 1124305920. Throughput: 0: 5699.0. Samples: 1124303674. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:40:20,209][25689] Avg episode reward: [(0, '1.852')] [2022-07-11 07:40:21,703][26022] Updated weights on worker 0-0, policy_version 1097963 (0.00091) [2022-07-11 07:40:23,325][26022] Updated weights on worker 0-0, policy_version 1097973 (0.00090) [2022-07-11 07:40:25,225][25689] Fps is (10 sec: 5601.3, 60 sec: 5516.7, 300 sec: 5522.4). Total num frames: 1124333568. Throughput: 0: 5803.5. Samples: 1124337248. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:40:25,226][25689] Avg episode reward: [(0, '1.314')] [2022-07-11 07:40:25,325][26022] Updated weights on worker 0-0, policy_version 1097983 (0.00088) [2022-07-11 07:40:27,171][26022] Updated weights on worker 0-0, policy_version 1097993 (0.00088) [2022-07-11 07:40:29,005][26022] Updated weights on worker 0-0, policy_version 1098003 (0.00092) [2022-07-11 07:40:30,313][25689] Fps is (10 sec: 5574.2, 60 sec: 5534.3, 300 sec: 5525.6). Total num frames: 1124362240. Throughput: 0: 5779.9. Samples: 1124370426. Policy #0 lag: (min: 0.0, avg: 7.9, max: 18.0) [2022-07-11 07:40:30,314][25689] Avg episode reward: [(0, '1.596')] [2022-07-11 07:40:30,734][26022] Updated weights on worker 0-0, policy_version 1098013 (0.00100) [2022-07-11 07:40:32,836][26022] Updated weights on worker 0-0, policy_version 1098023 (0.00086) [2022-07-11 07:40:34,582][26022] Updated weights on worker 0-0, policy_version 1098033 (0.00087) [2022-07-11 07:40:35,315][25689] Fps is (10 sec: 5480.3, 60 sec: 5501.2, 300 sec: 5523.0). Total num frames: 1124388864. Throughput: 0: 4963.4. Samples: 1124386832. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:40:35,324][25689] Avg episode reward: [(0, '1.555')] [2022-07-11 07:40:36,363][26022] Updated weights on worker 0-0, policy_version 1098043 (0.00896) [2022-07-11 07:40:38,119][26022] Updated weights on worker 0-0, policy_version 1098053 (0.00090) [2022-07-11 07:40:39,972][26022] Updated weights on worker 0-0, policy_version 1098063 (0.00082) [2022-07-11 07:40:40,327][25689] Fps is (10 sec: 5624.0, 60 sec: 5535.1, 300 sec: 5519.7). Total num frames: 1124418560. Throughput: 0: 5811.2. Samples: 1124420584. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:40:40,337][25689] Avg episode reward: [(0, '1.485')] [2022-07-11 07:40:41,895][26022] Updated weights on worker 0-0, policy_version 1098073 (0.00085) [2022-07-11 07:40:43,721][26022] Updated weights on worker 0-0, policy_version 1098083 (0.00079) [2022-07-11 07:40:45,352][25689] Fps is (10 sec: 5713.1, 60 sec: 5534.5, 300 sec: 5527.3). Total num frames: 1124446208. Throughput: 0: 5812.1. Samples: 1124454230. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:40:45,353][25689] Avg episode reward: [(0, '1.032')] [2022-07-11 07:40:45,480][26022] Updated weights on worker 0-0, policy_version 1098093 (0.00087) [2022-07-11 07:40:47,456][26022] Updated weights on worker 0-0, policy_version 1098103 (0.00088) [2022-07-11 07:40:49,022][26022] Updated weights on worker 0-0, policy_version 1098113 (0.00089) [2022-07-11 07:40:50,491][25689] Fps is (10 sec: 5440.4, 60 sec: 5514.0, 300 sec: 5521.8). Total num frames: 1124473856. Throughput: 0: 4966.5. Samples: 1124470642. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:40:50,493][25689] Avg episode reward: [(0, '1.046')] [2022-07-11 07:40:51,234][26022] Updated weights on worker 0-0, policy_version 1098123 (0.00096) [2022-07-11 07:40:52,860][26022] Updated weights on worker 0-0, policy_version 1098133 (0.00085) [2022-07-11 07:40:54,927][26022] Updated weights on worker 0-0, policy_version 1098143 (0.00086) [2022-07-11 07:40:55,580][25689] Fps is (10 sec: 5506.6, 60 sec: 5524.5, 300 sec: 5520.7). Total num frames: 1124502528. Throughput: 0: 5784.9. Samples: 1124504064. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:40:55,581][25689] Avg episode reward: [(0, '1.336')] [2022-07-11 07:40:56,483][26022] Updated weights on worker 0-0, policy_version 1098153 (0.00081) [2022-07-11 07:40:58,445][26022] Updated weights on worker 0-0, policy_version 1098163 (0.00092) [2022-07-11 07:41:00,114][26022] Updated weights on worker 0-0, policy_version 1098173 (0.00091) [2022-07-11 07:41:00,626][25689] Fps is (10 sec: 5657.9, 60 sec: 5538.8, 300 sec: 5533.8). Total num frames: 1124531200. Throughput: 0: 5776.0. Samples: 1124537832. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:41:00,628][25689] Avg episode reward: [(0, '0.680')] [2022-07-11 07:41:02,563][26022] Updated weights on worker 0-0, policy_version 1098183 (0.00086) [2022-07-11 07:41:04,325][26022] Updated weights on worker 0-0, policy_version 1098193 (0.00104) [2022-07-11 07:41:05,646][25689] Fps is (10 sec: 5391.5, 60 sec: 5522.2, 300 sec: 5523.9). Total num frames: 1124556800. Throughput: 0: 4844.4. Samples: 1124552542. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:41:05,648][25689] Avg episode reward: [(0, '-0.036')] [2022-07-11 07:41:06,106][26022] Updated weights on worker 0-0, policy_version 1098203 (0.00373) [2022-07-11 07:41:08,019][26022] Updated weights on worker 0-0, policy_version 1098213 (0.00090) [2022-07-11 07:41:09,669][26022] Updated weights on worker 0-0, policy_version 1098223 (0.00093) [2022-07-11 07:41:10,707][25689] Fps is (10 sec: 5383.7, 60 sec: 5542.8, 300 sec: 5522.9). Total num frames: 1124585472. Throughput: 0: 5717.9. Samples: 1124586236. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:41:10,707][25689] Avg episode reward: [(0, '0.182')] [2022-07-11 07:41:11,676][26022] Updated weights on worker 0-0, policy_version 1098233 (0.00082) [2022-07-11 07:41:13,187][26022] Updated weights on worker 0-0, policy_version 1098243 (0.00080) [2022-07-11 07:41:15,247][26022] Updated weights on worker 0-0, policy_version 1098253 (0.00085) [2022-07-11 07:41:15,732][25689] Fps is (10 sec: 5685.7, 60 sec: 5562.9, 300 sec: 5526.0). Total num frames: 1124614144. Throughput: 0: 5758.1. Samples: 1124620100. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:41:15,732][25689] Avg episode reward: [(0, '0.424')] [2022-07-11 07:41:16,900][26022] Updated weights on worker 0-0, policy_version 1098263 (0.00095) [2022-07-11 07:41:18,979][26022] Updated weights on worker 0-0, policy_version 1098273 (0.00090) [2022-07-11 07:41:20,718][26022] Updated weights on worker 0-0, policy_version 1098283 (0.00087) [2022-07-11 07:41:20,818][25689] Fps is (10 sec: 5570.3, 60 sec: 5541.6, 300 sec: 5528.2). Total num frames: 1124641792. Throughput: 0: 4905.6. Samples: 1124636884. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:41:20,818][25689] Avg episode reward: [(0, '0.701')] [2022-07-11 07:41:22,585][26022] Updated weights on worker 0-0, policy_version 1098293 (0.00087) [2022-07-11 07:41:24,554][26022] Updated weights on worker 0-0, policy_version 1098303 (0.00085) [2022-07-11 07:41:25,845][25689] Fps is (10 sec: 5467.9, 60 sec: 5540.6, 300 sec: 5521.9). Total num frames: 1124669440. Throughput: 0: 5833.5. Samples: 1124670370. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:41:25,845][25689] Avg episode reward: [(0, '0.665')] [2022-07-11 07:41:26,251][26022] Updated weights on worker 0-0, policy_version 1098313 (0.00095) [2022-07-11 07:41:28,205][26022] Updated weights on worker 0-0, policy_version 1098323 (0.00092) [2022-07-11 07:41:29,934][26022] Updated weights on worker 0-0, policy_version 1098333 (0.00095) [2022-07-11 07:41:30,923][25689] Fps is (10 sec: 5573.5, 60 sec: 5541.5, 300 sec: 5527.8). Total num frames: 1124698112. Throughput: 0: 5803.5. Samples: 1124703558. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:41:30,923][25689] Avg episode reward: [(0, '1.112')] [2022-07-11 07:41:31,875][26022] Updated weights on worker 0-0, policy_version 1098343 (0.00086) [2022-07-11 07:41:33,840][26022] Updated weights on worker 0-0, policy_version 1098353 (0.00095) [2022-07-11 07:41:35,434][26022] Updated weights on worker 0-0, policy_version 1098363 (0.00082) [2022-07-11 07:41:35,975][25689] Fps is (10 sec: 5660.8, 60 sec: 5570.7, 300 sec: 5527.1). Total num frames: 1124726784. Throughput: 0: 4954.6. Samples: 1124720396. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:41:35,975][25689] Avg episode reward: [(0, '2.049')] [2022-07-11 07:41:37,551][26022] Updated weights on worker 0-0, policy_version 1098373 (0.00083) [2022-07-11 07:41:38,989][26022] Updated weights on worker 0-0, policy_version 1098383 (0.00094) [2022-07-11 07:41:40,995][25689] Fps is (10 sec: 5489.9, 60 sec: 5519.3, 300 sec: 5526.8). Total num frames: 1124753408. Throughput: 0: 5796.8. Samples: 1124753848. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:41:40,996][25689] Avg episode reward: [(0, '0.741')] [2022-07-11 07:41:41,153][26022] Updated weights on worker 0-0, policy_version 1098393 (0.00084) [2022-07-11 07:41:42,765][26022] Updated weights on worker 0-0, policy_version 1098403 (0.00079) [2022-07-11 07:41:44,487][26022] Updated weights on worker 0-0, policy_version 1098413 (0.00092) [2022-07-11 07:41:46,018][25689] Fps is (10 sec: 5505.9, 60 sec: 5536.5, 300 sec: 5527.8). Total num frames: 1124782080. Throughput: 0: 5802.7. Samples: 1124787428. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:41:46,019][25689] Avg episode reward: [(0, '0.614')] [2022-07-11 07:41:46,425][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:41:46,437][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001098421_1124783104.pth [2022-07-11 07:41:46,438][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001096478_1122793472.pth [2022-07-11 07:41:46,666][26022] Updated weights on worker 0-0, policy_version 1098423 (0.00087) [2022-07-11 07:41:48,271][26022] Updated weights on worker 0-0, policy_version 1098433 (0.00091) [2022-07-11 07:41:50,155][26022] Updated weights on worker 0-0, policy_version 1098443 (0.00090) [2022-07-11 07:41:51,127][25689] Fps is (10 sec: 5660.0, 60 sec: 5556.1, 300 sec: 5526.2). Total num frames: 1124810752. Throughput: 0: 4967.9. Samples: 1124803928. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:41:51,127][25689] Avg episode reward: [(0, '0.592')] [2022-07-11 07:41:52,187][26022] Updated weights on worker 0-0, policy_version 1098453 (0.00094) [2022-07-11 07:41:53,761][26022] Updated weights on worker 0-0, policy_version 1098463 (0.00087) [2022-07-11 07:41:55,718][26022] Updated weights on worker 0-0, policy_version 1098473 (0.00081) [2022-07-11 07:41:56,202][25689] Fps is (10 sec: 5530.1, 60 sec: 5540.4, 300 sec: 5525.0). Total num frames: 1124838400. Throughput: 0: 5791.0. Samples: 1124837532. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:41:56,203][25689] Avg episode reward: [(0, '-0.048')] [2022-07-11 07:41:57,370][26022] Updated weights on worker 0-0, policy_version 1098483 (0.00087) [2022-07-11 07:41:59,223][26022] Updated weights on worker 0-0, policy_version 1098493 (0.00089) [2022-07-11 07:42:01,215][25689] Fps is (10 sec: 5481.3, 60 sec: 5526.6, 300 sec: 5531.8). Total num frames: 1124866048. Throughput: 0: 5812.4. Samples: 1124871372. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:42:01,215][25689] Avg episode reward: [(0, '0.602')] [2022-07-11 07:42:01,293][26022] Updated weights on worker 0-0, policy_version 1098503 (0.00093) [2022-07-11 07:42:03,313][26022] Updated weights on worker 0-0, policy_version 1098513 (0.00085) [2022-07-11 07:42:05,234][26022] Updated weights on worker 0-0, policy_version 1098523 (0.00092) [2022-07-11 07:42:06,251][25689] Fps is (10 sec: 5502.7, 60 sec: 5558.9, 300 sec: 5532.8). Total num frames: 1124893696. Throughput: 0: 4875.2. Samples: 1124886068. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:42:06,252][25689] Avg episode reward: [(0, '0.464')] [2022-07-11 07:42:07,085][26022] Updated weights on worker 0-0, policy_version 1098533 (0.00092) [2022-07-11 07:42:08,836][26022] Updated weights on worker 0-0, policy_version 1098543 (0.00086) [2022-07-11 07:42:10,868][26022] Updated weights on worker 0-0, policy_version 1098553 (0.00096) [2022-07-11 07:42:11,380][25689] Fps is (10 sec: 5439.8, 60 sec: 5535.8, 300 sec: 5527.0). Total num frames: 1124921344. Throughput: 0: 5710.6. Samples: 1124919586. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:42:11,381][25689] Avg episode reward: [(0, '1.180')] [2022-07-11 07:42:12,481][26022] Updated weights on worker 0-0, policy_version 1098563 (0.00091) [2022-07-11 07:42:14,414][26022] Updated weights on worker 0-0, policy_version 1098573 (0.00115) [2022-07-11 07:42:16,179][26022] Updated weights on worker 0-0, policy_version 1098583 (0.00094) [2022-07-11 07:42:16,410][25689] Fps is (10 sec: 5544.1, 60 sec: 5535.3, 300 sec: 5526.9). Total num frames: 1124950016. Throughput: 0: 5725.2. Samples: 1124953224. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:42:16,416][25689] Avg episode reward: [(0, '1.158')] [2022-07-11 07:42:17,966][26022] Updated weights on worker 0-0, policy_version 1098593 (0.00085) [2022-07-11 07:42:20,046][26022] Updated weights on worker 0-0, policy_version 1098603 (0.00097) [2022-07-11 07:42:21,479][25689] Fps is (10 sec: 5678.2, 60 sec: 5553.8, 300 sec: 5526.0). Total num frames: 1124978688. Throughput: 0: 5691.5. Samples: 1124986704. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:42:21,479][25689] Avg episode reward: [(0, '0.128')] [2022-07-11 07:42:21,586][26022] Updated weights on worker 0-0, policy_version 1098613 (0.00079) [2022-07-11 07:42:23,575][26022] Updated weights on worker 0-0, policy_version 1098623 (0.00087) [2022-07-11 07:42:25,242][26022] Updated weights on worker 0-0, policy_version 1098633 (0.00085) [2022-07-11 07:42:26,519][25689] Fps is (10 sec: 5571.3, 60 sec: 5552.6, 300 sec: 5526.8). Total num frames: 1125006336. Throughput: 0: 5803.2. Samples: 1125003684. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:42:26,520][25689] Avg episode reward: [(0, '0.293')] [2022-07-11 07:42:27,349][26022] Updated weights on worker 0-0, policy_version 1098643 (0.00086) [2022-07-11 07:42:28,907][26022] Updated weights on worker 0-0, policy_version 1098653 (0.00090) [2022-07-11 07:42:31,093][26022] Updated weights on worker 0-0, policy_version 1098663 (0.00082) [2022-07-11 07:42:31,620][25689] Fps is (10 sec: 5452.7, 60 sec: 5533.6, 300 sec: 5526.5). Total num frames: 1125033984. Throughput: 0: 5794.2. Samples: 1125036862. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:42:31,620][25689] Avg episode reward: [(0, '0.190')] [2022-07-11 07:42:32,376][26022] Updated weights on worker 0-0, policy_version 1098673 (0.00081) [2022-07-11 07:42:34,679][26022] Updated weights on worker 0-0, policy_version 1098683 (0.00085) [2022-07-11 07:42:36,367][26022] Updated weights on worker 0-0, policy_version 1098693 (0.00080) [2022-07-11 07:42:36,686][25689] Fps is (10 sec: 5539.5, 60 sec: 5532.3, 300 sec: 5528.9). Total num frames: 1125062656. Throughput: 0: 5794.2. Samples: 1125070708. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:42:36,686][25689] Avg episode reward: [(0, '0.787')] [2022-07-11 07:42:38,143][26022] Updated weights on worker 0-0, policy_version 1098703 (0.00091) [2022-07-11 07:42:39,921][26022] Updated weights on worker 0-0, policy_version 1098713 (0.00092) [2022-07-11 07:42:41,726][25689] Fps is (10 sec: 5674.4, 60 sec: 5564.3, 300 sec: 5529.1). Total num frames: 1125091328. Throughput: 0: 4985.3. Samples: 1125087642. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:42:41,726][25689] Avg episode reward: [(0, '0.314')] [2022-07-11 07:42:41,743][26022] Updated weights on worker 0-0, policy_version 1098723 (0.00085) [2022-07-11 07:42:43,789][26022] Updated weights on worker 0-0, policy_version 1098733 (0.00088) [2022-07-11 07:42:45,485][26022] Updated weights on worker 0-0, policy_version 1098743 (0.00471) [2022-07-11 07:42:46,755][25689] Fps is (10 sec: 5593.4, 60 sec: 5546.8, 300 sec: 5530.8). Total num frames: 1125118976. Throughput: 0: 5794.6. Samples: 1125120944. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:42:46,755][25689] Avg episode reward: [(0, '0.539')] [2022-07-11 07:42:47,415][26022] Updated weights on worker 0-0, policy_version 1098753 (0.00086) [2022-07-11 07:42:49,303][26022] Updated weights on worker 0-0, policy_version 1098763 (0.00086) [2022-07-11 07:42:51,117][26022] Updated weights on worker 0-0, policy_version 1098773 (0.00095) [2022-07-11 07:42:51,859][25689] Fps is (10 sec: 5457.0, 60 sec: 5530.4, 300 sec: 5526.1). Total num frames: 1125146624. Throughput: 0: 5804.1. Samples: 1125154330. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:42:51,859][25689] Avg episode reward: [(0, '1.285')] [2022-07-11 07:42:52,783][26022] Updated weights on worker 0-0, policy_version 1098783 (0.00086) [2022-07-11 07:42:54,833][26022] Updated weights on worker 0-0, policy_version 1098793 (0.00093) [2022-07-11 07:42:56,499][26022] Updated weights on worker 0-0, policy_version 1098803 (0.00081) [2022-07-11 07:42:56,878][25689] Fps is (10 sec: 5563.6, 60 sec: 5552.4, 300 sec: 5530.0). Total num frames: 1125175296. Throughput: 0: 4962.0. Samples: 1125170900. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:42:56,878][25689] Avg episode reward: [(0, '1.325')] [2022-07-11 07:42:58,627][26022] Updated weights on worker 0-0, policy_version 1098813 (0.00096) [2022-07-11 07:43:00,362][26022] Updated weights on worker 0-0, policy_version 1098823 (0.00089) [2022-07-11 07:43:01,899][25689] Fps is (10 sec: 5507.4, 60 sec: 5534.7, 300 sec: 5533.7). Total num frames: 1125201920. Throughput: 0: 5783.7. Samples: 1125204320. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:43:01,900][25689] Avg episode reward: [(0, '1.346')] [2022-07-11 07:43:02,416][26022] Updated weights on worker 0-0, policy_version 1098833 (0.00097) [2022-07-11 07:43:04,369][26022] Updated weights on worker 0-0, policy_version 1098843 (0.00085) [2022-07-11 07:43:06,064][26022] Updated weights on worker 0-0, policy_version 1098853 (0.00094) [2022-07-11 07:43:06,939][25689] Fps is (10 sec: 5394.2, 60 sec: 5534.4, 300 sec: 5531.2). Total num frames: 1125229568. Throughput: 0: 5684.8. Samples: 1125235686. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:43:06,939][25689] Avg episode reward: [(0, '0.311')] [2022-07-11 07:43:08,132][26022] Updated weights on worker 0-0, policy_version 1098863 (0.00094) [2022-07-11 07:43:09,902][26022] Updated weights on worker 0-0, policy_version 1098873 (0.00085) [2022-07-11 07:43:11,707][26022] Updated weights on worker 0-0, policy_version 1098883 (0.00088) [2022-07-11 07:43:11,995][25689] Fps is (10 sec: 5476.9, 60 sec: 5541.0, 300 sec: 5531.5). Total num frames: 1125257216. Throughput: 0: 4867.9. Samples: 1125252352. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:43:11,996][25689] Avg episode reward: [(0, '0.524')] [2022-07-11 07:43:13,563][26022] Updated weights on worker 0-0, policy_version 1098893 (0.00084) [2022-07-11 07:43:15,527][26022] Updated weights on worker 0-0, policy_version 1098903 (0.00086) [2022-07-11 07:43:17,004][25689] Fps is (10 sec: 5493.9, 60 sec: 5526.1, 300 sec: 5531.5). Total num frames: 1125284864. Throughput: 0: 5698.6. Samples: 1125285590. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:43:17,004][25689] Avg episode reward: [(0, '-0.078')] [2022-07-11 07:43:17,212][26022] Updated weights on worker 0-0, policy_version 1098913 (0.00087) [2022-07-11 07:43:19,190][26022] Updated weights on worker 0-0, policy_version 1098923 (0.00106) [2022-07-11 07:43:20,805][26022] Updated weights on worker 0-0, policy_version 1098933 (0.00086) [2022-07-11 07:43:22,024][25689] Fps is (10 sec: 5513.5, 60 sec: 5513.6, 300 sec: 5527.8). Total num frames: 1125312512. Throughput: 0: 5705.3. Samples: 1125319142. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:43:22,025][25689] Avg episode reward: [(0, '-0.070')] [2022-07-11 07:43:22,863][26022] Updated weights on worker 0-0, policy_version 1098943 (0.00090) [2022-07-11 07:43:24,752][26022] Updated weights on worker 0-0, policy_version 1098953 (0.00092) [2022-07-11 07:43:26,560][26022] Updated weights on worker 0-0, policy_version 1098963 (0.00086) [2022-07-11 07:43:27,063][25689] Fps is (10 sec: 5599.0, 60 sec: 5530.7, 300 sec: 5531.6). Total num frames: 1125341184. Throughput: 0: 4975.5. Samples: 1125335812. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:43:27,064][25689] Avg episode reward: [(0, '0.224')] [2022-07-11 07:43:28,434][26022] Updated weights on worker 0-0, policy_version 1098973 (0.00087) [2022-07-11 07:43:30,153][26022] Updated weights on worker 0-0, policy_version 1098983 (0.00090) [2022-07-11 07:43:32,165][25689] Fps is (10 sec: 5554.2, 60 sec: 5530.6, 300 sec: 5530.0). Total num frames: 1125368832. Throughput: 0: 5761.1. Samples: 1125368550. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:43:32,166][25689] Avg episode reward: [(0, '0.174')] [2022-07-11 07:43:32,172][26022] Updated weights on worker 0-0, policy_version 1098993 (0.00090) [2022-07-11 07:43:33,955][26022] Updated weights on worker 0-0, policy_version 1099003 (0.00082) [2022-07-11 07:43:35,917][26022] Updated weights on worker 0-0, policy_version 1099013 (0.00097) [2022-07-11 07:43:37,263][25689] Fps is (10 sec: 5420.9, 60 sec: 5510.7, 300 sec: 5526.4). Total num frames: 1125396480. Throughput: 0: 5732.0. Samples: 1125401718. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:43:37,264][25689] Avg episode reward: [(0, '1.033')] [2022-07-11 07:43:37,906][26022] Updated weights on worker 0-0, policy_version 1099023 (0.00085) [2022-07-11 07:43:39,643][26022] Updated weights on worker 0-0, policy_version 1099033 (0.00083) [2022-07-11 07:43:41,267][26022] Updated weights on worker 0-0, policy_version 1099043 (0.00104) [2022-07-11 07:43:42,310][25689] Fps is (10 sec: 5450.2, 60 sec: 5493.1, 300 sec: 5522.3). Total num frames: 1125424128. Throughput: 0: 4889.7. Samples: 1125418334. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:43:42,311][25689] Avg episode reward: [(0, '0.262')] [2022-07-11 07:43:43,427][26022] Updated weights on worker 0-0, policy_version 1099053 (0.00085) [2022-07-11 07:43:45,070][26022] Updated weights on worker 0-0, policy_version 1099063 (0.00087) [2022-07-11 07:43:46,594][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:43:46,611][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001099069_1125446656.pth [2022-07-11 07:43:46,612][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001097127_1123458048.pth [2022-07-11 07:43:47,049][26022] Updated weights on worker 0-0, policy_version 1099073 (0.00083) [2022-07-11 07:43:47,316][25689] Fps is (10 sec: 5500.6, 60 sec: 5495.3, 300 sec: 5526.8). Total num frames: 1125451776. Throughput: 0: 5715.1. Samples: 1125451562. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:43:47,317][25689] Avg episode reward: [(0, '1.025')] [2022-07-11 07:43:48,744][26022] Updated weights on worker 0-0, policy_version 1099083 (0.00762) [2022-07-11 07:43:50,733][26022] Updated weights on worker 0-0, policy_version 1099093 (0.00088) [2022-07-11 07:43:52,402][25689] Fps is (10 sec: 5479.5, 60 sec: 5496.9, 300 sec: 5526.4). Total num frames: 1125479424. Throughput: 0: 5707.9. Samples: 1125484062. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:43:52,402][25689] Avg episode reward: [(0, '1.097')] [2022-07-11 07:43:52,837][26022] Updated weights on worker 0-0, policy_version 1099103 (0.00094) [2022-07-11 07:43:54,404][26022] Updated weights on worker 0-0, policy_version 1099113 (0.00089) [2022-07-11 07:43:56,402][26022] Updated weights on worker 0-0, policy_version 1099123 (0.00086) [2022-07-11 07:43:57,419][25689] Fps is (10 sec: 5473.2, 60 sec: 5480.2, 300 sec: 5519.9). Total num frames: 1125507072. Throughput: 0: 4900.7. Samples: 1125500496. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:43:57,419][25689] Avg episode reward: [(0, '0.408')] [2022-07-11 07:43:58,167][26022] Updated weights on worker 0-0, policy_version 1099133 (0.00095) [2022-07-11 07:43:59,930][26022] Updated weights on worker 0-0, policy_version 1099143 (0.00090) [2022-07-11 07:44:02,419][26022] Updated weights on worker 0-0, policy_version 1099153 (0.00083) [2022-07-11 07:44:02,516][25689] Fps is (10 sec: 5264.7, 60 sec: 5456.4, 300 sec: 5518.4). Total num frames: 1125532672. Throughput: 0: 5727.7. Samples: 1125534066. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:44:02,516][25689] Avg episode reward: [(0, '0.670')] [2022-07-11 07:44:04,080][26022] Updated weights on worker 0-0, policy_version 1099163 (0.00094) [2022-07-11 07:44:06,074][26022] Updated weights on worker 0-0, policy_version 1099173 (0.00086) [2022-07-11 07:44:07,523][25689] Fps is (10 sec: 5371.4, 60 sec: 5476.3, 300 sec: 5524.2). Total num frames: 1125561344. Throughput: 0: 5632.4. Samples: 1125565376. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:44:07,523][25689] Avg episode reward: [(0, '-0.418')] [2022-07-11 07:44:07,664][26022] Updated weights on worker 0-0, policy_version 1099183 (0.00092) [2022-07-11 07:44:09,487][26022] Updated weights on worker 0-0, policy_version 1099193 (0.00089) [2022-07-11 07:44:11,411][26022] Updated weights on worker 0-0, policy_version 1099203 (0.00086) [2022-07-11 07:44:12,622][25689] Fps is (10 sec: 5673.9, 60 sec: 5489.3, 300 sec: 5525.9). Total num frames: 1125590016. Throughput: 0: 5685.5. Samples: 1125599028. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:44:12,623][25689] Avg episode reward: [(0, '0.655')] [2022-07-11 07:44:13,195][26022] Updated weights on worker 0-0, policy_version 1099213 (0.00093) [2022-07-11 07:44:15,054][26022] Updated weights on worker 0-0, policy_version 1099223 (0.00102) [2022-07-11 07:44:16,992][26022] Updated weights on worker 0-0, policy_version 1099233 (0.00088) [2022-07-11 07:44:17,663][25689] Fps is (10 sec: 5553.9, 60 sec: 5486.4, 300 sec: 5526.2). Total num frames: 1125617664. Throughput: 0: 5700.9. Samples: 1125615908. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:44:17,664][25689] Avg episode reward: [(0, '0.211')] [2022-07-11 07:44:18,675][26022] Updated weights on worker 0-0, policy_version 1099243 (0.00083) [2022-07-11 07:44:20,732][26022] Updated weights on worker 0-0, policy_version 1099253 (0.00088) [2022-07-11 07:44:22,369][26022] Updated weights on worker 0-0, policy_version 1099263 (0.00086) [2022-07-11 07:44:22,683][25689] Fps is (10 sec: 5598.3, 60 sec: 5503.4, 300 sec: 5526.1). Total num frames: 1125646336. Throughput: 0: 5717.6. Samples: 1125649372. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:44:22,683][25689] Avg episode reward: [(0, '-0.547')] [2022-07-11 07:44:24,275][26022] Updated weights on worker 0-0, policy_version 1099273 (0.00088) [2022-07-11 07:44:25,916][26022] Updated weights on worker 0-0, policy_version 1099283 (0.00086) [2022-07-11 07:44:27,696][25689] Fps is (10 sec: 5613.5, 60 sec: 5488.7, 300 sec: 5527.6). Total num frames: 1125673984. Throughput: 0: 5837.2. Samples: 1125683134. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 07:44:27,697][25689] Avg episode reward: [(0, '-0.897')] [2022-07-11 07:44:28,105][26022] Updated weights on worker 0-0, policy_version 1099293 (0.00130) [2022-07-11 07:44:29,763][26022] Updated weights on worker 0-0, policy_version 1099303 (0.00081) [2022-07-11 07:44:31,767][26022] Updated weights on worker 0-0, policy_version 1099313 (0.00085) [2022-07-11 07:44:32,746][25689] Fps is (10 sec: 5596.8, 60 sec: 5510.4, 300 sec: 5526.8). Total num frames: 1125702656. Throughput: 0: 5010.5. Samples: 1125699856. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:44:32,746][25689] Avg episode reward: [(0, '-0.683')] [2022-07-11 07:44:33,186][26022] Updated weights on worker 0-0, policy_version 1099323 (0.00084) [2022-07-11 07:44:35,467][26022] Updated weights on worker 0-0, policy_version 1099333 (0.00085) [2022-07-11 07:44:37,171][26022] Updated weights on worker 0-0, policy_version 1099343 (0.00092) [2022-07-11 07:44:37,759][25689] Fps is (10 sec: 5495.4, 60 sec: 5501.2, 300 sec: 5523.4). Total num frames: 1125729280. Throughput: 0: 5836.0. Samples: 1125733184. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:44:37,759][25689] Avg episode reward: [(0, '-0.051')] [2022-07-11 07:44:38,909][26022] Updated weights on worker 0-0, policy_version 1099353 (0.00088) [2022-07-11 07:44:40,975][26022] Updated weights on worker 0-0, policy_version 1099363 (0.00058) [2022-07-11 07:44:42,419][26022] Updated weights on worker 0-0, policy_version 1099373 (0.00083) [2022-07-11 07:44:42,792][25689] Fps is (10 sec: 5503.8, 60 sec: 5519.4, 300 sec: 5526.6). Total num frames: 1125757952. Throughput: 0: 5843.5. Samples: 1125766884. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:44:42,794][25689] Avg episode reward: [(0, '0.452')] [2022-07-11 07:44:44,487][26022] Updated weights on worker 0-0, policy_version 1099383 (0.00089) [2022-07-11 07:44:46,496][26022] Updated weights on worker 0-0, policy_version 1099393 (0.00085) [2022-07-11 07:44:47,820][25689] Fps is (10 sec: 5699.2, 60 sec: 5534.3, 300 sec: 5527.9). Total num frames: 1125786624. Throughput: 0: 4995.9. Samples: 1125783672. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:44:47,821][25689] Avg episode reward: [(0, '-0.532')] [2022-07-11 07:44:48,034][26022] Updated weights on worker 0-0, policy_version 1099403 (0.00085) [2022-07-11 07:44:50,049][26022] Updated weights on worker 0-0, policy_version 1099413 (0.00086) [2022-07-11 07:44:51,603][26022] Updated weights on worker 0-0, policy_version 1099423 (0.00088) [2022-07-11 07:44:52,882][25689] Fps is (10 sec: 5480.3, 60 sec: 5519.5, 300 sec: 5523.6). Total num frames: 1125813248. Throughput: 0: 5817.1. Samples: 1125816994. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:44:52,883][25689] Avg episode reward: [(0, '0.284')] [2022-07-11 07:44:53,643][26022] Updated weights on worker 0-0, policy_version 1099433 (0.00900) [2022-07-11 07:44:55,614][26022] Updated weights on worker 0-0, policy_version 1099443 (0.00081) [2022-07-11 07:44:57,326][26022] Updated weights on worker 0-0, policy_version 1099453 (0.00090) [2022-07-11 07:44:57,889][25689] Fps is (10 sec: 5593.6, 60 sec: 5554.3, 300 sec: 5530.7). Total num frames: 1125842944. Throughput: 0: 5837.4. Samples: 1125850694. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:44:57,889][25689] Avg episode reward: [(0, '0.605')] [2022-07-11 07:44:59,177][26022] Updated weights on worker 0-0, policy_version 1099463 (0.00084) [2022-07-11 07:45:00,858][26022] Updated weights on worker 0-0, policy_version 1099473 (0.00091) [2022-07-11 07:45:02,907][25689] Fps is (10 sec: 5515.8, 60 sec: 5561.6, 300 sec: 5527.4). Total num frames: 1125868544. Throughput: 0: 5000.7. Samples: 1125867472. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:45:02,908][25689] Avg episode reward: [(0, '0.719')] [2022-07-11 07:45:02,984][26022] Updated weights on worker 0-0, policy_version 1099483 (0.00091) [2022-07-11 07:45:05,098][26022] Updated weights on worker 0-0, policy_version 1099493 (0.00092) [2022-07-11 07:45:07,042][26022] Updated weights on worker 0-0, policy_version 1099503 (0.00083) [2022-07-11 07:45:07,924][25689] Fps is (10 sec: 5408.1, 60 sec: 5560.7, 300 sec: 5532.4). Total num frames: 1125897216. Throughput: 0: 5738.7. Samples: 1125899042. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:45:07,925][25689] Avg episode reward: [(0, '0.523')] [2022-07-11 07:45:08,728][26022] Updated weights on worker 0-0, policy_version 1099513 (0.00084) [2022-07-11 07:45:10,484][26022] Updated weights on worker 0-0, policy_version 1099523 (0.00088) [2022-07-11 07:45:12,323][26022] Updated weights on worker 0-0, policy_version 1099533 (0.00084) [2022-07-11 07:45:12,981][25689] Fps is (10 sec: 5590.9, 60 sec: 5547.7, 300 sec: 5532.4). Total num frames: 1125924864. Throughput: 0: 5758.7. Samples: 1125932736. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:45:12,981][25689] Avg episode reward: [(0, '0.705')] [2022-07-11 07:45:14,307][26022] Updated weights on worker 0-0, policy_version 1099543 (0.00081) [2022-07-11 07:45:16,204][26022] Updated weights on worker 0-0, policy_version 1099553 (0.00085) [2022-07-11 07:45:17,743][26022] Updated weights on worker 0-0, policy_version 1099563 (0.00086) [2022-07-11 07:45:18,005][25689] Fps is (10 sec: 5587.1, 60 sec: 5566.2, 300 sec: 5532.7). Total num frames: 1125953536. Throughput: 0: 4906.0. Samples: 1125949382. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:45:18,005][25689] Avg episode reward: [(0, '1.589')] [2022-07-11 07:45:19,872][26022] Updated weights on worker 0-0, policy_version 1099573 (0.00088) [2022-07-11 07:45:21,450][26022] Updated weights on worker 0-0, policy_version 1099583 (0.00095) [2022-07-11 07:45:23,027][25689] Fps is (10 sec: 5606.3, 60 sec: 5549.0, 300 sec: 5532.6). Total num frames: 1125981184. Throughput: 0: 5736.0. Samples: 1125982876. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:45:23,027][25689] Avg episode reward: [(0, '1.763')] [2022-07-11 07:45:23,382][26022] Updated weights on worker 0-0, policy_version 1099593 (0.00084) [2022-07-11 07:45:25,140][26022] Updated weights on worker 0-0, policy_version 1099603 (0.00084) [2022-07-11 07:45:27,003][26022] Updated weights on worker 0-0, policy_version 1099613 (0.00091) [2022-07-11 07:45:28,101][25689] Fps is (10 sec: 5477.1, 60 sec: 5543.5, 300 sec: 5529.4). Total num frames: 1126008832. Throughput: 0: 5819.1. Samples: 1126016450. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:45:28,101][25689] Avg episode reward: [(0, '1.534')] [2022-07-11 07:45:28,830][26022] Updated weights on worker 0-0, policy_version 1099623 (0.00087) [2022-07-11 07:45:30,939][26022] Updated weights on worker 0-0, policy_version 1099633 (0.00090) [2022-07-11 07:45:32,342][26022] Updated weights on worker 0-0, policy_version 1099643 (0.00088) [2022-07-11 07:45:33,178][25689] Fps is (10 sec: 5547.8, 60 sec: 5540.8, 300 sec: 5534.9). Total num frames: 1126037504. Throughput: 0: 4978.2. Samples: 1126033284. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:45:33,179][25689] Avg episode reward: [(0, '0.586')] [2022-07-11 07:45:34,630][26022] Updated weights on worker 0-0, policy_version 1099653 (0.00087) [2022-07-11 07:45:36,096][26022] Updated weights on worker 0-0, policy_version 1099663 (0.00092) [2022-07-11 07:45:38,219][25689] Fps is (10 sec: 5464.8, 60 sec: 5538.3, 300 sec: 5524.0). Total num frames: 1126064128. Throughput: 0: 5808.2. Samples: 1126066792. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:45:38,220][25689] Avg episode reward: [(0, '0.943')] [2022-07-11 07:45:38,287][26022] Updated weights on worker 0-0, policy_version 1099673 (0.00087) [2022-07-11 07:45:39,635][26022] Updated weights on worker 0-0, policy_version 1099683 (0.00344) [2022-07-11 07:45:41,691][26022] Updated weights on worker 0-0, policy_version 1099693 (0.00089) [2022-07-11 07:45:43,250][25689] Fps is (10 sec: 5693.6, 60 sec: 5572.4, 300 sec: 5534.2). Total num frames: 1126094848. Throughput: 0: 5814.3. Samples: 1126100462. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:45:43,252][25689] Avg episode reward: [(0, '0.861')] [2022-07-11 07:45:43,387][26022] Updated weights on worker 0-0, policy_version 1099703 (0.00097) [2022-07-11 07:45:45,425][26022] Updated weights on worker 0-0, policy_version 1099713 (0.00081) [2022-07-11 07:45:46,826][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:45:46,834][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001099721_1126114304.pth [2022-07-11 07:45:46,839][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001097773_1124119552.pth [2022-07-11 07:45:47,211][26022] Updated weights on worker 0-0, policy_version 1099723 (0.00113) [2022-07-11 07:45:48,261][25689] Fps is (10 sec: 5812.7, 60 sec: 5557.1, 300 sec: 5536.6). Total num frames: 1126122496. Throughput: 0: 4997.1. Samples: 1126117192. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:45:48,261][25689] Avg episode reward: [(0, '0.227')] [2022-07-11 07:45:49,014][26022] Updated weights on worker 0-0, policy_version 1099733 (0.00089) [2022-07-11 07:45:50,793][26022] Updated weights on worker 0-0, policy_version 1099743 (0.00090) [2022-07-11 07:45:52,835][26022] Updated weights on worker 0-0, policy_version 1099753 (0.00088) [2022-07-11 07:45:53,353][25689] Fps is (10 sec: 5473.3, 60 sec: 5571.2, 300 sec: 5533.1). Total num frames: 1126150144. Throughput: 0: 5817.3. Samples: 1126150646. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:45:53,353][25689] Avg episode reward: [(0, '0.082')] [2022-07-11 07:45:54,527][26022] Updated weights on worker 0-0, policy_version 1099763 (0.00092) [2022-07-11 07:45:56,565][26022] Updated weights on worker 0-0, policy_version 1099773 (0.00084) [2022-07-11 07:45:58,187][26022] Updated weights on worker 0-0, policy_version 1099783 (0.00083) [2022-07-11 07:45:58,380][25689] Fps is (10 sec: 5464.4, 60 sec: 5535.5, 300 sec: 5530.0). Total num frames: 1126177792. Throughput: 0: 5810.7. Samples: 1126183940. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:45:58,381][25689] Avg episode reward: [(0, '0.768')] [2022-07-11 07:46:00,124][26022] Updated weights on worker 0-0, policy_version 1099793 (0.00087) [2022-07-11 07:46:02,295][26022] Updated weights on worker 0-0, policy_version 1099803 (0.00088) [2022-07-11 07:46:03,410][25689] Fps is (10 sec: 5294.4, 60 sec: 5534.4, 300 sec: 5529.9). Total num frames: 1126203392. Throughput: 0: 4972.0. Samples: 1126200698. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:46:03,411][25689] Avg episode reward: [(0, '0.404')] [2022-07-11 07:46:04,204][26022] Updated weights on worker 0-0, policy_version 1099813 (0.00086) [2022-07-11 07:46:05,747][26022] Updated weights on worker 0-0, policy_version 1099823 (0.00091) [2022-07-11 07:46:07,962][26022] Updated weights on worker 0-0, policy_version 1099833 (0.00081) [2022-07-11 07:46:08,416][25689] Fps is (10 sec: 5407.7, 60 sec: 5535.4, 300 sec: 5530.9). Total num frames: 1126232064. Throughput: 0: 5694.5. Samples: 1126231966. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:46:08,416][25689] Avg episode reward: [(0, '0.509')] [2022-07-11 07:46:09,627][26022] Updated weights on worker 0-0, policy_version 1099843 (0.00088) [2022-07-11 07:46:11,527][26022] Updated weights on worker 0-0, policy_version 1099853 (0.00083) [2022-07-11 07:46:13,228][26022] Updated weights on worker 0-0, policy_version 1099863 (0.00091) [2022-07-11 07:46:13,512][25689] Fps is (10 sec: 5676.4, 60 sec: 5548.7, 300 sec: 5529.5). Total num frames: 1126260736. Throughput: 0: 5700.3. Samples: 1126265562. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:46:13,513][25689] Avg episode reward: [(0, '-0.130')] [2022-07-11 07:46:15,187][26022] Updated weights on worker 0-0, policy_version 1099873 (0.00087) [2022-07-11 07:46:16,954][26022] Updated weights on worker 0-0, policy_version 1099883 (0.00100) [2022-07-11 07:46:18,522][25689] Fps is (10 sec: 5471.8, 60 sec: 5516.2, 300 sec: 5527.5). Total num frames: 1126287360. Throughput: 0: 4883.6. Samples: 1126282304. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:46:18,522][25689] Avg episode reward: [(0, '0.261')] [2022-07-11 07:46:19,026][26022] Updated weights on worker 0-0, policy_version 1099893 (0.00085) [2022-07-11 07:46:20,596][26022] Updated weights on worker 0-0, policy_version 1099903 (0.00092) [2022-07-11 07:46:22,660][26022] Updated weights on worker 0-0, policy_version 1099913 (0.00094) [2022-07-11 07:46:23,544][25689] Fps is (10 sec: 5512.2, 60 sec: 5533.1, 300 sec: 5531.1). Total num frames: 1126316032. Throughput: 0: 5710.1. Samples: 1126315664. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:46:23,545][25689] Avg episode reward: [(0, '-0.202')] [2022-07-11 07:46:24,174][26022] Updated weights on worker 0-0, policy_version 1099923 (0.00084) [2022-07-11 07:46:26,281][26022] Updated weights on worker 0-0, policy_version 1099933 (0.00090) [2022-07-11 07:46:28,067][26022] Updated weights on worker 0-0, policy_version 1099943 (0.00085) [2022-07-11 07:46:28,547][25689] Fps is (10 sec: 5618.0, 60 sec: 5539.6, 300 sec: 5529.0). Total num frames: 1126343680. Throughput: 0: 5815.6. Samples: 1126349038. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:46:28,547][25689] Avg episode reward: [(0, '-0.201')] [2022-07-11 07:46:29,853][26022] Updated weights on worker 0-0, policy_version 1099953 (0.00085) [2022-07-11 07:46:31,899][26022] Updated weights on worker 0-0, policy_version 1099963 (0.00095) [2022-07-11 07:46:33,437][26022] Updated weights on worker 0-0, policy_version 1099973 (0.00089) [2022-07-11 07:46:33,611][25689] Fps is (10 sec: 5594.6, 60 sec: 5540.9, 300 sec: 5528.8). Total num frames: 1126372352. Throughput: 0: 4977.9. Samples: 1126365610. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:46:33,611][25689] Avg episode reward: [(0, '-1.335')] [2022-07-11 07:46:35,599][26022] Updated weights on worker 0-0, policy_version 1099983 (0.00092) [2022-07-11 07:46:37,228][26022] Updated weights on worker 0-0, policy_version 1099993 (0.00241) [2022-07-11 07:46:38,700][25689] Fps is (10 sec: 5446.0, 60 sec: 5536.4, 300 sec: 5527.5). Total num frames: 1126398976. Throughput: 0: 5789.1. Samples: 1126399118. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:46:38,701][25689] Avg episode reward: [(0, '-0.918')] [2022-07-11 07:46:39,018][26022] Updated weights on worker 0-0, policy_version 1100003 (0.00088) [2022-07-11 07:46:40,882][26022] Updated weights on worker 0-0, policy_version 1100013 (0.00088) [2022-07-11 07:46:42,714][26022] Updated weights on worker 0-0, policy_version 1100023 (0.00088) [2022-07-11 07:46:43,794][25689] Fps is (10 sec: 5530.4, 60 sec: 5513.7, 300 sec: 5529.6). Total num frames: 1126428672. Throughput: 0: 5779.7. Samples: 1126432706. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:46:43,795][25689] Avg episode reward: [(0, '-1.528')] [2022-07-11 07:46:44,664][26022] Updated weights on worker 0-0, policy_version 1100033 (0.00063) [2022-07-11 07:46:46,396][26022] Updated weights on worker 0-0, policy_version 1100043 (0.00093) [2022-07-11 07:46:48,278][26022] Updated weights on worker 0-0, policy_version 1100053 (0.00086) [2022-07-11 07:46:48,858][25689] Fps is (10 sec: 5745.8, 60 sec: 5525.7, 300 sec: 5530.5). Total num frames: 1126457344. Throughput: 0: 5770.8. Samples: 1126466254. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:46:48,859][25689] Avg episode reward: [(0, '-1.259')] [2022-07-11 07:46:50,122][26022] Updated weights on worker 0-0, policy_version 1100063 (0.00091) [2022-07-11 07:46:51,825][26022] Updated weights on worker 0-0, policy_version 1100073 (0.00087) [2022-07-11 07:46:53,819][26022] Updated weights on worker 0-0, policy_version 1100083 (0.00089) [2022-07-11 07:46:53,905][25689] Fps is (10 sec: 5570.5, 60 sec: 5529.9, 300 sec: 5531.0). Total num frames: 1126484992. Throughput: 0: 5788.5. Samples: 1126483082. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:46:53,905][25689] Avg episode reward: [(0, '-1.307')] [2022-07-11 07:46:55,579][26022] Updated weights on worker 0-0, policy_version 1100093 (0.00085) [2022-07-11 07:46:57,459][26022] Updated weights on worker 0-0, policy_version 1100103 (0.00095) [2022-07-11 07:46:58,928][25689] Fps is (10 sec: 5491.2, 60 sec: 5530.2, 300 sec: 5530.8). Total num frames: 1126512640. Throughput: 0: 5791.9. Samples: 1126516278. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:46:58,929][25689] Avg episode reward: [(0, '-1.464')] [2022-07-11 07:46:59,242][26022] Updated weights on worker 0-0, policy_version 1100113 (0.00089) [2022-07-11 07:47:01,086][26022] Updated weights on worker 0-0, policy_version 1100123 (0.00092) [2022-07-11 07:47:03,390][26022] Updated weights on worker 0-0, policy_version 1100133 (0.00093) [2022-07-11 07:47:03,967][25689] Fps is (10 sec: 5393.5, 60 sec: 5546.4, 300 sec: 5527.3). Total num frames: 1126539264. Throughput: 0: 5709.3. Samples: 1126547878. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:47:03,969][25689] Avg episode reward: [(0, '-0.437')] [2022-07-11 07:47:04,952][26022] Updated weights on worker 0-0, policy_version 1100143 (0.00063) [2022-07-11 07:47:06,928][26022] Updated weights on worker 0-0, policy_version 1100153 (0.00090) [2022-07-11 07:47:08,895][26022] Updated weights on worker 0-0, policy_version 1100163 (0.00093) [2022-07-11 07:47:08,993][25689] Fps is (10 sec: 5392.1, 60 sec: 5527.6, 300 sec: 5529.2). Total num frames: 1126566912. Throughput: 0: 4891.5. Samples: 1126564744. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:47:08,995][25689] Avg episode reward: [(0, '0.602')] [2022-07-11 07:47:10,563][26022] Updated weights on worker 0-0, policy_version 1100173 (0.00083) [2022-07-11 07:47:12,469][26022] Updated weights on worker 0-0, policy_version 1100183 (0.00088) [2022-07-11 07:47:14,044][25689] Fps is (10 sec: 5690.6, 60 sec: 5548.7, 300 sec: 5532.3). Total num frames: 1126596608. Throughput: 0: 5729.1. Samples: 1126598464. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:47:14,045][25689] Avg episode reward: [(0, '0.060')] [2022-07-11 07:47:14,131][26022] Updated weights on worker 0-0, policy_version 1100193 (0.00095) [2022-07-11 07:47:16,287][26022] Updated weights on worker 0-0, policy_version 1100203 (0.00094) [2022-07-11 07:47:17,961][26022] Updated weights on worker 0-0, policy_version 1100213 (0.00086) [2022-07-11 07:47:19,065][25689] Fps is (10 sec: 5693.8, 60 sec: 5564.6, 300 sec: 5529.7). Total num frames: 1126624256. Throughput: 0: 5752.9. Samples: 1126632120. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:47:19,065][25689] Avg episode reward: [(0, '0.065')] [2022-07-11 07:47:19,919][26022] Updated weights on worker 0-0, policy_version 1100223 (0.00093) [2022-07-11 07:47:21,392][26022] Updated weights on worker 0-0, policy_version 1100233 (0.00083) [2022-07-11 07:47:23,720][26022] Updated weights on worker 0-0, policy_version 1100243 (0.00091) [2022-07-11 07:47:24,092][25689] Fps is (10 sec: 5401.4, 60 sec: 5530.3, 300 sec: 5526.5). Total num frames: 1126650880. Throughput: 0: 5031.1. Samples: 1126649128. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:47:24,098][25689] Avg episode reward: [(0, '1.079')] [2022-07-11 07:47:25,157][26022] Updated weights on worker 0-0, policy_version 1100253 (0.00083) [2022-07-11 07:47:27,216][26022] Updated weights on worker 0-0, policy_version 1100263 (0.00080) [2022-07-11 07:47:28,971][26022] Updated weights on worker 0-0, policy_version 1100273 (0.00086) [2022-07-11 07:47:29,104][25689] Fps is (10 sec: 5609.8, 60 sec: 5563.2, 300 sec: 5535.1). Total num frames: 1126680576. Throughput: 0: 5859.3. Samples: 1126682580. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:47:29,105][25689] Avg episode reward: [(0, '0.434')] [2022-07-11 07:47:30,741][26022] Updated weights on worker 0-0, policy_version 1100283 (0.00090) [2022-07-11 07:47:32,658][26022] Updated weights on worker 0-0, policy_version 1100293 (0.00085) [2022-07-11 07:47:34,171][25689] Fps is (10 sec: 5791.3, 60 sec: 5563.0, 300 sec: 5535.0). Total num frames: 1126709248. Throughput: 0: 5864.8. Samples: 1126716502. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:47:34,171][25689] Avg episode reward: [(0, '0.905')] [2022-07-11 07:47:34,384][26022] Updated weights on worker 0-0, policy_version 1100303 (0.00091) [2022-07-11 07:47:36,203][26022] Updated weights on worker 0-0, policy_version 1100313 (0.00090) [2022-07-11 07:47:38,002][26022] Updated weights on worker 0-0, policy_version 1100323 (0.00085) [2022-07-11 07:47:39,205][25689] Fps is (10 sec: 5575.9, 60 sec: 5585.0, 300 sec: 5531.7). Total num frames: 1126736896. Throughput: 0: 5037.1. Samples: 1126733568. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:47:39,205][25689] Avg episode reward: [(0, '0.525')] [2022-07-11 07:47:39,764][26022] Updated weights on worker 0-0, policy_version 1100333 (0.00093) [2022-07-11 07:47:41,656][26022] Updated weights on worker 0-0, policy_version 1100343 (0.00081) [2022-07-11 07:47:43,447][26022] Updated weights on worker 0-0, policy_version 1100353 (0.00085) [2022-07-11 07:47:44,222][25689] Fps is (10 sec: 5603.4, 60 sec: 5575.2, 300 sec: 5535.4). Total num frames: 1126765568. Throughput: 0: 5881.4. Samples: 1126767518. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:47:44,222][25689] Avg episode reward: [(0, '1.546')] [2022-07-11 07:47:45,276][26022] Updated weights on worker 0-0, policy_version 1100363 (0.00086) [2022-07-11 07:47:46,882][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:47:46,895][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001100372_1126780928.pth [2022-07-11 07:47:46,896][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001098421_1124783104.pth [2022-07-11 07:47:47,086][26022] Updated weights on worker 0-0, policy_version 1100373 (0.00081) [2022-07-11 07:47:48,788][26022] Updated weights on worker 0-0, policy_version 1100383 (0.00086) [2022-07-11 07:47:49,252][25689] Fps is (10 sec: 5605.6, 60 sec: 5561.4, 300 sec: 5536.7). Total num frames: 1126793216. Throughput: 0: 5897.2. Samples: 1126801396. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:47:49,253][25689] Avg episode reward: [(0, '1.559')] [2022-07-11 07:47:50,704][26022] Updated weights on worker 0-0, policy_version 1100393 (0.00090) [2022-07-11 07:47:52,459][26022] Updated weights on worker 0-0, policy_version 1100403 (0.00088) [2022-07-11 07:47:54,371][25689] Fps is (10 sec: 5448.3, 60 sec: 5554.7, 300 sec: 5531.4). Total num frames: 1126820864. Throughput: 0: 5039.5. Samples: 1126818302. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:47:54,372][25689] Avg episode reward: [(0, '1.933')] [2022-07-11 07:47:54,513][26022] Updated weights on worker 0-0, policy_version 1100413 (0.00088) [2022-07-11 07:47:56,345][26022] Updated weights on worker 0-0, policy_version 1100423 (0.00087) [2022-07-11 07:47:58,069][26022] Updated weights on worker 0-0, policy_version 1100433 (0.00091) [2022-07-11 07:47:59,441][25689] Fps is (10 sec: 5628.2, 60 sec: 5584.3, 300 sec: 5540.9). Total num frames: 1126850560. Throughput: 0: 5825.0. Samples: 1126851442. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:47:59,441][25689] Avg episode reward: [(0, '1.535')] [2022-07-11 07:48:00,119][26022] Updated weights on worker 0-0, policy_version 1100443 (0.00096) [2022-07-11 07:48:01,660][26022] Updated weights on worker 0-0, policy_version 1100453 (0.01136) [2022-07-11 07:48:04,096][26022] Updated weights on worker 0-0, policy_version 1100463 (0.00088) [2022-07-11 07:48:04,462][25689] Fps is (10 sec: 5479.7, 60 sec: 5569.0, 300 sec: 5534.3). Total num frames: 1126876160. Throughput: 0: 5696.9. Samples: 1126882824. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:48:04,463][25689] Avg episode reward: [(0, '1.659')] [2022-07-11 07:48:05,708][26022] Updated weights on worker 0-0, policy_version 1100473 (0.00119) [2022-07-11 07:48:07,601][26022] Updated weights on worker 0-0, policy_version 1100483 (0.00089) [2022-07-11 07:48:09,511][25689] Fps is (10 sec: 5186.0, 60 sec: 5550.0, 300 sec: 5531.0). Total num frames: 1126902784. Throughput: 0: 4844.2. Samples: 1126899534. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:48:09,511][25689] Avg episode reward: [(0, '1.793')] [2022-07-11 07:48:09,672][26022] Updated weights on worker 0-0, policy_version 1100493 (0.00087) [2022-07-11 07:48:11,264][26022] Updated weights on worker 0-0, policy_version 1100503 (0.00090) [2022-07-11 07:48:13,362][26022] Updated weights on worker 0-0, policy_version 1100513 (0.00082) [2022-07-11 07:48:14,580][25689] Fps is (10 sec: 5667.9, 60 sec: 5565.3, 300 sec: 5540.2). Total num frames: 1126933504. Throughput: 0: 5671.7. Samples: 1126932918. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:48:14,580][25689] Avg episode reward: [(0, '1.832')] [2022-07-11 07:48:15,067][26022] Updated weights on worker 0-0, policy_version 1100523 (0.00089) [2022-07-11 07:48:16,923][26022] Updated weights on worker 0-0, policy_version 1100533 (0.00086) [2022-07-11 07:48:18,663][26022] Updated weights on worker 0-0, policy_version 1100543 (0.00082) [2022-07-11 07:48:19,645][25689] Fps is (10 sec: 5759.5, 60 sec: 5561.2, 300 sec: 5539.4). Total num frames: 1126961152. Throughput: 0: 5720.3. Samples: 1126967016. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:48:19,647][25689] Avg episode reward: [(0, '0.983')] [2022-07-11 07:48:20,633][26022] Updated weights on worker 0-0, policy_version 1100553 (0.00092) [2022-07-11 07:48:22,130][26022] Updated weights on worker 0-0, policy_version 1100563 (0.00093) [2022-07-11 07:48:24,257][26022] Updated weights on worker 0-0, policy_version 1100573 (0.00083) [2022-07-11 07:48:24,657][25689] Fps is (10 sec: 5487.0, 60 sec: 5579.5, 300 sec: 5536.4). Total num frames: 1126988800. Throughput: 0: 4996.7. Samples: 1126983732. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:48:24,658][25689] Avg episode reward: [(0, '0.832')] [2022-07-11 07:48:25,996][26022] Updated weights on worker 0-0, policy_version 1100583 (0.00092) [2022-07-11 07:48:28,009][26022] Updated weights on worker 0-0, policy_version 1100593 (0.00081) [2022-07-11 07:48:29,666][25689] Fps is (10 sec: 5518.0, 60 sec: 5546.0, 300 sec: 5538.2). Total num frames: 1127016448. Throughput: 0: 5826.1. Samples: 1127016960. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 07:48:29,666][25689] Avg episode reward: [(0, '0.518')] [2022-07-11 07:48:29,697][26022] Updated weights on worker 0-0, policy_version 1100603 (0.00389) [2022-07-11 07:48:31,599][26022] Updated weights on worker 0-0, policy_version 1100613 (0.00496) [2022-07-11 07:48:33,333][26022] Updated weights on worker 0-0, policy_version 1100623 (0.00086) [2022-07-11 07:48:34,734][25689] Fps is (10 sec: 5588.9, 60 sec: 5545.8, 300 sec: 5542.2). Total num frames: 1127045120. Throughput: 0: 5830.3. Samples: 1127050426. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:48:34,735][25689] Avg episode reward: [(0, '0.424')] [2022-07-11 07:48:35,303][26022] Updated weights on worker 0-0, policy_version 1100633 (0.00080) [2022-07-11 07:48:36,997][26022] Updated weights on worker 0-0, policy_version 1100643 (0.00089) [2022-07-11 07:48:39,036][26022] Updated weights on worker 0-0, policy_version 1100653 (0.00092) [2022-07-11 07:48:39,741][25689] Fps is (10 sec: 5590.3, 60 sec: 5548.3, 300 sec: 5542.9). Total num frames: 1127072768. Throughput: 0: 4984.5. Samples: 1127067182. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:48:39,741][25689] Avg episode reward: [(0, '0.646')] [2022-07-11 07:48:40,599][26022] Updated weights on worker 0-0, policy_version 1100663 (0.00087) [2022-07-11 07:48:42,614][26022] Updated weights on worker 0-0, policy_version 1100673 (0.00089) [2022-07-11 07:48:44,486][26022] Updated weights on worker 0-0, policy_version 1100683 (0.00086) [2022-07-11 07:48:44,764][25689] Fps is (10 sec: 5615.5, 60 sec: 5547.8, 300 sec: 5546.1). Total num frames: 1127101440. Throughput: 0: 5834.9. Samples: 1127101052. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:48:44,764][25689] Avg episode reward: [(0, '0.851')] [2022-07-11 07:48:46,115][26022] Updated weights on worker 0-0, policy_version 1100693 (0.00089) [2022-07-11 07:48:48,159][26022] Updated weights on worker 0-0, policy_version 1100703 (0.00074) [2022-07-11 07:48:49,831][25689] Fps is (10 sec: 5581.7, 60 sec: 5544.4, 300 sec: 5546.4). Total num frames: 1127129088. Throughput: 0: 5832.6. Samples: 1127134574. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:48:49,831][25689] Avg episode reward: [(0, '0.506')] [2022-07-11 07:48:49,861][26022] Updated weights on worker 0-0, policy_version 1100713 (0.00086) [2022-07-11 07:48:51,746][26022] Updated weights on worker 0-0, policy_version 1100723 (0.00083) [2022-07-11 07:48:53,564][26022] Updated weights on worker 0-0, policy_version 1100733 (0.00085) [2022-07-11 07:48:54,937][25689] Fps is (10 sec: 5536.2, 60 sec: 5562.5, 300 sec: 5548.2). Total num frames: 1127157760. Throughput: 0: 4980.4. Samples: 1127151042. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:48:54,937][25689] Avg episode reward: [(0, '0.832')] [2022-07-11 07:48:55,372][26022] Updated weights on worker 0-0, policy_version 1100743 (0.00085) [2022-07-11 07:48:57,159][26022] Updated weights on worker 0-0, policy_version 1100753 (0.00090) [2022-07-11 07:48:59,041][26022] Updated weights on worker 0-0, policy_version 1100763 (0.00089) [2022-07-11 07:48:59,960][25689] Fps is (10 sec: 5661.2, 60 sec: 5549.8, 300 sec: 5559.9). Total num frames: 1127186432. Throughput: 0: 5821.6. Samples: 1127184890. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:48:59,962][25689] Avg episode reward: [(0, '1.664')] [2022-07-11 07:49:00,822][26022] Updated weights on worker 0-0, policy_version 1100773 (0.00080) [2022-07-11 07:49:02,969][26022] Updated weights on worker 0-0, policy_version 1100783 (0.00090) [2022-07-11 07:49:04,831][26022] Updated weights on worker 0-0, policy_version 1100793 (0.00085) [2022-07-11 07:49:04,996][25689] Fps is (10 sec: 5395.2, 60 sec: 5548.5, 300 sec: 5549.0). Total num frames: 1127212032. Throughput: 0: 5709.4. Samples: 1127216566. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:49:04,996][25689] Avg episode reward: [(0, '0.625')] [2022-07-11 07:49:06,706][26022] Updated weights on worker 0-0, policy_version 1100803 (0.00083) [2022-07-11 07:49:08,483][26022] Updated weights on worker 0-0, policy_version 1100813 (0.00085) [2022-07-11 07:49:10,020][25689] Fps is (10 sec: 5394.5, 60 sec: 5584.6, 300 sec: 5550.4). Total num frames: 1127240704. Throughput: 0: 4896.9. Samples: 1127233442. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:49:10,022][25689] Avg episode reward: [(0, '-0.295')] [2022-07-11 07:49:10,355][26022] Updated weights on worker 0-0, policy_version 1100823 (0.00086) [2022-07-11 07:49:12,042][26022] Updated weights on worker 0-0, policy_version 1100833 (0.00084) [2022-07-11 07:49:14,054][26022] Updated weights on worker 0-0, policy_version 1100843 (0.00092) [2022-07-11 07:49:15,099][25689] Fps is (10 sec: 5676.1, 60 sec: 5549.8, 300 sec: 5553.2). Total num frames: 1127269376. Throughput: 0: 5751.9. Samples: 1127267012. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:49:15,099][25689] Avg episode reward: [(0, '0.377')] [2022-07-11 07:49:16,081][26022] Updated weights on worker 0-0, policy_version 1100853 (0.00083) [2022-07-11 07:49:17,662][26022] Updated weights on worker 0-0, policy_version 1100863 (0.00090) [2022-07-11 07:49:19,522][26022] Updated weights on worker 0-0, policy_version 1100873 (0.00085) [2022-07-11 07:49:20,113][25689] Fps is (10 sec: 5580.6, 60 sec: 5554.6, 300 sec: 5549.8). Total num frames: 1127297024. Throughput: 0: 5742.0. Samples: 1127300606. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:49:20,113][25689] Avg episode reward: [(0, '0.423')] [2022-07-11 07:49:21,200][26022] Updated weights on worker 0-0, policy_version 1100883 (0.00088) [2022-07-11 07:49:23,397][26022] Updated weights on worker 0-0, policy_version 1100893 (0.00089) [2022-07-11 07:49:24,805][26022] Updated weights on worker 0-0, policy_version 1100903 (0.00109) [2022-07-11 07:49:25,136][25689] Fps is (10 sec: 5611.1, 60 sec: 5570.5, 300 sec: 5553.1). Total num frames: 1127325696. Throughput: 0: 5856.5. Samples: 1127334516. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:49:25,138][25689] Avg episode reward: [(0, '0.273')] [2022-07-11 07:49:26,996][26022] Updated weights on worker 0-0, policy_version 1100913 (0.00088) [2022-07-11 07:49:28,482][26022] Updated weights on worker 0-0, policy_version 1100923 (0.00089) [2022-07-11 07:49:30,138][25689] Fps is (10 sec: 5515.4, 60 sec: 5554.1, 300 sec: 5547.1). Total num frames: 1127352320. Throughput: 0: 5839.5. Samples: 1127350920. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:49:30,140][25689] Avg episode reward: [(0, '-0.312')] [2022-07-11 07:49:30,633][26022] Updated weights on worker 0-0, policy_version 1100933 (0.00085) [2022-07-11 07:49:32,320][26022] Updated weights on worker 0-0, policy_version 1100943 (0.00085) [2022-07-11 07:49:34,256][26022] Updated weights on worker 0-0, policy_version 1100953 (0.00099) [2022-07-11 07:49:35,272][25689] Fps is (10 sec: 5455.2, 60 sec: 5548.1, 300 sec: 5551.7). Total num frames: 1127380992. Throughput: 0: 5820.7. Samples: 1127384436. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:49:35,273][25689] Avg episode reward: [(0, '0.746')] [2022-07-11 07:49:35,882][26022] Updated weights on worker 0-0, policy_version 1100963 (0.00087) [2022-07-11 07:49:37,911][26022] Updated weights on worker 0-0, policy_version 1100973 (0.00092) [2022-07-11 07:49:39,669][26022] Updated weights on worker 0-0, policy_version 1100983 (0.00087) [2022-07-11 07:49:40,290][25689] Fps is (10 sec: 5749.5, 60 sec: 5580.9, 300 sec: 5555.5). Total num frames: 1127410688. Throughput: 0: 5831.2. Samples: 1127418266. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:49:40,291][25689] Avg episode reward: [(0, '0.424')] [2022-07-11 07:49:41,630][26022] Updated weights on worker 0-0, policy_version 1100993 (0.00088) [2022-07-11 07:49:43,274][26022] Updated weights on worker 0-0, policy_version 1101003 (0.00087) [2022-07-11 07:49:45,312][25689] Fps is (10 sec: 5609.6, 60 sec: 5547.1, 300 sec: 5548.7). Total num frames: 1127437312. Throughput: 0: 4989.7. Samples: 1127435190. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:49:45,313][25689] Avg episode reward: [(0, '1.095')] [2022-07-11 07:49:45,315][26022] Updated weights on worker 0-0, policy_version 1101013 (0.00093) [2022-07-11 07:49:46,896][26022] Updated weights on worker 0-0, policy_version 1101023 (0.01245) [2022-07-11 07:49:47,070][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:49:47,082][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001101024_1127448576.pth [2022-07-11 07:49:47,083][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001099069_1125446656.pth [2022-07-11 07:49:48,984][26022] Updated weights on worker 0-0, policy_version 1101033 (0.00081) [2022-07-11 07:49:50,325][25689] Fps is (10 sec: 5612.2, 60 sec: 5585.9, 300 sec: 5559.9). Total num frames: 1127467008. Throughput: 0: 5829.3. Samples: 1127468596. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:49:50,327][25689] Avg episode reward: [(0, '0.933')] [2022-07-11 07:49:50,647][26022] Updated weights on worker 0-0, policy_version 1101043 (0.00086) [2022-07-11 07:49:52,631][26022] Updated weights on worker 0-0, policy_version 1101053 (0.00091) [2022-07-11 07:49:54,166][26022] Updated weights on worker 0-0, policy_version 1101063 (0.00086) [2022-07-11 07:49:55,446][25689] Fps is (10 sec: 5456.7, 60 sec: 5533.8, 300 sec: 5544.0). Total num frames: 1127492608. Throughput: 0: 5840.9. Samples: 1127502268. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:49:55,447][25689] Avg episode reward: [(0, '1.558')] [2022-07-11 07:49:56,248][26022] Updated weights on worker 0-0, policy_version 1101073 (0.00091) [2022-07-11 07:49:57,940][26022] Updated weights on worker 0-0, policy_version 1101083 (0.00089) [2022-07-11 07:50:00,016][26022] Updated weights on worker 0-0, policy_version 1101093 (0.00051) [2022-07-11 07:50:00,473][25689] Fps is (10 sec: 5348.3, 60 sec: 5533.5, 300 sec: 5554.2). Total num frames: 1127521280. Throughput: 0: 4965.8. Samples: 1127518490. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:50:00,473][25689] Avg episode reward: [(0, '1.436')] [2022-07-11 07:50:02,184][26022] Updated weights on worker 0-0, policy_version 1101103 (0.00091) [2022-07-11 07:50:04,160][26022] Updated weights on worker 0-0, policy_version 1101113 (0.00089) [2022-07-11 07:50:05,528][25689] Fps is (10 sec: 5484.6, 60 sec: 5548.7, 300 sec: 5546.6). Total num frames: 1127547904. Throughput: 0: 5673.3. Samples: 1127549878. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:50:05,528][25689] Avg episode reward: [(0, '1.354')] [2022-07-11 07:50:05,831][26022] Updated weights on worker 0-0, policy_version 1101123 (0.00086) [2022-07-11 07:50:07,845][26022] Updated weights on worker 0-0, policy_version 1101133 (0.00086) [2022-07-11 07:50:09,494][26022] Updated weights on worker 0-0, policy_version 1101143 (0.00086) [2022-07-11 07:50:10,547][25689] Fps is (10 sec: 5387.1, 60 sec: 5532.2, 300 sec: 5547.3). Total num frames: 1127575552. Throughput: 0: 5670.8. Samples: 1127583270. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:50:10,548][25689] Avg episode reward: [(0, '1.642')] [2022-07-11 07:50:11,486][26022] Updated weights on worker 0-0, policy_version 1101153 (0.00086) [2022-07-11 07:50:13,155][26022] Updated weights on worker 0-0, policy_version 1101163 (0.00093) [2022-07-11 07:50:15,012][26022] Updated weights on worker 0-0, policy_version 1101173 (0.00078) [2022-07-11 07:50:15,602][25689] Fps is (10 sec: 5590.6, 60 sec: 5534.4, 300 sec: 5546.7). Total num frames: 1127604224. Throughput: 0: 4843.1. Samples: 1127599886. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:50:15,602][25689] Avg episode reward: [(0, '1.178')] [2022-07-11 07:50:16,829][26022] Updated weights on worker 0-0, policy_version 1101183 (0.00093) [2022-07-11 07:50:18,839][26022] Updated weights on worker 0-0, policy_version 1101193 (0.00083) [2022-07-11 07:50:20,522][26022] Updated weights on worker 0-0, policy_version 1101203 (0.00085) [2022-07-11 07:50:20,606][25689] Fps is (10 sec: 5701.2, 60 sec: 5552.2, 300 sec: 5550.5). Total num frames: 1127632896. Throughput: 0: 5711.3. Samples: 1127633474. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:50:20,606][25689] Avg episode reward: [(0, '0.367')] [2022-07-11 07:50:22,484][26022] Updated weights on worker 0-0, policy_version 1101213 (0.00974) [2022-07-11 07:50:24,222][26022] Updated weights on worker 0-0, policy_version 1101223 (0.00091) [2022-07-11 07:50:25,646][25689] Fps is (10 sec: 5505.2, 60 sec: 5516.8, 300 sec: 5547.7). Total num frames: 1127659520. Throughput: 0: 5811.7. Samples: 1127666800. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:50:25,651][25689] Avg episode reward: [(0, '-0.591')] [2022-07-11 07:50:26,263][26022] Updated weights on worker 0-0, policy_version 1101233 (0.00080) [2022-07-11 07:50:27,806][26022] Updated weights on worker 0-0, policy_version 1101243 (0.00086) [2022-07-11 07:50:30,022][26022] Updated weights on worker 0-0, policy_version 1101253 (0.00084) [2022-07-11 07:50:30,671][25689] Fps is (10 sec: 5392.1, 60 sec: 5531.7, 300 sec: 5545.2). Total num frames: 1127687168. Throughput: 0: 4980.1. Samples: 1127683486. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:50:30,671][25689] Avg episode reward: [(0, '-1.406')] [2022-07-11 07:50:31,503][26022] Updated weights on worker 0-0, policy_version 1101263 (0.00087) [2022-07-11 07:50:33,726][26022] Updated weights on worker 0-0, policy_version 1101273 (0.00090) [2022-07-11 07:50:35,148][26022] Updated weights on worker 0-0, policy_version 1101283 (0.00092) [2022-07-11 07:50:35,785][25689] Fps is (10 sec: 5555.1, 60 sec: 5533.5, 300 sec: 5550.7). Total num frames: 1127715840. Throughput: 0: 5801.1. Samples: 1127716970. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:50:35,785][25689] Avg episode reward: [(0, '-1.570')] [2022-07-11 07:50:37,311][26022] Updated weights on worker 0-0, policy_version 1101293 (0.00090) [2022-07-11 07:50:38,810][26022] Updated weights on worker 0-0, policy_version 1101303 (0.00089) [2022-07-11 07:50:40,854][25689] Fps is (10 sec: 5530.6, 60 sec: 5495.0, 300 sec: 5539.7). Total num frames: 1127743488. Throughput: 0: 5773.5. Samples: 1127750378. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:50:40,855][25689] Avg episode reward: [(0, '-0.780')] [2022-07-11 07:50:40,922][26022] Updated weights on worker 0-0, policy_version 1101313 (0.00092) [2022-07-11 07:50:42,570][26022] Updated weights on worker 0-0, policy_version 1101323 (0.00089) [2022-07-11 07:50:44,677][26022] Updated weights on worker 0-0, policy_version 1101333 (0.00084) [2022-07-11 07:50:45,940][25689] Fps is (10 sec: 5646.9, 60 sec: 5539.9, 300 sec: 5545.2). Total num frames: 1127773184. Throughput: 0: 4944.2. Samples: 1127767136. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:50:45,942][25689] Avg episode reward: [(0, '-0.614')] [2022-07-11 07:50:46,385][26022] Updated weights on worker 0-0, policy_version 1101343 (0.00085) [2022-07-11 07:50:48,162][26022] Updated weights on worker 0-0, policy_version 1101353 (0.00086) [2022-07-11 07:50:49,939][26022] Updated weights on worker 0-0, policy_version 1101363 (0.00084) [2022-07-11 07:50:51,028][25689] Fps is (10 sec: 5636.6, 60 sec: 5499.4, 300 sec: 5545.3). Total num frames: 1127800832. Throughput: 0: 5761.7. Samples: 1127800776. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:50:51,029][25689] Avg episode reward: [(0, '-0.508')] [2022-07-11 07:50:52,021][26022] Updated weights on worker 0-0, policy_version 1101373 (0.00085) [2022-07-11 07:50:53,581][26022] Updated weights on worker 0-0, policy_version 1101383 (0.00060) [2022-07-11 07:50:55,739][26022] Updated weights on worker 0-0, policy_version 1101393 (0.00089) [2022-07-11 07:50:56,107][25689] Fps is (10 sec: 5539.4, 60 sec: 5553.7, 300 sec: 5547.7). Total num frames: 1127829504. Throughput: 0: 5773.2. Samples: 1127834294. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:50:56,107][25689] Avg episode reward: [(0, '-0.858')] [2022-07-11 07:50:57,271][26022] Updated weights on worker 0-0, policy_version 1101403 (0.00094) [2022-07-11 07:50:59,203][26022] Updated weights on worker 0-0, policy_version 1101413 (0.00089) [2022-07-11 07:51:01,054][26022] Updated weights on worker 0-0, policy_version 1101423 (0.00082) [2022-07-11 07:51:01,111][25689] Fps is (10 sec: 5585.6, 60 sec: 5539.0, 300 sec: 5555.1). Total num frames: 1127857152. Throughput: 0: 4969.1. Samples: 1127851036. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:51:01,111][25689] Avg episode reward: [(0, '0.235')] [2022-07-11 07:51:03,176][26022] Updated weights on worker 0-0, policy_version 1101433 (0.00085) [2022-07-11 07:51:05,156][26022] Updated weights on worker 0-0, policy_version 1101443 (0.00092) [2022-07-11 07:51:06,113][25689] Fps is (10 sec: 5321.7, 60 sec: 5526.9, 300 sec: 5544.9). Total num frames: 1127882752. Throughput: 0: 5721.2. Samples: 1127882550. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:51:06,115][25689] Avg episode reward: [(0, '0.093')] [2022-07-11 07:51:06,782][26022] Updated weights on worker 0-0, policy_version 1101453 (0.00094) [2022-07-11 07:51:08,911][26022] Updated weights on worker 0-0, policy_version 1101463 (0.00091) [2022-07-11 07:51:10,426][26022] Updated weights on worker 0-0, policy_version 1101473 (0.00086) [2022-07-11 07:51:11,159][25689] Fps is (10 sec: 5401.3, 60 sec: 5541.4, 300 sec: 5545.8). Total num frames: 1127911424. Throughput: 0: 5720.0. Samples: 1127915926. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:51:11,159][25689] Avg episode reward: [(0, '0.230')] [2022-07-11 07:51:12,479][26022] Updated weights on worker 0-0, policy_version 1101483 (0.00092) [2022-07-11 07:51:14,319][26022] Updated weights on worker 0-0, policy_version 1101493 (0.00091) [2022-07-11 07:51:15,995][26022] Updated weights on worker 0-0, policy_version 1101503 (0.00088) [2022-07-11 07:51:16,293][25689] Fps is (10 sec: 5632.9, 60 sec: 5534.1, 300 sec: 5550.4). Total num frames: 1127940096. Throughput: 0: 4863.0. Samples: 1127932458. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:51:16,293][25689] Avg episode reward: [(0, '-0.646')] [2022-07-11 07:51:18,258][26022] Updated weights on worker 0-0, policy_version 1101513 (0.00092) [2022-07-11 07:51:19,578][26022] Updated weights on worker 0-0, policy_version 1101523 (0.00093) [2022-07-11 07:51:21,322][25689] Fps is (10 sec: 5340.0, 60 sec: 5481.2, 300 sec: 5539.9). Total num frames: 1127965696. Throughput: 0: 5676.1. Samples: 1127965758. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:51:21,327][25689] Avg episode reward: [(0, '-0.111')] [2022-07-11 07:51:21,862][26022] Updated weights on worker 0-0, policy_version 1101533 (0.00084) [2022-07-11 07:51:23,280][26022] Updated weights on worker 0-0, policy_version 1101543 (0.00091) [2022-07-11 07:51:25,398][26022] Updated weights on worker 0-0, policy_version 1101553 (0.00086) [2022-07-11 07:51:26,331][25689] Fps is (10 sec: 5508.7, 60 sec: 5534.7, 300 sec: 5546.7). Total num frames: 1127995392. Throughput: 0: 5773.4. Samples: 1127999276. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:51:26,331][25689] Avg episode reward: [(0, '1.104')] [2022-07-11 07:51:27,239][26022] Updated weights on worker 0-0, policy_version 1101563 (0.00086) [2022-07-11 07:51:29,002][26022] Updated weights on worker 0-0, policy_version 1101573 (0.00082) [2022-07-11 07:51:30,892][26022] Updated weights on worker 0-0, policy_version 1101583 (0.00092) [2022-07-11 07:51:31,346][25689] Fps is (10 sec: 5822.7, 60 sec: 5552.4, 300 sec: 5547.6). Total num frames: 1128024064. Throughput: 0: 5784.5. Samples: 1128032702. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:51:31,347][25689] Avg episode reward: [(0, '1.073')] [2022-07-11 07:51:32,792][26022] Updated weights on worker 0-0, policy_version 1101593 (0.00094) [2022-07-11 07:51:34,567][26022] Updated weights on worker 0-0, policy_version 1101603 (0.00091) [2022-07-11 07:51:36,469][25689] Fps is (10 sec: 5353.2, 60 sec: 5501.0, 300 sec: 5543.5). Total num frames: 1128049664. Throughput: 0: 5795.8. Samples: 1128049394. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:51:36,470][25689] Avg episode reward: [(0, '0.368')] [2022-07-11 07:51:36,651][26022] Updated weights on worker 0-0, policy_version 1101613 (0.00087) [2022-07-11 07:51:38,088][26022] Updated weights on worker 0-0, policy_version 1101623 (0.00092) [2022-07-11 07:51:40,355][26022] Updated weights on worker 0-0, policy_version 1101633 (0.00089) [2022-07-11 07:51:41,491][25689] Fps is (10 sec: 5450.9, 60 sec: 5539.1, 300 sec: 5544.9). Total num frames: 1128079360. Throughput: 0: 5802.6. Samples: 1128082788. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:51:41,491][25689] Avg episode reward: [(0, '1.037')] [2022-07-11 07:51:41,826][26022] Updated weights on worker 0-0, policy_version 1101643 (0.00083) [2022-07-11 07:51:44,001][26022] Updated weights on worker 0-0, policy_version 1101653 (0.00086) [2022-07-11 07:51:45,594][26022] Updated weights on worker 0-0, policy_version 1101663 (0.00088) [2022-07-11 07:51:46,498][25689] Fps is (10 sec: 5718.1, 60 sec: 5512.5, 300 sec: 5542.5). Total num frames: 1128107008. Throughput: 0: 5797.1. Samples: 1128116184. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:51:46,498][25689] Avg episode reward: [(0, '0.794')] [2022-07-11 07:51:47,120][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:51:47,129][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001101670_1128110080.pth [2022-07-11 07:51:47,137][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001099721_1126114304.pth [2022-07-11 07:51:47,544][26022] Updated weights on worker 0-0, policy_version 1101673 (0.00090) [2022-07-11 07:51:49,271][26022] Updated weights on worker 0-0, policy_version 1101683 (0.00087) [2022-07-11 07:51:51,140][26022] Updated weights on worker 0-0, policy_version 1101693 (0.00089) [2022-07-11 07:51:51,514][25689] Fps is (10 sec: 5619.0, 60 sec: 5535.9, 300 sec: 5546.5). Total num frames: 1128135680. Throughput: 0: 4968.9. Samples: 1128132912. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:51:51,516][25689] Avg episode reward: [(0, '0.504')] [2022-07-11 07:51:53,064][26022] Updated weights on worker 0-0, policy_version 1101703 (0.00094) [2022-07-11 07:51:54,906][26022] Updated weights on worker 0-0, policy_version 1101713 (0.00087) [2022-07-11 07:51:56,564][26022] Updated weights on worker 0-0, policy_version 1101723 (0.00091) [2022-07-11 07:51:56,640][25689] Fps is (10 sec: 5654.0, 60 sec: 5531.7, 300 sec: 5548.0). Total num frames: 1128164352. Throughput: 0: 5791.1. Samples: 1128166204. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:51:56,640][25689] Avg episode reward: [(0, '0.527')] [2022-07-11 07:51:58,909][26022] Updated weights on worker 0-0, policy_version 1101733 (0.00087) [2022-07-11 07:52:00,201][26022] Updated weights on worker 0-0, policy_version 1101743 (0.00082) [2022-07-11 07:52:01,657][25689] Fps is (10 sec: 5451.4, 60 sec: 5513.5, 300 sec: 5548.4). Total num frames: 1128190976. Throughput: 0: 5801.3. Samples: 1128199780. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:52:01,658][25689] Avg episode reward: [(0, '0.534')] [2022-07-11 07:52:02,631][26022] Updated weights on worker 0-0, policy_version 1101753 (0.00110) [2022-07-11 07:52:04,342][26022] Updated weights on worker 0-0, policy_version 1101763 (0.00089) [2022-07-11 07:52:06,266][26022] Updated weights on worker 0-0, policy_version 1101773 (0.00090) [2022-07-11 07:52:06,676][25689] Fps is (10 sec: 5203.4, 60 sec: 5512.0, 300 sec: 5541.7). Total num frames: 1128216576. Throughput: 0: 4867.1. Samples: 1128214396. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:52:06,677][25689] Avg episode reward: [(0, '0.398')] [2022-07-11 07:52:08,102][26022] Updated weights on worker 0-0, policy_version 1101783 (0.00086) [2022-07-11 07:52:10,059][26022] Updated weights on worker 0-0, policy_version 1101793 (0.00084) [2022-07-11 07:52:11,694][25689] Fps is (10 sec: 5407.5, 60 sec: 5514.6, 300 sec: 5538.9). Total num frames: 1128245248. Throughput: 0: 5689.5. Samples: 1128247726. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:52:11,694][25689] Avg episode reward: [(0, '0.630')] [2022-07-11 07:52:11,802][26022] Updated weights on worker 0-0, policy_version 1101803 (0.00086) [2022-07-11 07:52:13,648][26022] Updated weights on worker 0-0, policy_version 1101813 (0.00087) [2022-07-11 07:52:15,620][26022] Updated weights on worker 0-0, policy_version 1101823 (0.00087) [2022-07-11 07:52:16,764][25689] Fps is (10 sec: 5481.4, 60 sec: 5486.5, 300 sec: 5534.5). Total num frames: 1128271872. Throughput: 0: 5708.9. Samples: 1128281092. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:52:16,766][25689] Avg episode reward: [(0, '0.115')] [2022-07-11 07:52:17,235][26022] Updated weights on worker 0-0, policy_version 1101833 (0.00087) [2022-07-11 07:52:19,339][26022] Updated weights on worker 0-0, policy_version 1101843 (0.00092) [2022-07-11 07:52:20,848][26022] Updated weights on worker 0-0, policy_version 1101853 (0.00088) [2022-07-11 07:52:21,794][25689] Fps is (10 sec: 5576.2, 60 sec: 5554.2, 300 sec: 5544.8). Total num frames: 1128301568. Throughput: 0: 4876.3. Samples: 1128297972. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:52:21,795][25689] Avg episode reward: [(0, '-1.100')] [2022-07-11 07:52:22,867][26022] Updated weights on worker 0-0, policy_version 1101863 (0.00613) [2022-07-11 07:52:24,662][26022] Updated weights on worker 0-0, policy_version 1101873 (0.00096) [2022-07-11 07:52:26,491][26022] Updated weights on worker 0-0, policy_version 1101883 (0.00092) [2022-07-11 07:52:26,808][25689] Fps is (10 sec: 5811.4, 60 sec: 5536.8, 300 sec: 5541.3). Total num frames: 1128330240. Throughput: 0: 5818.6. Samples: 1128331534. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:52:26,808][25689] Avg episode reward: [(0, '-1.707')] [2022-07-11 07:52:28,559][26022] Updated weights on worker 0-0, policy_version 1101893 (0.00093) [2022-07-11 07:52:30,242][26022] Updated weights on worker 0-0, policy_version 1101903 (0.00093) [2022-07-11 07:52:31,841][25689] Fps is (10 sec: 5401.5, 60 sec: 5484.4, 300 sec: 5531.6). Total num frames: 1128355840. Throughput: 0: 5821.0. Samples: 1128365006. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2022-07-11 07:52:31,843][25689] Avg episode reward: [(0, '-0.983')] [2022-07-11 07:52:32,171][26022] Updated weights on worker 0-0, policy_version 1101913 (0.00083) [2022-07-11 07:52:33,847][26022] Updated weights on worker 0-0, policy_version 1101923 (0.00108) [2022-07-11 07:52:35,693][26022] Updated weights on worker 0-0, policy_version 1101933 (0.00084) [2022-07-11 07:52:36,921][25689] Fps is (10 sec: 5569.2, 60 sec: 5573.0, 300 sec: 5541.1). Total num frames: 1128386560. Throughput: 0: 4970.1. Samples: 1128381272. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:52:36,921][25689] Avg episode reward: [(0, '-1.039')] [2022-07-11 07:52:37,679][26022] Updated weights on worker 0-0, policy_version 1101943 (0.00081) [2022-07-11 07:52:39,451][26022] Updated weights on worker 0-0, policy_version 1101953 (0.00097) [2022-07-11 07:52:41,173][26022] Updated weights on worker 0-0, policy_version 1101963 (0.00095) [2022-07-11 07:52:42,023][25689] Fps is (10 sec: 5632.2, 60 sec: 5514.8, 300 sec: 5532.6). Total num frames: 1128413184. Throughput: 0: 5787.4. Samples: 1128415046. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:52:42,023][25689] Avg episode reward: [(0, '-1.386')] [2022-07-11 07:52:43,075][26022] Updated weights on worker 0-0, policy_version 1101973 (0.00084) [2022-07-11 07:52:44,723][26022] Updated weights on worker 0-0, policy_version 1101983 (0.00084) [2022-07-11 07:52:46,811][26022] Updated weights on worker 0-0, policy_version 1101993 (0.00085) [2022-07-11 07:52:47,046][25689] Fps is (10 sec: 5562.0, 60 sec: 5547.1, 300 sec: 5539.6). Total num frames: 1128442880. Throughput: 0: 5795.2. Samples: 1128448820. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:52:47,047][25689] Avg episode reward: [(0, '-2.302')] [2022-07-11 07:52:48,563][26022] Updated weights on worker 0-0, policy_version 1102003 (0.00119) [2022-07-11 07:52:50,162][26022] Updated weights on worker 0-0, policy_version 1102013 (0.00094) [2022-07-11 07:52:52,070][25689] Fps is (10 sec: 5707.2, 60 sec: 5529.5, 300 sec: 5541.4). Total num frames: 1128470528. Throughput: 0: 4981.3. Samples: 1128465774. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:52:52,071][25689] Avg episode reward: [(0, '-0.637')] [2022-07-11 07:52:52,209][26022] Updated weights on worker 0-0, policy_version 1102023 (0.00086) [2022-07-11 07:52:54,028][26022] Updated weights on worker 0-0, policy_version 1102033 (0.00089) [2022-07-11 07:52:55,713][26022] Updated weights on worker 0-0, policy_version 1102043 (0.00083) [2022-07-11 07:52:57,138][25689] Fps is (10 sec: 5479.4, 60 sec: 5517.9, 300 sec: 5534.5). Total num frames: 1128498176. Throughput: 0: 5848.6. Samples: 1128499516. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:52:57,139][25689] Avg episode reward: [(0, '-0.800')] [2022-07-11 07:52:57,676][26022] Updated weights on worker 0-0, policy_version 1102053 (0.00086) [2022-07-11 07:52:59,459][26022] Updated weights on worker 0-0, policy_version 1102063 (0.00087) [2022-07-11 07:53:01,313][26022] Updated weights on worker 0-0, policy_version 1102073 (0.00086) [2022-07-11 07:53:02,165][25689] Fps is (10 sec: 5680.5, 60 sec: 5567.8, 300 sec: 5548.2). Total num frames: 1128527872. Throughput: 0: 5870.3. Samples: 1128533288. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:53:02,171][25689] Avg episode reward: [(0, '-1.506')] [2022-07-11 07:53:03,759][26022] Updated weights on worker 0-0, policy_version 1102083 (0.00089) [2022-07-11 07:53:05,094][26022] Updated weights on worker 0-0, policy_version 1102093 (0.00086) [2022-07-11 07:53:07,180][25689] Fps is (10 sec: 5302.5, 60 sec: 5534.3, 300 sec: 5538.5). Total num frames: 1128551424. Throughput: 0: 4917.5. Samples: 1128547828. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:53:07,181][25689] Avg episode reward: [(0, '-2.209')] [2022-07-11 07:53:07,539][26022] Updated weights on worker 0-0, policy_version 1102103 (0.00083) [2022-07-11 07:53:08,903][26022] Updated weights on worker 0-0, policy_version 1102113 (0.00089) [2022-07-11 07:53:11,057][26022] Updated weights on worker 0-0, policy_version 1102123 (0.00391) [2022-07-11 07:53:12,188][25689] Fps is (10 sec: 5414.8, 60 sec: 5569.0, 300 sec: 5539.6). Total num frames: 1128582144. Throughput: 0: 5725.6. Samples: 1128580960. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:53:12,189][25689] Avg episode reward: [(0, '-1.895')] [2022-07-11 07:53:12,675][26022] Updated weights on worker 0-0, policy_version 1102133 (0.00081) [2022-07-11 07:53:14,541][26022] Updated weights on worker 0-0, policy_version 1102143 (0.00069) [2022-07-11 07:53:16,332][26022] Updated weights on worker 0-0, policy_version 1102153 (0.00084) [2022-07-11 07:53:17,243][25689] Fps is (10 sec: 5698.9, 60 sec: 5570.5, 300 sec: 5536.4). Total num frames: 1128608768. Throughput: 0: 5735.8. Samples: 1128614832. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:53:17,243][25689] Avg episode reward: [(0, '-1.945')] [2022-07-11 07:53:18,122][26022] Updated weights on worker 0-0, policy_version 1102163 (0.00087) [2022-07-11 07:53:19,992][26022] Updated weights on worker 0-0, policy_version 1102173 (0.00087) [2022-07-11 07:53:21,974][26022] Updated weights on worker 0-0, policy_version 1102183 (0.00093) [2022-07-11 07:53:22,251][25689] Fps is (10 sec: 5495.1, 60 sec: 5555.5, 300 sec: 5539.9). Total num frames: 1128637440. Throughput: 0: 4899.7. Samples: 1128631702. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:53:22,251][25689] Avg episode reward: [(0, '-2.034')] [2022-07-11 07:53:23,495][26022] Updated weights on worker 0-0, policy_version 1102193 (0.00086) [2022-07-11 07:53:25,490][26022] Updated weights on worker 0-0, policy_version 1102203 (0.00088) [2022-07-11 07:53:27,206][26022] Updated weights on worker 0-0, policy_version 1102213 (0.00094) [2022-07-11 07:53:27,279][25689] Fps is (10 sec: 5713.8, 60 sec: 5554.3, 300 sec: 5543.0). Total num frames: 1128666112. Throughput: 0: 5862.3. Samples: 1128665652. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:53:27,279][25689] Avg episode reward: [(0, '-1.430')] [2022-07-11 07:53:29,244][26022] Updated weights on worker 0-0, policy_version 1102223 (0.00407) [2022-07-11 07:53:30,859][26022] Updated weights on worker 0-0, policy_version 1102233 (0.00083) [2022-07-11 07:53:32,287][25689] Fps is (10 sec: 5611.7, 60 sec: 5590.5, 300 sec: 5540.7). Total num frames: 1128693760. Throughput: 0: 5883.9. Samples: 1128699220. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:53:32,287][25689] Avg episode reward: [(0, '-2.450')] [2022-07-11 07:53:32,716][26022] Updated weights on worker 0-0, policy_version 1102243 (0.00095) [2022-07-11 07:53:34,687][26022] Updated weights on worker 0-0, policy_version 1102253 (0.00086) [2022-07-11 07:53:36,545][26022] Updated weights on worker 0-0, policy_version 1102263 (0.00086) [2022-07-11 07:53:37,425][25689] Fps is (10 sec: 5449.7, 60 sec: 5534.2, 300 sec: 5538.2). Total num frames: 1128721408. Throughput: 0: 5004.7. Samples: 1128715840. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:53:37,427][25689] Avg episode reward: [(0, '-1.448')] [2022-07-11 07:53:38,326][26022] Updated weights on worker 0-0, policy_version 1102273 (0.00090) [2022-07-11 07:53:39,914][26022] Updated weights on worker 0-0, policy_version 1102283 (0.00086) [2022-07-11 07:53:41,803][26022] Updated weights on worker 0-0, policy_version 1102293 (0.00085) [2022-07-11 07:53:42,428][25689] Fps is (10 sec: 5553.4, 60 sec: 5577.2, 300 sec: 5538.6). Total num frames: 1128750080. Throughput: 0: 5847.1. Samples: 1128749682. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:53:42,429][25689] Avg episode reward: [(0, '-0.763')] [2022-07-11 07:53:43,852][26022] Updated weights on worker 0-0, policy_version 1102303 (0.00091) [2022-07-11 07:53:45,551][26022] Updated weights on worker 0-0, policy_version 1102313 (0.00092) [2022-07-11 07:53:47,244][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:53:47,253][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001102322_1128777728.pth [2022-07-11 07:53:47,254][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001100372_1126780928.pth [2022-07-11 07:53:47,444][25689] Fps is (10 sec: 5621.4, 60 sec: 5544.1, 300 sec: 5539.5). Total num frames: 1128777728. Throughput: 0: 5837.7. Samples: 1128783372. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:53:47,445][25689] Avg episode reward: [(0, '-0.308')] [2022-07-11 07:53:47,548][26022] Updated weights on worker 0-0, policy_version 1102323 (0.00089) [2022-07-11 07:53:49,211][26022] Updated weights on worker 0-0, policy_version 1102333 (0.00089) [2022-07-11 07:53:51,119][26022] Updated weights on worker 0-0, policy_version 1102343 (0.00090) [2022-07-11 07:53:52,456][25689] Fps is (10 sec: 5718.7, 60 sec: 5579.1, 300 sec: 5544.7). Total num frames: 1128807424. Throughput: 0: 5001.3. Samples: 1128800090. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:53:52,456][25689] Avg episode reward: [(0, '-0.034')] [2022-07-11 07:53:52,887][26022] Updated weights on worker 0-0, policy_version 1102353 (0.00086) [2022-07-11 07:53:54,664][26022] Updated weights on worker 0-0, policy_version 1102363 (0.00090) [2022-07-11 07:53:56,603][26022] Updated weights on worker 0-0, policy_version 1102373 (0.00094) [2022-07-11 07:53:57,505][25689] Fps is (10 sec: 5597.8, 60 sec: 5563.8, 300 sec: 5537.4). Total num frames: 1128834048. Throughput: 0: 5854.4. Samples: 1128833396. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:53:57,509][25689] Avg episode reward: [(0, '1.615')] [2022-07-11 07:53:58,295][26022] Updated weights on worker 0-0, policy_version 1102383 (0.00082) [2022-07-11 07:54:00,370][26022] Updated weights on worker 0-0, policy_version 1102393 (0.00097) [2022-07-11 07:54:02,466][26022] Updated weights on worker 0-0, policy_version 1102403 (0.00086) [2022-07-11 07:54:02,600][25689] Fps is (10 sec: 5249.1, 60 sec: 5506.7, 300 sec: 5539.7). Total num frames: 1128860672. Throughput: 0: 5821.2. Samples: 1128867106. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:54:02,601][25689] Avg episode reward: [(0, '1.341')] [2022-07-11 07:54:04,248][26022] Updated weights on worker 0-0, policy_version 1102413 (0.00086) [2022-07-11 07:54:06,169][26022] Updated weights on worker 0-0, policy_version 1102423 (0.00082) [2022-07-11 07:54:07,691][25689] Fps is (10 sec: 5429.0, 60 sec: 5584.5, 300 sec: 5538.5). Total num frames: 1128889344. Throughput: 0: 5694.0. Samples: 1128898656. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:54:07,692][25689] Avg episode reward: [(0, '1.283')] [2022-07-11 07:54:08,107][26022] Updated weights on worker 0-0, policy_version 1102433 (0.00086) [2022-07-11 07:54:09,749][26022] Updated weights on worker 0-0, policy_version 1102443 (0.00086) [2022-07-11 07:54:11,680][26022] Updated weights on worker 0-0, policy_version 1102453 (0.00084) [2022-07-11 07:54:12,714][25689] Fps is (10 sec: 5669.8, 60 sec: 5549.2, 300 sec: 5539.5). Total num frames: 1128918016. Throughput: 0: 5689.6. Samples: 1128915352. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:54:12,715][25689] Avg episode reward: [(0, '1.313')] [2022-07-11 07:54:13,434][26022] Updated weights on worker 0-0, policy_version 1102463 (0.00091) [2022-07-11 07:54:15,332][26022] Updated weights on worker 0-0, policy_version 1102473 (0.00084) [2022-07-11 07:54:17,274][26022] Updated weights on worker 0-0, policy_version 1102483 (0.00087) [2022-07-11 07:54:17,761][25689] Fps is (10 sec: 5592.8, 60 sec: 5566.9, 300 sec: 5538.9). Total num frames: 1128945664. Throughput: 0: 5701.5. Samples: 1128948882. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:54:17,761][25689] Avg episode reward: [(0, '1.553')] [2022-07-11 07:54:19,039][26022] Updated weights on worker 0-0, policy_version 1102493 (0.00085) [2022-07-11 07:54:20,890][26022] Updated weights on worker 0-0, policy_version 1102503 (0.00742) [2022-07-11 07:54:22,715][26022] Updated weights on worker 0-0, policy_version 1102513 (0.00119) [2022-07-11 07:54:22,816][25689] Fps is (10 sec: 5575.3, 60 sec: 5562.6, 300 sec: 5538.3). Total num frames: 1128974336. Throughput: 0: 5703.3. Samples: 1128982402. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:54:22,816][25689] Avg episode reward: [(0, '-0.193')] [2022-07-11 07:54:24,459][26022] Updated weights on worker 0-0, policy_version 1102523 (0.00085) [2022-07-11 07:54:26,302][26022] Updated weights on worker 0-0, policy_version 1102533 (0.00088) [2022-07-11 07:54:27,880][25689] Fps is (10 sec: 5565.5, 60 sec: 5542.3, 300 sec: 5540.6). Total num frames: 1129001984. Throughput: 0: 4974.1. Samples: 1128999080. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:54:27,882][25689] Avg episode reward: [(0, '-0.168')] [2022-07-11 07:54:28,157][26022] Updated weights on worker 0-0, policy_version 1102543 (0.00088) [2022-07-11 07:54:29,915][26022] Updated weights on worker 0-0, policy_version 1102553 (0.00092) [2022-07-11 07:54:31,955][26022] Updated weights on worker 0-0, policy_version 1102563 (0.00094) [2022-07-11 07:54:32,892][25689] Fps is (10 sec: 5487.8, 60 sec: 5542.0, 300 sec: 5539.4). Total num frames: 1129029632. Throughput: 0: 5825.8. Samples: 1129032904. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:54:32,893][25689] Avg episode reward: [(0, '-0.053')] [2022-07-11 07:54:33,475][26022] Updated weights on worker 0-0, policy_version 1102573 (0.00083) [2022-07-11 07:54:35,594][26022] Updated weights on worker 0-0, policy_version 1102583 (0.00080) [2022-07-11 07:54:37,356][26022] Updated weights on worker 0-0, policy_version 1102593 (0.00087) [2022-07-11 07:54:38,005][25689] Fps is (10 sec: 5461.2, 60 sec: 5544.3, 300 sec: 5530.8). Total num frames: 1129057280. Throughput: 0: 5797.3. Samples: 1129066246. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:54:38,006][25689] Avg episode reward: [(0, '-0.122')] [2022-07-11 07:54:39,093][26022] Updated weights on worker 0-0, policy_version 1102603 (0.00088) [2022-07-11 07:54:41,069][26022] Updated weights on worker 0-0, policy_version 1102613 (0.00089) [2022-07-11 07:54:42,815][26022] Updated weights on worker 0-0, policy_version 1102623 (0.00094) [2022-07-11 07:54:43,043][25689] Fps is (10 sec: 5648.9, 60 sec: 5558.0, 300 sec: 5540.8). Total num frames: 1129086976. Throughput: 0: 4983.2. Samples: 1129083200. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:54:43,045][25689] Avg episode reward: [(0, '-0.335')] [2022-07-11 07:54:44,679][26022] Updated weights on worker 0-0, policy_version 1102633 (0.00094) [2022-07-11 07:54:46,401][26022] Updated weights on worker 0-0, policy_version 1102643 (0.00093) [2022-07-11 07:54:48,063][25689] Fps is (10 sec: 5803.5, 60 sec: 5574.5, 300 sec: 5537.2). Total num frames: 1129115648. Throughput: 0: 5841.2. Samples: 1129116970. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:54:48,064][25689] Avg episode reward: [(0, '-0.913')] [2022-07-11 07:54:48,079][26022] Updated weights on worker 0-0, policy_version 1102653 (0.00083) [2022-07-11 07:54:50,007][26022] Updated weights on worker 0-0, policy_version 1102663 (0.00089) [2022-07-11 07:54:51,872][26022] Updated weights on worker 0-0, policy_version 1102673 (0.00088) [2022-07-11 07:54:53,067][25689] Fps is (10 sec: 5618.8, 60 sec: 5541.4, 300 sec: 5546.3). Total num frames: 1129143296. Throughput: 0: 5850.7. Samples: 1129150942. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:54:53,067][25689] Avg episode reward: [(0, '0.717')] [2022-07-11 07:54:53,806][26022] Updated weights on worker 0-0, policy_version 1102683 (0.00086) [2022-07-11 07:54:55,534][26022] Updated weights on worker 0-0, policy_version 1102693 (0.00092) [2022-07-11 07:54:57,409][26022] Updated weights on worker 0-0, policy_version 1102703 (0.00081) [2022-07-11 07:54:58,197][25689] Fps is (10 sec: 5456.2, 60 sec: 5550.9, 300 sec: 5540.9). Total num frames: 1129170944. Throughput: 0: 5025.8. Samples: 1129167726. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:54:58,198][25689] Avg episode reward: [(0, '0.816')] [2022-07-11 07:54:59,153][26022] Updated weights on worker 0-0, policy_version 1102713 (0.00087) [2022-07-11 07:55:01,164][26022] Updated weights on worker 0-0, policy_version 1102723 (0.00092) [2022-07-11 07:55:03,207][25689] Fps is (10 sec: 5352.3, 60 sec: 5558.7, 300 sec: 5541.8). Total num frames: 1129197568. Throughput: 0: 5825.1. Samples: 1129200654. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:55:03,207][25689] Avg episode reward: [(0, '0.754')] [2022-07-11 07:55:03,283][26022] Updated weights on worker 0-0, policy_version 1102733 (0.00077) [2022-07-11 07:55:05,169][26022] Updated weights on worker 0-0, policy_version 1102743 (0.00091) [2022-07-11 07:55:06,964][26022] Updated weights on worker 0-0, policy_version 1102753 (0.00083) [2022-07-11 07:55:08,231][25689] Fps is (10 sec: 5408.9, 60 sec: 5547.9, 300 sec: 5541.7). Total num frames: 1129225216. Throughput: 0: 5751.4. Samples: 1129232966. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:55:08,232][25689] Avg episode reward: [(0, '0.324')] [2022-07-11 07:55:08,724][26022] Updated weights on worker 0-0, policy_version 1102763 (0.00097) [2022-07-11 07:55:10,407][26022] Updated weights on worker 0-0, policy_version 1102773 (0.00091) [2022-07-11 07:55:12,362][26022] Updated weights on worker 0-0, policy_version 1102783 (0.00085) [2022-07-11 07:55:13,256][25689] Fps is (10 sec: 5604.6, 60 sec: 5547.8, 300 sec: 5542.2). Total num frames: 1129253888. Throughput: 0: 4898.1. Samples: 1129249828. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:55:13,256][25689] Avg episode reward: [(0, '-0.097')] [2022-07-11 07:55:14,098][26022] Updated weights on worker 0-0, policy_version 1102793 (0.00080) [2022-07-11 07:55:16,192][26022] Updated weights on worker 0-0, policy_version 1102803 (0.00085) [2022-07-11 07:55:17,989][26022] Updated weights on worker 0-0, policy_version 1102813 (0.00090) [2022-07-11 07:55:18,389][25689] Fps is (10 sec: 5746.0, 60 sec: 5573.6, 300 sec: 5543.2). Total num frames: 1129283584. Throughput: 0: 5731.3. Samples: 1129283452. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:55:18,390][25689] Avg episode reward: [(0, '-0.124')] [2022-07-11 07:55:19,757][26022] Updated weights on worker 0-0, policy_version 1102823 (0.00088) [2022-07-11 07:55:21,407][26022] Updated weights on worker 0-0, policy_version 1102833 (0.00089) [2022-07-11 07:55:23,351][26022] Updated weights on worker 0-0, policy_version 1102843 (0.00085) [2022-07-11 07:55:23,450][25689] Fps is (10 sec: 5624.9, 60 sec: 5556.2, 300 sec: 5546.3). Total num frames: 1129311232. Throughput: 0: 5758.9. Samples: 1129317234. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:55:23,451][25689] Avg episode reward: [(0, '-0.370')] [2022-07-11 07:55:25,192][26022] Updated weights on worker 0-0, policy_version 1102853 (0.00093) [2022-07-11 07:55:26,902][26022] Updated weights on worker 0-0, policy_version 1102863 (0.00092) [2022-07-11 07:55:28,478][25689] Fps is (10 sec: 5480.6, 60 sec: 5559.5, 300 sec: 5546.2). Total num frames: 1129338880. Throughput: 0: 4992.6. Samples: 1129334052. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:55:28,479][25689] Avg episode reward: [(0, '-0.270')] [2022-07-11 07:55:28,930][26022] Updated weights on worker 0-0, policy_version 1102873 (0.00094) [2022-07-11 07:55:30,577][26022] Updated weights on worker 0-0, policy_version 1102883 (0.00092) [2022-07-11 07:55:32,587][26022] Updated weights on worker 0-0, policy_version 1102893 (0.00081) [2022-07-11 07:55:33,507][25689] Fps is (10 sec: 5600.4, 60 sec: 5574.9, 300 sec: 5547.8). Total num frames: 1129367552. Throughput: 0: 5820.4. Samples: 1129367696. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:55:33,507][25689] Avg episode reward: [(0, '-0.503')] [2022-07-11 07:55:34,413][26022] Updated weights on worker 0-0, policy_version 1102903 (0.00092) [2022-07-11 07:55:36,152][26022] Updated weights on worker 0-0, policy_version 1102913 (0.00089) [2022-07-11 07:55:38,059][26022] Updated weights on worker 0-0, policy_version 1102923 (0.00083) [2022-07-11 07:55:38,565][25689] Fps is (10 sec: 5583.9, 60 sec: 5580.0, 300 sec: 5548.0). Total num frames: 1129395200. Throughput: 0: 5838.5. Samples: 1129401246. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:55:38,566][25689] Avg episode reward: [(0, '0.563')] [2022-07-11 07:55:39,766][26022] Updated weights on worker 0-0, policy_version 1102933 (0.00089) [2022-07-11 07:55:41,847][26022] Updated weights on worker 0-0, policy_version 1102943 (0.00090) [2022-07-11 07:55:43,475][26022] Updated weights on worker 0-0, policy_version 1102953 (0.00088) [2022-07-11 07:55:43,597][25689] Fps is (10 sec: 5683.1, 60 sec: 5580.5, 300 sec: 5549.0). Total num frames: 1129424896. Throughput: 0: 5001.9. Samples: 1129418006. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:55:43,597][25689] Avg episode reward: [(0, '1.377')] [2022-07-11 07:55:45,354][26022] Updated weights on worker 0-0, policy_version 1102963 (0.00085) [2022-07-11 07:55:46,951][26022] Updated weights on worker 0-0, policy_version 1102973 (0.00092) [2022-07-11 07:55:47,286][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:55:47,302][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001102974_1129445376.pth [2022-07-11 07:55:47,303][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001101024_1127448576.pth [2022-07-11 07:55:48,623][25689] Fps is (10 sec: 5599.1, 60 sec: 5546.0, 300 sec: 5546.7). Total num frames: 1129451520. Throughput: 0: 5833.7. Samples: 1129451572. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:55:48,625][25689] Avg episode reward: [(0, '1.662')] [2022-07-11 07:55:49,031][26022] Updated weights on worker 0-0, policy_version 1102983 (0.00088) [2022-07-11 07:55:50,703][26022] Updated weights on worker 0-0, policy_version 1102993 (0.00091) [2022-07-11 07:55:52,929][26022] Updated weights on worker 0-0, policy_version 1103003 (0.00084) [2022-07-11 07:55:53,629][25689] Fps is (10 sec: 5511.6, 60 sec: 5562.8, 300 sec: 5548.1). Total num frames: 1129480192. Throughput: 0: 5830.8. Samples: 1129485028. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:55:53,631][25689] Avg episode reward: [(0, '1.687')] [2022-07-11 07:55:54,622][26022] Updated weights on worker 0-0, policy_version 1103013 (0.00085) [2022-07-11 07:55:56,404][26022] Updated weights on worker 0-0, policy_version 1103023 (0.00085) [2022-07-11 07:55:58,350][26022] Updated weights on worker 0-0, policy_version 1103033 (0.00096) [2022-07-11 07:55:58,703][25689] Fps is (10 sec: 5587.7, 60 sec: 5568.0, 300 sec: 5546.8). Total num frames: 1129507840. Throughput: 0: 5002.2. Samples: 1129501980. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:55:58,704][25689] Avg episode reward: [(0, '1.695')] [2022-07-11 07:55:59,998][26022] Updated weights on worker 0-0, policy_version 1103043 (0.00081) [2022-07-11 07:56:02,286][26022] Updated weights on worker 0-0, policy_version 1103053 (0.00085) [2022-07-11 07:56:03,712][25689] Fps is (10 sec: 5382.8, 60 sec: 5568.1, 300 sec: 5550.1). Total num frames: 1129534464. Throughput: 0: 5761.6. Samples: 1129533900. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:56:03,714][25689] Avg episode reward: [(0, '1.976')] [2022-07-11 07:56:03,905][26022] Updated weights on worker 0-0, policy_version 1103063 (0.00086) [2022-07-11 07:56:05,871][26022] Updated weights on worker 0-0, policy_version 1103073 (0.00060) [2022-07-11 07:56:07,701][26022] Updated weights on worker 0-0, policy_version 1103083 (0.00085) [2022-07-11 07:56:08,732][25689] Fps is (10 sec: 5411.5, 60 sec: 5568.5, 300 sec: 5547.2). Total num frames: 1129562112. Throughput: 0: 5775.7. Samples: 1129567712. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:56:08,733][25689] Avg episode reward: [(0, '1.928')] [2022-07-11 07:56:09,447][26022] Updated weights on worker 0-0, policy_version 1103093 (0.00080) [2022-07-11 07:56:11,259][26022] Updated weights on worker 0-0, policy_version 1103103 (0.00056) [2022-07-11 07:56:13,168][26022] Updated weights on worker 0-0, policy_version 1103113 (0.00095) [2022-07-11 07:56:13,734][25689] Fps is (10 sec: 5619.5, 60 sec: 5570.5, 300 sec: 5549.6). Total num frames: 1129590784. Throughput: 0: 4947.6. Samples: 1129584498. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:56:13,735][25689] Avg episode reward: [(0, '1.683')] [2022-07-11 07:56:15,008][26022] Updated weights on worker 0-0, policy_version 1103123 (0.00079) [2022-07-11 07:56:16,605][26022] Updated weights on worker 0-0, policy_version 1103133 (0.00090) [2022-07-11 07:56:18,527][26022] Updated weights on worker 0-0, policy_version 1103143 (0.00095) [2022-07-11 07:56:18,843][25689] Fps is (10 sec: 5570.1, 60 sec: 5538.9, 300 sec: 5555.0). Total num frames: 1129618432. Throughput: 0: 5760.1. Samples: 1129617990. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:56:18,843][25689] Avg episode reward: [(0, '1.359')] [2022-07-11 07:56:20,338][26022] Updated weights on worker 0-0, policy_version 1103153 (0.00094) [2022-07-11 07:56:22,334][26022] Updated weights on worker 0-0, policy_version 1103163 (0.00096) [2022-07-11 07:56:23,865][25689] Fps is (10 sec: 5559.2, 60 sec: 5559.4, 300 sec: 5551.3). Total num frames: 1129647104. Throughput: 0: 5855.7. Samples: 1129651910. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:56:23,866][25689] Avg episode reward: [(0, '0.958')] [2022-07-11 07:56:24,007][26022] Updated weights on worker 0-0, policy_version 1103173 (0.00080) [2022-07-11 07:56:25,884][26022] Updated weights on worker 0-0, policy_version 1103183 (0.00084) [2022-07-11 07:56:27,683][26022] Updated weights on worker 0-0, policy_version 1103193 (0.00087) [2022-07-11 07:56:28,894][25689] Fps is (10 sec: 5705.3, 60 sec: 5576.3, 300 sec: 5551.1). Total num frames: 1129675776. Throughput: 0: 5859.4. Samples: 1129685850. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2022-07-11 07:56:28,894][25689] Avg episode reward: [(0, '0.267')] [2022-07-11 07:56:29,510][26022] Updated weights on worker 0-0, policy_version 1103203 (0.00091) [2022-07-11 07:56:31,278][26022] Updated weights on worker 0-0, policy_version 1103213 (0.00086) [2022-07-11 07:56:33,056][26022] Updated weights on worker 0-0, policy_version 1103223 (0.00089) [2022-07-11 07:56:33,909][25689] Fps is (10 sec: 5607.5, 60 sec: 5560.6, 300 sec: 5560.0). Total num frames: 1129703424. Throughput: 0: 5866.6. Samples: 1129702854. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:56:33,909][25689] Avg episode reward: [(0, '0.276')] [2022-07-11 07:56:34,872][26022] Updated weights on worker 0-0, policy_version 1103233 (0.00082) [2022-07-11 07:56:36,688][26022] Updated weights on worker 0-0, policy_version 1103243 (0.00082) [2022-07-11 07:56:38,440][26022] Updated weights on worker 0-0, policy_version 1103253 (0.00099) [2022-07-11 07:56:38,975][25689] Fps is (10 sec: 5688.4, 60 sec: 5593.8, 300 sec: 5559.2). Total num frames: 1129733120. Throughput: 0: 5891.2. Samples: 1129736590. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:56:38,975][25689] Avg episode reward: [(0, '0.560')] [2022-07-11 07:56:40,502][26022] Updated weights on worker 0-0, policy_version 1103263 (0.00086) [2022-07-11 07:56:42,021][26022] Updated weights on worker 0-0, policy_version 1103273 (0.00084) [2022-07-11 07:56:44,003][25689] Fps is (10 sec: 5681.1, 60 sec: 5560.3, 300 sec: 5558.8). Total num frames: 1129760768. Throughput: 0: 5873.4. Samples: 1129770186. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:56:44,003][25689] Avg episode reward: [(0, '0.600')] [2022-07-11 07:56:44,231][26022] Updated weights on worker 0-0, policy_version 1103283 (0.00088) [2022-07-11 07:56:45,846][26022] Updated weights on worker 0-0, policy_version 1103293 (0.00094) [2022-07-11 07:56:47,852][26022] Updated weights on worker 0-0, policy_version 1103303 (0.00086) [2022-07-11 07:56:49,023][25689] Fps is (10 sec: 5503.2, 60 sec: 5577.8, 300 sec: 5555.2). Total num frames: 1129788416. Throughput: 0: 5032.8. Samples: 1129787154. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:56:49,024][25689] Avg episode reward: [(0, '0.128')] [2022-07-11 07:56:49,414][26022] Updated weights on worker 0-0, policy_version 1103313 (0.00087) [2022-07-11 07:56:51,156][26022] Updated weights on worker 0-0, policy_version 1103323 (0.00085) [2022-07-11 07:56:53,088][26022] Updated weights on worker 0-0, policy_version 1103333 (0.00090) [2022-07-11 07:56:54,041][25689] Fps is (10 sec: 5610.5, 60 sec: 5576.7, 300 sec: 5557.3). Total num frames: 1129817088. Throughput: 0: 5873.4. Samples: 1129821098. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:56:54,041][25689] Avg episode reward: [(0, '0.954')] [2022-07-11 07:56:54,976][26022] Updated weights on worker 0-0, policy_version 1103343 (0.00083) [2022-07-11 07:56:56,736][26022] Updated weights on worker 0-0, policy_version 1103353 (0.00088) [2022-07-11 07:56:58,611][26022] Updated weights on worker 0-0, policy_version 1103363 (0.00093) [2022-07-11 07:56:59,092][25689] Fps is (10 sec: 5796.8, 60 sec: 5612.7, 300 sec: 5567.0). Total num frames: 1129846784. Throughput: 0: 5901.4. Samples: 1129855308. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:56:59,095][25689] Avg episode reward: [(0, '1.885')] [2022-07-11 07:57:00,218][26022] Updated weights on worker 0-0, policy_version 1103373 (0.00083) [2022-07-11 07:57:02,367][26022] Updated weights on worker 0-0, policy_version 1103383 (0.00093) [2022-07-11 07:57:04,098][25689] Fps is (10 sec: 5498.4, 60 sec: 5596.0, 300 sec: 5567.2). Total num frames: 1129872384. Throughput: 0: 5073.8. Samples: 1129872144. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:57:04,098][25689] Avg episode reward: [(0, '1.781')] [2022-07-11 07:57:04,323][26022] Updated weights on worker 0-0, policy_version 1103393 (0.00085) [2022-07-11 07:57:06,184][26022] Updated weights on worker 0-0, policy_version 1103403 (0.00090) [2022-07-11 07:57:08,282][26022] Updated weights on worker 0-0, policy_version 1103413 (0.00083) [2022-07-11 07:57:09,137][25689] Fps is (10 sec: 5300.6, 60 sec: 5594.2, 300 sec: 5563.4). Total num frames: 1129900032. Throughput: 0: 5794.6. Samples: 1129903710. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:57:09,138][25689] Avg episode reward: [(0, '1.211')] [2022-07-11 07:57:09,709][26022] Updated weights on worker 0-0, policy_version 1103423 (0.00094) [2022-07-11 07:57:11,870][26022] Updated weights on worker 0-0, policy_version 1103433 (0.00093) [2022-07-11 07:57:13,411][26022] Updated weights on worker 0-0, policy_version 1103443 (0.00096) [2022-07-11 07:57:14,156][25689] Fps is (10 sec: 5599.4, 60 sec: 5592.7, 300 sec: 5571.2). Total num frames: 1129928704. Throughput: 0: 5769.7. Samples: 1129937156. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:57:14,156][25689] Avg episode reward: [(0, '2.100')] [2022-07-11 07:57:15,517][26022] Updated weights on worker 0-0, policy_version 1103453 (0.00080) [2022-07-11 07:57:17,039][26022] Updated weights on worker 0-0, policy_version 1103463 (0.00087) [2022-07-11 07:57:19,164][26022] Updated weights on worker 0-0, policy_version 1103473 (0.00086) [2022-07-11 07:57:19,228][25689] Fps is (10 sec: 5581.3, 60 sec: 5596.1, 300 sec: 5563.5). Total num frames: 1129956352. Throughput: 0: 4912.8. Samples: 1129954234. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:57:19,229][25689] Avg episode reward: [(0, '1.908')] [2022-07-11 07:57:20,689][26022] Updated weights on worker 0-0, policy_version 1103483 (0.00087) [2022-07-11 07:57:22,711][26022] Updated weights on worker 0-0, policy_version 1103493 (0.00084) [2022-07-11 07:57:24,269][25689] Fps is (10 sec: 5670.3, 60 sec: 5611.3, 300 sec: 5566.5). Total num frames: 1129986048. Throughput: 0: 5753.2. Samples: 1129988194. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:57:24,270][25689] Avg episode reward: [(0, '1.971')] [2022-07-11 07:57:24,412][26022] Updated weights on worker 0-0, policy_version 1103503 (0.00087) [2022-07-11 07:57:26,379][26022] Updated weights on worker 0-0, policy_version 1103513 (0.00094) [2022-07-11 07:57:28,119][26022] Updated weights on worker 0-0, policy_version 1103523 (0.00088) [2022-07-11 07:57:29,275][25689] Fps is (10 sec: 5707.6, 60 sec: 5596.4, 300 sec: 5573.9). Total num frames: 1130013696. Throughput: 0: 5876.5. Samples: 1130022050. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:57:29,276][25689] Avg episode reward: [(0, '1.742')] [2022-07-11 07:57:29,900][26022] Updated weights on worker 0-0, policy_version 1103533 (0.00097) [2022-07-11 07:57:31,591][26022] Updated weights on worker 0-0, policy_version 1103543 (0.00082) [2022-07-11 07:57:33,623][26022] Updated weights on worker 0-0, policy_version 1103553 (0.00059) [2022-07-11 07:57:34,325][25689] Fps is (10 sec: 5498.7, 60 sec: 5593.2, 300 sec: 5564.1). Total num frames: 1130041344. Throughput: 0: 5052.9. Samples: 1130039070. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:57:34,326][25689] Avg episode reward: [(0, '-0.144')] [2022-07-11 07:57:35,363][26022] Updated weights on worker 0-0, policy_version 1103563 (0.00093) [2022-07-11 07:57:37,251][26022] Updated weights on worker 0-0, policy_version 1103573 (0.00086) [2022-07-11 07:57:39,239][26022] Updated weights on worker 0-0, policy_version 1103583 (0.00087) [2022-07-11 07:57:39,440][25689] Fps is (10 sec: 5540.5, 60 sec: 5571.7, 300 sec: 5570.7). Total num frames: 1130070016. Throughput: 0: 5852.5. Samples: 1130072524. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:57:39,441][25689] Avg episode reward: [(0, '-0.271')] [2022-07-11 07:57:40,780][26022] Updated weights on worker 0-0, policy_version 1103593 (0.00089) [2022-07-11 07:57:42,822][26022] Updated weights on worker 0-0, policy_version 1103603 (0.00085) [2022-07-11 07:57:44,502][25689] Fps is (10 sec: 5634.7, 60 sec: 5585.5, 300 sec: 5566.6). Total num frames: 1130098688. Throughput: 0: 5829.9. Samples: 1130106150. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:57:44,503][25689] Avg episode reward: [(0, '-0.496')] [2022-07-11 07:57:44,561][26022] Updated weights on worker 0-0, policy_version 1103613 (0.00091) [2022-07-11 07:57:46,429][26022] Updated weights on worker 0-0, policy_version 1103623 (0.00085) [2022-07-11 07:57:47,580][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:57:47,593][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001103630_1130117120.pth [2022-07-11 07:57:47,593][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001101670_1128110080.pth [2022-07-11 07:57:48,284][26022] Updated weights on worker 0-0, policy_version 1103633 (0.00094) [2022-07-11 07:57:49,538][25689] Fps is (10 sec: 5678.7, 60 sec: 5600.9, 300 sec: 5569.8). Total num frames: 1130127360. Throughput: 0: 4986.6. Samples: 1130123094. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:57:49,539][25689] Avg episode reward: [(0, '-0.619')] [2022-07-11 07:57:49,882][26022] Updated weights on worker 0-0, policy_version 1103643 (0.00086) [2022-07-11 07:57:51,939][26022] Updated weights on worker 0-0, policy_version 1103653 (0.00087) [2022-07-11 07:57:53,816][26022] Updated weights on worker 0-0, policy_version 1103663 (0.00083) [2022-07-11 07:57:54,572][25689] Fps is (10 sec: 5694.9, 60 sec: 5599.5, 300 sec: 5573.9). Total num frames: 1130156032. Throughput: 0: 5812.4. Samples: 1130156750. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:57:54,572][25689] Avg episode reward: [(0, '-0.543')] [2022-07-11 07:57:55,484][26022] Updated weights on worker 0-0, policy_version 1103673 (0.00087) [2022-07-11 07:57:57,520][26022] Updated weights on worker 0-0, policy_version 1103683 (0.00089) [2022-07-11 07:57:59,022][26022] Updated weights on worker 0-0, policy_version 1103693 (0.00090) [2022-07-11 07:57:59,706][25689] Fps is (10 sec: 5539.3, 60 sec: 5558.1, 300 sec: 5565.0). Total num frames: 1130183680. Throughput: 0: 5814.7. Samples: 1130190362. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:57:59,706][25689] Avg episode reward: [(0, '0.660')] [2022-07-11 07:58:01,160][26022] Updated weights on worker 0-0, policy_version 1103703 (0.00091) [2022-07-11 07:58:03,232][26022] Updated weights on worker 0-0, policy_version 1103713 (0.00096) [2022-07-11 07:58:04,719][25689] Fps is (10 sec: 5247.6, 60 sec: 5557.4, 300 sec: 5571.9). Total num frames: 1130209280. Throughput: 0: 4954.9. Samples: 1130206320. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:58:04,719][25689] Avg episode reward: [(0, '0.710')] [2022-07-11 07:58:04,937][26022] Updated weights on worker 0-0, policy_version 1103723 (0.00108) [2022-07-11 07:58:06,818][26022] Updated weights on worker 0-0, policy_version 1103733 (0.00082) [2022-07-11 07:58:08,419][26022] Updated weights on worker 0-0, policy_version 1103743 (0.00096) [2022-07-11 07:58:09,752][25689] Fps is (10 sec: 5402.3, 60 sec: 5574.9, 300 sec: 5564.5). Total num frames: 1130237952. Throughput: 0: 5720.4. Samples: 1130238724. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:58:09,752][25689] Avg episode reward: [(0, '1.073')] [2022-07-11 07:58:10,558][26022] Updated weights on worker 0-0, policy_version 1103753 (0.01510) [2022-07-11 07:58:12,448][26022] Updated weights on worker 0-0, policy_version 1103763 (0.00080) [2022-07-11 07:58:14,114][26022] Updated weights on worker 0-0, policy_version 1103773 (0.00092) [2022-07-11 07:58:14,756][25689] Fps is (10 sec: 5713.1, 60 sec: 5576.2, 300 sec: 5572.4). Total num frames: 1130266624. Throughput: 0: 5724.0. Samples: 1130272286. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:58:14,757][25689] Avg episode reward: [(0, '0.699')] [2022-07-11 07:58:16,104][26022] Updated weights on worker 0-0, policy_version 1103783 (0.00087) [2022-07-11 07:58:17,908][26022] Updated weights on worker 0-0, policy_version 1103793 (0.00092) [2022-07-11 07:58:19,748][26022] Updated weights on worker 0-0, policy_version 1103803 (0.00090) [2022-07-11 07:58:19,836][25689] Fps is (10 sec: 5585.2, 60 sec: 5575.6, 300 sec: 5567.6). Total num frames: 1130294272. Throughput: 0: 4894.8. Samples: 1130288894. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:58:19,836][25689] Avg episode reward: [(0, '-0.777')] [2022-07-11 07:58:21,652][26022] Updated weights on worker 0-0, policy_version 1103813 (0.00081) [2022-07-11 07:58:23,417][26022] Updated weights on worker 0-0, policy_version 1103823 (0.00088) [2022-07-11 07:58:24,838][25689] Fps is (10 sec: 5484.5, 60 sec: 5545.3, 300 sec: 5564.6). Total num frames: 1130321920. Throughput: 0: 5774.4. Samples: 1130322500. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:58:24,840][25689] Avg episode reward: [(0, '-0.690')] [2022-07-11 07:58:25,279][26022] Updated weights on worker 0-0, policy_version 1103833 (0.00084) [2022-07-11 07:58:27,079][26022] Updated weights on worker 0-0, policy_version 1103843 (0.00098) [2022-07-11 07:58:28,898][26022] Updated weights on worker 0-0, policy_version 1103853 (0.00094) [2022-07-11 07:58:29,858][25689] Fps is (10 sec: 5619.5, 60 sec: 5560.9, 300 sec: 5567.9). Total num frames: 1130350592. Throughput: 0: 5820.1. Samples: 1130355744. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:58:29,865][25689] Avg episode reward: [(0, '-0.562')] [2022-07-11 07:58:30,921][26022] Updated weights on worker 0-0, policy_version 1103863 (0.00083) [2022-07-11 07:58:32,520][26022] Updated weights on worker 0-0, policy_version 1103873 (0.00092) [2022-07-11 07:58:34,608][26022] Updated weights on worker 0-0, policy_version 1103883 (0.00090) [2022-07-11 07:58:34,877][25689] Fps is (10 sec: 5610.4, 60 sec: 5563.8, 300 sec: 5570.1). Total num frames: 1130378240. Throughput: 0: 4987.1. Samples: 1130372630. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:58:34,878][25689] Avg episode reward: [(0, '-0.524')] [2022-07-11 07:58:36,273][26022] Updated weights on worker 0-0, policy_version 1103893 (0.00085) [2022-07-11 07:58:38,375][26022] Updated weights on worker 0-0, policy_version 1103903 (0.00093) [2022-07-11 07:58:39,994][25689] Fps is (10 sec: 5455.5, 60 sec: 5546.7, 300 sec: 5564.5). Total num frames: 1130405888. Throughput: 0: 5798.6. Samples: 1130405782. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:58:39,995][25689] Avg episode reward: [(0, '-0.384')] [2022-07-11 07:58:40,014][26022] Updated weights on worker 0-0, policy_version 1103913 (0.00079) [2022-07-11 07:58:42,048][26022] Updated weights on worker 0-0, policy_version 1103923 (0.00092) [2022-07-11 07:58:43,731][26022] Updated weights on worker 0-0, policy_version 1103933 (0.00084) [2022-07-11 07:58:45,083][25689] Fps is (10 sec: 5417.7, 60 sec: 5527.3, 300 sec: 5563.1). Total num frames: 1130433536. Throughput: 0: 5766.2. Samples: 1130439236. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:58:45,084][25689] Avg episode reward: [(0, '-0.540')] [2022-07-11 07:58:45,680][26022] Updated weights on worker 0-0, policy_version 1103943 (0.00091) [2022-07-11 07:58:47,387][26022] Updated weights on worker 0-0, policy_version 1103953 (0.00084) [2022-07-11 07:58:49,331][26022] Updated weights on worker 0-0, policy_version 1103963 (0.00085) [2022-07-11 07:58:50,099][25689] Fps is (10 sec: 5674.4, 60 sec: 5546.0, 300 sec: 5563.1). Total num frames: 1130463232. Throughput: 0: 4952.0. Samples: 1130455980. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:58:50,100][25689] Avg episode reward: [(0, '1.178')] [2022-07-11 07:58:51,025][26022] Updated weights on worker 0-0, policy_version 1103973 (0.00103) [2022-07-11 07:58:52,835][26022] Updated weights on worker 0-0, policy_version 1103983 (0.00088) [2022-07-11 07:58:54,758][26022] Updated weights on worker 0-0, policy_version 1103993 (0.00087) [2022-07-11 07:58:55,197][25689] Fps is (10 sec: 5670.2, 60 sec: 5523.3, 300 sec: 5565.6). Total num frames: 1130490880. Throughput: 0: 5752.6. Samples: 1130489522. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:58:55,197][25689] Avg episode reward: [(0, '1.366')] [2022-07-11 07:58:56,697][26022] Updated weights on worker 0-0, policy_version 1104003 (0.00088) [2022-07-11 07:58:58,339][26022] Updated weights on worker 0-0, policy_version 1104013 (0.00088) [2022-07-11 07:59:00,302][25689] Fps is (10 sec: 5419.6, 60 sec: 5525.9, 300 sec: 5568.8). Total num frames: 1130518528. Throughput: 0: 5791.6. Samples: 1130523402. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:59:00,303][25689] Avg episode reward: [(0, '1.367')] [2022-07-11 07:59:00,337][26022] Updated weights on worker 0-0, policy_version 1104023 (0.00085) [2022-07-11 07:59:01,852][26022] Updated weights on worker 0-0, policy_version 1104033 (0.00145) [2022-07-11 07:59:04,317][26022] Updated weights on worker 0-0, policy_version 1104043 (0.00087) [2022-07-11 07:59:05,373][25689] Fps is (10 sec: 5534.2, 60 sec: 5571.2, 300 sec: 5569.2). Total num frames: 1130547200. Throughput: 0: 5712.8. Samples: 1130555148. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:59:05,374][25689] Avg episode reward: [(0, '0.976')] [2022-07-11 07:59:05,824][26022] Updated weights on worker 0-0, policy_version 1104053 (0.00081) [2022-07-11 07:59:07,874][26022] Updated weights on worker 0-0, policy_version 1104063 (0.00086) [2022-07-11 07:59:09,546][26022] Updated weights on worker 0-0, policy_version 1104073 (0.00080) [2022-07-11 07:59:10,463][25689] Fps is (10 sec: 5441.9, 60 sec: 5532.3, 300 sec: 5561.1). Total num frames: 1130573824. Throughput: 0: 5697.1. Samples: 1130571996. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:59:10,464][25689] Avg episode reward: [(0, '1.625')] [2022-07-11 07:59:11,505][26022] Updated weights on worker 0-0, policy_version 1104083 (0.00081) [2022-07-11 07:59:13,456][26022] Updated weights on worker 0-0, policy_version 1104093 (0.00086) [2022-07-11 07:59:15,319][26022] Updated weights on worker 0-0, policy_version 1104103 (0.00092) [2022-07-11 07:59:15,499][25689] Fps is (10 sec: 5460.8, 60 sec: 5529.4, 300 sec: 5564.7). Total num frames: 1130602496. Throughput: 0: 5708.0. Samples: 1130605410. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:59:15,500][25689] Avg episode reward: [(0, '1.481')] [2022-07-11 07:59:16,923][26022] Updated weights on worker 0-0, policy_version 1104113 (0.00084) [2022-07-11 07:59:18,905][26022] Updated weights on worker 0-0, policy_version 1104123 (0.00087) [2022-07-11 07:59:20,562][25689] Fps is (10 sec: 5678.4, 60 sec: 5547.8, 300 sec: 5564.6). Total num frames: 1130631168. Throughput: 0: 5715.4. Samples: 1130639194. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:59:20,564][25689] Avg episode reward: [(0, '1.471')] [2022-07-11 07:59:20,712][26022] Updated weights on worker 0-0, policy_version 1104133 (0.00088) [2022-07-11 07:59:22,432][26022] Updated weights on worker 0-0, policy_version 1104143 (0.00086) [2022-07-11 07:59:24,218][26022] Updated weights on worker 0-0, policy_version 1104153 (0.00091) [2022-07-11 07:59:25,573][25689] Fps is (10 sec: 5692.5, 60 sec: 5563.9, 300 sec: 5569.0). Total num frames: 1130659840. Throughput: 0: 5000.7. Samples: 1130656160. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:59:25,575][25689] Avg episode reward: [(0, '1.392')] [2022-07-11 07:59:26,118][26022] Updated weights on worker 0-0, policy_version 1104163 (0.00090) [2022-07-11 07:59:27,826][26022] Updated weights on worker 0-0, policy_version 1104173 (0.00084) [2022-07-11 07:59:29,741][26022] Updated weights on worker 0-0, policy_version 1104183 (0.00085) [2022-07-11 07:59:30,664][25689] Fps is (10 sec: 5676.3, 60 sec: 5557.3, 300 sec: 5571.0). Total num frames: 1130688512. Throughput: 0: 5845.9. Samples: 1130690088. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:59:30,666][25689] Avg episode reward: [(0, '1.410')] [2022-07-11 07:59:31,635][26022] Updated weights on worker 0-0, policy_version 1104193 (0.00092) [2022-07-11 07:59:33,283][26022] Updated weights on worker 0-0, policy_version 1104203 (0.00081) [2022-07-11 07:59:35,216][26022] Updated weights on worker 0-0, policy_version 1104213 (0.00089) [2022-07-11 07:59:35,683][25689] Fps is (10 sec: 5570.6, 60 sec: 5557.3, 300 sec: 5572.7). Total num frames: 1130716160. Throughput: 0: 5878.0. Samples: 1130724052. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:59:35,685][25689] Avg episode reward: [(0, '1.355')] [2022-07-11 07:59:36,820][26022] Updated weights on worker 0-0, policy_version 1104223 (0.00061) [2022-07-11 07:59:38,708][26022] Updated weights on worker 0-0, policy_version 1104233 (0.00093) [2022-07-11 07:59:40,783][25689] Fps is (10 sec: 5464.7, 60 sec: 5558.9, 300 sec: 5564.7). Total num frames: 1130743808. Throughput: 0: 5031.2. Samples: 1130740930. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:59:40,785][25689] Avg episode reward: [(0, '1.532')] [2022-07-11 07:59:40,839][26022] Updated weights on worker 0-0, policy_version 1104243 (0.00086) [2022-07-11 07:59:42,270][26022] Updated weights on worker 0-0, policy_version 1104253 (0.00097) [2022-07-11 07:59:44,449][26022] Updated weights on worker 0-0, policy_version 1104263 (0.00093) [2022-07-11 07:59:45,835][25689] Fps is (10 sec: 5749.6, 60 sec: 5612.9, 300 sec: 5571.0). Total num frames: 1130774528. Throughput: 0: 5837.9. Samples: 1130774448. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:59:45,835][25689] Avg episode reward: [(0, '1.474')] [2022-07-11 07:59:46,016][26022] Updated weights on worker 0-0, policy_version 1104273 (0.00619) [2022-07-11 07:59:47,818][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 07:59:47,833][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001104281_1130783744.pth [2022-07-11 07:59:47,834][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001102322_1128777728.pth [2022-07-11 07:59:48,039][26022] Updated weights on worker 0-0, policy_version 1104283 (0.00088) [2022-07-11 07:59:49,819][26022] Updated weights on worker 0-0, policy_version 1104293 (0.00082) [2022-07-11 07:59:50,885][25689] Fps is (10 sec: 5778.0, 60 sec: 5576.1, 300 sec: 5570.1). Total num frames: 1130802176. Throughput: 0: 5855.7. Samples: 1130808492. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:59:50,885][25689] Avg episode reward: [(0, '1.335')] [2022-07-11 07:59:51,568][26022] Updated weights on worker 0-0, policy_version 1104303 (0.00085) [2022-07-11 07:59:53,539][26022] Updated weights on worker 0-0, policy_version 1104313 (0.00088) [2022-07-11 07:59:54,932][26022] Updated weights on worker 0-0, policy_version 1104323 (0.00085) [2022-07-11 07:59:55,949][25689] Fps is (10 sec: 5467.0, 60 sec: 5579.1, 300 sec: 5571.3). Total num frames: 1130829824. Throughput: 0: 5011.9. Samples: 1130825624. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 07:59:55,950][25689] Avg episode reward: [(0, '1.295')] [2022-07-11 07:59:57,191][26022] Updated weights on worker 0-0, policy_version 1104333 (0.00084) [2022-07-11 07:59:58,955][26022] Updated weights on worker 0-0, policy_version 1104343 (0.00082) [2022-07-11 08:00:00,595][26022] Updated weights on worker 0-0, policy_version 1104353 (0.00088) [2022-07-11 08:00:01,017][25689] Fps is (10 sec: 5760.9, 60 sec: 5633.2, 300 sec: 5584.0). Total num frames: 1130860544. Throughput: 0: 5855.2. Samples: 1130859400. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 08:00:01,017][25689] Avg episode reward: [(0, '0.817')] [2022-07-11 08:00:02,972][26022] Updated weights on worker 0-0, policy_version 1104363 (0.00087) [2022-07-11 08:00:04,579][26022] Updated weights on worker 0-0, policy_version 1104373 (0.00090) [2022-07-11 08:00:06,027][25689] Fps is (10 sec: 5588.5, 60 sec: 5588.2, 300 sec: 5577.4). Total num frames: 1130886144. Throughput: 0: 5780.4. Samples: 1130891168. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 08:00:06,028][25689] Avg episode reward: [(0, '1.144')] [2022-07-11 08:00:06,566][26022] Updated weights on worker 0-0, policy_version 1104383 (0.00090) [2022-07-11 08:00:08,191][26022] Updated weights on worker 0-0, policy_version 1104393 (0.00096) [2022-07-11 08:00:10,202][26022] Updated weights on worker 0-0, policy_version 1104403 (0.00089) [2022-07-11 08:00:11,062][25689] Fps is (10 sec: 5198.8, 60 sec: 5593.3, 300 sec: 5570.3). Total num frames: 1130912768. Throughput: 0: 4948.0. Samples: 1130908328. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 08:00:11,063][25689] Avg episode reward: [(0, '0.413')] [2022-07-11 08:00:11,778][26022] Updated weights on worker 0-0, policy_version 1104413 (0.00086) [2022-07-11 08:00:13,777][26022] Updated weights on worker 0-0, policy_version 1104423 (0.00087) [2022-07-11 08:00:15,611][26022] Updated weights on worker 0-0, policy_version 1104433 (0.00083) [2022-07-11 08:00:16,107][25689] Fps is (10 sec: 5485.8, 60 sec: 5592.4, 300 sec: 5568.5). Total num frames: 1130941440. Throughput: 0: 5767.5. Samples: 1130941884. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 08:00:16,108][25689] Avg episode reward: [(0, '0.461')] [2022-07-11 08:00:17,513][26022] Updated weights on worker 0-0, policy_version 1104443 (0.00084) [2022-07-11 08:00:19,371][26022] Updated weights on worker 0-0, policy_version 1104453 (0.00095) [2022-07-11 08:00:21,124][26022] Updated weights on worker 0-0, policy_version 1104463 (0.00087) [2022-07-11 08:00:21,219][25689] Fps is (10 sec: 5645.8, 60 sec: 5587.9, 300 sec: 5571.0). Total num frames: 1130970112. Throughput: 0: 5741.6. Samples: 1130975394. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 08:00:21,219][25689] Avg episode reward: [(0, '0.500')] [2022-07-11 08:00:23,034][26022] Updated weights on worker 0-0, policy_version 1104473 (0.00081) [2022-07-11 08:00:24,619][26022] Updated weights on worker 0-0, policy_version 1104483 (0.00085) [2022-07-11 08:00:26,313][25689] Fps is (10 sec: 5619.0, 60 sec: 5580.3, 300 sec: 5573.2). Total num frames: 1130998784. Throughput: 0: 4989.0. Samples: 1130992372. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 08:00:26,313][25689] Avg episode reward: [(0, '0.218')] [2022-07-11 08:00:26,518][26022] Updated weights on worker 0-0, policy_version 1104493 (0.00093) [2022-07-11 08:00:28,440][26022] Updated weights on worker 0-0, policy_version 1104503 (0.00093) [2022-07-11 08:00:30,140][26022] Updated weights on worker 0-0, policy_version 1104513 (0.00085) [2022-07-11 08:00:31,364][25689] Fps is (10 sec: 5551.2, 60 sec: 5567.0, 300 sec: 5569.4). Total num frames: 1131026432. Throughput: 0: 5789.1. Samples: 1131025860. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2022-07-11 08:00:31,366][25689] Avg episode reward: [(0, '-0.132')] [2022-07-11 08:00:31,982][26022] Updated weights on worker 0-0, policy_version 1104523 (0.00086) [2022-07-11 08:00:34,034][26022] Updated weights on worker 0-0, policy_version 1104533 (0.00090) [2022-07-11 08:00:35,524][26022] Updated weights on worker 0-0, policy_version 1104543 (0.00085) [2022-07-11 08:00:36,413][25689] Fps is (10 sec: 5677.1, 60 sec: 5598.0, 300 sec: 5576.4). Total num frames: 1131056128. Throughput: 0: 5804.4. Samples: 1131059750. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:00:36,416][25689] Avg episode reward: [(0, '0.264')] [2022-07-11 08:00:37,718][26022] Updated weights on worker 0-0, policy_version 1104553 (0.00103) [2022-07-11 08:00:39,214][26022] Updated weights on worker 0-0, policy_version 1104563 (0.00077) [2022-07-11 08:00:41,398][26022] Updated weights on worker 0-0, policy_version 1104573 (0.00086) [2022-07-11 08:00:41,476][25689] Fps is (10 sec: 5569.6, 60 sec: 5584.5, 300 sec: 5565.5). Total num frames: 1131082752. Throughput: 0: 5812.9. Samples: 1131093150. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:00:41,477][25689] Avg episode reward: [(0, '0.265')] [2022-07-11 08:00:42,973][26022] Updated weights on worker 0-0, policy_version 1104583 (0.00092) [2022-07-11 08:00:44,822][26022] Updated weights on worker 0-0, policy_version 1104593 (0.00085) [2022-07-11 08:00:46,496][25689] Fps is (10 sec: 5585.8, 60 sec: 5570.6, 300 sec: 5576.0). Total num frames: 1131112448. Throughput: 0: 5810.1. Samples: 1131109640. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:00:46,497][25689] Avg episode reward: [(0, '-0.245')] [2022-07-11 08:00:46,672][26022] Updated weights on worker 0-0, policy_version 1104603 (0.00094) [2022-07-11 08:00:48,596][26022] Updated weights on worker 0-0, policy_version 1104613 (0.00089) [2022-07-11 08:00:50,407][26022] Updated weights on worker 0-0, policy_version 1104623 (0.00088) [2022-07-11 08:00:51,503][25689] Fps is (10 sec: 5617.0, 60 sec: 5557.7, 300 sec: 5569.1). Total num frames: 1131139072. Throughput: 0: 5830.5. Samples: 1131143278. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:00:51,505][25689] Avg episode reward: [(0, '-0.462')] [2022-07-11 08:00:52,283][26022] Updated weights on worker 0-0, policy_version 1104633 (0.00083) [2022-07-11 08:00:53,975][26022] Updated weights on worker 0-0, policy_version 1104643 (0.00085) [2022-07-11 08:00:55,940][26022] Updated weights on worker 0-0, policy_version 1104653 (0.00084) [2022-07-11 08:00:56,518][25689] Fps is (10 sec: 5415.2, 60 sec: 5562.2, 300 sec: 5570.2). Total num frames: 1131166720. Throughput: 0: 5834.4. Samples: 1131177050. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:00:56,520][25689] Avg episode reward: [(0, '-0.358')] [2022-07-11 08:00:57,530][26022] Updated weights on worker 0-0, policy_version 1104663 (0.00095) [2022-07-11 08:00:59,660][26022] Updated weights on worker 0-0, policy_version 1104673 (0.00096) [2022-07-11 08:01:01,503][26022] Updated weights on worker 0-0, policy_version 1104683 (0.00094) [2022-07-11 08:01:01,597][25689] Fps is (10 sec: 5579.4, 60 sec: 5527.3, 300 sec: 5575.7). Total num frames: 1131195392. Throughput: 0: 4997.9. Samples: 1131193712. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:01:01,598][25689] Avg episode reward: [(0, '0.045')] [2022-07-11 08:01:03,611][26022] Updated weights on worker 0-0, policy_version 1104693 (0.00097) [2022-07-11 08:01:05,330][26022] Updated weights on worker 0-0, policy_version 1104703 (0.00089) [2022-07-11 08:01:06,603][25689] Fps is (10 sec: 5381.5, 60 sec: 5527.7, 300 sec: 5569.1). Total num frames: 1131220992. Throughput: 0: 5759.5. Samples: 1131225446. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:01:06,604][25689] Avg episode reward: [(0, '-0.127')] [2022-07-11 08:01:07,258][26022] Updated weights on worker 0-0, policy_version 1104713 (0.00086) [2022-07-11 08:01:09,102][26022] Updated weights on worker 0-0, policy_version 1104723 (0.00089) [2022-07-11 08:01:10,912][26022] Updated weights on worker 0-0, policy_version 1104733 (0.00091) [2022-07-11 08:01:11,612][25689] Fps is (10 sec: 5419.2, 60 sec: 5563.9, 300 sec: 5569.0). Total num frames: 1131249664. Throughput: 0: 5750.9. Samples: 1131258924. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:01:11,613][25689] Avg episode reward: [(0, '0.888')] [2022-07-11 08:01:12,804][26022] Updated weights on worker 0-0, policy_version 1104743 (0.00095) [2022-07-11 08:01:14,714][26022] Updated weights on worker 0-0, policy_version 1104753 (0.00096) [2022-07-11 08:01:16,367][26022] Updated weights on worker 0-0, policy_version 1104763 (0.00088) [2022-07-11 08:01:16,617][25689] Fps is (10 sec: 5726.7, 60 sec: 5567.7, 300 sec: 5574.4). Total num frames: 1131278336. Throughput: 0: 4908.2. Samples: 1131275696. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:01:16,617][25689] Avg episode reward: [(0, '0.302')] [2022-07-11 08:01:18,316][26022] Updated weights on worker 0-0, policy_version 1104773 (0.00090) [2022-07-11 08:01:20,144][26022] Updated weights on worker 0-0, policy_version 1104783 (0.00114) [2022-07-11 08:01:21,661][25689] Fps is (10 sec: 5502.8, 60 sec: 5540.0, 300 sec: 5567.1). Total num frames: 1131304960. Throughput: 0: 5756.7. Samples: 1131309210. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:01:21,661][25689] Avg episode reward: [(0, '0.216')] [2022-07-11 08:01:21,925][26022] Updated weights on worker 0-0, policy_version 1104793 (0.00083) [2022-07-11 08:01:23,758][26022] Updated weights on worker 0-0, policy_version 1104803 (0.00092) [2022-07-11 08:01:25,528][26022] Updated weights on worker 0-0, policy_version 1104813 (0.00088) [2022-07-11 08:01:26,668][25689] Fps is (10 sec: 5501.6, 60 sec: 5548.0, 300 sec: 5567.5). Total num frames: 1131333632. Throughput: 0: 5839.1. Samples: 1131342604. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:01:26,668][25689] Avg episode reward: [(0, '-1.481')] [2022-07-11 08:01:27,494][26022] Updated weights on worker 0-0, policy_version 1104823 (0.00096) [2022-07-11 08:01:29,238][26022] Updated weights on worker 0-0, policy_version 1104833 (0.00569) [2022-07-11 08:01:30,922][26022] Updated weights on worker 0-0, policy_version 1104843 (0.00090) [2022-07-11 08:01:31,675][25689] Fps is (10 sec: 5726.1, 60 sec: 5569.0, 300 sec: 5571.1). Total num frames: 1131362304. Throughput: 0: 5015.7. Samples: 1131359554. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:01:31,676][25689] Avg episode reward: [(0, '-2.024')] [2022-07-11 08:01:32,986][26022] Updated weights on worker 0-0, policy_version 1104853 (0.00094) [2022-07-11 08:01:34,698][26022] Updated weights on worker 0-0, policy_version 1104863 (0.00089) [2022-07-11 08:01:36,467][26022] Updated weights on worker 0-0, policy_version 1104873 (0.00085) [2022-07-11 08:01:36,706][25689] Fps is (10 sec: 5712.4, 60 sec: 5553.7, 300 sec: 5568.3). Total num frames: 1131390976. Throughput: 0: 5869.2. Samples: 1131393604. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:01:36,707][25689] Avg episode reward: [(0, '-2.074')] [2022-07-11 08:01:38,412][26022] Updated weights on worker 0-0, policy_version 1104883 (0.00093) [2022-07-11 08:01:40,188][26022] Updated weights on worker 0-0, policy_version 1104893 (0.00091) [2022-07-11 08:01:41,751][25689] Fps is (10 sec: 5589.8, 60 sec: 5572.4, 300 sec: 5568.0). Total num frames: 1131418624. Throughput: 0: 5850.9. Samples: 1131426754. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:01:41,751][25689] Avg episode reward: [(0, '-2.737')] [2022-07-11 08:01:42,157][26022] Updated weights on worker 0-0, policy_version 1104903 (0.00094) [2022-07-11 08:01:43,931][26022] Updated weights on worker 0-0, policy_version 1104913 (0.00086) [2022-07-11 08:01:45,892][26022] Updated weights on worker 0-0, policy_version 1104923 (0.00095) [2022-07-11 08:01:46,772][25689] Fps is (10 sec: 5290.0, 60 sec: 5504.3, 300 sec: 5561.1). Total num frames: 1131444224. Throughput: 0: 5017.8. Samples: 1131443484. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:01:46,772][25689] Avg episode reward: [(0, '-2.155')] [2022-07-11 08:01:47,547][26022] Updated weights on worker 0-0, policy_version 1104933 (0.00086) [2022-07-11 08:01:47,948][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:01:47,962][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001104935_1131453440.pth [2022-07-11 08:01:47,962][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001102974_1129445376.pth [2022-07-11 08:01:49,471][26022] Updated weights on worker 0-0, policy_version 1104943 (0.00086) [2022-07-11 08:01:51,064][26022] Updated weights on worker 0-0, policy_version 1104953 (0.00083) [2022-07-11 08:01:51,777][25689] Fps is (10 sec: 5515.3, 60 sec: 5555.5, 300 sec: 5564.8). Total num frames: 1131473920. Throughput: 0: 5852.7. Samples: 1131477200. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:01:51,777][25689] Avg episode reward: [(0, '-0.540')] [2022-07-11 08:01:53,135][26022] Updated weights on worker 0-0, policy_version 1104963 (0.00089) [2022-07-11 08:01:54,916][26022] Updated weights on worker 0-0, policy_version 1104973 (0.00084) [2022-07-11 08:01:56,725][26022] Updated weights on worker 0-0, policy_version 1104983 (0.00091) [2022-07-11 08:01:56,811][25689] Fps is (10 sec: 5916.2, 60 sec: 5587.7, 300 sec: 5565.1). Total num frames: 1131503616. Throughput: 0: 5840.5. Samples: 1131511026. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:01:56,811][25689] Avg episode reward: [(0, '1.178')] [2022-07-11 08:01:58,554][26022] Updated weights on worker 0-0, policy_version 1104993 (0.00087) [2022-07-11 08:02:00,154][26022] Updated weights on worker 0-0, policy_version 1105003 (0.00080) [2022-07-11 08:02:01,860][25689] Fps is (10 sec: 5484.0, 60 sec: 5539.5, 300 sec: 5564.3). Total num frames: 1131529216. Throughput: 0: 5028.6. Samples: 1131527874. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:02:01,860][25689] Avg episode reward: [(0, '0.310')] [2022-07-11 08:02:02,698][26022] Updated weights on worker 0-0, policy_version 1105013 (0.00092) [2022-07-11 08:02:04,183][26022] Updated weights on worker 0-0, policy_version 1105023 (0.00096) [2022-07-11 08:02:06,494][26022] Updated weights on worker 0-0, policy_version 1105033 (0.00086) [2022-07-11 08:02:06,868][25689] Fps is (10 sec: 5294.6, 60 sec: 5573.3, 300 sec: 5564.9). Total num frames: 1131556864. Throughput: 0: 5759.8. Samples: 1131559232. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:02:06,868][25689] Avg episode reward: [(0, '-0.042')] [2022-07-11 08:02:07,829][26022] Updated weights on worker 0-0, policy_version 1105043 (0.00087) [2022-07-11 08:02:09,943][26022] Updated weights on worker 0-0, policy_version 1105053 (0.00089) [2022-07-11 08:02:11,619][26022] Updated weights on worker 0-0, policy_version 1105063 (0.00080) [2022-07-11 08:02:11,893][25689] Fps is (10 sec: 5613.4, 60 sec: 5571.8, 300 sec: 5564.8). Total num frames: 1131585536. Throughput: 0: 5766.3. Samples: 1131593196. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:02:11,894][25689] Avg episode reward: [(0, '0.488')] [2022-07-11 08:02:13,522][26022] Updated weights on worker 0-0, policy_version 1105073 (0.00097) [2022-07-11 08:02:15,571][26022] Updated weights on worker 0-0, policy_version 1105083 (0.00097) [2022-07-11 08:02:16,926][25689] Fps is (10 sec: 5599.7, 60 sec: 5552.2, 300 sec: 5565.5). Total num frames: 1131613184. Throughput: 0: 4907.8. Samples: 1131609746. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:02:16,926][25689] Avg episode reward: [(0, '0.555')] [2022-07-11 08:02:17,087][26022] Updated weights on worker 0-0, policy_version 1105093 (0.00084) [2022-07-11 08:02:19,120][26022] Updated weights on worker 0-0, policy_version 1105103 (0.00085) [2022-07-11 08:02:20,800][26022] Updated weights on worker 0-0, policy_version 1105113 (0.00089) [2022-07-11 08:02:21,997][25689] Fps is (10 sec: 5472.7, 60 sec: 5566.7, 300 sec: 5558.0). Total num frames: 1131640832. Throughput: 0: 5725.2. Samples: 1131643162. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:02:21,999][25689] Avg episode reward: [(0, '0.431')] [2022-07-11 08:02:22,891][26022] Updated weights on worker 0-0, policy_version 1105123 (0.00098) [2022-07-11 08:02:24,590][26022] Updated weights on worker 0-0, policy_version 1105133 (0.00095) [2022-07-11 08:02:26,558][26022] Updated weights on worker 0-0, policy_version 1105143 (0.00089) [2022-07-11 08:02:27,030][25689] Fps is (10 sec: 5573.9, 60 sec: 5564.3, 300 sec: 5561.0). Total num frames: 1131669504. Throughput: 0: 5840.1. Samples: 1131676980. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:02:27,032][25689] Avg episode reward: [(0, '0.309')] [2022-07-11 08:02:28,048][26022] Updated weights on worker 0-0, policy_version 1105153 (0.00087) [2022-07-11 08:02:30,071][26022] Updated weights on worker 0-0, policy_version 1105163 (0.00086) [2022-07-11 08:02:31,843][26022] Updated weights on worker 0-0, policy_version 1105173 (0.00093) [2022-07-11 08:02:32,042][25689] Fps is (10 sec: 5708.5, 60 sec: 5563.9, 300 sec: 5565.1). Total num frames: 1131698176. Throughput: 0: 5013.9. Samples: 1131694222. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:02:32,043][25689] Avg episode reward: [(0, '1.325')] [2022-07-11 08:02:33,685][26022] Updated weights on worker 0-0, policy_version 1105183 (0.00084) [2022-07-11 08:02:35,495][26022] Updated weights on worker 0-0, policy_version 1105193 (0.00524) [2022-07-11 08:02:37,055][25689] Fps is (10 sec: 5720.2, 60 sec: 5565.5, 300 sec: 5567.0). Total num frames: 1131726848. Throughput: 0: 5882.4. Samples: 1131728154. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:02:37,056][25689] Avg episode reward: [(0, '1.496')] [2022-07-11 08:02:37,091][26022] Updated weights on worker 0-0, policy_version 1105203 (0.00081) [2022-07-11 08:02:39,098][26022] Updated weights on worker 0-0, policy_version 1105213 (0.00095) [2022-07-11 08:02:40,856][26022] Updated weights on worker 0-0, policy_version 1105223 (0.00090) [2022-07-11 08:02:42,091][25689] Fps is (10 sec: 5502.8, 60 sec: 5549.3, 300 sec: 5560.6). Total num frames: 1131753472. Throughput: 0: 5924.3. Samples: 1131762206. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:02:42,092][25689] Avg episode reward: [(0, '1.485')] [2022-07-11 08:02:42,566][26022] Updated weights on worker 0-0, policy_version 1105233 (0.00086) [2022-07-11 08:02:44,379][26022] Updated weights on worker 0-0, policy_version 1105243 (0.00105) [2022-07-11 08:02:46,343][26022] Updated weights on worker 0-0, policy_version 1105253 (0.00087) [2022-07-11 08:02:47,093][25689] Fps is (10 sec: 5406.4, 60 sec: 5585.0, 300 sec: 5557.8). Total num frames: 1131781120. Throughput: 0: 5082.0. Samples: 1131778942. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:02:47,094][25689] Avg episode reward: [(0, '1.516')] [2022-07-11 08:02:47,868][26022] Updated weights on worker 0-0, policy_version 1105263 (0.00092) [2022-07-11 08:02:50,109][26022] Updated weights on worker 0-0, policy_version 1105273 (0.00091) [2022-07-11 08:02:51,495][26022] Updated weights on worker 0-0, policy_version 1105283 (0.00488) [2022-07-11 08:02:52,120][25689] Fps is (10 sec: 5819.5, 60 sec: 5599.9, 300 sec: 5564.8). Total num frames: 1131811840. Throughput: 0: 5919.1. Samples: 1131813068. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:02:52,121][25689] Avg episode reward: [(0, '1.759')] [2022-07-11 08:02:53,704][26022] Updated weights on worker 0-0, policy_version 1105293 (0.00091) [2022-07-11 08:02:55,579][26022] Updated weights on worker 0-0, policy_version 1105303 (0.00088) [2022-07-11 08:02:57,139][25689] Fps is (10 sec: 5810.0, 60 sec: 5567.4, 300 sec: 5567.0). Total num frames: 1131839488. Throughput: 0: 5907.0. Samples: 1131846794. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:02:57,141][25689] Avg episode reward: [(0, '1.850')] [2022-07-11 08:02:57,159][26022] Updated weights on worker 0-0, policy_version 1105313 (0.00095) [2022-07-11 08:02:59,220][26022] Updated weights on worker 0-0, policy_version 1105323 (0.00093) [2022-07-11 08:03:00,827][26022] Updated weights on worker 0-0, policy_version 1105333 (0.00085) [2022-07-11 08:03:02,243][25689] Fps is (10 sec: 5564.1, 60 sec: 5613.2, 300 sec: 5575.6). Total num frames: 1131868160. Throughput: 0: 5031.4. Samples: 1131863598. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:03:02,243][25689] Avg episode reward: [(0, '1.299')] [2022-07-11 08:03:03,168][26022] Updated weights on worker 0-0, policy_version 1105343 (0.00086) [2022-07-11 08:03:04,793][26022] Updated weights on worker 0-0, policy_version 1105353 (0.00088) [2022-07-11 08:03:06,639][26022] Updated weights on worker 0-0, policy_version 1105363 (0.00085) [2022-07-11 08:03:07,254][25689] Fps is (10 sec: 5466.8, 60 sec: 5596.0, 300 sec: 5569.1). Total num frames: 1131894784. Throughput: 0: 5780.5. Samples: 1131895482. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:03:07,256][25689] Avg episode reward: [(0, '0.794')] [2022-07-11 08:03:08,387][26022] Updated weights on worker 0-0, policy_version 1105373 (0.00097) [2022-07-11 08:03:10,370][26022] Updated weights on worker 0-0, policy_version 1105383 (0.00088) [2022-07-11 08:03:11,973][26022] Updated weights on worker 0-0, policy_version 1105393 (0.00080) [2022-07-11 08:03:12,267][25689] Fps is (10 sec: 5515.9, 60 sec: 5597.0, 300 sec: 5569.0). Total num frames: 1131923456. Throughput: 0: 5756.5. Samples: 1131929044. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:03:12,268][25689] Avg episode reward: [(0, '0.638')] [2022-07-11 08:03:14,186][26022] Updated weights on worker 0-0, policy_version 1105403 (0.00092) [2022-07-11 08:03:15,649][26022] Updated weights on worker 0-0, policy_version 1105413 (0.00086) [2022-07-11 08:03:17,311][25689] Fps is (10 sec: 5498.6, 60 sec: 5579.1, 300 sec: 5566.2). Total num frames: 1131950080. Throughput: 0: 4916.8. Samples: 1131945972. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:03:17,311][25689] Avg episode reward: [(0, '0.620')] [2022-07-11 08:03:17,759][26022] Updated weights on worker 0-0, policy_version 1105423 (0.00090) [2022-07-11 08:03:19,352][26022] Updated weights on worker 0-0, policy_version 1105433 (0.00092) [2022-07-11 08:03:21,319][26022] Updated weights on worker 0-0, policy_version 1105443 (0.00093) [2022-07-11 08:03:22,439][25689] Fps is (10 sec: 5537.1, 60 sec: 5607.7, 300 sec: 5570.7). Total num frames: 1131979776. Throughput: 0: 5730.4. Samples: 1131979330. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:03:22,439][25689] Avg episode reward: [(0, '0.360')] [2022-07-11 08:03:23,193][26022] Updated weights on worker 0-0, policy_version 1105453 (0.00091) [2022-07-11 08:03:24,944][26022] Updated weights on worker 0-0, policy_version 1105463 (0.00093) [2022-07-11 08:03:26,990][26022] Updated weights on worker 0-0, policy_version 1105473 (0.00096) [2022-07-11 08:03:27,469][25689] Fps is (10 sec: 5645.1, 60 sec: 5591.1, 300 sec: 5567.1). Total num frames: 1132007424. Throughput: 0: 5791.1. Samples: 1132012546. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:03:27,471][25689] Avg episode reward: [(0, '0.062')] [2022-07-11 08:03:28,567][26022] Updated weights on worker 0-0, policy_version 1105483 (0.00084) [2022-07-11 08:03:30,607][26022] Updated weights on worker 0-0, policy_version 1105493 (0.00090) [2022-07-11 08:03:32,463][26022] Updated weights on worker 0-0, policy_version 1105503 (0.00092) [2022-07-11 08:03:32,550][25689] Fps is (10 sec: 5468.9, 60 sec: 5567.8, 300 sec: 5565.9). Total num frames: 1132035072. Throughput: 0: 4946.5. Samples: 1132029370. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:03:32,550][25689] Avg episode reward: [(0, '1.209')] [2022-07-11 08:03:34,320][26022] Updated weights on worker 0-0, policy_version 1105513 (0.00105) [2022-07-11 08:03:36,108][26022] Updated weights on worker 0-0, policy_version 1105523 (0.00077) [2022-07-11 08:03:37,583][25689] Fps is (10 sec: 5568.5, 60 sec: 5565.9, 300 sec: 5571.0). Total num frames: 1132063744. Throughput: 0: 5764.0. Samples: 1132062820. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:03:37,584][25689] Avg episode reward: [(0, '0.022')] [2022-07-11 08:03:38,087][26022] Updated weights on worker 0-0, policy_version 1105533 (0.00082) [2022-07-11 08:03:40,019][26022] Updated weights on worker 0-0, policy_version 1105543 (0.00082) [2022-07-11 08:03:41,655][26022] Updated weights on worker 0-0, policy_version 1105553 (0.00087) [2022-07-11 08:03:42,643][25689] Fps is (10 sec: 5580.1, 60 sec: 5580.7, 300 sec: 5571.5). Total num frames: 1132091392. Throughput: 0: 5774.0. Samples: 1132095988. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:03:42,643][25689] Avg episode reward: [(0, '-0.277')] [2022-07-11 08:03:43,635][26022] Updated weights on worker 0-0, policy_version 1105563 (0.00096) [2022-07-11 08:03:45,212][26022] Updated weights on worker 0-0, policy_version 1105573 (0.00091) [2022-07-11 08:03:47,262][26022] Updated weights on worker 0-0, policy_version 1105583 (0.00084) [2022-07-11 08:03:47,672][25689] Fps is (10 sec: 5582.3, 60 sec: 5595.1, 300 sec: 5567.8). Total num frames: 1132120064. Throughput: 0: 4967.9. Samples: 1132112914. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:03:47,672][25689] Avg episode reward: [(0, '-0.287')] [2022-07-11 08:03:48,026][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:03:48,048][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001105589_1132123136.pth [2022-07-11 08:03:48,048][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001103630_1130117120.pth [2022-07-11 08:03:48,876][26022] Updated weights on worker 0-0, policy_version 1105593 (0.00089) [2022-07-11 08:03:50,876][26022] Updated weights on worker 0-0, policy_version 1105603 (0.00098) [2022-07-11 08:03:52,693][25689] Fps is (10 sec: 5501.8, 60 sec: 5528.0, 300 sec: 5565.8). Total num frames: 1132146688. Throughput: 0: 5817.8. Samples: 1132146562. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:03:52,694][25689] Avg episode reward: [(0, '-0.009')] [2022-07-11 08:03:52,714][26022] Updated weights on worker 0-0, policy_version 1105613 (0.00088) [2022-07-11 08:03:54,476][26022] Updated weights on worker 0-0, policy_version 1105623 (0.00095) [2022-07-11 08:03:56,322][26022] Updated weights on worker 0-0, policy_version 1105633 (0.00052) [2022-07-11 08:03:57,699][25689] Fps is (10 sec: 5616.6, 60 sec: 5563.0, 300 sec: 5574.6). Total num frames: 1132176384. Throughput: 0: 5830.8. Samples: 1132180116. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:03:57,700][25689] Avg episode reward: [(0, '0.303')] [2022-07-11 08:03:58,219][26022] Updated weights on worker 0-0, policy_version 1105643 (0.00085) [2022-07-11 08:04:00,085][26022] Updated weights on worker 0-0, policy_version 1105653 (0.00087) [2022-07-11 08:04:01,856][26022] Updated weights on worker 0-0, policy_version 1105663 (0.00105) [2022-07-11 08:04:02,743][25689] Fps is (10 sec: 5502.5, 60 sec: 5517.7, 300 sec: 5564.7). Total num frames: 1132201984. Throughput: 0: 5019.0. Samples: 1132196870. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:04:02,743][25689] Avg episode reward: [(0, '-0.126')] [2022-07-11 08:04:04,104][26022] Updated weights on worker 0-0, policy_version 1105673 (0.00087) [2022-07-11 08:04:05,759][26022] Updated weights on worker 0-0, policy_version 1105683 (0.00093) [2022-07-11 08:04:07,751][26022] Updated weights on worker 0-0, policy_version 1105693 (0.00093) [2022-07-11 08:04:07,752][25689] Fps is (10 sec: 5296.7, 60 sec: 5534.8, 300 sec: 5569.7). Total num frames: 1132229632. Throughput: 0: 5744.0. Samples: 1132228256. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:04:07,753][25689] Avg episode reward: [(0, '-0.774')] [2022-07-11 08:04:09,554][26022] Updated weights on worker 0-0, policy_version 1105703 (0.00086) [2022-07-11 08:04:11,390][26022] Updated weights on worker 0-0, policy_version 1105713 (0.00090) [2022-07-11 08:04:12,775][25689] Fps is (10 sec: 5410.0, 60 sec: 5500.2, 300 sec: 5563.1). Total num frames: 1132256256. Throughput: 0: 5738.2. Samples: 1132261792. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:04:12,775][25689] Avg episode reward: [(0, '-0.536')] [2022-07-11 08:04:13,220][26022] Updated weights on worker 0-0, policy_version 1105723 (0.00091) [2022-07-11 08:04:14,981][26022] Updated weights on worker 0-0, policy_version 1105733 (0.00085) [2022-07-11 08:04:16,769][26022] Updated weights on worker 0-0, policy_version 1105743 (0.00095) [2022-07-11 08:04:17,791][25689] Fps is (10 sec: 5610.5, 60 sec: 5553.4, 300 sec: 5567.4). Total num frames: 1132285952. Throughput: 0: 4913.3. Samples: 1132278832. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:04:17,793][25689] Avg episode reward: [(0, '-1.689')] [2022-07-11 08:04:18,824][26022] Updated weights on worker 0-0, policy_version 1105753 (0.00093) [2022-07-11 08:04:20,386][26022] Updated weights on worker 0-0, policy_version 1105763 (0.00082) [2022-07-11 08:04:22,503][26022] Updated weights on worker 0-0, policy_version 1105773 (0.00090) [2022-07-11 08:04:22,919][25689] Fps is (10 sec: 5551.9, 60 sec: 5502.6, 300 sec: 5558.3). Total num frames: 1132312576. Throughput: 0: 5709.9. Samples: 1132312074. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:04:22,920][25689] Avg episode reward: [(0, '-1.768')] [2022-07-11 08:04:24,149][26022] Updated weights on worker 0-0, policy_version 1105783 (0.00088) [2022-07-11 08:04:26,072][26022] Updated weights on worker 0-0, policy_version 1105793 (0.00083) [2022-07-11 08:04:27,864][26022] Updated weights on worker 0-0, policy_version 1105803 (0.00083) [2022-07-11 08:04:27,956][25689] Fps is (10 sec: 5540.6, 60 sec: 5535.9, 300 sec: 5562.8). Total num frames: 1132342272. Throughput: 0: 5808.7. Samples: 1132345612. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:04:27,956][25689] Avg episode reward: [(0, '-1.452')] [2022-07-11 08:04:29,662][26022] Updated weights on worker 0-0, policy_version 1105813 (0.00086) [2022-07-11 08:04:31,865][26022] Updated weights on worker 0-0, policy_version 1105823 (0.00088) [2022-07-11 08:04:33,020][25689] Fps is (10 sec: 5677.1, 60 sec: 5537.4, 300 sec: 5561.9). Total num frames: 1132369920. Throughput: 0: 5776.0. Samples: 1132378728. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2022-07-11 08:04:33,020][25689] Avg episode reward: [(0, '-2.246')] [2022-07-11 08:04:33,528][26022] Updated weights on worker 0-0, policy_version 1105833 (0.00093) [2022-07-11 08:04:35,326][26022] Updated weights on worker 0-0, policy_version 1105843 (0.00091) [2022-07-11 08:04:37,066][26022] Updated weights on worker 0-0, policy_version 1105853 (0.00090) [2022-07-11 08:04:38,025][25689] Fps is (10 sec: 5491.8, 60 sec: 5523.1, 300 sec: 5563.7). Total num frames: 1132397568. Throughput: 0: 5758.1. Samples: 1132395340. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:04:38,026][25689] Avg episode reward: [(0, '-0.709')] [2022-07-11 08:04:39,324][26022] Updated weights on worker 0-0, policy_version 1105863 (0.00090) [2022-07-11 08:04:40,817][26022] Updated weights on worker 0-0, policy_version 1105873 (0.00085) [2022-07-11 08:04:42,912][26022] Updated weights on worker 0-0, policy_version 1105883 (0.00086) [2022-07-11 08:04:43,109][25689] Fps is (10 sec: 5683.5, 60 sec: 5554.7, 300 sec: 5559.6). Total num frames: 1132427264. Throughput: 0: 5770.1. Samples: 1132428574. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:04:43,110][25689] Avg episode reward: [(0, '-0.478')] [2022-07-11 08:04:44,480][26022] Updated weights on worker 0-0, policy_version 1105893 (0.00082) [2022-07-11 08:04:46,556][26022] Updated weights on worker 0-0, policy_version 1105903 (0.00100) [2022-07-11 08:04:48,182][25689] Fps is (10 sec: 5545.0, 60 sec: 5516.9, 300 sec: 5555.8). Total num frames: 1132453888. Throughput: 0: 5731.4. Samples: 1132461534. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:04:48,182][25689] Avg episode reward: [(0, '1.021')] [2022-07-11 08:04:48,262][26022] Updated weights on worker 0-0, policy_version 1105913 (0.00086) [2022-07-11 08:04:50,049][26022] Updated weights on worker 0-0, policy_version 1105923 (0.00091) [2022-07-11 08:04:52,157][26022] Updated weights on worker 0-0, policy_version 1105933 (0.00084) [2022-07-11 08:04:53,196][25689] Fps is (10 sec: 5380.5, 60 sec: 5534.4, 300 sec: 5556.7). Total num frames: 1132481536. Throughput: 0: 4939.4. Samples: 1132478386. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:04:53,197][25689] Avg episode reward: [(0, '0.458')] [2022-07-11 08:04:53,756][26022] Updated weights on worker 0-0, policy_version 1105943 (0.00091) [2022-07-11 08:04:55,554][26022] Updated weights on worker 0-0, policy_version 1105953 (0.00087) [2022-07-11 08:04:57,529][26022] Updated weights on worker 0-0, policy_version 1105963 (0.00091) [2022-07-11 08:04:58,236][25689] Fps is (10 sec: 5601.6, 60 sec: 5514.4, 300 sec: 5550.3). Total num frames: 1132510208. Throughput: 0: 5782.4. Samples: 1132512208. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:04:58,236][25689] Avg episode reward: [(0, '-0.042')] [2022-07-11 08:04:59,250][26022] Updated weights on worker 0-0, policy_version 1105973 (0.00095) [2022-07-11 08:05:01,199][26022] Updated weights on worker 0-0, policy_version 1105983 (0.00120) [2022-07-11 08:05:03,310][25689] Fps is (10 sec: 5467.4, 60 sec: 5528.5, 300 sec: 5552.6). Total num frames: 1132536832. Throughput: 0: 5684.5. Samples: 1132543404. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:05:03,310][25689] Avg episode reward: [(0, '0.750')] [2022-07-11 08:05:03,319][26022] Updated weights on worker 0-0, policy_version 1105993 (0.00084) [2022-07-11 08:05:05,376][26022] Updated weights on worker 0-0, policy_version 1106003 (0.00084) [2022-07-11 08:05:07,046][26022] Updated weights on worker 0-0, policy_version 1106013 (0.00087) [2022-07-11 08:05:08,380][25689] Fps is (10 sec: 5248.9, 60 sec: 5506.1, 300 sec: 5551.9). Total num frames: 1132563456. Throughput: 0: 4879.9. Samples: 1132560102. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:05:08,381][25689] Avg episode reward: [(0, '0.592')] [2022-07-11 08:05:09,125][26022] Updated weights on worker 0-0, policy_version 1106023 (0.00090) [2022-07-11 08:05:10,677][26022] Updated weights on worker 0-0, policy_version 1106033 (0.00089) [2022-07-11 08:05:12,795][26022] Updated weights on worker 0-0, policy_version 1106043 (0.00089) [2022-07-11 08:05:13,414][25689] Fps is (10 sec: 5574.0, 60 sec: 5555.7, 300 sec: 5555.6). Total num frames: 1132593152. Throughput: 0: 5695.4. Samples: 1132593534. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:05:13,415][25689] Avg episode reward: [(0, '0.279')] [2022-07-11 08:05:14,358][26022] Updated weights on worker 0-0, policy_version 1106053 (0.00091) [2022-07-11 08:05:16,328][26022] Updated weights on worker 0-0, policy_version 1106063 (0.00083) [2022-07-11 08:05:18,048][26022] Updated weights on worker 0-0, policy_version 1106073 (0.00083) [2022-07-11 08:05:18,424][25689] Fps is (10 sec: 5607.7, 60 sec: 5505.7, 300 sec: 5550.6). Total num frames: 1132619776. Throughput: 0: 5697.0. Samples: 1132627220. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:05:18,424][25689] Avg episode reward: [(0, '0.506')] [2022-07-11 08:05:19,994][26022] Updated weights on worker 0-0, policy_version 1106083 (0.00168) [2022-07-11 08:05:21,663][26022] Updated weights on worker 0-0, policy_version 1106093 (0.00084) [2022-07-11 08:05:23,512][25689] Fps is (10 sec: 5476.0, 60 sec: 5543.1, 300 sec: 5550.7). Total num frames: 1132648448. Throughput: 0: 4994.4. Samples: 1132644300. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:05:23,512][25689] Avg episode reward: [(0, '0.815')] [2022-07-11 08:05:23,615][26022] Updated weights on worker 0-0, policy_version 1106103 (0.00086) [2022-07-11 08:05:25,297][26022] Updated weights on worker 0-0, policy_version 1106113 (0.00091) [2022-07-11 08:05:27,267][26022] Updated weights on worker 0-0, policy_version 1106123 (0.00092) [2022-07-11 08:05:28,524][25689] Fps is (10 sec: 5677.2, 60 sec: 5528.4, 300 sec: 5554.9). Total num frames: 1132677120. Throughput: 0: 5869.3. Samples: 1132678334. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:05:28,525][25689] Avg episode reward: [(0, '0.730')] [2022-07-11 08:05:29,111][26022] Updated weights on worker 0-0, policy_version 1106133 (0.00051) [2022-07-11 08:05:30,727][26022] Updated weights on worker 0-0, policy_version 1106143 (0.00084) [2022-07-11 08:05:32,680][26022] Updated weights on worker 0-0, policy_version 1106153 (0.00087) [2022-07-11 08:05:33,542][25689] Fps is (10 sec: 5614.9, 60 sec: 5532.6, 300 sec: 5548.6). Total num frames: 1132704768. Throughput: 0: 5877.6. Samples: 1132711840. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:05:33,543][25689] Avg episode reward: [(0, '0.975')] [2022-07-11 08:05:34,369][26022] Updated weights on worker 0-0, policy_version 1106163 (0.00093) [2022-07-11 08:05:36,217][26022] Updated weights on worker 0-0, policy_version 1106173 (0.00094) [2022-07-11 08:05:38,192][26022] Updated weights on worker 0-0, policy_version 1106183 (0.00085) [2022-07-11 08:05:38,578][25689] Fps is (10 sec: 5500.4, 60 sec: 5529.8, 300 sec: 5552.5). Total num frames: 1132732416. Throughput: 0: 5042.0. Samples: 1132728836. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:05:38,578][25689] Avg episode reward: [(0, '1.003')] [2022-07-11 08:05:39,950][26022] Updated weights on worker 0-0, policy_version 1106193 (0.00093) [2022-07-11 08:05:41,922][26022] Updated weights on worker 0-0, policy_version 1106203 (0.00084) [2022-07-11 08:05:43,342][26022] Updated weights on worker 0-0, policy_version 1106213 (0.00086) [2022-07-11 08:05:43,647][25689] Fps is (10 sec: 5674.8, 60 sec: 5531.2, 300 sec: 5551.6). Total num frames: 1132762112. Throughput: 0: 5879.7. Samples: 1132762690. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:05:43,648][25689] Avg episode reward: [(0, '-0.041')] [2022-07-11 08:05:45,430][26022] Updated weights on worker 0-0, policy_version 1106223 (0.00092) [2022-07-11 08:05:47,212][26022] Updated weights on worker 0-0, policy_version 1106233 (0.00090) [2022-07-11 08:05:48,193][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:05:48,205][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001106238_1132787712.pth [2022-07-11 08:05:48,205][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001104281_1130783744.pth [2022-07-11 08:05:48,206][25974] Saving a milestone ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/milestones/checkpoint_001106238_1132787712.pth.milestone [2022-07-11 08:05:48,652][25689] Fps is (10 sec: 5692.1, 60 sec: 5554.3, 300 sec: 5555.1). Total num frames: 1132789760. Throughput: 0: 5863.3. Samples: 1132796346. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:05:48,652][25689] Avg episode reward: [(0, '-0.090')] [2022-07-11 08:05:49,168][26022] Updated weights on worker 0-0, policy_version 1106243 (0.00094) [2022-07-11 08:05:50,819][26022] Updated weights on worker 0-0, policy_version 1106253 (0.01102) [2022-07-11 08:05:52,731][26022] Updated weights on worker 0-0, policy_version 1106263 (0.00090) [2022-07-11 08:05:53,675][25689] Fps is (10 sec: 5514.4, 60 sec: 5553.6, 300 sec: 5554.9). Total num frames: 1132817408. Throughput: 0: 5030.4. Samples: 1132813114. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:05:53,675][25689] Avg episode reward: [(0, '-0.004')] [2022-07-11 08:05:54,549][26022] Updated weights on worker 0-0, policy_version 1106273 (0.00080) [2022-07-11 08:05:56,507][26022] Updated weights on worker 0-0, policy_version 1106283 (0.00081) [2022-07-11 08:05:58,214][26022] Updated weights on worker 0-0, policy_version 1106293 (0.00087) [2022-07-11 08:05:58,681][25689] Fps is (10 sec: 5717.5, 60 sec: 5573.5, 300 sec: 5559.7). Total num frames: 1132847104. Throughput: 0: 5863.1. Samples: 1132846708. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:05:58,682][25689] Avg episode reward: [(0, '-1.710')] [2022-07-11 08:06:00,129][26022] Updated weights on worker 0-0, policy_version 1106303 (0.00065) [2022-07-11 08:06:02,064][26022] Updated weights on worker 0-0, policy_version 1106313 (0.00086) [2022-07-11 08:06:03,816][25689] Fps is (10 sec: 5452.7, 60 sec: 5551.1, 300 sec: 5557.3). Total num frames: 1132872704. Throughput: 0: 5739.0. Samples: 1132878438. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:06:03,816][25689] Avg episode reward: [(0, '-1.091')] [2022-07-11 08:06:04,145][26022] Updated weights on worker 0-0, policy_version 1106323 (0.00090) [2022-07-11 08:06:05,656][26022] Updated weights on worker 0-0, policy_version 1106333 (0.00089) [2022-07-11 08:06:07,712][26022] Updated weights on worker 0-0, policy_version 1106343 (0.00092) [2022-07-11 08:06:08,827][25689] Fps is (10 sec: 5248.4, 60 sec: 5573.5, 300 sec: 5553.9). Total num frames: 1132900352. Throughput: 0: 4905.9. Samples: 1132895324. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:06:08,829][25689] Avg episode reward: [(0, '-0.944')] [2022-07-11 08:06:09,423][26022] Updated weights on worker 0-0, policy_version 1106353 (0.00086) [2022-07-11 08:06:11,370][26022] Updated weights on worker 0-0, policy_version 1106363 (0.00093) [2022-07-11 08:06:13,055][26022] Updated weights on worker 0-0, policy_version 1106373 (0.00094) [2022-07-11 08:06:13,858][25689] Fps is (10 sec: 5608.5, 60 sec: 5556.8, 300 sec: 5553.4). Total num frames: 1132929024. Throughput: 0: 5728.7. Samples: 1132928736. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:06:13,859][25689] Avg episode reward: [(0, '0.036')] [2022-07-11 08:06:15,083][26022] Updated weights on worker 0-0, policy_version 1106383 (0.00097) [2022-07-11 08:06:16,937][26022] Updated weights on worker 0-0, policy_version 1106393 (0.00086) [2022-07-11 08:06:18,607][26022] Updated weights on worker 0-0, policy_version 1106403 (0.00086) [2022-07-11 08:06:18,911][25689] Fps is (10 sec: 5585.2, 60 sec: 5569.7, 300 sec: 5556.6). Total num frames: 1132956672. Throughput: 0: 5712.6. Samples: 1132962270. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:06:18,911][25689] Avg episode reward: [(0, '-0.042')] [2022-07-11 08:06:20,448][26022] Updated weights on worker 0-0, policy_version 1106413 (0.00090) [2022-07-11 08:06:22,499][26022] Updated weights on worker 0-0, policy_version 1106423 (0.00085) [2022-07-11 08:06:23,993][25689] Fps is (10 sec: 5556.7, 60 sec: 5570.2, 300 sec: 5555.2). Total num frames: 1132985344. Throughput: 0: 4985.1. Samples: 1132979024. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:06:23,994][25689] Avg episode reward: [(0, '1.400')] [2022-07-11 08:06:24,195][26022] Updated weights on worker 0-0, policy_version 1106433 (0.00091) [2022-07-11 08:06:26,115][26022] Updated weights on worker 0-0, policy_version 1106443 (0.00079) [2022-07-11 08:06:27,744][26022] Updated weights on worker 0-0, policy_version 1106453 (0.00091) [2022-07-11 08:06:29,026][25689] Fps is (10 sec: 5669.2, 60 sec: 5568.4, 300 sec: 5554.8). Total num frames: 1133014016. Throughput: 0: 5797.9. Samples: 1133012434. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:06:29,026][25689] Avg episode reward: [(0, '1.335')] [2022-07-11 08:06:29,816][26022] Updated weights on worker 0-0, policy_version 1106463 (0.00093) [2022-07-11 08:06:31,652][26022] Updated weights on worker 0-0, policy_version 1106473 (0.00090) [2022-07-11 08:06:33,433][26022] Updated weights on worker 0-0, policy_version 1106483 (0.00089) [2022-07-11 08:06:34,067][25689] Fps is (10 sec: 5590.9, 60 sec: 5566.3, 300 sec: 5551.1). Total num frames: 1133041664. Throughput: 0: 5792.5. Samples: 1133045796. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:06:34,067][25689] Avg episode reward: [(0, '1.395')] [2022-07-11 08:06:35,403][26022] Updated weights on worker 0-0, policy_version 1106493 (0.00088) [2022-07-11 08:06:36,988][26022] Updated weights on worker 0-0, policy_version 1106503 (0.00100) [2022-07-11 08:06:38,912][26022] Updated weights on worker 0-0, policy_version 1106513 (0.00089) [2022-07-11 08:06:39,079][25689] Fps is (10 sec: 5500.3, 60 sec: 5568.4, 300 sec: 5551.7). Total num frames: 1133069312. Throughput: 0: 4979.3. Samples: 1133062694. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:06:39,079][25689] Avg episode reward: [(0, '1.520')] [2022-07-11 08:06:40,748][26022] Updated weights on worker 0-0, policy_version 1106523 (0.00081) [2022-07-11 08:06:42,571][26022] Updated weights on worker 0-0, policy_version 1106533 (0.00081) [2022-07-11 08:06:44,195][25689] Fps is (10 sec: 5560.7, 60 sec: 5547.2, 300 sec: 5560.3). Total num frames: 1133097984. Throughput: 0: 5812.6. Samples: 1133096448. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:06:44,195][25689] Avg episode reward: [(0, '1.235')] [2022-07-11 08:06:44,427][26022] Updated weights on worker 0-0, policy_version 1106543 (0.00081) [2022-07-11 08:06:46,391][26022] Updated weights on worker 0-0, policy_version 1106553 (0.00087) [2022-07-11 08:06:48,040][26022] Updated weights on worker 0-0, policy_version 1106563 (0.00086) [2022-07-11 08:06:49,224][25689] Fps is (10 sec: 5450.7, 60 sec: 5528.1, 300 sec: 5549.5). Total num frames: 1133124608. Throughput: 0: 5815.7. Samples: 1133129900. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:06:49,224][25689] Avg episode reward: [(0, '0.834')] [2022-07-11 08:06:49,911][26022] Updated weights on worker 0-0, policy_version 1106573 (0.00094) [2022-07-11 08:06:51,845][26022] Updated weights on worker 0-0, policy_version 1106583 (0.00087) [2022-07-11 08:06:53,662][26022] Updated weights on worker 0-0, policy_version 1106593 (0.00087) [2022-07-11 08:06:54,235][25689] Fps is (10 sec: 5609.2, 60 sec: 5562.9, 300 sec: 5549.9). Total num frames: 1133154304. Throughput: 0: 5837.8. Samples: 1133163538. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:06:54,236][25689] Avg episode reward: [(0, '0.642')] [2022-07-11 08:06:55,409][26022] Updated weights on worker 0-0, policy_version 1106603 (0.00087) [2022-07-11 08:06:57,298][26022] Updated weights on worker 0-0, policy_version 1106613 (0.00087) [2022-07-11 08:06:58,999][26022] Updated weights on worker 0-0, policy_version 1106623 (0.00099) [2022-07-11 08:06:59,327][25689] Fps is (10 sec: 5777.3, 60 sec: 5538.3, 300 sec: 5559.5). Total num frames: 1133182976. Throughput: 0: 5807.8. Samples: 1133180290. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:06:59,327][25689] Avg episode reward: [(0, '0.761')] [2022-07-11 08:07:01,021][26022] Updated weights on worker 0-0, policy_version 1106633 (0.00088) [2022-07-11 08:07:03,252][26022] Updated weights on worker 0-0, policy_version 1106643 (0.00091) [2022-07-11 08:07:04,409][25689] Fps is (10 sec: 5234.1, 60 sec: 5526.2, 300 sec: 5547.8). Total num frames: 1133207552. Throughput: 0: 5696.8. Samples: 1133211602. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:07:04,411][25689] Avg episode reward: [(0, '-0.064')] [2022-07-11 08:07:05,010][26022] Updated weights on worker 0-0, policy_version 1106653 (0.00090) [2022-07-11 08:07:06,758][26022] Updated weights on worker 0-0, policy_version 1106663 (0.00090) [2022-07-11 08:07:08,669][26022] Updated weights on worker 0-0, policy_version 1106673 (0.00499) [2022-07-11 08:07:09,461][25689] Fps is (10 sec: 5456.6, 60 sec: 5573.1, 300 sec: 5554.1). Total num frames: 1133238272. Throughput: 0: 5703.6. Samples: 1133245322. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:07:09,462][25689] Avg episode reward: [(0, '0.096')] [2022-07-11 08:07:10,579][26022] Updated weights on worker 0-0, policy_version 1106683 (0.00083) [2022-07-11 08:07:12,159][26022] Updated weights on worker 0-0, policy_version 1106693 (0.00080) [2022-07-11 08:07:14,270][26022] Updated weights on worker 0-0, policy_version 1106703 (0.00082) [2022-07-11 08:07:14,475][25689] Fps is (10 sec: 5696.7, 60 sec: 5540.9, 300 sec: 5551.1). Total num frames: 1133264896. Throughput: 0: 4875.9. Samples: 1133262224. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:07:14,477][25689] Avg episode reward: [(0, '-0.653')] [2022-07-11 08:07:15,958][26022] Updated weights on worker 0-0, policy_version 1106713 (0.00088) [2022-07-11 08:07:17,929][26022] Updated weights on worker 0-0, policy_version 1106723 (0.00088) [2022-07-11 08:07:19,478][25689] Fps is (10 sec: 5417.6, 60 sec: 5545.4, 300 sec: 5552.3). Total num frames: 1133292544. Throughput: 0: 5730.0. Samples: 1133295758. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:07:19,482][25689] Avg episode reward: [(0, '-0.313')] [2022-07-11 08:07:19,763][26022] Updated weights on worker 0-0, policy_version 1106733 (0.00086) [2022-07-11 08:07:21,366][26022] Updated weights on worker 0-0, policy_version 1106743 (0.00091) [2022-07-11 08:07:23,422][26022] Updated weights on worker 0-0, policy_version 1106753 (0.00090) [2022-07-11 08:07:24,552][25689] Fps is (10 sec: 5690.8, 60 sec: 5563.1, 300 sec: 5555.0). Total num frames: 1133322240. Throughput: 0: 5847.0. Samples: 1133329378. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:07:24,552][25689] Avg episode reward: [(0, '-0.130')] [2022-07-11 08:07:25,180][26022] Updated weights on worker 0-0, policy_version 1106763 (0.00093) [2022-07-11 08:07:27,024][26022] Updated weights on worker 0-0, policy_version 1106773 (0.00097) [2022-07-11 08:07:28,993][26022] Updated weights on worker 0-0, policy_version 1106783 (0.00085) [2022-07-11 08:07:29,596][25689] Fps is (10 sec: 5566.4, 60 sec: 5528.2, 300 sec: 5547.5). Total num frames: 1133348864. Throughput: 0: 4994.7. Samples: 1133345894. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:07:29,597][25689] Avg episode reward: [(0, '-0.176')] [2022-07-11 08:07:30,662][26022] Updated weights on worker 0-0, policy_version 1106793 (0.00090) [2022-07-11 08:07:32,518][26022] Updated weights on worker 0-0, policy_version 1106803 (0.00091) [2022-07-11 08:07:34,562][26022] Updated weights on worker 0-0, policy_version 1106813 (0.00086) [2022-07-11 08:07:34,620][25689] Fps is (10 sec: 5491.8, 60 sec: 5546.7, 300 sec: 5547.3). Total num frames: 1133377536. Throughput: 0: 5807.6. Samples: 1133379220. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:07:34,621][25689] Avg episode reward: [(0, '0.101')] [2022-07-11 08:07:36,062][26022] Updated weights on worker 0-0, policy_version 1106823 (0.00087) [2022-07-11 08:07:38,101][26022] Updated weights on worker 0-0, policy_version 1106833 (0.00087) [2022-07-11 08:07:39,631][25689] Fps is (10 sec: 5714.5, 60 sec: 5563.7, 300 sec: 5554.7). Total num frames: 1133406208. Throughput: 0: 5808.8. Samples: 1133412822. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:07:39,632][25689] Avg episode reward: [(0, '0.898')] [2022-07-11 08:07:39,889][26022] Updated weights on worker 0-0, policy_version 1106843 (0.00447) [2022-07-11 08:07:41,706][26022] Updated weights on worker 0-0, policy_version 1106853 (0.00087) [2022-07-11 08:07:43,719][26022] Updated weights on worker 0-0, policy_version 1106863 (0.00097) [2022-07-11 08:07:44,680][25689] Fps is (10 sec: 5496.7, 60 sec: 5536.0, 300 sec: 5550.3). Total num frames: 1133432832. Throughput: 0: 4978.5. Samples: 1133429588. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:07:44,681][25689] Avg episode reward: [(0, '0.833')] [2022-07-11 08:07:45,201][26022] Updated weights on worker 0-0, policy_version 1106873 (0.00086) [2022-07-11 08:07:47,256][26022] Updated weights on worker 0-0, policy_version 1106883 (0.00088) [2022-07-11 08:07:48,274][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:07:48,289][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001106889_1133454336.pth [2022-07-11 08:07:48,290][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001104935_1131453440.pth [2022-07-11 08:07:48,805][26022] Updated weights on worker 0-0, policy_version 1106893 (0.00090) [2022-07-11 08:07:49,711][25689] Fps is (10 sec: 5384.2, 60 sec: 5552.8, 300 sec: 5540.0). Total num frames: 1133460480. Throughput: 0: 5845.8. Samples: 1133463482. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:07:49,711][25689] Avg episode reward: [(0, '-0.101')] [2022-07-11 08:07:50,783][26022] Updated weights on worker 0-0, policy_version 1106903 (0.00083) [2022-07-11 08:07:53,012][26022] Updated weights on worker 0-0, policy_version 1106913 (0.00085) [2022-07-11 08:07:54,687][26022] Updated weights on worker 0-0, policy_version 1106923 (0.00086) [2022-07-11 08:07:54,743][25689] Fps is (10 sec: 5597.1, 60 sec: 5534.0, 300 sec: 5543.2). Total num frames: 1133489152. Throughput: 0: 5850.3. Samples: 1133496942. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:07:54,743][25689] Avg episode reward: [(0, '0.244')] [2022-07-11 08:07:56,402][26022] Updated weights on worker 0-0, policy_version 1106933 (0.00091) [2022-07-11 08:07:58,316][26022] Updated weights on worker 0-0, policy_version 1106943 (0.00091) [2022-07-11 08:07:59,747][25689] Fps is (10 sec: 5815.9, 60 sec: 5558.9, 300 sec: 5548.4). Total num frames: 1133518848. Throughput: 0: 5012.8. Samples: 1133513658. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:07:59,747][25689] Avg episode reward: [(0, '0.320')] [2022-07-11 08:07:59,904][26022] Updated weights on worker 0-0, policy_version 1106953 (0.00089) [2022-07-11 08:08:02,042][26022] Updated weights on worker 0-0, policy_version 1106963 (0.00094) [2022-07-11 08:08:04,140][26022] Updated weights on worker 0-0, policy_version 1106973 (0.00085) [2022-07-11 08:08:04,789][25689] Fps is (10 sec: 5300.0, 60 sec: 5545.6, 300 sec: 5537.5). Total num frames: 1133542400. Throughput: 0: 5747.3. Samples: 1133545162. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:08:04,790][25689] Avg episode reward: [(0, '0.362')] [2022-07-11 08:08:05,787][26022] Updated weights on worker 0-0, policy_version 1106983 (0.00090) [2022-07-11 08:08:07,951][26022] Updated weights on worker 0-0, policy_version 1106993 (0.00080) [2022-07-11 08:08:09,434][26022] Updated weights on worker 0-0, policy_version 1107003 (0.00086) [2022-07-11 08:08:09,791][25689] Fps is (10 sec: 5301.3, 60 sec: 5533.2, 300 sec: 5541.2). Total num frames: 1133572096. Throughput: 0: 5748.7. Samples: 1133578918. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:08:09,792][25689] Avg episode reward: [(0, '-0.075')] [2022-07-11 08:08:11,471][26022] Updated weights on worker 0-0, policy_version 1107013 (0.00082) [2022-07-11 08:08:13,203][26022] Updated weights on worker 0-0, policy_version 1107023 (0.00082) [2022-07-11 08:08:14,804][25689] Fps is (10 sec: 5828.3, 60 sec: 5567.3, 300 sec: 5548.7). Total num frames: 1133600768. Throughput: 0: 4931.1. Samples: 1133595866. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:08:14,804][25689] Avg episode reward: [(0, '-0.017')] [2022-07-11 08:08:15,047][26022] Updated weights on worker 0-0, policy_version 1107033 (0.00093) [2022-07-11 08:08:17,068][26022] Updated weights on worker 0-0, policy_version 1107043 (0.00090) [2022-07-11 08:08:18,773][26022] Updated weights on worker 0-0, policy_version 1107053 (0.00090) [2022-07-11 08:08:19,806][25689] Fps is (10 sec: 5521.5, 60 sec: 5550.5, 300 sec: 5540.7). Total num frames: 1133627392. Throughput: 0: 5774.3. Samples: 1133629484. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:08:19,807][25689] Avg episode reward: [(0, '-0.052')] [2022-07-11 08:08:20,504][26022] Updated weights on worker 0-0, policy_version 1107063 (0.00090) [2022-07-11 08:08:22,468][26022] Updated weights on worker 0-0, policy_version 1107073 (0.00091) [2022-07-11 08:08:24,291][26022] Updated weights on worker 0-0, policy_version 1107083 (0.00096) [2022-07-11 08:08:24,873][25689] Fps is (10 sec: 5491.5, 60 sec: 5534.1, 300 sec: 5543.4). Total num frames: 1133656064. Throughput: 0: 5863.8. Samples: 1133662930. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:08:24,874][25689] Avg episode reward: [(0, '-0.132')] [2022-07-11 08:08:26,212][26022] Updated weights on worker 0-0, policy_version 1107093 (0.00090) [2022-07-11 08:08:27,782][26022] Updated weights on worker 0-0, policy_version 1107103 (0.00088) [2022-07-11 08:08:29,715][26022] Updated weights on worker 0-0, policy_version 1107113 (0.00082) [2022-07-11 08:08:29,909][25689] Fps is (10 sec: 5675.7, 60 sec: 5568.8, 300 sec: 5547.7). Total num frames: 1133684736. Throughput: 0: 5010.3. Samples: 1133679716. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 08:08:29,910][25689] Avg episode reward: [(0, '-0.388')] [2022-07-11 08:08:31,677][26022] Updated weights on worker 0-0, policy_version 1107123 (0.00088) [2022-07-11 08:08:33,421][26022] Updated weights on worker 0-0, policy_version 1107133 (0.00094) [2022-07-11 08:08:34,946][25689] Fps is (10 sec: 5591.4, 60 sec: 5550.7, 300 sec: 5544.2). Total num frames: 1133712384. Throughput: 0: 5824.1. Samples: 1133713176. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:08:34,946][25689] Avg episode reward: [(0, '-2.048')] [2022-07-11 08:08:35,461][26022] Updated weights on worker 0-0, policy_version 1107143 (0.00100) [2022-07-11 08:08:37,118][26022] Updated weights on worker 0-0, policy_version 1107153 (0.00087) [2022-07-11 08:08:39,053][26022] Updated weights on worker 0-0, policy_version 1107163 (0.00089) [2022-07-11 08:08:39,954][25689] Fps is (10 sec: 5403.3, 60 sec: 5517.0, 300 sec: 5541.7). Total num frames: 1133739008. Throughput: 0: 5803.6. Samples: 1133746412. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:08:39,954][25689] Avg episode reward: [(0, '-1.287')] [2022-07-11 08:08:40,819][26022] Updated weights on worker 0-0, policy_version 1107173 (0.00093) [2022-07-11 08:08:42,677][26022] Updated weights on worker 0-0, policy_version 1107183 (0.00085) [2022-07-11 08:08:44,384][26022] Updated weights on worker 0-0, policy_version 1107193 (0.00089) [2022-07-11 08:08:45,013][25689] Fps is (10 sec: 5594.7, 60 sec: 5567.0, 300 sec: 5544.6). Total num frames: 1133768704. Throughput: 0: 4974.0. Samples: 1133763102. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:08:45,013][25689] Avg episode reward: [(0, '-1.337')] [2022-07-11 08:08:46,444][26022] Updated weights on worker 0-0, policy_version 1107203 (0.00064) [2022-07-11 08:08:48,163][26022] Updated weights on worker 0-0, policy_version 1107213 (0.00094) [2022-07-11 08:08:49,937][26022] Updated weights on worker 0-0, policy_version 1107223 (0.00085) [2022-07-11 08:08:50,109][25689] Fps is (10 sec: 5646.5, 60 sec: 5561.0, 300 sec: 5546.7). Total num frames: 1133796352. Throughput: 0: 5779.3. Samples: 1133796458. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:08:50,110][25689] Avg episode reward: [(0, '-1.371')] [2022-07-11 08:08:52,000][26022] Updated weights on worker 0-0, policy_version 1107233 (0.00089) [2022-07-11 08:08:53,787][26022] Updated weights on worker 0-0, policy_version 1107243 (0.00094) [2022-07-11 08:08:55,179][25689] Fps is (10 sec: 5338.1, 60 sec: 5523.5, 300 sec: 5535.1). Total num frames: 1133822976. Throughput: 0: 5741.2. Samples: 1133829340. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:08:55,180][25689] Avg episode reward: [(0, '-1.122')] [2022-07-11 08:08:55,695][26022] Updated weights on worker 0-0, policy_version 1107253 (0.00087) [2022-07-11 08:08:57,485][26022] Updated weights on worker 0-0, policy_version 1107263 (0.00430) [2022-07-11 08:08:59,334][26022] Updated weights on worker 0-0, policy_version 1107273 (0.00088) [2022-07-11 08:09:00,202][25689] Fps is (10 sec: 5580.1, 60 sec: 5521.8, 300 sec: 5549.3). Total num frames: 1133852672. Throughput: 0: 4923.3. Samples: 1133846102. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:09:00,203][25689] Avg episode reward: [(0, '-1.973')] [2022-07-11 08:09:01,314][26022] Updated weights on worker 0-0, policy_version 1107283 (0.00079) [2022-07-11 08:09:03,390][26022] Updated weights on worker 0-0, policy_version 1107293 (0.00099) [2022-07-11 08:09:05,172][26022] Updated weights on worker 0-0, policy_version 1107303 (0.00083) [2022-07-11 08:09:05,258][25689] Fps is (10 sec: 5587.9, 60 sec: 5571.4, 300 sec: 5545.0). Total num frames: 1133879296. Throughput: 0: 5657.1. Samples: 1133877634. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:09:05,263][25689] Avg episode reward: [(0, '-0.339')] [2022-07-11 08:09:07,013][26022] Updated weights on worker 0-0, policy_version 1107313 (0.00089) [2022-07-11 08:09:08,690][26022] Updated weights on worker 0-0, policy_version 1107323 (0.00085) [2022-07-11 08:09:10,315][25689] Fps is (10 sec: 5265.3, 60 sec: 5515.6, 300 sec: 5544.3). Total num frames: 1133905920. Throughput: 0: 5683.2. Samples: 1133911292. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:09:10,316][25689] Avg episode reward: [(0, '-0.259')] [2022-07-11 08:09:10,907][26022] Updated weights on worker 0-0, policy_version 1107333 (0.00085) [2022-07-11 08:09:12,476][26022] Updated weights on worker 0-0, policy_version 1107343 (0.00086) [2022-07-11 08:09:14,409][26022] Updated weights on worker 0-0, policy_version 1107353 (0.00083) [2022-07-11 08:09:15,342][25689] Fps is (10 sec: 5585.1, 60 sec: 5531.2, 300 sec: 5544.1). Total num frames: 1133935616. Throughput: 0: 4895.4. Samples: 1133928046. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:09:15,343][25689] Avg episode reward: [(0, '0.184')] [2022-07-11 08:09:16,155][26022] Updated weights on worker 0-0, policy_version 1107363 (0.00084) [2022-07-11 08:09:17,966][26022] Updated weights on worker 0-0, policy_version 1107373 (0.00086) [2022-07-11 08:09:19,891][26022] Updated weights on worker 0-0, policy_version 1107383 (0.00088) [2022-07-11 08:09:20,373][25689] Fps is (10 sec: 5701.2, 60 sec: 5545.4, 300 sec: 5549.4). Total num frames: 1133963264. Throughput: 0: 5732.2. Samples: 1133961726. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:09:20,374][25689] Avg episode reward: [(0, '1.119')] [2022-07-11 08:09:21,720][26022] Updated weights on worker 0-0, policy_version 1107393 (0.00085) [2022-07-11 08:09:23,649][26022] Updated weights on worker 0-0, policy_version 1107403 (0.00085) [2022-07-11 08:09:25,298][26022] Updated weights on worker 0-0, policy_version 1107413 (0.00092) [2022-07-11 08:09:25,496][25689] Fps is (10 sec: 5445.7, 60 sec: 5523.4, 300 sec: 5540.9). Total num frames: 1133990912. Throughput: 0: 5802.0. Samples: 1133995054. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:09:25,497][25689] Avg episode reward: [(0, '1.869')] [2022-07-11 08:09:27,211][26022] Updated weights on worker 0-0, policy_version 1107423 (0.00085) [2022-07-11 08:09:29,164][26022] Updated weights on worker 0-0, policy_version 1107433 (0.00091) [2022-07-11 08:09:30,558][25689] Fps is (10 sec: 5429.2, 60 sec: 5504.2, 300 sec: 5540.9). Total num frames: 1134018560. Throughput: 0: 5776.3. Samples: 1134028222. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:09:30,559][25689] Avg episode reward: [(0, '1.891')] [2022-07-11 08:09:30,863][26022] Updated weights on worker 0-0, policy_version 1107443 (0.00088) [2022-07-11 08:09:32,907][26022] Updated weights on worker 0-0, policy_version 1107453 (0.00083) [2022-07-11 08:09:34,475][26022] Updated weights on worker 0-0, policy_version 1107463 (0.00079) [2022-07-11 08:09:35,615][25689] Fps is (10 sec: 5566.0, 60 sec: 5519.3, 300 sec: 5543.4). Total num frames: 1134047232. Throughput: 0: 5766.1. Samples: 1134044940. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:09:35,615][25689] Avg episode reward: [(0, '1.835')] [2022-07-11 08:09:36,706][26022] Updated weights on worker 0-0, policy_version 1107473 (0.01032) [2022-07-11 08:09:38,147][26022] Updated weights on worker 0-0, policy_version 1107483 (0.00074) [2022-07-11 08:09:40,176][26022] Updated weights on worker 0-0, policy_version 1107493 (0.00083) [2022-07-11 08:09:40,669][25689] Fps is (10 sec: 5570.6, 60 sec: 5531.9, 300 sec: 5537.1). Total num frames: 1134074880. Throughput: 0: 5742.5. Samples: 1134078272. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:09:40,669][25689] Avg episode reward: [(0, '1.814')] [2022-07-11 08:09:42,044][26022] Updated weights on worker 0-0, policy_version 1107503 (0.00086) [2022-07-11 08:09:43,989][26022] Updated weights on worker 0-0, policy_version 1107513 (0.00203) [2022-07-11 08:09:45,711][25689] Fps is (10 sec: 5578.2, 60 sec: 5516.6, 300 sec: 5544.5). Total num frames: 1134103552. Throughput: 0: 5782.1. Samples: 1134111938. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:09:45,712][25689] Avg episode reward: [(0, '1.765')] [2022-07-11 08:09:45,716][26022] Updated weights on worker 0-0, policy_version 1107523 (0.00091) [2022-07-11 08:09:47,327][26022] Updated weights on worker 0-0, policy_version 1107533 (0.00085) [2022-07-11 08:09:48,483][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:09:48,496][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001107538_1134118912.pth [2022-07-11 08:09:48,497][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001105589_1132123136.pth [2022-07-11 08:09:49,314][26022] Updated weights on worker 0-0, policy_version 1107543 (0.00092) [2022-07-11 08:09:50,727][25689] Fps is (10 sec: 5599.1, 60 sec: 5523.9, 300 sec: 5544.5). Total num frames: 1134131200. Throughput: 0: 4988.4. Samples: 1134128834. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:09:50,730][25689] Avg episode reward: [(0, '1.344')] [2022-07-11 08:09:51,136][26022] Updated weights on worker 0-0, policy_version 1107553 (0.00094) [2022-07-11 08:09:52,941][26022] Updated weights on worker 0-0, policy_version 1107563 (0.00086) [2022-07-11 08:09:54,831][26022] Updated weights on worker 0-0, policy_version 1107573 (0.00092) [2022-07-11 08:09:55,738][25689] Fps is (10 sec: 5412.9, 60 sec: 5529.4, 300 sec: 5538.2). Total num frames: 1134157824. Throughput: 0: 5832.3. Samples: 1134162302. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:09:55,740][25689] Avg episode reward: [(0, '1.109')] [2022-07-11 08:09:56,454][26022] Updated weights on worker 0-0, policy_version 1107583 (0.00085) [2022-07-11 08:09:58,575][26022] Updated weights on worker 0-0, policy_version 1107593 (0.00087) [2022-07-11 08:10:00,147][26022] Updated weights on worker 0-0, policy_version 1107603 (0.00101) [2022-07-11 08:10:00,838][25689] Fps is (10 sec: 5570.2, 60 sec: 5522.3, 300 sec: 5548.0). Total num frames: 1134187520. Throughput: 0: 5836.5. Samples: 1134195992. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:10:00,839][25689] Avg episode reward: [(0, '1.068')] [2022-07-11 08:10:02,487][26022] Updated weights on worker 0-0, policy_version 1107613 (0.00087) [2022-07-11 08:10:04,536][26022] Updated weights on worker 0-0, policy_version 1107623 (0.00090) [2022-07-11 08:10:05,910][25689] Fps is (10 sec: 5637.4, 60 sec: 5537.8, 300 sec: 5551.4). Total num frames: 1134215168. Throughput: 0: 4893.6. Samples: 1134210780. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:10:05,910][25689] Avg episode reward: [(0, '1.315')] [2022-07-11 08:10:06,149][26022] Updated weights on worker 0-0, policy_version 1107633 (0.00091) [2022-07-11 08:10:08,073][26022] Updated weights on worker 0-0, policy_version 1107643 (0.00086) [2022-07-11 08:10:09,745][26022] Updated weights on worker 0-0, policy_version 1107653 (0.00083) [2022-07-11 08:10:10,942][25689] Fps is (10 sec: 5371.3, 60 sec: 5540.0, 300 sec: 5541.1). Total num frames: 1134241792. Throughput: 0: 5711.5. Samples: 1134244292. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:10:10,943][25689] Avg episode reward: [(0, '1.134')] [2022-07-11 08:10:11,582][26022] Updated weights on worker 0-0, policy_version 1107663 (0.00085) [2022-07-11 08:10:13,396][26022] Updated weights on worker 0-0, policy_version 1107673 (0.00079) [2022-07-11 08:10:15,163][26022] Updated weights on worker 0-0, policy_version 1107683 (0.00087) [2022-07-11 08:10:15,999][25689] Fps is (10 sec: 5582.2, 60 sec: 5537.3, 300 sec: 5550.6). Total num frames: 1134271488. Throughput: 0: 5732.4. Samples: 1134278448. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:10:16,000][25689] Avg episode reward: [(0, '1.347')] [2022-07-11 08:10:17,149][26022] Updated weights on worker 0-0, policy_version 1107693 (0.00087) [2022-07-11 08:10:18,708][26022] Updated weights on worker 0-0, policy_version 1107703 (0.00091) [2022-07-11 08:10:20,859][26022] Updated weights on worker 0-0, policy_version 1107713 (0.00086) [2022-07-11 08:10:21,064][25689] Fps is (10 sec: 5665.2, 60 sec: 5534.1, 300 sec: 5547.5). Total num frames: 1134299136. Throughput: 0: 4913.8. Samples: 1134295380. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:10:21,065][25689] Avg episode reward: [(0, '1.638')] [2022-07-11 08:10:22,405][26022] Updated weights on worker 0-0, policy_version 1107723 (0.00217) [2022-07-11 08:10:24,491][26022] Updated weights on worker 0-0, policy_version 1107733 (0.00087) [2022-07-11 08:10:26,083][26022] Updated weights on worker 0-0, policy_version 1107743 (0.00098) [2022-07-11 08:10:26,132][25689] Fps is (10 sec: 5659.1, 60 sec: 5573.0, 300 sec: 5550.0). Total num frames: 1134328832. Throughput: 0: 5842.9. Samples: 1134328936. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:10:26,132][25689] Avg episode reward: [(0, '1.561')] [2022-07-11 08:10:28,319][26022] Updated weights on worker 0-0, policy_version 1107753 (0.00093) [2022-07-11 08:10:29,758][26022] Updated weights on worker 0-0, policy_version 1107763 (0.00093) [2022-07-11 08:10:31,187][25689] Fps is (10 sec: 5564.0, 60 sec: 5556.7, 300 sec: 5545.8). Total num frames: 1134355456. Throughput: 0: 5820.0. Samples: 1134362114. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:10:31,187][25689] Avg episode reward: [(0, '-0.014')] [2022-07-11 08:10:31,794][26022] Updated weights on worker 0-0, policy_version 1107773 (0.00087) [2022-07-11 08:10:33,545][26022] Updated weights on worker 0-0, policy_version 1107783 (0.00089) [2022-07-11 08:10:35,467][26022] Updated weights on worker 0-0, policy_version 1107793 (0.00081) [2022-07-11 08:10:36,192][25689] Fps is (10 sec: 5394.8, 60 sec: 5544.5, 300 sec: 5546.4). Total num frames: 1134383104. Throughput: 0: 4972.4. Samples: 1134378856. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:10:36,195][25689] Avg episode reward: [(0, '-0.487')] [2022-07-11 08:10:37,262][26022] Updated weights on worker 0-0, policy_version 1107803 (0.00092) [2022-07-11 08:10:39,274][26022] Updated weights on worker 0-0, policy_version 1107813 (0.00085) [2022-07-11 08:10:40,730][26022] Updated weights on worker 0-0, policy_version 1107823 (0.00099) [2022-07-11 08:10:41,221][25689] Fps is (10 sec: 5714.6, 60 sec: 5580.6, 300 sec: 5547.1). Total num frames: 1134412800. Throughput: 0: 5821.3. Samples: 1134412720. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:10:41,222][25689] Avg episode reward: [(0, '-0.434')] [2022-07-11 08:10:42,703][26022] Updated weights on worker 0-0, policy_version 1107833 (0.00089) [2022-07-11 08:10:44,435][26022] Updated weights on worker 0-0, policy_version 1107843 (0.00079) [2022-07-11 08:10:46,315][25689] Fps is (10 sec: 5664.9, 60 sec: 5559.0, 300 sec: 5545.5). Total num frames: 1134440448. Throughput: 0: 5824.7. Samples: 1134446494. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:10:46,315][25689] Avg episode reward: [(0, '-1.035')] [2022-07-11 08:10:46,366][26022] Updated weights on worker 0-0, policy_version 1107853 (0.00091) [2022-07-11 08:10:48,306][26022] Updated weights on worker 0-0, policy_version 1107863 (0.00080) [2022-07-11 08:10:49,860][26022] Updated weights on worker 0-0, policy_version 1107873 (0.00089) [2022-07-11 08:10:51,321][25689] Fps is (10 sec: 5475.1, 60 sec: 5559.9, 300 sec: 5545.8). Total num frames: 1134468096. Throughput: 0: 5017.0. Samples: 1134463128. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:10:51,321][25689] Avg episode reward: [(0, '-1.336')] [2022-07-11 08:10:51,964][26022] Updated weights on worker 0-0, policy_version 1107883 (0.00078) [2022-07-11 08:10:53,641][26022] Updated weights on worker 0-0, policy_version 1107893 (0.00092) [2022-07-11 08:10:55,561][26022] Updated weights on worker 0-0, policy_version 1107903 (0.00092) [2022-07-11 08:10:56,324][25689] Fps is (10 sec: 5729.0, 60 sec: 5611.3, 300 sec: 5545.9). Total num frames: 1134497792. Throughput: 0: 5865.9. Samples: 1134496948. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:10:56,325][25689] Avg episode reward: [(0, '-0.091')] [2022-07-11 08:10:57,499][26022] Updated weights on worker 0-0, policy_version 1107913 (0.00080) [2022-07-11 08:10:59,256][26022] Updated weights on worker 0-0, policy_version 1107923 (0.00273) [2022-07-11 08:11:01,022][26022] Updated weights on worker 0-0, policy_version 1107933 (0.00091) [2022-07-11 08:11:01,328][25689] Fps is (10 sec: 5627.9, 60 sec: 5569.5, 300 sec: 5551.7). Total num frames: 1134524416. Throughput: 0: 5848.4. Samples: 1134530312. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:11:01,329][25689] Avg episode reward: [(0, '-1.386')] [2022-07-11 08:11:03,183][26022] Updated weights on worker 0-0, policy_version 1107943 (0.00088) [2022-07-11 08:11:04,943][26022] Updated weights on worker 0-0, policy_version 1107953 (0.00099) [2022-07-11 08:11:06,403][25689] Fps is (10 sec: 5181.6, 60 sec: 5535.3, 300 sec: 5543.7). Total num frames: 1134550016. Throughput: 0: 4901.1. Samples: 1134544948. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:11:06,403][25689] Avg episode reward: [(0, '-1.225')] [2022-07-11 08:11:07,037][26022] Updated weights on worker 0-0, policy_version 1107963 (0.00087) [2022-07-11 08:11:08,783][26022] Updated weights on worker 0-0, policy_version 1107973 (0.00091) [2022-07-11 08:11:10,555][26022] Updated weights on worker 0-0, policy_version 1107983 (0.00089) [2022-07-11 08:11:11,412][25689] Fps is (10 sec: 5382.2, 60 sec: 5571.3, 300 sec: 5544.1). Total num frames: 1134578688. Throughput: 0: 5739.0. Samples: 1134578430. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:11:11,413][25689] Avg episode reward: [(0, '-1.665')] [2022-07-11 08:11:12,446][26022] Updated weights on worker 0-0, policy_version 1107993 (0.00085) [2022-07-11 08:11:14,144][26022] Updated weights on worker 0-0, policy_version 1108003 (0.00094) [2022-07-11 08:11:16,177][26022] Updated weights on worker 0-0, policy_version 1108013 (0.00097) [2022-07-11 08:11:16,459][25689] Fps is (10 sec: 5600.8, 60 sec: 5538.4, 300 sec: 5544.2). Total num frames: 1134606336. Throughput: 0: 5700.9. Samples: 1134611732. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:11:16,460][25689] Avg episode reward: [(0, '-0.728')] [2022-07-11 08:11:17,985][26022] Updated weights on worker 0-0, policy_version 1108023 (0.00090) [2022-07-11 08:11:19,693][26022] Updated weights on worker 0-0, policy_version 1108033 (0.00084) [2022-07-11 08:11:21,484][25689] Fps is (10 sec: 5490.0, 60 sec: 5542.1, 300 sec: 5541.8). Total num frames: 1134633984. Throughput: 0: 4882.8. Samples: 1134628728. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:11:21,486][25689] Avg episode reward: [(0, '-1.449')] [2022-07-11 08:11:21,734][26022] Updated weights on worker 0-0, policy_version 1108043 (0.00092) [2022-07-11 08:11:23,368][26022] Updated weights on worker 0-0, policy_version 1108053 (0.00091) [2022-07-11 08:11:25,254][26022] Updated weights on worker 0-0, policy_version 1108063 (0.00093) [2022-07-11 08:11:26,557][25689] Fps is (10 sec: 5780.1, 60 sec: 5558.5, 300 sec: 5548.0). Total num frames: 1134664704. Throughput: 0: 5836.3. Samples: 1134662572. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:11:26,557][25689] Avg episode reward: [(0, '-0.295')] [2022-07-11 08:11:27,208][26022] Updated weights on worker 0-0, policy_version 1108073 (0.00086) [2022-07-11 08:11:28,958][26022] Updated weights on worker 0-0, policy_version 1108083 (0.00086) [2022-07-11 08:11:30,998][26022] Updated weights on worker 0-0, policy_version 1108093 (0.00084) [2022-07-11 08:11:31,571][25689] Fps is (10 sec: 5685.1, 60 sec: 5562.3, 300 sec: 5545.0). Total num frames: 1134691328. Throughput: 0: 5829.7. Samples: 1134695950. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:11:31,571][25689] Avg episode reward: [(0, '-0.715')] [2022-07-11 08:11:32,556][26022] Updated weights on worker 0-0, policy_version 1108103 (0.00087) [2022-07-11 08:11:34,427][26022] Updated weights on worker 0-0, policy_version 1108113 (0.00088) [2022-07-11 08:11:36,231][26022] Updated weights on worker 0-0, policy_version 1108123 (0.00086) [2022-07-11 08:11:36,578][25689] Fps is (10 sec: 5415.9, 60 sec: 5562.1, 300 sec: 5545.1). Total num frames: 1134718976. Throughput: 0: 5029.8. Samples: 1134712926. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:11:36,578][25689] Avg episode reward: [(0, '-1.450')] [2022-07-11 08:11:37,907][26022] Updated weights on worker 0-0, policy_version 1108133 (0.00088) [2022-07-11 08:11:39,952][26022] Updated weights on worker 0-0, policy_version 1108143 (0.00091) [2022-07-11 08:11:41,582][25689] Fps is (10 sec: 5625.8, 60 sec: 5547.5, 300 sec: 5547.2). Total num frames: 1134747648. Throughput: 0: 5864.0. Samples: 1134746580. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:11:41,582][25689] Avg episode reward: [(0, '-1.843')] [2022-07-11 08:11:41,594][26022] Updated weights on worker 0-0, policy_version 1108153 (0.00084) [2022-07-11 08:11:43,615][26022] Updated weights on worker 0-0, policy_version 1108163 (0.00096) [2022-07-11 08:11:45,424][26022] Updated weights on worker 0-0, policy_version 1108173 (0.00094) [2022-07-11 08:11:46,731][25689] Fps is (10 sec: 5547.1, 60 sec: 5542.4, 300 sec: 5548.4). Total num frames: 1134775296. Throughput: 0: 5809.6. Samples: 1134779774. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:11:46,731][25689] Avg episode reward: [(0, '-1.796')] [2022-07-11 08:11:47,418][26022] Updated weights on worker 0-0, policy_version 1108183 (0.00091) [2022-07-11 08:11:48,588][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:11:48,597][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001108190_1134786560.pth [2022-07-11 08:11:48,597][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001106238_1132787712.pth [2022-07-11 08:11:49,159][26022] Updated weights on worker 0-0, policy_version 1108193 (0.00089) [2022-07-11 08:11:51,035][26022] Updated weights on worker 0-0, policy_version 1108203 (0.00089) [2022-07-11 08:11:51,748][25689] Fps is (10 sec: 5539.8, 60 sec: 5558.3, 300 sec: 5544.9). Total num frames: 1134803968. Throughput: 0: 5823.4. Samples: 1134813452. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:11:51,749][25689] Avg episode reward: [(0, '-1.541')] [2022-07-11 08:11:52,538][26022] Updated weights on worker 0-0, policy_version 1108213 (0.00109) [2022-07-11 08:11:54,720][26022] Updated weights on worker 0-0, policy_version 1108223 (0.00084) [2022-07-11 08:11:56,322][26022] Updated weights on worker 0-0, policy_version 1108233 (0.00094) [2022-07-11 08:11:56,776][25689] Fps is (10 sec: 5606.4, 60 sec: 5522.1, 300 sec: 5542.6). Total num frames: 1134831616. Throughput: 0: 5802.3. Samples: 1134830124. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:11:56,777][25689] Avg episode reward: [(0, '-1.810')] [2022-07-11 08:11:58,325][26022] Updated weights on worker 0-0, policy_version 1108243 (0.00086) [2022-07-11 08:12:00,043][26022] Updated weights on worker 0-0, policy_version 1108253 (0.00088) [2022-07-11 08:12:01,781][25689] Fps is (10 sec: 5205.2, 60 sec: 5488.2, 300 sec: 5544.0). Total num frames: 1134856192. Throughput: 0: 5788.6. Samples: 1134863508. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:12:01,782][25689] Avg episode reward: [(0, '-1.383')] [2022-07-11 08:12:02,368][26022] Updated weights on worker 0-0, policy_version 1108263 (0.00089) [2022-07-11 08:12:04,226][26022] Updated weights on worker 0-0, policy_version 1108273 (0.00088) [2022-07-11 08:12:05,918][26022] Updated weights on worker 0-0, policy_version 1108283 (0.00083) [2022-07-11 08:12:06,877][25689] Fps is (10 sec: 5373.4, 60 sec: 5554.0, 300 sec: 5539.8). Total num frames: 1134885888. Throughput: 0: 5723.2. Samples: 1134895072. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:12:06,877][25689] Avg episode reward: [(0, '-0.486')] [2022-07-11 08:12:07,664][26022] Updated weights on worker 0-0, policy_version 1108293 (0.00085) [2022-07-11 08:12:09,751][26022] Updated weights on worker 0-0, policy_version 1108303 (0.00082) [2022-07-11 08:12:11,485][26022] Updated weights on worker 0-0, policy_version 1108313 (0.00082) [2022-07-11 08:12:11,882][25689] Fps is (10 sec: 5677.0, 60 sec: 5537.4, 300 sec: 5543.4). Total num frames: 1134913536. Throughput: 0: 4886.9. Samples: 1134911846. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:12:11,883][25689] Avg episode reward: [(0, '-0.467')] [2022-07-11 08:12:13,293][26022] Updated weights on worker 0-0, policy_version 1108323 (0.00087) [2022-07-11 08:12:15,221][26022] Updated weights on worker 0-0, policy_version 1108333 (0.00087) [2022-07-11 08:12:16,831][26022] Updated weights on worker 0-0, policy_version 1108343 (0.00087) [2022-07-11 08:12:16,902][25689] Fps is (10 sec: 5719.7, 60 sec: 5573.7, 300 sec: 5549.9). Total num frames: 1134943232. Throughput: 0: 5739.7. Samples: 1134945642. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:12:16,903][25689] Avg episode reward: [(0, '-0.499')] [2022-07-11 08:12:18,872][26022] Updated weights on worker 0-0, policy_version 1108353 (0.00086) [2022-07-11 08:12:20,554][26022] Updated weights on worker 0-0, policy_version 1108363 (0.00078) [2022-07-11 08:12:21,935][25689] Fps is (10 sec: 5602.6, 60 sec: 5556.1, 300 sec: 5540.4). Total num frames: 1134969856. Throughput: 0: 5748.1. Samples: 1134979352. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:12:21,936][25689] Avg episode reward: [(0, '0.396')] [2022-07-11 08:12:22,466][26022] Updated weights on worker 0-0, policy_version 1108373 (0.00090) [2022-07-11 08:12:24,347][26022] Updated weights on worker 0-0, policy_version 1108383 (0.00088) [2022-07-11 08:12:26,108][26022] Updated weights on worker 0-0, policy_version 1108393 (0.00089) [2022-07-11 08:12:27,053][25689] Fps is (10 sec: 5548.4, 60 sec: 5535.0, 300 sec: 5549.3). Total num frames: 1134999552. Throughput: 0: 5008.6. Samples: 1134996128. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:12:27,053][25689] Avg episode reward: [(0, '1.245')] [2022-07-11 08:12:28,179][26022] Updated weights on worker 0-0, policy_version 1108403 (0.00088) [2022-07-11 08:12:29,748][26022] Updated weights on worker 0-0, policy_version 1108413 (0.00090) [2022-07-11 08:12:31,713][26022] Updated weights on worker 0-0, policy_version 1108423 (0.00086) [2022-07-11 08:12:32,067][25689] Fps is (10 sec: 5558.6, 60 sec: 5535.0, 300 sec: 5542.6). Total num frames: 1135026176. Throughput: 0: 5821.2. Samples: 1135029344. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 08:12:32,067][25689] Avg episode reward: [(0, '0.795')] [2022-07-11 08:12:33,481][26022] Updated weights on worker 0-0, policy_version 1108433 (0.00086) [2022-07-11 08:12:35,365][26022] Updated weights on worker 0-0, policy_version 1108443 (0.00087) [2022-07-11 08:12:37,053][26022] Updated weights on worker 0-0, policy_version 1108453 (0.00081) [2022-07-11 08:12:37,153][25689] Fps is (10 sec: 5576.3, 60 sec: 5561.6, 300 sec: 5544.7). Total num frames: 1135055872. Throughput: 0: 5801.8. Samples: 1135063132. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:12:37,153][25689] Avg episode reward: [(0, '1.500')] [2022-07-11 08:12:39,060][26022] Updated weights on worker 0-0, policy_version 1108463 (0.00086) [2022-07-11 08:12:40,698][26022] Updated weights on worker 0-0, policy_version 1108473 (0.00088) [2022-07-11 08:12:42,163][25689] Fps is (10 sec: 5578.3, 60 sec: 5527.2, 300 sec: 5545.4). Total num frames: 1135082496. Throughput: 0: 4980.1. Samples: 1135080092. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:12:42,164][25689] Avg episode reward: [(0, '0.729')] [2022-07-11 08:12:42,716][26022] Updated weights on worker 0-0, policy_version 1108483 (0.00087) [2022-07-11 08:12:44,365][26022] Updated weights on worker 0-0, policy_version 1108493 (0.00089) [2022-07-11 08:12:46,392][26022] Updated weights on worker 0-0, policy_version 1108503 (0.00087) [2022-07-11 08:12:47,249][25689] Fps is (10 sec: 5578.1, 60 sec: 5566.8, 300 sec: 5551.3). Total num frames: 1135112192. Throughput: 0: 5812.9. Samples: 1135113528. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:12:47,250][25689] Avg episode reward: [(0, '0.378')] [2022-07-11 08:12:48,139][26022] Updated weights on worker 0-0, policy_version 1108513 (0.00091) [2022-07-11 08:12:50,038][26022] Updated weights on worker 0-0, policy_version 1108523 (0.00092) [2022-07-11 08:12:51,745][26022] Updated weights on worker 0-0, policy_version 1108533 (0.00086) [2022-07-11 08:12:52,259][25689] Fps is (10 sec: 5680.2, 60 sec: 5550.6, 300 sec: 5548.2). Total num frames: 1135139840. Throughput: 0: 5832.5. Samples: 1135147112. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:12:52,259][25689] Avg episode reward: [(0, '-0.212')] [2022-07-11 08:12:53,697][26022] Updated weights on worker 0-0, policy_version 1108543 (0.00087) [2022-07-11 08:12:55,444][26022] Updated weights on worker 0-0, policy_version 1108553 (0.00088) [2022-07-11 08:12:57,293][25689] Fps is (10 sec: 5403.7, 60 sec: 5533.2, 300 sec: 5537.4). Total num frames: 1135166464. Throughput: 0: 5006.5. Samples: 1135163964. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:12:57,295][25689] Avg episode reward: [(0, '-0.547')] [2022-07-11 08:12:57,504][26022] Updated weights on worker 0-0, policy_version 1108563 (0.00090) [2022-07-11 08:12:58,994][26022] Updated weights on worker 0-0, policy_version 1108573 (0.00086) [2022-07-11 08:13:01,015][26022] Updated weights on worker 0-0, policy_version 1108583 (0.00084) [2022-07-11 08:13:02,326][25689] Fps is (10 sec: 5492.6, 60 sec: 5598.2, 300 sec: 5554.7). Total num frames: 1135195136. Throughput: 0: 5817.9. Samples: 1135197398. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:13:02,327][25689] Avg episode reward: [(0, '-0.442')] [2022-07-11 08:13:03,340][26022] Updated weights on worker 0-0, policy_version 1108593 (0.00086) [2022-07-11 08:13:05,007][26022] Updated weights on worker 0-0, policy_version 1108603 (0.00092) [2022-07-11 08:13:07,048][26022] Updated weights on worker 0-0, policy_version 1108613 (0.00089) [2022-07-11 08:13:07,445][25689] Fps is (10 sec: 5547.6, 60 sec: 5562.2, 300 sec: 5545.7). Total num frames: 1135222784. Throughput: 0: 5707.8. Samples: 1135228800. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:13:07,446][25689] Avg episode reward: [(0, '-0.349')] [2022-07-11 08:13:08,674][26022] Updated weights on worker 0-0, policy_version 1108623 (0.00084) [2022-07-11 08:13:10,632][26022] Updated weights on worker 0-0, policy_version 1108633 (0.00088) [2022-07-11 08:13:12,364][26022] Updated weights on worker 0-0, policy_version 1108643 (0.00085) [2022-07-11 08:13:12,507][25689] Fps is (10 sec: 5431.5, 60 sec: 5557.1, 300 sec: 5541.3). Total num frames: 1135250432. Throughput: 0: 4866.6. Samples: 1135245652. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:13:12,507][25689] Avg episode reward: [(0, '0.234')] [2022-07-11 08:13:14,142][26022] Updated weights on worker 0-0, policy_version 1108653 (0.00090) [2022-07-11 08:13:16,074][26022] Updated weights on worker 0-0, policy_version 1108663 (0.00086) [2022-07-11 08:13:17,544][25689] Fps is (10 sec: 5576.9, 60 sec: 5538.6, 300 sec: 5547.5). Total num frames: 1135279104. Throughput: 0: 5701.3. Samples: 1135279422. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:13:17,544][25689] Avg episode reward: [(0, '0.422')] [2022-07-11 08:13:17,678][26022] Updated weights on worker 0-0, policy_version 1108673 (0.00091) [2022-07-11 08:13:19,643][26022] Updated weights on worker 0-0, policy_version 1108683 (0.00361) [2022-07-11 08:13:21,455][26022] Updated weights on worker 0-0, policy_version 1108693 (0.00087) [2022-07-11 08:13:22,547][25689] Fps is (10 sec: 5711.4, 60 sec: 5575.1, 300 sec: 5548.7). Total num frames: 1135307776. Throughput: 0: 5727.0. Samples: 1135313204. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:13:22,548][25689] Avg episode reward: [(0, '-0.490')] [2022-07-11 08:13:23,279][26022] Updated weights on worker 0-0, policy_version 1108703 (0.00088) [2022-07-11 08:13:25,006][26022] Updated weights on worker 0-0, policy_version 1108713 (0.00087) [2022-07-11 08:13:27,031][26022] Updated weights on worker 0-0, policy_version 1108723 (0.00088) [2022-07-11 08:13:27,589][25689] Fps is (10 sec: 5708.4, 60 sec: 5565.2, 300 sec: 5548.6). Total num frames: 1135336448. Throughput: 0: 5024.7. Samples: 1135330020. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:13:27,591][25689] Avg episode reward: [(0, '-0.048')] [2022-07-11 08:13:28,780][26022] Updated weights on worker 0-0, policy_version 1108733 (0.00088) [2022-07-11 08:13:30,638][26022] Updated weights on worker 0-0, policy_version 1108743 (0.00087) [2022-07-11 08:13:32,402][26022] Updated weights on worker 0-0, policy_version 1108753 (0.00086) [2022-07-11 08:13:32,619][25689] Fps is (10 sec: 5490.3, 60 sec: 5563.8, 300 sec: 5545.3). Total num frames: 1135363072. Throughput: 0: 5865.0. Samples: 1135363608. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:13:32,619][25689] Avg episode reward: [(0, '0.372')] [2022-07-11 08:13:34,340][26022] Updated weights on worker 0-0, policy_version 1108763 (0.00085) [2022-07-11 08:13:35,990][26022] Updated weights on worker 0-0, policy_version 1108773 (0.00087) [2022-07-11 08:13:37,627][25689] Fps is (10 sec: 5509.1, 60 sec: 5554.0, 300 sec: 5552.2). Total num frames: 1135391744. Throughput: 0: 5883.2. Samples: 1135397572. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:13:37,627][25689] Avg episode reward: [(0, '0.878')] [2022-07-11 08:13:37,898][26022] Updated weights on worker 0-0, policy_version 1108783 (0.00070) [2022-07-11 08:13:39,735][26022] Updated weights on worker 0-0, policy_version 1108793 (0.00087) [2022-07-11 08:13:41,515][26022] Updated weights on worker 0-0, policy_version 1108803 (0.00087) [2022-07-11 08:13:42,644][25689] Fps is (10 sec: 5720.1, 60 sec: 5587.3, 300 sec: 5549.5). Total num frames: 1135420416. Throughput: 0: 5037.2. Samples: 1135414434. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:13:42,645][25689] Avg episode reward: [(0, '0.949')] [2022-07-11 08:13:43,537][26022] Updated weights on worker 0-0, policy_version 1108813 (0.00081) [2022-07-11 08:13:45,108][26022] Updated weights on worker 0-0, policy_version 1108823 (0.00089) [2022-07-11 08:13:46,948][26022] Updated weights on worker 0-0, policy_version 1108833 (0.00090) [2022-07-11 08:13:47,687][25689] Fps is (10 sec: 5496.6, 60 sec: 5540.4, 300 sec: 5547.1). Total num frames: 1135447040. Throughput: 0: 5877.3. Samples: 1135448138. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:13:47,687][25689] Avg episode reward: [(0, '2.050')] [2022-07-11 08:13:48,765][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:13:48,777][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001108842_1135454208.pth [2022-07-11 08:13:48,777][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001106889_1133454336.pth [2022-07-11 08:13:48,876][26022] Updated weights on worker 0-0, policy_version 1108843 (0.00093) [2022-07-11 08:13:50,736][26022] Updated weights on worker 0-0, policy_version 1108853 (0.00094) [2022-07-11 08:13:52,699][25689] Fps is (10 sec: 5397.5, 60 sec: 5540.2, 300 sec: 5551.6). Total num frames: 1135474688. Throughput: 0: 5850.8. Samples: 1135481092. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:13:52,699][25689] Avg episode reward: [(0, '1.927')] [2022-07-11 08:13:52,705][26022] Updated weights on worker 0-0, policy_version 1108863 (0.00085) [2022-07-11 08:13:54,436][26022] Updated weights on worker 0-0, policy_version 1108873 (0.00092) [2022-07-11 08:13:56,411][26022] Updated weights on worker 0-0, policy_version 1108883 (0.00502) [2022-07-11 08:13:57,707][25689] Fps is (10 sec: 5620.9, 60 sec: 5576.5, 300 sec: 5548.5). Total num frames: 1135503360. Throughput: 0: 4983.5. Samples: 1135497640. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:13:57,707][25689] Avg episode reward: [(0, '1.888')] [2022-07-11 08:13:58,228][26022] Updated weights on worker 0-0, policy_version 1108893 (0.00092) [2022-07-11 08:14:00,161][26022] Updated weights on worker 0-0, policy_version 1108903 (0.00094) [2022-07-11 08:14:01,943][26022] Updated weights on worker 0-0, policy_version 1108913 (0.00095) [2022-07-11 08:14:02,709][25689] Fps is (10 sec: 5421.4, 60 sec: 5528.4, 300 sec: 5546.0). Total num frames: 1135528960. Throughput: 0: 5796.8. Samples: 1135530750. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:14:02,710][25689] Avg episode reward: [(0, '1.872')] [2022-07-11 08:14:04,214][26022] Updated weights on worker 0-0, policy_version 1108923 (0.00099) [2022-07-11 08:14:05,896][26022] Updated weights on worker 0-0, policy_version 1108933 (0.00091) [2022-07-11 08:14:07,784][25689] Fps is (10 sec: 5284.0, 60 sec: 5532.5, 300 sec: 5549.1). Total num frames: 1135556608. Throughput: 0: 5651.5. Samples: 1135561716. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:14:07,784][25689] Avg episode reward: [(0, '1.648')] [2022-07-11 08:14:07,899][26022] Updated weights on worker 0-0, policy_version 1108943 (0.00085) [2022-07-11 08:14:09,626][26022] Updated weights on worker 0-0, policy_version 1108953 (0.00092) [2022-07-11 08:14:11,689][26022] Updated weights on worker 0-0, policy_version 1108963 (0.00091) [2022-07-11 08:14:12,786][25689] Fps is (10 sec: 5487.3, 60 sec: 5537.9, 300 sec: 5542.7). Total num frames: 1135584256. Throughput: 0: 4847.2. Samples: 1135578462. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:14:12,787][25689] Avg episode reward: [(0, '1.852')] [2022-07-11 08:14:13,342][26022] Updated weights on worker 0-0, policy_version 1108973 (0.00085) [2022-07-11 08:14:15,376][26022] Updated weights on worker 0-0, policy_version 1108983 (0.00084) [2022-07-11 08:14:16,996][26022] Updated weights on worker 0-0, policy_version 1108993 (0.00080) [2022-07-11 08:14:17,804][25689] Fps is (10 sec: 5518.5, 60 sec: 5522.8, 300 sec: 5542.9). Total num frames: 1135611904. Throughput: 0: 5684.1. Samples: 1135611874. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:14:17,804][25689] Avg episode reward: [(0, '1.491')] [2022-07-11 08:14:18,939][26022] Updated weights on worker 0-0, policy_version 1109003 (0.00086) [2022-07-11 08:14:20,590][26022] Updated weights on worker 0-0, policy_version 1109013 (0.00094) [2022-07-11 08:14:22,530][26022] Updated weights on worker 0-0, policy_version 1109023 (0.00095) [2022-07-11 08:14:22,835][25689] Fps is (10 sec: 5605.0, 60 sec: 5520.2, 300 sec: 5548.1). Total num frames: 1135640576. Throughput: 0: 5708.7. Samples: 1135645640. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:14:22,835][25689] Avg episode reward: [(0, '1.682')] [2022-07-11 08:14:24,323][26022] Updated weights on worker 0-0, policy_version 1109033 (0.00087) [2022-07-11 08:14:26,249][26022] Updated weights on worker 0-0, policy_version 1109043 (0.00088) [2022-07-11 08:14:27,914][25689] Fps is (10 sec: 5671.7, 60 sec: 5516.8, 300 sec: 5551.2). Total num frames: 1135669248. Throughput: 0: 4999.8. Samples: 1135662364. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:14:27,915][25689] Avg episode reward: [(0, '1.689')] [2022-07-11 08:14:28,035][26022] Updated weights on worker 0-0, policy_version 1109053 (0.00084) [2022-07-11 08:14:29,971][26022] Updated weights on worker 0-0, policy_version 1109063 (0.00098) [2022-07-11 08:14:31,762][26022] Updated weights on worker 0-0, policy_version 1109073 (0.00092) [2022-07-11 08:14:32,922][25689] Fps is (10 sec: 5481.6, 60 sec: 5518.7, 300 sec: 5545.2). Total num frames: 1135695872. Throughput: 0: 5818.3. Samples: 1135695618. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:14:32,923][25689] Avg episode reward: [(0, '1.428')] [2022-07-11 08:14:33,604][26022] Updated weights on worker 0-0, policy_version 1109083 (0.00092) [2022-07-11 08:14:35,412][26022] Updated weights on worker 0-0, policy_version 1109093 (0.00090) [2022-07-11 08:14:37,347][26022] Updated weights on worker 0-0, policy_version 1109103 (0.00076) [2022-07-11 08:14:37,926][25689] Fps is (10 sec: 5420.9, 60 sec: 5502.2, 300 sec: 5546.2). Total num frames: 1135723520. Throughput: 0: 5832.9. Samples: 1135729244. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:14:37,927][25689] Avg episode reward: [(0, '1.516')] [2022-07-11 08:14:39,176][26022] Updated weights on worker 0-0, policy_version 1109113 (0.00088) [2022-07-11 08:14:41,048][26022] Updated weights on worker 0-0, policy_version 1109123 (0.00091) [2022-07-11 08:14:42,651][26022] Updated weights on worker 0-0, policy_version 1109133 (0.00093) [2022-07-11 08:14:42,946][25689] Fps is (10 sec: 5720.5, 60 sec: 5518.8, 300 sec: 5550.0). Total num frames: 1135753216. Throughput: 0: 4987.6. Samples: 1135745950. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:14:42,947][25689] Avg episode reward: [(0, '1.312')] [2022-07-11 08:14:44,684][26022] Updated weights on worker 0-0, policy_version 1109143 (0.00088) [2022-07-11 08:14:46,433][26022] Updated weights on worker 0-0, policy_version 1109153 (0.00101) [2022-07-11 08:14:47,985][25689] Fps is (10 sec: 5598.7, 60 sec: 5519.2, 300 sec: 5546.2). Total num frames: 1135779840. Throughput: 0: 5843.8. Samples: 1135779656. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:14:47,986][25689] Avg episode reward: [(0, '0.988')] [2022-07-11 08:14:48,369][26022] Updated weights on worker 0-0, policy_version 1109163 (0.00087) [2022-07-11 08:14:50,141][26022] Updated weights on worker 0-0, policy_version 1109173 (0.00084) [2022-07-11 08:14:52,091][26022] Updated weights on worker 0-0, policy_version 1109183 (0.00094) [2022-07-11 08:14:53,001][25689] Fps is (10 sec: 5499.6, 60 sec: 5535.8, 300 sec: 5553.0). Total num frames: 1135808512. Throughput: 0: 5836.1. Samples: 1135812800. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:14:53,003][25689] Avg episode reward: [(0, '0.796')] [2022-07-11 08:14:53,656][26022] Updated weights on worker 0-0, policy_version 1109193 (0.00079) [2022-07-11 08:14:55,799][26022] Updated weights on worker 0-0, policy_version 1109203 (0.00102) [2022-07-11 08:14:57,345][26022] Updated weights on worker 0-0, policy_version 1109213 (0.00085) [2022-07-11 08:14:58,028][25689] Fps is (10 sec: 5607.9, 60 sec: 5517.1, 300 sec: 5547.4). Total num frames: 1135836160. Throughput: 0: 4994.6. Samples: 1135829648. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:14:58,029][25689] Avg episode reward: [(0, '-0.069')] [2022-07-11 08:14:59,489][26022] Updated weights on worker 0-0, policy_version 1109223 (0.00096) [2022-07-11 08:15:01,232][26022] Updated weights on worker 0-0, policy_version 1109233 (0.00093) [2022-07-11 08:15:03,035][25689] Fps is (10 sec: 5204.8, 60 sec: 5499.8, 300 sec: 5538.3). Total num frames: 1135860736. Throughput: 0: 5818.6. Samples: 1135862836. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:15:03,037][25689] Avg episode reward: [(0, '0.348')] [2022-07-11 08:15:03,556][26022] Updated weights on worker 0-0, policy_version 1109243 (0.00093) [2022-07-11 08:15:05,350][26022] Updated weights on worker 0-0, policy_version 1109253 (0.00094) [2022-07-11 08:15:07,289][26022] Updated weights on worker 0-0, policy_version 1109263 (0.00092) [2022-07-11 08:15:08,151][25689] Fps is (10 sec: 5361.4, 60 sec: 5529.9, 300 sec: 5547.1). Total num frames: 1135890432. Throughput: 0: 5668.3. Samples: 1135893962. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:15:08,152][25689] Avg episode reward: [(0, '0.569')] [2022-07-11 08:15:09,069][26022] Updated weights on worker 0-0, policy_version 1109273 (0.00087) [2022-07-11 08:15:10,991][26022] Updated weights on worker 0-0, policy_version 1109283 (0.00092) [2022-07-11 08:15:12,765][26022] Updated weights on worker 0-0, policy_version 1109293 (0.00053) [2022-07-11 08:15:13,193][25689] Fps is (10 sec: 5544.3, 60 sec: 5509.3, 300 sec: 5537.0). Total num frames: 1135917056. Throughput: 0: 5647.3. Samples: 1135926830. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:15:13,193][25689] Avg episode reward: [(0, '0.725')] [2022-07-11 08:15:14,495][26022] Updated weights on worker 0-0, policy_version 1109303 (0.00091) [2022-07-11 08:15:16,551][26022] Updated weights on worker 0-0, policy_version 1109313 (0.00090) [2022-07-11 08:15:18,214][25689] Fps is (10 sec: 5597.1, 60 sec: 5542.9, 300 sec: 5544.7). Total num frames: 1135946752. Throughput: 0: 5642.6. Samples: 1135943544. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:15:18,214][25689] Avg episode reward: [(0, '0.946')] [2022-07-11 08:15:18,219][26022] Updated weights on worker 0-0, policy_version 1109323 (0.00085) [2022-07-11 08:15:20,248][26022] Updated weights on worker 0-0, policy_version 1109333 (0.00086) [2022-07-11 08:15:22,099][26022] Updated weights on worker 0-0, policy_version 1109343 (0.00095) [2022-07-11 08:15:23,255][25689] Fps is (10 sec: 5597.3, 60 sec: 5508.1, 300 sec: 5534.9). Total num frames: 1135973376. Throughput: 0: 5633.8. Samples: 1135976752. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:15:23,255][25689] Avg episode reward: [(0, '0.695')] [2022-07-11 08:15:23,892][26022] Updated weights on worker 0-0, policy_version 1109353 (0.00080) [2022-07-11 08:15:25,790][26022] Updated weights on worker 0-0, policy_version 1109363 (0.00095) [2022-07-11 08:15:27,682][26022] Updated weights on worker 0-0, policy_version 1109373 (0.00088) [2022-07-11 08:15:28,320][25689] Fps is (10 sec: 5370.2, 60 sec: 5492.5, 300 sec: 5538.2). Total num frames: 1136001024. Throughput: 0: 5765.3. Samples: 1136010238. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:15:28,320][25689] Avg episode reward: [(0, '1.201')] [2022-07-11 08:15:29,420][26022] Updated weights on worker 0-0, policy_version 1109383 (0.00086) [2022-07-11 08:15:31,440][26022] Updated weights on worker 0-0, policy_version 1109393 (0.00080) [2022-07-11 08:15:33,007][26022] Updated weights on worker 0-0, policy_version 1109403 (0.00088) [2022-07-11 08:15:33,343][25689] Fps is (10 sec: 5582.7, 60 sec: 5525.0, 300 sec: 5541.3). Total num frames: 1136029696. Throughput: 0: 4961.9. Samples: 1136026814. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:15:33,344][25689] Avg episode reward: [(0, '1.137')] [2022-07-11 08:15:35,072][26022] Updated weights on worker 0-0, policy_version 1109413 (0.00084) [2022-07-11 08:15:36,487][26022] Updated weights on worker 0-0, policy_version 1109423 (0.00086) [2022-07-11 08:15:38,367][25689] Fps is (10 sec: 5503.7, 60 sec: 5506.2, 300 sec: 5531.0). Total num frames: 1136056320. Throughput: 0: 5792.8. Samples: 1136060288. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:15:38,367][25689] Avg episode reward: [(0, '0.084')] [2022-07-11 08:15:38,883][26022] Updated weights on worker 0-0, policy_version 1109433 (0.00092) [2022-07-11 08:15:40,356][26022] Updated weights on worker 0-0, policy_version 1109443 (0.00090) [2022-07-11 08:15:42,347][26022] Updated weights on worker 0-0, policy_version 1109453 (0.00086) [2022-07-11 08:15:43,399][25689] Fps is (10 sec: 5499.1, 60 sec: 5488.2, 300 sec: 5535.6). Total num frames: 1136084992. Throughput: 0: 5789.7. Samples: 1136093378. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:15:43,400][25689] Avg episode reward: [(0, '-0.963')] [2022-07-11 08:15:44,152][26022] Updated weights on worker 0-0, policy_version 1109463 (0.00088) [2022-07-11 08:15:46,097][26022] Updated weights on worker 0-0, policy_version 1109473 (0.00084) [2022-07-11 08:15:47,921][26022] Updated weights on worker 0-0, policy_version 1109483 (0.00081) [2022-07-11 08:15:48,448][25689] Fps is (10 sec: 5586.6, 60 sec: 5504.2, 300 sec: 5534.8). Total num frames: 1136112640. Throughput: 0: 4949.9. Samples: 1136109872. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:15:48,448][25689] Avg episode reward: [(0, '-0.936')] [2022-07-11 08:15:48,808][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:15:48,820][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001109487_1136114688.pth [2022-07-11 08:15:48,821][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001107538_1134118912.pth [2022-07-11 08:15:49,745][26022] Updated weights on worker 0-0, policy_version 1109493 (0.00094) [2022-07-11 08:15:51,720][26022] Updated weights on worker 0-0, policy_version 1109503 (0.00086) [2022-07-11 08:15:53,434][26022] Updated weights on worker 0-0, policy_version 1109513 (0.00086) [2022-07-11 08:15:53,466][25689] Fps is (10 sec: 5594.3, 60 sec: 5504.0, 300 sec: 5531.1). Total num frames: 1136141312. Throughput: 0: 5781.5. Samples: 1136143154. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:15:53,475][25689] Avg episode reward: [(0, '-0.637')] [2022-07-11 08:15:55,345][26022] Updated weights on worker 0-0, policy_version 1109523 (0.00086) [2022-07-11 08:15:57,148][26022] Updated weights on worker 0-0, policy_version 1109533 (0.00095) [2022-07-11 08:15:58,479][25689] Fps is (10 sec: 5614.7, 60 sec: 5505.4, 300 sec: 5534.4). Total num frames: 1136168960. Throughput: 0: 5781.8. Samples: 1136176572. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:15:58,480][25689] Avg episode reward: [(0, '-0.653')] [2022-07-11 08:15:58,886][26022] Updated weights on worker 0-0, policy_version 1109543 (0.00092) [2022-07-11 08:16:01,125][26022] Updated weights on worker 0-0, policy_version 1109553 (0.00088) [2022-07-11 08:16:03,168][26022] Updated weights on worker 0-0, policy_version 1109563 (0.00092) [2022-07-11 08:16:03,501][25689] Fps is (10 sec: 5102.1, 60 sec: 5487.0, 300 sec: 5528.5). Total num frames: 1136192512. Throughput: 0: 4956.9. Samples: 1136193024. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:16:03,503][25689] Avg episode reward: [(0, '-0.431')] [2022-07-11 08:16:04,999][26022] Updated weights on worker 0-0, policy_version 1109573 (0.00084) [2022-07-11 08:16:07,031][26022] Updated weights on worker 0-0, policy_version 1109583 (0.00087) [2022-07-11 08:16:08,615][25689] Fps is (10 sec: 5253.2, 60 sec: 5487.2, 300 sec: 5530.0). Total num frames: 1136222208. Throughput: 0: 5658.1. Samples: 1136223980. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:16:08,616][25689] Avg episode reward: [(0, '1.117')] [2022-07-11 08:16:08,736][26022] Updated weights on worker 0-0, policy_version 1109593 (0.00092) [2022-07-11 08:16:10,537][26022] Updated weights on worker 0-0, policy_version 1109603 (0.00081) [2022-07-11 08:16:12,415][26022] Updated weights on worker 0-0, policy_version 1109613 (0.00087) [2022-07-11 08:16:13,656][25689] Fps is (10 sec: 5747.8, 60 sec: 5521.1, 300 sec: 5533.5). Total num frames: 1136250880. Throughput: 0: 5661.7. Samples: 1136257464. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:16:13,658][25689] Avg episode reward: [(0, '0.471')] [2022-07-11 08:16:14,127][26022] Updated weights on worker 0-0, policy_version 1109623 (0.00081) [2022-07-11 08:16:16,256][26022] Updated weights on worker 0-0, policy_version 1109633 (0.00099) [2022-07-11 08:16:17,814][26022] Updated weights on worker 0-0, policy_version 1109643 (0.00088) [2022-07-11 08:16:18,670][25689] Fps is (10 sec: 5499.3, 60 sec: 5470.9, 300 sec: 5530.3). Total num frames: 1136277504. Throughput: 0: 4837.0. Samples: 1136274238. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:16:18,672][25689] Avg episode reward: [(0, '0.585')] [2022-07-11 08:16:19,848][26022] Updated weights on worker 0-0, policy_version 1109653 (0.00085) [2022-07-11 08:16:21,527][26022] Updated weights on worker 0-0, policy_version 1109663 (0.00085) [2022-07-11 08:16:23,515][26022] Updated weights on worker 0-0, policy_version 1109673 (0.00089) [2022-07-11 08:16:23,687][25689] Fps is (10 sec: 5614.7, 60 sec: 5524.0, 300 sec: 5527.9). Total num frames: 1136307200. Throughput: 0: 5688.0. Samples: 1136307840. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:16:23,688][25689] Avg episode reward: [(0, '-0.731')] [2022-07-11 08:16:25,258][26022] Updated weights on worker 0-0, policy_version 1109683 (0.00089) [2022-07-11 08:16:27,123][26022] Updated weights on worker 0-0, policy_version 1109693 (0.00091) [2022-07-11 08:16:28,766][25689] Fps is (10 sec: 5679.7, 60 sec: 5522.6, 300 sec: 5530.1). Total num frames: 1136334848. Throughput: 0: 5827.3. Samples: 1136341408. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2022-07-11 08:16:28,767][25689] Avg episode reward: [(0, '-0.856')] [2022-07-11 08:16:28,854][26022] Updated weights on worker 0-0, policy_version 1109703 (0.00053) [2022-07-11 08:16:30,724][26022] Updated weights on worker 0-0, policy_version 1109713 (0.00079) [2022-07-11 08:16:32,815][26022] Updated weights on worker 0-0, policy_version 1109723 (0.00086) [2022-07-11 08:16:33,772][25689] Fps is (10 sec: 5381.3, 60 sec: 5490.4, 300 sec: 5526.7). Total num frames: 1136361472. Throughput: 0: 5007.0. Samples: 1136358186. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:16:33,772][25689] Avg episode reward: [(0, '-0.626')] [2022-07-11 08:16:34,435][26022] Updated weights on worker 0-0, policy_version 1109733 (0.00085) [2022-07-11 08:16:36,287][26022] Updated weights on worker 0-0, policy_version 1109743 (0.00086) [2022-07-11 08:16:38,044][26022] Updated weights on worker 0-0, policy_version 1109753 (0.00089) [2022-07-11 08:16:38,788][25689] Fps is (10 sec: 5620.0, 60 sec: 5541.9, 300 sec: 5529.9). Total num frames: 1136391168. Throughput: 0: 5831.1. Samples: 1136391546. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:16:38,788][25689] Avg episode reward: [(0, '-0.529')] [2022-07-11 08:16:40,048][26022] Updated weights on worker 0-0, policy_version 1109763 (0.00089) [2022-07-11 08:16:41,783][26022] Updated weights on worker 0-0, policy_version 1109773 (0.00094) [2022-07-11 08:16:43,718][26022] Updated weights on worker 0-0, policy_version 1109783 (0.00086) [2022-07-11 08:16:43,796][25689] Fps is (10 sec: 5618.6, 60 sec: 5510.2, 300 sec: 5529.1). Total num frames: 1136417792. Throughput: 0: 5835.8. Samples: 1136425194. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:16:43,796][25689] Avg episode reward: [(0, '0.118')] [2022-07-11 08:16:45,482][26022] Updated weights on worker 0-0, policy_version 1109793 (0.00088) [2022-07-11 08:16:47,363][26022] Updated weights on worker 0-0, policy_version 1109803 (0.00089) [2022-07-11 08:16:48,856][25689] Fps is (10 sec: 5594.1, 60 sec: 5543.1, 300 sec: 5531.7). Total num frames: 1136447488. Throughput: 0: 4993.8. Samples: 1136441730. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:16:48,856][25689] Avg episode reward: [(0, '1.404')] [2022-07-11 08:16:48,975][26022] Updated weights on worker 0-0, policy_version 1109813 (0.00083) [2022-07-11 08:16:51,012][26022] Updated weights on worker 0-0, policy_version 1109823 (0.00091) [2022-07-11 08:16:52,863][26022] Updated weights on worker 0-0, policy_version 1109833 (0.00089) [2022-07-11 08:16:53,893][25689] Fps is (10 sec: 5577.6, 60 sec: 5507.4, 300 sec: 5528.1). Total num frames: 1136474112. Throughput: 0: 5820.5. Samples: 1136475302. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:16:53,894][25689] Avg episode reward: [(0, '1.036')] [2022-07-11 08:16:54,719][26022] Updated weights on worker 0-0, policy_version 1109843 (0.00086) [2022-07-11 08:16:56,595][26022] Updated weights on worker 0-0, policy_version 1109853 (0.00085) [2022-07-11 08:16:58,189][26022] Updated weights on worker 0-0, policy_version 1109863 (0.00088) [2022-07-11 08:16:58,919][25689] Fps is (10 sec: 5494.9, 60 sec: 5523.2, 300 sec: 5541.5). Total num frames: 1136502784. Throughput: 0: 5833.3. Samples: 1136508976. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:16:58,920][25689] Avg episode reward: [(0, '0.971')] [2022-07-11 08:17:00,329][26022] Updated weights on worker 0-0, policy_version 1109873 (0.00084) [2022-07-11 08:17:02,407][26022] Updated weights on worker 0-0, policy_version 1109883 (0.00088) [2022-07-11 08:17:03,926][25689] Fps is (10 sec: 5409.5, 60 sec: 5558.5, 300 sec: 5529.4). Total num frames: 1136528384. Throughput: 0: 4977.8. Samples: 1136525398. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:17:03,927][25689] Avg episode reward: [(0, '0.474')] [2022-07-11 08:17:04,301][26022] Updated weights on worker 0-0, policy_version 1109893 (0.00092) [2022-07-11 08:17:06,215][26022] Updated weights on worker 0-0, policy_version 1109903 (0.00086) [2022-07-11 08:17:07,845][26022] Updated weights on worker 0-0, policy_version 1109913 (0.00056) [2022-07-11 08:17:09,042][25689] Fps is (10 sec: 5361.0, 60 sec: 5541.3, 300 sec: 5530.8). Total num frames: 1136557056. Throughput: 0: 5701.6. Samples: 1136556826. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:17:09,043][25689] Avg episode reward: [(0, '0.419')] [2022-07-11 08:17:09,834][26022] Updated weights on worker 0-0, policy_version 1109923 (0.00084) [2022-07-11 08:17:11,747][26022] Updated weights on worker 0-0, policy_version 1109933 (0.00088) [2022-07-11 08:17:13,531][26022] Updated weights on worker 0-0, policy_version 1109943 (0.00085) [2022-07-11 08:17:14,097][25689] Fps is (10 sec: 5537.5, 60 sec: 5523.1, 300 sec: 5523.2). Total num frames: 1136584704. Throughput: 0: 5692.8. Samples: 1136590316. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:17:14,097][25689] Avg episode reward: [(0, '0.633')] [2022-07-11 08:17:15,450][26022] Updated weights on worker 0-0, policy_version 1109953 (0.00088) [2022-07-11 08:17:17,217][26022] Updated weights on worker 0-0, policy_version 1109963 (0.00091) [2022-07-11 08:17:19,040][26022] Updated weights on worker 0-0, policy_version 1109973 (0.00089) [2022-07-11 08:17:19,102][25689] Fps is (10 sec: 5496.7, 60 sec: 5540.9, 300 sec: 5527.2). Total num frames: 1136612352. Throughput: 0: 4846.9. Samples: 1136606802. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:17:19,103][25689] Avg episode reward: [(0, '0.540')] [2022-07-11 08:17:20,944][26022] Updated weights on worker 0-0, policy_version 1109983 (0.00093) [2022-07-11 08:17:22,627][26022] Updated weights on worker 0-0, policy_version 1109993 (0.00087) [2022-07-11 08:17:24,143][25689] Fps is (10 sec: 5606.2, 60 sec: 5521.8, 300 sec: 5525.2). Total num frames: 1136641024. Throughput: 0: 5706.1. Samples: 1136640756. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:17:24,143][25689] Avg episode reward: [(0, '1.049')] [2022-07-11 08:17:24,527][26022] Updated weights on worker 0-0, policy_version 1110003 (0.00088) [2022-07-11 08:17:26,253][26022] Updated weights on worker 0-0, policy_version 1110013 (0.00086) [2022-07-11 08:17:28,158][26022] Updated weights on worker 0-0, policy_version 1110023 (0.00088) [2022-07-11 08:17:29,236][25689] Fps is (10 sec: 5557.6, 60 sec: 5520.5, 300 sec: 5527.1). Total num frames: 1136668672. Throughput: 0: 5809.2. Samples: 1136674136. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:17:29,237][25689] Avg episode reward: [(0, '0.957')] [2022-07-11 08:17:30,088][26022] Updated weights on worker 0-0, policy_version 1110033 (0.00085) [2022-07-11 08:17:31,983][26022] Updated weights on worker 0-0, policy_version 1110043 (0.00085) [2022-07-11 08:17:33,606][26022] Updated weights on worker 0-0, policy_version 1110053 (0.00091) [2022-07-11 08:17:34,282][25689] Fps is (10 sec: 5453.9, 60 sec: 5533.8, 300 sec: 5521.0). Total num frames: 1136696320. Throughput: 0: 4973.3. Samples: 1136690702. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:17:34,282][25689] Avg episode reward: [(0, '1.079')] [2022-07-11 08:17:35,690][26022] Updated weights on worker 0-0, policy_version 1110063 (0.00088) [2022-07-11 08:17:37,475][26022] Updated weights on worker 0-0, policy_version 1110073 (0.00094) [2022-07-11 08:17:39,260][26022] Updated weights on worker 0-0, policy_version 1110083 (0.00088) [2022-07-11 08:17:39,358][25689] Fps is (10 sec: 5564.4, 60 sec: 5511.4, 300 sec: 5526.7). Total num frames: 1136724992. Throughput: 0: 5795.7. Samples: 1136724196. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:17:39,359][25689] Avg episode reward: [(0, '0.348')] [2022-07-11 08:17:41,178][26022] Updated weights on worker 0-0, policy_version 1110093 (0.00087) [2022-07-11 08:17:42,943][26022] Updated weights on worker 0-0, policy_version 1110103 (0.00092) [2022-07-11 08:17:44,411][25689] Fps is (10 sec: 5560.1, 60 sec: 5524.2, 300 sec: 5520.4). Total num frames: 1136752640. Throughput: 0: 5768.5. Samples: 1136757672. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:17:44,411][25689] Avg episode reward: [(0, '-0.441')] [2022-07-11 08:17:44,693][26022] Updated weights on worker 0-0, policy_version 1110113 (0.00092) [2022-07-11 08:17:46,591][26022] Updated weights on worker 0-0, policy_version 1110123 (0.00087) [2022-07-11 08:17:48,668][26022] Updated weights on worker 0-0, policy_version 1110133 (0.00090) [2022-07-11 08:17:48,851][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:17:48,861][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001110135_1136778240.pth [2022-07-11 08:17:48,862][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001108190_1134786560.pth [2022-07-11 08:17:49,561][25689] Fps is (10 sec: 5620.1, 60 sec: 5516.0, 300 sec: 5524.7). Total num frames: 1136782336. Throughput: 0: 5752.7. Samples: 1136791058. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:17:49,561][25689] Avg episode reward: [(0, '-0.864')] [2022-07-11 08:17:50,323][26022] Updated weights on worker 0-0, policy_version 1110143 (0.00084) [2022-07-11 08:17:52,056][26022] Updated weights on worker 0-0, policy_version 1110153 (0.00090) [2022-07-11 08:17:53,841][26022] Updated weights on worker 0-0, policy_version 1110163 (0.00084) [2022-07-11 08:17:54,651][25689] Fps is (10 sec: 5600.0, 60 sec: 5528.1, 300 sec: 5527.1). Total num frames: 1136809984. Throughput: 0: 5747.3. Samples: 1136807770. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:17:54,652][25689] Avg episode reward: [(0, '-0.783')] [2022-07-11 08:17:55,680][26022] Updated weights on worker 0-0, policy_version 1110173 (0.00088) [2022-07-11 08:17:57,795][26022] Updated weights on worker 0-0, policy_version 1110183 (0.00086) [2022-07-11 08:17:59,599][26022] Updated weights on worker 0-0, policy_version 1110193 (0.00092) [2022-07-11 08:17:59,696][25689] Fps is (10 sec: 5557.0, 60 sec: 5526.3, 300 sec: 5526.9). Total num frames: 1136838656. Throughput: 0: 5753.2. Samples: 1136841206. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:17:59,696][25689] Avg episode reward: [(0, '-0.698')] [2022-07-11 08:18:01,487][26022] Updated weights on worker 0-0, policy_version 1110203 (0.00093) [2022-07-11 08:18:03,502][26022] Updated weights on worker 0-0, policy_version 1110213 (0.00089) [2022-07-11 08:18:04,712][25689] Fps is (10 sec: 5292.3, 60 sec: 5508.6, 300 sec: 5518.4). Total num frames: 1136863232. Throughput: 0: 5639.9. Samples: 1136872170. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:18:04,713][25689] Avg episode reward: [(0, '-1.504')] [2022-07-11 08:18:05,565][26022] Updated weights on worker 0-0, policy_version 1110223 (0.00090) [2022-07-11 08:18:07,293][26022] Updated weights on worker 0-0, policy_version 1110233 (0.00089) [2022-07-11 08:18:09,170][26022] Updated weights on worker 0-0, policy_version 1110243 (0.00093) [2022-07-11 08:18:09,840][25689] Fps is (10 sec: 5350.2, 60 sec: 5524.4, 300 sec: 5524.1). Total num frames: 1136892928. Throughput: 0: 4822.2. Samples: 1136888846. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:18:09,840][25689] Avg episode reward: [(0, '-1.760')] [2022-07-11 08:18:10,928][26022] Updated weights on worker 0-0, policy_version 1110253 (0.00095) [2022-07-11 08:18:12,785][26022] Updated weights on worker 0-0, policy_version 1110263 (0.00094) [2022-07-11 08:18:14,567][26022] Updated weights on worker 0-0, policy_version 1110273 (0.00086) [2022-07-11 08:18:14,865][25689] Fps is (10 sec: 5648.4, 60 sec: 5527.1, 300 sec: 5520.9). Total num frames: 1136920576. Throughput: 0: 5671.4. Samples: 1136922410. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:18:14,865][25689] Avg episode reward: [(0, '-1.493')] [2022-07-11 08:18:16,511][26022] Updated weights on worker 0-0, policy_version 1110283 (0.00084) [2022-07-11 08:18:18,101][26022] Updated weights on worker 0-0, policy_version 1110293 (0.00082) [2022-07-11 08:18:19,932][25689] Fps is (10 sec: 5479.3, 60 sec: 5521.5, 300 sec: 5516.3). Total num frames: 1136948224. Throughput: 0: 5676.6. Samples: 1136956076. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:18:19,932][25689] Avg episode reward: [(0, '-1.089')] [2022-07-11 08:18:20,080][26022] Updated weights on worker 0-0, policy_version 1110303 (0.00091) [2022-07-11 08:18:21,869][26022] Updated weights on worker 0-0, policy_version 1110313 (0.00088) [2022-07-11 08:18:23,783][26022] Updated weights on worker 0-0, policy_version 1110323 (0.00090) [2022-07-11 08:18:24,962][25689] Fps is (10 sec: 5577.6, 60 sec: 5522.4, 300 sec: 5516.5). Total num frames: 1136976896. Throughput: 0: 4980.5. Samples: 1136973024. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:18:24,963][25689] Avg episode reward: [(0, '-1.098')] [2022-07-11 08:18:25,711][26022] Updated weights on worker 0-0, policy_version 1110333 (0.00094) [2022-07-11 08:18:27,466][26022] Updated weights on worker 0-0, policy_version 1110343 (0.00082) [2022-07-11 08:18:29,300][26022] Updated weights on worker 0-0, policy_version 1110353 (0.00091) [2022-07-11 08:18:30,088][25689] Fps is (10 sec: 5747.1, 60 sec: 5553.2, 300 sec: 5525.0). Total num frames: 1137006592. Throughput: 0: 5799.0. Samples: 1137006264. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:18:30,088][25689] Avg episode reward: [(0, '-0.965')] [2022-07-11 08:18:31,174][26022] Updated weights on worker 0-0, policy_version 1110363 (0.00085) [2022-07-11 08:18:32,967][26022] Updated weights on worker 0-0, policy_version 1110373 (0.00084) [2022-07-11 08:18:35,092][26022] Updated weights on worker 0-0, policy_version 1110383 (0.00087) [2022-07-11 08:18:35,146][25689] Fps is (10 sec: 5530.2, 60 sec: 5535.2, 300 sec: 5517.2). Total num frames: 1137033216. Throughput: 0: 5746.9. Samples: 1137038966. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:18:35,147][25689] Avg episode reward: [(0, '-1.590')] [2022-07-11 08:18:36,633][26022] Updated weights on worker 0-0, policy_version 1110393 (0.00084) [2022-07-11 08:18:38,713][26022] Updated weights on worker 0-0, policy_version 1110403 (0.00089) [2022-07-11 08:18:40,171][25689] Fps is (10 sec: 5382.4, 60 sec: 5523.0, 300 sec: 5513.6). Total num frames: 1137060864. Throughput: 0: 4924.1. Samples: 1137055736. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:18:40,171][25689] Avg episode reward: [(0, '-1.481')] [2022-07-11 08:18:40,558][26022] Updated weights on worker 0-0, policy_version 1110413 (0.00090) [2022-07-11 08:18:42,153][26022] Updated weights on worker 0-0, policy_version 1110423 (0.00084) [2022-07-11 08:18:44,390][26022] Updated weights on worker 0-0, policy_version 1110433 (0.00089) [2022-07-11 08:18:45,202][25689] Fps is (10 sec: 5702.8, 60 sec: 5558.7, 300 sec: 5524.2). Total num frames: 1137090560. Throughput: 0: 5747.2. Samples: 1137089344. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:18:45,202][25689] Avg episode reward: [(0, '-1.512')] [2022-07-11 08:18:45,747][26022] Updated weights on worker 0-0, policy_version 1110443 (0.00094) [2022-07-11 08:18:47,897][26022] Updated weights on worker 0-0, policy_version 1110453 (0.00089) [2022-07-11 08:18:49,562][26022] Updated weights on worker 0-0, policy_version 1110463 (0.00099) [2022-07-11 08:18:50,323][25689] Fps is (10 sec: 5446.8, 60 sec: 5494.0, 300 sec: 5515.2). Total num frames: 1137116160. Throughput: 0: 5762.2. Samples: 1137122864. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:18:50,323][25689] Avg episode reward: [(0, '-1.866')] [2022-07-11 08:18:51,435][26022] Updated weights on worker 0-0, policy_version 1110473 (0.00087) [2022-07-11 08:18:53,397][26022] Updated weights on worker 0-0, policy_version 1110483 (0.00105) [2022-07-11 08:18:55,247][26022] Updated weights on worker 0-0, policy_version 1110493 (0.00091) [2022-07-11 08:18:55,336][25689] Fps is (10 sec: 5355.5, 60 sec: 5517.8, 300 sec: 5515.2). Total num frames: 1137144832. Throughput: 0: 4967.6. Samples: 1137139258. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:18:55,336][25689] Avg episode reward: [(0, '-2.147')] [2022-07-11 08:18:56,947][26022] Updated weights on worker 0-0, policy_version 1110503 (0.00087) [2022-07-11 08:18:59,002][26022] Updated weights on worker 0-0, policy_version 1110513 (0.00094) [2022-07-11 08:19:00,371][25689] Fps is (10 sec: 5707.2, 60 sec: 5518.7, 300 sec: 5524.9). Total num frames: 1137173504. Throughput: 0: 5790.6. Samples: 1137172704. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:19:00,371][25689] Avg episode reward: [(0, '-1.440')] [2022-07-11 08:19:00,648][26022] Updated weights on worker 0-0, policy_version 1110523 (0.00089) [2022-07-11 08:19:03,010][26022] Updated weights on worker 0-0, policy_version 1110533 (0.00088) [2022-07-11 08:19:04,740][26022] Updated weights on worker 0-0, policy_version 1110543 (0.00090) [2022-07-11 08:19:05,403][25689] Fps is (10 sec: 5391.0, 60 sec: 5534.2, 300 sec: 5518.8). Total num frames: 1137199104. Throughput: 0: 5675.0. Samples: 1137203986. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:19:05,403][25689] Avg episode reward: [(0, '-1.121')] [2022-07-11 08:19:06,618][26022] Updated weights on worker 0-0, policy_version 1110553 (0.00086) [2022-07-11 08:19:08,515][26022] Updated weights on worker 0-0, policy_version 1110563 (0.00082) [2022-07-11 08:19:10,346][26022] Updated weights on worker 0-0, policy_version 1110573 (0.00088) [2022-07-11 08:19:10,475][25689] Fps is (10 sec: 5269.6, 60 sec: 5505.4, 300 sec: 5517.5). Total num frames: 1137226752. Throughput: 0: 4862.9. Samples: 1137220862. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:19:10,476][25689] Avg episode reward: [(0, '0.517')] [2022-07-11 08:19:12,125][26022] Updated weights on worker 0-0, policy_version 1110583 (0.00087) [2022-07-11 08:19:14,280][26022] Updated weights on worker 0-0, policy_version 1110593 (0.00089) [2022-07-11 08:19:15,478][25689] Fps is (10 sec: 5691.9, 60 sec: 5541.3, 300 sec: 5524.7). Total num frames: 1137256448. Throughput: 0: 5697.3. Samples: 1137254012. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:19:15,478][25689] Avg episode reward: [(0, '-0.430')] [2022-07-11 08:19:15,661][26022] Updated weights on worker 0-0, policy_version 1110603 (0.00089) [2022-07-11 08:19:17,778][26022] Updated weights on worker 0-0, policy_version 1110613 (0.00082) [2022-07-11 08:19:19,536][26022] Updated weights on worker 0-0, policy_version 1110623 (0.00088) [2022-07-11 08:19:20,531][25689] Fps is (10 sec: 5601.0, 60 sec: 5525.6, 300 sec: 5517.4). Total num frames: 1137283072. Throughput: 0: 5708.1. Samples: 1137287780. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:19:20,531][25689] Avg episode reward: [(0, '-0.480')] [2022-07-11 08:19:21,294][26022] Updated weights on worker 0-0, policy_version 1110633 (0.00088) [2022-07-11 08:19:23,099][26022] Updated weights on worker 0-0, policy_version 1110643 (0.00089) [2022-07-11 08:19:25,084][26022] Updated weights on worker 0-0, policy_version 1110653 (0.00086) [2022-07-11 08:19:25,540][25689] Fps is (10 sec: 5393.7, 60 sec: 5510.7, 300 sec: 5515.2). Total num frames: 1137310720. Throughput: 0: 4999.0. Samples: 1137304650. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:19:25,540][25689] Avg episode reward: [(0, '-0.709')] [2022-07-11 08:19:26,838][26022] Updated weights on worker 0-0, policy_version 1110663 (0.00117) [2022-07-11 08:19:28,855][26022] Updated weights on worker 0-0, policy_version 1110673 (0.00090) [2022-07-11 08:19:30,284][26022] Updated weights on worker 0-0, policy_version 1110683 (0.00090) [2022-07-11 08:19:30,591][25689] Fps is (10 sec: 5699.8, 60 sec: 5517.4, 300 sec: 5524.7). Total num frames: 1137340416. Throughput: 0: 5830.7. Samples: 1137338152. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:19:30,592][25689] Avg episode reward: [(0, '-1.061')] [2022-07-11 08:19:32,596][26022] Updated weights on worker 0-0, policy_version 1110693 (0.00081) [2022-07-11 08:19:33,875][26022] Updated weights on worker 0-0, policy_version 1110703 (0.00097) [2022-07-11 08:19:35,610][25689] Fps is (10 sec: 5389.0, 60 sec: 5487.2, 300 sec: 5514.1). Total num frames: 1137364992. Throughput: 0: 5828.0. Samples: 1137371346. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:19:35,611][25689] Avg episode reward: [(0, '-0.148')] [2022-07-11 08:19:36,101][26022] Updated weights on worker 0-0, policy_version 1110713 (0.00082) [2022-07-11 08:19:37,911][26022] Updated weights on worker 0-0, policy_version 1110723 (0.00085) [2022-07-11 08:19:39,774][26022] Updated weights on worker 0-0, policy_version 1110733 (0.00093) [2022-07-11 08:19:40,641][25689] Fps is (10 sec: 5400.5, 60 sec: 5520.5, 300 sec: 5513.9). Total num frames: 1137394688. Throughput: 0: 5812.4. Samples: 1137404668. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:19:40,641][25689] Avg episode reward: [(0, '0.365')] [2022-07-11 08:19:41,535][26022] Updated weights on worker 0-0, policy_version 1110743 (0.00093) [2022-07-11 08:19:43,486][26022] Updated weights on worker 0-0, policy_version 1110753 (0.00090) [2022-07-11 08:19:45,213][26022] Updated weights on worker 0-0, policy_version 1110763 (0.00082) [2022-07-11 08:19:45,680][25689] Fps is (10 sec: 5796.6, 60 sec: 5502.8, 300 sec: 5520.8). Total num frames: 1137423360. Throughput: 0: 5798.3. Samples: 1137421428. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:19:45,680][25689] Avg episode reward: [(0, '0.245')] [2022-07-11 08:19:47,189][26022] Updated weights on worker 0-0, policy_version 1110773 (0.00087) [2022-07-11 08:19:48,934][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:19:48,941][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001110783_1137441792.pth [2022-07-11 08:19:48,941][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001108842_1135454208.pth [2022-07-11 08:19:48,952][26022] Updated weights on worker 0-0, policy_version 1110783 (0.00090) [2022-07-11 08:19:50,789][25689] Fps is (10 sec: 5549.8, 60 sec: 5537.8, 300 sec: 5515.7). Total num frames: 1137451008. Throughput: 0: 5775.7. Samples: 1137454808. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:19:50,790][25689] Avg episode reward: [(0, '0.004')] [2022-07-11 08:19:50,908][26022] Updated weights on worker 0-0, policy_version 1110793 (0.00090) [2022-07-11 08:19:52,763][26022] Updated weights on worker 0-0, policy_version 1110803 (0.00088) [2022-07-11 08:19:54,479][26022] Updated weights on worker 0-0, policy_version 1110813 (0.00088) [2022-07-11 08:19:55,876][25689] Fps is (10 sec: 5423.4, 60 sec: 5514.1, 300 sec: 5514.6). Total num frames: 1137478656. Throughput: 0: 5773.3. Samples: 1137488344. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:19:55,877][25689] Avg episode reward: [(0, '0.644')] [2022-07-11 08:19:56,432][26022] Updated weights on worker 0-0, policy_version 1110823 (0.00096) [2022-07-11 08:19:58,187][26022] Updated weights on worker 0-0, policy_version 1110833 (0.00087) [2022-07-11 08:20:00,035][26022] Updated weights on worker 0-0, policy_version 1110843 (0.00086) [2022-07-11 08:20:00,938][25689] Fps is (10 sec: 5650.3, 60 sec: 5528.5, 300 sec: 5530.7). Total num frames: 1137508352. Throughput: 0: 4946.4. Samples: 1137505064. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:20:00,939][25689] Avg episode reward: [(0, '0.879')] [2022-07-11 08:20:02,317][26022] Updated weights on worker 0-0, policy_version 1110853 (0.00116) [2022-07-11 08:20:03,961][26022] Updated weights on worker 0-0, policy_version 1110863 (0.00089) [2022-07-11 08:20:05,911][26022] Updated weights on worker 0-0, policy_version 1110873 (0.00084) [2022-07-11 08:20:06,020][25689] Fps is (10 sec: 5451.1, 60 sec: 5524.0, 300 sec: 5517.6). Total num frames: 1137533952. Throughput: 0: 5656.8. Samples: 1137536486. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:20:06,020][25689] Avg episode reward: [(0, '1.209')] [2022-07-11 08:20:07,895][26022] Updated weights on worker 0-0, policy_version 1110883 (0.00091) [2022-07-11 08:20:09,799][26022] Updated weights on worker 0-0, policy_version 1110893 (0.00096) [2022-07-11 08:20:11,083][25689] Fps is (10 sec: 5248.6, 60 sec: 5524.8, 300 sec: 5520.6). Total num frames: 1137561600. Throughput: 0: 5647.0. Samples: 1137569408. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:20:11,084][25689] Avg episode reward: [(0, '0.924')] [2022-07-11 08:20:11,646][26022] Updated weights on worker 0-0, policy_version 1110903 (0.00085) [2022-07-11 08:20:13,328][26022] Updated weights on worker 0-0, policy_version 1110913 (0.00088) [2022-07-11 08:20:15,157][26022] Updated weights on worker 0-0, policy_version 1110923 (0.00086) [2022-07-11 08:20:16,092][25689] Fps is (10 sec: 5591.8, 60 sec: 5507.4, 300 sec: 5517.4). Total num frames: 1137590272. Throughput: 0: 4846.1. Samples: 1137586312. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:20:16,092][25689] Avg episode reward: [(0, '0.873')] [2022-07-11 08:20:17,156][26022] Updated weights on worker 0-0, policy_version 1110933 (0.00089) [2022-07-11 08:20:18,860][26022] Updated weights on worker 0-0, policy_version 1110943 (0.00091) [2022-07-11 08:20:20,855][26022] Updated weights on worker 0-0, policy_version 1110953 (0.00095) [2022-07-11 08:20:21,097][25689] Fps is (10 sec: 5624.3, 60 sec: 5528.6, 300 sec: 5521.5). Total num frames: 1137617920. Throughput: 0: 5698.3. Samples: 1137619936. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:20:21,097][25689] Avg episode reward: [(0, '1.335')] [2022-07-11 08:20:22,575][26022] Updated weights on worker 0-0, policy_version 1110963 (0.00090) [2022-07-11 08:20:24,347][26022] Updated weights on worker 0-0, policy_version 1110973 (0.00087) [2022-07-11 08:20:26,114][25689] Fps is (10 sec: 5517.1, 60 sec: 5527.9, 300 sec: 5522.4). Total num frames: 1137645568. Throughput: 0: 5819.6. Samples: 1137653428. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:20:26,115][25689] Avg episode reward: [(0, '1.298')] [2022-07-11 08:20:26,259][26022] Updated weights on worker 0-0, policy_version 1110983 (0.00093) [2022-07-11 08:20:28,183][26022] Updated weights on worker 0-0, policy_version 1110993 (0.00085) [2022-07-11 08:20:29,852][26022] Updated weights on worker 0-0, policy_version 1111003 (0.00084) [2022-07-11 08:20:31,196][25689] Fps is (10 sec: 5475.3, 60 sec: 5491.4, 300 sec: 5517.9). Total num frames: 1137673216. Throughput: 0: 5005.7. Samples: 1137670086. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 08:20:31,196][25689] Avg episode reward: [(0, '-0.597')] [2022-07-11 08:20:31,758][26022] Updated weights on worker 0-0, policy_version 1111013 (0.00085) [2022-07-11 08:20:33,455][26022] Updated weights on worker 0-0, policy_version 1111023 (0.00086) [2022-07-11 08:20:35,472][26022] Updated weights on worker 0-0, policy_version 1111033 (0.00087) [2022-07-11 08:20:36,197][25689] Fps is (10 sec: 5585.7, 60 sec: 5560.6, 300 sec: 5525.2). Total num frames: 1137701888. Throughput: 0: 5840.7. Samples: 1137703742. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:20:36,198][25689] Avg episode reward: [(0, '-0.777')] [2022-07-11 08:20:37,124][26022] Updated weights on worker 0-0, policy_version 1111043 (0.00087) [2022-07-11 08:20:39,219][26022] Updated weights on worker 0-0, policy_version 1111053 (0.00089) [2022-07-11 08:20:40,890][26022] Updated weights on worker 0-0, policy_version 1111063 (0.00104) [2022-07-11 08:20:41,218][25689] Fps is (10 sec: 5517.3, 60 sec: 5510.7, 300 sec: 5518.5). Total num frames: 1137728512. Throughput: 0: 5829.6. Samples: 1137737236. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:20:41,219][25689] Avg episode reward: [(0, '-0.390')] [2022-07-11 08:20:42,766][26022] Updated weights on worker 0-0, policy_version 1111073 (0.00082) [2022-07-11 08:20:44,587][26022] Updated weights on worker 0-0, policy_version 1111083 (0.00084) [2022-07-11 08:20:46,239][25689] Fps is (10 sec: 5608.3, 60 sec: 5529.2, 300 sec: 5525.9). Total num frames: 1137758208. Throughput: 0: 5001.0. Samples: 1137754074. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:20:46,240][25689] Avg episode reward: [(0, '-0.716')] [2022-07-11 08:20:46,365][26022] Updated weights on worker 0-0, policy_version 1111093 (0.00091) [2022-07-11 08:20:48,163][26022] Updated weights on worker 0-0, policy_version 1111103 (0.00088) [2022-07-11 08:20:50,067][26022] Updated weights on worker 0-0, policy_version 1111113 (0.00086) [2022-07-11 08:20:51,321][25689] Fps is (10 sec: 5777.3, 60 sec: 5548.7, 300 sec: 5524.7). Total num frames: 1137786880. Throughput: 0: 5851.6. Samples: 1137787850. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:20:51,322][25689] Avg episode reward: [(0, '-0.474')] [2022-07-11 08:20:51,764][26022] Updated weights on worker 0-0, policy_version 1111123 (0.00125) [2022-07-11 08:20:53,572][26022] Updated weights on worker 0-0, policy_version 1111133 (0.00086) [2022-07-11 08:20:55,471][26022] Updated weights on worker 0-0, policy_version 1111143 (0.00093) [2022-07-11 08:20:56,339][25689] Fps is (10 sec: 5576.4, 60 sec: 5555.0, 300 sec: 5524.6). Total num frames: 1137814528. Throughput: 0: 5849.3. Samples: 1137821558. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:20:56,339][25689] Avg episode reward: [(0, '0.153')] [2022-07-11 08:20:57,201][26022] Updated weights on worker 0-0, policy_version 1111153 (0.00076) [2022-07-11 08:20:59,166][26022] Updated weights on worker 0-0, policy_version 1111163 (0.00086) [2022-07-11 08:21:01,006][26022] Updated weights on worker 0-0, policy_version 1111173 (0.00092) [2022-07-11 08:21:01,350][25689] Fps is (10 sec: 5513.8, 60 sec: 5525.8, 300 sec: 5538.6). Total num frames: 1137842176. Throughput: 0: 5031.2. Samples: 1137838522. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:21:01,350][25689] Avg episode reward: [(0, '0.318')] [2022-07-11 08:21:03,198][26022] Updated weights on worker 0-0, policy_version 1111183 (0.00089) [2022-07-11 08:21:04,969][26022] Updated weights on worker 0-0, policy_version 1111193 (0.00093) [2022-07-11 08:21:06,419][25689] Fps is (10 sec: 5485.5, 60 sec: 5560.8, 300 sec: 5532.5). Total num frames: 1137869824. Throughput: 0: 5741.2. Samples: 1137869932. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:21:06,420][25689] Avg episode reward: [(0, '-0.621')] [2022-07-11 08:21:06,920][26022] Updated weights on worker 0-0, policy_version 1111203 (0.00094) [2022-07-11 08:21:08,930][26022] Updated weights on worker 0-0, policy_version 1111213 (0.00099) [2022-07-11 08:21:10,517][26022] Updated weights on worker 0-0, policy_version 1111223 (0.00088) [2022-07-11 08:21:11,532][25689] Fps is (10 sec: 5330.2, 60 sec: 5539.4, 300 sec: 5524.3). Total num frames: 1137896448. Throughput: 0: 5685.9. Samples: 1137902766. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:21:11,532][25689] Avg episode reward: [(0, '-0.943')] [2022-07-11 08:21:12,569][26022] Updated weights on worker 0-0, policy_version 1111233 (0.00084) [2022-07-11 08:21:14,215][26022] Updated weights on worker 0-0, policy_version 1111243 (0.00087) [2022-07-11 08:21:16,449][26022] Updated weights on worker 0-0, policy_version 1111253 (0.00082) [2022-07-11 08:21:16,544][25689] Fps is (10 sec: 5360.4, 60 sec: 5522.1, 300 sec: 5527.8). Total num frames: 1137924096. Throughput: 0: 4854.0. Samples: 1137919634. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:21:16,544][25689] Avg episode reward: [(0, '-0.969')] [2022-07-11 08:21:17,864][26022] Updated weights on worker 0-0, policy_version 1111263 (0.00090) [2022-07-11 08:21:19,906][26022] Updated weights on worker 0-0, policy_version 1111273 (0.00092) [2022-07-11 08:21:21,547][25689] Fps is (10 sec: 5623.3, 60 sec: 5539.2, 300 sec: 5524.6). Total num frames: 1137952768. Throughput: 0: 5668.6. Samples: 1137953012. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:21:21,549][25689] Avg episode reward: [(0, '-2.710')] [2022-07-11 08:21:21,579][26022] Updated weights on worker 0-0, policy_version 1111283 (0.00087) [2022-07-11 08:21:23,628][26022] Updated weights on worker 0-0, policy_version 1111293 (0.00089) [2022-07-11 08:21:25,200][26022] Updated weights on worker 0-0, policy_version 1111303 (0.00093) [2022-07-11 08:21:26,557][25689] Fps is (10 sec: 5624.6, 60 sec: 5539.9, 300 sec: 5525.9). Total num frames: 1137980416. Throughput: 0: 5795.8. Samples: 1137986646. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:21:26,558][25689] Avg episode reward: [(0, '-3.495')] [2022-07-11 08:21:27,227][26022] Updated weights on worker 0-0, policy_version 1111313 (0.00087) [2022-07-11 08:21:29,027][26022] Updated weights on worker 0-0, policy_version 1111323 (0.00085) [2022-07-11 08:21:30,956][26022] Updated weights on worker 0-0, policy_version 1111333 (0.00105) [2022-07-11 08:21:31,605][25689] Fps is (10 sec: 5497.8, 60 sec: 5543.0, 300 sec: 5528.6). Total num frames: 1138008064. Throughput: 0: 4999.5. Samples: 1138003122. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:21:31,605][25689] Avg episode reward: [(0, '-2.674')] [2022-07-11 08:21:32,731][26022] Updated weights on worker 0-0, policy_version 1111343 (0.00095) [2022-07-11 08:21:34,710][26022] Updated weights on worker 0-0, policy_version 1111353 (0.00830) [2022-07-11 08:21:36,363][26022] Updated weights on worker 0-0, policy_version 1111363 (0.00083) [2022-07-11 08:21:36,620][25689] Fps is (10 sec: 5596.7, 60 sec: 5541.7, 300 sec: 5525.2). Total num frames: 1138036736. Throughput: 0: 5817.3. Samples: 1138036424. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:21:36,621][25689] Avg episode reward: [(0, '-2.364')] [2022-07-11 08:21:38,326][26022] Updated weights on worker 0-0, policy_version 1111373 (0.00090) [2022-07-11 08:21:40,139][26022] Updated weights on worker 0-0, policy_version 1111383 (0.00089) [2022-07-11 08:21:41,631][25689] Fps is (10 sec: 5515.0, 60 sec: 5542.6, 300 sec: 5525.1). Total num frames: 1138063360. Throughput: 0: 5810.1. Samples: 1138069704. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:21:41,633][25689] Avg episode reward: [(0, '-1.367')] [2022-07-11 08:21:42,130][26022] Updated weights on worker 0-0, policy_version 1111393 (0.00086) [2022-07-11 08:21:43,707][26022] Updated weights on worker 0-0, policy_version 1111403 (0.00089) [2022-07-11 08:21:45,673][26022] Updated weights on worker 0-0, policy_version 1111413 (0.00084) [2022-07-11 08:21:46,647][25689] Fps is (10 sec: 5412.5, 60 sec: 5509.2, 300 sec: 5519.0). Total num frames: 1138091008. Throughput: 0: 4981.8. Samples: 1138086732. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:21:46,649][25689] Avg episode reward: [(0, '-0.503')] [2022-07-11 08:21:47,370][26022] Updated weights on worker 0-0, policy_version 1111423 (0.00094) [2022-07-11 08:21:49,228][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:21:49,242][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001111432_1138106368.pth [2022-07-11 08:21:49,242][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001109487_1136114688.pth [2022-07-11 08:21:49,348][26022] Updated weights on worker 0-0, policy_version 1111433 (0.00093) [2022-07-11 08:21:51,178][26022] Updated weights on worker 0-0, policy_version 1111443 (0.00089) [2022-07-11 08:21:51,783][25689] Fps is (10 sec: 5648.8, 60 sec: 5521.2, 300 sec: 5527.5). Total num frames: 1138120704. Throughput: 0: 5791.9. Samples: 1138119990. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:21:51,784][25689] Avg episode reward: [(0, '-0.383')] [2022-07-11 08:21:53,142][26022] Updated weights on worker 0-0, policy_version 1111453 (0.00093) [2022-07-11 08:21:54,703][26022] Updated weights on worker 0-0, policy_version 1111463 (0.00086) [2022-07-11 08:21:56,809][25689] Fps is (10 sec: 5643.3, 60 sec: 5520.5, 300 sec: 5524.1). Total num frames: 1138148352. Throughput: 0: 5795.7. Samples: 1138153430. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:21:56,809][25689] Avg episode reward: [(0, '-0.445')] [2022-07-11 08:21:56,815][26022] Updated weights on worker 0-0, policy_version 1111473 (0.00083) [2022-07-11 08:21:58,409][26022] Updated weights on worker 0-0, policy_version 1111483 (0.00100) [2022-07-11 08:22:00,372][26022] Updated weights on worker 0-0, policy_version 1111493 (0.00089) [2022-07-11 08:22:01,831][25689] Fps is (10 sec: 5401.2, 60 sec: 5502.5, 300 sec: 5527.2). Total num frames: 1138174976. Throughput: 0: 4979.8. Samples: 1138170294. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:22:01,831][25689] Avg episode reward: [(0, '-0.133')] [2022-07-11 08:22:02,397][26022] Updated weights on worker 0-0, policy_version 1111503 (0.01249) [2022-07-11 08:22:04,485][26022] Updated weights on worker 0-0, policy_version 1111513 (0.00085) [2022-07-11 08:22:06,146][26022] Updated weights on worker 0-0, policy_version 1111523 (0.00083) [2022-07-11 08:22:06,856][25689] Fps is (10 sec: 5503.3, 60 sec: 5523.5, 300 sec: 5528.9). Total num frames: 1138203648. Throughput: 0: 5710.9. Samples: 1138202144. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:22:06,858][25689] Avg episode reward: [(0, '-0.313')] [2022-07-11 08:22:08,003][26022] Updated weights on worker 0-0, policy_version 1111533 (0.00084) [2022-07-11 08:22:09,870][26022] Updated weights on worker 0-0, policy_version 1111543 (0.00614) [2022-07-11 08:22:11,789][26022] Updated weights on worker 0-0, policy_version 1111553 (0.00089) [2022-07-11 08:22:11,988][25689] Fps is (10 sec: 5544.8, 60 sec: 5538.7, 300 sec: 5527.5). Total num frames: 1138231296. Throughput: 0: 5706.9. Samples: 1138235300. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:22:11,989][25689] Avg episode reward: [(0, '-0.701')] [2022-07-11 08:22:13,610][26022] Updated weights on worker 0-0, policy_version 1111563 (0.00084) [2022-07-11 08:22:15,413][26022] Updated weights on worker 0-0, policy_version 1111573 (0.00093) [2022-07-11 08:22:16,999][25689] Fps is (10 sec: 5552.6, 60 sec: 5555.7, 300 sec: 5530.8). Total num frames: 1138259968. Throughput: 0: 5724.8. Samples: 1138269018. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:22:17,000][25689] Avg episode reward: [(0, '-0.831')] [2022-07-11 08:22:17,270][26022] Updated weights on worker 0-0, policy_version 1111583 (0.00093) [2022-07-11 08:22:18,995][26022] Updated weights on worker 0-0, policy_version 1111593 (0.00089) [2022-07-11 08:22:20,919][26022] Updated weights on worker 0-0, policy_version 1111603 (0.00088) [2022-07-11 08:22:22,071][25689] Fps is (10 sec: 5585.9, 60 sec: 5532.5, 300 sec: 5526.8). Total num frames: 1138287616. Throughput: 0: 5704.9. Samples: 1138285760. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:22:22,071][25689] Avg episode reward: [(0, '0.280')] [2022-07-11 08:22:22,745][26022] Updated weights on worker 0-0, policy_version 1111613 (0.00090) [2022-07-11 08:22:24,612][26022] Updated weights on worker 0-0, policy_version 1111623 (0.00087) [2022-07-11 08:22:26,394][26022] Updated weights on worker 0-0, policy_version 1111633 (0.00085) [2022-07-11 08:22:27,082][25689] Fps is (10 sec: 5484.1, 60 sec: 5532.4, 300 sec: 5528.3). Total num frames: 1138315264. Throughput: 0: 5801.1. Samples: 1138319476. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:22:27,083][25689] Avg episode reward: [(0, '0.080')] [2022-07-11 08:22:28,129][26022] Updated weights on worker 0-0, policy_version 1111643 (0.00086) [2022-07-11 08:22:30,002][26022] Updated weights on worker 0-0, policy_version 1111653 (0.00087) [2022-07-11 08:22:31,834][26022] Updated weights on worker 0-0, policy_version 1111663 (0.00083) [2022-07-11 08:22:32,144][25689] Fps is (10 sec: 5591.1, 60 sec: 5548.0, 300 sec: 5531.5). Total num frames: 1138343936. Throughput: 0: 5842.2. Samples: 1138353054. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:22:32,144][25689] Avg episode reward: [(0, '0.399')] [2022-07-11 08:22:33,763][26022] Updated weights on worker 0-0, policy_version 1111673 (0.00087) [2022-07-11 08:22:35,543][26022] Updated weights on worker 0-0, policy_version 1111683 (0.00084) [2022-07-11 08:22:37,180][25689] Fps is (10 sec: 5678.7, 60 sec: 5546.1, 300 sec: 5532.2). Total num frames: 1138372608. Throughput: 0: 4996.1. Samples: 1138369844. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:22:37,181][25689] Avg episode reward: [(0, '0.314')] [2022-07-11 08:22:37,247][26022] Updated weights on worker 0-0, policy_version 1111693 (0.00084) [2022-07-11 08:22:39,104][26022] Updated weights on worker 0-0, policy_version 1111703 (0.00086) [2022-07-11 08:22:41,096][26022] Updated weights on worker 0-0, policy_version 1111713 (0.00086) [2022-07-11 08:22:42,219][25689] Fps is (10 sec: 5590.2, 60 sec: 5560.5, 300 sec: 5532.5). Total num frames: 1138400256. Throughput: 0: 5846.5. Samples: 1138403556. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:22:42,219][25689] Avg episode reward: [(0, '0.577')] [2022-07-11 08:22:42,841][26022] Updated weights on worker 0-0, policy_version 1111723 (0.00209) [2022-07-11 08:22:44,646][26022] Updated weights on worker 0-0, policy_version 1111733 (0.00083) [2022-07-11 08:22:46,608][26022] Updated weights on worker 0-0, policy_version 1111743 (0.00085) [2022-07-11 08:22:47,242][25689] Fps is (10 sec: 5495.5, 60 sec: 5559.8, 300 sec: 5527.9). Total num frames: 1138427904. Throughput: 0: 5841.0. Samples: 1138437232. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:22:47,243][25689] Avg episode reward: [(0, '-0.448')] [2022-07-11 08:22:48,232][26022] Updated weights on worker 0-0, policy_version 1111753 (0.00100) [2022-07-11 08:22:50,307][26022] Updated weights on worker 0-0, policy_version 1111763 (0.00089) [2022-07-11 08:22:51,952][26022] Updated weights on worker 0-0, policy_version 1111773 (0.00086) [2022-07-11 08:22:52,300][25689] Fps is (10 sec: 5687.9, 60 sec: 5566.9, 300 sec: 5535.4). Total num frames: 1138457600. Throughput: 0: 5002.5. Samples: 1138453886. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:22:52,301][25689] Avg episode reward: [(0, '-0.436')] [2022-07-11 08:22:53,851][26022] Updated weights on worker 0-0, policy_version 1111783 (0.00086) [2022-07-11 08:22:55,686][26022] Updated weights on worker 0-0, policy_version 1111793 (0.00092) [2022-07-11 08:22:57,306][25689] Fps is (10 sec: 5698.2, 60 sec: 5568.8, 300 sec: 5532.7). Total num frames: 1138485248. Throughput: 0: 5854.5. Samples: 1138487670. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:22:57,307][25689] Avg episode reward: [(0, '-1.033')] [2022-07-11 08:22:57,587][26022] Updated weights on worker 0-0, policy_version 1111803 (0.00089) [2022-07-11 08:22:59,304][26022] Updated weights on worker 0-0, policy_version 1111813 (0.00093) [2022-07-11 08:23:01,037][26022] Updated weights on worker 0-0, policy_version 1111823 (0.00087) [2022-07-11 08:23:02,315][25689] Fps is (10 sec: 5419.5, 60 sec: 5570.0, 300 sec: 5539.7). Total num frames: 1138511872. Throughput: 0: 5826.0. Samples: 1138520636. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:23:02,316][25689] Avg episode reward: [(0, '-2.354')] [2022-07-11 08:23:03,407][26022] Updated weights on worker 0-0, policy_version 1111833 (0.00087) [2022-07-11 08:23:05,224][26022] Updated weights on worker 0-0, policy_version 1111843 (0.00094) [2022-07-11 08:23:07,175][26022] Updated weights on worker 0-0, policy_version 1111853 (0.00081) [2022-07-11 08:23:07,334][25689] Fps is (10 sec: 5310.1, 60 sec: 5536.7, 300 sec: 5531.4). Total num frames: 1138538496. Throughput: 0: 4904.7. Samples: 1138535774. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:23:07,335][25689] Avg episode reward: [(0, '-2.196')] [2022-07-11 08:23:08,961][26022] Updated weights on worker 0-0, policy_version 1111863 (0.00089) [2022-07-11 08:23:10,628][26022] Updated weights on worker 0-0, policy_version 1111873 (0.00090) [2022-07-11 08:23:12,423][25689] Fps is (10 sec: 5369.2, 60 sec: 5540.7, 300 sec: 5530.2). Total num frames: 1138566144. Throughput: 0: 5727.7. Samples: 1138569142. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:23:12,423][25689] Avg episode reward: [(0, '-0.633')] [2022-07-11 08:23:12,587][26022] Updated weights on worker 0-0, policy_version 1111883 (0.00094) [2022-07-11 08:23:14,503][26022] Updated weights on worker 0-0, policy_version 1111893 (0.00091) [2022-07-11 08:23:16,325][26022] Updated weights on worker 0-0, policy_version 1111903 (0.00091) [2022-07-11 08:23:17,435][25689] Fps is (10 sec: 5575.5, 60 sec: 5540.6, 300 sec: 5534.7). Total num frames: 1138594816. Throughput: 0: 5718.7. Samples: 1138602782. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:23:17,436][25689] Avg episode reward: [(0, '-0.680')] [2022-07-11 08:23:18,139][26022] Updated weights on worker 0-0, policy_version 1111913 (0.00081) [2022-07-11 08:23:20,099][26022] Updated weights on worker 0-0, policy_version 1111923 (0.00081) [2022-07-11 08:23:21,775][26022] Updated weights on worker 0-0, policy_version 1111933 (0.00078) [2022-07-11 08:23:22,438][25689] Fps is (10 sec: 5623.7, 60 sec: 5546.9, 300 sec: 5531.8). Total num frames: 1138622464. Throughput: 0: 4906.6. Samples: 1138619370. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:23:22,443][25689] Avg episode reward: [(0, '-1.644')] [2022-07-11 08:23:23,838][26022] Updated weights on worker 0-0, policy_version 1111943 (0.00089) [2022-07-11 08:23:25,350][26022] Updated weights on worker 0-0, policy_version 1111953 (0.00092) [2022-07-11 08:23:27,447][25689] Fps is (10 sec: 5420.9, 60 sec: 5530.1, 300 sec: 5523.6). Total num frames: 1138649088. Throughput: 0: 5822.2. Samples: 1138652876. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:23:27,449][25689] Avg episode reward: [(0, '-0.456')] [2022-07-11 08:23:27,452][26022] Updated weights on worker 0-0, policy_version 1111963 (0.00082) [2022-07-11 08:23:29,016][26022] Updated weights on worker 0-0, policy_version 1111973 (0.00334) [2022-07-11 08:23:31,022][26022] Updated weights on worker 0-0, policy_version 1111983 (0.00085) [2022-07-11 08:23:32,544][25689] Fps is (10 sec: 5673.8, 60 sec: 5560.8, 300 sec: 5536.7). Total num frames: 1138679808. Throughput: 0: 5828.6. Samples: 1138686422. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:23:32,546][25689] Avg episode reward: [(0, '1.009')] [2022-07-11 08:23:32,555][26022] Updated weights on worker 0-0, policy_version 1111993 (0.00091) [2022-07-11 08:23:34,697][26022] Updated weights on worker 0-0, policy_version 1112003 (0.01477) [2022-07-11 08:23:36,582][26022] Updated weights on worker 0-0, policy_version 1112013 (0.00087) [2022-07-11 08:23:37,630][25689] Fps is (10 sec: 5731.7, 60 sec: 5539.3, 300 sec: 5535.5). Total num frames: 1138707456. Throughput: 0: 4970.5. Samples: 1138703162. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:23:37,631][25689] Avg episode reward: [(0, '0.360')] [2022-07-11 08:23:38,408][26022] Updated weights on worker 0-0, policy_version 1112023 (0.00067) [2022-07-11 08:23:40,040][26022] Updated weights on worker 0-0, policy_version 1112033 (0.00088) [2022-07-11 08:23:42,080][26022] Updated weights on worker 0-0, policy_version 1112043 (0.00088) [2022-07-11 08:23:42,688][25689] Fps is (10 sec: 5451.3, 60 sec: 5537.5, 300 sec: 5528.1). Total num frames: 1138735104. Throughput: 0: 5790.3. Samples: 1138736626. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:23:42,688][25689] Avg episode reward: [(0, '0.038')] [2022-07-11 08:23:43,829][26022] Updated weights on worker 0-0, policy_version 1112053 (0.00089) [2022-07-11 08:23:45,725][26022] Updated weights on worker 0-0, policy_version 1112063 (0.00049) [2022-07-11 08:23:47,709][25689] Fps is (10 sec: 5486.5, 60 sec: 5537.8, 300 sec: 5536.9). Total num frames: 1138762752. Throughput: 0: 5771.7. Samples: 1138769822. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:23:47,709][25689] Avg episode reward: [(0, '0.066')] [2022-07-11 08:23:47,711][26022] Updated weights on worker 0-0, policy_version 1112073 (0.00083) [2022-07-11 08:23:49,365][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:23:49,387][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001112083_1138772992.pth [2022-07-11 08:23:49,387][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001110135_1136778240.pth [2022-07-11 08:23:49,391][26022] Updated weights on worker 0-0, policy_version 1112083 (0.00090) [2022-07-11 08:23:51,425][26022] Updated weights on worker 0-0, policy_version 1112093 (0.00086) [2022-07-11 08:23:52,803][25689] Fps is (10 sec: 5568.0, 60 sec: 5517.5, 300 sec: 5535.4). Total num frames: 1138791424. Throughput: 0: 4941.1. Samples: 1138786522. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:23:52,803][25689] Avg episode reward: [(0, '0.685')] [2022-07-11 08:23:53,003][26022] Updated weights on worker 0-0, policy_version 1112103 (0.00081) [2022-07-11 08:23:54,859][26022] Updated weights on worker 0-0, policy_version 1112113 (0.00088) [2022-07-11 08:23:56,794][26022] Updated weights on worker 0-0, policy_version 1112123 (0.00081) [2022-07-11 08:23:57,883][25689] Fps is (10 sec: 5635.9, 60 sec: 5527.6, 300 sec: 5534.5). Total num frames: 1138820096. Throughput: 0: 5756.1. Samples: 1138819742. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:23:57,884][25689] Avg episode reward: [(0, '0.194')] [2022-07-11 08:23:58,483][26022] Updated weights on worker 0-0, policy_version 1112133 (0.00459) [2022-07-11 08:24:00,487][26022] Updated weights on worker 0-0, policy_version 1112143 (0.00085) [2022-07-11 08:24:02,534][26022] Updated weights on worker 0-0, policy_version 1112153 (0.00092) [2022-07-11 08:24:02,893][25689] Fps is (10 sec: 5378.6, 60 sec: 5510.6, 300 sec: 5534.9). Total num frames: 1138845696. Throughput: 0: 5708.5. Samples: 1138851968. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:24:02,894][25689] Avg episode reward: [(0, '-0.016')] [2022-07-11 08:24:04,362][26022] Updated weights on worker 0-0, policy_version 1112163 (0.00088) [2022-07-11 08:24:06,399][26022] Updated weights on worker 0-0, policy_version 1112173 (0.00085) [2022-07-11 08:24:07,948][25689] Fps is (10 sec: 5290.6, 60 sec: 5524.2, 300 sec: 5535.3). Total num frames: 1138873344. Throughput: 0: 4851.6. Samples: 1138868016. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:24:07,949][25689] Avg episode reward: [(0, '0.303')] [2022-07-11 08:24:08,125][26022] Updated weights on worker 0-0, policy_version 1112183 (0.00089) [2022-07-11 08:24:09,947][26022] Updated weights on worker 0-0, policy_version 1112193 (0.00094) [2022-07-11 08:24:12,017][26022] Updated weights on worker 0-0, policy_version 1112203 (0.00088) [2022-07-11 08:24:13,024][25689] Fps is (10 sec: 5559.4, 60 sec: 5542.3, 300 sec: 5530.5). Total num frames: 1138902016. Throughput: 0: 5661.9. Samples: 1138901012. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:24:13,024][25689] Avg episode reward: [(0, '0.078')] [2022-07-11 08:24:13,754][26022] Updated weights on worker 0-0, policy_version 1112213 (0.00094) [2022-07-11 08:24:15,576][26022] Updated weights on worker 0-0, policy_version 1112223 (0.00088) [2022-07-11 08:24:17,459][26022] Updated weights on worker 0-0, policy_version 1112233 (0.00082) [2022-07-11 08:24:18,080][25689] Fps is (10 sec: 5457.5, 60 sec: 5504.6, 300 sec: 5530.4). Total num frames: 1138928640. Throughput: 0: 5674.1. Samples: 1138934340. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:24:18,081][25689] Avg episode reward: [(0, '-0.038')] [2022-07-11 08:24:19,291][26022] Updated weights on worker 0-0, policy_version 1112243 (0.00097) [2022-07-11 08:24:21,105][26022] Updated weights on worker 0-0, policy_version 1112253 (0.00088) [2022-07-11 08:24:22,983][26022] Updated weights on worker 0-0, policy_version 1112263 (0.00086) [2022-07-11 08:24:23,098][25689] Fps is (10 sec: 5590.7, 60 sec: 5536.9, 300 sec: 5537.1). Total num frames: 1138958336. Throughput: 0: 4905.3. Samples: 1138951076. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:24:23,098][25689] Avg episode reward: [(0, '-0.541')] [2022-07-11 08:24:24,923][26022] Updated weights on worker 0-0, policy_version 1112273 (0.00089) [2022-07-11 08:24:26,762][26022] Updated weights on worker 0-0, policy_version 1112283 (0.00093) [2022-07-11 08:24:28,137][25689] Fps is (10 sec: 5600.0, 60 sec: 5534.2, 300 sec: 5527.0). Total num frames: 1138984960. Throughput: 0: 5776.4. Samples: 1138984638. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:24:28,139][25689] Avg episode reward: [(0, '0.179')] [2022-07-11 08:24:28,545][26022] Updated weights on worker 0-0, policy_version 1112293 (0.00089) [2022-07-11 08:24:30,317][26022] Updated weights on worker 0-0, policy_version 1112303 (0.00088) [2022-07-11 08:24:32,177][26022] Updated weights on worker 0-0, policy_version 1112313 (0.00086) [2022-07-11 08:24:33,224][25689] Fps is (10 sec: 5561.9, 60 sec: 5518.3, 300 sec: 5543.0). Total num frames: 1139014656. Throughput: 0: 5799.3. Samples: 1139018158. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 08:24:33,224][25689] Avg episode reward: [(0, '0.155')] [2022-07-11 08:24:34,084][26022] Updated weights on worker 0-0, policy_version 1112323 (0.00060) [2022-07-11 08:24:35,902][26022] Updated weights on worker 0-0, policy_version 1112333 (0.00092) [2022-07-11 08:24:37,567][26022] Updated weights on worker 0-0, policy_version 1112343 (0.00083) [2022-07-11 08:24:38,239][25689] Fps is (10 sec: 5676.4, 60 sec: 5524.7, 300 sec: 5536.4). Total num frames: 1139042304. Throughput: 0: 4991.2. Samples: 1139034960. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:24:38,240][25689] Avg episode reward: [(0, '0.105')] [2022-07-11 08:24:39,504][26022] Updated weights on worker 0-0, policy_version 1112353 (0.00077) [2022-07-11 08:24:41,237][26022] Updated weights on worker 0-0, policy_version 1112363 (0.00085) [2022-07-11 08:24:43,126][26022] Updated weights on worker 0-0, policy_version 1112373 (0.00090) [2022-07-11 08:24:43,255][25689] Fps is (10 sec: 5512.5, 60 sec: 5528.6, 300 sec: 5533.4). Total num frames: 1139069952. Throughput: 0: 5842.7. Samples: 1139068848. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:24:43,255][25689] Avg episode reward: [(0, '-0.455')] [2022-07-11 08:24:44,882][26022] Updated weights on worker 0-0, policy_version 1112383 (0.00084) [2022-07-11 08:24:46,866][26022] Updated weights on worker 0-0, policy_version 1112393 (0.00085) [2022-07-11 08:24:48,282][25689] Fps is (10 sec: 5506.2, 60 sec: 5528.0, 300 sec: 5534.9). Total num frames: 1139097600. Throughput: 0: 5834.0. Samples: 1139102162. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:24:48,282][25689] Avg episode reward: [(0, '0.430')] [2022-07-11 08:24:48,621][26022] Updated weights on worker 0-0, policy_version 1112403 (0.00083) [2022-07-11 08:24:50,471][26022] Updated weights on worker 0-0, policy_version 1112413 (0.00085) [2022-07-11 08:24:52,301][26022] Updated weights on worker 0-0, policy_version 1112423 (0.00087) [2022-07-11 08:24:53,316][25689] Fps is (10 sec: 5597.5, 60 sec: 5533.4, 300 sec: 5539.3). Total num frames: 1139126272. Throughput: 0: 5033.8. Samples: 1139119300. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:24:53,317][25689] Avg episode reward: [(0, '0.241')] [2022-07-11 08:24:54,098][26022] Updated weights on worker 0-0, policy_version 1112433 (0.00090) [2022-07-11 08:24:55,940][26022] Updated weights on worker 0-0, policy_version 1112443 (0.00091) [2022-07-11 08:24:57,879][26022] Updated weights on worker 0-0, policy_version 1112453 (0.00084) [2022-07-11 08:24:58,339][25689] Fps is (10 sec: 5701.7, 60 sec: 5538.7, 300 sec: 5536.6). Total num frames: 1139154944. Throughput: 0: 5872.3. Samples: 1139152992. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:24:58,340][25689] Avg episode reward: [(0, '-1.006')] [2022-07-11 08:24:59,584][26022] Updated weights on worker 0-0, policy_version 1112463 (0.00093) [2022-07-11 08:25:01,357][26022] Updated weights on worker 0-0, policy_version 1112473 (0.00084) [2022-07-11 08:25:03,343][25689] Fps is (10 sec: 5412.6, 60 sec: 5539.3, 300 sec: 5538.0). Total num frames: 1139180544. Throughput: 0: 5765.0. Samples: 1139184658. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:25:03,344][25689] Avg episode reward: [(0, '-0.719')] [2022-07-11 08:25:03,691][26022] Updated weights on worker 0-0, policy_version 1112483 (0.00481) [2022-07-11 08:25:05,506][26022] Updated weights on worker 0-0, policy_version 1112493 (0.00086) [2022-07-11 08:25:07,432][26022] Updated weights on worker 0-0, policy_version 1112503 (0.00088) [2022-07-11 08:25:08,379][25689] Fps is (10 sec: 5405.8, 60 sec: 5558.0, 300 sec: 5542.0). Total num frames: 1139209216. Throughput: 0: 5773.1. Samples: 1139218182. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:25:08,379][25689] Avg episode reward: [(0, '-0.568')] [2022-07-11 08:25:09,196][26022] Updated weights on worker 0-0, policy_version 1112513 (0.00093) [2022-07-11 08:25:10,992][26022] Updated weights on worker 0-0, policy_version 1112523 (0.00088) [2022-07-11 08:25:12,928][26022] Updated weights on worker 0-0, policy_version 1112533 (0.00090) [2022-07-11 08:25:13,451][25689] Fps is (10 sec: 5470.9, 60 sec: 5524.4, 300 sec: 5533.9). Total num frames: 1139235840. Throughput: 0: 5744.1. Samples: 1139234954. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:25:13,451][25689] Avg episode reward: [(0, '-0.009')] [2022-07-11 08:25:14,563][26022] Updated weights on worker 0-0, policy_version 1112543 (0.00087) [2022-07-11 08:25:16,576][26022] Updated weights on worker 0-0, policy_version 1112553 (0.00092) [2022-07-11 08:25:18,226][26022] Updated weights on worker 0-0, policy_version 1112563 (0.00050) [2022-07-11 08:25:18,458][25689] Fps is (10 sec: 5587.5, 60 sec: 5579.8, 300 sec: 5540.8). Total num frames: 1139265536. Throughput: 0: 5743.0. Samples: 1139268536. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:25:18,459][25689] Avg episode reward: [(0, '0.302')] [2022-07-11 08:25:20,211][26022] Updated weights on worker 0-0, policy_version 1112573 (0.00085) [2022-07-11 08:25:21,714][26022] Updated weights on worker 0-0, policy_version 1112583 (0.00085) [2022-07-11 08:25:23,463][25689] Fps is (10 sec: 5625.3, 60 sec: 5530.1, 300 sec: 5537.6). Total num frames: 1139292160. Throughput: 0: 5847.5. Samples: 1139302306. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:25:23,463][25689] Avg episode reward: [(0, '-0.309')] [2022-07-11 08:25:23,890][26022] Updated weights on worker 0-0, policy_version 1112593 (0.00091) [2022-07-11 08:25:25,509][26022] Updated weights on worker 0-0, policy_version 1112603 (0.00086) [2022-07-11 08:25:27,440][26022] Updated weights on worker 0-0, policy_version 1112613 (0.00087) [2022-07-11 08:25:28,468][25689] Fps is (10 sec: 5626.5, 60 sec: 5584.2, 300 sec: 5545.9). Total num frames: 1139321856. Throughput: 0: 5032.1. Samples: 1139319274. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:25:28,469][25689] Avg episode reward: [(0, '0.618')] [2022-07-11 08:25:29,327][26022] Updated weights on worker 0-0, policy_version 1112623 (0.00215) [2022-07-11 08:25:31,004][26022] Updated weights on worker 0-0, policy_version 1112633 (0.00090) [2022-07-11 08:25:33,006][26022] Updated weights on worker 0-0, policy_version 1112643 (0.00090) [2022-07-11 08:25:33,505][25689] Fps is (10 sec: 5608.0, 60 sec: 5537.7, 300 sec: 5538.3). Total num frames: 1139348480. Throughput: 0: 5870.2. Samples: 1139352680. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:25:33,506][25689] Avg episode reward: [(0, '0.361')] [2022-07-11 08:25:34,767][26022] Updated weights on worker 0-0, policy_version 1112653 (0.00453) [2022-07-11 08:25:36,618][26022] Updated weights on worker 0-0, policy_version 1112663 (0.00092) [2022-07-11 08:25:38,471][26022] Updated weights on worker 0-0, policy_version 1112673 (0.00096) [2022-07-11 08:25:38,540][25689] Fps is (10 sec: 5489.9, 60 sec: 5552.9, 300 sec: 5545.0). Total num frames: 1139377152. Throughput: 0: 5863.9. Samples: 1139386294. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:25:38,541][25689] Avg episode reward: [(0, '0.218')] [2022-07-11 08:25:40,112][26022] Updated weights on worker 0-0, policy_version 1112683 (0.00089) [2022-07-11 08:25:42,178][26022] Updated weights on worker 0-0, policy_version 1112693 (0.00085) [2022-07-11 08:25:43,571][25689] Fps is (10 sec: 5697.0, 60 sec: 5568.5, 300 sec: 5541.3). Total num frames: 1139405824. Throughput: 0: 5015.3. Samples: 1139403156. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:25:43,573][25689] Avg episode reward: [(0, '0.243')] [2022-07-11 08:25:43,907][26022] Updated weights on worker 0-0, policy_version 1112703 (0.00103) [2022-07-11 08:25:45,654][26022] Updated weights on worker 0-0, policy_version 1112713 (0.00087) [2022-07-11 08:25:47,495][26022] Updated weights on worker 0-0, policy_version 1112723 (0.00088) [2022-07-11 08:25:48,582][25689] Fps is (10 sec: 5608.4, 60 sec: 5570.0, 300 sec: 5539.2). Total num frames: 1139433472. Throughput: 0: 5836.2. Samples: 1139436666. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:25:48,583][25689] Avg episode reward: [(0, '0.205')] [2022-07-11 08:25:49,360][26022] Updated weights on worker 0-0, policy_version 1112733 (0.00092) [2022-07-11 08:25:49,598][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:25:49,612][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001112734_1139439616.pth [2022-07-11 08:25:49,613][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001110783_1137441792.pth [2022-07-11 08:25:51,298][26022] Updated weights on worker 0-0, policy_version 1112743 (0.00087) [2022-07-11 08:25:53,275][26022] Updated weights on worker 0-0, policy_version 1112753 (0.00620) [2022-07-11 08:25:53,626][25689] Fps is (10 sec: 5499.3, 60 sec: 5552.1, 300 sec: 5538.7). Total num frames: 1139461120. Throughput: 0: 5820.7. Samples: 1139469798. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:25:53,627][25689] Avg episode reward: [(0, '-0.196')] [2022-07-11 08:25:54,894][26022] Updated weights on worker 0-0, policy_version 1112763 (0.00084) [2022-07-11 08:25:56,855][26022] Updated weights on worker 0-0, policy_version 1112773 (0.00092) [2022-07-11 08:25:58,630][25689] Fps is (10 sec: 5503.2, 60 sec: 5536.9, 300 sec: 5538.8). Total num frames: 1139488768. Throughput: 0: 4981.3. Samples: 1139486368. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:25:58,635][25689] Avg episode reward: [(0, '-0.054')] [2022-07-11 08:25:58,724][26022] Updated weights on worker 0-0, policy_version 1112783 (0.00103) [2022-07-11 08:26:00,662][26022] Updated weights on worker 0-0, policy_version 1112793 (0.00090) [2022-07-11 08:26:02,807][26022] Updated weights on worker 0-0, policy_version 1112803 (0.00086) [2022-07-11 08:26:03,670][25689] Fps is (10 sec: 5301.4, 60 sec: 5533.6, 300 sec: 5532.5). Total num frames: 1139514368. Throughput: 0: 5704.8. Samples: 1139517818. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:26:03,671][25689] Avg episode reward: [(0, '0.169')] [2022-07-11 08:26:04,703][26022] Updated weights on worker 0-0, policy_version 1112813 (0.00090) [2022-07-11 08:26:06,375][26022] Updated weights on worker 0-0, policy_version 1112823 (0.00083) [2022-07-11 08:26:08,298][26022] Updated weights on worker 0-0, policy_version 1112833 (0.00087) [2022-07-11 08:26:08,681][25689] Fps is (10 sec: 5298.2, 60 sec: 5518.9, 300 sec: 5537.9). Total num frames: 1139542016. Throughput: 0: 5712.0. Samples: 1139551466. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:26:08,682][25689] Avg episode reward: [(0, '-0.734')] [2022-07-11 08:26:10,021][26022] Updated weights on worker 0-0, policy_version 1112843 (0.00089) [2022-07-11 08:26:12,103][26022] Updated weights on worker 0-0, policy_version 1112853 (0.00086) [2022-07-11 08:26:13,724][25689] Fps is (10 sec: 5703.6, 60 sec: 5572.4, 300 sec: 5544.2). Total num frames: 1139571712. Throughput: 0: 4904.0. Samples: 1139568360. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:26:13,726][25689] Avg episode reward: [(0, '-1.085')] [2022-07-11 08:26:13,733][26022] Updated weights on worker 0-0, policy_version 1112863 (0.00088) [2022-07-11 08:26:15,758][26022] Updated weights on worker 0-0, policy_version 1112873 (0.00084) [2022-07-11 08:26:17,342][26022] Updated weights on worker 0-0, policy_version 1112883 (0.00089) [2022-07-11 08:26:18,808][25689] Fps is (10 sec: 5662.1, 60 sec: 5531.5, 300 sec: 5539.2). Total num frames: 1139599360. Throughput: 0: 5725.8. Samples: 1139601902. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:26:18,809][25689] Avg episode reward: [(0, '-0.748')] [2022-07-11 08:26:19,549][26022] Updated weights on worker 0-0, policy_version 1112893 (0.00085) [2022-07-11 08:26:20,998][26022] Updated weights on worker 0-0, policy_version 1112903 (0.00085) [2022-07-11 08:26:23,008][26022] Updated weights on worker 0-0, policy_version 1112913 (0.00092) [2022-07-11 08:26:23,893][25689] Fps is (10 sec: 5538.5, 60 sec: 5558.0, 300 sec: 5541.3). Total num frames: 1139628032. Throughput: 0: 5842.2. Samples: 1139635964. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:26:23,894][25689] Avg episode reward: [(0, '-0.452')] [2022-07-11 08:26:24,530][26022] Updated weights on worker 0-0, policy_version 1112923 (0.00092) [2022-07-11 08:26:26,658][26022] Updated weights on worker 0-0, policy_version 1112933 (0.00087) [2022-07-11 08:26:28,191][26022] Updated weights on worker 0-0, policy_version 1112943 (0.00099) [2022-07-11 08:26:28,940][25689] Fps is (10 sec: 5659.7, 60 sec: 5537.2, 300 sec: 5544.7). Total num frames: 1139656704. Throughput: 0: 5004.0. Samples: 1139652844. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:26:28,941][25689] Avg episode reward: [(0, '-0.513')] [2022-07-11 08:26:30,267][26022] Updated weights on worker 0-0, policy_version 1112953 (0.00086) [2022-07-11 08:26:32,181][26022] Updated weights on worker 0-0, policy_version 1112963 (0.00082) [2022-07-11 08:26:33,776][26022] Updated weights on worker 0-0, policy_version 1112973 (0.00083) [2022-07-11 08:26:33,994][25689] Fps is (10 sec: 5677.0, 60 sec: 5569.6, 300 sec: 5544.0). Total num frames: 1139685376. Throughput: 0: 5823.5. Samples: 1139686402. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:26:33,995][25689] Avg episode reward: [(0, '0.027')] [2022-07-11 08:26:35,700][26022] Updated weights on worker 0-0, policy_version 1112983 (0.00092) [2022-07-11 08:26:37,606][26022] Updated weights on worker 0-0, policy_version 1112993 (0.00080) [2022-07-11 08:26:39,007][25689] Fps is (10 sec: 5594.8, 60 sec: 5554.7, 300 sec: 5547.4). Total num frames: 1139713024. Throughput: 0: 5871.3. Samples: 1139720492. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:26:39,007][25689] Avg episode reward: [(0, '-0.235')] [2022-07-11 08:26:39,248][26022] Updated weights on worker 0-0, policy_version 1113003 (0.00087) [2022-07-11 08:26:41,142][26022] Updated weights on worker 0-0, policy_version 1113013 (0.00085) [2022-07-11 08:26:42,866][26022] Updated weights on worker 0-0, policy_version 1113023 (0.00090) [2022-07-11 08:26:44,019][25689] Fps is (10 sec: 5515.9, 60 sec: 5539.4, 300 sec: 5547.5). Total num frames: 1139740672. Throughput: 0: 5047.7. Samples: 1139737554. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:26:44,020][25689] Avg episode reward: [(0, '0.202')] [2022-07-11 08:26:44,563][26022] Updated weights on worker 0-0, policy_version 1113033 (0.00092) [2022-07-11 08:26:46,545][26022] Updated weights on worker 0-0, policy_version 1113043 (0.00086) [2022-07-11 08:26:48,294][26022] Updated weights on worker 0-0, policy_version 1113053 (0.00378) [2022-07-11 08:26:49,035][25689] Fps is (10 sec: 5616.2, 60 sec: 5556.0, 300 sec: 5546.3). Total num frames: 1139769344. Throughput: 0: 5892.6. Samples: 1139771254. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:26:49,038][25689] Avg episode reward: [(0, '0.081')] [2022-07-11 08:26:50,278][26022] Updated weights on worker 0-0, policy_version 1113063 (0.00086) [2022-07-11 08:26:51,844][26022] Updated weights on worker 0-0, policy_version 1113073 (0.00084) [2022-07-11 08:26:53,759][26022] Updated weights on worker 0-0, policy_version 1113083 (0.00089) [2022-07-11 08:26:54,082][25689] Fps is (10 sec: 5800.2, 60 sec: 5589.5, 300 sec: 5552.8). Total num frames: 1139799040. Throughput: 0: 5904.3. Samples: 1139805006. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:26:54,083][25689] Avg episode reward: [(0, '0.452')] [2022-07-11 08:26:55,583][26022] Updated weights on worker 0-0, policy_version 1113093 (0.00095) [2022-07-11 08:26:57,431][26022] Updated weights on worker 0-0, policy_version 1113103 (0.00095) [2022-07-11 08:26:59,091][25689] Fps is (10 sec: 5702.5, 60 sec: 5589.1, 300 sec: 5556.5). Total num frames: 1139826688. Throughput: 0: 5040.6. Samples: 1139821728. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:26:59,091][25689] Avg episode reward: [(0, '-0.292')] [2022-07-11 08:26:59,307][26022] Updated weights on worker 0-0, policy_version 1113113 (0.00078) [2022-07-11 08:27:01,181][26022] Updated weights on worker 0-0, policy_version 1113123 (0.00088) [2022-07-11 08:27:03,315][26022] Updated weights on worker 0-0, policy_version 1113133 (0.00097) [2022-07-11 08:27:04,120][25689] Fps is (10 sec: 5100.9, 60 sec: 5556.3, 300 sec: 5539.2). Total num frames: 1139850240. Throughput: 0: 5752.3. Samples: 1139853178. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:27:04,122][25689] Avg episode reward: [(0, '-0.433')] [2022-07-11 08:27:05,295][26022] Updated weights on worker 0-0, policy_version 1113143 (0.00086) [2022-07-11 08:27:07,063][26022] Updated weights on worker 0-0, policy_version 1113153 (0.00357) [2022-07-11 08:27:08,811][26022] Updated weights on worker 0-0, policy_version 1113163 (0.00095) [2022-07-11 08:27:09,135][25689] Fps is (10 sec: 5403.4, 60 sec: 5606.6, 300 sec: 5551.7). Total num frames: 1139880960. Throughput: 0: 5744.9. Samples: 1139886726. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:27:09,135][25689] Avg episode reward: [(0, '-0.194')] [2022-07-11 08:27:10,895][26022] Updated weights on worker 0-0, policy_version 1113173 (0.00091) [2022-07-11 08:27:12,617][26022] Updated weights on worker 0-0, policy_version 1113183 (0.00092) [2022-07-11 08:27:14,193][25689] Fps is (10 sec: 5692.4, 60 sec: 5554.5, 300 sec: 5543.9). Total num frames: 1139907584. Throughput: 0: 4896.8. Samples: 1139903488. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:27:14,194][25689] Avg episode reward: [(0, '-1.285')] [2022-07-11 08:27:14,386][26022] Updated weights on worker 0-0, policy_version 1113193 (0.00086) [2022-07-11 08:27:16,064][26022] Updated weights on worker 0-0, policy_version 1113203 (0.00087) [2022-07-11 08:27:18,095][26022] Updated weights on worker 0-0, policy_version 1113213 (0.00090) [2022-07-11 08:27:19,279][25689] Fps is (10 sec: 5451.0, 60 sec: 5571.2, 300 sec: 5547.1). Total num frames: 1139936256. Throughput: 0: 5726.5. Samples: 1139937336. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:27:19,279][25689] Avg episode reward: [(0, '-0.781')] [2022-07-11 08:27:19,952][26022] Updated weights on worker 0-0, policy_version 1113224 (0.00082) [2022-07-11 08:27:21,907][26022] Updated weights on worker 0-0, policy_version 1113234 (0.00096) [2022-07-11 08:27:23,674][26022] Updated weights on worker 0-0, policy_version 1113244 (0.00091) [2022-07-11 08:27:24,328][25689] Fps is (10 sec: 5658.2, 60 sec: 5574.5, 300 sec: 5549.8). Total num frames: 1139964928. Throughput: 0: 5831.6. Samples: 1139971026. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:27:24,329][25689] Avg episode reward: [(0, '0.412')] [2022-07-11 08:27:25,695][26022] Updated weights on worker 0-0, policy_version 1113254 (0.00084) [2022-07-11 08:27:27,546][26022] Updated weights on worker 0-0, policy_version 1113264 (0.00093) [2022-07-11 08:27:29,211][26022] Updated weights on worker 0-0, policy_version 1113274 (0.00096) [2022-07-11 08:27:29,334][25689] Fps is (10 sec: 5601.2, 60 sec: 5561.4, 300 sec: 5547.4). Total num frames: 1139992576. Throughput: 0: 5000.5. Samples: 1139987732. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:27:29,334][25689] Avg episode reward: [(0, '0.316')] [2022-07-11 08:27:31,221][26022] Updated weights on worker 0-0, policy_version 1113284 (0.00091) [2022-07-11 08:27:32,692][26022] Updated weights on worker 0-0, policy_version 1113294 (0.00093) [2022-07-11 08:27:34,450][25689] Fps is (10 sec: 5362.1, 60 sec: 5521.8, 300 sec: 5539.1). Total num frames: 1140019200. Throughput: 0: 5809.6. Samples: 1140021168. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:27:34,451][25689] Avg episode reward: [(0, '-1.092')] [2022-07-11 08:27:34,843][26022] Updated weights on worker 0-0, policy_version 1113304 (0.00084) [2022-07-11 08:27:36,339][26022] Updated weights on worker 0-0, policy_version 1113314 (0.00088) [2022-07-11 08:27:38,411][26022] Updated weights on worker 0-0, policy_version 1113324 (0.00086) [2022-07-11 08:27:39,467][25689] Fps is (10 sec: 5760.0, 60 sec: 5589.1, 300 sec: 5553.2). Total num frames: 1140050944. Throughput: 0: 5816.7. Samples: 1140054766. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:27:39,468][25689] Avg episode reward: [(0, '-1.310')] [2022-07-11 08:27:40,607][26022] Updated weights on worker 0-0, policy_version 1113334 (0.00087) [2022-07-11 08:27:41,968][26022] Updated weights on worker 0-0, policy_version 1113344 (0.00089) [2022-07-11 08:27:44,060][26022] Updated weights on worker 0-0, policy_version 1113354 (0.00088) [2022-07-11 08:27:44,566][25689] Fps is (10 sec: 5668.6, 60 sec: 5547.3, 300 sec: 5544.9). Total num frames: 1140076544. Throughput: 0: 4974.8. Samples: 1140071704. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:27:44,566][25689] Avg episode reward: [(0, '-1.133')] [2022-07-11 08:27:45,737][26022] Updated weights on worker 0-0, policy_version 1113364 (0.00084) [2022-07-11 08:27:47,790][26022] Updated weights on worker 0-0, policy_version 1113374 (0.00092) [2022-07-11 08:27:49,551][26022] Updated weights on worker 0-0, policy_version 1113384 (0.00094) [2022-07-11 08:27:49,577][25689] Fps is (10 sec: 5368.2, 60 sec: 5547.7, 300 sec: 5542.4). Total num frames: 1140105216. Throughput: 0: 5795.4. Samples: 1140105050. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:27:49,578][25689] Avg episode reward: [(0, '-3.447')] [2022-07-11 08:27:49,659][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:27:49,670][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001113386_1140107264.pth [2022-07-11 08:27:49,671][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001111432_1138106368.pth [2022-07-11 08:27:51,403][26022] Updated weights on worker 0-0, policy_version 1113394 (0.00090) [2022-07-11 08:27:52,925][26022] Updated weights on worker 0-0, policy_version 1113404 (0.00086) [2022-07-11 08:27:54,656][25689] Fps is (10 sec: 5682.9, 60 sec: 5527.9, 300 sec: 5544.4). Total num frames: 1140133888. Throughput: 0: 5816.9. Samples: 1140138710. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:27:54,657][25689] Avg episode reward: [(0, '-3.140')] [2022-07-11 08:27:55,212][26022] Updated weights on worker 0-0, policy_version 1113414 (0.00089) [2022-07-11 08:27:56,698][26022] Updated weights on worker 0-0, policy_version 1113424 (0.00085) [2022-07-11 08:27:58,585][26022] Updated weights on worker 0-0, policy_version 1113434 (0.00081) [2022-07-11 08:27:59,729][25689] Fps is (10 sec: 5648.5, 60 sec: 5538.9, 300 sec: 5550.1). Total num frames: 1140162560. Throughput: 0: 4967.3. Samples: 1140155414. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:27:59,730][25689] Avg episode reward: [(0, '-2.893')] [2022-07-11 08:28:00,555][26022] Updated weights on worker 0-0, policy_version 1113444 (0.00089) [2022-07-11 08:28:02,534][26022] Updated weights on worker 0-0, policy_version 1113454 (0.00091) [2022-07-11 08:28:04,499][26022] Updated weights on worker 0-0, policy_version 1113464 (0.00086) [2022-07-11 08:28:04,731][25689] Fps is (10 sec: 5387.1, 60 sec: 5575.2, 300 sec: 5547.0). Total num frames: 1140188160. Throughput: 0: 5713.5. Samples: 1140186916. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:28:04,731][25689] Avg episode reward: [(0, '-1.469')] [2022-07-11 08:28:06,101][26022] Updated weights on worker 0-0, policy_version 1113474 (0.00614) [2022-07-11 08:28:08,069][26022] Updated weights on worker 0-0, policy_version 1113484 (0.00091) [2022-07-11 08:28:09,815][25689] Fps is (10 sec: 5381.0, 60 sec: 5535.1, 300 sec: 5550.5). Total num frames: 1140216832. Throughput: 0: 5720.4. Samples: 1140220820. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:28:09,816][25689] Avg episode reward: [(0, '-0.513')] [2022-07-11 08:28:09,949][26022] Updated weights on worker 0-0, policy_version 1113494 (0.00088) [2022-07-11 08:28:11,583][26022] Updated weights on worker 0-0, policy_version 1113504 (0.00088) [2022-07-11 08:28:13,759][26022] Updated weights on worker 0-0, policy_version 1113514 (0.00090) [2022-07-11 08:28:14,905][25689] Fps is (10 sec: 5434.7, 60 sec: 5532.2, 300 sec: 5542.2). Total num frames: 1140243456. Throughput: 0: 5722.6. Samples: 1140254586. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:28:14,907][25689] Avg episode reward: [(0, '1.443')] [2022-07-11 08:28:15,265][26022] Updated weights on worker 0-0, policy_version 1113524 (0.00089) [2022-07-11 08:28:17,352][26022] Updated weights on worker 0-0, policy_version 1113534 (0.00090) [2022-07-11 08:28:19,156][26022] Updated weights on worker 0-0, policy_version 1113544 (0.00088) [2022-07-11 08:28:19,963][25689] Fps is (10 sec: 5651.0, 60 sec: 5568.5, 300 sec: 5551.5). Total num frames: 1140274176. Throughput: 0: 5717.0. Samples: 1140271088. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:28:19,963][25689] Avg episode reward: [(0, '0.193')] [2022-07-11 08:28:20,927][26022] Updated weights on worker 0-0, policy_version 1113554 (0.00088) [2022-07-11 08:28:22,640][26022] Updated weights on worker 0-0, policy_version 1113564 (0.00086) [2022-07-11 08:28:24,491][26022] Updated weights on worker 0-0, policy_version 1113574 (0.00091) [2022-07-11 08:28:24,979][25689] Fps is (10 sec: 5794.3, 60 sec: 5554.7, 300 sec: 5554.8). Total num frames: 1140301824. Throughput: 0: 5825.8. Samples: 1140304876. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:28:24,980][25689] Avg episode reward: [(0, '-1.320')] [2022-07-11 08:28:26,314][26022] Updated weights on worker 0-0, policy_version 1113584 (0.00386) [2022-07-11 08:28:28,389][26022] Updated weights on worker 0-0, policy_version 1113594 (0.00096) [2022-07-11 08:28:29,959][26022] Updated weights on worker 0-0, policy_version 1113604 (0.00088) [2022-07-11 08:28:30,056][25689] Fps is (10 sec: 5579.8, 60 sec: 5565.0, 300 sec: 5548.3). Total num frames: 1140330496. Throughput: 0: 5810.5. Samples: 1140338432. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 08:28:30,057][25689] Avg episode reward: [(0, '-1.995')] [2022-07-11 08:28:31,984][26022] Updated weights on worker 0-0, policy_version 1113614 (0.00087) [2022-07-11 08:28:33,539][26022] Updated weights on worker 0-0, policy_version 1113624 (0.00090) [2022-07-11 08:28:35,148][25689] Fps is (10 sec: 5538.3, 60 sec: 5584.1, 300 sec: 5548.2). Total num frames: 1140358144. Throughput: 0: 4958.3. Samples: 1140354954. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:28:35,148][25689] Avg episode reward: [(0, '-1.973')] [2022-07-11 08:28:35,787][26022] Updated weights on worker 0-0, policy_version 1113634 (0.00086) [2022-07-11 08:28:37,332][26022] Updated weights on worker 0-0, policy_version 1113644 (0.00087) [2022-07-11 08:28:39,366][26022] Updated weights on worker 0-0, policy_version 1113654 (0.00114) [2022-07-11 08:28:40,182][25689] Fps is (10 sec: 5562.1, 60 sec: 5532.0, 300 sec: 5552.1). Total num frames: 1140386816. Throughput: 0: 5812.8. Samples: 1140388618. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:28:40,183][25689] Avg episode reward: [(0, '-1.939')] [2022-07-11 08:28:41,166][26022] Updated weights on worker 0-0, policy_version 1113664 (0.00087) [2022-07-11 08:28:42,990][26022] Updated weights on worker 0-0, policy_version 1113674 (0.00088) [2022-07-11 08:28:44,796][26022] Updated weights on worker 0-0, policy_version 1113684 (0.00087) [2022-07-11 08:28:45,215][25689] Fps is (10 sec: 5594.8, 60 sec: 5571.8, 300 sec: 5551.9). Total num frames: 1140414464. Throughput: 0: 5805.7. Samples: 1140422358. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:28:45,215][25689] Avg episode reward: [(0, '-2.391')] [2022-07-11 08:28:46,686][26022] Updated weights on worker 0-0, policy_version 1113694 (0.00091) [2022-07-11 08:28:48,266][26022] Updated weights on worker 0-0, policy_version 1113704 (0.00086) [2022-07-11 08:28:50,258][25689] Fps is (10 sec: 5488.2, 60 sec: 5552.0, 300 sec: 5549.4). Total num frames: 1140442112. Throughput: 0: 4968.8. Samples: 1140438808. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:28:50,258][25689] Avg episode reward: [(0, '-1.231')] [2022-07-11 08:28:50,391][26022] Updated weights on worker 0-0, policy_version 1113714 (0.00097) [2022-07-11 08:28:52,103][26022] Updated weights on worker 0-0, policy_version 1113724 (0.00091) [2022-07-11 08:28:54,008][26022] Updated weights on worker 0-0, policy_version 1113734 (0.00086) [2022-07-11 08:28:55,317][25689] Fps is (10 sec: 5574.9, 60 sec: 5553.8, 300 sec: 5549.8). Total num frames: 1140470784. Throughput: 0: 5828.6. Samples: 1140472512. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:28:55,317][25689] Avg episode reward: [(0, '1.093')] [2022-07-11 08:28:55,708][26022] Updated weights on worker 0-0, policy_version 1113744 (0.00086) [2022-07-11 08:28:57,504][26022] Updated weights on worker 0-0, policy_version 1113754 (0.00090) [2022-07-11 08:28:59,507][26022] Updated weights on worker 0-0, policy_version 1113764 (0.00083) [2022-07-11 08:29:00,334][25689] Fps is (10 sec: 5691.1, 60 sec: 5558.9, 300 sec: 5560.0). Total num frames: 1140499456. Throughput: 0: 5832.4. Samples: 1140506150. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:29:00,334][25689] Avg episode reward: [(0, '0.418')] [2022-07-11 08:29:01,182][26022] Updated weights on worker 0-0, policy_version 1113774 (0.00082) [2022-07-11 08:29:03,484][26022] Updated weights on worker 0-0, policy_version 1113784 (0.00087) [2022-07-11 08:29:05,274][26022] Updated weights on worker 0-0, policy_version 1113794 (0.00084) [2022-07-11 08:29:05,360][25689] Fps is (10 sec: 5404.0, 60 sec: 5556.7, 300 sec: 5553.6). Total num frames: 1140525056. Throughput: 0: 4891.7. Samples: 1140520904. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:29:05,360][25689] Avg episode reward: [(0, '0.533')] [2022-07-11 08:29:07,142][26022] Updated weights on worker 0-0, policy_version 1113804 (0.00091) [2022-07-11 08:29:09,114][26022] Updated weights on worker 0-0, policy_version 1113814 (0.00091) [2022-07-11 08:29:10,365][25689] Fps is (10 sec: 5308.0, 60 sec: 5547.0, 300 sec: 5551.5). Total num frames: 1140552704. Throughput: 0: 5757.6. Samples: 1140554580. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:29:10,366][25689] Avg episode reward: [(0, '-0.803')] [2022-07-11 08:29:10,731][26022] Updated weights on worker 0-0, policy_version 1113824 (0.00094) [2022-07-11 08:29:12,720][26022] Updated weights on worker 0-0, policy_version 1113834 (0.00093) [2022-07-11 08:29:14,364][26022] Updated weights on worker 0-0, policy_version 1113844 (0.00092) [2022-07-11 08:29:15,484][25689] Fps is (10 sec: 5563.1, 60 sec: 5578.2, 300 sec: 5557.2). Total num frames: 1140581376. Throughput: 0: 5723.1. Samples: 1140587928. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:29:15,484][25689] Avg episode reward: [(0, '-0.378')] [2022-07-11 08:29:16,412][26022] Updated weights on worker 0-0, policy_version 1113854 (0.00098) [2022-07-11 08:29:18,275][26022] Updated weights on worker 0-0, policy_version 1113864 (0.00085) [2022-07-11 08:29:20,136][26022] Updated weights on worker 0-0, policy_version 1113874 (0.00093) [2022-07-11 08:29:20,511][25689] Fps is (10 sec: 5450.3, 60 sec: 5513.4, 300 sec: 5546.7). Total num frames: 1140608000. Throughput: 0: 4868.0. Samples: 1140604372. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:29:20,512][25689] Avg episode reward: [(0, '-0.482')] [2022-07-11 08:29:21,894][26022] Updated weights on worker 0-0, policy_version 1113884 (0.00087) [2022-07-11 08:29:23,983][26022] Updated weights on worker 0-0, policy_version 1113894 (0.00087) [2022-07-11 08:29:25,349][26022] Updated weights on worker 0-0, policy_version 1113904 (0.00085) [2022-07-11 08:29:25,523][25689] Fps is (10 sec: 5610.1, 60 sec: 5547.6, 300 sec: 5557.6). Total num frames: 1140637696. Throughput: 0: 5801.1. Samples: 1140637870. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:29:25,523][25689] Avg episode reward: [(0, '-0.543')] [2022-07-11 08:29:27,641][26022] Updated weights on worker 0-0, policy_version 1113914 (0.00089) [2022-07-11 08:29:29,313][26022] Updated weights on worker 0-0, policy_version 1113924 (0.00083) [2022-07-11 08:29:30,545][25689] Fps is (10 sec: 5612.6, 60 sec: 5518.8, 300 sec: 5548.4). Total num frames: 1140664320. Throughput: 0: 5764.1. Samples: 1140670900. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:29:30,547][25689] Avg episode reward: [(0, '-0.309')] [2022-07-11 08:29:31,186][26022] Updated weights on worker 0-0, policy_version 1113934 (0.00084) [2022-07-11 08:29:33,004][26022] Updated weights on worker 0-0, policy_version 1113944 (0.00090) [2022-07-11 08:29:34,916][26022] Updated weights on worker 0-0, policy_version 1113954 (0.00091) [2022-07-11 08:29:35,658][25689] Fps is (10 sec: 5455.7, 60 sec: 5533.7, 300 sec: 5550.0). Total num frames: 1140692992. Throughput: 0: 4935.4. Samples: 1140687498. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:29:35,659][25689] Avg episode reward: [(0, '-0.560')] [2022-07-11 08:29:36,587][26022] Updated weights on worker 0-0, policy_version 1113964 (0.00093) [2022-07-11 08:29:38,572][26022] Updated weights on worker 0-0, policy_version 1113974 (0.00079) [2022-07-11 08:29:40,231][26022] Updated weights on worker 0-0, policy_version 1113984 (0.00081) [2022-07-11 08:29:40,717][25689] Fps is (10 sec: 5738.1, 60 sec: 5548.4, 300 sec: 5556.1). Total num frames: 1140722688. Throughput: 0: 5781.0. Samples: 1140721186. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:29:40,718][25689] Avg episode reward: [(0, '0.719')] [2022-07-11 08:29:42,437][26022] Updated weights on worker 0-0, policy_version 1113994 (0.00092) [2022-07-11 08:29:43,934][26022] Updated weights on worker 0-0, policy_version 1114004 (0.00082) [2022-07-11 08:29:45,749][25689] Fps is (10 sec: 5378.5, 60 sec: 5497.7, 300 sec: 5545.7). Total num frames: 1140747264. Throughput: 0: 5769.3. Samples: 1140754560. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:29:45,749][25689] Avg episode reward: [(0, '0.577')] [2022-07-11 08:29:46,017][26022] Updated weights on worker 0-0, policy_version 1114014 (0.00103) [2022-07-11 08:29:47,710][26022] Updated weights on worker 0-0, policy_version 1114024 (0.00088) [2022-07-11 08:29:49,564][26022] Updated weights on worker 0-0, policy_version 1114034 (0.00086) [2022-07-11 08:29:49,878][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:29:49,891][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001114036_1140772864.pth [2022-07-11 08:29:49,892][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001112083_1138772992.pth [2022-07-11 08:29:50,828][25689] Fps is (10 sec: 5468.9, 60 sec: 5545.1, 300 sec: 5551.8). Total num frames: 1140777984. Throughput: 0: 4950.5. Samples: 1140771310. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:29:50,829][25689] Avg episode reward: [(0, '0.564')] [2022-07-11 08:29:51,305][26022] Updated weights on worker 0-0, policy_version 1114044 (0.00081) [2022-07-11 08:29:53,050][26022] Updated weights on worker 0-0, policy_version 1114054 (0.00078) [2022-07-11 08:29:54,955][26022] Updated weights on worker 0-0, policy_version 1114064 (0.00089) [2022-07-11 08:29:55,930][25689] Fps is (10 sec: 5833.4, 60 sec: 5541.2, 300 sec: 5550.3). Total num frames: 1140806656. Throughput: 0: 5799.6. Samples: 1140805068. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:29:55,931][25689] Avg episode reward: [(0, '1.429')] [2022-07-11 08:29:56,959][26022] Updated weights on worker 0-0, policy_version 1114074 (0.00091) [2022-07-11 08:29:58,605][26022] Updated weights on worker 0-0, policy_version 1114084 (0.00087) [2022-07-11 08:30:00,680][26022] Updated weights on worker 0-0, policy_version 1114094 (0.00090) [2022-07-11 08:30:00,941][25689] Fps is (10 sec: 5467.9, 60 sec: 5508.0, 300 sec: 5553.6). Total num frames: 1140833280. Throughput: 0: 5791.2. Samples: 1140838306. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:30:00,942][25689] Avg episode reward: [(0, '0.302')] [2022-07-11 08:30:02,752][26022] Updated weights on worker 0-0, policy_version 1114104 (0.00087) [2022-07-11 08:30:04,738][26022] Updated weights on worker 0-0, policy_version 1114114 (0.00090) [2022-07-11 08:30:05,954][25689] Fps is (10 sec: 5108.0, 60 sec: 5492.3, 300 sec: 5540.3). Total num frames: 1140857856. Throughput: 0: 4858.7. Samples: 1140852728. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:30:05,954][25689] Avg episode reward: [(0, '0.319')] [2022-07-11 08:30:06,429][26022] Updated weights on worker 0-0, policy_version 1114124 (0.00088) [2022-07-11 08:30:08,394][26022] Updated weights on worker 0-0, policy_version 1114134 (0.00091) [2022-07-11 08:30:10,133][26022] Updated weights on worker 0-0, policy_version 1114144 (0.00087) [2022-07-11 08:30:10,987][25689] Fps is (10 sec: 5504.5, 60 sec: 5540.5, 300 sec: 5554.7). Total num frames: 1140888576. Throughput: 0: 5710.3. Samples: 1140886420. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:30:10,987][25689] Avg episode reward: [(0, '0.341')] [2022-07-11 08:30:12,021][26022] Updated weights on worker 0-0, policy_version 1114154 (0.00116) [2022-07-11 08:30:13,779][26022] Updated weights on worker 0-0, policy_version 1114164 (0.00084) [2022-07-11 08:30:15,618][26022] Updated weights on worker 0-0, policy_version 1114174 (0.00052) [2022-07-11 08:30:16,113][25689] Fps is (10 sec: 5745.0, 60 sec: 5522.8, 300 sec: 5545.6). Total num frames: 1140916224. Throughput: 0: 5700.8. Samples: 1140920128. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:30:16,114][25689] Avg episode reward: [(0, '0.408')] [2022-07-11 08:30:17,542][26022] Updated weights on worker 0-0, policy_version 1114184 (0.00091) [2022-07-11 08:30:19,464][26022] Updated weights on worker 0-0, policy_version 1114194 (0.00087) [2022-07-11 08:30:21,120][25689] Fps is (10 sec: 5557.7, 60 sec: 5558.5, 300 sec: 5552.5). Total num frames: 1140944896. Throughput: 0: 5708.7. Samples: 1140953504. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:30:21,121][25689] Avg episode reward: [(0, '0.253')] [2022-07-11 08:30:21,122][26022] Updated weights on worker 0-0, policy_version 1114204 (0.00096) [2022-07-11 08:30:23,134][26022] Updated weights on worker 0-0, policy_version 1114214 (0.00086) [2022-07-11 08:30:24,777][26022] Updated weights on worker 0-0, policy_version 1114224 (0.00085) [2022-07-11 08:30:26,136][25689] Fps is (10 sec: 5619.6, 60 sec: 5524.4, 300 sec: 5545.4). Total num frames: 1140972544. Throughput: 0: 5828.6. Samples: 1140970360. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:30:26,136][25689] Avg episode reward: [(0, '0.825')] [2022-07-11 08:30:26,835][26022] Updated weights on worker 0-0, policy_version 1114234 (0.00100) [2022-07-11 08:30:28,520][26022] Updated weights on worker 0-0, policy_version 1114244 (0.00091) [2022-07-11 08:30:30,303][26022] Updated weights on worker 0-0, policy_version 1114254 (0.00093) [2022-07-11 08:30:31,178][25689] Fps is (10 sec: 5497.9, 60 sec: 5539.5, 300 sec: 5548.8). Total num frames: 1141000192. Throughput: 0: 5814.3. Samples: 1141003818. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:30:31,178][25689] Avg episode reward: [(0, '0.849')] [2022-07-11 08:30:32,232][26022] Updated weights on worker 0-0, policy_version 1114264 (0.00093) [2022-07-11 08:30:34,112][26022] Updated weights on worker 0-0, policy_version 1114274 (0.00084) [2022-07-11 08:30:35,790][26022] Updated weights on worker 0-0, policy_version 1114284 (0.00098) [2022-07-11 08:30:36,309][25689] Fps is (10 sec: 5535.9, 60 sec: 5537.8, 300 sec: 5547.0). Total num frames: 1141028864. Throughput: 0: 5793.0. Samples: 1141037122. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:30:36,310][25689] Avg episode reward: [(0, '1.122')] [2022-07-11 08:30:37,842][26022] Updated weights on worker 0-0, policy_version 1114294 (0.00084) [2022-07-11 08:30:39,623][26022] Updated weights on worker 0-0, policy_version 1114304 (0.00095) [2022-07-11 08:30:41,322][25689] Fps is (10 sec: 5551.9, 60 sec: 5508.2, 300 sec: 5543.9). Total num frames: 1141056512. Throughput: 0: 4966.5. Samples: 1141053836. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:30:41,323][25689] Avg episode reward: [(0, '0.318')] [2022-07-11 08:30:41,497][26022] Updated weights on worker 0-0, policy_version 1114314 (0.00090) [2022-07-11 08:30:43,279][26022] Updated weights on worker 0-0, policy_version 1114324 (0.00095) [2022-07-11 08:30:45,142][26022] Updated weights on worker 0-0, policy_version 1114334 (0.00086) [2022-07-11 08:30:46,334][25689] Fps is (10 sec: 5618.0, 60 sec: 5577.6, 300 sec: 5547.3). Total num frames: 1141085184. Throughput: 0: 5787.7. Samples: 1141087262. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:30:46,335][25689] Avg episode reward: [(0, '0.429')] [2022-07-11 08:30:47,035][26022] Updated weights on worker 0-0, policy_version 1114344 (0.00083) [2022-07-11 08:30:48,734][26022] Updated weights on worker 0-0, policy_version 1114354 (0.00090) [2022-07-11 08:30:50,652][26022] Updated weights on worker 0-0, policy_version 1114364 (0.00090) [2022-07-11 08:30:51,365][25689] Fps is (10 sec: 5506.1, 60 sec: 5514.4, 300 sec: 5544.1). Total num frames: 1141111808. Throughput: 0: 5795.3. Samples: 1141120808. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:30:51,365][25689] Avg episode reward: [(0, '1.027')] [2022-07-11 08:30:52,418][26022] Updated weights on worker 0-0, policy_version 1114374 (0.00095) [2022-07-11 08:30:54,411][26022] Updated weights on worker 0-0, policy_version 1114384 (0.00079) [2022-07-11 08:30:56,070][26022] Updated weights on worker 0-0, policy_version 1114394 (0.00085) [2022-07-11 08:30:56,423][25689] Fps is (10 sec: 5480.6, 60 sec: 5518.4, 300 sec: 5546.5). Total num frames: 1141140480. Throughput: 0: 4986.9. Samples: 1141137430. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:30:56,424][25689] Avg episode reward: [(0, '1.197')] [2022-07-11 08:30:57,938][26022] Updated weights on worker 0-0, policy_version 1114404 (0.00091) [2022-07-11 08:30:59,740][26022] Updated weights on worker 0-0, policy_version 1114414 (0.00087) [2022-07-11 08:31:01,438][25689] Fps is (10 sec: 5591.1, 60 sec: 5535.0, 300 sec: 5553.9). Total num frames: 1141168128. Throughput: 0: 5836.9. Samples: 1141171250. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:31:01,439][25689] Avg episode reward: [(0, '1.368')] [2022-07-11 08:31:01,905][26022] Updated weights on worker 0-0, policy_version 1114424 (0.00106) [2022-07-11 08:31:03,899][26022] Updated weights on worker 0-0, policy_version 1114434 (0.00087) [2022-07-11 08:31:05,664][26022] Updated weights on worker 0-0, policy_version 1114444 (0.00118) [2022-07-11 08:31:06,453][25689] Fps is (10 sec: 5411.1, 60 sec: 5568.6, 300 sec: 5550.3). Total num frames: 1141194752. Throughput: 0: 5740.9. Samples: 1141202764. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:31:06,454][25689] Avg episode reward: [(0, '1.177')] [2022-07-11 08:31:07,556][26022] Updated weights on worker 0-0, policy_version 1114454 (0.00091) [2022-07-11 08:31:09,408][26022] Updated weights on worker 0-0, policy_version 1114464 (0.00091) [2022-07-11 08:31:11,006][26022] Updated weights on worker 0-0, policy_version 1114474 (0.00091) [2022-07-11 08:31:11,461][25689] Fps is (10 sec: 5414.5, 60 sec: 5520.1, 300 sec: 5544.1). Total num frames: 1141222400. Throughput: 0: 4901.5. Samples: 1141219310. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:31:11,463][25689] Avg episode reward: [(0, '1.747')] [2022-07-11 08:31:13,289][26022] Updated weights on worker 0-0, policy_version 1114484 (0.00084) [2022-07-11 08:31:14,875][26022] Updated weights on worker 0-0, policy_version 1114494 (0.00091) [2022-07-11 08:31:16,537][25689] Fps is (10 sec: 5686.5, 60 sec: 5558.6, 300 sec: 5551.1). Total num frames: 1141252096. Throughput: 0: 5749.6. Samples: 1141253078. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:31:16,538][25689] Avg episode reward: [(0, '1.282')] [2022-07-11 08:31:16,550][26022] Updated weights on worker 0-0, policy_version 1114504 (0.00093) [2022-07-11 08:31:18,498][26022] Updated weights on worker 0-0, policy_version 1114514 (0.00091) [2022-07-11 08:31:20,253][26022] Updated weights on worker 0-0, policy_version 1114524 (0.00084) [2022-07-11 08:31:21,576][25689] Fps is (10 sec: 5669.7, 60 sec: 5538.8, 300 sec: 5548.6). Total num frames: 1141279744. Throughput: 0: 5751.0. Samples: 1141287062. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:31:21,576][25689] Avg episode reward: [(0, '0.395')] [2022-07-11 08:31:22,059][26022] Updated weights on worker 0-0, policy_version 1114534 (0.00088) [2022-07-11 08:31:23,966][26022] Updated weights on worker 0-0, policy_version 1114544 (0.00098) [2022-07-11 08:31:25,587][26022] Updated weights on worker 0-0, policy_version 1114554 (0.00092) [2022-07-11 08:31:26,615][25689] Fps is (10 sec: 5486.9, 60 sec: 5536.5, 300 sec: 5545.3). Total num frames: 1141307392. Throughput: 0: 5013.7. Samples: 1141303852. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:31:26,617][25689] Avg episode reward: [(0, '0.395')] [2022-07-11 08:31:27,756][26022] Updated weights on worker 0-0, policy_version 1114564 (0.00086) [2022-07-11 08:31:29,425][26022] Updated weights on worker 0-0, policy_version 1114574 (0.00088) [2022-07-11 08:31:31,287][26022] Updated weights on worker 0-0, policy_version 1114584 (0.00088) [2022-07-11 08:31:31,644][25689] Fps is (10 sec: 5492.0, 60 sec: 5537.8, 300 sec: 5542.3). Total num frames: 1141335040. Throughput: 0: 5847.3. Samples: 1141337324. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:31:31,647][25689] Avg episode reward: [(0, '0.389')] [2022-07-11 08:31:33,064][26022] Updated weights on worker 0-0, policy_version 1114594 (0.00098) [2022-07-11 08:31:34,900][26022] Updated weights on worker 0-0, policy_version 1114604 (0.00089) [2022-07-11 08:31:36,708][25689] Fps is (10 sec: 5681.5, 60 sec: 5560.9, 300 sec: 5548.2). Total num frames: 1141364736. Throughput: 0: 5831.3. Samples: 1141370702. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:31:36,709][25689] Avg episode reward: [(0, '0.707')] [2022-07-11 08:31:36,715][26022] Updated weights on worker 0-0, policy_version 1114614 (0.00086) [2022-07-11 08:31:38,538][26022] Updated weights on worker 0-0, policy_version 1114624 (0.00085) [2022-07-11 08:31:40,744][26022] Updated weights on worker 0-0, policy_version 1114634 (0.00081) [2022-07-11 08:31:41,719][25689] Fps is (10 sec: 5590.3, 60 sec: 5544.2, 300 sec: 5544.8). Total num frames: 1141391360. Throughput: 0: 4989.2. Samples: 1141387560. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:31:41,719][25689] Avg episode reward: [(0, '0.684')] [2022-07-11 08:31:42,379][26022] Updated weights on worker 0-0, policy_version 1114644 (0.00093) [2022-07-11 08:31:44,270][26022] Updated weights on worker 0-0, policy_version 1114654 (0.00089) [2022-07-11 08:31:45,687][26022] Updated weights on worker 0-0, policy_version 1114664 (0.00084) [2022-07-11 08:31:46,723][25689] Fps is (10 sec: 5521.3, 60 sec: 5544.8, 300 sec: 5545.0). Total num frames: 1141420032. Throughput: 0: 5824.3. Samples: 1141420968. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:31:46,724][25689] Avg episode reward: [(0, '1.241')] [2022-07-11 08:31:48,142][26022] Updated weights on worker 0-0, policy_version 1114674 (0.00105) [2022-07-11 08:31:49,627][26022] Updated weights on worker 0-0, policy_version 1114684 (0.00088) [2022-07-11 08:31:50,016][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:31:50,029][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001114686_1141438464.pth [2022-07-11 08:31:50,029][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001112734_1139439616.pth [2022-07-11 08:31:51,580][26022] Updated weights on worker 0-0, policy_version 1114694 (0.00092) [2022-07-11 08:31:51,731][25689] Fps is (10 sec: 5625.2, 60 sec: 5564.0, 300 sec: 5538.9). Total num frames: 1141447680. Throughput: 0: 5839.6. Samples: 1141454622. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:31:51,732][25689] Avg episode reward: [(0, '1.150')] [2022-07-11 08:31:53,328][26022] Updated weights on worker 0-0, policy_version 1114704 (0.00087) [2022-07-11 08:31:55,440][26022] Updated weights on worker 0-0, policy_version 1114714 (0.00094) [2022-07-11 08:31:56,820][25689] Fps is (10 sec: 5476.5, 60 sec: 5544.1, 300 sec: 5537.4). Total num frames: 1141475328. Throughput: 0: 4992.0. Samples: 1141471102. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:31:56,822][25689] Avg episode reward: [(0, '0.884')] [2022-07-11 08:31:57,174][26022] Updated weights on worker 0-0, policy_version 1114724 (0.00086) [2022-07-11 08:31:58,987][26022] Updated weights on worker 0-0, policy_version 1114734 (0.00086) [2022-07-11 08:32:00,565][26022] Updated weights on worker 0-0, policy_version 1114744 (0.00086) [2022-07-11 08:32:01,857][25689] Fps is (10 sec: 5561.6, 60 sec: 5559.0, 300 sec: 5554.4). Total num frames: 1141504000. Throughput: 0: 5817.2. Samples: 1141504710. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:32:01,858][25689] Avg episode reward: [(0, '0.873')] [2022-07-11 08:32:03,051][26022] Updated weights on worker 0-0, policy_version 1114754 (0.00083) [2022-07-11 08:32:04,740][26022] Updated weights on worker 0-0, policy_version 1114764 (0.00092) [2022-07-11 08:32:06,612][26022] Updated weights on worker 0-0, policy_version 1114774 (0.00079) [2022-07-11 08:32:06,879][25689] Fps is (10 sec: 5497.6, 60 sec: 5558.4, 300 sec: 5540.5). Total num frames: 1141530624. Throughput: 0: 5718.3. Samples: 1141536222. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:32:06,879][25689] Avg episode reward: [(0, '1.674')] [2022-07-11 08:32:08,376][26022] Updated weights on worker 0-0, policy_version 1114784 (0.00093) [2022-07-11 08:32:10,259][26022] Updated weights on worker 0-0, policy_version 1114794 (0.00094) [2022-07-11 08:32:11,899][25689] Fps is (10 sec: 5302.8, 60 sec: 5540.4, 300 sec: 5541.3). Total num frames: 1141557248. Throughput: 0: 4874.3. Samples: 1141552928. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:32:11,900][25689] Avg episode reward: [(0, '1.838')] [2022-07-11 08:32:12,125][26022] Updated weights on worker 0-0, policy_version 1114804 (0.00087) [2022-07-11 08:32:13,752][26022] Updated weights on worker 0-0, policy_version 1114814 (0.00084) [2022-07-11 08:32:15,856][26022] Updated weights on worker 0-0, policy_version 1114824 (0.00094) [2022-07-11 08:32:16,996][25689] Fps is (10 sec: 5566.9, 60 sec: 5538.5, 300 sec: 5544.5). Total num frames: 1141586944. Throughput: 0: 5715.8. Samples: 1141586418. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:32:16,997][25689] Avg episode reward: [(0, '1.001')] [2022-07-11 08:32:17,430][26022] Updated weights on worker 0-0, policy_version 1114834 (0.00093) [2022-07-11 08:32:19,609][26022] Updated weights on worker 0-0, policy_version 1114844 (0.00078) [2022-07-11 08:32:21,213][26022] Updated weights on worker 0-0, policy_version 1114854 (0.00084) [2022-07-11 08:32:22,012][25689] Fps is (10 sec: 5670.1, 60 sec: 5540.5, 300 sec: 5541.7). Total num frames: 1141614592. Throughput: 0: 5738.1. Samples: 1141620362. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:32:22,013][25689] Avg episode reward: [(0, '0.680')] [2022-07-11 08:32:23,159][26022] Updated weights on worker 0-0, policy_version 1114864 (0.00096) [2022-07-11 08:32:24,861][26022] Updated weights on worker 0-0, policy_version 1114874 (0.00086) [2022-07-11 08:32:26,831][26022] Updated weights on worker 0-0, policy_version 1114884 (0.00095) [2022-07-11 08:32:27,024][25689] Fps is (10 sec: 5514.1, 60 sec: 5543.1, 300 sec: 5541.6). Total num frames: 1141642240. Throughput: 0: 5010.1. Samples: 1141637150. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:32:27,026][25689] Avg episode reward: [(0, '0.865')] [2022-07-11 08:32:28,544][26022] Updated weights on worker 0-0, policy_version 1114894 (0.00088) [2022-07-11 08:32:30,424][26022] Updated weights on worker 0-0, policy_version 1114904 (0.00054) [2022-07-11 08:32:32,028][25689] Fps is (10 sec: 5521.1, 60 sec: 5545.3, 300 sec: 5547.1). Total num frames: 1141669888. Throughput: 0: 5853.8. Samples: 1141670758. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2022-07-11 08:32:32,028][25689] Avg episode reward: [(0, '0.614')] [2022-07-11 08:32:32,203][26022] Updated weights on worker 0-0, policy_version 1114914 (0.00094) [2022-07-11 08:32:34,232][26022] Updated weights on worker 0-0, policy_version 1114924 (0.00092) [2022-07-11 08:32:35,910][26022] Updated weights on worker 0-0, policy_version 1114934 (0.00086) [2022-07-11 08:32:37,082][25689] Fps is (10 sec: 5497.4, 60 sec: 5512.3, 300 sec: 5532.6). Total num frames: 1141697536. Throughput: 0: 5864.6. Samples: 1141704220. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:32:37,083][25689] Avg episode reward: [(0, '0.440')] [2022-07-11 08:32:37,864][26022] Updated weights on worker 0-0, policy_version 1114944 (0.00088) [2022-07-11 08:32:39,465][26022] Updated weights on worker 0-0, policy_version 1114954 (0.00091) [2022-07-11 08:32:41,545][26022] Updated weights on worker 0-0, policy_version 1114964 (0.00092) [2022-07-11 08:32:42,087][25689] Fps is (10 sec: 5599.1, 60 sec: 5546.8, 300 sec: 5544.7). Total num frames: 1141726208. Throughput: 0: 5012.5. Samples: 1141720984. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:32:42,087][25689] Avg episode reward: [(0, '0.396')] [2022-07-11 08:32:43,160][26022] Updated weights on worker 0-0, policy_version 1114974 (0.00093) [2022-07-11 08:32:45,088][26022] Updated weights on worker 0-0, policy_version 1114984 (0.00091) [2022-07-11 08:32:46,727][26022] Updated weights on worker 0-0, policy_version 1114994 (0.00084) [2022-07-11 08:32:47,091][25689] Fps is (10 sec: 5831.7, 60 sec: 5563.8, 300 sec: 5548.3). Total num frames: 1141755904. Throughput: 0: 5865.0. Samples: 1141754846. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:32:47,092][25689] Avg episode reward: [(0, '0.642')] [2022-07-11 08:32:48,756][26022] Updated weights on worker 0-0, policy_version 1115004 (0.00091) [2022-07-11 08:32:50,430][26022] Updated weights on worker 0-0, policy_version 1115014 (0.00579) [2022-07-11 08:32:52,107][25689] Fps is (10 sec: 5620.2, 60 sec: 5546.0, 300 sec: 5542.6). Total num frames: 1141782528. Throughput: 0: 5861.8. Samples: 1141788462. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:32:52,108][25689] Avg episode reward: [(0, '0.442')] [2022-07-11 08:32:52,412][26022] Updated weights on worker 0-0, policy_version 1115024 (0.00088) [2022-07-11 08:32:54,151][26022] Updated weights on worker 0-0, policy_version 1115034 (0.00089) [2022-07-11 08:32:56,199][26022] Updated weights on worker 0-0, policy_version 1115044 (0.00087) [2022-07-11 08:32:57,195][25689] Fps is (10 sec: 5472.9, 60 sec: 5563.2, 300 sec: 5542.3). Total num frames: 1141811200. Throughput: 0: 5004.9. Samples: 1141804880. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:32:57,196][25689] Avg episode reward: [(0, '-0.108')] [2022-07-11 08:32:57,923][26022] Updated weights on worker 0-0, policy_version 1115054 (0.00081) [2022-07-11 08:32:59,690][26022] Updated weights on worker 0-0, policy_version 1115064 (0.00090) [2022-07-11 08:33:01,592][26022] Updated weights on worker 0-0, policy_version 1115074 (0.00089) [2022-07-11 08:33:02,239][25689] Fps is (10 sec: 5458.0, 60 sec: 5528.6, 300 sec: 5544.9). Total num frames: 1141837824. Throughput: 0: 5823.5. Samples: 1141838340. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:33:02,243][25689] Avg episode reward: [(0, '0.264')] [2022-07-11 08:33:03,904][26022] Updated weights on worker 0-0, policy_version 1115084 (0.00085) [2022-07-11 08:33:05,599][26022] Updated weights on worker 0-0, policy_version 1115094 (0.00084) [2022-07-11 08:33:07,340][25689] Fps is (10 sec: 5349.3, 60 sec: 5538.2, 300 sec: 5541.2). Total num frames: 1141865472. Throughput: 0: 5680.6. Samples: 1141869874. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:33:07,343][25689] Avg episode reward: [(0, '0.206')] [2022-07-11 08:33:07,360][26022] Updated weights on worker 0-0, policy_version 1115104 (0.00091) [2022-07-11 08:33:09,391][26022] Updated weights on worker 0-0, policy_version 1115114 (0.00092) [2022-07-11 08:33:10,968][26022] Updated weights on worker 0-0, policy_version 1115124 (0.00103) [2022-07-11 08:33:12,418][25689] Fps is (10 sec: 5331.4, 60 sec: 5532.9, 300 sec: 5541.4). Total num frames: 1141892096. Throughput: 0: 5661.1. Samples: 1141903444. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:33:12,419][25689] Avg episode reward: [(0, '0.028')] [2022-07-11 08:33:12,945][26022] Updated weights on worker 0-0, policy_version 1115134 (0.00107) [2022-07-11 08:33:14,728][26022] Updated weights on worker 0-0, policy_version 1115144 (0.00082) [2022-07-11 08:33:16,659][26022] Updated weights on worker 0-0, policy_version 1115154 (0.00094) [2022-07-11 08:33:17,463][25689] Fps is (10 sec: 5563.7, 60 sec: 5537.6, 300 sec: 5538.2). Total num frames: 1141921792. Throughput: 0: 5680.5. Samples: 1141920016. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:33:17,464][25689] Avg episode reward: [(0, '0.618')] [2022-07-11 08:33:18,575][26022] Updated weights on worker 0-0, policy_version 1115164 (0.00097) [2022-07-11 08:33:20,206][26022] Updated weights on worker 0-0, policy_version 1115174 (0.00082) [2022-07-11 08:33:22,098][26022] Updated weights on worker 0-0, policy_version 1115184 (0.00082) [2022-07-11 08:33:22,484][25689] Fps is (10 sec: 5696.8, 60 sec: 5537.2, 300 sec: 5538.1). Total num frames: 1141949440. Throughput: 0: 5695.5. Samples: 1141953648. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:33:22,485][25689] Avg episode reward: [(0, '0.526')] [2022-07-11 08:33:23,948][26022] Updated weights on worker 0-0, policy_version 1115194 (0.00089) [2022-07-11 08:33:25,867][26022] Updated weights on worker 0-0, policy_version 1115204 (0.00096) [2022-07-11 08:33:27,526][25689] Fps is (10 sec: 5495.0, 60 sec: 5534.5, 300 sec: 5535.3). Total num frames: 1141977088. Throughput: 0: 5803.8. Samples: 1141987028. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:33:27,527][25689] Avg episode reward: [(0, '0.220')] [2022-07-11 08:33:27,740][26022] Updated weights on worker 0-0, policy_version 1115214 (0.00085) [2022-07-11 08:33:29,415][26022] Updated weights on worker 0-0, policy_version 1115224 (0.00085) [2022-07-11 08:33:31,335][26022] Updated weights on worker 0-0, policy_version 1115234 (0.00081) [2022-07-11 08:33:32,536][25689] Fps is (10 sec: 5602.8, 60 sec: 5550.8, 300 sec: 5540.3). Total num frames: 1142005760. Throughput: 0: 4987.2. Samples: 1142003778. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:33:32,537][25689] Avg episode reward: [(0, '-0.789')] [2022-07-11 08:33:33,207][26022] Updated weights on worker 0-0, policy_version 1115244 (0.00086) [2022-07-11 08:33:34,971][26022] Updated weights on worker 0-0, policy_version 1115254 (0.00093) [2022-07-11 08:33:36,799][26022] Updated weights on worker 0-0, policy_version 1115264 (0.00086) [2022-07-11 08:33:37,595][25689] Fps is (10 sec: 5593.4, 60 sec: 5550.4, 300 sec: 5536.4). Total num frames: 1142033408. Throughput: 0: 5826.3. Samples: 1142037310. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:33:37,596][25689] Avg episode reward: [(0, '-0.565')] [2022-07-11 08:33:38,576][26022] Updated weights on worker 0-0, policy_version 1115274 (0.00453) [2022-07-11 08:33:40,582][26022] Updated weights on worker 0-0, policy_version 1115284 (0.00086) [2022-07-11 08:33:42,398][26022] Updated weights on worker 0-0, policy_version 1115294 (0.00086) [2022-07-11 08:33:42,653][25689] Fps is (10 sec: 5465.8, 60 sec: 5528.6, 300 sec: 5535.9). Total num frames: 1142061056. Throughput: 0: 5813.5. Samples: 1142070900. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:33:42,655][25689] Avg episode reward: [(0, '-1.001')] [2022-07-11 08:33:44,387][26022] Updated weights on worker 0-0, policy_version 1115304 (0.00079) [2022-07-11 08:33:46,149][26022] Updated weights on worker 0-0, policy_version 1115314 (0.00094) [2022-07-11 08:33:47,674][25689] Fps is (10 sec: 5588.0, 60 sec: 5510.2, 300 sec: 5539.8). Total num frames: 1142089728. Throughput: 0: 4987.0. Samples: 1142087504. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:33:47,674][25689] Avg episode reward: [(0, '-0.485')] [2022-07-11 08:33:47,923][26022] Updated weights on worker 0-0, policy_version 1115324 (0.00080) [2022-07-11 08:33:49,682][26022] Updated weights on worker 0-0, policy_version 1115334 (0.00428) [2022-07-11 08:33:50,064][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:33:50,077][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001115336_1142104064.pth [2022-07-11 08:33:50,078][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001113386_1140107264.pth [2022-07-11 08:33:51,609][26022] Updated weights on worker 0-0, policy_version 1115344 (0.00091) [2022-07-11 08:33:52,731][25689] Fps is (10 sec: 5690.3, 60 sec: 5540.3, 300 sec: 5539.8). Total num frames: 1142118400. Throughput: 0: 5795.4. Samples: 1142120812. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:33:52,731][25689] Avg episode reward: [(0, '-0.539')] [2022-07-11 08:33:53,514][26022] Updated weights on worker 0-0, policy_version 1115354 (0.00086) [2022-07-11 08:33:55,243][26022] Updated weights on worker 0-0, policy_version 1115364 (0.00085) [2022-07-11 08:33:57,265][26022] Updated weights on worker 0-0, policy_version 1115374 (0.00087) [2022-07-11 08:33:57,787][25689] Fps is (10 sec: 5467.9, 60 sec: 5509.4, 300 sec: 5532.2). Total num frames: 1142145024. Throughput: 0: 5796.3. Samples: 1142154346. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:33:57,787][25689] Avg episode reward: [(0, '-0.579')] [2022-07-11 08:33:58,725][26022] Updated weights on worker 0-0, policy_version 1115384 (0.00092) [2022-07-11 08:34:00,966][26022] Updated weights on worker 0-0, policy_version 1115394 (0.00094) [2022-07-11 08:34:02,799][25689] Fps is (10 sec: 5390.5, 60 sec: 5529.2, 300 sec: 5539.3). Total num frames: 1142172672. Throughput: 0: 4978.7. Samples: 1142171200. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:34:02,799][25689] Avg episode reward: [(0, '0.080')] [2022-07-11 08:34:02,855][26022] Updated weights on worker 0-0, policy_version 1115404 (0.00084) [2022-07-11 08:34:04,835][26022] Updated weights on worker 0-0, policy_version 1115414 (0.00085) [2022-07-11 08:34:06,878][26022] Updated weights on worker 0-0, policy_version 1115424 (0.00086) [2022-07-11 08:34:07,811][25689] Fps is (10 sec: 5515.9, 60 sec: 5537.3, 300 sec: 5539.2). Total num frames: 1142200320. Throughput: 0: 5720.5. Samples: 1142202702. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:34:07,812][25689] Avg episode reward: [(0, '-0.474')] [2022-07-11 08:34:08,407][26022] Updated weights on worker 0-0, policy_version 1115434 (0.00079) [2022-07-11 08:34:10,503][26022] Updated weights on worker 0-0, policy_version 1115444 (0.00082) [2022-07-11 08:34:11,929][26022] Updated weights on worker 0-0, policy_version 1115454 (0.00088) [2022-07-11 08:34:12,820][25689] Fps is (10 sec: 5517.8, 60 sec: 5560.6, 300 sec: 5537.8). Total num frames: 1142227968. Throughput: 0: 5761.6. Samples: 1142236560. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:34:12,821][25689] Avg episode reward: [(0, '-0.896')] [2022-07-11 08:34:13,901][26022] Updated weights on worker 0-0, policy_version 1115464 (0.00089) [2022-07-11 08:34:15,976][26022] Updated weights on worker 0-0, policy_version 1115474 (0.00093) [2022-07-11 08:34:17,425][26022] Updated weights on worker 0-0, policy_version 1115484 (0.00086) [2022-07-11 08:34:17,942][25689] Fps is (10 sec: 5660.4, 60 sec: 5553.5, 300 sec: 5546.4). Total num frames: 1142257664. Throughput: 0: 4914.3. Samples: 1142253396. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:34:17,943][25689] Avg episode reward: [(0, '-1.243')] [2022-07-11 08:34:19,574][26022] Updated weights on worker 0-0, policy_version 1115494 (0.00084) [2022-07-11 08:34:21,108][26022] Updated weights on worker 0-0, policy_version 1115504 (0.00102) [2022-07-11 08:34:22,960][25689] Fps is (10 sec: 5554.3, 60 sec: 5536.9, 300 sec: 5535.9). Total num frames: 1142284288. Throughput: 0: 5735.3. Samples: 1142286832. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:34:22,961][25689] Avg episode reward: [(0, '-0.568')] [2022-07-11 08:34:23,149][26022] Updated weights on worker 0-0, policy_version 1115514 (0.00086) [2022-07-11 08:34:24,891][26022] Updated weights on worker 0-0, policy_version 1115524 (0.00093) [2022-07-11 08:34:26,799][26022] Updated weights on worker 0-0, policy_version 1115534 (0.00096) [2022-07-11 08:34:27,975][25689] Fps is (10 sec: 5409.6, 60 sec: 5539.4, 300 sec: 5539.5). Total num frames: 1142311936. Throughput: 0: 5826.6. Samples: 1142320184. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:34:27,975][25689] Avg episode reward: [(0, '-1.346')] [2022-07-11 08:34:28,676][26022] Updated weights on worker 0-0, policy_version 1115544 (0.00089) [2022-07-11 08:34:30,608][26022] Updated weights on worker 0-0, policy_version 1115554 (0.00089) [2022-07-11 08:34:32,366][26022] Updated weights on worker 0-0, policy_version 1115564 (0.00086) [2022-07-11 08:34:32,979][25689] Fps is (10 sec: 5621.4, 60 sec: 5539.9, 300 sec: 5541.5). Total num frames: 1142340608. Throughput: 0: 4962.9. Samples: 1142336606. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:34:32,980][25689] Avg episode reward: [(0, '-2.375')] [2022-07-11 08:34:34,367][26022] Updated weights on worker 0-0, policy_version 1115574 (0.00088) [2022-07-11 08:34:35,966][26022] Updated weights on worker 0-0, policy_version 1115584 (0.00086) [2022-07-11 08:34:37,957][26022] Updated weights on worker 0-0, policy_version 1115594 (0.00085) [2022-07-11 08:34:38,100][25689] Fps is (10 sec: 5562.4, 60 sec: 5534.2, 300 sec: 5533.5). Total num frames: 1142368256. Throughput: 0: 5775.6. Samples: 1142369818. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:34:38,100][25689] Avg episode reward: [(0, '-1.419')] [2022-07-11 08:34:39,635][26022] Updated weights on worker 0-0, policy_version 1115604 (0.00082) [2022-07-11 08:34:41,793][26022] Updated weights on worker 0-0, policy_version 1115614 (0.00082) [2022-07-11 08:34:43,141][25689] Fps is (10 sec: 5542.0, 60 sec: 5552.7, 300 sec: 5547.1). Total num frames: 1142396928. Throughput: 0: 5782.5. Samples: 1142403530. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:34:43,142][25689] Avg episode reward: [(0, '-1.640')] [2022-07-11 08:34:43,588][26022] Updated weights on worker 0-0, policy_version 1115624 (0.00096) [2022-07-11 08:34:45,349][26022] Updated weights on worker 0-0, policy_version 1115634 (0.00088) [2022-07-11 08:34:46,838][26022] Updated weights on worker 0-0, policy_version 1115644 (0.00090) [2022-07-11 08:34:48,172][25689] Fps is (10 sec: 5591.5, 60 sec: 5534.8, 300 sec: 5537.7). Total num frames: 1142424576. Throughput: 0: 4958.8. Samples: 1142420340. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:34:48,173][25689] Avg episode reward: [(0, '-1.926')] [2022-07-11 08:34:49,037][26022] Updated weights on worker 0-0, policy_version 1115654 (0.00083) [2022-07-11 08:34:50,718][26022] Updated weights on worker 0-0, policy_version 1115664 (0.00089) [2022-07-11 08:34:52,507][26022] Updated weights on worker 0-0, policy_version 1115674 (0.00085) [2022-07-11 08:34:53,216][25689] Fps is (10 sec: 5590.1, 60 sec: 5536.0, 300 sec: 5538.7). Total num frames: 1142453248. Throughput: 0: 5804.4. Samples: 1142454072. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:34:53,218][25689] Avg episode reward: [(0, '-1.297')] [2022-07-11 08:34:54,489][26022] Updated weights on worker 0-0, policy_version 1115684 (0.00082) [2022-07-11 08:34:56,277][26022] Updated weights on worker 0-0, policy_version 1115694 (0.00087) [2022-07-11 08:34:58,183][26022] Updated weights on worker 0-0, policy_version 1115704 (0.00080) [2022-07-11 08:34:58,301][25689] Fps is (10 sec: 5661.5, 60 sec: 5567.2, 300 sec: 5544.3). Total num frames: 1142481920. Throughput: 0: 5825.6. Samples: 1142487502. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:34:58,303][25689] Avg episode reward: [(0, '-1.113')] [2022-07-11 08:35:00,136][26022] Updated weights on worker 0-0, policy_version 1115714 (0.00092) [2022-07-11 08:35:02,039][26022] Updated weights on worker 0-0, policy_version 1115724 (0.00096) [2022-07-11 08:35:03,335][25689] Fps is (10 sec: 5363.7, 60 sec: 5531.4, 300 sec: 5547.3). Total num frames: 1142507520. Throughput: 0: 4986.0. Samples: 1142504214. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:35:03,335][25689] Avg episode reward: [(0, '-1.190')] [2022-07-11 08:35:04,107][26022] Updated weights on worker 0-0, policy_version 1115734 (0.00090) [2022-07-11 08:35:05,777][26022] Updated weights on worker 0-0, policy_version 1115744 (0.00086) [2022-07-11 08:35:07,675][26022] Updated weights on worker 0-0, policy_version 1115754 (0.00086) [2022-07-11 08:35:08,362][25689] Fps is (10 sec: 5394.1, 60 sec: 5546.9, 300 sec: 5540.5). Total num frames: 1142536192. Throughput: 0: 5719.2. Samples: 1142535812. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:35:08,363][25689] Avg episode reward: [(0, '-1.724')] [2022-07-11 08:35:09,451][26022] Updated weights on worker 0-0, policy_version 1115764 (0.00080) [2022-07-11 08:35:11,468][26022] Updated weights on worker 0-0, policy_version 1115774 (0.00086) [2022-07-11 08:35:13,054][26022] Updated weights on worker 0-0, policy_version 1115784 (0.00085) [2022-07-11 08:35:13,391][25689] Fps is (10 sec: 5702.4, 60 sec: 5562.0, 300 sec: 5545.8). Total num frames: 1142564864. Throughput: 0: 5724.7. Samples: 1142569566. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:35:13,391][25689] Avg episode reward: [(0, '-1.042')] [2022-07-11 08:35:15,067][26022] Updated weights on worker 0-0, policy_version 1115794 (0.00088) [2022-07-11 08:35:16,724][26022] Updated weights on worker 0-0, policy_version 1115804 (0.00092) [2022-07-11 08:35:18,516][25689] Fps is (10 sec: 5546.8, 60 sec: 5527.9, 300 sec: 5540.1). Total num frames: 1142592512. Throughput: 0: 4897.0. Samples: 1142586496. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:35:18,517][25689] Avg episode reward: [(0, '-0.712')] [2022-07-11 08:35:18,622][26022] Updated weights on worker 0-0, policy_version 1115814 (0.00083) [2022-07-11 08:35:20,278][26022] Updated weights on worker 0-0, policy_version 1115824 (0.00085) [2022-07-11 08:35:22,210][26022] Updated weights on worker 0-0, policy_version 1115834 (0.00080) [2022-07-11 08:35:23,530][25689] Fps is (10 sec: 5554.9, 60 sec: 5562.1, 300 sec: 5543.6). Total num frames: 1142621184. Throughput: 0: 5752.3. Samples: 1142620382. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:35:23,530][25689] Avg episode reward: [(0, '-0.335')] [2022-07-11 08:35:24,096][26022] Updated weights on worker 0-0, policy_version 1115844 (0.00093) [2022-07-11 08:35:25,868][26022] Updated weights on worker 0-0, policy_version 1115854 (0.00871) [2022-07-11 08:35:27,667][26022] Updated weights on worker 0-0, policy_version 1115864 (0.00092) [2022-07-11 08:35:28,535][25689] Fps is (10 sec: 5621.5, 60 sec: 5563.0, 300 sec: 5544.3). Total num frames: 1142648832. Throughput: 0: 5868.7. Samples: 1142654198. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:35:28,535][25689] Avg episode reward: [(0, '-0.143')] [2022-07-11 08:35:29,481][26022] Updated weights on worker 0-0, policy_version 1115874 (0.00087) [2022-07-11 08:35:31,515][26022] Updated weights on worker 0-0, policy_version 1115884 (0.00093) [2022-07-11 08:35:33,168][26022] Updated weights on worker 0-0, policy_version 1115894 (0.00093) [2022-07-11 08:35:33,587][25689] Fps is (10 sec: 5498.2, 60 sec: 5541.7, 300 sec: 5542.3). Total num frames: 1142676480. Throughput: 0: 5026.7. Samples: 1142671084. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:35:33,587][25689] Avg episode reward: [(0, '-0.290')] [2022-07-11 08:35:34,942][26022] Updated weights on worker 0-0, policy_version 1115904 (0.00088) [2022-07-11 08:35:36,758][26022] Updated weights on worker 0-0, policy_version 1115914 (0.00087) [2022-07-11 08:35:38,615][26022] Updated weights on worker 0-0, policy_version 1115924 (0.00089) [2022-07-11 08:35:38,660][25689] Fps is (10 sec: 5663.7, 60 sec: 5579.9, 300 sec: 5548.1). Total num frames: 1142706176. Throughput: 0: 5889.7. Samples: 1142705134. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:35:38,660][25689] Avg episode reward: [(0, '-0.356')] [2022-07-11 08:35:40,253][26022] Updated weights on worker 0-0, policy_version 1115934 (0.00085) [2022-07-11 08:35:42,261][26022] Updated weights on worker 0-0, policy_version 1115944 (0.00086) [2022-07-11 08:35:43,699][25689] Fps is (10 sec: 5772.2, 60 sec: 5580.2, 300 sec: 5547.6). Total num frames: 1142734848. Throughput: 0: 5869.7. Samples: 1142738768. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:35:43,699][25689] Avg episode reward: [(0, '-1.410')] [2022-07-11 08:35:44,180][26022] Updated weights on worker 0-0, policy_version 1115954 (0.00090) [2022-07-11 08:35:45,868][26022] Updated weights on worker 0-0, policy_version 1115964 (0.00095) [2022-07-11 08:35:47,916][26022] Updated weights on worker 0-0, policy_version 1115974 (0.00090) [2022-07-11 08:35:48,716][25689] Fps is (10 sec: 5498.5, 60 sec: 5564.5, 300 sec: 5547.9). Total num frames: 1142761472. Throughput: 0: 5020.1. Samples: 1142755510. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:35:48,717][25689] Avg episode reward: [(0, '-1.378')] [2022-07-11 08:35:49,604][26022] Updated weights on worker 0-0, policy_version 1115984 (0.00087) [2022-07-11 08:35:50,171][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:35:50,185][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001115988_1142771712.pth [2022-07-11 08:35:50,185][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001114036_1140772864.pth [2022-07-11 08:35:51,575][26022] Updated weights on worker 0-0, policy_version 1115994 (0.00083) [2022-07-11 08:35:53,244][26022] Updated weights on worker 0-0, policy_version 1116004 (0.00083) [2022-07-11 08:35:53,743][25689] Fps is (10 sec: 5709.3, 60 sec: 5599.9, 300 sec: 5555.3). Total num frames: 1142792192. Throughput: 0: 5840.7. Samples: 1142788808. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:35:53,743][25689] Avg episode reward: [(0, '-1.494')] [2022-07-11 08:35:55,195][26022] Updated weights on worker 0-0, policy_version 1116014 (0.00085) [2022-07-11 08:35:56,857][26022] Updated weights on worker 0-0, policy_version 1116024 (0.00079) [2022-07-11 08:35:58,784][25689] Fps is (10 sec: 5695.6, 60 sec: 5570.1, 300 sec: 5551.4). Total num frames: 1142818816. Throughput: 0: 5829.2. Samples: 1142822446. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:35:58,785][25689] Avg episode reward: [(0, '-0.688')] [2022-07-11 08:35:58,790][26022] Updated weights on worker 0-0, policy_version 1116034 (0.00098) [2022-07-11 08:36:00,562][26022] Updated weights on worker 0-0, policy_version 1116044 (0.00084) [2022-07-11 08:36:02,736][26022] Updated weights on worker 0-0, policy_version 1116054 (0.00088) [2022-07-11 08:36:03,798][25689] Fps is (10 sec: 5193.6, 60 sec: 5571.9, 300 sec: 5548.0). Total num frames: 1142844416. Throughput: 0: 5742.5. Samples: 1142854190. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:36:03,799][25689] Avg episode reward: [(0, '-0.167')] [2022-07-11 08:36:04,609][26022] Updated weights on worker 0-0, policy_version 1116064 (0.00089) [2022-07-11 08:36:06,336][26022] Updated weights on worker 0-0, policy_version 1116074 (0.00087) [2022-07-11 08:36:08,115][26022] Updated weights on worker 0-0, policy_version 1116084 (0.00091) [2022-07-11 08:36:08,806][25689] Fps is (10 sec: 5415.5, 60 sec: 5573.7, 300 sec: 5551.4). Total num frames: 1142873088. Throughput: 0: 5753.2. Samples: 1142871090. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:36:08,806][25689] Avg episode reward: [(0, '1.721')] [2022-07-11 08:36:10,098][26022] Updated weights on worker 0-0, policy_version 1116094 (0.00085) [2022-07-11 08:36:11,855][26022] Updated weights on worker 0-0, policy_version 1116104 (0.00089) [2022-07-11 08:36:13,519][26022] Updated weights on worker 0-0, policy_version 1116114 (0.00079) [2022-07-11 08:36:13,829][25689] Fps is (10 sec: 5818.9, 60 sec: 5591.2, 300 sec: 5552.4). Total num frames: 1142902784. Throughput: 0: 5790.8. Samples: 1142905124. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:36:13,829][25689] Avg episode reward: [(0, '1.707')] [2022-07-11 08:36:15,618][26022] Updated weights on worker 0-0, policy_version 1116124 (0.00090) [2022-07-11 08:36:17,149][26022] Updated weights on worker 0-0, policy_version 1116134 (0.00091) [2022-07-11 08:36:18,913][25689] Fps is (10 sec: 5572.3, 60 sec: 5578.0, 300 sec: 5548.1). Total num frames: 1142929408. Throughput: 0: 5770.8. Samples: 1142938606. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:36:18,914][25689] Avg episode reward: [(0, '1.608')] [2022-07-11 08:36:19,417][26022] Updated weights on worker 0-0, policy_version 1116144 (0.00083) [2022-07-11 08:36:20,751][26022] Updated weights on worker 0-0, policy_version 1116154 (0.00084) [2022-07-11 08:36:23,030][26022] Updated weights on worker 0-0, policy_version 1116164 (0.00109) [2022-07-11 08:36:23,956][25689] Fps is (10 sec: 5561.4, 60 sec: 5592.3, 300 sec: 5554.9). Total num frames: 1142959104. Throughput: 0: 5026.0. Samples: 1142955502. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:36:23,957][25689] Avg episode reward: [(0, '1.494')] [2022-07-11 08:36:24,398][26022] Updated weights on worker 0-0, policy_version 1116174 (0.00090) [2022-07-11 08:36:26,711][26022] Updated weights on worker 0-0, policy_version 1116184 (0.00086) [2022-07-11 08:36:28,024][26022] Updated weights on worker 0-0, policy_version 1116194 (0.00092) [2022-07-11 08:36:28,971][25689] Fps is (10 sec: 5599.3, 60 sec: 5574.4, 300 sec: 5551.8). Total num frames: 1142985728. Throughput: 0: 5850.6. Samples: 1142989070. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:36:28,973][25689] Avg episode reward: [(0, '1.487')] [2022-07-11 08:36:30,283][26022] Updated weights on worker 0-0, policy_version 1116204 (0.00100) [2022-07-11 08:36:31,884][26022] Updated weights on worker 0-0, policy_version 1116214 (0.00091) [2022-07-11 08:36:33,847][26022] Updated weights on worker 0-0, policy_version 1116224 (0.00083) [2022-07-11 08:36:33,988][25689] Fps is (10 sec: 5613.9, 60 sec: 5611.5, 300 sec: 5552.6). Total num frames: 1143015424. Throughput: 0: 5819.9. Samples: 1143022448. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 08:36:33,989][25689] Avg episode reward: [(0, '1.856')] [2022-07-11 08:36:36,011][26022] Updated weights on worker 0-0, policy_version 1116234 (0.00083) [2022-07-11 08:36:37,313][26022] Updated weights on worker 0-0, policy_version 1116244 (0.00104) [2022-07-11 08:36:39,116][25689] Fps is (10 sec: 5450.8, 60 sec: 5538.7, 300 sec: 5547.0). Total num frames: 1143041024. Throughput: 0: 4983.5. Samples: 1143039286. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:36:39,116][25689] Avg episode reward: [(0, '1.816')] [2022-07-11 08:36:39,413][26022] Updated weights on worker 0-0, policy_version 1116254 (0.00087) [2022-07-11 08:36:41,016][26022] Updated weights on worker 0-0, policy_version 1116264 (0.00091) [2022-07-11 08:36:42,829][26022] Updated weights on worker 0-0, policy_version 1116274 (0.00090) [2022-07-11 08:36:44,135][25689] Fps is (10 sec: 5449.4, 60 sec: 5557.4, 300 sec: 5550.2). Total num frames: 1143070720. Throughput: 0: 5823.2. Samples: 1143073010. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:36:44,136][25689] Avg episode reward: [(0, '0.558')] [2022-07-11 08:36:44,979][26022] Updated weights on worker 0-0, policy_version 1116284 (0.00082) [2022-07-11 08:36:46,539][26022] Updated weights on worker 0-0, policy_version 1116294 (0.00089) [2022-07-11 08:36:48,356][26022] Updated weights on worker 0-0, policy_version 1116304 (0.00087) [2022-07-11 08:36:49,178][25689] Fps is (10 sec: 5801.0, 60 sec: 5589.0, 300 sec: 5553.0). Total num frames: 1143099392. Throughput: 0: 5831.7. Samples: 1143106906. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:36:49,178][25689] Avg episode reward: [(0, '0.609')] [2022-07-11 08:36:50,322][26022] Updated weights on worker 0-0, policy_version 1116314 (0.00091) [2022-07-11 08:36:51,999][26022] Updated weights on worker 0-0, policy_version 1116324 (0.00086) [2022-07-11 08:36:54,115][26022] Updated weights on worker 0-0, policy_version 1116334 (0.00062) [2022-07-11 08:36:54,184][25689] Fps is (10 sec: 5604.7, 60 sec: 5540.0, 300 sec: 5554.5). Total num frames: 1143127040. Throughput: 0: 5017.2. Samples: 1143123776. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:36:54,185][25689] Avg episode reward: [(0, '0.581')] [2022-07-11 08:36:55,793][26022] Updated weights on worker 0-0, policy_version 1116344 (0.00087) [2022-07-11 08:36:57,509][26022] Updated weights on worker 0-0, policy_version 1116354 (0.00083) [2022-07-11 08:36:59,274][25689] Fps is (10 sec: 5476.6, 60 sec: 5552.5, 300 sec: 5550.1). Total num frames: 1143154688. Throughput: 0: 5853.9. Samples: 1143157292. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:36:59,275][25689] Avg episode reward: [(0, '0.376')] [2022-07-11 08:36:59,424][26022] Updated weights on worker 0-0, policy_version 1116364 (0.00086) [2022-07-11 08:37:01,315][26022] Updated weights on worker 0-0, policy_version 1116374 (0.00088) [2022-07-11 08:37:03,360][26022] Updated weights on worker 0-0, policy_version 1116384 (0.00088) [2022-07-11 08:37:04,277][25689] Fps is (10 sec: 5377.4, 60 sec: 5570.5, 300 sec: 5550.4). Total num frames: 1143181312. Throughput: 0: 5753.5. Samples: 1143188894. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:37:04,277][25689] Avg episode reward: [(0, '-1.223')] [2022-07-11 08:37:05,388][26022] Updated weights on worker 0-0, policy_version 1116394 (0.00089) [2022-07-11 08:37:07,123][26022] Updated weights on worker 0-0, policy_version 1116404 (0.00086) [2022-07-11 08:37:08,982][26022] Updated weights on worker 0-0, policy_version 1116414 (0.00089) [2022-07-11 08:37:09,367][25689] Fps is (10 sec: 5377.6, 60 sec: 5546.0, 300 sec: 5552.6). Total num frames: 1143208960. Throughput: 0: 4901.1. Samples: 1143205854. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:37:09,367][25689] Avg episode reward: [(0, '0.228')] [2022-07-11 08:37:10,823][26022] Updated weights on worker 0-0, policy_version 1116424 (0.00082) [2022-07-11 08:37:12,579][26022] Updated weights on worker 0-0, policy_version 1116434 (0.00098) [2022-07-11 08:37:14,440][25689] Fps is (10 sec: 5541.7, 60 sec: 5524.6, 300 sec: 5549.6). Total num frames: 1143237632. Throughput: 0: 5722.8. Samples: 1143239694. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:37:14,440][25689] Avg episode reward: [(0, '-0.742')] [2022-07-11 08:37:14,471][26022] Updated weights on worker 0-0, policy_version 1116444 (0.00088) [2022-07-11 08:37:16,166][26022] Updated weights on worker 0-0, policy_version 1116454 (0.00088) [2022-07-11 08:37:18,176][26022] Updated weights on worker 0-0, policy_version 1116464 (0.00084) [2022-07-11 08:37:19,558][25689] Fps is (10 sec: 5726.9, 60 sec: 5572.1, 300 sec: 5554.6). Total num frames: 1143267328. Throughput: 0: 5720.5. Samples: 1143273328. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:37:19,560][25689] Avg episode reward: [(0, '-1.389')] [2022-07-11 08:37:19,837][26022] Updated weights on worker 0-0, policy_version 1116474 (0.00092) [2022-07-11 08:37:21,816][26022] Updated weights on worker 0-0, policy_version 1116484 (0.00091) [2022-07-11 08:37:23,503][26022] Updated weights on worker 0-0, policy_version 1116494 (0.00090) [2022-07-11 08:37:24,605][25689] Fps is (10 sec: 5640.9, 60 sec: 5537.9, 300 sec: 5553.9). Total num frames: 1143294976. Throughput: 0: 4979.5. Samples: 1143290122. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:37:24,607][25689] Avg episode reward: [(0, '-1.450')] [2022-07-11 08:37:25,463][26022] Updated weights on worker 0-0, policy_version 1116504 (0.00095) [2022-07-11 08:37:27,063][26022] Updated weights on worker 0-0, policy_version 1116514 (0.00112) [2022-07-11 08:37:29,235][26022] Updated weights on worker 0-0, policy_version 1116524 (0.00091) [2022-07-11 08:37:29,624][25689] Fps is (10 sec: 5493.7, 60 sec: 5554.6, 300 sec: 5553.6). Total num frames: 1143322624. Throughput: 0: 5813.1. Samples: 1143323610. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:37:29,624][25689] Avg episode reward: [(0, '-1.162')] [2022-07-11 08:37:30,744][26022] Updated weights on worker 0-0, policy_version 1116534 (0.00092) [2022-07-11 08:37:33,155][26022] Updated weights on worker 0-0, policy_version 1116544 (0.00083) [2022-07-11 08:37:34,420][26022] Updated weights on worker 0-0, policy_version 1116554 (0.00090) [2022-07-11 08:37:34,639][25689] Fps is (10 sec: 5612.9, 60 sec: 5537.7, 300 sec: 5557.8). Total num frames: 1143351296. Throughput: 0: 5802.8. Samples: 1143356910. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:37:34,640][25689] Avg episode reward: [(0, '0.273')] [2022-07-11 08:37:36,633][26022] Updated weights on worker 0-0, policy_version 1116564 (0.00084) [2022-07-11 08:37:38,058][26022] Updated weights on worker 0-0, policy_version 1116574 (0.00093) [2022-07-11 08:37:39,707][25689] Fps is (10 sec: 5484.0, 60 sec: 5560.2, 300 sec: 5549.7). Total num frames: 1143377920. Throughput: 0: 4982.7. Samples: 1143373722. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:37:39,707][25689] Avg episode reward: [(0, '1.128')] [2022-07-11 08:37:40,182][26022] Updated weights on worker 0-0, policy_version 1116584 (0.00105) [2022-07-11 08:37:42,080][26022] Updated weights on worker 0-0, policy_version 1116594 (0.00089) [2022-07-11 08:37:43,576][26022] Updated weights on worker 0-0, policy_version 1116604 (0.00084) [2022-07-11 08:37:44,730][25689] Fps is (10 sec: 5480.2, 60 sec: 5543.0, 300 sec: 5545.9). Total num frames: 1143406592. Throughput: 0: 5818.1. Samples: 1143407206. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:37:44,731][25689] Avg episode reward: [(0, '-0.011')] [2022-07-11 08:37:45,714][26022] Updated weights on worker 0-0, policy_version 1116614 (0.00090) [2022-07-11 08:37:47,616][26022] Updated weights on worker 0-0, policy_version 1116624 (0.00083) [2022-07-11 08:37:49,367][26022] Updated weights on worker 0-0, policy_version 1116634 (0.00084) [2022-07-11 08:37:49,747][25689] Fps is (10 sec: 5813.7, 60 sec: 5562.2, 300 sec: 5556.3). Total num frames: 1143436288. Throughput: 0: 5821.9. Samples: 1143440764. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:37:49,747][25689] Avg episode reward: [(0, '-0.474')] [2022-07-11 08:37:50,647][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:37:50,656][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001116641_1143440384.pth [2022-07-11 08:37:50,657][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001114686_1141438464.pth [2022-07-11 08:37:51,168][26022] Updated weights on worker 0-0, policy_version 1116644 (0.00089) [2022-07-11 08:37:53,102][26022] Updated weights on worker 0-0, policy_version 1116654 (0.00083) [2022-07-11 08:37:54,753][25689] Fps is (10 sec: 5618.7, 60 sec: 5545.3, 300 sec: 5550.9). Total num frames: 1143462912. Throughput: 0: 5003.1. Samples: 1143457542. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:37:54,759][25689] Avg episode reward: [(0, '-0.077')] [2022-07-11 08:37:54,879][26022] Updated weights on worker 0-0, policy_version 1116664 (0.00084) [2022-07-11 08:37:56,715][26022] Updated weights on worker 0-0, policy_version 1116674 (0.00087) [2022-07-11 08:37:58,411][26022] Updated weights on worker 0-0, policy_version 1116684 (0.00092) [2022-07-11 08:37:59,806][25689] Fps is (10 sec: 5598.8, 60 sec: 5582.5, 300 sec: 5561.0). Total num frames: 1143492608. Throughput: 0: 5840.9. Samples: 1143491118. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:37:59,806][25689] Avg episode reward: [(0, '-0.398')] [2022-07-11 08:38:00,296][26022] Updated weights on worker 0-0, policy_version 1116694 (0.00091) [2022-07-11 08:38:02,751][26022] Updated weights on worker 0-0, policy_version 1116704 (0.00087) [2022-07-11 08:38:04,546][26022] Updated weights on worker 0-0, policy_version 1116714 (0.00083) [2022-07-11 08:38:04,813][25689] Fps is (10 sec: 5293.3, 60 sec: 5531.4, 300 sec: 5549.0). Total num frames: 1143516160. Throughput: 0: 5732.3. Samples: 1143522328. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:38:04,814][25689] Avg episode reward: [(0, '-0.407')] [2022-07-11 08:38:06,361][26022] Updated weights on worker 0-0, policy_version 1116724 (0.00092) [2022-07-11 08:38:08,196][26022] Updated weights on worker 0-0, policy_version 1116734 (0.00090) [2022-07-11 08:38:09,839][25689] Fps is (10 sec: 5205.3, 60 sec: 5554.2, 300 sec: 5556.9). Total num frames: 1143544832. Throughput: 0: 4875.1. Samples: 1143538716. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:38:09,840][25689] Avg episode reward: [(0, '-0.402')] [2022-07-11 08:38:09,916][26022] Updated weights on worker 0-0, policy_version 1116744 (0.00088) [2022-07-11 08:38:11,834][26022] Updated weights on worker 0-0, policy_version 1116754 (0.00087) [2022-07-11 08:38:13,535][26022] Updated weights on worker 0-0, policy_version 1116764 (0.00086) [2022-07-11 08:38:14,853][25689] Fps is (10 sec: 5711.7, 60 sec: 5559.6, 300 sec: 5554.0). Total num frames: 1143573504. Throughput: 0: 5727.1. Samples: 1143572652. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:38:14,853][25689] Avg episode reward: [(0, '1.563')] [2022-07-11 08:38:15,450][26022] Updated weights on worker 0-0, policy_version 1116774 (0.00557) [2022-07-11 08:38:17,266][26022] Updated weights on worker 0-0, policy_version 1116784 (0.00088) [2022-07-11 08:38:19,001][26022] Updated weights on worker 0-0, policy_version 1116794 (0.00088) [2022-07-11 08:38:19,990][25689] Fps is (10 sec: 5548.1, 60 sec: 5524.0, 300 sec: 5551.9). Total num frames: 1143601152. Throughput: 0: 5712.9. Samples: 1143606428. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:38:19,990][25689] Avg episode reward: [(0, '1.360')] [2022-07-11 08:38:20,929][26022] Updated weights on worker 0-0, policy_version 1116804 (0.00090) [2022-07-11 08:38:22,619][26022] Updated weights on worker 0-0, policy_version 1116814 (0.00088) [2022-07-11 08:38:24,462][26022] Updated weights on worker 0-0, policy_version 1116824 (0.00087) [2022-07-11 08:38:25,013][25689] Fps is (10 sec: 5644.0, 60 sec: 5560.1, 300 sec: 5559.1). Total num frames: 1143630848. Throughput: 0: 4991.2. Samples: 1143623150. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:38:25,015][25689] Avg episode reward: [(0, '1.363')] [2022-07-11 08:38:26,499][26022] Updated weights on worker 0-0, policy_version 1116834 (0.00096) [2022-07-11 08:38:28,309][26022] Updated weights on worker 0-0, policy_version 1116844 (0.00094) [2022-07-11 08:38:30,031][25689] Fps is (10 sec: 5507.2, 60 sec: 5526.3, 300 sec: 5548.6). Total num frames: 1143656448. Throughput: 0: 5826.2. Samples: 1143656358. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:38:30,032][25689] Avg episode reward: [(0, '1.884')] [2022-07-11 08:38:30,219][26022] Updated weights on worker 0-0, policy_version 1116854 (0.00086) [2022-07-11 08:38:31,816][26022] Updated weights on worker 0-0, policy_version 1116864 (0.00093) [2022-07-11 08:38:33,815][26022] Updated weights on worker 0-0, policy_version 1116874 (0.00084) [2022-07-11 08:38:35,094][25689] Fps is (10 sec: 5485.3, 60 sec: 5538.9, 300 sec: 5555.4). Total num frames: 1143686144. Throughput: 0: 5795.4. Samples: 1143689956. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:38:35,095][25689] Avg episode reward: [(0, '1.626')] [2022-07-11 08:38:35,704][26022] Updated weights on worker 0-0, policy_version 1116884 (0.00428) [2022-07-11 08:38:37,683][26022] Updated weights on worker 0-0, policy_version 1116894 (0.00089) [2022-07-11 08:38:39,237][26022] Updated weights on worker 0-0, policy_version 1116904 (0.00088) [2022-07-11 08:38:40,163][25689] Fps is (10 sec: 5760.7, 60 sec: 5572.6, 300 sec: 5558.7). Total num frames: 1143714816. Throughput: 0: 5797.0. Samples: 1143723370. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:38:40,165][25689] Avg episode reward: [(0, '1.494')] [2022-07-11 08:38:41,291][26022] Updated weights on worker 0-0, policy_version 1116914 (0.00379) [2022-07-11 08:38:42,799][26022] Updated weights on worker 0-0, policy_version 1116924 (0.00088) [2022-07-11 08:38:44,813][26022] Updated weights on worker 0-0, policy_version 1116934 (0.00086) [2022-07-11 08:38:45,249][25689] Fps is (10 sec: 5545.8, 60 sec: 5549.8, 300 sec: 5554.0). Total num frames: 1143742464. Throughput: 0: 5781.0. Samples: 1143740136. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:38:45,251][25689] Avg episode reward: [(0, '1.495')] [2022-07-11 08:38:46,655][26022] Updated weights on worker 0-0, policy_version 1116944 (0.00082) [2022-07-11 08:38:48,476][26022] Updated weights on worker 0-0, policy_version 1116954 (0.00084) [2022-07-11 08:38:50,146][26022] Updated weights on worker 0-0, policy_version 1116964 (0.00089) [2022-07-11 08:38:50,255][25689] Fps is (10 sec: 5580.5, 60 sec: 5533.9, 300 sec: 5555.0). Total num frames: 1143771136. Throughput: 0: 5807.6. Samples: 1143773814. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:38:50,257][25689] Avg episode reward: [(0, '1.399')] [2022-07-11 08:38:52,244][26022] Updated weights on worker 0-0, policy_version 1116974 (0.00087) [2022-07-11 08:38:54,043][26022] Updated weights on worker 0-0, policy_version 1116984 (0.00088) [2022-07-11 08:38:55,279][25689] Fps is (10 sec: 5513.3, 60 sec: 5532.4, 300 sec: 5555.6). Total num frames: 1143797760. Throughput: 0: 5803.2. Samples: 1143807094. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:38:55,280][25689] Avg episode reward: [(0, '1.061')] [2022-07-11 08:38:55,944][26022] Updated weights on worker 0-0, policy_version 1116994 (0.00086) [2022-07-11 08:38:57,694][26022] Updated weights on worker 0-0, policy_version 1117004 (0.00100) [2022-07-11 08:38:59,681][26022] Updated weights on worker 0-0, policy_version 1117014 (0.00091) [2022-07-11 08:39:00,340][25689] Fps is (10 sec: 5482.9, 60 sec: 5514.6, 300 sec: 5558.1). Total num frames: 1143826432. Throughput: 0: 4973.7. Samples: 1143823728. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:39:00,341][25689] Avg episode reward: [(0, '1.355')] [2022-07-11 08:39:01,384][26022] Updated weights on worker 0-0, policy_version 1117024 (0.00090) [2022-07-11 08:39:03,635][26022] Updated weights on worker 0-0, policy_version 1117034 (0.00083) [2022-07-11 08:39:05,263][26022] Updated weights on worker 0-0, policy_version 1117044 (0.00092) [2022-07-11 08:39:05,358][25689] Fps is (10 sec: 5485.9, 60 sec: 5564.4, 300 sec: 5554.5). Total num frames: 1143853056. Throughput: 0: 5721.0. Samples: 1143855180. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:39:05,359][25689] Avg episode reward: [(0, '1.329')] [2022-07-11 08:39:07,240][26022] Updated weights on worker 0-0, policy_version 1117054 (0.00087) [2022-07-11 08:39:09,038][26022] Updated weights on worker 0-0, policy_version 1117064 (0.00087) [2022-07-11 08:39:10,427][25689] Fps is (10 sec: 5380.8, 60 sec: 5543.6, 300 sec: 5553.4). Total num frames: 1143880704. Throughput: 0: 5706.2. Samples: 1143888916. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:39:10,427][25689] Avg episode reward: [(0, '1.367')] [2022-07-11 08:39:10,913][26022] Updated weights on worker 0-0, policy_version 1117074 (0.00093) [2022-07-11 08:39:12,724][26022] Updated weights on worker 0-0, policy_version 1117084 (0.00093) [2022-07-11 08:39:14,440][26022] Updated weights on worker 0-0, policy_version 1117094 (0.00094) [2022-07-11 08:39:15,435][25689] Fps is (10 sec: 5487.3, 60 sec: 5527.1, 300 sec: 5548.7). Total num frames: 1143908352. Throughput: 0: 4895.3. Samples: 1143905768. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:39:15,436][25689] Avg episode reward: [(0, '1.211')] [2022-07-11 08:39:16,432][26022] Updated weights on worker 0-0, policy_version 1117104 (0.00090) [2022-07-11 08:39:18,255][26022] Updated weights on worker 0-0, policy_version 1117114 (0.00088) [2022-07-11 08:39:20,054][26022] Updated weights on worker 0-0, policy_version 1117124 (0.00085) [2022-07-11 08:39:20,505][25689] Fps is (10 sec: 5791.3, 60 sec: 5584.0, 300 sec: 5561.5). Total num frames: 1143939072. Throughput: 0: 5730.4. Samples: 1143939280. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:39:20,506][25689] Avg episode reward: [(0, '1.211')] [2022-07-11 08:39:21,885][26022] Updated weights on worker 0-0, policy_version 1117134 (0.00083) [2022-07-11 08:39:23,535][26022] Updated weights on worker 0-0, policy_version 1117144 (0.00082) [2022-07-11 08:39:25,510][25689] Fps is (10 sec: 5590.2, 60 sec: 5518.0, 300 sec: 5554.8). Total num frames: 1143964672. Throughput: 0: 5827.5. Samples: 1143972614. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:39:25,511][25689] Avg episode reward: [(0, '0.052')] [2022-07-11 08:39:25,843][26022] Updated weights on worker 0-0, policy_version 1117154 (0.00551) [2022-07-11 08:39:27,299][26022] Updated weights on worker 0-0, policy_version 1117164 (0.00094) [2022-07-11 08:39:29,311][26022] Updated weights on worker 0-0, policy_version 1117174 (0.00085) [2022-07-11 08:39:30,519][25689] Fps is (10 sec: 5419.7, 60 sec: 5569.6, 300 sec: 5554.7). Total num frames: 1143993344. Throughput: 0: 5005.9. Samples: 1143989496. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:39:30,520][25689] Avg episode reward: [(0, '-0.276')] [2022-07-11 08:39:31,160][26022] Updated weights on worker 0-0, policy_version 1117184 (0.00093) [2022-07-11 08:39:32,923][26022] Updated weights on worker 0-0, policy_version 1117194 (0.00090) [2022-07-11 08:39:34,800][26022] Updated weights on worker 0-0, policy_version 1117204 (0.00094) [2022-07-11 08:39:35,533][25689] Fps is (10 sec: 5618.8, 60 sec: 5540.2, 300 sec: 5556.7). Total num frames: 1144020992. Throughput: 0: 5822.7. Samples: 1144022794. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:39:35,535][25689] Avg episode reward: [(0, '-2.010')] [2022-07-11 08:39:36,433][26022] Updated weights on worker 0-0, policy_version 1117214 (0.00085) [2022-07-11 08:39:38,621][26022] Updated weights on worker 0-0, policy_version 1117224 (0.00087) [2022-07-11 08:39:40,241][26022] Updated weights on worker 0-0, policy_version 1117234 (0.00087) [2022-07-11 08:39:40,678][25689] Fps is (10 sec: 5442.8, 60 sec: 5516.3, 300 sec: 5551.3). Total num frames: 1144048640. Throughput: 0: 5796.4. Samples: 1144056212. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:39:40,679][25689] Avg episode reward: [(0, '-2.100')] [2022-07-11 08:39:42,160][26022] Updated weights on worker 0-0, policy_version 1117244 (0.00089) [2022-07-11 08:39:43,969][26022] Updated weights on worker 0-0, policy_version 1117254 (0.00095) [2022-07-11 08:39:45,750][25689] Fps is (10 sec: 5512.5, 60 sec: 5534.6, 300 sec: 5554.0). Total num frames: 1144077312. Throughput: 0: 4953.4. Samples: 1144072872. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:39:45,753][25689] Avg episode reward: [(0, '-2.281')] [2022-07-11 08:39:45,772][26022] Updated weights on worker 0-0, policy_version 1117264 (0.00085) [2022-07-11 08:39:47,683][26022] Updated weights on worker 0-0, policy_version 1117274 (0.00088) [2022-07-11 08:39:49,433][26022] Updated weights on worker 0-0, policy_version 1117284 (0.00087) [2022-07-11 08:39:50,794][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:39:50,808][25689] Fps is (10 sec: 5560.0, 60 sec: 5512.9, 300 sec: 5550.3). Total num frames: 1144104960. Throughput: 0: 5765.9. Samples: 1144106480. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:39:50,808][25689] Avg episode reward: [(0, '-2.414')] [2022-07-11 08:39:50,811][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001117291_1144105984.pth [2022-07-11 08:39:50,811][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001115336_1142104064.pth [2022-07-11 08:39:51,188][26022] Updated weights on worker 0-0, policy_version 1117294 (0.00090) [2022-07-11 08:39:53,058][26022] Updated weights on worker 0-0, policy_version 1117304 (0.00092) [2022-07-11 08:39:54,925][26022] Updated weights on worker 0-0, policy_version 1117314 (0.00081) [2022-07-11 08:39:55,867][25689] Fps is (10 sec: 5567.1, 60 sec: 5543.5, 300 sec: 5550.7). Total num frames: 1144133632. Throughput: 0: 5800.6. Samples: 1144140740. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:39:55,867][25689] Avg episode reward: [(0, '-1.568')] [2022-07-11 08:39:56,687][26022] Updated weights on worker 0-0, policy_version 1117324 (0.00081) [2022-07-11 08:39:58,528][26022] Updated weights on worker 0-0, policy_version 1117334 (0.00095) [2022-07-11 08:40:00,270][26022] Updated weights on worker 0-0, policy_version 1117344 (0.00102) [2022-07-11 08:40:00,923][25689] Fps is (10 sec: 5770.3, 60 sec: 5560.9, 300 sec: 5564.1). Total num frames: 1144163328. Throughput: 0: 5009.4. Samples: 1144157626. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:40:00,924][25689] Avg episode reward: [(0, '-1.257')] [2022-07-11 08:40:02,599][26022] Updated weights on worker 0-0, policy_version 1117354 (0.00091) [2022-07-11 08:40:04,358][26022] Updated weights on worker 0-0, policy_version 1117364 (0.00088) [2022-07-11 08:40:05,937][25689] Fps is (10 sec: 5491.2, 60 sec: 5544.4, 300 sec: 5554.0). Total num frames: 1144188928. Throughput: 0: 5740.3. Samples: 1144188750. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:40:05,937][25689] Avg episode reward: [(0, '-0.232')] [2022-07-11 08:40:06,328][26022] Updated weights on worker 0-0, policy_version 1117374 (0.00081) [2022-07-11 08:40:07,901][26022] Updated weights on worker 0-0, policy_version 1117384 (0.00089) [2022-07-11 08:40:10,055][26022] Updated weights on worker 0-0, policy_version 1117394 (0.00088) [2022-07-11 08:40:10,944][25689] Fps is (10 sec: 5211.3, 60 sec: 5533.0, 300 sec: 5547.5). Total num frames: 1144215552. Throughput: 0: 5759.4. Samples: 1144222454. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:40:10,945][25689] Avg episode reward: [(0, '0.457')] [2022-07-11 08:40:11,683][26022] Updated weights on worker 0-0, policy_version 1117404 (0.00084) [2022-07-11 08:40:13,637][26022] Updated weights on worker 0-0, policy_version 1117414 (0.00106) [2022-07-11 08:40:15,312][26022] Updated weights on worker 0-0, policy_version 1117424 (0.00090) [2022-07-11 08:40:15,965][25689] Fps is (10 sec: 5514.2, 60 sec: 5548.9, 300 sec: 5552.9). Total num frames: 1144244224. Throughput: 0: 4900.1. Samples: 1144239222. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:40:15,965][25689] Avg episode reward: [(0, '0.787')] [2022-07-11 08:40:17,556][26022] Updated weights on worker 0-0, policy_version 1117434 (0.00088) [2022-07-11 08:40:19,139][26022] Updated weights on worker 0-0, policy_version 1117444 (0.00087) [2022-07-11 08:40:20,967][26022] Updated weights on worker 0-0, policy_version 1117454 (0.00092) [2022-07-11 08:40:21,024][25689] Fps is (10 sec: 5688.8, 60 sec: 5516.0, 300 sec: 5552.1). Total num frames: 1144272896. Throughput: 0: 5735.8. Samples: 1144272924. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:40:21,025][25689] Avg episode reward: [(0, '1.465')] [2022-07-11 08:40:22,620][26022] Updated weights on worker 0-0, policy_version 1117464 (0.00090) [2022-07-11 08:40:24,464][26022] Updated weights on worker 0-0, policy_version 1117474 (0.00093) [2022-07-11 08:40:26,031][25689] Fps is (10 sec: 5594.9, 60 sec: 5549.7, 300 sec: 5552.0). Total num frames: 1144300544. Throughput: 0: 5868.1. Samples: 1144306666. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:40:26,032][25689] Avg episode reward: [(0, '0.528')] [2022-07-11 08:40:26,443][26022] Updated weights on worker 0-0, policy_version 1117484 (0.00089) [2022-07-11 08:40:28,107][26022] Updated weights on worker 0-0, policy_version 1117494 (0.00088) [2022-07-11 08:40:30,225][26022] Updated weights on worker 0-0, policy_version 1117504 (0.00086) [2022-07-11 08:40:31,057][25689] Fps is (10 sec: 5613.7, 60 sec: 5548.1, 300 sec: 5556.0). Total num frames: 1144329216. Throughput: 0: 5022.5. Samples: 1144323470. Policy #0 lag: (min: 0.0, avg: 11.4, max: 26.0) [2022-07-11 08:40:31,058][25689] Avg episode reward: [(0, '0.642')] [2022-07-11 08:40:31,683][26022] Updated weights on worker 0-0, policy_version 1117514 (0.00090) [2022-07-11 08:40:33,819][26022] Updated weights on worker 0-0, policy_version 1117524 (0.00087) [2022-07-11 08:40:35,606][26022] Updated weights on worker 0-0, policy_version 1117534 (0.00084) [2022-07-11 08:40:36,079][25689] Fps is (10 sec: 5605.4, 60 sec: 5547.5, 300 sec: 5550.0). Total num frames: 1144356864. Throughput: 0: 5857.3. Samples: 1144357034. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:40:36,079][25689] Avg episode reward: [(0, '0.813')] [2022-07-11 08:40:37,512][26022] Updated weights on worker 0-0, policy_version 1117544 (0.00094) [2022-07-11 08:40:39,032][26022] Updated weights on worker 0-0, policy_version 1117554 (0.00095) [2022-07-11 08:40:41,111][26022] Updated weights on worker 0-0, policy_version 1117564 (0.00085) [2022-07-11 08:40:41,198][25689] Fps is (10 sec: 5654.5, 60 sec: 5583.6, 300 sec: 5552.0). Total num frames: 1144386560. Throughput: 0: 5859.1. Samples: 1144391124. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:40:41,199][25689] Avg episode reward: [(0, '0.015')] [2022-07-11 08:40:42,663][26022] Updated weights on worker 0-0, policy_version 1117574 (0.00094) [2022-07-11 08:40:44,687][26022] Updated weights on worker 0-0, policy_version 1117584 (0.00086) [2022-07-11 08:40:46,234][25689] Fps is (10 sec: 5747.7, 60 sec: 5587.0, 300 sec: 5558.5). Total num frames: 1144415232. Throughput: 0: 5019.3. Samples: 1144408068. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:40:46,234][25689] Avg episode reward: [(0, '-0.086')] [2022-07-11 08:40:46,381][26022] Updated weights on worker 0-0, policy_version 1117594 (0.00082) [2022-07-11 08:40:48,239][26022] Updated weights on worker 0-0, policy_version 1117604 (0.00087) [2022-07-11 08:40:49,994][26022] Updated weights on worker 0-0, policy_version 1117614 (0.00085) [2022-07-11 08:40:51,239][25689] Fps is (10 sec: 5507.3, 60 sec: 5574.9, 300 sec: 5545.2). Total num frames: 1144441856. Throughput: 0: 5869.1. Samples: 1144441918. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:40:51,239][25689] Avg episode reward: [(0, '-0.699')] [2022-07-11 08:40:51,813][26022] Updated weights on worker 0-0, policy_version 1117624 (0.00085) [2022-07-11 08:40:53,719][26022] Updated weights on worker 0-0, policy_version 1117634 (0.00084) [2022-07-11 08:40:55,559][26022] Updated weights on worker 0-0, policy_version 1117644 (0.00091) [2022-07-11 08:40:56,268][25689] Fps is (10 sec: 5612.7, 60 sec: 5594.6, 300 sec: 5555.7). Total num frames: 1144471552. Throughput: 0: 5881.6. Samples: 1144475780. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:40:56,268][25689] Avg episode reward: [(0, '0.247')] [2022-07-11 08:40:57,406][26022] Updated weights on worker 0-0, policy_version 1117654 (0.00089) [2022-07-11 08:40:59,278][26022] Updated weights on worker 0-0, policy_version 1117664 (0.00087) [2022-07-11 08:41:00,869][26022] Updated weights on worker 0-0, policy_version 1117674 (0.00085) [2022-07-11 08:41:01,349][25689] Fps is (10 sec: 5773.3, 60 sec: 5575.4, 300 sec: 5564.8). Total num frames: 1144500224. Throughput: 0: 5888.4. Samples: 1144509778. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:41:01,350][25689] Avg episode reward: [(0, '-0.719')] [2022-07-11 08:41:03,309][26022] Updated weights on worker 0-0, policy_version 1117684 (0.00085) [2022-07-11 08:41:05,084][26022] Updated weights on worker 0-0, policy_version 1117694 (0.00079) [2022-07-11 08:41:06,368][25689] Fps is (10 sec: 5272.2, 60 sec: 5557.9, 300 sec: 5550.8). Total num frames: 1144524800. Throughput: 0: 5774.0. Samples: 1144524322. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:41:06,368][25689] Avg episode reward: [(0, '0.401')] [2022-07-11 08:41:06,987][26022] Updated weights on worker 0-0, policy_version 1117704 (0.00090) [2022-07-11 08:41:08,525][26022] Updated weights on worker 0-0, policy_version 1117714 (0.00318) [2022-07-11 08:41:10,510][26022] Updated weights on worker 0-0, policy_version 1117724 (0.00088) [2022-07-11 08:41:11,385][25689] Fps is (10 sec: 5407.5, 60 sec: 5607.9, 300 sec: 5550.9). Total num frames: 1144554496. Throughput: 0: 5773.9. Samples: 1144558242. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:41:11,386][25689] Avg episode reward: [(0, '0.247')] [2022-07-11 08:41:12,318][26022] Updated weights on worker 0-0, policy_version 1117734 (0.00092) [2022-07-11 08:41:14,093][26022] Updated weights on worker 0-0, policy_version 1117744 (0.00085) [2022-07-11 08:41:15,913][26022] Updated weights on worker 0-0, policy_version 1117754 (0.00091) [2022-07-11 08:41:16,411][25689] Fps is (10 sec: 5709.7, 60 sec: 5590.4, 300 sec: 5555.5). Total num frames: 1144582144. Throughput: 0: 5772.5. Samples: 1144592056. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:41:16,412][25689] Avg episode reward: [(0, '0.672')] [2022-07-11 08:41:17,699][26022] Updated weights on worker 0-0, policy_version 1117764 (0.00084) [2022-07-11 08:41:19,612][26022] Updated weights on worker 0-0, policy_version 1117774 (0.00089) [2022-07-11 08:41:21,521][25689] Fps is (10 sec: 5455.3, 60 sec: 5568.8, 300 sec: 5547.3). Total num frames: 1144609792. Throughput: 0: 4904.1. Samples: 1144608706. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:41:21,522][25689] Avg episode reward: [(0, '0.653')] [2022-07-11 08:41:21,545][26022] Updated weights on worker 0-0, policy_version 1117784 (0.00088) [2022-07-11 08:41:23,186][26022] Updated weights on worker 0-0, policy_version 1117794 (0.00081) [2022-07-11 08:41:25,211][26022] Updated weights on worker 0-0, policy_version 1117804 (0.00111) [2022-07-11 08:41:26,543][25689] Fps is (10 sec: 5659.8, 60 sec: 5601.3, 300 sec: 5557.5). Total num frames: 1144639488. Throughput: 0: 5855.2. Samples: 1144642450. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:41:26,544][25689] Avg episode reward: [(0, '0.905')] [2022-07-11 08:41:26,806][26022] Updated weights on worker 0-0, policy_version 1117814 (0.00090) [2022-07-11 08:41:28,859][26022] Updated weights on worker 0-0, policy_version 1117824 (0.00068) [2022-07-11 08:41:30,544][26022] Updated weights on worker 0-0, policy_version 1117834 (0.00093) [2022-07-11 08:41:31,551][25689] Fps is (10 sec: 5615.4, 60 sec: 5569.1, 300 sec: 5547.4). Total num frames: 1144666112. Throughput: 0: 5847.2. Samples: 1144676154. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:41:31,551][25689] Avg episode reward: [(0, '0.888')] [2022-07-11 08:41:32,259][26022] Updated weights on worker 0-0, policy_version 1117844 (0.00092) [2022-07-11 08:41:34,448][26022] Updated weights on worker 0-0, policy_version 1117854 (0.00087) [2022-07-11 08:41:35,855][26022] Updated weights on worker 0-0, policy_version 1117864 (0.00097) [2022-07-11 08:41:36,567][25689] Fps is (10 sec: 5516.2, 60 sec: 5586.5, 300 sec: 5559.8). Total num frames: 1144694784. Throughput: 0: 5006.6. Samples: 1144692970. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:41:36,569][25689] Avg episode reward: [(0, '-0.358')] [2022-07-11 08:41:38,063][26022] Updated weights on worker 0-0, policy_version 1117874 (0.00080) [2022-07-11 08:41:39,697][26022] Updated weights on worker 0-0, policy_version 1117884 (0.00085) [2022-07-11 08:41:41,530][26022] Updated weights on worker 0-0, policy_version 1117894 (0.00092) [2022-07-11 08:41:41,694][25689] Fps is (10 sec: 5754.4, 60 sec: 5585.9, 300 sec: 5557.8). Total num frames: 1144724480. Throughput: 0: 5846.8. Samples: 1144726652. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:41:41,696][25689] Avg episode reward: [(0, '-0.283')] [2022-07-11 08:41:43,666][26022] Updated weights on worker 0-0, policy_version 1117904 (0.00104) [2022-07-11 08:41:45,131][26022] Updated weights on worker 0-0, policy_version 1117914 (0.00090) [2022-07-11 08:41:46,778][25689] Fps is (10 sec: 5615.9, 60 sec: 5564.4, 300 sec: 5553.5). Total num frames: 1144752128. Throughput: 0: 5839.5. Samples: 1144760614. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:41:46,780][25689] Avg episode reward: [(0, '0.513')] [2022-07-11 08:41:47,077][26022] Updated weights on worker 0-0, policy_version 1117924 (0.00090) [2022-07-11 08:41:48,859][26022] Updated weights on worker 0-0, policy_version 1117934 (0.00087) [2022-07-11 08:41:50,476][26022] Updated weights on worker 0-0, policy_version 1117944 (0.00083) [2022-07-11 08:41:50,827][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:41:50,843][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001117945_1144775680.pth [2022-07-11 08:41:50,843][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001115988_1142771712.pth [2022-07-11 08:41:51,862][25689] Fps is (10 sec: 5538.7, 60 sec: 5591.0, 300 sec: 5555.5). Total num frames: 1144780800. Throughput: 0: 4995.8. Samples: 1144777626. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:41:51,863][25689] Avg episode reward: [(0, '0.029')] [2022-07-11 08:41:52,494][26022] Updated weights on worker 0-0, policy_version 1117954 (0.00096) [2022-07-11 08:41:54,441][26022] Updated weights on worker 0-0, policy_version 1117964 (0.00092) [2022-07-11 08:41:56,043][26022] Updated weights on worker 0-0, policy_version 1117974 (0.00092) [2022-07-11 08:41:56,864][25689] Fps is (10 sec: 5786.9, 60 sec: 5593.5, 300 sec: 5564.1). Total num frames: 1144810496. Throughput: 0: 5824.6. Samples: 1144811194. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:41:56,865][25689] Avg episode reward: [(0, '-0.381')] [2022-07-11 08:41:58,096][26022] Updated weights on worker 0-0, policy_version 1117984 (0.00081) [2022-07-11 08:41:59,495][26022] Updated weights on worker 0-0, policy_version 1117994 (0.00084) [2022-07-11 08:42:01,783][26022] Updated weights on worker 0-0, policy_version 1118004 (0.00095) [2022-07-11 08:42:01,939][25689] Fps is (10 sec: 5589.4, 60 sec: 5560.3, 300 sec: 5562.7). Total num frames: 1144837120. Throughput: 0: 5835.8. Samples: 1144844796. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:42:01,939][25689] Avg episode reward: [(0, '0.587')] [2022-07-11 08:42:03,823][26022] Updated weights on worker 0-0, policy_version 1118014 (0.00092) [2022-07-11 08:42:05,651][26022] Updated weights on worker 0-0, policy_version 1118024 (0.00087) [2022-07-11 08:42:07,009][25689] Fps is (10 sec: 5148.0, 60 sec: 5572.5, 300 sec: 5556.2). Total num frames: 1144862720. Throughput: 0: 4888.1. Samples: 1144859504. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:42:07,009][25689] Avg episode reward: [(0, '1.301')] [2022-07-11 08:42:07,537][26022] Updated weights on worker 0-0, policy_version 1118034 (0.00082) [2022-07-11 08:42:09,369][26022] Updated weights on worker 0-0, policy_version 1118044 (0.00086) [2022-07-11 08:42:11,266][26022] Updated weights on worker 0-0, policy_version 1118054 (0.00087) [2022-07-11 08:42:12,019][25689] Fps is (10 sec: 5485.5, 60 sec: 5573.1, 300 sec: 5560.8). Total num frames: 1144892416. Throughput: 0: 5726.8. Samples: 1144893058. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:42:12,020][25689] Avg episode reward: [(0, '0.411')] [2022-07-11 08:42:12,959][26022] Updated weights on worker 0-0, policy_version 1118064 (0.00089) [2022-07-11 08:42:14,851][26022] Updated weights on worker 0-0, policy_version 1118074 (0.00080) [2022-07-11 08:42:16,393][26022] Updated weights on worker 0-0, policy_version 1118084 (0.00085) [2022-07-11 08:42:17,039][25689] Fps is (10 sec: 5819.5, 60 sec: 5590.6, 300 sec: 5559.2). Total num frames: 1144921088. Throughput: 0: 5741.7. Samples: 1144927028. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:42:17,039][25689] Avg episode reward: [(0, '-0.643')] [2022-07-11 08:42:18,707][26022] Updated weights on worker 0-0, policy_version 1118094 (0.00073) [2022-07-11 08:42:20,144][26022] Updated weights on worker 0-0, policy_version 1118104 (0.00092) [2022-07-11 08:42:22,193][25689] Fps is (10 sec: 5434.9, 60 sec: 5569.6, 300 sec: 5553.8). Total num frames: 1144947712. Throughput: 0: 4882.2. Samples: 1144943684. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:42:22,194][25689] Avg episode reward: [(0, '-0.521')] [2022-07-11 08:42:22,218][26022] Updated weights on worker 0-0, policy_version 1118114 (0.00090) [2022-07-11 08:42:23,711][26022] Updated weights on worker 0-0, policy_version 1118124 (0.00084) [2022-07-11 08:42:25,901][26022] Updated weights on worker 0-0, policy_version 1118134 (0.00090) [2022-07-11 08:42:27,248][25689] Fps is (10 sec: 5516.3, 60 sec: 5566.5, 300 sec: 5560.0). Total num frames: 1144977408. Throughput: 0: 5810.8. Samples: 1144977112. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:42:27,249][25689] Avg episode reward: [(0, '-0.085')] [2022-07-11 08:42:27,812][26022] Updated weights on worker 0-0, policy_version 1118144 (0.00083) [2022-07-11 08:42:29,570][26022] Updated weights on worker 0-0, policy_version 1118154 (0.00095) [2022-07-11 08:42:31,447][26022] Updated weights on worker 0-0, policy_version 1118164 (0.00081) [2022-07-11 08:42:32,323][25689] Fps is (10 sec: 5661.1, 60 sec: 5577.3, 300 sec: 5555.4). Total num frames: 1145005056. Throughput: 0: 5782.1. Samples: 1145010456. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:42:32,323][25689] Avg episode reward: [(0, '-1.060')] [2022-07-11 08:42:33,135][26022] Updated weights on worker 0-0, policy_version 1118174 (0.00081) [2022-07-11 08:42:35,017][26022] Updated weights on worker 0-0, policy_version 1118184 (0.00086) [2022-07-11 08:42:37,015][26022] Updated weights on worker 0-0, policy_version 1118194 (0.00115) [2022-07-11 08:42:37,403][25689] Fps is (10 sec: 5445.6, 60 sec: 5554.6, 300 sec: 5558.6). Total num frames: 1145032704. Throughput: 0: 4934.2. Samples: 1145027508. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:42:37,403][25689] Avg episode reward: [(0, '-0.966')] [2022-07-11 08:42:38,691][26022] Updated weights on worker 0-0, policy_version 1118204 (0.00083) [2022-07-11 08:42:40,583][26022] Updated weights on worker 0-0, policy_version 1118214 (0.00092) [2022-07-11 08:42:42,352][26022] Updated weights on worker 0-0, policy_version 1118224 (0.00093) [2022-07-11 08:42:42,502][25689] Fps is (10 sec: 5633.2, 60 sec: 5557.1, 300 sec: 5560.6). Total num frames: 1145062400. Throughput: 0: 5762.3. Samples: 1145060712. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:42:42,503][25689] Avg episode reward: [(0, '0.381')] [2022-07-11 08:42:44,060][26022] Updated weights on worker 0-0, policy_version 1118234 (0.00098) [2022-07-11 08:42:46,016][26022] Updated weights on worker 0-0, policy_version 1118244 (0.00087) [2022-07-11 08:42:47,522][25689] Fps is (10 sec: 5666.6, 60 sec: 5563.0, 300 sec: 5553.7). Total num frames: 1145090048. Throughput: 0: 5777.5. Samples: 1145094244. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:42:47,523][25689] Avg episode reward: [(0, '0.205')] [2022-07-11 08:42:47,811][26022] Updated weights on worker 0-0, policy_version 1118254 (0.00098) [2022-07-11 08:42:49,740][26022] Updated weights on worker 0-0, policy_version 1118264 (0.00088) [2022-07-11 08:42:51,466][26022] Updated weights on worker 0-0, policy_version 1118274 (0.00092) [2022-07-11 08:42:52,524][25689] Fps is (10 sec: 5415.3, 60 sec: 5536.8, 300 sec: 5553.8). Total num frames: 1145116672. Throughput: 0: 4983.8. Samples: 1145111136. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:42:52,525][25689] Avg episode reward: [(0, '0.249')] [2022-07-11 08:42:53,456][26022] Updated weights on worker 0-0, policy_version 1118284 (0.00092) [2022-07-11 08:42:55,121][26022] Updated weights on worker 0-0, policy_version 1118294 (0.00082) [2022-07-11 08:42:56,930][26022] Updated weights on worker 0-0, policy_version 1118304 (0.00083) [2022-07-11 08:42:57,558][25689] Fps is (10 sec: 5611.8, 60 sec: 5533.9, 300 sec: 5554.1). Total num frames: 1145146368. Throughput: 0: 5829.6. Samples: 1145145008. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:42:57,559][25689] Avg episode reward: [(0, '0.351')] [2022-07-11 08:42:58,869][26022] Updated weights on worker 0-0, policy_version 1118314 (0.00089) [2022-07-11 08:43:00,567][26022] Updated weights on worker 0-0, policy_version 1118324 (0.00097) [2022-07-11 08:43:02,619][25689] Fps is (10 sec: 5477.8, 60 sec: 5518.2, 300 sec: 5560.0). Total num frames: 1145171968. Throughput: 0: 5855.7. Samples: 1145178508. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:43:02,619][25689] Avg episode reward: [(0, '-0.276')] [2022-07-11 08:43:02,834][26022] Updated weights on worker 0-0, policy_version 1118334 (0.00087) [2022-07-11 08:43:04,598][26022] Updated weights on worker 0-0, policy_version 1118344 (0.00087) [2022-07-11 08:43:06,419][26022] Updated weights on worker 0-0, policy_version 1118354 (0.00096) [2022-07-11 08:43:07,625][25689] Fps is (10 sec: 5391.5, 60 sec: 5574.8, 300 sec: 5560.4). Total num frames: 1145200640. Throughput: 0: 4937.8. Samples: 1145193504. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:43:07,627][25689] Avg episode reward: [(0, '0.381')] [2022-07-11 08:43:08,349][26022] Updated weights on worker 0-0, policy_version 1118364 (0.00091) [2022-07-11 08:43:10,170][26022] Updated weights on worker 0-0, policy_version 1118374 (0.00090) [2022-07-11 08:43:12,095][26022] Updated weights on worker 0-0, policy_version 1118384 (0.00089) [2022-07-11 08:43:12,631][25689] Fps is (10 sec: 5727.7, 60 sec: 5558.3, 300 sec: 5560.5). Total num frames: 1145229312. Throughput: 0: 5766.8. Samples: 1145227086. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:43:12,631][25689] Avg episode reward: [(0, '0.321')] [2022-07-11 08:43:13,779][26022] Updated weights on worker 0-0, policy_version 1118394 (0.00096) [2022-07-11 08:43:15,819][26022] Updated weights on worker 0-0, policy_version 1118404 (0.00083) [2022-07-11 08:43:17,281][26022] Updated weights on worker 0-0, policy_version 1118414 (0.00093) [2022-07-11 08:43:17,634][25689] Fps is (10 sec: 5626.5, 60 sec: 5542.8, 300 sec: 5563.0). Total num frames: 1145256960. Throughput: 0: 5765.8. Samples: 1145260762. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:43:17,635][25689] Avg episode reward: [(0, '0.417')] [2022-07-11 08:43:19,387][26022] Updated weights on worker 0-0, policy_version 1118424 (0.00089) [2022-07-11 08:43:21,315][26022] Updated weights on worker 0-0, policy_version 1118434 (0.00090) [2022-07-11 08:43:22,694][25689] Fps is (10 sec: 5392.9, 60 sec: 5551.5, 300 sec: 5552.0). Total num frames: 1145283584. Throughput: 0: 5763.4. Samples: 1145294210. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:43:22,695][25689] Avg episode reward: [(0, '0.906')] [2022-07-11 08:43:22,884][26022] Updated weights on worker 0-0, policy_version 1118444 (0.00083) [2022-07-11 08:43:24,988][26022] Updated weights on worker 0-0, policy_version 1118454 (0.00093) [2022-07-11 08:43:26,592][26022] Updated weights on worker 0-0, policy_version 1118464 (0.00088) [2022-07-11 08:43:27,724][25689] Fps is (10 sec: 5480.5, 60 sec: 5536.9, 300 sec: 5562.1). Total num frames: 1145312256. Throughput: 0: 5845.9. Samples: 1145311002. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:43:27,724][25689] Avg episode reward: [(0, '1.273')] [2022-07-11 08:43:28,471][26022] Updated weights on worker 0-0, policy_version 1118474 (0.00094) [2022-07-11 08:43:30,362][26022] Updated weights on worker 0-0, policy_version 1118484 (0.00085) [2022-07-11 08:43:32,001][26022] Updated weights on worker 0-0, policy_version 1118494 (0.00086) [2022-07-11 08:43:32,738][25689] Fps is (10 sec: 5709.5, 60 sec: 5559.4, 300 sec: 5559.6). Total num frames: 1145340928. Throughput: 0: 5828.1. Samples: 1145344272. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:43:32,738][25689] Avg episode reward: [(0, '0.779')] [2022-07-11 08:43:34,057][26022] Updated weights on worker 0-0, policy_version 1118504 (0.00086) [2022-07-11 08:43:35,646][26022] Updated weights on worker 0-0, policy_version 1118514 (0.00089) [2022-07-11 08:43:37,710][26022] Updated weights on worker 0-0, policy_version 1118524 (0.00100) [2022-07-11 08:43:37,753][25689] Fps is (10 sec: 5615.9, 60 sec: 5565.4, 300 sec: 5557.2). Total num frames: 1145368576. Throughput: 0: 5829.6. Samples: 1145378044. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:43:37,753][25689] Avg episode reward: [(0, '0.538')] [2022-07-11 08:43:39,420][26022] Updated weights on worker 0-0, policy_version 1118534 (0.00080) [2022-07-11 08:43:41,206][26022] Updated weights on worker 0-0, policy_version 1118544 (0.00086) [2022-07-11 08:43:42,850][25689] Fps is (10 sec: 5468.3, 60 sec: 5531.7, 300 sec: 5557.0). Total num frames: 1145396224. Throughput: 0: 4994.5. Samples: 1145394878. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:43:42,851][25689] Avg episode reward: [(0, '0.510')] [2022-07-11 08:43:43,242][26022] Updated weights on worker 0-0, policy_version 1118554 (0.00085) [2022-07-11 08:43:45,013][26022] Updated weights on worker 0-0, policy_version 1118564 (0.00093) [2022-07-11 08:43:46,689][26022] Updated weights on worker 0-0, policy_version 1118574 (0.00090) [2022-07-11 08:43:47,869][25689] Fps is (10 sec: 5567.5, 60 sec: 5548.8, 300 sec: 5556.7). Total num frames: 1145424896. Throughput: 0: 5843.7. Samples: 1145428722. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:43:47,869][25689] Avg episode reward: [(0, '-0.084')] [2022-07-11 08:43:48,673][26022] Updated weights on worker 0-0, policy_version 1118584 (0.00083) [2022-07-11 08:43:50,228][26022] Updated weights on worker 0-0, policy_version 1118594 (0.00092) [2022-07-11 08:43:50,875][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:43:50,886][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001118597_1145443328.pth [2022-07-11 08:43:50,886][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001116641_1143440384.pth [2022-07-11 08:43:52,490][26022] Updated weights on worker 0-0, policy_version 1118604 (0.00090) [2022-07-11 08:43:52,891][25689] Fps is (10 sec: 5711.2, 60 sec: 5580.9, 300 sec: 5563.6). Total num frames: 1145453568. Throughput: 0: 5860.5. Samples: 1145462378. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:43:52,891][25689] Avg episode reward: [(0, '-0.376')] [2022-07-11 08:43:53,950][26022] Updated weights on worker 0-0, policy_version 1118614 (0.00096) [2022-07-11 08:43:55,754][26022] Updated weights on worker 0-0, policy_version 1118624 (0.00086) [2022-07-11 08:43:57,796][26022] Updated weights on worker 0-0, policy_version 1118634 (0.00077) [2022-07-11 08:43:57,896][25689] Fps is (10 sec: 5616.3, 60 sec: 5549.5, 300 sec: 5561.2). Total num frames: 1145481216. Throughput: 0: 5028.0. Samples: 1145479328. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:43:57,898][25689] Avg episode reward: [(0, '-0.395')] [2022-07-11 08:43:59,515][26022] Updated weights on worker 0-0, policy_version 1118644 (0.00086) [2022-07-11 08:44:01,319][26022] Updated weights on worker 0-0, policy_version 1118654 (0.00093) [2022-07-11 08:44:02,935][25689] Fps is (10 sec: 5403.3, 60 sec: 5568.5, 300 sec: 5560.9). Total num frames: 1145507840. Throughput: 0: 5806.2. Samples: 1145511494. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:44:02,935][25689] Avg episode reward: [(0, '0.800')] [2022-07-11 08:44:03,821][26022] Updated weights on worker 0-0, policy_version 1118664 (0.00086) [2022-07-11 08:44:05,283][26022] Updated weights on worker 0-0, policy_version 1118674 (0.00082) [2022-07-11 08:44:07,476][26022] Updated weights on worker 0-0, policy_version 1118684 (0.00089) [2022-07-11 08:44:07,938][25689] Fps is (10 sec: 5506.5, 60 sec: 5568.7, 300 sec: 5565.5). Total num frames: 1145536512. Throughput: 0: 5748.9. Samples: 1145544102. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:44:07,939][25689] Avg episode reward: [(0, '0.836')] [2022-07-11 08:44:09,019][26022] Updated weights on worker 0-0, policy_version 1118694 (0.00083) [2022-07-11 08:44:10,919][26022] Updated weights on worker 0-0, policy_version 1118704 (0.00082) [2022-07-11 08:44:12,940][26022] Updated weights on worker 0-0, policy_version 1118714 (0.00090) [2022-07-11 08:44:12,946][25689] Fps is (10 sec: 5523.6, 60 sec: 5534.7, 300 sec: 5562.1). Total num frames: 1145563136. Throughput: 0: 4919.5. Samples: 1145561038. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:44:12,946][25689] Avg episode reward: [(0, '0.155')] [2022-07-11 08:44:14,607][26022] Updated weights on worker 0-0, policy_version 1118724 (0.00080) [2022-07-11 08:44:16,449][26022] Updated weights on worker 0-0, policy_version 1118734 (0.00093) [2022-07-11 08:44:17,963][25689] Fps is (10 sec: 5413.8, 60 sec: 5533.4, 300 sec: 5552.8). Total num frames: 1145590784. Throughput: 0: 5760.4. Samples: 1145594920. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:44:17,964][25689] Avg episode reward: [(0, '0.293')] [2022-07-11 08:44:18,208][26022] Updated weights on worker 0-0, policy_version 1118744 (0.00368) [2022-07-11 08:44:20,043][26022] Updated weights on worker 0-0, policy_version 1118754 (0.00091) [2022-07-11 08:44:21,916][26022] Updated weights on worker 0-0, policy_version 1118764 (0.00086) [2022-07-11 08:44:23,038][25689] Fps is (10 sec: 5681.9, 60 sec: 5582.9, 300 sec: 5565.2). Total num frames: 1145620480. Throughput: 0: 5825.7. Samples: 1145628610. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:44:23,039][25689] Avg episode reward: [(0, '-1.576')] [2022-07-11 08:44:23,717][26022] Updated weights on worker 0-0, policy_version 1118774 (0.00086) [2022-07-11 08:44:25,505][26022] Updated weights on worker 0-0, policy_version 1118784 (0.00089) [2022-07-11 08:44:27,463][26022] Updated weights on worker 0-0, policy_version 1118794 (0.00095) [2022-07-11 08:44:28,045][25689] Fps is (10 sec: 5688.0, 60 sec: 5568.1, 300 sec: 5561.8). Total num frames: 1145648128. Throughput: 0: 5041.5. Samples: 1145645468. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:44:28,045][25689] Avg episode reward: [(0, '-1.762')] [2022-07-11 08:44:29,087][26022] Updated weights on worker 0-0, policy_version 1118804 (0.00088) [2022-07-11 08:44:31,137][26022] Updated weights on worker 0-0, policy_version 1118814 (0.00085) [2022-07-11 08:44:32,806][26022] Updated weights on worker 0-0, policy_version 1118824 (0.00082) [2022-07-11 08:44:33,051][25689] Fps is (10 sec: 5522.7, 60 sec: 5551.9, 300 sec: 5562.0). Total num frames: 1145675776. Throughput: 0: 5875.0. Samples: 1145679154. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2022-07-11 08:44:33,051][25689] Avg episode reward: [(0, '-1.450')] [2022-07-11 08:44:34,686][26022] Updated weights on worker 0-0, policy_version 1118834 (0.00081) [2022-07-11 08:44:36,579][26022] Updated weights on worker 0-0, policy_version 1118844 (0.00085) [2022-07-11 08:44:38,074][25689] Fps is (10 sec: 5513.6, 60 sec: 5551.1, 300 sec: 5564.3). Total num frames: 1145703424. Throughput: 0: 5850.1. Samples: 1145712570. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:44:38,074][25689] Avg episode reward: [(0, '-2.259')] [2022-07-11 08:44:38,547][26022] Updated weights on worker 0-0, policy_version 1118854 (0.00418) [2022-07-11 08:44:40,108][26022] Updated weights on worker 0-0, policy_version 1118864 (0.00083) [2022-07-11 08:44:42,221][26022] Updated weights on worker 0-0, policy_version 1118874 (0.00313) [2022-07-11 08:44:43,207][25689] Fps is (10 sec: 5545.3, 60 sec: 5564.8, 300 sec: 5563.1). Total num frames: 1145732096. Throughput: 0: 4986.0. Samples: 1145729172. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:44:43,208][25689] Avg episode reward: [(0, '-1.444')] [2022-07-11 08:44:43,755][26022] Updated weights on worker 0-0, policy_version 1118884 (0.00088) [2022-07-11 08:44:45,775][26022] Updated weights on worker 0-0, policy_version 1118894 (0.00088) [2022-07-11 08:44:47,460][26022] Updated weights on worker 0-0, policy_version 1118904 (0.00091) [2022-07-11 08:44:48,250][25689] Fps is (10 sec: 5534.3, 60 sec: 5545.5, 300 sec: 5563.4). Total num frames: 1145759744. Throughput: 0: 5818.4. Samples: 1145763032. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:44:48,251][25689] Avg episode reward: [(0, '-2.042')] [2022-07-11 08:44:49,339][26022] Updated weights on worker 0-0, policy_version 1118914 (0.00092) [2022-07-11 08:44:51,383][26022] Updated weights on worker 0-0, policy_version 1118924 (0.00079) [2022-07-11 08:44:53,102][26022] Updated weights on worker 0-0, policy_version 1118934 (0.00091) [2022-07-11 08:44:53,271][25689] Fps is (10 sec: 5596.4, 60 sec: 5545.7, 300 sec: 5564.1). Total num frames: 1145788416. Throughput: 0: 5806.6. Samples: 1145796564. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:44:53,271][25689] Avg episode reward: [(0, '-1.402')] [2022-07-11 08:44:54,940][26022] Updated weights on worker 0-0, policy_version 1118944 (0.00082) [2022-07-11 08:44:57,063][26022] Updated weights on worker 0-0, policy_version 1118954 (0.00091) [2022-07-11 08:44:58,278][25689] Fps is (10 sec: 5718.4, 60 sec: 5562.5, 300 sec: 5561.6). Total num frames: 1145817088. Throughput: 0: 4979.8. Samples: 1145813184. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:44:58,279][25689] Avg episode reward: [(0, '-2.509')] [2022-07-11 08:44:58,517][26022] Updated weights on worker 0-0, policy_version 1118964 (0.00092) [2022-07-11 08:45:00,681][26022] Updated weights on worker 0-0, policy_version 1118974 (0.00094) [2022-07-11 08:45:02,658][26022] Updated weights on worker 0-0, policy_version 1118984 (0.00102) [2022-07-11 08:45:03,390][25689] Fps is (10 sec: 5261.8, 60 sec: 5521.8, 300 sec: 5556.3). Total num frames: 1145841664. Throughput: 0: 5808.3. Samples: 1145846404. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:45:03,391][25689] Avg episode reward: [(0, '-2.514')] [2022-07-11 08:45:04,449][26022] Updated weights on worker 0-0, policy_version 1118994 (0.00084) [2022-07-11 08:45:06,546][26022] Updated weights on worker 0-0, policy_version 1119004 (0.00091) [2022-07-11 08:45:08,211][26022] Updated weights on worker 0-0, policy_version 1119014 (0.00613) [2022-07-11 08:45:08,402][25689] Fps is (10 sec: 5259.8, 60 sec: 5521.1, 300 sec: 5563.1). Total num frames: 1145870336. Throughput: 0: 5691.9. Samples: 1145877732. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:45:08,402][25689] Avg episode reward: [(0, '-1.537')] [2022-07-11 08:45:10,204][26022] Updated weights on worker 0-0, policy_version 1119024 (0.00090) [2022-07-11 08:45:12,014][26022] Updated weights on worker 0-0, policy_version 1119034 (0.00084) [2022-07-11 08:45:13,434][25689] Fps is (10 sec: 5607.5, 60 sec: 5535.8, 300 sec: 5559.4). Total num frames: 1145897984. Throughput: 0: 4838.1. Samples: 1145894116. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:45:13,436][25689] Avg episode reward: [(0, '-1.946')] [2022-07-11 08:45:13,673][26022] Updated weights on worker 0-0, policy_version 1119044 (0.00095) [2022-07-11 08:45:15,740][26022] Updated weights on worker 0-0, policy_version 1119054 (0.00092) [2022-07-11 08:45:17,358][26022] Updated weights on worker 0-0, policy_version 1119064 (0.00088) [2022-07-11 08:45:18,440][25689] Fps is (10 sec: 5610.6, 60 sec: 5553.7, 300 sec: 5560.4). Total num frames: 1145926656. Throughput: 0: 5698.8. Samples: 1145928082. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:45:18,440][25689] Avg episode reward: [(0, '-1.880')] [2022-07-11 08:45:19,257][26022] Updated weights on worker 0-0, policy_version 1119074 (0.00084) [2022-07-11 08:45:21,012][26022] Updated weights on worker 0-0, policy_version 1119084 (0.00088) [2022-07-11 08:45:22,961][26022] Updated weights on worker 0-0, policy_version 1119094 (0.00086) [2022-07-11 08:45:23,555][25689] Fps is (10 sec: 5564.8, 60 sec: 5516.2, 300 sec: 5558.4). Total num frames: 1145954304. Throughput: 0: 5712.2. Samples: 1145961588. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:45:23,555][25689] Avg episode reward: [(0, '-1.510')] [2022-07-11 08:45:24,787][26022] Updated weights on worker 0-0, policy_version 1119104 (0.00087) [2022-07-11 08:45:26,683][26022] Updated weights on worker 0-0, policy_version 1119114 (0.00080) [2022-07-11 08:45:28,282][26022] Updated weights on worker 0-0, policy_version 1119124 (0.00086) [2022-07-11 08:45:28,654][25689] Fps is (10 sec: 5614.1, 60 sec: 5541.6, 300 sec: 5560.5). Total num frames: 1145984000. Throughput: 0: 4971.7. Samples: 1145978420. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:45:28,655][25689] Avg episode reward: [(0, '-0.521')] [2022-07-11 08:45:30,314][26022] Updated weights on worker 0-0, policy_version 1119134 (0.00083) [2022-07-11 08:45:32,190][26022] Updated weights on worker 0-0, policy_version 1119144 (0.00095) [2022-07-11 08:45:33,666][25689] Fps is (10 sec: 5671.3, 60 sec: 5541.0, 300 sec: 5560.7). Total num frames: 1146011648. Throughput: 0: 5813.2. Samples: 1146011730. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:45:33,667][25689] Avg episode reward: [(0, '-0.357')] [2022-07-11 08:45:33,931][26022] Updated weights on worker 0-0, policy_version 1119154 (0.00084) [2022-07-11 08:45:35,872][26022] Updated weights on worker 0-0, policy_version 1119164 (0.00091) [2022-07-11 08:45:37,646][26022] Updated weights on worker 0-0, policy_version 1119174 (0.00088) [2022-07-11 08:45:38,698][25689] Fps is (10 sec: 5505.8, 60 sec: 5540.3, 300 sec: 5555.4). Total num frames: 1146039296. Throughput: 0: 5783.4. Samples: 1146045240. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:45:38,698][25689] Avg episode reward: [(0, '-0.605')] [2022-07-11 08:45:39,615][26022] Updated weights on worker 0-0, policy_version 1119184 (0.00084) [2022-07-11 08:45:41,263][26022] Updated weights on worker 0-0, policy_version 1119194 (0.00090) [2022-07-11 08:45:43,070][26022] Updated weights on worker 0-0, policy_version 1119204 (0.00087) [2022-07-11 08:45:43,833][25689] Fps is (10 sec: 5439.1, 60 sec: 5523.2, 300 sec: 5550.1). Total num frames: 1146066944. Throughput: 0: 4952.0. Samples: 1146062004. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:45:43,833][25689] Avg episode reward: [(0, '-0.015')] [2022-07-11 08:45:44,952][26022] Updated weights on worker 0-0, policy_version 1119214 (0.00090) [2022-07-11 08:45:46,936][26022] Updated weights on worker 0-0, policy_version 1119224 (0.00090) [2022-07-11 08:45:48,571][26022] Updated weights on worker 0-0, policy_version 1119234 (0.00092) [2022-07-11 08:45:48,844][25689] Fps is (10 sec: 5752.4, 60 sec: 5576.8, 300 sec: 5563.8). Total num frames: 1146097664. Throughput: 0: 5811.4. Samples: 1146095752. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:45:48,845][25689] Avg episode reward: [(0, '0.042')] [2022-07-11 08:45:50,384][26022] Updated weights on worker 0-0, policy_version 1119244 (0.00095) [2022-07-11 08:45:51,235][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:45:51,249][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001119247_1146108928.pth [2022-07-11 08:45:51,249][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001117291_1144105984.pth [2022-07-11 08:45:52,002][26022] Updated weights on worker 0-0, policy_version 1119254 (0.00088) [2022-07-11 08:45:53,853][25689] Fps is (10 sec: 5722.5, 60 sec: 5544.0, 300 sec: 5553.8). Total num frames: 1146124288. Throughput: 0: 5836.2. Samples: 1146129546. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:45:53,854][25689] Avg episode reward: [(0, '-0.057')] [2022-07-11 08:45:54,208][26022] Updated weights on worker 0-0, policy_version 1119264 (0.00084) [2022-07-11 08:45:55,851][26022] Updated weights on worker 0-0, policy_version 1119274 (0.00532) [2022-07-11 08:45:57,651][26022] Updated weights on worker 0-0, policy_version 1119284 (0.00089) [2022-07-11 08:45:58,862][25689] Fps is (10 sec: 5519.9, 60 sec: 5543.9, 300 sec: 5555.2). Total num frames: 1146152960. Throughput: 0: 5015.4. Samples: 1146146372. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:45:58,864][25689] Avg episode reward: [(0, '0.131')] [2022-07-11 08:45:59,560][26022] Updated weights on worker 0-0, policy_version 1119294 (0.00089) [2022-07-11 08:46:01,407][26022] Updated weights on worker 0-0, policy_version 1119304 (0.00095) [2022-07-11 08:46:03,614][26022] Updated weights on worker 0-0, policy_version 1119314 (0.00085) [2022-07-11 08:46:03,989][25689] Fps is (10 sec: 5455.8, 60 sec: 5576.4, 300 sec: 5560.0). Total num frames: 1146179584. Throughput: 0: 5835.8. Samples: 1146179630. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:46:03,989][25689] Avg episode reward: [(0, '0.272')] [2022-07-11 08:46:05,374][26022] Updated weights on worker 0-0, policy_version 1119324 (0.00087) [2022-07-11 08:46:07,196][26022] Updated weights on worker 0-0, policy_version 1119334 (0.00093) [2022-07-11 08:46:08,994][25689] Fps is (10 sec: 5356.6, 60 sec: 5560.1, 300 sec: 5553.4). Total num frames: 1146207232. Throughput: 0: 5782.9. Samples: 1146212274. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:46:08,995][25689] Avg episode reward: [(0, '1.019')] [2022-07-11 08:46:09,056][26022] Updated weights on worker 0-0, policy_version 1119344 (0.00083) [2022-07-11 08:46:10,938][26022] Updated weights on worker 0-0, policy_version 1119354 (0.00085) [2022-07-11 08:46:12,714][26022] Updated weights on worker 0-0, policy_version 1119364 (0.00087) [2022-07-11 08:46:14,079][25689] Fps is (10 sec: 5581.6, 60 sec: 5572.1, 300 sec: 5555.7). Total num frames: 1146235904. Throughput: 0: 4910.4. Samples: 1146228862. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:46:14,079][25689] Avg episode reward: [(0, '1.125')] [2022-07-11 08:46:14,597][26022] Updated weights on worker 0-0, policy_version 1119374 (0.00086) [2022-07-11 08:46:16,280][26022] Updated weights on worker 0-0, policy_version 1119384 (0.00086) [2022-07-11 08:46:18,035][26022] Updated weights on worker 0-0, policy_version 1119394 (0.00085) [2022-07-11 08:46:19,105][25689] Fps is (10 sec: 5772.3, 60 sec: 5587.1, 300 sec: 5564.2). Total num frames: 1146265600. Throughput: 0: 5767.0. Samples: 1146263116. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:46:19,106][25689] Avg episode reward: [(0, '1.701')] [2022-07-11 08:46:19,850][26022] Updated weights on worker 0-0, policy_version 1119404 (0.00089) [2022-07-11 08:46:21,796][26022] Updated weights on worker 0-0, policy_version 1119414 (0.00079) [2022-07-11 08:46:23,480][26022] Updated weights on worker 0-0, policy_version 1119424 (0.00089) [2022-07-11 08:46:24,228][25689] Fps is (10 sec: 5751.2, 60 sec: 5603.3, 300 sec: 5558.8). Total num frames: 1146294272. Throughput: 0: 5806.5. Samples: 1146297148. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:46:24,229][25689] Avg episode reward: [(0, '1.728')] [2022-07-11 08:46:25,459][26022] Updated weights on worker 0-0, policy_version 1119434 (0.00093) [2022-07-11 08:46:27,090][26022] Updated weights on worker 0-0, policy_version 1119444 (0.00095) [2022-07-11 08:46:29,179][26022] Updated weights on worker 0-0, policy_version 1119454 (0.00089) [2022-07-11 08:46:29,277][25689] Fps is (10 sec: 5436.5, 60 sec: 5557.3, 300 sec: 5558.1). Total num frames: 1146320896. Throughput: 0: 5845.2. Samples: 1146330832. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:46:29,277][25689] Avg episode reward: [(0, '0.747')] [2022-07-11 08:46:31,045][26022] Updated weights on worker 0-0, policy_version 1119464 (0.00091) [2022-07-11 08:46:32,614][26022] Updated weights on worker 0-0, policy_version 1119474 (0.00086) [2022-07-11 08:46:34,307][25689] Fps is (10 sec: 5384.3, 60 sec: 5555.6, 300 sec: 5554.4). Total num frames: 1146348544. Throughput: 0: 5879.5. Samples: 1146347794. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:46:34,308][25689] Avg episode reward: [(0, '-0.851')] [2022-07-11 08:46:34,693][26022] Updated weights on worker 0-0, policy_version 1119484 (0.00088) [2022-07-11 08:46:36,135][26022] Updated weights on worker 0-0, policy_version 1119494 (0.00088) [2022-07-11 08:46:38,260][26022] Updated weights on worker 0-0, policy_version 1119504 (0.00087) [2022-07-11 08:46:39,406][25689] Fps is (10 sec: 5661.4, 60 sec: 5583.2, 300 sec: 5554.9). Total num frames: 1146378240. Throughput: 0: 5822.3. Samples: 1146381310. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:46:39,406][25689] Avg episode reward: [(0, '-0.857')] [2022-07-11 08:46:39,820][26022] Updated weights on worker 0-0, policy_version 1119514 (0.00104) [2022-07-11 08:46:41,879][26022] Updated weights on worker 0-0, policy_version 1119524 (0.00204) [2022-07-11 08:46:43,684][26022] Updated weights on worker 0-0, policy_version 1119534 (0.00090) [2022-07-11 08:46:44,455][25689] Fps is (10 sec: 5752.0, 60 sec: 5608.0, 300 sec: 5559.0). Total num frames: 1146406912. Throughput: 0: 5821.7. Samples: 1146414904. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:46:44,455][25689] Avg episode reward: [(0, '-0.994')] [2022-07-11 08:46:45,575][26022] Updated weights on worker 0-0, policy_version 1119544 (0.00089) [2022-07-11 08:46:47,133][26022] Updated weights on worker 0-0, policy_version 1119554 (0.01326) [2022-07-11 08:46:49,115][26022] Updated weights on worker 0-0, policy_version 1119564 (0.00067) [2022-07-11 08:46:49,499][25689] Fps is (10 sec: 5580.1, 60 sec: 5554.4, 300 sec: 5556.3). Total num frames: 1146434560. Throughput: 0: 4998.3. Samples: 1146431908. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:46:49,499][25689] Avg episode reward: [(0, '-0.846')] [2022-07-11 08:46:50,799][26022] Updated weights on worker 0-0, policy_version 1119574 (0.00086) [2022-07-11 08:46:52,787][26022] Updated weights on worker 0-0, policy_version 1119584 (0.00089) [2022-07-11 08:46:54,421][26022] Updated weights on worker 0-0, policy_version 1119594 (0.00092) [2022-07-11 08:46:54,520][25689] Fps is (10 sec: 5697.3, 60 sec: 5603.9, 300 sec: 5556.0). Total num frames: 1146464256. Throughput: 0: 5844.3. Samples: 1146465922. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:46:54,521][25689] Avg episode reward: [(0, '0.400')] [2022-07-11 08:46:56,442][26022] Updated weights on worker 0-0, policy_version 1119604 (0.00087) [2022-07-11 08:46:58,084][26022] Updated weights on worker 0-0, policy_version 1119614 (0.00087) [2022-07-11 08:46:59,555][25689] Fps is (10 sec: 5702.5, 60 sec: 5584.6, 300 sec: 5560.1). Total num frames: 1146491904. Throughput: 0: 5887.8. Samples: 1146499942. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:46:59,555][25689] Avg episode reward: [(0, '1.020')] [2022-07-11 08:47:00,173][26022] Updated weights on worker 0-0, policy_version 1119624 (0.00087) [2022-07-11 08:47:01,628][26022] Updated weights on worker 0-0, policy_version 1119634 (0.00091) [2022-07-11 08:47:04,063][26022] Updated weights on worker 0-0, policy_version 1119644 (0.00092) [2022-07-11 08:47:04,615][25689] Fps is (10 sec: 5477.4, 60 sec: 5607.6, 300 sec: 5567.2). Total num frames: 1146519552. Throughput: 0: 4972.2. Samples: 1146515146. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:47:04,616][25689] Avg episode reward: [(0, '1.243')] [2022-07-11 08:47:05,710][26022] Updated weights on worker 0-0, policy_version 1119654 (0.00085) [2022-07-11 08:47:07,678][26022] Updated weights on worker 0-0, policy_version 1119664 (0.00092) [2022-07-11 08:47:09,324][26022] Updated weights on worker 0-0, policy_version 1119674 (0.00091) [2022-07-11 08:47:09,620][25689] Fps is (10 sec: 5595.3, 60 sec: 5624.5, 300 sec: 5563.8). Total num frames: 1146548224. Throughput: 0: 5821.2. Samples: 1146549040. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:47:09,621][25689] Avg episode reward: [(0, '1.149')] [2022-07-11 08:47:11,326][26022] Updated weights on worker 0-0, policy_version 1119684 (0.00082) [2022-07-11 08:47:12,982][26022] Updated weights on worker 0-0, policy_version 1119694 (0.00086) [2022-07-11 08:47:14,647][25689] Fps is (10 sec: 5409.9, 60 sec: 5579.2, 300 sec: 5553.4). Total num frames: 1146573824. Throughput: 0: 5792.2. Samples: 1146582504. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:47:14,647][25689] Avg episode reward: [(0, '1.382')] [2022-07-11 08:47:14,807][26022] Updated weights on worker 0-0, policy_version 1119704 (0.00086) [2022-07-11 08:47:16,544][26022] Updated weights on worker 0-0, policy_version 1119714 (0.00088) [2022-07-11 08:47:18,563][26022] Updated weights on worker 0-0, policy_version 1119724 (0.00081) [2022-07-11 08:47:19,683][25689] Fps is (10 sec: 5495.2, 60 sec: 5578.3, 300 sec: 5565.9). Total num frames: 1146603520. Throughput: 0: 4941.7. Samples: 1146599408. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:47:19,683][25689] Avg episode reward: [(0, '1.652')] [2022-07-11 08:47:20,341][26022] Updated weights on worker 0-0, policy_version 1119734 (0.00083) [2022-07-11 08:47:22,155][26022] Updated weights on worker 0-0, policy_version 1119744 (0.00080) [2022-07-11 08:47:24,043][26022] Updated weights on worker 0-0, policy_version 1119754 (0.00086) [2022-07-11 08:47:24,803][25689] Fps is (10 sec: 5747.2, 60 sec: 5578.5, 300 sec: 5561.3). Total num frames: 1146632192. Throughput: 0: 5840.7. Samples: 1146633058. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:47:24,805][25689] Avg episode reward: [(0, '1.427')] [2022-07-11 08:47:25,858][26022] Updated weights on worker 0-0, policy_version 1119764 (0.00091) [2022-07-11 08:47:27,592][26022] Updated weights on worker 0-0, policy_version 1119774 (0.00087) [2022-07-11 08:47:29,598][26022] Updated weights on worker 0-0, policy_version 1119784 (0.00091) [2022-07-11 08:47:29,856][25689] Fps is (10 sec: 5536.1, 60 sec: 5595.1, 300 sec: 5561.7). Total num frames: 1146659840. Throughput: 0: 5819.2. Samples: 1146666796. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:47:29,858][25689] Avg episode reward: [(0, '1.274')] [2022-07-11 08:47:31,130][26022] Updated weights on worker 0-0, policy_version 1119794 (0.00104) [2022-07-11 08:47:33,215][26022] Updated weights on worker 0-0, policy_version 1119804 (0.00092) [2022-07-11 08:47:34,915][25689] Fps is (10 sec: 5569.5, 60 sec: 5609.3, 300 sec: 5565.5). Total num frames: 1146688512. Throughput: 0: 4986.8. Samples: 1146683578. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:47:34,917][25689] Avg episode reward: [(0, '1.145')] [2022-07-11 08:47:34,939][26022] Updated weights on worker 0-0, policy_version 1119814 (0.00080) [2022-07-11 08:47:36,914][26022] Updated weights on worker 0-0, policy_version 1119824 (0.00086) [2022-07-11 08:47:38,759][26022] Updated weights on worker 0-0, policy_version 1119834 (0.00087) [2022-07-11 08:47:39,939][25689] Fps is (10 sec: 5585.6, 60 sec: 5582.4, 300 sec: 5560.0). Total num frames: 1146716160. Throughput: 0: 5818.1. Samples: 1146717260. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:47:39,941][25689] Avg episode reward: [(0, '1.301')] [2022-07-11 08:47:40,611][26022] Updated weights on worker 0-0, policy_version 1119844 (0.00091) [2022-07-11 08:47:42,303][26022] Updated weights on worker 0-0, policy_version 1119854 (0.00086) [2022-07-11 08:47:44,431][26022] Updated weights on worker 0-0, policy_version 1119864 (0.00086) [2022-07-11 08:47:45,002][25689] Fps is (10 sec: 5685.2, 60 sec: 5598.0, 300 sec: 5566.1). Total num frames: 1146745856. Throughput: 0: 5809.9. Samples: 1146750410. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:47:45,002][25689] Avg episode reward: [(0, '1.272')] [2022-07-11 08:47:45,994][26022] Updated weights on worker 0-0, policy_version 1119874 (0.00084) [2022-07-11 08:47:47,932][26022] Updated weights on worker 0-0, policy_version 1119884 (0.00090) [2022-07-11 08:47:49,731][26022] Updated weights on worker 0-0, policy_version 1119894 (0.00090) [2022-07-11 08:47:50,044][25689] Fps is (10 sec: 5674.5, 60 sec: 5598.1, 300 sec: 5568.8). Total num frames: 1146773504. Throughput: 0: 4971.2. Samples: 1146767158. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:47:50,045][25689] Avg episode reward: [(0, '0.990')] [2022-07-11 08:47:51,333][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:47:51,343][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001119903_1146780672.pth [2022-07-11 08:47:51,343][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001117945_1144775680.pth [2022-07-11 08:47:51,664][26022] Updated weights on worker 0-0, policy_version 1119904 (0.00080) [2022-07-11 08:47:53,383][26022] Updated weights on worker 0-0, policy_version 1119914 (0.00085) [2022-07-11 08:47:55,069][25689] Fps is (10 sec: 5390.6, 60 sec: 5547.0, 300 sec: 5558.7). Total num frames: 1146800128. Throughput: 0: 5816.3. Samples: 1146800802. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:47:55,070][25689] Avg episode reward: [(0, '1.047')] [2022-07-11 08:47:55,354][26022] Updated weights on worker 0-0, policy_version 1119924 (0.00086) [2022-07-11 08:47:56,984][26022] Updated weights on worker 0-0, policy_version 1119934 (0.00092) [2022-07-11 08:47:59,159][26022] Updated weights on worker 0-0, policy_version 1119944 (0.00092) [2022-07-11 08:48:00,087][25689] Fps is (10 sec: 5506.4, 60 sec: 5565.6, 300 sec: 5569.8). Total num frames: 1146828800. Throughput: 0: 5811.0. Samples: 1146834338. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:48:00,087][25689] Avg episode reward: [(0, '1.389')] [2022-07-11 08:48:00,522][26022] Updated weights on worker 0-0, policy_version 1119954 (0.00092) [2022-07-11 08:48:03,208][26022] Updated weights on worker 0-0, policy_version 1119964 (0.00085) [2022-07-11 08:48:04,773][26022] Updated weights on worker 0-0, policy_version 1119974 (0.00094) [2022-07-11 08:48:05,145][25689] Fps is (10 sec: 5386.5, 60 sec: 5531.9, 300 sec: 5558.5). Total num frames: 1146854400. Throughput: 0: 4886.5. Samples: 1146848840. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:48:05,145][25689] Avg episode reward: [(0, '0.209')] [2022-07-11 08:48:06,589][26022] Updated weights on worker 0-0, policy_version 1119984 (0.00085) [2022-07-11 08:48:08,698][26022] Updated weights on worker 0-0, policy_version 1119994 (0.00087) [2022-07-11 08:48:10,180][25689] Fps is (10 sec: 5376.8, 60 sec: 5529.2, 300 sec: 5557.9). Total num frames: 1146883072. Throughput: 0: 5716.4. Samples: 1146882264. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:48:10,181][25689] Avg episode reward: [(0, '-0.083')] [2022-07-11 08:48:10,322][26022] Updated weights on worker 0-0, policy_version 1120004 (0.00088) [2022-07-11 08:48:12,076][26022] Updated weights on worker 0-0, policy_version 1120014 (0.00087) [2022-07-11 08:48:13,942][26022] Updated weights on worker 0-0, policy_version 1120024 (0.00061) [2022-07-11 08:48:15,197][25689] Fps is (10 sec: 5704.7, 60 sec: 5580.8, 300 sec: 5561.1). Total num frames: 1146911744. Throughput: 0: 5731.9. Samples: 1146916170. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:48:15,197][25689] Avg episode reward: [(0, '0.084')] [2022-07-11 08:48:15,780][26022] Updated weights on worker 0-0, policy_version 1120034 (0.00089) [2022-07-11 08:48:17,662][26022] Updated weights on worker 0-0, policy_version 1120044 (0.00087) [2022-07-11 08:48:19,518][26022] Updated weights on worker 0-0, policy_version 1120054 (0.00053) [2022-07-11 08:48:20,200][25689] Fps is (10 sec: 5621.2, 60 sec: 5550.0, 300 sec: 5565.6). Total num frames: 1146939392. Throughput: 0: 4912.9. Samples: 1146933152. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:48:20,200][25689] Avg episode reward: [(0, '0.343')] [2022-07-11 08:48:21,087][26022] Updated weights on worker 0-0, policy_version 1120064 (0.00088) [2022-07-11 08:48:23,236][26022] Updated weights on worker 0-0, policy_version 1120074 (0.00085) [2022-07-11 08:48:24,881][26022] Updated weights on worker 0-0, policy_version 1120084 (0.00092) [2022-07-11 08:48:25,309][25689] Fps is (10 sec: 5569.7, 60 sec: 5551.1, 300 sec: 5564.2). Total num frames: 1146968064. Throughput: 0: 5861.0. Samples: 1146967020. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:48:25,309][25689] Avg episode reward: [(0, '0.373')] [2022-07-11 08:48:26,690][26022] Updated weights on worker 0-0, policy_version 1120094 (0.00091) [2022-07-11 08:48:28,685][26022] Updated weights on worker 0-0, policy_version 1120104 (0.00087) [2022-07-11 08:48:30,335][25689] Fps is (10 sec: 5556.8, 60 sec: 5553.5, 300 sec: 5560.5). Total num frames: 1146995712. Throughput: 0: 5870.5. Samples: 1147000582. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2022-07-11 08:48:30,336][25689] Avg episode reward: [(0, '0.491')] [2022-07-11 08:48:30,367][26022] Updated weights on worker 0-0, policy_version 1120114 (0.00095) [2022-07-11 08:48:32,099][26022] Updated weights on worker 0-0, policy_version 1120124 (0.00081) [2022-07-11 08:48:34,132][26022] Updated weights on worker 0-0, policy_version 1120134 (0.00095) [2022-07-11 08:48:35,373][25689] Fps is (10 sec: 5596.3, 60 sec: 5555.5, 300 sec: 5563.5). Total num frames: 1147024384. Throughput: 0: 5025.3. Samples: 1147017558. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:48:35,373][25689] Avg episode reward: [(0, '0.772')] [2022-07-11 08:48:35,803][26022] Updated weights on worker 0-0, policy_version 1120144 (0.00531) [2022-07-11 08:48:37,842][26022] Updated weights on worker 0-0, policy_version 1120154 (0.00082) [2022-07-11 08:48:39,326][26022] Updated weights on worker 0-0, policy_version 1120164 (0.00083) [2022-07-11 08:48:40,402][25689] Fps is (10 sec: 5594.3, 60 sec: 5554.9, 300 sec: 5564.8). Total num frames: 1147052032. Throughput: 0: 5844.0. Samples: 1147051218. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:48:40,403][25689] Avg episode reward: [(0, '0.654')] [2022-07-11 08:48:41,382][26022] Updated weights on worker 0-0, policy_version 1120174 (0.00080) [2022-07-11 08:48:43,172][26022] Updated weights on worker 0-0, policy_version 1120184 (0.00082) [2022-07-11 08:48:44,910][26022] Updated weights on worker 0-0, policy_version 1120194 (0.00087) [2022-07-11 08:48:45,462][25689] Fps is (10 sec: 5480.7, 60 sec: 5521.4, 300 sec: 5560.6). Total num frames: 1147079680. Throughput: 0: 5838.2. Samples: 1147084678. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:48:45,462][25689] Avg episode reward: [(0, '0.556')] [2022-07-11 08:48:46,926][26022] Updated weights on worker 0-0, policy_version 1120204 (0.00091) [2022-07-11 08:48:48,698][26022] Updated weights on worker 0-0, policy_version 1120214 (0.00086) [2022-07-11 08:48:50,556][25689] Fps is (10 sec: 5547.1, 60 sec: 5533.7, 300 sec: 5559.2). Total num frames: 1147108352. Throughput: 0: 5837.3. Samples: 1147118616. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:48:50,557][25689] Avg episode reward: [(0, '0.471')] [2022-07-11 08:48:50,561][26022] Updated weights on worker 0-0, policy_version 1120224 (0.00084) [2022-07-11 08:48:52,351][26022] Updated weights on worker 0-0, policy_version 1120234 (0.00102) [2022-07-11 08:48:54,048][26022] Updated weights on worker 0-0, policy_version 1120244 (0.00088) [2022-07-11 08:48:55,601][25689] Fps is (10 sec: 5756.7, 60 sec: 5582.6, 300 sec: 5565.4). Total num frames: 1147138048. Throughput: 0: 5826.0. Samples: 1147135408. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:48:55,603][25689] Avg episode reward: [(0, '-0.122')] [2022-07-11 08:48:55,984][26022] Updated weights on worker 0-0, policy_version 1120254 (0.00088) [2022-07-11 08:48:57,787][26022] Updated weights on worker 0-0, policy_version 1120264 (0.00092) [2022-07-11 08:48:59,394][26022] Updated weights on worker 0-0, policy_version 1120274 (0.00083) [2022-07-11 08:49:00,689][25689] Fps is (10 sec: 5760.2, 60 sec: 5576.1, 300 sec: 5571.3). Total num frames: 1147166720. Throughput: 0: 5832.2. Samples: 1147169530. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:49:00,690][25689] Avg episode reward: [(0, '-0.379')] [2022-07-11 08:49:01,715][26022] Updated weights on worker 0-0, policy_version 1120284 (0.00095) [2022-07-11 08:49:03,375][26022] Updated weights on worker 0-0, policy_version 1120294 (0.00087) [2022-07-11 08:49:05,454][26022] Updated weights on worker 0-0, policy_version 1120304 (0.00090) [2022-07-11 08:49:05,751][25689] Fps is (10 sec: 5347.0, 60 sec: 5575.7, 300 sec: 5559.9). Total num frames: 1147192320. Throughput: 0: 5743.3. Samples: 1147201204. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:49:05,751][25689] Avg episode reward: [(0, '-0.917')] [2022-07-11 08:49:07,112][26022] Updated weights on worker 0-0, policy_version 1120314 (0.00092) [2022-07-11 08:49:09,068][26022] Updated weights on worker 0-0, policy_version 1120324 (0.00089) [2022-07-11 08:49:10,754][25689] Fps is (10 sec: 5391.7, 60 sec: 5578.6, 300 sec: 5566.9). Total num frames: 1147220992. Throughput: 0: 4919.4. Samples: 1147217982. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:49:10,755][25689] Avg episode reward: [(0, '-0.173')] [2022-07-11 08:49:10,888][26022] Updated weights on worker 0-0, policy_version 1120334 (0.00084) [2022-07-11 08:49:12,891][26022] Updated weights on worker 0-0, policy_version 1120344 (0.00085) [2022-07-11 08:49:14,428][26022] Updated weights on worker 0-0, policy_version 1120354 (0.00088) [2022-07-11 08:49:15,807][25689] Fps is (10 sec: 5600.6, 60 sec: 5558.4, 300 sec: 5566.2). Total num frames: 1147248640. Throughput: 0: 5742.6. Samples: 1147251442. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:49:15,807][25689] Avg episode reward: [(0, '-0.893')] [2022-07-11 08:49:16,411][26022] Updated weights on worker 0-0, policy_version 1120364 (0.00084) [2022-07-11 08:49:18,121][26022] Updated weights on worker 0-0, policy_version 1120374 (0.00080) [2022-07-11 08:49:20,132][26022] Updated weights on worker 0-0, policy_version 1120384 (0.00084) [2022-07-11 08:49:20,812][25689] Fps is (10 sec: 5599.8, 60 sec: 5575.1, 300 sec: 5564.1). Total num frames: 1147277312. Throughput: 0: 5756.1. Samples: 1147285362. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:49:20,812][25689] Avg episode reward: [(0, '-0.136')] [2022-07-11 08:49:21,746][26022] Updated weights on worker 0-0, policy_version 1120394 (0.00088) [2022-07-11 08:49:23,604][26022] Updated weights on worker 0-0, policy_version 1120404 (0.00089) [2022-07-11 08:49:25,453][26022] Updated weights on worker 0-0, policy_version 1120414 (0.00611) [2022-07-11 08:49:25,877][25689] Fps is (10 sec: 5592.5, 60 sec: 5562.2, 300 sec: 5563.0). Total num frames: 1147304960. Throughput: 0: 5013.0. Samples: 1147302100. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:49:25,878][25689] Avg episode reward: [(0, '-0.701')] [2022-07-11 08:49:27,241][26022] Updated weights on worker 0-0, policy_version 1120424 (0.00082) [2022-07-11 08:49:29,266][26022] Updated weights on worker 0-0, policy_version 1120434 (0.00093) [2022-07-11 08:49:30,926][25689] Fps is (10 sec: 5568.7, 60 sec: 5577.1, 300 sec: 5565.6). Total num frames: 1147333632. Throughput: 0: 5830.6. Samples: 1147335592. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:49:30,926][25689] Avg episode reward: [(0, '0.442')] [2022-07-11 08:49:30,956][26022] Updated weights on worker 0-0, policy_version 1120444 (0.00090) [2022-07-11 08:49:32,799][26022] Updated weights on worker 0-0, policy_version 1120454 (0.00088) [2022-07-11 08:49:34,650][26022] Updated weights on worker 0-0, policy_version 1120464 (0.00085) [2022-07-11 08:49:35,957][25689] Fps is (10 sec: 5689.2, 60 sec: 5577.7, 300 sec: 5568.9). Total num frames: 1147362304. Throughput: 0: 5842.1. Samples: 1147369162. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:49:35,957][25689] Avg episode reward: [(0, '0.644')] [2022-07-11 08:49:36,560][26022] Updated weights on worker 0-0, policy_version 1120474 (0.00090) [2022-07-11 08:49:38,262][26022] Updated weights on worker 0-0, policy_version 1120484 (0.00096) [2022-07-11 08:49:40,233][26022] Updated weights on worker 0-0, policy_version 1120494 (0.00087) [2022-07-11 08:49:40,980][25689] Fps is (10 sec: 5601.7, 60 sec: 5578.3, 300 sec: 5567.5). Total num frames: 1147389952. Throughput: 0: 4987.4. Samples: 1147385948. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:49:40,980][25689] Avg episode reward: [(0, '0.341')] [2022-07-11 08:49:41,899][26022] Updated weights on worker 0-0, policy_version 1120504 (0.00080) [2022-07-11 08:49:43,963][26022] Updated weights on worker 0-0, policy_version 1120514 (0.00090) [2022-07-11 08:49:45,553][26022] Updated weights on worker 0-0, policy_version 1120524 (0.00091) [2022-07-11 08:49:46,093][25689] Fps is (10 sec: 5455.6, 60 sec: 5573.4, 300 sec: 5566.2). Total num frames: 1147417600. Throughput: 0: 5800.3. Samples: 1147419354. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:49:46,093][25689] Avg episode reward: [(0, '0.219')] [2022-07-11 08:49:47,662][26022] Updated weights on worker 0-0, policy_version 1120534 (0.00095) [2022-07-11 08:49:49,400][26022] Updated weights on worker 0-0, policy_version 1120544 (0.00091) [2022-07-11 08:49:51,145][25689] Fps is (10 sec: 5540.2, 60 sec: 5577.2, 300 sec: 5565.6). Total num frames: 1147446272. Throughput: 0: 5811.9. Samples: 1147453108. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:49:51,146][25689] Avg episode reward: [(0, '0.908')] [2022-07-11 08:49:51,202][26022] Updated weights on worker 0-0, policy_version 1120554 (0.00088) [2022-07-11 08:49:51,364][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:49:51,373][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001120555_1147448320.pth [2022-07-11 08:49:51,374][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001118597_1145443328.pth [2022-07-11 08:49:53,052][26022] Updated weights on worker 0-0, policy_version 1120564 (0.00087) [2022-07-11 08:49:54,891][26022] Updated weights on worker 0-0, policy_version 1120574 (0.00095) [2022-07-11 08:49:56,207][25689] Fps is (10 sec: 5771.0, 60 sec: 5575.7, 300 sec: 5568.1). Total num frames: 1147475968. Throughput: 0: 4985.7. Samples: 1147470122. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:49:56,207][25689] Avg episode reward: [(0, '0.737')] [2022-07-11 08:49:56,515][26022] Updated weights on worker 0-0, policy_version 1120584 (0.00084) [2022-07-11 08:49:58,741][26022] Updated weights on worker 0-0, policy_version 1120594 (0.00081) [2022-07-11 08:50:00,077][26022] Updated weights on worker 0-0, policy_version 1120604 (0.00085) [2022-07-11 08:50:01,292][25689] Fps is (10 sec: 5550.6, 60 sec: 5542.1, 300 sec: 5575.4). Total num frames: 1147502592. Throughput: 0: 5807.4. Samples: 1147503910. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:50:01,294][25689] Avg episode reward: [(0, '0.587')] [2022-07-11 08:50:02,377][26022] Updated weights on worker 0-0, policy_version 1120614 (0.00087) [2022-07-11 08:50:04,378][26022] Updated weights on worker 0-0, policy_version 1120624 (0.00084) [2022-07-11 08:50:06,205][26022] Updated weights on worker 0-0, policy_version 1120634 (0.00096) [2022-07-11 08:50:06,330][25689] Fps is (10 sec: 5260.0, 60 sec: 5561.3, 300 sec: 5568.1). Total num frames: 1147529216. Throughput: 0: 5727.2. Samples: 1147535256. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:50:06,332][25689] Avg episode reward: [(0, '0.520')] [2022-07-11 08:50:08,131][26022] Updated weights on worker 0-0, policy_version 1120644 (0.00078) [2022-07-11 08:50:09,812][26022] Updated weights on worker 0-0, policy_version 1120654 (0.00200) [2022-07-11 08:50:11,335][25689] Fps is (10 sec: 5404.1, 60 sec: 5544.2, 300 sec: 5568.6). Total num frames: 1147556864. Throughput: 0: 4894.6. Samples: 1147551928. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:50:11,335][25689] Avg episode reward: [(0, '0.677')] [2022-07-11 08:50:11,643][26022] Updated weights on worker 0-0, policy_version 1120664 (0.00079) [2022-07-11 08:50:13,703][26022] Updated weights on worker 0-0, policy_version 1120674 (0.00088) [2022-07-11 08:50:15,313][26022] Updated weights on worker 0-0, policy_version 1120684 (0.00093) [2022-07-11 08:50:16,357][25689] Fps is (10 sec: 5616.6, 60 sec: 5563.9, 300 sec: 5568.3). Total num frames: 1147585536. Throughput: 0: 5724.4. Samples: 1147585472. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:50:16,359][25689] Avg episode reward: [(0, '0.584')] [2022-07-11 08:50:17,271][26022] Updated weights on worker 0-0, policy_version 1120694 (0.00089) [2022-07-11 08:50:18,881][26022] Updated weights on worker 0-0, policy_version 1120704 (0.00087) [2022-07-11 08:50:20,848][26022] Updated weights on worker 0-0, policy_version 1120714 (0.00088) [2022-07-11 08:50:21,367][25689] Fps is (10 sec: 5716.2, 60 sec: 5563.5, 300 sec: 5573.7). Total num frames: 1147614208. Throughput: 0: 5754.6. Samples: 1147619432. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:50:21,367][25689] Avg episode reward: [(0, '-1.966')] [2022-07-11 08:50:22,790][26022] Updated weights on worker 0-0, policy_version 1120724 (0.00061) [2022-07-11 08:50:24,280][26022] Updated weights on worker 0-0, policy_version 1120734 (0.00097) [2022-07-11 08:50:26,406][26022] Updated weights on worker 0-0, policy_version 1120744 (0.00091) [2022-07-11 08:50:26,450][25689] Fps is (10 sec: 5681.7, 60 sec: 5578.8, 300 sec: 5570.5). Total num frames: 1147642880. Throughput: 0: 5025.4. Samples: 1147636368. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:50:26,451][25689] Avg episode reward: [(0, '-2.088')] [2022-07-11 08:50:28,204][26022] Updated weights on worker 0-0, policy_version 1120754 (0.00088) [2022-07-11 08:50:30,009][26022] Updated weights on worker 0-0, policy_version 1120764 (0.00084) [2022-07-11 08:50:31,503][25689] Fps is (10 sec: 5455.6, 60 sec: 5544.5, 300 sec: 5566.3). Total num frames: 1147669504. Throughput: 0: 5854.4. Samples: 1147669998. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:50:31,503][25689] Avg episode reward: [(0, '-1.995')] [2022-07-11 08:50:31,865][26022] Updated weights on worker 0-0, policy_version 1120774 (0.00090) [2022-07-11 08:50:33,403][26022] Updated weights on worker 0-0, policy_version 1120784 (0.00087) [2022-07-11 08:50:35,606][26022] Updated weights on worker 0-0, policy_version 1120794 (0.00125) [2022-07-11 08:50:36,527][25689] Fps is (10 sec: 5589.0, 60 sec: 5562.1, 300 sec: 5573.3). Total num frames: 1147699200. Throughput: 0: 5868.7. Samples: 1147703842. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:50:36,528][25689] Avg episode reward: [(0, '-1.968')] [2022-07-11 08:50:37,198][26022] Updated weights on worker 0-0, policy_version 1120804 (0.00086) [2022-07-11 08:50:39,151][26022] Updated weights on worker 0-0, policy_version 1120814 (0.00081) [2022-07-11 08:50:40,834][26022] Updated weights on worker 0-0, policy_version 1120824 (0.00080) [2022-07-11 08:50:41,628][25689] Fps is (10 sec: 5663.5, 60 sec: 5554.9, 300 sec: 5574.0). Total num frames: 1147726848. Throughput: 0: 4997.6. Samples: 1147720688. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:50:41,628][25689] Avg episode reward: [(0, '-1.321')] [2022-07-11 08:50:42,659][26022] Updated weights on worker 0-0, policy_version 1120834 (0.00079) [2022-07-11 08:50:44,514][26022] Updated weights on worker 0-0, policy_version 1120844 (0.00085) [2022-07-11 08:50:46,492][26022] Updated weights on worker 0-0, policy_version 1120854 (0.00091) [2022-07-11 08:50:46,679][25689] Fps is (10 sec: 5446.8, 60 sec: 5560.6, 300 sec: 5562.9). Total num frames: 1147754496. Throughput: 0: 5834.4. Samples: 1147754392. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:50:46,680][25689] Avg episode reward: [(0, '-1.133')] [2022-07-11 08:50:48,302][26022] Updated weights on worker 0-0, policy_version 1120864 (0.00086) [2022-07-11 08:50:50,293][26022] Updated weights on worker 0-0, policy_version 1120874 (0.00085) [2022-07-11 08:50:51,688][25689] Fps is (10 sec: 5700.2, 60 sec: 5581.6, 300 sec: 5573.2). Total num frames: 1147784192. Throughput: 0: 5840.5. Samples: 1147787890. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:50:51,688][25689] Avg episode reward: [(0, '0.984')] [2022-07-11 08:50:51,866][26022] Updated weights on worker 0-0, policy_version 1120884 (0.00091) [2022-07-11 08:50:53,795][26022] Updated weights on worker 0-0, policy_version 1120894 (0.00087) [2022-07-11 08:50:55,467][26022] Updated weights on worker 0-0, policy_version 1120904 (0.00088) [2022-07-11 08:50:56,705][25689] Fps is (10 sec: 5821.6, 60 sec: 5568.7, 300 sec: 5573.1). Total num frames: 1147812864. Throughput: 0: 4999.1. Samples: 1147804716. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:50:56,707][25689] Avg episode reward: [(0, '0.987')] [2022-07-11 08:50:57,391][26022] Updated weights on worker 0-0, policy_version 1120914 (0.00081) [2022-07-11 08:50:59,094][26022] Updated weights on worker 0-0, policy_version 1120924 (0.00093) [2022-07-11 08:51:00,902][26022] Updated weights on worker 0-0, policy_version 1120934 (0.00086) [2022-07-11 08:51:01,728][25689] Fps is (10 sec: 5507.3, 60 sec: 5574.4, 300 sec: 5575.0). Total num frames: 1147839488. Throughput: 0: 5868.4. Samples: 1147838646. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:51:01,730][25689] Avg episode reward: [(0, '0.504')] [2022-07-11 08:51:03,132][26022] Updated weights on worker 0-0, policy_version 1120944 (0.00083) [2022-07-11 08:51:05,175][26022] Updated weights on worker 0-0, policy_version 1120954 (0.00086) [2022-07-11 08:51:06,793][25689] Fps is (10 sec: 5379.8, 60 sec: 5588.8, 300 sec: 5573.9). Total num frames: 1147867136. Throughput: 0: 5763.5. Samples: 1147870320. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:51:06,796][25689] Avg episode reward: [(0, '0.499')] [2022-07-11 08:51:06,799][26022] Updated weights on worker 0-0, policy_version 1120964 (0.00087) [2022-07-11 08:51:08,587][26022] Updated weights on worker 0-0, policy_version 1120974 (0.00084) [2022-07-11 08:51:10,377][26022] Updated weights on worker 0-0, policy_version 1120984 (0.00632) [2022-07-11 08:51:11,861][25689] Fps is (10 sec: 5457.1, 60 sec: 5583.0, 300 sec: 5570.8). Total num frames: 1147894784. Throughput: 0: 5756.6. Samples: 1147904020. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:51:11,862][25689] Avg episode reward: [(0, '0.326')] [2022-07-11 08:51:12,466][26022] Updated weights on worker 0-0, policy_version 1120994 (0.00088) [2022-07-11 08:51:14,038][26022] Updated weights on worker 0-0, policy_version 1121004 (0.00090) [2022-07-11 08:51:16,165][26022] Updated weights on worker 0-0, policy_version 1121014 (0.00093) [2022-07-11 08:51:16,873][25689] Fps is (10 sec: 5486.0, 60 sec: 5567.1, 300 sec: 5564.2). Total num frames: 1147922432. Throughput: 0: 5762.6. Samples: 1147920934. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:51:16,875][25689] Avg episode reward: [(0, '0.514')] [2022-07-11 08:51:17,716][26022] Updated weights on worker 0-0, policy_version 1121024 (0.00090) [2022-07-11 08:51:19,901][26022] Updated weights on worker 0-0, policy_version 1121034 (0.00093) [2022-07-11 08:51:21,451][26022] Updated weights on worker 0-0, policy_version 1121044 (0.00417) [2022-07-11 08:51:21,895][25689] Fps is (10 sec: 5715.2, 60 sec: 5582.9, 300 sec: 5569.5). Total num frames: 1147952128. Throughput: 0: 5734.0. Samples: 1147954280. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:51:21,895][25689] Avg episode reward: [(0, '0.444')] [2022-07-11 08:51:23,588][26022] Updated weights on worker 0-0, policy_version 1121054 (0.00075) [2022-07-11 08:51:25,020][26022] Updated weights on worker 0-0, policy_version 1121064 (0.00084) [2022-07-11 08:51:27,014][25689] Fps is (10 sec: 5452.8, 60 sec: 5528.8, 300 sec: 5564.7). Total num frames: 1147977728. Throughput: 0: 5811.3. Samples: 1147987828. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:51:27,016][25689] Avg episode reward: [(0, '0.521')] [2022-07-11 08:51:27,136][26022] Updated weights on worker 0-0, policy_version 1121074 (0.00083) [2022-07-11 08:51:28,805][26022] Updated weights on worker 0-0, policy_version 1121084 (0.00085) [2022-07-11 08:51:30,667][26022] Updated weights on worker 0-0, policy_version 1121094 (0.00088) [2022-07-11 08:51:32,112][25689] Fps is (10 sec: 5512.2, 60 sec: 5592.3, 300 sec: 5573.8). Total num frames: 1148008448. Throughput: 0: 4969.3. Samples: 1148004652. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:51:32,112][25689] Avg episode reward: [(0, '1.056')] [2022-07-11 08:51:32,505][26022] Updated weights on worker 0-0, policy_version 1121104 (0.00091) [2022-07-11 08:51:34,418][26022] Updated weights on worker 0-0, policy_version 1121114 (0.00086) [2022-07-11 08:51:36,211][26022] Updated weights on worker 0-0, policy_version 1121124 (0.00086) [2022-07-11 08:51:37,179][25689] Fps is (10 sec: 5842.7, 60 sec: 5571.5, 300 sec: 5571.0). Total num frames: 1148037120. Throughput: 0: 5786.6. Samples: 1148038438. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:51:37,180][25689] Avg episode reward: [(0, '0.603')] [2022-07-11 08:51:37,898][26022] Updated weights on worker 0-0, policy_version 1121134 (0.00085) [2022-07-11 08:51:39,839][26022] Updated weights on worker 0-0, policy_version 1121144 (0.00084) [2022-07-11 08:51:41,560][26022] Updated weights on worker 0-0, policy_version 1121154 (0.00083) [2022-07-11 08:51:42,211][25689] Fps is (10 sec: 5475.2, 60 sec: 5560.8, 300 sec: 5564.4). Total num frames: 1148063744. Throughput: 0: 5792.4. Samples: 1148071962. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:51:42,212][25689] Avg episode reward: [(0, '0.660')] [2022-07-11 08:51:43,430][26022] Updated weights on worker 0-0, policy_version 1121164 (0.00083) [2022-07-11 08:51:45,266][26022] Updated weights on worker 0-0, policy_version 1121174 (0.00084) [2022-07-11 08:51:47,068][26022] Updated weights on worker 0-0, policy_version 1121184 (0.00084) [2022-07-11 08:51:47,330][25689] Fps is (10 sec: 5548.5, 60 sec: 5588.5, 300 sec: 5569.9). Total num frames: 1148093440. Throughput: 0: 4968.7. Samples: 1148088770. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:51:47,330][25689] Avg episode reward: [(0, '0.912')] [2022-07-11 08:51:49,035][26022] Updated weights on worker 0-0, policy_version 1121194 (0.00082) [2022-07-11 08:51:50,710][26022] Updated weights on worker 0-0, policy_version 1121204 (0.00095) [2022-07-11 08:51:51,412][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:51:51,425][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001121208_1148116992.pth [2022-07-11 08:51:51,425][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001119247_1146108928.pth [2022-07-11 08:51:52,365][25689] Fps is (10 sec: 5546.6, 60 sec: 5535.3, 300 sec: 5559.3). Total num frames: 1148120064. Throughput: 0: 5809.8. Samples: 1148122320. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:51:52,366][25689] Avg episode reward: [(0, '0.621')] [2022-07-11 08:51:52,729][26022] Updated weights on worker 0-0, policy_version 1121214 (0.00085) [2022-07-11 08:51:54,416][26022] Updated weights on worker 0-0, policy_version 1121224 (0.00090) [2022-07-11 08:51:56,206][26022] Updated weights on worker 0-0, policy_version 1121234 (0.00267) [2022-07-11 08:51:57,383][25689] Fps is (10 sec: 5602.2, 60 sec: 5552.2, 300 sec: 5566.5). Total num frames: 1148149760. Throughput: 0: 5821.1. Samples: 1148156046. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:51:57,384][25689] Avg episode reward: [(0, '0.594')] [2022-07-11 08:51:58,048][26022] Updated weights on worker 0-0, policy_version 1121244 (0.00081) [2022-07-11 08:51:59,844][26022] Updated weights on worker 0-0, policy_version 1121254 (0.00094) [2022-07-11 08:52:02,017][26022] Updated weights on worker 0-0, policy_version 1121264 (0.00318) [2022-07-11 08:52:02,397][25689] Fps is (10 sec: 5512.0, 60 sec: 5536.1, 300 sec: 5560.5). Total num frames: 1148175360. Throughput: 0: 5008.7. Samples: 1148173068. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:52:02,398][25689] Avg episode reward: [(0, '0.282')] [2022-07-11 08:52:04,015][26022] Updated weights on worker 0-0, policy_version 1121274 (0.00084) [2022-07-11 08:52:05,700][26022] Updated weights on worker 0-0, policy_version 1121284 (0.00081) [2022-07-11 08:52:07,516][25689] Fps is (10 sec: 5456.6, 60 sec: 5564.9, 300 sec: 5561.8). Total num frames: 1148205056. Throughput: 0: 5744.1. Samples: 1148204726. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:52:07,517][25689] Avg episode reward: [(0, '0.365')] [2022-07-11 08:52:07,518][26022] Updated weights on worker 0-0, policy_version 1121294 (0.00093) [2022-07-11 08:52:09,305][26022] Updated weights on worker 0-0, policy_version 1121304 (0.00081) [2022-07-11 08:52:11,226][26022] Updated weights on worker 0-0, policy_version 1121314 (0.00089) [2022-07-11 08:52:12,532][25689] Fps is (10 sec: 5759.4, 60 sec: 5586.6, 300 sec: 5572.3). Total num frames: 1148233728. Throughput: 0: 5772.9. Samples: 1148238738. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:52:12,532][25689] Avg episode reward: [(0, '0.454')] [2022-07-11 08:52:13,113][26022] Updated weights on worker 0-0, policy_version 1121324 (0.00087) [2022-07-11 08:52:14,883][26022] Updated weights on worker 0-0, policy_version 1121334 (0.00083) [2022-07-11 08:52:16,663][26022] Updated weights on worker 0-0, policy_version 1121344 (0.00085) [2022-07-11 08:52:17,534][25689] Fps is (10 sec: 5520.2, 60 sec: 5570.7, 300 sec: 5562.6). Total num frames: 1148260352. Throughput: 0: 4935.9. Samples: 1148255508. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:52:17,535][25689] Avg episode reward: [(0, '0.451')] [2022-07-11 08:52:18,579][26022] Updated weights on worker 0-0, policy_version 1121354 (0.00559) [2022-07-11 08:52:20,363][26022] Updated weights on worker 0-0, policy_version 1121364 (0.00083) [2022-07-11 08:52:22,199][26022] Updated weights on worker 0-0, policy_version 1121374 (0.00095) [2022-07-11 08:52:22,547][25689] Fps is (10 sec: 5521.1, 60 sec: 5554.5, 300 sec: 5564.6). Total num frames: 1148289024. Throughput: 0: 5736.4. Samples: 1148288656. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:52:22,548][25689] Avg episode reward: [(0, '0.468')] [2022-07-11 08:52:24,119][26022] Updated weights on worker 0-0, policy_version 1121384 (0.00084) [2022-07-11 08:52:25,794][26022] Updated weights on worker 0-0, policy_version 1121394 (0.00085) [2022-07-11 08:52:27,634][25689] Fps is (10 sec: 5474.3, 60 sec: 5574.3, 300 sec: 5560.5). Total num frames: 1148315648. Throughput: 0: 5837.5. Samples: 1148322164. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:52:27,636][25689] Avg episode reward: [(0, '0.573')] [2022-07-11 08:52:28,040][26022] Updated weights on worker 0-0, policy_version 1121404 (0.00083) [2022-07-11 08:52:29,428][26022] Updated weights on worker 0-0, policy_version 1121414 (0.00090) [2022-07-11 08:52:31,618][26022] Updated weights on worker 0-0, policy_version 1121424 (0.00092) [2022-07-11 08:52:32,642][25689] Fps is (10 sec: 5578.8, 60 sec: 5565.7, 300 sec: 5564.9). Total num frames: 1148345344. Throughput: 0: 4976.8. Samples: 1148338826. Policy #0 lag: (min: 0.0, avg: 8.1, max: 18.0) [2022-07-11 08:52:32,643][25689] Avg episode reward: [(0, '0.101')] [2022-07-11 08:52:33,138][26022] Updated weights on worker 0-0, policy_version 1121434 (0.00085) [2022-07-11 08:52:35,232][26022] Updated weights on worker 0-0, policy_version 1121444 (0.00087) [2022-07-11 08:52:36,928][26022] Updated weights on worker 0-0, policy_version 1121454 (0.00092) [2022-07-11 08:52:37,660][25689] Fps is (10 sec: 5617.6, 60 sec: 5536.4, 300 sec: 5561.6). Total num frames: 1148371968. Throughput: 0: 5807.1. Samples: 1148372384. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:52:37,661][25689] Avg episode reward: [(0, '-1.372')] [2022-07-11 08:52:38,711][26022] Updated weights on worker 0-0, policy_version 1121464 (0.00085) [2022-07-11 08:52:40,748][26022] Updated weights on worker 0-0, policy_version 1121474 (0.00083) [2022-07-11 08:52:42,454][26022] Updated weights on worker 0-0, policy_version 1121484 (0.00081) [2022-07-11 08:52:42,663][25689] Fps is (10 sec: 5518.4, 60 sec: 5573.0, 300 sec: 5559.3). Total num frames: 1148400640. Throughput: 0: 5836.7. Samples: 1148406064. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:52:42,663][25689] Avg episode reward: [(0, '-1.287')] [2022-07-11 08:52:44,162][26022] Updated weights on worker 0-0, policy_version 1121494 (0.00085) [2022-07-11 08:52:46,117][26022] Updated weights on worker 0-0, policy_version 1121504 (0.00086) [2022-07-11 08:52:47,684][26022] Updated weights on worker 0-0, policy_version 1121514 (0.00081) [2022-07-11 08:52:47,712][25689] Fps is (10 sec: 5806.8, 60 sec: 5579.4, 300 sec: 5566.1). Total num frames: 1148430336. Throughput: 0: 5015.9. Samples: 1148422870. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:52:47,712][25689] Avg episode reward: [(0, '-2.258')] [2022-07-11 08:52:49,816][26022] Updated weights on worker 0-0, policy_version 1121524 (0.00087) [2022-07-11 08:52:51,569][26022] Updated weights on worker 0-0, policy_version 1121534 (0.00088) [2022-07-11 08:52:52,727][25689] Fps is (10 sec: 5494.4, 60 sec: 5564.4, 300 sec: 5562.8). Total num frames: 1148455936. Throughput: 0: 5856.4. Samples: 1148456448. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:52:52,727][25689] Avg episode reward: [(0, '-3.309')] [2022-07-11 08:52:53,266][26022] Updated weights on worker 0-0, policy_version 1121544 (0.00086) [2022-07-11 08:52:55,267][26022] Updated weights on worker 0-0, policy_version 1121554 (0.00087) [2022-07-11 08:52:56,970][26022] Updated weights on worker 0-0, policy_version 1121564 (0.00089) [2022-07-11 08:52:57,747][25689] Fps is (10 sec: 5510.3, 60 sec: 5564.1, 300 sec: 5566.2). Total num frames: 1148485632. Throughput: 0: 5877.2. Samples: 1148490438. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:52:57,747][25689] Avg episode reward: [(0, '-3.396')] [2022-07-11 08:52:58,849][26022] Updated weights on worker 0-0, policy_version 1121574 (0.00099) [2022-07-11 08:53:00,689][26022] Updated weights on worker 0-0, policy_version 1121584 (0.00087) [2022-07-11 08:53:02,751][25689] Fps is (10 sec: 5516.1, 60 sec: 5565.0, 300 sec: 5567.2). Total num frames: 1148511232. Throughput: 0: 5041.8. Samples: 1148507346. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:53:02,752][25689] Avg episode reward: [(0, '-3.260')] [2022-07-11 08:53:02,779][26022] Updated weights on worker 0-0, policy_version 1121594 (0.00277) [2022-07-11 08:53:04,675][26022] Updated weights on worker 0-0, policy_version 1121604 (0.00091) [2022-07-11 08:53:06,514][26022] Updated weights on worker 0-0, policy_version 1121614 (0.00095) [2022-07-11 08:53:07,803][25689] Fps is (10 sec: 5397.0, 60 sec: 5554.3, 300 sec: 5566.9). Total num frames: 1148539904. Throughput: 0: 5792.1. Samples: 1148539238. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:53:07,803][25689] Avg episode reward: [(0, '-2.120')] [2022-07-11 08:53:08,212][26022] Updated weights on worker 0-0, policy_version 1121624 (0.00091) [2022-07-11 08:53:10,000][26022] Updated weights on worker 0-0, policy_version 1121634 (0.00091) [2022-07-11 08:53:11,875][26022] Updated weights on worker 0-0, policy_version 1121644 (0.00086) [2022-07-11 08:53:12,811][25689] Fps is (10 sec: 5496.8, 60 sec: 5521.0, 300 sec: 5560.2). Total num frames: 1148566528. Throughput: 0: 5804.0. Samples: 1148573016. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:53:12,811][25689] Avg episode reward: [(0, '-0.430')] [2022-07-11 08:53:13,681][26022] Updated weights on worker 0-0, policy_version 1121654 (0.00085) [2022-07-11 08:53:15,600][26022] Updated weights on worker 0-0, policy_version 1121664 (0.00090) [2022-07-11 08:53:17,383][26022] Updated weights on worker 0-0, policy_version 1121674 (0.00086) [2022-07-11 08:53:17,817][25689] Fps is (10 sec: 5726.2, 60 sec: 5588.5, 300 sec: 5570.4). Total num frames: 1148597248. Throughput: 0: 4950.5. Samples: 1148589796. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:53:17,817][25689] Avg episode reward: [(0, '-0.436')] [2022-07-11 08:53:19,194][26022] Updated weights on worker 0-0, policy_version 1121684 (0.00084) [2022-07-11 08:53:20,839][26022] Updated weights on worker 0-0, policy_version 1121694 (0.00086) [2022-07-11 08:53:22,820][25689] Fps is (10 sec: 5831.2, 60 sec: 5572.5, 300 sec: 5569.0). Total num frames: 1148624896. Throughput: 0: 5802.1. Samples: 1148623790. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:53:22,821][25689] Avg episode reward: [(0, '0.516')] [2022-07-11 08:53:22,831][26022] Updated weights on worker 0-0, policy_version 1121704 (0.00087) [2022-07-11 08:53:24,717][26022] Updated weights on worker 0-0, policy_version 1121714 (0.00087) [2022-07-11 08:53:26,443][26022] Updated weights on worker 0-0, policy_version 1121724 (0.00089) [2022-07-11 08:53:27,878][25689] Fps is (10 sec: 5496.1, 60 sec: 5592.3, 300 sec: 5568.4). Total num frames: 1148652544. Throughput: 0: 5886.4. Samples: 1148657408. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:53:27,878][25689] Avg episode reward: [(0, '0.449')] [2022-07-11 08:53:28,259][26022] Updated weights on worker 0-0, policy_version 1121734 (0.00088) [2022-07-11 08:53:30,224][26022] Updated weights on worker 0-0, policy_version 1121744 (0.00087) [2022-07-11 08:53:31,955][26022] Updated weights on worker 0-0, policy_version 1121754 (0.00092) [2022-07-11 08:53:32,888][25689] Fps is (10 sec: 5594.1, 60 sec: 5575.1, 300 sec: 5568.9). Total num frames: 1148681216. Throughput: 0: 5041.8. Samples: 1148674242. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:53:32,888][25689] Avg episode reward: [(0, '1.089')] [2022-07-11 08:53:33,833][26022] Updated weights on worker 0-0, policy_version 1121764 (0.00094) [2022-07-11 08:53:35,496][26022] Updated weights on worker 0-0, policy_version 1121774 (0.00084) [2022-07-11 08:53:37,368][26022] Updated weights on worker 0-0, policy_version 1121784 (0.00623) [2022-07-11 08:53:37,896][25689] Fps is (10 sec: 5621.9, 60 sec: 5593.0, 300 sec: 5569.3). Total num frames: 1148708864. Throughput: 0: 5902.6. Samples: 1148708314. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:53:37,896][25689] Avg episode reward: [(0, '1.436')] [2022-07-11 08:53:39,203][26022] Updated weights on worker 0-0, policy_version 1121794 (0.00095) [2022-07-11 08:53:41,155][26022] Updated weights on worker 0-0, policy_version 1121804 (0.00094) [2022-07-11 08:53:42,944][26022] Updated weights on worker 0-0, policy_version 1121814 (0.00088) [2022-07-11 08:53:42,947][25689] Fps is (10 sec: 5497.3, 60 sec: 5571.5, 300 sec: 5569.5). Total num frames: 1148736512. Throughput: 0: 5880.6. Samples: 1148742146. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:53:42,947][25689] Avg episode reward: [(0, '1.706')] [2022-07-11 08:53:44,826][26022] Updated weights on worker 0-0, policy_version 1121824 (0.00083) [2022-07-11 08:53:46,493][26022] Updated weights on worker 0-0, policy_version 1121834 (0.00092) [2022-07-11 08:53:48,012][25689] Fps is (10 sec: 5567.1, 60 sec: 5553.0, 300 sec: 5570.0). Total num frames: 1148765184. Throughput: 0: 5044.6. Samples: 1148758978. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:53:48,013][25689] Avg episode reward: [(0, '1.862')] [2022-07-11 08:53:48,280][26022] Updated weights on worker 0-0, policy_version 1121844 (0.00091) [2022-07-11 08:53:50,297][26022] Updated weights on worker 0-0, policy_version 1121854 (0.00506) [2022-07-11 08:53:51,539][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:53:51,553][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001121862_1148786688.pth [2022-07-11 08:53:51,553][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001119903_1146780672.pth [2022-07-11 08:53:51,931][26022] Updated weights on worker 0-0, policy_version 1121864 (0.00078) [2022-07-11 08:53:53,043][25689] Fps is (10 sec: 5679.8, 60 sec: 5602.5, 300 sec: 5566.8). Total num frames: 1148793856. Throughput: 0: 5869.4. Samples: 1148792540. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:53:53,043][25689] Avg episode reward: [(0, '1.029')] [2022-07-11 08:53:54,026][26022] Updated weights on worker 0-0, policy_version 1121874 (0.00083) [2022-07-11 08:53:55,412][26022] Updated weights on worker 0-0, policy_version 1121884 (0.00089) [2022-07-11 08:53:57,563][26022] Updated weights on worker 0-0, policy_version 1121894 (0.00097) [2022-07-11 08:53:58,051][25689] Fps is (10 sec: 5712.2, 60 sec: 5586.6, 300 sec: 5568.3). Total num frames: 1148822528. Throughput: 0: 5848.8. Samples: 1148826200. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:53:58,052][25689] Avg episode reward: [(0, '1.022')] [2022-07-11 08:53:59,221][26022] Updated weights on worker 0-0, policy_version 1121904 (0.00088) [2022-07-11 08:54:01,042][26022] Updated weights on worker 0-0, policy_version 1121914 (0.00083) [2022-07-11 08:54:03,060][25689] Fps is (10 sec: 5315.5, 60 sec: 5569.2, 300 sec: 5565.9). Total num frames: 1148847104. Throughput: 0: 5760.2. Samples: 1148858004. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:54:03,061][25689] Avg episode reward: [(0, '1.415')] [2022-07-11 08:54:03,378][26022] Updated weights on worker 0-0, policy_version 1121924 (0.00084) [2022-07-11 08:54:05,019][26022] Updated weights on worker 0-0, policy_version 1121934 (0.00084) [2022-07-11 08:54:06,845][26022] Updated weights on worker 0-0, policy_version 1121944 (0.00091) [2022-07-11 08:54:08,201][25689] Fps is (10 sec: 5347.0, 60 sec: 5577.9, 300 sec: 5566.7). Total num frames: 1148876800. Throughput: 0: 5748.7. Samples: 1148875038. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:54:08,202][25689] Avg episode reward: [(0, '0.955')] [2022-07-11 08:54:08,776][26022] Updated weights on worker 0-0, policy_version 1121954 (0.00083) [2022-07-11 08:54:10,508][26022] Updated weights on worker 0-0, policy_version 1121964 (0.00092) [2022-07-11 08:54:12,423][26022] Updated weights on worker 0-0, policy_version 1121974 (0.00087) [2022-07-11 08:54:13,225][25689] Fps is (10 sec: 5842.8, 60 sec: 5627.2, 300 sec: 5574.1). Total num frames: 1148906496. Throughput: 0: 5766.8. Samples: 1148908930. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:54:13,226][25689] Avg episode reward: [(0, '0.848')] [2022-07-11 08:54:14,433][26022] Updated weights on worker 0-0, policy_version 1121984 (0.00086) [2022-07-11 08:54:15,892][26022] Updated weights on worker 0-0, policy_version 1121994 (0.00086) [2022-07-11 08:54:18,178][26022] Updated weights on worker 0-0, policy_version 1122004 (0.00089) [2022-07-11 08:54:18,265][25689] Fps is (10 sec: 5494.6, 60 sec: 5539.4, 300 sec: 5563.2). Total num frames: 1148932096. Throughput: 0: 5763.9. Samples: 1148942710. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:54:18,266][25689] Avg episode reward: [(0, '-0.812')] [2022-07-11 08:54:19,514][26022] Updated weights on worker 0-0, policy_version 1122014 (0.00086) [2022-07-11 08:54:21,749][26022] Updated weights on worker 0-0, policy_version 1122024 (0.00099) [2022-07-11 08:54:23,318][25689] Fps is (10 sec: 5478.8, 60 sec: 5568.7, 300 sec: 5570.3). Total num frames: 1148961792. Throughput: 0: 5011.4. Samples: 1148959524. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:54:23,319][25689] Avg episode reward: [(0, '-0.534')] [2022-07-11 08:54:23,436][26022] Updated weights on worker 0-0, policy_version 1122034 (0.00092) [2022-07-11 08:54:25,287][26022] Updated weights on worker 0-0, policy_version 1122044 (0.00089) [2022-07-11 08:54:27,075][26022] Updated weights on worker 0-0, policy_version 1122054 (0.00087) [2022-07-11 08:54:28,386][25689] Fps is (10 sec: 5767.1, 60 sec: 5584.7, 300 sec: 5569.9). Total num frames: 1148990464. Throughput: 0: 5846.1. Samples: 1148993040. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:54:28,386][25689] Avg episode reward: [(0, '-1.164')] [2022-07-11 08:54:28,823][26022] Updated weights on worker 0-0, policy_version 1122064 (0.00083) [2022-07-11 08:54:30,721][26022] Updated weights on worker 0-0, policy_version 1122074 (0.00088) [2022-07-11 08:54:32,451][26022] Updated weights on worker 0-0, policy_version 1122084 (0.00084) [2022-07-11 08:54:33,419][25689] Fps is (10 sec: 5677.1, 60 sec: 5582.6, 300 sec: 5569.9). Total num frames: 1149019136. Throughput: 0: 5843.7. Samples: 1149026936. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:54:33,419][25689] Avg episode reward: [(0, '-1.771')] [2022-07-11 08:54:34,255][26022] Updated weights on worker 0-0, policy_version 1122094 (0.00099) [2022-07-11 08:54:36,168][26022] Updated weights on worker 0-0, policy_version 1122104 (0.01430) [2022-07-11 08:54:37,883][26022] Updated weights on worker 0-0, policy_version 1122114 (0.00084) [2022-07-11 08:54:38,448][25689] Fps is (10 sec: 5699.3, 60 sec: 5597.6, 300 sec: 5573.2). Total num frames: 1149047808. Throughput: 0: 5018.1. Samples: 1149043988. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:54:38,448][25689] Avg episode reward: [(0, '-2.785')] [2022-07-11 08:54:39,806][26022] Updated weights on worker 0-0, policy_version 1122124 (0.00081) [2022-07-11 08:54:41,501][26022] Updated weights on worker 0-0, policy_version 1122134 (0.00087) [2022-07-11 08:54:43,380][26022] Updated weights on worker 0-0, policy_version 1122144 (0.00087) [2022-07-11 08:54:43,471][25689] Fps is (10 sec: 5603.0, 60 sec: 5600.1, 300 sec: 5574.9). Total num frames: 1149075456. Throughput: 0: 5886.9. Samples: 1149078162. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:54:43,471][25689] Avg episode reward: [(0, '-2.326')] [2022-07-11 08:54:45,222][26022] Updated weights on worker 0-0, policy_version 1122154 (0.00079) [2022-07-11 08:54:47,115][26022] Updated weights on worker 0-0, policy_version 1122164 (0.00084) [2022-07-11 08:54:48,545][25689] Fps is (10 sec: 5577.5, 60 sec: 5599.3, 300 sec: 5574.5). Total num frames: 1149104128. Throughput: 0: 5890.7. Samples: 1149111794. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:54:48,546][25689] Avg episode reward: [(0, '-1.851')] [2022-07-11 08:54:48,871][26022] Updated weights on worker 0-0, policy_version 1122174 (0.00093) [2022-07-11 08:54:50,635][26022] Updated weights on worker 0-0, policy_version 1122184 (0.00093) [2022-07-11 08:54:52,440][26022] Updated weights on worker 0-0, policy_version 1122194 (0.00088) [2022-07-11 08:54:53,562][25689] Fps is (10 sec: 5682.8, 60 sec: 5600.6, 300 sec: 5571.9). Total num frames: 1149132800. Throughput: 0: 5048.9. Samples: 1149128634. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:54:53,563][25689] Avg episode reward: [(0, '-1.611')] [2022-07-11 08:54:54,277][26022] Updated weights on worker 0-0, policy_version 1122204 (0.00087) [2022-07-11 08:54:55,855][26022] Updated weights on worker 0-0, policy_version 1122214 (0.00094) [2022-07-11 08:54:57,877][26022] Updated weights on worker 0-0, policy_version 1122224 (0.00090) [2022-07-11 08:54:58,595][25689] Fps is (10 sec: 5706.1, 60 sec: 5598.3, 300 sec: 5579.8). Total num frames: 1149161472. Throughput: 0: 5896.8. Samples: 1149162794. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:54:58,595][25689] Avg episode reward: [(0, '-0.684')] [2022-07-11 08:54:59,538][26022] Updated weights on worker 0-0, policy_version 1122234 (0.00086) [2022-07-11 08:55:01,615][26022] Updated weights on worker 0-0, policy_version 1122244 (0.00085) [2022-07-11 08:55:03,587][26022] Updated weights on worker 0-0, policy_version 1122254 (0.00089) [2022-07-11 08:55:03,687][25689] Fps is (10 sec: 5461.1, 60 sec: 5624.5, 300 sec: 5578.7). Total num frames: 1149188096. Throughput: 0: 5771.2. Samples: 1149194834. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:55:03,688][25689] Avg episode reward: [(0, '-0.340')] [2022-07-11 08:55:05,572][26022] Updated weights on worker 0-0, policy_version 1122264 (0.00090) [2022-07-11 08:55:07,252][26022] Updated weights on worker 0-0, policy_version 1122274 (0.00089) [2022-07-11 08:55:08,759][25689] Fps is (10 sec: 5339.8, 60 sec: 5597.1, 300 sec: 5577.5). Total num frames: 1149215744. Throughput: 0: 4944.5. Samples: 1149211740. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:55:08,759][25689] Avg episode reward: [(0, '0.863')] [2022-07-11 08:55:09,240][26022] Updated weights on worker 0-0, policy_version 1122284 (0.00094) [2022-07-11 08:55:11,187][26022] Updated weights on worker 0-0, policy_version 1122294 (0.00081) [2022-07-11 08:55:12,927][26022] Updated weights on worker 0-0, policy_version 1122304 (0.00102) [2022-07-11 08:55:13,790][25689] Fps is (10 sec: 5574.8, 60 sec: 5579.5, 300 sec: 5577.3). Total num frames: 1149244416. Throughput: 0: 5755.7. Samples: 1149245060. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:55:13,790][25689] Avg episode reward: [(0, '0.841')] [2022-07-11 08:55:14,684][26022] Updated weights on worker 0-0, policy_version 1122314 (0.00092) [2022-07-11 08:55:16,552][26022] Updated weights on worker 0-0, policy_version 1122324 (0.00093) [2022-07-11 08:55:18,201][26022] Updated weights on worker 0-0, policy_version 1122334 (0.00083) [2022-07-11 08:55:18,810][25689] Fps is (10 sec: 5501.4, 60 sec: 5598.2, 300 sec: 5570.2). Total num frames: 1149271040. Throughput: 0: 5721.1. Samples: 1149278446. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:55:18,811][25689] Avg episode reward: [(0, '0.465')] [2022-07-11 08:55:20,391][26022] Updated weights on worker 0-0, policy_version 1122344 (0.00091) [2022-07-11 08:55:22,048][26022] Updated weights on worker 0-0, policy_version 1122354 (0.00088) [2022-07-11 08:55:23,840][25689] Fps is (10 sec: 5502.1, 60 sec: 5583.5, 300 sec: 5571.3). Total num frames: 1149299712. Throughput: 0: 4980.7. Samples: 1149295208. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:55:23,840][25689] Avg episode reward: [(0, '-0.522')] [2022-07-11 08:55:23,965][26022] Updated weights on worker 0-0, policy_version 1122364 (0.00081) [2022-07-11 08:55:25,692][26022] Updated weights on worker 0-0, policy_version 1122374 (0.00086) [2022-07-11 08:55:27,618][26022] Updated weights on worker 0-0, policy_version 1122384 (0.00091) [2022-07-11 08:55:28,892][25689] Fps is (10 sec: 5687.8, 60 sec: 5584.9, 300 sec: 5578.1). Total num frames: 1149328384. Throughput: 0: 5810.1. Samples: 1149328716. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:55:28,892][25689] Avg episode reward: [(0, '-0.259')] [2022-07-11 08:55:29,347][26022] Updated weights on worker 0-0, policy_version 1122394 (0.00088) [2022-07-11 08:55:31,480][26022] Updated weights on worker 0-0, policy_version 1122404 (0.00081) [2022-07-11 08:55:32,984][26022] Updated weights on worker 0-0, policy_version 1122414 (0.00090) [2022-07-11 08:55:33,950][25689] Fps is (10 sec: 5672.0, 60 sec: 5582.6, 300 sec: 5574.1). Total num frames: 1149357056. Throughput: 0: 5800.2. Samples: 1149361992. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:55:33,950][25689] Avg episode reward: [(0, '-0.331')] [2022-07-11 08:55:35,072][26022] Updated weights on worker 0-0, policy_version 1122424 (0.00093) [2022-07-11 08:55:36,693][26022] Updated weights on worker 0-0, policy_version 1122434 (0.00089) [2022-07-11 08:55:38,585][26022] Updated weights on worker 0-0, policy_version 1122444 (0.00084) [2022-07-11 08:55:38,980][25689] Fps is (10 sec: 5582.9, 60 sec: 5565.6, 300 sec: 5575.4). Total num frames: 1149384704. Throughput: 0: 4984.9. Samples: 1149378992. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:55:38,980][25689] Avg episode reward: [(0, '-0.143')] [2022-07-11 08:55:40,360][26022] Updated weights on worker 0-0, policy_version 1122454 (0.00088) [2022-07-11 08:55:42,202][26022] Updated weights on worker 0-0, policy_version 1122464 (0.00086) [2022-07-11 08:55:43,959][26022] Updated weights on worker 0-0, policy_version 1122474 (0.00081) [2022-07-11 08:55:44,039][25689] Fps is (10 sec: 5582.3, 60 sec: 5579.2, 300 sec: 5578.7). Total num frames: 1149413376. Throughput: 0: 5823.5. Samples: 1149412838. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:55:44,039][25689] Avg episode reward: [(0, '-0.096')] [2022-07-11 08:55:45,906][26022] Updated weights on worker 0-0, policy_version 1122484 (0.00089) [2022-07-11 08:55:47,471][26022] Updated weights on worker 0-0, policy_version 1122494 (0.00081) [2022-07-11 08:55:49,074][25689] Fps is (10 sec: 5681.0, 60 sec: 5582.8, 300 sec: 5574.8). Total num frames: 1149442048. Throughput: 0: 5870.9. Samples: 1149447202. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:55:49,075][25689] Avg episode reward: [(0, '-0.072')] [2022-07-11 08:55:49,437][26022] Updated weights on worker 0-0, policy_version 1122504 (0.00084) [2022-07-11 08:55:51,203][26022] Updated weights on worker 0-0, policy_version 1122514 (0.00091) [2022-07-11 08:55:51,563][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:55:51,572][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001122516_1149456384.pth [2022-07-11 08:55:51,573][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001120555_1147448320.pth [2022-07-11 08:55:53,083][26022] Updated weights on worker 0-0, policy_version 1122524 (0.00091) [2022-07-11 08:55:54,104][25689] Fps is (10 sec: 5697.0, 60 sec: 5581.5, 300 sec: 5574.5). Total num frames: 1149470720. Throughput: 0: 5075.4. Samples: 1149464284. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:55:54,105][25689] Avg episode reward: [(0, '0.397')] [2022-07-11 08:55:55,104][26022] Updated weights on worker 0-0, policy_version 1122534 (0.00087) [2022-07-11 08:55:56,655][26022] Updated weights on worker 0-0, policy_version 1122544 (0.00079) [2022-07-11 08:55:58,734][26022] Updated weights on worker 0-0, policy_version 1122554 (0.00090) [2022-07-11 08:55:59,112][25689] Fps is (10 sec: 5508.3, 60 sec: 5550.0, 300 sec: 5574.8). Total num frames: 1149497344. Throughput: 0: 5901.1. Samples: 1149497796. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:55:59,113][25689] Avg episode reward: [(0, '-0.116')] [2022-07-11 08:56:00,160][26022] Updated weights on worker 0-0, policy_version 1122564 (0.00083) [2022-07-11 08:56:02,605][26022] Updated weights on worker 0-0, policy_version 1122574 (0.00095) [2022-07-11 08:56:04,129][25689] Fps is (10 sec: 5414.0, 60 sec: 5573.9, 300 sec: 5575.7). Total num frames: 1149524992. Throughput: 0: 5794.9. Samples: 1149529258. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:56:04,131][25689] Avg episode reward: [(0, '0.076')] [2022-07-11 08:56:04,202][26022] Updated weights on worker 0-0, policy_version 1122584 (0.00086) [2022-07-11 08:56:06,274][26022] Updated weights on worker 0-0, policy_version 1122594 (0.00087) [2022-07-11 08:56:07,903][26022] Updated weights on worker 0-0, policy_version 1122604 (0.00086) [2022-07-11 08:56:09,180][25689] Fps is (10 sec: 5594.1, 60 sec: 5592.7, 300 sec: 5579.5). Total num frames: 1149553664. Throughput: 0: 4918.0. Samples: 1149546084. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:56:09,181][25689] Avg episode reward: [(0, '-1.437')] [2022-07-11 08:56:09,992][26022] Updated weights on worker 0-0, policy_version 1122614 (0.00092) [2022-07-11 08:56:11,551][26022] Updated weights on worker 0-0, policy_version 1122624 (0.00095) [2022-07-11 08:56:13,492][26022] Updated weights on worker 0-0, policy_version 1122634 (0.00079) [2022-07-11 08:56:14,273][25689] Fps is (10 sec: 5551.7, 60 sec: 5570.0, 300 sec: 5577.9). Total num frames: 1149581312. Throughput: 0: 5735.9. Samples: 1149579972. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:56:14,274][25689] Avg episode reward: [(0, '-1.572')] [2022-07-11 08:56:15,227][26022] Updated weights on worker 0-0, policy_version 1122644 (0.00088) [2022-07-11 08:56:17,292][26022] Updated weights on worker 0-0, policy_version 1122654 (0.00097) [2022-07-11 08:56:19,160][26022] Updated weights on worker 0-0, policy_version 1122664 (0.00088) [2022-07-11 08:56:19,372][25689] Fps is (10 sec: 5425.4, 60 sec: 5579.7, 300 sec: 5569.6). Total num frames: 1149608960. Throughput: 0: 5691.0. Samples: 1149613094. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:56:19,373][25689] Avg episode reward: [(0, '-1.559')] [2022-07-11 08:56:20,908][26022] Updated weights on worker 0-0, policy_version 1122674 (0.00088) [2022-07-11 08:56:22,908][26022] Updated weights on worker 0-0, policy_version 1122684 (0.00112) [2022-07-11 08:56:24,413][25689] Fps is (10 sec: 5453.7, 60 sec: 5561.8, 300 sec: 5578.0). Total num frames: 1149636608. Throughput: 0: 5760.0. Samples: 1149646094. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:56:24,413][25689] Avg episode reward: [(0, '-1.852')] [2022-07-11 08:56:24,652][26022] Updated weights on worker 0-0, policy_version 1122694 (0.00085) [2022-07-11 08:56:26,566][26022] Updated weights on worker 0-0, policy_version 1122704 (0.00095) [2022-07-11 08:56:28,477][26022] Updated weights on worker 0-0, policy_version 1122714 (0.00095) [2022-07-11 08:56:29,452][25689] Fps is (10 sec: 5485.7, 60 sec: 5546.1, 300 sec: 5568.7). Total num frames: 1149664256. Throughput: 0: 5739.4. Samples: 1149662434. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:56:29,453][25689] Avg episode reward: [(0, '-2.594')] [2022-07-11 08:56:30,332][26022] Updated weights on worker 0-0, policy_version 1122724 (0.00087) [2022-07-11 08:56:32,265][26022] Updated weights on worker 0-0, policy_version 1122734 (0.00100) [2022-07-11 08:56:33,947][26022] Updated weights on worker 0-0, policy_version 1122744 (0.00099) [2022-07-11 08:56:34,455][25689] Fps is (10 sec: 5506.3, 60 sec: 5534.2, 300 sec: 5566.5). Total num frames: 1149691904. Throughput: 0: 5712.3. Samples: 1149695254. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2022-07-11 08:56:34,455][25689] Avg episode reward: [(0, '-2.382')] [2022-07-11 08:56:35,979][26022] Updated weights on worker 0-0, policy_version 1122754 (0.00088) [2022-07-11 08:56:37,619][26022] Updated weights on worker 0-0, policy_version 1122764 (0.00081) [2022-07-11 08:56:39,444][26022] Updated weights on worker 0-0, policy_version 1122774 (0.00083) [2022-07-11 08:56:39,543][25689] Fps is (10 sec: 5581.2, 60 sec: 5545.8, 300 sec: 5572.3). Total num frames: 1149720576. Throughput: 0: 5762.0. Samples: 1149729318. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:56:39,544][25689] Avg episode reward: [(0, '-0.111')] [2022-07-11 08:56:41,282][26022] Updated weights on worker 0-0, policy_version 1122784 (0.00076) [2022-07-11 08:56:42,871][26022] Updated weights on worker 0-0, policy_version 1122794 (0.00086) [2022-07-11 08:56:44,602][25689] Fps is (10 sec: 5550.4, 60 sec: 5528.9, 300 sec: 5566.6). Total num frames: 1149748224. Throughput: 0: 4968.5. Samples: 1149746404. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:56:44,602][25689] Avg episode reward: [(0, '-0.441')] [2022-07-11 08:56:44,887][26022] Updated weights on worker 0-0, policy_version 1122804 (0.00054) [2022-07-11 08:56:46,805][26022] Updated weights on worker 0-0, policy_version 1122814 (0.00091) [2022-07-11 08:56:48,528][26022] Updated weights on worker 0-0, policy_version 1122824 (0.00095) [2022-07-11 08:56:49,649][25689] Fps is (10 sec: 5573.3, 60 sec: 5527.9, 300 sec: 5573.2). Total num frames: 1149776896. Throughput: 0: 5824.3. Samples: 1149780064. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:56:49,649][25689] Avg episode reward: [(0, '0.117')] [2022-07-11 08:56:50,612][26022] Updated weights on worker 0-0, policy_version 1122834 (0.00080) [2022-07-11 08:56:52,081][26022] Updated weights on worker 0-0, policy_version 1122844 (0.00085) [2022-07-11 08:56:54,106][26022] Updated weights on worker 0-0, policy_version 1122854 (0.00093) [2022-07-11 08:56:54,658][25689] Fps is (10 sec: 5600.4, 60 sec: 5512.8, 300 sec: 5566.5). Total num frames: 1149804544. Throughput: 0: 5865.2. Samples: 1149813752. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:56:54,659][25689] Avg episode reward: [(0, '0.074')] [2022-07-11 08:56:55,891][26022] Updated weights on worker 0-0, policy_version 1122864 (0.00095) [2022-07-11 08:56:57,706][26022] Updated weights on worker 0-0, policy_version 1122874 (0.00083) [2022-07-11 08:56:59,522][26022] Updated weights on worker 0-0, policy_version 1122884 (0.00086) [2022-07-11 08:56:59,666][25689] Fps is (10 sec: 5724.6, 60 sec: 5563.6, 300 sec: 5580.4). Total num frames: 1149834240. Throughput: 0: 5030.2. Samples: 1149830538. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:56:59,666][25689] Avg episode reward: [(0, '0.157')] [2022-07-11 08:57:01,514][26022] Updated weights on worker 0-0, policy_version 1122894 (0.00093) [2022-07-11 08:57:03,533][26022] Updated weights on worker 0-0, policy_version 1122904 (0.00087) [2022-07-11 08:57:04,686][25689] Fps is (10 sec: 5514.2, 60 sec: 5529.4, 300 sec: 5568.5). Total num frames: 1149859840. Throughput: 0: 5769.4. Samples: 1149862278. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:57:04,687][25689] Avg episode reward: [(0, '0.083')] [2022-07-11 08:57:05,560][26022] Updated weights on worker 0-0, policy_version 1122914 (0.00087) [2022-07-11 08:57:06,992][26022] Updated weights on worker 0-0, policy_version 1122924 (0.00087) [2022-07-11 08:57:09,116][26022] Updated weights on worker 0-0, policy_version 1122934 (0.00088) [2022-07-11 08:57:09,727][25689] Fps is (10 sec: 5394.3, 60 sec: 5530.4, 300 sec: 5568.0). Total num frames: 1149888512. Throughput: 0: 5789.1. Samples: 1149896298. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:57:09,727][25689] Avg episode reward: [(0, '-0.550')] [2022-07-11 08:57:10,683][26022] Updated weights on worker 0-0, policy_version 1122944 (0.00545) [2022-07-11 08:57:12,633][26022] Updated weights on worker 0-0, policy_version 1122954 (0.00079) [2022-07-11 08:57:14,456][26022] Updated weights on worker 0-0, policy_version 1122964 (0.00080) [2022-07-11 08:57:14,759][25689] Fps is (10 sec: 5693.2, 60 sec: 5553.0, 300 sec: 5574.3). Total num frames: 1149917184. Throughput: 0: 4953.1. Samples: 1149913310. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:57:14,759][25689] Avg episode reward: [(0, '-0.206')] [2022-07-11 08:57:16,372][26022] Updated weights on worker 0-0, policy_version 1122974 (0.00082) [2022-07-11 08:57:18,079][26022] Updated weights on worker 0-0, policy_version 1122984 (0.00085) [2022-07-11 08:57:19,773][25689] Fps is (10 sec: 5504.1, 60 sec: 5543.8, 300 sec: 5567.4). Total num frames: 1149943808. Throughput: 0: 5786.5. Samples: 1149946890. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:57:19,775][25689] Avg episode reward: [(0, '0.201')] [2022-07-11 08:57:19,942][26022] Updated weights on worker 0-0, policy_version 1122994 (0.00087) [2022-07-11 08:57:21,796][26022] Updated weights on worker 0-0, policy_version 1123004 (0.00091) [2022-07-11 08:57:23,549][26022] Updated weights on worker 0-0, policy_version 1123014 (0.00084) [2022-07-11 08:57:24,777][25689] Fps is (10 sec: 5519.3, 60 sec: 5564.0, 300 sec: 5575.9). Total num frames: 1149972480. Throughput: 0: 5900.1. Samples: 1149980818. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:57:24,779][25689] Avg episode reward: [(0, '-0.592')] [2022-07-11 08:57:25,432][26022] Updated weights on worker 0-0, policy_version 1123024 (0.00090) [2022-07-11 08:57:27,195][26022] Updated weights on worker 0-0, policy_version 1123034 (0.00088) [2022-07-11 08:57:28,975][26022] Updated weights on worker 0-0, policy_version 1123044 (0.00081) [2022-07-11 08:57:29,863][25689] Fps is (10 sec: 5784.8, 60 sec: 5593.7, 300 sec: 5574.4). Total num frames: 1150002176. Throughput: 0: 5031.1. Samples: 1149997606. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:57:29,863][25689] Avg episode reward: [(0, '0.151')] [2022-07-11 08:57:30,949][26022] Updated weights on worker 0-0, policy_version 1123054 (0.00086) [2022-07-11 08:57:32,559][26022] Updated weights on worker 0-0, policy_version 1123064 (0.00070) [2022-07-11 08:57:34,555][26022] Updated weights on worker 0-0, policy_version 1123074 (0.00091) [2022-07-11 08:57:34,909][25689] Fps is (10 sec: 5558.6, 60 sec: 5572.7, 300 sec: 5573.9). Total num frames: 1150028800. Throughput: 0: 5849.1. Samples: 1150031174. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:57:34,910][25689] Avg episode reward: [(0, '0.939')] [2022-07-11 08:57:36,383][26022] Updated weights on worker 0-0, policy_version 1123084 (0.00084) [2022-07-11 08:57:38,195][26022] Updated weights on worker 0-0, policy_version 1123094 (0.00081) [2022-07-11 08:57:39,943][25689] Fps is (10 sec: 5485.9, 60 sec: 5577.8, 300 sec: 5573.3). Total num frames: 1150057472. Throughput: 0: 5862.6. Samples: 1150065136. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:57:39,944][25689] Avg episode reward: [(0, '1.192')] [2022-07-11 08:57:39,951][26022] Updated weights on worker 0-0, policy_version 1123104 (0.00079) [2022-07-11 08:57:41,795][26022] Updated weights on worker 0-0, policy_version 1123114 (0.00084) [2022-07-11 08:57:43,475][26022] Updated weights on worker 0-0, policy_version 1123124 (0.00078) [2022-07-11 08:57:44,963][25689] Fps is (10 sec: 5704.0, 60 sec: 5598.3, 300 sec: 5570.4). Total num frames: 1150086144. Throughput: 0: 5025.1. Samples: 1150082252. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:57:44,963][25689] Avg episode reward: [(0, '1.285')] [2022-07-11 08:57:45,678][26022] Updated weights on worker 0-0, policy_version 1123134 (0.00087) [2022-07-11 08:57:47,265][26022] Updated weights on worker 0-0, policy_version 1123144 (0.00092) [2022-07-11 08:57:49,248][26022] Updated weights on worker 0-0, policy_version 1123154 (0.00095) [2022-07-11 08:57:50,072][25689] Fps is (10 sec: 5560.0, 60 sec: 5575.6, 300 sec: 5575.5). Total num frames: 1150113792. Throughput: 0: 5845.9. Samples: 1150115746. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:57:50,073][25689] Avg episode reward: [(0, '1.015')] [2022-07-11 08:57:51,043][26022] Updated weights on worker 0-0, policy_version 1123164 (0.00086) [2022-07-11 08:57:51,590][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:57:51,606][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001123168_1150124032.pth [2022-07-11 08:57:51,608][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001121208_1148116992.pth [2022-07-11 08:57:52,907][26022] Updated weights on worker 0-0, policy_version 1123174 (0.00080) [2022-07-11 08:57:54,544][26022] Updated weights on worker 0-0, policy_version 1123184 (0.00090) [2022-07-11 08:57:55,075][25689] Fps is (10 sec: 5670.9, 60 sec: 5610.1, 300 sec: 5575.9). Total num frames: 1150143488. Throughput: 0: 5865.7. Samples: 1150149456. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:57:55,075][25689] Avg episode reward: [(0, '1.049')] [2022-07-11 08:57:56,707][26022] Updated weights on worker 0-0, policy_version 1123194 (0.00086) [2022-07-11 08:57:57,951][26022] Updated weights on worker 0-0, policy_version 1123204 (0.00088) [2022-07-11 08:58:00,114][25689] Fps is (10 sec: 5608.4, 60 sec: 5556.3, 300 sec: 5578.6). Total num frames: 1150170112. Throughput: 0: 5010.5. Samples: 1150166206. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:58:00,115][25689] Avg episode reward: [(0, '0.174')] [2022-07-11 08:58:00,354][26022] Updated weights on worker 0-0, policy_version 1123214 (0.00088) [2022-07-11 08:58:01,993][26022] Updated weights on worker 0-0, policy_version 1123224 (0.00081) [2022-07-11 08:58:04,241][26022] Updated weights on worker 0-0, policy_version 1123234 (0.00088) [2022-07-11 08:58:05,118][25689] Fps is (10 sec: 5505.7, 60 sec: 5608.7, 300 sec: 5579.5). Total num frames: 1150198784. Throughput: 0: 5730.9. Samples: 1150197760. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:58:05,118][25689] Avg episode reward: [(0, '-0.705')] [2022-07-11 08:58:05,757][26022] Updated weights on worker 0-0, policy_version 1123244 (0.00089) [2022-07-11 08:58:07,890][26022] Updated weights on worker 0-0, policy_version 1123254 (0.00093) [2022-07-11 08:58:09,533][26022] Updated weights on worker 0-0, policy_version 1123264 (0.00090) [2022-07-11 08:58:10,160][25689] Fps is (10 sec: 5402.7, 60 sec: 5557.8, 300 sec: 5575.5). Total num frames: 1150224384. Throughput: 0: 5749.2. Samples: 1150231234. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:58:10,160][25689] Avg episode reward: [(0, '-1.300')] [2022-07-11 08:58:11,342][26022] Updated weights on worker 0-0, policy_version 1123274 (0.00811) [2022-07-11 08:58:13,349][26022] Updated weights on worker 0-0, policy_version 1123284 (0.00091) [2022-07-11 08:58:15,193][25689] Fps is (10 sec: 5284.9, 60 sec: 5540.6, 300 sec: 5564.6). Total num frames: 1150252032. Throughput: 0: 4908.2. Samples: 1150248204. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:58:15,194][25689] Avg episode reward: [(0, '-1.034')] [2022-07-11 08:58:15,284][26022] Updated weights on worker 0-0, policy_version 1123294 (0.00085) [2022-07-11 08:58:16,861][26022] Updated weights on worker 0-0, policy_version 1123304 (0.00091) [2022-07-11 08:58:19,038][26022] Updated weights on worker 0-0, policy_version 1123314 (0.00083) [2022-07-11 08:58:20,197][25689] Fps is (10 sec: 5713.1, 60 sec: 5592.5, 300 sec: 5571.5). Total num frames: 1150281728. Throughput: 0: 5759.1. Samples: 1150281864. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:58:20,198][25689] Avg episode reward: [(0, '-1.980')] [2022-07-11 08:58:20,437][26022] Updated weights on worker 0-0, policy_version 1123324 (0.00095) [2022-07-11 08:58:22,563][26022] Updated weights on worker 0-0, policy_version 1123334 (0.00086) [2022-07-11 08:58:24,061][26022] Updated weights on worker 0-0, policy_version 1123344 (0.00087) [2022-07-11 08:58:25,212][25689] Fps is (10 sec: 5723.8, 60 sec: 5574.6, 300 sec: 5572.3). Total num frames: 1150309376. Throughput: 0: 5864.4. Samples: 1150315598. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:58:25,212][25689] Avg episode reward: [(0, '-2.596')] [2022-07-11 08:58:26,243][26022] Updated weights on worker 0-0, policy_version 1123354 (0.00084) [2022-07-11 08:58:27,744][26022] Updated weights on worker 0-0, policy_version 1123364 (0.00095) [2022-07-11 08:58:29,776][26022] Updated weights on worker 0-0, policy_version 1123374 (0.00086) [2022-07-11 08:58:30,329][25689] Fps is (10 sec: 5457.4, 60 sec: 5537.8, 300 sec: 5566.8). Total num frames: 1150337024. Throughput: 0: 5024.7. Samples: 1150332580. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:58:30,330][25689] Avg episode reward: [(0, '-1.887')] [2022-07-11 08:58:31,306][26022] Updated weights on worker 0-0, policy_version 1123384 (0.00089) [2022-07-11 08:58:33,421][26022] Updated weights on worker 0-0, policy_version 1123394 (0.00085) [2022-07-11 08:58:34,983][26022] Updated weights on worker 0-0, policy_version 1123404 (0.00091) [2022-07-11 08:58:35,347][25689] Fps is (10 sec: 5557.0, 60 sec: 5574.3, 300 sec: 5570.1). Total num frames: 1150365696. Throughput: 0: 5871.1. Samples: 1150366528. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:58:35,347][25689] Avg episode reward: [(0, '-1.719')] [2022-07-11 08:58:37,210][26022] Updated weights on worker 0-0, policy_version 1123414 (0.00087) [2022-07-11 08:58:38,719][26022] Updated weights on worker 0-0, policy_version 1123424 (0.00088) [2022-07-11 08:58:40,385][25689] Fps is (10 sec: 5601.1, 60 sec: 5556.9, 300 sec: 5570.3). Total num frames: 1150393344. Throughput: 0: 5871.1. Samples: 1150400388. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:58:40,386][25689] Avg episode reward: [(0, '-2.157')] [2022-07-11 08:58:40,792][26022] Updated weights on worker 0-0, policy_version 1123434 (0.00050) [2022-07-11 08:58:42,295][26022] Updated weights on worker 0-0, policy_version 1123444 (0.00093) [2022-07-11 08:58:44,427][26022] Updated weights on worker 0-0, policy_version 1123454 (0.00089) [2022-07-11 08:58:45,402][25689] Fps is (10 sec: 5702.8, 60 sec: 5574.1, 300 sec: 5574.7). Total num frames: 1150423040. Throughput: 0: 5038.8. Samples: 1150417334. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:58:45,404][25689] Avg episode reward: [(0, '-1.859')] [2022-07-11 08:58:46,079][26022] Updated weights on worker 0-0, policy_version 1123464 (0.00105) [2022-07-11 08:58:47,991][26022] Updated weights on worker 0-0, policy_version 1123474 (0.00096) [2022-07-11 08:58:49,728][26022] Updated weights on worker 0-0, policy_version 1123484 (0.00090) [2022-07-11 08:58:50,461][25689] Fps is (10 sec: 5690.6, 60 sec: 5578.7, 300 sec: 5570.7). Total num frames: 1150450688. Throughput: 0: 5878.2. Samples: 1150450920. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:58:50,463][25689] Avg episode reward: [(0, '-1.049')] [2022-07-11 08:58:51,575][26022] Updated weights on worker 0-0, policy_version 1123494 (0.00084) [2022-07-11 08:58:53,518][26022] Updated weights on worker 0-0, policy_version 1123504 (0.00084) [2022-07-11 08:58:55,324][26022] Updated weights on worker 0-0, policy_version 1123514 (0.00092) [2022-07-11 08:58:55,495][25689] Fps is (10 sec: 5479.0, 60 sec: 5542.0, 300 sec: 5566.8). Total num frames: 1150478336. Throughput: 0: 5841.4. Samples: 1150484220. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:58:55,495][25689] Avg episode reward: [(0, '-0.013')] [2022-07-11 08:58:57,154][26022] Updated weights on worker 0-0, policy_version 1123524 (0.00085) [2022-07-11 08:58:58,934][26022] Updated weights on worker 0-0, policy_version 1123534 (0.00101) [2022-07-11 08:59:00,511][25689] Fps is (10 sec: 5706.3, 60 sec: 5595.0, 300 sec: 5583.9). Total num frames: 1150508032. Throughput: 0: 4984.5. Samples: 1150500706. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:59:00,512][25689] Avg episode reward: [(0, '-0.438')] [2022-07-11 08:59:00,864][26022] Updated weights on worker 0-0, policy_version 1123544 (0.00078) [2022-07-11 08:59:03,190][26022] Updated weights on worker 0-0, policy_version 1123554 (0.00061) [2022-07-11 08:59:04,837][26022] Updated weights on worker 0-0, policy_version 1123564 (0.00081) [2022-07-11 08:59:05,519][25689] Fps is (10 sec: 5414.1, 60 sec: 5526.8, 300 sec: 5569.2). Total num frames: 1150532608. Throughput: 0: 5724.3. Samples: 1150532488. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:59:05,520][25689] Avg episode reward: [(0, '0.146')] [2022-07-11 08:59:06,686][26022] Updated weights on worker 0-0, policy_version 1123574 (0.00088) [2022-07-11 08:59:08,371][26022] Updated weights on worker 0-0, policy_version 1123584 (0.00093) [2022-07-11 08:59:10,502][26022] Updated weights on worker 0-0, policy_version 1123594 (0.00079) [2022-07-11 08:59:10,662][25689] Fps is (10 sec: 5245.6, 60 sec: 5568.4, 300 sec: 5563.5). Total num frames: 1150561280. Throughput: 0: 5704.1. Samples: 1150566144. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:59:10,662][25689] Avg episode reward: [(0, '0.047')] [2022-07-11 08:59:12,175][26022] Updated weights on worker 0-0, policy_version 1123604 (0.00095) [2022-07-11 08:59:13,980][26022] Updated weights on worker 0-0, policy_version 1123614 (0.00089) [2022-07-11 08:59:15,664][25689] Fps is (10 sec: 5652.5, 60 sec: 5588.2, 300 sec: 5574.5). Total num frames: 1150589952. Throughput: 0: 4900.2. Samples: 1150583054. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:59:15,664][25689] Avg episode reward: [(0, '-0.121')] [2022-07-11 08:59:15,833][26022] Updated weights on worker 0-0, policy_version 1123624 (0.00082) [2022-07-11 08:59:17,494][26022] Updated weights on worker 0-0, policy_version 1123634 (0.00087) [2022-07-11 08:59:19,615][26022] Updated weights on worker 0-0, policy_version 1123644 (0.00086) [2022-07-11 08:59:20,669][25689] Fps is (10 sec: 5730.0, 60 sec: 5571.1, 300 sec: 5572.0). Total num frames: 1150618624. Throughput: 0: 5760.0. Samples: 1150616820. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:59:20,670][25689] Avg episode reward: [(0, '-1.693')] [2022-07-11 08:59:21,359][26022] Updated weights on worker 0-0, policy_version 1123654 (0.00087) [2022-07-11 08:59:23,024][26022] Updated weights on worker 0-0, policy_version 1123664 (0.00091) [2022-07-11 08:59:25,021][26022] Updated weights on worker 0-0, policy_version 1123674 (0.00089) [2022-07-11 08:59:25,739][25689] Fps is (10 sec: 5590.2, 60 sec: 5566.1, 300 sec: 5568.5). Total num frames: 1150646272. Throughput: 0: 5833.8. Samples: 1150650446. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:59:25,739][25689] Avg episode reward: [(0, '-1.996')] [2022-07-11 08:59:26,900][26022] Updated weights on worker 0-0, policy_version 1123684 (0.00098) [2022-07-11 08:59:28,541][26022] Updated weights on worker 0-0, policy_version 1123694 (0.00089) [2022-07-11 08:59:30,677][26022] Updated weights on worker 0-0, policy_version 1123704 (0.00092) [2022-07-11 08:59:30,806][25689] Fps is (10 sec: 5354.1, 60 sec: 5553.8, 300 sec: 5561.0). Total num frames: 1150672896. Throughput: 0: 5840.3. Samples: 1150683792. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:59:30,806][25689] Avg episode reward: [(0, '-1.802')] [2022-07-11 08:59:32,267][26022] Updated weights on worker 0-0, policy_version 1123714 (0.00116) [2022-07-11 08:59:34,324][26022] Updated weights on worker 0-0, policy_version 1123724 (0.00081) [2022-07-11 08:59:35,899][25689] Fps is (10 sec: 5543.3, 60 sec: 5563.8, 300 sec: 5563.2). Total num frames: 1150702592. Throughput: 0: 5794.3. Samples: 1150700300. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:59:35,900][25689] Avg episode reward: [(0, '-2.524')] [2022-07-11 08:59:36,047][26022] Updated weights on worker 0-0, policy_version 1123734 (0.00093) [2022-07-11 08:59:38,041][26022] Updated weights on worker 0-0, policy_version 1123744 (0.00083) [2022-07-11 08:59:39,751][26022] Updated weights on worker 0-0, policy_version 1123754 (0.00086) [2022-07-11 08:59:40,910][25689] Fps is (10 sec: 5675.2, 60 sec: 5566.2, 300 sec: 5563.4). Total num frames: 1150730240. Throughput: 0: 5768.8. Samples: 1150733584. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:59:40,910][25689] Avg episode reward: [(0, '-3.124')] [2022-07-11 08:59:41,611][26022] Updated weights on worker 0-0, policy_version 1123764 (0.00082) [2022-07-11 08:59:43,371][26022] Updated weights on worker 0-0, policy_version 1123774 (0.00523) [2022-07-11 08:59:45,311][26022] Updated weights on worker 0-0, policy_version 1123784 (0.00089) [2022-07-11 08:59:45,924][25689] Fps is (10 sec: 5515.4, 60 sec: 5532.7, 300 sec: 5561.1). Total num frames: 1150757888. Throughput: 0: 5793.1. Samples: 1150767384. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:59:45,925][25689] Avg episode reward: [(0, '-1.969')] [2022-07-11 08:59:47,103][26022] Updated weights on worker 0-0, policy_version 1123794 (0.00086) [2022-07-11 08:59:48,975][26022] Updated weights on worker 0-0, policy_version 1123804 (0.00082) [2022-07-11 08:59:50,802][26022] Updated weights on worker 0-0, policy_version 1123814 (0.00092) [2022-07-11 08:59:50,995][25689] Fps is (10 sec: 5584.6, 60 sec: 5548.5, 300 sec: 5560.1). Total num frames: 1150786560. Throughput: 0: 4969.3. Samples: 1150784118. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:59:50,995][25689] Avg episode reward: [(0, '-1.364')] [2022-07-11 08:59:51,753][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 08:59:51,766][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001123819_1150790656.pth [2022-07-11 08:59:51,766][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001121862_1148786688.pth [2022-07-11 08:59:52,665][26022] Updated weights on worker 0-0, policy_version 1123824 (0.00086) [2022-07-11 08:59:54,496][26022] Updated weights on worker 0-0, policy_version 1123834 (0.00081) [2022-07-11 08:59:56,046][25689] Fps is (10 sec: 5564.5, 60 sec: 5546.9, 300 sec: 5556.3). Total num frames: 1150814208. Throughput: 0: 5824.5. Samples: 1150817646. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 08:59:56,046][25689] Avg episode reward: [(0, '-1.193')] [2022-07-11 08:59:56,274][26022] Updated weights on worker 0-0, policy_version 1123844 (0.00090) [2022-07-11 08:59:58,103][26022] Updated weights on worker 0-0, policy_version 1123854 (0.00086) [2022-07-11 09:00:00,174][26022] Updated weights on worker 0-0, policy_version 1123864 (0.00051) [2022-07-11 09:00:01,064][25689] Fps is (10 sec: 5593.2, 60 sec: 5529.8, 300 sec: 5564.6). Total num frames: 1150842880. Throughput: 0: 5824.4. Samples: 1150850970. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:00:01,065][25689] Avg episode reward: [(0, '-0.838')] [2022-07-11 09:00:02,066][26022] Updated weights on worker 0-0, policy_version 1123874 (0.00092) [2022-07-11 09:00:04,080][26022] Updated weights on worker 0-0, policy_version 1123884 (0.00083) [2022-07-11 09:00:05,896][26022] Updated weights on worker 0-0, policy_version 1123894 (0.00091) [2022-07-11 09:00:06,149][25689] Fps is (10 sec: 5371.7, 60 sec: 5539.7, 300 sec: 5557.5). Total num frames: 1150868480. Throughput: 0: 4865.9. Samples: 1150865794. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:00:06,150][25689] Avg episode reward: [(0, '0.035')] [2022-07-11 09:00:07,550][26022] Updated weights on worker 0-0, policy_version 1123904 (0.00094) [2022-07-11 09:00:09,703][26022] Updated weights on worker 0-0, policy_version 1123914 (0.00086) [2022-07-11 09:00:11,256][25689] Fps is (10 sec: 5324.9, 60 sec: 5542.9, 300 sec: 5556.0). Total num frames: 1150897152. Throughput: 0: 5675.8. Samples: 1150899118. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:00:11,258][25689] Avg episode reward: [(0, '0.821')] [2022-07-11 09:00:11,371][26022] Updated weights on worker 0-0, policy_version 1123924 (0.00084) [2022-07-11 09:00:13,161][26022] Updated weights on worker 0-0, policy_version 1123934 (0.00091) [2022-07-11 09:00:15,109][26022] Updated weights on worker 0-0, policy_version 1123944 (0.00088) [2022-07-11 09:00:16,284][25689] Fps is (10 sec: 5556.9, 60 sec: 5523.7, 300 sec: 5559.3). Total num frames: 1150924800. Throughput: 0: 5693.6. Samples: 1150932876. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:00:16,285][25689] Avg episode reward: [(0, '0.829')] [2022-07-11 09:00:16,687][26022] Updated weights on worker 0-0, policy_version 1123954 (0.00085) [2022-07-11 09:00:18,803][26022] Updated weights on worker 0-0, policy_version 1123964 (0.00085) [2022-07-11 09:00:20,253][26022] Updated weights on worker 0-0, policy_version 1123974 (0.00077) [2022-07-11 09:00:21,309][25689] Fps is (10 sec: 5500.7, 60 sec: 5505.1, 300 sec: 5556.0). Total num frames: 1150952448. Throughput: 0: 4870.1. Samples: 1150949564. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:00:21,309][25689] Avg episode reward: [(0, '-1.639')] [2022-07-11 09:00:22,603][26022] Updated weights on worker 0-0, policy_version 1123984 (0.00085) [2022-07-11 09:00:24,109][26022] Updated weights on worker 0-0, policy_version 1123994 (0.00086) [2022-07-11 09:00:26,192][26022] Updated weights on worker 0-0, policy_version 1124004 (0.00085) [2022-07-11 09:00:26,318][25689] Fps is (10 sec: 5612.8, 60 sec: 5527.4, 300 sec: 5556.8). Total num frames: 1150981120. Throughput: 0: 5819.7. Samples: 1150983174. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:00:26,319][25689] Avg episode reward: [(0, '-3.181')] [2022-07-11 09:00:27,927][26022] Updated weights on worker 0-0, policy_version 1124014 (0.00093) [2022-07-11 09:00:29,766][26022] Updated weights on worker 0-0, policy_version 1124024 (0.00084) [2022-07-11 09:00:31,369][25689] Fps is (10 sec: 5598.5, 60 sec: 5545.8, 300 sec: 5553.5). Total num frames: 1151008768. Throughput: 0: 5827.1. Samples: 1151016314. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:00:31,369][25689] Avg episode reward: [(0, '-3.316')] [2022-07-11 09:00:31,915][26022] Updated weights on worker 0-0, policy_version 1124034 (0.00092) [2022-07-11 09:00:33,381][26022] Updated weights on worker 0-0, policy_version 1124044 (0.00088) [2022-07-11 09:00:35,435][26022] Updated weights on worker 0-0, policy_version 1124054 (0.00085) [2022-07-11 09:00:36,371][25689] Fps is (10 sec: 5602.7, 60 sec: 5537.2, 300 sec: 5557.5). Total num frames: 1151037440. Throughput: 0: 4979.0. Samples: 1151032886. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:00:36,371][25689] Avg episode reward: [(0, '-3.065')] [2022-07-11 09:00:37,143][26022] Updated weights on worker 0-0, policy_version 1124064 (0.00080) [2022-07-11 09:00:38,970][26022] Updated weights on worker 0-0, policy_version 1124074 (0.00084) [2022-07-11 09:00:41,010][26022] Updated weights on worker 0-0, policy_version 1124084 (0.00094) [2022-07-11 09:00:41,400][25689] Fps is (10 sec: 5614.6, 60 sec: 5535.6, 300 sec: 5554.6). Total num frames: 1151065088. Throughput: 0: 5841.3. Samples: 1151066920. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:00:41,400][25689] Avg episode reward: [(0, '-2.764')] [2022-07-11 09:00:42,345][26022] Updated weights on worker 0-0, policy_version 1124094 (0.00087) [2022-07-11 09:00:44,474][26022] Updated weights on worker 0-0, policy_version 1124104 (0.00088) [2022-07-11 09:00:46,291][26022] Updated weights on worker 0-0, policy_version 1124114 (0.00085) [2022-07-11 09:00:46,420][25689] Fps is (10 sec: 5604.1, 60 sec: 5552.0, 300 sec: 5554.8). Total num frames: 1151093760. Throughput: 0: 5862.5. Samples: 1151101022. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:00:46,421][25689] Avg episode reward: [(0, '-1.505')] [2022-07-11 09:00:47,866][26022] Updated weights on worker 0-0, policy_version 1124124 (0.00090) [2022-07-11 09:00:49,896][26022] Updated weights on worker 0-0, policy_version 1124134 (0.00090) [2022-07-11 09:00:51,491][25689] Fps is (10 sec: 5682.6, 60 sec: 5552.0, 300 sec: 5554.1). Total num frames: 1151122432. Throughput: 0: 5048.1. Samples: 1151117892. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:00:51,491][25689] Avg episode reward: [(0, '0.455')] [2022-07-11 09:00:51,592][26022] Updated weights on worker 0-0, policy_version 1124144 (0.00089) [2022-07-11 09:00:53,455][26022] Updated weights on worker 0-0, policy_version 1124154 (0.00085) [2022-07-11 09:00:55,412][26022] Updated weights on worker 0-0, policy_version 1124164 (0.00089) [2022-07-11 09:00:56,495][25689] Fps is (10 sec: 5590.2, 60 sec: 5556.2, 300 sec: 5557.6). Total num frames: 1151150080. Throughput: 0: 5900.0. Samples: 1151151622. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:00:56,497][25689] Avg episode reward: [(0, '0.575')] [2022-07-11 09:00:56,912][26022] Updated weights on worker 0-0, policy_version 1124174 (0.00086) [2022-07-11 09:00:59,085][26022] Updated weights on worker 0-0, policy_version 1124184 (0.00087) [2022-07-11 09:01:00,786][26022] Updated weights on worker 0-0, policy_version 1124194 (0.00081) [2022-07-11 09:01:01,518][25689] Fps is (10 sec: 5616.6, 60 sec: 5555.8, 300 sec: 5560.9). Total num frames: 1151178752. Throughput: 0: 5888.3. Samples: 1151185384. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:01:01,519][25689] Avg episode reward: [(0, '0.778')] [2022-07-11 09:01:03,135][26022] Updated weights on worker 0-0, policy_version 1124204 (0.00082) [2022-07-11 09:01:04,681][26022] Updated weights on worker 0-0, policy_version 1124214 (0.00089) [2022-07-11 09:01:06,526][25689] Fps is (10 sec: 5308.4, 60 sec: 5545.9, 300 sec: 5548.0). Total num frames: 1151203328. Throughput: 0: 4941.5. Samples: 1151200374. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:01:06,527][25689] Avg episode reward: [(0, '0.768')] [2022-07-11 09:01:06,631][26022] Updated weights on worker 0-0, policy_version 1124224 (0.00097) [2022-07-11 09:01:08,189][26022] Updated weights on worker 0-0, policy_version 1124234 (0.00052) [2022-07-11 09:01:10,485][26022] Updated weights on worker 0-0, policy_version 1124244 (0.00091) [2022-07-11 09:01:11,572][25689] Fps is (10 sec: 5500.0, 60 sec: 5585.5, 300 sec: 5559.2). Total num frames: 1151234048. Throughput: 0: 5781.8. Samples: 1151233998. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:01:11,572][25689] Avg episode reward: [(0, '0.976')] [2022-07-11 09:01:11,912][26022] Updated weights on worker 0-0, policy_version 1124254 (0.00087) [2022-07-11 09:01:13,967][26022] Updated weights on worker 0-0, policy_version 1124264 (0.00083) [2022-07-11 09:01:15,755][26022] Updated weights on worker 0-0, policy_version 1124274 (0.00087) [2022-07-11 09:01:16,582][25689] Fps is (10 sec: 5601.0, 60 sec: 5553.3, 300 sec: 5554.0). Total num frames: 1151259648. Throughput: 0: 5759.1. Samples: 1151267300. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:01:16,582][25689] Avg episode reward: [(0, '2.364')] [2022-07-11 09:01:17,618][26022] Updated weights on worker 0-0, policy_version 1124284 (0.00087) [2022-07-11 09:01:19,395][26022] Updated weights on worker 0-0, policy_version 1124294 (0.00088) [2022-07-11 09:01:21,023][26022] Updated weights on worker 0-0, policy_version 1124304 (0.00085) [2022-07-11 09:01:21,602][25689] Fps is (10 sec: 5513.1, 60 sec: 5587.6, 300 sec: 5561.2). Total num frames: 1151289344. Throughput: 0: 4924.8. Samples: 1151284292. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:01:21,602][25689] Avg episode reward: [(0, '2.039')] [2022-07-11 09:01:23,171][26022] Updated weights on worker 0-0, policy_version 1124314 (0.00088) [2022-07-11 09:01:25,030][26022] Updated weights on worker 0-0, policy_version 1124324 (0.00090) [2022-07-11 09:01:26,633][25689] Fps is (10 sec: 5704.9, 60 sec: 5568.6, 300 sec: 5561.4). Total num frames: 1151316992. Throughput: 0: 5853.5. Samples: 1151318072. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:01:26,635][25689] Avg episode reward: [(0, '1.757')] [2022-07-11 09:01:26,726][26022] Updated weights on worker 0-0, policy_version 1124334 (0.00373) [2022-07-11 09:01:28,613][26022] Updated weights on worker 0-0, policy_version 1124344 (0.00091) [2022-07-11 09:01:30,507][26022] Updated weights on worker 0-0, policy_version 1124354 (0.00091) [2022-07-11 09:01:31,703][25689] Fps is (10 sec: 5575.6, 60 sec: 5583.8, 300 sec: 5563.6). Total num frames: 1151345664. Throughput: 0: 5829.1. Samples: 1151351344. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:01:31,704][25689] Avg episode reward: [(0, '1.697')] [2022-07-11 09:01:32,401][26022] Updated weights on worker 0-0, policy_version 1124364 (0.00083) [2022-07-11 09:01:34,030][26022] Updated weights on worker 0-0, policy_version 1124374 (0.00099) [2022-07-11 09:01:36,147][26022] Updated weights on worker 0-0, policy_version 1124384 (0.00084) [2022-07-11 09:01:36,723][25689] Fps is (10 sec: 5582.0, 60 sec: 5565.2, 300 sec: 5561.4). Total num frames: 1151373312. Throughput: 0: 5016.3. Samples: 1151368336. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:01:36,723][25689] Avg episode reward: [(0, '1.487')] [2022-07-11 09:01:37,568][26022] Updated weights on worker 0-0, policy_version 1124394 (0.00085) [2022-07-11 09:01:39,675][26022] Updated weights on worker 0-0, policy_version 1124404 (0.00075) [2022-07-11 09:01:41,211][26022] Updated weights on worker 0-0, policy_version 1124414 (0.00060) [2022-07-11 09:01:41,820][25689] Fps is (10 sec: 5567.0, 60 sec: 5575.9, 300 sec: 5564.1). Total num frames: 1151401984. Throughput: 0: 5832.1. Samples: 1151402204. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:01:41,820][25689] Avg episode reward: [(0, '1.338')] [2022-07-11 09:01:43,163][26022] Updated weights on worker 0-0, policy_version 1124424 (0.00078) [2022-07-11 09:01:44,974][26022] Updated weights on worker 0-0, policy_version 1124434 (0.00052) [2022-07-11 09:01:46,759][26022] Updated weights on worker 0-0, policy_version 1124444 (0.00086) [2022-07-11 09:01:46,868][25689] Fps is (10 sec: 5753.2, 60 sec: 5590.3, 300 sec: 5567.5). Total num frames: 1151431680. Throughput: 0: 5844.0. Samples: 1151436326. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:01:46,869][25689] Avg episode reward: [(0, '1.289')] [2022-07-11 09:01:48,629][26022] Updated weights on worker 0-0, policy_version 1124454 (0.00090) [2022-07-11 09:01:50,211][26022] Updated weights on worker 0-0, policy_version 1124464 (0.00088) [2022-07-11 09:01:51,834][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:01:51,844][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001124473_1151460352.pth [2022-07-11 09:01:51,845][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001122516_1149456384.pth [2022-07-11 09:01:51,914][25689] Fps is (10 sec: 5782.2, 60 sec: 5592.5, 300 sec: 5570.3). Total num frames: 1151460352. Throughput: 0: 5052.3. Samples: 1151453456. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:01:51,915][25689] Avg episode reward: [(0, '1.952')] [2022-07-11 09:01:52,162][26022] Updated weights on worker 0-0, policy_version 1124474 (0.00087) [2022-07-11 09:01:54,086][26022] Updated weights on worker 0-0, policy_version 1124484 (0.00090) [2022-07-11 09:01:55,925][26022] Updated weights on worker 0-0, policy_version 1124494 (0.00090) [2022-07-11 09:01:56,916][25689] Fps is (10 sec: 5503.2, 60 sec: 5575.8, 300 sec: 5560.1). Total num frames: 1151486976. Throughput: 0: 5903.8. Samples: 1151487554. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:01:56,917][25689] Avg episode reward: [(0, '1.633')] [2022-07-11 09:01:57,480][26022] Updated weights on worker 0-0, policy_version 1124504 (0.00083) [2022-07-11 09:01:59,438][26022] Updated weights on worker 0-0, policy_version 1124514 (0.00089) [2022-07-11 09:02:01,004][26022] Updated weights on worker 0-0, policy_version 1124524 (0.00089) [2022-07-11 09:02:01,983][25689] Fps is (10 sec: 5389.9, 60 sec: 5554.8, 300 sec: 5566.1). Total num frames: 1151514624. Throughput: 0: 5917.4. Samples: 1151521520. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:02:01,984][25689] Avg episode reward: [(0, '1.626')] [2022-07-11 09:02:03,427][26022] Updated weights on worker 0-0, policy_version 1124534 (0.00093) [2022-07-11 09:02:05,022][26022] Updated weights on worker 0-0, policy_version 1124544 (0.00085) [2022-07-11 09:02:06,944][26022] Updated weights on worker 0-0, policy_version 1124554 (0.00083) [2022-07-11 09:02:06,986][25689] Fps is (10 sec: 5592.8, 60 sec: 5623.0, 300 sec: 5566.8). Total num frames: 1151543296. Throughput: 0: 4983.8. Samples: 1151536592. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:02:06,995][25689] Avg episode reward: [(0, '1.438')] [2022-07-11 09:02:09,064][26022] Updated weights on worker 0-0, policy_version 1124564 (0.00080) [2022-07-11 09:02:10,650][26022] Updated weights on worker 0-0, policy_version 1124574 (0.00089) [2022-07-11 09:02:12,103][25689] Fps is (10 sec: 5565.5, 60 sec: 5565.7, 300 sec: 5561.8). Total num frames: 1151570944. Throughput: 0: 5784.4. Samples: 1151570236. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:02:12,103][25689] Avg episode reward: [(0, '1.612')] [2022-07-11 09:02:12,501][26022] Updated weights on worker 0-0, policy_version 1124584 (0.00082) [2022-07-11 09:02:14,222][26022] Updated weights on worker 0-0, policy_version 1124594 (0.00087) [2022-07-11 09:02:15,907][26022] Updated weights on worker 0-0, policy_version 1124604 (0.00091) [2022-07-11 09:02:17,112][25689] Fps is (10 sec: 5562.3, 60 sec: 5616.5, 300 sec: 5568.8). Total num frames: 1151599616. Throughput: 0: 5794.9. Samples: 1151604584. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:02:17,112][25689] Avg episode reward: [(0, '1.630')] [2022-07-11 09:02:17,999][26022] Updated weights on worker 0-0, policy_version 1124614 (0.00080) [2022-07-11 09:02:19,449][26022] Updated weights on worker 0-0, policy_version 1124624 (0.00088) [2022-07-11 09:02:21,549][26022] Updated weights on worker 0-0, policy_version 1124634 (0.00085) [2022-07-11 09:02:22,115][25689] Fps is (10 sec: 5932.1, 60 sec: 5635.0, 300 sec: 5575.7). Total num frames: 1151630336. Throughput: 0: 4976.7. Samples: 1151621708. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:02:22,116][25689] Avg episode reward: [(0, '0.871')] [2022-07-11 09:02:23,422][26022] Updated weights on worker 0-0, policy_version 1124644 (0.00046) [2022-07-11 09:02:24,913][26022] Updated weights on worker 0-0, policy_version 1124654 (0.00089) [2022-07-11 09:02:27,007][26022] Updated weights on worker 0-0, policy_version 1124664 (0.00086) [2022-07-11 09:02:27,135][25689] Fps is (10 sec: 5721.2, 60 sec: 5619.2, 300 sec: 5566.6). Total num frames: 1151656960. Throughput: 0: 5917.8. Samples: 1151655826. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:02:27,136][25689] Avg episode reward: [(0, '0.505')] [2022-07-11 09:02:28,442][26022] Updated weights on worker 0-0, policy_version 1124674 (0.00089) [2022-07-11 09:02:30,647][26022] Updated weights on worker 0-0, policy_version 1124684 (0.00086) [2022-07-11 09:02:32,204][25689] Fps is (10 sec: 5480.5, 60 sec: 5619.2, 300 sec: 5573.0). Total num frames: 1151685632. Throughput: 0: 5928.9. Samples: 1151689416. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:02:32,205][25689] Avg episode reward: [(0, '1.014')] [2022-07-11 09:02:32,475][26022] Updated weights on worker 0-0, policy_version 1124694 (0.00084) [2022-07-11 09:02:34,066][26022] Updated weights on worker 0-0, policy_version 1124704 (0.00090) [2022-07-11 09:02:36,146][26022] Updated weights on worker 0-0, policy_version 1124714 (0.00088) [2022-07-11 09:02:37,215][25689] Fps is (10 sec: 5587.1, 60 sec: 5620.0, 300 sec: 5570.0). Total num frames: 1151713280. Throughput: 0: 5896.1. Samples: 1151723116. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:02:37,219][25689] Avg episode reward: [(0, '0.971')] [2022-07-11 09:02:37,570][26022] Updated weights on worker 0-0, policy_version 1124724 (0.00086) [2022-07-11 09:02:39,775][26022] Updated weights on worker 0-0, policy_version 1124734 (0.00093) [2022-07-11 09:02:41,482][26022] Updated weights on worker 0-0, policy_version 1124744 (0.00086) [2022-07-11 09:02:42,303][25689] Fps is (10 sec: 5577.2, 60 sec: 5620.9, 300 sec: 5568.7). Total num frames: 1151741952. Throughput: 0: 5858.1. Samples: 1151739972. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:02:42,303][25689] Avg episode reward: [(0, '0.743')] [2022-07-11 09:02:43,155][26022] Updated weights on worker 0-0, policy_version 1124754 (0.00087) [2022-07-11 09:02:45,219][26022] Updated weights on worker 0-0, policy_version 1124764 (0.00100) [2022-07-11 09:02:46,918][26022] Updated weights on worker 0-0, policy_version 1124774 (0.00089) [2022-07-11 09:02:47,307][25689] Fps is (10 sec: 5580.6, 60 sec: 5591.1, 300 sec: 5570.7). Total num frames: 1151769600. Throughput: 0: 5859.3. Samples: 1151774022. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:02:47,309][25689] Avg episode reward: [(0, '1.252')] [2022-07-11 09:02:48,817][26022] Updated weights on worker 0-0, policy_version 1124784 (0.00085) [2022-07-11 09:02:50,549][26022] Updated weights on worker 0-0, policy_version 1124794 (0.00098) [2022-07-11 09:02:52,341][25689] Fps is (10 sec: 5610.3, 60 sec: 5592.2, 300 sec: 5566.7). Total num frames: 1151798272. Throughput: 0: 5882.2. Samples: 1151807866. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:02:52,342][25689] Avg episode reward: [(0, '0.242')] [2022-07-11 09:02:52,365][26022] Updated weights on worker 0-0, policy_version 1124804 (0.00080) [2022-07-11 09:02:54,013][26022] Updated weights on worker 0-0, policy_version 1124814 (0.00088) [2022-07-11 09:02:56,087][26022] Updated weights on worker 0-0, policy_version 1124824 (0.00085) [2022-07-11 09:02:57,383][25689] Fps is (10 sec: 5793.1, 60 sec: 5639.4, 300 sec: 5577.0). Total num frames: 1151827968. Throughput: 0: 5045.1. Samples: 1151824864. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:02:57,383][25689] Avg episode reward: [(0, '0.498')] [2022-07-11 09:02:57,772][26022] Updated weights on worker 0-0, policy_version 1124834 (0.00080) [2022-07-11 09:02:59,657][26022] Updated weights on worker 0-0, policy_version 1124844 (0.00461) [2022-07-11 09:03:01,815][26022] Updated weights on worker 0-0, policy_version 1124854 (0.00082) [2022-07-11 09:03:02,427][25689] Fps is (10 sec: 5482.7, 60 sec: 5607.6, 300 sec: 5565.9). Total num frames: 1151853568. Throughput: 0: 5885.1. Samples: 1151858406. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:03:02,428][25689] Avg episode reward: [(0, '0.533')] [2022-07-11 09:03:03,752][26022] Updated weights on worker 0-0, policy_version 1124864 (0.00086) [2022-07-11 09:03:05,460][26022] Updated weights on worker 0-0, policy_version 1124874 (0.00081) [2022-07-11 09:03:07,403][26022] Updated weights on worker 0-0, policy_version 1124884 (0.00093) [2022-07-11 09:03:07,457][25689] Fps is (10 sec: 5285.4, 60 sec: 5588.1, 300 sec: 5573.0). Total num frames: 1151881216. Throughput: 0: 5785.8. Samples: 1151890606. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:03:07,458][25689] Avg episode reward: [(0, '-0.463')] [2022-07-11 09:03:09,174][26022] Updated weights on worker 0-0, policy_version 1124894 (0.00087) [2022-07-11 09:03:10,775][26022] Updated weights on worker 0-0, policy_version 1124904 (0.00084) [2022-07-11 09:03:12,554][25689] Fps is (10 sec: 5662.6, 60 sec: 5623.9, 300 sec: 5578.7). Total num frames: 1151910912. Throughput: 0: 4935.5. Samples: 1151907624. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:03:12,554][25689] Avg episode reward: [(0, '-0.428')] [2022-07-11 09:03:12,673][26022] Updated weights on worker 0-0, policy_version 1124914 (0.00087) [2022-07-11 09:03:14,624][26022] Updated weights on worker 0-0, policy_version 1124924 (0.00084) [2022-07-11 09:03:16,286][26022] Updated weights on worker 0-0, policy_version 1124934 (0.00084) [2022-07-11 09:03:17,601][25689] Fps is (10 sec: 5754.1, 60 sec: 5620.3, 300 sec: 5574.4). Total num frames: 1151939584. Throughput: 0: 5781.2. Samples: 1151941750. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:03:17,602][25689] Avg episode reward: [(0, '-0.400')] [2022-07-11 09:03:18,031][26022] Updated weights on worker 0-0, policy_version 1124944 (0.00086) [2022-07-11 09:03:19,920][26022] Updated weights on worker 0-0, policy_version 1124954 (0.00119) [2022-07-11 09:03:21,641][26022] Updated weights on worker 0-0, policy_version 1124964 (0.00089) [2022-07-11 09:03:22,649][25689] Fps is (10 sec: 5680.6, 60 sec: 5582.3, 300 sec: 5577.3). Total num frames: 1151968256. Throughput: 0: 5815.6. Samples: 1151976006. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:03:22,649][25689] Avg episode reward: [(0, '-0.664')] [2022-07-11 09:03:23,632][26022] Updated weights on worker 0-0, policy_version 1124974 (0.00087) [2022-07-11 09:03:25,127][26022] Updated weights on worker 0-0, policy_version 1124984 (0.00085) [2022-07-11 09:03:27,339][26022] Updated weights on worker 0-0, policy_version 1124994 (0.00087) [2022-07-11 09:03:27,721][25689] Fps is (10 sec: 5666.2, 60 sec: 5611.3, 300 sec: 5581.5). Total num frames: 1151996928. Throughput: 0: 5055.1. Samples: 1151993042. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:03:27,722][25689] Avg episode reward: [(0, '-0.732')] [2022-07-11 09:03:28,868][26022] Updated weights on worker 0-0, policy_version 1125004 (0.00087) [2022-07-11 09:03:30,775][26022] Updated weights on worker 0-0, policy_version 1125014 (0.00082) [2022-07-11 09:03:32,720][26022] Updated weights on worker 0-0, policy_version 1125024 (0.00098) [2022-07-11 09:03:32,816][25689] Fps is (10 sec: 5539.3, 60 sec: 5592.1, 300 sec: 5576.6). Total num frames: 1152024576. Throughput: 0: 5881.2. Samples: 1152026788. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:03:32,816][25689] Avg episode reward: [(0, '-1.648')] [2022-07-11 09:03:34,297][26022] Updated weights on worker 0-0, policy_version 1125034 (0.00087) [2022-07-11 09:03:36,370][26022] Updated weights on worker 0-0, policy_version 1125044 (0.00093) [2022-07-11 09:03:37,892][25689] Fps is (10 sec: 5537.5, 60 sec: 5602.9, 300 sec: 5579.4). Total num frames: 1152053248. Throughput: 0: 5844.2. Samples: 1152060334. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:03:37,893][25689] Avg episode reward: [(0, '-0.731')] [2022-07-11 09:03:38,284][26022] Updated weights on worker 0-0, policy_version 1125054 (0.00093) [2022-07-11 09:03:39,831][26022] Updated weights on worker 0-0, policy_version 1125064 (0.00089) [2022-07-11 09:03:41,987][26022] Updated weights on worker 0-0, policy_version 1125074 (0.00087) [2022-07-11 09:03:42,911][25689] Fps is (10 sec: 5680.4, 60 sec: 5609.3, 300 sec: 5575.9). Total num frames: 1152081920. Throughput: 0: 4981.7. Samples: 1152076946. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:03:42,913][25689] Avg episode reward: [(0, '-0.940')] [2022-07-11 09:03:43,534][26022] Updated weights on worker 0-0, policy_version 1125084 (0.00092) [2022-07-11 09:03:45,505][26022] Updated weights on worker 0-0, policy_version 1125094 (0.00098) [2022-07-11 09:03:47,255][26022] Updated weights on worker 0-0, policy_version 1125104 (0.00084) [2022-07-11 09:03:47,943][25689] Fps is (10 sec: 5705.0, 60 sec: 5623.6, 300 sec: 5579.8). Total num frames: 1152110592. Throughput: 0: 5835.1. Samples: 1152111040. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:03:47,944][25689] Avg episode reward: [(0, '-1.068')] [2022-07-11 09:03:49,082][26022] Updated weights on worker 0-0, policy_version 1125114 (0.00093) [2022-07-11 09:03:50,803][26022] Updated weights on worker 0-0, policy_version 1125124 (0.00079) [2022-07-11 09:03:51,966][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:03:51,982][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001125130_1152133120.pth [2022-07-11 09:03:51,983][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001123168_1150124032.pth [2022-07-11 09:03:52,948][26022] Updated weights on worker 0-0, policy_version 1125134 (0.00092) [2022-07-11 09:03:52,995][25689] Fps is (10 sec: 5483.2, 60 sec: 5588.1, 300 sec: 5576.1). Total num frames: 1152137216. Throughput: 0: 5825.8. Samples: 1152144350. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:03:52,996][25689] Avg episode reward: [(0, '-1.005')] [2022-07-11 09:03:54,534][26022] Updated weights on worker 0-0, policy_version 1125144 (0.00092) [2022-07-11 09:03:56,548][26022] Updated weights on worker 0-0, policy_version 1125154 (0.00087) [2022-07-11 09:03:58,023][25689] Fps is (10 sec: 5486.1, 60 sec: 5572.6, 300 sec: 5572.4). Total num frames: 1152165888. Throughput: 0: 5016.4. Samples: 1152161318. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:03:58,024][25689] Avg episode reward: [(0, '-0.254')] [2022-07-11 09:03:58,215][26022] Updated weights on worker 0-0, policy_version 1125164 (0.00090) [2022-07-11 09:04:00,023][26022] Updated weights on worker 0-0, policy_version 1125174 (0.00084) [2022-07-11 09:04:01,927][26022] Updated weights on worker 0-0, policy_version 1125184 (0.00080) [2022-07-11 09:04:03,034][25689] Fps is (10 sec: 5406.5, 60 sec: 5575.6, 300 sec: 5575.8). Total num frames: 1152191488. Throughput: 0: 5864.1. Samples: 1152194948. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:04:03,036][25689] Avg episode reward: [(0, '0.047')] [2022-07-11 09:04:04,047][26022] Updated weights on worker 0-0, policy_version 1125194 (0.00084) [2022-07-11 09:04:06,176][26022] Updated weights on worker 0-0, policy_version 1125204 (0.00086) [2022-07-11 09:04:07,687][26022] Updated weights on worker 0-0, policy_version 1125214 (0.00081) [2022-07-11 09:04:08,139][25689] Fps is (10 sec: 5466.2, 60 sec: 5602.5, 300 sec: 5579.9). Total num frames: 1152221184. Throughput: 0: 5715.9. Samples: 1152226474. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:04:08,140][25689] Avg episode reward: [(0, '-0.853')] [2022-07-11 09:04:09,619][26022] Updated weights on worker 0-0, policy_version 1125224 (0.00084) [2022-07-11 09:04:11,459][26022] Updated weights on worker 0-0, policy_version 1125234 (0.00085) [2022-07-11 09:04:13,173][26022] Updated weights on worker 0-0, policy_version 1125244 (0.00090) [2022-07-11 09:04:13,269][25689] Fps is (10 sec: 5702.7, 60 sec: 5582.5, 300 sec: 5577.5). Total num frames: 1152249856. Throughput: 0: 4872.6. Samples: 1152243134. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:04:13,269][25689] Avg episode reward: [(0, '-0.815')] [2022-07-11 09:04:15,104][26022] Updated weights on worker 0-0, policy_version 1125254 (0.00087) [2022-07-11 09:04:16,959][26022] Updated weights on worker 0-0, policy_version 1125264 (0.00096) [2022-07-11 09:04:18,309][25689] Fps is (10 sec: 5437.1, 60 sec: 5549.5, 300 sec: 5570.0). Total num frames: 1152276480. Throughput: 0: 5687.7. Samples: 1152276700. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:04:18,310][25689] Avg episode reward: [(0, '0.569')] [2022-07-11 09:04:18,725][26022] Updated weights on worker 0-0, policy_version 1125274 (0.00080) [2022-07-11 09:04:20,724][26022] Updated weights on worker 0-0, policy_version 1125284 (0.00088) [2022-07-11 09:04:22,371][26022] Updated weights on worker 0-0, policy_version 1125294 (0.00095) [2022-07-11 09:04:23,376][25689] Fps is (10 sec: 5572.1, 60 sec: 5564.5, 300 sec: 5576.9). Total num frames: 1152306176. Throughput: 0: 5671.4. Samples: 1152310318. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:04:23,377][25689] Avg episode reward: [(0, '1.014')] [2022-07-11 09:04:24,464][26022] Updated weights on worker 0-0, policy_version 1125304 (0.00093) [2022-07-11 09:04:25,932][26022] Updated weights on worker 0-0, policy_version 1125314 (0.00095) [2022-07-11 09:04:28,059][26022] Updated weights on worker 0-0, policy_version 1125324 (0.00096) [2022-07-11 09:04:28,461][25689] Fps is (10 sec: 5648.4, 60 sec: 5546.5, 300 sec: 5580.0). Total num frames: 1152333824. Throughput: 0: 4962.4. Samples: 1152327322. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:04:28,462][25689] Avg episode reward: [(0, '0.868')] [2022-07-11 09:04:29,792][26022] Updated weights on worker 0-0, policy_version 1125334 (0.00090) [2022-07-11 09:04:31,656][26022] Updated weights on worker 0-0, policy_version 1125344 (0.00092) [2022-07-11 09:04:33,426][26022] Updated weights on worker 0-0, policy_version 1125354 (0.00051) [2022-07-11 09:04:33,526][25689] Fps is (10 sec: 5549.0, 60 sec: 5566.1, 300 sec: 5577.1). Total num frames: 1152362496. Throughput: 0: 5812.6. Samples: 1152360880. Policy #0 lag: (min: 0.0, avg: 7.2, max: 18.0) [2022-07-11 09:04:33,526][25689] Avg episode reward: [(0, '0.463')] [2022-07-11 09:04:35,203][26022] Updated weights on worker 0-0, policy_version 1125364 (0.00082) [2022-07-11 09:04:37,190][26022] Updated weights on worker 0-0, policy_version 1125374 (0.00096) [2022-07-11 09:04:38,547][25689] Fps is (10 sec: 5685.6, 60 sec: 5571.2, 300 sec: 5580.4). Total num frames: 1152391168. Throughput: 0: 5833.2. Samples: 1152394752. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:04:38,547][25689] Avg episode reward: [(0, '1.049')] [2022-07-11 09:04:38,718][26022] Updated weights on worker 0-0, policy_version 1125384 (0.00088) [2022-07-11 09:04:40,787][26022] Updated weights on worker 0-0, policy_version 1125394 (0.00087) [2022-07-11 09:04:42,516][26022] Updated weights on worker 0-0, policy_version 1125404 (0.00092) [2022-07-11 09:04:43,563][25689] Fps is (10 sec: 5713.4, 60 sec: 5571.5, 300 sec: 5583.8). Total num frames: 1152419840. Throughput: 0: 5860.6. Samples: 1152428622. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:04:43,564][25689] Avg episode reward: [(0, '1.093')] [2022-07-11 09:04:44,351][26022] Updated weights on worker 0-0, policy_version 1125414 (0.00100) [2022-07-11 09:04:46,058][26022] Updated weights on worker 0-0, policy_version 1125424 (0.00085) [2022-07-11 09:04:48,036][26022] Updated weights on worker 0-0, policy_version 1125434 (0.00087) [2022-07-11 09:04:48,566][25689] Fps is (10 sec: 5621.4, 60 sec: 5557.3, 300 sec: 5581.6). Total num frames: 1152447488. Throughput: 0: 5878.7. Samples: 1152445512. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:04:48,567][25689] Avg episode reward: [(0, '1.204')] [2022-07-11 09:04:49,736][26022] Updated weights on worker 0-0, policy_version 1125444 (0.00088) [2022-07-11 09:04:51,699][26022] Updated weights on worker 0-0, policy_version 1125454 (0.00094) [2022-07-11 09:04:53,251][26022] Updated weights on worker 0-0, policy_version 1125464 (0.00087) [2022-07-11 09:04:53,687][25689] Fps is (10 sec: 5562.6, 60 sec: 5584.7, 300 sec: 5583.7). Total num frames: 1152476160. Throughput: 0: 5877.0. Samples: 1152479370. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:04:53,688][25689] Avg episode reward: [(0, '-0.246')] [2022-07-11 09:04:55,238][26022] Updated weights on worker 0-0, policy_version 1125474 (0.00086) [2022-07-11 09:04:57,161][26022] Updated weights on worker 0-0, policy_version 1125484 (0.00082) [2022-07-11 09:04:58,721][25689] Fps is (10 sec: 5545.7, 60 sec: 5567.2, 300 sec: 5580.0). Total num frames: 1152503808. Throughput: 0: 5868.6. Samples: 1152513148. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:04:58,722][25689] Avg episode reward: [(0, '-0.273')] [2022-07-11 09:04:58,867][26022] Updated weights on worker 0-0, policy_version 1125494 (0.00085) [2022-07-11 09:05:00,765][26022] Updated weights on worker 0-0, policy_version 1125504 (0.00082) [2022-07-11 09:05:02,884][26022] Updated weights on worker 0-0, policy_version 1125514 (0.00092) [2022-07-11 09:05:03,745][25689] Fps is (10 sec: 5395.9, 60 sec: 5582.9, 300 sec: 5584.6). Total num frames: 1152530432. Throughput: 0: 5032.8. Samples: 1152530198. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:05:03,746][25689] Avg episode reward: [(0, '-0.225')] [2022-07-11 09:05:04,799][26022] Updated weights on worker 0-0, policy_version 1125524 (0.00084) [2022-07-11 09:05:06,524][26022] Updated weights on worker 0-0, policy_version 1125534 (0.00091) [2022-07-11 09:05:08,381][26022] Updated weights on worker 0-0, policy_version 1125544 (0.00093) [2022-07-11 09:05:08,771][25689] Fps is (10 sec: 5400.3, 60 sec: 5556.4, 300 sec: 5582.7). Total num frames: 1152558080. Throughput: 0: 5753.0. Samples: 1152561754. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:05:08,772][25689] Avg episode reward: [(0, '0.200')] [2022-07-11 09:05:10,124][26022] Updated weights on worker 0-0, policy_version 1125554 (0.00088) [2022-07-11 09:05:11,919][26022] Updated weights on worker 0-0, policy_version 1125564 (0.00082) [2022-07-11 09:05:13,761][26022] Updated weights on worker 0-0, policy_version 1125574 (0.00086) [2022-07-11 09:05:13,862][25689] Fps is (10 sec: 5769.4, 60 sec: 5593.8, 300 sec: 5591.8). Total num frames: 1152588800. Throughput: 0: 5762.0. Samples: 1152595618. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:05:13,863][25689] Avg episode reward: [(0, '0.271')] [2022-07-11 09:05:15,736][26022] Updated weights on worker 0-0, policy_version 1125584 (0.00097) [2022-07-11 09:05:17,309][26022] Updated weights on worker 0-0, policy_version 1125594 (0.00085) [2022-07-11 09:05:18,909][25689] Fps is (10 sec: 5656.4, 60 sec: 5593.1, 300 sec: 5587.9). Total num frames: 1152615424. Throughput: 0: 4912.3. Samples: 1152612316. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:05:18,910][25689] Avg episode reward: [(0, '1.328')] [2022-07-11 09:05:19,382][26022] Updated weights on worker 0-0, policy_version 1125604 (0.00088) [2022-07-11 09:05:21,277][26022] Updated weights on worker 0-0, policy_version 1125614 (0.00087) [2022-07-11 09:05:22,906][26022] Updated weights on worker 0-0, policy_version 1125624 (0.00086) [2022-07-11 09:05:23,915][25689] Fps is (10 sec: 5500.6, 60 sec: 5581.9, 300 sec: 5588.0). Total num frames: 1152644096. Throughput: 0: 5754.5. Samples: 1152646264. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:05:23,915][25689] Avg episode reward: [(0, '1.142')] [2022-07-11 09:05:24,785][26022] Updated weights on worker 0-0, policy_version 1125634 (0.00087) [2022-07-11 09:05:26,762][26022] Updated weights on worker 0-0, policy_version 1125644 (0.00093) [2022-07-11 09:05:28,461][26022] Updated weights on worker 0-0, policy_version 1125654 (0.00084) [2022-07-11 09:05:28,929][25689] Fps is (10 sec: 5723.0, 60 sec: 5605.4, 300 sec: 5592.1). Total num frames: 1152672768. Throughput: 0: 5849.5. Samples: 1152679668. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:05:28,930][25689] Avg episode reward: [(0, '1.420')] [2022-07-11 09:05:30,394][26022] Updated weights on worker 0-0, policy_version 1125664 (0.00087) [2022-07-11 09:05:32,258][26022] Updated weights on worker 0-0, policy_version 1125674 (0.00077) [2022-07-11 09:05:33,955][26022] Updated weights on worker 0-0, policy_version 1125684 (0.00088) [2022-07-11 09:05:34,035][25689] Fps is (10 sec: 5666.4, 60 sec: 5601.5, 300 sec: 5590.2). Total num frames: 1152701440. Throughput: 0: 4992.7. Samples: 1152696334. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:05:34,035][25689] Avg episode reward: [(0, '0.973')] [2022-07-11 09:05:35,852][26022] Updated weights on worker 0-0, policy_version 1125694 (0.00101) [2022-07-11 09:05:37,750][26022] Updated weights on worker 0-0, policy_version 1125704 (0.00098) [2022-07-11 09:05:39,063][25689] Fps is (10 sec: 5456.5, 60 sec: 5567.0, 300 sec: 5586.8). Total num frames: 1152728064. Throughput: 0: 5832.8. Samples: 1152729870. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:05:39,064][25689] Avg episode reward: [(0, '0.290')] [2022-07-11 09:05:39,593][26022] Updated weights on worker 0-0, policy_version 1125714 (0.00086) [2022-07-11 09:05:41,473][26022] Updated weights on worker 0-0, policy_version 1125724 (0.00108) [2022-07-11 09:05:43,112][26022] Updated weights on worker 0-0, policy_version 1125734 (0.00117) [2022-07-11 09:05:44,077][25689] Fps is (10 sec: 5506.5, 60 sec: 5567.2, 300 sec: 5586.9). Total num frames: 1152756736. Throughput: 0: 5801.1. Samples: 1152763226. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:05:44,077][25689] Avg episode reward: [(0, '0.059')] [2022-07-11 09:05:45,171][26022] Updated weights on worker 0-0, policy_version 1125744 (0.00088) [2022-07-11 09:05:46,930][26022] Updated weights on worker 0-0, policy_version 1125754 (0.00095) [2022-07-11 09:05:48,557][26022] Updated weights on worker 0-0, policy_version 1125764 (0.00089) [2022-07-11 09:05:49,102][25689] Fps is (10 sec: 5610.3, 60 sec: 5565.2, 300 sec: 5584.3). Total num frames: 1152784384. Throughput: 0: 4974.0. Samples: 1152780006. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:05:49,102][25689] Avg episode reward: [(0, '0.191')] [2022-07-11 09:05:50,674][26022] Updated weights on worker 0-0, policy_version 1125774 (0.00086) [2022-07-11 09:05:52,196][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:05:52,211][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001125783_1152801792.pth [2022-07-11 09:05:52,212][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001123819_1150790656.pth [2022-07-11 09:05:52,382][26022] Updated weights on worker 0-0, policy_version 1125784 (0.00089) [2022-07-11 09:05:54,197][25689] Fps is (10 sec: 5463.9, 60 sec: 5550.7, 300 sec: 5582.6). Total num frames: 1152812032. Throughput: 0: 5816.5. Samples: 1152813608. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:05:54,198][25689] Avg episode reward: [(0, '0.466')] [2022-07-11 09:05:54,254][26022] Updated weights on worker 0-0, policy_version 1125794 (0.00077) [2022-07-11 09:05:56,112][26022] Updated weights on worker 0-0, policy_version 1125804 (0.00092) [2022-07-11 09:05:57,733][26022] Updated weights on worker 0-0, policy_version 1125814 (0.00092) [2022-07-11 09:05:59,199][25689] Fps is (10 sec: 5476.1, 60 sec: 5553.6, 300 sec: 5579.6). Total num frames: 1152839680. Throughput: 0: 5833.1. Samples: 1152847328. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:05:59,200][25689] Avg episode reward: [(0, '-0.218')] [2022-07-11 09:05:59,787][26022] Updated weights on worker 0-0, policy_version 1125824 (0.00090) [2022-07-11 09:06:01,295][26022] Updated weights on worker 0-0, policy_version 1125834 (0.00093) [2022-07-11 09:06:03,911][26022] Updated weights on worker 0-0, policy_version 1125844 (0.00110) [2022-07-11 09:06:04,236][25689] Fps is (10 sec: 5507.9, 60 sec: 5569.4, 300 sec: 5589.3). Total num frames: 1152867328. Throughput: 0: 4984.4. Samples: 1152863710. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:06:04,238][25689] Avg episode reward: [(0, '-0.362')] [2022-07-11 09:06:05,634][26022] Updated weights on worker 0-0, policy_version 1125854 (0.00106) [2022-07-11 09:06:07,469][26022] Updated weights on worker 0-0, policy_version 1125864 (0.00086) [2022-07-11 09:06:09,243][25689] Fps is (10 sec: 5403.6, 60 sec: 5554.2, 300 sec: 5576.3). Total num frames: 1152893952. Throughput: 0: 5733.0. Samples: 1152895476. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:06:09,245][25689] Avg episode reward: [(0, '0.272')] [2022-07-11 09:06:09,267][26022] Updated weights on worker 0-0, policy_version 1125874 (0.00093) [2022-07-11 09:06:11,213][26022] Updated weights on worker 0-0, policy_version 1125884 (0.00086) [2022-07-11 09:06:13,010][26022] Updated weights on worker 0-0, policy_version 1125894 (0.00093) [2022-07-11 09:06:14,286][25689] Fps is (10 sec: 5400.5, 60 sec: 5507.8, 300 sec: 5582.6). Total num frames: 1152921600. Throughput: 0: 5747.2. Samples: 1152929062. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:06:14,297][25689] Avg episode reward: [(0, '0.722')] [2022-07-11 09:06:14,866][26022] Updated weights on worker 0-0, policy_version 1125904 (0.00092) [2022-07-11 09:06:16,511][26022] Updated weights on worker 0-0, policy_version 1125914 (0.00095) [2022-07-11 09:06:18,452][26022] Updated weights on worker 0-0, policy_version 1125924 (0.00091) [2022-07-11 09:06:19,380][25689] Fps is (10 sec: 5656.8, 60 sec: 5554.3, 300 sec: 5581.2). Total num frames: 1152951296. Throughput: 0: 4880.1. Samples: 1152945812. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:06:19,382][25689] Avg episode reward: [(0, '0.599')] [2022-07-11 09:06:20,083][26022] Updated weights on worker 0-0, policy_version 1125934 (0.00087) [2022-07-11 09:06:22,188][26022] Updated weights on worker 0-0, policy_version 1125944 (0.00089) [2022-07-11 09:06:23,614][26022] Updated weights on worker 0-0, policy_version 1125954 (0.00083) [2022-07-11 09:06:24,398][25689] Fps is (10 sec: 5772.0, 60 sec: 5553.1, 300 sec: 5584.9). Total num frames: 1152979968. Throughput: 0: 5766.2. Samples: 1152979966. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:06:24,398][25689] Avg episode reward: [(0, '0.990')] [2022-07-11 09:06:25,788][26022] Updated weights on worker 0-0, policy_version 1125964 (0.00095) [2022-07-11 09:06:27,397][26022] Updated weights on worker 0-0, policy_version 1125974 (0.00086) [2022-07-11 09:06:29,330][26022] Updated weights on worker 0-0, policy_version 1125984 (0.00084) [2022-07-11 09:06:29,411][25689] Fps is (10 sec: 5614.5, 60 sec: 5536.3, 300 sec: 5582.5). Total num frames: 1153007616. Throughput: 0: 5880.4. Samples: 1153014074. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:06:29,412][25689] Avg episode reward: [(0, '1.061')] [2022-07-11 09:06:30,882][26022] Updated weights on worker 0-0, policy_version 1125994 (0.00086) [2022-07-11 09:06:33,035][26022] Updated weights on worker 0-0, policy_version 1126004 (0.00089) [2022-07-11 09:06:34,465][25689] Fps is (10 sec: 5696.2, 60 sec: 5558.0, 300 sec: 5588.7). Total num frames: 1153037312. Throughput: 0: 5056.5. Samples: 1153031100. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:06:34,466][25689] Avg episode reward: [(0, '0.914')] [2022-07-11 09:06:34,609][26022] Updated weights on worker 0-0, policy_version 1126014 (0.00086) [2022-07-11 09:06:36,598][26022] Updated weights on worker 0-0, policy_version 1126024 (0.00051) [2022-07-11 09:06:38,271][26022] Updated weights on worker 0-0, policy_version 1126034 (0.00084) [2022-07-11 09:06:39,510][25689] Fps is (10 sec: 5678.6, 60 sec: 5573.4, 300 sec: 5586.3). Total num frames: 1153064960. Throughput: 0: 5918.1. Samples: 1153064940. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:06:39,510][25689] Avg episode reward: [(0, '1.017')] [2022-07-11 09:06:40,190][26022] Updated weights on worker 0-0, policy_version 1126044 (0.00086) [2022-07-11 09:06:41,839][26022] Updated weights on worker 0-0, policy_version 1126054 (0.00084) [2022-07-11 09:06:43,731][26022] Updated weights on worker 0-0, policy_version 1126064 (0.00082) [2022-07-11 09:06:44,529][25689] Fps is (10 sec: 5596.2, 60 sec: 5572.9, 300 sec: 5583.4). Total num frames: 1153093632. Throughput: 0: 5926.5. Samples: 1153099274. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:06:44,530][25689] Avg episode reward: [(0, '1.005')] [2022-07-11 09:06:45,438][26022] Updated weights on worker 0-0, policy_version 1126074 (0.00085) [2022-07-11 09:06:47,353][26022] Updated weights on worker 0-0, policy_version 1126084 (0.00086) [2022-07-11 09:06:49,173][26022] Updated weights on worker 0-0, policy_version 1126094 (0.00086) [2022-07-11 09:06:49,546][25689] Fps is (10 sec: 5611.4, 60 sec: 5573.6, 300 sec: 5580.5). Total num frames: 1153121280. Throughput: 0: 5075.3. Samples: 1153116266. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:06:49,547][25689] Avg episode reward: [(0, '1.245')] [2022-07-11 09:06:51,027][26022] Updated weights on worker 0-0, policy_version 1126104 (0.00081) [2022-07-11 09:06:52,845][26022] Updated weights on worker 0-0, policy_version 1126114 (0.00083) [2022-07-11 09:06:54,622][25689] Fps is (10 sec: 5580.4, 60 sec: 5592.4, 300 sec: 5586.0). Total num frames: 1153149952. Throughput: 0: 5902.5. Samples: 1153150074. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:06:54,622][25689] Avg episode reward: [(0, '2.352')] [2022-07-11 09:06:54,705][26022] Updated weights on worker 0-0, policy_version 1126124 (0.00090) [2022-07-11 09:06:56,602][26022] Updated weights on worker 0-0, policy_version 1126134 (0.00085) [2022-07-11 09:06:58,224][26022] Updated weights on worker 0-0, policy_version 1126144 (0.00086) [2022-07-11 09:06:59,626][25689] Fps is (10 sec: 5689.3, 60 sec: 5609.2, 300 sec: 5590.6). Total num frames: 1153178624. Throughput: 0: 5909.4. Samples: 1153183814. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:06:59,626][25689] Avg episode reward: [(0, '2.400')] [2022-07-11 09:07:00,067][26022] Updated weights on worker 0-0, policy_version 1126154 (0.00084) [2022-07-11 09:07:02,379][26022] Updated weights on worker 0-0, policy_version 1126164 (0.00071) [2022-07-11 09:07:04,039][26022] Updated weights on worker 0-0, policy_version 1126174 (0.00084) [2022-07-11 09:07:04,662][25689] Fps is (10 sec: 5507.6, 60 sec: 5592.4, 300 sec: 5583.1). Total num frames: 1153205248. Throughput: 0: 4969.2. Samples: 1153199316. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:07:04,662][25689] Avg episode reward: [(0, '1.158')] [2022-07-11 09:07:06,019][26022] Updated weights on worker 0-0, policy_version 1126184 (0.00081) [2022-07-11 09:07:07,739][26022] Updated weights on worker 0-0, policy_version 1126194 (0.00081) [2022-07-11 09:07:09,669][25689] Fps is (10 sec: 5403.9, 60 sec: 5609.2, 300 sec: 5585.2). Total num frames: 1153232896. Throughput: 0: 5762.2. Samples: 1153232216. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:07:09,669][25689] Avg episode reward: [(0, '0.992')] [2022-07-11 09:07:09,679][26022] Updated weights on worker 0-0, policy_version 1126204 (0.00088) [2022-07-11 09:07:11,516][26022] Updated weights on worker 0-0, policy_version 1126214 (0.00088) [2022-07-11 09:07:13,236][26022] Updated weights on worker 0-0, policy_version 1126224 (0.00087) [2022-07-11 09:07:14,741][25689] Fps is (10 sec: 5587.6, 60 sec: 5623.5, 300 sec: 5584.0). Total num frames: 1153261568. Throughput: 0: 5793.6. Samples: 1153266638. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:07:14,741][25689] Avg episode reward: [(0, '0.394')] [2022-07-11 09:07:14,943][26022] Updated weights on worker 0-0, policy_version 1126234 (0.00082) [2022-07-11 09:07:16,795][26022] Updated weights on worker 0-0, policy_version 1126244 (0.00079) [2022-07-11 09:07:18,546][26022] Updated weights on worker 0-0, policy_version 1126254 (0.00084) [2022-07-11 09:07:19,779][25689] Fps is (10 sec: 5671.9, 60 sec: 5611.8, 300 sec: 5576.4). Total num frames: 1153290240. Throughput: 0: 4940.8. Samples: 1153283388. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:07:19,780][25689] Avg episode reward: [(0, '0.640')] [2022-07-11 09:07:20,442][26022] Updated weights on worker 0-0, policy_version 1126264 (0.00084) [2022-07-11 09:07:22,063][26022] Updated weights on worker 0-0, policy_version 1126274 (0.00511) [2022-07-11 09:07:24,123][26022] Updated weights on worker 0-0, policy_version 1126284 (0.00087) [2022-07-11 09:07:24,811][25689] Fps is (10 sec: 5694.7, 60 sec: 5610.5, 300 sec: 5583.1). Total num frames: 1153318912. Throughput: 0: 5869.1. Samples: 1153317574. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:07:24,811][25689] Avg episode reward: [(0, '-0.542')] [2022-07-11 09:07:25,622][26022] Updated weights on worker 0-0, policy_version 1126294 (0.00091) [2022-07-11 09:07:27,727][26022] Updated weights on worker 0-0, policy_version 1126304 (0.00089) [2022-07-11 09:07:29,616][26022] Updated weights on worker 0-0, policy_version 1126314 (0.00099) [2022-07-11 09:07:29,829][25689] Fps is (10 sec: 5604.0, 60 sec: 5610.0, 300 sec: 5580.6). Total num frames: 1153346560. Throughput: 0: 5919.6. Samples: 1153351556. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:07:29,831][25689] Avg episode reward: [(0, '0.030')] [2022-07-11 09:07:31,290][26022] Updated weights on worker 0-0, policy_version 1126324 (0.00088) [2022-07-11 09:07:33,077][26022] Updated weights on worker 0-0, policy_version 1126334 (0.00092) [2022-07-11 09:07:34,857][26022] Updated weights on worker 0-0, policy_version 1126344 (0.00081) [2022-07-11 09:07:34,891][25689] Fps is (10 sec: 5688.5, 60 sec: 5609.2, 300 sec: 5586.5). Total num frames: 1153376256. Throughput: 0: 5900.9. Samples: 1153385544. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:07:34,892][25689] Avg episode reward: [(0, '0.571')] [2022-07-11 09:07:36,704][26022] Updated weights on worker 0-0, policy_version 1126354 (0.00093) [2022-07-11 09:07:38,523][26022] Updated weights on worker 0-0, policy_version 1126364 (0.00093) [2022-07-11 09:07:39,903][25689] Fps is (10 sec: 5793.7, 60 sec: 5629.2, 300 sec: 5587.9). Total num frames: 1153404928. Throughput: 0: 5913.0. Samples: 1153402384. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:07:39,906][25689] Avg episode reward: [(0, '0.559')] [2022-07-11 09:07:40,315][26022] Updated weights on worker 0-0, policy_version 1126374 (0.00088) [2022-07-11 09:07:41,959][26022] Updated weights on worker 0-0, policy_version 1126384 (0.00079) [2022-07-11 09:07:43,931][26022] Updated weights on worker 0-0, policy_version 1126394 (0.00085) [2022-07-11 09:07:44,935][25689] Fps is (10 sec: 5607.6, 60 sec: 5611.1, 300 sec: 5587.4). Total num frames: 1153432576. Throughput: 0: 5934.1. Samples: 1153436992. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:07:44,937][25689] Avg episode reward: [(0, '-0.546')] [2022-07-11 09:07:45,664][26022] Updated weights on worker 0-0, policy_version 1126404 (0.00093) [2022-07-11 09:07:47,573][26022] Updated weights on worker 0-0, policy_version 1126414 (0.00095) [2022-07-11 09:07:49,361][26022] Updated weights on worker 0-0, policy_version 1126424 (0.00090) [2022-07-11 09:07:50,036][25689] Fps is (10 sec: 5558.2, 60 sec: 5620.2, 300 sec: 5586.2). Total num frames: 1153461248. Throughput: 0: 5893.1. Samples: 1153470640. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:07:50,037][25689] Avg episode reward: [(0, '0.436')] [2022-07-11 09:07:51,271][26022] Updated weights on worker 0-0, policy_version 1126434 (0.00087) [2022-07-11 09:07:52,440][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:07:52,467][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001126441_1153475584.pth [2022-07-11 09:07:52,467][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001124473_1151460352.pth [2022-07-11 09:07:52,915][26022] Updated weights on worker 0-0, policy_version 1126444 (0.00076) [2022-07-11 09:07:55,000][26022] Updated weights on worker 0-0, policy_version 1126454 (0.00095) [2022-07-11 09:07:55,083][25689] Fps is (10 sec: 5549.6, 60 sec: 5605.9, 300 sec: 5579.2). Total num frames: 1153488896. Throughput: 0: 5053.9. Samples: 1153487592. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:07:55,084][25689] Avg episode reward: [(0, '0.275')] [2022-07-11 09:07:56,640][26022] Updated weights on worker 0-0, policy_version 1126464 (0.00090) [2022-07-11 09:07:58,744][26022] Updated weights on worker 0-0, policy_version 1126474 (0.00089) [2022-07-11 09:08:00,104][25689] Fps is (10 sec: 5695.6, 60 sec: 5621.3, 300 sec: 5593.4). Total num frames: 1153518592. Throughput: 0: 5874.9. Samples: 1153521062. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:08:00,105][25689] Avg episode reward: [(0, '-0.454')] [2022-07-11 09:08:00,166][26022] Updated weights on worker 0-0, policy_version 1126484 (0.00090) [2022-07-11 09:08:02,717][26022] Updated weights on worker 0-0, policy_version 1126494 (0.00081) [2022-07-11 09:08:04,156][26022] Updated weights on worker 0-0, policy_version 1126504 (0.00086) [2022-07-11 09:08:05,129][25689] Fps is (10 sec: 5402.8, 60 sec: 5588.5, 300 sec: 5583.2). Total num frames: 1153543168. Throughput: 0: 5725.3. Samples: 1153552606. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:08:05,129][25689] Avg episode reward: [(0, '-0.430')] [2022-07-11 09:08:06,392][26022] Updated weights on worker 0-0, policy_version 1126514 (0.00108) [2022-07-11 09:08:07,758][26022] Updated weights on worker 0-0, policy_version 1126524 (0.00095) [2022-07-11 09:08:09,975][26022] Updated weights on worker 0-0, policy_version 1126534 (0.00084) [2022-07-11 09:08:10,139][25689] Fps is (10 sec: 5408.7, 60 sec: 5622.1, 300 sec: 5584.8). Total num frames: 1153572864. Throughput: 0: 4902.0. Samples: 1153569182. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:08:10,139][25689] Avg episode reward: [(0, '1.130')] [2022-07-11 09:08:11,688][26022] Updated weights on worker 0-0, policy_version 1126544 (0.00086) [2022-07-11 09:08:13,555][26022] Updated weights on worker 0-0, policy_version 1126554 (0.00086) [2022-07-11 09:08:15,273][25689] Fps is (10 sec: 5551.9, 60 sec: 5582.5, 300 sec: 5576.3). Total num frames: 1153599488. Throughput: 0: 5709.9. Samples: 1153602872. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:08:15,273][25689] Avg episode reward: [(0, '-0.282')] [2022-07-11 09:08:15,529][26022] Updated weights on worker 0-0, policy_version 1126564 (0.00084) [2022-07-11 09:08:17,133][26022] Updated weights on worker 0-0, policy_version 1126574 (0.00089) [2022-07-11 09:08:19,174][26022] Updated weights on worker 0-0, policy_version 1126584 (0.00090) [2022-07-11 09:08:20,288][25689] Fps is (10 sec: 5549.4, 60 sec: 5601.6, 300 sec: 5580.3). Total num frames: 1153629184. Throughput: 0: 5720.2. Samples: 1153636512. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:08:20,288][25689] Avg episode reward: [(0, '-0.863')] [2022-07-11 09:08:20,998][26022] Updated weights on worker 0-0, policy_version 1126594 (0.00082) [2022-07-11 09:08:22,579][26022] Updated weights on worker 0-0, policy_version 1126604 (0.00090) [2022-07-11 09:08:24,407][26022] Updated weights on worker 0-0, policy_version 1126614 (0.00094) [2022-07-11 09:08:25,302][25689] Fps is (10 sec: 5717.7, 60 sec: 5586.3, 300 sec: 5578.0). Total num frames: 1153656832. Throughput: 0: 5010.7. Samples: 1153653688. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:08:25,302][25689] Avg episode reward: [(0, '-0.680')] [2022-07-11 09:08:26,217][26022] Updated weights on worker 0-0, policy_version 1126624 (0.00087) [2022-07-11 09:08:28,159][26022] Updated weights on worker 0-0, policy_version 1126634 (0.00087) [2022-07-11 09:08:29,911][26022] Updated weights on worker 0-0, policy_version 1126644 (0.00086) [2022-07-11 09:08:30,335][25689] Fps is (10 sec: 5605.4, 60 sec: 5601.8, 300 sec: 5582.6). Total num frames: 1153685504. Throughput: 0: 5861.9. Samples: 1153687570. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:08:30,336][25689] Avg episode reward: [(0, '-1.058')] [2022-07-11 09:08:31,775][26022] Updated weights on worker 0-0, policy_version 1126654 (0.00064) [2022-07-11 09:08:33,635][26022] Updated weights on worker 0-0, policy_version 1126664 (0.00089) [2022-07-11 09:08:35,294][26022] Updated weights on worker 0-0, policy_version 1126674 (0.00097) [2022-07-11 09:08:35,468][25689] Fps is (10 sec: 5640.5, 60 sec: 5578.3, 300 sec: 5581.5). Total num frames: 1153714176. Throughput: 0: 5875.0. Samples: 1153721522. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:08:35,469][25689] Avg episode reward: [(0, '-0.483')] [2022-07-11 09:08:37,247][26022] Updated weights on worker 0-0, policy_version 1126684 (0.00081) [2022-07-11 09:08:38,966][26022] Updated weights on worker 0-0, policy_version 1126694 (0.00086) [2022-07-11 09:08:40,497][25689] Fps is (10 sec: 5441.5, 60 sec: 5543.1, 300 sec: 5574.5). Total num frames: 1153740800. Throughput: 0: 5022.8. Samples: 1153738020. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:08:40,497][25689] Avg episode reward: [(0, '-0.110')] [2022-07-11 09:08:40,833][26022] Updated weights on worker 0-0, policy_version 1126704 (0.00084) [2022-07-11 09:08:42,680][26022] Updated weights on worker 0-0, policy_version 1126714 (0.00089) [2022-07-11 09:08:44,673][26022] Updated weights on worker 0-0, policy_version 1126724 (0.00085) [2022-07-11 09:08:45,520][25689] Fps is (10 sec: 5602.9, 60 sec: 5577.6, 300 sec: 5578.1). Total num frames: 1153770496. Throughput: 0: 5844.7. Samples: 1153771860. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:08:45,521][25689] Avg episode reward: [(0, '-0.247')] [2022-07-11 09:08:46,266][26022] Updated weights on worker 0-0, policy_version 1126734 (0.00088) [2022-07-11 09:08:48,199][26022] Updated weights on worker 0-0, policy_version 1126744 (0.00087) [2022-07-11 09:08:49,947][26022] Updated weights on worker 0-0, policy_version 1126754 (0.00090) [2022-07-11 09:08:50,540][25689] Fps is (10 sec: 5811.5, 60 sec: 5585.1, 300 sec: 5585.6). Total num frames: 1153799168. Throughput: 0: 5875.6. Samples: 1153806290. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:08:50,541][25689] Avg episode reward: [(0, '-0.234')] [2022-07-11 09:08:51,832][26022] Updated weights on worker 0-0, policy_version 1126764 (0.00082) [2022-07-11 09:08:53,542][26022] Updated weights on worker 0-0, policy_version 1126774 (0.00061) [2022-07-11 09:08:55,377][26022] Updated weights on worker 0-0, policy_version 1126784 (0.00084) [2022-07-11 09:08:55,599][25689] Fps is (10 sec: 5689.7, 60 sec: 5600.9, 300 sec: 5585.0). Total num frames: 1153827840. Throughput: 0: 5033.2. Samples: 1153822844. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:08:55,602][25689] Avg episode reward: [(0, '0.040')] [2022-07-11 09:08:57,120][26022] Updated weights on worker 0-0, policy_version 1126794 (0.00091) [2022-07-11 09:08:59,111][26022] Updated weights on worker 0-0, policy_version 1126804 (0.00098) [2022-07-11 09:09:00,618][25689] Fps is (10 sec: 5690.3, 60 sec: 5584.2, 300 sec: 5595.1). Total num frames: 1153856512. Throughput: 0: 5899.9. Samples: 1153856732. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:09:00,618][25689] Avg episode reward: [(0, '0.021')] [2022-07-11 09:09:00,686][26022] Updated weights on worker 0-0, policy_version 1126814 (0.00090) [2022-07-11 09:09:03,174][26022] Updated weights on worker 0-0, policy_version 1126824 (0.00085) [2022-07-11 09:09:04,838][26022] Updated weights on worker 0-0, policy_version 1126834 (0.00088) [2022-07-11 09:09:05,646][25689] Fps is (10 sec: 5401.6, 60 sec: 5600.7, 300 sec: 5582.8). Total num frames: 1153882112. Throughput: 0: 5798.4. Samples: 1153888558. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:09:05,648][25689] Avg episode reward: [(0, '0.079')] [2022-07-11 09:09:06,855][26022] Updated weights on worker 0-0, policy_version 1126844 (0.00100) [2022-07-11 09:09:08,583][26022] Updated weights on worker 0-0, policy_version 1126854 (0.00088) [2022-07-11 09:09:10,350][26022] Updated weights on worker 0-0, policy_version 1126864 (0.00082) [2022-07-11 09:09:10,659][25689] Fps is (10 sec: 5302.9, 60 sec: 5566.7, 300 sec: 5581.6). Total num frames: 1153909760. Throughput: 0: 4931.8. Samples: 1153905512. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:09:10,660][25689] Avg episode reward: [(0, '0.005')] [2022-07-11 09:09:12,242][26022] Updated weights on worker 0-0, policy_version 1126874 (0.00082) [2022-07-11 09:09:13,928][26022] Updated weights on worker 0-0, policy_version 1126884 (0.00084) [2022-07-11 09:09:15,748][25689] Fps is (10 sec: 5676.6, 60 sec: 5621.6, 300 sec: 5591.0). Total num frames: 1153939456. Throughput: 0: 5779.6. Samples: 1153939298. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:09:15,749][25689] Avg episode reward: [(0, '0.109')] [2022-07-11 09:09:15,755][26022] Updated weights on worker 0-0, policy_version 1126894 (0.00085) [2022-07-11 09:09:17,584][26022] Updated weights on worker 0-0, policy_version 1126904 (0.00098) [2022-07-11 09:09:19,477][26022] Updated weights on worker 0-0, policy_version 1126914 (0.00080) [2022-07-11 09:09:20,825][25689] Fps is (10 sec: 5641.0, 60 sec: 5582.0, 300 sec: 5583.9). Total num frames: 1153967104. Throughput: 0: 5750.5. Samples: 1153972932. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:09:20,825][25689] Avg episode reward: [(0, '0.296')] [2022-07-11 09:09:21,407][26022] Updated weights on worker 0-0, policy_version 1126924 (0.00087) [2022-07-11 09:09:23,095][26022] Updated weights on worker 0-0, policy_version 1126934 (0.00087) [2022-07-11 09:09:25,064][26022] Updated weights on worker 0-0, policy_version 1126944 (0.00084) [2022-07-11 09:09:25,850][25689] Fps is (10 sec: 5474.1, 60 sec: 5581.0, 300 sec: 5585.0). Total num frames: 1153994752. Throughput: 0: 5013.1. Samples: 1153989838. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:09:25,850][25689] Avg episode reward: [(0, '0.262')] [2022-07-11 09:09:26,896][26022] Updated weights on worker 0-0, policy_version 1126954 (0.00094) [2022-07-11 09:09:28,770][26022] Updated weights on worker 0-0, policy_version 1126964 (0.00090) [2022-07-11 09:09:30,471][26022] Updated weights on worker 0-0, policy_version 1126974 (0.00095) [2022-07-11 09:09:30,883][25689] Fps is (10 sec: 5497.2, 60 sec: 5564.1, 300 sec: 5582.2). Total num frames: 1154022400. Throughput: 0: 5821.0. Samples: 1154023238. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:09:30,884][25689] Avg episode reward: [(0, '-0.296')] [2022-07-11 09:09:32,495][26022] Updated weights on worker 0-0, policy_version 1126984 (0.00089) [2022-07-11 09:09:34,036][26022] Updated weights on worker 0-0, policy_version 1126994 (0.00088) [2022-07-11 09:09:35,940][25689] Fps is (10 sec: 5479.9, 60 sec: 5554.2, 300 sec: 5578.1). Total num frames: 1154050048. Throughput: 0: 5807.1. Samples: 1154056554. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:09:35,941][25689] Avg episode reward: [(0, '-0.088')] [2022-07-11 09:09:36,312][26022] Updated weights on worker 0-0, policy_version 1127004 (0.00088) [2022-07-11 09:09:37,879][26022] Updated weights on worker 0-0, policy_version 1127014 (0.00085) [2022-07-11 09:09:39,832][26022] Updated weights on worker 0-0, policy_version 1127024 (0.00084) [2022-07-11 09:09:40,946][25689] Fps is (10 sec: 5698.6, 60 sec: 5607.1, 300 sec: 5581.7). Total num frames: 1154079744. Throughput: 0: 4993.7. Samples: 1154073414. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:09:40,947][25689] Avg episode reward: [(0, '0.710')] [2022-07-11 09:09:41,523][26022] Updated weights on worker 0-0, policy_version 1127034 (0.00088) [2022-07-11 09:09:43,329][26022] Updated weights on worker 0-0, policy_version 1127044 (0.00103) [2022-07-11 09:09:45,102][26022] Updated weights on worker 0-0, policy_version 1127054 (0.00083) [2022-07-11 09:09:45,965][25689] Fps is (10 sec: 5618.4, 60 sec: 5556.7, 300 sec: 5578.0). Total num frames: 1154106368. Throughput: 0: 5828.0. Samples: 1154107068. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:09:45,965][25689] Avg episode reward: [(0, '0.286')] [2022-07-11 09:09:47,100][26022] Updated weights on worker 0-0, policy_version 1127064 (0.00098) [2022-07-11 09:09:48,743][26022] Updated weights on worker 0-0, policy_version 1127074 (0.00532) [2022-07-11 09:09:50,848][26022] Updated weights on worker 0-0, policy_version 1127084 (0.00077) [2022-07-11 09:09:50,970][25689] Fps is (10 sec: 5516.7, 60 sec: 5558.1, 300 sec: 5580.1). Total num frames: 1154135040. Throughput: 0: 5858.8. Samples: 1154140920. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:09:50,970][25689] Avg episode reward: [(0, '0.886')] [2022-07-11 09:09:52,324][26022] Updated weights on worker 0-0, policy_version 1127094 (0.00097) [2022-07-11 09:09:52,694][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:09:52,706][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001127095_1154145280.pth [2022-07-11 09:09:52,707][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001125130_1152133120.pth [2022-07-11 09:09:54,488][26022] Updated weights on worker 0-0, policy_version 1127104 (0.00091) [2022-07-11 09:09:56,028][25689] Fps is (10 sec: 5698.0, 60 sec: 5558.1, 300 sec: 5583.1). Total num frames: 1154163712. Throughput: 0: 5044.7. Samples: 1154157892. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:09:56,029][25689] Avg episode reward: [(0, '1.058')] [2022-07-11 09:09:56,057][26022] Updated weights on worker 0-0, policy_version 1127114 (0.00094) [2022-07-11 09:09:57,981][26022] Updated weights on worker 0-0, policy_version 1127124 (0.00091) [2022-07-11 09:09:59,803][26022] Updated weights on worker 0-0, policy_version 1127134 (0.00078) [2022-07-11 09:10:01,058][25689] Fps is (10 sec: 5684.0, 60 sec: 5557.1, 300 sec: 5589.9). Total num frames: 1154192384. Throughput: 0: 5884.2. Samples: 1154191758. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:10:01,059][25689] Avg episode reward: [(0, '1.730')] [2022-07-11 09:10:01,652][26022] Updated weights on worker 0-0, policy_version 1127144 (0.00605) [2022-07-11 09:10:03,881][26022] Updated weights on worker 0-0, policy_version 1127154 (0.00106) [2022-07-11 09:10:05,670][26022] Updated weights on worker 0-0, policy_version 1127164 (0.00094) [2022-07-11 09:10:06,071][25689] Fps is (10 sec: 5506.4, 60 sec: 5575.5, 300 sec: 5586.7). Total num frames: 1154219008. Throughput: 0: 5812.0. Samples: 1154223924. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:10:06,071][25689] Avg episode reward: [(0, '1.840')] [2022-07-11 09:10:07,268][26022] Updated weights on worker 0-0, policy_version 1127174 (0.00090) [2022-07-11 09:10:09,356][26022] Updated weights on worker 0-0, policy_version 1127184 (0.00092) [2022-07-11 09:10:10,899][26022] Updated weights on worker 0-0, policy_version 1127194 (0.00090) [2022-07-11 09:10:11,095][25689] Fps is (10 sec: 5407.6, 60 sec: 5574.5, 300 sec: 5577.6). Total num frames: 1154246656. Throughput: 0: 4966.8. Samples: 1154240876. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:10:11,097][25689] Avg episode reward: [(0, '2.085')] [2022-07-11 09:10:12,924][26022] Updated weights on worker 0-0, policy_version 1127204 (0.00084) [2022-07-11 09:10:14,550][26022] Updated weights on worker 0-0, policy_version 1127214 (0.00087) [2022-07-11 09:10:16,198][25689] Fps is (10 sec: 5460.1, 60 sec: 5539.3, 300 sec: 5580.0). Total num frames: 1154274304. Throughput: 0: 5764.3. Samples: 1154274154. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:10:16,200][25689] Avg episode reward: [(0, '2.097')] [2022-07-11 09:10:16,620][26022] Updated weights on worker 0-0, policy_version 1127224 (0.00090) [2022-07-11 09:10:18,123][26022] Updated weights on worker 0-0, policy_version 1127234 (0.00091) [2022-07-11 09:10:20,367][26022] Updated weights on worker 0-0, policy_version 1127244 (0.00091) [2022-07-11 09:10:21,203][25689] Fps is (10 sec: 5774.3, 60 sec: 5596.7, 300 sec: 5586.9). Total num frames: 1154305024. Throughput: 0: 5771.1. Samples: 1154308014. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:10:21,203][25689] Avg episode reward: [(0, '1.097')] [2022-07-11 09:10:22,004][26022] Updated weights on worker 0-0, policy_version 1127254 (0.00091) [2022-07-11 09:10:23,824][26022] Updated weights on worker 0-0, policy_version 1127264 (0.00082) [2022-07-11 09:10:25,466][26022] Updated weights on worker 0-0, policy_version 1127274 (0.00093) [2022-07-11 09:10:26,229][25689] Fps is (10 sec: 5614.4, 60 sec: 5562.7, 300 sec: 5576.4). Total num frames: 1154330624. Throughput: 0: 5018.0. Samples: 1154325080. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:10:26,230][25689] Avg episode reward: [(0, '1.054')] [2022-07-11 09:10:27,526][26022] Updated weights on worker 0-0, policy_version 1127284 (0.00094) [2022-07-11 09:10:29,297][26022] Updated weights on worker 0-0, policy_version 1127294 (0.00087) [2022-07-11 09:10:31,201][26022] Updated weights on worker 0-0, policy_version 1127304 (0.00079) [2022-07-11 09:10:31,259][25689] Fps is (10 sec: 5397.2, 60 sec: 5580.1, 300 sec: 5577.8). Total num frames: 1154359296. Throughput: 0: 5846.6. Samples: 1154358766. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:10:31,259][25689] Avg episode reward: [(0, '-0.480')] [2022-07-11 09:10:32,923][26022] Updated weights on worker 0-0, policy_version 1127314 (0.00084) [2022-07-11 09:10:34,924][26022] Updated weights on worker 0-0, policy_version 1127324 (0.00086) [2022-07-11 09:10:36,311][25689] Fps is (10 sec: 5687.7, 60 sec: 5597.4, 300 sec: 5584.2). Total num frames: 1154387968. Throughput: 0: 5883.6. Samples: 1154392492. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:10:36,312][25689] Avg episode reward: [(0, '-0.484')] [2022-07-11 09:10:36,492][26022] Updated weights on worker 0-0, policy_version 1127334 (0.00096) [2022-07-11 09:10:38,629][26022] Updated weights on worker 0-0, policy_version 1127344 (0.00089) [2022-07-11 09:10:40,063][26022] Updated weights on worker 0-0, policy_version 1127354 (0.00087) [2022-07-11 09:10:41,340][25689] Fps is (10 sec: 5586.7, 60 sec: 5561.4, 300 sec: 5580.5). Total num frames: 1154415616. Throughput: 0: 5018.3. Samples: 1154409066. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:10:41,340][25689] Avg episode reward: [(0, '-0.661')] [2022-07-11 09:10:42,291][26022] Updated weights on worker 0-0, policy_version 1127364 (0.00098) [2022-07-11 09:10:44,021][26022] Updated weights on worker 0-0, policy_version 1127374 (0.00089) [2022-07-11 09:10:45,826][26022] Updated weights on worker 0-0, policy_version 1127384 (0.00084) [2022-07-11 09:10:46,348][25689] Fps is (10 sec: 5611.5, 60 sec: 5596.3, 300 sec: 5584.3). Total num frames: 1154444288. Throughput: 0: 5848.3. Samples: 1154442740. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:10:46,352][25689] Avg episode reward: [(0, '-0.255')] [2022-07-11 09:10:47,684][26022] Updated weights on worker 0-0, policy_version 1127394 (0.00095) [2022-07-11 09:10:49,497][26022] Updated weights on worker 0-0, policy_version 1127404 (0.00083) [2022-07-11 09:10:51,208][26022] Updated weights on worker 0-0, policy_version 1127414 (0.00085) [2022-07-11 09:10:51,395][25689] Fps is (10 sec: 5702.9, 60 sec: 5592.4, 300 sec: 5588.6). Total num frames: 1154472960. Throughput: 0: 5847.4. Samples: 1154476512. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:10:51,395][25689] Avg episode reward: [(0, '-0.965')] [2022-07-11 09:10:53,329][26022] Updated weights on worker 0-0, policy_version 1127424 (0.00087) [2022-07-11 09:10:54,713][26022] Updated weights on worker 0-0, policy_version 1127434 (0.00091) [2022-07-11 09:10:56,455][25689] Fps is (10 sec: 5471.0, 60 sec: 5558.4, 300 sec: 5584.1). Total num frames: 1154499584. Throughput: 0: 5856.3. Samples: 1154510458. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:10:56,455][25689] Avg episode reward: [(0, '-0.438')] [2022-07-11 09:10:56,912][26022] Updated weights on worker 0-0, policy_version 1127444 (0.00074) [2022-07-11 09:10:58,271][26022] Updated weights on worker 0-0, policy_version 1127454 (0.00101) [2022-07-11 09:11:00,367][26022] Updated weights on worker 0-0, policy_version 1127464 (0.00088) [2022-07-11 09:11:01,509][25689] Fps is (10 sec: 5669.6, 60 sec: 5590.0, 300 sec: 5594.1). Total num frames: 1154530304. Throughput: 0: 5869.2. Samples: 1154527444. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:11:01,510][25689] Avg episode reward: [(0, '-0.431')] [2022-07-11 09:11:02,340][26022] Updated weights on worker 0-0, policy_version 1127474 (0.00096) [2022-07-11 09:11:04,262][26022] Updated weights on worker 0-0, policy_version 1127484 (0.00087) [2022-07-11 09:11:06,096][26022] Updated weights on worker 0-0, policy_version 1127494 (0.00086) [2022-07-11 09:11:06,582][25689] Fps is (10 sec: 5460.4, 60 sec: 5550.6, 300 sec: 5586.0). Total num frames: 1154554880. Throughput: 0: 5760.9. Samples: 1154559306. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:11:06,582][25689] Avg episode reward: [(0, '-0.306')] [2022-07-11 09:11:07,987][26022] Updated weights on worker 0-0, policy_version 1127504 (0.00088) [2022-07-11 09:11:09,837][26022] Updated weights on worker 0-0, policy_version 1127514 (0.00083) [2022-07-11 09:11:11,611][25689] Fps is (10 sec: 5372.7, 60 sec: 5584.0, 300 sec: 5593.1). Total num frames: 1154584576. Throughput: 0: 5764.6. Samples: 1154593048. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:11:11,612][25689] Avg episode reward: [(0, '-1.608')] [2022-07-11 09:11:11,613][26022] Updated weights on worker 0-0, policy_version 1127524 (0.00086) [2022-07-11 09:11:13,309][26022] Updated weights on worker 0-0, policy_version 1127534 (0.00115) [2022-07-11 09:11:15,349][26022] Updated weights on worker 0-0, policy_version 1127544 (0.00084) [2022-07-11 09:11:16,659][25689] Fps is (10 sec: 5792.0, 60 sec: 5606.0, 300 sec: 5590.6). Total num frames: 1154613248. Throughput: 0: 4903.1. Samples: 1154609520. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:11:16,659][25689] Avg episode reward: [(0, '-1.429')] [2022-07-11 09:11:17,155][26022] Updated weights on worker 0-0, policy_version 1127554 (0.00085) [2022-07-11 09:11:19,095][26022] Updated weights on worker 0-0, policy_version 1127564 (0.00087) [2022-07-11 09:11:20,705][26022] Updated weights on worker 0-0, policy_version 1127574 (0.00087) [2022-07-11 09:11:21,695][25689] Fps is (10 sec: 5483.5, 60 sec: 5535.4, 300 sec: 5583.3). Total num frames: 1154639872. Throughput: 0: 5736.4. Samples: 1154643236. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:11:21,695][25689] Avg episode reward: [(0, '-1.144')] [2022-07-11 09:11:22,567][26022] Updated weights on worker 0-0, policy_version 1127584 (0.00093) [2022-07-11 09:11:24,317][26022] Updated weights on worker 0-0, policy_version 1127594 (0.00088) [2022-07-11 09:11:26,475][26022] Updated weights on worker 0-0, policy_version 1127604 (0.00085) [2022-07-11 09:11:26,703][25689] Fps is (10 sec: 5403.6, 60 sec: 5571.0, 300 sec: 5583.4). Total num frames: 1154667520. Throughput: 0: 5845.2. Samples: 1154676918. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:11:26,704][25689] Avg episode reward: [(0, '-0.975')] [2022-07-11 09:11:28,023][26022] Updated weights on worker 0-0, policy_version 1127614 (0.00088) [2022-07-11 09:11:29,894][26022] Updated weights on worker 0-0, policy_version 1127624 (0.00080) [2022-07-11 09:11:31,709][25689] Fps is (10 sec: 5726.3, 60 sec: 5590.0, 300 sec: 5584.3). Total num frames: 1154697216. Throughput: 0: 5008.0. Samples: 1154693700. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:11:31,711][25689] Avg episode reward: [(0, '-1.841')] [2022-07-11 09:11:31,718][26022] Updated weights on worker 0-0, policy_version 1127634 (0.00481) [2022-07-11 09:11:33,573][26022] Updated weights on worker 0-0, policy_version 1127644 (0.00089) [2022-07-11 09:11:35,449][26022] Updated weights on worker 0-0, policy_version 1127654 (0.00087) [2022-07-11 09:11:36,755][25689] Fps is (10 sec: 5602.6, 60 sec: 5556.8, 300 sec: 5580.8). Total num frames: 1154723840. Throughput: 0: 5865.4. Samples: 1154727392. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:11:36,755][25689] Avg episode reward: [(0, '-0.954')] [2022-07-11 09:11:37,114][26022] Updated weights on worker 0-0, policy_version 1127664 (0.00091) [2022-07-11 09:11:39,246][26022] Updated weights on worker 0-0, policy_version 1127674 (0.00078) [2022-07-11 09:11:40,933][26022] Updated weights on worker 0-0, policy_version 1127684 (0.00082) [2022-07-11 09:11:41,778][25689] Fps is (10 sec: 5491.4, 60 sec: 5574.2, 300 sec: 5580.8). Total num frames: 1154752512. Throughput: 0: 5876.3. Samples: 1154761254. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:11:41,779][25689] Avg episode reward: [(0, '-0.260')] [2022-07-11 09:11:42,935][26022] Updated weights on worker 0-0, policy_version 1127694 (0.00093) [2022-07-11 09:11:44,647][26022] Updated weights on worker 0-0, policy_version 1127704 (0.00094) [2022-07-11 09:11:46,401][26022] Updated weights on worker 0-0, policy_version 1127714 (0.00091) [2022-07-11 09:11:46,809][25689] Fps is (10 sec: 5703.3, 60 sec: 5572.0, 300 sec: 5584.0). Total num frames: 1154781184. Throughput: 0: 5025.6. Samples: 1154777968. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:11:46,810][25689] Avg episode reward: [(0, '-0.203')] [2022-07-11 09:11:48,347][26022] Updated weights on worker 0-0, policy_version 1127724 (0.00086) [2022-07-11 09:11:49,911][26022] Updated weights on worker 0-0, policy_version 1127734 (0.00088) [2022-07-11 09:11:51,855][25689] Fps is (10 sec: 5589.2, 60 sec: 5555.3, 300 sec: 5581.1). Total num frames: 1154808832. Throughput: 0: 5836.1. Samples: 1154811274. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:11:51,855][25689] Avg episode reward: [(0, '0.361')] [2022-07-11 09:11:51,984][26022] Updated weights on worker 0-0, policy_version 1127744 (0.00088) [2022-07-11 09:11:52,718][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:11:52,730][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001127748_1154813952.pth [2022-07-11 09:11:52,731][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001125783_1152801792.pth [2022-07-11 09:11:53,978][26022] Updated weights on worker 0-0, policy_version 1127754 (0.00088) [2022-07-11 09:11:55,638][26022] Updated weights on worker 0-0, policy_version 1127764 (0.00093) [2022-07-11 09:11:56,910][25689] Fps is (10 sec: 5576.2, 60 sec: 5589.6, 300 sec: 5580.1). Total num frames: 1154837504. Throughput: 0: 5810.6. Samples: 1154844502. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:11:56,910][25689] Avg episode reward: [(0, '-0.525')] [2022-07-11 09:11:57,652][26022] Updated weights on worker 0-0, policy_version 1127774 (0.00053) [2022-07-11 09:11:59,265][26022] Updated weights on worker 0-0, policy_version 1127784 (0.00086) [2022-07-11 09:12:01,105][26022] Updated weights on worker 0-0, policy_version 1127794 (0.00089) [2022-07-11 09:12:01,995][25689] Fps is (10 sec: 5453.4, 60 sec: 5519.1, 300 sec: 5579.2). Total num frames: 1154864128. Throughput: 0: 4961.5. Samples: 1154861558. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:12:01,995][25689] Avg episode reward: [(0, '0.101')] [2022-07-11 09:12:03,711][26022] Updated weights on worker 0-0, policy_version 1127804 (0.00084) [2022-07-11 09:12:05,096][26022] Updated weights on worker 0-0, policy_version 1127814 (0.00094) [2022-07-11 09:12:07,003][25689] Fps is (10 sec: 5173.8, 60 sec: 5541.8, 300 sec: 5572.3). Total num frames: 1154889728. Throughput: 0: 5694.5. Samples: 1154892962. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:12:07,004][25689] Avg episode reward: [(0, '-0.931')] [2022-07-11 09:12:07,232][26022] Updated weights on worker 0-0, policy_version 1127824 (0.00079) [2022-07-11 09:12:08,801][26022] Updated weights on worker 0-0, policy_version 1127834 (0.00089) [2022-07-11 09:12:10,799][26022] Updated weights on worker 0-0, policy_version 1127844 (0.00100) [2022-07-11 09:12:12,019][25689] Fps is (10 sec: 5618.5, 60 sec: 5560.1, 300 sec: 5580.2). Total num frames: 1154920448. Throughput: 0: 5728.1. Samples: 1154926774. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:12:12,019][25689] Avg episode reward: [(0, '-0.663')] [2022-07-11 09:12:12,674][26022] Updated weights on worker 0-0, policy_version 1127854 (0.00087) [2022-07-11 09:12:14,341][26022] Updated weights on worker 0-0, policy_version 1127864 (0.00086) [2022-07-11 09:12:16,345][26022] Updated weights on worker 0-0, policy_version 1127874 (0.00086) [2022-07-11 09:12:17,078][25689] Fps is (10 sec: 5793.7, 60 sec: 5542.1, 300 sec: 5576.4). Total num frames: 1154948096. Throughput: 0: 4898.1. Samples: 1154943288. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:12:17,078][25689] Avg episode reward: [(0, '-0.291')] [2022-07-11 09:12:18,147][26022] Updated weights on worker 0-0, policy_version 1127884 (0.00088) [2022-07-11 09:12:19,863][26022] Updated weights on worker 0-0, policy_version 1127894 (0.00089) [2022-07-11 09:12:21,902][26022] Updated weights on worker 0-0, policy_version 1127904 (0.00089) [2022-07-11 09:12:22,120][25689] Fps is (10 sec: 5271.5, 60 sec: 5524.6, 300 sec: 5565.9). Total num frames: 1154973696. Throughput: 0: 5728.1. Samples: 1154976836. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:12:22,122][25689] Avg episode reward: [(0, '-0.828')] [2022-07-11 09:12:23,478][26022] Updated weights on worker 0-0, policy_version 1127914 (0.00064) [2022-07-11 09:12:25,455][26022] Updated weights on worker 0-0, policy_version 1127924 (0.00084) [2022-07-11 09:12:26,954][26022] Updated weights on worker 0-0, policy_version 1127934 (0.00087) [2022-07-11 09:12:27,147][25689] Fps is (10 sec: 5593.1, 60 sec: 5573.6, 300 sec: 5576.0). Total num frames: 1155004416. Throughput: 0: 5838.5. Samples: 1155010572. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:12:27,148][25689] Avg episode reward: [(0, '-0.551')] [2022-07-11 09:12:29,094][26022] Updated weights on worker 0-0, policy_version 1127944 (0.00085) [2022-07-11 09:12:30,831][26022] Updated weights on worker 0-0, policy_version 1127954 (0.00088) [2022-07-11 09:12:32,174][25689] Fps is (10 sec: 5805.1, 60 sec: 5537.8, 300 sec: 5569.8). Total num frames: 1155032064. Throughput: 0: 4996.8. Samples: 1155027488. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2022-07-11 09:12:32,175][25689] Avg episode reward: [(0, '0.249')] [2022-07-11 09:12:32,662][26022] Updated weights on worker 0-0, policy_version 1127964 (0.00091) [2022-07-11 09:12:34,448][26022] Updated weights on worker 0-0, policy_version 1127974 (0.00086) [2022-07-11 09:12:36,254][26022] Updated weights on worker 0-0, policy_version 1127984 (0.00082) [2022-07-11 09:12:37,254][25689] Fps is (10 sec: 5471.3, 60 sec: 5551.7, 300 sec: 5565.1). Total num frames: 1155059712. Throughput: 0: 5840.8. Samples: 1155061132. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:12:37,254][25689] Avg episode reward: [(0, '0.274')] [2022-07-11 09:12:38,217][26022] Updated weights on worker 0-0, policy_version 1127994 (0.00085) [2022-07-11 09:12:39,915][26022] Updated weights on worker 0-0, policy_version 1128004 (0.00087) [2022-07-11 09:12:41,742][26022] Updated weights on worker 0-0, policy_version 1128014 (0.00085) [2022-07-11 09:12:42,268][25689] Fps is (10 sec: 5681.3, 60 sec: 5569.5, 300 sec: 5572.3). Total num frames: 1155089408. Throughput: 0: 5864.5. Samples: 1155094994. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:12:42,268][25689] Avg episode reward: [(0, '0.064')] [2022-07-11 09:12:43,461][26022] Updated weights on worker 0-0, policy_version 1128024 (0.00085) [2022-07-11 09:12:45,414][26022] Updated weights on worker 0-0, policy_version 1128034 (0.00088) [2022-07-11 09:12:47,332][25689] Fps is (10 sec: 5588.6, 60 sec: 5532.7, 300 sec: 5566.1). Total num frames: 1155116032. Throughput: 0: 5013.7. Samples: 1155111770. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:12:47,332][25689] Avg episode reward: [(0, '-0.940')] [2022-07-11 09:12:47,460][26022] Updated weights on worker 0-0, policy_version 1128044 (0.00092) [2022-07-11 09:12:48,965][26022] Updated weights on worker 0-0, policy_version 1128054 (0.00087) [2022-07-11 09:12:51,029][26022] Updated weights on worker 0-0, policy_version 1128064 (0.00085) [2022-07-11 09:12:52,387][25689] Fps is (10 sec: 5565.8, 60 sec: 5565.6, 300 sec: 5572.9). Total num frames: 1155145728. Throughput: 0: 5850.9. Samples: 1155145748. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:12:52,388][25689] Avg episode reward: [(0, '-0.091')] [2022-07-11 09:12:52,624][26022] Updated weights on worker 0-0, policy_version 1128074 (0.00085) [2022-07-11 09:12:54,560][26022] Updated weights on worker 0-0, policy_version 1128084 (0.00087) [2022-07-11 09:12:56,367][26022] Updated weights on worker 0-0, policy_version 1128094 (0.00089) [2022-07-11 09:12:57,476][25689] Fps is (10 sec: 5753.8, 60 sec: 5562.4, 300 sec: 5568.2). Total num frames: 1155174400. Throughput: 0: 5852.6. Samples: 1155179484. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:12:57,478][25689] Avg episode reward: [(0, '-0.260')] [2022-07-11 09:12:58,008][26022] Updated weights on worker 0-0, policy_version 1128104 (0.00083) [2022-07-11 09:13:00,012][26022] Updated weights on worker 0-0, policy_version 1128114 (0.00091) [2022-07-11 09:13:02,295][26022] Updated weights on worker 0-0, policy_version 1128124 (0.00089) [2022-07-11 09:13:02,511][25689] Fps is (10 sec: 5360.7, 60 sec: 5550.1, 300 sec: 5571.4). Total num frames: 1155200000. Throughput: 0: 5007.7. Samples: 1155196364. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:13:02,512][25689] Avg episode reward: [(0, '-0.261')] [2022-07-11 09:13:03,997][26022] Updated weights on worker 0-0, policy_version 1128134 (0.00082) [2022-07-11 09:13:05,967][26022] Updated weights on worker 0-0, policy_version 1128144 (0.00080) [2022-07-11 09:13:07,336][26022] Updated weights on worker 0-0, policy_version 1128154 (0.00082) [2022-07-11 09:13:07,604][25689] Fps is (10 sec: 5560.8, 60 sec: 5626.9, 300 sec: 5573.3). Total num frames: 1155230720. Throughput: 0: 5746.5. Samples: 1155228266. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:13:07,604][25689] Avg episode reward: [(0, '-0.031')] [2022-07-11 09:13:09,521][26022] Updated weights on worker 0-0, policy_version 1128164 (0.00081) [2022-07-11 09:13:10,935][26022] Updated weights on worker 0-0, policy_version 1128174 (0.00099) [2022-07-11 09:13:12,663][25689] Fps is (10 sec: 5547.3, 60 sec: 5538.4, 300 sec: 5571.2). Total num frames: 1155256320. Throughput: 0: 5746.2. Samples: 1155262262. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:13:12,664][25689] Avg episode reward: [(0, '-0.897')] [2022-07-11 09:13:13,094][26022] Updated weights on worker 0-0, policy_version 1128184 (0.00086) [2022-07-11 09:13:14,985][26022] Updated weights on worker 0-0, policy_version 1128194 (0.00092) [2022-07-11 09:13:16,835][26022] Updated weights on worker 0-0, policy_version 1128204 (0.00086) [2022-07-11 09:13:17,726][25689] Fps is (10 sec: 5463.1, 60 sec: 5571.8, 300 sec: 5570.3). Total num frames: 1155286016. Throughput: 0: 5730.1. Samples: 1155295518. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:13:17,726][25689] Avg episode reward: [(0, '-0.109')] [2022-07-11 09:13:18,716][26022] Updated weights on worker 0-0, policy_version 1128214 (0.00106) [2022-07-11 09:13:20,591][26022] Updated weights on worker 0-0, policy_version 1128224 (0.00083) [2022-07-11 09:13:22,354][26022] Updated weights on worker 0-0, policy_version 1128234 (0.00087) [2022-07-11 09:13:22,741][25689] Fps is (10 sec: 5893.5, 60 sec: 5641.9, 300 sec: 5577.2). Total num frames: 1155315712. Throughput: 0: 5734.2. Samples: 1155312368. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:13:22,741][25689] Avg episode reward: [(0, '0.084')] [2022-07-11 09:13:24,310][26022] Updated weights on worker 0-0, policy_version 1128244 (0.00088) [2022-07-11 09:13:25,670][26022] Updated weights on worker 0-0, policy_version 1128254 (0.00089) [2022-07-11 09:13:27,808][25689] Fps is (10 sec: 5484.4, 60 sec: 5553.8, 300 sec: 5566.2). Total num frames: 1155341312. Throughput: 0: 5832.0. Samples: 1155346098. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:13:27,809][25689] Avg episode reward: [(0, '1.062')] [2022-07-11 09:13:27,939][26022] Updated weights on worker 0-0, policy_version 1128264 (0.00090) [2022-07-11 09:13:29,569][26022] Updated weights on worker 0-0, policy_version 1128274 (0.00081) [2022-07-11 09:13:31,511][26022] Updated weights on worker 0-0, policy_version 1128284 (0.00369) [2022-07-11 09:13:32,839][25689] Fps is (10 sec: 5476.0, 60 sec: 5587.2, 300 sec: 5571.6). Total num frames: 1155371008. Throughput: 0: 5831.4. Samples: 1155379912. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:13:32,839][25689] Avg episode reward: [(0, '0.206')] [2022-07-11 09:13:33,213][26022] Updated weights on worker 0-0, policy_version 1128294 (0.00087) [2022-07-11 09:13:34,977][26022] Updated weights on worker 0-0, policy_version 1128304 (0.00090) [2022-07-11 09:13:37,011][26022] Updated weights on worker 0-0, policy_version 1128314 (0.00089) [2022-07-11 09:13:37,926][25689] Fps is (10 sec: 5566.3, 60 sec: 5569.6, 300 sec: 5570.5). Total num frames: 1155397632. Throughput: 0: 5006.4. Samples: 1155396650. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:13:37,927][25689] Avg episode reward: [(0, '0.077')] [2022-07-11 09:13:38,684][26022] Updated weights on worker 0-0, policy_version 1128324 (0.00093) [2022-07-11 09:13:40,761][26022] Updated weights on worker 0-0, policy_version 1128334 (0.00088) [2022-07-11 09:13:42,287][26022] Updated weights on worker 0-0, policy_version 1128344 (0.00098) [2022-07-11 09:13:42,957][25689] Fps is (10 sec: 5566.4, 60 sec: 5568.1, 300 sec: 5570.4). Total num frames: 1155427328. Throughput: 0: 5828.4. Samples: 1155430194. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:13:42,957][25689] Avg episode reward: [(0, '1.111')] [2022-07-11 09:13:44,501][26022] Updated weights on worker 0-0, policy_version 1128354 (0.00094) [2022-07-11 09:13:45,935][26022] Updated weights on worker 0-0, policy_version 1128364 (0.00086) [2022-07-11 09:13:47,990][25689] Fps is (10 sec: 5698.2, 60 sec: 5587.8, 300 sec: 5566.7). Total num frames: 1155454976. Throughput: 0: 5840.5. Samples: 1155463970. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:13:47,990][25689] Avg episode reward: [(0, '1.269')] [2022-07-11 09:13:47,995][26022] Updated weights on worker 0-0, policy_version 1128374 (0.00091) [2022-07-11 09:13:49,692][26022] Updated weights on worker 0-0, policy_version 1128384 (0.00083) [2022-07-11 09:13:51,663][26022] Updated weights on worker 0-0, policy_version 1128394 (0.00091) [2022-07-11 09:13:52,792][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:13:52,813][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001128400_1155481600.pth [2022-07-11 09:13:52,819][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001126441_1153475584.pth [2022-07-11 09:13:53,011][25689] Fps is (10 sec: 5601.6, 60 sec: 5574.0, 300 sec: 5567.4). Total num frames: 1155483648. Throughput: 0: 4983.3. Samples: 1155480436. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:13:53,011][25689] Avg episode reward: [(0, '1.463')] [2022-07-11 09:13:53,451][26022] Updated weights on worker 0-0, policy_version 1128404 (0.00085) [2022-07-11 09:13:55,192][26022] Updated weights on worker 0-0, policy_version 1128414 (0.00080) [2022-07-11 09:13:57,118][26022] Updated weights on worker 0-0, policy_version 1128424 (0.00086) [2022-07-11 09:13:58,065][25689] Fps is (10 sec: 5691.6, 60 sec: 5577.2, 300 sec: 5566.7). Total num frames: 1155512320. Throughput: 0: 5829.8. Samples: 1155514054. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:13:58,065][25689] Avg episode reward: [(0, '0.473')] [2022-07-11 09:13:58,895][26022] Updated weights on worker 0-0, policy_version 1128434 (0.00091) [2022-07-11 09:14:00,767][26022] Updated weights on worker 0-0, policy_version 1128444 (0.00084) [2022-07-11 09:14:03,003][26022] Updated weights on worker 0-0, policy_version 1128454 (0.00080) [2022-07-11 09:14:03,080][25689] Fps is (10 sec: 5288.4, 60 sec: 5562.2, 300 sec: 5563.5). Total num frames: 1155536896. Throughput: 0: 5730.6. Samples: 1155545512. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:14:03,082][25689] Avg episode reward: [(0, '1.160')] [2022-07-11 09:14:04,856][26022] Updated weights on worker 0-0, policy_version 1128464 (0.00087) [2022-07-11 09:14:06,447][26022] Updated weights on worker 0-0, policy_version 1128474 (0.00097) [2022-07-11 09:14:08,088][25689] Fps is (10 sec: 5210.3, 60 sec: 5519.2, 300 sec: 5563.6). Total num frames: 1155564544. Throughput: 0: 4904.4. Samples: 1155562540. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:14:08,092][25689] Avg episode reward: [(0, '1.236')] [2022-07-11 09:14:08,367][26022] Updated weights on worker 0-0, policy_version 1128484 (0.00096) [2022-07-11 09:14:10,344][26022] Updated weights on worker 0-0, policy_version 1128494 (0.00084) [2022-07-11 09:14:12,171][26022] Updated weights on worker 0-0, policy_version 1128504 (0.00087) [2022-07-11 09:14:13,099][25689] Fps is (10 sec: 5723.5, 60 sec: 5591.4, 300 sec: 5565.1). Total num frames: 1155594240. Throughput: 0: 5759.7. Samples: 1155596136. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:14:13,099][25689] Avg episode reward: [(0, '0.915')] [2022-07-11 09:14:13,909][26022] Updated weights on worker 0-0, policy_version 1128514 (0.00086) [2022-07-11 09:14:15,704][26022] Updated weights on worker 0-0, policy_version 1128524 (0.00115) [2022-07-11 09:14:17,633][26022] Updated weights on worker 0-0, policy_version 1128534 (0.00085) [2022-07-11 09:14:18,236][25689] Fps is (10 sec: 5549.8, 60 sec: 5533.7, 300 sec: 5560.5). Total num frames: 1155620864. Throughput: 0: 5732.5. Samples: 1155629686. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:14:18,237][25689] Avg episode reward: [(0, '0.617')] [2022-07-11 09:14:19,550][26022] Updated weights on worker 0-0, policy_version 1128544 (0.00087) [2022-07-11 09:14:21,212][26022] Updated weights on worker 0-0, policy_version 1128554 (0.00085) [2022-07-11 09:14:22,967][26022] Updated weights on worker 0-0, policy_version 1128564 (0.00098) [2022-07-11 09:14:23,239][25689] Fps is (10 sec: 5554.2, 60 sec: 5534.9, 300 sec: 5567.8). Total num frames: 1155650560. Throughput: 0: 5009.1. Samples: 1155646490. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:14:23,239][25689] Avg episode reward: [(0, '0.488')] [2022-07-11 09:14:24,910][26022] Updated weights on worker 0-0, policy_version 1128574 (0.00090) [2022-07-11 09:14:26,775][26022] Updated weights on worker 0-0, policy_version 1128584 (0.00092) [2022-07-11 09:14:28,251][25689] Fps is (10 sec: 5624.0, 60 sec: 5556.9, 300 sec: 5564.8). Total num frames: 1155677184. Throughput: 0: 5825.8. Samples: 1155680002. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:14:28,251][25689] Avg episode reward: [(0, '0.564')] [2022-07-11 09:14:28,656][26022] Updated weights on worker 0-0, policy_version 1128594 (0.00064) [2022-07-11 09:14:30,352][26022] Updated weights on worker 0-0, policy_version 1128604 (0.00071) [2022-07-11 09:14:32,234][26022] Updated weights on worker 0-0, policy_version 1128614 (0.00094) [2022-07-11 09:14:33,269][25689] Fps is (10 sec: 5513.1, 60 sec: 5541.1, 300 sec: 5569.0). Total num frames: 1155705856. Throughput: 0: 5810.8. Samples: 1155713340. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:14:33,269][25689] Avg episode reward: [(0, '0.826')] [2022-07-11 09:14:34,136][26022] Updated weights on worker 0-0, policy_version 1128624 (0.00082) [2022-07-11 09:14:35,986][26022] Updated weights on worker 0-0, policy_version 1128634 (0.00086) [2022-07-11 09:14:37,839][26022] Updated weights on worker 0-0, policy_version 1128644 (0.00088) [2022-07-11 09:14:38,311][25689] Fps is (10 sec: 5700.1, 60 sec: 5579.2, 300 sec: 5564.8). Total num frames: 1155734528. Throughput: 0: 4988.5. Samples: 1155729826. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:14:38,311][25689] Avg episode reward: [(0, '0.287')] [2022-07-11 09:14:39,399][26022] Updated weights on worker 0-0, policy_version 1128654 (0.00085) [2022-07-11 09:14:41,593][26022] Updated weights on worker 0-0, policy_version 1128664 (0.00082) [2022-07-11 09:14:43,255][26022] Updated weights on worker 0-0, policy_version 1128674 (0.00085) [2022-07-11 09:14:43,353][25689] Fps is (10 sec: 5585.0, 60 sec: 5544.2, 300 sec: 5567.8). Total num frames: 1155762176. Throughput: 0: 5809.9. Samples: 1155763350. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:14:43,353][25689] Avg episode reward: [(0, '0.606')] [2022-07-11 09:14:45,275][26022] Updated weights on worker 0-0, policy_version 1128684 (0.00085) [2022-07-11 09:14:47,199][26022] Updated weights on worker 0-0, policy_version 1128694 (0.00078) [2022-07-11 09:14:48,372][25689] Fps is (10 sec: 5496.1, 60 sec: 5545.5, 300 sec: 5564.1). Total num frames: 1155789824. Throughput: 0: 5797.0. Samples: 1155796644. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:14:48,372][25689] Avg episode reward: [(0, '0.235')] [2022-07-11 09:14:48,911][26022] Updated weights on worker 0-0, policy_version 1128704 (0.00085) [2022-07-11 09:14:50,823][26022] Updated weights on worker 0-0, policy_version 1128714 (0.00076) [2022-07-11 09:14:52,564][26022] Updated weights on worker 0-0, policy_version 1128724 (0.00083) [2022-07-11 09:14:53,377][25689] Fps is (10 sec: 5720.8, 60 sec: 5563.9, 300 sec: 5568.6). Total num frames: 1155819520. Throughput: 0: 4989.3. Samples: 1155813664. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:14:53,377][25689] Avg episode reward: [(0, '0.630')] [2022-07-11 09:14:54,428][26022] Updated weights on worker 0-0, policy_version 1128734 (0.00085) [2022-07-11 09:14:56,172][26022] Updated weights on worker 0-0, policy_version 1128744 (0.00086) [2022-07-11 09:14:58,031][26022] Updated weights on worker 0-0, policy_version 1128754 (0.00082) [2022-07-11 09:14:58,431][25689] Fps is (10 sec: 5497.0, 60 sec: 5513.0, 300 sec: 5557.8). Total num frames: 1155845120. Throughput: 0: 5841.0. Samples: 1155847348. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:14:58,431][25689] Avg episode reward: [(0, '1.250')] [2022-07-11 09:14:59,883][26022] Updated weights on worker 0-0, policy_version 1128764 (0.00082) [2022-07-11 09:15:01,771][26022] Updated weights on worker 0-0, policy_version 1128774 (0.00115) [2022-07-11 09:15:03,446][25689] Fps is (10 sec: 5084.9, 60 sec: 5530.0, 300 sec: 5554.3). Total num frames: 1155870720. Throughput: 0: 5737.6. Samples: 1155878634. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:15:03,447][25689] Avg episode reward: [(0, '1.447')] [2022-07-11 09:15:03,762][26022] Updated weights on worker 0-0, policy_version 1128784 (0.00088) [2022-07-11 09:15:05,675][26022] Updated weights on worker 0-0, policy_version 1128794 (0.00084) [2022-07-11 09:15:07,571][26022] Updated weights on worker 0-0, policy_version 1128804 (0.00089) [2022-07-11 09:15:08,456][25689] Fps is (10 sec: 5515.5, 60 sec: 5563.7, 300 sec: 5561.4). Total num frames: 1155900416. Throughput: 0: 4923.1. Samples: 1155895524. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:15:08,457][25689] Avg episode reward: [(0, '1.684')] [2022-07-11 09:15:09,320][26022] Updated weights on worker 0-0, policy_version 1128814 (0.00212) [2022-07-11 09:15:11,056][26022] Updated weights on worker 0-0, policy_version 1128824 (0.00091) [2022-07-11 09:15:12,955][26022] Updated weights on worker 0-0, policy_version 1128834 (0.00081) [2022-07-11 09:15:13,497][25689] Fps is (10 sec: 5705.0, 60 sec: 5527.0, 300 sec: 5562.6). Total num frames: 1155928064. Throughput: 0: 5746.3. Samples: 1155929282. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:15:13,498][25689] Avg episode reward: [(0, '1.445')] [2022-07-11 09:15:14,699][26022] Updated weights on worker 0-0, policy_version 1128844 (0.00088) [2022-07-11 09:15:16,543][26022] Updated weights on worker 0-0, policy_version 1128854 (0.00086) [2022-07-11 09:15:18,544][25689] Fps is (10 sec: 5481.6, 60 sec: 5552.3, 300 sec: 5551.5). Total num frames: 1155955712. Throughput: 0: 5734.2. Samples: 1155962680. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:15:18,545][25689] Avg episode reward: [(0, '0.749')] [2022-07-11 09:15:18,555][26022] Updated weights on worker 0-0, policy_version 1128864 (0.00087) [2022-07-11 09:15:20,299][26022] Updated weights on worker 0-0, policy_version 1128874 (0.00095) [2022-07-11 09:15:22,134][26022] Updated weights on worker 0-0, policy_version 1128884 (0.00096) [2022-07-11 09:15:23,550][25689] Fps is (10 sec: 5704.5, 60 sec: 5552.0, 300 sec: 5565.7). Total num frames: 1155985408. Throughput: 0: 5021.6. Samples: 1155979588. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:15:23,550][25689] Avg episode reward: [(0, '-0.360')] [2022-07-11 09:15:24,011][26022] Updated weights on worker 0-0, policy_version 1128894 (0.00085) [2022-07-11 09:15:25,743][26022] Updated weights on worker 0-0, policy_version 1128904 (0.00092) [2022-07-11 09:15:27,747][26022] Updated weights on worker 0-0, policy_version 1128914 (0.00090) [2022-07-11 09:15:28,626][25689] Fps is (10 sec: 5687.8, 60 sec: 5563.1, 300 sec: 5561.3). Total num frames: 1156013056. Throughput: 0: 5835.1. Samples: 1156013216. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:15:28,627][25689] Avg episode reward: [(0, '-1.004')] [2022-07-11 09:15:29,423][26022] Updated weights on worker 0-0, policy_version 1128924 (0.00092) [2022-07-11 09:15:31,268][26022] Updated weights on worker 0-0, policy_version 1128934 (0.00088) [2022-07-11 09:15:33,123][26022] Updated weights on worker 0-0, policy_version 1128944 (0.00093) [2022-07-11 09:15:33,684][25689] Fps is (10 sec: 5456.3, 60 sec: 5542.4, 300 sec: 5557.8). Total num frames: 1156040704. Throughput: 0: 5838.1. Samples: 1156047134. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:15:33,684][25689] Avg episode reward: [(0, '-4.050')] [2022-07-11 09:15:34,875][26022] Updated weights on worker 0-0, policy_version 1128954 (0.00087) [2022-07-11 09:15:36,843][26022] Updated weights on worker 0-0, policy_version 1128964 (0.00082) [2022-07-11 09:15:38,455][26022] Updated weights on worker 0-0, policy_version 1128974 (0.00096) [2022-07-11 09:15:38,815][25689] Fps is (10 sec: 5628.2, 60 sec: 5551.3, 300 sec: 5562.8). Total num frames: 1156070400. Throughput: 0: 4995.7. Samples: 1156063946. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:15:38,815][25689] Avg episode reward: [(0, '-4.108')] [2022-07-11 09:15:40,362][26022] Updated weights on worker 0-0, policy_version 1128984 (0.00092) [2022-07-11 09:15:42,218][26022] Updated weights on worker 0-0, policy_version 1128994 (0.00086) [2022-07-11 09:15:43,864][25689] Fps is (10 sec: 5633.1, 60 sec: 5550.6, 300 sec: 5558.6). Total num frames: 1156098048. Throughput: 0: 5807.2. Samples: 1156097556. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:15:43,864][25689] Avg episode reward: [(0, '-3.999')] [2022-07-11 09:15:44,097][26022] Updated weights on worker 0-0, policy_version 1129004 (0.00086) [2022-07-11 09:15:45,964][26022] Updated weights on worker 0-0, policy_version 1129014 (0.00086) [2022-07-11 09:15:47,453][26022] Updated weights on worker 0-0, policy_version 1129024 (0.00093) [2022-07-11 09:15:48,870][25689] Fps is (10 sec: 5601.1, 60 sec: 5568.7, 300 sec: 5559.3). Total num frames: 1156126720. Throughput: 0: 5842.3. Samples: 1156131486. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:15:48,870][25689] Avg episode reward: [(0, '-4.518')] [2022-07-11 09:15:49,455][26022] Updated weights on worker 0-0, policy_version 1129034 (0.00084) [2022-07-11 09:15:51,473][26022] Updated weights on worker 0-0, policy_version 1129044 (0.00085) [2022-07-11 09:15:53,003][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:15:53,018][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001129053_1156150272.pth [2022-07-11 09:15:53,018][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001127095_1154145280.pth [2022-07-11 09:15:53,162][26022] Updated weights on worker 0-0, policy_version 1129054 (0.00086) [2022-07-11 09:15:53,894][25689] Fps is (10 sec: 5717.3, 60 sec: 5550.1, 300 sec: 5566.9). Total num frames: 1156155392. Throughput: 0: 5009.6. Samples: 1156148376. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:15:53,894][25689] Avg episode reward: [(0, '-4.024')] [2022-07-11 09:15:55,072][26022] Updated weights on worker 0-0, policy_version 1129064 (0.00090) [2022-07-11 09:15:56,812][26022] Updated weights on worker 0-0, policy_version 1129074 (0.00088) [2022-07-11 09:15:58,764][26022] Updated weights on worker 0-0, policy_version 1129084 (0.00081) [2022-07-11 09:15:58,963][25689] Fps is (10 sec: 5478.4, 60 sec: 5565.6, 300 sec: 5552.8). Total num frames: 1156182016. Throughput: 0: 5859.1. Samples: 1156181998. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:15:58,965][25689] Avg episode reward: [(0, '-2.211')] [2022-07-11 09:16:00,358][26022] Updated weights on worker 0-0, policy_version 1129094 (0.00096) [2022-07-11 09:16:02,777][26022] Updated weights on worker 0-0, policy_version 1129104 (0.00088) [2022-07-11 09:16:03,976][25689] Fps is (10 sec: 5382.9, 60 sec: 5599.6, 300 sec: 5564.3). Total num frames: 1156209664. Throughput: 0: 5761.4. Samples: 1156213430. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:16:03,977][25689] Avg episode reward: [(0, '-1.956')] [2022-07-11 09:16:04,395][26022] Updated weights on worker 0-0, policy_version 1129114 (0.00087) [2022-07-11 09:16:06,466][26022] Updated weights on worker 0-0, policy_version 1129124 (0.00091) [2022-07-11 09:16:07,991][26022] Updated weights on worker 0-0, policy_version 1129134 (0.00087) [2022-07-11 09:16:09,011][25689] Fps is (10 sec: 5503.1, 60 sec: 5563.5, 300 sec: 5557.3). Total num frames: 1156237312. Throughput: 0: 4915.4. Samples: 1156230490. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:16:09,012][25689] Avg episode reward: [(0, '-2.032')] [2022-07-11 09:16:10,031][26022] Updated weights on worker 0-0, policy_version 1129144 (0.00086) [2022-07-11 09:16:11,651][26022] Updated weights on worker 0-0, policy_version 1129154 (0.00086) [2022-07-11 09:16:13,720][26022] Updated weights on worker 0-0, policy_version 1129164 (0.00093) [2022-07-11 09:16:14,027][25689] Fps is (10 sec: 5501.6, 60 sec: 5565.8, 300 sec: 5554.4). Total num frames: 1156264960. Throughput: 0: 5757.7. Samples: 1156264296. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:16:14,027][25689] Avg episode reward: [(0, '-1.040')] [2022-07-11 09:16:15,537][26022] Updated weights on worker 0-0, policy_version 1129174 (0.00082) [2022-07-11 09:16:17,463][26022] Updated weights on worker 0-0, policy_version 1129184 (0.00080) [2022-07-11 09:16:19,075][25689] Fps is (10 sec: 5596.5, 60 sec: 5582.6, 300 sec: 5561.1). Total num frames: 1156293632. Throughput: 0: 5739.4. Samples: 1156297426. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:16:19,075][25689] Avg episode reward: [(0, '0.245')] [2022-07-11 09:16:19,264][26022] Updated weights on worker 0-0, policy_version 1129194 (0.00088) [2022-07-11 09:16:21,084][26022] Updated weights on worker 0-0, policy_version 1129204 (0.00083) [2022-07-11 09:16:22,828][26022] Updated weights on worker 0-0, policy_version 1129214 (0.00091) [2022-07-11 09:16:24,091][25689] Fps is (10 sec: 5596.1, 60 sec: 5547.8, 300 sec: 5560.9). Total num frames: 1156321280. Throughput: 0: 5840.9. Samples: 1156330920. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:16:24,091][25689] Avg episode reward: [(0, '0.263')] [2022-07-11 09:16:24,726][26022] Updated weights on worker 0-0, policy_version 1129224 (0.00078) [2022-07-11 09:16:26,635][26022] Updated weights on worker 0-0, policy_version 1129234 (0.00088) [2022-07-11 09:16:28,465][26022] Updated weights on worker 0-0, policy_version 1129244 (0.00085) [2022-07-11 09:16:29,116][25689] Fps is (10 sec: 5609.0, 60 sec: 5569.5, 300 sec: 5557.2). Total num frames: 1156349952. Throughput: 0: 5832.1. Samples: 1156347740. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:16:29,117][25689] Avg episode reward: [(0, '1.206')] [2022-07-11 09:16:30,191][26022] Updated weights on worker 0-0, policy_version 1129254 (0.00092) [2022-07-11 09:16:31,947][26022] Updated weights on worker 0-0, policy_version 1129264 (0.00086) [2022-07-11 09:16:33,713][26022] Updated weights on worker 0-0, policy_version 1129274 (0.00083) [2022-07-11 09:16:34,133][25689] Fps is (10 sec: 5608.1, 60 sec: 5573.2, 300 sec: 5561.1). Total num frames: 1156377600. Throughput: 0: 5832.1. Samples: 1156381560. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 09:16:34,134][25689] Avg episode reward: [(0, '1.319')] [2022-07-11 09:16:35,568][26022] Updated weights on worker 0-0, policy_version 1129284 (0.00077) [2022-07-11 09:16:37,471][26022] Updated weights on worker 0-0, policy_version 1129294 (0.00857) [2022-07-11 09:16:39,170][25689] Fps is (10 sec: 5499.6, 60 sec: 5547.9, 300 sec: 5557.4). Total num frames: 1156405248. Throughput: 0: 5861.7. Samples: 1156415220. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:16:39,171][25689] Avg episode reward: [(0, '1.330')] [2022-07-11 09:16:39,344][26022] Updated weights on worker 0-0, policy_version 1129304 (0.00092) [2022-07-11 09:16:41,162][26022] Updated weights on worker 0-0, policy_version 1129314 (0.00087) [2022-07-11 09:16:43,023][26022] Updated weights on worker 0-0, policy_version 1129324 (0.00089) [2022-07-11 09:16:44,201][25689] Fps is (10 sec: 5492.6, 60 sec: 5549.6, 300 sec: 5554.0). Total num frames: 1156432896. Throughput: 0: 5022.5. Samples: 1156431922. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:16:44,201][25689] Avg episode reward: [(0, '1.184')] [2022-07-11 09:16:44,644][26022] Updated weights on worker 0-0, policy_version 1129334 (0.00088) [2022-07-11 09:16:46,781][26022] Updated weights on worker 0-0, policy_version 1129344 (0.00087) [2022-07-11 09:16:48,424][26022] Updated weights on worker 0-0, policy_version 1129354 (0.00092) [2022-07-11 09:16:49,205][25689] Fps is (10 sec: 5612.5, 60 sec: 5549.8, 300 sec: 5558.2). Total num frames: 1156461568. Throughput: 0: 5866.2. Samples: 1156465588. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:16:49,205][25689] Avg episode reward: [(0, '1.642')] [2022-07-11 09:16:50,299][26022] Updated weights on worker 0-0, policy_version 1129364 (0.00333) [2022-07-11 09:16:52,187][26022] Updated weights on worker 0-0, policy_version 1129374 (0.00087) [2022-07-11 09:16:53,991][26022] Updated weights on worker 0-0, policy_version 1129384 (0.00095) [2022-07-11 09:16:54,223][25689] Fps is (10 sec: 5721.7, 60 sec: 5550.3, 300 sec: 5558.9). Total num frames: 1156490240. Throughput: 0: 5841.7. Samples: 1156498918. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:16:54,224][25689] Avg episode reward: [(0, '1.715')] [2022-07-11 09:16:55,933][26022] Updated weights on worker 0-0, policy_version 1129394 (0.01007) [2022-07-11 09:16:57,598][26022] Updated weights on worker 0-0, policy_version 1129404 (0.00095) [2022-07-11 09:16:59,300][25689] Fps is (10 sec: 5579.0, 60 sec: 5566.6, 300 sec: 5562.5). Total num frames: 1156517888. Throughput: 0: 4980.2. Samples: 1156515470. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:16:59,301][25689] Avg episode reward: [(0, '0.894')] [2022-07-11 09:16:59,702][26022] Updated weights on worker 0-0, policy_version 1129414 (0.00088) [2022-07-11 09:17:01,540][26022] Updated weights on worker 0-0, policy_version 1129424 (0.00108) [2022-07-11 09:17:03,646][26022] Updated weights on worker 0-0, policy_version 1129434 (0.00086) [2022-07-11 09:17:04,327][25689] Fps is (10 sec: 5270.2, 60 sec: 5531.4, 300 sec: 5562.2). Total num frames: 1156543488. Throughput: 0: 5712.2. Samples: 1156546884. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:17:04,327][25689] Avg episode reward: [(0, '0.659')] [2022-07-11 09:17:05,267][26022] Updated weights on worker 0-0, policy_version 1129444 (0.00087) [2022-07-11 09:17:07,315][26022] Updated weights on worker 0-0, policy_version 1129454 (0.00079) [2022-07-11 09:17:08,986][26022] Updated weights on worker 0-0, policy_version 1129464 (0.00088) [2022-07-11 09:17:09,343][25689] Fps is (10 sec: 5404.2, 60 sec: 5550.1, 300 sec: 5555.3). Total num frames: 1156572160. Throughput: 0: 5711.6. Samples: 1156580606. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:17:09,343][25689] Avg episode reward: [(0, '-0.191')] [2022-07-11 09:17:11,017][26022] Updated weights on worker 0-0, policy_version 1129474 (0.00089) [2022-07-11 09:17:12,723][26022] Updated weights on worker 0-0, policy_version 1129484 (0.00090) [2022-07-11 09:17:14,352][25689] Fps is (10 sec: 5617.5, 60 sec: 5550.7, 300 sec: 5556.2). Total num frames: 1156599808. Throughput: 0: 4900.7. Samples: 1156597566. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:17:14,353][25689] Avg episode reward: [(0, '-0.297')] [2022-07-11 09:17:14,601][26022] Updated weights on worker 0-0, policy_version 1129494 (0.00089) [2022-07-11 09:17:16,473][26022] Updated weights on worker 0-0, policy_version 1129504 (0.00072) [2022-07-11 09:17:18,473][26022] Updated weights on worker 0-0, policy_version 1129514 (0.00092) [2022-07-11 09:17:19,401][25689] Fps is (10 sec: 5497.6, 60 sec: 5533.6, 300 sec: 5563.0). Total num frames: 1156627456. Throughput: 0: 5736.6. Samples: 1156630780. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:17:19,401][25689] Avg episode reward: [(0, '-0.219')] [2022-07-11 09:17:20,040][26022] Updated weights on worker 0-0, policy_version 1129524 (0.00096) [2022-07-11 09:17:22,033][26022] Updated weights on worker 0-0, policy_version 1129534 (0.00081) [2022-07-11 09:17:23,787][26022] Updated weights on worker 0-0, policy_version 1129544 (0.00086) [2022-07-11 09:17:24,434][25689] Fps is (10 sec: 5586.5, 60 sec: 5549.1, 300 sec: 5556.0). Total num frames: 1156656128. Throughput: 0: 5841.3. Samples: 1156664336. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:17:24,434][25689] Avg episode reward: [(0, '-0.633')] [2022-07-11 09:17:25,830][26022] Updated weights on worker 0-0, policy_version 1129554 (0.01205) [2022-07-11 09:17:27,507][26022] Updated weights on worker 0-0, policy_version 1129564 (0.00079) [2022-07-11 09:17:29,389][26022] Updated weights on worker 0-0, policy_version 1129574 (0.00083) [2022-07-11 09:17:29,454][25689] Fps is (10 sec: 5602.5, 60 sec: 5532.6, 300 sec: 5556.1). Total num frames: 1156683776. Throughput: 0: 4998.3. Samples: 1156681126. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:17:29,454][25689] Avg episode reward: [(0, '0.347')] [2022-07-11 09:17:31,024][26022] Updated weights on worker 0-0, policy_version 1129584 (0.00090) [2022-07-11 09:17:33,059][26022] Updated weights on worker 0-0, policy_version 1129594 (0.00077) [2022-07-11 09:17:34,478][25689] Fps is (10 sec: 5607.3, 60 sec: 5548.9, 300 sec: 5560.6). Total num frames: 1156712448. Throughput: 0: 5828.7. Samples: 1156714872. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:17:34,478][25689] Avg episode reward: [(0, '-0.029')] [2022-07-11 09:17:34,675][26022] Updated weights on worker 0-0, policy_version 1129604 (0.00088) [2022-07-11 09:17:36,625][26022] Updated weights on worker 0-0, policy_version 1129614 (0.00088) [2022-07-11 09:17:38,292][26022] Updated weights on worker 0-0, policy_version 1129624 (0.00081) [2022-07-11 09:17:39,514][25689] Fps is (10 sec: 5598.2, 60 sec: 5549.0, 300 sec: 5553.3). Total num frames: 1156740096. Throughput: 0: 5850.3. Samples: 1156748448. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:17:39,516][25689] Avg episode reward: [(0, '-0.115')] [2022-07-11 09:17:40,369][26022] Updated weights on worker 0-0, policy_version 1129634 (0.00094) [2022-07-11 09:17:42,051][26022] Updated weights on worker 0-0, policy_version 1129644 (0.00083) [2022-07-11 09:17:43,894][26022] Updated weights on worker 0-0, policy_version 1129654 (0.00084) [2022-07-11 09:17:44,610][25689] Fps is (10 sec: 5558.4, 60 sec: 5559.9, 300 sec: 5559.5). Total num frames: 1156768768. Throughput: 0: 5003.4. Samples: 1156765288. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:17:44,611][25689] Avg episode reward: [(0, '-0.380')] [2022-07-11 09:17:45,837][26022] Updated weights on worker 0-0, policy_version 1129664 (0.00091) [2022-07-11 09:17:47,566][26022] Updated weights on worker 0-0, policy_version 1129674 (0.00092) [2022-07-11 09:17:49,457][26022] Updated weights on worker 0-0, policy_version 1129684 (0.00084) [2022-07-11 09:17:49,627][25689] Fps is (10 sec: 5569.1, 60 sec: 5541.8, 300 sec: 5553.4). Total num frames: 1156796416. Throughput: 0: 5824.8. Samples: 1156798632. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:17:49,627][25689] Avg episode reward: [(0, '-0.705')] [2022-07-11 09:17:51,180][26022] Updated weights on worker 0-0, policy_version 1129694 (0.00096) [2022-07-11 09:17:53,128][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:17:53,139][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001129703_1156815872.pth [2022-07-11 09:17:53,139][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001127748_1154813952.pth [2022-07-11 09:17:53,300][26022] Updated weights on worker 0-0, policy_version 1129704 (0.00089) [2022-07-11 09:17:54,703][25689] Fps is (10 sec: 5580.5, 60 sec: 5536.5, 300 sec: 5553.6). Total num frames: 1156825088. Throughput: 0: 5793.1. Samples: 1156832036. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:17:54,703][25689] Avg episode reward: [(0, '-0.602')] [2022-07-11 09:17:54,907][26022] Updated weights on worker 0-0, policy_version 1129714 (0.00094) [2022-07-11 09:17:56,793][26022] Updated weights on worker 0-0, policy_version 1129724 (0.00093) [2022-07-11 09:17:58,624][26022] Updated weights on worker 0-0, policy_version 1129734 (0.00098) [2022-07-11 09:17:59,767][25689] Fps is (10 sec: 5554.2, 60 sec: 5537.7, 300 sec: 5560.0). Total num frames: 1156852736. Throughput: 0: 4948.8. Samples: 1156848676. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:17:59,767][25689] Avg episode reward: [(0, '-0.515')] [2022-07-11 09:18:00,516][26022] Updated weights on worker 0-0, policy_version 1129744 (0.00084) [2022-07-11 09:18:02,666][26022] Updated weights on worker 0-0, policy_version 1129754 (0.00092) [2022-07-11 09:18:04,623][26022] Updated weights on worker 0-0, policy_version 1129764 (0.00083) [2022-07-11 09:18:04,799][25689] Fps is (10 sec: 5274.2, 60 sec: 5537.2, 300 sec: 5543.9). Total num frames: 1156878336. Throughput: 0: 5690.7. Samples: 1156880174. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:18:04,799][25689] Avg episode reward: [(0, '0.905')] [2022-07-11 09:18:06,447][26022] Updated weights on worker 0-0, policy_version 1129774 (0.00091) [2022-07-11 09:18:08,171][26022] Updated weights on worker 0-0, policy_version 1129784 (0.00965) [2022-07-11 09:18:09,837][25689] Fps is (10 sec: 5389.4, 60 sec: 5535.2, 300 sec: 5554.6). Total num frames: 1156907008. Throughput: 0: 5704.8. Samples: 1156913926. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:18:09,837][25689] Avg episode reward: [(0, '0.285')] [2022-07-11 09:18:10,075][26022] Updated weights on worker 0-0, policy_version 1129794 (0.00087) [2022-07-11 09:18:11,892][26022] Updated weights on worker 0-0, policy_version 1129804 (0.00085) [2022-07-11 09:18:13,549][26022] Updated weights on worker 0-0, policy_version 1129814 (0.00086) [2022-07-11 09:18:14,850][25689] Fps is (10 sec: 5705.3, 60 sec: 5551.8, 300 sec: 5552.1). Total num frames: 1156935680. Throughput: 0: 4906.7. Samples: 1156930892. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:18:14,850][25689] Avg episode reward: [(0, '0.407')] [2022-07-11 09:18:15,570][26022] Updated weights on worker 0-0, policy_version 1129824 (0.00087) [2022-07-11 09:18:17,381][26022] Updated weights on worker 0-0, policy_version 1129834 (0.00085) [2022-07-11 09:18:19,107][26022] Updated weights on worker 0-0, policy_version 1129844 (0.00092) [2022-07-11 09:18:19,896][25689] Fps is (10 sec: 5700.8, 60 sec: 5569.0, 300 sec: 5548.1). Total num frames: 1156964352. Throughput: 0: 5755.2. Samples: 1156964522. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:18:19,896][25689] Avg episode reward: [(0, '-0.438')] [2022-07-11 09:18:20,952][26022] Updated weights on worker 0-0, policy_version 1129854 (0.00088) [2022-07-11 09:18:22,903][26022] Updated weights on worker 0-0, policy_version 1129864 (0.00093) [2022-07-11 09:18:24,719][26022] Updated weights on worker 0-0, policy_version 1129874 (0.00087) [2022-07-11 09:18:24,910][25689] Fps is (10 sec: 5598.5, 60 sec: 5553.8, 300 sec: 5556.0). Total num frames: 1156992000. Throughput: 0: 5854.6. Samples: 1156997916. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:18:24,910][25689] Avg episode reward: [(0, '0.277')] [2022-07-11 09:18:26,560][26022] Updated weights on worker 0-0, policy_version 1129884 (0.00091) [2022-07-11 09:18:28,249][26022] Updated weights on worker 0-0, policy_version 1129894 (0.00091) [2022-07-11 09:18:29,918][25689] Fps is (10 sec: 5517.5, 60 sec: 5554.9, 300 sec: 5549.5). Total num frames: 1157019648. Throughput: 0: 5022.1. Samples: 1157014774. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:18:29,918][25689] Avg episode reward: [(0, '0.070')] [2022-07-11 09:18:30,119][26022] Updated weights on worker 0-0, policy_version 1129904 (0.00085) [2022-07-11 09:18:32,233][26022] Updated weights on worker 0-0, policy_version 1129914 (0.00096) [2022-07-11 09:18:33,804][26022] Updated weights on worker 0-0, policy_version 1129924 (0.00092) [2022-07-11 09:18:34,946][25689] Fps is (10 sec: 5611.5, 60 sec: 5554.5, 300 sec: 5557.5). Total num frames: 1157048320. Throughput: 0: 5846.2. Samples: 1157048378. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:18:34,946][25689] Avg episode reward: [(0, '-0.074')] [2022-07-11 09:18:35,852][26022] Updated weights on worker 0-0, policy_version 1129934 (0.00090) [2022-07-11 09:18:37,447][26022] Updated weights on worker 0-0, policy_version 1129944 (0.00082) [2022-07-11 09:18:39,396][26022] Updated weights on worker 0-0, policy_version 1129954 (0.00087) [2022-07-11 09:18:40,073][25689] Fps is (10 sec: 5545.7, 60 sec: 5546.2, 300 sec: 5548.8). Total num frames: 1157075968. Throughput: 0: 5827.1. Samples: 1157082098. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:18:40,075][25689] Avg episode reward: [(0, '-0.259')] [2022-07-11 09:18:41,199][26022] Updated weights on worker 0-0, policy_version 1129964 (0.00084) [2022-07-11 09:18:42,960][26022] Updated weights on worker 0-0, policy_version 1129974 (0.00091) [2022-07-11 09:18:44,770][26022] Updated weights on worker 0-0, policy_version 1129984 (0.00091) [2022-07-11 09:18:45,132][25689] Fps is (10 sec: 5529.3, 60 sec: 5549.7, 300 sec: 5551.8). Total num frames: 1157104640. Throughput: 0: 4990.1. Samples: 1157098826. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:18:45,132][25689] Avg episode reward: [(0, '0.394')] [2022-07-11 09:18:46,756][26022] Updated weights on worker 0-0, policy_version 1129994 (0.00084) [2022-07-11 09:18:48,541][26022] Updated weights on worker 0-0, policy_version 1130004 (0.00084) [2022-07-11 09:18:50,147][25689] Fps is (10 sec: 5692.3, 60 sec: 5566.6, 300 sec: 5551.9). Total num frames: 1157133312. Throughput: 0: 5827.2. Samples: 1157132656. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:18:50,148][25689] Avg episode reward: [(0, '0.476')] [2022-07-11 09:18:50,332][26022] Updated weights on worker 0-0, policy_version 1130014 (0.00092) [2022-07-11 09:18:52,203][26022] Updated weights on worker 0-0, policy_version 1130024 (0.00085) [2022-07-11 09:18:54,271][26022] Updated weights on worker 0-0, policy_version 1130034 (0.00098) [2022-07-11 09:18:55,180][25689] Fps is (10 sec: 5605.1, 60 sec: 5553.7, 300 sec: 5548.8). Total num frames: 1157160960. Throughput: 0: 5810.8. Samples: 1157165952. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:18:55,180][25689] Avg episode reward: [(0, '0.436')] [2022-07-11 09:18:55,782][26022] Updated weights on worker 0-0, policy_version 1130044 (0.00088) [2022-07-11 09:18:57,902][26022] Updated weights on worker 0-0, policy_version 1130054 (0.00090) [2022-07-11 09:18:59,334][26022] Updated weights on worker 0-0, policy_version 1130064 (0.00080) [2022-07-11 09:19:00,329][25689] Fps is (10 sec: 5431.1, 60 sec: 5545.9, 300 sec: 5556.7). Total num frames: 1157188608. Throughput: 0: 4967.3. Samples: 1157182712. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:19:00,329][25689] Avg episode reward: [(0, '-0.188')] [2022-07-11 09:19:01,712][26022] Updated weights on worker 0-0, policy_version 1130074 (0.00104) [2022-07-11 09:19:03,457][26022] Updated weights on worker 0-0, policy_version 1130084 (0.00083) [2022-07-11 09:19:05,245][26022] Updated weights on worker 0-0, policy_version 1130094 (0.00117) [2022-07-11 09:19:05,414][25689] Fps is (10 sec: 5503.0, 60 sec: 5591.7, 300 sec: 5558.7). Total num frames: 1157217280. Throughput: 0: 5696.1. Samples: 1157214356. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:19:05,415][25689] Avg episode reward: [(0, '-0.179')] [2022-07-11 09:19:07,173][26022] Updated weights on worker 0-0, policy_version 1130104 (0.00091) [2022-07-11 09:19:08,999][26022] Updated weights on worker 0-0, policy_version 1130114 (0.00086) [2022-07-11 09:19:10,492][25689] Fps is (10 sec: 5440.5, 60 sec: 5554.3, 300 sec: 5547.1). Total num frames: 1157243904. Throughput: 0: 5670.7. Samples: 1157248026. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:19:10,493][25689] Avg episode reward: [(0, '0.064')] [2022-07-11 09:19:10,864][26022] Updated weights on worker 0-0, policy_version 1130124 (0.00085) [2022-07-11 09:19:12,934][26022] Updated weights on worker 0-0, policy_version 1130134 (0.00089) [2022-07-11 09:19:14,430][26022] Updated weights on worker 0-0, policy_version 1130144 (0.00085) [2022-07-11 09:19:15,514][25689] Fps is (10 sec: 5474.7, 60 sec: 5553.4, 300 sec: 5556.1). Total num frames: 1157272576. Throughput: 0: 5707.6. Samples: 1157282014. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:19:15,515][25689] Avg episode reward: [(0, '-0.192')] [2022-07-11 09:19:16,509][26022] Updated weights on worker 0-0, policy_version 1130154 (0.00048) [2022-07-11 09:19:18,068][26022] Updated weights on worker 0-0, policy_version 1130164 (0.00084) [2022-07-11 09:19:19,987][26022] Updated weights on worker 0-0, policy_version 1130174 (0.00084) [2022-07-11 09:19:20,579][25689] Fps is (10 sec: 5583.7, 60 sec: 5534.9, 300 sec: 5548.1). Total num frames: 1157300224. Throughput: 0: 5732.7. Samples: 1157298800. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:19:20,579][25689] Avg episode reward: [(0, '-0.248')] [2022-07-11 09:19:21,795][26022] Updated weights on worker 0-0, policy_version 1130184 (0.00095) [2022-07-11 09:19:23,888][26022] Updated weights on worker 0-0, policy_version 1130194 (0.00087) [2022-07-11 09:19:25,423][26022] Updated weights on worker 0-0, policy_version 1130204 (0.00087) [2022-07-11 09:19:25,595][25689] Fps is (10 sec: 5688.7, 60 sec: 5568.4, 300 sec: 5558.3). Total num frames: 1157329920. Throughput: 0: 5851.1. Samples: 1157332434. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:19:25,595][25689] Avg episode reward: [(0, '-0.947')] [2022-07-11 09:19:27,527][26022] Updated weights on worker 0-0, policy_version 1130214 (0.00092) [2022-07-11 09:19:28,883][26022] Updated weights on worker 0-0, policy_version 1130224 (0.00084) [2022-07-11 09:19:30,627][25689] Fps is (10 sec: 5604.8, 60 sec: 5549.3, 300 sec: 5551.2). Total num frames: 1157356544. Throughput: 0: 5861.0. Samples: 1157366038. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:19:30,628][25689] Avg episode reward: [(0, '-1.554')] [2022-07-11 09:19:31,039][26022] Updated weights on worker 0-0, policy_version 1130234 (0.00088) [2022-07-11 09:19:32,853][26022] Updated weights on worker 0-0, policy_version 1130244 (0.00087) [2022-07-11 09:19:34,616][26022] Updated weights on worker 0-0, policy_version 1130254 (0.00090) [2022-07-11 09:19:35,635][25689] Fps is (10 sec: 5507.2, 60 sec: 5551.2, 300 sec: 5551.8). Total num frames: 1157385216. Throughput: 0: 5016.9. Samples: 1157382958. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:19:35,636][25689] Avg episode reward: [(0, '-1.608')] [2022-07-11 09:19:36,437][26022] Updated weights on worker 0-0, policy_version 1130264 (0.00089) [2022-07-11 09:19:38,200][26022] Updated weights on worker 0-0, policy_version 1130274 (0.00085) [2022-07-11 09:19:40,128][26022] Updated weights on worker 0-0, policy_version 1130284 (0.00086) [2022-07-11 09:19:40,723][25689] Fps is (10 sec: 5781.8, 60 sec: 5588.6, 300 sec: 5557.8). Total num frames: 1157414912. Throughput: 0: 5848.8. Samples: 1157416618. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:19:40,723][25689] Avg episode reward: [(0, '-1.976')] [2022-07-11 09:19:42,129][26022] Updated weights on worker 0-0, policy_version 1130294 (0.00413) [2022-07-11 09:19:43,464][26022] Updated weights on worker 0-0, policy_version 1130304 (0.00086) [2022-07-11 09:19:45,725][25689] Fps is (10 sec: 5480.3, 60 sec: 5543.0, 300 sec: 5551.3). Total num frames: 1157440512. Throughput: 0: 5850.5. Samples: 1157450208. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:19:45,726][25689] Avg episode reward: [(0, '-1.890')] [2022-07-11 09:19:45,784][26022] Updated weights on worker 0-0, policy_version 1130314 (0.00960) [2022-07-11 09:19:47,379][26022] Updated weights on worker 0-0, policy_version 1130324 (0.00083) [2022-07-11 09:19:49,299][26022] Updated weights on worker 0-0, policy_version 1130334 (0.00085) [2022-07-11 09:19:50,731][25689] Fps is (10 sec: 5627.1, 60 sec: 5577.7, 300 sec: 5554.7). Total num frames: 1157471232. Throughput: 0: 5024.9. Samples: 1157467058. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:19:50,732][25689] Avg episode reward: [(0, '-1.952')] [2022-07-11 09:19:51,087][26022] Updated weights on worker 0-0, policy_version 1130344 (0.00087) [2022-07-11 09:19:52,895][26022] Updated weights on worker 0-0, policy_version 1130354 (0.00088) [2022-07-11 09:19:53,267][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:19:53,278][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001130356_1157484544.pth [2022-07-11 09:19:53,278][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001128400_1155481600.pth [2022-07-11 09:19:54,782][26022] Updated weights on worker 0-0, policy_version 1130364 (0.00090) [2022-07-11 09:19:55,758][25689] Fps is (10 sec: 5817.8, 60 sec: 5578.2, 300 sec: 5562.1). Total num frames: 1157498880. Throughput: 0: 5857.2. Samples: 1157500820. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:19:55,759][25689] Avg episode reward: [(0, '-0.417')] [2022-07-11 09:19:56,557][26022] Updated weights on worker 0-0, policy_version 1130374 (0.00092) [2022-07-11 09:19:58,274][26022] Updated weights on worker 0-0, policy_version 1130384 (0.00087) [2022-07-11 09:20:00,271][26022] Updated weights on worker 0-0, policy_version 1130394 (0.00081) [2022-07-11 09:20:00,811][25689] Fps is (10 sec: 5384.5, 60 sec: 5570.2, 300 sec: 5564.8). Total num frames: 1157525504. Throughput: 0: 5865.9. Samples: 1157534452. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:20:00,811][25689] Avg episode reward: [(0, '0.057')] [2022-07-11 09:20:01,908][26022] Updated weights on worker 0-0, policy_version 1130404 (0.00086) [2022-07-11 09:20:04,426][26022] Updated weights on worker 0-0, policy_version 1130414 (0.00080) [2022-07-11 09:20:05,831][25689] Fps is (10 sec: 5388.0, 60 sec: 5559.3, 300 sec: 5557.8). Total num frames: 1157553152. Throughput: 0: 4921.6. Samples: 1157549158. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:20:05,831][25689] Avg episode reward: [(0, '1.175')] [2022-07-11 09:20:05,896][26022] Updated weights on worker 0-0, policy_version 1130424 (0.00084) [2022-07-11 09:20:07,820][26022] Updated weights on worker 0-0, policy_version 1130434 (0.00105) [2022-07-11 09:20:09,545][26022] Updated weights on worker 0-0, policy_version 1130444 (0.00085) [2022-07-11 09:20:10,842][25689] Fps is (10 sec: 5512.5, 60 sec: 5582.4, 300 sec: 5558.3). Total num frames: 1157580800. Throughput: 0: 5767.9. Samples: 1157583052. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:20:10,842][25689] Avg episode reward: [(0, '1.125')] [2022-07-11 09:20:11,526][26022] Updated weights on worker 0-0, policy_version 1130454 (0.00615) [2022-07-11 09:20:13,280][26022] Updated weights on worker 0-0, policy_version 1130464 (0.00088) [2022-07-11 09:20:15,123][26022] Updated weights on worker 0-0, policy_version 1130474 (0.00089) [2022-07-11 09:20:15,865][25689] Fps is (10 sec: 5510.7, 60 sec: 5565.3, 300 sec: 5558.8). Total num frames: 1157608448. Throughput: 0: 5773.7. Samples: 1157616912. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:20:15,867][25689] Avg episode reward: [(0, '1.140')] [2022-07-11 09:20:16,763][26022] Updated weights on worker 0-0, policy_version 1130484 (0.00084) [2022-07-11 09:20:18,898][26022] Updated weights on worker 0-0, policy_version 1130494 (0.00083) [2022-07-11 09:20:20,608][26022] Updated weights on worker 0-0, policy_version 1130504 (0.00090) [2022-07-11 09:20:20,996][25689] Fps is (10 sec: 5546.5, 60 sec: 5576.2, 300 sec: 5553.0). Total num frames: 1157637120. Throughput: 0: 4915.7. Samples: 1157633676. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:20:20,996][25689] Avg episode reward: [(0, '1.299')] [2022-07-11 09:20:22,470][26022] Updated weights on worker 0-0, policy_version 1130514 (0.00098) [2022-07-11 09:20:24,284][26022] Updated weights on worker 0-0, policy_version 1130524 (0.00091) [2022-07-11 09:20:26,023][25689] Fps is (10 sec: 5544.5, 60 sec: 5541.3, 300 sec: 5553.9). Total num frames: 1157664768. Throughput: 0: 5834.8. Samples: 1157666976. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:20:26,023][25689] Avg episode reward: [(0, '0.259')] [2022-07-11 09:20:26,414][26022] Updated weights on worker 0-0, policy_version 1130534 (0.00091) [2022-07-11 09:20:28,059][26022] Updated weights on worker 0-0, policy_version 1130544 (0.00087) [2022-07-11 09:20:29,935][26022] Updated weights on worker 0-0, policy_version 1130554 (0.00090) [2022-07-11 09:20:31,039][25689] Fps is (10 sec: 5505.9, 60 sec: 5559.7, 300 sec: 5554.7). Total num frames: 1157692416. Throughput: 0: 5787.6. Samples: 1157699946. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2022-07-11 09:20:31,039][25689] Avg episode reward: [(0, '0.031')] [2022-07-11 09:20:31,749][26022] Updated weights on worker 0-0, policy_version 1130564 (0.00085) [2022-07-11 09:20:33,700][26022] Updated weights on worker 0-0, policy_version 1130574 (0.00087) [2022-07-11 09:20:35,403][26022] Updated weights on worker 0-0, policy_version 1130584 (0.00082) [2022-07-11 09:20:36,058][25689] Fps is (10 sec: 5612.3, 60 sec: 5558.7, 300 sec: 5553.3). Total num frames: 1157721088. Throughput: 0: 4932.8. Samples: 1157716520. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:20:36,058][25689] Avg episode reward: [(0, '0.019')] [2022-07-11 09:20:37,410][26022] Updated weights on worker 0-0, policy_version 1130594 (0.00086) [2022-07-11 09:20:39,051][26022] Updated weights on worker 0-0, policy_version 1130604 (0.00085) [2022-07-11 09:20:41,029][26022] Updated weights on worker 0-0, policy_version 1130614 (0.00086) [2022-07-11 09:20:41,156][25689] Fps is (10 sec: 5567.0, 60 sec: 5523.9, 300 sec: 5552.4). Total num frames: 1157748736. Throughput: 0: 5779.9. Samples: 1157750198. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:20:41,156][25689] Avg episode reward: [(0, '0.351')] [2022-07-11 09:20:42,722][26022] Updated weights on worker 0-0, policy_version 1130624 (0.00091) [2022-07-11 09:20:44,564][26022] Updated weights on worker 0-0, policy_version 1130634 (0.00097) [2022-07-11 09:20:46,176][25689] Fps is (10 sec: 5667.3, 60 sec: 5590.0, 300 sec: 5555.6). Total num frames: 1157778432. Throughput: 0: 5807.2. Samples: 1157784012. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:20:46,177][25689] Avg episode reward: [(0, '0.473')] [2022-07-11 09:20:46,289][26022] Updated weights on worker 0-0, policy_version 1130644 (0.00086) [2022-07-11 09:20:48,519][26022] Updated weights on worker 0-0, policy_version 1130654 (0.00086) [2022-07-11 09:20:49,924][26022] Updated weights on worker 0-0, policy_version 1130664 (0.00079) [2022-07-11 09:20:51,205][25689] Fps is (10 sec: 5706.2, 60 sec: 5537.1, 300 sec: 5552.0). Total num frames: 1157806080. Throughput: 0: 4999.3. Samples: 1157800764. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:20:51,206][25689] Avg episode reward: [(0, '1.524')] [2022-07-11 09:20:52,024][26022] Updated weights on worker 0-0, policy_version 1130674 (0.00093) [2022-07-11 09:20:53,775][26022] Updated weights on worker 0-0, policy_version 1130684 (0.00082) [2022-07-11 09:20:55,699][26022] Updated weights on worker 0-0, policy_version 1130694 (0.00093) [2022-07-11 09:20:56,278][25689] Fps is (10 sec: 5372.4, 60 sec: 5515.9, 300 sec: 5552.0). Total num frames: 1157832704. Throughput: 0: 5816.5. Samples: 1157834132. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:20:56,279][25689] Avg episode reward: [(0, '1.233')] [2022-07-11 09:20:57,472][26022] Updated weights on worker 0-0, policy_version 1130704 (0.00082) [2022-07-11 09:20:59,585][26022] Updated weights on worker 0-0, policy_version 1130714 (0.00081) [2022-07-11 09:21:01,294][26022] Updated weights on worker 0-0, policy_version 1130724 (0.00086) [2022-07-11 09:21:01,321][25689] Fps is (10 sec: 5466.6, 60 sec: 5550.7, 300 sec: 5554.9). Total num frames: 1157861376. Throughput: 0: 5794.7. Samples: 1157867046. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:21:01,321][25689] Avg episode reward: [(0, '1.865')] [2022-07-11 09:21:03,480][26022] Updated weights on worker 0-0, policy_version 1130734 (0.00092) [2022-07-11 09:21:05,335][26022] Updated weights on worker 0-0, policy_version 1130744 (0.00086) [2022-07-11 09:21:06,332][25689] Fps is (10 sec: 5398.6, 60 sec: 5517.7, 300 sec: 5548.4). Total num frames: 1157886976. Throughput: 0: 4847.5. Samples: 1157881714. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:21:06,333][25689] Avg episode reward: [(0, '2.073')] [2022-07-11 09:21:07,126][26022] Updated weights on worker 0-0, policy_version 1130754 (0.00082) [2022-07-11 09:21:09,227][26022] Updated weights on worker 0-0, policy_version 1130764 (0.00087) [2022-07-11 09:21:10,731][26022] Updated weights on worker 0-0, policy_version 1130774 (0.00085) [2022-07-11 09:21:11,362][25689] Fps is (10 sec: 5302.8, 60 sec: 5515.9, 300 sec: 5548.2). Total num frames: 1157914624. Throughput: 0: 5666.1. Samples: 1157914974. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:21:11,362][25689] Avg episode reward: [(0, '2.129')] [2022-07-11 09:21:12,770][26022] Updated weights on worker 0-0, policy_version 1130784 (0.01190) [2022-07-11 09:21:14,424][26022] Updated weights on worker 0-0, policy_version 1130794 (0.00089) [2022-07-11 09:21:16,342][26022] Updated weights on worker 0-0, policy_version 1130804 (0.00083) [2022-07-11 09:21:16,400][25689] Fps is (10 sec: 5593.7, 60 sec: 5531.5, 300 sec: 5548.3). Total num frames: 1157943296. Throughput: 0: 5673.4. Samples: 1157948290. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:21:16,401][25689] Avg episode reward: [(0, '2.004')] [2022-07-11 09:21:18,511][26022] Updated weights on worker 0-0, policy_version 1130814 (0.00092) [2022-07-11 09:21:19,995][26022] Updated weights on worker 0-0, policy_version 1130824 (0.00101) [2022-07-11 09:21:21,515][25689] Fps is (10 sec: 5547.2, 60 sec: 5516.0, 300 sec: 5546.5). Total num frames: 1157970944. Throughput: 0: 4843.3. Samples: 1157964852. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:21:21,515][25689] Avg episode reward: [(0, '2.137')] [2022-07-11 09:21:21,967][26022] Updated weights on worker 0-0, policy_version 1130834 (0.00087) [2022-07-11 09:21:23,678][26022] Updated weights on worker 0-0, policy_version 1130844 (0.00089) [2022-07-11 09:21:25,688][26022] Updated weights on worker 0-0, policy_version 1130854 (0.00088) [2022-07-11 09:21:26,595][25689] Fps is (10 sec: 5424.1, 60 sec: 5511.2, 300 sec: 5542.0). Total num frames: 1157998592. Throughput: 0: 5775.7. Samples: 1157998748. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:21:26,595][25689] Avg episode reward: [(0, '2.596')] [2022-07-11 09:21:27,440][26022] Updated weights on worker 0-0, policy_version 1130864 (0.00087) [2022-07-11 09:21:29,254][26022] Updated weights on worker 0-0, policy_version 1130874 (0.00086) [2022-07-11 09:21:31,136][26022] Updated weights on worker 0-0, policy_version 1130884 (0.00089) [2022-07-11 09:21:31,612][25689] Fps is (10 sec: 5578.0, 60 sec: 5528.0, 300 sec: 5545.5). Total num frames: 1158027264. Throughput: 0: 5759.2. Samples: 1158031596. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:21:31,612][25689] Avg episode reward: [(0, '0.969')] [2022-07-11 09:21:33,273][26022] Updated weights on worker 0-0, policy_version 1130894 (0.00089) [2022-07-11 09:21:34,631][26022] Updated weights on worker 0-0, policy_version 1130904 (0.00087) [2022-07-11 09:21:36,622][25689] Fps is (10 sec: 5514.3, 60 sec: 5495.0, 300 sec: 5542.5). Total num frames: 1158053888. Throughput: 0: 4947.3. Samples: 1158048336. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:21:36,623][25689] Avg episode reward: [(0, '0.743')] [2022-07-11 09:21:36,805][26022] Updated weights on worker 0-0, policy_version 1130914 (0.00089) [2022-07-11 09:21:38,515][26022] Updated weights on worker 0-0, policy_version 1130924 (0.00088) [2022-07-11 09:21:40,414][26022] Updated weights on worker 0-0, policy_version 1130934 (0.00078) [2022-07-11 09:21:41,747][25689] Fps is (10 sec: 5557.1, 60 sec: 5526.4, 300 sec: 5547.7). Total num frames: 1158083584. Throughput: 0: 5776.4. Samples: 1158081718. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:21:41,747][25689] Avg episode reward: [(0, '-0.503')] [2022-07-11 09:21:42,364][26022] Updated weights on worker 0-0, policy_version 1130944 (0.00090) [2022-07-11 09:21:43,944][26022] Updated weights on worker 0-0, policy_version 1130954 (0.00093) [2022-07-11 09:21:45,914][26022] Updated weights on worker 0-0, policy_version 1130964 (0.00097) [2022-07-11 09:21:46,782][25689] Fps is (10 sec: 5845.9, 60 sec: 5525.0, 300 sec: 5550.5). Total num frames: 1158113280. Throughput: 0: 5779.2. Samples: 1158115414. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:21:46,783][25689] Avg episode reward: [(0, '-0.202')] [2022-07-11 09:21:47,752][26022] Updated weights on worker 0-0, policy_version 1130974 (0.00085) [2022-07-11 09:21:49,511][26022] Updated weights on worker 0-0, policy_version 1130984 (0.00084) [2022-07-11 09:21:51,523][26022] Updated weights on worker 0-0, policy_version 1130994 (0.00097) [2022-07-11 09:21:51,829][25689] Fps is (10 sec: 5586.0, 60 sec: 5506.5, 300 sec: 5543.1). Total num frames: 1158139904. Throughput: 0: 5799.8. Samples: 1158148852. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:21:51,829][25689] Avg episode reward: [(0, '-0.433')] [2022-07-11 09:21:53,185][26022] Updated weights on worker 0-0, policy_version 1131004 (0.00085) [2022-07-11 09:21:53,409][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:21:53,424][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001131005_1158149120.pth [2022-07-11 09:21:53,425][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001129053_1156150272.pth [2022-07-11 09:21:55,009][26022] Updated weights on worker 0-0, policy_version 1131014 (0.00090) [2022-07-11 09:21:56,882][25689] Fps is (10 sec: 5373.7, 60 sec: 5525.3, 300 sec: 5543.6). Total num frames: 1158167552. Throughput: 0: 5802.3. Samples: 1158165886. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:21:56,882][25689] Avg episode reward: [(0, '-1.698')] [2022-07-11 09:21:57,020][26022] Updated weights on worker 0-0, policy_version 1131024 (0.00084) [2022-07-11 09:21:58,609][26022] Updated weights on worker 0-0, policy_version 1131034 (0.00081) [2022-07-11 09:22:00,507][26022] Updated weights on worker 0-0, policy_version 1131044 (0.00093) [2022-07-11 09:22:01,955][25689] Fps is (10 sec: 5562.0, 60 sec: 5522.4, 300 sec: 5553.0). Total num frames: 1158196224. Throughput: 0: 5808.7. Samples: 1158199100. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:22:01,956][25689] Avg episode reward: [(0, '-0.270')] [2022-07-11 09:22:02,846][26022] Updated weights on worker 0-0, policy_version 1131054 (0.00085) [2022-07-11 09:22:04,533][26022] Updated weights on worker 0-0, policy_version 1131064 (0.00088) [2022-07-11 09:22:06,486][26022] Updated weights on worker 0-0, policy_version 1131074 (0.00091) [2022-07-11 09:22:06,966][25689] Fps is (10 sec: 5280.4, 60 sec: 5505.6, 300 sec: 5539.4). Total num frames: 1158220800. Throughput: 0: 5691.4. Samples: 1158230288. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:22:06,966][25689] Avg episode reward: [(0, '-0.194')] [2022-07-11 09:22:08,345][26022] Updated weights on worker 0-0, policy_version 1131084 (0.00092) [2022-07-11 09:22:10,389][26022] Updated weights on worker 0-0, policy_version 1131094 (0.00092) [2022-07-11 09:22:11,974][25689] Fps is (10 sec: 5212.5, 60 sec: 5507.6, 300 sec: 5539.4). Total num frames: 1158248448. Throughput: 0: 4865.1. Samples: 1158246858. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:22:11,974][25689] Avg episode reward: [(0, '0.810')] [2022-07-11 09:22:12,115][26022] Updated weights on worker 0-0, policy_version 1131104 (0.00082) [2022-07-11 09:22:13,888][26022] Updated weights on worker 0-0, policy_version 1131114 (0.00086) [2022-07-11 09:22:15,664][26022] Updated weights on worker 0-0, policy_version 1131124 (0.00085) [2022-07-11 09:22:16,986][25689] Fps is (10 sec: 5722.9, 60 sec: 5526.9, 300 sec: 5546.9). Total num frames: 1158278144. Throughput: 0: 5689.9. Samples: 1158280276. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:22:16,986][25689] Avg episode reward: [(0, '0.177')] [2022-07-11 09:22:17,677][26022] Updated weights on worker 0-0, policy_version 1131134 (0.00096) [2022-07-11 09:22:19,496][26022] Updated weights on worker 0-0, policy_version 1131144 (0.00086) [2022-07-11 09:22:21,231][26022] Updated weights on worker 0-0, policy_version 1131154 (0.00093) [2022-07-11 09:22:22,127][25689] Fps is (10 sec: 5748.8, 60 sec: 5541.3, 300 sec: 5544.9). Total num frames: 1158306816. Throughput: 0: 5686.0. Samples: 1158313798. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:22:22,127][25689] Avg episode reward: [(0, '-0.626')] [2022-07-11 09:22:23,050][26022] Updated weights on worker 0-0, policy_version 1131164 (0.00087) [2022-07-11 09:22:24,918][26022] Updated weights on worker 0-0, policy_version 1131174 (0.00092) [2022-07-11 09:22:26,953][26022] Updated weights on worker 0-0, policy_version 1131184 (0.00089) [2022-07-11 09:22:27,170][25689] Fps is (10 sec: 5429.5, 60 sec: 5527.8, 300 sec: 5541.1). Total num frames: 1158333440. Throughput: 0: 4956.0. Samples: 1158330422. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:22:27,171][25689] Avg episode reward: [(0, '0.655')] [2022-07-11 09:22:28,610][26022] Updated weights on worker 0-0, policy_version 1131194 (0.00084) [2022-07-11 09:22:30,684][26022] Updated weights on worker 0-0, policy_version 1131204 (0.00092) [2022-07-11 09:22:32,177][25689] Fps is (10 sec: 5400.1, 60 sec: 5511.8, 300 sec: 5537.9). Total num frames: 1158361088. Throughput: 0: 5773.7. Samples: 1158363506. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:22:32,177][25689] Avg episode reward: [(0, '0.570')] [2022-07-11 09:22:32,313][26022] Updated weights on worker 0-0, policy_version 1131214 (0.00083) [2022-07-11 09:22:34,380][26022] Updated weights on worker 0-0, policy_version 1131224 (0.00092) [2022-07-11 09:22:35,976][26022] Updated weights on worker 0-0, policy_version 1131234 (0.00074) [2022-07-11 09:22:37,203][25689] Fps is (10 sec: 5613.7, 60 sec: 5544.2, 300 sec: 5541.6). Total num frames: 1158389760. Throughput: 0: 5767.7. Samples: 1158396882. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:22:37,203][25689] Avg episode reward: [(0, '0.270')] [2022-07-11 09:22:38,072][26022] Updated weights on worker 0-0, policy_version 1131244 (0.00088) [2022-07-11 09:22:39,735][26022] Updated weights on worker 0-0, policy_version 1131254 (0.00082) [2022-07-11 09:22:41,524][26022] Updated weights on worker 0-0, policy_version 1131264 (0.00088) [2022-07-11 09:22:42,257][25689] Fps is (10 sec: 5688.6, 60 sec: 5533.7, 300 sec: 5542.4). Total num frames: 1158418432. Throughput: 0: 4965.7. Samples: 1158413760. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:22:42,259][25689] Avg episode reward: [(0, '0.345')] [2022-07-11 09:22:43,458][26022] Updated weights on worker 0-0, policy_version 1131274 (0.00084) [2022-07-11 09:22:45,199][26022] Updated weights on worker 0-0, policy_version 1131284 (0.00085) [2022-07-11 09:22:47,179][26022] Updated weights on worker 0-0, policy_version 1131294 (0.00088) [2022-07-11 09:22:47,278][25689] Fps is (10 sec: 5488.3, 60 sec: 5484.3, 300 sec: 5538.8). Total num frames: 1158445056. Throughput: 0: 5800.2. Samples: 1158447052. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:22:47,279][25689] Avg episode reward: [(0, '0.034')] [2022-07-11 09:22:48,823][26022] Updated weights on worker 0-0, policy_version 1131304 (0.00106) [2022-07-11 09:22:50,748][26022] Updated weights on worker 0-0, policy_version 1131314 (0.00100) [2022-07-11 09:22:52,315][25689] Fps is (10 sec: 5498.1, 60 sec: 5519.0, 300 sec: 5539.6). Total num frames: 1158473728. Throughput: 0: 5814.0. Samples: 1158480586. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:22:52,315][25689] Avg episode reward: [(0, '0.972')] [2022-07-11 09:22:52,597][26022] Updated weights on worker 0-0, policy_version 1131324 (0.00084) [2022-07-11 09:22:54,279][26022] Updated weights on worker 0-0, policy_version 1131334 (0.00090) [2022-07-11 09:22:56,298][26022] Updated weights on worker 0-0, policy_version 1131344 (0.00066) [2022-07-11 09:22:57,329][25689] Fps is (10 sec: 5705.2, 60 sec: 5539.5, 300 sec: 5543.9). Total num frames: 1158502400. Throughput: 0: 4994.2. Samples: 1158497398. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:22:57,330][25689] Avg episode reward: [(0, '-0.291')] [2022-07-11 09:22:58,231][26022] Updated weights on worker 0-0, policy_version 1131354 (0.00093) [2022-07-11 09:22:59,862][26022] Updated weights on worker 0-0, policy_version 1131364 (0.00083) [2022-07-11 09:23:02,292][26022] Updated weights on worker 0-0, policy_version 1131374 (0.00090) [2022-07-11 09:23:02,451][25689] Fps is (10 sec: 5354.3, 60 sec: 5484.3, 300 sec: 5542.3). Total num frames: 1158528000. Throughput: 0: 5781.0. Samples: 1158530498. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:23:02,451][25689] Avg episode reward: [(0, '-0.354')] [2022-07-11 09:23:03,850][26022] Updated weights on worker 0-0, policy_version 1131384 (0.00098) [2022-07-11 09:23:05,916][26022] Updated weights on worker 0-0, policy_version 1131394 (0.00086) [2022-07-11 09:23:07,503][25689] Fps is (10 sec: 5233.9, 60 sec: 5531.3, 300 sec: 5538.6). Total num frames: 1158555648. Throughput: 0: 5684.1. Samples: 1158562012. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:23:07,503][25689] Avg episode reward: [(0, '-1.341')] [2022-07-11 09:23:07,972][26022] Updated weights on worker 0-0, policy_version 1131404 (0.00085) [2022-07-11 09:23:09,332][26022] Updated weights on worker 0-0, policy_version 1131414 (0.00084) [2022-07-11 09:23:11,568][26022] Updated weights on worker 0-0, policy_version 1131424 (0.00089) [2022-07-11 09:23:12,595][25689] Fps is (10 sec: 5653.0, 60 sec: 5557.4, 300 sec: 5540.5). Total num frames: 1158585344. Throughput: 0: 4836.2. Samples: 1158578664. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:23:12,595][25689] Avg episode reward: [(0, '-0.932')] [2022-07-11 09:23:13,058][26022] Updated weights on worker 0-0, policy_version 1131434 (0.00089) [2022-07-11 09:23:15,024][26022] Updated weights on worker 0-0, policy_version 1131444 (0.00094) [2022-07-11 09:23:16,771][26022] Updated weights on worker 0-0, policy_version 1131454 (0.00094) [2022-07-11 09:23:17,623][25689] Fps is (10 sec: 5565.0, 60 sec: 5505.2, 300 sec: 5534.0). Total num frames: 1158611968. Throughput: 0: 5657.2. Samples: 1158612204. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:23:17,624][25689] Avg episode reward: [(0, '-1.366')] [2022-07-11 09:23:19,081][26022] Updated weights on worker 0-0, policy_version 1131464 (0.00086) [2022-07-11 09:23:20,542][26022] Updated weights on worker 0-0, policy_version 1131474 (0.00095) [2022-07-11 09:23:22,627][26022] Updated weights on worker 0-0, policy_version 1131484 (0.00087) [2022-07-11 09:23:22,726][25689] Fps is (10 sec: 5356.6, 60 sec: 5491.8, 300 sec: 5532.3). Total num frames: 1158639616. Throughput: 0: 5685.0. Samples: 1158645766. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:23:22,727][25689] Avg episode reward: [(0, '-0.529')] [2022-07-11 09:23:24,154][26022] Updated weights on worker 0-0, policy_version 1131494 (0.00091) [2022-07-11 09:23:26,130][26022] Updated weights on worker 0-0, policy_version 1131504 (0.00097) [2022-07-11 09:23:27,731][25689] Fps is (10 sec: 5673.3, 60 sec: 5546.0, 300 sec: 5539.3). Total num frames: 1158669312. Throughput: 0: 4976.2. Samples: 1158662670. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:23:27,731][25689] Avg episode reward: [(0, '-0.344')] [2022-07-11 09:23:27,777][26022] Updated weights on worker 0-0, policy_version 1131514 (0.00091) [2022-07-11 09:23:29,709][26022] Updated weights on worker 0-0, policy_version 1131524 (0.00087) [2022-07-11 09:23:31,731][26022] Updated weights on worker 0-0, policy_version 1131534 (0.00078) [2022-07-11 09:23:32,823][25689] Fps is (10 sec: 5780.8, 60 sec: 5555.1, 300 sec: 5538.1). Total num frames: 1158697984. Throughput: 0: 5799.6. Samples: 1158695982. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:23:32,825][25689] Avg episode reward: [(0, '-1.326')] [2022-07-11 09:23:33,316][26022] Updated weights on worker 0-0, policy_version 1131544 (0.00090) [2022-07-11 09:23:35,315][26022] Updated weights on worker 0-0, policy_version 1131554 (0.00081) [2022-07-11 09:23:36,830][26022] Updated weights on worker 0-0, policy_version 1131564 (0.00082) [2022-07-11 09:23:37,923][25689] Fps is (10 sec: 5425.2, 60 sec: 5514.6, 300 sec: 5535.1). Total num frames: 1158724608. Throughput: 0: 5790.9. Samples: 1158729760. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:23:37,924][25689] Avg episode reward: [(0, '0.218')] [2022-07-11 09:23:38,874][26022] Updated weights on worker 0-0, policy_version 1131574 (0.00110) [2022-07-11 09:23:41,041][26022] Updated weights on worker 0-0, policy_version 1131584 (0.00089) [2022-07-11 09:23:42,554][26022] Updated weights on worker 0-0, policy_version 1131594 (0.00085) [2022-07-11 09:23:43,017][25689] Fps is (10 sec: 5525.0, 60 sec: 5527.9, 300 sec: 5537.9). Total num frames: 1158754304. Throughput: 0: 5783.4. Samples: 1158763114. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:23:43,017][25689] Avg episode reward: [(0, '0.131')] [2022-07-11 09:23:44,537][26022] Updated weights on worker 0-0, policy_version 1131604 (0.00077) [2022-07-11 09:23:46,270][26022] Updated weights on worker 0-0, policy_version 1131614 (0.00089) [2022-07-11 09:23:48,058][25689] Fps is (10 sec: 5658.3, 60 sec: 5542.9, 300 sec: 5534.0). Total num frames: 1158781952. Throughput: 0: 5765.9. Samples: 1158779872. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:23:48,059][25689] Avg episode reward: [(0, '0.519')] [2022-07-11 09:23:48,126][26022] Updated weights on worker 0-0, policy_version 1131624 (0.00085) [2022-07-11 09:23:49,867][26022] Updated weights on worker 0-0, policy_version 1131634 (0.00088) [2022-07-11 09:23:51,755][26022] Updated weights on worker 0-0, policy_version 1131644 (0.00093) [2022-07-11 09:23:53,112][25689] Fps is (10 sec: 5477.7, 60 sec: 5524.5, 300 sec: 5533.6). Total num frames: 1158809600. Throughput: 0: 5779.4. Samples: 1158813236. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:23:53,112][25689] Avg episode reward: [(0, '0.519')] [2022-07-11 09:23:53,583][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:23:53,597][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001131654_1158813696.pth [2022-07-11 09:23:53,598][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001129703_1156815872.pth [2022-07-11 09:23:53,601][26022] Updated weights on worker 0-0, policy_version 1131654 (0.00080) [2022-07-11 09:23:55,644][26022] Updated weights on worker 0-0, policy_version 1131664 (0.00092) [2022-07-11 09:23:57,314][26022] Updated weights on worker 0-0, policy_version 1131674 (0.00084) [2022-07-11 09:23:58,128][25689] Fps is (10 sec: 5592.8, 60 sec: 5524.3, 300 sec: 5539.5). Total num frames: 1158838272. Throughput: 0: 5793.3. Samples: 1158846812. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:23:58,128][25689] Avg episode reward: [(0, '0.531')] [2022-07-11 09:23:59,374][26022] Updated weights on worker 0-0, policy_version 1131684 (0.00093) [2022-07-11 09:24:00,789][26022] Updated weights on worker 0-0, policy_version 1131694 (0.00093) [2022-07-11 09:24:03,206][25689] Fps is (10 sec: 5376.3, 60 sec: 5528.3, 300 sec: 5529.3). Total num frames: 1158863872. Throughput: 0: 4962.4. Samples: 1158863302. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:24:03,207][25689] Avg episode reward: [(0, '1.011')] [2022-07-11 09:24:03,300][26022] Updated weights on worker 0-0, policy_version 1131704 (0.00098) [2022-07-11 09:24:05,147][26022] Updated weights on worker 0-0, policy_version 1131714 (0.00084) [2022-07-11 09:24:06,869][26022] Updated weights on worker 0-0, policy_version 1131724 (0.00084) [2022-07-11 09:24:08,218][25689] Fps is (10 sec: 5378.5, 60 sec: 5548.8, 300 sec: 5537.4). Total num frames: 1158892544. Throughput: 0: 5697.3. Samples: 1158894734. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:24:08,220][25689] Avg episode reward: [(0, '1.271')] [2022-07-11 09:24:08,854][26022] Updated weights on worker 0-0, policy_version 1131734 (0.00086) [2022-07-11 09:24:10,553][26022] Updated weights on worker 0-0, policy_version 1131744 (0.00415) [2022-07-11 09:24:12,375][26022] Updated weights on worker 0-0, policy_version 1131754 (0.00084) [2022-07-11 09:24:13,227][25689] Fps is (10 sec: 5722.7, 60 sec: 5539.5, 300 sec: 5537.7). Total num frames: 1158921216. Throughput: 0: 5722.3. Samples: 1158928340. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:24:13,228][25689] Avg episode reward: [(0, '1.280')] [2022-07-11 09:24:14,262][26022] Updated weights on worker 0-0, policy_version 1131764 (0.00088) [2022-07-11 09:24:15,972][26022] Updated weights on worker 0-0, policy_version 1131774 (0.00082) [2022-07-11 09:24:18,115][26022] Updated weights on worker 0-0, policy_version 1131784 (0.00082) [2022-07-11 09:24:18,243][25689] Fps is (10 sec: 5413.9, 60 sec: 5523.8, 300 sec: 5531.7). Total num frames: 1158946816. Throughput: 0: 4893.4. Samples: 1158945242. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:24:18,243][25689] Avg episode reward: [(0, '0.507')] [2022-07-11 09:24:19,699][26022] Updated weights on worker 0-0, policy_version 1131794 (0.00100) [2022-07-11 09:24:21,763][26022] Updated weights on worker 0-0, policy_version 1131804 (0.00083) [2022-07-11 09:24:23,327][25689] Fps is (10 sec: 5474.8, 60 sec: 5559.3, 300 sec: 5530.5). Total num frames: 1158976512. Throughput: 0: 5731.7. Samples: 1158978626. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:24:23,327][25689] Avg episode reward: [(0, '0.481')] [2022-07-11 09:24:23,471][26022] Updated weights on worker 0-0, policy_version 1131814 (0.00088) [2022-07-11 09:24:25,194][26022] Updated weights on worker 0-0, policy_version 1131824 (0.00083) [2022-07-11 09:24:27,207][26022] Updated weights on worker 0-0, policy_version 1131834 (0.00089) [2022-07-11 09:24:28,354][25689] Fps is (10 sec: 5772.9, 60 sec: 5540.4, 300 sec: 5537.4). Total num frames: 1159005184. Throughput: 0: 5831.2. Samples: 1159012146. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:24:28,354][25689] Avg episode reward: [(0, '-1.969')] [2022-07-11 09:24:28,894][26022] Updated weights on worker 0-0, policy_version 1131844 (0.00087) [2022-07-11 09:24:30,834][26022] Updated weights on worker 0-0, policy_version 1131854 (0.00081) [2022-07-11 09:24:32,719][26022] Updated weights on worker 0-0, policy_version 1131864 (0.00089) [2022-07-11 09:24:33,392][25689] Fps is (10 sec: 5493.9, 60 sec: 5511.5, 300 sec: 5530.0). Total num frames: 1159031808. Throughput: 0: 4980.9. Samples: 1159028780. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:24:33,392][25689] Avg episode reward: [(0, '-1.634')] [2022-07-11 09:24:34,373][26022] Updated weights on worker 0-0, policy_version 1131874 (0.00091) [2022-07-11 09:24:36,304][26022] Updated weights on worker 0-0, policy_version 1131884 (0.00084) [2022-07-11 09:24:38,055][26022] Updated weights on worker 0-0, policy_version 1131894 (0.00082) [2022-07-11 09:24:38,413][25689] Fps is (10 sec: 5598.9, 60 sec: 5569.5, 300 sec: 5531.2). Total num frames: 1159061504. Throughput: 0: 5810.1. Samples: 1159062430. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:24:38,413][25689] Avg episode reward: [(0, '-1.642')] [2022-07-11 09:24:39,787][26022] Updated weights on worker 0-0, policy_version 1131904 (0.00083) [2022-07-11 09:24:41,713][26022] Updated weights on worker 0-0, policy_version 1131914 (0.00085) [2022-07-11 09:24:43,518][25689] Fps is (10 sec: 5764.3, 60 sec: 5551.5, 300 sec: 5539.6). Total num frames: 1159090176. Throughput: 0: 5818.7. Samples: 1159096110. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:24:43,518][25689] Avg episode reward: [(0, '-2.044')] [2022-07-11 09:24:43,523][26022] Updated weights on worker 0-0, policy_version 1131924 (0.00084) [2022-07-11 09:24:45,359][26022] Updated weights on worker 0-0, policy_version 1131934 (0.00945) [2022-07-11 09:24:47,539][26022] Updated weights on worker 0-0, policy_version 1131944 (0.00085) [2022-07-11 09:24:48,591][25689] Fps is (10 sec: 5533.6, 60 sec: 5548.6, 300 sec: 5528.1). Total num frames: 1159117824. Throughput: 0: 4983.6. Samples: 1159112998. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:24:48,591][25689] Avg episode reward: [(0, '-1.105')] [2022-07-11 09:24:48,906][26022] Updated weights on worker 0-0, policy_version 1131954 (0.00087) [2022-07-11 09:24:51,027][26022] Updated weights on worker 0-0, policy_version 1131964 (0.00094) [2022-07-11 09:24:52,475][26022] Updated weights on worker 0-0, policy_version 1131974 (0.00085) [2022-07-11 09:24:53,622][25689] Fps is (10 sec: 5472.6, 60 sec: 5550.7, 300 sec: 5528.0). Total num frames: 1159145472. Throughput: 0: 5837.6. Samples: 1159146874. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:24:53,622][25689] Avg episode reward: [(0, '0.315')] [2022-07-11 09:24:54,520][26022] Updated weights on worker 0-0, policy_version 1131984 (0.00111) [2022-07-11 09:24:56,357][26022] Updated weights on worker 0-0, policy_version 1131994 (0.00085) [2022-07-11 09:24:58,224][26022] Updated weights on worker 0-0, policy_version 1132004 (0.00106) [2022-07-11 09:24:58,639][25689] Fps is (10 sec: 5604.7, 60 sec: 5550.6, 300 sec: 5535.5). Total num frames: 1159174144. Throughput: 0: 5839.5. Samples: 1159180542. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:24:58,641][25689] Avg episode reward: [(0, '1.131')] [2022-07-11 09:24:59,990][26022] Updated weights on worker 0-0, policy_version 1132014 (0.00052) [2022-07-11 09:25:02,228][26022] Updated weights on worker 0-0, policy_version 1132024 (0.00088) [2022-07-11 09:25:03,775][25689] Fps is (10 sec: 5446.1, 60 sec: 5562.2, 300 sec: 5529.9). Total num frames: 1159200768. Throughput: 0: 4985.8. Samples: 1159197110. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:25:03,777][25689] Avg episode reward: [(0, '0.403')] [2022-07-11 09:25:03,969][26022] Updated weights on worker 0-0, policy_version 1132034 (0.00090) [2022-07-11 09:25:06,059][26022] Updated weights on worker 0-0, policy_version 1132044 (0.00096) [2022-07-11 09:25:07,709][26022] Updated weights on worker 0-0, policy_version 1132054 (0.00092) [2022-07-11 09:25:08,818][25689] Fps is (10 sec: 5331.9, 60 sec: 5542.5, 300 sec: 5529.4). Total num frames: 1159228416. Throughput: 0: 5714.2. Samples: 1159228582. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:25:08,819][25689] Avg episode reward: [(0, '0.331')] [2022-07-11 09:25:09,491][26022] Updated weights on worker 0-0, policy_version 1132064 (0.00089) [2022-07-11 09:25:11,562][26022] Updated weights on worker 0-0, policy_version 1132074 (0.00090) [2022-07-11 09:25:12,909][26022] Updated weights on worker 0-0, policy_version 1132084 (0.00091) [2022-07-11 09:25:13,842][25689] Fps is (10 sec: 5696.1, 60 sec: 5557.9, 300 sec: 5536.2). Total num frames: 1159258112. Throughput: 0: 5713.2. Samples: 1159262398. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:25:13,843][25689] Avg episode reward: [(0, '0.561')] [2022-07-11 09:25:15,304][26022] Updated weights on worker 0-0, policy_version 1132094 (0.00087) [2022-07-11 09:25:16,645][26022] Updated weights on worker 0-0, policy_version 1132104 (0.00088) [2022-07-11 09:25:18,802][26022] Updated weights on worker 0-0, policy_version 1132114 (0.00093) [2022-07-11 09:25:18,848][25689] Fps is (10 sec: 5717.4, 60 sec: 5592.7, 300 sec: 5535.1). Total num frames: 1159285760. Throughput: 0: 4889.0. Samples: 1159279342. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:25:18,850][25689] Avg episode reward: [(0, '0.760')] [2022-07-11 09:25:20,566][26022] Updated weights on worker 0-0, policy_version 1132124 (0.00083) [2022-07-11 09:25:22,267][26022] Updated weights on worker 0-0, policy_version 1132134 (0.00088) [2022-07-11 09:25:23,906][25689] Fps is (10 sec: 5494.6, 60 sec: 5561.2, 300 sec: 5534.5). Total num frames: 1159313408. Throughput: 0: 5747.7. Samples: 1159312816. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:25:23,906][25689] Avg episode reward: [(0, '1.086')] [2022-07-11 09:25:24,227][26022] Updated weights on worker 0-0, policy_version 1132144 (0.00090) [2022-07-11 09:25:25,858][26022] Updated weights on worker 0-0, policy_version 1132154 (0.00093) [2022-07-11 09:25:27,854][26022] Updated weights on worker 0-0, policy_version 1132164 (0.00086) [2022-07-11 09:25:28,917][25689] Fps is (10 sec: 5593.2, 60 sec: 5562.7, 300 sec: 5538.1). Total num frames: 1159342080. Throughput: 0: 5862.5. Samples: 1159346412. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:25:28,925][25689] Avg episode reward: [(0, '1.015')] [2022-07-11 09:25:29,672][26022] Updated weights on worker 0-0, policy_version 1132174 (0.00083) [2022-07-11 09:25:31,644][26022] Updated weights on worker 0-0, policy_version 1132184 (0.00098) [2022-07-11 09:25:33,236][26022] Updated weights on worker 0-0, policy_version 1132194 (0.00086) [2022-07-11 09:25:33,927][25689] Fps is (10 sec: 5620.1, 60 sec: 5582.2, 300 sec: 5534.8). Total num frames: 1159369728. Throughput: 0: 5011.6. Samples: 1159363056. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:25:33,928][25689] Avg episode reward: [(0, '0.486')] [2022-07-11 09:25:35,149][26022] Updated weights on worker 0-0, policy_version 1132204 (0.00084) [2022-07-11 09:25:36,887][26022] Updated weights on worker 0-0, policy_version 1132214 (0.00095) [2022-07-11 09:25:38,928][26022] Updated weights on worker 0-0, policy_version 1132224 (0.00091) [2022-07-11 09:25:38,942][25689] Fps is (10 sec: 5516.1, 60 sec: 5549.0, 300 sec: 5536.3). Total num frames: 1159397376. Throughput: 0: 5844.6. Samples: 1159396782. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:25:38,942][25689] Avg episode reward: [(0, '-0.004')] [2022-07-11 09:25:40,611][26022] Updated weights on worker 0-0, policy_version 1132234 (0.00088) [2022-07-11 09:25:42,635][26022] Updated weights on worker 0-0, policy_version 1132244 (0.00084) [2022-07-11 09:25:44,090][25689] Fps is (10 sec: 5642.8, 60 sec: 5561.9, 300 sec: 5534.0). Total num frames: 1159427072. Throughput: 0: 5824.3. Samples: 1159430372. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:25:44,090][25689] Avg episode reward: [(0, '-0.141')] [2022-07-11 09:25:44,213][26022] Updated weights on worker 0-0, policy_version 1132254 (0.00086) [2022-07-11 09:25:46,217][26022] Updated weights on worker 0-0, policy_version 1132264 (0.00092) [2022-07-11 09:25:47,929][26022] Updated weights on worker 0-0, policy_version 1132274 (0.00063) [2022-07-11 09:25:49,106][25689] Fps is (10 sec: 5540.8, 60 sec: 5550.2, 300 sec: 5530.8). Total num frames: 1159453696. Throughput: 0: 5809.2. Samples: 1159463694. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:25:49,107][25689] Avg episode reward: [(0, '-0.641')] [2022-07-11 09:25:49,977][26022] Updated weights on worker 0-0, policy_version 1132284 (0.00095) [2022-07-11 09:25:51,695][26022] Updated weights on worker 0-0, policy_version 1132294 (0.00099) [2022-07-11 09:25:53,575][26022] Updated weights on worker 0-0, policy_version 1132304 (0.00081) [2022-07-11 09:25:53,710][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:25:53,720][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001132305_1159480320.pth [2022-07-11 09:25:53,721][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001130356_1157484544.pth [2022-07-11 09:25:54,144][25689] Fps is (10 sec: 5499.4, 60 sec: 5566.4, 300 sec: 5538.3). Total num frames: 1159482368. Throughput: 0: 5805.3. Samples: 1159480424. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:25:54,145][25689] Avg episode reward: [(0, '-0.478')] [2022-07-11 09:25:55,412][26022] Updated weights on worker 0-0, policy_version 1132314 (0.00085) [2022-07-11 09:25:57,277][26022] Updated weights on worker 0-0, policy_version 1132324 (0.00086) [2022-07-11 09:25:59,083][26022] Updated weights on worker 0-0, policy_version 1132334 (0.00089) [2022-07-11 09:25:59,155][25689] Fps is (10 sec: 5604.4, 60 sec: 5550.1, 300 sec: 5535.4). Total num frames: 1159510016. Throughput: 0: 5795.5. Samples: 1159513930. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:25:59,156][25689] Avg episode reward: [(0, '-0.004')] [2022-07-11 09:26:00,951][26022] Updated weights on worker 0-0, policy_version 1132344 (0.00086) [2022-07-11 09:26:03,116][26022] Updated weights on worker 0-0, policy_version 1132354 (0.00086) [2022-07-11 09:26:04,254][25689] Fps is (10 sec: 5165.7, 60 sec: 5519.6, 300 sec: 5530.3). Total num frames: 1159534592. Throughput: 0: 5680.9. Samples: 1159544926. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:26:04,255][25689] Avg episode reward: [(0, '0.311')] [2022-07-11 09:26:05,174][26022] Updated weights on worker 0-0, policy_version 1132364 (0.00090) [2022-07-11 09:26:06,914][26022] Updated weights on worker 0-0, policy_version 1132374 (0.00094) [2022-07-11 09:26:08,855][26022] Updated weights on worker 0-0, policy_version 1132384 (0.00101) [2022-07-11 09:26:09,275][25689] Fps is (10 sec: 5363.0, 60 sec: 5555.6, 300 sec: 5537.4). Total num frames: 1159564288. Throughput: 0: 4852.3. Samples: 1159561560. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:26:09,275][25689] Avg episode reward: [(0, '-0.417')] [2022-07-11 09:26:10,690][26022] Updated weights on worker 0-0, policy_version 1132394 (0.00093) [2022-07-11 09:26:12,281][26022] Updated weights on worker 0-0, policy_version 1132404 (0.00083) [2022-07-11 09:26:14,252][26022] Updated weights on worker 0-0, policy_version 1132414 (0.00094) [2022-07-11 09:26:14,291][25689] Fps is (10 sec: 5713.6, 60 sec: 5522.5, 300 sec: 5534.4). Total num frames: 1159591936. Throughput: 0: 5696.8. Samples: 1159595192. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:26:14,291][25689] Avg episode reward: [(0, '0.301')] [2022-07-11 09:26:16,174][26022] Updated weights on worker 0-0, policy_version 1132424 (0.00085) [2022-07-11 09:26:17,928][26022] Updated weights on worker 0-0, policy_version 1132434 (0.00095) [2022-07-11 09:26:19,329][25689] Fps is (10 sec: 5499.9, 60 sec: 5519.5, 300 sec: 5535.8). Total num frames: 1159619584. Throughput: 0: 5688.8. Samples: 1159628694. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:26:19,329][25689] Avg episode reward: [(0, '-0.175')] [2022-07-11 09:26:19,743][26022] Updated weights on worker 0-0, policy_version 1132444 (0.00088) [2022-07-11 09:26:21,686][26022] Updated weights on worker 0-0, policy_version 1132454 (0.00084) [2022-07-11 09:26:23,594][26022] Updated weights on worker 0-0, policy_version 1132464 (0.00092) [2022-07-11 09:26:24,440][25689] Fps is (10 sec: 5448.3, 60 sec: 5514.7, 300 sec: 5535.2). Total num frames: 1159647232. Throughput: 0: 4973.8. Samples: 1159645324. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:26:24,440][25689] Avg episode reward: [(0, '-0.611')] [2022-07-11 09:26:25,209][26022] Updated weights on worker 0-0, policy_version 1132474 (0.00077) [2022-07-11 09:26:27,110][26022] Updated weights on worker 0-0, policy_version 1132484 (0.00096) [2022-07-11 09:26:28,961][26022] Updated weights on worker 0-0, policy_version 1132494 (0.00087) [2022-07-11 09:26:29,455][25689] Fps is (10 sec: 5562.0, 60 sec: 5514.3, 300 sec: 5535.3). Total num frames: 1159675904. Throughput: 0: 5815.2. Samples: 1159678910. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:26:29,455][25689] Avg episode reward: [(0, '0.343')] [2022-07-11 09:26:30,931][26022] Updated weights on worker 0-0, policy_version 1132504 (0.00098) [2022-07-11 09:26:32,671][26022] Updated weights on worker 0-0, policy_version 1132514 (0.00089) [2022-07-11 09:26:34,464][25689] Fps is (10 sec: 5618.4, 60 sec: 5514.4, 300 sec: 5538.7). Total num frames: 1159703552. Throughput: 0: 5809.7. Samples: 1159712394. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:26:34,466][25689] Avg episode reward: [(0, '0.093')] [2022-07-11 09:26:34,511][26022] Updated weights on worker 0-0, policy_version 1132524 (0.00086) [2022-07-11 09:26:36,315][26022] Updated weights on worker 0-0, policy_version 1132534 (0.00102) [2022-07-11 09:26:38,305][26022] Updated weights on worker 0-0, policy_version 1132544 (0.00083) [2022-07-11 09:26:39,498][25689] Fps is (10 sec: 5607.5, 60 sec: 5529.5, 300 sec: 5537.0). Total num frames: 1159732224. Throughput: 0: 4987.6. Samples: 1159729292. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:26:39,499][25689] Avg episode reward: [(0, '-1.154')] [2022-07-11 09:26:40,088][26022] Updated weights on worker 0-0, policy_version 1132554 (0.00086) [2022-07-11 09:26:41,767][26022] Updated weights on worker 0-0, policy_version 1132564 (0.00086) [2022-07-11 09:26:43,554][26022] Updated weights on worker 0-0, policy_version 1132574 (0.00085) [2022-07-11 09:26:44,632][25689] Fps is (10 sec: 5639.7, 60 sec: 5513.9, 300 sec: 5531.7). Total num frames: 1159760896. Throughput: 0: 5835.3. Samples: 1159763150. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:26:44,632][25689] Avg episode reward: [(0, '-1.058')] [2022-07-11 09:26:45,414][26022] Updated weights on worker 0-0, policy_version 1132584 (0.00088) [2022-07-11 09:26:47,134][26022] Updated weights on worker 0-0, policy_version 1132594 (0.00090) [2022-07-11 09:26:49,007][26022] Updated weights on worker 0-0, policy_version 1132604 (0.00081) [2022-07-11 09:26:49,663][25689] Fps is (10 sec: 5742.5, 60 sec: 5563.3, 300 sec: 5542.3). Total num frames: 1159790592. Throughput: 0: 5857.8. Samples: 1159797284. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:26:49,663][25689] Avg episode reward: [(0, '-2.625')] [2022-07-11 09:26:50,759][26022] Updated weights on worker 0-0, policy_version 1132614 (0.00092) [2022-07-11 09:26:52,688][26022] Updated weights on worker 0-0, policy_version 1132624 (0.00088) [2022-07-11 09:26:54,534][26022] Updated weights on worker 0-0, policy_version 1132634 (0.00091) [2022-07-11 09:26:54,673][25689] Fps is (10 sec: 5711.1, 60 sec: 5549.0, 300 sec: 5543.1). Total num frames: 1159818240. Throughput: 0: 5040.3. Samples: 1159814252. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:26:54,673][25689] Avg episode reward: [(0, '-2.640')] [2022-07-11 09:26:56,149][26022] Updated weights on worker 0-0, policy_version 1132644 (0.00088) [2022-07-11 09:26:58,262][26022] Updated weights on worker 0-0, policy_version 1132654 (0.00084) [2022-07-11 09:26:59,697][25689] Fps is (10 sec: 5613.0, 60 sec: 5564.7, 300 sec: 5544.0). Total num frames: 1159846912. Throughput: 0: 5888.0. Samples: 1159848218. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:26:59,697][25689] Avg episode reward: [(0, '-3.337')] [2022-07-11 09:26:59,715][26022] Updated weights on worker 0-0, policy_version 1132664 (0.00085) [2022-07-11 09:27:02,271][26022] Updated weights on worker 0-0, policy_version 1132674 (0.00080) [2022-07-11 09:27:03,730][26022] Updated weights on worker 0-0, policy_version 1132684 (0.00094) [2022-07-11 09:27:04,741][25689] Fps is (10 sec: 5288.8, 60 sec: 5569.8, 300 sec: 5543.4). Total num frames: 1159871488. Throughput: 0: 5773.8. Samples: 1159879254. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:27:04,741][25689] Avg episode reward: [(0, '-3.067')] [2022-07-11 09:27:05,875][26022] Updated weights on worker 0-0, policy_version 1132694 (0.00096) [2022-07-11 09:27:07,597][26022] Updated weights on worker 0-0, policy_version 1132704 (0.00083) [2022-07-11 09:27:09,530][26022] Updated weights on worker 0-0, policy_version 1132714 (0.00089) [2022-07-11 09:27:09,768][25689] Fps is (10 sec: 5286.9, 60 sec: 5552.2, 300 sec: 5546.5). Total num frames: 1159900160. Throughput: 0: 4908.9. Samples: 1159895980. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:27:09,769][25689] Avg episode reward: [(0, '-4.076')] [2022-07-11 09:27:11,442][26022] Updated weights on worker 0-0, policy_version 1132724 (0.00082) [2022-07-11 09:27:13,255][26022] Updated weights on worker 0-0, policy_version 1132734 (0.00086) [2022-07-11 09:27:14,788][25689] Fps is (10 sec: 5605.4, 60 sec: 5551.8, 300 sec: 5539.5). Total num frames: 1159927808. Throughput: 0: 5736.0. Samples: 1159929636. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:27:14,790][25689] Avg episode reward: [(0, '-4.265')] [2022-07-11 09:27:14,939][26022] Updated weights on worker 0-0, policy_version 1132744 (0.00084) [2022-07-11 09:27:17,007][26022] Updated weights on worker 0-0, policy_version 1132754 (0.00092) [2022-07-11 09:27:18,519][26022] Updated weights on worker 0-0, policy_version 1132764 (0.00087) [2022-07-11 09:27:19,796][25689] Fps is (10 sec: 5514.4, 60 sec: 5554.6, 300 sec: 5538.5). Total num frames: 1159955456. Throughput: 0: 5727.5. Samples: 1159963336. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:27:19,798][25689] Avg episode reward: [(0, '-1.850')] [2022-07-11 09:27:20,528][26022] Updated weights on worker 0-0, policy_version 1132774 (0.00080) [2022-07-11 09:27:22,216][26022] Updated weights on worker 0-0, policy_version 1132784 (0.00086) [2022-07-11 09:27:24,108][26022] Updated weights on worker 0-0, policy_version 1132794 (0.00085) [2022-07-11 09:27:24,839][25689] Fps is (10 sec: 5705.5, 60 sec: 5594.8, 300 sec: 5548.8). Total num frames: 1159985152. Throughput: 0: 5017.6. Samples: 1159980098. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:27:24,841][25689] Avg episode reward: [(0, '-1.706')] [2022-07-11 09:27:25,911][26022] Updated weights on worker 0-0, policy_version 1132804 (0.00087) [2022-07-11 09:27:27,692][26022] Updated weights on worker 0-0, policy_version 1132814 (0.00087) [2022-07-11 09:27:29,480][26022] Updated weights on worker 0-0, policy_version 1132824 (0.00086) [2022-07-11 09:27:29,858][25689] Fps is (10 sec: 5698.7, 60 sec: 5577.4, 300 sec: 5548.6). Total num frames: 1160012800. Throughput: 0: 5881.5. Samples: 1160014140. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:27:29,860][25689] Avg episode reward: [(0, '-1.906')] [2022-07-11 09:27:31,335][26022] Updated weights on worker 0-0, policy_version 1132834 (0.00088) [2022-07-11 09:27:33,303][26022] Updated weights on worker 0-0, policy_version 1132844 (0.00082) [2022-07-11 09:27:34,891][25689] Fps is (10 sec: 5501.1, 60 sec: 5575.3, 300 sec: 5545.0). Total num frames: 1160040448. Throughput: 0: 5867.3. Samples: 1160047582. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:27:34,893][25689] Avg episode reward: [(0, '-1.924')] [2022-07-11 09:27:34,986][26022] Updated weights on worker 0-0, policy_version 1132854 (0.00083) [2022-07-11 09:27:36,880][26022] Updated weights on worker 0-0, policy_version 1132864 (0.00084) [2022-07-11 09:27:38,814][26022] Updated weights on worker 0-0, policy_version 1132874 (0.00088) [2022-07-11 09:27:39,895][25689] Fps is (10 sec: 5509.5, 60 sec: 5561.1, 300 sec: 5542.5). Total num frames: 1160068096. Throughput: 0: 5029.9. Samples: 1160064432. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:27:39,895][25689] Avg episode reward: [(0, '-0.461')] [2022-07-11 09:27:40,648][26022] Updated weights on worker 0-0, policy_version 1132884 (0.00096) [2022-07-11 09:27:42,581][26022] Updated weights on worker 0-0, policy_version 1132894 (0.00079) [2022-07-11 09:27:44,038][26022] Updated weights on worker 0-0, policy_version 1132904 (0.00085) [2022-07-11 09:27:45,027][25689] Fps is (10 sec: 5556.4, 60 sec: 5561.3, 300 sec: 5547.3). Total num frames: 1160096768. Throughput: 0: 5836.0. Samples: 1160097914. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:27:45,029][25689] Avg episode reward: [(0, '0.188')] [2022-07-11 09:27:46,363][26022] Updated weights on worker 0-0, policy_version 1132914 (0.00088) [2022-07-11 09:27:47,860][26022] Updated weights on worker 0-0, policy_version 1132924 (0.00091) [2022-07-11 09:27:49,824][26022] Updated weights on worker 0-0, policy_version 1132934 (0.00086) [2022-07-11 09:27:50,093][25689] Fps is (10 sec: 5823.8, 60 sec: 5574.9, 300 sec: 5553.7). Total num frames: 1160127488. Throughput: 0: 5795.7. Samples: 1160131414. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:27:50,094][25689] Avg episode reward: [(0, '0.145')] [2022-07-11 09:27:51,785][26022] Updated weights on worker 0-0, policy_version 1132944 (0.00094) [2022-07-11 09:27:53,294][26022] Updated weights on worker 0-0, policy_version 1132954 (0.00094) [2022-07-11 09:27:53,810][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:27:53,820][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001132956_1160146944.pth [2022-07-11 09:27:53,820][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001131005_1158149120.pth [2022-07-11 09:27:55,131][25689] Fps is (10 sec: 5574.1, 60 sec: 5538.5, 300 sec: 5542.9). Total num frames: 1160153088. Throughput: 0: 4966.6. Samples: 1160148106. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:27:55,131][25689] Avg episode reward: [(0, '0.640')] [2022-07-11 09:27:55,324][26022] Updated weights on worker 0-0, policy_version 1132964 (0.00092) [2022-07-11 09:27:57,244][26022] Updated weights on worker 0-0, policy_version 1132974 (0.00092) [2022-07-11 09:27:58,883][26022] Updated weights on worker 0-0, policy_version 1132984 (0.00088) [2022-07-11 09:28:00,141][25689] Fps is (10 sec: 5299.2, 60 sec: 5522.8, 300 sec: 5551.9). Total num frames: 1160180736. Throughput: 0: 5782.1. Samples: 1160181500. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:28:00,148][25689] Avg episode reward: [(0, '0.137')] [2022-07-11 09:28:00,936][26022] Updated weights on worker 0-0, policy_version 1132994 (0.01363) [2022-07-11 09:28:03,127][26022] Updated weights on worker 0-0, policy_version 1133004 (0.00080) [2022-07-11 09:28:04,823][26022] Updated weights on worker 0-0, policy_version 1133014 (0.00087) [2022-07-11 09:28:05,224][25689] Fps is (10 sec: 5478.4, 60 sec: 5570.1, 300 sec: 5551.3). Total num frames: 1160208384. Throughput: 0: 5689.1. Samples: 1160212818. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:28:05,225][25689] Avg episode reward: [(0, '0.195')] [2022-07-11 09:28:06,911][26022] Updated weights on worker 0-0, policy_version 1133024 (0.00079) [2022-07-11 09:28:08,468][26022] Updated weights on worker 0-0, policy_version 1133034 (0.00093) [2022-07-11 09:28:10,240][25689] Fps is (10 sec: 5374.3, 60 sec: 5537.3, 300 sec: 5542.4). Total num frames: 1160235008. Throughput: 0: 4872.2. Samples: 1160229572. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:28:10,241][25689] Avg episode reward: [(0, '0.320')] [2022-07-11 09:28:10,559][26022] Updated weights on worker 0-0, policy_version 1133044 (0.00085) [2022-07-11 09:28:12,360][26022] Updated weights on worker 0-0, policy_version 1133054 (0.00084) [2022-07-11 09:28:14,019][26022] Updated weights on worker 0-0, policy_version 1133064 (0.00092) [2022-07-11 09:28:15,251][25689] Fps is (10 sec: 5412.4, 60 sec: 5538.0, 300 sec: 5546.1). Total num frames: 1160262656. Throughput: 0: 5718.1. Samples: 1160263158. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:28:15,252][25689] Avg episode reward: [(0, '0.838')] [2022-07-11 09:28:15,982][26022] Updated weights on worker 0-0, policy_version 1133074 (0.00083) [2022-07-11 09:28:17,730][26022] Updated weights on worker 0-0, policy_version 1133084 (0.00070) [2022-07-11 09:28:19,531][26022] Updated weights on worker 0-0, policy_version 1133094 (0.00095) [2022-07-11 09:28:20,267][25689] Fps is (10 sec: 5821.2, 60 sec: 5588.1, 300 sec: 5558.1). Total num frames: 1160293376. Throughput: 0: 5728.5. Samples: 1160296788. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:28:20,267][25689] Avg episode reward: [(0, '0.745')] [2022-07-11 09:28:21,515][26022] Updated weights on worker 0-0, policy_version 1133104 (0.00082) [2022-07-11 09:28:23,195][26022] Updated weights on worker 0-0, policy_version 1133114 (0.00092) [2022-07-11 09:28:25,190][26022] Updated weights on worker 0-0, policy_version 1133124 (0.00093) [2022-07-11 09:28:25,344][25689] Fps is (10 sec: 5580.5, 60 sec: 5517.3, 300 sec: 5543.0). Total num frames: 1160318976. Throughput: 0: 5008.5. Samples: 1160313588. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:28:25,344][25689] Avg episode reward: [(0, '1.179')] [2022-07-11 09:28:26,964][26022] Updated weights on worker 0-0, policy_version 1133134 (0.00085) [2022-07-11 09:28:28,646][26022] Updated weights on worker 0-0, policy_version 1133144 (0.00107) [2022-07-11 09:28:30,355][25689] Fps is (10 sec: 5379.7, 60 sec: 5535.0, 300 sec: 5544.5). Total num frames: 1160347648. Throughput: 0: 5841.1. Samples: 1160347066. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:28:30,355][25689] Avg episode reward: [(0, '1.241')] [2022-07-11 09:28:30,827][26022] Updated weights on worker 0-0, policy_version 1133154 (0.00087) [2022-07-11 09:28:32,251][26022] Updated weights on worker 0-0, policy_version 1133164 (0.00091) [2022-07-11 09:28:34,402][26022] Updated weights on worker 0-0, policy_version 1133174 (0.00095) [2022-07-11 09:28:35,372][25689] Fps is (10 sec: 5820.2, 60 sec: 5570.2, 300 sec: 5556.4). Total num frames: 1160377344. Throughput: 0: 5830.8. Samples: 1160380478. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 09:28:35,373][25689] Avg episode reward: [(0, '1.782')] [2022-07-11 09:28:36,383][26022] Updated weights on worker 0-0, policy_version 1133184 (0.00090) [2022-07-11 09:28:37,914][26022] Updated weights on worker 0-0, policy_version 1133194 (0.00082) [2022-07-11 09:28:40,043][26022] Updated weights on worker 0-0, policy_version 1133204 (0.00099) [2022-07-11 09:28:40,387][25689] Fps is (10 sec: 5511.8, 60 sec: 5535.4, 300 sec: 5544.1). Total num frames: 1160402944. Throughput: 0: 4995.1. Samples: 1160397292. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:28:40,389][25689] Avg episode reward: [(0, '1.583')] [2022-07-11 09:28:41,456][26022] Updated weights on worker 0-0, policy_version 1133214 (0.00086) [2022-07-11 09:28:43,530][26022] Updated weights on worker 0-0, policy_version 1133224 (0.00087) [2022-07-11 09:28:45,131][26022] Updated weights on worker 0-0, policy_version 1133234 (0.00085) [2022-07-11 09:28:45,494][25689] Fps is (10 sec: 5362.0, 60 sec: 5537.7, 300 sec: 5546.3). Total num frames: 1160431616. Throughput: 0: 5805.9. Samples: 1160430578. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:28:45,494][25689] Avg episode reward: [(0, '1.521')] [2022-07-11 09:28:47,231][26022] Updated weights on worker 0-0, policy_version 1133244 (0.00087) [2022-07-11 09:28:49,195][26022] Updated weights on worker 0-0, policy_version 1133254 (0.00091) [2022-07-11 09:28:50,511][25689] Fps is (10 sec: 5765.6, 60 sec: 5525.3, 300 sec: 5553.9). Total num frames: 1160461312. Throughput: 0: 5797.7. Samples: 1160463922. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:28:50,512][25689] Avg episode reward: [(0, '0.822')] [2022-07-11 09:28:50,739][26022] Updated weights on worker 0-0, policy_version 1133264 (0.00093) [2022-07-11 09:28:52,891][26022] Updated weights on worker 0-0, policy_version 1133274 (0.00084) [2022-07-11 09:28:54,422][26022] Updated weights on worker 0-0, policy_version 1133284 (0.00082) [2022-07-11 09:28:55,521][25689] Fps is (10 sec: 5514.5, 60 sec: 5527.7, 300 sec: 5543.6). Total num frames: 1160486912. Throughput: 0: 5799.0. Samples: 1160497322. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:28:55,523][25689] Avg episode reward: [(0, '0.820')] [2022-07-11 09:28:56,470][26022] Updated weights on worker 0-0, policy_version 1133294 (0.00120) [2022-07-11 09:28:58,326][26022] Updated weights on worker 0-0, policy_version 1133304 (0.00087) [2022-07-11 09:28:59,926][26022] Updated weights on worker 0-0, policy_version 1133314 (0.00094) [2022-07-11 09:29:00,524][25689] Fps is (10 sec: 5420.3, 60 sec: 5545.5, 300 sec: 5555.4). Total num frames: 1160515584. Throughput: 0: 5813.6. Samples: 1160514356. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:29:00,525][25689] Avg episode reward: [(0, '0.766')] [2022-07-11 09:29:02,289][26022] Updated weights on worker 0-0, policy_version 1133324 (0.00086) [2022-07-11 09:29:04,205][26022] Updated weights on worker 0-0, policy_version 1133334 (0.00597) [2022-07-11 09:29:05,589][25689] Fps is (10 sec: 5492.6, 60 sec: 5530.1, 300 sec: 5547.5). Total num frames: 1160542208. Throughput: 0: 5726.2. Samples: 1160545644. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:29:05,589][25689] Avg episode reward: [(0, '0.798')] [2022-07-11 09:29:05,932][26022] Updated weights on worker 0-0, policy_version 1133344 (0.00079) [2022-07-11 09:29:07,814][26022] Updated weights on worker 0-0, policy_version 1133354 (0.00088) [2022-07-11 09:29:09,763][26022] Updated weights on worker 0-0, policy_version 1133364 (0.00091) [2022-07-11 09:29:10,664][25689] Fps is (10 sec: 5452.9, 60 sec: 5558.6, 300 sec: 5546.3). Total num frames: 1160570880. Throughput: 0: 5726.5. Samples: 1160579330. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:29:10,665][25689] Avg episode reward: [(0, '-0.388')] [2022-07-11 09:29:11,422][26022] Updated weights on worker 0-0, policy_version 1133374 (0.00085) [2022-07-11 09:29:13,214][26022] Updated weights on worker 0-0, policy_version 1133384 (0.00085) [2022-07-11 09:29:15,019][26022] Updated weights on worker 0-0, policy_version 1133394 (0.00094) [2022-07-11 09:29:15,761][25689] Fps is (10 sec: 5536.6, 60 sec: 5550.7, 300 sec: 5551.6). Total num frames: 1160598528. Throughput: 0: 4885.7. Samples: 1160596208. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:29:15,761][25689] Avg episode reward: [(0, '-1.100')] [2022-07-11 09:29:16,721][26022] Updated weights on worker 0-0, policy_version 1133404 (0.00091) [2022-07-11 09:29:18,799][26022] Updated weights on worker 0-0, policy_version 1133414 (0.00082) [2022-07-11 09:29:20,506][26022] Updated weights on worker 0-0, policy_version 1133424 (0.00089) [2022-07-11 09:29:20,776][25689] Fps is (10 sec: 5569.9, 60 sec: 5516.9, 300 sec: 5549.5). Total num frames: 1160627200. Throughput: 0: 5692.4. Samples: 1160629640. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:29:20,776][25689] Avg episode reward: [(0, '-0.126')] [2022-07-11 09:29:22,415][26022] Updated weights on worker 0-0, policy_version 1133434 (0.00092) [2022-07-11 09:29:24,205][26022] Updated weights on worker 0-0, policy_version 1133444 (0.00082) [2022-07-11 09:29:25,843][25689] Fps is (10 sec: 5586.1, 60 sec: 5551.7, 300 sec: 5545.3). Total num frames: 1160654848. Throughput: 0: 5825.2. Samples: 1160663630. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:29:25,845][25689] Avg episode reward: [(0, '-0.269')] [2022-07-11 09:29:25,992][26022] Updated weights on worker 0-0, policy_version 1133454 (0.00090) [2022-07-11 09:29:28,046][26022] Updated weights on worker 0-0, policy_version 1133464 (0.00086) [2022-07-11 09:29:29,818][26022] Updated weights on worker 0-0, policy_version 1133474 (0.00078) [2022-07-11 09:29:30,851][25689] Fps is (10 sec: 5488.3, 60 sec: 5535.0, 300 sec: 5549.3). Total num frames: 1160682496. Throughput: 0: 5005.8. Samples: 1160680382. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:29:30,852][25689] Avg episode reward: [(0, '-0.111')] [2022-07-11 09:29:31,573][26022] Updated weights on worker 0-0, policy_version 1133484 (0.00084) [2022-07-11 09:29:33,520][26022] Updated weights on worker 0-0, policy_version 1133494 (0.00087) [2022-07-11 09:29:35,066][26022] Updated weights on worker 0-0, policy_version 1133504 (0.00079) [2022-07-11 09:29:35,867][25689] Fps is (10 sec: 5720.9, 60 sec: 5535.2, 300 sec: 5549.4). Total num frames: 1160712192. Throughput: 0: 5853.2. Samples: 1160713890. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:29:35,867][25689] Avg episode reward: [(0, '0.465')] [2022-07-11 09:29:37,271][26022] Updated weights on worker 0-0, policy_version 1133514 (0.00088) [2022-07-11 09:29:38,835][26022] Updated weights on worker 0-0, policy_version 1133524 (0.00094) [2022-07-11 09:29:40,825][26022] Updated weights on worker 0-0, policy_version 1133534 (0.00084) [2022-07-11 09:29:40,887][25689] Fps is (10 sec: 5612.0, 60 sec: 5551.7, 300 sec: 5544.1). Total num frames: 1160738816. Throughput: 0: 5841.7. Samples: 1160747122. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:29:40,888][25689] Avg episode reward: [(0, '0.997')] [2022-07-11 09:29:42,675][26022] Updated weights on worker 0-0, policy_version 1133544 (0.00087) [2022-07-11 09:29:44,544][26022] Updated weights on worker 0-0, policy_version 1133554 (0.00097) [2022-07-11 09:29:45,961][25689] Fps is (10 sec: 5478.0, 60 sec: 5554.6, 300 sec: 5547.5). Total num frames: 1160767488. Throughput: 0: 4977.0. Samples: 1160763756. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:29:45,963][25689] Avg episode reward: [(0, '1.186')] [2022-07-11 09:29:46,379][26022] Updated weights on worker 0-0, policy_version 1133564 (0.00083) [2022-07-11 09:29:48,213][26022] Updated weights on worker 0-0, policy_version 1133574 (0.00089) [2022-07-11 09:29:50,051][26022] Updated weights on worker 0-0, policy_version 1133584 (0.00097) [2022-07-11 09:29:51,063][25689] Fps is (10 sec: 5534.6, 60 sec: 5513.0, 300 sec: 5546.2). Total num frames: 1160795136. Throughput: 0: 5787.0. Samples: 1160797346. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:29:51,063][25689] Avg episode reward: [(0, '1.078')] [2022-07-11 09:29:52,033][26022] Updated weights on worker 0-0, policy_version 1133594 (0.00116) [2022-07-11 09:29:53,665][26022] Updated weights on worker 0-0, policy_version 1133604 (0.00086) [2022-07-11 09:29:53,996][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:29:54,010][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001133606_1160812544.pth [2022-07-11 09:29:54,011][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001131654_1158813696.pth [2022-07-11 09:29:55,374][26022] Updated weights on worker 0-0, policy_version 1133614 (0.00086) [2022-07-11 09:29:56,078][25689] Fps is (10 sec: 5465.3, 60 sec: 5546.4, 300 sec: 5542.8). Total num frames: 1160822784. Throughput: 0: 5795.9. Samples: 1160831036. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:29:56,079][25689] Avg episode reward: [(0, '1.634')] [2022-07-11 09:29:57,179][26022] Updated weights on worker 0-0, policy_version 1133624 (0.00091) [2022-07-11 09:29:59,126][26022] Updated weights on worker 0-0, policy_version 1133634 (0.00076) [2022-07-11 09:30:00,912][26022] Updated weights on worker 0-0, policy_version 1133644 (0.00094) [2022-07-11 09:30:01,097][25689] Fps is (10 sec: 5612.7, 60 sec: 5544.9, 300 sec: 5551.9). Total num frames: 1160851456. Throughput: 0: 4977.1. Samples: 1160847710. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:30:01,098][25689] Avg episode reward: [(0, '1.414')] [2022-07-11 09:30:03,417][26022] Updated weights on worker 0-0, policy_version 1133654 (0.00090) [2022-07-11 09:30:04,844][26022] Updated weights on worker 0-0, policy_version 1133664 (0.00093) [2022-07-11 09:30:06,182][25689] Fps is (10 sec: 5371.5, 60 sec: 5526.1, 300 sec: 5544.2). Total num frames: 1160877056. Throughput: 0: 5698.4. Samples: 1160878984. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:30:06,183][25689] Avg episode reward: [(0, '1.751')] [2022-07-11 09:30:07,018][26022] Updated weights on worker 0-0, policy_version 1133674 (0.00761) [2022-07-11 09:30:08,692][26022] Updated weights on worker 0-0, policy_version 1133684 (0.00089) [2022-07-11 09:30:10,589][26022] Updated weights on worker 0-0, policy_version 1133694 (0.00087) [2022-07-11 09:30:11,192][25689] Fps is (10 sec: 5477.3, 60 sec: 5549.0, 300 sec: 5544.4). Total num frames: 1160906752. Throughput: 0: 5725.8. Samples: 1160912606. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:30:11,193][25689] Avg episode reward: [(0, '0.809')] [2022-07-11 09:30:12,446][26022] Updated weights on worker 0-0, policy_version 1133704 (0.00085) [2022-07-11 09:30:14,286][26022] Updated weights on worker 0-0, policy_version 1133714 (0.00079) [2022-07-11 09:30:16,161][26022] Updated weights on worker 0-0, policy_version 1133724 (0.00089) [2022-07-11 09:30:16,257][25689] Fps is (10 sec: 5590.1, 60 sec: 5535.0, 300 sec: 5539.9). Total num frames: 1160933376. Throughput: 0: 4874.7. Samples: 1160929400. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:30:16,258][25689] Avg episode reward: [(0, '-0.365')] [2022-07-11 09:30:17,883][26022] Updated weights on worker 0-0, policy_version 1133734 (0.00085) [2022-07-11 09:30:19,596][26022] Updated weights on worker 0-0, policy_version 1133744 (0.00088) [2022-07-11 09:30:21,316][25689] Fps is (10 sec: 5462.1, 60 sec: 5531.0, 300 sec: 5543.3). Total num frames: 1160962048. Throughput: 0: 5707.4. Samples: 1160963108. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:30:21,317][25689] Avg episode reward: [(0, '-0.533')] [2022-07-11 09:30:21,547][26022] Updated weights on worker 0-0, policy_version 1133754 (0.00090) [2022-07-11 09:30:23,268][26022] Updated weights on worker 0-0, policy_version 1133764 (0.00090) [2022-07-11 09:30:25,101][26022] Updated weights on worker 0-0, policy_version 1133774 (0.00084) [2022-07-11 09:30:26,372][25689] Fps is (10 sec: 5770.5, 60 sec: 5565.9, 300 sec: 5545.9). Total num frames: 1160991744. Throughput: 0: 5850.6. Samples: 1160997106. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:30:26,372][25689] Avg episode reward: [(0, '-0.585')] [2022-07-11 09:30:27,175][26022] Updated weights on worker 0-0, policy_version 1133784 (0.00084) [2022-07-11 09:30:28,732][26022] Updated weights on worker 0-0, policy_version 1133794 (0.00092) [2022-07-11 09:30:30,763][26022] Updated weights on worker 0-0, policy_version 1133804 (0.00085) [2022-07-11 09:30:31,459][25689] Fps is (10 sec: 5552.7, 60 sec: 5541.7, 300 sec: 5541.1). Total num frames: 1161018368. Throughput: 0: 4996.2. Samples: 1161013860. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:30:31,459][25689] Avg episode reward: [(0, '-0.633')] [2022-07-11 09:30:32,279][26022] Updated weights on worker 0-0, policy_version 1133814 (0.00080) [2022-07-11 09:30:34,232][26022] Updated weights on worker 0-0, policy_version 1133824 (0.00088) [2022-07-11 09:30:36,118][26022] Updated weights on worker 0-0, policy_version 1133834 (0.00093) [2022-07-11 09:30:36,518][25689] Fps is (10 sec: 5551.2, 60 sec: 5537.8, 300 sec: 5547.1). Total num frames: 1161048064. Throughput: 0: 5838.9. Samples: 1161047698. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:30:36,518][25689] Avg episode reward: [(0, '-0.269')] [2022-07-11 09:30:38,101][26022] Updated weights on worker 0-0, policy_version 1133844 (0.00117) [2022-07-11 09:30:39,806][26022] Updated weights on worker 0-0, policy_version 1133854 (0.00090) [2022-07-11 09:30:41,574][25689] Fps is (10 sec: 5668.9, 60 sec: 5551.3, 300 sec: 5541.9). Total num frames: 1161075712. Throughput: 0: 5808.8. Samples: 1161080784. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:30:41,575][25689] Avg episode reward: [(0, '0.455')] [2022-07-11 09:30:41,700][26022] Updated weights on worker 0-0, policy_version 1133864 (0.00086) [2022-07-11 09:30:43,418][26022] Updated weights on worker 0-0, policy_version 1133874 (0.00086) [2022-07-11 09:30:45,365][26022] Updated weights on worker 0-0, policy_version 1133884 (0.00060) [2022-07-11 09:30:46,612][25689] Fps is (10 sec: 5477.8, 60 sec: 5537.8, 300 sec: 5545.0). Total num frames: 1161103360. Throughput: 0: 4967.3. Samples: 1161097644. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:30:46,613][25689] Avg episode reward: [(0, '1.088')] [2022-07-11 09:30:47,231][26022] Updated weights on worker 0-0, policy_version 1133894 (0.00080) [2022-07-11 09:30:48,920][26022] Updated weights on worker 0-0, policy_version 1133904 (0.00081) [2022-07-11 09:30:51,020][26022] Updated weights on worker 0-0, policy_version 1133914 (0.00090) [2022-07-11 09:30:51,709][25689] Fps is (10 sec: 5557.1, 60 sec: 5555.1, 300 sec: 5543.9). Total num frames: 1161132032. Throughput: 0: 5791.8. Samples: 1161131146. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:30:51,710][25689] Avg episode reward: [(0, '0.761')] [2022-07-11 09:30:52,541][26022] Updated weights on worker 0-0, policy_version 1133924 (0.00090) [2022-07-11 09:30:54,607][26022] Updated weights on worker 0-0, policy_version 1133934 (0.00086) [2022-07-11 09:30:56,215][26022] Updated weights on worker 0-0, policy_version 1133944 (0.00080) [2022-07-11 09:30:56,807][25689] Fps is (10 sec: 5624.7, 60 sec: 5564.4, 300 sec: 5545.7). Total num frames: 1161160704. Throughput: 0: 5778.3. Samples: 1161164938. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:30:56,808][25689] Avg episode reward: [(0, '-0.102')] [2022-07-11 09:30:58,048][26022] Updated weights on worker 0-0, policy_version 1133954 (0.00085) [2022-07-11 09:31:00,016][26022] Updated weights on worker 0-0, policy_version 1133964 (0.00082) [2022-07-11 09:31:01,851][25689] Fps is (10 sec: 5553.4, 60 sec: 5545.3, 300 sec: 5557.1). Total num frames: 1161188352. Throughput: 0: 5795.6. Samples: 1161198296. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:31:01,851][25689] Avg episode reward: [(0, '-0.972')] [2022-07-11 09:31:02,232][26022] Updated weights on worker 0-0, policy_version 1133974 (0.00096) [2022-07-11 09:31:03,966][26022] Updated weights on worker 0-0, policy_version 1133984 (0.00084) [2022-07-11 09:31:05,980][26022] Updated weights on worker 0-0, policy_version 1133994 (0.00085) [2022-07-11 09:31:07,019][25689] Fps is (10 sec: 5515.2, 60 sec: 5588.2, 300 sec: 5550.9). Total num frames: 1161217024. Throughput: 0: 5674.1. Samples: 1161213434. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:31:07,019][25689] Avg episode reward: [(0, '-0.674')] [2022-07-11 09:31:07,530][26022] Updated weights on worker 0-0, policy_version 1134004 (0.00090) [2022-07-11 09:31:09,714][26022] Updated weights on worker 0-0, policy_version 1134014 (0.00086) [2022-07-11 09:31:11,316][26022] Updated weights on worker 0-0, policy_version 1134024 (0.00084) [2022-07-11 09:31:12,084][25689] Fps is (10 sec: 5403.6, 60 sec: 5532.7, 300 sec: 5546.5). Total num frames: 1161243648. Throughput: 0: 5672.4. Samples: 1161246720. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:31:12,084][25689] Avg episode reward: [(0, '-0.655')] [2022-07-11 09:31:13,131][26022] Updated weights on worker 0-0, policy_version 1134034 (0.00088) [2022-07-11 09:31:14,883][26022] Updated weights on worker 0-0, policy_version 1134044 (0.00095) [2022-07-11 09:31:17,094][26022] Updated weights on worker 0-0, policy_version 1134054 (0.00620) [2022-07-11 09:31:17,153][25689] Fps is (10 sec: 5355.4, 60 sec: 5549.1, 300 sec: 5545.9). Total num frames: 1161271296. Throughput: 0: 5675.6. Samples: 1161280414. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:31:17,159][25689] Avg episode reward: [(0, '-0.889')] [2022-07-11 09:31:18,727][26022] Updated weights on worker 0-0, policy_version 1134064 (0.00088) [2022-07-11 09:31:20,577][26022] Updated weights on worker 0-0, policy_version 1134074 (0.00088) [2022-07-11 09:31:22,259][25689] Fps is (10 sec: 5635.9, 60 sec: 5561.7, 300 sec: 5552.9). Total num frames: 1161300992. Throughput: 0: 4841.5. Samples: 1161297100. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:31:22,259][25689] Avg episode reward: [(0, '-1.874')] [2022-07-11 09:31:22,360][26022] Updated weights on worker 0-0, policy_version 1134084 (0.00086) [2022-07-11 09:31:24,058][26022] Updated weights on worker 0-0, policy_version 1134094 (0.00085) [2022-07-11 09:31:26,233][26022] Updated weights on worker 0-0, policy_version 1134104 (0.00086) [2022-07-11 09:31:27,385][25689] Fps is (10 sec: 5704.6, 60 sec: 5538.5, 300 sec: 5550.8). Total num frames: 1161329664. Throughput: 0: 5759.1. Samples: 1161330722. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:31:27,386][25689] Avg episode reward: [(0, '0.088')] [2022-07-11 09:31:27,697][26022] Updated weights on worker 0-0, policy_version 1134114 (0.00088) [2022-07-11 09:31:29,941][26022] Updated weights on worker 0-0, policy_version 1134124 (0.00088) [2022-07-11 09:31:31,348][26022] Updated weights on worker 0-0, policy_version 1134134 (0.00090) [2022-07-11 09:31:32,463][25689] Fps is (10 sec: 5519.2, 60 sec: 5556.1, 300 sec: 5549.6). Total num frames: 1161357312. Throughput: 0: 5761.0. Samples: 1161364124. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:31:32,465][25689] Avg episode reward: [(0, '0.329')] [2022-07-11 09:31:33,530][26022] Updated weights on worker 0-0, policy_version 1134144 (0.00089) [2022-07-11 09:31:35,275][26022] Updated weights on worker 0-0, policy_version 1134154 (0.00084) [2022-07-11 09:31:36,938][26022] Updated weights on worker 0-0, policy_version 1134164 (0.00090) [2022-07-11 09:31:37,531][25689] Fps is (10 sec: 5651.8, 60 sec: 5555.2, 300 sec: 5552.4). Total num frames: 1161387008. Throughput: 0: 4930.7. Samples: 1161380890. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:31:37,533][25689] Avg episode reward: [(0, '0.385')] [2022-07-11 09:31:38,863][26022] Updated weights on worker 0-0, policy_version 1134174 (0.00090) [2022-07-11 09:31:40,533][26022] Updated weights on worker 0-0, policy_version 1134184 (0.00092) [2022-07-11 09:31:42,593][25689] Fps is (10 sec: 5559.5, 60 sec: 5537.9, 300 sec: 5546.8). Total num frames: 1161413632. Throughput: 0: 5771.3. Samples: 1161414458. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:31:42,594][25689] Avg episode reward: [(0, '0.462')] [2022-07-11 09:31:42,626][26022] Updated weights on worker 0-0, policy_version 1134194 (0.00089) [2022-07-11 09:31:44,597][26022] Updated weights on worker 0-0, policy_version 1134204 (0.00083) [2022-07-11 09:31:46,120][26022] Updated weights on worker 0-0, policy_version 1134214 (0.00086) [2022-07-11 09:31:47,699][25689] Fps is (10 sec: 5437.9, 60 sec: 5548.5, 300 sec: 5542.0). Total num frames: 1161442304. Throughput: 0: 5771.9. Samples: 1161447976. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:31:47,700][25689] Avg episode reward: [(0, '2.246')] [2022-07-11 09:31:48,387][26022] Updated weights on worker 0-0, policy_version 1134224 (0.00088) [2022-07-11 09:31:49,731][26022] Updated weights on worker 0-0, policy_version 1134234 (0.00090) [2022-07-11 09:31:51,864][26022] Updated weights on worker 0-0, policy_version 1134244 (0.00088) [2022-07-11 09:31:52,779][25689] Fps is (10 sec: 5730.2, 60 sec: 5566.8, 300 sec: 5547.5). Total num frames: 1161472000. Throughput: 0: 4954.1. Samples: 1161464774. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:31:52,780][25689] Avg episode reward: [(0, '2.217')] [2022-07-11 09:31:53,475][26022] Updated weights on worker 0-0, policy_version 1134254 (0.00084) [2022-07-11 09:31:54,048][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:31:54,058][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001134257_1161479168.pth [2022-07-11 09:31:54,059][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001132305_1159480320.pth [2022-07-11 09:31:55,474][26022] Updated weights on worker 0-0, policy_version 1134264 (0.00090) [2022-07-11 09:31:57,252][26022] Updated weights on worker 0-0, policy_version 1134274 (0.00088) [2022-07-11 09:31:57,815][25689] Fps is (10 sec: 5669.0, 60 sec: 5555.8, 300 sec: 5543.9). Total num frames: 1161499648. Throughput: 0: 5801.7. Samples: 1161498572. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:31:57,815][25689] Avg episode reward: [(0, '2.226')] [2022-07-11 09:31:59,078][26022] Updated weights on worker 0-0, policy_version 1134284 (0.00086) [2022-07-11 09:32:00,743][26022] Updated weights on worker 0-0, policy_version 1134294 (0.00052) [2022-07-11 09:32:02,818][25689] Fps is (10 sec: 5304.5, 60 sec: 5525.9, 300 sec: 5548.1). Total num frames: 1161525248. Throughput: 0: 5763.1. Samples: 1161531012. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:32:02,818][25689] Avg episode reward: [(0, '2.110')] [2022-07-11 09:32:03,086][26022] Updated weights on worker 0-0, policy_version 1134304 (0.00079) [2022-07-11 09:32:04,701][26022] Updated weights on worker 0-0, policy_version 1134314 (0.00087) [2022-07-11 09:32:06,758][26022] Updated weights on worker 0-0, policy_version 1134324 (0.00081) [2022-07-11 09:32:07,883][25689] Fps is (10 sec: 5492.0, 60 sec: 5552.0, 300 sec: 5550.8). Total num frames: 1161554944. Throughput: 0: 4904.3. Samples: 1161546964. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:32:07,884][25689] Avg episode reward: [(0, '2.096')] [2022-07-11 09:32:08,587][26022] Updated weights on worker 0-0, policy_version 1134334 (0.00083) [2022-07-11 09:32:10,419][26022] Updated weights on worker 0-0, policy_version 1134344 (0.00085) [2022-07-11 09:32:12,347][26022] Updated weights on worker 0-0, policy_version 1134354 (0.00085) [2022-07-11 09:32:12,893][25689] Fps is (10 sec: 5589.9, 60 sec: 5557.1, 300 sec: 5547.6). Total num frames: 1161581568. Throughput: 0: 5737.0. Samples: 1161580166. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:32:12,893][25689] Avg episode reward: [(0, '0.845')] [2022-07-11 09:32:14,096][26022] Updated weights on worker 0-0, policy_version 1134364 (0.00086) [2022-07-11 09:32:15,962][26022] Updated weights on worker 0-0, policy_version 1134374 (0.00087) [2022-07-11 09:32:17,822][26022] Updated weights on worker 0-0, policy_version 1134384 (0.00087) [2022-07-11 09:32:17,908][25689] Fps is (10 sec: 5516.0, 60 sec: 5578.9, 300 sec: 5550.9). Total num frames: 1161610240. Throughput: 0: 5752.3. Samples: 1161614152. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:32:17,908][25689] Avg episode reward: [(0, '-0.161')] [2022-07-11 09:32:19,427][26022] Updated weights on worker 0-0, policy_version 1134394 (0.00077) [2022-07-11 09:32:21,307][26022] Updated weights on worker 0-0, policy_version 1134404 (0.00086) [2022-07-11 09:32:22,922][25689] Fps is (10 sec: 5717.5, 60 sec: 5570.4, 300 sec: 5548.0). Total num frames: 1161638912. Throughput: 0: 4972.6. Samples: 1161630986. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:32:22,923][25689] Avg episode reward: [(0, '-0.343')] [2022-07-11 09:32:23,287][26022] Updated weights on worker 0-0, policy_version 1134414 (0.00083) [2022-07-11 09:32:24,827][26022] Updated weights on worker 0-0, policy_version 1134424 (0.00086) [2022-07-11 09:32:26,902][26022] Updated weights on worker 0-0, policy_version 1134434 (0.00090) [2022-07-11 09:32:27,991][25689] Fps is (10 sec: 5585.5, 60 sec: 5558.8, 300 sec: 5547.1). Total num frames: 1161666560. Throughput: 0: 5847.4. Samples: 1161664542. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:32:27,992][25689] Avg episode reward: [(0, '-0.889')] [2022-07-11 09:32:28,610][26022] Updated weights on worker 0-0, policy_version 1134444 (0.00092) [2022-07-11 09:32:30,600][26022] Updated weights on worker 0-0, policy_version 1134454 (0.00090) [2022-07-11 09:32:32,277][26022] Updated weights on worker 0-0, policy_version 1134464 (0.00089) [2022-07-11 09:32:32,999][25689] Fps is (10 sec: 5487.5, 60 sec: 5565.2, 300 sec: 5547.5). Total num frames: 1161694208. Throughput: 0: 5857.6. Samples: 1161697940. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2022-07-11 09:32:33,000][25689] Avg episode reward: [(0, '-0.887')] [2022-07-11 09:32:34,095][26022] Updated weights on worker 0-0, policy_version 1134474 (0.00087) [2022-07-11 09:32:36,066][26022] Updated weights on worker 0-0, policy_version 1134484 (0.00097) [2022-07-11 09:32:38,025][25689] Fps is (10 sec: 5510.6, 60 sec: 5535.2, 300 sec: 5547.1). Total num frames: 1161721856. Throughput: 0: 5841.2. Samples: 1161731664. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:32:38,026][25689] Avg episode reward: [(0, '-0.483')] [2022-07-11 09:32:38,027][26022] Updated weights on worker 0-0, policy_version 1134494 (0.00079) [2022-07-11 09:32:39,688][26022] Updated weights on worker 0-0, policy_version 1134504 (0.00084) [2022-07-11 09:32:41,547][26022] Updated weights on worker 0-0, policy_version 1134514 (0.00089) [2022-07-11 09:32:43,045][25689] Fps is (10 sec: 5606.1, 60 sec: 5573.0, 300 sec: 5549.2). Total num frames: 1161750528. Throughput: 0: 5837.9. Samples: 1161748462. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:32:43,046][25689] Avg episode reward: [(0, '-0.140')] [2022-07-11 09:32:43,469][26022] Updated weights on worker 0-0, policy_version 1134524 (0.00081) [2022-07-11 09:32:45,303][26022] Updated weights on worker 0-0, policy_version 1134534 (0.00101) [2022-07-11 09:32:46,967][26022] Updated weights on worker 0-0, policy_version 1134544 (0.00088) [2022-07-11 09:32:48,154][25689] Fps is (10 sec: 5560.7, 60 sec: 5555.8, 300 sec: 5538.1). Total num frames: 1161778176. Throughput: 0: 5845.1. Samples: 1161782394. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:32:48,154][25689] Avg episode reward: [(0, '0.923')] [2022-07-11 09:32:48,771][26022] Updated weights on worker 0-0, policy_version 1134554 (0.00078) [2022-07-11 09:32:50,721][26022] Updated weights on worker 0-0, policy_version 1134564 (0.00088) [2022-07-11 09:32:52,368][26022] Updated weights on worker 0-0, policy_version 1134574 (0.00086) [2022-07-11 09:32:53,170][25689] Fps is (10 sec: 5663.9, 60 sec: 5561.7, 300 sec: 5552.2). Total num frames: 1161807872. Throughput: 0: 5859.8. Samples: 1161816136. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:32:53,170][25689] Avg episode reward: [(0, '0.986')] [2022-07-11 09:32:54,440][26022] Updated weights on worker 0-0, policy_version 1134584 (0.00080) [2022-07-11 09:32:56,183][26022] Updated weights on worker 0-0, policy_version 1134594 (0.00087) [2022-07-11 09:32:57,813][26022] Updated weights on worker 0-0, policy_version 1134604 (0.00091) [2022-07-11 09:32:58,245][25689] Fps is (10 sec: 5784.0, 60 sec: 5575.0, 300 sec: 5554.5). Total num frames: 1161836544. Throughput: 0: 5020.9. Samples: 1161833184. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:32:58,246][25689] Avg episode reward: [(0, '1.720')] [2022-07-11 09:32:59,765][26022] Updated weights on worker 0-0, policy_version 1134614 (0.00087) [2022-07-11 09:33:01,522][26022] Updated weights on worker 0-0, policy_version 1134624 (0.00089) [2022-07-11 09:33:03,329][25689] Fps is (10 sec: 5342.1, 60 sec: 5567.5, 300 sec: 5547.6). Total num frames: 1161862144. Throughput: 0: 5735.5. Samples: 1161864800. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:33:03,330][25689] Avg episode reward: [(0, '1.462')] [2022-07-11 09:33:03,911][26022] Updated weights on worker 0-0, policy_version 1134634 (0.00089) [2022-07-11 09:33:05,422][26022] Updated weights on worker 0-0, policy_version 1134644 (0.00078) [2022-07-11 09:33:07,545][26022] Updated weights on worker 0-0, policy_version 1134654 (0.00088) [2022-07-11 09:33:08,392][25689] Fps is (10 sec: 5449.6, 60 sec: 5567.7, 300 sec: 5557.0). Total num frames: 1161891840. Throughput: 0: 5743.7. Samples: 1161898636. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:33:08,392][25689] Avg episode reward: [(0, '1.289')] [2022-07-11 09:33:09,079][26022] Updated weights on worker 0-0, policy_version 1134664 (0.00090) [2022-07-11 09:33:11,030][26022] Updated weights on worker 0-0, policy_version 1134674 (0.00145) [2022-07-11 09:33:12,836][26022] Updated weights on worker 0-0, policy_version 1134684 (0.00091) [2022-07-11 09:33:13,425][25689] Fps is (10 sec: 5578.5, 60 sec: 5565.6, 300 sec: 5553.2). Total num frames: 1161918464. Throughput: 0: 4889.9. Samples: 1161915182. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:33:13,425][25689] Avg episode reward: [(0, '1.076')] [2022-07-11 09:33:14,843][26022] Updated weights on worker 0-0, policy_version 1134694 (0.00075) [2022-07-11 09:33:16,576][26022] Updated weights on worker 0-0, policy_version 1134705 (0.00092) [2022-07-11 09:33:18,436][25689] Fps is (10 sec: 5301.5, 60 sec: 5532.1, 300 sec: 5539.5). Total num frames: 1161945088. Throughput: 0: 5726.0. Samples: 1161948796. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:33:18,436][25689] Avg episode reward: [(0, '1.294')] [2022-07-11 09:33:18,678][26022] Updated weights on worker 0-0, policy_version 1134715 (0.00089) [2022-07-11 09:33:20,385][26022] Updated weights on worker 0-0, policy_version 1134725 (0.00086) [2022-07-11 09:33:22,409][26022] Updated weights on worker 0-0, policy_version 1134735 (0.00086) [2022-07-11 09:33:23,438][25689] Fps is (10 sec: 5726.7, 60 sec: 5567.1, 300 sec: 5558.1). Total num frames: 1161975808. Throughput: 0: 5841.5. Samples: 1161982270. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:33:23,439][25689] Avg episode reward: [(0, '1.491')] [2022-07-11 09:33:24,252][26022] Updated weights on worker 0-0, policy_version 1134745 (0.00086) [2022-07-11 09:33:25,901][26022] Updated weights on worker 0-0, policy_version 1134755 (0.00084) [2022-07-11 09:33:27,873][26022] Updated weights on worker 0-0, policy_version 1134765 (0.00090) [2022-07-11 09:33:28,531][25689] Fps is (10 sec: 5579.1, 60 sec: 5531.1, 300 sec: 5546.3). Total num frames: 1162001408. Throughput: 0: 4980.0. Samples: 1161998928. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:33:28,531][25689] Avg episode reward: [(0, '1.146')] [2022-07-11 09:33:29,547][26022] Updated weights on worker 0-0, policy_version 1134775 (0.00096) [2022-07-11 09:33:31,638][26022] Updated weights on worker 0-0, policy_version 1134785 (0.00088) [2022-07-11 09:33:33,358][26022] Updated weights on worker 0-0, policy_version 1134795 (0.00090) [2022-07-11 09:33:33,541][25689] Fps is (10 sec: 5473.5, 60 sec: 5564.7, 300 sec: 5546.4). Total num frames: 1162031104. Throughput: 0: 5824.1. Samples: 1162032340. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:33:33,541][25689] Avg episode reward: [(0, '1.049')] [2022-07-11 09:33:35,228][26022] Updated weights on worker 0-0, policy_version 1134805 (0.00073) [2022-07-11 09:33:36,973][26022] Updated weights on worker 0-0, policy_version 1134815 (0.00091) [2022-07-11 09:33:38,560][25689] Fps is (10 sec: 5615.5, 60 sec: 5548.4, 300 sec: 5549.7). Total num frames: 1162057728. Throughput: 0: 5822.4. Samples: 1162065966. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:33:38,561][25689] Avg episode reward: [(0, '1.422')] [2022-07-11 09:33:38,893][26022] Updated weights on worker 0-0, policy_version 1134825 (0.00084) [2022-07-11 09:33:40,490][26022] Updated weights on worker 0-0, policy_version 1134835 (0.00095) [2022-07-11 09:33:42,491][26022] Updated weights on worker 0-0, policy_version 1134845 (0.00098) [2022-07-11 09:33:43,563][25689] Fps is (10 sec: 5619.7, 60 sec: 5566.9, 300 sec: 5555.1). Total num frames: 1162087424. Throughput: 0: 4992.6. Samples: 1162082742. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:33:43,564][25689] Avg episode reward: [(0, '1.495')] [2022-07-11 09:33:44,140][26022] Updated weights on worker 0-0, policy_version 1134855 (0.00087) [2022-07-11 09:33:46,228][26022] Updated weights on worker 0-0, policy_version 1134865 (0.00090) [2022-07-11 09:33:48,093][26022] Updated weights on worker 0-0, policy_version 1134875 (0.00086) [2022-07-11 09:33:48,620][25689] Fps is (10 sec: 5700.0, 60 sec: 5571.6, 300 sec: 5547.5). Total num frames: 1162115072. Throughput: 0: 5813.3. Samples: 1162115714. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:33:48,621][25689] Avg episode reward: [(0, '1.377')] [2022-07-11 09:33:49,895][26022] Updated weights on worker 0-0, policy_version 1134885 (0.00099) [2022-07-11 09:33:51,870][26022] Updated weights on worker 0-0, policy_version 1134895 (0.00102) [2022-07-11 09:33:53,646][25689] Fps is (10 sec: 5382.4, 60 sec: 5519.9, 300 sec: 5550.6). Total num frames: 1162141696. Throughput: 0: 5802.1. Samples: 1162148992. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:33:53,647][25689] Avg episode reward: [(0, '1.374')] [2022-07-11 09:33:53,675][26022] Updated weights on worker 0-0, policy_version 1134905 (0.00082) [2022-07-11 09:33:54,300][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:33:54,308][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001134907_1162144768.pth [2022-07-11 09:33:54,309][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001132956_1160146944.pth [2022-07-11 09:33:55,351][26022] Updated weights on worker 0-0, policy_version 1134915 (0.00089) [2022-07-11 09:33:57,415][26022] Updated weights on worker 0-0, policy_version 1134925 (0.00086) [2022-07-11 09:33:58,662][25689] Fps is (10 sec: 5506.7, 60 sec: 5525.4, 300 sec: 5550.4). Total num frames: 1162170368. Throughput: 0: 4968.6. Samples: 1162165844. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:33:58,662][25689] Avg episode reward: [(0, '0.981')] [2022-07-11 09:33:59,185][26022] Updated weights on worker 0-0, policy_version 1134935 (0.00085) [2022-07-11 09:34:00,857][26022] Updated weights on worker 0-0, policy_version 1134945 (0.00086) [2022-07-11 09:34:03,164][26022] Updated weights on worker 0-0, policy_version 1134955 (0.00087) [2022-07-11 09:34:03,663][25689] Fps is (10 sec: 5315.7, 60 sec: 5515.9, 300 sec: 5544.7). Total num frames: 1162194944. Throughput: 0: 5791.7. Samples: 1162199158. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:34:03,664][25689] Avg episode reward: [(0, '1.182')] [2022-07-11 09:34:04,770][26022] Updated weights on worker 0-0, policy_version 1134965 (0.00091) [2022-07-11 09:34:06,915][26022] Updated weights on worker 0-0, policy_version 1134975 (0.00086) [2022-07-11 09:34:08,742][25689] Fps is (10 sec: 5282.7, 60 sec: 5497.6, 300 sec: 5544.7). Total num frames: 1162223616. Throughput: 0: 5710.7. Samples: 1162230620. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:34:08,742][25689] Avg episode reward: [(0, '0.443')] [2022-07-11 09:34:09,050][26022] Updated weights on worker 0-0, policy_version 1134985 (0.00086) [2022-07-11 09:34:10,451][26022] Updated weights on worker 0-0, policy_version 1134995 (0.00085) [2022-07-11 09:34:12,600][26022] Updated weights on worker 0-0, policy_version 1135005 (0.00093) [2022-07-11 09:34:13,764][25689] Fps is (10 sec: 5778.8, 60 sec: 5549.5, 300 sec: 5552.9). Total num frames: 1162253312. Throughput: 0: 4898.0. Samples: 1162247528. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:34:13,764][25689] Avg episode reward: [(0, '0.758')] [2022-07-11 09:34:14,048][26022] Updated weights on worker 0-0, policy_version 1135015 (0.00101) [2022-07-11 09:34:16,152][26022] Updated weights on worker 0-0, policy_version 1135025 (0.00093) [2022-07-11 09:34:17,895][26022] Updated weights on worker 0-0, policy_version 1135035 (0.00085) [2022-07-11 09:34:18,771][25689] Fps is (10 sec: 5717.4, 60 sec: 5566.7, 300 sec: 5549.6). Total num frames: 1162280960. Throughput: 0: 5723.8. Samples: 1162280948. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:34:18,772][25689] Avg episode reward: [(0, '0.644')] [2022-07-11 09:34:19,629][26022] Updated weights on worker 0-0, policy_version 1135045 (0.00088) [2022-07-11 09:34:21,415][26022] Updated weights on worker 0-0, policy_version 1135055 (0.00083) [2022-07-11 09:34:23,518][26022] Updated weights on worker 0-0, policy_version 1135065 (0.00084) [2022-07-11 09:34:23,791][25689] Fps is (10 sec: 5412.6, 60 sec: 5497.3, 300 sec: 5547.1). Total num frames: 1162307584. Throughput: 0: 5745.9. Samples: 1162314808. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:34:23,791][25689] Avg episode reward: [(0, '0.245')] [2022-07-11 09:34:25,007][26022] Updated weights on worker 0-0, policy_version 1135075 (0.00085) [2022-07-11 09:34:27,107][26022] Updated weights on worker 0-0, policy_version 1135085 (0.00082) [2022-07-11 09:34:28,665][26022] Updated weights on worker 0-0, policy_version 1135095 (0.00082) [2022-07-11 09:34:28,875][25689] Fps is (10 sec: 5574.1, 60 sec: 5565.9, 300 sec: 5552.5). Total num frames: 1162337280. Throughput: 0: 5023.5. Samples: 1162331762. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:34:28,876][25689] Avg episode reward: [(0, '0.307')] [2022-07-11 09:34:30,671][26022] Updated weights on worker 0-0, policy_version 1135105 (0.00084) [2022-07-11 09:34:32,395][26022] Updated weights on worker 0-0, policy_version 1135115 (0.00089) [2022-07-11 09:34:33,880][25689] Fps is (10 sec: 5785.4, 60 sec: 5549.5, 300 sec: 5549.3). Total num frames: 1162365952. Throughput: 0: 5872.9. Samples: 1162365666. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:34:33,880][25689] Avg episode reward: [(0, '1.162')] [2022-07-11 09:34:34,320][26022] Updated weights on worker 0-0, policy_version 1135125 (0.00085) [2022-07-11 09:34:35,892][26022] Updated weights on worker 0-0, policy_version 1135135 (0.00085) [2022-07-11 09:34:38,080][26022] Updated weights on worker 0-0, policy_version 1135145 (0.00090) [2022-07-11 09:34:38,893][25689] Fps is (10 sec: 5621.9, 60 sec: 5566.9, 300 sec: 5552.8). Total num frames: 1162393600. Throughput: 0: 5892.3. Samples: 1162399512. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:34:38,894][25689] Avg episode reward: [(0, '1.195')] [2022-07-11 09:34:39,613][26022] Updated weights on worker 0-0, policy_version 1135155 (0.00088) [2022-07-11 09:34:41,579][26022] Updated weights on worker 0-0, policy_version 1135165 (0.00091) [2022-07-11 09:34:43,454][26022] Updated weights on worker 0-0, policy_version 1135175 (0.00104) [2022-07-11 09:34:43,918][25689] Fps is (10 sec: 5406.6, 60 sec: 5514.1, 300 sec: 5546.9). Total num frames: 1162420224. Throughput: 0: 5032.9. Samples: 1162416104. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:34:43,918][25689] Avg episode reward: [(0, '1.244')] [2022-07-11 09:34:45,318][26022] Updated weights on worker 0-0, policy_version 1135185 (0.00091) [2022-07-11 09:34:47,260][26022] Updated weights on worker 0-0, policy_version 1135195 (0.00093) [2022-07-11 09:34:49,023][25689] Fps is (10 sec: 5559.7, 60 sec: 5543.6, 300 sec: 5553.7). Total num frames: 1162449920. Throughput: 0: 5820.2. Samples: 1162449028. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:34:49,024][25689] Avg episode reward: [(0, '1.279')] [2022-07-11 09:34:49,034][26022] Updated weights on worker 0-0, policy_version 1135205 (0.00932) [2022-07-11 09:34:50,750][26022] Updated weights on worker 0-0, policy_version 1135215 (0.00090) [2022-07-11 09:34:52,875][26022] Updated weights on worker 0-0, policy_version 1135225 (0.00087) [2022-07-11 09:34:54,039][25689] Fps is (10 sec: 5665.5, 60 sec: 5561.4, 300 sec: 5553.7). Total num frames: 1162477568. Throughput: 0: 5810.0. Samples: 1162482794. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:34:54,040][25689] Avg episode reward: [(0, '1.476')] [2022-07-11 09:34:54,324][26022] Updated weights on worker 0-0, policy_version 1135235 (0.00091) [2022-07-11 09:34:56,604][26022] Updated weights on worker 0-0, policy_version 1135245 (0.00088) [2022-07-11 09:34:57,927][26022] Updated weights on worker 0-0, policy_version 1135255 (0.00088) [2022-07-11 09:34:59,095][25689] Fps is (10 sec: 5592.1, 60 sec: 5557.8, 300 sec: 5553.0). Total num frames: 1162506240. Throughput: 0: 4967.9. Samples: 1162499870. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:34:59,096][25689] Avg episode reward: [(0, '1.994')] [2022-07-11 09:35:00,095][26022] Updated weights on worker 0-0, policy_version 1135265 (0.00097) [2022-07-11 09:35:01,563][26022] Updated weights on worker 0-0, policy_version 1135275 (0.00084) [2022-07-11 09:35:03,782][26022] Updated weights on worker 0-0, policy_version 1135285 (0.00082) [2022-07-11 09:35:04,111][25689] Fps is (10 sec: 5388.6, 60 sec: 5573.4, 300 sec: 5554.3). Total num frames: 1162531840. Throughput: 0: 5779.9. Samples: 1162532818. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:35:04,111][25689] Avg episode reward: [(0, '1.795')] [2022-07-11 09:35:05,638][26022] Updated weights on worker 0-0, policy_version 1135295 (0.00095) [2022-07-11 09:35:07,639][26022] Updated weights on worker 0-0, policy_version 1135305 (0.00087) [2022-07-11 09:35:09,211][25689] Fps is (10 sec: 5466.2, 60 sec: 5588.3, 300 sec: 5552.6). Total num frames: 1162561536. Throughput: 0: 5776.2. Samples: 1162565634. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:35:09,211][25689] Avg episode reward: [(0, '1.822')] [2022-07-11 09:35:09,454][26022] Updated weights on worker 0-0, policy_version 1135315 (0.00081) [2022-07-11 09:35:11,306][26022] Updated weights on worker 0-0, policy_version 1135325 (0.00091) [2022-07-11 09:35:12,888][26022] Updated weights on worker 0-0, policy_version 1135335 (0.00088) [2022-07-11 09:35:14,240][25689] Fps is (10 sec: 5661.0, 60 sec: 5553.7, 300 sec: 5556.7). Total num frames: 1162589184. Throughput: 0: 4931.3. Samples: 1162582412. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:35:14,242][25689] Avg episode reward: [(0, '1.823')] [2022-07-11 09:35:15,053][26022] Updated weights on worker 0-0, policy_version 1135345 (0.00081) [2022-07-11 09:35:16,567][26022] Updated weights on worker 0-0, policy_version 1135355 (0.00090) [2022-07-11 09:35:18,624][26022] Updated weights on worker 0-0, policy_version 1135365 (0.00085) [2022-07-11 09:35:19,249][25689] Fps is (10 sec: 5610.6, 60 sec: 5570.6, 300 sec: 5557.7). Total num frames: 1162617856. Throughput: 0: 5774.0. Samples: 1162616240. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:35:19,249][25689] Avg episode reward: [(0, '1.833')] [2022-07-11 09:35:20,267][26022] Updated weights on worker 0-0, policy_version 1135375 (0.00098) [2022-07-11 09:35:22,295][26022] Updated weights on worker 0-0, policy_version 1135385 (0.00086) [2022-07-11 09:35:23,899][26022] Updated weights on worker 0-0, policy_version 1135395 (0.00088) [2022-07-11 09:35:24,259][25689] Fps is (10 sec: 5723.7, 60 sec: 5605.3, 300 sec: 5555.1). Total num frames: 1162646528. Throughput: 0: 5813.9. Samples: 1162649960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:35:24,261][25689] Avg episode reward: [(0, '1.932')] [2022-07-11 09:35:25,950][26022] Updated weights on worker 0-0, policy_version 1135405 (0.00088) [2022-07-11 09:35:27,543][26022] Updated weights on worker 0-0, policy_version 1135415 (0.00089) [2022-07-11 09:35:29,307][25689] Fps is (10 sec: 5599.5, 60 sec: 5574.8, 300 sec: 5559.2). Total num frames: 1162674176. Throughput: 0: 5879.6. Samples: 1162683792. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:35:29,308][25689] Avg episode reward: [(0, '1.361')] [2022-07-11 09:35:29,334][26022] Updated weights on worker 0-0, policy_version 1135425 (0.00084) [2022-07-11 09:35:31,340][26022] Updated weights on worker 0-0, policy_version 1135435 (0.00083) [2022-07-11 09:35:33,116][26022] Updated weights on worker 0-0, policy_version 1135445 (0.00083) [2022-07-11 09:35:34,322][25689] Fps is (10 sec: 5494.9, 60 sec: 5556.9, 300 sec: 5553.2). Total num frames: 1162701824. Throughput: 0: 5885.0. Samples: 1162700594. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:35:34,323][25689] Avg episode reward: [(0, '0.412')] [2022-07-11 09:35:34,976][26022] Updated weights on worker 0-0, policy_version 1135455 (0.00085) [2022-07-11 09:35:36,719][26022] Updated weights on worker 0-0, policy_version 1135465 (0.00086) [2022-07-11 09:35:38,487][26022] Updated weights on worker 0-0, policy_version 1135475 (0.00090) [2022-07-11 09:35:39,412][25689] Fps is (10 sec: 5675.0, 60 sec: 5583.8, 300 sec: 5559.4). Total num frames: 1162731520. Throughput: 0: 5854.8. Samples: 1162734288. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:35:39,412][25689] Avg episode reward: [(0, '0.645')] [2022-07-11 09:35:40,640][26022] Updated weights on worker 0-0, policy_version 1135485 (0.00092) [2022-07-11 09:35:42,071][26022] Updated weights on worker 0-0, policy_version 1135495 (0.00087) [2022-07-11 09:35:44,362][26022] Updated weights on worker 0-0, policy_version 1135505 (0.00090) [2022-07-11 09:35:44,422][25689] Fps is (10 sec: 5576.3, 60 sec: 5585.0, 300 sec: 5556.5). Total num frames: 1162758144. Throughput: 0: 5848.6. Samples: 1162767886. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:35:44,423][25689] Avg episode reward: [(0, '0.357')] [2022-07-11 09:35:45,679][26022] Updated weights on worker 0-0, policy_version 1135515 (0.00093) [2022-07-11 09:35:47,764][26022] Updated weights on worker 0-0, policy_version 1135525 (0.00088) [2022-07-11 09:35:49,519][25689] Fps is (10 sec: 5470.9, 60 sec: 5568.9, 300 sec: 5556.5). Total num frames: 1162786816. Throughput: 0: 4984.3. Samples: 1162784534. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:35:49,519][25689] Avg episode reward: [(0, '0.586')] [2022-07-11 09:35:49,737][26022] Updated weights on worker 0-0, policy_version 1135535 (0.00086) [2022-07-11 09:35:51,504][26022] Updated weights on worker 0-0, policy_version 1135545 (0.00085) [2022-07-11 09:35:53,275][26022] Updated weights on worker 0-0, policy_version 1135555 (0.00087) [2022-07-11 09:35:54,369][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:35:54,385][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001135561_1162814464.pth [2022-07-11 09:35:54,385][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001133606_1160812544.pth [2022-07-11 09:35:54,573][25689] Fps is (10 sec: 5649.0, 60 sec: 5582.3, 300 sec: 5557.3). Total num frames: 1162815488. Throughput: 0: 5805.4. Samples: 1162818158. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:35:54,574][25689] Avg episode reward: [(0, '-0.044')] [2022-07-11 09:35:55,134][26022] Updated weights on worker 0-0, policy_version 1135565 (0.00095) [2022-07-11 09:35:56,751][26022] Updated weights on worker 0-0, policy_version 1135575 (0.00084) [2022-07-11 09:35:58,931][26022] Updated weights on worker 0-0, policy_version 1135585 (0.00083) [2022-07-11 09:35:59,584][25689] Fps is (10 sec: 5595.4, 60 sec: 5569.5, 300 sec: 5557.9). Total num frames: 1162843136. Throughput: 0: 5843.7. Samples: 1162852172. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:35:59,585][25689] Avg episode reward: [(0, '0.026')] [2022-07-11 09:36:00,320][26022] Updated weights on worker 0-0, policy_version 1135595 (0.00090) [2022-07-11 09:36:02,741][26022] Updated weights on worker 0-0, policy_version 1135605 (0.00084) [2022-07-11 09:36:04,408][26022] Updated weights on worker 0-0, policy_version 1135615 (0.00087) [2022-07-11 09:36:04,609][25689] Fps is (10 sec: 5408.0, 60 sec: 5585.6, 300 sec: 5553.7). Total num frames: 1162869760. Throughput: 0: 4909.7. Samples: 1162866996. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:36:04,609][25689] Avg episode reward: [(0, '0.983')] [2022-07-11 09:36:06,509][26022] Updated weights on worker 0-0, policy_version 1135625 (0.00615) [2022-07-11 09:36:08,142][26022] Updated weights on worker 0-0, policy_version 1135635 (0.00966) [2022-07-11 09:36:09,706][25689] Fps is (10 sec: 5361.9, 60 sec: 5552.0, 300 sec: 5556.6). Total num frames: 1162897408. Throughput: 0: 5751.2. Samples: 1162900634. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:36:09,707][25689] Avg episode reward: [(0, '0.700')] [2022-07-11 09:36:10,242][26022] Updated weights on worker 0-0, policy_version 1135645 (0.00088) [2022-07-11 09:36:11,696][26022] Updated weights on worker 0-0, policy_version 1135655 (0.00083) [2022-07-11 09:36:13,883][26022] Updated weights on worker 0-0, policy_version 1135665 (0.00086) [2022-07-11 09:36:14,722][25689] Fps is (10 sec: 5670.2, 60 sec: 5587.1, 300 sec: 5564.4). Total num frames: 1162927104. Throughput: 0: 5758.2. Samples: 1162934178. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:36:14,723][25689] Avg episode reward: [(0, '-0.678')] [2022-07-11 09:36:15,346][26022] Updated weights on worker 0-0, policy_version 1135675 (0.00090) [2022-07-11 09:36:17,432][26022] Updated weights on worker 0-0, policy_version 1135685 (0.00093) [2022-07-11 09:36:19,176][26022] Updated weights on worker 0-0, policy_version 1135695 (0.00080) [2022-07-11 09:36:19,735][25689] Fps is (10 sec: 5616.0, 60 sec: 5552.9, 300 sec: 5555.9). Total num frames: 1162953728. Throughput: 0: 4908.9. Samples: 1162951088. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:36:19,735][25689] Avg episode reward: [(0, '-0.899')] [2022-07-11 09:36:21,148][26022] Updated weights on worker 0-0, policy_version 1135705 (0.00080) [2022-07-11 09:36:22,874][26022] Updated weights on worker 0-0, policy_version 1135715 (0.00084) [2022-07-11 09:36:24,740][25689] Fps is (10 sec: 5417.7, 60 sec: 5536.5, 300 sec: 5554.7). Total num frames: 1162981376. Throughput: 0: 5850.9. Samples: 1162984780. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:36:24,740][25689] Avg episode reward: [(0, '-1.067')] [2022-07-11 09:36:24,755][26022] Updated weights on worker 0-0, policy_version 1135725 (0.00085) [2022-07-11 09:36:26,547][26022] Updated weights on worker 0-0, policy_version 1135735 (0.00089) [2022-07-11 09:36:28,588][26022] Updated weights on worker 0-0, policy_version 1135745 (0.00090) [2022-07-11 09:36:29,858][25689] Fps is (10 sec: 5563.5, 60 sec: 5546.9, 300 sec: 5557.4). Total num frames: 1163010048. Throughput: 0: 5828.1. Samples: 1163018078. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:36:29,858][25689] Avg episode reward: [(0, '-1.849')] [2022-07-11 09:36:30,118][26022] Updated weights on worker 0-0, policy_version 1135755 (0.00086) [2022-07-11 09:36:32,105][26022] Updated weights on worker 0-0, policy_version 1135765 (0.00087) [2022-07-11 09:36:34,036][26022] Updated weights on worker 0-0, policy_version 1135775 (0.00086) [2022-07-11 09:36:34,868][25689] Fps is (10 sec: 5763.0, 60 sec: 5581.3, 300 sec: 5558.5). Total num frames: 1163039744. Throughput: 0: 4991.9. Samples: 1163034742. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 09:36:34,868][25689] Avg episode reward: [(0, '-1.719')] [2022-07-11 09:36:35,778][26022] Updated weights on worker 0-0, policy_version 1135785 (0.00079) [2022-07-11 09:36:37,553][26022] Updated weights on worker 0-0, policy_version 1135795 (0.00088) [2022-07-11 09:36:39,612][26022] Updated weights on worker 0-0, policy_version 1135805 (0.00088) [2022-07-11 09:36:39,876][25689] Fps is (10 sec: 5621.5, 60 sec: 5537.9, 300 sec: 5559.5). Total num frames: 1163066368. Throughput: 0: 5812.4. Samples: 1163068160. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:36:39,877][25689] Avg episode reward: [(0, '-0.203')] [2022-07-11 09:36:41,419][26022] Updated weights on worker 0-0, policy_version 1135815 (0.00080) [2022-07-11 09:36:43,232][26022] Updated weights on worker 0-0, policy_version 1135825 (0.00086) [2022-07-11 09:36:44,893][25689] Fps is (10 sec: 5413.5, 60 sec: 5554.3, 300 sec: 5557.7). Total num frames: 1163094016. Throughput: 0: 5799.4. Samples: 1163101658. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:36:44,894][25689] Avg episode reward: [(0, '-0.423')] [2022-07-11 09:36:45,090][26022] Updated weights on worker 0-0, policy_version 1135835 (0.00098) [2022-07-11 09:36:46,707][26022] Updated weights on worker 0-0, policy_version 1135845 (0.00082) [2022-07-11 09:36:48,684][26022] Updated weights on worker 0-0, policy_version 1135855 (0.00084) [2022-07-11 09:36:49,999][25689] Fps is (10 sec: 5563.7, 60 sec: 5553.4, 300 sec: 5553.8). Total num frames: 1163122688. Throughput: 0: 4974.3. Samples: 1163118268. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:36:50,000][25689] Avg episode reward: [(0, '0.305')] [2022-07-11 09:36:50,483][26022] Updated weights on worker 0-0, policy_version 1135865 (0.00090) [2022-07-11 09:36:52,215][26022] Updated weights on worker 0-0, policy_version 1135875 (0.00090) [2022-07-11 09:36:54,155][26022] Updated weights on worker 0-0, policy_version 1135885 (0.00089) [2022-07-11 09:36:55,022][25689] Fps is (10 sec: 5560.3, 60 sec: 5539.4, 300 sec: 5554.0). Total num frames: 1163150336. Throughput: 0: 5822.0. Samples: 1163152080. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:36:55,023][25689] Avg episode reward: [(0, '0.402')] [2022-07-11 09:36:55,852][26022] Updated weights on worker 0-0, policy_version 1135895 (0.00087) [2022-07-11 09:36:57,980][26022] Updated weights on worker 0-0, policy_version 1135905 (0.00084) [2022-07-11 09:36:59,526][26022] Updated weights on worker 0-0, policy_version 1135915 (0.00088) [2022-07-11 09:37:00,046][25689] Fps is (10 sec: 5809.4, 60 sec: 5589.0, 300 sec: 5570.8). Total num frames: 1163181056. Throughput: 0: 5835.3. Samples: 1163185858. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:37:00,047][25689] Avg episode reward: [(0, '1.372')] [2022-07-11 09:37:01,784][26022] Updated weights on worker 0-0, policy_version 1135925 (0.00084) [2022-07-11 09:37:03,431][26022] Updated weights on worker 0-0, policy_version 1135935 (0.00085) [2022-07-11 09:37:05,127][25689] Fps is (10 sec: 5371.0, 60 sec: 5533.0, 300 sec: 5549.9). Total num frames: 1163204608. Throughput: 0: 4898.4. Samples: 1163200770. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:37:05,127][25689] Avg episode reward: [(0, '0.823')] [2022-07-11 09:37:05,528][26022] Updated weights on worker 0-0, policy_version 1135945 (0.00092) [2022-07-11 09:37:06,987][26022] Updated weights on worker 0-0, policy_version 1135955 (0.00089) [2022-07-11 09:37:09,231][26022] Updated weights on worker 0-0, policy_version 1135965 (0.00083) [2022-07-11 09:37:10,187][25689] Fps is (10 sec: 5352.0, 60 sec: 5587.2, 300 sec: 5562.7). Total num frames: 1163235328. Throughput: 0: 5762.6. Samples: 1163234602. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:37:10,187][25689] Avg episode reward: [(0, '1.278')] [2022-07-11 09:37:10,711][26022] Updated weights on worker 0-0, policy_version 1135975 (0.00083) [2022-07-11 09:37:12,744][26022] Updated weights on worker 0-0, policy_version 1135985 (0.00097) [2022-07-11 09:37:14,340][26022] Updated weights on worker 0-0, policy_version 1135995 (0.00080) [2022-07-11 09:37:15,239][25689] Fps is (10 sec: 5670.9, 60 sec: 5533.2, 300 sec: 5555.1). Total num frames: 1163261952. Throughput: 0: 5764.8. Samples: 1163268624. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:37:15,239][25689] Avg episode reward: [(0, '1.244')] [2022-07-11 09:37:16,207][26022] Updated weights on worker 0-0, policy_version 1136005 (0.00091) [2022-07-11 09:37:18,177][26022] Updated weights on worker 0-0, policy_version 1136015 (0.00088) [2022-07-11 09:37:20,010][26022] Updated weights on worker 0-0, policy_version 1136025 (0.00090) [2022-07-11 09:37:20,240][25689] Fps is (10 sec: 5602.1, 60 sec: 5584.9, 300 sec: 5558.8). Total num frames: 1163291648. Throughput: 0: 4926.8. Samples: 1163285350. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:37:20,241][25689] Avg episode reward: [(0, '1.471')] [2022-07-11 09:37:21,949][26022] Updated weights on worker 0-0, policy_version 1136035 (0.00082) [2022-07-11 09:37:23,607][26022] Updated weights on worker 0-0, policy_version 1136045 (0.00091) [2022-07-11 09:37:25,280][25689] Fps is (10 sec: 5711.0, 60 sec: 5581.8, 300 sec: 5559.4). Total num frames: 1163319296. Throughput: 0: 5889.4. Samples: 1163319458. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:37:25,280][25689] Avg episode reward: [(0, '1.560')] [2022-07-11 09:37:25,384][26022] Updated weights on worker 0-0, policy_version 1136055 (0.00083) [2022-07-11 09:37:27,258][26022] Updated weights on worker 0-0, policy_version 1136065 (0.00094) [2022-07-11 09:37:28,971][26022] Updated weights on worker 0-0, policy_version 1136075 (0.00083) [2022-07-11 09:37:30,386][25689] Fps is (10 sec: 5550.9, 60 sec: 5582.8, 300 sec: 5561.0). Total num frames: 1163347968. Throughput: 0: 5875.1. Samples: 1163353276. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:37:30,387][25689] Avg episode reward: [(0, '1.512')] [2022-07-11 09:37:30,959][26022] Updated weights on worker 0-0, policy_version 1136085 (0.00080) [2022-07-11 09:37:32,719][26022] Updated weights on worker 0-0, policy_version 1136095 (0.00086) [2022-07-11 09:37:34,606][26022] Updated weights on worker 0-0, policy_version 1136105 (0.00091) [2022-07-11 09:37:35,410][25689] Fps is (10 sec: 5660.8, 60 sec: 5564.7, 300 sec: 5564.5). Total num frames: 1163376640. Throughput: 0: 5041.2. Samples: 1163370310. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:37:35,410][25689] Avg episode reward: [(0, '0.991')] [2022-07-11 09:37:36,422][26022] Updated weights on worker 0-0, policy_version 1136115 (0.00087) [2022-07-11 09:37:38,147][26022] Updated weights on worker 0-0, policy_version 1136125 (0.00053) [2022-07-11 09:37:40,022][26022] Updated weights on worker 0-0, policy_version 1136135 (0.00119) [2022-07-11 09:37:40,415][25689] Fps is (10 sec: 5718.3, 60 sec: 5598.8, 300 sec: 5564.7). Total num frames: 1163405312. Throughput: 0: 5872.5. Samples: 1163403824. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:37:40,415][25689] Avg episode reward: [(0, '0.845')] [2022-07-11 09:37:41,796][26022] Updated weights on worker 0-0, policy_version 1136145 (0.00096) [2022-07-11 09:37:43,575][26022] Updated weights on worker 0-0, policy_version 1136155 (0.00088) [2022-07-11 09:37:45,442][25689] Fps is (10 sec: 5511.7, 60 sec: 5580.9, 300 sec: 5562.8). Total num frames: 1163431936. Throughput: 0: 5857.8. Samples: 1163437566. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:37:45,443][25689] Avg episode reward: [(0, '0.151')] [2022-07-11 09:37:45,719][26022] Updated weights on worker 0-0, policy_version 1136165 (0.00098) [2022-07-11 09:37:47,202][26022] Updated weights on worker 0-0, policy_version 1136175 (0.00092) [2022-07-11 09:37:49,179][26022] Updated weights on worker 0-0, policy_version 1136185 (0.00096) [2022-07-11 09:37:50,491][25689] Fps is (10 sec: 5589.3, 60 sec: 5603.1, 300 sec: 5562.2). Total num frames: 1163461632. Throughput: 0: 5002.5. Samples: 1163453850. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:37:50,491][25689] Avg episode reward: [(0, '-0.618')] [2022-07-11 09:37:51,019][26022] Updated weights on worker 0-0, policy_version 1136195 (0.00082) [2022-07-11 09:37:52,757][26022] Updated weights on worker 0-0, policy_version 1136205 (0.00090) [2022-07-11 09:37:54,416][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:37:54,427][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001136214_1163483136.pth [2022-07-11 09:37:54,428][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001134257_1161479168.pth [2022-07-11 09:37:54,650][26022] Updated weights on worker 0-0, policy_version 1136215 (0.00092) [2022-07-11 09:37:55,587][25689] Fps is (10 sec: 5450.7, 60 sec: 5562.6, 300 sec: 5551.5). Total num frames: 1163487232. Throughput: 0: 5827.7. Samples: 1163487898. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:37:55,588][25689] Avg episode reward: [(0, '-0.619')] [2022-07-11 09:37:56,339][26022] Updated weights on worker 0-0, policy_version 1136225 (0.00089) [2022-07-11 09:37:58,358][26022] Updated weights on worker 0-0, policy_version 1136235 (0.00084) [2022-07-11 09:38:00,130][26022] Updated weights on worker 0-0, policy_version 1136245 (0.00645) [2022-07-11 09:38:00,632][25689] Fps is (10 sec: 5553.5, 60 sec: 5560.6, 300 sec: 5569.4). Total num frames: 1163517952. Throughput: 0: 5826.9. Samples: 1163521630. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:38:00,633][25689] Avg episode reward: [(0, '-0.352')] [2022-07-11 09:38:02,364][26022] Updated weights on worker 0-0, policy_version 1136255 (0.00080) [2022-07-11 09:38:03,987][26022] Updated weights on worker 0-0, policy_version 1136265 (0.00084) [2022-07-11 09:38:05,636][25689] Fps is (10 sec: 5706.2, 60 sec: 5618.4, 300 sec: 5560.2). Total num frames: 1163544576. Throughput: 0: 4900.2. Samples: 1163536522. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:38:05,637][25689] Avg episode reward: [(0, '-0.803')] [2022-07-11 09:38:05,875][26022] Updated weights on worker 0-0, policy_version 1136275 (0.00081) [2022-07-11 09:38:07,635][26022] Updated weights on worker 0-0, policy_version 1136285 (0.00089) [2022-07-11 09:38:09,761][26022] Updated weights on worker 0-0, policy_version 1136295 (0.00099) [2022-07-11 09:38:10,728][25689] Fps is (10 sec: 5274.1, 60 sec: 5547.8, 300 sec: 5559.1). Total num frames: 1163571200. Throughput: 0: 5750.2. Samples: 1163570220. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:38:10,729][25689] Avg episode reward: [(0, '-0.956')] [2022-07-11 09:38:11,349][26022] Updated weights on worker 0-0, policy_version 1136305 (0.00092) [2022-07-11 09:38:13,233][26022] Updated weights on worker 0-0, policy_version 1136315 (0.00090) [2022-07-11 09:38:14,908][26022] Updated weights on worker 0-0, policy_version 1136325 (0.00084) [2022-07-11 09:38:15,773][25689] Fps is (10 sec: 5454.9, 60 sec: 5582.2, 300 sec: 5565.3). Total num frames: 1163599872. Throughput: 0: 5761.8. Samples: 1163604208. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:38:15,774][25689] Avg episode reward: [(0, '-0.146')] [2022-07-11 09:38:16,986][26022] Updated weights on worker 0-0, policy_version 1136335 (0.00092) [2022-07-11 09:38:18,597][26022] Updated weights on worker 0-0, policy_version 1136345 (0.00085) [2022-07-11 09:38:20,594][26022] Updated weights on worker 0-0, policy_version 1136355 (0.00082) [2022-07-11 09:38:20,863][25689] Fps is (10 sec: 5658.2, 60 sec: 5557.3, 300 sec: 5556.8). Total num frames: 1163628544. Throughput: 0: 5748.4. Samples: 1163637924. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:38:20,864][25689] Avg episode reward: [(0, '-1.508')] [2022-07-11 09:38:22,291][26022] Updated weights on worker 0-0, policy_version 1136365 (0.00083) [2022-07-11 09:38:24,102][26022] Updated weights on worker 0-0, policy_version 1136375 (0.00094) [2022-07-11 09:38:25,864][25689] Fps is (10 sec: 5682.6, 60 sec: 5577.6, 300 sec: 5568.9). Total num frames: 1163657216. Throughput: 0: 5851.4. Samples: 1163654884. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:38:25,865][25689] Avg episode reward: [(0, '-0.917')] [2022-07-11 09:38:26,043][26022] Updated weights on worker 0-0, policy_version 1136385 (0.00126) [2022-07-11 09:38:27,758][26022] Updated weights on worker 0-0, policy_version 1136395 (0.00087) [2022-07-11 09:38:29,776][26022] Updated weights on worker 0-0, policy_version 1136405 (0.00086) [2022-07-11 09:38:30,940][25689] Fps is (10 sec: 5690.9, 60 sec: 5580.6, 300 sec: 5564.2). Total num frames: 1163685888. Throughput: 0: 5837.4. Samples: 1163688200. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:38:30,940][25689] Avg episode reward: [(0, '-0.443')] [2022-07-11 09:38:31,510][26022] Updated weights on worker 0-0, policy_version 1136415 (0.00089) [2022-07-11 09:38:33,410][26022] Updated weights on worker 0-0, policy_version 1136425 (0.00098) [2022-07-11 09:38:35,391][26022] Updated weights on worker 0-0, policy_version 1136435 (0.00090) [2022-07-11 09:38:36,003][25689] Fps is (10 sec: 5555.2, 60 sec: 5560.0, 300 sec: 5566.8). Total num frames: 1163713536. Throughput: 0: 5808.7. Samples: 1163721714. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:38:36,003][25689] Avg episode reward: [(0, '0.345')] [2022-07-11 09:38:37,015][26022] Updated weights on worker 0-0, policy_version 1136445 (0.00085) [2022-07-11 09:38:38,907][26022] Updated weights on worker 0-0, policy_version 1136455 (0.00082) [2022-07-11 09:38:40,801][26022] Updated weights on worker 0-0, policy_version 1136465 (0.00084) [2022-07-11 09:38:41,099][25689] Fps is (10 sec: 5442.6, 60 sec: 5534.7, 300 sec: 5558.2). Total num frames: 1163741184. Throughput: 0: 4963.0. Samples: 1163738358. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:38:41,100][25689] Avg episode reward: [(0, '0.260')] [2022-07-11 09:38:42,518][26022] Updated weights on worker 0-0, policy_version 1136475 (0.00084) [2022-07-11 09:38:44,623][26022] Updated weights on worker 0-0, policy_version 1136485 (0.00099) [2022-07-11 09:38:46,102][25689] Fps is (10 sec: 5677.8, 60 sec: 5587.6, 300 sec: 5566.1). Total num frames: 1163770880. Throughput: 0: 5788.2. Samples: 1163772024. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:38:46,103][25689] Avg episode reward: [(0, '1.614')] [2022-07-11 09:38:46,104][26022] Updated weights on worker 0-0, policy_version 1136495 (0.00082) [2022-07-11 09:38:48,168][26022] Updated weights on worker 0-0, policy_version 1136505 (0.00084) [2022-07-11 09:38:49,876][26022] Updated weights on worker 0-0, policy_version 1136515 (0.00075) [2022-07-11 09:38:51,209][25689] Fps is (10 sec: 5570.9, 60 sec: 5531.7, 300 sec: 5564.6). Total num frames: 1163797504. Throughput: 0: 5798.2. Samples: 1163805728. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:38:51,210][25689] Avg episode reward: [(0, '1.952')] [2022-07-11 09:38:51,812][26022] Updated weights on worker 0-0, policy_version 1136525 (0.00080) [2022-07-11 09:38:53,474][26022] Updated weights on worker 0-0, policy_version 1136535 (0.00082) [2022-07-11 09:38:55,530][26022] Updated weights on worker 0-0, policy_version 1136545 (0.00086) [2022-07-11 09:38:56,236][25689] Fps is (10 sec: 5456.8, 60 sec: 5588.6, 300 sec: 5564.4). Total num frames: 1163826176. Throughput: 0: 4970.3. Samples: 1163822278. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:38:56,236][25689] Avg episode reward: [(0, '1.169')] [2022-07-11 09:38:57,218][26022] Updated weights on worker 0-0, policy_version 1136555 (0.00087) [2022-07-11 09:38:59,156][26022] Updated weights on worker 0-0, policy_version 1136565 (0.00093) [2022-07-11 09:39:00,960][26022] Updated weights on worker 0-0, policy_version 1136575 (0.00087) [2022-07-11 09:39:01,244][25689] Fps is (10 sec: 5612.4, 60 sec: 5541.4, 300 sec: 5574.6). Total num frames: 1163853824. Throughput: 0: 5841.4. Samples: 1163856032. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:39:01,245][25689] Avg episode reward: [(0, '0.523')] [2022-07-11 09:39:03,192][26022] Updated weights on worker 0-0, policy_version 1136585 (0.00086) [2022-07-11 09:39:04,775][26022] Updated weights on worker 0-0, policy_version 1136595 (0.00090) [2022-07-11 09:39:06,304][25689] Fps is (10 sec: 5390.3, 60 sec: 5536.3, 300 sec: 5568.0). Total num frames: 1163880448. Throughput: 0: 5721.4. Samples: 1163887606. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:39:06,305][25689] Avg episode reward: [(0, '0.329')] [2022-07-11 09:39:06,752][26022] Updated weights on worker 0-0, policy_version 1136605 (0.00089) [2022-07-11 09:39:08,390][26022] Updated weights on worker 0-0, policy_version 1136615 (0.00531) [2022-07-11 09:39:10,527][26022] Updated weights on worker 0-0, policy_version 1136625 (0.00085) [2022-07-11 09:39:11,396][25689] Fps is (10 sec: 5547.6, 60 sec: 5586.9, 300 sec: 5566.7). Total num frames: 1163910144. Throughput: 0: 4887.3. Samples: 1163904388. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:39:11,397][25689] Avg episode reward: [(0, '0.232')] [2022-07-11 09:39:12,231][26022] Updated weights on worker 0-0, policy_version 1136635 (0.00083) [2022-07-11 09:39:14,070][26022] Updated weights on worker 0-0, policy_version 1136645 (0.00096) [2022-07-11 09:39:15,909][26022] Updated weights on worker 0-0, policy_version 1136655 (0.00086) [2022-07-11 09:39:16,420][25689] Fps is (10 sec: 5567.4, 60 sec: 5555.0, 300 sec: 5563.0). Total num frames: 1163936768. Throughput: 0: 5739.4. Samples: 1163938126. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:39:16,421][25689] Avg episode reward: [(0, '0.097')] [2022-07-11 09:39:17,645][26022] Updated weights on worker 0-0, policy_version 1136665 (0.00089) [2022-07-11 09:39:19,680][26022] Updated weights on worker 0-0, policy_version 1136675 (0.00082) [2022-07-11 09:39:21,398][26022] Updated weights on worker 0-0, policy_version 1136685 (0.00085) [2022-07-11 09:39:21,426][25689] Fps is (10 sec: 5512.9, 60 sec: 5562.7, 300 sec: 5570.1). Total num frames: 1163965440. Throughput: 0: 5729.7. Samples: 1163971670. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:39:21,427][25689] Avg episode reward: [(0, '-0.532')] [2022-07-11 09:39:23,140][26022] Updated weights on worker 0-0, policy_version 1136695 (0.00089) [2022-07-11 09:39:25,109][26022] Updated weights on worker 0-0, policy_version 1136705 (0.00088) [2022-07-11 09:39:26,449][25689] Fps is (10 sec: 5615.6, 60 sec: 5543.8, 300 sec: 5564.4). Total num frames: 1163993088. Throughput: 0: 5021.2. Samples: 1163988760. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:39:26,450][25689] Avg episode reward: [(0, '1.223')] [2022-07-11 09:39:26,797][26022] Updated weights on worker 0-0, policy_version 1136715 (0.00089) [2022-07-11 09:39:28,696][26022] Updated weights on worker 0-0, policy_version 1136725 (0.00081) [2022-07-11 09:39:30,384][26022] Updated weights on worker 0-0, policy_version 1136735 (0.00084) [2022-07-11 09:39:31,579][25689] Fps is (10 sec: 5446.5, 60 sec: 5521.9, 300 sec: 5558.6). Total num frames: 1164020736. Throughput: 0: 5843.0. Samples: 1164022316. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:39:31,579][25689] Avg episode reward: [(0, '1.312')] [2022-07-11 09:39:32,289][26022] Updated weights on worker 0-0, policy_version 1136745 (0.00086) [2022-07-11 09:39:34,049][26022] Updated weights on worker 0-0, policy_version 1136755 (0.00089) [2022-07-11 09:39:36,081][26022] Updated weights on worker 0-0, policy_version 1136765 (0.00093) [2022-07-11 09:39:36,644][25689] Fps is (10 sec: 5725.3, 60 sec: 5572.4, 300 sec: 5567.9). Total num frames: 1164051456. Throughput: 0: 5836.5. Samples: 1164056164. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:39:36,645][25689] Avg episode reward: [(0, '1.382')] [2022-07-11 09:39:37,747][26022] Updated weights on worker 0-0, policy_version 1136775 (0.00087) [2022-07-11 09:39:39,618][26022] Updated weights on worker 0-0, policy_version 1136785 (0.00085) [2022-07-11 09:39:41,390][26022] Updated weights on worker 0-0, policy_version 1136795 (0.00087) [2022-07-11 09:39:41,649][25689] Fps is (10 sec: 5694.3, 60 sec: 5563.9, 300 sec: 5568.3). Total num frames: 1164078080. Throughput: 0: 5018.1. Samples: 1164073152. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:39:41,650][25689] Avg episode reward: [(0, '0.473')] [2022-07-11 09:39:43,307][26022] Updated weights on worker 0-0, policy_version 1136805 (0.00092) [2022-07-11 09:39:45,174][26022] Updated weights on worker 0-0, policy_version 1136815 (0.00084) [2022-07-11 09:39:46,688][25689] Fps is (10 sec: 5505.8, 60 sec: 5543.8, 300 sec: 5566.1). Total num frames: 1164106752. Throughput: 0: 5839.8. Samples: 1164106946. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:39:46,688][25689] Avg episode reward: [(0, '-0.078')] [2022-07-11 09:39:46,947][26022] Updated weights on worker 0-0, policy_version 1136825 (0.00081) [2022-07-11 09:39:48,951][26022] Updated weights on worker 0-0, policy_version 1136835 (0.00106) [2022-07-11 09:39:50,582][26022] Updated weights on worker 0-0, policy_version 1136845 (0.00083) [2022-07-11 09:39:51,763][25689] Fps is (10 sec: 5568.9, 60 sec: 5563.6, 300 sec: 5565.0). Total num frames: 1164134400. Throughput: 0: 5837.0. Samples: 1164140128. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:39:51,763][25689] Avg episode reward: [(0, '-1.313')] [2022-07-11 09:39:52,516][26022] Updated weights on worker 0-0, policy_version 1136855 (0.00087) [2022-07-11 09:39:54,226][26022] Updated weights on worker 0-0, policy_version 1136865 (0.00084) [2022-07-11 09:39:54,480][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:39:54,502][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001136866_1164150784.pth [2022-07-11 09:39:54,503][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001134907_1162144768.pth [2022-07-11 09:39:56,128][26022] Updated weights on worker 0-0, policy_version 1136875 (0.00094) [2022-07-11 09:39:56,814][25689] Fps is (10 sec: 5662.6, 60 sec: 5578.2, 300 sec: 5568.5). Total num frames: 1164164096. Throughput: 0: 5006.9. Samples: 1164157154. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:39:56,815][25689] Avg episode reward: [(0, '-2.111')] [2022-07-11 09:39:57,974][26022] Updated weights on worker 0-0, policy_version 1136885 (0.00082) [2022-07-11 09:39:59,670][26022] Updated weights on worker 0-0, policy_version 1136895 (0.00094) [2022-07-11 09:40:01,532][26022] Updated weights on worker 0-0, policy_version 1136905 (0.00092) [2022-07-11 09:40:01,832][25689] Fps is (10 sec: 5694.9, 60 sec: 5577.4, 300 sec: 5575.4). Total num frames: 1164191744. Throughput: 0: 5833.9. Samples: 1164190894. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:40:01,832][25689] Avg episode reward: [(0, '-2.058')] [2022-07-11 09:40:03,955][26022] Updated weights on worker 0-0, policy_version 1136915 (0.00089) [2022-07-11 09:40:05,540][26022] Updated weights on worker 0-0, policy_version 1136925 (0.00088) [2022-07-11 09:40:06,851][25689] Fps is (10 sec: 5305.1, 60 sec: 5564.2, 300 sec: 5563.1). Total num frames: 1164217344. Throughput: 0: 5727.1. Samples: 1164222426. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:40:06,852][25689] Avg episode reward: [(0, '-2.815')] [2022-07-11 09:40:07,325][26022] Updated weights on worker 0-0, policy_version 1136935 (0.00085) [2022-07-11 09:40:09,212][26022] Updated weights on worker 0-0, policy_version 1136945 (0.00082) [2022-07-11 09:40:11,031][26022] Updated weights on worker 0-0, policy_version 1136955 (0.01367) [2022-07-11 09:40:11,937][25689] Fps is (10 sec: 5269.3, 60 sec: 5530.9, 300 sec: 5562.1). Total num frames: 1164244992. Throughput: 0: 4917.7. Samples: 1164239342. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:40:11,943][25689] Avg episode reward: [(0, '-2.669')] [2022-07-11 09:40:12,930][26022] Updated weights on worker 0-0, policy_version 1136965 (0.00088) [2022-07-11 09:40:15,016][26022] Updated weights on worker 0-0, policy_version 1136975 (0.00090) [2022-07-11 09:40:16,314][26022] Updated weights on worker 0-0, policy_version 1136985 (0.00087) [2022-07-11 09:40:16,981][25689] Fps is (10 sec: 5661.2, 60 sec: 5579.9, 300 sec: 5564.8). Total num frames: 1164274688. Throughput: 0: 5739.8. Samples: 1164272904. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:40:16,981][25689] Avg episode reward: [(0, '-2.334')] [2022-07-11 09:40:18,738][26022] Updated weights on worker 0-0, policy_version 1136995 (0.00085) [2022-07-11 09:40:20,165][26022] Updated weights on worker 0-0, policy_version 1137005 (0.00085) [2022-07-11 09:40:22,046][25689] Fps is (10 sec: 5571.5, 60 sec: 5540.7, 300 sec: 5556.9). Total num frames: 1164301312. Throughput: 0: 5711.2. Samples: 1164306340. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:40:22,046][25689] Avg episode reward: [(0, '-0.869')] [2022-07-11 09:40:22,090][26022] Updated weights on worker 0-0, policy_version 1137015 (0.00098) [2022-07-11 09:40:23,783][26022] Updated weights on worker 0-0, policy_version 1137025 (0.00088) [2022-07-11 09:40:25,762][26022] Updated weights on worker 0-0, policy_version 1137035 (0.00229) [2022-07-11 09:40:27,096][25689] Fps is (10 sec: 5567.6, 60 sec: 5571.9, 300 sec: 5563.8). Total num frames: 1164331008. Throughput: 0: 5811.3. Samples: 1164340074. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:40:27,098][25689] Avg episode reward: [(0, '-0.308')] [2022-07-11 09:40:27,780][26022] Updated weights on worker 0-0, policy_version 1137045 (0.00090) [2022-07-11 09:40:29,355][26022] Updated weights on worker 0-0, policy_version 1137055 (0.00091) [2022-07-11 09:40:31,126][26022] Updated weights on worker 0-0, policy_version 1137065 (0.00096) [2022-07-11 09:40:32,209][25689] Fps is (10 sec: 5743.4, 60 sec: 5590.4, 300 sec: 5565.4). Total num frames: 1164359680. Throughput: 0: 5803.3. Samples: 1164356982. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 09:40:32,209][25689] Avg episode reward: [(0, '-0.538')] [2022-07-11 09:40:33,078][26022] Updated weights on worker 0-0, policy_version 1137075 (0.00087) [2022-07-11 09:40:34,748][26022] Updated weights on worker 0-0, policy_version 1137085 (0.00089) [2022-07-11 09:40:36,960][26022] Updated weights on worker 0-0, policy_version 1137095 (0.00092) [2022-07-11 09:40:37,219][25689] Fps is (10 sec: 5664.8, 60 sec: 5561.6, 300 sec: 5563.4). Total num frames: 1164388352. Throughput: 0: 5821.5. Samples: 1164390722. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:40:37,220][25689] Avg episode reward: [(0, '1.111')] [2022-07-11 09:40:38,575][26022] Updated weights on worker 0-0, policy_version 1137105 (0.00096) [2022-07-11 09:40:40,299][26022] Updated weights on worker 0-0, policy_version 1137115 (0.00084) [2022-07-11 09:40:42,278][25689] Fps is (10 sec: 5491.3, 60 sec: 5556.7, 300 sec: 5562.5). Total num frames: 1164414976. Throughput: 0: 5843.1. Samples: 1164424560. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:40:42,279][25689] Avg episode reward: [(0, '1.078')] [2022-07-11 09:40:42,298][26022] Updated weights on worker 0-0, policy_version 1137125 (0.00085) [2022-07-11 09:40:43,946][26022] Updated weights on worker 0-0, policy_version 1137135 (0.00089) [2022-07-11 09:40:45,881][26022] Updated weights on worker 0-0, policy_version 1137145 (0.00090) [2022-07-11 09:40:47,356][25689] Fps is (10 sec: 5555.8, 60 sec: 5569.9, 300 sec: 5566.3). Total num frames: 1164444672. Throughput: 0: 5006.6. Samples: 1164441508. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:40:47,357][25689] Avg episode reward: [(0, '1.434')] [2022-07-11 09:40:47,707][26022] Updated weights on worker 0-0, policy_version 1137155 (0.00083) [2022-07-11 09:40:49,501][26022] Updated weights on worker 0-0, policy_version 1137165 (0.00089) [2022-07-11 09:40:51,407][26022] Updated weights on worker 0-0, policy_version 1137175 (0.00089) [2022-07-11 09:40:52,457][25689] Fps is (10 sec: 5834.9, 60 sec: 5601.3, 300 sec: 5568.9). Total num frames: 1164474368. Throughput: 0: 5818.4. Samples: 1164474796. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:40:52,458][25689] Avg episode reward: [(0, '0.458')] [2022-07-11 09:40:53,291][26022] Updated weights on worker 0-0, policy_version 1137185 (0.00101) [2022-07-11 09:40:54,923][26022] Updated weights on worker 0-0, policy_version 1137195 (0.00087) [2022-07-11 09:40:56,991][26022] Updated weights on worker 0-0, policy_version 1137205 (0.00092) [2022-07-11 09:40:57,476][25689] Fps is (10 sec: 5565.3, 60 sec: 5553.7, 300 sec: 5565.3). Total num frames: 1164500992. Throughput: 0: 5793.2. Samples: 1164508074. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:40:57,477][25689] Avg episode reward: [(0, '0.602')] [2022-07-11 09:40:58,668][26022] Updated weights on worker 0-0, policy_version 1137215 (0.00088) [2022-07-11 09:41:00,486][26022] Updated weights on worker 0-0, policy_version 1137225 (0.00103) [2022-07-11 09:41:02,510][25689] Fps is (10 sec: 5296.6, 60 sec: 5535.3, 300 sec: 5565.1). Total num frames: 1164527616. Throughput: 0: 4973.4. Samples: 1164525182. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:41:02,511][25689] Avg episode reward: [(0, '-0.797')] [2022-07-11 09:41:02,850][26022] Updated weights on worker 0-0, policy_version 1137235 (0.00087) [2022-07-11 09:41:04,525][26022] Updated weights on worker 0-0, policy_version 1137245 (0.00090) [2022-07-11 09:41:06,520][26022] Updated weights on worker 0-0, policy_version 1137255 (0.00081) [2022-07-11 09:41:07,547][25689] Fps is (10 sec: 5491.0, 60 sec: 5584.4, 300 sec: 5569.7). Total num frames: 1164556288. Throughput: 0: 5714.1. Samples: 1164556876. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:41:07,547][25689] Avg episode reward: [(0, '-1.424')] [2022-07-11 09:41:08,254][26022] Updated weights on worker 0-0, policy_version 1137265 (0.00085) [2022-07-11 09:41:10,126][26022] Updated weights on worker 0-0, policy_version 1137275 (0.00089) [2022-07-11 09:41:11,931][26022] Updated weights on worker 0-0, policy_version 1137285 (0.00086) [2022-07-11 09:41:12,606][25689] Fps is (10 sec: 5578.8, 60 sec: 5586.8, 300 sec: 5562.0). Total num frames: 1164583936. Throughput: 0: 5743.9. Samples: 1164590528. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:41:12,606][25689] Avg episode reward: [(0, '-1.523')] [2022-07-11 09:41:13,650][26022] Updated weights on worker 0-0, policy_version 1137295 (0.00097) [2022-07-11 09:41:15,617][26022] Updated weights on worker 0-0, policy_version 1137305 (0.00875) [2022-07-11 09:41:17,250][26022] Updated weights on worker 0-0, policy_version 1137315 (0.00087) [2022-07-11 09:41:17,685][25689] Fps is (10 sec: 5555.1, 60 sec: 5566.7, 300 sec: 5567.6). Total num frames: 1164612608. Throughput: 0: 4919.0. Samples: 1164607482. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:41:17,685][25689] Avg episode reward: [(0, '-0.451')] [2022-07-11 09:41:19,191][26022] Updated weights on worker 0-0, policy_version 1137325 (0.00089) [2022-07-11 09:41:20,943][26022] Updated weights on worker 0-0, policy_version 1137335 (0.00092) [2022-07-11 09:41:22,714][25689] Fps is (10 sec: 5571.8, 60 sec: 5586.9, 300 sec: 5567.2). Total num frames: 1164640256. Throughput: 0: 5736.2. Samples: 1164641074. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:41:22,715][25689] Avg episode reward: [(0, '-0.571')] [2022-07-11 09:41:22,819][26022] Updated weights on worker 0-0, policy_version 1137345 (0.00084) [2022-07-11 09:41:24,552][26022] Updated weights on worker 0-0, policy_version 1137355 (0.00092) [2022-07-11 09:41:26,398][26022] Updated weights on worker 0-0, policy_version 1137365 (0.00098) [2022-07-11 09:41:27,730][25689] Fps is (10 sec: 5504.6, 60 sec: 5556.2, 300 sec: 5565.7). Total num frames: 1164667904. Throughput: 0: 5835.8. Samples: 1164674666. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:41:27,731][25689] Avg episode reward: [(0, '0.256')] [2022-07-11 09:41:28,363][26022] Updated weights on worker 0-0, policy_version 1137375 (0.00089) [2022-07-11 09:41:30,191][26022] Updated weights on worker 0-0, policy_version 1137385 (0.00081) [2022-07-11 09:41:31,902][26022] Updated weights on worker 0-0, policy_version 1137395 (0.00635) [2022-07-11 09:41:32,854][25689] Fps is (10 sec: 5554.3, 60 sec: 5555.2, 300 sec: 5560.1). Total num frames: 1164696576. Throughput: 0: 4986.5. Samples: 1164691494. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:41:32,855][25689] Avg episode reward: [(0, '-0.292')] [2022-07-11 09:41:33,789][26022] Updated weights on worker 0-0, policy_version 1137405 (0.01522) [2022-07-11 09:41:35,439][26022] Updated weights on worker 0-0, policy_version 1137415 (0.00089) [2022-07-11 09:41:37,597][26022] Updated weights on worker 0-0, policy_version 1137425 (0.00083) [2022-07-11 09:41:37,862][25689] Fps is (10 sec: 5659.7, 60 sec: 5555.4, 300 sec: 5567.0). Total num frames: 1164725248. Throughput: 0: 5823.0. Samples: 1164724974. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:41:37,863][25689] Avg episode reward: [(0, '-0.444')] [2022-07-11 09:41:39,399][26022] Updated weights on worker 0-0, policy_version 1137435 (0.00097) [2022-07-11 09:41:41,103][26022] Updated weights on worker 0-0, policy_version 1137445 (0.00082) [2022-07-11 09:41:42,902][25689] Fps is (10 sec: 5502.8, 60 sec: 5557.1, 300 sec: 5563.1). Total num frames: 1164751872. Throughput: 0: 5807.9. Samples: 1164758326. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:41:42,904][25689] Avg episode reward: [(0, '-0.478')] [2022-07-11 09:41:43,188][26022] Updated weights on worker 0-0, policy_version 1137455 (0.00093) [2022-07-11 09:41:44,682][26022] Updated weights on worker 0-0, policy_version 1137465 (0.00083) [2022-07-11 09:41:46,840][26022] Updated weights on worker 0-0, policy_version 1137475 (0.00086) [2022-07-11 09:41:47,923][25689] Fps is (10 sec: 5597.9, 60 sec: 5562.4, 300 sec: 5568.1). Total num frames: 1164781568. Throughput: 0: 4973.9. Samples: 1164775104. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:41:47,925][25689] Avg episode reward: [(0, '-0.315')] [2022-07-11 09:41:48,522][26022] Updated weights on worker 0-0, policy_version 1137485 (0.00088) [2022-07-11 09:41:50,299][26022] Updated weights on worker 0-0, policy_version 1137495 (0.00087) [2022-07-11 09:41:52,429][26022] Updated weights on worker 0-0, policy_version 1137505 (0.00090) [2022-07-11 09:41:53,008][25689] Fps is (10 sec: 5573.2, 60 sec: 5513.1, 300 sec: 5563.5). Total num frames: 1164808192. Throughput: 0: 5802.4. Samples: 1164808438. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:41:53,008][25689] Avg episode reward: [(0, '-1.547')] [2022-07-11 09:41:54,038][26022] Updated weights on worker 0-0, policy_version 1137515 (0.00087) [2022-07-11 09:41:54,659][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:41:54,676][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001137519_1164819456.pth [2022-07-11 09:41:54,677][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001135561_1162814464.pth [2022-07-11 09:41:56,040][26022] Updated weights on worker 0-0, policy_version 1137525 (0.00090) [2022-07-11 09:41:57,680][26022] Updated weights on worker 0-0, policy_version 1137535 (0.00081) [2022-07-11 09:41:58,039][25689] Fps is (10 sec: 5567.6, 60 sec: 5562.8, 300 sec: 5560.0). Total num frames: 1164837888. Throughput: 0: 5791.7. Samples: 1164841830. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:41:58,039][25689] Avg episode reward: [(0, '-0.543')] [2022-07-11 09:41:59,699][26022] Updated weights on worker 0-0, policy_version 1137545 (0.00089) [2022-07-11 09:42:01,271][26022] Updated weights on worker 0-0, policy_version 1137555 (0.00094) [2022-07-11 09:42:03,050][25689] Fps is (10 sec: 5404.4, 60 sec: 5531.0, 300 sec: 5564.7). Total num frames: 1164862464. Throughput: 0: 4984.7. Samples: 1164858756. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:42:03,051][25689] Avg episode reward: [(0, '-0.062')] [2022-07-11 09:42:03,599][26022] Updated weights on worker 0-0, policy_version 1137565 (0.00084) [2022-07-11 09:42:05,290][26022] Updated weights on worker 0-0, policy_version 1137575 (0.00096) [2022-07-11 09:42:07,352][26022] Updated weights on worker 0-0, policy_version 1137585 (0.00084) [2022-07-11 09:42:08,081][25689] Fps is (10 sec: 5404.4, 60 sec: 5548.4, 300 sec: 5561.8). Total num frames: 1164892160. Throughput: 0: 5727.1. Samples: 1164890550. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:42:08,082][25689] Avg episode reward: [(0, '-0.379')] [2022-07-11 09:42:09,013][26022] Updated weights on worker 0-0, policy_version 1137595 (0.00087) [2022-07-11 09:42:10,910][26022] Updated weights on worker 0-0, policy_version 1137605 (0.00088) [2022-07-11 09:42:12,783][26022] Updated weights on worker 0-0, policy_version 1137615 (0.00498) [2022-07-11 09:42:13,162][25689] Fps is (10 sec: 5772.4, 60 sec: 5563.4, 300 sec: 5568.1). Total num frames: 1164920832. Throughput: 0: 5756.5. Samples: 1164924454. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:42:13,162][25689] Avg episode reward: [(0, '0.010')] [2022-07-11 09:42:14,497][26022] Updated weights on worker 0-0, policy_version 1137625 (0.00084) [2022-07-11 09:42:16,324][26022] Updated weights on worker 0-0, policy_version 1137635 (0.00095) [2022-07-11 09:42:18,045][26022] Updated weights on worker 0-0, policy_version 1137645 (0.00085) [2022-07-11 09:42:18,191][25689] Fps is (10 sec: 5570.9, 60 sec: 5551.1, 300 sec: 5560.8). Total num frames: 1164948480. Throughput: 0: 4941.5. Samples: 1164941410. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:42:18,191][25689] Avg episode reward: [(0, '-0.148')] [2022-07-11 09:42:19,839][26022] Updated weights on worker 0-0, policy_version 1137655 (0.00092) [2022-07-11 09:42:21,740][26022] Updated weights on worker 0-0, policy_version 1137665 (0.00098) [2022-07-11 09:42:23,196][25689] Fps is (10 sec: 5612.5, 60 sec: 5570.1, 300 sec: 5564.8). Total num frames: 1164977152. Throughput: 0: 5785.0. Samples: 1164975302. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:42:23,197][25689] Avg episode reward: [(0, '0.874')] [2022-07-11 09:42:23,565][26022] Updated weights on worker 0-0, policy_version 1137675 (0.00079) [2022-07-11 09:42:25,335][26022] Updated weights on worker 0-0, policy_version 1137685 (0.00087) [2022-07-11 09:42:27,430][26022] Updated weights on worker 0-0, policy_version 1137695 (0.00089) [2022-07-11 09:42:28,227][25689] Fps is (10 sec: 5509.4, 60 sec: 5551.9, 300 sec: 5559.4). Total num frames: 1165003776. Throughput: 0: 5869.9. Samples: 1165008808. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:42:28,228][25689] Avg episode reward: [(0, '0.787')] [2022-07-11 09:42:29,075][26022] Updated weights on worker 0-0, policy_version 1137705 (0.00091) [2022-07-11 09:42:30,939][26022] Updated weights on worker 0-0, policy_version 1137715 (0.00090) [2022-07-11 09:42:32,603][26022] Updated weights on worker 0-0, policy_version 1137725 (0.00050) [2022-07-11 09:42:33,362][25689] Fps is (10 sec: 5540.0, 60 sec: 5567.7, 300 sec: 5560.7). Total num frames: 1165033472. Throughput: 0: 5849.7. Samples: 1165042622. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:42:33,363][25689] Avg episode reward: [(0, '0.799')] [2022-07-11 09:42:34,534][26022] Updated weights on worker 0-0, policy_version 1137735 (0.00075) [2022-07-11 09:42:36,451][26022] Updated weights on worker 0-0, policy_version 1137745 (0.00087) [2022-07-11 09:42:38,191][26022] Updated weights on worker 0-0, policy_version 1137755 (0.00090) [2022-07-11 09:42:38,383][25689] Fps is (10 sec: 5848.1, 60 sec: 5583.5, 300 sec: 5563.9). Total num frames: 1165063168. Throughput: 0: 5847.2. Samples: 1165059480. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:42:38,384][25689] Avg episode reward: [(0, '0.989')] [2022-07-11 09:42:39,987][26022] Updated weights on worker 0-0, policy_version 1137765 (0.00086) [2022-07-11 09:42:41,788][26022] Updated weights on worker 0-0, policy_version 1137775 (0.00083) [2022-07-11 09:42:43,387][25689] Fps is (10 sec: 5618.5, 60 sec: 5586.9, 300 sec: 5564.3). Total num frames: 1165089792. Throughput: 0: 5850.3. Samples: 1165093420. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:42:43,388][25689] Avg episode reward: [(0, '0.431')] [2022-07-11 09:42:43,709][26022] Updated weights on worker 0-0, policy_version 1137785 (0.00089) [2022-07-11 09:42:45,672][26022] Updated weights on worker 0-0, policy_version 1137795 (0.00090) [2022-07-11 09:42:47,212][26022] Updated weights on worker 0-0, policy_version 1137805 (0.00082) [2022-07-11 09:42:48,400][25689] Fps is (10 sec: 5520.4, 60 sec: 5570.7, 300 sec: 5561.5). Total num frames: 1165118464. Throughput: 0: 5867.2. Samples: 1165127164. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:42:48,400][25689] Avg episode reward: [(0, '0.754')] [2022-07-11 09:42:49,199][26022] Updated weights on worker 0-0, policy_version 1137815 (0.00089) [2022-07-11 09:42:50,635][26022] Updated weights on worker 0-0, policy_version 1137825 (0.00090) [2022-07-11 09:42:52,796][26022] Updated weights on worker 0-0, policy_version 1137835 (0.00095) [2022-07-11 09:42:53,443][25689] Fps is (10 sec: 5702.1, 60 sec: 5608.4, 300 sec: 5572.8). Total num frames: 1165147136. Throughput: 0: 5060.8. Samples: 1165144246. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:42:53,445][25689] Avg episode reward: [(0, '1.005')] [2022-07-11 09:42:54,606][26022] Updated weights on worker 0-0, policy_version 1137845 (0.00104) [2022-07-11 09:42:56,355][26022] Updated weights on worker 0-0, policy_version 1137855 (0.00092) [2022-07-11 09:42:58,296][26022] Updated weights on worker 0-0, policy_version 1137865 (0.00088) [2022-07-11 09:42:58,519][25689] Fps is (10 sec: 5565.7, 60 sec: 5570.4, 300 sec: 5562.0). Total num frames: 1165174784. Throughput: 0: 5857.8. Samples: 1165177430. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:42:58,520][25689] Avg episode reward: [(0, '1.111')] [2022-07-11 09:43:00,227][26022] Updated weights on worker 0-0, policy_version 1137875 (0.00094) [2022-07-11 09:43:02,072][26022] Updated weights on worker 0-0, policy_version 1137885 (0.00091) [2022-07-11 09:43:03,578][25689] Fps is (10 sec: 5253.9, 60 sec: 5582.9, 300 sec: 5557.5). Total num frames: 1165200384. Throughput: 0: 5729.7. Samples: 1165209112. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:43:03,579][25689] Avg episode reward: [(0, '0.239')] [2022-07-11 09:43:04,207][26022] Updated weights on worker 0-0, policy_version 1137895 (0.00098) [2022-07-11 09:43:05,945][26022] Updated weights on worker 0-0, policy_version 1137905 (0.00093) [2022-07-11 09:43:07,784][26022] Updated weights on worker 0-0, policy_version 1137915 (0.00085) [2022-07-11 09:43:08,655][25689] Fps is (10 sec: 5455.5, 60 sec: 5578.7, 300 sec: 5568.1). Total num frames: 1165230080. Throughput: 0: 4870.7. Samples: 1165225820. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:43:08,655][25689] Avg episode reward: [(0, '-0.225')] [2022-07-11 09:43:09,789][26022] Updated weights on worker 0-0, policy_version 1137925 (0.00098) [2022-07-11 09:43:11,496][26022] Updated weights on worker 0-0, policy_version 1137935 (0.00084) [2022-07-11 09:43:13,246][26022] Updated weights on worker 0-0, policy_version 1137945 (0.00089) [2022-07-11 09:43:13,725][25689] Fps is (10 sec: 5651.4, 60 sec: 5562.7, 300 sec: 5564.2). Total num frames: 1165257728. Throughput: 0: 5682.7. Samples: 1165259502. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:43:13,726][25689] Avg episode reward: [(0, '-0.278')] [2022-07-11 09:43:15,118][26022] Updated weights on worker 0-0, policy_version 1137955 (0.00091) [2022-07-11 09:43:16,956][26022] Updated weights on worker 0-0, policy_version 1137965 (0.00087) [2022-07-11 09:43:18,759][25689] Fps is (10 sec: 5472.6, 60 sec: 5562.3, 300 sec: 5561.8). Total num frames: 1165285376. Throughput: 0: 5729.5. Samples: 1165293394. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:43:18,759][25689] Avg episode reward: [(0, '-2.077')] [2022-07-11 09:43:18,771][26022] Updated weights on worker 0-0, policy_version 1137975 (0.00086) [2022-07-11 09:43:20,546][26022] Updated weights on worker 0-0, policy_version 1137985 (0.00086) [2022-07-11 09:43:22,297][26022] Updated weights on worker 0-0, policy_version 1137995 (0.00087) [2022-07-11 09:43:23,785][25689] Fps is (10 sec: 5598.2, 60 sec: 5560.3, 300 sec: 5561.3). Total num frames: 1165314048. Throughput: 0: 5008.7. Samples: 1165310324. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:43:23,786][25689] Avg episode reward: [(0, '-2.118')] [2022-07-11 09:43:24,214][26022] Updated weights on worker 0-0, policy_version 1138005 (0.00082) [2022-07-11 09:43:26,003][26022] Updated weights on worker 0-0, policy_version 1138015 (0.00077) [2022-07-11 09:43:27,775][26022] Updated weights on worker 0-0, policy_version 1138025 (0.00491) [2022-07-11 09:43:28,792][25689] Fps is (10 sec: 5715.2, 60 sec: 5596.4, 300 sec: 5562.6). Total num frames: 1165342720. Throughput: 0: 5865.8. Samples: 1165343944. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:43:28,794][25689] Avg episode reward: [(0, '-2.277')] [2022-07-11 09:43:29,719][26022] Updated weights on worker 0-0, policy_version 1138035 (0.00096) [2022-07-11 09:43:31,527][26022] Updated weights on worker 0-0, policy_version 1138045 (0.00093) [2022-07-11 09:43:33,362][26022] Updated weights on worker 0-0, policy_version 1138055 (0.00085) [2022-07-11 09:43:33,925][25689] Fps is (10 sec: 5554.6, 60 sec: 5562.8, 300 sec: 5561.3). Total num frames: 1165370368. Throughput: 0: 5853.0. Samples: 1165377730. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:43:33,927][25689] Avg episode reward: [(0, '-1.391')] [2022-07-11 09:43:35,227][26022] Updated weights on worker 0-0, policy_version 1138065 (0.00085) [2022-07-11 09:43:36,985][26022] Updated weights on worker 0-0, policy_version 1138075 (0.00087) [2022-07-11 09:43:38,724][26022] Updated weights on worker 0-0, policy_version 1138085 (0.00084) [2022-07-11 09:43:38,952][25689] Fps is (10 sec: 5644.2, 60 sec: 5562.2, 300 sec: 5569.5). Total num frames: 1165400064. Throughput: 0: 5009.9. Samples: 1165394560. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:43:38,953][25689] Avg episode reward: [(0, '-0.887')] [2022-07-11 09:43:40,662][26022] Updated weights on worker 0-0, policy_version 1138095 (0.00086) [2022-07-11 09:43:42,427][26022] Updated weights on worker 0-0, policy_version 1138105 (0.00088) [2022-07-11 09:43:43,963][25689] Fps is (10 sec: 5712.6, 60 sec: 5578.4, 300 sec: 5562.4). Total num frames: 1165427712. Throughput: 0: 5859.3. Samples: 1165428550. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:43:43,965][25689] Avg episode reward: [(0, '1.132')] [2022-07-11 09:43:44,119][26022] Updated weights on worker 0-0, policy_version 1138115 (0.00084) [2022-07-11 09:43:45,949][26022] Updated weights on worker 0-0, policy_version 1138125 (0.00096) [2022-07-11 09:43:47,729][26022] Updated weights on worker 0-0, policy_version 1138135 (0.00091) [2022-07-11 09:43:48,985][25689] Fps is (10 sec: 5511.7, 60 sec: 5560.7, 300 sec: 5567.5). Total num frames: 1165455360. Throughput: 0: 5856.1. Samples: 1165462192. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:43:48,987][25689] Avg episode reward: [(0, '1.009')] [2022-07-11 09:43:49,607][26022] Updated weights on worker 0-0, policy_version 1138145 (0.00091) [2022-07-11 09:43:51,608][26022] Updated weights on worker 0-0, policy_version 1138155 (0.00092) [2022-07-11 09:43:53,503][26022] Updated weights on worker 0-0, policy_version 1138165 (0.00086) [2022-07-11 09:43:54,052][25689] Fps is (10 sec: 5582.0, 60 sec: 5558.5, 300 sec: 5566.7). Total num frames: 1165484032. Throughput: 0: 5009.6. Samples: 1165478562. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:43:54,054][25689] Avg episode reward: [(0, '0.909')] [2022-07-11 09:43:54,805][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:43:54,821][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001138172_1165488128.pth [2022-07-11 09:43:54,822][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001136214_1163483136.pth [2022-07-11 09:43:55,246][26022] Updated weights on worker 0-0, policy_version 1138175 (0.00089) [2022-07-11 09:43:57,231][26022] Updated weights on worker 0-0, policy_version 1138185 (0.00088) [2022-07-11 09:43:58,886][26022] Updated weights on worker 0-0, policy_version 1138195 (0.00083) [2022-07-11 09:43:59,065][25689] Fps is (10 sec: 5688.6, 60 sec: 5581.2, 300 sec: 5570.1). Total num frames: 1165512704. Throughput: 0: 5826.5. Samples: 1165511748. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:43:59,067][25689] Avg episode reward: [(0, '0.357')] [2022-07-11 09:44:00,889][26022] Updated weights on worker 0-0, policy_version 1138205 (0.00082) [2022-07-11 09:44:02,856][26022] Updated weights on worker 0-0, policy_version 1138215 (0.00091) [2022-07-11 09:44:04,068][25689] Fps is (10 sec: 5214.2, 60 sec: 5552.5, 300 sec: 5560.8). Total num frames: 1165536256. Throughput: 0: 5712.6. Samples: 1165543402. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:44:04,070][25689] Avg episode reward: [(0, '0.190')] [2022-07-11 09:44:04,855][26022] Updated weights on worker 0-0, policy_version 1138225 (0.00093) [2022-07-11 09:44:06,775][26022] Updated weights on worker 0-0, policy_version 1138235 (0.00074) [2022-07-11 09:44:08,532][26022] Updated weights on worker 0-0, policy_version 1138245 (0.00089) [2022-07-11 09:44:09,099][25689] Fps is (10 sec: 5306.8, 60 sec: 5556.7, 300 sec: 5561.9). Total num frames: 1165565952. Throughput: 0: 4871.3. Samples: 1165560176. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:44:09,101][25689] Avg episode reward: [(0, '-0.830')] [2022-07-11 09:44:10,392][26022] Updated weights on worker 0-0, policy_version 1138255 (0.00088) [2022-07-11 09:44:12,260][26022] Updated weights on worker 0-0, policy_version 1138265 (0.00089) [2022-07-11 09:44:13,976][26022] Updated weights on worker 0-0, policy_version 1138275 (0.00089) [2022-07-11 09:44:14,145][25689] Fps is (10 sec: 5792.4, 60 sec: 5575.9, 300 sec: 5568.4). Total num frames: 1165594624. Throughput: 0: 5724.4. Samples: 1165593578. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:44:14,145][25689] Avg episode reward: [(0, '-0.755')] [2022-07-11 09:44:16,088][26022] Updated weights on worker 0-0, policy_version 1138285 (0.00078) [2022-07-11 09:44:17,607][26022] Updated weights on worker 0-0, policy_version 1138295 (0.00089) [2022-07-11 09:44:19,166][25689] Fps is (10 sec: 5492.5, 60 sec: 5560.1, 300 sec: 5561.3). Total num frames: 1165621248. Throughput: 0: 5749.4. Samples: 1165627320. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:44:19,167][25689] Avg episode reward: [(0, '-0.940')] [2022-07-11 09:44:19,497][26022] Updated weights on worker 0-0, policy_version 1138305 (0.00083) [2022-07-11 09:44:21,223][26022] Updated weights on worker 0-0, policy_version 1138315 (0.00088) [2022-07-11 09:44:23,075][26022] Updated weights on worker 0-0, policy_version 1138325 (0.00087) [2022-07-11 09:44:24,191][25689] Fps is (10 sec: 5504.0, 60 sec: 5560.3, 300 sec: 5564.7). Total num frames: 1165649920. Throughput: 0: 5017.3. Samples: 1165644366. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:44:24,192][25689] Avg episode reward: [(0, '-0.596')] [2022-07-11 09:44:25,168][26022] Updated weights on worker 0-0, policy_version 1138335 (0.00090) [2022-07-11 09:44:26,850][26022] Updated weights on worker 0-0, policy_version 1138345 (0.00088) [2022-07-11 09:44:28,612][26022] Updated weights on worker 0-0, policy_version 1138355 (0.00077) [2022-07-11 09:44:29,203][25689] Fps is (10 sec: 5713.5, 60 sec: 5559.8, 300 sec: 5570.3). Total num frames: 1165678592. Throughput: 0: 5853.8. Samples: 1165677862. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:44:29,204][25689] Avg episode reward: [(0, '-0.574')] [2022-07-11 09:44:30,487][26022] Updated weights on worker 0-0, policy_version 1138365 (0.00084) [2022-07-11 09:44:32,343][26022] Updated weights on worker 0-0, policy_version 1138375 (0.00083) [2022-07-11 09:44:34,146][26022] Updated weights on worker 0-0, policy_version 1138385 (0.00080) [2022-07-11 09:44:34,295][25689] Fps is (10 sec: 5574.2, 60 sec: 5563.5, 300 sec: 5559.5). Total num frames: 1165706240. Throughput: 0: 5842.4. Samples: 1165711304. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2022-07-11 09:44:34,295][25689] Avg episode reward: [(0, '-1.422')] [2022-07-11 09:44:35,846][26022] Updated weights on worker 0-0, policy_version 1138395 (0.00088) [2022-07-11 09:44:37,947][26022] Updated weights on worker 0-0, policy_version 1138405 (0.00093) [2022-07-11 09:44:39,299][25689] Fps is (10 sec: 5680.2, 60 sec: 5565.7, 300 sec: 5569.8). Total num frames: 1165735936. Throughput: 0: 5008.8. Samples: 1165728158. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:44:39,299][25689] Avg episode reward: [(0, '-0.441')] [2022-07-11 09:44:39,691][26022] Updated weights on worker 0-0, policy_version 1138415 (0.00088) [2022-07-11 09:44:41,523][26022] Updated weights on worker 0-0, policy_version 1138425 (0.00084) [2022-07-11 09:44:43,356][26022] Updated weights on worker 0-0, policy_version 1138435 (0.00087) [2022-07-11 09:44:44,336][25689] Fps is (10 sec: 5711.1, 60 sec: 5563.3, 300 sec: 5566.4). Total num frames: 1165763584. Throughput: 0: 5832.3. Samples: 1165761856. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:44:44,336][25689] Avg episode reward: [(0, '-0.399')] [2022-07-11 09:44:45,094][26022] Updated weights on worker 0-0, policy_version 1138445 (0.00086) [2022-07-11 09:44:46,863][26022] Updated weights on worker 0-0, policy_version 1138455 (0.00096) [2022-07-11 09:44:48,721][26022] Updated weights on worker 0-0, policy_version 1138465 (0.00084) [2022-07-11 09:44:49,340][25689] Fps is (10 sec: 5506.8, 60 sec: 5564.9, 300 sec: 5567.7). Total num frames: 1165791232. Throughput: 0: 5850.0. Samples: 1165795662. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:44:49,341][25689] Avg episode reward: [(0, '-1.160')] [2022-07-11 09:44:50,543][26022] Updated weights on worker 0-0, policy_version 1138475 (0.00090) [2022-07-11 09:44:52,266][26022] Updated weights on worker 0-0, policy_version 1138485 (0.00093) [2022-07-11 09:44:54,181][26022] Updated weights on worker 0-0, policy_version 1138495 (0.00088) [2022-07-11 09:44:54,434][25689] Fps is (10 sec: 5475.8, 60 sec: 5545.5, 300 sec: 5560.1). Total num frames: 1165818880. Throughput: 0: 5871.8. Samples: 1165829556. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:44:54,434][25689] Avg episode reward: [(0, '-0.643')] [2022-07-11 09:44:55,960][26022] Updated weights on worker 0-0, policy_version 1138505 (0.00083) [2022-07-11 09:44:58,005][26022] Updated weights on worker 0-0, policy_version 1138515 (0.00090) [2022-07-11 09:44:59,470][25689] Fps is (10 sec: 5660.9, 60 sec: 5560.3, 300 sec: 5566.6). Total num frames: 1165848576. Throughput: 0: 5841.6. Samples: 1165845990. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:44:59,470][25689] Avg episode reward: [(0, '0.467')] [2022-07-11 09:44:59,634][26022] Updated weights on worker 0-0, policy_version 1138525 (0.00092) [2022-07-11 09:45:01,616][26022] Updated weights on worker 0-0, policy_version 1138535 (0.00086) [2022-07-11 09:45:03,913][26022] Updated weights on worker 0-0, policy_version 1138545 (0.00087) [2022-07-11 09:45:04,499][25689] Fps is (10 sec: 5290.4, 60 sec: 5557.9, 300 sec: 5559.5). Total num frames: 1165872128. Throughput: 0: 5737.4. Samples: 1165877540. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:45:04,500][25689] Avg episode reward: [(0, '0.569')] [2022-07-11 09:45:05,420][26022] Updated weights on worker 0-0, policy_version 1138555 (0.00078) [2022-07-11 09:45:07,478][26022] Updated weights on worker 0-0, policy_version 1138565 (0.00090) [2022-07-11 09:45:09,080][26022] Updated weights on worker 0-0, policy_version 1138575 (0.00087) [2022-07-11 09:45:09,552][25689] Fps is (10 sec: 5281.5, 60 sec: 5555.9, 300 sec: 5567.0). Total num frames: 1165901824. Throughput: 0: 5722.8. Samples: 1165911330. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:45:09,553][25689] Avg episode reward: [(0, '0.410')] [2022-07-11 09:45:11,090][26022] Updated weights on worker 0-0, policy_version 1138585 (0.00097) [2022-07-11 09:45:12,955][26022] Updated weights on worker 0-0, policy_version 1138595 (0.00090) [2022-07-11 09:45:14,511][26022] Updated weights on worker 0-0, policy_version 1138605 (0.00090) [2022-07-11 09:45:14,592][25689] Fps is (10 sec: 5884.7, 60 sec: 5573.4, 300 sec: 5567.1). Total num frames: 1165931520. Throughput: 0: 4903.6. Samples: 1165928402. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:45:14,594][25689] Avg episode reward: [(0, '-0.295')] [2022-07-11 09:45:16,434][26022] Updated weights on worker 0-0, policy_version 1138615 (0.00095) [2022-07-11 09:45:18,211][26022] Updated weights on worker 0-0, policy_version 1138625 (0.00085) [2022-07-11 09:45:19,607][25689] Fps is (10 sec: 5703.2, 60 sec: 5591.0, 300 sec: 5571.5). Total num frames: 1165959168. Throughput: 0: 5778.4. Samples: 1165962348. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:45:19,607][25689] Avg episode reward: [(0, '-0.181')] [2022-07-11 09:45:20,049][26022] Updated weights on worker 0-0, policy_version 1138635 (0.00083) [2022-07-11 09:45:21,911][26022] Updated weights on worker 0-0, policy_version 1138645 (0.00086) [2022-07-11 09:45:23,785][26022] Updated weights on worker 0-0, policy_version 1138655 (0.00094) [2022-07-11 09:45:24,610][25689] Fps is (10 sec: 5519.7, 60 sec: 5576.1, 300 sec: 5565.5). Total num frames: 1165986816. Throughput: 0: 5899.2. Samples: 1165996176. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:45:24,610][25689] Avg episode reward: [(0, '1.212')] [2022-07-11 09:45:25,526][26022] Updated weights on worker 0-0, policy_version 1138665 (0.00083) [2022-07-11 09:45:27,570][26022] Updated weights on worker 0-0, policy_version 1138675 (0.00086) [2022-07-11 09:45:29,048][26022] Updated weights on worker 0-0, policy_version 1138685 (0.00081) [2022-07-11 09:45:29,613][25689] Fps is (10 sec: 5628.4, 60 sec: 5576.8, 300 sec: 5567.5). Total num frames: 1166015488. Throughput: 0: 5065.6. Samples: 1166012952. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:45:29,614][25689] Avg episode reward: [(0, '1.032')] [2022-07-11 09:45:31,259][26022] Updated weights on worker 0-0, policy_version 1138695 (0.00090) [2022-07-11 09:45:32,840][26022] Updated weights on worker 0-0, policy_version 1138705 (0.00084) [2022-07-11 09:45:34,671][25689] Fps is (10 sec: 5597.5, 60 sec: 5579.9, 300 sec: 5563.2). Total num frames: 1166043136. Throughput: 0: 5881.6. Samples: 1166046502. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:45:34,672][25689] Avg episode reward: [(0, '1.178')] [2022-07-11 09:45:34,772][26022] Updated weights on worker 0-0, policy_version 1138715 (0.00099) [2022-07-11 09:45:36,855][26022] Updated weights on worker 0-0, policy_version 1138725 (0.00087) [2022-07-11 09:45:38,367][26022] Updated weights on worker 0-0, policy_version 1138735 (0.00094) [2022-07-11 09:45:39,701][25689] Fps is (10 sec: 5481.6, 60 sec: 5543.7, 300 sec: 5567.2). Total num frames: 1166070784. Throughput: 0: 5839.9. Samples: 1166079696. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:45:39,701][25689] Avg episode reward: [(0, '1.584')] [2022-07-11 09:45:40,496][26022] Updated weights on worker 0-0, policy_version 1138745 (0.00094) [2022-07-11 09:45:42,131][26022] Updated weights on worker 0-0, policy_version 1138755 (0.00087) [2022-07-11 09:45:44,126][26022] Updated weights on worker 0-0, policy_version 1138765 (0.00088) [2022-07-11 09:45:44,706][25689] Fps is (10 sec: 5714.7, 60 sec: 5580.5, 300 sec: 5568.6). Total num frames: 1166100480. Throughput: 0: 4993.7. Samples: 1166096530. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:45:44,706][25689] Avg episode reward: [(0, '1.182')] [2022-07-11 09:45:45,762][26022] Updated weights on worker 0-0, policy_version 1138775 (0.00087) [2022-07-11 09:45:47,550][26022] Updated weights on worker 0-0, policy_version 1138785 (0.00090) [2022-07-11 09:45:49,451][26022] Updated weights on worker 0-0, policy_version 1138795 (0.00091) [2022-07-11 09:45:49,721][25689] Fps is (10 sec: 5620.4, 60 sec: 5562.5, 300 sec: 5559.8). Total num frames: 1166127104. Throughput: 0: 5822.8. Samples: 1166130038. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:45:49,722][25689] Avg episode reward: [(0, '0.410')] [2022-07-11 09:45:51,402][26022] Updated weights on worker 0-0, policy_version 1138805 (0.00086) [2022-07-11 09:45:53,045][26022] Updated weights on worker 0-0, policy_version 1138815 (0.00104) [2022-07-11 09:45:54,801][25689] Fps is (10 sec: 5477.5, 60 sec: 5580.8, 300 sec: 5565.6). Total num frames: 1166155776. Throughput: 0: 5846.5. Samples: 1166164190. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:45:54,801][25689] Avg episode reward: [(0, '-0.404')] [2022-07-11 09:45:54,987][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:45:54,995][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001138825_1166156800.pth [2022-07-11 09:45:54,996][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001136866_1164150784.pth [2022-07-11 09:45:54,998][26022] Updated weights on worker 0-0, policy_version 1138825 (0.00106) [2022-07-11 09:45:56,517][26022] Updated weights on worker 0-0, policy_version 1138835 (0.00085) [2022-07-11 09:45:58,744][26022] Updated weights on worker 0-0, policy_version 1138845 (0.00089) [2022-07-11 09:45:59,809][25689] Fps is (10 sec: 5684.7, 60 sec: 5566.4, 300 sec: 5573.0). Total num frames: 1166184448. Throughput: 0: 5033.2. Samples: 1166180904. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:45:59,810][25689] Avg episode reward: [(0, '-0.456')] [2022-07-11 09:46:00,451][26022] Updated weights on worker 0-0, policy_version 1138855 (0.00083) [2022-07-11 09:46:02,679][26022] Updated weights on worker 0-0, policy_version 1138865 (0.00088) [2022-07-11 09:46:04,329][26022] Updated weights on worker 0-0, policy_version 1138875 (0.00090) [2022-07-11 09:46:04,827][25689] Fps is (10 sec: 5413.2, 60 sec: 5601.4, 300 sec: 5563.0). Total num frames: 1166210048. Throughput: 0: 5773.4. Samples: 1166212696. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:46:04,827][25689] Avg episode reward: [(0, '-0.651')] [2022-07-11 09:46:06,251][26022] Updated weights on worker 0-0, policy_version 1138885 (0.00085) [2022-07-11 09:46:08,167][26022] Updated weights on worker 0-0, policy_version 1138895 (0.00089) [2022-07-11 09:46:09,846][25689] Fps is (10 sec: 5407.3, 60 sec: 5587.6, 300 sec: 5567.2). Total num frames: 1166238720. Throughput: 0: 5791.7. Samples: 1166246592. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:46:09,846][25689] Avg episode reward: [(0, '-0.774')] [2022-07-11 09:46:10,107][26022] Updated weights on worker 0-0, policy_version 1138906 (0.00079) [2022-07-11 09:46:11,890][26022] Updated weights on worker 0-0, policy_version 1138916 (0.00092) [2022-07-11 09:46:13,504][26022] Updated weights on worker 0-0, policy_version 1138926 (0.00108) [2022-07-11 09:46:15,009][25689] Fps is (10 sec: 5732.0, 60 sec: 5576.1, 300 sec: 5569.0). Total num frames: 1166268416. Throughput: 0: 4923.1. Samples: 1166263680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:46:15,010][25689] Avg episode reward: [(0, '0.119')] [2022-07-11 09:46:15,475][26022] Updated weights on worker 0-0, policy_version 1138936 (0.00083) [2022-07-11 09:46:17,267][26022] Updated weights on worker 0-0, policy_version 1138946 (0.00087) [2022-07-11 09:46:18,983][26022] Updated weights on worker 0-0, policy_version 1138956 (0.00087) [2022-07-11 09:46:20,027][25689] Fps is (10 sec: 5632.1, 60 sec: 5575.8, 300 sec: 5569.2). Total num frames: 1166296064. Throughput: 0: 5779.3. Samples: 1166297752. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:46:20,028][25689] Avg episode reward: [(0, '-0.150')] [2022-07-11 09:46:20,956][26022] Updated weights on worker 0-0, policy_version 1138966 (0.00091) [2022-07-11 09:46:22,669][26022] Updated weights on worker 0-0, policy_version 1138976 (0.00084) [2022-07-11 09:46:24,535][26022] Updated weights on worker 0-0, policy_version 1138986 (0.00086) [2022-07-11 09:46:25,039][25689] Fps is (10 sec: 5513.2, 60 sec: 5575.0, 300 sec: 5569.3). Total num frames: 1166323712. Throughput: 0: 5886.3. Samples: 1166331674. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:46:25,040][25689] Avg episode reward: [(0, '-0.005')] [2022-07-11 09:46:26,241][26022] Updated weights on worker 0-0, policy_version 1138996 (0.00086) [2022-07-11 09:46:28,257][26022] Updated weights on worker 0-0, policy_version 1139006 (0.00086) [2022-07-11 09:46:30,009][26022] Updated weights on worker 0-0, policy_version 1139016 (0.00406) [2022-07-11 09:46:30,067][25689] Fps is (10 sec: 5609.9, 60 sec: 5572.8, 300 sec: 5571.1). Total num frames: 1166352384. Throughput: 0: 5035.0. Samples: 1166348404. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:46:30,067][25689] Avg episode reward: [(0, '-1.510')] [2022-07-11 09:46:32,002][26022] Updated weights on worker 0-0, policy_version 1139026 (0.00134) [2022-07-11 09:46:33,662][26022] Updated weights on worker 0-0, policy_version 1139036 (0.00086) [2022-07-11 09:46:35,163][25689] Fps is (10 sec: 5664.5, 60 sec: 5586.2, 300 sec: 5569.5). Total num frames: 1166381056. Throughput: 0: 5875.4. Samples: 1166382088. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:46:35,164][25689] Avg episode reward: [(0, '-1.384')] [2022-07-11 09:46:35,490][26022] Updated weights on worker 0-0, policy_version 1139046 (0.00093) [2022-07-11 09:46:37,315][26022] Updated weights on worker 0-0, policy_version 1139056 (0.00086) [2022-07-11 09:46:39,209][26022] Updated weights on worker 0-0, policy_version 1139066 (0.00086) [2022-07-11 09:46:40,206][25689] Fps is (10 sec: 5655.5, 60 sec: 5601.9, 300 sec: 5576.3). Total num frames: 1166409728. Throughput: 0: 5854.0. Samples: 1166415880. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:46:40,207][25689] Avg episode reward: [(0, '-1.896')] [2022-07-11 09:46:40,906][26022] Updated weights on worker 0-0, policy_version 1139076 (0.00093) [2022-07-11 09:46:42,785][26022] Updated weights on worker 0-0, policy_version 1139086 (0.00061) [2022-07-11 09:46:44,465][26022] Updated weights on worker 0-0, policy_version 1139096 (0.00089) [2022-07-11 09:46:45,208][25689] Fps is (10 sec: 5504.7, 60 sec: 5551.4, 300 sec: 5566.3). Total num frames: 1166436352. Throughput: 0: 5011.7. Samples: 1166432754. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:46:45,208][25689] Avg episode reward: [(0, '-1.963')] [2022-07-11 09:46:46,587][26022] Updated weights on worker 0-0, policy_version 1139106 (0.00094) [2022-07-11 09:46:48,272][26022] Updated weights on worker 0-0, policy_version 1139116 (0.00092) [2022-07-11 09:46:49,971][26022] Updated weights on worker 0-0, policy_version 1139126 (0.00086) [2022-07-11 09:46:50,231][25689] Fps is (10 sec: 5618.3, 60 sec: 5601.5, 300 sec: 5577.8). Total num frames: 1166466048. Throughput: 0: 5851.7. Samples: 1166466396. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:46:50,232][25689] Avg episode reward: [(0, '-2.117')] [2022-07-11 09:46:52,083][26022] Updated weights on worker 0-0, policy_version 1139136 (0.00433) [2022-07-11 09:46:53,725][26022] Updated weights on worker 0-0, policy_version 1139146 (0.00091) [2022-07-11 09:46:55,288][25689] Fps is (10 sec: 5689.0, 60 sec: 5586.7, 300 sec: 5570.4). Total num frames: 1166493696. Throughput: 0: 5868.9. Samples: 1166500200. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:46:55,288][25689] Avg episode reward: [(0, '-1.140')] [2022-07-11 09:46:55,535][26022] Updated weights on worker 0-0, policy_version 1139156 (0.00090) [2022-07-11 09:46:57,557][26022] Updated weights on worker 0-0, policy_version 1139166 (0.00093) [2022-07-11 09:46:59,394][26022] Updated weights on worker 0-0, policy_version 1139176 (0.00091) [2022-07-11 09:47:00,350][25689] Fps is (10 sec: 5362.9, 60 sec: 5547.8, 300 sec: 5576.4). Total num frames: 1166520320. Throughput: 0: 5002.7. Samples: 1166516656. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:47:00,351][25689] Avg episode reward: [(0, '-0.435')] [2022-07-11 09:47:01,120][26022] Updated weights on worker 0-0, policy_version 1139186 (0.00094) [2022-07-11 09:47:03,266][26022] Updated weights on worker 0-0, policy_version 1139196 (0.00097) [2022-07-11 09:47:05,079][26022] Updated weights on worker 0-0, policy_version 1139206 (0.00100) [2022-07-11 09:47:05,360][25689] Fps is (10 sec: 5286.5, 60 sec: 5565.4, 300 sec: 5566.4). Total num frames: 1166546944. Throughput: 0: 5716.8. Samples: 1166547960. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:47:05,361][25689] Avg episode reward: [(0, '-0.228')] [2022-07-11 09:47:07,027][26022] Updated weights on worker 0-0, policy_version 1139216 (0.00088) [2022-07-11 09:47:08,943][26022] Updated weights on worker 0-0, policy_version 1139226 (0.00085) [2022-07-11 09:47:10,378][25689] Fps is (10 sec: 5616.7, 60 sec: 5582.5, 300 sec: 5571.1). Total num frames: 1166576640. Throughput: 0: 5727.7. Samples: 1166581794. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:47:10,378][25689] Avg episode reward: [(0, '0.451')] [2022-07-11 09:47:10,401][26022] Updated weights on worker 0-0, policy_version 1139236 (0.00085) [2022-07-11 09:47:12,492][26022] Updated weights on worker 0-0, policy_version 1139246 (0.00096) [2022-07-11 09:47:14,127][26022] Updated weights on worker 0-0, policy_version 1139256 (0.00085) [2022-07-11 09:47:15,423][25689] Fps is (10 sec: 5495.0, 60 sec: 5525.6, 300 sec: 5563.9). Total num frames: 1166602240. Throughput: 0: 4885.3. Samples: 1166598568. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:47:15,424][25689] Avg episode reward: [(0, '0.809')] [2022-07-11 09:47:16,190][26022] Updated weights on worker 0-0, policy_version 1139266 (0.00089) [2022-07-11 09:47:18,059][26022] Updated weights on worker 0-0, policy_version 1139276 (0.00103) [2022-07-11 09:47:19,631][26022] Updated weights on worker 0-0, policy_version 1139286 (0.00080) [2022-07-11 09:47:20,442][25689] Fps is (10 sec: 5494.2, 60 sec: 5559.4, 300 sec: 5567.1). Total num frames: 1166631936. Throughput: 0: 5750.1. Samples: 1166632186. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:47:20,443][25689] Avg episode reward: [(0, '0.585')] [2022-07-11 09:47:21,701][26022] Updated weights on worker 0-0, policy_version 1139296 (0.00088) [2022-07-11 09:47:23,377][26022] Updated weights on worker 0-0, policy_version 1139306 (0.00084) [2022-07-11 09:47:25,243][26022] Updated weights on worker 0-0, policy_version 1139316 (0.00085) [2022-07-11 09:47:25,450][25689] Fps is (10 sec: 5923.5, 60 sec: 5593.7, 300 sec: 5577.8). Total num frames: 1166661632. Throughput: 0: 5876.7. Samples: 1166666020. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:47:25,450][25689] Avg episode reward: [(0, '0.942')] [2022-07-11 09:47:27,062][26022] Updated weights on worker 0-0, policy_version 1139326 (0.00088) [2022-07-11 09:47:28,815][26022] Updated weights on worker 0-0, policy_version 1139336 (0.00091) [2022-07-11 09:47:30,513][25689] Fps is (10 sec: 5592.1, 60 sec: 5556.5, 300 sec: 5568.8). Total num frames: 1166688256. Throughput: 0: 5026.2. Samples: 1166683000. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:47:30,514][25689] Avg episode reward: [(0, '0.110')] [2022-07-11 09:47:30,709][26022] Updated weights on worker 0-0, policy_version 1139346 (0.00088) [2022-07-11 09:47:32,647][26022] Updated weights on worker 0-0, policy_version 1139356 (0.00090) [2022-07-11 09:47:34,433][26022] Updated weights on worker 0-0, policy_version 1139366 (0.00086) [2022-07-11 09:47:35,551][25689] Fps is (10 sec: 5474.1, 60 sec: 5561.9, 300 sec: 5565.1). Total num frames: 1166716928. Throughput: 0: 5852.6. Samples: 1166716368. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:47:35,552][25689] Avg episode reward: [(0, '0.252')] [2022-07-11 09:47:36,225][26022] Updated weights on worker 0-0, policy_version 1139376 (0.00088) [2022-07-11 09:47:38,135][26022] Updated weights on worker 0-0, policy_version 1139386 (0.00082) [2022-07-11 09:47:39,860][26022] Updated weights on worker 0-0, policy_version 1139396 (0.00088) [2022-07-11 09:47:40,598][25689] Fps is (10 sec: 5584.6, 60 sec: 5544.6, 300 sec: 5567.7). Total num frames: 1166744576. Throughput: 0: 5856.1. Samples: 1166750222. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:47:40,599][25689] Avg episode reward: [(0, '-0.282')] [2022-07-11 09:47:41,853][26022] Updated weights on worker 0-0, policy_version 1139406 (0.00086) [2022-07-11 09:47:43,634][26022] Updated weights on worker 0-0, policy_version 1139416 (0.00091) [2022-07-11 09:47:45,306][26022] Updated weights on worker 0-0, policy_version 1139426 (0.00088) [2022-07-11 09:47:45,604][25689] Fps is (10 sec: 5704.1, 60 sec: 5595.1, 300 sec: 5571.3). Total num frames: 1166774272. Throughput: 0: 5017.8. Samples: 1166767152. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:47:45,606][25689] Avg episode reward: [(0, '-0.861')] [2022-07-11 09:47:47,221][26022] Updated weights on worker 0-0, policy_version 1139436 (0.00087) [2022-07-11 09:47:48,887][26022] Updated weights on worker 0-0, policy_version 1139446 (0.00085) [2022-07-11 09:47:50,622][25689] Fps is (10 sec: 5720.9, 60 sec: 5561.6, 300 sec: 5568.3). Total num frames: 1166801920. Throughput: 0: 5883.9. Samples: 1166801316. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:47:50,623][25689] Avg episode reward: [(0, '-0.564')] [2022-07-11 09:47:50,764][26022] Updated weights on worker 0-0, policy_version 1139456 (0.00085) [2022-07-11 09:47:52,595][26022] Updated weights on worker 0-0, policy_version 1139466 (0.00089) [2022-07-11 09:47:54,389][26022] Updated weights on worker 0-0, policy_version 1139476 (0.00088) [2022-07-11 09:47:55,027][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:47:55,038][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001139479_1166826496.pth [2022-07-11 09:47:55,051][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001137519_1164819456.pth [2022-07-11 09:47:55,719][25689] Fps is (10 sec: 5568.2, 60 sec: 5574.9, 300 sec: 5571.4). Total num frames: 1166830592. Throughput: 0: 5891.1. Samples: 1166835178. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:47:55,722][25689] Avg episode reward: [(0, '-0.109')] [2022-07-11 09:47:56,306][26022] Updated weights on worker 0-0, policy_version 1139486 (0.00087) [2022-07-11 09:47:58,019][26022] Updated weights on worker 0-0, policy_version 1139496 (0.00095) [2022-07-11 09:47:59,803][26022] Updated weights on worker 0-0, policy_version 1139506 (0.00086) [2022-07-11 09:48:00,760][25689] Fps is (10 sec: 5757.1, 60 sec: 5627.7, 300 sec: 5585.5). Total num frames: 1166860288. Throughput: 0: 5059.4. Samples: 1166852230. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:48:00,761][25689] Avg episode reward: [(0, '-0.352')] [2022-07-11 09:48:02,003][26022] Updated weights on worker 0-0, policy_version 1139516 (0.00083) [2022-07-11 09:48:03,859][26022] Updated weights on worker 0-0, policy_version 1139526 (0.00087) [2022-07-11 09:48:05,532][26022] Updated weights on worker 0-0, policy_version 1139536 (0.00085) [2022-07-11 09:48:05,817][25689] Fps is (10 sec: 5475.6, 60 sec: 5606.3, 300 sec: 5572.1). Total num frames: 1166885888. Throughput: 0: 5771.4. Samples: 1166883810. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:48:05,818][25689] Avg episode reward: [(0, '0.098')] [2022-07-11 09:48:07,491][26022] Updated weights on worker 0-0, policy_version 1139546 (0.00079) [2022-07-11 09:48:09,207][26022] Updated weights on worker 0-0, policy_version 1139556 (0.00084) [2022-07-11 09:48:10,830][25689] Fps is (10 sec: 5288.2, 60 sec: 5573.0, 300 sec: 5573.1). Total num frames: 1166913536. Throughput: 0: 5768.7. Samples: 1166917888. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:48:10,830][25689] Avg episode reward: [(0, '0.069')] [2022-07-11 09:48:11,081][26022] Updated weights on worker 0-0, policy_version 1139566 (0.00080) [2022-07-11 09:48:12,789][26022] Updated weights on worker 0-0, policy_version 1139576 (0.00086) [2022-07-11 09:48:14,652][26022] Updated weights on worker 0-0, policy_version 1139586 (0.00089) [2022-07-11 09:48:15,889][25689] Fps is (10 sec: 5693.7, 60 sec: 5639.4, 300 sec: 5579.6). Total num frames: 1166943232. Throughput: 0: 5769.5. Samples: 1166951550. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:48:15,890][25689] Avg episode reward: [(0, '1.039')] [2022-07-11 09:48:16,422][26022] Updated weights on worker 0-0, policy_version 1139596 (0.00082) [2022-07-11 09:48:18,296][26022] Updated weights on worker 0-0, policy_version 1139606 (0.00091) [2022-07-11 09:48:20,064][26022] Updated weights on worker 0-0, policy_version 1139616 (0.00091) [2022-07-11 09:48:20,910][25689] Fps is (10 sec: 5688.6, 60 sec: 5605.3, 300 sec: 5576.2). Total num frames: 1166970880. Throughput: 0: 5775.4. Samples: 1166968604. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:48:20,911][25689] Avg episode reward: [(0, '0.059')] [2022-07-11 09:48:22,154][26022] Updated weights on worker 0-0, policy_version 1139626 (0.00080) [2022-07-11 09:48:23,784][26022] Updated weights on worker 0-0, policy_version 1139636 (0.00089) [2022-07-11 09:48:25,641][26022] Updated weights on worker 0-0, policy_version 1139646 (0.00084) [2022-07-11 09:48:25,937][25689] Fps is (10 sec: 5503.3, 60 sec: 5569.7, 300 sec: 5572.4). Total num frames: 1166998528. Throughput: 0: 5898.5. Samples: 1167002484. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:48:25,939][25689] Avg episode reward: [(0, '-0.031')] [2022-07-11 09:48:27,476][26022] Updated weights on worker 0-0, policy_version 1139656 (0.00085) [2022-07-11 09:48:29,402][26022] Updated weights on worker 0-0, policy_version 1139666 (0.00074) [2022-07-11 09:48:30,995][25689] Fps is (10 sec: 5584.6, 60 sec: 5604.1, 300 sec: 5577.2). Total num frames: 1167027200. Throughput: 0: 5863.8. Samples: 1167036134. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:48:30,996][25689] Avg episode reward: [(0, '-1.412')] [2022-07-11 09:48:31,051][26022] Updated weights on worker 0-0, policy_version 1139676 (0.00174) [2022-07-11 09:48:32,939][26022] Updated weights on worker 0-0, policy_version 1139686 (0.00627) [2022-07-11 09:48:34,698][26022] Updated weights on worker 0-0, policy_version 1139696 (0.00082) [2022-07-11 09:48:36,080][25689] Fps is (10 sec: 5653.4, 60 sec: 5599.7, 300 sec: 5572.7). Total num frames: 1167055872. Throughput: 0: 5028.1. Samples: 1167053072. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2022-07-11 09:48:36,081][25689] Avg episode reward: [(0, '-0.936')] [2022-07-11 09:48:36,569][26022] Updated weights on worker 0-0, policy_version 1139706 (0.00081) [2022-07-11 09:48:38,266][26022] Updated weights on worker 0-0, policy_version 1139716 (0.00090) [2022-07-11 09:48:40,182][26022] Updated weights on worker 0-0, policy_version 1139726 (0.00090) [2022-07-11 09:48:41,096][25689] Fps is (10 sec: 5677.2, 60 sec: 5619.5, 300 sec: 5576.1). Total num frames: 1167084544. Throughput: 0: 5866.0. Samples: 1167087014. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:48:41,098][25689] Avg episode reward: [(0, '-1.424')] [2022-07-11 09:48:41,992][26022] Updated weights on worker 0-0, policy_version 1139736 (0.00087) [2022-07-11 09:48:43,839][26022] Updated weights on worker 0-0, policy_version 1139746 (0.00047) [2022-07-11 09:48:45,569][26022] Updated weights on worker 0-0, policy_version 1139756 (0.00084) [2022-07-11 09:48:46,124][25689] Fps is (10 sec: 5709.1, 60 sec: 5600.5, 300 sec: 5579.4). Total num frames: 1167113216. Throughput: 0: 5861.0. Samples: 1167120802. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:48:46,126][25689] Avg episode reward: [(0, '-0.642')] [2022-07-11 09:48:47,371][26022] Updated weights on worker 0-0, policy_version 1139766 (0.00085) [2022-07-11 09:48:49,097][26022] Updated weights on worker 0-0, policy_version 1139776 (0.00096) [2022-07-11 09:48:51,050][26022] Updated weights on worker 0-0, policy_version 1139786 (0.00093) [2022-07-11 09:48:51,146][25689] Fps is (10 sec: 5604.2, 60 sec: 5600.2, 300 sec: 5576.8). Total num frames: 1167140864. Throughput: 0: 5045.8. Samples: 1167137808. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:48:51,146][25689] Avg episode reward: [(0, '-0.866')] [2022-07-11 09:48:52,779][26022] Updated weights on worker 0-0, policy_version 1139796 (0.00082) [2022-07-11 09:48:54,771][26022] Updated weights on worker 0-0, policy_version 1139806 (0.00090) [2022-07-11 09:48:56,228][25689] Fps is (10 sec: 5675.6, 60 sec: 5618.5, 300 sec: 5578.9). Total num frames: 1167170560. Throughput: 0: 5882.7. Samples: 1167171596. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:48:56,229][25689] Avg episode reward: [(0, '0.261')] [2022-07-11 09:48:56,331][26022] Updated weights on worker 0-0, policy_version 1139816 (0.00085) [2022-07-11 09:48:58,399][26022] Updated weights on worker 0-0, policy_version 1139826 (0.00085) [2022-07-11 09:49:00,096][26022] Updated weights on worker 0-0, policy_version 1139836 (0.00088) [2022-07-11 09:49:01,273][25689] Fps is (10 sec: 5561.1, 60 sec: 5567.4, 300 sec: 5588.5). Total num frames: 1167197184. Throughput: 0: 5868.9. Samples: 1167205432. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:49:01,275][25689] Avg episode reward: [(0, '0.336')] [2022-07-11 09:49:02,295][26022] Updated weights on worker 0-0, policy_version 1139846 (0.00091) [2022-07-11 09:49:04,291][26022] Updated weights on worker 0-0, policy_version 1139856 (0.00091) [2022-07-11 09:49:06,045][26022] Updated weights on worker 0-0, policy_version 1139866 (0.00093) [2022-07-11 09:49:06,283][25689] Fps is (10 sec: 5295.7, 60 sec: 5588.6, 300 sec: 5578.6). Total num frames: 1167223808. Throughput: 0: 4928.1. Samples: 1167220146. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:49:06,284][25689] Avg episode reward: [(0, '1.380')] [2022-07-11 09:49:07,839][26022] Updated weights on worker 0-0, policy_version 1139876 (0.00088) [2022-07-11 09:49:09,803][26022] Updated weights on worker 0-0, policy_version 1139886 (0.00087) [2022-07-11 09:49:11,334][25689] Fps is (10 sec: 5394.1, 60 sec: 5585.0, 300 sec: 5575.0). Total num frames: 1167251456. Throughput: 0: 5748.6. Samples: 1167253866. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:49:11,336][25689] Avg episode reward: [(0, '0.891')] [2022-07-11 09:49:11,485][26022] Updated weights on worker 0-0, policy_version 1139896 (0.00085) [2022-07-11 09:49:13,486][26022] Updated weights on worker 0-0, policy_version 1139906 (0.00067) [2022-07-11 09:49:15,215][26022] Updated weights on worker 0-0, policy_version 1139916 (0.00086) [2022-07-11 09:49:16,441][25689] Fps is (10 sec: 5544.4, 60 sec: 5563.7, 300 sec: 5580.3). Total num frames: 1167280128. Throughput: 0: 5719.7. Samples: 1167287208. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:49:16,442][25689] Avg episode reward: [(0, '0.179')] [2022-07-11 09:49:16,931][26022] Updated weights on worker 0-0, policy_version 1139926 (0.00085) [2022-07-11 09:49:18,856][26022] Updated weights on worker 0-0, policy_version 1139936 (0.00092) [2022-07-11 09:49:20,559][26022] Updated weights on worker 0-0, policy_version 1139946 (0.00085) [2022-07-11 09:49:21,515][25689] Fps is (10 sec: 5532.2, 60 sec: 5558.9, 300 sec: 5575.9). Total num frames: 1167307776. Throughput: 0: 4874.8. Samples: 1167304106. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:49:21,515][25689] Avg episode reward: [(0, '0.583')] [2022-07-11 09:49:22,322][26022] Updated weights on worker 0-0, policy_version 1139956 (0.00090) [2022-07-11 09:49:24,358][26022] Updated weights on worker 0-0, policy_version 1139966 (0.00091) [2022-07-11 09:49:26,120][26022] Updated weights on worker 0-0, policy_version 1139976 (0.00094) [2022-07-11 09:49:26,531][25689] Fps is (10 sec: 5683.4, 60 sec: 5593.7, 300 sec: 5579.3). Total num frames: 1167337472. Throughput: 0: 5824.0. Samples: 1167338068. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:49:26,531][25689] Avg episode reward: [(0, '0.325')] [2022-07-11 09:49:27,935][26022] Updated weights on worker 0-0, policy_version 1139986 (0.00093) [2022-07-11 09:49:29,755][26022] Updated weights on worker 0-0, policy_version 1139996 (0.00085) [2022-07-11 09:49:31,539][25689] Fps is (10 sec: 5720.7, 60 sec: 5581.4, 300 sec: 5580.9). Total num frames: 1167365120. Throughput: 0: 5830.2. Samples: 1167371662. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:49:31,540][25689] Avg episode reward: [(0, '0.465')] [2022-07-11 09:49:31,580][26022] Updated weights on worker 0-0, policy_version 1140006 (0.00087) [2022-07-11 09:49:33,564][26022] Updated weights on worker 0-0, policy_version 1140016 (0.00084) [2022-07-11 09:49:35,161][26022] Updated weights on worker 0-0, policy_version 1140026 (0.00087) [2022-07-11 09:49:36,575][25689] Fps is (10 sec: 5505.0, 60 sec: 5569.0, 300 sec: 5573.4). Total num frames: 1167392768. Throughput: 0: 5037.2. Samples: 1167388628. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:49:36,576][25689] Avg episode reward: [(0, '0.358')] [2022-07-11 09:49:37,050][26022] Updated weights on worker 0-0, policy_version 1140036 (0.00087) [2022-07-11 09:49:38,946][26022] Updated weights on worker 0-0, policy_version 1140046 (0.00086) [2022-07-11 09:49:40,575][26022] Updated weights on worker 0-0, policy_version 1140056 (0.00076) [2022-07-11 09:49:41,587][25689] Fps is (10 sec: 5808.6, 60 sec: 5603.2, 300 sec: 5584.2). Total num frames: 1167423488. Throughput: 0: 5899.4. Samples: 1167422522. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:49:41,588][25689] Avg episode reward: [(0, '-0.636')] [2022-07-11 09:49:42,806][26022] Updated weights on worker 0-0, policy_version 1140066 (0.00083) [2022-07-11 09:49:44,132][26022] Updated weights on worker 0-0, policy_version 1140076 (0.00092) [2022-07-11 09:49:46,473][26022] Updated weights on worker 0-0, policy_version 1140086 (0.00082) [2022-07-11 09:49:46,604][25689] Fps is (10 sec: 5616.0, 60 sec: 5553.5, 300 sec: 5577.1). Total num frames: 1167449088. Throughput: 0: 5887.8. Samples: 1167456254. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:49:46,604][25689] Avg episode reward: [(0, '-0.098')] [2022-07-11 09:49:47,799][26022] Updated weights on worker 0-0, policy_version 1140096 (0.00085) [2022-07-11 09:49:49,963][26022] Updated weights on worker 0-0, policy_version 1140106 (0.00095) [2022-07-11 09:49:51,591][26022] Updated weights on worker 0-0, policy_version 1140116 (0.00097) [2022-07-11 09:49:51,633][25689] Fps is (10 sec: 5504.7, 60 sec: 5586.7, 300 sec: 5585.2). Total num frames: 1167478784. Throughput: 0: 5049.6. Samples: 1167473128. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:49:51,633][25689] Avg episode reward: [(0, '-0.168')] [2022-07-11 09:49:53,471][26022] Updated weights on worker 0-0, policy_version 1140126 (0.00085) [2022-07-11 09:49:55,226][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:49:55,244][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001140136_1167499264.pth [2022-07-11 09:49:55,245][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001138172_1165488128.pth [2022-07-11 09:49:55,258][26022] Updated weights on worker 0-0, policy_version 1140136 (0.00085) [2022-07-11 09:49:56,680][25689] Fps is (10 sec: 5691.2, 60 sec: 5556.0, 300 sec: 5578.1). Total num frames: 1167506432. Throughput: 0: 5882.6. Samples: 1167506894. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:49:56,680][25689] Avg episode reward: [(0, '-0.538')] [2022-07-11 09:49:56,956][26022] Updated weights on worker 0-0, policy_version 1140146 (0.00085) [2022-07-11 09:49:58,928][26022] Updated weights on worker 0-0, policy_version 1140156 (0.00087) [2022-07-11 09:50:00,799][26022] Updated weights on worker 0-0, policy_version 1140166 (0.00081) [2022-07-11 09:50:01,720][25689] Fps is (10 sec: 5481.9, 60 sec: 5573.5, 300 sec: 5591.6). Total num frames: 1167534080. Throughput: 0: 5858.5. Samples: 1167540466. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:50:01,721][25689] Avg episode reward: [(0, '-0.610')] [2022-07-11 09:50:02,939][26022] Updated weights on worker 0-0, policy_version 1140176 (0.00089) [2022-07-11 09:50:04,939][26022] Updated weights on worker 0-0, policy_version 1140186 (0.00083) [2022-07-11 09:50:06,691][26022] Updated weights on worker 0-0, policy_version 1140196 (0.00086) [2022-07-11 09:50:06,726][25689] Fps is (10 sec: 5402.0, 60 sec: 5573.7, 300 sec: 5582.2). Total num frames: 1167560704. Throughput: 0: 4916.6. Samples: 1167555188. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:50:06,729][25689] Avg episode reward: [(0, '0.875')] [2022-07-11 09:50:08,654][26022] Updated weights on worker 0-0, policy_version 1140206 (0.00090) [2022-07-11 09:50:10,325][26022] Updated weights on worker 0-0, policy_version 1140216 (0.00082) [2022-07-11 09:50:11,761][25689] Fps is (10 sec: 5404.7, 60 sec: 5575.3, 300 sec: 5575.4). Total num frames: 1167588352. Throughput: 0: 5752.1. Samples: 1167588910. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:50:11,763][25689] Avg episode reward: [(0, '1.231')] [2022-07-11 09:50:12,292][26022] Updated weights on worker 0-0, policy_version 1140226 (0.00089) [2022-07-11 09:50:13,956][26022] Updated weights on worker 0-0, policy_version 1140236 (0.00083) [2022-07-11 09:50:15,882][26022] Updated weights on worker 0-0, policy_version 1140246 (0.00086) [2022-07-11 09:50:16,842][25689] Fps is (10 sec: 5668.7, 60 sec: 5594.6, 300 sec: 5581.0). Total num frames: 1167618048. Throughput: 0: 5740.8. Samples: 1167622642. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:50:16,843][25689] Avg episode reward: [(0, '1.167')] [2022-07-11 09:50:17,648][26022] Updated weights on worker 0-0, policy_version 1140256 (0.00080) [2022-07-11 09:50:19,624][26022] Updated weights on worker 0-0, policy_version 1140266 (0.00092) [2022-07-11 09:50:21,258][26022] Updated weights on worker 0-0, policy_version 1140276 (0.00086) [2022-07-11 09:50:21,908][25689] Fps is (10 sec: 5651.3, 60 sec: 5595.3, 300 sec: 5579.8). Total num frames: 1167645696. Throughput: 0: 4912.1. Samples: 1167639634. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:50:21,909][25689] Avg episode reward: [(0, '-0.134')] [2022-07-11 09:50:23,143][26022] Updated weights on worker 0-0, policy_version 1140286 (0.00561) [2022-07-11 09:50:25,167][26022] Updated weights on worker 0-0, policy_version 1140296 (0.00085) [2022-07-11 09:50:26,688][26022] Updated weights on worker 0-0, policy_version 1140306 (0.00112) [2022-07-11 09:50:26,927][25689] Fps is (10 sec: 5483.2, 60 sec: 5561.2, 300 sec: 5576.1). Total num frames: 1167673344. Throughput: 0: 5844.5. Samples: 1167673250. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:50:26,929][25689] Avg episode reward: [(0, '0.008')] [2022-07-11 09:50:28,639][26022] Updated weights on worker 0-0, policy_version 1140316 (0.00087) [2022-07-11 09:50:30,392][26022] Updated weights on worker 0-0, policy_version 1140326 (0.00085) [2022-07-11 09:50:31,951][25689] Fps is (10 sec: 5506.2, 60 sec: 5559.7, 300 sec: 5576.8). Total num frames: 1167700992. Throughput: 0: 5847.4. Samples: 1167706966. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:50:31,951][25689] Avg episode reward: [(0, '0.264')] [2022-07-11 09:50:32,219][26022] Updated weights on worker 0-0, policy_version 1140336 (0.00089) [2022-07-11 09:50:34,175][26022] Updated weights on worker 0-0, policy_version 1140346 (0.00090) [2022-07-11 09:50:35,938][26022] Updated weights on worker 0-0, policy_version 1140356 (0.00083) [2022-07-11 09:50:37,007][25689] Fps is (10 sec: 5587.2, 60 sec: 5574.8, 300 sec: 5579.7). Total num frames: 1167729664. Throughput: 0: 5003.1. Samples: 1167723528. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:50:37,019][25689] Avg episode reward: [(0, '-0.492')] [2022-07-11 09:50:37,783][26022] Updated weights on worker 0-0, policy_version 1140366 (0.00094) [2022-07-11 09:50:39,551][26022] Updated weights on worker 0-0, policy_version 1140376 (0.00086) [2022-07-11 09:50:41,283][26022] Updated weights on worker 0-0, policy_version 1140386 (0.00091) [2022-07-11 09:50:42,083][25689] Fps is (10 sec: 5659.7, 60 sec: 5535.1, 300 sec: 5574.9). Total num frames: 1167758336. Throughput: 0: 5829.0. Samples: 1167757230. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:50:42,083][25689] Avg episode reward: [(0, '-1.455')] [2022-07-11 09:50:43,225][26022] Updated weights on worker 0-0, policy_version 1140396 (0.00086) [2022-07-11 09:50:45,028][26022] Updated weights on worker 0-0, policy_version 1140406 (0.00086) [2022-07-11 09:50:46,884][26022] Updated weights on worker 0-0, policy_version 1140416 (0.00088) [2022-07-11 09:50:47,130][25689] Fps is (10 sec: 5664.7, 60 sec: 5583.0, 300 sec: 5581.2). Total num frames: 1167787008. Throughput: 0: 5831.2. Samples: 1167791058. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:50:47,131][25689] Avg episode reward: [(0, '0.359')] [2022-07-11 09:50:48,769][26022] Updated weights on worker 0-0, policy_version 1140426 (0.00094) [2022-07-11 09:50:50,495][26022] Updated weights on worker 0-0, policy_version 1140436 (0.00087) [2022-07-11 09:50:52,222][25689] Fps is (10 sec: 5554.7, 60 sec: 5543.4, 300 sec: 5577.5). Total num frames: 1167814656. Throughput: 0: 5809.7. Samples: 1167824736. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:50:52,222][25689] Avg episode reward: [(0, '-0.493')] [2022-07-11 09:50:52,473][26022] Updated weights on worker 0-0, policy_version 1140446 (0.00084) [2022-07-11 09:50:54,107][26022] Updated weights on worker 0-0, policy_version 1140456 (0.00094) [2022-07-11 09:50:56,127][26022] Updated weights on worker 0-0, policy_version 1140466 (0.00084) [2022-07-11 09:50:57,258][25689] Fps is (10 sec: 5662.0, 60 sec: 5578.2, 300 sec: 5580.5). Total num frames: 1167844352. Throughput: 0: 5825.7. Samples: 1167841504. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:50:57,258][25689] Avg episode reward: [(0, '-0.348')] [2022-07-11 09:50:57,753][26022] Updated weights on worker 0-0, policy_version 1140476 (0.00083) [2022-07-11 09:50:59,805][26022] Updated weights on worker 0-0, policy_version 1140486 (0.00087) [2022-07-11 09:51:01,516][26022] Updated weights on worker 0-0, policy_version 1140496 (0.00084) [2022-07-11 09:51:02,303][25689] Fps is (10 sec: 5383.5, 60 sec: 5527.1, 300 sec: 5576.5). Total num frames: 1167868928. Throughput: 0: 5824.7. Samples: 1167875008. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:51:02,303][25689] Avg episode reward: [(0, '-1.500')] [2022-07-11 09:51:03,712][26022] Updated weights on worker 0-0, policy_version 1140506 (0.00087) [2022-07-11 09:51:05,618][26022] Updated weights on worker 0-0, policy_version 1140516 (0.00091) [2022-07-11 09:51:07,305][25689] Fps is (10 sec: 5300.0, 60 sec: 5561.3, 300 sec: 5576.8). Total num frames: 1167897600. Throughput: 0: 5701.8. Samples: 1167906090. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:51:07,305][25689] Avg episode reward: [(0, '-1.536')] [2022-07-11 09:51:07,435][26022] Updated weights on worker 0-0, policy_version 1140526 (0.00086) [2022-07-11 09:51:09,325][26022] Updated weights on worker 0-0, policy_version 1140536 (0.00592) [2022-07-11 09:51:11,338][26022] Updated weights on worker 0-0, policy_version 1140546 (0.00087) [2022-07-11 09:51:12,330][25689] Fps is (10 sec: 5718.9, 60 sec: 5579.1, 300 sec: 5576.0). Total num frames: 1167926272. Throughput: 0: 4877.8. Samples: 1167922820. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:51:12,331][25689] Avg episode reward: [(0, '-0.709')] [2022-07-11 09:51:13,002][26022] Updated weights on worker 0-0, policy_version 1140556 (0.00083) [2022-07-11 09:51:14,900][26022] Updated weights on worker 0-0, policy_version 1140566 (0.00091) [2022-07-11 09:51:16,704][26022] Updated weights on worker 0-0, policy_version 1140576 (0.00086) [2022-07-11 09:51:17,446][25689] Fps is (10 sec: 5553.4, 60 sec: 5542.1, 300 sec: 5574.1). Total num frames: 1167953920. Throughput: 0: 5687.3. Samples: 1167956320. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:51:17,447][25689] Avg episode reward: [(0, '-0.847')] [2022-07-11 09:51:18,486][26022] Updated weights on worker 0-0, policy_version 1140586 (0.00083) [2022-07-11 09:51:20,315][26022] Updated weights on worker 0-0, policy_version 1140596 (0.00096) [2022-07-11 09:51:22,315][26022] Updated weights on worker 0-0, policy_version 1140606 (0.00088) [2022-07-11 09:51:22,482][25689] Fps is (10 sec: 5547.5, 60 sec: 5561.7, 300 sec: 5577.1). Total num frames: 1167982592. Throughput: 0: 5691.0. Samples: 1167989848. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:51:22,483][25689] Avg episode reward: [(0, '-0.679')] [2022-07-11 09:51:24,078][26022] Updated weights on worker 0-0, policy_version 1140616 (0.00087) [2022-07-11 09:51:25,827][26022] Updated weights on worker 0-0, policy_version 1140626 (0.00085) [2022-07-11 09:51:27,527][25689] Fps is (10 sec: 5586.9, 60 sec: 5559.3, 300 sec: 5573.4). Total num frames: 1168010240. Throughput: 0: 4970.1. Samples: 1168006596. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:51:27,527][25689] Avg episode reward: [(0, '-0.993')] [2022-07-11 09:51:27,741][26022] Updated weights on worker 0-0, policy_version 1140636 (0.00088) [2022-07-11 09:51:29,586][26022] Updated weights on worker 0-0, policy_version 1140646 (0.00090) [2022-07-11 09:51:31,432][26022] Updated weights on worker 0-0, policy_version 1140656 (0.00086) [2022-07-11 09:51:32,557][25689] Fps is (10 sec: 5387.2, 60 sec: 5541.9, 300 sec: 5567.7). Total num frames: 1168036864. Throughput: 0: 5786.5. Samples: 1168039860. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:51:32,557][25689] Avg episode reward: [(0, '-0.781')] [2022-07-11 09:51:33,172][26022] Updated weights on worker 0-0, policy_version 1140666 (0.00085) [2022-07-11 09:51:34,967][26022] Updated weights on worker 0-0, policy_version 1140676 (0.00102) [2022-07-11 09:51:36,971][26022] Updated weights on worker 0-0, policy_version 1140686 (0.00097) [2022-07-11 09:51:37,628][25689] Fps is (10 sec: 5474.3, 60 sec: 5540.5, 300 sec: 5567.2). Total num frames: 1168065536. Throughput: 0: 5785.3. Samples: 1168073078. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:51:37,628][25689] Avg episode reward: [(0, '-0.632')] [2022-07-11 09:51:38,866][26022] Updated weights on worker 0-0, policy_version 1140696 (0.00090) [2022-07-11 09:51:40,583][26022] Updated weights on worker 0-0, policy_version 1140706 (0.00084) [2022-07-11 09:51:42,412][26022] Updated weights on worker 0-0, policy_version 1140716 (0.00083) [2022-07-11 09:51:42,639][25689] Fps is (10 sec: 5586.0, 60 sec: 5529.5, 300 sec: 5570.5). Total num frames: 1168093184. Throughput: 0: 4965.4. Samples: 1168089934. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:51:42,641][25689] Avg episode reward: [(0, '-0.403')] [2022-07-11 09:51:44,289][26022] Updated weights on worker 0-0, policy_version 1140726 (0.00111) [2022-07-11 09:51:46,257][26022] Updated weights on worker 0-0, policy_version 1140736 (0.00082) [2022-07-11 09:51:47,669][25689] Fps is (10 sec: 5609.0, 60 sec: 5531.1, 300 sec: 5566.9). Total num frames: 1168121856. Throughput: 0: 5813.1. Samples: 1168123684. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:51:47,669][25689] Avg episode reward: [(0, '-0.413')] [2022-07-11 09:51:47,719][26022] Updated weights on worker 0-0, policy_version 1140746 (0.00082) [2022-07-11 09:51:49,893][26022] Updated weights on worker 0-0, policy_version 1140756 (0.00101) [2022-07-11 09:51:51,643][26022] Updated weights on worker 0-0, policy_version 1140766 (0.00090) [2022-07-11 09:51:52,684][25689] Fps is (10 sec: 5606.8, 60 sec: 5538.2, 300 sec: 5567.7). Total num frames: 1168149504. Throughput: 0: 5837.7. Samples: 1168157356. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:51:52,685][25689] Avg episode reward: [(0, '-0.285')] [2022-07-11 09:51:53,407][26022] Updated weights on worker 0-0, policy_version 1140776 (0.00087) [2022-07-11 09:51:55,372][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:51:55,396][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001140786_1168164864.pth [2022-07-11 09:51:55,397][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001138825_1166156800.pth [2022-07-11 09:51:55,405][26022] Updated weights on worker 0-0, policy_version 1140786 (0.00085) [2022-07-11 09:51:56,892][26022] Updated weights on worker 0-0, policy_version 1140796 (0.00090) [2022-07-11 09:51:57,794][25689] Fps is (10 sec: 5562.7, 60 sec: 5514.5, 300 sec: 5573.7). Total num frames: 1168178176. Throughput: 0: 5012.2. Samples: 1168174152. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:51:57,794][25689] Avg episode reward: [(0, '0.480')] [2022-07-11 09:51:58,959][26022] Updated weights on worker 0-0, policy_version 1140806 (0.00088) [2022-07-11 09:52:00,751][26022] Updated weights on worker 0-0, policy_version 1140816 (0.00105) [2022-07-11 09:52:02,815][25689] Fps is (10 sec: 5356.8, 60 sec: 5533.6, 300 sec: 5570.0). Total num frames: 1168203776. Throughput: 0: 5833.7. Samples: 1168207636. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:52:02,816][25689] Avg episode reward: [(0, '1.188')] [2022-07-11 09:52:03,018][26022] Updated weights on worker 0-0, policy_version 1140826 (0.00084) [2022-07-11 09:52:04,759][26022] Updated weights on worker 0-0, policy_version 1140836 (0.00075) [2022-07-11 09:52:06,822][26022] Updated weights on worker 0-0, policy_version 1140846 (0.00088) [2022-07-11 09:52:07,831][25689] Fps is (10 sec: 5407.1, 60 sec: 5532.3, 300 sec: 5566.6). Total num frames: 1168232448. Throughput: 0: 5703.0. Samples: 1168238666. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:52:07,831][25689] Avg episode reward: [(0, '1.043')] [2022-07-11 09:52:08,428][26022] Updated weights on worker 0-0, policy_version 1140856 (0.00094) [2022-07-11 09:52:10,692][26022] Updated weights on worker 0-0, policy_version 1140866 (0.00092) [2022-07-11 09:52:12,035][26022] Updated weights on worker 0-0, policy_version 1140876 (0.00086) [2022-07-11 09:52:12,836][25689] Fps is (10 sec: 5518.1, 60 sec: 5500.3, 300 sec: 5570.8). Total num frames: 1168259072. Throughput: 0: 4864.5. Samples: 1168255386. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:52:12,837][25689] Avg episode reward: [(0, '0.026')] [2022-07-11 09:52:14,054][26022] Updated weights on worker 0-0, policy_version 1140886 (0.00085) [2022-07-11 09:52:15,899][26022] Updated weights on worker 0-0, policy_version 1140896 (0.00093) [2022-07-11 09:52:17,744][26022] Updated weights on worker 0-0, policy_version 1140906 (0.00094) [2022-07-11 09:52:17,891][25689] Fps is (10 sec: 5496.6, 60 sec: 5522.8, 300 sec: 5566.7). Total num frames: 1168287744. Throughput: 0: 5715.1. Samples: 1168289010. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:52:17,891][25689] Avg episode reward: [(0, '-0.924')] [2022-07-11 09:52:19,561][26022] Updated weights on worker 0-0, policy_version 1140916 (0.00084) [2022-07-11 09:52:21,452][26022] Updated weights on worker 0-0, policy_version 1140926 (0.00089) [2022-07-11 09:52:22,914][25689] Fps is (10 sec: 5690.2, 60 sec: 5524.0, 300 sec: 5563.0). Total num frames: 1168316416. Throughput: 0: 5735.4. Samples: 1168322908. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:52:22,914][25689] Avg episode reward: [(0, '-0.827')] [2022-07-11 09:52:23,136][26022] Updated weights on worker 0-0, policy_version 1140936 (0.00086) [2022-07-11 09:52:25,141][26022] Updated weights on worker 0-0, policy_version 1140946 (0.00083) [2022-07-11 09:52:26,858][26022] Updated weights on worker 0-0, policy_version 1140956 (0.00091) [2022-07-11 09:52:27,935][25689] Fps is (10 sec: 5505.4, 60 sec: 5509.2, 300 sec: 5563.8). Total num frames: 1168343040. Throughput: 0: 5035.2. Samples: 1168339894. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:52:27,935][25689] Avg episode reward: [(0, '-0.725')] [2022-07-11 09:52:28,676][26022] Updated weights on worker 0-0, policy_version 1140966 (0.00092) [2022-07-11 09:52:30,811][26022] Updated weights on worker 0-0, policy_version 1140976 (0.00085) [2022-07-11 09:52:32,163][26022] Updated weights on worker 0-0, policy_version 1140986 (0.00082) [2022-07-11 09:52:32,951][25689] Fps is (10 sec: 5611.2, 60 sec: 5561.3, 300 sec: 5567.6). Total num frames: 1168372736. Throughput: 0: 5866.8. Samples: 1168373396. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 09:52:32,952][25689] Avg episode reward: [(0, '-1.533')] [2022-07-11 09:52:34,304][26022] Updated weights on worker 0-0, policy_version 1140996 (0.00090) [2022-07-11 09:52:35,968][26022] Updated weights on worker 0-0, policy_version 1141006 (0.00088) [2022-07-11 09:52:38,043][25689] Fps is (10 sec: 5673.0, 60 sec: 5542.5, 300 sec: 5566.8). Total num frames: 1168400384. Throughput: 0: 5830.1. Samples: 1168406500. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:52:38,044][25689] Avg episode reward: [(0, '-1.332')] [2022-07-11 09:52:38,048][26022] Updated weights on worker 0-0, policy_version 1141016 (0.00084) [2022-07-11 09:52:39,834][26022] Updated weights on worker 0-0, policy_version 1141026 (0.00089) [2022-07-11 09:52:41,639][26022] Updated weights on worker 0-0, policy_version 1141036 (0.00086) [2022-07-11 09:52:43,061][25689] Fps is (10 sec: 5469.2, 60 sec: 5541.8, 300 sec: 5559.7). Total num frames: 1168428032. Throughput: 0: 4987.7. Samples: 1168423398. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:52:43,067][25689] Avg episode reward: [(0, '-0.988')] [2022-07-11 09:52:43,488][26022] Updated weights on worker 0-0, policy_version 1141046 (0.00092) [2022-07-11 09:52:45,252][26022] Updated weights on worker 0-0, policy_version 1141056 (0.00085) [2022-07-11 09:52:47,024][26022] Updated weights on worker 0-0, policy_version 1141066 (0.00087) [2022-07-11 09:52:48,084][25689] Fps is (10 sec: 5609.1, 60 sec: 5542.5, 300 sec: 5563.0). Total num frames: 1168456704. Throughput: 0: 5816.8. Samples: 1168457098. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:52:48,084][25689] Avg episode reward: [(0, '0.597')] [2022-07-11 09:52:48,947][26022] Updated weights on worker 0-0, policy_version 1141076 (0.00095) [2022-07-11 09:52:50,724][26022] Updated weights on worker 0-0, policy_version 1141086 (0.00079) [2022-07-11 09:52:52,547][26022] Updated weights on worker 0-0, policy_version 1141096 (0.00106) [2022-07-11 09:52:53,163][25689] Fps is (10 sec: 5676.9, 60 sec: 5553.5, 300 sec: 5563.3). Total num frames: 1168485376. Throughput: 0: 5826.5. Samples: 1168491160. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:52:53,164][25689] Avg episode reward: [(0, '-0.112')] [2022-07-11 09:52:54,282][26022] Updated weights on worker 0-0, policy_version 1141106 (0.00101) [2022-07-11 09:52:56,049][26022] Updated weights on worker 0-0, policy_version 1141116 (0.00091) [2022-07-11 09:52:57,893][26022] Updated weights on worker 0-0, policy_version 1141126 (0.00095) [2022-07-11 09:52:58,239][25689] Fps is (10 sec: 5646.6, 60 sec: 5556.6, 300 sec: 5559.2). Total num frames: 1168514048. Throughput: 0: 5031.6. Samples: 1168508120. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:52:58,240][25689] Avg episode reward: [(0, '-0.240')] [2022-07-11 09:52:59,693][26022] Updated weights on worker 0-0, policy_version 1141136 (0.00086) [2022-07-11 09:53:02,001][26022] Updated weights on worker 0-0, policy_version 1141146 (0.00085) [2022-07-11 09:53:03,274][25689] Fps is (10 sec: 5367.7, 60 sec: 5555.4, 300 sec: 5559.7). Total num frames: 1168539648. Throughput: 0: 5820.6. Samples: 1168541044. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:53:03,274][25689] Avg episode reward: [(0, '0.417')] [2022-07-11 09:53:03,778][26022] Updated weights on worker 0-0, policy_version 1141156 (0.00086) [2022-07-11 09:53:05,553][26022] Updated weights on worker 0-0, policy_version 1141166 (0.00087) [2022-07-11 09:53:07,533][26022] Updated weights on worker 0-0, policy_version 1141176 (0.00090) [2022-07-11 09:53:08,335][25689] Fps is (10 sec: 5375.6, 60 sec: 5551.2, 300 sec: 5562.2). Total num frames: 1168568320. Throughput: 0: 5740.9. Samples: 1168573360. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:53:08,336][25689] Avg episode reward: [(0, '0.946')] [2022-07-11 09:53:09,296][26022] Updated weights on worker 0-0, policy_version 1141186 (0.00084) [2022-07-11 09:53:11,172][26022] Updated weights on worker 0-0, policy_version 1141196 (0.01225) [2022-07-11 09:53:13,025][26022] Updated weights on worker 0-0, policy_version 1141206 (0.00106) [2022-07-11 09:53:13,375][25689] Fps is (10 sec: 5676.9, 60 sec: 5581.9, 300 sec: 5559.1). Total num frames: 1168596992. Throughput: 0: 4903.8. Samples: 1168590276. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:53:13,375][25689] Avg episode reward: [(0, '1.174')] [2022-07-11 09:53:14,940][26022] Updated weights on worker 0-0, policy_version 1141216 (0.00094) [2022-07-11 09:53:16,707][26022] Updated weights on worker 0-0, policy_version 1141226 (0.00085) [2022-07-11 09:53:18,415][25689] Fps is (10 sec: 5587.6, 60 sec: 5566.3, 300 sec: 5558.8). Total num frames: 1168624640. Throughput: 0: 5738.7. Samples: 1168623902. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:53:18,415][25689] Avg episode reward: [(0, '0.910')] [2022-07-11 09:53:18,516][26022] Updated weights on worker 0-0, policy_version 1141236 (0.00093) [2022-07-11 09:53:20,153][26022] Updated weights on worker 0-0, policy_version 1141246 (0.00086) [2022-07-11 09:53:22,139][26022] Updated weights on worker 0-0, policy_version 1141256 (0.00091) [2022-07-11 09:53:23,424][25689] Fps is (10 sec: 5604.5, 60 sec: 5567.6, 300 sec: 5562.5). Total num frames: 1168653312. Throughput: 0: 5789.8. Samples: 1168657710. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:53:23,425][25689] Avg episode reward: [(0, '1.722')] [2022-07-11 09:53:23,756][26022] Updated weights on worker 0-0, policy_version 1141266 (0.00091) [2022-07-11 09:53:25,768][26022] Updated weights on worker 0-0, policy_version 1141276 (0.00095) [2022-07-11 09:53:27,636][26022] Updated weights on worker 0-0, policy_version 1141286 (0.00084) [2022-07-11 09:53:28,459][25689] Fps is (10 sec: 5505.2, 60 sec: 5566.3, 300 sec: 5556.1). Total num frames: 1168679936. Throughput: 0: 5032.1. Samples: 1168674626. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:53:28,460][25689] Avg episode reward: [(0, '1.643')] [2022-07-11 09:53:29,233][26022] Updated weights on worker 0-0, policy_version 1141296 (0.00098) [2022-07-11 09:53:31,427][26022] Updated weights on worker 0-0, policy_version 1141306 (0.00082) [2022-07-11 09:53:33,031][26022] Updated weights on worker 0-0, policy_version 1141316 (0.00086) [2022-07-11 09:53:33,461][25689] Fps is (10 sec: 5611.0, 60 sec: 5567.6, 300 sec: 5561.1). Total num frames: 1168709632. Throughput: 0: 5865.9. Samples: 1168708102. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:53:33,462][25689] Avg episode reward: [(0, '0.932')] [2022-07-11 09:53:34,822][26022] Updated weights on worker 0-0, policy_version 1141326 (0.00085) [2022-07-11 09:53:36,755][26022] Updated weights on worker 0-0, policy_version 1141336 (0.00086) [2022-07-11 09:53:38,560][25689] Fps is (10 sec: 5677.1, 60 sec: 5566.9, 300 sec: 5556.1). Total num frames: 1168737280. Throughput: 0: 5834.3. Samples: 1168741436. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:53:38,562][25689] Avg episode reward: [(0, '0.537')] [2022-07-11 09:53:38,631][26022] Updated weights on worker 0-0, policy_version 1141346 (0.00087) [2022-07-11 09:53:40,378][26022] Updated weights on worker 0-0, policy_version 1141356 (0.00087) [2022-07-11 09:53:42,463][26022] Updated weights on worker 0-0, policy_version 1141366 (0.00080) [2022-07-11 09:53:43,590][25689] Fps is (10 sec: 5560.5, 60 sec: 5582.8, 300 sec: 5556.1). Total num frames: 1168765952. Throughput: 0: 5833.2. Samples: 1168775344. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:53:43,590][25689] Avg episode reward: [(0, '-0.060')] [2022-07-11 09:53:43,873][26022] Updated weights on worker 0-0, policy_version 1141376 (0.00401) [2022-07-11 09:53:46,048][26022] Updated weights on worker 0-0, policy_version 1141386 (0.00089) [2022-07-11 09:53:47,709][26022] Updated weights on worker 0-0, policy_version 1141396 (0.00086) [2022-07-11 09:53:48,634][25689] Fps is (10 sec: 5590.8, 60 sec: 5563.9, 300 sec: 5555.6). Total num frames: 1168793600. Throughput: 0: 5842.3. Samples: 1168792492. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:53:48,635][25689] Avg episode reward: [(0, '0.180')] [2022-07-11 09:53:49,576][26022] Updated weights on worker 0-0, policy_version 1141406 (0.00092) [2022-07-11 09:53:51,386][26022] Updated weights on worker 0-0, policy_version 1141416 (0.00093) [2022-07-11 09:53:53,217][26022] Updated weights on worker 0-0, policy_version 1141426 (0.00086) [2022-07-11 09:53:53,671][25689] Fps is (10 sec: 5587.0, 60 sec: 5567.8, 300 sec: 5553.0). Total num frames: 1168822272. Throughput: 0: 5818.4. Samples: 1168825688. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:53:53,671][25689] Avg episode reward: [(0, '-0.668')] [2022-07-11 09:53:55,063][26022] Updated weights on worker 0-0, policy_version 1141436 (0.00097) [2022-07-11 09:53:55,454][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:53:55,468][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001141438_1168832512.pth [2022-07-11 09:53:55,469][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001139479_1166826496.pth [2022-07-11 09:53:57,084][26022] Updated weights on worker 0-0, policy_version 1141446 (0.00096) [2022-07-11 09:53:58,504][26022] Updated weights on worker 0-0, policy_version 1141456 (0.00085) [2022-07-11 09:53:58,749][25689] Fps is (10 sec: 5669.2, 60 sec: 5567.6, 300 sec: 5559.3). Total num frames: 1168850944. Throughput: 0: 5836.8. Samples: 1168859274. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:53:58,750][25689] Avg episode reward: [(0, '-0.268')] [2022-07-11 09:54:00,735][26022] Updated weights on worker 0-0, policy_version 1141466 (0.00084) [2022-07-11 09:54:02,775][26022] Updated weights on worker 0-0, policy_version 1141476 (0.00085) [2022-07-11 09:54:03,757][25689] Fps is (10 sec: 5279.4, 60 sec: 5553.1, 300 sec: 5552.5). Total num frames: 1168875520. Throughput: 0: 4992.3. Samples: 1168876022. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:54:03,757][25689] Avg episode reward: [(0, '0.405')] [2022-07-11 09:54:04,591][26022] Updated weights on worker 0-0, policy_version 1141486 (0.00082) [2022-07-11 09:54:06,545][26022] Updated weights on worker 0-0, policy_version 1141496 (0.00093) [2022-07-11 09:54:08,336][26022] Updated weights on worker 0-0, policy_version 1141506 (0.00090) [2022-07-11 09:54:08,767][25689] Fps is (10 sec: 5315.4, 60 sec: 5557.9, 300 sec: 5556.7). Total num frames: 1168904192. Throughput: 0: 5723.2. Samples: 1168907716. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:54:08,767][25689] Avg episode reward: [(0, '-0.348')] [2022-07-11 09:54:10,175][26022] Updated weights on worker 0-0, policy_version 1141516 (0.00083) [2022-07-11 09:54:11,971][26022] Updated weights on worker 0-0, policy_version 1141526 (0.00092) [2022-07-11 09:54:13,698][26022] Updated weights on worker 0-0, policy_version 1141536 (0.00082) [2022-07-11 09:54:13,784][25689] Fps is (10 sec: 5719.0, 60 sec: 5560.0, 300 sec: 5558.4). Total num frames: 1168932864. Throughput: 0: 5751.2. Samples: 1168941362. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:54:13,784][25689] Avg episode reward: [(0, '0.480')] [2022-07-11 09:54:15,748][26022] Updated weights on worker 0-0, policy_version 1141546 (0.00083) [2022-07-11 09:54:17,364][26022] Updated weights on worker 0-0, policy_version 1141556 (0.00087) [2022-07-11 09:54:18,850][25689] Fps is (10 sec: 5687.0, 60 sec: 5574.5, 300 sec: 5561.9). Total num frames: 1168961536. Throughput: 0: 4916.2. Samples: 1168958094. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:54:18,851][25689] Avg episode reward: [(0, '-0.698')] [2022-07-11 09:54:19,283][26022] Updated weights on worker 0-0, policy_version 1141566 (0.00090) [2022-07-11 09:54:20,950][26022] Updated weights on worker 0-0, policy_version 1141576 (0.00082) [2022-07-11 09:54:22,847][26022] Updated weights on worker 0-0, policy_version 1141586 (0.00091) [2022-07-11 09:54:23,943][25689] Fps is (10 sec: 5543.9, 60 sec: 5549.9, 300 sec: 5553.6). Total num frames: 1168989184. Throughput: 0: 5743.9. Samples: 1168991968. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:54:23,943][25689] Avg episode reward: [(0, '0.010')] [2022-07-11 09:54:24,536][26022] Updated weights on worker 0-0, policy_version 1141596 (0.00091) [2022-07-11 09:54:26,747][26022] Updated weights on worker 0-0, policy_version 1141606 (0.00085) [2022-07-11 09:54:28,285][26022] Updated weights on worker 0-0, policy_version 1141616 (0.00085) [2022-07-11 09:54:28,947][25689] Fps is (10 sec: 5578.2, 60 sec: 5586.6, 300 sec: 5557.1). Total num frames: 1169017856. Throughput: 0: 5833.8. Samples: 1169025442. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:54:28,947][25689] Avg episode reward: [(0, '-1.406')] [2022-07-11 09:54:30,383][26022] Updated weights on worker 0-0, policy_version 1141626 (0.00093) [2022-07-11 09:54:31,866][26022] Updated weights on worker 0-0, policy_version 1141636 (0.00096) [2022-07-11 09:54:33,911][26022] Updated weights on worker 0-0, policy_version 1141646 (0.00097) [2022-07-11 09:54:33,990][25689] Fps is (10 sec: 5605.6, 60 sec: 5549.0, 300 sec: 5557.0). Total num frames: 1169045504. Throughput: 0: 5000.3. Samples: 1169042398. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:54:33,990][25689] Avg episode reward: [(0, '-0.223')] [2022-07-11 09:54:35,743][26022] Updated weights on worker 0-0, policy_version 1141656 (0.00081) [2022-07-11 09:54:37,582][26022] Updated weights on worker 0-0, policy_version 1141666 (0.00080) [2022-07-11 09:54:39,112][25689] Fps is (10 sec: 5540.2, 60 sec: 5563.8, 300 sec: 5548.1). Total num frames: 1169074176. Throughput: 0: 5831.7. Samples: 1169076256. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:54:39,113][25689] Avg episode reward: [(0, '-1.247')] [2022-07-11 09:54:39,426][26022] Updated weights on worker 0-0, policy_version 1141676 (0.00098) [2022-07-11 09:54:41,308][26022] Updated weights on worker 0-0, policy_version 1141686 (0.00092) [2022-07-11 09:54:43,040][26022] Updated weights on worker 0-0, policy_version 1141696 (0.00095) [2022-07-11 09:54:44,126][25689] Fps is (10 sec: 5657.5, 60 sec: 5565.3, 300 sec: 5558.4). Total num frames: 1169102848. Throughput: 0: 5845.1. Samples: 1169109940. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:54:44,126][25689] Avg episode reward: [(0, '-1.557')] [2022-07-11 09:54:44,916][26022] Updated weights on worker 0-0, policy_version 1141706 (0.00086) [2022-07-11 09:54:46,498][26022] Updated weights on worker 0-0, policy_version 1141716 (0.00086) [2022-07-11 09:54:48,519][26022] Updated weights on worker 0-0, policy_version 1141726 (0.00093) [2022-07-11 09:54:49,175][25689] Fps is (10 sec: 5495.1, 60 sec: 5547.8, 300 sec: 5547.7). Total num frames: 1169129472. Throughput: 0: 5009.3. Samples: 1169126772. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:54:49,176][25689] Avg episode reward: [(0, '-0.500')] [2022-07-11 09:54:50,186][26022] Updated weights on worker 0-0, policy_version 1141736 (0.00086) [2022-07-11 09:54:52,126][26022] Updated weights on worker 0-0, policy_version 1141746 (0.00085) [2022-07-11 09:54:53,674][26022] Updated weights on worker 0-0, policy_version 1141756 (0.00091) [2022-07-11 09:54:54,190][25689] Fps is (10 sec: 5595.7, 60 sec: 5566.7, 300 sec: 5555.2). Total num frames: 1169159168. Throughput: 0: 5862.5. Samples: 1169160826. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:54:54,191][25689] Avg episode reward: [(0, '-0.309')] [2022-07-11 09:54:55,663][26022] Updated weights on worker 0-0, policy_version 1141766 (0.00084) [2022-07-11 09:54:57,556][26022] Updated weights on worker 0-0, policy_version 1141776 (0.00091) [2022-07-11 09:54:59,255][25689] Fps is (10 sec: 5790.7, 60 sec: 5568.0, 300 sec: 5558.2). Total num frames: 1169187840. Throughput: 0: 5880.4. Samples: 1169194702. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:54:59,257][25689] Avg episode reward: [(0, '-0.566')] [2022-07-11 09:54:59,267][26022] Updated weights on worker 0-0, policy_version 1141786 (0.00096) [2022-07-11 09:55:01,299][26022] Updated weights on worker 0-0, policy_version 1141796 (0.00086) [2022-07-11 09:55:03,314][26022] Updated weights on worker 0-0, policy_version 1141806 (0.00053) [2022-07-11 09:55:04,340][25689] Fps is (10 sec: 5448.1, 60 sec: 5594.7, 300 sec: 5556.7). Total num frames: 1169214464. Throughput: 0: 5017.5. Samples: 1169211366. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:55:04,341][25689] Avg episode reward: [(0, '-1.580')] [2022-07-11 09:55:05,313][26022] Updated weights on worker 0-0, policy_version 1141816 (0.00086) [2022-07-11 09:55:07,168][26022] Updated weights on worker 0-0, policy_version 1141826 (0.00050) [2022-07-11 09:55:08,774][26022] Updated weights on worker 0-0, policy_version 1141836 (0.00097) [2022-07-11 09:55:09,372][25689] Fps is (10 sec: 5364.0, 60 sec: 5575.7, 300 sec: 5556.8). Total num frames: 1169242112. Throughput: 0: 5744.5. Samples: 1169242796. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:55:09,373][25689] Avg episode reward: [(0, '-1.876')] [2022-07-11 09:55:10,930][26022] Updated weights on worker 0-0, policy_version 1141846 (0.00094) [2022-07-11 09:55:12,609][26022] Updated weights on worker 0-0, policy_version 1141856 (0.00085) [2022-07-11 09:55:14,413][25689] Fps is (10 sec: 5489.5, 60 sec: 5556.6, 300 sec: 5550.6). Total num frames: 1169269760. Throughput: 0: 5723.8. Samples: 1169276578. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:55:14,415][25689] Avg episode reward: [(0, '-1.507')] [2022-07-11 09:55:14,478][26022] Updated weights on worker 0-0, policy_version 1141866 (0.00089) [2022-07-11 09:55:16,168][26022] Updated weights on worker 0-0, policy_version 1141876 (0.00088) [2022-07-11 09:55:18,056][26022] Updated weights on worker 0-0, policy_version 1141886 (0.00082) [2022-07-11 09:55:19,553][25689] Fps is (10 sec: 5632.9, 60 sec: 5566.8, 300 sec: 5556.2). Total num frames: 1169299456. Throughput: 0: 4869.4. Samples: 1169293540. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:55:19,553][25689] Avg episode reward: [(0, '-1.932')] [2022-07-11 09:55:19,852][26022] Updated weights on worker 0-0, policy_version 1141896 (0.00088) [2022-07-11 09:55:21,642][26022] Updated weights on worker 0-0, policy_version 1141906 (0.00098) [2022-07-11 09:55:23,762][26022] Updated weights on worker 0-0, policy_version 1141916 (0.00086) [2022-07-11 09:55:24,615][25689] Fps is (10 sec: 5621.1, 60 sec: 5569.6, 300 sec: 5555.3). Total num frames: 1169327104. Throughput: 0: 5713.2. Samples: 1169327198. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:55:24,615][25689] Avg episode reward: [(0, '-1.846')] [2022-07-11 09:55:25,458][26022] Updated weights on worker 0-0, policy_version 1141926 (0.00064) [2022-07-11 09:55:27,293][26022] Updated weights on worker 0-0, policy_version 1141936 (0.00080) [2022-07-11 09:55:29,192][26022] Updated weights on worker 0-0, policy_version 1141946 (0.00088) [2022-07-11 09:55:29,621][25689] Fps is (10 sec: 5492.1, 60 sec: 5552.5, 300 sec: 5555.7). Total num frames: 1169354752. Throughput: 0: 5815.2. Samples: 1169360544. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:55:29,621][25689] Avg episode reward: [(0, '-0.670')] [2022-07-11 09:55:30,734][26022] Updated weights on worker 0-0, policy_version 1141956 (0.00081) [2022-07-11 09:55:32,784][26022] Updated weights on worker 0-0, policy_version 1141966 (0.00088) [2022-07-11 09:55:34,395][26022] Updated weights on worker 0-0, policy_version 1141976 (0.00082) [2022-07-11 09:55:34,673][25689] Fps is (10 sec: 5701.0, 60 sec: 5585.4, 300 sec: 5559.2). Total num frames: 1169384448. Throughput: 0: 4971.5. Samples: 1169377300. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:55:34,674][25689] Avg episode reward: [(0, '0.293')] [2022-07-11 09:55:36,562][26022] Updated weights on worker 0-0, policy_version 1141986 (0.00096) [2022-07-11 09:55:38,105][26022] Updated weights on worker 0-0, policy_version 1141996 (0.00050) [2022-07-11 09:55:39,783][25689] Fps is (10 sec: 5542.4, 60 sec: 5552.9, 300 sec: 5551.7). Total num frames: 1169411072. Throughput: 0: 5802.0. Samples: 1169410914. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:55:39,783][25689] Avg episode reward: [(0, '-0.189')] [2022-07-11 09:55:40,012][26022] Updated weights on worker 0-0, policy_version 1142006 (0.00087) [2022-07-11 09:55:41,935][26022] Updated weights on worker 0-0, policy_version 1142016 (0.00087) [2022-07-11 09:55:43,724][26022] Updated weights on worker 0-0, policy_version 1142026 (0.00084) [2022-07-11 09:55:44,845][25689] Fps is (10 sec: 5537.1, 60 sec: 5565.3, 300 sec: 5554.9). Total num frames: 1169440768. Throughput: 0: 5796.4. Samples: 1169444458. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:55:44,845][25689] Avg episode reward: [(0, '0.281')] [2022-07-11 09:55:45,615][26022] Updated weights on worker 0-0, policy_version 1142036 (0.00084) [2022-07-11 09:55:47,319][26022] Updated weights on worker 0-0, policy_version 1142046 (0.00086) [2022-07-11 09:55:49,245][26022] Updated weights on worker 0-0, policy_version 1142056 (0.00070) [2022-07-11 09:55:49,849][25689] Fps is (10 sec: 5798.4, 60 sec: 5603.2, 300 sec: 5559.9). Total num frames: 1169469440. Throughput: 0: 5828.9. Samples: 1169478450. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:55:49,849][25689] Avg episode reward: [(0, '0.494')] [2022-07-11 09:55:50,896][26022] Updated weights on worker 0-0, policy_version 1142066 (0.00086) [2022-07-11 09:55:52,856][26022] Updated weights on worker 0-0, policy_version 1142076 (0.00098) [2022-07-11 09:55:54,635][26022] Updated weights on worker 0-0, policy_version 1142086 (0.00087) [2022-07-11 09:55:54,857][25689] Fps is (10 sec: 5625.0, 60 sec: 5570.1, 300 sec: 5553.6). Total num frames: 1169497088. Throughput: 0: 5857.8. Samples: 1169495532. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:55:54,858][25689] Avg episode reward: [(0, '0.971')] [2022-07-11 09:55:55,605][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:55:55,618][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001142091_1169501184.pth [2022-07-11 09:55:55,619][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001140136_1167499264.pth [2022-07-11 09:55:56,484][26022] Updated weights on worker 0-0, policy_version 1142096 (0.00089) [2022-07-11 09:55:58,280][26022] Updated weights on worker 0-0, policy_version 1142106 (0.00085) [2022-07-11 09:55:59,971][25689] Fps is (10 sec: 5564.0, 60 sec: 5565.6, 300 sec: 5566.0). Total num frames: 1169525760. Throughput: 0: 5860.2. Samples: 1169529222. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:55:59,972][25689] Avg episode reward: [(0, '1.804')] [2022-07-11 09:56:00,015][26022] Updated weights on worker 0-0, policy_version 1142116 (0.00087) [2022-07-11 09:56:02,228][26022] Updated weights on worker 0-0, policy_version 1142126 (0.00086) [2022-07-11 09:56:04,284][26022] Updated weights on worker 0-0, policy_version 1142136 (0.00095) [2022-07-11 09:56:05,003][25689] Fps is (10 sec: 5349.2, 60 sec: 5553.6, 300 sec: 5555.2). Total num frames: 1169551360. Throughput: 0: 5789.1. Samples: 1169561156. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:56:05,003][25689] Avg episode reward: [(0, '1.569')] [2022-07-11 09:56:05,811][26022] Updated weights on worker 0-0, policy_version 1142146 (0.00081) [2022-07-11 09:56:07,971][26022] Updated weights on worker 0-0, policy_version 1142156 (0.00087) [2022-07-11 09:56:09,338][26022] Updated weights on worker 0-0, policy_version 1142166 (0.00094) [2022-07-11 09:56:10,010][25689] Fps is (10 sec: 5406.1, 60 sec: 5572.8, 300 sec: 5555.5). Total num frames: 1169580032. Throughput: 0: 4926.6. Samples: 1169577776. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:56:10,011][25689] Avg episode reward: [(0, '2.118')] [2022-07-11 09:56:11,544][26022] Updated weights on worker 0-0, policy_version 1142176 (0.00086) [2022-07-11 09:56:13,230][26022] Updated weights on worker 0-0, policy_version 1142186 (0.01243) [2022-07-11 09:56:15,047][25689] Fps is (10 sec: 5607.0, 60 sec: 5573.1, 300 sec: 5557.0). Total num frames: 1169607680. Throughput: 0: 5742.2. Samples: 1169611470. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:56:15,048][25689] Avg episode reward: [(0, '1.843')] [2022-07-11 09:56:15,194][26022] Updated weights on worker 0-0, policy_version 1142196 (0.00087) [2022-07-11 09:56:16,950][26022] Updated weights on worker 0-0, policy_version 1142206 (0.00092) [2022-07-11 09:56:18,703][26022] Updated weights on worker 0-0, policy_version 1142216 (0.00088) [2022-07-11 09:56:20,150][25689] Fps is (10 sec: 5655.1, 60 sec: 5576.4, 300 sec: 5559.2). Total num frames: 1169637376. Throughput: 0: 5756.7. Samples: 1169645388. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:56:20,151][25689] Avg episode reward: [(0, '0.066')] [2022-07-11 09:56:20,599][26022] Updated weights on worker 0-0, policy_version 1142226 (0.00078) [2022-07-11 09:56:22,292][26022] Updated weights on worker 0-0, policy_version 1142236 (0.00086) [2022-07-11 09:56:24,147][26022] Updated weights on worker 0-0, policy_version 1142246 (0.00085) [2022-07-11 09:56:25,176][25689] Fps is (10 sec: 5762.5, 60 sec: 5596.7, 300 sec: 5562.9). Total num frames: 1169666048. Throughput: 0: 5019.4. Samples: 1169662416. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:56:25,178][25689] Avg episode reward: [(0, '-0.142')] [2022-07-11 09:56:26,127][26022] Updated weights on worker 0-0, policy_version 1142256 (0.00096) [2022-07-11 09:56:27,911][26022] Updated weights on worker 0-0, policy_version 1142266 (0.00094) [2022-07-11 09:56:29,744][26022] Updated weights on worker 0-0, policy_version 1142276 (0.00086) [2022-07-11 09:56:30,267][25689] Fps is (10 sec: 5567.3, 60 sec: 5588.9, 300 sec: 5565.3). Total num frames: 1169693696. Throughput: 0: 5830.8. Samples: 1169695888. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:56:30,267][25689] Avg episode reward: [(0, '0.072')] [2022-07-11 09:56:31,577][26022] Updated weights on worker 0-0, policy_version 1142286 (0.00087) [2022-07-11 09:56:33,452][26022] Updated weights on worker 0-0, policy_version 1142296 (0.00086) [2022-07-11 09:56:35,105][26022] Updated weights on worker 0-0, policy_version 1142306 (0.00077) [2022-07-11 09:56:35,302][25689] Fps is (10 sec: 5562.0, 60 sec: 5573.6, 300 sec: 5565.9). Total num frames: 1169722368. Throughput: 0: 5839.0. Samples: 1169729738. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 09:56:35,303][25689] Avg episode reward: [(0, '0.136')] [2022-07-11 09:56:37,065][26022] Updated weights on worker 0-0, policy_version 1142316 (0.00085) [2022-07-11 09:56:38,889][26022] Updated weights on worker 0-0, policy_version 1142326 (0.00095) [2022-07-11 09:56:40,345][25689] Fps is (10 sec: 5588.4, 60 sec: 5596.6, 300 sec: 5565.3). Total num frames: 1169750016. Throughput: 0: 5008.1. Samples: 1169746524. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:56:40,345][25689] Avg episode reward: [(0, '-0.705')] [2022-07-11 09:56:40,764][26022] Updated weights on worker 0-0, policy_version 1142336 (0.00075) [2022-07-11 09:56:42,535][26022] Updated weights on worker 0-0, policy_version 1142346 (0.00615) [2022-07-11 09:56:44,419][26022] Updated weights on worker 0-0, policy_version 1142356 (0.00089) [2022-07-11 09:56:45,357][25689] Fps is (10 sec: 5601.6, 60 sec: 5584.3, 300 sec: 5565.7). Total num frames: 1169778688. Throughput: 0: 5818.7. Samples: 1169779838. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:56:45,357][25689] Avg episode reward: [(0, '0.852')] [2022-07-11 09:56:46,026][26022] Updated weights on worker 0-0, policy_version 1142366 (0.00082) [2022-07-11 09:56:48,182][26022] Updated weights on worker 0-0, policy_version 1142376 (0.00090) [2022-07-11 09:56:49,804][26022] Updated weights on worker 0-0, policy_version 1142386 (0.00093) [2022-07-11 09:56:50,414][25689] Fps is (10 sec: 5593.4, 60 sec: 5562.6, 300 sec: 5564.9). Total num frames: 1169806336. Throughput: 0: 5843.5. Samples: 1169813618. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:56:50,414][25689] Avg episode reward: [(0, '-0.399')] [2022-07-11 09:56:51,667][26022] Updated weights on worker 0-0, policy_version 1142396 (0.00087) [2022-07-11 09:56:53,345][26022] Updated weights on worker 0-0, policy_version 1142406 (0.00089) [2022-07-11 09:56:55,302][26022] Updated weights on worker 0-0, policy_version 1142416 (0.00090) [2022-07-11 09:56:55,423][25689] Fps is (10 sec: 5594.7, 60 sec: 5579.3, 300 sec: 5566.8). Total num frames: 1169835008. Throughput: 0: 5009.2. Samples: 1169830528. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:56:55,424][25689] Avg episode reward: [(0, '-0.605')] [2022-07-11 09:56:57,027][26022] Updated weights on worker 0-0, policy_version 1142426 (0.00099) [2022-07-11 09:56:59,022][26022] Updated weights on worker 0-0, policy_version 1142436 (0.00088) [2022-07-11 09:57:00,510][25689] Fps is (10 sec: 5679.7, 60 sec: 5581.9, 300 sec: 5575.9). Total num frames: 1169863680. Throughput: 0: 5842.0. Samples: 1169864330. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:57:00,511][25689] Avg episode reward: [(0, '-0.306')] [2022-07-11 09:57:00,520][26022] Updated weights on worker 0-0, policy_version 1142446 (0.00083) [2022-07-11 09:57:02,945][26022] Updated weights on worker 0-0, policy_version 1142456 (0.00091) [2022-07-11 09:57:04,438][26022] Updated weights on worker 0-0, policy_version 1142466 (0.00084) [2022-07-11 09:57:05,513][25689] Fps is (10 sec: 5277.4, 60 sec: 5567.6, 300 sec: 5562.3). Total num frames: 1169888256. Throughput: 0: 5773.7. Samples: 1169896216. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:57:05,514][25689] Avg episode reward: [(0, '0.455')] [2022-07-11 09:57:06,663][26022] Updated weights on worker 0-0, policy_version 1142476 (0.00086) [2022-07-11 09:57:08,277][26022] Updated weights on worker 0-0, policy_version 1142486 (0.00085) [2022-07-11 09:57:10,143][26022] Updated weights on worker 0-0, policy_version 1142496 (0.00577) [2022-07-11 09:57:10,526][25689] Fps is (10 sec: 5418.5, 60 sec: 5584.0, 300 sec: 5572.5). Total num frames: 1169917952. Throughput: 0: 4955.5. Samples: 1169913286. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:57:10,527][25689] Avg episode reward: [(0, '0.140')] [2022-07-11 09:57:11,759][26022] Updated weights on worker 0-0, policy_version 1142506 (0.00090) [2022-07-11 09:57:14,049][26022] Updated weights on worker 0-0, policy_version 1142516 (0.00084) [2022-07-11 09:57:15,506][26022] Updated weights on worker 0-0, policy_version 1142526 (0.00091) [2022-07-11 09:57:15,553][25689] Fps is (10 sec: 5813.9, 60 sec: 5601.9, 300 sec: 5573.0). Total num frames: 1169946624. Throughput: 0: 5776.5. Samples: 1169946804. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:57:15,553][25689] Avg episode reward: [(0, '0.364')] [2022-07-11 09:57:17,562][26022] Updated weights on worker 0-0, policy_version 1142536 (0.00087) [2022-07-11 09:57:19,337][26022] Updated weights on worker 0-0, policy_version 1142546 (0.00095) [2022-07-11 09:57:20,683][25689] Fps is (10 sec: 5545.1, 60 sec: 5565.5, 300 sec: 5567.6). Total num frames: 1169974272. Throughput: 0: 5754.4. Samples: 1169980412. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:57:20,683][25689] Avg episode reward: [(0, '1.743')] [2022-07-11 09:57:21,204][26022] Updated weights on worker 0-0, policy_version 1142556 (0.00086) [2022-07-11 09:57:23,003][26022] Updated weights on worker 0-0, policy_version 1142566 (0.00083) [2022-07-11 09:57:24,780][26022] Updated weights on worker 0-0, policy_version 1142576 (0.00084) [2022-07-11 09:57:25,688][25689] Fps is (10 sec: 5455.8, 60 sec: 5550.6, 300 sec: 5571.3). Total num frames: 1170001920. Throughput: 0: 4996.6. Samples: 1169997020. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:57:25,689][25689] Avg episode reward: [(0, '1.395')] [2022-07-11 09:57:26,640][26022] Updated weights on worker 0-0, policy_version 1142586 (0.00089) [2022-07-11 09:57:28,470][26022] Updated weights on worker 0-0, policy_version 1142596 (0.00106) [2022-07-11 09:57:30,369][26022] Updated weights on worker 0-0, policy_version 1142606 (0.00099) [2022-07-11 09:57:30,698][25689] Fps is (10 sec: 5521.3, 60 sec: 5557.9, 300 sec: 5564.6). Total num frames: 1170029568. Throughput: 0: 5813.0. Samples: 1170030544. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:57:30,700][25689] Avg episode reward: [(0, '-0.666')] [2022-07-11 09:57:32,257][26022] Updated weights on worker 0-0, policy_version 1142616 (0.00087) [2022-07-11 09:57:33,923][26022] Updated weights on worker 0-0, policy_version 1142626 (0.00094) [2022-07-11 09:57:35,710][25689] Fps is (10 sec: 5619.2, 60 sec: 5560.0, 300 sec: 5569.5). Total num frames: 1170058240. Throughput: 0: 5808.4. Samples: 1170063890. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:57:35,713][25689] Avg episode reward: [(0, '-0.741')] [2022-07-11 09:57:35,802][26022] Updated weights on worker 0-0, policy_version 1142636 (0.00088) [2022-07-11 09:57:37,680][26022] Updated weights on worker 0-0, policy_version 1142646 (0.00090) [2022-07-11 09:57:39,586][26022] Updated weights on worker 0-0, policy_version 1142656 (0.00086) [2022-07-11 09:57:40,777][25689] Fps is (10 sec: 5689.1, 60 sec: 5574.7, 300 sec: 5572.0). Total num frames: 1170086912. Throughput: 0: 4982.9. Samples: 1170080542. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:57:40,778][25689] Avg episode reward: [(0, '-0.792')] [2022-07-11 09:57:41,268][26022] Updated weights on worker 0-0, policy_version 1142666 (0.00054) [2022-07-11 09:57:43,295][26022] Updated weights on worker 0-0, policy_version 1142676 (0.00084) [2022-07-11 09:57:44,974][26022] Updated weights on worker 0-0, policy_version 1142686 (0.00083) [2022-07-11 09:57:45,779][25689] Fps is (10 sec: 5492.0, 60 sec: 5541.8, 300 sec: 5565.5). Total num frames: 1170113536. Throughput: 0: 5819.9. Samples: 1170113948. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:57:45,779][25689] Avg episode reward: [(0, '-1.458')] [2022-07-11 09:57:46,847][26022] Updated weights on worker 0-0, policy_version 1142696 (0.00083) [2022-07-11 09:57:48,766][26022] Updated weights on worker 0-0, policy_version 1142706 (0.00071) [2022-07-11 09:57:50,566][26022] Updated weights on worker 0-0, policy_version 1142716 (0.00086) [2022-07-11 09:57:50,815][25689] Fps is (10 sec: 5611.1, 60 sec: 5577.6, 300 sec: 5569.8). Total num frames: 1170143232. Throughput: 0: 5825.9. Samples: 1170147744. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:57:50,815][25689] Avg episode reward: [(0, '-1.064')] [2022-07-11 09:57:52,298][26022] Updated weights on worker 0-0, policy_version 1142726 (0.00088) [2022-07-11 09:57:54,194][26022] Updated weights on worker 0-0, policy_version 1142736 (0.00090) [2022-07-11 09:57:55,770][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:57:55,784][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001142746_1170171904.pth [2022-07-11 09:57:55,784][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001140786_1168164864.pth [2022-07-11 09:57:55,789][26022] Updated weights on worker 0-0, policy_version 1142746 (0.00085) [2022-07-11 09:57:55,818][25689] Fps is (10 sec: 5813.8, 60 sec: 5578.2, 300 sec: 5571.1). Total num frames: 1170171904. Throughput: 0: 4997.3. Samples: 1170164376. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:57:55,821][25689] Avg episode reward: [(0, '-1.048')] [2022-07-11 09:57:57,981][26022] Updated weights on worker 0-0, policy_version 1142756 (0.00087) [2022-07-11 09:57:59,759][26022] Updated weights on worker 0-0, policy_version 1142766 (0.00086) [2022-07-11 09:58:00,916][25689] Fps is (10 sec: 5373.0, 60 sec: 5526.4, 300 sec: 5570.0). Total num frames: 1170197504. Throughput: 0: 5838.6. Samples: 1170198120. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:58:00,916][25689] Avg episode reward: [(0, '0.915')] [2022-07-11 09:58:01,418][26022] Updated weights on worker 0-0, policy_version 1142776 (0.00083) [2022-07-11 09:58:03,993][26022] Updated weights on worker 0-0, policy_version 1142786 (0.00084) [2022-07-11 09:58:05,292][26022] Updated weights on worker 0-0, policy_version 1142796 (0.00100) [2022-07-11 09:58:05,947][25689] Fps is (10 sec: 5257.3, 60 sec: 5574.6, 300 sec: 5567.1). Total num frames: 1170225152. Throughput: 0: 5750.2. Samples: 1170229918. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:58:05,949][25689] Avg episode reward: [(0, '0.908')] [2022-07-11 09:58:07,445][26022] Updated weights on worker 0-0, policy_version 1142806 (0.00083) [2022-07-11 09:58:09,251][26022] Updated weights on worker 0-0, policy_version 1142816 (0.00090) [2022-07-11 09:58:10,959][25689] Fps is (10 sec: 5607.7, 60 sec: 5557.8, 300 sec: 5567.6). Total num frames: 1170253824. Throughput: 0: 4922.4. Samples: 1170246902. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:58:10,960][25689] Avg episode reward: [(0, '1.283')] [2022-07-11 09:58:10,973][26022] Updated weights on worker 0-0, policy_version 1142826 (0.00091) [2022-07-11 09:58:13,005][26022] Updated weights on worker 0-0, policy_version 1142836 (0.00085) [2022-07-11 09:58:14,642][26022] Updated weights on worker 0-0, policy_version 1142846 (0.00088) [2022-07-11 09:58:15,994][25689] Fps is (10 sec: 5605.7, 60 sec: 5540.1, 300 sec: 5567.7). Total num frames: 1170281472. Throughput: 0: 5746.9. Samples: 1170280322. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:58:15,997][25689] Avg episode reward: [(0, '1.856')] [2022-07-11 09:58:16,555][26022] Updated weights on worker 0-0, policy_version 1142856 (0.00084) [2022-07-11 09:58:18,296][26022] Updated weights on worker 0-0, policy_version 1142866 (0.00090) [2022-07-11 09:58:20,194][26022] Updated weights on worker 0-0, policy_version 1142876 (0.00509) [2022-07-11 09:58:21,062][25689] Fps is (10 sec: 5574.4, 60 sec: 5562.7, 300 sec: 5566.6). Total num frames: 1170310144. Throughput: 0: 5758.3. Samples: 1170314130. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:58:21,064][25689] Avg episode reward: [(0, '0.520')] [2022-07-11 09:58:22,009][26022] Updated weights on worker 0-0, policy_version 1142886 (0.00092) [2022-07-11 09:58:23,952][26022] Updated weights on worker 0-0, policy_version 1142896 (0.00087) [2022-07-11 09:58:25,680][26022] Updated weights on worker 0-0, policy_version 1142906 (0.00091) [2022-07-11 09:58:26,077][25689] Fps is (10 sec: 5484.1, 60 sec: 5544.8, 300 sec: 5567.0). Total num frames: 1170336768. Throughput: 0: 5014.4. Samples: 1170330858. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:58:26,079][25689] Avg episode reward: [(0, '-0.091')] [2022-07-11 09:58:27,565][26022] Updated weights on worker 0-0, policy_version 1142916 (0.00085) [2022-07-11 09:58:29,280][26022] Updated weights on worker 0-0, policy_version 1142926 (0.00085) [2022-07-11 09:58:31,102][25689] Fps is (10 sec: 5507.6, 60 sec: 5560.4, 300 sec: 5563.1). Total num frames: 1170365440. Throughput: 0: 5818.1. Samples: 1170364098. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:58:31,103][25689] Avg episode reward: [(0, '0.219')] [2022-07-11 09:58:31,103][26022] Updated weights on worker 0-0, policy_version 1142936 (0.00083) [2022-07-11 09:58:33,007][26022] Updated weights on worker 0-0, policy_version 1142946 (0.00092) [2022-07-11 09:58:34,868][26022] Updated weights on worker 0-0, policy_version 1142956 (0.00090) [2022-07-11 09:58:36,139][25689] Fps is (10 sec: 5698.7, 60 sec: 5558.1, 300 sec: 5567.7). Total num frames: 1170394112. Throughput: 0: 5811.5. Samples: 1170397398. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:58:36,140][25689] Avg episode reward: [(0, '0.510')] [2022-07-11 09:58:36,755][26022] Updated weights on worker 0-0, policy_version 1142966 (0.00082) [2022-07-11 09:58:38,557][26022] Updated weights on worker 0-0, policy_version 1142976 (0.00087) [2022-07-11 09:58:40,321][26022] Updated weights on worker 0-0, policy_version 1142986 (0.00084) [2022-07-11 09:58:41,269][25689] Fps is (10 sec: 5438.6, 60 sec: 5518.5, 300 sec: 5558.9). Total num frames: 1170420736. Throughput: 0: 5779.2. Samples: 1170430912. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:58:41,270][25689] Avg episode reward: [(0, '-0.465')] [2022-07-11 09:58:42,172][26022] Updated weights on worker 0-0, policy_version 1142996 (0.00092) [2022-07-11 09:58:44,094][26022] Updated weights on worker 0-0, policy_version 1143006 (0.00085) [2022-07-11 09:58:45,880][26022] Updated weights on worker 0-0, policy_version 1143016 (0.00095) [2022-07-11 09:58:46,273][25689] Fps is (10 sec: 5456.8, 60 sec: 5552.2, 300 sec: 5563.1). Total num frames: 1170449408. Throughput: 0: 5779.6. Samples: 1170447584. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:58:46,275][25689] Avg episode reward: [(0, '-0.207')] [2022-07-11 09:58:47,863][26022] Updated weights on worker 0-0, policy_version 1143026 (0.00097) [2022-07-11 09:58:49,525][26022] Updated weights on worker 0-0, policy_version 1143036 (0.00087) [2022-07-11 09:58:51,299][25689] Fps is (10 sec: 5615.6, 60 sec: 5519.2, 300 sec: 5559.9). Total num frames: 1170477056. Throughput: 0: 5802.4. Samples: 1170481286. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:58:51,299][25689] Avg episode reward: [(0, '1.076')] [2022-07-11 09:58:51,447][26022] Updated weights on worker 0-0, policy_version 1143046 (0.00089) [2022-07-11 09:58:53,206][26022] Updated weights on worker 0-0, policy_version 1143056 (0.00086) [2022-07-11 09:58:55,047][26022] Updated weights on worker 0-0, policy_version 1143066 (0.00085) [2022-07-11 09:58:56,374][25689] Fps is (10 sec: 5676.8, 60 sec: 5529.6, 300 sec: 5563.4). Total num frames: 1170506752. Throughput: 0: 5799.2. Samples: 1170514744. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:58:56,376][25689] Avg episode reward: [(0, '1.509')] [2022-07-11 09:58:57,024][26022] Updated weights on worker 0-0, policy_version 1143076 (0.00081) [2022-07-11 09:58:58,815][26022] Updated weights on worker 0-0, policy_version 1143086 (0.00088) [2022-07-11 09:59:00,803][26022] Updated weights on worker 0-0, policy_version 1143096 (0.00089) [2022-07-11 09:59:01,487][25689] Fps is (10 sec: 5628.3, 60 sec: 5561.9, 300 sec: 5571.7). Total num frames: 1170534400. Throughput: 0: 4983.1. Samples: 1170531660. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:59:01,489][25689] Avg episode reward: [(0, '0.605')] [2022-07-11 09:59:02,683][26022] Updated weights on worker 0-0, policy_version 1143106 (0.00082) [2022-07-11 09:59:04,803][26022] Updated weights on worker 0-0, policy_version 1143116 (0.00084) [2022-07-11 09:59:06,443][26022] Updated weights on worker 0-0, policy_version 1143126 (0.00090) [2022-07-11 09:59:06,507][25689] Fps is (10 sec: 5457.5, 60 sec: 5563.0, 300 sec: 5568.1). Total num frames: 1170562048. Throughput: 0: 5721.4. Samples: 1170563348. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:59:06,507][25689] Avg episode reward: [(0, '0.657')] [2022-07-11 09:59:08,373][26022] Updated weights on worker 0-0, policy_version 1143136 (0.00094) [2022-07-11 09:59:10,177][26022] Updated weights on worker 0-0, policy_version 1143146 (0.00083) [2022-07-11 09:59:11,532][25689] Fps is (10 sec: 5504.9, 60 sec: 5544.9, 300 sec: 5564.5). Total num frames: 1170589696. Throughput: 0: 5718.3. Samples: 1170596986. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:59:11,534][25689] Avg episode reward: [(0, '0.549')] [2022-07-11 09:59:11,939][26022] Updated weights on worker 0-0, policy_version 1143156 (0.00083) [2022-07-11 09:59:13,904][26022] Updated weights on worker 0-0, policy_version 1143166 (0.00085) [2022-07-11 09:59:15,632][26022] Updated weights on worker 0-0, policy_version 1143176 (0.00089) [2022-07-11 09:59:16,567][25689] Fps is (10 sec: 5394.7, 60 sec: 5528.0, 300 sec: 5558.2). Total num frames: 1170616320. Throughput: 0: 4890.0. Samples: 1170613486. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:59:16,568][25689] Avg episode reward: [(0, '0.824')] [2022-07-11 09:59:17,511][26022] Updated weights on worker 0-0, policy_version 1143186 (0.00090) [2022-07-11 09:59:19,299][26022] Updated weights on worker 0-0, policy_version 1143196 (0.00085) [2022-07-11 09:59:21,059][26022] Updated weights on worker 0-0, policy_version 1143206 (0.00083) [2022-07-11 09:59:21,688][25689] Fps is (10 sec: 5444.9, 60 sec: 5523.2, 300 sec: 5561.2). Total num frames: 1170644992. Throughput: 0: 5720.6. Samples: 1170647218. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:59:21,688][25689] Avg episode reward: [(0, '0.656')] [2022-07-11 09:59:23,085][26022] Updated weights on worker 0-0, policy_version 1143216 (0.00093) [2022-07-11 09:59:24,827][26022] Updated weights on worker 0-0, policy_version 1143226 (0.00092) [2022-07-11 09:59:26,592][26022] Updated weights on worker 0-0, policy_version 1143236 (0.00092) [2022-07-11 09:59:26,703][25689] Fps is (10 sec: 5657.5, 60 sec: 5557.0, 300 sec: 5561.0). Total num frames: 1170673664. Throughput: 0: 5810.2. Samples: 1170680692. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:59:26,703][25689] Avg episode reward: [(0, '0.519')] [2022-07-11 09:59:28,379][26022] Updated weights on worker 0-0, policy_version 1143246 (0.00353) [2022-07-11 09:59:30,338][26022] Updated weights on worker 0-0, policy_version 1143256 (0.00091) [2022-07-11 09:59:31,771][25689] Fps is (10 sec: 5686.8, 60 sec: 5553.0, 300 sec: 5563.9). Total num frames: 1170702336. Throughput: 0: 4985.1. Samples: 1170697882. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:59:31,772][25689] Avg episode reward: [(0, '1.333')] [2022-07-11 09:59:31,864][26022] Updated weights on worker 0-0, policy_version 1143266 (0.00078) [2022-07-11 09:59:34,136][26022] Updated weights on worker 0-0, policy_version 1143276 (0.00081) [2022-07-11 09:59:35,422][26022] Updated weights on worker 0-0, policy_version 1143286 (0.00091) [2022-07-11 09:59:36,814][25689] Fps is (10 sec: 5468.8, 60 sec: 5518.8, 300 sec: 5558.5). Total num frames: 1170728960. Throughput: 0: 5840.3. Samples: 1170731736. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:59:36,814][25689] Avg episode reward: [(0, '1.924')] [2022-07-11 09:59:37,587][26022] Updated weights on worker 0-0, policy_version 1143296 (0.00895) [2022-07-11 09:59:39,311][26022] Updated weights on worker 0-0, policy_version 1143306 (0.00084) [2022-07-11 09:59:41,254][26022] Updated weights on worker 0-0, policy_version 1143316 (0.00082) [2022-07-11 09:59:41,930][25689] Fps is (10 sec: 5543.9, 60 sec: 5570.6, 300 sec: 5560.0). Total num frames: 1170758656. Throughput: 0: 5803.7. Samples: 1170764700. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:59:41,931][25689] Avg episode reward: [(0, '1.156')] [2022-07-11 09:59:43,159][26022] Updated weights on worker 0-0, policy_version 1143326 (0.00088) [2022-07-11 09:59:45,127][26022] Updated weights on worker 0-0, policy_version 1143336 (0.00089) [2022-07-11 09:59:46,960][25689] Fps is (10 sec: 5551.0, 60 sec: 5534.5, 300 sec: 5560.4). Total num frames: 1170785280. Throughput: 0: 4933.6. Samples: 1170780632. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:59:46,960][25689] Avg episode reward: [(0, '0.869')] [2022-07-11 09:59:47,032][26022] Updated weights on worker 0-0, policy_version 1143346 (0.00090) [2022-07-11 09:59:48,904][26022] Updated weights on worker 0-0, policy_version 1143356 (0.00091) [2022-07-11 09:59:50,656][26022] Updated weights on worker 0-0, policy_version 1143366 (0.00091) [2022-07-11 09:59:52,038][25689] Fps is (10 sec: 5267.9, 60 sec: 5512.8, 300 sec: 5548.9). Total num frames: 1170811904. Throughput: 0: 5706.6. Samples: 1170813536. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:59:52,039][25689] Avg episode reward: [(0, '0.471')] [2022-07-11 09:59:52,776][26022] Updated weights on worker 0-0, policy_version 1143376 (0.00082) [2022-07-11 09:59:54,411][26022] Updated weights on worker 0-0, policy_version 1143386 (0.00085) [2022-07-11 09:59:55,900][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 09:59:55,911][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001143393_1170834432.pth [2022-07-11 09:59:55,912][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001141438_1168832512.pth [2022-07-11 09:59:56,324][26022] Updated weights on worker 0-0, policy_version 1143396 (0.00087) [2022-07-11 09:59:57,130][25689] Fps is (10 sec: 5537.7, 60 sec: 5511.4, 300 sec: 5551.8). Total num frames: 1170841600. Throughput: 0: 5656.3. Samples: 1170846650. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 09:59:57,131][25689] Avg episode reward: [(0, '0.379')] [2022-07-11 09:59:58,183][26022] Updated weights on worker 0-0, policy_version 1143406 (0.00093) [2022-07-11 09:59:59,973][26022] Updated weights on worker 0-0, policy_version 1143416 (0.00095) [2022-07-11 10:00:02,145][26022] Updated weights on worker 0-0, policy_version 1143426 (0.00092) [2022-07-11 10:00:02,198][25689] Fps is (10 sec: 5543.3, 60 sec: 5498.6, 300 sec: 5552.2). Total num frames: 1170868224. Throughput: 0: 4868.0. Samples: 1170863364. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 10:00:02,202][25689] Avg episode reward: [(0, '0.413')] [2022-07-11 10:00:04,294][26022] Updated weights on worker 0-0, policy_version 1143436 (0.00090) [2022-07-11 10:00:05,768][26022] Updated weights on worker 0-0, policy_version 1143446 (0.00088) [2022-07-11 10:00:07,214][25689] Fps is (10 sec: 5280.7, 60 sec: 5482.1, 300 sec: 5549.0). Total num frames: 1170894848. Throughput: 0: 5627.4. Samples: 1170894608. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 10:00:07,214][25689] Avg episode reward: [(0, '0.787')] [2022-07-11 10:00:07,790][26022] Updated weights on worker 0-0, policy_version 1143456 (0.00098) [2022-07-11 10:00:09,541][26022] Updated weights on worker 0-0, policy_version 1143466 (0.00093) [2022-07-11 10:00:11,448][26022] Updated weights on worker 0-0, policy_version 1143476 (0.00081) [2022-07-11 10:00:12,235][25689] Fps is (10 sec: 5611.3, 60 sec: 5516.2, 300 sec: 5556.3). Total num frames: 1170924544. Throughput: 0: 5681.7. Samples: 1170928288. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 10:00:12,237][25689] Avg episode reward: [(0, '0.784')] [2022-07-11 10:00:13,344][26022] Updated weights on worker 0-0, policy_version 1143486 (0.00098) [2022-07-11 10:00:15,116][26022] Updated weights on worker 0-0, policy_version 1143496 (0.00094) [2022-07-11 10:00:16,845][26022] Updated weights on worker 0-0, policy_version 1143506 (0.00088) [2022-07-11 10:00:17,251][25689] Fps is (10 sec: 5713.3, 60 sec: 5534.8, 300 sec: 5551.7). Total num frames: 1170952192. Throughput: 0: 4892.0. Samples: 1170945080. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 10:00:17,251][25689] Avg episode reward: [(0, '1.019')] [2022-07-11 10:00:18,805][26022] Updated weights on worker 0-0, policy_version 1143516 (0.00087) [2022-07-11 10:00:20,698][26022] Updated weights on worker 0-0, policy_version 1143526 (0.00084) [2022-07-11 10:00:22,306][25689] Fps is (10 sec: 5592.1, 60 sec: 5540.7, 300 sec: 5555.3). Total num frames: 1170980864. Throughput: 0: 5725.4. Samples: 1170978492. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 10:00:22,307][25689] Avg episode reward: [(0, '0.451')] [2022-07-11 10:00:22,310][26022] Updated weights on worker 0-0, policy_version 1143536 (0.00089) [2022-07-11 10:00:24,376][26022] Updated weights on worker 0-0, policy_version 1143546 (0.00097) [2022-07-11 10:00:25,855][26022] Updated weights on worker 0-0, policy_version 1143556 (0.00105) [2022-07-11 10:00:27,331][25689] Fps is (10 sec: 5384.0, 60 sec: 5489.2, 300 sec: 5548.1). Total num frames: 1171006464. Throughput: 0: 5822.5. Samples: 1171011740. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 10:00:27,333][25689] Avg episode reward: [(0, '0.329')] [2022-07-11 10:00:28,220][26022] Updated weights on worker 0-0, policy_version 1143566 (0.00090) [2022-07-11 10:00:29,635][26022] Updated weights on worker 0-0, policy_version 1143576 (0.00089) [2022-07-11 10:00:31,730][26022] Updated weights on worker 0-0, policy_version 1143586 (0.00091) [2022-07-11 10:00:32,339][25689] Fps is (10 sec: 5511.8, 60 sec: 5511.6, 300 sec: 5548.9). Total num frames: 1171036160. Throughput: 0: 4980.1. Samples: 1171028406. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2022-07-11 10:00:32,340][25689] Avg episode reward: [(0, '0.086')] [2022-07-11 10:00:33,404][26022] Updated weights on worker 0-0, policy_version 1143596 (0.00082) [2022-07-11 10:00:35,231][26022] Updated weights on worker 0-0, policy_version 1143606 (0.00083) [2022-07-11 10:00:37,099][26022] Updated weights on worker 0-0, policy_version 1143616 (0.00082) [2022-07-11 10:00:37,355][25689] Fps is (10 sec: 5720.9, 60 sec: 5531.0, 300 sec: 5554.1). Total num frames: 1171063808. Throughput: 0: 5822.1. Samples: 1171062126. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:00:37,358][25689] Avg episode reward: [(0, '0.593')] [2022-07-11 10:00:39,079][26022] Updated weights on worker 0-0, policy_version 1143626 (0.00092) [2022-07-11 10:00:40,723][26022] Updated weights on worker 0-0, policy_version 1143636 (0.00086) [2022-07-11 10:00:42,479][25689] Fps is (10 sec: 5352.3, 60 sec: 5479.5, 300 sec: 5542.6). Total num frames: 1171090432. Throughput: 0: 5811.3. Samples: 1171095720. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:00:42,479][25689] Avg episode reward: [(0, '0.733')] [2022-07-11 10:00:42,777][26022] Updated weights on worker 0-0, policy_version 1143646 (0.00094) [2022-07-11 10:00:44,535][26022] Updated weights on worker 0-0, policy_version 1143656 (0.00093) [2022-07-11 10:00:46,313][26022] Updated weights on worker 0-0, policy_version 1143666 (0.00082) [2022-07-11 10:00:47,492][25689] Fps is (10 sec: 5555.3, 60 sec: 5531.7, 300 sec: 5545.9). Total num frames: 1171120128. Throughput: 0: 5813.2. Samples: 1171128942. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:00:47,494][25689] Avg episode reward: [(0, '1.060')] [2022-07-11 10:00:48,318][26022] Updated weights on worker 0-0, policy_version 1143676 (0.00085) [2022-07-11 10:00:50,083][26022] Updated weights on worker 0-0, policy_version 1143686 (0.00086) [2022-07-11 10:00:51,903][26022] Updated weights on worker 0-0, policy_version 1143696 (0.00085) [2022-07-11 10:00:52,496][25689] Fps is (10 sec: 5826.9, 60 sec: 5572.4, 300 sec: 5549.4). Total num frames: 1171148800. Throughput: 0: 5825.8. Samples: 1171145836. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:00:52,496][25689] Avg episode reward: [(0, '1.202')] [2022-07-11 10:00:53,716][26022] Updated weights on worker 0-0, policy_version 1143706 (0.00087) [2022-07-11 10:00:55,472][26022] Updated weights on worker 0-0, policy_version 1143716 (0.00087) [2022-07-11 10:00:57,485][26022] Updated weights on worker 0-0, policy_version 1143726 (0.00097) [2022-07-11 10:00:57,514][25689] Fps is (10 sec: 5517.7, 60 sec: 5528.4, 300 sec: 5544.3). Total num frames: 1171175424. Throughput: 0: 5823.3. Samples: 1171179520. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:00:57,515][25689] Avg episode reward: [(0, '0.735')] [2022-07-11 10:00:59,135][26022] Updated weights on worker 0-0, policy_version 1143736 (0.00087) [2022-07-11 10:01:01,070][26022] Updated weights on worker 0-0, policy_version 1143746 (0.00088) [2022-07-11 10:01:02,610][25689] Fps is (10 sec: 5264.8, 60 sec: 5525.8, 300 sec: 5546.5). Total num frames: 1171202048. Throughput: 0: 5732.7. Samples: 1171211126. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:01:02,615][25689] Avg episode reward: [(0, '-0.175')] [2022-07-11 10:01:03,398][26022] Updated weights on worker 0-0, policy_version 1143756 (0.00088) [2022-07-11 10:01:05,001][26022] Updated weights on worker 0-0, policy_version 1143766 (0.00099) [2022-07-11 10:01:06,788][26022] Updated weights on worker 0-0, policy_version 1143776 (0.00090) [2022-07-11 10:01:07,651][25689] Fps is (10 sec: 5556.1, 60 sec: 5574.3, 300 sec: 5549.4). Total num frames: 1171231744. Throughput: 0: 4919.1. Samples: 1171228102. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:01:07,651][25689] Avg episode reward: [(0, '-1.363')] [2022-07-11 10:01:08,854][26022] Updated weights on worker 0-0, policy_version 1143786 (0.00101) [2022-07-11 10:01:10,307][26022] Updated weights on worker 0-0, policy_version 1143796 (0.00086) [2022-07-11 10:01:12,609][26022] Updated weights on worker 0-0, policy_version 1143806 (0.00089) [2022-07-11 10:01:12,655][25689] Fps is (10 sec: 5504.5, 60 sec: 5508.1, 300 sec: 5543.1). Total num frames: 1171257344. Throughput: 0: 5733.3. Samples: 1171261416. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:01:12,656][25689] Avg episode reward: [(0, '-1.274')] [2022-07-11 10:01:14,060][26022] Updated weights on worker 0-0, policy_version 1143816 (0.00086) [2022-07-11 10:01:16,100][26022] Updated weights on worker 0-0, policy_version 1143826 (0.00082) [2022-07-11 10:01:17,668][25689] Fps is (10 sec: 5519.9, 60 sec: 5542.2, 300 sec: 5544.8). Total num frames: 1171287040. Throughput: 0: 5728.6. Samples: 1171294976. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:01:17,669][25689] Avg episode reward: [(0, '-1.572')] [2022-07-11 10:01:17,969][26022] Updated weights on worker 0-0, policy_version 1143836 (0.00089) [2022-07-11 10:01:19,647][26022] Updated weights on worker 0-0, policy_version 1143846 (0.00084) [2022-07-11 10:01:21,584][26022] Updated weights on worker 0-0, policy_version 1143856 (0.00084) [2022-07-11 10:01:22,724][25689] Fps is (10 sec: 5695.6, 60 sec: 5525.3, 300 sec: 5540.8). Total num frames: 1171314688. Throughput: 0: 5000.1. Samples: 1171311698. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:01:22,724][25689] Avg episode reward: [(0, '-1.316')] [2022-07-11 10:01:23,371][26022] Updated weights on worker 0-0, policy_version 1143866 (0.00082) [2022-07-11 10:01:25,119][26022] Updated weights on worker 0-0, policy_version 1143876 (0.00085) [2022-07-11 10:01:26,974][26022] Updated weights on worker 0-0, policy_version 1143886 (0.00087) [2022-07-11 10:01:27,743][25689] Fps is (10 sec: 5590.1, 60 sec: 5576.6, 300 sec: 5545.5). Total num frames: 1171343360. Throughput: 0: 5836.8. Samples: 1171345378. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:01:27,745][25689] Avg episode reward: [(0, '-0.465')] [2022-07-11 10:01:28,869][26022] Updated weights on worker 0-0, policy_version 1143896 (0.00086) [2022-07-11 10:01:30,667][26022] Updated weights on worker 0-0, policy_version 1143906 (0.00089) [2022-07-11 10:01:32,502][26022] Updated weights on worker 0-0, policy_version 1143916 (0.00084) [2022-07-11 10:01:32,783][25689] Fps is (10 sec: 5700.7, 60 sec: 5556.7, 300 sec: 5545.5). Total num frames: 1171372032. Throughput: 0: 5859.0. Samples: 1171379342. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:01:32,785][25689] Avg episode reward: [(0, '0.975')] [2022-07-11 10:01:34,299][26022] Updated weights on worker 0-0, policy_version 1143926 (0.00080) [2022-07-11 10:01:36,122][26022] Updated weights on worker 0-0, policy_version 1143936 (0.00093) [2022-07-11 10:01:37,816][25689] Fps is (10 sec: 5489.7, 60 sec: 5538.2, 300 sec: 5542.2). Total num frames: 1171398656. Throughput: 0: 5019.3. Samples: 1171396104. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:01:37,818][25689] Avg episode reward: [(0, '-0.496')] [2022-07-11 10:01:38,077][26022] Updated weights on worker 0-0, policy_version 1143946 (0.00088) [2022-07-11 10:01:39,619][26022] Updated weights on worker 0-0, policy_version 1143956 (0.00082) [2022-07-11 10:01:41,609][26022] Updated weights on worker 0-0, policy_version 1143966 (0.00090) [2022-07-11 10:01:42,933][25689] Fps is (10 sec: 5548.6, 60 sec: 5589.7, 300 sec: 5543.7). Total num frames: 1171428352. Throughput: 0: 5834.8. Samples: 1171429616. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:01:42,933][25689] Avg episode reward: [(0, '-0.423')] [2022-07-11 10:01:43,529][26022] Updated weights on worker 0-0, policy_version 1143976 (0.00090) [2022-07-11 10:01:45,367][26022] Updated weights on worker 0-0, policy_version 1143986 (0.00087) [2022-07-11 10:01:47,240][26022] Updated weights on worker 0-0, policy_version 1143996 (0.00091) [2022-07-11 10:01:47,937][25689] Fps is (10 sec: 5564.7, 60 sec: 5539.8, 300 sec: 5541.2). Total num frames: 1171454976. Throughput: 0: 5823.8. Samples: 1171462982. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:01:47,937][25689] Avg episode reward: [(0, '-1.029')] [2022-07-11 10:01:49,042][26022] Updated weights on worker 0-0, policy_version 1144006 (0.00081) [2022-07-11 10:01:50,596][26022] Updated weights on worker 0-0, policy_version 1144016 (0.00086) [2022-07-11 10:01:52,815][26022] Updated weights on worker 0-0, policy_version 1144026 (0.00089) [2022-07-11 10:01:52,954][25689] Fps is (10 sec: 5415.7, 60 sec: 5521.5, 300 sec: 5537.6). Total num frames: 1171482624. Throughput: 0: 4977.7. Samples: 1171479750. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:01:52,955][25689] Avg episode reward: [(0, '-2.219')] [2022-07-11 10:01:54,386][26022] Updated weights on worker 0-0, policy_version 1144036 (0.00080) [2022-07-11 10:01:56,107][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:01:56,121][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001144044_1171501056.pth [2022-07-11 10:01:56,121][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001142091_1169501184.pth [2022-07-11 10:01:56,434][26022] Updated weights on worker 0-0, policy_version 1144046 (0.00093) [2022-07-11 10:01:57,843][26022] Updated weights on worker 0-0, policy_version 1144056 (0.00088) [2022-07-11 10:01:57,970][25689] Fps is (10 sec: 5817.7, 60 sec: 5589.6, 300 sec: 5545.8). Total num frames: 1171513344. Throughput: 0: 5822.8. Samples: 1171513456. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:01:57,970][25689] Avg episode reward: [(0, '-2.121')] [2022-07-11 10:01:59,996][26022] Updated weights on worker 0-0, policy_version 1144066 (0.00103) [2022-07-11 10:02:01,630][26022] Updated weights on worker 0-0, policy_version 1144076 (0.00088) [2022-07-11 10:02:03,099][25689] Fps is (10 sec: 5450.9, 60 sec: 5552.6, 300 sec: 5543.5). Total num frames: 1171537920. Throughput: 0: 5742.2. Samples: 1171545412. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:02:03,099][25689] Avg episode reward: [(0, '-2.107')] [2022-07-11 10:02:04,028][26022] Updated weights on worker 0-0, policy_version 1144086 (0.00088) [2022-07-11 10:02:05,707][26022] Updated weights on worker 0-0, policy_version 1144096 (0.00088) [2022-07-11 10:02:07,792][26022] Updated weights on worker 0-0, policy_version 1144106 (0.00087) [2022-07-11 10:02:08,158][25689] Fps is (10 sec: 5327.0, 60 sec: 5551.0, 300 sec: 5542.6). Total num frames: 1171567616. Throughput: 0: 4893.0. Samples: 1171561924. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:02:08,158][25689] Avg episode reward: [(0, '-1.687')] [2022-07-11 10:02:09,356][26022] Updated weights on worker 0-0, policy_version 1144116 (0.00089) [2022-07-11 10:02:11,352][26022] Updated weights on worker 0-0, policy_version 1144126 (0.00083) [2022-07-11 10:02:13,158][26022] Updated weights on worker 0-0, policy_version 1144136 (0.00077) [2022-07-11 10:02:13,255][25689] Fps is (10 sec: 5646.2, 60 sec: 5576.3, 300 sec: 5537.9). Total num frames: 1171595264. Throughput: 0: 5695.1. Samples: 1171595364. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:02:13,255][25689] Avg episode reward: [(0, '-2.269')] [2022-07-11 10:02:14,987][26022] Updated weights on worker 0-0, policy_version 1144146 (0.00617) [2022-07-11 10:02:16,879][26022] Updated weights on worker 0-0, policy_version 1144156 (0.00082) [2022-07-11 10:02:18,257][25689] Fps is (10 sec: 5373.6, 60 sec: 5526.5, 300 sec: 5536.8). Total num frames: 1171621888. Throughput: 0: 5695.0. Samples: 1171628996. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:02:18,258][25689] Avg episode reward: [(0, '-1.458')] [2022-07-11 10:02:18,859][26022] Updated weights on worker 0-0, policy_version 1144166 (0.00089) [2022-07-11 10:02:20,374][26022] Updated weights on worker 0-0, policy_version 1144176 (0.00085) [2022-07-11 10:02:22,524][26022] Updated weights on worker 0-0, policy_version 1144186 (0.00086) [2022-07-11 10:02:23,388][25689] Fps is (10 sec: 5659.3, 60 sec: 5570.4, 300 sec: 5544.8). Total num frames: 1171652608. Throughput: 0: 4935.0. Samples: 1171645538. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:02:23,388][25689] Avg episode reward: [(0, '-0.639')] [2022-07-11 10:02:24,192][26022] Updated weights on worker 0-0, policy_version 1144196 (0.00393) [2022-07-11 10:02:26,183][26022] Updated weights on worker 0-0, policy_version 1144206 (0.00085) [2022-07-11 10:02:28,004][26022] Updated weights on worker 0-0, policy_version 1144216 (0.00094) [2022-07-11 10:02:28,438][25689] Fps is (10 sec: 5632.5, 60 sec: 5533.7, 300 sec: 5540.6). Total num frames: 1171679232. Throughput: 0: 5760.7. Samples: 1171678756. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:02:28,439][25689] Avg episode reward: [(0, '-1.310')] [2022-07-11 10:02:29,771][26022] Updated weights on worker 0-0, policy_version 1144226 (0.00084) [2022-07-11 10:02:31,641][26022] Updated weights on worker 0-0, policy_version 1144236 (0.00090) [2022-07-11 10:02:33,455][25689] Fps is (10 sec: 5390.8, 60 sec: 5518.9, 300 sec: 5537.1). Total num frames: 1171706880. Throughput: 0: 5780.9. Samples: 1171712140. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:02:33,457][25689] Avg episode reward: [(0, '0.064')] [2022-07-11 10:02:33,615][26022] Updated weights on worker 0-0, policy_version 1144246 (0.00095) [2022-07-11 10:02:35,462][26022] Updated weights on worker 0-0, policy_version 1144256 (0.00101) [2022-07-11 10:02:37,234][26022] Updated weights on worker 0-0, policy_version 1144266 (0.00082) [2022-07-11 10:02:38,490][25689] Fps is (10 sec: 5603.0, 60 sec: 5552.5, 300 sec: 5537.7). Total num frames: 1171735552. Throughput: 0: 4942.6. Samples: 1171728998. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:02:38,492][25689] Avg episode reward: [(0, '-0.606')] [2022-07-11 10:02:38,866][26022] Updated weights on worker 0-0, policy_version 1144276 (0.00091) [2022-07-11 10:02:40,844][26022] Updated weights on worker 0-0, policy_version 1144286 (0.00085) [2022-07-11 10:02:42,747][26022] Updated weights on worker 0-0, policy_version 1144296 (0.00089) [2022-07-11 10:02:43,571][25689] Fps is (10 sec: 5668.8, 60 sec: 5539.0, 300 sec: 5543.1). Total num frames: 1171764224. Throughput: 0: 5806.8. Samples: 1171762738. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:02:43,573][25689] Avg episode reward: [(0, '-1.622')] [2022-07-11 10:02:44,567][26022] Updated weights on worker 0-0, policy_version 1144306 (0.00089) [2022-07-11 10:02:46,239][26022] Updated weights on worker 0-0, policy_version 1144316 (0.00094) [2022-07-11 10:02:48,134][26022] Updated weights on worker 0-0, policy_version 1144326 (0.00083) [2022-07-11 10:02:48,590][25689] Fps is (10 sec: 5576.4, 60 sec: 5554.5, 300 sec: 5536.5). Total num frames: 1171791872. Throughput: 0: 5848.3. Samples: 1171796608. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:02:48,590][25689] Avg episode reward: [(0, '-1.288')] [2022-07-11 10:02:49,806][26022] Updated weights on worker 0-0, policy_version 1144336 (0.00088) [2022-07-11 10:02:51,852][26022] Updated weights on worker 0-0, policy_version 1144346 (0.00091) [2022-07-11 10:02:53,457][26022] Updated weights on worker 0-0, policy_version 1144356 (0.00085) [2022-07-11 10:02:53,592][25689] Fps is (10 sec: 5620.1, 60 sec: 5572.8, 300 sec: 5536.6). Total num frames: 1171820544. Throughput: 0: 5026.7. Samples: 1171813362. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:02:53,593][25689] Avg episode reward: [(0, '-2.117')] [2022-07-11 10:02:55,412][26022] Updated weights on worker 0-0, policy_version 1144366 (0.00086) [2022-07-11 10:02:57,284][26022] Updated weights on worker 0-0, policy_version 1144376 (0.00088) [2022-07-11 10:02:58,603][25689] Fps is (10 sec: 5624.8, 60 sec: 5522.5, 300 sec: 5545.0). Total num frames: 1171848192. Throughput: 0: 5866.9. Samples: 1171846996. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:02:58,604][25689] Avg episode reward: [(0, '-2.242')] [2022-07-11 10:02:59,004][26022] Updated weights on worker 0-0, policy_version 1144386 (0.00088) [2022-07-11 10:03:00,975][26022] Updated weights on worker 0-0, policy_version 1144396 (0.00084) [2022-07-11 10:03:02,999][26022] Updated weights on worker 0-0, policy_version 1144406 (0.00099) [2022-07-11 10:03:03,683][25689] Fps is (10 sec: 5378.5, 60 sec: 5560.8, 300 sec: 5540.7). Total num frames: 1171874816. Throughput: 0: 5755.8. Samples: 1171878498. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:03:03,685][25689] Avg episode reward: [(0, '-4.256')] [2022-07-11 10:03:04,835][26022] Updated weights on worker 0-0, policy_version 1144416 (0.00089) [2022-07-11 10:03:06,619][26022] Updated weights on worker 0-0, policy_version 1144426 (0.00082) [2022-07-11 10:03:08,565][26022] Updated weights on worker 0-0, policy_version 1144436 (0.00086) [2022-07-11 10:03:08,699][25689] Fps is (10 sec: 5375.2, 60 sec: 5530.9, 300 sec: 5537.2). Total num frames: 1171902464. Throughput: 0: 5755.8. Samples: 1171912354. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:03:08,700][25689] Avg episode reward: [(0, '-5.436')] [2022-07-11 10:03:10,353][26022] Updated weights on worker 0-0, policy_version 1144446 (0.00084) [2022-07-11 10:03:12,238][26022] Updated weights on worker 0-0, policy_version 1144456 (0.00083) [2022-07-11 10:03:13,716][25689] Fps is (10 sec: 5613.6, 60 sec: 5555.2, 300 sec: 5541.0). Total num frames: 1171931136. Throughput: 0: 5748.7. Samples: 1171929046. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:03:13,716][25689] Avg episode reward: [(0, '-4.159')] [2022-07-11 10:03:14,005][26022] Updated weights on worker 0-0, policy_version 1144466 (0.00084) [2022-07-11 10:03:15,982][26022] Updated weights on worker 0-0, policy_version 1144476 (0.00084) [2022-07-11 10:03:17,743][26022] Updated weights on worker 0-0, policy_version 1144486 (0.00095) [2022-07-11 10:03:18,732][25689] Fps is (10 sec: 5715.9, 60 sec: 5587.8, 300 sec: 5541.9). Total num frames: 1171959808. Throughput: 0: 5737.1. Samples: 1171962480. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:03:18,732][25689] Avg episode reward: [(0, '-4.533')] [2022-07-11 10:03:19,851][26022] Updated weights on worker 0-0, policy_version 1144496 (0.00053) [2022-07-11 10:03:21,392][26022] Updated weights on worker 0-0, policy_version 1144506 (0.00092) [2022-07-11 10:03:23,465][26022] Updated weights on worker 0-0, policy_version 1144516 (0.00090) [2022-07-11 10:03:23,831][25689] Fps is (10 sec: 5365.1, 60 sec: 5506.0, 300 sec: 5536.9). Total num frames: 1171985408. Throughput: 0: 5817.2. Samples: 1171995706. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:03:23,832][25689] Avg episode reward: [(0, '-3.305')] [2022-07-11 10:03:25,118][26022] Updated weights on worker 0-0, policy_version 1144526 (0.00089) [2022-07-11 10:03:27,035][26022] Updated weights on worker 0-0, policy_version 1144536 (0.00089) [2022-07-11 10:03:28,835][26022] Updated weights on worker 0-0, policy_version 1144546 (0.00084) [2022-07-11 10:03:28,854][25689] Fps is (10 sec: 5462.9, 60 sec: 5559.4, 300 sec: 5540.4). Total num frames: 1172015104. Throughput: 0: 4955.6. Samples: 1172012232. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:03:28,854][25689] Avg episode reward: [(0, '-2.681')] [2022-07-11 10:03:30,639][26022] Updated weights on worker 0-0, policy_version 1144556 (0.00083) [2022-07-11 10:03:32,553][26022] Updated weights on worker 0-0, policy_version 1144566 (0.00093) [2022-07-11 10:03:33,871][25689] Fps is (10 sec: 5711.5, 60 sec: 5559.3, 300 sec: 5537.3). Total num frames: 1172042752. Throughput: 0: 5798.4. Samples: 1172045918. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:03:33,872][25689] Avg episode reward: [(0, '-2.732')] [2022-07-11 10:03:34,157][26022] Updated weights on worker 0-0, policy_version 1144576 (0.00087) [2022-07-11 10:03:36,075][26022] Updated weights on worker 0-0, policy_version 1144586 (0.00086) [2022-07-11 10:03:38,074][26022] Updated weights on worker 0-0, policy_version 1144596 (0.00088) [2022-07-11 10:03:38,900][25689] Fps is (10 sec: 5504.2, 60 sec: 5542.9, 300 sec: 5542.7). Total num frames: 1172070400. Throughput: 0: 5787.3. Samples: 1172079200. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:03:38,901][25689] Avg episode reward: [(0, '-0.558')] [2022-07-11 10:03:39,818][26022] Updated weights on worker 0-0, policy_version 1144606 (0.00088) [2022-07-11 10:03:41,773][26022] Updated weights on worker 0-0, policy_version 1144616 (0.00082) [2022-07-11 10:03:43,619][26022] Updated weights on worker 0-0, policy_version 1144626 (0.00084) [2022-07-11 10:03:43,978][25689] Fps is (10 sec: 5471.1, 60 sec: 5526.2, 300 sec: 5537.8). Total num frames: 1172098048. Throughput: 0: 4966.6. Samples: 1172095768. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:03:43,979][25689] Avg episode reward: [(0, '-0.449')] [2022-07-11 10:03:45,415][26022] Updated weights on worker 0-0, policy_version 1144636 (0.00090) [2022-07-11 10:03:47,350][26022] Updated weights on worker 0-0, policy_version 1144646 (0.00089) [2022-07-11 10:03:48,894][26022] Updated weights on worker 0-0, policy_version 1144656 (0.00097) [2022-07-11 10:03:48,992][25689] Fps is (10 sec: 5682.0, 60 sec: 5560.6, 300 sec: 5544.9). Total num frames: 1172127744. Throughput: 0: 5816.4. Samples: 1172129364. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:03:48,993][25689] Avg episode reward: [(0, '-0.044')] [2022-07-11 10:03:51,088][26022] Updated weights on worker 0-0, policy_version 1144666 (0.00093) [2022-07-11 10:03:52,906][26022] Updated weights on worker 0-0, policy_version 1144676 (0.00093) [2022-07-11 10:03:54,004][25689] Fps is (10 sec: 5413.2, 60 sec: 5491.9, 300 sec: 5528.9). Total num frames: 1172152320. Throughput: 0: 5775.2. Samples: 1172162188. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:03:54,006][25689] Avg episode reward: [(0, '0.505')] [2022-07-11 10:03:54,727][26022] Updated weights on worker 0-0, policy_version 1144686 (0.00091) [2022-07-11 10:03:56,303][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:03:56,318][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001144694_1172166656.pth [2022-07-11 10:03:56,319][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001142746_1170171904.pth [2022-07-11 10:03:56,668][26022] Updated weights on worker 0-0, policy_version 1144696 (0.00087) [2022-07-11 10:03:58,300][26022] Updated weights on worker 0-0, policy_version 1144706 (0.00088) [2022-07-11 10:03:59,025][25689] Fps is (10 sec: 5511.3, 60 sec: 5541.8, 300 sec: 5540.9). Total num frames: 1172183040. Throughput: 0: 4951.3. Samples: 1172178846. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:03:59,026][25689] Avg episode reward: [(0, '1.042')] [2022-07-11 10:04:00,387][26022] Updated weights on worker 0-0, policy_version 1144716 (0.00089) [2022-07-11 10:04:02,122][26022] Updated weights on worker 0-0, policy_version 1144726 (0.00094) [2022-07-11 10:04:04,143][25689] Fps is (10 sec: 5554.9, 60 sec: 5521.4, 300 sec: 5532.2). Total num frames: 1172208640. Throughput: 0: 5689.6. Samples: 1172210498. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:04:04,145][25689] Avg episode reward: [(0, '1.067')] [2022-07-11 10:04:04,260][26022] Updated weights on worker 0-0, policy_version 1144736 (0.00090) [2022-07-11 10:04:05,990][26022] Updated weights on worker 0-0, policy_version 1144746 (0.00084) [2022-07-11 10:04:08,024][26022] Updated weights on worker 0-0, policy_version 1144756 (0.00090) [2022-07-11 10:04:09,176][25689] Fps is (10 sec: 5346.7, 60 sec: 5536.8, 300 sec: 5535.5). Total num frames: 1172237312. Throughput: 0: 5701.5. Samples: 1172244442. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:04:09,177][25689] Avg episode reward: [(0, '1.154')] [2022-07-11 10:04:09,555][26022] Updated weights on worker 0-0, policy_version 1144766 (0.00088) [2022-07-11 10:04:11,643][26022] Updated weights on worker 0-0, policy_version 1144776 (0.00094) [2022-07-11 10:04:13,334][26022] Updated weights on worker 0-0, policy_version 1144786 (0.00079) [2022-07-11 10:04:14,191][25689] Fps is (10 sec: 5604.9, 60 sec: 5520.0, 300 sec: 5539.3). Total num frames: 1172264960. Throughput: 0: 4902.7. Samples: 1172261160. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:04:14,192][25689] Avg episode reward: [(0, '1.075')] [2022-07-11 10:04:15,303][26022] Updated weights on worker 0-0, policy_version 1144796 (0.00085) [2022-07-11 10:04:16,998][26022] Updated weights on worker 0-0, policy_version 1144806 (0.00086) [2022-07-11 10:04:18,784][26022] Updated weights on worker 0-0, policy_version 1144816 (0.00089) [2022-07-11 10:04:19,210][25689] Fps is (10 sec: 5612.9, 60 sec: 5519.8, 300 sec: 5541.2). Total num frames: 1172293632. Throughput: 0: 5749.7. Samples: 1172294902. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:04:19,211][25689] Avg episode reward: [(0, '0.836')] [2022-07-11 10:04:20,644][26022] Updated weights on worker 0-0, policy_version 1144826 (0.00093) [2022-07-11 10:04:22,616][26022] Updated weights on worker 0-0, policy_version 1144836 (0.00085) [2022-07-11 10:04:24,257][25689] Fps is (10 sec: 5595.0, 60 sec: 5558.4, 300 sec: 5537.2). Total num frames: 1172321280. Throughput: 0: 5846.9. Samples: 1172328106. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:04:24,258][25689] Avg episode reward: [(0, '0.637')] [2022-07-11 10:04:24,334][26022] Updated weights on worker 0-0, policy_version 1144846 (0.00087) [2022-07-11 10:04:26,398][26022] Updated weights on worker 0-0, policy_version 1144856 (0.00087) [2022-07-11 10:04:27,954][26022] Updated weights on worker 0-0, policy_version 1144866 (0.00105) [2022-07-11 10:04:29,280][25689] Fps is (10 sec: 5389.5, 60 sec: 5507.6, 300 sec: 5531.2). Total num frames: 1172347904. Throughput: 0: 4995.7. Samples: 1172344876. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:04:29,281][25689] Avg episode reward: [(0, '-0.922')] [2022-07-11 10:04:29,873][26022] Updated weights on worker 0-0, policy_version 1144876 (0.00080) [2022-07-11 10:04:31,666][26022] Updated weights on worker 0-0, policy_version 1144886 (0.00085) [2022-07-11 10:04:33,526][26022] Updated weights on worker 0-0, policy_version 1144896 (0.00086) [2022-07-11 10:04:34,308][25689] Fps is (10 sec: 5705.6, 60 sec: 5557.4, 300 sec: 5545.2). Total num frames: 1172378624. Throughput: 0: 5836.2. Samples: 1172378566. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2022-07-11 10:04:34,309][25689] Avg episode reward: [(0, '-1.353')] [2022-07-11 10:04:35,351][26022] Updated weights on worker 0-0, policy_version 1144906 (0.00086) [2022-07-11 10:04:37,128][26022] Updated weights on worker 0-0, policy_version 1144916 (0.00091) [2022-07-11 10:04:39,165][26022] Updated weights on worker 0-0, policy_version 1144926 (0.00088) [2022-07-11 10:04:39,360][25689] Fps is (10 sec: 5587.3, 60 sec: 5521.4, 300 sec: 5532.6). Total num frames: 1172404224. Throughput: 0: 5806.8. Samples: 1172411910. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:04:39,361][25689] Avg episode reward: [(0, '-1.817')] [2022-07-11 10:04:40,760][26022] Updated weights on worker 0-0, policy_version 1144936 (0.00086) [2022-07-11 10:04:42,655][26022] Updated weights on worker 0-0, policy_version 1144946 (0.00089) [2022-07-11 10:04:44,451][25689] Fps is (10 sec: 5552.4, 60 sec: 5571.0, 300 sec: 5545.2). Total num frames: 1172434944. Throughput: 0: 4978.6. Samples: 1172428644. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:04:44,452][25689] Avg episode reward: [(0, '-1.642')] [2022-07-11 10:04:44,459][26022] Updated weights on worker 0-0, policy_version 1144956 (0.00095) [2022-07-11 10:04:46,422][26022] Updated weights on worker 0-0, policy_version 1144966 (0.00086) [2022-07-11 10:04:48,243][26022] Updated weights on worker 0-0, policy_version 1144976 (0.00089) [2022-07-11 10:04:49,481][25689] Fps is (10 sec: 5666.1, 60 sec: 5518.8, 300 sec: 5546.2). Total num frames: 1172461568. Throughput: 0: 5810.0. Samples: 1172462242. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:04:49,481][25689] Avg episode reward: [(0, '-1.450')] [2022-07-11 10:04:50,097][26022] Updated weights on worker 0-0, policy_version 1144986 (0.00096) [2022-07-11 10:04:51,868][26022] Updated weights on worker 0-0, policy_version 1144996 (0.00092) [2022-07-11 10:04:53,867][26022] Updated weights on worker 0-0, policy_version 1145006 (0.00091) [2022-07-11 10:04:54,550][25689] Fps is (10 sec: 5374.2, 60 sec: 5564.3, 300 sec: 5539.7). Total num frames: 1172489216. Throughput: 0: 5778.0. Samples: 1172495524. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:04:54,551][25689] Avg episode reward: [(0, '-1.895')] [2022-07-11 10:04:55,514][26022] Updated weights on worker 0-0, policy_version 1145016 (0.00094) [2022-07-11 10:04:57,592][26022] Updated weights on worker 0-0, policy_version 1145026 (0.00083) [2022-07-11 10:04:59,264][26022] Updated weights on worker 0-0, policy_version 1145036 (0.00087) [2022-07-11 10:04:59,555][25689] Fps is (10 sec: 5590.2, 60 sec: 5531.9, 300 sec: 5547.8). Total num frames: 1172517888. Throughput: 0: 4966.0. Samples: 1172512200. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:04:59,556][25689] Avg episode reward: [(0, '-0.511')] [2022-07-11 10:05:01,242][26022] Updated weights on worker 0-0, policy_version 1145046 (0.00087) [2022-07-11 10:05:03,562][26022] Updated weights on worker 0-0, policy_version 1145056 (0.00085) [2022-07-11 10:05:04,637][25689] Fps is (10 sec: 5482.0, 60 sec: 5552.2, 300 sec: 5546.5). Total num frames: 1172544512. Throughput: 0: 5691.4. Samples: 1172543528. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:05:04,637][25689] Avg episode reward: [(0, '-0.430')] [2022-07-11 10:05:05,118][26022] Updated weights on worker 0-0, policy_version 1145066 (0.00097) [2022-07-11 10:05:07,048][26022] Updated weights on worker 0-0, policy_version 1145076 (0.00092) [2022-07-11 10:05:08,994][26022] Updated weights on worker 0-0, policy_version 1145086 (0.00085) [2022-07-11 10:05:09,653][25689] Fps is (10 sec: 5273.4, 60 sec: 5519.9, 300 sec: 5536.3). Total num frames: 1172571136. Throughput: 0: 5698.4. Samples: 1172577192. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:05:09,653][25689] Avg episode reward: [(0, '0.484')] [2022-07-11 10:05:10,571][26022] Updated weights on worker 0-0, policy_version 1145096 (0.00091) [2022-07-11 10:05:12,611][26022] Updated weights on worker 0-0, policy_version 1145106 (0.00087) [2022-07-11 10:05:14,307][26022] Updated weights on worker 0-0, policy_version 1145116 (0.00083) [2022-07-11 10:05:14,680][25689] Fps is (10 sec: 5505.6, 60 sec: 5535.7, 300 sec: 5539.5). Total num frames: 1172599808. Throughput: 0: 4897.5. Samples: 1172594112. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:05:14,681][25689] Avg episode reward: [(0, '0.244')] [2022-07-11 10:05:16,100][26022] Updated weights on worker 0-0, policy_version 1145126 (0.00087) [2022-07-11 10:05:17,908][26022] Updated weights on worker 0-0, policy_version 1145136 (0.00099) [2022-07-11 10:05:19,703][25689] Fps is (10 sec: 5705.9, 60 sec: 5535.3, 300 sec: 5540.1). Total num frames: 1172628480. Throughput: 0: 5736.5. Samples: 1172627776. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:05:19,703][25689] Avg episode reward: [(0, '-0.780')] [2022-07-11 10:05:19,823][26022] Updated weights on worker 0-0, policy_version 1145146 (0.00083) [2022-07-11 10:05:21,530][26022] Updated weights on worker 0-0, policy_version 1145156 (0.00086) [2022-07-11 10:05:23,770][26022] Updated weights on worker 0-0, policy_version 1145166 (0.00090) [2022-07-11 10:05:24,785][25689] Fps is (10 sec: 5675.3, 60 sec: 5549.1, 300 sec: 5549.4). Total num frames: 1172657152. Throughput: 0: 5850.0. Samples: 1172661392. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:05:24,785][25689] Avg episode reward: [(0, '-2.147')] [2022-07-11 10:05:25,111][26022] Updated weights on worker 0-0, policy_version 1145176 (0.00084) [2022-07-11 10:05:27,400][26022] Updated weights on worker 0-0, policy_version 1145186 (0.00088) [2022-07-11 10:05:28,892][26022] Updated weights on worker 0-0, policy_version 1145196 (0.00086) [2022-07-11 10:05:29,815][25689] Fps is (10 sec: 5569.4, 60 sec: 5565.3, 300 sec: 5542.1). Total num frames: 1172684800. Throughput: 0: 5012.3. Samples: 1172678250. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:05:29,816][25689] Avg episode reward: [(0, '-2.542')] [2022-07-11 10:05:31,035][26022] Updated weights on worker 0-0, policy_version 1145206 (0.00085) [2022-07-11 10:05:32,524][26022] Updated weights on worker 0-0, policy_version 1145216 (0.00088) [2022-07-11 10:05:34,467][26022] Updated weights on worker 0-0, policy_version 1145226 (0.00087) [2022-07-11 10:05:34,831][25689] Fps is (10 sec: 5606.3, 60 sec: 5532.6, 300 sec: 5545.5). Total num frames: 1172713472. Throughput: 0: 5845.6. Samples: 1172711902. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:05:34,831][25689] Avg episode reward: [(0, '-4.115')] [2022-07-11 10:05:36,213][26022] Updated weights on worker 0-0, policy_version 1145236 (0.00092) [2022-07-11 10:05:38,135][26022] Updated weights on worker 0-0, policy_version 1145246 (0.00088) [2022-07-11 10:05:39,807][26022] Updated weights on worker 0-0, policy_version 1145256 (0.00098) [2022-07-11 10:05:39,908][25689] Fps is (10 sec: 5681.6, 60 sec: 5581.0, 300 sec: 5553.3). Total num frames: 1172742144. Throughput: 0: 5826.5. Samples: 1172745502. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:05:39,909][25689] Avg episode reward: [(0, '-5.090')] [2022-07-11 10:05:41,986][26022] Updated weights on worker 0-0, policy_version 1145266 (0.00082) [2022-07-11 10:05:43,478][26022] Updated weights on worker 0-0, policy_version 1145276 (0.00089) [2022-07-11 10:05:44,999][25689] Fps is (10 sec: 5438.1, 60 sec: 5513.4, 300 sec: 5541.5). Total num frames: 1172768768. Throughput: 0: 5812.3. Samples: 1172778882. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:05:44,999][25689] Avg episode reward: [(0, '-4.861')] [2022-07-11 10:05:45,641][26022] Updated weights on worker 0-0, policy_version 1145286 (0.00092) [2022-07-11 10:05:47,380][26022] Updated weights on worker 0-0, policy_version 1145296 (0.00090) [2022-07-11 10:05:49,430][26022] Updated weights on worker 0-0, policy_version 1145306 (0.00094) [2022-07-11 10:05:50,013][25689] Fps is (10 sec: 5573.7, 60 sec: 5565.6, 300 sec: 5544.8). Total num frames: 1172798464. Throughput: 0: 5803.7. Samples: 1172795470. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:05:50,013][25689] Avg episode reward: [(0, '-3.577')] [2022-07-11 10:05:51,166][26022] Updated weights on worker 0-0, policy_version 1145316 (0.00091) [2022-07-11 10:05:52,921][26022] Updated weights on worker 0-0, policy_version 1145326 (0.00086) [2022-07-11 10:05:54,883][26022] Updated weights on worker 0-0, policy_version 1145336 (0.00082) [2022-07-11 10:05:55,050][25689] Fps is (10 sec: 5603.0, 60 sec: 5551.6, 300 sec: 5544.4). Total num frames: 1172825088. Throughput: 0: 5761.6. Samples: 1172828402. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:05:55,051][25689] Avg episode reward: [(0, '-2.708')] [2022-07-11 10:05:56,442][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:05:56,451][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001145345_1172833280.pth [2022-07-11 10:05:56,451][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001143393_1170834432.pth [2022-07-11 10:05:56,544][26022] Updated weights on worker 0-0, policy_version 1145346 (0.00081) [2022-07-11 10:05:58,520][26022] Updated weights on worker 0-0, policy_version 1145356 (0.00088) [2022-07-11 10:06:00,086][25689] Fps is (10 sec: 5489.4, 60 sec: 5548.8, 300 sec: 5552.4). Total num frames: 1172853760. Throughput: 0: 5770.3. Samples: 1172861934. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:06:00,086][25689] Avg episode reward: [(0, '-2.009')] [2022-07-11 10:06:00,250][26022] Updated weights on worker 0-0, policy_version 1145366 (0.00095) [2022-07-11 10:06:02,593][26022] Updated weights on worker 0-0, policy_version 1145376 (0.00085) [2022-07-11 10:06:04,193][26022] Updated weights on worker 0-0, policy_version 1145386 (0.00082) [2022-07-11 10:06:05,200][25689] Fps is (10 sec: 5448.1, 60 sec: 5545.8, 300 sec: 5540.7). Total num frames: 1172880384. Throughput: 0: 4843.2. Samples: 1172876722. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:06:05,201][25689] Avg episode reward: [(0, '-2.161')] [2022-07-11 10:06:06,168][26022] Updated weights on worker 0-0, policy_version 1145396 (0.00396) [2022-07-11 10:06:07,868][26022] Updated weights on worker 0-0, policy_version 1145406 (0.00082) [2022-07-11 10:06:10,037][26022] Updated weights on worker 0-0, policy_version 1145416 (0.00094) [2022-07-11 10:06:10,225][25689] Fps is (10 sec: 5150.8, 60 sec: 5528.1, 300 sec: 5540.4). Total num frames: 1172905984. Throughput: 0: 5677.5. Samples: 1172910224. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:06:10,226][25689] Avg episode reward: [(0, '-0.330')] [2022-07-11 10:06:11,619][26022] Updated weights on worker 0-0, policy_version 1145426 (0.00088) [2022-07-11 10:06:13,686][26022] Updated weights on worker 0-0, policy_version 1145436 (0.00087) [2022-07-11 10:06:15,309][25689] Fps is (10 sec: 5469.7, 60 sec: 5539.8, 300 sec: 5539.0). Total num frames: 1172935680. Throughput: 0: 5667.0. Samples: 1172943210. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:06:15,310][25689] Avg episode reward: [(0, '-0.736')] [2022-07-11 10:06:15,370][26022] Updated weights on worker 0-0, policy_version 1145446 (0.00095) [2022-07-11 10:06:17,364][26022] Updated weights on worker 0-0, policy_version 1145456 (0.00099) [2022-07-11 10:06:19,215][26022] Updated weights on worker 0-0, policy_version 1145466 (0.00088) [2022-07-11 10:06:20,339][25689] Fps is (10 sec: 5771.2, 60 sec: 5539.2, 300 sec: 5542.9). Total num frames: 1172964352. Throughput: 0: 4825.0. Samples: 1172959652. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:06:20,339][25689] Avg episode reward: [(0, '-0.850')] [2022-07-11 10:06:21,068][26022] Updated weights on worker 0-0, policy_version 1145476 (0.00091) [2022-07-11 10:06:22,949][26022] Updated weights on worker 0-0, policy_version 1145486 (0.00092) [2022-07-11 10:06:24,704][26022] Updated weights on worker 0-0, policy_version 1145496 (0.00103) [2022-07-11 10:06:25,430][25689] Fps is (10 sec: 5463.7, 60 sec: 5504.5, 300 sec: 5534.7). Total num frames: 1172990976. Throughput: 0: 5756.6. Samples: 1172993178. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:06:25,431][25689] Avg episode reward: [(0, '-0.182')] [2022-07-11 10:06:26,631][26022] Updated weights on worker 0-0, policy_version 1145506 (0.00085) [2022-07-11 10:06:28,352][26022] Updated weights on worker 0-0, policy_version 1145516 (0.00084) [2022-07-11 10:06:30,227][26022] Updated weights on worker 0-0, policy_version 1145526 (0.00094) [2022-07-11 10:06:30,486][25689] Fps is (10 sec: 5449.4, 60 sec: 5519.1, 300 sec: 5534.4). Total num frames: 1173019648. Throughput: 0: 5738.1. Samples: 1173026482. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:06:30,486][25689] Avg episode reward: [(0, '0.847')] [2022-07-11 10:06:32,081][26022] Updated weights on worker 0-0, policy_version 1145536 (0.00085) [2022-07-11 10:06:33,944][26022] Updated weights on worker 0-0, policy_version 1145546 (0.00091) [2022-07-11 10:06:35,505][25689] Fps is (10 sec: 5589.9, 60 sec: 5501.8, 300 sec: 5538.1). Total num frames: 1173047296. Throughput: 0: 4946.0. Samples: 1173043100. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:06:35,506][25689] Avg episode reward: [(0, '0.009')] [2022-07-11 10:06:35,796][26022] Updated weights on worker 0-0, policy_version 1145556 (0.00091) [2022-07-11 10:06:37,628][26022] Updated weights on worker 0-0, policy_version 1145566 (0.00087) [2022-07-11 10:06:39,460][26022] Updated weights on worker 0-0, policy_version 1145576 (0.00089) [2022-07-11 10:06:40,606][25689] Fps is (10 sec: 5564.9, 60 sec: 5499.7, 300 sec: 5535.0). Total num frames: 1173075968. Throughput: 0: 5776.6. Samples: 1173076732. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:06:40,607][25689] Avg episode reward: [(0, '-0.029')] [2022-07-11 10:06:41,075][26022] Updated weights on worker 0-0, policy_version 1145586 (0.00091) [2022-07-11 10:06:43,315][26022] Updated weights on worker 0-0, policy_version 1145596 (0.00088) [2022-07-11 10:06:44,914][26022] Updated weights on worker 0-0, policy_version 1145606 (0.00080) [2022-07-11 10:06:45,744][25689] Fps is (10 sec: 5600.5, 60 sec: 5529.1, 300 sec: 5539.4). Total num frames: 1173104640. Throughput: 0: 5755.9. Samples: 1173110106. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:06:45,745][25689] Avg episode reward: [(0, '0.234')] [2022-07-11 10:06:46,869][26022] Updated weights on worker 0-0, policy_version 1145616 (0.00086) [2022-07-11 10:06:48,547][26022] Updated weights on worker 0-0, policy_version 1145626 (0.00085) [2022-07-11 10:06:50,633][26022] Updated weights on worker 0-0, policy_version 1145636 (0.00094) [2022-07-11 10:06:50,759][25689] Fps is (10 sec: 5547.5, 60 sec: 5495.4, 300 sec: 5539.4). Total num frames: 1173132288. Throughput: 0: 4955.9. Samples: 1173126952. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:06:50,759][25689] Avg episode reward: [(0, '-0.509')] [2022-07-11 10:06:52,115][26022] Updated weights on worker 0-0, policy_version 1145646 (0.00090) [2022-07-11 10:06:54,322][26022] Updated weights on worker 0-0, policy_version 1145656 (0.00077) [2022-07-11 10:06:55,770][25689] Fps is (10 sec: 5719.5, 60 sec: 5548.4, 300 sec: 5536.0). Total num frames: 1173161984. Throughput: 0: 5790.0. Samples: 1173160434. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:06:55,772][25689] Avg episode reward: [(0, '-0.676')] [2022-07-11 10:06:55,776][26022] Updated weights on worker 0-0, policy_version 1145666 (0.00090) [2022-07-11 10:06:58,133][26022] Updated weights on worker 0-0, policy_version 1145676 (0.00093) [2022-07-11 10:06:59,527][26022] Updated weights on worker 0-0, policy_version 1145686 (0.00086) [2022-07-11 10:07:00,793][25689] Fps is (10 sec: 5612.7, 60 sec: 5515.8, 300 sec: 5544.9). Total num frames: 1173188608. Throughput: 0: 5820.6. Samples: 1173194230. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:07:00,794][25689] Avg episode reward: [(0, '-0.132')] [2022-07-11 10:07:01,905][26022] Updated weights on worker 0-0, policy_version 1145696 (0.00093) [2022-07-11 10:07:03,523][26022] Updated weights on worker 0-0, policy_version 1145706 (0.00081) [2022-07-11 10:07:05,503][26022] Updated weights on worker 0-0, policy_version 1145716 (0.00085) [2022-07-11 10:07:05,916][25689] Fps is (10 sec: 5248.5, 60 sec: 5515.0, 300 sec: 5533.4). Total num frames: 1173215232. Throughput: 0: 4907.8. Samples: 1173209104. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:07:05,916][25689] Avg episode reward: [(0, '-0.142')] [2022-07-11 10:07:07,272][26022] Updated weights on worker 0-0, policy_version 1145726 (0.00107) [2022-07-11 10:07:09,052][26022] Updated weights on worker 0-0, policy_version 1145736 (0.00096) [2022-07-11 10:07:10,946][25689] Fps is (10 sec: 5345.3, 60 sec: 5548.2, 300 sec: 5534.6). Total num frames: 1173242880. Throughput: 0: 5737.0. Samples: 1173242768. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:07:10,947][25689] Avg episode reward: [(0, '-0.731')] [2022-07-11 10:07:10,955][26022] Updated weights on worker 0-0, policy_version 1145746 (0.00094) [2022-07-11 10:07:12,565][26022] Updated weights on worker 0-0, policy_version 1145756 (0.00087) [2022-07-11 10:07:14,574][26022] Updated weights on worker 0-0, policy_version 1145766 (0.01175) [2022-07-11 10:07:15,991][25689] Fps is (10 sec: 5691.7, 60 sec: 5551.9, 300 sec: 5544.2). Total num frames: 1173272576. Throughput: 0: 5740.5. Samples: 1173276510. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:07:15,991][25689] Avg episode reward: [(0, '0.794')] [2022-07-11 10:07:16,355][26022] Updated weights on worker 0-0, policy_version 1145776 (0.00088) [2022-07-11 10:07:18,223][26022] Updated weights on worker 0-0, policy_version 1145786 (0.00083) [2022-07-11 10:07:19,825][26022] Updated weights on worker 0-0, policy_version 1145796 (0.00089) [2022-07-11 10:07:21,003][25689] Fps is (10 sec: 5702.3, 60 sec: 5536.6, 300 sec: 5536.1). Total num frames: 1173300224. Throughput: 0: 4912.3. Samples: 1173293506. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:07:21,003][25689] Avg episode reward: [(0, '1.022')] [2022-07-11 10:07:21,881][26022] Updated weights on worker 0-0, policy_version 1145806 (0.00091) [2022-07-11 10:07:23,598][26022] Updated weights on worker 0-0, policy_version 1145816 (0.00090) [2022-07-11 10:07:25,491][26022] Updated weights on worker 0-0, policy_version 1145826 (0.00083) [2022-07-11 10:07:26,060][25689] Fps is (10 sec: 5491.8, 60 sec: 5556.6, 300 sec: 5539.4). Total num frames: 1173327872. Throughput: 0: 5857.1. Samples: 1173327088. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:07:26,060][25689] Avg episode reward: [(0, '1.107')] [2022-07-11 10:07:27,399][26022] Updated weights on worker 0-0, policy_version 1145836 (0.00087) [2022-07-11 10:07:29,179][26022] Updated weights on worker 0-0, policy_version 1145846 (0.00085) [2022-07-11 10:07:30,996][26022] Updated weights on worker 0-0, policy_version 1145856 (0.00092) [2022-07-11 10:07:31,134][25689] Fps is (10 sec: 5559.1, 60 sec: 5554.9, 300 sec: 5541.8). Total num frames: 1173356544. Throughput: 0: 5824.7. Samples: 1173360354. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:07:31,134][25689] Avg episode reward: [(0, '1.305')] [2022-07-11 10:07:32,811][26022] Updated weights on worker 0-0, policy_version 1145866 (0.00085) [2022-07-11 10:07:34,806][26022] Updated weights on worker 0-0, policy_version 1145876 (0.00076) [2022-07-11 10:07:36,148][25689] Fps is (10 sec: 5582.5, 60 sec: 5555.4, 300 sec: 5538.7). Total num frames: 1173384192. Throughput: 0: 5825.8. Samples: 1173393944. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:07:36,149][25689] Avg episode reward: [(0, '0.983')] [2022-07-11 10:07:36,681][26022] Updated weights on worker 0-0, policy_version 1145886 (0.00079) [2022-07-11 10:07:38,154][26022] Updated weights on worker 0-0, policy_version 1145896 (0.00099) [2022-07-11 10:07:40,262][26022] Updated weights on worker 0-0, policy_version 1145906 (0.00084) [2022-07-11 10:07:41,151][25689] Fps is (10 sec: 5622.5, 60 sec: 5564.5, 300 sec: 5540.2). Total num frames: 1173412864. Throughput: 0: 5822.3. Samples: 1173410814. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:07:41,151][25689] Avg episode reward: [(0, '0.941')] [2022-07-11 10:07:42,107][26022] Updated weights on worker 0-0, policy_version 1145916 (0.00085) [2022-07-11 10:07:43,870][26022] Updated weights on worker 0-0, policy_version 1145926 (0.00096) [2022-07-11 10:07:45,892][26022] Updated weights on worker 0-0, policy_version 1145936 (0.00093) [2022-07-11 10:07:46,294][25689] Fps is (10 sec: 5551.0, 60 sec: 5547.0, 300 sec: 5537.8). Total num frames: 1173440512. Throughput: 0: 5776.4. Samples: 1173443970. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:07:46,295][25689] Avg episode reward: [(0, '0.815')] [2022-07-11 10:07:47,474][26022] Updated weights on worker 0-0, policy_version 1145946 (0.00083) [2022-07-11 10:07:49,419][26022] Updated weights on worker 0-0, policy_version 1145956 (0.00089) [2022-07-11 10:07:51,159][26022] Updated weights on worker 0-0, policy_version 1145966 (0.00091) [2022-07-11 10:07:51,296][25689] Fps is (10 sec: 5551.2, 60 sec: 5565.1, 300 sec: 5537.8). Total num frames: 1173469184. Throughput: 0: 5806.9. Samples: 1173477436. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:07:51,297][25689] Avg episode reward: [(0, '-0.373')] [2022-07-11 10:07:53,147][26022] Updated weights on worker 0-0, policy_version 1145976 (0.00084) [2022-07-11 10:07:54,992][26022] Updated weights on worker 0-0, policy_version 1145986 (0.00088) [2022-07-11 10:07:56,313][25689] Fps is (10 sec: 5519.1, 60 sec: 5513.8, 300 sec: 5534.3). Total num frames: 1173495808. Throughput: 0: 4981.5. Samples: 1173494396. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:07:56,314][25689] Avg episode reward: [(0, '-0.018')] [2022-07-11 10:07:56,514][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:07:56,524][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001145994_1173497856.pth [2022-07-11 10:07:56,524][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001144044_1171501056.pth [2022-07-11 10:07:56,728][26022] Updated weights on worker 0-0, policy_version 1145996 (0.00084) [2022-07-11 10:07:58,609][26022] Updated weights on worker 0-0, policy_version 1146006 (0.00090) [2022-07-11 10:08:00,427][26022] Updated weights on worker 0-0, policy_version 1146016 (0.00084) [2022-07-11 10:08:01,327][25689] Fps is (10 sec: 5410.6, 60 sec: 5531.6, 300 sec: 5539.0). Total num frames: 1173523456. Throughput: 0: 5806.5. Samples: 1173527968. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:08:01,328][25689] Avg episode reward: [(0, '-0.671')] [2022-07-11 10:08:02,596][26022] Updated weights on worker 0-0, policy_version 1146026 (0.00092) [2022-07-11 10:08:04,370][26022] Updated weights on worker 0-0, policy_version 1146036 (0.00089) [2022-07-11 10:08:06,391][25689] Fps is (10 sec: 5385.4, 60 sec: 5537.0, 300 sec: 5534.6). Total num frames: 1173550080. Throughput: 0: 5751.3. Samples: 1173559552. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:08:06,392][25689] Avg episode reward: [(0, '-1.320')] [2022-07-11 10:08:06,407][26022] Updated weights on worker 0-0, policy_version 1146046 (0.00086) [2022-07-11 10:08:08,171][26022] Updated weights on worker 0-0, policy_version 1146056 (0.00427) [2022-07-11 10:08:10,030][26022] Updated weights on worker 0-0, policy_version 1146066 (0.00090) [2022-07-11 10:08:11,410][25689] Fps is (10 sec: 5585.4, 60 sec: 5571.8, 300 sec: 5538.0). Total num frames: 1173579776. Throughput: 0: 4917.7. Samples: 1173576352. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:08:11,411][25689] Avg episode reward: [(0, '-0.546')] [2022-07-11 10:08:12,015][26022] Updated weights on worker 0-0, policy_version 1146076 (0.00091) [2022-07-11 10:08:13,588][26022] Updated weights on worker 0-0, policy_version 1146086 (0.00087) [2022-07-11 10:08:15,545][26022] Updated weights on worker 0-0, policy_version 1146096 (0.00094) [2022-07-11 10:08:16,447][25689] Fps is (10 sec: 5499.0, 60 sec: 5504.9, 300 sec: 5527.3). Total num frames: 1173605376. Throughput: 0: 5742.9. Samples: 1173610020. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:08:16,447][25689] Avg episode reward: [(0, '-0.475')] [2022-07-11 10:08:17,181][26022] Updated weights on worker 0-0, policy_version 1146106 (0.00085) [2022-07-11 10:08:19,015][26022] Updated weights on worker 0-0, policy_version 1146116 (0.00084) [2022-07-11 10:08:20,903][26022] Updated weights on worker 0-0, policy_version 1146126 (0.00089) [2022-07-11 10:08:21,454][25689] Fps is (10 sec: 5607.4, 60 sec: 5556.0, 300 sec: 5546.2). Total num frames: 1173636096. Throughput: 0: 5759.8. Samples: 1173643896. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:08:21,455][25689] Avg episode reward: [(0, '-0.687')] [2022-07-11 10:08:22,899][26022] Updated weights on worker 0-0, policy_version 1146136 (0.00088) [2022-07-11 10:08:24,561][26022] Updated weights on worker 0-0, policy_version 1146146 (0.00086) [2022-07-11 10:08:26,499][25689] Fps is (10 sec: 5704.5, 60 sec: 5540.2, 300 sec: 5535.5). Total num frames: 1173662720. Throughput: 0: 5033.8. Samples: 1173660772. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:08:26,500][25689] Avg episode reward: [(0, '-0.534')] [2022-07-11 10:08:26,659][26022] Updated weights on worker 0-0, policy_version 1146156 (0.00085) [2022-07-11 10:08:28,140][26022] Updated weights on worker 0-0, policy_version 1146166 (0.00091) [2022-07-11 10:08:30,191][26022] Updated weights on worker 0-0, policy_version 1146176 (0.00095) [2022-07-11 10:08:31,501][25689] Fps is (10 sec: 5402.0, 60 sec: 5529.9, 300 sec: 5535.8). Total num frames: 1173690368. Throughput: 0: 5849.5. Samples: 1173693872. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:08:31,501][25689] Avg episode reward: [(0, '-0.624')] [2022-07-11 10:08:31,951][26022] Updated weights on worker 0-0, policy_version 1146186 (0.00082) [2022-07-11 10:08:33,775][26022] Updated weights on worker 0-0, policy_version 1146196 (0.00088) [2022-07-11 10:08:35,850][26022] Updated weights on worker 0-0, policy_version 1146206 (0.00088) [2022-07-11 10:08:36,602][25689] Fps is (10 sec: 5675.8, 60 sec: 5555.8, 300 sec: 5541.3). Total num frames: 1173720064. Throughput: 0: 5838.1. Samples: 1173727692. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 10:08:36,603][25689] Avg episode reward: [(0, '-0.264')] [2022-07-11 10:08:37,374][26022] Updated weights on worker 0-0, policy_version 1146216 (0.00088) [2022-07-11 10:08:39,313][26022] Updated weights on worker 0-0, policy_version 1146226 (0.01076) [2022-07-11 10:08:41,012][26022] Updated weights on worker 0-0, policy_version 1146236 (0.00998) [2022-07-11 10:08:41,652][25689] Fps is (10 sec: 5648.8, 60 sec: 5534.5, 300 sec: 5541.8). Total num frames: 1173747712. Throughput: 0: 4979.4. Samples: 1173744470. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:08:41,653][25689] Avg episode reward: [(0, '-0.034')] [2022-07-11 10:08:42,972][26022] Updated weights on worker 0-0, policy_version 1146246 (0.00095) [2022-07-11 10:08:44,962][26022] Updated weights on worker 0-0, policy_version 1146256 (0.00095) [2022-07-11 10:08:46,678][26022] Updated weights on worker 0-0, policy_version 1146266 (0.00085) [2022-07-11 10:08:46,700][25689] Fps is (10 sec: 5577.6, 60 sec: 5560.3, 300 sec: 5537.8). Total num frames: 1173776384. Throughput: 0: 5801.9. Samples: 1173777976. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:08:46,700][25689] Avg episode reward: [(0, '-0.117')] [2022-07-11 10:08:48,515][26022] Updated weights on worker 0-0, policy_version 1146276 (0.00096) [2022-07-11 10:08:50,235][26022] Updated weights on worker 0-0, policy_version 1146286 (0.00089) [2022-07-11 10:08:51,717][25689] Fps is (10 sec: 5595.6, 60 sec: 5541.9, 300 sec: 5548.0). Total num frames: 1173804032. Throughput: 0: 5795.9. Samples: 1173811044. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:08:51,718][25689] Avg episode reward: [(0, '0.271')] [2022-07-11 10:08:52,413][26022] Updated weights on worker 0-0, policy_version 1146296 (0.00083) [2022-07-11 10:08:54,102][26022] Updated weights on worker 0-0, policy_version 1146306 (0.00091) [2022-07-11 10:08:55,996][26022] Updated weights on worker 0-0, policy_version 1146316 (0.00107) [2022-07-11 10:08:56,735][25689] Fps is (10 sec: 5510.2, 60 sec: 5558.8, 300 sec: 5537.7). Total num frames: 1173831680. Throughput: 0: 4984.4. Samples: 1173828042. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:08:56,736][25689] Avg episode reward: [(0, '1.033')] [2022-07-11 10:08:57,732][26022] Updated weights on worker 0-0, policy_version 1146326 (0.00089) [2022-07-11 10:08:59,590][26022] Updated weights on worker 0-0, policy_version 1146336 (0.00093) [2022-07-11 10:09:01,635][26022] Updated weights on worker 0-0, policy_version 1146346 (0.00080) [2022-07-11 10:09:01,747][25689] Fps is (10 sec: 5513.5, 60 sec: 5559.0, 300 sec: 5546.6). Total num frames: 1173859328. Throughput: 0: 5821.2. Samples: 1173861442. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:09:01,747][25689] Avg episode reward: [(0, '1.437')] [2022-07-11 10:09:03,900][26022] Updated weights on worker 0-0, policy_version 1146356 (0.00086) [2022-07-11 10:09:05,776][26022] Updated weights on worker 0-0, policy_version 1146366 (0.00089) [2022-07-11 10:09:06,915][25689] Fps is (10 sec: 5231.0, 60 sec: 5532.5, 300 sec: 5533.8). Total num frames: 1173884928. Throughput: 0: 5607.5. Samples: 1173891330. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:09:06,915][25689] Avg episode reward: [(0, '1.324')] [2022-07-11 10:09:07,688][26022] Updated weights on worker 0-0, policy_version 1146376 (0.00095) [2022-07-11 10:09:09,768][26022] Updated weights on worker 0-0, policy_version 1146386 (0.00092) [2022-07-11 10:09:11,877][26022] Updated weights on worker 0-0, policy_version 1146396 (0.00082) [2022-07-11 10:09:11,933][25689] Fps is (10 sec: 4925.8, 60 sec: 5448.0, 300 sec: 5523.4). Total num frames: 1173909504. Throughput: 0: 4752.6. Samples: 1173907120. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:09:11,933][25689] Avg episode reward: [(0, '1.398')] [2022-07-11 10:09:13,672][26022] Updated weights on worker 0-0, policy_version 1146406 (0.00087) [2022-07-11 10:09:15,726][26022] Updated weights on worker 0-0, policy_version 1146416 (0.00058) [2022-07-11 10:09:16,945][25689] Fps is (10 sec: 5206.4, 60 sec: 5484.0, 300 sec: 5520.1). Total num frames: 1173937152. Throughput: 0: 5452.9. Samples: 1173938246. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:09:16,946][25689] Avg episode reward: [(0, '1.608')] [2022-07-11 10:09:17,439][26022] Updated weights on worker 0-0, policy_version 1146426 (0.00083) [2022-07-11 10:09:19,252][26022] Updated weights on worker 0-0, policy_version 1146436 (0.00083) [2022-07-11 10:09:20,995][26022] Updated weights on worker 0-0, policy_version 1146446 (0.00087) [2022-07-11 10:09:21,982][25689] Fps is (10 sec: 5604.8, 60 sec: 5447.6, 300 sec: 5523.7). Total num frames: 1173965824. Throughput: 0: 5464.4. Samples: 1173972014. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:09:21,983][25689] Avg episode reward: [(0, '0.293')] [2022-07-11 10:09:22,868][26022] Updated weights on worker 0-0, policy_version 1146456 (0.00088) [2022-07-11 10:09:24,720][26022] Updated weights on worker 0-0, policy_version 1146466 (0.00084) [2022-07-11 10:09:26,606][26022] Updated weights on worker 0-0, policy_version 1146476 (0.00780) [2022-07-11 10:09:27,044][25689] Fps is (10 sec: 5678.2, 60 sec: 5479.8, 300 sec: 5529.9). Total num frames: 1173994496. Throughput: 0: 4836.7. Samples: 1173988690. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:09:27,045][25689] Avg episode reward: [(0, '0.091')] [2022-07-11 10:09:28,578][26022] Updated weights on worker 0-0, policy_version 1146486 (0.00088) [2022-07-11 10:09:30,179][26022] Updated weights on worker 0-0, policy_version 1146496 (0.00110) [2022-07-11 10:09:32,092][25689] Fps is (10 sec: 5570.3, 60 sec: 5475.6, 300 sec: 5519.2). Total num frames: 1174022144. Throughput: 0: 5706.8. Samples: 1174022166. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:09:32,093][25689] Avg episode reward: [(0, '-1.447')] [2022-07-11 10:09:32,095][26022] Updated weights on worker 0-0, policy_version 1146506 (0.00085) [2022-07-11 10:09:33,767][26022] Updated weights on worker 0-0, policy_version 1146516 (0.00090) [2022-07-11 10:09:35,672][26022] Updated weights on worker 0-0, policy_version 1146526 (0.00085) [2022-07-11 10:09:37,161][25689] Fps is (10 sec: 5465.5, 60 sec: 5444.7, 300 sec: 5525.7). Total num frames: 1174049792. Throughput: 0: 5815.9. Samples: 1174055822. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:09:37,162][25689] Avg episode reward: [(0, '-2.414')] [2022-07-11 10:09:37,595][26022] Updated weights on worker 0-0, policy_version 1146536 (0.00083) [2022-07-11 10:09:39,336][26022] Updated weights on worker 0-0, policy_version 1146546 (0.00090) [2022-07-11 10:09:41,182][26022] Updated weights on worker 0-0, policy_version 1146556 (0.00087) [2022-07-11 10:09:42,168][25689] Fps is (10 sec: 5589.6, 60 sec: 5465.5, 300 sec: 5520.4). Total num frames: 1174078464. Throughput: 0: 4985.5. Samples: 1174072654. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:09:42,169][25689] Avg episode reward: [(0, '-2.607')] [2022-07-11 10:09:43,177][26022] Updated weights on worker 0-0, policy_version 1146566 (0.00101) [2022-07-11 10:09:44,916][26022] Updated weights on worker 0-0, policy_version 1146576 (0.00094) [2022-07-11 10:09:46,893][26022] Updated weights on worker 0-0, policy_version 1146586 (0.00105) [2022-07-11 10:09:47,241][25689] Fps is (10 sec: 5587.7, 60 sec: 5446.4, 300 sec: 5523.1). Total num frames: 1174106112. Throughput: 0: 5807.0. Samples: 1174105972. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:09:47,241][25689] Avg episode reward: [(0, '-0.562')] [2022-07-11 10:09:48,547][26022] Updated weights on worker 0-0, policy_version 1146596 (0.00086) [2022-07-11 10:09:50,607][26022] Updated weights on worker 0-0, policy_version 1146606 (0.00093) [2022-07-11 10:09:52,067][26022] Updated weights on worker 0-0, policy_version 1146616 (0.00089) [2022-07-11 10:09:52,259][25689] Fps is (10 sec: 5682.7, 60 sec: 5480.1, 300 sec: 5530.9). Total num frames: 1174135808. Throughput: 0: 5815.2. Samples: 1174139440. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:09:52,260][25689] Avg episode reward: [(0, '-0.855')] [2022-07-11 10:09:54,191][26022] Updated weights on worker 0-0, policy_version 1146626 (0.00087) [2022-07-11 10:09:55,883][26022] Updated weights on worker 0-0, policy_version 1146636 (0.00087) [2022-07-11 10:09:56,758][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:09:56,772][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001146640_1174159360.pth [2022-07-11 10:09:56,773][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001144694_1172166656.pth [2022-07-11 10:09:57,283][25689] Fps is (10 sec: 5506.5, 60 sec: 5445.8, 300 sec: 5520.2). Total num frames: 1174161408. Throughput: 0: 4996.9. Samples: 1174156366. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:09:57,283][25689] Avg episode reward: [(0, '-0.693')] [2022-07-11 10:09:57,936][26022] Updated weights on worker 0-0, policy_version 1146646 (0.00088) [2022-07-11 10:09:59,577][26022] Updated weights on worker 0-0, policy_version 1146656 (0.00091) [2022-07-11 10:10:01,291][26022] Updated weights on worker 0-0, policy_version 1146666 (0.00090) [2022-07-11 10:10:02,313][25689] Fps is (10 sec: 5194.3, 60 sec: 5427.1, 300 sec: 5521.2). Total num frames: 1174188032. Throughput: 0: 5814.9. Samples: 1174189796. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:10:02,314][25689] Avg episode reward: [(0, '0.215')] [2022-07-11 10:10:03,691][26022] Updated weights on worker 0-0, policy_version 1146676 (0.00066) [2022-07-11 10:10:05,404][26022] Updated weights on worker 0-0, policy_version 1146686 (0.00091) [2022-07-11 10:10:07,379][25689] Fps is (10 sec: 5477.1, 60 sec: 5487.2, 300 sec: 5527.1). Total num frames: 1174216704. Throughput: 0: 5705.7. Samples: 1174220872. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:10:07,379][25689] Avg episode reward: [(0, '1.477')] [2022-07-11 10:10:07,379][26022] Updated weights on worker 0-0, policy_version 1146696 (0.00086) [2022-07-11 10:10:09,412][26022] Updated weights on worker 0-0, policy_version 1146706 (0.00094) [2022-07-11 10:10:11,016][26022] Updated weights on worker 0-0, policy_version 1146716 (0.00093) [2022-07-11 10:10:12,442][25689] Fps is (10 sec: 5459.4, 60 sec: 5516.9, 300 sec: 5519.6). Total num frames: 1174243328. Throughput: 0: 4856.3. Samples: 1174237454. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:10:12,442][25689] Avg episode reward: [(0, '1.458')] [2022-07-11 10:10:13,012][26022] Updated weights on worker 0-0, policy_version 1146726 (0.00090) [2022-07-11 10:10:14,854][26022] Updated weights on worker 0-0, policy_version 1146736 (0.00092) [2022-07-11 10:10:16,518][26022] Updated weights on worker 0-0, policy_version 1146746 (0.00089) [2022-07-11 10:10:17,475][25689] Fps is (10 sec: 5477.0, 60 sec: 5532.0, 300 sec: 5519.4). Total num frames: 1174272000. Throughput: 0: 5676.1. Samples: 1174270978. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:10:17,475][25689] Avg episode reward: [(0, '1.043')] [2022-07-11 10:10:18,477][26022] Updated weights on worker 0-0, policy_version 1146756 (0.00085) [2022-07-11 10:10:20,298][26022] Updated weights on worker 0-0, policy_version 1146766 (0.00091) [2022-07-11 10:10:21,961][26022] Updated weights on worker 0-0, policy_version 1146776 (0.00089) [2022-07-11 10:10:22,497][25689] Fps is (10 sec: 5702.8, 60 sec: 5533.2, 300 sec: 5520.5). Total num frames: 1174300672. Throughput: 0: 5697.6. Samples: 1174304796. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:10:22,498][25689] Avg episode reward: [(0, '0.097')] [2022-07-11 10:10:24,081][26022] Updated weights on worker 0-0, policy_version 1146786 (0.00087) [2022-07-11 10:10:25,537][26022] Updated weights on worker 0-0, policy_version 1146796 (0.00086) [2022-07-11 10:10:27,592][25689] Fps is (10 sec: 5465.7, 60 sec: 5496.5, 300 sec: 5515.9). Total num frames: 1174327296. Throughput: 0: 5804.7. Samples: 1174338204. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:10:27,593][25689] Avg episode reward: [(0, '-0.039')] [2022-07-11 10:10:27,738][26022] Updated weights on worker 0-0, policy_version 1146806 (0.00093) [2022-07-11 10:10:29,515][26022] Updated weights on worker 0-0, policy_version 1146816 (0.00085) [2022-07-11 10:10:31,266][26022] Updated weights on worker 0-0, policy_version 1146826 (0.00086) [2022-07-11 10:10:32,636][25689] Fps is (10 sec: 5554.9, 60 sec: 5530.7, 300 sec: 5518.8). Total num frames: 1174356992. Throughput: 0: 5821.1. Samples: 1174355006. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:10:32,638][25689] Avg episode reward: [(0, '-0.835')] [2022-07-11 10:10:33,279][26022] Updated weights on worker 0-0, policy_version 1146836 (0.00097) [2022-07-11 10:10:34,852][26022] Updated weights on worker 0-0, policy_version 1146846 (0.00099) [2022-07-11 10:10:36,703][26022] Updated weights on worker 0-0, policy_version 1146856 (0.00085) [2022-07-11 10:10:37,650][25689] Fps is (10 sec: 5802.7, 60 sec: 5552.6, 300 sec: 5519.9). Total num frames: 1174385664. Throughput: 0: 5840.8. Samples: 1174388820. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:10:37,651][25689] Avg episode reward: [(0, '-1.427')] [2022-07-11 10:10:38,743][26022] Updated weights on worker 0-0, policy_version 1146866 (0.00086) [2022-07-11 10:10:40,171][26022] Updated weights on worker 0-0, policy_version 1146876 (0.00086) [2022-07-11 10:10:42,130][26022] Updated weights on worker 0-0, policy_version 1146886 (0.00871) [2022-07-11 10:10:42,714][25689] Fps is (10 sec: 5792.0, 60 sec: 5564.4, 300 sec: 5530.8). Total num frames: 1174415360. Throughput: 0: 5837.3. Samples: 1174422804. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:10:42,714][25689] Avg episode reward: [(0, '-1.532')] [2022-07-11 10:10:44,110][26022] Updated weights on worker 0-0, policy_version 1146896 (0.00093) [2022-07-11 10:10:45,632][26022] Updated weights on worker 0-0, policy_version 1146906 (0.00089) [2022-07-11 10:10:47,769][25689] Fps is (10 sec: 5465.0, 60 sec: 5532.1, 300 sec: 5516.3). Total num frames: 1174440960. Throughput: 0: 5017.0. Samples: 1174439432. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:10:47,769][25689] Avg episode reward: [(0, '-1.538')] [2022-07-11 10:10:47,887][26022] Updated weights on worker 0-0, policy_version 1146916 (0.00093) [2022-07-11 10:10:49,310][26022] Updated weights on worker 0-0, policy_version 1146926 (0.00095) [2022-07-11 10:10:51,328][26022] Updated weights on worker 0-0, policy_version 1146936 (0.00113) [2022-07-11 10:10:52,787][25689] Fps is (10 sec: 5387.8, 60 sec: 5515.2, 300 sec: 5523.5). Total num frames: 1174469632. Throughput: 0: 5847.6. Samples: 1174472838. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:10:52,787][25689] Avg episode reward: [(0, '-2.022')] [2022-07-11 10:10:53,305][26022] Updated weights on worker 0-0, policy_version 1146946 (0.00087) [2022-07-11 10:10:54,932][26022] Updated weights on worker 0-0, policy_version 1146956 (0.00092) [2022-07-11 10:10:56,994][26022] Updated weights on worker 0-0, policy_version 1146966 (0.00091) [2022-07-11 10:10:57,798][25689] Fps is (10 sec: 5717.5, 60 sec: 5567.1, 300 sec: 5523.9). Total num frames: 1174498304. Throughput: 0: 5821.2. Samples: 1174506104. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:10:57,799][25689] Avg episode reward: [(0, '-1.362')] [2022-07-11 10:10:58,696][26022] Updated weights on worker 0-0, policy_version 1146976 (0.00108) [2022-07-11 10:11:00,680][26022] Updated weights on worker 0-0, policy_version 1146986 (0.00100) [2022-07-11 10:11:02,817][25689] Fps is (10 sec: 5308.7, 60 sec: 5534.3, 300 sec: 5518.8). Total num frames: 1174522880. Throughput: 0: 4974.1. Samples: 1174522800. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:11:02,817][25689] Avg episode reward: [(0, '-0.474')] [2022-07-11 10:11:02,847][26022] Updated weights on worker 0-0, policy_version 1146996 (0.00084) [2022-07-11 10:11:04,729][26022] Updated weights on worker 0-0, policy_version 1147006 (0.00084) [2022-07-11 10:11:06,764][26022] Updated weights on worker 0-0, policy_version 1147016 (0.00090) [2022-07-11 10:11:07,888][25689] Fps is (10 sec: 5277.5, 60 sec: 5533.8, 300 sec: 5528.3). Total num frames: 1174551552. Throughput: 0: 5698.2. Samples: 1174554074. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:11:07,888][25689] Avg episode reward: [(0, '-0.442')] [2022-07-11 10:11:08,315][26022] Updated weights on worker 0-0, policy_version 1147026 (0.00090) [2022-07-11 10:11:10,448][26022] Updated weights on worker 0-0, policy_version 1147036 (0.00082) [2022-07-11 10:11:11,954][26022] Updated weights on worker 0-0, policy_version 1147046 (0.00083) [2022-07-11 10:11:12,973][25689] Fps is (10 sec: 5646.5, 60 sec: 5565.7, 300 sec: 5524.8). Total num frames: 1174580224. Throughput: 0: 5687.6. Samples: 1174587648. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:11:12,973][25689] Avg episode reward: [(0, '-0.511')] [2022-07-11 10:11:13,927][26022] Updated weights on worker 0-0, policy_version 1147056 (0.00089) [2022-07-11 10:11:15,606][26022] Updated weights on worker 0-0, policy_version 1147066 (0.00076) [2022-07-11 10:11:17,430][26022] Updated weights on worker 0-0, policy_version 1147076 (0.00085) [2022-07-11 10:11:17,999][25689] Fps is (10 sec: 5671.4, 60 sec: 5566.3, 300 sec: 5524.9). Total num frames: 1174608896. Throughput: 0: 4876.1. Samples: 1174604604. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:11:17,999][25689] Avg episode reward: [(0, '-0.639')] [2022-07-11 10:11:19,254][26022] Updated weights on worker 0-0, policy_version 1147086 (0.00089) [2022-07-11 10:11:21,102][26022] Updated weights on worker 0-0, policy_version 1147096 (0.00084) [2022-07-11 10:11:22,944][26022] Updated weights on worker 0-0, policy_version 1147106 (0.00099) [2022-07-11 10:11:23,015][25689] Fps is (10 sec: 5608.4, 60 sec: 5550.0, 300 sec: 5529.8). Total num frames: 1174636544. Throughput: 0: 5716.8. Samples: 1174638266. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:11:23,015][25689] Avg episode reward: [(0, '-2.157')] [2022-07-11 10:11:24,855][26022] Updated weights on worker 0-0, policy_version 1147116 (0.00098) [2022-07-11 10:11:26,631][26022] Updated weights on worker 0-0, policy_version 1147126 (0.00099) [2022-07-11 10:11:28,081][25689] Fps is (10 sec: 5484.6, 60 sec: 5569.5, 300 sec: 5526.1). Total num frames: 1174664192. Throughput: 0: 5808.2. Samples: 1174671360. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:11:28,081][25689] Avg episode reward: [(0, '-0.286')] [2022-07-11 10:11:28,747][26022] Updated weights on worker 0-0, policy_version 1147136 (0.00093) [2022-07-11 10:11:30,237][26022] Updated weights on worker 0-0, policy_version 1147146 (0.00091) [2022-07-11 10:11:32,405][26022] Updated weights on worker 0-0, policy_version 1147156 (0.00088) [2022-07-11 10:11:33,099][25689] Fps is (10 sec: 5483.6, 60 sec: 5538.1, 300 sec: 5526.2). Total num frames: 1174691840. Throughput: 0: 4988.2. Samples: 1174688040. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:11:33,099][25689] Avg episode reward: [(0, '-1.010')] [2022-07-11 10:11:34,180][26022] Updated weights on worker 0-0, policy_version 1147166 (0.00086) [2022-07-11 10:11:35,916][26022] Updated weights on worker 0-0, policy_version 1147176 (0.00097) [2022-07-11 10:11:37,927][26022] Updated weights on worker 0-0, policy_version 1147186 (0.00086) [2022-07-11 10:11:38,103][25689] Fps is (10 sec: 5415.4, 60 sec: 5505.2, 300 sec: 5521.1). Total num frames: 1174718464. Throughput: 0: 5802.0. Samples: 1174721246. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:11:38,103][25689] Avg episode reward: [(0, '-1.710')] [2022-07-11 10:11:39,803][26022] Updated weights on worker 0-0, policy_version 1147196 (0.00084) [2022-07-11 10:11:41,384][26022] Updated weights on worker 0-0, policy_version 1147206 (0.00282) [2022-07-11 10:11:43,127][25689] Fps is (10 sec: 5411.8, 60 sec: 5474.8, 300 sec: 5519.7). Total num frames: 1174746112. Throughput: 0: 5807.5. Samples: 1174755068. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:11:43,128][25689] Avg episode reward: [(0, '-1.026')] [2022-07-11 10:11:43,556][26022] Updated weights on worker 0-0, policy_version 1147216 (0.00087) [2022-07-11 10:11:44,915][26022] Updated weights on worker 0-0, policy_version 1147226 (0.00094) [2022-07-11 10:11:47,047][26022] Updated weights on worker 0-0, policy_version 1147236 (0.00087) [2022-07-11 10:11:48,182][25689] Fps is (10 sec: 5892.5, 60 sec: 5576.5, 300 sec: 5532.8). Total num frames: 1174777856. Throughput: 0: 4991.0. Samples: 1174771684. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:11:48,182][25689] Avg episode reward: [(0, '-2.978')] [2022-07-11 10:11:48,836][26022] Updated weights on worker 0-0, policy_version 1147246 (0.00090) [2022-07-11 10:11:50,615][26022] Updated weights on worker 0-0, policy_version 1147256 (0.00087) [2022-07-11 10:11:52,539][26022] Updated weights on worker 0-0, policy_version 1147266 (0.00079) [2022-07-11 10:11:53,186][25689] Fps is (10 sec: 5802.3, 60 sec: 5543.9, 300 sec: 5522.6). Total num frames: 1174804480. Throughput: 0: 5845.3. Samples: 1174805460. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:11:53,187][25689] Avg episode reward: [(0, '-2.313')] [2022-07-11 10:11:54,414][26022] Updated weights on worker 0-0, policy_version 1147276 (0.00085) [2022-07-11 10:11:55,961][26022] Updated weights on worker 0-0, policy_version 1147286 (0.00513) [2022-07-11 10:11:56,895][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:11:56,909][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001147291_1174825984.pth [2022-07-11 10:11:56,909][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001145345_1172833280.pth [2022-07-11 10:11:58,161][26022] Updated weights on worker 0-0, policy_version 1147296 (0.00079) [2022-07-11 10:11:58,229][25689] Fps is (10 sec: 5299.6, 60 sec: 5507.1, 300 sec: 5522.2). Total num frames: 1174831104. Throughput: 0: 5851.8. Samples: 1174839024. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:11:58,230][25689] Avg episode reward: [(0, '-2.633')] [2022-07-11 10:11:59,571][26022] Updated weights on worker 0-0, policy_version 1147306 (0.00083) [2022-07-11 10:12:02,238][26022] Updated weights on worker 0-0, policy_version 1147316 (0.00088) [2022-07-11 10:12:03,233][25689] Fps is (10 sec: 5402.0, 60 sec: 5559.3, 300 sec: 5527.8). Total num frames: 1174858752. Throughput: 0: 5026.4. Samples: 1174856130. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:12:03,234][25689] Avg episode reward: [(0, '-2.019')] [2022-07-11 10:12:03,781][26022] Updated weights on worker 0-0, policy_version 1147326 (0.00085) [2022-07-11 10:12:05,590][26022] Updated weights on worker 0-0, policy_version 1147336 (0.00083) [2022-07-11 10:12:07,694][26022] Updated weights on worker 0-0, policy_version 1147346 (0.00087) [2022-07-11 10:12:08,319][25689] Fps is (10 sec: 5581.9, 60 sec: 5557.9, 300 sec: 5530.2). Total num frames: 1174887424. Throughput: 0: 5737.8. Samples: 1174887228. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:12:08,320][25689] Avg episode reward: [(0, '-1.216')] [2022-07-11 10:12:09,317][26022] Updated weights on worker 0-0, policy_version 1147356 (0.00084) [2022-07-11 10:12:11,302][26022] Updated weights on worker 0-0, policy_version 1147366 (0.00090) [2022-07-11 10:12:13,179][26022] Updated weights on worker 0-0, policy_version 1147376 (0.00086) [2022-07-11 10:12:13,375][25689] Fps is (10 sec: 5452.2, 60 sec: 5526.6, 300 sec: 5519.7). Total num frames: 1174914048. Throughput: 0: 5709.6. Samples: 1174920730. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:12:13,377][25689] Avg episode reward: [(0, '-1.201')] [2022-07-11 10:12:14,722][26022] Updated weights on worker 0-0, policy_version 1147386 (0.00080) [2022-07-11 10:12:16,842][26022] Updated weights on worker 0-0, policy_version 1147396 (0.00084) [2022-07-11 10:12:18,451][25689] Fps is (10 sec: 5457.6, 60 sec: 5522.1, 300 sec: 5522.0). Total num frames: 1174942720. Throughput: 0: 4867.6. Samples: 1174937454. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:12:18,451][25689] Avg episode reward: [(0, '0.651')] [2022-07-11 10:12:18,522][26022] Updated weights on worker 0-0, policy_version 1147406 (0.00083) [2022-07-11 10:12:20,358][26022] Updated weights on worker 0-0, policy_version 1147416 (0.00086) [2022-07-11 10:12:22,302][26022] Updated weights on worker 0-0, policy_version 1147426 (0.00082) [2022-07-11 10:12:23,500][25689] Fps is (10 sec: 5764.7, 60 sec: 5552.9, 300 sec: 5529.0). Total num frames: 1174972416. Throughput: 0: 5674.0. Samples: 1174971126. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:12:23,501][25689] Avg episode reward: [(0, '1.292')] [2022-07-11 10:12:24,119][26022] Updated weights on worker 0-0, policy_version 1147436 (0.00079) [2022-07-11 10:12:26,011][26022] Updated weights on worker 0-0, policy_version 1147446 (0.00098) [2022-07-11 10:12:27,918][26022] Updated weights on worker 0-0, policy_version 1147456 (0.00096) [2022-07-11 10:12:28,621][25689] Fps is (10 sec: 5537.8, 60 sec: 5531.0, 300 sec: 5521.2). Total num frames: 1174999040. Throughput: 0: 5779.6. Samples: 1175004566. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:12:28,622][25689] Avg episode reward: [(0, '0.327')] [2022-07-11 10:12:29,492][26022] Updated weights on worker 0-0, policy_version 1147466 (0.00089) [2022-07-11 10:12:31,579][26022] Updated weights on worker 0-0, policy_version 1147476 (0.00097) [2022-07-11 10:12:33,175][26022] Updated weights on worker 0-0, policy_version 1147486 (0.00087) [2022-07-11 10:12:33,626][25689] Fps is (10 sec: 5460.9, 60 sec: 5549.1, 300 sec: 5524.9). Total num frames: 1175027712. Throughput: 0: 4973.2. Samples: 1175021442. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 10:12:33,627][25689] Avg episode reward: [(0, '0.048')] [2022-07-11 10:12:35,238][26022] Updated weights on worker 0-0, policy_version 1147496 (0.00089) [2022-07-11 10:12:36,853][26022] Updated weights on worker 0-0, policy_version 1147506 (0.00085) [2022-07-11 10:12:38,655][25689] Fps is (10 sec: 5613.1, 60 sec: 5563.7, 300 sec: 5520.9). Total num frames: 1175055360. Throughput: 0: 5799.5. Samples: 1175054626. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:12:38,655][25689] Avg episode reward: [(0, '-1.072')] [2022-07-11 10:12:38,796][26022] Updated weights on worker 0-0, policy_version 1147516 (0.00088) [2022-07-11 10:12:40,789][26022] Updated weights on worker 0-0, policy_version 1147526 (0.00093) [2022-07-11 10:12:42,543][26022] Updated weights on worker 0-0, policy_version 1147536 (0.00104) [2022-07-11 10:12:43,668][25689] Fps is (10 sec: 5404.8, 60 sec: 5547.9, 300 sec: 5519.9). Total num frames: 1175081984. Throughput: 0: 5793.1. Samples: 1175087958. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:12:43,668][25689] Avg episode reward: [(0, '-1.471')] [2022-07-11 10:12:44,327][26022] Updated weights on worker 0-0, policy_version 1147546 (0.00088) [2022-07-11 10:12:46,471][26022] Updated weights on worker 0-0, policy_version 1147556 (0.00099) [2022-07-11 10:12:47,702][26022] Updated weights on worker 0-0, policy_version 1147566 (0.00087) [2022-07-11 10:12:48,763][25689] Fps is (10 sec: 5470.4, 60 sec: 5493.5, 300 sec: 5518.2). Total num frames: 1175110656. Throughput: 0: 4976.5. Samples: 1175104802. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:12:48,765][25689] Avg episode reward: [(0, '-2.613')] [2022-07-11 10:12:50,162][26022] Updated weights on worker 0-0, policy_version 1147576 (0.00086) [2022-07-11 10:12:51,312][26022] Updated weights on worker 0-0, policy_version 1147586 (0.00087) [2022-07-11 10:12:53,679][26022] Updated weights on worker 0-0, policy_version 1147596 (0.00083) [2022-07-11 10:12:53,775][25689] Fps is (10 sec: 5673.4, 60 sec: 5526.6, 300 sec: 5525.1). Total num frames: 1175139328. Throughput: 0: 5788.8. Samples: 1175138080. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:12:53,776][25689] Avg episode reward: [(0, '-2.708')] [2022-07-11 10:12:55,411][26022] Updated weights on worker 0-0, policy_version 1147606 (0.00092) [2022-07-11 10:12:57,111][26022] Updated weights on worker 0-0, policy_version 1147616 (0.00095) [2022-07-11 10:12:58,798][25689] Fps is (10 sec: 5714.6, 60 sec: 5562.3, 300 sec: 5528.4). Total num frames: 1175168000. Throughput: 0: 5802.5. Samples: 1175171506. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:12:58,798][25689] Avg episode reward: [(0, '-1.794')] [2022-07-11 10:12:59,219][26022] Updated weights on worker 0-0, policy_version 1147626 (0.00085) [2022-07-11 10:13:00,832][26022] Updated weights on worker 0-0, policy_version 1147636 (0.00090) [2022-07-11 10:13:02,968][26022] Updated weights on worker 0-0, policy_version 1147646 (0.00096) [2022-07-11 10:13:03,808][25689] Fps is (10 sec: 5409.1, 60 sec: 5527.8, 300 sec: 5526.0). Total num frames: 1175193600. Throughput: 0: 4976.7. Samples: 1175188194. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:13:03,811][25689] Avg episode reward: [(0, '-1.711')] [2022-07-11 10:13:05,279][26022] Updated weights on worker 0-0, policy_version 1147656 (0.00094) [2022-07-11 10:13:06,534][26022] Updated weights on worker 0-0, policy_version 1147666 (0.00088) [2022-07-11 10:13:08,862][25689] Fps is (10 sec: 5189.0, 60 sec: 5496.9, 300 sec: 5515.0). Total num frames: 1175220224. Throughput: 0: 5676.4. Samples: 1175218892. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:13:08,862][25689] Avg episode reward: [(0, '-0.540')] [2022-07-11 10:13:08,864][26022] Updated weights on worker 0-0, policy_version 1147676 (0.00086) [2022-07-11 10:13:10,393][26022] Updated weights on worker 0-0, policy_version 1147686 (0.00091) [2022-07-11 10:13:12,198][26022] Updated weights on worker 0-0, policy_version 1147696 (0.00084) [2022-07-11 10:13:13,914][25689] Fps is (10 sec: 5471.8, 60 sec: 5531.1, 300 sec: 5525.0). Total num frames: 1175248896. Throughput: 0: 5683.1. Samples: 1175252532. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:13:13,915][25689] Avg episode reward: [(0, '1.299')] [2022-07-11 10:13:14,263][26022] Updated weights on worker 0-0, policy_version 1147706 (0.00089) [2022-07-11 10:13:16,226][26022] Updated weights on worker 0-0, policy_version 1147716 (0.00094) [2022-07-11 10:13:17,859][26022] Updated weights on worker 0-0, policy_version 1147726 (0.00091) [2022-07-11 10:13:18,929][25689] Fps is (10 sec: 5696.2, 60 sec: 5536.7, 300 sec: 5518.0). Total num frames: 1175277568. Throughput: 0: 5695.0. Samples: 1175286154. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:13:18,930][25689] Avg episode reward: [(0, '1.049')] [2022-07-11 10:13:19,892][26022] Updated weights on worker 0-0, policy_version 1147736 (0.00086) [2022-07-11 10:13:21,308][26022] Updated weights on worker 0-0, policy_version 1147746 (0.00088) [2022-07-11 10:13:23,698][26022] Updated weights on worker 0-0, policy_version 1147756 (0.00086) [2022-07-11 10:13:23,941][25689] Fps is (10 sec: 5412.6, 60 sec: 5472.3, 300 sec: 5515.2). Total num frames: 1175303168. Throughput: 0: 5707.4. Samples: 1175303098. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:13:23,942][25689] Avg episode reward: [(0, '0.983')] [2022-07-11 10:13:24,974][26022] Updated weights on worker 0-0, policy_version 1147766 (0.00098) [2022-07-11 10:13:27,170][26022] Updated weights on worker 0-0, policy_version 1147776 (0.00091) [2022-07-11 10:13:28,819][26022] Updated weights on worker 0-0, policy_version 1147786 (0.00082) [2022-07-11 10:13:29,023][25689] Fps is (10 sec: 5579.7, 60 sec: 5543.7, 300 sec: 5524.0). Total num frames: 1175333888. Throughput: 0: 5840.2. Samples: 1175336634. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:13:29,023][25689] Avg episode reward: [(0, '1.214')] [2022-07-11 10:13:30,822][26022] Updated weights on worker 0-0, policy_version 1147796 (0.00085) [2022-07-11 10:13:32,620][26022] Updated weights on worker 0-0, policy_version 1147806 (0.00087) [2022-07-11 10:13:34,035][25689] Fps is (10 sec: 5680.9, 60 sec: 5509.1, 300 sec: 5515.4). Total num frames: 1175360512. Throughput: 0: 5837.2. Samples: 1175369984. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:13:34,036][25689] Avg episode reward: [(0, '0.658')] [2022-07-11 10:13:34,523][26022] Updated weights on worker 0-0, policy_version 1147816 (0.00091) [2022-07-11 10:13:36,119][26022] Updated weights on worker 0-0, policy_version 1147826 (0.00091) [2022-07-11 10:13:38,018][26022] Updated weights on worker 0-0, policy_version 1147836 (0.00085) [2022-07-11 10:13:39,039][25689] Fps is (10 sec: 5623.0, 60 sec: 5545.3, 300 sec: 5523.1). Total num frames: 1175390208. Throughput: 0: 5014.6. Samples: 1175386996. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:13:39,039][25689] Avg episode reward: [(0, '0.601')] [2022-07-11 10:13:39,843][26022] Updated weights on worker 0-0, policy_version 1147846 (0.00083) [2022-07-11 10:13:41,716][26022] Updated weights on worker 0-0, policy_version 1147856 (0.00083) [2022-07-11 10:13:43,603][26022] Updated weights on worker 0-0, policy_version 1147866 (0.00087) [2022-07-11 10:13:44,050][25689] Fps is (10 sec: 5828.1, 60 sec: 5579.3, 300 sec: 5523.8). Total num frames: 1175418880. Throughput: 0: 5853.7. Samples: 1175420812. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:13:44,051][25689] Avg episode reward: [(0, '-1.205')] [2022-07-11 10:13:45,345][26022] Updated weights on worker 0-0, policy_version 1147876 (0.00090) [2022-07-11 10:13:47,192][26022] Updated weights on worker 0-0, policy_version 1147886 (0.00084) [2022-07-11 10:13:49,188][25689] Fps is (10 sec: 5347.1, 60 sec: 5524.6, 300 sec: 5514.6). Total num frames: 1175444480. Throughput: 0: 5830.8. Samples: 1175454216. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:13:49,189][25689] Avg episode reward: [(0, '-1.073')] [2022-07-11 10:13:49,239][26022] Updated weights on worker 0-0, policy_version 1147896 (0.01086) [2022-07-11 10:13:50,631][26022] Updated weights on worker 0-0, policy_version 1147906 (0.00097) [2022-07-11 10:13:52,896][26022] Updated weights on worker 0-0, policy_version 1147916 (0.00092) [2022-07-11 10:13:54,192][25689] Fps is (10 sec: 5452.3, 60 sec: 5542.3, 300 sec: 5521.8). Total num frames: 1175474176. Throughput: 0: 4995.7. Samples: 1175470682. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:13:54,193][25689] Avg episode reward: [(0, '-1.668')] [2022-07-11 10:13:54,583][26022] Updated weights on worker 0-0, policy_version 1147926 (0.00092) [2022-07-11 10:13:56,471][26022] Updated weights on worker 0-0, policy_version 1147936 (0.00087) [2022-07-11 10:13:56,916][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:13:56,929][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001147939_1175489536.pth [2022-07-11 10:13:56,929][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001145994_1173497856.pth [2022-07-11 10:13:58,228][26022] Updated weights on worker 0-0, policy_version 1147946 (0.00086) [2022-07-11 10:13:59,195][25689] Fps is (10 sec: 5628.4, 60 sec: 5510.2, 300 sec: 5518.5). Total num frames: 1175500800. Throughput: 0: 5822.1. Samples: 1175504348. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:13:59,197][25689] Avg episode reward: [(0, '-2.262')] [2022-07-11 10:14:00,094][26022] Updated weights on worker 0-0, policy_version 1147956 (0.00086) [2022-07-11 10:14:02,051][26022] Updated weights on worker 0-0, policy_version 1147966 (0.00517) [2022-07-11 10:14:04,133][26022] Updated weights on worker 0-0, policy_version 1147976 (0.00094) [2022-07-11 10:14:04,260][25689] Fps is (10 sec: 5288.7, 60 sec: 5522.1, 300 sec: 5523.9). Total num frames: 1175527424. Throughput: 0: 5701.0. Samples: 1175536030. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:14:04,261][25689] Avg episode reward: [(0, '-2.148')] [2022-07-11 10:14:05,763][26022] Updated weights on worker 0-0, policy_version 1147986 (0.00085) [2022-07-11 10:14:07,841][26022] Updated weights on worker 0-0, policy_version 1147996 (0.00058) [2022-07-11 10:14:09,319][25689] Fps is (10 sec: 5563.1, 60 sec: 5572.5, 300 sec: 5540.3). Total num frames: 1175557120. Throughput: 0: 4901.0. Samples: 1175552874. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:14:09,320][25689] Avg episode reward: [(0, '-2.157')] [2022-07-11 10:14:09,445][26022] Updated weights on worker 0-0, policy_version 1148006 (0.00088) [2022-07-11 10:14:11,355][26022] Updated weights on worker 0-0, policy_version 1148016 (0.00086) [2022-07-11 10:14:13,163][26022] Updated weights on worker 0-0, policy_version 1148026 (0.00090) [2022-07-11 10:14:14,331][25689] Fps is (10 sec: 5592.7, 60 sec: 5542.3, 300 sec: 5536.9). Total num frames: 1175583744. Throughput: 0: 5747.3. Samples: 1175586424. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:14:14,332][25689] Avg episode reward: [(0, '-2.891')] [2022-07-11 10:14:15,141][26022] Updated weights on worker 0-0, policy_version 1148036 (0.00087) [2022-07-11 10:14:16,929][26022] Updated weights on worker 0-0, policy_version 1148046 (0.00093) [2022-07-11 10:14:18,967][26022] Updated weights on worker 0-0, policy_version 1148056 (0.00092) [2022-07-11 10:14:19,344][25689] Fps is (10 sec: 5413.6, 60 sec: 5525.5, 300 sec: 5533.9). Total num frames: 1175611392. Throughput: 0: 5721.6. Samples: 1175619632. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:14:19,344][25689] Avg episode reward: [(0, '-3.281')] [2022-07-11 10:14:20,533][26022] Updated weights on worker 0-0, policy_version 1148066 (0.00085) [2022-07-11 10:14:22,557][26022] Updated weights on worker 0-0, policy_version 1148076 (0.00096) [2022-07-11 10:14:24,301][26022] Updated weights on worker 0-0, policy_version 1148086 (0.00091) [2022-07-11 10:14:24,355][25689] Fps is (10 sec: 5618.5, 60 sec: 5576.5, 300 sec: 5534.8). Total num frames: 1175640064. Throughput: 0: 4993.4. Samples: 1175636368. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:14:24,355][25689] Avg episode reward: [(0, '-3.511')] [2022-07-11 10:14:26,126][26022] Updated weights on worker 0-0, policy_version 1148096 (0.00505) [2022-07-11 10:14:28,169][26022] Updated weights on worker 0-0, policy_version 1148106 (0.00090) [2022-07-11 10:14:29,422][25689] Fps is (10 sec: 5690.2, 60 sec: 5543.9, 300 sec: 5537.9). Total num frames: 1175668736. Throughput: 0: 5817.4. Samples: 1175669820. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:14:29,422][25689] Avg episode reward: [(0, '-2.576')] [2022-07-11 10:14:29,763][26022] Updated weights on worker 0-0, policy_version 1148116 (0.00096) [2022-07-11 10:14:31,651][26022] Updated weights on worker 0-0, policy_version 1148126 (0.00086) [2022-07-11 10:14:33,572][26022] Updated weights on worker 0-0, policy_version 1148136 (0.00085) [2022-07-11 10:14:34,431][25689] Fps is (10 sec: 5487.8, 60 sec: 5544.2, 300 sec: 5535.6). Total num frames: 1175695360. Throughput: 0: 5794.0. Samples: 1175702884. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:14:34,431][25689] Avg episode reward: [(0, '-2.184')] [2022-07-11 10:14:35,399][26022] Updated weights on worker 0-0, policy_version 1148146 (0.00105) [2022-07-11 10:14:37,363][26022] Updated weights on worker 0-0, policy_version 1148156 (0.00085) [2022-07-11 10:14:38,972][26022] Updated weights on worker 0-0, policy_version 1148166 (0.00095) [2022-07-11 10:14:39,494][25689] Fps is (10 sec: 5490.0, 60 sec: 5521.8, 300 sec: 5534.6). Total num frames: 1175724032. Throughput: 0: 4962.4. Samples: 1175719624. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:14:39,494][25689] Avg episode reward: [(0, '-0.369')] [2022-07-11 10:14:40,966][26022] Updated weights on worker 0-0, policy_version 1148176 (0.00089) [2022-07-11 10:14:42,708][26022] Updated weights on worker 0-0, policy_version 1148186 (0.00090) [2022-07-11 10:14:44,549][25689] Fps is (10 sec: 5566.3, 60 sec: 5501.0, 300 sec: 5534.9). Total num frames: 1175751680. Throughput: 0: 5798.5. Samples: 1175753464. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:14:44,549][25689] Avg episode reward: [(0, '0.271')] [2022-07-11 10:14:44,579][26022] Updated weights on worker 0-0, policy_version 1148196 (0.00078) [2022-07-11 10:14:46,576][26022] Updated weights on worker 0-0, policy_version 1148206 (0.00090) [2022-07-11 10:14:48,191][26022] Updated weights on worker 0-0, policy_version 1148216 (0.00088) [2022-07-11 10:14:49,628][25689] Fps is (10 sec: 5658.6, 60 sec: 5574.1, 300 sec: 5533.8). Total num frames: 1175781376. Throughput: 0: 5801.2. Samples: 1175787040. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:14:49,628][25689] Avg episode reward: [(0, '1.309')] [2022-07-11 10:14:50,101][26022] Updated weights on worker 0-0, policy_version 1148226 (0.00098) [2022-07-11 10:14:51,935][26022] Updated weights on worker 0-0, policy_version 1148236 (0.00084) [2022-07-11 10:14:53,632][26022] Updated weights on worker 0-0, policy_version 1148246 (0.00085) [2022-07-11 10:14:54,713][25689] Fps is (10 sec: 5641.6, 60 sec: 5532.7, 300 sec: 5539.5). Total num frames: 1175809024. Throughput: 0: 4986.2. Samples: 1175804016. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:14:54,714][25689] Avg episode reward: [(0, '0.664')] [2022-07-11 10:14:55,549][26022] Updated weights on worker 0-0, policy_version 1148256 (0.00085) [2022-07-11 10:14:57,503][26022] Updated weights on worker 0-0, policy_version 1148266 (0.00084) [2022-07-11 10:14:59,011][26022] Updated weights on worker 0-0, policy_version 1148276 (0.00090) [2022-07-11 10:14:59,813][25689] Fps is (10 sec: 5630.2, 60 sec: 5574.6, 300 sec: 5548.5). Total num frames: 1175838720. Throughput: 0: 5804.1. Samples: 1175837558. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:14:59,813][25689] Avg episode reward: [(0, '-0.347')] [2022-07-11 10:15:01,134][26022] Updated weights on worker 0-0, policy_version 1148286 (0.00088) [2022-07-11 10:15:03,005][26022] Updated weights on worker 0-0, policy_version 1148296 (0.00088) [2022-07-11 10:15:04,830][25689] Fps is (10 sec: 5465.6, 60 sec: 5562.1, 300 sec: 5539.1). Total num frames: 1175864320. Throughput: 0: 5710.7. Samples: 1175869286. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:15:04,831][25689] Avg episode reward: [(0, '-1.682')] [2022-07-11 10:15:05,024][26022] Updated weights on worker 0-0, policy_version 1148306 (0.00093) [2022-07-11 10:15:06,992][26022] Updated weights on worker 0-0, policy_version 1148316 (0.00086) [2022-07-11 10:15:08,702][26022] Updated weights on worker 0-0, policy_version 1148326 (0.00095) [2022-07-11 10:15:09,988][25689] Fps is (10 sec: 5132.4, 60 sec: 5502.3, 300 sec: 5537.3). Total num frames: 1175890944. Throughput: 0: 4834.1. Samples: 1175885464. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:15:09,989][25689] Avg episode reward: [(0, '-0.960')] [2022-07-11 10:15:10,680][26022] Updated weights on worker 0-0, policy_version 1148336 (0.00084) [2022-07-11 10:15:12,353][26022] Updated weights on worker 0-0, policy_version 1148346 (0.00093) [2022-07-11 10:15:14,175][26022] Updated weights on worker 0-0, policy_version 1148356 (0.00089) [2022-07-11 10:15:15,047][25689] Fps is (10 sec: 5613.0, 60 sec: 5565.6, 300 sec: 5543.7). Total num frames: 1175921664. Throughput: 0: 5670.1. Samples: 1175919308. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:15:15,047][25689] Avg episode reward: [(0, '-1.646')] [2022-07-11 10:15:16,191][26022] Updated weights on worker 0-0, policy_version 1148366 (0.00087) [2022-07-11 10:15:17,879][26022] Updated weights on worker 0-0, policy_version 1148376 (0.00087) [2022-07-11 10:15:19,968][26022] Updated weights on worker 0-0, policy_version 1148386 (0.00083) [2022-07-11 10:15:20,083][25689] Fps is (10 sec: 5578.8, 60 sec: 5529.7, 300 sec: 5533.2). Total num frames: 1175947264. Throughput: 0: 5676.4. Samples: 1175952622. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:15:20,084][25689] Avg episode reward: [(0, '-1.885')] [2022-07-11 10:15:21,639][26022] Updated weights on worker 0-0, policy_version 1148396 (0.00088) [2022-07-11 10:15:23,598][26022] Updated weights on worker 0-0, policy_version 1148406 (0.00084) [2022-07-11 10:15:25,094][25689] Fps is (10 sec: 5401.8, 60 sec: 5529.8, 300 sec: 5541.6). Total num frames: 1175975936. Throughput: 0: 5751.7. Samples: 1175985834. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:15:25,094][25689] Avg episode reward: [(0, '-2.118')] [2022-07-11 10:15:25,298][26022] Updated weights on worker 0-0, policy_version 1148416 (0.00080) [2022-07-11 10:15:27,256][26022] Updated weights on worker 0-0, policy_version 1148426 (0.00087) [2022-07-11 10:15:29,075][26022] Updated weights on worker 0-0, policy_version 1148436 (0.00095) [2022-07-11 10:15:30,207][25689] Fps is (10 sec: 5563.4, 60 sec: 5508.7, 300 sec: 5533.4). Total num frames: 1176003584. Throughput: 0: 5783.7. Samples: 1176002404. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:15:30,207][25689] Avg episode reward: [(0, '-1.392')] [2022-07-11 10:15:30,864][26022] Updated weights on worker 0-0, policy_version 1148446 (0.00090) [2022-07-11 10:15:32,839][26022] Updated weights on worker 0-0, policy_version 1148456 (0.00093) [2022-07-11 10:15:34,592][26022] Updated weights on worker 0-0, policy_version 1148466 (0.00085) [2022-07-11 10:15:35,214][25689] Fps is (10 sec: 5564.9, 60 sec: 5542.6, 300 sec: 5533.6). Total num frames: 1176032256. Throughput: 0: 5789.7. Samples: 1176036072. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:15:35,216][25689] Avg episode reward: [(0, '-0.904')] [2022-07-11 10:15:36,406][26022] Updated weights on worker 0-0, policy_version 1148476 (0.00093) [2022-07-11 10:15:38,292][26022] Updated weights on worker 0-0, policy_version 1148486 (0.00090) [2022-07-11 10:15:39,946][26022] Updated weights on worker 0-0, policy_version 1148496 (0.00086) [2022-07-11 10:15:40,306][25689] Fps is (10 sec: 5677.9, 60 sec: 5539.9, 300 sec: 5529.6). Total num frames: 1176060928. Throughput: 0: 5761.4. Samples: 1176069134. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:15:40,308][25689] Avg episode reward: [(0, '-0.421')] [2022-07-11 10:15:42,047][26022] Updated weights on worker 0-0, policy_version 1148506 (0.00085) [2022-07-11 10:15:43,862][26022] Updated weights on worker 0-0, policy_version 1148516 (0.00618) [2022-07-11 10:15:45,339][25689] Fps is (10 sec: 5461.3, 60 sec: 5525.1, 300 sec: 5533.4). Total num frames: 1176087552. Throughput: 0: 4947.0. Samples: 1176085988. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:15:45,341][25689] Avg episode reward: [(0, '-0.772')] [2022-07-11 10:15:45,595][26022] Updated weights on worker 0-0, policy_version 1148526 (0.00093) [2022-07-11 10:15:47,483][26022] Updated weights on worker 0-0, policy_version 1148536 (0.00089) [2022-07-11 10:15:49,223][26022] Updated weights on worker 0-0, policy_version 1148546 (0.00094) [2022-07-11 10:15:50,438][25689] Fps is (10 sec: 5558.5, 60 sec: 5523.2, 300 sec: 5535.4). Total num frames: 1176117248. Throughput: 0: 5794.0. Samples: 1176119626. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:15:50,439][25689] Avg episode reward: [(0, '-0.359')] [2022-07-11 10:15:51,160][26022] Updated weights on worker 0-0, policy_version 1148556 (0.00084) [2022-07-11 10:15:52,967][26022] Updated weights on worker 0-0, policy_version 1148566 (0.00089) [2022-07-11 10:15:54,736][26022] Updated weights on worker 0-0, policy_version 1148576 (0.00092) [2022-07-11 10:15:55,463][25689] Fps is (10 sec: 5664.4, 60 sec: 5528.8, 300 sec: 5531.7). Total num frames: 1176144896. Throughput: 0: 5797.8. Samples: 1176153470. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:15:55,463][25689] Avg episode reward: [(0, '0.557')] [2022-07-11 10:15:56,684][26022] Updated weights on worker 0-0, policy_version 1148586 (0.00088) [2022-07-11 10:15:57,038][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:15:57,048][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001148589_1176155136.pth [2022-07-11 10:15:57,048][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001146640_1174159360.pth [2022-07-11 10:15:58,364][26022] Updated weights on worker 0-0, policy_version 1148596 (0.00090) [2022-07-11 10:16:00,275][26022] Updated weights on worker 0-0, policy_version 1148606 (0.00088) [2022-07-11 10:16:00,468][25689] Fps is (10 sec: 5615.7, 60 sec: 5520.6, 300 sec: 5545.7). Total num frames: 1176173568. Throughput: 0: 5006.1. Samples: 1176170064. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:16:00,468][25689] Avg episode reward: [(0, '-0.365')] [2022-07-11 10:16:02,087][26022] Updated weights on worker 0-0, policy_version 1148616 (0.00107) [2022-07-11 10:16:04,431][26022] Updated weights on worker 0-0, policy_version 1148626 (0.00092) [2022-07-11 10:16:05,476][25689] Fps is (10 sec: 5522.5, 60 sec: 5538.3, 300 sec: 5540.0). Total num frames: 1176200192. Throughput: 0: 5755.3. Samples: 1176201880. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:16:05,476][25689] Avg episode reward: [(0, '-0.504')] [2022-07-11 10:16:06,151][26022] Updated weights on worker 0-0, policy_version 1148636 (0.00083) [2022-07-11 10:16:08,003][26022] Updated weights on worker 0-0, policy_version 1148646 (0.00087) [2022-07-11 10:16:09,721][26022] Updated weights on worker 0-0, policy_version 1148656 (0.00090) [2022-07-11 10:16:10,612][25689] Fps is (10 sec: 5349.8, 60 sec: 5557.1, 300 sec: 5535.6). Total num frames: 1176227840. Throughput: 0: 5726.9. Samples: 1176235160. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:16:10,613][25689] Avg episode reward: [(0, '-1.548')] [2022-07-11 10:16:11,648][26022] Updated weights on worker 0-0, policy_version 1148666 (0.00096) [2022-07-11 10:16:13,479][26022] Updated weights on worker 0-0, policy_version 1148676 (0.00085) [2022-07-11 10:16:15,316][26022] Updated weights on worker 0-0, policy_version 1148686 (0.00081) [2022-07-11 10:16:15,631][25689] Fps is (10 sec: 5445.2, 60 sec: 5510.1, 300 sec: 5532.3). Total num frames: 1176255488. Throughput: 0: 4878.6. Samples: 1176251862. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:16:15,631][25689] Avg episode reward: [(0, '-1.808')] [2022-07-11 10:16:17,207][26022] Updated weights on worker 0-0, policy_version 1148696 (0.00088) [2022-07-11 10:16:18,967][26022] Updated weights on worker 0-0, policy_version 1148706 (0.00091) [2022-07-11 10:16:20,672][25689] Fps is (10 sec: 5496.7, 60 sec: 5543.5, 300 sec: 5531.8). Total num frames: 1176283136. Throughput: 0: 5705.2. Samples: 1176285336. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:16:20,673][25689] Avg episode reward: [(0, '-1.989')] [2022-07-11 10:16:20,813][26022] Updated weights on worker 0-0, policy_version 1148716 (0.00082) [2022-07-11 10:16:22,722][26022] Updated weights on worker 0-0, policy_version 1148726 (0.00086) [2022-07-11 10:16:24,445][26022] Updated weights on worker 0-0, policy_version 1148736 (0.00055) [2022-07-11 10:16:25,697][25689] Fps is (10 sec: 5594.8, 60 sec: 5542.1, 300 sec: 5536.0). Total num frames: 1176311808. Throughput: 0: 5790.9. Samples: 1176318980. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:16:25,698][25689] Avg episode reward: [(0, '-0.848')] [2022-07-11 10:16:26,346][26022] Updated weights on worker 0-0, policy_version 1148746 (0.00094) [2022-07-11 10:16:28,303][26022] Updated weights on worker 0-0, policy_version 1148756 (0.00089) [2022-07-11 10:16:30,060][26022] Updated weights on worker 0-0, policy_version 1148766 (0.01451) [2022-07-11 10:16:30,785][25689] Fps is (10 sec: 5670.4, 60 sec: 5561.3, 300 sec: 5538.2). Total num frames: 1176340480. Throughput: 0: 4978.4. Samples: 1176335588. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:16:30,787][25689] Avg episode reward: [(0, '-0.686')] [2022-07-11 10:16:32,038][26022] Updated weights on worker 0-0, policy_version 1148776 (0.00087) [2022-07-11 10:16:33,561][26022] Updated weights on worker 0-0, policy_version 1148786 (0.00092) [2022-07-11 10:16:35,791][25689] Fps is (10 sec: 5478.1, 60 sec: 5527.6, 300 sec: 5538.1). Total num frames: 1176367104. Throughput: 0: 5801.2. Samples: 1176368818. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 10:16:35,793][25689] Avg episode reward: [(0, '1.377')] [2022-07-11 10:16:35,802][26022] Updated weights on worker 0-0, policy_version 1148796 (0.01037) [2022-07-11 10:16:37,318][26022] Updated weights on worker 0-0, policy_version 1148806 (0.00085) [2022-07-11 10:16:39,578][26022] Updated weights on worker 0-0, policy_version 1148816 (0.00090) [2022-07-11 10:16:40,801][25689] Fps is (10 sec: 5520.5, 60 sec: 5535.1, 300 sec: 5541.8). Total num frames: 1176395776. Throughput: 0: 5764.0. Samples: 1176401362. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:16:40,803][25689] Avg episode reward: [(0, '0.523')] [2022-07-11 10:16:41,108][26022] Updated weights on worker 0-0, policy_version 1148826 (0.00086) [2022-07-11 10:16:43,366][26022] Updated weights on worker 0-0, policy_version 1148836 (0.00080) [2022-07-11 10:16:44,812][26022] Updated weights on worker 0-0, policy_version 1148846 (0.00087) [2022-07-11 10:16:45,903][25689] Fps is (10 sec: 5570.0, 60 sec: 5545.8, 300 sec: 5527.2). Total num frames: 1176423424. Throughput: 0: 4899.3. Samples: 1176417972. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:16:45,903][25689] Avg episode reward: [(0, '0.778')] [2022-07-11 10:16:46,887][26022] Updated weights on worker 0-0, policy_version 1148856 (0.00096) [2022-07-11 10:16:48,647][26022] Updated weights on worker 0-0, policy_version 1148866 (0.00086) [2022-07-11 10:16:50,788][26022] Updated weights on worker 0-0, policy_version 1148876 (0.00086) [2022-07-11 10:16:51,037][25689] Fps is (10 sec: 5302.3, 60 sec: 5491.9, 300 sec: 5524.8). Total num frames: 1176450048. Throughput: 0: 5693.9. Samples: 1176450902. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:16:51,038][25689] Avg episode reward: [(0, '0.138')] [2022-07-11 10:16:52,315][26022] Updated weights on worker 0-0, policy_version 1148886 (0.00095) [2022-07-11 10:16:54,340][26022] Updated weights on worker 0-0, policy_version 1148896 (0.00090) [2022-07-11 10:16:56,057][25689] Fps is (10 sec: 5445.4, 60 sec: 5509.1, 300 sec: 5532.1). Total num frames: 1176478720. Throughput: 0: 5693.4. Samples: 1176484200. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:16:56,058][25689] Avg episode reward: [(0, '-0.140')] [2022-07-11 10:16:56,131][26022] Updated weights on worker 0-0, policy_version 1148906 (0.00527) [2022-07-11 10:16:58,058][26022] Updated weights on worker 0-0, policy_version 1148916 (0.00086) [2022-07-11 10:16:59,795][26022] Updated weights on worker 0-0, policy_version 1148926 (0.00092) [2022-07-11 10:17:01,067][25689] Fps is (10 sec: 5819.5, 60 sec: 5525.6, 300 sec: 5538.9). Total num frames: 1176508416. Throughput: 0: 4910.5. Samples: 1176500874. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:17:01,069][25689] Avg episode reward: [(0, '-1.134')] [2022-07-11 10:17:02,099][26022] Updated weights on worker 0-0, policy_version 1148936 (0.00083) [2022-07-11 10:17:03,816][26022] Updated weights on worker 0-0, policy_version 1148946 (0.00092) [2022-07-11 10:17:05,868][26022] Updated weights on worker 0-0, policy_version 1148956 (0.00090) [2022-07-11 10:17:06,089][25689] Fps is (10 sec: 5205.8, 60 sec: 5456.7, 300 sec: 5519.4). Total num frames: 1176530944. Throughput: 0: 5652.8. Samples: 1176532084. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:17:06,090][25689] Avg episode reward: [(0, '-3.279')] [2022-07-11 10:17:07,487][26022] Updated weights on worker 0-0, policy_version 1148966 (0.00894) [2022-07-11 10:17:09,676][26022] Updated weights on worker 0-0, policy_version 1148976 (0.00090) [2022-07-11 10:17:11,191][25689] Fps is (10 sec: 5158.5, 60 sec: 5493.7, 300 sec: 5528.9). Total num frames: 1176560640. Throughput: 0: 5670.8. Samples: 1176565188. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:17:11,191][26022] Updated weights on worker 0-0, policy_version 1148986 (0.00082) [2022-07-11 10:17:11,191][25689] Avg episode reward: [(0, '-2.137')] [2022-07-11 10:17:13,307][26022] Updated weights on worker 0-0, policy_version 1148996 (0.00085) [2022-07-11 10:17:14,862][26022] Updated weights on worker 0-0, policy_version 1149006 (0.00091) [2022-07-11 10:17:16,224][25689] Fps is (10 sec: 5658.2, 60 sec: 5492.3, 300 sec: 5526.2). Total num frames: 1176588288. Throughput: 0: 4844.3. Samples: 1176581894. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:17:16,226][25689] Avg episode reward: [(0, '-2.033')] [2022-07-11 10:17:16,765][26022] Updated weights on worker 0-0, policy_version 1149016 (0.00098) [2022-07-11 10:17:18,725][26022] Updated weights on worker 0-0, policy_version 1149026 (0.00084) [2022-07-11 10:17:20,595][26022] Updated weights on worker 0-0, policy_version 1149036 (0.00090) [2022-07-11 10:17:21,231][25689] Fps is (10 sec: 5507.5, 60 sec: 5495.5, 300 sec: 5520.2). Total num frames: 1176615936. Throughput: 0: 5697.8. Samples: 1176615764. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:17:21,233][25689] Avg episode reward: [(0, '-1.762')] [2022-07-11 10:17:22,387][26022] Updated weights on worker 0-0, policy_version 1149046 (0.00105) [2022-07-11 10:17:24,270][26022] Updated weights on worker 0-0, policy_version 1149056 (0.00089) [2022-07-11 10:17:25,919][26022] Updated weights on worker 0-0, policy_version 1149066 (0.00100) [2022-07-11 10:17:26,288][25689] Fps is (10 sec: 5698.3, 60 sec: 5509.5, 300 sec: 5531.7). Total num frames: 1176645632. Throughput: 0: 5800.6. Samples: 1176649246. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:17:26,288][25689] Avg episode reward: [(0, '-2.376')] [2022-07-11 10:17:28,022][26022] Updated weights on worker 0-0, policy_version 1149076 (0.00095) [2022-07-11 10:17:29,591][26022] Updated weights on worker 0-0, policy_version 1149086 (0.00084) [2022-07-11 10:17:31,341][25689] Fps is (10 sec: 5470.6, 60 sec: 5462.1, 300 sec: 5520.5). Total num frames: 1176671232. Throughput: 0: 4977.8. Samples: 1176665480. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:17:31,343][25689] Avg episode reward: [(0, '-2.090')] [2022-07-11 10:17:31,773][26022] Updated weights on worker 0-0, policy_version 1149096 (0.00623) [2022-07-11 10:17:32,984][26022] Updated weights on worker 0-0, policy_version 1149106 (0.00095) [2022-07-11 10:17:35,318][26022] Updated weights on worker 0-0, policy_version 1149116 (0.00093) [2022-07-11 10:17:36,364][25689] Fps is (10 sec: 5589.5, 60 sec: 5528.1, 300 sec: 5530.9). Total num frames: 1176701952. Throughput: 0: 5836.5. Samples: 1176699438. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:17:36,366][25689] Avg episode reward: [(0, '-2.290')] [2022-07-11 10:17:36,799][26022] Updated weights on worker 0-0, policy_version 1149126 (0.00098) [2022-07-11 10:17:38,811][26022] Updated weights on worker 0-0, policy_version 1149136 (0.00098) [2022-07-11 10:17:40,710][26022] Updated weights on worker 0-0, policy_version 1149146 (0.00093) [2022-07-11 10:17:41,368][25689] Fps is (10 sec: 5615.8, 60 sec: 5477.9, 300 sec: 5527.6). Total num frames: 1176727552. Throughput: 0: 5807.7. Samples: 1176732712. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:17:41,369][25689] Avg episode reward: [(0, '-2.343')] [2022-07-11 10:17:42,484][26022] Updated weights on worker 0-0, policy_version 1149156 (0.00082) [2022-07-11 10:17:44,415][26022] Updated weights on worker 0-0, policy_version 1149166 (0.00084) [2022-07-11 10:17:46,137][26022] Updated weights on worker 0-0, policy_version 1149176 (0.00085) [2022-07-11 10:17:46,381][25689] Fps is (10 sec: 5417.9, 60 sec: 5502.8, 300 sec: 5529.1). Total num frames: 1176756224. Throughput: 0: 4996.0. Samples: 1176749632. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:17:46,383][25689] Avg episode reward: [(0, '-2.415')] [2022-07-11 10:17:47,867][26022] Updated weights on worker 0-0, policy_version 1149186 (0.00096) [2022-07-11 10:17:49,975][26022] Updated weights on worker 0-0, policy_version 1149196 (0.00086) [2022-07-11 10:17:51,491][25689] Fps is (10 sec: 5765.5, 60 sec: 5555.8, 300 sec: 5530.7). Total num frames: 1176785920. Throughput: 0: 5843.3. Samples: 1176783236. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:17:51,493][25689] Avg episode reward: [(0, '-1.257')] [2022-07-11 10:17:51,536][26022] Updated weights on worker 0-0, policy_version 1149206 (0.00085) [2022-07-11 10:17:53,811][26022] Updated weights on worker 0-0, policy_version 1149216 (0.00081) [2022-07-11 10:17:55,119][26022] Updated weights on worker 0-0, policy_version 1149226 (0.00112) [2022-07-11 10:17:56,516][25689] Fps is (10 sec: 5556.8, 60 sec: 5521.5, 300 sec: 5523.8). Total num frames: 1176812544. Throughput: 0: 5827.7. Samples: 1176816880. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:17:56,518][25689] Avg episode reward: [(0, '-1.391')] [2022-07-11 10:17:57,179][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:17:57,193][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001149235_1176816640.pth [2022-07-11 10:17:57,194][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001147291_1174825984.pth [2022-07-11 10:17:57,282][26022] Updated weights on worker 0-0, policy_version 1149236 (0.00084) [2022-07-11 10:17:58,799][26022] Updated weights on worker 0-0, policy_version 1149246 (0.00089) [2022-07-11 10:18:00,950][26022] Updated weights on worker 0-0, policy_version 1149256 (0.00093) [2022-07-11 10:18:01,530][25689] Fps is (10 sec: 5508.1, 60 sec: 5504.2, 300 sec: 5534.1). Total num frames: 1176841216. Throughput: 0: 5839.6. Samples: 1176850454. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:18:01,532][25689] Avg episode reward: [(0, '0.520')] [2022-07-11 10:18:02,973][26022] Updated weights on worker 0-0, policy_version 1149266 (0.00097) [2022-07-11 10:18:04,819][26022] Updated weights on worker 0-0, policy_version 1149276 (0.00079) [2022-07-11 10:18:06,557][25689] Fps is (10 sec: 5507.1, 60 sec: 5571.5, 300 sec: 5534.6). Total num frames: 1176867840. Throughput: 0: 5735.5. Samples: 1176865352. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:18:06,559][25689] Avg episode reward: [(0, '1.327')] [2022-07-11 10:18:06,622][26022] Updated weights on worker 0-0, policy_version 1149286 (0.00104) [2022-07-11 10:18:08,784][26022] Updated weights on worker 0-0, policy_version 1149296 (0.00084) [2022-07-11 10:18:10,232][26022] Updated weights on worker 0-0, policy_version 1149306 (0.00088) [2022-07-11 10:18:11,644][25689] Fps is (10 sec: 5365.9, 60 sec: 5538.9, 300 sec: 5530.5). Total num frames: 1176895488. Throughput: 0: 5712.0. Samples: 1176898350. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:18:11,645][25689] Avg episode reward: [(0, '1.525')] [2022-07-11 10:18:12,569][26022] Updated weights on worker 0-0, policy_version 1149316 (0.00092) [2022-07-11 10:18:13,955][26022] Updated weights on worker 0-0, policy_version 1149326 (0.00084) [2022-07-11 10:18:16,140][26022] Updated weights on worker 0-0, policy_version 1149336 (0.00087) [2022-07-11 10:18:16,721][25689] Fps is (10 sec: 5440.2, 60 sec: 5534.9, 300 sec: 5525.9). Total num frames: 1176923136. Throughput: 0: 5701.4. Samples: 1176932078. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:18:16,722][25689] Avg episode reward: [(0, '1.622')] [2022-07-11 10:18:17,699][26022] Updated weights on worker 0-0, policy_version 1149346 (0.00086) [2022-07-11 10:18:19,917][26022] Updated weights on worker 0-0, policy_version 1149356 (0.00076) [2022-07-11 10:18:21,119][26022] Updated weights on worker 0-0, policy_version 1149366 (0.00092) [2022-07-11 10:18:21,787][25689] Fps is (10 sec: 5653.9, 60 sec: 5563.4, 300 sec: 5538.7). Total num frames: 1176952832. Throughput: 0: 4850.2. Samples: 1176948702. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:18:21,787][25689] Avg episode reward: [(0, '0.399')] [2022-07-11 10:18:23,362][26022] Updated weights on worker 0-0, policy_version 1149376 (0.00090) [2022-07-11 10:18:24,924][26022] Updated weights on worker 0-0, policy_version 1149386 (0.00086) [2022-07-11 10:18:26,806][25689] Fps is (10 sec: 5483.1, 60 sec: 5499.1, 300 sec: 5522.6). Total num frames: 1176978432. Throughput: 0: 5770.5. Samples: 1176982200. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:18:26,807][25689] Avg episode reward: [(0, '0.393')] [2022-07-11 10:18:27,101][26022] Updated weights on worker 0-0, policy_version 1149396 (0.00099) [2022-07-11 10:18:28,654][26022] Updated weights on worker 0-0, policy_version 1149406 (0.00085) [2022-07-11 10:18:30,812][26022] Updated weights on worker 0-0, policy_version 1149416 (0.00087) [2022-07-11 10:18:31,873][25689] Fps is (10 sec: 5583.9, 60 sec: 5582.3, 300 sec: 5535.4). Total num frames: 1177009152. Throughput: 0: 5801.8. Samples: 1177015712. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:18:31,873][25689] Avg episode reward: [(0, '0.444')] [2022-07-11 10:18:32,441][26022] Updated weights on worker 0-0, policy_version 1149426 (0.00098) [2022-07-11 10:18:34,523][26022] Updated weights on worker 0-0, policy_version 1149436 (0.00099) [2022-07-11 10:18:36,164][26022] Updated weights on worker 0-0, policy_version 1149446 (0.00084) [2022-07-11 10:18:36,951][25689] Fps is (10 sec: 5753.4, 60 sec: 5526.7, 300 sec: 5527.1). Total num frames: 1177036800. Throughput: 0: 4961.3. Samples: 1177032444. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:18:36,951][25689] Avg episode reward: [(0, '0.561')] [2022-07-11 10:18:38,114][26022] Updated weights on worker 0-0, policy_version 1149456 (0.00080) [2022-07-11 10:18:39,920][26022] Updated weights on worker 0-0, policy_version 1149466 (0.00083) [2022-07-11 10:18:41,915][26022] Updated weights on worker 0-0, policy_version 1149476 (0.00093) [2022-07-11 10:18:41,971][25689] Fps is (10 sec: 5475.6, 60 sec: 5559.0, 300 sec: 5523.5). Total num frames: 1177064448. Throughput: 0: 5798.3. Samples: 1177065740. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:18:41,972][25689] Avg episode reward: [(0, '-0.822')] [2022-07-11 10:18:43,614][26022] Updated weights on worker 0-0, policy_version 1149486 (0.00079) [2022-07-11 10:18:45,474][26022] Updated weights on worker 0-0, policy_version 1149496 (0.00087) [2022-07-11 10:18:47,047][25689] Fps is (10 sec: 5578.7, 60 sec: 5553.3, 300 sec: 5535.0). Total num frames: 1177093120. Throughput: 0: 5785.7. Samples: 1177099306. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:18:47,047][25689] Avg episode reward: [(0, '-0.437')] [2022-07-11 10:18:47,337][26022] Updated weights on worker 0-0, policy_version 1149506 (0.00086) [2022-07-11 10:18:49,181][26022] Updated weights on worker 0-0, policy_version 1149516 (0.00091) [2022-07-11 10:18:51,014][26022] Updated weights on worker 0-0, policy_version 1149527 (0.00082) [2022-07-11 10:18:52,098][25689] Fps is (10 sec: 5561.7, 60 sec: 5524.9, 300 sec: 5527.2). Total num frames: 1177120768. Throughput: 0: 4966.7. Samples: 1177116162. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:18:52,098][25689] Avg episode reward: [(0, '-0.453')] [2022-07-11 10:18:53,132][26022] Updated weights on worker 0-0, policy_version 1149537 (0.00086) [2022-07-11 10:18:54,733][26022] Updated weights on worker 0-0, policy_version 1149547 (0.00088) [2022-07-11 10:18:56,587][26022] Updated weights on worker 0-0, policy_version 1149557 (0.00093) [2022-07-11 10:18:57,122][25689] Fps is (10 sec: 5488.1, 60 sec: 5541.9, 300 sec: 5530.3). Total num frames: 1177148416. Throughput: 0: 5809.2. Samples: 1177149622. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:18:57,123][25689] Avg episode reward: [(0, '-0.294')] [2022-07-11 10:18:58,381][26022] Updated weights on worker 0-0, policy_version 1149567 (0.00085) [2022-07-11 10:19:00,414][26022] Updated weights on worker 0-0, policy_version 1149577 (0.00085) [2022-07-11 10:19:02,130][25689] Fps is (10 sec: 5409.7, 60 sec: 5508.6, 300 sec: 5531.3). Total num frames: 1177175040. Throughput: 0: 5770.4. Samples: 1177182064. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:19:02,131][25689] Avg episode reward: [(0, '-0.977')] [2022-07-11 10:19:02,335][26022] Updated weights on worker 0-0, policy_version 1149587 (0.00092) [2022-07-11 10:19:04,456][26022] Updated weights on worker 0-0, policy_version 1149597 (0.00088) [2022-07-11 10:19:05,907][26022] Updated weights on worker 0-0, policy_version 1149607 (0.00080) [2022-07-11 10:19:07,147][25689] Fps is (10 sec: 5414.0, 60 sec: 5526.5, 300 sec: 5525.2). Total num frames: 1177202688. Throughput: 0: 4929.6. Samples: 1177198392. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:19:07,147][25689] Avg episode reward: [(0, '-2.319')] [2022-07-11 10:19:08,111][26022] Updated weights on worker 0-0, policy_version 1149617 (0.00083) [2022-07-11 10:19:09,724][26022] Updated weights on worker 0-0, policy_version 1149627 (0.00083) [2022-07-11 10:19:11,757][26022] Updated weights on worker 0-0, policy_version 1149637 (0.00081) [2022-07-11 10:19:12,280][25689] Fps is (10 sec: 5549.1, 60 sec: 5539.2, 300 sec: 5529.9). Total num frames: 1177231360. Throughput: 0: 5717.1. Samples: 1177231542. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:19:12,280][25689] Avg episode reward: [(0, '-0.973')] [2022-07-11 10:19:13,527][26022] Updated weights on worker 0-0, policy_version 1149647 (0.00086) [2022-07-11 10:19:15,379][26022] Updated weights on worker 0-0, policy_version 1149657 (0.00094) [2022-07-11 10:19:17,216][26022] Updated weights on worker 0-0, policy_version 1149667 (0.00091) [2022-07-11 10:19:17,343][25689] Fps is (10 sec: 5724.3, 60 sec: 5574.2, 300 sec: 5535.8). Total num frames: 1177261056. Throughput: 0: 5701.5. Samples: 1177264912. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:19:17,344][25689] Avg episode reward: [(0, '-0.926')] [2022-07-11 10:19:19,134][26022] Updated weights on worker 0-0, policy_version 1149677 (0.00091) [2022-07-11 10:19:20,690][26022] Updated weights on worker 0-0, policy_version 1149687 (0.00091) [2022-07-11 10:19:22,406][25689] Fps is (10 sec: 5561.5, 60 sec: 5523.8, 300 sec: 5528.0). Total num frames: 1177287680. Throughput: 0: 4904.4. Samples: 1177281506. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:19:22,407][25689] Avg episode reward: [(0, '-0.135')] [2022-07-11 10:19:22,908][26022] Updated weights on worker 0-0, policy_version 1149697 (0.00081) [2022-07-11 10:19:24,501][26022] Updated weights on worker 0-0, policy_version 1149707 (0.00087) [2022-07-11 10:19:26,571][26022] Updated weights on worker 0-0, policy_version 1149717 (0.01315) [2022-07-11 10:19:27,410][25689] Fps is (10 sec: 5289.5, 60 sec: 5542.1, 300 sec: 5522.3). Total num frames: 1177314304. Throughput: 0: 5752.0. Samples: 1177314946. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:19:27,411][25689] Avg episode reward: [(0, '0.343')] [2022-07-11 10:19:28,223][26022] Updated weights on worker 0-0, policy_version 1149727 (0.00088) [2022-07-11 10:19:30,109][26022] Updated weights on worker 0-0, policy_version 1149737 (0.00082) [2022-07-11 10:19:32,042][26022] Updated weights on worker 0-0, policy_version 1149747 (0.00092) [2022-07-11 10:19:32,462][25689] Fps is (10 sec: 5498.9, 60 sec: 5509.6, 300 sec: 5528.3). Total num frames: 1177342976. Throughput: 0: 5784.3. Samples: 1177348284. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:19:32,463][25689] Avg episode reward: [(0, '1.460')] [2022-07-11 10:19:33,826][26022] Updated weights on worker 0-0, policy_version 1149757 (0.00083) [2022-07-11 10:19:35,674][26022] Updated weights on worker 0-0, policy_version 1149767 (0.00085) [2022-07-11 10:19:37,541][25689] Fps is (10 sec: 5559.2, 60 sec: 5509.6, 300 sec: 5524.6). Total num frames: 1177370624. Throughput: 0: 4952.1. Samples: 1177364934. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:19:37,542][25689] Avg episode reward: [(0, '1.276')] [2022-07-11 10:19:37,583][26022] Updated weights on worker 0-0, policy_version 1149777 (0.00090) [2022-07-11 10:19:39,469][26022] Updated weights on worker 0-0, policy_version 1149787 (0.00090) [2022-07-11 10:19:41,435][26022] Updated weights on worker 0-0, policy_version 1149797 (0.00092) [2022-07-11 10:19:42,559][25689] Fps is (10 sec: 5577.7, 60 sec: 5526.6, 300 sec: 5528.7). Total num frames: 1177399296. Throughput: 0: 5793.7. Samples: 1177398268. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:19:42,560][25689] Avg episode reward: [(0, '0.852')] [2022-07-11 10:19:43,055][26022] Updated weights on worker 0-0, policy_version 1149807 (0.00091) [2022-07-11 10:19:44,846][26022] Updated weights on worker 0-0, policy_version 1149817 (0.00093) [2022-07-11 10:19:47,032][26022] Updated weights on worker 0-0, policy_version 1149827 (0.00085) [2022-07-11 10:19:47,584][25689] Fps is (10 sec: 5709.6, 60 sec: 5531.2, 300 sec: 5526.3). Total num frames: 1177427968. Throughput: 0: 5794.3. Samples: 1177431842. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:19:47,585][25689] Avg episode reward: [(0, '1.072')] [2022-07-11 10:19:48,582][26022] Updated weights on worker 0-0, policy_version 1149837 (0.00093) [2022-07-11 10:19:50,562][26022] Updated weights on worker 0-0, policy_version 1149847 (0.00085) [2022-07-11 10:19:52,209][26022] Updated weights on worker 0-0, policy_version 1149857 (0.00086) [2022-07-11 10:19:52,661][25689] Fps is (10 sec: 5575.4, 60 sec: 5528.9, 300 sec: 5526.4). Total num frames: 1177455616. Throughput: 0: 4962.9. Samples: 1177448530. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:19:52,662][25689] Avg episode reward: [(0, '0.308')] [2022-07-11 10:19:54,015][26022] Updated weights on worker 0-0, policy_version 1149867 (0.00095) [2022-07-11 10:19:55,931][26022] Updated weights on worker 0-0, policy_version 1149877 (0.00082) [2022-07-11 10:19:57,308][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:19:57,317][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001149885_1177482240.pth [2022-07-11 10:19:57,318][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001147939_1175489536.pth [2022-07-11 10:19:57,723][25689] Fps is (10 sec: 5454.2, 60 sec: 5525.5, 300 sec: 5520.3). Total num frames: 1177483264. Throughput: 0: 5799.9. Samples: 1177481986. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:19:57,723][25689] Avg episode reward: [(0, '0.184')] [2022-07-11 10:19:57,899][26022] Updated weights on worker 0-0, policy_version 1149887 (0.00083) [2022-07-11 10:19:59,576][26022] Updated weights on worker 0-0, policy_version 1149897 (0.00080) [2022-07-11 10:20:01,438][26022] Updated weights on worker 0-0, policy_version 1149907 (0.00085) [2022-07-11 10:20:02,730][25689] Fps is (10 sec: 5389.9, 60 sec: 5525.5, 300 sec: 5523.9). Total num frames: 1177509888. Throughput: 0: 5829.3. Samples: 1177515850. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:20:02,731][25689] Avg episode reward: [(0, '0.489')] [2022-07-11 10:20:03,710][26022] Updated weights on worker 0-0, policy_version 1149917 (0.00088) [2022-07-11 10:20:05,563][26022] Updated weights on worker 0-0, policy_version 1149927 (0.00088) [2022-07-11 10:20:07,395][26022] Updated weights on worker 0-0, policy_version 1149937 (0.00085) [2022-07-11 10:20:07,780][25689] Fps is (10 sec: 5396.1, 60 sec: 5522.5, 300 sec: 5529.3). Total num frames: 1177537536. Throughput: 0: 5722.4. Samples: 1177547410. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:20:07,781][25689] Avg episode reward: [(0, '0.769')] [2022-07-11 10:20:09,031][26022] Updated weights on worker 0-0, policy_version 1149947 (0.00089) [2022-07-11 10:20:11,252][26022] Updated weights on worker 0-0, policy_version 1149957 (0.00089) [2022-07-11 10:20:12,760][26022] Updated weights on worker 0-0, policy_version 1149967 (0.00094) [2022-07-11 10:20:12,843][25689] Fps is (10 sec: 5569.4, 60 sec: 5528.9, 300 sec: 5522.4). Total num frames: 1177566208. Throughput: 0: 5712.6. Samples: 1177563820. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:20:12,843][25689] Avg episode reward: [(0, '1.037')] [2022-07-11 10:20:14,661][26022] Updated weights on worker 0-0, policy_version 1149977 (0.00082) [2022-07-11 10:20:16,682][26022] Updated weights on worker 0-0, policy_version 1149987 (0.00078) [2022-07-11 10:20:17,845][25689] Fps is (10 sec: 5697.7, 60 sec: 5517.6, 300 sec: 5533.3). Total num frames: 1177594880. Throughput: 0: 5741.1. Samples: 1177597508. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:20:17,845][25689] Avg episode reward: [(0, '1.495')] [2022-07-11 10:20:18,338][26022] Updated weights on worker 0-0, policy_version 1149997 (0.00097) [2022-07-11 10:20:20,307][26022] Updated weights on worker 0-0, policy_version 1150007 (0.00080) [2022-07-11 10:20:21,873][26022] Updated weights on worker 0-0, policy_version 1150017 (0.00086) [2022-07-11 10:20:22,863][25689] Fps is (10 sec: 5518.6, 60 sec: 5521.7, 300 sec: 5526.3). Total num frames: 1177621504. Throughput: 0: 5743.8. Samples: 1177631486. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:20:22,863][25689] Avg episode reward: [(0, '1.434')] [2022-07-11 10:20:23,739][26022] Updated weights on worker 0-0, policy_version 1150027 (0.00088) [2022-07-11 10:20:25,555][26022] Updated weights on worker 0-0, policy_version 1150037 (0.00090) [2022-07-11 10:20:27,326][26022] Updated weights on worker 0-0, policy_version 1150047 (0.00087) [2022-07-11 10:20:27,900][25689] Fps is (10 sec: 5499.2, 60 sec: 5552.5, 300 sec: 5531.2). Total num frames: 1177650176. Throughput: 0: 5017.2. Samples: 1177648354. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:20:27,900][25689] Avg episode reward: [(0, '1.363')] [2022-07-11 10:20:29,261][26022] Updated weights on worker 0-0, policy_version 1150057 (0.00089) [2022-07-11 10:20:31,187][26022] Updated weights on worker 0-0, policy_version 1150067 (0.00083) [2022-07-11 10:20:32,643][26022] Updated weights on worker 0-0, policy_version 1150077 (0.00085) [2022-07-11 10:20:33,012][25689] Fps is (10 sec: 5649.9, 60 sec: 5547.0, 300 sec: 5529.2). Total num frames: 1177678848. Throughput: 0: 5854.4. Samples: 1177681900. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2022-07-11 10:20:33,013][25689] Avg episode reward: [(0, '1.223')] [2022-07-11 10:20:34,820][26022] Updated weights on worker 0-0, policy_version 1150087 (0.00089) [2022-07-11 10:20:36,376][26022] Updated weights on worker 0-0, policy_version 1150097 (0.00086) [2022-07-11 10:20:38,049][25689] Fps is (10 sec: 5549.4, 60 sec: 5550.9, 300 sec: 5526.8). Total num frames: 1177706496. Throughput: 0: 5837.8. Samples: 1177715456. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:20:38,049][25689] Avg episode reward: [(0, '1.338')] [2022-07-11 10:20:38,581][26022] Updated weights on worker 0-0, policy_version 1150107 (0.00091) [2022-07-11 10:20:40,080][26022] Updated weights on worker 0-0, policy_version 1150117 (0.00101) [2022-07-11 10:20:42,050][26022] Updated weights on worker 0-0, policy_version 1150127 (0.00093) [2022-07-11 10:20:43,085][25689] Fps is (10 sec: 5794.9, 60 sec: 5583.1, 300 sec: 5540.5). Total num frames: 1177737216. Throughput: 0: 4987.4. Samples: 1177732340. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:20:43,085][25689] Avg episode reward: [(0, '1.587')] [2022-07-11 10:20:44,008][26022] Updated weights on worker 0-0, policy_version 1150137 (0.00091) [2022-07-11 10:20:45,710][26022] Updated weights on worker 0-0, policy_version 1150147 (0.00086) [2022-07-11 10:20:47,500][26022] Updated weights on worker 0-0, policy_version 1150157 (0.00083) [2022-07-11 10:20:48,133][25689] Fps is (10 sec: 5585.2, 60 sec: 5530.3, 300 sec: 5527.7). Total num frames: 1177762816. Throughput: 0: 5816.3. Samples: 1177766032. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:20:48,133][25689] Avg episode reward: [(0, '1.148')] [2022-07-11 10:20:49,372][26022] Updated weights on worker 0-0, policy_version 1150167 (0.00094) [2022-07-11 10:20:51,154][26022] Updated weights on worker 0-0, policy_version 1150177 (0.00086) [2022-07-11 10:20:53,075][26022] Updated weights on worker 0-0, policy_version 1150187 (0.00088) [2022-07-11 10:20:53,270][25689] Fps is (10 sec: 5328.4, 60 sec: 5541.6, 300 sec: 5529.1). Total num frames: 1177791488. Throughput: 0: 5812.8. Samples: 1177799654. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:20:53,272][25689] Avg episode reward: [(0, '0.717')] [2022-07-11 10:20:54,792][26022] Updated weights on worker 0-0, policy_version 1150197 (0.00090) [2022-07-11 10:20:56,742][26022] Updated weights on worker 0-0, policy_version 1150207 (0.00089) [2022-07-11 10:20:58,280][25689] Fps is (10 sec: 5752.1, 60 sec: 5580.2, 300 sec: 5532.4). Total num frames: 1177821184. Throughput: 0: 4993.5. Samples: 1177816480. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:20:58,280][25689] Avg episode reward: [(0, '0.495')] [2022-07-11 10:20:58,647][26022] Updated weights on worker 0-0, policy_version 1150217 (0.00108) [2022-07-11 10:21:00,315][26022] Updated weights on worker 0-0, policy_version 1150227 (0.00087) [2022-07-11 10:21:02,797][26022] Updated weights on worker 0-0, policy_version 1150237 (0.00096) [2022-07-11 10:21:03,333][25689] Fps is (10 sec: 5494.8, 60 sec: 5559.1, 300 sec: 5528.1). Total num frames: 1177846784. Throughput: 0: 5775.2. Samples: 1177849280. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:21:03,334][25689] Avg episode reward: [(0, '-0.648')] [2022-07-11 10:21:04,353][26022] Updated weights on worker 0-0, policy_version 1150247 (0.00091) [2022-07-11 10:21:06,293][26022] Updated weights on worker 0-0, policy_version 1150257 (0.00091) [2022-07-11 10:21:07,995][26022] Updated weights on worker 0-0, policy_version 1150267 (0.00087) [2022-07-11 10:21:08,364][25689] Fps is (10 sec: 5179.1, 60 sec: 5544.0, 300 sec: 5526.7). Total num frames: 1177873408. Throughput: 0: 5720.3. Samples: 1177881758. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:21:08,365][25689] Avg episode reward: [(0, '-0.832')] [2022-07-11 10:21:09,628][26022] Updated weights on worker 0-0, policy_version 1150277 (0.00088) [2022-07-11 10:21:11,640][26022] Updated weights on worker 0-0, policy_version 1150287 (0.00083) [2022-07-11 10:21:13,310][26022] Updated weights on worker 0-0, policy_version 1150297 (0.00087) [2022-07-11 10:21:13,458][25689] Fps is (10 sec: 5664.0, 60 sec: 5574.9, 300 sec: 5535.6). Total num frames: 1177904128. Throughput: 0: 4909.5. Samples: 1177898766. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:21:13,459][25689] Avg episode reward: [(0, '-1.180')] [2022-07-11 10:21:15,245][26022] Updated weights on worker 0-0, policy_version 1150307 (0.00089) [2022-07-11 10:21:17,144][26022] Updated weights on worker 0-0, policy_version 1150317 (0.00089) [2022-07-11 10:21:18,472][25689] Fps is (10 sec: 5774.2, 60 sec: 5556.8, 300 sec: 5536.1). Total num frames: 1177931776. Throughput: 0: 5747.3. Samples: 1177932530. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:21:18,474][25689] Avg episode reward: [(0, '-1.218')] [2022-07-11 10:21:18,871][26022] Updated weights on worker 0-0, policy_version 1150327 (0.00091) [2022-07-11 10:21:20,834][26022] Updated weights on worker 0-0, policy_version 1150337 (0.00090) [2022-07-11 10:21:22,439][26022] Updated weights on worker 0-0, policy_version 1150347 (0.00084) [2022-07-11 10:21:23,506][25689] Fps is (10 sec: 5502.9, 60 sec: 5572.2, 300 sec: 5532.5). Total num frames: 1177959424. Throughput: 0: 5787.2. Samples: 1177966024. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:21:23,507][25689] Avg episode reward: [(0, '-0.312')] [2022-07-11 10:21:24,506][26022] Updated weights on worker 0-0, policy_version 1150357 (0.00089) [2022-07-11 10:21:26,278][26022] Updated weights on worker 0-0, policy_version 1150367 (0.00086) [2022-07-11 10:21:28,316][26022] Updated weights on worker 0-0, policy_version 1150377 (0.00083) [2022-07-11 10:21:28,523][25689] Fps is (10 sec: 5501.7, 60 sec: 5557.3, 300 sec: 5530.3). Total num frames: 1177987072. Throughput: 0: 5008.0. Samples: 1177982716. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:21:28,523][25689] Avg episode reward: [(0, '0.146')] [2022-07-11 10:21:30,118][26022] Updated weights on worker 0-0, policy_version 1150387 (0.00085) [2022-07-11 10:21:31,921][26022] Updated weights on worker 0-0, policy_version 1150397 (0.00083) [2022-07-11 10:21:33,654][25689] Fps is (10 sec: 5550.2, 60 sec: 5555.5, 300 sec: 5534.9). Total num frames: 1178015744. Throughput: 0: 5797.9. Samples: 1178015860. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:21:33,654][25689] Avg episode reward: [(0, '1.186')] [2022-07-11 10:21:33,750][26022] Updated weights on worker 0-0, policy_version 1150407 (0.00087) [2022-07-11 10:21:35,699][26022] Updated weights on worker 0-0, policy_version 1150417 (0.00083) [2022-07-11 10:21:37,398][26022] Updated weights on worker 0-0, policy_version 1150427 (0.00097) [2022-07-11 10:21:38,684][25689] Fps is (10 sec: 5542.4, 60 sec: 5556.1, 300 sec: 5531.1). Total num frames: 1178043392. Throughput: 0: 5784.9. Samples: 1178049456. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:21:38,686][25689] Avg episode reward: [(0, '1.227')] [2022-07-11 10:21:39,265][26022] Updated weights on worker 0-0, policy_version 1150437 (0.00089) [2022-07-11 10:21:41,010][26022] Updated weights on worker 0-0, policy_version 1150447 (0.01138) [2022-07-11 10:21:42,919][26022] Updated weights on worker 0-0, policy_version 1150457 (0.00086) [2022-07-11 10:21:43,730][25689] Fps is (10 sec: 5487.5, 60 sec: 5504.5, 300 sec: 5532.1). Total num frames: 1178071040. Throughput: 0: 4948.3. Samples: 1178066100. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:21:43,732][25689] Avg episode reward: [(0, '2.355')] [2022-07-11 10:21:44,648][26022] Updated weights on worker 0-0, policy_version 1150467 (0.00101) [2022-07-11 10:21:46,750][26022] Updated weights on worker 0-0, policy_version 1150477 (0.00088) [2022-07-11 10:21:48,468][26022] Updated weights on worker 0-0, policy_version 1150487 (0.00093) [2022-07-11 10:21:48,743][25689] Fps is (10 sec: 5701.2, 60 sec: 5575.3, 300 sec: 5544.7). Total num frames: 1178100736. Throughput: 0: 5779.7. Samples: 1178099582. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:21:48,743][25689] Avg episode reward: [(0, '2.088')] [2022-07-11 10:21:50,118][26022] Updated weights on worker 0-0, policy_version 1150497 (0.00088) [2022-07-11 10:21:52,144][26022] Updated weights on worker 0-0, policy_version 1150507 (0.00087) [2022-07-11 10:21:53,802][25689] Fps is (10 sec: 5693.6, 60 sec: 5565.6, 300 sec: 5540.5). Total num frames: 1178128384. Throughput: 0: 5826.2. Samples: 1178133248. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:21:53,811][25689] Avg episode reward: [(0, '1.475')] [2022-07-11 10:21:53,975][26022] Updated weights on worker 0-0, policy_version 1150517 (0.00087) [2022-07-11 10:21:55,565][26022] Updated weights on worker 0-0, policy_version 1150527 (0.00087) [2022-07-11 10:21:57,528][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:21:57,544][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001150536_1178148864.pth [2022-07-11 10:21:57,547][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001148589_1176155136.pth [2022-07-11 10:21:57,651][26022] Updated weights on worker 0-0, policy_version 1150537 (0.00096) [2022-07-11 10:21:58,829][25689] Fps is (10 sec: 5583.6, 60 sec: 5547.1, 300 sec: 5536.8). Total num frames: 1178157056. Throughput: 0: 4991.0. Samples: 1178150000. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:21:58,830][25689] Avg episode reward: [(0, '1.645')] [2022-07-11 10:21:59,446][26022] Updated weights on worker 0-0, policy_version 1150547 (0.00087) [2022-07-11 10:22:01,492][26022] Updated weights on worker 0-0, policy_version 1150557 (0.00091) [2022-07-11 10:22:03,519][26022] Updated weights on worker 0-0, policy_version 1150567 (0.00087) [2022-07-11 10:22:03,856][25689] Fps is (10 sec: 5296.3, 60 sec: 5532.6, 300 sec: 5543.6). Total num frames: 1178181632. Throughput: 0: 5735.6. Samples: 1178181532. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:22:03,856][25689] Avg episode reward: [(0, '1.595')] [2022-07-11 10:22:05,326][26022] Updated weights on worker 0-0, policy_version 1150577 (0.00086) [2022-07-11 10:22:07,160][26022] Updated weights on worker 0-0, policy_version 1150587 (0.00083) [2022-07-11 10:22:08,876][25689] Fps is (10 sec: 5198.1, 60 sec: 5550.5, 300 sec: 5538.2). Total num frames: 1178209280. Throughput: 0: 5721.8. Samples: 1178214780. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:22:08,877][25689] Avg episode reward: [(0, '1.521')] [2022-07-11 10:22:09,080][26022] Updated weights on worker 0-0, policy_version 1150597 (0.00089) [2022-07-11 10:22:10,892][26022] Updated weights on worker 0-0, policy_version 1150607 (0.00087) [2022-07-11 10:22:12,721][26022] Updated weights on worker 0-0, policy_version 1150617 (0.00097) [2022-07-11 10:22:13,947][25689] Fps is (10 sec: 5581.2, 60 sec: 5518.7, 300 sec: 5540.9). Total num frames: 1178237952. Throughput: 0: 4874.8. Samples: 1178231450. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:22:13,947][25689] Avg episode reward: [(0, '1.236')] [2022-07-11 10:22:14,669][26022] Updated weights on worker 0-0, policy_version 1150627 (0.00092) [2022-07-11 10:22:16,392][26022] Updated weights on worker 0-0, policy_version 1150637 (0.00089) [2022-07-11 10:22:18,128][26022] Updated weights on worker 0-0, policy_version 1150647 (0.00084) [2022-07-11 10:22:18,960][25689] Fps is (10 sec: 5686.6, 60 sec: 5535.8, 300 sec: 5544.3). Total num frames: 1178266624. Throughput: 0: 5706.6. Samples: 1178264878. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:22:18,960][25689] Avg episode reward: [(0, '1.611')] [2022-07-11 10:22:20,213][26022] Updated weights on worker 0-0, policy_version 1150657 (0.00086) [2022-07-11 10:22:21,802][26022] Updated weights on worker 0-0, policy_version 1150667 (0.00092) [2022-07-11 10:22:23,870][26022] Updated weights on worker 0-0, policy_version 1150677 (0.00104) [2022-07-11 10:22:23,970][25689] Fps is (10 sec: 5517.0, 60 sec: 5521.1, 300 sec: 5534.8). Total num frames: 1178293248. Throughput: 0: 5814.2. Samples: 1178298478. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:22:23,970][25689] Avg episode reward: [(0, '1.898')] [2022-07-11 10:22:25,441][26022] Updated weights on worker 0-0, policy_version 1150687 (0.00087) [2022-07-11 10:22:27,475][26022] Updated weights on worker 0-0, policy_version 1150697 (0.00083) [2022-07-11 10:22:28,985][25689] Fps is (10 sec: 5516.0, 60 sec: 5538.1, 300 sec: 5545.8). Total num frames: 1178321920. Throughput: 0: 4993.0. Samples: 1178315182. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:22:28,985][25689] Avg episode reward: [(0, '0.659')] [2022-07-11 10:22:29,147][26022] Updated weights on worker 0-0, policy_version 1150707 (0.00091) [2022-07-11 10:22:31,098][26022] Updated weights on worker 0-0, policy_version 1150717 (0.00082) [2022-07-11 10:22:33,058][26022] Updated weights on worker 0-0, policy_version 1150727 (0.00867) [2022-07-11 10:22:34,036][25689] Fps is (10 sec: 5696.7, 60 sec: 5545.5, 300 sec: 5538.4). Total num frames: 1178350592. Throughput: 0: 5839.1. Samples: 1178348752. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:22:34,037][25689] Avg episode reward: [(0, '0.625')] [2022-07-11 10:22:34,783][26022] Updated weights on worker 0-0, policy_version 1150737 (0.00088) [2022-07-11 10:22:36,621][26022] Updated weights on worker 0-0, policy_version 1150747 (0.00092) [2022-07-11 10:22:38,447][26022] Updated weights on worker 0-0, policy_version 1150757 (0.00096) [2022-07-11 10:22:39,061][25689] Fps is (10 sec: 5589.2, 60 sec: 5546.0, 300 sec: 5544.9). Total num frames: 1178378240. Throughput: 0: 5845.7. Samples: 1178382384. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:22:39,063][25689] Avg episode reward: [(0, '0.789')] [2022-07-11 10:22:40,162][26022] Updated weights on worker 0-0, policy_version 1150767 (0.00086) [2022-07-11 10:22:42,102][26022] Updated weights on worker 0-0, policy_version 1150777 (0.00086) [2022-07-11 10:22:43,735][26022] Updated weights on worker 0-0, policy_version 1150787 (0.00087) [2022-07-11 10:22:44,079][25689] Fps is (10 sec: 5709.5, 60 sec: 5582.5, 300 sec: 5548.3). Total num frames: 1178407936. Throughput: 0: 5005.9. Samples: 1178399146. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:22:44,080][25689] Avg episode reward: [(0, '0.918')] [2022-07-11 10:22:45,885][26022] Updated weights on worker 0-0, policy_version 1150797 (0.00097) [2022-07-11 10:22:47,523][26022] Updated weights on worker 0-0, policy_version 1150807 (0.00081) [2022-07-11 10:22:49,082][25689] Fps is (10 sec: 5518.3, 60 sec: 5515.5, 300 sec: 5536.5). Total num frames: 1178433536. Throughput: 0: 5860.1. Samples: 1178432952. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:22:49,083][25689] Avg episode reward: [(0, '0.960')] [2022-07-11 10:22:49,427][26022] Updated weights on worker 0-0, policy_version 1150817 (0.00087) [2022-07-11 10:22:51,155][26022] Updated weights on worker 0-0, policy_version 1150827 (0.00086) [2022-07-11 10:22:52,958][26022] Updated weights on worker 0-0, policy_version 1150837 (0.00084) [2022-07-11 10:22:54,145][25689] Fps is (10 sec: 5493.6, 60 sec: 5549.1, 300 sec: 5546.1). Total num frames: 1178463232. Throughput: 0: 5873.5. Samples: 1178466862. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:22:54,145][25689] Avg episode reward: [(0, '-0.453')] [2022-07-11 10:22:54,983][26022] Updated weights on worker 0-0, policy_version 1150847 (0.00090) [2022-07-11 10:22:56,667][26022] Updated weights on worker 0-0, policy_version 1150857 (0.00087) [2022-07-11 10:22:58,499][26022] Updated weights on worker 0-0, policy_version 1150867 (0.00087) [2022-07-11 10:22:59,147][25689] Fps is (10 sec: 5697.4, 60 sec: 5534.5, 300 sec: 5542.9). Total num frames: 1178490880. Throughput: 0: 5888.8. Samples: 1178500662. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:22:59,147][25689] Avg episode reward: [(0, '-0.481')] [2022-07-11 10:23:00,305][26022] Updated weights on worker 0-0, policy_version 1150877 (0.00093) [2022-07-11 10:23:02,507][26022] Updated weights on worker 0-0, policy_version 1150887 (0.00097) [2022-07-11 10:23:04,182][25689] Fps is (10 sec: 5304.9, 60 sec: 5550.6, 300 sec: 5539.3). Total num frames: 1178516480. Throughput: 0: 5782.4. Samples: 1178515388. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:23:04,183][25689] Avg episode reward: [(0, '0.164')] [2022-07-11 10:23:04,485][26022] Updated weights on worker 0-0, policy_version 1150897 (0.00089) [2022-07-11 10:23:06,234][26022] Updated weights on worker 0-0, policy_version 1150907 (0.00087) [2022-07-11 10:23:08,053][26022] Updated weights on worker 0-0, policy_version 1150917 (0.00094) [2022-07-11 10:23:09,185][25689] Fps is (10 sec: 5304.5, 60 sec: 5552.2, 300 sec: 5540.9). Total num frames: 1178544128. Throughput: 0: 5752.4. Samples: 1178548594. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:23:09,188][25689] Avg episode reward: [(0, '-0.101')] [2022-07-11 10:23:09,840][26022] Updated weights on worker 0-0, policy_version 1150927 (0.00085) [2022-07-11 10:23:11,819][26022] Updated weights on worker 0-0, policy_version 1150937 (0.00086) [2022-07-11 10:23:13,644][26022] Updated weights on worker 0-0, policy_version 1150947 (0.00096) [2022-07-11 10:23:14,294][25689] Fps is (10 sec: 5468.6, 60 sec: 5531.7, 300 sec: 5540.3). Total num frames: 1178571776. Throughput: 0: 5685.6. Samples: 1178581420. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:23:14,294][25689] Avg episode reward: [(0, '-1.066')] [2022-07-11 10:23:15,390][26022] Updated weights on worker 0-0, policy_version 1150957 (0.00090) [2022-07-11 10:23:17,376][26022] Updated weights on worker 0-0, policy_version 1150967 (0.00093) [2022-07-11 10:23:19,317][25689] Fps is (10 sec: 5457.3, 60 sec: 5513.8, 300 sec: 5534.2). Total num frames: 1178599424. Throughput: 0: 4818.8. Samples: 1178597862. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:23:19,318][25689] Avg episode reward: [(0, '-1.074')] [2022-07-11 10:23:19,326][26022] Updated weights on worker 0-0, policy_version 1150977 (0.00089) [2022-07-11 10:23:21,065][26022] Updated weights on worker 0-0, policy_version 1150987 (0.00095) [2022-07-11 10:23:23,018][26022] Updated weights on worker 0-0, policy_version 1150997 (0.00089) [2022-07-11 10:23:24,329][25689] Fps is (10 sec: 5714.0, 60 sec: 5564.5, 300 sec: 5548.1). Total num frames: 1178629120. Throughput: 0: 5760.5. Samples: 1178631446. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:23:24,330][25689] Avg episode reward: [(0, '-0.688')] [2022-07-11 10:23:24,587][26022] Updated weights on worker 0-0, policy_version 1151007 (0.00084) [2022-07-11 10:23:26,562][26022] Updated weights on worker 0-0, policy_version 1151017 (0.00086) [2022-07-11 10:23:28,569][26022] Updated weights on worker 0-0, policy_version 1151027 (0.00086) [2022-07-11 10:23:29,353][25689] Fps is (10 sec: 5611.9, 60 sec: 5529.8, 300 sec: 5535.1). Total num frames: 1178655744. Throughput: 0: 5771.7. Samples: 1178665000. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:23:29,354][25689] Avg episode reward: [(0, '-0.133')] [2022-07-11 10:23:30,106][26022] Updated weights on worker 0-0, policy_version 1151037 (0.00092) [2022-07-11 10:23:32,300][26022] Updated weights on worker 0-0, policy_version 1151047 (0.00090) [2022-07-11 10:23:34,010][26022] Updated weights on worker 0-0, policy_version 1151057 (0.00090) [2022-07-11 10:23:34,405][25689] Fps is (10 sec: 5386.6, 60 sec: 5512.7, 300 sec: 5535.6). Total num frames: 1178683392. Throughput: 0: 4977.3. Samples: 1178681520. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:23:34,405][25689] Avg episode reward: [(0, '-1.330')] [2022-07-11 10:23:35,803][26022] Updated weights on worker 0-0, policy_version 1151067 (0.00093) [2022-07-11 10:23:38,004][26022] Updated weights on worker 0-0, policy_version 1151077 (0.00087) [2022-07-11 10:23:39,399][26022] Updated weights on worker 0-0, policy_version 1151087 (0.00440) [2022-07-11 10:23:39,493][25689] Fps is (10 sec: 5655.3, 60 sec: 5540.9, 300 sec: 5541.2). Total num frames: 1178713088. Throughput: 0: 5819.0. Samples: 1178715264. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:23:39,494][25689] Avg episode reward: [(0, '-1.932')] [2022-07-11 10:23:41,467][26022] Updated weights on worker 0-0, policy_version 1151097 (0.00083) [2022-07-11 10:23:43,047][26022] Updated weights on worker 0-0, policy_version 1151107 (0.00087) [2022-07-11 10:23:44,512][25689] Fps is (10 sec: 5673.9, 60 sec: 5507.0, 300 sec: 5538.9). Total num frames: 1178740736. Throughput: 0: 5820.4. Samples: 1178748914. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:23:44,512][25689] Avg episode reward: [(0, '-1.157')] [2022-07-11 10:23:44,806][26022] Updated weights on worker 0-0, policy_version 1151117 (0.00084) [2022-07-11 10:23:47,124][26022] Updated weights on worker 0-0, policy_version 1151127 (0.00083) [2022-07-11 10:23:48,520][26022] Updated weights on worker 0-0, policy_version 1151137 (0.00086) [2022-07-11 10:23:49,583][25689] Fps is (10 sec: 5480.5, 60 sec: 5534.6, 300 sec: 5538.5). Total num frames: 1178768384. Throughput: 0: 4960.2. Samples: 1178765342. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:23:49,583][25689] Avg episode reward: [(0, '-1.148')] [2022-07-11 10:23:50,624][26022] Updated weights on worker 0-0, policy_version 1151147 (0.00095) [2022-07-11 10:23:52,434][26022] Updated weights on worker 0-0, policy_version 1151157 (0.00082) [2022-07-11 10:23:54,024][26022] Updated weights on worker 0-0, policy_version 1151167 (0.00094) [2022-07-11 10:23:54,634][25689] Fps is (10 sec: 5665.3, 60 sec: 5535.7, 300 sec: 5544.9). Total num frames: 1178798080. Throughput: 0: 5784.0. Samples: 1178798522. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:23:54,634][25689] Avg episode reward: [(0, '-0.850')] [2022-07-11 10:23:56,332][26022] Updated weights on worker 0-0, policy_version 1151177 (0.00088) [2022-07-11 10:23:57,636][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:23:57,650][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001151187_1178815488.pth [2022-07-11 10:23:57,650][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001149235_1176816640.pth [2022-07-11 10:23:57,653][26022] Updated weights on worker 0-0, policy_version 1151187 (0.00091) [2022-07-11 10:23:59,686][25689] Fps is (10 sec: 5574.4, 60 sec: 5514.2, 300 sec: 5544.0). Total num frames: 1178824704. Throughput: 0: 5792.0. Samples: 1178832220. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:23:59,687][25689] Avg episode reward: [(0, '-0.723')] [2022-07-11 10:23:59,783][26022] Updated weights on worker 0-0, policy_version 1151197 (0.00086) [2022-07-11 10:24:01,503][26022] Updated weights on worker 0-0, policy_version 1151207 (0.00108) [2022-07-11 10:24:03,691][26022] Updated weights on worker 0-0, policy_version 1151217 (0.00054) [2022-07-11 10:24:04,762][25689] Fps is (10 sec: 5257.4, 60 sec: 5527.4, 300 sec: 5539.5). Total num frames: 1178851328. Throughput: 0: 4893.3. Samples: 1178848004. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:24:04,762][25689] Avg episode reward: [(0, '0.915')] [2022-07-11 10:24:05,700][26022] Updated weights on worker 0-0, policy_version 1151227 (0.00100) [2022-07-11 10:24:07,382][26022] Updated weights on worker 0-0, policy_version 1151237 (0.00089) [2022-07-11 10:24:09,489][26022] Updated weights on worker 0-0, policy_version 1151247 (0.00089) [2022-07-11 10:24:09,819][25689] Fps is (10 sec: 5456.9, 60 sec: 5539.3, 300 sec: 5540.9). Total num frames: 1178880000. Throughput: 0: 5692.3. Samples: 1178880534. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:24:09,830][25689] Avg episode reward: [(0, '1.187')] [2022-07-11 10:24:11,212][26022] Updated weights on worker 0-0, policy_version 1151257 (0.00090) [2022-07-11 10:24:12,817][26022] Updated weights on worker 0-0, policy_version 1151267 (0.00084) [2022-07-11 10:24:14,910][25689] Fps is (10 sec: 5448.5, 60 sec: 5524.0, 300 sec: 5530.1). Total num frames: 1178906624. Throughput: 0: 5701.2. Samples: 1178914124. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:24:14,912][25689] Avg episode reward: [(0, '1.034')] [2022-07-11 10:24:14,932][26022] Updated weights on worker 0-0, policy_version 1151277 (0.00086) [2022-07-11 10:24:16,448][26022] Updated weights on worker 0-0, policy_version 1151287 (0.00086) [2022-07-11 10:24:18,727][26022] Updated weights on worker 0-0, policy_version 1151297 (0.00089) [2022-07-11 10:24:19,931][25689] Fps is (10 sec: 5569.9, 60 sec: 5558.1, 300 sec: 5541.2). Total num frames: 1178936320. Throughput: 0: 4851.4. Samples: 1178930434. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:24:19,931][25689] Avg episode reward: [(0, '1.285')] [2022-07-11 10:24:20,424][26022] Updated weights on worker 0-0, policy_version 1151307 (0.00085) [2022-07-11 10:24:22,235][26022] Updated weights on worker 0-0, policy_version 1151317 (0.00088) [2022-07-11 10:24:23,981][26022] Updated weights on worker 0-0, policy_version 1151327 (0.00085) [2022-07-11 10:24:24,958][25689] Fps is (10 sec: 5605.3, 60 sec: 5506.1, 300 sec: 5540.8). Total num frames: 1178962944. Throughput: 0: 5747.9. Samples: 1178964090. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:24:24,959][25689] Avg episode reward: [(0, '1.000')] [2022-07-11 10:24:25,929][26022] Updated weights on worker 0-0, policy_version 1151337 (0.01130) [2022-07-11 10:24:27,722][26022] Updated weights on worker 0-0, policy_version 1151347 (0.00088) [2022-07-11 10:24:29,447][26022] Updated weights on worker 0-0, policy_version 1151357 (0.00087) [2022-07-11 10:24:29,967][25689] Fps is (10 sec: 5509.6, 60 sec: 5541.2, 300 sec: 5541.6). Total num frames: 1178991616. Throughput: 0: 5809.7. Samples: 1178997586. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:24:29,969][25689] Avg episode reward: [(0, '0.025')] [2022-07-11 10:24:31,435][26022] Updated weights on worker 0-0, policy_version 1151367 (0.00083) [2022-07-11 10:24:33,207][26022] Updated weights on worker 0-0, policy_version 1151377 (0.00083) [2022-07-11 10:24:34,966][26022] Updated weights on worker 0-0, policy_version 1151387 (0.00085) [2022-07-11 10:24:35,073][25689] Fps is (10 sec: 5669.2, 60 sec: 5553.1, 300 sec: 5544.5). Total num frames: 1179020288. Throughput: 0: 4974.4. Samples: 1179014422. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2022-07-11 10:24:35,074][25689] Avg episode reward: [(0, '-0.585')] [2022-07-11 10:24:36,787][26022] Updated weights on worker 0-0, policy_version 1151397 (0.00090) [2022-07-11 10:24:38,737][26022] Updated weights on worker 0-0, policy_version 1151407 (0.00090) [2022-07-11 10:24:40,119][25689] Fps is (10 sec: 5547.8, 60 sec: 5523.2, 300 sec: 5540.5). Total num frames: 1179047936. Throughput: 0: 5839.4. Samples: 1179048320. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:24:40,119][25689] Avg episode reward: [(0, '-0.916')] [2022-07-11 10:24:40,463][26022] Updated weights on worker 0-0, policy_version 1151417 (0.00079) [2022-07-11 10:24:42,417][26022] Updated weights on worker 0-0, policy_version 1151427 (0.00085) [2022-07-11 10:24:44,148][26022] Updated weights on worker 0-0, policy_version 1151437 (0.00086) [2022-07-11 10:24:45,163][25689] Fps is (10 sec: 5582.0, 60 sec: 5537.8, 300 sec: 5540.2). Total num frames: 1179076608. Throughput: 0: 5843.1. Samples: 1179082148. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:24:45,163][25689] Avg episode reward: [(0, '-0.085')] [2022-07-11 10:24:45,925][26022] Updated weights on worker 0-0, policy_version 1151447 (0.00086) [2022-07-11 10:24:47,940][26022] Updated weights on worker 0-0, policy_version 1151457 (0.00091) [2022-07-11 10:24:49,603][26022] Updated weights on worker 0-0, policy_version 1151467 (0.00095) [2022-07-11 10:24:50,165][25689] Fps is (10 sec: 5606.0, 60 sec: 5544.1, 300 sec: 5541.6). Total num frames: 1179104256. Throughput: 0: 4999.3. Samples: 1179098562. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:24:50,166][25689] Avg episode reward: [(0, '-0.312')] [2022-07-11 10:24:51,657][26022] Updated weights on worker 0-0, policy_version 1151477 (0.00083) [2022-07-11 10:24:53,477][26022] Updated weights on worker 0-0, policy_version 1151487 (0.00083) [2022-07-11 10:24:55,172][26022] Updated weights on worker 0-0, policy_version 1151497 (0.00089) [2022-07-11 10:24:55,233][25689] Fps is (10 sec: 5592.4, 60 sec: 5525.6, 300 sec: 5544.9). Total num frames: 1179132928. Throughput: 0: 5828.5. Samples: 1179131926. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:24:55,234][25689] Avg episode reward: [(0, '-0.418')] [2022-07-11 10:24:57,219][26022] Updated weights on worker 0-0, policy_version 1151507 (0.00090) [2022-07-11 10:24:58,781][26022] Updated weights on worker 0-0, policy_version 1151517 (0.00086) [2022-07-11 10:25:00,235][25689] Fps is (10 sec: 5592.6, 60 sec: 5547.1, 300 sec: 5548.4). Total num frames: 1179160576. Throughput: 0: 5834.1. Samples: 1179165684. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:25:00,236][25689] Avg episode reward: [(0, '-0.875')] [2022-07-11 10:25:00,759][26022] Updated weights on worker 0-0, policy_version 1151527 (0.00085) [2022-07-11 10:25:03,021][26022] Updated weights on worker 0-0, policy_version 1151537 (0.00087) [2022-07-11 10:25:04,664][26022] Updated weights on worker 0-0, policy_version 1151547 (0.00088) [2022-07-11 10:25:05,282][25689] Fps is (10 sec: 5401.0, 60 sec: 5549.7, 300 sec: 5545.1). Total num frames: 1179187200. Throughput: 0: 4888.0. Samples: 1179180492. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:25:05,282][25689] Avg episode reward: [(0, '-0.155')] [2022-07-11 10:25:06,516][26022] Updated weights on worker 0-0, policy_version 1151557 (0.00085) [2022-07-11 10:25:08,320][26022] Updated weights on worker 0-0, policy_version 1151567 (0.00087) [2022-07-11 10:25:10,131][26022] Updated weights on worker 0-0, policy_version 1151577 (0.00091) [2022-07-11 10:25:10,292][25689] Fps is (10 sec: 5396.6, 60 sec: 5537.2, 300 sec: 5542.6). Total num frames: 1179214848. Throughput: 0: 5745.2. Samples: 1179214196. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:25:10,293][25689] Avg episode reward: [(0, '0.052')] [2022-07-11 10:25:12,001][26022] Updated weights on worker 0-0, policy_version 1151587 (0.00085) [2022-07-11 10:25:13,691][26022] Updated weights on worker 0-0, policy_version 1151597 (0.00082) [2022-07-11 10:25:15,358][25689] Fps is (10 sec: 5488.0, 60 sec: 5556.4, 300 sec: 5538.0). Total num frames: 1179242496. Throughput: 0: 5763.7. Samples: 1179247916. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:25:15,358][25689] Avg episode reward: [(0, '-1.690')] [2022-07-11 10:25:15,748][26022] Updated weights on worker 0-0, policy_version 1151607 (0.00081) [2022-07-11 10:25:17,248][26022] Updated weights on worker 0-0, policy_version 1151617 (0.00088) [2022-07-11 10:25:19,487][26022] Updated weights on worker 0-0, policy_version 1151627 (0.00095) [2022-07-11 10:25:20,366][25689] Fps is (10 sec: 5692.2, 60 sec: 5557.5, 300 sec: 5548.5). Total num frames: 1179272192. Throughput: 0: 4925.2. Samples: 1179264832. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:25:20,371][25689] Avg episode reward: [(0, '-2.236')] [2022-07-11 10:25:20,920][26022] Updated weights on worker 0-0, policy_version 1151637 (0.00085) [2022-07-11 10:25:23,075][26022] Updated weights on worker 0-0, policy_version 1151647 (0.00097) [2022-07-11 10:25:24,890][26022] Updated weights on worker 0-0, policy_version 1151657 (0.00093) [2022-07-11 10:25:25,376][25689] Fps is (10 sec: 5621.5, 60 sec: 5559.1, 300 sec: 5542.1). Total num frames: 1179298816. Throughput: 0: 5858.6. Samples: 1179298218. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:25:25,380][25689] Avg episode reward: [(0, '-2.121')] [2022-07-11 10:25:26,787][26022] Updated weights on worker 0-0, policy_version 1151667 (0.00091) [2022-07-11 10:25:28,496][26022] Updated weights on worker 0-0, policy_version 1151677 (0.00093) [2022-07-11 10:25:30,381][25689] Fps is (10 sec: 5419.1, 60 sec: 5542.5, 300 sec: 5540.6). Total num frames: 1179326464. Throughput: 0: 5836.2. Samples: 1179331440. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:25:30,383][25689] Avg episode reward: [(0, '-2.824')] [2022-07-11 10:25:30,487][26022] Updated weights on worker 0-0, policy_version 1151687 (0.00090) [2022-07-11 10:25:32,248][26022] Updated weights on worker 0-0, policy_version 1151697 (0.00089) [2022-07-11 10:25:34,139][26022] Updated weights on worker 0-0, policy_version 1151707 (0.00104) [2022-07-11 10:25:35,450][25689] Fps is (10 sec: 5590.7, 60 sec: 5545.9, 300 sec: 5543.5). Total num frames: 1179355136. Throughput: 0: 5807.4. Samples: 1179364602. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:25:35,451][25689] Avg episode reward: [(0, '-2.558')] [2022-07-11 10:25:36,064][26022] Updated weights on worker 0-0, policy_version 1151717 (0.00088) [2022-07-11 10:25:37,604][26022] Updated weights on worker 0-0, policy_version 1151727 (0.00087) [2022-07-11 10:25:39,638][26022] Updated weights on worker 0-0, policy_version 1151737 (0.00094) [2022-07-11 10:25:40,548][25689] Fps is (10 sec: 5640.3, 60 sec: 5558.1, 300 sec: 5535.4). Total num frames: 1179383808. Throughput: 0: 5778.1. Samples: 1179381444. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:25:40,549][25689] Avg episode reward: [(0, '-3.050')] [2022-07-11 10:25:41,396][26022] Updated weights on worker 0-0, policy_version 1151747 (0.00087) [2022-07-11 10:25:43,162][26022] Updated weights on worker 0-0, policy_version 1151757 (0.00060) [2022-07-11 10:25:45,182][26022] Updated weights on worker 0-0, policy_version 1151767 (0.00089) [2022-07-11 10:25:45,579][25689] Fps is (10 sec: 5560.4, 60 sec: 5542.3, 300 sec: 5542.6). Total num frames: 1179411456. Throughput: 0: 5815.2. Samples: 1179415700. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:25:45,581][25689] Avg episode reward: [(0, '-0.944')] [2022-07-11 10:25:46,560][26022] Updated weights on worker 0-0, policy_version 1151777 (0.00052) [2022-07-11 10:25:48,763][26022] Updated weights on worker 0-0, policy_version 1151787 (0.00083) [2022-07-11 10:25:50,243][26022] Updated weights on worker 0-0, policy_version 1151797 (0.00095) [2022-07-11 10:25:50,680][25689] Fps is (10 sec: 5558.6, 60 sec: 5550.2, 300 sec: 5543.3). Total num frames: 1179440128. Throughput: 0: 5798.0. Samples: 1179449132. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:25:50,681][25689] Avg episode reward: [(0, '-0.828')] [2022-07-11 10:25:52,306][26022] Updated weights on worker 0-0, policy_version 1151807 (0.00083) [2022-07-11 10:25:54,389][26022] Updated weights on worker 0-0, policy_version 1151817 (0.00084) [2022-07-11 10:25:55,767][25689] Fps is (10 sec: 5729.2, 60 sec: 5565.4, 300 sec: 5541.9). Total num frames: 1179469824. Throughput: 0: 4984.7. Samples: 1179465874. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:25:55,768][25689] Avg episode reward: [(0, '-0.583')] [2022-07-11 10:25:55,836][26022] Updated weights on worker 0-0, policy_version 1151827 (0.00106) [2022-07-11 10:25:57,928][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:25:57,941][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001151837_1179481088.pth [2022-07-11 10:25:57,946][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001149885_1177482240.pth [2022-07-11 10:25:57,948][26022] Updated weights on worker 0-0, policy_version 1151837 (0.00089) [2022-07-11 10:25:59,879][26022] Updated weights on worker 0-0, policy_version 1151847 (0.00091) [2022-07-11 10:26:00,858][25689] Fps is (10 sec: 5533.8, 60 sec: 5540.4, 300 sec: 5544.6). Total num frames: 1179496448. Throughput: 0: 5798.6. Samples: 1179499210. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:26:00,858][25689] Avg episode reward: [(0, '-1.249')] [2022-07-11 10:26:01,696][26022] Updated weights on worker 0-0, policy_version 1151857 (0.00096) [2022-07-11 10:26:03,992][26022] Updated weights on worker 0-0, policy_version 1151867 (0.00087) [2022-07-11 10:26:05,461][26022] Updated weights on worker 0-0, policy_version 1151877 (0.00087) [2022-07-11 10:26:05,887][25689] Fps is (10 sec: 5261.8, 60 sec: 5542.0, 300 sec: 5544.7). Total num frames: 1179523072. Throughput: 0: 5649.5. Samples: 1179530428. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:26:05,887][25689] Avg episode reward: [(0, '-0.742')] [2022-07-11 10:26:07,681][26022] Updated weights on worker 0-0, policy_version 1151887 (0.00086) [2022-07-11 10:26:09,204][26022] Updated weights on worker 0-0, policy_version 1151897 (0.00093) [2022-07-11 10:26:10,922][25689] Fps is (10 sec: 5290.8, 60 sec: 5522.8, 300 sec: 5532.0). Total num frames: 1179549696. Throughput: 0: 4848.5. Samples: 1179547274. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:26:10,922][25689] Avg episode reward: [(0, '-0.760')] [2022-07-11 10:26:11,313][26022] Updated weights on worker 0-0, policy_version 1151907 (0.00084) [2022-07-11 10:26:12,991][26022] Updated weights on worker 0-0, policy_version 1151917 (0.00085) [2022-07-11 10:26:14,876][26022] Updated weights on worker 0-0, policy_version 1151927 (0.00078) [2022-07-11 10:26:16,048][25689] Fps is (10 sec: 5441.9, 60 sec: 5534.2, 300 sec: 5533.4). Total num frames: 1179578368. Throughput: 0: 5661.6. Samples: 1179580696. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:26:16,048][25689] Avg episode reward: [(0, '-0.866')] [2022-07-11 10:26:16,582][26022] Updated weights on worker 0-0, policy_version 1151937 (0.00087) [2022-07-11 10:26:18,603][26022] Updated weights on worker 0-0, policy_version 1151947 (0.00083) [2022-07-11 10:26:20,355][26022] Updated weights on worker 0-0, policy_version 1151957 (0.00081) [2022-07-11 10:26:21,087][25689] Fps is (10 sec: 5641.2, 60 sec: 5514.6, 300 sec: 5536.7). Total num frames: 1179607040. Throughput: 0: 5673.6. Samples: 1179613984. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:26:21,088][25689] Avg episode reward: [(0, '0.093')] [2022-07-11 10:26:22,195][26022] Updated weights on worker 0-0, policy_version 1151967 (0.00079) [2022-07-11 10:26:24,046][26022] Updated weights on worker 0-0, policy_version 1151977 (0.00083) [2022-07-11 10:26:26,024][26022] Updated weights on worker 0-0, policy_version 1151987 (0.00095) [2022-07-11 10:26:26,117][25689] Fps is (10 sec: 5593.3, 60 sec: 5529.6, 300 sec: 5536.4). Total num frames: 1179634688. Throughput: 0: 4957.6. Samples: 1179630722. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:26:26,118][25689] Avg episode reward: [(0, '0.249')] [2022-07-11 10:26:27,820][26022] Updated weights on worker 0-0, policy_version 1151997 (0.00085) [2022-07-11 10:26:29,678][26022] Updated weights on worker 0-0, policy_version 1152007 (0.00089) [2022-07-11 10:26:31,147][25689] Fps is (10 sec: 5598.5, 60 sec: 5544.2, 300 sec: 5538.3). Total num frames: 1179663360. Throughput: 0: 5786.6. Samples: 1179664308. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:26:31,147][25689] Avg episode reward: [(0, '1.444')] [2022-07-11 10:26:31,456][26022] Updated weights on worker 0-0, policy_version 1152017 (0.00088) [2022-07-11 10:26:33,207][26022] Updated weights on worker 0-0, policy_version 1152027 (0.00078) [2022-07-11 10:26:35,096][26022] Updated weights on worker 0-0, policy_version 1152037 (0.00091) [2022-07-11 10:26:36,205][25689] Fps is (10 sec: 5684.4, 60 sec: 5545.2, 300 sec: 5541.3). Total num frames: 1179692032. Throughput: 0: 5819.4. Samples: 1179697998. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:26:36,207][25689] Avg episode reward: [(0, '1.593')] [2022-07-11 10:26:36,823][26022] Updated weights on worker 0-0, policy_version 1152047 (0.00102) [2022-07-11 10:26:38,823][26022] Updated weights on worker 0-0, policy_version 1152057 (0.00082) [2022-07-11 10:26:40,606][26022] Updated weights on worker 0-0, policy_version 1152067 (0.00095) [2022-07-11 10:26:41,223][25689] Fps is (10 sec: 5487.7, 60 sec: 5518.7, 300 sec: 5538.3). Total num frames: 1179718656. Throughput: 0: 5010.8. Samples: 1179714882. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:26:41,224][25689] Avg episode reward: [(0, '1.639')] [2022-07-11 10:26:42,463][26022] Updated weights on worker 0-0, policy_version 1152077 (0.00082) [2022-07-11 10:26:44,287][26022] Updated weights on worker 0-0, policy_version 1152087 (0.00083) [2022-07-11 10:26:46,052][26022] Updated weights on worker 0-0, policy_version 1152097 (0.00091) [2022-07-11 10:26:46,255][25689] Fps is (10 sec: 5706.1, 60 sec: 5569.3, 300 sec: 5541.4). Total num frames: 1179749376. Throughput: 0: 5846.6. Samples: 1179748458. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:26:46,255][25689] Avg episode reward: [(0, '1.582')] [2022-07-11 10:26:48,010][26022] Updated weights on worker 0-0, policy_version 1152107 (0.00097) [2022-07-11 10:26:49,735][26022] Updated weights on worker 0-0, policy_version 1152117 (0.00085) [2022-07-11 10:26:51,273][25689] Fps is (10 sec: 5706.1, 60 sec: 5543.1, 300 sec: 5538.8). Total num frames: 1179776000. Throughput: 0: 5831.1. Samples: 1179781664. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:26:51,274][25689] Avg episode reward: [(0, '1.385')] [2022-07-11 10:26:51,670][26022] Updated weights on worker 0-0, policy_version 1152127 (0.00086) [2022-07-11 10:26:53,642][26022] Updated weights on worker 0-0, policy_version 1152137 (0.00058) [2022-07-11 10:26:55,106][26022] Updated weights on worker 0-0, policy_version 1152147 (0.00089) [2022-07-11 10:26:56,321][25689] Fps is (10 sec: 5493.1, 60 sec: 5529.7, 300 sec: 5538.4). Total num frames: 1179804672. Throughput: 0: 4994.1. Samples: 1179798460. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:26:56,323][25689] Avg episode reward: [(0, '1.758')] [2022-07-11 10:26:57,192][26022] Updated weights on worker 0-0, policy_version 1152157 (0.00083) [2022-07-11 10:26:58,831][26022] Updated weights on worker 0-0, policy_version 1152167 (0.00088) [2022-07-11 10:27:00,639][26022] Updated weights on worker 0-0, policy_version 1152177 (0.00091) [2022-07-11 10:27:01,350][25689] Fps is (10 sec: 5588.6, 60 sec: 5552.3, 300 sec: 5548.6). Total num frames: 1179832320. Throughput: 0: 5849.0. Samples: 1179832606. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:27:01,351][25689] Avg episode reward: [(0, '0.660')] [2022-07-11 10:27:02,835][26022] Updated weights on worker 0-0, policy_version 1152187 (0.00087) [2022-07-11 10:27:04,735][26022] Updated weights on worker 0-0, policy_version 1152197 (0.00084) [2022-07-11 10:27:06,359][25689] Fps is (10 sec: 5304.9, 60 sec: 5537.2, 300 sec: 5542.0). Total num frames: 1179857920. Throughput: 0: 5753.2. Samples: 1179864120. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:27:06,359][25689] Avg episode reward: [(0, '-0.717')] [2022-07-11 10:27:06,602][26022] Updated weights on worker 0-0, policy_version 1152207 (0.00086) [2022-07-11 10:27:08,587][26022] Updated weights on worker 0-0, policy_version 1152217 (0.00084) [2022-07-11 10:27:10,059][26022] Updated weights on worker 0-0, policy_version 1152227 (0.00084) [2022-07-11 10:27:11,372][25689] Fps is (10 sec: 5415.3, 60 sec: 5573.1, 300 sec: 5543.0). Total num frames: 1179886592. Throughput: 0: 4937.8. Samples: 1179880912. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:27:11,373][25689] Avg episode reward: [(0, '-1.377')] [2022-07-11 10:27:12,041][26022] Updated weights on worker 0-0, policy_version 1152237 (0.00090) [2022-07-11 10:27:13,900][26022] Updated weights on worker 0-0, policy_version 1152247 (0.00085) [2022-07-11 10:27:15,753][26022] Updated weights on worker 0-0, policy_version 1152257 (0.00086) [2022-07-11 10:27:16,460][25689] Fps is (10 sec: 5575.6, 60 sec: 5559.7, 300 sec: 5538.2). Total num frames: 1179914240. Throughput: 0: 5763.1. Samples: 1179914522. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:27:16,460][25689] Avg episode reward: [(0, '-1.848')] [2022-07-11 10:27:17,632][26022] Updated weights on worker 0-0, policy_version 1152267 (0.00082) [2022-07-11 10:27:19,482][26022] Updated weights on worker 0-0, policy_version 1152277 (0.00095) [2022-07-11 10:27:21,143][26022] Updated weights on worker 0-0, policy_version 1152287 (0.00090) [2022-07-11 10:27:21,484][25689] Fps is (10 sec: 5570.0, 60 sec: 5561.1, 300 sec: 5544.8). Total num frames: 1179942912. Throughput: 0: 5715.0. Samples: 1179947668. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:27:21,484][25689] Avg episode reward: [(0, '-2.864')] [2022-07-11 10:27:23,340][26022] Updated weights on worker 0-0, policy_version 1152297 (0.00092) [2022-07-11 10:27:24,789][26022] Updated weights on worker 0-0, policy_version 1152307 (0.00088) [2022-07-11 10:27:26,524][25689] Fps is (10 sec: 5494.6, 60 sec: 5543.3, 300 sec: 5537.5). Total num frames: 1179969536. Throughput: 0: 4983.0. Samples: 1179964602. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:27:26,524][25689] Avg episode reward: [(0, '-3.063')] [2022-07-11 10:27:26,935][26022] Updated weights on worker 0-0, policy_version 1152317 (0.00093) [2022-07-11 10:27:28,939][26022] Updated weights on worker 0-0, policy_version 1152327 (0.00094) [2022-07-11 10:27:30,301][26022] Updated weights on worker 0-0, policy_version 1152337 (0.00090) [2022-07-11 10:27:31,539][25689] Fps is (10 sec: 5499.3, 60 sec: 5544.6, 300 sec: 5538.1). Total num frames: 1179998208. Throughput: 0: 5814.8. Samples: 1179998176. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:27:31,539][25689] Avg episode reward: [(0, '-3.048')] [2022-07-11 10:27:32,436][26022] Updated weights on worker 0-0, policy_version 1152347 (0.00086) [2022-07-11 10:27:33,994][26022] Updated weights on worker 0-0, policy_version 1152357 (0.00086) [2022-07-11 10:27:36,106][26022] Updated weights on worker 0-0, policy_version 1152367 (0.00088) [2022-07-11 10:27:36,611][25689] Fps is (10 sec: 5785.9, 60 sec: 5560.2, 300 sec: 5544.2). Total num frames: 1180027904. Throughput: 0: 5815.5. Samples: 1180031714. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:27:36,612][25689] Avg episode reward: [(0, '-3.097')] [2022-07-11 10:27:37,861][26022] Updated weights on worker 0-0, policy_version 1152377 (0.00081) [2022-07-11 10:27:39,612][26022] Updated weights on worker 0-0, policy_version 1152387 (0.00093) [2022-07-11 10:27:41,625][25689] Fps is (10 sec: 5482.2, 60 sec: 5543.6, 300 sec: 5530.5). Total num frames: 1180053504. Throughput: 0: 5004.3. Samples: 1180048462. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:27:41,626][25689] Avg episode reward: [(0, '-2.476')] [2022-07-11 10:27:41,676][26022] Updated weights on worker 0-0, policy_version 1152397 (0.00082) [2022-07-11 10:27:43,357][26022] Updated weights on worker 0-0, policy_version 1152407 (0.00087) [2022-07-11 10:27:45,233][26022] Updated weights on worker 0-0, policy_version 1152417 (0.00082) [2022-07-11 10:27:46,628][25689] Fps is (10 sec: 5520.3, 60 sec: 5529.3, 300 sec: 5544.2). Total num frames: 1180083200. Throughput: 0: 5848.3. Samples: 1180082180. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:27:46,629][25689] Avg episode reward: [(0, '-2.446')] [2022-07-11 10:27:46,935][26022] Updated weights on worker 0-0, policy_version 1152427 (0.00087) [2022-07-11 10:27:48,736][26022] Updated weights on worker 0-0, policy_version 1152437 (0.00081) [2022-07-11 10:27:50,635][26022] Updated weights on worker 0-0, policy_version 1152447 (0.00094) [2022-07-11 10:27:51,643][25689] Fps is (10 sec: 5724.5, 60 sec: 5546.6, 300 sec: 5538.2). Total num frames: 1180110848. Throughput: 0: 5836.0. Samples: 1180115502. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:27:51,643][25689] Avg episode reward: [(0, '-1.583')] [2022-07-11 10:27:52,511][26022] Updated weights on worker 0-0, policy_version 1152457 (0.00089) [2022-07-11 10:27:54,220][26022] Updated weights on worker 0-0, policy_version 1152467 (0.00095) [2022-07-11 10:27:56,327][26022] Updated weights on worker 0-0, policy_version 1152477 (0.00080) [2022-07-11 10:27:56,749][25689] Fps is (10 sec: 5463.7, 60 sec: 5524.3, 300 sec: 5536.3). Total num frames: 1180138496. Throughput: 0: 4994.9. Samples: 1180132298. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:27:56,751][25689] Avg episode reward: [(0, '0.276')] [2022-07-11 10:27:57,794][26022] Updated weights on worker 0-0, policy_version 1152487 (0.00369) [2022-07-11 10:27:58,016][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:27:58,054][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001152488_1180147712.pth [2022-07-11 10:27:58,054][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001150536_1178148864.pth [2022-07-11 10:27:59,916][26022] Updated weights on worker 0-0, policy_version 1152497 (0.00084) [2022-07-11 10:28:01,759][25689] Fps is (10 sec: 5466.0, 60 sec: 5526.1, 300 sec: 5543.7). Total num frames: 1180166144. Throughput: 0: 5830.5. Samples: 1180165852. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:28:01,759][25689] Avg episode reward: [(0, '0.341')] [2022-07-11 10:28:01,899][26022] Updated weights on worker 0-0, policy_version 1152507 (0.00096) [2022-07-11 10:28:03,775][26022] Updated weights on worker 0-0, policy_version 1152517 (0.00083) [2022-07-11 10:28:05,573][26022] Updated weights on worker 0-0, policy_version 1152527 (0.00083) [2022-07-11 10:28:06,766][25689] Fps is (10 sec: 5622.3, 60 sec: 5577.1, 300 sec: 5547.0). Total num frames: 1180194816. Throughput: 0: 5735.4. Samples: 1180197680. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:28:06,768][25689] Avg episode reward: [(0, '0.176')] [2022-07-11 10:28:07,550][26022] Updated weights on worker 0-0, policy_version 1152537 (0.00089) [2022-07-11 10:28:09,163][26022] Updated weights on worker 0-0, policy_version 1152547 (0.00084) [2022-07-11 10:28:11,371][26022] Updated weights on worker 0-0, policy_version 1152557 (0.01251) [2022-07-11 10:28:11,779][25689] Fps is (10 sec: 5518.5, 60 sec: 5543.2, 300 sec: 5545.4). Total num frames: 1180221440. Throughput: 0: 4921.8. Samples: 1180214610. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:28:11,780][25689] Avg episode reward: [(0, '0.490')] [2022-07-11 10:28:12,813][26022] Updated weights on worker 0-0, policy_version 1152567 (0.00573) [2022-07-11 10:28:14,809][26022] Updated weights on worker 0-0, policy_version 1152577 (0.00088) [2022-07-11 10:28:16,637][26022] Updated weights on worker 0-0, policy_version 1152587 (0.00088) [2022-07-11 10:28:16,875][25689] Fps is (10 sec: 5470.1, 60 sec: 5559.4, 300 sec: 5547.5). Total num frames: 1180250112. Throughput: 0: 5745.8. Samples: 1180247938. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:28:16,875][25689] Avg episode reward: [(0, '1.297')] [2022-07-11 10:28:18,442][26022] Updated weights on worker 0-0, policy_version 1152597 (0.00085) [2022-07-11 10:28:20,425][26022] Updated weights on worker 0-0, policy_version 1152607 (0.00086) [2022-07-11 10:28:21,902][25689] Fps is (10 sec: 5664.9, 60 sec: 5559.1, 300 sec: 5543.7). Total num frames: 1180278784. Throughput: 0: 5738.8. Samples: 1180281448. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:28:21,902][25689] Avg episode reward: [(0, '1.533')] [2022-07-11 10:28:22,000][26022] Updated weights on worker 0-0, policy_version 1152617 (0.00091) [2022-07-11 10:28:24,105][26022] Updated weights on worker 0-0, policy_version 1152627 (0.00094) [2022-07-11 10:28:26,115][26022] Updated weights on worker 0-0, policy_version 1152637 (0.00085) [2022-07-11 10:28:26,947][25689] Fps is (10 sec: 5490.2, 60 sec: 5558.7, 300 sec: 5543.3). Total num frames: 1180305408. Throughput: 0: 5792.8. Samples: 1180314582. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:28:26,947][25689] Avg episode reward: [(0, '0.412')] [2022-07-11 10:28:27,612][26022] Updated weights on worker 0-0, policy_version 1152647 (0.00093) [2022-07-11 10:28:29,680][26022] Updated weights on worker 0-0, policy_version 1152657 (0.00084) [2022-07-11 10:28:31,529][26022] Updated weights on worker 0-0, policy_version 1152667 (0.00086) [2022-07-11 10:28:31,981][25689] Fps is (10 sec: 5282.8, 60 sec: 5523.0, 300 sec: 5540.2). Total num frames: 1180332032. Throughput: 0: 5766.1. Samples: 1180331098. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:28:31,982][25689] Avg episode reward: [(0, '0.406')] [2022-07-11 10:28:33,242][26022] Updated weights on worker 0-0, policy_version 1152677 (0.00087) [2022-07-11 10:28:35,467][26022] Updated weights on worker 0-0, policy_version 1152687 (0.00089) [2022-07-11 10:28:36,748][26022] Updated weights on worker 0-0, policy_version 1152697 (0.00089) [2022-07-11 10:28:37,058][25689] Fps is (10 sec: 5671.1, 60 sec: 5539.5, 300 sec: 5543.9). Total num frames: 1180362752. Throughput: 0: 5786.3. Samples: 1180364724. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 10:28:37,059][25689] Avg episode reward: [(0, '0.167')] [2022-07-11 10:28:38,950][26022] Updated weights on worker 0-0, policy_version 1152707 (0.00094) [2022-07-11 10:28:40,760][26022] Updated weights on worker 0-0, policy_version 1152717 (0.00094) [2022-07-11 10:28:42,105][25689] Fps is (10 sec: 5664.4, 60 sec: 5553.5, 300 sec: 5539.9). Total num frames: 1180389376. Throughput: 0: 5774.2. Samples: 1180398104. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:28:42,105][25689] Avg episode reward: [(0, '0.174')] [2022-07-11 10:28:42,523][26022] Updated weights on worker 0-0, policy_version 1152727 (0.00095) [2022-07-11 10:28:44,399][26022] Updated weights on worker 0-0, policy_version 1152737 (0.00092) [2022-07-11 10:28:46,019][26022] Updated weights on worker 0-0, policy_version 1152747 (0.00085) [2022-07-11 10:28:47,119][25689] Fps is (10 sec: 5292.5, 60 sec: 5501.6, 300 sec: 5537.5). Total num frames: 1180416000. Throughput: 0: 4976.0. Samples: 1180414960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:28:47,120][25689] Avg episode reward: [(0, '0.364')] [2022-07-11 10:28:47,919][26022] Updated weights on worker 0-0, policy_version 1152757 (0.00082) [2022-07-11 10:28:50,034][26022] Updated weights on worker 0-0, policy_version 1152767 (0.00099) [2022-07-11 10:28:51,543][26022] Updated weights on worker 0-0, policy_version 1152777 (0.00091) [2022-07-11 10:28:52,131][25689] Fps is (10 sec: 5617.4, 60 sec: 5535.8, 300 sec: 5538.3). Total num frames: 1180445696. Throughput: 0: 5815.8. Samples: 1180448284. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:28:52,131][25689] Avg episode reward: [(0, '1.319')] [2022-07-11 10:28:53,795][26022] Updated weights on worker 0-0, policy_version 1152787 (0.00084) [2022-07-11 10:28:55,209][26022] Updated weights on worker 0-0, policy_version 1152797 (0.00090) [2022-07-11 10:28:57,191][25689] Fps is (10 sec: 5693.8, 60 sec: 5540.0, 300 sec: 5541.6). Total num frames: 1180473344. Throughput: 0: 5803.0. Samples: 1180481552. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:28:57,191][25689] Avg episode reward: [(0, '1.026')] [2022-07-11 10:28:57,241][26022] Updated weights on worker 0-0, policy_version 1152807 (0.00087) [2022-07-11 10:28:59,137][26022] Updated weights on worker 0-0, policy_version 1152817 (0.00082) [2022-07-11 10:29:00,602][26022] Updated weights on worker 0-0, policy_version 1152827 (0.00087) [2022-07-11 10:29:02,210][25689] Fps is (10 sec: 5384.7, 60 sec: 5522.2, 300 sec: 5542.6). Total num frames: 1180499968. Throughput: 0: 4991.4. Samples: 1180498456. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:29:02,215][25689] Avg episode reward: [(0, '0.538')] [2022-07-11 10:29:03,287][26022] Updated weights on worker 0-0, policy_version 1152837 (0.00082) [2022-07-11 10:29:04,617][26022] Updated weights on worker 0-0, policy_version 1152847 (0.00091) [2022-07-11 10:29:06,832][26022] Updated weights on worker 0-0, policy_version 1152857 (0.00089) [2022-07-11 10:29:07,231][25689] Fps is (10 sec: 5507.4, 60 sec: 5521.0, 300 sec: 5543.3). Total num frames: 1180528640. Throughput: 0: 5706.8. Samples: 1180529734. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:29:07,231][25689] Avg episode reward: [(0, '0.996')] [2022-07-11 10:29:08,720][26022] Updated weights on worker 0-0, policy_version 1152867 (0.00095) [2022-07-11 10:29:10,229][26022] Updated weights on worker 0-0, policy_version 1152877 (0.00090) [2022-07-11 10:29:12,249][25689] Fps is (10 sec: 5406.2, 60 sec: 5503.6, 300 sec: 5541.2). Total num frames: 1180554240. Throughput: 0: 5710.3. Samples: 1180563162. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:29:12,250][25689] Avg episode reward: [(0, '0.798')] [2022-07-11 10:29:12,407][26022] Updated weights on worker 0-0, policy_version 1152887 (0.00087) [2022-07-11 10:29:14,265][26022] Updated weights on worker 0-0, policy_version 1152897 (0.00084) [2022-07-11 10:29:15,912][26022] Updated weights on worker 0-0, policy_version 1152907 (0.00084) [2022-07-11 10:29:17,336][25689] Fps is (10 sec: 5471.9, 60 sec: 5521.3, 300 sec: 5540.0). Total num frames: 1180583936. Throughput: 0: 4886.6. Samples: 1180579996. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:29:17,338][25689] Avg episode reward: [(0, '-0.205')] [2022-07-11 10:29:17,847][26022] Updated weights on worker 0-0, policy_version 1152917 (0.00088) [2022-07-11 10:29:19,377][26022] Updated weights on worker 0-0, policy_version 1152927 (0.00093) [2022-07-11 10:29:21,545][26022] Updated weights on worker 0-0, policy_version 1152937 (0.00085) [2022-07-11 10:29:22,434][25689] Fps is (10 sec: 5730.5, 60 sec: 5514.8, 300 sec: 5545.5). Total num frames: 1180612608. Throughput: 0: 5682.7. Samples: 1180613384. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:29:22,435][25689] Avg episode reward: [(0, '-0.336')] [2022-07-11 10:29:23,311][26022] Updated weights on worker 0-0, policy_version 1152947 (0.00082) [2022-07-11 10:29:25,263][26022] Updated weights on worker 0-0, policy_version 1152957 (0.00088) [2022-07-11 10:29:27,143][26022] Updated weights on worker 0-0, policy_version 1152967 (0.00088) [2022-07-11 10:29:27,459][25689] Fps is (10 sec: 5462.7, 60 sec: 5516.6, 300 sec: 5538.4). Total num frames: 1180639232. Throughput: 0: 5760.3. Samples: 1180646252. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:29:27,460][25689] Avg episode reward: [(0, '1.032')] [2022-07-11 10:29:28,914][26022] Updated weights on worker 0-0, policy_version 1152977 (0.00091) [2022-07-11 10:29:30,819][26022] Updated weights on worker 0-0, policy_version 1152987 (0.00085) [2022-07-11 10:29:32,547][25689] Fps is (10 sec: 5366.5, 60 sec: 5528.6, 300 sec: 5535.2). Total num frames: 1180666880. Throughput: 0: 4910.3. Samples: 1180662834. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:29:32,549][25689] Avg episode reward: [(0, '0.103')] [2022-07-11 10:29:32,876][26022] Updated weights on worker 0-0, policy_version 1152997 (0.00084) [2022-07-11 10:29:34,511][26022] Updated weights on worker 0-0, policy_version 1153007 (0.00086) [2022-07-11 10:29:36,461][26022] Updated weights on worker 0-0, policy_version 1153017 (0.00086) [2022-07-11 10:29:37,634][25689] Fps is (10 sec: 5635.5, 60 sec: 5510.8, 300 sec: 5541.4). Total num frames: 1180696576. Throughput: 0: 5719.2. Samples: 1180696084. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:29:37,635][25689] Avg episode reward: [(0, '-0.709')] [2022-07-11 10:29:38,292][26022] Updated weights on worker 0-0, policy_version 1153027 (0.00084) [2022-07-11 10:29:40,020][26022] Updated weights on worker 0-0, policy_version 1153037 (0.00092) [2022-07-11 10:29:42,176][26022] Updated weights on worker 0-0, policy_version 1153047 (0.00089) [2022-07-11 10:29:42,650][25689] Fps is (10 sec: 5574.7, 60 sec: 5513.6, 300 sec: 5535.0). Total num frames: 1180723200. Throughput: 0: 5745.7. Samples: 1180729540. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:29:42,651][25689] Avg episode reward: [(0, '-0.646')] [2022-07-11 10:29:43,865][26022] Updated weights on worker 0-0, policy_version 1153057 (0.00086) [2022-07-11 10:29:45,540][26022] Updated weights on worker 0-0, policy_version 1153067 (0.00086) [2022-07-11 10:29:47,620][26022] Updated weights on worker 0-0, policy_version 1153077 (0.00087) [2022-07-11 10:29:47,667][25689] Fps is (10 sec: 5409.8, 60 sec: 5530.4, 300 sec: 5534.7). Total num frames: 1180750848. Throughput: 0: 4944.8. Samples: 1180746176. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:29:47,667][25689] Avg episode reward: [(0, '0.195')] [2022-07-11 10:29:49,183][26022] Updated weights on worker 0-0, policy_version 1153087 (0.00087) [2022-07-11 10:29:51,200][26022] Updated weights on worker 0-0, policy_version 1153097 (0.00087) [2022-07-11 10:29:52,670][25689] Fps is (10 sec: 5621.0, 60 sec: 5514.2, 300 sec: 5535.9). Total num frames: 1180779520. Throughput: 0: 5810.9. Samples: 1180779764. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:29:52,670][25689] Avg episode reward: [(0, '-0.513')] [2022-07-11 10:29:53,050][26022] Updated weights on worker 0-0, policy_version 1153107 (0.00090) [2022-07-11 10:29:54,734][26022] Updated weights on worker 0-0, policy_version 1153117 (0.00084) [2022-07-11 10:29:56,756][26022] Updated weights on worker 0-0, policy_version 1153127 (0.00090) [2022-07-11 10:29:57,803][25689] Fps is (10 sec: 5556.4, 60 sec: 5507.6, 300 sec: 5533.5). Total num frames: 1180807168. Throughput: 0: 5806.6. Samples: 1180813194. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:29:57,803][25689] Avg episode reward: [(0, '-1.329')] [2022-07-11 10:29:58,349][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:29:58,359][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001153136_1180811264.pth [2022-07-11 10:29:58,360][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001151187_1178815488.pth [2022-07-11 10:29:58,442][26022] Updated weights on worker 0-0, policy_version 1153137 (0.00082) [2022-07-11 10:30:00,430][26022] Updated weights on worker 0-0, policy_version 1153147 (0.00092) [2022-07-11 10:30:02,454][26022] Updated weights on worker 0-0, policy_version 1153157 (0.00074) [2022-07-11 10:30:02,814][25689] Fps is (10 sec: 5349.8, 60 sec: 5508.2, 300 sec: 5534.2). Total num frames: 1180833792. Throughput: 0: 4986.6. Samples: 1180830090. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:30:02,815][25689] Avg episode reward: [(0, '-0.397')] [2022-07-11 10:30:04,488][26022] Updated weights on worker 0-0, policy_version 1153167 (0.00088) [2022-07-11 10:30:06,294][26022] Updated weights on worker 0-0, policy_version 1153177 (0.00080) [2022-07-11 10:30:07,832][25689] Fps is (10 sec: 5513.5, 60 sec: 5508.6, 300 sec: 5537.5). Total num frames: 1180862464. Throughput: 0: 5728.7. Samples: 1180861698. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:30:07,832][25689] Avg episode reward: [(0, '0.246')] [2022-07-11 10:30:07,915][26022] Updated weights on worker 0-0, policy_version 1153187 (0.00092) [2022-07-11 10:30:10,047][26022] Updated weights on worker 0-0, policy_version 1153197 (0.00093) [2022-07-11 10:30:11,723][26022] Updated weights on worker 0-0, policy_version 1153207 (0.00091) [2022-07-11 10:30:12,847][25689] Fps is (10 sec: 5613.7, 60 sec: 5542.6, 300 sec: 5538.4). Total num frames: 1180890112. Throughput: 0: 5704.2. Samples: 1180894860. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:30:12,848][25689] Avg episode reward: [(0, '-0.425')] [2022-07-11 10:30:13,712][26022] Updated weights on worker 0-0, policy_version 1153217 (0.00075) [2022-07-11 10:30:15,438][26022] Updated weights on worker 0-0, policy_version 1153227 (0.00085) [2022-07-11 10:30:17,304][26022] Updated weights on worker 0-0, policy_version 1153237 (0.00089) [2022-07-11 10:30:17,918][25689] Fps is (10 sec: 5583.8, 60 sec: 5527.2, 300 sec: 5533.8). Total num frames: 1180918784. Throughput: 0: 4887.9. Samples: 1180911516. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:30:17,919][25689] Avg episode reward: [(0, '0.542')] [2022-07-11 10:30:19,279][26022] Updated weights on worker 0-0, policy_version 1153247 (0.00087) [2022-07-11 10:30:20,825][26022] Updated weights on worker 0-0, policy_version 1153257 (0.00087) [2022-07-11 10:30:22,921][25689] Fps is (10 sec: 5387.2, 60 sec: 5485.1, 300 sec: 5530.5). Total num frames: 1180944384. Throughput: 0: 5714.9. Samples: 1180944998. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:30:22,922][25689] Avg episode reward: [(0, '1.586')] [2022-07-11 10:30:23,101][26022] Updated weights on worker 0-0, policy_version 1153267 (0.00092) [2022-07-11 10:30:24,498][26022] Updated weights on worker 0-0, policy_version 1153277 (0.00082) [2022-07-11 10:30:26,526][26022] Updated weights on worker 0-0, policy_version 1153287 (0.00086) [2022-07-11 10:30:27,928][25689] Fps is (10 sec: 5524.4, 60 sec: 5537.5, 300 sec: 5537.3). Total num frames: 1180974080. Throughput: 0: 5807.4. Samples: 1180978402. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:30:27,930][25689] Avg episode reward: [(0, '1.564')] [2022-07-11 10:30:28,345][26022] Updated weights on worker 0-0, policy_version 1153297 (0.00087) [2022-07-11 10:30:30,168][26022] Updated weights on worker 0-0, policy_version 1153307 (0.00094) [2022-07-11 10:30:32,115][26022] Updated weights on worker 0-0, policy_version 1153317 (0.00086) [2022-07-11 10:30:32,932][25689] Fps is (10 sec: 5728.0, 60 sec: 5545.2, 300 sec: 5535.1). Total num frames: 1181001728. Throughput: 0: 4998.5. Samples: 1180995258. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:30:32,933][25689] Avg episode reward: [(0, '1.352')] [2022-07-11 10:30:33,904][26022] Updated weights on worker 0-0, policy_version 1153327 (0.00090) [2022-07-11 10:30:35,644][26022] Updated weights on worker 0-0, policy_version 1153337 (0.00085) [2022-07-11 10:30:37,757][26022] Updated weights on worker 0-0, policy_version 1153347 (0.00083) [2022-07-11 10:30:38,001][25689] Fps is (10 sec: 5489.3, 60 sec: 5513.0, 300 sec: 5532.2). Total num frames: 1181029376. Throughput: 0: 5808.4. Samples: 1181028164. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:30:38,002][25689] Avg episode reward: [(0, '1.506')] [2022-07-11 10:30:39,469][26022] Updated weights on worker 0-0, policy_version 1153357 (0.00082) [2022-07-11 10:30:41,375][26022] Updated weights on worker 0-0, policy_version 1153367 (0.00081) [2022-07-11 10:30:43,012][25689] Fps is (10 sec: 5485.8, 60 sec: 5530.4, 300 sec: 5532.6). Total num frames: 1181057024. Throughput: 0: 5799.7. Samples: 1181061520. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:30:43,014][25689] Avg episode reward: [(0, '2.033')] [2022-07-11 10:30:43,120][26022] Updated weights on worker 0-0, policy_version 1153377 (0.00089) [2022-07-11 10:30:44,990][26022] Updated weights on worker 0-0, policy_version 1153387 (0.00087) [2022-07-11 10:30:46,920][26022] Updated weights on worker 0-0, policy_version 1153397 (0.00084) [2022-07-11 10:30:48,023][25689] Fps is (10 sec: 5415.5, 60 sec: 5513.9, 300 sec: 5527.4). Total num frames: 1181083648. Throughput: 0: 4977.8. Samples: 1181078430. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:30:48,024][25689] Avg episode reward: [(0, '1.945')] [2022-07-11 10:30:48,634][26022] Updated weights on worker 0-0, policy_version 1153407 (0.00092) [2022-07-11 10:30:50,535][26022] Updated weights on worker 0-0, policy_version 1153417 (0.00091) [2022-07-11 10:30:52,380][26022] Updated weights on worker 0-0, policy_version 1153427 (0.00093) [2022-07-11 10:30:53,044][25689] Fps is (10 sec: 5512.2, 60 sec: 5512.3, 300 sec: 5525.1). Total num frames: 1181112320. Throughput: 0: 5793.4. Samples: 1181111772. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:30:53,045][25689] Avg episode reward: [(0, '1.849')] [2022-07-11 10:30:54,197][26022] Updated weights on worker 0-0, policy_version 1153437 (0.00096) [2022-07-11 10:30:56,023][26022] Updated weights on worker 0-0, policy_version 1153447 (0.00092) [2022-07-11 10:30:58,015][26022] Updated weights on worker 0-0, policy_version 1153457 (0.00094) [2022-07-11 10:30:58,151][25689] Fps is (10 sec: 5661.9, 60 sec: 5531.6, 300 sec: 5531.7). Total num frames: 1181140992. Throughput: 0: 5783.5. Samples: 1181144700. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:30:58,151][25689] Avg episode reward: [(0, '1.068')] [2022-07-11 10:30:59,695][26022] Updated weights on worker 0-0, policy_version 1153467 (0.00086) [2022-07-11 10:31:01,493][26022] Updated weights on worker 0-0, policy_version 1153477 (0.00091) [2022-07-11 10:31:03,235][25689] Fps is (10 sec: 5325.4, 60 sec: 5508.1, 300 sec: 5527.3). Total num frames: 1181166592. Throughput: 0: 5664.9. Samples: 1181176080. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:31:03,235][25689] Avg episode reward: [(0, '1.039')] [2022-07-11 10:31:03,864][26022] Updated weights on worker 0-0, policy_version 1153487 (0.00093) [2022-07-11 10:31:05,603][26022] Updated weights on worker 0-0, policy_version 1153497 (0.00089) [2022-07-11 10:31:07,460][26022] Updated weights on worker 0-0, policy_version 1153507 (0.00087) [2022-07-11 10:31:08,251][25689] Fps is (10 sec: 5373.4, 60 sec: 5508.2, 300 sec: 5534.5). Total num frames: 1181195264. Throughput: 0: 5664.6. Samples: 1181193014. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:31:08,251][25689] Avg episode reward: [(0, '1.228')] [2022-07-11 10:31:09,347][26022] Updated weights on worker 0-0, policy_version 1153517 (0.00093) [2022-07-11 10:31:11,347][26022] Updated weights on worker 0-0, policy_version 1153527 (0.00084) [2022-07-11 10:31:13,002][26022] Updated weights on worker 0-0, policy_version 1153537 (0.00083) [2022-07-11 10:31:13,257][25689] Fps is (10 sec: 5619.8, 60 sec: 5509.0, 300 sec: 5533.3). Total num frames: 1181222912. Throughput: 0: 5661.9. Samples: 1181226216. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:31:13,257][25689] Avg episode reward: [(0, '1.248')] [2022-07-11 10:31:14,863][26022] Updated weights on worker 0-0, policy_version 1153547 (0.00082) [2022-07-11 10:31:16,638][26022] Updated weights on worker 0-0, policy_version 1153557 (0.00089) [2022-07-11 10:31:18,322][25689] Fps is (10 sec: 5592.0, 60 sec: 5509.6, 300 sec: 5532.8). Total num frames: 1181251584. Throughput: 0: 5710.6. Samples: 1181259892. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:31:18,323][25689] Avg episode reward: [(0, '1.313')] [2022-07-11 10:31:18,530][26022] Updated weights on worker 0-0, policy_version 1153567 (0.00079) [2022-07-11 10:31:20,491][26022] Updated weights on worker 0-0, policy_version 1153577 (0.00086) [2022-07-11 10:31:22,149][26022] Updated weights on worker 0-0, policy_version 1153587 (0.00095) [2022-07-11 10:31:23,418][25689] Fps is (10 sec: 5542.9, 60 sec: 5535.0, 300 sec: 5531.6). Total num frames: 1181279232. Throughput: 0: 4992.1. Samples: 1181276834. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:31:23,418][25689] Avg episode reward: [(0, '-0.367')] [2022-07-11 10:31:24,168][26022] Updated weights on worker 0-0, policy_version 1153597 (0.00092) [2022-07-11 10:31:25,629][26022] Updated weights on worker 0-0, policy_version 1153607 (0.00092) [2022-07-11 10:31:27,920][26022] Updated weights on worker 0-0, policy_version 1153617 (0.00090) [2022-07-11 10:31:28,505][25689] Fps is (10 sec: 5329.7, 60 sec: 5476.9, 300 sec: 5523.6). Total num frames: 1181305856. Throughput: 0: 5766.6. Samples: 1181309814. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:31:28,506][25689] Avg episode reward: [(0, '-0.312')] [2022-07-11 10:31:29,534][26022] Updated weights on worker 0-0, policy_version 1153627 (0.00081) [2022-07-11 10:31:31,578][26022] Updated weights on worker 0-0, policy_version 1153637 (0.00076) [2022-07-11 10:31:33,245][26022] Updated weights on worker 0-0, policy_version 1153647 (0.00085) [2022-07-11 10:31:33,554][25689] Fps is (10 sec: 5455.1, 60 sec: 5489.8, 300 sec: 5523.8). Total num frames: 1181334528. Throughput: 0: 5750.4. Samples: 1181342934. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:31:33,554][25689] Avg episode reward: [(0, '-0.974')] [2022-07-11 10:31:35,247][26022] Updated weights on worker 0-0, policy_version 1153657 (0.00094) [2022-07-11 10:31:37,027][26022] Updated weights on worker 0-0, policy_version 1153667 (0.00090) [2022-07-11 10:31:38,676][25689] Fps is (10 sec: 5638.4, 60 sec: 5501.9, 300 sec: 5528.8). Total num frames: 1181363200. Throughput: 0: 4881.2. Samples: 1181359240. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:31:38,676][25689] Avg episode reward: [(0, '-1.032')] [2022-07-11 10:31:39,144][26022] Updated weights on worker 0-0, policy_version 1153677 (0.00092) [2022-07-11 10:31:40,692][26022] Updated weights on worker 0-0, policy_version 1153687 (0.00085) [2022-07-11 10:31:42,756][26022] Updated weights on worker 0-0, policy_version 1153697 (0.00089) [2022-07-11 10:31:43,706][25689] Fps is (10 sec: 5648.4, 60 sec: 5517.0, 300 sec: 5521.9). Total num frames: 1181391872. Throughput: 0: 5707.9. Samples: 1181392640. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:31:43,707][25689] Avg episode reward: [(0, '-1.190')] [2022-07-11 10:31:44,490][26022] Updated weights on worker 0-0, policy_version 1153707 (0.00085) [2022-07-11 10:31:46,429][26022] Updated weights on worker 0-0, policy_version 1153718 (0.00085) [2022-07-11 10:31:48,109][26022] Updated weights on worker 0-0, policy_version 1153728 (0.00093) [2022-07-11 10:31:48,712][25689] Fps is (10 sec: 5509.6, 60 sec: 5517.4, 300 sec: 5522.1). Total num frames: 1181418496. Throughput: 0: 5759.8. Samples: 1181426200. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:31:48,713][25689] Avg episode reward: [(0, '0.101')] [2022-07-11 10:31:50,023][26022] Updated weights on worker 0-0, policy_version 1153738 (0.00080) [2022-07-11 10:31:51,903][26022] Updated weights on worker 0-0, policy_version 1153748 (0.00362) [2022-07-11 10:31:53,721][25689] Fps is (10 sec: 5623.7, 60 sec: 5535.4, 300 sec: 5526.3). Total num frames: 1181448192. Throughput: 0: 4965.8. Samples: 1181443080. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:31:53,721][25689] Avg episode reward: [(0, '0.501')] [2022-07-11 10:31:53,726][26022] Updated weights on worker 0-0, policy_version 1153758 (0.00083) [2022-07-11 10:31:55,609][26022] Updated weights on worker 0-0, policy_version 1153768 (0.00091) [2022-07-11 10:31:57,263][26022] Updated weights on worker 0-0, policy_version 1153778 (0.00098) [2022-07-11 10:31:58,427][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:31:58,439][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001153782_1181472768.pth [2022-07-11 10:31:58,440][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001151837_1179481088.pth [2022-07-11 10:31:58,782][25689] Fps is (10 sec: 5592.9, 60 sec: 5505.8, 300 sec: 5522.3). Total num frames: 1181474816. Throughput: 0: 5826.3. Samples: 1181476384. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:31:58,784][25689] Avg episode reward: [(0, '1.004')] [2022-07-11 10:31:59,353][26022] Updated weights on worker 0-0, policy_version 1153788 (0.00082) [2022-07-11 10:32:01,210][26022] Updated weights on worker 0-0, policy_version 1153798 (0.00089) [2022-07-11 10:32:03,420][26022] Updated weights on worker 0-0, policy_version 1153808 (0.00084) [2022-07-11 10:32:03,806][25689] Fps is (10 sec: 5381.4, 60 sec: 5545.1, 300 sec: 5528.9). Total num frames: 1181502464. Throughput: 0: 5731.6. Samples: 1181507844. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:32:03,807][25689] Avg episode reward: [(0, '1.194')] [2022-07-11 10:32:05,421][26022] Updated weights on worker 0-0, policy_version 1153818 (0.00091) [2022-07-11 10:32:06,835][26022] Updated weights on worker 0-0, policy_version 1153828 (0.00569) [2022-07-11 10:32:08,823][25689] Fps is (10 sec: 5405.1, 60 sec: 5511.2, 300 sec: 5521.9). Total num frames: 1181529088. Throughput: 0: 4901.0. Samples: 1181524762. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:32:08,823][25689] Avg episode reward: [(0, '0.143')] [2022-07-11 10:32:09,083][26022] Updated weights on worker 0-0, policy_version 1153838 (0.00086) [2022-07-11 10:32:10,643][26022] Updated weights on worker 0-0, policy_version 1153848 (0.00087) [2022-07-11 10:32:12,685][26022] Updated weights on worker 0-0, policy_version 1153858 (0.00881) [2022-07-11 10:32:13,844][25689] Fps is (10 sec: 5508.6, 60 sec: 5526.7, 300 sec: 5526.6). Total num frames: 1181557760. Throughput: 0: 5724.8. Samples: 1181558282. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:32:13,845][25689] Avg episode reward: [(0, '-1.103')] [2022-07-11 10:32:14,280][26022] Updated weights on worker 0-0, policy_version 1153868 (0.00091) [2022-07-11 10:32:16,138][26022] Updated weights on worker 0-0, policy_version 1153878 (0.00088) [2022-07-11 10:32:18,223][26022] Updated weights on worker 0-0, policy_version 1153888 (0.00080) [2022-07-11 10:32:18,913][25689] Fps is (10 sec: 5581.5, 60 sec: 5509.5, 300 sec: 5522.3). Total num frames: 1181585408. Throughput: 0: 5726.9. Samples: 1181591674. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:32:18,914][25689] Avg episode reward: [(0, '-1.333')] [2022-07-11 10:32:19,991][26022] Updated weights on worker 0-0, policy_version 1153898 (0.00097) [2022-07-11 10:32:21,927][26022] Updated weights on worker 0-0, policy_version 1153908 (0.00097) [2022-07-11 10:32:23,741][26022] Updated weights on worker 0-0, policy_version 1153918 (0.00103) [2022-07-11 10:32:23,951][25689] Fps is (10 sec: 5471.3, 60 sec: 5514.7, 300 sec: 5525.8). Total num frames: 1181613056. Throughput: 0: 4994.3. Samples: 1181608452. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:32:23,951][25689] Avg episode reward: [(0, '-2.993')] [2022-07-11 10:32:25,409][26022] Updated weights on worker 0-0, policy_version 1153928 (0.00091) [2022-07-11 10:32:27,476][26022] Updated weights on worker 0-0, policy_version 1153938 (0.00087) [2022-07-11 10:32:28,974][25689] Fps is (10 sec: 5496.4, 60 sec: 5537.6, 300 sec: 5522.2). Total num frames: 1181640704. Throughput: 0: 5787.8. Samples: 1181641390. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:32:28,974][25689] Avg episode reward: [(0, '-3.780')] [2022-07-11 10:32:29,164][26022] Updated weights on worker 0-0, policy_version 1153948 (0.00082) [2022-07-11 10:32:31,031][26022] Updated weights on worker 0-0, policy_version 1153958 (0.00096) [2022-07-11 10:32:32,891][26022] Updated weights on worker 0-0, policy_version 1153968 (0.00082) [2022-07-11 10:32:33,979][25689] Fps is (10 sec: 5616.3, 60 sec: 5541.6, 300 sec: 5520.0). Total num frames: 1181669376. Throughput: 0: 5796.7. Samples: 1181674996. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 10:32:33,979][25689] Avg episode reward: [(0, '-2.114')] [2022-07-11 10:32:34,603][26022] Updated weights on worker 0-0, policy_version 1153978 (0.00084) [2022-07-11 10:32:36,840][26022] Updated weights on worker 0-0, policy_version 1153988 (0.00085) [2022-07-11 10:32:38,416][26022] Updated weights on worker 0-0, policy_version 1153998 (0.00092) [2022-07-11 10:32:39,112][25689] Fps is (10 sec: 5555.3, 60 sec: 5523.6, 300 sec: 5524.7). Total num frames: 1181697024. Throughput: 0: 4952.5. Samples: 1181691706. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:32:39,113][25689] Avg episode reward: [(0, '-2.454')] [2022-07-11 10:32:40,185][26022] Updated weights on worker 0-0, policy_version 1154008 (0.00085) [2022-07-11 10:32:42,186][26022] Updated weights on worker 0-0, policy_version 1154018 (0.00085) [2022-07-11 10:32:43,835][26022] Updated weights on worker 0-0, policy_version 1154028 (0.00086) [2022-07-11 10:32:44,184][25689] Fps is (10 sec: 5619.0, 60 sec: 5536.7, 300 sec: 5523.4). Total num frames: 1181726720. Throughput: 0: 5767.4. Samples: 1181725146. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:32:44,185][25689] Avg episode reward: [(0, '-1.241')] [2022-07-11 10:32:46,003][26022] Updated weights on worker 0-0, policy_version 1154038 (0.00093) [2022-07-11 10:32:47,685][26022] Updated weights on worker 0-0, policy_version 1154048 (0.00085) [2022-07-11 10:32:49,223][25689] Fps is (10 sec: 5468.5, 60 sec: 5516.7, 300 sec: 5516.1). Total num frames: 1181752320. Throughput: 0: 5780.3. Samples: 1181758440. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:32:49,224][25689] Avg episode reward: [(0, '-1.440')] [2022-07-11 10:32:49,518][26022] Updated weights on worker 0-0, policy_version 1154058 (0.00089) [2022-07-11 10:32:51,436][26022] Updated weights on worker 0-0, policy_version 1154068 (0.00080) [2022-07-11 10:32:53,074][26022] Updated weights on worker 0-0, policy_version 1154078 (0.00091) [2022-07-11 10:32:54,249][25689] Fps is (10 sec: 5290.4, 60 sec: 5481.3, 300 sec: 5517.5). Total num frames: 1181779968. Throughput: 0: 4941.2. Samples: 1181775156. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:32:54,250][25689] Avg episode reward: [(0, '-1.226')] [2022-07-11 10:32:55,243][26022] Updated weights on worker 0-0, policy_version 1154088 (0.00093) [2022-07-11 10:32:56,643][26022] Updated weights on worker 0-0, policy_version 1154098 (0.00088) [2022-07-11 10:32:58,743][26022] Updated weights on worker 0-0, policy_version 1154108 (0.00100) [2022-07-11 10:32:59,340][25689] Fps is (10 sec: 5668.3, 60 sec: 5529.4, 300 sec: 5522.9). Total num frames: 1181809664. Throughput: 0: 5771.3. Samples: 1181808450. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:32:59,340][25689] Avg episode reward: [(0, '-0.100')] [2022-07-11 10:33:00,684][26022] Updated weights on worker 0-0, policy_version 1154118 (0.00095) [2022-07-11 10:33:02,730][26022] Updated weights on worker 0-0, policy_version 1154128 (0.00095) [2022-07-11 10:33:04,349][25689] Fps is (10 sec: 5474.7, 60 sec: 5496.9, 300 sec: 5512.6). Total num frames: 1181835264. Throughput: 0: 5664.0. Samples: 1181839362. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:33:04,350][25689] Avg episode reward: [(0, '0.198')] [2022-07-11 10:33:04,583][26022] Updated weights on worker 0-0, policy_version 1154138 (0.00094) [2022-07-11 10:33:06,427][26022] Updated weights on worker 0-0, policy_version 1154148 (0.00086) [2022-07-11 10:33:08,326][26022] Updated weights on worker 0-0, policy_version 1154158 (0.00096) [2022-07-11 10:33:09,404][25689] Fps is (10 sec: 5290.6, 60 sec: 5510.3, 300 sec: 5515.2). Total num frames: 1181862912. Throughput: 0: 5650.6. Samples: 1181872476. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:33:09,406][25689] Avg episode reward: [(0, '-0.600')] [2022-07-11 10:33:10,500][26022] Updated weights on worker 0-0, policy_version 1154168 (0.00090) [2022-07-11 10:33:11,898][26022] Updated weights on worker 0-0, policy_version 1154178 (0.00082) [2022-07-11 10:33:14,074][26022] Updated weights on worker 0-0, policy_version 1154188 (0.00092) [2022-07-11 10:33:14,421][25689] Fps is (10 sec: 5591.8, 60 sec: 5510.7, 300 sec: 5516.7). Total num frames: 1181891584. Throughput: 0: 5654.9. Samples: 1181889228. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:33:14,423][25689] Avg episode reward: [(0, '-0.607')] [2022-07-11 10:33:15,665][26022] Updated weights on worker 0-0, policy_version 1154198 (0.00092) [2022-07-11 10:33:17,507][26022] Updated weights on worker 0-0, policy_version 1154208 (0.00101) [2022-07-11 10:33:19,556][25689] Fps is (10 sec: 5447.1, 60 sec: 5487.9, 300 sec: 5507.8). Total num frames: 1181918208. Throughput: 0: 5644.3. Samples: 1181922556. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:33:19,557][25689] Avg episode reward: [(0, '0.409')] [2022-07-11 10:33:19,597][26022] Updated weights on worker 0-0, policy_version 1154218 (0.00089) [2022-07-11 10:33:20,984][26022] Updated weights on worker 0-0, policy_version 1154228 (0.00082) [2022-07-11 10:33:23,247][26022] Updated weights on worker 0-0, policy_version 1154238 (0.00084) [2022-07-11 10:33:24,570][25689] Fps is (10 sec: 5549.5, 60 sec: 5523.8, 300 sec: 5518.7). Total num frames: 1181947904. Throughput: 0: 5767.0. Samples: 1181955974. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:33:24,570][25689] Avg episode reward: [(0, '0.906')] [2022-07-11 10:33:24,968][26022] Updated weights on worker 0-0, policy_version 1154248 (0.00091) [2022-07-11 10:33:26,853][26022] Updated weights on worker 0-0, policy_version 1154258 (0.00099) [2022-07-11 10:33:28,897][26022] Updated weights on worker 0-0, policy_version 1154268 (0.00091) [2022-07-11 10:33:29,669][25689] Fps is (10 sec: 5569.0, 60 sec: 5500.0, 300 sec: 5517.5). Total num frames: 1181974528. Throughput: 0: 4927.7. Samples: 1181972334. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:33:29,670][25689] Avg episode reward: [(0, '0.801')] [2022-07-11 10:33:30,552][26022] Updated weights on worker 0-0, policy_version 1154278 (0.00098) [2022-07-11 10:33:32,557][26022] Updated weights on worker 0-0, policy_version 1154288 (0.00084) [2022-07-11 10:33:34,273][26022] Updated weights on worker 0-0, policy_version 1154298 (0.00088) [2022-07-11 10:33:34,759][25689] Fps is (10 sec: 5427.1, 60 sec: 5492.3, 300 sec: 5510.4). Total num frames: 1182003200. Throughput: 0: 5710.6. Samples: 1182005366. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:33:34,759][25689] Avg episode reward: [(0, '0.574')] [2022-07-11 10:33:36,255][26022] Updated weights on worker 0-0, policy_version 1154308 (0.00095) [2022-07-11 10:33:37,919][26022] Updated weights on worker 0-0, policy_version 1154318 (0.00085) [2022-07-11 10:33:39,837][25689] Fps is (10 sec: 5538.9, 60 sec: 5497.2, 300 sec: 5513.2). Total num frames: 1182030848. Throughput: 0: 5719.7. Samples: 1182038558. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:33:39,838][25689] Avg episode reward: [(0, '0.505')] [2022-07-11 10:33:39,897][26022] Updated weights on worker 0-0, policy_version 1154328 (0.00091) [2022-07-11 10:33:41,643][26022] Updated weights on worker 0-0, policy_version 1154338 (0.00390) [2022-07-11 10:33:43,674][26022] Updated weights on worker 0-0, policy_version 1154348 (0.00086) [2022-07-11 10:33:44,841][25689] Fps is (10 sec: 5585.9, 60 sec: 5486.6, 300 sec: 5520.3). Total num frames: 1182059520. Throughput: 0: 4900.6. Samples: 1182055316. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:33:44,842][25689] Avg episode reward: [(0, '0.418')] [2022-07-11 10:33:45,265][26022] Updated weights on worker 0-0, policy_version 1154358 (0.00079) [2022-07-11 10:33:47,419][26022] Updated weights on worker 0-0, policy_version 1154368 (0.00096) [2022-07-11 10:33:48,959][26022] Updated weights on worker 0-0, policy_version 1154378 (0.00091) [2022-07-11 10:33:49,847][25689] Fps is (10 sec: 5524.2, 60 sec: 5506.5, 300 sec: 5510.1). Total num frames: 1182086144. Throughput: 0: 5767.7. Samples: 1182088714. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:33:49,848][25689] Avg episode reward: [(0, '-0.166')] [2022-07-11 10:33:51,016][26022] Updated weights on worker 0-0, policy_version 1154388 (0.00081) [2022-07-11 10:33:52,799][26022] Updated weights on worker 0-0, policy_version 1154398 (0.00095) [2022-07-11 10:33:54,683][26022] Updated weights on worker 0-0, policy_version 1154408 (0.00093) [2022-07-11 10:33:54,883][25689] Fps is (10 sec: 5506.8, 60 sec: 5522.5, 300 sec: 5514.0). Total num frames: 1182114816. Throughput: 0: 5796.3. Samples: 1182122010. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:33:54,883][25689] Avg episode reward: [(0, '-0.477')] [2022-07-11 10:33:56,590][26022] Updated weights on worker 0-0, policy_version 1154418 (0.00095) [2022-07-11 10:33:58,491][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:33:58,506][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001154428_1182134272.pth [2022-07-11 10:33:58,507][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001152488_1180147712.pth [2022-07-11 10:33:58,510][26022] Updated weights on worker 0-0, policy_version 1154428 (0.00097) [2022-07-11 10:33:59,993][25689] Fps is (10 sec: 5550.9, 60 sec: 5486.9, 300 sec: 5515.7). Total num frames: 1182142464. Throughput: 0: 4961.7. Samples: 1182138564. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:33:59,994][25689] Avg episode reward: [(0, '-0.198')] [2022-07-11 10:34:00,227][26022] Updated weights on worker 0-0, policy_version 1154438 (0.00090) [2022-07-11 10:34:02,093][26022] Updated weights on worker 0-0, policy_version 1154448 (0.00086) [2022-07-11 10:34:04,382][26022] Updated weights on worker 0-0, policy_version 1154458 (0.00081) [2022-07-11 10:34:05,079][25689] Fps is (10 sec: 5222.4, 60 sec: 5480.1, 300 sec: 5504.2). Total num frames: 1182168064. Throughput: 0: 5649.1. Samples: 1182169640. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:34:05,079][25689] Avg episode reward: [(0, '0.312')] [2022-07-11 10:34:06,133][26022] Updated weights on worker 0-0, policy_version 1154468 (0.00088) [2022-07-11 10:34:08,048][26022] Updated weights on worker 0-0, policy_version 1154478 (0.00085) [2022-07-11 10:34:09,745][26022] Updated weights on worker 0-0, policy_version 1154488 (0.00093) [2022-07-11 10:34:10,109][25689] Fps is (10 sec: 5364.7, 60 sec: 5499.1, 300 sec: 5514.3). Total num frames: 1182196736. Throughput: 0: 5656.9. Samples: 1182203336. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:34:10,110][25689] Avg episode reward: [(0, '-0.632')] [2022-07-11 10:34:11,631][26022] Updated weights on worker 0-0, policy_version 1154498 (0.00090) [2022-07-11 10:34:13,661][26022] Updated weights on worker 0-0, policy_version 1154508 (0.00092) [2022-07-11 10:34:15,166][26022] Updated weights on worker 0-0, policy_version 1154518 (0.00080) [2022-07-11 10:34:15,169][25689] Fps is (10 sec: 5683.0, 60 sec: 5495.2, 300 sec: 5511.4). Total num frames: 1182225408. Throughput: 0: 4835.0. Samples: 1182220096. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:34:15,170][25689] Avg episode reward: [(0, '-0.503')] [2022-07-11 10:34:17,119][26022] Updated weights on worker 0-0, policy_version 1154528 (0.00091) [2022-07-11 10:34:18,955][26022] Updated weights on worker 0-0, policy_version 1154538 (0.00088) [2022-07-11 10:34:20,249][25689] Fps is (10 sec: 5554.5, 60 sec: 5517.1, 300 sec: 5508.3). Total num frames: 1182253056. Throughput: 0: 5670.5. Samples: 1182253426. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:34:20,255][25689] Avg episode reward: [(0, '0.230')] [2022-07-11 10:34:20,768][26022] Updated weights on worker 0-0, policy_version 1154548 (0.00086) [2022-07-11 10:34:22,623][26022] Updated weights on worker 0-0, policy_version 1154558 (0.00087) [2022-07-11 10:34:24,542][26022] Updated weights on worker 0-0, policy_version 1154568 (0.00091) [2022-07-11 10:34:25,276][25689] Fps is (10 sec: 5369.9, 60 sec: 5465.3, 300 sec: 5508.2). Total num frames: 1182279680. Throughput: 0: 5809.5. Samples: 1182286976. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:34:25,276][25689] Avg episode reward: [(0, '0.518')] [2022-07-11 10:34:26,267][26022] Updated weights on worker 0-0, policy_version 1154578 (0.00093) [2022-07-11 10:34:28,539][26022] Updated weights on worker 0-0, policy_version 1154588 (0.00085) [2022-07-11 10:34:29,845][26022] Updated weights on worker 0-0, policy_version 1154598 (0.00093) [2022-07-11 10:34:30,296][25689] Fps is (10 sec: 5707.7, 60 sec: 5540.0, 300 sec: 5519.8). Total num frames: 1182310400. Throughput: 0: 4969.1. Samples: 1182303644. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:34:30,296][25689] Avg episode reward: [(0, '0.586')] [2022-07-11 10:34:32,102][26022] Updated weights on worker 0-0, policy_version 1154608 (0.00086) [2022-07-11 10:34:33,519][26022] Updated weights on worker 0-0, policy_version 1154618 (0.00091) [2022-07-11 10:34:35,302][25689] Fps is (10 sec: 5719.6, 60 sec: 5513.8, 300 sec: 5511.0). Total num frames: 1182337024. Throughput: 0: 5814.5. Samples: 1182337156. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:34:35,302][25689] Avg episode reward: [(0, '0.205')] [2022-07-11 10:34:35,591][26022] Updated weights on worker 0-0, policy_version 1154628 (0.00088) [2022-07-11 10:34:37,466][26022] Updated weights on worker 0-0, policy_version 1154638 (0.00084) [2022-07-11 10:34:39,077][26022] Updated weights on worker 0-0, policy_version 1154648 (0.00084) [2022-07-11 10:34:40,366][25689] Fps is (10 sec: 5491.3, 60 sec: 5532.1, 300 sec: 5517.0). Total num frames: 1182365696. Throughput: 0: 5840.0. Samples: 1182370908. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:34:40,366][25689] Avg episode reward: [(0, '0.855')] [2022-07-11 10:34:41,184][26022] Updated weights on worker 0-0, policy_version 1154658 (0.00093) [2022-07-11 10:34:42,925][26022] Updated weights on worker 0-0, policy_version 1154668 (0.00078) [2022-07-11 10:34:44,629][26022] Updated weights on worker 0-0, policy_version 1154678 (0.00107) [2022-07-11 10:34:45,426][25689] Fps is (10 sec: 5563.1, 60 sec: 5510.1, 300 sec: 5516.2). Total num frames: 1182393344. Throughput: 0: 4996.6. Samples: 1182387656. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:34:45,426][25689] Avg episode reward: [(0, '0.763')] [2022-07-11 10:34:46,552][26022] Updated weights on worker 0-0, policy_version 1154688 (0.00081) [2022-07-11 10:34:48,437][26022] Updated weights on worker 0-0, policy_version 1154698 (0.00087) [2022-07-11 10:34:50,206][26022] Updated weights on worker 0-0, policy_version 1154708 (0.00086) [2022-07-11 10:34:50,450][25689] Fps is (10 sec: 5584.9, 60 sec: 5542.2, 300 sec: 5515.8). Total num frames: 1182422016. Throughput: 0: 5826.2. Samples: 1182421066. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:34:50,450][25689] Avg episode reward: [(0, '0.755')] [2022-07-11 10:34:52,159][26022] Updated weights on worker 0-0, policy_version 1154718 (0.00083) [2022-07-11 10:34:53,740][26022] Updated weights on worker 0-0, policy_version 1154728 (0.00094) [2022-07-11 10:34:55,510][25689] Fps is (10 sec: 5584.6, 60 sec: 5523.0, 300 sec: 5517.1). Total num frames: 1182449664. Throughput: 0: 5804.7. Samples: 1182454462. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:34:55,511][25689] Avg episode reward: [(0, '-0.229')] [2022-07-11 10:34:55,943][26022] Updated weights on worker 0-0, policy_version 1154738 (0.00081) [2022-07-11 10:34:57,589][26022] Updated weights on worker 0-0, policy_version 1154748 (0.00084) [2022-07-11 10:34:59,405][26022] Updated weights on worker 0-0, policy_version 1154758 (0.00088) [2022-07-11 10:35:00,602][25689] Fps is (10 sec: 5547.5, 60 sec: 5541.6, 300 sec: 5522.5). Total num frames: 1182478336. Throughput: 0: 4954.0. Samples: 1182471162. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:35:00,603][25689] Avg episode reward: [(0, '0.210')] [2022-07-11 10:35:01,222][26022] Updated weights on worker 0-0, policy_version 1154768 (0.00083) [2022-07-11 10:35:03,472][26022] Updated weights on worker 0-0, policy_version 1154778 (0.00088) [2022-07-11 10:35:05,390][26022] Updated weights on worker 0-0, policy_version 1154788 (0.00098) [2022-07-11 10:35:05,610][25689] Fps is (10 sec: 5373.4, 60 sec: 5548.7, 300 sec: 5512.4). Total num frames: 1182503936. Throughput: 0: 5697.0. Samples: 1182502650. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:35:05,611][25689] Avg episode reward: [(0, '0.786')] [2022-07-11 10:35:07,175][26022] Updated weights on worker 0-0, policy_version 1154798 (0.00094) [2022-07-11 10:35:08,999][26022] Updated weights on worker 0-0, policy_version 1154808 (0.00085) [2022-07-11 10:35:10,629][25689] Fps is (10 sec: 5310.6, 60 sec: 5532.9, 300 sec: 5512.3). Total num frames: 1182531584. Throughput: 0: 5697.3. Samples: 1182536032. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:35:10,630][25689] Avg episode reward: [(0, '0.444')] [2022-07-11 10:35:10,909][26022] Updated weights on worker 0-0, policy_version 1154818 (0.00091) [2022-07-11 10:35:12,873][26022] Updated weights on worker 0-0, policy_version 1154828 (0.00093) [2022-07-11 10:35:14,524][26022] Updated weights on worker 0-0, policy_version 1154838 (0.00091) [2022-07-11 10:35:15,633][25689] Fps is (10 sec: 5619.1, 60 sec: 5538.0, 300 sec: 5513.6). Total num frames: 1182560256. Throughput: 0: 4887.7. Samples: 1182552816. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:35:15,635][25689] Avg episode reward: [(0, '0.210')] [2022-07-11 10:35:16,403][26022] Updated weights on worker 0-0, policy_version 1154848 (0.00091) [2022-07-11 10:35:18,142][26022] Updated weights on worker 0-0, policy_version 1154858 (0.00903) [2022-07-11 10:35:20,103][26022] Updated weights on worker 0-0, policy_version 1154868 (0.00093) [2022-07-11 10:35:20,751][25689] Fps is (10 sec: 5766.6, 60 sec: 5568.4, 300 sec: 5525.2). Total num frames: 1182589952. Throughput: 0: 5735.1. Samples: 1182586714. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:35:20,753][25689] Avg episode reward: [(0, '1.056')] [2022-07-11 10:35:21,935][26022] Updated weights on worker 0-0, policy_version 1154878 (0.00089) [2022-07-11 10:35:23,529][26022] Updated weights on worker 0-0, policy_version 1154888 (0.00084) [2022-07-11 10:35:25,654][26022] Updated weights on worker 0-0, policy_version 1154898 (0.00092) [2022-07-11 10:35:25,764][25689] Fps is (10 sec: 5458.5, 60 sec: 5552.7, 300 sec: 5511.3). Total num frames: 1182615552. Throughput: 0: 5837.4. Samples: 1182620292. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:35:25,765][25689] Avg episode reward: [(0, '1.085')] [2022-07-11 10:35:27,211][26022] Updated weights on worker 0-0, policy_version 1154908 (0.00084) [2022-07-11 10:35:29,359][26022] Updated weights on worker 0-0, policy_version 1154918 (0.00086) [2022-07-11 10:35:30,775][25689] Fps is (10 sec: 5414.2, 60 sec: 5519.7, 300 sec: 5514.7). Total num frames: 1182644224. Throughput: 0: 5847.3. Samples: 1182653830. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:35:30,776][25689] Avg episode reward: [(0, '0.594')] [2022-07-11 10:35:30,872][26022] Updated weights on worker 0-0, policy_version 1154928 (0.00088) [2022-07-11 10:35:33,020][26022] Updated weights on worker 0-0, policy_version 1154938 (0.00080) [2022-07-11 10:35:34,734][26022] Updated weights on worker 0-0, policy_version 1154948 (0.00083) [2022-07-11 10:35:35,819][25689] Fps is (10 sec: 5702.7, 60 sec: 5550.0, 300 sec: 5518.5). Total num frames: 1182672896. Throughput: 0: 5809.7. Samples: 1182670090. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:35:35,820][25689] Avg episode reward: [(0, '-1.039')] [2022-07-11 10:35:36,567][26022] Updated weights on worker 0-0, policy_version 1154958 (0.00091) [2022-07-11 10:35:38,474][26022] Updated weights on worker 0-0, policy_version 1154968 (0.00085) [2022-07-11 10:35:40,335][26022] Updated weights on worker 0-0, policy_version 1154978 (0.00084) [2022-07-11 10:35:40,900][25689] Fps is (10 sec: 5562.0, 60 sec: 5531.5, 300 sec: 5517.2). Total num frames: 1182700544. Throughput: 0: 5784.8. Samples: 1182703276. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:35:40,902][25689] Avg episode reward: [(0, '-1.315')] [2022-07-11 10:35:42,221][26022] Updated weights on worker 0-0, policy_version 1154988 (0.00085) [2022-07-11 10:35:44,042][26022] Updated weights on worker 0-0, policy_version 1154998 (0.00613) [2022-07-11 10:35:45,861][26022] Updated weights on worker 0-0, policy_version 1155008 (0.00089) [2022-07-11 10:35:45,944][25689] Fps is (10 sec: 5562.7, 60 sec: 5549.9, 300 sec: 5523.5). Total num frames: 1182729216. Throughput: 0: 5766.0. Samples: 1182736650. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:35:45,945][25689] Avg episode reward: [(0, '-1.150')] [2022-07-11 10:35:47,704][26022] Updated weights on worker 0-0, policy_version 1155018 (0.00052) [2022-07-11 10:35:49,571][26022] Updated weights on worker 0-0, policy_version 1155028 (0.00082) [2022-07-11 10:35:50,980][25689] Fps is (10 sec: 5486.0, 60 sec: 5515.0, 300 sec: 5516.3). Total num frames: 1182755840. Throughput: 0: 4926.0. Samples: 1182753362. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:35:50,982][25689] Avg episode reward: [(0, '-1.201')] [2022-07-11 10:35:51,409][26022] Updated weights on worker 0-0, policy_version 1155038 (0.00095) [2022-07-11 10:35:53,155][26022] Updated weights on worker 0-0, policy_version 1155048 (0.00087) [2022-07-11 10:35:55,035][26022] Updated weights on worker 0-0, policy_version 1155058 (0.00082) [2022-07-11 10:35:55,984][25689] Fps is (10 sec: 5405.5, 60 sec: 5520.2, 300 sec: 5514.8). Total num frames: 1182783488. Throughput: 0: 5790.1. Samples: 1182786846. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:35:55,984][25689] Avg episode reward: [(0, '-0.789')] [2022-07-11 10:35:57,062][26022] Updated weights on worker 0-0, policy_version 1155068 (0.00090) [2022-07-11 10:35:58,590][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:35:58,603][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001155077_1182798848.pth [2022-07-11 10:35:58,604][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001153136_1180811264.pth [2022-07-11 10:35:58,801][26022] Updated weights on worker 0-0, policy_version 1155078 (0.00091) [2022-07-11 10:36:00,640][26022] Updated weights on worker 0-0, policy_version 1155088 (0.00101) [2022-07-11 10:36:01,036][25689] Fps is (10 sec: 5600.8, 60 sec: 5523.8, 300 sec: 5525.7). Total num frames: 1182812160. Throughput: 0: 5805.5. Samples: 1182820172. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:36:01,036][25689] Avg episode reward: [(0, '-0.017')] [2022-07-11 10:36:02,807][26022] Updated weights on worker 0-0, policy_version 1155098 (0.00079) [2022-07-11 10:36:04,674][26022] Updated weights on worker 0-0, policy_version 1155108 (0.00081) [2022-07-11 10:36:06,061][25689] Fps is (10 sec: 5385.3, 60 sec: 5522.2, 300 sec: 5515.2). Total num frames: 1182837760. Throughput: 0: 4882.3. Samples: 1182834872. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:36:06,070][25689] Avg episode reward: [(0, '0.160')] [2022-07-11 10:36:06,461][26022] Updated weights on worker 0-0, policy_version 1155118 (0.00090) [2022-07-11 10:36:08,137][26022] Updated weights on worker 0-0, policy_version 1155128 (0.00084) [2022-07-11 10:36:10,114][26022] Updated weights on worker 0-0, policy_version 1155138 (0.00086) [2022-07-11 10:36:11,106][25689] Fps is (10 sec: 5287.5, 60 sec: 5519.8, 300 sec: 5514.5). Total num frames: 1182865408. Throughput: 0: 5717.7. Samples: 1182868438. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:36:11,107][25689] Avg episode reward: [(0, '0.173')] [2022-07-11 10:36:11,832][26022] Updated weights on worker 0-0, policy_version 1155148 (0.00088) [2022-07-11 10:36:13,794][26022] Updated weights on worker 0-0, policy_version 1155158 (0.00085) [2022-07-11 10:36:15,615][26022] Updated weights on worker 0-0, policy_version 1155168 (0.00086) [2022-07-11 10:36:16,117][25689] Fps is (10 sec: 5601.2, 60 sec: 5519.3, 300 sec: 5515.5). Total num frames: 1182894080. Throughput: 0: 5720.5. Samples: 1182902016. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:36:16,117][25689] Avg episode reward: [(0, '0.491')] [2022-07-11 10:36:17,496][26022] Updated weights on worker 0-0, policy_version 1155178 (0.00086) [2022-07-11 10:36:19,469][26022] Updated weights on worker 0-0, policy_version 1155188 (0.00087) [2022-07-11 10:36:21,181][26022] Updated weights on worker 0-0, policy_version 1155198 (0.00090) [2022-07-11 10:36:21,182][25689] Fps is (10 sec: 5589.6, 60 sec: 5490.1, 300 sec: 5516.1). Total num frames: 1182921728. Throughput: 0: 4892.2. Samples: 1182918732. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:36:21,183][25689] Avg episode reward: [(0, '-0.605')] [2022-07-11 10:36:23,246][26022] Updated weights on worker 0-0, policy_version 1155208 (0.00081) [2022-07-11 10:36:24,875][26022] Updated weights on worker 0-0, policy_version 1155218 (0.00091) [2022-07-11 10:36:26,187][25689] Fps is (10 sec: 5490.9, 60 sec: 5524.7, 300 sec: 5521.1). Total num frames: 1182949376. Throughput: 0: 5827.6. Samples: 1182952156. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:36:26,188][25689] Avg episode reward: [(0, '-2.481')] [2022-07-11 10:36:26,712][26022] Updated weights on worker 0-0, policy_version 1155228 (0.00090) [2022-07-11 10:36:28,560][26022] Updated weights on worker 0-0, policy_version 1155238 (0.00088) [2022-07-11 10:36:30,443][26022] Updated weights on worker 0-0, policy_version 1155248 (0.00090) [2022-07-11 10:36:31,203][25689] Fps is (10 sec: 5620.2, 60 sec: 5524.3, 300 sec: 5521.7). Total num frames: 1182978048. Throughput: 0: 5809.9. Samples: 1182985200. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:36:31,204][25689] Avg episode reward: [(0, '-2.809')] [2022-07-11 10:36:32,488][26022] Updated weights on worker 0-0, policy_version 1155258 (0.00091) [2022-07-11 10:36:34,176][26022] Updated weights on worker 0-0, policy_version 1155268 (0.00088) [2022-07-11 10:36:36,128][26022] Updated weights on worker 0-0, policy_version 1155278 (0.00619) [2022-07-11 10:36:36,218][25689] Fps is (10 sec: 5512.4, 60 sec: 5493.1, 300 sec: 5516.8). Total num frames: 1183004672. Throughput: 0: 4961.2. Samples: 1183001744. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 10:36:36,219][25689] Avg episode reward: [(0, '-2.543')] [2022-07-11 10:36:37,730][26022] Updated weights on worker 0-0, policy_version 1155288 (0.00089) [2022-07-11 10:36:39,827][26022] Updated weights on worker 0-0, policy_version 1155298 (0.00095) [2022-07-11 10:36:41,301][25689] Fps is (10 sec: 5476.0, 60 sec: 5509.9, 300 sec: 5515.8). Total num frames: 1183033344. Throughput: 0: 5789.7. Samples: 1183035216. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:36:41,302][25689] Avg episode reward: [(0, '-2.800')] [2022-07-11 10:36:41,516][26022] Updated weights on worker 0-0, policy_version 1155308 (0.00088) [2022-07-11 10:36:43,363][26022] Updated weights on worker 0-0, policy_version 1155318 (0.00085) [2022-07-11 10:36:45,097][26022] Updated weights on worker 0-0, policy_version 1155328 (0.00086) [2022-07-11 10:36:46,332][25689] Fps is (10 sec: 5771.3, 60 sec: 5528.0, 300 sec: 5525.7). Total num frames: 1183063040. Throughput: 0: 5789.9. Samples: 1183068792. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:36:46,332][25689] Avg episode reward: [(0, '-2.811')] [2022-07-11 10:36:47,160][26022] Updated weights on worker 0-0, policy_version 1155338 (0.00085) [2022-07-11 10:36:48,801][26022] Updated weights on worker 0-0, policy_version 1155348 (0.00115) [2022-07-11 10:36:50,887][26022] Updated weights on worker 0-0, policy_version 1155358 (0.00087) [2022-07-11 10:36:51,341][25689] Fps is (10 sec: 5609.6, 60 sec: 5530.5, 300 sec: 5515.3). Total num frames: 1183089664. Throughput: 0: 4997.2. Samples: 1183085832. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:36:51,343][25689] Avg episode reward: [(0, '-0.645')] [2022-07-11 10:36:52,319][26022] Updated weights on worker 0-0, policy_version 1155368 (0.00094) [2022-07-11 10:36:54,443][26022] Updated weights on worker 0-0, policy_version 1155378 (0.00084) [2022-07-11 10:36:55,858][26022] Updated weights on worker 0-0, policy_version 1155388 (0.00086) [2022-07-11 10:36:56,362][25689] Fps is (10 sec: 5615.3, 60 sec: 5562.8, 300 sec: 5526.4). Total num frames: 1183119360. Throughput: 0: 5861.2. Samples: 1183119808. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:36:56,363][25689] Avg episode reward: [(0, '0.046')] [2022-07-11 10:36:58,014][26022] Updated weights on worker 0-0, policy_version 1155398 (0.00094) [2022-07-11 10:36:59,720][26022] Updated weights on worker 0-0, policy_version 1155408 (0.00090) [2022-07-11 10:37:01,398][25689] Fps is (10 sec: 5702.0, 60 sec: 5547.3, 300 sec: 5526.2). Total num frames: 1183147008. Throughput: 0: 5879.5. Samples: 1183153374. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:37:01,398][25689] Avg episode reward: [(0, '0.230')] [2022-07-11 10:37:01,923][26022] Updated weights on worker 0-0, policy_version 1155418 (0.00091) [2022-07-11 10:37:03,870][26022] Updated weights on worker 0-0, policy_version 1155428 (0.00090) [2022-07-11 10:37:05,512][26022] Updated weights on worker 0-0, policy_version 1155438 (0.00085) [2022-07-11 10:37:06,494][25689] Fps is (10 sec: 5255.1, 60 sec: 5540.8, 300 sec: 5521.3). Total num frames: 1183172608. Throughput: 0: 4921.4. Samples: 1183168020. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:37:06,495][25689] Avg episode reward: [(0, '1.090')] [2022-07-11 10:37:07,338][26022] Updated weights on worker 0-0, policy_version 1155448 (0.00050) [2022-07-11 10:37:09,276][26022] Updated weights on worker 0-0, policy_version 1155458 (0.00093) [2022-07-11 10:37:11,131][26022] Updated weights on worker 0-0, policy_version 1155468 (0.00089) [2022-07-11 10:37:11,519][25689] Fps is (10 sec: 5362.4, 60 sec: 5559.6, 300 sec: 5521.2). Total num frames: 1183201280. Throughput: 0: 5736.3. Samples: 1183201578. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:37:11,519][25689] Avg episode reward: [(0, '0.846')] [2022-07-11 10:37:12,907][26022] Updated weights on worker 0-0, policy_version 1155478 (0.00095) [2022-07-11 10:37:14,779][26022] Updated weights on worker 0-0, policy_version 1155488 (0.00083) [2022-07-11 10:37:16,524][25689] Fps is (10 sec: 5615.2, 60 sec: 5543.2, 300 sec: 5522.4). Total num frames: 1183228928. Throughput: 0: 5740.9. Samples: 1183235560. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:37:16,525][25689] Avg episode reward: [(0, '0.216')] [2022-07-11 10:37:16,544][26022] Updated weights on worker 0-0, policy_version 1155498 (0.00087) [2022-07-11 10:37:18,379][26022] Updated weights on worker 0-0, policy_version 1155508 (0.00087) [2022-07-11 10:37:20,470][26022] Updated weights on worker 0-0, policy_version 1155518 (0.00094) [2022-07-11 10:37:21,576][25689] Fps is (10 sec: 5601.4, 60 sec: 5561.6, 300 sec: 5525.6). Total num frames: 1183257600. Throughput: 0: 4900.5. Samples: 1183252244. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:37:21,577][25689] Avg episode reward: [(0, '0.258')] [2022-07-11 10:37:21,899][26022] Updated weights on worker 0-0, policy_version 1155528 (0.00091) [2022-07-11 10:37:23,861][26022] Updated weights on worker 0-0, policy_version 1155538 (0.00079) [2022-07-11 10:37:25,635][26022] Updated weights on worker 0-0, policy_version 1155548 (0.00092) [2022-07-11 10:37:26,605][25689] Fps is (10 sec: 5688.1, 60 sec: 5576.1, 300 sec: 5528.9). Total num frames: 1183286272. Throughput: 0: 5871.3. Samples: 1183286100. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:37:26,606][25689] Avg episode reward: [(0, '0.415')] [2022-07-11 10:37:27,601][26022] Updated weights on worker 0-0, policy_version 1155558 (0.00091) [2022-07-11 10:37:29,346][26022] Updated weights on worker 0-0, policy_version 1155568 (0.00091) [2022-07-11 10:37:31,259][26022] Updated weights on worker 0-0, policy_version 1155578 (0.00097) [2022-07-11 10:37:31,653][25689] Fps is (10 sec: 5587.5, 60 sec: 5556.2, 300 sec: 5524.6). Total num frames: 1183313920. Throughput: 0: 5862.1. Samples: 1183319606. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:37:31,654][25689] Avg episode reward: [(0, '0.164')] [2022-07-11 10:37:33,072][26022] Updated weights on worker 0-0, policy_version 1155588 (0.00091) [2022-07-11 10:37:34,957][26022] Updated weights on worker 0-0, policy_version 1155598 (0.00092) [2022-07-11 10:37:36,517][26022] Updated weights on worker 0-0, policy_version 1155608 (0.00092) [2022-07-11 10:37:36,660][25689] Fps is (10 sec: 5601.3, 60 sec: 5590.8, 300 sec: 5530.4). Total num frames: 1183342592. Throughput: 0: 5000.3. Samples: 1183336248. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:37:36,661][25689] Avg episode reward: [(0, '-0.689')] [2022-07-11 10:37:38,775][26022] Updated weights on worker 0-0, policy_version 1155618 (0.00050) [2022-07-11 10:37:40,290][26022] Updated weights on worker 0-0, policy_version 1155628 (0.00085) [2022-07-11 10:37:41,802][25689] Fps is (10 sec: 5448.6, 60 sec: 5551.5, 300 sec: 5518.8). Total num frames: 1183369216. Throughput: 0: 5811.1. Samples: 1183369794. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:37:41,802][25689] Avg episode reward: [(0, '-1.494')] [2022-07-11 10:37:42,413][26022] Updated weights on worker 0-0, policy_version 1155638 (0.00092) [2022-07-11 10:37:43,938][26022] Updated weights on worker 0-0, policy_version 1155648 (0.00905) [2022-07-11 10:37:45,998][26022] Updated weights on worker 0-0, policy_version 1155658 (0.00085) [2022-07-11 10:37:46,848][25689] Fps is (10 sec: 5628.5, 60 sec: 5567.0, 300 sec: 5535.9). Total num frames: 1183399936. Throughput: 0: 5803.3. Samples: 1183403576. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:37:46,849][25689] Avg episode reward: [(0, '-0.835')] [2022-07-11 10:37:47,623][26022] Updated weights on worker 0-0, policy_version 1155668 (0.00113) [2022-07-11 10:37:49,434][26022] Updated weights on worker 0-0, policy_version 1155678 (0.00082) [2022-07-11 10:37:51,271][26022] Updated weights on worker 0-0, policy_version 1155688 (0.00081) [2022-07-11 10:37:51,857][25689] Fps is (10 sec: 5702.7, 60 sec: 5567.0, 300 sec: 5532.7). Total num frames: 1183426560. Throughput: 0: 4992.7. Samples: 1183420480. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:37:51,858][25689] Avg episode reward: [(0, '-2.926')] [2022-07-11 10:37:53,076][26022] Updated weights on worker 0-0, policy_version 1155698 (0.00090) [2022-07-11 10:37:54,980][26022] Updated weights on worker 0-0, policy_version 1155708 (0.00090) [2022-07-11 10:37:56,500][26022] Updated weights on worker 0-0, policy_version 1155718 (0.00087) [2022-07-11 10:37:56,934][25689] Fps is (10 sec: 5583.9, 60 sec: 5561.8, 300 sec: 5533.0). Total num frames: 1183456256. Throughput: 0: 5827.7. Samples: 1183454400. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:37:56,935][25689] Avg episode reward: [(0, '-2.890')] [2022-07-11 10:37:58,497][26022] Updated weights on worker 0-0, policy_version 1155728 (0.00092) [2022-07-11 10:37:58,836][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:37:58,847][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001155729_1183466496.pth [2022-07-11 10:37:58,847][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001153782_1181472768.pth [2022-07-11 10:38:00,457][26022] Updated weights on worker 0-0, policy_version 1155738 (0.00085) [2022-07-11 10:38:01,993][25689] Fps is (10 sec: 5657.8, 60 sec: 5559.8, 300 sec: 5539.0). Total num frames: 1183483904. Throughput: 0: 5861.8. Samples: 1183488150. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:38:01,993][25689] Avg episode reward: [(0, '-2.915')] [2022-07-11 10:38:02,824][26022] Updated weights on worker 0-0, policy_version 1155748 (0.00087) [2022-07-11 10:38:04,378][26022] Updated weights on worker 0-0, policy_version 1155758 (0.00090) [2022-07-11 10:38:06,359][26022] Updated weights on worker 0-0, policy_version 1155768 (0.00089) [2022-07-11 10:38:07,037][25689] Fps is (10 sec: 5372.3, 60 sec: 5581.5, 300 sec: 5535.7). Total num frames: 1183510528. Throughput: 0: 5739.8. Samples: 1183519452. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:38:07,039][25689] Avg episode reward: [(0, '-2.761')] [2022-07-11 10:38:08,192][26022] Updated weights on worker 0-0, policy_version 1155778 (0.00090) [2022-07-11 10:38:10,014][26022] Updated weights on worker 0-0, policy_version 1155788 (0.00088) [2022-07-11 10:38:11,691][26022] Updated weights on worker 0-0, policy_version 1155798 (0.00090) [2022-07-11 10:38:12,088][25689] Fps is (10 sec: 5477.5, 60 sec: 5579.1, 300 sec: 5535.1). Total num frames: 1183539200. Throughput: 0: 5729.3. Samples: 1183536386. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:38:12,090][25689] Avg episode reward: [(0, '-1.544')] [2022-07-11 10:38:13,566][26022] Updated weights on worker 0-0, policy_version 1155808 (0.00085) [2022-07-11 10:38:15,363][26022] Updated weights on worker 0-0, policy_version 1155818 (0.00093) [2022-07-11 10:38:17,135][25689] Fps is (10 sec: 5577.3, 60 sec: 5575.3, 300 sec: 5540.2). Total num frames: 1183566848. Throughput: 0: 5733.7. Samples: 1183570222. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:38:17,135][25689] Avg episode reward: [(0, '-2.586')] [2022-07-11 10:38:17,324][26022] Updated weights on worker 0-0, policy_version 1155828 (0.00082) [2022-07-11 10:38:19,037][26022] Updated weights on worker 0-0, policy_version 1155838 (0.00086) [2022-07-11 10:38:21,052][26022] Updated weights on worker 0-0, policy_version 1155848 (0.00090) [2022-07-11 10:38:22,228][25689] Fps is (10 sec: 5654.9, 60 sec: 5588.1, 300 sec: 5538.7). Total num frames: 1183596544. Throughput: 0: 5714.1. Samples: 1183603778. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:38:22,229][25689] Avg episode reward: [(0, '-1.552')] [2022-07-11 10:38:22,698][26022] Updated weights on worker 0-0, policy_version 1155858 (0.00097) [2022-07-11 10:38:24,736][26022] Updated weights on worker 0-0, policy_version 1155868 (0.00091) [2022-07-11 10:38:26,466][26022] Updated weights on worker 0-0, policy_version 1155878 (0.00087) [2022-07-11 10:38:27,265][25689] Fps is (10 sec: 5559.5, 60 sec: 5553.9, 300 sec: 5539.9). Total num frames: 1183623168. Throughput: 0: 4997.4. Samples: 1183620542. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:38:27,266][25689] Avg episode reward: [(0, '-1.503')] [2022-07-11 10:38:28,374][26022] Updated weights on worker 0-0, policy_version 1155888 (0.00086) [2022-07-11 10:38:30,271][26022] Updated weights on worker 0-0, policy_version 1155898 (0.00087) [2022-07-11 10:38:32,050][26022] Updated weights on worker 0-0, policy_version 1155908 (0.00094) [2022-07-11 10:38:32,283][25689] Fps is (10 sec: 5397.8, 60 sec: 5556.6, 300 sec: 5537.8). Total num frames: 1183650816. Throughput: 0: 5807.7. Samples: 1183653672. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:38:32,283][25689] Avg episode reward: [(0, '-0.775')] [2022-07-11 10:38:33,836][26022] Updated weights on worker 0-0, policy_version 1155918 (0.00088) [2022-07-11 10:38:35,927][26022] Updated weights on worker 0-0, policy_version 1155928 (0.00369) [2022-07-11 10:38:37,292][25689] Fps is (10 sec: 5514.9, 60 sec: 5539.6, 300 sec: 5539.1). Total num frames: 1183678464. Throughput: 0: 5783.4. Samples: 1183686798. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:38:37,292][25689] Avg episode reward: [(0, '-1.175')] [2022-07-11 10:38:37,578][26022] Updated weights on worker 0-0, policy_version 1155938 (0.00084) [2022-07-11 10:38:39,545][26022] Updated weights on worker 0-0, policy_version 1155948 (0.00090) [2022-07-11 10:38:41,119][26022] Updated weights on worker 0-0, policy_version 1155958 (0.00093) [2022-07-11 10:38:42,357][25689] Fps is (10 sec: 5590.3, 60 sec: 5580.3, 300 sec: 5537.9). Total num frames: 1183707136. Throughput: 0: 4956.9. Samples: 1183703556. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:38:42,358][25689] Avg episode reward: [(0, '-0.669')] [2022-07-11 10:38:43,233][26022] Updated weights on worker 0-0, policy_version 1155968 (0.00094) [2022-07-11 10:38:45,037][26022] Updated weights on worker 0-0, policy_version 1155978 (0.00087) [2022-07-11 10:38:46,784][26022] Updated weights on worker 0-0, policy_version 1155988 (0.00090) [2022-07-11 10:38:47,426][25689] Fps is (10 sec: 5557.4, 60 sec: 5527.6, 300 sec: 5540.2). Total num frames: 1183734784. Throughput: 0: 5772.8. Samples: 1183736926. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:38:47,427][25689] Avg episode reward: [(0, '-0.648')] [2022-07-11 10:38:48,744][26022] Updated weights on worker 0-0, policy_version 1155998 (0.00085) [2022-07-11 10:38:50,471][26022] Updated weights on worker 0-0, policy_version 1156008 (0.00090) [2022-07-11 10:38:52,323][26022] Updated weights on worker 0-0, policy_version 1156018 (0.00086) [2022-07-11 10:38:52,439][25689] Fps is (10 sec: 5484.6, 60 sec: 5544.1, 300 sec: 5537.1). Total num frames: 1183762432. Throughput: 0: 5777.5. Samples: 1183770126. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:38:52,440][25689] Avg episode reward: [(0, '-0.486')] [2022-07-11 10:38:54,378][26022] Updated weights on worker 0-0, policy_version 1156028 (0.00083) [2022-07-11 10:38:55,797][26022] Updated weights on worker 0-0, policy_version 1156038 (0.00096) [2022-07-11 10:38:57,485][25689] Fps is (10 sec: 5497.1, 60 sec: 5513.1, 300 sec: 5538.4). Total num frames: 1183790080. Throughput: 0: 4955.0. Samples: 1183786856. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:38:57,487][25689] Avg episode reward: [(0, '-0.171')] [2022-07-11 10:38:58,080][26022] Updated weights on worker 0-0, policy_version 1156048 (0.00094) [2022-07-11 10:38:59,582][26022] Updated weights on worker 0-0, policy_version 1156058 (0.00098) [2022-07-11 10:39:01,686][26022] Updated weights on worker 0-0, policy_version 1156068 (0.00088) [2022-07-11 10:39:02,529][25689] Fps is (10 sec: 5379.0, 60 sec: 5497.5, 300 sec: 5542.6). Total num frames: 1183816704. Throughput: 0: 5788.8. Samples: 1183820326. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:39:02,530][25689] Avg episode reward: [(0, '-1.077')] [2022-07-11 10:39:03,633][26022] Updated weights on worker 0-0, policy_version 1156078 (0.00091) [2022-07-11 10:39:05,725][26022] Updated weights on worker 0-0, policy_version 1156088 (0.00084) [2022-07-11 10:39:07,434][26022] Updated weights on worker 0-0, policy_version 1156098 (0.00096) [2022-07-11 10:39:07,628][25689] Fps is (10 sec: 5350.7, 60 sec: 5509.4, 300 sec: 5537.9). Total num frames: 1183844352. Throughput: 0: 5665.2. Samples: 1183851374. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:39:07,629][25689] Avg episode reward: [(0, '-1.351')] [2022-07-11 10:39:09,538][26022] Updated weights on worker 0-0, policy_version 1156108 (0.00090) [2022-07-11 10:39:11,132][26022] Updated weights on worker 0-0, policy_version 1156118 (0.00091) [2022-07-11 10:39:12,692][25689] Fps is (10 sec: 5440.7, 60 sec: 5491.3, 300 sec: 5534.3). Total num frames: 1183872000. Throughput: 0: 4839.0. Samples: 1183868130. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:39:12,699][25689] Avg episode reward: [(0, '-1.145')] [2022-07-11 10:39:13,241][26022] Updated weights on worker 0-0, policy_version 1156128 (0.00088) [2022-07-11 10:39:14,844][26022] Updated weights on worker 0-0, policy_version 1156138 (0.00087) [2022-07-11 10:39:16,795][26022] Updated weights on worker 0-0, policy_version 1156148 (0.00087) [2022-07-11 10:39:17,706][25689] Fps is (10 sec: 5690.1, 60 sec: 5528.2, 300 sec: 5542.4). Total num frames: 1183901696. Throughput: 0: 5680.1. Samples: 1183901712. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:39:17,706][25689] Avg episode reward: [(0, '-1.640')] [2022-07-11 10:39:18,585][26022] Updated weights on worker 0-0, policy_version 1156158 (0.00085) [2022-07-11 10:39:20,557][26022] Updated weights on worker 0-0, policy_version 1156168 (0.00089) [2022-07-11 10:39:22,176][26022] Updated weights on worker 0-0, policy_version 1156178 (0.00094) [2022-07-11 10:39:22,772][25689] Fps is (10 sec: 5689.2, 60 sec: 5496.9, 300 sec: 5545.2). Total num frames: 1183929344. Throughput: 0: 5685.1. Samples: 1183935408. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:39:22,773][25689] Avg episode reward: [(0, '-0.896')] [2022-07-11 10:39:24,067][26022] Updated weights on worker 0-0, policy_version 1156188 (0.00083) [2022-07-11 10:39:25,635][26022] Updated weights on worker 0-0, policy_version 1156198 (0.00089) [2022-07-11 10:39:27,795][25689] Fps is (10 sec: 5379.5, 60 sec: 5498.1, 300 sec: 5531.3). Total num frames: 1183955968. Throughput: 0: 4998.1. Samples: 1183952170. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:39:27,795][25689] Avg episode reward: [(0, '-0.832')] [2022-07-11 10:39:27,920][26022] Updated weights on worker 0-0, policy_version 1156208 (0.00092) [2022-07-11 10:39:29,557][26022] Updated weights on worker 0-0, policy_version 1156218 (0.00086) [2022-07-11 10:39:31,483][26022] Updated weights on worker 0-0, policy_version 1156228 (0.00082) [2022-07-11 10:39:32,799][25689] Fps is (10 sec: 5514.7, 60 sec: 5516.3, 300 sec: 5538.3). Total num frames: 1183984640. Throughput: 0: 5832.3. Samples: 1183985396. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:39:32,800][25689] Avg episode reward: [(0, '-1.101')] [2022-07-11 10:39:33,178][26022] Updated weights on worker 0-0, policy_version 1156238 (0.00095) [2022-07-11 10:39:35,131][26022] Updated weights on worker 0-0, policy_version 1156248 (0.00085) [2022-07-11 10:39:36,793][26022] Updated weights on worker 0-0, policy_version 1156258 (0.00080) [2022-07-11 10:39:37,807][25689] Fps is (10 sec: 5625.3, 60 sec: 5516.4, 300 sec: 5535.9). Total num frames: 1184012288. Throughput: 0: 5821.1. Samples: 1184018720. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:39:37,808][25689] Avg episode reward: [(0, '-0.604')] [2022-07-11 10:39:39,003][26022] Updated weights on worker 0-0, policy_version 1156268 (0.00082) [2022-07-11 10:39:40,499][26022] Updated weights on worker 0-0, policy_version 1156278 (0.00253) [2022-07-11 10:39:42,620][26022] Updated weights on worker 0-0, policy_version 1156288 (0.00094) [2022-07-11 10:39:42,860][25689] Fps is (10 sec: 5597.5, 60 sec: 5517.5, 300 sec: 5539.4). Total num frames: 1184040960. Throughput: 0: 4982.2. Samples: 1184035494. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:39:42,861][25689] Avg episode reward: [(0, '-1.319')] [2022-07-11 10:39:44,330][26022] Updated weights on worker 0-0, policy_version 1156298 (0.00390) [2022-07-11 10:39:46,223][26022] Updated weights on worker 0-0, policy_version 1156308 (0.00086) [2022-07-11 10:39:47,823][26022] Updated weights on worker 0-0, policy_version 1156318 (0.00081) [2022-07-11 10:39:47,920][25689] Fps is (10 sec: 5670.1, 60 sec: 5535.2, 300 sec: 5538.8). Total num frames: 1184069632. Throughput: 0: 5807.7. Samples: 1184069052. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:39:47,921][25689] Avg episode reward: [(0, '-0.621')] [2022-07-11 10:39:49,806][26022] Updated weights on worker 0-0, policy_version 1156328 (0.00368) [2022-07-11 10:39:51,671][26022] Updated weights on worker 0-0, policy_version 1156338 (0.00088) [2022-07-11 10:39:52,954][25689] Fps is (10 sec: 5478.3, 60 sec: 5516.4, 300 sec: 5535.8). Total num frames: 1184096256. Throughput: 0: 5815.3. Samples: 1184102606. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:39:52,955][25689] Avg episode reward: [(0, '-1.051')] [2022-07-11 10:39:53,434][26022] Updated weights on worker 0-0, policy_version 1156348 (0.00086) [2022-07-11 10:39:55,383][26022] Updated weights on worker 0-0, policy_version 1156358 (0.00090) [2022-07-11 10:39:56,993][26022] Updated weights on worker 0-0, policy_version 1156368 (0.00092) [2022-07-11 10:39:58,023][25689] Fps is (10 sec: 5473.4, 60 sec: 5531.2, 300 sec: 5536.2). Total num frames: 1184124928. Throughput: 0: 4973.0. Samples: 1184119254. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:39:58,023][25689] Avg episode reward: [(0, '-0.305')] [2022-07-11 10:39:58,945][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:39:58,957][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001156377_1184130048.pth [2022-07-11 10:39:58,958][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001154428_1182134272.pth [2022-07-11 10:39:59,143][26022] Updated weights on worker 0-0, policy_version 1156378 (0.00089) [2022-07-11 10:40:00,831][26022] Updated weights on worker 0-0, policy_version 1156388 (0.00090) [2022-07-11 10:40:03,108][25689] Fps is (10 sec: 5344.7, 60 sec: 5510.5, 300 sec: 5534.8). Total num frames: 1184150528. Throughput: 0: 5778.4. Samples: 1184152494. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:40:03,109][25689] Avg episode reward: [(0, '-0.864')] [2022-07-11 10:40:03,188][26022] Updated weights on worker 0-0, policy_version 1156398 (0.00091) [2022-07-11 10:40:04,760][26022] Updated weights on worker 0-0, policy_version 1156408 (0.00090) [2022-07-11 10:40:06,882][26022] Updated weights on worker 0-0, policy_version 1156418 (0.00090) [2022-07-11 10:40:08,156][25689] Fps is (10 sec: 5457.1, 60 sec: 5549.1, 300 sec: 5541.2). Total num frames: 1184180224. Throughput: 0: 5670.5. Samples: 1184183798. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:40:08,156][25689] Avg episode reward: [(0, '-0.037')] [2022-07-11 10:40:08,729][26022] Updated weights on worker 0-0, policy_version 1156428 (0.00086) [2022-07-11 10:40:10,531][26022] Updated weights on worker 0-0, policy_version 1156438 (0.00091) [2022-07-11 10:40:12,468][26022] Updated weights on worker 0-0, policy_version 1156448 (0.00091) [2022-07-11 10:40:13,195][25689] Fps is (10 sec: 5583.5, 60 sec: 5534.4, 300 sec: 5533.6). Total num frames: 1184206848. Throughput: 0: 4835.0. Samples: 1184200472. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:40:13,196][25689] Avg episode reward: [(0, '-1.559')] [2022-07-11 10:40:14,097][26022] Updated weights on worker 0-0, policy_version 1156458 (0.00051) [2022-07-11 10:40:16,022][26022] Updated weights on worker 0-0, policy_version 1156468 (0.00090) [2022-07-11 10:40:17,842][26022] Updated weights on worker 0-0, policy_version 1156478 (0.00093) [2022-07-11 10:40:18,226][25689] Fps is (10 sec: 5389.2, 60 sec: 5499.0, 300 sec: 5528.3). Total num frames: 1184234496. Throughput: 0: 5678.6. Samples: 1184233980. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:40:18,227][25689] Avg episode reward: [(0, '-2.086')] [2022-07-11 10:40:19,817][26022] Updated weights on worker 0-0, policy_version 1156488 (0.00087) [2022-07-11 10:40:21,510][26022] Updated weights on worker 0-0, policy_version 1156498 (0.00089) [2022-07-11 10:40:23,330][25689] Fps is (10 sec: 5557.4, 60 sec: 5512.5, 300 sec: 5537.0). Total num frames: 1184263168. Throughput: 0: 5692.3. Samples: 1184267598. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:40:23,330][25689] Avg episode reward: [(0, '-1.592')] [2022-07-11 10:40:23,398][26022] Updated weights on worker 0-0, policy_version 1156508 (0.00089) [2022-07-11 10:40:25,214][26022] Updated weights on worker 0-0, policy_version 1156518 (0.00080) [2022-07-11 10:40:27,050][26022] Updated weights on worker 0-0, policy_version 1156528 (0.00087) [2022-07-11 10:40:28,364][25689] Fps is (10 sec: 5757.5, 60 sec: 5562.2, 300 sec: 5540.0). Total num frames: 1184292864. Throughput: 0: 5798.8. Samples: 1184300980. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:40:28,364][25689] Avg episode reward: [(0, '-1.877')] [2022-07-11 10:40:28,867][26022] Updated weights on worker 0-0, policy_version 1156538 (0.00087) [2022-07-11 10:40:30,647][26022] Updated weights on worker 0-0, policy_version 1156548 (0.00089) [2022-07-11 10:40:32,637][26022] Updated weights on worker 0-0, policy_version 1156558 (0.00085) [2022-07-11 10:40:33,411][25689] Fps is (10 sec: 5688.2, 60 sec: 5541.3, 300 sec: 5536.5). Total num frames: 1184320512. Throughput: 0: 5787.4. Samples: 1184317468. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2022-07-11 10:40:33,413][25689] Avg episode reward: [(0, '-1.152')] [2022-07-11 10:40:34,338][26022] Updated weights on worker 0-0, policy_version 1156568 (0.00090) [2022-07-11 10:40:36,187][26022] Updated weights on worker 0-0, policy_version 1156578 (0.00087) [2022-07-11 10:40:38,019][26022] Updated weights on worker 0-0, policy_version 1156588 (0.00087) [2022-07-11 10:40:38,426][25689] Fps is (10 sec: 5393.7, 60 sec: 5523.8, 300 sec: 5534.3). Total num frames: 1184347136. Throughput: 0: 5806.8. Samples: 1184351276. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:40:38,426][25689] Avg episode reward: [(0, '-0.926')] [2022-07-11 10:40:39,935][26022] Updated weights on worker 0-0, policy_version 1156598 (0.00089) [2022-07-11 10:40:42,104][26022] Updated weights on worker 0-0, policy_version 1156608 (0.00818) [2022-07-11 10:40:43,501][25689] Fps is (10 sec: 5581.9, 60 sec: 5538.8, 300 sec: 5537.1). Total num frames: 1184376832. Throughput: 0: 5789.1. Samples: 1184384370. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:40:43,502][25689] Avg episode reward: [(0, '-0.560')] [2022-07-11 10:40:43,505][26022] Updated weights on worker 0-0, policy_version 1156618 (0.00093) [2022-07-11 10:40:45,559][26022] Updated weights on worker 0-0, policy_version 1156628 (0.00088) [2022-07-11 10:40:47,281][26022] Updated weights on worker 0-0, policy_version 1156638 (0.00095) [2022-07-11 10:40:48,555][25689] Fps is (10 sec: 5560.2, 60 sec: 5505.5, 300 sec: 5536.8). Total num frames: 1184403456. Throughput: 0: 4956.8. Samples: 1184401064. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:40:48,556][25689] Avg episode reward: [(0, '-0.495')] [2022-07-11 10:40:49,060][26022] Updated weights on worker 0-0, policy_version 1156648 (0.00089) [2022-07-11 10:40:51,162][26022] Updated weights on worker 0-0, policy_version 1156658 (0.00090) [2022-07-11 10:40:52,838][26022] Updated weights on worker 0-0, policy_version 1156668 (0.00083) [2022-07-11 10:40:53,606][25689] Fps is (10 sec: 5370.6, 60 sec: 5520.8, 300 sec: 5535.9). Total num frames: 1184431104. Throughput: 0: 5770.8. Samples: 1184434010. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:40:53,607][25689] Avg episode reward: [(0, '-0.970')] [2022-07-11 10:40:54,916][26022] Updated weights on worker 0-0, policy_version 1156678 (0.00102) [2022-07-11 10:40:56,368][26022] Updated weights on worker 0-0, policy_version 1156688 (0.00091) [2022-07-11 10:40:58,440][26022] Updated weights on worker 0-0, policy_version 1156698 (0.00088) [2022-07-11 10:40:58,625][25689] Fps is (10 sec: 5592.9, 60 sec: 5525.3, 300 sec: 5536.5). Total num frames: 1184459776. Throughput: 0: 5752.3. Samples: 1184467468. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:40:58,625][25689] Avg episode reward: [(0, '-0.610')] [2022-07-11 10:41:00,538][26022] Updated weights on worker 0-0, policy_version 1156708 (0.00081) [2022-07-11 10:41:02,532][26022] Updated weights on worker 0-0, policy_version 1156718 (0.00120) [2022-07-11 10:41:03,687][25689] Fps is (10 sec: 5383.9, 60 sec: 5527.5, 300 sec: 5535.9). Total num frames: 1184485376. Throughput: 0: 4935.4. Samples: 1184483996. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:41:03,687][25689] Avg episode reward: [(0, '-0.969')] [2022-07-11 10:41:04,312][26022] Updated weights on worker 0-0, policy_version 1156728 (0.00103) [2022-07-11 10:41:06,132][26022] Updated weights on worker 0-0, policy_version 1156738 (0.00090) [2022-07-11 10:41:08,119][26022] Updated weights on worker 0-0, policy_version 1156748 (0.00951) [2022-07-11 10:41:08,711][25689] Fps is (10 sec: 5380.9, 60 sec: 5512.7, 300 sec: 5539.7). Total num frames: 1184514048. Throughput: 0: 5682.6. Samples: 1184515602. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:41:08,712][25689] Avg episode reward: [(0, '0.149')] [2022-07-11 10:41:09,857][26022] Updated weights on worker 0-0, policy_version 1156758 (0.00089) [2022-07-11 10:41:11,763][26022] Updated weights on worker 0-0, policy_version 1156768 (0.00098) [2022-07-11 10:41:13,697][26022] Updated weights on worker 0-0, policy_version 1156778 (0.00093) [2022-07-11 10:41:13,757][25689] Fps is (10 sec: 5490.9, 60 sec: 5512.1, 300 sec: 5532.2). Total num frames: 1184540672. Throughput: 0: 5697.4. Samples: 1184548818. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:41:13,758][25689] Avg episode reward: [(0, '0.256')] [2022-07-11 10:41:15,331][26022] Updated weights on worker 0-0, policy_version 1156788 (0.00091) [2022-07-11 10:41:17,279][26022] Updated weights on worker 0-0, policy_version 1156798 (0.00081) [2022-07-11 10:41:18,763][25689] Fps is (10 sec: 5501.2, 60 sec: 5531.3, 300 sec: 5536.7). Total num frames: 1184569344. Throughput: 0: 4873.1. Samples: 1184565602. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:41:18,763][25689] Avg episode reward: [(0, '0.818')] [2022-07-11 10:41:18,954][26022] Updated weights on worker 0-0, policy_version 1156808 (0.00079) [2022-07-11 10:41:20,945][26022] Updated weights on worker 0-0, policy_version 1156818 (0.00087) [2022-07-11 10:41:22,835][26022] Updated weights on worker 0-0, policy_version 1156828 (0.00079) [2022-07-11 10:41:23,859][25689] Fps is (10 sec: 5575.2, 60 sec: 5515.1, 300 sec: 5535.0). Total num frames: 1184596992. Throughput: 0: 5708.1. Samples: 1184599142. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:41:23,859][25689] Avg episode reward: [(0, '0.853')] [2022-07-11 10:41:24,497][26022] Updated weights on worker 0-0, policy_version 1156838 (0.00086) [2022-07-11 10:41:26,499][26022] Updated weights on worker 0-0, policy_version 1156848 (0.00083) [2022-07-11 10:41:28,095][26022] Updated weights on worker 0-0, policy_version 1156858 (0.00085) [2022-07-11 10:41:28,896][25689] Fps is (10 sec: 5456.6, 60 sec: 5480.9, 300 sec: 5531.2). Total num frames: 1184624640. Throughput: 0: 5788.1. Samples: 1184632438. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:41:28,897][25689] Avg episode reward: [(0, '0.763')] [2022-07-11 10:41:30,274][26022] Updated weights on worker 0-0, policy_version 1156868 (0.00621) [2022-07-11 10:41:32,019][26022] Updated weights on worker 0-0, policy_version 1156878 (0.00080) [2022-07-11 10:41:33,689][26022] Updated weights on worker 0-0, policy_version 1156888 (0.00091) [2022-07-11 10:41:33,924][25689] Fps is (10 sec: 5697.2, 60 sec: 5516.5, 300 sec: 5541.3). Total num frames: 1184654336. Throughput: 0: 4976.2. Samples: 1184649174. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:41:33,925][25689] Avg episode reward: [(0, '-0.359')] [2022-07-11 10:41:35,661][26022] Updated weights on worker 0-0, policy_version 1156898 (0.00086) [2022-07-11 10:41:37,337][26022] Updated weights on worker 0-0, policy_version 1156908 (0.00082) [2022-07-11 10:41:39,004][25689] Fps is (10 sec: 5673.4, 60 sec: 5527.5, 300 sec: 5537.9). Total num frames: 1184681984. Throughput: 0: 5796.1. Samples: 1184682922. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:41:39,004][25689] Avg episode reward: [(0, '-0.304')] [2022-07-11 10:41:39,383][26022] Updated weights on worker 0-0, policy_version 1156918 (0.00084) [2022-07-11 10:41:41,173][26022] Updated weights on worker 0-0, policy_version 1156928 (0.00091) [2022-07-11 10:41:42,906][26022] Updated weights on worker 0-0, policy_version 1156938 (0.00092) [2022-07-11 10:41:44,096][25689] Fps is (10 sec: 5436.0, 60 sec: 5492.1, 300 sec: 5529.8). Total num frames: 1184709632. Throughput: 0: 5780.3. Samples: 1184716122. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:41:44,097][25689] Avg episode reward: [(0, '-2.189')] [2022-07-11 10:41:44,924][26022] Updated weights on worker 0-0, policy_version 1156948 (0.00091) [2022-07-11 10:41:46,601][26022] Updated weights on worker 0-0, policy_version 1156958 (0.00089) [2022-07-11 10:41:48,611][26022] Updated weights on worker 0-0, policy_version 1156968 (0.00085) [2022-07-11 10:41:49,106][25689] Fps is (10 sec: 5473.7, 60 sec: 5513.1, 300 sec: 5533.3). Total num frames: 1184737280. Throughput: 0: 4957.0. Samples: 1184732616. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:41:49,108][25689] Avg episode reward: [(0, '-2.214')] [2022-07-11 10:41:50,347][26022] Updated weights on worker 0-0, policy_version 1156978 (0.00085) [2022-07-11 10:41:52,299][26022] Updated weights on worker 0-0, policy_version 1156988 (0.00081) [2022-07-11 10:41:53,961][26022] Updated weights on worker 0-0, policy_version 1156998 (0.00090) [2022-07-11 10:41:54,153][25689] Fps is (10 sec: 5600.6, 60 sec: 5530.4, 300 sec: 5529.4). Total num frames: 1184765952. Throughput: 0: 5779.9. Samples: 1184766092. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:41:54,153][25689] Avg episode reward: [(0, '-1.381')] [2022-07-11 10:41:55,837][26022] Updated weights on worker 0-0, policy_version 1157008 (0.00086) [2022-07-11 10:41:57,648][26022] Updated weights on worker 0-0, policy_version 1157018 (0.00088) [2022-07-11 10:41:59,036][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:41:59,046][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001157025_1184793600.pth [2022-07-11 10:41:59,046][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001155077_1182798848.pth [2022-07-11 10:41:59,163][25689] Fps is (10 sec: 5701.9, 60 sec: 5531.2, 300 sec: 5533.3). Total num frames: 1184794624. Throughput: 0: 5796.0. Samples: 1184799764. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:41:59,165][25689] Avg episode reward: [(0, '-1.366')] [2022-07-11 10:41:59,527][26022] Updated weights on worker 0-0, policy_version 1157028 (0.00090) [2022-07-11 10:42:01,666][26022] Updated weights on worker 0-0, policy_version 1157038 (0.00061) [2022-07-11 10:42:03,507][26022] Updated weights on worker 0-0, policy_version 1157048 (0.00082) [2022-07-11 10:42:04,285][25689] Fps is (10 sec: 5356.1, 60 sec: 5525.7, 300 sec: 5532.8). Total num frames: 1184820224. Throughput: 0: 4912.0. Samples: 1184815290. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:42:04,286][25689] Avg episode reward: [(0, '-0.231')] [2022-07-11 10:42:05,541][26022] Updated weights on worker 0-0, policy_version 1157058 (0.00094) [2022-07-11 10:42:07,256][26022] Updated weights on worker 0-0, policy_version 1157068 (0.00089) [2022-07-11 10:42:09,163][26022] Updated weights on worker 0-0, policy_version 1157078 (0.00085) [2022-07-11 10:42:09,299][25689] Fps is (10 sec: 5253.5, 60 sec: 5509.8, 300 sec: 5529.6). Total num frames: 1184847872. Throughput: 0: 5697.8. Samples: 1184847670. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:42:09,299][25689] Avg episode reward: [(0, '0.855')] [2022-07-11 10:42:11,000][26022] Updated weights on worker 0-0, policy_version 1157088 (0.00090) [2022-07-11 10:42:12,829][26022] Updated weights on worker 0-0, policy_version 1157098 (0.00085) [2022-07-11 10:42:14,311][25689] Fps is (10 sec: 5515.5, 60 sec: 5529.8, 300 sec: 5529.4). Total num frames: 1184875520. Throughput: 0: 5700.0. Samples: 1184880994. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:42:14,311][25689] Avg episode reward: [(0, '1.189')] [2022-07-11 10:42:14,793][26022] Updated weights on worker 0-0, policy_version 1157108 (0.00088) [2022-07-11 10:42:16,523][26022] Updated weights on worker 0-0, policy_version 1157118 (0.00085) [2022-07-11 10:42:18,308][26022] Updated weights on worker 0-0, policy_version 1157128 (0.00082) [2022-07-11 10:42:19,328][25689] Fps is (10 sec: 5513.3, 60 sec: 5511.8, 300 sec: 5526.6). Total num frames: 1184903168. Throughput: 0: 4851.8. Samples: 1184897602. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:42:19,329][25689] Avg episode reward: [(0, '1.429')] [2022-07-11 10:42:20,235][26022] Updated weights on worker 0-0, policy_version 1157138 (0.00089) [2022-07-11 10:42:22,063][26022] Updated weights on worker 0-0, policy_version 1157148 (0.00088) [2022-07-11 10:42:23,991][26022] Updated weights on worker 0-0, policy_version 1157158 (0.00109) [2022-07-11 10:42:24,458][25689] Fps is (10 sec: 5550.6, 60 sec: 5525.7, 300 sec: 5524.8). Total num frames: 1184931840. Throughput: 0: 5721.7. Samples: 1184930710. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:42:24,458][25689] Avg episode reward: [(0, '1.628')] [2022-07-11 10:42:25,658][26022] Updated weights on worker 0-0, policy_version 1157168 (0.00094) [2022-07-11 10:42:27,635][26022] Updated weights on worker 0-0, policy_version 1157178 (0.00083) [2022-07-11 10:42:29,400][26022] Updated weights on worker 0-0, policy_version 1157188 (0.00095) [2022-07-11 10:42:29,495][25689] Fps is (10 sec: 5640.2, 60 sec: 5542.6, 300 sec: 5528.4). Total num frames: 1184960512. Throughput: 0: 5764.1. Samples: 1184964086. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:42:29,496][25689] Avg episode reward: [(0, '2.190')] [2022-07-11 10:42:31,277][26022] Updated weights on worker 0-0, policy_version 1157198 (0.00088) [2022-07-11 10:42:33,263][26022] Updated weights on worker 0-0, policy_version 1157208 (0.00093) [2022-07-11 10:42:34,504][25689] Fps is (10 sec: 5605.8, 60 sec: 5510.5, 300 sec: 5524.9). Total num frames: 1184988160. Throughput: 0: 5759.2. Samples: 1184997292. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:42:34,505][25689] Avg episode reward: [(0, '0.991')] [2022-07-11 10:42:34,977][26022] Updated weights on worker 0-0, policy_version 1157218 (0.00091) [2022-07-11 10:42:37,041][26022] Updated weights on worker 0-0, policy_version 1157228 (0.00090) [2022-07-11 10:42:38,649][26022] Updated weights on worker 0-0, policy_version 1157238 (0.00088) [2022-07-11 10:42:39,509][25689] Fps is (10 sec: 5419.9, 60 sec: 5500.4, 300 sec: 5527.5). Total num frames: 1185014784. Throughput: 0: 5762.3. Samples: 1185013888. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:42:39,509][25689] Avg episode reward: [(0, '1.224')] [2022-07-11 10:42:40,642][26022] Updated weights on worker 0-0, policy_version 1157248 (0.00094) [2022-07-11 10:42:42,518][26022] Updated weights on worker 0-0, policy_version 1157258 (0.00085) [2022-07-11 10:42:44,309][26022] Updated weights on worker 0-0, policy_version 1157268 (0.00086) [2022-07-11 10:42:44,561][25689] Fps is (10 sec: 5498.7, 60 sec: 5521.0, 300 sec: 5520.5). Total num frames: 1185043456. Throughput: 0: 5793.6. Samples: 1185047178. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:42:44,561][25689] Avg episode reward: [(0, '1.587')] [2022-07-11 10:42:46,181][26022] Updated weights on worker 0-0, policy_version 1157278 (0.00086) [2022-07-11 10:42:47,908][26022] Updated weights on worker 0-0, policy_version 1157288 (0.00088) [2022-07-11 10:42:49,625][25689] Fps is (10 sec: 5567.1, 60 sec: 5516.0, 300 sec: 5522.9). Total num frames: 1185071104. Throughput: 0: 5781.8. Samples: 1185080474. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:42:49,626][25689] Avg episode reward: [(0, '1.016')] [2022-07-11 10:42:49,802][26022] Updated weights on worker 0-0, policy_version 1157298 (0.00088) [2022-07-11 10:42:51,772][26022] Updated weights on worker 0-0, policy_version 1157308 (0.00087) [2022-07-11 10:42:53,494][26022] Updated weights on worker 0-0, policy_version 1157318 (0.00086) [2022-07-11 10:42:54,694][25689] Fps is (10 sec: 5456.5, 60 sec: 5497.0, 300 sec: 5516.2). Total num frames: 1185098752. Throughput: 0: 4951.8. Samples: 1185097270. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:42:54,695][25689] Avg episode reward: [(0, '0.731')] [2022-07-11 10:42:55,472][26022] Updated weights on worker 0-0, policy_version 1157328 (0.00081) [2022-07-11 10:42:57,264][26022] Updated weights on worker 0-0, policy_version 1157338 (0.00086) [2022-07-11 10:42:59,108][26022] Updated weights on worker 0-0, policy_version 1157348 (0.00087) [2022-07-11 10:42:59,720][25689] Fps is (10 sec: 5680.8, 60 sec: 5512.6, 300 sec: 5523.7). Total num frames: 1185128448. Throughput: 0: 5774.4. Samples: 1185130594. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:42:59,720][25689] Avg episode reward: [(0, '-0.186')] [2022-07-11 10:43:01,031][26022] Updated weights on worker 0-0, policy_version 1157358 (0.00801) [2022-07-11 10:43:03,098][26022] Updated weights on worker 0-0, policy_version 1157368 (0.00086) [2022-07-11 10:43:04,771][25689] Fps is (10 sec: 5487.8, 60 sec: 5519.1, 300 sec: 5520.1). Total num frames: 1185154048. Throughput: 0: 5676.9. Samples: 1185161908. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:43:04,771][25689] Avg episode reward: [(0, '0.119')] [2022-07-11 10:43:05,062][26022] Updated weights on worker 0-0, policy_version 1157378 (0.00111) [2022-07-11 10:43:06,746][26022] Updated weights on worker 0-0, policy_version 1157388 (0.00089) [2022-07-11 10:43:08,783][26022] Updated weights on worker 0-0, policy_version 1157398 (0.00085) [2022-07-11 10:43:09,794][25689] Fps is (10 sec: 5285.6, 60 sec: 5518.2, 300 sec: 5517.2). Total num frames: 1185181696. Throughput: 0: 4872.4. Samples: 1185178742. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:43:09,797][25689] Avg episode reward: [(0, '0.079')] [2022-07-11 10:43:10,530][26022] Updated weights on worker 0-0, policy_version 1157408 (0.00089) [2022-07-11 10:43:12,531][26022] Updated weights on worker 0-0, policy_version 1157418 (0.00089) [2022-07-11 10:43:14,286][26022] Updated weights on worker 0-0, policy_version 1157428 (0.00089) [2022-07-11 10:43:14,819][25689] Fps is (10 sec: 5503.2, 60 sec: 5517.0, 300 sec: 5517.6). Total num frames: 1185209344. Throughput: 0: 5688.7. Samples: 1185211750. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:43:14,821][25689] Avg episode reward: [(0, '0.140')] [2022-07-11 10:43:16,191][26022] Updated weights on worker 0-0, policy_version 1157438 (0.00089) [2022-07-11 10:43:17,930][26022] Updated weights on worker 0-0, policy_version 1157448 (0.00615) [2022-07-11 10:43:19,828][25689] Fps is (10 sec: 5408.4, 60 sec: 5500.8, 300 sec: 5508.8). Total num frames: 1185235968. Throughput: 0: 5691.1. Samples: 1185245036. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:43:19,829][25689] Avg episode reward: [(0, '0.565')] [2022-07-11 10:43:19,957][26022] Updated weights on worker 0-0, policy_version 1157458 (0.00085) [2022-07-11 10:43:21,738][26022] Updated weights on worker 0-0, policy_version 1157468 (0.00090) [2022-07-11 10:43:23,625][26022] Updated weights on worker 0-0, policy_version 1157478 (0.00084) [2022-07-11 10:43:24,915][25689] Fps is (10 sec: 5578.4, 60 sec: 5521.7, 300 sec: 5518.2). Total num frames: 1185265664. Throughput: 0: 4958.7. Samples: 1185261796. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:43:24,915][25689] Avg episode reward: [(0, '0.964')] [2022-07-11 10:43:25,261][26022] Updated weights on worker 0-0, policy_version 1157488 (0.00090) [2022-07-11 10:43:27,406][26022] Updated weights on worker 0-0, policy_version 1157498 (0.00089) [2022-07-11 10:43:28,894][26022] Updated weights on worker 0-0, policy_version 1157508 (0.00096) [2022-07-11 10:43:29,918][25689] Fps is (10 sec: 5582.0, 60 sec: 5490.9, 300 sec: 5515.0). Total num frames: 1185292288. Throughput: 0: 5785.0. Samples: 1185295162. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:43:29,918][25689] Avg episode reward: [(0, '0.896')] [2022-07-11 10:43:31,191][26022] Updated weights on worker 0-0, policy_version 1157518 (0.00096) [2022-07-11 10:43:32,678][26022] Updated weights on worker 0-0, policy_version 1157528 (0.00086) [2022-07-11 10:43:34,760][26022] Updated weights on worker 0-0, policy_version 1157538 (0.00097) [2022-07-11 10:43:34,932][25689] Fps is (10 sec: 5417.8, 60 sec: 5490.4, 300 sec: 5515.0). Total num frames: 1185319936. Throughput: 0: 5794.7. Samples: 1185328302. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:43:34,933][25689] Avg episode reward: [(0, '1.556')] [2022-07-11 10:43:36,307][26022] Updated weights on worker 0-0, policy_version 1157548 (0.00079) [2022-07-11 10:43:38,389][26022] Updated weights on worker 0-0, policy_version 1157558 (0.00090) [2022-07-11 10:43:39,939][25689] Fps is (10 sec: 5620.3, 60 sec: 5524.1, 300 sec: 5516.0). Total num frames: 1185348608. Throughput: 0: 4975.1. Samples: 1185345088. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:43:39,939][25689] Avg episode reward: [(0, '1.448')] [2022-07-11 10:43:40,081][26022] Updated weights on worker 0-0, policy_version 1157568 (0.00084) [2022-07-11 10:43:42,155][26022] Updated weights on worker 0-0, policy_version 1157578 (0.00084) [2022-07-11 10:43:43,790][26022] Updated weights on worker 0-0, policy_version 1157588 (0.00091) [2022-07-11 10:43:45,037][25689] Fps is (10 sec: 5472.0, 60 sec: 5486.0, 300 sec: 5512.0). Total num frames: 1185375232. Throughput: 0: 5782.3. Samples: 1185378152. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:43:45,038][25689] Avg episode reward: [(0, '1.215')] [2022-07-11 10:43:45,826][26022] Updated weights on worker 0-0, policy_version 1157598 (0.00088) [2022-07-11 10:43:47,280][26022] Updated weights on worker 0-0, policy_version 1157608 (0.00089) [2022-07-11 10:43:49,552][26022] Updated weights on worker 0-0, policy_version 1157618 (0.00088) [2022-07-11 10:43:50,046][25689] Fps is (10 sec: 5369.4, 60 sec: 5491.1, 300 sec: 5512.1). Total num frames: 1185402880. Throughput: 0: 5789.1. Samples: 1185411686. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:43:50,047][25689] Avg episode reward: [(0, '1.082')] [2022-07-11 10:43:50,971][26022] Updated weights on worker 0-0, policy_version 1157628 (0.00099) [2022-07-11 10:43:53,231][26022] Updated weights on worker 0-0, policy_version 1157638 (0.00087) [2022-07-11 10:43:54,884][26022] Updated weights on worker 0-0, policy_version 1157648 (0.00086) [2022-07-11 10:43:55,087][25689] Fps is (10 sec: 5604.3, 60 sec: 5510.6, 300 sec: 5515.7). Total num frames: 1185431552. Throughput: 0: 4945.7. Samples: 1185427980. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:43:55,087][25689] Avg episode reward: [(0, '1.098')] [2022-07-11 10:43:56,809][26022] Updated weights on worker 0-0, policy_version 1157658 (0.00080) [2022-07-11 10:43:58,682][26022] Updated weights on worker 0-0, policy_version 1157668 (0.00088) [2022-07-11 10:43:59,149][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:43:59,174][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001157670_1185454080.pth [2022-07-11 10:43:59,174][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001155729_1183466496.pth [2022-07-11 10:44:00,100][25689] Fps is (10 sec: 5602.1, 60 sec: 5477.8, 300 sec: 5519.7). Total num frames: 1185459200. Throughput: 0: 5759.8. Samples: 1185461210. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:44:00,100][25689] Avg episode reward: [(0, '1.004')] [2022-07-11 10:44:00,491][26022] Updated weights on worker 0-0, policy_version 1157678 (0.00098) [2022-07-11 10:44:02,681][26022] Updated weights on worker 0-0, policy_version 1157688 (0.00089) [2022-07-11 10:44:04,692][26022] Updated weights on worker 0-0, policy_version 1157698 (0.00094) [2022-07-11 10:44:05,163][25689] Fps is (10 sec: 5386.3, 60 sec: 5493.7, 300 sec: 5516.9). Total num frames: 1185485824. Throughput: 0: 5677.7. Samples: 1185492416. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:44:05,165][25689] Avg episode reward: [(0, '0.037')] [2022-07-11 10:44:06,500][26022] Updated weights on worker 0-0, policy_version 1157708 (0.00092) [2022-07-11 10:44:08,320][26022] Updated weights on worker 0-0, policy_version 1157718 (0.00094) [2022-07-11 10:44:10,129][26022] Updated weights on worker 0-0, policy_version 1157728 (0.00088) [2022-07-11 10:44:10,176][25689] Fps is (10 sec: 5386.2, 60 sec: 5494.6, 300 sec: 5517.9). Total num frames: 1185513472. Throughput: 0: 4836.2. Samples: 1185509036. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:44:10,177][25689] Avg episode reward: [(0, '-0.912')] [2022-07-11 10:44:12,120][26022] Updated weights on worker 0-0, policy_version 1157738 (0.00094) [2022-07-11 10:44:13,713][26022] Updated weights on worker 0-0, policy_version 1157748 (0.00086) [2022-07-11 10:44:15,181][25689] Fps is (10 sec: 5417.4, 60 sec: 5479.5, 300 sec: 5507.7). Total num frames: 1185540096. Throughput: 0: 5687.1. Samples: 1185542256. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:44:15,183][25689] Avg episode reward: [(0, '-1.679')] [2022-07-11 10:44:15,879][26022] Updated weights on worker 0-0, policy_version 1157758 (0.00085) [2022-07-11 10:44:17,398][26022] Updated weights on worker 0-0, policy_version 1157768 (0.00082) [2022-07-11 10:44:19,532][26022] Updated weights on worker 0-0, policy_version 1157778 (0.00086) [2022-07-11 10:44:20,187][25689] Fps is (10 sec: 5728.1, 60 sec: 5547.7, 300 sec: 5519.1). Total num frames: 1185570816. Throughput: 0: 5705.3. Samples: 1185575812. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:44:20,188][25689] Avg episode reward: [(0, '-1.589')] [2022-07-11 10:44:21,315][26022] Updated weights on worker 0-0, policy_version 1157788 (0.00093) [2022-07-11 10:44:23,044][26022] Updated weights on worker 0-0, policy_version 1157798 (0.00091) [2022-07-11 10:44:24,963][26022] Updated weights on worker 0-0, policy_version 1157808 (0.00088) [2022-07-11 10:44:25,266][25689] Fps is (10 sec: 5584.2, 60 sec: 5480.4, 300 sec: 5514.6). Total num frames: 1185596416. Throughput: 0: 4981.3. Samples: 1185592558. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:44:25,267][25689] Avg episode reward: [(0, '-1.635')] [2022-07-11 10:44:26,834][26022] Updated weights on worker 0-0, policy_version 1157818 (0.00092) [2022-07-11 10:44:28,550][26022] Updated weights on worker 0-0, policy_version 1157828 (0.00094) [2022-07-11 10:44:30,269][25689] Fps is (10 sec: 5281.2, 60 sec: 5497.4, 300 sec: 5511.2). Total num frames: 1185624064. Throughput: 0: 5817.5. Samples: 1185625930. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:44:30,270][25689] Avg episode reward: [(0, '-2.502')] [2022-07-11 10:44:30,559][26022] Updated weights on worker 0-0, policy_version 1157838 (0.00085) [2022-07-11 10:44:32,160][26022] Updated weights on worker 0-0, policy_version 1157848 (0.00098) [2022-07-11 10:44:34,399][26022] Updated weights on worker 0-0, policy_version 1157858 (0.00082) [2022-07-11 10:44:35,275][25689] Fps is (10 sec: 5729.6, 60 sec: 5532.1, 300 sec: 5518.1). Total num frames: 1185653760. Throughput: 0: 5830.3. Samples: 1185659410. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 10:44:35,275][25689] Avg episode reward: [(0, '-2.028')] [2022-07-11 10:44:35,635][26022] Updated weights on worker 0-0, policy_version 1157868 (0.00094) [2022-07-11 10:44:37,950][26022] Updated weights on worker 0-0, policy_version 1157878 (0.00094) [2022-07-11 10:44:39,504][26022] Updated weights on worker 0-0, policy_version 1157888 (0.00090) [2022-07-11 10:44:40,278][25689] Fps is (10 sec: 5524.7, 60 sec: 5481.5, 300 sec: 5508.8). Total num frames: 1185679360. Throughput: 0: 4987.0. Samples: 1185676008. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:44:40,279][25689] Avg episode reward: [(0, '-1.138')] [2022-07-11 10:44:41,676][26022] Updated weights on worker 0-0, policy_version 1157898 (0.00085) [2022-07-11 10:44:43,357][26022] Updated weights on worker 0-0, policy_version 1157908 (0.00092) [2022-07-11 10:44:45,269][26022] Updated weights on worker 0-0, policy_version 1157918 (0.00085) [2022-07-11 10:44:45,389][25689] Fps is (10 sec: 5365.9, 60 sec: 5514.4, 300 sec: 5507.8). Total num frames: 1185708032. Throughput: 0: 5788.1. Samples: 1185709030. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:44:45,389][25689] Avg episode reward: [(0, '-0.482')] [2022-07-11 10:44:47,000][26022] Updated weights on worker 0-0, policy_version 1157928 (0.00049) [2022-07-11 10:44:48,917][26022] Updated weights on worker 0-0, policy_version 1157938 (0.00092) [2022-07-11 10:44:50,401][25689] Fps is (10 sec: 5563.6, 60 sec: 5514.1, 300 sec: 5511.7). Total num frames: 1185735680. Throughput: 0: 5770.6. Samples: 1185742102. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:44:50,402][25689] Avg episode reward: [(0, '-1.443')] [2022-07-11 10:44:50,734][26022] Updated weights on worker 0-0, policy_version 1157948 (0.00089) [2022-07-11 10:44:52,676][26022] Updated weights on worker 0-0, policy_version 1157958 (0.00093) [2022-07-11 10:44:54,532][26022] Updated weights on worker 0-0, policy_version 1157968 (0.00088) [2022-07-11 10:44:55,417][25689] Fps is (10 sec: 5718.2, 60 sec: 5533.3, 300 sec: 5516.1). Total num frames: 1185765376. Throughput: 0: 4938.3. Samples: 1185758880. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:44:55,418][25689] Avg episode reward: [(0, '-0.688')] [2022-07-11 10:44:56,267][26022] Updated weights on worker 0-0, policy_version 1157978 (0.00087) [2022-07-11 10:44:58,156][26022] Updated weights on worker 0-0, policy_version 1157988 (0.00096) [2022-07-11 10:45:00,139][26022] Updated weights on worker 0-0, policy_version 1157998 (0.00084) [2022-07-11 10:45:00,466][25689] Fps is (10 sec: 5493.9, 60 sec: 5496.0, 300 sec: 5516.8). Total num frames: 1185790976. Throughput: 0: 5763.8. Samples: 1185792368. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:45:00,468][25689] Avg episode reward: [(0, '-0.254')] [2022-07-11 10:45:01,858][26022] Updated weights on worker 0-0, policy_version 1158008 (0.00095) [2022-07-11 10:45:04,339][26022] Updated weights on worker 0-0, policy_version 1158018 (0.00086) [2022-07-11 10:45:05,531][25689] Fps is (10 sec: 5264.7, 60 sec: 5512.8, 300 sec: 5509.5). Total num frames: 1185818624. Throughput: 0: 5677.2. Samples: 1185823384. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:45:05,532][25689] Avg episode reward: [(0, '-0.056')] [2022-07-11 10:45:05,743][26022] Updated weights on worker 0-0, policy_version 1158028 (0.00085) [2022-07-11 10:45:07,902][26022] Updated weights on worker 0-0, policy_version 1158038 (0.00087) [2022-07-11 10:45:09,747][26022] Updated weights on worker 0-0, policy_version 1158048 (0.00087) [2022-07-11 10:45:10,601][25689] Fps is (10 sec: 5355.1, 60 sec: 5490.7, 300 sec: 5509.0). Total num frames: 1185845248. Throughput: 0: 4847.6. Samples: 1185840022. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:45:10,602][25689] Avg episode reward: [(0, '-0.270')] [2022-07-11 10:45:11,431][26022] Updated weights on worker 0-0, policy_version 1158058 (0.00092) [2022-07-11 10:45:13,483][26022] Updated weights on worker 0-0, policy_version 1158068 (0.00095) [2022-07-11 10:45:15,229][26022] Updated weights on worker 0-0, policy_version 1158078 (0.00088) [2022-07-11 10:45:15,626][25689] Fps is (10 sec: 5477.5, 60 sec: 5522.7, 300 sec: 5512.5). Total num frames: 1185873920. Throughput: 0: 5663.0. Samples: 1185873324. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:45:15,627][25689] Avg episode reward: [(0, '1.127')] [2022-07-11 10:45:17,008][26022] Updated weights on worker 0-0, policy_version 1158088 (0.00095) [2022-07-11 10:45:18,955][26022] Updated weights on worker 0-0, policy_version 1158098 (0.00086) [2022-07-11 10:45:20,595][26022] Updated weights on worker 0-0, policy_version 1158108 (0.00085) [2022-07-11 10:45:20,649][25689] Fps is (10 sec: 5707.1, 60 sec: 5487.4, 300 sec: 5514.0). Total num frames: 1185902592. Throughput: 0: 5663.3. Samples: 1185906668. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:45:20,649][25689] Avg episode reward: [(0, '1.408')] [2022-07-11 10:45:22,856][26022] Updated weights on worker 0-0, policy_version 1158118 (0.00092) [2022-07-11 10:45:24,492][26022] Updated weights on worker 0-0, policy_version 1158128 (0.00090) [2022-07-11 10:45:25,722][25689] Fps is (10 sec: 5477.5, 60 sec: 5504.9, 300 sec: 5503.0). Total num frames: 1185929216. Throughput: 0: 4951.0. Samples: 1185923346. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:45:25,722][25689] Avg episode reward: [(0, '1.607')] [2022-07-11 10:45:26,352][26022] Updated weights on worker 0-0, policy_version 1158138 (0.00092) [2022-07-11 10:45:28,392][26022] Updated weights on worker 0-0, policy_version 1158148 (0.00086) [2022-07-11 10:45:29,918][26022] Updated weights on worker 0-0, policy_version 1158158 (0.00089) [2022-07-11 10:45:30,742][25689] Fps is (10 sec: 5478.5, 60 sec: 5520.3, 300 sec: 5506.9). Total num frames: 1185957888. Throughput: 0: 5781.9. Samples: 1185956476. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:45:30,742][25689] Avg episode reward: [(0, '1.196')] [2022-07-11 10:45:31,894][26022] Updated weights on worker 0-0, policy_version 1158168 (0.00088) [2022-07-11 10:45:33,644][26022] Updated weights on worker 0-0, policy_version 1158178 (0.00089) [2022-07-11 10:45:35,403][26022] Updated weights on worker 0-0, policy_version 1158188 (0.00093) [2022-07-11 10:45:35,788][25689] Fps is (10 sec: 5595.0, 60 sec: 5482.7, 300 sec: 5509.8). Total num frames: 1185985536. Throughput: 0: 5785.1. Samples: 1185989960. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:45:35,790][25689] Avg episode reward: [(0, '1.142')] [2022-07-11 10:45:37,426][26022] Updated weights on worker 0-0, policy_version 1158198 (0.00093) [2022-07-11 10:45:39,144][26022] Updated weights on worker 0-0, policy_version 1158208 (0.00093) [2022-07-11 10:45:40,809][25689] Fps is (10 sec: 5391.1, 60 sec: 5498.0, 300 sec: 5500.4). Total num frames: 1186012160. Throughput: 0: 4947.9. Samples: 1186006418. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:45:40,810][25689] Avg episode reward: [(0, '0.429')] [2022-07-11 10:45:41,288][26022] Updated weights on worker 0-0, policy_version 1158218 (0.00087) [2022-07-11 10:45:42,901][26022] Updated weights on worker 0-0, policy_version 1158228 (0.00087) [2022-07-11 10:45:45,091][26022] Updated weights on worker 0-0, policy_version 1158238 (0.00098) [2022-07-11 10:45:45,901][25689] Fps is (10 sec: 5467.8, 60 sec: 5499.7, 300 sec: 5506.6). Total num frames: 1186040832. Throughput: 0: 5761.0. Samples: 1186039598. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:45:45,901][25689] Avg episode reward: [(0, '-0.518')] [2022-07-11 10:45:46,450][26022] Updated weights on worker 0-0, policy_version 1158248 (0.00090) [2022-07-11 10:45:48,663][26022] Updated weights on worker 0-0, policy_version 1158258 (0.00085) [2022-07-11 10:45:50,066][26022] Updated weights on worker 0-0, policy_version 1158268 (0.00085) [2022-07-11 10:45:50,929][25689] Fps is (10 sec: 5564.9, 60 sec: 5498.3, 300 sec: 5507.1). Total num frames: 1186068480. Throughput: 0: 5766.5. Samples: 1186072888. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:45:50,936][25689] Avg episode reward: [(0, '-0.311')] [2022-07-11 10:45:52,280][26022] Updated weights on worker 0-0, policy_version 1158278 (0.00506) [2022-07-11 10:45:53,859][26022] Updated weights on worker 0-0, policy_version 1158288 (0.00088) [2022-07-11 10:45:55,962][25689] Fps is (10 sec: 5495.9, 60 sec: 5462.9, 300 sec: 5503.4). Total num frames: 1186096128. Throughput: 0: 5782.6. Samples: 1186106620. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:45:55,963][25689] Avg episode reward: [(0, '-0.089')] [2022-07-11 10:45:56,060][26022] Updated weights on worker 0-0, policy_version 1158298 (0.00092) [2022-07-11 10:45:57,568][26022] Updated weights on worker 0-0, policy_version 1158308 (0.00093) [2022-07-11 10:45:59,239][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:45:59,254][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001158315_1186114560.pth [2022-07-11 10:45:59,254][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001156377_1184130048.pth [2022-07-11 10:45:59,653][26022] Updated weights on worker 0-0, policy_version 1158318 (0.00092) [2022-07-11 10:46:00,989][25689] Fps is (10 sec: 5700.2, 60 sec: 5532.6, 300 sec: 5517.8). Total num frames: 1186125824. Throughput: 0: 5796.0. Samples: 1186123384. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:46:00,990][25689] Avg episode reward: [(0, '-0.212')] [2022-07-11 10:46:01,271][26022] Updated weights on worker 0-0, policy_version 1158328 (0.00085) [2022-07-11 10:46:03,709][26022] Updated weights on worker 0-0, policy_version 1158338 (0.00091) [2022-07-11 10:46:05,451][26022] Updated weights on worker 0-0, policy_version 1158348 (0.00088) [2022-07-11 10:46:06,121][25689] Fps is (10 sec: 5442.9, 60 sec: 5492.7, 300 sec: 5505.4). Total num frames: 1186151424. Throughput: 0: 5691.1. Samples: 1186154676. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:46:06,122][25689] Avg episode reward: [(0, '-0.478')] [2022-07-11 10:46:07,183][26022] Updated weights on worker 0-0, policy_version 1158358 (0.00087) [2022-07-11 10:46:09,148][26022] Updated weights on worker 0-0, policy_version 1158368 (0.00087) [2022-07-11 10:46:10,990][26022] Updated weights on worker 0-0, policy_version 1158378 (0.00205) [2022-07-11 10:46:11,134][25689] Fps is (10 sec: 5349.7, 60 sec: 5531.6, 300 sec: 5512.9). Total num frames: 1186180096. Throughput: 0: 5712.3. Samples: 1186188304. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:46:11,135][25689] Avg episode reward: [(0, '-0.496')] [2022-07-11 10:46:12,671][26022] Updated weights on worker 0-0, policy_version 1158388 (0.00087) [2022-07-11 10:46:14,735][26022] Updated weights on worker 0-0, policy_version 1158398 (0.00093) [2022-07-11 10:46:16,155][25689] Fps is (10 sec: 5612.9, 60 sec: 5515.1, 300 sec: 5509.2). Total num frames: 1186207744. Throughput: 0: 4880.0. Samples: 1186205164. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:46:16,156][25689] Avg episode reward: [(0, '0.149')] [2022-07-11 10:46:16,313][26022] Updated weights on worker 0-0, policy_version 1158408 (0.00086) [2022-07-11 10:46:18,507][26022] Updated weights on worker 0-0, policy_version 1158418 (0.00088) [2022-07-11 10:46:20,008][26022] Updated weights on worker 0-0, policy_version 1158428 (0.00083) [2022-07-11 10:46:21,164][25689] Fps is (10 sec: 5410.9, 60 sec: 5482.5, 300 sec: 5507.4). Total num frames: 1186234368. Throughput: 0: 5699.9. Samples: 1186238380. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:46:21,165][25689] Avg episode reward: [(0, '-0.282')] [2022-07-11 10:46:21,912][26022] Updated weights on worker 0-0, policy_version 1158438 (0.00093) [2022-07-11 10:46:23,918][26022] Updated weights on worker 0-0, policy_version 1158448 (0.00370) [2022-07-11 10:46:25,568][26022] Updated weights on worker 0-0, policy_version 1158458 (0.00095) [2022-07-11 10:46:26,311][25689] Fps is (10 sec: 5545.8, 60 sec: 5526.5, 300 sec: 5512.3). Total num frames: 1186264064. Throughput: 0: 5779.7. Samples: 1186271364. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:46:26,311][25689] Avg episode reward: [(0, '-0.392')] [2022-07-11 10:46:27,707][26022] Updated weights on worker 0-0, policy_version 1158468 (0.00101) [2022-07-11 10:46:29,354][26022] Updated weights on worker 0-0, policy_version 1158478 (0.00092) [2022-07-11 10:46:31,362][25689] Fps is (10 sec: 5522.5, 60 sec: 5489.9, 300 sec: 5501.5). Total num frames: 1186290688. Throughput: 0: 4922.2. Samples: 1186287872. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:46:31,363][25689] Avg episode reward: [(0, '-0.358')] [2022-07-11 10:46:31,424][26022] Updated weights on worker 0-0, policy_version 1158488 (0.00088) [2022-07-11 10:46:33,308][26022] Updated weights on worker 0-0, policy_version 1158498 (0.00103) [2022-07-11 10:46:34,866][26022] Updated weights on worker 0-0, policy_version 1158508 (0.00092) [2022-07-11 10:46:36,383][25689] Fps is (10 sec: 5388.5, 60 sec: 5492.2, 300 sec: 5502.6). Total num frames: 1186318336. Throughput: 0: 5737.5. Samples: 1186321218. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:46:36,386][25689] Avg episode reward: [(0, '-1.622')] [2022-07-11 10:46:36,896][26022] Updated weights on worker 0-0, policy_version 1158518 (0.00096) [2022-07-11 10:46:38,782][26022] Updated weights on worker 0-0, policy_version 1158528 (0.00087) [2022-07-11 10:46:40,566][26022] Updated weights on worker 0-0, policy_version 1158538 (0.00090) [2022-07-11 10:46:41,395][25689] Fps is (10 sec: 5715.9, 60 sec: 5543.7, 300 sec: 5511.0). Total num frames: 1186348032. Throughput: 0: 5742.4. Samples: 1186354552. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:46:41,395][25689] Avg episode reward: [(0, '-1.008')] [2022-07-11 10:46:42,542][26022] Updated weights on worker 0-0, policy_version 1158548 (0.00846) [2022-07-11 10:46:44,145][26022] Updated weights on worker 0-0, policy_version 1158558 (0.00090) [2022-07-11 10:46:45,942][26022] Updated weights on worker 0-0, policy_version 1158568 (0.00090) [2022-07-11 10:46:46,522][25689] Fps is (10 sec: 5655.6, 60 sec: 5523.6, 300 sec: 5508.8). Total num frames: 1186375680. Throughput: 0: 4948.6. Samples: 1186371380. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:46:46,522][25689] Avg episode reward: [(0, '-1.019')] [2022-07-11 10:46:48,022][26022] Updated weights on worker 0-0, policy_version 1158578 (0.00093) [2022-07-11 10:46:49,623][26022] Updated weights on worker 0-0, policy_version 1158588 (0.00084) [2022-07-11 10:46:51,527][25689] Fps is (10 sec: 5457.3, 60 sec: 5525.7, 300 sec: 5506.1). Total num frames: 1186403328. Throughput: 0: 5793.3. Samples: 1186404694. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:46:51,529][25689] Avg episode reward: [(0, '-1.472')] [2022-07-11 10:46:51,636][26022] Updated weights on worker 0-0, policy_version 1158598 (0.00085) [2022-07-11 10:46:53,543][26022] Updated weights on worker 0-0, policy_version 1158608 (0.00248) [2022-07-11 10:46:54,976][26022] Updated weights on worker 0-0, policy_version 1158618 (0.00091) [2022-07-11 10:46:56,567][25689] Fps is (10 sec: 5504.8, 60 sec: 5525.1, 300 sec: 5502.1). Total num frames: 1186430976. Throughput: 0: 5811.2. Samples: 1186438514. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:46:56,568][25689] Avg episode reward: [(0, '-1.643')] [2022-07-11 10:46:57,176][26022] Updated weights on worker 0-0, policy_version 1158628 (0.00091) [2022-07-11 10:46:58,631][26022] Updated weights on worker 0-0, policy_version 1158638 (0.00089) [2022-07-11 10:47:00,705][26022] Updated weights on worker 0-0, policy_version 1158648 (0.00085) [2022-07-11 10:47:01,615][25689] Fps is (10 sec: 5684.8, 60 sec: 5523.2, 300 sec: 5517.3). Total num frames: 1186460672. Throughput: 0: 4991.0. Samples: 1186455470. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:47:01,615][25689] Avg episode reward: [(0, '-1.981')] [2022-07-11 10:47:02,791][26022] Updated weights on worker 0-0, policy_version 1158658 (0.00091) [2022-07-11 10:47:04,613][26022] Updated weights on worker 0-0, policy_version 1158668 (0.00086) [2022-07-11 10:47:06,717][25689] Fps is (10 sec: 5347.0, 60 sec: 5509.0, 300 sec: 5505.3). Total num frames: 1186485248. Throughput: 0: 5728.8. Samples: 1186487074. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:47:06,718][25689] Avg episode reward: [(0, '-0.464')] [2022-07-11 10:47:06,791][26022] Updated weights on worker 0-0, policy_version 1158678 (0.00083) [2022-07-11 10:47:08,222][26022] Updated weights on worker 0-0, policy_version 1158688 (0.00084) [2022-07-11 10:47:10,188][26022] Updated weights on worker 0-0, policy_version 1158698 (0.00090) [2022-07-11 10:47:11,737][25689] Fps is (10 sec: 5361.7, 60 sec: 5525.3, 300 sec: 5512.0). Total num frames: 1186514944. Throughput: 0: 5742.1. Samples: 1186520738. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:47:11,737][25689] Avg episode reward: [(0, '-0.967')] [2022-07-11 10:47:12,004][26022] Updated weights on worker 0-0, policy_version 1158708 (0.00082) [2022-07-11 10:47:13,740][26022] Updated weights on worker 0-0, policy_version 1158718 (0.00089) [2022-07-11 10:47:15,685][26022] Updated weights on worker 0-0, policy_version 1158728 (0.00090) [2022-07-11 10:47:16,746][25689] Fps is (10 sec: 5717.8, 60 sec: 5526.4, 300 sec: 5512.2). Total num frames: 1186542592. Throughput: 0: 4908.7. Samples: 1186537566. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:47:16,746][25689] Avg episode reward: [(0, '-0.370')] [2022-07-11 10:47:17,600][26022] Updated weights on worker 0-0, policy_version 1158738 (0.00090) [2022-07-11 10:47:19,330][26022] Updated weights on worker 0-0, policy_version 1158748 (0.00087) [2022-07-11 10:47:21,369][26022] Updated weights on worker 0-0, policy_version 1158758 (0.00089) [2022-07-11 10:47:21,775][25689] Fps is (10 sec: 5508.6, 60 sec: 5541.5, 300 sec: 5510.6). Total num frames: 1186570240. Throughput: 0: 5739.1. Samples: 1186571172. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:47:21,775][25689] Avg episode reward: [(0, '-1.482')] [2022-07-11 10:47:22,822][26022] Updated weights on worker 0-0, policy_version 1158768 (0.00092) [2022-07-11 10:47:25,052][26022] Updated weights on worker 0-0, policy_version 1158778 (0.00087) [2022-07-11 10:47:26,546][26022] Updated weights on worker 0-0, policy_version 1158788 (0.00084) [2022-07-11 10:47:26,858][25689] Fps is (10 sec: 5671.1, 60 sec: 5547.3, 300 sec: 5513.2). Total num frames: 1186599936. Throughput: 0: 5841.3. Samples: 1186604720. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:47:26,858][25689] Avg episode reward: [(0, '-1.535')] [2022-07-11 10:47:28,679][26022] Updated weights on worker 0-0, policy_version 1158798 (0.00090) [2022-07-11 10:47:30,359][26022] Updated weights on worker 0-0, policy_version 1158808 (0.00091) [2022-07-11 10:47:31,897][25689] Fps is (10 sec: 5665.3, 60 sec: 5565.4, 300 sec: 5512.7). Total num frames: 1186627584. Throughput: 0: 4998.6. Samples: 1186621510. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:47:31,897][25689] Avg episode reward: [(0, '-1.269')] [2022-07-11 10:47:32,392][26022] Updated weights on worker 0-0, policy_version 1158818 (0.00084) [2022-07-11 10:47:33,910][26022] Updated weights on worker 0-0, policy_version 1158828 (0.00898) [2022-07-11 10:47:35,945][26022] Updated weights on worker 0-0, policy_version 1158838 (0.00094) [2022-07-11 10:47:36,949][25689] Fps is (10 sec: 5479.5, 60 sec: 5562.4, 300 sec: 5515.2). Total num frames: 1186655232. Throughput: 0: 5821.5. Samples: 1186655178. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:47:36,950][25689] Avg episode reward: [(0, '-1.232')] [2022-07-11 10:47:37,492][26022] Updated weights on worker 0-0, policy_version 1158848 (0.00101) [2022-07-11 10:47:39,583][26022] Updated weights on worker 0-0, policy_version 1158858 (0.00088) [2022-07-11 10:47:41,243][26022] Updated weights on worker 0-0, policy_version 1158868 (0.00066) [2022-07-11 10:47:42,036][25689] Fps is (10 sec: 5554.7, 60 sec: 5538.7, 300 sec: 5514.6). Total num frames: 1186683904. Throughput: 0: 5816.8. Samples: 1186689026. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:47:42,036][25689] Avg episode reward: [(0, '-1.643')] [2022-07-11 10:47:43,159][26022] Updated weights on worker 0-0, policy_version 1158878 (0.00098) [2022-07-11 10:47:44,990][26022] Updated weights on worker 0-0, policy_version 1158888 (0.00093) [2022-07-11 10:47:46,765][26022] Updated weights on worker 0-0, policy_version 1158898 (0.00092) [2022-07-11 10:47:47,120][25689] Fps is (10 sec: 5638.3, 60 sec: 5559.6, 300 sec: 5517.7). Total num frames: 1186712576. Throughput: 0: 5805.2. Samples: 1186722346. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:47:47,120][25689] Avg episode reward: [(0, '-0.641')] [2022-07-11 10:47:48,631][26022] Updated weights on worker 0-0, policy_version 1158908 (0.00875) [2022-07-11 10:47:50,484][26022] Updated weights on worker 0-0, policy_version 1158918 (0.00094) [2022-07-11 10:47:52,137][25689] Fps is (10 sec: 5778.3, 60 sec: 5592.2, 300 sec: 5525.5). Total num frames: 1186742272. Throughput: 0: 5810.9. Samples: 1186739126. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:47:52,139][25689] Avg episode reward: [(0, '-1.039')] [2022-07-11 10:47:52,142][26022] Updated weights on worker 0-0, policy_version 1158928 (0.00088) [2022-07-11 10:47:54,462][26022] Updated weights on worker 0-0, policy_version 1158938 (0.00098) [2022-07-11 10:47:55,678][26022] Updated weights on worker 0-0, policy_version 1158948 (0.00092) [2022-07-11 10:47:57,158][25689] Fps is (10 sec: 5508.5, 60 sec: 5560.2, 300 sec: 5511.8). Total num frames: 1186767872. Throughput: 0: 5819.4. Samples: 1186772782. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:47:57,160][25689] Avg episode reward: [(0, '-2.239')] [2022-07-11 10:47:58,040][26022] Updated weights on worker 0-0, policy_version 1158958 (0.00085) [2022-07-11 10:47:59,429][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:47:59,444][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001158968_1186783232.pth [2022-07-11 10:47:59,446][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001157025_1184793600.pth [2022-07-11 10:47:59,449][26022] Updated weights on worker 0-0, policy_version 1158968 (0.00087) [2022-07-11 10:48:01,586][26022] Updated weights on worker 0-0, policy_version 1158978 (0.00082) [2022-07-11 10:48:02,185][25689] Fps is (10 sec: 5503.2, 60 sec: 5562.1, 300 sec: 5526.0). Total num frames: 1186797568. Throughput: 0: 5797.3. Samples: 1186805836. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:48:02,193][25689] Avg episode reward: [(0, '-1.452')] [2022-07-11 10:48:03,846][26022] Updated weights on worker 0-0, policy_version 1158988 (0.00085) [2022-07-11 10:48:05,434][26022] Updated weights on worker 0-0, policy_version 1158998 (0.00086) [2022-07-11 10:48:07,261][25689] Fps is (10 sec: 5473.2, 60 sec: 5581.4, 300 sec: 5518.2). Total num frames: 1186823168. Throughput: 0: 4891.6. Samples: 1186820870. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:48:07,262][25689] Avg episode reward: [(0, '-0.675')] [2022-07-11 10:48:07,461][26022] Updated weights on worker 0-0, policy_version 1159008 (0.00081) [2022-07-11 10:48:09,243][26022] Updated weights on worker 0-0, policy_version 1159018 (0.00087) [2022-07-11 10:48:11,116][26022] Updated weights on worker 0-0, policy_version 1159028 (0.00094) [2022-07-11 10:48:12,317][25689] Fps is (10 sec: 5255.7, 60 sec: 5544.3, 300 sec: 5517.6). Total num frames: 1186850816. Throughput: 0: 5705.3. Samples: 1186854256. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:48:12,317][25689] Avg episode reward: [(0, '-0.563')] [2022-07-11 10:48:12,970][26022] Updated weights on worker 0-0, policy_version 1159038 (0.00074) [2022-07-11 10:48:14,719][26022] Updated weights on worker 0-0, policy_version 1159048 (0.00087) [2022-07-11 10:48:16,700][26022] Updated weights on worker 0-0, policy_version 1159058 (0.00087) [2022-07-11 10:48:17,329][25689] Fps is (10 sec: 5594.2, 60 sec: 5560.9, 300 sec: 5524.4). Total num frames: 1186879488. Throughput: 0: 5698.1. Samples: 1186887716. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:48:17,329][25689] Avg episode reward: [(0, '-0.928')] [2022-07-11 10:48:18,449][26022] Updated weights on worker 0-0, policy_version 1159068 (0.00089) [2022-07-11 10:48:20,504][26022] Updated weights on worker 0-0, policy_version 1159078 (0.00095) [2022-07-11 10:48:22,130][26022] Updated weights on worker 0-0, policy_version 1159088 (0.00089) [2022-07-11 10:48:22,368][25689] Fps is (10 sec: 5603.5, 60 sec: 5560.0, 300 sec: 5518.4). Total num frames: 1186907136. Throughput: 0: 4876.5. Samples: 1186904256. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:48:22,368][25689] Avg episode reward: [(0, '-0.462')] [2022-07-11 10:48:24,021][26022] Updated weights on worker 0-0, policy_version 1159098 (0.00093) [2022-07-11 10:48:26,074][26022] Updated weights on worker 0-0, policy_version 1159108 (0.00084) [2022-07-11 10:48:27,409][25689] Fps is (10 sec: 5383.6, 60 sec: 5513.0, 300 sec: 5517.7). Total num frames: 1186933760. Throughput: 0: 5773.3. Samples: 1186937194. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:48:27,410][25689] Avg episode reward: [(0, '0.831')] [2022-07-11 10:48:27,762][26022] Updated weights on worker 0-0, policy_version 1159118 (0.00087) [2022-07-11 10:48:29,810][26022] Updated weights on worker 0-0, policy_version 1159128 (0.00095) [2022-07-11 10:48:31,550][26022] Updated weights on worker 0-0, policy_version 1159138 (0.00096) [2022-07-11 10:48:32,413][25689] Fps is (10 sec: 5300.9, 60 sec: 5499.4, 300 sec: 5514.5). Total num frames: 1186960384. Throughput: 0: 5771.1. Samples: 1186970232. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:48:32,413][25689] Avg episode reward: [(0, '0.473')] [2022-07-11 10:48:33,371][26022] Updated weights on worker 0-0, policy_version 1159148 (0.00083) [2022-07-11 10:48:35,265][26022] Updated weights on worker 0-0, policy_version 1159158 (0.00093) [2022-07-11 10:48:36,838][26022] Updated weights on worker 0-0, policy_version 1159168 (0.00085) [2022-07-11 10:48:37,416][25689] Fps is (10 sec: 5628.5, 60 sec: 5537.7, 300 sec: 5518.0). Total num frames: 1186990080. Throughput: 0: 4942.0. Samples: 1186986986. Policy #0 lag: (min: 0.0, avg: 7.3, max: 19.0) [2022-07-11 10:48:37,416][25689] Avg episode reward: [(0, '0.372')] [2022-07-11 10:48:39,045][26022] Updated weights on worker 0-0, policy_version 1159178 (0.00093) [2022-07-11 10:48:40,643][26022] Updated weights on worker 0-0, policy_version 1159188 (0.00090) [2022-07-11 10:48:42,455][25689] Fps is (10 sec: 5608.3, 60 sec: 5508.2, 300 sec: 5519.1). Total num frames: 1187016704. Throughput: 0: 5777.7. Samples: 1187020314. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:48:42,459][25689] Avg episode reward: [(0, '0.320')] [2022-07-11 10:48:42,676][26022] Updated weights on worker 0-0, policy_version 1159198 (0.00087) [2022-07-11 10:48:44,471][26022] Updated weights on worker 0-0, policy_version 1159208 (0.00092) [2022-07-11 10:48:46,140][26022] Updated weights on worker 0-0, policy_version 1159218 (0.00088) [2022-07-11 10:48:47,600][25689] Fps is (10 sec: 5429.3, 60 sec: 5502.6, 300 sec: 5520.0). Total num frames: 1187045376. Throughput: 0: 5765.2. Samples: 1187053598. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:48:47,601][25689] Avg episode reward: [(0, '0.509')] [2022-07-11 10:48:48,222][26022] Updated weights on worker 0-0, policy_version 1159228 (0.00086) [2022-07-11 10:48:49,883][26022] Updated weights on worker 0-0, policy_version 1159238 (0.00089) [2022-07-11 10:48:52,045][26022] Updated weights on worker 0-0, policy_version 1159248 (0.00085) [2022-07-11 10:48:52,609][25689] Fps is (10 sec: 5647.4, 60 sec: 5486.5, 300 sec: 5520.6). Total num frames: 1187074048. Throughput: 0: 4963.0. Samples: 1187070466. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:48:52,610][25689] Avg episode reward: [(0, '0.327')] [2022-07-11 10:48:53,771][26022] Updated weights on worker 0-0, policy_version 1159258 (0.00073) [2022-07-11 10:48:55,729][26022] Updated weights on worker 0-0, policy_version 1159268 (0.00098) [2022-07-11 10:48:57,375][26022] Updated weights on worker 0-0, policy_version 1159278 (0.00089) [2022-07-11 10:48:57,644][25689] Fps is (10 sec: 5607.5, 60 sec: 5519.0, 300 sec: 5520.2). Total num frames: 1187101696. Throughput: 0: 5757.6. Samples: 1187103452. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:48:57,645][25689] Avg episode reward: [(0, '-0.383')] [2022-07-11 10:48:59,338][26022] Updated weights on worker 0-0, policy_version 1159288 (0.00090) [2022-07-11 10:49:00,935][26022] Updated weights on worker 0-0, policy_version 1159298 (0.00087) [2022-07-11 10:49:02,719][25689] Fps is (10 sec: 5267.0, 60 sec: 5447.0, 300 sec: 5516.5). Total num frames: 1187127296. Throughput: 0: 5679.0. Samples: 1187135392. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:49:02,719][25689] Avg episode reward: [(0, '-0.011')] [2022-07-11 10:49:03,422][26022] Updated weights on worker 0-0, policy_version 1159308 (0.00089) [2022-07-11 10:49:05,315][26022] Updated weights on worker 0-0, policy_version 1159318 (0.00086) [2022-07-11 10:49:06,928][26022] Updated weights on worker 0-0, policy_version 1159328 (0.00088) [2022-07-11 10:49:07,808][25689] Fps is (10 sec: 5339.9, 60 sec: 5496.6, 300 sec: 5518.5). Total num frames: 1187155968. Throughput: 0: 4833.2. Samples: 1187151264. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:49:07,808][25689] Avg episode reward: [(0, '0.144')] [2022-07-11 10:49:08,972][26022] Updated weights on worker 0-0, policy_version 1159338 (0.00086) [2022-07-11 10:49:10,785][26022] Updated weights on worker 0-0, policy_version 1159348 (0.00086) [2022-07-11 10:49:12,563][26022] Updated weights on worker 0-0, policy_version 1159358 (0.00084) [2022-07-11 10:49:12,841][25689] Fps is (10 sec: 5664.9, 60 sec: 5515.5, 300 sec: 5524.9). Total num frames: 1187184640. Throughput: 0: 5647.5. Samples: 1187184728. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:49:12,842][25689] Avg episode reward: [(0, '-0.024')] [2022-07-11 10:49:14,339][26022] Updated weights on worker 0-0, policy_version 1159368 (0.00513) [2022-07-11 10:49:16,317][26022] Updated weights on worker 0-0, policy_version 1159378 (0.00084) [2022-07-11 10:49:17,881][25689] Fps is (10 sec: 5590.7, 60 sec: 5496.0, 300 sec: 5513.9). Total num frames: 1187212288. Throughput: 0: 5687.4. Samples: 1187218550. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:49:17,882][25689] Avg episode reward: [(0, '-0.818')] [2022-07-11 10:49:17,967][26022] Updated weights on worker 0-0, policy_version 1159388 (0.00094) [2022-07-11 10:49:20,162][26022] Updated weights on worker 0-0, policy_version 1159398 (0.00092) [2022-07-11 10:49:21,523][26022] Updated weights on worker 0-0, policy_version 1159408 (0.00087) [2022-07-11 10:49:22,892][25689] Fps is (10 sec: 5501.8, 60 sec: 5498.6, 300 sec: 5522.1). Total num frames: 1187239936. Throughput: 0: 4930.5. Samples: 1187234856. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:49:22,892][25689] Avg episode reward: [(0, '-0.538')] [2022-07-11 10:49:23,871][26022] Updated weights on worker 0-0, policy_version 1159418 (0.00095) [2022-07-11 10:49:25,330][26022] Updated weights on worker 0-0, policy_version 1159428 (0.00092) [2022-07-11 10:49:27,388][26022] Updated weights on worker 0-0, policy_version 1159438 (0.00091) [2022-07-11 10:49:28,028][25689] Fps is (10 sec: 5550.7, 60 sec: 5523.9, 300 sec: 5523.1). Total num frames: 1187268608. Throughput: 0: 5785.3. Samples: 1187268244. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:49:28,029][25689] Avg episode reward: [(0, '0.009')] [2022-07-11 10:49:29,151][26022] Updated weights on worker 0-0, policy_version 1159448 (0.00092) [2022-07-11 10:49:30,965][26022] Updated weights on worker 0-0, policy_version 1159458 (0.00089) [2022-07-11 10:49:33,019][26022] Updated weights on worker 0-0, policy_version 1159468 (0.00087) [2022-07-11 10:49:33,116][25689] Fps is (10 sec: 5408.3, 60 sec: 5516.1, 300 sec: 5511.2). Total num frames: 1187295232. Throughput: 0: 5757.1. Samples: 1187301452. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:49:33,117][25689] Avg episode reward: [(0, '0.031')] [2022-07-11 10:49:34,628][26022] Updated weights on worker 0-0, policy_version 1159478 (0.00093) [2022-07-11 10:49:36,691][26022] Updated weights on worker 0-0, policy_version 1159488 (0.00086) [2022-07-11 10:49:38,133][25689] Fps is (10 sec: 5573.4, 60 sec: 5514.9, 300 sec: 5524.7). Total num frames: 1187324928. Throughput: 0: 4925.2. Samples: 1187318292. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:49:38,133][25689] Avg episode reward: [(0, '-0.358')] [2022-07-11 10:49:38,298][26022] Updated weights on worker 0-0, policy_version 1159498 (0.00086) [2022-07-11 10:49:40,197][26022] Updated weights on worker 0-0, policy_version 1159508 (0.00094) [2022-07-11 10:49:42,216][26022] Updated weights on worker 0-0, policy_version 1159518 (0.00090) [2022-07-11 10:49:43,173][25689] Fps is (10 sec: 5701.8, 60 sec: 5531.6, 300 sec: 5522.6). Total num frames: 1187352576. Throughput: 0: 5774.3. Samples: 1187351968. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:49:43,174][25689] Avg episode reward: [(0, '0.091')] [2022-07-11 10:49:43,803][26022] Updated weights on worker 0-0, policy_version 1159528 (0.00087) [2022-07-11 10:49:45,706][26022] Updated weights on worker 0-0, policy_version 1159538 (0.00087) [2022-07-11 10:49:47,443][26022] Updated weights on worker 0-0, policy_version 1159548 (0.00081) [2022-07-11 10:49:48,223][25689] Fps is (10 sec: 5581.8, 60 sec: 5540.4, 300 sec: 5525.3). Total num frames: 1187381248. Throughput: 0: 5810.5. Samples: 1187385588. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:49:48,223][25689] Avg episode reward: [(0, '0.894')] [2022-07-11 10:49:49,478][26022] Updated weights on worker 0-0, policy_version 1159558 (0.00099) [2022-07-11 10:49:51,127][26022] Updated weights on worker 0-0, policy_version 1159568 (0.00086) [2022-07-11 10:49:53,206][26022] Updated weights on worker 0-0, policy_version 1159578 (0.00093) [2022-07-11 10:49:53,255][25689] Fps is (10 sec: 5485.1, 60 sec: 5504.5, 300 sec: 5514.7). Total num frames: 1187407872. Throughput: 0: 5000.3. Samples: 1187402150. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:49:53,255][25689] Avg episode reward: [(0, '-0.773')] [2022-07-11 10:49:54,828][26022] Updated weights on worker 0-0, policy_version 1159588 (0.00090) [2022-07-11 10:49:56,711][26022] Updated weights on worker 0-0, policy_version 1159598 (0.00091) [2022-07-11 10:49:58,267][25689] Fps is (10 sec: 5607.1, 60 sec: 5540.3, 300 sec: 5529.2). Total num frames: 1187437568. Throughput: 0: 5841.5. Samples: 1187435906. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:49:58,268][25689] Avg episode reward: [(0, '-1.083')] [2022-07-11 10:49:58,382][26022] Updated weights on worker 0-0, policy_version 1159608 (0.00084) [2022-07-11 10:49:59,630][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:49:59,649][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001159614_1187444736.pth [2022-07-11 10:49:59,650][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001157670_1185454080.pth [2022-07-11 10:50:00,532][26022] Updated weights on worker 0-0, policy_version 1159618 (0.00095) [2022-07-11 10:50:02,453][26022] Updated weights on worker 0-0, policy_version 1159628 (0.00083) [2022-07-11 10:50:03,271][25689] Fps is (10 sec: 5520.5, 60 sec: 5546.8, 300 sec: 5523.4). Total num frames: 1187463168. Throughput: 0: 5736.7. Samples: 1187467262. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:50:03,274][25689] Avg episode reward: [(0, '-1.157')] [2022-07-11 10:50:04,665][26022] Updated weights on worker 0-0, policy_version 1159638 (0.00085) [2022-07-11 10:50:06,063][26022] Updated weights on worker 0-0, policy_version 1159648 (0.00095) [2022-07-11 10:50:08,177][26022] Updated weights on worker 0-0, policy_version 1159658 (0.00103) [2022-07-11 10:50:08,314][25689] Fps is (10 sec: 5197.9, 60 sec: 5517.1, 300 sec: 5523.9). Total num frames: 1187489792. Throughput: 0: 4905.8. Samples: 1187484152. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:50:08,316][25689] Avg episode reward: [(0, '-1.482')] [2022-07-11 10:50:09,857][26022] Updated weights on worker 0-0, policy_version 1159668 (0.00091) [2022-07-11 10:50:11,844][26022] Updated weights on worker 0-0, policy_version 1159678 (0.00099) [2022-07-11 10:50:13,392][25689] Fps is (10 sec: 5463.5, 60 sec: 5513.1, 300 sec: 5522.9). Total num frames: 1187518464. Throughput: 0: 5730.3. Samples: 1187517544. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:50:13,393][25689] Avg episode reward: [(0, '-1.213')] [2022-07-11 10:50:13,656][26022] Updated weights on worker 0-0, policy_version 1159688 (0.00100) [2022-07-11 10:50:15,577][26022] Updated weights on worker 0-0, policy_version 1159698 (0.00089) [2022-07-11 10:50:17,398][26022] Updated weights on worker 0-0, policy_version 1159708 (0.00089) [2022-07-11 10:50:18,400][25689] Fps is (10 sec: 5584.4, 60 sec: 5516.1, 300 sec: 5519.8). Total num frames: 1187546112. Throughput: 0: 5705.1. Samples: 1187550764. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:50:18,401][25689] Avg episode reward: [(0, '-0.956')] [2022-07-11 10:50:19,192][26022] Updated weights on worker 0-0, policy_version 1159718 (0.00094) [2022-07-11 10:50:21,155][26022] Updated weights on worker 0-0, policy_version 1159728 (0.00095) [2022-07-11 10:50:22,949][26022] Updated weights on worker 0-0, policy_version 1159738 (0.00086) [2022-07-11 10:50:23,402][25689] Fps is (10 sec: 5421.7, 60 sec: 5499.9, 300 sec: 5521.1). Total num frames: 1187572736. Throughput: 0: 4965.0. Samples: 1187567214. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:50:23,404][25689] Avg episode reward: [(0, '0.292')] [2022-07-11 10:50:24,898][26022] Updated weights on worker 0-0, policy_version 1159748 (0.00086) [2022-07-11 10:50:26,836][26022] Updated weights on worker 0-0, policy_version 1159758 (0.00085) [2022-07-11 10:50:28,519][25689] Fps is (10 sec: 5464.4, 60 sec: 5501.6, 300 sec: 5519.3). Total num frames: 1187601408. Throughput: 0: 5730.5. Samples: 1187599934. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:50:28,519][25689] Avg episode reward: [(0, '0.788')] [2022-07-11 10:50:28,612][26022] Updated weights on worker 0-0, policy_version 1159768 (0.00092) [2022-07-11 10:50:30,483][26022] Updated weights on worker 0-0, policy_version 1159778 (0.00093) [2022-07-11 10:50:32,365][26022] Updated weights on worker 0-0, policy_version 1159788 (0.00089) [2022-07-11 10:50:33,538][25689] Fps is (10 sec: 5657.8, 60 sec: 5541.9, 300 sec: 5523.2). Total num frames: 1187630080. Throughput: 0: 5744.5. Samples: 1187633270. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:50:33,540][25689] Avg episode reward: [(0, '0.611')] [2022-07-11 10:50:34,170][26022] Updated weights on worker 0-0, policy_version 1159798 (0.00096) [2022-07-11 10:50:35,812][26022] Updated weights on worker 0-0, policy_version 1159808 (0.00095) [2022-07-11 10:50:37,847][26022] Updated weights on worker 0-0, policy_version 1159818 (0.00085) [2022-07-11 10:50:38,551][25689] Fps is (10 sec: 5614.1, 60 sec: 5508.3, 300 sec: 5526.8). Total num frames: 1187657728. Throughput: 0: 5768.6. Samples: 1187667008. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:50:38,553][25689] Avg episode reward: [(0, '-0.044')] [2022-07-11 10:50:39,491][26022] Updated weights on worker 0-0, policy_version 1159828 (0.00082) [2022-07-11 10:50:41,558][26022] Updated weights on worker 0-0, policy_version 1159838 (0.00085) [2022-07-11 10:50:43,044][26022] Updated weights on worker 0-0, policy_version 1159848 (0.00090) [2022-07-11 10:50:43,564][25689] Fps is (10 sec: 5617.5, 60 sec: 5527.8, 300 sec: 5528.3). Total num frames: 1187686400. Throughput: 0: 5788.5. Samples: 1187683916. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:50:43,566][25689] Avg episode reward: [(0, '-0.119')] [2022-07-11 10:50:45,008][26022] Updated weights on worker 0-0, policy_version 1159858 (0.00091) [2022-07-11 10:50:46,924][26022] Updated weights on worker 0-0, policy_version 1159868 (0.00098) [2022-07-11 10:50:48,615][25689] Fps is (10 sec: 5596.3, 60 sec: 5510.7, 300 sec: 5527.9). Total num frames: 1187714048. Throughput: 0: 5860.2. Samples: 1187717698. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:50:48,616][25689] Avg episode reward: [(0, '-0.053')] [2022-07-11 10:50:48,768][26022] Updated weights on worker 0-0, policy_version 1159878 (0.00086) [2022-07-11 10:50:50,572][26022] Updated weights on worker 0-0, policy_version 1159888 (0.00091) [2022-07-11 10:50:52,430][26022] Updated weights on worker 0-0, policy_version 1159898 (0.00094) [2022-07-11 10:50:53,630][25689] Fps is (10 sec: 5493.3, 60 sec: 5529.2, 300 sec: 5528.2). Total num frames: 1187741696. Throughput: 0: 5854.4. Samples: 1187750896. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:50:53,630][25689] Avg episode reward: [(0, '-1.087')] [2022-07-11 10:50:54,302][26022] Updated weights on worker 0-0, policy_version 1159908 (0.00085) [2022-07-11 10:50:56,024][26022] Updated weights on worker 0-0, policy_version 1159918 (0.00094) [2022-07-11 10:50:57,836][26022] Updated weights on worker 0-0, policy_version 1159928 (0.00091) [2022-07-11 10:50:58,639][25689] Fps is (10 sec: 5618.5, 60 sec: 5512.5, 300 sec: 5525.1). Total num frames: 1187770368. Throughput: 0: 5015.6. Samples: 1187767760. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:50:58,639][25689] Avg episode reward: [(0, '-2.224')] [2022-07-11 10:50:59,817][26022] Updated weights on worker 0-0, policy_version 1159938 (0.00088) [2022-07-11 10:51:01,425][26022] Updated weights on worker 0-0, policy_version 1159948 (0.00084) [2022-07-11 10:51:03,645][25689] Fps is (10 sec: 5316.8, 60 sec: 5495.4, 300 sec: 5524.0). Total num frames: 1187794944. Throughput: 0: 5735.7. Samples: 1187799096. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:51:03,645][25689] Avg episode reward: [(0, '-1.477')] [2022-07-11 10:51:03,773][26022] Updated weights on worker 0-0, policy_version 1159958 (0.00085) [2022-07-11 10:51:05,473][26022] Updated weights on worker 0-0, policy_version 1159968 (0.00091) [2022-07-11 10:51:07,529][26022] Updated weights on worker 0-0, policy_version 1159978 (0.00092) [2022-07-11 10:51:08,763][25689] Fps is (10 sec: 5360.5, 60 sec: 5539.4, 300 sec: 5525.5). Total num frames: 1187824640. Throughput: 0: 5707.8. Samples: 1187832702. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:51:08,764][25689] Avg episode reward: [(0, '-2.275')] [2022-07-11 10:51:09,323][26022] Updated weights on worker 0-0, policy_version 1159988 (0.00086) [2022-07-11 10:51:11,182][26022] Updated weights on worker 0-0, policy_version 1159998 (0.00092) [2022-07-11 10:51:13,072][26022] Updated weights on worker 0-0, policy_version 1160008 (0.00092) [2022-07-11 10:51:13,778][25689] Fps is (10 sec: 5558.1, 60 sec: 5511.3, 300 sec: 5522.2). Total num frames: 1187851264. Throughput: 0: 4890.4. Samples: 1187849428. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:51:13,778][25689] Avg episode reward: [(0, '-1.822')] [2022-07-11 10:51:14,855][26022] Updated weights on worker 0-0, policy_version 1160018 (0.00087) [2022-07-11 10:51:16,665][26022] Updated weights on worker 0-0, policy_version 1160028 (0.00082) [2022-07-11 10:51:18,302][26022] Updated weights on worker 0-0, policy_version 1160038 (0.00086) [2022-07-11 10:51:18,800][25689] Fps is (10 sec: 5713.1, 60 sec: 5560.7, 300 sec: 5535.7). Total num frames: 1187881984. Throughput: 0: 5730.6. Samples: 1187883300. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:51:18,801][25689] Avg episode reward: [(0, '-1.750')] [2022-07-11 10:51:20,181][26022] Updated weights on worker 0-0, policy_version 1160048 (0.00091) [2022-07-11 10:51:22,031][26022] Updated weights on worker 0-0, policy_version 1160058 (0.00083) [2022-07-11 10:51:23,833][25689] Fps is (10 sec: 5702.6, 60 sec: 5557.9, 300 sec: 5527.5). Total num frames: 1187908608. Throughput: 0: 5815.9. Samples: 1187916512. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:51:23,834][25689] Avg episode reward: [(0, '-2.249')] [2022-07-11 10:51:23,947][26022] Updated weights on worker 0-0, policy_version 1160068 (0.00085) [2022-07-11 10:51:25,808][26022] Updated weights on worker 0-0, policy_version 1160078 (0.00092) [2022-07-11 10:51:27,506][26022] Updated weights on worker 0-0, policy_version 1160088 (0.00082) [2022-07-11 10:51:28,937][25689] Fps is (10 sec: 5354.3, 60 sec: 5542.2, 300 sec: 5529.9). Total num frames: 1187936256. Throughput: 0: 4988.9. Samples: 1187933346. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:51:28,937][25689] Avg episode reward: [(0, '-0.969')] [2022-07-11 10:51:29,507][26022] Updated weights on worker 0-0, policy_version 1160098 (0.00087) [2022-07-11 10:51:31,500][26022] Updated weights on worker 0-0, policy_version 1160108 (0.00091) [2022-07-11 10:51:33,042][26022] Updated weights on worker 0-0, policy_version 1160118 (0.00082) [2022-07-11 10:51:33,958][25689] Fps is (10 sec: 5562.9, 60 sec: 5542.0, 300 sec: 5533.4). Total num frames: 1187964928. Throughput: 0: 5816.7. Samples: 1187966810. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:51:33,958][25689] Avg episode reward: [(0, '-0.095')] [2022-07-11 10:51:35,176][26022] Updated weights on worker 0-0, policy_version 1160128 (0.00084) [2022-07-11 10:51:36,687][26022] Updated weights on worker 0-0, policy_version 1160138 (0.00097) [2022-07-11 10:51:38,758][26022] Updated weights on worker 0-0, policy_version 1160148 (0.00091) [2022-07-11 10:51:39,024][25689] Fps is (10 sec: 5583.1, 60 sec: 5537.1, 300 sec: 5525.5). Total num frames: 1187992576. Throughput: 0: 5784.6. Samples: 1188000286. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:51:39,026][25689] Avg episode reward: [(0, '-0.722')] [2022-07-11 10:51:40,525][26022] Updated weights on worker 0-0, policy_version 1160158 (0.00082) [2022-07-11 10:51:42,223][26022] Updated weights on worker 0-0, policy_version 1160168 (0.00084) [2022-07-11 10:51:44,043][25689] Fps is (10 sec: 5584.6, 60 sec: 5536.6, 300 sec: 5530.9). Total num frames: 1188021248. Throughput: 0: 4972.3. Samples: 1188016998. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:51:44,044][25689] Avg episode reward: [(0, '-0.743')] [2022-07-11 10:51:44,208][26022] Updated weights on worker 0-0, policy_version 1160178 (0.00093) [2022-07-11 10:51:46,132][26022] Updated weights on worker 0-0, policy_version 1160188 (0.00088) [2022-07-11 10:51:47,947][26022] Updated weights on worker 0-0, policy_version 1160198 (0.00091) [2022-07-11 10:51:49,092][25689] Fps is (10 sec: 5594.3, 60 sec: 5536.8, 300 sec: 5530.1). Total num frames: 1188048896. Throughput: 0: 5799.1. Samples: 1188050228. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:51:49,093][25689] Avg episode reward: [(0, '0.037')] [2022-07-11 10:51:49,709][26022] Updated weights on worker 0-0, policy_version 1160208 (0.00086) [2022-07-11 10:51:51,939][26022] Updated weights on worker 0-0, policy_version 1160218 (0.00088) [2022-07-11 10:51:53,495][26022] Updated weights on worker 0-0, policy_version 1160228 (0.00083) [2022-07-11 10:51:54,103][25689] Fps is (10 sec: 5597.9, 60 sec: 5554.0, 300 sec: 5534.1). Total num frames: 1188077568. Throughput: 0: 5776.7. Samples: 1188083186. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:51:54,104][25689] Avg episode reward: [(0, '0.062')] [2022-07-11 10:51:55,559][26022] Updated weights on worker 0-0, policy_version 1160238 (0.00085) [2022-07-11 10:51:57,080][26022] Updated weights on worker 0-0, policy_version 1160248 (0.00087) [2022-07-11 10:51:59,120][25689] Fps is (10 sec: 5411.6, 60 sec: 5502.5, 300 sec: 5520.9). Total num frames: 1188103168. Throughput: 0: 4959.1. Samples: 1188099946. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:51:59,121][25689] Avg episode reward: [(0, '0.581')] [2022-07-11 10:51:59,162][26022] Updated weights on worker 0-0, policy_version 1160258 (0.00096) [2022-07-11 10:51:59,772][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:51:59,788][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001160262_1188108288.pth [2022-07-11 10:51:59,791][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001158315_1186114560.pth [2022-07-11 10:52:00,857][26022] Updated weights on worker 0-0, policy_version 1160268 (0.00086) [2022-07-11 10:52:03,046][26022] Updated weights on worker 0-0, policy_version 1160278 (0.00086) [2022-07-11 10:52:04,213][25689] Fps is (10 sec: 5266.9, 60 sec: 5545.3, 300 sec: 5531.4). Total num frames: 1188130816. Throughput: 0: 5672.5. Samples: 1188131416. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:52:04,214][25689] Avg episode reward: [(0, '0.494')] [2022-07-11 10:52:05,093][26022] Updated weights on worker 0-0, policy_version 1160288 (0.00090) [2022-07-11 10:52:06,602][26022] Updated weights on worker 0-0, policy_version 1160298 (0.00090) [2022-07-11 10:52:08,648][26022] Updated weights on worker 0-0, policy_version 1160308 (0.00082) [2022-07-11 10:52:09,355][25689] Fps is (10 sec: 5502.7, 60 sec: 5526.2, 300 sec: 5525.7). Total num frames: 1188159488. Throughput: 0: 5668.6. Samples: 1188165096. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:52:09,356][25689] Avg episode reward: [(0, '1.354')] [2022-07-11 10:52:10,121][26022] Updated weights on worker 0-0, policy_version 1160318 (0.00087) [2022-07-11 10:52:12,231][26022] Updated weights on worker 0-0, policy_version 1160328 (0.00090) [2022-07-11 10:52:13,952][26022] Updated weights on worker 0-0, policy_version 1160338 (0.00100) [2022-07-11 10:52:14,407][25689] Fps is (10 sec: 5525.2, 60 sec: 5539.8, 300 sec: 5524.9). Total num frames: 1188187136. Throughput: 0: 4861.6. Samples: 1188181892. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:52:14,407][25689] Avg episode reward: [(0, '1.522')] [2022-07-11 10:52:16,033][26022] Updated weights on worker 0-0, policy_version 1160348 (0.00094) [2022-07-11 10:52:17,610][26022] Updated weights on worker 0-0, policy_version 1160358 (0.00089) [2022-07-11 10:52:19,474][25689] Fps is (10 sec: 5566.1, 60 sec: 5502.0, 300 sec: 5527.6). Total num frames: 1188215808. Throughput: 0: 5664.7. Samples: 1188215242. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:52:19,474][25689] Avg episode reward: [(0, '-0.129')] [2022-07-11 10:52:19,695][26022] Updated weights on worker 0-0, policy_version 1160368 (0.00082) [2022-07-11 10:52:21,314][26022] Updated weights on worker 0-0, policy_version 1160378 (0.00087) [2022-07-11 10:52:23,345][26022] Updated weights on worker 0-0, policy_version 1160388 (0.00101) [2022-07-11 10:52:24,507][25689] Fps is (10 sec: 5576.2, 60 sec: 5518.9, 300 sec: 5521.7). Total num frames: 1188243456. Throughput: 0: 5775.3. Samples: 1188248616. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:52:24,507][25689] Avg episode reward: [(0, '0.366')] [2022-07-11 10:52:24,991][26022] Updated weights on worker 0-0, policy_version 1160398 (0.00090) [2022-07-11 10:52:27,101][26022] Updated weights on worker 0-0, policy_version 1160408 (0.00080) [2022-07-11 10:52:28,799][26022] Updated weights on worker 0-0, policy_version 1160418 (0.00099) [2022-07-11 10:52:29,625][25689] Fps is (10 sec: 5548.0, 60 sec: 5534.4, 300 sec: 5523.6). Total num frames: 1188272128. Throughput: 0: 4939.7. Samples: 1188265222. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:52:29,626][25689] Avg episode reward: [(0, '0.495')] [2022-07-11 10:52:30,781][26022] Updated weights on worker 0-0, policy_version 1160428 (0.00089) [2022-07-11 10:52:32,428][26022] Updated weights on worker 0-0, policy_version 1160438 (0.00096) [2022-07-11 10:52:34,433][26022] Updated weights on worker 0-0, policy_version 1160448 (0.00620) [2022-07-11 10:52:34,649][25689] Fps is (10 sec: 5452.2, 60 sec: 5500.4, 300 sec: 5520.7). Total num frames: 1188298752. Throughput: 0: 5760.6. Samples: 1188298498. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 10:52:34,649][25689] Avg episode reward: [(0, '0.639')] [2022-07-11 10:52:36,010][26022] Updated weights on worker 0-0, policy_version 1160458 (0.00090) [2022-07-11 10:52:37,960][26022] Updated weights on worker 0-0, policy_version 1160468 (0.00085) [2022-07-11 10:52:39,655][25689] Fps is (10 sec: 5615.4, 60 sec: 5539.7, 300 sec: 5525.7). Total num frames: 1188328448. Throughput: 0: 5792.2. Samples: 1188332134. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:52:39,655][25689] Avg episode reward: [(0, '0.340')] [2022-07-11 10:52:39,935][26022] Updated weights on worker 0-0, policy_version 1160478 (0.00091) [2022-07-11 10:52:41,697][26022] Updated weights on worker 0-0, policy_version 1160488 (0.00090) [2022-07-11 10:52:43,720][26022] Updated weights on worker 0-0, policy_version 1160498 (0.00088) [2022-07-11 10:52:44,667][25689] Fps is (10 sec: 5724.1, 60 sec: 5523.4, 300 sec: 5523.6). Total num frames: 1188356096. Throughput: 0: 4974.3. Samples: 1188348898. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:52:44,667][25689] Avg episode reward: [(0, '0.197')] [2022-07-11 10:52:45,246][26022] Updated weights on worker 0-0, policy_version 1160508 (0.00091) [2022-07-11 10:52:47,336][26022] Updated weights on worker 0-0, policy_version 1160518 (0.00108) [2022-07-11 10:52:49,086][26022] Updated weights on worker 0-0, policy_version 1160528 (0.00089) [2022-07-11 10:52:49,795][25689] Fps is (10 sec: 5452.8, 60 sec: 5516.1, 300 sec: 5514.6). Total num frames: 1188383744. Throughput: 0: 5793.6. Samples: 1188382080. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:52:49,796][25689] Avg episode reward: [(0, '0.373')] [2022-07-11 10:52:50,907][26022] Updated weights on worker 0-0, policy_version 1160538 (0.00099) [2022-07-11 10:52:52,788][26022] Updated weights on worker 0-0, policy_version 1160548 (0.00093) [2022-07-11 10:52:54,602][26022] Updated weights on worker 0-0, policy_version 1160558 (0.00092) [2022-07-11 10:52:54,824][25689] Fps is (10 sec: 5544.6, 60 sec: 5514.6, 300 sec: 5524.8). Total num frames: 1188412416. Throughput: 0: 5807.1. Samples: 1188415660. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:52:54,825][25689] Avg episode reward: [(0, '0.050')] [2022-07-11 10:52:56,403][26022] Updated weights on worker 0-0, policy_version 1160568 (0.00083) [2022-07-11 10:52:58,325][26022] Updated weights on worker 0-0, policy_version 1160578 (0.00088) [2022-07-11 10:52:59,841][25689] Fps is (10 sec: 5708.4, 60 sec: 5565.2, 300 sec: 5521.5). Total num frames: 1188441088. Throughput: 0: 5785.1. Samples: 1188448914. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:52:59,842][25689] Avg episode reward: [(0, '-0.836')] [2022-07-11 10:52:59,913][26022] Updated weights on worker 0-0, policy_version 1160588 (0.00085) [2022-07-11 10:53:02,370][26022] Updated weights on worker 0-0, policy_version 1160598 (0.00091) [2022-07-11 10:53:04,224][26022] Updated weights on worker 0-0, policy_version 1160608 (0.00088) [2022-07-11 10:53:04,866][25689] Fps is (10 sec: 5200.6, 60 sec: 5503.9, 300 sec: 5515.6). Total num frames: 1188464640. Throughput: 0: 5678.1. Samples: 1188463592. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:53:04,867][25689] Avg episode reward: [(0, '-0.754')] [2022-07-11 10:53:05,981][26022] Updated weights on worker 0-0, policy_version 1160618 (0.00635) [2022-07-11 10:53:08,204][26022] Updated weights on worker 0-0, policy_version 1160628 (0.00096) [2022-07-11 10:53:09,465][26022] Updated weights on worker 0-0, policy_version 1160638 (0.00084) [2022-07-11 10:53:09,939][25689] Fps is (10 sec: 5475.8, 60 sec: 5560.8, 300 sec: 5529.0). Total num frames: 1188496384. Throughput: 0: 5716.7. Samples: 1188497236. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:53:09,940][25689] Avg episode reward: [(0, '-0.485')] [2022-07-11 10:53:11,879][26022] Updated weights on worker 0-0, policy_version 1160648 (0.00085) [2022-07-11 10:53:13,299][26022] Updated weights on worker 0-0, policy_version 1160658 (0.00083) [2022-07-11 10:53:14,966][25689] Fps is (10 sec: 5576.2, 60 sec: 5512.3, 300 sec: 5515.0). Total num frames: 1188520960. Throughput: 0: 5709.9. Samples: 1188530668. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:53:14,967][25689] Avg episode reward: [(0, '-0.658')] [2022-07-11 10:53:15,306][26022] Updated weights on worker 0-0, policy_version 1160668 (0.00085) [2022-07-11 10:53:17,047][26022] Updated weights on worker 0-0, policy_version 1160678 (0.00080) [2022-07-11 10:53:18,744][26022] Updated weights on worker 0-0, policy_version 1160688 (0.00090) [2022-07-11 10:53:19,981][25689] Fps is (10 sec: 5404.7, 60 sec: 5534.0, 300 sec: 5522.3). Total num frames: 1188550656. Throughput: 0: 4901.1. Samples: 1188547622. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:53:19,981][25689] Avg episode reward: [(0, '-0.306')] [2022-07-11 10:53:20,737][26022] Updated weights on worker 0-0, policy_version 1160698 (0.00088) [2022-07-11 10:53:22,459][26022] Updated weights on worker 0-0, policy_version 1160708 (0.00092) [2022-07-11 10:53:24,475][26022] Updated weights on worker 0-0, policy_version 1160718 (0.00088) [2022-07-11 10:53:24,991][25689] Fps is (10 sec: 5720.5, 60 sec: 5536.2, 300 sec: 5526.4). Total num frames: 1188578304. Throughput: 0: 5859.8. Samples: 1188581518. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:53:24,991][25689] Avg episode reward: [(0, '0.616')] [2022-07-11 10:53:26,256][26022] Updated weights on worker 0-0, policy_version 1160728 (0.00086) [2022-07-11 10:53:28,116][26022] Updated weights on worker 0-0, policy_version 1160738 (0.00088) [2022-07-11 10:53:29,763][26022] Updated weights on worker 0-0, policy_version 1160748 (0.00089) [2022-07-11 10:53:30,108][25689] Fps is (10 sec: 5561.4, 60 sec: 5536.3, 300 sec: 5531.1). Total num frames: 1188606976. Throughput: 0: 5831.5. Samples: 1188614850. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:53:30,108][25689] Avg episode reward: [(0, '-0.229')] [2022-07-11 10:53:31,780][26022] Updated weights on worker 0-0, policy_version 1160758 (0.00094) [2022-07-11 10:53:33,715][26022] Updated weights on worker 0-0, policy_version 1160768 (0.00092) [2022-07-11 10:53:35,126][25689] Fps is (10 sec: 5556.6, 60 sec: 5553.7, 300 sec: 5524.0). Total num frames: 1188634624. Throughput: 0: 5002.9. Samples: 1188631526. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:53:35,127][25689] Avg episode reward: [(0, '-1.144')] [2022-07-11 10:53:35,428][26022] Updated weights on worker 0-0, policy_version 1160778 (0.00084) [2022-07-11 10:53:37,291][26022] Updated weights on worker 0-0, policy_version 1160788 (0.00084) [2022-07-11 10:53:39,024][26022] Updated weights on worker 0-0, policy_version 1160798 (0.00087) [2022-07-11 10:53:40,148][25689] Fps is (10 sec: 5609.5, 60 sec: 5535.3, 300 sec: 5531.2). Total num frames: 1188663296. Throughput: 0: 5830.9. Samples: 1188665214. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:53:40,149][25689] Avg episode reward: [(0, '-1.199')] [2022-07-11 10:53:40,891][26022] Updated weights on worker 0-0, policy_version 1160808 (0.00087) [2022-07-11 10:53:42,647][26022] Updated weights on worker 0-0, policy_version 1160818 (0.00084) [2022-07-11 10:53:44,363][26022] Updated weights on worker 0-0, policy_version 1160828 (0.00090) [2022-07-11 10:53:45,243][25689] Fps is (10 sec: 5566.9, 60 sec: 5527.7, 300 sec: 5528.7). Total num frames: 1188690944. Throughput: 0: 5805.9. Samples: 1188699104. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:53:45,244][25689] Avg episode reward: [(0, '-0.192')] [2022-07-11 10:53:46,410][26022] Updated weights on worker 0-0, policy_version 1160838 (0.00096) [2022-07-11 10:53:48,180][26022] Updated weights on worker 0-0, policy_version 1160848 (0.00088) [2022-07-11 10:53:50,092][26022] Updated weights on worker 0-0, policy_version 1160858 (0.00086) [2022-07-11 10:53:50,316][25689] Fps is (10 sec: 5639.6, 60 sec: 5566.6, 300 sec: 5530.9). Total num frames: 1188720640. Throughput: 0: 4993.0. Samples: 1188715750. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:53:50,317][25689] Avg episode reward: [(0, '-0.548')] [2022-07-11 10:53:51,825][26022] Updated weights on worker 0-0, policy_version 1160868 (0.00089) [2022-07-11 10:53:53,726][26022] Updated weights on worker 0-0, policy_version 1160878 (0.00092) [2022-07-11 10:53:55,365][25689] Fps is (10 sec: 5564.1, 60 sec: 5530.9, 300 sec: 5527.2). Total num frames: 1188747264. Throughput: 0: 5800.9. Samples: 1188748930. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:53:55,366][25689] Avg episode reward: [(0, '-0.505')] [2022-07-11 10:53:55,752][26022] Updated weights on worker 0-0, policy_version 1160888 (0.00088) [2022-07-11 10:53:57,499][26022] Updated weights on worker 0-0, policy_version 1160898 (0.00055) [2022-07-11 10:53:59,393][26022] Updated weights on worker 0-0, policy_version 1160908 (0.00090) [2022-07-11 10:53:59,877][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:53:59,892][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001160911_1188772864.pth [2022-07-11 10:53:59,893][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001158968_1186783232.pth [2022-07-11 10:53:59,893][25974] Saving a milestone ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/milestones/checkpoint_001160911_1188772864.pth.milestone [2022-07-11 10:54:00,371][25689] Fps is (10 sec: 5397.7, 60 sec: 5515.0, 300 sec: 5535.4). Total num frames: 1188774912. Throughput: 0: 5772.6. Samples: 1188781952. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:54:00,372][25689] Avg episode reward: [(0, '-0.655')] [2022-07-11 10:54:01,213][26022] Updated weights on worker 0-0, policy_version 1160918 (0.00092) [2022-07-11 10:54:03,364][26022] Updated weights on worker 0-0, policy_version 1160928 (0.00090) [2022-07-11 10:54:05,359][26022] Updated weights on worker 0-0, policy_version 1160938 (0.00080) [2022-07-11 10:54:05,391][25689] Fps is (10 sec: 5311.2, 60 sec: 5549.3, 300 sec: 5526.3). Total num frames: 1188800512. Throughput: 0: 4833.6. Samples: 1188796490. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:54:05,391][25689] Avg episode reward: [(0, '0.170')] [2022-07-11 10:54:07,122][26022] Updated weights on worker 0-0, policy_version 1160948 (0.00084) [2022-07-11 10:54:08,998][26022] Updated weights on worker 0-0, policy_version 1160958 (0.00087) [2022-07-11 10:54:10,515][25689] Fps is (10 sec: 5350.3, 60 sec: 5494.0, 300 sec: 5524.7). Total num frames: 1188829184. Throughput: 0: 5642.8. Samples: 1188829726. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:54:10,515][25689] Avg episode reward: [(0, '0.671')] [2022-07-11 10:54:10,850][26022] Updated weights on worker 0-0, policy_version 1160968 (0.00090) [2022-07-11 10:54:12,754][26022] Updated weights on worker 0-0, policy_version 1160978 (0.00086) [2022-07-11 10:54:14,575][26022] Updated weights on worker 0-0, policy_version 1160988 (0.00094) [2022-07-11 10:54:15,593][25689] Fps is (10 sec: 5520.3, 60 sec: 5540.0, 300 sec: 5523.9). Total num frames: 1188856832. Throughput: 0: 5633.6. Samples: 1188862886. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:54:15,594][25689] Avg episode reward: [(0, '0.422')] [2022-07-11 10:54:16,411][26022] Updated weights on worker 0-0, policy_version 1160998 (0.00088) [2022-07-11 10:54:18,076][26022] Updated weights on worker 0-0, policy_version 1161008 (0.00089) [2022-07-11 10:54:20,234][26022] Updated weights on worker 0-0, policy_version 1161018 (0.00098) [2022-07-11 10:54:20,616][25689] Fps is (10 sec: 5575.5, 60 sec: 5522.3, 300 sec: 5527.2). Total num frames: 1188885504. Throughput: 0: 4826.1. Samples: 1188879656. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:54:20,617][25689] Avg episode reward: [(0, '0.681')] [2022-07-11 10:54:21,848][26022] Updated weights on worker 0-0, policy_version 1161028 (0.00087) [2022-07-11 10:54:23,778][26022] Updated weights on worker 0-0, policy_version 1161038 (0.00094) [2022-07-11 10:54:25,542][26022] Updated weights on worker 0-0, policy_version 1161048 (0.00080) [2022-07-11 10:54:25,630][25689] Fps is (10 sec: 5611.1, 60 sec: 5521.9, 300 sec: 5526.0). Total num frames: 1188913152. Throughput: 0: 5766.0. Samples: 1188913192. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:54:25,632][25689] Avg episode reward: [(0, '1.208')] [2022-07-11 10:54:27,527][26022] Updated weights on worker 0-0, policy_version 1161058 (0.00090) [2022-07-11 10:54:29,327][26022] Updated weights on worker 0-0, policy_version 1161068 (0.00289) [2022-07-11 10:54:30,752][25689] Fps is (10 sec: 5455.5, 60 sec: 5504.7, 300 sec: 5528.8). Total num frames: 1188940800. Throughput: 0: 5756.1. Samples: 1188946212. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:54:30,752][25689] Avg episode reward: [(0, '1.470')] [2022-07-11 10:54:31,220][26022] Updated weights on worker 0-0, policy_version 1161078 (0.00093) [2022-07-11 10:54:32,964][26022] Updated weights on worker 0-0, policy_version 1161088 (0.00088) [2022-07-11 10:54:34,816][26022] Updated weights on worker 0-0, policy_version 1161098 (0.00095) [2022-07-11 10:54:35,798][25689] Fps is (10 sec: 5539.0, 60 sec: 5519.0, 300 sec: 5524.8). Total num frames: 1188969472. Throughput: 0: 4951.8. Samples: 1188962938. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:54:35,799][25689] Avg episode reward: [(0, '1.488')] [2022-07-11 10:54:36,743][26022] Updated weights on worker 0-0, policy_version 1161108 (0.00085) [2022-07-11 10:54:38,686][26022] Updated weights on worker 0-0, policy_version 1161118 (0.00114) [2022-07-11 10:54:40,471][26022] Updated weights on worker 0-0, policy_version 1161128 (0.00086) [2022-07-11 10:54:40,805][25689] Fps is (10 sec: 5602.0, 60 sec: 5503.4, 300 sec: 5525.4). Total num frames: 1188997120. Throughput: 0: 5775.3. Samples: 1188996256. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:54:40,806][25689] Avg episode reward: [(0, '1.638')] [2022-07-11 10:54:42,339][26022] Updated weights on worker 0-0, policy_version 1161138 (0.00083) [2022-07-11 10:54:44,022][26022] Updated weights on worker 0-0, policy_version 1161148 (0.00109) [2022-07-11 10:54:45,854][25689] Fps is (10 sec: 5397.5, 60 sec: 5490.8, 300 sec: 5518.6). Total num frames: 1189023744. Throughput: 0: 5771.2. Samples: 1189029904. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:54:45,854][25689] Avg episode reward: [(0, '1.655')] [2022-07-11 10:54:46,098][26022] Updated weights on worker 0-0, policy_version 1161158 (0.00090) [2022-07-11 10:54:47,583][26022] Updated weights on worker 0-0, policy_version 1161168 (0.00106) [2022-07-11 10:54:49,688][26022] Updated weights on worker 0-0, policy_version 1161178 (0.00084) [2022-07-11 10:54:50,933][25689] Fps is (10 sec: 5560.9, 60 sec: 5490.2, 300 sec: 5528.0). Total num frames: 1189053440. Throughput: 0: 4961.2. Samples: 1189046336. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:54:50,934][25689] Avg episode reward: [(0, '1.595')] [2022-07-11 10:54:51,321][26022] Updated weights on worker 0-0, policy_version 1161188 (0.00085) [2022-07-11 10:54:53,306][26022] Updated weights on worker 0-0, policy_version 1161198 (0.00088) [2022-07-11 10:54:54,972][26022] Updated weights on worker 0-0, policy_version 1161208 (0.00087) [2022-07-11 10:54:56,004][25689] Fps is (10 sec: 5750.2, 60 sec: 5522.0, 300 sec: 5523.5). Total num frames: 1189082112. Throughput: 0: 5795.8. Samples: 1189080046. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:54:56,005][25689] Avg episode reward: [(0, '1.008')] [2022-07-11 10:54:57,051][26022] Updated weights on worker 0-0, policy_version 1161218 (0.00089) [2022-07-11 10:54:58,631][26022] Updated weights on worker 0-0, policy_version 1161228 (0.00088) [2022-07-11 10:55:00,652][26022] Updated weights on worker 0-0, policy_version 1161238 (0.00086) [2022-07-11 10:55:01,066][25689] Fps is (10 sec: 5558.4, 60 sec: 5516.9, 300 sec: 5529.3). Total num frames: 1189109760. Throughput: 0: 5791.5. Samples: 1189113594. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:55:01,066][25689] Avg episode reward: [(0, '1.227')] [2022-07-11 10:55:02,711][26022] Updated weights on worker 0-0, policy_version 1161248 (0.00064) [2022-07-11 10:55:04,675][26022] Updated weights on worker 0-0, policy_version 1161258 (0.00089) [2022-07-11 10:55:06,092][25689] Fps is (10 sec: 5379.8, 60 sec: 5533.2, 300 sec: 5529.6). Total num frames: 1189136384. Throughput: 0: 5675.3. Samples: 1189144766. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:55:06,094][25689] Avg episode reward: [(0, '0.323')] [2022-07-11 10:55:06,504][26022] Updated weights on worker 0-0, policy_version 1161268 (0.00094) [2022-07-11 10:55:08,330][26022] Updated weights on worker 0-0, policy_version 1161278 (0.00100) [2022-07-11 10:55:10,240][26022] Updated weights on worker 0-0, policy_version 1161288 (0.00094) [2022-07-11 10:55:11,143][25689] Fps is (10 sec: 5385.5, 60 sec: 5523.0, 300 sec: 5526.7). Total num frames: 1189164032. Throughput: 0: 5695.3. Samples: 1189161436. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:55:11,145][25689] Avg episode reward: [(0, '0.058')] [2022-07-11 10:55:12,153][26022] Updated weights on worker 0-0, policy_version 1161298 (0.00088) [2022-07-11 10:55:13,890][26022] Updated weights on worker 0-0, policy_version 1161308 (0.00092) [2022-07-11 10:55:15,855][26022] Updated weights on worker 0-0, policy_version 1161318 (0.00097) [2022-07-11 10:55:16,202][25689] Fps is (10 sec: 5571.0, 60 sec: 5541.7, 300 sec: 5529.1). Total num frames: 1189192704. Throughput: 0: 5685.5. Samples: 1189194880. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:55:16,203][25689] Avg episode reward: [(0, '-0.605')] [2022-07-11 10:55:17,495][26022] Updated weights on worker 0-0, policy_version 1161328 (0.00086) [2022-07-11 10:55:19,461][26022] Updated weights on worker 0-0, policy_version 1161338 (0.00091) [2022-07-11 10:55:21,211][25689] Fps is (10 sec: 5492.2, 60 sec: 5509.1, 300 sec: 5529.0). Total num frames: 1189219328. Throughput: 0: 5673.5. Samples: 1189227890. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:55:21,213][25689] Avg episode reward: [(0, '-0.749')] [2022-07-11 10:55:21,389][26022] Updated weights on worker 0-0, policy_version 1161348 (0.00087) [2022-07-11 10:55:23,257][26022] Updated weights on worker 0-0, policy_version 1161358 (0.00093) [2022-07-11 10:55:25,073][26022] Updated weights on worker 0-0, policy_version 1161368 (0.00093) [2022-07-11 10:55:26,230][25689] Fps is (10 sec: 5411.8, 60 sec: 5508.7, 300 sec: 5527.4). Total num frames: 1189246976. Throughput: 0: 4965.3. Samples: 1189244758. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:55:26,231][25689] Avg episode reward: [(0, '-0.460')] [2022-07-11 10:55:26,868][26022] Updated weights on worker 0-0, policy_version 1161378 (0.00089) [2022-07-11 10:55:28,710][26022] Updated weights on worker 0-0, policy_version 1161388 (0.00093) [2022-07-11 10:55:30,675][26022] Updated weights on worker 0-0, policy_version 1161398 (0.00090) [2022-07-11 10:55:31,284][25689] Fps is (10 sec: 5489.8, 60 sec: 5514.8, 300 sec: 5523.3). Total num frames: 1189274624. Throughput: 0: 5772.9. Samples: 1189277706. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:55:31,285][25689] Avg episode reward: [(0, '-0.543')] [2022-07-11 10:55:32,280][26022] Updated weights on worker 0-0, policy_version 1161408 (0.00088) [2022-07-11 10:55:34,391][26022] Updated weights on worker 0-0, policy_version 1161418 (0.00089) [2022-07-11 10:55:36,119][26022] Updated weights on worker 0-0, policy_version 1161428 (0.00092) [2022-07-11 10:55:36,307][25689] Fps is (10 sec: 5589.3, 60 sec: 5517.0, 300 sec: 5526.6). Total num frames: 1189303296. Throughput: 0: 5777.2. Samples: 1189311030. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:55:36,307][25689] Avg episode reward: [(0, '0.450')] [2022-07-11 10:55:37,977][26022] Updated weights on worker 0-0, policy_version 1161438 (0.00084) [2022-07-11 10:55:39,819][26022] Updated weights on worker 0-0, policy_version 1161448 (0.00522) [2022-07-11 10:55:41,310][25689] Fps is (10 sec: 5514.9, 60 sec: 5500.4, 300 sec: 5519.9). Total num frames: 1189329920. Throughput: 0: 4973.0. Samples: 1189327842. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:55:41,312][25689] Avg episode reward: [(0, '0.543')] [2022-07-11 10:55:41,603][26022] Updated weights on worker 0-0, policy_version 1161458 (0.00081) [2022-07-11 10:55:43,454][26022] Updated weights on worker 0-0, policy_version 1161468 (0.00086) [2022-07-11 10:55:45,327][26022] Updated weights on worker 0-0, policy_version 1161478 (0.00090) [2022-07-11 10:55:46,328][25689] Fps is (10 sec: 5415.6, 60 sec: 5520.1, 300 sec: 5520.5). Total num frames: 1189357568. Throughput: 0: 5797.2. Samples: 1189361268. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:55:46,329][25689] Avg episode reward: [(0, '0.987')] [2022-07-11 10:55:47,008][26022] Updated weights on worker 0-0, policy_version 1161488 (0.00092) [2022-07-11 10:55:49,000][26022] Updated weights on worker 0-0, policy_version 1161498 (0.00085) [2022-07-11 10:55:51,018][26022] Updated weights on worker 0-0, policy_version 1161508 (0.00086) [2022-07-11 10:55:51,373][25689] Fps is (10 sec: 5596.7, 60 sec: 5506.3, 300 sec: 5523.4). Total num frames: 1189386240. Throughput: 0: 5812.9. Samples: 1189394486. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:55:51,374][25689] Avg episode reward: [(0, '0.092')] [2022-07-11 10:55:52,841][26022] Updated weights on worker 0-0, policy_version 1161519 (0.00082) [2022-07-11 10:55:54,968][26022] Updated weights on worker 0-0, policy_version 1161529 (0.00082) [2022-07-11 10:55:56,403][25689] Fps is (10 sec: 5590.3, 60 sec: 5493.1, 300 sec: 5519.5). Total num frames: 1189413888. Throughput: 0: 4975.5. Samples: 1189411018. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:55:56,403][25689] Avg episode reward: [(0, '0.175')] [2022-07-11 10:55:56,584][26022] Updated weights on worker 0-0, policy_version 1161539 (0.00096) [2022-07-11 10:55:58,596][26022] Updated weights on worker 0-0, policy_version 1161549 (0.00086) [2022-07-11 10:56:00,019][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:56:00,028][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001161558_1189435392.pth [2022-07-11 10:56:00,031][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001159614_1187444736.pth [2022-07-11 10:56:00,170][26022] Updated weights on worker 0-0, policy_version 1161559 (0.00092) [2022-07-11 10:56:01,421][25689] Fps is (10 sec: 5503.3, 60 sec: 5497.0, 300 sec: 5529.6). Total num frames: 1189441536. Throughput: 0: 5793.7. Samples: 1189444358. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:56:01,422][25689] Avg episode reward: [(0, '0.249')] [2022-07-11 10:56:02,540][26022] Updated weights on worker 0-0, policy_version 1161569 (0.00081) [2022-07-11 10:56:04,314][26022] Updated weights on worker 0-0, policy_version 1161579 (0.00093) [2022-07-11 10:56:06,180][26022] Updated weights on worker 0-0, policy_version 1161589 (0.00091) [2022-07-11 10:56:06,430][25689] Fps is (10 sec: 5412.5, 60 sec: 5498.7, 300 sec: 5521.3). Total num frames: 1189468160. Throughput: 0: 5718.5. Samples: 1189476220. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:56:06,430][25689] Avg episode reward: [(0, '0.148')] [2022-07-11 10:56:08,007][26022] Updated weights on worker 0-0, policy_version 1161599 (0.00086) [2022-07-11 10:56:09,822][26022] Updated weights on worker 0-0, policy_version 1161609 (0.00086) [2022-07-11 10:56:11,536][25689] Fps is (10 sec: 5365.6, 60 sec: 5493.6, 300 sec: 5523.1). Total num frames: 1189495808. Throughput: 0: 4869.1. Samples: 1189492660. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:56:11,536][25689] Avg episode reward: [(0, '0.186')] [2022-07-11 10:56:11,683][26022] Updated weights on worker 0-0, policy_version 1161619 (0.00082) [2022-07-11 10:56:13,432][26022] Updated weights on worker 0-0, policy_version 1161629 (0.00077) [2022-07-11 10:56:15,451][26022] Updated weights on worker 0-0, policy_version 1161639 (0.00084) [2022-07-11 10:56:16,560][25689] Fps is (10 sec: 5559.4, 60 sec: 5496.7, 300 sec: 5516.2). Total num frames: 1189524480. Throughput: 0: 5715.8. Samples: 1189526236. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:56:16,561][25689] Avg episode reward: [(0, '-0.393')] [2022-07-11 10:56:17,083][26022] Updated weights on worker 0-0, policy_version 1161649 (0.00095) [2022-07-11 10:56:18,970][26022] Updated weights on worker 0-0, policy_version 1161659 (0.00093) [2022-07-11 10:56:20,763][26022] Updated weights on worker 0-0, policy_version 1161669 (0.00090) [2022-07-11 10:56:21,609][25689] Fps is (10 sec: 5591.0, 60 sec: 5510.1, 300 sec: 5519.3). Total num frames: 1189552128. Throughput: 0: 5722.0. Samples: 1189559874. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:56:21,610][25689] Avg episode reward: [(0, '0.650')] [2022-07-11 10:56:22,516][26022] Updated weights on worker 0-0, policy_version 1161679 (0.00096) [2022-07-11 10:56:24,696][26022] Updated weights on worker 0-0, policy_version 1161689 (0.00091) [2022-07-11 10:56:26,164][26022] Updated weights on worker 0-0, policy_version 1161699 (0.00087) [2022-07-11 10:56:26,634][25689] Fps is (10 sec: 5590.9, 60 sec: 5526.5, 300 sec: 5524.2). Total num frames: 1189580800. Throughput: 0: 4966.8. Samples: 1189576570. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:56:26,645][25689] Avg episode reward: [(0, '0.617')] [2022-07-11 10:56:28,440][26022] Updated weights on worker 0-0, policy_version 1161709 (0.00091) [2022-07-11 10:56:30,071][26022] Updated weights on worker 0-0, policy_version 1161719 (0.00080) [2022-07-11 10:56:31,779][25689] Fps is (10 sec: 5538.2, 60 sec: 5518.2, 300 sec: 5518.5). Total num frames: 1189608448. Throughput: 0: 5776.3. Samples: 1189609588. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:56:31,779][25689] Avg episode reward: [(0, '-0.311')] [2022-07-11 10:56:32,010][26022] Updated weights on worker 0-0, policy_version 1161729 (0.00091) [2022-07-11 10:56:33,840][26022] Updated weights on worker 0-0, policy_version 1161739 (0.00088) [2022-07-11 10:56:35,440][26022] Updated weights on worker 0-0, policy_version 1161749 (0.00094) [2022-07-11 10:56:36,799][25689] Fps is (10 sec: 5540.6, 60 sec: 5518.4, 300 sec: 5522.8). Total num frames: 1189637120. Throughput: 0: 5775.0. Samples: 1189643114. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2022-07-11 10:56:36,801][25689] Avg episode reward: [(0, '-0.511')] [2022-07-11 10:56:37,487][26022] Updated weights on worker 0-0, policy_version 1161759 (0.00089) [2022-07-11 10:56:39,551][26022] Updated weights on worker 0-0, policy_version 1161769 (0.00090) [2022-07-11 10:56:41,238][26022] Updated weights on worker 0-0, policy_version 1161779 (0.00085) [2022-07-11 10:56:41,836][25689] Fps is (10 sec: 5701.7, 60 sec: 5549.2, 300 sec: 5522.4). Total num frames: 1189665792. Throughput: 0: 4931.7. Samples: 1189659626. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:56:41,837][25689] Avg episode reward: [(0, '-0.299')] [2022-07-11 10:56:43,078][26022] Updated weights on worker 0-0, policy_version 1161789 (0.00086) [2022-07-11 10:56:44,835][26022] Updated weights on worker 0-0, policy_version 1161799 (0.00080) [2022-07-11 10:56:46,648][26022] Updated weights on worker 0-0, policy_version 1161809 (0.00090) [2022-07-11 10:56:46,849][25689] Fps is (10 sec: 5604.1, 60 sec: 5549.7, 300 sec: 5523.1). Total num frames: 1189693440. Throughput: 0: 5762.2. Samples: 1189693052. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:56:46,850][25689] Avg episode reward: [(0, '0.484')] [2022-07-11 10:56:48,543][26022] Updated weights on worker 0-0, policy_version 1161819 (0.00088) [2022-07-11 10:56:50,503][26022] Updated weights on worker 0-0, policy_version 1161829 (0.00086) [2022-07-11 10:56:51,897][25689] Fps is (10 sec: 5394.3, 60 sec: 5515.6, 300 sec: 5515.6). Total num frames: 1189720064. Throughput: 0: 5790.1. Samples: 1189726076. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:56:51,898][25689] Avg episode reward: [(0, '-0.047')] [2022-07-11 10:56:52,369][26022] Updated weights on worker 0-0, policy_version 1161839 (0.00089) [2022-07-11 10:56:54,067][26022] Updated weights on worker 0-0, policy_version 1161849 (0.00086) [2022-07-11 10:56:55,928][26022] Updated weights on worker 0-0, policy_version 1161859 (0.00095) [2022-07-11 10:56:56,910][25689] Fps is (10 sec: 5394.4, 60 sec: 5517.1, 300 sec: 5522.5). Total num frames: 1189747712. Throughput: 0: 4963.2. Samples: 1189742926. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:56:56,915][25689] Avg episode reward: [(0, '-1.895')] [2022-07-11 10:56:57,985][26022] Updated weights on worker 0-0, policy_version 1161869 (0.00093) [2022-07-11 10:56:59,673][26022] Updated weights on worker 0-0, policy_version 1161879 (0.00091) [2022-07-11 10:57:01,880][26022] Updated weights on worker 0-0, policy_version 1161889 (0.00107) [2022-07-11 10:57:01,978][25689] Fps is (10 sec: 5383.9, 60 sec: 5495.7, 300 sec: 5519.5). Total num frames: 1189774336. Throughput: 0: 5785.6. Samples: 1189776156. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:57:01,978][25689] Avg episode reward: [(0, '-0.752')] [2022-07-11 10:57:03,819][26022] Updated weights on worker 0-0, policy_version 1161899 (0.00092) [2022-07-11 10:57:05,584][26022] Updated weights on worker 0-0, policy_version 1161909 (0.00090) [2022-07-11 10:57:07,079][25689] Fps is (10 sec: 5336.9, 60 sec: 5504.2, 300 sec: 5516.9). Total num frames: 1189801984. Throughput: 0: 5669.9. Samples: 1189807754. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:57:07,079][25689] Avg episode reward: [(0, '-0.715')] [2022-07-11 10:57:07,305][26022] Updated weights on worker 0-0, policy_version 1161919 (0.00081) [2022-07-11 10:57:09,351][26022] Updated weights on worker 0-0, policy_version 1161929 (0.00099) [2022-07-11 10:57:10,959][26022] Updated weights on worker 0-0, policy_version 1161939 (0.00085) [2022-07-11 10:57:12,169][25689] Fps is (10 sec: 5626.4, 60 sec: 5539.4, 300 sec: 5523.0). Total num frames: 1189831680. Throughput: 0: 4855.7. Samples: 1189824512. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:57:12,170][25689] Avg episode reward: [(0, '-0.999')] [2022-07-11 10:57:12,999][26022] Updated weights on worker 0-0, policy_version 1161949 (0.00086) [2022-07-11 10:57:14,563][26022] Updated weights on worker 0-0, policy_version 1161959 (0.00087) [2022-07-11 10:57:16,708][26022] Updated weights on worker 0-0, policy_version 1161969 (0.00087) [2022-07-11 10:57:17,178][25689] Fps is (10 sec: 5678.3, 60 sec: 5524.0, 300 sec: 5520.7). Total num frames: 1189859328. Throughput: 0: 5674.3. Samples: 1189857932. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:57:17,178][25689] Avg episode reward: [(0, '-1.132')] [2022-07-11 10:57:18,506][26022] Updated weights on worker 0-0, policy_version 1161979 (0.00086) [2022-07-11 10:57:20,299][26022] Updated weights on worker 0-0, policy_version 1161989 (0.00095) [2022-07-11 10:57:22,203][25689] Fps is (10 sec: 5409.2, 60 sec: 5509.2, 300 sec: 5517.4). Total num frames: 1189885952. Throughput: 0: 5701.1. Samples: 1189891462. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:57:22,203][25689] Avg episode reward: [(0, '0.138')] [2022-07-11 10:57:22,314][26022] Updated weights on worker 0-0, policy_version 1161999 (0.00100) [2022-07-11 10:57:23,907][26022] Updated weights on worker 0-0, policy_version 1162009 (0.00084) [2022-07-11 10:57:25,673][26022] Updated weights on worker 0-0, policy_version 1162019 (0.00085) [2022-07-11 10:57:27,233][25689] Fps is (10 sec: 5499.4, 60 sec: 5508.8, 300 sec: 5519.0). Total num frames: 1189914624. Throughput: 0: 5813.1. Samples: 1189924910. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:57:27,233][25689] Avg episode reward: [(0, '1.130')] [2022-07-11 10:57:27,742][26022] Updated weights on worker 0-0, policy_version 1162029 (0.00093) [2022-07-11 10:57:29,451][26022] Updated weights on worker 0-0, policy_version 1162039 (0.00089) [2022-07-11 10:57:31,392][26022] Updated weights on worker 0-0, policy_version 1162049 (0.00093) [2022-07-11 10:57:32,291][25689] Fps is (10 sec: 5785.5, 60 sec: 5550.4, 300 sec: 5528.7). Total num frames: 1189944320. Throughput: 0: 5800.8. Samples: 1189941236. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:57:32,292][25689] Avg episode reward: [(0, '1.039')] [2022-07-11 10:57:33,155][26022] Updated weights on worker 0-0, policy_version 1162059 (0.00092) [2022-07-11 10:57:34,964][26022] Updated weights on worker 0-0, policy_version 1162069 (0.00079) [2022-07-11 10:57:36,813][26022] Updated weights on worker 0-0, policy_version 1162079 (0.00083) [2022-07-11 10:57:37,310][25689] Fps is (10 sec: 5588.5, 60 sec: 5516.7, 300 sec: 5518.1). Total num frames: 1189970944. Throughput: 0: 5823.6. Samples: 1189975178. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:57:37,311][25689] Avg episode reward: [(0, '-0.339')] [2022-07-11 10:57:38,551][26022] Updated weights on worker 0-0, policy_version 1162089 (0.00088) [2022-07-11 10:57:40,437][26022] Updated weights on worker 0-0, policy_version 1162099 (0.00053) [2022-07-11 10:57:42,302][26022] Updated weights on worker 0-0, policy_version 1162109 (0.00089) [2022-07-11 10:57:42,351][25689] Fps is (10 sec: 5497.1, 60 sec: 5516.4, 300 sec: 5521.0). Total num frames: 1189999616. Throughput: 0: 5841.2. Samples: 1190009150. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:57:42,359][25689] Avg episode reward: [(0, '-0.242')] [2022-07-11 10:57:43,989][26022] Updated weights on worker 0-0, policy_version 1162119 (0.00094) [2022-07-11 10:57:46,071][26022] Updated weights on worker 0-0, policy_version 1162129 (0.00094) [2022-07-11 10:57:47,409][25689] Fps is (10 sec: 5678.5, 60 sec: 5529.2, 300 sec: 5525.8). Total num frames: 1190028288. Throughput: 0: 5002.5. Samples: 1190025842. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:57:47,409][25689] Avg episode reward: [(0, '-0.129')] [2022-07-11 10:57:47,733][26022] Updated weights on worker 0-0, policy_version 1162139 (0.00084) [2022-07-11 10:57:49,651][26022] Updated weights on worker 0-0, policy_version 1162149 (0.00095) [2022-07-11 10:57:51,513][26022] Updated weights on worker 0-0, policy_version 1162159 (0.00087) [2022-07-11 10:57:52,469][25689] Fps is (10 sec: 5566.3, 60 sec: 5545.1, 300 sec: 5521.8). Total num frames: 1190055936. Throughput: 0: 5848.1. Samples: 1190059234. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:57:52,469][25689] Avg episode reward: [(0, '-0.130')] [2022-07-11 10:57:53,255][26022] Updated weights on worker 0-0, policy_version 1162169 (0.00084) [2022-07-11 10:57:55,206][26022] Updated weights on worker 0-0, policy_version 1162179 (0.00092) [2022-07-11 10:57:56,879][26022] Updated weights on worker 0-0, policy_version 1162189 (0.00081) [2022-07-11 10:57:57,474][25689] Fps is (10 sec: 5494.0, 60 sec: 5545.7, 300 sec: 5518.5). Total num frames: 1190083584. Throughput: 0: 5820.8. Samples: 1190092544. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:57:57,474][25689] Avg episode reward: [(0, '-0.396')] [2022-07-11 10:57:58,781][26022] Updated weights on worker 0-0, policy_version 1162199 (0.00085) [2022-07-11 10:58:00,399][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 10:58:00,410][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001162207_1190099968.pth [2022-07-11 10:58:00,410][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001160262_1188108288.pth [2022-07-11 10:58:00,829][26022] Updated weights on worker 0-0, policy_version 1162209 (0.00085) [2022-07-11 10:58:02,496][25689] Fps is (10 sec: 5412.3, 60 sec: 5549.9, 300 sec: 5528.9). Total num frames: 1190110208. Throughput: 0: 4959.5. Samples: 1190109062. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:58:02,497][25689] Avg episode reward: [(0, '0.418')] [2022-07-11 10:58:02,732][26022] Updated weights on worker 0-0, policy_version 1162219 (0.00090) [2022-07-11 10:58:04,795][26022] Updated weights on worker 0-0, policy_version 1162229 (0.00078) [2022-07-11 10:58:06,457][26022] Updated weights on worker 0-0, policy_version 1162239 (0.00083) [2022-07-11 10:58:07,507][25689] Fps is (10 sec: 5307.3, 60 sec: 5541.3, 300 sec: 5512.9). Total num frames: 1190136832. Throughput: 0: 5711.2. Samples: 1190140624. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:58:07,507][25689] Avg episode reward: [(0, '0.658')] [2022-07-11 10:58:08,497][26022] Updated weights on worker 0-0, policy_version 1162249 (0.00100) [2022-07-11 10:58:10,297][26022] Updated weights on worker 0-0, policy_version 1162259 (0.00088) [2022-07-11 10:58:11,883][26022] Updated weights on worker 0-0, policy_version 1162269 (0.00082) [2022-07-11 10:58:12,581][25689] Fps is (10 sec: 5483.2, 60 sec: 5525.9, 300 sec: 5525.8). Total num frames: 1190165504. Throughput: 0: 5709.1. Samples: 1190174056. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:58:12,581][25689] Avg episode reward: [(0, '-0.183')] [2022-07-11 10:58:13,877][26022] Updated weights on worker 0-0, policy_version 1162279 (0.00068) [2022-07-11 10:58:15,735][26022] Updated weights on worker 0-0, policy_version 1162289 (0.00096) [2022-07-11 10:58:17,566][26022] Updated weights on worker 0-0, policy_version 1162299 (0.00092) [2022-07-11 10:58:17,651][25689] Fps is (10 sec: 5652.8, 60 sec: 5537.1, 300 sec: 5521.3). Total num frames: 1190194176. Throughput: 0: 4876.2. Samples: 1190190932. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:58:17,651][25689] Avg episode reward: [(0, '-0.050')] [2022-07-11 10:58:19,335][26022] Updated weights on worker 0-0, policy_version 1162309 (0.00086) [2022-07-11 10:58:21,292][26022] Updated weights on worker 0-0, policy_version 1162319 (0.00089) [2022-07-11 10:58:22,686][25689] Fps is (10 sec: 5573.5, 60 sec: 5553.2, 300 sec: 5520.8). Total num frames: 1190221824. Throughput: 0: 5700.4. Samples: 1190224150. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:58:22,686][25689] Avg episode reward: [(0, '-0.059')] [2022-07-11 10:58:23,108][26022] Updated weights on worker 0-0, policy_version 1162329 (0.00084) [2022-07-11 10:58:24,927][26022] Updated weights on worker 0-0, policy_version 1162339 (0.00083) [2022-07-11 10:58:26,736][26022] Updated weights on worker 0-0, policy_version 1162349 (0.00089) [2022-07-11 10:58:27,706][25689] Fps is (10 sec: 5499.1, 60 sec: 5537.1, 300 sec: 5519.2). Total num frames: 1190249472. Throughput: 0: 5777.1. Samples: 1190257320. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:58:27,707][25689] Avg episode reward: [(0, '-0.782')] [2022-07-11 10:58:28,775][26022] Updated weights on worker 0-0, policy_version 1162359 (0.00090) [2022-07-11 10:58:30,638][26022] Updated weights on worker 0-0, policy_version 1162369 (0.00085) [2022-07-11 10:58:32,455][26022] Updated weights on worker 0-0, policy_version 1162379 (0.00087) [2022-07-11 10:58:32,745][25689] Fps is (10 sec: 5598.8, 60 sec: 5522.0, 300 sec: 5522.2). Total num frames: 1190278144. Throughput: 0: 4937.2. Samples: 1190273612. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:58:32,746][25689] Avg episode reward: [(0, '-1.984')] [2022-07-11 10:58:34,378][26022] Updated weights on worker 0-0, policy_version 1162389 (0.00107) [2022-07-11 10:58:36,011][26022] Updated weights on worker 0-0, policy_version 1162399 (0.00091) [2022-07-11 10:58:37,772][25689] Fps is (10 sec: 5391.6, 60 sec: 5504.3, 300 sec: 5511.8). Total num frames: 1190303744. Throughput: 0: 5761.8. Samples: 1190306868. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:58:37,773][25689] Avg episode reward: [(0, '-0.992')] [2022-07-11 10:58:38,091][26022] Updated weights on worker 0-0, policy_version 1162409 (0.00087) [2022-07-11 10:58:39,706][26022] Updated weights on worker 0-0, policy_version 1162419 (0.00083) [2022-07-11 10:58:41,627][26022] Updated weights on worker 0-0, policy_version 1162429 (0.00096) [2022-07-11 10:58:42,775][25689] Fps is (10 sec: 5411.2, 60 sec: 5507.7, 300 sec: 5517.0). Total num frames: 1190332416. Throughput: 0: 5796.2. Samples: 1190340590. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:58:42,775][25689] Avg episode reward: [(0, '-0.942')] [2022-07-11 10:58:43,552][26022] Updated weights on worker 0-0, policy_version 1162439 (0.00086) [2022-07-11 10:58:45,253][26022] Updated weights on worker 0-0, policy_version 1162449 (0.00091) [2022-07-11 10:58:47,244][26022] Updated weights on worker 0-0, policy_version 1162459 (0.00090) [2022-07-11 10:58:47,789][25689] Fps is (10 sec: 5827.3, 60 sec: 5528.7, 300 sec: 5518.1). Total num frames: 1190362112. Throughput: 0: 4983.0. Samples: 1190357390. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:58:47,789][25689] Avg episode reward: [(0, '-2.292')] [2022-07-11 10:58:48,863][26022] Updated weights on worker 0-0, policy_version 1162469 (0.00095) [2022-07-11 10:58:50,825][26022] Updated weights on worker 0-0, policy_version 1162479 (0.00093) [2022-07-11 10:58:52,734][26022] Updated weights on worker 0-0, policy_version 1162489 (0.00092) [2022-07-11 10:58:52,912][25689] Fps is (10 sec: 5556.0, 60 sec: 5506.0, 300 sec: 5516.7). Total num frames: 1190388736. Throughput: 0: 5795.9. Samples: 1190390496. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:58:52,912][25689] Avg episode reward: [(0, '-1.005')] [2022-07-11 10:58:54,401][26022] Updated weights on worker 0-0, policy_version 1162499 (0.00086) [2022-07-11 10:58:56,580][26022] Updated weights on worker 0-0, policy_version 1162509 (0.00094) [2022-07-11 10:58:57,918][25689] Fps is (10 sec: 5458.7, 60 sec: 5522.8, 300 sec: 5520.1). Total num frames: 1190417408. Throughput: 0: 5820.1. Samples: 1190424120. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:58:57,919][25689] Avg episode reward: [(0, '-1.040')] [2022-07-11 10:58:58,165][26022] Updated weights on worker 0-0, policy_version 1162519 (0.00101) [2022-07-11 10:58:59,966][26022] Updated weights on worker 0-0, policy_version 1162529 (0.00092) [2022-07-11 10:59:02,159][26022] Updated weights on worker 0-0, policy_version 1162539 (0.00107) [2022-07-11 10:59:02,926][25689] Fps is (10 sec: 5419.5, 60 sec: 5507.2, 300 sec: 5520.3). Total num frames: 1190443008. Throughput: 0: 4963.6. Samples: 1190440612. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:59:02,926][25689] Avg episode reward: [(0, '0.081')] [2022-07-11 10:59:03,982][26022] Updated weights on worker 0-0, policy_version 1162549 (0.00087) [2022-07-11 10:59:05,975][26022] Updated weights on worker 0-0, policy_version 1162559 (0.00092) [2022-07-11 10:59:07,715][26022] Updated weights on worker 0-0, policy_version 1162569 (0.00087) [2022-07-11 10:59:07,991][25689] Fps is (10 sec: 5286.3, 60 sec: 5519.2, 300 sec: 5518.0). Total num frames: 1190470656. Throughput: 0: 5683.2. Samples: 1190472206. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:59:07,992][25689] Avg episode reward: [(0, '-0.363')] [2022-07-11 10:59:09,643][26022] Updated weights on worker 0-0, policy_version 1162579 (0.00089) [2022-07-11 10:59:11,400][26022] Updated weights on worker 0-0, policy_version 1162589 (0.00094) [2022-07-11 10:59:13,141][25689] Fps is (10 sec: 5613.5, 60 sec: 5529.2, 300 sec: 5523.5). Total num frames: 1190500352. Throughput: 0: 5708.4. Samples: 1190505974. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:59:13,142][25689] Avg episode reward: [(0, '-0.928')] [2022-07-11 10:59:13,187][26022] Updated weights on worker 0-0, policy_version 1162599 (0.00096) [2022-07-11 10:59:15,214][26022] Updated weights on worker 0-0, policy_version 1162609 (0.00092) [2022-07-11 10:59:16,738][26022] Updated weights on worker 0-0, policy_version 1162619 (0.00090) [2022-07-11 10:59:18,149][25689] Fps is (10 sec: 5544.5, 60 sec: 5501.0, 300 sec: 5516.9). Total num frames: 1190526976. Throughput: 0: 4876.7. Samples: 1190522780. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:59:18,154][25689] Avg episode reward: [(0, '0.118')] [2022-07-11 10:59:18,802][26022] Updated weights on worker 0-0, policy_version 1162629 (0.00085) [2022-07-11 10:59:20,537][26022] Updated weights on worker 0-0, policy_version 1162639 (0.00091) [2022-07-11 10:59:22,525][26022] Updated weights on worker 0-0, policy_version 1162649 (0.00086) [2022-07-11 10:59:23,159][25689] Fps is (10 sec: 5519.9, 60 sec: 5520.2, 300 sec: 5520.5). Total num frames: 1190555648. Throughput: 0: 5700.9. Samples: 1190555958. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:59:23,159][25689] Avg episode reward: [(0, '-0.984')] [2022-07-11 10:59:24,249][26022] Updated weights on worker 0-0, policy_version 1162659 (0.00085) [2022-07-11 10:59:26,280][26022] Updated weights on worker 0-0, policy_version 1162669 (0.00125) [2022-07-11 10:59:28,111][26022] Updated weights on worker 0-0, policy_version 1162679 (0.00091) [2022-07-11 10:59:28,171][25689] Fps is (10 sec: 5619.8, 60 sec: 5521.0, 300 sec: 5522.5). Total num frames: 1190583296. Throughput: 0: 5811.4. Samples: 1190589478. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:59:28,171][25689] Avg episode reward: [(0, '-0.537')] [2022-07-11 10:59:29,720][26022] Updated weights on worker 0-0, policy_version 1162689 (0.00089) [2022-07-11 10:59:31,667][26022] Updated weights on worker 0-0, policy_version 1162699 (0.00088) [2022-07-11 10:59:33,315][25689] Fps is (10 sec: 5646.3, 60 sec: 5528.3, 300 sec: 5524.1). Total num frames: 1190612992. Throughput: 0: 4963.4. Samples: 1190606104. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:59:33,315][25689] Avg episode reward: [(0, '-0.447')] [2022-07-11 10:59:33,389][26022] Updated weights on worker 0-0, policy_version 1162709 (0.00085) [2022-07-11 10:59:35,375][26022] Updated weights on worker 0-0, policy_version 1162719 (0.00091) [2022-07-11 10:59:37,079][26022] Updated weights on worker 0-0, policy_version 1162729 (0.00083) [2022-07-11 10:59:38,322][25689] Fps is (10 sec: 5649.0, 60 sec: 5564.0, 300 sec: 5524.1). Total num frames: 1190640640. Throughput: 0: 5811.6. Samples: 1190640016. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:59:38,322][25689] Avg episode reward: [(0, '0.686')] [2022-07-11 10:59:38,972][26022] Updated weights on worker 0-0, policy_version 1162739 (0.00084) [2022-07-11 10:59:40,935][26022] Updated weights on worker 0-0, policy_version 1162749 (0.00092) [2022-07-11 10:59:42,571][26022] Updated weights on worker 0-0, policy_version 1162759 (0.00090) [2022-07-11 10:59:43,336][25689] Fps is (10 sec: 5620.1, 60 sec: 5562.9, 300 sec: 5531.6). Total num frames: 1190669312. Throughput: 0: 5817.8. Samples: 1190673344. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:59:43,336][25689] Avg episode reward: [(0, '1.011')] [2022-07-11 10:59:44,582][26022] Updated weights on worker 0-0, policy_version 1162769 (0.00085) [2022-07-11 10:59:46,287][26022] Updated weights on worker 0-0, policy_version 1162779 (0.00090) [2022-07-11 10:59:48,260][26022] Updated weights on worker 0-0, policy_version 1162789 (0.00087) [2022-07-11 10:59:48,359][25689] Fps is (10 sec: 5611.4, 60 sec: 5528.3, 300 sec: 5525.8). Total num frames: 1190696960. Throughput: 0: 4984.2. Samples: 1190690098. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:59:48,359][25689] Avg episode reward: [(0, '1.151')] [2022-07-11 10:59:50,277][26022] Updated weights on worker 0-0, policy_version 1162799 (0.00090) [2022-07-11 10:59:51,981][26022] Updated weights on worker 0-0, policy_version 1162809 (0.00087) [2022-07-11 10:59:53,501][25689] Fps is (10 sec: 5439.6, 60 sec: 5543.4, 300 sec: 5521.0). Total num frames: 1190724608. Throughput: 0: 5814.9. Samples: 1190723488. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:59:53,502][25689] Avg episode reward: [(0, '1.952')] [2022-07-11 10:59:53,759][26022] Updated weights on worker 0-0, policy_version 1162819 (0.00084) [2022-07-11 10:59:55,570][26022] Updated weights on worker 0-0, policy_version 1162829 (0.00099) [2022-07-11 10:59:57,358][26022] Updated weights on worker 0-0, policy_version 1162839 (0.00090) [2022-07-11 10:59:58,535][25689] Fps is (10 sec: 5534.7, 60 sec: 5541.0, 300 sec: 5525.0). Total num frames: 1190753280. Throughput: 0: 5788.3. Samples: 1190757012. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 10:59:58,535][25689] Avg episode reward: [(0, '1.972')] [2022-07-11 10:59:59,450][26022] Updated weights on worker 0-0, policy_version 1162849 (0.00085) [2022-07-11 11:00:00,433][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:00:00,442][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001162855_1190763520.pth [2022-07-11 11:00:00,442][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001160911_1188772864.pth [2022-07-11 11:00:00,880][26022] Updated weights on worker 0-0, policy_version 1162859 (0.00092) [2022-07-11 11:00:03,554][25689] Fps is (10 sec: 5297.1, 60 sec: 5523.0, 300 sec: 5518.3). Total num frames: 1190777856. Throughput: 0: 5675.4. Samples: 1190788088. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 11:00:03,555][25689] Avg episode reward: [(0, '1.281')] [2022-07-11 11:00:03,556][26022] Updated weights on worker 0-0, policy_version 1162869 (0.00082) [2022-07-11 11:00:04,939][26022] Updated weights on worker 0-0, policy_version 1162879 (0.00082) [2022-07-11 11:00:07,077][26022] Updated weights on worker 0-0, policy_version 1162889 (0.00081) [2022-07-11 11:00:08,573][25689] Fps is (10 sec: 5304.4, 60 sec: 5544.1, 300 sec: 5522.3). Total num frames: 1190806528. Throughput: 0: 5675.6. Samples: 1190804826. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 11:00:08,574][25689] Avg episode reward: [(0, '0.590')] [2022-07-11 11:00:08,768][26022] Updated weights on worker 0-0, policy_version 1162899 (0.00089) [2022-07-11 11:00:10,604][26022] Updated weights on worker 0-0, policy_version 1162909 (0.00089) [2022-07-11 11:00:12,761][26022] Updated weights on worker 0-0, policy_version 1162919 (0.00081) [2022-07-11 11:00:13,675][25689] Fps is (10 sec: 5665.8, 60 sec: 5531.6, 300 sec: 5521.5). Total num frames: 1190835200. Throughput: 0: 5693.2. Samples: 1190838338. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 11:00:13,675][25689] Avg episode reward: [(0, '0.932')] [2022-07-11 11:00:14,327][26022] Updated weights on worker 0-0, policy_version 1162929 (0.00083) [2022-07-11 11:00:16,187][26022] Updated weights on worker 0-0, policy_version 1162939 (0.00086) [2022-07-11 11:00:17,991][26022] Updated weights on worker 0-0, policy_version 1162949 (0.00080) [2022-07-11 11:00:18,743][25689] Fps is (10 sec: 5537.6, 60 sec: 5543.0, 300 sec: 5523.8). Total num frames: 1190862848. Throughput: 0: 5682.4. Samples: 1190871848. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 11:00:18,744][25689] Avg episode reward: [(0, '0.792')] [2022-07-11 11:00:19,761][26022] Updated weights on worker 0-0, policy_version 1162959 (0.00078) [2022-07-11 11:00:21,766][26022] Updated weights on worker 0-0, policy_version 1162969 (0.00105) [2022-07-11 11:00:23,447][26022] Updated weights on worker 0-0, policy_version 1162979 (0.00079) [2022-07-11 11:00:23,825][25689] Fps is (10 sec: 5548.6, 60 sec: 5536.4, 300 sec: 5526.1). Total num frames: 1190891520. Throughput: 0: 4972.0. Samples: 1190888876. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 11:00:23,825][25689] Avg episode reward: [(0, '0.629')] [2022-07-11 11:00:25,405][26022] Updated weights on worker 0-0, policy_version 1162989 (0.00084) [2022-07-11 11:00:27,155][26022] Updated weights on worker 0-0, policy_version 1162999 (0.00096) [2022-07-11 11:00:28,831][25689] Fps is (10 sec: 5583.1, 60 sec: 5536.9, 300 sec: 5527.0). Total num frames: 1190919168. Throughput: 0: 5786.2. Samples: 1190922044. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 11:00:28,832][25689] Avg episode reward: [(0, '0.177')] [2022-07-11 11:00:29,244][26022] Updated weights on worker 0-0, policy_version 1163009 (0.00092) [2022-07-11 11:00:30,807][26022] Updated weights on worker 0-0, policy_version 1163019 (0.00091) [2022-07-11 11:00:33,004][26022] Updated weights on worker 0-0, policy_version 1163029 (0.00088) [2022-07-11 11:00:33,949][25689] Fps is (10 sec: 5562.7, 60 sec: 5522.4, 300 sec: 5525.2). Total num frames: 1190947840. Throughput: 0: 5755.3. Samples: 1190955026. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 11:00:33,950][25689] Avg episode reward: [(0, '0.049')] [2022-07-11 11:00:34,689][26022] Updated weights on worker 0-0, policy_version 1163039 (0.00086) [2022-07-11 11:00:36,563][26022] Updated weights on worker 0-0, policy_version 1163049 (0.00093) [2022-07-11 11:00:38,503][26022] Updated weights on worker 0-0, policy_version 1163059 (0.00082) [2022-07-11 11:00:38,960][25689] Fps is (10 sec: 5459.4, 60 sec: 5505.2, 300 sec: 5525.1). Total num frames: 1190974464. Throughput: 0: 4941.8. Samples: 1190971752. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:00:38,962][25689] Avg episode reward: [(0, '-0.749')] [2022-07-11 11:00:40,315][26022] Updated weights on worker 0-0, policy_version 1163069 (0.00090) [2022-07-11 11:00:42,003][26022] Updated weights on worker 0-0, policy_version 1163079 (0.00088) [2022-07-11 11:00:43,935][26022] Updated weights on worker 0-0, policy_version 1163089 (0.00083) [2022-07-11 11:00:43,976][25689] Fps is (10 sec: 5515.0, 60 sec: 5505.0, 300 sec: 5528.6). Total num frames: 1191003136. Throughput: 0: 5775.0. Samples: 1191005250. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:00:43,978][25689] Avg episode reward: [(0, '-0.864')] [2022-07-11 11:00:45,729][26022] Updated weights on worker 0-0, policy_version 1163099 (0.00087) [2022-07-11 11:00:47,633][26022] Updated weights on worker 0-0, policy_version 1163109 (0.00088) [2022-07-11 11:00:48,995][25689] Fps is (10 sec: 5714.0, 60 sec: 5522.2, 300 sec: 5529.1). Total num frames: 1191031808. Throughput: 0: 5795.6. Samples: 1191038908. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:00:48,996][25689] Avg episode reward: [(0, '-0.800')] [2022-07-11 11:00:49,329][26022] Updated weights on worker 0-0, policy_version 1163119 (0.00090) [2022-07-11 11:00:51,395][26022] Updated weights on worker 0-0, policy_version 1163129 (0.00087) [2022-07-11 11:00:53,049][26022] Updated weights on worker 0-0, policy_version 1163139 (0.00096) [2022-07-11 11:00:54,117][25689] Fps is (10 sec: 5351.8, 60 sec: 5490.3, 300 sec: 5520.5). Total num frames: 1191057408. Throughput: 0: 4965.1. Samples: 1191055160. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:00:54,117][25689] Avg episode reward: [(0, '-0.764')] [2022-07-11 11:00:55,083][26022] Updated weights on worker 0-0, policy_version 1163149 (0.00088) [2022-07-11 11:00:57,118][26022] Updated weights on worker 0-0, policy_version 1163159 (0.00093) [2022-07-11 11:00:58,623][26022] Updated weights on worker 0-0, policy_version 1163169 (0.00089) [2022-07-11 11:00:59,132][25689] Fps is (10 sec: 5555.9, 60 sec: 5525.7, 300 sec: 5530.8). Total num frames: 1191088128. Throughput: 0: 5786.5. Samples: 1191088482. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:00:59,133][25689] Avg episode reward: [(0, '-0.954')] [2022-07-11 11:01:00,807][26022] Updated weights on worker 0-0, policy_version 1163179 (0.00082) [2022-07-11 11:01:02,659][26022] Updated weights on worker 0-0, policy_version 1163189 (0.00097) [2022-07-11 11:01:04,184][25689] Fps is (10 sec: 5492.7, 60 sec: 5522.8, 300 sec: 5523.2). Total num frames: 1191112704. Throughput: 0: 5657.9. Samples: 1191119584. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:01:04,185][25689] Avg episode reward: [(0, '-0.206')] [2022-07-11 11:01:04,858][26022] Updated weights on worker 0-0, policy_version 1163199 (0.00090) [2022-07-11 11:01:06,420][26022] Updated weights on worker 0-0, policy_version 1163209 (0.00091) [2022-07-11 11:01:08,350][26022] Updated weights on worker 0-0, policy_version 1163219 (0.00108) [2022-07-11 11:01:09,220][25689] Fps is (10 sec: 5075.6, 60 sec: 5487.5, 300 sec: 5521.0). Total num frames: 1191139328. Throughput: 0: 4801.7. Samples: 1191136020. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:01:09,220][25689] Avg episode reward: [(0, '-0.259')] [2022-07-11 11:01:10,246][26022] Updated weights on worker 0-0, policy_version 1163229 (0.00092) [2022-07-11 11:01:12,286][26022] Updated weights on worker 0-0, policy_version 1163239 (0.00093) [2022-07-11 11:01:13,999][26022] Updated weights on worker 0-0, policy_version 1163249 (0.00092) [2022-07-11 11:01:14,266][25689] Fps is (10 sec: 5484.4, 60 sec: 5492.5, 300 sec: 5520.6). Total num frames: 1191168000. Throughput: 0: 5653.3. Samples: 1191169072. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:01:14,267][25689] Avg episode reward: [(0, '-0.287')] [2022-07-11 11:01:15,848][26022] Updated weights on worker 0-0, policy_version 1163259 (0.00050) [2022-07-11 11:01:17,695][26022] Updated weights on worker 0-0, policy_version 1163269 (0.00088) [2022-07-11 11:01:19,281][25689] Fps is (10 sec: 5598.0, 60 sec: 5497.4, 300 sec: 5521.3). Total num frames: 1191195648. Throughput: 0: 5657.9. Samples: 1191202478. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:01:19,281][25689] Avg episode reward: [(0, '-0.486')] [2022-07-11 11:01:19,687][26022] Updated weights on worker 0-0, policy_version 1163279 (0.00082) [2022-07-11 11:01:21,238][26022] Updated weights on worker 0-0, policy_version 1163289 (0.00097) [2022-07-11 11:01:23,291][26022] Updated weights on worker 0-0, policy_version 1163299 (0.00091) [2022-07-11 11:01:24,282][25689] Fps is (10 sec: 5623.3, 60 sec: 5504.7, 300 sec: 5521.7). Total num frames: 1191224320. Throughput: 0: 4957.1. Samples: 1191219212. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:01:24,282][25689] Avg episode reward: [(0, '-0.424')] [2022-07-11 11:01:25,128][26022] Updated weights on worker 0-0, policy_version 1163309 (0.00089) [2022-07-11 11:01:26,938][26022] Updated weights on worker 0-0, policy_version 1163319 (0.00081) [2022-07-11 11:01:28,779][26022] Updated weights on worker 0-0, policy_version 1163329 (0.00090) [2022-07-11 11:01:29,292][25689] Fps is (10 sec: 5625.6, 60 sec: 5504.4, 300 sec: 5524.2). Total num frames: 1191251968. Throughput: 0: 5797.0. Samples: 1191252378. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:01:29,293][25689] Avg episode reward: [(0, '1.059')] [2022-07-11 11:01:30,702][26022] Updated weights on worker 0-0, policy_version 1163339 (0.00089) [2022-07-11 11:01:32,395][26022] Updated weights on worker 0-0, policy_version 1163349 (0.00084) [2022-07-11 11:01:34,423][25689] Fps is (10 sec: 5452.8, 60 sec: 5486.3, 300 sec: 5518.7). Total num frames: 1191279616. Throughput: 0: 5787.2. Samples: 1191285720. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:01:34,423][25689] Avg episode reward: [(0, '0.593')] [2022-07-11 11:01:34,425][26022] Updated weights on worker 0-0, policy_version 1163359 (0.00109) [2022-07-11 11:01:36,119][26022] Updated weights on worker 0-0, policy_version 1163369 (0.00084) [2022-07-11 11:01:38,107][26022] Updated weights on worker 0-0, policy_version 1163379 (0.00093) [2022-07-11 11:01:39,478][25689] Fps is (10 sec: 5529.1, 60 sec: 5516.1, 300 sec: 5518.4). Total num frames: 1191308288. Throughput: 0: 4939.0. Samples: 1191302234. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:01:39,479][25689] Avg episode reward: [(0, '1.049')] [2022-07-11 11:01:39,690][26022] Updated weights on worker 0-0, policy_version 1163389 (0.00087) [2022-07-11 11:01:41,732][26022] Updated weights on worker 0-0, policy_version 1163399 (0.00085) [2022-07-11 11:01:43,464][26022] Updated weights on worker 0-0, policy_version 1163409 (0.00086) [2022-07-11 11:01:44,495][25689] Fps is (10 sec: 5591.7, 60 sec: 5499.1, 300 sec: 5518.3). Total num frames: 1191335936. Throughput: 0: 5775.3. Samples: 1191335948. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:01:44,495][25689] Avg episode reward: [(0, '0.784')] [2022-07-11 11:01:45,452][26022] Updated weights on worker 0-0, policy_version 1163419 (0.00088) [2022-07-11 11:01:47,149][26022] Updated weights on worker 0-0, policy_version 1163429 (0.00081) [2022-07-11 11:01:49,357][26022] Updated weights on worker 0-0, policy_version 1163439 (0.00087) [2022-07-11 11:01:49,526][25689] Fps is (10 sec: 5401.3, 60 sec: 5464.2, 300 sec: 5518.6). Total num frames: 1191362560. Throughput: 0: 5765.6. Samples: 1191369040. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:01:49,527][25689] Avg episode reward: [(0, '0.884')] [2022-07-11 11:01:50,741][26022] Updated weights on worker 0-0, policy_version 1163449 (0.00086) [2022-07-11 11:01:52,867][26022] Updated weights on worker 0-0, policy_version 1163459 (0.00092) [2022-07-11 11:01:54,366][26022] Updated weights on worker 0-0, policy_version 1163469 (0.00048) [2022-07-11 11:01:54,606][25689] Fps is (10 sec: 5671.5, 60 sec: 5552.6, 300 sec: 5527.7). Total num frames: 1191393280. Throughput: 0: 4960.9. Samples: 1191385846. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:01:54,606][25689] Avg episode reward: [(0, '0.782')] [2022-07-11 11:01:56,561][26022] Updated weights on worker 0-0, policy_version 1163479 (0.00080) [2022-07-11 11:01:58,037][26022] Updated weights on worker 0-0, policy_version 1163489 (0.00094) [2022-07-11 11:01:59,642][25689] Fps is (10 sec: 5567.3, 60 sec: 5466.1, 300 sec: 5524.8). Total num frames: 1191418880. Throughput: 0: 5807.6. Samples: 1191419340. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:01:59,643][25689] Avg episode reward: [(0, '0.845')] [2022-07-11 11:02:00,121][26022] Updated weights on worker 0-0, policy_version 1163499 (0.00088) [2022-07-11 11:02:00,687][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:02:00,703][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001163502_1191426048.pth [2022-07-11 11:02:00,704][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001161558_1189435392.pth [2022-07-11 11:02:02,264][26022] Updated weights on worker 0-0, policy_version 1163509 (0.00101) [2022-07-11 11:02:04,303][26022] Updated weights on worker 0-0, policy_version 1163519 (0.00106) [2022-07-11 11:02:04,673][25689] Fps is (10 sec: 5289.0, 60 sec: 5518.7, 300 sec: 5526.1). Total num frames: 1191446528. Throughput: 0: 5690.8. Samples: 1191450780. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:02:04,674][25689] Avg episode reward: [(0, '0.548')] [2022-07-11 11:02:05,967][26022] Updated weights on worker 0-0, policy_version 1163529 (0.00086) [2022-07-11 11:02:07,916][26022] Updated weights on worker 0-0, policy_version 1163539 (0.00096) [2022-07-11 11:02:09,601][26022] Updated weights on worker 0-0, policy_version 1163549 (0.00088) [2022-07-11 11:02:09,700][25689] Fps is (10 sec: 5497.9, 60 sec: 5536.5, 300 sec: 5520.4). Total num frames: 1191474176. Throughput: 0: 4859.9. Samples: 1191467084. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:02:09,701][25689] Avg episode reward: [(0, '-1.076')] [2022-07-11 11:02:11,727][26022] Updated weights on worker 0-0, policy_version 1163559 (0.00092) [2022-07-11 11:02:13,531][26022] Updated weights on worker 0-0, policy_version 1163569 (0.00096) [2022-07-11 11:02:14,819][25689] Fps is (10 sec: 5349.5, 60 sec: 5496.1, 300 sec: 5514.9). Total num frames: 1191500800. Throughput: 0: 5663.3. Samples: 1191500318. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:02:14,819][25689] Avg episode reward: [(0, '-0.841')] [2022-07-11 11:02:15,359][26022] Updated weights on worker 0-0, policy_version 1163579 (0.00085) [2022-07-11 11:02:17,123][26022] Updated weights on worker 0-0, policy_version 1163589 (0.00092) [2022-07-11 11:02:18,867][26022] Updated weights on worker 0-0, policy_version 1163599 (0.00089) [2022-07-11 11:02:19,835][25689] Fps is (10 sec: 5355.0, 60 sec: 5495.9, 300 sec: 5518.5). Total num frames: 1191528448. Throughput: 0: 5666.3. Samples: 1191533756. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:02:19,835][25689] Avg episode reward: [(0, '-0.751')] [2022-07-11 11:02:20,901][26022] Updated weights on worker 0-0, policy_version 1163609 (0.00095) [2022-07-11 11:02:22,601][26022] Updated weights on worker 0-0, policy_version 1163619 (0.00086) [2022-07-11 11:02:24,264][26022] Updated weights on worker 0-0, policy_version 1163629 (0.00092) [2022-07-11 11:02:24,844][25689] Fps is (10 sec: 5719.9, 60 sec: 5512.1, 300 sec: 5522.4). Total num frames: 1191558144. Throughput: 0: 5781.6. Samples: 1191567398. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:02:24,844][25689] Avg episode reward: [(0, '-0.816')] [2022-07-11 11:02:26,521][26022] Updated weights on worker 0-0, policy_version 1163639 (0.00087) [2022-07-11 11:02:28,038][26022] Updated weights on worker 0-0, policy_version 1163649 (0.00085) [2022-07-11 11:02:29,872][25689] Fps is (10 sec: 5713.3, 60 sec: 5510.5, 300 sec: 5516.1). Total num frames: 1191585792. Throughput: 0: 5798.0. Samples: 1191584040. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:02:29,872][25689] Avg episode reward: [(0, '-1.657')] [2022-07-11 11:02:30,148][26022] Updated weights on worker 0-0, policy_version 1163659 (0.00092) [2022-07-11 11:02:31,869][26022] Updated weights on worker 0-0, policy_version 1163669 (0.00087) [2022-07-11 11:02:33,872][26022] Updated weights on worker 0-0, policy_version 1163679 (0.00087) [2022-07-11 11:02:34,963][25689] Fps is (10 sec: 5464.8, 60 sec: 5514.1, 300 sec: 5518.2). Total num frames: 1191613440. Throughput: 0: 5799.7. Samples: 1191617146. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:02:34,963][25689] Avg episode reward: [(0, '-1.858')] [2022-07-11 11:02:35,684][26022] Updated weights on worker 0-0, policy_version 1163689 (0.00092) [2022-07-11 11:02:37,531][26022] Updated weights on worker 0-0, policy_version 1163699 (0.00088) [2022-07-11 11:02:39,230][26022] Updated weights on worker 0-0, policy_version 1163709 (0.00091) [2022-07-11 11:02:40,018][25689] Fps is (10 sec: 5550.7, 60 sec: 5514.1, 300 sec: 5517.9). Total num frames: 1191642112. Throughput: 0: 5791.1. Samples: 1191650640. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:02:40,019][25689] Avg episode reward: [(0, '0.049')] [2022-07-11 11:02:41,108][26022] Updated weights on worker 0-0, policy_version 1163719 (0.00088) [2022-07-11 11:02:42,926][26022] Updated weights on worker 0-0, policy_version 1163729 (0.00091) [2022-07-11 11:02:44,879][26022] Updated weights on worker 0-0, policy_version 1163739 (0.01016) [2022-07-11 11:02:45,037][25689] Fps is (10 sec: 5488.7, 60 sec: 5497.0, 300 sec: 5511.7). Total num frames: 1191668736. Throughput: 0: 4950.0. Samples: 1191667354. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:02:45,038][25689] Avg episode reward: [(0, '-0.439')] [2022-07-11 11:02:46,699][26022] Updated weights on worker 0-0, policy_version 1163749 (0.00084) [2022-07-11 11:02:48,557][26022] Updated weights on worker 0-0, policy_version 1163759 (0.00086) [2022-07-11 11:02:50,095][25689] Fps is (10 sec: 5386.1, 60 sec: 5511.5, 300 sec: 5511.8). Total num frames: 1191696384. Throughput: 0: 5774.9. Samples: 1191700824. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:02:50,095][25689] Avg episode reward: [(0, '-0.459')] [2022-07-11 11:02:50,403][26022] Updated weights on worker 0-0, policy_version 1163769 (0.00085) [2022-07-11 11:02:52,103][26022] Updated weights on worker 0-0, policy_version 1163779 (0.00097) [2022-07-11 11:02:54,008][26022] Updated weights on worker 0-0, policy_version 1163789 (0.00113) [2022-07-11 11:02:55,142][25689] Fps is (10 sec: 5674.7, 60 sec: 5497.5, 300 sec: 5517.9). Total num frames: 1191726080. Throughput: 0: 5811.5. Samples: 1191734420. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:02:55,143][25689] Avg episode reward: [(0, '0.408')] [2022-07-11 11:02:55,827][26022] Updated weights on worker 0-0, policy_version 1163799 (0.00090) [2022-07-11 11:02:57,590][26022] Updated weights on worker 0-0, policy_version 1163809 (0.00088) [2022-07-11 11:02:59,588][26022] Updated weights on worker 0-0, policy_version 1163819 (0.00094) [2022-07-11 11:03:00,204][25689] Fps is (10 sec: 5672.5, 60 sec: 5529.1, 300 sec: 5520.6). Total num frames: 1191753728. Throughput: 0: 4977.5. Samples: 1191751114. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:03:00,204][25689] Avg episode reward: [(0, '0.729')] [2022-07-11 11:03:01,191][26022] Updated weights on worker 0-0, policy_version 1163829 (0.00083) [2022-07-11 11:03:03,826][26022] Updated weights on worker 0-0, policy_version 1163839 (0.00082) [2022-07-11 11:03:05,276][25689] Fps is (10 sec: 5355.6, 60 sec: 5508.4, 300 sec: 5519.4). Total num frames: 1191780352. Throughput: 0: 5696.4. Samples: 1191782642. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:03:05,278][25689] Avg episode reward: [(0, '1.217')] [2022-07-11 11:03:05,293][26022] Updated weights on worker 0-0, policy_version 1163849 (0.00093) [2022-07-11 11:03:07,356][26022] Updated weights on worker 0-0, policy_version 1163859 (0.00083) [2022-07-11 11:03:09,168][26022] Updated weights on worker 0-0, policy_version 1163869 (0.00091) [2022-07-11 11:03:10,362][25689] Fps is (10 sec: 5342.5, 60 sec: 5503.0, 300 sec: 5515.8). Total num frames: 1191808000. Throughput: 0: 5681.6. Samples: 1191815978. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:03:10,363][25689] Avg episode reward: [(0, '0.795')] [2022-07-11 11:03:10,851][26022] Updated weights on worker 0-0, policy_version 1163879 (0.00095) [2022-07-11 11:03:12,858][26022] Updated weights on worker 0-0, policy_version 1163889 (0.00082) [2022-07-11 11:03:14,488][26022] Updated weights on worker 0-0, policy_version 1163899 (0.00083) [2022-07-11 11:03:15,421][25689] Fps is (10 sec: 5349.4, 60 sec: 5508.4, 300 sec: 5509.1). Total num frames: 1191834624. Throughput: 0: 4843.1. Samples: 1191832634. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:03:15,422][25689] Avg episode reward: [(0, '0.592')] [2022-07-11 11:03:16,392][26022] Updated weights on worker 0-0, policy_version 1163909 (0.00091) [2022-07-11 11:03:18,562][26022] Updated weights on worker 0-0, policy_version 1163919 (0.00087) [2022-07-11 11:03:20,056][26022] Updated weights on worker 0-0, policy_version 1163929 (0.00087) [2022-07-11 11:03:20,441][25689] Fps is (10 sec: 5588.3, 60 sec: 5541.9, 300 sec: 5516.3). Total num frames: 1191864320. Throughput: 0: 5665.4. Samples: 1191865764. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:03:20,443][25689] Avg episode reward: [(0, '0.549')] [2022-07-11 11:03:22,205][26022] Updated weights on worker 0-0, policy_version 1163939 (0.00091) [2022-07-11 11:03:23,889][26022] Updated weights on worker 0-0, policy_version 1163949 (0.00087) [2022-07-11 11:03:25,446][25689] Fps is (10 sec: 5720.2, 60 sec: 5508.4, 300 sec: 5516.5). Total num frames: 1191891968. Throughput: 0: 5769.7. Samples: 1191899018. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:03:25,447][25689] Avg episode reward: [(0, '0.285')] [2022-07-11 11:03:25,723][26022] Updated weights on worker 0-0, policy_version 1163959 (0.00092) [2022-07-11 11:03:27,730][26022] Updated weights on worker 0-0, policy_version 1163969 (0.00097) [2022-07-11 11:03:29,341][26022] Updated weights on worker 0-0, policy_version 1163979 (0.00092) [2022-07-11 11:03:30,467][25689] Fps is (10 sec: 5413.0, 60 sec: 5492.2, 300 sec: 5510.0). Total num frames: 1191918592. Throughput: 0: 4957.5. Samples: 1191915646. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:03:30,468][25689] Avg episode reward: [(0, '0.087')] [2022-07-11 11:03:31,328][26022] Updated weights on worker 0-0, policy_version 1163989 (0.00092) [2022-07-11 11:03:33,491][26022] Updated weights on worker 0-0, policy_version 1163999 (0.00086) [2022-07-11 11:03:34,942][26022] Updated weights on worker 0-0, policy_version 1164009 (0.00091) [2022-07-11 11:03:35,527][25689] Fps is (10 sec: 5587.0, 60 sec: 5528.8, 300 sec: 5523.2). Total num frames: 1191948288. Throughput: 0: 5790.7. Samples: 1191949058. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:03:35,527][25689] Avg episode reward: [(0, '0.343')] [2022-07-11 11:03:37,036][26022] Updated weights on worker 0-0, policy_version 1164019 (0.00655) [2022-07-11 11:03:38,528][26022] Updated weights on worker 0-0, policy_version 1164029 (0.00102) [2022-07-11 11:03:40,536][25689] Fps is (10 sec: 5593.4, 60 sec: 5499.2, 300 sec: 5516.2). Total num frames: 1191974912. Throughput: 0: 5795.2. Samples: 1191982220. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:03:40,536][25689] Avg episode reward: [(0, '0.120')] [2022-07-11 11:03:40,760][26022] Updated weights on worker 0-0, policy_version 1164039 (0.00088) [2022-07-11 11:03:42,434][26022] Updated weights on worker 0-0, policy_version 1164049 (0.00092) [2022-07-11 11:03:44,446][26022] Updated weights on worker 0-0, policy_version 1164059 (0.00086) [2022-07-11 11:03:45,559][25689] Fps is (10 sec: 5409.9, 60 sec: 5515.8, 300 sec: 5509.1). Total num frames: 1192002560. Throughput: 0: 4967.0. Samples: 1191998916. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:03:45,559][25689] Avg episode reward: [(0, '0.899')] [2022-07-11 11:03:46,045][26022] Updated weights on worker 0-0, policy_version 1164069 (0.00086) [2022-07-11 11:03:48,202][26022] Updated weights on worker 0-0, policy_version 1164079 (0.00086) [2022-07-11 11:03:49,725][26022] Updated weights on worker 0-0, policy_version 1164089 (0.00091) [2022-07-11 11:03:50,568][25689] Fps is (10 sec: 5511.9, 60 sec: 5520.2, 300 sec: 5514.7). Total num frames: 1192030208. Throughput: 0: 5795.4. Samples: 1192032140. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:03:50,568][25689] Avg episode reward: [(0, '0.996')] [2022-07-11 11:03:51,766][26022] Updated weights on worker 0-0, policy_version 1164099 (0.00091) [2022-07-11 11:03:53,610][26022] Updated weights on worker 0-0, policy_version 1164109 (0.00101) [2022-07-11 11:03:55,405][26022] Updated weights on worker 0-0, policy_version 1164119 (0.00092) [2022-07-11 11:03:55,630][25689] Fps is (10 sec: 5592.1, 60 sec: 5501.9, 300 sec: 5513.6). Total num frames: 1192058880. Throughput: 0: 5764.5. Samples: 1192064944. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:03:55,631][25689] Avg episode reward: [(0, '0.292')] [2022-07-11 11:03:57,310][26022] Updated weights on worker 0-0, policy_version 1164129 (0.00087) [2022-07-11 11:03:59,002][26022] Updated weights on worker 0-0, policy_version 1164139 (0.00082) [2022-07-11 11:04:00,649][25689] Fps is (10 sec: 5587.0, 60 sec: 5505.9, 300 sec: 5520.3). Total num frames: 1192086528. Throughput: 0: 4943.3. Samples: 1192081642. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:04:00,650][25689] Avg episode reward: [(0, '0.325')] [2022-07-11 11:04:00,887][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:04:00,898][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001164148_1192087552.pth [2022-07-11 11:04:00,899][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001162207_1190099968.pth [2022-07-11 11:04:01,042][26022] Updated weights on worker 0-0, policy_version 1164149 (0.00087) [2022-07-11 11:04:03,342][26022] Updated weights on worker 0-0, policy_version 1164159 (0.00094) [2022-07-11 11:04:04,958][26022] Updated weights on worker 0-0, policy_version 1164169 (0.00086) [2022-07-11 11:04:05,681][25689] Fps is (10 sec: 5196.0, 60 sec: 5475.6, 300 sec: 5510.6). Total num frames: 1192111104. Throughput: 0: 5664.3. Samples: 1192112894. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:04:05,683][25689] Avg episode reward: [(0, '0.246')] [2022-07-11 11:04:06,892][26022] Updated weights on worker 0-0, policy_version 1164179 (0.00082) [2022-07-11 11:04:08,938][26022] Updated weights on worker 0-0, policy_version 1164189 (0.00089) [2022-07-11 11:04:10,695][25689] Fps is (10 sec: 5300.0, 60 sec: 5499.1, 300 sec: 5509.7). Total num frames: 1192139776. Throughput: 0: 5675.3. Samples: 1192146370. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:04:10,697][25689] Avg episode reward: [(0, '-1.121')] [2022-07-11 11:04:10,699][26022] Updated weights on worker 0-0, policy_version 1164199 (0.00096) [2022-07-11 11:04:12,538][26022] Updated weights on worker 0-0, policy_version 1164209 (0.00084) [2022-07-11 11:04:14,189][26022] Updated weights on worker 0-0, policy_version 1164219 (0.00089) [2022-07-11 11:04:15,765][25689] Fps is (10 sec: 5686.4, 60 sec: 5532.0, 300 sec: 5515.4). Total num frames: 1192168448. Throughput: 0: 4876.3. Samples: 1192163130. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:04:15,766][25689] Avg episode reward: [(0, '-0.931')] [2022-07-11 11:04:16,154][26022] Updated weights on worker 0-0, policy_version 1164229 (0.00990) [2022-07-11 11:04:18,079][26022] Updated weights on worker 0-0, policy_version 1164239 (0.00080) [2022-07-11 11:04:19,683][26022] Updated weights on worker 0-0, policy_version 1164249 (0.00090) [2022-07-11 11:04:20,776][25689] Fps is (10 sec: 5587.3, 60 sec: 5498.9, 300 sec: 5511.9). Total num frames: 1192196096. Throughput: 0: 5719.5. Samples: 1192196758. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:04:20,777][25689] Avg episode reward: [(0, '-0.567')] [2022-07-11 11:04:21,709][26022] Updated weights on worker 0-0, policy_version 1164259 (0.00080) [2022-07-11 11:04:23,371][26022] Updated weights on worker 0-0, policy_version 1164269 (0.00091) [2022-07-11 11:04:25,165][26022] Updated weights on worker 0-0, policy_version 1164279 (0.00087) [2022-07-11 11:04:25,862][25689] Fps is (10 sec: 5476.8, 60 sec: 5491.6, 300 sec: 5510.6). Total num frames: 1192223744. Throughput: 0: 5825.7. Samples: 1192230460. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:04:25,862][25689] Avg episode reward: [(0, '0.009')] [2022-07-11 11:04:27,083][26022] Updated weights on worker 0-0, policy_version 1164289 (0.00091) [2022-07-11 11:04:29,000][26022] Updated weights on worker 0-0, policy_version 1164299 (0.00086) [2022-07-11 11:04:30,786][26022] Updated weights on worker 0-0, policy_version 1164309 (0.00091) [2022-07-11 11:04:30,895][25689] Fps is (10 sec: 5565.6, 60 sec: 5524.3, 300 sec: 5509.2). Total num frames: 1192252416. Throughput: 0: 4977.0. Samples: 1192246902. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:04:30,895][25689] Avg episode reward: [(0, '-0.142')] [2022-07-11 11:04:32,753][26022] Updated weights on worker 0-0, policy_version 1164319 (0.00086) [2022-07-11 11:04:34,323][26022] Updated weights on worker 0-0, policy_version 1164329 (0.00099) [2022-07-11 11:04:35,966][25689] Fps is (10 sec: 5472.7, 60 sec: 5472.5, 300 sec: 5504.6). Total num frames: 1192279040. Throughput: 0: 5792.6. Samples: 1192280142. Policy #0 lag: (min: 0.0, avg: 9.0, max: 18.0) [2022-07-11 11:04:35,968][25689] Avg episode reward: [(0, '0.105')] [2022-07-11 11:04:36,446][26022] Updated weights on worker 0-0, policy_version 1164339 (0.00088) [2022-07-11 11:04:38,415][26022] Updated weights on worker 0-0, policy_version 1164349 (0.00089) [2022-07-11 11:04:40,123][26022] Updated weights on worker 0-0, policy_version 1164359 (0.00091) [2022-07-11 11:04:40,983][25689] Fps is (10 sec: 5582.8, 60 sec: 5522.6, 300 sec: 5507.9). Total num frames: 1192308736. Throughput: 0: 5757.6. Samples: 1192313104. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:04:40,983][25689] Avg episode reward: [(0, '1.168')] [2022-07-11 11:04:42,078][26022] Updated weights on worker 0-0, policy_version 1164369 (0.00097) [2022-07-11 11:04:43,713][26022] Updated weights on worker 0-0, policy_version 1164379 (0.00083) [2022-07-11 11:04:45,590][26022] Updated weights on worker 0-0, policy_version 1164389 (0.00092) [2022-07-11 11:04:45,998][25689] Fps is (10 sec: 5715.7, 60 sec: 5523.3, 300 sec: 5508.1). Total num frames: 1192336384. Throughput: 0: 4947.7. Samples: 1192330090. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:04:45,999][25689] Avg episode reward: [(0, '0.997')] [2022-07-11 11:04:47,427][26022] Updated weights on worker 0-0, policy_version 1164399 (0.00085) [2022-07-11 11:04:49,153][26022] Updated weights on worker 0-0, policy_version 1164409 (0.00812) [2022-07-11 11:04:51,003][25689] Fps is (10 sec: 5518.4, 60 sec: 5523.7, 300 sec: 5510.6). Total num frames: 1192364032. Throughput: 0: 5807.5. Samples: 1192363682. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:04:51,004][25689] Avg episode reward: [(0, '1.051')] [2022-07-11 11:04:51,131][26022] Updated weights on worker 0-0, policy_version 1164419 (0.00097) [2022-07-11 11:04:52,935][26022] Updated weights on worker 0-0, policy_version 1164429 (0.00087) [2022-07-11 11:04:54,660][26022] Updated weights on worker 0-0, policy_version 1164439 (0.00081) [2022-07-11 11:04:56,088][25689] Fps is (10 sec: 5582.1, 60 sec: 5521.6, 300 sec: 5509.7). Total num frames: 1192392704. Throughput: 0: 5837.3. Samples: 1192397602. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:04:56,088][25689] Avg episode reward: [(0, '0.990')] [2022-07-11 11:04:56,683][26022] Updated weights on worker 0-0, policy_version 1164449 (0.00084) [2022-07-11 11:04:58,350][26022] Updated weights on worker 0-0, policy_version 1164459 (0.00097) [2022-07-11 11:05:00,267][26022] Updated weights on worker 0-0, policy_version 1164469 (0.00097) [2022-07-11 11:05:01,161][25689] Fps is (10 sec: 5544.3, 60 sec: 5516.6, 300 sec: 5519.0). Total num frames: 1192420352. Throughput: 0: 5015.6. Samples: 1192414310. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:05:01,162][25689] Avg episode reward: [(0, '1.171')] [2022-07-11 11:05:02,478][26022] Updated weights on worker 0-0, policy_version 1164479 (0.00093) [2022-07-11 11:05:04,540][26022] Updated weights on worker 0-0, policy_version 1164489 (0.00091) [2022-07-11 11:05:06,164][25689] Fps is (10 sec: 5284.7, 60 sec: 5536.3, 300 sec: 5509.0). Total num frames: 1192445952. Throughput: 0: 5731.3. Samples: 1192445664. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:05:06,165][25689] Avg episode reward: [(0, '0.988')] [2022-07-11 11:05:06,183][26022] Updated weights on worker 0-0, policy_version 1164499 (0.00089) [2022-07-11 11:05:08,193][26022] Updated weights on worker 0-0, policy_version 1164509 (0.00095) [2022-07-11 11:05:09,791][26022] Updated weights on worker 0-0, policy_version 1164519 (0.00085) [2022-07-11 11:05:11,198][25689] Fps is (10 sec: 5203.4, 60 sec: 5500.6, 300 sec: 5503.4). Total num frames: 1192472576. Throughput: 0: 5701.7. Samples: 1192478826. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:05:11,199][25689] Avg episode reward: [(0, '0.411')] [2022-07-11 11:05:11,742][26022] Updated weights on worker 0-0, policy_version 1164529 (0.00086) [2022-07-11 11:05:13,476][26022] Updated weights on worker 0-0, policy_version 1164539 (0.00092) [2022-07-11 11:05:15,419][26022] Updated weights on worker 0-0, policy_version 1164549 (0.00086) [2022-07-11 11:05:16,271][25689] Fps is (10 sec: 5673.6, 60 sec: 5534.2, 300 sec: 5513.6). Total num frames: 1192503296. Throughput: 0: 4845.8. Samples: 1192495404. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:05:16,271][25689] Avg episode reward: [(0, '0.327')] [2022-07-11 11:05:17,431][26022] Updated weights on worker 0-0, policy_version 1164559 (0.00085) [2022-07-11 11:05:19,070][26022] Updated weights on worker 0-0, policy_version 1164569 (0.00097) [2022-07-11 11:05:20,970][26022] Updated weights on worker 0-0, policy_version 1164579 (0.00093) [2022-07-11 11:05:21,354][25689] Fps is (10 sec: 5747.3, 60 sec: 5527.5, 300 sec: 5510.1). Total num frames: 1192530944. Throughput: 0: 5679.0. Samples: 1192528982. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:05:21,354][25689] Avg episode reward: [(0, '0.731')] [2022-07-11 11:05:23,150][26022] Updated weights on worker 0-0, policy_version 1164589 (0.00086) [2022-07-11 11:05:24,581][26022] Updated weights on worker 0-0, policy_version 1164599 (0.00086) [2022-07-11 11:05:26,389][25689] Fps is (10 sec: 5363.7, 60 sec: 5515.2, 300 sec: 5506.1). Total num frames: 1192557568. Throughput: 0: 5766.7. Samples: 1192562298. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:05:26,391][25689] Avg episode reward: [(0, '0.634')] [2022-07-11 11:05:26,632][26022] Updated weights on worker 0-0, policy_version 1164609 (0.00087) [2022-07-11 11:05:28,354][26022] Updated weights on worker 0-0, policy_version 1164619 (0.00086) [2022-07-11 11:05:30,338][26022] Updated weights on worker 0-0, policy_version 1164629 (0.00091) [2022-07-11 11:05:31,393][25689] Fps is (10 sec: 5610.0, 60 sec: 5534.9, 300 sec: 5511.7). Total num frames: 1192587264. Throughput: 0: 5783.9. Samples: 1192595632. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:05:31,393][25689] Avg episode reward: [(0, '0.616')] [2022-07-11 11:05:32,041][26022] Updated weights on worker 0-0, policy_version 1164639 (0.00083) [2022-07-11 11:05:33,991][26022] Updated weights on worker 0-0, policy_version 1164649 (0.00087) [2022-07-11 11:05:35,661][26022] Updated weights on worker 0-0, policy_version 1164659 (0.00089) [2022-07-11 11:05:36,451][25689] Fps is (10 sec: 5597.4, 60 sec: 5536.0, 300 sec: 5510.8). Total num frames: 1192613888. Throughput: 0: 5795.9. Samples: 1192612366. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:05:36,453][25689] Avg episode reward: [(0, '0.817')] [2022-07-11 11:05:37,502][26022] Updated weights on worker 0-0, policy_version 1164669 (0.00096) [2022-07-11 11:05:39,550][26022] Updated weights on worker 0-0, policy_version 1164679 (0.00088) [2022-07-11 11:05:41,196][26022] Updated weights on worker 0-0, policy_version 1164689 (0.00093) [2022-07-11 11:05:41,460][25689] Fps is (10 sec: 5594.4, 60 sec: 5536.8, 300 sec: 5514.4). Total num frames: 1192643584. Throughput: 0: 5819.6. Samples: 1192645992. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:05:41,462][25689] Avg episode reward: [(0, '1.273')] [2022-07-11 11:05:42,942][26022] Updated weights on worker 0-0, policy_version 1164699 (0.00087) [2022-07-11 11:05:44,839][26022] Updated weights on worker 0-0, policy_version 1164709 (0.00086) [2022-07-11 11:05:46,479][25689] Fps is (10 sec: 5718.5, 60 sec: 5536.5, 300 sec: 5511.0). Total num frames: 1192671232. Throughput: 0: 5846.1. Samples: 1192679744. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:05:46,481][25689] Avg episode reward: [(0, '1.176')] [2022-07-11 11:05:46,839][26022] Updated weights on worker 0-0, policy_version 1164719 (0.00081) [2022-07-11 11:05:48,592][26022] Updated weights on worker 0-0, policy_version 1164729 (0.00089) [2022-07-11 11:05:50,273][26022] Updated weights on worker 0-0, policy_version 1164739 (0.00091) [2022-07-11 11:05:51,513][25689] Fps is (10 sec: 5500.3, 60 sec: 5533.8, 300 sec: 5519.5). Total num frames: 1192698880. Throughput: 0: 5012.9. Samples: 1192696494. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:05:51,514][25689] Avg episode reward: [(0, '1.437')] [2022-07-11 11:05:52,282][26022] Updated weights on worker 0-0, policy_version 1164749 (0.00049) [2022-07-11 11:05:54,081][26022] Updated weights on worker 0-0, policy_version 1164759 (0.00097) [2022-07-11 11:05:55,848][26022] Updated weights on worker 0-0, policy_version 1164769 (0.00082) [2022-07-11 11:05:56,654][25689] Fps is (10 sec: 5535.2, 60 sec: 5528.7, 300 sec: 5510.3). Total num frames: 1192727552. Throughput: 0: 5820.2. Samples: 1192729950. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:05:56,654][25689] Avg episode reward: [(0, '0.601')] [2022-07-11 11:05:57,747][26022] Updated weights on worker 0-0, policy_version 1164779 (0.00087) [2022-07-11 11:05:59,423][26022] Updated weights on worker 0-0, policy_version 1164789 (0.00089) [2022-07-11 11:06:00,981][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:06:00,997][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001164797_1192752128.pth [2022-07-11 11:06:00,997][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001162855_1190763520.pth [2022-07-11 11:06:01,579][26022] Updated weights on worker 0-0, policy_version 1164799 (0.00083) [2022-07-11 11:06:01,695][25689] Fps is (10 sec: 5431.1, 60 sec: 5514.7, 300 sec: 5517.4). Total num frames: 1192754176. Throughput: 0: 5817.1. Samples: 1192763700. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:06:01,695][25689] Avg episode reward: [(0, '0.306')] [2022-07-11 11:06:03,471][26022] Updated weights on worker 0-0, policy_version 1164809 (0.00084) [2022-07-11 11:06:05,379][26022] Updated weights on worker 0-0, policy_version 1164819 (0.00086) [2022-07-11 11:06:06,723][25689] Fps is (10 sec: 5491.7, 60 sec: 5563.1, 300 sec: 5524.4). Total num frames: 1192782848. Throughput: 0: 4886.1. Samples: 1192778658. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:06:06,723][25689] Avg episode reward: [(0, '0.315')] [2022-07-11 11:06:07,243][26022] Updated weights on worker 0-0, policy_version 1164829 (0.00085) [2022-07-11 11:06:09,027][26022] Updated weights on worker 0-0, policy_version 1164839 (0.00090) [2022-07-11 11:06:10,974][26022] Updated weights on worker 0-0, policy_version 1164849 (0.00088) [2022-07-11 11:06:11,772][25689] Fps is (10 sec: 5588.9, 60 sec: 5578.6, 300 sec: 5520.9). Total num frames: 1192810496. Throughput: 0: 5711.3. Samples: 1192812198. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:06:11,772][25689] Avg episode reward: [(0, '0.417')] [2022-07-11 11:06:12,873][26022] Updated weights on worker 0-0, policy_version 1164859 (0.00095) [2022-07-11 11:06:14,431][26022] Updated weights on worker 0-0, policy_version 1164869 (0.00086) [2022-07-11 11:06:16,462][26022] Updated weights on worker 0-0, policy_version 1164879 (0.00086) [2022-07-11 11:06:16,826][25689] Fps is (10 sec: 5473.3, 60 sec: 5529.7, 300 sec: 5520.2). Total num frames: 1192838144. Throughput: 0: 5753.2. Samples: 1192846002. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:06:16,826][25689] Avg episode reward: [(0, '0.553')] [2022-07-11 11:06:18,151][26022] Updated weights on worker 0-0, policy_version 1164889 (0.00092) [2022-07-11 11:06:20,011][26022] Updated weights on worker 0-0, policy_version 1164899 (0.00079) [2022-07-11 11:06:21,806][26022] Updated weights on worker 0-0, policy_version 1164909 (0.00090) [2022-07-11 11:06:21,884][25689] Fps is (10 sec: 5670.7, 60 sec: 5565.7, 300 sec: 5522.5). Total num frames: 1192867840. Throughput: 0: 4911.7. Samples: 1192862866. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:06:21,885][25689] Avg episode reward: [(0, '-0.032')] [2022-07-11 11:06:23,718][26022] Updated weights on worker 0-0, policy_version 1164919 (0.00095) [2022-07-11 11:06:25,427][26022] Updated weights on worker 0-0, policy_version 1164929 (0.00083) [2022-07-11 11:06:26,889][25689] Fps is (10 sec: 5596.6, 60 sec: 5568.6, 300 sec: 5519.2). Total num frames: 1192894464. Throughput: 0: 5829.4. Samples: 1192896214. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:06:26,889][25689] Avg episode reward: [(0, '0.459')] [2022-07-11 11:06:27,483][26022] Updated weights on worker 0-0, policy_version 1164939 (0.00087) [2022-07-11 11:06:29,094][26022] Updated weights on worker 0-0, policy_version 1164949 (0.00088) [2022-07-11 11:06:31,032][26022] Updated weights on worker 0-0, policy_version 1164959 (0.00092) [2022-07-11 11:06:31,905][25689] Fps is (10 sec: 5416.1, 60 sec: 5533.6, 300 sec: 5521.3). Total num frames: 1192922112. Throughput: 0: 5832.7. Samples: 1192929626. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:06:31,905][25689] Avg episode reward: [(0, '0.274')] [2022-07-11 11:06:32,782][26022] Updated weights on worker 0-0, policy_version 1164969 (0.00086) [2022-07-11 11:06:34,719][26022] Updated weights on worker 0-0, policy_version 1164979 (0.00086) [2022-07-11 11:06:36,557][26022] Updated weights on worker 0-0, policy_version 1164989 (0.00091) [2022-07-11 11:06:37,010][25689] Fps is (10 sec: 5564.4, 60 sec: 5563.1, 300 sec: 5520.4). Total num frames: 1192950784. Throughput: 0: 4972.9. Samples: 1192946378. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:06:37,011][25689] Avg episode reward: [(0, '-0.674')] [2022-07-11 11:06:38,284][26022] Updated weights on worker 0-0, policy_version 1164999 (0.00091) [2022-07-11 11:06:40,232][26022] Updated weights on worker 0-0, policy_version 1165009 (0.00093) [2022-07-11 11:06:42,082][25689] Fps is (10 sec: 5533.7, 60 sec: 5523.5, 300 sec: 5519.4). Total num frames: 1192978432. Throughput: 0: 5787.4. Samples: 1192979760. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:06:42,083][25689] Avg episode reward: [(0, '-0.792')] [2022-07-11 11:06:42,197][26022] Updated weights on worker 0-0, policy_version 1165019 (0.00086) [2022-07-11 11:06:43,741][26022] Updated weights on worker 0-0, policy_version 1165029 (0.00093) [2022-07-11 11:06:45,926][26022] Updated weights on worker 0-0, policy_version 1165039 (0.00084) [2022-07-11 11:06:47,113][25689] Fps is (10 sec: 5676.4, 60 sec: 5556.2, 300 sec: 5529.7). Total num frames: 1193008128. Throughput: 0: 5797.4. Samples: 1193013458. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:06:47,113][25689] Avg episode reward: [(0, '-0.405')] [2022-07-11 11:06:47,454][26022] Updated weights on worker 0-0, policy_version 1165049 (0.00082) [2022-07-11 11:06:49,471][26022] Updated weights on worker 0-0, policy_version 1165059 (0.00080) [2022-07-11 11:06:51,143][26022] Updated weights on worker 0-0, policy_version 1165069 (0.00082) [2022-07-11 11:06:52,169][25689] Fps is (10 sec: 5684.9, 60 sec: 5554.2, 300 sec: 5519.8). Total num frames: 1193035776. Throughput: 0: 4969.2. Samples: 1193030326. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:06:52,170][25689] Avg episode reward: [(0, '-0.137')] [2022-07-11 11:06:53,053][26022] Updated weights on worker 0-0, policy_version 1165079 (0.00091) [2022-07-11 11:06:54,821][26022] Updated weights on worker 0-0, policy_version 1165089 (0.00084) [2022-07-11 11:06:56,573][26022] Updated weights on worker 0-0, policy_version 1165099 (0.00086) [2022-07-11 11:06:57,279][25689] Fps is (10 sec: 5439.2, 60 sec: 5540.1, 300 sec: 5525.3). Total num frames: 1193063424. Throughput: 0: 5801.5. Samples: 1193063966. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:06:57,279][25689] Avg episode reward: [(0, '0.357')] [2022-07-11 11:06:58,434][26022] Updated weights on worker 0-0, policy_version 1165109 (0.00084) [2022-07-11 11:07:00,327][26022] Updated weights on worker 0-0, policy_version 1165119 (0.00083) [2022-07-11 11:07:02,324][25689] Fps is (10 sec: 5344.8, 60 sec: 5539.8, 300 sec: 5521.7). Total num frames: 1193090048. Throughput: 0: 5770.7. Samples: 1193096566. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:07:02,324][25689] Avg episode reward: [(0, '0.088')] [2022-07-11 11:07:02,566][26022] Updated weights on worker 0-0, policy_version 1165129 (0.00091) [2022-07-11 11:07:04,397][26022] Updated weights on worker 0-0, policy_version 1165139 (0.00081) [2022-07-11 11:07:06,350][26022] Updated weights on worker 0-0, policy_version 1165149 (0.00093) [2022-07-11 11:07:07,352][25689] Fps is (10 sec: 5387.7, 60 sec: 5522.9, 300 sec: 5521.6). Total num frames: 1193117696. Throughput: 0: 4886.3. Samples: 1193112354. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:07:07,353][25689] Avg episode reward: [(0, '0.921')] [2022-07-11 11:07:08,092][26022] Updated weights on worker 0-0, policy_version 1165159 (0.00092) [2022-07-11 11:07:10,124][26022] Updated weights on worker 0-0, policy_version 1165169 (0.00089) [2022-07-11 11:07:11,876][26022] Updated weights on worker 0-0, policy_version 1165179 (0.00089) [2022-07-11 11:07:12,387][25689] Fps is (10 sec: 5494.8, 60 sec: 5524.2, 300 sec: 5526.6). Total num frames: 1193145344. Throughput: 0: 5689.6. Samples: 1193145356. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:07:12,387][25689] Avg episode reward: [(0, '1.052')] [2022-07-11 11:07:13,670][26022] Updated weights on worker 0-0, policy_version 1165189 (0.00930) [2022-07-11 11:07:15,677][26022] Updated weights on worker 0-0, policy_version 1165199 (0.00086) [2022-07-11 11:07:17,293][26022] Updated weights on worker 0-0, policy_version 1165209 (0.00096) [2022-07-11 11:07:17,438][25689] Fps is (10 sec: 5685.5, 60 sec: 5558.2, 300 sec: 5532.8). Total num frames: 1193175040. Throughput: 0: 5682.0. Samples: 1193178512. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:07:17,439][25689] Avg episode reward: [(0, '0.990')] [2022-07-11 11:07:19,439][26022] Updated weights on worker 0-0, policy_version 1165219 (0.00087) [2022-07-11 11:07:20,915][26022] Updated weights on worker 0-0, policy_version 1165229 (0.00083) [2022-07-11 11:07:22,463][25689] Fps is (10 sec: 5589.5, 60 sec: 5510.6, 300 sec: 5522.2). Total num frames: 1193201664. Throughput: 0: 4902.9. Samples: 1193195308. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:07:22,463][25689] Avg episode reward: [(0, '1.018')] [2022-07-11 11:07:22,922][26022] Updated weights on worker 0-0, policy_version 1165239 (0.00093) [2022-07-11 11:07:24,660][26022] Updated weights on worker 0-0, policy_version 1165249 (0.00084) [2022-07-11 11:07:26,543][26022] Updated weights on worker 0-0, policy_version 1165259 (0.00093) [2022-07-11 11:07:27,491][25689] Fps is (10 sec: 5398.9, 60 sec: 5525.4, 300 sec: 5522.2). Total num frames: 1193229312. Throughput: 0: 5792.4. Samples: 1193229004. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:07:27,491][25689] Avg episode reward: [(0, '1.091')] [2022-07-11 11:07:28,458][26022] Updated weights on worker 0-0, policy_version 1165269 (0.00086) [2022-07-11 11:07:30,260][26022] Updated weights on worker 0-0, policy_version 1165279 (0.00078) [2022-07-11 11:07:31,987][26022] Updated weights on worker 0-0, policy_version 1165289 (0.00091) [2022-07-11 11:07:32,516][25689] Fps is (10 sec: 5602.0, 60 sec: 5541.4, 300 sec: 5526.9). Total num frames: 1193257984. Throughput: 0: 5804.1. Samples: 1193262190. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:07:32,517][25689] Avg episode reward: [(0, '1.301')] [2022-07-11 11:07:34,058][26022] Updated weights on worker 0-0, policy_version 1165299 (0.00082) [2022-07-11 11:07:35,583][26022] Updated weights on worker 0-0, policy_version 1165309 (0.00093) [2022-07-11 11:07:37,615][25689] Fps is (10 sec: 5563.0, 60 sec: 5525.1, 300 sec: 5522.6). Total num frames: 1193285632. Throughput: 0: 5803.3. Samples: 1193295602. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:07:37,615][25689] Avg episode reward: [(0, '0.835')] [2022-07-11 11:07:37,763][26022] Updated weights on worker 0-0, policy_version 1165319 (0.00098) [2022-07-11 11:07:39,258][26022] Updated weights on worker 0-0, policy_version 1165329 (0.00088) [2022-07-11 11:07:41,318][26022] Updated weights on worker 0-0, policy_version 1165339 (0.00090) [2022-07-11 11:07:42,650][25689] Fps is (10 sec: 5557.8, 60 sec: 5545.4, 300 sec: 5529.2). Total num frames: 1193314304. Throughput: 0: 5802.3. Samples: 1193312440. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:07:42,651][25689] Avg episode reward: [(0, '0.622')] [2022-07-11 11:07:42,911][26022] Updated weights on worker 0-0, policy_version 1165349 (0.00085) [2022-07-11 11:07:45,160][26022] Updated weights on worker 0-0, policy_version 1165359 (0.00089) [2022-07-11 11:07:46,811][26022] Updated weights on worker 0-0, policy_version 1165369 (0.00098) [2022-07-11 11:07:47,675][25689] Fps is (10 sec: 5699.9, 60 sec: 5529.0, 300 sec: 5533.3). Total num frames: 1193342976. Throughput: 0: 5779.3. Samples: 1193345656. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:07:47,676][25689] Avg episode reward: [(0, '-0.013')] [2022-07-11 11:07:48,799][26022] Updated weights on worker 0-0, policy_version 1165379 (0.00086) [2022-07-11 11:07:50,467][26022] Updated weights on worker 0-0, policy_version 1165389 (0.00089) [2022-07-11 11:07:52,344][26022] Updated weights on worker 0-0, policy_version 1165399 (0.00103) [2022-07-11 11:07:52,744][25689] Fps is (10 sec: 5478.0, 60 sec: 5510.9, 300 sec: 5522.5). Total num frames: 1193369600. Throughput: 0: 5794.5. Samples: 1193379400. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:07:52,745][25689] Avg episode reward: [(0, '-1.169')] [2022-07-11 11:07:54,313][26022] Updated weights on worker 0-0, policy_version 1165409 (0.00080) [2022-07-11 11:07:56,011][26022] Updated weights on worker 0-0, policy_version 1165419 (0.00091) [2022-07-11 11:07:57,752][26022] Updated weights on worker 0-0, policy_version 1165429 (0.00087) [2022-07-11 11:07:57,795][25689] Fps is (10 sec: 5565.3, 60 sec: 5550.1, 300 sec: 5529.6). Total num frames: 1193399296. Throughput: 0: 4982.3. Samples: 1193396146. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:07:57,795][25689] Avg episode reward: [(0, '-0.948')] [2022-07-11 11:07:59,628][26022] Updated weights on worker 0-0, policy_version 1165439 (0.00085) [2022-07-11 11:08:01,018][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:08:01,031][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001165446_1193416704.pth [2022-07-11 11:08:01,032][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001163502_1191426048.pth [2022-07-11 11:08:01,509][26022] Updated weights on worker 0-0, policy_version 1165449 (0.00084) [2022-07-11 11:08:02,804][25689] Fps is (10 sec: 5496.8, 60 sec: 5536.5, 300 sec: 5527.4). Total num frames: 1193424896. Throughput: 0: 5825.6. Samples: 1193429848. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:08:02,804][25689] Avg episode reward: [(0, '-0.671')] [2022-07-11 11:08:03,905][26022] Updated weights on worker 0-0, policy_version 1165459 (0.00095) [2022-07-11 11:08:05,307][26022] Updated weights on worker 0-0, policy_version 1165469 (0.00885) [2022-07-11 11:08:07,391][26022] Updated weights on worker 0-0, policy_version 1165479 (0.00088) [2022-07-11 11:08:07,805][25689] Fps is (10 sec: 5217.2, 60 sec: 5522.1, 300 sec: 5525.5). Total num frames: 1193451520. Throughput: 0: 5769.1. Samples: 1193461786. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:08:07,807][25689] Avg episode reward: [(0, '-0.556')] [2022-07-11 11:08:08,988][26022] Updated weights on worker 0-0, policy_version 1165489 (0.00080) [2022-07-11 11:08:11,064][26022] Updated weights on worker 0-0, policy_version 1165499 (0.00089) [2022-07-11 11:08:12,807][25689] Fps is (10 sec: 5527.8, 60 sec: 5542.0, 300 sec: 5533.4). Total num frames: 1193480192. Throughput: 0: 4944.5. Samples: 1193478602. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:08:12,809][25689] Avg episode reward: [(0, '-0.678')] [2022-07-11 11:08:12,865][26022] Updated weights on worker 0-0, policy_version 1165509 (0.00086) [2022-07-11 11:08:14,699][26022] Updated weights on worker 0-0, policy_version 1165519 (0.00085) [2022-07-11 11:08:16,545][26022] Updated weights on worker 0-0, policy_version 1165529 (0.00090) [2022-07-11 11:08:17,939][25689] Fps is (10 sec: 5658.6, 60 sec: 5517.7, 300 sec: 5527.9). Total num frames: 1193508864. Throughput: 0: 5754.5. Samples: 1193512064. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:08:17,941][25689] Avg episode reward: [(0, '0.283')] [2022-07-11 11:08:18,399][26022] Updated weights on worker 0-0, policy_version 1165539 (0.00099) [2022-07-11 11:08:20,047][26022] Updated weights on worker 0-0, policy_version 1165549 (0.00092) [2022-07-11 11:08:22,203][26022] Updated weights on worker 0-0, policy_version 1165559 (0.00086) [2022-07-11 11:08:23,036][25689] Fps is (10 sec: 5606.0, 60 sec: 5544.9, 300 sec: 5529.7). Total num frames: 1193537536. Throughput: 0: 5718.0. Samples: 1193545536. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:08:23,037][25689] Avg episode reward: [(0, '-0.996')] [2022-07-11 11:08:23,664][26022] Updated weights on worker 0-0, policy_version 1165569 (0.00091) [2022-07-11 11:08:25,938][26022] Updated weights on worker 0-0, policy_version 1165579 (0.00094) [2022-07-11 11:08:27,435][26022] Updated weights on worker 0-0, policy_version 1165589 (0.00085) [2022-07-11 11:08:28,052][25689] Fps is (10 sec: 5568.7, 60 sec: 5545.9, 300 sec: 5533.2). Total num frames: 1193565184. Throughput: 0: 4959.2. Samples: 1193562198. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:08:28,054][25689] Avg episode reward: [(0, '-0.693')] [2022-07-11 11:08:29,475][26022] Updated weights on worker 0-0, policy_version 1165599 (0.00085) [2022-07-11 11:08:31,401][26022] Updated weights on worker 0-0, policy_version 1165609 (0.00083) [2022-07-11 11:08:33,063][25689] Fps is (10 sec: 5514.9, 60 sec: 5530.4, 300 sec: 5527.2). Total num frames: 1193592832. Throughput: 0: 5769.4. Samples: 1193595464. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:08:33,064][25689] Avg episode reward: [(0, '-0.855')] [2022-07-11 11:08:33,194][26022] Updated weights on worker 0-0, policy_version 1165619 (0.00097) [2022-07-11 11:08:34,953][26022] Updated weights on worker 0-0, policy_version 1165629 (0.00092) [2022-07-11 11:08:36,872][26022] Updated weights on worker 0-0, policy_version 1165639 (0.00087) [2022-07-11 11:08:38,141][25689] Fps is (10 sec: 5582.8, 60 sec: 5549.2, 300 sec: 5532.8). Total num frames: 1193621504. Throughput: 0: 5777.2. Samples: 1193628774. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 11:08:38,141][25689] Avg episode reward: [(0, '-1.157')] [2022-07-11 11:08:38,471][26022] Updated weights on worker 0-0, policy_version 1165649 (0.00087) [2022-07-11 11:08:40,515][26022] Updated weights on worker 0-0, policy_version 1165659 (0.00088) [2022-07-11 11:08:42,157][26022] Updated weights on worker 0-0, policy_version 1165669 (0.00097) [2022-07-11 11:08:43,218][25689] Fps is (10 sec: 5546.1, 60 sec: 5528.5, 300 sec: 5531.8). Total num frames: 1193649152. Throughput: 0: 4946.9. Samples: 1193645372. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:08:43,218][25689] Avg episode reward: [(0, '-0.703')] [2022-07-11 11:08:44,374][26022] Updated weights on worker 0-0, policy_version 1165679 (0.00081) [2022-07-11 11:08:45,916][26022] Updated weights on worker 0-0, policy_version 1165689 (0.00084) [2022-07-11 11:08:47,965][26022] Updated weights on worker 0-0, policy_version 1165699 (0.00081) [2022-07-11 11:08:48,295][25689] Fps is (10 sec: 5445.5, 60 sec: 5506.8, 300 sec: 5530.5). Total num frames: 1193676800. Throughput: 0: 5760.1. Samples: 1193678796. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:08:48,295][25689] Avg episode reward: [(0, '-1.171')] [2022-07-11 11:08:49,700][26022] Updated weights on worker 0-0, policy_version 1165709 (0.00091) [2022-07-11 11:08:51,667][26022] Updated weights on worker 0-0, policy_version 1165719 (0.00099) [2022-07-11 11:08:53,350][25689] Fps is (10 sec: 5558.6, 60 sec: 5541.9, 300 sec: 5530.7). Total num frames: 1193705472. Throughput: 0: 5737.5. Samples: 1193711860. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:08:53,350][25689] Avg episode reward: [(0, '-0.038')] [2022-07-11 11:08:53,400][26022] Updated weights on worker 0-0, policy_version 1165729 (0.00094) [2022-07-11 11:08:55,551][26022] Updated weights on worker 0-0, policy_version 1165739 (0.00090) [2022-07-11 11:08:56,992][26022] Updated weights on worker 0-0, policy_version 1165749 (0.00085) [2022-07-11 11:08:58,405][25689] Fps is (10 sec: 5570.9, 60 sec: 5507.8, 300 sec: 5530.0). Total num frames: 1193733120. Throughput: 0: 4925.4. Samples: 1193728582. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:08:58,405][25689] Avg episode reward: [(0, '-1.091')] [2022-07-11 11:08:59,070][26022] Updated weights on worker 0-0, policy_version 1165759 (0.00095) [2022-07-11 11:09:00,902][26022] Updated weights on worker 0-0, policy_version 1165769 (0.00087) [2022-07-11 11:09:03,131][26022] Updated weights on worker 0-0, policy_version 1165779 (0.00086) [2022-07-11 11:09:03,411][25689] Fps is (10 sec: 5292.5, 60 sec: 5508.0, 300 sec: 5533.9). Total num frames: 1193758720. Throughput: 0: 5772.5. Samples: 1193761936. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:09:03,411][25689] Avg episode reward: [(0, '-0.737')] [2022-07-11 11:09:05,025][26022] Updated weights on worker 0-0, policy_version 1165789 (0.00085) [2022-07-11 11:09:06,887][26022] Updated weights on worker 0-0, policy_version 1165799 (0.00079) [2022-07-11 11:09:08,392][26022] Updated weights on worker 0-0, policy_version 1165809 (0.00093) [2022-07-11 11:09:08,433][25689] Fps is (10 sec: 5513.8, 60 sec: 5556.7, 300 sec: 5537.2). Total num frames: 1193788416. Throughput: 0: 5695.1. Samples: 1193793486. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:09:08,434][25689] Avg episode reward: [(0, '-0.955')] [2022-07-11 11:09:10,602][26022] Updated weights on worker 0-0, policy_version 1165819 (0.00087) [2022-07-11 11:09:12,076][26022] Updated weights on worker 0-0, policy_version 1165829 (0.00091) [2022-07-11 11:09:13,457][25689] Fps is (10 sec: 5504.0, 60 sec: 5504.1, 300 sec: 5527.8). Total num frames: 1193814016. Throughput: 0: 4894.9. Samples: 1193810284. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:09:13,458][25689] Avg episode reward: [(0, '-0.216')] [2022-07-11 11:09:14,368][26022] Updated weights on worker 0-0, policy_version 1165839 (0.00092) [2022-07-11 11:09:15,788][26022] Updated weights on worker 0-0, policy_version 1165849 (0.00089) [2022-07-11 11:09:17,826][26022] Updated weights on worker 0-0, policy_version 1165859 (0.00090) [2022-07-11 11:09:18,514][25689] Fps is (10 sec: 5383.8, 60 sec: 5510.9, 300 sec: 5530.3). Total num frames: 1193842688. Throughput: 0: 5710.4. Samples: 1193843414. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:09:18,514][25689] Avg episode reward: [(0, '0.062')] [2022-07-11 11:09:19,795][26022] Updated weights on worker 0-0, policy_version 1165869 (0.00050) [2022-07-11 11:09:21,559][26022] Updated weights on worker 0-0, policy_version 1165879 (0.00089) [2022-07-11 11:09:23,427][26022] Updated weights on worker 0-0, policy_version 1165889 (0.00086) [2022-07-11 11:09:23,571][25689] Fps is (10 sec: 5670.0, 60 sec: 5514.6, 300 sec: 5534.3). Total num frames: 1193871360. Throughput: 0: 5694.2. Samples: 1193876732. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:09:23,571][25689] Avg episode reward: [(0, '0.033')] [2022-07-11 11:09:25,214][26022] Updated weights on worker 0-0, policy_version 1165899 (0.00083) [2022-07-11 11:09:26,982][26022] Updated weights on worker 0-0, policy_version 1165909 (0.00053) [2022-07-11 11:09:28,575][25689] Fps is (10 sec: 5495.9, 60 sec: 5498.8, 300 sec: 5528.0). Total num frames: 1193897984. Throughput: 0: 4966.0. Samples: 1193893512. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:09:28,576][25689] Avg episode reward: [(0, '0.920')] [2022-07-11 11:09:29,053][26022] Updated weights on worker 0-0, policy_version 1165919 (0.00091) [2022-07-11 11:09:30,578][26022] Updated weights on worker 0-0, policy_version 1165929 (0.00085) [2022-07-11 11:09:32,760][26022] Updated weights on worker 0-0, policy_version 1165939 (0.00093) [2022-07-11 11:09:33,601][25689] Fps is (10 sec: 5614.9, 60 sec: 5531.2, 300 sec: 5539.1). Total num frames: 1193927680. Throughput: 0: 5794.0. Samples: 1193926998. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:09:33,602][25689] Avg episode reward: [(0, '-0.154')] [2022-07-11 11:09:34,251][26022] Updated weights on worker 0-0, policy_version 1165949 (0.00088) [2022-07-11 11:09:36,204][26022] Updated weights on worker 0-0, policy_version 1165959 (0.00089) [2022-07-11 11:09:38,157][26022] Updated weights on worker 0-0, policy_version 1165969 (0.00587) [2022-07-11 11:09:38,684][25689] Fps is (10 sec: 5571.2, 60 sec: 5496.8, 300 sec: 5527.6). Total num frames: 1193954304. Throughput: 0: 5799.7. Samples: 1193960396. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:09:38,685][25689] Avg episode reward: [(0, '-0.099')] [2022-07-11 11:09:39,715][26022] Updated weights on worker 0-0, policy_version 1165979 (0.00089) [2022-07-11 11:09:41,906][26022] Updated weights on worker 0-0, policy_version 1165989 (0.00079) [2022-07-11 11:09:43,532][26022] Updated weights on worker 0-0, policy_version 1165999 (0.00089) [2022-07-11 11:09:43,747][25689] Fps is (10 sec: 5450.3, 60 sec: 5515.1, 300 sec: 5530.1). Total num frames: 1193982976. Throughput: 0: 4972.7. Samples: 1193977060. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:09:43,748][25689] Avg episode reward: [(0, '-0.786')] [2022-07-11 11:09:45,516][26022] Updated weights on worker 0-0, policy_version 1166009 (0.00084) [2022-07-11 11:09:47,371][26022] Updated weights on worker 0-0, policy_version 1166019 (0.00089) [2022-07-11 11:09:48,761][25689] Fps is (10 sec: 5487.7, 60 sec: 5503.9, 300 sec: 5526.5). Total num frames: 1194009600. Throughput: 0: 5779.8. Samples: 1194010180. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:09:48,761][25689] Avg episode reward: [(0, '-0.850')] [2022-07-11 11:09:49,154][26022] Updated weights on worker 0-0, policy_version 1166029 (0.00092) [2022-07-11 11:09:50,959][26022] Updated weights on worker 0-0, policy_version 1166039 (0.00087) [2022-07-11 11:09:52,829][26022] Updated weights on worker 0-0, policy_version 1166049 (0.00085) [2022-07-11 11:09:53,799][25689] Fps is (10 sec: 5704.4, 60 sec: 5539.3, 300 sec: 5534.3). Total num frames: 1194040320. Throughput: 0: 5775.8. Samples: 1194043658. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:09:53,800][25689] Avg episode reward: [(0, '-0.974')] [2022-07-11 11:09:54,827][26022] Updated weights on worker 0-0, policy_version 1166059 (0.00086) [2022-07-11 11:09:56,602][26022] Updated weights on worker 0-0, policy_version 1166069 (0.00090) [2022-07-11 11:09:58,525][26022] Updated weights on worker 0-0, policy_version 1166079 (0.00090) [2022-07-11 11:09:58,878][25689] Fps is (10 sec: 5566.7, 60 sec: 5503.2, 300 sec: 5527.3). Total num frames: 1194065920. Throughput: 0: 4943.2. Samples: 1194060218. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:09:58,879][25689] Avg episode reward: [(0, '-1.183')] [2022-07-11 11:10:00,130][26022] Updated weights on worker 0-0, policy_version 1166089 (0.00088) [2022-07-11 11:10:01,261][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:10:01,273][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001166094_1194080256.pth [2022-07-11 11:10:01,273][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001164148_1192087552.pth [2022-07-11 11:10:02,475][26022] Updated weights on worker 0-0, policy_version 1166099 (0.00096) [2022-07-11 11:10:03,919][25689] Fps is (10 sec: 5160.4, 60 sec: 5517.0, 300 sec: 5530.0). Total num frames: 1194092544. Throughput: 0: 5708.6. Samples: 1194092218. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:10:03,920][25689] Avg episode reward: [(0, '0.489')] [2022-07-11 11:10:04,456][26022] Updated weights on worker 0-0, policy_version 1166109 (0.00092) [2022-07-11 11:10:06,244][26022] Updated weights on worker 0-0, policy_version 1166119 (0.00088) [2022-07-11 11:10:08,116][26022] Updated weights on worker 0-0, policy_version 1166129 (0.00092) [2022-07-11 11:10:08,950][25689] Fps is (10 sec: 5388.4, 60 sec: 5482.4, 300 sec: 5533.5). Total num frames: 1194120192. Throughput: 0: 5674.4. Samples: 1194124742. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:10:08,951][25689] Avg episode reward: [(0, '0.681')] [2022-07-11 11:10:09,832][26022] Updated weights on worker 0-0, policy_version 1166139 (0.00081) [2022-07-11 11:10:11,707][26022] Updated weights on worker 0-0, policy_version 1166149 (0.00091) [2022-07-11 11:10:13,764][26022] Updated weights on worker 0-0, policy_version 1166159 (0.00091) [2022-07-11 11:10:13,965][25689] Fps is (10 sec: 5504.5, 60 sec: 5517.0, 300 sec: 5524.3). Total num frames: 1194147840. Throughput: 0: 4847.7. Samples: 1194141416. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:10:13,972][25689] Avg episode reward: [(0, '1.177')] [2022-07-11 11:10:15,410][26022] Updated weights on worker 0-0, policy_version 1166169 (0.00785) [2022-07-11 11:10:17,487][26022] Updated weights on worker 0-0, policy_version 1166179 (0.00094) [2022-07-11 11:10:19,028][25689] Fps is (10 sec: 5588.6, 60 sec: 5516.5, 300 sec: 5528.1). Total num frames: 1194176512. Throughput: 0: 5677.0. Samples: 1194174608. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:10:19,028][25689] Avg episode reward: [(0, '1.051')] [2022-07-11 11:10:19,178][26022] Updated weights on worker 0-0, policy_version 1166189 (0.00095) [2022-07-11 11:10:21,142][26022] Updated weights on worker 0-0, policy_version 1166199 (0.00081) [2022-07-11 11:10:22,795][26022] Updated weights on worker 0-0, policy_version 1166209 (0.00088) [2022-07-11 11:10:24,034][25689] Fps is (10 sec: 5593.5, 60 sec: 5504.2, 300 sec: 5532.1). Total num frames: 1194204160. Throughput: 0: 5749.1. Samples: 1194207858. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:10:24,034][25689] Avg episode reward: [(0, '0.872')] [2022-07-11 11:10:24,730][26022] Updated weights on worker 0-0, policy_version 1166219 (0.00088) [2022-07-11 11:10:26,507][26022] Updated weights on worker 0-0, policy_version 1166229 (0.00088) [2022-07-11 11:10:28,524][26022] Updated weights on worker 0-0, policy_version 1166239 (0.00093) [2022-07-11 11:10:29,053][25689] Fps is (10 sec: 5413.6, 60 sec: 5502.9, 300 sec: 5521.5). Total num frames: 1194230784. Throughput: 0: 5792.9. Samples: 1194241194. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:10:29,053][25689] Avg episode reward: [(0, '-0.125')] [2022-07-11 11:10:30,240][26022] Updated weights on worker 0-0, policy_version 1166249 (0.00084) [2022-07-11 11:10:32,132][26022] Updated weights on worker 0-0, policy_version 1166259 (0.00086) [2022-07-11 11:10:34,008][26022] Updated weights on worker 0-0, policy_version 1166269 (0.00083) [2022-07-11 11:10:34,091][25689] Fps is (10 sec: 5498.3, 60 sec: 5484.9, 300 sec: 5528.7). Total num frames: 1194259456. Throughput: 0: 5786.2. Samples: 1194257868. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:10:34,091][25689] Avg episode reward: [(0, '-0.245')] [2022-07-11 11:10:35,972][26022] Updated weights on worker 0-0, policy_version 1166279 (0.00088) [2022-07-11 11:10:37,719][26022] Updated weights on worker 0-0, policy_version 1166289 (0.00089) [2022-07-11 11:10:39,162][25689] Fps is (10 sec: 5672.3, 60 sec: 5519.8, 300 sec: 5524.1). Total num frames: 1194288128. Throughput: 0: 5777.9. Samples: 1194290942. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:10:39,163][25689] Avg episode reward: [(0, '-0.518')] [2022-07-11 11:10:39,651][26022] Updated weights on worker 0-0, policy_version 1166299 (0.00087) [2022-07-11 11:10:41,296][26022] Updated weights on worker 0-0, policy_version 1166309 (0.00093) [2022-07-11 11:10:43,303][26022] Updated weights on worker 0-0, policy_version 1166319 (0.00092) [2022-07-11 11:10:44,178][25689] Fps is (10 sec: 5481.8, 60 sec: 5490.2, 300 sec: 5520.7). Total num frames: 1194314752. Throughput: 0: 5776.4. Samples: 1194324218. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:10:44,180][25689] Avg episode reward: [(0, '-0.496')] [2022-07-11 11:10:45,110][26022] Updated weights on worker 0-0, policy_version 1166329 (0.00107) [2022-07-11 11:10:47,190][26022] Updated weights on worker 0-0, policy_version 1166339 (0.00071) [2022-07-11 11:10:48,882][26022] Updated weights on worker 0-0, policy_version 1166349 (0.00091) [2022-07-11 11:10:49,194][25689] Fps is (10 sec: 5511.8, 60 sec: 5523.8, 300 sec: 5524.5). Total num frames: 1194343424. Throughput: 0: 4940.0. Samples: 1194340694. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:10:49,196][25689] Avg episode reward: [(0, '-0.178')] [2022-07-11 11:10:50,951][26022] Updated weights on worker 0-0, policy_version 1166359 (0.00090) [2022-07-11 11:10:52,542][26022] Updated weights on worker 0-0, policy_version 1166369 (0.00086) [2022-07-11 11:10:54,205][25689] Fps is (10 sec: 5514.6, 60 sec: 5458.6, 300 sec: 5520.0). Total num frames: 1194370048. Throughput: 0: 5765.7. Samples: 1194373842. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:10:54,205][25689] Avg episode reward: [(0, '0.180')] [2022-07-11 11:10:54,584][26022] Updated weights on worker 0-0, policy_version 1166379 (0.00088) [2022-07-11 11:10:56,173][26022] Updated weights on worker 0-0, policy_version 1166389 (0.00097) [2022-07-11 11:10:58,111][26022] Updated weights on worker 0-0, policy_version 1166399 (0.00086) [2022-07-11 11:10:59,322][25689] Fps is (10 sec: 5460.2, 60 sec: 5506.0, 300 sec: 5525.5). Total num frames: 1194398720. Throughput: 0: 5787.8. Samples: 1194407620. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:10:59,322][25689] Avg episode reward: [(0, '0.044')] [2022-07-11 11:10:59,795][26022] Updated weights on worker 0-0, policy_version 1166409 (0.00094) [2022-07-11 11:11:02,076][26022] Updated weights on worker 0-0, policy_version 1166419 (0.00084) [2022-07-11 11:11:03,841][26022] Updated weights on worker 0-0, policy_version 1166429 (0.00090) [2022-07-11 11:11:04,329][25689] Fps is (10 sec: 5461.6, 60 sec: 5509.0, 300 sec: 5519.0). Total num frames: 1194425344. Throughput: 0: 4868.5. Samples: 1194422326. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:11:04,330][25689] Avg episode reward: [(0, '0.257')] [2022-07-11 11:11:05,993][26022] Updated weights on worker 0-0, policy_version 1166439 (0.00085) [2022-07-11 11:11:07,599][26022] Updated weights on worker 0-0, policy_version 1166449 (0.00084) [2022-07-11 11:11:09,341][25689] Fps is (10 sec: 5314.5, 60 sec: 5493.8, 300 sec: 5516.3). Total num frames: 1194451968. Throughput: 0: 5716.7. Samples: 1194455866. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:11:09,341][25689] Avg episode reward: [(0, '0.429')] [2022-07-11 11:11:09,581][26022] Updated weights on worker 0-0, policy_version 1166459 (0.00085) [2022-07-11 11:11:11,356][26022] Updated weights on worker 0-0, policy_version 1166469 (0.00081) [2022-07-11 11:11:13,211][26022] Updated weights on worker 0-0, policy_version 1166479 (0.00084) [2022-07-11 11:11:14,348][25689] Fps is (10 sec: 5621.8, 60 sec: 5528.5, 300 sec: 5524.0). Total num frames: 1194481664. Throughput: 0: 5740.7. Samples: 1194489474. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:11:14,348][25689] Avg episode reward: [(0, '0.143')] [2022-07-11 11:11:15,082][26022] Updated weights on worker 0-0, policy_version 1166489 (0.00090) [2022-07-11 11:11:16,794][26022] Updated weights on worker 0-0, policy_version 1166499 (0.00092) [2022-07-11 11:11:18,708][26022] Updated weights on worker 0-0, policy_version 1166509 (0.00093) [2022-07-11 11:11:19,377][25689] Fps is (10 sec: 5611.9, 60 sec: 5497.6, 300 sec: 5514.3). Total num frames: 1194508288. Throughput: 0: 4909.1. Samples: 1194506072. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:11:19,377][25689] Avg episode reward: [(0, '0.679')] [2022-07-11 11:11:20,337][26022] Updated weights on worker 0-0, policy_version 1166519 (0.00087) [2022-07-11 11:11:22,258][26022] Updated weights on worker 0-0, policy_version 1166529 (0.00084) [2022-07-11 11:11:24,180][26022] Updated weights on worker 0-0, policy_version 1166539 (0.00088) [2022-07-11 11:11:24,382][25689] Fps is (10 sec: 5408.4, 60 sec: 5497.7, 300 sec: 5517.7). Total num frames: 1194535936. Throughput: 0: 5846.7. Samples: 1194539568. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:11:24,383][25689] Avg episode reward: [(0, '0.807')] [2022-07-11 11:11:25,939][26022] Updated weights on worker 0-0, policy_version 1166549 (0.00085) [2022-07-11 11:11:27,639][26022] Updated weights on worker 0-0, policy_version 1166559 (0.00087) [2022-07-11 11:11:29,402][25689] Fps is (10 sec: 5617.8, 60 sec: 5531.5, 300 sec: 5521.0). Total num frames: 1194564608. Throughput: 0: 5841.7. Samples: 1194573056. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:11:29,403][25689] Avg episode reward: [(0, '0.717')] [2022-07-11 11:11:29,711][26022] Updated weights on worker 0-0, policy_version 1166569 (0.00089) [2022-07-11 11:11:31,507][26022] Updated weights on worker 0-0, policy_version 1166579 (0.00090) [2022-07-11 11:11:33,194][26022] Updated weights on worker 0-0, policy_version 1166589 (0.00094) [2022-07-11 11:11:34,416][25689] Fps is (10 sec: 5612.9, 60 sec: 5516.7, 300 sec: 5519.3). Total num frames: 1194592256. Throughput: 0: 4999.4. Samples: 1194589806. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:11:34,417][25689] Avg episode reward: [(0, '0.547')] [2022-07-11 11:11:35,005][26022] Updated weights on worker 0-0, policy_version 1166599 (0.00090) [2022-07-11 11:11:36,859][26022] Updated weights on worker 0-0, policy_version 1166609 (0.00090) [2022-07-11 11:11:38,961][26022] Updated weights on worker 0-0, policy_version 1166619 (0.00081) [2022-07-11 11:11:39,479][25689] Fps is (10 sec: 5487.4, 60 sec: 5500.6, 300 sec: 5519.5). Total num frames: 1194619904. Throughput: 0: 5839.2. Samples: 1194623454. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:11:39,479][25689] Avg episode reward: [(0, '0.153')] [2022-07-11 11:11:40,555][26022] Updated weights on worker 0-0, policy_version 1166629 (0.00095) [2022-07-11 11:11:42,663][26022] Updated weights on worker 0-0, policy_version 1166639 (0.00086) [2022-07-11 11:11:44,141][26022] Updated weights on worker 0-0, policy_version 1166649 (0.00085) [2022-07-11 11:11:44,525][25689] Fps is (10 sec: 5672.7, 60 sec: 5548.7, 300 sec: 5519.2). Total num frames: 1194649600. Throughput: 0: 5836.1. Samples: 1194657124. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:11:44,526][25689] Avg episode reward: [(0, '-0.052')] [2022-07-11 11:11:46,144][26022] Updated weights on worker 0-0, policy_version 1166659 (0.00090) [2022-07-11 11:11:47,709][26022] Updated weights on worker 0-0, policy_version 1166669 (0.00091) [2022-07-11 11:11:49,527][25689] Fps is (10 sec: 5707.2, 60 sec: 5533.1, 300 sec: 5520.2). Total num frames: 1194677248. Throughput: 0: 5022.3. Samples: 1194674128. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:11:49,527][25689] Avg episode reward: [(0, '0.032')] [2022-07-11 11:11:49,937][26022] Updated weights on worker 0-0, policy_version 1166679 (0.00094) [2022-07-11 11:11:51,555][26022] Updated weights on worker 0-0, policy_version 1166689 (0.00052) [2022-07-11 11:11:53,415][26022] Updated weights on worker 0-0, policy_version 1166699 (0.00090) [2022-07-11 11:11:54,555][25689] Fps is (10 sec: 5411.2, 60 sec: 5531.5, 300 sec: 5518.3). Total num frames: 1194703872. Throughput: 0: 5851.7. Samples: 1194707650. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:11:54,556][25689] Avg episode reward: [(0, '-0.054')] [2022-07-11 11:11:55,140][26022] Updated weights on worker 0-0, policy_version 1166709 (0.00091) [2022-07-11 11:11:57,187][26022] Updated weights on worker 0-0, policy_version 1166719 (0.00507) [2022-07-11 11:11:58,807][26022] Updated weights on worker 0-0, policy_version 1166729 (0.00086) [2022-07-11 11:11:59,681][25689] Fps is (10 sec: 5647.3, 60 sec: 5564.5, 300 sec: 5530.5). Total num frames: 1194734592. Throughput: 0: 5835.5. Samples: 1194741342. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:11:59,681][25689] Avg episode reward: [(0, '1.146')] [2022-07-11 11:12:00,871][26022] Updated weights on worker 0-0, policy_version 1166739 (0.00089) [2022-07-11 11:12:01,488][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:12:01,506][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001166743_1194744832.pth [2022-07-11 11:12:01,506][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001164797_1192752128.pth [2022-07-11 11:12:02,896][26022] Updated weights on worker 0-0, policy_version 1166749 (0.00100) [2022-07-11 11:12:04,690][25689] Fps is (10 sec: 5455.8, 60 sec: 5530.5, 300 sec: 5520.6). Total num frames: 1194759168. Throughput: 0: 4978.7. Samples: 1194757518. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:12:04,692][25689] Avg episode reward: [(0, '1.386')] [2022-07-11 11:12:04,823][26022] Updated weights on worker 0-0, policy_version 1166759 (0.00088) [2022-07-11 11:12:06,705][26022] Updated weights on worker 0-0, policy_version 1166769 (0.00090) [2022-07-11 11:12:08,368][26022] Updated weights on worker 0-0, policy_version 1166779 (0.00095) [2022-07-11 11:12:09,710][25689] Fps is (10 sec: 5309.2, 60 sec: 5563.6, 300 sec: 5524.3). Total num frames: 1194787840. Throughput: 0: 5719.0. Samples: 1194789558. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:12:09,712][25689] Avg episode reward: [(0, '1.569')] [2022-07-11 11:12:10,490][26022] Updated weights on worker 0-0, policy_version 1166789 (0.00085) [2022-07-11 11:12:12,201][26022] Updated weights on worker 0-0, policy_version 1166799 (0.00093) [2022-07-11 11:12:14,028][26022] Updated weights on worker 0-0, policy_version 1166809 (0.00094) [2022-07-11 11:12:14,752][25689] Fps is (10 sec: 5698.9, 60 sec: 5543.4, 300 sec: 5521.0). Total num frames: 1194816512. Throughput: 0: 5725.1. Samples: 1194823282. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:12:14,753][25689] Avg episode reward: [(0, '1.317')] [2022-07-11 11:12:15,940][26022] Updated weights on worker 0-0, policy_version 1166819 (0.00081) [2022-07-11 11:12:17,789][26022] Updated weights on worker 0-0, policy_version 1166829 (0.00051) [2022-07-11 11:12:19,553][26022] Updated weights on worker 0-0, policy_version 1166839 (0.00086) [2022-07-11 11:12:19,801][25689] Fps is (10 sec: 5581.4, 60 sec: 5558.6, 300 sec: 5524.0). Total num frames: 1194844160. Throughput: 0: 4892.9. Samples: 1194839788. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:12:19,801][25689] Avg episode reward: [(0, '0.191')] [2022-07-11 11:12:21,580][26022] Updated weights on worker 0-0, policy_version 1166849 (0.00085) [2022-07-11 11:12:23,235][26022] Updated weights on worker 0-0, policy_version 1166859 (0.00092) [2022-07-11 11:12:24,823][25689] Fps is (10 sec: 5490.9, 60 sec: 5557.1, 300 sec: 5524.1). Total num frames: 1194871808. Throughput: 0: 5734.5. Samples: 1194872970. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:12:24,823][25689] Avg episode reward: [(0, '0.335')] [2022-07-11 11:12:25,311][26022] Updated weights on worker 0-0, policy_version 1166869 (0.00085) [2022-07-11 11:12:26,903][26022] Updated weights on worker 0-0, policy_version 1166879 (0.00090) [2022-07-11 11:12:28,896][26022] Updated weights on worker 0-0, policy_version 1166889 (0.00094) [2022-07-11 11:12:29,918][25689] Fps is (10 sec: 5465.5, 60 sec: 5533.2, 300 sec: 5519.4). Total num frames: 1194899456. Throughput: 0: 5797.7. Samples: 1194906718. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:12:29,919][25689] Avg episode reward: [(0, '0.227')] [2022-07-11 11:12:30,495][26022] Updated weights on worker 0-0, policy_version 1166899 (0.00082) [2022-07-11 11:12:32,423][26022] Updated weights on worker 0-0, policy_version 1166909 (0.00618) [2022-07-11 11:12:34,295][26022] Updated weights on worker 0-0, policy_version 1166919 (0.00095) [2022-07-11 11:12:34,993][25689] Fps is (10 sec: 5739.3, 60 sec: 5578.4, 300 sec: 5530.1). Total num frames: 1194930176. Throughput: 0: 4957.5. Samples: 1194923624. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2022-07-11 11:12:34,993][25689] Avg episode reward: [(0, '0.085')] [2022-07-11 11:12:36,172][26022] Updated weights on worker 0-0, policy_version 1166929 (0.00090) [2022-07-11 11:12:37,792][26022] Updated weights on worker 0-0, policy_version 1166939 (0.00088) [2022-07-11 11:12:39,841][26022] Updated weights on worker 0-0, policy_version 1166949 (0.00633) [2022-07-11 11:12:40,062][25689] Fps is (10 sec: 5653.3, 60 sec: 5560.9, 300 sec: 5522.6). Total num frames: 1194956800. Throughput: 0: 5793.4. Samples: 1194957166. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:12:40,062][25689] Avg episode reward: [(0, '0.510')] [2022-07-11 11:12:41,738][26022] Updated weights on worker 0-0, policy_version 1166959 (0.00087) [2022-07-11 11:12:43,320][26022] Updated weights on worker 0-0, policy_version 1166969 (0.00091) [2022-07-11 11:12:45,069][25689] Fps is (10 sec: 5386.3, 60 sec: 5530.7, 300 sec: 5519.5). Total num frames: 1194984448. Throughput: 0: 5822.5. Samples: 1194990852. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:12:45,069][25689] Avg episode reward: [(0, '0.392')] [2022-07-11 11:12:45,400][26022] Updated weights on worker 0-0, policy_version 1166979 (0.00092) [2022-07-11 11:12:46,925][26022] Updated weights on worker 0-0, policy_version 1166989 (0.00091) [2022-07-11 11:12:48,985][26022] Updated weights on worker 0-0, policy_version 1166999 (0.00095) [2022-07-11 11:12:50,105][25689] Fps is (10 sec: 5709.9, 60 sec: 5561.4, 300 sec: 5530.5). Total num frames: 1195014144. Throughput: 0: 4991.3. Samples: 1195007474. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:12:50,105][25689] Avg episode reward: [(0, '1.397')] [2022-07-11 11:12:50,899][26022] Updated weights on worker 0-0, policy_version 1167009 (0.00084) [2022-07-11 11:12:52,490][26022] Updated weights on worker 0-0, policy_version 1167019 (0.00089) [2022-07-11 11:12:54,652][26022] Updated weights on worker 0-0, policy_version 1167029 (0.00080) [2022-07-11 11:12:55,124][25689] Fps is (10 sec: 5499.2, 60 sec: 5545.3, 300 sec: 5517.3). Total num frames: 1195039744. Throughput: 0: 5830.7. Samples: 1195041002. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:12:55,124][25689] Avg episode reward: [(0, '1.270')] [2022-07-11 11:12:56,252][26022] Updated weights on worker 0-0, policy_version 1167039 (0.00086) [2022-07-11 11:12:58,011][26022] Updated weights on worker 0-0, policy_version 1167049 (0.00092) [2022-07-11 11:12:59,939][26022] Updated weights on worker 0-0, policy_version 1167059 (0.00084) [2022-07-11 11:13:00,171][25689] Fps is (10 sec: 5391.5, 60 sec: 5518.7, 300 sec: 5526.9). Total num frames: 1195068416. Throughput: 0: 5846.5. Samples: 1195074734. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:13:00,171][25689] Avg episode reward: [(0, '1.257')] [2022-07-11 11:13:01,698][26022] Updated weights on worker 0-0, policy_version 1167069 (0.00093) [2022-07-11 11:13:04,012][26022] Updated weights on worker 0-0, policy_version 1167079 (0.00061) [2022-07-11 11:13:05,181][25689] Fps is (10 sec: 5599.9, 60 sec: 5569.3, 300 sec: 5530.2). Total num frames: 1195096064. Throughput: 0: 4925.4. Samples: 1195089914. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:13:05,182][25689] Avg episode reward: [(0, '1.217')] [2022-07-11 11:13:05,863][26022] Updated weights on worker 0-0, policy_version 1167089 (0.00100) [2022-07-11 11:13:07,655][26022] Updated weights on worker 0-0, policy_version 1167099 (0.00085) [2022-07-11 11:13:09,588][26022] Updated weights on worker 0-0, policy_version 1167109 (0.00087) [2022-07-11 11:13:10,197][25689] Fps is (10 sec: 5515.1, 60 sec: 5552.8, 300 sec: 5526.5). Total num frames: 1195123712. Throughput: 0: 5738.1. Samples: 1195122768. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:13:10,198][25689] Avg episode reward: [(0, '1.419')] [2022-07-11 11:13:11,458][26022] Updated weights on worker 0-0, policy_version 1167119 (0.00097) [2022-07-11 11:13:13,312][26022] Updated weights on worker 0-0, policy_version 1167129 (0.00101) [2022-07-11 11:13:15,119][26022] Updated weights on worker 0-0, policy_version 1167139 (0.00085) [2022-07-11 11:13:15,219][25689] Fps is (10 sec: 5406.9, 60 sec: 5520.8, 300 sec: 5521.6). Total num frames: 1195150336. Throughput: 0: 5739.2. Samples: 1195156330. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:13:15,219][25689] Avg episode reward: [(0, '1.414')] [2022-07-11 11:13:16,751][26022] Updated weights on worker 0-0, policy_version 1167149 (0.00085) [2022-07-11 11:13:18,772][26022] Updated weights on worker 0-0, policy_version 1167159 (0.00093) [2022-07-11 11:13:20,254][25689] Fps is (10 sec: 5396.6, 60 sec: 5522.0, 300 sec: 5519.3). Total num frames: 1195177984. Throughput: 0: 4897.5. Samples: 1195173090. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:13:20,256][25689] Avg episode reward: [(0, '1.370')] [2022-07-11 11:13:20,627][26022] Updated weights on worker 0-0, policy_version 1167169 (0.00102) [2022-07-11 11:13:22,406][26022] Updated weights on worker 0-0, policy_version 1167179 (0.00086) [2022-07-11 11:13:24,190][26022] Updated weights on worker 0-0, policy_version 1167189 (0.00095) [2022-07-11 11:13:25,283][25689] Fps is (10 sec: 5799.6, 60 sec: 5572.2, 300 sec: 5529.5). Total num frames: 1195208704. Throughput: 0: 5804.9. Samples: 1195206602. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:13:25,284][25689] Avg episode reward: [(0, '1.337')] [2022-07-11 11:13:25,956][26022] Updated weights on worker 0-0, policy_version 1167199 (0.00089) [2022-07-11 11:13:27,919][26022] Updated weights on worker 0-0, policy_version 1167209 (0.00088) [2022-07-11 11:13:29,844][26022] Updated weights on worker 0-0, policy_version 1167219 (0.00092) [2022-07-11 11:13:30,308][25689] Fps is (10 sec: 5601.7, 60 sec: 5544.8, 300 sec: 5522.3). Total num frames: 1195234304. Throughput: 0: 5829.9. Samples: 1195240012. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:13:30,310][25689] Avg episode reward: [(0, '1.146')] [2022-07-11 11:13:31,488][26022] Updated weights on worker 0-0, policy_version 1167229 (0.00095) [2022-07-11 11:13:33,588][26022] Updated weights on worker 0-0, policy_version 1167239 (0.00098) [2022-07-11 11:13:35,085][26022] Updated weights on worker 0-0, policy_version 1167249 (0.00096) [2022-07-11 11:13:35,356][25689] Fps is (10 sec: 5387.8, 60 sec: 5513.3, 300 sec: 5522.8). Total num frames: 1195262976. Throughput: 0: 4983.0. Samples: 1195256674. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:13:35,357][25689] Avg episode reward: [(0, '0.339')] [2022-07-11 11:13:36,893][26022] Updated weights on worker 0-0, policy_version 1167259 (0.00085) [2022-07-11 11:13:38,944][26022] Updated weights on worker 0-0, policy_version 1167269 (0.00094) [2022-07-11 11:13:40,467][25689] Fps is (10 sec: 5846.4, 60 sec: 5577.3, 300 sec: 5532.5). Total num frames: 1195293696. Throughput: 0: 5819.3. Samples: 1195290714. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:13:40,467][25689] Avg episode reward: [(0, '-0.298')] [2022-07-11 11:13:40,472][26022] Updated weights on worker 0-0, policy_version 1167279 (0.00090) [2022-07-11 11:13:42,655][26022] Updated weights on worker 0-0, policy_version 1167289 (0.00085) [2022-07-11 11:13:44,336][26022] Updated weights on worker 0-0, policy_version 1167299 (0.00091) [2022-07-11 11:13:45,494][25689] Fps is (10 sec: 5656.1, 60 sec: 5558.4, 300 sec: 5530.0). Total num frames: 1195320320. Throughput: 0: 5823.8. Samples: 1195324310. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:13:45,495][25689] Avg episode reward: [(0, '0.115')] [2022-07-11 11:13:46,096][26022] Updated weights on worker 0-0, policy_version 1167309 (0.00082) [2022-07-11 11:13:48,088][26022] Updated weights on worker 0-0, policy_version 1167319 (0.00085) [2022-07-11 11:13:49,825][26022] Updated weights on worker 0-0, policy_version 1167329 (0.00091) [2022-07-11 11:13:50,497][25689] Fps is (10 sec: 5411.0, 60 sec: 5527.6, 300 sec: 5527.6). Total num frames: 1195347968. Throughput: 0: 5826.3. Samples: 1195357636. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:13:50,497][25689] Avg episode reward: [(0, '-1.545')] [2022-07-11 11:13:51,955][26022] Updated weights on worker 0-0, policy_version 1167339 (0.00086) [2022-07-11 11:13:53,534][26022] Updated weights on worker 0-0, policy_version 1167349 (0.00088) [2022-07-11 11:13:55,435][26022] Updated weights on worker 0-0, policy_version 1167359 (0.00095) [2022-07-11 11:13:55,504][25689] Fps is (10 sec: 5524.0, 60 sec: 5562.6, 300 sec: 5528.4). Total num frames: 1195375616. Throughput: 0: 5828.8. Samples: 1195374116. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:13:55,510][25689] Avg episode reward: [(0, '-1.813')] [2022-07-11 11:13:57,320][26022] Updated weights on worker 0-0, policy_version 1167369 (0.00091) [2022-07-11 11:13:59,022][26022] Updated weights on worker 0-0, policy_version 1167379 (0.00091) [2022-07-11 11:14:00,543][25689] Fps is (10 sec: 5504.3, 60 sec: 5546.4, 300 sec: 5534.7). Total num frames: 1195403264. Throughput: 0: 5834.3. Samples: 1195407842. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:14:00,543][25689] Avg episode reward: [(0, '-2.708')] [2022-07-11 11:14:01,002][26022] Updated weights on worker 0-0, policy_version 1167389 (0.00089) [2022-07-11 11:14:01,555][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:14:01,570][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001167394_1195411456.pth [2022-07-11 11:14:01,570][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001165446_1193416704.pth [2022-07-11 11:14:03,176][26022] Updated weights on worker 0-0, policy_version 1167399 (0.00094) [2022-07-11 11:14:04,852][26022] Updated weights on worker 0-0, policy_version 1167409 (0.00093) [2022-07-11 11:14:05,567][25689] Fps is (10 sec: 5494.8, 60 sec: 5545.1, 300 sec: 5527.8). Total num frames: 1195430912. Throughput: 0: 5721.3. Samples: 1195439156. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:14:05,568][25689] Avg episode reward: [(0, '-1.830')] [2022-07-11 11:14:06,885][26022] Updated weights on worker 0-0, policy_version 1167419 (0.00367) [2022-07-11 11:14:08,514][26022] Updated weights on worker 0-0, policy_version 1167429 (0.00092) [2022-07-11 11:14:10,570][25689] Fps is (10 sec: 5310.4, 60 sec: 5512.5, 300 sec: 5528.2). Total num frames: 1195456512. Throughput: 0: 4901.1. Samples: 1195456016. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:14:10,570][25689] Avg episode reward: [(0, '-1.167')] [2022-07-11 11:14:10,605][26022] Updated weights on worker 0-0, policy_version 1167439 (0.00086) [2022-07-11 11:14:12,187][26022] Updated weights on worker 0-0, policy_version 1167449 (0.00088) [2022-07-11 11:14:14,009][26022] Updated weights on worker 0-0, policy_version 1167459 (0.00085) [2022-07-11 11:14:15,583][25689] Fps is (10 sec: 5316.7, 60 sec: 5530.2, 300 sec: 5525.6). Total num frames: 1195484160. Throughput: 0: 5765.1. Samples: 1195489870. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:14:15,585][25689] Avg episode reward: [(0, '0.013')] [2022-07-11 11:14:15,934][26022] Updated weights on worker 0-0, policy_version 1167469 (0.00082) [2022-07-11 11:14:17,526][26022] Updated weights on worker 0-0, policy_version 1167479 (0.00089) [2022-07-11 11:14:19,679][26022] Updated weights on worker 0-0, policy_version 1167489 (0.00089) [2022-07-11 11:14:20,627][25689] Fps is (10 sec: 5701.6, 60 sec: 5563.3, 300 sec: 5529.2). Total num frames: 1195513856. Throughput: 0: 5760.0. Samples: 1195523528. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:14:20,629][25689] Avg episode reward: [(0, '0.335')] [2022-07-11 11:14:21,174][26022] Updated weights on worker 0-0, policy_version 1167499 (0.00082) [2022-07-11 11:14:23,157][26022] Updated weights on worker 0-0, policy_version 1167509 (0.00096) [2022-07-11 11:14:24,864][26022] Updated weights on worker 0-0, policy_version 1167519 (0.00082) [2022-07-11 11:14:25,650][25689] Fps is (10 sec: 5696.3, 60 sec: 5512.9, 300 sec: 5532.4). Total num frames: 1195541504. Throughput: 0: 5045.4. Samples: 1195540478. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:14:25,651][25689] Avg episode reward: [(0, '0.546')] [2022-07-11 11:14:26,869][26022] Updated weights on worker 0-0, policy_version 1167529 (0.00097) [2022-07-11 11:14:28,619][26022] Updated weights on worker 0-0, policy_version 1167539 (0.00092) [2022-07-11 11:14:30,585][26022] Updated weights on worker 0-0, policy_version 1167549 (0.00087) [2022-07-11 11:14:30,677][25689] Fps is (10 sec: 5603.9, 60 sec: 5563.6, 300 sec: 5528.9). Total num frames: 1195570176. Throughput: 0: 5857.2. Samples: 1195573790. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:14:30,678][25689] Avg episode reward: [(0, '0.440')] [2022-07-11 11:14:32,138][26022] Updated weights on worker 0-0, policy_version 1167559 (0.00082) [2022-07-11 11:14:34,144][26022] Updated weights on worker 0-0, policy_version 1167569 (0.00084) [2022-07-11 11:14:35,683][25689] Fps is (10 sec: 5715.4, 60 sec: 5567.5, 300 sec: 5537.2). Total num frames: 1195598848. Throughput: 0: 5874.6. Samples: 1195607950. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:14:35,685][25689] Avg episode reward: [(0, '0.770')] [2022-07-11 11:14:35,936][26022] Updated weights on worker 0-0, policy_version 1167579 (0.00090) [2022-07-11 11:14:37,950][26022] Updated weights on worker 0-0, policy_version 1167589 (0.00096) [2022-07-11 11:14:39,552][26022] Updated weights on worker 0-0, policy_version 1167599 (0.00095) [2022-07-11 11:14:40,798][25689] Fps is (10 sec: 5463.8, 60 sec: 5499.3, 300 sec: 5529.3). Total num frames: 1195625472. Throughput: 0: 5008.9. Samples: 1195624560. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:14:40,798][25689] Avg episode reward: [(0, '0.339')] [2022-07-11 11:14:41,563][26022] Updated weights on worker 0-0, policy_version 1167609 (0.00086) [2022-07-11 11:14:43,275][26022] Updated weights on worker 0-0, policy_version 1167619 (0.00084) [2022-07-11 11:14:45,205][26022] Updated weights on worker 0-0, policy_version 1167629 (0.00087) [2022-07-11 11:14:45,866][25689] Fps is (10 sec: 5530.7, 60 sec: 5546.4, 300 sec: 5538.7). Total num frames: 1195655168. Throughput: 0: 5812.4. Samples: 1195657984. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:14:45,868][25689] Avg episode reward: [(0, '0.424')] [2022-07-11 11:14:47,039][26022] Updated weights on worker 0-0, policy_version 1167639 (0.00095) [2022-07-11 11:14:49,059][26022] Updated weights on worker 0-0, policy_version 1167649 (0.00091) [2022-07-11 11:14:50,624][26022] Updated weights on worker 0-0, policy_version 1167659 (0.00088) [2022-07-11 11:14:50,871][25689] Fps is (10 sec: 5692.9, 60 sec: 5546.2, 300 sec: 5529.0). Total num frames: 1195682816. Throughput: 0: 5813.0. Samples: 1195691174. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:14:50,871][25689] Avg episode reward: [(0, '0.727')] [2022-07-11 11:14:52,676][26022] Updated weights on worker 0-0, policy_version 1167669 (0.00095) [2022-07-11 11:14:54,286][26022] Updated weights on worker 0-0, policy_version 1167679 (0.00090) [2022-07-11 11:14:55,879][25689] Fps is (10 sec: 5624.8, 60 sec: 5563.1, 300 sec: 5540.6). Total num frames: 1195711488. Throughput: 0: 4941.1. Samples: 1195707740. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:14:55,879][25689] Avg episode reward: [(0, '0.824')] [2022-07-11 11:14:56,355][26022] Updated weights on worker 0-0, policy_version 1167689 (0.00085) [2022-07-11 11:14:58,157][26022] Updated weights on worker 0-0, policy_version 1167699 (0.00087) [2022-07-11 11:14:59,872][26022] Updated weights on worker 0-0, policy_version 1167709 (0.00093) [2022-07-11 11:15:00,987][25689] Fps is (10 sec: 5466.1, 60 sec: 5539.8, 300 sec: 5539.4). Total num frames: 1195738112. Throughput: 0: 5776.2. Samples: 1195741176. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:15:00,987][25689] Avg episode reward: [(0, '1.221')] [2022-07-11 11:15:02,168][26022] Updated weights on worker 0-0, policy_version 1167719 (0.00108) [2022-07-11 11:15:04,233][26022] Updated weights on worker 0-0, policy_version 1167729 (0.00085) [2022-07-11 11:15:05,789][26022] Updated weights on worker 0-0, policy_version 1167739 (0.00087) [2022-07-11 11:15:06,054][25689] Fps is (10 sec: 5333.9, 60 sec: 5535.9, 300 sec: 5538.7). Total num frames: 1195765760. Throughput: 0: 5654.9. Samples: 1195772144. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:15:06,054][25689] Avg episode reward: [(0, '1.311')] [2022-07-11 11:15:07,791][26022] Updated weights on worker 0-0, policy_version 1167749 (0.00102) [2022-07-11 11:15:09,509][26022] Updated weights on worker 0-0, policy_version 1167759 (0.00086) [2022-07-11 11:15:11,120][25689] Fps is (10 sec: 5457.2, 60 sec: 5563.9, 300 sec: 5537.7). Total num frames: 1195793408. Throughput: 0: 4838.8. Samples: 1195789154. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:15:11,120][25689] Avg episode reward: [(0, '1.805')] [2022-07-11 11:15:11,649][26022] Updated weights on worker 0-0, policy_version 1167769 (0.00087) [2022-07-11 11:15:13,336][26022] Updated weights on worker 0-0, policy_version 1167779 (0.00093) [2022-07-11 11:15:15,041][26022] Updated weights on worker 0-0, policy_version 1167789 (0.00094) [2022-07-11 11:15:16,194][25689] Fps is (10 sec: 5453.3, 60 sec: 5558.3, 300 sec: 5534.1). Total num frames: 1195821056. Throughput: 0: 5663.5. Samples: 1195822794. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:15:16,194][25689] Avg episode reward: [(0, '1.963')] [2022-07-11 11:15:16,939][26022] Updated weights on worker 0-0, policy_version 1167799 (0.00080) [2022-07-11 11:15:18,873][26022] Updated weights on worker 0-0, policy_version 1167809 (0.00087) [2022-07-11 11:15:20,640][26022] Updated weights on worker 0-0, policy_version 1167819 (0.00089) [2022-07-11 11:15:21,291][25689] Fps is (10 sec: 5637.9, 60 sec: 5553.5, 300 sec: 5539.3). Total num frames: 1195850752. Throughput: 0: 5666.2. Samples: 1195856222. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:15:21,291][25689] Avg episode reward: [(0, '1.839')] [2022-07-11 11:15:22,703][26022] Updated weights on worker 0-0, policy_version 1167829 (0.00093) [2022-07-11 11:15:24,206][26022] Updated weights on worker 0-0, policy_version 1167839 (0.00095) [2022-07-11 11:15:26,261][26022] Updated weights on worker 0-0, policy_version 1167849 (0.00087) [2022-07-11 11:15:26,317][25689] Fps is (10 sec: 5563.4, 60 sec: 5536.3, 300 sec: 5539.2). Total num frames: 1195877376. Throughput: 0: 4975.1. Samples: 1195872956. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:15:26,317][25689] Avg episode reward: [(0, '1.988')] [2022-07-11 11:15:28,038][26022] Updated weights on worker 0-0, policy_version 1167859 (0.00087) [2022-07-11 11:15:30,009][26022] Updated weights on worker 0-0, policy_version 1167869 (0.00092) [2022-07-11 11:15:31,355][25689] Fps is (10 sec: 5392.5, 60 sec: 5518.5, 300 sec: 5535.7). Total num frames: 1195905024. Throughput: 0: 5764.8. Samples: 1195905808. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:15:31,356][25689] Avg episode reward: [(0, '1.370')] [2022-07-11 11:15:31,735][26022] Updated weights on worker 0-0, policy_version 1167879 (0.00082) [2022-07-11 11:15:33,618][26022] Updated weights on worker 0-0, policy_version 1167889 (0.00091) [2022-07-11 11:15:35,420][26022] Updated weights on worker 0-0, policy_version 1167899 (0.00096) [2022-07-11 11:15:36,363][25689] Fps is (10 sec: 5504.2, 60 sec: 5501.4, 300 sec: 5533.4). Total num frames: 1195932672. Throughput: 0: 5779.2. Samples: 1195939358. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:15:36,365][25689] Avg episode reward: [(0, '1.106')] [2022-07-11 11:15:37,269][26022] Updated weights on worker 0-0, policy_version 1167909 (0.00088) [2022-07-11 11:15:39,150][26022] Updated weights on worker 0-0, policy_version 1167919 (0.00089) [2022-07-11 11:15:40,878][26022] Updated weights on worker 0-0, policy_version 1167929 (0.00088) [2022-07-11 11:15:41,481][25689] Fps is (10 sec: 5662.9, 60 sec: 5551.7, 300 sec: 5541.9). Total num frames: 1195962368. Throughput: 0: 4952.9. Samples: 1195956224. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:15:41,483][25689] Avg episode reward: [(0, '1.112')] [2022-07-11 11:15:42,635][26022] Updated weights on worker 0-0, policy_version 1167939 (0.00094) [2022-07-11 11:15:44,524][26022] Updated weights on worker 0-0, policy_version 1167949 (0.00091) [2022-07-11 11:15:46,348][26022] Updated weights on worker 0-0, policy_version 1167959 (0.00097) [2022-07-11 11:15:46,516][25689] Fps is (10 sec: 5647.9, 60 sec: 5521.0, 300 sec: 5538.1). Total num frames: 1195990016. Throughput: 0: 5777.8. Samples: 1195989664. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:15:46,518][25689] Avg episode reward: [(0, '0.920')] [2022-07-11 11:15:48,201][26022] Updated weights on worker 0-0, policy_version 1167969 (0.00083) [2022-07-11 11:15:50,141][26022] Updated weights on worker 0-0, policy_version 1167979 (0.01511) [2022-07-11 11:15:51,535][25689] Fps is (10 sec: 5601.6, 60 sec: 5536.5, 300 sec: 5544.8). Total num frames: 1196018688. Throughput: 0: 5806.7. Samples: 1196022990. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:15:51,536][25689] Avg episode reward: [(0, '-0.204')] [2022-07-11 11:15:51,909][26022] Updated weights on worker 0-0, policy_version 1167989 (0.00087) [2022-07-11 11:15:53,788][26022] Updated weights on worker 0-0, policy_version 1167999 (0.00090) [2022-07-11 11:15:55,451][26022] Updated weights on worker 0-0, policy_version 1168009 (0.00087) [2022-07-11 11:15:56,612][25689] Fps is (10 sec: 5375.7, 60 sec: 5479.7, 300 sec: 5535.2). Total num frames: 1196044288. Throughput: 0: 4946.2. Samples: 1196039514. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:15:56,614][25689] Avg episode reward: [(0, '-0.611')] [2022-07-11 11:15:57,599][26022] Updated weights on worker 0-0, policy_version 1168019 (0.00085) [2022-07-11 11:15:59,431][26022] Updated weights on worker 0-0, policy_version 1168029 (0.00097) [2022-07-11 11:16:01,107][26022] Updated weights on worker 0-0, policy_version 1168039 (0.00089) [2022-07-11 11:16:01,651][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:16:01,655][25689] Fps is (10 sec: 5363.3, 60 sec: 5519.4, 300 sec: 5541.5). Total num frames: 1196072960. Throughput: 0: 5778.6. Samples: 1196072798. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:16:01,655][25689] Avg episode reward: [(0, '-0.518')] [2022-07-11 11:16:01,660][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001168041_1196073984.pth [2022-07-11 11:16:01,661][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001166094_1194080256.pth [2022-07-11 11:16:03,373][26022] Updated weights on worker 0-0, policy_version 1168049 (0.00091) [2022-07-11 11:16:05,247][26022] Updated weights on worker 0-0, policy_version 1168059 (0.00085) [2022-07-11 11:16:06,734][25689] Fps is (10 sec: 5564.3, 60 sec: 5518.3, 300 sec: 5543.6). Total num frames: 1196100608. Throughput: 0: 5666.0. Samples: 1196104216. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:16:06,734][25689] Avg episode reward: [(0, '-0.633')] [2022-07-11 11:16:06,968][26022] Updated weights on worker 0-0, policy_version 1168069 (0.00089) [2022-07-11 11:16:09,041][26022] Updated weights on worker 0-0, policy_version 1168079 (0.00089) [2022-07-11 11:16:10,675][26022] Updated weights on worker 0-0, policy_version 1168089 (0.00087) [2022-07-11 11:16:11,736][25689] Fps is (10 sec: 5484.9, 60 sec: 5524.0, 300 sec: 5536.8). Total num frames: 1196128256. Throughput: 0: 5675.6. Samples: 1196137640. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:16:11,737][25689] Avg episode reward: [(0, '-1.165')] [2022-07-11 11:16:12,638][26022] Updated weights on worker 0-0, policy_version 1168099 (0.00081) [2022-07-11 11:16:14,546][26022] Updated weights on worker 0-0, policy_version 1168109 (0.00090) [2022-07-11 11:16:16,263][26022] Updated weights on worker 0-0, policy_version 1168119 (0.00095) [2022-07-11 11:16:16,779][25689] Fps is (10 sec: 5606.7, 60 sec: 5543.8, 300 sec: 5543.5). Total num frames: 1196156928. Throughput: 0: 5701.2. Samples: 1196154490. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:16:16,780][25689] Avg episode reward: [(0, '-1.030')] [2022-07-11 11:16:18,257][26022] Updated weights on worker 0-0, policy_version 1168129 (0.00086) [2022-07-11 11:16:19,887][26022] Updated weights on worker 0-0, policy_version 1168139 (0.00084) [2022-07-11 11:16:21,812][26022] Updated weights on worker 0-0, policy_version 1168149 (0.00089) [2022-07-11 11:16:21,870][25689] Fps is (10 sec: 5658.9, 60 sec: 5527.5, 300 sec: 5545.3). Total num frames: 1196185600. Throughput: 0: 5693.6. Samples: 1196187894. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:16:21,870][25689] Avg episode reward: [(0, '-0.436')] [2022-07-11 11:16:23,786][26022] Updated weights on worker 0-0, policy_version 1168159 (0.00083) [2022-07-11 11:16:25,409][26022] Updated weights on worker 0-0, policy_version 1168169 (0.00091) [2022-07-11 11:16:26,939][25689] Fps is (10 sec: 5341.9, 60 sec: 5506.7, 300 sec: 5534.1). Total num frames: 1196211200. Throughput: 0: 5774.5. Samples: 1196220888. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:16:26,939][25689] Avg episode reward: [(0, '0.044')] [2022-07-11 11:16:27,498][26022] Updated weights on worker 0-0, policy_version 1168179 (0.00099) [2022-07-11 11:16:29,225][26022] Updated weights on worker 0-0, policy_version 1168189 (0.00091) [2022-07-11 11:16:31,088][26022] Updated weights on worker 0-0, policy_version 1168199 (0.00083) [2022-07-11 11:16:32,001][25689] Fps is (10 sec: 5457.6, 60 sec: 5538.2, 300 sec: 5540.1). Total num frames: 1196240896. Throughput: 0: 4931.7. Samples: 1196237582. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:16:32,002][25689] Avg episode reward: [(0, '0.496')] [2022-07-11 11:16:32,815][26022] Updated weights on worker 0-0, policy_version 1168209 (0.00086) [2022-07-11 11:16:34,795][26022] Updated weights on worker 0-0, policy_version 1168219 (0.00083) [2022-07-11 11:16:36,499][26022] Updated weights on worker 0-0, policy_version 1168229 (0.00089) [2022-07-11 11:16:37,052][25689] Fps is (10 sec: 5669.9, 60 sec: 5534.3, 300 sec: 5540.3). Total num frames: 1196268544. Throughput: 0: 5750.8. Samples: 1196271078. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2022-07-11 11:16:37,053][25689] Avg episode reward: [(0, '-0.087')] [2022-07-11 11:16:38,665][26022] Updated weights on worker 0-0, policy_version 1168239 (0.00261) [2022-07-11 11:16:40,123][26022] Updated weights on worker 0-0, policy_version 1168249 (0.00080) [2022-07-11 11:16:42,116][25689] Fps is (10 sec: 5365.8, 60 sec: 5488.6, 300 sec: 5529.6). Total num frames: 1196295168. Throughput: 0: 5759.4. Samples: 1196304502. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:16:42,116][25689] Avg episode reward: [(0, '0.644')] [2022-07-11 11:16:42,367][26022] Updated weights on worker 0-0, policy_version 1168259 (0.00088) [2022-07-11 11:16:43,795][26022] Updated weights on worker 0-0, policy_version 1168269 (0.00092) [2022-07-11 11:16:45,851][26022] Updated weights on worker 0-0, policy_version 1168279 (0.00085) [2022-07-11 11:16:47,162][25689] Fps is (10 sec: 5570.8, 60 sec: 5521.4, 300 sec: 5535.7). Total num frames: 1196324864. Throughput: 0: 4967.7. Samples: 1196321352. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:16:47,163][25689] Avg episode reward: [(0, '0.461')] [2022-07-11 11:16:47,858][26022] Updated weights on worker 0-0, policy_version 1168289 (0.01194) [2022-07-11 11:16:49,388][26022] Updated weights on worker 0-0, policy_version 1168299 (0.00089) [2022-07-11 11:16:51,507][26022] Updated weights on worker 0-0, policy_version 1168309 (0.00083) [2022-07-11 11:16:52,197][25689] Fps is (10 sec: 5789.8, 60 sec: 5519.9, 300 sec: 5542.4). Total num frames: 1196353536. Throughput: 0: 5810.2. Samples: 1196354922. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:16:52,198][25689] Avg episode reward: [(0, '0.927')] [2022-07-11 11:16:53,080][26022] Updated weights on worker 0-0, policy_version 1168319 (0.00089) [2022-07-11 11:16:55,123][26022] Updated weights on worker 0-0, policy_version 1168329 (0.00082) [2022-07-11 11:16:56,973][26022] Updated weights on worker 0-0, policy_version 1168339 (0.00086) [2022-07-11 11:16:57,219][25689] Fps is (10 sec: 5600.2, 60 sec: 5558.7, 300 sec: 5534.1). Total num frames: 1196381184. Throughput: 0: 5814.1. Samples: 1196388328. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:16:57,219][25689] Avg episode reward: [(0, '0.930')] [2022-07-11 11:16:58,650][26022] Updated weights on worker 0-0, policy_version 1168349 (0.00088) [2022-07-11 11:17:00,511][26022] Updated weights on worker 0-0, policy_version 1168359 (0.00094) [2022-07-11 11:17:02,276][25689] Fps is (10 sec: 5283.0, 60 sec: 5506.7, 300 sec: 5536.6). Total num frames: 1196406784. Throughput: 0: 4991.9. Samples: 1196405140. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:17:02,277][25689] Avg episode reward: [(0, '0.706')] [2022-07-11 11:17:02,679][26022] Updated weights on worker 0-0, policy_version 1168369 (0.00085) [2022-07-11 11:17:04,592][26022] Updated weights on worker 0-0, policy_version 1168379 (0.00084) [2022-07-11 11:17:06,568][26022] Updated weights on worker 0-0, policy_version 1168389 (0.00084) [2022-07-11 11:17:07,344][25689] Fps is (10 sec: 5259.0, 60 sec: 5507.7, 300 sec: 5532.3). Total num frames: 1196434432. Throughput: 0: 5692.8. Samples: 1196436244. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:17:07,344][25689] Avg episode reward: [(0, '0.889')] [2022-07-11 11:17:08,279][26022] Updated weights on worker 0-0, policy_version 1168399 (0.00935) [2022-07-11 11:17:10,129][26022] Updated weights on worker 0-0, policy_version 1168409 (0.00091) [2022-07-11 11:17:12,074][26022] Updated weights on worker 0-0, policy_version 1168419 (0.00090) [2022-07-11 11:17:12,348][25689] Fps is (10 sec: 5591.9, 60 sec: 5524.5, 300 sec: 5533.0). Total num frames: 1196463104. Throughput: 0: 5686.2. Samples: 1196469504. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:17:12,348][25689] Avg episode reward: [(0, '0.917')] [2022-07-11 11:17:13,736][26022] Updated weights on worker 0-0, policy_version 1168429 (0.00085) [2022-07-11 11:17:15,683][26022] Updated weights on worker 0-0, policy_version 1168439 (0.00091) [2022-07-11 11:17:17,397][25689] Fps is (10 sec: 5500.6, 60 sec: 5490.1, 300 sec: 5529.5). Total num frames: 1196489728. Throughput: 0: 4858.7. Samples: 1196486366. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:17:17,397][25689] Avg episode reward: [(0, '0.822')] [2022-07-11 11:17:17,695][26022] Updated weights on worker 0-0, policy_version 1168449 (0.00089) [2022-07-11 11:17:19,299][26022] Updated weights on worker 0-0, policy_version 1168459 (0.00088) [2022-07-11 11:17:21,248][26022] Updated weights on worker 0-0, policy_version 1168469 (0.00081) [2022-07-11 11:17:22,469][25689] Fps is (10 sec: 5463.2, 60 sec: 5491.8, 300 sec: 5532.0). Total num frames: 1196518400. Throughput: 0: 5668.0. Samples: 1196519596. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:17:22,470][25689] Avg episode reward: [(0, '-0.027')] [2022-07-11 11:17:22,982][26022] Updated weights on worker 0-0, policy_version 1168479 (0.00085) [2022-07-11 11:17:24,995][26022] Updated weights on worker 0-0, policy_version 1168489 (0.00100) [2022-07-11 11:17:26,823][26022] Updated weights on worker 0-0, policy_version 1168499 (0.00081) [2022-07-11 11:17:27,518][25689] Fps is (10 sec: 5564.3, 60 sec: 5527.4, 300 sec: 5532.9). Total num frames: 1196546048. Throughput: 0: 5782.1. Samples: 1196552894. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:17:27,519][25689] Avg episode reward: [(0, '0.142')] [2022-07-11 11:17:28,519][26022] Updated weights on worker 0-0, policy_version 1168509 (0.00087) [2022-07-11 11:17:30,659][26022] Updated weights on worker 0-0, policy_version 1168519 (0.00089) [2022-07-11 11:17:32,281][26022] Updated weights on worker 0-0, policy_version 1168529 (0.00089) [2022-07-11 11:17:32,539][25689] Fps is (10 sec: 5491.4, 60 sec: 5497.4, 300 sec: 5523.6). Total num frames: 1196573696. Throughput: 0: 4946.1. Samples: 1196569372. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:17:32,539][25689] Avg episode reward: [(0, '0.287')] [2022-07-11 11:17:34,170][26022] Updated weights on worker 0-0, policy_version 1168539 (0.00091) [2022-07-11 11:17:36,207][26022] Updated weights on worker 0-0, policy_version 1168549 (0.00090) [2022-07-11 11:17:37,596][25689] Fps is (10 sec: 5690.4, 60 sec: 5530.7, 300 sec: 5534.1). Total num frames: 1196603392. Throughput: 0: 5780.0. Samples: 1196603116. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:17:37,596][25689] Avg episode reward: [(0, '0.284')] [2022-07-11 11:17:37,715][26022] Updated weights on worker 0-0, policy_version 1168559 (0.00082) [2022-07-11 11:17:39,988][26022] Updated weights on worker 0-0, policy_version 1168569 (0.00113) [2022-07-11 11:17:41,517][26022] Updated weights on worker 0-0, policy_version 1168579 (0.00087) [2022-07-11 11:17:42,748][25689] Fps is (10 sec: 5616.9, 60 sec: 5539.5, 300 sec: 5531.4). Total num frames: 1196631040. Throughput: 0: 5765.6. Samples: 1196636516. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:17:42,749][25689] Avg episode reward: [(0, '0.280')] [2022-07-11 11:17:43,489][26022] Updated weights on worker 0-0, policy_version 1168589 (0.00088) [2022-07-11 11:17:44,980][26022] Updated weights on worker 0-0, policy_version 1168599 (0.00093) [2022-07-11 11:17:47,115][26022] Updated weights on worker 0-0, policy_version 1168609 (0.00092) [2022-07-11 11:17:47,795][25689] Fps is (10 sec: 5622.1, 60 sec: 5539.4, 300 sec: 5531.2). Total num frames: 1196660736. Throughput: 0: 4956.2. Samples: 1196653392. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:17:47,796][25689] Avg episode reward: [(0, '0.276')] [2022-07-11 11:17:48,879][26022] Updated weights on worker 0-0, policy_version 1168619 (0.00092) [2022-07-11 11:17:50,568][26022] Updated weights on worker 0-0, policy_version 1168629 (0.00087) [2022-07-11 11:17:52,394][26022] Updated weights on worker 0-0, policy_version 1168639 (0.00082) [2022-07-11 11:17:52,852][25689] Fps is (10 sec: 5574.2, 60 sec: 5503.7, 300 sec: 5534.0). Total num frames: 1196687360. Throughput: 0: 5802.1. Samples: 1196687230. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:17:52,852][25689] Avg episode reward: [(0, '0.892')] [2022-07-11 11:17:54,315][26022] Updated weights on worker 0-0, policy_version 1168649 (0.00086) [2022-07-11 11:17:55,911][26022] Updated weights on worker 0-0, policy_version 1168659 (0.00089) [2022-07-11 11:17:57,858][25689] Fps is (10 sec: 5495.0, 60 sec: 5521.9, 300 sec: 5534.7). Total num frames: 1196716032. Throughput: 0: 5824.9. Samples: 1196721144. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:17:57,859][25689] Avg episode reward: [(0, '1.008')] [2022-07-11 11:17:58,016][26022] Updated weights on worker 0-0, policy_version 1168669 (0.00093) [2022-07-11 11:17:59,554][26022] Updated weights on worker 0-0, policy_version 1168679 (0.00088) [2022-07-11 11:18:01,691][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:18:01,706][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001168689_1196737536.pth [2022-07-11 11:18:01,706][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001166743_1194744832.pth [2022-07-11 11:18:01,715][26022] Updated weights on worker 0-0, policy_version 1168689 (0.00086) [2022-07-11 11:18:02,937][25689] Fps is (10 sec: 5584.3, 60 sec: 5553.7, 300 sec: 5533.4). Total num frames: 1196743680. Throughput: 0: 5014.0. Samples: 1196737744. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:18:02,938][25689] Avg episode reward: [(0, '0.969')] [2022-07-11 11:18:03,974][26022] Updated weights on worker 0-0, policy_version 1168699 (0.00097) [2022-07-11 11:18:05,672][26022] Updated weights on worker 0-0, policy_version 1168709 (0.00087) [2022-07-11 11:18:07,543][26022] Updated weights on worker 0-0, policy_version 1168719 (0.00091) [2022-07-11 11:18:07,968][25689] Fps is (10 sec: 5368.4, 60 sec: 5540.2, 300 sec: 5529.7). Total num frames: 1196770304. Throughput: 0: 5748.4. Samples: 1196769352. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:18:07,969][25689] Avg episode reward: [(0, '-0.025')] [2022-07-11 11:18:09,135][26022] Updated weights on worker 0-0, policy_version 1168729 (0.00085) [2022-07-11 11:18:11,265][26022] Updated weights on worker 0-0, policy_version 1168739 (0.00991) [2022-07-11 11:18:12,787][26022] Updated weights on worker 0-0, policy_version 1168749 (0.00089) [2022-07-11 11:18:13,047][25689] Fps is (10 sec: 5570.7, 60 sec: 5550.2, 300 sec: 5539.0). Total num frames: 1196800000. Throughput: 0: 5732.8. Samples: 1196803006. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:18:13,048][25689] Avg episode reward: [(0, '-0.197')] [2022-07-11 11:18:14,840][26022] Updated weights on worker 0-0, policy_version 1168759 (0.00095) [2022-07-11 11:18:16,541][26022] Updated weights on worker 0-0, policy_version 1168769 (0.00085) [2022-07-11 11:18:18,102][25689] Fps is (10 sec: 5658.5, 60 sec: 5566.5, 300 sec: 5538.6). Total num frames: 1196827648. Throughput: 0: 5699.2. Samples: 1196836518. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:18:18,104][25689] Avg episode reward: [(0, '0.127')] [2022-07-11 11:18:18,665][26022] Updated weights on worker 0-0, policy_version 1168779 (0.00083) [2022-07-11 11:18:20,234][26022] Updated weights on worker 0-0, policy_version 1168789 (0.00086) [2022-07-11 11:18:22,268][26022] Updated weights on worker 0-0, policy_version 1168799 (0.00091) [2022-07-11 11:18:23,192][25689] Fps is (10 sec: 5450.7, 60 sec: 5548.1, 300 sec: 5527.2). Total num frames: 1196855296. Throughput: 0: 5705.3. Samples: 1196853304. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:18:23,192][25689] Avg episode reward: [(0, '-0.417')] [2022-07-11 11:18:23,958][26022] Updated weights on worker 0-0, policy_version 1168809 (0.00086) [2022-07-11 11:18:25,987][26022] Updated weights on worker 0-0, policy_version 1168819 (0.00097) [2022-07-11 11:18:27,711][26022] Updated weights on worker 0-0, policy_version 1168829 (0.00087) [2022-07-11 11:18:28,215][25689] Fps is (10 sec: 5468.0, 60 sec: 5550.5, 300 sec: 5534.1). Total num frames: 1196882944. Throughput: 0: 5771.7. Samples: 1196886210. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:18:28,215][25689] Avg episode reward: [(0, '-1.048')] [2022-07-11 11:18:29,548][26022] Updated weights on worker 0-0, policy_version 1168839 (0.00057) [2022-07-11 11:18:31,402][26022] Updated weights on worker 0-0, policy_version 1168849 (0.00089) [2022-07-11 11:18:33,231][25689] Fps is (10 sec: 5406.3, 60 sec: 5534.1, 300 sec: 5527.8). Total num frames: 1196909568. Throughput: 0: 5769.1. Samples: 1196919446. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:18:33,231][25689] Avg episode reward: [(0, '-0.376')] [2022-07-11 11:18:33,423][26022] Updated weights on worker 0-0, policy_version 1168859 (0.00082) [2022-07-11 11:18:35,189][26022] Updated weights on worker 0-0, policy_version 1168869 (0.00086) [2022-07-11 11:18:37,119][26022] Updated weights on worker 0-0, policy_version 1168879 (0.00089) [2022-07-11 11:18:38,259][25689] Fps is (10 sec: 5607.4, 60 sec: 5536.7, 300 sec: 5525.9). Total num frames: 1196939264. Throughput: 0: 4935.8. Samples: 1196936006. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:18:38,261][25689] Avg episode reward: [(0, '0.553')] [2022-07-11 11:18:38,931][26022] Updated weights on worker 0-0, policy_version 1168889 (0.00092) [2022-07-11 11:18:40,793][26022] Updated weights on worker 0-0, policy_version 1168899 (0.00101) [2022-07-11 11:18:42,734][26022] Updated weights on worker 0-0, policy_version 1168909 (0.00081) [2022-07-11 11:18:43,381][25689] Fps is (10 sec: 5649.7, 60 sec: 5539.5, 300 sec: 5527.6). Total num frames: 1196966912. Throughput: 0: 5748.3. Samples: 1196969354. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:18:43,381][25689] Avg episode reward: [(0, '0.389')] [2022-07-11 11:18:44,469][26022] Updated weights on worker 0-0, policy_version 1168919 (0.00089) [2022-07-11 11:18:46,272][26022] Updated weights on worker 0-0, policy_version 1168929 (0.00088) [2022-07-11 11:18:48,347][26022] Updated weights on worker 0-0, policy_version 1168939 (0.00091) [2022-07-11 11:18:48,396][25689] Fps is (10 sec: 5353.7, 60 sec: 5491.7, 300 sec: 5523.9). Total num frames: 1196993536. Throughput: 0: 5777.2. Samples: 1197002800. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:18:48,396][25689] Avg episode reward: [(0, '0.456')] [2022-07-11 11:18:49,932][26022] Updated weights on worker 0-0, policy_version 1168949 (0.00093) [2022-07-11 11:18:52,078][26022] Updated weights on worker 0-0, policy_version 1168959 (0.00084) [2022-07-11 11:18:53,427][25689] Fps is (10 sec: 5606.1, 60 sec: 5544.7, 300 sec: 5530.4). Total num frames: 1197023232. Throughput: 0: 4942.3. Samples: 1197019260. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:18:53,427][25689] Avg episode reward: [(0, '0.519')] [2022-07-11 11:18:53,678][26022] Updated weights on worker 0-0, policy_version 1168969 (0.00090) [2022-07-11 11:18:55,770][26022] Updated weights on worker 0-0, policy_version 1168979 (0.00094) [2022-07-11 11:18:57,533][26022] Updated weights on worker 0-0, policy_version 1168989 (0.00078) [2022-07-11 11:18:58,472][25689] Fps is (10 sec: 5488.0, 60 sec: 5490.5, 300 sec: 5523.4). Total num frames: 1197048832. Throughput: 0: 5758.3. Samples: 1197052398. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:18:58,472][25689] Avg episode reward: [(0, '0.051')] [2022-07-11 11:18:59,234][26022] Updated weights on worker 0-0, policy_version 1168999 (0.00091) [2022-07-11 11:19:01,191][26022] Updated weights on worker 0-0, policy_version 1169009 (0.00090) [2022-07-11 11:19:03,291][26022] Updated weights on worker 0-0, policy_version 1169019 (0.00094) [2022-07-11 11:19:03,523][25689] Fps is (10 sec: 5274.0, 60 sec: 5493.0, 300 sec: 5522.9). Total num frames: 1197076480. Throughput: 0: 5679.5. Samples: 1197083752. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:19:03,523][25689] Avg episode reward: [(0, '-0.566')] [2022-07-11 11:19:05,382][26022] Updated weights on worker 0-0, policy_version 1169029 (0.00092) [2022-07-11 11:19:06,928][26022] Updated weights on worker 0-0, policy_version 1169039 (0.00089) [2022-07-11 11:19:08,537][25689] Fps is (10 sec: 5392.0, 60 sec: 5494.6, 300 sec: 5526.1). Total num frames: 1197103104. Throughput: 0: 4858.3. Samples: 1197100652. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:19:08,538][25689] Avg episode reward: [(0, '-0.722')] [2022-07-11 11:19:08,776][26022] Updated weights on worker 0-0, policy_version 1169049 (0.00082) [2022-07-11 11:19:10,687][26022] Updated weights on worker 0-0, policy_version 1169059 (0.00088) [2022-07-11 11:19:12,367][26022] Updated weights on worker 0-0, policy_version 1169069 (0.00093) [2022-07-11 11:19:13,539][25689] Fps is (10 sec: 5520.9, 60 sec: 5484.7, 300 sec: 5529.7). Total num frames: 1197131776. Throughput: 0: 5728.1. Samples: 1197134464. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:19:13,539][25689] Avg episode reward: [(0, '-0.827')] [2022-07-11 11:19:14,255][26022] Updated weights on worker 0-0, policy_version 1169079 (0.00086) [2022-07-11 11:19:16,056][26022] Updated weights on worker 0-0, policy_version 1169089 (0.00087) [2022-07-11 11:19:17,880][26022] Updated weights on worker 0-0, policy_version 1169099 (0.00084) [2022-07-11 11:19:18,567][25689] Fps is (10 sec: 5717.5, 60 sec: 5504.1, 300 sec: 5526.6). Total num frames: 1197160448. Throughput: 0: 5750.5. Samples: 1197167954. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:19:18,567][25689] Avg episode reward: [(0, '-0.348')] [2022-07-11 11:19:19,800][26022] Updated weights on worker 0-0, policy_version 1169109 (0.00092) [2022-07-11 11:19:21,686][26022] Updated weights on worker 0-0, policy_version 1169119 (0.00089) [2022-07-11 11:19:23,460][26022] Updated weights on worker 0-0, policy_version 1169129 (0.00083) [2022-07-11 11:19:23,613][25689] Fps is (10 sec: 5590.4, 60 sec: 5508.0, 300 sec: 5526.2). Total num frames: 1197188096. Throughput: 0: 5009.5. Samples: 1197184394. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:19:23,614][25689] Avg episode reward: [(0, '-1.091')] [2022-07-11 11:19:25,318][26022] Updated weights on worker 0-0, policy_version 1169139 (0.00089) [2022-07-11 11:19:27,293][26022] Updated weights on worker 0-0, policy_version 1169149 (0.00085) [2022-07-11 11:19:28,671][25689] Fps is (10 sec: 5472.7, 60 sec: 5504.8, 300 sec: 5522.2). Total num frames: 1197215744. Throughput: 0: 5808.7. Samples: 1197217602. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:19:28,671][25689] Avg episode reward: [(0, '-1.329')] [2022-07-11 11:19:28,924][26022] Updated weights on worker 0-0, policy_version 1169159 (0.00053) [2022-07-11 11:19:30,984][26022] Updated weights on worker 0-0, policy_version 1169169 (0.00086) [2022-07-11 11:19:32,942][26022] Updated weights on worker 0-0, policy_version 1169179 (0.00091) [2022-07-11 11:19:33,674][25689] Fps is (10 sec: 5496.0, 60 sec: 5522.9, 300 sec: 5518.8). Total num frames: 1197243392. Throughput: 0: 5794.5. Samples: 1197251140. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:19:33,675][25689] Avg episode reward: [(0, '-0.868')] [2022-07-11 11:19:34,551][26022] Updated weights on worker 0-0, policy_version 1169189 (0.00084) [2022-07-11 11:19:36,431][26022] Updated weights on worker 0-0, policy_version 1169199 (0.00086) [2022-07-11 11:19:38,383][26022] Updated weights on worker 0-0, policy_version 1169209 (0.00096) [2022-07-11 11:19:38,679][25689] Fps is (10 sec: 5422.9, 60 sec: 5474.2, 300 sec: 5520.8). Total num frames: 1197270016. Throughput: 0: 4971.1. Samples: 1197267932. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:19:38,679][25689] Avg episode reward: [(0, '-1.341')] [2022-07-11 11:19:40,074][26022] Updated weights on worker 0-0, policy_version 1169219 (0.00087) [2022-07-11 11:19:42,144][26022] Updated weights on worker 0-0, policy_version 1169229 (0.00087) [2022-07-11 11:19:43,739][25689] Fps is (10 sec: 5595.6, 60 sec: 5513.7, 300 sec: 5521.0). Total num frames: 1197299712. Throughput: 0: 5816.1. Samples: 1197301450. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:19:43,740][25689] Avg episode reward: [(0, '-1.291')] [2022-07-11 11:19:43,743][26022] Updated weights on worker 0-0, policy_version 1169239 (0.00097) [2022-07-11 11:19:45,499][26022] Updated weights on worker 0-0, policy_version 1169249 (0.00087) [2022-07-11 11:19:47,560][26022] Updated weights on worker 0-0, policy_version 1169259 (0.00094) [2022-07-11 11:19:48,757][25689] Fps is (10 sec: 5791.2, 60 sec: 5547.4, 300 sec: 5524.1). Total num frames: 1197328384. Throughput: 0: 5859.1. Samples: 1197335292. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:19:48,758][25689] Avg episode reward: [(0, '-1.189')] [2022-07-11 11:19:49,515][26022] Updated weights on worker 0-0, policy_version 1169269 (0.00090) [2022-07-11 11:19:51,011][26022] Updated weights on worker 0-0, policy_version 1169279 (0.00087) [2022-07-11 11:19:52,955][26022] Updated weights on worker 0-0, policy_version 1169289 (0.00087) [2022-07-11 11:19:53,766][25689] Fps is (10 sec: 5719.4, 60 sec: 5532.5, 300 sec: 5524.1). Total num frames: 1197357056. Throughput: 0: 5030.9. Samples: 1197352218. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:19:53,766][25689] Avg episode reward: [(0, '-0.501')] [2022-07-11 11:19:54,635][26022] Updated weights on worker 0-0, policy_version 1169299 (0.00085) [2022-07-11 11:19:56,786][26022] Updated weights on worker 0-0, policy_version 1169309 (0.00094) [2022-07-11 11:19:58,583][26022] Updated weights on worker 0-0, policy_version 1169319 (0.00093) [2022-07-11 11:19:58,784][25689] Fps is (10 sec: 5412.8, 60 sec: 5534.9, 300 sec: 5522.4). Total num frames: 1197382656. Throughput: 0: 5841.1. Samples: 1197385368. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:19:58,784][25689] Avg episode reward: [(0, '-0.103')] [2022-07-11 11:20:00,269][26022] Updated weights on worker 0-0, policy_version 1169329 (0.00094) [2022-07-11 11:20:01,970][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:20:01,985][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001169336_1197400064.pth [2022-07-11 11:20:01,986][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001167394_1195411456.pth [2022-07-11 11:20:02,685][26022] Updated weights on worker 0-0, policy_version 1169339 (0.00087) [2022-07-11 11:20:03,835][25689] Fps is (10 sec: 5288.1, 60 sec: 5534.9, 300 sec: 5522.7). Total num frames: 1197410304. Throughput: 0: 5712.8. Samples: 1197416252. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:20:03,836][25689] Avg episode reward: [(0, '-0.212')] [2022-07-11 11:20:04,506][26022] Updated weights on worker 0-0, policy_version 1169349 (0.00087) [2022-07-11 11:20:06,236][26022] Updated weights on worker 0-0, policy_version 1169359 (0.00091) [2022-07-11 11:20:08,334][26022] Updated weights on worker 0-0, policy_version 1169369 (0.00095) [2022-07-11 11:20:08,854][25689] Fps is (10 sec: 5389.4, 60 sec: 5534.5, 300 sec: 5520.1). Total num frames: 1197436928. Throughput: 0: 4856.3. Samples: 1197432888. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:20:08,856][25689] Avg episode reward: [(0, '-0.021')] [2022-07-11 11:20:09,847][26022] Updated weights on worker 0-0, policy_version 1169379 (0.00110) [2022-07-11 11:20:11,932][26022] Updated weights on worker 0-0, policy_version 1169389 (0.00081) [2022-07-11 11:20:13,568][26022] Updated weights on worker 0-0, policy_version 1169399 (0.00088) [2022-07-11 11:20:13,858][25689] Fps is (10 sec: 5517.1, 60 sec: 5534.3, 300 sec: 5524.8). Total num frames: 1197465600. Throughput: 0: 5679.0. Samples: 1197466320. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:20:13,859][25689] Avg episode reward: [(0, '-0.835')] [2022-07-11 11:20:15,563][26022] Updated weights on worker 0-0, policy_version 1169409 (0.00088) [2022-07-11 11:20:17,387][26022] Updated weights on worker 0-0, policy_version 1169419 (0.00089) [2022-07-11 11:20:18,868][25689] Fps is (10 sec: 5726.2, 60 sec: 5535.9, 300 sec: 5523.0). Total num frames: 1197494272. Throughput: 0: 5717.1. Samples: 1197500192. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:20:18,869][25689] Avg episode reward: [(0, '-0.986')] [2022-07-11 11:20:18,932][26022] Updated weights on worker 0-0, policy_version 1169429 (0.00085) [2022-07-11 11:20:20,996][26022] Updated weights on worker 0-0, policy_version 1169439 (0.00094) [2022-07-11 11:20:22,843][26022] Updated weights on worker 0-0, policy_version 1169449 (0.00083) [2022-07-11 11:20:24,009][25689] Fps is (10 sec: 5547.9, 60 sec: 5527.2, 300 sec: 5524.3). Total num frames: 1197521920. Throughput: 0: 4993.8. Samples: 1197516998. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:20:24,011][25689] Avg episode reward: [(0, '-1.383')] [2022-07-11 11:20:24,593][26022] Updated weights on worker 0-0, policy_version 1169459 (0.00089) [2022-07-11 11:20:26,471][26022] Updated weights on worker 0-0, policy_version 1169469 (0.00080) [2022-07-11 11:20:28,201][26022] Updated weights on worker 0-0, policy_version 1169479 (0.00060) [2022-07-11 11:20:29,034][25689] Fps is (10 sec: 5439.5, 60 sec: 5530.2, 300 sec: 5524.6). Total num frames: 1197549568. Throughput: 0: 5819.2. Samples: 1197550320. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:20:29,036][25689] Avg episode reward: [(0, '-1.359')] [2022-07-11 11:20:30,260][26022] Updated weights on worker 0-0, policy_version 1169489 (0.00092) [2022-07-11 11:20:31,967][26022] Updated weights on worker 0-0, policy_version 1169499 (0.00084) [2022-07-11 11:20:33,875][26022] Updated weights on worker 0-0, policy_version 1169509 (0.00085) [2022-07-11 11:20:34,092][25689] Fps is (10 sec: 5484.3, 60 sec: 5525.3, 300 sec: 5523.6). Total num frames: 1197577216. Throughput: 0: 5798.7. Samples: 1197583652. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 11:20:34,092][25689] Avg episode reward: [(0, '-1.744')] [2022-07-11 11:20:35,477][26022] Updated weights on worker 0-0, policy_version 1169519 (0.00093) [2022-07-11 11:20:37,345][26022] Updated weights on worker 0-0, policy_version 1169529 (0.00088) [2022-07-11 11:20:39,141][25689] Fps is (10 sec: 5673.5, 60 sec: 5572.0, 300 sec: 5524.9). Total num frames: 1197606912. Throughput: 0: 4953.9. Samples: 1197600622. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:20:39,142][25689] Avg episode reward: [(0, '-1.145')] [2022-07-11 11:20:39,412][26022] Updated weights on worker 0-0, policy_version 1169539 (0.00090) [2022-07-11 11:20:41,224][26022] Updated weights on worker 0-0, policy_version 1169549 (0.00093) [2022-07-11 11:20:43,128][26022] Updated weights on worker 0-0, policy_version 1169559 (0.00411) [2022-07-11 11:20:44,176][25689] Fps is (10 sec: 5584.9, 60 sec: 5523.5, 300 sec: 5521.5). Total num frames: 1197633536. Throughput: 0: 5812.5. Samples: 1197634218. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:20:44,177][25689] Avg episode reward: [(0, '-0.962')] [2022-07-11 11:20:44,689][26022] Updated weights on worker 0-0, policy_version 1169569 (0.00086) [2022-07-11 11:20:46,805][26022] Updated weights on worker 0-0, policy_version 1169579 (0.00088) [2022-07-11 11:20:48,375][26022] Updated weights on worker 0-0, policy_version 1169589 (0.00095) [2022-07-11 11:20:49,222][25689] Fps is (10 sec: 5485.4, 60 sec: 5521.0, 300 sec: 5521.0). Total num frames: 1197662208. Throughput: 0: 5822.8. Samples: 1197667870. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:20:49,223][25689] Avg episode reward: [(0, '-1.666')] [2022-07-11 11:20:50,368][26022] Updated weights on worker 0-0, policy_version 1169599 (0.00093) [2022-07-11 11:20:52,183][26022] Updated weights on worker 0-0, policy_version 1169609 (0.00087) [2022-07-11 11:20:54,066][26022] Updated weights on worker 0-0, policy_version 1169619 (0.00086) [2022-07-11 11:20:54,226][25689] Fps is (10 sec: 5604.1, 60 sec: 5504.4, 300 sec: 5529.2). Total num frames: 1197689856. Throughput: 0: 5019.7. Samples: 1197684720. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:20:54,228][25689] Avg episode reward: [(0, '-1.258')] [2022-07-11 11:20:55,862][26022] Updated weights on worker 0-0, policy_version 1169629 (0.00090) [2022-07-11 11:20:57,789][26022] Updated weights on worker 0-0, policy_version 1169639 (0.00084) [2022-07-11 11:20:59,246][25689] Fps is (10 sec: 5618.7, 60 sec: 5555.1, 300 sec: 5529.6). Total num frames: 1197718528. Throughput: 0: 5842.2. Samples: 1197718076. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:20:59,247][25689] Avg episode reward: [(0, '-0.835')] [2022-07-11 11:20:59,569][26022] Updated weights on worker 0-0, policy_version 1169649 (0.00084) [2022-07-11 11:21:01,911][26022] Updated weights on worker 0-0, policy_version 1169659 (0.00108) [2022-07-11 11:21:03,558][26022] Updated weights on worker 0-0, policy_version 1169669 (0.00089) [2022-07-11 11:21:04,340][25689] Fps is (10 sec: 5365.9, 60 sec: 5517.3, 300 sec: 5522.5). Total num frames: 1197744128. Throughput: 0: 5708.4. Samples: 1197749322. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:21:04,341][25689] Avg episode reward: [(0, '0.061')] [2022-07-11 11:21:05,498][26022] Updated weights on worker 0-0, policy_version 1169679 (0.00082) [2022-07-11 11:21:07,295][26022] Updated weights on worker 0-0, policy_version 1169689 (0.00085) [2022-07-11 11:21:09,175][26022] Updated weights on worker 0-0, policy_version 1169699 (0.00095) [2022-07-11 11:21:09,369][25689] Fps is (10 sec: 5361.1, 60 sec: 5550.2, 300 sec: 5525.4). Total num frames: 1197772800. Throughput: 0: 4871.5. Samples: 1197766016. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:21:09,369][25689] Avg episode reward: [(0, '-0.291')] [2022-07-11 11:21:10,975][26022] Updated weights on worker 0-0, policy_version 1169709 (0.00087) [2022-07-11 11:21:12,810][26022] Updated weights on worker 0-0, policy_version 1169719 (0.00096) [2022-07-11 11:21:14,391][25689] Fps is (10 sec: 5603.4, 60 sec: 5531.6, 300 sec: 5522.4). Total num frames: 1197800448. Throughput: 0: 5699.0. Samples: 1197799642. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:21:14,392][25689] Avg episode reward: [(0, '-0.084')] [2022-07-11 11:21:14,732][26022] Updated weights on worker 0-0, policy_version 1169729 (0.00088) [2022-07-11 11:21:16,389][26022] Updated weights on worker 0-0, policy_version 1169739 (0.00091) [2022-07-11 11:21:18,225][26022] Updated weights on worker 0-0, policy_version 1169749 (0.00090) [2022-07-11 11:21:19,427][25689] Fps is (10 sec: 5701.5, 60 sec: 5546.3, 300 sec: 5526.8). Total num frames: 1197830144. Throughput: 0: 5716.2. Samples: 1197833434. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:21:19,427][25689] Avg episode reward: [(0, '0.607')] [2022-07-11 11:21:20,108][26022] Updated weights on worker 0-0, policy_version 1169759 (0.00088) [2022-07-11 11:21:21,880][26022] Updated weights on worker 0-0, policy_version 1169769 (0.00089) [2022-07-11 11:21:23,948][26022] Updated weights on worker 0-0, policy_version 1169779 (0.00081) [2022-07-11 11:21:24,519][25689] Fps is (10 sec: 5460.1, 60 sec: 5516.9, 300 sec: 5526.4). Total num frames: 1197855744. Throughput: 0: 5829.1. Samples: 1197866944. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:21:24,519][25689] Avg episode reward: [(0, '1.070')] [2022-07-11 11:21:25,519][26022] Updated weights on worker 0-0, policy_version 1169789 (0.00086) [2022-07-11 11:21:27,568][26022] Updated weights on worker 0-0, policy_version 1169799 (0.00088) [2022-07-11 11:21:29,347][26022] Updated weights on worker 0-0, policy_version 1169809 (0.00088) [2022-07-11 11:21:29,527][25689] Fps is (10 sec: 5474.5, 60 sec: 5552.2, 300 sec: 5527.4). Total num frames: 1197885440. Throughput: 0: 5842.5. Samples: 1197883790. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:21:29,528][25689] Avg episode reward: [(0, '1.202')] [2022-07-11 11:21:31,018][26022] Updated weights on worker 0-0, policy_version 1169819 (0.00097) [2022-07-11 11:21:33,178][26022] Updated weights on worker 0-0, policy_version 1169829 (0.00103) [2022-07-11 11:21:34,585][25689] Fps is (10 sec: 5798.6, 60 sec: 5569.2, 300 sec: 5530.7). Total num frames: 1197914112. Throughput: 0: 5805.6. Samples: 1197916876. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:21:34,585][25689] Avg episode reward: [(0, '1.495')] [2022-07-11 11:21:34,670][26022] Updated weights on worker 0-0, policy_version 1169839 (0.00095) [2022-07-11 11:21:36,863][26022] Updated weights on worker 0-0, policy_version 1169849 (0.00098) [2022-07-11 11:21:38,497][26022] Updated weights on worker 0-0, policy_version 1169859 (0.00086) [2022-07-11 11:21:39,592][25689] Fps is (10 sec: 5392.2, 60 sec: 5505.3, 300 sec: 5528.3). Total num frames: 1197939712. Throughput: 0: 5802.1. Samples: 1197950436. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:21:39,593][25689] Avg episode reward: [(0, '1.240')] [2022-07-11 11:21:40,260][26022] Updated weights on worker 0-0, policy_version 1169869 (0.00085) [2022-07-11 11:21:42,241][26022] Updated weights on worker 0-0, policy_version 1169879 (0.00088) [2022-07-11 11:21:44,043][26022] Updated weights on worker 0-0, policy_version 1169889 (0.00096) [2022-07-11 11:21:44,644][25689] Fps is (10 sec: 5496.8, 60 sec: 5554.5, 300 sec: 5528.2). Total num frames: 1197969408. Throughput: 0: 4972.6. Samples: 1197967020. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:21:44,645][25689] Avg episode reward: [(0, '1.116')] [2022-07-11 11:21:46,008][26022] Updated weights on worker 0-0, policy_version 1169899 (0.00084) [2022-07-11 11:21:47,753][26022] Updated weights on worker 0-0, policy_version 1169909 (0.00092) [2022-07-11 11:21:49,540][26022] Updated weights on worker 0-0, policy_version 1169919 (0.00084) [2022-07-11 11:21:49,651][25689] Fps is (10 sec: 5701.0, 60 sec: 5541.2, 300 sec: 5525.3). Total num frames: 1197997056. Throughput: 0: 5805.1. Samples: 1198000608. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:21:49,651][25689] Avg episode reward: [(0, '1.089')] [2022-07-11 11:21:51,410][26022] Updated weights on worker 0-0, policy_version 1169929 (0.00091) [2022-07-11 11:21:53,154][26022] Updated weights on worker 0-0, policy_version 1169939 (0.00098) [2022-07-11 11:21:54,655][25689] Fps is (10 sec: 5523.8, 60 sec: 5541.2, 300 sec: 5525.6). Total num frames: 1198024704. Throughput: 0: 5855.9. Samples: 1198034404. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:21:54,656][25689] Avg episode reward: [(0, '1.214')] [2022-07-11 11:21:55,142][26022] Updated weights on worker 0-0, policy_version 1169949 (0.01113) [2022-07-11 11:21:56,836][26022] Updated weights on worker 0-0, policy_version 1169959 (0.00091) [2022-07-11 11:21:58,635][26022] Updated weights on worker 0-0, policy_version 1169969 (0.00086) [2022-07-11 11:21:59,664][25689] Fps is (10 sec: 5624.6, 60 sec: 5542.2, 300 sec: 5536.9). Total num frames: 1198053376. Throughput: 0: 5032.2. Samples: 1198051440. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:21:59,664][25689] Avg episode reward: [(0, '1.004')] [2022-07-11 11:22:00,303][26022] Updated weights on worker 0-0, policy_version 1169979 (0.00082) [2022-07-11 11:22:02,032][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:22:02,049][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001169984_1198063616.pth [2022-07-11 11:22:02,050][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001168041_1196073984.pth [2022-07-11 11:22:02,586][26022] Updated weights on worker 0-0, policy_version 1169989 (0.00805) [2022-07-11 11:22:04,487][26022] Updated weights on worker 0-0, policy_version 1169999 (0.00089) [2022-07-11 11:22:04,713][25689] Fps is (10 sec: 5395.6, 60 sec: 5546.3, 300 sec: 5530.3). Total num frames: 1198078976. Throughput: 0: 5757.7. Samples: 1198082572. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:22:04,715][25689] Avg episode reward: [(0, '0.622')] [2022-07-11 11:22:06,236][26022] Updated weights on worker 0-0, policy_version 1170009 (0.00094) [2022-07-11 11:22:08,155][26022] Updated weights on worker 0-0, policy_version 1170019 (0.00087) [2022-07-11 11:22:09,736][25689] Fps is (10 sec: 5490.0, 60 sec: 5563.8, 300 sec: 5533.4). Total num frames: 1198108672. Throughput: 0: 5770.0. Samples: 1198116500. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:22:09,738][25689] Avg episode reward: [(0, '0.913')] [2022-07-11 11:22:09,946][26022] Updated weights on worker 0-0, policy_version 1170029 (0.00088) [2022-07-11 11:22:11,771][26022] Updated weights on worker 0-0, policy_version 1170039 (0.00085) [2022-07-11 11:22:13,912][26022] Updated weights on worker 0-0, policy_version 1170049 (0.00089) [2022-07-11 11:22:14,763][25689] Fps is (10 sec: 5604.3, 60 sec: 5546.5, 300 sec: 5533.8). Total num frames: 1198135296. Throughput: 0: 4927.0. Samples: 1198133476. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:22:14,764][25689] Avg episode reward: [(0, '0.993')] [2022-07-11 11:22:15,363][26022] Updated weights on worker 0-0, policy_version 1170059 (0.00090) [2022-07-11 11:22:17,494][26022] Updated weights on worker 0-0, policy_version 1170069 (0.00091) [2022-07-11 11:22:19,029][26022] Updated weights on worker 0-0, policy_version 1170079 (0.00086) [2022-07-11 11:22:19,803][25689] Fps is (10 sec: 5493.0, 60 sec: 5529.1, 300 sec: 5534.4). Total num frames: 1198163968. Throughput: 0: 5733.5. Samples: 1198166906. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:22:19,804][25689] Avg episode reward: [(0, '0.961')] [2022-07-11 11:22:21,248][26022] Updated weights on worker 0-0, policy_version 1170089 (0.00087) [2022-07-11 11:22:22,794][26022] Updated weights on worker 0-0, policy_version 1170099 (0.00083) [2022-07-11 11:22:24,761][26022] Updated weights on worker 0-0, policy_version 1170109 (0.00088) [2022-07-11 11:22:24,904][25689] Fps is (10 sec: 5553.5, 60 sec: 5562.1, 300 sec: 5533.4). Total num frames: 1198191616. Throughput: 0: 5834.1. Samples: 1198200366. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:22:24,905][25689] Avg episode reward: [(0, '1.171')] [2022-07-11 11:22:26,375][26022] Updated weights on worker 0-0, policy_version 1170119 (0.00095) [2022-07-11 11:22:28,324][26022] Updated weights on worker 0-0, policy_version 1170129 (0.00086) [2022-07-11 11:22:29,954][25689] Fps is (10 sec: 5548.1, 60 sec: 5541.4, 300 sec: 5536.3). Total num frames: 1198220288. Throughput: 0: 4973.6. Samples: 1198217056. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:22:29,954][25689] Avg episode reward: [(0, '0.995')] [2022-07-11 11:22:30,362][26022] Updated weights on worker 0-0, policy_version 1170139 (0.00085) [2022-07-11 11:22:31,943][26022] Updated weights on worker 0-0, policy_version 1170149 (0.00098) [2022-07-11 11:22:34,075][26022] Updated weights on worker 0-0, policy_version 1170159 (0.00089) [2022-07-11 11:22:34,960][25689] Fps is (10 sec: 5600.5, 60 sec: 5529.1, 300 sec: 5530.4). Total num frames: 1198247936. Throughput: 0: 5774.9. Samples: 1198250114. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:22:34,961][25689] Avg episode reward: [(0, '1.173')] [2022-07-11 11:22:35,484][26022] Updated weights on worker 0-0, policy_version 1170169 (0.00086) [2022-07-11 11:22:37,496][26022] Updated weights on worker 0-0, policy_version 1170179 (0.00090) [2022-07-11 11:22:39,341][26022] Updated weights on worker 0-0, policy_version 1170189 (0.00088) [2022-07-11 11:22:39,984][25689] Fps is (10 sec: 5615.0, 60 sec: 5578.5, 300 sec: 5536.2). Total num frames: 1198276608. Throughput: 0: 5811.1. Samples: 1198284184. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:22:39,985][25689] Avg episode reward: [(0, '1.076')] [2022-07-11 11:22:41,187][26022] Updated weights on worker 0-0, policy_version 1170199 (0.00082) [2022-07-11 11:22:43,120][26022] Updated weights on worker 0-0, policy_version 1170209 (0.00085) [2022-07-11 11:22:44,980][26022] Updated weights on worker 0-0, policy_version 1170219 (0.00084) [2022-07-11 11:22:45,079][25689] Fps is (10 sec: 5565.8, 60 sec: 5540.6, 300 sec: 5528.5). Total num frames: 1198304256. Throughput: 0: 4991.8. Samples: 1198301076. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:22:45,080][25689] Avg episode reward: [(0, '1.268')] [2022-07-11 11:22:46,523][26022] Updated weights on worker 0-0, policy_version 1170229 (0.00091) [2022-07-11 11:22:48,642][26022] Updated weights on worker 0-0, policy_version 1170239 (0.00081) [2022-07-11 11:22:50,064][26022] Updated weights on worker 0-0, policy_version 1170249 (0.00081) [2022-07-11 11:22:50,159][25689] Fps is (10 sec: 5736.8, 60 sec: 5584.7, 300 sec: 5541.8). Total num frames: 1198334976. Throughput: 0: 5828.2. Samples: 1198334814. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:22:50,159][25689] Avg episode reward: [(0, '1.283')] [2022-07-11 11:22:52,380][26022] Updated weights on worker 0-0, policy_version 1170259 (0.00082) [2022-07-11 11:22:53,826][26022] Updated weights on worker 0-0, policy_version 1170269 (0.00095) [2022-07-11 11:22:55,237][25689] Fps is (10 sec: 5645.5, 60 sec: 5561.0, 300 sec: 5533.6). Total num frames: 1198361600. Throughput: 0: 5865.5. Samples: 1198369046. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:22:55,237][25689] Avg episode reward: [(0, '1.074')] [2022-07-11 11:22:55,711][26022] Updated weights on worker 0-0, policy_version 1170279 (0.00086) [2022-07-11 11:22:57,592][26022] Updated weights on worker 0-0, policy_version 1170289 (0.00104) [2022-07-11 11:22:59,352][26022] Updated weights on worker 0-0, policy_version 1170299 (0.00096) [2022-07-11 11:23:00,289][25689] Fps is (10 sec: 5559.6, 60 sec: 5573.9, 300 sec: 5541.0). Total num frames: 1198391296. Throughput: 0: 5016.8. Samples: 1198386048. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:23:00,289][25689] Avg episode reward: [(0, '1.374')] [2022-07-11 11:23:01,307][26022] Updated weights on worker 0-0, policy_version 1170309 (0.00083) [2022-07-11 11:23:03,527][26022] Updated weights on worker 0-0, policy_version 1170319 (0.00081) [2022-07-11 11:23:05,184][26022] Updated weights on worker 0-0, policy_version 1170329 (0.00090) [2022-07-11 11:23:05,429][25689] Fps is (10 sec: 5626.1, 60 sec: 5599.3, 300 sec: 5542.4). Total num frames: 1198418944. Throughput: 0: 5720.2. Samples: 1198417482. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:23:05,430][25689] Avg episode reward: [(0, '1.081')] [2022-07-11 11:23:07,147][26022] Updated weights on worker 0-0, policy_version 1170339 (0.00094) [2022-07-11 11:23:08,780][26022] Updated weights on worker 0-0, policy_version 1170349 (0.00090) [2022-07-11 11:23:10,455][25689] Fps is (10 sec: 5237.9, 60 sec: 5531.6, 300 sec: 5529.6). Total num frames: 1198444544. Throughput: 0: 5734.6. Samples: 1198451204. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:23:10,456][25689] Avg episode reward: [(0, '0.800')] [2022-07-11 11:23:10,715][26022] Updated weights on worker 0-0, policy_version 1170359 (0.00084) [2022-07-11 11:23:12,276][26022] Updated weights on worker 0-0, policy_version 1170369 (0.00088) [2022-07-11 11:23:14,210][26022] Updated weights on worker 0-0, policy_version 1170379 (0.00084) [2022-07-11 11:23:15,502][25689] Fps is (10 sec: 5489.6, 60 sec: 5580.3, 300 sec: 5536.6). Total num frames: 1198474240. Throughput: 0: 4885.7. Samples: 1198468056. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:23:15,507][25689] Avg episode reward: [(0, '0.695')] [2022-07-11 11:23:16,324][26022] Updated weights on worker 0-0, policy_version 1170389 (0.00086) [2022-07-11 11:23:18,039][26022] Updated weights on worker 0-0, policy_version 1170399 (0.00092) [2022-07-11 11:23:19,815][26022] Updated weights on worker 0-0, policy_version 1170409 (0.00099) [2022-07-11 11:23:20,510][25689] Fps is (10 sec: 5702.8, 60 sec: 5566.4, 300 sec: 5538.1). Total num frames: 1198501888. Throughput: 0: 5715.8. Samples: 1198501628. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:23:20,512][25689] Avg episode reward: [(0, '0.731')] [2022-07-11 11:23:21,800][26022] Updated weights on worker 0-0, policy_version 1170419 (0.00407) [2022-07-11 11:23:23,219][26022] Updated weights on worker 0-0, policy_version 1170429 (0.00087) [2022-07-11 11:23:25,434][26022] Updated weights on worker 0-0, policy_version 1170439 (0.00107) [2022-07-11 11:23:25,573][25689] Fps is (10 sec: 5592.4, 60 sec: 5586.8, 300 sec: 5540.8). Total num frames: 1198530560. Throughput: 0: 5849.4. Samples: 1198535310. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:23:25,573][25689] Avg episode reward: [(0, '0.794')] [2022-07-11 11:23:27,288][26022] Updated weights on worker 0-0, policy_version 1170449 (0.00081) [2022-07-11 11:23:28,954][26022] Updated weights on worker 0-0, policy_version 1170459 (0.00086) [2022-07-11 11:23:30,671][25689] Fps is (10 sec: 5543.0, 60 sec: 5565.5, 300 sec: 5542.8). Total num frames: 1198558208. Throughput: 0: 4984.2. Samples: 1198551964. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:23:30,671][25689] Avg episode reward: [(0, '1.161')] [2022-07-11 11:23:30,827][26022] Updated weights on worker 0-0, policy_version 1170469 (0.00087) [2022-07-11 11:23:32,628][26022] Updated weights on worker 0-0, policy_version 1170479 (0.00093) [2022-07-11 11:23:34,528][26022] Updated weights on worker 0-0, policy_version 1170489 (0.00093) [2022-07-11 11:23:35,697][25689] Fps is (10 sec: 5562.8, 60 sec: 5580.5, 300 sec: 5539.3). Total num frames: 1198586880. Throughput: 0: 5813.0. Samples: 1198585450. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:23:35,698][25689] Avg episode reward: [(0, '1.281')] [2022-07-11 11:23:36,588][26022] Updated weights on worker 0-0, policy_version 1170499 (0.00080) [2022-07-11 11:23:38,107][26022] Updated weights on worker 0-0, policy_version 1170509 (0.00083) [2022-07-11 11:23:40,296][26022] Updated weights on worker 0-0, policy_version 1170519 (0.00086) [2022-07-11 11:23:40,759][25689] Fps is (10 sec: 5785.6, 60 sec: 5593.9, 300 sec: 5547.4). Total num frames: 1198616576. Throughput: 0: 5801.6. Samples: 1198619106. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:23:40,760][25689] Avg episode reward: [(0, '1.599')] [2022-07-11 11:23:41,988][26022] Updated weights on worker 0-0, policy_version 1170529 (0.00095) [2022-07-11 11:23:43,662][26022] Updated weights on worker 0-0, policy_version 1170539 (0.00086) [2022-07-11 11:23:45,413][26022] Updated weights on worker 0-0, policy_version 1170549 (0.00087) [2022-07-11 11:23:45,857][25689] Fps is (10 sec: 5644.3, 60 sec: 5593.6, 300 sec: 5549.3). Total num frames: 1198644224. Throughput: 0: 5806.0. Samples: 1198653080. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:23:45,857][25689] Avg episode reward: [(0, '1.613')] [2022-07-11 11:23:47,431][26022] Updated weights on worker 0-0, policy_version 1170559 (0.00085) [2022-07-11 11:23:48,904][26022] Updated weights on worker 0-0, policy_version 1170569 (0.00085) [2022-07-11 11:23:50,867][25689] Fps is (10 sec: 5369.3, 60 sec: 5532.5, 300 sec: 5539.3). Total num frames: 1198670848. Throughput: 0: 5846.7. Samples: 1198670046. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:23:50,868][25689] Avg episode reward: [(0, '1.152')] [2022-07-11 11:23:51,179][26022] Updated weights on worker 0-0, policy_version 1170579 (0.00093) [2022-07-11 11:23:52,467][26022] Updated weights on worker 0-0, policy_version 1170589 (0.00088) [2022-07-11 11:23:54,872][26022] Updated weights on worker 0-0, policy_version 1170599 (0.00084) [2022-07-11 11:23:55,891][25689] Fps is (10 sec: 5714.8, 60 sec: 5605.0, 300 sec: 5556.9). Total num frames: 1198701568. Throughput: 0: 5858.1. Samples: 1198703748. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:23:55,892][25689] Avg episode reward: [(0, '0.006')] [2022-07-11 11:23:56,121][26022] Updated weights on worker 0-0, policy_version 1170609 (0.00079) [2022-07-11 11:23:58,317][26022] Updated weights on worker 0-0, policy_version 1170619 (0.00085) [2022-07-11 11:23:59,900][26022] Updated weights on worker 0-0, policy_version 1170629 (0.00100) [2022-07-11 11:24:00,903][25689] Fps is (10 sec: 5714.0, 60 sec: 5558.1, 300 sec: 5554.2). Total num frames: 1198728192. Throughput: 0: 5873.7. Samples: 1198737424. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:24:00,903][25689] Avg episode reward: [(0, '-0.038')] [2022-07-11 11:24:02,528][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:24:02,543][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001170639_1198734336.pth [2022-07-11 11:24:02,544][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001168689_1196737536.pth [2022-07-11 11:24:02,557][26022] Updated weights on worker 0-0, policy_version 1170639 (0.00090) [2022-07-11 11:24:03,987][26022] Updated weights on worker 0-0, policy_version 1170649 (0.00085) [2022-07-11 11:24:05,959][25689] Fps is (10 sec: 5085.7, 60 sec: 5515.0, 300 sec: 5546.5). Total num frames: 1198752768. Throughput: 0: 4908.5. Samples: 1198751750. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:24:05,959][25689] Avg episode reward: [(0, '-0.265')] [2022-07-11 11:24:06,067][26022] Updated weights on worker 0-0, policy_version 1170659 (0.00084) [2022-07-11 11:24:07,747][26022] Updated weights on worker 0-0, policy_version 1170669 (0.00083) [2022-07-11 11:24:09,615][26022] Updated weights on worker 0-0, policy_version 1170679 (0.00088) [2022-07-11 11:24:10,990][25689] Fps is (10 sec: 5380.4, 60 sec: 5582.2, 300 sec: 5549.4). Total num frames: 1198782464. Throughput: 0: 5740.4. Samples: 1198785558. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:24:10,990][25689] Avg episode reward: [(0, '-0.611')] [2022-07-11 11:24:11,364][26022] Updated weights on worker 0-0, policy_version 1170689 (0.00081) [2022-07-11 11:24:13,484][26022] Updated weights on worker 0-0, policy_version 1170699 (0.00084) [2022-07-11 11:24:15,167][26022] Updated weights on worker 0-0, policy_version 1170709 (0.00085) [2022-07-11 11:24:15,993][25689] Fps is (10 sec: 5816.7, 60 sec: 5569.3, 300 sec: 5549.9). Total num frames: 1198811136. Throughput: 0: 5746.5. Samples: 1198819264. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:24:15,994][25689] Avg episode reward: [(0, '-0.307')] [2022-07-11 11:24:17,025][26022] Updated weights on worker 0-0, policy_version 1170719 (0.00092) [2022-07-11 11:24:18,638][26022] Updated weights on worker 0-0, policy_version 1170729 (0.00074) [2022-07-11 11:24:20,527][26022] Updated weights on worker 0-0, policy_version 1170739 (0.00085) [2022-07-11 11:24:21,018][25689] Fps is (10 sec: 5615.9, 60 sec: 5567.8, 300 sec: 5550.3). Total num frames: 1198838784. Throughput: 0: 4906.9. Samples: 1198836130. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:24:21,020][25689] Avg episode reward: [(0, '0.428')] [2022-07-11 11:24:22,515][26022] Updated weights on worker 0-0, policy_version 1170749 (0.00884) [2022-07-11 11:24:24,312][26022] Updated weights on worker 0-0, policy_version 1170759 (0.00086) [2022-07-11 11:24:26,089][25689] Fps is (10 sec: 5578.6, 60 sec: 5567.0, 300 sec: 5553.5). Total num frames: 1198867456. Throughput: 0: 5858.4. Samples: 1198869682. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:24:26,091][25689] Avg episode reward: [(0, '1.315')] [2022-07-11 11:24:26,097][26022] Updated weights on worker 0-0, policy_version 1170769 (0.00104) [2022-07-11 11:24:27,811][26022] Updated weights on worker 0-0, policy_version 1170779 (0.00099) [2022-07-11 11:24:29,703][26022] Updated weights on worker 0-0, policy_version 1170789 (0.00090) [2022-07-11 11:24:31,096][25689] Fps is (10 sec: 5588.4, 60 sec: 5575.4, 300 sec: 5553.4). Total num frames: 1198895104. Throughput: 0: 5845.5. Samples: 1198903092. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:24:31,097][25689] Avg episode reward: [(0, '1.006')] [2022-07-11 11:24:31,503][26022] Updated weights on worker 0-0, policy_version 1170799 (0.00081) [2022-07-11 11:24:33,547][26022] Updated weights on worker 0-0, policy_version 1170809 (0.00088) [2022-07-11 11:24:35,314][26022] Updated weights on worker 0-0, policy_version 1170819 (0.00080) [2022-07-11 11:24:36,175][25689] Fps is (10 sec: 5381.2, 60 sec: 5536.8, 300 sec: 5552.0). Total num frames: 1198921728. Throughput: 0: 4974.9. Samples: 1198919664. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2022-07-11 11:24:36,176][25689] Avg episode reward: [(0, '0.108')] [2022-07-11 11:24:37,331][26022] Updated weights on worker 0-0, policy_version 1170829 (0.00082) [2022-07-11 11:24:38,891][26022] Updated weights on worker 0-0, policy_version 1170839 (0.00087) [2022-07-11 11:24:40,899][26022] Updated weights on worker 0-0, policy_version 1170849 (0.00090) [2022-07-11 11:24:41,181][25689] Fps is (10 sec: 5585.0, 60 sec: 5541.9, 300 sec: 5553.1). Total num frames: 1198951424. Throughput: 0: 5819.4. Samples: 1198953462. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:24:41,182][25689] Avg episode reward: [(0, '-0.665')] [2022-07-11 11:24:42,671][26022] Updated weights on worker 0-0, policy_version 1170859 (0.00090) [2022-07-11 11:24:44,430][26022] Updated weights on worker 0-0, policy_version 1170869 (0.00085) [2022-07-11 11:24:46,265][25689] Fps is (10 sec: 5581.7, 60 sec: 5526.2, 300 sec: 5545.0). Total num frames: 1198978048. Throughput: 0: 5817.6. Samples: 1198987056. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:24:46,266][25689] Avg episode reward: [(0, '-1.042')] [2022-07-11 11:24:46,456][26022] Updated weights on worker 0-0, policy_version 1170879 (0.00081) [2022-07-11 11:24:48,070][26022] Updated weights on worker 0-0, policy_version 1170889 (0.00087) [2022-07-11 11:24:50,073][26022] Updated weights on worker 0-0, policy_version 1170899 (0.00084) [2022-07-11 11:24:51,357][25689] Fps is (10 sec: 5534.2, 60 sec: 5569.4, 300 sec: 5546.8). Total num frames: 1199007744. Throughput: 0: 4962.7. Samples: 1199003640. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:24:51,358][25689] Avg episode reward: [(0, '-1.431')] [2022-07-11 11:24:51,947][26022] Updated weights on worker 0-0, policy_version 1170909 (0.00084) [2022-07-11 11:24:53,602][26022] Updated weights on worker 0-0, policy_version 1170919 (0.00091) [2022-07-11 11:24:55,554][26022] Updated weights on worker 0-0, policy_version 1170929 (0.00092) [2022-07-11 11:24:56,369][25689] Fps is (10 sec: 5675.0, 60 sec: 5519.8, 300 sec: 5553.8). Total num frames: 1199035392. Throughput: 0: 5826.7. Samples: 1199037332. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:24:56,370][25689] Avg episode reward: [(0, '-1.074')] [2022-07-11 11:24:57,267][26022] Updated weights on worker 0-0, policy_version 1170939 (0.00088) [2022-07-11 11:24:59,283][26022] Updated weights on worker 0-0, policy_version 1170949 (0.00086) [2022-07-11 11:25:01,021][26022] Updated weights on worker 0-0, policy_version 1170959 (0.00087) [2022-07-11 11:25:01,464][25689] Fps is (10 sec: 5471.3, 60 sec: 5529.1, 300 sec: 5553.0). Total num frames: 1199063040. Throughput: 0: 5768.0. Samples: 1199070456. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:25:01,464][25689] Avg episode reward: [(0, '-1.014')] [2022-07-11 11:25:03,347][26022] Updated weights on worker 0-0, policy_version 1170969 (0.00086) [2022-07-11 11:25:05,198][26022] Updated weights on worker 0-0, policy_version 1170979 (0.00086) [2022-07-11 11:25:06,548][25689] Fps is (10 sec: 5331.8, 60 sec: 5560.3, 300 sec: 5551.8). Total num frames: 1199089664. Throughput: 0: 4832.3. Samples: 1199085072. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:25:06,549][25689] Avg episode reward: [(0, '-0.080')] [2022-07-11 11:25:07,161][26022] Updated weights on worker 0-0, policy_version 1170989 (0.00086) [2022-07-11 11:25:08,864][26022] Updated weights on worker 0-0, policy_version 1170999 (0.00106) [2022-07-11 11:25:10,907][26022] Updated weights on worker 0-0, policy_version 1171009 (0.00082) [2022-07-11 11:25:11,589][25689] Fps is (10 sec: 5360.4, 60 sec: 5525.6, 300 sec: 5547.7). Total num frames: 1199117312. Throughput: 0: 5664.0. Samples: 1199118230. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:25:11,589][25689] Avg episode reward: [(0, '-0.306')] [2022-07-11 11:25:12,556][26022] Updated weights on worker 0-0, policy_version 1171019 (0.00090) [2022-07-11 11:25:14,593][26022] Updated weights on worker 0-0, policy_version 1171029 (0.00081) [2022-07-11 11:25:16,350][26022] Updated weights on worker 0-0, policy_version 1171039 (0.00099) [2022-07-11 11:25:16,611][25689] Fps is (10 sec: 5393.6, 60 sec: 5490.2, 300 sec: 5540.6). Total num frames: 1199143936. Throughput: 0: 5638.8. Samples: 1199151468. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:25:16,611][25689] Avg episode reward: [(0, '-0.388')] [2022-07-11 11:25:18,118][26022] Updated weights on worker 0-0, policy_version 1171049 (0.00089) [2022-07-11 11:25:20,047][26022] Updated weights on worker 0-0, policy_version 1171059 (0.00088) [2022-07-11 11:25:21,624][25689] Fps is (10 sec: 5611.9, 60 sec: 5525.0, 300 sec: 5549.8). Total num frames: 1199173632. Throughput: 0: 4829.2. Samples: 1199167814. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:25:21,625][25689] Avg episode reward: [(0, '0.145')] [2022-07-11 11:25:21,883][26022] Updated weights on worker 0-0, policy_version 1171069 (0.00082) [2022-07-11 11:25:23,644][26022] Updated weights on worker 0-0, policy_version 1171079 (0.00052) [2022-07-11 11:25:25,772][26022] Updated weights on worker 0-0, policy_version 1171089 (0.00096) [2022-07-11 11:25:26,676][25689] Fps is (10 sec: 5697.0, 60 sec: 5509.8, 300 sec: 5549.3). Total num frames: 1199201280. Throughput: 0: 5778.8. Samples: 1199201388. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:25:26,677][25689] Avg episode reward: [(0, '0.042')] [2022-07-11 11:25:27,348][26022] Updated weights on worker 0-0, policy_version 1171099 (0.00082) [2022-07-11 11:25:29,218][26022] Updated weights on worker 0-0, policy_version 1171109 (0.00084) [2022-07-11 11:25:31,038][26022] Updated weights on worker 0-0, policy_version 1171119 (0.00086) [2022-07-11 11:25:31,699][25689] Fps is (10 sec: 5488.8, 60 sec: 5508.4, 300 sec: 5550.0). Total num frames: 1199228928. Throughput: 0: 5794.0. Samples: 1199234748. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:25:31,699][25689] Avg episode reward: [(0, '0.068')] [2022-07-11 11:25:32,781][26022] Updated weights on worker 0-0, policy_version 1171129 (0.00086) [2022-07-11 11:25:34,931][26022] Updated weights on worker 0-0, policy_version 1171139 (0.00105) [2022-07-11 11:25:36,702][25689] Fps is (10 sec: 5515.3, 60 sec: 5532.1, 300 sec: 5544.0). Total num frames: 1199256576. Throughput: 0: 4986.8. Samples: 1199251662. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:25:36,703][25689] Avg episode reward: [(0, '-0.154')] [2022-07-11 11:25:36,711][26022] Updated weights on worker 0-0, policy_version 1171149 (0.00089) [2022-07-11 11:25:38,365][26022] Updated weights on worker 0-0, policy_version 1171159 (0.00091) [2022-07-11 11:25:40,335][26022] Updated weights on worker 0-0, policy_version 1171169 (0.00088) [2022-07-11 11:25:41,722][25689] Fps is (10 sec: 5619.2, 60 sec: 5514.0, 300 sec: 5551.1). Total num frames: 1199285248. Throughput: 0: 5834.1. Samples: 1199285062. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:25:41,723][25689] Avg episode reward: [(0, '0.810')] [2022-07-11 11:25:41,982][26022] Updated weights on worker 0-0, policy_version 1171179 (0.00091) [2022-07-11 11:25:43,993][26022] Updated weights on worker 0-0, policy_version 1171189 (0.00091) [2022-07-11 11:25:45,823][26022] Updated weights on worker 0-0, policy_version 1171199 (0.00083) [2022-07-11 11:25:46,774][25689] Fps is (10 sec: 5592.0, 60 sec: 5533.8, 300 sec: 5547.6). Total num frames: 1199312896. Throughput: 0: 5838.0. Samples: 1199318718. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:25:46,775][25689] Avg episode reward: [(0, '1.237')] [2022-07-11 11:25:47,381][26022] Updated weights on worker 0-0, policy_version 1171209 (0.00091) [2022-07-11 11:25:49,356][26022] Updated weights on worker 0-0, policy_version 1171219 (0.00084) [2022-07-11 11:25:51,158][26022] Updated weights on worker 0-0, policy_version 1171229 (0.00092) [2022-07-11 11:25:51,788][25689] Fps is (10 sec: 5492.9, 60 sec: 5507.1, 300 sec: 5547.4). Total num frames: 1199340544. Throughput: 0: 5035.0. Samples: 1199335900. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:25:51,790][25689] Avg episode reward: [(0, '1.109')] [2022-07-11 11:25:52,868][26022] Updated weights on worker 0-0, policy_version 1171239 (0.00086) [2022-07-11 11:25:55,055][26022] Updated weights on worker 0-0, policy_version 1171249 (0.00081) [2022-07-11 11:25:56,589][26022] Updated weights on worker 0-0, policy_version 1171259 (0.00093) [2022-07-11 11:25:56,807][25689] Fps is (10 sec: 5715.7, 60 sec: 5540.4, 300 sec: 5550.8). Total num frames: 1199370240. Throughput: 0: 5856.2. Samples: 1199369396. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:25:56,807][25689] Avg episode reward: [(0, '1.306')] [2022-07-11 11:25:58,527][26022] Updated weights on worker 0-0, policy_version 1171269 (0.00091) [2022-07-11 11:26:00,284][26022] Updated weights on worker 0-0, policy_version 1171279 (0.00085) [2022-07-11 11:26:01,816][25689] Fps is (10 sec: 5514.4, 60 sec: 5514.3, 300 sec: 5552.4). Total num frames: 1199395840. Throughput: 0: 5887.2. Samples: 1199403362. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:26:01,816][25689] Avg episode reward: [(0, '0.899')] [2022-07-11 11:26:02,348][26022] Updated weights on worker 0-0, policy_version 1171289 (0.00087) [2022-07-11 11:26:02,581][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:26:02,597][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001171291_1199401984.pth [2022-07-11 11:26:02,601][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001169336_1197400064.pth [2022-07-11 11:26:04,478][26022] Updated weights on worker 0-0, policy_version 1171299 (0.00091) [2022-07-11 11:26:06,033][26022] Updated weights on worker 0-0, policy_version 1171309 (0.00086) [2022-07-11 11:26:06,974][25689] Fps is (10 sec: 5237.0, 60 sec: 5524.5, 300 sec: 5546.6). Total num frames: 1199423488. Throughput: 0: 4920.9. Samples: 1199418122. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:26:06,975][25689] Avg episode reward: [(0, '0.034')] [2022-07-11 11:26:07,979][26022] Updated weights on worker 0-0, policy_version 1171319 (0.00085) [2022-07-11 11:26:09,836][26022] Updated weights on worker 0-0, policy_version 1171329 (0.00083) [2022-07-11 11:26:11,642][26022] Updated weights on worker 0-0, policy_version 1171339 (0.00079) [2022-07-11 11:26:12,073][25689] Fps is (10 sec: 5590.8, 60 sec: 5553.1, 300 sec: 5552.0). Total num frames: 1199453184. Throughput: 0: 5698.9. Samples: 1199451498. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:26:12,073][25689] Avg episode reward: [(0, '-0.195')] [2022-07-11 11:26:13,592][26022] Updated weights on worker 0-0, policy_version 1171349 (0.00091) [2022-07-11 11:26:15,322][26022] Updated weights on worker 0-0, policy_version 1171359 (0.00089) [2022-07-11 11:26:17,087][25689] Fps is (10 sec: 5569.2, 60 sec: 5553.8, 300 sec: 5542.1). Total num frames: 1199479808. Throughput: 0: 5704.3. Samples: 1199485080. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:26:17,087][25689] Avg episode reward: [(0, '-0.104')] [2022-07-11 11:26:17,201][26022] Updated weights on worker 0-0, policy_version 1171369 (0.00083) [2022-07-11 11:26:18,995][26022] Updated weights on worker 0-0, policy_version 1171379 (0.00083) [2022-07-11 11:26:20,709][26022] Updated weights on worker 0-0, policy_version 1171389 (0.00083) [2022-07-11 11:26:22,113][25689] Fps is (10 sec: 5507.3, 60 sec: 5535.7, 300 sec: 5553.6). Total num frames: 1199508480. Throughput: 0: 5686.9. Samples: 1199518792. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:26:22,114][25689] Avg episode reward: [(0, '-0.184')] [2022-07-11 11:26:22,646][26022] Updated weights on worker 0-0, policy_version 1171399 (0.00080) [2022-07-11 11:26:24,579][26022] Updated weights on worker 0-0, policy_version 1171409 (0.00049) [2022-07-11 11:26:26,316][26022] Updated weights on worker 0-0, policy_version 1171419 (0.00081) [2022-07-11 11:26:27,180][25689] Fps is (10 sec: 5783.2, 60 sec: 5568.2, 300 sec: 5552.6). Total num frames: 1199538176. Throughput: 0: 5819.1. Samples: 1199535700. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:26:27,180][25689] Avg episode reward: [(0, '-0.209')] [2022-07-11 11:26:28,241][26022] Updated weights on worker 0-0, policy_version 1171429 (0.00095) [2022-07-11 11:26:29,975][26022] Updated weights on worker 0-0, policy_version 1171439 (0.00087) [2022-07-11 11:26:31,890][26022] Updated weights on worker 0-0, policy_version 1171449 (0.00097) [2022-07-11 11:26:32,216][25689] Fps is (10 sec: 5676.0, 60 sec: 5566.9, 300 sec: 5549.5). Total num frames: 1199565824. Throughput: 0: 5829.0. Samples: 1199568914. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:26:32,217][25689] Avg episode reward: [(0, '1.190')] [2022-07-11 11:26:33,752][26022] Updated weights on worker 0-0, policy_version 1171459 (0.00089) [2022-07-11 11:26:35,531][26022] Updated weights on worker 0-0, policy_version 1171469 (0.00088) [2022-07-11 11:26:37,227][25689] Fps is (10 sec: 5503.7, 60 sec: 5566.3, 300 sec: 5556.3). Total num frames: 1199593472. Throughput: 0: 5806.3. Samples: 1199602018. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:26:37,227][25689] Avg episode reward: [(0, '1.309')] [2022-07-11 11:26:37,419][26022] Updated weights on worker 0-0, policy_version 1171479 (0.00089) [2022-07-11 11:26:39,273][26022] Updated weights on worker 0-0, policy_version 1171489 (0.00092) [2022-07-11 11:26:41,166][26022] Updated weights on worker 0-0, policy_version 1171499 (0.00086) [2022-07-11 11:26:42,234][25689] Fps is (10 sec: 5519.6, 60 sec: 5550.4, 300 sec: 5550.3). Total num frames: 1199621120. Throughput: 0: 4970.2. Samples: 1199618798. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:26:42,235][25689] Avg episode reward: [(0, '0.677')] [2022-07-11 11:26:43,036][26022] Updated weights on worker 0-0, policy_version 1171509 (0.00091) [2022-07-11 11:26:44,689][26022] Updated weights on worker 0-0, policy_version 1171519 (0.00092) [2022-07-11 11:26:46,463][26022] Updated weights on worker 0-0, policy_version 1171529 (0.00092) [2022-07-11 11:26:47,311][25689] Fps is (10 sec: 5381.6, 60 sec: 5531.2, 300 sec: 5545.5). Total num frames: 1199647744. Throughput: 0: 5795.4. Samples: 1199652372. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:26:47,312][25689] Avg episode reward: [(0, '0.427')] [2022-07-11 11:26:48,287][26022] Updated weights on worker 0-0, policy_version 1171539 (0.00088) [2022-07-11 11:26:50,106][26022] Updated weights on worker 0-0, policy_version 1171549 (0.00093) [2022-07-11 11:26:52,175][26022] Updated weights on worker 0-0, policy_version 1171559 (0.00088) [2022-07-11 11:26:52,364][25689] Fps is (10 sec: 5459.1, 60 sec: 5544.7, 300 sec: 5548.1). Total num frames: 1199676416. Throughput: 0: 5813.1. Samples: 1199686032. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:26:52,364][25689] Avg episode reward: [(0, '0.389')] [2022-07-11 11:26:53,804][26022] Updated weights on worker 0-0, policy_version 1171569 (0.00084) [2022-07-11 11:26:55,887][26022] Updated weights on worker 0-0, policy_version 1171579 (0.00085) [2022-07-11 11:26:57,371][25689] Fps is (10 sec: 5802.0, 60 sec: 5545.7, 300 sec: 5551.5). Total num frames: 1199706112. Throughput: 0: 5014.8. Samples: 1199703040. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:26:57,372][25689] Avg episode reward: [(0, '-0.209')] [2022-07-11 11:26:57,505][26022] Updated weights on worker 0-0, policy_version 1171589 (0.00101) [2022-07-11 11:26:59,564][26022] Updated weights on worker 0-0, policy_version 1171599 (0.00092) [2022-07-11 11:27:01,031][26022] Updated weights on worker 0-0, policy_version 1171609 (0.00086) [2022-07-11 11:27:02,378][25689] Fps is (10 sec: 5623.7, 60 sec: 5562.8, 300 sec: 5555.8). Total num frames: 1199732736. Throughput: 0: 5859.6. Samples: 1199736834. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:27:02,379][25689] Avg episode reward: [(0, '-0.263')] [2022-07-11 11:27:03,443][26022] Updated weights on worker 0-0, policy_version 1171619 (0.00056) [2022-07-11 11:27:05,207][26022] Updated weights on worker 0-0, policy_version 1171629 (0.00085) [2022-07-11 11:27:07,296][26022] Updated weights on worker 0-0, policy_version 1171639 (0.00085) [2022-07-11 11:27:07,484][25689] Fps is (10 sec: 5265.5, 60 sec: 5550.7, 300 sec: 5543.9). Total num frames: 1199759360. Throughput: 0: 5723.8. Samples: 1199767836. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:27:07,484][25689] Avg episode reward: [(0, '-0.052')] [2022-07-11 11:27:08,991][26022] Updated weights on worker 0-0, policy_version 1171649 (0.00089) [2022-07-11 11:27:10,872][26022] Updated weights on worker 0-0, policy_version 1171659 (0.00092) [2022-07-11 11:27:12,491][25689] Fps is (10 sec: 5366.5, 60 sec: 5525.2, 300 sec: 5547.7). Total num frames: 1199787008. Throughput: 0: 4898.7. Samples: 1199784632. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:27:12,492][25689] Avg episode reward: [(0, '0.059')] [2022-07-11 11:27:12,579][26022] Updated weights on worker 0-0, policy_version 1171669 (0.00089) [2022-07-11 11:27:14,519][26022] Updated weights on worker 0-0, policy_version 1171679 (0.00091) [2022-07-11 11:27:16,289][26022] Updated weights on worker 0-0, policy_version 1171689 (0.00080) [2022-07-11 11:27:17,494][25689] Fps is (10 sec: 5626.5, 60 sec: 5560.2, 300 sec: 5548.4). Total num frames: 1199815680. Throughput: 0: 5734.0. Samples: 1199818422. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:27:17,494][25689] Avg episode reward: [(0, '-0.005')] [2022-07-11 11:27:18,295][26022] Updated weights on worker 0-0, policy_version 1171699 (0.00093) [2022-07-11 11:27:19,832][26022] Updated weights on worker 0-0, policy_version 1171709 (0.00090) [2022-07-11 11:27:21,895][26022] Updated weights on worker 0-0, policy_version 1171719 (0.00089) [2022-07-11 11:27:22,523][25689] Fps is (10 sec: 5614.3, 60 sec: 5543.0, 300 sec: 5549.8). Total num frames: 1199843328. Throughput: 0: 5716.2. Samples: 1199851984. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:27:22,523][25689] Avg episode reward: [(0, '0.046')] [2022-07-11 11:27:23,559][26022] Updated weights on worker 0-0, policy_version 1171729 (0.00056) [2022-07-11 11:27:25,398][26022] Updated weights on worker 0-0, policy_version 1171739 (0.00085) [2022-07-11 11:27:27,378][26022] Updated weights on worker 0-0, policy_version 1171749 (0.00086) [2022-07-11 11:27:27,589][25689] Fps is (10 sec: 5680.1, 60 sec: 5542.9, 300 sec: 5552.9). Total num frames: 1199873024. Throughput: 0: 5017.6. Samples: 1199868718. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:27:27,590][25689] Avg episode reward: [(0, '1.096')] [2022-07-11 11:27:29,135][26022] Updated weights on worker 0-0, policy_version 1171759 (0.00090) [2022-07-11 11:27:30,978][26022] Updated weights on worker 0-0, policy_version 1171769 (0.00084) [2022-07-11 11:27:32,675][25689] Fps is (10 sec: 5648.7, 60 sec: 5538.5, 300 sec: 5551.4). Total num frames: 1199900672. Throughput: 0: 5832.0. Samples: 1199902340. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:27:32,675][25689] Avg episode reward: [(0, '0.261')] [2022-07-11 11:27:32,766][26022] Updated weights on worker 0-0, policy_version 1171779 (0.00094) [2022-07-11 11:27:34,587][26022] Updated weights on worker 0-0, policy_version 1171789 (0.00087) [2022-07-11 11:27:36,448][26022] Updated weights on worker 0-0, policy_version 1171799 (0.00095) [2022-07-11 11:27:37,767][25689] Fps is (10 sec: 5433.1, 60 sec: 5531.0, 300 sec: 5546.7). Total num frames: 1199928320. Throughput: 0: 5797.1. Samples: 1199935948. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:27:37,768][25689] Avg episode reward: [(0, '-0.018')] [2022-07-11 11:27:38,187][26022] Updated weights on worker 0-0, policy_version 1171809 (0.00090) [2022-07-11 11:27:40,039][26022] Updated weights on worker 0-0, policy_version 1171819 (0.00083) [2022-07-11 11:27:41,785][26022] Updated weights on worker 0-0, policy_version 1171829 (0.00469) [2022-07-11 11:27:42,792][25689] Fps is (10 sec: 5566.3, 60 sec: 5546.3, 300 sec: 5551.4). Total num frames: 1199956992. Throughput: 0: 4971.6. Samples: 1199952756. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:27:42,793][25689] Avg episode reward: [(0, '0.079')] [2022-07-11 11:27:43,727][26022] Updated weights on worker 0-0, policy_version 1171839 (0.00088) [2022-07-11 11:27:45,738][26022] Updated weights on worker 0-0, policy_version 1171849 (0.00081) [2022-07-11 11:27:47,372][26022] Updated weights on worker 0-0, policy_version 1171859 (0.00084) [2022-07-11 11:27:47,839][25689] Fps is (10 sec: 5794.8, 60 sec: 5599.8, 300 sec: 5548.6). Total num frames: 1199986688. Throughput: 0: 5819.0. Samples: 1199986552. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:27:47,840][25689] Avg episode reward: [(0, '-0.092')] [2022-07-11 11:27:49,300][26022] Updated weights on worker 0-0, policy_version 1171869 (0.00092) [2022-07-11 11:27:51,082][26022] Updated weights on worker 0-0, policy_version 1171879 (0.00084) [2022-07-11 11:27:52,815][26022] Updated weights on worker 0-0, policy_version 1171889 (0.00085) [2022-07-11 11:27:52,914][25689] Fps is (10 sec: 5766.6, 60 sec: 5597.7, 300 sec: 5555.5). Total num frames: 1200015360. Throughput: 0: 5838.7. Samples: 1200020514. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:27:52,915][25689] Avg episode reward: [(0, '-0.159')] [2022-07-11 11:27:54,552][26022] Updated weights on worker 0-0, policy_version 1171899 (0.00088) [2022-07-11 11:27:56,499][26022] Updated weights on worker 0-0, policy_version 1171909 (0.00078) [2022-07-11 11:27:57,944][25689] Fps is (10 sec: 5573.5, 60 sec: 5561.8, 300 sec: 5549.1). Total num frames: 1200043008. Throughput: 0: 5031.8. Samples: 1200037476. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:27:57,945][25689] Avg episode reward: [(0, '-0.313')] [2022-07-11 11:27:58,119][26022] Updated weights on worker 0-0, policy_version 1171919 (0.00081) [2022-07-11 11:28:00,122][26022] Updated weights on worker 0-0, policy_version 1171929 (0.00080) [2022-07-11 11:28:01,801][26022] Updated weights on worker 0-0, policy_version 1171939 (0.00073) [2022-07-11 11:28:02,715][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:28:02,736][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001171941_1200067584.pth [2022-07-11 11:28:02,737][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001169984_1198063616.pth [2022-07-11 11:28:02,967][25689] Fps is (10 sec: 5195.1, 60 sec: 5526.6, 300 sec: 5540.9). Total num frames: 1200067584. Throughput: 0: 5874.7. Samples: 1200071274. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:28:02,967][25689] Avg episode reward: [(0, '-0.287')] [2022-07-11 11:28:04,147][26022] Updated weights on worker 0-0, policy_version 1171949 (0.00086) [2022-07-11 11:28:06,158][26022] Updated weights on worker 0-0, policy_version 1171959 (0.00088) [2022-07-11 11:28:07,570][26022] Updated weights on worker 0-0, policy_version 1171969 (0.00098) [2022-07-11 11:28:08,008][25689] Fps is (10 sec: 5392.7, 60 sec: 5583.2, 300 sec: 5554.4). Total num frames: 1200097280. Throughput: 0: 5756.3. Samples: 1200102650. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:28:08,009][25689] Avg episode reward: [(0, '0.633')] [2022-07-11 11:28:09,761][26022] Updated weights on worker 0-0, policy_version 1171979 (0.00085) [2022-07-11 11:28:11,356][26022] Updated weights on worker 0-0, policy_version 1171989 (0.00082) [2022-07-11 11:28:13,013][25689] Fps is (10 sec: 5708.2, 60 sec: 5583.5, 300 sec: 5548.3). Total num frames: 1200124928. Throughput: 0: 4912.3. Samples: 1200119244. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:28:13,013][25689] Avg episode reward: [(0, '0.793')] [2022-07-11 11:28:13,467][26022] Updated weights on worker 0-0, policy_version 1171999 (0.00085) [2022-07-11 11:28:15,263][26022] Updated weights on worker 0-0, policy_version 1172009 (0.00086) [2022-07-11 11:28:16,979][26022] Updated weights on worker 0-0, policy_version 1172019 (0.00093) [2022-07-11 11:28:18,044][25689] Fps is (10 sec: 5611.8, 60 sec: 5580.8, 300 sec: 5551.3). Total num frames: 1200153600. Throughput: 0: 5747.9. Samples: 1200153008. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:28:18,045][25689] Avg episode reward: [(0, '0.522')] [2022-07-11 11:28:18,829][26022] Updated weights on worker 0-0, policy_version 1172029 (0.00094) [2022-07-11 11:28:20,524][26022] Updated weights on worker 0-0, policy_version 1172039 (0.00093) [2022-07-11 11:28:22,706][26022] Updated weights on worker 0-0, policy_version 1172049 (0.00088) [2022-07-11 11:28:23,047][25689] Fps is (10 sec: 5510.5, 60 sec: 5566.3, 300 sec: 5545.6). Total num frames: 1200180224. Throughput: 0: 5744.4. Samples: 1200186624. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:28:23,049][25689] Avg episode reward: [(0, '0.949')] [2022-07-11 11:28:24,142][26022] Updated weights on worker 0-0, policy_version 1172059 (0.00083) [2022-07-11 11:28:26,226][26022] Updated weights on worker 0-0, policy_version 1172069 (0.00086) [2022-07-11 11:28:27,920][26022] Updated weights on worker 0-0, policy_version 1172079 (0.00088) [2022-07-11 11:28:28,182][25689] Fps is (10 sec: 5454.6, 60 sec: 5543.1, 300 sec: 5548.3). Total num frames: 1200208896. Throughput: 0: 4993.3. Samples: 1200203382. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:28:28,182][25689] Avg episode reward: [(0, '1.524')] [2022-07-11 11:28:29,721][26022] Updated weights on worker 0-0, policy_version 1172089 (0.01089) [2022-07-11 11:28:31,763][26022] Updated weights on worker 0-0, policy_version 1172099 (0.00086) [2022-07-11 11:28:33,201][25689] Fps is (10 sec: 5748.1, 60 sec: 5583.0, 300 sec: 5551.9). Total num frames: 1200238592. Throughput: 0: 5834.7. Samples: 1200237040. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:28:33,202][25689] Avg episode reward: [(0, '1.438')] [2022-07-11 11:28:33,203][26022] Updated weights on worker 0-0, policy_version 1172109 (0.00092) [2022-07-11 11:28:35,235][26022] Updated weights on worker 0-0, policy_version 1172119 (0.00085) [2022-07-11 11:28:37,267][26022] Updated weights on worker 0-0, policy_version 1172129 (0.00092) [2022-07-11 11:28:38,209][25689] Fps is (10 sec: 5617.1, 60 sec: 5573.9, 300 sec: 5542.6). Total num frames: 1200265216. Throughput: 0: 5842.9. Samples: 1200270824. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:28:38,210][25689] Avg episode reward: [(0, '1.378')] [2022-07-11 11:28:38,808][26022] Updated weights on worker 0-0, policy_version 1172139 (0.00087) [2022-07-11 11:28:41,015][26022] Updated weights on worker 0-0, policy_version 1172149 (0.00098) [2022-07-11 11:28:42,474][26022] Updated weights on worker 0-0, policy_version 1172159 (0.00084) [2022-07-11 11:28:43,219][25689] Fps is (10 sec: 5417.9, 60 sec: 5558.4, 300 sec: 5544.2). Total num frames: 1200292864. Throughput: 0: 4995.2. Samples: 1200287384. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:28:43,219][25689] Avg episode reward: [(0, '1.314')] [2022-07-11 11:28:44,441][26022] Updated weights on worker 0-0, policy_version 1172169 (0.00086) [2022-07-11 11:28:46,420][26022] Updated weights on worker 0-0, policy_version 1172179 (0.00093) [2022-07-11 11:28:47,983][26022] Updated weights on worker 0-0, policy_version 1172189 (0.00089) [2022-07-11 11:28:48,284][25689] Fps is (10 sec: 5793.3, 60 sec: 5573.7, 300 sec: 5557.0). Total num frames: 1200323584. Throughput: 0: 5861.8. Samples: 1200321214. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:28:48,284][25689] Avg episode reward: [(0, '1.237')] [2022-07-11 11:28:50,003][26022] Updated weights on worker 0-0, policy_version 1172199 (0.00093) [2022-07-11 11:28:51,462][26022] Updated weights on worker 0-0, policy_version 1172209 (0.00087) [2022-07-11 11:28:53,359][25689] Fps is (10 sec: 5655.2, 60 sec: 5539.8, 300 sec: 5542.3). Total num frames: 1200350208. Throughput: 0: 5843.5. Samples: 1200354830. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:28:53,359][25689] Avg episode reward: [(0, '1.308')] [2022-07-11 11:28:53,541][26022] Updated weights on worker 0-0, policy_version 1172219 (0.00092) [2022-07-11 11:28:55,610][26022] Updated weights on worker 0-0, policy_version 1172229 (0.00081) [2022-07-11 11:28:57,226][26022] Updated weights on worker 0-0, policy_version 1172239 (0.00087) [2022-07-11 11:28:58,405][25689] Fps is (10 sec: 5361.9, 60 sec: 5538.2, 300 sec: 5545.0). Total num frames: 1200377856. Throughput: 0: 4990.9. Samples: 1200371624. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:28:58,406][25689] Avg episode reward: [(0, '0.901')] [2022-07-11 11:28:59,047][26022] Updated weights on worker 0-0, policy_version 1172249 (0.00086) [2022-07-11 11:29:00,928][26022] Updated weights on worker 0-0, policy_version 1172259 (0.00080) [2022-07-11 11:29:03,007][26022] Updated weights on worker 0-0, policy_version 1172269 (0.00084) [2022-07-11 11:29:03,489][25689] Fps is (10 sec: 5357.5, 60 sec: 5566.5, 300 sec: 5551.4). Total num frames: 1200404480. Throughput: 0: 5819.1. Samples: 1200405338. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:29:03,489][25689] Avg episode reward: [(0, '1.029')] [2022-07-11 11:29:04,959][26022] Updated weights on worker 0-0, policy_version 1172279 (0.00088) [2022-07-11 11:29:06,872][26022] Updated weights on worker 0-0, policy_version 1172289 (0.00092) [2022-07-11 11:29:08,469][26022] Updated weights on worker 0-0, policy_version 1172299 (0.00092) [2022-07-11 11:29:08,614][25689] Fps is (10 sec: 5517.2, 60 sec: 5558.9, 300 sec: 5549.7). Total num frames: 1200434176. Throughput: 0: 5696.2. Samples: 1200437018. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:29:08,614][25689] Avg episode reward: [(0, '0.989')] [2022-07-11 11:29:10,478][26022] Updated weights on worker 0-0, policy_version 1172309 (0.00087) [2022-07-11 11:29:12,090][26022] Updated weights on worker 0-0, policy_version 1172319 (0.00096) [2022-07-11 11:29:13,647][25689] Fps is (10 sec: 5645.1, 60 sec: 5556.2, 300 sec: 5545.7). Total num frames: 1200461824. Throughput: 0: 5714.6. Samples: 1200470770. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:29:13,648][25689] Avg episode reward: [(0, '1.544')] [2022-07-11 11:29:14,037][26022] Updated weights on worker 0-0, policy_version 1172329 (0.00090) [2022-07-11 11:29:15,887][26022] Updated weights on worker 0-0, policy_version 1172339 (0.00064) [2022-07-11 11:29:17,661][26022] Updated weights on worker 0-0, policy_version 1172349 (0.00093) [2022-07-11 11:29:18,704][25689] Fps is (10 sec: 5581.7, 60 sec: 5553.9, 300 sec: 5548.5). Total num frames: 1200490496. Throughput: 0: 5711.1. Samples: 1200487550. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:29:18,704][25689] Avg episode reward: [(0, '1.476')] [2022-07-11 11:29:19,598][26022] Updated weights on worker 0-0, policy_version 1172359 (0.00086) [2022-07-11 11:29:21,319][26022] Updated weights on worker 0-0, policy_version 1172369 (0.00083) [2022-07-11 11:29:23,298][26022] Updated weights on worker 0-0, policy_version 1172379 (0.00086) [2022-07-11 11:29:23,770][25689] Fps is (10 sec: 5664.6, 60 sec: 5581.8, 300 sec: 5548.6). Total num frames: 1200519168. Throughput: 0: 5722.9. Samples: 1200521406. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:29:23,771][25689] Avg episode reward: [(0, '1.250')] [2022-07-11 11:29:25,140][26022] Updated weights on worker 0-0, policy_version 1172389 (0.00085) [2022-07-11 11:29:26,808][26022] Updated weights on worker 0-0, policy_version 1172399 (0.00086) [2022-07-11 11:29:28,819][25689] Fps is (10 sec: 5466.5, 60 sec: 5555.9, 300 sec: 5544.4). Total num frames: 1200545792. Throughput: 0: 5834.9. Samples: 1200554916. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:29:28,820][25689] Avg episode reward: [(0, '0.779')] [2022-07-11 11:29:28,873][26022] Updated weights on worker 0-0, policy_version 1172409 (0.00094) [2022-07-11 11:29:30,550][26022] Updated weights on worker 0-0, policy_version 1172419 (0.00095) [2022-07-11 11:29:32,583][26022] Updated weights on worker 0-0, policy_version 1172429 (0.00093) [2022-07-11 11:29:33,853][25689] Fps is (10 sec: 5687.4, 60 sec: 5571.5, 300 sec: 5559.0). Total num frames: 1200576512. Throughput: 0: 4990.2. Samples: 1200571602. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:29:33,855][25689] Avg episode reward: [(0, '0.538')] [2022-07-11 11:29:34,023][26022] Updated weights on worker 0-0, policy_version 1172439 (0.00091) [2022-07-11 11:29:36,017][26022] Updated weights on worker 0-0, policy_version 1172449 (0.00090) [2022-07-11 11:29:37,950][26022] Updated weights on worker 0-0, policy_version 1172459 (0.00091) [2022-07-11 11:29:38,912][25689] Fps is (10 sec: 5681.8, 60 sec: 5566.8, 300 sec: 5547.7). Total num frames: 1200603136. Throughput: 0: 5826.3. Samples: 1200605288. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:29:38,912][25689] Avg episode reward: [(0, '0.500')] [2022-07-11 11:29:39,616][26022] Updated weights on worker 0-0, policy_version 1172469 (0.00093) [2022-07-11 11:29:41,566][26022] Updated weights on worker 0-0, policy_version 1172479 (0.00087) [2022-07-11 11:29:43,456][26022] Updated weights on worker 0-0, policy_version 1172489 (0.00087) [2022-07-11 11:29:43,958][25689] Fps is (10 sec: 5370.7, 60 sec: 5563.4, 300 sec: 5551.8). Total num frames: 1200630784. Throughput: 0: 5816.0. Samples: 1200638818. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:29:43,959][25689] Avg episode reward: [(0, '0.418')] [2022-07-11 11:29:45,183][26022] Updated weights on worker 0-0, policy_version 1172499 (0.00083) [2022-07-11 11:29:47,024][26022] Updated weights on worker 0-0, policy_version 1172509 (0.00089) [2022-07-11 11:29:48,654][26022] Updated weights on worker 0-0, policy_version 1172519 (0.00081) [2022-07-11 11:29:49,096][25689] Fps is (10 sec: 5630.8, 60 sec: 5540.0, 300 sec: 5551.0). Total num frames: 1200660480. Throughput: 0: 4965.1. Samples: 1200655592. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:29:49,096][25689] Avg episode reward: [(0, '-0.301')] [2022-07-11 11:29:50,768][26022] Updated weights on worker 0-0, policy_version 1172529 (0.00088) [2022-07-11 11:29:52,475][26022] Updated weights on worker 0-0, policy_version 1172539 (0.00092) [2022-07-11 11:29:54,124][25689] Fps is (10 sec: 5640.9, 60 sec: 5561.1, 300 sec: 5550.7). Total num frames: 1200688128. Throughput: 0: 5808.7. Samples: 1200689350. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:29:54,125][25689] Avg episode reward: [(0, '0.295')] [2022-07-11 11:29:54,340][26022] Updated weights on worker 0-0, policy_version 1172549 (0.00090) [2022-07-11 11:29:56,174][26022] Updated weights on worker 0-0, policy_version 1172559 (0.00092) [2022-07-11 11:29:58,131][26022] Updated weights on worker 0-0, policy_version 1172569 (0.00086) [2022-07-11 11:29:59,163][25689] Fps is (10 sec: 5492.8, 60 sec: 5561.8, 300 sec: 5551.7). Total num frames: 1200715776. Throughput: 0: 5819.0. Samples: 1200723130. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:29:59,163][25689] Avg episode reward: [(0, '0.333')] [2022-07-11 11:29:59,715][26022] Updated weights on worker 0-0, policy_version 1172579 (0.00077) [2022-07-11 11:30:01,868][26022] Updated weights on worker 0-0, policy_version 1172589 (0.00090) [2022-07-11 11:30:02,831][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:30:02,849][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001172593_1200735232.pth [2022-07-11 11:30:02,850][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001170639_1198734336.pth [2022-07-11 11:30:03,787][26022] Updated weights on worker 0-0, policy_version 1172599 (0.00075) [2022-07-11 11:30:04,164][25689] Fps is (10 sec: 5507.4, 60 sec: 5586.2, 300 sec: 5556.7). Total num frames: 1200743424. Throughput: 0: 4985.8. Samples: 1200739562. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:30:04,165][25689] Avg episode reward: [(0, '0.196')] [2022-07-11 11:30:05,785][26022] Updated weights on worker 0-0, policy_version 1172609 (0.00085) [2022-07-11 11:30:07,561][26022] Updated weights on worker 0-0, policy_version 1172619 (0.00083) [2022-07-11 11:30:09,191][26022] Updated weights on worker 0-0, policy_version 1172629 (0.00084) [2022-07-11 11:30:09,210][25689] Fps is (10 sec: 5605.2, 60 sec: 5576.5, 300 sec: 5560.0). Total num frames: 1200772096. Throughput: 0: 5775.0. Samples: 1200771754. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:30:09,211][25689] Avg episode reward: [(0, '-0.484')] [2022-07-11 11:30:11,504][26022] Updated weights on worker 0-0, policy_version 1172639 (0.00091) [2022-07-11 11:30:12,880][26022] Updated weights on worker 0-0, policy_version 1172649 (0.00089) [2022-07-11 11:30:14,221][25689] Fps is (10 sec: 5396.4, 60 sec: 5544.8, 300 sec: 5556.8). Total num frames: 1200797696. Throughput: 0: 5752.7. Samples: 1200804964. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:30:14,222][25689] Avg episode reward: [(0, '-0.447')] [2022-07-11 11:30:15,031][26022] Updated weights on worker 0-0, policy_version 1172659 (0.00058) [2022-07-11 11:30:16,897][26022] Updated weights on worker 0-0, policy_version 1172669 (0.00082) [2022-07-11 11:30:18,518][26022] Updated weights on worker 0-0, policy_version 1172679 (0.00087) [2022-07-11 11:30:19,251][25689] Fps is (10 sec: 5507.4, 60 sec: 5564.2, 300 sec: 5556.5). Total num frames: 1200827392. Throughput: 0: 4903.0. Samples: 1200821622. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:30:19,251][25689] Avg episode reward: [(0, '0.591')] [2022-07-11 11:30:20,558][26022] Updated weights on worker 0-0, policy_version 1172689 (0.00085) [2022-07-11 11:30:22,178][26022] Updated weights on worker 0-0, policy_version 1172699 (0.00087) [2022-07-11 11:30:24,027][26022] Updated weights on worker 0-0, policy_version 1172709 (0.00082) [2022-07-11 11:30:24,266][25689] Fps is (10 sec: 5709.0, 60 sec: 5552.0, 300 sec: 5557.2). Total num frames: 1200855040. Throughput: 0: 5751.6. Samples: 1200855178. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:30:24,267][25689] Avg episode reward: [(0, '0.721')] [2022-07-11 11:30:25,934][26022] Updated weights on worker 0-0, policy_version 1172719 (0.00087) [2022-07-11 11:30:27,769][26022] Updated weights on worker 0-0, policy_version 1172729 (0.00082) [2022-07-11 11:30:29,313][25689] Fps is (10 sec: 5495.1, 60 sec: 5569.0, 300 sec: 5556.7). Total num frames: 1200882688. Throughput: 0: 5810.4. Samples: 1200888560. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:30:29,314][25689] Avg episode reward: [(0, '0.114')] [2022-07-11 11:30:29,841][26022] Updated weights on worker 0-0, policy_version 1172739 (0.00091) [2022-07-11 11:30:31,305][26022] Updated weights on worker 0-0, policy_version 1172749 (0.00081) [2022-07-11 11:30:33,515][26022] Updated weights on worker 0-0, policy_version 1172759 (0.00089) [2022-07-11 11:30:34,391][25689] Fps is (10 sec: 5663.6, 60 sec: 5548.1, 300 sec: 5562.2). Total num frames: 1200912384. Throughput: 0: 4969.3. Samples: 1200905192. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:30:34,399][25689] Avg episode reward: [(0, '0.115')] [2022-07-11 11:30:34,939][26022] Updated weights on worker 0-0, policy_version 1172769 (0.00093) [2022-07-11 11:30:37,209][26022] Updated weights on worker 0-0, policy_version 1172779 (0.00087) [2022-07-11 11:30:38,999][26022] Updated weights on worker 0-0, policy_version 1172789 (0.00085) [2022-07-11 11:30:39,442][25689] Fps is (10 sec: 5560.5, 60 sec: 5548.8, 300 sec: 5554.7). Total num frames: 1200939008. Throughput: 0: 5797.0. Samples: 1200938672. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:30:39,444][25689] Avg episode reward: [(0, '0.904')] [2022-07-11 11:30:40,519][26022] Updated weights on worker 0-0, policy_version 1172799 (0.00087) [2022-07-11 11:30:42,572][26022] Updated weights on worker 0-0, policy_version 1172809 (0.00100) [2022-07-11 11:30:44,251][26022] Updated weights on worker 0-0, policy_version 1172819 (0.00111) [2022-07-11 11:30:44,487][25689] Fps is (10 sec: 5375.8, 60 sec: 5549.0, 300 sec: 5554.9). Total num frames: 1200966656. Throughput: 0: 5773.9. Samples: 1200971930. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:30:44,488][25689] Avg episode reward: [(0, '1.065')] [2022-07-11 11:30:46,253][26022] Updated weights on worker 0-0, policy_version 1172829 (0.00095) [2022-07-11 11:30:47,960][26022] Updated weights on worker 0-0, policy_version 1172839 (0.00097) [2022-07-11 11:30:49,639][25689] Fps is (10 sec: 5523.2, 60 sec: 5530.7, 300 sec: 5555.7). Total num frames: 1200995328. Throughput: 0: 4924.4. Samples: 1200988652. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:30:49,640][25689] Avg episode reward: [(0, '0.874')] [2022-07-11 11:30:50,034][26022] Updated weights on worker 0-0, policy_version 1172849 (0.00084) [2022-07-11 11:30:51,827][26022] Updated weights on worker 0-0, policy_version 1172859 (0.00089) [2022-07-11 11:30:53,559][26022] Updated weights on worker 0-0, policy_version 1172869 (0.00085) [2022-07-11 11:30:54,661][25689] Fps is (10 sec: 5636.3, 60 sec: 5548.2, 300 sec: 5552.2). Total num frames: 1201024000. Throughput: 0: 5771.9. Samples: 1201022186. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:30:54,661][25689] Avg episode reward: [(0, '0.713')] [2022-07-11 11:30:55,355][26022] Updated weights on worker 0-0, policy_version 1172879 (0.00083) [2022-07-11 11:30:57,327][26022] Updated weights on worker 0-0, policy_version 1172889 (0.00095) [2022-07-11 11:30:59,037][26022] Updated weights on worker 0-0, policy_version 1172899 (0.00085) [2022-07-11 11:30:59,685][25689] Fps is (10 sec: 5606.0, 60 sec: 5549.5, 300 sec: 5558.8). Total num frames: 1201051648. Throughput: 0: 5792.7. Samples: 1201055936. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:30:59,686][25689] Avg episode reward: [(0, '0.705')] [2022-07-11 11:31:00,961][26022] Updated weights on worker 0-0, policy_version 1172909 (0.00084) [2022-07-11 11:31:02,863][26022] Updated weights on worker 0-0, policy_version 1172919 (0.00089) [2022-07-11 11:31:04,703][25689] Fps is (10 sec: 5302.6, 60 sec: 5514.3, 300 sec: 5554.6). Total num frames: 1201077248. Throughput: 0: 4898.5. Samples: 1201070958. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:31:04,703][25689] Avg episode reward: [(0, '0.667')] [2022-07-11 11:31:05,004][26022] Updated weights on worker 0-0, policy_version 1172929 (0.00089) [2022-07-11 11:31:06,829][26022] Updated weights on worker 0-0, policy_version 1172939 (0.00083) [2022-07-11 11:31:08,441][26022] Updated weights on worker 0-0, policy_version 1172949 (0.00084) [2022-07-11 11:31:09,810][25689] Fps is (10 sec: 5360.3, 60 sec: 5508.7, 300 sec: 5551.0). Total num frames: 1201105920. Throughput: 0: 5756.2. Samples: 1201104762. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:31:09,811][25689] Avg episode reward: [(0, '0.277')] [2022-07-11 11:31:10,255][26022] Updated weights on worker 0-0, policy_version 1172959 (0.00087) [2022-07-11 11:31:11,917][26022] Updated weights on worker 0-0, policy_version 1172969 (0.00091) [2022-07-11 11:31:14,128][26022] Updated weights on worker 0-0, policy_version 1172980 (0.00094) [2022-07-11 11:31:14,843][25689] Fps is (10 sec: 5756.1, 60 sec: 5574.3, 300 sec: 5561.0). Total num frames: 1201135616. Throughput: 0: 5751.5. Samples: 1201138262. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:31:14,843][25689] Avg episode reward: [(0, '0.273')] [2022-07-11 11:31:16,086][26022] Updated weights on worker 0-0, policy_version 1172990 (0.00093) [2022-07-11 11:31:17,572][26022] Updated weights on worker 0-0, policy_version 1173000 (0.00082) [2022-07-11 11:31:19,799][26022] Updated weights on worker 0-0, policy_version 1173010 (0.00080) [2022-07-11 11:31:19,859][25689] Fps is (10 sec: 5604.9, 60 sec: 5524.8, 300 sec: 5554.3). Total num frames: 1201162240. Throughput: 0: 5763.8. Samples: 1201172210. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:31:19,859][25689] Avg episode reward: [(0, '0.438')] [2022-07-11 11:31:21,320][26022] Updated weights on worker 0-0, policy_version 1173020 (0.00087) [2022-07-11 11:31:23,339][26022] Updated weights on worker 0-0, policy_version 1173030 (0.00085) [2022-07-11 11:31:24,864][25689] Fps is (10 sec: 5620.0, 60 sec: 5559.5, 300 sec: 5555.4). Total num frames: 1201191936. Throughput: 0: 5866.6. Samples: 1201189238. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:31:24,865][25689] Avg episode reward: [(0, '0.540')] [2022-07-11 11:31:24,979][26022] Updated weights on worker 0-0, policy_version 1173040 (0.00096) [2022-07-11 11:31:27,182][26022] Updated weights on worker 0-0, policy_version 1173050 (0.00084) [2022-07-11 11:31:28,664][26022] Updated weights on worker 0-0, policy_version 1173060 (0.00082) [2022-07-11 11:31:29,927][25689] Fps is (10 sec: 5695.5, 60 sec: 5558.1, 300 sec: 5554.9). Total num frames: 1201219584. Throughput: 0: 5858.3. Samples: 1201222612. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:31:29,927][25689] Avg episode reward: [(0, '0.188')] [2022-07-11 11:31:30,744][26022] Updated weights on worker 0-0, policy_version 1173070 (0.00093) [2022-07-11 11:31:32,346][26022] Updated weights on worker 0-0, policy_version 1173080 (0.00087) [2022-07-11 11:31:34,561][26022] Updated weights on worker 0-0, policy_version 1173090 (0.00084) [2022-07-11 11:31:35,002][25689] Fps is (10 sec: 5353.2, 60 sec: 5507.6, 300 sec: 5550.3). Total num frames: 1201246208. Throughput: 0: 5834.4. Samples: 1201255882. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:31:35,003][25689] Avg episode reward: [(0, '0.170')] [2022-07-11 11:31:35,882][26022] Updated weights on worker 0-0, policy_version 1173100 (0.00087) [2022-07-11 11:31:38,206][26022] Updated weights on worker 0-0, policy_version 1173110 (0.00088) [2022-07-11 11:31:39,935][26022] Updated weights on worker 0-0, policy_version 1173120 (0.00085) [2022-07-11 11:31:40,088][25689] Fps is (10 sec: 5441.7, 60 sec: 5538.2, 300 sec: 5552.2). Total num frames: 1201274880. Throughput: 0: 4963.9. Samples: 1201272628. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:31:40,089][25689] Avg episode reward: [(0, '0.232')] [2022-07-11 11:31:41,762][26022] Updated weights on worker 0-0, policy_version 1173130 (0.00094) [2022-07-11 11:31:43,597][26022] Updated weights on worker 0-0, policy_version 1173140 (0.00095) [2022-07-11 11:31:45,107][25689] Fps is (10 sec: 5776.1, 60 sec: 5574.3, 300 sec: 5563.6). Total num frames: 1201304576. Throughput: 0: 5791.2. Samples: 1201306472. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:31:45,108][25689] Avg episode reward: [(0, '-0.641')] [2022-07-11 11:31:45,347][26022] Updated weights on worker 0-0, policy_version 1173150 (0.00621) [2022-07-11 11:31:47,044][26022] Updated weights on worker 0-0, policy_version 1173160 (0.00093) [2022-07-11 11:31:49,093][26022] Updated weights on worker 0-0, policy_version 1173170 (0.00234) [2022-07-11 11:31:50,222][25689] Fps is (10 sec: 5759.7, 60 sec: 5577.8, 300 sec: 5562.5). Total num frames: 1201333248. Throughput: 0: 5792.7. Samples: 1201340176. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:31:50,223][25689] Avg episode reward: [(0, '-0.629')] [2022-07-11 11:31:50,707][26022] Updated weights on worker 0-0, policy_version 1173180 (0.00087) [2022-07-11 11:31:52,631][26022] Updated weights on worker 0-0, policy_version 1173190 (0.00082) [2022-07-11 11:31:54,500][26022] Updated weights on worker 0-0, policy_version 1173200 (0.00098) [2022-07-11 11:31:55,244][25689] Fps is (10 sec: 5555.9, 60 sec: 5560.8, 300 sec: 5555.3). Total num frames: 1201360896. Throughput: 0: 5003.1. Samples: 1201357156. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:31:55,245][25689] Avg episode reward: [(0, '-0.849')] [2022-07-11 11:31:56,290][26022] Updated weights on worker 0-0, policy_version 1173210 (0.00093) [2022-07-11 11:31:58,059][26022] Updated weights on worker 0-0, policy_version 1173220 (0.00094) [2022-07-11 11:32:00,075][26022] Updated weights on worker 0-0, policy_version 1173230 (0.00088) [2022-07-11 11:32:00,259][25689] Fps is (10 sec: 5509.3, 60 sec: 5561.8, 300 sec: 5558.6). Total num frames: 1201388544. Throughput: 0: 5849.9. Samples: 1201390624. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:32:00,260][25689] Avg episode reward: [(0, '-0.148')] [2022-07-11 11:32:01,543][26022] Updated weights on worker 0-0, policy_version 1173240 (0.00095) [2022-07-11 11:32:02,868][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:32:02,885][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001173244_1201401856.pth [2022-07-11 11:32:02,888][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001171291_1199401984.pth [2022-07-11 11:32:04,032][26022] Updated weights on worker 0-0, policy_version 1173250 (0.00098) [2022-07-11 11:32:05,272][25689] Fps is (10 sec: 5412.3, 60 sec: 5579.1, 300 sec: 5560.4). Total num frames: 1201415168. Throughput: 0: 5729.7. Samples: 1201422008. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:32:05,273][25689] Avg episode reward: [(0, '0.236')] [2022-07-11 11:32:05,698][26022] Updated weights on worker 0-0, policy_version 1173260 (0.00090) [2022-07-11 11:32:07,900][26022] Updated weights on worker 0-0, policy_version 1173270 (0.00092) [2022-07-11 11:32:09,332][26022] Updated weights on worker 0-0, policy_version 1173280 (0.00092) [2022-07-11 11:32:10,327][25689] Fps is (10 sec: 5390.6, 60 sec: 5567.0, 300 sec: 5559.5). Total num frames: 1201442816. Throughput: 0: 4899.2. Samples: 1201438672. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:32:10,331][25689] Avg episode reward: [(0, '-0.047')] [2022-07-11 11:32:11,514][26022] Updated weights on worker 0-0, policy_version 1173290 (0.00081) [2022-07-11 11:32:13,074][26022] Updated weights on worker 0-0, policy_version 1173300 (0.00081) [2022-07-11 11:32:15,215][26022] Updated weights on worker 0-0, policy_version 1173310 (0.00089) [2022-07-11 11:32:15,341][25689] Fps is (10 sec: 5389.7, 60 sec: 5517.9, 300 sec: 5552.4). Total num frames: 1201469440. Throughput: 0: 5715.2. Samples: 1201472014. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:32:15,343][25689] Avg episode reward: [(0, '0.951')] [2022-07-11 11:32:16,704][26022] Updated weights on worker 0-0, policy_version 1173320 (0.00086) [2022-07-11 11:32:18,895][26022] Updated weights on worker 0-0, policy_version 1173330 (0.00089) [2022-07-11 11:32:20,287][26022] Updated weights on worker 0-0, policy_version 1173340 (0.00084) [2022-07-11 11:32:20,382][25689] Fps is (10 sec: 5702.8, 60 sec: 5583.3, 300 sec: 5562.5). Total num frames: 1201500160. Throughput: 0: 5731.8. Samples: 1201505966. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:32:20,383][25689] Avg episode reward: [(0, '1.486')] [2022-07-11 11:32:22,381][26022] Updated weights on worker 0-0, policy_version 1173350 (0.00086) [2022-07-11 11:32:23,992][26022] Updated weights on worker 0-0, policy_version 1173360 (0.00090) [2022-07-11 11:32:25,424][25689] Fps is (10 sec: 5687.5, 60 sec: 5529.2, 300 sec: 5552.6). Total num frames: 1201526784. Throughput: 0: 5011.9. Samples: 1201523008. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:32:25,426][25689] Avg episode reward: [(0, '1.586')] [2022-07-11 11:32:25,974][26022] Updated weights on worker 0-0, policy_version 1173370 (0.00112) [2022-07-11 11:32:27,755][26022] Updated weights on worker 0-0, policy_version 1173380 (0.00086) [2022-07-11 11:32:29,548][26022] Updated weights on worker 0-0, policy_version 1173390 (0.00086) [2022-07-11 11:32:30,521][25689] Fps is (10 sec: 5554.9, 60 sec: 5559.9, 300 sec: 5559.3). Total num frames: 1201556480. Throughput: 0: 5834.6. Samples: 1201556498. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:32:30,522][25689] Avg episode reward: [(0, '1.835')] [2022-07-11 11:32:31,412][26022] Updated weights on worker 0-0, policy_version 1173400 (0.00091) [2022-07-11 11:32:32,997][26022] Updated weights on worker 0-0, policy_version 1173410 (0.00084) [2022-07-11 11:32:35,025][26022] Updated weights on worker 0-0, policy_version 1173420 (0.00080) [2022-07-11 11:32:35,530][25689] Fps is (10 sec: 5674.5, 60 sec: 5582.9, 300 sec: 5560.8). Total num frames: 1201584128. Throughput: 0: 5849.8. Samples: 1201590110. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 11:32:35,530][25689] Avg episode reward: [(0, '1.633')] [2022-07-11 11:32:36,898][26022] Updated weights on worker 0-0, policy_version 1173430 (0.00088) [2022-07-11 11:32:38,747][26022] Updated weights on worker 0-0, policy_version 1173440 (0.00086) [2022-07-11 11:32:40,537][25689] Fps is (10 sec: 5418.9, 60 sec: 5556.4, 300 sec: 5554.3). Total num frames: 1201610752. Throughput: 0: 5016.1. Samples: 1201607060. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:32:40,537][25689] Avg episode reward: [(0, '1.763')] [2022-07-11 11:32:40,643][26022] Updated weights on worker 0-0, policy_version 1173450 (0.00093) [2022-07-11 11:32:42,239][26022] Updated weights on worker 0-0, policy_version 1173460 (0.00087) [2022-07-11 11:32:44,337][26022] Updated weights on worker 0-0, policy_version 1173470 (0.00085) [2022-07-11 11:32:45,632][25689] Fps is (10 sec: 5575.1, 60 sec: 5549.4, 300 sec: 5553.4). Total num frames: 1201640448. Throughput: 0: 5818.3. Samples: 1201640582. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:32:45,632][25689] Avg episode reward: [(0, '1.481')] [2022-07-11 11:32:46,067][26022] Updated weights on worker 0-0, policy_version 1173480 (0.00090) [2022-07-11 11:32:47,924][26022] Updated weights on worker 0-0, policy_version 1173490 (0.00083) [2022-07-11 11:32:49,752][26022] Updated weights on worker 0-0, policy_version 1173500 (0.00089) [2022-07-11 11:32:50,702][25689] Fps is (10 sec: 5640.8, 60 sec: 5536.5, 300 sec: 5550.0). Total num frames: 1201668096. Throughput: 0: 5827.9. Samples: 1201674112. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:32:50,703][25689] Avg episode reward: [(0, '1.093')] [2022-07-11 11:32:51,505][26022] Updated weights on worker 0-0, policy_version 1173510 (0.00085) [2022-07-11 11:32:53,285][26022] Updated weights on worker 0-0, policy_version 1173520 (0.00084) [2022-07-11 11:32:55,255][26022] Updated weights on worker 0-0, policy_version 1173530 (0.00097) [2022-07-11 11:32:55,714][25689] Fps is (10 sec: 5585.9, 60 sec: 5554.4, 300 sec: 5553.8). Total num frames: 1201696768. Throughput: 0: 4998.2. Samples: 1201690998. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:32:55,715][25689] Avg episode reward: [(0, '0.446')] [2022-07-11 11:32:57,020][26022] Updated weights on worker 0-0, policy_version 1173540 (0.00087) [2022-07-11 11:32:58,731][26022] Updated weights on worker 0-0, policy_version 1173550 (0.00091) [2022-07-11 11:33:00,742][25689] Fps is (10 sec: 5712.1, 60 sec: 5570.1, 300 sec: 5567.5). Total num frames: 1201725440. Throughput: 0: 5825.0. Samples: 1201724754. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:33:00,742][25689] Avg episode reward: [(0, '-1.343')] [2022-07-11 11:33:00,751][26022] Updated weights on worker 0-0, policy_version 1173560 (0.00089) [2022-07-11 11:33:02,707][26022] Updated weights on worker 0-0, policy_version 1173570 (0.00086) [2022-07-11 11:33:04,840][26022] Updated weights on worker 0-0, policy_version 1173580 (0.00092) [2022-07-11 11:33:05,759][25689] Fps is (10 sec: 5505.0, 60 sec: 5569.7, 300 sec: 5557.6). Total num frames: 1201752064. Throughput: 0: 5772.4. Samples: 1201756764. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:33:05,759][25689] Avg episode reward: [(0, '-2.129')] [2022-07-11 11:33:06,355][26022] Updated weights on worker 0-0, policy_version 1173590 (0.00097) [2022-07-11 11:33:08,277][26022] Updated weights on worker 0-0, policy_version 1173600 (0.00090) [2022-07-11 11:33:10,162][26022] Updated weights on worker 0-0, policy_version 1173610 (0.00365) [2022-07-11 11:33:10,814][25689] Fps is (10 sec: 5388.0, 60 sec: 5569.7, 300 sec: 5556.7). Total num frames: 1201779712. Throughput: 0: 4947.8. Samples: 1201773622. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:33:10,815][25689] Avg episode reward: [(0, '-1.743')] [2022-07-11 11:33:11,851][26022] Updated weights on worker 0-0, policy_version 1173620 (0.00085) [2022-07-11 11:33:13,864][26022] Updated weights on worker 0-0, policy_version 1173630 (0.00618) [2022-07-11 11:33:15,680][26022] Updated weights on worker 0-0, policy_version 1173640 (0.00090) [2022-07-11 11:33:15,818][25689] Fps is (10 sec: 5496.9, 60 sec: 5587.6, 300 sec: 5553.7). Total num frames: 1201807360. Throughput: 0: 5787.0. Samples: 1201807340. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:33:15,822][25689] Avg episode reward: [(0, '-1.741')] [2022-07-11 11:33:17,427][26022] Updated weights on worker 0-0, policy_version 1173650 (0.00096) [2022-07-11 11:33:19,535][26022] Updated weights on worker 0-0, policy_version 1173660 (0.00092) [2022-07-11 11:33:20,898][25689] Fps is (10 sec: 5687.0, 60 sec: 5567.1, 300 sec: 5562.6). Total num frames: 1201837056. Throughput: 0: 5740.1. Samples: 1201840452. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:33:20,898][25689] Avg episode reward: [(0, '-1.548')] [2022-07-11 11:33:20,921][26022] Updated weights on worker 0-0, policy_version 1173670 (0.00086) [2022-07-11 11:33:23,163][26022] Updated weights on worker 0-0, policy_version 1173680 (0.00107) [2022-07-11 11:33:25,211][26022] Updated weights on worker 0-0, policy_version 1173690 (0.00097) [2022-07-11 11:33:25,956][25689] Fps is (10 sec: 5454.8, 60 sec: 5548.7, 300 sec: 5553.7). Total num frames: 1201862656. Throughput: 0: 4976.4. Samples: 1201857270. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:33:25,956][25689] Avg episode reward: [(0, '-0.625')] [2022-07-11 11:33:26,599][26022] Updated weights on worker 0-0, policy_version 1173700 (0.00098) [2022-07-11 11:33:29,058][26022] Updated weights on worker 0-0, policy_version 1173710 (0.00089) [2022-07-11 11:33:30,181][26022] Updated weights on worker 0-0, policy_version 1173720 (0.00081) [2022-07-11 11:33:31,069][25689] Fps is (10 sec: 5436.3, 60 sec: 5547.2, 300 sec: 5552.0). Total num frames: 1201892352. Throughput: 0: 5772.6. Samples: 1201890546. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:33:31,070][25689] Avg episode reward: [(0, '1.187')] [2022-07-11 11:33:32,425][26022] Updated weights on worker 0-0, policy_version 1173730 (0.00096) [2022-07-11 11:33:34,227][26022] Updated weights on worker 0-0, policy_version 1173740 (0.00090) [2022-07-11 11:33:36,022][26022] Updated weights on worker 0-0, policy_version 1173750 (0.00081) [2022-07-11 11:33:36,118][25689] Fps is (10 sec: 5642.6, 60 sec: 5543.5, 300 sec: 5554.6). Total num frames: 1201920000. Throughput: 0: 5735.9. Samples: 1201923778. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:33:36,121][25689] Avg episode reward: [(0, '1.058')] [2022-07-11 11:33:37,829][26022] Updated weights on worker 0-0, policy_version 1173760 (0.00052) [2022-07-11 11:33:39,644][26022] Updated weights on worker 0-0, policy_version 1173770 (0.00085) [2022-07-11 11:33:41,217][25689] Fps is (10 sec: 5651.0, 60 sec: 5585.7, 300 sec: 5559.9). Total num frames: 1201949696. Throughput: 0: 5751.6. Samples: 1201957320. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:33:41,218][25689] Avg episode reward: [(0, '1.189')] [2022-07-11 11:33:41,381][26022] Updated weights on worker 0-0, policy_version 1173780 (0.00086) [2022-07-11 11:33:43,469][26022] Updated weights on worker 0-0, policy_version 1173790 (0.00085) [2022-07-11 11:33:45,200][26022] Updated weights on worker 0-0, policy_version 1173800 (0.00087) [2022-07-11 11:33:46,247][25689] Fps is (10 sec: 5560.7, 60 sec: 5541.1, 300 sec: 5546.8). Total num frames: 1201976320. Throughput: 0: 5750.1. Samples: 1201973946. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:33:46,247][25689] Avg episode reward: [(0, '1.225')] [2022-07-11 11:33:47,052][26022] Updated weights on worker 0-0, policy_version 1173810 (0.00085) [2022-07-11 11:33:48,853][26022] Updated weights on worker 0-0, policy_version 1173820 (0.00090) [2022-07-11 11:33:50,557][26022] Updated weights on worker 0-0, policy_version 1173830 (0.00083) [2022-07-11 11:33:51,301][25689] Fps is (10 sec: 5382.0, 60 sec: 5542.6, 300 sec: 5550.6). Total num frames: 1202003968. Throughput: 0: 5769.7. Samples: 1202007276. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:33:51,301][25689] Avg episode reward: [(0, '1.405')] [2022-07-11 11:33:52,614][26022] Updated weights on worker 0-0, policy_version 1173840 (0.00087) [2022-07-11 11:33:54,630][26022] Updated weights on worker 0-0, policy_version 1173850 (0.00091) [2022-07-11 11:33:56,302][25689] Fps is (10 sec: 5600.9, 60 sec: 5543.6, 300 sec: 5554.9). Total num frames: 1202032640. Throughput: 0: 5779.8. Samples: 1202040438. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:33:56,303][25689] Avg episode reward: [(0, '1.561')] [2022-07-11 11:33:56,316][26022] Updated weights on worker 0-0, policy_version 1173860 (0.00088) [2022-07-11 11:33:58,155][26022] Updated weights on worker 0-0, policy_version 1173870 (0.00082) [2022-07-11 11:34:00,088][26022] Updated weights on worker 0-0, policy_version 1173880 (0.00095) [2022-07-11 11:34:01,324][25689] Fps is (10 sec: 5618.7, 60 sec: 5527.1, 300 sec: 5559.5). Total num frames: 1202060288. Throughput: 0: 4970.0. Samples: 1202057254. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:34:01,331][25689] Avg episode reward: [(0, '1.638')] [2022-07-11 11:34:02,097][26022] Updated weights on worker 0-0, policy_version 1173890 (0.00100) [2022-07-11 11:34:02,982][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:34:02,997][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001173895_1202068480.pth [2022-07-11 11:34:02,997][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001171941_1200067584.pth [2022-07-11 11:34:04,204][26022] Updated weights on worker 0-0, policy_version 1173900 (0.00078) [2022-07-11 11:34:05,621][26022] Updated weights on worker 0-0, policy_version 1173910 (0.00087) [2022-07-11 11:34:06,355][25689] Fps is (10 sec: 5297.0, 60 sec: 5509.0, 300 sec: 5547.5). Total num frames: 1202085888. Throughput: 0: 5711.7. Samples: 1202088798. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:34:06,355][25689] Avg episode reward: [(0, '0.859')] [2022-07-11 11:34:07,764][26022] Updated weights on worker 0-0, policy_version 1173920 (0.00080) [2022-07-11 11:34:09,430][26022] Updated weights on worker 0-0, policy_version 1173930 (0.00094) [2022-07-11 11:34:11,287][26022] Updated weights on worker 0-0, policy_version 1173940 (0.00087) [2022-07-11 11:34:11,500][25689] Fps is (10 sec: 5434.3, 60 sec: 5534.6, 300 sec: 5552.3). Total num frames: 1202115584. Throughput: 0: 5695.1. Samples: 1202122312. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:34:11,500][25689] Avg episode reward: [(0, '0.636')] [2022-07-11 11:34:13,405][26022] Updated weights on worker 0-0, policy_version 1173950 (0.00087) [2022-07-11 11:34:14,723][26022] Updated weights on worker 0-0, policy_version 1173960 (0.00095) [2022-07-11 11:34:16,539][25689] Fps is (10 sec: 5429.7, 60 sec: 5497.7, 300 sec: 5542.3). Total num frames: 1202141184. Throughput: 0: 4883.9. Samples: 1202139272. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:34:16,540][25689] Avg episode reward: [(0, '0.439')] [2022-07-11 11:34:16,991][26022] Updated weights on worker 0-0, policy_version 1173970 (0.00093) [2022-07-11 11:34:18,633][26022] Updated weights on worker 0-0, policy_version 1173980 (0.00083) [2022-07-11 11:34:20,423][26022] Updated weights on worker 0-0, policy_version 1173990 (0.00087) [2022-07-11 11:34:21,547][25689] Fps is (10 sec: 5605.5, 60 sec: 5521.0, 300 sec: 5550.3). Total num frames: 1202171904. Throughput: 0: 5727.1. Samples: 1202173072. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:34:21,548][25689] Avg episode reward: [(0, '0.477')] [2022-07-11 11:34:22,283][26022] Updated weights on worker 0-0, policy_version 1174000 (0.00086) [2022-07-11 11:34:24,043][26022] Updated weights on worker 0-0, policy_version 1174010 (0.00089) [2022-07-11 11:34:25,907][26022] Updated weights on worker 0-0, policy_version 1174020 (0.00088) [2022-07-11 11:34:26,550][25689] Fps is (10 sec: 5932.7, 60 sec: 5576.8, 300 sec: 5558.0). Total num frames: 1202200576. Throughput: 0: 5844.7. Samples: 1202206832. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:34:26,552][25689] Avg episode reward: [(0, '0.543')] [2022-07-11 11:34:27,857][26022] Updated weights on worker 0-0, policy_version 1174030 (0.00097) [2022-07-11 11:34:29,601][26022] Updated weights on worker 0-0, policy_version 1174040 (0.00088) [2022-07-11 11:34:31,611][25689] Fps is (10 sec: 5494.4, 60 sec: 5530.8, 300 sec: 5543.7). Total num frames: 1202227200. Throughput: 0: 5031.2. Samples: 1202223496. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:34:31,612][25689] Avg episode reward: [(0, '0.336')] [2022-07-11 11:34:31,617][26022] Updated weights on worker 0-0, policy_version 1174050 (0.00085) [2022-07-11 11:34:33,226][26022] Updated weights on worker 0-0, policy_version 1174060 (0.00092) [2022-07-11 11:34:35,161][26022] Updated weights on worker 0-0, policy_version 1174070 (0.00091) [2022-07-11 11:34:36,621][25689] Fps is (10 sec: 5490.3, 60 sec: 5551.3, 300 sec: 5551.5). Total num frames: 1202255872. Throughput: 0: 5856.7. Samples: 1202256888. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:34:36,622][25689] Avg episode reward: [(0, '0.856')] [2022-07-11 11:34:36,973][26022] Updated weights on worker 0-0, policy_version 1174080 (0.00085) [2022-07-11 11:34:38,886][26022] Updated weights on worker 0-0, policy_version 1174090 (0.00083) [2022-07-11 11:34:40,594][26022] Updated weights on worker 0-0, policy_version 1174100 (0.00082) [2022-07-11 11:34:41,650][25689] Fps is (10 sec: 5609.9, 60 sec: 5523.8, 300 sec: 5551.8). Total num frames: 1202283520. Throughput: 0: 5843.1. Samples: 1202290538. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:34:41,651][25689] Avg episode reward: [(0, '0.653')] [2022-07-11 11:34:42,340][26022] Updated weights on worker 0-0, policy_version 1174110 (0.00077) [2022-07-11 11:34:44,211][26022] Updated weights on worker 0-0, policy_version 1174120 (0.00085) [2022-07-11 11:34:46,130][26022] Updated weights on worker 0-0, policy_version 1174130 (0.00095) [2022-07-11 11:34:46,705][25689] Fps is (10 sec: 5686.4, 60 sec: 5572.3, 300 sec: 5553.4). Total num frames: 1202313216. Throughput: 0: 4986.8. Samples: 1202307346. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:34:46,706][25689] Avg episode reward: [(0, '0.790')] [2022-07-11 11:34:48,162][26022] Updated weights on worker 0-0, policy_version 1174140 (0.00080) [2022-07-11 11:34:49,690][26022] Updated weights on worker 0-0, policy_version 1174150 (0.00097) [2022-07-11 11:34:51,775][25689] Fps is (10 sec: 5461.4, 60 sec: 5537.0, 300 sec: 5545.7). Total num frames: 1202338816. Throughput: 0: 5825.0. Samples: 1202340952. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:34:51,776][25689] Avg episode reward: [(0, '1.023')] [2022-07-11 11:34:51,884][26022] Updated weights on worker 0-0, policy_version 1174160 (0.00054) [2022-07-11 11:34:53,465][26022] Updated weights on worker 0-0, policy_version 1174170 (0.00091) [2022-07-11 11:34:55,334][26022] Updated weights on worker 0-0, policy_version 1174180 (0.00091) [2022-07-11 11:34:56,842][25689] Fps is (10 sec: 5455.3, 60 sec: 5548.0, 300 sec: 5552.1). Total num frames: 1202368512. Throughput: 0: 5830.9. Samples: 1202374792. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:34:56,842][25689] Avg episode reward: [(0, '0.606')] [2022-07-11 11:34:57,047][26022] Updated weights on worker 0-0, policy_version 1174190 (0.00085) [2022-07-11 11:34:58,763][26022] Updated weights on worker 0-0, policy_version 1174200 (0.00090) [2022-07-11 11:35:00,826][26022] Updated weights on worker 0-0, policy_version 1174210 (0.00084) [2022-07-11 11:35:01,876][25689] Fps is (10 sec: 5879.8, 60 sec: 5580.7, 300 sec: 5558.3). Total num frames: 1202398208. Throughput: 0: 4992.8. Samples: 1202391522. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:35:01,877][25689] Avg episode reward: [(0, '0.521')] [2022-07-11 11:35:02,873][26022] Updated weights on worker 0-0, policy_version 1174220 (0.00079) [2022-07-11 11:35:04,779][26022] Updated weights on worker 0-0, policy_version 1174230 (0.00083) [2022-07-11 11:35:06,822][26022] Updated weights on worker 0-0, policy_version 1174240 (0.00096) [2022-07-11 11:35:06,903][25689] Fps is (10 sec: 5292.4, 60 sec: 5547.2, 300 sec: 5541.5). Total num frames: 1202421760. Throughput: 0: 5737.9. Samples: 1202423236. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:35:06,903][25689] Avg episode reward: [(0, '0.586')] [2022-07-11 11:35:08,263][26022] Updated weights on worker 0-0, policy_version 1174250 (0.00085) [2022-07-11 11:35:10,212][26022] Updated weights on worker 0-0, policy_version 1174260 (0.00085) [2022-07-11 11:35:11,951][25689] Fps is (10 sec: 5386.8, 60 sec: 5573.0, 300 sec: 5558.0). Total num frames: 1202452480. Throughput: 0: 5742.6. Samples: 1202456814. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:35:11,952][25689] Avg episode reward: [(0, '1.188')] [2022-07-11 11:35:11,954][26022] Updated weights on worker 0-0, policy_version 1174270 (0.00079) [2022-07-11 11:35:13,863][26022] Updated weights on worker 0-0, policy_version 1174280 (0.00085) [2022-07-11 11:35:15,822][26022] Updated weights on worker 0-0, policy_version 1174290 (0.00087) [2022-07-11 11:35:17,030][25689] Fps is (10 sec: 5763.4, 60 sec: 5603.2, 300 sec: 5550.2). Total num frames: 1202480128. Throughput: 0: 4906.9. Samples: 1202473856. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:35:17,031][25689] Avg episode reward: [(0, '0.655')] [2022-07-11 11:35:17,352][26022] Updated weights on worker 0-0, policy_version 1174300 (0.00102) [2022-07-11 11:35:19,333][26022] Updated weights on worker 0-0, policy_version 1174310 (0.00091) [2022-07-11 11:35:21,180][26022] Updated weights on worker 0-0, policy_version 1174320 (0.00091) [2022-07-11 11:35:22,044][25689] Fps is (10 sec: 5377.5, 60 sec: 5535.0, 300 sec: 5546.8). Total num frames: 1202506752. Throughput: 0: 5752.3. Samples: 1202507530. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:35:22,044][25689] Avg episode reward: [(0, '0.791')] [2022-07-11 11:35:23,041][26022] Updated weights on worker 0-0, policy_version 1174330 (0.00079) [2022-07-11 11:35:24,874][26022] Updated weights on worker 0-0, policy_version 1174340 (0.00092) [2022-07-11 11:35:26,539][26022] Updated weights on worker 0-0, policy_version 1174350 (0.00091) [2022-07-11 11:35:27,045][25689] Fps is (10 sec: 5725.7, 60 sec: 5568.9, 300 sec: 5558.0). Total num frames: 1202537472. Throughput: 0: 5859.4. Samples: 1202541260. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:35:27,046][25689] Avg episode reward: [(0, '0.096')] [2022-07-11 11:35:28,397][26022] Updated weights on worker 0-0, policy_version 1174360 (0.00090) [2022-07-11 11:35:30,296][26022] Updated weights on worker 0-0, policy_version 1174370 (0.00082) [2022-07-11 11:35:32,108][25689] Fps is (10 sec: 5595.8, 60 sec: 5551.8, 300 sec: 5544.5). Total num frames: 1202563072. Throughput: 0: 5007.3. Samples: 1202557746. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:35:32,109][25689] Avg episode reward: [(0, '0.097')] [2022-07-11 11:35:32,427][26022] Updated weights on worker 0-0, policy_version 1174380 (0.00095) [2022-07-11 11:35:34,055][26022] Updated weights on worker 0-0, policy_version 1174390 (0.00087) [2022-07-11 11:35:36,038][26022] Updated weights on worker 0-0, policy_version 1174400 (0.00090) [2022-07-11 11:35:37,210][25689] Fps is (10 sec: 5339.5, 60 sec: 5543.5, 300 sec: 5550.4). Total num frames: 1202591744. Throughput: 0: 5811.7. Samples: 1202591134. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:35:37,210][25689] Avg episode reward: [(0, '0.094')] [2022-07-11 11:35:37,604][26022] Updated weights on worker 0-0, policy_version 1174410 (0.00087) [2022-07-11 11:35:39,693][26022] Updated weights on worker 0-0, policy_version 1174420 (0.00089) [2022-07-11 11:35:41,345][26022] Updated weights on worker 0-0, policy_version 1174430 (0.00094) [2022-07-11 11:35:42,284][25689] Fps is (10 sec: 5635.3, 60 sec: 5556.2, 300 sec: 5553.3). Total num frames: 1202620416. Throughput: 0: 5777.1. Samples: 1202624462. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:35:42,285][25689] Avg episode reward: [(0, '-0.708')] [2022-07-11 11:35:43,407][26022] Updated weights on worker 0-0, policy_version 1174440 (0.00092) [2022-07-11 11:35:44,974][26022] Updated weights on worker 0-0, policy_version 1174450 (0.00086) [2022-07-11 11:35:47,054][26022] Updated weights on worker 0-0, policy_version 1174460 (0.00094) [2022-07-11 11:35:47,331][25689] Fps is (10 sec: 5564.3, 60 sec: 5523.2, 300 sec: 5551.8). Total num frames: 1202648064. Throughput: 0: 5759.4. Samples: 1202658094. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:35:47,332][25689] Avg episode reward: [(0, '-0.531')] [2022-07-11 11:35:48,770][26022] Updated weights on worker 0-0, policy_version 1174470 (0.00114) [2022-07-11 11:35:50,879][26022] Updated weights on worker 0-0, policy_version 1174480 (0.00094) [2022-07-11 11:35:52,379][25689] Fps is (10 sec: 5579.4, 60 sec: 5575.9, 300 sec: 5551.3). Total num frames: 1202676736. Throughput: 0: 5748.2. Samples: 1202674264. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:35:52,379][25689] Avg episode reward: [(0, '-1.571')] [2022-07-11 11:35:52,555][26022] Updated weights on worker 0-0, policy_version 1174490 (0.00091) [2022-07-11 11:35:54,462][26022] Updated weights on worker 0-0, policy_version 1174500 (0.00089) [2022-07-11 11:35:56,163][26022] Updated weights on worker 0-0, policy_version 1174510 (0.00097) [2022-07-11 11:35:57,388][25689] Fps is (10 sec: 5600.4, 60 sec: 5547.4, 300 sec: 5551.6). Total num frames: 1202704384. Throughput: 0: 5776.2. Samples: 1202707686. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:35:57,388][25689] Avg episode reward: [(0, '-1.082')] [2022-07-11 11:35:57,954][26022] Updated weights on worker 0-0, policy_version 1174520 (0.00088) [2022-07-11 11:35:59,881][26022] Updated weights on worker 0-0, policy_version 1174530 (0.00086) [2022-07-11 11:36:01,858][26022] Updated weights on worker 0-0, policy_version 1174540 (0.00087) [2022-07-11 11:36:02,403][25689] Fps is (10 sec: 5312.1, 60 sec: 5481.5, 300 sec: 5551.7). Total num frames: 1202729984. Throughput: 0: 5711.7. Samples: 1202739372. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:36:02,403][25689] Avg episode reward: [(0, '-0.834')] [2022-07-11 11:36:03,144][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:36:03,161][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001174545_1202734080.pth [2022-07-11 11:36:03,164][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001172593_1200735232.pth [2022-07-11 11:36:04,072][26022] Updated weights on worker 0-0, policy_version 1174550 (0.00256) [2022-07-11 11:36:05,647][26022] Updated weights on worker 0-0, policy_version 1174560 (0.00091) [2022-07-11 11:36:07,427][25689] Fps is (10 sec: 5202.2, 60 sec: 5532.5, 300 sec: 5546.4). Total num frames: 1202756608. Throughput: 0: 4860.1. Samples: 1202755760. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:36:07,427][25689] Avg episode reward: [(0, '-0.249')] [2022-07-11 11:36:07,703][26022] Updated weights on worker 0-0, policy_version 1174570 (0.00084) [2022-07-11 11:36:09,301][26022] Updated weights on worker 0-0, policy_version 1174580 (0.00091) [2022-07-11 11:36:11,411][26022] Updated weights on worker 0-0, policy_version 1174590 (0.00089) [2022-07-11 11:36:12,505][25689] Fps is (10 sec: 5676.1, 60 sec: 5529.7, 300 sec: 5548.9). Total num frames: 1202787328. Throughput: 0: 5712.1. Samples: 1202789230. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:36:12,506][25689] Avg episode reward: [(0, '-0.145')] [2022-07-11 11:36:13,040][26022] Updated weights on worker 0-0, policy_version 1174600 (0.00086) [2022-07-11 11:36:14,856][26022] Updated weights on worker 0-0, policy_version 1174610 (0.00092) [2022-07-11 11:36:16,901][26022] Updated weights on worker 0-0, policy_version 1174620 (0.00467) [2022-07-11 11:36:17,533][25689] Fps is (10 sec: 5572.5, 60 sec: 5500.5, 300 sec: 5545.3). Total num frames: 1202812928. Throughput: 0: 5721.3. Samples: 1202822946. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:36:17,534][25689] Avg episode reward: [(0, '1.271')] [2022-07-11 11:36:18,599][26022] Updated weights on worker 0-0, policy_version 1174630 (0.00101) [2022-07-11 11:36:20,746][26022] Updated weights on worker 0-0, policy_version 1174640 (0.00085) [2022-07-11 11:36:22,219][26022] Updated weights on worker 0-0, policy_version 1174650 (0.00084) [2022-07-11 11:36:22,540][25689] Fps is (10 sec: 5510.8, 60 sec: 5552.0, 300 sec: 5545.3). Total num frames: 1202842624. Throughput: 0: 4967.8. Samples: 1202839408. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:36:22,540][25689] Avg episode reward: [(0, '1.346')] [2022-07-11 11:36:24,162][26022] Updated weights on worker 0-0, policy_version 1174660 (0.00093) [2022-07-11 11:36:25,995][26022] Updated weights on worker 0-0, policy_version 1174670 (0.00113) [2022-07-11 11:36:27,567][25689] Fps is (10 sec: 5715.4, 60 sec: 5498.9, 300 sec: 5545.9). Total num frames: 1202870272. Throughput: 0: 5820.3. Samples: 1202872980. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:36:27,568][25689] Avg episode reward: [(0, '1.372')] [2022-07-11 11:36:27,699][26022] Updated weights on worker 0-0, policy_version 1174680 (0.00083) [2022-07-11 11:36:29,793][26022] Updated weights on worker 0-0, policy_version 1174690 (0.00092) [2022-07-11 11:36:31,276][26022] Updated weights on worker 0-0, policy_version 1174700 (0.00088) [2022-07-11 11:36:32,641][25689] Fps is (10 sec: 5474.0, 60 sec: 5531.7, 300 sec: 5549.4). Total num frames: 1202897920. Throughput: 0: 5818.0. Samples: 1202906380. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:36:32,642][25689] Avg episode reward: [(0, '0.838')] [2022-07-11 11:36:33,259][26022] Updated weights on worker 0-0, policy_version 1174710 (0.00085) [2022-07-11 11:36:35,093][26022] Updated weights on worker 0-0, policy_version 1174720 (0.00095) [2022-07-11 11:36:36,816][26022] Updated weights on worker 0-0, policy_version 1174730 (0.00087) [2022-07-11 11:36:37,646][25689] Fps is (10 sec: 5486.3, 60 sec: 5523.6, 300 sec: 5547.5). Total num frames: 1202925568. Throughput: 0: 4989.2. Samples: 1202923290. Policy #0 lag: (min: 1.0, avg: 8.4, max: 18.0) [2022-07-11 11:36:37,646][25689] Avg episode reward: [(0, '0.812')] [2022-07-11 11:36:38,740][26022] Updated weights on worker 0-0, policy_version 1174740 (0.00089) [2022-07-11 11:36:40,646][26022] Updated weights on worker 0-0, policy_version 1174750 (0.00086) [2022-07-11 11:36:42,390][26022] Updated weights on worker 0-0, policy_version 1174760 (0.00091) [2022-07-11 11:36:42,652][25689] Fps is (10 sec: 5728.0, 60 sec: 5546.8, 300 sec: 5547.7). Total num frames: 1202955264. Throughput: 0: 5856.5. Samples: 1202957198. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:36:42,654][25689] Avg episode reward: [(0, '0.766')] [2022-07-11 11:36:44,443][26022] Updated weights on worker 0-0, policy_version 1174770 (0.00094) [2022-07-11 11:36:46,149][26022] Updated weights on worker 0-0, policy_version 1174780 (0.00088) [2022-07-11 11:36:47,656][25689] Fps is (10 sec: 5728.3, 60 sec: 5550.7, 300 sec: 5546.3). Total num frames: 1202982912. Throughput: 0: 5871.9. Samples: 1202990944. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:36:47,657][25689] Avg episode reward: [(0, '0.453')] [2022-07-11 11:36:47,978][26022] Updated weights on worker 0-0, policy_version 1174790 (0.00079) [2022-07-11 11:36:49,911][26022] Updated weights on worker 0-0, policy_version 1174800 (0.00101) [2022-07-11 11:36:51,555][26022] Updated weights on worker 0-0, policy_version 1174810 (0.00083) [2022-07-11 11:36:52,693][25689] Fps is (10 sec: 5609.4, 60 sec: 5551.8, 300 sec: 5549.5). Total num frames: 1203011584. Throughput: 0: 5049.9. Samples: 1203007638. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:36:52,693][25689] Avg episode reward: [(0, '0.461')] [2022-07-11 11:36:53,446][26022] Updated weights on worker 0-0, policy_version 1174820 (0.00094) [2022-07-11 11:36:55,515][26022] Updated weights on worker 0-0, policy_version 1174830 (0.00085) [2022-07-11 11:36:56,839][26022] Updated weights on worker 0-0, policy_version 1174840 (0.00081) [2022-07-11 11:36:57,707][25689] Fps is (10 sec: 5603.8, 60 sec: 5551.3, 300 sec: 5549.5). Total num frames: 1203039232. Throughput: 0: 5878.8. Samples: 1203041224. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:36:57,707][25689] Avg episode reward: [(0, '0.680')] [2022-07-11 11:36:59,034][26022] Updated weights on worker 0-0, policy_version 1174850 (0.00090) [2022-07-11 11:37:00,554][26022] Updated weights on worker 0-0, policy_version 1174860 (0.00343) [2022-07-11 11:37:02,725][25689] Fps is (10 sec: 5307.6, 60 sec: 5551.0, 300 sec: 5546.0). Total num frames: 1203064832. Throughput: 0: 5753.5. Samples: 1203072686. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:37:02,725][25689] Avg episode reward: [(0, '0.464')] [2022-07-11 11:37:03,064][26022] Updated weights on worker 0-0, policy_version 1174870 (0.00087) [2022-07-11 11:37:04,776][26022] Updated weights on worker 0-0, policy_version 1174880 (0.00087) [2022-07-11 11:37:06,507][26022] Updated weights on worker 0-0, policy_version 1174890 (0.00662) [2022-07-11 11:37:07,735][25689] Fps is (10 sec: 5411.7, 60 sec: 5586.2, 300 sec: 5550.2). Total num frames: 1203093504. Throughput: 0: 4913.2. Samples: 1203089596. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:37:07,736][25689] Avg episode reward: [(0, '0.552')] [2022-07-11 11:37:08,504][26022] Updated weights on worker 0-0, policy_version 1174900 (0.00092) [2022-07-11 11:37:10,282][26022] Updated weights on worker 0-0, policy_version 1174910 (0.00085) [2022-07-11 11:37:12,199][26022] Updated weights on worker 0-0, policy_version 1174920 (0.00084) [2022-07-11 11:37:12,818][25689] Fps is (10 sec: 5580.2, 60 sec: 5534.9, 300 sec: 5552.4). Total num frames: 1203121152. Throughput: 0: 5746.9. Samples: 1203123296. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:37:12,818][25689] Avg episode reward: [(0, '-0.393')] [2022-07-11 11:37:13,831][26022] Updated weights on worker 0-0, policy_version 1174930 (0.00087) [2022-07-11 11:37:15,699][26022] Updated weights on worker 0-0, policy_version 1174940 (0.00088) [2022-07-11 11:37:17,511][26022] Updated weights on worker 0-0, policy_version 1174950 (0.00085) [2022-07-11 11:37:17,880][25689] Fps is (10 sec: 5652.9, 60 sec: 5599.7, 300 sec: 5548.6). Total num frames: 1203150848. Throughput: 0: 5740.3. Samples: 1203157022. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:37:17,880][25689] Avg episode reward: [(0, '-0.184')] [2022-07-11 11:37:19,388][26022] Updated weights on worker 0-0, policy_version 1174960 (0.00098) [2022-07-11 11:37:21,243][26022] Updated weights on worker 0-0, policy_version 1174970 (0.00086) [2022-07-11 11:37:22,952][25689] Fps is (10 sec: 5658.6, 60 sec: 5559.7, 300 sec: 5551.5). Total num frames: 1203178496. Throughput: 0: 4986.5. Samples: 1203173548. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:37:22,953][25689] Avg episode reward: [(0, '-0.021')] [2022-07-11 11:37:22,961][26022] Updated weights on worker 0-0, policy_version 1174980 (0.00087) [2022-07-11 11:37:24,990][26022] Updated weights on worker 0-0, policy_version 1174990 (0.00100) [2022-07-11 11:37:26,702][26022] Updated weights on worker 0-0, policy_version 1175000 (0.00089) [2022-07-11 11:37:28,028][25689] Fps is (10 sec: 5448.7, 60 sec: 5555.1, 300 sec: 5545.0). Total num frames: 1203206144. Throughput: 0: 5803.9. Samples: 1203207374. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:37:28,029][25689] Avg episode reward: [(0, '-0.614')] [2022-07-11 11:37:28,567][26022] Updated weights on worker 0-0, policy_version 1175010 (0.00086) [2022-07-11 11:37:30,352][26022] Updated weights on worker 0-0, policy_version 1175020 (0.00085) [2022-07-11 11:37:32,169][26022] Updated weights on worker 0-0, policy_version 1175030 (0.00048) [2022-07-11 11:37:33,090][25689] Fps is (10 sec: 5757.1, 60 sec: 5607.1, 300 sec: 5554.3). Total num frames: 1203236864. Throughput: 0: 5813.8. Samples: 1203241156. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:37:33,091][25689] Avg episode reward: [(0, '-0.476')] [2022-07-11 11:37:34,222][26022] Updated weights on worker 0-0, policy_version 1175040 (0.00086) [2022-07-11 11:37:35,848][26022] Updated weights on worker 0-0, policy_version 1175050 (0.00092) [2022-07-11 11:37:37,735][26022] Updated weights on worker 0-0, policy_version 1175060 (0.00082) [2022-07-11 11:37:38,168][25689] Fps is (10 sec: 5655.4, 60 sec: 5583.4, 300 sec: 5553.0). Total num frames: 1203263488. Throughput: 0: 4985.6. Samples: 1203258170. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:37:38,168][25689] Avg episode reward: [(0, '0.376')] [2022-07-11 11:37:39,568][26022] Updated weights on worker 0-0, policy_version 1175070 (0.00091) [2022-07-11 11:37:41,164][26022] Updated weights on worker 0-0, policy_version 1175080 (0.00090) [2022-07-11 11:37:43,182][25689] Fps is (10 sec: 5378.1, 60 sec: 5548.9, 300 sec: 5547.6). Total num frames: 1203291136. Throughput: 0: 5841.8. Samples: 1203291726. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:37:43,182][25689] Avg episode reward: [(0, '0.929')] [2022-07-11 11:37:43,210][26022] Updated weights on worker 0-0, policy_version 1175090 (0.00090) [2022-07-11 11:37:44,800][26022] Updated weights on worker 0-0, policy_version 1175100 (0.00087) [2022-07-11 11:37:46,864][26022] Updated weights on worker 0-0, policy_version 1175110 (0.00083) [2022-07-11 11:37:48,206][25689] Fps is (10 sec: 5712.5, 60 sec: 5580.9, 300 sec: 5555.3). Total num frames: 1203320832. Throughput: 0: 5867.7. Samples: 1203325772. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:37:48,207][25689] Avg episode reward: [(0, '0.871')] [2022-07-11 11:37:48,530][26022] Updated weights on worker 0-0, policy_version 1175120 (0.00089) [2022-07-11 11:37:50,523][26022] Updated weights on worker 0-0, policy_version 1175130 (0.00077) [2022-07-11 11:37:52,324][26022] Updated weights on worker 0-0, policy_version 1175140 (0.00058) [2022-07-11 11:37:53,287][25689] Fps is (10 sec: 5776.2, 60 sec: 5576.8, 300 sec: 5554.1). Total num frames: 1203349504. Throughput: 0: 5027.6. Samples: 1203342696. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:37:53,287][25689] Avg episode reward: [(0, '1.204')] [2022-07-11 11:37:53,919][26022] Updated weights on worker 0-0, policy_version 1175150 (0.00087) [2022-07-11 11:37:56,028][26022] Updated weights on worker 0-0, policy_version 1175160 (0.00092) [2022-07-11 11:37:57,803][26022] Updated weights on worker 0-0, policy_version 1175170 (0.00081) [2022-07-11 11:37:58,298][25689] Fps is (10 sec: 5580.6, 60 sec: 5577.0, 300 sec: 5550.9). Total num frames: 1203377152. Throughput: 0: 5875.8. Samples: 1203376452. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:37:58,299][25689] Avg episode reward: [(0, '-0.394')] [2022-07-11 11:37:59,477][26022] Updated weights on worker 0-0, policy_version 1175180 (0.00085) [2022-07-11 11:38:01,496][26022] Updated weights on worker 0-0, policy_version 1175190 (0.00093) [2022-07-11 11:38:03,217][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:38:03,232][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001175200_1203404800.pth [2022-07-11 11:38:03,232][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001173244_1201401856.pth [2022-07-11 11:38:03,235][26022] Updated weights on worker 0-0, policy_version 1175200 (0.00083) [2022-07-11 11:38:03,333][25689] Fps is (10 sec: 5504.1, 60 sec: 5609.3, 300 sec: 5554.0). Total num frames: 1203404800. Throughput: 0: 5792.4. Samples: 1203408450. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:38:03,333][25689] Avg episode reward: [(0, '-0.578')] [2022-07-11 11:38:05,430][26022] Updated weights on worker 0-0, policy_version 1175210 (0.00619) [2022-07-11 11:38:07,195][26022] Updated weights on worker 0-0, policy_version 1175220 (0.00084) [2022-07-11 11:38:08,339][25689] Fps is (10 sec: 5405.3, 60 sec: 5575.9, 300 sec: 5551.5). Total num frames: 1203431424. Throughput: 0: 4928.0. Samples: 1203424986. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:38:08,339][25689] Avg episode reward: [(0, '-0.953')] [2022-07-11 11:38:08,949][26022] Updated weights on worker 0-0, policy_version 1175230 (0.00087) [2022-07-11 11:38:10,804][26022] Updated weights on worker 0-0, policy_version 1175240 (0.00084) [2022-07-11 11:38:12,447][26022] Updated weights on worker 0-0, policy_version 1175250 (0.00087) [2022-07-11 11:38:13,420][25689] Fps is (10 sec: 5481.7, 60 sec: 5592.9, 300 sec: 5553.5). Total num frames: 1203460096. Throughput: 0: 5784.3. Samples: 1203459154. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:38:13,421][25689] Avg episode reward: [(0, '-0.932')] [2022-07-11 11:38:14,358][26022] Updated weights on worker 0-0, policy_version 1175260 (0.00050) [2022-07-11 11:38:16,073][26022] Updated weights on worker 0-0, policy_version 1175270 (0.00084) [2022-07-11 11:38:18,037][26022] Updated weights on worker 0-0, policy_version 1175280 (0.00099) [2022-07-11 11:38:18,435][25689] Fps is (10 sec: 5679.9, 60 sec: 5580.4, 300 sec: 5551.3). Total num frames: 1203488768. Throughput: 0: 5776.3. Samples: 1203492766. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:38:18,436][25689] Avg episode reward: [(0, '-0.834')] [2022-07-11 11:38:19,929][26022] Updated weights on worker 0-0, policy_version 1175290 (0.00403) [2022-07-11 11:38:21,671][26022] Updated weights on worker 0-0, policy_version 1175300 (0.00055) [2022-07-11 11:38:23,446][25689] Fps is (10 sec: 5515.3, 60 sec: 5569.1, 300 sec: 5555.6). Total num frames: 1203515392. Throughput: 0: 5013.0. Samples: 1203509278. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:38:23,446][25689] Avg episode reward: [(0, '-0.706')] [2022-07-11 11:38:23,577][26022] Updated weights on worker 0-0, policy_version 1175310 (0.00728) [2022-07-11 11:38:25,279][26022] Updated weights on worker 0-0, policy_version 1175320 (0.00089) [2022-07-11 11:38:27,364][26022] Updated weights on worker 0-0, policy_version 1175330 (0.00085) [2022-07-11 11:38:28,469][25689] Fps is (10 sec: 5510.4, 60 sec: 5590.9, 300 sec: 5553.8). Total num frames: 1203544064. Throughput: 0: 5842.9. Samples: 1203542608. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:38:28,471][25689] Avg episode reward: [(0, '-0.174')] [2022-07-11 11:38:28,989][26022] Updated weights on worker 0-0, policy_version 1175340 (0.00081) [2022-07-11 11:38:31,211][26022] Updated weights on worker 0-0, policy_version 1175350 (0.00091) [2022-07-11 11:38:32,831][26022] Updated weights on worker 0-0, policy_version 1175360 (0.00089) [2022-07-11 11:38:33,568][25689] Fps is (10 sec: 5665.1, 60 sec: 5553.6, 300 sec: 5556.3). Total num frames: 1203572736. Throughput: 0: 5809.1. Samples: 1203576196. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:38:33,569][25689] Avg episode reward: [(0, '0.192')] [2022-07-11 11:38:34,664][26022] Updated weights on worker 0-0, policy_version 1175370 (0.00090) [2022-07-11 11:38:36,383][26022] Updated weights on worker 0-0, policy_version 1175380 (0.00095) [2022-07-11 11:38:38,314][26022] Updated weights on worker 0-0, policy_version 1175390 (0.00088) [2022-07-11 11:38:38,604][25689] Fps is (10 sec: 5557.5, 60 sec: 5574.4, 300 sec: 5550.6). Total num frames: 1203600384. Throughput: 0: 5800.3. Samples: 1203609752. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:38:38,605][25689] Avg episode reward: [(0, '-0.114')] [2022-07-11 11:38:40,135][26022] Updated weights on worker 0-0, policy_version 1175400 (0.00092) [2022-07-11 11:38:42,027][26022] Updated weights on worker 0-0, policy_version 1175410 (0.00087) [2022-07-11 11:38:43,612][25689] Fps is (10 sec: 5505.6, 60 sec: 5575.0, 300 sec: 5554.5). Total num frames: 1203628032. Throughput: 0: 5815.4. Samples: 1203626550. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:38:43,612][25689] Avg episode reward: [(0, '-0.171')] [2022-07-11 11:38:43,872][26022] Updated weights on worker 0-0, policy_version 1175420 (0.00083) [2022-07-11 11:38:45,601][26022] Updated weights on worker 0-0, policy_version 1175430 (0.00091) [2022-07-11 11:38:47,344][26022] Updated weights on worker 0-0, policy_version 1175440 (0.00087) [2022-07-11 11:38:48,619][25689] Fps is (10 sec: 5623.5, 60 sec: 5559.6, 300 sec: 5558.8). Total num frames: 1203656704. Throughput: 0: 5847.8. Samples: 1203660436. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:38:48,619][25689] Avg episode reward: [(0, '-0.189')] [2022-07-11 11:38:49,344][26022] Updated weights on worker 0-0, policy_version 1175451 (0.00093) [2022-07-11 11:38:51,391][26022] Updated weights on worker 0-0, policy_version 1175461 (0.00469) [2022-07-11 11:38:53,182][26022] Updated weights on worker 0-0, policy_version 1175471 (0.00097) [2022-07-11 11:38:53,726][25689] Fps is (10 sec: 5669.9, 60 sec: 5557.2, 300 sec: 5556.8). Total num frames: 1203685376. Throughput: 0: 5820.9. Samples: 1203693530. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:38:53,726][25689] Avg episode reward: [(0, '-0.820')] [2022-07-11 11:38:55,172][26022] Updated weights on worker 0-0, policy_version 1175481 (0.00089) [2022-07-11 11:38:56,670][26022] Updated weights on worker 0-0, policy_version 1175491 (0.00097) [2022-07-11 11:38:58,693][26022] Updated weights on worker 0-0, policy_version 1175501 (0.00081) [2022-07-11 11:38:58,788][25689] Fps is (10 sec: 5538.4, 60 sec: 5552.6, 300 sec: 5556.1). Total num frames: 1203713024. Throughput: 0: 4976.7. Samples: 1203710202. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:38:58,788][25689] Avg episode reward: [(0, '0.026')] [2022-07-11 11:39:00,178][26022] Updated weights on worker 0-0, policy_version 1175511 (0.00089) [2022-07-11 11:39:02,784][26022] Updated weights on worker 0-0, policy_version 1175521 (0.00091) [2022-07-11 11:39:03,790][25689] Fps is (10 sec: 5494.3, 60 sec: 5555.5, 300 sec: 5563.5). Total num frames: 1203740672. Throughput: 0: 5721.9. Samples: 1203742008. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:39:03,791][25689] Avg episode reward: [(0, '0.328')] [2022-07-11 11:39:04,418][26022] Updated weights on worker 0-0, policy_version 1175531 (0.00089) [2022-07-11 11:39:06,406][26022] Updated weights on worker 0-0, policy_version 1175541 (0.00095) [2022-07-11 11:39:08,094][26022] Updated weights on worker 0-0, policy_version 1175551 (0.00086) [2022-07-11 11:39:08,839][25689] Fps is (10 sec: 5399.8, 60 sec: 5551.6, 300 sec: 5555.0). Total num frames: 1203767296. Throughput: 0: 5707.3. Samples: 1203775836. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:39:08,839][25689] Avg episode reward: [(0, '-0.172')] [2022-07-11 11:39:10,048][26022] Updated weights on worker 0-0, policy_version 1175561 (0.00084) [2022-07-11 11:39:11,745][26022] Updated weights on worker 0-0, policy_version 1175571 (0.00090) [2022-07-11 11:39:13,720][26022] Updated weights on worker 0-0, policy_version 1175581 (0.00094) [2022-07-11 11:39:13,897][25689] Fps is (10 sec: 5369.8, 60 sec: 5536.8, 300 sec: 5561.5). Total num frames: 1203794944. Throughput: 0: 4916.6. Samples: 1203792704. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:39:13,899][25689] Avg episode reward: [(0, '0.028')] [2022-07-11 11:39:15,431][26022] Updated weights on worker 0-0, policy_version 1175591 (0.00092) [2022-07-11 11:39:17,367][26022] Updated weights on worker 0-0, policy_version 1175601 (0.00100) [2022-07-11 11:39:18,930][25689] Fps is (10 sec: 5682.1, 60 sec: 5552.0, 300 sec: 5557.6). Total num frames: 1203824640. Throughput: 0: 5777.0. Samples: 1203826568. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:39:18,932][25689] Avg episode reward: [(0, '0.645')] [2022-07-11 11:39:19,153][26022] Updated weights on worker 0-0, policy_version 1175611 (0.00084) [2022-07-11 11:39:20,934][26022] Updated weights on worker 0-0, policy_version 1175621 (0.00087) [2022-07-11 11:39:22,674][26022] Updated weights on worker 0-0, policy_version 1175631 (0.00084) [2022-07-11 11:39:23,963][25689] Fps is (10 sec: 5697.0, 60 sec: 5567.0, 300 sec: 5553.6). Total num frames: 1203852288. Throughput: 0: 5855.1. Samples: 1203860120. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:39:23,963][25689] Avg episode reward: [(0, '0.526')] [2022-07-11 11:39:24,731][26022] Updated weights on worker 0-0, policy_version 1175641 (0.00089) [2022-07-11 11:39:26,383][26022] Updated weights on worker 0-0, policy_version 1175651 (0.00085) [2022-07-11 11:39:28,506][26022] Updated weights on worker 0-0, policy_version 1175661 (0.00085) [2022-07-11 11:39:28,978][25689] Fps is (10 sec: 5605.5, 60 sec: 5567.8, 300 sec: 5561.3). Total num frames: 1203880960. Throughput: 0: 5018.9. Samples: 1203876914. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:39:28,978][25689] Avg episode reward: [(0, '0.233')] [2022-07-11 11:39:29,908][26022] Updated weights on worker 0-0, policy_version 1175671 (0.00091) [2022-07-11 11:39:32,054][26022] Updated weights on worker 0-0, policy_version 1175681 (0.00086) [2022-07-11 11:39:33,615][26022] Updated weights on worker 0-0, policy_version 1175691 (0.00086) [2022-07-11 11:39:34,079][25689] Fps is (10 sec: 5567.3, 60 sec: 5550.6, 300 sec: 5556.2). Total num frames: 1203908608. Throughput: 0: 5848.6. Samples: 1203910740. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:39:34,079][25689] Avg episode reward: [(0, '-0.401')] [2022-07-11 11:39:35,659][26022] Updated weights on worker 0-0, policy_version 1175701 (0.00091) [2022-07-11 11:39:37,170][26022] Updated weights on worker 0-0, policy_version 1175711 (0.00084) [2022-07-11 11:39:39,118][25689] Fps is (10 sec: 5554.1, 60 sec: 5567.2, 300 sec: 5559.5). Total num frames: 1203937280. Throughput: 0: 5843.0. Samples: 1203944524. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:39:39,120][25689] Avg episode reward: [(0, '0.331')] [2022-07-11 11:39:39,143][26022] Updated weights on worker 0-0, policy_version 1175721 (0.00095) [2022-07-11 11:39:40,833][26022] Updated weights on worker 0-0, policy_version 1175731 (0.00092) [2022-07-11 11:39:42,876][26022] Updated weights on worker 0-0, policy_version 1175741 (0.00093) [2022-07-11 11:39:44,191][25689] Fps is (10 sec: 5772.1, 60 sec: 5595.1, 300 sec: 5559.1). Total num frames: 1203966976. Throughput: 0: 5013.6. Samples: 1203961536. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:39:44,193][25689] Avg episode reward: [(0, '0.309')] [2022-07-11 11:39:44,548][26022] Updated weights on worker 0-0, policy_version 1175751 (0.00087) [2022-07-11 11:39:46,390][26022] Updated weights on worker 0-0, policy_version 1175761 (0.00092) [2022-07-11 11:39:48,299][26022] Updated weights on worker 0-0, policy_version 1175771 (0.00086) [2022-07-11 11:39:49,286][25689] Fps is (10 sec: 5639.9, 60 sec: 5570.1, 300 sec: 5565.5). Total num frames: 1203994624. Throughput: 0: 5838.1. Samples: 1203995474. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:39:49,286][25689] Avg episode reward: [(0, '0.155')] [2022-07-11 11:39:50,022][26022] Updated weights on worker 0-0, policy_version 1175781 (0.00087) [2022-07-11 11:39:51,907][26022] Updated weights on worker 0-0, policy_version 1175791 (0.00088) [2022-07-11 11:39:53,714][26022] Updated weights on worker 0-0, policy_version 1175801 (0.00087) [2022-07-11 11:39:54,358][25689] Fps is (10 sec: 5438.7, 60 sec: 5556.4, 300 sec: 5558.5). Total num frames: 1204022272. Throughput: 0: 5836.3. Samples: 1204029096. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:39:54,359][25689] Avg episode reward: [(0, '0.229')] [2022-07-11 11:39:55,464][26022] Updated weights on worker 0-0, policy_version 1175811 (0.00092) [2022-07-11 11:39:57,715][26022] Updated weights on worker 0-0, policy_version 1175821 (0.00086) [2022-07-11 11:39:59,006][26022] Updated weights on worker 0-0, policy_version 1175831 (0.00091) [2022-07-11 11:39:59,374][25689] Fps is (10 sec: 5684.4, 60 sec: 5594.4, 300 sec: 5558.9). Total num frames: 1204051968. Throughput: 0: 5006.3. Samples: 1204045930. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:39:59,374][25689] Avg episode reward: [(0, '0.297')] [2022-07-11 11:40:01,386][26022] Updated weights on worker 0-0, policy_version 1175841 (0.00109) [2022-07-11 11:40:03,035][26022] Updated weights on worker 0-0, policy_version 1175851 (0.00085) [2022-07-11 11:40:03,333][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:40:03,354][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001175852_1204072448.pth [2022-07-11 11:40:03,355][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001173895_1202068480.pth [2022-07-11 11:40:04,441][25689] Fps is (10 sec: 5382.5, 60 sec: 5537.8, 300 sec: 5561.6). Total num frames: 1204076544. Throughput: 0: 5705.0. Samples: 1204077064. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:40:04,442][25689] Avg episode reward: [(0, '1.146')] [2022-07-11 11:40:05,288][26022] Updated weights on worker 0-0, policy_version 1175861 (0.00097) [2022-07-11 11:40:06,882][26022] Updated weights on worker 0-0, policy_version 1175871 (0.00090) [2022-07-11 11:40:08,904][26022] Updated weights on worker 0-0, policy_version 1175881 (0.00085) [2022-07-11 11:40:09,460][25689] Fps is (10 sec: 5279.0, 60 sec: 5574.2, 300 sec: 5555.2). Total num frames: 1204105216. Throughput: 0: 5703.3. Samples: 1204110536. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:40:09,461][25689] Avg episode reward: [(0, '1.137')] [2022-07-11 11:40:10,678][26022] Updated weights on worker 0-0, policy_version 1175891 (0.00094) [2022-07-11 11:40:12,450][26022] Updated weights on worker 0-0, policy_version 1175901 (0.00092) [2022-07-11 11:40:14,093][26022] Updated weights on worker 0-0, policy_version 1175911 (0.00081) [2022-07-11 11:40:14,591][25689] Fps is (10 sec: 5750.7, 60 sec: 5601.3, 300 sec: 5561.2). Total num frames: 1204134912. Throughput: 0: 4862.4. Samples: 1204127476. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:40:14,591][25689] Avg episode reward: [(0, '1.215')] [2022-07-11 11:40:16,395][26022] Updated weights on worker 0-0, policy_version 1175921 (0.00083) [2022-07-11 11:40:17,754][26022] Updated weights on worker 0-0, policy_version 1175931 (0.00084) [2022-07-11 11:40:19,634][25689] Fps is (10 sec: 5435.4, 60 sec: 5533.0, 300 sec: 5557.2). Total num frames: 1204160512. Throughput: 0: 5683.9. Samples: 1204161086. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:40:19,634][25689] Avg episode reward: [(0, '1.213')] [2022-07-11 11:40:20,018][26022] Updated weights on worker 0-0, policy_version 1175941 (0.00053) [2022-07-11 11:40:21,526][26022] Updated weights on worker 0-0, policy_version 1175951 (0.00086) [2022-07-11 11:40:23,523][26022] Updated weights on worker 0-0, policy_version 1175961 (0.00086) [2022-07-11 11:40:24,672][25689] Fps is (10 sec: 5485.2, 60 sec: 5566.2, 300 sec: 5553.1). Total num frames: 1204190208. Throughput: 0: 5822.4. Samples: 1204194854. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:40:24,672][25689] Avg episode reward: [(0, '1.115')] [2022-07-11 11:40:25,341][26022] Updated weights on worker 0-0, policy_version 1175971 (0.00087) [2022-07-11 11:40:27,202][26022] Updated weights on worker 0-0, policy_version 1175981 (0.00092) [2022-07-11 11:40:29,015][26022] Updated weights on worker 0-0, policy_version 1175991 (0.00093) [2022-07-11 11:40:29,694][25689] Fps is (10 sec: 5802.0, 60 sec: 5565.6, 300 sec: 5564.1). Total num frames: 1204218880. Throughput: 0: 5000.3. Samples: 1204211708. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:40:29,694][25689] Avg episode reward: [(0, '1.147')] [2022-07-11 11:40:30,821][26022] Updated weights on worker 0-0, policy_version 1176001 (0.00087) [2022-07-11 11:40:32,647][26022] Updated weights on worker 0-0, policy_version 1176011 (0.00096) [2022-07-11 11:40:34,491][26022] Updated weights on worker 0-0, policy_version 1176021 (0.00084) [2022-07-11 11:40:34,793][25689] Fps is (10 sec: 5564.6, 60 sec: 5565.8, 300 sec: 5560.7). Total num frames: 1204246528. Throughput: 0: 5807.2. Samples: 1204244792. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 11:40:34,793][25689] Avg episode reward: [(0, '1.119')] [2022-07-11 11:40:36,444][26022] Updated weights on worker 0-0, policy_version 1176031 (0.00090) [2022-07-11 11:40:38,100][26022] Updated weights on worker 0-0, policy_version 1176041 (0.00091) [2022-07-11 11:40:39,864][25689] Fps is (10 sec: 5437.0, 60 sec: 5546.0, 300 sec: 5557.4). Total num frames: 1204274176. Throughput: 0: 5795.4. Samples: 1204278328. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:40:39,864][25689] Avg episode reward: [(0, '0.834')] [2022-07-11 11:40:40,128][26022] Updated weights on worker 0-0, policy_version 1176051 (0.00084) [2022-07-11 11:40:41,978][26022] Updated weights on worker 0-0, policy_version 1176061 (0.00083) [2022-07-11 11:40:43,661][26022] Updated weights on worker 0-0, policy_version 1176071 (0.00082) [2022-07-11 11:40:44,936][25689] Fps is (10 sec: 5552.4, 60 sec: 5529.2, 300 sec: 5560.3). Total num frames: 1204302848. Throughput: 0: 5770.0. Samples: 1204311780. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:40:44,937][25689] Avg episode reward: [(0, '-0.103')] [2022-07-11 11:40:45,764][26022] Updated weights on worker 0-0, policy_version 1176081 (0.00090) [2022-07-11 11:40:47,499][26022] Updated weights on worker 0-0, policy_version 1176091 (0.00094) [2022-07-11 11:40:49,280][26022] Updated weights on worker 0-0, policy_version 1176101 (0.00086) [2022-07-11 11:40:49,971][25689] Fps is (10 sec: 5572.6, 60 sec: 5534.7, 300 sec: 5557.1). Total num frames: 1204330496. Throughput: 0: 5735.5. Samples: 1204328006. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:40:49,972][25689] Avg episode reward: [(0, '0.410')] [2022-07-11 11:40:51,354][26022] Updated weights on worker 0-0, policy_version 1176111 (0.00061) [2022-07-11 11:40:52,845][26022] Updated weights on worker 0-0, policy_version 1176121 (0.00087) [2022-07-11 11:40:54,868][26022] Updated weights on worker 0-0, policy_version 1176131 (0.00084) [2022-07-11 11:40:55,043][25689] Fps is (10 sec: 5471.4, 60 sec: 5534.7, 300 sec: 5556.0). Total num frames: 1204358144. Throughput: 0: 5762.7. Samples: 1204361486. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:40:55,045][25689] Avg episode reward: [(0, '0.367')] [2022-07-11 11:40:56,569][26022] Updated weights on worker 0-0, policy_version 1176141 (0.00615) [2022-07-11 11:40:58,515][26022] Updated weights on worker 0-0, policy_version 1176151 (0.00084) [2022-07-11 11:41:00,071][25689] Fps is (10 sec: 5474.8, 60 sec: 5499.8, 300 sec: 5562.6). Total num frames: 1204385792. Throughput: 0: 5792.9. Samples: 1204395384. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:41:00,072][25689] Avg episode reward: [(0, '0.478')] [2022-07-11 11:41:00,414][26022] Updated weights on worker 0-0, policy_version 1176161 (0.00081) [2022-07-11 11:41:02,353][26022] Updated weights on worker 0-0, policy_version 1176171 (0.00080) [2022-07-11 11:41:04,427][26022] Updated weights on worker 0-0, policy_version 1176181 (0.00083) [2022-07-11 11:41:05,100][25689] Fps is (10 sec: 5701.9, 60 sec: 5587.8, 300 sec: 5572.8). Total num frames: 1204415488. Throughput: 0: 4887.4. Samples: 1204410326. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:41:05,100][25689] Avg episode reward: [(0, '0.640')] [2022-07-11 11:41:06,105][26022] Updated weights on worker 0-0, policy_version 1176191 (0.00086) [2022-07-11 11:41:07,884][26022] Updated weights on worker 0-0, policy_version 1176201 (0.00088) [2022-07-11 11:41:09,536][26022] Updated weights on worker 0-0, policy_version 1176211 (0.00095) [2022-07-11 11:41:10,133][25689] Fps is (10 sec: 5597.0, 60 sec: 5552.7, 300 sec: 5559.9). Total num frames: 1204442112. Throughput: 0: 5783.7. Samples: 1204444620. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:41:10,135][25689] Avg episode reward: [(0, '0.616')] [2022-07-11 11:41:11,491][26022] Updated weights on worker 0-0, policy_version 1176221 (0.00088) [2022-07-11 11:41:13,261][26022] Updated weights on worker 0-0, policy_version 1176231 (0.00082) [2022-07-11 11:41:15,234][25689] Fps is (10 sec: 5456.5, 60 sec: 5538.5, 300 sec: 5568.9). Total num frames: 1204470784. Throughput: 0: 5806.1. Samples: 1204478716. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:41:15,234][25689] Avg episode reward: [(0, '1.404')] [2022-07-11 11:41:15,247][26022] Updated weights on worker 0-0, policy_version 1176241 (0.00091) [2022-07-11 11:41:16,777][26022] Updated weights on worker 0-0, policy_version 1176251 (0.00083) [2022-07-11 11:41:18,724][26022] Updated weights on worker 0-0, policy_version 1176261 (0.00091) [2022-07-11 11:41:20,243][25689] Fps is (10 sec: 5671.9, 60 sec: 5592.2, 300 sec: 5565.4). Total num frames: 1204499456. Throughput: 0: 4965.8. Samples: 1204495558. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:41:20,244][25689] Avg episode reward: [(0, '1.245')] [2022-07-11 11:41:20,620][26022] Updated weights on worker 0-0, policy_version 1176271 (0.00085) [2022-07-11 11:41:22,450][26022] Updated weights on worker 0-0, policy_version 1176281 (0.00087) [2022-07-11 11:41:24,123][26022] Updated weights on worker 0-0, policy_version 1176291 (0.00079) [2022-07-11 11:41:25,273][25689] Fps is (10 sec: 5610.0, 60 sec: 5559.3, 300 sec: 5565.3). Total num frames: 1204527104. Throughput: 0: 5899.2. Samples: 1204529332. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:41:25,273][25689] Avg episode reward: [(0, '1.260')] [2022-07-11 11:41:26,065][26022] Updated weights on worker 0-0, policy_version 1176301 (0.00091) [2022-07-11 11:41:27,690][26022] Updated weights on worker 0-0, policy_version 1176311 (0.00090) [2022-07-11 11:41:29,691][26022] Updated weights on worker 0-0, policy_version 1176321 (0.00085) [2022-07-11 11:41:30,323][25689] Fps is (10 sec: 5587.4, 60 sec: 5556.6, 300 sec: 5569.2). Total num frames: 1204555776. Throughput: 0: 5875.0. Samples: 1204563236. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:41:30,324][25689] Avg episode reward: [(0, '1.342')] [2022-07-11 11:41:31,426][26022] Updated weights on worker 0-0, policy_version 1176331 (0.00084) [2022-07-11 11:41:33,377][26022] Updated weights on worker 0-0, policy_version 1176341 (0.00078) [2022-07-11 11:41:35,115][26022] Updated weights on worker 0-0, policy_version 1176351 (0.00088) [2022-07-11 11:41:35,407][25689] Fps is (10 sec: 5759.4, 60 sec: 5591.8, 300 sec: 5574.6). Total num frames: 1204585472. Throughput: 0: 5027.2. Samples: 1204580132. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:41:35,408][25689] Avg episode reward: [(0, '1.353')] [2022-07-11 11:41:36,866][26022] Updated weights on worker 0-0, policy_version 1176361 (0.00082) [2022-07-11 11:41:38,640][26022] Updated weights on worker 0-0, policy_version 1176371 (0.00094) [2022-07-11 11:41:40,509][25689] Fps is (10 sec: 5629.6, 60 sec: 5589.0, 300 sec: 5565.9). Total num frames: 1204613120. Throughput: 0: 5849.2. Samples: 1204614098. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:41:40,510][25689] Avg episode reward: [(0, '0.384')] [2022-07-11 11:41:40,562][26022] Updated weights on worker 0-0, policy_version 1176381 (0.00087) [2022-07-11 11:41:42,198][26022] Updated weights on worker 0-0, policy_version 1176391 (0.00091) [2022-07-11 11:41:44,151][26022] Updated weights on worker 0-0, policy_version 1176401 (0.00082) [2022-07-11 11:41:45,538][25689] Fps is (10 sec: 5761.1, 60 sec: 5626.7, 300 sec: 5575.8). Total num frames: 1204643840. Throughput: 0: 5866.6. Samples: 1204648222. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:41:45,539][25689] Avg episode reward: [(0, '0.316')] [2022-07-11 11:41:45,825][26022] Updated weights on worker 0-0, policy_version 1176411 (0.00084) [2022-07-11 11:41:47,826][26022] Updated weights on worker 0-0, policy_version 1176421 (0.00087) [2022-07-11 11:41:49,584][26022] Updated weights on worker 0-0, policy_version 1176431 (0.00087) [2022-07-11 11:41:50,563][25689] Fps is (10 sec: 5703.8, 60 sec: 5610.7, 300 sec: 5569.1). Total num frames: 1204670464. Throughput: 0: 5033.3. Samples: 1204665110. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:41:50,564][25689] Avg episode reward: [(0, '0.219')] [2022-07-11 11:41:51,391][26022] Updated weights on worker 0-0, policy_version 1176441 (0.00091) [2022-07-11 11:41:53,099][26022] Updated weights on worker 0-0, policy_version 1176451 (0.00088) [2022-07-11 11:41:55,303][26022] Updated weights on worker 0-0, policy_version 1176461 (0.00084) [2022-07-11 11:41:55,611][25689] Fps is (10 sec: 5489.8, 60 sec: 5629.9, 300 sec: 5571.9). Total num frames: 1204699136. Throughput: 0: 5874.9. Samples: 1204698828. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:41:55,611][25689] Avg episode reward: [(0, '-0.568')] [2022-07-11 11:41:56,771][26022] Updated weights on worker 0-0, policy_version 1176471 (0.00090) [2022-07-11 11:41:58,944][26022] Updated weights on worker 0-0, policy_version 1176481 (0.00088) [2022-07-11 11:42:00,394][26022] Updated weights on worker 0-0, policy_version 1176491 (0.00083) [2022-07-11 11:42:00,648][25689] Fps is (10 sec: 5685.9, 60 sec: 5645.9, 300 sec: 5581.9). Total num frames: 1204727808. Throughput: 0: 5868.1. Samples: 1204732276. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:42:00,649][25689] Avg episode reward: [(0, '-0.331')] [2022-07-11 11:42:02,776][26022] Updated weights on worker 0-0, policy_version 1176501 (0.00088) [2022-07-11 11:42:03,451][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:42:03,460][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001176505_1204741120.pth [2022-07-11 11:42:03,460][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001174545_1202734080.pth [2022-07-11 11:42:04,492][26022] Updated weights on worker 0-0, policy_version 1176511 (0.00090) [2022-07-11 11:42:05,669][25689] Fps is (10 sec: 5294.1, 60 sec: 5562.2, 300 sec: 5568.0). Total num frames: 1204752384. Throughput: 0: 4908.6. Samples: 1204747034. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:42:05,671][25689] Avg episode reward: [(0, '-1.033')] [2022-07-11 11:42:06,436][26022] Updated weights on worker 0-0, policy_version 1176521 (0.00084) [2022-07-11 11:42:08,332][26022] Updated weights on worker 0-0, policy_version 1176531 (0.00092) [2022-07-11 11:42:10,055][26022] Updated weights on worker 0-0, policy_version 1176541 (0.00100) [2022-07-11 11:42:10,718][25689] Fps is (10 sec: 5288.0, 60 sec: 5594.5, 300 sec: 5572.0). Total num frames: 1204781056. Throughput: 0: 5735.5. Samples: 1204780708. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:42:10,719][25689] Avg episode reward: [(0, '0.101')] [2022-07-11 11:42:12,008][26022] Updated weights on worker 0-0, policy_version 1176551 (0.00085) [2022-07-11 11:42:13,913][26022] Updated weights on worker 0-0, policy_version 1176561 (0.00085) [2022-07-11 11:42:15,510][26022] Updated weights on worker 0-0, policy_version 1176571 (0.00090) [2022-07-11 11:42:15,796][25689] Fps is (10 sec: 5662.3, 60 sec: 5596.6, 300 sec: 5568.3). Total num frames: 1204809728. Throughput: 0: 5730.4. Samples: 1204814498. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:42:15,798][25689] Avg episode reward: [(0, '-0.296')] [2022-07-11 11:42:17,455][26022] Updated weights on worker 0-0, policy_version 1176581 (0.00091) [2022-07-11 11:42:19,044][26022] Updated weights on worker 0-0, policy_version 1176591 (0.00092) [2022-07-11 11:42:20,803][25689] Fps is (10 sec: 5482.9, 60 sec: 5563.0, 300 sec: 5566.1). Total num frames: 1204836352. Throughput: 0: 4913.6. Samples: 1204831308. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:42:20,804][25689] Avg episode reward: [(0, '-0.399')] [2022-07-11 11:42:21,195][26022] Updated weights on worker 0-0, policy_version 1176601 (0.00087) [2022-07-11 11:42:22,855][26022] Updated weights on worker 0-0, policy_version 1176611 (0.00101) [2022-07-11 11:42:24,685][26022] Updated weights on worker 0-0, policy_version 1176621 (0.00085) [2022-07-11 11:42:25,813][25689] Fps is (10 sec: 5622.5, 60 sec: 5598.6, 300 sec: 5574.2). Total num frames: 1204866048. Throughput: 0: 5862.9. Samples: 1204865138. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:42:25,814][25689] Avg episode reward: [(0, '-0.068')] [2022-07-11 11:42:26,554][26022] Updated weights on worker 0-0, policy_version 1176631 (0.00090) [2022-07-11 11:42:28,234][26022] Updated weights on worker 0-0, policy_version 1176641 (0.00088) [2022-07-11 11:42:30,307][26022] Updated weights on worker 0-0, policy_version 1176651 (0.00083) [2022-07-11 11:42:30,875][25689] Fps is (10 sec: 5693.6, 60 sec: 5580.7, 300 sec: 5563.9). Total num frames: 1204893696. Throughput: 0: 5857.0. Samples: 1204898766. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:42:30,875][25689] Avg episode reward: [(0, '0.514')] [2022-07-11 11:42:31,722][26022] Updated weights on worker 0-0, policy_version 1176661 (0.00092) [2022-07-11 11:42:33,944][26022] Updated weights on worker 0-0, policy_version 1176671 (0.00097) [2022-07-11 11:42:35,750][26022] Updated weights on worker 0-0, policy_version 1176681 (0.00085) [2022-07-11 11:42:35,951][25689] Fps is (10 sec: 5555.4, 60 sec: 5564.5, 300 sec: 5570.8). Total num frames: 1204922368. Throughput: 0: 5008.3. Samples: 1204915440. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:42:35,952][25689] Avg episode reward: [(0, '-0.147')] [2022-07-11 11:42:37,323][26022] Updated weights on worker 0-0, policy_version 1176691 (0.00092) [2022-07-11 11:42:39,539][26022] Updated weights on worker 0-0, policy_version 1176701 (0.00062) [2022-07-11 11:42:40,953][25689] Fps is (10 sec: 5689.6, 60 sec: 5590.6, 300 sec: 5574.4). Total num frames: 1204951040. Throughput: 0: 5858.5. Samples: 1204949358. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:42:40,954][25689] Avg episode reward: [(0, '-0.554')] [2022-07-11 11:42:40,998][26022] Updated weights on worker 0-0, policy_version 1176711 (0.00086) [2022-07-11 11:42:43,041][26022] Updated weights on worker 0-0, policy_version 1176721 (0.00081) [2022-07-11 11:42:44,513][26022] Updated weights on worker 0-0, policy_version 1176731 (0.00089) [2022-07-11 11:42:46,031][25689] Fps is (10 sec: 5587.5, 60 sec: 5535.4, 300 sec: 5566.6). Total num frames: 1204978688. Throughput: 0: 5836.3. Samples: 1204983134. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:42:46,031][25689] Avg episode reward: [(0, '-0.398')] [2022-07-11 11:42:46,580][26022] Updated weights on worker 0-0, policy_version 1176741 (0.00083) [2022-07-11 11:42:48,303][26022] Updated weights on worker 0-0, policy_version 1176751 (0.00086) [2022-07-11 11:42:50,226][26022] Updated weights on worker 0-0, policy_version 1176761 (0.00089) [2022-07-11 11:42:51,033][25689] Fps is (10 sec: 5587.3, 60 sec: 5571.2, 300 sec: 5568.0). Total num frames: 1205007360. Throughput: 0: 5869.6. Samples: 1205017090. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:42:51,034][25689] Avg episode reward: [(0, '-0.283')] [2022-07-11 11:42:51,958][26022] Updated weights on worker 0-0, policy_version 1176771 (0.00082) [2022-07-11 11:42:53,809][26022] Updated weights on worker 0-0, policy_version 1176781 (0.00090) [2022-07-11 11:42:55,697][26022] Updated weights on worker 0-0, policy_version 1176791 (0.00079) [2022-07-11 11:42:56,100][25689] Fps is (10 sec: 5796.8, 60 sec: 5586.5, 300 sec: 5573.9). Total num frames: 1205037056. Throughput: 0: 5882.8. Samples: 1205033970. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:42:56,100][25689] Avg episode reward: [(0, '-0.557')] [2022-07-11 11:42:57,397][26022] Updated weights on worker 0-0, policy_version 1176801 (0.00089) [2022-07-11 11:42:59,312][26022] Updated weights on worker 0-0, policy_version 1176811 (0.00086) [2022-07-11 11:43:01,051][26022] Updated weights on worker 0-0, policy_version 1176821 (0.00086) [2022-07-11 11:43:01,107][25689] Fps is (10 sec: 5692.7, 60 sec: 5572.4, 300 sec: 5574.4). Total num frames: 1205064704. Throughput: 0: 5867.5. Samples: 1205067608. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:43:01,107][25689] Avg episode reward: [(0, '-0.629')] [2022-07-11 11:43:03,339][26022] Updated weights on worker 0-0, policy_version 1176831 (0.00084) [2022-07-11 11:43:05,189][26022] Updated weights on worker 0-0, policy_version 1176841 (0.00088) [2022-07-11 11:43:06,110][25689] Fps is (10 sec: 5319.3, 60 sec: 5590.9, 300 sec: 5571.0). Total num frames: 1205090304. Throughput: 0: 5779.2. Samples: 1205099176. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:43:06,110][25689] Avg episode reward: [(0, '-0.095')] [2022-07-11 11:43:07,043][26022] Updated weights on worker 0-0, policy_version 1176851 (0.00076) [2022-07-11 11:43:08,935][26022] Updated weights on worker 0-0, policy_version 1176861 (0.00084) [2022-07-11 11:43:10,663][26022] Updated weights on worker 0-0, policy_version 1176871 (0.00089) [2022-07-11 11:43:11,126][25689] Fps is (10 sec: 5314.4, 60 sec: 5577.0, 300 sec: 5568.8). Total num frames: 1205117952. Throughput: 0: 4931.8. Samples: 1205116184. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:43:11,127][25689] Avg episode reward: [(0, '0.775')] [2022-07-11 11:43:12,459][26022] Updated weights on worker 0-0, policy_version 1176881 (0.00087) [2022-07-11 11:43:14,296][26022] Updated weights on worker 0-0, policy_version 1176891 (0.00083) [2022-07-11 11:43:16,090][26022] Updated weights on worker 0-0, policy_version 1176901 (0.00099) [2022-07-11 11:43:16,186][25689] Fps is (10 sec: 5589.2, 60 sec: 5578.7, 300 sec: 5567.9). Total num frames: 1205146624. Throughput: 0: 5769.2. Samples: 1205149856. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:43:16,187][25689] Avg episode reward: [(0, '0.372')] [2022-07-11 11:43:18,213][26022] Updated weights on worker 0-0, policy_version 1176911 (0.00086) [2022-07-11 11:43:19,825][26022] Updated weights on worker 0-0, policy_version 1176921 (0.00087) [2022-07-11 11:43:21,192][25689] Fps is (10 sec: 5595.2, 60 sec: 5595.7, 300 sec: 5571.5). Total num frames: 1205174272. Throughput: 0: 5760.0. Samples: 1205183300. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:43:21,194][25689] Avg episode reward: [(0, '0.563')] [2022-07-11 11:43:21,632][26022] Updated weights on worker 0-0, policy_version 1176931 (0.00091) [2022-07-11 11:43:23,392][26022] Updated weights on worker 0-0, policy_version 1176941 (0.00085) [2022-07-11 11:43:25,063][26022] Updated weights on worker 0-0, policy_version 1176951 (0.00090) [2022-07-11 11:43:26,199][25689] Fps is (10 sec: 5624.7, 60 sec: 5579.0, 300 sec: 5571.8). Total num frames: 1205202944. Throughput: 0: 5045.4. Samples: 1205200536. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:43:26,201][25689] Avg episode reward: [(0, '0.949')] [2022-07-11 11:43:27,171][26022] Updated weights on worker 0-0, policy_version 1176961 (0.00085) [2022-07-11 11:43:28,960][26022] Updated weights on worker 0-0, policy_version 1176971 (0.00055) [2022-07-11 11:43:30,912][26022] Updated weights on worker 0-0, policy_version 1176981 (0.00089) [2022-07-11 11:43:31,251][25689] Fps is (10 sec: 5700.9, 60 sec: 5596.9, 300 sec: 5572.7). Total num frames: 1205231616. Throughput: 0: 5844.8. Samples: 1205233808. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:43:31,251][25689] Avg episode reward: [(0, '1.151')] [2022-07-11 11:43:32,672][26022] Updated weights on worker 0-0, policy_version 1176991 (0.00083) [2022-07-11 11:43:34,376][26022] Updated weights on worker 0-0, policy_version 1177001 (0.00080) [2022-07-11 11:43:36,198][26022] Updated weights on worker 0-0, policy_version 1177011 (0.00051) [2022-07-11 11:43:36,362][25689] Fps is (10 sec: 5541.7, 60 sec: 5576.7, 300 sec: 5571.2). Total num frames: 1205259264. Throughput: 0: 5829.0. Samples: 1205267462. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:43:36,363][25689] Avg episode reward: [(0, '1.099')] [2022-07-11 11:43:38,159][26022] Updated weights on worker 0-0, policy_version 1177021 (0.00091) [2022-07-11 11:43:39,856][26022] Updated weights on worker 0-0, policy_version 1177031 (0.00090) [2022-07-11 11:43:41,376][25689] Fps is (10 sec: 5461.4, 60 sec: 5558.8, 300 sec: 5571.1). Total num frames: 1205286912. Throughput: 0: 5013.8. Samples: 1205284496. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:43:41,377][25689] Avg episode reward: [(0, '1.056')] [2022-07-11 11:43:41,946][26022] Updated weights on worker 0-0, policy_version 1177041 (0.00085) [2022-07-11 11:43:43,383][26022] Updated weights on worker 0-0, policy_version 1177051 (0.00090) [2022-07-11 11:43:45,477][26022] Updated weights on worker 0-0, policy_version 1177061 (0.00090) [2022-07-11 11:43:46,400][25689] Fps is (10 sec: 5814.8, 60 sec: 5614.5, 300 sec: 5577.7). Total num frames: 1205317632. Throughput: 0: 5836.2. Samples: 1205318432. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:43:46,401][25689] Avg episode reward: [(0, '1.101')] [2022-07-11 11:43:47,099][26022] Updated weights on worker 0-0, policy_version 1177071 (0.00087) [2022-07-11 11:43:48,938][26022] Updated weights on worker 0-0, policy_version 1177081 (0.00090) [2022-07-11 11:43:50,886][26022] Updated weights on worker 0-0, policy_version 1177091 (0.00084) [2022-07-11 11:43:51,417][25689] Fps is (10 sec: 5608.5, 60 sec: 5562.3, 300 sec: 5569.0). Total num frames: 1205343232. Throughput: 0: 5851.0. Samples: 1205351804. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:43:51,418][25689] Avg episode reward: [(0, '1.242')] [2022-07-11 11:43:52,647][26022] Updated weights on worker 0-0, policy_version 1177101 (0.00089) [2022-07-11 11:43:54,652][26022] Updated weights on worker 0-0, policy_version 1177111 (0.00089) [2022-07-11 11:43:56,494][25689] Fps is (10 sec: 5377.0, 60 sec: 5544.4, 300 sec: 5572.2). Total num frames: 1205371904. Throughput: 0: 5017.9. Samples: 1205368478. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:43:56,494][25689] Avg episode reward: [(0, '1.236')] [2022-07-11 11:43:56,508][26022] Updated weights on worker 0-0, policy_version 1177121 (0.00087) [2022-07-11 11:43:58,076][26022] Updated weights on worker 0-0, policy_version 1177131 (0.00080) [2022-07-11 11:43:59,988][26022] Updated weights on worker 0-0, policy_version 1177141 (0.00086) [2022-07-11 11:44:01,558][25689] Fps is (10 sec: 5655.2, 60 sec: 5556.1, 300 sec: 5574.5). Total num frames: 1205400576. Throughput: 0: 5833.6. Samples: 1205402230. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:44:01,558][25689] Avg episode reward: [(0, '0.605')] [2022-07-11 11:44:01,872][26022] Updated weights on worker 0-0, policy_version 1177151 (0.00087) [2022-07-11 11:44:03,484][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:44:03,494][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001177157_1205408768.pth [2022-07-11 11:44:03,495][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001175200_1203404800.pth [2022-07-11 11:44:03,939][26022] Updated weights on worker 0-0, policy_version 1177161 (0.00085) [2022-07-11 11:44:05,831][26022] Updated weights on worker 0-0, policy_version 1177171 (0.00089) [2022-07-11 11:44:06,619][25689] Fps is (10 sec: 5360.2, 60 sec: 5550.8, 300 sec: 5570.8). Total num frames: 1205426176. Throughput: 0: 5717.7. Samples: 1205434036. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:44:06,619][25689] Avg episode reward: [(0, '0.675')] [2022-07-11 11:44:07,650][26022] Updated weights on worker 0-0, policy_version 1177181 (0.00089) [2022-07-11 11:44:09,659][26022] Updated weights on worker 0-0, policy_version 1177191 (0.00086) [2022-07-11 11:44:11,337][26022] Updated weights on worker 0-0, policy_version 1177201 (0.00084) [2022-07-11 11:44:11,638][25689] Fps is (10 sec: 5485.4, 60 sec: 5584.4, 300 sec: 5578.4). Total num frames: 1205455872. Throughput: 0: 4887.4. Samples: 1205450630. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:44:11,639][25689] Avg episode reward: [(0, '0.989')] [2022-07-11 11:44:13,218][26022] Updated weights on worker 0-0, policy_version 1177211 (0.00077) [2022-07-11 11:44:14,870][26022] Updated weights on worker 0-0, policy_version 1177221 (0.00096) [2022-07-11 11:44:16,750][25689] Fps is (10 sec: 5558.8, 60 sec: 5545.7, 300 sec: 5566.6). Total num frames: 1205482496. Throughput: 0: 5719.0. Samples: 1205484326. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:44:16,751][25689] Avg episode reward: [(0, '0.941')] [2022-07-11 11:44:17,004][26022] Updated weights on worker 0-0, policy_version 1177231 (0.00089) [2022-07-11 11:44:18,663][26022] Updated weights on worker 0-0, policy_version 1177241 (0.00094) [2022-07-11 11:44:20,491][26022] Updated weights on worker 0-0, policy_version 1177251 (0.00115) [2022-07-11 11:44:21,812][25689] Fps is (10 sec: 5535.8, 60 sec: 5574.4, 300 sec: 5573.0). Total num frames: 1205512192. Throughput: 0: 5710.2. Samples: 1205517886. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:44:21,812][25689] Avg episode reward: [(0, '0.902')] [2022-07-11 11:44:22,379][26022] Updated weights on worker 0-0, policy_version 1177261 (0.00079) [2022-07-11 11:44:24,111][26022] Updated weights on worker 0-0, policy_version 1177271 (0.00082) [2022-07-11 11:44:26,027][26022] Updated weights on worker 0-0, policy_version 1177281 (0.00097) [2022-07-11 11:44:26,856][25689] Fps is (10 sec: 5775.8, 60 sec: 5571.0, 300 sec: 5572.4). Total num frames: 1205540864. Throughput: 0: 4986.2. Samples: 1205534944. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:44:26,858][25689] Avg episode reward: [(0, '0.818')] [2022-07-11 11:44:27,708][26022] Updated weights on worker 0-0, policy_version 1177291 (0.00092) [2022-07-11 11:44:29,528][26022] Updated weights on worker 0-0, policy_version 1177301 (0.00096) [2022-07-11 11:44:31,476][26022] Updated weights on worker 0-0, policy_version 1177311 (0.00090) [2022-07-11 11:44:31,907][25689] Fps is (10 sec: 5477.5, 60 sec: 5537.3, 300 sec: 5569.9). Total num frames: 1205567488. Throughput: 0: 5806.1. Samples: 1205568312. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:44:31,909][25689] Avg episode reward: [(0, '1.233')] [2022-07-11 11:44:33,346][26022] Updated weights on worker 0-0, policy_version 1177321 (0.00088) [2022-07-11 11:44:35,266][26022] Updated weights on worker 0-0, policy_version 1177331 (0.00086) [2022-07-11 11:44:37,021][25689] Fps is (10 sec: 5440.1, 60 sec: 5554.0, 300 sec: 5568.5). Total num frames: 1205596160. Throughput: 0: 5804.8. Samples: 1205601990. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2022-07-11 11:44:37,022][25689] Avg episode reward: [(0, '1.198')] [2022-07-11 11:44:37,026][26022] Updated weights on worker 0-0, policy_version 1177341 (0.00087) [2022-07-11 11:44:38,827][26022] Updated weights on worker 0-0, policy_version 1177351 (0.00096) [2022-07-11 11:44:40,654][26022] Updated weights on worker 0-0, policy_version 1177361 (0.00091) [2022-07-11 11:44:42,044][25689] Fps is (10 sec: 5757.9, 60 sec: 5586.8, 300 sec: 5569.5). Total num frames: 1205625856. Throughput: 0: 4998.1. Samples: 1205619008. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:44:42,049][25689] Avg episode reward: [(0, '1.249')] [2022-07-11 11:44:42,532][26022] Updated weights on worker 0-0, policy_version 1177371 (0.00093) [2022-07-11 11:44:44,132][26022] Updated weights on worker 0-0, policy_version 1177381 (0.00089) [2022-07-11 11:44:46,054][26022] Updated weights on worker 0-0, policy_version 1177391 (0.00083) [2022-07-11 11:44:47,057][25689] Fps is (10 sec: 5713.6, 60 sec: 5537.3, 300 sec: 5571.0). Total num frames: 1205653504. Throughput: 0: 5845.0. Samples: 1205653018. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:44:47,058][25689] Avg episode reward: [(0, '1.042')] [2022-07-11 11:44:47,725][26022] Updated weights on worker 0-0, policy_version 1177401 (0.00089) [2022-07-11 11:44:49,642][26022] Updated weights on worker 0-0, policy_version 1177411 (0.00085) [2022-07-11 11:44:51,346][26022] Updated weights on worker 0-0, policy_version 1177421 (0.00082) [2022-07-11 11:44:52,078][25689] Fps is (10 sec: 5612.8, 60 sec: 5587.5, 300 sec: 5575.4). Total num frames: 1205682176. Throughput: 0: 5899.3. Samples: 1205687308. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:44:52,083][25689] Avg episode reward: [(0, '1.414')] [2022-07-11 11:44:53,251][26022] Updated weights on worker 0-0, policy_version 1177431 (0.00094) [2022-07-11 11:44:55,073][26022] Updated weights on worker 0-0, policy_version 1177441 (0.00089) [2022-07-11 11:44:56,750][26022] Updated weights on worker 0-0, policy_version 1177451 (0.00083) [2022-07-11 11:44:57,164][25689] Fps is (10 sec: 5775.1, 60 sec: 5603.5, 300 sec: 5574.1). Total num frames: 1205711872. Throughput: 0: 5081.8. Samples: 1205704354. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:44:57,164][25689] Avg episode reward: [(0, '1.539')] [2022-07-11 11:44:58,487][26022] Updated weights on worker 0-0, policy_version 1177461 (0.00094) [2022-07-11 11:45:00,440][26022] Updated weights on worker 0-0, policy_version 1177471 (0.00094) [2022-07-11 11:45:02,205][25689] Fps is (10 sec: 5460.6, 60 sec: 5555.0, 300 sec: 5578.0). Total num frames: 1205737472. Throughput: 0: 5913.6. Samples: 1205738230. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:45:02,205][25689] Avg episode reward: [(0, '1.519')] [2022-07-11 11:45:02,811][26022] Updated weights on worker 0-0, policy_version 1177481 (0.00081) [2022-07-11 11:45:04,423][26022] Updated weights on worker 0-0, policy_version 1177491 (0.00088) [2022-07-11 11:45:06,339][26022] Updated weights on worker 0-0, policy_version 1177501 (0.00085) [2022-07-11 11:45:07,229][25689] Fps is (10 sec: 5392.1, 60 sec: 5609.1, 300 sec: 5577.9). Total num frames: 1205766144. Throughput: 0: 5786.8. Samples: 1205769750. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:45:07,229][25689] Avg episode reward: [(0, '1.248')] [2022-07-11 11:45:08,217][26022] Updated weights on worker 0-0, policy_version 1177511 (0.00089) [2022-07-11 11:45:09,967][26022] Updated weights on worker 0-0, policy_version 1177521 (0.00083) [2022-07-11 11:45:11,894][26022] Updated weights on worker 0-0, policy_version 1177531 (0.00084) [2022-07-11 11:45:12,247][25689] Fps is (10 sec: 5608.4, 60 sec: 5575.4, 300 sec: 5573.1). Total num frames: 1205793792. Throughput: 0: 5771.7. Samples: 1205803714. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:45:12,247][25689] Avg episode reward: [(0, '0.188')] [2022-07-11 11:45:13,573][26022] Updated weights on worker 0-0, policy_version 1177541 (0.00090) [2022-07-11 11:45:15,427][26022] Updated weights on worker 0-0, policy_version 1177551 (0.00089) [2022-07-11 11:45:16,955][26022] Updated weights on worker 0-0, policy_version 1177561 (0.00076) [2022-07-11 11:45:17,386][25689] Fps is (10 sec: 5645.8, 60 sec: 5623.6, 300 sec: 5585.1). Total num frames: 1205823488. Throughput: 0: 5760.7. Samples: 1205820848. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:45:17,386][25689] Avg episode reward: [(0, '-0.127')] [2022-07-11 11:45:19,201][26022] Updated weights on worker 0-0, policy_version 1177571 (0.00082) [2022-07-11 11:45:20,852][26022] Updated weights on worker 0-0, policy_version 1177581 (0.00111) [2022-07-11 11:45:22,434][25689] Fps is (10 sec: 5729.7, 60 sec: 5608.0, 300 sec: 5581.5). Total num frames: 1205852160. Throughput: 0: 5750.3. Samples: 1205854554. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:45:22,434][25689] Avg episode reward: [(0, '-0.475')] [2022-07-11 11:45:22,571][26022] Updated weights on worker 0-0, policy_version 1177591 (0.00092) [2022-07-11 11:45:24,453][26022] Updated weights on worker 0-0, policy_version 1177601 (0.00091) [2022-07-11 11:45:26,362][26022] Updated weights on worker 0-0, policy_version 1177611 (0.00087) [2022-07-11 11:45:27,456][25689] Fps is (10 sec: 5592.8, 60 sec: 5593.1, 300 sec: 5578.0). Total num frames: 1205879808. Throughput: 0: 5859.4. Samples: 1205888268. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:45:27,456][25689] Avg episode reward: [(0, '-0.483')] [2022-07-11 11:45:28,115][26022] Updated weights on worker 0-0, policy_version 1177621 (0.00085) [2022-07-11 11:45:30,185][26022] Updated weights on worker 0-0, policy_version 1177631 (0.00089) [2022-07-11 11:45:31,725][26022] Updated weights on worker 0-0, policy_version 1177641 (0.00091) [2022-07-11 11:45:32,488][25689] Fps is (10 sec: 5499.7, 60 sec: 5611.8, 300 sec: 5579.3). Total num frames: 1205907456. Throughput: 0: 5011.2. Samples: 1205905148. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:45:32,489][25689] Avg episode reward: [(0, '-0.249')] [2022-07-11 11:45:33,789][26022] Updated weights on worker 0-0, policy_version 1177651 (0.00411) [2022-07-11 11:45:35,540][26022] Updated weights on worker 0-0, policy_version 1177661 (0.00093) [2022-07-11 11:45:37,315][26022] Updated weights on worker 0-0, policy_version 1177671 (0.00082) [2022-07-11 11:45:37,532][25689] Fps is (10 sec: 5589.4, 60 sec: 5618.2, 300 sec: 5583.2). Total num frames: 1205936128. Throughput: 0: 5850.9. Samples: 1205938722. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:45:37,533][25689] Avg episode reward: [(0, '0.800')] [2022-07-11 11:45:39,130][26022] Updated weights on worker 0-0, policy_version 1177681 (0.00097) [2022-07-11 11:45:41,007][26022] Updated weights on worker 0-0, policy_version 1177691 (0.00086) [2022-07-11 11:45:42,539][25689] Fps is (10 sec: 5705.4, 60 sec: 5602.9, 300 sec: 5584.5). Total num frames: 1205964800. Throughput: 0: 5875.6. Samples: 1205972684. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:45:42,541][25689] Avg episode reward: [(0, '1.182')] [2022-07-11 11:45:42,606][26022] Updated weights on worker 0-0, policy_version 1177701 (0.00086) [2022-07-11 11:45:44,656][26022] Updated weights on worker 0-0, policy_version 1177711 (0.00083) [2022-07-11 11:45:46,205][26022] Updated weights on worker 0-0, policy_version 1177721 (0.00087) [2022-07-11 11:45:47,591][25689] Fps is (10 sec: 5497.4, 60 sec: 5582.3, 300 sec: 5580.7). Total num frames: 1205991424. Throughput: 0: 5040.6. Samples: 1205989760. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:45:47,591][25689] Avg episode reward: [(0, '1.382')] [2022-07-11 11:45:48,195][26022] Updated weights on worker 0-0, policy_version 1177731 (0.00083) [2022-07-11 11:45:49,991][26022] Updated weights on worker 0-0, policy_version 1177741 (0.00086) [2022-07-11 11:45:51,793][26022] Updated weights on worker 0-0, policy_version 1177751 (0.00546) [2022-07-11 11:45:52,602][25689] Fps is (10 sec: 5596.9, 60 sec: 5600.2, 300 sec: 5588.7). Total num frames: 1206021120. Throughput: 0: 5889.1. Samples: 1206023598. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:45:52,602][25689] Avg episode reward: [(0, '1.551')] [2022-07-11 11:45:53,787][26022] Updated weights on worker 0-0, policy_version 1177761 (0.00084) [2022-07-11 11:45:55,497][26022] Updated weights on worker 0-0, policy_version 1177771 (0.00089) [2022-07-11 11:45:57,379][26022] Updated weights on worker 0-0, policy_version 1177781 (0.00089) [2022-07-11 11:45:57,741][25689] Fps is (10 sec: 5750.3, 60 sec: 5578.3, 300 sec: 5590.1). Total num frames: 1206049792. Throughput: 0: 5855.4. Samples: 1206057054. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:45:57,743][25689] Avg episode reward: [(0, '1.288')] [2022-07-11 11:45:59,235][26022] Updated weights on worker 0-0, policy_version 1177791 (0.00084) [2022-07-11 11:46:00,839][26022] Updated weights on worker 0-0, policy_version 1177801 (0.00085) [2022-07-11 11:46:02,771][25689] Fps is (10 sec: 5236.3, 60 sec: 5562.5, 300 sec: 5572.9). Total num frames: 1206074368. Throughput: 0: 4998.4. Samples: 1206073810. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:46:02,773][25689] Avg episode reward: [(0, '0.427')] [2022-07-11 11:46:03,313][26022] Updated weights on worker 0-0, policy_version 1177811 (0.00090) [2022-07-11 11:46:03,572][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:46:03,585][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001177813_1206080512.pth [2022-07-11 11:46:03,585][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001175852_1204072448.pth [2022-07-11 11:46:05,012][26022] Updated weights on worker 0-0, policy_version 1177821 (0.00094) [2022-07-11 11:46:06,884][26022] Updated weights on worker 0-0, policy_version 1177831 (0.00094) [2022-07-11 11:46:07,787][25689] Fps is (10 sec: 5300.5, 60 sec: 5563.2, 300 sec: 5580.1). Total num frames: 1206103040. Throughput: 0: 5704.2. Samples: 1206104962. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:46:07,789][25689] Avg episode reward: [(0, '0.600')] [2022-07-11 11:46:08,779][26022] Updated weights on worker 0-0, policy_version 1177841 (0.00085) [2022-07-11 11:46:10,619][26022] Updated weights on worker 0-0, policy_version 1177851 (0.00051) [2022-07-11 11:46:12,427][26022] Updated weights on worker 0-0, policy_version 1177861 (0.00091) [2022-07-11 11:46:12,795][25689] Fps is (10 sec: 5618.1, 60 sec: 5564.1, 300 sec: 5578.4). Total num frames: 1206130688. Throughput: 0: 5682.4. Samples: 1206138348. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:46:12,796][25689] Avg episode reward: [(0, '0.256')] [2022-07-11 11:46:14,281][26022] Updated weights on worker 0-0, policy_version 1177871 (0.00094) [2022-07-11 11:46:16,047][26022] Updated weights on worker 0-0, policy_version 1177881 (0.00083) [2022-07-11 11:46:17,916][25689] Fps is (10 sec: 5459.3, 60 sec: 5532.0, 300 sec: 5572.8). Total num frames: 1206158336. Throughput: 0: 4858.0. Samples: 1206155062. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:46:17,916][25689] Avg episode reward: [(0, '0.487')] [2022-07-11 11:46:18,018][26022] Updated weights on worker 0-0, policy_version 1177891 (0.00081) [2022-07-11 11:46:19,884][26022] Updated weights on worker 0-0, policy_version 1177901 (0.00091) [2022-07-11 11:46:21,552][26022] Updated weights on worker 0-0, policy_version 1177911 (0.00090) [2022-07-11 11:46:22,965][25689] Fps is (10 sec: 5638.8, 60 sec: 5548.7, 300 sec: 5579.3). Total num frames: 1206188032. Throughput: 0: 5707.8. Samples: 1206189076. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:46:22,966][25689] Avg episode reward: [(0, '0.513')] [2022-07-11 11:46:23,519][26022] Updated weights on worker 0-0, policy_version 1177921 (0.00085) [2022-07-11 11:46:25,223][26022] Updated weights on worker 0-0, policy_version 1177931 (0.00084) [2022-07-11 11:46:27,167][26022] Updated weights on worker 0-0, policy_version 1177941 (0.00085) [2022-07-11 11:46:28,012][25689] Fps is (10 sec: 5781.0, 60 sec: 5563.4, 300 sec: 5579.4). Total num frames: 1206216704. Throughput: 0: 5824.6. Samples: 1206222766. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:46:28,013][25689] Avg episode reward: [(0, '0.381')] [2022-07-11 11:46:28,948][26022] Updated weights on worker 0-0, policy_version 1177951 (0.00091) [2022-07-11 11:46:30,810][26022] Updated weights on worker 0-0, policy_version 1177961 (0.00084) [2022-07-11 11:46:32,646][26022] Updated weights on worker 0-0, policy_version 1177971 (0.00082) [2022-07-11 11:46:33,046][25689] Fps is (10 sec: 5485.3, 60 sec: 5546.3, 300 sec: 5570.0). Total num frames: 1206243328. Throughput: 0: 5012.0. Samples: 1206239842. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:46:33,046][25689] Avg episode reward: [(0, '-0.030')] [2022-07-11 11:46:34,291][26022] Updated weights on worker 0-0, policy_version 1177981 (0.00092) [2022-07-11 11:46:36,360][26022] Updated weights on worker 0-0, policy_version 1177991 (0.00082) [2022-07-11 11:46:37,909][26022] Updated weights on worker 0-0, policy_version 1178001 (0.00083) [2022-07-11 11:46:38,099][25689] Fps is (10 sec: 5583.7, 60 sec: 5562.4, 300 sec: 5577.8). Total num frames: 1206273024. Throughput: 0: 5870.9. Samples: 1206273554. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:46:38,099][25689] Avg episode reward: [(0, '0.246')] [2022-07-11 11:46:39,915][26022] Updated weights on worker 0-0, policy_version 1178011 (0.00071) [2022-07-11 11:46:41,618][26022] Updated weights on worker 0-0, policy_version 1178021 (0.00092) [2022-07-11 11:46:43,104][25689] Fps is (10 sec: 5803.3, 60 sec: 5562.6, 300 sec: 5571.4). Total num frames: 1206301696. Throughput: 0: 5878.2. Samples: 1206307454. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:46:43,104][25689] Avg episode reward: [(0, '0.208')] [2022-07-11 11:46:43,522][26022] Updated weights on worker 0-0, policy_version 1178031 (0.00087) [2022-07-11 11:46:45,142][26022] Updated weights on worker 0-0, policy_version 1178041 (0.00074) [2022-07-11 11:46:47,077][26022] Updated weights on worker 0-0, policy_version 1178051 (0.00091) [2022-07-11 11:46:48,109][25689] Fps is (10 sec: 5626.1, 60 sec: 5583.8, 300 sec: 5575.2). Total num frames: 1206329344. Throughput: 0: 5051.6. Samples: 1206324288. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:46:48,110][25689] Avg episode reward: [(0, '0.310')] [2022-07-11 11:46:48,779][26022] Updated weights on worker 0-0, policy_version 1178061 (0.00086) [2022-07-11 11:46:50,901][26022] Updated weights on worker 0-0, policy_version 1178071 (0.00096) [2022-07-11 11:46:52,429][26022] Updated weights on worker 0-0, policy_version 1178081 (0.00087) [2022-07-11 11:46:53,141][25689] Fps is (10 sec: 5509.1, 60 sec: 5548.0, 300 sec: 5572.1). Total num frames: 1206356992. Throughput: 0: 5879.3. Samples: 1206357986. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:46:53,141][25689] Avg episode reward: [(0, '0.681')] [2022-07-11 11:46:54,468][26022] Updated weights on worker 0-0, policy_version 1178091 (0.00092) [2022-07-11 11:46:56,362][26022] Updated weights on worker 0-0, policy_version 1178101 (0.00097) [2022-07-11 11:46:58,014][26022] Updated weights on worker 0-0, policy_version 1178111 (0.00083) [2022-07-11 11:46:58,191][25689] Fps is (10 sec: 5586.5, 60 sec: 5556.2, 300 sec: 5571.8). Total num frames: 1206385664. Throughput: 0: 5863.7. Samples: 1206391368. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:46:58,191][25689] Avg episode reward: [(0, '0.466')] [2022-07-11 11:46:59,902][26022] Updated weights on worker 0-0, policy_version 1178121 (0.00092) [2022-07-11 11:47:02,130][26022] Updated weights on worker 0-0, policy_version 1178131 (0.00098) [2022-07-11 11:47:03,215][25689] Fps is (10 sec: 5387.2, 60 sec: 5573.7, 300 sec: 5575.2). Total num frames: 1206411264. Throughput: 0: 5013.2. Samples: 1206408276. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:47:03,215][25689] Avg episode reward: [(0, '1.191')] [2022-07-11 11:47:03,995][26022] Updated weights on worker 0-0, policy_version 1178141 (0.00096) [2022-07-11 11:47:05,794][26022] Updated weights on worker 0-0, policy_version 1178151 (0.00088) [2022-07-11 11:47:07,843][26022] Updated weights on worker 0-0, policy_version 1178161 (0.00088) [2022-07-11 11:47:08,238][25689] Fps is (10 sec: 5299.5, 60 sec: 5556.1, 300 sec: 5572.2). Total num frames: 1206438912. Throughput: 0: 5736.4. Samples: 1206439758. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:47:08,240][25689] Avg episode reward: [(0, '1.099')] [2022-07-11 11:47:09,406][26022] Updated weights on worker 0-0, policy_version 1178171 (0.00082) [2022-07-11 11:47:11,411][26022] Updated weights on worker 0-0, policy_version 1178181 (0.00083) [2022-07-11 11:47:13,071][26022] Updated weights on worker 0-0, policy_version 1178191 (0.00085) [2022-07-11 11:47:13,267][25689] Fps is (10 sec: 5704.8, 60 sec: 5588.1, 300 sec: 5576.6). Total num frames: 1206468608. Throughput: 0: 5735.1. Samples: 1206473412. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:47:13,267][25689] Avg episode reward: [(0, '1.037')] [2022-07-11 11:47:14,866][26022] Updated weights on worker 0-0, policy_version 1178201 (0.00077) [2022-07-11 11:47:16,747][26022] Updated weights on worker 0-0, policy_version 1178211 (0.00087) [2022-07-11 11:47:18,404][25689] Fps is (10 sec: 5640.9, 60 sec: 5586.5, 300 sec: 5577.6). Total num frames: 1206496256. Throughput: 0: 4892.7. Samples: 1206490266. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:47:18,406][25689] Avg episode reward: [(0, '0.925')] [2022-07-11 11:47:18,519][26022] Updated weights on worker 0-0, policy_version 1178221 (0.00094) [2022-07-11 11:47:20,446][26022] Updated weights on worker 0-0, policy_version 1178231 (0.00087) [2022-07-11 11:47:22,137][26022] Updated weights on worker 0-0, policy_version 1178241 (0.00087) [2022-07-11 11:47:23,422][25689] Fps is (10 sec: 5545.8, 60 sec: 5572.5, 300 sec: 5574.0). Total num frames: 1206524928. Throughput: 0: 5733.2. Samples: 1206524130. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:47:23,424][25689] Avg episode reward: [(0, '0.446')] [2022-07-11 11:47:24,097][26022] Updated weights on worker 0-0, policy_version 1178251 (0.00086) [2022-07-11 11:47:25,743][26022] Updated weights on worker 0-0, policy_version 1178261 (0.00088) [2022-07-11 11:47:27,784][26022] Updated weights on worker 0-0, policy_version 1178271 (0.00084) [2022-07-11 11:47:28,434][25689] Fps is (10 sec: 5717.1, 60 sec: 5575.7, 300 sec: 5578.4). Total num frames: 1206553600. Throughput: 0: 5847.4. Samples: 1206557852. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:47:28,435][25689] Avg episode reward: [(0, '0.645')] [2022-07-11 11:47:29,575][26022] Updated weights on worker 0-0, policy_version 1178281 (0.00089) [2022-07-11 11:47:31,366][26022] Updated weights on worker 0-0, policy_version 1178291 (0.00080) [2022-07-11 11:47:33,208][26022] Updated weights on worker 0-0, policy_version 1178301 (0.00094) [2022-07-11 11:47:33,483][25689] Fps is (10 sec: 5496.5, 60 sec: 5574.3, 300 sec: 5572.0). Total num frames: 1206580224. Throughput: 0: 5001.2. Samples: 1206574516. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:47:33,483][25689] Avg episode reward: [(0, '0.614')] [2022-07-11 11:47:34,950][26022] Updated weights on worker 0-0, policy_version 1178311 (0.00091) [2022-07-11 11:47:36,923][26022] Updated weights on worker 0-0, policy_version 1178321 (0.00086) [2022-07-11 11:47:38,552][25689] Fps is (10 sec: 5566.5, 60 sec: 5572.8, 300 sec: 5574.2). Total num frames: 1206609920. Throughput: 0: 5848.0. Samples: 1206608090. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:47:38,553][25689] Avg episode reward: [(0, '0.557')] [2022-07-11 11:47:38,664][26022] Updated weights on worker 0-0, policy_version 1178331 (0.00083) [2022-07-11 11:47:40,609][26022] Updated weights on worker 0-0, policy_version 1178341 (0.00088) [2022-07-11 11:47:42,272][26022] Updated weights on worker 0-0, policy_version 1178351 (0.00096) [2022-07-11 11:47:43,572][25689] Fps is (10 sec: 5683.7, 60 sec: 5554.5, 300 sec: 5575.3). Total num frames: 1206637568. Throughput: 0: 5841.5. Samples: 1206641834. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:47:43,572][25689] Avg episode reward: [(0, '0.438')] [2022-07-11 11:47:44,137][26022] Updated weights on worker 0-0, policy_version 1178361 (0.00094) [2022-07-11 11:47:45,943][26022] Updated weights on worker 0-0, policy_version 1178371 (0.00089) [2022-07-11 11:47:47,748][26022] Updated weights on worker 0-0, policy_version 1178381 (0.00088) [2022-07-11 11:47:48,643][25689] Fps is (10 sec: 5581.2, 60 sec: 5565.4, 300 sec: 5574.0). Total num frames: 1206666240. Throughput: 0: 5830.9. Samples: 1206675686. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:47:48,644][25689] Avg episode reward: [(0, '1.237')] [2022-07-11 11:47:49,669][26022] Updated weights on worker 0-0, policy_version 1178391 (0.00053) [2022-07-11 11:47:51,390][26022] Updated weights on worker 0-0, policy_version 1178401 (0.00093) [2022-07-11 11:47:53,151][26022] Updated weights on worker 0-0, policy_version 1178411 (0.00092) [2022-07-11 11:47:53,656][25689] Fps is (10 sec: 5686.4, 60 sec: 5584.0, 300 sec: 5571.6). Total num frames: 1206694912. Throughput: 0: 5856.4. Samples: 1206692660. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:47:53,657][25689] Avg episode reward: [(0, '1.478')] [2022-07-11 11:47:55,132][26022] Updated weights on worker 0-0, policy_version 1178421 (0.00083) [2022-07-11 11:47:56,893][26022] Updated weights on worker 0-0, policy_version 1178431 (0.00086) [2022-07-11 11:47:58,731][25689] Fps is (10 sec: 5684.5, 60 sec: 5581.7, 300 sec: 5573.7). Total num frames: 1206723584. Throughput: 0: 5872.8. Samples: 1206726596. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:47:58,732][25689] Avg episode reward: [(0, '1.544')] [2022-07-11 11:47:58,736][26022] Updated weights on worker 0-0, policy_version 1178441 (0.00082) [2022-07-11 11:48:00,710][26022] Updated weights on worker 0-0, policy_version 1178451 (0.00088) [2022-07-11 11:48:02,590][26022] Updated weights on worker 0-0, policy_version 1178461 (0.00094) [2022-07-11 11:48:03,731][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:48:03,746][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001178467_1206750208.pth [2022-07-11 11:48:03,747][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001176505_1204741120.pth [2022-07-11 11:48:03,748][25689] Fps is (10 sec: 5479.6, 60 sec: 5599.4, 300 sec: 5576.9). Total num frames: 1206750208. Throughput: 0: 5762.8. Samples: 1206758102. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:48:03,748][25689] Avg episode reward: [(0, '1.400')] [2022-07-11 11:48:04,743][26022] Updated weights on worker 0-0, policy_version 1178471 (0.00082) [2022-07-11 11:48:06,183][26022] Updated weights on worker 0-0, policy_version 1178481 (0.00083) [2022-07-11 11:48:08,270][26022] Updated weights on worker 0-0, policy_version 1178491 (0.00089) [2022-07-11 11:48:08,777][25689] Fps is (10 sec: 5300.3, 60 sec: 5581.9, 300 sec: 5573.2). Total num frames: 1206776832. Throughput: 0: 4923.8. Samples: 1206774822. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:48:08,778][25689] Avg episode reward: [(0, '0.724')] [2022-07-11 11:48:09,995][26022] Updated weights on worker 0-0, policy_version 1178501 (0.00072) [2022-07-11 11:48:11,958][26022] Updated weights on worker 0-0, policy_version 1178511 (0.00058) [2022-07-11 11:48:13,779][25689] Fps is (10 sec: 5512.3, 60 sec: 5567.4, 300 sec: 5574.3). Total num frames: 1206805504. Throughput: 0: 5754.6. Samples: 1206808458. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:48:13,779][25689] Avg episode reward: [(0, '0.957')] [2022-07-11 11:48:13,784][26022] Updated weights on worker 0-0, policy_version 1178521 (0.00091) [2022-07-11 11:48:15,480][26022] Updated weights on worker 0-0, policy_version 1178531 (0.00085) [2022-07-11 11:48:17,363][26022] Updated weights on worker 0-0, policy_version 1178541 (0.00089) [2022-07-11 11:48:18,875][25689] Fps is (10 sec: 5678.9, 60 sec: 5588.2, 300 sec: 5576.1). Total num frames: 1206834176. Throughput: 0: 5734.9. Samples: 1206842120. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:48:18,875][25689] Avg episode reward: [(0, '0.205')] [2022-07-11 11:48:19,228][26022] Updated weights on worker 0-0, policy_version 1178551 (0.00080) [2022-07-11 11:48:21,038][26022] Updated weights on worker 0-0, policy_version 1178561 (0.00083) [2022-07-11 11:48:23,093][26022] Updated weights on worker 0-0, policy_version 1178571 (0.00090) [2022-07-11 11:48:23,879][25689] Fps is (10 sec: 5576.4, 60 sec: 5572.6, 300 sec: 5572.7). Total num frames: 1206861824. Throughput: 0: 5012.9. Samples: 1206859018. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:48:23,879][25689] Avg episode reward: [(0, '0.016')] [2022-07-11 11:48:24,583][26022] Updated weights on worker 0-0, policy_version 1178581 (0.00094) [2022-07-11 11:48:26,690][26022] Updated weights on worker 0-0, policy_version 1178591 (0.00084) [2022-07-11 11:48:28,229][26022] Updated weights on worker 0-0, policy_version 1178601 (0.00082) [2022-07-11 11:48:28,899][25689] Fps is (10 sec: 5618.6, 60 sec: 5571.8, 300 sec: 5573.3). Total num frames: 1206890496. Throughput: 0: 5841.9. Samples: 1206892370. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:48:28,899][25689] Avg episode reward: [(0, '-0.395')] [2022-07-11 11:48:30,411][26022] Updated weights on worker 0-0, policy_version 1178611 (0.00085) [2022-07-11 11:48:31,800][26022] Updated weights on worker 0-0, policy_version 1178621 (0.00092) [2022-07-11 11:48:33,937][25689] Fps is (10 sec: 5599.5, 60 sec: 5589.7, 300 sec: 5574.7). Total num frames: 1206918144. Throughput: 0: 5842.4. Samples: 1206926228. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:48:33,937][26022] Updated weights on worker 0-0, policy_version 1178631 (0.00080) [2022-07-11 11:48:33,937][25689] Avg episode reward: [(0, '-0.291')] [2022-07-11 11:48:35,657][26022] Updated weights on worker 0-0, policy_version 1178641 (0.00093) [2022-07-11 11:48:37,487][26022] Updated weights on worker 0-0, policy_version 1178651 (0.00087) [2022-07-11 11:48:39,006][25689] Fps is (10 sec: 5673.5, 60 sec: 5589.7, 300 sec: 5580.5). Total num frames: 1206947840. Throughput: 0: 5008.9. Samples: 1206942954. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2022-07-11 11:48:39,006][25689] Avg episode reward: [(0, '0.543')] [2022-07-11 11:48:39,170][26022] Updated weights on worker 0-0, policy_version 1178661 (0.00082) [2022-07-11 11:48:41,048][26022] Updated weights on worker 0-0, policy_version 1178671 (0.00089) [2022-07-11 11:48:43,129][26022] Updated weights on worker 0-0, policy_version 1178681 (0.00096) [2022-07-11 11:48:44,071][25689] Fps is (10 sec: 5658.6, 60 sec: 5585.6, 300 sec: 5569.4). Total num frames: 1206975488. Throughput: 0: 5829.0. Samples: 1206976718. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:48:44,071][25689] Avg episode reward: [(0, '0.494')] [2022-07-11 11:48:44,895][26022] Updated weights on worker 0-0, policy_version 1178691 (0.00087) [2022-07-11 11:48:46,312][26022] Updated weights on worker 0-0, policy_version 1178701 (0.00089) [2022-07-11 11:48:48,632][26022] Updated weights on worker 0-0, policy_version 1178711 (0.00105) [2022-07-11 11:48:49,075][25689] Fps is (10 sec: 5491.9, 60 sec: 5574.8, 300 sec: 5576.6). Total num frames: 1207003136. Throughput: 0: 5878.7. Samples: 1207010978. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:48:49,075][25689] Avg episode reward: [(0, '0.501')] [2022-07-11 11:48:49,911][26022] Updated weights on worker 0-0, policy_version 1178721 (0.00085) [2022-07-11 11:48:52,060][26022] Updated weights on worker 0-0, policy_version 1178731 (0.00089) [2022-07-11 11:48:53,666][26022] Updated weights on worker 0-0, policy_version 1178741 (0.00083) [2022-07-11 11:48:54,095][25689] Fps is (10 sec: 5516.1, 60 sec: 5557.2, 300 sec: 5574.2). Total num frames: 1207030784. Throughput: 0: 5030.6. Samples: 1207027636. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:48:54,096][25689] Avg episode reward: [(0, '-0.150')] [2022-07-11 11:48:55,625][26022] Updated weights on worker 0-0, policy_version 1178751 (0.00091) [2022-07-11 11:48:57,563][26022] Updated weights on worker 0-0, policy_version 1178761 (0.00085) [2022-07-11 11:48:59,202][25689] Fps is (10 sec: 5763.4, 60 sec: 5588.1, 300 sec: 5580.2). Total num frames: 1207061504. Throughput: 0: 5877.7. Samples: 1207061660. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:48:59,203][25689] Avg episode reward: [(0, '-0.065')] [2022-07-11 11:48:59,212][26022] Updated weights on worker 0-0, policy_version 1178771 (0.00085) [2022-07-11 11:49:01,023][26022] Updated weights on worker 0-0, policy_version 1178781 (0.00083) [2022-07-11 11:49:03,318][26022] Updated weights on worker 0-0, policy_version 1178791 (0.00092) [2022-07-11 11:49:04,221][25689] Fps is (10 sec: 5663.0, 60 sec: 5587.9, 300 sec: 5584.5). Total num frames: 1207088128. Throughput: 0: 5799.1. Samples: 1207093574. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:49:04,227][25689] Avg episode reward: [(0, '-0.049')] [2022-07-11 11:49:05,007][26022] Updated weights on worker 0-0, policy_version 1178801 (0.00095) [2022-07-11 11:49:07,036][26022] Updated weights on worker 0-0, policy_version 1178811 (0.00088) [2022-07-11 11:49:08,844][26022] Updated weights on worker 0-0, policy_version 1178821 (0.00085) [2022-07-11 11:49:09,263][25689] Fps is (10 sec: 5191.0, 60 sec: 5569.9, 300 sec: 5570.3). Total num frames: 1207113728. Throughput: 0: 4918.8. Samples: 1207110278. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:49:09,263][25689] Avg episode reward: [(0, '-0.414')] [2022-07-11 11:49:10,708][26022] Updated weights on worker 0-0, policy_version 1178831 (0.00090) [2022-07-11 11:49:12,538][26022] Updated weights on worker 0-0, policy_version 1178841 (0.00091) [2022-07-11 11:49:14,251][26022] Updated weights on worker 0-0, policy_version 1178851 (0.00084) [2022-07-11 11:49:14,350][25689] Fps is (10 sec: 5459.4, 60 sec: 5578.9, 300 sec: 5581.1). Total num frames: 1207143424. Throughput: 0: 5733.1. Samples: 1207143758. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:49:14,350][25689] Avg episode reward: [(0, '-0.463')] [2022-07-11 11:49:16,095][26022] Updated weights on worker 0-0, policy_version 1178861 (0.00083) [2022-07-11 11:49:18,078][26022] Updated weights on worker 0-0, policy_version 1178871 (0.00085) [2022-07-11 11:49:19,397][25689] Fps is (10 sec: 5658.4, 60 sec: 5566.5, 300 sec: 5574.5). Total num frames: 1207171072. Throughput: 0: 5727.1. Samples: 1207177318. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:49:19,397][25689] Avg episode reward: [(0, '0.390')] [2022-07-11 11:49:19,609][26022] Updated weights on worker 0-0, policy_version 1178881 (0.00094) [2022-07-11 11:49:21,822][26022] Updated weights on worker 0-0, policy_version 1178891 (0.00086) [2022-07-11 11:49:23,278][26022] Updated weights on worker 0-0, policy_version 1178901 (0.00084) [2022-07-11 11:49:24,410][25689] Fps is (10 sec: 5496.4, 60 sec: 5565.6, 300 sec: 5571.6). Total num frames: 1207198720. Throughput: 0: 4980.3. Samples: 1207194120. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:49:24,414][25689] Avg episode reward: [(0, '0.744')] [2022-07-11 11:49:25,353][26022] Updated weights on worker 0-0, policy_version 1178911 (0.00094) [2022-07-11 11:49:27,250][26022] Updated weights on worker 0-0, policy_version 1178921 (0.00065) [2022-07-11 11:49:29,036][26022] Updated weights on worker 0-0, policy_version 1178931 (0.00083) [2022-07-11 11:49:29,501][25689] Fps is (10 sec: 5675.4, 60 sec: 5576.0, 300 sec: 5581.2). Total num frames: 1207228416. Throughput: 0: 5781.4. Samples: 1207227284. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:49:29,501][25689] Avg episode reward: [(0, '0.869')] [2022-07-11 11:49:30,972][26022] Updated weights on worker 0-0, policy_version 1178941 (0.00082) [2022-07-11 11:49:32,778][26022] Updated weights on worker 0-0, policy_version 1178951 (0.00094) [2022-07-11 11:49:34,451][26022] Updated weights on worker 0-0, policy_version 1178961 (0.00083) [2022-07-11 11:49:34,523][25689] Fps is (10 sec: 5771.6, 60 sec: 5594.4, 300 sec: 5582.9). Total num frames: 1207257088. Throughput: 0: 5819.0. Samples: 1207261146. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:49:34,524][25689] Avg episode reward: [(0, '0.302')] [2022-07-11 11:49:36,447][26022] Updated weights on worker 0-0, policy_version 1178971 (0.00086) [2022-07-11 11:49:38,223][26022] Updated weights on worker 0-0, policy_version 1178981 (0.00086) [2022-07-11 11:49:39,564][25689] Fps is (10 sec: 5495.1, 60 sec: 5546.3, 300 sec: 5572.3). Total num frames: 1207283712. Throughput: 0: 4978.8. Samples: 1207277724. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:49:39,564][25689] Avg episode reward: [(0, '0.468')] [2022-07-11 11:49:39,930][26022] Updated weights on worker 0-0, policy_version 1178991 (0.00095) [2022-07-11 11:49:41,933][26022] Updated weights on worker 0-0, policy_version 1179001 (0.00085) [2022-07-11 11:49:43,592][26022] Updated weights on worker 0-0, policy_version 1179011 (0.00093) [2022-07-11 11:49:44,597][25689] Fps is (10 sec: 5387.3, 60 sec: 5549.2, 300 sec: 5571.9). Total num frames: 1207311360. Throughput: 0: 5807.2. Samples: 1207311350. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:49:44,598][25689] Avg episode reward: [(0, '0.569')] [2022-07-11 11:49:45,561][26022] Updated weights on worker 0-0, policy_version 1179021 (0.00087) [2022-07-11 11:49:47,366][26022] Updated weights on worker 0-0, policy_version 1179031 (0.00088) [2022-07-11 11:49:49,236][26022] Updated weights on worker 0-0, policy_version 1179041 (0.01116) [2022-07-11 11:49:49,656][25689] Fps is (10 sec: 5580.2, 60 sec: 5561.0, 300 sec: 5571.2). Total num frames: 1207340032. Throughput: 0: 5834.5. Samples: 1207344882. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:49:49,657][25689] Avg episode reward: [(0, '0.657')] [2022-07-11 11:49:51,041][26022] Updated weights on worker 0-0, policy_version 1179051 (0.00084) [2022-07-11 11:49:52,828][26022] Updated weights on worker 0-0, policy_version 1179061 (0.00084) [2022-07-11 11:49:54,580][26022] Updated weights on worker 0-0, policy_version 1179071 (0.00088) [2022-07-11 11:49:54,712][25689] Fps is (10 sec: 5770.7, 60 sec: 5591.6, 300 sec: 5571.7). Total num frames: 1207369728. Throughput: 0: 4992.4. Samples: 1207361936. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:49:54,712][25689] Avg episode reward: [(0, '1.106')] [2022-07-11 11:49:56,478][26022] Updated weights on worker 0-0, policy_version 1179081 (0.00092) [2022-07-11 11:49:58,066][26022] Updated weights on worker 0-0, policy_version 1179091 (0.00087) [2022-07-11 11:49:59,827][25689] Fps is (10 sec: 5537.7, 60 sec: 5523.3, 300 sec: 5573.8). Total num frames: 1207396352. Throughput: 0: 5835.5. Samples: 1207395970. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:49:59,827][25689] Avg episode reward: [(0, '1.486')] [2022-07-11 11:50:00,184][26022] Updated weights on worker 0-0, policy_version 1179101 (0.00084) [2022-07-11 11:50:02,159][26022] Updated weights on worker 0-0, policy_version 1179111 (0.00112) [2022-07-11 11:50:03,814][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:50:03,823][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001179119_1207417856.pth [2022-07-11 11:50:03,823][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001177157_1205408768.pth [2022-07-11 11:50:04,082][26022] Updated weights on worker 0-0, policy_version 1179121 (0.00091) [2022-07-11 11:50:04,873][25689] Fps is (10 sec: 5340.9, 60 sec: 5537.7, 300 sec: 5569.9). Total num frames: 1207424000. Throughput: 0: 5731.3. Samples: 1207427558. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:50:04,874][25689] Avg episode reward: [(0, '0.498')] [2022-07-11 11:50:05,948][26022] Updated weights on worker 0-0, policy_version 1179131 (0.00090) [2022-07-11 11:50:08,024][26022] Updated weights on worker 0-0, policy_version 1179141 (0.00100) [2022-07-11 11:50:09,676][26022] Updated weights on worker 0-0, policy_version 1179151 (0.00085) [2022-07-11 11:50:09,935][25689] Fps is (10 sec: 5571.9, 60 sec: 5586.5, 300 sec: 5572.5). Total num frames: 1207452672. Throughput: 0: 5710.0. Samples: 1207460672. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:50:09,935][25689] Avg episode reward: [(0, '-0.794')] [2022-07-11 11:50:11,720][26022] Updated weights on worker 0-0, policy_version 1179161 (0.00083) [2022-07-11 11:50:13,174][26022] Updated weights on worker 0-0, policy_version 1179171 (0.00091) [2022-07-11 11:50:14,951][25689] Fps is (10 sec: 5385.4, 60 sec: 5525.4, 300 sec: 5561.1). Total num frames: 1207478272. Throughput: 0: 5702.9. Samples: 1207477358. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:50:14,951][25689] Avg episode reward: [(0, '-0.570')] [2022-07-11 11:50:15,477][26022] Updated weights on worker 0-0, policy_version 1179181 (0.00091) [2022-07-11 11:50:16,799][26022] Updated weights on worker 0-0, policy_version 1179191 (0.00087) [2022-07-11 11:50:18,952][26022] Updated weights on worker 0-0, policy_version 1179201 (0.00106) [2022-07-11 11:50:20,010][25689] Fps is (10 sec: 5488.3, 60 sec: 5558.1, 300 sec: 5564.3). Total num frames: 1207507968. Throughput: 0: 5703.9. Samples: 1207511092. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:50:20,010][25689] Avg episode reward: [(0, '-1.622')] [2022-07-11 11:50:20,559][26022] Updated weights on worker 0-0, policy_version 1179211 (0.00084) [2022-07-11 11:50:22,462][26022] Updated weights on worker 0-0, policy_version 1179221 (0.00085) [2022-07-11 11:50:24,234][26022] Updated weights on worker 0-0, policy_version 1179231 (0.00096) [2022-07-11 11:50:25,030][25689] Fps is (10 sec: 5791.2, 60 sec: 5574.4, 300 sec: 5567.8). Total num frames: 1207536640. Throughput: 0: 5819.2. Samples: 1207544852. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:50:25,030][25689] Avg episode reward: [(0, '-1.630')] [2022-07-11 11:50:26,291][26022] Updated weights on worker 0-0, policy_version 1179241 (0.00080) [2022-07-11 11:50:27,824][26022] Updated weights on worker 0-0, policy_version 1179251 (0.00089) [2022-07-11 11:50:29,882][26022] Updated weights on worker 0-0, policy_version 1179261 (0.00086) [2022-07-11 11:50:30,039][25689] Fps is (10 sec: 5513.3, 60 sec: 5531.1, 300 sec: 5564.8). Total num frames: 1207563264. Throughput: 0: 5025.8. Samples: 1207561714. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:50:30,040][25689] Avg episode reward: [(0, '-1.909')] [2022-07-11 11:50:31,406][26022] Updated weights on worker 0-0, policy_version 1179271 (0.00084) [2022-07-11 11:50:33,530][26022] Updated weights on worker 0-0, policy_version 1179281 (0.00097) [2022-07-11 11:50:35,071][25689] Fps is (10 sec: 5507.0, 60 sec: 5530.3, 300 sec: 5565.0). Total num frames: 1207591936. Throughput: 0: 5859.6. Samples: 1207595252. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:50:35,071][25689] Avg episode reward: [(0, '-1.572')] [2022-07-11 11:50:35,409][26022] Updated weights on worker 0-0, policy_version 1179291 (0.00086) [2022-07-11 11:50:37,076][26022] Updated weights on worker 0-0, policy_version 1179301 (0.00089) [2022-07-11 11:50:39,114][26022] Updated weights on worker 0-0, policy_version 1179311 (0.00088) [2022-07-11 11:50:40,208][25689] Fps is (10 sec: 5538.7, 60 sec: 5538.4, 300 sec: 5559.1). Total num frames: 1207619584. Throughput: 0: 5825.7. Samples: 1207628760. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:50:40,208][25689] Avg episode reward: [(0, '0.205')] [2022-07-11 11:50:40,800][26022] Updated weights on worker 0-0, policy_version 1179321 (0.00092) [2022-07-11 11:50:42,622][26022] Updated weights on worker 0-0, policy_version 1179331 (0.00090) [2022-07-11 11:50:44,658][26022] Updated weights on worker 0-0, policy_version 1179341 (0.00053) [2022-07-11 11:50:45,219][25689] Fps is (10 sec: 5549.8, 60 sec: 5557.4, 300 sec: 5566.8). Total num frames: 1207648256. Throughput: 0: 4985.8. Samples: 1207645510. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:50:45,219][25689] Avg episode reward: [(0, '1.130')] [2022-07-11 11:50:46,266][26022] Updated weights on worker 0-0, policy_version 1179351 (0.00092) [2022-07-11 11:50:48,127][26022] Updated weights on worker 0-0, policy_version 1179361 (0.00085) [2022-07-11 11:50:50,114][26022] Updated weights on worker 0-0, policy_version 1179371 (0.00096) [2022-07-11 11:50:50,244][25689] Fps is (10 sec: 5611.6, 60 sec: 5543.6, 300 sec: 5559.6). Total num frames: 1207675904. Throughput: 0: 5822.9. Samples: 1207679364. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:50:50,245][25689] Avg episode reward: [(0, '0.273')] [2022-07-11 11:50:51,690][26022] Updated weights on worker 0-0, policy_version 1179381 (0.00083) [2022-07-11 11:50:53,722][26022] Updated weights on worker 0-0, policy_version 1179391 (0.00087) [2022-07-11 11:50:55,256][25689] Fps is (10 sec: 5713.3, 60 sec: 5547.6, 300 sec: 5565.5). Total num frames: 1207705600. Throughput: 0: 5835.8. Samples: 1207713048. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:50:55,256][25689] Avg episode reward: [(0, '0.262')] [2022-07-11 11:50:55,327][26022] Updated weights on worker 0-0, policy_version 1179401 (0.00091) [2022-07-11 11:50:57,275][26022] Updated weights on worker 0-0, policy_version 1179411 (0.00089) [2022-07-11 11:50:59,089][26022] Updated weights on worker 0-0, policy_version 1179421 (0.00090) [2022-07-11 11:51:00,355][25689] Fps is (10 sec: 5772.8, 60 sec: 5582.8, 300 sec: 5577.9). Total num frames: 1207734272. Throughput: 0: 5014.1. Samples: 1207729780. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:51:00,357][25689] Avg episode reward: [(0, '0.310')] [2022-07-11 11:51:00,874][26022] Updated weights on worker 0-0, policy_version 1179431 (0.00086) [2022-07-11 11:51:03,206][26022] Updated weights on worker 0-0, policy_version 1179441 (0.00086) [2022-07-11 11:51:04,823][26022] Updated weights on worker 0-0, policy_version 1179451 (0.00082) [2022-07-11 11:51:05,455][25689] Fps is (10 sec: 5321.4, 60 sec: 5544.2, 300 sec: 5566.0). Total num frames: 1207759872. Throughput: 0: 5728.4. Samples: 1207761428. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:51:05,455][25689] Avg episode reward: [(0, '1.104')] [2022-07-11 11:51:06,968][26022] Updated weights on worker 0-0, policy_version 1179461 (0.00090) [2022-07-11 11:51:08,612][26022] Updated weights on worker 0-0, policy_version 1179471 (0.00086) [2022-07-11 11:51:10,456][25689] Fps is (10 sec: 5373.2, 60 sec: 5549.7, 300 sec: 5569.6). Total num frames: 1207788544. Throughput: 0: 5715.6. Samples: 1207794884. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:51:10,456][25689] Avg episode reward: [(0, '0.827')] [2022-07-11 11:51:10,456][26022] Updated weights on worker 0-0, policy_version 1179481 (0.00075) [2022-07-11 11:51:12,310][26022] Updated weights on worker 0-0, policy_version 1179491 (0.00094) [2022-07-11 11:51:14,289][26022] Updated weights on worker 0-0, policy_version 1179501 (0.00091) [2022-07-11 11:51:15,465][25689] Fps is (10 sec: 5626.0, 60 sec: 5584.1, 300 sec: 5571.7). Total num frames: 1207816192. Throughput: 0: 4872.9. Samples: 1207811526. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:51:15,467][25689] Avg episode reward: [(0, '-0.758')] [2022-07-11 11:51:16,080][26022] Updated weights on worker 0-0, policy_version 1179511 (0.00090) [2022-07-11 11:51:17,843][26022] Updated weights on worker 0-0, policy_version 1179521 (0.00079) [2022-07-11 11:51:19,892][26022] Updated weights on worker 0-0, policy_version 1179531 (0.00086) [2022-07-11 11:51:20,609][25689] Fps is (10 sec: 5445.9, 60 sec: 5542.5, 300 sec: 5563.0). Total num frames: 1207843840. Throughput: 0: 5696.4. Samples: 1207845156. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:51:20,610][25689] Avg episode reward: [(0, '-1.174')] [2022-07-11 11:51:21,384][26022] Updated weights on worker 0-0, policy_version 1179541 (0.00093) [2022-07-11 11:51:23,502][26022] Updated weights on worker 0-0, policy_version 1179551 (0.00084) [2022-07-11 11:51:25,070][26022] Updated weights on worker 0-0, policy_version 1179561 (0.00074) [2022-07-11 11:51:25,618][25689] Fps is (10 sec: 5648.4, 60 sec: 5560.5, 300 sec: 5567.2). Total num frames: 1207873536. Throughput: 0: 5831.7. Samples: 1207879012. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:51:25,618][25689] Avg episode reward: [(0, '-0.862')] [2022-07-11 11:51:26,922][26022] Updated weights on worker 0-0, policy_version 1179571 (0.00086) [2022-07-11 11:51:28,643][26022] Updated weights on worker 0-0, policy_version 1179581 (0.00086) [2022-07-11 11:51:30,486][26022] Updated weights on worker 0-0, policy_version 1179591 (0.00092) [2022-07-11 11:51:30,679][25689] Fps is (10 sec: 5694.8, 60 sec: 5572.6, 300 sec: 5570.1). Total num frames: 1207901184. Throughput: 0: 4998.3. Samples: 1207895966. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:51:30,679][25689] Avg episode reward: [(0, '-2.417')] [2022-07-11 11:51:32,439][26022] Updated weights on worker 0-0, policy_version 1179601 (0.00084) [2022-07-11 11:51:34,191][26022] Updated weights on worker 0-0, policy_version 1179611 (0.00089) [2022-07-11 11:51:35,699][25689] Fps is (10 sec: 5485.0, 60 sec: 5556.8, 300 sec: 5563.9). Total num frames: 1207928832. Throughput: 0: 5845.5. Samples: 1207929802. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:51:35,699][25689] Avg episode reward: [(0, '-3.037')] [2022-07-11 11:51:36,079][26022] Updated weights on worker 0-0, policy_version 1179621 (0.00087) [2022-07-11 11:51:37,858][26022] Updated weights on worker 0-0, policy_version 1179631 (0.00080) [2022-07-11 11:51:39,785][26022] Updated weights on worker 0-0, policy_version 1179641 (0.00101) [2022-07-11 11:51:40,841][25689] Fps is (10 sec: 5643.1, 60 sec: 5590.1, 300 sec: 5564.7). Total num frames: 1207958528. Throughput: 0: 5838.5. Samples: 1207963278. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:51:40,841][25689] Avg episode reward: [(0, '-2.885')] [2022-07-11 11:51:41,575][26022] Updated weights on worker 0-0, policy_version 1179651 (0.00083) [2022-07-11 11:51:43,348][26022] Updated weights on worker 0-0, policy_version 1179661 (0.00089) [2022-07-11 11:51:45,066][26022] Updated weights on worker 0-0, policy_version 1179671 (0.00087) [2022-07-11 11:51:45,905][25689] Fps is (10 sec: 5718.8, 60 sec: 5585.2, 300 sec: 5567.1). Total num frames: 1207987200. Throughput: 0: 4996.9. Samples: 1207980388. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:51:45,906][25689] Avg episode reward: [(0, '-2.400')] [2022-07-11 11:51:47,091][26022] Updated weights on worker 0-0, policy_version 1179681 (0.00092) [2022-07-11 11:51:48,660][26022] Updated weights on worker 0-0, policy_version 1179691 (0.00085) [2022-07-11 11:51:50,638][26022] Updated weights on worker 0-0, policy_version 1179701 (0.00090) [2022-07-11 11:51:50,919][25689] Fps is (10 sec: 5588.1, 60 sec: 5586.2, 300 sec: 5567.4). Total num frames: 1208014848. Throughput: 0: 5857.2. Samples: 1208014518. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:51:50,920][25689] Avg episode reward: [(0, '-1.435')] [2022-07-11 11:51:52,099][26022] Updated weights on worker 0-0, policy_version 1179711 (0.00090) [2022-07-11 11:51:54,418][26022] Updated weights on worker 0-0, policy_version 1179721 (0.00104) [2022-07-11 11:51:55,788][26022] Updated weights on worker 0-0, policy_version 1179731 (0.00085) [2022-07-11 11:51:55,951][25689] Fps is (10 sec: 5810.3, 60 sec: 5601.2, 300 sec: 5574.6). Total num frames: 1208045568. Throughput: 0: 5856.7. Samples: 1208048412. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:51:55,951][25689] Avg episode reward: [(0, '-0.492')] [2022-07-11 11:51:57,832][26022] Updated weights on worker 0-0, policy_version 1179741 (0.00091) [2022-07-11 11:51:59,651][26022] Updated weights on worker 0-0, policy_version 1179751 (0.00090) [2022-07-11 11:52:01,035][25689] Fps is (10 sec: 5770.0, 60 sec: 5585.8, 300 sec: 5580.4). Total num frames: 1208073216. Throughput: 0: 5903.4. Samples: 1208082494. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:52:01,035][25689] Avg episode reward: [(0, '-0.043')] [2022-07-11 11:52:01,328][26022] Updated weights on worker 0-0, policy_version 1179761 (0.00088) [2022-07-11 11:52:03,701][26022] Updated weights on worker 0-0, policy_version 1179771 (0.00085) [2022-07-11 11:52:03,876][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:52:03,893][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001179772_1208086528.pth [2022-07-11 11:52:03,893][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001177813_1206080512.pth [2022-07-11 11:52:05,368][26022] Updated weights on worker 0-0, policy_version 1179781 (0.00093) [2022-07-11 11:52:06,075][25689] Fps is (10 sec: 5259.6, 60 sec: 5591.3, 300 sec: 5573.2). Total num frames: 1208098816. Throughput: 0: 5797.5. Samples: 1208097322. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:52:06,075][25689] Avg episode reward: [(0, '0.672')] [2022-07-11 11:52:07,266][26022] Updated weights on worker 0-0, policy_version 1179791 (0.00089) [2022-07-11 11:52:09,024][26022] Updated weights on worker 0-0, policy_version 1179801 (0.00083) [2022-07-11 11:52:10,982][26022] Updated weights on worker 0-0, policy_version 1179811 (0.00085) [2022-07-11 11:52:11,081][25689] Fps is (10 sec: 5402.2, 60 sec: 5590.8, 300 sec: 5570.2). Total num frames: 1208127488. Throughput: 0: 5783.5. Samples: 1208131126. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:52:11,082][25689] Avg episode reward: [(0, '0.482')] [2022-07-11 11:52:12,624][26022] Updated weights on worker 0-0, policy_version 1179821 (0.00091) [2022-07-11 11:52:14,552][26022] Updated weights on worker 0-0, policy_version 1179831 (0.00090) [2022-07-11 11:52:16,169][25689] Fps is (10 sec: 5681.1, 60 sec: 5600.5, 300 sec: 5574.6). Total num frames: 1208156160. Throughput: 0: 5759.3. Samples: 1208164852. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:52:16,170][25689] Avg episode reward: [(0, '1.301')] [2022-07-11 11:52:16,270][26022] Updated weights on worker 0-0, policy_version 1179841 (0.00100) [2022-07-11 11:52:18,293][26022] Updated weights on worker 0-0, policy_version 1179851 (0.00086) [2022-07-11 11:52:20,040][26022] Updated weights on worker 0-0, policy_version 1179861 (0.00087) [2022-07-11 11:52:21,238][25689] Fps is (10 sec: 5544.9, 60 sec: 5607.3, 300 sec: 5570.2). Total num frames: 1208183808. Throughput: 0: 4909.2. Samples: 1208181676. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:52:21,240][25689] Avg episode reward: [(0, '1.172')] [2022-07-11 11:52:21,907][26022] Updated weights on worker 0-0, policy_version 1179871 (0.00085) [2022-07-11 11:52:23,550][26022] Updated weights on worker 0-0, policy_version 1179881 (0.00089) [2022-07-11 11:52:25,473][26022] Updated weights on worker 0-0, policy_version 1179891 (0.00086) [2022-07-11 11:52:26,254][25689] Fps is (10 sec: 5584.5, 60 sec: 5589.8, 300 sec: 5570.1). Total num frames: 1208212480. Throughput: 0: 5860.8. Samples: 1208215586. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:52:26,255][25689] Avg episode reward: [(0, '1.267')] [2022-07-11 11:52:27,230][26022] Updated weights on worker 0-0, policy_version 1179901 (0.00093) [2022-07-11 11:52:29,052][26022] Updated weights on worker 0-0, policy_version 1179911 (0.00084) [2022-07-11 11:52:30,832][26022] Updated weights on worker 0-0, policy_version 1179921 (0.00083) [2022-07-11 11:52:31,263][25689] Fps is (10 sec: 5720.4, 60 sec: 5611.5, 300 sec: 5577.7). Total num frames: 1208241152. Throughput: 0: 5863.8. Samples: 1208249466. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:52:31,263][25689] Avg episode reward: [(0, '1.429')] [2022-07-11 11:52:32,659][26022] Updated weights on worker 0-0, policy_version 1179931 (0.00095) [2022-07-11 11:52:34,305][26022] Updated weights on worker 0-0, policy_version 1179941 (0.00087) [2022-07-11 11:52:36,283][25689] Fps is (10 sec: 5717.8, 60 sec: 5628.4, 300 sec: 5575.2). Total num frames: 1208269824. Throughput: 0: 5057.9. Samples: 1208266588. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2022-07-11 11:52:36,284][25689] Avg episode reward: [(0, '1.538')] [2022-07-11 11:52:36,287][26022] Updated weights on worker 0-0, policy_version 1179951 (0.00294) [2022-07-11 11:52:38,126][26022] Updated weights on worker 0-0, policy_version 1179961 (0.00080) [2022-07-11 11:52:39,885][26022] Updated weights on worker 0-0, policy_version 1179971 (0.00082) [2022-07-11 11:52:41,331][25689] Fps is (10 sec: 5492.3, 60 sec: 5586.4, 300 sec: 5571.2). Total num frames: 1208296448. Throughput: 0: 5908.3. Samples: 1208300388. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:52:41,331][25689] Avg episode reward: [(0, '1.249')] [2022-07-11 11:52:41,934][26022] Updated weights on worker 0-0, policy_version 1179981 (0.00084) [2022-07-11 11:52:43,521][26022] Updated weights on worker 0-0, policy_version 1179991 (0.00085) [2022-07-11 11:52:45,432][26022] Updated weights on worker 0-0, policy_version 1180001 (0.00088) [2022-07-11 11:52:46,363][25689] Fps is (10 sec: 5587.3, 60 sec: 5606.3, 300 sec: 5575.4). Total num frames: 1208326144. Throughput: 0: 5899.3. Samples: 1208334214. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:52:46,363][25689] Avg episode reward: [(0, '0.980')] [2022-07-11 11:52:47,294][26022] Updated weights on worker 0-0, policy_version 1180011 (0.00084) [2022-07-11 11:52:49,083][26022] Updated weights on worker 0-0, policy_version 1180021 (0.00106) [2022-07-11 11:52:50,809][26022] Updated weights on worker 0-0, policy_version 1180031 (0.00855) [2022-07-11 11:52:51,366][25689] Fps is (10 sec: 5816.0, 60 sec: 5624.2, 300 sec: 5575.6). Total num frames: 1208354816. Throughput: 0: 5064.5. Samples: 1208351280. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:52:51,367][25689] Avg episode reward: [(0, '0.179')] [2022-07-11 11:52:52,747][26022] Updated weights on worker 0-0, policy_version 1180041 (0.00092) [2022-07-11 11:52:54,306][26022] Updated weights on worker 0-0, policy_version 1180051 (0.00113) [2022-07-11 11:52:56,369][25689] Fps is (10 sec: 5526.0, 60 sec: 5559.1, 300 sec: 5570.0). Total num frames: 1208381440. Throughput: 0: 5898.5. Samples: 1208385068. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:52:56,370][25689] Avg episode reward: [(0, '-0.052')] [2022-07-11 11:52:56,617][26022] Updated weights on worker 0-0, policy_version 1180061 (0.00096) [2022-07-11 11:52:58,059][26022] Updated weights on worker 0-0, policy_version 1180071 (0.00089) [2022-07-11 11:53:00,159][26022] Updated weights on worker 0-0, policy_version 1180081 (0.00092) [2022-07-11 11:53:01,415][25689] Fps is (10 sec: 5604.5, 60 sec: 5596.5, 300 sec: 5579.8). Total num frames: 1208411136. Throughput: 0: 5898.7. Samples: 1208418862. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:53:01,416][25689] Avg episode reward: [(0, '0.100')] [2022-07-11 11:53:01,629][26022] Updated weights on worker 0-0, policy_version 1180091 (0.00093) [2022-07-11 11:53:04,175][26022] Updated weights on worker 0-0, policy_version 1180101 (0.00085) [2022-07-11 11:53:05,746][26022] Updated weights on worker 0-0, policy_version 1180111 (0.00089) [2022-07-11 11:53:06,438][25689] Fps is (10 sec: 5390.1, 60 sec: 5581.1, 300 sec: 5573.1). Total num frames: 1208435712. Throughput: 0: 4948.0. Samples: 1208433548. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:53:06,439][25689] Avg episode reward: [(0, '0.503')] [2022-07-11 11:53:07,678][26022] Updated weights on worker 0-0, policy_version 1180121 (0.00096) [2022-07-11 11:53:09,481][26022] Updated weights on worker 0-0, policy_version 1180131 (0.00085) [2022-07-11 11:53:11,330][26022] Updated weights on worker 0-0, policy_version 1180141 (0.00094) [2022-07-11 11:53:11,458][25689] Fps is (10 sec: 5302.2, 60 sec: 5579.9, 300 sec: 5572.7). Total num frames: 1208464384. Throughput: 0: 5766.7. Samples: 1208467144. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:53:11,459][25689] Avg episode reward: [(0, '0.738')] [2022-07-11 11:53:13,106][26022] Updated weights on worker 0-0, policy_version 1180151 (0.00086) [2022-07-11 11:53:15,129][26022] Updated weights on worker 0-0, policy_version 1180161 (0.00086) [2022-07-11 11:53:16,462][25689] Fps is (10 sec: 5720.9, 60 sec: 5587.7, 300 sec: 5574.4). Total num frames: 1208493056. Throughput: 0: 5743.6. Samples: 1208500470. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:53:16,462][25689] Avg episode reward: [(0, '0.844')] [2022-07-11 11:53:16,827][26022] Updated weights on worker 0-0, policy_version 1180171 (0.00092) [2022-07-11 11:53:18,947][26022] Updated weights on worker 0-0, policy_version 1180181 (0.00084) [2022-07-11 11:53:20,495][26022] Updated weights on worker 0-0, policy_version 1180191 (0.00092) [2022-07-11 11:53:21,524][25689] Fps is (10 sec: 5595.3, 60 sec: 5588.4, 300 sec: 5573.4). Total num frames: 1208520704. Throughput: 0: 4884.4. Samples: 1208517078. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:53:21,524][25689] Avg episode reward: [(0, '1.637')] [2022-07-11 11:53:22,412][26022] Updated weights on worker 0-0, policy_version 1180201 (0.00059) [2022-07-11 11:53:24,368][26022] Updated weights on worker 0-0, policy_version 1180211 (0.00085) [2022-07-11 11:53:25,921][26022] Updated weights on worker 0-0, policy_version 1180221 (0.00097) [2022-07-11 11:53:26,543][25689] Fps is (10 sec: 5383.6, 60 sec: 5554.1, 300 sec: 5566.5). Total num frames: 1208547328. Throughput: 0: 5821.2. Samples: 1208550580. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:53:26,543][25689] Avg episode reward: [(0, '0.772')] [2022-07-11 11:53:27,927][26022] Updated weights on worker 0-0, policy_version 1180231 (0.00086) [2022-07-11 11:53:29,767][26022] Updated weights on worker 0-0, policy_version 1180241 (0.00089) [2022-07-11 11:53:31,539][26022] Updated weights on worker 0-0, policy_version 1180251 (0.00090) [2022-07-11 11:53:31,551][25689] Fps is (10 sec: 5616.4, 60 sec: 5571.1, 300 sec: 5573.9). Total num frames: 1208577024. Throughput: 0: 5824.0. Samples: 1208584166. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:53:31,552][25689] Avg episode reward: [(0, '0.603')] [2022-07-11 11:53:33,459][26022] Updated weights on worker 0-0, policy_version 1180261 (0.00090) [2022-07-11 11:53:35,183][26022] Updated weights on worker 0-0, policy_version 1180271 (0.00087) [2022-07-11 11:53:36,571][25689] Fps is (10 sec: 5718.4, 60 sec: 5554.1, 300 sec: 5568.0). Total num frames: 1208604672. Throughput: 0: 5007.4. Samples: 1208601164. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:53:36,571][25689] Avg episode reward: [(0, '0.233')] [2022-07-11 11:53:37,029][26022] Updated weights on worker 0-0, policy_version 1180281 (0.00080) [2022-07-11 11:53:38,869][26022] Updated weights on worker 0-0, policy_version 1180291 (0.00092) [2022-07-11 11:53:40,462][26022] Updated weights on worker 0-0, policy_version 1180301 (0.00088) [2022-07-11 11:53:41,630][25689] Fps is (10 sec: 5384.8, 60 sec: 5553.1, 300 sec: 5564.6). Total num frames: 1208631296. Throughput: 0: 5866.1. Samples: 1208635024. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:53:41,631][25689] Avg episode reward: [(0, '-0.510')] [2022-07-11 11:53:42,372][26022] Updated weights on worker 0-0, policy_version 1180311 (0.00094) [2022-07-11 11:53:44,615][26022] Updated weights on worker 0-0, policy_version 1180321 (0.00086) [2022-07-11 11:53:46,077][26022] Updated weights on worker 0-0, policy_version 1180331 (0.00091) [2022-07-11 11:53:46,653][25689] Fps is (10 sec: 5687.5, 60 sec: 5570.9, 300 sec: 5574.6). Total num frames: 1208662016. Throughput: 0: 5860.4. Samples: 1208668434. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:53:46,654][25689] Avg episode reward: [(0, '-1.229')] [2022-07-11 11:53:48,251][26022] Updated weights on worker 0-0, policy_version 1180341 (0.00088) [2022-07-11 11:53:49,556][26022] Updated weights on worker 0-0, policy_version 1180351 (0.00049) [2022-07-11 11:53:51,666][25689] Fps is (10 sec: 5713.7, 60 sec: 5536.1, 300 sec: 5571.3). Total num frames: 1208688640. Throughput: 0: 5033.2. Samples: 1208685408. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:53:51,667][25689] Avg episode reward: [(0, '-0.605')] [2022-07-11 11:53:51,926][26022] Updated weights on worker 0-0, policy_version 1180361 (0.00084) [2022-07-11 11:53:53,434][26022] Updated weights on worker 0-0, policy_version 1180371 (0.00088) [2022-07-11 11:53:55,429][26022] Updated weights on worker 0-0, policy_version 1180381 (0.00051) [2022-07-11 11:53:56,681][25689] Fps is (10 sec: 5616.6, 60 sec: 5585.9, 300 sec: 5569.6). Total num frames: 1208718336. Throughput: 0: 5858.5. Samples: 1208718978. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:53:56,681][25689] Avg episode reward: [(0, '-0.759')] [2022-07-11 11:53:57,239][26022] Updated weights on worker 0-0, policy_version 1180391 (0.00082) [2022-07-11 11:53:59,205][26022] Updated weights on worker 0-0, policy_version 1180401 (0.00085) [2022-07-11 11:54:00,729][26022] Updated weights on worker 0-0, policy_version 1180411 (0.00085) [2022-07-11 11:54:01,771][25689] Fps is (10 sec: 5573.8, 60 sec: 5531.0, 300 sec: 5568.3). Total num frames: 1208744960. Throughput: 0: 5854.1. Samples: 1208752930. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:54:01,771][25689] Avg episode reward: [(0, '-1.058')] [2022-07-11 11:54:02,898][26022] Updated weights on worker 0-0, policy_version 1180421 (0.00086) [2022-07-11 11:54:03,958][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:54:03,973][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001180427_1208757248.pth [2022-07-11 11:54:03,974][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001178467_1206750208.pth [2022-07-11 11:54:04,760][26022] Updated weights on worker 0-0, policy_version 1180431 (0.00099) [2022-07-11 11:54:06,600][26022] Updated weights on worker 0-0, policy_version 1180441 (0.00097) [2022-07-11 11:54:06,786][25689] Fps is (10 sec: 5370.5, 60 sec: 5582.5, 300 sec: 5575.6). Total num frames: 1208772608. Throughput: 0: 4936.8. Samples: 1208767828. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:54:06,787][25689] Avg episode reward: [(0, '-0.384')] [2022-07-11 11:54:08,629][26022] Updated weights on worker 0-0, policy_version 1180451 (0.00095) [2022-07-11 11:54:10,238][26022] Updated weights on worker 0-0, policy_version 1180461 (0.00085) [2022-07-11 11:54:11,799][25689] Fps is (10 sec: 5412.0, 60 sec: 5549.3, 300 sec: 5566.7). Total num frames: 1208799232. Throughput: 0: 5771.8. Samples: 1208801608. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:54:11,799][25689] Avg episode reward: [(0, '-0.560')] [2022-07-11 11:54:12,224][26022] Updated weights on worker 0-0, policy_version 1180471 (0.00092) [2022-07-11 11:54:13,718][26022] Updated weights on worker 0-0, policy_version 1180481 (0.00091) [2022-07-11 11:54:15,806][26022] Updated weights on worker 0-0, policy_version 1180491 (0.00087) [2022-07-11 11:54:16,813][25689] Fps is (10 sec: 5616.8, 60 sec: 5565.2, 300 sec: 5574.2). Total num frames: 1208828928. Throughput: 0: 5762.1. Samples: 1208834986. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:54:16,814][25689] Avg episode reward: [(0, '0.107')] [2022-07-11 11:54:17,504][26022] Updated weights on worker 0-0, policy_version 1180501 (0.00086) [2022-07-11 11:54:19,512][26022] Updated weights on worker 0-0, policy_version 1180511 (0.00084) [2022-07-11 11:54:21,215][26022] Updated weights on worker 0-0, policy_version 1180521 (0.00092) [2022-07-11 11:54:21,942][25689] Fps is (10 sec: 5552.4, 60 sec: 5542.2, 300 sec: 5568.6). Total num frames: 1208855552. Throughput: 0: 4903.1. Samples: 1208851832. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:54:21,944][25689] Avg episode reward: [(0, '-0.536')] [2022-07-11 11:54:23,083][26022] Updated weights on worker 0-0, policy_version 1180531 (0.00096) [2022-07-11 11:54:24,999][26022] Updated weights on worker 0-0, policy_version 1180541 (0.00080) [2022-07-11 11:54:26,769][26022] Updated weights on worker 0-0, policy_version 1180551 (0.00089) [2022-07-11 11:54:26,949][25689] Fps is (10 sec: 5455.5, 60 sec: 5577.1, 300 sec: 5566.7). Total num frames: 1208884224. Throughput: 0: 5836.5. Samples: 1208885510. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:54:26,951][25689] Avg episode reward: [(0, '-1.148')] [2022-07-11 11:54:28,653][26022] Updated weights on worker 0-0, policy_version 1180561 (0.00090) [2022-07-11 11:54:30,381][26022] Updated weights on worker 0-0, policy_version 1180571 (0.00086) [2022-07-11 11:54:32,015][25689] Fps is (10 sec: 5692.8, 60 sec: 5554.9, 300 sec: 5565.9). Total num frames: 1208912896. Throughput: 0: 5817.4. Samples: 1208919214. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:54:32,016][25689] Avg episode reward: [(0, '-0.044')] [2022-07-11 11:54:32,284][26022] Updated weights on worker 0-0, policy_version 1180581 (0.00096) [2022-07-11 11:54:34,190][26022] Updated weights on worker 0-0, policy_version 1180591 (0.00085) [2022-07-11 11:54:35,981][26022] Updated weights on worker 0-0, policy_version 1180601 (0.00084) [2022-07-11 11:54:37,028][25689] Fps is (10 sec: 5587.8, 60 sec: 5555.5, 300 sec: 5569.8). Total num frames: 1208940544. Throughput: 0: 4998.5. Samples: 1208936030. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:54:37,029][25689] Avg episode reward: [(0, '-0.152')] [2022-07-11 11:54:37,752][26022] Updated weights on worker 0-0, policy_version 1180611 (0.00099) [2022-07-11 11:54:39,405][26022] Updated weights on worker 0-0, policy_version 1180621 (0.00091) [2022-07-11 11:54:41,295][26022] Updated weights on worker 0-0, policy_version 1180631 (0.00087) [2022-07-11 11:54:42,070][25689] Fps is (10 sec: 5702.8, 60 sec: 5607.9, 300 sec: 5576.6). Total num frames: 1208970240. Throughput: 0: 5869.2. Samples: 1208969970. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:54:42,072][25689] Avg episode reward: [(0, '-0.159')] [2022-07-11 11:54:43,292][26022] Updated weights on worker 0-0, policy_version 1180641 (0.00091) [2022-07-11 11:54:45,044][26022] Updated weights on worker 0-0, policy_version 1180651 (0.00093) [2022-07-11 11:54:46,714][26022] Updated weights on worker 0-0, policy_version 1180661 (0.00087) [2022-07-11 11:54:47,094][25689] Fps is (10 sec: 5595.3, 60 sec: 5540.1, 300 sec: 5570.4). Total num frames: 1208996864. Throughput: 0: 5840.3. Samples: 1209003162. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:54:47,096][25689] Avg episode reward: [(0, '0.839')] [2022-07-11 11:54:48,723][26022] Updated weights on worker 0-0, policy_version 1180671 (0.00083) [2022-07-11 11:54:50,556][26022] Updated weights on worker 0-0, policy_version 1180681 (0.00861) [2022-07-11 11:54:52,104][25689] Fps is (10 sec: 5409.1, 60 sec: 5557.3, 300 sec: 5564.3). Total num frames: 1209024512. Throughput: 0: 5025.1. Samples: 1209020162. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:54:52,104][25689] Avg episode reward: [(0, '0.954')] [2022-07-11 11:54:52,419][26022] Updated weights on worker 0-0, policy_version 1180691 (0.00087) [2022-07-11 11:54:54,137][26022] Updated weights on worker 0-0, policy_version 1180701 (0.00098) [2022-07-11 11:54:56,180][26022] Updated weights on worker 0-0, policy_version 1180711 (0.00088) [2022-07-11 11:54:57,129][25689] Fps is (10 sec: 5714.4, 60 sec: 5556.3, 300 sec: 5576.3). Total num frames: 1209054208. Throughput: 0: 5861.2. Samples: 1209053844. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:54:57,129][25689] Avg episode reward: [(0, '1.497')] [2022-07-11 11:54:57,694][26022] Updated weights on worker 0-0, policy_version 1180721 (0.00084) [2022-07-11 11:54:59,828][26022] Updated weights on worker 0-0, policy_version 1180731 (0.00084) [2022-07-11 11:55:01,536][26022] Updated weights on worker 0-0, policy_version 1180741 (0.00086) [2022-07-11 11:55:02,202][25689] Fps is (10 sec: 5577.4, 60 sec: 5557.9, 300 sec: 5572.4). Total num frames: 1209080832. Throughput: 0: 5830.1. Samples: 1209087338. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:55:02,202][25689] Avg episode reward: [(0, '1.069')] [2022-07-11 11:55:03,660][26022] Updated weights on worker 0-0, policy_version 1180751 (0.00085) [2022-07-11 11:55:05,587][26022] Updated weights on worker 0-0, policy_version 1180761 (0.00095) [2022-07-11 11:55:07,235][25689] Fps is (10 sec: 5269.1, 60 sec: 5539.4, 300 sec: 5566.0). Total num frames: 1209107456. Throughput: 0: 4910.6. Samples: 1209102064. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:55:07,235][25689] Avg episode reward: [(0, '1.212')] [2022-07-11 11:55:07,477][26022] Updated weights on worker 0-0, policy_version 1180771 (0.00086) [2022-07-11 11:55:09,107][26022] Updated weights on worker 0-0, policy_version 1180781 (0.00080) [2022-07-11 11:55:11,028][26022] Updated weights on worker 0-0, policy_version 1180791 (0.00085) [2022-07-11 11:55:12,258][25689] Fps is (10 sec: 5600.4, 60 sec: 5589.2, 300 sec: 5579.7). Total num frames: 1209137152. Throughput: 0: 5748.1. Samples: 1209136012. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:55:12,259][25689] Avg episode reward: [(0, '1.508')] [2022-07-11 11:55:12,706][26022] Updated weights on worker 0-0, policy_version 1180801 (0.00080) [2022-07-11 11:55:14,639][26022] Updated weights on worker 0-0, policy_version 1180811 (0.00091) [2022-07-11 11:55:16,595][26022] Updated weights on worker 0-0, policy_version 1180821 (0.00084) [2022-07-11 11:55:17,262][25689] Fps is (10 sec: 5616.4, 60 sec: 5539.3, 300 sec: 5570.4). Total num frames: 1209163776. Throughput: 0: 5736.5. Samples: 1209169340. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:55:17,263][25689] Avg episode reward: [(0, '1.333')] [2022-07-11 11:55:18,225][26022] Updated weights on worker 0-0, policy_version 1180831 (0.00086) [2022-07-11 11:55:20,250][26022] Updated weights on worker 0-0, policy_version 1180841 (0.00109) [2022-07-11 11:55:21,835][26022] Updated weights on worker 0-0, policy_version 1180851 (0.00085) [2022-07-11 11:55:22,313][25689] Fps is (10 sec: 5703.0, 60 sec: 5614.3, 300 sec: 5576.7). Total num frames: 1209194496. Throughput: 0: 4929.6. Samples: 1209186480. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:55:22,314][25689] Avg episode reward: [(0, '0.874')] [2022-07-11 11:55:23,767][26022] Updated weights on worker 0-0, policy_version 1180861 (0.00088) [2022-07-11 11:55:25,436][26022] Updated weights on worker 0-0, policy_version 1180871 (0.00081) [2022-07-11 11:55:27,324][25689] Fps is (10 sec: 5699.2, 60 sec: 5580.1, 300 sec: 5576.7). Total num frames: 1209221120. Throughput: 0: 5882.3. Samples: 1209220236. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:55:27,325][25689] Avg episode reward: [(0, '0.867')] [2022-07-11 11:55:27,722][26022] Updated weights on worker 0-0, policy_version 1180881 (0.00085) [2022-07-11 11:55:29,252][26022] Updated weights on worker 0-0, policy_version 1180891 (0.00094) [2022-07-11 11:55:31,251][26022] Updated weights on worker 0-0, policy_version 1180901 (0.00088) [2022-07-11 11:55:32,331][25689] Fps is (10 sec: 5417.5, 60 sec: 5568.5, 300 sec: 5573.7). Total num frames: 1209248768. Throughput: 0: 5873.2. Samples: 1209253904. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:55:32,332][25689] Avg episode reward: [(0, '1.019')] [2022-07-11 11:55:32,799][26022] Updated weights on worker 0-0, policy_version 1180911 (0.00085) [2022-07-11 11:55:35,011][26022] Updated weights on worker 0-0, policy_version 1180921 (0.00095) [2022-07-11 11:55:36,354][26022] Updated weights on worker 0-0, policy_version 1180931 (0.00085) [2022-07-11 11:55:37,343][25689] Fps is (10 sec: 5621.6, 60 sec: 5585.7, 300 sec: 5579.5). Total num frames: 1209277440. Throughput: 0: 5055.6. Samples: 1209270858. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:55:37,343][25689] Avg episode reward: [(0, '0.906')] [2022-07-11 11:55:38,716][26022] Updated weights on worker 0-0, policy_version 1180941 (0.00752) [2022-07-11 11:55:40,067][26022] Updated weights on worker 0-0, policy_version 1180951 (0.00089) [2022-07-11 11:55:42,176][26022] Updated weights on worker 0-0, policy_version 1180961 (0.00090) [2022-07-11 11:55:42,399][25689] Fps is (10 sec: 5594.3, 60 sec: 5550.4, 300 sec: 5575.2). Total num frames: 1209305088. Throughput: 0: 5878.7. Samples: 1209304554. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:55:42,399][25689] Avg episode reward: [(0, '0.462')] [2022-07-11 11:55:43,700][26022] Updated weights on worker 0-0, policy_version 1180971 (0.00087) [2022-07-11 11:55:46,000][26022] Updated weights on worker 0-0, policy_version 1180981 (0.00095) [2022-07-11 11:55:47,413][25689] Fps is (10 sec: 5592.4, 60 sec: 5585.2, 300 sec: 5578.8). Total num frames: 1209333760. Throughput: 0: 5859.0. Samples: 1209337938. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:55:47,415][25689] Avg episode reward: [(0, '-0.948')] [2022-07-11 11:55:47,498][26022] Updated weights on worker 0-0, policy_version 1180991 (0.00089) [2022-07-11 11:55:49,456][26022] Updated weights on worker 0-0, policy_version 1181001 (0.00085) [2022-07-11 11:55:51,315][26022] Updated weights on worker 0-0, policy_version 1181011 (0.00092) [2022-07-11 11:55:52,419][25689] Fps is (10 sec: 5518.4, 60 sec: 5568.6, 300 sec: 5568.6). Total num frames: 1209360384. Throughput: 0: 5007.5. Samples: 1209354492. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:55:52,420][25689] Avg episode reward: [(0, '0.043')] [2022-07-11 11:55:53,323][26022] Updated weights on worker 0-0, policy_version 1181021 (0.00279) [2022-07-11 11:55:54,810][26022] Updated weights on worker 0-0, policy_version 1181031 (0.00104) [2022-07-11 11:55:56,799][26022] Updated weights on worker 0-0, policy_version 1181041 (0.00086) [2022-07-11 11:55:57,423][25689] Fps is (10 sec: 5421.7, 60 sec: 5536.6, 300 sec: 5566.9). Total num frames: 1209388032. Throughput: 0: 5830.6. Samples: 1209387942. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:55:57,424][25689] Avg episode reward: [(0, '-0.271')] [2022-07-11 11:55:58,564][26022] Updated weights on worker 0-0, policy_version 1181051 (0.00093) [2022-07-11 11:56:00,616][26022] Updated weights on worker 0-0, policy_version 1181061 (0.00095) [2022-07-11 11:56:02,487][25689] Fps is (10 sec: 5390.5, 60 sec: 5537.4, 300 sec: 5571.1). Total num frames: 1209414656. Throughput: 0: 5817.4. Samples: 1209421416. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:56:02,488][25689] Avg episode reward: [(0, '-0.338')] [2022-07-11 11:56:02,735][26022] Updated weights on worker 0-0, policy_version 1181071 (0.00069) [2022-07-11 11:56:04,074][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:56:04,086][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001181078_1209423872.pth [2022-07-11 11:56:04,086][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001179119_1207417856.pth [2022-07-11 11:56:04,708][26022] Updated weights on worker 0-0, policy_version 1181081 (0.00085) [2022-07-11 11:56:06,226][26022] Updated weights on worker 0-0, policy_version 1181091 (0.00095) [2022-07-11 11:56:07,498][25689] Fps is (10 sec: 5386.9, 60 sec: 5556.4, 300 sec: 5567.4). Total num frames: 1209442304. Throughput: 0: 4890.6. Samples: 1209436164. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:56:07,499][25689] Avg episode reward: [(0, '-0.341')] [2022-07-11 11:56:08,318][26022] Updated weights on worker 0-0, policy_version 1181101 (0.00083) [2022-07-11 11:56:10,082][26022] Updated weights on worker 0-0, policy_version 1181111 (0.00049) [2022-07-11 11:56:11,978][26022] Updated weights on worker 0-0, policy_version 1181121 (0.00083) [2022-07-11 11:56:12,507][25689] Fps is (10 sec: 5518.5, 60 sec: 5523.8, 300 sec: 5567.4). Total num frames: 1209469952. Throughput: 0: 5719.0. Samples: 1209469376. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:56:12,508][25689] Avg episode reward: [(0, '1.496')] [2022-07-11 11:56:13,857][26022] Updated weights on worker 0-0, policy_version 1181131 (0.00088) [2022-07-11 11:56:15,693][26022] Updated weights on worker 0-0, policy_version 1181141 (0.00085) [2022-07-11 11:56:17,524][25689] Fps is (10 sec: 5515.6, 60 sec: 5539.6, 300 sec: 5569.8). Total num frames: 1209497600. Throughput: 0: 5702.7. Samples: 1209502566. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:56:17,524][25689] Avg episode reward: [(0, '1.509')] [2022-07-11 11:56:17,620][26022] Updated weights on worker 0-0, policy_version 1181151 (0.01442) [2022-07-11 11:56:19,474][26022] Updated weights on worker 0-0, policy_version 1181161 (0.00109) [2022-07-11 11:56:21,206][26022] Updated weights on worker 0-0, policy_version 1181171 (0.00088) [2022-07-11 11:56:22,576][25689] Fps is (10 sec: 5593.7, 60 sec: 5505.5, 300 sec: 5565.6). Total num frames: 1209526272. Throughput: 0: 5700.0. Samples: 1209535920. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:56:22,576][25689] Avg episode reward: [(0, '1.447')] [2022-07-11 11:56:23,075][26022] Updated weights on worker 0-0, policy_version 1181181 (0.00087) [2022-07-11 11:56:24,790][26022] Updated weights on worker 0-0, policy_version 1181191 (0.00099) [2022-07-11 11:56:26,916][26022] Updated weights on worker 0-0, policy_version 1181201 (0.00092) [2022-07-11 11:56:27,610][25689] Fps is (10 sec: 5583.5, 60 sec: 5520.3, 300 sec: 5566.1). Total num frames: 1209553920. Throughput: 0: 5796.7. Samples: 1209552748. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:56:27,611][25689] Avg episode reward: [(0, '1.788')] [2022-07-11 11:56:28,431][26022] Updated weights on worker 0-0, policy_version 1181211 (0.00619) [2022-07-11 11:56:30,548][26022] Updated weights on worker 0-0, policy_version 1181221 (0.00088) [2022-07-11 11:56:32,127][26022] Updated weights on worker 0-0, policy_version 1181231 (0.00613) [2022-07-11 11:56:32,623][25689] Fps is (10 sec: 5503.6, 60 sec: 5519.8, 300 sec: 5566.2). Total num frames: 1209581568. Throughput: 0: 5803.9. Samples: 1209586126. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:56:32,624][25689] Avg episode reward: [(0, '1.973')] [2022-07-11 11:56:34,084][26022] Updated weights on worker 0-0, policy_version 1181241 (0.00086) [2022-07-11 11:56:36,069][26022] Updated weights on worker 0-0, policy_version 1181251 (0.00085) [2022-07-11 11:56:37,657][25689] Fps is (10 sec: 5605.7, 60 sec: 5517.7, 300 sec: 5564.8). Total num frames: 1209610240. Throughput: 0: 5821.9. Samples: 1209619782. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 11:56:37,658][25689] Avg episode reward: [(0, '1.847')] [2022-07-11 11:56:37,738][26022] Updated weights on worker 0-0, policy_version 1181261 (0.00085) [2022-07-11 11:56:39,754][26022] Updated weights on worker 0-0, policy_version 1181271 (0.00096) [2022-07-11 11:56:41,420][26022] Updated weights on worker 0-0, policy_version 1181281 (0.00086) [2022-07-11 11:56:42,748][25689] Fps is (10 sec: 5663.4, 60 sec: 5531.5, 300 sec: 5564.3). Total num frames: 1209638912. Throughput: 0: 4986.9. Samples: 1209636520. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:56:42,749][25689] Avg episode reward: [(0, '1.923')] [2022-07-11 11:56:43,298][26022] Updated weights on worker 0-0, policy_version 1181291 (0.00510) [2022-07-11 11:56:45,126][26022] Updated weights on worker 0-0, policy_version 1181301 (0.00090) [2022-07-11 11:56:46,960][26022] Updated weights on worker 0-0, policy_version 1181311 (0.00085) [2022-07-11 11:56:47,807][25689] Fps is (10 sec: 5549.1, 60 sec: 5510.5, 300 sec: 5563.4). Total num frames: 1209666560. Throughput: 0: 5792.0. Samples: 1209669726. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:56:47,807][25689] Avg episode reward: [(0, '1.631')] [2022-07-11 11:56:48,684][26022] Updated weights on worker 0-0, policy_version 1181321 (0.00089) [2022-07-11 11:56:50,527][26022] Updated weights on worker 0-0, policy_version 1181331 (0.00090) [2022-07-11 11:56:52,524][26022] Updated weights on worker 0-0, policy_version 1181341 (0.00081) [2022-07-11 11:56:52,809][25689] Fps is (10 sec: 5598.0, 60 sec: 5544.7, 300 sec: 5557.1). Total num frames: 1209695232. Throughput: 0: 5826.4. Samples: 1209703738. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:56:52,810][25689] Avg episode reward: [(0, '1.625')] [2022-07-11 11:56:54,195][26022] Updated weights on worker 0-0, policy_version 1181351 (0.00085) [2022-07-11 11:56:55,938][26022] Updated weights on worker 0-0, policy_version 1181361 (0.00089) [2022-07-11 11:56:57,855][25689] Fps is (10 sec: 5605.1, 60 sec: 5541.0, 300 sec: 5557.8). Total num frames: 1209722880. Throughput: 0: 4995.2. Samples: 1209720672. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:56:57,856][25689] Avg episode reward: [(0, '1.275')] [2022-07-11 11:56:57,917][26022] Updated weights on worker 0-0, policy_version 1181371 (0.00091) [2022-07-11 11:56:59,590][26022] Updated weights on worker 0-0, policy_version 1181381 (0.00090) [2022-07-11 11:57:01,584][26022] Updated weights on worker 0-0, policy_version 1181391 (0.00102) [2022-07-11 11:57:03,000][25689] Fps is (10 sec: 5325.6, 60 sec: 5533.5, 300 sec: 5559.3). Total num frames: 1209749504. Throughput: 0: 5791.4. Samples: 1209753804. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:57:03,000][25689] Avg episode reward: [(0, '0.932')] [2022-07-11 11:57:03,500][26022] Updated weights on worker 0-0, policy_version 1181401 (0.00088) [2022-07-11 11:57:05,528][26022] Updated weights on worker 0-0, policy_version 1181411 (0.00089) [2022-07-11 11:57:07,202][26022] Updated weights on worker 0-0, policy_version 1181421 (0.00086) [2022-07-11 11:57:08,053][25689] Fps is (10 sec: 5321.6, 60 sec: 5529.7, 300 sec: 5555.0). Total num frames: 1209777152. Throughput: 0: 5747.9. Samples: 1209786100. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:57:08,055][25689] Avg episode reward: [(0, '0.258')] [2022-07-11 11:57:09,151][26022] Updated weights on worker 0-0, policy_version 1181431 (0.00093) [2022-07-11 11:57:11,029][26022] Updated weights on worker 0-0, policy_version 1181441 (0.00098) [2022-07-11 11:57:12,707][26022] Updated weights on worker 0-0, policy_version 1181451 (0.00088) [2022-07-11 11:57:13,154][25689] Fps is (10 sec: 5748.3, 60 sec: 5572.0, 300 sec: 5561.6). Total num frames: 1209807872. Throughput: 0: 4879.3. Samples: 1209803006. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:57:13,156][25689] Avg episode reward: [(0, '-0.153')] [2022-07-11 11:57:14,920][26022] Updated weights on worker 0-0, policy_version 1181461 (0.00089) [2022-07-11 11:57:16,518][26022] Updated weights on worker 0-0, policy_version 1181471 (0.00102) [2022-07-11 11:57:18,177][25689] Fps is (10 sec: 5664.0, 60 sec: 5554.4, 300 sec: 5559.0). Total num frames: 1209834496. Throughput: 0: 5682.6. Samples: 1209836158. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:57:18,179][25689] Avg episode reward: [(0, '-0.281')] [2022-07-11 11:57:18,507][26022] Updated weights on worker 0-0, policy_version 1181481 (0.00084) [2022-07-11 11:57:20,199][26022] Updated weights on worker 0-0, policy_version 1181491 (0.00093) [2022-07-11 11:57:22,030][26022] Updated weights on worker 0-0, policy_version 1181501 (0.00095) [2022-07-11 11:57:23,283][25689] Fps is (10 sec: 5459.2, 60 sec: 5549.5, 300 sec: 5557.3). Total num frames: 1209863168. Throughput: 0: 5705.8. Samples: 1209869534. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:57:23,284][25689] Avg episode reward: [(0, '-0.243')] [2022-07-11 11:57:24,108][26022] Updated weights on worker 0-0, policy_version 1181511 (0.00088) [2022-07-11 11:57:25,655][26022] Updated weights on worker 0-0, policy_version 1181521 (0.00092) [2022-07-11 11:57:27,813][26022] Updated weights on worker 0-0, policy_version 1181531 (0.00087) [2022-07-11 11:57:28,288][25689] Fps is (10 sec: 5671.7, 60 sec: 5569.1, 300 sec: 5557.4). Total num frames: 1209891840. Throughput: 0: 4944.6. Samples: 1209886154. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:57:28,289][25689] Avg episode reward: [(0, '0.260')] [2022-07-11 11:57:29,333][26022] Updated weights on worker 0-0, policy_version 1181541 (0.00085) [2022-07-11 11:57:31,310][26022] Updated weights on worker 0-0, policy_version 1181551 (0.00089) [2022-07-11 11:57:33,312][25689] Fps is (10 sec: 5411.3, 60 sec: 5534.3, 300 sec: 5547.0). Total num frames: 1209917440. Throughput: 0: 5769.9. Samples: 1209919318. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:57:33,313][25689] Avg episode reward: [(0, '0.348')] [2022-07-11 11:57:33,377][26022] Updated weights on worker 0-0, policy_version 1181561 (0.00083) [2022-07-11 11:57:35,074][26022] Updated weights on worker 0-0, policy_version 1181571 (0.00091) [2022-07-11 11:57:37,001][26022] Updated weights on worker 0-0, policy_version 1181581 (0.00085) [2022-07-11 11:57:38,342][25689] Fps is (10 sec: 5398.1, 60 sec: 5534.7, 300 sec: 5554.2). Total num frames: 1209946112. Throughput: 0: 5759.0. Samples: 1209952286. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:57:38,343][25689] Avg episode reward: [(0, '1.135')] [2022-07-11 11:57:38,777][26022] Updated weights on worker 0-0, policy_version 1181591 (0.00085) [2022-07-11 11:57:40,694][26022] Updated weights on worker 0-0, policy_version 1181601 (0.00086) [2022-07-11 11:57:42,514][26022] Updated weights on worker 0-0, policy_version 1181611 (0.00086) [2022-07-11 11:57:43,391][25689] Fps is (10 sec: 5588.1, 60 sec: 5521.7, 300 sec: 5547.0). Total num frames: 1209973760. Throughput: 0: 4948.2. Samples: 1209969032. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:57:43,391][25689] Avg episode reward: [(0, '1.267')] [2022-07-11 11:57:44,266][26022] Updated weights on worker 0-0, policy_version 1181621 (0.00087) [2022-07-11 11:57:46,249][26022] Updated weights on worker 0-0, policy_version 1181631 (0.00086) [2022-07-11 11:57:47,806][26022] Updated weights on worker 0-0, policy_version 1181641 (0.00088) [2022-07-11 11:57:48,418][25689] Fps is (10 sec: 5589.3, 60 sec: 5541.4, 300 sec: 5546.6). Total num frames: 1210002432. Throughput: 0: 5778.5. Samples: 1210002478. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:57:48,419][25689] Avg episode reward: [(0, '1.543')] [2022-07-11 11:57:49,924][26022] Updated weights on worker 0-0, policy_version 1181651 (0.00086) [2022-07-11 11:57:51,589][26022] Updated weights on worker 0-0, policy_version 1181661 (0.00083) [2022-07-11 11:57:53,433][25689] Fps is (10 sec: 5608.0, 60 sec: 5523.3, 300 sec: 5549.8). Total num frames: 1210030080. Throughput: 0: 5803.6. Samples: 1210036094. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:57:53,434][25689] Avg episode reward: [(0, '1.590')] [2022-07-11 11:57:53,523][26022] Updated weights on worker 0-0, policy_version 1181671 (0.00094) [2022-07-11 11:57:55,354][26022] Updated weights on worker 0-0, policy_version 1181681 (0.00087) [2022-07-11 11:57:57,175][26022] Updated weights on worker 0-0, policy_version 1181691 (0.00096) [2022-07-11 11:57:58,443][25689] Fps is (10 sec: 5720.6, 60 sec: 5560.5, 300 sec: 5550.5). Total num frames: 1210059776. Throughput: 0: 5006.3. Samples: 1210052916. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:57:58,443][25689] Avg episode reward: [(0, '1.307')] [2022-07-11 11:57:59,069][26022] Updated weights on worker 0-0, policy_version 1181701 (0.00092) [2022-07-11 11:58:00,840][26022] Updated weights on worker 0-0, policy_version 1181711 (0.00105) [2022-07-11 11:58:03,091][26022] Updated weights on worker 0-0, policy_version 1181721 (0.00090) [2022-07-11 11:58:03,545][25689] Fps is (10 sec: 5367.2, 60 sec: 5530.5, 300 sec: 5549.0). Total num frames: 1210084352. Throughput: 0: 5744.2. Samples: 1210084804. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:58:03,546][25689] Avg episode reward: [(0, '1.527')] [2022-07-11 11:58:04,159][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 11:58:04,167][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001181727_1210088448.pth [2022-07-11 11:58:04,168][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001179772_1208086528.pth [2022-07-11 11:58:05,018][26022] Updated weights on worker 0-0, policy_version 1181731 (0.00086) [2022-07-11 11:58:06,911][26022] Updated weights on worker 0-0, policy_version 1181741 (0.00086) [2022-07-11 11:58:08,510][26022] Updated weights on worker 0-0, policy_version 1181751 (0.00087) [2022-07-11 11:58:08,610][25689] Fps is (10 sec: 5237.3, 60 sec: 5546.4, 300 sec: 5548.1). Total num frames: 1210113024. Throughput: 0: 5713.6. Samples: 1210117842. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:58:08,610][25689] Avg episode reward: [(0, '1.543')] [2022-07-11 11:58:10,465][26022] Updated weights on worker 0-0, policy_version 1181761 (0.00087) [2022-07-11 11:58:12,261][26022] Updated weights on worker 0-0, policy_version 1181771 (0.00091) [2022-07-11 11:58:13,621][25689] Fps is (10 sec: 5589.7, 60 sec: 5503.8, 300 sec: 5544.6). Total num frames: 1210140672. Throughput: 0: 4881.3. Samples: 1210134636. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:58:13,621][25689] Avg episode reward: [(0, '1.476')] [2022-07-11 11:58:13,979][26022] Updated weights on worker 0-0, policy_version 1181781 (0.00093) [2022-07-11 11:58:15,870][26022] Updated weights on worker 0-0, policy_version 1181791 (0.00088) [2022-07-11 11:58:17,591][26022] Updated weights on worker 0-0, policy_version 1181801 (0.00108) [2022-07-11 11:58:18,634][25689] Fps is (10 sec: 5516.4, 60 sec: 5521.8, 300 sec: 5545.5). Total num frames: 1210168320. Throughput: 0: 5731.5. Samples: 1210168640. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:58:18,636][25689] Avg episode reward: [(0, '1.498')] [2022-07-11 11:58:19,667][26022] Updated weights on worker 0-0, policy_version 1181811 (0.00089) [2022-07-11 11:58:21,469][26022] Updated weights on worker 0-0, policy_version 1181821 (0.00086) [2022-07-11 11:58:23,137][26022] Updated weights on worker 0-0, policy_version 1181831 (0.00092) [2022-07-11 11:58:23,756][25689] Fps is (10 sec: 5657.7, 60 sec: 5537.1, 300 sec: 5553.9). Total num frames: 1210198016. Throughput: 0: 5788.8. Samples: 1210201802. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:58:23,757][25689] Avg episode reward: [(0, '1.510')] [2022-07-11 11:58:25,266][26022] Updated weights on worker 0-0, policy_version 1181841 (0.00092) [2022-07-11 11:58:26,752][26022] Updated weights on worker 0-0, policy_version 1181851 (0.00092) [2022-07-11 11:58:28,756][26022] Updated weights on worker 0-0, policy_version 1181861 (0.00088) [2022-07-11 11:58:28,821][25689] Fps is (10 sec: 5628.8, 60 sec: 5514.7, 300 sec: 5545.9). Total num frames: 1210225664. Throughput: 0: 4985.6. Samples: 1210218608. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:58:28,822][25689] Avg episode reward: [(0, '1.634')] [2022-07-11 11:58:30,815][26022] Updated weights on worker 0-0, policy_version 1181871 (0.00089) [2022-07-11 11:58:32,260][26022] Updated weights on worker 0-0, policy_version 1181881 (0.00091) [2022-07-11 11:58:33,839][25689] Fps is (10 sec: 5382.5, 60 sec: 5532.2, 300 sec: 5542.5). Total num frames: 1210252288. Throughput: 0: 5795.9. Samples: 1210251820. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:58:33,841][25689] Avg episode reward: [(0, '1.153')] [2022-07-11 11:58:34,522][26022] Updated weights on worker 0-0, policy_version 1181891 (0.00094) [2022-07-11 11:58:35,786][26022] Updated weights on worker 0-0, policy_version 1181901 (0.00086) [2022-07-11 11:58:37,999][26022] Updated weights on worker 0-0, policy_version 1181911 (0.00086) [2022-07-11 11:58:38,868][25689] Fps is (10 sec: 5605.8, 60 sec: 5549.2, 300 sec: 5553.4). Total num frames: 1210281984. Throughput: 0: 5776.8. Samples: 1210285530. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:58:38,870][25689] Avg episode reward: [(0, '1.389')] [2022-07-11 11:58:39,647][26022] Updated weights on worker 0-0, policy_version 1181921 (0.00099) [2022-07-11 11:58:41,596][26022] Updated weights on worker 0-0, policy_version 1181931 (0.00096) [2022-07-11 11:58:43,583][26022] Updated weights on worker 0-0, policy_version 1181941 (0.00091) [2022-07-11 11:58:43,948][25689] Fps is (10 sec: 5672.9, 60 sec: 5546.4, 300 sec: 5542.0). Total num frames: 1210309632. Throughput: 0: 5806.0. Samples: 1210319034. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:58:43,948][25689] Avg episode reward: [(0, '1.461')] [2022-07-11 11:58:45,190][26022] Updated weights on worker 0-0, policy_version 1181951 (0.00085) [2022-07-11 11:58:47,112][26022] Updated weights on worker 0-0, policy_version 1181961 (0.00083) [2022-07-11 11:58:48,952][25689] Fps is (10 sec: 5483.7, 60 sec: 5531.6, 300 sec: 5545.6). Total num frames: 1210337280. Throughput: 0: 5830.0. Samples: 1210335968. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:58:48,952][25689] Avg episode reward: [(0, '1.490')] [2022-07-11 11:58:48,972][26022] Updated weights on worker 0-0, policy_version 1181971 (0.00071) [2022-07-11 11:58:50,858][26022] Updated weights on worker 0-0, policy_version 1181981 (0.00092) [2022-07-11 11:58:52,838][26022] Updated weights on worker 0-0, policy_version 1181991 (0.00092) [2022-07-11 11:58:53,979][25689] Fps is (10 sec: 5512.5, 60 sec: 5530.5, 300 sec: 5538.5). Total num frames: 1210364928. Throughput: 0: 5803.1. Samples: 1210368692. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:58:53,979][25689] Avg episode reward: [(0, '1.274')] [2022-07-11 11:58:54,660][26022] Updated weights on worker 0-0, policy_version 1182001 (0.00058) [2022-07-11 11:58:56,511][26022] Updated weights on worker 0-0, policy_version 1182011 (0.00086) [2022-07-11 11:58:58,315][26022] Updated weights on worker 0-0, policy_version 1182021 (0.00090) [2022-07-11 11:58:59,006][25689] Fps is (10 sec: 5601.6, 60 sec: 5512.0, 300 sec: 5546.6). Total num frames: 1210393600. Throughput: 0: 5802.9. Samples: 1210402388. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:58:59,006][25689] Avg episode reward: [(0, '1.230')] [2022-07-11 11:59:00,237][26022] Updated weights on worker 0-0, policy_version 1182031 (0.00088) [2022-07-11 11:59:02,098][26022] Updated weights on worker 0-0, policy_version 1182041 (0.00084) [2022-07-11 11:59:04,104][25689] Fps is (10 sec: 5461.3, 60 sec: 5546.2, 300 sec: 5541.6). Total num frames: 1210420224. Throughput: 0: 4964.6. Samples: 1210419100. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:59:04,104][25689] Avg episode reward: [(0, '1.538')] [2022-07-11 11:59:04,107][26022] Updated weights on worker 0-0, policy_version 1182051 (0.00089) [2022-07-11 11:59:06,114][26022] Updated weights on worker 0-0, policy_version 1182061 (0.00613) [2022-07-11 11:59:07,603][26022] Updated weights on worker 0-0, policy_version 1182071 (0.00080) [2022-07-11 11:59:09,123][25689] Fps is (10 sec: 5263.0, 60 sec: 5516.5, 300 sec: 5541.5). Total num frames: 1210446848. Throughput: 0: 5675.7. Samples: 1210450456. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:59:09,124][25689] Avg episode reward: [(0, '1.131')] [2022-07-11 11:59:09,877][26022] Updated weights on worker 0-0, policy_version 1182081 (0.00090) [2022-07-11 11:59:11,432][26022] Updated weights on worker 0-0, policy_version 1182091 (0.00081) [2022-07-11 11:59:13,419][26022] Updated weights on worker 0-0, policy_version 1182101 (0.00089) [2022-07-11 11:59:14,163][25689] Fps is (10 sec: 5700.4, 60 sec: 5564.6, 300 sec: 5544.4). Total num frames: 1210477568. Throughput: 0: 5741.0. Samples: 1210484572. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:59:14,164][25689] Avg episode reward: [(0, '0.987')] [2022-07-11 11:59:14,900][26022] Updated weights on worker 0-0, policy_version 1182111 (0.00092) [2022-07-11 11:59:17,158][26022] Updated weights on worker 0-0, policy_version 1182121 (0.00089) [2022-07-11 11:59:18,753][26022] Updated weights on worker 0-0, policy_version 1182131 (0.00079) [2022-07-11 11:59:19,168][25689] Fps is (10 sec: 5810.5, 60 sec: 5565.3, 300 sec: 5550.2). Total num frames: 1210505216. Throughput: 0: 4913.4. Samples: 1210501456. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:59:19,173][25689] Avg episode reward: [(0, '0.017')] [2022-07-11 11:59:20,819][26022] Updated weights on worker 0-0, policy_version 1182141 (0.00085) [2022-07-11 11:59:22,291][26022] Updated weights on worker 0-0, policy_version 1182151 (0.00096) [2022-07-11 11:59:24,231][25689] Fps is (10 sec: 5289.3, 60 sec: 5503.2, 300 sec: 5538.8). Total num frames: 1210530816. Throughput: 0: 5728.1. Samples: 1210534388. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:59:24,231][25689] Avg episode reward: [(0, '-0.220')] [2022-07-11 11:59:24,708][26022] Updated weights on worker 0-0, policy_version 1182161 (0.00084) [2022-07-11 11:59:26,174][26022] Updated weights on worker 0-0, policy_version 1182171 (0.00087) [2022-07-11 11:59:28,107][26022] Updated weights on worker 0-0, policy_version 1182181 (0.00088) [2022-07-11 11:59:29,275][25689] Fps is (10 sec: 5471.5, 60 sec: 5538.9, 300 sec: 5542.7). Total num frames: 1210560512. Throughput: 0: 5829.9. Samples: 1210567936. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:59:29,276][25689] Avg episode reward: [(0, '-0.027')] [2022-07-11 11:59:29,797][26022] Updated weights on worker 0-0, policy_version 1182191 (0.00084) [2022-07-11 11:59:31,742][26022] Updated weights on worker 0-0, policy_version 1182201 (0.00091) [2022-07-11 11:59:33,500][26022] Updated weights on worker 0-0, policy_version 1182211 (0.00086) [2022-07-11 11:59:34,332][25689] Fps is (10 sec: 5676.9, 60 sec: 5552.3, 300 sec: 5541.9). Total num frames: 1210588160. Throughput: 0: 4966.4. Samples: 1210584736. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:59:34,332][25689] Avg episode reward: [(0, '0.102')] [2022-07-11 11:59:35,429][26022] Updated weights on worker 0-0, policy_version 1182221 (0.00095) [2022-07-11 11:59:37,094][26022] Updated weights on worker 0-0, policy_version 1182231 (0.00085) [2022-07-11 11:59:39,202][26022] Updated weights on worker 0-0, policy_version 1182241 (0.00085) [2022-07-11 11:59:39,384][25689] Fps is (10 sec: 5469.6, 60 sec: 5516.3, 300 sec: 5534.8). Total num frames: 1210615808. Throughput: 0: 5776.3. Samples: 1210618228. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:59:39,385][25689] Avg episode reward: [(0, '-0.999')] [2022-07-11 11:59:40,795][26022] Updated weights on worker 0-0, policy_version 1182251 (0.00508) [2022-07-11 11:59:42,896][26022] Updated weights on worker 0-0, policy_version 1182261 (0.00094) [2022-07-11 11:59:44,487][25689] Fps is (10 sec: 5546.1, 60 sec: 5531.1, 300 sec: 5540.2). Total num frames: 1210644480. Throughput: 0: 5787.6. Samples: 1210651622. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:59:44,487][25689] Avg episode reward: [(0, '-1.287')] [2022-07-11 11:59:44,538][26022] Updated weights on worker 0-0, policy_version 1182271 (0.00080) [2022-07-11 11:59:46,488][26022] Updated weights on worker 0-0, policy_version 1182281 (0.00094) [2022-07-11 11:59:48,242][26022] Updated weights on worker 0-0, policy_version 1182291 (0.00088) [2022-07-11 11:59:49,529][25689] Fps is (10 sec: 5551.6, 60 sec: 5527.6, 300 sec: 5539.6). Total num frames: 1210672128. Throughput: 0: 4953.8. Samples: 1210668272. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:59:49,529][25689] Avg episode reward: [(0, '-1.158')] [2022-07-11 11:59:50,106][26022] Updated weights on worker 0-0, policy_version 1182301 (0.00086) [2022-07-11 11:59:51,941][26022] Updated weights on worker 0-0, policy_version 1182311 (0.00077) [2022-07-11 11:59:53,945][26022] Updated weights on worker 0-0, policy_version 1182321 (0.00340) [2022-07-11 11:59:54,542][25689] Fps is (10 sec: 5600.8, 60 sec: 5545.8, 300 sec: 5536.4). Total num frames: 1210700800. Throughput: 0: 5782.8. Samples: 1210701610. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:59:54,544][25689] Avg episode reward: [(0, '-2.323')] [2022-07-11 11:59:55,444][26022] Updated weights on worker 0-0, policy_version 1182331 (0.00053) [2022-07-11 11:59:57,530][26022] Updated weights on worker 0-0, policy_version 1182341 (0.00095) [2022-07-11 11:59:59,134][26022] Updated weights on worker 0-0, policy_version 1182351 (0.00090) [2022-07-11 11:59:59,567][25689] Fps is (10 sec: 5610.7, 60 sec: 5529.1, 300 sec: 5540.7). Total num frames: 1210728448. Throughput: 0: 5809.8. Samples: 1210735486. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 11:59:59,569][25689] Avg episode reward: [(0, '-2.408')] [2022-07-11 12:00:01,129][26022] Updated weights on worker 0-0, policy_version 1182361 (0.00089) [2022-07-11 12:00:03,483][26022] Updated weights on worker 0-0, policy_version 1182371 (0.00095) [2022-07-11 12:00:04,200][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:00:04,217][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001182376_1210753024.pth [2022-07-11 12:00:04,217][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001180427_1208757248.pth [2022-07-11 12:00:04,636][25689] Fps is (10 sec: 5478.4, 60 sec: 5548.7, 300 sec: 5543.5). Total num frames: 1210756096. Throughput: 0: 4954.8. Samples: 1210751456. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 12:00:04,636][25689] Avg episode reward: [(0, '-2.464')] [2022-07-11 12:00:05,105][26022] Updated weights on worker 0-0, policy_version 1182381 (0.00085) [2022-07-11 12:00:07,096][26022] Updated weights on worker 0-0, policy_version 1182391 (0.00091) [2022-07-11 12:00:08,819][26022] Updated weights on worker 0-0, policy_version 1182401 (0.00083) [2022-07-11 12:00:09,642][25689] Fps is (10 sec: 5285.0, 60 sec: 5532.9, 300 sec: 5530.0). Total num frames: 1210781696. Throughput: 0: 5750.8. Samples: 1210783938. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 12:00:09,643][25689] Avg episode reward: [(0, '-1.549')] [2022-07-11 12:00:10,559][26022] Updated weights on worker 0-0, policy_version 1182411 (0.00082) [2022-07-11 12:00:12,677][26022] Updated weights on worker 0-0, policy_version 1182421 (0.00095) [2022-07-11 12:00:14,112][26022] Updated weights on worker 0-0, policy_version 1182431 (0.00089) [2022-07-11 12:00:14,667][25689] Fps is (10 sec: 5410.4, 60 sec: 5500.5, 300 sec: 5536.5). Total num frames: 1210810368. Throughput: 0: 5744.8. Samples: 1210817220. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 12:00:14,667][25689] Avg episode reward: [(0, '-1.877')] [2022-07-11 12:00:16,256][26022] Updated weights on worker 0-0, policy_version 1182441 (0.00090) [2022-07-11 12:00:18,041][26022] Updated weights on worker 0-0, policy_version 1182451 (0.00096) [2022-07-11 12:00:19,694][25689] Fps is (10 sec: 5602.9, 60 sec: 5498.5, 300 sec: 5526.7). Total num frames: 1210838016. Throughput: 0: 4892.9. Samples: 1210833966. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 12:00:19,695][25689] Avg episode reward: [(0, '-1.198')] [2022-07-11 12:00:19,928][26022] Updated weights on worker 0-0, policy_version 1182461 (0.00088) [2022-07-11 12:00:21,893][26022] Updated weights on worker 0-0, policy_version 1182471 (0.00091) [2022-07-11 12:00:23,683][26022] Updated weights on worker 0-0, policy_version 1182481 (0.00511) [2022-07-11 12:00:24,759][25689] Fps is (10 sec: 5580.8, 60 sec: 5549.0, 300 sec: 5532.5). Total num frames: 1210866688. Throughput: 0: 5734.6. Samples: 1210866852. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 12:00:24,759][25689] Avg episode reward: [(0, '-0.261')] [2022-07-11 12:00:25,674][26022] Updated weights on worker 0-0, policy_version 1182491 (0.00093) [2022-07-11 12:00:27,400][26022] Updated weights on worker 0-0, policy_version 1182501 (0.00075) [2022-07-11 12:00:29,305][26022] Updated weights on worker 0-0, policy_version 1182511 (0.00088) [2022-07-11 12:00:29,795][25689] Fps is (10 sec: 5474.5, 60 sec: 5499.0, 300 sec: 5528.6). Total num frames: 1210893312. Throughput: 0: 5754.5. Samples: 1210899906. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 12:00:29,795][25689] Avg episode reward: [(0, '0.145')] [2022-07-11 12:00:31,038][26022] Updated weights on worker 0-0, policy_version 1182521 (0.00085) [2022-07-11 12:00:32,936][26022] Updated weights on worker 0-0, policy_version 1182531 (0.00090) [2022-07-11 12:00:34,702][26022] Updated weights on worker 0-0, policy_version 1182541 (0.00087) [2022-07-11 12:00:34,809][25689] Fps is (10 sec: 5501.8, 60 sec: 5519.8, 300 sec: 5528.5). Total num frames: 1210921984. Throughput: 0: 4942.8. Samples: 1210916780. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2022-07-11 12:00:34,810][25689] Avg episode reward: [(0, '-0.662')] [2022-07-11 12:00:36,598][26022] Updated weights on worker 0-0, policy_version 1182551 (0.00087) [2022-07-11 12:00:38,377][26022] Updated weights on worker 0-0, policy_version 1182561 (0.00095) [2022-07-11 12:00:39,828][25689] Fps is (10 sec: 5715.3, 60 sec: 5539.8, 300 sec: 5532.6). Total num frames: 1210950656. Throughput: 0: 5790.7. Samples: 1210950556. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:00:39,830][25689] Avg episode reward: [(0, '-0.304')] [2022-07-11 12:00:40,305][26022] Updated weights on worker 0-0, policy_version 1182571 (0.00089) [2022-07-11 12:00:42,055][26022] Updated weights on worker 0-0, policy_version 1182581 (0.00056) [2022-07-11 12:00:43,774][26022] Updated weights on worker 0-0, policy_version 1182591 (0.00086) [2022-07-11 12:00:44,914][25689] Fps is (10 sec: 5573.8, 60 sec: 5524.4, 300 sec: 5527.9). Total num frames: 1210978304. Throughput: 0: 5817.1. Samples: 1210984096. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:00:44,914][25689] Avg episode reward: [(0, '-0.089')] [2022-07-11 12:00:45,632][26022] Updated weights on worker 0-0, policy_version 1182601 (0.00405) [2022-07-11 12:00:47,393][26022] Updated weights on worker 0-0, policy_version 1182611 (0.00090) [2022-07-11 12:00:49,377][26022] Updated weights on worker 0-0, policy_version 1182621 (0.00091) [2022-07-11 12:00:49,937][25689] Fps is (10 sec: 5571.4, 60 sec: 5543.1, 300 sec: 5534.4). Total num frames: 1211006976. Throughput: 0: 5020.7. Samples: 1211001034. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:00:49,937][25689] Avg episode reward: [(0, '0.368')] [2022-07-11 12:00:51,206][26022] Updated weights on worker 0-0, policy_version 1182631 (0.00091) [2022-07-11 12:00:53,109][26022] Updated weights on worker 0-0, policy_version 1182641 (0.00091) [2022-07-11 12:00:54,721][26022] Updated weights on worker 0-0, policy_version 1182651 (0.00082) [2022-07-11 12:00:54,951][25689] Fps is (10 sec: 5712.9, 60 sec: 5543.0, 300 sec: 5537.7). Total num frames: 1211035648. Throughput: 0: 5843.7. Samples: 1211034484. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:00:54,952][25689] Avg episode reward: [(0, '-0.891')] [2022-07-11 12:00:56,751][26022] Updated weights on worker 0-0, policy_version 1182661 (0.00088) [2022-07-11 12:00:58,421][26022] Updated weights on worker 0-0, policy_version 1182671 (0.00088) [2022-07-11 12:00:59,967][25689] Fps is (10 sec: 5513.1, 60 sec: 5526.9, 300 sec: 5538.6). Total num frames: 1211062272. Throughput: 0: 5852.2. Samples: 1211068412. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:00:59,968][25689] Avg episode reward: [(0, '-1.185')] [2022-07-11 12:01:00,391][26022] Updated weights on worker 0-0, policy_version 1182681 (0.00105) [2022-07-11 12:01:02,422][26022] Updated weights on worker 0-0, policy_version 1182691 (0.00080) [2022-07-11 12:01:04,307][26022] Updated weights on worker 0-0, policy_version 1182701 (0.00494) [2022-07-11 12:01:05,036][25689] Fps is (10 sec: 5381.5, 60 sec: 5526.8, 300 sec: 5537.5). Total num frames: 1211089920. Throughput: 0: 4940.2. Samples: 1211083506. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:01:05,037][25689] Avg episode reward: [(0, '-0.411')] [2022-07-11 12:01:06,051][26022] Updated weights on worker 0-0, policy_version 1182711 (0.00093) [2022-07-11 12:01:08,128][26022] Updated weights on worker 0-0, policy_version 1182721 (0.00088) [2022-07-11 12:01:09,806][26022] Updated weights on worker 0-0, policy_version 1182731 (0.00069) [2022-07-11 12:01:10,062][25689] Fps is (10 sec: 5477.3, 60 sec: 5559.0, 300 sec: 5537.2). Total num frames: 1211117568. Throughput: 0: 5755.0. Samples: 1211116856. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:01:10,063][25689] Avg episode reward: [(0, '-1.573')] [2022-07-11 12:01:11,806][26022] Updated weights on worker 0-0, policy_version 1182741 (0.00094) [2022-07-11 12:01:13,428][26022] Updated weights on worker 0-0, policy_version 1182751 (0.00113) [2022-07-11 12:01:15,128][25689] Fps is (10 sec: 5479.6, 60 sec: 5538.3, 300 sec: 5536.3). Total num frames: 1211145216. Throughput: 0: 5745.3. Samples: 1211150402. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:01:15,129][25689] Avg episode reward: [(0, '-2.844')] [2022-07-11 12:01:15,380][26022] Updated weights on worker 0-0, policy_version 1182761 (0.00095) [2022-07-11 12:01:17,051][26022] Updated weights on worker 0-0, policy_version 1182771 (0.00086) [2022-07-11 12:01:18,991][26022] Updated weights on worker 0-0, policy_version 1182781 (0.00107) [2022-07-11 12:01:20,146][25689] Fps is (10 sec: 5686.5, 60 sec: 5572.9, 300 sec: 5540.3). Total num frames: 1211174912. Throughput: 0: 4907.6. Samples: 1211167446. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:01:20,148][25689] Avg episode reward: [(0, '-2.253')] [2022-07-11 12:01:20,967][26022] Updated weights on worker 0-0, policy_version 1182791 (0.00086) [2022-07-11 12:01:22,798][26022] Updated weights on worker 0-0, policy_version 1182801 (0.00089) [2022-07-11 12:01:24,471][26022] Updated weights on worker 0-0, policy_version 1182811 (0.00097) [2022-07-11 12:01:25,291][25689] Fps is (10 sec: 5541.3, 60 sec: 5531.7, 300 sec: 5534.8). Total num frames: 1211201536. Throughput: 0: 5772.1. Samples: 1211200420. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:01:25,292][25689] Avg episode reward: [(0, '-1.851')] [2022-07-11 12:01:26,480][26022] Updated weights on worker 0-0, policy_version 1182821 (0.00093) [2022-07-11 12:01:28,120][26022] Updated weights on worker 0-0, policy_version 1182831 (0.01156) [2022-07-11 12:01:30,286][26022] Updated weights on worker 0-0, policy_version 1182841 (0.00091) [2022-07-11 12:01:30,307][25689] Fps is (10 sec: 5341.3, 60 sec: 5550.4, 300 sec: 5534.8). Total num frames: 1211229184. Throughput: 0: 5754.8. Samples: 1211233362. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:01:30,308][25689] Avg episode reward: [(0, '-1.926')] [2022-07-11 12:01:31,885][26022] Updated weights on worker 0-0, policy_version 1182851 (0.00088) [2022-07-11 12:01:33,988][26022] Updated weights on worker 0-0, policy_version 1182861 (0.00088) [2022-07-11 12:01:35,323][25689] Fps is (10 sec: 5614.4, 60 sec: 5550.4, 300 sec: 5535.1). Total num frames: 1211257856. Throughput: 0: 5758.5. Samples: 1211266696. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:01:35,323][25689] Avg episode reward: [(0, '-1.159')] [2022-07-11 12:01:35,615][26022] Updated weights on worker 0-0, policy_version 1182871 (0.00093) [2022-07-11 12:01:37,655][26022] Updated weights on worker 0-0, policy_version 1182881 (0.00086) [2022-07-11 12:01:39,376][26022] Updated weights on worker 0-0, policy_version 1182891 (0.00087) [2022-07-11 12:01:40,334][25689] Fps is (10 sec: 5514.6, 60 sec: 5517.2, 300 sec: 5529.7). Total num frames: 1211284480. Throughput: 0: 5755.1. Samples: 1211283630. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:01:40,335][25689] Avg episode reward: [(0, '-0.217')] [2022-07-11 12:01:41,195][26022] Updated weights on worker 0-0, policy_version 1182901 (0.00083) [2022-07-11 12:01:43,201][26022] Updated weights on worker 0-0, policy_version 1182911 (0.00089) [2022-07-11 12:01:44,855][26022] Updated weights on worker 0-0, policy_version 1182921 (0.00089) [2022-07-11 12:01:45,459][25689] Fps is (10 sec: 5657.4, 60 sec: 5564.4, 300 sec: 5538.8). Total num frames: 1211315200. Throughput: 0: 5795.3. Samples: 1211317296. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:01:45,459][25689] Avg episode reward: [(0, '1.195')] [2022-07-11 12:01:46,761][26022] Updated weights on worker 0-0, policy_version 1182931 (0.00088) [2022-07-11 12:01:48,499][26022] Updated weights on worker 0-0, policy_version 1182941 (0.00082) [2022-07-11 12:01:50,472][25689] Fps is (10 sec: 5656.5, 60 sec: 5531.4, 300 sec: 5531.7). Total num frames: 1211341824. Throughput: 0: 5825.6. Samples: 1211350834. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:01:50,473][25689] Avg episode reward: [(0, '1.160')] [2022-07-11 12:01:50,476][26022] Updated weights on worker 0-0, policy_version 1182951 (0.00090) [2022-07-11 12:01:52,246][26022] Updated weights on worker 0-0, policy_version 1182961 (0.00087) [2022-07-11 12:01:54,097][26022] Updated weights on worker 0-0, policy_version 1182971 (0.00088) [2022-07-11 12:01:55,502][25689] Fps is (10 sec: 5404.1, 60 sec: 5513.2, 300 sec: 5532.0). Total num frames: 1211369472. Throughput: 0: 4984.6. Samples: 1211367280. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:01:55,502][25689] Avg episode reward: [(0, '1.512')] [2022-07-11 12:01:55,930][26022] Updated weights on worker 0-0, policy_version 1182981 (0.00084) [2022-07-11 12:01:57,766][26022] Updated weights on worker 0-0, policy_version 1182991 (0.00092) [2022-07-11 12:01:59,676][26022] Updated weights on worker 0-0, policy_version 1183001 (0.00094) [2022-07-11 12:02:00,526][25689] Fps is (10 sec: 5602.1, 60 sec: 5546.2, 300 sec: 5541.2). Total num frames: 1211398144. Throughput: 0: 5790.0. Samples: 1211400536. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:02:00,527][25689] Avg episode reward: [(0, '1.624')] [2022-07-11 12:02:01,435][26022] Updated weights on worker 0-0, policy_version 1183011 (0.00092) [2022-07-11 12:02:03,734][26022] Updated weights on worker 0-0, policy_version 1183021 (0.00089) [2022-07-11 12:02:04,253][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:02:04,266][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001183024_1211416576.pth [2022-07-11 12:02:04,266][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001181078_1209423872.pth [2022-07-11 12:02:05,450][26022] Updated weights on worker 0-0, policy_version 1183031 (0.00087) [2022-07-11 12:02:05,592][25689] Fps is (10 sec: 5378.8, 60 sec: 5512.7, 300 sec: 5534.0). Total num frames: 1211423744. Throughput: 0: 5708.2. Samples: 1211432218. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:02:05,592][25689] Avg episode reward: [(0, '0.897')] [2022-07-11 12:02:07,351][26022] Updated weights on worker 0-0, policy_version 1183041 (0.00080) [2022-07-11 12:02:09,251][26022] Updated weights on worker 0-0, policy_version 1183051 (0.00089) [2022-07-11 12:02:10,677][25689] Fps is (10 sec: 5346.4, 60 sec: 5524.2, 300 sec: 5527.5). Total num frames: 1211452416. Throughput: 0: 4862.5. Samples: 1211449078. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:02:10,677][25689] Avg episode reward: [(0, '0.328')] [2022-07-11 12:02:11,006][26022] Updated weights on worker 0-0, policy_version 1183061 (0.00089) [2022-07-11 12:02:12,902][26022] Updated weights on worker 0-0, policy_version 1183071 (0.00102) [2022-07-11 12:02:14,546][26022] Updated weights on worker 0-0, policy_version 1183081 (0.00092) [2022-07-11 12:02:15,690][25689] Fps is (10 sec: 5577.1, 60 sec: 5529.0, 300 sec: 5531.1). Total num frames: 1211480064. Throughput: 0: 5724.3. Samples: 1211482844. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:02:15,691][25689] Avg episode reward: [(0, '0.279')] [2022-07-11 12:02:16,607][26022] Updated weights on worker 0-0, policy_version 1183091 (0.00088) [2022-07-11 12:02:18,284][26022] Updated weights on worker 0-0, policy_version 1183101 (0.00083) [2022-07-11 12:02:19,989][26022] Updated weights on worker 0-0, policy_version 1183111 (0.00082) [2022-07-11 12:02:20,705][25689] Fps is (10 sec: 5616.3, 60 sec: 5512.4, 300 sec: 5532.8). Total num frames: 1211508736. Throughput: 0: 5751.1. Samples: 1211516588. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:02:20,706][25689] Avg episode reward: [(0, '-0.161')] [2022-07-11 12:02:22,121][26022] Updated weights on worker 0-0, policy_version 1183121 (0.00084) [2022-07-11 12:02:23,582][26022] Updated weights on worker 0-0, policy_version 1183131 (0.00090) [2022-07-11 12:02:25,777][25689] Fps is (10 sec: 5583.7, 60 sec: 5536.0, 300 sec: 5528.1). Total num frames: 1211536384. Throughput: 0: 5008.3. Samples: 1211533310. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:02:25,777][25689] Avg episode reward: [(0, '-0.857')] [2022-07-11 12:02:25,777][26022] Updated weights on worker 0-0, policy_version 1183141 (0.00082) [2022-07-11 12:02:27,492][26022] Updated weights on worker 0-0, policy_version 1183151 (0.00083) [2022-07-11 12:02:29,360][26022] Updated weights on worker 0-0, policy_version 1183161 (0.00083) [2022-07-11 12:02:30,795][25689] Fps is (10 sec: 5581.6, 60 sec: 5552.7, 300 sec: 5538.5). Total num frames: 1211565056. Throughput: 0: 5844.1. Samples: 1211566650. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:02:30,795][25689] Avg episode reward: [(0, '-1.129')] [2022-07-11 12:02:31,201][26022] Updated weights on worker 0-0, policy_version 1183171 (0.00081) [2022-07-11 12:02:32,939][26022] Updated weights on worker 0-0, policy_version 1183181 (0.00086) [2022-07-11 12:02:34,716][26022] Updated weights on worker 0-0, policy_version 1183191 (0.00083) [2022-07-11 12:02:35,867][25689] Fps is (10 sec: 5683.0, 60 sec: 5547.5, 300 sec: 5537.7). Total num frames: 1211593728. Throughput: 0: 5829.0. Samples: 1211600456. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:02:35,867][25689] Avg episode reward: [(0, '-0.952')] [2022-07-11 12:02:36,556][26022] Updated weights on worker 0-0, policy_version 1183201 (0.00087) [2022-07-11 12:02:38,362][26022] Updated weights on worker 0-0, policy_version 1183211 (0.00091) [2022-07-11 12:02:40,292][26022] Updated weights on worker 0-0, policy_version 1183221 (0.00090) [2022-07-11 12:02:40,940][25689] Fps is (10 sec: 5551.4, 60 sec: 5558.8, 300 sec: 5537.3). Total num frames: 1211621376. Throughput: 0: 4972.7. Samples: 1211617214. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:02:40,941][25689] Avg episode reward: [(0, '0.878')] [2022-07-11 12:02:42,013][26022] Updated weights on worker 0-0, policy_version 1183231 (0.00089) [2022-07-11 12:02:43,848][26022] Updated weights on worker 0-0, policy_version 1183241 (0.00091) [2022-07-11 12:02:45,865][26022] Updated weights on worker 0-0, policy_version 1183251 (0.00100) [2022-07-11 12:02:45,998][25689] Fps is (10 sec: 5458.3, 60 sec: 5514.2, 300 sec: 5533.3). Total num frames: 1211649024. Throughput: 0: 5819.0. Samples: 1211650976. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:02:45,998][25689] Avg episode reward: [(0, '1.428')] [2022-07-11 12:02:47,543][26022] Updated weights on worker 0-0, policy_version 1183261 (0.00093) [2022-07-11 12:02:49,526][26022] Updated weights on worker 0-0, policy_version 1183271 (0.00091) [2022-07-11 12:02:51,000][25689] Fps is (10 sec: 5598.7, 60 sec: 5549.1, 300 sec: 5537.0). Total num frames: 1211677696. Throughput: 0: 5825.6. Samples: 1211684354. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:02:51,000][25689] Avg episode reward: [(0, '1.592')] [2022-07-11 12:02:51,193][26022] Updated weights on worker 0-0, policy_version 1183281 (0.00089) [2022-07-11 12:02:53,067][26022] Updated weights on worker 0-0, policy_version 1183291 (0.00094) [2022-07-11 12:02:54,886][26022] Updated weights on worker 0-0, policy_version 1183301 (0.00085) [2022-07-11 12:02:56,024][25689] Fps is (10 sec: 5617.1, 60 sec: 5549.5, 300 sec: 5529.8). Total num frames: 1211705344. Throughput: 0: 4998.1. Samples: 1211701204. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:02:56,025][25689] Avg episode reward: [(0, '2.293')] [2022-07-11 12:02:56,687][26022] Updated weights on worker 0-0, policy_version 1183311 (0.00097) [2022-07-11 12:02:58,610][26022] Updated weights on worker 0-0, policy_version 1183321 (0.00094) [2022-07-11 12:03:00,442][26022] Updated weights on worker 0-0, policy_version 1183331 (0.00095) [2022-07-11 12:03:01,050][25689] Fps is (10 sec: 5603.7, 60 sec: 5549.3, 300 sec: 5545.0). Total num frames: 1211734016. Throughput: 0: 5847.1. Samples: 1211734800. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:03:01,051][25689] Avg episode reward: [(0, '2.053')] [2022-07-11 12:03:02,674][26022] Updated weights on worker 0-0, policy_version 1183341 (0.00081) [2022-07-11 12:03:04,290][26022] Updated weights on worker 0-0, policy_version 1183351 (0.00093) [2022-07-11 12:03:06,195][25689] Fps is (10 sec: 5436.5, 60 sec: 5559.0, 300 sec: 5536.6). Total num frames: 1211760640. Throughput: 0: 5699.3. Samples: 1211766088. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:03:06,196][25689] Avg episode reward: [(0, '1.600')] [2022-07-11 12:03:06,257][26022] Updated weights on worker 0-0, policy_version 1183361 (0.00080) [2022-07-11 12:03:08,146][26022] Updated weights on worker 0-0, policy_version 1183371 (0.00085) [2022-07-11 12:03:09,957][26022] Updated weights on worker 0-0, policy_version 1183381 (0.00089) [2022-07-11 12:03:11,215][25689] Fps is (10 sec: 5339.0, 60 sec: 5548.1, 300 sec: 5536.4). Total num frames: 1211788288. Throughput: 0: 4880.6. Samples: 1211783020. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:03:11,217][25689] Avg episode reward: [(0, '0.982')] [2022-07-11 12:03:11,800][26022] Updated weights on worker 0-0, policy_version 1183391 (0.00091) [2022-07-11 12:03:13,464][26022] Updated weights on worker 0-0, policy_version 1183401 (0.00091) [2022-07-11 12:03:15,462][26022] Updated weights on worker 0-0, policy_version 1183411 (0.00092) [2022-07-11 12:03:16,233][25689] Fps is (10 sec: 5610.9, 60 sec: 5564.6, 300 sec: 5539.8). Total num frames: 1211816960. Throughput: 0: 5716.0. Samples: 1211816718. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:03:16,235][25689] Avg episode reward: [(0, '0.974')] [2022-07-11 12:03:17,280][26022] Updated weights on worker 0-0, policy_version 1183421 (0.00090) [2022-07-11 12:03:19,211][26022] Updated weights on worker 0-0, policy_version 1183431 (0.00090) [2022-07-11 12:03:21,100][26022] Updated weights on worker 0-0, policy_version 1183441 (0.00091) [2022-07-11 12:03:21,246][25689] Fps is (10 sec: 5512.5, 60 sec: 5530.9, 300 sec: 5531.5). Total num frames: 1211843584. Throughput: 0: 5705.9. Samples: 1211850038. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:03:21,248][25689] Avg episode reward: [(0, '0.247')] [2022-07-11 12:03:22,859][26022] Updated weights on worker 0-0, policy_version 1183451 (0.00090) [2022-07-11 12:03:24,808][26022] Updated weights on worker 0-0, policy_version 1183461 (0.00088) [2022-07-11 12:03:26,329][25689] Fps is (10 sec: 5476.8, 60 sec: 5546.8, 300 sec: 5534.6). Total num frames: 1211872256. Throughput: 0: 4986.0. Samples: 1211866476. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:03:26,331][25689] Avg episode reward: [(0, '0.091')] [2022-07-11 12:03:26,685][26022] Updated weights on worker 0-0, policy_version 1183471 (0.00083) [2022-07-11 12:03:28,394][26022] Updated weights on worker 0-0, policy_version 1183481 (0.00097) [2022-07-11 12:03:30,452][26022] Updated weights on worker 0-0, policy_version 1183491 (0.00091) [2022-07-11 12:03:31,344][25689] Fps is (10 sec: 5679.2, 60 sec: 5547.1, 300 sec: 5541.6). Total num frames: 1211900928. Throughput: 0: 5792.2. Samples: 1211899606. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:03:31,344][25689] Avg episode reward: [(0, '0.283')] [2022-07-11 12:03:31,960][26022] Updated weights on worker 0-0, policy_version 1183501 (0.00089) [2022-07-11 12:03:34,175][26022] Updated weights on worker 0-0, policy_version 1183511 (0.00083) [2022-07-11 12:03:35,769][26022] Updated weights on worker 0-0, policy_version 1183521 (0.00087) [2022-07-11 12:03:36,369][25689] Fps is (10 sec: 5609.5, 60 sec: 5534.5, 300 sec: 5534.7). Total num frames: 1211928576. Throughput: 0: 5786.3. Samples: 1211933234. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:03:36,371][25689] Avg episode reward: [(0, '-0.608')] [2022-07-11 12:03:37,672][26022] Updated weights on worker 0-0, policy_version 1183531 (0.00086) [2022-07-11 12:03:39,305][26022] Updated weights on worker 0-0, policy_version 1183541 (0.00085) [2022-07-11 12:03:41,388][26022] Updated weights on worker 0-0, policy_version 1183551 (0.00090) [2022-07-11 12:03:41,391][25689] Fps is (10 sec: 5401.7, 60 sec: 5522.3, 300 sec: 5532.4). Total num frames: 1211955200. Throughput: 0: 4954.6. Samples: 1211949846. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:03:41,391][25689] Avg episode reward: [(0, '-0.322')] [2022-07-11 12:03:42,930][26022] Updated weights on worker 0-0, policy_version 1183561 (0.00088) [2022-07-11 12:03:45,036][26022] Updated weights on worker 0-0, policy_version 1183571 (0.00088) [2022-07-11 12:03:46,496][25689] Fps is (10 sec: 5662.7, 60 sec: 5568.7, 300 sec: 5540.8). Total num frames: 1211985920. Throughput: 0: 5813.8. Samples: 1211983720. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:03:46,496][25689] Avg episode reward: [(0, '-0.128')] [2022-07-11 12:03:46,622][26022] Updated weights on worker 0-0, policy_version 1183581 (0.00073) [2022-07-11 12:03:48,831][26022] Updated weights on worker 0-0, policy_version 1183591 (0.00084) [2022-07-11 12:03:50,414][26022] Updated weights on worker 0-0, policy_version 1183601 (0.00090) [2022-07-11 12:03:51,541][25689] Fps is (10 sec: 5649.6, 60 sec: 5530.9, 300 sec: 5537.1). Total num frames: 1212012544. Throughput: 0: 5819.0. Samples: 1212017134. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:03:51,541][25689] Avg episode reward: [(0, '0.070')] [2022-07-11 12:03:52,423][26022] Updated weights on worker 0-0, policy_version 1183611 (0.00100) [2022-07-11 12:03:54,325][26022] Updated weights on worker 0-0, policy_version 1183621 (0.00088) [2022-07-11 12:03:56,004][26022] Updated weights on worker 0-0, policy_version 1183631 (0.00118) [2022-07-11 12:03:56,555][25689] Fps is (10 sec: 5496.9, 60 sec: 5548.7, 300 sec: 5537.3). Total num frames: 1212041216. Throughput: 0: 5787.6. Samples: 1212050062. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:03:56,556][25689] Avg episode reward: [(0, '0.017')] [2022-07-11 12:03:57,898][26022] Updated weights on worker 0-0, policy_version 1183641 (0.00088) [2022-07-11 12:03:59,809][26022] Updated weights on worker 0-0, policy_version 1183651 (0.00092) [2022-07-11 12:04:01,533][26022] Updated weights on worker 0-0, policy_version 1183661 (0.00088) [2022-07-11 12:04:01,590][25689] Fps is (10 sec: 5604.2, 60 sec: 5531.0, 300 sec: 5541.9). Total num frames: 1212068864. Throughput: 0: 5786.3. Samples: 1212066726. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:04:01,591][25689] Avg episode reward: [(0, '0.287')] [2022-07-11 12:04:03,789][26022] Updated weights on worker 0-0, policy_version 1183671 (0.00086) [2022-07-11 12:04:04,410][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:04:04,435][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001183674_1212082176.pth [2022-07-11 12:04:04,435][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001181727_1210088448.pth [2022-07-11 12:04:05,727][26022] Updated weights on worker 0-0, policy_version 1183681 (0.00089) [2022-07-11 12:04:06,684][25689] Fps is (10 sec: 5358.3, 60 sec: 5535.7, 300 sec: 5540.5). Total num frames: 1212095488. Throughput: 0: 5679.0. Samples: 1212098368. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:04:06,685][25689] Avg episode reward: [(0, '1.202')] [2022-07-11 12:04:07,370][26022] Updated weights on worker 0-0, policy_version 1183691 (0.00086) [2022-07-11 12:04:09,483][26022] Updated weights on worker 0-0, policy_version 1183701 (0.00079) [2022-07-11 12:04:11,216][26022] Updated weights on worker 0-0, policy_version 1183711 (0.00088) [2022-07-11 12:04:11,719][25689] Fps is (10 sec: 5257.3, 60 sec: 5517.4, 300 sec: 5526.9). Total num frames: 1212122112. Throughput: 0: 5677.4. Samples: 1212131692. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:04:11,719][25689] Avg episode reward: [(0, '1.006')] [2022-07-11 12:04:13,108][26022] Updated weights on worker 0-0, policy_version 1183721 (0.00087) [2022-07-11 12:04:14,947][26022] Updated weights on worker 0-0, policy_version 1183731 (0.00078) [2022-07-11 12:04:16,498][26022] Updated weights on worker 0-0, policy_version 1183741 (0.00059) [2022-07-11 12:04:16,731][25689] Fps is (10 sec: 5503.8, 60 sec: 5517.9, 300 sec: 5530.2). Total num frames: 1212150784. Throughput: 0: 4879.5. Samples: 1212148508. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:04:16,731][25689] Avg episode reward: [(0, '1.302')] [2022-07-11 12:04:18,495][26022] Updated weights on worker 0-0, policy_version 1183751 (0.00092) [2022-07-11 12:04:20,237][26022] Updated weights on worker 0-0, policy_version 1183761 (0.00090) [2022-07-11 12:04:21,773][25689] Fps is (10 sec: 5703.4, 60 sec: 5549.2, 300 sec: 5540.9). Total num frames: 1212179456. Throughput: 0: 5720.2. Samples: 1212182174. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:04:21,773][25689] Avg episode reward: [(0, '1.387')] [2022-07-11 12:04:22,223][26022] Updated weights on worker 0-0, policy_version 1183771 (0.00091) [2022-07-11 12:04:23,925][26022] Updated weights on worker 0-0, policy_version 1183781 (0.00082) [2022-07-11 12:04:25,948][26022] Updated weights on worker 0-0, policy_version 1183791 (0.00093) [2022-07-11 12:04:26,892][25689] Fps is (10 sec: 5442.0, 60 sec: 5512.1, 300 sec: 5529.1). Total num frames: 1212206080. Throughput: 0: 5777.2. Samples: 1212215112. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:04:26,892][25689] Avg episode reward: [(0, '0.396')] [2022-07-11 12:04:27,692][26022] Updated weights on worker 0-0, policy_version 1183801 (0.00088) [2022-07-11 12:04:29,663][26022] Updated weights on worker 0-0, policy_version 1183811 (0.00088) [2022-07-11 12:04:31,246][26022] Updated weights on worker 0-0, policy_version 1183821 (0.00089) [2022-07-11 12:04:31,939][25689] Fps is (10 sec: 5439.5, 60 sec: 5509.1, 300 sec: 5532.8). Total num frames: 1212234752. Throughput: 0: 4943.1. Samples: 1212231642. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:04:31,939][25689] Avg episode reward: [(0, '-0.499')] [2022-07-11 12:04:33,431][26022] Updated weights on worker 0-0, policy_version 1183831 (0.00084) [2022-07-11 12:04:35,215][26022] Updated weights on worker 0-0, policy_version 1183841 (0.00086) [2022-07-11 12:04:36,935][26022] Updated weights on worker 0-0, policy_version 1183851 (0.00085) [2022-07-11 12:04:37,012][25689] Fps is (10 sec: 5666.3, 60 sec: 5521.7, 300 sec: 5535.8). Total num frames: 1212263424. Throughput: 0: 5737.2. Samples: 1212264864. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2022-07-11 12:04:37,012][25689] Avg episode reward: [(0, '-0.496')] [2022-07-11 12:04:38,932][26022] Updated weights on worker 0-0, policy_version 1183861 (0.00091) [2022-07-11 12:04:40,748][26022] Updated weights on worker 0-0, policy_version 1183871 (0.00091) [2022-07-11 12:04:42,023][25689] Fps is (10 sec: 5483.1, 60 sec: 5522.6, 300 sec: 5530.6). Total num frames: 1212290048. Throughput: 0: 5718.2. Samples: 1212297970. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:04:42,024][25689] Avg episode reward: [(0, '-0.197')] [2022-07-11 12:04:42,609][26022] Updated weights on worker 0-0, policy_version 1183881 (0.00091) [2022-07-11 12:04:44,468][26022] Updated weights on worker 0-0, policy_version 1183891 (0.00090) [2022-07-11 12:04:46,361][26022] Updated weights on worker 0-0, policy_version 1183901 (0.00085) [2022-07-11 12:04:47,147][25689] Fps is (10 sec: 5455.7, 60 sec: 5487.1, 300 sec: 5532.6). Total num frames: 1212318720. Throughput: 0: 4915.4. Samples: 1212314676. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:04:47,148][25689] Avg episode reward: [(0, '-1.043')] [2022-07-11 12:04:48,008][26022] Updated weights on worker 0-0, policy_version 1183911 (0.00091) [2022-07-11 12:04:49,915][26022] Updated weights on worker 0-0, policy_version 1183921 (0.00083) [2022-07-11 12:04:51,802][26022] Updated weights on worker 0-0, policy_version 1183931 (0.00084) [2022-07-11 12:04:52,176][25689] Fps is (10 sec: 5648.3, 60 sec: 5522.3, 300 sec: 5532.3). Total num frames: 1212347392. Throughput: 0: 5781.8. Samples: 1212348650. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:04:52,176][25689] Avg episode reward: [(0, '-1.063')] [2022-07-11 12:04:53,657][26022] Updated weights on worker 0-0, policy_version 1183941 (0.00081) [2022-07-11 12:04:55,421][26022] Updated weights on worker 0-0, policy_version 1183951 (0.00088) [2022-07-11 12:04:57,212][25689] Fps is (10 sec: 5595.9, 60 sec: 5503.5, 300 sec: 5532.1). Total num frames: 1212375040. Throughput: 0: 5803.3. Samples: 1212382090. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:04:57,213][25689] Avg episode reward: [(0, '-0.952')] [2022-07-11 12:04:57,229][26022] Updated weights on worker 0-0, policy_version 1183961 (0.00088) [2022-07-11 12:04:59,196][26022] Updated weights on worker 0-0, policy_version 1183971 (0.00087) [2022-07-11 12:05:00,812][26022] Updated weights on worker 0-0, policy_version 1183981 (0.00088) [2022-07-11 12:05:02,235][25689] Fps is (10 sec: 5395.5, 60 sec: 5487.7, 300 sec: 5529.5). Total num frames: 1212401664. Throughput: 0: 4991.8. Samples: 1212398860. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:05:02,235][25689] Avg episode reward: [(0, '0.926')] [2022-07-11 12:05:03,267][26022] Updated weights on worker 0-0, policy_version 1183991 (0.00083) [2022-07-11 12:05:05,120][26022] Updated weights on worker 0-0, policy_version 1184001 (0.00090) [2022-07-11 12:05:06,772][26022] Updated weights on worker 0-0, policy_version 1184011 (0.00086) [2022-07-11 12:05:07,358][25689] Fps is (10 sec: 5349.3, 60 sec: 5502.0, 300 sec: 5534.2). Total num frames: 1212429312. Throughput: 0: 5707.0. Samples: 1212430016. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:05:07,358][25689] Avg episode reward: [(0, '1.164')] [2022-07-11 12:05:08,876][26022] Updated weights on worker 0-0, policy_version 1184021 (0.00088) [2022-07-11 12:05:10,664][26022] Updated weights on worker 0-0, policy_version 1184031 (0.00090) [2022-07-11 12:05:12,335][26022] Updated weights on worker 0-0, policy_version 1184041 (0.00090) [2022-07-11 12:05:12,359][25689] Fps is (10 sec: 5562.9, 60 sec: 5538.8, 300 sec: 5534.6). Total num frames: 1212457984. Throughput: 0: 5687.7. Samples: 1212463444. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:05:12,361][25689] Avg episode reward: [(0, '0.999')] [2022-07-11 12:05:14,439][26022] Updated weights on worker 0-0, policy_version 1184051 (0.00104) [2022-07-11 12:05:15,982][26022] Updated weights on worker 0-0, policy_version 1184061 (0.00082) [2022-07-11 12:05:17,405][25689] Fps is (10 sec: 5503.7, 60 sec: 5502.0, 300 sec: 5530.8). Total num frames: 1212484608. Throughput: 0: 4854.7. Samples: 1212480120. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:05:17,405][25689] Avg episode reward: [(0, '0.924')] [2022-07-11 12:05:18,019][26022] Updated weights on worker 0-0, policy_version 1184071 (0.00087) [2022-07-11 12:05:19,652][26022] Updated weights on worker 0-0, policy_version 1184081 (0.00097) [2022-07-11 12:05:21,458][26022] Updated weights on worker 0-0, policy_version 1184091 (0.00085) [2022-07-11 12:05:22,407][25689] Fps is (10 sec: 5502.9, 60 sec: 5505.5, 300 sec: 5532.0). Total num frames: 1212513280. Throughput: 0: 5692.7. Samples: 1212513698. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:05:22,408][25689] Avg episode reward: [(0, '-1.029')] [2022-07-11 12:05:23,457][26022] Updated weights on worker 0-0, policy_version 1184101 (0.00090) [2022-07-11 12:05:25,181][26022] Updated weights on worker 0-0, policy_version 1184111 (0.00092) [2022-07-11 12:05:27,307][26022] Updated weights on worker 0-0, policy_version 1184121 (0.00099) [2022-07-11 12:05:27,460][25689] Fps is (10 sec: 5601.0, 60 sec: 5528.5, 300 sec: 5535.1). Total num frames: 1212540928. Throughput: 0: 5820.4. Samples: 1212547020. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:05:27,460][25689] Avg episode reward: [(0, '-1.208')] [2022-07-11 12:05:29,016][26022] Updated weights on worker 0-0, policy_version 1184131 (0.00095) [2022-07-11 12:05:30,970][26022] Updated weights on worker 0-0, policy_version 1184141 (0.00098) [2022-07-11 12:05:32,510][25689] Fps is (10 sec: 5575.0, 60 sec: 5528.2, 300 sec: 5534.5). Total num frames: 1212569600. Throughput: 0: 4963.1. Samples: 1212563454. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:05:32,510][25689] Avg episode reward: [(0, '-0.983')] [2022-07-11 12:05:32,580][26022] Updated weights on worker 0-0, policy_version 1184151 (0.00624) [2022-07-11 12:05:34,679][26022] Updated weights on worker 0-0, policy_version 1184161 (0.00088) [2022-07-11 12:05:36,308][26022] Updated weights on worker 0-0, policy_version 1184171 (0.01554) [2022-07-11 12:05:37,513][25689] Fps is (10 sec: 5500.3, 60 sec: 5500.7, 300 sec: 5527.9). Total num frames: 1212596224. Throughput: 0: 5795.8. Samples: 1212596666. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:05:37,514][25689] Avg episode reward: [(0, '-1.299')] [2022-07-11 12:05:38,344][26022] Updated weights on worker 0-0, policy_version 1184181 (0.00090) [2022-07-11 12:05:40,012][26022] Updated weights on worker 0-0, policy_version 1184191 (0.00093) [2022-07-11 12:05:42,004][26022] Updated weights on worker 0-0, policy_version 1184201 (0.00081) [2022-07-11 12:05:42,532][25689] Fps is (10 sec: 5415.2, 60 sec: 5517.0, 300 sec: 5529.1). Total num frames: 1212623872. Throughput: 0: 5771.0. Samples: 1212629836. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:05:42,532][25689] Avg episode reward: [(0, '-1.371')] [2022-07-11 12:05:43,844][26022] Updated weights on worker 0-0, policy_version 1184211 (0.00092) [2022-07-11 12:05:45,616][26022] Updated weights on worker 0-0, policy_version 1184221 (0.00088) [2022-07-11 12:05:47,439][26022] Updated weights on worker 0-0, policy_version 1184231 (0.00085) [2022-07-11 12:05:47,617][25689] Fps is (10 sec: 5675.2, 60 sec: 5537.4, 300 sec: 5531.4). Total num frames: 1212653568. Throughput: 0: 4932.0. Samples: 1212646438. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:05:47,618][25689] Avg episode reward: [(0, '-0.656')] [2022-07-11 12:05:49,336][26022] Updated weights on worker 0-0, policy_version 1184241 (0.00088) [2022-07-11 12:05:51,252][26022] Updated weights on worker 0-0, policy_version 1184251 (0.00083) [2022-07-11 12:05:52,662][25689] Fps is (10 sec: 5559.5, 60 sec: 5502.1, 300 sec: 5524.0). Total num frames: 1212680192. Throughput: 0: 5781.7. Samples: 1212679970. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:05:52,663][25689] Avg episode reward: [(0, '0.187')] [2022-07-11 12:05:53,135][26022] Updated weights on worker 0-0, policy_version 1184261 (0.00088) [2022-07-11 12:05:54,956][26022] Updated weights on worker 0-0, policy_version 1184271 (0.00094) [2022-07-11 12:05:56,593][26022] Updated weights on worker 0-0, policy_version 1184281 (0.00097) [2022-07-11 12:05:57,681][25689] Fps is (10 sec: 5392.9, 60 sec: 5503.6, 300 sec: 5527.3). Total num frames: 1212707840. Throughput: 0: 5779.3. Samples: 1212713222. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:05:57,683][25689] Avg episode reward: [(0, '0.158')] [2022-07-11 12:05:58,844][26022] Updated weights on worker 0-0, policy_version 1184291 (0.00085) [2022-07-11 12:06:00,287][26022] Updated weights on worker 0-0, policy_version 1184301 (0.00084) [2022-07-11 12:06:02,627][26022] Updated weights on worker 0-0, policy_version 1184311 (0.00083) [2022-07-11 12:06:02,703][25689] Fps is (10 sec: 5405.3, 60 sec: 5503.7, 300 sec: 5524.8). Total num frames: 1212734464. Throughput: 0: 4950.0. Samples: 1212729682. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:06:02,703][25689] Avg episode reward: [(0, '-0.149')] [2022-07-11 12:06:04,612][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:06:04,633][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001184321_1212744704.pth [2022-07-11 12:06:04,634][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001182376_1210753024.pth [2022-07-11 12:06:04,635][26022] Updated weights on worker 0-0, policy_version 1184321 (0.00091) [2022-07-11 12:06:06,227][26022] Updated weights on worker 0-0, policy_version 1184331 (0.00088) [2022-07-11 12:06:07,777][25689] Fps is (10 sec: 5477.0, 60 sec: 5525.1, 300 sec: 5527.3). Total num frames: 1212763136. Throughput: 0: 5697.0. Samples: 1212761288. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:06:07,778][25689] Avg episode reward: [(0, '-0.631')] [2022-07-11 12:06:08,166][26022] Updated weights on worker 0-0, policy_version 1184341 (0.00083) [2022-07-11 12:06:09,980][26022] Updated weights on worker 0-0, policy_version 1184351 (0.00086) [2022-07-11 12:06:11,951][26022] Updated weights on worker 0-0, policy_version 1184361 (0.00085) [2022-07-11 12:06:12,845][25689] Fps is (10 sec: 5553.2, 60 sec: 5502.1, 300 sec: 5527.3). Total num frames: 1212790784. Throughput: 0: 5690.3. Samples: 1212794814. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:06:12,845][25689] Avg episode reward: [(0, '-0.726')] [2022-07-11 12:06:13,772][26022] Updated weights on worker 0-0, policy_version 1184371 (0.00087) [2022-07-11 12:06:15,438][26022] Updated weights on worker 0-0, policy_version 1184381 (0.00076) [2022-07-11 12:06:17,181][26022] Updated weights on worker 0-0, policy_version 1184391 (0.00082) [2022-07-11 12:06:17,853][25689] Fps is (10 sec: 5487.9, 60 sec: 5522.5, 300 sec: 5520.6). Total num frames: 1212818432. Throughput: 0: 5711.2. Samples: 1212828428. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:06:17,854][25689] Avg episode reward: [(0, '-0.372')] [2022-07-11 12:06:19,231][26022] Updated weights on worker 0-0, policy_version 1184401 (0.00103) [2022-07-11 12:06:21,091][26022] Updated weights on worker 0-0, policy_version 1184411 (0.00091) [2022-07-11 12:06:22,934][25689] Fps is (10 sec: 5480.6, 60 sec: 5498.4, 300 sec: 5525.2). Total num frames: 1212846080. Throughput: 0: 5695.6. Samples: 1212844912. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:06:22,936][25689] Avg episode reward: [(0, '0.710')] [2022-07-11 12:06:23,105][26022] Updated weights on worker 0-0, policy_version 1184421 (0.00083) [2022-07-11 12:06:24,604][26022] Updated weights on worker 0-0, policy_version 1184431 (0.00088) [2022-07-11 12:06:26,592][26022] Updated weights on worker 0-0, policy_version 1184441 (0.00091) [2022-07-11 12:06:28,013][25689] Fps is (10 sec: 5644.0, 60 sec: 5529.8, 300 sec: 5530.9). Total num frames: 1212875776. Throughput: 0: 5800.5. Samples: 1212878668. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:06:28,014][25689] Avg episode reward: [(0, '0.634')] [2022-07-11 12:06:28,607][26022] Updated weights on worker 0-0, policy_version 1184451 (0.00086) [2022-07-11 12:06:30,249][26022] Updated weights on worker 0-0, policy_version 1184461 (0.00088) [2022-07-11 12:06:32,301][26022] Updated weights on worker 0-0, policy_version 1184471 (0.00080) [2022-07-11 12:06:33,023][25689] Fps is (10 sec: 5684.1, 60 sec: 5516.6, 300 sec: 5527.6). Total num frames: 1212903424. Throughput: 0: 5797.9. Samples: 1212911804. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:06:33,023][25689] Avg episode reward: [(0, '0.685')] [2022-07-11 12:06:34,016][26022] Updated weights on worker 0-0, policy_version 1184481 (0.00080) [2022-07-11 12:06:35,776][26022] Updated weights on worker 0-0, policy_version 1184491 (0.00095) [2022-07-11 12:06:37,668][26022] Updated weights on worker 0-0, policy_version 1184501 (0.00090) [2022-07-11 12:06:38,031][25689] Fps is (10 sec: 5417.5, 60 sec: 5516.1, 300 sec: 5527.7). Total num frames: 1212930048. Throughput: 0: 4959.1. Samples: 1212928492. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:06:38,032][25689] Avg episode reward: [(0, '1.229')] [2022-07-11 12:06:39,492][26022] Updated weights on worker 0-0, policy_version 1184511 (0.00087) [2022-07-11 12:06:41,207][26022] Updated weights on worker 0-0, policy_version 1184521 (0.00094) [2022-07-11 12:06:43,048][25689] Fps is (10 sec: 5413.7, 60 sec: 5516.3, 300 sec: 5519.3). Total num frames: 1212957696. Throughput: 0: 5808.4. Samples: 1212961740. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:06:43,048][25689] Avg episode reward: [(0, '0.928')] [2022-07-11 12:06:43,319][26022] Updated weights on worker 0-0, policy_version 1184531 (0.00095) [2022-07-11 12:06:44,980][26022] Updated weights on worker 0-0, policy_version 1184541 (0.00090) [2022-07-11 12:06:47,035][26022] Updated weights on worker 0-0, policy_version 1184551 (0.00081) [2022-07-11 12:06:48,112][25689] Fps is (10 sec: 5688.2, 60 sec: 5518.2, 300 sec: 5528.7). Total num frames: 1212987392. Throughput: 0: 5800.1. Samples: 1212995244. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:06:48,113][25689] Avg episode reward: [(0, '1.082')] [2022-07-11 12:06:48,740][26022] Updated weights on worker 0-0, policy_version 1184561 (0.00089) [2022-07-11 12:06:50,632][26022] Updated weights on worker 0-0, policy_version 1184571 (0.00089) [2022-07-11 12:06:52,254][26022] Updated weights on worker 0-0, policy_version 1184581 (0.00082) [2022-07-11 12:06:53,168][25689] Fps is (10 sec: 5564.8, 60 sec: 5517.2, 300 sec: 5524.8). Total num frames: 1213014016. Throughput: 0: 4979.0. Samples: 1213012110. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:06:53,169][25689] Avg episode reward: [(0, '1.080')] [2022-07-11 12:06:54,431][26022] Updated weights on worker 0-0, policy_version 1184592 (0.00084) [2022-07-11 12:06:56,248][26022] Updated weights on worker 0-0, policy_version 1184602 (0.00092) [2022-07-11 12:06:58,175][25689] Fps is (10 sec: 5393.3, 60 sec: 5518.3, 300 sec: 5521.7). Total num frames: 1213041664. Throughput: 0: 5821.3. Samples: 1213045756. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:06:58,176][25689] Avg episode reward: [(0, '0.767')] [2022-07-11 12:06:58,251][26022] Updated weights on worker 0-0, policy_version 1184612 (0.00085) [2022-07-11 12:06:59,725][26022] Updated weights on worker 0-0, policy_version 1184622 (0.00084) [2022-07-11 12:07:02,269][26022] Updated weights on worker 0-0, policy_version 1184632 (0.00089) [2022-07-11 12:07:03,177][25689] Fps is (10 sec: 5422.8, 60 sec: 5520.2, 300 sec: 5526.3). Total num frames: 1213068288. Throughput: 0: 5716.9. Samples: 1213076814. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:07:03,177][25689] Avg episode reward: [(0, '1.224')] [2022-07-11 12:07:03,753][26022] Updated weights on worker 0-0, policy_version 1184642 (0.00086) [2022-07-11 12:07:05,912][26022] Updated weights on worker 0-0, policy_version 1184652 (0.00051) [2022-07-11 12:07:07,499][26022] Updated weights on worker 0-0, policy_version 1184662 (0.00081) [2022-07-11 12:07:08,297][25689] Fps is (10 sec: 5361.9, 60 sec: 5499.0, 300 sec: 5522.2). Total num frames: 1213095936. Throughput: 0: 4870.9. Samples: 1213093560. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:07:08,297][25689] Avg episode reward: [(0, '0.743')] [2022-07-11 12:07:09,505][26022] Updated weights on worker 0-0, policy_version 1184672 (0.00083) [2022-07-11 12:07:11,331][26022] Updated weights on worker 0-0, policy_version 1184682 (0.00094) [2022-07-11 12:07:13,242][26022] Updated weights on worker 0-0, policy_version 1184692 (0.00089) [2022-07-11 12:07:13,333][25689] Fps is (10 sec: 5545.3, 60 sec: 5518.8, 300 sec: 5525.2). Total num frames: 1213124608. Throughput: 0: 5701.1. Samples: 1213127070. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:07:13,335][25689] Avg episode reward: [(0, '-0.302')] [2022-07-11 12:07:15,183][26022] Updated weights on worker 0-0, policy_version 1184702 (0.00091) [2022-07-11 12:07:16,791][26022] Updated weights on worker 0-0, policy_version 1184712 (0.00086) [2022-07-11 12:07:18,419][25689] Fps is (10 sec: 5665.3, 60 sec: 5528.7, 300 sec: 5523.9). Total num frames: 1213153280. Throughput: 0: 5674.6. Samples: 1213160632. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:07:18,419][25689] Avg episode reward: [(0, '-1.001')] [2022-07-11 12:07:18,662][26022] Updated weights on worker 0-0, policy_version 1184722 (0.00053) [2022-07-11 12:07:20,495][26022] Updated weights on worker 0-0, policy_version 1184732 (0.00078) [2022-07-11 12:07:22,391][26022] Updated weights on worker 0-0, policy_version 1184742 (0.00089) [2022-07-11 12:07:23,421][25689] Fps is (10 sec: 5582.9, 60 sec: 5535.9, 300 sec: 5525.2). Total num frames: 1213180928. Throughput: 0: 4968.6. Samples: 1213177400. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:07:23,421][25689] Avg episode reward: [(0, '-1.053')] [2022-07-11 12:07:24,207][26022] Updated weights on worker 0-0, policy_version 1184752 (0.00084) [2022-07-11 12:07:25,939][26022] Updated weights on worker 0-0, policy_version 1184762 (0.00070) [2022-07-11 12:07:27,791][26022] Updated weights on worker 0-0, policy_version 1184772 (0.00092) [2022-07-11 12:07:28,463][25689] Fps is (10 sec: 5607.4, 60 sec: 5522.4, 300 sec: 5524.8). Total num frames: 1213209600. Throughput: 0: 5815.9. Samples: 1213210842. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:07:28,465][25689] Avg episode reward: [(0, '-1.401')] [2022-07-11 12:07:29,859][26022] Updated weights on worker 0-0, policy_version 1184782 (0.00076) [2022-07-11 12:07:31,624][26022] Updated weights on worker 0-0, policy_version 1184792 (0.00088) [2022-07-11 12:07:33,383][26022] Updated weights on worker 0-0, policy_version 1184802 (0.00412) [2022-07-11 12:07:33,478][25689] Fps is (10 sec: 5600.2, 60 sec: 5521.9, 300 sec: 5522.4). Total num frames: 1213237248. Throughput: 0: 5825.2. Samples: 1213244416. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:07:33,478][25689] Avg episode reward: [(0, '-1.326')] [2022-07-11 12:07:35,083][26022] Updated weights on worker 0-0, policy_version 1184812 (0.00087) [2022-07-11 12:07:36,951][26022] Updated weights on worker 0-0, policy_version 1184822 (0.00087) [2022-07-11 12:07:38,525][25689] Fps is (10 sec: 5596.9, 60 sec: 5552.1, 300 sec: 5526.3). Total num frames: 1213265920. Throughput: 0: 5011.9. Samples: 1213261404. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:07:38,526][25689] Avg episode reward: [(0, '-1.512')] [2022-07-11 12:07:38,905][26022] Updated weights on worker 0-0, policy_version 1184832 (0.00087) [2022-07-11 12:07:40,704][26022] Updated weights on worker 0-0, policy_version 1184842 (0.00092) [2022-07-11 12:07:42,486][26022] Updated weights on worker 0-0, policy_version 1184852 (0.00097) [2022-07-11 12:07:43,542][25689] Fps is (10 sec: 5494.0, 60 sec: 5535.2, 300 sec: 5523.6). Total num frames: 1213292544. Throughput: 0: 5827.9. Samples: 1213294666. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:07:43,543][25689] Avg episode reward: [(0, '-0.028')] [2022-07-11 12:07:44,239][26022] Updated weights on worker 0-0, policy_version 1184862 (0.00086) [2022-07-11 12:07:46,182][26022] Updated weights on worker 0-0, policy_version 1184872 (0.00084) [2022-07-11 12:07:48,183][26022] Updated weights on worker 0-0, policy_version 1184882 (0.00086) [2022-07-11 12:07:48,685][25689] Fps is (10 sec: 5543.4, 60 sec: 5528.0, 300 sec: 5524.4). Total num frames: 1213322240. Throughput: 0: 5806.1. Samples: 1213328256. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:07:48,686][25689] Avg episode reward: [(0, '0.019')] [2022-07-11 12:07:49,862][26022] Updated weights on worker 0-0, policy_version 1184892 (0.00094) [2022-07-11 12:07:51,628][26022] Updated weights on worker 0-0, policy_version 1184902 (0.00091) [2022-07-11 12:07:53,407][26022] Updated weights on worker 0-0, policy_version 1184912 (0.00090) [2022-07-11 12:07:53,687][25689] Fps is (10 sec: 5652.8, 60 sec: 5549.9, 300 sec: 5524.9). Total num frames: 1213349888. Throughput: 0: 4978.9. Samples: 1213345036. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:07:53,687][25689] Avg episode reward: [(0, '0.822')] [2022-07-11 12:07:55,244][26022] Updated weights on worker 0-0, policy_version 1184922 (0.00089) [2022-07-11 12:07:57,257][26022] Updated weights on worker 0-0, policy_version 1184932 (0.00087) [2022-07-11 12:07:58,713][25689] Fps is (10 sec: 5718.6, 60 sec: 5582.0, 300 sec: 5528.3). Total num frames: 1213379584. Throughput: 0: 5801.3. Samples: 1213378516. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:07:58,713][25689] Avg episode reward: [(0, '0.719')] [2022-07-11 12:07:58,956][26022] Updated weights on worker 0-0, policy_version 1184942 (0.00085) [2022-07-11 12:08:01,103][26022] Updated weights on worker 0-0, policy_version 1184952 (0.00071) [2022-07-11 12:08:03,043][26022] Updated weights on worker 0-0, policy_version 1184962 (0.00092) [2022-07-11 12:08:03,727][25689] Fps is (10 sec: 5303.3, 60 sec: 5530.0, 300 sec: 5520.4). Total num frames: 1213403136. Throughput: 0: 5707.7. Samples: 1213409876. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:08:03,728][25689] Avg episode reward: [(0, '0.844')] [2022-07-11 12:08:04,658][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:08:04,683][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001184971_1213410304.pth [2022-07-11 12:08:04,683][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001183024_1211416576.pth [2022-07-11 12:08:04,992][26022] Updated weights on worker 0-0, policy_version 1184972 (0.00086) [2022-07-11 12:08:06,747][26022] Updated weights on worker 0-0, policy_version 1184982 (0.00090) [2022-07-11 12:08:08,589][26022] Updated weights on worker 0-0, policy_version 1184992 (0.00084) [2022-07-11 12:08:08,843][25689] Fps is (10 sec: 5155.2, 60 sec: 5547.4, 300 sec: 5522.1). Total num frames: 1213431808. Throughput: 0: 4877.5. Samples: 1213426576. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:08:08,844][25689] Avg episode reward: [(0, '1.163')] [2022-07-11 12:08:10,445][26022] Updated weights on worker 0-0, policy_version 1185002 (0.00087) [2022-07-11 12:08:12,153][26022] Updated weights on worker 0-0, policy_version 1185012 (0.00085) [2022-07-11 12:08:13,902][25689] Fps is (10 sec: 5736.8, 60 sec: 5562.2, 300 sec: 5524.7). Total num frames: 1213461504. Throughput: 0: 5700.0. Samples: 1213460260. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:08:13,903][25689] Avg episode reward: [(0, '1.275')] [2022-07-11 12:08:14,091][26022] Updated weights on worker 0-0, policy_version 1185022 (0.00091) [2022-07-11 12:08:16,053][26022] Updated weights on worker 0-0, policy_version 1185032 (0.00087) [2022-07-11 12:08:17,662][26022] Updated weights on worker 0-0, policy_version 1185042 (0.00092) [2022-07-11 12:08:18,933][25689] Fps is (10 sec: 5581.6, 60 sec: 5533.4, 300 sec: 5524.4). Total num frames: 1213488128. Throughput: 0: 5715.2. Samples: 1213494080. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:08:18,935][25689] Avg episode reward: [(0, '1.308')] [2022-07-11 12:08:19,523][26022] Updated weights on worker 0-0, policy_version 1185052 (0.00886) [2022-07-11 12:08:21,362][26022] Updated weights on worker 0-0, policy_version 1185062 (0.00090) [2022-07-11 12:08:23,038][26022] Updated weights on worker 0-0, policy_version 1185072 (0.00091) [2022-07-11 12:08:23,963][25689] Fps is (10 sec: 5699.7, 60 sec: 5581.6, 300 sec: 5532.3). Total num frames: 1213518848. Throughput: 0: 4998.4. Samples: 1213511020. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:08:23,963][25689] Avg episode reward: [(0, '0.470')] [2022-07-11 12:08:25,064][26022] Updated weights on worker 0-0, policy_version 1185082 (0.00090) [2022-07-11 12:08:26,840][26022] Updated weights on worker 0-0, policy_version 1185092 (0.00092) [2022-07-11 12:08:29,023][25689] Fps is (10 sec: 5581.7, 60 sec: 5529.1, 300 sec: 5521.1). Total num frames: 1213544448. Throughput: 0: 5817.4. Samples: 1213543974. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:08:29,024][25689] Avg episode reward: [(0, '-1.652')] [2022-07-11 12:08:29,026][26022] Updated weights on worker 0-0, policy_version 1185102 (0.00093) [2022-07-11 12:08:30,641][26022] Updated weights on worker 0-0, policy_version 1185112 (0.00083) [2022-07-11 12:08:32,406][26022] Updated weights on worker 0-0, policy_version 1185122 (0.00086) [2022-07-11 12:08:34,092][25689] Fps is (10 sec: 5358.0, 60 sec: 5541.2, 300 sec: 5523.8). Total num frames: 1213573120. Throughput: 0: 5805.6. Samples: 1213577476. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:08:34,092][25689] Avg episode reward: [(0, '-2.299')] [2022-07-11 12:08:34,271][26022] Updated weights on worker 0-0, policy_version 1185132 (0.00087) [2022-07-11 12:08:36,263][26022] Updated weights on worker 0-0, policy_version 1185142 (0.00086) [2022-07-11 12:08:38,017][26022] Updated weights on worker 0-0, policy_version 1185152 (0.00089) [2022-07-11 12:08:39,113][25689] Fps is (10 sec: 5582.2, 60 sec: 5526.7, 300 sec: 5527.2). Total num frames: 1213600768. Throughput: 0: 5806.5. Samples: 1213611252. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 12:08:39,114][25689] Avg episode reward: [(0, '-2.413')] [2022-07-11 12:08:39,969][26022] Updated weights on worker 0-0, policy_version 1185162 (0.00086) [2022-07-11 12:08:41,565][26022] Updated weights on worker 0-0, policy_version 1185172 (0.00083) [2022-07-11 12:08:43,403][26022] Updated weights on worker 0-0, policy_version 1185182 (0.00087) [2022-07-11 12:08:44,116][25689] Fps is (10 sec: 5618.4, 60 sec: 5561.8, 300 sec: 5522.2). Total num frames: 1213629440. Throughput: 0: 5805.2. Samples: 1213628014. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:08:44,116][25689] Avg episode reward: [(0, '-2.273')] [2022-07-11 12:08:45,334][26022] Updated weights on worker 0-0, policy_version 1185192 (0.00085) [2022-07-11 12:08:47,092][26022] Updated weights on worker 0-0, policy_version 1185202 (0.00089) [2022-07-11 12:08:49,063][26022] Updated weights on worker 0-0, policy_version 1185212 (0.00085) [2022-07-11 12:08:49,255][25689] Fps is (10 sec: 5553.1, 60 sec: 5528.3, 300 sec: 5523.9). Total num frames: 1213657088. Throughput: 0: 5798.4. Samples: 1213661284. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:08:49,255][25689] Avg episode reward: [(0, '-2.484')] [2022-07-11 12:08:50,653][26022] Updated weights on worker 0-0, policy_version 1185222 (0.00091) [2022-07-11 12:08:52,826][26022] Updated weights on worker 0-0, policy_version 1185232 (0.00086) [2022-07-11 12:08:54,279][25689] Fps is (10 sec: 5642.5, 60 sec: 5560.1, 300 sec: 5527.2). Total num frames: 1213686784. Throughput: 0: 5796.7. Samples: 1213694494. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:08:54,280][25689] Avg episode reward: [(0, '-1.492')] [2022-07-11 12:08:54,739][26022] Updated weights on worker 0-0, policy_version 1185242 (0.00090) [2022-07-11 12:08:56,451][26022] Updated weights on worker 0-0, policy_version 1185252 (0.00087) [2022-07-11 12:08:58,430][26022] Updated weights on worker 0-0, policy_version 1185262 (0.00088) [2022-07-11 12:08:59,363][25689] Fps is (10 sec: 5673.0, 60 sec: 5521.0, 300 sec: 5526.3). Total num frames: 1213714432. Throughput: 0: 4935.6. Samples: 1213711200. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:08:59,364][25689] Avg episode reward: [(0, '0.401')] [2022-07-11 12:09:00,106][26022] Updated weights on worker 0-0, policy_version 1185272 (0.00094) [2022-07-11 12:09:02,388][26022] Updated weights on worker 0-0, policy_version 1185282 (0.00095) [2022-07-11 12:09:04,213][26022] Updated weights on worker 0-0, policy_version 1185292 (0.00087) [2022-07-11 12:09:04,370][25689] Fps is (10 sec: 5276.8, 60 sec: 5555.5, 300 sec: 5524.4). Total num frames: 1213740032. Throughput: 0: 5654.6. Samples: 1213742542. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:09:04,370][25689] Avg episode reward: [(0, '0.643')] [2022-07-11 12:09:06,156][26022] Updated weights on worker 0-0, policy_version 1185302 (0.00090) [2022-07-11 12:09:07,892][26022] Updated weights on worker 0-0, policy_version 1185312 (0.00081) [2022-07-11 12:09:09,487][25689] Fps is (10 sec: 5360.7, 60 sec: 5555.3, 300 sec: 5529.8). Total num frames: 1213768704. Throughput: 0: 5674.8. Samples: 1213776098. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:09:09,489][25689] Avg episode reward: [(0, '0.660')] [2022-07-11 12:09:09,662][26022] Updated weights on worker 0-0, policy_version 1185322 (0.00090) [2022-07-11 12:09:11,676][26022] Updated weights on worker 0-0, policy_version 1185332 (0.00091) [2022-07-11 12:09:13,323][26022] Updated weights on worker 0-0, policy_version 1185342 (0.00092) [2022-07-11 12:09:14,533][25689] Fps is (10 sec: 5339.9, 60 sec: 5488.9, 300 sec: 5518.8). Total num frames: 1213794304. Throughput: 0: 4838.0. Samples: 1213792484. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:09:14,535][25689] Avg episode reward: [(0, '0.170')] [2022-07-11 12:09:15,360][26022] Updated weights on worker 0-0, policy_version 1185352 (0.00088) [2022-07-11 12:09:17,132][26022] Updated weights on worker 0-0, policy_version 1185362 (0.00094) [2022-07-11 12:09:19,060][26022] Updated weights on worker 0-0, policy_version 1185372 (0.00087) [2022-07-11 12:09:19,571][25689] Fps is (10 sec: 5585.5, 60 sec: 5555.9, 300 sec: 5525.8). Total num frames: 1213825024. Throughput: 0: 5675.4. Samples: 1213825886. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:09:19,571][25689] Avg episode reward: [(0, '-0.004')] [2022-07-11 12:09:20,758][26022] Updated weights on worker 0-0, policy_version 1185382 (0.00084) [2022-07-11 12:09:22,843][26022] Updated weights on worker 0-0, policy_version 1185392 (0.00091) [2022-07-11 12:09:24,606][25689] Fps is (10 sec: 5591.2, 60 sec: 5470.9, 300 sec: 5523.9). Total num frames: 1213850624. Throughput: 0: 5756.9. Samples: 1213859042. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:09:24,607][25689] Avg episode reward: [(0, '0.121')] [2022-07-11 12:09:24,607][26022] Updated weights on worker 0-0, policy_version 1185402 (0.00084) [2022-07-11 12:09:26,552][26022] Updated weights on worker 0-0, policy_version 1185412 (0.00053) [2022-07-11 12:09:28,551][26022] Updated weights on worker 0-0, policy_version 1185422 (0.00094) [2022-07-11 12:09:29,748][25689] Fps is (10 sec: 5332.5, 60 sec: 5514.2, 300 sec: 5522.1). Total num frames: 1213879296. Throughput: 0: 4910.3. Samples: 1213875586. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:09:29,749][25689] Avg episode reward: [(0, '0.240')] [2022-07-11 12:09:30,097][26022] Updated weights on worker 0-0, policy_version 1185432 (0.00090) [2022-07-11 12:09:32,062][26022] Updated weights on worker 0-0, policy_version 1185442 (0.00085) [2022-07-11 12:09:33,841][26022] Updated weights on worker 0-0, policy_version 1185452 (0.00084) [2022-07-11 12:09:34,791][25689] Fps is (10 sec: 5529.9, 60 sec: 5499.6, 300 sec: 5519.3). Total num frames: 1213906944. Throughput: 0: 5748.5. Samples: 1213908936. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:09:34,793][25689] Avg episode reward: [(0, '0.673')] [2022-07-11 12:09:35,632][26022] Updated weights on worker 0-0, policy_version 1185462 (0.00093) [2022-07-11 12:09:37,611][26022] Updated weights on worker 0-0, policy_version 1185472 (0.00089) [2022-07-11 12:09:39,477][26022] Updated weights on worker 0-0, policy_version 1185482 (0.00049) [2022-07-11 12:09:39,872][25689] Fps is (10 sec: 5563.0, 60 sec: 5511.0, 300 sec: 5524.8). Total num frames: 1213935616. Throughput: 0: 5727.1. Samples: 1213942156. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:09:39,874][25689] Avg episode reward: [(0, '0.760')] [2022-07-11 12:09:41,311][26022] Updated weights on worker 0-0, policy_version 1185492 (0.00092) [2022-07-11 12:09:43,027][26022] Updated weights on worker 0-0, policy_version 1185502 (0.00096) [2022-07-11 12:09:44,885][25689] Fps is (10 sec: 5579.6, 60 sec: 5493.3, 300 sec: 5523.5). Total num frames: 1213963264. Throughput: 0: 4926.0. Samples: 1213958934. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:09:44,885][25689] Avg episode reward: [(0, '0.834')] [2022-07-11 12:09:44,951][26022] Updated weights on worker 0-0, policy_version 1185512 (0.00089) [2022-07-11 12:09:46,788][26022] Updated weights on worker 0-0, policy_version 1185522 (0.00085) [2022-07-11 12:09:48,721][26022] Updated weights on worker 0-0, policy_version 1185532 (0.00081) [2022-07-11 12:09:49,991][25689] Fps is (10 sec: 5667.1, 60 sec: 5530.0, 300 sec: 5525.5). Total num frames: 1213992960. Throughput: 0: 5774.5. Samples: 1213992478. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:09:49,992][25689] Avg episode reward: [(0, '1.109')] [2022-07-11 12:09:50,413][26022] Updated weights on worker 0-0, policy_version 1185542 (0.00085) [2022-07-11 12:09:52,398][26022] Updated weights on worker 0-0, policy_version 1185552 (0.00085) [2022-07-11 12:09:54,164][26022] Updated weights on worker 0-0, policy_version 1185562 (0.00091) [2022-07-11 12:09:55,083][25689] Fps is (10 sec: 5522.4, 60 sec: 5473.3, 300 sec: 5521.0). Total num frames: 1214019584. Throughput: 0: 5759.4. Samples: 1214025808. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:09:55,084][25689] Avg episode reward: [(0, '0.693')] [2022-07-11 12:09:55,972][26022] Updated weights on worker 0-0, policy_version 1185572 (0.00091) [2022-07-11 12:09:57,844][26022] Updated weights on worker 0-0, policy_version 1185582 (0.00090) [2022-07-11 12:09:59,591][26022] Updated weights on worker 0-0, policy_version 1185592 (0.00081) [2022-07-11 12:10:00,169][25689] Fps is (10 sec: 5533.3, 60 sec: 5506.8, 300 sec: 5530.1). Total num frames: 1214049280. Throughput: 0: 4935.5. Samples: 1214042334. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:10:00,179][25689] Avg episode reward: [(0, '0.718')] [2022-07-11 12:10:01,611][26022] Updated weights on worker 0-0, policy_version 1185602 (0.00095) [2022-07-11 12:10:03,713][26022] Updated weights on worker 0-0, policy_version 1185612 (0.00051) [2022-07-11 12:10:04,912][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:10:04,925][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001185618_1214072832.pth [2022-07-11 12:10:04,925][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001183674_1212082176.pth [2022-07-11 12:10:05,248][25689] Fps is (10 sec: 5540.5, 60 sec: 5517.1, 300 sec: 5527.5). Total num frames: 1214075904. Throughput: 0: 5642.0. Samples: 1214073826. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:10:05,251][25689] Avg episode reward: [(0, '0.561')] [2022-07-11 12:10:05,532][26022] Updated weights on worker 0-0, policy_version 1185622 (0.00093) [2022-07-11 12:10:07,492][26022] Updated weights on worker 0-0, policy_version 1185632 (0.00091) [2022-07-11 12:10:09,215][26022] Updated weights on worker 0-0, policy_version 1185642 (0.00084) [2022-07-11 12:10:10,331][25689] Fps is (10 sec: 5240.3, 60 sec: 5486.6, 300 sec: 5519.1). Total num frames: 1214102528. Throughput: 0: 5640.5. Samples: 1214107204. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:10:10,331][25689] Avg episode reward: [(0, '0.924')] [2022-07-11 12:10:11,047][26022] Updated weights on worker 0-0, policy_version 1185652 (0.00094) [2022-07-11 12:10:12,947][26022] Updated weights on worker 0-0, policy_version 1185662 (0.00094) [2022-07-11 12:10:14,764][26022] Updated weights on worker 0-0, policy_version 1185672 (0.00105) [2022-07-11 12:10:15,341][25689] Fps is (10 sec: 5478.9, 60 sec: 5540.4, 300 sec: 5526.7). Total num frames: 1214131200. Throughput: 0: 5666.7. Samples: 1214140602. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:10:15,341][25689] Avg episode reward: [(0, '1.317')] [2022-07-11 12:10:16,558][26022] Updated weights on worker 0-0, policy_version 1185682 (0.00087) [2022-07-11 12:10:18,486][26022] Updated weights on worker 0-0, policy_version 1185692 (0.00089) [2022-07-11 12:10:20,308][26022] Updated weights on worker 0-0, policy_version 1185702 (0.00087) [2022-07-11 12:10:20,343][25689] Fps is (10 sec: 5625.3, 60 sec: 5493.0, 300 sec: 5523.2). Total num frames: 1214158848. Throughput: 0: 5696.8. Samples: 1214157256. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:10:20,343][25689] Avg episode reward: [(0, '1.214')] [2022-07-11 12:10:22,069][26022] Updated weights on worker 0-0, policy_version 1185712 (0.00884) [2022-07-11 12:10:23,926][26022] Updated weights on worker 0-0, policy_version 1185722 (0.00091) [2022-07-11 12:10:25,350][25689] Fps is (10 sec: 5626.8, 60 sec: 5546.2, 300 sec: 5527.5). Total num frames: 1214187520. Throughput: 0: 5821.4. Samples: 1214190846. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:10:25,351][25689] Avg episode reward: [(0, '1.124')] [2022-07-11 12:10:25,745][26022] Updated weights on worker 0-0, policy_version 1185732 (0.00085) [2022-07-11 12:10:27,671][26022] Updated weights on worker 0-0, policy_version 1185742 (0.00085) [2022-07-11 12:10:29,438][26022] Updated weights on worker 0-0, policy_version 1185752 (0.00093) [2022-07-11 12:10:30,403][25689] Fps is (10 sec: 5496.6, 60 sec: 5520.6, 300 sec: 5520.6). Total num frames: 1214214144. Throughput: 0: 5832.2. Samples: 1214224268. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:10:30,403][25689] Avg episode reward: [(0, '0.836')] [2022-07-11 12:10:31,227][26022] Updated weights on worker 0-0, policy_version 1185762 (0.00085) [2022-07-11 12:10:33,279][26022] Updated weights on worker 0-0, policy_version 1185772 (0.00884) [2022-07-11 12:10:35,020][26022] Updated weights on worker 0-0, policy_version 1185782 (0.00097) [2022-07-11 12:10:35,422][25689] Fps is (10 sec: 5490.1, 60 sec: 5539.6, 300 sec: 5527.2). Total num frames: 1214242816. Throughput: 0: 4988.7. Samples: 1214240782. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:10:35,423][25689] Avg episode reward: [(0, '0.923')] [2022-07-11 12:10:36,883][26022] Updated weights on worker 0-0, policy_version 1185792 (0.00086) [2022-07-11 12:10:38,733][26022] Updated weights on worker 0-0, policy_version 1185802 (0.00087) [2022-07-11 12:10:40,430][25689] Fps is (10 sec: 5616.8, 60 sec: 5529.4, 300 sec: 5527.4). Total num frames: 1214270464. Throughput: 0: 5805.0. Samples: 1214273862. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:10:40,430][25689] Avg episode reward: [(0, '0.125')] [2022-07-11 12:10:40,534][26022] Updated weights on worker 0-0, policy_version 1185812 (0.00087) [2022-07-11 12:10:42,585][26022] Updated weights on worker 0-0, policy_version 1185822 (0.00090) [2022-07-11 12:10:44,193][26022] Updated weights on worker 0-0, policy_version 1185832 (0.00085) [2022-07-11 12:10:45,433][25689] Fps is (10 sec: 5523.8, 60 sec: 5530.3, 300 sec: 5522.0). Total num frames: 1214298112. Throughput: 0: 5802.2. Samples: 1214307368. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:10:45,433][25689] Avg episode reward: [(0, '0.372')] [2022-07-11 12:10:46,239][26022] Updated weights on worker 0-0, policy_version 1185842 (0.00086) [2022-07-11 12:10:47,902][26022] Updated weights on worker 0-0, policy_version 1185852 (0.00091) [2022-07-11 12:10:49,733][26022] Updated weights on worker 0-0, policy_version 1185862 (0.00081) [2022-07-11 12:10:50,492][25689] Fps is (10 sec: 5495.7, 60 sec: 5500.8, 300 sec: 5525.2). Total num frames: 1214325760. Throughput: 0: 4977.0. Samples: 1214324250. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:10:50,492][25689] Avg episode reward: [(0, '0.350')] [2022-07-11 12:10:51,752][26022] Updated weights on worker 0-0, policy_version 1185872 (0.00095) [2022-07-11 12:10:53,515][26022] Updated weights on worker 0-0, policy_version 1185882 (0.00086) [2022-07-11 12:10:55,511][25689] Fps is (10 sec: 5384.9, 60 sec: 5507.4, 300 sec: 5521.8). Total num frames: 1214352384. Throughput: 0: 5799.5. Samples: 1214357288. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:10:55,512][25689] Avg episode reward: [(0, '-0.174')] [2022-07-11 12:10:55,559][26022] Updated weights on worker 0-0, policy_version 1185892 (0.00090) [2022-07-11 12:10:57,105][26022] Updated weights on worker 0-0, policy_version 1185902 (0.00087) [2022-07-11 12:10:59,046][26022] Updated weights on worker 0-0, policy_version 1185912 (0.00081) [2022-07-11 12:11:00,550][25689] Fps is (10 sec: 5599.1, 60 sec: 5511.7, 300 sec: 5531.7). Total num frames: 1214382080. Throughput: 0: 5820.2. Samples: 1214390968. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:11:00,551][25689] Avg episode reward: [(0, '-0.138')] [2022-07-11 12:11:00,798][26022] Updated weights on worker 0-0, policy_version 1185922 (0.00093) [2022-07-11 12:11:03,179][26022] Updated weights on worker 0-0, policy_version 1185932 (0.00086) [2022-07-11 12:11:04,892][26022] Updated weights on worker 0-0, policy_version 1185942 (0.00094) [2022-07-11 12:11:05,566][25689] Fps is (10 sec: 5499.3, 60 sec: 5500.5, 300 sec: 5522.5). Total num frames: 1214407680. Throughput: 0: 4893.9. Samples: 1214405900. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:11:05,567][25689] Avg episode reward: [(0, '0.217')] [2022-07-11 12:11:06,890][26022] Updated weights on worker 0-0, policy_version 1185952 (0.00095) [2022-07-11 12:11:08,630][26022] Updated weights on worker 0-0, policy_version 1185962 (0.00094) [2022-07-11 12:11:10,524][26022] Updated weights on worker 0-0, policy_version 1185972 (0.00097) [2022-07-11 12:11:10,631][25689] Fps is (10 sec: 5383.9, 60 sec: 5536.1, 300 sec: 5526.0). Total num frames: 1214436352. Throughput: 0: 5707.6. Samples: 1214439196. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:11:10,631][25689] Avg episode reward: [(0, '0.750')] [2022-07-11 12:11:12,375][26022] Updated weights on worker 0-0, policy_version 1185982 (0.00092) [2022-07-11 12:11:14,052][26022] Updated weights on worker 0-0, policy_version 1185992 (0.00094) [2022-07-11 12:11:15,633][25689] Fps is (10 sec: 5594.8, 60 sec: 5519.8, 300 sec: 5526.1). Total num frames: 1214464000. Throughput: 0: 5735.3. Samples: 1214472692. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:11:15,633][25689] Avg episode reward: [(0, '0.745')] [2022-07-11 12:11:16,034][26022] Updated weights on worker 0-0, policy_version 1186002 (0.00089) [2022-07-11 12:11:17,841][26022] Updated weights on worker 0-0, policy_version 1186012 (0.00121) [2022-07-11 12:11:19,642][26022] Updated weights on worker 0-0, policy_version 1186022 (0.00095) [2022-07-11 12:11:20,643][25689] Fps is (10 sec: 5420.8, 60 sec: 5502.1, 300 sec: 5524.0). Total num frames: 1214490624. Throughput: 0: 4898.6. Samples: 1214489392. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:11:20,643][25689] Avg episode reward: [(0, '1.086')] [2022-07-11 12:11:21,511][26022] Updated weights on worker 0-0, policy_version 1186032 (0.00090) [2022-07-11 12:11:23,212][26022] Updated weights on worker 0-0, policy_version 1186042 (0.00092) [2022-07-11 12:11:25,167][26022] Updated weights on worker 0-0, policy_version 1186052 (0.00081) [2022-07-11 12:11:25,683][25689] Fps is (10 sec: 5604.1, 60 sec: 5516.1, 300 sec: 5524.7). Total num frames: 1214520320. Throughput: 0: 5807.2. Samples: 1214522720. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:11:25,683][25689] Avg episode reward: [(0, '1.187')] [2022-07-11 12:11:27,233][26022] Updated weights on worker 0-0, policy_version 1186062 (0.00087) [2022-07-11 12:11:28,896][26022] Updated weights on worker 0-0, policy_version 1186073 (0.00094) [2022-07-11 12:11:30,752][25689] Fps is (10 sec: 5571.2, 60 sec: 5514.6, 300 sec: 5520.2). Total num frames: 1214546944. Throughput: 0: 5790.8. Samples: 1214555712. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:11:30,752][25689] Avg episode reward: [(0, '1.877')] [2022-07-11 12:11:31,159][26022] Updated weights on worker 0-0, policy_version 1186083 (0.00084) [2022-07-11 12:11:32,707][26022] Updated weights on worker 0-0, policy_version 1186093 (0.00092) [2022-07-11 12:11:34,609][26022] Updated weights on worker 0-0, policy_version 1186103 (0.00094) [2022-07-11 12:11:35,763][25689] Fps is (10 sec: 5485.5, 60 sec: 5515.3, 300 sec: 5527.0). Total num frames: 1214575616. Throughput: 0: 4961.3. Samples: 1214572564. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:11:35,764][25689] Avg episode reward: [(0, '1.595')] [2022-07-11 12:11:36,507][26022] Updated weights on worker 0-0, policy_version 1186113 (0.00089) [2022-07-11 12:11:38,396][26022] Updated weights on worker 0-0, policy_version 1186123 (0.00090) [2022-07-11 12:11:40,061][26022] Updated weights on worker 0-0, policy_version 1186133 (0.00107) [2022-07-11 12:11:40,772][25689] Fps is (10 sec: 5620.7, 60 sec: 5515.2, 300 sec: 5527.2). Total num frames: 1214603264. Throughput: 0: 5792.0. Samples: 1214605980. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:11:40,772][25689] Avg episode reward: [(0, '1.688')] [2022-07-11 12:11:41,806][26022] Updated weights on worker 0-0, policy_version 1186143 (0.00087) [2022-07-11 12:11:43,836][26022] Updated weights on worker 0-0, policy_version 1186153 (0.00085) [2022-07-11 12:11:45,573][26022] Updated weights on worker 0-0, policy_version 1186163 (0.00089) [2022-07-11 12:11:45,788][25689] Fps is (10 sec: 5516.1, 60 sec: 5514.0, 300 sec: 5521.2). Total num frames: 1214630912. Throughput: 0: 5806.8. Samples: 1214639464. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:11:45,788][25689] Avg episode reward: [(0, '1.444')] [2022-07-11 12:11:47,695][26022] Updated weights on worker 0-0, policy_version 1186173 (0.00098) [2022-07-11 12:11:49,288][26022] Updated weights on worker 0-0, policy_version 1186183 (0.00081) [2022-07-11 12:11:50,913][25689] Fps is (10 sec: 5351.7, 60 sec: 5491.0, 300 sec: 5519.9). Total num frames: 1214657536. Throughput: 0: 4961.9. Samples: 1214655750. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:11:50,913][25689] Avg episode reward: [(0, '1.554')] [2022-07-11 12:11:51,348][26022] Updated weights on worker 0-0, policy_version 1186193 (0.00089) [2022-07-11 12:11:53,227][26022] Updated weights on worker 0-0, policy_version 1186203 (0.00088) [2022-07-11 12:11:54,918][26022] Updated weights on worker 0-0, policy_version 1186213 (0.00085) [2022-07-11 12:11:55,919][25689] Fps is (10 sec: 5458.1, 60 sec: 5526.2, 300 sec: 5523.4). Total num frames: 1214686208. Throughput: 0: 5769.3. Samples: 1214688848. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:11:55,919][25689] Avg episode reward: [(0, '1.587')] [2022-07-11 12:11:56,906][26022] Updated weights on worker 0-0, policy_version 1186223 (0.00085) [2022-07-11 12:11:58,624][26022] Updated weights on worker 0-0, policy_version 1186233 (0.00087) [2022-07-11 12:12:00,598][26022] Updated weights on worker 0-0, policy_version 1186243 (0.00089) [2022-07-11 12:12:00,994][25689] Fps is (10 sec: 5688.2, 60 sec: 5506.0, 300 sec: 5528.9). Total num frames: 1214714880. Throughput: 0: 5734.5. Samples: 1214721944. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:12:00,994][25689] Avg episode reward: [(0, '1.524')] [2022-07-11 12:12:02,791][26022] Updated weights on worker 0-0, policy_version 1186253 (0.00086) [2022-07-11 12:12:04,549][26022] Updated weights on worker 0-0, policy_version 1186263 (0.00087) [2022-07-11 12:12:05,118][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:12:05,127][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001186266_1214736384.pth [2022-07-11 12:12:05,127][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001184321_1212744704.pth [2022-07-11 12:12:06,018][25689] Fps is (10 sec: 5374.0, 60 sec: 5505.3, 300 sec: 5523.8). Total num frames: 1214740480. Throughput: 0: 4791.9. Samples: 1214736404. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:12:06,018][25689] Avg episode reward: [(0, '1.355')] [2022-07-11 12:12:06,408][26022] Updated weights on worker 0-0, policy_version 1186273 (0.00090) [2022-07-11 12:12:08,336][26022] Updated weights on worker 0-0, policy_version 1186283 (0.00090) [2022-07-11 12:12:10,376][26022] Updated weights on worker 0-0, policy_version 1186293 (0.00103) [2022-07-11 12:12:11,067][25689] Fps is (10 sec: 5184.7, 60 sec: 5472.8, 300 sec: 5516.7). Total num frames: 1214767104. Throughput: 0: 5648.2. Samples: 1214769584. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:12:11,068][25689] Avg episode reward: [(0, '1.326')] [2022-07-11 12:12:11,826][26022] Updated weights on worker 0-0, policy_version 1186303 (0.00099) [2022-07-11 12:12:14,319][26022] Updated weights on worker 0-0, policy_version 1186313 (0.00089) [2022-07-11 12:12:15,531][26022] Updated weights on worker 0-0, policy_version 1186323 (0.00088) [2022-07-11 12:12:16,079][25689] Fps is (10 sec: 5597.8, 60 sec: 5505.8, 300 sec: 5521.5). Total num frames: 1214796800. Throughput: 0: 5665.3. Samples: 1214803062. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:12:16,079][25689] Avg episode reward: [(0, '0.666')] [2022-07-11 12:12:17,712][26022] Updated weights on worker 0-0, policy_version 1186333 (0.00092) [2022-07-11 12:12:19,159][26022] Updated weights on worker 0-0, policy_version 1186343 (0.00436) [2022-07-11 12:12:21,084][25689] Fps is (10 sec: 5622.4, 60 sec: 5506.2, 300 sec: 5518.0). Total num frames: 1214823424. Throughput: 0: 4879.4. Samples: 1214819970. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:12:21,085][25689] Avg episode reward: [(0, '-0.623')] [2022-07-11 12:12:21,223][26022] Updated weights on worker 0-0, policy_version 1186353 (0.00092) [2022-07-11 12:12:23,183][26022] Updated weights on worker 0-0, policy_version 1186363 (0.00086) [2022-07-11 12:12:24,820][26022] Updated weights on worker 0-0, policy_version 1186373 (0.00088) [2022-07-11 12:12:26,111][25689] Fps is (10 sec: 5409.8, 60 sec: 5473.5, 300 sec: 5514.8). Total num frames: 1214851072. Throughput: 0: 5830.0. Samples: 1214853548. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:12:26,111][25689] Avg episode reward: [(0, '-0.874')] [2022-07-11 12:12:26,879][26022] Updated weights on worker 0-0, policy_version 1186383 (0.00099) [2022-07-11 12:12:28,463][26022] Updated weights on worker 0-0, policy_version 1186393 (0.00105) [2022-07-11 12:12:30,503][26022] Updated weights on worker 0-0, policy_version 1186403 (0.00085) [2022-07-11 12:12:31,152][25689] Fps is (10 sec: 5695.6, 60 sec: 5526.9, 300 sec: 5521.2). Total num frames: 1214880768. Throughput: 0: 5820.5. Samples: 1214886490. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:12:31,152][25689] Avg episode reward: [(0, '-0.612')] [2022-07-11 12:12:32,317][26022] Updated weights on worker 0-0, policy_version 1186413 (0.00086) [2022-07-11 12:12:34,185][26022] Updated weights on worker 0-0, policy_version 1186423 (0.00085) [2022-07-11 12:12:36,155][25689] Fps is (10 sec: 5607.4, 60 sec: 5493.8, 300 sec: 5515.2). Total num frames: 1214907392. Throughput: 0: 4988.3. Samples: 1214903206. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 12:12:36,155][25689] Avg episode reward: [(0, '-2.221')] [2022-07-11 12:12:36,159][26022] Updated weights on worker 0-0, policy_version 1186433 (0.00088) [2022-07-11 12:12:37,927][26022] Updated weights on worker 0-0, policy_version 1186443 (0.00094) [2022-07-11 12:12:39,776][26022] Updated weights on worker 0-0, policy_version 1186453 (0.00101) [2022-07-11 12:12:41,162][25689] Fps is (10 sec: 5421.6, 60 sec: 5493.9, 300 sec: 5518.8). Total num frames: 1214935040. Throughput: 0: 5797.0. Samples: 1214936364. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:12:41,163][25689] Avg episode reward: [(0, '-1.402')] [2022-07-11 12:12:41,502][26022] Updated weights on worker 0-0, policy_version 1186463 (0.00094) [2022-07-11 12:12:43,379][26022] Updated weights on worker 0-0, policy_version 1186473 (0.00092) [2022-07-11 12:12:45,228][26022] Updated weights on worker 0-0, policy_version 1186483 (0.00091) [2022-07-11 12:12:46,196][25689] Fps is (10 sec: 5507.0, 60 sec: 5492.3, 300 sec: 5513.9). Total num frames: 1214962688. Throughput: 0: 5782.0. Samples: 1214969678. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:12:46,196][25689] Avg episode reward: [(0, '-1.673')] [2022-07-11 12:12:47,095][26022] Updated weights on worker 0-0, policy_version 1186493 (0.00096) [2022-07-11 12:12:48,980][26022] Updated weights on worker 0-0, policy_version 1186503 (0.00093) [2022-07-11 12:12:51,043][26022] Updated weights on worker 0-0, policy_version 1186513 (0.00083) [2022-07-11 12:12:51,256][25689] Fps is (10 sec: 5477.9, 60 sec: 5515.1, 300 sec: 5512.8). Total num frames: 1214990336. Throughput: 0: 4949.2. Samples: 1214985992. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:12:51,258][25689] Avg episode reward: [(0, '-0.405')] [2022-07-11 12:12:52,533][26022] Updated weights on worker 0-0, policy_version 1186523 (0.00086) [2022-07-11 12:12:54,802][26022] Updated weights on worker 0-0, policy_version 1186533 (0.00085) [2022-07-11 12:12:56,172][26022] Updated weights on worker 0-0, policy_version 1186543 (0.00088) [2022-07-11 12:12:56,259][25689] Fps is (10 sec: 5698.4, 60 sec: 5532.4, 300 sec: 5513.3). Total num frames: 1215020032. Throughput: 0: 5793.8. Samples: 1215019686. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:12:56,264][25689] Avg episode reward: [(0, '-0.493')] [2022-07-11 12:12:58,206][26022] Updated weights on worker 0-0, policy_version 1186553 (0.00087) [2022-07-11 12:12:59,933][26022] Updated weights on worker 0-0, policy_version 1186563 (0.00086) [2022-07-11 12:13:01,271][25689] Fps is (10 sec: 5624.0, 60 sec: 5504.2, 300 sec: 5523.6). Total num frames: 1215046656. Throughput: 0: 5828.9. Samples: 1215053574. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:13:01,271][25689] Avg episode reward: [(0, '-1.510')] [2022-07-11 12:13:02,298][26022] Updated weights on worker 0-0, policy_version 1186573 (0.00091) [2022-07-11 12:13:03,907][26022] Updated weights on worker 0-0, policy_version 1186583 (0.00082) [2022-07-11 12:13:05,944][26022] Updated weights on worker 0-0, policy_version 1186593 (0.00090) [2022-07-11 12:13:06,276][25689] Fps is (10 sec: 5315.3, 60 sec: 5522.8, 300 sec: 5518.8). Total num frames: 1215073280. Throughput: 0: 4904.8. Samples: 1215068170. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:13:06,277][25689] Avg episode reward: [(0, '-1.483')] [2022-07-11 12:13:07,602][26022] Updated weights on worker 0-0, policy_version 1186603 (0.00052) [2022-07-11 12:13:09,580][26022] Updated weights on worker 0-0, policy_version 1186613 (0.00085) [2022-07-11 12:13:11,317][25689] Fps is (10 sec: 5402.0, 60 sec: 5540.6, 300 sec: 5512.2). Total num frames: 1215100928. Throughput: 0: 5765.9. Samples: 1215101662. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:13:11,318][25689] Avg episode reward: [(0, '-2.248')] [2022-07-11 12:13:11,568][26022] Updated weights on worker 0-0, policy_version 1186623 (0.00090) [2022-07-11 12:13:13,141][26022] Updated weights on worker 0-0, policy_version 1186633 (0.00090) [2022-07-11 12:13:15,090][26022] Updated weights on worker 0-0, policy_version 1186643 (0.00117) [2022-07-11 12:13:16,339][25689] Fps is (10 sec: 5597.2, 60 sec: 5522.7, 300 sec: 5519.3). Total num frames: 1215129600. Throughput: 0: 5745.9. Samples: 1215135064. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:13:16,339][25689] Avg episode reward: [(0, '-3.276')] [2022-07-11 12:13:16,992][26022] Updated weights on worker 0-0, policy_version 1186653 (0.00086) [2022-07-11 12:13:18,682][26022] Updated weights on worker 0-0, policy_version 1186663 (0.00089) [2022-07-11 12:13:20,780][26022] Updated weights on worker 0-0, policy_version 1186673 (0.00081) [2022-07-11 12:13:21,349][25689] Fps is (10 sec: 5614.3, 60 sec: 5539.3, 300 sec: 5509.3). Total num frames: 1215157248. Throughput: 0: 4883.1. Samples: 1215151618. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:13:21,350][25689] Avg episode reward: [(0, '-3.144')] [2022-07-11 12:13:22,379][26022] Updated weights on worker 0-0, policy_version 1186683 (0.00090) [2022-07-11 12:13:24,518][26022] Updated weights on worker 0-0, policy_version 1186693 (0.00090) [2022-07-11 12:13:26,164][26022] Updated weights on worker 0-0, policy_version 1186703 (0.00089) [2022-07-11 12:13:26,384][25689] Fps is (10 sec: 5504.7, 60 sec: 5538.5, 300 sec: 5516.7). Total num frames: 1215184896. Throughput: 0: 5809.2. Samples: 1215184978. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:13:26,384][25689] Avg episode reward: [(0, '-3.632')] [2022-07-11 12:13:28,059][26022] Updated weights on worker 0-0, policy_version 1186713 (0.00080) [2022-07-11 12:13:29,916][26022] Updated weights on worker 0-0, policy_version 1186723 (0.00085) [2022-07-11 12:13:31,435][25689] Fps is (10 sec: 5482.3, 60 sec: 5503.6, 300 sec: 5513.6). Total num frames: 1215212544. Throughput: 0: 5793.6. Samples: 1215218216. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:13:31,436][25689] Avg episode reward: [(0, '-2.872')] [2022-07-11 12:13:31,635][26022] Updated weights on worker 0-0, policy_version 1186733 (0.00886) [2022-07-11 12:13:33,470][26022] Updated weights on worker 0-0, policy_version 1186743 (0.00083) [2022-07-11 12:13:35,431][26022] Updated weights on worker 0-0, policy_version 1186753 (0.00093) [2022-07-11 12:13:36,446][25689] Fps is (10 sec: 5495.6, 60 sec: 5519.9, 300 sec: 5513.8). Total num frames: 1215240192. Throughput: 0: 4979.6. Samples: 1215235190. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:13:36,446][25689] Avg episode reward: [(0, '-1.383')] [2022-07-11 12:13:37,266][26022] Updated weights on worker 0-0, policy_version 1186763 (0.00082) [2022-07-11 12:13:39,107][26022] Updated weights on worker 0-0, policy_version 1186773 (0.00095) [2022-07-11 12:13:40,806][26022] Updated weights on worker 0-0, policy_version 1186783 (0.00095) [2022-07-11 12:13:41,462][25689] Fps is (10 sec: 5718.9, 60 sec: 5553.0, 300 sec: 5517.0). Total num frames: 1215269888. Throughput: 0: 5820.3. Samples: 1215268684. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:13:41,464][25689] Avg episode reward: [(0, '-0.559')] [2022-07-11 12:13:42,774][26022] Updated weights on worker 0-0, policy_version 1186793 (0.00108) [2022-07-11 12:13:44,472][26022] Updated weights on worker 0-0, policy_version 1186803 (0.00086) [2022-07-11 12:13:46,464][26022] Updated weights on worker 0-0, policy_version 1186813 (0.00091) [2022-07-11 12:13:46,482][25689] Fps is (10 sec: 5611.4, 60 sec: 5537.3, 300 sec: 5515.7). Total num frames: 1215296512. Throughput: 0: 5838.7. Samples: 1215302326. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:13:46,483][25689] Avg episode reward: [(0, '-1.505')] [2022-07-11 12:13:47,942][26022] Updated weights on worker 0-0, policy_version 1186823 (0.00082) [2022-07-11 12:13:50,270][26022] Updated weights on worker 0-0, policy_version 1186833 (0.00098) [2022-07-11 12:13:51,547][25689] Fps is (10 sec: 5483.4, 60 sec: 5553.9, 300 sec: 5511.5). Total num frames: 1215325184. Throughput: 0: 5020.0. Samples: 1215319176. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:13:51,547][25689] Avg episode reward: [(0, '-1.440')] [2022-07-11 12:13:51,781][26022] Updated weights on worker 0-0, policy_version 1186843 (0.00084) [2022-07-11 12:13:53,915][26022] Updated weights on worker 0-0, policy_version 1186853 (0.00085) [2022-07-11 12:13:55,512][26022] Updated weights on worker 0-0, policy_version 1186863 (0.00084) [2022-07-11 12:13:56,548][25689] Fps is (10 sec: 5595.4, 60 sec: 5520.1, 300 sec: 5513.1). Total num frames: 1215352832. Throughput: 0: 5823.8. Samples: 1215352260. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:13:56,548][25689] Avg episode reward: [(0, '-1.119')] [2022-07-11 12:13:57,347][26022] Updated weights on worker 0-0, policy_version 1186873 (0.00094) [2022-07-11 12:13:59,386][26022] Updated weights on worker 0-0, policy_version 1186883 (0.00092) [2022-07-11 12:14:01,361][26022] Updated weights on worker 0-0, policy_version 1186893 (0.00090) [2022-07-11 12:14:01,579][25689] Fps is (10 sec: 5307.6, 60 sec: 5501.3, 300 sec: 5512.6). Total num frames: 1215378432. Throughput: 0: 5812.4. Samples: 1215385610. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:14:01,581][25689] Avg episode reward: [(0, '-1.128')] [2022-07-11 12:14:03,374][26022] Updated weights on worker 0-0, policy_version 1186903 (0.00081) [2022-07-11 12:14:05,124][26022] Updated weights on worker 0-0, policy_version 1186913 (0.00086) [2022-07-11 12:14:05,638][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:14:05,651][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001186914_1215399936.pth [2022-07-11 12:14:05,652][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001184971_1213410304.pth [2022-07-11 12:14:06,599][25689] Fps is (10 sec: 5297.4, 60 sec: 5517.0, 300 sec: 5511.0). Total num frames: 1215406080. Throughput: 0: 4872.9. Samples: 1215400354. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:14:06,600][25689] Avg episode reward: [(0, '-0.959')] [2022-07-11 12:14:07,007][26022] Updated weights on worker 0-0, policy_version 1186923 (0.00094) [2022-07-11 12:14:09,139][26022] Updated weights on worker 0-0, policy_version 1186933 (0.00080) [2022-07-11 12:14:10,674][26022] Updated weights on worker 0-0, policy_version 1186943 (0.00092) [2022-07-11 12:14:11,694][25689] Fps is (10 sec: 5669.1, 60 sec: 5546.0, 300 sec: 5523.8). Total num frames: 1215435776. Throughput: 0: 5683.8. Samples: 1215433690. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:14:11,695][25689] Avg episode reward: [(0, '-1.370')] [2022-07-11 12:14:12,608][26022] Updated weights on worker 0-0, policy_version 1186953 (0.00093) [2022-07-11 12:14:14,375][26022] Updated weights on worker 0-0, policy_version 1186963 (0.00088) [2022-07-11 12:14:16,162][26022] Updated weights on worker 0-0, policy_version 1186973 (0.00089) [2022-07-11 12:14:16,754][25689] Fps is (10 sec: 5546.5, 60 sec: 5508.6, 300 sec: 5509.7). Total num frames: 1215462400. Throughput: 0: 5703.0. Samples: 1215467494. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:14:16,754][25689] Avg episode reward: [(0, '-0.944')] [2022-07-11 12:14:17,933][26022] Updated weights on worker 0-0, policy_version 1186983 (0.00091) [2022-07-11 12:14:19,791][26022] Updated weights on worker 0-0, policy_version 1186993 (0.00086) [2022-07-11 12:14:21,576][26022] Updated weights on worker 0-0, policy_version 1187003 (0.00092) [2022-07-11 12:14:21,780][25689] Fps is (10 sec: 5482.5, 60 sec: 5524.0, 300 sec: 5520.2). Total num frames: 1215491072. Throughput: 0: 4889.9. Samples: 1215484392. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:14:21,781][25689] Avg episode reward: [(0, '-0.000')] [2022-07-11 12:14:23,643][26022] Updated weights on worker 0-0, policy_version 1187013 (0.00092) [2022-07-11 12:14:25,155][26022] Updated weights on worker 0-0, policy_version 1187023 (0.00087) [2022-07-11 12:14:26,795][25689] Fps is (10 sec: 5506.8, 60 sec: 5508.9, 300 sec: 5515.6). Total num frames: 1215517696. Throughput: 0: 5812.1. Samples: 1215517732. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:14:26,795][25689] Avg episode reward: [(0, '0.574')] [2022-07-11 12:14:27,211][26022] Updated weights on worker 0-0, policy_version 1187033 (0.00084) [2022-07-11 12:14:28,891][26022] Updated weights on worker 0-0, policy_version 1187043 (0.00084) [2022-07-11 12:14:30,926][26022] Updated weights on worker 0-0, policy_version 1187053 (0.00083) [2022-07-11 12:14:31,904][25689] Fps is (10 sec: 5563.3, 60 sec: 5537.6, 300 sec: 5521.3). Total num frames: 1215547392. Throughput: 0: 5808.9. Samples: 1215551086. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:14:31,904][25689] Avg episode reward: [(0, '0.470')] [2022-07-11 12:14:32,645][26022] Updated weights on worker 0-0, policy_version 1187063 (0.00090) [2022-07-11 12:14:34,720][26022] Updated weights on worker 0-0, policy_version 1187073 (0.00089) [2022-07-11 12:14:36,282][26022] Updated weights on worker 0-0, policy_version 1187083 (0.00088) [2022-07-11 12:14:36,983][25689] Fps is (10 sec: 5729.2, 60 sec: 5548.2, 300 sec: 5521.3). Total num frames: 1215576064. Throughput: 0: 5791.4. Samples: 1215584650. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:14:36,983][25689] Avg episode reward: [(0, '1.197')] [2022-07-11 12:14:38,223][26022] Updated weights on worker 0-0, policy_version 1187093 (0.00086) [2022-07-11 12:14:40,066][26022] Updated weights on worker 0-0, policy_version 1187103 (0.00085) [2022-07-11 12:14:41,911][26022] Updated weights on worker 0-0, policy_version 1187113 (0.00086) [2022-07-11 12:14:42,004][25689] Fps is (10 sec: 5576.1, 60 sec: 5514.0, 300 sec: 5521.2). Total num frames: 1215603712. Throughput: 0: 5778.3. Samples: 1215601250. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:14:42,005][25689] Avg episode reward: [(0, '1.640')] [2022-07-11 12:14:43,728][26022] Updated weights on worker 0-0, policy_version 1187123 (0.00086) [2022-07-11 12:14:45,633][26022] Updated weights on worker 0-0, policy_version 1187133 (0.00091) [2022-07-11 12:14:47,006][25689] Fps is (10 sec: 5516.7, 60 sec: 5532.5, 300 sec: 5516.2). Total num frames: 1215631360. Throughput: 0: 5799.2. Samples: 1215634940. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:14:47,007][25689] Avg episode reward: [(0, '1.718')] [2022-07-11 12:14:47,334][26022] Updated weights on worker 0-0, policy_version 1187143 (0.00084) [2022-07-11 12:14:49,361][26022] Updated weights on worker 0-0, policy_version 1187153 (0.00085) [2022-07-11 12:14:50,917][26022] Updated weights on worker 0-0, policy_version 1187163 (0.00088) [2022-07-11 12:14:52,111][25689] Fps is (10 sec: 5572.0, 60 sec: 5528.8, 300 sec: 5522.9). Total num frames: 1215660032. Throughput: 0: 5817.0. Samples: 1215668634. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:14:52,114][25689] Avg episode reward: [(0, '1.850')] [2022-07-11 12:14:53,091][26022] Updated weights on worker 0-0, policy_version 1187173 (0.00079) [2022-07-11 12:14:54,503][26022] Updated weights on worker 0-0, policy_version 1187183 (0.00083) [2022-07-11 12:14:56,628][26022] Updated weights on worker 0-0, policy_version 1187193 (0.00090) [2022-07-11 12:14:57,116][25689] Fps is (10 sec: 5672.0, 60 sec: 5545.4, 300 sec: 5520.9). Total num frames: 1215688704. Throughput: 0: 5024.6. Samples: 1215685812. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:14:57,117][25689] Avg episode reward: [(0, '2.047')] [2022-07-11 12:14:58,184][26022] Updated weights on worker 0-0, policy_version 1187203 (0.00080) [2022-07-11 12:15:00,174][26022] Updated weights on worker 0-0, policy_version 1187213 (0.00091) [2022-07-11 12:15:01,910][26022] Updated weights on worker 0-0, policy_version 1187223 (0.00077) [2022-07-11 12:15:02,180][25689] Fps is (10 sec: 5593.4, 60 sec: 5576.1, 300 sec: 5524.7). Total num frames: 1215716352. Throughput: 0: 5844.8. Samples: 1215719178. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:15:02,187][25689] Avg episode reward: [(0, '1.843')] [2022-07-11 12:15:04,213][26022] Updated weights on worker 0-0, policy_version 1187233 (0.00082) [2022-07-11 12:15:05,890][26022] Updated weights on worker 0-0, policy_version 1187243 (0.00094) [2022-07-11 12:15:07,188][25689] Fps is (10 sec: 5286.8, 60 sec: 5543.5, 300 sec: 5522.6). Total num frames: 1215741952. Throughput: 0: 5740.1. Samples: 1215750786. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:15:07,188][25689] Avg episode reward: [(0, '1.587')] [2022-07-11 12:15:07,943][26022] Updated weights on worker 0-0, policy_version 1187253 (0.00082) [2022-07-11 12:15:09,738][26022] Updated weights on worker 0-0, policy_version 1187263 (0.00100) [2022-07-11 12:15:11,573][26022] Updated weights on worker 0-0, policy_version 1187273 (0.00085) [2022-07-11 12:15:12,288][25689] Fps is (10 sec: 5369.3, 60 sec: 5526.1, 300 sec: 5520.9). Total num frames: 1215770624. Throughput: 0: 4893.8. Samples: 1215767376. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:15:12,289][25689] Avg episode reward: [(0, '1.569')] [2022-07-11 12:15:13,588][26022] Updated weights on worker 0-0, policy_version 1187283 (0.00165) [2022-07-11 12:15:15,404][26022] Updated weights on worker 0-0, policy_version 1187293 (0.00085) [2022-07-11 12:15:17,175][26022] Updated weights on worker 0-0, policy_version 1187303 (0.00090) [2022-07-11 12:15:17,375][25689] Fps is (10 sec: 5628.8, 60 sec: 5557.3, 300 sec: 5522.8). Total num frames: 1215799296. Throughput: 0: 5668.3. Samples: 1215800648. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:15:17,376][25689] Avg episode reward: [(0, '1.627')] [2022-07-11 12:15:19,135][26022] Updated weights on worker 0-0, policy_version 1187313 (0.00056) [2022-07-11 12:15:20,633][26022] Updated weights on worker 0-0, policy_version 1187323 (0.00097) [2022-07-11 12:15:22,407][25689] Fps is (10 sec: 5566.1, 60 sec: 5540.0, 300 sec: 5518.9). Total num frames: 1215826944. Throughput: 0: 5697.0. Samples: 1215834406. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:15:22,407][25689] Avg episode reward: [(0, '1.326')] [2022-07-11 12:15:22,713][26022] Updated weights on worker 0-0, policy_version 1187333 (0.00093) [2022-07-11 12:15:24,526][26022] Updated weights on worker 0-0, policy_version 1187343 (0.00523) [2022-07-11 12:15:26,398][26022] Updated weights on worker 0-0, policy_version 1187353 (0.00092) [2022-07-11 12:15:27,414][25689] Fps is (10 sec: 5610.1, 60 sec: 5574.4, 300 sec: 5526.6). Total num frames: 1215855616. Throughput: 0: 4971.5. Samples: 1215851342. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:15:27,415][25689] Avg episode reward: [(0, '1.507')] [2022-07-11 12:15:28,154][26022] Updated weights on worker 0-0, policy_version 1187363 (0.00092) [2022-07-11 12:15:29,834][26022] Updated weights on worker 0-0, policy_version 1187373 (0.00089) [2022-07-11 12:15:31,753][26022] Updated weights on worker 0-0, policy_version 1187383 (0.00087) [2022-07-11 12:15:32,466][25689] Fps is (10 sec: 5599.0, 60 sec: 5545.9, 300 sec: 5522.6). Total num frames: 1215883264. Throughput: 0: 5819.3. Samples: 1215884794. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:15:32,466][25689] Avg episode reward: [(0, '1.669')] [2022-07-11 12:15:33,856][26022] Updated weights on worker 0-0, policy_version 1187393 (0.00093) [2022-07-11 12:15:35,469][26022] Updated weights on worker 0-0, policy_version 1187403 (0.00083) [2022-07-11 12:15:37,478][25689] Fps is (10 sec: 5494.8, 60 sec: 5535.1, 300 sec: 5522.5). Total num frames: 1215910912. Throughput: 0: 5854.1. Samples: 1215918330. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:15:37,478][25689] Avg episode reward: [(0, '1.589')] [2022-07-11 12:15:37,480][26022] Updated weights on worker 0-0, policy_version 1187413 (0.00090) [2022-07-11 12:15:38,929][26022] Updated weights on worker 0-0, policy_version 1187423 (0.00087) [2022-07-11 12:15:41,023][26022] Updated weights on worker 0-0, policy_version 1187433 (0.00095) [2022-07-11 12:15:42,498][25689] Fps is (10 sec: 5716.1, 60 sec: 5569.1, 300 sec: 5529.1). Total num frames: 1215940608. Throughput: 0: 5012.6. Samples: 1215935114. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:15:42,498][25689] Avg episode reward: [(0, '1.493')] [2022-07-11 12:15:42,768][26022] Updated weights on worker 0-0, policy_version 1187443 (0.00085) [2022-07-11 12:15:44,744][26022] Updated weights on worker 0-0, policy_version 1187453 (0.00090) [2022-07-11 12:15:46,507][26022] Updated weights on worker 0-0, policy_version 1187463 (0.00096) [2022-07-11 12:15:47,535][25689] Fps is (10 sec: 5600.1, 60 sec: 5549.0, 300 sec: 5526.0). Total num frames: 1215967232. Throughput: 0: 5829.1. Samples: 1215968626. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:15:47,535][25689] Avg episode reward: [(0, '0.351')] [2022-07-11 12:15:48,303][26022] Updated weights on worker 0-0, policy_version 1187473 (0.00080) [2022-07-11 12:15:50,098][26022] Updated weights on worker 0-0, policy_version 1187483 (0.00087) [2022-07-11 12:15:51,984][26022] Updated weights on worker 0-0, policy_version 1187493 (0.00087) [2022-07-11 12:15:52,600][25689] Fps is (10 sec: 5473.4, 60 sec: 5552.6, 300 sec: 5532.1). Total num frames: 1215995904. Throughput: 0: 5840.9. Samples: 1216002400. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:15:52,601][25689] Avg episode reward: [(0, '0.382')] [2022-07-11 12:15:53,722][26022] Updated weights on worker 0-0, policy_version 1187503 (0.00088) [2022-07-11 12:15:55,798][26022] Updated weights on worker 0-0, policy_version 1187513 (0.00085) [2022-07-11 12:15:57,445][26022] Updated weights on worker 0-0, policy_version 1187523 (0.00090) [2022-07-11 12:15:57,637][25689] Fps is (10 sec: 5676.2, 60 sec: 5549.7, 300 sec: 5528.7). Total num frames: 1216024576. Throughput: 0: 5000.8. Samples: 1216019144. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:15:57,638][25689] Avg episode reward: [(0, '0.069')] [2022-07-11 12:15:59,174][26022] Updated weights on worker 0-0, policy_version 1187533 (0.00092) [2022-07-11 12:16:01,088][26022] Updated weights on worker 0-0, policy_version 1187543 (0.00087) [2022-07-11 12:16:02,666][25689] Fps is (10 sec: 5493.6, 60 sec: 5536.0, 300 sec: 5531.9). Total num frames: 1216051200. Throughput: 0: 5851.0. Samples: 1216053118. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:16:02,666][25689] Avg episode reward: [(0, '-0.688')] [2022-07-11 12:16:03,098][26022] Updated weights on worker 0-0, policy_version 1187553 (0.00099) [2022-07-11 12:16:05,221][26022] Updated weights on worker 0-0, policy_version 1187563 (0.00051) [2022-07-11 12:16:05,736][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:16:05,746][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001187567_1216068608.pth [2022-07-11 12:16:05,747][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001185618_1214072832.pth [2022-07-11 12:16:07,010][26022] Updated weights on worker 0-0, policy_version 1187573 (0.00083) [2022-07-11 12:16:07,726][25689] Fps is (10 sec: 5379.5, 60 sec: 5565.0, 300 sec: 5528.5). Total num frames: 1216078848. Throughput: 0: 5760.3. Samples: 1216084934. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:16:07,726][25689] Avg episode reward: [(0, '-0.405')] [2022-07-11 12:16:08,619][26022] Updated weights on worker 0-0, policy_version 1187583 (0.00075) [2022-07-11 12:16:10,529][26022] Updated weights on worker 0-0, policy_version 1187593 (0.00087) [2022-07-11 12:16:12,263][26022] Updated weights on worker 0-0, policy_version 1187603 (0.00090) [2022-07-11 12:16:12,760][25689] Fps is (10 sec: 5477.9, 60 sec: 5554.2, 300 sec: 5527.9). Total num frames: 1216106496. Throughput: 0: 4925.8. Samples: 1216101704. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:16:12,761][25689] Avg episode reward: [(0, '-0.789')] [2022-07-11 12:16:14,208][26022] Updated weights on worker 0-0, policy_version 1187613 (0.00086) [2022-07-11 12:16:16,226][26022] Updated weights on worker 0-0, policy_version 1187623 (0.00089) [2022-07-11 12:16:17,772][26022] Updated weights on worker 0-0, policy_version 1187633 (0.01394) [2022-07-11 12:16:17,857][25689] Fps is (10 sec: 5660.3, 60 sec: 5570.2, 300 sec: 5536.6). Total num frames: 1216136192. Throughput: 0: 5747.3. Samples: 1216135352. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:16:17,857][25689] Avg episode reward: [(0, '-0.235')] [2022-07-11 12:16:19,816][26022] Updated weights on worker 0-0, policy_version 1187643 (0.00102) [2022-07-11 12:16:21,416][26022] Updated weights on worker 0-0, policy_version 1187653 (0.00084) [2022-07-11 12:16:22,904][25689] Fps is (10 sec: 5653.3, 60 sec: 5568.8, 300 sec: 5529.6). Total num frames: 1216163840. Throughput: 0: 5733.3. Samples: 1216169148. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:16:22,904][25689] Avg episode reward: [(0, '0.053')] [2022-07-11 12:16:23,336][26022] Updated weights on worker 0-0, policy_version 1187663 (0.00091) [2022-07-11 12:16:25,375][26022] Updated weights on worker 0-0, policy_version 1187673 (0.00084) [2022-07-11 12:16:26,938][26022] Updated weights on worker 0-0, policy_version 1187683 (0.00090) [2022-07-11 12:16:27,951][25689] Fps is (10 sec: 5478.0, 60 sec: 5548.2, 300 sec: 5533.5). Total num frames: 1216191488. Throughput: 0: 4989.0. Samples: 1216185838. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:16:27,951][25689] Avg episode reward: [(0, '0.353')] [2022-07-11 12:16:28,999][26022] Updated weights on worker 0-0, policy_version 1187693 (0.00074) [2022-07-11 12:16:30,707][26022] Updated weights on worker 0-0, policy_version 1187703 (0.00089) [2022-07-11 12:16:32,597][26022] Updated weights on worker 0-0, policy_version 1187713 (0.00084) [2022-07-11 12:16:32,998][25689] Fps is (10 sec: 5579.2, 60 sec: 5565.5, 300 sec: 5532.8). Total num frames: 1216220160. Throughput: 0: 5792.7. Samples: 1216218938. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:16:32,999][25689] Avg episode reward: [(0, '0.721')] [2022-07-11 12:16:34,558][26022] Updated weights on worker 0-0, policy_version 1187723 (0.00090) [2022-07-11 12:16:36,310][26022] Updated weights on worker 0-0, policy_version 1187733 (0.00096) [2022-07-11 12:16:38,016][25689] Fps is (10 sec: 5697.3, 60 sec: 5581.9, 300 sec: 5536.1). Total num frames: 1216248832. Throughput: 0: 5802.2. Samples: 1216252320. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2022-07-11 12:16:38,017][25689] Avg episode reward: [(0, '0.737')] [2022-07-11 12:16:38,025][26022] Updated weights on worker 0-0, policy_version 1187743 (0.00083) [2022-07-11 12:16:40,173][26022] Updated weights on worker 0-0, policy_version 1187753 (0.00094) [2022-07-11 12:16:41,719][26022] Updated weights on worker 0-0, policy_version 1187763 (0.00096) [2022-07-11 12:16:43,086][25689] Fps is (10 sec: 5380.2, 60 sec: 5509.7, 300 sec: 5528.2). Total num frames: 1216274432. Throughput: 0: 4958.7. Samples: 1216269224. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:16:43,086][25689] Avg episode reward: [(0, '1.012')] [2022-07-11 12:16:43,926][26022] Updated weights on worker 0-0, policy_version 1187773 (0.00114) [2022-07-11 12:16:45,648][26022] Updated weights on worker 0-0, policy_version 1187783 (0.00092) [2022-07-11 12:16:47,368][26022] Updated weights on worker 0-0, policy_version 1187793 (0.00090) [2022-07-11 12:16:48,099][25689] Fps is (10 sec: 5382.4, 60 sec: 5545.7, 300 sec: 5537.2). Total num frames: 1216303104. Throughput: 0: 5769.4. Samples: 1216302082. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:16:48,100][25689] Avg episode reward: [(0, '1.332')] [2022-07-11 12:16:49,512][26022] Updated weights on worker 0-0, policy_version 1187803 (0.00088) [2022-07-11 12:16:51,144][26022] Updated weights on worker 0-0, policy_version 1187813 (0.00092) [2022-07-11 12:16:53,156][26022] Updated weights on worker 0-0, policy_version 1187823 (0.00089) [2022-07-11 12:16:53,234][25689] Fps is (10 sec: 5549.8, 60 sec: 5522.5, 300 sec: 5531.3). Total num frames: 1216330752. Throughput: 0: 5743.8. Samples: 1216335166. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:16:53,234][25689] Avg episode reward: [(0, '1.042')] [2022-07-11 12:16:54,920][26022] Updated weights on worker 0-0, policy_version 1187833 (0.00088) [2022-07-11 12:16:56,688][26022] Updated weights on worker 0-0, policy_version 1187843 (0.00065) [2022-07-11 12:16:58,295][25689] Fps is (10 sec: 5423.3, 60 sec: 5503.4, 300 sec: 5528.1). Total num frames: 1216358400. Throughput: 0: 5734.9. Samples: 1216368620. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:16:58,296][25689] Avg episode reward: [(0, '0.955')] [2022-07-11 12:16:58,632][26022] Updated weights on worker 0-0, policy_version 1187853 (0.00094) [2022-07-11 12:17:00,495][26022] Updated weights on worker 0-0, policy_version 1187863 (0.00921) [2022-07-11 12:17:02,659][26022] Updated weights on worker 0-0, policy_version 1187873 (0.00084) [2022-07-11 12:17:03,311][25689] Fps is (10 sec: 5487.4, 60 sec: 5521.5, 300 sec: 5535.2). Total num frames: 1216386048. Throughput: 0: 5731.0. Samples: 1216385134. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:17:03,311][25689] Avg episode reward: [(0, '1.345')] [2022-07-11 12:17:04,634][26022] Updated weights on worker 0-0, policy_version 1187883 (0.00083) [2022-07-11 12:17:06,328][26022] Updated weights on worker 0-0, policy_version 1187893 (0.00085) [2022-07-11 12:17:08,188][26022] Updated weights on worker 0-0, policy_version 1187903 (0.00089) [2022-07-11 12:17:08,380][25689] Fps is (10 sec: 5381.5, 60 sec: 5503.7, 300 sec: 5534.8). Total num frames: 1216412672. Throughput: 0: 5632.8. Samples: 1216416320. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:17:08,381][25689] Avg episode reward: [(0, '0.515')] [2022-07-11 12:17:10,115][26022] Updated weights on worker 0-0, policy_version 1187913 (0.00097) [2022-07-11 12:17:11,819][26022] Updated weights on worker 0-0, policy_version 1187923 (0.00081) [2022-07-11 12:17:13,510][25689] Fps is (10 sec: 5421.6, 60 sec: 5511.9, 300 sec: 5529.2). Total num frames: 1216441344. Throughput: 0: 5645.7. Samples: 1216449640. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:17:13,510][25689] Avg episode reward: [(0, '0.266')] [2022-07-11 12:17:14,046][26022] Updated weights on worker 0-0, policy_version 1187933 (0.00085) [2022-07-11 12:17:15,446][26022] Updated weights on worker 0-0, policy_version 1187943 (0.00083) [2022-07-11 12:17:17,440][26022] Updated weights on worker 0-0, policy_version 1187953 (0.00087) [2022-07-11 12:17:18,561][25689] Fps is (10 sec: 5733.3, 60 sec: 5516.1, 300 sec: 5538.6). Total num frames: 1216471040. Throughput: 0: 4830.9. Samples: 1216466520. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:17:18,561][25689] Avg episode reward: [(0, '-0.148')] [2022-07-11 12:17:19,318][26022] Updated weights on worker 0-0, policy_version 1187963 (0.00086) [2022-07-11 12:17:21,052][26022] Updated weights on worker 0-0, policy_version 1187973 (0.00083) [2022-07-11 12:17:22,819][26022] Updated weights on worker 0-0, policy_version 1187983 (0.00090) [2022-07-11 12:17:23,656][25689] Fps is (10 sec: 5651.9, 60 sec: 5511.7, 300 sec: 5537.4). Total num frames: 1216498688. Throughput: 0: 5663.5. Samples: 1216500358. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:17:23,656][25689] Avg episode reward: [(0, '-0.093')] [2022-07-11 12:17:24,608][26022] Updated weights on worker 0-0, policy_version 1187993 (0.00084) [2022-07-11 12:17:26,211][26022] Updated weights on worker 0-0, policy_version 1188003 (0.00082) [2022-07-11 12:17:28,336][26022] Updated weights on worker 0-0, policy_version 1188013 (0.00081) [2022-07-11 12:17:28,675][25689] Fps is (10 sec: 5568.2, 60 sec: 5531.1, 300 sec: 5534.3). Total num frames: 1216527360. Throughput: 0: 5814.8. Samples: 1216534332. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:17:28,676][25689] Avg episode reward: [(0, '0.241')] [2022-07-11 12:17:30,016][26022] Updated weights on worker 0-0, policy_version 1188023 (0.00088) [2022-07-11 12:17:31,995][26022] Updated weights on worker 0-0, policy_version 1188033 (0.00087) [2022-07-11 12:17:33,777][25689] Fps is (10 sec: 5564.5, 60 sec: 5509.3, 300 sec: 5535.9). Total num frames: 1216555008. Throughput: 0: 4995.2. Samples: 1216550878. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:17:33,778][25689] Avg episode reward: [(0, '0.807')] [2022-07-11 12:17:33,818][26022] Updated weights on worker 0-0, policy_version 1188043 (0.00084) [2022-07-11 12:17:35,603][26022] Updated weights on worker 0-0, policy_version 1188053 (0.00084) [2022-07-11 12:17:37,584][26022] Updated weights on worker 0-0, policy_version 1188063 (0.00090) [2022-07-11 12:17:38,850][25689] Fps is (10 sec: 5535.5, 60 sec: 5504.3, 300 sec: 5538.2). Total num frames: 1216583680. Throughput: 0: 5817.7. Samples: 1216584554. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:17:38,850][25689] Avg episode reward: [(0, '1.359')] [2022-07-11 12:17:39,236][26022] Updated weights on worker 0-0, policy_version 1188073 (0.00086) [2022-07-11 12:17:41,080][26022] Updated weights on worker 0-0, policy_version 1188083 (0.00369) [2022-07-11 12:17:42,997][26022] Updated weights on worker 0-0, policy_version 1188093 (0.00084) [2022-07-11 12:17:43,907][25689] Fps is (10 sec: 5660.7, 60 sec: 5555.9, 300 sec: 5541.1). Total num frames: 1216612352. Throughput: 0: 5813.9. Samples: 1216618098. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:17:43,908][25689] Avg episode reward: [(0, '1.638')] [2022-07-11 12:17:44,736][26022] Updated weights on worker 0-0, policy_version 1188103 (0.00096) [2022-07-11 12:17:46,711][26022] Updated weights on worker 0-0, policy_version 1188113 (0.00091) [2022-07-11 12:17:48,416][26022] Updated weights on worker 0-0, policy_version 1188123 (0.00073) [2022-07-11 12:17:48,931][25689] Fps is (10 sec: 5586.3, 60 sec: 5538.1, 300 sec: 5541.8). Total num frames: 1216640000. Throughput: 0: 4965.8. Samples: 1216634920. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:17:48,932][25689] Avg episode reward: [(0, '1.656')] [2022-07-11 12:17:50,317][26022] Updated weights on worker 0-0, policy_version 1188133 (0.00083) [2022-07-11 12:17:52,116][26022] Updated weights on worker 0-0, policy_version 1188143 (0.00093) [2022-07-11 12:17:53,932][26022] Updated weights on worker 0-0, policy_version 1188153 (0.00090) [2022-07-11 12:17:54,050][25689] Fps is (10 sec: 5552.7, 60 sec: 5556.4, 300 sec: 5536.2). Total num frames: 1216668672. Throughput: 0: 5800.5. Samples: 1216668472. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:17:54,051][25689] Avg episode reward: [(0, '1.527')] [2022-07-11 12:17:55,774][26022] Updated weights on worker 0-0, policy_version 1188163 (0.00083) [2022-07-11 12:17:57,695][26022] Updated weights on worker 0-0, policy_version 1188173 (0.00417) [2022-07-11 12:17:59,088][25689] Fps is (10 sec: 5545.2, 60 sec: 5558.6, 300 sec: 5539.2). Total num frames: 1216696320. Throughput: 0: 5813.2. Samples: 1216702204. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:17:59,089][25689] Avg episode reward: [(0, '1.481')] [2022-07-11 12:17:59,348][26022] Updated weights on worker 0-0, policy_version 1188183 (0.00092) [2022-07-11 12:18:01,282][26022] Updated weights on worker 0-0, policy_version 1188193 (0.00091) [2022-07-11 12:18:03,619][26022] Updated weights on worker 0-0, policy_version 1188203 (0.00089) [2022-07-11 12:18:04,091][25689] Fps is (10 sec: 5303.2, 60 sec: 5526.0, 300 sec: 5535.8). Total num frames: 1216721920. Throughput: 0: 4997.7. Samples: 1216718970. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:18:04,091][25689] Avg episode reward: [(0, '1.780')] [2022-07-11 12:18:05,262][26022] Updated weights on worker 0-0, policy_version 1188213 (0.00085) [2022-07-11 12:18:05,800][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:18:05,811][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001188216_1216733184.pth [2022-07-11 12:18:05,812][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001186266_1214736384.pth [2022-07-11 12:18:07,348][26022] Updated weights on worker 0-0, policy_version 1188223 (0.00092) [2022-07-11 12:18:08,901][26022] Updated weights on worker 0-0, policy_version 1188233 (0.00098) [2022-07-11 12:18:09,096][25689] Fps is (10 sec: 5525.1, 60 sec: 5582.5, 300 sec: 5543.3). Total num frames: 1216751616. Throughput: 0: 5731.4. Samples: 1216750492. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:18:09,097][25689] Avg episode reward: [(0, '1.844')] [2022-07-11 12:18:10,880][26022] Updated weights on worker 0-0, policy_version 1188243 (0.00092) [2022-07-11 12:18:12,581][26022] Updated weights on worker 0-0, policy_version 1188253 (0.00090) [2022-07-11 12:18:14,184][25689] Fps is (10 sec: 5681.4, 60 sec: 5569.4, 300 sec: 5538.7). Total num frames: 1216779264. Throughput: 0: 5730.3. Samples: 1216783846. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:18:14,185][25689] Avg episode reward: [(0, '1.788')] [2022-07-11 12:18:14,589][26022] Updated weights on worker 0-0, policy_version 1188263 (0.00094) [2022-07-11 12:18:16,280][26022] Updated weights on worker 0-0, policy_version 1188273 (0.00093) [2022-07-11 12:18:18,255][26022] Updated weights on worker 0-0, policy_version 1188283 (0.00083) [2022-07-11 12:18:19,192][25689] Fps is (10 sec: 5578.7, 60 sec: 5556.5, 300 sec: 5542.1). Total num frames: 1216807936. Throughput: 0: 4899.5. Samples: 1216800702. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:18:19,192][25689] Avg episode reward: [(0, '1.513')] [2022-07-11 12:18:19,991][26022] Updated weights on worker 0-0, policy_version 1188293 (0.00082) [2022-07-11 12:18:21,788][26022] Updated weights on worker 0-0, policy_version 1188303 (0.00093) [2022-07-11 12:18:23,601][26022] Updated weights on worker 0-0, policy_version 1188313 (0.00091) [2022-07-11 12:18:24,205][25689] Fps is (10 sec: 5620.3, 60 sec: 5564.0, 300 sec: 5542.6). Total num frames: 1216835584. Throughput: 0: 5742.0. Samples: 1216834466. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:18:24,205][25689] Avg episode reward: [(0, '1.734')] [2022-07-11 12:18:25,366][26022] Updated weights on worker 0-0, policy_version 1188323 (0.00082) [2022-07-11 12:18:27,319][26022] Updated weights on worker 0-0, policy_version 1188333 (0.00093) [2022-07-11 12:18:29,056][26022] Updated weights on worker 0-0, policy_version 1188343 (0.00084) [2022-07-11 12:18:29,224][25689] Fps is (10 sec: 5511.8, 60 sec: 5547.1, 300 sec: 5543.1). Total num frames: 1216863232. Throughput: 0: 5836.2. Samples: 1216867962. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:18:29,224][25689] Avg episode reward: [(0, '1.500')] [2022-07-11 12:18:31,059][26022] Updated weights on worker 0-0, policy_version 1188353 (0.00092) [2022-07-11 12:18:32,808][26022] Updated weights on worker 0-0, policy_version 1188363 (0.00090) [2022-07-11 12:18:34,282][25689] Fps is (10 sec: 5487.5, 60 sec: 5551.2, 300 sec: 5542.3). Total num frames: 1216890880. Throughput: 0: 5021.5. Samples: 1216884766. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:18:34,282][25689] Avg episode reward: [(0, '1.447')] [2022-07-11 12:18:34,613][26022] Updated weights on worker 0-0, policy_version 1188373 (0.00090) [2022-07-11 12:18:36,546][26022] Updated weights on worker 0-0, policy_version 1188383 (0.00089) [2022-07-11 12:18:38,259][26022] Updated weights on worker 0-0, policy_version 1188393 (0.00093) [2022-07-11 12:18:39,290][25689] Fps is (10 sec: 5696.6, 60 sec: 5574.0, 300 sec: 5542.4). Total num frames: 1216920576. Throughput: 0: 5862.0. Samples: 1216918520. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:18:39,292][25689] Avg episode reward: [(0, '1.184')] [2022-07-11 12:18:40,356][26022] Updated weights on worker 0-0, policy_version 1188403 (0.00094) [2022-07-11 12:18:42,003][26022] Updated weights on worker 0-0, policy_version 1188413 (0.00085) [2022-07-11 12:18:43,819][26022] Updated weights on worker 0-0, policy_version 1188423 (0.00101) [2022-07-11 12:18:44,296][25689] Fps is (10 sec: 5726.2, 60 sec: 5561.8, 300 sec: 5546.1). Total num frames: 1216948224. Throughput: 0: 5860.7. Samples: 1216952214. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:18:44,296][25689] Avg episode reward: [(0, '1.117')] [2022-07-11 12:18:45,711][26022] Updated weights on worker 0-0, policy_version 1188433 (0.00088) [2022-07-11 12:18:47,295][26022] Updated weights on worker 0-0, policy_version 1188443 (0.00087) [2022-07-11 12:18:49,319][25689] Fps is (10 sec: 5411.8, 60 sec: 5545.0, 300 sec: 5540.0). Total num frames: 1216974848. Throughput: 0: 5026.7. Samples: 1216968972. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:18:49,319][25689] Avg episode reward: [(0, '1.564')] [2022-07-11 12:18:49,402][26022] Updated weights on worker 0-0, policy_version 1188453 (0.00095) [2022-07-11 12:18:50,992][26022] Updated weights on worker 0-0, policy_version 1188463 (0.00090) [2022-07-11 12:18:52,831][26022] Updated weights on worker 0-0, policy_version 1188473 (0.00092) [2022-07-11 12:18:54,422][25689] Fps is (10 sec: 5561.8, 60 sec: 5563.4, 300 sec: 5545.0). Total num frames: 1217004544. Throughput: 0: 5836.1. Samples: 1217002308. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:18:54,423][25689] Avg episode reward: [(0, '1.305')] [2022-07-11 12:18:54,708][26022] Updated weights on worker 0-0, policy_version 1188483 (0.00107) [2022-07-11 12:18:56,648][26022] Updated weights on worker 0-0, policy_version 1188493 (0.00087) [2022-07-11 12:18:58,287][26022] Updated weights on worker 0-0, policy_version 1188503 (0.00082) [2022-07-11 12:18:59,448][25689] Fps is (10 sec: 5560.0, 60 sec: 5547.5, 300 sec: 5548.5). Total num frames: 1217031168. Throughput: 0: 5843.8. Samples: 1217036318. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:18:59,449][25689] Avg episode reward: [(0, '1.203')] [2022-07-11 12:19:00,145][26022] Updated weights on worker 0-0, policy_version 1188513 (0.00092) [2022-07-11 12:19:02,418][26022] Updated weights on worker 0-0, policy_version 1188523 (0.00091) [2022-07-11 12:19:04,311][26022] Updated weights on worker 0-0, policy_version 1188533 (0.00083) [2022-07-11 12:19:04,474][25689] Fps is (10 sec: 5399.3, 60 sec: 5579.3, 300 sec: 5548.4). Total num frames: 1217058816. Throughput: 0: 5715.2. Samples: 1217067534. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:19:04,474][25689] Avg episode reward: [(0, '0.404')] [2022-07-11 12:19:06,165][26022] Updated weights on worker 0-0, policy_version 1188543 (0.00100) [2022-07-11 12:19:07,983][26022] Updated weights on worker 0-0, policy_version 1188553 (0.00085) [2022-07-11 12:19:09,488][25689] Fps is (10 sec: 5507.7, 60 sec: 5544.6, 300 sec: 5543.1). Total num frames: 1217086464. Throughput: 0: 5710.6. Samples: 1217084150. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:19:09,489][25689] Avg episode reward: [(0, '0.549')] [2022-07-11 12:19:09,904][26022] Updated weights on worker 0-0, policy_version 1188563 (0.00087) [2022-07-11 12:19:11,736][26022] Updated weights on worker 0-0, policy_version 1188573 (0.00083) [2022-07-11 12:19:13,502][26022] Updated weights on worker 0-0, policy_version 1188583 (0.00084) [2022-07-11 12:19:14,595][25689] Fps is (10 sec: 5463.5, 60 sec: 5542.8, 300 sec: 5545.6). Total num frames: 1217114112. Throughput: 0: 5727.5. Samples: 1217117848. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:19:14,596][25689] Avg episode reward: [(0, '0.549')] [2022-07-11 12:19:15,521][26022] Updated weights on worker 0-0, policy_version 1188593 (0.00086) [2022-07-11 12:19:17,032][26022] Updated weights on worker 0-0, policy_version 1188603 (0.00096) [2022-07-11 12:19:19,229][26022] Updated weights on worker 0-0, policy_version 1188613 (0.00110) [2022-07-11 12:19:19,611][25689] Fps is (10 sec: 5462.5, 60 sec: 5525.1, 300 sec: 5542.4). Total num frames: 1217141760. Throughput: 0: 5697.2. Samples: 1217151188. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:19:19,612][25689] Avg episode reward: [(0, '0.413')] [2022-07-11 12:19:20,758][26022] Updated weights on worker 0-0, policy_version 1188623 (0.00088) [2022-07-11 12:19:22,795][26022] Updated weights on worker 0-0, policy_version 1188633 (0.00091) [2022-07-11 12:19:24,478][26022] Updated weights on worker 0-0, policy_version 1188643 (0.00085) [2022-07-11 12:19:24,652][25689] Fps is (10 sec: 5702.0, 60 sec: 5556.5, 300 sec: 5552.2). Total num frames: 1217171456. Throughput: 0: 4987.4. Samples: 1217168168. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:19:24,652][25689] Avg episode reward: [(0, '-0.356')] [2022-07-11 12:19:26,233][26022] Updated weights on worker 0-0, policy_version 1188653 (0.00087) [2022-07-11 12:19:28,160][26022] Updated weights on worker 0-0, policy_version 1188663 (0.00091) [2022-07-11 12:19:29,661][25689] Fps is (10 sec: 5705.8, 60 sec: 5557.4, 300 sec: 5547.2). Total num frames: 1217199104. Throughput: 0: 5821.0. Samples: 1217201576. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:19:29,661][25689] Avg episode reward: [(0, '-0.611')] [2022-07-11 12:19:30,056][26022] Updated weights on worker 0-0, policy_version 1188673 (0.00092) [2022-07-11 12:19:31,875][26022] Updated weights on worker 0-0, policy_version 1188683 (0.00086) [2022-07-11 12:19:33,809][26022] Updated weights on worker 0-0, policy_version 1188693 (0.00091) [2022-07-11 12:19:34,724][25689] Fps is (10 sec: 5388.5, 60 sec: 5540.0, 300 sec: 5540.6). Total num frames: 1217225728. Throughput: 0: 5816.1. Samples: 1217234918. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:19:34,724][25689] Avg episode reward: [(0, '-0.995')] [2022-07-11 12:19:35,459][26022] Updated weights on worker 0-0, policy_version 1188703 (0.00097) [2022-07-11 12:19:37,602][26022] Updated weights on worker 0-0, policy_version 1188713 (0.00085) [2022-07-11 12:19:39,202][26022] Updated weights on worker 0-0, policy_version 1188723 (0.00094) [2022-07-11 12:19:39,727][25689] Fps is (10 sec: 5595.0, 60 sec: 5540.5, 300 sec: 5547.8). Total num frames: 1217255424. Throughput: 0: 4986.9. Samples: 1217251506. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:19:39,728][25689] Avg episode reward: [(0, '-0.331')] [2022-07-11 12:19:41,099][26022] Updated weights on worker 0-0, policy_version 1188733 (0.00092) [2022-07-11 12:19:42,894][26022] Updated weights on worker 0-0, policy_version 1188743 (0.00058) [2022-07-11 12:19:44,702][26022] Updated weights on worker 0-0, policy_version 1188753 (0.00091) [2022-07-11 12:19:44,732][25689] Fps is (10 sec: 5729.8, 60 sec: 5540.6, 300 sec: 5547.8). Total num frames: 1217283072. Throughput: 0: 5827.0. Samples: 1217285174. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:19:44,732][25689] Avg episode reward: [(0, '-0.124')] [2022-07-11 12:19:46,577][26022] Updated weights on worker 0-0, policy_version 1188763 (0.00093) [2022-07-11 12:19:48,360][26022] Updated weights on worker 0-0, policy_version 1188773 (0.00089) [2022-07-11 12:19:49,736][25689] Fps is (10 sec: 5422.3, 60 sec: 5542.2, 300 sec: 5542.8). Total num frames: 1217309696. Throughput: 0: 5813.4. Samples: 1217318280. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:19:49,737][25689] Avg episode reward: [(0, '0.682')] [2022-07-11 12:19:50,423][26022] Updated weights on worker 0-0, policy_version 1188783 (0.00085) [2022-07-11 12:19:52,038][26022] Updated weights on worker 0-0, policy_version 1188793 (0.00087) [2022-07-11 12:19:54,059][26022] Updated weights on worker 0-0, policy_version 1188803 (0.00082) [2022-07-11 12:19:54,883][25689] Fps is (10 sec: 5447.3, 60 sec: 5521.3, 300 sec: 5540.1). Total num frames: 1217338368. Throughput: 0: 4957.7. Samples: 1217334862. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:19:54,883][25689] Avg episode reward: [(0, '0.653')] [2022-07-11 12:19:55,889][26022] Updated weights on worker 0-0, policy_version 1188813 (0.00098) [2022-07-11 12:19:57,610][26022] Updated weights on worker 0-0, policy_version 1188823 (0.00096) [2022-07-11 12:19:59,423][26022] Updated weights on worker 0-0, policy_version 1188833 (0.00081) [2022-07-11 12:19:59,905][25689] Fps is (10 sec: 5639.0, 60 sec: 5555.5, 300 sec: 5544.3). Total num frames: 1217367040. Throughput: 0: 5795.7. Samples: 1217368454. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:19:59,906][25689] Avg episode reward: [(0, '0.129')] [2022-07-11 12:20:01,454][26022] Updated weights on worker 0-0, policy_version 1188843 (0.00094) [2022-07-11 12:20:03,533][26022] Updated weights on worker 0-0, policy_version 1188853 (0.00085) [2022-07-11 12:20:04,915][25689] Fps is (10 sec: 5307.5, 60 sec: 5506.2, 300 sec: 5540.8). Total num frames: 1217391616. Throughput: 0: 5688.7. Samples: 1217399994. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:20:04,916][25689] Avg episode reward: [(0, '0.356')] [2022-07-11 12:20:05,392][26022] Updated weights on worker 0-0, policy_version 1188863 (0.00084) [2022-07-11 12:20:06,070][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:20:06,079][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001188866_1217398784.pth [2022-07-11 12:20:06,091][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001186914_1215399936.pth [2022-07-11 12:20:07,168][26022] Updated weights on worker 0-0, policy_version 1188873 (0.00085) [2022-07-11 12:20:09,146][26022] Updated weights on worker 0-0, policy_version 1188883 (0.00091) [2022-07-11 12:20:09,962][25689] Fps is (10 sec: 5193.2, 60 sec: 5503.2, 300 sec: 5538.4). Total num frames: 1217419264. Throughput: 0: 4868.2. Samples: 1217416742. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:20:09,962][25689] Avg episode reward: [(0, '0.764')] [2022-07-11 12:20:10,919][26022] Updated weights on worker 0-0, policy_version 1188893 (0.00087) [2022-07-11 12:20:12,992][26022] Updated weights on worker 0-0, policy_version 1188903 (0.00087) [2022-07-11 12:20:14,463][26022] Updated weights on worker 0-0, policy_version 1188913 (0.00087) [2022-07-11 12:20:15,025][25689] Fps is (10 sec: 5672.2, 60 sec: 5541.1, 300 sec: 5542.3). Total num frames: 1217448960. Throughput: 0: 5725.8. Samples: 1217450192. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:20:15,026][25689] Avg episode reward: [(0, '0.561')] [2022-07-11 12:20:16,607][26022] Updated weights on worker 0-0, policy_version 1188923 (0.00091) [2022-07-11 12:20:18,245][26022] Updated weights on worker 0-0, policy_version 1188933 (0.00083) [2022-07-11 12:20:20,109][25689] Fps is (10 sec: 5651.2, 60 sec: 5534.9, 300 sec: 5541.3). Total num frames: 1217476608. Throughput: 0: 5693.5. Samples: 1217483482. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:20:20,109][25689] Avg episode reward: [(0, '0.572')] [2022-07-11 12:20:20,253][26022] Updated weights on worker 0-0, policy_version 1188943 (0.00080) [2022-07-11 12:20:21,907][26022] Updated weights on worker 0-0, policy_version 1188953 (0.00080) [2022-07-11 12:20:24,064][26022] Updated weights on worker 0-0, policy_version 1188963 (0.00086) [2022-07-11 12:20:25,119][25689] Fps is (10 sec: 5681.2, 60 sec: 5537.7, 300 sec: 5544.7). Total num frames: 1217506304. Throughput: 0: 4955.3. Samples: 1217500106. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:20:25,119][25689] Avg episode reward: [(0, '0.628')] [2022-07-11 12:20:25,730][26022] Updated weights on worker 0-0, policy_version 1188973 (0.00084) [2022-07-11 12:20:27,557][26022] Updated weights on worker 0-0, policy_version 1188983 (0.00083) [2022-07-11 12:20:29,382][26022] Updated weights on worker 0-0, policy_version 1188993 (0.00084) [2022-07-11 12:20:30,144][25689] Fps is (10 sec: 5612.0, 60 sec: 5519.3, 300 sec: 5541.8). Total num frames: 1217532928. Throughput: 0: 5798.7. Samples: 1217533776. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:20:30,145][25689] Avg episode reward: [(0, '0.876')] [2022-07-11 12:20:31,164][26022] Updated weights on worker 0-0, policy_version 1189003 (0.00087) [2022-07-11 12:20:33,140][26022] Updated weights on worker 0-0, policy_version 1189013 (0.00092) [2022-07-11 12:20:34,849][26022] Updated weights on worker 0-0, policy_version 1189023 (0.00085) [2022-07-11 12:20:35,219][25689] Fps is (10 sec: 5373.4, 60 sec: 5535.1, 300 sec: 5540.6). Total num frames: 1217560576. Throughput: 0: 5789.1. Samples: 1217567096. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 12:20:35,219][25689] Avg episode reward: [(0, '0.685')] [2022-07-11 12:20:36,543][26022] Updated weights on worker 0-0, policy_version 1189033 (0.00088) [2022-07-11 12:20:38,621][26022] Updated weights on worker 0-0, policy_version 1189043 (0.00080) [2022-07-11 12:20:40,243][25689] Fps is (10 sec: 5576.9, 60 sec: 5516.3, 300 sec: 5537.1). Total num frames: 1217589248. Throughput: 0: 4983.9. Samples: 1217583830. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:20:40,244][25689] Avg episode reward: [(0, '0.709')] [2022-07-11 12:20:40,459][26022] Updated weights on worker 0-0, policy_version 1189053 (0.00094) [2022-07-11 12:20:42,468][26022] Updated weights on worker 0-0, policy_version 1189063 (0.00087) [2022-07-11 12:20:43,991][26022] Updated weights on worker 0-0, policy_version 1189073 (0.00089) [2022-07-11 12:20:45,259][25689] Fps is (10 sec: 5507.6, 60 sec: 5498.4, 300 sec: 5537.5). Total num frames: 1217615872. Throughput: 0: 5817.9. Samples: 1217617278. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:20:45,259][25689] Avg episode reward: [(0, '0.872')] [2022-07-11 12:20:45,743][26022] Updated weights on worker 0-0, policy_version 1189083 (0.00087) [2022-07-11 12:20:47,841][26022] Updated weights on worker 0-0, policy_version 1189093 (0.00097) [2022-07-11 12:20:49,538][26022] Updated weights on worker 0-0, policy_version 1189103 (0.00087) [2022-07-11 12:20:50,274][25689] Fps is (10 sec: 5410.5, 60 sec: 5514.3, 300 sec: 5535.0). Total num frames: 1217643520. Throughput: 0: 5799.9. Samples: 1217650526. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:20:50,275][25689] Avg episode reward: [(0, '1.135')] [2022-07-11 12:20:51,548][26022] Updated weights on worker 0-0, policy_version 1189113 (0.00089) [2022-07-11 12:20:53,557][26022] Updated weights on worker 0-0, policy_version 1189123 (0.00086) [2022-07-11 12:20:55,115][26022] Updated weights on worker 0-0, policy_version 1189133 (0.00091) [2022-07-11 12:20:55,310][25689] Fps is (10 sec: 5705.0, 60 sec: 5541.3, 300 sec: 5538.4). Total num frames: 1217673216. Throughput: 0: 4979.9. Samples: 1217667148. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:20:55,311][25689] Avg episode reward: [(0, '1.913')] [2022-07-11 12:20:57,025][26022] Updated weights on worker 0-0, policy_version 1189143 (0.00087) [2022-07-11 12:20:58,861][26022] Updated weights on worker 0-0, policy_version 1189153 (0.00094) [2022-07-11 12:21:00,332][25689] Fps is (10 sec: 5701.4, 60 sec: 5524.4, 300 sec: 5542.0). Total num frames: 1217700864. Throughput: 0: 5813.6. Samples: 1217700618. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:21:00,333][25689] Avg episode reward: [(0, '1.679')] [2022-07-11 12:21:00,771][26022] Updated weights on worker 0-0, policy_version 1189163 (0.00090) [2022-07-11 12:21:02,963][26022] Updated weights on worker 0-0, policy_version 1189173 (0.00093) [2022-07-11 12:21:04,446][26022] Updated weights on worker 0-0, policy_version 1189183 (0.00085) [2022-07-11 12:21:05,367][25689] Fps is (10 sec: 5294.9, 60 sec: 5539.1, 300 sec: 5535.6). Total num frames: 1217726464. Throughput: 0: 5731.8. Samples: 1217732534. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:21:05,367][25689] Avg episode reward: [(0, '1.695')] [2022-07-11 12:21:06,570][26022] Updated weights on worker 0-0, policy_version 1189193 (0.00082) [2022-07-11 12:21:08,286][26022] Updated weights on worker 0-0, policy_version 1189203 (0.00086) [2022-07-11 12:21:10,235][26022] Updated weights on worker 0-0, policy_version 1189213 (0.00087) [2022-07-11 12:21:10,396][25689] Fps is (10 sec: 5392.8, 60 sec: 5557.6, 300 sec: 5539.1). Total num frames: 1217755136. Throughput: 0: 4903.9. Samples: 1217749200. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:21:10,396][25689] Avg episode reward: [(0, '0.988')] [2022-07-11 12:21:11,951][26022] Updated weights on worker 0-0, policy_version 1189223 (0.00088) [2022-07-11 12:21:13,812][26022] Updated weights on worker 0-0, policy_version 1189233 (0.00085) [2022-07-11 12:21:15,417][26022] Updated weights on worker 0-0, policy_version 1189243 (0.00081) [2022-07-11 12:21:15,479][25689] Fps is (10 sec: 5771.7, 60 sec: 5555.8, 300 sec: 5539.4). Total num frames: 1217784832. Throughput: 0: 5737.9. Samples: 1217782878. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:21:15,480][25689] Avg episode reward: [(0, '1.290')] [2022-07-11 12:21:17,539][26022] Updated weights on worker 0-0, policy_version 1189253 (0.00082) [2022-07-11 12:21:19,161][26022] Updated weights on worker 0-0, policy_version 1189263 (0.00083) [2022-07-11 12:21:20,497][25689] Fps is (10 sec: 5474.1, 60 sec: 5527.9, 300 sec: 5533.0). Total num frames: 1217810432. Throughput: 0: 5756.2. Samples: 1217816692. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:21:20,498][25689] Avg episode reward: [(0, '1.140')] [2022-07-11 12:21:21,144][26022] Updated weights on worker 0-0, policy_version 1189273 (0.00093) [2022-07-11 12:21:22,908][26022] Updated weights on worker 0-0, policy_version 1189283 (0.00081) [2022-07-11 12:21:24,798][26022] Updated weights on worker 0-0, policy_version 1189293 (0.00087) [2022-07-11 12:21:25,505][25689] Fps is (10 sec: 5413.2, 60 sec: 5511.2, 300 sec: 5537.2). Total num frames: 1217839104. Throughput: 0: 5012.9. Samples: 1217833486. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:21:25,506][25689] Avg episode reward: [(0, '0.877')] [2022-07-11 12:21:26,593][26022] Updated weights on worker 0-0, policy_version 1189303 (0.00089) [2022-07-11 12:21:28,554][26022] Updated weights on worker 0-0, policy_version 1189313 (0.00084) [2022-07-11 12:21:30,367][26022] Updated weights on worker 0-0, policy_version 1189323 (0.00085) [2022-07-11 12:21:30,512][25689] Fps is (10 sec: 5623.5, 60 sec: 5529.8, 300 sec: 5534.5). Total num frames: 1217866752. Throughput: 0: 5848.8. Samples: 1217866858. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:21:30,513][25689] Avg episode reward: [(0, '0.900')] [2022-07-11 12:21:32,158][26022] Updated weights on worker 0-0, policy_version 1189333 (0.00085) [2022-07-11 12:21:33,887][26022] Updated weights on worker 0-0, policy_version 1189343 (0.00088) [2022-07-11 12:21:35,587][25689] Fps is (10 sec: 5687.8, 60 sec: 5563.7, 300 sec: 5536.9). Total num frames: 1217896448. Throughput: 0: 5864.6. Samples: 1217900802. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:21:35,588][25689] Avg episode reward: [(0, '1.735')] [2022-07-11 12:21:35,796][26022] Updated weights on worker 0-0, policy_version 1189353 (0.00086) [2022-07-11 12:21:37,577][26022] Updated weights on worker 0-0, policy_version 1189363 (0.00089) [2022-07-11 12:21:39,422][26022] Updated weights on worker 0-0, policy_version 1189373 (0.00097) [2022-07-11 12:21:40,629][25689] Fps is (10 sec: 5668.5, 60 sec: 5545.2, 300 sec: 5544.3). Total num frames: 1217924096. Throughput: 0: 5019.9. Samples: 1217917750. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:21:40,629][25689] Avg episode reward: [(0, '1.514')] [2022-07-11 12:21:41,254][26022] Updated weights on worker 0-0, policy_version 1189383 (0.00085) [2022-07-11 12:21:42,992][26022] Updated weights on worker 0-0, policy_version 1189393 (0.00089) [2022-07-11 12:21:45,047][26022] Updated weights on worker 0-0, policy_version 1189403 (0.00085) [2022-07-11 12:21:45,660][25689] Fps is (10 sec: 5591.2, 60 sec: 5577.6, 300 sec: 5544.0). Total num frames: 1217952768. Throughput: 0: 5844.4. Samples: 1217951280. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:21:45,661][25689] Avg episode reward: [(0, '0.838')] [2022-07-11 12:21:46,715][26022] Updated weights on worker 0-0, policy_version 1189413 (0.00084) [2022-07-11 12:21:48,566][26022] Updated weights on worker 0-0, policy_version 1189423 (0.00087) [2022-07-11 12:21:50,517][26022] Updated weights on worker 0-0, policy_version 1189433 (0.00092) [2022-07-11 12:21:50,753][25689] Fps is (10 sec: 5562.6, 60 sec: 5570.4, 300 sec: 5544.7). Total num frames: 1217980416. Throughput: 0: 5818.9. Samples: 1217984638. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:21:50,754][25689] Avg episode reward: [(0, '0.838')] [2022-07-11 12:21:52,238][26022] Updated weights on worker 0-0, policy_version 1189443 (0.00088) [2022-07-11 12:21:54,166][26022] Updated weights on worker 0-0, policy_version 1189453 (0.00083) [2022-07-11 12:21:55,833][25689] Fps is (10 sec: 5536.1, 60 sec: 5549.5, 300 sec: 5547.8). Total num frames: 1218009088. Throughput: 0: 5779.3. Samples: 1218017810. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:21:55,834][25689] Avg episode reward: [(0, '0.729')] [2022-07-11 12:21:56,076][26022] Updated weights on worker 0-0, policy_version 1189463 (0.00095) [2022-07-11 12:21:57,885][26022] Updated weights on worker 0-0, policy_version 1189473 (0.00083) [2022-07-11 12:21:59,974][26022] Updated weights on worker 0-0, policy_version 1189483 (0.00086) [2022-07-11 12:22:00,933][25689] Fps is (10 sec: 5532.3, 60 sec: 5542.3, 300 sec: 5546.2). Total num frames: 1218036736. Throughput: 0: 5736.8. Samples: 1218034236. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:22:00,934][25689] Avg episode reward: [(0, '1.133')] [2022-07-11 12:22:01,647][26022] Updated weights on worker 0-0, policy_version 1189493 (0.00094) [2022-07-11 12:22:03,737][26022] Updated weights on worker 0-0, policy_version 1189503 (0.00093) [2022-07-11 12:22:05,838][26022] Updated weights on worker 0-0, policy_version 1189513 (0.00084) [2022-07-11 12:22:05,936][25689] Fps is (10 sec: 5169.3, 60 sec: 5528.3, 300 sec: 5540.6). Total num frames: 1218061312. Throughput: 0: 5627.8. Samples: 1218065390. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:22:05,937][25689] Avg episode reward: [(0, '1.183')] [2022-07-11 12:22:06,194][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:22:06,202][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001189515_1218063360.pth [2022-07-11 12:22:06,203][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001187567_1216068608.pth [2022-07-11 12:22:07,355][26022] Updated weights on worker 0-0, policy_version 1189523 (0.00086) [2022-07-11 12:22:09,466][26022] Updated weights on worker 0-0, policy_version 1189533 (0.00085) [2022-07-11 12:22:10,965][25689] Fps is (10 sec: 5409.9, 60 sec: 5545.2, 300 sec: 5545.9). Total num frames: 1218091008. Throughput: 0: 5640.4. Samples: 1218098642. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:22:10,966][25689] Avg episode reward: [(0, '0.801')] [2022-07-11 12:22:11,167][26022] Updated weights on worker 0-0, policy_version 1189543 (0.00085) [2022-07-11 12:22:13,159][26022] Updated weights on worker 0-0, policy_version 1189553 (0.00094) [2022-07-11 12:22:15,110][26022] Updated weights on worker 0-0, policy_version 1189563 (0.00089) [2022-07-11 12:22:16,053][25689] Fps is (10 sec: 5566.9, 60 sec: 5494.1, 300 sec: 5534.9). Total num frames: 1218117632. Throughput: 0: 4812.3. Samples: 1218115114. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:22:16,054][25689] Avg episode reward: [(0, '1.310')] [2022-07-11 12:22:16,851][26022] Updated weights on worker 0-0, policy_version 1189573 (0.00076) [2022-07-11 12:22:18,656][26022] Updated weights on worker 0-0, policy_version 1189583 (0.00096) [2022-07-11 12:22:20,730][26022] Updated weights on worker 0-0, policy_version 1189593 (0.00095) [2022-07-11 12:22:21,079][25689] Fps is (10 sec: 5366.1, 60 sec: 5527.2, 300 sec: 5536.2). Total num frames: 1218145280. Throughput: 0: 5658.2. Samples: 1218148224. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:22:21,081][25689] Avg episode reward: [(0, '1.461')] [2022-07-11 12:22:22,218][26022] Updated weights on worker 0-0, policy_version 1189603 (0.00089) [2022-07-11 12:22:24,278][26022] Updated weights on worker 0-0, policy_version 1189613 (0.00088) [2022-07-11 12:22:25,839][26022] Updated weights on worker 0-0, policy_version 1189623 (0.00108) [2022-07-11 12:22:26,094][25689] Fps is (10 sec: 5609.2, 60 sec: 5526.6, 300 sec: 5536.3). Total num frames: 1218173952. Throughput: 0: 5773.4. Samples: 1218181768. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:22:26,094][25689] Avg episode reward: [(0, '0.541')] [2022-07-11 12:22:27,961][26022] Updated weights on worker 0-0, policy_version 1189633 (0.00621) [2022-07-11 12:22:29,685][26022] Updated weights on worker 0-0, policy_version 1189643 (0.00088) [2022-07-11 12:22:31,103][25689] Fps is (10 sec: 5516.7, 60 sec: 5509.5, 300 sec: 5534.6). Total num frames: 1218200576. Throughput: 0: 4953.6. Samples: 1218198394. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:22:31,104][25689] Avg episode reward: [(0, '-0.771')] [2022-07-11 12:22:31,624][26022] Updated weights on worker 0-0, policy_version 1189653 (0.00086) [2022-07-11 12:22:33,333][26022] Updated weights on worker 0-0, policy_version 1189663 (0.00085) [2022-07-11 12:22:35,366][26022] Updated weights on worker 0-0, policy_version 1189673 (0.00085) [2022-07-11 12:22:36,211][25689] Fps is (10 sec: 5465.7, 60 sec: 5489.6, 300 sec: 5533.9). Total num frames: 1218229248. Throughput: 0: 5803.6. Samples: 1218232100. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:22:36,211][25689] Avg episode reward: [(0, '-0.619')] [2022-07-11 12:22:37,148][26022] Updated weights on worker 0-0, policy_version 1189683 (0.00092) [2022-07-11 12:22:38,943][26022] Updated weights on worker 0-0, policy_version 1189693 (0.00089) [2022-07-11 12:22:40,709][26022] Updated weights on worker 0-0, policy_version 1189703 (0.00091) [2022-07-11 12:22:41,234][25689] Fps is (10 sec: 5862.3, 60 sec: 5542.0, 300 sec: 5541.4). Total num frames: 1218259968. Throughput: 0: 5843.8. Samples: 1218266004. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:22:41,234][25689] Avg episode reward: [(0, '-1.099')] [2022-07-11 12:22:42,579][26022] Updated weights on worker 0-0, policy_version 1189713 (0.00086) [2022-07-11 12:22:44,318][26022] Updated weights on worker 0-0, policy_version 1189723 (0.00086) [2022-07-11 12:22:46,243][25689] Fps is (10 sec: 5614.0, 60 sec: 5493.3, 300 sec: 5534.8). Total num frames: 1218285568. Throughput: 0: 5017.4. Samples: 1218282862. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:22:46,243][25689] Avg episode reward: [(0, '-1.410')] [2022-07-11 12:22:46,367][26022] Updated weights on worker 0-0, policy_version 1189733 (0.00084) [2022-07-11 12:22:47,955][26022] Updated weights on worker 0-0, policy_version 1189743 (0.00086) [2022-07-11 12:22:50,007][26022] Updated weights on worker 0-0, policy_version 1189753 (0.00094) [2022-07-11 12:22:51,257][25689] Fps is (10 sec: 5414.9, 60 sec: 5517.5, 300 sec: 5536.8). Total num frames: 1218314240. Throughput: 0: 5852.7. Samples: 1218316348. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:22:51,257][25689] Avg episode reward: [(0, '-1.322')] [2022-07-11 12:22:51,619][26022] Updated weights on worker 0-0, policy_version 1189763 (0.00100) [2022-07-11 12:22:53,601][26022] Updated weights on worker 0-0, policy_version 1189773 (0.00087) [2022-07-11 12:22:55,338][26022] Updated weights on worker 0-0, policy_version 1189783 (0.00084) [2022-07-11 12:22:56,301][25689] Fps is (10 sec: 5700.9, 60 sec: 5520.7, 300 sec: 5540.1). Total num frames: 1218342912. Throughput: 0: 5871.2. Samples: 1218350056. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:22:56,302][25689] Avg episode reward: [(0, '-0.602')] [2022-07-11 12:22:57,092][26022] Updated weights on worker 0-0, policy_version 1189793 (0.00097) [2022-07-11 12:22:58,999][26022] Updated weights on worker 0-0, policy_version 1189803 (0.00114) [2022-07-11 12:23:00,888][26022] Updated weights on worker 0-0, policy_version 1189813 (0.00083) [2022-07-11 12:23:01,312][25689] Fps is (10 sec: 5600.8, 60 sec: 5528.8, 300 sec: 5546.8). Total num frames: 1218370560. Throughput: 0: 5022.2. Samples: 1218366842. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:23:01,313][25689] Avg episode reward: [(0, '0.326')] [2022-07-11 12:23:03,027][26022] Updated weights on worker 0-0, policy_version 1189823 (0.00088) [2022-07-11 12:23:04,920][26022] Updated weights on worker 0-0, policy_version 1189833 (0.00086) [2022-07-11 12:23:06,320][25689] Fps is (10 sec: 5314.9, 60 sec: 5545.3, 300 sec: 5533.0). Total num frames: 1218396160. Throughput: 0: 5743.3. Samples: 1218398172. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:23:06,321][25689] Avg episode reward: [(0, '1.031')] [2022-07-11 12:23:06,681][26022] Updated weights on worker 0-0, policy_version 1189843 (0.00085) [2022-07-11 12:23:08,483][26022] Updated weights on worker 0-0, policy_version 1189853 (0.00087) [2022-07-11 12:23:10,465][26022] Updated weights on worker 0-0, policy_version 1189863 (0.00083) [2022-07-11 12:23:11,326][25689] Fps is (10 sec: 5419.7, 60 sec: 5530.5, 300 sec: 5538.0). Total num frames: 1218424832. Throughput: 0: 5747.4. Samples: 1218431696. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:23:11,327][25689] Avg episode reward: [(0, '0.629')] [2022-07-11 12:23:12,253][26022] Updated weights on worker 0-0, policy_version 1189873 (0.00089) [2022-07-11 12:23:14,126][26022] Updated weights on worker 0-0, policy_version 1189883 (0.00090) [2022-07-11 12:23:16,028][26022] Updated weights on worker 0-0, policy_version 1189893 (0.00089) [2022-07-11 12:23:16,367][25689] Fps is (10 sec: 5503.8, 60 sec: 5534.8, 300 sec: 5530.5). Total num frames: 1218451456. Throughput: 0: 4894.1. Samples: 1218448258. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:23:16,367][25689] Avg episode reward: [(0, '0.666')] [2022-07-11 12:23:17,873][26022] Updated weights on worker 0-0, policy_version 1189903 (0.00086) [2022-07-11 12:23:19,692][26022] Updated weights on worker 0-0, policy_version 1189913 (0.00053) [2022-07-11 12:23:21,382][25689] Fps is (10 sec: 5499.1, 60 sec: 5552.8, 300 sec: 5533.9). Total num frames: 1218480128. Throughput: 0: 5737.2. Samples: 1218481984. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:23:21,382][25689] Avg episode reward: [(0, '1.263')] [2022-07-11 12:23:21,504][26022] Updated weights on worker 0-0, policy_version 1189923 (0.00091) [2022-07-11 12:23:23,287][26022] Updated weights on worker 0-0, policy_version 1189933 (0.00090) [2022-07-11 12:23:25,259][26022] Updated weights on worker 0-0, policy_version 1189943 (0.00089) [2022-07-11 12:23:26,402][25689] Fps is (10 sec: 5714.5, 60 sec: 5552.3, 300 sec: 5537.3). Total num frames: 1218508800. Throughput: 0: 5828.3. Samples: 1218515214. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:23:26,402][25689] Avg episode reward: [(0, '0.979')] [2022-07-11 12:23:26,951][26022] Updated weights on worker 0-0, policy_version 1189953 (0.00080) [2022-07-11 12:23:28,756][26022] Updated weights on worker 0-0, policy_version 1189963 (0.00093) [2022-07-11 12:23:30,671][26022] Updated weights on worker 0-0, policy_version 1189973 (0.00088) [2022-07-11 12:23:31,411][25689] Fps is (10 sec: 5411.3, 60 sec: 5535.3, 300 sec: 5531.3). Total num frames: 1218534400. Throughput: 0: 5000.3. Samples: 1218532126. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:23:31,412][25689] Avg episode reward: [(0, '0.997')] [2022-07-11 12:23:32,284][26022] Updated weights on worker 0-0, policy_version 1189983 (0.00087) [2022-07-11 12:23:34,527][26022] Updated weights on worker 0-0, policy_version 1189993 (0.00085) [2022-07-11 12:23:35,951][26022] Updated weights on worker 0-0, policy_version 1190003 (0.00093) [2022-07-11 12:23:36,554][25689] Fps is (10 sec: 5547.3, 60 sec: 5566.0, 300 sec: 5532.3). Total num frames: 1218565120. Throughput: 0: 5811.3. Samples: 1218565574. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:23:36,556][25689] Avg episode reward: [(0, '0.557')] [2022-07-11 12:23:38,219][26022] Updated weights on worker 0-0, policy_version 1190013 (0.00092) [2022-07-11 12:23:39,663][26022] Updated weights on worker 0-0, policy_version 1190023 (0.00085) [2022-07-11 12:23:41,632][25689] Fps is (10 sec: 5710.5, 60 sec: 5510.1, 300 sec: 5530.9). Total num frames: 1218592768. Throughput: 0: 5781.3. Samples: 1218599058. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:23:41,634][25689] Avg episode reward: [(0, '0.265')] [2022-07-11 12:23:41,852][26022] Updated weights on worker 0-0, policy_version 1190033 (0.00090) [2022-07-11 12:23:43,482][26022] Updated weights on worker 0-0, policy_version 1190043 (0.00086) [2022-07-11 12:23:45,568][26022] Updated weights on worker 0-0, policy_version 1190053 (0.00085) [2022-07-11 12:23:46,664][25689] Fps is (10 sec: 5469.7, 60 sec: 5541.9, 300 sec: 5534.2). Total num frames: 1218620416. Throughput: 0: 4957.2. Samples: 1218615658. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:23:46,664][25689] Avg episode reward: [(0, '0.104')] [2022-07-11 12:23:47,216][26022] Updated weights on worker 0-0, policy_version 1190063 (0.00090) [2022-07-11 12:23:49,211][26022] Updated weights on worker 0-0, policy_version 1190073 (0.00091) [2022-07-11 12:23:50,931][26022] Updated weights on worker 0-0, policy_version 1190083 (0.00479) [2022-07-11 12:23:51,679][25689] Fps is (10 sec: 5605.9, 60 sec: 5541.8, 300 sec: 5532.4). Total num frames: 1218649088. Throughput: 0: 5756.6. Samples: 1218648800. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:23:51,680][25689] Avg episode reward: [(0, '0.044')] [2022-07-11 12:23:52,872][26022] Updated weights on worker 0-0, policy_version 1190093 (0.00087) [2022-07-11 12:23:54,714][26022] Updated weights on worker 0-0, policy_version 1190103 (0.00089) [2022-07-11 12:23:56,719][26022] Updated weights on worker 0-0, policy_version 1190113 (0.00086) [2022-07-11 12:23:56,816][25689] Fps is (10 sec: 5446.5, 60 sec: 5499.5, 300 sec: 5530.3). Total num frames: 1218675712. Throughput: 0: 5727.1. Samples: 1218681616. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:23:56,817][25689] Avg episode reward: [(0, '-0.495')] [2022-07-11 12:23:58,288][26022] Updated weights on worker 0-0, policy_version 1190123 (0.00083) [2022-07-11 12:24:00,399][26022] Updated weights on worker 0-0, policy_version 1190133 (0.00085) [2022-07-11 12:24:01,827][25689] Fps is (10 sec: 5247.2, 60 sec: 5482.6, 300 sec: 5527.2). Total num frames: 1218702336. Throughput: 0: 4918.2. Samples: 1218698378. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:24:01,827][25689] Avg episode reward: [(0, '-0.629')] [2022-07-11 12:24:02,274][26022] Updated weights on worker 0-0, policy_version 1190143 (0.00087) [2022-07-11 12:24:04,116][26022] Updated weights on worker 0-0, policy_version 1190153 (0.01315) [2022-07-11 12:24:06,211][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:24:06,220][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001190163_1218726912.pth [2022-07-11 12:24:06,220][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001188216_1216733184.pth [2022-07-11 12:24:06,222][26022] Updated weights on worker 0-0, policy_version 1190163 (0.00084) [2022-07-11 12:24:06,845][25689] Fps is (10 sec: 5411.9, 60 sec: 5515.5, 300 sec: 5527.1). Total num frames: 1218729984. Throughput: 0: 5666.7. Samples: 1218730016. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:24:06,845][25689] Avg episode reward: [(0, '-0.513')] [2022-07-11 12:24:07,895][26022] Updated weights on worker 0-0, policy_version 1190173 (0.00090) [2022-07-11 12:24:09,830][26022] Updated weights on worker 0-0, policy_version 1190183 (0.00089) [2022-07-11 12:24:11,717][26022] Updated weights on worker 0-0, policy_version 1190193 (0.00080) [2022-07-11 12:24:11,864][25689] Fps is (10 sec: 5611.2, 60 sec: 5514.3, 300 sec: 5532.2). Total num frames: 1218758656. Throughput: 0: 5674.5. Samples: 1218763340. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:24:11,864][25689] Avg episode reward: [(0, '-0.377')] [2022-07-11 12:24:13,545][26022] Updated weights on worker 0-0, policy_version 1190203 (0.00087) [2022-07-11 12:24:15,323][26022] Updated weights on worker 0-0, policy_version 1190213 (0.00084) [2022-07-11 12:24:16,904][25689] Fps is (10 sec: 5598.7, 60 sec: 5531.3, 300 sec: 5531.7). Total num frames: 1218786304. Throughput: 0: 4902.5. Samples: 1218780096. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:24:16,905][25689] Avg episode reward: [(0, '0.416')] [2022-07-11 12:24:17,048][26022] Updated weights on worker 0-0, policy_version 1190223 (0.00096) [2022-07-11 12:24:19,004][26022] Updated weights on worker 0-0, policy_version 1190233 (0.00093) [2022-07-11 12:24:20,728][26022] Updated weights on worker 0-0, policy_version 1190243 (0.00088) [2022-07-11 12:24:21,919][25689] Fps is (10 sec: 5600.8, 60 sec: 5531.2, 300 sec: 5528.8). Total num frames: 1218814976. Throughput: 0: 5744.0. Samples: 1218813790. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:24:21,920][25689] Avg episode reward: [(0, '0.300')] [2022-07-11 12:24:22,839][26022] Updated weights on worker 0-0, policy_version 1190253 (0.00088) [2022-07-11 12:24:24,447][26022] Updated weights on worker 0-0, policy_version 1190263 (0.00081) [2022-07-11 12:24:26,637][26022] Updated weights on worker 0-0, policy_version 1190273 (0.00084) [2022-07-11 12:24:26,932][25689] Fps is (10 sec: 5514.2, 60 sec: 5498.1, 300 sec: 5525.2). Total num frames: 1218841600. Throughput: 0: 5818.5. Samples: 1218846896. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:24:26,933][25689] Avg episode reward: [(0, '0.561')] [2022-07-11 12:24:28,124][26022] Updated weights on worker 0-0, policy_version 1190283 (0.00085) [2022-07-11 12:24:30,240][26022] Updated weights on worker 0-0, policy_version 1190293 (0.00098) [2022-07-11 12:24:31,952][25689] Fps is (10 sec: 5409.6, 60 sec: 5530.9, 300 sec: 5529.5). Total num frames: 1218869248. Throughput: 0: 5000.7. Samples: 1218863794. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:24:31,953][25689] Avg episode reward: [(0, '0.841')] [2022-07-11 12:24:31,959][26022] Updated weights on worker 0-0, policy_version 1190303 (0.00090) [2022-07-11 12:24:33,815][26022] Updated weights on worker 0-0, policy_version 1190313 (0.00085) [2022-07-11 12:24:35,561][26022] Updated weights on worker 0-0, policy_version 1190323 (0.00092) [2022-07-11 12:24:36,991][25689] Fps is (10 sec: 5497.4, 60 sec: 5489.7, 300 sec: 5521.9). Total num frames: 1218896896. Throughput: 0: 5829.3. Samples: 1218897186. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2022-07-11 12:24:36,993][25689] Avg episode reward: [(0, '1.124')] [2022-07-11 12:24:37,418][26022] Updated weights on worker 0-0, policy_version 1190333 (0.00087) [2022-07-11 12:24:39,323][26022] Updated weights on worker 0-0, policy_version 1190343 (0.00093) [2022-07-11 12:24:40,928][26022] Updated weights on worker 0-0, policy_version 1190353 (0.00087) [2022-07-11 12:24:42,008][25689] Fps is (10 sec: 5702.5, 60 sec: 5529.1, 300 sec: 5528.6). Total num frames: 1218926592. Throughput: 0: 5833.4. Samples: 1218930974. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:24:42,008][25689] Avg episode reward: [(0, '1.291')] [2022-07-11 12:24:42,895][26022] Updated weights on worker 0-0, policy_version 1190363 (0.00093) [2022-07-11 12:24:44,634][26022] Updated weights on worker 0-0, policy_version 1190373 (0.00084) [2022-07-11 12:24:46,598][26022] Updated weights on worker 0-0, policy_version 1190383 (0.00053) [2022-07-11 12:24:47,027][25689] Fps is (10 sec: 5611.8, 60 sec: 5513.3, 300 sec: 5528.3). Total num frames: 1218953216. Throughput: 0: 5020.3. Samples: 1218947776. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:24:47,027][25689] Avg episode reward: [(0, '0.696')] [2022-07-11 12:24:48,392][26022] Updated weights on worker 0-0, policy_version 1190393 (0.00087) [2022-07-11 12:24:50,172][26022] Updated weights on worker 0-0, policy_version 1190403 (0.00087) [2022-07-11 12:24:52,013][26022] Updated weights on worker 0-0, policy_version 1190413 (0.00086) [2022-07-11 12:24:52,055][25689] Fps is (10 sec: 5605.7, 60 sec: 5529.1, 300 sec: 5534.0). Total num frames: 1218982912. Throughput: 0: 5858.4. Samples: 1218981564. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:24:52,056][25689] Avg episode reward: [(0, '0.807')] [2022-07-11 12:24:53,816][26022] Updated weights on worker 0-0, policy_version 1190423 (0.00105) [2022-07-11 12:24:55,768][26022] Updated weights on worker 0-0, policy_version 1190433 (0.00088) [2022-07-11 12:24:57,150][25689] Fps is (10 sec: 5765.4, 60 sec: 5566.9, 300 sec: 5532.6). Total num frames: 1219011584. Throughput: 0: 5831.3. Samples: 1219014744. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:24:57,152][25689] Avg episode reward: [(0, '1.083')] [2022-07-11 12:24:57,639][26022] Updated weights on worker 0-0, policy_version 1190443 (0.00095) [2022-07-11 12:24:59,587][26022] Updated weights on worker 0-0, policy_version 1190453 (0.00081) [2022-07-11 12:25:01,180][26022] Updated weights on worker 0-0, policy_version 1190463 (0.00083) [2022-07-11 12:25:02,156][25689] Fps is (10 sec: 5372.6, 60 sec: 5550.3, 300 sec: 5536.1). Total num frames: 1219037184. Throughput: 0: 5774.0. Samples: 1219047312. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:25:02,163][25689] Avg episode reward: [(0, '0.957')] [2022-07-11 12:25:03,659][26022] Updated weights on worker 0-0, policy_version 1190473 (0.00089) [2022-07-11 12:25:05,327][26022] Updated weights on worker 0-0, policy_version 1190483 (0.00093) [2022-07-11 12:25:07,223][25689] Fps is (10 sec: 5184.6, 60 sec: 5528.8, 300 sec: 5532.3). Total num frames: 1219063808. Throughput: 0: 5699.5. Samples: 1219062888. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:25:07,225][25689] Avg episode reward: [(0, '0.617')] [2022-07-11 12:25:07,271][26022] Updated weights on worker 0-0, policy_version 1190493 (0.00084) [2022-07-11 12:25:08,935][26022] Updated weights on worker 0-0, policy_version 1190503 (0.00088) [2022-07-11 12:25:10,770][26022] Updated weights on worker 0-0, policy_version 1190513 (0.00091) [2022-07-11 12:25:12,265][25689] Fps is (10 sec: 5571.6, 60 sec: 5543.7, 300 sec: 5532.7). Total num frames: 1219093504. Throughput: 0: 5686.6. Samples: 1219096490. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:25:12,265][25689] Avg episode reward: [(0, '0.413')] [2022-07-11 12:25:12,659][26022] Updated weights on worker 0-0, policy_version 1190523 (0.00086) [2022-07-11 12:25:14,471][26022] Updated weights on worker 0-0, policy_version 1190533 (0.00092) [2022-07-11 12:25:16,225][26022] Updated weights on worker 0-0, policy_version 1190543 (0.00085) [2022-07-11 12:25:17,360][25689] Fps is (10 sec: 5657.0, 60 sec: 5538.7, 300 sec: 5532.5). Total num frames: 1219121152. Throughput: 0: 5709.1. Samples: 1219130124. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:25:17,361][25689] Avg episode reward: [(0, '1.036')] [2022-07-11 12:25:18,261][26022] Updated weights on worker 0-0, policy_version 1190553 (0.00082) [2022-07-11 12:25:19,978][26022] Updated weights on worker 0-0, policy_version 1190563 (0.00087) [2022-07-11 12:25:21,828][26022] Updated weights on worker 0-0, policy_version 1190573 (0.00091) [2022-07-11 12:25:22,408][25689] Fps is (10 sec: 5451.7, 60 sec: 5518.8, 300 sec: 5524.9). Total num frames: 1219148800. Throughput: 0: 4908.3. Samples: 1219146712. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:25:22,408][25689] Avg episode reward: [(0, '1.262')] [2022-07-11 12:25:23,480][26022] Updated weights on worker 0-0, policy_version 1190583 (0.00081) [2022-07-11 12:25:25,564][26022] Updated weights on worker 0-0, policy_version 1190593 (0.00086) [2022-07-11 12:25:27,400][26022] Updated weights on worker 0-0, policy_version 1190603 (0.00085) [2022-07-11 12:25:27,443][25689] Fps is (10 sec: 5585.5, 60 sec: 5550.6, 300 sec: 5531.6). Total num frames: 1219177472. Throughput: 0: 5807.9. Samples: 1219180324. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:25:27,444][25689] Avg episode reward: [(0, '1.236')] [2022-07-11 12:25:29,181][26022] Updated weights on worker 0-0, policy_version 1190613 (0.00074) [2022-07-11 12:25:31,226][26022] Updated weights on worker 0-0, policy_version 1190623 (0.00082) [2022-07-11 12:25:32,448][25689] Fps is (10 sec: 5609.6, 60 sec: 5552.0, 300 sec: 5532.9). Total num frames: 1219205120. Throughput: 0: 5799.8. Samples: 1219213546. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:25:32,448][25689] Avg episode reward: [(0, '0.829')] [2022-07-11 12:25:32,835][26022] Updated weights on worker 0-0, policy_version 1190633 (0.00095) [2022-07-11 12:25:34,714][26022] Updated weights on worker 0-0, policy_version 1190643 (0.00091) [2022-07-11 12:25:36,543][26022] Updated weights on worker 0-0, policy_version 1190653 (0.00087) [2022-07-11 12:25:37,539][25689] Fps is (10 sec: 5376.0, 60 sec: 5530.3, 300 sec: 5524.8). Total num frames: 1219231744. Throughput: 0: 4961.8. Samples: 1219230246. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:25:37,539][25689] Avg episode reward: [(0, '0.496')] [2022-07-11 12:25:38,478][26022] Updated weights on worker 0-0, policy_version 1190663 (0.00093) [2022-07-11 12:25:40,529][26022] Updated weights on worker 0-0, policy_version 1190673 (0.00087) [2022-07-11 12:25:41,939][26022] Updated weights on worker 0-0, policy_version 1190683 (0.00083) [2022-07-11 12:25:42,575][25689] Fps is (10 sec: 5561.0, 60 sec: 5528.5, 300 sec: 5534.7). Total num frames: 1219261440. Throughput: 0: 5793.1. Samples: 1219263546. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:25:42,576][25689] Avg episode reward: [(0, '0.595')] [2022-07-11 12:25:43,976][26022] Updated weights on worker 0-0, policy_version 1190693 (0.00089) [2022-07-11 12:25:45,705][26022] Updated weights on worker 0-0, policy_version 1190703 (0.00085) [2022-07-11 12:25:47,593][25689] Fps is (10 sec: 5703.3, 60 sec: 5545.5, 300 sec: 5534.7). Total num frames: 1219289088. Throughput: 0: 5785.7. Samples: 1219296904. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:25:47,594][25689] Avg episode reward: [(0, '-0.333')] [2022-07-11 12:25:47,848][26022] Updated weights on worker 0-0, policy_version 1190713 (0.00092) [2022-07-11 12:25:49,601][26022] Updated weights on worker 0-0, policy_version 1190723 (0.00088) [2022-07-11 12:25:51,503][26022] Updated weights on worker 0-0, policy_version 1190733 (0.00084) [2022-07-11 12:25:52,619][25689] Fps is (10 sec: 5403.6, 60 sec: 5495.0, 300 sec: 5524.5). Total num frames: 1219315712. Throughput: 0: 4946.3. Samples: 1219313320. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:25:52,619][25689] Avg episode reward: [(0, '-0.606')] [2022-07-11 12:25:53,220][26022] Updated weights on worker 0-0, policy_version 1190743 (0.00104) [2022-07-11 12:25:55,353][26022] Updated weights on worker 0-0, policy_version 1190753 (0.00088) [2022-07-11 12:25:57,105][26022] Updated weights on worker 0-0, policy_version 1190763 (0.00090) [2022-07-11 12:25:57,747][25689] Fps is (10 sec: 5446.0, 60 sec: 5492.1, 300 sec: 5526.0). Total num frames: 1219344384. Throughput: 0: 5723.1. Samples: 1219345900. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:25:57,747][25689] Avg episode reward: [(0, '-0.660')] [2022-07-11 12:25:59,000][26022] Updated weights on worker 0-0, policy_version 1190773 (0.00085) [2022-07-11 12:26:00,772][26022] Updated weights on worker 0-0, policy_version 1190783 (0.00087) [2022-07-11 12:26:02,847][25689] Fps is (10 sec: 5305.9, 60 sec: 5483.5, 300 sec: 5524.8). Total num frames: 1219369984. Throughput: 0: 5609.9. Samples: 1219377272. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:26:02,849][25689] Avg episode reward: [(0, '-0.008')] [2022-07-11 12:26:03,120][26022] Updated weights on worker 0-0, policy_version 1190793 (0.00093) [2022-07-11 12:26:04,815][26022] Updated weights on worker 0-0, policy_version 1190803 (0.00087) [2022-07-11 12:26:06,408][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:26:06,421][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001190811_1219390464.pth [2022-07-11 12:26:06,422][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001188866_1217398784.pth [2022-07-11 12:26:06,713][26022] Updated weights on worker 0-0, policy_version 1190813 (0.00086) [2022-07-11 12:26:07,874][25689] Fps is (10 sec: 5358.9, 60 sec: 5520.9, 300 sec: 5524.8). Total num frames: 1219398656. Throughput: 0: 4770.0. Samples: 1219393644. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:26:07,875][25689] Avg episode reward: [(0, '0.200')] [2022-07-11 12:26:08,390][26022] Updated weights on worker 0-0, policy_version 1190823 (0.00096) [2022-07-11 12:26:10,565][26022] Updated weights on worker 0-0, policy_version 1190833 (0.00088) [2022-07-11 12:26:12,023][26022] Updated weights on worker 0-0, policy_version 1190843 (0.00097) [2022-07-11 12:26:12,911][25689] Fps is (10 sec: 5596.4, 60 sec: 5487.6, 300 sec: 5518.8). Total num frames: 1219426304. Throughput: 0: 5605.5. Samples: 1219427068. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:26:12,911][25689] Avg episode reward: [(0, '1.311')] [2022-07-11 12:26:14,093][26022] Updated weights on worker 0-0, policy_version 1190853 (0.00095) [2022-07-11 12:26:15,768][26022] Updated weights on worker 0-0, policy_version 1190863 (0.00092) [2022-07-11 12:26:17,743][26022] Updated weights on worker 0-0, policy_version 1190873 (0.00090) [2022-07-11 12:26:17,991][25689] Fps is (10 sec: 5567.1, 60 sec: 5505.9, 300 sec: 5528.0). Total num frames: 1219454976. Throughput: 0: 5662.3. Samples: 1219460528. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:26:17,991][25689] Avg episode reward: [(0, '1.383')] [2022-07-11 12:26:19,563][26022] Updated weights on worker 0-0, policy_version 1190883 (0.00090) [2022-07-11 12:26:21,506][26022] Updated weights on worker 0-0, policy_version 1190893 (0.00085) [2022-07-11 12:26:23,055][25689] Fps is (10 sec: 5551.9, 60 sec: 5504.3, 300 sec: 5523.5). Total num frames: 1219482624. Throughput: 0: 4944.7. Samples: 1219477194. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:26:23,056][25689] Avg episode reward: [(0, '1.650')] [2022-07-11 12:26:23,266][26022] Updated weights on worker 0-0, policy_version 1190903 (0.00083) [2022-07-11 12:26:25,176][26022] Updated weights on worker 0-0, policy_version 1190913 (0.00089) [2022-07-11 12:26:27,010][26022] Updated weights on worker 0-0, policy_version 1190923 (0.00092) [2022-07-11 12:26:28,068][25689] Fps is (10 sec: 5487.5, 60 sec: 5489.6, 300 sec: 5523.4). Total num frames: 1219510272. Throughput: 0: 5797.5. Samples: 1219510714. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:26:28,068][25689] Avg episode reward: [(0, '1.376')] [2022-07-11 12:26:29,023][26022] Updated weights on worker 0-0, policy_version 1190933 (0.00087) [2022-07-11 12:26:30,609][26022] Updated weights on worker 0-0, policy_version 1190943 (0.00084) [2022-07-11 12:26:32,603][26022] Updated weights on worker 0-0, policy_version 1190953 (0.00096) [2022-07-11 12:26:33,100][25689] Fps is (10 sec: 5607.1, 60 sec: 5503.9, 300 sec: 5520.8). Total num frames: 1219538944. Throughput: 0: 5809.4. Samples: 1219544352. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:26:33,100][25689] Avg episode reward: [(0, '1.567')] [2022-07-11 12:26:34,331][26022] Updated weights on worker 0-0, policy_version 1190963 (0.00090) [2022-07-11 12:26:36,207][26022] Updated weights on worker 0-0, policy_version 1190973 (0.00092) [2022-07-11 12:26:38,203][25689] Fps is (10 sec: 5455.9, 60 sec: 5502.8, 300 sec: 5516.2). Total num frames: 1219565568. Throughput: 0: 4972.5. Samples: 1219561032. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:26:38,203][25689] Avg episode reward: [(0, '0.946')] [2022-07-11 12:26:38,316][26022] Updated weights on worker 0-0, policy_version 1190983 (0.01364) [2022-07-11 12:26:39,801][26022] Updated weights on worker 0-0, policy_version 1190993 (0.00086) [2022-07-11 12:26:41,923][26022] Updated weights on worker 0-0, policy_version 1191003 (0.00088) [2022-07-11 12:26:43,241][25689] Fps is (10 sec: 5553.4, 60 sec: 5502.7, 300 sec: 5519.5). Total num frames: 1219595264. Throughput: 0: 5804.1. Samples: 1219594354. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:26:43,242][25689] Avg episode reward: [(0, '1.018')] [2022-07-11 12:26:43,476][26022] Updated weights on worker 0-0, policy_version 1191013 (0.00086) [2022-07-11 12:26:45,402][26022] Updated weights on worker 0-0, policy_version 1191023 (0.00085) [2022-07-11 12:26:47,338][26022] Updated weights on worker 0-0, policy_version 1191033 (0.00086) [2022-07-11 12:26:48,302][25689] Fps is (10 sec: 5678.1, 60 sec: 5498.8, 300 sec: 5520.1). Total num frames: 1219622912. Throughput: 0: 5784.3. Samples: 1219627756. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:26:48,302][25689] Avg episode reward: [(0, '0.918')] [2022-07-11 12:26:48,947][26022] Updated weights on worker 0-0, policy_version 1191043 (0.00082) [2022-07-11 12:26:50,845][26022] Updated weights on worker 0-0, policy_version 1191053 (0.00085) [2022-07-11 12:26:52,715][26022] Updated weights on worker 0-0, policy_version 1191063 (0.00086) [2022-07-11 12:26:53,317][25689] Fps is (10 sec: 5487.9, 60 sec: 5516.6, 300 sec: 5517.9). Total num frames: 1219650560. Throughput: 0: 4958.3. Samples: 1219644596. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:26:53,318][25689] Avg episode reward: [(0, '0.994')] [2022-07-11 12:26:54,608][26022] Updated weights on worker 0-0, policy_version 1191073 (0.00090) [2022-07-11 12:26:56,515][26022] Updated weights on worker 0-0, policy_version 1191083 (0.00092) [2022-07-11 12:26:58,370][26022] Updated weights on worker 0-0, policy_version 1191093 (0.00085) [2022-07-11 12:26:58,445][25689] Fps is (10 sec: 5653.7, 60 sec: 5533.5, 300 sec: 5524.2). Total num frames: 1219680256. Throughput: 0: 5770.0. Samples: 1219677828. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:26:58,445][25689] Avg episode reward: [(0, '1.359')] [2022-07-11 12:27:00,125][26022] Updated weights on worker 0-0, policy_version 1191103 (0.00086) [2022-07-11 12:27:02,424][26022] Updated weights on worker 0-0, policy_version 1191113 (0.00089) [2022-07-11 12:27:03,475][25689] Fps is (10 sec: 5443.9, 60 sec: 5540.0, 300 sec: 5527.2). Total num frames: 1219705856. Throughput: 0: 5698.7. Samples: 1219709658. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:27:03,476][25689] Avg episode reward: [(0, '1.728')] [2022-07-11 12:27:04,239][26022] Updated weights on worker 0-0, policy_version 1191123 (0.00093) [2022-07-11 12:27:05,994][26022] Updated weights on worker 0-0, policy_version 1191133 (0.00090) [2022-07-11 12:27:07,935][26022] Updated weights on worker 0-0, policy_version 1191143 (0.00087) [2022-07-11 12:27:08,518][25689] Fps is (10 sec: 5286.2, 60 sec: 5521.6, 300 sec: 5520.0). Total num frames: 1219733504. Throughput: 0: 5705.1. Samples: 1219743088. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:27:08,519][25689] Avg episode reward: [(0, '1.960')] [2022-07-11 12:27:09,627][26022] Updated weights on worker 0-0, policy_version 1191153 (0.00084) [2022-07-11 12:27:11,655][26022] Updated weights on worker 0-0, policy_version 1191163 (0.00085) [2022-07-11 12:27:13,264][26022] Updated weights on worker 0-0, policy_version 1191173 (0.00089) [2022-07-11 12:27:13,591][25689] Fps is (10 sec: 5567.3, 60 sec: 5535.1, 300 sec: 5527.2). Total num frames: 1219762176. Throughput: 0: 5668.8. Samples: 1219759524. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:27:13,593][25689] Avg episode reward: [(0, '0.806')] [2022-07-11 12:27:15,305][26022] Updated weights on worker 0-0, policy_version 1191183 (0.00086) [2022-07-11 12:27:17,043][26022] Updated weights on worker 0-0, policy_version 1191193 (0.00106) [2022-07-11 12:27:18,669][25689] Fps is (10 sec: 5547.9, 60 sec: 5518.4, 300 sec: 5526.2). Total num frames: 1219789824. Throughput: 0: 5677.6. Samples: 1219792656. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:27:18,672][25689] Avg episode reward: [(0, '0.789')] [2022-07-11 12:27:18,899][26022] Updated weights on worker 0-0, policy_version 1191203 (0.00082) [2022-07-11 12:27:20,630][26022] Updated weights on worker 0-0, policy_version 1191213 (0.00096) [2022-07-11 12:27:22,956][26022] Updated weights on worker 0-0, policy_version 1191223 (0.00090) [2022-07-11 12:27:23,688][25689] Fps is (10 sec: 5476.4, 60 sec: 5522.6, 300 sec: 5522.7). Total num frames: 1219817472. Throughput: 0: 5748.3. Samples: 1219825850. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:27:23,688][25689] Avg episode reward: [(0, '0.145')] [2022-07-11 12:27:24,274][26022] Updated weights on worker 0-0, policy_version 1191233 (0.00089) [2022-07-11 12:27:26,666][26022] Updated weights on worker 0-0, policy_version 1191243 (0.00087) [2022-07-11 12:27:28,065][26022] Updated weights on worker 0-0, policy_version 1191253 (0.00085) [2022-07-11 12:27:28,712][25689] Fps is (10 sec: 5506.1, 60 sec: 5521.5, 300 sec: 5525.9). Total num frames: 1219845120. Throughput: 0: 4901.5. Samples: 1219842070. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:27:28,713][25689] Avg episode reward: [(0, '0.405')] [2022-07-11 12:27:30,148][26022] Updated weights on worker 0-0, policy_version 1191263 (0.00079) [2022-07-11 12:27:32,005][26022] Updated weights on worker 0-0, policy_version 1191273 (0.00085) [2022-07-11 12:27:33,717][26022] Updated weights on worker 0-0, policy_version 1191283 (0.00095) [2022-07-11 12:27:33,809][25689] Fps is (10 sec: 5564.4, 60 sec: 5515.6, 300 sec: 5526.0). Total num frames: 1219873792. Throughput: 0: 5730.6. Samples: 1219875388. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:27:33,810][25689] Avg episode reward: [(0, '0.063')] [2022-07-11 12:27:35,753][26022] Updated weights on worker 0-0, policy_version 1191293 (0.00094) [2022-07-11 12:27:37,519][26022] Updated weights on worker 0-0, policy_version 1191303 (0.00082) [2022-07-11 12:27:38,913][25689] Fps is (10 sec: 5420.4, 60 sec: 5515.5, 300 sec: 5510.8). Total num frames: 1219900416. Throughput: 0: 5738.9. Samples: 1219908834. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:27:38,914][25689] Avg episode reward: [(0, '0.103')] [2022-07-11 12:27:39,388][26022] Updated weights on worker 0-0, policy_version 1191313 (0.00101) [2022-07-11 12:27:41,276][26022] Updated weights on worker 0-0, policy_version 1191323 (0.00087) [2022-07-11 12:27:42,897][26022] Updated weights on worker 0-0, policy_version 1191333 (0.00083) [2022-07-11 12:27:43,964][25689] Fps is (10 sec: 5546.2, 60 sec: 5514.4, 300 sec: 5523.8). Total num frames: 1219930112. Throughput: 0: 4921.9. Samples: 1219925648. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:27:43,965][25689] Avg episode reward: [(0, '1.136')] [2022-07-11 12:27:45,060][26022] Updated weights on worker 0-0, policy_version 1191343 (0.00082) [2022-07-11 12:27:46,649][26022] Updated weights on worker 0-0, policy_version 1191353 (0.00105) [2022-07-11 12:27:48,519][26022] Updated weights on worker 0-0, policy_version 1191363 (0.00085) [2022-07-11 12:27:49,051][25689] Fps is (10 sec: 5757.3, 60 sec: 5528.8, 300 sec: 5522.4). Total num frames: 1219958784. Throughput: 0: 5765.2. Samples: 1219959332. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:27:49,052][25689] Avg episode reward: [(0, '0.770')] [2022-07-11 12:27:50,416][26022] Updated weights on worker 0-0, policy_version 1191373 (0.00081) [2022-07-11 12:27:52,088][26022] Updated weights on worker 0-0, policy_version 1191383 (0.00088) [2022-07-11 12:27:53,947][26022] Updated weights on worker 0-0, policy_version 1191393 (0.00107) [2022-07-11 12:27:54,123][25689] Fps is (10 sec: 5644.7, 60 sec: 5540.5, 300 sec: 5521.9). Total num frames: 1219987456. Throughput: 0: 5793.5. Samples: 1219993074. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:27:54,123][25689] Avg episode reward: [(0, '1.351')] [2022-07-11 12:27:55,899][26022] Updated weights on worker 0-0, policy_version 1191403 (0.00088) [2022-07-11 12:27:57,670][26022] Updated weights on worker 0-0, policy_version 1191413 (0.00086) [2022-07-11 12:27:59,172][25689] Fps is (10 sec: 5463.6, 60 sec: 5497.1, 300 sec: 5517.8). Total num frames: 1220014080. Throughput: 0: 4976.0. Samples: 1220009642. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:27:59,173][25689] Avg episode reward: [(0, '-0.402')] [2022-07-11 12:27:59,658][26022] Updated weights on worker 0-0, policy_version 1191423 (0.00081) [2022-07-11 12:28:01,398][26022] Updated weights on worker 0-0, policy_version 1191433 (0.00099) [2022-07-11 12:28:03,748][26022] Updated weights on worker 0-0, policy_version 1191443 (0.00087) [2022-07-11 12:28:04,200][25689] Fps is (10 sec: 5284.4, 60 sec: 5514.2, 300 sec: 5520.8). Total num frames: 1220040704. Throughput: 0: 5710.0. Samples: 1220041192. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:28:04,200][25689] Avg episode reward: [(0, '-0.481')] [2022-07-11 12:28:05,392][26022] Updated weights on worker 0-0, policy_version 1191453 (0.00090) [2022-07-11 12:28:06,591][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:28:06,602][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001191459_1220054016.pth [2022-07-11 12:28:06,602][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001189515_1218063360.pth [2022-07-11 12:28:07,266][26022] Updated weights on worker 0-0, policy_version 1191463 (0.00092) [2022-07-11 12:28:09,019][26022] Updated weights on worker 0-0, policy_version 1191473 (0.00086) [2022-07-11 12:28:09,222][25689] Fps is (10 sec: 5400.3, 60 sec: 5516.1, 300 sec: 5517.1). Total num frames: 1220068352. Throughput: 0: 5727.2. Samples: 1220074852. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:28:09,222][25689] Avg episode reward: [(0, '-0.501')] [2022-07-11 12:28:10,786][26022] Updated weights on worker 0-0, policy_version 1191483 (0.00081) [2022-07-11 12:28:12,666][26022] Updated weights on worker 0-0, policy_version 1191493 (0.00097) [2022-07-11 12:28:14,237][25689] Fps is (10 sec: 5611.2, 60 sec: 5521.4, 300 sec: 5524.5). Total num frames: 1220097024. Throughput: 0: 4908.6. Samples: 1220091804. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:28:14,238][25689] Avg episode reward: [(0, '-0.715')] [2022-07-11 12:28:14,487][26022] Updated weights on worker 0-0, policy_version 1191503 (0.00081) [2022-07-11 12:28:16,361][26022] Updated weights on worker 0-0, policy_version 1191513 (0.00093) [2022-07-11 12:28:18,334][26022] Updated weights on worker 0-0, policy_version 1191523 (0.00085) [2022-07-11 12:28:19,287][25689] Fps is (10 sec: 5697.5, 60 sec: 5540.8, 300 sec: 5523.8). Total num frames: 1220125696. Throughput: 0: 5741.6. Samples: 1220125132. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:28:19,287][25689] Avg episode reward: [(0, '-1.002')] [2022-07-11 12:28:20,115][26022] Updated weights on worker 0-0, policy_version 1191533 (0.00092) [2022-07-11 12:28:22,045][26022] Updated weights on worker 0-0, policy_version 1191543 (0.00091) [2022-07-11 12:28:23,717][26022] Updated weights on worker 0-0, policy_version 1191553 (0.00094) [2022-07-11 12:28:24,376][25689] Fps is (10 sec: 5554.6, 60 sec: 5534.4, 300 sec: 5519.1). Total num frames: 1220153344. Throughput: 0: 5826.5. Samples: 1220158750. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:28:24,376][25689] Avg episode reward: [(0, '-0.835')] [2022-07-11 12:28:25,568][26022] Updated weights on worker 0-0, policy_version 1191563 (0.00090) [2022-07-11 12:28:27,507][26022] Updated weights on worker 0-0, policy_version 1191573 (0.00083) [2022-07-11 12:28:29,403][25689] Fps is (10 sec: 5364.9, 60 sec: 5517.3, 300 sec: 5522.2). Total num frames: 1220179968. Throughput: 0: 4964.1. Samples: 1220175032. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:28:29,403][25689] Avg episode reward: [(0, '0.951')] [2022-07-11 12:28:29,530][26022] Updated weights on worker 0-0, policy_version 1191583 (0.00095) [2022-07-11 12:28:31,050][26022] Updated weights on worker 0-0, policy_version 1191593 (0.00088) [2022-07-11 12:28:32,970][26022] Updated weights on worker 0-0, policy_version 1191603 (0.00082) [2022-07-11 12:28:34,409][25689] Fps is (10 sec: 5715.4, 60 sec: 5559.4, 300 sec: 5524.7). Total num frames: 1220210688. Throughput: 0: 5805.6. Samples: 1220208918. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:28:34,411][25689] Avg episode reward: [(0, '1.139')] [2022-07-11 12:28:34,612][26022] Updated weights on worker 0-0, policy_version 1191613 (0.00078) [2022-07-11 12:28:36,708][26022] Updated weights on worker 0-0, policy_version 1191623 (0.00091) [2022-07-11 12:28:38,395][26022] Updated weights on worker 0-0, policy_version 1191633 (0.00093) [2022-07-11 12:28:39,546][25689] Fps is (10 sec: 5754.6, 60 sec: 5573.3, 300 sec: 5523.6). Total num frames: 1220238336. Throughput: 0: 5795.7. Samples: 1220242546. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 12:28:39,547][25689] Avg episode reward: [(0, '1.803')] [2022-07-11 12:28:40,224][26022] Updated weights on worker 0-0, policy_version 1191643 (0.00089) [2022-07-11 12:28:42,046][26022] Updated weights on worker 0-0, policy_version 1191653 (0.00085) [2022-07-11 12:28:43,897][26022] Updated weights on worker 0-0, policy_version 1191663 (0.00085) [2022-07-11 12:28:44,562][25689] Fps is (10 sec: 5547.3, 60 sec: 5559.6, 300 sec: 5527.4). Total num frames: 1220267008. Throughput: 0: 4992.9. Samples: 1220259536. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:28:44,563][25689] Avg episode reward: [(0, '1.667')] [2022-07-11 12:28:45,726][26022] Updated weights on worker 0-0, policy_version 1191673 (0.00085) [2022-07-11 12:28:47,708][26022] Updated weights on worker 0-0, policy_version 1191683 (0.00086) [2022-07-11 12:28:49,241][26022] Updated weights on worker 0-0, policy_version 1191693 (0.00082) [2022-07-11 12:28:49,603][25689] Fps is (10 sec: 5600.1, 60 sec: 5546.9, 300 sec: 5523.5). Total num frames: 1220294656. Throughput: 0: 5862.2. Samples: 1220293448. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:28:49,603][25689] Avg episode reward: [(0, '2.411')] [2022-07-11 12:28:51,419][26022] Updated weights on worker 0-0, policy_version 1191703 (0.00085) [2022-07-11 12:28:52,975][26022] Updated weights on worker 0-0, policy_version 1191713 (0.00087) [2022-07-11 12:28:54,655][25689] Fps is (10 sec: 5478.5, 60 sec: 5531.8, 300 sec: 5528.5). Total num frames: 1220322304. Throughput: 0: 5821.2. Samples: 1220326774. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:28:54,655][25689] Avg episode reward: [(0, '2.059')] [2022-07-11 12:28:54,957][26022] Updated weights on worker 0-0, policy_version 1191723 (0.00092) [2022-07-11 12:28:56,738][26022] Updated weights on worker 0-0, policy_version 1191733 (0.00086) [2022-07-11 12:28:58,591][26022] Updated weights on worker 0-0, policy_version 1191743 (0.00085) [2022-07-11 12:28:59,751][25689] Fps is (10 sec: 5549.8, 60 sec: 5561.3, 300 sec: 5533.8). Total num frames: 1220350976. Throughput: 0: 5815.4. Samples: 1220360046. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:28:59,751][25689] Avg episode reward: [(0, '2.039')] [2022-07-11 12:29:00,460][26022] Updated weights on worker 0-0, policy_version 1191753 (0.00092) [2022-07-11 12:29:02,623][26022] Updated weights on worker 0-0, policy_version 1191763 (0.00088) [2022-07-11 12:29:04,315][26022] Updated weights on worker 0-0, policy_version 1191773 (0.00089) [2022-07-11 12:29:04,771][25689] Fps is (10 sec: 5466.2, 60 sec: 5562.0, 300 sec: 5530.3). Total num frames: 1220377600. Throughput: 0: 5712.7. Samples: 1220374986. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:29:04,771][25689] Avg episode reward: [(0, '1.661')] [2022-07-11 12:29:06,195][26022] Updated weights on worker 0-0, policy_version 1191783 (0.00088) [2022-07-11 12:29:07,889][26022] Updated weights on worker 0-0, policy_version 1191793 (0.00091) [2022-07-11 12:29:09,797][25689] Fps is (10 sec: 5402.2, 60 sec: 5561.6, 300 sec: 5526.7). Total num frames: 1220405248. Throughput: 0: 5699.9. Samples: 1220408554. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:29:09,798][25689] Avg episode reward: [(0, '1.484')] [2022-07-11 12:29:09,974][26022] Updated weights on worker 0-0, policy_version 1191803 (0.00087) [2022-07-11 12:29:11,590][26022] Updated weights on worker 0-0, policy_version 1191813 (0.00089) [2022-07-11 12:29:13,771][26022] Updated weights on worker 0-0, policy_version 1191823 (0.00092) [2022-07-11 12:29:14,816][25689] Fps is (10 sec: 5504.7, 60 sec: 5544.3, 300 sec: 5527.1). Total num frames: 1220432896. Throughput: 0: 5725.1. Samples: 1220442200. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:29:14,817][25689] Avg episode reward: [(0, '1.505')] [2022-07-11 12:29:15,284][26022] Updated weights on worker 0-0, policy_version 1191833 (0.00083) [2022-07-11 12:29:17,161][26022] Updated weights on worker 0-0, policy_version 1191843 (0.00087) [2022-07-11 12:29:19,086][26022] Updated weights on worker 0-0, policy_version 1191853 (0.00095) [2022-07-11 12:29:19,862][25689] Fps is (10 sec: 5595.6, 60 sec: 5544.7, 300 sec: 5526.5). Total num frames: 1220461568. Throughput: 0: 4917.9. Samples: 1220458952. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:29:19,863][25689] Avg episode reward: [(0, '1.223')] [2022-07-11 12:29:20,875][26022] Updated weights on worker 0-0, policy_version 1191863 (0.00090) [2022-07-11 12:29:22,644][26022] Updated weights on worker 0-0, policy_version 1191873 (0.00091) [2022-07-11 12:29:24,611][26022] Updated weights on worker 0-0, policy_version 1191883 (0.00085) [2022-07-11 12:29:24,892][25689] Fps is (10 sec: 5589.3, 60 sec: 5550.1, 300 sec: 5529.6). Total num frames: 1220489216. Throughput: 0: 5837.9. Samples: 1220492454. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:29:24,894][25689] Avg episode reward: [(0, '0.462')] [2022-07-11 12:29:26,429][26022] Updated weights on worker 0-0, policy_version 1191893 (0.00091) [2022-07-11 12:29:28,236][26022] Updated weights on worker 0-0, policy_version 1191903 (0.00094) [2022-07-11 12:29:29,896][25689] Fps is (10 sec: 5612.7, 60 sec: 5586.1, 300 sec: 5533.4). Total num frames: 1220517888. Throughput: 0: 5837.0. Samples: 1220525874. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:29:29,898][25689] Avg episode reward: [(0, '0.335')] [2022-07-11 12:29:29,960][26022] Updated weights on worker 0-0, policy_version 1191913 (0.00085) [2022-07-11 12:29:31,896][26022] Updated weights on worker 0-0, policy_version 1191923 (0.00092) [2022-07-11 12:29:33,659][26022] Updated weights on worker 0-0, policy_version 1191933 (0.00087) [2022-07-11 12:29:34,900][25689] Fps is (10 sec: 5627.5, 60 sec: 5535.5, 300 sec: 5534.0). Total num frames: 1220545536. Throughput: 0: 5004.9. Samples: 1220542720. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:29:34,902][25689] Avg episode reward: [(0, '-0.046')] [2022-07-11 12:29:35,543][26022] Updated weights on worker 0-0, policy_version 1191943 (0.00081) [2022-07-11 12:29:37,519][26022] Updated weights on worker 0-0, policy_version 1191953 (0.00176) [2022-07-11 12:29:39,137][26022] Updated weights on worker 0-0, policy_version 1191963 (0.00090) [2022-07-11 12:29:39,984][25689] Fps is (10 sec: 5380.1, 60 sec: 5523.4, 300 sec: 5522.5). Total num frames: 1220572160. Throughput: 0: 5829.4. Samples: 1220576250. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:29:39,985][25689] Avg episode reward: [(0, '-0.218')] [2022-07-11 12:29:41,096][26022] Updated weights on worker 0-0, policy_version 1191973 (0.00085) [2022-07-11 12:29:42,892][26022] Updated weights on worker 0-0, policy_version 1191983 (0.00089) [2022-07-11 12:29:44,724][26022] Updated weights on worker 0-0, policy_version 1191993 (0.00093) [2022-07-11 12:29:44,995][25689] Fps is (10 sec: 5680.5, 60 sec: 5557.7, 300 sec: 5536.4). Total num frames: 1220602880. Throughput: 0: 5838.3. Samples: 1220609820. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:29:44,997][25689] Avg episode reward: [(0, '-0.233')] [2022-07-11 12:29:46,656][26022] Updated weights on worker 0-0, policy_version 1192003 (0.00096) [2022-07-11 12:29:48,435][26022] Updated weights on worker 0-0, policy_version 1192013 (0.00083) [2022-07-11 12:29:50,021][25689] Fps is (10 sec: 5611.3, 60 sec: 5525.2, 300 sec: 5522.7). Total num frames: 1220628480. Throughput: 0: 5004.7. Samples: 1220626590. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:29:50,021][25689] Avg episode reward: [(0, '0.016')] [2022-07-11 12:29:50,383][26022] Updated weights on worker 0-0, policy_version 1192023 (0.00088) [2022-07-11 12:29:52,080][26022] Updated weights on worker 0-0, policy_version 1192033 (0.00086) [2022-07-11 12:29:53,810][26022] Updated weights on worker 0-0, policy_version 1192043 (0.00093) [2022-07-11 12:29:55,097][25689] Fps is (10 sec: 5372.7, 60 sec: 5540.0, 300 sec: 5523.0). Total num frames: 1220657152. Throughput: 0: 5811.8. Samples: 1220660096. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:29:55,098][25689] Avg episode reward: [(0, '-0.035')] [2022-07-11 12:29:55,826][26022] Updated weights on worker 0-0, policy_version 1192053 (0.00088) [2022-07-11 12:29:57,899][26022] Updated weights on worker 0-0, policy_version 1192063 (0.00085) [2022-07-11 12:29:59,502][26022] Updated weights on worker 0-0, policy_version 1192073 (0.00083) [2022-07-11 12:30:00,184][25689] Fps is (10 sec: 5743.3, 60 sec: 5557.8, 300 sec: 5535.3). Total num frames: 1220686848. Throughput: 0: 5796.6. Samples: 1220693340. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:30:00,184][25689] Avg episode reward: [(0, '-0.031')] [2022-07-11 12:30:01,393][26022] Updated weights on worker 0-0, policy_version 1192083 (0.00092) [2022-07-11 12:30:03,464][26022] Updated weights on worker 0-0, policy_version 1192093 (0.00088) [2022-07-11 12:30:05,185][25689] Fps is (10 sec: 5379.6, 60 sec: 5525.6, 300 sec: 5529.6). Total num frames: 1220711424. Throughput: 0: 4865.1. Samples: 1220708046. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:30:05,186][25689] Avg episode reward: [(0, '0.031')] [2022-07-11 12:30:05,634][26022] Updated weights on worker 0-0, policy_version 1192103 (0.00089) [2022-07-11 12:30:06,796][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:30:06,813][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001192111_1220721664.pth [2022-07-11 12:30:06,814][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001190163_1218726912.pth [2022-07-11 12:30:07,377][26022] Updated weights on worker 0-0, policy_version 1192113 (0.00087) [2022-07-11 12:30:09,133][26022] Updated weights on worker 0-0, policy_version 1192123 (0.00086) [2022-07-11 12:30:10,195][25689] Fps is (10 sec: 5318.9, 60 sec: 5544.1, 300 sec: 5526.8). Total num frames: 1220740096. Throughput: 0: 5696.6. Samples: 1220741512. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:30:10,196][25689] Avg episode reward: [(0, '0.128')] [2022-07-11 12:30:11,091][26022] Updated weights on worker 0-0, policy_version 1192133 (0.00094) [2022-07-11 12:30:12,697][26022] Updated weights on worker 0-0, policy_version 1192143 (0.00090) [2022-07-11 12:30:14,784][26022] Updated weights on worker 0-0, policy_version 1192153 (0.00091) [2022-07-11 12:30:15,223][25689] Fps is (10 sec: 5610.9, 60 sec: 5543.2, 300 sec: 5528.0). Total num frames: 1220767744. Throughput: 0: 5706.0. Samples: 1220774936. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:30:15,223][25689] Avg episode reward: [(0, '-0.871')] [2022-07-11 12:30:16,225][26022] Updated weights on worker 0-0, policy_version 1192163 (0.00087) [2022-07-11 12:30:18,343][26022] Updated weights on worker 0-0, policy_version 1192173 (0.00093) [2022-07-11 12:30:20,096][26022] Updated weights on worker 0-0, policy_version 1192183 (0.00098) [2022-07-11 12:30:20,351][25689] Fps is (10 sec: 5444.4, 60 sec: 5518.7, 300 sec: 5526.5). Total num frames: 1220795392. Throughput: 0: 4882.6. Samples: 1220791810. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:30:20,352][25689] Avg episode reward: [(0, '-0.119')] [2022-07-11 12:30:22,083][26022] Updated weights on worker 0-0, policy_version 1192193 (0.00090) [2022-07-11 12:30:23,840][26022] Updated weights on worker 0-0, policy_version 1192203 (0.00093) [2022-07-11 12:30:25,366][25689] Fps is (10 sec: 5552.6, 60 sec: 5537.1, 300 sec: 5526.9). Total num frames: 1220824064. Throughput: 0: 5801.7. Samples: 1220825128. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:30:25,366][25689] Avg episode reward: [(0, '0.464')] [2022-07-11 12:30:25,663][26022] Updated weights on worker 0-0, policy_version 1192213 (0.00093) [2022-07-11 12:30:27,467][26022] Updated weights on worker 0-0, policy_version 1192223 (0.00085) [2022-07-11 12:30:29,488][26022] Updated weights on worker 0-0, policy_version 1192233 (0.00174) [2022-07-11 12:30:30,387][25689] Fps is (10 sec: 5714.2, 60 sec: 5535.6, 300 sec: 5530.0). Total num frames: 1220852736. Throughput: 0: 5796.5. Samples: 1220858554. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:30:30,387][25689] Avg episode reward: [(0, '1.121')] [2022-07-11 12:30:31,175][26022] Updated weights on worker 0-0, policy_version 1192243 (0.00088) [2022-07-11 12:30:33,050][26022] Updated weights on worker 0-0, policy_version 1192253 (0.00088) [2022-07-11 12:30:34,836][26022] Updated weights on worker 0-0, policy_version 1192263 (0.00088) [2022-07-11 12:30:35,389][25689] Fps is (10 sec: 5516.6, 60 sec: 5518.8, 300 sec: 5531.7). Total num frames: 1220879360. Throughput: 0: 4976.7. Samples: 1220875300. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:30:35,390][25689] Avg episode reward: [(0, '1.485')] [2022-07-11 12:30:36,823][26022] Updated weights on worker 0-0, policy_version 1192273 (0.00095) [2022-07-11 12:30:38,550][26022] Updated weights on worker 0-0, policy_version 1192283 (0.00091) [2022-07-11 12:30:40,365][26022] Updated weights on worker 0-0, policy_version 1192293 (0.00079) [2022-07-11 12:30:40,442][25689] Fps is (10 sec: 5600.9, 60 sec: 5572.4, 300 sec: 5531.4). Total num frames: 1220909056. Throughput: 0: 5808.5. Samples: 1220908508. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:30:40,443][25689] Avg episode reward: [(0, '1.464')] [2022-07-11 12:30:42,252][26022] Updated weights on worker 0-0, policy_version 1192303 (0.00091) [2022-07-11 12:30:43,997][26022] Updated weights on worker 0-0, policy_version 1192313 (0.00085) [2022-07-11 12:30:45,459][25689] Fps is (10 sec: 5593.3, 60 sec: 5504.2, 300 sec: 5528.0). Total num frames: 1220935680. Throughput: 0: 5822.9. Samples: 1220942126. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:30:45,459][25689] Avg episode reward: [(0, '2.326')] [2022-07-11 12:30:46,077][26022] Updated weights on worker 0-0, policy_version 1192323 (0.00094) [2022-07-11 12:30:47,708][26022] Updated weights on worker 0-0, policy_version 1192333 (0.00092) [2022-07-11 12:30:49,685][26022] Updated weights on worker 0-0, policy_version 1192343 (0.00087) [2022-07-11 12:30:50,501][25689] Fps is (10 sec: 5395.2, 60 sec: 5536.4, 300 sec: 5531.1). Total num frames: 1220963328. Throughput: 0: 4982.0. Samples: 1220958766. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:30:50,502][25689] Avg episode reward: [(0, '2.262')] [2022-07-11 12:30:51,351][26022] Updated weights on worker 0-0, policy_version 1192353 (0.00083) [2022-07-11 12:30:53,339][26022] Updated weights on worker 0-0, policy_version 1192363 (0.00086) [2022-07-11 12:30:55,022][26022] Updated weights on worker 0-0, policy_version 1192373 (0.00108) [2022-07-11 12:30:55,538][25689] Fps is (10 sec: 5486.2, 60 sec: 5523.2, 300 sec: 5529.4). Total num frames: 1220990976. Throughput: 0: 5799.3. Samples: 1220992146. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:30:55,538][25689] Avg episode reward: [(0, '2.239')] [2022-07-11 12:30:57,001][26022] Updated weights on worker 0-0, policy_version 1192383 (0.00086) [2022-07-11 12:30:58,796][26022] Updated weights on worker 0-0, policy_version 1192393 (0.00088) [2022-07-11 12:31:00,643][25689] Fps is (10 sec: 5553.6, 60 sec: 5504.5, 300 sec: 5539.6). Total num frames: 1221019648. Throughput: 0: 5789.7. Samples: 1221025464. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:31:00,643][25689] Avg episode reward: [(0, '0.925')] [2022-07-11 12:31:00,694][26022] Updated weights on worker 0-0, policy_version 1192403 (0.00095) [2022-07-11 12:31:02,815][26022] Updated weights on worker 0-0, policy_version 1192413 (0.00085) [2022-07-11 12:31:04,832][26022] Updated weights on worker 0-0, policy_version 1192423 (0.00085) [2022-07-11 12:31:05,652][25689] Fps is (10 sec: 5365.6, 60 sec: 5520.7, 300 sec: 5529.6). Total num frames: 1221045248. Throughput: 0: 4842.0. Samples: 1221039904. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:31:05,653][25689] Avg episode reward: [(0, '0.923')] [2022-07-11 12:31:06,487][26022] Updated weights on worker 0-0, policy_version 1192433 (0.00089) [2022-07-11 12:31:08,503][26022] Updated weights on worker 0-0, policy_version 1192443 (0.00097) [2022-07-11 12:31:10,221][26022] Updated weights on worker 0-0, policy_version 1192453 (0.00088) [2022-07-11 12:31:10,676][25689] Fps is (10 sec: 5409.3, 60 sec: 5519.5, 300 sec: 5533.3). Total num frames: 1221073920. Throughput: 0: 5681.2. Samples: 1221073380. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:31:10,676][25689] Avg episode reward: [(0, '0.680')] [2022-07-11 12:31:12,102][26022] Updated weights on worker 0-0, policy_version 1192463 (0.00094) [2022-07-11 12:31:13,915][26022] Updated weights on worker 0-0, policy_version 1192473 (0.00084) [2022-07-11 12:31:15,679][25689] Fps is (10 sec: 5515.0, 60 sec: 5504.8, 300 sec: 5527.8). Total num frames: 1221100544. Throughput: 0: 5706.4. Samples: 1221107080. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:31:15,679][25689] Avg episode reward: [(0, '0.471')] [2022-07-11 12:31:16,035][26022] Updated weights on worker 0-0, policy_version 1192483 (0.00085) [2022-07-11 12:31:17,516][26022] Updated weights on worker 0-0, policy_version 1192493 (0.00088) [2022-07-11 12:31:19,464][26022] Updated weights on worker 0-0, policy_version 1192503 (0.00091) [2022-07-11 12:31:20,763][25689] Fps is (10 sec: 5482.0, 60 sec: 5525.8, 300 sec: 5530.9). Total num frames: 1221129216. Throughput: 0: 4884.0. Samples: 1221123730. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:31:20,763][25689] Avg episode reward: [(0, '0.311')] [2022-07-11 12:31:21,116][26022] Updated weights on worker 0-0, policy_version 1192513 (0.00092) [2022-07-11 12:31:23,166][26022] Updated weights on worker 0-0, policy_version 1192523 (0.00092) [2022-07-11 12:31:24,871][26022] Updated weights on worker 0-0, policy_version 1192533 (0.00082) [2022-07-11 12:31:25,784][25689] Fps is (10 sec: 5675.0, 60 sec: 5525.3, 300 sec: 5534.2). Total num frames: 1221157888. Throughput: 0: 5824.8. Samples: 1221157164. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:31:25,785][25689] Avg episode reward: [(0, '0.284')] [2022-07-11 12:31:26,896][26022] Updated weights on worker 0-0, policy_version 1192543 (0.00090) [2022-07-11 12:31:28,553][26022] Updated weights on worker 0-0, policy_version 1192553 (0.00092) [2022-07-11 12:31:30,759][26022] Updated weights on worker 0-0, policy_version 1192563 (0.00087) [2022-07-11 12:31:30,814][25689] Fps is (10 sec: 5501.2, 60 sec: 5490.5, 300 sec: 5527.3). Total num frames: 1221184512. Throughput: 0: 5827.0. Samples: 1221190726. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:31:30,815][25689] Avg episode reward: [(0, '-0.002')] [2022-07-11 12:31:32,110][26022] Updated weights on worker 0-0, policy_version 1192573 (0.00093) [2022-07-11 12:31:34,462][26022] Updated weights on worker 0-0, policy_version 1192583 (0.00093) [2022-07-11 12:31:35,831][25689] Fps is (10 sec: 5605.6, 60 sec: 5540.1, 300 sec: 5539.3). Total num frames: 1221214208. Throughput: 0: 4970.3. Samples: 1221207240. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:31:35,831][25689] Avg episode reward: [(0, '0.739')] [2022-07-11 12:31:35,899][26022] Updated weights on worker 0-0, policy_version 1192593 (0.00087) [2022-07-11 12:31:37,954][26022] Updated weights on worker 0-0, policy_version 1192603 (0.00083) [2022-07-11 12:31:39,740][26022] Updated weights on worker 0-0, policy_version 1192613 (0.00090) [2022-07-11 12:31:40,885][25689] Fps is (10 sec: 5694.3, 60 sec: 5506.1, 300 sec: 5532.1). Total num frames: 1221241856. Throughput: 0: 5820.6. Samples: 1221240852. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:31:40,885][25689] Avg episode reward: [(0, '1.555')] [2022-07-11 12:31:41,536][26022] Updated weights on worker 0-0, policy_version 1192623 (0.00086) [2022-07-11 12:31:43,449][26022] Updated weights on worker 0-0, policy_version 1192633 (0.00086) [2022-07-11 12:31:45,151][26022] Updated weights on worker 0-0, policy_version 1192643 (0.00088) [2022-07-11 12:31:45,978][25689] Fps is (10 sec: 5449.2, 60 sec: 5516.0, 300 sec: 5531.5). Total num frames: 1221269504. Throughput: 0: 5807.6. Samples: 1221274446. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:31:45,979][25689] Avg episode reward: [(0, '1.210')] [2022-07-11 12:31:46,969][26022] Updated weights on worker 0-0, policy_version 1192653 (0.00088) [2022-07-11 12:31:48,918][26022] Updated weights on worker 0-0, policy_version 1192663 (0.00090) [2022-07-11 12:31:50,577][26022] Updated weights on worker 0-0, policy_version 1192673 (0.00085) [2022-07-11 12:31:50,999][25689] Fps is (10 sec: 5669.7, 60 sec: 5551.9, 300 sec: 5538.3). Total num frames: 1221299200. Throughput: 0: 4991.8. Samples: 1221291482. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:31:50,999][25689] Avg episode reward: [(0, '0.866')] [2022-07-11 12:31:52,479][26022] Updated weights on worker 0-0, policy_version 1192683 (0.00080) [2022-07-11 12:31:54,428][26022] Updated weights on worker 0-0, policy_version 1192693 (0.00092) [2022-07-11 12:31:56,023][25689] Fps is (10 sec: 5708.6, 60 sec: 5552.9, 300 sec: 5533.3). Total num frames: 1221326848. Throughput: 0: 5849.9. Samples: 1221325366. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:31:56,024][25689] Avg episode reward: [(0, '0.896')] [2022-07-11 12:31:56,155][26022] Updated weights on worker 0-0, policy_version 1192703 (0.00090) [2022-07-11 12:31:57,992][26022] Updated weights on worker 0-0, policy_version 1192713 (0.00079) [2022-07-11 12:31:59,902][26022] Updated weights on worker 0-0, policy_version 1192723 (0.00085) [2022-07-11 12:32:01,062][25689] Fps is (10 sec: 5495.0, 60 sec: 5542.1, 300 sec: 5540.0). Total num frames: 1221354496. Throughput: 0: 5853.5. Samples: 1221358958. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:32:01,062][25689] Avg episode reward: [(0, '1.010')] [2022-07-11 12:32:01,830][26022] Updated weights on worker 0-0, policy_version 1192733 (0.00824) [2022-07-11 12:32:03,961][26022] Updated weights on worker 0-0, policy_version 1192743 (0.00083) [2022-07-11 12:32:05,708][26022] Updated weights on worker 0-0, policy_version 1192753 (0.00087) [2022-07-11 12:32:06,074][25689] Fps is (10 sec: 5399.6, 60 sec: 5558.8, 300 sec: 5537.1). Total num frames: 1221381120. Throughput: 0: 5766.8. Samples: 1221390338. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:32:06,075][25689] Avg episode reward: [(0, '0.777')] [2022-07-11 12:32:06,856][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:32:06,867][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001192759_1221385216.pth [2022-07-11 12:32:06,868][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001190811_1219390464.pth [2022-07-11 12:32:07,462][26022] Updated weights on worker 0-0, policy_version 1192763 (0.00091) [2022-07-11 12:32:09,390][26022] Updated weights on worker 0-0, policy_version 1192773 (0.00091) [2022-07-11 12:32:11,086][25689] Fps is (10 sec: 5414.3, 60 sec: 5542.9, 300 sec: 5534.9). Total num frames: 1221408768. Throughput: 0: 5750.4. Samples: 1221406990. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:32:11,086][25689] Avg episode reward: [(0, '0.711')] [2022-07-11 12:32:11,129][26022] Updated weights on worker 0-0, policy_version 1192783 (0.00091) [2022-07-11 12:32:13,109][26022] Updated weights on worker 0-0, policy_version 1192793 (0.00091) [2022-07-11 12:32:14,955][26022] Updated weights on worker 0-0, policy_version 1192803 (0.00093) [2022-07-11 12:32:16,095][25689] Fps is (10 sec: 5415.9, 60 sec: 5542.3, 300 sec: 5532.7). Total num frames: 1221435392. Throughput: 0: 5729.1. Samples: 1221440360. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:32:16,097][25689] Avg episode reward: [(0, '1.098')] [2022-07-11 12:32:16,647][26022] Updated weights on worker 0-0, policy_version 1192813 (0.00082) [2022-07-11 12:32:18,613][26022] Updated weights on worker 0-0, policy_version 1192823 (0.00084) [2022-07-11 12:32:20,228][26022] Updated weights on worker 0-0, policy_version 1192833 (0.00086) [2022-07-11 12:32:21,215][25689] Fps is (10 sec: 5560.3, 60 sec: 5556.0, 300 sec: 5537.7). Total num frames: 1221465088. Throughput: 0: 5706.0. Samples: 1221473950. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:32:21,215][25689] Avg episode reward: [(0, '1.532')] [2022-07-11 12:32:22,295][26022] Updated weights on worker 0-0, policy_version 1192843 (0.00074) [2022-07-11 12:32:24,123][26022] Updated weights on worker 0-0, policy_version 1192853 (0.00082) [2022-07-11 12:32:26,008][26022] Updated weights on worker 0-0, policy_version 1192863 (0.00095) [2022-07-11 12:32:26,226][25689] Fps is (10 sec: 5660.4, 60 sec: 5539.9, 300 sec: 5537.9). Total num frames: 1221492736. Throughput: 0: 4984.3. Samples: 1221490780. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:32:26,227][25689] Avg episode reward: [(0, '1.687')] [2022-07-11 12:32:27,773][26022] Updated weights on worker 0-0, policy_version 1192873 (0.00087) [2022-07-11 12:32:29,641][26022] Updated weights on worker 0-0, policy_version 1192883 (0.00094) [2022-07-11 12:32:31,279][25689] Fps is (10 sec: 5596.0, 60 sec: 5571.8, 300 sec: 5538.8). Total num frames: 1221521408. Throughput: 0: 5803.3. Samples: 1221524178. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:32:31,279][25689] Avg episode reward: [(0, '1.732')] [2022-07-11 12:32:31,359][26022] Updated weights on worker 0-0, policy_version 1192893 (0.00085) [2022-07-11 12:32:33,398][26022] Updated weights on worker 0-0, policy_version 1192903 (0.00088) [2022-07-11 12:32:35,027][26022] Updated weights on worker 0-0, policy_version 1192913 (0.00091) [2022-07-11 12:32:36,299][25689] Fps is (10 sec: 5591.2, 60 sec: 5537.6, 300 sec: 5543.8). Total num frames: 1221549056. Throughput: 0: 5811.9. Samples: 1221557782. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 12:32:36,300][25689] Avg episode reward: [(0, '1.691')] [2022-07-11 12:32:36,959][26022] Updated weights on worker 0-0, policy_version 1192923 (0.00096) [2022-07-11 12:32:38,755][26022] Updated weights on worker 0-0, policy_version 1192933 (0.00578) [2022-07-11 12:32:40,539][26022] Updated weights on worker 0-0, policy_version 1192943 (0.00078) [2022-07-11 12:32:41,371][25689] Fps is (10 sec: 5479.0, 60 sec: 5535.9, 300 sec: 5536.5). Total num frames: 1221576704. Throughput: 0: 4980.3. Samples: 1221574336. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:32:41,372][25689] Avg episode reward: [(0, '1.358')] [2022-07-11 12:32:42,587][26022] Updated weights on worker 0-0, policy_version 1192953 (0.00084) [2022-07-11 12:32:44,487][26022] Updated weights on worker 0-0, policy_version 1192963 (0.00084) [2022-07-11 12:32:46,248][26022] Updated weights on worker 0-0, policy_version 1192973 (0.00083) [2022-07-11 12:32:46,379][25689] Fps is (10 sec: 5587.1, 60 sec: 5560.7, 300 sec: 5538.0). Total num frames: 1221605376. Throughput: 0: 5797.0. Samples: 1221607608. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:32:46,380][25689] Avg episode reward: [(0, '1.514')] [2022-07-11 12:32:48,297][26022] Updated weights on worker 0-0, policy_version 1192983 (0.00085) [2022-07-11 12:32:49,745][26022] Updated weights on worker 0-0, policy_version 1192993 (0.00095) [2022-07-11 12:32:51,401][25689] Fps is (10 sec: 5615.0, 60 sec: 5526.6, 300 sec: 5535.4). Total num frames: 1221633024. Throughput: 0: 5821.6. Samples: 1221641324. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:32:51,402][25689] Avg episode reward: [(0, '1.294')] [2022-07-11 12:32:51,805][26022] Updated weights on worker 0-0, policy_version 1193003 (0.00086) [2022-07-11 12:32:53,674][26022] Updated weights on worker 0-0, policy_version 1193013 (0.00090) [2022-07-11 12:32:55,215][26022] Updated weights on worker 0-0, policy_version 1193023 (0.00085) [2022-07-11 12:32:56,458][25689] Fps is (10 sec: 5486.3, 60 sec: 5523.7, 300 sec: 5538.7). Total num frames: 1221660672. Throughput: 0: 4977.9. Samples: 1221658132. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:32:56,459][25689] Avg episode reward: [(0, '1.515')] [2022-07-11 12:32:57,210][26022] Updated weights on worker 0-0, policy_version 1193033 (0.00087) [2022-07-11 12:32:59,230][26022] Updated weights on worker 0-0, policy_version 1193043 (0.00088) [2022-07-11 12:33:00,847][26022] Updated weights on worker 0-0, policy_version 1193053 (0.00092) [2022-07-11 12:33:01,510][25689] Fps is (10 sec: 5470.3, 60 sec: 5522.5, 300 sec: 5541.7). Total num frames: 1221688320. Throughput: 0: 5810.8. Samples: 1221691358. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:33:01,510][25689] Avg episode reward: [(0, '1.495')] [2022-07-11 12:33:03,220][26022] Updated weights on worker 0-0, policy_version 1193063 (0.00562) [2022-07-11 12:33:04,827][26022] Updated weights on worker 0-0, policy_version 1193073 (0.00090) [2022-07-11 12:33:06,523][25689] Fps is (10 sec: 5392.5, 60 sec: 5522.5, 300 sec: 5538.5). Total num frames: 1221714944. Throughput: 0: 5696.8. Samples: 1221722360. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:33:06,523][25689] Avg episode reward: [(0, '1.703')] [2022-07-11 12:33:06,944][26022] Updated weights on worker 0-0, policy_version 1193083 (0.00091) [2022-07-11 12:33:08,687][26022] Updated weights on worker 0-0, policy_version 1193093 (0.00089) [2022-07-11 12:33:10,422][26022] Updated weights on worker 0-0, policy_version 1193103 (0.00084) [2022-07-11 12:33:11,594][25689] Fps is (10 sec: 5280.2, 60 sec: 5500.0, 300 sec: 5530.5). Total num frames: 1221741568. Throughput: 0: 4850.4. Samples: 1221739268. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:33:11,595][25689] Avg episode reward: [(0, '1.412')] [2022-07-11 12:33:12,319][26022] Updated weights on worker 0-0, policy_version 1193113 (0.00086) [2022-07-11 12:33:14,251][26022] Updated weights on worker 0-0, policy_version 1193123 (0.00091) [2022-07-11 12:33:15,851][26022] Updated weights on worker 0-0, policy_version 1193133 (0.00083) [2022-07-11 12:33:16,635][25689] Fps is (10 sec: 5569.6, 60 sec: 5548.0, 300 sec: 5534.1). Total num frames: 1221771264. Throughput: 0: 5698.7. Samples: 1221773110. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:33:16,635][25689] Avg episode reward: [(0, '0.040')] [2022-07-11 12:33:17,882][26022] Updated weights on worker 0-0, policy_version 1193143 (0.00087) [2022-07-11 12:33:19,477][26022] Updated weights on worker 0-0, policy_version 1193153 (0.00086) [2022-07-11 12:33:21,495][26022] Updated weights on worker 0-0, policy_version 1193163 (0.00088) [2022-07-11 12:33:21,748][25689] Fps is (10 sec: 5748.5, 60 sec: 5531.6, 300 sec: 5537.1). Total num frames: 1221799936. Throughput: 0: 5700.5. Samples: 1221806724. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:33:21,748][25689] Avg episode reward: [(0, '0.235')] [2022-07-11 12:33:23,372][26022] Updated weights on worker 0-0, policy_version 1193173 (0.00089) [2022-07-11 12:33:25,204][26022] Updated weights on worker 0-0, policy_version 1193183 (0.00085) [2022-07-11 12:33:26,778][25689] Fps is (10 sec: 5653.3, 60 sec: 5546.8, 300 sec: 5544.0). Total num frames: 1221828608. Throughput: 0: 4996.2. Samples: 1221823560. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:33:26,779][25689] Avg episode reward: [(0, '0.117')] [2022-07-11 12:33:27,003][26022] Updated weights on worker 0-0, policy_version 1193193 (0.00089) [2022-07-11 12:33:28,864][26022] Updated weights on worker 0-0, policy_version 1193203 (0.00087) [2022-07-11 12:33:30,463][26022] Updated weights on worker 0-0, policy_version 1193213 (0.00083) [2022-07-11 12:33:31,789][25689] Fps is (10 sec: 5608.8, 60 sec: 5533.7, 300 sec: 5533.5). Total num frames: 1221856256. Throughput: 0: 5828.0. Samples: 1221856964. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:33:31,790][25689] Avg episode reward: [(0, '0.517')] [2022-07-11 12:33:32,676][26022] Updated weights on worker 0-0, policy_version 1193223 (0.00088) [2022-07-11 12:33:34,228][26022] Updated weights on worker 0-0, policy_version 1193233 (0.00090) [2022-07-11 12:33:36,308][26022] Updated weights on worker 0-0, policy_version 1193243 (0.00091) [2022-07-11 12:33:36,848][25689] Fps is (10 sec: 5287.6, 60 sec: 5496.3, 300 sec: 5528.1). Total num frames: 1221881856. Throughput: 0: 5780.4. Samples: 1221889954. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:33:36,849][25689] Avg episode reward: [(0, '0.226')] [2022-07-11 12:33:38,075][26022] Updated weights on worker 0-0, policy_version 1193253 (0.00083) [2022-07-11 12:33:40,115][26022] Updated weights on worker 0-0, policy_version 1193263 (0.00091) [2022-07-11 12:33:41,671][26022] Updated weights on worker 0-0, policy_version 1193273 (0.00098) [2022-07-11 12:33:41,955][25689] Fps is (10 sec: 5540.4, 60 sec: 5544.0, 300 sec: 5533.3). Total num frames: 1221912576. Throughput: 0: 4933.2. Samples: 1221906408. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:33:41,958][25689] Avg episode reward: [(0, '1.872')] [2022-07-11 12:33:43,770][26022] Updated weights on worker 0-0, policy_version 1193283 (0.00090) [2022-07-11 12:33:45,393][26022] Updated weights on worker 0-0, policy_version 1193293 (0.00084) [2022-07-11 12:33:47,021][25689] Fps is (10 sec: 5636.9, 60 sec: 5504.8, 300 sec: 5529.4). Total num frames: 1221939200. Throughput: 0: 5754.4. Samples: 1221940048. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:33:47,022][25689] Avg episode reward: [(0, '1.847')] [2022-07-11 12:33:47,467][26022] Updated weights on worker 0-0, policy_version 1193303 (0.00096) [2022-07-11 12:33:49,071][26022] Updated weights on worker 0-0, policy_version 1193313 (0.00093) [2022-07-11 12:33:51,361][26022] Updated weights on worker 0-0, policy_version 1193323 (0.00092) [2022-07-11 12:33:52,110][25689] Fps is (10 sec: 5444.9, 60 sec: 5515.7, 300 sec: 5532.2). Total num frames: 1221967872. Throughput: 0: 5712.3. Samples: 1221973042. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:33:52,112][25689] Avg episode reward: [(0, '2.073')] [2022-07-11 12:33:52,935][26022] Updated weights on worker 0-0, policy_version 1193333 (0.00084) [2022-07-11 12:33:54,815][26022] Updated weights on worker 0-0, policy_version 1193343 (0.00096) [2022-07-11 12:33:56,491][26022] Updated weights on worker 0-0, policy_version 1193353 (0.00089) [2022-07-11 12:33:57,128][25689] Fps is (10 sec: 5572.4, 60 sec: 5519.2, 300 sec: 5530.2). Total num frames: 1221995520. Throughput: 0: 4923.2. Samples: 1221989796. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:33:57,129][25689] Avg episode reward: [(0, '2.006')] [2022-07-11 12:33:58,564][26022] Updated weights on worker 0-0, policy_version 1193363 (0.00081) [2022-07-11 12:34:00,157][26022] Updated weights on worker 0-0, policy_version 1193373 (0.00086) [2022-07-11 12:34:02,283][25689] Fps is (10 sec: 5435.8, 60 sec: 5509.8, 300 sec: 5531.1). Total num frames: 1222023168. Throughput: 0: 5756.2. Samples: 1222023420. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:34:02,283][25689] Avg episode reward: [(0, '0.838')] [2022-07-11 12:34:02,386][26022] Updated weights on worker 0-0, policy_version 1193383 (0.00092) [2022-07-11 12:34:04,372][26022] Updated weights on worker 0-0, policy_version 1193393 (0.00095) [2022-07-11 12:34:06,184][26022] Updated weights on worker 0-0, policy_version 1193403 (0.00085) [2022-07-11 12:34:07,043][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:34:07,056][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001193408_1222049792.pth [2022-07-11 12:34:07,056][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001191459_1220054016.pth [2022-07-11 12:34:07,342][25689] Fps is (10 sec: 5413.9, 60 sec: 5522.5, 300 sec: 5530.5). Total num frames: 1222050816. Throughput: 0: 5635.7. Samples: 1222054568. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:34:07,343][25689] Avg episode reward: [(0, '1.200')] [2022-07-11 12:34:08,153][26022] Updated weights on worker 0-0, policy_version 1193413 (0.00090) [2022-07-11 12:34:09,907][26022] Updated weights on worker 0-0, policy_version 1193423 (0.00086) [2022-07-11 12:34:11,744][26022] Updated weights on worker 0-0, policy_version 1193433 (0.00084) [2022-07-11 12:34:12,355][25689] Fps is (10 sec: 5490.0, 60 sec: 5544.6, 300 sec: 5530.6). Total num frames: 1222078464. Throughput: 0: 5670.0. Samples: 1222087830. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:34:12,355][25689] Avg episode reward: [(0, '1.046')] [2022-07-11 12:34:13,532][26022] Updated weights on worker 0-0, policy_version 1193443 (0.00095) [2022-07-11 12:34:15,463][26022] Updated weights on worker 0-0, policy_version 1193453 (0.00086) [2022-07-11 12:34:17,298][26022] Updated weights on worker 0-0, policy_version 1193463 (0.00082) [2022-07-11 12:34:17,371][25689] Fps is (10 sec: 5513.7, 60 sec: 5513.2, 300 sec: 5527.7). Total num frames: 1222106112. Throughput: 0: 5676.3. Samples: 1222104700. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:34:17,371][25689] Avg episode reward: [(0, '0.942')] [2022-07-11 12:34:18,934][26022] Updated weights on worker 0-0, policy_version 1193473 (0.00089) [2022-07-11 12:34:20,890][26022] Updated weights on worker 0-0, policy_version 1193483 (0.00095) [2022-07-11 12:34:22,418][25689] Fps is (10 sec: 5596.5, 60 sec: 5519.1, 300 sec: 5530.8). Total num frames: 1222134784. Throughput: 0: 5717.4. Samples: 1222138544. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:34:22,419][25689] Avg episode reward: [(0, '0.660')] [2022-07-11 12:34:22,561][26022] Updated weights on worker 0-0, policy_version 1193493 (0.00085) [2022-07-11 12:34:24,508][26022] Updated weights on worker 0-0, policy_version 1193503 (0.00090) [2022-07-11 12:34:26,323][26022] Updated weights on worker 0-0, policy_version 1193513 (0.00085) [2022-07-11 12:34:27,426][25689] Fps is (10 sec: 5702.7, 60 sec: 5521.1, 300 sec: 5530.8). Total num frames: 1222163456. Throughput: 0: 5864.3. Samples: 1222172348. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:34:27,427][25689] Avg episode reward: [(0, '0.815')] [2022-07-11 12:34:28,279][26022] Updated weights on worker 0-0, policy_version 1193523 (0.00094) [2022-07-11 12:34:29,953][26022] Updated weights on worker 0-0, policy_version 1193533 (0.00095) [2022-07-11 12:34:31,803][26022] Updated weights on worker 0-0, policy_version 1193543 (0.00094) [2022-07-11 12:34:32,513][25689] Fps is (10 sec: 5477.4, 60 sec: 5497.4, 300 sec: 5525.8). Total num frames: 1222190080. Throughput: 0: 5014.8. Samples: 1222188922. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:34:32,514][25689] Avg episode reward: [(0, '1.499')] [2022-07-11 12:34:33,664][26022] Updated weights on worker 0-0, policy_version 1193553 (0.00090) [2022-07-11 12:34:35,597][26022] Updated weights on worker 0-0, policy_version 1193563 (0.00085) [2022-07-11 12:34:37,532][25689] Fps is (10 sec: 5370.4, 60 sec: 5534.8, 300 sec: 5530.4). Total num frames: 1222217728. Throughput: 0: 5821.3. Samples: 1222222066. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:34:37,533][25689] Avg episode reward: [(0, '1.375')] [2022-07-11 12:34:37,579][26022] Updated weights on worker 0-0, policy_version 1193573 (0.00086) [2022-07-11 12:34:39,383][26022] Updated weights on worker 0-0, policy_version 1193583 (0.00085) [2022-07-11 12:34:41,153][26022] Updated weights on worker 0-0, policy_version 1193593 (0.00096) [2022-07-11 12:34:42,627][25689] Fps is (10 sec: 5568.9, 60 sec: 5502.1, 300 sec: 5522.0). Total num frames: 1222246400. Throughput: 0: 5778.6. Samples: 1222255322. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:34:42,627][25689] Avg episode reward: [(0, '1.394')] [2022-07-11 12:34:43,072][26022] Updated weights on worker 0-0, policy_version 1193603 (0.00091) [2022-07-11 12:34:44,807][26022] Updated weights on worker 0-0, policy_version 1193613 (0.00092) [2022-07-11 12:34:46,664][26022] Updated weights on worker 0-0, policy_version 1193623 (0.00093) [2022-07-11 12:34:47,643][25689] Fps is (10 sec: 5671.3, 60 sec: 5540.4, 300 sec: 5532.5). Total num frames: 1222275072. Throughput: 0: 4934.4. Samples: 1222272108. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:34:47,645][25689] Avg episode reward: [(0, '1.587')] [2022-07-11 12:34:48,450][26022] Updated weights on worker 0-0, policy_version 1193633 (0.00336) [2022-07-11 12:34:50,473][26022] Updated weights on worker 0-0, policy_version 1193643 (0.00086) [2022-07-11 12:34:52,156][26022] Updated weights on worker 0-0, policy_version 1193653 (0.00086) [2022-07-11 12:34:52,661][25689] Fps is (10 sec: 5612.6, 60 sec: 5530.0, 300 sec: 5530.1). Total num frames: 1222302720. Throughput: 0: 5777.1. Samples: 1222305320. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:34:52,662][25689] Avg episode reward: [(0, '1.602')] [2022-07-11 12:34:54,003][26022] Updated weights on worker 0-0, policy_version 1193663 (0.00088) [2022-07-11 12:34:55,972][26022] Updated weights on worker 0-0, policy_version 1193673 (0.00084) [2022-07-11 12:34:57,702][25689] Fps is (10 sec: 5497.5, 60 sec: 5528.0, 300 sec: 5524.1). Total num frames: 1222330368. Throughput: 0: 5784.4. Samples: 1222338736. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:34:57,704][25689] Avg episode reward: [(0, '1.646')] [2022-07-11 12:34:57,867][26022] Updated weights on worker 0-0, policy_version 1193683 (0.00099) [2022-07-11 12:34:59,626][26022] Updated weights on worker 0-0, policy_version 1193693 (0.00086) [2022-07-11 12:35:01,424][26022] Updated weights on worker 0-0, policy_version 1193703 (0.00091) [2022-07-11 12:35:02,842][25689] Fps is (10 sec: 5330.8, 60 sec: 5512.3, 300 sec: 5528.4). Total num frames: 1222356992. Throughput: 0: 4959.0. Samples: 1222355572. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:35:02,843][25689] Avg episode reward: [(0, '1.741')] [2022-07-11 12:35:03,567][26022] Updated weights on worker 0-0, policy_version 1193713 (0.00102) [2022-07-11 12:35:05,514][26022] Updated weights on worker 0-0, policy_version 1193723 (0.00088) [2022-07-11 12:35:07,220][26022] Updated weights on worker 0-0, policy_version 1193733 (0.00092) [2022-07-11 12:35:07,867][25689] Fps is (10 sec: 5440.0, 60 sec: 5532.4, 300 sec: 5528.1). Total num frames: 1222385664. Throughput: 0: 5673.3. Samples: 1222386840. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:35:07,867][25689] Avg episode reward: [(0, '2.028')] [2022-07-11 12:35:09,400][26022] Updated weights on worker 0-0, policy_version 1193743 (0.00086) [2022-07-11 12:35:10,888][26022] Updated weights on worker 0-0, policy_version 1193753 (0.00088) [2022-07-11 12:35:12,947][25689] Fps is (10 sec: 5472.4, 60 sec: 5509.4, 300 sec: 5523.7). Total num frames: 1222412288. Throughput: 0: 5668.8. Samples: 1222420314. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:35:12,947][25689] Avg episode reward: [(0, '2.026')] [2022-07-11 12:35:12,957][26022] Updated weights on worker 0-0, policy_version 1193763 (0.00120) [2022-07-11 12:35:14,559][26022] Updated weights on worker 0-0, policy_version 1193773 (0.00087) [2022-07-11 12:35:16,536][26022] Updated weights on worker 0-0, policy_version 1193783 (0.00106) [2022-07-11 12:35:17,951][25689] Fps is (10 sec: 5483.3, 60 sec: 5527.3, 300 sec: 5529.5). Total num frames: 1222440960. Throughput: 0: 4858.0. Samples: 1222437106. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:35:17,953][25689] Avg episode reward: [(0, '2.111')] [2022-07-11 12:35:18,286][26022] Updated weights on worker 0-0, policy_version 1193793 (0.00092) [2022-07-11 12:35:20,174][26022] Updated weights on worker 0-0, policy_version 1193803 (0.00083) [2022-07-11 12:35:21,979][26022] Updated weights on worker 0-0, policy_version 1193813 (0.00090) [2022-07-11 12:35:23,050][25689] Fps is (10 sec: 5777.0, 60 sec: 5539.5, 300 sec: 5531.3). Total num frames: 1222470656. Throughput: 0: 5686.5. Samples: 1222470484. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:35:23,051][25689] Avg episode reward: [(0, '1.052')] [2022-07-11 12:35:23,966][26022] Updated weights on worker 0-0, policy_version 1193823 (0.00108) [2022-07-11 12:35:25,508][26022] Updated weights on worker 0-0, policy_version 1193833 (0.00086) [2022-07-11 12:35:27,892][26022] Updated weights on worker 0-0, policy_version 1193843 (0.00091) [2022-07-11 12:35:28,057][25689] Fps is (10 sec: 5471.8, 60 sec: 5489.0, 300 sec: 5521.3). Total num frames: 1222496256. Throughput: 0: 5796.2. Samples: 1222503866. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:35:28,057][25689] Avg episode reward: [(0, '0.892')] [2022-07-11 12:35:29,190][26022] Updated weights on worker 0-0, policy_version 1193853 (0.00087) [2022-07-11 12:35:31,483][26022] Updated weights on worker 0-0, policy_version 1193863 (0.00102) [2022-07-11 12:35:32,731][26022] Updated weights on worker 0-0, policy_version 1193873 (0.00085) [2022-07-11 12:35:33,061][25689] Fps is (10 sec: 5625.7, 60 sec: 5564.1, 300 sec: 5535.0). Total num frames: 1222526976. Throughput: 0: 4983.0. Samples: 1222520546. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:35:33,062][25689] Avg episode reward: [(0, '0.716')] [2022-07-11 12:35:35,159][26022] Updated weights on worker 0-0, policy_version 1193883 (0.00093) [2022-07-11 12:35:36,707][26022] Updated weights on worker 0-0, policy_version 1193893 (0.00083) [2022-07-11 12:35:38,079][25689] Fps is (10 sec: 5517.5, 60 sec: 5513.5, 300 sec: 5518.5). Total num frames: 1222551552. Throughput: 0: 5802.6. Samples: 1222553898. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:35:38,079][25689] Avg episode reward: [(0, '0.458')] [2022-07-11 12:35:38,679][26022] Updated weights on worker 0-0, policy_version 1193903 (0.00088) [2022-07-11 12:35:40,283][26022] Updated weights on worker 0-0, policy_version 1193913 (0.00082) [2022-07-11 12:35:42,415][26022] Updated weights on worker 0-0, policy_version 1193923 (0.00089) [2022-07-11 12:35:43,164][25689] Fps is (10 sec: 5473.4, 60 sec: 5548.2, 300 sec: 5530.9). Total num frames: 1222582272. Throughput: 0: 5807.5. Samples: 1222587296. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:35:43,165][25689] Avg episode reward: [(0, '0.547')] [2022-07-11 12:35:44,223][26022] Updated weights on worker 0-0, policy_version 1193933 (0.00079) [2022-07-11 12:35:45,985][26022] Updated weights on worker 0-0, policy_version 1193943 (0.00081) [2022-07-11 12:35:47,926][26022] Updated weights on worker 0-0, policy_version 1193953 (0.00086) [2022-07-11 12:35:48,231][25689] Fps is (10 sec: 5648.5, 60 sec: 5509.8, 300 sec: 5527.1). Total num frames: 1222608896. Throughput: 0: 4965.3. Samples: 1222604036. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:35:48,231][25689] Avg episode reward: [(0, '0.720')] [2022-07-11 12:35:49,635][26022] Updated weights on worker 0-0, policy_version 1193963 (0.00090) [2022-07-11 12:35:51,671][26022] Updated weights on worker 0-0, policy_version 1193973 (0.00086) [2022-07-11 12:35:53,234][25689] Fps is (10 sec: 5389.5, 60 sec: 5511.1, 300 sec: 5527.7). Total num frames: 1222636544. Throughput: 0: 5792.5. Samples: 1222637396. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:35:53,235][25689] Avg episode reward: [(0, '1.758')] [2022-07-11 12:35:53,400][26022] Updated weights on worker 0-0, policy_version 1193983 (0.00088) [2022-07-11 12:35:55,175][26022] Updated weights on worker 0-0, policy_version 1193993 (0.00087) [2022-07-11 12:35:57,133][26022] Updated weights on worker 0-0, policy_version 1194003 (0.00087) [2022-07-11 12:35:58,296][25689] Fps is (10 sec: 5697.1, 60 sec: 5543.0, 300 sec: 5531.9). Total num frames: 1222666240. Throughput: 0: 5785.9. Samples: 1222670874. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:35:58,298][25689] Avg episode reward: [(0, '1.694')] [2022-07-11 12:35:58,775][26022] Updated weights on worker 0-0, policy_version 1194013 (0.00091) [2022-07-11 12:36:00,780][26022] Updated weights on worker 0-0, policy_version 1194023 (0.00095) [2022-07-11 12:36:03,056][26022] Updated weights on worker 0-0, policy_version 1194033 (0.00102) [2022-07-11 12:36:03,414][25689] Fps is (10 sec: 5331.5, 60 sec: 5511.3, 300 sec: 5526.5). Total num frames: 1222690816. Throughput: 0: 4959.3. Samples: 1222687720. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:36:03,414][25689] Avg episode reward: [(0, '1.590')] [2022-07-11 12:36:04,563][26022] Updated weights on worker 0-0, policy_version 1194043 (0.00083) [2022-07-11 12:36:06,723][26022] Updated weights on worker 0-0, policy_version 1194053 (0.00085) [2022-07-11 12:36:07,123][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:36:07,144][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001194055_1222712320.pth [2022-07-11 12:36:07,145][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001192111_1220721664.pth [2022-07-11 12:36:08,456][25689] Fps is (10 sec: 5342.0, 60 sec: 5526.5, 300 sec: 5529.6). Total num frames: 1222720512. Throughput: 0: 5690.6. Samples: 1222719128. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:36:08,458][25689] Avg episode reward: [(0, '1.689')] [2022-07-11 12:36:08,457][26022] Updated weights on worker 0-0, policy_version 1194063 (0.00117) [2022-07-11 12:36:10,367][26022] Updated weights on worker 0-0, policy_version 1194073 (0.00084) [2022-07-11 12:36:12,315][26022] Updated weights on worker 0-0, policy_version 1194083 (0.00087) [2022-07-11 12:36:13,461][25689] Fps is (10 sec: 5707.2, 60 sec: 5550.3, 300 sec: 5533.0). Total num frames: 1222748160. Throughput: 0: 5688.4. Samples: 1222752456. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:36:13,462][25689] Avg episode reward: [(0, '0.838')] [2022-07-11 12:36:14,042][26022] Updated weights on worker 0-0, policy_version 1194093 (0.00087) [2022-07-11 12:36:15,994][26022] Updated weights on worker 0-0, policy_version 1194103 (0.00083) [2022-07-11 12:36:17,615][26022] Updated weights on worker 0-0, policy_version 1194113 (0.00100) [2022-07-11 12:36:18,488][25689] Fps is (10 sec: 5511.7, 60 sec: 5531.3, 300 sec: 5530.6). Total num frames: 1222775808. Throughput: 0: 4877.2. Samples: 1222769354. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:36:18,490][25689] Avg episode reward: [(0, '0.621')] [2022-07-11 12:36:19,604][26022] Updated weights on worker 0-0, policy_version 1194123 (0.00093) [2022-07-11 12:36:21,268][26022] Updated weights on worker 0-0, policy_version 1194133 (0.00085) [2022-07-11 12:36:23,280][26022] Updated weights on worker 0-0, policy_version 1194143 (0.00088) [2022-07-11 12:36:23,605][25689] Fps is (10 sec: 5451.1, 60 sec: 5495.8, 300 sec: 5525.3). Total num frames: 1222803456. Throughput: 0: 5690.5. Samples: 1222802620. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:36:23,607][25689] Avg episode reward: [(0, '-0.301')] [2022-07-11 12:36:24,911][26022] Updated weights on worker 0-0, policy_version 1194153 (0.00086) [2022-07-11 12:36:26,880][26022] Updated weights on worker 0-0, policy_version 1194163 (0.00079) [2022-07-11 12:36:28,655][25689] Fps is (10 sec: 5539.8, 60 sec: 5542.6, 300 sec: 5531.9). Total num frames: 1222832128. Throughput: 0: 5807.0. Samples: 1222836424. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:36:28,655][25689] Avg episode reward: [(0, '-0.206')] [2022-07-11 12:36:28,671][26022] Updated weights on worker 0-0, policy_version 1194173 (0.00091) [2022-07-11 12:36:30,677][26022] Updated weights on worker 0-0, policy_version 1194183 (0.00086) [2022-07-11 12:36:32,236][26022] Updated weights on worker 0-0, policy_version 1194193 (0.00090) [2022-07-11 12:36:33,691][25689] Fps is (10 sec: 5584.3, 60 sec: 5489.1, 300 sec: 5524.6). Total num frames: 1222859776. Throughput: 0: 5811.3. Samples: 1222870016. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:36:33,691][25689] Avg episode reward: [(0, '-0.108')] [2022-07-11 12:36:34,286][26022] Updated weights on worker 0-0, policy_version 1194203 (0.00086) [2022-07-11 12:36:35,871][26022] Updated weights on worker 0-0, policy_version 1194213 (0.00059) [2022-07-11 12:36:37,982][26022] Updated weights on worker 0-0, policy_version 1194223 (0.00086) [2022-07-11 12:36:38,694][25689] Fps is (10 sec: 5610.1, 60 sec: 5558.0, 300 sec: 5529.0). Total num frames: 1222888448. Throughput: 0: 5810.7. Samples: 1222886762. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2022-07-11 12:36:38,694][25689] Avg episode reward: [(0, '-0.519')] [2022-07-11 12:36:39,558][26022] Updated weights on worker 0-0, policy_version 1194233 (0.00091) [2022-07-11 12:36:41,741][26022] Updated weights on worker 0-0, policy_version 1194243 (0.00086) [2022-07-11 12:36:43,502][26022] Updated weights on worker 0-0, policy_version 1194253 (0.00089) [2022-07-11 12:36:43,806][25689] Fps is (10 sec: 5669.1, 60 sec: 5521.8, 300 sec: 5532.1). Total num frames: 1222917120. Throughput: 0: 5803.0. Samples: 1222919844. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:36:43,806][25689] Avg episode reward: [(0, '1.373')] [2022-07-11 12:36:45,336][26022] Updated weights on worker 0-0, policy_version 1194263 (0.00084) [2022-07-11 12:36:47,062][26022] Updated weights on worker 0-0, policy_version 1194273 (0.00565) [2022-07-11 12:36:48,852][25689] Fps is (10 sec: 5544.2, 60 sec: 5540.5, 300 sec: 5524.8). Total num frames: 1222944768. Throughput: 0: 5798.8. Samples: 1222953544. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:36:48,853][25689] Avg episode reward: [(0, '1.731')] [2022-07-11 12:36:48,924][26022] Updated weights on worker 0-0, policy_version 1194283 (0.00094) [2022-07-11 12:36:50,742][26022] Updated weights on worker 0-0, policy_version 1194293 (0.00085) [2022-07-11 12:36:52,582][26022] Updated weights on worker 0-0, policy_version 1194303 (0.00092) [2022-07-11 12:36:53,887][25689] Fps is (10 sec: 5586.7, 60 sec: 5554.5, 300 sec: 5528.0). Total num frames: 1222973440. Throughput: 0: 4979.5. Samples: 1222970584. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:36:53,888][25689] Avg episode reward: [(0, '1.501')] [2022-07-11 12:36:54,341][26022] Updated weights on worker 0-0, policy_version 1194313 (0.00088) [2022-07-11 12:36:56,179][26022] Updated weights on worker 0-0, policy_version 1194323 (0.00090) [2022-07-11 12:36:57,908][26022] Updated weights on worker 0-0, policy_version 1194333 (0.00090) [2022-07-11 12:36:58,894][25689] Fps is (10 sec: 5608.3, 60 sec: 5525.7, 300 sec: 5528.6). Total num frames: 1223001088. Throughput: 0: 5806.2. Samples: 1223004050. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:36:58,895][25689] Avg episode reward: [(0, '1.492')] [2022-07-11 12:36:59,927][26022] Updated weights on worker 0-0, policy_version 1194343 (0.00086) [2022-07-11 12:37:01,952][26022] Updated weights on worker 0-0, policy_version 1194353 (0.00090) [2022-07-11 12:37:03,943][26022] Updated weights on worker 0-0, policy_version 1194363 (0.00091) [2022-07-11 12:37:04,000][25689] Fps is (10 sec: 5366.7, 60 sec: 5560.6, 300 sec: 5526.9). Total num frames: 1223027712. Throughput: 0: 5717.6. Samples: 1223035304. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:37:04,000][25689] Avg episode reward: [(0, '0.759')] [2022-07-11 12:37:05,676][26022] Updated weights on worker 0-0, policy_version 1194373 (0.00088) [2022-07-11 12:37:07,876][26022] Updated weights on worker 0-0, policy_version 1194383 (0.00088) [2022-07-11 12:37:09,004][25689] Fps is (10 sec: 5267.0, 60 sec: 5513.4, 300 sec: 5523.6). Total num frames: 1223054336. Throughput: 0: 4889.9. Samples: 1223052084. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:37:09,004][25689] Avg episode reward: [(0, '0.922')] [2022-07-11 12:37:09,409][26022] Updated weights on worker 0-0, policy_version 1194393 (0.00081) [2022-07-11 12:37:11,399][26022] Updated weights on worker 0-0, policy_version 1194403 (0.00090) [2022-07-11 12:37:12,938][26022] Updated weights on worker 0-0, policy_version 1194413 (0.00093) [2022-07-11 12:37:14,010][25689] Fps is (10 sec: 5524.0, 60 sec: 5530.2, 300 sec: 5530.5). Total num frames: 1223083008. Throughput: 0: 5710.6. Samples: 1223085496. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:37:14,010][25689] Avg episode reward: [(0, '-0.615')] [2022-07-11 12:37:15,243][26022] Updated weights on worker 0-0, policy_version 1194423 (0.00097) [2022-07-11 12:37:16,647][26022] Updated weights on worker 0-0, policy_version 1194433 (0.00077) [2022-07-11 12:37:18,667][26022] Updated weights on worker 0-0, policy_version 1194443 (0.00095) [2022-07-11 12:37:19,018][25689] Fps is (10 sec: 5623.9, 60 sec: 5531.9, 300 sec: 5525.7). Total num frames: 1223110656. Throughput: 0: 5711.4. Samples: 1223118986. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:37:19,019][25689] Avg episode reward: [(0, '-1.208')] [2022-07-11 12:37:20,427][26022] Updated weights on worker 0-0, policy_version 1194453 (0.00093) [2022-07-11 12:37:22,315][26022] Updated weights on worker 0-0, policy_version 1194463 (0.00086) [2022-07-11 12:37:24,063][26022] Updated weights on worker 0-0, policy_version 1194473 (0.00081) [2022-07-11 12:37:24,083][25689] Fps is (10 sec: 5692.8, 60 sec: 5570.6, 300 sec: 5531.6). Total num frames: 1223140352. Throughput: 0: 4998.6. Samples: 1223135692. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:37:24,083][25689] Avg episode reward: [(0, '-1.182')] [2022-07-11 12:37:26,077][26022] Updated weights on worker 0-0, policy_version 1194483 (0.00920) [2022-07-11 12:37:27,701][26022] Updated weights on worker 0-0, policy_version 1194493 (0.00622) [2022-07-11 12:37:29,095][25689] Fps is (10 sec: 5487.6, 60 sec: 5523.2, 300 sec: 5522.0). Total num frames: 1223165952. Throughput: 0: 5821.3. Samples: 1223169040. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:37:29,095][25689] Avg episode reward: [(0, '-1.374')] [2022-07-11 12:37:29,781][26022] Updated weights on worker 0-0, policy_version 1194503 (0.00089) [2022-07-11 12:37:31,325][26022] Updated weights on worker 0-0, policy_version 1194513 (0.00086) [2022-07-11 12:37:33,339][26022] Updated weights on worker 0-0, policy_version 1194523 (0.00112) [2022-07-11 12:37:34,149][25689] Fps is (10 sec: 5493.4, 60 sec: 5555.5, 300 sec: 5528.3). Total num frames: 1223195648. Throughput: 0: 5813.7. Samples: 1223202578. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:37:34,149][25689] Avg episode reward: [(0, '-0.351')] [2022-07-11 12:37:35,266][26022] Updated weights on worker 0-0, policy_version 1194533 (0.00092) [2022-07-11 12:37:36,888][26022] Updated weights on worker 0-0, policy_version 1194543 (0.00094) [2022-07-11 12:37:39,061][26022] Updated weights on worker 0-0, policy_version 1194553 (0.00087) [2022-07-11 12:37:39,171][25689] Fps is (10 sec: 5691.3, 60 sec: 5536.8, 300 sec: 5529.2). Total num frames: 1223223296. Throughput: 0: 4980.8. Samples: 1223219360. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:37:39,172][25689] Avg episode reward: [(0, '-0.340')] [2022-07-11 12:37:40,804][26022] Updated weights on worker 0-0, policy_version 1194563 (0.00094) [2022-07-11 12:37:42,731][26022] Updated weights on worker 0-0, policy_version 1194573 (0.00105) [2022-07-11 12:37:44,301][25689] Fps is (10 sec: 5446.8, 60 sec: 5518.2, 300 sec: 5523.5). Total num frames: 1223250944. Throughput: 0: 5765.3. Samples: 1223252254. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:37:44,301][25689] Avg episode reward: [(0, '0.628')] [2022-07-11 12:37:44,540][26022] Updated weights on worker 0-0, policy_version 1194583 (0.00091) [2022-07-11 12:37:46,203][26022] Updated weights on worker 0-0, policy_version 1194593 (0.00082) [2022-07-11 12:37:48,193][26022] Updated weights on worker 0-0, policy_version 1194603 (0.00092) [2022-07-11 12:37:49,326][25689] Fps is (10 sec: 5444.8, 60 sec: 5520.1, 300 sec: 5523.4). Total num frames: 1223278592. Throughput: 0: 5765.1. Samples: 1223285676. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:37:49,327][25689] Avg episode reward: [(0, '0.528')] [2022-07-11 12:37:49,944][26022] Updated weights on worker 0-0, policy_version 1194613 (0.00088) [2022-07-11 12:37:51,874][26022] Updated weights on worker 0-0, policy_version 1194623 (0.00086) [2022-07-11 12:37:53,659][26022] Updated weights on worker 0-0, policy_version 1194633 (0.00082) [2022-07-11 12:37:54,343][25689] Fps is (10 sec: 5608.3, 60 sec: 5521.8, 300 sec: 5527.6). Total num frames: 1223307264. Throughput: 0: 4951.6. Samples: 1223302572. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:37:54,344][25689] Avg episode reward: [(0, '1.118')] [2022-07-11 12:37:55,495][26022] Updated weights on worker 0-0, policy_version 1194643 (0.00081) [2022-07-11 12:37:57,407][26022] Updated weights on worker 0-0, policy_version 1194653 (0.00092) [2022-07-11 12:37:59,140][26022] Updated weights on worker 0-0, policy_version 1194663 (0.00100) [2022-07-11 12:37:59,372][25689] Fps is (10 sec: 5606.2, 60 sec: 5519.8, 300 sec: 5528.0). Total num frames: 1223334912. Throughput: 0: 5794.2. Samples: 1223336412. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:37:59,373][25689] Avg episode reward: [(0, '0.973')] [2022-07-11 12:38:00,993][26022] Updated weights on worker 0-0, policy_version 1194673 (0.00086) [2022-07-11 12:38:03,143][26022] Updated weights on worker 0-0, policy_version 1194683 (0.00089) [2022-07-11 12:38:04,486][25689] Fps is (10 sec: 5350.9, 60 sec: 5519.0, 300 sec: 5526.2). Total num frames: 1223361536. Throughput: 0: 5723.4. Samples: 1223367780. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:38:04,486][25689] Avg episode reward: [(0, '-0.002')] [2022-07-11 12:38:05,198][26022] Updated weights on worker 0-0, policy_version 1194693 (0.00085) [2022-07-11 12:38:06,873][26022] Updated weights on worker 0-0, policy_version 1194703 (0.00091) [2022-07-11 12:38:07,365][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:38:07,377][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001194705_1223377920.pth [2022-07-11 12:38:07,378][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001192759_1221385216.pth [2022-07-11 12:38:08,715][26022] Updated weights on worker 0-0, policy_version 1194713 (0.00099) [2022-07-11 12:38:09,528][25689] Fps is (10 sec: 5545.7, 60 sec: 5566.3, 300 sec: 5537.0). Total num frames: 1223391232. Throughput: 0: 4901.1. Samples: 1223384686. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:38:09,529][25689] Avg episode reward: [(0, '0.908')] [2022-07-11 12:38:10,719][26022] Updated weights on worker 0-0, policy_version 1194723 (0.00085) [2022-07-11 12:38:12,300][26022] Updated weights on worker 0-0, policy_version 1194733 (0.00091) [2022-07-11 12:38:14,402][26022] Updated weights on worker 0-0, policy_version 1194743 (0.00094) [2022-07-11 12:38:14,555][25689] Fps is (10 sec: 5593.2, 60 sec: 5530.6, 300 sec: 5527.0). Total num frames: 1223417856. Throughput: 0: 5712.9. Samples: 1223418040. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:38:14,555][25689] Avg episode reward: [(0, '0.793')] [2022-07-11 12:38:15,848][26022] Updated weights on worker 0-0, policy_version 1194753 (0.00093) [2022-07-11 12:38:17,921][26022] Updated weights on worker 0-0, policy_version 1194763 (0.00089) [2022-07-11 12:38:19,624][25689] Fps is (10 sec: 5476.8, 60 sec: 5541.9, 300 sec: 5527.8). Total num frames: 1223446528. Throughput: 0: 5685.7. Samples: 1223451560. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:38:19,625][25689] Avg episode reward: [(0, '0.721')] [2022-07-11 12:38:19,775][26022] Updated weights on worker 0-0, policy_version 1194773 (0.00085) [2022-07-11 12:38:21,512][26022] Updated weights on worker 0-0, policy_version 1194783 (0.00083) [2022-07-11 12:38:23,484][26022] Updated weights on worker 0-0, policy_version 1194793 (0.00091) [2022-07-11 12:38:24,703][25689] Fps is (10 sec: 5751.6, 60 sec: 5540.6, 300 sec: 5530.3). Total num frames: 1223476224. Throughput: 0: 4974.2. Samples: 1223468346. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:38:24,704][25689] Avg episode reward: [(0, '0.815')] [2022-07-11 12:38:25,143][26022] Updated weights on worker 0-0, policy_version 1194803 (0.00094) [2022-07-11 12:38:27,214][26022] Updated weights on worker 0-0, policy_version 1194813 (0.00504) [2022-07-11 12:38:29,083][26022] Updated weights on worker 0-0, policy_version 1194823 (0.00085) [2022-07-11 12:38:29,708][25689] Fps is (10 sec: 5484.0, 60 sec: 5541.3, 300 sec: 5523.6). Total num frames: 1223501824. Throughput: 0: 5800.6. Samples: 1223501740. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:38:29,708][25689] Avg episode reward: [(0, '0.254')] [2022-07-11 12:38:30,652][26022] Updated weights on worker 0-0, policy_version 1194833 (0.00096) [2022-07-11 12:38:32,795][26022] Updated weights on worker 0-0, policy_version 1194843 (0.00094) [2022-07-11 12:38:34,479][26022] Updated weights on worker 0-0, policy_version 1194853 (0.00093) [2022-07-11 12:38:34,727][25689] Fps is (10 sec: 5414.4, 60 sec: 5527.5, 300 sec: 5534.6). Total num frames: 1223530496. Throughput: 0: 5792.0. Samples: 1223534876. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:38:34,729][25689] Avg episode reward: [(0, '1.015')] [2022-07-11 12:38:36,530][26022] Updated weights on worker 0-0, policy_version 1194863 (0.00087) [2022-07-11 12:38:38,096][26022] Updated weights on worker 0-0, policy_version 1194873 (0.00054) [2022-07-11 12:38:39,770][25689] Fps is (10 sec: 5597.3, 60 sec: 5525.7, 300 sec: 5525.5). Total num frames: 1223558144. Throughput: 0: 4954.8. Samples: 1223551374. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:38:39,771][25689] Avg episode reward: [(0, '-0.213')] [2022-07-11 12:38:40,012][26022] Updated weights on worker 0-0, policy_version 1194883 (0.00085) [2022-07-11 12:38:42,018][26022] Updated weights on worker 0-0, policy_version 1194893 (0.00093) [2022-07-11 12:38:43,915][26022] Updated weights on worker 0-0, policy_version 1194903 (0.00098) [2022-07-11 12:38:44,883][25689] Fps is (10 sec: 5444.7, 60 sec: 5527.2, 300 sec: 5528.0). Total num frames: 1223585792. Throughput: 0: 5763.4. Samples: 1223584650. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:38:44,883][25689] Avg episode reward: [(0, '-0.360')] [2022-07-11 12:38:45,532][26022] Updated weights on worker 0-0, policy_version 1194913 (0.00084) [2022-07-11 12:38:47,349][26022] Updated weights on worker 0-0, policy_version 1194923 (0.00083) [2022-07-11 12:38:49,095][26022] Updated weights on worker 0-0, policy_version 1194933 (0.00087) [2022-07-11 12:38:49,948][25689] Fps is (10 sec: 5533.3, 60 sec: 5540.5, 300 sec: 5528.5). Total num frames: 1223614464. Throughput: 0: 5766.9. Samples: 1223618464. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:38:49,948][25689] Avg episode reward: [(0, '-0.735')] [2022-07-11 12:38:51,252][26022] Updated weights on worker 0-0, policy_version 1194943 (0.00089) [2022-07-11 12:38:52,943][26022] Updated weights on worker 0-0, policy_version 1194953 (0.00093) [2022-07-11 12:38:54,919][26022] Updated weights on worker 0-0, policy_version 1194963 (0.00055) [2022-07-11 12:38:54,963][25689] Fps is (10 sec: 5587.2, 60 sec: 5523.7, 300 sec: 5528.5). Total num frames: 1223642112. Throughput: 0: 5759.4. Samples: 1223651424. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:38:54,963][25689] Avg episode reward: [(0, '-0.252')] [2022-07-11 12:38:56,542][26022] Updated weights on worker 0-0, policy_version 1194973 (0.00085) [2022-07-11 12:38:58,360][26022] Updated weights on worker 0-0, policy_version 1194983 (0.00085) [2022-07-11 12:38:59,995][25689] Fps is (10 sec: 5503.8, 60 sec: 5523.5, 300 sec: 5530.8). Total num frames: 1223669760. Throughput: 0: 5780.9. Samples: 1223668294. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:38:59,995][25689] Avg episode reward: [(0, '-1.787')] [2022-07-11 12:39:00,327][26022] Updated weights on worker 0-0, policy_version 1194993 (0.00093) [2022-07-11 12:39:02,567][26022] Updated weights on worker 0-0, policy_version 1195003 (0.00091) [2022-07-11 12:39:04,447][26022] Updated weights on worker 0-0, policy_version 1195013 (0.00087) [2022-07-11 12:39:05,108][25689] Fps is (10 sec: 5349.4, 60 sec: 5523.5, 300 sec: 5526.4). Total num frames: 1223696384. Throughput: 0: 5682.9. Samples: 1223699590. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:39:05,109][25689] Avg episode reward: [(0, '-1.763')] [2022-07-11 12:39:06,288][26022] Updated weights on worker 0-0, policy_version 1195023 (0.00088) [2022-07-11 12:39:07,976][26022] Updated weights on worker 0-0, policy_version 1195033 (0.00090) [2022-07-11 12:39:10,115][25689] Fps is (10 sec: 5261.4, 60 sec: 5476.0, 300 sec: 5523.0). Total num frames: 1223723008. Throughput: 0: 5671.0. Samples: 1223732834. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:39:10,116][25689] Avg episode reward: [(0, '-1.736')] [2022-07-11 12:39:10,181][26022] Updated weights on worker 0-0, policy_version 1195043 (0.00090) [2022-07-11 12:39:11,606][26022] Updated weights on worker 0-0, policy_version 1195053 (0.00094) [2022-07-11 12:39:13,793][26022] Updated weights on worker 0-0, policy_version 1195063 (0.00090) [2022-07-11 12:39:15,129][25689] Fps is (10 sec: 5620.5, 60 sec: 5527.9, 300 sec: 5530.0). Total num frames: 1223752704. Throughput: 0: 4865.5. Samples: 1223749542. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:39:15,130][25689] Avg episode reward: [(0, '-0.242')] [2022-07-11 12:39:15,389][26022] Updated weights on worker 0-0, policy_version 1195073 (0.00085) [2022-07-11 12:39:17,547][26022] Updated weights on worker 0-0, policy_version 1195083 (0.00093) [2022-07-11 12:39:19,004][26022] Updated weights on worker 0-0, policy_version 1195093 (0.00096) [2022-07-11 12:39:20,135][25689] Fps is (10 sec: 5621.0, 60 sec: 5499.9, 300 sec: 5523.9). Total num frames: 1223779328. Throughput: 0: 5699.4. Samples: 1223783080. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:39:20,135][25689] Avg episode reward: [(0, '-0.187')] [2022-07-11 12:39:21,144][26022] Updated weights on worker 0-0, policy_version 1195103 (0.00090) [2022-07-11 12:39:22,808][26022] Updated weights on worker 0-0, policy_version 1195113 (0.00095) [2022-07-11 12:39:24,836][26022] Updated weights on worker 0-0, policy_version 1195123 (0.00093) [2022-07-11 12:39:25,237][25689] Fps is (10 sec: 5470.4, 60 sec: 5480.8, 300 sec: 5522.1). Total num frames: 1223808000. Throughput: 0: 5796.7. Samples: 1223816270. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:39:25,238][25689] Avg episode reward: [(0, '-0.153')] [2022-07-11 12:39:26,414][26022] Updated weights on worker 0-0, policy_version 1195133 (0.00485) [2022-07-11 12:39:28,619][26022] Updated weights on worker 0-0, policy_version 1195143 (0.00090) [2022-07-11 12:39:30,099][26022] Updated weights on worker 0-0, policy_version 1195153 (0.00084) [2022-07-11 12:39:30,298][25689] Fps is (10 sec: 5642.5, 60 sec: 5526.4, 300 sec: 5529.5). Total num frames: 1223836672. Throughput: 0: 4953.1. Samples: 1223832800. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:39:30,298][25689] Avg episode reward: [(0, '0.620')] [2022-07-11 12:39:32,361][26022] Updated weights on worker 0-0, policy_version 1195163 (0.00111) [2022-07-11 12:39:33,913][26022] Updated weights on worker 0-0, policy_version 1195173 (0.00090) [2022-07-11 12:39:35,364][25689] Fps is (10 sec: 5460.3, 60 sec: 5488.4, 300 sec: 5525.2). Total num frames: 1223863296. Throughput: 0: 5765.5. Samples: 1223866208. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:39:35,364][25689] Avg episode reward: [(0, '0.939')] [2022-07-11 12:39:35,899][26022] Updated weights on worker 0-0, policy_version 1195183 (0.00095) [2022-07-11 12:39:37,549][26022] Updated weights on worker 0-0, policy_version 1195193 (0.00509) [2022-07-11 12:39:39,611][26022] Updated weights on worker 0-0, policy_version 1195203 (0.00087) [2022-07-11 12:39:40,446][25689] Fps is (10 sec: 5448.7, 60 sec: 5501.7, 300 sec: 5525.4). Total num frames: 1223891968. Throughput: 0: 5738.6. Samples: 1223899640. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:39:40,448][25689] Avg episode reward: [(0, '0.900')] [2022-07-11 12:39:41,315][26022] Updated weights on worker 0-0, policy_version 1195213 (0.00083) [2022-07-11 12:39:43,423][26022] Updated weights on worker 0-0, policy_version 1195223 (0.00089) [2022-07-11 12:39:44,792][26022] Updated weights on worker 0-0, policy_version 1195233 (0.00094) [2022-07-11 12:39:45,492][25689] Fps is (10 sec: 5661.8, 60 sec: 5524.7, 300 sec: 5524.8). Total num frames: 1223920640. Throughput: 0: 4931.6. Samples: 1223916160. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:39:45,493][25689] Avg episode reward: [(0, '-0.093')] [2022-07-11 12:39:47,047][26022] Updated weights on worker 0-0, policy_version 1195243 (0.00086) [2022-07-11 12:39:48,615][26022] Updated weights on worker 0-0, policy_version 1195253 (0.00094) [2022-07-11 12:39:50,501][25689] Fps is (10 sec: 5703.2, 60 sec: 5529.8, 300 sec: 5528.5). Total num frames: 1223949312. Throughput: 0: 5786.6. Samples: 1223949708. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:39:50,503][25689] Avg episode reward: [(0, '-0.048')] [2022-07-11 12:39:50,507][26022] Updated weights on worker 0-0, policy_version 1195263 (0.00078) [2022-07-11 12:39:52,454][26022] Updated weights on worker 0-0, policy_version 1195273 (0.00686) [2022-07-11 12:39:54,392][26022] Updated weights on worker 0-0, policy_version 1195283 (0.00095) [2022-07-11 12:39:55,519][25689] Fps is (10 sec: 5616.9, 60 sec: 5529.5, 300 sec: 5528.9). Total num frames: 1223976960. Throughput: 0: 5799.5. Samples: 1223983098. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:39:55,520][25689] Avg episode reward: [(0, '0.735')] [2022-07-11 12:39:56,035][26022] Updated weights on worker 0-0, policy_version 1195293 (0.00097) [2022-07-11 12:39:58,058][26022] Updated weights on worker 0-0, policy_version 1195303 (0.00086) [2022-07-11 12:39:59,699][26022] Updated weights on worker 0-0, policy_version 1195313 (0.00095) [2022-07-11 12:40:00,547][25689] Fps is (10 sec: 5402.5, 60 sec: 5513.0, 300 sec: 5531.0). Total num frames: 1224003584. Throughput: 0: 4985.0. Samples: 1223999840. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:40:00,548][25689] Avg episode reward: [(0, '0.719')] [2022-07-11 12:40:01,800][26022] Updated weights on worker 0-0, policy_version 1195323 (0.00083) [2022-07-11 12:40:04,020][26022] Updated weights on worker 0-0, policy_version 1195333 (0.00087) [2022-07-11 12:40:05,659][25689] Fps is (10 sec: 5251.1, 60 sec: 5513.1, 300 sec: 5522.4). Total num frames: 1224030208. Throughput: 0: 5700.9. Samples: 1224031132. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:40:05,662][25689] Avg episode reward: [(0, '0.225')] [2022-07-11 12:40:05,840][26022] Updated weights on worker 0-0, policy_version 1195343 (0.00093) [2022-07-11 12:40:07,525][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:40:07,534][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001195353_1224041472.pth [2022-07-11 12:40:07,535][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001193408_1222049792.pth [2022-07-11 12:40:07,548][26022] Updated weights on worker 0-0, policy_version 1195353 (0.00097) [2022-07-11 12:40:09,488][26022] Updated weights on worker 0-0, policy_version 1195363 (0.00084) [2022-07-11 12:40:10,670][25689] Fps is (10 sec: 5462.3, 60 sec: 5546.6, 300 sec: 5530.6). Total num frames: 1224058880. Throughput: 0: 5695.6. Samples: 1224064582. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:40:10,672][25689] Avg episode reward: [(0, '0.445')] [2022-07-11 12:40:11,320][26022] Updated weights on worker 0-0, policy_version 1195373 (0.00092) [2022-07-11 12:40:13,327][26022] Updated weights on worker 0-0, policy_version 1195383 (0.00083) [2022-07-11 12:40:14,835][26022] Updated weights on worker 0-0, policy_version 1195393 (0.00089) [2022-07-11 12:40:15,675][25689] Fps is (10 sec: 5521.1, 60 sec: 5496.7, 300 sec: 5523.7). Total num frames: 1224085504. Throughput: 0: 4858.9. Samples: 1224081032. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:40:15,676][25689] Avg episode reward: [(0, '1.577')] [2022-07-11 12:40:17,023][26022] Updated weights on worker 0-0, policy_version 1195403 (0.00091) [2022-07-11 12:40:18,395][26022] Updated weights on worker 0-0, policy_version 1195413 (0.00084) [2022-07-11 12:40:20,713][25689] Fps is (10 sec: 5403.6, 60 sec: 5510.6, 300 sec: 5518.0). Total num frames: 1224113152. Throughput: 0: 5669.3. Samples: 1224114172. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:40:20,714][25689] Avg episode reward: [(0, '1.266')] [2022-07-11 12:40:20,813][26022] Updated weights on worker 0-0, policy_version 1195424 (0.00086) [2022-07-11 12:40:22,516][26022] Updated weights on worker 0-0, policy_version 1195434 (0.00050) [2022-07-11 12:40:24,349][26022] Updated weights on worker 0-0, policy_version 1195444 (0.00091) [2022-07-11 12:40:25,807][25689] Fps is (10 sec: 5659.5, 60 sec: 5528.3, 300 sec: 5530.1). Total num frames: 1224142848. Throughput: 0: 5790.7. Samples: 1224147800. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:40:25,808][25689] Avg episode reward: [(0, '1.259')] [2022-07-11 12:40:26,144][26022] Updated weights on worker 0-0, policy_version 1195454 (0.00091) [2022-07-11 12:40:27,981][26022] Updated weights on worker 0-0, policy_version 1195464 (0.00094) [2022-07-11 12:40:29,777][26022] Updated weights on worker 0-0, policy_version 1195474 (0.00091) [2022-07-11 12:40:30,814][25689] Fps is (10 sec: 5576.0, 60 sec: 5499.4, 300 sec: 5516.3). Total num frames: 1224169472. Throughput: 0: 4963.4. Samples: 1224164564. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:40:30,814][25689] Avg episode reward: [(0, '1.320')] [2022-07-11 12:40:31,796][26022] Updated weights on worker 0-0, policy_version 1195484 (0.00083) [2022-07-11 12:40:33,412][26022] Updated weights on worker 0-0, policy_version 1195494 (0.00089) [2022-07-11 12:40:35,643][26022] Updated weights on worker 0-0, policy_version 1195504 (0.00089) [2022-07-11 12:40:35,820][25689] Fps is (10 sec: 5522.5, 60 sec: 5538.7, 300 sec: 5530.3). Total num frames: 1224198144. Throughput: 0: 5799.7. Samples: 1224197868. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2022-07-11 12:40:35,821][25689] Avg episode reward: [(0, '0.695')] [2022-07-11 12:40:37,106][26022] Updated weights on worker 0-0, policy_version 1195514 (0.00086) [2022-07-11 12:40:39,268][26022] Updated weights on worker 0-0, policy_version 1195524 (0.00094) [2022-07-11 12:40:40,760][26022] Updated weights on worker 0-0, policy_version 1195534 (0.00082) [2022-07-11 12:40:40,842][25689] Fps is (10 sec: 5718.3, 60 sec: 5544.2, 300 sec: 5524.6). Total num frames: 1224226816. Throughput: 0: 5805.8. Samples: 1224231034. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:40:40,842][25689] Avg episode reward: [(0, '-0.077')] [2022-07-11 12:40:42,913][26022] Updated weights on worker 0-0, policy_version 1195544 (0.00086) [2022-07-11 12:40:44,602][26022] Updated weights on worker 0-0, policy_version 1195554 (0.00096) [2022-07-11 12:40:45,923][25689] Fps is (10 sec: 5371.9, 60 sec: 5490.2, 300 sec: 5520.9). Total num frames: 1224252416. Throughput: 0: 4965.3. Samples: 1224247680. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:40:45,923][25689] Avg episode reward: [(0, '0.249')] [2022-07-11 12:40:46,688][26022] Updated weights on worker 0-0, policy_version 1195564 (0.00092) [2022-07-11 12:40:48,349][26022] Updated weights on worker 0-0, policy_version 1195574 (0.00093) [2022-07-11 12:40:50,383][26022] Updated weights on worker 0-0, policy_version 1195584 (0.00090) [2022-07-11 12:40:51,008][25689] Fps is (10 sec: 5539.9, 60 sec: 5517.1, 300 sec: 5529.7). Total num frames: 1224283136. Throughput: 0: 5760.6. Samples: 1224280896. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:40:51,009][25689] Avg episode reward: [(0, '-0.068')] [2022-07-11 12:40:51,859][26022] Updated weights on worker 0-0, policy_version 1195594 (0.00298) [2022-07-11 12:40:54,072][26022] Updated weights on worker 0-0, policy_version 1195604 (0.00096) [2022-07-11 12:40:55,731][26022] Updated weights on worker 0-0, policy_version 1195614 (0.00089) [2022-07-11 12:40:56,009][25689] Fps is (10 sec: 5685.2, 60 sec: 5501.7, 300 sec: 5520.5). Total num frames: 1224309760. Throughput: 0: 5778.2. Samples: 1224314526. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:40:56,010][25689] Avg episode reward: [(0, '-0.259')] [2022-07-11 12:40:57,708][26022] Updated weights on worker 0-0, policy_version 1195624 (0.00091) [2022-07-11 12:40:59,489][26022] Updated weights on worker 0-0, policy_version 1195634 (0.00082) [2022-07-11 12:41:01,031][25689] Fps is (10 sec: 5415.0, 60 sec: 5519.2, 300 sec: 5532.6). Total num frames: 1224337408. Throughput: 0: 4964.4. Samples: 1224331260. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:41:01,031][25689] Avg episode reward: [(0, '0.278')] [2022-07-11 12:41:01,228][26022] Updated weights on worker 0-0, policy_version 1195644 (0.00089) [2022-07-11 12:41:03,656][26022] Updated weights on worker 0-0, policy_version 1195654 (0.00094) [2022-07-11 12:41:05,256][26022] Updated weights on worker 0-0, policy_version 1195664 (0.00092) [2022-07-11 12:41:06,076][25689] Fps is (10 sec: 5391.3, 60 sec: 5525.4, 300 sec: 5522.2). Total num frames: 1224364032. Throughput: 0: 5699.9. Samples: 1224362550. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:41:06,076][25689] Avg episode reward: [(0, '0.916')] [2022-07-11 12:41:07,084][26022] Updated weights on worker 0-0, policy_version 1195674 (0.00073) [2022-07-11 12:41:09,011][26022] Updated weights on worker 0-0, policy_version 1195684 (0.00084) [2022-07-11 12:41:10,762][26022] Updated weights on worker 0-0, policy_version 1195694 (0.00089) [2022-07-11 12:41:11,142][25689] Fps is (10 sec: 5367.3, 60 sec: 5503.3, 300 sec: 5521.1). Total num frames: 1224391680. Throughput: 0: 5709.5. Samples: 1224395852. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:41:11,143][25689] Avg episode reward: [(0, '0.765')] [2022-07-11 12:41:12,504][26022] Updated weights on worker 0-0, policy_version 1195704 (0.00090) [2022-07-11 12:41:14,635][26022] Updated weights on worker 0-0, policy_version 1195714 (0.00085) [2022-07-11 12:41:16,151][25689] Fps is (10 sec: 5488.7, 60 sec: 5520.0, 300 sec: 5521.4). Total num frames: 1224419328. Throughput: 0: 5685.7. Samples: 1224429044. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:41:16,151][25689] Avg episode reward: [(0, '0.876')] [2022-07-11 12:41:16,284][26022] Updated weights on worker 0-0, policy_version 1195724 (0.00092) [2022-07-11 12:41:18,403][26022] Updated weights on worker 0-0, policy_version 1195734 (0.00097) [2022-07-11 12:41:20,033][26022] Updated weights on worker 0-0, policy_version 1195744 (0.00079) [2022-07-11 12:41:21,210][25689] Fps is (10 sec: 5492.7, 60 sec: 5518.1, 300 sec: 5522.5). Total num frames: 1224446976. Throughput: 0: 5675.9. Samples: 1224445792. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:41:21,210][25689] Avg episode reward: [(0, '1.227')] [2022-07-11 12:41:22,001][26022] Updated weights on worker 0-0, policy_version 1195754 (0.00087) [2022-07-11 12:41:23,707][26022] Updated weights on worker 0-0, policy_version 1195764 (0.00089) [2022-07-11 12:41:25,832][26022] Updated weights on worker 0-0, policy_version 1195774 (0.00096) [2022-07-11 12:41:26,261][25689] Fps is (10 sec: 5570.3, 60 sec: 5505.0, 300 sec: 5522.5). Total num frames: 1224475648. Throughput: 0: 5776.6. Samples: 1224479152. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:41:26,262][25689] Avg episode reward: [(0, '1.620')] [2022-07-11 12:41:27,321][26022] Updated weights on worker 0-0, policy_version 1195784 (0.00082) [2022-07-11 12:41:29,373][26022] Updated weights on worker 0-0, policy_version 1195794 (0.00086) [2022-07-11 12:41:31,179][26022] Updated weights on worker 0-0, policy_version 1195804 (0.00087) [2022-07-11 12:41:31,327][25689] Fps is (10 sec: 5566.7, 60 sec: 5516.6, 300 sec: 5521.9). Total num frames: 1224503296. Throughput: 0: 5781.9. Samples: 1224512556. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:41:31,327][25689] Avg episode reward: [(0, '1.526')] [2022-07-11 12:41:32,815][26022] Updated weights on worker 0-0, policy_version 1195814 (0.00082) [2022-07-11 12:41:35,115][26022] Updated weights on worker 0-0, policy_version 1195824 (0.00084) [2022-07-11 12:41:36,355][25689] Fps is (10 sec: 5680.9, 60 sec: 5531.4, 300 sec: 5524.9). Total num frames: 1224532992. Throughput: 0: 4962.8. Samples: 1224529316. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:41:36,356][25689] Avg episode reward: [(0, '0.432')] [2022-07-11 12:41:36,526][26022] Updated weights on worker 0-0, policy_version 1195834 (0.00086) [2022-07-11 12:41:38,604][26022] Updated weights on worker 0-0, policy_version 1195844 (0.00087) [2022-07-11 12:41:40,517][26022] Updated weights on worker 0-0, policy_version 1195854 (0.00085) [2022-07-11 12:41:41,408][25689] Fps is (10 sec: 5484.8, 60 sec: 5477.9, 300 sec: 5515.6). Total num frames: 1224558592. Throughput: 0: 5789.8. Samples: 1224562740. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:41:41,409][25689] Avg episode reward: [(0, '0.412')] [2022-07-11 12:41:42,215][26022] Updated weights on worker 0-0, policy_version 1195864 (0.00088) [2022-07-11 12:41:44,166][26022] Updated weights on worker 0-0, policy_version 1195874 (0.00091) [2022-07-11 12:41:45,912][26022] Updated weights on worker 0-0, policy_version 1195884 (0.00087) [2022-07-11 12:41:46,508][25689] Fps is (10 sec: 5446.6, 60 sec: 5543.8, 300 sec: 5521.5). Total num frames: 1224588288. Throughput: 0: 5764.6. Samples: 1224595864. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:41:46,509][25689] Avg episode reward: [(0, '0.466')] [2022-07-11 12:41:47,963][26022] Updated weights on worker 0-0, policy_version 1195894 (0.00089) [2022-07-11 12:41:49,642][26022] Updated weights on worker 0-0, policy_version 1195904 (0.00090) [2022-07-11 12:41:51,519][25689] Fps is (10 sec: 5570.6, 60 sec: 5482.9, 300 sec: 5515.1). Total num frames: 1224614912. Throughput: 0: 4947.1. Samples: 1224612448. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:41:51,519][25689] Avg episode reward: [(0, '-0.187')] [2022-07-11 12:41:51,826][26022] Updated weights on worker 0-0, policy_version 1195914 (0.00090) [2022-07-11 12:41:53,355][26022] Updated weights on worker 0-0, policy_version 1195924 (0.00085) [2022-07-11 12:41:55,302][26022] Updated weights on worker 0-0, policy_version 1195934 (0.00083) [2022-07-11 12:41:56,527][25689] Fps is (10 sec: 5519.2, 60 sec: 5516.2, 300 sec: 5518.5). Total num frames: 1224643584. Throughput: 0: 5777.0. Samples: 1224645846. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:41:56,527][25689] Avg episode reward: [(0, '-0.274')] [2022-07-11 12:41:56,865][26022] Updated weights on worker 0-0, policy_version 1195944 (0.00084) [2022-07-11 12:41:59,003][26022] Updated weights on worker 0-0, policy_version 1195954 (0.00086) [2022-07-11 12:42:00,776][26022] Updated weights on worker 0-0, policy_version 1195964 (0.00086) [2022-07-11 12:42:01,561][25689] Fps is (10 sec: 5608.5, 60 sec: 5515.0, 300 sec: 5523.3). Total num frames: 1224671232. Throughput: 0: 5789.6. Samples: 1224679414. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:42:01,561][25689] Avg episode reward: [(0, '-0.174')] [2022-07-11 12:42:02,874][26022] Updated weights on worker 0-0, policy_version 1195974 (0.00091) [2022-07-11 12:42:04,773][26022] Updated weights on worker 0-0, policy_version 1195984 (0.00085) [2022-07-11 12:42:06,588][26022] Updated weights on worker 0-0, policy_version 1195994 (0.00086) [2022-07-11 12:42:06,722][25689] Fps is (10 sec: 5323.2, 60 sec: 5504.5, 300 sec: 5520.4). Total num frames: 1224697856. Throughput: 0: 4863.1. Samples: 1224694168. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:42:06,723][25689] Avg episode reward: [(0, '-0.773')] [2022-07-11 12:42:07,686][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:42:07,698][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001195999_1224702976.pth [2022-07-11 12:42:07,698][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001194055_1222712320.pth [2022-07-11 12:42:08,488][26022] Updated weights on worker 0-0, policy_version 1196004 (0.00085) [2022-07-11 12:42:10,562][26022] Updated weights on worker 0-0, policy_version 1196014 (0.00097) [2022-07-11 12:42:11,797][25689] Fps is (10 sec: 5501.8, 60 sec: 5537.4, 300 sec: 5522.5). Total num frames: 1224727552. Throughput: 0: 5675.3. Samples: 1224727536. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:42:11,799][25689] Avg episode reward: [(0, '-1.629')] [2022-07-11 12:42:12,079][26022] Updated weights on worker 0-0, policy_version 1196024 (0.00091) [2022-07-11 12:42:14,004][26022] Updated weights on worker 0-0, policy_version 1196034 (0.00086) [2022-07-11 12:42:15,746][26022] Updated weights on worker 0-0, policy_version 1196044 (0.00091) [2022-07-11 12:42:16,875][25689] Fps is (10 sec: 5446.6, 60 sec: 5497.4, 300 sec: 5514.4). Total num frames: 1224753152. Throughput: 0: 5646.4. Samples: 1224760738. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:42:16,875][25689] Avg episode reward: [(0, '-1.812')] [2022-07-11 12:42:17,640][26022] Updated weights on worker 0-0, policy_version 1196054 (0.00107) [2022-07-11 12:42:19,748][26022] Updated weights on worker 0-0, policy_version 1196064 (0.00089) [2022-07-11 12:42:21,255][26022] Updated weights on worker 0-0, policy_version 1196074 (0.00081) [2022-07-11 12:42:21,938][25689] Fps is (10 sec: 5352.1, 60 sec: 5513.9, 300 sec: 5511.0). Total num frames: 1224781824. Throughput: 0: 4807.0. Samples: 1224777366. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:42:21,938][25689] Avg episode reward: [(0, '-1.771')] [2022-07-11 12:42:23,224][26022] Updated weights on worker 0-0, policy_version 1196084 (0.00080) [2022-07-11 12:42:25,071][26022] Updated weights on worker 0-0, policy_version 1196094 (0.00088) [2022-07-11 12:42:26,866][26022] Updated weights on worker 0-0, policy_version 1196104 (0.00090) [2022-07-11 12:42:27,004][25689] Fps is (10 sec: 5660.9, 60 sec: 5512.5, 300 sec: 5520.3). Total num frames: 1224810496. Throughput: 0: 5766.4. Samples: 1224811124. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:42:27,006][25689] Avg episode reward: [(0, '-2.889')] [2022-07-11 12:42:28,800][26022] Updated weights on worker 0-0, policy_version 1196114 (0.00088) [2022-07-11 12:42:30,489][26022] Updated weights on worker 0-0, policy_version 1196124 (0.00090) [2022-07-11 12:42:32,019][25689] Fps is (10 sec: 5586.4, 60 sec: 5517.1, 300 sec: 5514.1). Total num frames: 1224838144. Throughput: 0: 5775.5. Samples: 1224844328. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:42:32,021][25689] Avg episode reward: [(0, '-2.871')] [2022-07-11 12:42:32,499][26022] Updated weights on worker 0-0, policy_version 1196134 (0.00075) [2022-07-11 12:42:34,273][26022] Updated weights on worker 0-0, policy_version 1196144 (0.00090) [2022-07-11 12:42:35,960][26022] Updated weights on worker 0-0, policy_version 1196154 (0.00089) [2022-07-11 12:42:37,073][25689] Fps is (10 sec: 5695.1, 60 sec: 5514.8, 300 sec: 5520.4). Total num frames: 1224867840. Throughput: 0: 4962.8. Samples: 1224860980. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:42:37,075][25689] Avg episode reward: [(0, '-0.724')] [2022-07-11 12:42:38,023][26022] Updated weights on worker 0-0, policy_version 1196164 (0.00088) [2022-07-11 12:42:39,765][26022] Updated weights on worker 0-0, policy_version 1196174 (0.00080) [2022-07-11 12:42:41,795][26022] Updated weights on worker 0-0, policy_version 1196184 (0.00084) [2022-07-11 12:42:42,137][25689] Fps is (10 sec: 5465.3, 60 sec: 5513.9, 300 sec: 5514.8). Total num frames: 1224893440. Throughput: 0: 5797.8. Samples: 1224894476. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:42:42,137][25689] Avg episode reward: [(0, '-0.620')] [2022-07-11 12:42:43,248][26022] Updated weights on worker 0-0, policy_version 1196194 (0.00077) [2022-07-11 12:42:45,475][26022] Updated weights on worker 0-0, policy_version 1196204 (0.00085) [2022-07-11 12:42:47,173][26022] Updated weights on worker 0-0, policy_version 1196214 (0.00085) [2022-07-11 12:42:47,208][25689] Fps is (10 sec: 5456.1, 60 sec: 5516.4, 300 sec: 5520.8). Total num frames: 1224923136. Throughput: 0: 5762.5. Samples: 1224927548. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:42:47,209][25689] Avg episode reward: [(0, '-0.384')] [2022-07-11 12:42:49,154][26022] Updated weights on worker 0-0, policy_version 1196224 (0.00092) [2022-07-11 12:42:50,847][26022] Updated weights on worker 0-0, policy_version 1196234 (0.00095) [2022-07-11 12:42:52,243][25689] Fps is (10 sec: 5572.8, 60 sec: 5514.2, 300 sec: 5513.6). Total num frames: 1224949760. Throughput: 0: 4931.5. Samples: 1224944062. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:42:52,243][25689] Avg episode reward: [(0, '0.741')] [2022-07-11 12:42:52,779][26022] Updated weights on worker 0-0, policy_version 1196244 (0.00086) [2022-07-11 12:42:54,719][26022] Updated weights on worker 0-0, policy_version 1196254 (0.00081) [2022-07-11 12:42:56,441][26022] Updated weights on worker 0-0, policy_version 1196264 (0.00540) [2022-07-11 12:42:57,282][25689] Fps is (10 sec: 5489.1, 60 sec: 5511.4, 300 sec: 5516.8). Total num frames: 1224978432. Throughput: 0: 5760.1. Samples: 1224977382. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:42:57,282][25689] Avg episode reward: [(0, '1.298')] [2022-07-11 12:42:58,388][26022] Updated weights on worker 0-0, policy_version 1196274 (0.00093) [2022-07-11 12:43:00,080][26022] Updated weights on worker 0-0, policy_version 1196284 (0.00108) [2022-07-11 12:43:02,018][26022] Updated weights on worker 0-0, policy_version 1196294 (0.00082) [2022-07-11 12:43:02,341][25689] Fps is (10 sec: 5476.1, 60 sec: 5492.3, 300 sec: 5517.8). Total num frames: 1225005056. Throughput: 0: 5745.1. Samples: 1225010550. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:43:02,341][25689] Avg episode reward: [(0, '1.667')] [2022-07-11 12:43:04,347][26022] Updated weights on worker 0-0, policy_version 1196304 (0.00090) [2022-07-11 12:43:05,990][26022] Updated weights on worker 0-0, policy_version 1196314 (0.00091) [2022-07-11 12:43:07,531][25689] Fps is (10 sec: 5295.4, 60 sec: 5506.6, 300 sec: 5508.2). Total num frames: 1225032704. Throughput: 0: 5617.3. Samples: 1225041706. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:43:07,531][25689] Avg episode reward: [(0, '1.716')] [2022-07-11 12:43:08,118][26022] Updated weights on worker 0-0, policy_version 1196324 (0.00095) [2022-07-11 12:43:09,749][26022] Updated weights on worker 0-0, policy_version 1196334 (0.00094) [2022-07-11 12:43:11,655][26022] Updated weights on worker 0-0, policy_version 1196344 (0.00094) [2022-07-11 12:43:12,548][25689] Fps is (10 sec: 5517.4, 60 sec: 5495.0, 300 sec: 5515.3). Total num frames: 1225061376. Throughput: 0: 5634.2. Samples: 1225058468. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:43:12,549][25689] Avg episode reward: [(0, '1.659')] [2022-07-11 12:43:13,458][26022] Updated weights on worker 0-0, policy_version 1196354 (0.00086) [2022-07-11 12:43:15,191][26022] Updated weights on worker 0-0, policy_version 1196364 (0.00084) [2022-07-11 12:43:17,230][26022] Updated weights on worker 0-0, policy_version 1196374 (0.00107) [2022-07-11 12:43:17,577][25689] Fps is (10 sec: 5606.3, 60 sec: 5533.1, 300 sec: 5512.6). Total num frames: 1225089024. Throughput: 0: 5648.7. Samples: 1225092022. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:43:17,577][25689] Avg episode reward: [(0, '1.558')] [2022-07-11 12:43:18,883][26022] Updated weights on worker 0-0, policy_version 1196384 (0.00093) [2022-07-11 12:43:20,816][26022] Updated weights on worker 0-0, policy_version 1196394 (0.00085) [2022-07-11 12:43:22,588][25689] Fps is (10 sec: 5609.6, 60 sec: 5537.8, 300 sec: 5510.4). Total num frames: 1225117696. Throughput: 0: 5683.3. Samples: 1225125624. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:43:22,589][25689] Avg episode reward: [(0, '1.318')] [2022-07-11 12:43:22,596][26022] Updated weights on worker 0-0, policy_version 1196404 (0.00077) [2022-07-11 12:43:24,510][26022] Updated weights on worker 0-0, policy_version 1196414 (0.00095) [2022-07-11 12:43:26,358][26022] Updated weights on worker 0-0, policy_version 1196424 (0.00092) [2022-07-11 12:43:27,698][25689] Fps is (10 sec: 5564.7, 60 sec: 5517.1, 300 sec: 5515.4). Total num frames: 1225145344. Throughput: 0: 4987.5. Samples: 1225142288. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:43:27,698][25689] Avg episode reward: [(0, '1.463')] [2022-07-11 12:43:28,261][26022] Updated weights on worker 0-0, policy_version 1196434 (0.00082) [2022-07-11 12:43:30,299][26022] Updated weights on worker 0-0, policy_version 1196444 (0.00085) [2022-07-11 12:43:31,947][26022] Updated weights on worker 0-0, policy_version 1196454 (0.00102) [2022-07-11 12:43:32,744][25689] Fps is (10 sec: 5444.7, 60 sec: 5514.2, 300 sec: 5511.4). Total num frames: 1225172992. Throughput: 0: 5789.3. Samples: 1225175388. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:43:32,745][25689] Avg episode reward: [(0, '1.547')] [2022-07-11 12:43:33,845][26022] Updated weights on worker 0-0, policy_version 1196464 (0.00093) [2022-07-11 12:43:35,479][26022] Updated weights on worker 0-0, policy_version 1196474 (0.00092) [2022-07-11 12:43:37,564][26022] Updated weights on worker 0-0, policy_version 1196484 (0.00086) [2022-07-11 12:43:37,758][25689] Fps is (10 sec: 5598.0, 60 sec: 5500.9, 300 sec: 5515.4). Total num frames: 1225201664. Throughput: 0: 5801.7. Samples: 1225209110. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:43:37,759][25689] Avg episode reward: [(0, '1.837')] [2022-07-11 12:43:39,123][26022] Updated weights on worker 0-0, policy_version 1196494 (0.00086) [2022-07-11 12:43:41,144][26022] Updated weights on worker 0-0, policy_version 1196504 (0.00425) [2022-07-11 12:43:42,761][25689] Fps is (10 sec: 5725.0, 60 sec: 5557.1, 300 sec: 5520.9). Total num frames: 1225230336. Throughput: 0: 4961.0. Samples: 1225225700. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:43:42,761][25689] Avg episode reward: [(0, '1.556')] [2022-07-11 12:43:42,764][26022] Updated weights on worker 0-0, policy_version 1196514 (0.00081) [2022-07-11 12:43:44,827][26022] Updated weights on worker 0-0, policy_version 1196524 (0.00084) [2022-07-11 12:43:46,592][26022] Updated weights on worker 0-0, policy_version 1196534 (0.00083) [2022-07-11 12:43:47,818][25689] Fps is (10 sec: 5497.0, 60 sec: 5507.8, 300 sec: 5514.1). Total num frames: 1225256960. Throughput: 0: 5830.1. Samples: 1225259590. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:43:47,819][25689] Avg episode reward: [(0, '1.523')] [2022-07-11 12:43:48,523][26022] Updated weights on worker 0-0, policy_version 1196544 (0.00086) [2022-07-11 12:43:50,254][26022] Updated weights on worker 0-0, policy_version 1196554 (0.00087) [2022-07-11 12:43:52,175][26022] Updated weights on worker 0-0, policy_version 1196564 (0.00084) [2022-07-11 12:43:52,832][25689] Fps is (10 sec: 5490.4, 60 sec: 5543.4, 300 sec: 5517.6). Total num frames: 1225285632. Throughput: 0: 5831.0. Samples: 1225292520. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:43:52,833][25689] Avg episode reward: [(0, '1.143')] [2022-07-11 12:43:53,960][26022] Updated weights on worker 0-0, policy_version 1196574 (0.00054) [2022-07-11 12:43:55,927][26022] Updated weights on worker 0-0, policy_version 1196584 (0.00087) [2022-07-11 12:43:57,531][26022] Updated weights on worker 0-0, policy_version 1196594 (0.00088) [2022-07-11 12:43:57,909][25689] Fps is (10 sec: 5682.6, 60 sec: 5540.0, 300 sec: 5520.2). Total num frames: 1225314304. Throughput: 0: 4973.6. Samples: 1225309330. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:43:57,909][25689] Avg episode reward: [(0, '0.705')] [2022-07-11 12:43:59,534][26022] Updated weights on worker 0-0, policy_version 1196604 (0.00611) [2022-07-11 12:44:01,153][26022] Updated weights on worker 0-0, policy_version 1196614 (0.00083) [2022-07-11 12:44:02,971][25689] Fps is (10 sec: 5251.9, 60 sec: 5505.9, 300 sec: 5514.3). Total num frames: 1225338880. Throughput: 0: 5822.3. Samples: 1225343370. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:44:02,971][25689] Avg episode reward: [(0, '0.439')] [2022-07-11 12:44:03,497][26022] Updated weights on worker 0-0, policy_version 1196624 (0.00086) [2022-07-11 12:44:05,275][26022] Updated weights on worker 0-0, policy_version 1196634 (0.00086) [2022-07-11 12:44:07,083][26022] Updated weights on worker 0-0, policy_version 1196644 (0.00086) [2022-07-11 12:44:07,842][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:44:07,851][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001196648_1225367552.pth [2022-07-11 12:44:07,851][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001194705_1223377920.pth [2022-07-11 12:44:08,078][25689] Fps is (10 sec: 5336.8, 60 sec: 5547.2, 300 sec: 5522.7). Total num frames: 1225368576. Throughput: 0: 5678.5. Samples: 1225374640. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:44:08,079][25689] Avg episode reward: [(0, '0.532')] [2022-07-11 12:44:09,023][26022] Updated weights on worker 0-0, policy_version 1196654 (0.00083) [2022-07-11 12:44:10,823][26022] Updated weights on worker 0-0, policy_version 1196664 (0.00082) [2022-07-11 12:44:12,653][26022] Updated weights on worker 0-0, policy_version 1196674 (0.00089) [2022-07-11 12:44:13,095][25689] Fps is (10 sec: 5664.2, 60 sec: 5530.4, 300 sec: 5515.8). Total num frames: 1225396224. Throughput: 0: 4885.6. Samples: 1225391516. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:44:13,095][25689] Avg episode reward: [(0, '0.594')] [2022-07-11 12:44:14,435][26022] Updated weights on worker 0-0, policy_version 1196684 (0.00081) [2022-07-11 12:44:16,294][26022] Updated weights on worker 0-0, policy_version 1196694 (0.00093) [2022-07-11 12:44:18,108][25689] Fps is (10 sec: 5513.2, 60 sec: 5531.8, 300 sec: 5519.1). Total num frames: 1225423872. Throughput: 0: 5745.6. Samples: 1225425386. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:44:18,109][25689] Avg episode reward: [(0, '1.157')] [2022-07-11 12:44:18,244][26022] Updated weights on worker 0-0, policy_version 1196704 (0.00096) [2022-07-11 12:44:20,035][26022] Updated weights on worker 0-0, policy_version 1196714 (0.00097) [2022-07-11 12:44:21,760][26022] Updated weights on worker 0-0, policy_version 1196724 (0.00092) [2022-07-11 12:44:23,134][25689] Fps is (10 sec: 5610.0, 60 sec: 5530.5, 300 sec: 5520.5). Total num frames: 1225452544. Throughput: 0: 5726.4. Samples: 1225458832. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:44:23,135][25689] Avg episode reward: [(0, '1.300')] [2022-07-11 12:44:23,675][26022] Updated weights on worker 0-0, policy_version 1196734 (0.00087) [2022-07-11 12:44:25,494][26022] Updated weights on worker 0-0, policy_version 1196744 (0.00086) [2022-07-11 12:44:27,323][26022] Updated weights on worker 0-0, policy_version 1196754 (0.00087) [2022-07-11 12:44:28,263][25689] Fps is (10 sec: 5545.9, 60 sec: 5528.7, 300 sec: 5515.8). Total num frames: 1225480192. Throughput: 0: 5004.0. Samples: 1225475646. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:44:28,264][25689] Avg episode reward: [(0, '0.538')] [2022-07-11 12:44:29,051][26022] Updated weights on worker 0-0, policy_version 1196764 (0.00087) [2022-07-11 12:44:31,043][26022] Updated weights on worker 0-0, policy_version 1196774 (0.00087) [2022-07-11 12:44:32,751][26022] Updated weights on worker 0-0, policy_version 1196784 (0.00085) [2022-07-11 12:44:33,278][25689] Fps is (10 sec: 5551.9, 60 sec: 5548.5, 300 sec: 5523.6). Total num frames: 1225508864. Throughput: 0: 5842.5. Samples: 1225509438. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:44:33,279][25689] Avg episode reward: [(0, '-0.457')] [2022-07-11 12:44:34,623][26022] Updated weights on worker 0-0, policy_version 1196794 (0.00085) [2022-07-11 12:44:36,418][26022] Updated weights on worker 0-0, policy_version 1196804 (0.00084) [2022-07-11 12:44:38,317][25689] Fps is (10 sec: 5601.7, 60 sec: 5529.3, 300 sec: 5521.0). Total num frames: 1225536512. Throughput: 0: 5822.1. Samples: 1225543048. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2022-07-11 12:44:38,318][25689] Avg episode reward: [(0, '-0.460')] [2022-07-11 12:44:38,362][26022] Updated weights on worker 0-0, policy_version 1196814 (0.00089) [2022-07-11 12:44:40,163][26022] Updated weights on worker 0-0, policy_version 1196824 (0.00096) [2022-07-11 12:44:41,917][26022] Updated weights on worker 0-0, policy_version 1196834 (0.00090) [2022-07-11 12:44:43,381][25689] Fps is (10 sec: 5574.8, 60 sec: 5523.7, 300 sec: 5520.7). Total num frames: 1225565184. Throughput: 0: 4984.2. Samples: 1225559746. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:44:43,381][25689] Avg episode reward: [(0, '-0.490')] [2022-07-11 12:44:43,871][26022] Updated weights on worker 0-0, policy_version 1196844 (0.00095) [2022-07-11 12:44:45,624][26022] Updated weights on worker 0-0, policy_version 1196854 (0.00087) [2022-07-11 12:44:47,425][26022] Updated weights on worker 0-0, policy_version 1196864 (0.00097) [2022-07-11 12:44:48,441][25689] Fps is (10 sec: 5664.2, 60 sec: 5557.2, 300 sec: 5519.7). Total num frames: 1225593856. Throughput: 0: 5814.2. Samples: 1225592966. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:44:48,442][25689] Avg episode reward: [(0, '-0.549')] [2022-07-11 12:44:49,314][26022] Updated weights on worker 0-0, policy_version 1196874 (0.00079) [2022-07-11 12:44:51,035][26022] Updated weights on worker 0-0, policy_version 1196884 (0.00087) [2022-07-11 12:44:53,052][26022] Updated weights on worker 0-0, policy_version 1196894 (0.00083) [2022-07-11 12:44:53,463][25689] Fps is (10 sec: 5484.4, 60 sec: 5522.7, 300 sec: 5516.2). Total num frames: 1225620480. Throughput: 0: 5800.8. Samples: 1225626528. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:44:53,467][25689] Avg episode reward: [(0, '-2.231')] [2022-07-11 12:44:54,687][26022] Updated weights on worker 0-0, policy_version 1196904 (0.00092) [2022-07-11 12:44:56,787][26022] Updated weights on worker 0-0, policy_version 1196914 (0.00083) [2022-07-11 12:44:58,497][25689] Fps is (10 sec: 5498.9, 60 sec: 5526.6, 300 sec: 5523.0). Total num frames: 1225649152. Throughput: 0: 4964.0. Samples: 1225643220. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:44:58,498][25689] Avg episode reward: [(0, '-0.826')] [2022-07-11 12:44:58,520][26022] Updated weights on worker 0-0, policy_version 1196924 (0.00093) [2022-07-11 12:45:00,219][26022] Updated weights on worker 0-0, policy_version 1196934 (0.00082) [2022-07-11 12:45:02,489][26022] Updated weights on worker 0-0, policy_version 1196944 (0.00085) [2022-07-11 12:45:03,501][25689] Fps is (10 sec: 5712.6, 60 sec: 5599.5, 300 sec: 5531.8). Total num frames: 1225677824. Throughput: 0: 5825.1. Samples: 1225676950. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:45:03,502][25689] Avg episode reward: [(0, '-1.149')] [2022-07-11 12:45:04,512][26022] Updated weights on worker 0-0, policy_version 1196954 (0.00088) [2022-07-11 12:45:06,122][26022] Updated weights on worker 0-0, policy_version 1196964 (0.00089) [2022-07-11 12:45:07,997][26022] Updated weights on worker 0-0, policy_version 1196974 (0.00088) [2022-07-11 12:45:08,633][25689] Fps is (10 sec: 5455.7, 60 sec: 5546.6, 300 sec: 5522.7). Total num frames: 1225704448. Throughput: 0: 5737.5. Samples: 1225708814. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:45:08,633][25689] Avg episode reward: [(0, '-1.354')] [2022-07-11 12:45:09,637][26022] Updated weights on worker 0-0, policy_version 1196984 (0.00090) [2022-07-11 12:45:11,845][26022] Updated weights on worker 0-0, policy_version 1196994 (0.00088) [2022-07-11 12:45:13,568][26022] Updated weights on worker 0-0, policy_version 1197004 (0.00091) [2022-07-11 12:45:13,683][25689] Fps is (10 sec: 5330.6, 60 sec: 5543.5, 300 sec: 5525.3). Total num frames: 1225732096. Throughput: 0: 5730.9. Samples: 1225742404. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:45:13,683][25689] Avg episode reward: [(0, '-1.993')] [2022-07-11 12:45:15,515][26022] Updated weights on worker 0-0, policy_version 1197014 (0.00092) [2022-07-11 12:45:17,042][26022] Updated weights on worker 0-0, policy_version 1197024 (0.00087) [2022-07-11 12:45:18,759][25689] Fps is (10 sec: 5561.9, 60 sec: 5554.7, 300 sec: 5528.1). Total num frames: 1225760768. Throughput: 0: 5728.8. Samples: 1225759294. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:45:18,759][25689] Avg episode reward: [(0, '-2.024')] [2022-07-11 12:45:19,131][26022] Updated weights on worker 0-0, policy_version 1197034 (0.00083) [2022-07-11 12:45:20,831][26022] Updated weights on worker 0-0, policy_version 1197044 (0.00094) [2022-07-11 12:45:22,673][26022] Updated weights on worker 0-0, policy_version 1197054 (0.00086) [2022-07-11 12:45:23,858][25689] Fps is (10 sec: 5635.5, 60 sec: 5548.0, 300 sec: 5524.5). Total num frames: 1225789440. Throughput: 0: 5699.0. Samples: 1225792962. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:45:23,859][25689] Avg episode reward: [(0, '-1.200')] [2022-07-11 12:45:24,633][26022] Updated weights on worker 0-0, policy_version 1197064 (0.00082) [2022-07-11 12:45:26,250][26022] Updated weights on worker 0-0, policy_version 1197074 (0.00085) [2022-07-11 12:45:28,300][26022] Updated weights on worker 0-0, policy_version 1197084 (0.00088) [2022-07-11 12:45:28,927][25689] Fps is (10 sec: 5538.8, 60 sec: 5553.5, 300 sec: 5526.8). Total num frames: 1225817088. Throughput: 0: 5777.9. Samples: 1225826072. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:45:28,928][25689] Avg episode reward: [(0, '-1.595')] [2022-07-11 12:45:29,950][26022] Updated weights on worker 0-0, policy_version 1197094 (0.00096) [2022-07-11 12:45:31,863][26022] Updated weights on worker 0-0, policy_version 1197104 (0.00089) [2022-07-11 12:45:33,832][26022] Updated weights on worker 0-0, policy_version 1197114 (0.00090) [2022-07-11 12:45:33,954][25689] Fps is (10 sec: 5477.0, 60 sec: 5535.5, 300 sec: 5523.0). Total num frames: 1225844736. Throughput: 0: 4949.4. Samples: 1225842736. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:45:33,955][25689] Avg episode reward: [(0, '-0.970')] [2022-07-11 12:45:35,508][26022] Updated weights on worker 0-0, policy_version 1197124 (0.00085) [2022-07-11 12:45:37,415][26022] Updated weights on worker 0-0, policy_version 1197134 (0.00084) [2022-07-11 12:45:38,956][25689] Fps is (10 sec: 5717.8, 60 sec: 5572.7, 300 sec: 5526.8). Total num frames: 1225874432. Throughput: 0: 5802.3. Samples: 1225876484. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:45:38,956][25689] Avg episode reward: [(0, '-0.833')] [2022-07-11 12:45:39,116][26022] Updated weights on worker 0-0, policy_version 1197144 (0.00085) [2022-07-11 12:45:41,097][26022] Updated weights on worker 0-0, policy_version 1197154 (0.00088) [2022-07-11 12:45:42,847][26022] Updated weights on worker 0-0, policy_version 1197164 (0.00093) [2022-07-11 12:45:43,990][25689] Fps is (10 sec: 5612.1, 60 sec: 5541.6, 300 sec: 5531.1). Total num frames: 1225901056. Throughput: 0: 5793.8. Samples: 1225909600. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:45:43,990][25689] Avg episode reward: [(0, '0.005')] [2022-07-11 12:45:44,912][26022] Updated weights on worker 0-0, policy_version 1197174 (0.00082) [2022-07-11 12:45:46,644][26022] Updated weights on worker 0-0, policy_version 1197184 (0.00094) [2022-07-11 12:45:48,539][26022] Updated weights on worker 0-0, policy_version 1197194 (0.00086) [2022-07-11 12:45:49,042][25689] Fps is (10 sec: 5482.4, 60 sec: 5542.4, 300 sec: 5524.8). Total num frames: 1225929728. Throughput: 0: 4976.8. Samples: 1225926180. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:45:49,043][25689] Avg episode reward: [(0, '0.164')] [2022-07-11 12:45:50,227][26022] Updated weights on worker 0-0, policy_version 1197204 (0.00090) [2022-07-11 12:45:52,219][26022] Updated weights on worker 0-0, policy_version 1197214 (0.00079) [2022-07-11 12:45:53,927][26022] Updated weights on worker 0-0, policy_version 1197224 (0.00081) [2022-07-11 12:45:54,123][25689] Fps is (10 sec: 5558.0, 60 sec: 5553.9, 300 sec: 5526.8). Total num frames: 1225957376. Throughput: 0: 5793.3. Samples: 1225959578. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:45:54,123][25689] Avg episode reward: [(0, '-0.377')] [2022-07-11 12:45:55,691][26022] Updated weights on worker 0-0, policy_version 1197234 (0.00088) [2022-07-11 12:45:57,707][26022] Updated weights on worker 0-0, policy_version 1197244 (0.00089) [2022-07-11 12:45:59,154][25689] Fps is (10 sec: 5569.7, 60 sec: 5554.1, 300 sec: 5530.1). Total num frames: 1225986048. Throughput: 0: 5785.6. Samples: 1225993340. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:45:59,155][25689] Avg episode reward: [(0, '0.275')] [2022-07-11 12:45:59,338][26022] Updated weights on worker 0-0, policy_version 1197254 (0.00093) [2022-07-11 12:46:01,432][26022] Updated weights on worker 0-0, policy_version 1197264 (0.00097) [2022-07-11 12:46:03,552][26022] Updated weights on worker 0-0, policy_version 1197274 (0.00093) [2022-07-11 12:46:04,157][25689] Fps is (10 sec: 5408.4, 60 sec: 5503.6, 300 sec: 5527.4). Total num frames: 1226011648. Throughput: 0: 4922.8. Samples: 1226008882. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:46:04,159][25689] Avg episode reward: [(0, '-1.460')] [2022-07-11 12:46:05,391][26022] Updated weights on worker 0-0, policy_version 1197284 (0.00095) [2022-07-11 12:46:07,173][26022] Updated weights on worker 0-0, policy_version 1197294 (0.00087) [2022-07-11 12:46:08,071][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:46:08,086][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001197299_1226034176.pth [2022-07-11 12:46:08,087][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001195353_1224041472.pth [2022-07-11 12:46:08,999][26022] Updated weights on worker 0-0, policy_version 1197304 (0.00087) [2022-07-11 12:46:09,224][25689] Fps is (10 sec: 5389.5, 60 sec: 5543.3, 300 sec: 5530.8). Total num frames: 1226040320. Throughput: 0: 5733.1. Samples: 1226041884. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:46:09,226][25689] Avg episode reward: [(0, '-0.987')] [2022-07-11 12:46:10,773][26022] Updated weights on worker 0-0, policy_version 1197314 (0.00085) [2022-07-11 12:46:12,629][26022] Updated weights on worker 0-0, policy_version 1197324 (0.00086) [2022-07-11 12:46:14,238][25689] Fps is (10 sec: 5688.8, 60 sec: 5563.5, 300 sec: 5534.2). Total num frames: 1226068992. Throughput: 0: 5773.2. Samples: 1226075706. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:46:14,240][25689] Avg episode reward: [(0, '-0.955')] [2022-07-11 12:46:14,507][26022] Updated weights on worker 0-0, policy_version 1197334 (0.00080) [2022-07-11 12:46:16,334][26022] Updated weights on worker 0-0, policy_version 1197344 (0.00085) [2022-07-11 12:46:17,975][26022] Updated weights on worker 0-0, policy_version 1197354 (0.00094) [2022-07-11 12:46:19,284][25689] Fps is (10 sec: 5496.4, 60 sec: 5532.3, 300 sec: 5531.0). Total num frames: 1226095616. Throughput: 0: 4938.6. Samples: 1226092756. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:46:19,285][25689] Avg episode reward: [(0, '-0.714')] [2022-07-11 12:46:19,971][26022] Updated weights on worker 0-0, policy_version 1197364 (0.00106) [2022-07-11 12:46:21,539][26022] Updated weights on worker 0-0, policy_version 1197374 (0.00088) [2022-07-11 12:46:23,890][26022] Updated weights on worker 0-0, policy_version 1197384 (0.00877) [2022-07-11 12:46:24,307][25689] Fps is (10 sec: 5491.4, 60 sec: 5539.3, 300 sec: 5531.5). Total num frames: 1226124288. Throughput: 0: 5816.5. Samples: 1226126082. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:46:24,309][25689] Avg episode reward: [(0, '0.398')] [2022-07-11 12:46:25,286][26022] Updated weights on worker 0-0, policy_version 1197394 (0.00085) [2022-07-11 12:46:27,371][26022] Updated weights on worker 0-0, policy_version 1197404 (0.00086) [2022-07-11 12:46:29,033][26022] Updated weights on worker 0-0, policy_version 1197414 (0.00092) [2022-07-11 12:46:29,347][25689] Fps is (10 sec: 5800.6, 60 sec: 5575.9, 300 sec: 5538.9). Total num frames: 1226153984. Throughput: 0: 5865.4. Samples: 1226159912. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:46:29,347][25689] Avg episode reward: [(0, '2.071')] [2022-07-11 12:46:30,903][26022] Updated weights on worker 0-0, policy_version 1197424 (0.00085) [2022-07-11 12:46:32,742][26022] Updated weights on worker 0-0, policy_version 1197434 (0.00085) [2022-07-11 12:46:34,363][25689] Fps is (10 sec: 5600.9, 60 sec: 5560.0, 300 sec: 5528.8). Total num frames: 1226180608. Throughput: 0: 5030.1. Samples: 1226176938. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:46:34,363][25689] Avg episode reward: [(0, '2.180')] [2022-07-11 12:46:34,563][26022] Updated weights on worker 0-0, policy_version 1197444 (0.00091) [2022-07-11 12:46:36,286][26022] Updated weights on worker 0-0, policy_version 1197454 (0.00087) [2022-07-11 12:46:38,147][26022] Updated weights on worker 0-0, policy_version 1197464 (0.00102) [2022-07-11 12:46:39,371][25689] Fps is (10 sec: 5618.1, 60 sec: 5559.4, 300 sec: 5543.3). Total num frames: 1226210304. Throughput: 0: 5889.0. Samples: 1226211050. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:46:39,372][25689] Avg episode reward: [(0, '2.092')] [2022-07-11 12:46:39,823][26022] Updated weights on worker 0-0, policy_version 1197474 (0.00089) [2022-07-11 12:46:41,907][26022] Updated weights on worker 0-0, policy_version 1197484 (0.00081) [2022-07-11 12:46:43,376][26022] Updated weights on worker 0-0, policy_version 1197494 (0.00093) [2022-07-11 12:46:44,390][25689] Fps is (10 sec: 5616.9, 60 sec: 5560.8, 300 sec: 5534.5). Total num frames: 1226236928. Throughput: 0: 5913.9. Samples: 1226244848. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:46:44,390][25689] Avg episode reward: [(0, '1.255')] [2022-07-11 12:46:45,513][26022] Updated weights on worker 0-0, policy_version 1197504 (0.00085) [2022-07-11 12:46:47,313][26022] Updated weights on worker 0-0, policy_version 1197514 (0.00080) [2022-07-11 12:46:49,157][26022] Updated weights on worker 0-0, policy_version 1197524 (0.00090) [2022-07-11 12:46:49,465][25689] Fps is (10 sec: 5579.9, 60 sec: 5575.6, 300 sec: 5543.7). Total num frames: 1226266624. Throughput: 0: 5053.0. Samples: 1226261568. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:46:49,465][25689] Avg episode reward: [(0, '1.016')] [2022-07-11 12:46:50,869][26022] Updated weights on worker 0-0, policy_version 1197534 (0.00097) [2022-07-11 12:46:52,804][26022] Updated weights on worker 0-0, policy_version 1197544 (0.00097) [2022-07-11 12:46:54,417][26022] Updated weights on worker 0-0, policy_version 1197554 (0.00087) [2022-07-11 12:46:54,520][25689] Fps is (10 sec: 5761.6, 60 sec: 5594.9, 300 sec: 5542.8). Total num frames: 1226295296. Throughput: 0: 5864.1. Samples: 1226295142. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:46:54,521][25689] Avg episode reward: [(0, '0.111')] [2022-07-11 12:46:56,526][26022] Updated weights on worker 0-0, policy_version 1197564 (0.00095) [2022-07-11 12:46:58,038][26022] Updated weights on worker 0-0, policy_version 1197574 (0.00094) [2022-07-11 12:46:59,550][25689] Fps is (10 sec: 5483.1, 60 sec: 5561.2, 300 sec: 5539.4). Total num frames: 1226321920. Throughput: 0: 5845.6. Samples: 1226329004. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:46:59,550][25689] Avg episode reward: [(0, '0.259')] [2022-07-11 12:47:00,039][26022] Updated weights on worker 0-0, policy_version 1197584 (0.00089) [2022-07-11 12:47:02,287][26022] Updated weights on worker 0-0, policy_version 1197594 (0.00081) [2022-07-11 12:47:03,754][26022] Updated weights on worker 0-0, policy_version 1197604 (0.00090) [2022-07-11 12:47:04,554][25689] Fps is (10 sec: 5408.6, 60 sec: 5595.0, 300 sec: 5545.8). Total num frames: 1226349568. Throughput: 0: 4902.2. Samples: 1226343700. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:47:04,555][25689] Avg episode reward: [(0, '0.169')] [2022-07-11 12:47:06,073][26022] Updated weights on worker 0-0, policy_version 1197614 (0.00087) [2022-07-11 12:47:07,600][26022] Updated weights on worker 0-0, policy_version 1197624 (0.00092) [2022-07-11 12:47:09,545][26022] Updated weights on worker 0-0, policy_version 1197634 (0.00090) [2022-07-11 12:47:09,671][25689] Fps is (10 sec: 5463.4, 60 sec: 5573.4, 300 sec: 5538.1). Total num frames: 1226377216. Throughput: 0: 5737.5. Samples: 1226377500. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:47:09,671][25689] Avg episode reward: [(0, '-0.690')] [2022-07-11 12:47:11,547][26022] Updated weights on worker 0-0, policy_version 1197644 (0.00084) [2022-07-11 12:47:13,047][26022] Updated weights on worker 0-0, policy_version 1197654 (0.00085) [2022-07-11 12:47:14,683][25689] Fps is (10 sec: 5459.4, 60 sec: 5556.6, 300 sec: 5546.2). Total num frames: 1226404864. Throughput: 0: 5747.4. Samples: 1226411028. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:47:14,685][25689] Avg episode reward: [(0, '-0.100')] [2022-07-11 12:47:15,115][26022] Updated weights on worker 0-0, policy_version 1197664 (0.00086) [2022-07-11 12:47:16,893][26022] Updated weights on worker 0-0, policy_version 1197674 (0.00096) [2022-07-11 12:47:18,780][26022] Updated weights on worker 0-0, policy_version 1197684 (0.00098) [2022-07-11 12:47:19,701][25689] Fps is (10 sec: 5818.9, 60 sec: 5627.0, 300 sec: 5553.9). Total num frames: 1226435584. Throughput: 0: 4911.2. Samples: 1226427974. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:47:19,702][25689] Avg episode reward: [(0, '0.396')] [2022-07-11 12:47:20,596][26022] Updated weights on worker 0-0, policy_version 1197694 (0.00088) [2022-07-11 12:47:22,423][26022] Updated weights on worker 0-0, policy_version 1197704 (0.00079) [2022-07-11 12:47:24,187][26022] Updated weights on worker 0-0, policy_version 1197714 (0.00090) [2022-07-11 12:47:24,704][25689] Fps is (10 sec: 5620.1, 60 sec: 5578.0, 300 sec: 5544.8). Total num frames: 1226461184. Throughput: 0: 5846.1. Samples: 1226461498. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:47:24,705][25689] Avg episode reward: [(0, '0.786')] [2022-07-11 12:47:26,186][26022] Updated weights on worker 0-0, policy_version 1197724 (0.00087) [2022-07-11 12:47:27,703][26022] Updated weights on worker 0-0, policy_version 1197734 (0.00108) [2022-07-11 12:47:29,763][25689] Fps is (10 sec: 5190.5, 60 sec: 5525.4, 300 sec: 5540.6). Total num frames: 1226487808. Throughput: 0: 5832.3. Samples: 1226494684. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:47:29,768][25689] Avg episode reward: [(0, '0.913')] [2022-07-11 12:47:29,858][26022] Updated weights on worker 0-0, policy_version 1197744 (0.00102) [2022-07-11 12:47:31,516][26022] Updated weights on worker 0-0, policy_version 1197754 (0.00085) [2022-07-11 12:47:33,416][26022] Updated weights on worker 0-0, policy_version 1197764 (0.00089) [2022-07-11 12:47:34,776][25689] Fps is (10 sec: 5592.0, 60 sec: 5576.5, 300 sec: 5541.3). Total num frames: 1226517504. Throughput: 0: 5000.5. Samples: 1226511502. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:47:34,777][25689] Avg episode reward: [(0, '1.065')] [2022-07-11 12:47:35,404][26022] Updated weights on worker 0-0, policy_version 1197774 (0.00085) [2022-07-11 12:47:36,999][26022] Updated weights on worker 0-0, policy_version 1197784 (0.00087) [2022-07-11 12:47:38,931][26022] Updated weights on worker 0-0, policy_version 1197794 (0.00087) [2022-07-11 12:47:39,788][25689] Fps is (10 sec: 5719.9, 60 sec: 5542.3, 300 sec: 5549.1). Total num frames: 1226545152. Throughput: 0: 5825.2. Samples: 1226544984. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:47:39,791][25689] Avg episode reward: [(0, '1.628')] [2022-07-11 12:47:40,719][26022] Updated weights on worker 0-0, policy_version 1197804 (0.00096) [2022-07-11 12:47:42,564][26022] Updated weights on worker 0-0, policy_version 1197814 (0.00088) [2022-07-11 12:47:44,569][26022] Updated weights on worker 0-0, policy_version 1197824 (0.00088) [2022-07-11 12:47:44,847][25689] Fps is (10 sec: 5592.3, 60 sec: 5572.5, 300 sec: 5545.9). Total num frames: 1226573824. Throughput: 0: 5797.1. Samples: 1226578266. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:47:44,847][25689] Avg episode reward: [(0, '1.315')] [2022-07-11 12:47:46,377][26022] Updated weights on worker 0-0, policy_version 1197834 (0.00090) [2022-07-11 12:47:48,160][26022] Updated weights on worker 0-0, policy_version 1197844 (0.00085) [2022-07-11 12:47:49,922][25689] Fps is (10 sec: 5557.6, 60 sec: 5538.6, 300 sec: 5548.6). Total num frames: 1226601472. Throughput: 0: 4973.1. Samples: 1226594938. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:47:49,923][25689] Avg episode reward: [(0, '0.472')] [2022-07-11 12:47:50,033][26022] Updated weights on worker 0-0, policy_version 1197854 (0.00089) [2022-07-11 12:47:51,817][26022] Updated weights on worker 0-0, policy_version 1197864 (0.00087) [2022-07-11 12:47:53,779][26022] Updated weights on worker 0-0, policy_version 1197874 (0.00091) [2022-07-11 12:47:54,948][25689] Fps is (10 sec: 5575.7, 60 sec: 5541.3, 300 sec: 5548.9). Total num frames: 1226630144. Throughput: 0: 5799.1. Samples: 1226628482. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:47:54,948][25689] Avg episode reward: [(0, '-0.404')] [2022-07-11 12:47:55,578][26022] Updated weights on worker 0-0, policy_version 1197884 (0.00090) [2022-07-11 12:47:57,271][26022] Updated weights on worker 0-0, policy_version 1197894 (0.00095) [2022-07-11 12:47:58,994][26022] Updated weights on worker 0-0, policy_version 1197904 (0.00083) [2022-07-11 12:47:59,955][25689] Fps is (10 sec: 5511.7, 60 sec: 5543.4, 300 sec: 5549.8). Total num frames: 1226656768. Throughput: 0: 5796.3. Samples: 1226661874. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:47:59,955][25689] Avg episode reward: [(0, '-0.423')] [2022-07-11 12:48:01,167][26022] Updated weights on worker 0-0, policy_version 1197914 (0.00085) [2022-07-11 12:48:03,083][26022] Updated weights on worker 0-0, policy_version 1197924 (0.00084) [2022-07-11 12:48:04,961][25689] Fps is (10 sec: 5113.0, 60 sec: 5492.3, 300 sec: 5542.9). Total num frames: 1226681344. Throughput: 0: 4982.1. Samples: 1226678480. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:48:04,962][25689] Avg episode reward: [(0, '-0.599')] [2022-07-11 12:48:05,203][26022] Updated weights on worker 0-0, policy_version 1197934 (0.00088) [2022-07-11 12:48:06,841][26022] Updated weights on worker 0-0, policy_version 1197944 (0.00089) [2022-07-11 12:48:08,108][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:48:08,120][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001197950_1226700800.pth [2022-07-11 12:48:08,127][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001195999_1224702976.pth [2022-07-11 12:48:08,891][26022] Updated weights on worker 0-0, policy_version 1197954 (0.00092) [2022-07-11 12:48:10,012][25689] Fps is (10 sec: 5498.1, 60 sec: 5549.3, 300 sec: 5549.2). Total num frames: 1226712064. Throughput: 0: 5734.2. Samples: 1226710136. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:48:10,012][25689] Avg episode reward: [(0, '-0.813')] [2022-07-11 12:48:10,676][26022] Updated weights on worker 0-0, policy_version 1197964 (0.00085) [2022-07-11 12:48:12,251][26022] Updated weights on worker 0-0, policy_version 1197974 (0.00083) [2022-07-11 12:48:14,261][26022] Updated weights on worker 0-0, policy_version 1197984 (0.00088) [2022-07-11 12:48:15,014][25689] Fps is (10 sec: 5806.1, 60 sec: 5550.2, 300 sec: 5549.7). Total num frames: 1226739712. Throughput: 0: 5744.3. Samples: 1226743750. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:48:15,015][25689] Avg episode reward: [(0, '-1.100')] [2022-07-11 12:48:16,051][26022] Updated weights on worker 0-0, policy_version 1197994 (0.00085) [2022-07-11 12:48:17,986][26022] Updated weights on worker 0-0, policy_version 1198004 (0.00083) [2022-07-11 12:48:19,788][26022] Updated weights on worker 0-0, policy_version 1198014 (0.00082) [2022-07-11 12:48:20,023][25689] Fps is (10 sec: 5523.5, 60 sec: 5500.1, 300 sec: 5546.3). Total num frames: 1226767360. Throughput: 0: 4925.1. Samples: 1226760712. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:48:20,024][25689] Avg episode reward: [(0, '0.684')] [2022-07-11 12:48:21,436][26022] Updated weights on worker 0-0, policy_version 1198024 (0.00089) [2022-07-11 12:48:23,365][26022] Updated weights on worker 0-0, policy_version 1198034 (0.00092) [2022-07-11 12:48:25,003][26022] Updated weights on worker 0-0, policy_version 1198044 (0.00093) [2022-07-11 12:48:25,055][25689] Fps is (10 sec: 5711.3, 60 sec: 5565.4, 300 sec: 5554.6). Total num frames: 1226797056. Throughput: 0: 5758.0. Samples: 1226794176. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:48:25,055][25689] Avg episode reward: [(0, '0.520')] [2022-07-11 12:48:27,151][26022] Updated weights on worker 0-0, policy_version 1198054 (0.00088) [2022-07-11 12:48:28,949][26022] Updated weights on worker 0-0, policy_version 1198064 (0.00086) [2022-07-11 12:48:30,095][25689] Fps is (10 sec: 5591.8, 60 sec: 5567.1, 300 sec: 5551.3). Total num frames: 1226823680. Throughput: 0: 5868.0. Samples: 1226827982. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:48:30,097][25689] Avg episode reward: [(0, '-0.119')] [2022-07-11 12:48:30,690][26022] Updated weights on worker 0-0, policy_version 1198074 (0.00096) [2022-07-11 12:48:32,527][26022] Updated weights on worker 0-0, policy_version 1198084 (0.00079) [2022-07-11 12:48:34,296][26022] Updated weights on worker 0-0, policy_version 1198094 (0.00094) [2022-07-11 12:48:35,116][25689] Fps is (10 sec: 5496.1, 60 sec: 5549.4, 300 sec: 5551.2). Total num frames: 1226852352. Throughput: 0: 5871.3. Samples: 1226861772. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:48:35,116][25689] Avg episode reward: [(0, '-0.839')] [2022-07-11 12:48:36,161][26022] Updated weights on worker 0-0, policy_version 1198104 (0.00087) [2022-07-11 12:48:37,961][26022] Updated weights on worker 0-0, policy_version 1198114 (0.00082) [2022-07-11 12:48:39,762][26022] Updated weights on worker 0-0, policy_version 1198124 (0.00083) [2022-07-11 12:48:40,122][25689] Fps is (10 sec: 5718.8, 60 sec: 5566.9, 300 sec: 5551.1). Total num frames: 1226881024. Throughput: 0: 5876.4. Samples: 1226878822. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 12:48:40,123][25689] Avg episode reward: [(0, '-0.808')] [2022-07-11 12:48:41,548][26022] Updated weights on worker 0-0, policy_version 1198134 (0.00085) [2022-07-11 12:48:43,475][26022] Updated weights on worker 0-0, policy_version 1198144 (0.00092) [2022-07-11 12:48:45,124][25689] Fps is (10 sec: 5627.3, 60 sec: 5555.1, 300 sec: 5555.6). Total num frames: 1226908672. Throughput: 0: 5894.8. Samples: 1226912480. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:48:45,125][25689] Avg episode reward: [(0, '-0.649')] [2022-07-11 12:48:45,145][26022] Updated weights on worker 0-0, policy_version 1198154 (0.00089) [2022-07-11 12:48:47,140][26022] Updated weights on worker 0-0, policy_version 1198164 (0.00093) [2022-07-11 12:48:49,005][26022] Updated weights on worker 0-0, policy_version 1198174 (0.00769) [2022-07-11 12:48:50,174][25689] Fps is (10 sec: 5500.8, 60 sec: 5557.4, 300 sec: 5551.5). Total num frames: 1226936320. Throughput: 0: 5898.5. Samples: 1226946422. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:48:50,175][25689] Avg episode reward: [(0, '-0.692')] [2022-07-11 12:48:50,643][26022] Updated weights on worker 0-0, policy_version 1198184 (0.00101) [2022-07-11 12:48:52,649][26022] Updated weights on worker 0-0, policy_version 1198194 (0.00087) [2022-07-11 12:48:54,344][26022] Updated weights on worker 0-0, policy_version 1198204 (0.00087) [2022-07-11 12:48:55,219][25689] Fps is (10 sec: 5477.8, 60 sec: 5538.7, 300 sec: 5548.6). Total num frames: 1226963968. Throughput: 0: 5033.3. Samples: 1226962956. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:48:55,220][25689] Avg episode reward: [(0, '-0.218')] [2022-07-11 12:48:56,259][26022] Updated weights on worker 0-0, policy_version 1198214 (0.00086) [2022-07-11 12:48:58,149][26022] Updated weights on worker 0-0, policy_version 1198224 (0.00092) [2022-07-11 12:48:59,926][26022] Updated weights on worker 0-0, policy_version 1198234 (0.00091) [2022-07-11 12:49:00,235][25689] Fps is (10 sec: 5699.6, 60 sec: 5588.8, 300 sec: 5566.7). Total num frames: 1226993664. Throughput: 0: 5859.0. Samples: 1226996666. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:49:00,236][25689] Avg episode reward: [(0, '0.934')] [2022-07-11 12:49:01,880][26022] Updated weights on worker 0-0, policy_version 1198244 (0.00090) [2022-07-11 12:49:03,904][26022] Updated weights on worker 0-0, policy_version 1198254 (0.00084) [2022-07-11 12:49:05,251][25689] Fps is (10 sec: 5511.7, 60 sec: 5604.9, 300 sec: 5554.6). Total num frames: 1227019264. Throughput: 0: 5767.5. Samples: 1227028562. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:49:05,253][25689] Avg episode reward: [(0, '-0.229')] [2022-07-11 12:49:05,733][26022] Updated weights on worker 0-0, policy_version 1198264 (0.00086) [2022-07-11 12:49:07,510][26022] Updated weights on worker 0-0, policy_version 1198274 (0.00082) [2022-07-11 12:49:09,358][26022] Updated weights on worker 0-0, policy_version 1198284 (0.00350) [2022-07-11 12:49:10,325][25689] Fps is (10 sec: 5277.7, 60 sec: 5551.9, 300 sec: 5553.6). Total num frames: 1227046912. Throughput: 0: 4913.3. Samples: 1227045426. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:49:10,325][25689] Avg episode reward: [(0, '-0.638')] [2022-07-11 12:49:11,237][26022] Updated weights on worker 0-0, policy_version 1198294 (0.00082) [2022-07-11 12:49:12,986][26022] Updated weights on worker 0-0, policy_version 1198304 (0.00081) [2022-07-11 12:49:14,786][26022] Updated weights on worker 0-0, policy_version 1198314 (0.00085) [2022-07-11 12:49:15,339][25689] Fps is (10 sec: 5684.6, 60 sec: 5584.7, 300 sec: 5560.4). Total num frames: 1227076608. Throughput: 0: 5775.3. Samples: 1227079156. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:49:15,339][25689] Avg episode reward: [(0, '-0.770')] [2022-07-11 12:49:16,647][26022] Updated weights on worker 0-0, policy_version 1198324 (0.00092) [2022-07-11 12:49:18,336][26022] Updated weights on worker 0-0, policy_version 1198334 (0.00090) [2022-07-11 12:49:20,223][26022] Updated weights on worker 0-0, policy_version 1198344 (0.00096) [2022-07-11 12:49:20,353][25689] Fps is (10 sec: 5718.1, 60 sec: 5584.2, 300 sec: 5557.2). Total num frames: 1227104256. Throughput: 0: 5785.3. Samples: 1227113054. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:49:20,354][25689] Avg episode reward: [(0, '-0.832')] [2022-07-11 12:49:22,240][26022] Updated weights on worker 0-0, policy_version 1198354 (0.00101) [2022-07-11 12:49:23,978][26022] Updated weights on worker 0-0, policy_version 1198364 (0.00090) [2022-07-11 12:49:25,399][25689] Fps is (10 sec: 5496.6, 60 sec: 5549.0, 300 sec: 5558.8). Total num frames: 1227131904. Throughput: 0: 5009.3. Samples: 1227129488. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:49:25,399][25689] Avg episode reward: [(0, '-1.190')] [2022-07-11 12:49:25,813][26022] Updated weights on worker 0-0, policy_version 1198374 (0.00090) [2022-07-11 12:49:27,711][26022] Updated weights on worker 0-0, policy_version 1198384 (0.01229) [2022-07-11 12:49:29,546][26022] Updated weights on worker 0-0, policy_version 1198394 (0.00091) [2022-07-11 12:49:30,511][25689] Fps is (10 sec: 5544.4, 60 sec: 5576.2, 300 sec: 5557.0). Total num frames: 1227160576. Throughput: 0: 5817.6. Samples: 1227162862. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:49:30,512][25689] Avg episode reward: [(0, '-2.583')] [2022-07-11 12:49:31,423][26022] Updated weights on worker 0-0, policy_version 1198404 (0.00111) [2022-07-11 12:49:33,284][26022] Updated weights on worker 0-0, policy_version 1198414 (0.00096) [2022-07-11 12:49:35,179][26022] Updated weights on worker 0-0, policy_version 1198424 (0.00092) [2022-07-11 12:49:35,596][25689] Fps is (10 sec: 5522.8, 60 sec: 5553.4, 300 sec: 5556.1). Total num frames: 1227188224. Throughput: 0: 5768.5. Samples: 1227196012. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:49:35,597][25689] Avg episode reward: [(0, '-0.922')] [2022-07-11 12:49:37,003][26022] Updated weights on worker 0-0, policy_version 1198434 (0.00086) [2022-07-11 12:49:38,765][26022] Updated weights on worker 0-0, policy_version 1198444 (0.00083) [2022-07-11 12:49:40,676][25689] Fps is (10 sec: 5540.8, 60 sec: 5546.7, 300 sec: 5555.8). Total num frames: 1227216896. Throughput: 0: 4914.7. Samples: 1227212934. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:49:40,676][25689] Avg episode reward: [(0, '-0.227')] [2022-07-11 12:49:40,680][26022] Updated weights on worker 0-0, policy_version 1198454 (0.00084) [2022-07-11 12:49:42,467][26022] Updated weights on worker 0-0, policy_version 1198464 (0.00089) [2022-07-11 12:49:44,372][26022] Updated weights on worker 0-0, policy_version 1198474 (0.00079) [2022-07-11 12:49:45,705][25689] Fps is (10 sec: 5571.4, 60 sec: 5544.2, 300 sec: 5552.9). Total num frames: 1227244544. Throughput: 0: 5768.0. Samples: 1227246616. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:49:45,706][25689] Avg episode reward: [(0, '-0.177')] [2022-07-11 12:49:46,260][26022] Updated weights on worker 0-0, policy_version 1198484 (0.00091) [2022-07-11 12:49:47,933][26022] Updated weights on worker 0-0, policy_version 1198494 (0.00094) [2022-07-11 12:49:49,933][26022] Updated weights on worker 0-0, policy_version 1198504 (0.00091) [2022-07-11 12:49:50,747][25689] Fps is (10 sec: 5591.9, 60 sec: 5561.8, 300 sec: 5559.4). Total num frames: 1227273216. Throughput: 0: 5787.1. Samples: 1227279972. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:49:50,748][25689] Avg episode reward: [(0, '-0.674')] [2022-07-11 12:49:51,612][26022] Updated weights on worker 0-0, policy_version 1198514 (0.00092) [2022-07-11 12:49:53,558][26022] Updated weights on worker 0-0, policy_version 1198524 (0.00085) [2022-07-11 12:49:55,175][26022] Updated weights on worker 0-0, policy_version 1198534 (0.00093) [2022-07-11 12:49:55,760][25689] Fps is (10 sec: 5601.1, 60 sec: 5564.7, 300 sec: 5556.4). Total num frames: 1227300864. Throughput: 0: 4995.4. Samples: 1227296740. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:49:55,761][25689] Avg episode reward: [(0, '-0.368')] [2022-07-11 12:49:57,237][26022] Updated weights on worker 0-0, policy_version 1198544 (0.00090) [2022-07-11 12:49:58,876][26022] Updated weights on worker 0-0, policy_version 1198554 (0.00092) [2022-07-11 12:50:00,781][25689] Fps is (10 sec: 5511.0, 60 sec: 5530.5, 300 sec: 5552.6). Total num frames: 1227328512. Throughput: 0: 5840.5. Samples: 1227330360. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:50:00,782][25689] Avg episode reward: [(0, '-0.496')] [2022-07-11 12:50:01,009][26022] Updated weights on worker 0-0, policy_version 1198564 (0.00085) [2022-07-11 12:50:03,167][26022] Updated weights on worker 0-0, policy_version 1198574 (0.00091) [2022-07-11 12:50:04,894][26022] Updated weights on worker 0-0, policy_version 1198584 (0.00093) [2022-07-11 12:50:05,802][25689] Fps is (10 sec: 5404.8, 60 sec: 5547.0, 300 sec: 5554.7). Total num frames: 1227355136. Throughput: 0: 5717.2. Samples: 1227361512. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:50:05,802][25689] Avg episode reward: [(0, '-0.488')] [2022-07-11 12:50:06,712][26022] Updated weights on worker 0-0, policy_version 1198594 (0.00088) [2022-07-11 12:50:08,491][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:50:08,511][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001198603_1227369472.pth [2022-07-11 12:50:08,511][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001196648_1225367552.pth [2022-07-11 12:50:08,697][26022] Updated weights on worker 0-0, policy_version 1198604 (0.01092) [2022-07-11 12:50:10,465][26022] Updated weights on worker 0-0, policy_version 1198614 (0.00086) [2022-07-11 12:50:10,907][25689] Fps is (10 sec: 5460.7, 60 sec: 5560.9, 300 sec: 5557.1). Total num frames: 1227383808. Throughput: 0: 4865.5. Samples: 1227378060. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:50:10,908][25689] Avg episode reward: [(0, '-0.594')] [2022-07-11 12:50:12,302][26022] Updated weights on worker 0-0, policy_version 1198624 (0.00089) [2022-07-11 12:50:14,047][26022] Updated weights on worker 0-0, policy_version 1198634 (0.00088) [2022-07-11 12:50:15,927][25689] Fps is (10 sec: 5461.3, 60 sec: 5509.7, 300 sec: 5551.3). Total num frames: 1227410432. Throughput: 0: 5697.4. Samples: 1227411636. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:50:15,927][25689] Avg episode reward: [(0, '-0.093')] [2022-07-11 12:50:16,055][26022] Updated weights on worker 0-0, policy_version 1198644 (0.00089) [2022-07-11 12:50:17,741][26022] Updated weights on worker 0-0, policy_version 1198654 (0.00053) [2022-07-11 12:50:19,792][26022] Updated weights on worker 0-0, policy_version 1198664 (0.00086) [2022-07-11 12:50:21,009][25689] Fps is (10 sec: 5575.1, 60 sec: 5537.3, 300 sec: 5555.0). Total num frames: 1227440128. Throughput: 0: 5686.1. Samples: 1227445380. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:50:21,010][25689] Avg episode reward: [(0, '-0.085')] [2022-07-11 12:50:21,239][26022] Updated weights on worker 0-0, policy_version 1198674 (0.00064) [2022-07-11 12:50:23,372][26022] Updated weights on worker 0-0, policy_version 1198684 (0.00070) [2022-07-11 12:50:24,983][26022] Updated weights on worker 0-0, policy_version 1198694 (0.00084) [2022-07-11 12:50:26,055][25689] Fps is (10 sec: 5459.7, 60 sec: 5503.5, 300 sec: 5548.6). Total num frames: 1227465728. Throughput: 0: 4965.2. Samples: 1227462072. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:50:26,055][25689] Avg episode reward: [(0, '0.423')] [2022-07-11 12:50:26,900][26022] Updated weights on worker 0-0, policy_version 1198704 (0.00089) [2022-07-11 12:50:28,924][26022] Updated weights on worker 0-0, policy_version 1198714 (0.00095) [2022-07-11 12:50:30,856][26022] Updated weights on worker 0-0, policy_version 1198724 (0.00094) [2022-07-11 12:50:31,086][25689] Fps is (10 sec: 5487.6, 60 sec: 5527.8, 300 sec: 5555.4). Total num frames: 1227495424. Throughput: 0: 5791.7. Samples: 1227494928. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:50:31,086][25689] Avg episode reward: [(0, '0.954')] [2022-07-11 12:50:32,634][26022] Updated weights on worker 0-0, policy_version 1198734 (0.00090) [2022-07-11 12:50:34,545][26022] Updated weights on worker 0-0, policy_version 1198744 (0.00086) [2022-07-11 12:50:36,107][25689] Fps is (10 sec: 5602.9, 60 sec: 5516.8, 300 sec: 5544.7). Total num frames: 1227522048. Throughput: 0: 5771.3. Samples: 1227528100. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:50:36,107][25689] Avg episode reward: [(0, '0.658')] [2022-07-11 12:50:36,287][26022] Updated weights on worker 0-0, policy_version 1198754 (0.00084) [2022-07-11 12:50:38,147][26022] Updated weights on worker 0-0, policy_version 1198764 (0.00083) [2022-07-11 12:50:40,023][26022] Updated weights on worker 0-0, policy_version 1198774 (0.00080) [2022-07-11 12:50:41,193][25689] Fps is (10 sec: 5369.9, 60 sec: 5499.3, 300 sec: 5547.2). Total num frames: 1227549696. Throughput: 0: 4939.4. Samples: 1227545074. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:50:41,193][25689] Avg episode reward: [(0, '0.737')] [2022-07-11 12:50:41,811][26022] Updated weights on worker 0-0, policy_version 1198784 (0.00485) [2022-07-11 12:50:43,762][26022] Updated weights on worker 0-0, policy_version 1198794 (0.00090) [2022-07-11 12:50:45,411][26022] Updated weights on worker 0-0, policy_version 1198804 (0.00087) [2022-07-11 12:50:46,266][25689] Fps is (10 sec: 5644.4, 60 sec: 5529.1, 300 sec: 5550.2). Total num frames: 1227579392. Throughput: 0: 5766.0. Samples: 1227578610. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:50:46,267][25689] Avg episode reward: [(0, '0.915')] [2022-07-11 12:50:47,237][26022] Updated weights on worker 0-0, policy_version 1198814 (0.00088) [2022-07-11 12:50:49,163][26022] Updated weights on worker 0-0, policy_version 1198824 (0.00087) [2022-07-11 12:50:50,964][26022] Updated weights on worker 0-0, policy_version 1198834 (0.00093) [2022-07-11 12:50:51,313][25689] Fps is (10 sec: 5767.2, 60 sec: 5528.7, 300 sec: 5554.3). Total num frames: 1227608064. Throughput: 0: 5790.8. Samples: 1227612060. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:50:51,314][25689] Avg episode reward: [(0, '0.975')] [2022-07-11 12:50:52,948][26022] Updated weights on worker 0-0, policy_version 1198844 (0.00090) [2022-07-11 12:50:54,555][26022] Updated weights on worker 0-0, policy_version 1198854 (0.00081) [2022-07-11 12:50:56,318][25689] Fps is (10 sec: 5501.2, 60 sec: 5512.5, 300 sec: 5547.9). Total num frames: 1227634688. Throughput: 0: 4980.0. Samples: 1227628752. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:50:56,318][25689] Avg episode reward: [(0, '0.739')] [2022-07-11 12:50:56,434][26022] Updated weights on worker 0-0, policy_version 1198864 (0.00091) [2022-07-11 12:50:58,645][26022] Updated weights on worker 0-0, policy_version 1198874 (0.00091) [2022-07-11 12:51:00,266][26022] Updated weights on worker 0-0, policy_version 1198884 (0.00085) [2022-07-11 12:51:01,403][25689] Fps is (10 sec: 5379.1, 60 sec: 5506.7, 300 sec: 5553.3). Total num frames: 1227662336. Throughput: 0: 5783.5. Samples: 1227661958. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:51:01,403][25689] Avg episode reward: [(0, '1.098')] [2022-07-11 12:51:02,514][26022] Updated weights on worker 0-0, policy_version 1198894 (0.00082) [2022-07-11 12:51:04,426][26022] Updated weights on worker 0-0, policy_version 1198904 (0.00083) [2022-07-11 12:51:06,216][26022] Updated weights on worker 0-0, policy_version 1198914 (0.00087) [2022-07-11 12:51:06,442][25689] Fps is (10 sec: 5461.6, 60 sec: 5521.8, 300 sec: 5550.3). Total num frames: 1227689984. Throughput: 0: 5678.5. Samples: 1227693180. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:51:06,443][25689] Avg episode reward: [(0, '1.063')] [2022-07-11 12:51:07,982][26022] Updated weights on worker 0-0, policy_version 1198924 (0.00084) [2022-07-11 12:51:09,696][26022] Updated weights on worker 0-0, policy_version 1198934 (0.00093) [2022-07-11 12:51:11,487][25689] Fps is (10 sec: 5483.5, 60 sec: 5510.5, 300 sec: 5546.3). Total num frames: 1227717632. Throughput: 0: 5709.9. Samples: 1227727248. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:51:11,487][25689] Avg episode reward: [(0, '1.168')] [2022-07-11 12:51:11,643][26022] Updated weights on worker 0-0, policy_version 1198944 (0.00092) [2022-07-11 12:51:13,424][26022] Updated weights on worker 0-0, policy_version 1198954 (0.00088) [2022-07-11 12:51:15,215][26022] Updated weights on worker 0-0, policy_version 1198964 (0.00087) [2022-07-11 12:51:16,515][25689] Fps is (10 sec: 5591.6, 60 sec: 5543.6, 300 sec: 5553.6). Total num frames: 1227746304. Throughput: 0: 5715.6. Samples: 1227744188. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:51:16,515][25689] Avg episode reward: [(0, '0.387')] [2022-07-11 12:51:16,976][26022] Updated weights on worker 0-0, policy_version 1198974 (0.00092) [2022-07-11 12:51:18,936][26022] Updated weights on worker 0-0, policy_version 1198984 (0.00092) [2022-07-11 12:51:20,604][26022] Updated weights on worker 0-0, policy_version 1198994 (0.00102) [2022-07-11 12:51:21,535][25689] Fps is (10 sec: 5707.0, 60 sec: 5532.4, 300 sec: 5553.6). Total num frames: 1227774976. Throughput: 0: 5760.2. Samples: 1227777922. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:51:21,536][25689] Avg episode reward: [(0, '0.458')] [2022-07-11 12:51:22,646][26022] Updated weights on worker 0-0, policy_version 1199004 (0.00078) [2022-07-11 12:51:24,117][26022] Updated weights on worker 0-0, policy_version 1199014 (0.00088) [2022-07-11 12:51:26,241][26022] Updated weights on worker 0-0, policy_version 1199024 (0.00082) [2022-07-11 12:51:26,551][25689] Fps is (10 sec: 5611.5, 60 sec: 5568.9, 300 sec: 5547.2). Total num frames: 1227802624. Throughput: 0: 5886.3. Samples: 1227811546. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:51:26,552][25689] Avg episode reward: [(0, '0.833')] [2022-07-11 12:51:28,101][26022] Updated weights on worker 0-0, policy_version 1199034 (0.00084) [2022-07-11 12:51:29,829][26022] Updated weights on worker 0-0, policy_version 1199044 (0.00100) [2022-07-11 12:51:31,666][25689] Fps is (10 sec: 5458.0, 60 sec: 5527.4, 300 sec: 5548.8). Total num frames: 1227830272. Throughput: 0: 5004.1. Samples: 1227828226. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:51:31,666][25689] Avg episode reward: [(0, '-1.755')] [2022-07-11 12:51:31,806][26022] Updated weights on worker 0-0, policy_version 1199054 (0.00087) [2022-07-11 12:51:33,395][26022] Updated weights on worker 0-0, policy_version 1199064 (0.00087) [2022-07-11 12:51:35,533][26022] Updated weights on worker 0-0, policy_version 1199074 (0.00087) [2022-07-11 12:51:36,695][25689] Fps is (10 sec: 5552.2, 60 sec: 5560.4, 300 sec: 5544.9). Total num frames: 1227858944. Throughput: 0: 5818.4. Samples: 1227861604. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:51:36,695][25689] Avg episode reward: [(0, '-1.800')] [2022-07-11 12:51:37,138][26022] Updated weights on worker 0-0, policy_version 1199084 (0.00086) [2022-07-11 12:51:39,202][26022] Updated weights on worker 0-0, policy_version 1199094 (0.00105) [2022-07-11 12:51:40,726][26022] Updated weights on worker 0-0, policy_version 1199104 (0.00088) [2022-07-11 12:51:41,748][25689] Fps is (10 sec: 5586.3, 60 sec: 5563.5, 300 sec: 5547.7). Total num frames: 1227886592. Throughput: 0: 5802.9. Samples: 1227895216. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:51:41,749][25689] Avg episode reward: [(0, '-1.146')] [2022-07-11 12:51:42,775][26022] Updated weights on worker 0-0, policy_version 1199114 (0.00089) [2022-07-11 12:51:44,473][26022] Updated weights on worker 0-0, policy_version 1199124 (0.00049) [2022-07-11 12:51:46,481][26022] Updated weights on worker 0-0, policy_version 1199134 (0.00090) [2022-07-11 12:51:46,764][25689] Fps is (10 sec: 5593.3, 60 sec: 5551.8, 300 sec: 5545.4). Total num frames: 1227915264. Throughput: 0: 4974.6. Samples: 1227912096. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:51:46,765][25689] Avg episode reward: [(0, '-0.907')] [2022-07-11 12:51:48,072][26022] Updated weights on worker 0-0, policy_version 1199144 (0.00752) [2022-07-11 12:51:49,883][26022] Updated weights on worker 0-0, policy_version 1199154 (0.00059) [2022-07-11 12:51:51,644][26022] Updated weights on worker 0-0, policy_version 1199164 (0.00090) [2022-07-11 12:51:51,901][25689] Fps is (10 sec: 5647.7, 60 sec: 5543.6, 300 sec: 5543.9). Total num frames: 1227943936. Throughput: 0: 5821.2. Samples: 1227946020. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:51:51,902][25689] Avg episode reward: [(0, '-1.165')] [2022-07-11 12:51:53,688][26022] Updated weights on worker 0-0, policy_version 1199174 (0.00087) [2022-07-11 12:51:55,371][26022] Updated weights on worker 0-0, policy_version 1199184 (0.00087) [2022-07-11 12:51:56,947][25689] Fps is (10 sec: 5631.4, 60 sec: 5573.6, 300 sec: 5550.4). Total num frames: 1227972608. Throughput: 0: 5830.2. Samples: 1227979676. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:51:56,947][25689] Avg episode reward: [(0, '-1.235')] [2022-07-11 12:51:57,245][26022] Updated weights on worker 0-0, policy_version 1199194 (0.00089) [2022-07-11 12:51:59,107][26022] Updated weights on worker 0-0, policy_version 1199204 (0.00088) [2022-07-11 12:52:00,873][26022] Updated weights on worker 0-0, policy_version 1199214 (0.00089) [2022-07-11 12:52:01,957][25689] Fps is (10 sec: 5397.1, 60 sec: 5546.6, 300 sec: 5543.5). Total num frames: 1227998208. Throughput: 0: 5002.7. Samples: 1227996318. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:52:01,958][25689] Avg episode reward: [(0, '1.081')] [2022-07-11 12:52:03,235][26022] Updated weights on worker 0-0, policy_version 1199224 (0.00081) [2022-07-11 12:52:04,877][26022] Updated weights on worker 0-0, policy_version 1199234 (0.00090) [2022-07-11 12:52:06,826][26022] Updated weights on worker 0-0, policy_version 1199244 (0.00081) [2022-07-11 12:52:07,049][25689] Fps is (10 sec: 5372.4, 60 sec: 5558.7, 300 sec: 5547.4). Total num frames: 1228026880. Throughput: 0: 5708.8. Samples: 1228027898. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:52:07,049][25689] Avg episode reward: [(0, '0.392')] [2022-07-11 12:52:08,620][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:52:08,634][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001199254_1228036096.pth [2022-07-11 12:52:08,634][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001197299_1226034176.pth [2022-07-11 12:52:08,639][26022] Updated weights on worker 0-0, policy_version 1199254 (0.00087) [2022-07-11 12:52:10,448][26022] Updated weights on worker 0-0, policy_version 1199264 (0.00093) [2022-07-11 12:52:12,097][25689] Fps is (10 sec: 5554.0, 60 sec: 5558.4, 300 sec: 5546.7). Total num frames: 1228054528. Throughput: 0: 5713.8. Samples: 1228061418. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:52:12,099][25689] Avg episode reward: [(0, '0.258')] [2022-07-11 12:52:12,284][26022] Updated weights on worker 0-0, policy_version 1199274 (0.00087) [2022-07-11 12:52:14,208][26022] Updated weights on worker 0-0, policy_version 1199284 (0.00091) [2022-07-11 12:52:15,838][26022] Updated weights on worker 0-0, policy_version 1199294 (0.00082) [2022-07-11 12:52:17,148][25689] Fps is (10 sec: 5576.6, 60 sec: 5556.3, 300 sec: 5539.2). Total num frames: 1228083200. Throughput: 0: 4881.0. Samples: 1228078278. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:52:17,149][25689] Avg episode reward: [(0, '0.149')] [2022-07-11 12:52:17,865][26022] Updated weights on worker 0-0, policy_version 1199304 (0.00088) [2022-07-11 12:52:19,539][26022] Updated weights on worker 0-0, policy_version 1199314 (0.00086) [2022-07-11 12:52:21,723][26022] Updated weights on worker 0-0, policy_version 1199324 (0.00095) [2022-07-11 12:52:22,209][25689] Fps is (10 sec: 5569.6, 60 sec: 5535.6, 300 sec: 5545.0). Total num frames: 1228110848. Throughput: 0: 5700.0. Samples: 1228111758. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:52:22,210][25689] Avg episode reward: [(0, '-0.462')] [2022-07-11 12:52:23,312][26022] Updated weights on worker 0-0, policy_version 1199334 (0.00085) [2022-07-11 12:52:25,240][26022] Updated weights on worker 0-0, policy_version 1199344 (0.00932) [2022-07-11 12:52:26,926][26022] Updated weights on worker 0-0, policy_version 1199354 (0.00093) [2022-07-11 12:52:27,249][25689] Fps is (10 sec: 5575.7, 60 sec: 5550.4, 300 sec: 5552.2). Total num frames: 1228139520. Throughput: 0: 5805.1. Samples: 1228145164. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:52:27,250][25689] Avg episode reward: [(0, '-0.893')] [2022-07-11 12:52:28,803][26022] Updated weights on worker 0-0, policy_version 1199364 (0.00089) [2022-07-11 12:52:31,010][26022] Updated weights on worker 0-0, policy_version 1199374 (0.00086) [2022-07-11 12:52:32,340][25689] Fps is (10 sec: 5660.2, 60 sec: 5569.4, 300 sec: 5547.3). Total num frames: 1228168192. Throughput: 0: 4947.0. Samples: 1228161560. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:52:32,342][25689] Avg episode reward: [(0, '-0.450')] [2022-07-11 12:52:32,539][26022] Updated weights on worker 0-0, policy_version 1199384 (0.00090) [2022-07-11 12:52:34,573][26022] Updated weights on worker 0-0, policy_version 1199394 (0.00061) [2022-07-11 12:52:36,074][26022] Updated weights on worker 0-0, policy_version 1199404 (0.00086) [2022-07-11 12:52:37,427][25689] Fps is (10 sec: 5332.7, 60 sec: 5513.6, 300 sec: 5539.1). Total num frames: 1228193792. Throughput: 0: 5755.5. Samples: 1228194990. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2022-07-11 12:52:37,427][25689] Avg episode reward: [(0, '-0.622')] [2022-07-11 12:52:38,357][26022] Updated weights on worker 0-0, policy_version 1199414 (0.00228) [2022-07-11 12:52:39,978][26022] Updated weights on worker 0-0, policy_version 1199424 (0.00089) [2022-07-11 12:52:41,930][26022] Updated weights on worker 0-0, policy_version 1199434 (0.00084) [2022-07-11 12:52:42,436][25689] Fps is (10 sec: 5477.4, 60 sec: 5551.3, 300 sec: 5543.4). Total num frames: 1228223488. Throughput: 0: 5756.6. Samples: 1228228192. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:52:42,436][25689] Avg episode reward: [(0, '-0.387')] [2022-07-11 12:52:43,524][26022] Updated weights on worker 0-0, policy_version 1199444 (0.00087) [2022-07-11 12:52:45,691][26022] Updated weights on worker 0-0, policy_version 1199454 (0.00094) [2022-07-11 12:52:47,080][26022] Updated weights on worker 0-0, policy_version 1199464 (0.00080) [2022-07-11 12:52:47,449][25689] Fps is (10 sec: 5823.6, 60 sec: 5551.5, 300 sec: 5548.0). Total num frames: 1228252160. Throughput: 0: 4952.6. Samples: 1228245206. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:52:47,450][25689] Avg episode reward: [(0, '-0.227')] [2022-07-11 12:52:49,183][26022] Updated weights on worker 0-0, policy_version 1199474 (0.00093) [2022-07-11 12:52:50,709][26022] Updated weights on worker 0-0, policy_version 1199484 (0.00087) [2022-07-11 12:52:52,536][25689] Fps is (10 sec: 5576.3, 60 sec: 5539.3, 300 sec: 5543.5). Total num frames: 1228279808. Throughput: 0: 5800.4. Samples: 1228278700. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:52:52,536][25689] Avg episode reward: [(0, '0.675')] [2022-07-11 12:52:52,675][26022] Updated weights on worker 0-0, policy_version 1199494 (0.00082) [2022-07-11 12:52:54,633][26022] Updated weights on worker 0-0, policy_version 1199504 (0.00088) [2022-07-11 12:52:56,362][26022] Updated weights on worker 0-0, policy_version 1199514 (0.00082) [2022-07-11 12:52:57,568][25689] Fps is (10 sec: 5363.7, 60 sec: 5506.8, 300 sec: 5543.0). Total num frames: 1228306432. Throughput: 0: 5826.1. Samples: 1228312334. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:52:57,569][25689] Avg episode reward: [(0, '1.218')] [2022-07-11 12:52:58,331][26022] Updated weights on worker 0-0, policy_version 1199524 (0.00090) [2022-07-11 12:53:00,288][26022] Updated weights on worker 0-0, policy_version 1199534 (0.00098) [2022-07-11 12:53:02,273][26022] Updated weights on worker 0-0, policy_version 1199544 (0.00099) [2022-07-11 12:53:02,642][25689] Fps is (10 sec: 5370.2, 60 sec: 5534.7, 300 sec: 5552.0). Total num frames: 1228334080. Throughput: 0: 4978.1. Samples: 1228328780. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:53:02,642][25689] Avg episode reward: [(0, '1.177')] [2022-07-11 12:53:04,305][26022] Updated weights on worker 0-0, policy_version 1199554 (0.00089) [2022-07-11 12:53:06,132][26022] Updated weights on worker 0-0, policy_version 1199564 (0.00088) [2022-07-11 12:53:07,673][25689] Fps is (10 sec: 5471.7, 60 sec: 5523.3, 300 sec: 5542.1). Total num frames: 1228361728. Throughput: 0: 5681.1. Samples: 1228360102. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:53:07,674][25689] Avg episode reward: [(0, '1.051')] [2022-07-11 12:53:08,160][26022] Updated weights on worker 0-0, policy_version 1199574 (0.00095) [2022-07-11 12:53:09,768][26022] Updated weights on worker 0-0, policy_version 1199584 (0.00095) [2022-07-11 12:53:11,823][26022] Updated weights on worker 0-0, policy_version 1199594 (0.00084) [2022-07-11 12:53:12,742][25689] Fps is (10 sec: 5373.2, 60 sec: 5504.6, 300 sec: 5537.4). Total num frames: 1228388352. Throughput: 0: 5666.8. Samples: 1228393208. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:53:12,743][25689] Avg episode reward: [(0, '-0.358')] [2022-07-11 12:53:13,464][26022] Updated weights on worker 0-0, policy_version 1199604 (0.00082) [2022-07-11 12:53:15,579][26022] Updated weights on worker 0-0, policy_version 1199614 (0.00086) [2022-07-11 12:53:17,294][26022] Updated weights on worker 0-0, policy_version 1199624 (0.00085) [2022-07-11 12:53:17,755][25689] Fps is (10 sec: 5383.5, 60 sec: 5491.2, 300 sec: 5537.3). Total num frames: 1228416000. Throughput: 0: 5645.8. Samples: 1228426306. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:53:17,755][25689] Avg episode reward: [(0, '-1.740')] [2022-07-11 12:53:19,160][26022] Updated weights on worker 0-0, policy_version 1199634 (0.00097) [2022-07-11 12:53:21,139][26022] Updated weights on worker 0-0, policy_version 1199644 (0.00093) [2022-07-11 12:53:22,757][25689] Fps is (10 sec: 5623.7, 60 sec: 5513.4, 300 sec: 5534.4). Total num frames: 1228444672. Throughput: 0: 5684.6. Samples: 1228443128. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:53:22,758][25689] Avg episode reward: [(0, '-1.460')] [2022-07-11 12:53:22,808][26022] Updated weights on worker 0-0, policy_version 1199654 (0.00094) [2022-07-11 12:53:24,604][26022] Updated weights on worker 0-0, policy_version 1199664 (0.00087) [2022-07-11 12:53:26,586][26022] Updated weights on worker 0-0, policy_version 1199674 (0.00089) [2022-07-11 12:53:27,777][25689] Fps is (10 sec: 5619.2, 60 sec: 5498.3, 300 sec: 5538.2). Total num frames: 1228472320. Throughput: 0: 5782.5. Samples: 1228476354. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:53:27,779][25689] Avg episode reward: [(0, '-1.284')] [2022-07-11 12:53:28,227][26022] Updated weights on worker 0-0, policy_version 1199684 (0.00089) [2022-07-11 12:53:30,241][26022] Updated weights on worker 0-0, policy_version 1199694 (0.00054) [2022-07-11 12:53:32,047][26022] Updated weights on worker 0-0, policy_version 1199704 (0.00094) [2022-07-11 12:53:32,836][25689] Fps is (10 sec: 5384.6, 60 sec: 5467.4, 300 sec: 5530.7). Total num frames: 1228498944. Throughput: 0: 5806.1. Samples: 1228509876. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:53:32,837][25689] Avg episode reward: [(0, '-0.830')] [2022-07-11 12:53:33,852][26022] Updated weights on worker 0-0, policy_version 1199714 (0.00081) [2022-07-11 12:53:35,874][26022] Updated weights on worker 0-0, policy_version 1199724 (0.00080) [2022-07-11 12:53:37,451][26022] Updated weights on worker 0-0, policy_version 1199734 (0.00109) [2022-07-11 12:53:37,846][25689] Fps is (10 sec: 5593.5, 60 sec: 5542.0, 300 sec: 5534.0). Total num frames: 1228528640. Throughput: 0: 4988.6. Samples: 1228526536. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:53:37,847][25689] Avg episode reward: [(0, '-0.575')] [2022-07-11 12:53:39,575][26022] Updated weights on worker 0-0, policy_version 1199744 (0.00087) [2022-07-11 12:53:41,138][26022] Updated weights on worker 0-0, policy_version 1199754 (0.00087) [2022-07-11 12:53:42,860][25689] Fps is (10 sec: 5721.1, 60 sec: 5507.8, 300 sec: 5533.8). Total num frames: 1228556288. Throughput: 0: 5819.6. Samples: 1228560116. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:53:42,860][25689] Avg episode reward: [(0, '1.514')] [2022-07-11 12:53:43,085][26022] Updated weights on worker 0-0, policy_version 1199764 (0.00095) [2022-07-11 12:53:44,823][26022] Updated weights on worker 0-0, policy_version 1199774 (0.00093) [2022-07-11 12:53:46,755][26022] Updated weights on worker 0-0, policy_version 1199784 (0.00097) [2022-07-11 12:53:47,864][25689] Fps is (10 sec: 5724.4, 60 sec: 5525.5, 300 sec: 5541.6). Total num frames: 1228585984. Throughput: 0: 5845.5. Samples: 1228593770. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:53:47,865][25689] Avg episode reward: [(0, '1.884')] [2022-07-11 12:53:48,545][26022] Updated weights on worker 0-0, policy_version 1199794 (0.00088) [2022-07-11 12:53:50,472][26022] Updated weights on worker 0-0, policy_version 1199804 (0.00089) [2022-07-11 12:53:52,358][26022] Updated weights on worker 0-0, policy_version 1199814 (0.00092) [2022-07-11 12:53:52,984][25689] Fps is (10 sec: 5563.0, 60 sec: 5505.6, 300 sec: 5536.7). Total num frames: 1228612608. Throughput: 0: 5000.7. Samples: 1228610628. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:53:52,984][25689] Avg episode reward: [(0, '1.849')] [2022-07-11 12:53:54,116][26022] Updated weights on worker 0-0, policy_version 1199824 (0.00090) [2022-07-11 12:53:56,255][26022] Updated weights on worker 0-0, policy_version 1199834 (0.00088) [2022-07-11 12:53:57,570][26022] Updated weights on worker 0-0, policy_version 1199844 (0.00092) [2022-07-11 12:53:57,991][25689] Fps is (10 sec: 5561.0, 60 sec: 5558.6, 300 sec: 5536.9). Total num frames: 1228642304. Throughput: 0: 5836.7. Samples: 1228644118. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:53:57,993][25689] Avg episode reward: [(0, '1.847')] [2022-07-11 12:53:59,777][26022] Updated weights on worker 0-0, policy_version 1199854 (0.00095) [2022-07-11 12:54:01,364][26022] Updated weights on worker 0-0, policy_version 1199864 (0.00089) [2022-07-11 12:54:02,996][25689] Fps is (10 sec: 5318.1, 60 sec: 5497.1, 300 sec: 5530.2). Total num frames: 1228665856. Throughput: 0: 5724.4. Samples: 1228675386. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:54:02,997][25689] Avg episode reward: [(0, '1.749')] [2022-07-11 12:54:03,759][26022] Updated weights on worker 0-0, policy_version 1199874 (0.00089) [2022-07-11 12:54:05,415][26022] Updated weights on worker 0-0, policy_version 1199884 (0.00080) [2022-07-11 12:54:07,455][26022] Updated weights on worker 0-0, policy_version 1199894 (0.00088) [2022-07-11 12:54:08,001][25689] Fps is (10 sec: 5217.4, 60 sec: 5516.5, 300 sec: 5534.9). Total num frames: 1228694528. Throughput: 0: 4880.6. Samples: 1228692052. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:54:08,003][25689] Avg episode reward: [(0, '1.680')] [2022-07-11 12:54:08,676][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:54:08,686][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001199901_1228698624.pth [2022-07-11 12:54:08,686][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001197950_1226700800.pth [2022-07-11 12:54:09,244][26022] Updated weights on worker 0-0, policy_version 1199904 (0.00089) [2022-07-11 12:54:11,124][26022] Updated weights on worker 0-0, policy_version 1199914 (0.00095) [2022-07-11 12:54:12,771][26022] Updated weights on worker 0-0, policy_version 1199924 (0.00096) [2022-07-11 12:54:13,080][25689] Fps is (10 sec: 5686.7, 60 sec: 5549.6, 300 sec: 5530.3). Total num frames: 1228723200. Throughput: 0: 5700.2. Samples: 1228725182. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:54:13,081][25689] Avg episode reward: [(0, '1.265')] [2022-07-11 12:54:14,767][26022] Updated weights on worker 0-0, policy_version 1199934 (0.00088) [2022-07-11 12:54:16,494][26022] Updated weights on worker 0-0, policy_version 1199944 (0.00085) [2022-07-11 12:54:18,103][25689] Fps is (10 sec: 5676.7, 60 sec: 5565.6, 300 sec: 5533.6). Total num frames: 1228751872. Throughput: 0: 5711.4. Samples: 1228758982. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:54:18,103][25689] Avg episode reward: [(0, '1.420')] [2022-07-11 12:54:18,454][26022] Updated weights on worker 0-0, policy_version 1199954 (0.00088) [2022-07-11 12:54:20,303][26022] Updated weights on worker 0-0, policy_version 1199964 (0.00082) [2022-07-11 12:54:22,052][26022] Updated weights on worker 0-0, policy_version 1199974 (0.00088) [2022-07-11 12:54:23,133][25689] Fps is (10 sec: 5500.4, 60 sec: 5529.1, 300 sec: 5530.4). Total num frames: 1228778496. Throughput: 0: 4990.7. Samples: 1228775884. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:54:23,134][25689] Avg episode reward: [(0, '0.145')] [2022-07-11 12:54:23,721][26022] Updated weights on worker 0-0, policy_version 1199984 (0.00079) [2022-07-11 12:54:25,466][26022] Updated weights on worker 0-0, policy_version 1199994 (0.00394) [2022-07-11 12:54:27,535][26022] Updated weights on worker 0-0, policy_version 1200004 (0.00093) [2022-07-11 12:54:28,143][25689] Fps is (10 sec: 5507.5, 60 sec: 5547.0, 300 sec: 5532.3). Total num frames: 1228807168. Throughput: 0: 5851.0. Samples: 1228809900. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:54:28,143][25689] Avg episode reward: [(0, '0.208')] [2022-07-11 12:54:29,195][26022] Updated weights on worker 0-0, policy_version 1200014 (0.00090) [2022-07-11 12:54:31,249][26022] Updated weights on worker 0-0, policy_version 1200024 (0.00086) [2022-07-11 12:54:32,827][26022] Updated weights on worker 0-0, policy_version 1200034 (0.00086) [2022-07-11 12:54:33,179][25689] Fps is (10 sec: 5810.5, 60 sec: 5600.1, 300 sec: 5540.1). Total num frames: 1228836864. Throughput: 0: 5886.1. Samples: 1228843482. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:54:33,179][25689] Avg episode reward: [(0, '0.259')] [2022-07-11 12:54:34,878][26022] Updated weights on worker 0-0, policy_version 1200044 (0.00094) [2022-07-11 12:54:36,649][26022] Updated weights on worker 0-0, policy_version 1200054 (0.00096) [2022-07-11 12:54:38,252][25689] Fps is (10 sec: 5672.6, 60 sec: 5560.3, 300 sec: 5536.8). Total num frames: 1228864512. Throughput: 0: 5029.3. Samples: 1228860318. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:54:38,253][25689] Avg episode reward: [(0, '0.815')] [2022-07-11 12:54:38,293][26022] Updated weights on worker 0-0, policy_version 1200064 (0.00083) [2022-07-11 12:54:40,323][26022] Updated weights on worker 0-0, policy_version 1200074 (0.00085) [2022-07-11 12:54:41,992][26022] Updated weights on worker 0-0, policy_version 1200084 (0.00089) [2022-07-11 12:54:43,283][25689] Fps is (10 sec: 5371.3, 60 sec: 5541.7, 300 sec: 5533.3). Total num frames: 1228891136. Throughput: 0: 5864.2. Samples: 1228894044. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:54:43,283][25689] Avg episode reward: [(0, '0.734')] [2022-07-11 12:54:43,844][26022] Updated weights on worker 0-0, policy_version 1200094 (0.00085) [2022-07-11 12:54:45,653][26022] Updated weights on worker 0-0, policy_version 1200104 (0.00091) [2022-07-11 12:54:47,471][26022] Updated weights on worker 0-0, policy_version 1200114 (0.00089) [2022-07-11 12:54:48,307][25689] Fps is (10 sec: 5600.9, 60 sec: 5539.9, 300 sec: 5537.1). Total num frames: 1228920832. Throughput: 0: 5864.9. Samples: 1228928162. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:54:48,308][25689] Avg episode reward: [(0, '0.597')] [2022-07-11 12:54:49,344][26022] Updated weights on worker 0-0, policy_version 1200124 (0.00086) [2022-07-11 12:54:51,221][26022] Updated weights on worker 0-0, policy_version 1200134 (0.00091) [2022-07-11 12:54:52,903][26022] Updated weights on worker 0-0, policy_version 1200144 (0.00103) [2022-07-11 12:54:53,379][25689] Fps is (10 sec: 5781.3, 60 sec: 5578.2, 300 sec: 5539.4). Total num frames: 1228949504. Throughput: 0: 5026.5. Samples: 1228945018. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:54:53,379][25689] Avg episode reward: [(0, '2.047')] [2022-07-11 12:54:54,917][26022] Updated weights on worker 0-0, policy_version 1200154 (0.00102) [2022-07-11 12:54:56,627][26022] Updated weights on worker 0-0, policy_version 1200164 (0.00092) [2022-07-11 12:54:58,385][25689] Fps is (10 sec: 5588.5, 60 sec: 5544.4, 300 sec: 5539.7). Total num frames: 1228977152. Throughput: 0: 5877.9. Samples: 1228978660. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:54:58,387][25689] Avg episode reward: [(0, '1.864')] [2022-07-11 12:54:58,482][26022] Updated weights on worker 0-0, policy_version 1200174 (0.00092) [2022-07-11 12:55:00,274][26022] Updated weights on worker 0-0, policy_version 1200184 (0.00079) [2022-07-11 12:55:02,580][26022] Updated weights on worker 0-0, policy_version 1200194 (0.00085) [2022-07-11 12:55:03,395][25689] Fps is (10 sec: 5418.6, 60 sec: 5594.9, 300 sec: 5539.9). Total num frames: 1229003776. Throughput: 0: 5766.7. Samples: 1229010024. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:55:03,397][25689] Avg episode reward: [(0, '0.597')] [2022-07-11 12:55:04,531][26022] Updated weights on worker 0-0, policy_version 1200204 (0.00087) [2022-07-11 12:55:06,305][26022] Updated weights on worker 0-0, policy_version 1200214 (0.00083) [2022-07-11 12:55:07,975][26022] Updated weights on worker 0-0, policy_version 1200224 (0.00094) [2022-07-11 12:55:08,400][25689] Fps is (10 sec: 5419.2, 60 sec: 5577.9, 300 sec: 5538.3). Total num frames: 1229031424. Throughput: 0: 4906.7. Samples: 1229026750. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:55:08,401][25689] Avg episode reward: [(0, '-0.273')] [2022-07-11 12:55:10,079][26022] Updated weights on worker 0-0, policy_version 1200234 (0.00097) [2022-07-11 12:55:11,620][26022] Updated weights on worker 0-0, policy_version 1200244 (0.00083) [2022-07-11 12:55:13,543][25689] Fps is (10 sec: 5448.7, 60 sec: 5555.0, 300 sec: 5539.5). Total num frames: 1229059072. Throughput: 0: 5713.6. Samples: 1229060230. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:55:13,544][25689] Avg episode reward: [(0, '-0.163')] [2022-07-11 12:55:13,598][26022] Updated weights on worker 0-0, policy_version 1200254 (0.00093) [2022-07-11 12:55:15,407][26022] Updated weights on worker 0-0, policy_version 1200264 (0.00090) [2022-07-11 12:55:17,074][26022] Updated weights on worker 0-0, policy_version 1200274 (0.00083) [2022-07-11 12:55:18,566][25689] Fps is (10 sec: 5439.3, 60 sec: 5538.0, 300 sec: 5533.7). Total num frames: 1229086720. Throughput: 0: 5715.6. Samples: 1229094006. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:55:18,566][25689] Avg episode reward: [(0, '-0.109')] [2022-07-11 12:55:19,138][26022] Updated weights on worker 0-0, policy_version 1200284 (0.00099) [2022-07-11 12:55:20,819][26022] Updated weights on worker 0-0, policy_version 1200294 (0.00074) [2022-07-11 12:55:22,714][26022] Updated weights on worker 0-0, policy_version 1200304 (0.00087) [2022-07-11 12:55:23,603][25689] Fps is (10 sec: 5802.0, 60 sec: 5605.2, 300 sec: 5551.1). Total num frames: 1229117440. Throughput: 0: 4981.1. Samples: 1229110684. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:55:23,604][25689] Avg episode reward: [(0, '-0.223')] [2022-07-11 12:55:24,785][26022] Updated weights on worker 0-0, policy_version 1200314 (0.00089) [2022-07-11 12:55:26,099][26022] Updated weights on worker 0-0, policy_version 1200324 (0.00083) [2022-07-11 12:55:28,260][26022] Updated weights on worker 0-0, policy_version 1200334 (0.00090) [2022-07-11 12:55:28,610][25689] Fps is (10 sec: 5607.5, 60 sec: 5554.7, 300 sec: 5537.8). Total num frames: 1229143040. Throughput: 0: 5832.6. Samples: 1229144626. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:55:28,610][25689] Avg episode reward: [(0, '0.110')] [2022-07-11 12:55:29,954][26022] Updated weights on worker 0-0, policy_version 1200344 (0.00085) [2022-07-11 12:55:31,920][26022] Updated weights on worker 0-0, policy_version 1200354 (0.00089) [2022-07-11 12:55:33,632][26022] Updated weights on worker 0-0, policy_version 1200364 (0.00087) [2022-07-11 12:55:33,727][25689] Fps is (10 sec: 5462.1, 60 sec: 5547.2, 300 sec: 5546.3). Total num frames: 1229172736. Throughput: 0: 5829.5. Samples: 1229177890. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:55:33,727][25689] Avg episode reward: [(0, '1.295')] [2022-07-11 12:55:35,503][26022] Updated weights on worker 0-0, policy_version 1200374 (0.00086) [2022-07-11 12:55:37,353][26022] Updated weights on worker 0-0, policy_version 1200384 (0.00081) [2022-07-11 12:55:38,747][25689] Fps is (10 sec: 5656.8, 60 sec: 5552.1, 300 sec: 5547.5). Total num frames: 1229200384. Throughput: 0: 4990.6. Samples: 1229194720. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:55:38,747][25689] Avg episode reward: [(0, '2.533')] [2022-07-11 12:55:39,134][26022] Updated weights on worker 0-0, policy_version 1200394 (0.00090) [2022-07-11 12:55:41,254][26022] Updated weights on worker 0-0, policy_version 1200404 (0.00088) [2022-07-11 12:55:42,768][26022] Updated weights on worker 0-0, policy_version 1200414 (0.00088) [2022-07-11 12:55:43,766][25689] Fps is (10 sec: 5508.0, 60 sec: 5570.1, 300 sec: 5541.7). Total num frames: 1229228032. Throughput: 0: 5829.0. Samples: 1229228212. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:55:43,766][25689] Avg episode reward: [(0, '2.427')] [2022-07-11 12:55:44,793][26022] Updated weights on worker 0-0, policy_version 1200424 (0.00091) [2022-07-11 12:55:46,454][26022] Updated weights on worker 0-0, policy_version 1200434 (0.00081) [2022-07-11 12:55:48,376][26022] Updated weights on worker 0-0, policy_version 1200444 (0.00093) [2022-07-11 12:55:48,800][25689] Fps is (10 sec: 5704.0, 60 sec: 5569.2, 300 sec: 5545.3). Total num frames: 1229257728. Throughput: 0: 5796.5. Samples: 1229261660. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:55:48,800][25689] Avg episode reward: [(0, '1.980')] [2022-07-11 12:55:50,342][26022] Updated weights on worker 0-0, policy_version 1200454 (0.00088) [2022-07-11 12:55:51,763][26022] Updated weights on worker 0-0, policy_version 1200464 (0.00097) [2022-07-11 12:55:53,842][25689] Fps is (10 sec: 5589.3, 60 sec: 5538.0, 300 sec: 5544.6). Total num frames: 1229284352. Throughput: 0: 4995.0. Samples: 1229278366. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:55:53,843][25689] Avg episode reward: [(0, '1.975')] [2022-07-11 12:55:53,961][26022] Updated weights on worker 0-0, policy_version 1200474 (0.00091) [2022-07-11 12:55:55,797][26022] Updated weights on worker 0-0, policy_version 1200484 (0.00084) [2022-07-11 12:55:57,539][26022] Updated weights on worker 0-0, policy_version 1200494 (0.00084) [2022-07-11 12:55:58,854][25689] Fps is (10 sec: 5499.9, 60 sec: 5554.5, 300 sec: 5549.4). Total num frames: 1229313024. Throughput: 0: 5841.1. Samples: 1229312168. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:55:58,854][25689] Avg episode reward: [(0, '1.994')] [2022-07-11 12:55:59,472][26022] Updated weights on worker 0-0, policy_version 1200504 (0.00085) [2022-07-11 12:56:01,111][26022] Updated weights on worker 0-0, policy_version 1200514 (0.00092) [2022-07-11 12:56:03,605][26022] Updated weights on worker 0-0, policy_version 1200524 (0.00088) [2022-07-11 12:56:03,869][25689] Fps is (10 sec: 5310.7, 60 sec: 5520.1, 300 sec: 5539.6). Total num frames: 1229337600. Throughput: 0: 5729.9. Samples: 1229343400. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:56:03,871][25689] Avg episode reward: [(0, '1.820')] [2022-07-11 12:56:05,235][26022] Updated weights on worker 0-0, policy_version 1200534 (0.00093) [2022-07-11 12:56:07,117][26022] Updated weights on worker 0-0, policy_version 1200544 (0.00084) [2022-07-11 12:56:08,717][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:56:08,726][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001200553_1229366272.pth [2022-07-11 12:56:08,731][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001198603_1227369472.pth [2022-07-11 12:56:08,895][25689] Fps is (10 sec: 5303.2, 60 sec: 5535.2, 300 sec: 5543.4). Total num frames: 1229366272. Throughput: 0: 4894.7. Samples: 1229360018. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:56:08,897][25689] Avg episode reward: [(0, '1.676')] [2022-07-11 12:56:08,994][26022] Updated weights on worker 0-0, policy_version 1200554 (0.00093) [2022-07-11 12:56:10,867][26022] Updated weights on worker 0-0, policy_version 1200564 (0.00096) [2022-07-11 12:56:12,763][26022] Updated weights on worker 0-0, policy_version 1200574 (0.00085) [2022-07-11 12:56:13,977][25689] Fps is (10 sec: 5774.2, 60 sec: 5574.6, 300 sec: 5545.8). Total num frames: 1229395968. Throughput: 0: 5719.1. Samples: 1229393520. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:56:13,978][25689] Avg episode reward: [(0, '1.575')] [2022-07-11 12:56:14,483][26022] Updated weights on worker 0-0, policy_version 1200584 (0.00091) [2022-07-11 12:56:16,450][26022] Updated weights on worker 0-0, policy_version 1200594 (0.00085) [2022-07-11 12:56:18,072][26022] Updated weights on worker 0-0, policy_version 1200604 (0.00615) [2022-07-11 12:56:18,980][25689] Fps is (10 sec: 5584.6, 60 sec: 5559.5, 300 sec: 5539.2). Total num frames: 1229422592. Throughput: 0: 5714.7. Samples: 1229427180. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:56:18,981][25689] Avg episode reward: [(0, '1.734')] [2022-07-11 12:56:20,042][26022] Updated weights on worker 0-0, policy_version 1200614 (0.00085) [2022-07-11 12:56:21,863][26022] Updated weights on worker 0-0, policy_version 1200624 (0.00089) [2022-07-11 12:56:23,722][26022] Updated weights on worker 0-0, policy_version 1200634 (0.00087) [2022-07-11 12:56:24,000][25689] Fps is (10 sec: 5516.9, 60 sec: 5527.2, 300 sec: 5542.6). Total num frames: 1229451264. Throughput: 0: 4988.5. Samples: 1229443826. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:56:24,001][25689] Avg episode reward: [(0, '0.595')] [2022-07-11 12:56:25,619][26022] Updated weights on worker 0-0, policy_version 1200644 (0.00086) [2022-07-11 12:56:27,277][26022] Updated weights on worker 0-0, policy_version 1200654 (0.00085) [2022-07-11 12:56:29,002][25689] Fps is (10 sec: 5619.3, 60 sec: 5561.5, 300 sec: 5544.7). Total num frames: 1229478912. Throughput: 0: 5835.3. Samples: 1229477352. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:56:29,004][25689] Avg episode reward: [(0, '0.410')] [2022-07-11 12:56:29,118][26022] Updated weights on worker 0-0, policy_version 1200664 (0.00090) [2022-07-11 12:56:31,160][26022] Updated weights on worker 0-0, policy_version 1200674 (0.00087) [2022-07-11 12:56:32,978][26022] Updated weights on worker 0-0, policy_version 1200684 (0.00096) [2022-07-11 12:56:34,067][25689] Fps is (10 sec: 5391.4, 60 sec: 5515.4, 300 sec: 5537.1). Total num frames: 1229505536. Throughput: 0: 5833.6. Samples: 1229510714. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:56:34,067][25689] Avg episode reward: [(0, '0.869')] [2022-07-11 12:56:34,731][26022] Updated weights on worker 0-0, policy_version 1200694 (0.00085) [2022-07-11 12:56:36,285][26022] Updated weights on worker 0-0, policy_version 1200704 (0.00094) [2022-07-11 12:56:38,468][26022] Updated weights on worker 0-0, policy_version 1200714 (0.00096) [2022-07-11 12:56:39,074][25689] Fps is (10 sec: 5490.3, 60 sec: 5533.6, 300 sec: 5541.4). Total num frames: 1229534208. Throughput: 0: 4994.8. Samples: 1229527546. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2022-07-11 12:56:39,074][25689] Avg episode reward: [(0, '0.702')] [2022-07-11 12:56:40,105][26022] Updated weights on worker 0-0, policy_version 1200724 (0.00756) [2022-07-11 12:56:42,066][26022] Updated weights on worker 0-0, policy_version 1200734 (0.00092) [2022-07-11 12:56:43,977][26022] Updated weights on worker 0-0, policy_version 1200744 (0.00086) [2022-07-11 12:56:44,094][25689] Fps is (10 sec: 5616.9, 60 sec: 5533.5, 300 sec: 5537.9). Total num frames: 1229561856. Throughput: 0: 5824.0. Samples: 1229560850. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:56:44,094][25689] Avg episode reward: [(0, '0.678')] [2022-07-11 12:56:45,860][26022] Updated weights on worker 0-0, policy_version 1200754 (0.00086) [2022-07-11 12:56:47,494][26022] Updated weights on worker 0-0, policy_version 1200764 (0.00092) [2022-07-11 12:56:49,117][25689] Fps is (10 sec: 5607.6, 60 sec: 5517.5, 300 sec: 5540.0). Total num frames: 1229590528. Throughput: 0: 5809.1. Samples: 1229594202. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:56:49,118][25689] Avg episode reward: [(0, '0.783')] [2022-07-11 12:56:49,684][26022] Updated weights on worker 0-0, policy_version 1200774 (0.00086) [2022-07-11 12:56:51,257][26022] Updated weights on worker 0-0, policy_version 1200784 (0.00090) [2022-07-11 12:56:53,068][26022] Updated weights on worker 0-0, policy_version 1200794 (0.00092) [2022-07-11 12:56:54,227][25689] Fps is (10 sec: 5658.7, 60 sec: 5545.2, 300 sec: 5538.8). Total num frames: 1229619200. Throughput: 0: 5807.4. Samples: 1229627796. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:56:54,228][25689] Avg episode reward: [(0, '0.514')] [2022-07-11 12:56:54,979][26022] Updated weights on worker 0-0, policy_version 1200804 (0.00097) [2022-07-11 12:56:56,767][26022] Updated weights on worker 0-0, policy_version 1200814 (0.00089) [2022-07-11 12:56:58,669][26022] Updated weights on worker 0-0, policy_version 1200824 (0.00090) [2022-07-11 12:56:59,264][25689] Fps is (10 sec: 5651.5, 60 sec: 5542.9, 300 sec: 5548.7). Total num frames: 1229647872. Throughput: 0: 5796.7. Samples: 1229644582. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:56:59,265][25689] Avg episode reward: [(0, '0.699')] [2022-07-11 12:57:00,423][26022] Updated weights on worker 0-0, policy_version 1200834 (0.00099) [2022-07-11 12:57:02,551][26022] Updated weights on worker 0-0, policy_version 1200844 (0.00385) [2022-07-11 12:57:04,285][25689] Fps is (10 sec: 5192.6, 60 sec: 5525.4, 300 sec: 5532.8). Total num frames: 1229671424. Throughput: 0: 5693.2. Samples: 1229675802. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:57:04,285][25689] Avg episode reward: [(0, '0.212')] [2022-07-11 12:57:04,684][26022] Updated weights on worker 0-0, policy_version 1200854 (0.00089) [2022-07-11 12:57:06,349][26022] Updated weights on worker 0-0, policy_version 1200864 (0.00083) [2022-07-11 12:57:08,314][26022] Updated weights on worker 0-0, policy_version 1200874 (0.00090) [2022-07-11 12:57:09,296][25689] Fps is (10 sec: 5205.5, 60 sec: 5526.7, 300 sec: 5536.9). Total num frames: 1229700096. Throughput: 0: 5696.4. Samples: 1229709150. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:57:09,297][25689] Avg episode reward: [(0, '-0.511')] [2022-07-11 12:57:09,903][26022] Updated weights on worker 0-0, policy_version 1200884 (0.00101) [2022-07-11 12:57:11,955][26022] Updated weights on worker 0-0, policy_version 1200894 (0.00086) [2022-07-11 12:57:13,926][26022] Updated weights on worker 0-0, policy_version 1200904 (0.00094) [2022-07-11 12:57:14,408][25689] Fps is (10 sec: 5664.3, 60 sec: 5507.1, 300 sec: 5535.8). Total num frames: 1229728768. Throughput: 0: 4859.2. Samples: 1229725860. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:57:14,409][25689] Avg episode reward: [(0, '-0.019')] [2022-07-11 12:57:15,591][26022] Updated weights on worker 0-0, policy_version 1200914 (0.00086) [2022-07-11 12:57:17,315][26022] Updated weights on worker 0-0, policy_version 1200924 (0.00093) [2022-07-11 12:57:19,235][26022] Updated weights on worker 0-0, policy_version 1200934 (0.00087) [2022-07-11 12:57:19,423][25689] Fps is (10 sec: 5561.5, 60 sec: 5522.9, 300 sec: 5536.6). Total num frames: 1229756416. Throughput: 0: 5706.8. Samples: 1229759626. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:57:19,424][25689] Avg episode reward: [(0, '0.118')] [2022-07-11 12:57:20,813][26022] Updated weights on worker 0-0, policy_version 1200944 (0.00086) [2022-07-11 12:57:22,833][26022] Updated weights on worker 0-0, policy_version 1200954 (0.00087) [2022-07-11 12:57:24,470][25689] Fps is (10 sec: 5699.4, 60 sec: 5537.5, 300 sec: 5540.0). Total num frames: 1229786112. Throughput: 0: 5834.7. Samples: 1229793576. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:57:24,470][25689] Avg episode reward: [(0, '1.270')] [2022-07-11 12:57:24,598][26022] Updated weights on worker 0-0, policy_version 1200964 (0.00087) [2022-07-11 12:57:26,589][26022] Updated weights on worker 0-0, policy_version 1200974 (0.00086) [2022-07-11 12:57:28,259][26022] Updated weights on worker 0-0, policy_version 1200984 (0.00092) [2022-07-11 12:57:29,472][25689] Fps is (10 sec: 5604.5, 60 sec: 5520.5, 300 sec: 5534.7). Total num frames: 1229812736. Throughput: 0: 5010.0. Samples: 1229810230. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:57:29,474][25689] Avg episode reward: [(0, '0.953')] [2022-07-11 12:57:30,159][26022] Updated weights on worker 0-0, policy_version 1200994 (0.00091) [2022-07-11 12:57:31,901][26022] Updated weights on worker 0-0, policy_version 1201004 (0.00092) [2022-07-11 12:57:34,090][26022] Updated weights on worker 0-0, policy_version 1201014 (0.00094) [2022-07-11 12:57:34,521][25689] Fps is (10 sec: 5501.5, 60 sec: 5555.8, 300 sec: 5545.8). Total num frames: 1229841408. Throughput: 0: 5859.1. Samples: 1229843702. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:57:34,521][25689] Avg episode reward: [(0, '1.274')] [2022-07-11 12:57:35,696][26022] Updated weights on worker 0-0, policy_version 1201024 (0.00086) [2022-07-11 12:57:37,647][26022] Updated weights on worker 0-0, policy_version 1201034 (0.00086) [2022-07-11 12:57:39,405][26022] Updated weights on worker 0-0, policy_version 1201044 (0.00073) [2022-07-11 12:57:39,537][25689] Fps is (10 sec: 5697.3, 60 sec: 5555.0, 300 sec: 5542.2). Total num frames: 1229870080. Throughput: 0: 5849.4. Samples: 1229877282. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:57:39,538][25689] Avg episode reward: [(0, '2.055')] [2022-07-11 12:57:41,254][26022] Updated weights on worker 0-0, policy_version 1201054 (0.00095) [2022-07-11 12:57:43,120][26022] Updated weights on worker 0-0, policy_version 1201064 (0.00085) [2022-07-11 12:57:44,567][25689] Fps is (10 sec: 5707.9, 60 sec: 5570.9, 300 sec: 5541.9). Total num frames: 1229898752. Throughput: 0: 5011.6. Samples: 1229894296. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:57:44,568][25689] Avg episode reward: [(0, '2.308')] [2022-07-11 12:57:44,924][26022] Updated weights on worker 0-0, policy_version 1201074 (0.00090) [2022-07-11 12:57:46,570][26022] Updated weights on worker 0-0, policy_version 1201084 (0.00086) [2022-07-11 12:57:48,654][26022] Updated weights on worker 0-0, policy_version 1201094 (0.00107) [2022-07-11 12:57:49,598][25689] Fps is (10 sec: 5598.2, 60 sec: 5553.4, 300 sec: 5542.9). Total num frames: 1229926400. Throughput: 0: 5874.6. Samples: 1229928458. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:57:49,598][25689] Avg episode reward: [(0, '1.609')] [2022-07-11 12:57:50,122][26022] Updated weights on worker 0-0, policy_version 1201104 (0.00086) [2022-07-11 12:57:52,400][26022] Updated weights on worker 0-0, policy_version 1201114 (0.00085) [2022-07-11 12:57:53,964][26022] Updated weights on worker 0-0, policy_version 1201124 (0.00090) [2022-07-11 12:57:54,648][25689] Fps is (10 sec: 5485.2, 60 sec: 5541.9, 300 sec: 5546.0). Total num frames: 1229954048. Throughput: 0: 5872.7. Samples: 1229961904. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:57:54,649][25689] Avg episode reward: [(0, '1.668')] [2022-07-11 12:57:55,800][26022] Updated weights on worker 0-0, policy_version 1201134 (0.00086) [2022-07-11 12:57:57,651][26022] Updated weights on worker 0-0, policy_version 1201144 (0.00096) [2022-07-11 12:57:59,474][26022] Updated weights on worker 0-0, policy_version 1201154 (0.00088) [2022-07-11 12:57:59,705][25689] Fps is (10 sec: 5572.1, 60 sec: 5540.1, 300 sec: 5549.8). Total num frames: 1229982720. Throughput: 0: 5025.8. Samples: 1229978642. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:57:59,706][25689] Avg episode reward: [(0, '1.779')] [2022-07-11 12:58:01,337][26022] Updated weights on worker 0-0, policy_version 1201164 (0.00095) [2022-07-11 12:58:03,634][26022] Updated weights on worker 0-0, policy_version 1201174 (0.00087) [2022-07-11 12:58:04,731][25689] Fps is (10 sec: 5382.9, 60 sec: 5573.5, 300 sec: 5543.0). Total num frames: 1230008320. Throughput: 0: 5743.4. Samples: 1230010102. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:58:04,731][25689] Avg episode reward: [(0, '1.868')] [2022-07-11 12:58:05,285][26022] Updated weights on worker 0-0, policy_version 1201184 (0.00086) [2022-07-11 12:58:07,228][26022] Updated weights on worker 0-0, policy_version 1201194 (0.00106) [2022-07-11 12:58:08,833][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 12:58:08,863][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001201204_1230032896.pth [2022-07-11 12:58:08,864][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001199254_1228036096.pth [2022-07-11 12:58:08,870][26022] Updated weights on worker 0-0, policy_version 1201204 (0.00087) [2022-07-11 12:58:09,739][25689] Fps is (10 sec: 5102.7, 60 sec: 5523.0, 300 sec: 5540.7). Total num frames: 1230033920. Throughput: 0: 5712.1. Samples: 1230043508. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:58:09,740][25689] Avg episode reward: [(0, '1.702')] [2022-07-11 12:58:10,963][26022] Updated weights on worker 0-0, policy_version 1201214 (0.00090) [2022-07-11 12:58:13,027][26022] Updated weights on worker 0-0, policy_version 1201224 (0.00086) [2022-07-11 12:58:14,507][26022] Updated weights on worker 0-0, policy_version 1201234 (0.00086) [2022-07-11 12:58:14,859][25689] Fps is (10 sec: 5763.1, 60 sec: 5590.1, 300 sec: 5555.9). Total num frames: 1230066688. Throughput: 0: 4861.6. Samples: 1230060158. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:58:14,859][25689] Avg episode reward: [(0, '1.457')] [2022-07-11 12:58:16,402][26022] Updated weights on worker 0-0, policy_version 1201244 (0.00080) [2022-07-11 12:58:18,265][26022] Updated weights on worker 0-0, policy_version 1201254 (0.00082) [2022-07-11 12:58:19,889][25689] Fps is (10 sec: 5750.7, 60 sec: 5554.8, 300 sec: 5545.1). Total num frames: 1230092288. Throughput: 0: 5695.9. Samples: 1230093606. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:58:19,889][25689] Avg episode reward: [(0, '1.977')] [2022-07-11 12:58:19,985][26022] Updated weights on worker 0-0, policy_version 1201264 (0.00094) [2022-07-11 12:58:22,019][26022] Updated weights on worker 0-0, policy_version 1201274 (0.00083) [2022-07-11 12:58:23,751][26022] Updated weights on worker 0-0, policy_version 1201284 (0.00093) [2022-07-11 12:58:24,929][25689] Fps is (10 sec: 5287.4, 60 sec: 5521.5, 300 sec: 5544.7). Total num frames: 1230119936. Throughput: 0: 5789.3. Samples: 1230127038. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:58:24,930][25689] Avg episode reward: [(0, '1.845')] [2022-07-11 12:58:25,755][26022] Updated weights on worker 0-0, policy_version 1201294 (0.00095) [2022-07-11 12:58:27,464][26022] Updated weights on worker 0-0, policy_version 1201304 (0.00086) [2022-07-11 12:58:29,279][26022] Updated weights on worker 0-0, policy_version 1201314 (0.00096) [2022-07-11 12:58:29,984][25689] Fps is (10 sec: 5579.1, 60 sec: 5550.6, 300 sec: 5551.7). Total num frames: 1230148608. Throughput: 0: 4937.0. Samples: 1230143456. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:58:29,984][25689] Avg episode reward: [(0, '1.734')] [2022-07-11 12:58:31,238][26022] Updated weights on worker 0-0, policy_version 1201324 (0.00086) [2022-07-11 12:58:33,015][26022] Updated weights on worker 0-0, policy_version 1201334 (0.00127) [2022-07-11 12:58:35,013][26022] Updated weights on worker 0-0, policy_version 1201344 (0.00082) [2022-07-11 12:58:35,046][25689] Fps is (10 sec: 5566.9, 60 sec: 5532.4, 300 sec: 5543.8). Total num frames: 1230176256. Throughput: 0: 5793.0. Samples: 1230177104. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:58:35,047][25689] Avg episode reward: [(0, '1.722')] [2022-07-11 12:58:36,616][26022] Updated weights on worker 0-0, policy_version 1201354 (0.00095) [2022-07-11 12:58:38,392][26022] Updated weights on worker 0-0, policy_version 1201364 (0.00089) [2022-07-11 12:58:40,098][25689] Fps is (10 sec: 5568.5, 60 sec: 5529.2, 300 sec: 5546.5). Total num frames: 1230204928. Throughput: 0: 5797.3. Samples: 1230210762. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:58:40,098][25689] Avg episode reward: [(0, '1.746')] [2022-07-11 12:58:40,416][26022] Updated weights on worker 0-0, policy_version 1201374 (0.00124) [2022-07-11 12:58:41,903][26022] Updated weights on worker 0-0, policy_version 1201384 (0.00084) [2022-07-11 12:58:44,008][26022] Updated weights on worker 0-0, policy_version 1201394 (0.00081) [2022-07-11 12:58:45,115][25689] Fps is (10 sec: 5797.1, 60 sec: 5547.3, 300 sec: 5546.3). Total num frames: 1230234624. Throughput: 0: 4976.0. Samples: 1230227480. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:58:45,115][25689] Avg episode reward: [(0, '1.473')] [2022-07-11 12:58:45,851][26022] Updated weights on worker 0-0, policy_version 1201404 (0.00088) [2022-07-11 12:58:47,484][26022] Updated weights on worker 0-0, policy_version 1201414 (0.00086) [2022-07-11 12:58:49,531][26022] Updated weights on worker 0-0, policy_version 1201424 (0.00057) [2022-07-11 12:58:50,142][25689] Fps is (10 sec: 5708.9, 60 sec: 5547.5, 300 sec: 5551.5). Total num frames: 1230262272. Throughput: 0: 5844.9. Samples: 1230261282. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:58:50,155][25689] Avg episode reward: [(0, '1.216')] [2022-07-11 12:58:51,085][26022] Updated weights on worker 0-0, policy_version 1201434 (0.00089) [2022-07-11 12:58:53,269][26022] Updated weights on worker 0-0, policy_version 1201444 (0.00082) [2022-07-11 12:58:54,984][26022] Updated weights on worker 0-0, policy_version 1201454 (0.00052) [2022-07-11 12:58:55,254][25689] Fps is (10 sec: 5453.8, 60 sec: 5542.0, 300 sec: 5542.6). Total num frames: 1230289920. Throughput: 0: 5819.4. Samples: 1230294700. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:58:55,254][25689] Avg episode reward: [(0, '1.134')] [2022-07-11 12:58:56,702][26022] Updated weights on worker 0-0, policy_version 1201464 (0.00083) [2022-07-11 12:58:58,575][26022] Updated weights on worker 0-0, policy_version 1201474 (0.00082) [2022-07-11 12:59:00,261][25689] Fps is (10 sec: 5464.7, 60 sec: 5529.6, 300 sec: 5556.4). Total num frames: 1230317568. Throughput: 0: 4998.3. Samples: 1230311546. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:59:00,262][25689] Avg episode reward: [(0, '0.988')] [2022-07-11 12:59:00,511][26022] Updated weights on worker 0-0, policy_version 1201484 (0.00086) [2022-07-11 12:59:02,581][26022] Updated weights on worker 0-0, policy_version 1201494 (0.00086) [2022-07-11 12:59:04,623][26022] Updated weights on worker 0-0, policy_version 1201504 (0.00087) [2022-07-11 12:59:05,274][25689] Fps is (10 sec: 5416.2, 60 sec: 5547.6, 300 sec: 5549.3). Total num frames: 1230344192. Throughput: 0: 5743.1. Samples: 1230343258. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:59:05,276][25689] Avg episode reward: [(0, '0.405')] [2022-07-11 12:59:06,099][26022] Updated weights on worker 0-0, policy_version 1201514 (0.00096) [2022-07-11 12:59:08,257][26022] Updated weights on worker 0-0, policy_version 1201524 (0.00095) [2022-07-11 12:59:09,896][26022] Updated weights on worker 0-0, policy_version 1201534 (0.00085) [2022-07-11 12:59:10,303][25689] Fps is (10 sec: 5404.7, 60 sec: 5579.6, 300 sec: 5546.8). Total num frames: 1230371840. Throughput: 0: 5726.5. Samples: 1230376730. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:59:10,303][25689] Avg episode reward: [(0, '0.221')] [2022-07-11 12:59:11,995][26022] Updated weights on worker 0-0, policy_version 1201544 (0.00080) [2022-07-11 12:59:13,740][26022] Updated weights on worker 0-0, policy_version 1201554 (0.00091) [2022-07-11 12:59:15,378][25689] Fps is (10 sec: 5472.9, 60 sec: 5499.1, 300 sec: 5542.4). Total num frames: 1230399488. Throughput: 0: 5726.8. Samples: 1230409946. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:59:15,378][25689] Avg episode reward: [(0, '0.428')] [2022-07-11 12:59:15,677][26022] Updated weights on worker 0-0, policy_version 1201564 (0.00087) [2022-07-11 12:59:17,290][26022] Updated weights on worker 0-0, policy_version 1201574 (0.00094) [2022-07-11 12:59:19,335][26022] Updated weights on worker 0-0, policy_version 1201584 (0.00083) [2022-07-11 12:59:20,385][25689] Fps is (10 sec: 5586.2, 60 sec: 5552.0, 300 sec: 5549.7). Total num frames: 1230428160. Throughput: 0: 5729.8. Samples: 1230426850. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:59:20,385][25689] Avg episode reward: [(0, '-0.262')] [2022-07-11 12:59:20,951][26022] Updated weights on worker 0-0, policy_version 1201594 (0.00089) [2022-07-11 12:59:23,139][26022] Updated weights on worker 0-0, policy_version 1201604 (0.00091) [2022-07-11 12:59:24,574][26022] Updated weights on worker 0-0, policy_version 1201614 (0.00093) [2022-07-11 12:59:25,443][25689] Fps is (10 sec: 5595.5, 60 sec: 5550.4, 300 sec: 5545.4). Total num frames: 1230455808. Throughput: 0: 5800.5. Samples: 1230460246. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:59:25,445][25689] Avg episode reward: [(0, '-0.950')] [2022-07-11 12:59:26,789][26022] Updated weights on worker 0-0, policy_version 1201624 (0.00081) [2022-07-11 12:59:28,382][26022] Updated weights on worker 0-0, policy_version 1201634 (0.00088) [2022-07-11 12:59:30,495][25689] Fps is (10 sec: 5469.4, 60 sec: 5533.7, 300 sec: 5538.2). Total num frames: 1230483456. Throughput: 0: 5790.7. Samples: 1230493656. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:59:30,496][26022] Updated weights on worker 0-0, policy_version 1201644 (0.00078) [2022-07-11 12:59:30,495][25689] Avg episode reward: [(0, '-2.736')] [2022-07-11 12:59:32,055][26022] Updated weights on worker 0-0, policy_version 1201654 (0.00086) [2022-07-11 12:59:34,019][26022] Updated weights on worker 0-0, policy_version 1201664 (0.00087) [2022-07-11 12:59:35,568][25689] Fps is (10 sec: 5663.4, 60 sec: 5566.5, 300 sec: 5545.1). Total num frames: 1230513152. Throughput: 0: 4973.3. Samples: 1230510360. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:59:35,569][25689] Avg episode reward: [(0, '-2.563')] [2022-07-11 12:59:35,611][26022] Updated weights on worker 0-0, policy_version 1201674 (0.00093) [2022-07-11 12:59:37,810][26022] Updated weights on worker 0-0, policy_version 1201684 (0.00094) [2022-07-11 12:59:39,373][26022] Updated weights on worker 0-0, policy_version 1201694 (0.00106) [2022-07-11 12:59:40,586][25689] Fps is (10 sec: 5580.9, 60 sec: 5535.7, 300 sec: 5545.3). Total num frames: 1230539776. Throughput: 0: 5784.0. Samples: 1230543696. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:59:40,587][25689] Avg episode reward: [(0, '-2.705')] [2022-07-11 12:59:41,448][26022] Updated weights on worker 0-0, policy_version 1201704 (0.00091) [2022-07-11 12:59:43,135][26022] Updated weights on worker 0-0, policy_version 1201714 (0.00083) [2022-07-11 12:59:45,039][26022] Updated weights on worker 0-0, policy_version 1201724 (0.00088) [2022-07-11 12:59:45,592][25689] Fps is (10 sec: 5516.4, 60 sec: 5519.8, 300 sec: 5542.2). Total num frames: 1230568448. Throughput: 0: 5798.7. Samples: 1230577086. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:59:45,593][25689] Avg episode reward: [(0, '-2.659')] [2022-07-11 12:59:46,894][26022] Updated weights on worker 0-0, policy_version 1201734 (0.00092) [2022-07-11 12:59:48,858][26022] Updated weights on worker 0-0, policy_version 1201744 (0.00053) [2022-07-11 12:59:50,510][26022] Updated weights on worker 0-0, policy_version 1201754 (0.00091) [2022-07-11 12:59:50,656][25689] Fps is (10 sec: 5694.8, 60 sec: 5533.4, 300 sec: 5542.4). Total num frames: 1230597120. Throughput: 0: 4958.2. Samples: 1230593618. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:59:50,656][25689] Avg episode reward: [(0, '-1.026')] [2022-07-11 12:59:52,572][26022] Updated weights on worker 0-0, policy_version 1201764 (0.00089) [2022-07-11 12:59:54,292][26022] Updated weights on worker 0-0, policy_version 1201774 (0.00087) [2022-07-11 12:59:55,730][25689] Fps is (10 sec: 5454.4, 60 sec: 5519.9, 300 sec: 5537.7). Total num frames: 1230623744. Throughput: 0: 5774.4. Samples: 1230626782. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 12:59:55,730][25689] Avg episode reward: [(0, '0.036')] [2022-07-11 12:59:56,192][26022] Updated weights on worker 0-0, policy_version 1201784 (0.00090) [2022-07-11 12:59:57,915][26022] Updated weights on worker 0-0, policy_version 1201794 (0.00084) [2022-07-11 12:59:59,840][26022] Updated weights on worker 0-0, policy_version 1201804 (0.00099) [2022-07-11 13:00:00,759][25689] Fps is (10 sec: 5473.4, 60 sec: 5534.9, 300 sec: 5544.2). Total num frames: 1230652416. Throughput: 0: 5786.1. Samples: 1230660414. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 13:00:00,759][25689] Avg episode reward: [(0, '1.120')] [2022-07-11 13:00:01,577][26022] Updated weights on worker 0-0, policy_version 1201814 (0.00086) [2022-07-11 13:00:03,915][26022] Updated weights on worker 0-0, policy_version 1201824 (0.00090) [2022-07-11 13:00:05,560][26022] Updated weights on worker 0-0, policy_version 1201834 (0.00088) [2022-07-11 13:00:05,791][25689] Fps is (10 sec: 5496.2, 60 sec: 5533.2, 300 sec: 5540.2). Total num frames: 1230679040. Throughput: 0: 4853.8. Samples: 1230675126. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 13:00:05,791][25689] Avg episode reward: [(0, '1.514')] [2022-07-11 13:00:07,549][26022] Updated weights on worker 0-0, policy_version 1201844 (0.00091) [2022-07-11 13:00:08,959][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:00:08,969][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001201852_1230696448.pth [2022-07-11 13:00:08,970][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001199901_1228698624.pth [2022-07-11 13:00:09,435][26022] Updated weights on worker 0-0, policy_version 1201854 (0.00088) [2022-07-11 13:00:10,800][25689] Fps is (10 sec: 5302.7, 60 sec: 5518.0, 300 sec: 5539.3). Total num frames: 1230705664. Throughput: 0: 5703.5. Samples: 1230708510. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 13:00:10,802][25689] Avg episode reward: [(0, '2.243')] [2022-07-11 13:00:11,198][26022] Updated weights on worker 0-0, policy_version 1201864 (0.00082) [2022-07-11 13:00:13,203][26022] Updated weights on worker 0-0, policy_version 1201874 (0.00088) [2022-07-11 13:00:14,791][26022] Updated weights on worker 0-0, policy_version 1201884 (0.00084) [2022-07-11 13:00:15,893][25689] Fps is (10 sec: 5473.5, 60 sec: 5533.3, 300 sec: 5541.4). Total num frames: 1230734336. Throughput: 0: 5702.9. Samples: 1230741770. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 13:00:15,894][25689] Avg episode reward: [(0, '2.087')] [2022-07-11 13:00:16,790][26022] Updated weights on worker 0-0, policy_version 1201894 (0.00093) [2022-07-11 13:00:18,535][26022] Updated weights on worker 0-0, policy_version 1201904 (0.00624) [2022-07-11 13:00:20,359][26022] Updated weights on worker 0-0, policy_version 1201914 (0.00094) [2022-07-11 13:00:20,901][25689] Fps is (10 sec: 5677.3, 60 sec: 5533.2, 300 sec: 5535.1). Total num frames: 1230763008. Throughput: 0: 4870.9. Samples: 1230758524. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 13:00:20,901][25689] Avg episode reward: [(0, '2.121')] [2022-07-11 13:00:22,260][26022] Updated weights on worker 0-0, policy_version 1201924 (0.00085) [2022-07-11 13:00:24,130][26022] Updated weights on worker 0-0, policy_version 1201934 (0.00094) [2022-07-11 13:00:25,912][25689] Fps is (10 sec: 5519.2, 60 sec: 5520.6, 300 sec: 5538.4). Total num frames: 1230789632. Throughput: 0: 5811.5. Samples: 1230792060. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 13:00:25,912][25689] Avg episode reward: [(0, '1.963')] [2022-07-11 13:00:25,932][26022] Updated weights on worker 0-0, policy_version 1201944 (0.00089) [2022-07-11 13:00:27,811][26022] Updated weights on worker 0-0, policy_version 1201954 (0.00087) [2022-07-11 13:00:29,670][26022] Updated weights on worker 0-0, policy_version 1201964 (0.00090) [2022-07-11 13:00:30,961][25689] Fps is (10 sec: 5496.6, 60 sec: 5537.8, 300 sec: 5536.3). Total num frames: 1230818304. Throughput: 0: 5794.4. Samples: 1230825328. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 13:00:30,961][25689] Avg episode reward: [(0, '1.793')] [2022-07-11 13:00:31,673][26022] Updated weights on worker 0-0, policy_version 1201974 (0.00086) [2022-07-11 13:00:33,295][26022] Updated weights on worker 0-0, policy_version 1201984 (0.00090) [2022-07-11 13:00:35,318][26022] Updated weights on worker 0-0, policy_version 1201994 (0.00081) [2022-07-11 13:00:36,023][25689] Fps is (10 sec: 5671.3, 60 sec: 5521.9, 300 sec: 5538.9). Total num frames: 1230846976. Throughput: 0: 4973.0. Samples: 1230841876. Policy #0 lag: (min: 0.0, avg: 7.5, max: 17.0) [2022-07-11 13:00:36,024][25689] Avg episode reward: [(0, '2.007')] [2022-07-11 13:00:36,828][26022] Updated weights on worker 0-0, policy_version 1202004 (0.00087) [2022-07-11 13:00:39,030][26022] Updated weights on worker 0-0, policy_version 1202014 (0.00118) [2022-07-11 13:00:40,569][26022] Updated weights on worker 0-0, policy_version 1202024 (0.00085) [2022-07-11 13:00:41,079][25689] Fps is (10 sec: 5464.8, 60 sec: 5518.4, 300 sec: 5534.8). Total num frames: 1230873600. Throughput: 0: 5784.1. Samples: 1230875240. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:00:41,080][25689] Avg episode reward: [(0, '0.857')] [2022-07-11 13:00:42,664][26022] Updated weights on worker 0-0, policy_version 1202034 (0.00085) [2022-07-11 13:00:44,390][26022] Updated weights on worker 0-0, policy_version 1202044 (0.00090) [2022-07-11 13:00:46,087][25689] Fps is (10 sec: 5392.6, 60 sec: 5501.3, 300 sec: 5528.4). Total num frames: 1230901248. Throughput: 0: 5788.1. Samples: 1230908838. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:00:46,088][25689] Avg episode reward: [(0, '0.933')] [2022-07-11 13:00:46,356][26022] Updated weights on worker 0-0, policy_version 1202054 (0.00087) [2022-07-11 13:00:48,003][26022] Updated weights on worker 0-0, policy_version 1202064 (0.00092) [2022-07-11 13:00:49,965][26022] Updated weights on worker 0-0, policy_version 1202074 (0.00086) [2022-07-11 13:00:51,122][25689] Fps is (10 sec: 5608.0, 60 sec: 5503.9, 300 sec: 5535.4). Total num frames: 1230929920. Throughput: 0: 4978.1. Samples: 1230925694. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:00:51,123][25689] Avg episode reward: [(0, '0.942')] [2022-07-11 13:00:51,636][26022] Updated weights on worker 0-0, policy_version 1202084 (0.00084) [2022-07-11 13:00:53,719][26022] Updated weights on worker 0-0, policy_version 1202094 (0.00091) [2022-07-11 13:00:55,412][26022] Updated weights on worker 0-0, policy_version 1202104 (0.00095) [2022-07-11 13:00:56,197][25689] Fps is (10 sec: 5672.1, 60 sec: 5537.7, 300 sec: 5534.2). Total num frames: 1230958592. Throughput: 0: 5803.6. Samples: 1230958958. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:00:56,198][25689] Avg episode reward: [(0, '0.981')] [2022-07-11 13:00:57,356][26022] Updated weights on worker 0-0, policy_version 1202114 (0.00088) [2022-07-11 13:00:59,028][26022] Updated weights on worker 0-0, policy_version 1202124 (0.00092) [2022-07-11 13:01:00,926][26022] Updated weights on worker 0-0, policy_version 1202134 (0.00097) [2022-07-11 13:01:01,259][25689] Fps is (10 sec: 5656.9, 60 sec: 5534.6, 300 sec: 5547.1). Total num frames: 1230987264. Throughput: 0: 5815.4. Samples: 1230992594. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:01:01,260][25689] Avg episode reward: [(0, '0.675')] [2022-07-11 13:01:03,170][26022] Updated weights on worker 0-0, policy_version 1202144 (0.00091) [2022-07-11 13:01:04,715][26022] Updated weights on worker 0-0, policy_version 1202154 (0.00084) [2022-07-11 13:01:06,346][25689] Fps is (10 sec: 5347.7, 60 sec: 5512.7, 300 sec: 5535.7). Total num frames: 1231012864. Throughput: 0: 4865.6. Samples: 1231007408. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:01:06,346][25689] Avg episode reward: [(0, '0.691')] [2022-07-11 13:01:06,770][26022] Updated weights on worker 0-0, policy_version 1202164 (0.00083) [2022-07-11 13:01:08,346][26022] Updated weights on worker 0-0, policy_version 1202174 (0.00094) [2022-07-11 13:01:10,391][26022] Updated weights on worker 0-0, policy_version 1202184 (0.00085) [2022-07-11 13:01:11,411][25689] Fps is (10 sec: 5346.2, 60 sec: 5541.5, 300 sec: 5532.6). Total num frames: 1231041536. Throughput: 0: 5689.7. Samples: 1231041130. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:01:11,412][25689] Avg episode reward: [(0, '1.027')] [2022-07-11 13:01:12,303][26022] Updated weights on worker 0-0, policy_version 1202194 (0.00369) [2022-07-11 13:01:14,023][26022] Updated weights on worker 0-0, policy_version 1202204 (0.00090) [2022-07-11 13:01:15,923][26022] Updated weights on worker 0-0, policy_version 1202214 (0.00087) [2022-07-11 13:01:16,464][25689] Fps is (10 sec: 5667.4, 60 sec: 5545.1, 300 sec: 5538.5). Total num frames: 1231070208. Throughput: 0: 5707.8. Samples: 1231074638. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:01:16,465][25689] Avg episode reward: [(0, '0.919')] [2022-07-11 13:01:17,933][26022] Updated weights on worker 0-0, policy_version 1202224 (0.00097) [2022-07-11 13:01:19,459][26022] Updated weights on worker 0-0, policy_version 1202234 (0.00088) [2022-07-11 13:01:21,471][25689] Fps is (10 sec: 5496.4, 60 sec: 5511.4, 300 sec: 5531.9). Total num frames: 1231096832. Throughput: 0: 4895.4. Samples: 1231091540. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:01:21,471][25689] Avg episode reward: [(0, '0.582')] [2022-07-11 13:01:21,611][26022] Updated weights on worker 0-0, policy_version 1202244 (0.00094) [2022-07-11 13:01:23,099][26022] Updated weights on worker 0-0, policy_version 1202254 (0.00082) [2022-07-11 13:01:25,124][26022] Updated weights on worker 0-0, policy_version 1202264 (0.00087) [2022-07-11 13:01:26,551][25689] Fps is (10 sec: 5481.8, 60 sec: 5538.8, 300 sec: 5533.8). Total num frames: 1231125504. Throughput: 0: 5831.5. Samples: 1231125236. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:01:26,552][25689] Avg episode reward: [(0, '-0.918')] [2022-07-11 13:01:26,960][26022] Updated weights on worker 0-0, policy_version 1202274 (0.00091) [2022-07-11 13:01:28,663][26022] Updated weights on worker 0-0, policy_version 1202284 (0.00088) [2022-07-11 13:01:30,783][26022] Updated weights on worker 0-0, policy_version 1202294 (0.00090) [2022-07-11 13:01:31,644][25689] Fps is (10 sec: 5737.5, 60 sec: 5551.7, 300 sec: 5543.6). Total num frames: 1231155200. Throughput: 0: 5806.1. Samples: 1231158606. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:01:31,644][25689] Avg episode reward: [(0, '-0.292')] [2022-07-11 13:01:32,245][26022] Updated weights on worker 0-0, policy_version 1202304 (0.00089) [2022-07-11 13:01:34,330][26022] Updated weights on worker 0-0, policy_version 1202314 (0.00090) [2022-07-11 13:01:35,915][26022] Updated weights on worker 0-0, policy_version 1202324 (0.00082) [2022-07-11 13:01:36,736][25689] Fps is (10 sec: 5630.4, 60 sec: 5532.1, 300 sec: 5538.6). Total num frames: 1231182848. Throughput: 0: 5815.7. Samples: 1231192534. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:01:36,737][25689] Avg episode reward: [(0, '0.550')] [2022-07-11 13:01:37,882][26022] Updated weights on worker 0-0, policy_version 1202334 (0.00087) [2022-07-11 13:01:39,699][26022] Updated weights on worker 0-0, policy_version 1202344 (0.00081) [2022-07-11 13:01:41,672][26022] Updated weights on worker 0-0, policy_version 1202354 (0.00084) [2022-07-11 13:01:41,753][25689] Fps is (10 sec: 5571.1, 60 sec: 5569.4, 300 sec: 5542.1). Total num frames: 1231211520. Throughput: 0: 5813.9. Samples: 1231209460. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:01:41,754][25689] Avg episode reward: [(0, '0.452')] [2022-07-11 13:01:43,314][26022] Updated weights on worker 0-0, policy_version 1202364 (0.00086) [2022-07-11 13:01:45,171][26022] Updated weights on worker 0-0, policy_version 1202374 (0.00083) [2022-07-11 13:01:46,797][25689] Fps is (10 sec: 5597.7, 60 sec: 5566.1, 300 sec: 5538.3). Total num frames: 1231239168. Throughput: 0: 5818.6. Samples: 1231243038. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:01:46,798][25689] Avg episode reward: [(0, '0.508')] [2022-07-11 13:01:46,982][26022] Updated weights on worker 0-0, policy_version 1202384 (0.00089) [2022-07-11 13:01:48,688][26022] Updated weights on worker 0-0, policy_version 1202394 (0.00086) [2022-07-11 13:01:50,634][26022] Updated weights on worker 0-0, policy_version 1202404 (0.00092) [2022-07-11 13:01:51,860][25689] Fps is (10 sec: 5674.1, 60 sec: 5580.5, 300 sec: 5542.6). Total num frames: 1231268864. Throughput: 0: 5852.7. Samples: 1231276922. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:01:51,860][25689] Avg episode reward: [(0, '1.548')] [2022-07-11 13:01:52,346][26022] Updated weights on worker 0-0, policy_version 1202414 (0.00080) [2022-07-11 13:01:54,298][26022] Updated weights on worker 0-0, policy_version 1202424 (0.00083) [2022-07-11 13:01:56,159][26022] Updated weights on worker 0-0, policy_version 1202434 (0.00091) [2022-07-11 13:01:56,964][25689] Fps is (10 sec: 5640.4, 60 sec: 5560.9, 300 sec: 5537.9). Total num frames: 1231296512. Throughput: 0: 4995.4. Samples: 1231293580. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:01:56,964][25689] Avg episode reward: [(0, '2.124')] [2022-07-11 13:01:57,931][26022] Updated weights on worker 0-0, policy_version 1202444 (0.00086) [2022-07-11 13:01:59,664][26022] Updated weights on worker 0-0, policy_version 1202454 (0.00089) [2022-07-11 13:02:02,012][25689] Fps is (10 sec: 5244.9, 60 sec: 5511.6, 300 sec: 5544.3). Total num frames: 1231322112. Throughput: 0: 5806.2. Samples: 1231327086. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:02:02,013][25689] Avg episode reward: [(0, '1.673')] [2022-07-11 13:02:02,077][26022] Updated weights on worker 0-0, policy_version 1202464 (0.00082) [2022-07-11 13:02:03,847][26022] Updated weights on worker 0-0, policy_version 1202474 (0.00486) [2022-07-11 13:02:05,744][26022] Updated weights on worker 0-0, policy_version 1202484 (0.00094) [2022-07-11 13:02:07,045][25689] Fps is (10 sec: 5383.3, 60 sec: 5567.1, 300 sec: 5543.9). Total num frames: 1231350784. Throughput: 0: 5701.3. Samples: 1231358480. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:02:07,046][25689] Avg episode reward: [(0, '0.692')] [2022-07-11 13:02:07,430][26022] Updated weights on worker 0-0, policy_version 1202494 (0.00086) [2022-07-11 13:02:09,193][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:02:09,206][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001202504_1231364096.pth [2022-07-11 13:02:09,206][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001200553_1229366272.pth [2022-07-11 13:02:09,210][26022] Updated weights on worker 0-0, policy_version 1202504 (0.00093) [2022-07-11 13:02:11,273][26022] Updated weights on worker 0-0, policy_version 1202514 (0.00083) [2022-07-11 13:02:12,055][25689] Fps is (10 sec: 5608.0, 60 sec: 5555.3, 300 sec: 5542.4). Total num frames: 1231378432. Throughput: 0: 4886.8. Samples: 1231375612. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:02:12,055][25689] Avg episode reward: [(0, '-1.029')] [2022-07-11 13:02:12,841][26022] Updated weights on worker 0-0, policy_version 1202524 (0.00082) [2022-07-11 13:02:14,900][26022] Updated weights on worker 0-0, policy_version 1202534 (0.00095) [2022-07-11 13:02:16,513][26022] Updated weights on worker 0-0, policy_version 1202544 (0.00086) [2022-07-11 13:02:17,140][25689] Fps is (10 sec: 5579.5, 60 sec: 5552.4, 300 sec: 5544.5). Total num frames: 1231407104. Throughput: 0: 5717.1. Samples: 1231408926. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:02:17,140][25689] Avg episode reward: [(0, '-1.183')] [2022-07-11 13:02:18,574][26022] Updated weights on worker 0-0, policy_version 1202554 (0.00085) [2022-07-11 13:02:20,186][26022] Updated weights on worker 0-0, policy_version 1202564 (0.00089) [2022-07-11 13:02:22,179][25689] Fps is (10 sec: 5562.6, 60 sec: 5566.3, 300 sec: 5537.7). Total num frames: 1231434752. Throughput: 0: 5729.5. Samples: 1231442636. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:02:22,180][25689] Avg episode reward: [(0, '-1.059')] [2022-07-11 13:02:22,223][26022] Updated weights on worker 0-0, policy_version 1202574 (0.00401) [2022-07-11 13:02:23,878][26022] Updated weights on worker 0-0, policy_version 1202584 (0.00093) [2022-07-11 13:02:25,844][26022] Updated weights on worker 0-0, policy_version 1202594 (0.00091) [2022-07-11 13:02:27,231][25689] Fps is (10 sec: 5581.1, 60 sec: 5568.9, 300 sec: 5543.7). Total num frames: 1231463424. Throughput: 0: 5000.4. Samples: 1231459414. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:02:27,231][25689] Avg episode reward: [(0, '-0.867')] [2022-07-11 13:02:27,546][26022] Updated weights on worker 0-0, policy_version 1202604 (0.00090) [2022-07-11 13:02:29,466][26022] Updated weights on worker 0-0, policy_version 1202614 (0.00087) [2022-07-11 13:02:31,291][26022] Updated weights on worker 0-0, policy_version 1202624 (0.00086) [2022-07-11 13:02:32,237][25689] Fps is (10 sec: 5599.9, 60 sec: 5543.1, 300 sec: 5541.1). Total num frames: 1231491072. Throughput: 0: 5827.4. Samples: 1231493220. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:02:32,237][25689] Avg episode reward: [(0, '-0.944')] [2022-07-11 13:02:33,202][26022] Updated weights on worker 0-0, policy_version 1202634 (0.00086) [2022-07-11 13:02:34,921][26022] Updated weights on worker 0-0, policy_version 1202644 (0.00078) [2022-07-11 13:02:36,788][26022] Updated weights on worker 0-0, policy_version 1202654 (0.00093) [2022-07-11 13:02:37,335][25689] Fps is (10 sec: 5675.5, 60 sec: 5576.4, 300 sec: 5543.0). Total num frames: 1231520768. Throughput: 0: 5847.7. Samples: 1231527020. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:02:37,335][25689] Avg episode reward: [(0, '-0.947')] [2022-07-11 13:02:38,542][26022] Updated weights on worker 0-0, policy_version 1202664 (0.00089) [2022-07-11 13:02:40,615][26022] Updated weights on worker 0-0, policy_version 1202674 (0.00091) [2022-07-11 13:02:42,342][25689] Fps is (10 sec: 5573.1, 60 sec: 5543.4, 300 sec: 5536.5). Total num frames: 1231547392. Throughput: 0: 5014.7. Samples: 1231543748. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:02:42,343][25689] Avg episode reward: [(0, '-0.292')] [2022-07-11 13:02:42,406][26022] Updated weights on worker 0-0, policy_version 1202684 (0.00089) [2022-07-11 13:02:44,270][26022] Updated weights on worker 0-0, policy_version 1202694 (0.00091) [2022-07-11 13:02:45,909][26022] Updated weights on worker 0-0, policy_version 1202704 (0.00081) [2022-07-11 13:02:47,383][25689] Fps is (10 sec: 5401.1, 60 sec: 5543.7, 300 sec: 5536.3). Total num frames: 1231575040. Throughput: 0: 5845.4. Samples: 1231577212. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:02:47,384][25689] Avg episode reward: [(0, '0.001')] [2022-07-11 13:02:47,809][26022] Updated weights on worker 0-0, policy_version 1202714 (0.00676) [2022-07-11 13:02:49,493][26022] Updated weights on worker 0-0, policy_version 1202724 (0.00087) [2022-07-11 13:02:51,467][26022] Updated weights on worker 0-0, policy_version 1202734 (0.00088) [2022-07-11 13:02:52,440][25689] Fps is (10 sec: 5679.0, 60 sec: 5544.2, 300 sec: 5543.1). Total num frames: 1231604736. Throughput: 0: 5826.5. Samples: 1231610934. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:02:52,440][25689] Avg episode reward: [(0, '-0.309')] [2022-07-11 13:02:53,287][26022] Updated weights on worker 0-0, policy_version 1202744 (0.00088) [2022-07-11 13:02:55,176][26022] Updated weights on worker 0-0, policy_version 1202754 (0.00095) [2022-07-11 13:02:56,931][26022] Updated weights on worker 0-0, policy_version 1202764 (0.00088) [2022-07-11 13:02:57,490][25689] Fps is (10 sec: 5674.0, 60 sec: 5549.3, 300 sec: 5539.8). Total num frames: 1231632384. Throughput: 0: 4994.5. Samples: 1231627684. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:02:57,490][25689] Avg episode reward: [(0, '0.704')] [2022-07-11 13:02:58,891][26022] Updated weights on worker 0-0, policy_version 1202774 (0.00083) [2022-07-11 13:03:00,530][26022] Updated weights on worker 0-0, policy_version 1202784 (0.00092) [2022-07-11 13:03:02,544][25689] Fps is (10 sec: 5269.8, 60 sec: 5548.6, 300 sec: 5539.3). Total num frames: 1231657984. Throughput: 0: 5820.8. Samples: 1231661340. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:03:02,545][25689] Avg episode reward: [(0, '0.630')] [2022-07-11 13:03:02,929][26022] Updated weights on worker 0-0, policy_version 1202794 (0.00295) [2022-07-11 13:03:04,500][26022] Updated weights on worker 0-0, policy_version 1202804 (0.00087) [2022-07-11 13:03:06,589][26022] Updated weights on worker 0-0, policy_version 1202814 (0.00093) [2022-07-11 13:03:07,563][25689] Fps is (10 sec: 5591.0, 60 sec: 5583.9, 300 sec: 5556.3). Total num frames: 1231688704. Throughput: 0: 5735.2. Samples: 1231692948. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:03:07,563][25689] Avg episode reward: [(0, '1.240')] [2022-07-11 13:03:08,263][26022] Updated weights on worker 0-0, policy_version 1202824 (0.00088) [2022-07-11 13:03:10,115][26022] Updated weights on worker 0-0, policy_version 1202834 (0.00090) [2022-07-11 13:03:12,081][26022] Updated weights on worker 0-0, policy_version 1202844 (0.00634) [2022-07-11 13:03:12,586][25689] Fps is (10 sec: 5710.3, 60 sec: 5565.6, 300 sec: 5537.4). Total num frames: 1231715328. Throughput: 0: 4908.1. Samples: 1231709814. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:03:12,587][25689] Avg episode reward: [(0, '0.873')] [2022-07-11 13:03:13,805][26022] Updated weights on worker 0-0, policy_version 1202854 (0.00092) [2022-07-11 13:03:15,692][26022] Updated weights on worker 0-0, policy_version 1202864 (0.00090) [2022-07-11 13:03:17,436][26022] Updated weights on worker 0-0, policy_version 1202874 (0.00090) [2022-07-11 13:03:17,680][25689] Fps is (10 sec: 5465.4, 60 sec: 5564.8, 300 sec: 5546.5). Total num frames: 1231744000. Throughput: 0: 5724.5. Samples: 1231743266. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:03:17,681][25689] Avg episode reward: [(0, '1.296')] [2022-07-11 13:03:19,225][26022] Updated weights on worker 0-0, policy_version 1202884 (0.00087) [2022-07-11 13:03:21,167][26022] Updated weights on worker 0-0, policy_version 1202894 (0.00083) [2022-07-11 13:03:22,723][25689] Fps is (10 sec: 5657.3, 60 sec: 5581.5, 300 sec: 5549.9). Total num frames: 1231772672. Throughput: 0: 5722.7. Samples: 1231776816. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:03:22,723][25689] Avg episode reward: [(0, '1.223')] [2022-07-11 13:03:23,035][26022] Updated weights on worker 0-0, policy_version 1202904 (0.00085) [2022-07-11 13:03:24,838][26022] Updated weights on worker 0-0, policy_version 1202914 (0.00085) [2022-07-11 13:03:26,699][26022] Updated weights on worker 0-0, policy_version 1202924 (0.00089) [2022-07-11 13:03:27,742][25689] Fps is (10 sec: 5495.5, 60 sec: 5550.5, 300 sec: 5543.7). Total num frames: 1231799296. Throughput: 0: 4981.8. Samples: 1231793478. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:03:27,743][25689] Avg episode reward: [(0, '1.343')] [2022-07-11 13:03:28,508][26022] Updated weights on worker 0-0, policy_version 1202934 (0.00085) [2022-07-11 13:03:30,437][26022] Updated weights on worker 0-0, policy_version 1202944 (0.00092) [2022-07-11 13:03:32,317][26022] Updated weights on worker 0-0, policy_version 1202954 (0.00082) [2022-07-11 13:03:32,750][25689] Fps is (10 sec: 5412.2, 60 sec: 5550.3, 300 sec: 5544.7). Total num frames: 1231826944. Throughput: 0: 5797.0. Samples: 1231826706. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:03:32,751][25689] Avg episode reward: [(0, '1.452')] [2022-07-11 13:03:34,073][26022] Updated weights on worker 0-0, policy_version 1202964 (0.00082) [2022-07-11 13:03:35,845][26022] Updated weights on worker 0-0, policy_version 1202974 (0.00081) [2022-07-11 13:03:37,807][25689] Fps is (10 sec: 5493.9, 60 sec: 5520.2, 300 sec: 5541.2). Total num frames: 1231854592. Throughput: 0: 5793.0. Samples: 1231859862. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:03:37,808][25689] Avg episode reward: [(0, '0.865')] [2022-07-11 13:03:37,913][26022] Updated weights on worker 0-0, policy_version 1202984 (0.00086) [2022-07-11 13:03:39,507][26022] Updated weights on worker 0-0, policy_version 1202994 (0.00079) [2022-07-11 13:03:41,404][26022] Updated weights on worker 0-0, policy_version 1203004 (0.00050) [2022-07-11 13:03:42,838][25689] Fps is (10 sec: 5582.9, 60 sec: 5552.0, 300 sec: 5537.5). Total num frames: 1231883264. Throughput: 0: 4954.5. Samples: 1231876480. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:03:42,839][25689] Avg episode reward: [(0, '1.220')] [2022-07-11 13:03:43,590][26022] Updated weights on worker 0-0, policy_version 1203014 (0.00085) [2022-07-11 13:03:44,952][26022] Updated weights on worker 0-0, policy_version 1203024 (0.00090) [2022-07-11 13:03:47,229][26022] Updated weights on worker 0-0, policy_version 1203034 (0.00086) [2022-07-11 13:03:47,870][25689] Fps is (10 sec: 5596.8, 60 sec: 5552.8, 300 sec: 5537.4). Total num frames: 1231910912. Throughput: 0: 5779.3. Samples: 1231909804. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:03:47,872][25689] Avg episode reward: [(0, '1.174')] [2022-07-11 13:03:48,567][26022] Updated weights on worker 0-0, policy_version 1203044 (0.00092) [2022-07-11 13:03:50,761][26022] Updated weights on worker 0-0, policy_version 1203054 (0.00093) [2022-07-11 13:03:52,450][26022] Updated weights on worker 0-0, policy_version 1203064 (0.00083) [2022-07-11 13:03:52,880][25689] Fps is (10 sec: 5507.0, 60 sec: 5523.3, 300 sec: 5539.3). Total num frames: 1231938560. Throughput: 0: 5771.6. Samples: 1231942882. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:03:52,880][25689] Avg episode reward: [(0, '1.082')] [2022-07-11 13:03:54,488][26022] Updated weights on worker 0-0, policy_version 1203074 (0.00086) [2022-07-11 13:03:56,159][26022] Updated weights on worker 0-0, policy_version 1203084 (0.00091) [2022-07-11 13:03:58,026][25689] Fps is (10 sec: 5444.7, 60 sec: 5514.4, 300 sec: 5536.7). Total num frames: 1231966208. Throughput: 0: 5762.5. Samples: 1231976372. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:03:58,027][25689] Avg episode reward: [(0, '-0.010')] [2022-07-11 13:03:58,352][26022] Updated weights on worker 0-0, policy_version 1203094 (0.00088) [2022-07-11 13:03:59,768][26022] Updated weights on worker 0-0, policy_version 1203104 (0.00084) [2022-07-11 13:04:02,257][26022] Updated weights on worker 0-0, policy_version 1203114 (0.00088) [2022-07-11 13:04:03,060][25689] Fps is (10 sec: 5330.8, 60 sec: 5533.2, 300 sec: 5536.3). Total num frames: 1231992832. Throughput: 0: 5779.3. Samples: 1231993350. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:04:03,062][25689] Avg episode reward: [(0, '-0.931')] [2022-07-11 13:04:03,590][26022] Updated weights on worker 0-0, policy_version 1203124 (0.00101) [2022-07-11 13:04:05,800][26022] Updated weights on worker 0-0, policy_version 1203134 (0.00092) [2022-07-11 13:04:07,366][26022] Updated weights on worker 0-0, policy_version 1203144 (0.00085) [2022-07-11 13:04:08,147][25689] Fps is (10 sec: 5665.7, 60 sec: 5526.9, 300 sec: 5545.5). Total num frames: 1232023552. Throughput: 0: 5702.2. Samples: 1232025428. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:04:08,148][25689] Avg episode reward: [(0, '-0.362')] [2022-07-11 13:04:09,263][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:04:09,286][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001203153_1232028672.pth [2022-07-11 13:04:09,287][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001201204_1230032896.pth [2022-07-11 13:04:09,600][26022] Updated weights on worker 0-0, policy_version 1203154 (0.00088) [2022-07-11 13:04:11,097][26022] Updated weights on worker 0-0, policy_version 1203164 (0.00096) [2022-07-11 13:04:13,154][25689] Fps is (10 sec: 5681.2, 60 sec: 5528.5, 300 sec: 5543.4). Total num frames: 1232050176. Throughput: 0: 5740.9. Samples: 1232059276. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:04:13,156][25689] Avg episode reward: [(0, '-0.317')] [2022-07-11 13:04:13,157][26022] Updated weights on worker 0-0, policy_version 1203174 (0.00095) [2022-07-11 13:04:14,676][26022] Updated weights on worker 0-0, policy_version 1203184 (0.00096) [2022-07-11 13:04:16,702][26022] Updated weights on worker 0-0, policy_version 1203194 (0.00083) [2022-07-11 13:04:18,229][25689] Fps is (10 sec: 5484.9, 60 sec: 5530.2, 300 sec: 5542.1). Total num frames: 1232078848. Throughput: 0: 4927.0. Samples: 1232075910. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:04:18,229][25689] Avg episode reward: [(0, '-0.130')] [2022-07-11 13:04:18,311][26022] Updated weights on worker 0-0, policy_version 1203204 (0.00087) [2022-07-11 13:04:20,301][26022] Updated weights on worker 0-0, policy_version 1203214 (0.00089) [2022-07-11 13:04:22,067][26022] Updated weights on worker 0-0, policy_version 1203224 (0.00093) [2022-07-11 13:04:23,263][25689] Fps is (10 sec: 5672.6, 60 sec: 5531.0, 300 sec: 5546.0). Total num frames: 1232107520. Throughput: 0: 5749.3. Samples: 1232109500. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:04:23,263][25689] Avg episode reward: [(0, '0.409')] [2022-07-11 13:04:24,057][26022] Updated weights on worker 0-0, policy_version 1203234 (0.00091) [2022-07-11 13:04:25,816][26022] Updated weights on worker 0-0, policy_version 1203244 (0.00092) [2022-07-11 13:04:27,712][26022] Updated weights on worker 0-0, policy_version 1203254 (0.00108) [2022-07-11 13:04:28,281][25689] Fps is (10 sec: 5602.9, 60 sec: 5548.1, 300 sec: 5546.6). Total num frames: 1232135168. Throughput: 0: 5841.7. Samples: 1232143040. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:04:28,281][25689] Avg episode reward: [(0, '0.644')] [2022-07-11 13:04:29,327][26022] Updated weights on worker 0-0, policy_version 1203264 (0.00088) [2022-07-11 13:04:31,401][26022] Updated weights on worker 0-0, policy_version 1203274 (0.00099) [2022-07-11 13:04:33,183][26022] Updated weights on worker 0-0, policy_version 1203284 (0.00079) [2022-07-11 13:04:33,329][25689] Fps is (10 sec: 5595.2, 60 sec: 5561.3, 300 sec: 5543.7). Total num frames: 1232163840. Throughput: 0: 4992.9. Samples: 1232160006. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:04:33,329][25689] Avg episode reward: [(0, '0.404')] [2022-07-11 13:04:35,084][26022] Updated weights on worker 0-0, policy_version 1203294 (0.00077) [2022-07-11 13:04:36,811][26022] Updated weights on worker 0-0, policy_version 1203304 (0.00090) [2022-07-11 13:04:38,404][25689] Fps is (10 sec: 5664.5, 60 sec: 5576.5, 300 sec: 5549.5). Total num frames: 1232192512. Throughput: 0: 5836.8. Samples: 1232193670. Policy #0 lag: (min: 0.0, avg: 9.4, max: 18.0) [2022-07-11 13:04:38,405][25689] Avg episode reward: [(0, '0.071')] [2022-07-11 13:04:38,555][26022] Updated weights on worker 0-0, policy_version 1203314 (0.00083) [2022-07-11 13:04:40,360][26022] Updated weights on worker 0-0, policy_version 1203324 (0.00089) [2022-07-11 13:04:42,286][26022] Updated weights on worker 0-0, policy_version 1203334 (0.00094) [2022-07-11 13:04:43,484][25689] Fps is (10 sec: 5546.2, 60 sec: 5555.2, 300 sec: 5544.7). Total num frames: 1232220160. Throughput: 0: 5825.1. Samples: 1232227286. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:04:43,484][25689] Avg episode reward: [(0, '0.040')] [2022-07-11 13:04:44,098][26022] Updated weights on worker 0-0, policy_version 1203344 (0.00086) [2022-07-11 13:04:45,988][26022] Updated weights on worker 0-0, policy_version 1203354 (0.00097) [2022-07-11 13:04:47,587][26022] Updated weights on worker 0-0, policy_version 1203364 (0.00091) [2022-07-11 13:04:48,515][25689] Fps is (10 sec: 5469.2, 60 sec: 5555.3, 300 sec: 5541.8). Total num frames: 1232247808. Throughput: 0: 4977.6. Samples: 1232243756. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:04:48,515][25689] Avg episode reward: [(0, '0.326')] [2022-07-11 13:04:49,674][26022] Updated weights on worker 0-0, policy_version 1203374 (0.00087) [2022-07-11 13:04:51,353][26022] Updated weights on worker 0-0, policy_version 1203384 (0.00090) [2022-07-11 13:04:53,535][25689] Fps is (10 sec: 5399.6, 60 sec: 5537.4, 300 sec: 5542.9). Total num frames: 1232274432. Throughput: 0: 5796.0. Samples: 1232277120. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:04:53,536][25689] Avg episode reward: [(0, '1.120')] [2022-07-11 13:04:53,607][26022] Updated weights on worker 0-0, policy_version 1203394 (0.00092) [2022-07-11 13:04:55,039][26022] Updated weights on worker 0-0, policy_version 1203404 (0.01272) [2022-07-11 13:04:57,141][26022] Updated weights on worker 0-0, policy_version 1203414 (0.00091) [2022-07-11 13:04:58,644][25689] Fps is (10 sec: 5661.2, 60 sec: 5591.5, 300 sec: 5548.2). Total num frames: 1232305152. Throughput: 0: 5783.6. Samples: 1232310728. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:04:58,645][25689] Avg episode reward: [(0, '1.050')] [2022-07-11 13:04:58,697][26022] Updated weights on worker 0-0, policy_version 1203424 (0.00086) [2022-07-11 13:05:00,811][26022] Updated weights on worker 0-0, policy_version 1203434 (0.00097) [2022-07-11 13:05:02,856][26022] Updated weights on worker 0-0, policy_version 1203444 (0.00095) [2022-07-11 13:05:03,675][25689] Fps is (10 sec: 5554.1, 60 sec: 5574.9, 300 sec: 5544.8). Total num frames: 1232330752. Throughput: 0: 4965.4. Samples: 1232327544. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:05:03,676][25689] Avg episode reward: [(0, '1.210')] [2022-07-11 13:05:04,807][26022] Updated weights on worker 0-0, policy_version 1203454 (0.00088) [2022-07-11 13:05:06,331][26022] Updated weights on worker 0-0, policy_version 1203464 (0.00087) [2022-07-11 13:05:08,539][26022] Updated weights on worker 0-0, policy_version 1203474 (0.00086) [2022-07-11 13:05:08,704][25689] Fps is (10 sec: 5293.1, 60 sec: 5529.5, 300 sec: 5547.9). Total num frames: 1232358400. Throughput: 0: 5708.4. Samples: 1232359004. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:05:08,705][25689] Avg episode reward: [(0, '1.694')] [2022-07-11 13:05:10,060][26022] Updated weights on worker 0-0, policy_version 1203484 (0.00090) [2022-07-11 13:05:12,044][26022] Updated weights on worker 0-0, policy_version 1203494 (0.00086) [2022-07-11 13:05:13,735][25689] Fps is (10 sec: 5598.2, 60 sec: 5561.1, 300 sec: 5549.0). Total num frames: 1232387072. Throughput: 0: 5722.1. Samples: 1232392710. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:05:13,736][25689] Avg episode reward: [(0, '1.517')] [2022-07-11 13:05:13,803][26022] Updated weights on worker 0-0, policy_version 1203504 (0.00095) [2022-07-11 13:05:15,682][26022] Updated weights on worker 0-0, policy_version 1203514 (0.00094) [2022-07-11 13:05:17,565][26022] Updated weights on worker 0-0, policy_version 1203524 (0.00085) [2022-07-11 13:05:18,855][25689] Fps is (10 sec: 5649.2, 60 sec: 5557.0, 300 sec: 5547.0). Total num frames: 1232415744. Throughput: 0: 4869.7. Samples: 1232409148. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:05:18,855][25689] Avg episode reward: [(0, '1.075')] [2022-07-11 13:05:19,535][26022] Updated weights on worker 0-0, policy_version 1203534 (0.00088) [2022-07-11 13:05:21,277][26022] Updated weights on worker 0-0, policy_version 1203544 (0.00088) [2022-07-11 13:05:23,169][26022] Updated weights on worker 0-0, policy_version 1203554 (0.00633) [2022-07-11 13:05:23,866][25689] Fps is (10 sec: 5559.2, 60 sec: 5542.1, 300 sec: 5550.4). Total num frames: 1232443392. Throughput: 0: 5717.8. Samples: 1232442994. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:05:23,867][25689] Avg episode reward: [(0, '0.585')] [2022-07-11 13:05:24,987][26022] Updated weights on worker 0-0, policy_version 1203564 (0.00090) [2022-07-11 13:05:26,879][26022] Updated weights on worker 0-0, policy_version 1203574 (0.00087) [2022-07-11 13:05:28,522][26022] Updated weights on worker 0-0, policy_version 1203584 (0.00086) [2022-07-11 13:05:28,872][25689] Fps is (10 sec: 5622.2, 60 sec: 5560.1, 300 sec: 5551.2). Total num frames: 1232472064. Throughput: 0: 5822.5. Samples: 1232476434. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:05:28,873][25689] Avg episode reward: [(0, '0.878')] [2022-07-11 13:05:30,252][26022] Updated weights on worker 0-0, policy_version 1203594 (0.00096) [2022-07-11 13:05:32,278][26022] Updated weights on worker 0-0, policy_version 1203604 (0.00096) [2022-07-11 13:05:33,880][25689] Fps is (10 sec: 5624.4, 60 sec: 5546.9, 300 sec: 5548.8). Total num frames: 1232499712. Throughput: 0: 4989.1. Samples: 1232493212. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:05:33,880][25689] Avg episode reward: [(0, '0.905')] [2022-07-11 13:05:34,147][26022] Updated weights on worker 0-0, policy_version 1203614 (0.00082) [2022-07-11 13:05:35,946][26022] Updated weights on worker 0-0, policy_version 1203624 (0.00084) [2022-07-11 13:05:37,827][26022] Updated weights on worker 0-0, policy_version 1203634 (0.00085) [2022-07-11 13:05:38,935][25689] Fps is (10 sec: 5495.1, 60 sec: 5531.9, 300 sec: 5552.2). Total num frames: 1232527360. Throughput: 0: 5856.5. Samples: 1232526748. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:05:38,935][25689] Avg episode reward: [(0, '0.849')] [2022-07-11 13:05:39,571][26022] Updated weights on worker 0-0, policy_version 1203644 (0.00095) [2022-07-11 13:05:41,367][26022] Updated weights on worker 0-0, policy_version 1203654 (0.00087) [2022-07-11 13:05:43,206][26022] Updated weights on worker 0-0, policy_version 1203664 (0.00091) [2022-07-11 13:05:43,952][25689] Fps is (10 sec: 5489.9, 60 sec: 5537.6, 300 sec: 5552.0). Total num frames: 1232555008. Throughput: 0: 5852.6. Samples: 1232560548. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:05:43,953][25689] Avg episode reward: [(0, '0.991')] [2022-07-11 13:05:45,108][26022] Updated weights on worker 0-0, policy_version 1203674 (0.00092) [2022-07-11 13:05:46,981][26022] Updated weights on worker 0-0, policy_version 1203684 (0.00085) [2022-07-11 13:05:48,658][26022] Updated weights on worker 0-0, policy_version 1203694 (0.00087) [2022-07-11 13:05:48,962][25689] Fps is (10 sec: 5617.0, 60 sec: 5556.5, 300 sec: 5552.5). Total num frames: 1232583680. Throughput: 0: 5006.9. Samples: 1232577020. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:05:48,962][25689] Avg episode reward: [(0, '1.572')] [2022-07-11 13:05:50,378][26022] Updated weights on worker 0-0, policy_version 1203704 (0.00087) [2022-07-11 13:05:52,371][26022] Updated weights on worker 0-0, policy_version 1203714 (0.00084) [2022-07-11 13:05:53,963][25689] Fps is (10 sec: 5728.2, 60 sec: 5592.1, 300 sec: 5553.9). Total num frames: 1232612352. Throughput: 0: 5859.1. Samples: 1232610882. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:05:53,963][25689] Avg episode reward: [(0, '1.992')] [2022-07-11 13:05:54,092][26022] Updated weights on worker 0-0, policy_version 1203724 (0.00089) [2022-07-11 13:05:56,092][26022] Updated weights on worker 0-0, policy_version 1203734 (0.00093) [2022-07-11 13:05:57,771][26022] Updated weights on worker 0-0, policy_version 1203744 (0.00084) [2022-07-11 13:05:59,025][25689] Fps is (10 sec: 5494.8, 60 sec: 5528.6, 300 sec: 5547.0). Total num frames: 1232638976. Throughput: 0: 5842.1. Samples: 1232644116. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:05:59,025][25689] Avg episode reward: [(0, '1.739')] [2022-07-11 13:05:59,778][26022] Updated weights on worker 0-0, policy_version 1203754 (0.00084) [2022-07-11 13:06:01,655][26022] Updated weights on worker 0-0, policy_version 1203764 (0.00093) [2022-07-11 13:06:03,815][26022] Updated weights on worker 0-0, policy_version 1203774 (0.00082) [2022-07-11 13:06:04,050][25689] Fps is (10 sec: 5278.7, 60 sec: 5546.1, 300 sec: 5551.6). Total num frames: 1232665600. Throughput: 0: 4988.1. Samples: 1232660798. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:06:04,052][25689] Avg episode reward: [(0, '1.866')] [2022-07-11 13:06:05,725][26022] Updated weights on worker 0-0, policy_version 1203784 (0.00089) [2022-07-11 13:06:07,579][26022] Updated weights on worker 0-0, policy_version 1203794 (0.00085) [2022-07-11 13:06:09,058][25689] Fps is (10 sec: 5409.1, 60 sec: 5548.0, 300 sec: 5549.2). Total num frames: 1232693248. Throughput: 0: 5744.5. Samples: 1232692466. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:06:09,059][25689] Avg episode reward: [(0, '1.905')] [2022-07-11 13:06:09,353][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:06:09,362][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001203804_1232695296.pth [2022-07-11 13:06:09,362][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001201852_1230696448.pth [2022-07-11 13:06:09,367][26022] Updated weights on worker 0-0, policy_version 1203804 (0.00086) [2022-07-11 13:06:11,191][26022] Updated weights on worker 0-0, policy_version 1203814 (0.00088) [2022-07-11 13:06:12,891][26022] Updated weights on worker 0-0, policy_version 1203824 (0.00091) [2022-07-11 13:06:14,091][25689] Fps is (10 sec: 5507.3, 60 sec: 5531.0, 300 sec: 5546.2). Total num frames: 1232720896. Throughput: 0: 5721.1. Samples: 1232726034. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:06:14,091][25689] Avg episode reward: [(0, '1.115')] [2022-07-11 13:06:14,808][26022] Updated weights on worker 0-0, policy_version 1203834 (0.00096) [2022-07-11 13:06:16,770][26022] Updated weights on worker 0-0, policy_version 1203844 (0.00096) [2022-07-11 13:06:18,581][26022] Updated weights on worker 0-0, policy_version 1203854 (0.00090) [2022-07-11 13:06:19,166][25689] Fps is (10 sec: 5572.3, 60 sec: 5535.1, 300 sec: 5551.8). Total num frames: 1232749568. Throughput: 0: 4891.7. Samples: 1232742638. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:06:19,167][25689] Avg episode reward: [(0, '0.855')] [2022-07-11 13:06:20,148][26022] Updated weights on worker 0-0, policy_version 1203864 (0.00082) [2022-07-11 13:06:22,171][26022] Updated weights on worker 0-0, policy_version 1203874 (0.00097) [2022-07-11 13:06:23,940][26022] Updated weights on worker 0-0, policy_version 1203884 (0.00086) [2022-07-11 13:06:24,225][25689] Fps is (10 sec: 5557.4, 60 sec: 5530.7, 300 sec: 5548.7). Total num frames: 1232777216. Throughput: 0: 5720.3. Samples: 1232776204. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:06:24,225][25689] Avg episode reward: [(0, '0.889')] [2022-07-11 13:06:25,896][26022] Updated weights on worker 0-0, policy_version 1203894 (0.00088) [2022-07-11 13:06:27,775][26022] Updated weights on worker 0-0, policy_version 1203904 (0.00051) [2022-07-11 13:06:29,237][25689] Fps is (10 sec: 5592.0, 60 sec: 5530.1, 300 sec: 5546.8). Total num frames: 1232805888. Throughput: 0: 5802.7. Samples: 1232809558. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:06:29,239][25689] Avg episode reward: [(0, '0.498')] [2022-07-11 13:06:29,534][26022] Updated weights on worker 0-0, policy_version 1203914 (0.00092) [2022-07-11 13:06:31,496][26022] Updated weights on worker 0-0, policy_version 1203924 (0.00096) [2022-07-11 13:06:33,352][26022] Updated weights on worker 0-0, policy_version 1203934 (0.00094) [2022-07-11 13:06:34,287][25689] Fps is (10 sec: 5699.2, 60 sec: 5543.2, 300 sec: 5551.0). Total num frames: 1232834560. Throughput: 0: 5788.5. Samples: 1232842940. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:06:34,288][25689] Avg episode reward: [(0, '0.570')] [2022-07-11 13:06:35,266][26022] Updated weights on worker 0-0, policy_version 1203944 (0.00090) [2022-07-11 13:06:36,777][26022] Updated weights on worker 0-0, policy_version 1203954 (0.00091) [2022-07-11 13:06:38,842][26022] Updated weights on worker 0-0, policy_version 1203964 (0.00090) [2022-07-11 13:06:39,318][25689] Fps is (10 sec: 5485.3, 60 sec: 5528.4, 300 sec: 5543.9). Total num frames: 1232861184. Throughput: 0: 5810.2. Samples: 1232859728. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:06:39,318][25689] Avg episode reward: [(0, '0.276')] [2022-07-11 13:06:40,789][26022] Updated weights on worker 0-0, policy_version 1203974 (0.00053) [2022-07-11 13:06:42,412][26022] Updated weights on worker 0-0, policy_version 1203984 (0.00087) [2022-07-11 13:06:44,331][25689] Fps is (10 sec: 5301.2, 60 sec: 5511.8, 300 sec: 5541.0). Total num frames: 1232887808. Throughput: 0: 5815.0. Samples: 1232893122. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:06:44,332][25689] Avg episode reward: [(0, '0.133')] [2022-07-11 13:06:44,518][26022] Updated weights on worker 0-0, policy_version 1203994 (0.00091) [2022-07-11 13:06:46,202][26022] Updated weights on worker 0-0, policy_version 1204004 (0.00085) [2022-07-11 13:06:48,096][26022] Updated weights on worker 0-0, policy_version 1204014 (0.00092) [2022-07-11 13:06:49,333][25689] Fps is (10 sec: 5521.0, 60 sec: 5512.5, 300 sec: 5538.7). Total num frames: 1232916480. Throughput: 0: 5812.2. Samples: 1232926362. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:06:49,334][25689] Avg episode reward: [(0, '-0.573')] [2022-07-11 13:06:49,870][26022] Updated weights on worker 0-0, policy_version 1204024 (0.00094) [2022-07-11 13:06:51,635][26022] Updated weights on worker 0-0, policy_version 1204034 (0.00087) [2022-07-11 13:06:53,548][26022] Updated weights on worker 0-0, policy_version 1204044 (0.00092) [2022-07-11 13:06:54,371][25689] Fps is (10 sec: 5813.5, 60 sec: 5526.1, 300 sec: 5546.8). Total num frames: 1232946176. Throughput: 0: 4990.6. Samples: 1232943176. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:06:54,372][25689] Avg episode reward: [(0, '-1.678')] [2022-07-11 13:06:55,315][26022] Updated weights on worker 0-0, policy_version 1204054 (0.00098) [2022-07-11 13:06:57,190][26022] Updated weights on worker 0-0, policy_version 1204064 (0.00090) [2022-07-11 13:06:59,024][26022] Updated weights on worker 0-0, policy_version 1204074 (0.00105) [2022-07-11 13:06:59,430][25689] Fps is (10 sec: 5476.5, 60 sec: 5509.4, 300 sec: 5546.6). Total num frames: 1232971776. Throughput: 0: 5803.2. Samples: 1232976444. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:06:59,431][25689] Avg episode reward: [(0, '-1.263')] [2022-07-11 13:07:00,862][26022] Updated weights on worker 0-0, policy_version 1204084 (0.00090) [2022-07-11 13:07:03,228][26022] Updated weights on worker 0-0, policy_version 1204094 (0.00554) [2022-07-11 13:07:04,514][25689] Fps is (10 sec: 5351.1, 60 sec: 5538.0, 300 sec: 5545.7). Total num frames: 1233000448. Throughput: 0: 5678.1. Samples: 1233007720. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:07:04,514][25689] Avg episode reward: [(0, '-1.255')] [2022-07-11 13:07:04,850][26022] Updated weights on worker 0-0, policy_version 1204104 (0.00091) [2022-07-11 13:07:06,923][26022] Updated weights on worker 0-0, policy_version 1204114 (0.00087) [2022-07-11 13:07:08,577][26022] Updated weights on worker 0-0, policy_version 1204124 (0.00089) [2022-07-11 13:07:09,533][25689] Fps is (10 sec: 5473.6, 60 sec: 5520.1, 300 sec: 5542.1). Total num frames: 1233027072. Throughput: 0: 4853.9. Samples: 1233024406. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:07:09,533][25689] Avg episode reward: [(0, '-0.669')] [2022-07-11 13:07:10,637][26022] Updated weights on worker 0-0, policy_version 1204134 (0.00084) [2022-07-11 13:07:12,253][26022] Updated weights on worker 0-0, policy_version 1204144 (0.00087) [2022-07-11 13:07:14,171][26022] Updated weights on worker 0-0, policy_version 1204154 (0.00093) [2022-07-11 13:07:14,544][25689] Fps is (10 sec: 5512.9, 60 sec: 5538.9, 300 sec: 5543.4). Total num frames: 1233055744. Throughput: 0: 5686.0. Samples: 1233057878. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:07:14,545][25689] Avg episode reward: [(0, '-1.229')] [2022-07-11 13:07:16,057][26022] Updated weights on worker 0-0, policy_version 1204164 (0.00094) [2022-07-11 13:07:18,007][26022] Updated weights on worker 0-0, policy_version 1204174 (0.00108) [2022-07-11 13:07:19,674][25689] Fps is (10 sec: 5553.4, 60 sec: 5517.0, 300 sec: 5541.7). Total num frames: 1233083392. Throughput: 0: 5655.4. Samples: 1233090930. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:07:19,675][25689] Avg episode reward: [(0, '-0.165')] [2022-07-11 13:07:19,744][26022] Updated weights on worker 0-0, policy_version 1204184 (0.00082) [2022-07-11 13:07:21,729][26022] Updated weights on worker 0-0, policy_version 1204194 (0.00087) [2022-07-11 13:07:23,342][26022] Updated weights on worker 0-0, policy_version 1204204 (0.00093) [2022-07-11 13:07:24,725][25689] Fps is (10 sec: 5431.3, 60 sec: 5517.8, 300 sec: 5538.3). Total num frames: 1233111040. Throughput: 0: 4951.8. Samples: 1233107802. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:07:24,725][25689] Avg episode reward: [(0, '0.891')] [2022-07-11 13:07:25,195][26022] Updated weights on worker 0-0, policy_version 1204214 (0.00086) [2022-07-11 13:07:27,016][26022] Updated weights on worker 0-0, policy_version 1204224 (0.00095) [2022-07-11 13:07:28,898][26022] Updated weights on worker 0-0, policy_version 1204234 (0.00087) [2022-07-11 13:07:29,728][25689] Fps is (10 sec: 5500.0, 60 sec: 5501.7, 300 sec: 5538.4). Total num frames: 1233138688. Throughput: 0: 5776.4. Samples: 1233141064. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:07:29,729][25689] Avg episode reward: [(0, '1.102')] [2022-07-11 13:07:30,826][26022] Updated weights on worker 0-0, policy_version 1204244 (0.00098) [2022-07-11 13:07:32,831][26022] Updated weights on worker 0-0, policy_version 1204254 (0.00087) [2022-07-11 13:07:34,543][26022] Updated weights on worker 0-0, policy_version 1204264 (0.00096) [2022-07-11 13:07:34,748][25689] Fps is (10 sec: 5619.1, 60 sec: 5504.4, 300 sec: 5536.4). Total num frames: 1233167360. Throughput: 0: 5785.1. Samples: 1233174760. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:07:34,750][25689] Avg episode reward: [(0, '0.535')] [2022-07-11 13:07:36,477][26022] Updated weights on worker 0-0, policy_version 1204274 (0.00080) [2022-07-11 13:07:37,983][26022] Updated weights on worker 0-0, policy_version 1204284 (0.00089) [2022-07-11 13:07:39,849][25689] Fps is (10 sec: 5564.5, 60 sec: 5514.9, 300 sec: 5538.1). Total num frames: 1233195008. Throughput: 0: 4986.0. Samples: 1233191524. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:07:39,850][25689] Avg episode reward: [(0, '1.774')] [2022-07-11 13:07:40,168][26022] Updated weights on worker 0-0, policy_version 1204294 (0.00096) [2022-07-11 13:07:41,749][26022] Updated weights on worker 0-0, policy_version 1204304 (0.00086) [2022-07-11 13:07:43,750][26022] Updated weights on worker 0-0, policy_version 1204314 (0.00098) [2022-07-11 13:07:44,871][25689] Fps is (10 sec: 5563.7, 60 sec: 5548.0, 300 sec: 5541.9). Total num frames: 1233223680. Throughput: 0: 5814.7. Samples: 1233224944. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:07:44,872][25689] Avg episode reward: [(0, '1.651')] [2022-07-11 13:07:45,679][26022] Updated weights on worker 0-0, policy_version 1204324 (0.00109) [2022-07-11 13:07:47,373][26022] Updated weights on worker 0-0, policy_version 1204334 (0.00086) [2022-07-11 13:07:49,063][26022] Updated weights on worker 0-0, policy_version 1204344 (0.00083) [2022-07-11 13:07:49,900][25689] Fps is (10 sec: 5705.6, 60 sec: 5545.5, 300 sec: 5539.0). Total num frames: 1233252352. Throughput: 0: 5816.6. Samples: 1233258396. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:07:49,900][25689] Avg episode reward: [(0, '1.521')] [2022-07-11 13:07:50,823][26022] Updated weights on worker 0-0, policy_version 1204354 (0.00091) [2022-07-11 13:07:52,559][26022] Updated weights on worker 0-0, policy_version 1204364 (0.00086) [2022-07-11 13:07:54,723][26022] Updated weights on worker 0-0, policy_version 1204374 (0.00089) [2022-07-11 13:07:54,904][25689] Fps is (10 sec: 5511.4, 60 sec: 5497.9, 300 sec: 5536.4). Total num frames: 1233278976. Throughput: 0: 5000.0. Samples: 1233275540. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:07:54,904][25689] Avg episode reward: [(0, '1.487')] [2022-07-11 13:07:56,385][26022] Updated weights on worker 0-0, policy_version 1204384 (0.00086) [2022-07-11 13:07:58,253][26022] Updated weights on worker 0-0, policy_version 1204394 (0.00081) [2022-07-11 13:08:00,023][25689] Fps is (10 sec: 5563.3, 60 sec: 5560.0, 300 sec: 5548.9). Total num frames: 1233308672. Throughput: 0: 5851.2. Samples: 1233309566. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:08:00,024][25689] Avg episode reward: [(0, '0.753')] [2022-07-11 13:08:00,136][26022] Updated weights on worker 0-0, policy_version 1204404 (0.00080) [2022-07-11 13:08:02,144][26022] Updated weights on worker 0-0, policy_version 1204414 (0.00085) [2022-07-11 13:08:04,270][26022] Updated weights on worker 0-0, policy_version 1204424 (0.00089) [2022-07-11 13:08:05,061][25689] Fps is (10 sec: 5544.6, 60 sec: 5530.3, 300 sec: 5534.8). Total num frames: 1233335296. Throughput: 0: 5758.9. Samples: 1233341220. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:08:05,062][25689] Avg episode reward: [(0, '1.051')] [2022-07-11 13:08:05,818][26022] Updated weights on worker 0-0, policy_version 1204434 (0.00106) [2022-07-11 13:08:07,726][26022] Updated weights on worker 0-0, policy_version 1204444 (0.00088) [2022-07-11 13:08:09,612][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:08:09,624][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001204454_1233360896.pth [2022-07-11 13:08:09,625][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001202504_1231364096.pth [2022-07-11 13:08:09,627][26022] Updated weights on worker 0-0, policy_version 1204454 (0.00085) [2022-07-11 13:08:10,091][25689] Fps is (10 sec: 5391.0, 60 sec: 5546.3, 300 sec: 5538.1). Total num frames: 1233362944. Throughput: 0: 4930.8. Samples: 1233357952. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:08:10,091][25689] Avg episode reward: [(0, '0.762')] [2022-07-11 13:08:11,429][26022] Updated weights on worker 0-0, policy_version 1204464 (0.00294) [2022-07-11 13:08:13,160][26022] Updated weights on worker 0-0, policy_version 1204474 (0.00084) [2022-07-11 13:08:15,149][25689] Fps is (10 sec: 5481.7, 60 sec: 5525.1, 300 sec: 5535.3). Total num frames: 1233390592. Throughput: 0: 5735.9. Samples: 1233391664. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:08:15,150][25689] Avg episode reward: [(0, '0.781')] [2022-07-11 13:08:15,223][26022] Updated weights on worker 0-0, policy_version 1204484 (0.00091) [2022-07-11 13:08:16,755][26022] Updated weights on worker 0-0, policy_version 1204494 (0.00092) [2022-07-11 13:08:18,972][26022] Updated weights on worker 0-0, policy_version 1204504 (0.00088) [2022-07-11 13:08:20,231][25689] Fps is (10 sec: 5655.0, 60 sec: 5563.3, 300 sec: 5538.0). Total num frames: 1233420288. Throughput: 0: 5685.8. Samples: 1233424462. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:08:20,232][25689] Avg episode reward: [(0, '0.669')] [2022-07-11 13:08:20,805][26022] Updated weights on worker 0-0, policy_version 1204514 (0.00091) [2022-07-11 13:08:22,580][26022] Updated weights on worker 0-0, policy_version 1204524 (0.00091) [2022-07-11 13:08:24,460][26022] Updated weights on worker 0-0, policy_version 1204534 (0.00084) [2022-07-11 13:08:25,312][25689] Fps is (10 sec: 5541.8, 60 sec: 5543.7, 300 sec: 5536.9). Total num frames: 1233446912. Throughput: 0: 4940.2. Samples: 1233441260. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:08:25,317][25689] Avg episode reward: [(0, '-1.329')] [2022-07-11 13:08:26,249][26022] Updated weights on worker 0-0, policy_version 1204544 (0.00082) [2022-07-11 13:08:28,128][26022] Updated weights on worker 0-0, policy_version 1204554 (0.00094) [2022-07-11 13:08:30,031][26022] Updated weights on worker 0-0, policy_version 1204564 (0.00087) [2022-07-11 13:08:30,381][25689] Fps is (10 sec: 5346.9, 60 sec: 5537.6, 300 sec: 5535.8). Total num frames: 1233474560. Throughput: 0: 5743.9. Samples: 1233474500. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:08:30,382][25689] Avg episode reward: [(0, '-1.013')] [2022-07-11 13:08:31,820][26022] Updated weights on worker 0-0, policy_version 1204574 (0.00092) [2022-07-11 13:08:33,780][26022] Updated weights on worker 0-0, policy_version 1204584 (0.00096) [2022-07-11 13:08:35,385][25689] Fps is (10 sec: 5590.9, 60 sec: 5539.0, 300 sec: 5540.2). Total num frames: 1233503232. Throughput: 0: 5720.7. Samples: 1233507430. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:08:35,386][25689] Avg episode reward: [(0, '-1.098')] [2022-07-11 13:08:35,468][26022] Updated weights on worker 0-0, policy_version 1204594 (0.00087) [2022-07-11 13:08:37,384][26022] Updated weights on worker 0-0, policy_version 1204604 (0.00092) [2022-07-11 13:08:39,492][26022] Updated weights on worker 0-0, policy_version 1204614 (0.00093) [2022-07-11 13:08:40,451][25689] Fps is (10 sec: 5694.5, 60 sec: 5559.2, 300 sec: 5539.5). Total num frames: 1233531904. Throughput: 0: 5740.2. Samples: 1233540530. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2022-07-11 13:08:40,453][25689] Avg episode reward: [(0, '-0.940')] [2022-07-11 13:08:40,989][26022] Updated weights on worker 0-0, policy_version 1204624 (0.00091) [2022-07-11 13:08:43,030][26022] Updated weights on worker 0-0, policy_version 1204634 (0.00546) [2022-07-11 13:08:44,898][26022] Updated weights on worker 0-0, policy_version 1204644 (0.00090) [2022-07-11 13:08:45,528][25689] Fps is (10 sec: 5451.5, 60 sec: 5520.3, 300 sec: 5535.3). Total num frames: 1233558528. Throughput: 0: 5739.0. Samples: 1233557284. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:08:45,529][25689] Avg episode reward: [(0, '-1.198')] [2022-07-11 13:08:46,582][26022] Updated weights on worker 0-0, policy_version 1204654 (0.00093) [2022-07-11 13:08:48,514][26022] Updated weights on worker 0-0, policy_version 1204664 (0.00085) [2022-07-11 13:08:50,379][26022] Updated weights on worker 0-0, policy_version 1204674 (0.00094) [2022-07-11 13:08:50,535][25689] Fps is (10 sec: 5381.9, 60 sec: 5505.4, 300 sec: 5535.3). Total num frames: 1233586176. Throughput: 0: 5754.8. Samples: 1233590484. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:08:50,536][25689] Avg episode reward: [(0, '-2.016')] [2022-07-11 13:08:52,205][26022] Updated weights on worker 0-0, policy_version 1204684 (0.00084) [2022-07-11 13:08:54,118][26022] Updated weights on worker 0-0, policy_version 1204694 (0.00085) [2022-07-11 13:08:55,558][25689] Fps is (10 sec: 5717.2, 60 sec: 5554.3, 300 sec: 5544.5). Total num frames: 1233615872. Throughput: 0: 5779.0. Samples: 1233624012. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:08:55,560][25689] Avg episode reward: [(0, '0.315')] [2022-07-11 13:08:55,740][26022] Updated weights on worker 0-0, policy_version 1204704 (0.00090) [2022-07-11 13:08:57,662][26022] Updated weights on worker 0-0, policy_version 1204714 (0.00100) [2022-07-11 13:08:59,540][26022] Updated weights on worker 0-0, policy_version 1204724 (0.00088) [2022-07-11 13:09:00,645][25689] Fps is (10 sec: 5672.3, 60 sec: 5523.6, 300 sec: 5547.0). Total num frames: 1233643520. Throughput: 0: 4965.4. Samples: 1233640800. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:09:00,645][25689] Avg episode reward: [(0, '0.375')] [2022-07-11 13:09:01,507][26022] Updated weights on worker 0-0, policy_version 1204734 (0.00081) [2022-07-11 13:09:03,587][26022] Updated weights on worker 0-0, policy_version 1204744 (0.00086) [2022-07-11 13:09:05,353][26022] Updated weights on worker 0-0, policy_version 1204754 (0.00084) [2022-07-11 13:09:05,701][25689] Fps is (10 sec: 5249.9, 60 sec: 5505.0, 300 sec: 5530.3). Total num frames: 1233669120. Throughput: 0: 5700.0. Samples: 1233672268. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:09:05,702][25689] Avg episode reward: [(0, '0.534')] [2022-07-11 13:09:07,252][26022] Updated weights on worker 0-0, policy_version 1204764 (0.00079) [2022-07-11 13:09:08,989][26022] Updated weights on worker 0-0, policy_version 1204774 (0.00085) [2022-07-11 13:09:10,758][25689] Fps is (10 sec: 5366.5, 60 sec: 5519.4, 300 sec: 5536.3). Total num frames: 1233697792. Throughput: 0: 5723.5. Samples: 1233706228. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:09:10,759][25689] Avg episode reward: [(0, '0.559')] [2022-07-11 13:09:10,841][26022] Updated weights on worker 0-0, policy_version 1204784 (0.00088) [2022-07-11 13:09:12,693][26022] Updated weights on worker 0-0, policy_version 1204794 (0.00089) [2022-07-11 13:09:14,400][26022] Updated weights on worker 0-0, policy_version 1204804 (0.00095) [2022-07-11 13:09:15,803][25689] Fps is (10 sec: 5575.0, 60 sec: 5520.6, 300 sec: 5533.4). Total num frames: 1233725440. Throughput: 0: 4891.8. Samples: 1233723046. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:09:15,804][25689] Avg episode reward: [(0, '0.498')] [2022-07-11 13:09:16,340][26022] Updated weights on worker 0-0, policy_version 1204814 (0.00085) [2022-07-11 13:09:18,175][26022] Updated weights on worker 0-0, policy_version 1204824 (0.00114) [2022-07-11 13:09:19,946][26022] Updated weights on worker 0-0, policy_version 1204834 (0.00094) [2022-07-11 13:09:20,873][25689] Fps is (10 sec: 5567.8, 60 sec: 5504.8, 300 sec: 5532.7). Total num frames: 1233754112. Throughput: 0: 5723.9. Samples: 1233756586. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:09:20,875][25689] Avg episode reward: [(0, '-0.045')] [2022-07-11 13:09:21,807][26022] Updated weights on worker 0-0, policy_version 1204844 (0.00082) [2022-07-11 13:09:23,609][26022] Updated weights on worker 0-0, policy_version 1204854 (0.00082) [2022-07-11 13:09:25,505][26022] Updated weights on worker 0-0, policy_version 1204864 (0.00095) [2022-07-11 13:09:25,894][25689] Fps is (10 sec: 5581.4, 60 sec: 5527.1, 300 sec: 5532.7). Total num frames: 1233781760. Throughput: 0: 5832.4. Samples: 1233790042. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:09:25,895][25689] Avg episode reward: [(0, '-1.178')] [2022-07-11 13:09:27,468][26022] Updated weights on worker 0-0, policy_version 1204874 (0.00541) [2022-07-11 13:09:29,131][26022] Updated weights on worker 0-0, policy_version 1204884 (0.00085) [2022-07-11 13:09:30,903][25689] Fps is (10 sec: 5513.5, 60 sec: 5532.7, 300 sec: 5529.9). Total num frames: 1233809408. Throughput: 0: 4992.0. Samples: 1233806790. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:09:30,903][25689] Avg episode reward: [(0, '-1.270')] [2022-07-11 13:09:31,072][26022] Updated weights on worker 0-0, policy_version 1204894 (0.00099) [2022-07-11 13:09:32,861][26022] Updated weights on worker 0-0, policy_version 1204904 (0.00092) [2022-07-11 13:09:34,753][26022] Updated weights on worker 0-0, policy_version 1204914 (0.00096) [2022-07-11 13:09:35,926][25689] Fps is (10 sec: 5511.9, 60 sec: 5514.0, 300 sec: 5527.5). Total num frames: 1233837056. Throughput: 0: 5820.5. Samples: 1233840172. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:09:35,928][25689] Avg episode reward: [(0, '-1.403')] [2022-07-11 13:09:36,592][26022] Updated weights on worker 0-0, policy_version 1204924 (0.00092) [2022-07-11 13:09:38,428][26022] Updated weights on worker 0-0, policy_version 1204934 (0.00093) [2022-07-11 13:09:40,233][26022] Updated weights on worker 0-0, policy_version 1204944 (0.00092) [2022-07-11 13:09:40,969][25689] Fps is (10 sec: 5696.9, 60 sec: 5533.1, 300 sec: 5535.0). Total num frames: 1233866752. Throughput: 0: 5834.6. Samples: 1233873832. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:09:40,969][25689] Avg episode reward: [(0, '-1.489')] [2022-07-11 13:09:42,115][26022] Updated weights on worker 0-0, policy_version 1204954 (0.00087) [2022-07-11 13:09:43,839][26022] Updated weights on worker 0-0, policy_version 1204964 (0.00087) [2022-07-11 13:09:45,843][26022] Updated weights on worker 0-0, policy_version 1204974 (0.00095) [2022-07-11 13:09:46,059][25689] Fps is (10 sec: 5659.3, 60 sec: 5548.8, 300 sec: 5533.9). Total num frames: 1233894400. Throughput: 0: 4984.0. Samples: 1233890544. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:09:46,059][25689] Avg episode reward: [(0, '-0.463')] [2022-07-11 13:09:47,543][26022] Updated weights on worker 0-0, policy_version 1204984 (0.00087) [2022-07-11 13:09:49,535][26022] Updated weights on worker 0-0, policy_version 1204994 (0.00084) [2022-07-11 13:09:51,124][25689] Fps is (10 sec: 5545.8, 60 sec: 5560.4, 300 sec: 5540.0). Total num frames: 1233923072. Throughput: 0: 5785.7. Samples: 1233923784. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:09:51,125][25689] Avg episode reward: [(0, '-0.539')] [2022-07-11 13:09:51,245][26022] Updated weights on worker 0-0, policy_version 1205004 (0.00083) [2022-07-11 13:09:53,054][26022] Updated weights on worker 0-0, policy_version 1205014 (0.00724) [2022-07-11 13:09:54,905][26022] Updated weights on worker 0-0, policy_version 1205024 (0.00083) [2022-07-11 13:09:56,137][25689] Fps is (10 sec: 5689.7, 60 sec: 5544.4, 300 sec: 5534.9). Total num frames: 1233951744. Throughput: 0: 5823.6. Samples: 1233957874. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:09:56,138][25689] Avg episode reward: [(0, '0.784')] [2022-07-11 13:09:56,623][26022] Updated weights on worker 0-0, policy_version 1205034 (0.00086) [2022-07-11 13:09:58,670][26022] Updated weights on worker 0-0, policy_version 1205044 (0.00089) [2022-07-11 13:10:00,346][26022] Updated weights on worker 0-0, policy_version 1205054 (0.00082) [2022-07-11 13:10:01,230][25689] Fps is (10 sec: 5573.1, 60 sec: 5543.8, 300 sec: 5540.6). Total num frames: 1233979392. Throughput: 0: 4979.0. Samples: 1233974712. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:10:01,230][25689] Avg episode reward: [(0, '0.897')] [2022-07-11 13:10:02,576][26022] Updated weights on worker 0-0, policy_version 1205064 (0.00086) [2022-07-11 13:10:04,367][26022] Updated weights on worker 0-0, policy_version 1205074 (0.00095) [2022-07-11 13:10:06,228][26022] Updated weights on worker 0-0, policy_version 1205084 (0.00090) [2022-07-11 13:10:06,260][25689] Fps is (10 sec: 5361.2, 60 sec: 5563.1, 300 sec: 5537.2). Total num frames: 1234006016. Throughput: 0: 5728.8. Samples: 1234006276. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:10:06,261][25689] Avg episode reward: [(0, '0.133')] [2022-07-11 13:10:08,055][26022] Updated weights on worker 0-0, policy_version 1205094 (0.00082) [2022-07-11 13:10:09,770][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:10:09,784][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001205103_1234025472.pth [2022-07-11 13:10:09,785][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001203153_1232028672.pth [2022-07-11 13:10:09,986][26022] Updated weights on worker 0-0, policy_version 1205104 (0.00088) [2022-07-11 13:10:11,290][25689] Fps is (10 sec: 5496.3, 60 sec: 5565.6, 300 sec: 5537.2). Total num frames: 1234034688. Throughput: 0: 5742.7. Samples: 1234039592. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:10:11,292][25689] Avg episode reward: [(0, '-0.055')] [2022-07-11 13:10:11,880][26022] Updated weights on worker 0-0, policy_version 1205114 (0.00084) [2022-07-11 13:10:13,566][26022] Updated weights on worker 0-0, policy_version 1205124 (0.00092) [2022-07-11 13:10:15,332][26022] Updated weights on worker 0-0, policy_version 1205134 (0.00088) [2022-07-11 13:10:16,303][25689] Fps is (10 sec: 5608.1, 60 sec: 5568.5, 300 sec: 5535.7). Total num frames: 1234062336. Throughput: 0: 4898.7. Samples: 1234056660. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:10:16,304][25689] Avg episode reward: [(0, '-0.697')] [2022-07-11 13:10:17,376][26022] Updated weights on worker 0-0, policy_version 1205144 (0.00086) [2022-07-11 13:10:18,865][26022] Updated weights on worker 0-0, policy_version 1205154 (0.00092) [2022-07-11 13:10:21,003][26022] Updated weights on worker 0-0, policy_version 1205164 (0.00101) [2022-07-11 13:10:21,342][25689] Fps is (10 sec: 5501.0, 60 sec: 5554.5, 300 sec: 5535.2). Total num frames: 1234089984. Throughput: 0: 5726.9. Samples: 1234089894. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:10:21,343][25689] Avg episode reward: [(0, '-0.135')] [2022-07-11 13:10:22,642][26022] Updated weights on worker 0-0, policy_version 1205174 (0.00085) [2022-07-11 13:10:24,643][26022] Updated weights on worker 0-0, policy_version 1205184 (0.00089) [2022-07-11 13:10:26,402][25689] Fps is (10 sec: 5374.1, 60 sec: 5534.0, 300 sec: 5527.3). Total num frames: 1234116608. Throughput: 0: 5806.7. Samples: 1234123232. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:10:26,402][25689] Avg episode reward: [(0, '-1.074')] [2022-07-11 13:10:26,522][26022] Updated weights on worker 0-0, policy_version 1205194 (0.00084) [2022-07-11 13:10:28,315][26022] Updated weights on worker 0-0, policy_version 1205204 (0.00080) [2022-07-11 13:10:30,322][26022] Updated weights on worker 0-0, policy_version 1205214 (0.00093) [2022-07-11 13:10:31,404][25689] Fps is (10 sec: 5495.7, 60 sec: 5551.5, 300 sec: 5530.9). Total num frames: 1234145280. Throughput: 0: 4975.3. Samples: 1234139664. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:10:31,404][25689] Avg episode reward: [(0, '-0.792')] [2022-07-11 13:10:32,016][26022] Updated weights on worker 0-0, policy_version 1205224 (0.00088) [2022-07-11 13:10:33,907][26022] Updated weights on worker 0-0, policy_version 1205234 (0.00084) [2022-07-11 13:10:35,670][26022] Updated weights on worker 0-0, policy_version 1205244 (0.00089) [2022-07-11 13:10:36,442][25689] Fps is (10 sec: 5609.8, 60 sec: 5550.2, 300 sec: 5531.2). Total num frames: 1234172928. Throughput: 0: 5785.6. Samples: 1234173172. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:10:36,442][25689] Avg episode reward: [(0, '-1.288')] [2022-07-11 13:10:37,620][26022] Updated weights on worker 0-0, policy_version 1205254 (0.00077) [2022-07-11 13:10:39,293][26022] Updated weights on worker 0-0, policy_version 1205264 (0.00092) [2022-07-11 13:10:41,306][26022] Updated weights on worker 0-0, policy_version 1205274 (0.00087) [2022-07-11 13:10:41,512][25689] Fps is (10 sec: 5571.9, 60 sec: 5530.7, 300 sec: 5533.7). Total num frames: 1234201600. Throughput: 0: 5782.0. Samples: 1234206514. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:10:41,512][25689] Avg episode reward: [(0, '-1.803')] [2022-07-11 13:10:42,953][26022] Updated weights on worker 0-0, policy_version 1205284 (0.00091) [2022-07-11 13:10:45,085][26022] Updated weights on worker 0-0, policy_version 1205294 (0.00104) [2022-07-11 13:10:46,559][25689] Fps is (10 sec: 5566.4, 60 sec: 5534.6, 300 sec: 5529.5). Total num frames: 1234229248. Throughput: 0: 4963.2. Samples: 1234223278. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:10:46,560][25689] Avg episode reward: [(0, '-0.780')] [2022-07-11 13:10:46,714][26022] Updated weights on worker 0-0, policy_version 1205304 (0.00096) [2022-07-11 13:10:48,835][26022] Updated weights on worker 0-0, policy_version 1205314 (0.00086) [2022-07-11 13:10:50,371][26022] Updated weights on worker 0-0, policy_version 1205324 (0.00083) [2022-07-11 13:10:51,629][25689] Fps is (10 sec: 5566.7, 60 sec: 5534.2, 300 sec: 5528.2). Total num frames: 1234257920. Throughput: 0: 5780.0. Samples: 1234256566. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:10:51,629][25689] Avg episode reward: [(0, '-0.688')] [2022-07-11 13:10:52,439][26022] Updated weights on worker 0-0, policy_version 1205334 (0.00086) [2022-07-11 13:10:54,054][26022] Updated weights on worker 0-0, policy_version 1205344 (0.00087) [2022-07-11 13:10:56,122][26022] Updated weights on worker 0-0, policy_version 1205354 (0.00084) [2022-07-11 13:10:56,639][25689] Fps is (10 sec: 5587.3, 60 sec: 5517.6, 300 sec: 5532.6). Total num frames: 1234285568. Throughput: 0: 5788.1. Samples: 1234290080. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:10:56,640][25689] Avg episode reward: [(0, '0.142')] [2022-07-11 13:10:57,780][26022] Updated weights on worker 0-0, policy_version 1205364 (0.00091) [2022-07-11 13:10:59,693][26022] Updated weights on worker 0-0, policy_version 1205374 (0.00084) [2022-07-11 13:11:01,407][26022] Updated weights on worker 0-0, policy_version 1205384 (0.00087) [2022-07-11 13:11:01,735][25689] Fps is (10 sec: 5572.7, 60 sec: 5534.2, 300 sec: 5538.2). Total num frames: 1234314240. Throughput: 0: 5789.3. Samples: 1234323594. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:11:01,736][25689] Avg episode reward: [(0, '0.728')] [2022-07-11 13:11:03,952][26022] Updated weights on worker 0-0, policy_version 1205394 (0.00082) [2022-07-11 13:11:05,575][26022] Updated weights on worker 0-0, policy_version 1205404 (0.00090) [2022-07-11 13:11:06,788][25689] Fps is (10 sec: 5246.8, 60 sec: 5498.3, 300 sec: 5527.1). Total num frames: 1234338816. Throughput: 0: 5686.4. Samples: 1234338306. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:11:06,789][25689] Avg episode reward: [(0, '0.553')] [2022-07-11 13:11:07,449][26022] Updated weights on worker 0-0, policy_version 1205414 (0.00087) [2022-07-11 13:11:09,027][26022] Updated weights on worker 0-0, policy_version 1205424 (0.00089) [2022-07-11 13:11:11,161][26022] Updated weights on worker 0-0, policy_version 1205434 (0.00095) [2022-07-11 13:11:11,800][25689] Fps is (10 sec: 5290.6, 60 sec: 5499.9, 300 sec: 5530.9). Total num frames: 1234367488. Throughput: 0: 5714.5. Samples: 1234371834. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:11:11,800][25689] Avg episode reward: [(0, '1.292')] [2022-07-11 13:11:12,986][26022] Updated weights on worker 0-0, policy_version 1205444 (0.00087) [2022-07-11 13:11:14,670][26022] Updated weights on worker 0-0, policy_version 1205454 (0.00087) [2022-07-11 13:11:16,552][26022] Updated weights on worker 0-0, policy_version 1205464 (0.00093) [2022-07-11 13:11:16,806][25689] Fps is (10 sec: 5724.3, 60 sec: 5517.5, 300 sec: 5532.2). Total num frames: 1234396160. Throughput: 0: 5719.9. Samples: 1234405428. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:11:16,806][25689] Avg episode reward: [(0, '1.090')] [2022-07-11 13:11:18,545][26022] Updated weights on worker 0-0, policy_version 1205474 (0.00110) [2022-07-11 13:11:20,239][26022] Updated weights on worker 0-0, policy_version 1205484 (0.00084) [2022-07-11 13:11:21,864][25689] Fps is (10 sec: 5494.3, 60 sec: 5498.8, 300 sec: 5528.7). Total num frames: 1234422784. Throughput: 0: 4887.3. Samples: 1234421970. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:11:21,866][25689] Avg episode reward: [(0, '0.968')] [2022-07-11 13:11:22,123][26022] Updated weights on worker 0-0, policy_version 1205494 (0.00096) [2022-07-11 13:11:23,965][26022] Updated weights on worker 0-0, policy_version 1205504 (0.00085) [2022-07-11 13:11:25,811][26022] Updated weights on worker 0-0, policy_version 1205514 (0.00097) [2022-07-11 13:11:26,910][25689] Fps is (10 sec: 5472.3, 60 sec: 5533.9, 300 sec: 5528.1). Total num frames: 1234451456. Throughput: 0: 5814.5. Samples: 1234455308. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:11:26,915][25689] Avg episode reward: [(0, '0.225')] [2022-07-11 13:11:27,930][26022] Updated weights on worker 0-0, policy_version 1205524 (0.00096) [2022-07-11 13:11:29,602][26022] Updated weights on worker 0-0, policy_version 1205534 (0.00094) [2022-07-11 13:11:31,438][26022] Updated weights on worker 0-0, policy_version 1205544 (0.00085) [2022-07-11 13:11:31,972][25689] Fps is (10 sec: 5673.2, 60 sec: 5528.4, 300 sec: 5527.9). Total num frames: 1234480128. Throughput: 0: 5787.2. Samples: 1234488574. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:11:31,973][25689] Avg episode reward: [(0, '-0.081')] [2022-07-11 13:11:33,245][26022] Updated weights on worker 0-0, policy_version 1205554 (0.00090) [2022-07-11 13:11:35,104][26022] Updated weights on worker 0-0, policy_version 1205564 (0.00088) [2022-07-11 13:11:36,991][25689] Fps is (10 sec: 5587.1, 60 sec: 5530.2, 300 sec: 5531.6). Total num frames: 1234507776. Throughput: 0: 4947.8. Samples: 1234505300. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:11:36,991][25689] Avg episode reward: [(0, '-0.325')] [2022-07-11 13:11:36,995][26022] Updated weights on worker 0-0, policy_version 1205574 (0.00089) [2022-07-11 13:11:38,644][26022] Updated weights on worker 0-0, policy_version 1205584 (0.00092) [2022-07-11 13:11:40,623][26022] Updated weights on worker 0-0, policy_version 1205594 (0.00095) [2022-07-11 13:11:42,062][25689] Fps is (10 sec: 5683.1, 60 sec: 5547.0, 300 sec: 5540.8). Total num frames: 1234537472. Throughput: 0: 5796.8. Samples: 1234539054. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:11:42,063][25689] Avg episode reward: [(0, '-0.530')] [2022-07-11 13:11:42,367][26022] Updated weights on worker 0-0, policy_version 1205604 (0.00091) [2022-07-11 13:11:44,373][26022] Updated weights on worker 0-0, policy_version 1205614 (0.00100) [2022-07-11 13:11:46,386][26022] Updated weights on worker 0-0, policy_version 1205625 (0.00078) [2022-07-11 13:11:47,145][25689] Fps is (10 sec: 5546.6, 60 sec: 5526.9, 300 sec: 5532.4). Total num frames: 1234564096. Throughput: 0: 5794.7. Samples: 1234572560. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:11:47,145][25689] Avg episode reward: [(0, '-0.430')] [2022-07-11 13:11:48,010][26022] Updated weights on worker 0-0, policy_version 1205635 (0.00102) [2022-07-11 13:11:50,119][26022] Updated weights on worker 0-0, policy_version 1205645 (0.00090) [2022-07-11 13:11:51,593][26022] Updated weights on worker 0-0, policy_version 1205655 (0.00081) [2022-07-11 13:11:52,179][25689] Fps is (10 sec: 5466.0, 60 sec: 5530.1, 300 sec: 5529.1). Total num frames: 1234592768. Throughput: 0: 4993.1. Samples: 1234589468. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:11:52,179][25689] Avg episode reward: [(0, '-0.659')] [2022-07-11 13:11:53,660][26022] Updated weights on worker 0-0, policy_version 1205665 (0.00094) [2022-07-11 13:11:55,485][26022] Updated weights on worker 0-0, policy_version 1205675 (0.00086) [2022-07-11 13:11:57,124][26022] Updated weights on worker 0-0, policy_version 1205685 (0.00056) [2022-07-11 13:11:57,193][25689] Fps is (10 sec: 5706.5, 60 sec: 5546.7, 300 sec: 5540.2). Total num frames: 1234621440. Throughput: 0: 5846.0. Samples: 1234623406. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:11:57,194][25689] Avg episode reward: [(0, '-0.813')] [2022-07-11 13:11:59,068][26022] Updated weights on worker 0-0, policy_version 1205695 (0.00087) [2022-07-11 13:12:00,619][26022] Updated weights on worker 0-0, policy_version 1205705 (0.00080) [2022-07-11 13:12:02,231][25689] Fps is (10 sec: 5602.5, 60 sec: 5535.0, 300 sec: 5537.6). Total num frames: 1234649088. Throughput: 0: 5858.2. Samples: 1234657208. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:12:02,232][25689] Avg episode reward: [(0, '-0.830')] [2022-07-11 13:12:02,866][26022] Updated weights on worker 0-0, policy_version 1205715 (0.00090) [2022-07-11 13:12:04,690][26022] Updated weights on worker 0-0, policy_version 1205725 (0.00087) [2022-07-11 13:12:06,634][26022] Updated weights on worker 0-0, policy_version 1205735 (0.00089) [2022-07-11 13:12:07,258][25689] Fps is (10 sec: 5392.4, 60 sec: 5571.3, 300 sec: 5537.5). Total num frames: 1234675712. Throughput: 0: 4961.4. Samples: 1234672350. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:12:07,259][25689] Avg episode reward: [(0, '-1.208')] [2022-07-11 13:12:08,405][26022] Updated weights on worker 0-0, policy_version 1205745 (0.00098) [2022-07-11 13:12:09,880][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:12:09,898][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001205752_1234690048.pth [2022-07-11 13:12:09,898][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001203804_1232695296.pth [2022-07-11 13:12:10,309][26022] Updated weights on worker 0-0, policy_version 1205755 (0.00088) [2022-07-11 13:12:12,103][26022] Updated weights on worker 0-0, policy_version 1205765 (0.00093) [2022-07-11 13:12:12,261][25689] Fps is (10 sec: 5513.4, 60 sec: 5572.1, 300 sec: 5537.6). Total num frames: 1234704384. Throughput: 0: 5806.6. Samples: 1234706076. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:12:12,261][25689] Avg episode reward: [(0, '-1.019')] [2022-07-11 13:12:13,811][26022] Updated weights on worker 0-0, policy_version 1205775 (0.00078) [2022-07-11 13:12:15,686][26022] Updated weights on worker 0-0, policy_version 1205785 (0.00085) [2022-07-11 13:12:17,292][25689] Fps is (10 sec: 5715.0, 60 sec: 5569.8, 300 sec: 5542.9). Total num frames: 1234733056. Throughput: 0: 5805.6. Samples: 1234740090. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:12:17,292][25689] Avg episode reward: [(0, '-0.820')] [2022-07-11 13:12:17,518][26022] Updated weights on worker 0-0, policy_version 1205795 (0.00087) [2022-07-11 13:12:19,339][26022] Updated weights on worker 0-0, policy_version 1205805 (0.00089) [2022-07-11 13:12:21,174][26022] Updated weights on worker 0-0, policy_version 1205815 (0.00082) [2022-07-11 13:12:22,348][25689] Fps is (10 sec: 5583.3, 60 sec: 5587.0, 300 sec: 5542.8). Total num frames: 1234760704. Throughput: 0: 4949.2. Samples: 1234756768. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:12:22,350][25689] Avg episode reward: [(0, '-0.034')] [2022-07-11 13:12:22,990][26022] Updated weights on worker 0-0, policy_version 1205825 (0.00108) [2022-07-11 13:12:24,787][26022] Updated weights on worker 0-0, policy_version 1205835 (0.00085) [2022-07-11 13:12:26,592][26022] Updated weights on worker 0-0, policy_version 1205845 (0.00081) [2022-07-11 13:12:27,366][25689] Fps is (10 sec: 5590.9, 60 sec: 5589.6, 300 sec: 5546.0). Total num frames: 1234789376. Throughput: 0: 5881.5. Samples: 1234790612. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:12:27,366][25689] Avg episode reward: [(0, '-1.443')] [2022-07-11 13:12:28,382][26022] Updated weights on worker 0-0, policy_version 1205855 (0.00084) [2022-07-11 13:12:30,241][26022] Updated weights on worker 0-0, policy_version 1205865 (0.00085) [2022-07-11 13:12:32,144][26022] Updated weights on worker 0-0, policy_version 1205875 (0.00094) [2022-07-11 13:12:32,391][25689] Fps is (10 sec: 5607.6, 60 sec: 5576.0, 300 sec: 5542.4). Total num frames: 1234817024. Throughput: 0: 5887.7. Samples: 1234824600. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:12:32,393][25689] Avg episode reward: [(0, '-0.927')] [2022-07-11 13:12:33,839][26022] Updated weights on worker 0-0, policy_version 1205885 (0.00088) [2022-07-11 13:12:35,730][26022] Updated weights on worker 0-0, policy_version 1205895 (0.00089) [2022-07-11 13:12:37,402][25689] Fps is (10 sec: 5611.5, 60 sec: 5593.6, 300 sec: 5547.5). Total num frames: 1234845696. Throughput: 0: 5044.4. Samples: 1234841534. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:12:37,403][25689] Avg episode reward: [(0, '-0.355')] [2022-07-11 13:12:37,429][26022] Updated weights on worker 0-0, policy_version 1205905 (0.00088) [2022-07-11 13:12:39,411][26022] Updated weights on worker 0-0, policy_version 1205915 (0.00093) [2022-07-11 13:12:41,194][26022] Updated weights on worker 0-0, policy_version 1205925 (0.00109) [2022-07-11 13:12:42,492][25689] Fps is (10 sec: 5677.4, 60 sec: 5575.0, 300 sec: 5546.3). Total num frames: 1234874368. Throughput: 0: 5879.7. Samples: 1234875208. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:12:42,492][25689] Avg episode reward: [(0, '-1.790')] [2022-07-11 13:12:43,081][26022] Updated weights on worker 0-0, policy_version 1205935 (0.00081) [2022-07-11 13:12:44,865][26022] Updated weights on worker 0-0, policy_version 1205945 (0.00091) [2022-07-11 13:12:46,808][26022] Updated weights on worker 0-0, policy_version 1205955 (0.00090) [2022-07-11 13:12:47,511][25689] Fps is (10 sec: 5571.2, 60 sec: 5597.8, 300 sec: 5543.0). Total num frames: 1234902016. Throughput: 0: 5877.7. Samples: 1234909022. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:12:47,512][25689] Avg episode reward: [(0, '-1.644')] [2022-07-11 13:12:48,419][26022] Updated weights on worker 0-0, policy_version 1205965 (0.00091) [2022-07-11 13:12:50,287][26022] Updated weights on worker 0-0, policy_version 1205975 (0.00120) [2022-07-11 13:12:52,007][26022] Updated weights on worker 0-0, policy_version 1205985 (0.00085) [2022-07-11 13:12:52,518][25689] Fps is (10 sec: 5617.0, 60 sec: 5600.3, 300 sec: 5549.8). Total num frames: 1234930688. Throughput: 0: 5042.0. Samples: 1234926080. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:12:52,519][25689] Avg episode reward: [(0, '-0.541')] [2022-07-11 13:12:53,896][26022] Updated weights on worker 0-0, policy_version 1205995 (0.00092) [2022-07-11 13:12:55,655][26022] Updated weights on worker 0-0, policy_version 1206005 (0.00086) [2022-07-11 13:12:57,582][25689] Fps is (10 sec: 5592.5, 60 sec: 5578.8, 300 sec: 5544.0). Total num frames: 1234958336. Throughput: 0: 5877.4. Samples: 1234960140. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:12:57,583][25689] Avg episode reward: [(0, '-0.419')] [2022-07-11 13:12:57,589][26022] Updated weights on worker 0-0, policy_version 1206015 (0.00088) [2022-07-11 13:12:59,284][26022] Updated weights on worker 0-0, policy_version 1206025 (0.00084) [2022-07-11 13:13:01,226][26022] Updated weights on worker 0-0, policy_version 1206035 (0.00083) [2022-07-11 13:13:02,613][25689] Fps is (10 sec: 5579.0, 60 sec: 5596.4, 300 sec: 5551.0). Total num frames: 1234987008. Throughput: 0: 5830.4. Samples: 1234992526. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:13:02,614][25689] Avg episode reward: [(0, '-0.055')] [2022-07-11 13:13:03,255][26022] Updated weights on worker 0-0, policy_version 1206045 (0.00085) [2022-07-11 13:13:05,191][26022] Updated weights on worker 0-0, policy_version 1206055 (0.00089) [2022-07-11 13:13:06,858][26022] Updated weights on worker 0-0, policy_version 1206065 (0.00063) [2022-07-11 13:13:07,626][25689] Fps is (10 sec: 5403.4, 60 sec: 5580.7, 300 sec: 5544.4). Total num frames: 1235012608. Throughput: 0: 4958.7. Samples: 1235008764. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:13:07,626][25689] Avg episode reward: [(0, '0.476')] [2022-07-11 13:13:08,721][26022] Updated weights on worker 0-0, policy_version 1206075 (0.00057) [2022-07-11 13:13:10,563][26022] Updated weights on worker 0-0, policy_version 1206085 (0.00078) [2022-07-11 13:13:12,335][26022] Updated weights on worker 0-0, policy_version 1206095 (0.00085) [2022-07-11 13:13:12,629][25689] Fps is (10 sec: 5623.3, 60 sec: 5614.6, 300 sec: 5555.8). Total num frames: 1235043328. Throughput: 0: 5792.7. Samples: 1235042574. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:13:12,629][25689] Avg episode reward: [(0, '1.385')] [2022-07-11 13:13:14,178][26022] Updated weights on worker 0-0, policy_version 1206105 (0.00085) [2022-07-11 13:13:15,978][26022] Updated weights on worker 0-0, policy_version 1206115 (0.00085) [2022-07-11 13:13:17,635][25689] Fps is (10 sec: 5729.3, 60 sec: 5583.0, 300 sec: 5546.9). Total num frames: 1235069952. Throughput: 0: 5809.4. Samples: 1235076636. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:13:17,637][25689] Avg episode reward: [(0, '1.384')] [2022-07-11 13:13:17,838][26022] Updated weights on worker 0-0, policy_version 1206125 (0.00087) [2022-07-11 13:13:19,797][26022] Updated weights on worker 0-0, policy_version 1206135 (0.00085) [2022-07-11 13:13:21,490][26022] Updated weights on worker 0-0, policy_version 1206145 (0.00087) [2022-07-11 13:13:22,696][25689] Fps is (10 sec: 5391.0, 60 sec: 5582.6, 300 sec: 5550.7). Total num frames: 1235097600. Throughput: 0: 5015.9. Samples: 1235093258. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:13:22,698][25689] Avg episode reward: [(0, '1.321')] [2022-07-11 13:13:23,284][26022] Updated weights on worker 0-0, policy_version 1206155 (0.00087) [2022-07-11 13:13:25,463][26022] Updated weights on worker 0-0, policy_version 1206165 (0.00087) [2022-07-11 13:13:27,028][26022] Updated weights on worker 0-0, policy_version 1206175 (0.00084) [2022-07-11 13:13:27,730][25689] Fps is (10 sec: 5578.8, 60 sec: 5581.0, 300 sec: 5554.8). Total num frames: 1235126272. Throughput: 0: 5861.8. Samples: 1235126612. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:13:27,732][25689] Avg episode reward: [(0, '0.730')] [2022-07-11 13:13:28,973][26022] Updated weights on worker 0-0, policy_version 1206185 (0.00088) [2022-07-11 13:13:30,563][26022] Updated weights on worker 0-0, policy_version 1206195 (0.00086) [2022-07-11 13:13:32,374][26022] Updated weights on worker 0-0, policy_version 1206205 (0.00086) [2022-07-11 13:13:32,737][25689] Fps is (10 sec: 5609.2, 60 sec: 5582.8, 300 sec: 5551.3). Total num frames: 1235153920. Throughput: 0: 5864.4. Samples: 1235160494. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:13:32,738][25689] Avg episode reward: [(0, '1.000')] [2022-07-11 13:13:34,309][26022] Updated weights on worker 0-0, policy_version 1206215 (0.00087) [2022-07-11 13:13:36,063][26022] Updated weights on worker 0-0, policy_version 1206225 (0.00094) [2022-07-11 13:13:37,777][25689] Fps is (10 sec: 5707.7, 60 sec: 5597.0, 300 sec: 5555.2). Total num frames: 1235183616. Throughput: 0: 5011.2. Samples: 1235177572. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:13:37,778][25689] Avg episode reward: [(0, '1.019')] [2022-07-11 13:13:37,919][26022] Updated weights on worker 0-0, policy_version 1206235 (0.00095) [2022-07-11 13:13:39,874][26022] Updated weights on worker 0-0, policy_version 1206245 (0.00093) [2022-07-11 13:13:41,456][26022] Updated weights on worker 0-0, policy_version 1206255 (0.00084) [2022-07-11 13:13:42,860][25689] Fps is (10 sec: 5664.3, 60 sec: 5580.7, 300 sec: 5558.6). Total num frames: 1235211264. Throughput: 0: 5856.7. Samples: 1235211354. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:13:42,863][25689] Avg episode reward: [(0, '0.972')] [2022-07-11 13:13:43,496][26022] Updated weights on worker 0-0, policy_version 1206265 (0.00084) [2022-07-11 13:13:45,135][26022] Updated weights on worker 0-0, policy_version 1206275 (0.00084) [2022-07-11 13:13:47,063][26022] Updated weights on worker 0-0, policy_version 1206285 (0.00099) [2022-07-11 13:13:47,864][25689] Fps is (10 sec: 5481.8, 60 sec: 5582.1, 300 sec: 5558.6). Total num frames: 1235238912. Throughput: 0: 5899.7. Samples: 1235245396. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:13:47,866][25689] Avg episode reward: [(0, '1.187')] [2022-07-11 13:13:48,790][26022] Updated weights on worker 0-0, policy_version 1206295 (0.00096) [2022-07-11 13:13:50,752][26022] Updated weights on worker 0-0, policy_version 1206305 (0.00093) [2022-07-11 13:13:52,646][26022] Updated weights on worker 0-0, policy_version 1206315 (0.00092) [2022-07-11 13:13:52,871][25689] Fps is (10 sec: 5626.1, 60 sec: 5582.2, 300 sec: 5555.5). Total num frames: 1235267584. Throughput: 0: 5056.0. Samples: 1235262292. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:13:52,872][25689] Avg episode reward: [(0, '1.487')] [2022-07-11 13:13:54,373][26022] Updated weights on worker 0-0, policy_version 1206325 (0.00079) [2022-07-11 13:13:56,199][26022] Updated weights on worker 0-0, policy_version 1206335 (0.00086) [2022-07-11 13:13:57,889][25689] Fps is (10 sec: 5720.1, 60 sec: 5603.3, 300 sec: 5560.2). Total num frames: 1235296256. Throughput: 0: 5876.6. Samples: 1235295762. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:13:57,890][25689] Avg episode reward: [(0, '2.039')] [2022-07-11 13:13:58,111][26022] Updated weights on worker 0-0, policy_version 1206345 (0.00095) [2022-07-11 13:13:59,744][26022] Updated weights on worker 0-0, policy_version 1206355 (0.00090) [2022-07-11 13:14:02,107][26022] Updated weights on worker 0-0, policy_version 1206365 (0.00086) [2022-07-11 13:14:02,939][25689] Fps is (10 sec: 5390.6, 60 sec: 5550.7, 300 sec: 5560.3). Total num frames: 1235321856. Throughput: 0: 5800.9. Samples: 1235327826. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:14:02,941][25689] Avg episode reward: [(0, '-0.453')] [2022-07-11 13:14:03,922][26022] Updated weights on worker 0-0, policy_version 1206375 (0.00085) [2022-07-11 13:14:05,616][26022] Updated weights on worker 0-0, policy_version 1206385 (0.00094) [2022-07-11 13:14:07,364][26022] Updated weights on worker 0-0, policy_version 1206395 (0.00089) [2022-07-11 13:14:07,950][25689] Fps is (10 sec: 5496.5, 60 sec: 5618.8, 300 sec: 5564.6). Total num frames: 1235351552. Throughput: 0: 4946.9. Samples: 1235344756. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:14:07,951][25689] Avg episode reward: [(0, '0.057')] [2022-07-11 13:14:09,379][26022] Updated weights on worker 0-0, policy_version 1206405 (0.00085) [2022-07-11 13:14:09,954][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:14:09,963][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001206409_1235362816.pth [2022-07-11 13:14:09,964][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001204454_1233360896.pth [2022-07-11 13:14:10,947][26022] Updated weights on worker 0-0, policy_version 1206415 (0.00087) [2022-07-11 13:14:12,840][26022] Updated weights on worker 0-0, policy_version 1206425 (0.00085) [2022-07-11 13:14:12,955][25689] Fps is (10 sec: 5725.2, 60 sec: 5567.6, 300 sec: 5565.4). Total num frames: 1235379200. Throughput: 0: 5796.4. Samples: 1235378706. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:14:12,955][25689] Avg episode reward: [(0, '-0.024')] [2022-07-11 13:14:14,634][26022] Updated weights on worker 0-0, policy_version 1206435 (0.00084) [2022-07-11 13:14:16,475][26022] Updated weights on worker 0-0, policy_version 1206445 (0.00087) [2022-07-11 13:14:17,982][25689] Fps is (10 sec: 5613.6, 60 sec: 5599.6, 300 sec: 5566.2). Total num frames: 1235407872. Throughput: 0: 5839.3. Samples: 1235413092. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:14:17,983][25689] Avg episode reward: [(0, '-0.037')] [2022-07-11 13:14:18,302][26022] Updated weights on worker 0-0, policy_version 1206455 (0.00079) [2022-07-11 13:14:20,152][26022] Updated weights on worker 0-0, policy_version 1206465 (0.00081) [2022-07-11 13:14:21,824][26022] Updated weights on worker 0-0, policy_version 1206475 (0.00085) [2022-07-11 13:14:23,073][25689] Fps is (10 sec: 5667.4, 60 sec: 5613.8, 300 sec: 5568.3). Total num frames: 1235436544. Throughput: 0: 5065.2. Samples: 1235429810. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:14:23,073][25689] Avg episode reward: [(0, '-1.082')] [2022-07-11 13:14:23,682][26022] Updated weights on worker 0-0, policy_version 1206485 (0.00093) [2022-07-11 13:14:25,669][26022] Updated weights on worker 0-0, policy_version 1206495 (0.00088) [2022-07-11 13:14:27,468][26022] Updated weights on worker 0-0, policy_version 1206505 (0.00051) [2022-07-11 13:14:28,171][25689] Fps is (10 sec: 5627.9, 60 sec: 5607.8, 300 sec: 5570.1). Total num frames: 1235465216. Throughput: 0: 5866.1. Samples: 1235463380. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:14:28,173][25689] Avg episode reward: [(0, '0.893')] [2022-07-11 13:14:29,387][26022] Updated weights on worker 0-0, policy_version 1206515 (0.00095) [2022-07-11 13:14:31,096][26022] Updated weights on worker 0-0, policy_version 1206525 (0.00079) [2022-07-11 13:14:32,835][26022] Updated weights on worker 0-0, policy_version 1206535 (0.00083) [2022-07-11 13:14:33,219][25689] Fps is (10 sec: 5551.1, 60 sec: 5604.0, 300 sec: 5569.6). Total num frames: 1235492864. Throughput: 0: 5846.9. Samples: 1235497188. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:14:33,219][25689] Avg episode reward: [(0, '0.653')] [2022-07-11 13:14:34,671][26022] Updated weights on worker 0-0, policy_version 1206545 (0.00081) [2022-07-11 13:14:36,413][26022] Updated weights on worker 0-0, policy_version 1206555 (0.00084) [2022-07-11 13:14:38,229][25689] Fps is (10 sec: 5701.5, 60 sec: 5606.8, 300 sec: 5570.2). Total num frames: 1235522560. Throughput: 0: 5838.4. Samples: 1235531302. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:14:38,231][25689] Avg episode reward: [(0, '0.466')] [2022-07-11 13:14:38,239][26022] Updated weights on worker 0-0, policy_version 1206565 (0.00448) [2022-07-11 13:14:40,009][26022] Updated weights on worker 0-0, policy_version 1206575 (0.00093) [2022-07-11 13:14:41,896][26022] Updated weights on worker 0-0, policy_version 1206585 (0.00091) [2022-07-11 13:14:43,300][25689] Fps is (10 sec: 5688.4, 60 sec: 5608.0, 300 sec: 5570.6). Total num frames: 1235550208. Throughput: 0: 5856.2. Samples: 1235548262. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:14:43,300][25689] Avg episode reward: [(0, '0.519')] [2022-07-11 13:14:43,724][26022] Updated weights on worker 0-0, policy_version 1206595 (0.00087) [2022-07-11 13:14:45,396][26022] Updated weights on worker 0-0, policy_version 1206605 (0.00084) [2022-07-11 13:14:47,541][26022] Updated weights on worker 0-0, policy_version 1206615 (0.00101) [2022-07-11 13:14:48,361][25689] Fps is (10 sec: 5558.9, 60 sec: 5619.6, 300 sec: 5570.7). Total num frames: 1235578880. Throughput: 0: 5892.1. Samples: 1235582336. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:14:48,363][25689] Avg episode reward: [(0, '0.456')] [2022-07-11 13:14:49,119][26022] Updated weights on worker 0-0, policy_version 1206625 (0.00089) [2022-07-11 13:14:50,994][26022] Updated weights on worker 0-0, policy_version 1206635 (0.00082) [2022-07-11 13:14:52,606][26022] Updated weights on worker 0-0, policy_version 1206645 (0.00081) [2022-07-11 13:14:53,376][25689] Fps is (10 sec: 5792.7, 60 sec: 5635.8, 300 sec: 5574.1). Total num frames: 1235608576. Throughput: 0: 5921.7. Samples: 1235616552. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:14:53,378][25689] Avg episode reward: [(0, '0.393')] [2022-07-11 13:14:54,496][26022] Updated weights on worker 0-0, policy_version 1206655 (0.00079) [2022-07-11 13:14:56,262][26022] Updated weights on worker 0-0, policy_version 1206665 (0.00094) [2022-07-11 13:14:58,188][26022] Updated weights on worker 0-0, policy_version 1206675 (0.00084) [2022-07-11 13:14:58,391][25689] Fps is (10 sec: 5716.9, 60 sec: 5619.1, 300 sec: 5575.5). Total num frames: 1235636224. Throughput: 0: 5070.1. Samples: 1235633526. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:14:58,393][25689] Avg episode reward: [(0, '0.080')] [2022-07-11 13:15:00,070][26022] Updated weights on worker 0-0, policy_version 1206685 (0.00089) [2022-07-11 13:15:02,023][26022] Updated weights on worker 0-0, policy_version 1206695 (0.00097) [2022-07-11 13:15:03,456][25689] Fps is (10 sec: 5282.4, 60 sec: 5617.7, 300 sec: 5571.4). Total num frames: 1235661824. Throughput: 0: 5792.9. Samples: 1235665026. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:15:03,457][25689] Avg episode reward: [(0, '0.235')] [2022-07-11 13:15:04,033][26022] Updated weights on worker 0-0, policy_version 1206705 (0.00085) [2022-07-11 13:15:05,619][26022] Updated weights on worker 0-0, policy_version 1206715 (0.00088) [2022-07-11 13:15:07,650][26022] Updated weights on worker 0-0, policy_version 1206725 (0.00090) [2022-07-11 13:15:08,522][25689] Fps is (10 sec: 5559.1, 60 sec: 5629.4, 300 sec: 5577.6). Total num frames: 1235692544. Throughput: 0: 5794.2. Samples: 1235699158. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:15:08,524][25689] Avg episode reward: [(0, '0.469')] [2022-07-11 13:15:09,435][26022] Updated weights on worker 0-0, policy_version 1206735 (0.00082) [2022-07-11 13:15:11,123][26022] Updated weights on worker 0-0, policy_version 1206745 (0.00088) [2022-07-11 13:15:13,300][26022] Updated weights on worker 0-0, policy_version 1206755 (0.00087) [2022-07-11 13:15:13,538][25689] Fps is (10 sec: 5586.0, 60 sec: 5594.6, 300 sec: 5570.7). Total num frames: 1235718144. Throughput: 0: 4940.4. Samples: 1235716164. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:15:13,539][25689] Avg episode reward: [(0, '-0.667')] [2022-07-11 13:15:14,717][26022] Updated weights on worker 0-0, policy_version 1206765 (0.00088) [2022-07-11 13:15:16,793][26022] Updated weights on worker 0-0, policy_version 1206775 (0.00091) [2022-07-11 13:15:18,285][26022] Updated weights on worker 0-0, policy_version 1206785 (0.00082) [2022-07-11 13:15:18,558][25689] Fps is (10 sec: 5611.8, 60 sec: 5629.2, 300 sec: 5581.4). Total num frames: 1235748864. Throughput: 0: 5801.9. Samples: 1235750534. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:15:18,560][25689] Avg episode reward: [(0, '0.491')] [2022-07-11 13:15:20,269][26022] Updated weights on worker 0-0, policy_version 1206795 (0.00080) [2022-07-11 13:15:22,115][26022] Updated weights on worker 0-0, policy_version 1206805 (0.00084) [2022-07-11 13:15:23,664][25689] Fps is (10 sec: 5865.4, 60 sec: 5627.7, 300 sec: 5587.4). Total num frames: 1235777536. Throughput: 0: 5914.0. Samples: 1235784538. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:15:23,665][25689] Avg episode reward: [(0, '1.319')] [2022-07-11 13:15:23,763][26022] Updated weights on worker 0-0, policy_version 1206815 (0.00094) [2022-07-11 13:15:25,584][26022] Updated weights on worker 0-0, policy_version 1206825 (0.00088) [2022-07-11 13:15:27,731][26022] Updated weights on worker 0-0, policy_version 1206835 (0.00086) [2022-07-11 13:15:28,734][25689] Fps is (10 sec: 5434.3, 60 sec: 5596.6, 300 sec: 5579.3). Total num frames: 1235804160. Throughput: 0: 5067.9. Samples: 1235801586. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:15:28,734][25689] Avg episode reward: [(0, '1.427')] [2022-07-11 13:15:29,094][26022] Updated weights on worker 0-0, policy_version 1206845 (0.00107) [2022-07-11 13:15:31,476][26022] Updated weights on worker 0-0, policy_version 1206855 (0.00085) [2022-07-11 13:15:32,546][26022] Updated weights on worker 0-0, policy_version 1206865 (0.00083) [2022-07-11 13:15:33,805][25689] Fps is (10 sec: 5554.1, 60 sec: 5628.2, 300 sec: 5585.5). Total num frames: 1235833856. Throughput: 0: 5875.1. Samples: 1235835230. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:15:33,805][25689] Avg episode reward: [(0, '0.605')] [2022-07-11 13:15:34,774][26022] Updated weights on worker 0-0, policy_version 1206875 (0.00095) [2022-07-11 13:15:36,403][26022] Updated weights on worker 0-0, policy_version 1206885 (0.00091) [2022-07-11 13:15:38,195][26022] Updated weights on worker 0-0, policy_version 1206895 (0.00092) [2022-07-11 13:15:38,830][25689] Fps is (10 sec: 5780.9, 60 sec: 5609.9, 300 sec: 5586.4). Total num frames: 1235862528. Throughput: 0: 5874.1. Samples: 1235869614. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:15:38,831][25689] Avg episode reward: [(0, '0.341')] [2022-07-11 13:15:40,330][26022] Updated weights on worker 0-0, policy_version 1206905 (0.00100) [2022-07-11 13:15:41,796][26022] Updated weights on worker 0-0, policy_version 1206915 (0.00168) [2022-07-11 13:15:43,842][26022] Updated weights on worker 0-0, policy_version 1206925 (0.00086) [2022-07-11 13:15:43,923][25689] Fps is (10 sec: 5667.5, 60 sec: 5624.8, 300 sec: 5589.0). Total num frames: 1235891200. Throughput: 0: 5022.4. Samples: 1235886284. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:15:43,923][25689] Avg episode reward: [(0, '0.534')] [2022-07-11 13:15:45,623][26022] Updated weights on worker 0-0, policy_version 1206935 (0.00099) [2022-07-11 13:15:47,535][26022] Updated weights on worker 0-0, policy_version 1206945 (0.00097) [2022-07-11 13:15:48,931][25689] Fps is (10 sec: 5677.5, 60 sec: 5629.7, 300 sec: 5590.1). Total num frames: 1235919872. Throughput: 0: 5873.8. Samples: 1235920218. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:15:48,932][25689] Avg episode reward: [(0, '0.052')] [2022-07-11 13:15:49,491][26022] Updated weights on worker 0-0, policy_version 1206955 (0.00084) [2022-07-11 13:15:51,052][26022] Updated weights on worker 0-0, policy_version 1206965 (0.00094) [2022-07-11 13:15:52,919][26022] Updated weights on worker 0-0, policy_version 1206975 (0.00101) [2022-07-11 13:15:53,951][25689] Fps is (10 sec: 5616.1, 60 sec: 5595.4, 300 sec: 5589.9). Total num frames: 1235947520. Throughput: 0: 5901.0. Samples: 1235954114. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:15:53,953][25689] Avg episode reward: [(0, '-0.116')] [2022-07-11 13:15:54,683][26022] Updated weights on worker 0-0, policy_version 1206985 (0.00100) [2022-07-11 13:15:56,547][26022] Updated weights on worker 0-0, policy_version 1206995 (0.00083) [2022-07-11 13:15:58,402][26022] Updated weights on worker 0-0, policy_version 1207005 (0.00084) [2022-07-11 13:15:58,967][25689] Fps is (10 sec: 5713.6, 60 sec: 5629.2, 300 sec: 5594.9). Total num frames: 1235977216. Throughput: 0: 5040.5. Samples: 1235971112. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:15:58,968][25689] Avg episode reward: [(0, '-0.019')] [2022-07-11 13:16:00,359][26022] Updated weights on worker 0-0, policy_version 1207015 (0.00085) [2022-07-11 13:16:01,909][26022] Updated weights on worker 0-0, policy_version 1207025 (0.00081) [2022-07-11 13:16:04,082][25689] Fps is (10 sec: 5458.3, 60 sec: 5624.5, 300 sec: 5597.2). Total num frames: 1236002816. Throughput: 0: 5826.1. Samples: 1236003732. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:16:04,082][25689] Avg episode reward: [(0, '-0.042')] [2022-07-11 13:16:04,123][26022] Updated weights on worker 0-0, policy_version 1207035 (0.00077) [2022-07-11 13:16:05,997][26022] Updated weights on worker 0-0, policy_version 1207045 (0.00088) [2022-07-11 13:16:07,788][26022] Updated weights on worker 0-0, policy_version 1207055 (0.00089) [2022-07-11 13:16:09,118][25689] Fps is (10 sec: 5245.7, 60 sec: 5576.7, 300 sec: 5593.3). Total num frames: 1236030464. Throughput: 0: 5777.6. Samples: 1236036852. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:16:09,120][25689] Avg episode reward: [(0, '0.227')] [2022-07-11 13:16:09,696][26022] Updated weights on worker 0-0, policy_version 1207065 (0.00090) [2022-07-11 13:16:10,000][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:16:10,009][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001207067_1236036608.pth [2022-07-11 13:16:10,011][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001205103_1234025472.pth [2022-07-11 13:16:11,210][26022] Updated weights on worker 0-0, policy_version 1207075 (0.00086) [2022-07-11 13:16:13,281][26022] Updated weights on worker 0-0, policy_version 1207085 (0.01130) [2022-07-11 13:16:14,132][25689] Fps is (10 sec: 5807.5, 60 sec: 5661.3, 300 sec: 5600.0). Total num frames: 1236061184. Throughput: 0: 4931.7. Samples: 1236053642. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:16:14,134][25689] Avg episode reward: [(0, '-0.007')] [2022-07-11 13:16:14,963][26022] Updated weights on worker 0-0, policy_version 1207095 (0.00091) [2022-07-11 13:16:16,790][26022] Updated weights on worker 0-0, policy_version 1207105 (0.00086) [2022-07-11 13:16:18,664][26022] Updated weights on worker 0-0, policy_version 1207115 (0.00102) [2022-07-11 13:16:19,174][25689] Fps is (10 sec: 5600.8, 60 sec: 5574.8, 300 sec: 5596.9). Total num frames: 1236086784. Throughput: 0: 5767.6. Samples: 1236087654. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:16:19,174][25689] Avg episode reward: [(0, '-0.467')] [2022-07-11 13:16:20,442][26022] Updated weights on worker 0-0, policy_version 1207125 (0.00088) [2022-07-11 13:16:22,425][26022] Updated weights on worker 0-0, policy_version 1207135 (0.00086) [2022-07-11 13:16:24,099][26022] Updated weights on worker 0-0, policy_version 1207145 (0.00083) [2022-07-11 13:16:24,251][25689] Fps is (10 sec: 5566.0, 60 sec: 5611.3, 300 sec: 5603.2). Total num frames: 1236117504. Throughput: 0: 5849.2. Samples: 1236121704. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:16:24,251][25689] Avg episode reward: [(0, '-1.350')] [2022-07-11 13:16:25,960][26022] Updated weights on worker 0-0, policy_version 1207155 (0.00087) [2022-07-11 13:16:27,806][26022] Updated weights on worker 0-0, policy_version 1207165 (0.00094) [2022-07-11 13:16:29,261][25689] Fps is (10 sec: 5786.2, 60 sec: 5633.7, 300 sec: 5600.7). Total num frames: 1236145152. Throughput: 0: 5061.2. Samples: 1236138800. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:16:29,262][25689] Avg episode reward: [(0, '-0.865')] [2022-07-11 13:16:29,524][26022] Updated weights on worker 0-0, policy_version 1207175 (0.00083) [2022-07-11 13:16:31,455][26022] Updated weights on worker 0-0, policy_version 1207185 (0.00080) [2022-07-11 13:16:33,143][26022] Updated weights on worker 0-0, policy_version 1207195 (0.00088) [2022-07-11 13:16:34,280][25689] Fps is (10 sec: 5615.4, 60 sec: 5621.6, 300 sec: 5604.1). Total num frames: 1236173824. Throughput: 0: 5914.2. Samples: 1236172802. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:16:34,281][25689] Avg episode reward: [(0, '-2.485')] [2022-07-11 13:16:34,886][26022] Updated weights on worker 0-0, policy_version 1207205 (0.00086) [2022-07-11 13:16:36,744][26022] Updated weights on worker 0-0, policy_version 1207215 (0.00094) [2022-07-11 13:16:38,467][26022] Updated weights on worker 0-0, policy_version 1207225 (0.00101) [2022-07-11 13:16:39,317][25689] Fps is (10 sec: 5702.2, 60 sec: 5620.5, 300 sec: 5601.3). Total num frames: 1236202496. Throughput: 0: 5937.9. Samples: 1236207266. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 13:16:39,318][25689] Avg episode reward: [(0, '-1.856')] [2022-07-11 13:16:40,234][26022] Updated weights on worker 0-0, policy_version 1207235 (0.00083) [2022-07-11 13:16:42,064][26022] Updated weights on worker 0-0, policy_version 1207245 (0.00082) [2022-07-11 13:16:44,001][26022] Updated weights on worker 0-0, policy_version 1207255 (0.00088) [2022-07-11 13:16:44,439][25689] Fps is (10 sec: 5645.0, 60 sec: 5617.8, 300 sec: 5607.5). Total num frames: 1236231168. Throughput: 0: 5919.9. Samples: 1236241214. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:16:44,439][25689] Avg episode reward: [(0, '-1.013')] [2022-07-11 13:16:45,749][26022] Updated weights on worker 0-0, policy_version 1207265 (0.00339) [2022-07-11 13:16:47,531][26022] Updated weights on worker 0-0, policy_version 1207275 (0.00085) [2022-07-11 13:16:49,435][26022] Updated weights on worker 0-0, policy_version 1207285 (0.00085) [2022-07-11 13:16:49,465][25689] Fps is (10 sec: 5650.9, 60 sec: 5616.1, 300 sec: 5607.6). Total num frames: 1236259840. Throughput: 0: 5920.2. Samples: 1236258414. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:16:49,466][25689] Avg episode reward: [(0, '-0.316')] [2022-07-11 13:16:51,089][26022] Updated weights on worker 0-0, policy_version 1207295 (0.00097) [2022-07-11 13:16:53,010][26022] Updated weights on worker 0-0, policy_version 1207305 (0.00086) [2022-07-11 13:16:54,493][25689] Fps is (10 sec: 5805.3, 60 sec: 5649.2, 300 sec: 5610.8). Total num frames: 1236289536. Throughput: 0: 5926.5. Samples: 1236292592. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:16:54,494][25689] Avg episode reward: [(0, '-1.081')] [2022-07-11 13:16:54,714][26022] Updated weights on worker 0-0, policy_version 1207315 (0.00082) [2022-07-11 13:16:56,560][26022] Updated weights on worker 0-0, policy_version 1207325 (0.00089) [2022-07-11 13:16:58,291][26022] Updated weights on worker 0-0, policy_version 1207335 (0.00100) [2022-07-11 13:16:59,515][25689] Fps is (10 sec: 5706.2, 60 sec: 5614.9, 300 sec: 5611.1). Total num frames: 1236317184. Throughput: 0: 5914.1. Samples: 1236326714. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:16:59,515][25689] Avg episode reward: [(0, '-0.946')] [2022-07-11 13:17:00,194][26022] Updated weights on worker 0-0, policy_version 1207345 (0.00081) [2022-07-11 13:17:02,308][26022] Updated weights on worker 0-0, policy_version 1207355 (0.00075) [2022-07-11 13:17:04,104][26022] Updated weights on worker 0-0, policy_version 1207365 (0.00088) [2022-07-11 13:17:04,555][25689] Fps is (10 sec: 5597.4, 60 sec: 5672.6, 300 sec: 5617.8). Total num frames: 1236345856. Throughput: 0: 4992.4. Samples: 1236341642. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:17:04,555][25689] Avg episode reward: [(0, '0.821')] [2022-07-11 13:17:05,968][26022] Updated weights on worker 0-0, policy_version 1207375 (0.00087) [2022-07-11 13:17:07,587][26022] Updated weights on worker 0-0, policy_version 1207385 (0.00083) [2022-07-11 13:17:09,559][25689] Fps is (10 sec: 5505.0, 60 sec: 5658.6, 300 sec: 5610.8). Total num frames: 1236372480. Throughput: 0: 5848.2. Samples: 1236375928. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:17:09,560][25689] Avg episode reward: [(0, '1.055')] [2022-07-11 13:17:09,563][26022] Updated weights on worker 0-0, policy_version 1207395 (0.00085) [2022-07-11 13:17:11,253][26022] Updated weights on worker 0-0, policy_version 1207405 (0.00088) [2022-07-11 13:17:13,116][26022] Updated weights on worker 0-0, policy_version 1207415 (0.00079) [2022-07-11 13:17:14,578][25689] Fps is (10 sec: 5516.8, 60 sec: 5624.3, 300 sec: 5611.1). Total num frames: 1236401152. Throughput: 0: 5852.5. Samples: 1236410140. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:17:14,578][25689] Avg episode reward: [(0, '0.901')] [2022-07-11 13:17:14,974][26022] Updated weights on worker 0-0, policy_version 1207425 (0.00083) [2022-07-11 13:17:16,566][26022] Updated weights on worker 0-0, policy_version 1207435 (0.00095) [2022-07-11 13:17:18,720][26022] Updated weights on worker 0-0, policy_version 1207445 (0.00085) [2022-07-11 13:17:19,603][25689] Fps is (10 sec: 5811.6, 60 sec: 5693.7, 300 sec: 5618.6). Total num frames: 1236430848. Throughput: 0: 5003.5. Samples: 1236427224. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:17:19,603][25689] Avg episode reward: [(0, '1.446')] [2022-07-11 13:17:20,142][26022] Updated weights on worker 0-0, policy_version 1207455 (0.00085) [2022-07-11 13:17:22,093][26022] Updated weights on worker 0-0, policy_version 1207465 (0.00096) [2022-07-11 13:17:24,000][26022] Updated weights on worker 0-0, policy_version 1207475 (0.00090) [2022-07-11 13:17:24,665][25689] Fps is (10 sec: 5583.4, 60 sec: 5627.3, 300 sec: 5610.8). Total num frames: 1236457472. Throughput: 0: 5962.6. Samples: 1236461550. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:17:24,665][25689] Avg episode reward: [(0, '1.390')] [2022-07-11 13:17:25,682][26022] Updated weights on worker 0-0, policy_version 1207485 (0.00087) [2022-07-11 13:17:27,543][26022] Updated weights on worker 0-0, policy_version 1207495 (0.00093) [2022-07-11 13:17:29,344][26022] Updated weights on worker 0-0, policy_version 1207505 (0.00091) [2022-07-11 13:17:29,676][25689] Fps is (10 sec: 5489.2, 60 sec: 5644.1, 300 sec: 5614.5). Total num frames: 1236486144. Throughput: 0: 5933.8. Samples: 1236495298. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:17:29,677][25689] Avg episode reward: [(0, '1.100')] [2022-07-11 13:17:31,115][26022] Updated weights on worker 0-0, policy_version 1207515 (0.00084) [2022-07-11 13:17:32,987][26022] Updated weights on worker 0-0, policy_version 1207525 (0.00082) [2022-07-11 13:17:34,711][26022] Updated weights on worker 0-0, policy_version 1207535 (0.00088) [2022-07-11 13:17:34,711][25689] Fps is (10 sec: 5809.8, 60 sec: 5659.6, 300 sec: 5617.5). Total num frames: 1236515840. Throughput: 0: 5080.9. Samples: 1236512436. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:17:34,713][25689] Avg episode reward: [(0, '0.443')] [2022-07-11 13:17:36,614][26022] Updated weights on worker 0-0, policy_version 1207545 (0.00087) [2022-07-11 13:17:38,218][26022] Updated weights on worker 0-0, policy_version 1207555 (0.00087) [2022-07-11 13:17:39,722][25689] Fps is (10 sec: 5708.4, 60 sec: 5645.2, 300 sec: 5615.6). Total num frames: 1236543488. Throughput: 0: 5938.2. Samples: 1236546696. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:17:39,723][25689] Avg episode reward: [(0, '0.572')] [2022-07-11 13:17:40,099][26022] Updated weights on worker 0-0, policy_version 1207565 (0.00090) [2022-07-11 13:17:41,753][26022] Updated weights on worker 0-0, policy_version 1207575 (0.00089) [2022-07-11 13:17:43,592][26022] Updated weights on worker 0-0, policy_version 1207585 (0.00086) [2022-07-11 13:17:44,828][25689] Fps is (10 sec: 5769.4, 60 sec: 5680.5, 300 sec: 5624.3). Total num frames: 1236574208. Throughput: 0: 5936.2. Samples: 1236581242. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:17:44,830][25689] Avg episode reward: [(0, '-0.690')] [2022-07-11 13:17:45,300][26022] Updated weights on worker 0-0, policy_version 1207595 (0.00097) [2022-07-11 13:17:47,140][26022] Updated weights on worker 0-0, policy_version 1207605 (0.00094) [2022-07-11 13:17:49,183][26022] Updated weights on worker 0-0, policy_version 1207615 (0.00079) [2022-07-11 13:17:49,859][25689] Fps is (10 sec: 5656.9, 60 sec: 5646.2, 300 sec: 5616.9). Total num frames: 1236600832. Throughput: 0: 5115.0. Samples: 1236598532. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:17:49,860][25689] Avg episode reward: [(0, '0.139')] [2022-07-11 13:17:50,685][26022] Updated weights on worker 0-0, policy_version 1207625 (0.00084) [2022-07-11 13:17:52,744][26022] Updated weights on worker 0-0, policy_version 1207635 (0.00079) [2022-07-11 13:17:54,085][26022] Updated weights on worker 0-0, policy_version 1207645 (0.00096) [2022-07-11 13:17:54,890][25689] Fps is (10 sec: 5699.2, 60 sec: 5662.8, 300 sec: 5627.9). Total num frames: 1236631552. Throughput: 0: 5983.8. Samples: 1236633180. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:17:54,890][25689] Avg episode reward: [(0, '0.130')] [2022-07-11 13:17:56,310][26022] Updated weights on worker 0-0, policy_version 1207655 (0.00082) [2022-07-11 13:17:57,827][26022] Updated weights on worker 0-0, policy_version 1207665 (0.00096) [2022-07-11 13:17:59,873][26022] Updated weights on worker 0-0, policy_version 1207675 (0.00090) [2022-07-11 13:17:59,917][25689] Fps is (10 sec: 5803.3, 60 sec: 5662.3, 300 sec: 5624.5). Total num frames: 1236659200. Throughput: 0: 5968.2. Samples: 1236667222. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:17:59,917][25689] Avg episode reward: [(0, '0.701')] [2022-07-11 13:18:01,827][26022] Updated weights on worker 0-0, policy_version 1207685 (0.00099) [2022-07-11 13:18:03,695][26022] Updated weights on worker 0-0, policy_version 1207695 (0.00082) [2022-07-11 13:18:04,984][25689] Fps is (10 sec: 5376.5, 60 sec: 5625.9, 300 sec: 5626.9). Total num frames: 1236685824. Throughput: 0: 4997.6. Samples: 1236681974. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:18:04,985][25689] Avg episode reward: [(0, '0.700')] [2022-07-11 13:18:05,524][26022] Updated weights on worker 0-0, policy_version 1207705 (0.00097) [2022-07-11 13:18:07,275][26022] Updated weights on worker 0-0, policy_version 1207715 (0.00087) [2022-07-11 13:18:09,042][26022] Updated weights on worker 0-0, policy_version 1207725 (0.00090) [2022-07-11 13:18:10,019][25689] Fps is (10 sec: 5575.0, 60 sec: 5673.9, 300 sec: 5622.9). Total num frames: 1236715520. Throughput: 0: 5831.3. Samples: 1236716092. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:18:10,019][25689] Avg episode reward: [(0, '0.449')] [2022-07-11 13:18:10,051][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:18:10,074][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001207731_1236716544.pth [2022-07-11 13:18:10,074][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001205752_1234690048.pth [2022-07-11 13:18:11,175][26022] Updated weights on worker 0-0, policy_version 1207735 (0.00079) [2022-07-11 13:18:12,665][26022] Updated weights on worker 0-0, policy_version 1207745 (0.00086) [2022-07-11 13:18:14,702][26022] Updated weights on worker 0-0, policy_version 1207755 (0.00083) [2022-07-11 13:18:15,096][25689] Fps is (10 sec: 5671.2, 60 sec: 5651.5, 300 sec: 5625.0). Total num frames: 1236743168. Throughput: 0: 5814.1. Samples: 1236750658. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:18:15,096][25689] Avg episode reward: [(0, '1.411')] [2022-07-11 13:18:16,217][26022] Updated weights on worker 0-0, policy_version 1207765 (0.00106) [2022-07-11 13:18:18,251][26022] Updated weights on worker 0-0, policy_version 1207775 (0.00085) [2022-07-11 13:18:19,772][26022] Updated weights on worker 0-0, policy_version 1207785 (0.00082) [2022-07-11 13:18:20,132][25689] Fps is (10 sec: 5771.4, 60 sec: 5667.3, 300 sec: 5635.8). Total num frames: 1236773888. Throughput: 0: 4987.7. Samples: 1236768052. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:18:20,133][25689] Avg episode reward: [(0, '-0.022')] [2022-07-11 13:18:21,611][26022] Updated weights on worker 0-0, policy_version 1207795 (0.00084) [2022-07-11 13:18:23,438][26022] Updated weights on worker 0-0, policy_version 1207805 (0.00406) [2022-07-11 13:18:25,249][25689] Fps is (10 sec: 5748.7, 60 sec: 5679.1, 300 sec: 5630.8). Total num frames: 1236801536. Throughput: 0: 5947.8. Samples: 1236802504. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:18:25,249][25689] Avg episode reward: [(0, '0.203')] [2022-07-11 13:18:25,407][26022] Updated weights on worker 0-0, policy_version 1207815 (0.00098) [2022-07-11 13:18:27,123][26022] Updated weights on worker 0-0, policy_version 1207825 (0.00541) [2022-07-11 13:18:28,940][26022] Updated weights on worker 0-0, policy_version 1207835 (0.00085) [2022-07-11 13:18:30,313][25689] Fps is (10 sec: 5431.4, 60 sec: 5657.3, 300 sec: 5629.7). Total num frames: 1236829184. Throughput: 0: 5925.4. Samples: 1236836344. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:18:30,314][25689] Avg episode reward: [(0, '-0.337')] [2022-07-11 13:18:30,819][26022] Updated weights on worker 0-0, policy_version 1207845 (0.00089) [2022-07-11 13:18:32,704][26022] Updated weights on worker 0-0, policy_version 1207855 (0.00081) [2022-07-11 13:18:34,246][26022] Updated weights on worker 0-0, policy_version 1207865 (0.00090) [2022-07-11 13:18:35,316][25689] Fps is (10 sec: 5696.0, 60 sec: 5660.3, 300 sec: 5630.4). Total num frames: 1236858880. Throughput: 0: 5069.1. Samples: 1236853160. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:18:35,317][25689] Avg episode reward: [(0, '-0.569')] [2022-07-11 13:18:36,237][26022] Updated weights on worker 0-0, policy_version 1207875 (0.00083) [2022-07-11 13:18:37,900][26022] Updated weights on worker 0-0, policy_version 1207885 (0.00093) [2022-07-11 13:18:39,992][26022] Updated weights on worker 0-0, policy_version 1207895 (0.00084) [2022-07-11 13:18:40,325][25689] Fps is (10 sec: 5727.9, 60 sec: 5660.5, 300 sec: 5631.8). Total num frames: 1236886528. Throughput: 0: 5907.4. Samples: 1236887336. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:18:40,325][25689] Avg episode reward: [(0, '-0.319')] [2022-07-11 13:18:41,399][26022] Updated weights on worker 0-0, policy_version 1207905 (0.00097) [2022-07-11 13:18:43,494][26022] Updated weights on worker 0-0, policy_version 1207915 (0.00082) [2022-07-11 13:18:44,863][26022] Updated weights on worker 0-0, policy_version 1207925 (0.00089) [2022-07-11 13:18:45,407][25689] Fps is (10 sec: 5784.6, 60 sec: 5662.7, 300 sec: 5640.7). Total num frames: 1236917248. Throughput: 0: 5905.4. Samples: 1236921544. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:18:45,408][25689] Avg episode reward: [(0, '-0.295')] [2022-07-11 13:18:47,243][26022] Updated weights on worker 0-0, policy_version 1207935 (0.00092) [2022-07-11 13:18:48,562][26022] Updated weights on worker 0-0, policy_version 1207945 (0.00077) [2022-07-11 13:18:50,500][25689] Fps is (10 sec: 5635.5, 60 sec: 5656.9, 300 sec: 5632.1). Total num frames: 1236943872. Throughput: 0: 5055.1. Samples: 1236938390. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:18:50,502][25689] Avg episode reward: [(0, '-0.189')] [2022-07-11 13:18:50,821][26022] Updated weights on worker 0-0, policy_version 1207955 (0.00084) [2022-07-11 13:18:52,266][26022] Updated weights on worker 0-0, policy_version 1207965 (0.00086) [2022-07-11 13:18:54,458][26022] Updated weights on worker 0-0, policy_version 1207975 (0.00086) [2022-07-11 13:18:55,554][25689] Fps is (10 sec: 5651.3, 60 sec: 5654.8, 300 sec: 5638.3). Total num frames: 1236974592. Throughput: 0: 5888.0. Samples: 1236972316. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:18:55,554][25689] Avg episode reward: [(0, '-0.499')] [2022-07-11 13:18:55,928][26022] Updated weights on worker 0-0, policy_version 1207985 (0.00086) [2022-07-11 13:18:57,959][26022] Updated weights on worker 0-0, policy_version 1207995 (0.00087) [2022-07-11 13:18:59,375][26022] Updated weights on worker 0-0, policy_version 1208005 (0.00090) [2022-07-11 13:19:00,603][25689] Fps is (10 sec: 5676.1, 60 sec: 5635.8, 300 sec: 5641.8). Total num frames: 1237001216. Throughput: 0: 5881.6. Samples: 1237006604. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:19:00,603][25689] Avg episode reward: [(0, '-0.186')] [2022-07-11 13:19:02,110][26022] Updated weights on worker 0-0, policy_version 1208015 (0.00098) [2022-07-11 13:19:03,329][26022] Updated weights on worker 0-0, policy_version 1208025 (0.00086) [2022-07-11 13:19:05,663][25689] Fps is (10 sec: 5267.3, 60 sec: 5636.5, 300 sec: 5630.5). Total num frames: 1237027840. Throughput: 0: 4916.1. Samples: 1237021126. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:19:05,664][25689] Avg episode reward: [(0, '0.195')] [2022-07-11 13:19:05,678][26022] Updated weights on worker 0-0, policy_version 1208035 (0.00087) [2022-07-11 13:19:07,175][26022] Updated weights on worker 0-0, policy_version 1208045 (0.00085) [2022-07-11 13:19:09,094][26022] Updated weights on worker 0-0, policy_version 1208055 (0.00084) [2022-07-11 13:19:10,758][25689] Fps is (10 sec: 5545.8, 60 sec: 5630.9, 300 sec: 5635.7). Total num frames: 1237057536. Throughput: 0: 5769.4. Samples: 1237055268. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:19:10,759][25689] Avg episode reward: [(0, '-0.010')] [2022-07-11 13:19:10,967][26022] Updated weights on worker 0-0, policy_version 1208065 (0.00085) [2022-07-11 13:19:12,549][26022] Updated weights on worker 0-0, policy_version 1208075 (0.00088) [2022-07-11 13:19:14,669][26022] Updated weights on worker 0-0, policy_version 1208085 (0.00086) [2022-07-11 13:19:15,767][25689] Fps is (10 sec: 5877.9, 60 sec: 5670.9, 300 sec: 5639.5). Total num frames: 1237087232. Throughput: 0: 5803.3. Samples: 1237089622. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:19:15,768][25689] Avg episode reward: [(0, '0.597')] [2022-07-11 13:19:16,267][26022] Updated weights on worker 0-0, policy_version 1208095 (0.00086) [2022-07-11 13:19:18,095][26022] Updated weights on worker 0-0, policy_version 1208105 (0.00085) [2022-07-11 13:19:19,928][26022] Updated weights on worker 0-0, policy_version 1208115 (0.00088) [2022-07-11 13:19:20,829][25689] Fps is (10 sec: 5592.5, 60 sec: 5601.1, 300 sec: 5633.2). Total num frames: 1237113856. Throughput: 0: 5782.5. Samples: 1237123562. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:19:20,829][25689] Avg episode reward: [(0, '0.255')] [2022-07-11 13:19:21,582][26022] Updated weights on worker 0-0, policy_version 1208125 (0.00088) [2022-07-11 13:19:23,685][26022] Updated weights on worker 0-0, policy_version 1208135 (0.00085) [2022-07-11 13:19:25,228][26022] Updated weights on worker 0-0, policy_version 1208145 (0.00093) [2022-07-11 13:19:25,909][25689] Fps is (10 sec: 5553.4, 60 sec: 5638.3, 300 sec: 5637.0). Total num frames: 1237143552. Throughput: 0: 5894.1. Samples: 1237140458. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:19:25,909][25689] Avg episode reward: [(0, '-0.366')] [2022-07-11 13:19:27,486][26022] Updated weights on worker 0-0, policy_version 1208155 (0.00084) [2022-07-11 13:19:28,889][26022] Updated weights on worker 0-0, policy_version 1208165 (0.00099) [2022-07-11 13:19:30,912][25689] Fps is (10 sec: 5585.8, 60 sec: 5627.1, 300 sec: 5634.4). Total num frames: 1237170176. Throughput: 0: 5900.0. Samples: 1237174172. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:19:30,912][25689] Avg episode reward: [(0, '-0.312')] [2022-07-11 13:19:31,095][26022] Updated weights on worker 0-0, policy_version 1208175 (0.00092) [2022-07-11 13:19:32,583][26022] Updated weights on worker 0-0, policy_version 1208185 (0.00087) [2022-07-11 13:19:34,678][26022] Updated weights on worker 0-0, policy_version 1208195 (0.00087) [2022-07-11 13:19:35,993][25689] Fps is (10 sec: 5584.8, 60 sec: 5619.8, 300 sec: 5633.0). Total num frames: 1237199872. Throughput: 0: 5844.9. Samples: 1237207842. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:19:35,994][25689] Avg episode reward: [(0, '-0.310')] [2022-07-11 13:19:36,282][26022] Updated weights on worker 0-0, policy_version 1208205 (0.00085) [2022-07-11 13:19:38,112][26022] Updated weights on worker 0-0, policy_version 1208215 (0.00086) [2022-07-11 13:19:39,979][26022] Updated weights on worker 0-0, policy_version 1208225 (0.00092) [2022-07-11 13:19:41,011][25689] Fps is (10 sec: 5779.5, 60 sec: 5635.8, 300 sec: 5637.5). Total num frames: 1237228544. Throughput: 0: 5029.0. Samples: 1237225056. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:19:41,011][25689] Avg episode reward: [(0, '0.397')] [2022-07-11 13:19:41,819][26022] Updated weights on worker 0-0, policy_version 1208235 (0.00386) [2022-07-11 13:19:43,490][26022] Updated weights on worker 0-0, policy_version 1208245 (0.00097) [2022-07-11 13:19:45,482][26022] Updated weights on worker 0-0, policy_version 1208255 (0.00069) [2022-07-11 13:19:46,091][25689] Fps is (10 sec: 5577.5, 60 sec: 5585.3, 300 sec: 5633.7). Total num frames: 1237256192. Throughput: 0: 5874.0. Samples: 1237259010. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:19:46,092][25689] Avg episode reward: [(0, '-0.135')] [2022-07-11 13:19:47,183][26022] Updated weights on worker 0-0, policy_version 1208265 (0.00899) [2022-07-11 13:19:49,235][26022] Updated weights on worker 0-0, policy_version 1208275 (0.00085) [2022-07-11 13:19:50,718][26022] Updated weights on worker 0-0, policy_version 1208285 (0.00088) [2022-07-11 13:19:51,101][25689] Fps is (10 sec: 5683.2, 60 sec: 5643.7, 300 sec: 5633.8). Total num frames: 1237285888. Throughput: 0: 5884.0. Samples: 1237292966. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:19:51,115][25689] Avg episode reward: [(0, '-0.013')] [2022-07-11 13:19:52,659][26022] Updated weights on worker 0-0, policy_version 1208295 (0.00082) [2022-07-11 13:19:54,372][26022] Updated weights on worker 0-0, policy_version 1208305 (0.00087) [2022-07-11 13:19:56,152][25689] Fps is (10 sec: 5699.4, 60 sec: 5593.2, 300 sec: 5633.1). Total num frames: 1237313536. Throughput: 0: 5078.7. Samples: 1237310224. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:19:56,153][25689] Avg episode reward: [(0, '1.004')] [2022-07-11 13:19:56,317][26022] Updated weights on worker 0-0, policy_version 1208315 (0.00088) [2022-07-11 13:19:57,867][26022] Updated weights on worker 0-0, policy_version 1208325 (0.00092) [2022-07-11 13:19:59,891][26022] Updated weights on worker 0-0, policy_version 1208335 (0.00092) [2022-07-11 13:20:01,235][25689] Fps is (10 sec: 5557.7, 60 sec: 5624.0, 300 sec: 5643.1). Total num frames: 1237342208. Throughput: 0: 5901.3. Samples: 1237344404. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:20:01,237][25689] Avg episode reward: [(0, '0.998')] [2022-07-11 13:20:01,969][26022] Updated weights on worker 0-0, policy_version 1208345 (0.00092) [2022-07-11 13:20:03,939][26022] Updated weights on worker 0-0, policy_version 1208355 (0.00084) [2022-07-11 13:20:05,633][26022] Updated weights on worker 0-0, policy_version 1208365 (0.00085) [2022-07-11 13:20:06,396][25689] Fps is (10 sec: 5498.2, 60 sec: 5631.5, 300 sec: 5631.0). Total num frames: 1237369856. Throughput: 0: 5753.0. Samples: 1237375824. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:20:06,397][25689] Avg episode reward: [(0, '1.099')] [2022-07-11 13:20:07,623][26022] Updated weights on worker 0-0, policy_version 1208375 (0.00082) [2022-07-11 13:20:09,168][26022] Updated weights on worker 0-0, policy_version 1208385 (0.00087) [2022-07-11 13:20:10,107][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:20:10,118][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001208391_1237392384.pth [2022-07-11 13:20:10,119][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001206409_1235362816.pth [2022-07-11 13:20:11,320][26022] Updated weights on worker 0-0, policy_version 1208395 (0.00081) [2022-07-11 13:20:11,416][25689] Fps is (10 sec: 5431.1, 60 sec: 5604.7, 300 sec: 5637.8). Total num frames: 1237397504. Throughput: 0: 4932.1. Samples: 1237393162. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:20:11,418][25689] Avg episode reward: [(0, '0.882')] [2022-07-11 13:20:12,695][26022] Updated weights on worker 0-0, policy_version 1208405 (0.00084) [2022-07-11 13:20:14,796][26022] Updated weights on worker 0-0, policy_version 1208415 (0.00080) [2022-07-11 13:20:16,239][26022] Updated weights on worker 0-0, policy_version 1208425 (0.00084) [2022-07-11 13:20:16,464][25689] Fps is (10 sec: 5695.5, 60 sec: 5601.1, 300 sec: 5633.8). Total num frames: 1237427200. Throughput: 0: 5771.8. Samples: 1237427460. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:20:16,466][25689] Avg episode reward: [(0, '1.052')] [2022-07-11 13:20:18,400][26022] Updated weights on worker 0-0, policy_version 1208435 (0.00963) [2022-07-11 13:20:19,908][26022] Updated weights on worker 0-0, policy_version 1208445 (0.00077) [2022-07-11 13:20:21,474][25689] Fps is (10 sec: 5701.5, 60 sec: 5622.8, 300 sec: 5632.2). Total num frames: 1237454848. Throughput: 0: 5794.5. Samples: 1237461678. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:20:21,475][25689] Avg episode reward: [(0, '-0.274')] [2022-07-11 13:20:21,856][26022] Updated weights on worker 0-0, policy_version 1208455 (0.00089) [2022-07-11 13:20:23,691][26022] Updated weights on worker 0-0, policy_version 1208465 (0.00086) [2022-07-11 13:20:25,457][26022] Updated weights on worker 0-0, policy_version 1208475 (0.00078) [2022-07-11 13:20:26,528][25689] Fps is (10 sec: 5596.4, 60 sec: 5608.3, 300 sec: 5639.3). Total num frames: 1237483520. Throughput: 0: 5109.0. Samples: 1237478678. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:20:26,528][25689] Avg episode reward: [(0, '-0.312')] [2022-07-11 13:20:27,193][26022] Updated weights on worker 0-0, policy_version 1208485 (0.00086) [2022-07-11 13:20:28,934][26022] Updated weights on worker 0-0, policy_version 1208495 (0.00084) [2022-07-11 13:20:31,037][26022] Updated weights on worker 0-0, policy_version 1208505 (0.00085) [2022-07-11 13:20:31,536][25689] Fps is (10 sec: 5698.7, 60 sec: 5641.5, 300 sec: 5637.1). Total num frames: 1237512192. Throughput: 0: 5932.0. Samples: 1237512516. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:20:31,537][25689] Avg episode reward: [(0, '-0.084')] [2022-07-11 13:20:32,724][26022] Updated weights on worker 0-0, policy_version 1208515 (0.00087) [2022-07-11 13:20:34,590][26022] Updated weights on worker 0-0, policy_version 1208525 (0.01229) [2022-07-11 13:20:36,423][26022] Updated weights on worker 0-0, policy_version 1208535 (0.00087) [2022-07-11 13:20:36,546][25689] Fps is (10 sec: 5621.5, 60 sec: 5614.4, 300 sec: 5633.9). Total num frames: 1237539840. Throughput: 0: 5908.0. Samples: 1237546106. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 13:20:36,547][25689] Avg episode reward: [(0, '-0.095')] [2022-07-11 13:20:38,163][26022] Updated weights on worker 0-0, policy_version 1208545 (0.00086) [2022-07-11 13:20:40,147][26022] Updated weights on worker 0-0, policy_version 1208555 (0.00085) [2022-07-11 13:20:41,563][25689] Fps is (10 sec: 5719.4, 60 sec: 5631.4, 300 sec: 5638.8). Total num frames: 1237569536. Throughput: 0: 5050.0. Samples: 1237563126. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:20:41,563][25689] Avg episode reward: [(0, '-0.304')] [2022-07-11 13:20:41,612][26022] Updated weights on worker 0-0, policy_version 1208565 (0.00085) [2022-07-11 13:20:43,666][26022] Updated weights on worker 0-0, policy_version 1208575 (0.00083) [2022-07-11 13:20:45,222][26022] Updated weights on worker 0-0, policy_version 1208585 (0.00084) [2022-07-11 13:20:46,604][25689] Fps is (10 sec: 5701.5, 60 sec: 5635.0, 300 sec: 5634.7). Total num frames: 1237597184. Throughput: 0: 5916.3. Samples: 1237597456. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:20:46,605][25689] Avg episode reward: [(0, '0.777')] [2022-07-11 13:20:47,195][26022] Updated weights on worker 0-0, policy_version 1208595 (0.00085) [2022-07-11 13:20:48,947][26022] Updated weights on worker 0-0, policy_version 1208605 (0.00084) [2022-07-11 13:20:50,816][26022] Updated weights on worker 0-0, policy_version 1208615 (0.00074) [2022-07-11 13:20:51,611][25689] Fps is (10 sec: 5706.6, 60 sec: 5635.3, 300 sec: 5641.9). Total num frames: 1237626880. Throughput: 0: 5935.2. Samples: 1237631664. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:20:51,612][25689] Avg episode reward: [(0, '1.332')] [2022-07-11 13:20:52,541][26022] Updated weights on worker 0-0, policy_version 1208625 (0.00092) [2022-07-11 13:20:54,540][26022] Updated weights on worker 0-0, policy_version 1208635 (0.00080) [2022-07-11 13:20:56,103][26022] Updated weights on worker 0-0, policy_version 1208645 (0.00083) [2022-07-11 13:20:56,638][25689] Fps is (10 sec: 5612.9, 60 sec: 5620.6, 300 sec: 5631.3). Total num frames: 1237653504. Throughput: 0: 5105.4. Samples: 1237648682. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:20:56,639][25689] Avg episode reward: [(0, '0.582')] [2022-07-11 13:20:58,011][26022] Updated weights on worker 0-0, policy_version 1208655 (0.00089) [2022-07-11 13:20:59,919][26022] Updated weights on worker 0-0, policy_version 1208665 (0.00084) [2022-07-11 13:21:01,451][26022] Updated weights on worker 0-0, policy_version 1208675 (0.00085) [2022-07-11 13:21:01,663][25689] Fps is (10 sec: 5602.8, 60 sec: 5642.9, 300 sec: 5646.8). Total num frames: 1237683200. Throughput: 0: 5950.0. Samples: 1237682726. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:21:01,664][25689] Avg episode reward: [(0, '0.247')] [2022-07-11 13:21:03,958][26022] Updated weights on worker 0-0, policy_version 1208685 (0.00083) [2022-07-11 13:21:05,459][26022] Updated weights on worker 0-0, policy_version 1208695 (0.00085) [2022-07-11 13:21:06,752][25689] Fps is (10 sec: 5568.6, 60 sec: 5632.7, 300 sec: 5642.4). Total num frames: 1237709824. Throughput: 0: 5819.8. Samples: 1237714712. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:21:06,753][25689] Avg episode reward: [(0, '-0.007')] [2022-07-11 13:21:07,455][26022] Updated weights on worker 0-0, policy_version 1208705 (0.00084) [2022-07-11 13:21:09,231][26022] Updated weights on worker 0-0, policy_version 1208715 (0.00096) [2022-07-11 13:21:11,057][26022] Updated weights on worker 0-0, policy_version 1208725 (0.00084) [2022-07-11 13:21:11,801][25689] Fps is (10 sec: 5353.7, 60 sec: 5630.0, 300 sec: 5631.4). Total num frames: 1237737472. Throughput: 0: 4955.5. Samples: 1237731712. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:21:11,801][25689] Avg episode reward: [(0, '0.272')] [2022-07-11 13:21:12,871][26022] Updated weights on worker 0-0, policy_version 1208735 (0.00614) [2022-07-11 13:21:14,554][26022] Updated weights on worker 0-0, policy_version 1208745 (0.00094) [2022-07-11 13:21:16,534][26022] Updated weights on worker 0-0, policy_version 1208755 (0.00097) [2022-07-11 13:21:16,855][25689] Fps is (10 sec: 5676.0, 60 sec: 5629.5, 300 sec: 5644.9). Total num frames: 1237767168. Throughput: 0: 5798.1. Samples: 1237765900. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:21:16,855][25689] Avg episode reward: [(0, '0.029')] [2022-07-11 13:21:18,266][26022] Updated weights on worker 0-0, policy_version 1208765 (0.00085) [2022-07-11 13:21:20,142][26022] Updated weights on worker 0-0, policy_version 1208775 (0.00078) [2022-07-11 13:21:21,743][26022] Updated weights on worker 0-0, policy_version 1208785 (0.00094) [2022-07-11 13:21:21,950][25689] Fps is (10 sec: 5750.8, 60 sec: 5638.4, 300 sec: 5637.7). Total num frames: 1237795840. Throughput: 0: 5776.9. Samples: 1237799920. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:21:21,951][25689] Avg episode reward: [(0, '-0.212')] [2022-07-11 13:21:23,703][26022] Updated weights on worker 0-0, policy_version 1208795 (0.00088) [2022-07-11 13:21:25,316][26022] Updated weights on worker 0-0, policy_version 1208805 (0.00086) [2022-07-11 13:21:27,051][25689] Fps is (10 sec: 5623.9, 60 sec: 5634.0, 300 sec: 5639.4). Total num frames: 1237824512. Throughput: 0: 5869.9. Samples: 1237833868. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:21:27,052][25689] Avg episode reward: [(0, '0.884')] [2022-07-11 13:21:27,399][26022] Updated weights on worker 0-0, policy_version 1208815 (0.00086) [2022-07-11 13:21:28,911][26022] Updated weights on worker 0-0, policy_version 1208825 (0.00083) [2022-07-11 13:21:31,088][26022] Updated weights on worker 0-0, policy_version 1208835 (0.00094) [2022-07-11 13:21:32,067][25689] Fps is (10 sec: 5769.6, 60 sec: 5650.3, 300 sec: 5642.9). Total num frames: 1237854208. Throughput: 0: 5883.0. Samples: 1237850938. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:21:32,067][25689] Avg episode reward: [(0, '1.880')] [2022-07-11 13:21:32,476][26022] Updated weights on worker 0-0, policy_version 1208845 (0.00083) [2022-07-11 13:21:34,530][26022] Updated weights on worker 0-0, policy_version 1208855 (0.00084) [2022-07-11 13:21:36,273][26022] Updated weights on worker 0-0, policy_version 1208865 (0.00087) [2022-07-11 13:21:37,071][25689] Fps is (10 sec: 5621.1, 60 sec: 5634.0, 300 sec: 5636.7). Total num frames: 1237880832. Throughput: 0: 5905.1. Samples: 1237885278. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:21:37,073][25689] Avg episode reward: [(0, '1.770')] [2022-07-11 13:21:38,177][26022] Updated weights on worker 0-0, policy_version 1208875 (0.00086) [2022-07-11 13:21:39,720][26022] Updated weights on worker 0-0, policy_version 1208885 (0.00086) [2022-07-11 13:21:41,809][26022] Updated weights on worker 0-0, policy_version 1208895 (0.00089) [2022-07-11 13:21:42,081][25689] Fps is (10 sec: 5521.9, 60 sec: 5617.6, 300 sec: 5638.8). Total num frames: 1237909504. Throughput: 0: 5931.0. Samples: 1237919314. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:21:42,081][25689] Avg episode reward: [(0, '1.473')] [2022-07-11 13:21:43,246][26022] Updated weights on worker 0-0, policy_version 1208905 (0.00085) [2022-07-11 13:21:45,395][26022] Updated weights on worker 0-0, policy_version 1208915 (0.00094) [2022-07-11 13:21:46,928][26022] Updated weights on worker 0-0, policy_version 1208925 (0.00088) [2022-07-11 13:21:47,134][25689] Fps is (10 sec: 5902.0, 60 sec: 5667.3, 300 sec: 5645.2). Total num frames: 1237940224. Throughput: 0: 5108.5. Samples: 1237936458. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:21:47,134][25689] Avg episode reward: [(0, '1.224')] [2022-07-11 13:21:48,919][26022] Updated weights on worker 0-0, policy_version 1208935 (0.00087) [2022-07-11 13:21:50,603][26022] Updated weights on worker 0-0, policy_version 1208945 (0.00083) [2022-07-11 13:21:52,137][25689] Fps is (10 sec: 5702.1, 60 sec: 5616.9, 300 sec: 5635.3). Total num frames: 1237966848. Throughput: 0: 5963.5. Samples: 1237970630. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:21:52,138][25689] Avg episode reward: [(0, '0.191')] [2022-07-11 13:21:52,460][26022] Updated weights on worker 0-0, policy_version 1208955 (0.00087) [2022-07-11 13:21:54,359][26022] Updated weights on worker 0-0, policy_version 1208965 (0.00080) [2022-07-11 13:21:56,034][26022] Updated weights on worker 0-0, policy_version 1208975 (0.00091) [2022-07-11 13:21:57,139][25689] Fps is (10 sec: 5526.6, 60 sec: 5653.1, 300 sec: 5639.1). Total num frames: 1237995520. Throughput: 0: 5960.8. Samples: 1238004902. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:21:57,140][25689] Avg episode reward: [(0, '0.022')] [2022-07-11 13:21:58,021][26022] Updated weights on worker 0-0, policy_version 1208985 (0.00080) [2022-07-11 13:21:59,709][26022] Updated weights on worker 0-0, policy_version 1208995 (0.00086) [2022-07-11 13:22:01,485][26022] Updated weights on worker 0-0, policy_version 1209005 (0.00085) [2022-07-11 13:22:02,199][25689] Fps is (10 sec: 5699.4, 60 sec: 5632.9, 300 sec: 5638.7). Total num frames: 1238024192. Throughput: 0: 5107.1. Samples: 1238022058. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:22:02,199][25689] Avg episode reward: [(0, '-0.047')] [2022-07-11 13:22:03,744][26022] Updated weights on worker 0-0, policy_version 1209015 (0.00093) [2022-07-11 13:22:05,452][26022] Updated weights on worker 0-0, policy_version 1209025 (0.00087) [2022-07-11 13:22:07,295][25689] Fps is (10 sec: 5444.7, 60 sec: 5632.2, 300 sec: 5637.0). Total num frames: 1238050816. Throughput: 0: 5836.5. Samples: 1238054130. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:22:07,296][25689] Avg episode reward: [(0, '0.105')] [2022-07-11 13:22:07,382][26022] Updated weights on worker 0-0, policy_version 1209035 (0.00091) [2022-07-11 13:22:09,169][26022] Updated weights on worker 0-0, policy_version 1209045 (0.00097) [2022-07-11 13:22:10,128][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:22:10,142][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001209050_1238067200.pth [2022-07-11 13:22:10,142][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001207067_1236036608.pth [2022-07-11 13:22:10,839][26022] Updated weights on worker 0-0, policy_version 1209055 (0.00083) [2022-07-11 13:22:12,376][25689] Fps is (10 sec: 5634.8, 60 sec: 5680.0, 300 sec: 5642.7). Total num frames: 1238081536. Throughput: 0: 5811.0. Samples: 1238088234. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:22:12,376][25689] Avg episode reward: [(0, '0.505')] [2022-07-11 13:22:12,650][26022] Updated weights on worker 0-0, policy_version 1209065 (0.00089) [2022-07-11 13:22:14,499][26022] Updated weights on worker 0-0, policy_version 1209075 (0.00093) [2022-07-11 13:22:16,371][26022] Updated weights on worker 0-0, policy_version 1209085 (0.00089) [2022-07-11 13:22:17,396][25689] Fps is (10 sec: 5676.9, 60 sec: 5632.4, 300 sec: 5632.5). Total num frames: 1238108160. Throughput: 0: 4949.2. Samples: 1238105158. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:22:17,397][25689] Avg episode reward: [(0, '1.644')] [2022-07-11 13:22:18,000][26022] Updated weights on worker 0-0, policy_version 1209095 (0.00096) [2022-07-11 13:22:19,987][26022] Updated weights on worker 0-0, policy_version 1209105 (0.00084) [2022-07-11 13:22:21,523][26022] Updated weights on worker 0-0, policy_version 1209115 (0.00084) [2022-07-11 13:22:22,470][25689] Fps is (10 sec: 5681.0, 60 sec: 5668.3, 300 sec: 5646.0). Total num frames: 1238138880. Throughput: 0: 5802.8. Samples: 1238139686. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:22:22,470][25689] Avg episode reward: [(0, '1.778')] [2022-07-11 13:22:23,559][26022] Updated weights on worker 0-0, policy_version 1209125 (0.00086) [2022-07-11 13:22:25,014][26022] Updated weights on worker 0-0, policy_version 1209135 (0.00092) [2022-07-11 13:22:27,095][26022] Updated weights on worker 0-0, policy_version 1209145 (0.00085) [2022-07-11 13:22:27,573][25689] Fps is (10 sec: 5635.0, 60 sec: 5634.3, 300 sec: 5637.4). Total num frames: 1238165504. Throughput: 0: 5907.6. Samples: 1238173922. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:22:27,573][25689] Avg episode reward: [(0, '1.858')] [2022-07-11 13:22:28,703][26022] Updated weights on worker 0-0, policy_version 1209155 (0.00091) [2022-07-11 13:22:30,684][26022] Updated weights on worker 0-0, policy_version 1209165 (0.00087) [2022-07-11 13:22:32,361][26022] Updated weights on worker 0-0, policy_version 1209175 (0.00089) [2022-07-11 13:22:32,593][25689] Fps is (10 sec: 5664.2, 60 sec: 5650.7, 300 sec: 5641.1). Total num frames: 1238196224. Throughput: 0: 5084.2. Samples: 1238191026. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:22:32,594][25689] Avg episode reward: [(0, '1.423')] [2022-07-11 13:22:34,447][26022] Updated weights on worker 0-0, policy_version 1209185 (0.00090) [2022-07-11 13:22:35,934][26022] Updated weights on worker 0-0, policy_version 1209195 (0.00084) [2022-07-11 13:22:37,621][25689] Fps is (10 sec: 5808.8, 60 sec: 5665.4, 300 sec: 5640.8). Total num frames: 1238223872. Throughput: 0: 5939.7. Samples: 1238225286. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:22:37,621][25689] Avg episode reward: [(0, '1.737')] [2022-07-11 13:22:38,025][26022] Updated weights on worker 0-0, policy_version 1209205 (0.00094) [2022-07-11 13:22:39,581][26022] Updated weights on worker 0-0, policy_version 1209215 (0.00081) [2022-07-11 13:22:41,571][26022] Updated weights on worker 0-0, policy_version 1209225 (0.00087) [2022-07-11 13:22:42,622][25689] Fps is (10 sec: 5718.2, 60 sec: 5683.1, 300 sec: 5639.3). Total num frames: 1238253568. Throughput: 0: 5927.9. Samples: 1238259146. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:22:42,622][25689] Avg episode reward: [(0, '1.833')] [2022-07-11 13:22:43,293][26022] Updated weights on worker 0-0, policy_version 1209235 (0.00084) [2022-07-11 13:22:45,139][26022] Updated weights on worker 0-0, policy_version 1209245 (0.00088) [2022-07-11 13:22:46,901][26022] Updated weights on worker 0-0, policy_version 1209255 (0.00090) [2022-07-11 13:22:47,745][25689] Fps is (10 sec: 5664.2, 60 sec: 5625.9, 300 sec: 5641.0). Total num frames: 1238281216. Throughput: 0: 5065.5. Samples: 1238276104. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:22:47,746][25689] Avg episode reward: [(0, '1.017')] [2022-07-11 13:22:48,955][26022] Updated weights on worker 0-0, policy_version 1209265 (0.00087) [2022-07-11 13:22:50,415][26022] Updated weights on worker 0-0, policy_version 1209275 (0.00085) [2022-07-11 13:22:52,498][26022] Updated weights on worker 0-0, policy_version 1209285 (0.00086) [2022-07-11 13:22:52,822][25689] Fps is (10 sec: 5521.4, 60 sec: 5652.8, 300 sec: 5633.3). Total num frames: 1238309888. Throughput: 0: 5892.4. Samples: 1238310222. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:22:52,823][25689] Avg episode reward: [(0, '0.828')] [2022-07-11 13:22:53,987][26022] Updated weights on worker 0-0, policy_version 1209295 (0.00088) [2022-07-11 13:22:55,944][26022] Updated weights on worker 0-0, policy_version 1209305 (0.00079) [2022-07-11 13:22:57,864][25689] Fps is (10 sec: 5565.8, 60 sec: 5632.3, 300 sec: 5633.0). Total num frames: 1238337536. Throughput: 0: 5874.2. Samples: 1238344196. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:22:57,865][25689] Avg episode reward: [(0, '0.724')] [2022-07-11 13:22:57,882][26022] Updated weights on worker 0-0, policy_version 1209315 (0.00085) [2022-07-11 13:22:59,499][26022] Updated weights on worker 0-0, policy_version 1209325 (0.00078) [2022-07-11 13:23:01,370][26022] Updated weights on worker 0-0, policy_version 1209335 (0.00084) [2022-07-11 13:23:02,896][25689] Fps is (10 sec: 5489.0, 60 sec: 5617.9, 300 sec: 5637.1). Total num frames: 1238365184. Throughput: 0: 5038.4. Samples: 1238361296. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:23:02,899][25689] Avg episode reward: [(0, '1.033')] [2022-07-11 13:23:03,488][26022] Updated weights on worker 0-0, policy_version 1209345 (0.00092) [2022-07-11 13:23:05,086][26022] Updated weights on worker 0-0, policy_version 1209355 (0.00085) [2022-07-11 13:23:07,344][26022] Updated weights on worker 0-0, policy_version 1209365 (0.00088) [2022-07-11 13:23:07,981][25689] Fps is (10 sec: 5566.5, 60 sec: 5652.7, 300 sec: 5632.7). Total num frames: 1238393856. Throughput: 0: 5787.8. Samples: 1238393230. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:23:07,982][25689] Avg episode reward: [(0, '1.036')] [2022-07-11 13:23:08,867][26022] Updated weights on worker 0-0, policy_version 1209375 (0.00089) [2022-07-11 13:23:10,736][26022] Updated weights on worker 0-0, policy_version 1209385 (0.00081) [2022-07-11 13:23:12,646][26022] Updated weights on worker 0-0, policy_version 1209395 (0.00080) [2022-07-11 13:23:12,998][25689] Fps is (10 sec: 5676.7, 60 sec: 5624.9, 300 sec: 5637.3). Total num frames: 1238422528. Throughput: 0: 5786.1. Samples: 1238426960. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:23:12,998][25689] Avg episode reward: [(0, '1.252')] [2022-07-11 13:23:14,424][26022] Updated weights on worker 0-0, policy_version 1209405 (0.00087) [2022-07-11 13:23:16,240][26022] Updated weights on worker 0-0, policy_version 1209415 (0.00084) [2022-07-11 13:23:18,008][25689] Fps is (10 sec: 5514.7, 60 sec: 5625.8, 300 sec: 5624.0). Total num frames: 1238449152. Throughput: 0: 4950.8. Samples: 1238443928. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:23:18,009][25689] Avg episode reward: [(0, '2.152')] [2022-07-11 13:23:18,187][26022] Updated weights on worker 0-0, policy_version 1209425 (0.00092) [2022-07-11 13:23:19,932][26022] Updated weights on worker 0-0, policy_version 1209435 (0.00081) [2022-07-11 13:23:21,782][26022] Updated weights on worker 0-0, policy_version 1209445 (0.00092) [2022-07-11 13:23:23,026][25689] Fps is (10 sec: 5513.9, 60 sec: 5597.1, 300 sec: 5629.3). Total num frames: 1238477824. Throughput: 0: 5793.1. Samples: 1238477912. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:23:23,027][25689] Avg episode reward: [(0, '2.125')] [2022-07-11 13:23:23,472][26022] Updated weights on worker 0-0, policy_version 1209455 (0.00085) [2022-07-11 13:23:25,308][26022] Updated weights on worker 0-0, policy_version 1209465 (0.00081) [2022-07-11 13:23:27,021][26022] Updated weights on worker 0-0, policy_version 1209475 (0.00112) [2022-07-11 13:23:28,080][25689] Fps is (10 sec: 5795.1, 60 sec: 5652.4, 300 sec: 5636.4). Total num frames: 1238507520. Throughput: 0: 5892.4. Samples: 1238511660. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:23:28,081][25689] Avg episode reward: [(0, '2.012')] [2022-07-11 13:23:29,115][26022] Updated weights on worker 0-0, policy_version 1209485 (0.00097) [2022-07-11 13:23:30,712][26022] Updated weights on worker 0-0, policy_version 1209495 (0.00085) [2022-07-11 13:23:32,651][26022] Updated weights on worker 0-0, policy_version 1209505 (0.00084) [2022-07-11 13:23:33,102][25689] Fps is (10 sec: 5691.4, 60 sec: 5601.6, 300 sec: 5629.2). Total num frames: 1238535168. Throughput: 0: 5052.5. Samples: 1238528536. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:23:33,102][25689] Avg episode reward: [(0, '1.928')] [2022-07-11 13:23:34,363][26022] Updated weights on worker 0-0, policy_version 1209515 (0.00082) [2022-07-11 13:23:36,256][26022] Updated weights on worker 0-0, policy_version 1209525 (0.00086) [2022-07-11 13:23:37,972][26022] Updated weights on worker 0-0, policy_version 1209535 (0.00084) [2022-07-11 13:23:38,127][25689] Fps is (10 sec: 5707.9, 60 sec: 5635.7, 300 sec: 5635.7). Total num frames: 1238564864. Throughput: 0: 5905.0. Samples: 1238562726. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:23:38,127][25689] Avg episode reward: [(0, '1.029')] [2022-07-11 13:23:39,855][26022] Updated weights on worker 0-0, policy_version 1209545 (0.00093) [2022-07-11 13:23:41,649][26022] Updated weights on worker 0-0, policy_version 1209555 (0.00078) [2022-07-11 13:23:43,141][25689] Fps is (10 sec: 5712.0, 60 sec: 5600.6, 300 sec: 5626.7). Total num frames: 1238592512. Throughput: 0: 5911.0. Samples: 1238596810. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:23:43,142][25689] Avg episode reward: [(0, '0.115')] [2022-07-11 13:23:43,450][26022] Updated weights on worker 0-0, policy_version 1209565 (0.00084) [2022-07-11 13:23:45,272][26022] Updated weights on worker 0-0, policy_version 1209575 (0.00077) [2022-07-11 13:23:47,023][26022] Updated weights on worker 0-0, policy_version 1209585 (0.00103) [2022-07-11 13:23:48,186][25689] Fps is (10 sec: 5598.8, 60 sec: 5624.8, 300 sec: 5634.5). Total num frames: 1238621184. Throughput: 0: 5082.7. Samples: 1238613850. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:23:48,195][25689] Avg episode reward: [(0, '0.119')] [2022-07-11 13:23:48,870][26022] Updated weights on worker 0-0, policy_version 1209595 (0.00094) [2022-07-11 13:23:50,787][26022] Updated weights on worker 0-0, policy_version 1209605 (0.00099) [2022-07-11 13:23:52,489][26022] Updated weights on worker 0-0, policy_version 1209615 (0.00087) [2022-07-11 13:23:53,199][25689] Fps is (10 sec: 5599.6, 60 sec: 5613.8, 300 sec: 5624.9). Total num frames: 1238648832. Throughput: 0: 5925.0. Samples: 1238647612. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:23:53,199][25689] Avg episode reward: [(0, '-0.621')] [2022-07-11 13:23:54,237][26022] Updated weights on worker 0-0, policy_version 1209625 (0.00097) [2022-07-11 13:23:56,352][26022] Updated weights on worker 0-0, policy_version 1209635 (0.00081) [2022-07-11 13:23:57,819][26022] Updated weights on worker 0-0, policy_version 1209645 (0.00078) [2022-07-11 13:23:58,243][25689] Fps is (10 sec: 5702.0, 60 sec: 5647.5, 300 sec: 5635.4). Total num frames: 1238678528. Throughput: 0: 5929.6. Samples: 1238682006. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:23:58,243][25689] Avg episode reward: [(0, '-0.551')] [2022-07-11 13:23:59,820][26022] Updated weights on worker 0-0, policy_version 1209655 (0.00087) [2022-07-11 13:24:01,171][26022] Updated weights on worker 0-0, policy_version 1209665 (0.00087) [2022-07-11 13:24:03,260][25689] Fps is (10 sec: 5496.1, 60 sec: 5615.0, 300 sec: 5632.7). Total num frames: 1238704128. Throughput: 0: 5830.2. Samples: 1238714106. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:24:03,260][25689] Avg episode reward: [(0, '-0.196')] [2022-07-11 13:24:03,443][26022] Updated weights on worker 0-0, policy_version 1209675 (0.00086) [2022-07-11 13:24:05,611][26022] Updated weights on worker 0-0, policy_version 1209685 (0.00085) [2022-07-11 13:24:07,220][26022] Updated weights on worker 0-0, policy_version 1209695 (0.00084) [2022-07-11 13:24:08,298][25689] Fps is (10 sec: 5295.8, 60 sec: 5602.5, 300 sec: 5627.0). Total num frames: 1238731776. Throughput: 0: 5829.6. Samples: 1238731092. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:24:08,300][25689] Avg episode reward: [(0, '0.644')] [2022-07-11 13:24:09,045][26022] Updated weights on worker 0-0, policy_version 1209705 (0.00086) [2022-07-11 13:24:10,165][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:24:10,174][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001209710_1238743040.pth [2022-07-11 13:24:10,179][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001207731_1236716544.pth [2022-07-11 13:24:10,864][26022] Updated weights on worker 0-0, policy_version 1209715 (0.00091) [2022-07-11 13:24:12,799][26022] Updated weights on worker 0-0, policy_version 1209725 (0.00087) [2022-07-11 13:24:13,306][25689] Fps is (10 sec: 5606.4, 60 sec: 5603.3, 300 sec: 5623.5). Total num frames: 1238760448. Throughput: 0: 5815.2. Samples: 1238764536. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:24:13,306][25689] Avg episode reward: [(0, '0.658')] [2022-07-11 13:24:14,710][26022] Updated weights on worker 0-0, policy_version 1209735 (0.00097) [2022-07-11 13:24:16,371][26022] Updated weights on worker 0-0, policy_version 1209745 (0.00093) [2022-07-11 13:24:18,006][26022] Updated weights on worker 0-0, policy_version 1209755 (0.00091) [2022-07-11 13:24:18,322][25689] Fps is (10 sec: 5822.6, 60 sec: 5653.7, 300 sec: 5634.7). Total num frames: 1238790144. Throughput: 0: 5795.4. Samples: 1238798374. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:24:18,322][25689] Avg episode reward: [(0, '0.533')] [2022-07-11 13:24:20,079][26022] Updated weights on worker 0-0, policy_version 1209765 (0.00095) [2022-07-11 13:24:21,669][26022] Updated weights on worker 0-0, policy_version 1209775 (0.00082) [2022-07-11 13:24:23,355][25689] Fps is (10 sec: 5604.5, 60 sec: 5618.3, 300 sec: 5625.3). Total num frames: 1238816768. Throughput: 0: 5039.4. Samples: 1238815372. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:24:23,356][25689] Avg episode reward: [(0, '0.469')] [2022-07-11 13:24:23,707][26022] Updated weights on worker 0-0, policy_version 1209785 (0.00089) [2022-07-11 13:24:25,238][26022] Updated weights on worker 0-0, policy_version 1209795 (0.00089) [2022-07-11 13:24:27,227][26022] Updated weights on worker 0-0, policy_version 1209805 (0.00085) [2022-07-11 13:24:28,420][25689] Fps is (10 sec: 5678.8, 60 sec: 5634.3, 300 sec: 5637.9). Total num frames: 1238847488. Throughput: 0: 5878.9. Samples: 1238849388. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:24:28,420][25689] Avg episode reward: [(0, '0.913')] [2022-07-11 13:24:29,043][26022] Updated weights on worker 0-0, policy_version 1209815 (0.00084) [2022-07-11 13:24:30,959][26022] Updated weights on worker 0-0, policy_version 1209825 (0.00089) [2022-07-11 13:24:32,535][26022] Updated weights on worker 0-0, policy_version 1209835 (0.00091) [2022-07-11 13:24:33,429][25689] Fps is (10 sec: 5692.1, 60 sec: 5618.4, 300 sec: 5628.9). Total num frames: 1238874112. Throughput: 0: 5899.3. Samples: 1238883250. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:24:33,430][25689] Avg episode reward: [(0, '1.281')] [2022-07-11 13:24:34,419][26022] Updated weights on worker 0-0, policy_version 1209845 (0.00088) [2022-07-11 13:24:36,332][26022] Updated weights on worker 0-0, policy_version 1209855 (0.00086) [2022-07-11 13:24:38,228][26022] Updated weights on worker 0-0, policy_version 1209865 (0.00091) [2022-07-11 13:24:38,446][25689] Fps is (10 sec: 5515.1, 60 sec: 5602.2, 300 sec: 5628.9). Total num frames: 1238902784. Throughput: 0: 5058.9. Samples: 1238900180. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2022-07-11 13:24:38,447][25689] Avg episode reward: [(0, '0.960')] [2022-07-11 13:24:39,942][26022] Updated weights on worker 0-0, policy_version 1209875 (0.00097) [2022-07-11 13:24:41,766][26022] Updated weights on worker 0-0, policy_version 1209885 (0.00087) [2022-07-11 13:24:43,454][25689] Fps is (10 sec: 5720.0, 60 sec: 5619.7, 300 sec: 5633.7). Total num frames: 1238931456. Throughput: 0: 5915.0. Samples: 1238934262. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:24:43,455][25689] Avg episode reward: [(0, '0.975')] [2022-07-11 13:24:43,685][26022] Updated weights on worker 0-0, policy_version 1209895 (0.00088) [2022-07-11 13:24:45,227][26022] Updated weights on worker 0-0, policy_version 1209905 (0.00085) [2022-07-11 13:24:47,319][26022] Updated weights on worker 0-0, policy_version 1209915 (0.00088) [2022-07-11 13:24:48,505][25689] Fps is (10 sec: 5802.9, 60 sec: 5636.2, 300 sec: 5633.0). Total num frames: 1238961152. Throughput: 0: 5933.9. Samples: 1238968570. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:24:48,505][25689] Avg episode reward: [(0, '0.936')] [2022-07-11 13:24:49,120][26022] Updated weights on worker 0-0, policy_version 1209925 (0.00080) [2022-07-11 13:24:50,774][26022] Updated weights on worker 0-0, policy_version 1209935 (0.00438) [2022-07-11 13:24:52,715][26022] Updated weights on worker 0-0, policy_version 1209945 (0.00085) [2022-07-11 13:24:53,552][25689] Fps is (10 sec: 5679.2, 60 sec: 5633.0, 300 sec: 5633.0). Total num frames: 1238988800. Throughput: 0: 5088.3. Samples: 1238985640. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:24:53,553][25689] Avg episode reward: [(0, '0.217')] [2022-07-11 13:24:54,344][26022] Updated weights on worker 0-0, policy_version 1209955 (0.00095) [2022-07-11 13:24:56,187][26022] Updated weights on worker 0-0, policy_version 1209965 (0.00087) [2022-07-11 13:24:57,932][26022] Updated weights on worker 0-0, policy_version 1209975 (0.00086) [2022-07-11 13:24:58,560][25689] Fps is (10 sec: 5499.1, 60 sec: 5602.4, 300 sec: 5631.0). Total num frames: 1239016448. Throughput: 0: 5929.2. Samples: 1239019440. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:24:58,563][25689] Avg episode reward: [(0, '0.604')] [2022-07-11 13:24:59,961][26022] Updated weights on worker 0-0, policy_version 1209985 (0.00086) [2022-07-11 13:25:02,067][26022] Updated weights on worker 0-0, policy_version 1209995 (0.00088) [2022-07-11 13:25:03,577][25689] Fps is (10 sec: 5311.3, 60 sec: 5602.4, 300 sec: 5626.8). Total num frames: 1239042048. Throughput: 0: 5829.4. Samples: 1239051564. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:25:03,578][25689] Avg episode reward: [(0, '0.549')] [2022-07-11 13:25:03,785][26022] Updated weights on worker 0-0, policy_version 1210005 (0.00099) [2022-07-11 13:25:05,487][26022] Updated weights on worker 0-0, policy_version 1210015 (0.00084) [2022-07-11 13:25:07,482][26022] Updated weights on worker 0-0, policy_version 1210025 (0.00105) [2022-07-11 13:25:08,656][25689] Fps is (10 sec: 5578.2, 60 sec: 5649.4, 300 sec: 5636.1). Total num frames: 1239072768. Throughput: 0: 4962.6. Samples: 1239068576. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:25:08,657][25689] Avg episode reward: [(0, '0.772')] [2022-07-11 13:25:09,187][26022] Updated weights on worker 0-0, policy_version 1210035 (0.00087) [2022-07-11 13:25:11,303][26022] Updated weights on worker 0-0, policy_version 1210045 (0.00087) [2022-07-11 13:25:12,910][26022] Updated weights on worker 0-0, policy_version 1210055 (0.00096) [2022-07-11 13:25:13,730][25689] Fps is (10 sec: 5647.9, 60 sec: 5609.4, 300 sec: 5625.2). Total num frames: 1239099392. Throughput: 0: 5790.7. Samples: 1239102488. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:25:13,731][25689] Avg episode reward: [(0, '0.774')] [2022-07-11 13:25:14,637][26022] Updated weights on worker 0-0, policy_version 1210065 (0.00085) [2022-07-11 13:25:16,646][26022] Updated weights on worker 0-0, policy_version 1210075 (0.00086) [2022-07-11 13:25:18,263][26022] Updated weights on worker 0-0, policy_version 1210085 (0.00083) [2022-07-11 13:25:18,811][25689] Fps is (10 sec: 5647.3, 60 sec: 5620.3, 300 sec: 5634.2). Total num frames: 1239130112. Throughput: 0: 5786.0. Samples: 1239136610. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:25:18,811][25689] Avg episode reward: [(0, '1.710')] [2022-07-11 13:25:20,257][26022] Updated weights on worker 0-0, policy_version 1210095 (0.00084) [2022-07-11 13:25:21,739][26022] Updated weights on worker 0-0, policy_version 1210105 (0.00099) [2022-07-11 13:25:23,705][26022] Updated weights on worker 0-0, policy_version 1210115 (0.00093) [2022-07-11 13:25:23,899][25689] Fps is (10 sec: 5739.9, 60 sec: 5632.1, 300 sec: 5630.1). Total num frames: 1239157760. Throughput: 0: 5030.7. Samples: 1239153798. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:25:23,907][25689] Avg episode reward: [(0, '1.461')] [2022-07-11 13:25:25,368][26022] Updated weights on worker 0-0, policy_version 1210125 (0.00088) [2022-07-11 13:25:27,321][26022] Updated weights on worker 0-0, policy_version 1210135 (0.00084) [2022-07-11 13:25:29,001][25689] Fps is (10 sec: 5727.9, 60 sec: 5628.7, 300 sec: 5635.3). Total num frames: 1239188480. Throughput: 0: 5867.2. Samples: 1239187942. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:25:29,003][26022] Updated weights on worker 0-0, policy_version 1210145 (0.00087) [2022-07-11 13:25:29,001][25689] Avg episode reward: [(0, '1.587')] [2022-07-11 13:25:30,935][26022] Updated weights on worker 0-0, policy_version 1210155 (0.00089) [2022-07-11 13:25:32,611][26022] Updated weights on worker 0-0, policy_version 1210165 (0.00083) [2022-07-11 13:25:34,084][25689] Fps is (10 sec: 5730.5, 60 sec: 5638.7, 300 sec: 5633.9). Total num frames: 1239216128. Throughput: 0: 5878.3. Samples: 1239222136. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:25:34,085][25689] Avg episode reward: [(0, '1.633')] [2022-07-11 13:25:34,534][26022] Updated weights on worker 0-0, policy_version 1210175 (0.00090) [2022-07-11 13:25:36,233][26022] Updated weights on worker 0-0, policy_version 1210185 (0.00059) [2022-07-11 13:25:37,954][26022] Updated weights on worker 0-0, policy_version 1210195 (0.00082) [2022-07-11 13:25:39,127][25689] Fps is (10 sec: 5663.2, 60 sec: 5653.2, 300 sec: 5633.4). Total num frames: 1239245824. Throughput: 0: 5067.9. Samples: 1239239570. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:25:39,127][25689] Avg episode reward: [(0, '0.851')] [2022-07-11 13:25:39,898][26022] Updated weights on worker 0-0, policy_version 1210205 (0.00083) [2022-07-11 13:25:41,539][26022] Updated weights on worker 0-0, policy_version 1210215 (0.00082) [2022-07-11 13:25:43,551][26022] Updated weights on worker 0-0, policy_version 1210225 (0.00087) [2022-07-11 13:25:44,144][25689] Fps is (10 sec: 5802.0, 60 sec: 5652.3, 300 sec: 5637.3). Total num frames: 1239274496. Throughput: 0: 5925.1. Samples: 1239273754. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:25:44,145][25689] Avg episode reward: [(0, '-0.246')] [2022-07-11 13:25:45,226][26022] Updated weights on worker 0-0, policy_version 1210235 (0.00088) [2022-07-11 13:25:46,969][26022] Updated weights on worker 0-0, policy_version 1210245 (0.00081) [2022-07-11 13:25:48,838][26022] Updated weights on worker 0-0, policy_version 1210255 (0.00080) [2022-07-11 13:25:49,187][25689] Fps is (10 sec: 5598.3, 60 sec: 5619.3, 300 sec: 5629.7). Total num frames: 1239302144. Throughput: 0: 5941.7. Samples: 1239307880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:25:49,187][25689] Avg episode reward: [(0, '-0.829')] [2022-07-11 13:25:50,445][26022] Updated weights on worker 0-0, policy_version 1210265 (0.00085) [2022-07-11 13:25:52,493][26022] Updated weights on worker 0-0, policy_version 1210275 (0.00084) [2022-07-11 13:25:54,083][26022] Updated weights on worker 0-0, policy_version 1210285 (0.00087) [2022-07-11 13:25:54,211][25689] Fps is (10 sec: 5696.5, 60 sec: 5655.2, 300 sec: 5640.1). Total num frames: 1239331840. Throughput: 0: 5098.4. Samples: 1239324750. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:25:54,212][25689] Avg episode reward: [(0, '-1.838')] [2022-07-11 13:25:56,035][26022] Updated weights on worker 0-0, policy_version 1210295 (0.00098) [2022-07-11 13:25:57,878][26022] Updated weights on worker 0-0, policy_version 1210305 (0.00096) [2022-07-11 13:25:59,229][25689] Fps is (10 sec: 5710.1, 60 sec: 5654.3, 300 sec: 5633.3). Total num frames: 1239359488. Throughput: 0: 5941.2. Samples: 1239359004. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:25:59,233][25689] Avg episode reward: [(0, '-2.086')] [2022-07-11 13:25:59,696][26022] Updated weights on worker 0-0, policy_version 1210315 (0.00088) [2022-07-11 13:26:01,451][26022] Updated weights on worker 0-0, policy_version 1210325 (0.00085) [2022-07-11 13:26:03,737][26022] Updated weights on worker 0-0, policy_version 1210335 (0.00084) [2022-07-11 13:26:04,255][25689] Fps is (10 sec: 5505.4, 60 sec: 5687.2, 300 sec: 5638.0). Total num frames: 1239387136. Throughput: 0: 5855.4. Samples: 1239391508. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:26:04,255][25689] Avg episode reward: [(0, '-2.306')] [2022-07-11 13:26:05,186][26022] Updated weights on worker 0-0, policy_version 1210345 (0.00088) [2022-07-11 13:26:07,392][26022] Updated weights on worker 0-0, policy_version 1210355 (0.00089) [2022-07-11 13:26:08,698][26022] Updated weights on worker 0-0, policy_version 1210365 (0.00085) [2022-07-11 13:26:09,328][25689] Fps is (10 sec: 5576.9, 60 sec: 5654.0, 300 sec: 5640.9). Total num frames: 1239415808. Throughput: 0: 5013.2. Samples: 1239408852. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:26:09,329][25689] Avg episode reward: [(0, '-2.823')] [2022-07-11 13:26:10,396][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:26:10,405][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001210372_1239420928.pth [2022-07-11 13:26:10,410][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001208391_1237392384.pth [2022-07-11 13:26:10,835][26022] Updated weights on worker 0-0, policy_version 1210375 (0.00086) [2022-07-11 13:26:12,295][26022] Updated weights on worker 0-0, policy_version 1210385 (0.00078) [2022-07-11 13:26:14,337][25689] Fps is (10 sec: 5586.2, 60 sec: 5677.0, 300 sec: 5634.9). Total num frames: 1239443456. Throughput: 0: 5879.6. Samples: 1239443082. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:26:14,340][25689] Avg episode reward: [(0, '-2.111')] [2022-07-11 13:26:14,427][26022] Updated weights on worker 0-0, policy_version 1210395 (0.00083) [2022-07-11 13:26:16,198][26022] Updated weights on worker 0-0, policy_version 1210406 (0.00084) [2022-07-11 13:26:18,096][26022] Updated weights on worker 0-0, policy_version 1210416 (0.00090) [2022-07-11 13:26:19,346][25689] Fps is (10 sec: 5724.4, 60 sec: 5666.8, 300 sec: 5640.0). Total num frames: 1239473152. Throughput: 0: 5890.4. Samples: 1239477496. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:26:19,347][25689] Avg episode reward: [(0, '-2.237')] [2022-07-11 13:26:19,774][26022] Updated weights on worker 0-0, policy_version 1210426 (0.00079) [2022-07-11 13:26:21,792][26022] Updated weights on worker 0-0, policy_version 1210436 (0.00085) [2022-07-11 13:26:23,355][26022] Updated weights on worker 0-0, policy_version 1210446 (0.00086) [2022-07-11 13:26:24,368][25689] Fps is (10 sec: 5818.9, 60 sec: 5690.0, 300 sec: 5641.5). Total num frames: 1239501824. Throughput: 0: 5114.7. Samples: 1239494378. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:26:24,368][25689] Avg episode reward: [(0, '-1.271')] [2022-07-11 13:26:25,465][26022] Updated weights on worker 0-0, policy_version 1210456 (0.00087) [2022-07-11 13:26:27,103][26022] Updated weights on worker 0-0, policy_version 1210466 (0.00087) [2022-07-11 13:26:29,061][26022] Updated weights on worker 0-0, policy_version 1210476 (0.00098) [2022-07-11 13:26:29,421][25689] Fps is (10 sec: 5590.0, 60 sec: 5643.7, 300 sec: 5633.9). Total num frames: 1239529472. Throughput: 0: 5949.8. Samples: 1239528400. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:26:29,423][25689] Avg episode reward: [(0, '-0.453')] [2022-07-11 13:26:30,695][26022] Updated weights on worker 0-0, policy_version 1210486 (0.00082) [2022-07-11 13:26:32,556][26022] Updated weights on worker 0-0, policy_version 1210496 (0.00080) [2022-07-11 13:26:34,279][26022] Updated weights on worker 0-0, policy_version 1210506 (0.00082) [2022-07-11 13:26:34,512][25689] Fps is (10 sec: 5552.1, 60 sec: 5660.0, 300 sec: 5639.1). Total num frames: 1239558144. Throughput: 0: 5911.5. Samples: 1239562344. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:26:34,513][25689] Avg episode reward: [(0, '-1.134')] [2022-07-11 13:26:36,195][26022] Updated weights on worker 0-0, policy_version 1210516 (0.00088) [2022-07-11 13:26:37,779][26022] Updated weights on worker 0-0, policy_version 1210526 (0.00102) [2022-07-11 13:26:39,569][25689] Fps is (10 sec: 5650.8, 60 sec: 5641.6, 300 sec: 5638.3). Total num frames: 1239586816. Throughput: 0: 5888.4. Samples: 1239596578. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:26:39,575][25689] Avg episode reward: [(0, '-0.443')] [2022-07-11 13:26:39,953][26022] Updated weights on worker 0-0, policy_version 1210536 (0.00087) [2022-07-11 13:26:41,275][26022] Updated weights on worker 0-0, policy_version 1210546 (0.00087) [2022-07-11 13:26:43,429][26022] Updated weights on worker 0-0, policy_version 1210556 (0.00086) [2022-07-11 13:26:44,601][25689] Fps is (10 sec: 5886.4, 60 sec: 5674.1, 300 sec: 5638.6). Total num frames: 1239617536. Throughput: 0: 5892.2. Samples: 1239613598. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:26:44,603][25689] Avg episode reward: [(0, '0.326')] [2022-07-11 13:26:44,997][26022] Updated weights on worker 0-0, policy_version 1210566 (0.00084) [2022-07-11 13:26:47,001][26022] Updated weights on worker 0-0, policy_version 1210576 (0.00091) [2022-07-11 13:26:48,862][26022] Updated weights on worker 0-0, policy_version 1210586 (0.00086) [2022-07-11 13:26:49,651][25689] Fps is (10 sec: 5687.3, 60 sec: 5656.5, 300 sec: 5637.8). Total num frames: 1239644160. Throughput: 0: 5896.1. Samples: 1239647682. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:26:49,652][25689] Avg episode reward: [(0, '0.488')] [2022-07-11 13:26:50,623][26022] Updated weights on worker 0-0, policy_version 1210596 (0.00079) [2022-07-11 13:26:52,218][26022] Updated weights on worker 0-0, policy_version 1210606 (0.00084) [2022-07-11 13:26:54,244][26022] Updated weights on worker 0-0, policy_version 1210616 (0.00094) [2022-07-11 13:26:54,700][25689] Fps is (10 sec: 5475.5, 60 sec: 5637.3, 300 sec: 5636.9). Total num frames: 1239672832. Throughput: 0: 5922.3. Samples: 1239681906. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:26:54,700][25689] Avg episode reward: [(0, '0.481')] [2022-07-11 13:26:55,874][26022] Updated weights on worker 0-0, policy_version 1210626 (0.00087) [2022-07-11 13:26:57,949][26022] Updated weights on worker 0-0, policy_version 1210636 (0.00084) [2022-07-11 13:26:59,388][26022] Updated weights on worker 0-0, policy_version 1210646 (0.00084) [2022-07-11 13:26:59,718][25689] Fps is (10 sec: 5798.1, 60 sec: 5671.1, 300 sec: 5641.1). Total num frames: 1239702528. Throughput: 0: 5079.4. Samples: 1239698926. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:26:59,721][25689] Avg episode reward: [(0, '0.693')] [2022-07-11 13:27:01,378][26022] Updated weights on worker 0-0, policy_version 1210656 (0.00091) [2022-07-11 13:27:03,411][26022] Updated weights on worker 0-0, policy_version 1210666 (0.00088) [2022-07-11 13:27:04,789][25689] Fps is (10 sec: 5379.1, 60 sec: 5616.1, 300 sec: 5634.7). Total num frames: 1239727104. Throughput: 0: 5821.1. Samples: 1239731116. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:27:04,790][25689] Avg episode reward: [(0, '1.921')] [2022-07-11 13:27:05,306][26022] Updated weights on worker 0-0, policy_version 1210676 (0.00085) [2022-07-11 13:27:06,964][26022] Updated weights on worker 0-0, policy_version 1210686 (0.00096) [2022-07-11 13:27:08,990][26022] Updated weights on worker 0-0, policy_version 1210696 (0.00096) [2022-07-11 13:27:09,852][25689] Fps is (10 sec: 5557.5, 60 sec: 5667.8, 300 sec: 5638.5). Total num frames: 1239758848. Throughput: 0: 5814.4. Samples: 1239765136. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:27:09,853][25689] Avg episode reward: [(0, '2.099')] [2022-07-11 13:27:10,790][26022] Updated weights on worker 0-0, policy_version 1210706 (0.00065) [2022-07-11 13:27:12,756][26022] Updated weights on worker 0-0, policy_version 1210716 (0.00085) [2022-07-11 13:27:14,235][26022] Updated weights on worker 0-0, policy_version 1210726 (0.00089) [2022-07-11 13:27:14,872][25689] Fps is (10 sec: 5789.0, 60 sec: 5649.9, 300 sec: 5638.5). Total num frames: 1239785472. Throughput: 0: 4964.9. Samples: 1239782058. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:27:14,873][25689] Avg episode reward: [(0, '1.987')] [2022-07-11 13:27:16,287][26022] Updated weights on worker 0-0, policy_version 1210736 (0.00084) [2022-07-11 13:27:18,016][26022] Updated weights on worker 0-0, policy_version 1210746 (0.00086) [2022-07-11 13:27:19,879][25689] Fps is (10 sec: 5514.6, 60 sec: 5633.1, 300 sec: 5632.8). Total num frames: 1239814144. Throughput: 0: 5801.4. Samples: 1239815888. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:27:19,880][25689] Avg episode reward: [(0, '1.816')] [2022-07-11 13:27:19,885][26022] Updated weights on worker 0-0, policy_version 1210756 (0.00086) [2022-07-11 13:27:21,599][26022] Updated weights on worker 0-0, policy_version 1210766 (0.00087) [2022-07-11 13:27:23,586][26022] Updated weights on worker 0-0, policy_version 1210776 (0.00075) [2022-07-11 13:27:24,932][25689] Fps is (10 sec: 5801.8, 60 sec: 5647.1, 300 sec: 5644.1). Total num frames: 1239843840. Throughput: 0: 5932.1. Samples: 1239850606. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:27:24,933][25689] Avg episode reward: [(0, '1.763')] [2022-07-11 13:27:24,975][26022] Updated weights on worker 0-0, policy_version 1210786 (0.00093) [2022-07-11 13:27:27,268][26022] Updated weights on worker 0-0, policy_version 1210796 (0.00082) [2022-07-11 13:27:28,579][26022] Updated weights on worker 0-0, policy_version 1210806 (0.00085) [2022-07-11 13:27:30,083][25689] Fps is (10 sec: 5519.6, 60 sec: 5621.2, 300 sec: 5627.9). Total num frames: 1239870464. Throughput: 0: 5067.2. Samples: 1239867650. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:27:30,085][25689] Avg episode reward: [(0, '0.966')] [2022-07-11 13:27:30,694][26022] Updated weights on worker 0-0, policy_version 1210816 (0.00078) [2022-07-11 13:27:32,342][26022] Updated weights on worker 0-0, policy_version 1210826 (0.00087) [2022-07-11 13:27:34,175][26022] Updated weights on worker 0-0, policy_version 1210836 (0.00089) [2022-07-11 13:27:35,115][25689] Fps is (10 sec: 5631.7, 60 sec: 5660.4, 300 sec: 5638.1). Total num frames: 1239901184. Throughput: 0: 5907.2. Samples: 1239901636. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:27:35,115][25689] Avg episode reward: [(0, '0.854')] [2022-07-11 13:27:36,026][26022] Updated weights on worker 0-0, policy_version 1210846 (0.00091) [2022-07-11 13:27:37,696][26022] Updated weights on worker 0-0, policy_version 1210856 (0.00086) [2022-07-11 13:27:39,609][26022] Updated weights on worker 0-0, policy_version 1210866 (0.00082) [2022-07-11 13:27:40,124][25689] Fps is (10 sec: 5812.8, 60 sec: 5648.0, 300 sec: 5631.1). Total num frames: 1239928832. Throughput: 0: 5927.5. Samples: 1239935890. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:27:40,126][25689] Avg episode reward: [(0, '-0.272')] [2022-07-11 13:27:41,408][26022] Updated weights on worker 0-0, policy_version 1210876 (0.00090) [2022-07-11 13:27:43,084][26022] Updated weights on worker 0-0, policy_version 1210886 (0.00092) [2022-07-11 13:27:45,133][25689] Fps is (10 sec: 5519.5, 60 sec: 5599.4, 300 sec: 5633.3). Total num frames: 1239956480. Throughput: 0: 5064.6. Samples: 1239952914. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:27:45,134][25689] Avg episode reward: [(0, '-0.525')] [2022-07-11 13:27:45,169][26022] Updated weights on worker 0-0, policy_version 1210896 (0.00081) [2022-07-11 13:27:46,739][26022] Updated weights on worker 0-0, policy_version 1210906 (0.00086) [2022-07-11 13:27:48,654][26022] Updated weights on worker 0-0, policy_version 1210916 (0.00093) [2022-07-11 13:27:50,239][25689] Fps is (10 sec: 5770.4, 60 sec: 5661.9, 300 sec: 5639.6). Total num frames: 1239987200. Throughput: 0: 5903.3. Samples: 1239986640. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:27:50,240][25689] Avg episode reward: [(0, '-1.150')] [2022-07-11 13:27:50,554][26022] Updated weights on worker 0-0, policy_version 1210926 (0.00084) [2022-07-11 13:27:52,055][26022] Updated weights on worker 0-0, policy_version 1210936 (0.00092) [2022-07-11 13:27:54,119][26022] Updated weights on worker 0-0, policy_version 1210946 (0.00080) [2022-07-11 13:27:55,311][25689] Fps is (10 sec: 5835.3, 60 sec: 5659.7, 300 sec: 5642.5). Total num frames: 1240015872. Throughput: 0: 5919.9. Samples: 1240021196. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:27:55,311][25689] Avg episode reward: [(0, '-1.146')] [2022-07-11 13:27:55,675][26022] Updated weights on worker 0-0, policy_version 1210956 (0.00082) [2022-07-11 13:27:57,745][26022] Updated weights on worker 0-0, policy_version 1210966 (0.00087) [2022-07-11 13:27:59,349][26022] Updated weights on worker 0-0, policy_version 1210976 (0.00084) [2022-07-11 13:28:00,336][25689] Fps is (10 sec: 5578.3, 60 sec: 5625.3, 300 sec: 5642.6). Total num frames: 1240043520. Throughput: 0: 5061.5. Samples: 1240038190. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:28:00,336][25689] Avg episode reward: [(0, '-0.924')] [2022-07-11 13:28:01,126][26022] Updated weights on worker 0-0, policy_version 1210986 (0.00091) [2022-07-11 13:28:03,428][26022] Updated weights on worker 0-0, policy_version 1210996 (0.00088) [2022-07-11 13:28:05,075][26022] Updated weights on worker 0-0, policy_version 1211006 (0.00087) [2022-07-11 13:28:05,352][25689] Fps is (10 sec: 5405.1, 60 sec: 5664.2, 300 sec: 5637.0). Total num frames: 1240070144. Throughput: 0: 5807.8. Samples: 1240070340. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:28:05,352][25689] Avg episode reward: [(0, '-1.044')] [2022-07-11 13:28:06,948][26022] Updated weights on worker 0-0, policy_version 1211016 (0.00086) [2022-07-11 13:28:08,785][26022] Updated weights on worker 0-0, policy_version 1211026 (0.00082) [2022-07-11 13:28:10,415][25689] Fps is (10 sec: 5587.7, 60 sec: 5630.4, 300 sec: 5639.6). Total num frames: 1240099840. Throughput: 0: 5855.4. Samples: 1240104776. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:28:10,415][25689] Avg episode reward: [(0, '0.010')] [2022-07-11 13:28:10,442][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:28:10,454][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001211036_1240100864.pth [2022-07-11 13:28:10,454][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001209050_1238067200.pth [2022-07-11 13:28:10,463][26022] Updated weights on worker 0-0, policy_version 1211036 (0.00103) [2022-07-11 13:28:12,371][26022] Updated weights on worker 0-0, policy_version 1211046 (0.00084) [2022-07-11 13:28:14,227][26022] Updated weights on worker 0-0, policy_version 1211056 (0.00053) [2022-07-11 13:28:15,427][25689] Fps is (10 sec: 5793.3, 60 sec: 5664.9, 300 sec: 5646.5). Total num frames: 1240128512. Throughput: 0: 5007.7. Samples: 1240121930. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:28:15,427][25689] Avg episode reward: [(0, '0.174')] [2022-07-11 13:28:16,016][26022] Updated weights on worker 0-0, policy_version 1211066 (0.00091) [2022-07-11 13:28:17,735][26022] Updated weights on worker 0-0, policy_version 1211076 (0.00090) [2022-07-11 13:28:19,588][26022] Updated weights on worker 0-0, policy_version 1211086 (0.00084) [2022-07-11 13:28:20,431][25689] Fps is (10 sec: 5622.8, 60 sec: 5648.3, 300 sec: 5643.3). Total num frames: 1240156160. Throughput: 0: 5860.7. Samples: 1240155964. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:28:20,432][25689] Avg episode reward: [(0, '0.595')] [2022-07-11 13:28:21,383][26022] Updated weights on worker 0-0, policy_version 1211096 (0.00088) [2022-07-11 13:28:23,159][26022] Updated weights on worker 0-0, policy_version 1211106 (0.00090) [2022-07-11 13:28:24,879][26022] Updated weights on worker 0-0, policy_version 1211116 (0.00085) [2022-07-11 13:28:25,455][25689] Fps is (10 sec: 5616.5, 60 sec: 5634.2, 300 sec: 5640.4). Total num frames: 1240184832. Throughput: 0: 5969.4. Samples: 1240190342. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:28:25,455][25689] Avg episode reward: [(0, '1.090')] [2022-07-11 13:28:26,798][26022] Updated weights on worker 0-0, policy_version 1211126 (0.00085) [2022-07-11 13:28:28,748][26022] Updated weights on worker 0-0, policy_version 1211136 (0.00086) [2022-07-11 13:28:30,225][26022] Updated weights on worker 0-0, policy_version 1211146 (0.00087) [2022-07-11 13:28:30,549][25689] Fps is (10 sec: 5769.1, 60 sec: 5690.2, 300 sec: 5645.9). Total num frames: 1240214528. Throughput: 0: 5083.1. Samples: 1240207118. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:28:30,549][25689] Avg episode reward: [(0, '0.757')] [2022-07-11 13:28:32,297][26022] Updated weights on worker 0-0, policy_version 1211156 (0.00084) [2022-07-11 13:28:34,036][26022] Updated weights on worker 0-0, policy_version 1211166 (0.00079) [2022-07-11 13:28:35,623][25689] Fps is (10 sec: 5840.6, 60 sec: 5669.3, 300 sec: 5645.0). Total num frames: 1240244224. Throughput: 0: 5913.0. Samples: 1240241352. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:28:35,624][25689] Avg episode reward: [(0, '0.316')] [2022-07-11 13:28:35,634][26022] Updated weights on worker 0-0, policy_version 1211176 (0.00072) [2022-07-11 13:28:37,628][26022] Updated weights on worker 0-0, policy_version 1211186 (0.00087) [2022-07-11 13:28:39,400][26022] Updated weights on worker 0-0, policy_version 1211196 (0.00092) [2022-07-11 13:28:40,667][25689] Fps is (10 sec: 5566.3, 60 sec: 5649.2, 300 sec: 5641.0). Total num frames: 1240270848. Throughput: 0: 5923.8. Samples: 1240275834. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 13:28:40,667][25689] Avg episode reward: [(0, '0.587')] [2022-07-11 13:28:41,057][26022] Updated weights on worker 0-0, policy_version 1211206 (0.00081) [2022-07-11 13:28:42,856][26022] Updated weights on worker 0-0, policy_version 1211216 (0.00093) [2022-07-11 13:28:44,749][26022] Updated weights on worker 0-0, policy_version 1211226 (0.00081) [2022-07-11 13:28:45,672][25689] Fps is (10 sec: 5604.8, 60 sec: 5683.3, 300 sec: 5645.2). Total num frames: 1240300544. Throughput: 0: 5079.2. Samples: 1240293030. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:28:45,673][25689] Avg episode reward: [(0, '0.866')] [2022-07-11 13:28:46,563][26022] Updated weights on worker 0-0, policy_version 1211236 (0.00086) [2022-07-11 13:28:48,305][26022] Updated weights on worker 0-0, policy_version 1211246 (0.00087) [2022-07-11 13:28:49,976][26022] Updated weights on worker 0-0, policy_version 1211256 (0.00089) [2022-07-11 13:28:50,795][25689] Fps is (10 sec: 5762.8, 60 sec: 5648.0, 300 sec: 5646.5). Total num frames: 1240329216. Throughput: 0: 5935.1. Samples: 1240327282. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:28:50,796][25689] Avg episode reward: [(0, '1.061')] [2022-07-11 13:28:51,922][26022] Updated weights on worker 0-0, policy_version 1211266 (0.00052) [2022-07-11 13:28:53,735][26022] Updated weights on worker 0-0, policy_version 1211276 (0.00087) [2022-07-11 13:28:55,438][26022] Updated weights on worker 0-0, policy_version 1211286 (0.00080) [2022-07-11 13:28:55,828][25689] Fps is (10 sec: 5746.9, 60 sec: 5668.4, 300 sec: 5646.7). Total num frames: 1240358912. Throughput: 0: 5937.0. Samples: 1240361310. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:28:55,829][25689] Avg episode reward: [(0, '1.377')] [2022-07-11 13:28:57,435][26022] Updated weights on worker 0-0, policy_version 1211296 (0.00082) [2022-07-11 13:28:59,094][26022] Updated weights on worker 0-0, policy_version 1211306 (0.00089) [2022-07-11 13:29:00,837][25689] Fps is (10 sec: 5710.3, 60 sec: 5669.9, 300 sec: 5653.8). Total num frames: 1240386560. Throughput: 0: 5938.7. Samples: 1240395622. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:29:00,838][25689] Avg episode reward: [(0, '1.588')] [2022-07-11 13:29:00,927][26022] Updated weights on worker 0-0, policy_version 1211316 (0.00083) [2022-07-11 13:29:03,180][26022] Updated weights on worker 0-0, policy_version 1211326 (0.00093) [2022-07-11 13:29:04,806][26022] Updated weights on worker 0-0, policy_version 1211336 (0.00089) [2022-07-11 13:29:05,854][25689] Fps is (10 sec: 5413.4, 60 sec: 5669.9, 300 sec: 5650.7). Total num frames: 1240413184. Throughput: 0: 5820.4. Samples: 1240410496. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:29:05,854][25689] Avg episode reward: [(0, '1.644')] [2022-07-11 13:29:06,883][26022] Updated weights on worker 0-0, policy_version 1211346 (0.00087) [2022-07-11 13:29:08,488][26022] Updated weights on worker 0-0, policy_version 1211356 (0.00095) [2022-07-11 13:29:10,367][26022] Updated weights on worker 0-0, policy_version 1211366 (0.00079) [2022-07-11 13:29:11,005][25689] Fps is (10 sec: 5438.2, 60 sec: 5644.7, 300 sec: 5648.0). Total num frames: 1240441856. Throughput: 0: 5790.8. Samples: 1240444316. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:29:11,006][25689] Avg episode reward: [(0, '0.941')] [2022-07-11 13:29:12,034][26022] Updated weights on worker 0-0, policy_version 1211376 (0.00089) [2022-07-11 13:29:14,045][26022] Updated weights on worker 0-0, policy_version 1211386 (0.00089) [2022-07-11 13:29:15,866][26022] Updated weights on worker 0-0, policy_version 1211396 (0.00086) [2022-07-11 13:29:16,014][25689] Fps is (10 sec: 5644.0, 60 sec: 5645.0, 300 sec: 5644.7). Total num frames: 1240470528. Throughput: 0: 5800.3. Samples: 1240478392. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:29:16,015][25689] Avg episode reward: [(0, '0.871')] [2022-07-11 13:29:17,800][26022] Updated weights on worker 0-0, policy_version 1211406 (0.00081) [2022-07-11 13:29:19,414][26022] Updated weights on worker 0-0, policy_version 1211416 (0.00083) [2022-07-11 13:29:21,043][25689] Fps is (10 sec: 5610.8, 60 sec: 5642.7, 300 sec: 5648.2). Total num frames: 1240498176. Throughput: 0: 4934.0. Samples: 1240495318. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:29:21,044][25689] Avg episode reward: [(0, '0.812')] [2022-07-11 13:29:21,311][26022] Updated weights on worker 0-0, policy_version 1211426 (0.00083) [2022-07-11 13:29:22,892][26022] Updated weights on worker 0-0, policy_version 1211436 (0.00087) [2022-07-11 13:29:24,851][26022] Updated weights on worker 0-0, policy_version 1211446 (0.00087) [2022-07-11 13:29:26,127][25689] Fps is (10 sec: 5670.6, 60 sec: 5654.0, 300 sec: 5644.4). Total num frames: 1240527872. Throughput: 0: 5876.8. Samples: 1240529636. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:29:26,127][25689] Avg episode reward: [(0, '0.390')] [2022-07-11 13:29:26,759][26022] Updated weights on worker 0-0, policy_version 1211456 (0.00091) [2022-07-11 13:29:28,412][26022] Updated weights on worker 0-0, policy_version 1211466 (0.00094) [2022-07-11 13:29:30,277][26022] Updated weights on worker 0-0, policy_version 1211476 (0.00089) [2022-07-11 13:29:31,194][25689] Fps is (10 sec: 5850.7, 60 sec: 5656.4, 300 sec: 5653.6). Total num frames: 1240557568. Throughput: 0: 5904.1. Samples: 1240563514. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:29:31,195][25689] Avg episode reward: [(0, '0.210')] [2022-07-11 13:29:32,154][26022] Updated weights on worker 0-0, policy_version 1211486 (0.00092) [2022-07-11 13:29:33,794][26022] Updated weights on worker 0-0, policy_version 1211496 (0.00081) [2022-07-11 13:29:35,913][26022] Updated weights on worker 0-0, policy_version 1211506 (0.00090) [2022-07-11 13:29:36,238][25689] Fps is (10 sec: 5569.9, 60 sec: 5608.7, 300 sec: 5646.3). Total num frames: 1240584192. Throughput: 0: 5055.7. Samples: 1240580638. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:29:36,239][25689] Avg episode reward: [(0, '0.206')] [2022-07-11 13:29:37,265][26022] Updated weights on worker 0-0, policy_version 1211516 (0.00092) [2022-07-11 13:29:39,444][26022] Updated weights on worker 0-0, policy_version 1211526 (0.00049) [2022-07-11 13:29:40,928][26022] Updated weights on worker 0-0, policy_version 1211536 (0.00089) [2022-07-11 13:29:41,265][25689] Fps is (10 sec: 5592.4, 60 sec: 5660.8, 300 sec: 5649.3). Total num frames: 1240613888. Throughput: 0: 5901.4. Samples: 1240614658. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:29:41,266][25689] Avg episode reward: [(0, '0.373')] [2022-07-11 13:29:42,981][26022] Updated weights on worker 0-0, policy_version 1211546 (0.00263) [2022-07-11 13:29:44,503][26022] Updated weights on worker 0-0, policy_version 1211556 (0.00087) [2022-07-11 13:29:46,278][25689] Fps is (10 sec: 5609.4, 60 sec: 5609.4, 300 sec: 5639.7). Total num frames: 1240640512. Throughput: 0: 5909.2. Samples: 1240648716. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:29:46,279][25689] Avg episode reward: [(0, '0.282')] [2022-07-11 13:29:46,551][26022] Updated weights on worker 0-0, policy_version 1211566 (0.00086) [2022-07-11 13:29:48,124][26022] Updated weights on worker 0-0, policy_version 1211576 (0.00086) [2022-07-11 13:29:50,152][26022] Updated weights on worker 0-0, policy_version 1211586 (0.00085) [2022-07-11 13:29:51,322][25689] Fps is (10 sec: 5702.1, 60 sec: 5650.6, 300 sec: 5650.1). Total num frames: 1240671232. Throughput: 0: 5088.3. Samples: 1240665928. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:29:51,322][25689] Avg episode reward: [(0, '0.481')] [2022-07-11 13:29:51,994][26022] Updated weights on worker 0-0, policy_version 1211596 (0.00083) [2022-07-11 13:29:53,627][26022] Updated weights on worker 0-0, policy_version 1211606 (0.00085) [2022-07-11 13:29:55,368][26022] Updated weights on worker 0-0, policy_version 1211616 (0.00090) [2022-07-11 13:29:56,353][25689] Fps is (10 sec: 5793.7, 60 sec: 5617.0, 300 sec: 5649.7). Total num frames: 1240698880. Throughput: 0: 5953.5. Samples: 1240700390. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:29:56,353][25689] Avg episode reward: [(0, '0.762')] [2022-07-11 13:29:57,340][26022] Updated weights on worker 0-0, policy_version 1211626 (0.00087) [2022-07-11 13:29:59,025][26022] Updated weights on worker 0-0, policy_version 1211636 (0.00083) [2022-07-11 13:30:00,813][26022] Updated weights on worker 0-0, policy_version 1211646 (0.00087) [2022-07-11 13:30:01,367][25689] Fps is (10 sec: 5505.0, 60 sec: 5616.6, 300 sec: 5656.6). Total num frames: 1240726528. Throughput: 0: 5959.7. Samples: 1240734456. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:30:01,368][25689] Avg episode reward: [(0, '1.020')] [2022-07-11 13:30:02,935][26022] Updated weights on worker 0-0, policy_version 1211656 (0.00087) [2022-07-11 13:30:04,945][26022] Updated weights on worker 0-0, policy_version 1211666 (0.00088) [2022-07-11 13:30:06,386][25689] Fps is (10 sec: 5511.4, 60 sec: 5633.3, 300 sec: 5647.4). Total num frames: 1240754176. Throughput: 0: 4996.4. Samples: 1240749182. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:30:06,388][25689] Avg episode reward: [(0, '1.082')] [2022-07-11 13:30:06,724][26022] Updated weights on worker 0-0, policy_version 1211676 (0.00088) [2022-07-11 13:30:08,484][26022] Updated weights on worker 0-0, policy_version 1211686 (0.00085) [2022-07-11 13:30:10,291][26022] Updated weights on worker 0-0, policy_version 1211696 (0.00086) [2022-07-11 13:30:10,563][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:30:10,571][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001211697_1240777728.pth [2022-07-11 13:30:10,572][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001209710_1238743040.pth [2022-07-11 13:30:11,497][25689] Fps is (10 sec: 5458.7, 60 sec: 5620.1, 300 sec: 5650.2). Total num frames: 1240781824. Throughput: 0: 5809.1. Samples: 1240783124. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:30:11,498][25689] Avg episode reward: [(0, '-0.366')] [2022-07-11 13:30:12,036][26022] Updated weights on worker 0-0, policy_version 1211706 (0.00490) [2022-07-11 13:30:14,010][26022] Updated weights on worker 0-0, policy_version 1211716 (0.00091) [2022-07-11 13:30:15,824][26022] Updated weights on worker 0-0, policy_version 1211726 (0.00091) [2022-07-11 13:30:16,503][25689] Fps is (10 sec: 5667.9, 60 sec: 5637.3, 300 sec: 5648.1). Total num frames: 1240811520. Throughput: 0: 5799.3. Samples: 1240817248. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:30:16,505][25689] Avg episode reward: [(0, '-0.560')] [2022-07-11 13:30:17,639][26022] Updated weights on worker 0-0, policy_version 1211736 (0.00087) [2022-07-11 13:30:19,509][26022] Updated weights on worker 0-0, policy_version 1211746 (0.00090) [2022-07-11 13:30:21,248][26022] Updated weights on worker 0-0, policy_version 1211756 (0.00083) [2022-07-11 13:30:21,511][25689] Fps is (10 sec: 5726.3, 60 sec: 5639.2, 300 sec: 5649.7). Total num frames: 1240839168. Throughput: 0: 4931.9. Samples: 1240833806. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:30:21,511][25689] Avg episode reward: [(0, '-1.247')] [2022-07-11 13:30:23,042][26022] Updated weights on worker 0-0, policy_version 1211766 (0.00086) [2022-07-11 13:30:24,820][26022] Updated weights on worker 0-0, policy_version 1211776 (0.00088) [2022-07-11 13:30:26,513][25689] Fps is (10 sec: 5524.3, 60 sec: 5613.0, 300 sec: 5641.2). Total num frames: 1240866816. Throughput: 0: 5894.9. Samples: 1240867828. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:30:26,513][25689] Avg episode reward: [(0, '-1.304')] [2022-07-11 13:30:26,669][26022] Updated weights on worker 0-0, policy_version 1211786 (0.00085) [2022-07-11 13:30:28,515][26022] Updated weights on worker 0-0, policy_version 1211796 (0.00081) [2022-07-11 13:30:30,327][26022] Updated weights on worker 0-0, policy_version 1211806 (0.00085) [2022-07-11 13:30:31,639][25689] Fps is (10 sec: 5662.1, 60 sec: 5607.6, 300 sec: 5647.3). Total num frames: 1240896512. Throughput: 0: 5890.0. Samples: 1240901760. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:30:31,639][25689] Avg episode reward: [(0, '-1.269')] [2022-07-11 13:30:32,139][26022] Updated weights on worker 0-0, policy_version 1211816 (0.00087) [2022-07-11 13:30:33,838][26022] Updated weights on worker 0-0, policy_version 1211826 (0.00083) [2022-07-11 13:30:36,027][26022] Updated weights on worker 0-0, policy_version 1211836 (0.00083) [2022-07-11 13:30:36,679][25689] Fps is (10 sec: 5741.5, 60 sec: 5641.8, 300 sec: 5643.9). Total num frames: 1240925184. Throughput: 0: 5039.6. Samples: 1240918924. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:30:36,679][25689] Avg episode reward: [(0, '-0.333')] [2022-07-11 13:30:37,285][26022] Updated weights on worker 0-0, policy_version 1211846 (0.00087) [2022-07-11 13:30:39,475][26022] Updated weights on worker 0-0, policy_version 1211856 (0.00078) [2022-07-11 13:30:40,975][26022] Updated weights on worker 0-0, policy_version 1211866 (0.00083) [2022-07-11 13:30:41,768][25689] Fps is (10 sec: 5661.3, 60 sec: 5619.1, 300 sec: 5642.6). Total num frames: 1240953856. Throughput: 0: 5891.3. Samples: 1240953144. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:30:41,768][25689] Avg episode reward: [(0, '-0.386')] [2022-07-11 13:30:43,023][26022] Updated weights on worker 0-0, policy_version 1211876 (0.00090) [2022-07-11 13:30:44,631][26022] Updated weights on worker 0-0, policy_version 1211886 (0.00086) [2022-07-11 13:30:46,612][26022] Updated weights on worker 0-0, policy_version 1211896 (0.00096) [2022-07-11 13:30:46,804][25689] Fps is (10 sec: 5663.3, 60 sec: 5650.7, 300 sec: 5646.1). Total num frames: 1240982528. Throughput: 0: 5885.9. Samples: 1240987262. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:30:46,805][25689] Avg episode reward: [(0, '-0.025')] [2022-07-11 13:30:48,343][26022] Updated weights on worker 0-0, policy_version 1211906 (0.00085) [2022-07-11 13:30:50,270][26022] Updated weights on worker 0-0, policy_version 1211916 (0.00096) [2022-07-11 13:30:51,894][25689] Fps is (10 sec: 5662.6, 60 sec: 5612.6, 300 sec: 5641.4). Total num frames: 1241011200. Throughput: 0: 5046.2. Samples: 1241003974. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:30:51,895][25689] Avg episode reward: [(0, '0.235')] [2022-07-11 13:30:51,957][26022] Updated weights on worker 0-0, policy_version 1211926 (0.00092) [2022-07-11 13:30:53,793][26022] Updated weights on worker 0-0, policy_version 1211936 (0.00087) [2022-07-11 13:30:55,406][26022] Updated weights on worker 0-0, policy_version 1211946 (0.00085) [2022-07-11 13:30:56,926][25689] Fps is (10 sec: 5665.3, 60 sec: 5629.4, 300 sec: 5644.6). Total num frames: 1241039872. Throughput: 0: 5910.2. Samples: 1241038592. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:30:56,927][25689] Avg episode reward: [(0, '1.065')] [2022-07-11 13:30:57,488][26022] Updated weights on worker 0-0, policy_version 1211956 (0.00088) [2022-07-11 13:30:59,133][26022] Updated weights on worker 0-0, policy_version 1211966 (0.00093) [2022-07-11 13:31:01,028][26022] Updated weights on worker 0-0, policy_version 1211976 (0.00090) [2022-07-11 13:31:01,934][25689] Fps is (10 sec: 5712.0, 60 sec: 5646.9, 300 sec: 5648.4). Total num frames: 1241068544. Throughput: 0: 5933.6. Samples: 1241072800. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:31:01,934][25689] Avg episode reward: [(0, '1.130')] [2022-07-11 13:31:03,070][26022] Updated weights on worker 0-0, policy_version 1211986 (0.00106) [2022-07-11 13:31:04,722][26022] Updated weights on worker 0-0, policy_version 1211996 (0.00093) [2022-07-11 13:31:06,877][26022] Updated weights on worker 0-0, policy_version 1212006 (0.00088) [2022-07-11 13:31:06,955][25689] Fps is (10 sec: 5411.5, 60 sec: 5612.9, 300 sec: 5639.1). Total num frames: 1241094144. Throughput: 0: 4985.6. Samples: 1241087726. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:31:06,956][25689] Avg episode reward: [(0, '1.140')] [2022-07-11 13:31:08,609][26022] Updated weights on worker 0-0, policy_version 1212016 (0.00079) [2022-07-11 13:31:10,343][26022] Updated weights on worker 0-0, policy_version 1212026 (0.00083) [2022-07-11 13:31:12,014][25689] Fps is (10 sec: 5485.4, 60 sec: 5651.5, 300 sec: 5645.0). Total num frames: 1241123840. Throughput: 0: 5871.7. Samples: 1241122112. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:31:12,015][25689] Avg episode reward: [(0, '1.319')] [2022-07-11 13:31:12,071][26022] Updated weights on worker 0-0, policy_version 1212036 (0.00091) [2022-07-11 13:31:13,926][26022] Updated weights on worker 0-0, policy_version 1212046 (0.00084) [2022-07-11 13:31:15,781][26022] Updated weights on worker 0-0, policy_version 1212056 (0.00084) [2022-07-11 13:31:17,017][25689] Fps is (10 sec: 5801.1, 60 sec: 5635.0, 300 sec: 5641.7). Total num frames: 1241152512. Throughput: 0: 5862.6. Samples: 1241156376. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:31:17,018][25689] Avg episode reward: [(0, '1.566')] [2022-07-11 13:31:17,448][26022] Updated weights on worker 0-0, policy_version 1212066 (0.00086) [2022-07-11 13:31:19,383][26022] Updated weights on worker 0-0, policy_version 1212076 (0.00090) [2022-07-11 13:31:21,175][26022] Updated weights on worker 0-0, policy_version 1212086 (0.00094) [2022-07-11 13:31:22,027][25689] Fps is (10 sec: 5727.3, 60 sec: 5651.7, 300 sec: 5641.9). Total num frames: 1241181184. Throughput: 0: 5846.0. Samples: 1241190264. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:31:22,027][25689] Avg episode reward: [(0, '1.946')] [2022-07-11 13:31:23,030][26022] Updated weights on worker 0-0, policy_version 1212096 (0.00090) [2022-07-11 13:31:24,760][26022] Updated weights on worker 0-0, policy_version 1212106 (0.00082) [2022-07-11 13:31:26,649][26022] Updated weights on worker 0-0, policy_version 1212116 (0.01144) [2022-07-11 13:31:27,047][25689] Fps is (10 sec: 5615.1, 60 sec: 5650.0, 300 sec: 5642.5). Total num frames: 1241208832. Throughput: 0: 5941.1. Samples: 1241207094. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:31:27,049][25689] Avg episode reward: [(0, '1.941')] [2022-07-11 13:31:28,566][26022] Updated weights on worker 0-0, policy_version 1212126 (0.00092) [2022-07-11 13:31:30,223][26022] Updated weights on worker 0-0, policy_version 1212136 (0.00084) [2022-07-11 13:31:32,050][26022] Updated weights on worker 0-0, policy_version 1212146 (0.00088) [2022-07-11 13:31:32,178][25689] Fps is (10 sec: 5547.8, 60 sec: 5632.5, 300 sec: 5641.7). Total num frames: 1241237504. Throughput: 0: 5899.9. Samples: 1241241078. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:31:32,179][25689] Avg episode reward: [(0, '1.657')] [2022-07-11 13:31:33,861][26022] Updated weights on worker 0-0, policy_version 1212156 (0.00087) [2022-07-11 13:31:35,544][26022] Updated weights on worker 0-0, policy_version 1212166 (0.00094) [2022-07-11 13:31:37,187][25689] Fps is (10 sec: 5554.3, 60 sec: 5618.5, 300 sec: 5639.2). Total num frames: 1241265152. Throughput: 0: 5900.0. Samples: 1241275380. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:31:37,187][25689] Avg episode reward: [(0, '1.220')] [2022-07-11 13:31:37,519][26022] Updated weights on worker 0-0, policy_version 1212176 (0.00084) [2022-07-11 13:31:39,350][26022] Updated weights on worker 0-0, policy_version 1212186 (0.00087) [2022-07-11 13:31:41,040][26022] Updated weights on worker 0-0, policy_version 1212196 (0.00090) [2022-07-11 13:31:42,204][25689] Fps is (10 sec: 5719.7, 60 sec: 5642.2, 300 sec: 5636.1). Total num frames: 1241294848. Throughput: 0: 5048.6. Samples: 1241292132. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:31:42,205][25689] Avg episode reward: [(0, '1.581')] [2022-07-11 13:31:43,079][26022] Updated weights on worker 0-0, policy_version 1212206 (0.00085) [2022-07-11 13:31:44,557][26022] Updated weights on worker 0-0, policy_version 1212216 (0.00090) [2022-07-11 13:31:46,503][26022] Updated weights on worker 0-0, policy_version 1212226 (0.00098) [2022-07-11 13:31:47,234][25689] Fps is (10 sec: 5809.6, 60 sec: 5642.8, 300 sec: 5643.3). Total num frames: 1241323520. Throughput: 0: 5909.1. Samples: 1241326380. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:31:47,234][25689] Avg episode reward: [(0, '1.265')] [2022-07-11 13:31:48,429][26022] Updated weights on worker 0-0, policy_version 1212236 (0.00082) [2022-07-11 13:31:49,975][26022] Updated weights on worker 0-0, policy_version 1212246 (0.00081) [2022-07-11 13:31:52,144][26022] Updated weights on worker 0-0, policy_version 1212256 (0.00089) [2022-07-11 13:31:52,315][25689] Fps is (10 sec: 5570.5, 60 sec: 5626.7, 300 sec: 5639.3). Total num frames: 1241351168. Throughput: 0: 5913.7. Samples: 1241360158. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:31:52,315][25689] Avg episode reward: [(0, '1.053')] [2022-07-11 13:31:53,638][26022] Updated weights on worker 0-0, policy_version 1212266 (0.00087) [2022-07-11 13:31:55,654][26022] Updated weights on worker 0-0, policy_version 1212276 (0.00099) [2022-07-11 13:31:57,370][25689] Fps is (10 sec: 5556.5, 60 sec: 5624.6, 300 sec: 5635.2). Total num frames: 1241379840. Throughput: 0: 5042.0. Samples: 1241377142. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:31:57,370][25689] Avg episode reward: [(0, '0.864')] [2022-07-11 13:31:57,457][26022] Updated weights on worker 0-0, policy_version 1212286 (0.00078) [2022-07-11 13:31:59,195][26022] Updated weights on worker 0-0, policy_version 1212296 (0.00085) [2022-07-11 13:32:01,093][26022] Updated weights on worker 0-0, policy_version 1212306 (0.00086) [2022-07-11 13:32:02,445][25689] Fps is (10 sec: 5458.3, 60 sec: 5584.4, 300 sec: 5642.0). Total num frames: 1241406464. Throughput: 0: 5881.6. Samples: 1241411182. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:32:02,446][25689] Avg episode reward: [(0, '1.129')] [2022-07-11 13:32:03,257][26022] Updated weights on worker 0-0, policy_version 1212316 (0.00084) [2022-07-11 13:32:05,007][26022] Updated weights on worker 0-0, policy_version 1212326 (0.00081) [2022-07-11 13:32:06,819][26022] Updated weights on worker 0-0, policy_version 1212336 (0.00083) [2022-07-11 13:32:07,455][25689] Fps is (10 sec: 5381.3, 60 sec: 5619.3, 300 sec: 5629.2). Total num frames: 1241434112. Throughput: 0: 5789.6. Samples: 1241443452. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:32:07,455][25689] Avg episode reward: [(0, '1.056')] [2022-07-11 13:32:08,490][26022] Updated weights on worker 0-0, policy_version 1212346 (0.00086) [2022-07-11 13:32:10,379][26022] Updated weights on worker 0-0, policy_version 1212356 (0.00086) [2022-07-11 13:32:10,748][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:32:10,764][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001212358_1241454592.pth [2022-07-11 13:32:10,764][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001210372_1239420928.pth [2022-07-11 13:32:12,223][26022] Updated weights on worker 0-0, policy_version 1212366 (0.00095) [2022-07-11 13:32:12,547][25689] Fps is (10 sec: 5676.9, 60 sec: 5616.3, 300 sec: 5638.2). Total num frames: 1241463808. Throughput: 0: 4963.9. Samples: 1241460588. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:32:12,547][25689] Avg episode reward: [(0, '0.995')] [2022-07-11 13:32:13,982][26022] Updated weights on worker 0-0, policy_version 1212376 (0.00087) [2022-07-11 13:32:15,763][26022] Updated weights on worker 0-0, policy_version 1212386 (0.00086) [2022-07-11 13:32:17,486][26022] Updated weights on worker 0-0, policy_version 1212396 (0.00091) [2022-07-11 13:32:17,577][25689] Fps is (10 sec: 5867.4, 60 sec: 5630.6, 300 sec: 5641.2). Total num frames: 1241493504. Throughput: 0: 5837.1. Samples: 1241495096. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:32:17,578][25689] Avg episode reward: [(0, '1.198')] [2022-07-11 13:32:19,018][26022] Updated weights on worker 0-0, policy_version 1212406 (0.00087) [2022-07-11 13:32:21,042][26022] Updated weights on worker 0-0, policy_version 1212416 (0.00101) [2022-07-11 13:32:22,611][25689] Fps is (10 sec: 5799.6, 60 sec: 5628.4, 300 sec: 5638.1). Total num frames: 1241522176. Throughput: 0: 5843.2. Samples: 1241529012. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:32:22,611][25689] Avg episode reward: [(0, '1.425')] [2022-07-11 13:32:23,174][26022] Updated weights on worker 0-0, policy_version 1212426 (0.00084) [2022-07-11 13:32:24,450][26022] Updated weights on worker 0-0, policy_version 1212436 (0.00080) [2022-07-11 13:32:26,555][26022] Updated weights on worker 0-0, policy_version 1212446 (0.00085) [2022-07-11 13:32:27,700][25689] Fps is (10 sec: 5765.9, 60 sec: 5655.8, 300 sec: 5649.6). Total num frames: 1241551872. Throughput: 0: 5074.0. Samples: 1241546176. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:32:27,701][25689] Avg episode reward: [(0, '0.922')] [2022-07-11 13:32:28,148][26022] Updated weights on worker 0-0, policy_version 1212456 (0.00096) [2022-07-11 13:32:30,259][26022] Updated weights on worker 0-0, policy_version 1212466 (0.00094) [2022-07-11 13:32:31,836][26022] Updated weights on worker 0-0, policy_version 1212476 (0.00088) [2022-07-11 13:32:32,800][25689] Fps is (10 sec: 5627.6, 60 sec: 5641.8, 300 sec: 5637.9). Total num frames: 1241579520. Throughput: 0: 5896.8. Samples: 1241580020. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:32:32,801][25689] Avg episode reward: [(0, '-0.528')] [2022-07-11 13:32:33,696][26022] Updated weights on worker 0-0, policy_version 1212486 (0.00091) [2022-07-11 13:32:35,532][26022] Updated weights on worker 0-0, policy_version 1212496 (0.00094) [2022-07-11 13:32:37,244][26022] Updated weights on worker 0-0, policy_version 1212506 (0.00076) [2022-07-11 13:32:37,856][25689] Fps is (10 sec: 5545.4, 60 sec: 5654.2, 300 sec: 5640.5). Total num frames: 1241608192. Throughput: 0: 5879.2. Samples: 1241614320. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2022-07-11 13:32:37,857][25689] Avg episode reward: [(0, '-0.203')] [2022-07-11 13:32:39,167][26022] Updated weights on worker 0-0, policy_version 1212516 (0.00088) [2022-07-11 13:32:40,983][26022] Updated weights on worker 0-0, policy_version 1212526 (0.00094) [2022-07-11 13:32:42,652][26022] Updated weights on worker 0-0, policy_version 1212536 (0.00062) [2022-07-11 13:32:42,903][25689] Fps is (10 sec: 5777.3, 60 sec: 5651.5, 300 sec: 5646.7). Total num frames: 1241637888. Throughput: 0: 5050.1. Samples: 1241631492. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:32:42,904][25689] Avg episode reward: [(0, '-1.289')] [2022-07-11 13:32:44,478][26022] Updated weights on worker 0-0, policy_version 1212546 (0.00087) [2022-07-11 13:32:46,396][26022] Updated weights on worker 0-0, policy_version 1212556 (0.00082) [2022-07-11 13:32:47,962][25689] Fps is (10 sec: 5877.0, 60 sec: 5665.6, 300 sec: 5644.1). Total num frames: 1241667584. Throughput: 0: 5891.3. Samples: 1241665546. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:32:47,963][25689] Avg episode reward: [(0, '-2.032')] [2022-07-11 13:32:47,967][26022] Updated weights on worker 0-0, policy_version 1212566 (0.00084) [2022-07-11 13:32:50,076][26022] Updated weights on worker 0-0, policy_version 1212576 (0.00084) [2022-07-11 13:32:51,653][26022] Updated weights on worker 0-0, policy_version 1212586 (0.00089) [2022-07-11 13:32:53,068][25689] Fps is (10 sec: 5540.5, 60 sec: 5646.4, 300 sec: 5636.6). Total num frames: 1241694208. Throughput: 0: 5910.1. Samples: 1241699806. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:32:53,069][25689] Avg episode reward: [(0, '-1.151')] [2022-07-11 13:32:53,679][26022] Updated weights on worker 0-0, policy_version 1212596 (0.00074) [2022-07-11 13:32:55,335][26022] Updated weights on worker 0-0, policy_version 1212606 (0.00083) [2022-07-11 13:32:57,191][26022] Updated weights on worker 0-0, policy_version 1212616 (0.00095) [2022-07-11 13:32:58,110][25689] Fps is (10 sec: 5650.6, 60 sec: 5681.3, 300 sec: 5646.6). Total num frames: 1241724928. Throughput: 0: 5061.3. Samples: 1241716832. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:32:58,111][25689] Avg episode reward: [(0, '-0.666')] [2022-07-11 13:32:58,977][26022] Updated weights on worker 0-0, policy_version 1212626 (0.00093) [2022-07-11 13:33:00,739][26022] Updated weights on worker 0-0, policy_version 1212636 (0.00088) [2022-07-11 13:33:02,791][26022] Updated weights on worker 0-0, policy_version 1212646 (0.00086) [2022-07-11 13:33:03,144][25689] Fps is (10 sec: 5589.5, 60 sec: 5668.3, 300 sec: 5642.8). Total num frames: 1241750528. Throughput: 0: 5913.4. Samples: 1241751186. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:33:03,145][25689] Avg episode reward: [(0, '0.398')] [2022-07-11 13:33:04,811][26022] Updated weights on worker 0-0, policy_version 1212656 (0.00089) [2022-07-11 13:33:06,385][26022] Updated weights on worker 0-0, policy_version 1212666 (0.00083) [2022-07-11 13:33:08,179][25689] Fps is (10 sec: 5288.7, 60 sec: 5666.0, 300 sec: 5636.5). Total num frames: 1241778176. Throughput: 0: 5827.2. Samples: 1241783352. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:33:08,179][25689] Avg episode reward: [(0, '0.860')] [2022-07-11 13:33:08,431][26022] Updated weights on worker 0-0, policy_version 1212676 (0.00096) [2022-07-11 13:33:09,980][26022] Updated weights on worker 0-0, policy_version 1212686 (0.00089) [2022-07-11 13:33:12,099][26022] Updated weights on worker 0-0, policy_version 1212696 (0.00081) [2022-07-11 13:33:13,228][25689] Fps is (10 sec: 5687.0, 60 sec: 5670.0, 300 sec: 5639.2). Total num frames: 1241807872. Throughput: 0: 4971.7. Samples: 1241800032. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:33:13,228][25689] Avg episode reward: [(0, '1.189')] [2022-07-11 13:33:13,604][26022] Updated weights on worker 0-0, policy_version 1212706 (0.00095) [2022-07-11 13:33:15,495][26022] Updated weights on worker 0-0, policy_version 1212716 (0.00084) [2022-07-11 13:33:17,399][26022] Updated weights on worker 0-0, policy_version 1212726 (0.00083) [2022-07-11 13:33:18,235][25689] Fps is (10 sec: 5702.4, 60 sec: 5638.5, 300 sec: 5639.2). Total num frames: 1241835520. Throughput: 0: 5833.4. Samples: 1241834226. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:33:18,235][25689] Avg episode reward: [(0, '1.780')] [2022-07-11 13:33:19,260][26022] Updated weights on worker 0-0, policy_version 1212736 (0.00091) [2022-07-11 13:33:20,920][26022] Updated weights on worker 0-0, policy_version 1212746 (0.00094) [2022-07-11 13:33:22,949][26022] Updated weights on worker 0-0, policy_version 1212756 (0.00085) [2022-07-11 13:33:23,250][25689] Fps is (10 sec: 5517.4, 60 sec: 5623.3, 300 sec: 5635.9). Total num frames: 1241863168. Throughput: 0: 5822.4. Samples: 1241868248. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:33:23,251][25689] Avg episode reward: [(0, '1.125')] [2022-07-11 13:33:24,431][26022] Updated weights on worker 0-0, policy_version 1212766 (0.00086) [2022-07-11 13:33:26,488][26022] Updated weights on worker 0-0, policy_version 1212776 (0.00840) [2022-07-11 13:33:28,115][26022] Updated weights on worker 0-0, policy_version 1212786 (0.00063) [2022-07-11 13:33:28,261][25689] Fps is (10 sec: 5719.1, 60 sec: 5630.5, 300 sec: 5637.4). Total num frames: 1241892864. Throughput: 0: 5080.0. Samples: 1241885372. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:33:28,262][25689] Avg episode reward: [(0, '0.700')] [2022-07-11 13:33:30,163][26022] Updated weights on worker 0-0, policy_version 1212796 (0.00087) [2022-07-11 13:33:32,003][26022] Updated weights on worker 0-0, policy_version 1212806 (0.00089) [2022-07-11 13:33:33,330][25689] Fps is (10 sec: 5688.6, 60 sec: 5633.4, 300 sec: 5630.7). Total num frames: 1241920512. Throughput: 0: 5914.2. Samples: 1241918922. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:33:33,331][25689] Avg episode reward: [(0, '1.115')] [2022-07-11 13:33:33,606][26022] Updated weights on worker 0-0, policy_version 1212816 (0.00090) [2022-07-11 13:33:35,521][26022] Updated weights on worker 0-0, policy_version 1212826 (0.00082) [2022-07-11 13:33:37,107][26022] Updated weights on worker 0-0, policy_version 1212836 (0.00447) [2022-07-11 13:33:38,340][25689] Fps is (10 sec: 5588.0, 60 sec: 5637.7, 300 sec: 5638.2). Total num frames: 1241949184. Throughput: 0: 5906.4. Samples: 1241952978. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:33:38,341][25689] Avg episode reward: [(0, '0.510')] [2022-07-11 13:33:39,152][26022] Updated weights on worker 0-0, policy_version 1212846 (0.00091) [2022-07-11 13:33:41,124][26022] Updated weights on worker 0-0, policy_version 1212856 (0.00087) [2022-07-11 13:33:42,671][26022] Updated weights on worker 0-0, policy_version 1212866 (0.00081) [2022-07-11 13:33:43,369][25689] Fps is (10 sec: 5610.4, 60 sec: 5605.6, 300 sec: 5630.9). Total num frames: 1241976832. Throughput: 0: 5885.5. Samples: 1241986658. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:33:43,369][25689] Avg episode reward: [(0, '0.127')] [2022-07-11 13:33:44,833][26022] Updated weights on worker 0-0, policy_version 1212876 (0.00094) [2022-07-11 13:33:46,489][26022] Updated weights on worker 0-0, policy_version 1212886 (0.00080) [2022-07-11 13:33:48,344][26022] Updated weights on worker 0-0, policy_version 1212896 (0.00090) [2022-07-11 13:33:48,396][25689] Fps is (10 sec: 5601.0, 60 sec: 5591.6, 300 sec: 5632.7). Total num frames: 1242005504. Throughput: 0: 5871.8. Samples: 1242003596. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:33:48,396][25689] Avg episode reward: [(0, '0.355')] [2022-07-11 13:33:49,959][26022] Updated weights on worker 0-0, policy_version 1212906 (0.00073) [2022-07-11 13:33:51,935][26022] Updated weights on worker 0-0, policy_version 1212916 (0.00079) [2022-07-11 13:33:53,546][25689] Fps is (10 sec: 5735.4, 60 sec: 5638.4, 300 sec: 5630.5). Total num frames: 1242035200. Throughput: 0: 5854.4. Samples: 1242037270. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:33:53,547][25689] Avg episode reward: [(0, '0.654')] [2022-07-11 13:33:53,715][26022] Updated weights on worker 0-0, policy_version 1212926 (0.00086) [2022-07-11 13:33:55,708][26022] Updated weights on worker 0-0, policy_version 1212936 (0.00083) [2022-07-11 13:33:57,113][26022] Updated weights on worker 0-0, policy_version 1212946 (0.00082) [2022-07-11 13:33:58,647][25689] Fps is (10 sec: 5594.1, 60 sec: 5582.1, 300 sec: 5628.7). Total num frames: 1242062848. Throughput: 0: 5837.5. Samples: 1242071516. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:33:58,651][25689] Avg episode reward: [(0, '1.264')] [2022-07-11 13:33:59,241][26022] Updated weights on worker 0-0, policy_version 1212956 (0.00087) [2022-07-11 13:34:00,972][26022] Updated weights on worker 0-0, policy_version 1212966 (0.00084) [2022-07-11 13:34:03,143][26022] Updated weights on worker 0-0, policy_version 1212976 (0.00096) [2022-07-11 13:34:03,674][25689] Fps is (10 sec: 5560.4, 60 sec: 5633.4, 300 sec: 5635.4). Total num frames: 1242091520. Throughput: 0: 4998.6. Samples: 1242088158. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:34:03,675][25689] Avg episode reward: [(0, '1.182')] [2022-07-11 13:34:04,839][26022] Updated weights on worker 0-0, policy_version 1212986 (0.00093) [2022-07-11 13:34:06,720][26022] Updated weights on worker 0-0, policy_version 1212996 (0.00084) [2022-07-11 13:34:08,436][26022] Updated weights on worker 0-0, policy_version 1213006 (0.00087) [2022-07-11 13:34:08,749][25689] Fps is (10 sec: 5574.7, 60 sec: 5629.6, 300 sec: 5633.4). Total num frames: 1242119168. Throughput: 0: 5769.6. Samples: 1242121030. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:34:08,751][25689] Avg episode reward: [(0, '1.743')] [2022-07-11 13:34:10,344][26022] Updated weights on worker 0-0, policy_version 1213016 (0.00083) [2022-07-11 13:34:10,862][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:34:10,874][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001213019_1242131456.pth [2022-07-11 13:34:10,874][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001211036_1240100864.pth [2022-07-11 13:34:12,075][26022] Updated weights on worker 0-0, policy_version 1213026 (0.00092) [2022-07-11 13:34:13,804][25689] Fps is (10 sec: 5560.1, 60 sec: 5612.3, 300 sec: 5632.5). Total num frames: 1242147840. Throughput: 0: 5807.2. Samples: 1242154914. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:34:13,804][25689] Avg episode reward: [(0, '2.236')] [2022-07-11 13:34:13,940][26022] Updated weights on worker 0-0, policy_version 1213036 (0.00086) [2022-07-11 13:34:15,619][26022] Updated weights on worker 0-0, policy_version 1213046 (0.00083) [2022-07-11 13:34:17,564][26022] Updated weights on worker 0-0, policy_version 1213056 (0.00083) [2022-07-11 13:34:18,814][25689] Fps is (10 sec: 5595.7, 60 sec: 5611.9, 300 sec: 5632.9). Total num frames: 1242175488. Throughput: 0: 4973.5. Samples: 1242171820. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:34:18,815][25689] Avg episode reward: [(0, '2.349')] [2022-07-11 13:34:19,296][26022] Updated weights on worker 0-0, policy_version 1213066 (0.00095) [2022-07-11 13:34:21,176][26022] Updated weights on worker 0-0, policy_version 1213076 (0.00087) [2022-07-11 13:34:23,110][26022] Updated weights on worker 0-0, policy_version 1213086 (0.00096) [2022-07-11 13:34:23,909][25689] Fps is (10 sec: 5573.2, 60 sec: 5621.4, 300 sec: 5629.2). Total num frames: 1242204160. Throughput: 0: 5802.5. Samples: 1242205570. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:34:23,910][25689] Avg episode reward: [(0, '2.117')] [2022-07-11 13:34:24,815][26022] Updated weights on worker 0-0, policy_version 1213096 (0.00085) [2022-07-11 13:34:26,832][26022] Updated weights on worker 0-0, policy_version 1213106 (0.00091) [2022-07-11 13:34:28,461][26022] Updated weights on worker 0-0, policy_version 1213116 (0.00092) [2022-07-11 13:34:29,003][25689] Fps is (10 sec: 5728.9, 60 sec: 5613.9, 300 sec: 5628.8). Total num frames: 1242233856. Throughput: 0: 5849.8. Samples: 1242239508. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:34:29,003][25689] Avg episode reward: [(0, '1.787')] [2022-07-11 13:34:30,414][26022] Updated weights on worker 0-0, policy_version 1213126 (0.00087) [2022-07-11 13:34:32,043][26022] Updated weights on worker 0-0, policy_version 1213136 (0.00084) [2022-07-11 13:34:33,743][26022] Updated weights on worker 0-0, policy_version 1213146 (0.00085) [2022-07-11 13:34:34,124][25689] Fps is (10 sec: 5613.7, 60 sec: 5609.0, 300 sec: 5630.7). Total num frames: 1242261504. Throughput: 0: 4984.8. Samples: 1242256208. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:34:34,125][25689] Avg episode reward: [(0, '1.373')] [2022-07-11 13:34:35,738][26022] Updated weights on worker 0-0, policy_version 1213156 (0.00085) [2022-07-11 13:34:37,706][26022] Updated weights on worker 0-0, policy_version 1213166 (0.00098) [2022-07-11 13:34:39,202][25689] Fps is (10 sec: 5522.0, 60 sec: 5602.7, 300 sec: 5626.3). Total num frames: 1242290176. Throughput: 0: 5795.1. Samples: 1242289968. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:34:39,202][25689] Avg episode reward: [(0, '1.427')] [2022-07-11 13:34:39,464][26022] Updated weights on worker 0-0, policy_version 1213176 (0.00082) [2022-07-11 13:34:41,344][26022] Updated weights on worker 0-0, policy_version 1213186 (0.00085) [2022-07-11 13:34:42,867][26022] Updated weights on worker 0-0, policy_version 1213196 (0.00084) [2022-07-11 13:34:44,271][25689] Fps is (10 sec: 5752.2, 60 sec: 5632.6, 300 sec: 5635.6). Total num frames: 1242319872. Throughput: 0: 5812.7. Samples: 1242323928. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:34:44,272][25689] Avg episode reward: [(0, '1.124')] [2022-07-11 13:34:45,067][26022] Updated weights on worker 0-0, policy_version 1213206 (0.00087) [2022-07-11 13:34:46,564][26022] Updated weights on worker 0-0, policy_version 1213216 (0.00080) [2022-07-11 13:34:48,648][26022] Updated weights on worker 0-0, policy_version 1213226 (0.00089) [2022-07-11 13:34:49,323][25689] Fps is (10 sec: 5666.0, 60 sec: 5613.5, 300 sec: 5625.1). Total num frames: 1242347520. Throughput: 0: 4986.1. Samples: 1242340826. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:34:49,324][25689] Avg episode reward: [(0, '0.361')] [2022-07-11 13:34:50,296][26022] Updated weights on worker 0-0, policy_version 1213236 (0.00085) [2022-07-11 13:34:52,268][26022] Updated weights on worker 0-0, policy_version 1213246 (0.00081) [2022-07-11 13:34:53,732][26022] Updated weights on worker 0-0, policy_version 1213256 (0.00811) [2022-07-11 13:34:54,413][25689] Fps is (10 sec: 5553.4, 60 sec: 5602.2, 300 sec: 5627.4). Total num frames: 1242376192. Throughput: 0: 5850.8. Samples: 1242374914. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:34:54,414][25689] Avg episode reward: [(0, '0.309')] [2022-07-11 13:34:55,906][26022] Updated weights on worker 0-0, policy_version 1213266 (0.00080) [2022-07-11 13:34:57,606][26022] Updated weights on worker 0-0, policy_version 1213276 (0.00085) [2022-07-11 13:34:59,420][25689] Fps is (10 sec: 5679.1, 60 sec: 5627.7, 300 sec: 5631.0). Total num frames: 1242404864. Throughput: 0: 5894.9. Samples: 1242409152. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:34:59,421][25689] Avg episode reward: [(0, '-0.029')] [2022-07-11 13:34:59,424][26022] Updated weights on worker 0-0, policy_version 1213286 (0.00091) [2022-07-11 13:35:01,085][26022] Updated weights on worker 0-0, policy_version 1213296 (0.00085) [2022-07-11 13:35:03,404][26022] Updated weights on worker 0-0, policy_version 1213306 (0.00091) [2022-07-11 13:35:04,465][25689] Fps is (10 sec: 5501.1, 60 sec: 5592.5, 300 sec: 5627.1). Total num frames: 1242431488. Throughput: 0: 5828.4. Samples: 1242441622. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:35:04,467][25689] Avg episode reward: [(0, '0.020')] [2022-07-11 13:35:04,978][26022] Updated weights on worker 0-0, policy_version 1213316 (0.00089) [2022-07-11 13:35:07,021][26022] Updated weights on worker 0-0, policy_version 1213326 (0.00085) [2022-07-11 13:35:08,530][26022] Updated weights on worker 0-0, policy_version 1213336 (0.00089) [2022-07-11 13:35:09,538][25689] Fps is (10 sec: 5464.9, 60 sec: 5609.4, 300 sec: 5631.2). Total num frames: 1242460160. Throughput: 0: 5839.9. Samples: 1242458884. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:35:09,539][25689] Avg episode reward: [(0, '0.176')] [2022-07-11 13:35:10,522][26022] Updated weights on worker 0-0, policy_version 1213346 (0.00084) [2022-07-11 13:35:12,185][26022] Updated weights on worker 0-0, policy_version 1213356 (0.00090) [2022-07-11 13:35:14,085][26022] Updated weights on worker 0-0, policy_version 1213366 (0.00086) [2022-07-11 13:35:14,590][25689] Fps is (10 sec: 5866.0, 60 sec: 5643.4, 300 sec: 5633.8). Total num frames: 1242490880. Throughput: 0: 5855.6. Samples: 1242493060. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:35:14,590][25689] Avg episode reward: [(0, '0.921')] [2022-07-11 13:35:15,893][26022] Updated weights on worker 0-0, policy_version 1213376 (0.00086) [2022-07-11 13:35:17,591][26022] Updated weights on worker 0-0, policy_version 1213386 (0.00087) [2022-07-11 13:35:19,475][26022] Updated weights on worker 0-0, policy_version 1213396 (0.00085) [2022-07-11 13:35:19,664][25689] Fps is (10 sec: 5663.2, 60 sec: 5620.7, 300 sec: 5629.1). Total num frames: 1242517504. Throughput: 0: 5837.6. Samples: 1242527330. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:35:19,665][25689] Avg episode reward: [(0, '0.983')] [2022-07-11 13:35:21,259][26022] Updated weights on worker 0-0, policy_version 1213406 (0.00086) [2022-07-11 13:35:23,057][26022] Updated weights on worker 0-0, policy_version 1213416 (0.00093) [2022-07-11 13:35:24,710][25689] Fps is (10 sec: 5564.7, 60 sec: 5642.0, 300 sec: 5635.1). Total num frames: 1242547200. Throughput: 0: 5083.7. Samples: 1242544546. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:35:24,711][25689] Avg episode reward: [(0, '1.280')] [2022-07-11 13:35:24,790][26022] Updated weights on worker 0-0, policy_version 1213426 (0.00091) [2022-07-11 13:35:26,533][26022] Updated weights on worker 0-0, policy_version 1213436 (0.00086) [2022-07-11 13:35:28,519][26022] Updated weights on worker 0-0, policy_version 1213446 (0.00088) [2022-07-11 13:35:29,774][25689] Fps is (10 sec: 5874.6, 60 sec: 5644.8, 300 sec: 5636.3). Total num frames: 1242576896. Throughput: 0: 5901.6. Samples: 1242578308. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:35:29,775][25689] Avg episode reward: [(0, '1.208')] [2022-07-11 13:35:30,164][26022] Updated weights on worker 0-0, policy_version 1213456 (0.00090) [2022-07-11 13:35:32,177][26022] Updated weights on worker 0-0, policy_version 1213466 (0.00088) [2022-07-11 13:35:33,879][26022] Updated weights on worker 0-0, policy_version 1213476 (0.00083) [2022-07-11 13:35:34,853][25689] Fps is (10 sec: 5552.7, 60 sec: 5631.8, 300 sec: 5628.7). Total num frames: 1242603520. Throughput: 0: 5889.9. Samples: 1242612412. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:35:34,854][25689] Avg episode reward: [(0, '1.632')] [2022-07-11 13:35:35,523][26022] Updated weights on worker 0-0, policy_version 1213486 (0.00091) [2022-07-11 13:35:37,474][26022] Updated weights on worker 0-0, policy_version 1213496 (0.00090) [2022-07-11 13:35:39,279][26022] Updated weights on worker 0-0, policy_version 1213506 (0.00090) [2022-07-11 13:35:39,870][25689] Fps is (10 sec: 5578.4, 60 sec: 5654.4, 300 sec: 5633.5). Total num frames: 1242633216. Throughput: 0: 5066.0. Samples: 1242629694. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:35:39,871][25689] Avg episode reward: [(0, '0.802')] [2022-07-11 13:35:40,995][26022] Updated weights on worker 0-0, policy_version 1213516 (0.00086) [2022-07-11 13:35:42,832][26022] Updated weights on worker 0-0, policy_version 1213526 (0.00087) [2022-07-11 13:35:44,428][26022] Updated weights on worker 0-0, policy_version 1213536 (0.00087) [2022-07-11 13:35:44,918][25689] Fps is (10 sec: 5901.0, 60 sec: 5656.4, 300 sec: 5636.7). Total num frames: 1242662912. Throughput: 0: 5913.3. Samples: 1242664040. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:35:44,918][25689] Avg episode reward: [(0, '0.619')] [2022-07-11 13:35:46,406][26022] Updated weights on worker 0-0, policy_version 1213546 (0.00084) [2022-07-11 13:35:48,141][26022] Updated weights on worker 0-0, policy_version 1213556 (0.00051) [2022-07-11 13:35:49,940][25689] Fps is (10 sec: 5694.6, 60 sec: 5659.1, 300 sec: 5634.6). Total num frames: 1242690560. Throughput: 0: 5968.3. Samples: 1242698664. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:35:49,941][25689] Avg episode reward: [(0, '-0.208')] [2022-07-11 13:35:49,942][26022] Updated weights on worker 0-0, policy_version 1213566 (0.00084) [2022-07-11 13:35:51,817][26022] Updated weights on worker 0-0, policy_version 1213576 (0.00092) [2022-07-11 13:35:53,500][26022] Updated weights on worker 0-0, policy_version 1213586 (0.00091) [2022-07-11 13:35:55,015][25689] Fps is (10 sec: 5679.5, 60 sec: 5677.5, 300 sec: 5637.2). Total num frames: 1242720256. Throughput: 0: 5127.2. Samples: 1242715784. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:35:55,015][25689] Avg episode reward: [(0, '-0.864')] [2022-07-11 13:35:55,574][26022] Updated weights on worker 0-0, policy_version 1213596 (0.00078) [2022-07-11 13:35:57,046][26022] Updated weights on worker 0-0, policy_version 1213606 (0.00085) [2022-07-11 13:35:59,036][26022] Updated weights on worker 0-0, policy_version 1213616 (0.00081) [2022-07-11 13:36:00,058][25689] Fps is (10 sec: 5769.0, 60 sec: 5674.1, 300 sec: 5636.5). Total num frames: 1242748928. Throughput: 0: 5940.7. Samples: 1242749622. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:36:00,058][25689] Avg episode reward: [(0, '-0.900')] [2022-07-11 13:36:00,787][26022] Updated weights on worker 0-0, policy_version 1213626 (0.00080) [2022-07-11 13:36:03,117][26022] Updated weights on worker 0-0, policy_version 1213636 (0.01238) [2022-07-11 13:36:04,764][26022] Updated weights on worker 0-0, policy_version 1213646 (0.00088) [2022-07-11 13:36:05,085][25689] Fps is (10 sec: 5491.1, 60 sec: 5675.7, 300 sec: 5639.9). Total num frames: 1242775552. Throughput: 0: 5842.2. Samples: 1242781858. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:36:05,085][25689] Avg episode reward: [(0, '-0.795')] [2022-07-11 13:36:06,430][26022] Updated weights on worker 0-0, policy_version 1213656 (0.00077) [2022-07-11 13:36:08,425][26022] Updated weights on worker 0-0, policy_version 1213666 (0.00081) [2022-07-11 13:36:10,065][26022] Updated weights on worker 0-0, policy_version 1213676 (0.00087) [2022-07-11 13:36:10,128][25689] Fps is (10 sec: 5490.8, 60 sec: 5678.6, 300 sec: 5636.7). Total num frames: 1242804224. Throughput: 0: 4958.6. Samples: 1242798770. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:36:10,131][25689] Avg episode reward: [(0, '-0.778')] [2022-07-11 13:36:11,088][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:36:11,108][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001213680_1242808320.pth [2022-07-11 13:36:11,108][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001211697_1240777728.pth [2022-07-11 13:36:12,056][26022] Updated weights on worker 0-0, policy_version 1213686 (0.00091) [2022-07-11 13:36:13,892][26022] Updated weights on worker 0-0, policy_version 1213696 (0.00085) [2022-07-11 13:36:15,215][25689] Fps is (10 sec: 5559.5, 60 sec: 5624.5, 300 sec: 5631.7). Total num frames: 1242831872. Throughput: 0: 5792.5. Samples: 1242832796. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:36:15,216][25689] Avg episode reward: [(0, '-0.806')] [2022-07-11 13:36:15,568][26022] Updated weights on worker 0-0, policy_version 1213706 (0.00086) [2022-07-11 13:36:17,495][26022] Updated weights on worker 0-0, policy_version 1213716 (0.00100) [2022-07-11 13:36:19,147][26022] Updated weights on worker 0-0, policy_version 1213726 (0.00092) [2022-07-11 13:36:20,291][25689] Fps is (10 sec: 5541.8, 60 sec: 5658.2, 300 sec: 5630.4). Total num frames: 1242860544. Throughput: 0: 5797.9. Samples: 1242866934. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:36:20,293][25689] Avg episode reward: [(0, '-0.667')] [2022-07-11 13:36:21,114][26022] Updated weights on worker 0-0, policy_version 1213736 (0.00085) [2022-07-11 13:36:22,656][26022] Updated weights on worker 0-0, policy_version 1213746 (0.00087) [2022-07-11 13:36:24,739][26022] Updated weights on worker 0-0, policy_version 1213756 (0.00088) [2022-07-11 13:36:25,348][25689] Fps is (10 sec: 5760.3, 60 sec: 5657.2, 300 sec: 5636.6). Total num frames: 1242890240. Throughput: 0: 5046.8. Samples: 1242884122. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:36:25,351][25689] Avg episode reward: [(0, '-1.062')] [2022-07-11 13:36:26,588][26022] Updated weights on worker 0-0, policy_version 1213766 (0.00080) [2022-07-11 13:36:28,163][26022] Updated weights on worker 0-0, policy_version 1213776 (0.00082) [2022-07-11 13:36:29,930][26022] Updated weights on worker 0-0, policy_version 1213786 (0.00090) [2022-07-11 13:36:30,360][25689] Fps is (10 sec: 5694.9, 60 sec: 5628.2, 300 sec: 5635.4). Total num frames: 1242917888. Throughput: 0: 5908.2. Samples: 1242918306. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:36:30,362][25689] Avg episode reward: [(0, '-1.457')] [2022-07-11 13:36:31,740][26022] Updated weights on worker 0-0, policy_version 1213796 (0.01026) [2022-07-11 13:36:33,670][26022] Updated weights on worker 0-0, policy_version 1213806 (0.00089) [2022-07-11 13:36:35,481][25689] Fps is (10 sec: 5659.3, 60 sec: 5675.0, 300 sec: 5640.2). Total num frames: 1242947584. Throughput: 0: 5917.0. Samples: 1242952708. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:36:35,481][25689] Avg episode reward: [(0, '-0.895')] [2022-07-11 13:36:35,484][26022] Updated weights on worker 0-0, policy_version 1213816 (0.00093) [2022-07-11 13:36:37,059][26022] Updated weights on worker 0-0, policy_version 1213826 (0.00516) [2022-07-11 13:36:39,025][26022] Updated weights on worker 0-0, policy_version 1213836 (0.00088) [2022-07-11 13:36:40,504][25689] Fps is (10 sec: 5754.4, 60 sec: 5657.6, 300 sec: 5636.6). Total num frames: 1242976256. Throughput: 0: 5098.0. Samples: 1242969978. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 13:36:40,505][25689] Avg episode reward: [(0, '-0.706')] [2022-07-11 13:36:40,834][26022] Updated weights on worker 0-0, policy_version 1213846 (0.00086) [2022-07-11 13:36:42,469][26022] Updated weights on worker 0-0, policy_version 1213856 (0.00083) [2022-07-11 13:36:44,551][26022] Updated weights on worker 0-0, policy_version 1213866 (0.00095) [2022-07-11 13:36:45,515][25689] Fps is (10 sec: 5817.1, 60 sec: 5661.0, 300 sec: 5640.4). Total num frames: 1243005952. Throughput: 0: 5947.5. Samples: 1243004062. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:36:45,515][25689] Avg episode reward: [(0, '-1.272')] [2022-07-11 13:36:46,002][26022] Updated weights on worker 0-0, policy_version 1213876 (0.00087) [2022-07-11 13:36:48,057][26022] Updated weights on worker 0-0, policy_version 1213886 (0.00094) [2022-07-11 13:36:49,556][26022] Updated weights on worker 0-0, policy_version 1213896 (0.00088) [2022-07-11 13:36:50,527][25689] Fps is (10 sec: 5619.1, 60 sec: 5645.1, 300 sec: 5638.3). Total num frames: 1243032576. Throughput: 0: 5947.9. Samples: 1243038254. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:36:50,527][25689] Avg episode reward: [(0, '-1.607')] [2022-07-11 13:36:51,604][26022] Updated weights on worker 0-0, policy_version 1213906 (0.00088) [2022-07-11 13:36:53,329][26022] Updated weights on worker 0-0, policy_version 1213916 (0.00088) [2022-07-11 13:36:55,295][26022] Updated weights on worker 0-0, policy_version 1213926 (0.00081) [2022-07-11 13:36:55,594][25689] Fps is (10 sec: 5587.6, 60 sec: 5645.7, 300 sec: 5641.5). Total num frames: 1243062272. Throughput: 0: 5099.3. Samples: 1243055272. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:36:55,595][25689] Avg episode reward: [(0, '-1.254')] [2022-07-11 13:36:57,016][26022] Updated weights on worker 0-0, policy_version 1213936 (0.00080) [2022-07-11 13:36:58,749][26022] Updated weights on worker 0-0, policy_version 1213946 (0.00082) [2022-07-11 13:37:00,599][25689] Fps is (10 sec: 5795.0, 60 sec: 5649.3, 300 sec: 5649.7). Total num frames: 1243090944. Throughput: 0: 5945.7. Samples: 1243089458. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:37:00,599][25689] Avg episode reward: [(0, '-0.694')] [2022-07-11 13:37:00,600][26022] Updated weights on worker 0-0, policy_version 1213956 (0.00083) [2022-07-11 13:37:02,671][26022] Updated weights on worker 0-0, policy_version 1213966 (0.00087) [2022-07-11 13:37:04,712][26022] Updated weights on worker 0-0, policy_version 1213976 (0.00099) [2022-07-11 13:37:05,613][25689] Fps is (10 sec: 5519.3, 60 sec: 5650.5, 300 sec: 5646.2). Total num frames: 1243117568. Throughput: 0: 5844.1. Samples: 1243121518. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:37:05,614][25689] Avg episode reward: [(0, '-1.777')] [2022-07-11 13:37:06,282][26022] Updated weights on worker 0-0, policy_version 1213986 (0.00083) [2022-07-11 13:37:08,428][26022] Updated weights on worker 0-0, policy_version 1213996 (0.00087) [2022-07-11 13:37:09,778][26022] Updated weights on worker 0-0, policy_version 1214006 (0.00081) [2022-07-11 13:37:10,700][25689] Fps is (10 sec: 5474.1, 60 sec: 5646.4, 300 sec: 5642.8). Total num frames: 1243146240. Throughput: 0: 4982.5. Samples: 1243138768. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:37:10,701][25689] Avg episode reward: [(0, '-0.939')] [2022-07-11 13:37:11,820][26022] Updated weights on worker 0-0, policy_version 1214016 (0.00083) [2022-07-11 13:37:13,442][26022] Updated weights on worker 0-0, policy_version 1214026 (0.00079) [2022-07-11 13:37:15,391][26022] Updated weights on worker 0-0, policy_version 1214036 (0.00085) [2022-07-11 13:37:15,771][25689] Fps is (10 sec: 5544.5, 60 sec: 5648.0, 300 sec: 5635.2). Total num frames: 1243173888. Throughput: 0: 5822.6. Samples: 1243172752. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:37:15,771][25689] Avg episode reward: [(0, '-0.542')] [2022-07-11 13:37:17,140][26022] Updated weights on worker 0-0, policy_version 1214046 (0.00087) [2022-07-11 13:37:18,916][26022] Updated weights on worker 0-0, policy_version 1214056 (0.00093) [2022-07-11 13:37:20,755][26022] Updated weights on worker 0-0, policy_version 1214066 (0.00086) [2022-07-11 13:37:20,799][25689] Fps is (10 sec: 5678.5, 60 sec: 5669.4, 300 sec: 5638.8). Total num frames: 1243203584. Throughput: 0: 5822.6. Samples: 1243207074. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:37:20,799][25689] Avg episode reward: [(0, '0.369')] [2022-07-11 13:37:22,499][26022] Updated weights on worker 0-0, policy_version 1214076 (0.00077) [2022-07-11 13:37:24,339][26022] Updated weights on worker 0-0, policy_version 1214086 (0.00047) [2022-07-11 13:37:25,822][25689] Fps is (10 sec: 5705.1, 60 sec: 5638.7, 300 sec: 5633.1). Total num frames: 1243231232. Throughput: 0: 5906.5. Samples: 1243240884. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:37:25,823][25689] Avg episode reward: [(0, '0.489')] [2022-07-11 13:37:26,439][26022] Updated weights on worker 0-0, policy_version 1214096 (0.00093) [2022-07-11 13:37:27,886][26022] Updated weights on worker 0-0, policy_version 1214106 (0.00084) [2022-07-11 13:37:29,961][26022] Updated weights on worker 0-0, policy_version 1214116 (0.00085) [2022-07-11 13:37:30,848][25689] Fps is (10 sec: 5604.2, 60 sec: 5654.3, 300 sec: 5638.0). Total num frames: 1243259904. Throughput: 0: 5905.6. Samples: 1243257754. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:37:30,849][25689] Avg episode reward: [(0, '-0.214')] [2022-07-11 13:37:31,540][26022] Updated weights on worker 0-0, policy_version 1214126 (0.00082) [2022-07-11 13:37:33,575][26022] Updated weights on worker 0-0, policy_version 1214136 (0.00084) [2022-07-11 13:37:35,115][26022] Updated weights on worker 0-0, policy_version 1214146 (0.00085) [2022-07-11 13:37:35,979][25689] Fps is (10 sec: 5645.9, 60 sec: 5636.4, 300 sec: 5636.6). Total num frames: 1243288576. Throughput: 0: 5904.3. Samples: 1243292066. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:37:35,981][25689] Avg episode reward: [(0, '0.560')] [2022-07-11 13:37:37,158][26022] Updated weights on worker 0-0, policy_version 1214156 (0.00087) [2022-07-11 13:37:38,729][26022] Updated weights on worker 0-0, policy_version 1214166 (0.00081) [2022-07-11 13:37:40,825][26022] Updated weights on worker 0-0, policy_version 1214176 (0.00092) [2022-07-11 13:37:40,991][25689] Fps is (10 sec: 5653.6, 60 sec: 5637.4, 300 sec: 5633.8). Total num frames: 1243317248. Throughput: 0: 5900.2. Samples: 1243326214. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:37:40,994][25689] Avg episode reward: [(0, '1.271')] [2022-07-11 13:37:42,410][26022] Updated weights on worker 0-0, policy_version 1214186 (0.00090) [2022-07-11 13:37:44,360][26022] Updated weights on worker 0-0, policy_version 1214196 (0.00049) [2022-07-11 13:37:45,916][26022] Updated weights on worker 0-0, policy_version 1214206 (0.00084) [2022-07-11 13:37:46,016][25689] Fps is (10 sec: 5815.2, 60 sec: 5636.1, 300 sec: 5634.4). Total num frames: 1243346944. Throughput: 0: 5080.3. Samples: 1243343474. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:37:46,018][25689] Avg episode reward: [(0, '1.306')] [2022-07-11 13:37:47,997][26022] Updated weights on worker 0-0, policy_version 1214216 (0.00086) [2022-07-11 13:37:49,520][26022] Updated weights on worker 0-0, policy_version 1214226 (0.00085) [2022-07-11 13:37:51,050][25689] Fps is (10 sec: 5599.1, 60 sec: 5634.1, 300 sec: 5635.8). Total num frames: 1243373568. Throughput: 0: 5919.9. Samples: 1243377346. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:37:51,050][25689] Avg episode reward: [(0, '0.475')] [2022-07-11 13:37:51,735][26022] Updated weights on worker 0-0, policy_version 1214236 (0.00093) [2022-07-11 13:37:53,140][26022] Updated weights on worker 0-0, policy_version 1214246 (0.00089) [2022-07-11 13:37:55,335][26022] Updated weights on worker 0-0, policy_version 1214256 (0.00078) [2022-07-11 13:37:56,132][25689] Fps is (10 sec: 5567.3, 60 sec: 5632.7, 300 sec: 5631.6). Total num frames: 1243403264. Throughput: 0: 5913.9. Samples: 1243411252. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:37:56,133][25689] Avg episode reward: [(0, '0.656')] [2022-07-11 13:37:56,768][26022] Updated weights on worker 0-0, policy_version 1214266 (0.00082) [2022-07-11 13:37:58,819][26022] Updated weights on worker 0-0, policy_version 1214276 (0.00085) [2022-07-11 13:38:00,471][26022] Updated weights on worker 0-0, policy_version 1214286 (0.00079) [2022-07-11 13:38:01,149][25689] Fps is (10 sec: 5779.3, 60 sec: 5631.5, 300 sec: 5642.2). Total num frames: 1243431936. Throughput: 0: 5063.6. Samples: 1243428288. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:38:01,150][25689] Avg episode reward: [(0, '1.172')] [2022-07-11 13:38:02,731][26022] Updated weights on worker 0-0, policy_version 1214296 (0.00092) [2022-07-11 13:38:04,425][26022] Updated weights on worker 0-0, policy_version 1214306 (0.00087) [2022-07-11 13:38:06,191][25689] Fps is (10 sec: 5395.7, 60 sec: 5612.1, 300 sec: 5635.2). Total num frames: 1243457536. Throughput: 0: 5805.8. Samples: 1243460604. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:38:06,191][25689] Avg episode reward: [(0, '0.511')] [2022-07-11 13:38:06,396][26022] Updated weights on worker 0-0, policy_version 1214316 (0.00092) [2022-07-11 13:38:07,856][26022] Updated weights on worker 0-0, policy_version 1214326 (0.00091) [2022-07-11 13:38:10,283][26022] Updated weights on worker 0-0, policy_version 1214336 (0.00081) [2022-07-11 13:38:11,158][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:38:11,179][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001214344_1243488256.pth [2022-07-11 13:38:11,180][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001212358_1241454592.pth [2022-07-11 13:38:11,233][25689] Fps is (10 sec: 5585.1, 60 sec: 5650.0, 300 sec: 5638.8). Total num frames: 1243488256. Throughput: 0: 5820.8. Samples: 1243494830. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:38:11,234][25689] Avg episode reward: [(0, '-0.993')] [2022-07-11 13:38:11,499][26022] Updated weights on worker 0-0, policy_version 1214346 (0.00081) [2022-07-11 13:38:13,737][26022] Updated weights on worker 0-0, policy_version 1214356 (0.00088) [2022-07-11 13:38:15,004][26022] Updated weights on worker 0-0, policy_version 1214366 (0.00081) [2022-07-11 13:38:16,335][25689] Fps is (10 sec: 5753.4, 60 sec: 5647.1, 300 sec: 5637.0). Total num frames: 1243515904. Throughput: 0: 4990.8. Samples: 1243512082. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:38:16,336][25689] Avg episode reward: [(0, '-0.918')] [2022-07-11 13:38:17,140][26022] Updated weights on worker 0-0, policy_version 1214376 (0.00084) [2022-07-11 13:38:18,667][26022] Updated weights on worker 0-0, policy_version 1214386 (0.00090) [2022-07-11 13:38:20,741][26022] Updated weights on worker 0-0, policy_version 1214396 (0.00090) [2022-07-11 13:38:21,352][25689] Fps is (10 sec: 5667.3, 60 sec: 5648.2, 300 sec: 5643.8). Total num frames: 1243545600. Throughput: 0: 5855.3. Samples: 1243546580. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:38:21,352][25689] Avg episode reward: [(0, '-1.687')] [2022-07-11 13:38:22,283][26022] Updated weights on worker 0-0, policy_version 1214406 (0.00087) [2022-07-11 13:38:24,455][26022] Updated weights on worker 0-0, policy_version 1214416 (0.00089) [2022-07-11 13:38:25,943][26022] Updated weights on worker 0-0, policy_version 1214426 (0.00085) [2022-07-11 13:38:26,401][25689] Fps is (10 sec: 5697.3, 60 sec: 5645.8, 300 sec: 5636.2). Total num frames: 1243573248. Throughput: 0: 5924.3. Samples: 1243580334. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:38:26,401][25689] Avg episode reward: [(0, '-2.998')] [2022-07-11 13:38:28,095][26022] Updated weights on worker 0-0, policy_version 1214436 (0.00089) [2022-07-11 13:38:29,826][26022] Updated weights on worker 0-0, policy_version 1214446 (0.00084) [2022-07-11 13:38:31,433][25689] Fps is (10 sec: 5485.3, 60 sec: 5628.4, 300 sec: 5636.9). Total num frames: 1243600896. Throughput: 0: 5061.8. Samples: 1243597074. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:38:31,433][25689] Avg episode reward: [(0, '-3.067')] [2022-07-11 13:38:31,673][26022] Updated weights on worker 0-0, policy_version 1214456 (0.00092) [2022-07-11 13:38:33,250][26022] Updated weights on worker 0-0, policy_version 1214466 (0.00100) [2022-07-11 13:38:35,084][26022] Updated weights on worker 0-0, policy_version 1214476 (0.00084) [2022-07-11 13:38:36,508][25689] Fps is (10 sec: 5774.7, 60 sec: 5667.3, 300 sec: 5642.6). Total num frames: 1243631616. Throughput: 0: 5926.4. Samples: 1243631634. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:38:36,510][25689] Avg episode reward: [(0, '-2.078')] [2022-07-11 13:38:36,939][26022] Updated weights on worker 0-0, policy_version 1214486 (0.00083) [2022-07-11 13:38:38,790][26022] Updated weights on worker 0-0, policy_version 1214496 (0.00096) [2022-07-11 13:38:40,460][26022] Updated weights on worker 0-0, policy_version 1214506 (0.00087) [2022-07-11 13:38:41,538][25689] Fps is (10 sec: 5776.1, 60 sec: 5648.8, 300 sec: 5642.5). Total num frames: 1243659264. Throughput: 0: 5910.1. Samples: 1243665880. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:38:41,538][25689] Avg episode reward: [(0, '-1.895')] [2022-07-11 13:38:42,363][26022] Updated weights on worker 0-0, policy_version 1214516 (0.00096) [2022-07-11 13:38:43,849][26022] Updated weights on worker 0-0, policy_version 1214526 (0.00506) [2022-07-11 13:38:45,839][26022] Updated weights on worker 0-0, policy_version 1214536 (0.00085) [2022-07-11 13:38:46,574][25689] Fps is (10 sec: 5595.0, 60 sec: 5630.8, 300 sec: 5642.4). Total num frames: 1243687936. Throughput: 0: 5091.4. Samples: 1243683046. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:38:46,575][25689] Avg episode reward: [(0, '-1.374')] [2022-07-11 13:38:47,704][26022] Updated weights on worker 0-0, policy_version 1214546 (0.00090) [2022-07-11 13:38:49,367][26022] Updated weights on worker 0-0, policy_version 1214556 (0.00089) [2022-07-11 13:38:51,377][26022] Updated weights on worker 0-0, policy_version 1214566 (0.00090) [2022-07-11 13:38:51,581][25689] Fps is (10 sec: 5709.9, 60 sec: 5667.2, 300 sec: 5641.6). Total num frames: 1243716608. Throughput: 0: 5973.3. Samples: 1243717424. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:38:51,581][25689] Avg episode reward: [(0, '-1.486')] [2022-07-11 13:38:53,086][26022] Updated weights on worker 0-0, policy_version 1214576 (0.00082) [2022-07-11 13:38:54,910][26022] Updated weights on worker 0-0, policy_version 1214586 (0.00089) [2022-07-11 13:38:56,645][26022] Updated weights on worker 0-0, policy_version 1214596 (0.00091) [2022-07-11 13:38:56,742][25689] Fps is (10 sec: 5740.4, 60 sec: 5659.8, 300 sec: 5647.4). Total num frames: 1243746304. Throughput: 0: 5925.1. Samples: 1243751524. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:38:56,743][25689] Avg episode reward: [(0, '0.231')] [2022-07-11 13:38:58,447][26022] Updated weights on worker 0-0, policy_version 1214606 (0.00103) [2022-07-11 13:39:00,261][26022] Updated weights on worker 0-0, policy_version 1214616 (0.00080) [2022-07-11 13:39:01,781][25689] Fps is (10 sec: 5722.1, 60 sec: 5657.8, 300 sec: 5647.2). Total num frames: 1243774976. Throughput: 0: 5070.4. Samples: 1243768522. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:39:01,783][25689] Avg episode reward: [(0, '0.337')] [2022-07-11 13:39:02,450][26022] Updated weights on worker 0-0, policy_version 1214626 (0.00086) [2022-07-11 13:39:04,131][26022] Updated weights on worker 0-0, policy_version 1214636 (0.00083) [2022-07-11 13:39:06,276][26022] Updated weights on worker 0-0, policy_version 1214646 (0.00095) [2022-07-11 13:39:06,822][25689] Fps is (10 sec: 5485.8, 60 sec: 5674.7, 300 sec: 5644.4). Total num frames: 1243801600. Throughput: 0: 5821.5. Samples: 1243800918. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:39:06,823][25689] Avg episode reward: [(0, '0.044')] [2022-07-11 13:39:07,615][26022] Updated weights on worker 0-0, policy_version 1214656 (0.00087) [2022-07-11 13:39:09,752][26022] Updated weights on worker 0-0, policy_version 1214666 (0.00091) [2022-07-11 13:39:11,258][26022] Updated weights on worker 0-0, policy_version 1214676 (0.00086) [2022-07-11 13:39:11,842][25689] Fps is (10 sec: 5496.1, 60 sec: 5643.0, 300 sec: 5645.0). Total num frames: 1243830272. Throughput: 0: 5810.9. Samples: 1243835162. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:39:11,843][25689] Avg episode reward: [(0, '-0.255')] [2022-07-11 13:39:13,227][26022] Updated weights on worker 0-0, policy_version 1214686 (0.00075) [2022-07-11 13:39:14,974][26022] Updated weights on worker 0-0, policy_version 1214696 (0.00082) [2022-07-11 13:39:16,856][26022] Updated weights on worker 0-0, policy_version 1214706 (0.00088) [2022-07-11 13:39:16,925][25689] Fps is (10 sec: 5777.6, 60 sec: 5678.6, 300 sec: 5650.5). Total num frames: 1243859968. Throughput: 0: 4993.1. Samples: 1243852296. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:39:16,925][25689] Avg episode reward: [(0, '1.094')] [2022-07-11 13:39:18,626][26022] Updated weights on worker 0-0, policy_version 1214716 (0.00086) [2022-07-11 13:39:20,320][26022] Updated weights on worker 0-0, policy_version 1214726 (0.00086) [2022-07-11 13:39:21,956][25689] Fps is (10 sec: 5669.6, 60 sec: 5643.4, 300 sec: 5648.3). Total num frames: 1243887616. Throughput: 0: 5845.1. Samples: 1243886448. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:39:21,957][25689] Avg episode reward: [(0, '0.576')] [2022-07-11 13:39:22,273][26022] Updated weights on worker 0-0, policy_version 1214736 (0.00085) [2022-07-11 13:39:23,924][26022] Updated weights on worker 0-0, policy_version 1214746 (0.00049) [2022-07-11 13:39:25,820][26022] Updated weights on worker 0-0, policy_version 1214756 (0.00089) [2022-07-11 13:39:27,006][25689] Fps is (10 sec: 5586.6, 60 sec: 5660.2, 300 sec: 5645.7). Total num frames: 1243916288. Throughput: 0: 5929.9. Samples: 1243920606. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:39:27,006][25689] Avg episode reward: [(0, '0.171')] [2022-07-11 13:39:27,596][26022] Updated weights on worker 0-0, policy_version 1214766 (0.00077) [2022-07-11 13:39:29,470][26022] Updated weights on worker 0-0, policy_version 1214776 (0.00091) [2022-07-11 13:39:31,280][26022] Updated weights on worker 0-0, policy_version 1214786 (0.00079) [2022-07-11 13:39:32,019][25689] Fps is (10 sec: 5597.2, 60 sec: 5662.0, 300 sec: 5647.7). Total num frames: 1243943936. Throughput: 0: 5063.5. Samples: 1243937326. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:39:32,019][25689] Avg episode reward: [(0, '1.001')] [2022-07-11 13:39:32,998][26022] Updated weights on worker 0-0, policy_version 1214796 (0.00092) [2022-07-11 13:39:34,807][26022] Updated weights on worker 0-0, policy_version 1214806 (0.00085) [2022-07-11 13:39:36,871][26022] Updated weights on worker 0-0, policy_version 1214816 (0.00086) [2022-07-11 13:39:37,122][25689] Fps is (10 sec: 5567.2, 60 sec: 5625.6, 300 sec: 5647.3). Total num frames: 1243972608. Throughput: 0: 5880.4. Samples: 1243971066. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:39:37,125][25689] Avg episode reward: [(0, '1.203')] [2022-07-11 13:39:38,653][26022] Updated weights on worker 0-0, policy_version 1214826 (0.00083) [2022-07-11 13:39:40,362][26022] Updated weights on worker 0-0, policy_version 1214836 (0.00084) [2022-07-11 13:39:42,134][25689] Fps is (10 sec: 5668.9, 60 sec: 5644.2, 300 sec: 5644.9). Total num frames: 1244001280. Throughput: 0: 5872.9. Samples: 1244004950. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:39:42,135][25689] Avg episode reward: [(0, '0.917')] [2022-07-11 13:39:42,303][26022] Updated weights on worker 0-0, policy_version 1214846 (0.00086) [2022-07-11 13:39:44,103][26022] Updated weights on worker 0-0, policy_version 1214856 (0.00085) [2022-07-11 13:39:45,971][26022] Updated weights on worker 0-0, policy_version 1214866 (0.00097) [2022-07-11 13:39:47,136][25689] Fps is (10 sec: 5726.6, 60 sec: 5647.4, 300 sec: 5649.3). Total num frames: 1244029952. Throughput: 0: 5864.6. Samples: 1244038660. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:39:47,136][25689] Avg episode reward: [(0, '1.199')] [2022-07-11 13:39:47,703][26022] Updated weights on worker 0-0, policy_version 1214876 (0.00087) [2022-07-11 13:39:49,523][26022] Updated weights on worker 0-0, policy_version 1214886 (0.00080) [2022-07-11 13:39:51,373][26022] Updated weights on worker 0-0, policy_version 1214896 (0.00089) [2022-07-11 13:39:52,139][25689] Fps is (10 sec: 5629.4, 60 sec: 5630.8, 300 sec: 5647.5). Total num frames: 1244057600. Throughput: 0: 5880.6. Samples: 1244055644. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:39:52,139][25689] Avg episode reward: [(0, '1.393')] [2022-07-11 13:39:53,108][26022] Updated weights on worker 0-0, policy_version 1214906 (0.00080) [2022-07-11 13:39:55,095][26022] Updated weights on worker 0-0, policy_version 1214916 (0.00085) [2022-07-11 13:39:56,682][26022] Updated weights on worker 0-0, policy_version 1214926 (0.00083) [2022-07-11 13:39:57,200][25689] Fps is (10 sec: 5697.6, 60 sec: 5640.1, 300 sec: 5649.9). Total num frames: 1244087296. Throughput: 0: 5904.3. Samples: 1244089614. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:39:57,201][25689] Avg episode reward: [(0, '1.458')] [2022-07-11 13:39:58,839][26022] Updated weights on worker 0-0, policy_version 1214936 (0.00085) [2022-07-11 13:40:00,292][26022] Updated weights on worker 0-0, policy_version 1214946 (0.00089) [2022-07-11 13:40:02,275][25689] Fps is (10 sec: 5354.4, 60 sec: 5569.1, 300 sec: 5642.5). Total num frames: 1244111872. Throughput: 0: 5818.1. Samples: 1244122130. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:40:02,275][25689] Avg episode reward: [(0, '1.183')] [2022-07-11 13:40:02,735][26022] Updated weights on worker 0-0, policy_version 1214956 (0.00087) [2022-07-11 13:40:04,308][26022] Updated weights on worker 0-0, policy_version 1214966 (0.00085) [2022-07-11 13:40:06,327][26022] Updated weights on worker 0-0, policy_version 1214976 (0.00092) [2022-07-11 13:40:07,311][25689] Fps is (10 sec: 5367.7, 60 sec: 5620.3, 300 sec: 5646.6). Total num frames: 1244141568. Throughput: 0: 4952.6. Samples: 1244138580. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:40:07,312][25689] Avg episode reward: [(0, '1.258')] [2022-07-11 13:40:07,961][26022] Updated weights on worker 0-0, policy_version 1214986 (0.00089) [2022-07-11 13:40:09,903][26022] Updated weights on worker 0-0, policy_version 1214996 (0.00086) [2022-07-11 13:40:11,290][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:40:11,301][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001215004_1244164096.pth [2022-07-11 13:40:11,302][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001213019_1242131456.pth [2022-07-11 13:40:11,659][26022] Updated weights on worker 0-0, policy_version 1215006 (0.00084) [2022-07-11 13:40:12,366][25689] Fps is (10 sec: 5784.1, 60 sec: 5617.1, 300 sec: 5639.7). Total num frames: 1244170240. Throughput: 0: 5770.5. Samples: 1244172366. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:40:12,368][25689] Avg episode reward: [(0, '0.558')] [2022-07-11 13:40:13,619][26022] Updated weights on worker 0-0, policy_version 1215016 (0.00087) [2022-07-11 13:40:15,318][26022] Updated weights on worker 0-0, policy_version 1215026 (0.00082) [2022-07-11 13:40:17,103][26022] Updated weights on worker 0-0, policy_version 1215036 (0.00089) [2022-07-11 13:40:17,449][25689] Fps is (10 sec: 5555.3, 60 sec: 5583.2, 300 sec: 5642.9). Total num frames: 1244197888. Throughput: 0: 5768.2. Samples: 1244206414. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:40:17,450][25689] Avg episode reward: [(0, '0.444')] [2022-07-11 13:40:18,696][26022] Updated weights on worker 0-0, policy_version 1215046 (0.00089) [2022-07-11 13:40:20,775][26022] Updated weights on worker 0-0, policy_version 1215056 (0.00080) [2022-07-11 13:40:22,526][25689] Fps is (10 sec: 5543.1, 60 sec: 5595.9, 300 sec: 5638.9). Total num frames: 1244226560. Throughput: 0: 4993.2. Samples: 1244223250. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:40:22,527][25689] Avg episode reward: [(0, '-0.296')] [2022-07-11 13:40:22,584][26022] Updated weights on worker 0-0, policy_version 1215066 (0.00087) [2022-07-11 13:40:24,303][26022] Updated weights on worker 0-0, policy_version 1215076 (0.00092) [2022-07-11 13:40:26,252][26022] Updated weights on worker 0-0, policy_version 1215086 (0.00093) [2022-07-11 13:40:27,548][25689] Fps is (10 sec: 5678.0, 60 sec: 5598.4, 300 sec: 5636.3). Total num frames: 1244255232. Throughput: 0: 5876.5. Samples: 1244257504. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:40:27,549][25689] Avg episode reward: [(0, '-0.477')] [2022-07-11 13:40:27,920][26022] Updated weights on worker 0-0, policy_version 1215096 (0.00086) [2022-07-11 13:40:29,615][26022] Updated weights on worker 0-0, policy_version 1215106 (0.00084) [2022-07-11 13:40:31,500][26022] Updated weights on worker 0-0, policy_version 1215116 (0.01068) [2022-07-11 13:40:32,560][25689] Fps is (10 sec: 5817.4, 60 sec: 5632.4, 300 sec: 5647.9). Total num frames: 1244284928. Throughput: 0: 5907.0. Samples: 1244291650. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:40:32,560][25689] Avg episode reward: [(0, '-1.165')] [2022-07-11 13:40:33,314][26022] Updated weights on worker 0-0, policy_version 1215126 (0.00091) [2022-07-11 13:40:35,161][26022] Updated weights on worker 0-0, policy_version 1215136 (0.00637) [2022-07-11 13:40:36,986][26022] Updated weights on worker 0-0, policy_version 1215146 (0.00082) [2022-07-11 13:40:37,690][25689] Fps is (10 sec: 5654.6, 60 sec: 5613.0, 300 sec: 5638.9). Total num frames: 1244312576. Throughput: 0: 5062.0. Samples: 1244308870. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2022-07-11 13:40:37,690][25689] Avg episode reward: [(0, '-1.193')] [2022-07-11 13:40:38,669][26022] Updated weights on worker 0-0, policy_version 1215156 (0.00087) [2022-07-11 13:40:40,596][26022] Updated weights on worker 0-0, policy_version 1215166 (0.00106) [2022-07-11 13:40:42,466][26022] Updated weights on worker 0-0, policy_version 1215176 (0.00094) [2022-07-11 13:40:42,775][25689] Fps is (10 sec: 5513.4, 60 sec: 5606.3, 300 sec: 5634.7). Total num frames: 1244341248. Throughput: 0: 5905.6. Samples: 1244342830. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:40:42,775][25689] Avg episode reward: [(0, '-0.199')] [2022-07-11 13:40:44,200][26022] Updated weights on worker 0-0, policy_version 1215186 (0.00091) [2022-07-11 13:40:46,165][26022] Updated weights on worker 0-0, policy_version 1215196 (0.00081) [2022-07-11 13:40:47,651][26022] Updated weights on worker 0-0, policy_version 1215206 (0.00082) [2022-07-11 13:40:47,859][25689] Fps is (10 sec: 5739.7, 60 sec: 5615.5, 300 sec: 5640.4). Total num frames: 1244370944. Throughput: 0: 5857.5. Samples: 1244376472. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:40:47,859][25689] Avg episode reward: [(0, '-1.074')] [2022-07-11 13:40:49,945][26022] Updated weights on worker 0-0, policy_version 1215216 (0.00101) [2022-07-11 13:40:51,490][26022] Updated weights on worker 0-0, policy_version 1215226 (0.00085) [2022-07-11 13:40:52,911][25689] Fps is (10 sec: 5657.5, 60 sec: 5611.0, 300 sec: 5634.0). Total num frames: 1244398592. Throughput: 0: 5007.3. Samples: 1244393554. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:40:52,911][25689] Avg episode reward: [(0, '-0.544')] [2022-07-11 13:40:53,439][26022] Updated weights on worker 0-0, policy_version 1215236 (0.00089) [2022-07-11 13:40:54,993][26022] Updated weights on worker 0-0, policy_version 1215246 (0.00086) [2022-07-11 13:40:56,871][26022] Updated weights on worker 0-0, policy_version 1215256 (0.00080) [2022-07-11 13:40:57,985][25689] Fps is (10 sec: 5764.1, 60 sec: 5626.7, 300 sec: 5640.2). Total num frames: 1244429312. Throughput: 0: 5863.8. Samples: 1244427878. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:40:57,985][25689] Avg episode reward: [(0, '0.339')] [2022-07-11 13:40:58,643][26022] Updated weights on worker 0-0, policy_version 1215266 (0.00364) [2022-07-11 13:41:00,725][26022] Updated weights on worker 0-0, policy_version 1215276 (0.00088) [2022-07-11 13:41:02,479][26022] Updated weights on worker 0-0, policy_version 1215286 (0.00082) [2022-07-11 13:41:03,001][25689] Fps is (10 sec: 5480.1, 60 sec: 5632.1, 300 sec: 5633.6). Total num frames: 1244453888. Throughput: 0: 5783.5. Samples: 1244459808. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:41:03,001][25689] Avg episode reward: [(0, '-0.566')] [2022-07-11 13:41:04,693][26022] Updated weights on worker 0-0, policy_version 1215296 (0.00083) [2022-07-11 13:41:06,198][26022] Updated weights on worker 0-0, policy_version 1215306 (0.00084) [2022-07-11 13:41:08,009][25689] Fps is (10 sec: 5107.8, 60 sec: 5584.1, 300 sec: 5627.4). Total num frames: 1244480512. Throughput: 0: 4964.8. Samples: 1244476512. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:41:08,009][25689] Avg episode reward: [(0, '-0.500')] [2022-07-11 13:41:08,232][26022] Updated weights on worker 0-0, policy_version 1215316 (0.00093) [2022-07-11 13:41:10,045][26022] Updated weights on worker 0-0, policy_version 1215326 (0.00095) [2022-07-11 13:41:11,753][26022] Updated weights on worker 0-0, policy_version 1215336 (0.00593) [2022-07-11 13:41:13,019][25689] Fps is (10 sec: 5724.1, 60 sec: 5622.0, 300 sec: 5639.2). Total num frames: 1244511232. Throughput: 0: 5812.0. Samples: 1244510424. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:41:13,020][25689] Avg episode reward: [(0, '0.277')] [2022-07-11 13:41:13,602][26022] Updated weights on worker 0-0, policy_version 1215346 (0.00082) [2022-07-11 13:41:15,450][26022] Updated weights on worker 0-0, policy_version 1215356 (0.00097) [2022-07-11 13:41:17,423][26022] Updated weights on worker 0-0, policy_version 1215366 (0.00081) [2022-07-11 13:41:18,086][25689] Fps is (10 sec: 5893.9, 60 sec: 5640.4, 300 sec: 5639.3). Total num frames: 1244539904. Throughput: 0: 5800.1. Samples: 1244544464. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:41:18,090][25689] Avg episode reward: [(0, '0.718')] [2022-07-11 13:41:19,051][26022] Updated weights on worker 0-0, policy_version 1215376 (0.00090) [2022-07-11 13:41:20,863][26022] Updated weights on worker 0-0, policy_version 1215386 (0.00093) [2022-07-11 13:41:22,506][26022] Updated weights on worker 0-0, policy_version 1215396 (0.00081) [2022-07-11 13:41:23,111][25689] Fps is (10 sec: 5479.0, 60 sec: 5611.4, 300 sec: 5629.6). Total num frames: 1244566528. Throughput: 0: 5048.8. Samples: 1244561340. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:41:23,113][25689] Avg episode reward: [(0, '1.000')] [2022-07-11 13:41:24,654][26022] Updated weights on worker 0-0, policy_version 1215406 (0.00087) [2022-07-11 13:41:26,421][26022] Updated weights on worker 0-0, policy_version 1215416 (0.00085) [2022-07-11 13:41:28,140][25689] Fps is (10 sec: 5499.7, 60 sec: 5610.8, 300 sec: 5632.7). Total num frames: 1244595200. Throughput: 0: 5896.1. Samples: 1244595208. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:41:28,142][25689] Avg episode reward: [(0, '1.257')] [2022-07-11 13:41:28,187][26022] Updated weights on worker 0-0, policy_version 1215426 (0.00087) [2022-07-11 13:41:29,906][26022] Updated weights on worker 0-0, policy_version 1215436 (0.00096) [2022-07-11 13:41:31,630][26022] Updated weights on worker 0-0, policy_version 1215446 (0.00087) [2022-07-11 13:41:33,163][25689] Fps is (10 sec: 5806.8, 60 sec: 5609.7, 300 sec: 5634.6). Total num frames: 1244624896. Throughput: 0: 5903.3. Samples: 1244629340. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:41:33,164][25689] Avg episode reward: [(0, '2.287')] [2022-07-11 13:41:33,405][26022] Updated weights on worker 0-0, policy_version 1215456 (0.00090) [2022-07-11 13:41:35,399][26022] Updated weights on worker 0-0, policy_version 1215466 (0.00083) [2022-07-11 13:41:37,009][26022] Updated weights on worker 0-0, policy_version 1215476 (0.00086) [2022-07-11 13:41:38,296][25689] Fps is (10 sec: 5746.9, 60 sec: 5626.3, 300 sec: 5632.5). Total num frames: 1244653568. Throughput: 0: 5061.4. Samples: 1244646758. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:41:38,298][25689] Avg episode reward: [(0, '1.607')] [2022-07-11 13:41:38,998][26022] Updated weights on worker 0-0, policy_version 1215486 (0.00087) [2022-07-11 13:41:40,605][26022] Updated weights on worker 0-0, policy_version 1215496 (0.00080) [2022-07-11 13:41:42,428][26022] Updated weights on worker 0-0, policy_version 1215506 (0.00079) [2022-07-11 13:41:43,308][25689] Fps is (10 sec: 5753.5, 60 sec: 5650.1, 300 sec: 5632.5). Total num frames: 1244683264. Throughput: 0: 5938.9. Samples: 1244681284. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:41:43,308][25689] Avg episode reward: [(0, '0.786')] [2022-07-11 13:41:44,259][26022] Updated weights on worker 0-0, policy_version 1215516 (0.00090) [2022-07-11 13:41:45,915][26022] Updated weights on worker 0-0, policy_version 1215526 (0.00088) [2022-07-11 13:41:47,821][26022] Updated weights on worker 0-0, policy_version 1215536 (0.00081) [2022-07-11 13:41:48,331][25689] Fps is (10 sec: 5714.8, 60 sec: 5621.9, 300 sec: 5635.7). Total num frames: 1244710912. Throughput: 0: 5964.7. Samples: 1244715640. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:41:48,333][25689] Avg episode reward: [(0, '0.258')] [2022-07-11 13:41:49,506][26022] Updated weights on worker 0-0, policy_version 1215546 (0.00076) [2022-07-11 13:41:51,463][26022] Updated weights on worker 0-0, policy_version 1215556 (0.00077) [2022-07-11 13:41:53,351][25689] Fps is (10 sec: 5505.7, 60 sec: 5624.8, 300 sec: 5629.7). Total num frames: 1244738560. Throughput: 0: 5132.6. Samples: 1244732958. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:41:53,352][25689] Avg episode reward: [(0, '0.166')] [2022-07-11 13:41:53,368][26022] Updated weights on worker 0-0, policy_version 1215566 (0.00086) [2022-07-11 13:41:54,922][26022] Updated weights on worker 0-0, policy_version 1215576 (0.00093) [2022-07-11 13:41:56,935][26022] Updated weights on worker 0-0, policy_version 1215586 (0.00609) [2022-07-11 13:41:58,390][25689] Fps is (10 sec: 5802.5, 60 sec: 5628.1, 300 sec: 5636.0). Total num frames: 1244769280. Throughput: 0: 5990.2. Samples: 1244767122. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:41:58,390][25689] Avg episode reward: [(0, '-0.152')] [2022-07-11 13:41:58,516][26022] Updated weights on worker 0-0, policy_version 1215596 (0.00086) [2022-07-11 13:42:00,472][26022] Updated weights on worker 0-0, policy_version 1215606 (0.00087) [2022-07-11 13:42:02,447][26022] Updated weights on worker 0-0, policy_version 1215616 (0.00082) [2022-07-11 13:42:03,429][25689] Fps is (10 sec: 5588.4, 60 sec: 5642.9, 300 sec: 5632.0). Total num frames: 1244794880. Throughput: 0: 5858.3. Samples: 1244799162. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:42:03,430][25689] Avg episode reward: [(0, '-0.310')] [2022-07-11 13:42:04,301][26022] Updated weights on worker 0-0, policy_version 1215626 (0.00088) [2022-07-11 13:42:06,262][26022] Updated weights on worker 0-0, policy_version 1215636 (0.00085) [2022-07-11 13:42:07,813][26022] Updated weights on worker 0-0, policy_version 1215646 (0.00112) [2022-07-11 13:42:08,437][25689] Fps is (10 sec: 5300.0, 60 sec: 5659.9, 300 sec: 5630.1). Total num frames: 1244822528. Throughput: 0: 5827.1. Samples: 1244832798. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:42:08,437][25689] Avg episode reward: [(0, '0.140')] [2022-07-11 13:42:09,944][26022] Updated weights on worker 0-0, policy_version 1215656 (0.00087) [2022-07-11 13:42:11,434][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:42:11,448][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001215664_1244839936.pth [2022-07-11 13:42:11,448][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001213680_1242808320.pth [2022-07-11 13:42:11,449][25974] Saving a milestone ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/milestones/checkpoint_001215664_1244839936.pth.milestone [2022-07-11 13:42:11,726][26022] Updated weights on worker 0-0, policy_version 1215666 (0.00083) [2022-07-11 13:42:13,280][26022] Updated weights on worker 0-0, policy_version 1215676 (0.00092) [2022-07-11 13:42:13,442][25689] Fps is (10 sec: 5727.0, 60 sec: 5643.4, 300 sec: 5638.2). Total num frames: 1244852224. Throughput: 0: 5818.2. Samples: 1244849852. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:42:13,443][25689] Avg episode reward: [(0, '1.213')] [2022-07-11 13:42:15,280][26022] Updated weights on worker 0-0, policy_version 1215686 (0.00086) [2022-07-11 13:42:17,023][26022] Updated weights on worker 0-0, policy_version 1215696 (0.00083) [2022-07-11 13:42:18,566][25689] Fps is (10 sec: 5560.3, 60 sec: 5604.2, 300 sec: 5626.1). Total num frames: 1244878848. Throughput: 0: 5776.3. Samples: 1244883662. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:42:18,566][25689] Avg episode reward: [(0, '1.670')] [2022-07-11 13:42:18,918][26022] Updated weights on worker 0-0, policy_version 1215706 (0.00086) [2022-07-11 13:42:20,745][26022] Updated weights on worker 0-0, policy_version 1215716 (0.00084) [2022-07-11 13:42:22,481][26022] Updated weights on worker 0-0, policy_version 1215726 (0.00089) [2022-07-11 13:42:23,667][25689] Fps is (10 sec: 5508.3, 60 sec: 5648.0, 300 sec: 5631.5). Total num frames: 1244908544. Throughput: 0: 5845.7. Samples: 1244917464. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:42:23,668][25689] Avg episode reward: [(0, '0.855')] [2022-07-11 13:42:24,652][26022] Updated weights on worker 0-0, policy_version 1215736 (0.00082) [2022-07-11 13:42:26,171][26022] Updated weights on worker 0-0, policy_version 1215746 (0.00097) [2022-07-11 13:42:28,172][26022] Updated weights on worker 0-0, policy_version 1215756 (0.00083) [2022-07-11 13:42:28,727][25689] Fps is (10 sec: 5744.1, 60 sec: 5645.0, 300 sec: 5630.9). Total num frames: 1244937216. Throughput: 0: 4997.5. Samples: 1244934200. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:42:28,728][25689] Avg episode reward: [(0, '0.825')] [2022-07-11 13:42:29,819][26022] Updated weights on worker 0-0, policy_version 1215766 (0.00085) [2022-07-11 13:42:31,543][26022] Updated weights on worker 0-0, policy_version 1215776 (0.00086) [2022-07-11 13:42:33,650][26022] Updated weights on worker 0-0, policy_version 1215786 (0.00086) [2022-07-11 13:42:33,838][25689] Fps is (10 sec: 5637.6, 60 sec: 5619.9, 300 sec: 5631.2). Total num frames: 1244965888. Throughput: 0: 5813.3. Samples: 1244968422. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:42:33,839][25689] Avg episode reward: [(0, '1.234')] [2022-07-11 13:42:35,291][26022] Updated weights on worker 0-0, policy_version 1215796 (0.00097) [2022-07-11 13:42:37,015][26022] Updated weights on worker 0-0, policy_version 1215806 (0.00088) [2022-07-11 13:42:38,879][25689] Fps is (10 sec: 5648.4, 60 sec: 5628.5, 300 sec: 5630.7). Total num frames: 1244994560. Throughput: 0: 5833.7. Samples: 1245002166. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:42:38,880][25689] Avg episode reward: [(0, '1.191')] [2022-07-11 13:42:39,088][26022] Updated weights on worker 0-0, policy_version 1215816 (0.00542) [2022-07-11 13:42:40,722][26022] Updated weights on worker 0-0, policy_version 1215826 (0.00086) [2022-07-11 13:42:42,595][26022] Updated weights on worker 0-0, policy_version 1215836 (0.00086) [2022-07-11 13:42:43,901][25689] Fps is (10 sec: 5698.4, 60 sec: 5610.6, 300 sec: 5627.3). Total num frames: 1245023232. Throughput: 0: 5030.0. Samples: 1245019246. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:42:43,902][25689] Avg episode reward: [(0, '1.087')] [2022-07-11 13:42:44,274][26022] Updated weights on worker 0-0, policy_version 1215846 (0.00086) [2022-07-11 13:42:46,098][26022] Updated weights on worker 0-0, policy_version 1215856 (0.00091) [2022-07-11 13:42:47,877][26022] Updated weights on worker 0-0, policy_version 1215866 (0.00084) [2022-07-11 13:42:48,940][25689] Fps is (10 sec: 5699.8, 60 sec: 5626.1, 300 sec: 5634.1). Total num frames: 1245051904. Throughput: 0: 5900.3. Samples: 1245053462. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:42:48,941][25689] Avg episode reward: [(0, '0.987')] [2022-07-11 13:42:49,843][26022] Updated weights on worker 0-0, policy_version 1215876 (0.00847) [2022-07-11 13:42:51,275][26022] Updated weights on worker 0-0, policy_version 1215886 (0.00092) [2022-07-11 13:42:53,546][26022] Updated weights on worker 0-0, policy_version 1215896 (0.00087) [2022-07-11 13:42:53,948][25689] Fps is (10 sec: 5708.0, 60 sec: 5644.1, 300 sec: 5632.1). Total num frames: 1245080576. Throughput: 0: 5931.8. Samples: 1245087706. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:42:53,948][25689] Avg episode reward: [(0, '1.734')] [2022-07-11 13:42:54,885][26022] Updated weights on worker 0-0, policy_version 1215906 (0.00092) [2022-07-11 13:42:57,047][26022] Updated weights on worker 0-0, policy_version 1215916 (0.00081) [2022-07-11 13:42:58,813][26022] Updated weights on worker 0-0, policy_version 1215926 (0.00086) [2022-07-11 13:42:59,034][25689] Fps is (10 sec: 5680.8, 60 sec: 5605.9, 300 sec: 5630.7). Total num frames: 1245109248. Throughput: 0: 5095.0. Samples: 1245104856. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:42:59,035][25689] Avg episode reward: [(0, '1.654')] [2022-07-11 13:43:00,347][26022] Updated weights on worker 0-0, policy_version 1215936 (0.00087) [2022-07-11 13:43:02,354][26022] Updated weights on worker 0-0, policy_version 1215946 (0.00080) [2022-07-11 13:43:04,115][25689] Fps is (10 sec: 5438.5, 60 sec: 5619.0, 300 sec: 5633.4). Total num frames: 1245135872. Throughput: 0: 5855.8. Samples: 1245137614. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:43:04,116][25689] Avg episode reward: [(0, '0.991')] [2022-07-11 13:43:04,589][26022] Updated weights on worker 0-0, policy_version 1215956 (0.00085) [2022-07-11 13:43:06,164][26022] Updated weights on worker 0-0, policy_version 1215966 (0.00087) [2022-07-11 13:43:08,070][26022] Updated weights on worker 0-0, policy_version 1215976 (0.00100) [2022-07-11 13:43:09,212][25689] Fps is (10 sec: 5533.7, 60 sec: 5644.4, 300 sec: 5629.0). Total num frames: 1245165568. Throughput: 0: 5809.6. Samples: 1245171234. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:43:09,212][25689] Avg episode reward: [(0, '1.195')] [2022-07-11 13:43:09,722][26022] Updated weights on worker 0-0, policy_version 1215986 (0.00090) [2022-07-11 13:43:11,728][26022] Updated weights on worker 0-0, policy_version 1215996 (0.00089) [2022-07-11 13:43:13,648][26022] Updated weights on worker 0-0, policy_version 1216006 (0.00084) [2022-07-11 13:43:14,252][25689] Fps is (10 sec: 5859.2, 60 sec: 5641.2, 300 sec: 5637.0). Total num frames: 1245195264. Throughput: 0: 4955.1. Samples: 1245188314. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:43:14,252][25689] Avg episode reward: [(0, '1.159')] [2022-07-11 13:43:15,221][26022] Updated weights on worker 0-0, policy_version 1216016 (0.00092) [2022-07-11 13:43:17,042][26022] Updated weights on worker 0-0, policy_version 1216026 (0.00084) [2022-07-11 13:43:18,848][26022] Updated weights on worker 0-0, policy_version 1216036 (0.00089) [2022-07-11 13:43:19,324][25689] Fps is (10 sec: 5670.9, 60 sec: 5662.8, 300 sec: 5629.1). Total num frames: 1245222912. Throughput: 0: 5802.0. Samples: 1245222576. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:43:19,324][25689] Avg episode reward: [(0, '1.311')] [2022-07-11 13:43:20,643][26022] Updated weights on worker 0-0, policy_version 1216046 (0.00088) [2022-07-11 13:43:22,499][26022] Updated weights on worker 0-0, policy_version 1216056 (0.00088) [2022-07-11 13:43:24,227][26022] Updated weights on worker 0-0, policy_version 1216066 (0.00098) [2022-07-11 13:43:24,388][25689] Fps is (10 sec: 5657.2, 60 sec: 5666.3, 300 sec: 5635.7). Total num frames: 1245252608. Throughput: 0: 5884.7. Samples: 1245256916. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:43:24,389][25689] Avg episode reward: [(0, '1.231')] [2022-07-11 13:43:26,065][26022] Updated weights on worker 0-0, policy_version 1216076 (0.00082) [2022-07-11 13:43:27,787][26022] Updated weights on worker 0-0, policy_version 1216086 (0.00088) [2022-07-11 13:43:29,392][25689] Fps is (10 sec: 5695.5, 60 sec: 5654.6, 300 sec: 5636.2). Total num frames: 1245280256. Throughput: 0: 5097.8. Samples: 1245274110. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:43:29,397][25689] Avg episode reward: [(0, '1.569')] [2022-07-11 13:43:29,543][26022] Updated weights on worker 0-0, policy_version 1216096 (0.00085) [2022-07-11 13:43:31,563][26022] Updated weights on worker 0-0, policy_version 1216106 (0.00091) [2022-07-11 13:43:33,175][26022] Updated weights on worker 0-0, policy_version 1216116 (0.00091) [2022-07-11 13:43:34,424][25689] Fps is (10 sec: 5612.1, 60 sec: 5662.1, 300 sec: 5630.2). Total num frames: 1245308928. Throughput: 0: 5951.9. Samples: 1245308376. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:43:34,424][25689] Avg episode reward: [(0, '1.529')] [2022-07-11 13:43:35,123][26022] Updated weights on worker 0-0, policy_version 1216126 (0.00087) [2022-07-11 13:43:36,711][26022] Updated weights on worker 0-0, policy_version 1216136 (0.00092) [2022-07-11 13:43:38,808][26022] Updated weights on worker 0-0, policy_version 1216146 (0.00086) [2022-07-11 13:43:39,471][25689] Fps is (10 sec: 5689.8, 60 sec: 5661.5, 300 sec: 5633.3). Total num frames: 1245337600. Throughput: 0: 5957.3. Samples: 1245342596. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:43:39,471][25689] Avg episode reward: [(0, '0.450')] [2022-07-11 13:43:40,393][26022] Updated weights on worker 0-0, policy_version 1216156 (0.00084) [2022-07-11 13:43:42,247][26022] Updated weights on worker 0-0, policy_version 1216166 (0.00094) [2022-07-11 13:43:43,792][26022] Updated weights on worker 0-0, policy_version 1216176 (0.00094) [2022-07-11 13:43:44,486][25689] Fps is (10 sec: 5698.8, 60 sec: 5662.1, 300 sec: 5633.7). Total num frames: 1245366272. Throughput: 0: 5117.9. Samples: 1245359776. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:43:44,487][25689] Avg episode reward: [(0, '-0.235')] [2022-07-11 13:43:45,944][26022] Updated weights on worker 0-0, policy_version 1216186 (0.00086) [2022-07-11 13:43:47,663][26022] Updated weights on worker 0-0, policy_version 1216196 (0.00058) [2022-07-11 13:43:49,423][26022] Updated weights on worker 0-0, policy_version 1216206 (0.00092) [2022-07-11 13:43:49,494][25689] Fps is (10 sec: 5721.1, 60 sec: 5665.0, 300 sec: 5633.6). Total num frames: 1245394944. Throughput: 0: 5961.7. Samples: 1245393950. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:43:49,494][25689] Avg episode reward: [(0, '-1.696')] [2022-07-11 13:43:51,311][26022] Updated weights on worker 0-0, policy_version 1216216 (0.00555) [2022-07-11 13:43:53,239][26022] Updated weights on worker 0-0, policy_version 1216226 (0.00089) [2022-07-11 13:43:54,514][25689] Fps is (10 sec: 5616.4, 60 sec: 5647.0, 300 sec: 5629.4). Total num frames: 1245422592. Throughput: 0: 5939.1. Samples: 1245427694. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:43:54,516][25689] Avg episode reward: [(0, '-1.539')] [2022-07-11 13:43:54,799][26022] Updated weights on worker 0-0, policy_version 1216236 (0.00089) [2022-07-11 13:43:56,755][26022] Updated weights on worker 0-0, policy_version 1216246 (0.00080) [2022-07-11 13:43:58,356][26022] Updated weights on worker 0-0, policy_version 1216256 (0.00081) [2022-07-11 13:43:59,622][25689] Fps is (10 sec: 5560.8, 60 sec: 5645.0, 300 sec: 5628.1). Total num frames: 1245451264. Throughput: 0: 5916.6. Samples: 1245461822. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:43:59,623][25689] Avg episode reward: [(0, '-1.077')] [2022-07-11 13:44:00,347][26022] Updated weights on worker 0-0, policy_version 1216266 (0.00085) [2022-07-11 13:44:02,452][26022] Updated weights on worker 0-0, policy_version 1216276 (0.00076) [2022-07-11 13:44:04,249][26022] Updated weights on worker 0-0, policy_version 1216286 (0.00091) [2022-07-11 13:44:04,631][25689] Fps is (10 sec: 5566.9, 60 sec: 5668.6, 300 sec: 5632.2). Total num frames: 1245478912. Throughput: 0: 5809.4. Samples: 1245476804. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:44:04,632][25689] Avg episode reward: [(0, '-1.102')] [2022-07-11 13:44:06,060][26022] Updated weights on worker 0-0, policy_version 1216296 (0.00093) [2022-07-11 13:44:07,978][26022] Updated weights on worker 0-0, policy_version 1216306 (0.00083) [2022-07-11 13:44:09,639][26022] Updated weights on worker 0-0, policy_version 1216316 (0.00049) [2022-07-11 13:44:09,715][25689] Fps is (10 sec: 5681.4, 60 sec: 5669.8, 300 sec: 5634.4). Total num frames: 1245508608. Throughput: 0: 5768.6. Samples: 1245510598. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:44:09,716][25689] Avg episode reward: [(0, '1.509')] [2022-07-11 13:44:11,525][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:44:11,540][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001216325_1245516800.pth [2022-07-11 13:44:11,540][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001214344_1243488256.pth [2022-07-11 13:44:11,694][26022] Updated weights on worker 0-0, policy_version 1216326 (0.00076) [2022-07-11 13:44:13,428][26022] Updated weights on worker 0-0, policy_version 1216336 (0.00093) [2022-07-11 13:44:14,749][25689] Fps is (10 sec: 5566.3, 60 sec: 5619.5, 300 sec: 5625.0). Total num frames: 1245535232. Throughput: 0: 5779.3. Samples: 1245544638. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:44:14,750][25689] Avg episode reward: [(0, '2.296')] [2022-07-11 13:44:15,092][26022] Updated weights on worker 0-0, policy_version 1216346 (0.00085) [2022-07-11 13:44:16,960][26022] Updated weights on worker 0-0, policy_version 1216356 (0.00087) [2022-07-11 13:44:18,809][26022] Updated weights on worker 0-0, policy_version 1216366 (0.00086) [2022-07-11 13:44:19,826][25689] Fps is (10 sec: 5468.8, 60 sec: 5636.0, 300 sec: 5627.6). Total num frames: 1245563904. Throughput: 0: 4938.7. Samples: 1245561606. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:44:19,827][25689] Avg episode reward: [(0, '2.144')] [2022-07-11 13:44:20,586][26022] Updated weights on worker 0-0, policy_version 1216376 (0.00090) [2022-07-11 13:44:22,471][26022] Updated weights on worker 0-0, policy_version 1216386 (0.00085) [2022-07-11 13:44:24,050][26022] Updated weights on worker 0-0, policy_version 1216396 (0.00090) [2022-07-11 13:44:24,834][25689] Fps is (10 sec: 5686.2, 60 sec: 5624.4, 300 sec: 5628.4). Total num frames: 1245592576. Throughput: 0: 5893.6. Samples: 1245595872. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:44:24,834][25689] Avg episode reward: [(0, '2.012')] [2022-07-11 13:44:26,038][26022] Updated weights on worker 0-0, policy_version 1216406 (0.00110) [2022-07-11 13:44:27,579][26022] Updated weights on worker 0-0, policy_version 1216416 (0.00082) [2022-07-11 13:44:29,678][26022] Updated weights on worker 0-0, policy_version 1216426 (0.00842) [2022-07-11 13:44:29,863][25689] Fps is (10 sec: 5713.4, 60 sec: 5639.0, 300 sec: 5631.5). Total num frames: 1245621248. Throughput: 0: 5907.9. Samples: 1245629630. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:44:29,864][25689] Avg episode reward: [(0, '1.051')] [2022-07-11 13:44:31,665][26022] Updated weights on worker 0-0, policy_version 1216436 (0.00083) [2022-07-11 13:44:33,085][26022] Updated weights on worker 0-0, policy_version 1216446 (0.00094) [2022-07-11 13:44:34,890][25689] Fps is (10 sec: 5600.3, 60 sec: 5622.4, 300 sec: 5629.5). Total num frames: 1245648896. Throughput: 0: 5068.2. Samples: 1245646718. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:44:34,891][25689] Avg episode reward: [(0, '-0.281')] [2022-07-11 13:44:35,226][26022] Updated weights on worker 0-0, policy_version 1216456 (0.00090) [2022-07-11 13:44:36,778][26022] Updated weights on worker 0-0, policy_version 1216466 (0.00086) [2022-07-11 13:44:38,647][26022] Updated weights on worker 0-0, policy_version 1216476 (0.00083) [2022-07-11 13:44:39,962][25689] Fps is (10 sec: 5678.5, 60 sec: 5637.1, 300 sec: 5631.8). Total num frames: 1245678592. Throughput: 0: 5920.4. Samples: 1245680814. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 13:44:39,962][25689] Avg episode reward: [(0, '-0.159')] [2022-07-11 13:44:40,327][26022] Updated weights on worker 0-0, policy_version 1216486 (0.00080) [2022-07-11 13:44:42,074][26022] Updated weights on worker 0-0, policy_version 1216496 (0.00084) [2022-07-11 13:44:44,077][26022] Updated weights on worker 0-0, policy_version 1216506 (0.00343) [2022-07-11 13:44:45,031][25689] Fps is (10 sec: 5654.8, 60 sec: 5615.1, 300 sec: 5627.1). Total num frames: 1245706240. Throughput: 0: 5890.6. Samples: 1245714846. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:44:45,032][25689] Avg episode reward: [(0, '-0.581')] [2022-07-11 13:44:46,048][26022] Updated weights on worker 0-0, policy_version 1216516 (0.00085) [2022-07-11 13:44:47,818][26022] Updated weights on worker 0-0, policy_version 1216526 (0.00087) [2022-07-11 13:44:49,508][26022] Updated weights on worker 0-0, policy_version 1216536 (0.00082) [2022-07-11 13:44:50,045][25689] Fps is (10 sec: 5687.2, 60 sec: 5631.5, 300 sec: 5633.8). Total num frames: 1245735936. Throughput: 0: 5063.8. Samples: 1245731824. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:44:50,045][25689] Avg episode reward: [(0, '-0.384')] [2022-07-11 13:44:51,435][26022] Updated weights on worker 0-0, policy_version 1216546 (0.00090) [2022-07-11 13:44:53,103][26022] Updated weights on worker 0-0, policy_version 1216556 (0.00093) [2022-07-11 13:44:55,058][25689] Fps is (10 sec: 5617.1, 60 sec: 5615.3, 300 sec: 5624.4). Total num frames: 1245762560. Throughput: 0: 5915.3. Samples: 1245766012. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:44:55,058][25689] Avg episode reward: [(0, '-0.109')] [2022-07-11 13:44:55,079][26022] Updated weights on worker 0-0, policy_version 1216566 (0.00089) [2022-07-11 13:44:56,679][26022] Updated weights on worker 0-0, policy_version 1216576 (0.00049) [2022-07-11 13:44:58,550][26022] Updated weights on worker 0-0, policy_version 1216586 (0.00093) [2022-07-11 13:45:00,144][25689] Fps is (10 sec: 5678.1, 60 sec: 5651.1, 300 sec: 5644.8). Total num frames: 1245793280. Throughput: 0: 5914.8. Samples: 1245800188. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:45:00,144][25689] Avg episode reward: [(0, '0.057')] [2022-07-11 13:45:00,592][26022] Updated weights on worker 0-0, policy_version 1216596 (0.00090) [2022-07-11 13:45:02,327][26022] Updated weights on worker 0-0, policy_version 1216606 (0.00089) [2022-07-11 13:45:04,392][26022] Updated weights on worker 0-0, policy_version 1216616 (0.00088) [2022-07-11 13:45:05,150][25689] Fps is (10 sec: 5681.7, 60 sec: 5634.4, 300 sec: 5635.0). Total num frames: 1245819904. Throughput: 0: 4990.2. Samples: 1245815244. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:45:05,151][25689] Avg episode reward: [(0, '-0.155')] [2022-07-11 13:45:06,083][26022] Updated weights on worker 0-0, policy_version 1216626 (0.00097) [2022-07-11 13:45:07,925][26022] Updated weights on worker 0-0, policy_version 1216636 (0.00084) [2022-07-11 13:45:09,602][26022] Updated weights on worker 0-0, policy_version 1216646 (0.00083) [2022-07-11 13:45:10,155][25689] Fps is (10 sec: 5421.3, 60 sec: 5608.0, 300 sec: 5632.6). Total num frames: 1245847552. Throughput: 0: 5843.1. Samples: 1245849330. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:45:10,155][25689] Avg episode reward: [(0, '-0.046')] [2022-07-11 13:45:11,601][26022] Updated weights on worker 0-0, policy_version 1216656 (0.00084) [2022-07-11 13:45:13,323][26022] Updated weights on worker 0-0, policy_version 1216666 (0.00087) [2022-07-11 13:45:15,111][26022] Updated weights on worker 0-0, policy_version 1216676 (0.00083) [2022-07-11 13:45:15,170][25689] Fps is (10 sec: 5620.7, 60 sec: 5643.6, 300 sec: 5637.3). Total num frames: 1245876224. Throughput: 0: 5853.2. Samples: 1245883736. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:45:15,171][25689] Avg episode reward: [(0, '-0.412')] [2022-07-11 13:45:16,819][26022] Updated weights on worker 0-0, policy_version 1216686 (0.00083) [2022-07-11 13:45:18,671][26022] Updated weights on worker 0-0, policy_version 1216696 (0.00097) [2022-07-11 13:45:20,251][25689] Fps is (10 sec: 5679.7, 60 sec: 5643.3, 300 sec: 5637.2). Total num frames: 1245904896. Throughput: 0: 5006.2. Samples: 1245900848. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:45:20,251][25689] Avg episode reward: [(0, '-0.716')] [2022-07-11 13:45:20,516][26022] Updated weights on worker 0-0, policy_version 1216706 (0.00080) [2022-07-11 13:45:22,483][26022] Updated weights on worker 0-0, policy_version 1216716 (0.00089) [2022-07-11 13:45:24,241][26022] Updated weights on worker 0-0, policy_version 1216726 (0.00087) [2022-07-11 13:45:25,263][25689] Fps is (10 sec: 5580.3, 60 sec: 5625.9, 300 sec: 5634.0). Total num frames: 1245932544. Throughput: 0: 5950.9. Samples: 1245934932. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:45:25,265][25689] Avg episode reward: [(0, '-0.263')] [2022-07-11 13:45:25,972][26022] Updated weights on worker 0-0, policy_version 1216736 (0.00087) [2022-07-11 13:45:27,715][26022] Updated weights on worker 0-0, policy_version 1216746 (0.00074) [2022-07-11 13:45:29,531][26022] Updated weights on worker 0-0, policy_version 1216756 (0.00082) [2022-07-11 13:45:30,283][25689] Fps is (10 sec: 5614.1, 60 sec: 5626.8, 300 sec: 5630.4). Total num frames: 1245961216. Throughput: 0: 5940.5. Samples: 1245968902. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:45:30,283][25689] Avg episode reward: [(0, '-0.321')] [2022-07-11 13:45:31,326][26022] Updated weights on worker 0-0, policy_version 1216766 (0.00085) [2022-07-11 13:45:33,122][26022] Updated weights on worker 0-0, policy_version 1216776 (0.00083) [2022-07-11 13:45:34,994][26022] Updated weights on worker 0-0, policy_version 1216786 (0.00084) [2022-07-11 13:45:35,318][25689] Fps is (10 sec: 5702.9, 60 sec: 5643.0, 300 sec: 5635.6). Total num frames: 1245989888. Throughput: 0: 5069.3. Samples: 1245985872. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:45:35,319][25689] Avg episode reward: [(0, '-1.380')] [2022-07-11 13:45:36,798][26022] Updated weights on worker 0-0, policy_version 1216796 (0.00083) [2022-07-11 13:45:38,633][26022] Updated weights on worker 0-0, policy_version 1216806 (0.00094) [2022-07-11 13:45:40,446][25689] Fps is (10 sec: 5743.3, 60 sec: 5637.7, 300 sec: 5638.2). Total num frames: 1246019584. Throughput: 0: 5878.0. Samples: 1246019552. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:45:40,446][25689] Avg episode reward: [(0, '-0.589')] [2022-07-11 13:45:40,453][26022] Updated weights on worker 0-0, policy_version 1216816 (0.00087) [2022-07-11 13:45:42,185][26022] Updated weights on worker 0-0, policy_version 1216826 (0.00095) [2022-07-11 13:45:44,251][26022] Updated weights on worker 0-0, policy_version 1216836 (0.00086) [2022-07-11 13:45:45,486][25689] Fps is (10 sec: 5740.4, 60 sec: 5657.4, 300 sec: 5635.6). Total num frames: 1246048256. Throughput: 0: 5847.2. Samples: 1246053182. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:45:45,487][25689] Avg episode reward: [(0, '-0.261')] [2022-07-11 13:45:45,861][26022] Updated weights on worker 0-0, policy_version 1216846 (0.00082) [2022-07-11 13:45:47,909][26022] Updated weights on worker 0-0, policy_version 1216856 (0.00099) [2022-07-11 13:45:49,554][26022] Updated weights on worker 0-0, policy_version 1216866 (0.00085) [2022-07-11 13:45:50,496][25689] Fps is (10 sec: 5501.9, 60 sec: 5606.9, 300 sec: 5633.0). Total num frames: 1246074880. Throughput: 0: 5011.2. Samples: 1246070194. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:45:50,496][25689] Avg episode reward: [(0, '-0.153')] [2022-07-11 13:45:51,575][26022] Updated weights on worker 0-0, policy_version 1216876 (0.00089) [2022-07-11 13:45:53,195][26022] Updated weights on worker 0-0, policy_version 1216886 (0.00084) [2022-07-11 13:45:55,171][26022] Updated weights on worker 0-0, policy_version 1216896 (0.00092) [2022-07-11 13:45:55,504][25689] Fps is (10 sec: 5519.7, 60 sec: 5641.2, 300 sec: 5627.3). Total num frames: 1246103552. Throughput: 0: 5855.7. Samples: 1246104074. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:45:55,505][25689] Avg episode reward: [(0, '-0.031')] [2022-07-11 13:45:56,836][26022] Updated weights on worker 0-0, policy_version 1216906 (0.00080) [2022-07-11 13:45:58,747][26022] Updated weights on worker 0-0, policy_version 1216916 (0.00088) [2022-07-11 13:46:00,393][26022] Updated weights on worker 0-0, policy_version 1216926 (0.00079) [2022-07-11 13:46:00,617][25689] Fps is (10 sec: 5767.2, 60 sec: 5621.8, 300 sec: 5642.7). Total num frames: 1246133248. Throughput: 0: 5867.2. Samples: 1246137900. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:46:00,617][25689] Avg episode reward: [(0, '1.817')] [2022-07-11 13:46:02,767][26022] Updated weights on worker 0-0, policy_version 1216936 (0.00084) [2022-07-11 13:46:04,460][26022] Updated weights on worker 0-0, policy_version 1216946 (0.00092) [2022-07-11 13:46:05,646][25689] Fps is (10 sec: 5452.0, 60 sec: 5602.7, 300 sec: 5638.9). Total num frames: 1246158848. Throughput: 0: 4935.6. Samples: 1246152684. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:46:05,647][25689] Avg episode reward: [(0, '1.918')] [2022-07-11 13:46:06,580][26022] Updated weights on worker 0-0, policy_version 1216956 (0.00096) [2022-07-11 13:46:07,934][26022] Updated weights on worker 0-0, policy_version 1216966 (0.00090) [2022-07-11 13:46:09,983][26022] Updated weights on worker 0-0, policy_version 1216976 (0.00101) [2022-07-11 13:46:10,732][25689] Fps is (10 sec: 5466.7, 60 sec: 5629.0, 300 sec: 5634.0). Total num frames: 1246188544. Throughput: 0: 5765.6. Samples: 1246186866. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:46:10,732][25689] Avg episode reward: [(0, '1.712')] [2022-07-11 13:46:11,500][26022] Updated weights on worker 0-0, policy_version 1216986 (0.00086) [2022-07-11 13:46:11,735][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:46:11,748][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001216987_1246194688.pth [2022-07-11 13:46:11,749][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001215004_1244164096.pth [2022-07-11 13:46:13,455][26022] Updated weights on worker 0-0, policy_version 1216996 (0.00082) [2022-07-11 13:46:15,282][26022] Updated weights on worker 0-0, policy_version 1217006 (0.00093) [2022-07-11 13:46:15,768][25689] Fps is (10 sec: 5766.4, 60 sec: 5627.1, 300 sec: 5634.6). Total num frames: 1246217216. Throughput: 0: 5764.5. Samples: 1246220888. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:46:15,769][25689] Avg episode reward: [(0, '0.694')] [2022-07-11 13:46:17,310][26022] Updated weights on worker 0-0, policy_version 1217016 (0.00058) [2022-07-11 13:46:18,975][26022] Updated weights on worker 0-0, policy_version 1217026 (0.00092) [2022-07-11 13:46:20,751][26022] Updated weights on worker 0-0, policy_version 1217036 (0.00091) [2022-07-11 13:46:20,843][25689] Fps is (10 sec: 5570.4, 60 sec: 5610.8, 300 sec: 5637.1). Total num frames: 1246244864. Throughput: 0: 4935.2. Samples: 1246237718. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:46:20,843][25689] Avg episode reward: [(0, '0.658')] [2022-07-11 13:46:22,487][26022] Updated weights on worker 0-0, policy_version 1217046 (0.00086) [2022-07-11 13:46:24,395][26022] Updated weights on worker 0-0, policy_version 1217056 (0.00089) [2022-07-11 13:46:25,909][25689] Fps is (10 sec: 5554.1, 60 sec: 5622.7, 300 sec: 5636.4). Total num frames: 1246273536. Throughput: 0: 5890.8. Samples: 1246272046. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:46:25,909][25689] Avg episode reward: [(0, '0.534')] [2022-07-11 13:46:26,096][26022] Updated weights on worker 0-0, policy_version 1217066 (0.00093) [2022-07-11 13:46:28,052][26022] Updated weights on worker 0-0, policy_version 1217076 (0.00087) [2022-07-11 13:46:29,777][26022] Updated weights on worker 0-0, policy_version 1217086 (0.00081) [2022-07-11 13:46:30,918][25689] Fps is (10 sec: 5691.5, 60 sec: 5623.7, 300 sec: 5633.2). Total num frames: 1246302208. Throughput: 0: 5894.1. Samples: 1246305844. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:46:30,918][25689] Avg episode reward: [(0, '0.532')] [2022-07-11 13:46:31,726][26022] Updated weights on worker 0-0, policy_version 1217096 (0.00080) [2022-07-11 13:46:33,391][26022] Updated weights on worker 0-0, policy_version 1217106 (0.00081) [2022-07-11 13:46:35,262][26022] Updated weights on worker 0-0, policy_version 1217116 (0.00086) [2022-07-11 13:46:35,922][25689] Fps is (10 sec: 5624.4, 60 sec: 5609.7, 300 sec: 5632.2). Total num frames: 1246329856. Throughput: 0: 5921.1. Samples: 1246340222. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:46:35,923][25689] Avg episode reward: [(0, '0.398')] [2022-07-11 13:46:36,830][26022] Updated weights on worker 0-0, policy_version 1217126 (0.00097) [2022-07-11 13:46:39,057][26022] Updated weights on worker 0-0, policy_version 1217136 (0.00078) [2022-07-11 13:46:40,468][26022] Updated weights on worker 0-0, policy_version 1217146 (0.00080) [2022-07-11 13:46:40,966][25689] Fps is (10 sec: 5706.7, 60 sec: 5617.4, 300 sec: 5631.6). Total num frames: 1246359552. Throughput: 0: 5926.8. Samples: 1246356988. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:46:40,967][25689] Avg episode reward: [(0, '0.611')] [2022-07-11 13:46:42,773][26022] Updated weights on worker 0-0, policy_version 1217156 (0.00072) [2022-07-11 13:46:44,237][26022] Updated weights on worker 0-0, policy_version 1217166 (0.00091) [2022-07-11 13:46:46,008][25689] Fps is (10 sec: 5685.9, 60 sec: 5600.4, 300 sec: 5631.2). Total num frames: 1246387200. Throughput: 0: 5925.2. Samples: 1246391136. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:46:46,008][25689] Avg episode reward: [(0, '1.848')] [2022-07-11 13:46:46,109][26022] Updated weights on worker 0-0, policy_version 1217176 (0.00088) [2022-07-11 13:46:47,877][26022] Updated weights on worker 0-0, policy_version 1217186 (0.00086) [2022-07-11 13:46:49,702][26022] Updated weights on worker 0-0, policy_version 1217196 (0.00082) [2022-07-11 13:46:51,081][25689] Fps is (10 sec: 5568.1, 60 sec: 5628.3, 300 sec: 5633.7). Total num frames: 1246415872. Throughput: 0: 5928.7. Samples: 1246425386. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:46:51,083][25689] Avg episode reward: [(0, '0.336')] [2022-07-11 13:46:51,486][26022] Updated weights on worker 0-0, policy_version 1217206 (0.00094) [2022-07-11 13:46:53,300][26022] Updated weights on worker 0-0, policy_version 1217216 (0.00087) [2022-07-11 13:46:54,952][26022] Updated weights on worker 0-0, policy_version 1217226 (0.00084) [2022-07-11 13:46:56,100][25689] Fps is (10 sec: 5682.1, 60 sec: 5627.3, 300 sec: 5627.2). Total num frames: 1246444544. Throughput: 0: 5071.5. Samples: 1246442552. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:46:56,100][25689] Avg episode reward: [(0, '0.445')] [2022-07-11 13:46:56,891][26022] Updated weights on worker 0-0, policy_version 1217236 (0.00088) [2022-07-11 13:46:58,448][26022] Updated weights on worker 0-0, policy_version 1217246 (0.00081) [2022-07-11 13:47:00,462][26022] Updated weights on worker 0-0, policy_version 1217256 (0.00086) [2022-07-11 13:47:01,151][25689] Fps is (10 sec: 5796.0, 60 sec: 5633.0, 300 sec: 5640.7). Total num frames: 1246474240. Throughput: 0: 5932.8. Samples: 1246476742. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:47:01,152][25689] Avg episode reward: [(0, '0.563')] [2022-07-11 13:47:02,663][26022] Updated weights on worker 0-0, policy_version 1217266 (0.00079) [2022-07-11 13:47:04,340][26022] Updated weights on worker 0-0, policy_version 1217276 (0.00084) [2022-07-11 13:47:06,134][26022] Updated weights on worker 0-0, policy_version 1217286 (0.00089) [2022-07-11 13:47:06,218][25689] Fps is (10 sec: 5566.0, 60 sec: 5646.4, 300 sec: 5636.1). Total num frames: 1246500864. Throughput: 0: 5828.2. Samples: 1246508928. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:47:06,219][25689] Avg episode reward: [(0, '0.489')] [2022-07-11 13:47:07,992][26022] Updated weights on worker 0-0, policy_version 1217296 (0.00095) [2022-07-11 13:47:09,806][26022] Updated weights on worker 0-0, policy_version 1217306 (0.00088) [2022-07-11 13:47:11,270][25689] Fps is (10 sec: 5464.8, 60 sec: 5632.7, 300 sec: 5631.8). Total num frames: 1246529536. Throughput: 0: 4970.9. Samples: 1246525742. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:47:11,271][25689] Avg episode reward: [(0, '0.492')] [2022-07-11 13:47:11,530][26022] Updated weights on worker 0-0, policy_version 1217316 (0.00081) [2022-07-11 13:47:13,528][26022] Updated weights on worker 0-0, policy_version 1217326 (0.00090) [2022-07-11 13:47:15,351][26022] Updated weights on worker 0-0, policy_version 1217336 (0.00084) [2022-07-11 13:47:16,370][25689] Fps is (10 sec: 5648.8, 60 sec: 5626.8, 300 sec: 5639.1). Total num frames: 1246558208. Throughput: 0: 5804.0. Samples: 1246560200. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:47:16,371][25689] Avg episode reward: [(0, '0.583')] [2022-07-11 13:47:17,091][26022] Updated weights on worker 0-0, policy_version 1217346 (0.00081) [2022-07-11 13:47:18,797][26022] Updated weights on worker 0-0, policy_version 1217356 (0.00086) [2022-07-11 13:47:20,757][26022] Updated weights on worker 0-0, policy_version 1217366 (0.00087) [2022-07-11 13:47:21,427][25689] Fps is (10 sec: 5645.9, 60 sec: 5645.3, 300 sec: 5636.5). Total num frames: 1246586880. Throughput: 0: 5806.4. Samples: 1246594468. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:47:21,427][25689] Avg episode reward: [(0, '1.916')] [2022-07-11 13:47:22,406][26022] Updated weights on worker 0-0, policy_version 1217376 (0.00086) [2022-07-11 13:47:24,275][26022] Updated weights on worker 0-0, policy_version 1217386 (0.00085) [2022-07-11 13:47:26,083][26022] Updated weights on worker 0-0, policy_version 1217396 (0.00100) [2022-07-11 13:47:26,433][25689] Fps is (10 sec: 5698.3, 60 sec: 5650.9, 300 sec: 5637.6). Total num frames: 1246615552. Throughput: 0: 5073.9. Samples: 1246611496. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:47:26,434][25689] Avg episode reward: [(0, '1.942')] [2022-07-11 13:47:27,851][26022] Updated weights on worker 0-0, policy_version 1217406 (0.00105) [2022-07-11 13:47:29,727][26022] Updated weights on worker 0-0, policy_version 1217416 (0.00082) [2022-07-11 13:47:31,470][25689] Fps is (10 sec: 5607.6, 60 sec: 5631.4, 300 sec: 5635.5). Total num frames: 1246643200. Throughput: 0: 5915.3. Samples: 1246645232. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:47:31,471][25689] Avg episode reward: [(0, '1.736')] [2022-07-11 13:47:31,594][26022] Updated weights on worker 0-0, policy_version 1217426 (0.00087) [2022-07-11 13:47:33,419][26022] Updated weights on worker 0-0, policy_version 1217436 (0.00085) [2022-07-11 13:47:35,267][26022] Updated weights on worker 0-0, policy_version 1217446 (0.00095) [2022-07-11 13:47:36,499][25689] Fps is (10 sec: 5696.9, 60 sec: 5662.9, 300 sec: 5639.2). Total num frames: 1246672896. Throughput: 0: 5905.0. Samples: 1246679064. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:47:36,500][25689] Avg episode reward: [(0, '1.556')] [2022-07-11 13:47:37,083][26022] Updated weights on worker 0-0, policy_version 1217456 (0.00087) [2022-07-11 13:47:38,973][26022] Updated weights on worker 0-0, policy_version 1217466 (0.00084) [2022-07-11 13:47:40,712][26022] Updated weights on worker 0-0, policy_version 1217476 (0.00086) [2022-07-11 13:47:41,546][25689] Fps is (10 sec: 5589.5, 60 sec: 5611.9, 300 sec: 5631.8). Total num frames: 1246699520. Throughput: 0: 5041.0. Samples: 1246695890. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:47:41,548][25689] Avg episode reward: [(0, '0.320')] [2022-07-11 13:47:42,530][26022] Updated weights on worker 0-0, policy_version 1217486 (0.00088) [2022-07-11 13:47:44,357][26022] Updated weights on worker 0-0, policy_version 1217496 (0.00097) [2022-07-11 13:47:46,173][26022] Updated weights on worker 0-0, policy_version 1217506 (0.00086) [2022-07-11 13:47:46,575][25689] Fps is (10 sec: 5488.2, 60 sec: 5630.0, 300 sec: 5632.0). Total num frames: 1246728192. Throughput: 0: 5863.7. Samples: 1246729598. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:47:46,576][25689] Avg episode reward: [(0, '0.472')] [2022-07-11 13:47:48,020][26022] Updated weights on worker 0-0, policy_version 1217516 (0.00090) [2022-07-11 13:47:49,826][26022] Updated weights on worker 0-0, policy_version 1217526 (0.00087) [2022-07-11 13:47:51,584][25689] Fps is (10 sec: 5713.0, 60 sec: 5636.0, 300 sec: 5632.0). Total num frames: 1246756864. Throughput: 0: 5875.0. Samples: 1246763398. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:47:51,585][25689] Avg episode reward: [(0, '0.090')] [2022-07-11 13:47:51,597][26022] Updated weights on worker 0-0, policy_version 1217536 (0.00083) [2022-07-11 13:47:53,594][26022] Updated weights on worker 0-0, policy_version 1217546 (0.00086) [2022-07-11 13:47:55,127][26022] Updated weights on worker 0-0, policy_version 1217556 (0.00085) [2022-07-11 13:47:56,617][25689] Fps is (10 sec: 5608.0, 60 sec: 5617.7, 300 sec: 5629.6). Total num frames: 1246784512. Throughput: 0: 5039.4. Samples: 1246780448. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:47:56,619][25689] Avg episode reward: [(0, '0.054')] [2022-07-11 13:47:57,114][26022] Updated weights on worker 0-0, policy_version 1217566 (0.00086) [2022-07-11 13:47:58,788][26022] Updated weights on worker 0-0, policy_version 1217576 (0.00084) [2022-07-11 13:48:00,706][26022] Updated weights on worker 0-0, policy_version 1217586 (0.00089) [2022-07-11 13:48:01,722][25689] Fps is (10 sec: 5555.0, 60 sec: 5595.8, 300 sec: 5636.0). Total num frames: 1246813184. Throughput: 0: 5888.0. Samples: 1246814686. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:48:01,723][25689] Avg episode reward: [(0, '0.488')] [2022-07-11 13:48:02,685][26022] Updated weights on worker 0-0, policy_version 1217596 (0.00087) [2022-07-11 13:48:04,512][26022] Updated weights on worker 0-0, policy_version 1217606 (0.00087) [2022-07-11 13:48:06,245][26022] Updated weights on worker 0-0, policy_version 1217616 (0.00101) [2022-07-11 13:48:06,724][25689] Fps is (10 sec: 5572.9, 60 sec: 5618.8, 300 sec: 5630.9). Total num frames: 1246840832. Throughput: 0: 5804.0. Samples: 1246846542. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:48:06,724][25689] Avg episode reward: [(0, '0.486')] [2022-07-11 13:48:08,252][26022] Updated weights on worker 0-0, policy_version 1217626 (0.00090) [2022-07-11 13:48:09,945][26022] Updated weights on worker 0-0, policy_version 1217636 (0.00085) [2022-07-11 13:48:11,751][25689] Fps is (10 sec: 5513.5, 60 sec: 5604.1, 300 sec: 5624.3). Total num frames: 1246868480. Throughput: 0: 4951.3. Samples: 1246863254. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:48:11,752][25689] Avg episode reward: [(0, '1.330')] [2022-07-11 13:48:11,856][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:48:11,865][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001217646_1246869504.pth [2022-07-11 13:48:11,865][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001215664_1244839936.pth [2022-07-11 13:48:11,869][26022] Updated weights on worker 0-0, policy_version 1217646 (0.00086) [2022-07-11 13:48:13,578][26022] Updated weights on worker 0-0, policy_version 1217656 (0.00091) [2022-07-11 13:48:15,531][26022] Updated weights on worker 0-0, policy_version 1217666 (0.00092) [2022-07-11 13:48:16,828][25689] Fps is (10 sec: 5574.0, 60 sec: 5606.3, 300 sec: 5627.6). Total num frames: 1246897152. Throughput: 0: 5795.5. Samples: 1246897576. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:48:16,828][25689] Avg episode reward: [(0, '1.042')] [2022-07-11 13:48:17,331][26022] Updated weights on worker 0-0, policy_version 1217676 (0.00088) [2022-07-11 13:48:19,140][26022] Updated weights on worker 0-0, policy_version 1217686 (0.00084) [2022-07-11 13:48:20,792][26022] Updated weights on worker 0-0, policy_version 1217696 (0.00078) [2022-07-11 13:48:21,951][25689] Fps is (10 sec: 5622.1, 60 sec: 5600.1, 300 sec: 5623.1). Total num frames: 1246925824. Throughput: 0: 5777.5. Samples: 1246931560. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:48:21,952][25689] Avg episode reward: [(0, '1.165')] [2022-07-11 13:48:22,609][26022] Updated weights on worker 0-0, policy_version 1217706 (0.00081) [2022-07-11 13:48:24,433][26022] Updated weights on worker 0-0, policy_version 1217716 (0.00083) [2022-07-11 13:48:26,123][26022] Updated weights on worker 0-0, policy_version 1217726 (0.00100) [2022-07-11 13:48:26,985][25689] Fps is (10 sec: 5645.4, 60 sec: 5597.6, 300 sec: 5625.9). Total num frames: 1246954496. Throughput: 0: 5052.0. Samples: 1246948904. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:48:26,986][25689] Avg episode reward: [(0, '1.286')] [2022-07-11 13:48:28,154][26022] Updated weights on worker 0-0, policy_version 1217736 (0.00084) [2022-07-11 13:48:29,951][26022] Updated weights on worker 0-0, policy_version 1217746 (0.00091) [2022-07-11 13:48:31,669][26022] Updated weights on worker 0-0, policy_version 1217756 (0.00086) [2022-07-11 13:48:32,068][25689] Fps is (10 sec: 5769.5, 60 sec: 5627.2, 300 sec: 5628.4). Total num frames: 1246984192. Throughput: 0: 5894.3. Samples: 1246983004. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:48:32,068][25689] Avg episode reward: [(0, '0.942')] [2022-07-11 13:48:33,599][26022] Updated weights on worker 0-0, policy_version 1217766 (0.00081) [2022-07-11 13:48:35,194][26022] Updated weights on worker 0-0, policy_version 1217776 (0.00086) [2022-07-11 13:48:37,091][25689] Fps is (10 sec: 5775.6, 60 sec: 5610.8, 300 sec: 5628.9). Total num frames: 1247012864. Throughput: 0: 5917.0. Samples: 1247017476. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2022-07-11 13:48:37,092][25689] Avg episode reward: [(0, '1.172')] [2022-07-11 13:48:37,098][26022] Updated weights on worker 0-0, policy_version 1217786 (0.00081) [2022-07-11 13:48:38,926][26022] Updated weights on worker 0-0, policy_version 1217796 (0.00088) [2022-07-11 13:48:40,680][26022] Updated weights on worker 0-0, policy_version 1217806 (0.00084) [2022-07-11 13:48:42,196][25689] Fps is (10 sec: 5661.5, 60 sec: 5639.1, 300 sec: 5627.2). Total num frames: 1247041536. Throughput: 0: 5906.3. Samples: 1247051134. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:48:42,197][25689] Avg episode reward: [(0, '1.148')] [2022-07-11 13:48:42,487][26022] Updated weights on worker 0-0, policy_version 1217816 (0.00085) [2022-07-11 13:48:44,382][26022] Updated weights on worker 0-0, policy_version 1217826 (0.00094) [2022-07-11 13:48:45,942][26022] Updated weights on worker 0-0, policy_version 1217836 (0.00094) [2022-07-11 13:48:47,224][25689] Fps is (10 sec: 5659.2, 60 sec: 5639.2, 300 sec: 5626.8). Total num frames: 1247070208. Throughput: 0: 5893.1. Samples: 1247068172. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:48:47,226][25689] Avg episode reward: [(0, '1.262')] [2022-07-11 13:48:47,999][26022] Updated weights on worker 0-0, policy_version 1217846 (0.00110) [2022-07-11 13:48:49,597][26022] Updated weights on worker 0-0, policy_version 1217856 (0.00088) [2022-07-11 13:48:51,544][26022] Updated weights on worker 0-0, policy_version 1217866 (0.00082) [2022-07-11 13:48:52,226][25689] Fps is (10 sec: 5819.3, 60 sec: 5656.7, 300 sec: 5634.0). Total num frames: 1247099904. Throughput: 0: 5920.3. Samples: 1247102350. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:48:52,228][25689] Avg episode reward: [(0, '1.185')] [2022-07-11 13:48:53,123][26022] Updated weights on worker 0-0, policy_version 1217876 (0.00085) [2022-07-11 13:48:54,866][26022] Updated weights on worker 0-0, policy_version 1217886 (0.00090) [2022-07-11 13:48:56,930][26022] Updated weights on worker 0-0, policy_version 1217896 (0.00077) [2022-07-11 13:48:57,239][25689] Fps is (10 sec: 5623.7, 60 sec: 5641.9, 300 sec: 5628.9). Total num frames: 1247126528. Throughput: 0: 5925.1. Samples: 1247136852. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:48:57,240][25689] Avg episode reward: [(0, '0.501')] [2022-07-11 13:48:58,741][26022] Updated weights on worker 0-0, policy_version 1217906 (0.00084) [2022-07-11 13:49:00,366][26022] Updated weights on worker 0-0, policy_version 1217916 (0.00082) [2022-07-11 13:49:02,310][25689] Fps is (10 sec: 5382.1, 60 sec: 5628.0, 300 sec: 5627.7). Total num frames: 1247154176. Throughput: 0: 5106.1. Samples: 1247153836. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:49:02,311][25689] Avg episode reward: [(0, '0.364')] [2022-07-11 13:49:02,915][26022] Updated weights on worker 0-0, policy_version 1217926 (0.00086) [2022-07-11 13:49:04,321][26022] Updated weights on worker 0-0, policy_version 1217936 (0.00092) [2022-07-11 13:49:06,431][26022] Updated weights on worker 0-0, policy_version 1217946 (0.00082) [2022-07-11 13:49:07,364][25689] Fps is (10 sec: 5562.4, 60 sec: 5640.1, 300 sec: 5624.9). Total num frames: 1247182848. Throughput: 0: 5839.4. Samples: 1247185776. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:49:07,364][25689] Avg episode reward: [(0, '0.820')] [2022-07-11 13:49:07,911][26022] Updated weights on worker 0-0, policy_version 1217956 (0.00087) [2022-07-11 13:49:09,934][26022] Updated weights on worker 0-0, policy_version 1217966 (0.00088) [2022-07-11 13:49:11,658][26022] Updated weights on worker 0-0, policy_version 1217976 (0.00087) [2022-07-11 13:49:12,381][25689] Fps is (10 sec: 5592.7, 60 sec: 5641.1, 300 sec: 5628.7). Total num frames: 1247210496. Throughput: 0: 5830.3. Samples: 1247219854. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:49:12,381][25689] Avg episode reward: [(0, '0.792')] [2022-07-11 13:49:13,339][26022] Updated weights on worker 0-0, policy_version 1217986 (0.00083) [2022-07-11 13:49:15,440][26022] Updated weights on worker 0-0, policy_version 1217996 (0.00090) [2022-07-11 13:49:17,111][26022] Updated weights on worker 0-0, policy_version 1218006 (0.00086) [2022-07-11 13:49:17,462][25689] Fps is (10 sec: 5678.9, 60 sec: 5657.6, 300 sec: 5632.0). Total num frames: 1247240192. Throughput: 0: 4953.2. Samples: 1247237020. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:49:17,462][25689] Avg episode reward: [(0, '-0.362')] [2022-07-11 13:49:18,869][26022] Updated weights on worker 0-0, policy_version 1218016 (0.00083) [2022-07-11 13:49:20,903][26022] Updated weights on worker 0-0, policy_version 1218026 (0.00085) [2022-07-11 13:49:22,467][26022] Updated weights on worker 0-0, policy_version 1218036 (0.00086) [2022-07-11 13:49:22,605][25689] Fps is (10 sec: 5808.7, 60 sec: 5672.6, 300 sec: 5632.9). Total num frames: 1247269888. Throughput: 0: 5781.6. Samples: 1247271174. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:49:22,606][25689] Avg episode reward: [(0, '-0.395')] [2022-07-11 13:49:24,503][26022] Updated weights on worker 0-0, policy_version 1218046 (0.00085) [2022-07-11 13:49:26,147][26022] Updated weights on worker 0-0, policy_version 1218056 (0.00081) [2022-07-11 13:49:27,642][25689] Fps is (10 sec: 5532.0, 60 sec: 5638.5, 300 sec: 5625.9). Total num frames: 1247296512. Throughput: 0: 5890.0. Samples: 1247305216. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:49:27,643][25689] Avg episode reward: [(0, '-0.192')] [2022-07-11 13:49:27,955][26022] Updated weights on worker 0-0, policy_version 1218066 (0.00078) [2022-07-11 13:49:29,864][26022] Updated weights on worker 0-0, policy_version 1218076 (0.00082) [2022-07-11 13:49:31,541][26022] Updated weights on worker 0-0, policy_version 1218086 (0.00395) [2022-07-11 13:49:32,652][25689] Fps is (10 sec: 5606.0, 60 sec: 5645.3, 300 sec: 5633.1). Total num frames: 1247326208. Throughput: 0: 5054.8. Samples: 1247322320. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:49:32,652][25689] Avg episode reward: [(0, '-0.251')] [2022-07-11 13:49:33,415][26022] Updated weights on worker 0-0, policy_version 1218096 (0.00083) [2022-07-11 13:49:35,042][26022] Updated weights on worker 0-0, policy_version 1218106 (0.00619) [2022-07-11 13:49:36,988][26022] Updated weights on worker 0-0, policy_version 1218116 (0.00083) [2022-07-11 13:49:37,667][25689] Fps is (10 sec: 5720.5, 60 sec: 5629.2, 300 sec: 5627.3). Total num frames: 1247353856. Throughput: 0: 5929.4. Samples: 1247356824. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:49:37,667][25689] Avg episode reward: [(0, '-0.967')] [2022-07-11 13:49:38,754][26022] Updated weights on worker 0-0, policy_version 1218126 (0.00087) [2022-07-11 13:49:40,693][26022] Updated weights on worker 0-0, policy_version 1218136 (0.00094) [2022-07-11 13:49:42,360][26022] Updated weights on worker 0-0, policy_version 1218146 (0.00087) [2022-07-11 13:49:42,781][25689] Fps is (10 sec: 5762.3, 60 sec: 5662.2, 300 sec: 5636.7). Total num frames: 1247384576. Throughput: 0: 5922.1. Samples: 1247390656. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:49:42,781][25689] Avg episode reward: [(0, '-1.560')] [2022-07-11 13:49:44,131][26022] Updated weights on worker 0-0, policy_version 1218156 (0.00083) [2022-07-11 13:49:46,121][26022] Updated weights on worker 0-0, policy_version 1218166 (0.00090) [2022-07-11 13:49:47,719][26022] Updated weights on worker 0-0, policy_version 1218176 (0.00090) [2022-07-11 13:49:47,847][25689] Fps is (10 sec: 5733.5, 60 sec: 5641.7, 300 sec: 5628.9). Total num frames: 1247412224. Throughput: 0: 5076.5. Samples: 1247407784. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:49:47,847][25689] Avg episode reward: [(0, '-1.713')] [2022-07-11 13:49:49,833][26022] Updated weights on worker 0-0, policy_version 1218186 (0.00082) [2022-07-11 13:49:51,163][26022] Updated weights on worker 0-0, policy_version 1218196 (0.00068) [2022-07-11 13:49:52,933][25689] Fps is (10 sec: 5446.7, 60 sec: 5600.2, 300 sec: 5630.9). Total num frames: 1247439872. Throughput: 0: 5891.0. Samples: 1247441798. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:49:52,934][25689] Avg episode reward: [(0, '-1.527')] [2022-07-11 13:49:53,323][26022] Updated weights on worker 0-0, policy_version 1218206 (0.00086) [2022-07-11 13:49:54,829][26022] Updated weights on worker 0-0, policy_version 1218216 (0.00084) [2022-07-11 13:49:56,808][26022] Updated weights on worker 0-0, policy_version 1218226 (0.00611) [2022-07-11 13:49:58,002][25689] Fps is (10 sec: 5747.4, 60 sec: 5662.3, 300 sec: 5631.2). Total num frames: 1247470592. Throughput: 0: 5870.5. Samples: 1247476204. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:49:58,003][25689] Avg episode reward: [(0, '-1.004')] [2022-07-11 13:49:58,825][26022] Updated weights on worker 0-0, policy_version 1218236 (0.00100) [2022-07-11 13:50:00,275][26022] Updated weights on worker 0-0, policy_version 1218246 (0.00079) [2022-07-11 13:50:02,651][26022] Updated weights on worker 0-0, policy_version 1218256 (0.00092) [2022-07-11 13:50:03,078][25689] Fps is (10 sec: 5551.7, 60 sec: 5628.3, 300 sec: 5626.5). Total num frames: 1247496192. Throughput: 0: 5049.0. Samples: 1247493136. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:50:03,078][25689] Avg episode reward: [(0, '-1.292')] [2022-07-11 13:50:04,573][26022] Updated weights on worker 0-0, policy_version 1218266 (0.00090) [2022-07-11 13:50:06,130][26022] Updated weights on worker 0-0, policy_version 1218276 (0.00097) [2022-07-11 13:50:08,111][25689] Fps is (10 sec: 5267.8, 60 sec: 5613.3, 300 sec: 5626.0). Total num frames: 1247523840. Throughput: 0: 5777.8. Samples: 1247524864. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:50:08,111][25689] Avg episode reward: [(0, '0.051')] [2022-07-11 13:50:08,157][26022] Updated weights on worker 0-0, policy_version 1218286 (0.00085) [2022-07-11 13:50:09,685][26022] Updated weights on worker 0-0, policy_version 1218296 (0.00084) [2022-07-11 13:50:11,828][26022] Updated weights on worker 0-0, policy_version 1218306 (0.00084) [2022-07-11 13:50:11,899][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:50:11,910][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001218307_1247546368.pth [2022-07-11 13:50:11,911][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001216325_1245516800.pth [2022-07-11 13:50:13,174][25689] Fps is (10 sec: 5679.5, 60 sec: 5642.7, 300 sec: 5628.5). Total num frames: 1247553536. Throughput: 0: 5764.0. Samples: 1247558468. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:50:13,175][25689] Avg episode reward: [(0, '0.723')] [2022-07-11 13:50:13,640][26022] Updated weights on worker 0-0, policy_version 1218316 (0.00086) [2022-07-11 13:50:15,237][26022] Updated weights on worker 0-0, policy_version 1218326 (0.00091) [2022-07-11 13:50:17,297][26022] Updated weights on worker 0-0, policy_version 1218336 (0.00088) [2022-07-11 13:50:18,180][25689] Fps is (10 sec: 5796.6, 60 sec: 5632.8, 300 sec: 5629.9). Total num frames: 1247582208. Throughput: 0: 4915.7. Samples: 1247575392. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:50:18,181][25689] Avg episode reward: [(0, '1.020')] [2022-07-11 13:50:18,730][26022] Updated weights on worker 0-0, policy_version 1218346 (0.00093) [2022-07-11 13:50:20,794][26022] Updated weights on worker 0-0, policy_version 1218356 (0.00086) [2022-07-11 13:50:22,636][26022] Updated weights on worker 0-0, policy_version 1218366 (0.00081) [2022-07-11 13:50:23,275][25689] Fps is (10 sec: 5576.1, 60 sec: 5603.6, 300 sec: 5628.3). Total num frames: 1247609856. Throughput: 0: 5762.2. Samples: 1247609514. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:50:23,275][25689] Avg episode reward: [(0, '0.357')] [2022-07-11 13:50:24,298][26022] Updated weights on worker 0-0, policy_version 1218376 (0.00082) [2022-07-11 13:50:26,296][26022] Updated weights on worker 0-0, policy_version 1218386 (0.00092) [2022-07-11 13:50:27,964][26022] Updated weights on worker 0-0, policy_version 1218396 (0.00087) [2022-07-11 13:50:28,281][25689] Fps is (10 sec: 5677.3, 60 sec: 5657.1, 300 sec: 5632.0). Total num frames: 1247639552. Throughput: 0: 5884.0. Samples: 1247643544. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:50:28,281][25689] Avg episode reward: [(0, '1.458')] [2022-07-11 13:50:29,875][26022] Updated weights on worker 0-0, policy_version 1218406 (0.00090) [2022-07-11 13:50:31,701][26022] Updated weights on worker 0-0, policy_version 1218416 (0.00090) [2022-07-11 13:50:33,300][25689] Fps is (10 sec: 5719.8, 60 sec: 5622.4, 300 sec: 5628.9). Total num frames: 1247667200. Throughput: 0: 5076.0. Samples: 1247660626. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:50:33,302][25689] Avg episode reward: [(0, '0.476')] [2022-07-11 13:50:33,447][26022] Updated weights on worker 0-0, policy_version 1218426 (0.00610) [2022-07-11 13:50:35,347][26022] Updated weights on worker 0-0, policy_version 1218436 (0.00087) [2022-07-11 13:50:37,068][26022] Updated weights on worker 0-0, policy_version 1218446 (0.00088) [2022-07-11 13:50:38,316][25689] Fps is (10 sec: 5612.4, 60 sec: 5639.3, 300 sec: 5627.6). Total num frames: 1247695872. Throughput: 0: 5927.0. Samples: 1247694736. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:50:38,316][25689] Avg episode reward: [(0, '0.379')] [2022-07-11 13:50:38,960][26022] Updated weights on worker 0-0, policy_version 1218456 (0.00083) [2022-07-11 13:50:40,599][26022] Updated weights on worker 0-0, policy_version 1218466 (0.00084) [2022-07-11 13:50:42,557][26022] Updated weights on worker 0-0, policy_version 1218476 (0.00078) [2022-07-11 13:50:43,446][25689] Fps is (10 sec: 5550.9, 60 sec: 5587.1, 300 sec: 5622.4). Total num frames: 1247723520. Throughput: 0: 5906.8. Samples: 1247728664. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:50:43,447][25689] Avg episode reward: [(0, '0.509')] [2022-07-11 13:50:44,229][26022] Updated weights on worker 0-0, policy_version 1218486 (0.00085) [2022-07-11 13:50:46,414][26022] Updated weights on worker 0-0, policy_version 1218496 (0.00082) [2022-07-11 13:50:47,804][26022] Updated weights on worker 0-0, policy_version 1218506 (0.00084) [2022-07-11 13:50:48,483][25689] Fps is (10 sec: 5640.2, 60 sec: 5623.6, 300 sec: 5632.3). Total num frames: 1247753216. Throughput: 0: 5044.0. Samples: 1247745444. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:50:48,483][25689] Avg episode reward: [(0, '-0.379')] [2022-07-11 13:50:49,818][26022] Updated weights on worker 0-0, policy_version 1218516 (0.00084) [2022-07-11 13:50:51,591][26022] Updated weights on worker 0-0, policy_version 1218526 (0.00083) [2022-07-11 13:50:53,257][26022] Updated weights on worker 0-0, policy_version 1218536 (0.00081) [2022-07-11 13:50:53,497][25689] Fps is (10 sec: 5807.2, 60 sec: 5647.2, 300 sec: 5632.1). Total num frames: 1247781888. Throughput: 0: 5871.6. Samples: 1247779214. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:50:53,498][25689] Avg episode reward: [(0, '0.330')] [2022-07-11 13:50:55,306][26022] Updated weights on worker 0-0, policy_version 1218546 (0.00091) [2022-07-11 13:50:56,833][26022] Updated weights on worker 0-0, policy_version 1218556 (0.00084) [2022-07-11 13:50:58,502][25689] Fps is (10 sec: 5416.8, 60 sec: 5568.6, 300 sec: 5620.4). Total num frames: 1247807488. Throughput: 0: 5874.0. Samples: 1247813310. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:50:58,502][25689] Avg episode reward: [(0, '1.153')] [2022-07-11 13:50:58,911][26022] Updated weights on worker 0-0, policy_version 1218566 (0.00227) [2022-07-11 13:51:00,660][26022] Updated weights on worker 0-0, policy_version 1218576 (0.00089) [2022-07-11 13:51:02,774][26022] Updated weights on worker 0-0, policy_version 1218586 (0.00083) [2022-07-11 13:51:03,596][25689] Fps is (10 sec: 5475.6, 60 sec: 5634.6, 300 sec: 5633.0). Total num frames: 1247837184. Throughput: 0: 5786.5. Samples: 1247845260. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:51:03,596][25689] Avg episode reward: [(0, '1.100')] [2022-07-11 13:51:04,787][26022] Updated weights on worker 0-0, policy_version 1218596 (0.00088) [2022-07-11 13:51:06,359][26022] Updated weights on worker 0-0, policy_version 1218606 (0.00081) [2022-07-11 13:51:08,236][26022] Updated weights on worker 0-0, policy_version 1218616 (0.00086) [2022-07-11 13:51:08,621][25689] Fps is (10 sec: 5667.1, 60 sec: 5635.3, 300 sec: 5627.2). Total num frames: 1247864832. Throughput: 0: 5798.1. Samples: 1247862206. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:51:08,621][25689] Avg episode reward: [(0, '1.314')] [2022-07-11 13:51:10,031][26022] Updated weights on worker 0-0, policy_version 1218626 (0.00089) [2022-07-11 13:51:11,707][26022] Updated weights on worker 0-0, policy_version 1218636 (0.00094) [2022-07-11 13:51:13,637][25689] Fps is (10 sec: 5506.8, 60 sec: 5605.8, 300 sec: 5624.2). Total num frames: 1247892480. Throughput: 0: 5819.1. Samples: 1247896412. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:51:13,638][25689] Avg episode reward: [(0, '-0.027')] [2022-07-11 13:51:13,789][26022] Updated weights on worker 0-0, policy_version 1218646 (0.00086) [2022-07-11 13:51:15,404][26022] Updated weights on worker 0-0, policy_version 1218656 (0.00076) [2022-07-11 13:51:17,329][26022] Updated weights on worker 0-0, policy_version 1218666 (0.00089) [2022-07-11 13:51:18,647][25689] Fps is (10 sec: 5617.2, 60 sec: 5605.5, 300 sec: 5628.8). Total num frames: 1247921152. Throughput: 0: 5809.4. Samples: 1247930340. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:51:18,647][25689] Avg episode reward: [(0, '-0.370')] [2022-07-11 13:51:19,120][26022] Updated weights on worker 0-0, policy_version 1218676 (0.00089) [2022-07-11 13:51:20,803][26022] Updated weights on worker 0-0, policy_version 1218686 (0.00082) [2022-07-11 13:51:22,716][26022] Updated weights on worker 0-0, policy_version 1218696 (0.00091) [2022-07-11 13:51:23,734][25689] Fps is (10 sec: 5679.5, 60 sec: 5623.1, 300 sec: 5628.4). Total num frames: 1247949824. Throughput: 0: 5081.2. Samples: 1247947586. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:51:23,734][25689] Avg episode reward: [(0, '-0.288')] [2022-07-11 13:51:24,428][26022] Updated weights on worker 0-0, policy_version 1218706 (0.00089) [2022-07-11 13:51:26,302][26022] Updated weights on worker 0-0, policy_version 1218716 (0.00086) [2022-07-11 13:51:28,016][26022] Updated weights on worker 0-0, policy_version 1218726 (0.00061) [2022-07-11 13:51:28,744][25689] Fps is (10 sec: 5679.0, 60 sec: 5605.7, 300 sec: 5628.4). Total num frames: 1247978496. Throughput: 0: 5937.2. Samples: 1247981686. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:51:28,745][25689] Avg episode reward: [(0, '-0.318')] [2022-07-11 13:51:29,905][26022] Updated weights on worker 0-0, policy_version 1218736 (0.00088) [2022-07-11 13:51:31,659][26022] Updated weights on worker 0-0, policy_version 1218746 (0.00094) [2022-07-11 13:51:33,593][26022] Updated weights on worker 0-0, policy_version 1218756 (0.00089) [2022-07-11 13:51:33,755][25689] Fps is (10 sec: 5722.1, 60 sec: 5623.5, 300 sec: 5631.7). Total num frames: 1248007168. Throughput: 0: 5952.6. Samples: 1248016168. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:51:33,756][25689] Avg episode reward: [(0, '-1.165')] [2022-07-11 13:51:35,252][26022] Updated weights on worker 0-0, policy_version 1218766 (0.00091) [2022-07-11 13:51:37,039][26022] Updated weights on worker 0-0, policy_version 1218776 (0.00084) [2022-07-11 13:51:38,781][25689] Fps is (10 sec: 5713.4, 60 sec: 5622.5, 300 sec: 5628.6). Total num frames: 1248035840. Throughput: 0: 5109.0. Samples: 1248033206. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:51:38,783][25689] Avg episode reward: [(0, '-0.156')] [2022-07-11 13:51:38,862][26022] Updated weights on worker 0-0, policy_version 1218786 (0.00089) [2022-07-11 13:51:40,890][26022] Updated weights on worker 0-0, policy_version 1218796 (0.00094) [2022-07-11 13:51:42,487][26022] Updated weights on worker 0-0, policy_version 1218806 (0.00086) [2022-07-11 13:51:43,903][25689] Fps is (10 sec: 5550.2, 60 sec: 5623.3, 300 sec: 5627.1). Total num frames: 1248063488. Throughput: 0: 5925.9. Samples: 1248067106. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:51:43,905][25689] Avg episode reward: [(0, '0.668')] [2022-07-11 13:51:44,359][26022] Updated weights on worker 0-0, policy_version 1218816 (0.00088) [2022-07-11 13:51:46,292][26022] Updated weights on worker 0-0, policy_version 1218826 (0.00088) [2022-07-11 13:51:47,873][26022] Updated weights on worker 0-0, policy_version 1218836 (0.00085) [2022-07-11 13:51:48,942][25689] Fps is (10 sec: 5543.1, 60 sec: 5606.2, 300 sec: 5627.8). Total num frames: 1248092160. Throughput: 0: 5902.4. Samples: 1248100898. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:51:48,942][25689] Avg episode reward: [(0, '0.583')] [2022-07-11 13:51:49,875][26022] Updated weights on worker 0-0, policy_version 1218846 (0.00091) [2022-07-11 13:51:51,531][26022] Updated weights on worker 0-0, policy_version 1218856 (0.00092) [2022-07-11 13:51:53,376][26022] Updated weights on worker 0-0, policy_version 1218866 (0.00098) [2022-07-11 13:51:53,980][25689] Fps is (10 sec: 5792.3, 60 sec: 5620.9, 300 sec: 5630.8). Total num frames: 1248121856. Throughput: 0: 5025.6. Samples: 1248117806. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:51:53,981][25689] Avg episode reward: [(0, '0.530')] [2022-07-11 13:51:55,379][26022] Updated weights on worker 0-0, policy_version 1218876 (0.00084) [2022-07-11 13:51:57,008][26022] Updated weights on worker 0-0, policy_version 1218886 (0.00078) [2022-07-11 13:51:58,928][26022] Updated weights on worker 0-0, policy_version 1218896 (0.00090) [2022-07-11 13:51:59,006][25689] Fps is (10 sec: 5697.8, 60 sec: 5652.8, 300 sec: 5624.4). Total num frames: 1248149504. Throughput: 0: 5872.5. Samples: 1248151976. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:51:59,007][25689] Avg episode reward: [(0, '0.432')] [2022-07-11 13:52:00,859][26022] Updated weights on worker 0-0, policy_version 1218906 (0.00904) [2022-07-11 13:52:02,645][26022] Updated weights on worker 0-0, policy_version 1218916 (0.00084) [2022-07-11 13:52:04,056][25689] Fps is (10 sec: 5386.2, 60 sec: 5606.1, 300 sec: 5624.8). Total num frames: 1248176128. Throughput: 0: 5796.9. Samples: 1248183930. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:52:04,056][25689] Avg episode reward: [(0, '1.031')] [2022-07-11 13:52:04,717][26022] Updated weights on worker 0-0, policy_version 1218926 (0.00084) [2022-07-11 13:52:06,211][26022] Updated weights on worker 0-0, policy_version 1218936 (0.00087) [2022-07-11 13:52:08,151][26022] Updated weights on worker 0-0, policy_version 1218946 (0.00098) [2022-07-11 13:52:09,073][25689] Fps is (10 sec: 5594.5, 60 sec: 5640.7, 300 sec: 5628.8). Total num frames: 1248205824. Throughput: 0: 4972.4. Samples: 1248200998. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:52:09,075][25689] Avg episode reward: [(0, '1.092')] [2022-07-11 13:52:10,271][26022] Updated weights on worker 0-0, policy_version 1218956 (0.00084) [2022-07-11 13:52:11,752][26022] Updated weights on worker 0-0, policy_version 1218966 (0.00085) [2022-07-11 13:52:11,950][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:52:11,961][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001218968_1248223232.pth [2022-07-11 13:52:11,962][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001216987_1246194688.pth [2022-07-11 13:52:13,731][26022] Updated weights on worker 0-0, policy_version 1218976 (0.00090) [2022-07-11 13:52:14,088][25689] Fps is (10 sec: 5716.1, 60 sec: 5640.8, 300 sec: 5627.0). Total num frames: 1248233472. Throughput: 0: 5835.5. Samples: 1248235146. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:52:14,090][25689] Avg episode reward: [(0, '1.174')] [2022-07-11 13:52:15,519][26022] Updated weights on worker 0-0, policy_version 1218986 (0.00085) [2022-07-11 13:52:17,300][26022] Updated weights on worker 0-0, policy_version 1218996 (0.00088) [2022-07-11 13:52:19,116][25689] Fps is (10 sec: 5505.9, 60 sec: 5622.2, 300 sec: 5624.1). Total num frames: 1248261120. Throughput: 0: 5802.6. Samples: 1248268666. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:52:19,117][25689] Avg episode reward: [(0, '1.309')] [2022-07-11 13:52:19,316][26022] Updated weights on worker 0-0, policy_version 1219006 (0.00083) [2022-07-11 13:52:20,898][26022] Updated weights on worker 0-0, policy_version 1219016 (0.00087) [2022-07-11 13:52:22,838][26022] Updated weights on worker 0-0, policy_version 1219026 (0.00087) [2022-07-11 13:52:24,235][25689] Fps is (10 sec: 5651.3, 60 sec: 5636.1, 300 sec: 5625.4). Total num frames: 1248290816. Throughput: 0: 5038.7. Samples: 1248285606. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:52:24,235][25689] Avg episode reward: [(0, '1.522')] [2022-07-11 13:52:24,642][26022] Updated weights on worker 0-0, policy_version 1219036 (0.00088) [2022-07-11 13:52:26,348][26022] Updated weights on worker 0-0, policy_version 1219046 (0.00543) [2022-07-11 13:52:28,216][26022] Updated weights on worker 0-0, policy_version 1219056 (0.00083) [2022-07-11 13:52:29,256][25689] Fps is (10 sec: 5756.0, 60 sec: 5635.1, 300 sec: 5629.2). Total num frames: 1248319488. Throughput: 0: 5876.1. Samples: 1248319598. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:52:29,258][25689] Avg episode reward: [(0, '1.869')] [2022-07-11 13:52:29,967][26022] Updated weights on worker 0-0, policy_version 1219066 (0.00088) [2022-07-11 13:52:31,930][26022] Updated weights on worker 0-0, policy_version 1219076 (0.00086) [2022-07-11 13:52:33,721][26022] Updated weights on worker 0-0, policy_version 1219086 (0.00087) [2022-07-11 13:52:34,264][25689] Fps is (10 sec: 5615.4, 60 sec: 5618.5, 300 sec: 5622.7). Total num frames: 1248347136. Throughput: 0: 5883.2. Samples: 1248353848. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:52:34,266][25689] Avg episode reward: [(0, '1.933')] [2022-07-11 13:52:35,437][26022] Updated weights on worker 0-0, policy_version 1219096 (0.00081) [2022-07-11 13:52:37,176][26022] Updated weights on worker 0-0, policy_version 1219106 (0.00095) [2022-07-11 13:52:39,126][26022] Updated weights on worker 0-0, policy_version 1219116 (0.00087) [2022-07-11 13:52:39,293][25689] Fps is (10 sec: 5611.4, 60 sec: 5618.2, 300 sec: 5629.9). Total num frames: 1248375808. Throughput: 0: 5063.6. Samples: 1248370834. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 13:52:39,293][25689] Avg episode reward: [(0, '1.822')] [2022-07-11 13:52:40,707][26022] Updated weights on worker 0-0, policy_version 1219126 (0.00082) [2022-07-11 13:52:42,763][26022] Updated weights on worker 0-0, policy_version 1219136 (0.00084) [2022-07-11 13:52:44,423][25689] Fps is (10 sec: 5644.8, 60 sec: 5634.3, 300 sec: 5628.0). Total num frames: 1248404480. Throughput: 0: 5915.5. Samples: 1248405030. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:52:44,424][25689] Avg episode reward: [(0, '1.891')] [2022-07-11 13:52:44,445][26022] Updated weights on worker 0-0, policy_version 1219146 (0.00092) [2022-07-11 13:52:46,170][26022] Updated weights on worker 0-0, policy_version 1219156 (0.00094) [2022-07-11 13:52:48,316][26022] Updated weights on worker 0-0, policy_version 1219166 (0.00090) [2022-07-11 13:52:49,468][25689] Fps is (10 sec: 5736.4, 60 sec: 5650.7, 300 sec: 5630.7). Total num frames: 1248434176. Throughput: 0: 5903.1. Samples: 1248438910. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:52:49,468][25689] Avg episode reward: [(0, '1.183')] [2022-07-11 13:52:49,849][26022] Updated weights on worker 0-0, policy_version 1219176 (0.00089) [2022-07-11 13:52:51,724][26022] Updated weights on worker 0-0, policy_version 1219186 (0.00087) [2022-07-11 13:52:53,679][26022] Updated weights on worker 0-0, policy_version 1219196 (0.00094) [2022-07-11 13:52:54,495][25689] Fps is (10 sec: 5693.6, 60 sec: 5617.9, 300 sec: 5630.9). Total num frames: 1248461824. Throughput: 0: 5045.5. Samples: 1248455920. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:52:54,496][25689] Avg episode reward: [(0, '1.110')] [2022-07-11 13:52:55,163][26022] Updated weights on worker 0-0, policy_version 1219206 (0.00086) [2022-07-11 13:52:57,258][26022] Updated weights on worker 0-0, policy_version 1219216 (0.00086) [2022-07-11 13:52:58,883][26022] Updated weights on worker 0-0, policy_version 1219226 (0.00084) [2022-07-11 13:52:59,563][25689] Fps is (10 sec: 5579.1, 60 sec: 5630.9, 300 sec: 5631.6). Total num frames: 1248490496. Throughput: 0: 5877.4. Samples: 1248489968. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:52:59,563][25689] Avg episode reward: [(0, '1.079')] [2022-07-11 13:53:00,907][26022] Updated weights on worker 0-0, policy_version 1219236 (0.00090) [2022-07-11 13:53:02,888][26022] Updated weights on worker 0-0, policy_version 1219246 (0.00089) [2022-07-11 13:53:04,698][25689] Fps is (10 sec: 5419.5, 60 sec: 5623.0, 300 sec: 5625.6). Total num frames: 1248517120. Throughput: 0: 5771.1. Samples: 1248522038. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:53:04,698][25689] Avg episode reward: [(0, '1.389')] [2022-07-11 13:53:04,859][26022] Updated weights on worker 0-0, policy_version 1219256 (0.00087) [2022-07-11 13:53:06,410][26022] Updated weights on worker 0-0, policy_version 1219266 (0.00090) [2022-07-11 13:53:08,548][26022] Updated weights on worker 0-0, policy_version 1219276 (0.00091) [2022-07-11 13:53:09,758][25689] Fps is (10 sec: 5423.7, 60 sec: 5602.2, 300 sec: 5628.4). Total num frames: 1248545792. Throughput: 0: 4938.8. Samples: 1248539118. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:53:09,759][25689] Avg episode reward: [(0, '1.420')] [2022-07-11 13:53:10,063][26022] Updated weights on worker 0-0, policy_version 1219286 (0.00087) [2022-07-11 13:53:12,268][26022] Updated weights on worker 0-0, policy_version 1219296 (0.00089) [2022-07-11 13:53:13,631][26022] Updated weights on worker 0-0, policy_version 1219306 (0.00086) [2022-07-11 13:53:14,827][25689] Fps is (10 sec: 5661.3, 60 sec: 5614.0, 300 sec: 5628.6). Total num frames: 1248574464. Throughput: 0: 5751.5. Samples: 1248572860. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:53:14,827][25689] Avg episode reward: [(0, '0.740')] [2022-07-11 13:53:15,824][26022] Updated weights on worker 0-0, policy_version 1219316 (0.00094) [2022-07-11 13:53:17,071][26022] Updated weights on worker 0-0, policy_version 1219326 (0.00086) [2022-07-11 13:53:19,419][26022] Updated weights on worker 0-0, policy_version 1219336 (0.00084) [2022-07-11 13:53:19,831][25689] Fps is (10 sec: 5591.1, 60 sec: 5616.2, 300 sec: 5627.4). Total num frames: 1248602112. Throughput: 0: 5768.3. Samples: 1248606882. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:53:19,832][25689] Avg episode reward: [(0, '1.399')] [2022-07-11 13:53:20,859][26022] Updated weights on worker 0-0, policy_version 1219346 (0.00093) [2022-07-11 13:53:22,972][26022] Updated weights on worker 0-0, policy_version 1219356 (0.00088) [2022-07-11 13:53:24,319][26022] Updated weights on worker 0-0, policy_version 1219366 (0.00087) [2022-07-11 13:53:24,952][25689] Fps is (10 sec: 5663.6, 60 sec: 5616.1, 300 sec: 5629.2). Total num frames: 1248631808. Throughput: 0: 5870.9. Samples: 1248640948. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:53:24,953][25689] Avg episode reward: [(0, '1.606')] [2022-07-11 13:53:26,663][26022] Updated weights on worker 0-0, policy_version 1219376 (0.00090) [2022-07-11 13:53:28,077][26022] Updated weights on worker 0-0, policy_version 1219386 (0.00088) [2022-07-11 13:53:29,962][25689] Fps is (10 sec: 5660.2, 60 sec: 5600.2, 300 sec: 5623.7). Total num frames: 1248659456. Throughput: 0: 5876.8. Samples: 1248657854. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:53:29,964][25689] Avg episode reward: [(0, '1.499')] [2022-07-11 13:53:30,202][26022] Updated weights on worker 0-0, policy_version 1219396 (0.00140) [2022-07-11 13:53:31,746][26022] Updated weights on worker 0-0, policy_version 1219406 (0.00087) [2022-07-11 13:53:33,854][26022] Updated weights on worker 0-0, policy_version 1219416 (0.00084) [2022-07-11 13:53:34,972][25689] Fps is (10 sec: 5722.7, 60 sec: 5633.8, 300 sec: 5627.4). Total num frames: 1248689152. Throughput: 0: 5901.9. Samples: 1248691756. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:53:34,973][25689] Avg episode reward: [(0, '1.013')] [2022-07-11 13:53:35,443][26022] Updated weights on worker 0-0, policy_version 1219426 (0.00085) [2022-07-11 13:53:37,586][26022] Updated weights on worker 0-0, policy_version 1219436 (0.00080) [2022-07-11 13:53:39,008][26022] Updated weights on worker 0-0, policy_version 1219446 (0.00087) [2022-07-11 13:53:40,012][25689] Fps is (10 sec: 5604.2, 60 sec: 5599.0, 300 sec: 5621.7). Total num frames: 1248715776. Throughput: 0: 5877.7. Samples: 1248725498. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:53:40,012][25689] Avg episode reward: [(0, '0.886')] [2022-07-11 13:53:41,265][26022] Updated weights on worker 0-0, policy_version 1219456 (0.00615) [2022-07-11 13:53:42,712][26022] Updated weights on worker 0-0, policy_version 1219466 (0.00083) [2022-07-11 13:53:44,730][26022] Updated weights on worker 0-0, policy_version 1219476 (0.00674) [2022-07-11 13:53:45,086][25689] Fps is (10 sec: 5669.9, 60 sec: 5638.0, 300 sec: 5627.7). Total num frames: 1248746496. Throughput: 0: 5040.9. Samples: 1248742442. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:53:45,087][25689] Avg episode reward: [(0, '1.181')] [2022-07-11 13:53:46,340][26022] Updated weights on worker 0-0, policy_version 1219486 (0.00083) [2022-07-11 13:53:48,112][26022] Updated weights on worker 0-0, policy_version 1219496 (0.00095) [2022-07-11 13:53:50,111][25689] Fps is (10 sec: 5678.3, 60 sec: 5589.2, 300 sec: 5617.0). Total num frames: 1248773120. Throughput: 0: 5899.9. Samples: 1248776728. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:53:50,112][25689] Avg episode reward: [(0, '1.332')] [2022-07-11 13:53:50,152][26022] Updated weights on worker 0-0, policy_version 1219506 (0.00092) [2022-07-11 13:53:51,923][26022] Updated weights on worker 0-0, policy_version 1219516 (0.00086) [2022-07-11 13:53:53,718][26022] Updated weights on worker 0-0, policy_version 1219526 (0.00083) [2022-07-11 13:53:55,115][25689] Fps is (10 sec: 5513.9, 60 sec: 5608.2, 300 sec: 5624.0). Total num frames: 1248801792. Throughput: 0: 5884.5. Samples: 1248810282. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:53:55,116][25689] Avg episode reward: [(0, '1.433')] [2022-07-11 13:53:55,527][26022] Updated weights on worker 0-0, policy_version 1219536 (0.00085) [2022-07-11 13:53:57,380][26022] Updated weights on worker 0-0, policy_version 1219546 (0.00084) [2022-07-11 13:53:59,373][26022] Updated weights on worker 0-0, policy_version 1219556 (0.00114) [2022-07-11 13:54:00,139][25689] Fps is (10 sec: 5718.4, 60 sec: 5612.3, 300 sec: 5628.4). Total num frames: 1248830464. Throughput: 0: 5059.6. Samples: 1248827332. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:54:00,140][25689] Avg episode reward: [(0, '1.516')] [2022-07-11 13:54:00,787][26022] Updated weights on worker 0-0, policy_version 1219566 (0.00079) [2022-07-11 13:54:03,251][26022] Updated weights on worker 0-0, policy_version 1219576 (0.00088) [2022-07-11 13:54:04,915][26022] Updated weights on worker 0-0, policy_version 1219586 (0.00084) [2022-07-11 13:54:05,242][25689] Fps is (10 sec: 5459.9, 60 sec: 5615.2, 300 sec: 5620.5). Total num frames: 1248857088. Throughput: 0: 5785.3. Samples: 1248859052. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:54:05,243][25689] Avg episode reward: [(0, '0.322')] [2022-07-11 13:54:06,795][26022] Updated weights on worker 0-0, policy_version 1219596 (0.00083) [2022-07-11 13:54:08,503][26022] Updated weights on worker 0-0, policy_version 1219606 (0.00094) [2022-07-11 13:54:10,244][25689] Fps is (10 sec: 5370.4, 60 sec: 5603.6, 300 sec: 5620.8). Total num frames: 1248884736. Throughput: 0: 5772.9. Samples: 1248892958. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:54:10,246][25689] Avg episode reward: [(0, '-0.456')] [2022-07-11 13:54:10,485][26022] Updated weights on worker 0-0, policy_version 1219616 (0.00085) [2022-07-11 13:54:12,460][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:54:12,470][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001219626_1248897024.pth [2022-07-11 13:54:12,470][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001217646_1246869504.pth [2022-07-11 13:54:12,475][26022] Updated weights on worker 0-0, policy_version 1219626 (0.00080) [2022-07-11 13:54:14,023][26022] Updated weights on worker 0-0, policy_version 1219636 (0.00094) [2022-07-11 13:54:15,263][25689] Fps is (10 sec: 5722.7, 60 sec: 5625.3, 300 sec: 5622.0). Total num frames: 1248914432. Throughput: 0: 4954.1. Samples: 1248910098. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:54:15,264][25689] Avg episode reward: [(0, '-1.206')] [2022-07-11 13:54:15,850][26022] Updated weights on worker 0-0, policy_version 1219646 (0.00080) [2022-07-11 13:54:17,471][26022] Updated weights on worker 0-0, policy_version 1219656 (0.00080) [2022-07-11 13:54:19,554][26022] Updated weights on worker 0-0, policy_version 1219666 (0.00089) [2022-07-11 13:54:20,312][25689] Fps is (10 sec: 5797.9, 60 sec: 5638.1, 300 sec: 5620.4). Total num frames: 1248943104. Throughput: 0: 5793.0. Samples: 1248944194. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:54:20,321][25689] Avg episode reward: [(0, '-1.484')] [2022-07-11 13:54:21,254][26022] Updated weights on worker 0-0, policy_version 1219676 (0.00099) [2022-07-11 13:54:23,039][26022] Updated weights on worker 0-0, policy_version 1219686 (0.00084) [2022-07-11 13:54:24,868][26022] Updated weights on worker 0-0, policy_version 1219696 (0.00091) [2022-07-11 13:54:25,471][25689] Fps is (10 sec: 5617.3, 60 sec: 5617.5, 300 sec: 5624.9). Total num frames: 1248971776. Throughput: 0: 5904.4. Samples: 1248978492. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:54:25,472][25689] Avg episode reward: [(0, '-1.431')] [2022-07-11 13:54:26,538][26022] Updated weights on worker 0-0, policy_version 1219706 (0.00073) [2022-07-11 13:54:28,471][26022] Updated weights on worker 0-0, policy_version 1219716 (0.00079) [2022-07-11 13:54:30,245][26022] Updated weights on worker 0-0, policy_version 1219726 (0.00404) [2022-07-11 13:54:30,478][25689] Fps is (10 sec: 5539.8, 60 sec: 5617.8, 300 sec: 5618.1). Total num frames: 1248999424. Throughput: 0: 5058.6. Samples: 1248995312. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:54:30,479][25689] Avg episode reward: [(0, '-1.324')] [2022-07-11 13:54:32,108][26022] Updated weights on worker 0-0, policy_version 1219736 (0.00086) [2022-07-11 13:54:34,045][26022] Updated weights on worker 0-0, policy_version 1219746 (0.00087) [2022-07-11 13:54:35,509][25689] Fps is (10 sec: 5712.8, 60 sec: 5615.9, 300 sec: 5624.7). Total num frames: 1249029120. Throughput: 0: 5878.6. Samples: 1249029118. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:54:35,509][25689] Avg episode reward: [(0, '0.241')] [2022-07-11 13:54:35,758][26022] Updated weights on worker 0-0, policy_version 1219756 (0.00089) [2022-07-11 13:54:37,591][26022] Updated weights on worker 0-0, policy_version 1219766 (0.00089) [2022-07-11 13:54:39,255][26022] Updated weights on worker 0-0, policy_version 1219776 (0.00092) [2022-07-11 13:54:40,528][25689] Fps is (10 sec: 5705.7, 60 sec: 5634.7, 300 sec: 5616.1). Total num frames: 1249056768. Throughput: 0: 5890.4. Samples: 1249063280. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:54:40,529][25689] Avg episode reward: [(0, '1.482')] [2022-07-11 13:54:41,255][26022] Updated weights on worker 0-0, policy_version 1219786 (0.00096) [2022-07-11 13:54:42,951][26022] Updated weights on worker 0-0, policy_version 1219796 (0.00099) [2022-07-11 13:54:44,754][26022] Updated weights on worker 0-0, policy_version 1219806 (0.00085) [2022-07-11 13:54:45,609][25689] Fps is (10 sec: 5677.8, 60 sec: 5617.2, 300 sec: 5622.7). Total num frames: 1249086464. Throughput: 0: 5051.4. Samples: 1249080214. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:54:45,609][25689] Avg episode reward: [(0, '1.622')] [2022-07-11 13:54:46,615][26022] Updated weights on worker 0-0, policy_version 1219816 (0.00090) [2022-07-11 13:54:48,389][26022] Updated weights on worker 0-0, policy_version 1219826 (0.00083) [2022-07-11 13:54:50,284][26022] Updated weights on worker 0-0, policy_version 1219836 (0.00097) [2022-07-11 13:54:50,639][25689] Fps is (10 sec: 5671.8, 60 sec: 5633.6, 300 sec: 5623.8). Total num frames: 1249114112. Throughput: 0: 5882.4. Samples: 1249113904. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:54:50,639][25689] Avg episode reward: [(0, '1.734')] [2022-07-11 13:54:52,039][26022] Updated weights on worker 0-0, policy_version 1219846 (0.00455) [2022-07-11 13:54:53,815][26022] Updated weights on worker 0-0, policy_version 1219856 (0.00411) [2022-07-11 13:54:55,611][26022] Updated weights on worker 0-0, policy_version 1219866 (0.00085) [2022-07-11 13:54:55,649][25689] Fps is (10 sec: 5609.1, 60 sec: 5633.0, 300 sec: 5618.0). Total num frames: 1249142784. Throughput: 0: 5920.5. Samples: 1249148360. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:54:55,650][25689] Avg episode reward: [(0, '0.445')] [2022-07-11 13:54:57,323][26022] Updated weights on worker 0-0, policy_version 1219876 (0.00095) [2022-07-11 13:54:59,298][26022] Updated weights on worker 0-0, policy_version 1219886 (0.00110) [2022-07-11 13:55:00,663][25689] Fps is (10 sec: 5720.7, 60 sec: 5634.0, 300 sec: 5629.5). Total num frames: 1249171456. Throughput: 0: 5072.5. Samples: 1249165412. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:55:00,663][25689] Avg episode reward: [(0, '0.430')] [2022-07-11 13:55:00,901][26022] Updated weights on worker 0-0, policy_version 1219896 (0.00086) [2022-07-11 13:55:03,123][26022] Updated weights on worker 0-0, policy_version 1219906 (0.00085) [2022-07-11 13:55:05,040][26022] Updated weights on worker 0-0, policy_version 1219916 (0.00086) [2022-07-11 13:55:05,716][25689] Fps is (10 sec: 5493.2, 60 sec: 5638.7, 300 sec: 5625.7). Total num frames: 1249198080. Throughput: 0: 5849.9. Samples: 1249197838. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:55:05,716][25689] Avg episode reward: [(0, '-0.237')] [2022-07-11 13:55:06,731][26022] Updated weights on worker 0-0, policy_version 1219926 (0.00097) [2022-07-11 13:55:08,647][26022] Updated weights on worker 0-0, policy_version 1219936 (0.00086) [2022-07-11 13:55:10,325][26022] Updated weights on worker 0-0, policy_version 1219946 (0.00088) [2022-07-11 13:55:10,727][25689] Fps is (10 sec: 5392.3, 60 sec: 5637.8, 300 sec: 5619.8). Total num frames: 1249225728. Throughput: 0: 5862.6. Samples: 1249231676. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:55:10,728][25689] Avg episode reward: [(0, '-0.242')] [2022-07-11 13:55:12,040][26022] Updated weights on worker 0-0, policy_version 1219956 (0.00089) [2022-07-11 13:55:14,192][26022] Updated weights on worker 0-0, policy_version 1219966 (0.00088) [2022-07-11 13:55:15,653][26022] Updated weights on worker 0-0, policy_version 1219976 (0.00089) [2022-07-11 13:55:15,749][25689] Fps is (10 sec: 5715.6, 60 sec: 5637.5, 300 sec: 5623.0). Total num frames: 1249255424. Throughput: 0: 4994.3. Samples: 1249248742. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:55:15,750][25689] Avg episode reward: [(0, '-0.180')] [2022-07-11 13:55:17,851][26022] Updated weights on worker 0-0, policy_version 1219986 (0.00094) [2022-07-11 13:55:19,377][26022] Updated weights on worker 0-0, policy_version 1219996 (0.00083) [2022-07-11 13:55:20,771][25689] Fps is (10 sec: 5607.8, 60 sec: 5606.2, 300 sec: 5620.9). Total num frames: 1249282048. Throughput: 0: 5817.3. Samples: 1249282386. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:55:20,771][25689] Avg episode reward: [(0, '-0.721')] [2022-07-11 13:55:21,418][26022] Updated weights on worker 0-0, policy_version 1220006 (0.00084) [2022-07-11 13:55:23,155][26022] Updated weights on worker 0-0, policy_version 1220016 (0.00087) [2022-07-11 13:55:25,061][26022] Updated weights on worker 0-0, policy_version 1220026 (0.00087) [2022-07-11 13:55:25,809][25689] Fps is (10 sec: 5496.7, 60 sec: 5617.5, 300 sec: 5616.8). Total num frames: 1249310720. Throughput: 0: 5888.1. Samples: 1249316146. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:55:25,809][25689] Avg episode reward: [(0, '0.071')] [2022-07-11 13:55:26,803][26022] Updated weights on worker 0-0, policy_version 1220036 (0.00081) [2022-07-11 13:55:28,578][26022] Updated weights on worker 0-0, policy_version 1220046 (0.00085) [2022-07-11 13:55:30,327][26022] Updated weights on worker 0-0, policy_version 1220056 (0.00083) [2022-07-11 13:55:30,814][25689] Fps is (10 sec: 5709.8, 60 sec: 5634.6, 300 sec: 5620.6). Total num frames: 1249339392. Throughput: 0: 5059.0. Samples: 1249333292. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:55:30,814][25689] Avg episode reward: [(0, '0.156')] [2022-07-11 13:55:32,330][26022] Updated weights on worker 0-0, policy_version 1220066 (0.00103) [2022-07-11 13:55:33,768][26022] Updated weights on worker 0-0, policy_version 1220076 (0.00078) [2022-07-11 13:55:35,843][25689] Fps is (10 sec: 5612.8, 60 sec: 5600.8, 300 sec: 5616.9). Total num frames: 1249367040. Throughput: 0: 5906.8. Samples: 1249367434. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:55:35,843][25689] Avg episode reward: [(0, '1.064')] [2022-07-11 13:55:35,851][26022] Updated weights on worker 0-0, policy_version 1220086 (0.00103) [2022-07-11 13:55:37,624][26022] Updated weights on worker 0-0, policy_version 1220096 (0.00100) [2022-07-11 13:55:39,352][26022] Updated weights on worker 0-0, policy_version 1220106 (0.00088) [2022-07-11 13:55:40,863][25689] Fps is (10 sec: 5706.5, 60 sec: 5634.7, 300 sec: 5625.8). Total num frames: 1249396736. Throughput: 0: 5940.2. Samples: 1249401736. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:55:40,863][25689] Avg episode reward: [(0, '1.174')] [2022-07-11 13:55:41,173][26022] Updated weights on worker 0-0, policy_version 1220116 (0.00091) [2022-07-11 13:55:42,800][26022] Updated weights on worker 0-0, policy_version 1220126 (0.00081) [2022-07-11 13:55:44,721][26022] Updated weights on worker 0-0, policy_version 1220136 (0.00097) [2022-07-11 13:55:45,907][25689] Fps is (10 sec: 5799.6, 60 sec: 5621.1, 300 sec: 5622.2). Total num frames: 1249425408. Throughput: 0: 5110.4. Samples: 1249418856. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:55:45,907][25689] Avg episode reward: [(0, '0.409')] [2022-07-11 13:55:46,575][26022] Updated weights on worker 0-0, policy_version 1220146 (0.00085) [2022-07-11 13:55:48,275][26022] Updated weights on worker 0-0, policy_version 1220156 (0.00079) [2022-07-11 13:55:50,227][26022] Updated weights on worker 0-0, policy_version 1220166 (0.00085) [2022-07-11 13:55:50,908][25689] Fps is (10 sec: 5708.4, 60 sec: 5640.8, 300 sec: 5622.5). Total num frames: 1249454080. Throughput: 0: 5954.9. Samples: 1249452952. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:55:50,909][25689] Avg episode reward: [(0, '0.699')] [2022-07-11 13:55:51,904][26022] Updated weights on worker 0-0, policy_version 1220176 (0.00091) [2022-07-11 13:55:53,791][26022] Updated weights on worker 0-0, policy_version 1220186 (0.00085) [2022-07-11 13:55:55,637][26022] Updated weights on worker 0-0, policy_version 1220196 (0.01507) [2022-07-11 13:55:55,941][25689] Fps is (10 sec: 5714.6, 60 sec: 5638.7, 300 sec: 5632.3). Total num frames: 1249482752. Throughput: 0: 5936.1. Samples: 1249486742. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:55:55,942][25689] Avg episode reward: [(0, '-0.103')] [2022-07-11 13:55:57,291][26022] Updated weights on worker 0-0, policy_version 1220206 (0.00083) [2022-07-11 13:55:59,148][26022] Updated weights on worker 0-0, policy_version 1220216 (0.00088) [2022-07-11 13:56:00,963][25689] Fps is (10 sec: 5601.3, 60 sec: 5620.9, 300 sec: 5626.8). Total num frames: 1249510400. Throughput: 0: 5081.6. Samples: 1249503878. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:56:00,964][25689] Avg episode reward: [(0, '-0.716')] [2022-07-11 13:56:01,036][26022] Updated weights on worker 0-0, policy_version 1220226 (0.00086) [2022-07-11 13:56:03,107][26022] Updated weights on worker 0-0, policy_version 1220236 (0.00086) [2022-07-11 13:56:05,191][26022] Updated weights on worker 0-0, policy_version 1220246 (0.00061) [2022-07-11 13:56:06,011][25689] Fps is (10 sec: 5389.8, 60 sec: 5621.4, 300 sec: 5622.9). Total num frames: 1249537024. Throughput: 0: 5818.9. Samples: 1249535840. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:56:06,011][25689] Avg episode reward: [(0, '-0.894')] [2022-07-11 13:56:06,680][26022] Updated weights on worker 0-0, policy_version 1220256 (0.00082) [2022-07-11 13:56:08,804][26022] Updated weights on worker 0-0, policy_version 1220266 (0.00098) [2022-07-11 13:56:10,411][26022] Updated weights on worker 0-0, policy_version 1220276 (0.00086) [2022-07-11 13:56:11,079][25689] Fps is (10 sec: 5364.8, 60 sec: 5616.1, 300 sec: 5621.9). Total num frames: 1249564672. Throughput: 0: 5795.5. Samples: 1249569854. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:56:11,080][25689] Avg episode reward: [(0, '-0.051')] [2022-07-11 13:56:12,204][26022] Updated weights on worker 0-0, policy_version 1220286 (0.00092) [2022-07-11 13:56:12,510][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:56:12,521][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001220288_1249574912.pth [2022-07-11 13:56:12,522][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001218307_1247546368.pth [2022-07-11 13:56:14,142][26022] Updated weights on worker 0-0, policy_version 1220296 (0.00084) [2022-07-11 13:56:15,864][26022] Updated weights on worker 0-0, policy_version 1220306 (0.00093) [2022-07-11 13:56:16,105][25689] Fps is (10 sec: 5782.4, 60 sec: 5632.7, 300 sec: 5628.5). Total num frames: 1249595392. Throughput: 0: 5821.0. Samples: 1249604114. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:56:16,106][25689] Avg episode reward: [(0, '-0.050')] [2022-07-11 13:56:17,638][26022] Updated weights on worker 0-0, policy_version 1220316 (0.00087) [2022-07-11 13:56:19,379][26022] Updated weights on worker 0-0, policy_version 1220326 (0.00091) [2022-07-11 13:56:21,145][25689] Fps is (10 sec: 5697.1, 60 sec: 5631.0, 300 sec: 5622.5). Total num frames: 1249622016. Throughput: 0: 5812.9. Samples: 1249621192. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:56:21,145][25689] Avg episode reward: [(0, '0.072')] [2022-07-11 13:56:21,432][26022] Updated weights on worker 0-0, policy_version 1220336 (0.00087) [2022-07-11 13:56:23,227][26022] Updated weights on worker 0-0, policy_version 1220346 (0.00094) [2022-07-11 13:56:24,726][26022] Updated weights on worker 0-0, policy_version 1220356 (0.00085) [2022-07-11 13:56:26,204][25689] Fps is (10 sec: 5475.5, 60 sec: 5629.0, 300 sec: 5621.6). Total num frames: 1249650688. Throughput: 0: 5906.4. Samples: 1249655108. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:56:26,205][25689] Avg episode reward: [(0, '0.649')] [2022-07-11 13:56:26,771][26022] Updated weights on worker 0-0, policy_version 1220366 (0.00088) [2022-07-11 13:56:28,453][26022] Updated weights on worker 0-0, policy_version 1220376 (0.00087) [2022-07-11 13:56:30,287][26022] Updated weights on worker 0-0, policy_version 1220386 (0.00064) [2022-07-11 13:56:31,239][25689] Fps is (10 sec: 5782.6, 60 sec: 5643.2, 300 sec: 5624.6). Total num frames: 1249680384. Throughput: 0: 5928.1. Samples: 1249689360. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:56:31,239][25689] Avg episode reward: [(0, '1.301')] [2022-07-11 13:56:31,982][26022] Updated weights on worker 0-0, policy_version 1220396 (0.00086) [2022-07-11 13:56:33,747][26022] Updated weights on worker 0-0, policy_version 1220406 (0.00088) [2022-07-11 13:56:35,817][26022] Updated weights on worker 0-0, policy_version 1220416 (0.00085) [2022-07-11 13:56:36,243][25689] Fps is (10 sec: 5813.9, 60 sec: 5662.5, 300 sec: 5625.0). Total num frames: 1249709056. Throughput: 0: 5079.7. Samples: 1249706410. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:56:36,245][25689] Avg episode reward: [(0, '1.184')] [2022-07-11 13:56:37,482][26022] Updated weights on worker 0-0, policy_version 1220426 (0.00087) [2022-07-11 13:56:39,390][26022] Updated weights on worker 0-0, policy_version 1220436 (0.00089) [2022-07-11 13:56:41,163][26022] Updated weights on worker 0-0, policy_version 1220446 (0.00090) [2022-07-11 13:56:41,256][25689] Fps is (10 sec: 5622.1, 60 sec: 5629.2, 300 sec: 5627.0). Total num frames: 1249736704. Throughput: 0: 5916.6. Samples: 1249740184. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2022-07-11 13:56:41,257][25689] Avg episode reward: [(0, '0.207')] [2022-07-11 13:56:42,959][26022] Updated weights on worker 0-0, policy_version 1220456 (0.00089) [2022-07-11 13:56:44,856][26022] Updated weights on worker 0-0, policy_version 1220466 (0.00094) [2022-07-11 13:56:46,310][25689] Fps is (10 sec: 5492.8, 60 sec: 5611.3, 300 sec: 5623.3). Total num frames: 1249764352. Throughput: 0: 5914.2. Samples: 1249774020. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:56:46,311][25689] Avg episode reward: [(0, '0.258')] [2022-07-11 13:56:46,692][26022] Updated weights on worker 0-0, policy_version 1220476 (0.00093) [2022-07-11 13:56:48,478][26022] Updated weights on worker 0-0, policy_version 1220486 (0.00085) [2022-07-11 13:56:50,432][26022] Updated weights on worker 0-0, policy_version 1220496 (0.00079) [2022-07-11 13:56:51,345][25689] Fps is (10 sec: 5683.9, 60 sec: 5625.2, 300 sec: 5623.4). Total num frames: 1249794048. Throughput: 0: 5054.2. Samples: 1249790982. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:56:51,345][25689] Avg episode reward: [(0, '0.430')] [2022-07-11 13:56:52,106][26022] Updated weights on worker 0-0, policy_version 1220506 (0.00083) [2022-07-11 13:56:53,887][26022] Updated weights on worker 0-0, policy_version 1220516 (0.00091) [2022-07-11 13:56:55,792][26022] Updated weights on worker 0-0, policy_version 1220526 (0.00091) [2022-07-11 13:56:56,348][25689] Fps is (10 sec: 5712.6, 60 sec: 5611.0, 300 sec: 5623.8). Total num frames: 1249821696. Throughput: 0: 5911.4. Samples: 1249825260. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:56:56,349][25689] Avg episode reward: [(0, '-0.317')] [2022-07-11 13:56:57,414][26022] Updated weights on worker 0-0, policy_version 1220536 (0.00085) [2022-07-11 13:56:59,316][26022] Updated weights on worker 0-0, policy_version 1220546 (0.00093) [2022-07-11 13:57:01,138][26022] Updated weights on worker 0-0, policy_version 1220556 (0.00085) [2022-07-11 13:57:01,363][25689] Fps is (10 sec: 5724.3, 60 sec: 5645.6, 300 sec: 5634.8). Total num frames: 1249851392. Throughput: 0: 5945.3. Samples: 1249859722. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:57:01,363][25689] Avg episode reward: [(0, '-0.693')] [2022-07-11 13:57:02,993][26022] Updated weights on worker 0-0, policy_version 1220566 (0.00087) [2022-07-11 13:57:05,167][26022] Updated weights on worker 0-0, policy_version 1220576 (0.00089) [2022-07-11 13:57:06,499][25689] Fps is (10 sec: 5548.6, 60 sec: 5637.4, 300 sec: 5622.2). Total num frames: 1249878016. Throughput: 0: 4985.4. Samples: 1249874668. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:57:06,499][25689] Avg episode reward: [(0, '-0.518')] [2022-07-11 13:57:06,686][26022] Updated weights on worker 0-0, policy_version 1220586 (0.00089) [2022-07-11 13:57:08,751][26022] Updated weights on worker 0-0, policy_version 1220596 (0.00089) [2022-07-11 13:57:10,390][26022] Updated weights on worker 0-0, policy_version 1220606 (0.00091) [2022-07-11 13:57:11,503][25689] Fps is (10 sec: 5351.9, 60 sec: 5643.3, 300 sec: 5622.4). Total num frames: 1249905664. Throughput: 0: 5847.8. Samples: 1249908864. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:57:11,504][25689] Avg episode reward: [(0, '0.396')] [2022-07-11 13:57:12,266][26022] Updated weights on worker 0-0, policy_version 1220616 (0.00301) [2022-07-11 13:57:14,028][26022] Updated weights on worker 0-0, policy_version 1220626 (0.00088) [2022-07-11 13:57:15,842][26022] Updated weights on worker 0-0, policy_version 1220636 (0.00084) [2022-07-11 13:57:16,505][25689] Fps is (10 sec: 5730.8, 60 sec: 5628.6, 300 sec: 5629.8). Total num frames: 1249935360. Throughput: 0: 5844.9. Samples: 1249943074. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:57:16,505][25689] Avg episode reward: [(0, '0.706')] [2022-07-11 13:57:17,516][26022] Updated weights on worker 0-0, policy_version 1220646 (0.00082) [2022-07-11 13:57:19,493][26022] Updated weights on worker 0-0, policy_version 1220656 (0.00091) [2022-07-11 13:57:21,207][26022] Updated weights on worker 0-0, policy_version 1220666 (0.00085) [2022-07-11 13:57:21,570][25689] Fps is (10 sec: 5797.9, 60 sec: 5660.1, 300 sec: 5627.4). Total num frames: 1249964032. Throughput: 0: 4971.7. Samples: 1249960188. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:57:21,571][25689] Avg episode reward: [(0, '0.747')] [2022-07-11 13:57:23,363][26022] Updated weights on worker 0-0, policy_version 1220676 (0.00085) [2022-07-11 13:57:24,628][26022] Updated weights on worker 0-0, policy_version 1220686 (0.00088) [2022-07-11 13:57:26,628][25689] Fps is (10 sec: 5462.6, 60 sec: 5626.4, 300 sec: 5619.8). Total num frames: 1249990656. Throughput: 0: 5928.8. Samples: 1249994008. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:57:26,628][25689] Avg episode reward: [(0, '2.104')] [2022-07-11 13:57:26,879][26022] Updated weights on worker 0-0, policy_version 1220696 (0.00083) [2022-07-11 13:57:28,462][26022] Updated weights on worker 0-0, policy_version 1220706 (0.00090) [2022-07-11 13:57:30,319][26022] Updated weights on worker 0-0, policy_version 1220716 (0.00083) [2022-07-11 13:57:31,698][25689] Fps is (10 sec: 5662.2, 60 sec: 5640.0, 300 sec: 5629.0). Total num frames: 1250021376. Throughput: 0: 5902.3. Samples: 1250028058. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:57:31,698][25689] Avg episode reward: [(0, '1.791')] [2022-07-11 13:57:32,181][26022] Updated weights on worker 0-0, policy_version 1220726 (0.00093) [2022-07-11 13:57:33,815][26022] Updated weights on worker 0-0, policy_version 1220736 (0.00087) [2022-07-11 13:57:35,793][26022] Updated weights on worker 0-0, policy_version 1220746 (0.00087) [2022-07-11 13:57:36,716][25689] Fps is (10 sec: 5684.1, 60 sec: 5604.9, 300 sec: 5622.3). Total num frames: 1250048000. Throughput: 0: 5049.2. Samples: 1250045124. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:57:36,723][25689] Avg episode reward: [(0, '1.723')] [2022-07-11 13:57:37,495][26022] Updated weights on worker 0-0, policy_version 1220756 (0.00083) [2022-07-11 13:57:39,294][26022] Updated weights on worker 0-0, policy_version 1220766 (0.00094) [2022-07-11 13:57:41,084][26022] Updated weights on worker 0-0, policy_version 1220776 (0.00087) [2022-07-11 13:57:41,796][25689] Fps is (10 sec: 5678.7, 60 sec: 5649.5, 300 sec: 5630.1). Total num frames: 1250078720. Throughput: 0: 5894.5. Samples: 1250079408. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:57:41,798][25689] Avg episode reward: [(0, '1.499')] [2022-07-11 13:57:42,924][26022] Updated weights on worker 0-0, policy_version 1220786 (0.00081) [2022-07-11 13:57:44,576][26022] Updated weights on worker 0-0, policy_version 1220796 (0.00079) [2022-07-11 13:57:46,489][26022] Updated weights on worker 0-0, policy_version 1220806 (0.00087) [2022-07-11 13:57:46,863][25689] Fps is (10 sec: 5853.1, 60 sec: 5665.1, 300 sec: 5626.2). Total num frames: 1250107392. Throughput: 0: 5899.5. Samples: 1250113388. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:57:46,864][25689] Avg episode reward: [(0, '1.400')] [2022-07-11 13:57:48,297][26022] Updated weights on worker 0-0, policy_version 1220816 (0.00096) [2022-07-11 13:57:50,115][26022] Updated weights on worker 0-0, policy_version 1220826 (0.00085) [2022-07-11 13:57:51,925][25689] Fps is (10 sec: 5559.9, 60 sec: 5628.7, 300 sec: 5625.6). Total num frames: 1250135040. Throughput: 0: 5066.1. Samples: 1250130534. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:57:51,926][25689] Avg episode reward: [(0, '1.319')] [2022-07-11 13:57:52,052][26022] Updated weights on worker 0-0, policy_version 1220836 (0.00092) [2022-07-11 13:57:53,674][26022] Updated weights on worker 0-0, policy_version 1220846 (0.00083) [2022-07-11 13:57:55,710][26022] Updated weights on worker 0-0, policy_version 1220856 (0.00093) [2022-07-11 13:57:56,944][25689] Fps is (10 sec: 5586.6, 60 sec: 5644.2, 300 sec: 5626.5). Total num frames: 1250163712. Throughput: 0: 5919.6. Samples: 1250164868. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:57:56,945][25689] Avg episode reward: [(0, '1.264')] [2022-07-11 13:57:57,355][26022] Updated weights on worker 0-0, policy_version 1220866 (0.00086) [2022-07-11 13:57:59,037][26022] Updated weights on worker 0-0, policy_version 1220876 (0.00090) [2022-07-11 13:58:00,963][26022] Updated weights on worker 0-0, policy_version 1220886 (0.00086) [2022-07-11 13:58:01,975][25689] Fps is (10 sec: 5604.4, 60 sec: 5608.9, 300 sec: 5631.9). Total num frames: 1250191360. Throughput: 0: 5935.5. Samples: 1250199180. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:58:01,976][25689] Avg episode reward: [(0, '0.654')] [2022-07-11 13:58:02,907][26022] Updated weights on worker 0-0, policy_version 1220896 (0.00089) [2022-07-11 13:58:04,831][26022] Updated weights on worker 0-0, policy_version 1220906 (0.00085) [2022-07-11 13:58:06,771][26022] Updated weights on worker 0-0, policy_version 1220916 (0.00090) [2022-07-11 13:58:07,158][25689] Fps is (10 sec: 5413.7, 60 sec: 5621.4, 300 sec: 5626.1). Total num frames: 1250219008. Throughput: 0: 4967.4. Samples: 1250214204. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:58:07,159][25689] Avg episode reward: [(0, '-0.252')] [2022-07-11 13:58:08,348][26022] Updated weights on worker 0-0, policy_version 1220926 (0.00095) [2022-07-11 13:58:10,513][26022] Updated weights on worker 0-0, policy_version 1220936 (0.00090) [2022-07-11 13:58:11,982][26022] Updated weights on worker 0-0, policy_version 1220946 (0.00083) [2022-07-11 13:58:12,170][25689] Fps is (10 sec: 5724.8, 60 sec: 5671.3, 300 sec: 5634.1). Total num frames: 1250249728. Throughput: 0: 5802.6. Samples: 1250248006. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:58:12,171][25689] Avg episode reward: [(0, '-0.829')] [2022-07-11 13:58:12,551][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 13:58:12,559][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001220949_1250251776.pth [2022-07-11 13:58:12,560][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001218968_1248223232.pth [2022-07-11 13:58:13,999][26022] Updated weights on worker 0-0, policy_version 1220956 (0.00085) [2022-07-11 13:58:15,805][26022] Updated weights on worker 0-0, policy_version 1220966 (0.00087) [2022-07-11 13:58:17,234][25689] Fps is (10 sec: 5792.8, 60 sec: 5631.8, 300 sec: 5633.0). Total num frames: 1250277376. Throughput: 0: 5764.4. Samples: 1250281828. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:58:17,235][25689] Avg episode reward: [(0, '-0.793')] [2022-07-11 13:58:17,522][26022] Updated weights on worker 0-0, policy_version 1220976 (0.00088) [2022-07-11 13:58:19,476][26022] Updated weights on worker 0-0, policy_version 1220986 (0.00084) [2022-07-11 13:58:21,022][26022] Updated weights on worker 0-0, policy_version 1220996 (0.00090) [2022-07-11 13:58:22,280][25689] Fps is (10 sec: 5469.4, 60 sec: 5616.7, 300 sec: 5627.5). Total num frames: 1250305024. Throughput: 0: 4909.1. Samples: 1250298864. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:58:22,281][25689] Avg episode reward: [(0, '-0.779')] [2022-07-11 13:58:23,127][26022] Updated weights on worker 0-0, policy_version 1221006 (0.00992) [2022-07-11 13:58:24,744][26022] Updated weights on worker 0-0, policy_version 1221016 (0.00085) [2022-07-11 13:58:26,587][26022] Updated weights on worker 0-0, policy_version 1221026 (0.00081) [2022-07-11 13:58:27,337][25689] Fps is (10 sec: 5574.5, 60 sec: 5650.5, 300 sec: 5630.1). Total num frames: 1250333696. Throughput: 0: 5862.9. Samples: 1250332512. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:58:27,338][25689] Avg episode reward: [(0, '-0.184')] [2022-07-11 13:58:28,476][26022] Updated weights on worker 0-0, policy_version 1221036 (0.00081) [2022-07-11 13:58:30,407][26022] Updated weights on worker 0-0, policy_version 1221046 (0.00086) [2022-07-11 13:58:32,083][26022] Updated weights on worker 0-0, policy_version 1221056 (0.00086) [2022-07-11 13:58:32,435][25689] Fps is (10 sec: 5747.8, 60 sec: 5631.1, 300 sec: 5628.4). Total num frames: 1250363392. Throughput: 0: 5844.6. Samples: 1250366446. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:58:32,436][25689] Avg episode reward: [(0, '0.687')] [2022-07-11 13:58:33,832][26022] Updated weights on worker 0-0, policy_version 1221066 (0.00086) [2022-07-11 13:58:35,759][26022] Updated weights on worker 0-0, policy_version 1221076 (0.00089) [2022-07-11 13:58:37,368][26022] Updated weights on worker 0-0, policy_version 1221086 (0.00294) [2022-07-11 13:58:37,473][25689] Fps is (10 sec: 5758.7, 60 sec: 5662.9, 300 sec: 5635.3). Total num frames: 1250392064. Throughput: 0: 5866.6. Samples: 1250400560. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:58:37,474][25689] Avg episode reward: [(0, '1.387')] [2022-07-11 13:58:39,541][26022] Updated weights on worker 0-0, policy_version 1221096 (0.00354) [2022-07-11 13:58:41,006][26022] Updated weights on worker 0-0, policy_version 1221106 (0.00091) [2022-07-11 13:58:42,531][25689] Fps is (10 sec: 5476.9, 60 sec: 5597.5, 300 sec: 5621.8). Total num frames: 1250418688. Throughput: 0: 5855.4. Samples: 1250417442. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:58:42,532][25689] Avg episode reward: [(0, '1.588')] [2022-07-11 13:58:43,038][26022] Updated weights on worker 0-0, policy_version 1221116 (0.00094) [2022-07-11 13:58:44,792][26022] Updated weights on worker 0-0, policy_version 1221126 (0.01014) [2022-07-11 13:58:46,684][26022] Updated weights on worker 0-0, policy_version 1221136 (0.00092) [2022-07-11 13:58:47,589][25689] Fps is (10 sec: 5567.7, 60 sec: 5615.3, 300 sec: 5631.6). Total num frames: 1250448384. Throughput: 0: 5869.0. Samples: 1250451366. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:58:47,589][25689] Avg episode reward: [(0, '1.476')] [2022-07-11 13:58:48,437][26022] Updated weights on worker 0-0, policy_version 1221146 (0.00085) [2022-07-11 13:58:50,384][26022] Updated weights on worker 0-0, policy_version 1221156 (0.00084) [2022-07-11 13:58:51,915][26022] Updated weights on worker 0-0, policy_version 1221166 (0.00087) [2022-07-11 13:58:52,663][25689] Fps is (10 sec: 5861.8, 60 sec: 5647.9, 300 sec: 5633.7). Total num frames: 1250478080. Throughput: 0: 5858.1. Samples: 1250484944. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:58:52,667][25689] Avg episode reward: [(0, '1.604')] [2022-07-11 13:58:54,198][26022] Updated weights on worker 0-0, policy_version 1221176 (0.00095) [2022-07-11 13:58:55,439][26022] Updated weights on worker 0-0, policy_version 1221186 (0.00086) [2022-07-11 13:58:57,475][26022] Updated weights on worker 0-0, policy_version 1221196 (0.00081) [2022-07-11 13:58:57,681][25689] Fps is (10 sec: 5580.3, 60 sec: 5614.2, 300 sec: 5626.9). Total num frames: 1250504704. Throughput: 0: 5023.7. Samples: 1250502080. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:58:57,682][25689] Avg episode reward: [(0, '1.736')] [2022-07-11 13:58:59,345][26022] Updated weights on worker 0-0, policy_version 1221206 (0.00095) [2022-07-11 13:59:01,117][26022] Updated weights on worker 0-0, policy_version 1221216 (0.00092) [2022-07-11 13:59:02,683][25689] Fps is (10 sec: 5416.4, 60 sec: 5616.9, 300 sec: 5632.3). Total num frames: 1250532352. Throughput: 0: 5894.7. Samples: 1250536232. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:59:02,684][25689] Avg episode reward: [(0, '1.660')] [2022-07-11 13:59:03,239][26022] Updated weights on worker 0-0, policy_version 1221226 (0.00091) [2022-07-11 13:59:05,160][26022] Updated weights on worker 0-0, policy_version 1221236 (0.00081) [2022-07-11 13:59:06,563][26022] Updated weights on worker 0-0, policy_version 1221246 (0.00081) [2022-07-11 13:59:07,768][25689] Fps is (10 sec: 5583.8, 60 sec: 5643.0, 300 sec: 5634.1). Total num frames: 1250561024. Throughput: 0: 5815.1. Samples: 1250568710. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:59:07,768][25689] Avg episode reward: [(0, '1.439')] [2022-07-11 13:59:08,770][26022] Updated weights on worker 0-0, policy_version 1221256 (0.00087) [2022-07-11 13:59:10,154][26022] Updated weights on worker 0-0, policy_version 1221266 (0.00078) [2022-07-11 13:59:12,335][26022] Updated weights on worker 0-0, policy_version 1221276 (0.00091) [2022-07-11 13:59:12,789][25689] Fps is (10 sec: 5775.7, 60 sec: 5625.2, 300 sec: 5634.1). Total num frames: 1250590720. Throughput: 0: 5020.9. Samples: 1250585994. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:59:12,790][25689] Avg episode reward: [(0, '1.291')] [2022-07-11 13:59:14,014][26022] Updated weights on worker 0-0, policy_version 1221286 (0.00081) [2022-07-11 13:59:15,713][26022] Updated weights on worker 0-0, policy_version 1221296 (0.00085) [2022-07-11 13:59:17,665][26022] Updated weights on worker 0-0, policy_version 1221306 (0.00085) [2022-07-11 13:59:17,811][25689] Fps is (10 sec: 5607.7, 60 sec: 5612.2, 300 sec: 5627.7). Total num frames: 1250617344. Throughput: 0: 5866.9. Samples: 1250620178. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:59:17,811][25689] Avg episode reward: [(0, '1.341')] [2022-07-11 13:59:19,255][26022] Updated weights on worker 0-0, policy_version 1221316 (0.00083) [2022-07-11 13:59:21,167][26022] Updated weights on worker 0-0, policy_version 1221326 (0.00093) [2022-07-11 13:59:22,915][25689] Fps is (10 sec: 5460.9, 60 sec: 5623.7, 300 sec: 5628.8). Total num frames: 1250646016. Throughput: 0: 5830.4. Samples: 1250654190. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:59:22,915][25689] Avg episode reward: [(0, '1.195')] [2022-07-11 13:59:23,127][26022] Updated weights on worker 0-0, policy_version 1221336 (0.00086) [2022-07-11 13:59:24,956][26022] Updated weights on worker 0-0, policy_version 1221346 (0.00084) [2022-07-11 13:59:26,626][26022] Updated weights on worker 0-0, policy_version 1221356 (0.00086) [2022-07-11 13:59:28,064][25689] Fps is (10 sec: 5692.9, 60 sec: 5632.1, 300 sec: 5633.0). Total num frames: 1250675712. Throughput: 0: 5040.6. Samples: 1250671014. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:59:28,064][25689] Avg episode reward: [(0, '0.338')] [2022-07-11 13:59:28,386][26022] Updated weights on worker 0-0, policy_version 1221366 (0.00082) [2022-07-11 13:59:30,096][26022] Updated weights on worker 0-0, policy_version 1221376 (0.00087) [2022-07-11 13:59:32,146][26022] Updated weights on worker 0-0, policy_version 1221386 (0.00076) [2022-07-11 13:59:33,157][25689] Fps is (10 sec: 5698.8, 60 sec: 5615.6, 300 sec: 5628.3). Total num frames: 1250704384. Throughput: 0: 5836.9. Samples: 1250704880. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:59:33,157][25689] Avg episode reward: [(0, '0.461')] [2022-07-11 13:59:33,875][26022] Updated weights on worker 0-0, policy_version 1221396 (0.00086) [2022-07-11 13:59:35,712][26022] Updated weights on worker 0-0, policy_version 1221406 (0.00090) [2022-07-11 13:59:37,535][26022] Updated weights on worker 0-0, policy_version 1221416 (0.00080) [2022-07-11 13:59:38,210][25689] Fps is (10 sec: 5752.6, 60 sec: 5631.1, 300 sec: 5634.6). Total num frames: 1250734080. Throughput: 0: 5829.2. Samples: 1250739090. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:59:38,211][25689] Avg episode reward: [(0, '0.478')] [2022-07-11 13:59:39,218][26022] Updated weights on worker 0-0, policy_version 1221426 (0.00083) [2022-07-11 13:59:40,912][26022] Updated weights on worker 0-0, policy_version 1221436 (0.00089) [2022-07-11 13:59:42,900][26022] Updated weights on worker 0-0, policy_version 1221446 (0.00085) [2022-07-11 13:59:43,241][25689] Fps is (10 sec: 5584.9, 60 sec: 5633.6, 300 sec: 5625.2). Total num frames: 1250760704. Throughput: 0: 5023.7. Samples: 1250756304. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:59:43,242][25689] Avg episode reward: [(0, '-0.273')] [2022-07-11 13:59:44,584][26022] Updated weights on worker 0-0, policy_version 1221456 (0.00080) [2022-07-11 13:59:46,555][26022] Updated weights on worker 0-0, policy_version 1221466 (0.00105) [2022-07-11 13:59:48,257][26022] Updated weights on worker 0-0, policy_version 1221476 (0.00081) [2022-07-11 13:59:48,322][25689] Fps is (10 sec: 5670.8, 60 sec: 5648.3, 300 sec: 5634.5). Total num frames: 1250791424. Throughput: 0: 5899.2. Samples: 1250790524. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:59:48,323][25689] Avg episode reward: [(0, '-0.162')] [2022-07-11 13:59:50,066][26022] Updated weights on worker 0-0, policy_version 1221486 (0.00083) [2022-07-11 13:59:51,731][26022] Updated weights on worker 0-0, policy_version 1221496 (0.00083) [2022-07-11 13:59:53,352][25689] Fps is (10 sec: 5873.9, 60 sec: 5635.5, 300 sec: 5634.2). Total num frames: 1250820096. Throughput: 0: 5947.0. Samples: 1250824982. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:59:53,353][25689] Avg episode reward: [(0, '0.690')] [2022-07-11 13:59:53,639][26022] Updated weights on worker 0-0, policy_version 1221506 (0.00096) [2022-07-11 13:59:55,550][26022] Updated weights on worker 0-0, policy_version 1221516 (0.00086) [2022-07-11 13:59:57,335][26022] Updated weights on worker 0-0, policy_version 1221526 (0.00081) [2022-07-11 13:59:58,379][25689] Fps is (10 sec: 5600.2, 60 sec: 5651.6, 300 sec: 5630.5). Total num frames: 1250847744. Throughput: 0: 5086.2. Samples: 1250841672. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 13:59:58,380][25689] Avg episode reward: [(0, '-0.366')] [2022-07-11 13:59:59,145][26022] Updated weights on worker 0-0, policy_version 1221536 (0.00089) [2022-07-11 14:00:01,027][26022] Updated weights on worker 0-0, policy_version 1221546 (0.00083) [2022-07-11 14:00:03,173][26022] Updated weights on worker 0-0, policy_version 1221556 (0.00086) [2022-07-11 14:00:03,401][25689] Fps is (10 sec: 5400.9, 60 sec: 5632.9, 300 sec: 5631.1). Total num frames: 1250874368. Throughput: 0: 5878.1. Samples: 1250874806. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 14:00:03,402][25689] Avg episode reward: [(0, '-0.230')] [2022-07-11 14:00:04,902][26022] Updated weights on worker 0-0, policy_version 1221566 (0.00079) [2022-07-11 14:00:06,793][26022] Updated weights on worker 0-0, policy_version 1221576 (0.00082) [2022-07-11 14:00:08,438][25689] Fps is (10 sec: 5497.1, 60 sec: 5637.3, 300 sec: 5634.0). Total num frames: 1250903040. Throughput: 0: 5831.2. Samples: 1250907824. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 14:00:08,439][25689] Avg episode reward: [(0, '-0.107')] [2022-07-11 14:00:08,441][26022] Updated weights on worker 0-0, policy_version 1221586 (0.00083) [2022-07-11 14:00:10,430][26022] Updated weights on worker 0-0, policy_version 1221596 (0.00102) [2022-07-11 14:00:12,011][26022] Updated weights on worker 0-0, policy_version 1221606 (0.00081) [2022-07-11 14:00:12,785][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:00:12,798][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001221609_1250927616.pth [2022-07-11 14:00:12,798][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001219626_1248897024.pth [2022-07-11 14:00:13,472][25689] Fps is (10 sec: 5694.3, 60 sec: 5619.3, 300 sec: 5630.4). Total num frames: 1250931712. Throughput: 0: 4976.2. Samples: 1250925098. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 14:00:13,472][25689] Avg episode reward: [(0, '0.341')] [2022-07-11 14:00:13,923][26022] Updated weights on worker 0-0, policy_version 1221616 (0.00088) [2022-07-11 14:00:15,790][26022] Updated weights on worker 0-0, policy_version 1221626 (0.00093) [2022-07-11 14:00:17,368][26022] Updated weights on worker 0-0, policy_version 1221636 (0.00100) [2022-07-11 14:00:18,480][25689] Fps is (10 sec: 5710.2, 60 sec: 5654.2, 300 sec: 5637.5). Total num frames: 1250960384. Throughput: 0: 5850.3. Samples: 1250959270. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 14:00:18,482][25689] Avg episode reward: [(0, '0.184')] [2022-07-11 14:00:19,457][26022] Updated weights on worker 0-0, policy_version 1221646 (0.00086) [2022-07-11 14:00:21,092][26022] Updated weights on worker 0-0, policy_version 1221656 (0.00085) [2022-07-11 14:00:22,938][26022] Updated weights on worker 0-0, policy_version 1221666 (0.00084) [2022-07-11 14:00:23,483][25689] Fps is (10 sec: 5830.0, 60 sec: 5680.6, 300 sec: 5641.6). Total num frames: 1250990080. Throughput: 0: 5901.2. Samples: 1250993314. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 14:00:23,484][25689] Avg episode reward: [(0, '0.393')] [2022-07-11 14:00:24,780][26022] Updated weights on worker 0-0, policy_version 1221676 (0.00091) [2022-07-11 14:00:26,611][26022] Updated weights on worker 0-0, policy_version 1221686 (0.00083) [2022-07-11 14:00:28,444][26022] Updated weights on worker 0-0, policy_version 1221696 (0.00083) [2022-07-11 14:00:28,564][25689] Fps is (10 sec: 5585.1, 60 sec: 5636.2, 300 sec: 5633.3). Total num frames: 1251016704. Throughput: 0: 5897.1. Samples: 1251026512. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 14:00:28,565][25689] Avg episode reward: [(0, '1.093')] [2022-07-11 14:00:30,177][26022] Updated weights on worker 0-0, policy_version 1221706 (0.00087) [2022-07-11 14:00:32,126][26022] Updated weights on worker 0-0, policy_version 1221716 (0.00070) [2022-07-11 14:00:33,569][25689] Fps is (10 sec: 5482.5, 60 sec: 5644.4, 300 sec: 5637.2). Total num frames: 1251045376. Throughput: 0: 5893.1. Samples: 1251043536. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 14:00:33,570][25689] Avg episode reward: [(0, '0.230')] [2022-07-11 14:00:33,822][26022] Updated weights on worker 0-0, policy_version 1221726 (0.00086) [2022-07-11 14:00:35,616][26022] Updated weights on worker 0-0, policy_version 1221736 (0.00100) [2022-07-11 14:00:37,527][26022] Updated weights on worker 0-0, policy_version 1221746 (0.00082) [2022-07-11 14:00:38,579][25689] Fps is (10 sec: 5623.7, 60 sec: 5614.5, 300 sec: 5630.5). Total num frames: 1251073024. Throughput: 0: 5898.6. Samples: 1251077826. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2022-07-11 14:00:38,584][25689] Avg episode reward: [(0, '0.661')] [2022-07-11 14:00:39,231][26022] Updated weights on worker 0-0, policy_version 1221756 (0.00078) [2022-07-11 14:00:40,938][26022] Updated weights on worker 0-0, policy_version 1221766 (0.00087) [2022-07-11 14:00:42,878][26022] Updated weights on worker 0-0, policy_version 1221776 (0.00083) [2022-07-11 14:00:43,606][25689] Fps is (10 sec: 5611.6, 60 sec: 5648.9, 300 sec: 5630.8). Total num frames: 1251101696. Throughput: 0: 5909.5. Samples: 1251112228. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:00:43,606][25689] Avg episode reward: [(0, '-0.386')] [2022-07-11 14:00:44,616][26022] Updated weights on worker 0-0, policy_version 1221786 (0.00088) [2022-07-11 14:00:46,579][26022] Updated weights on worker 0-0, policy_version 1221796 (0.00086) [2022-07-11 14:00:48,193][26022] Updated weights on worker 0-0, policy_version 1221806 (0.00081) [2022-07-11 14:00:48,655][25689] Fps is (10 sec: 5792.9, 60 sec: 5634.9, 300 sec: 5633.3). Total num frames: 1251131392. Throughput: 0: 5123.4. Samples: 1251129444. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:00:48,656][25689] Avg episode reward: [(0, '-0.573')] [2022-07-11 14:00:50,192][26022] Updated weights on worker 0-0, policy_version 1221816 (0.00091) [2022-07-11 14:00:51,783][26022] Updated weights on worker 0-0, policy_version 1221826 (0.00085) [2022-07-11 14:00:53,672][25689] Fps is (10 sec: 5798.4, 60 sec: 5636.1, 300 sec: 5633.7). Total num frames: 1251160064. Throughput: 0: 5958.4. Samples: 1251163316. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:00:53,672][25689] Avg episode reward: [(0, '-0.500')] [2022-07-11 14:00:53,676][26022] Updated weights on worker 0-0, policy_version 1221836 (0.00075) [2022-07-11 14:00:55,412][26022] Updated weights on worker 0-0, policy_version 1221846 (0.00091) [2022-07-11 14:00:57,222][26022] Updated weights on worker 0-0, policy_version 1221856 (0.00094) [2022-07-11 14:00:58,682][25689] Fps is (10 sec: 5514.7, 60 sec: 5620.7, 300 sec: 5630.4). Total num frames: 1251186688. Throughput: 0: 5920.7. Samples: 1251196848. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:00:58,683][25689] Avg episode reward: [(0, '-0.212')] [2022-07-11 14:00:59,177][26022] Updated weights on worker 0-0, policy_version 1221866 (0.00086) [2022-07-11 14:01:00,881][26022] Updated weights on worker 0-0, policy_version 1221876 (0.00085) [2022-07-11 14:01:03,127][26022] Updated weights on worker 0-0, policy_version 1221886 (0.00083) [2022-07-11 14:01:03,690][25689] Fps is (10 sec: 5417.1, 60 sec: 5638.9, 300 sec: 5634.6). Total num frames: 1251214336. Throughput: 0: 5062.7. Samples: 1251213912. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:01:03,692][25689] Avg episode reward: [(0, '-0.290')] [2022-07-11 14:01:04,831][26022] Updated weights on worker 0-0, policy_version 1221896 (0.00087) [2022-07-11 14:01:06,675][26022] Updated weights on worker 0-0, policy_version 1221906 (0.00080) [2022-07-11 14:01:08,577][26022] Updated weights on worker 0-0, policy_version 1221916 (0.00080) [2022-07-11 14:01:08,770][25689] Fps is (10 sec: 5582.9, 60 sec: 5634.9, 300 sec: 5637.8). Total num frames: 1251243008. Throughput: 0: 5787.3. Samples: 1251245856. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:01:08,771][25689] Avg episode reward: [(0, '-0.268')] [2022-07-11 14:01:10,321][26022] Updated weights on worker 0-0, policy_version 1221926 (0.00092) [2022-07-11 14:01:12,211][26022] Updated weights on worker 0-0, policy_version 1221936 (0.00088) [2022-07-11 14:01:13,842][25689] Fps is (10 sec: 5648.7, 60 sec: 5631.3, 300 sec: 5630.1). Total num frames: 1251271680. Throughput: 0: 5783.7. Samples: 1251279974. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:01:13,843][25689] Avg episode reward: [(0, '0.495')] [2022-07-11 14:01:13,874][26022] Updated weights on worker 0-0, policy_version 1221946 (0.00092) [2022-07-11 14:01:15,883][26022] Updated weights on worker 0-0, policy_version 1221956 (0.00084) [2022-07-11 14:01:17,500][26022] Updated weights on worker 0-0, policy_version 1221966 (0.00093) [2022-07-11 14:01:18,893][25689] Fps is (10 sec: 5664.6, 60 sec: 5627.4, 300 sec: 5636.8). Total num frames: 1251300352. Throughput: 0: 4951.7. Samples: 1251296928. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:01:18,896][25689] Avg episode reward: [(0, '0.523')] [2022-07-11 14:01:19,485][26022] Updated weights on worker 0-0, policy_version 1221976 (0.00084) [2022-07-11 14:01:21,155][26022] Updated weights on worker 0-0, policy_version 1221986 (0.00086) [2022-07-11 14:01:23,094][26022] Updated weights on worker 0-0, policy_version 1221996 (0.00084) [2022-07-11 14:01:23,949][25689] Fps is (10 sec: 5674.0, 60 sec: 5605.6, 300 sec: 5636.8). Total num frames: 1251329024. Throughput: 0: 5781.7. Samples: 1251331038. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:01:23,950][25689] Avg episode reward: [(0, '0.803')] [2022-07-11 14:01:24,748][26022] Updated weights on worker 0-0, policy_version 1222006 (0.00083) [2022-07-11 14:01:26,751][26022] Updated weights on worker 0-0, policy_version 1222016 (0.00091) [2022-07-11 14:01:28,376][26022] Updated weights on worker 0-0, policy_version 1222026 (0.00080) [2022-07-11 14:01:29,016][25689] Fps is (10 sec: 5563.8, 60 sec: 5623.8, 300 sec: 5629.3). Total num frames: 1251356672. Throughput: 0: 5869.9. Samples: 1251364698. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:01:29,017][25689] Avg episode reward: [(0, '1.490')] [2022-07-11 14:01:30,513][26022] Updated weights on worker 0-0, policy_version 1222036 (0.00087) [2022-07-11 14:01:32,083][26022] Updated weights on worker 0-0, policy_version 1222046 (0.00089) [2022-07-11 14:01:33,996][26022] Updated weights on worker 0-0, policy_version 1222056 (0.00102) [2022-07-11 14:01:34,031][25689] Fps is (10 sec: 5585.8, 60 sec: 5622.8, 300 sec: 5629.1). Total num frames: 1251385344. Throughput: 0: 5039.2. Samples: 1251381712. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:01:34,032][25689] Avg episode reward: [(0, '1.631')] [2022-07-11 14:01:35,707][26022] Updated weights on worker 0-0, policy_version 1222066 (0.00092) [2022-07-11 14:01:37,540][26022] Updated weights on worker 0-0, policy_version 1222076 (0.00092) [2022-07-11 14:01:39,059][25689] Fps is (10 sec: 5608.0, 60 sec: 5621.2, 300 sec: 5628.9). Total num frames: 1251412992. Throughput: 0: 5905.0. Samples: 1251416004. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:01:39,059][25689] Avg episode reward: [(0, '0.979')] [2022-07-11 14:01:39,383][26022] Updated weights on worker 0-0, policy_version 1222086 (0.00083) [2022-07-11 14:01:41,111][26022] Updated weights on worker 0-0, policy_version 1222096 (0.00081) [2022-07-11 14:01:42,928][26022] Updated weights on worker 0-0, policy_version 1222106 (0.00086) [2022-07-11 14:01:44,069][25689] Fps is (10 sec: 5814.9, 60 sec: 5656.6, 300 sec: 5640.0). Total num frames: 1251443712. Throughput: 0: 5936.0. Samples: 1251450472. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:01:44,071][25689] Avg episode reward: [(0, '1.245')] [2022-07-11 14:01:44,850][26022] Updated weights on worker 0-0, policy_version 1222116 (0.00091) [2022-07-11 14:01:46,487][26022] Updated weights on worker 0-0, policy_version 1222126 (0.00082) [2022-07-11 14:01:48,307][26022] Updated weights on worker 0-0, policy_version 1222136 (0.00093) [2022-07-11 14:01:49,176][25689] Fps is (10 sec: 5769.0, 60 sec: 5617.3, 300 sec: 5631.8). Total num frames: 1251471360. Throughput: 0: 5099.8. Samples: 1251467512. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:01:49,177][25689] Avg episode reward: [(0, '1.083')] [2022-07-11 14:01:50,030][26022] Updated weights on worker 0-0, policy_version 1222146 (0.00106) [2022-07-11 14:01:51,887][26022] Updated weights on worker 0-0, policy_version 1222156 (0.00090) [2022-07-11 14:01:53,743][26022] Updated weights on worker 0-0, policy_version 1222166 (0.00094) [2022-07-11 14:01:54,265][25689] Fps is (10 sec: 5624.2, 60 sec: 5627.6, 300 sec: 5637.0). Total num frames: 1251501056. Throughput: 0: 5938.1. Samples: 1251501860. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:01:54,265][25689] Avg episode reward: [(0, '1.060')] [2022-07-11 14:01:55,668][26022] Updated weights on worker 0-0, policy_version 1222176 (0.00092) [2022-07-11 14:01:57,278][26022] Updated weights on worker 0-0, policy_version 1222186 (0.00081) [2022-07-11 14:01:58,976][26022] Updated weights on worker 0-0, policy_version 1222196 (0.00085) [2022-07-11 14:01:59,288][25689] Fps is (10 sec: 5772.1, 60 sec: 5660.1, 300 sec: 5633.4). Total num frames: 1251529728. Throughput: 0: 5946.4. Samples: 1251536298. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:01:59,289][25689] Avg episode reward: [(0, '1.122')] [2022-07-11 14:02:00,887][26022] Updated weights on worker 0-0, policy_version 1222206 (0.00084) [2022-07-11 14:02:03,132][26022] Updated weights on worker 0-0, policy_version 1222216 (0.00086) [2022-07-11 14:02:04,304][25689] Fps is (10 sec: 5406.0, 60 sec: 5625.7, 300 sec: 5632.2). Total num frames: 1251555328. Throughput: 0: 5086.4. Samples: 1251553396. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:02:04,305][25689] Avg episode reward: [(0, '1.375')] [2022-07-11 14:02:04,971][26022] Updated weights on worker 0-0, policy_version 1222226 (0.00086) [2022-07-11 14:02:06,500][26022] Updated weights on worker 0-0, policy_version 1222236 (0.00085) [2022-07-11 14:02:08,337][26022] Updated weights on worker 0-0, policy_version 1222246 (0.00096) [2022-07-11 14:02:09,379][25689] Fps is (10 sec: 5581.5, 60 sec: 5659.9, 300 sec: 5641.2). Total num frames: 1251586048. Throughput: 0: 5835.7. Samples: 1251585408. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:02:09,379][25689] Avg episode reward: [(0, '2.010')] [2022-07-11 14:02:10,106][26022] Updated weights on worker 0-0, policy_version 1222256 (0.00497) [2022-07-11 14:02:12,082][26022] Updated weights on worker 0-0, policy_version 1222266 (0.00087) [2022-07-11 14:02:12,865][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:02:12,875][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001222272_1251606528.pth [2022-07-11 14:02:12,876][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001220288_1249574912.pth [2022-07-11 14:02:13,713][26022] Updated weights on worker 0-0, policy_version 1222276 (0.00079) [2022-07-11 14:02:14,446][25689] Fps is (10 sec: 5755.0, 60 sec: 5643.4, 300 sec: 5633.1). Total num frames: 1251613696. Throughput: 0: 5851.8. Samples: 1251619958. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:02:14,447][25689] Avg episode reward: [(0, '1.976')] [2022-07-11 14:02:15,661][26022] Updated weights on worker 0-0, policy_version 1222286 (0.00101) [2022-07-11 14:02:17,122][26022] Updated weights on worker 0-0, policy_version 1222296 (0.00083) [2022-07-11 14:02:19,247][26022] Updated weights on worker 0-0, policy_version 1222306 (0.00097) [2022-07-11 14:02:19,461][25689] Fps is (10 sec: 5687.6, 60 sec: 5663.7, 300 sec: 5637.5). Total num frames: 1251643392. Throughput: 0: 4997.0. Samples: 1251637102. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:02:19,462][25689] Avg episode reward: [(0, '1.825')] [2022-07-11 14:02:20,856][26022] Updated weights on worker 0-0, policy_version 1222316 (0.00097) [2022-07-11 14:02:22,772][26022] Updated weights on worker 0-0, policy_version 1222326 (0.00093) [2022-07-11 14:02:24,544][25689] Fps is (10 sec: 5679.1, 60 sec: 5644.3, 300 sec: 5640.5). Total num frames: 1251671040. Throughput: 0: 5825.3. Samples: 1251671298. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:02:24,544][25689] Avg episode reward: [(0, '1.771')] [2022-07-11 14:02:24,585][26022] Updated weights on worker 0-0, policy_version 1222336 (0.00087) [2022-07-11 14:02:26,453][26022] Updated weights on worker 0-0, policy_version 1222346 (0.00088) [2022-07-11 14:02:28,117][26022] Updated weights on worker 0-0, policy_version 1222356 (0.00095) [2022-07-11 14:02:29,673][25689] Fps is (10 sec: 5414.9, 60 sec: 5638.5, 300 sec: 5629.0). Total num frames: 1251698688. Throughput: 0: 5906.1. Samples: 1251705268. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:02:29,674][25689] Avg episode reward: [(0, '1.469')] [2022-07-11 14:02:30,088][26022] Updated weights on worker 0-0, policy_version 1222366 (0.00085) [2022-07-11 14:02:31,701][26022] Updated weights on worker 0-0, policy_version 1222376 (0.00088) [2022-07-11 14:02:33,823][26022] Updated weights on worker 0-0, policy_version 1222386 (0.00085) [2022-07-11 14:02:34,691][25689] Fps is (10 sec: 5752.1, 60 sec: 5672.1, 300 sec: 5642.8). Total num frames: 1251729408. Throughput: 0: 5042.0. Samples: 1251722032. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:02:34,691][25689] Avg episode reward: [(0, '1.131')] [2022-07-11 14:02:35,522][26022] Updated weights on worker 0-0, policy_version 1222396 (0.00080) [2022-07-11 14:02:37,197][26022] Updated weights on worker 0-0, policy_version 1222406 (0.00084) [2022-07-11 14:02:38,880][26022] Updated weights on worker 0-0, policy_version 1222416 (0.00087) [2022-07-11 14:02:39,745][25689] Fps is (10 sec: 5693.4, 60 sec: 5652.7, 300 sec: 5629.5). Total num frames: 1251756032. Throughput: 0: 5887.5. Samples: 1251756524. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:02:39,747][25689] Avg episode reward: [(0, '0.112')] [2022-07-11 14:02:40,937][26022] Updated weights on worker 0-0, policy_version 1222426 (0.00085) [2022-07-11 14:02:42,719][26022] Updated weights on worker 0-0, policy_version 1222436 (0.00088) [2022-07-11 14:02:44,468][26022] Updated weights on worker 0-0, policy_version 1222446 (0.00094) [2022-07-11 14:02:44,763][25689] Fps is (10 sec: 5693.4, 60 sec: 5652.0, 300 sec: 5637.3). Total num frames: 1251786752. Throughput: 0: 5915.3. Samples: 1251790900. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:02:44,765][25689] Avg episode reward: [(0, '-0.040')] [2022-07-11 14:02:46,261][26022] Updated weights on worker 0-0, policy_version 1222456 (0.00082) [2022-07-11 14:02:48,128][26022] Updated weights on worker 0-0, policy_version 1222466 (0.00091) [2022-07-11 14:02:49,775][26022] Updated weights on worker 0-0, policy_version 1222476 (0.00078) [2022-07-11 14:02:49,875][25689] Fps is (10 sec: 5863.2, 60 sec: 5668.4, 300 sec: 5639.8). Total num frames: 1251815424. Throughput: 0: 5897.9. Samples: 1251824416. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:02:49,875][25689] Avg episode reward: [(0, '-0.132')] [2022-07-11 14:02:51,778][26022] Updated weights on worker 0-0, policy_version 1222486 (0.00083) [2022-07-11 14:02:53,426][26022] Updated weights on worker 0-0, policy_version 1222496 (0.00087) [2022-07-11 14:02:54,903][25689] Fps is (10 sec: 5352.4, 60 sec: 5606.5, 300 sec: 5629.4). Total num frames: 1251841024. Throughput: 0: 5904.0. Samples: 1251841362. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:02:54,905][25689] Avg episode reward: [(0, '-0.105')] [2022-07-11 14:02:55,403][26022] Updated weights on worker 0-0, policy_version 1222506 (0.00086) [2022-07-11 14:02:57,209][26022] Updated weights on worker 0-0, policy_version 1222516 (0.00086) [2022-07-11 14:02:59,044][26022] Updated weights on worker 0-0, policy_version 1222526 (0.00087) [2022-07-11 14:02:59,915][25689] Fps is (10 sec: 5711.5, 60 sec: 5658.3, 300 sec: 5643.4). Total num frames: 1251872768. Throughput: 0: 5888.0. Samples: 1251875284. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:02:59,917][25689] Avg episode reward: [(0, '0.017')] [2022-07-11 14:03:00,839][26022] Updated weights on worker 0-0, policy_version 1222536 (0.00089) [2022-07-11 14:03:02,992][26022] Updated weights on worker 0-0, policy_version 1222546 (0.00085) [2022-07-11 14:03:04,840][26022] Updated weights on worker 0-0, policy_version 1222556 (0.00085) [2022-07-11 14:03:04,954][25689] Fps is (10 sec: 5603.1, 60 sec: 5639.2, 300 sec: 5635.9). Total num frames: 1251897344. Throughput: 0: 5757.1. Samples: 1251907144. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:03:04,955][25689] Avg episode reward: [(0, '0.402')] [2022-07-11 14:03:06,583][26022] Updated weights on worker 0-0, policy_version 1222566 (0.00081) [2022-07-11 14:03:08,416][26022] Updated weights on worker 0-0, policy_version 1222576 (0.00085) [2022-07-11 14:03:10,048][25689] Fps is (10 sec: 5355.7, 60 sec: 5620.5, 300 sec: 5630.9). Total num frames: 1251927040. Throughput: 0: 4945.3. Samples: 1251924180. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:03:10,049][25689] Avg episode reward: [(0, '1.169')] [2022-07-11 14:03:10,207][26022] Updated weights on worker 0-0, policy_version 1222586 (0.00088) [2022-07-11 14:03:12,040][26022] Updated weights on worker 0-0, policy_version 1222596 (0.00103) [2022-07-11 14:03:13,960][26022] Updated weights on worker 0-0, policy_version 1222606 (0.00085) [2022-07-11 14:03:15,092][25689] Fps is (10 sec: 5757.2, 60 sec: 5639.5, 300 sec: 5634.7). Total num frames: 1251955712. Throughput: 0: 5789.6. Samples: 1251958252. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:03:15,093][25689] Avg episode reward: [(0, '1.145')] [2022-07-11 14:03:15,593][26022] Updated weights on worker 0-0, policy_version 1222616 (0.00782) [2022-07-11 14:03:17,623][26022] Updated weights on worker 0-0, policy_version 1222626 (0.00077) [2022-07-11 14:03:19,383][26022] Updated weights on worker 0-0, policy_version 1222636 (0.00090) [2022-07-11 14:03:20,126][25689] Fps is (10 sec: 5588.5, 60 sec: 5604.0, 300 sec: 5634.9). Total num frames: 1251983360. Throughput: 0: 5774.2. Samples: 1251991988. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:03:20,127][25689] Avg episode reward: [(0, '1.152')] [2022-07-11 14:03:21,118][26022] Updated weights on worker 0-0, policy_version 1222646 (0.00092) [2022-07-11 14:03:22,919][26022] Updated weights on worker 0-0, policy_version 1222656 (0.00084) [2022-07-11 14:03:24,809][26022] Updated weights on worker 0-0, policy_version 1222666 (0.00080) [2022-07-11 14:03:25,213][25689] Fps is (10 sec: 5463.9, 60 sec: 5603.7, 300 sec: 5630.9). Total num frames: 1252011008. Throughput: 0: 5022.1. Samples: 1252008884. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:03:25,213][25689] Avg episode reward: [(0, '-0.199')] [2022-07-11 14:03:26,543][26022] Updated weights on worker 0-0, policy_version 1222676 (0.00082) [2022-07-11 14:03:28,543][26022] Updated weights on worker 0-0, policy_version 1222686 (0.00092) [2022-07-11 14:03:30,113][26022] Updated weights on worker 0-0, policy_version 1222696 (0.00083) [2022-07-11 14:03:30,270][25689] Fps is (10 sec: 5754.0, 60 sec: 5661.0, 300 sec: 5635.1). Total num frames: 1252041728. Throughput: 0: 5857.3. Samples: 1252042626. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:03:30,271][25689] Avg episode reward: [(0, '-0.104')] [2022-07-11 14:03:32,132][26022] Updated weights on worker 0-0, policy_version 1222706 (0.00083) [2022-07-11 14:03:33,818][26022] Updated weights on worker 0-0, policy_version 1222716 (0.00088) [2022-07-11 14:03:35,291][25689] Fps is (10 sec: 5690.0, 60 sec: 5593.2, 300 sec: 5628.6). Total num frames: 1252068352. Throughput: 0: 5857.5. Samples: 1252076564. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:03:35,291][25689] Avg episode reward: [(0, '-0.014')] [2022-07-11 14:03:35,699][26022] Updated weights on worker 0-0, policy_version 1222726 (0.00090) [2022-07-11 14:03:37,432][26022] Updated weights on worker 0-0, policy_version 1222736 (0.00093) [2022-07-11 14:03:39,289][26022] Updated weights on worker 0-0, policy_version 1222746 (0.00079) [2022-07-11 14:03:40,295][25689] Fps is (10 sec: 5618.2, 60 sec: 5648.5, 300 sec: 5639.9). Total num frames: 1252098048. Throughput: 0: 5043.4. Samples: 1252093706. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:03:40,295][25689] Avg episode reward: [(0, '0.105')] [2022-07-11 14:03:41,197][26022] Updated weights on worker 0-0, policy_version 1222756 (0.00084) [2022-07-11 14:03:42,870][26022] Updated weights on worker 0-0, policy_version 1222766 (0.00083) [2022-07-11 14:03:44,723][26022] Updated weights on worker 0-0, policy_version 1222776 (0.00087) [2022-07-11 14:03:45,297][25689] Fps is (10 sec: 5628.6, 60 sec: 5582.3, 300 sec: 5630.6). Total num frames: 1252124672. Throughput: 0: 5914.4. Samples: 1252127670. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:03:45,297][25689] Avg episode reward: [(0, '-0.348')] [2022-07-11 14:03:46,443][26022] Updated weights on worker 0-0, policy_version 1222786 (0.00099) [2022-07-11 14:03:48,427][26022] Updated weights on worker 0-0, policy_version 1222796 (0.00200) [2022-07-11 14:03:50,124][26022] Updated weights on worker 0-0, policy_version 1222806 (0.00083) [2022-07-11 14:03:50,335][25689] Fps is (10 sec: 5711.2, 60 sec: 5623.0, 300 sec: 5634.8). Total num frames: 1252155392. Throughput: 0: 5939.0. Samples: 1252161792. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:03:50,336][25689] Avg episode reward: [(0, '0.606')] [2022-07-11 14:03:52,042][26022] Updated weights on worker 0-0, policy_version 1222816 (0.00086) [2022-07-11 14:03:53,860][26022] Updated weights on worker 0-0, policy_version 1222826 (0.00081) [2022-07-11 14:03:55,355][25689] Fps is (10 sec: 5701.0, 60 sec: 5640.6, 300 sec: 5634.7). Total num frames: 1252182016. Throughput: 0: 5068.6. Samples: 1252178264. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:03:55,360][25689] Avg episode reward: [(0, '0.685')] [2022-07-11 14:03:55,541][26022] Updated weights on worker 0-0, policy_version 1222836 (0.00086) [2022-07-11 14:03:57,551][26022] Updated weights on worker 0-0, policy_version 1222846 (0.00083) [2022-07-11 14:03:59,316][26022] Updated weights on worker 0-0, policy_version 1222856 (0.00079) [2022-07-11 14:04:00,373][25689] Fps is (10 sec: 5406.8, 60 sec: 5572.4, 300 sec: 5634.4). Total num frames: 1252209664. Throughput: 0: 5903.8. Samples: 1252212244. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:04:00,373][25689] Avg episode reward: [(0, '0.287')] [2022-07-11 14:04:01,163][26022] Updated weights on worker 0-0, policy_version 1222866 (0.00101) [2022-07-11 14:04:03,362][26022] Updated weights on worker 0-0, policy_version 1222876 (0.00082) [2022-07-11 14:04:05,077][26022] Updated weights on worker 0-0, policy_version 1222886 (0.00087) [2022-07-11 14:04:05,390][25689] Fps is (10 sec: 5408.4, 60 sec: 5608.3, 300 sec: 5628.8). Total num frames: 1252236288. Throughput: 0: 5787.5. Samples: 1252243960. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:04:05,391][25689] Avg episode reward: [(0, '0.488')] [2022-07-11 14:04:06,908][26022] Updated weights on worker 0-0, policy_version 1222896 (0.00183) [2022-07-11 14:04:08,711][26022] Updated weights on worker 0-0, policy_version 1222906 (0.00095) [2022-07-11 14:04:10,494][25689] Fps is (10 sec: 5463.4, 60 sec: 5590.5, 300 sec: 5623.8). Total num frames: 1252264960. Throughput: 0: 4924.3. Samples: 1252261060. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:04:10,495][25689] Avg episode reward: [(0, '0.490')] [2022-07-11 14:04:10,540][26022] Updated weights on worker 0-0, policy_version 1222916 (0.00082) [2022-07-11 14:04:12,416][26022] Updated weights on worker 0-0, policy_version 1222926 (0.00091) [2022-07-11 14:04:13,031][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:04:13,046][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001222930_1252280320.pth [2022-07-11 14:04:13,047][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001220949_1250251776.pth [2022-07-11 14:04:13,983][26022] Updated weights on worker 0-0, policy_version 1222936 (0.00076) [2022-07-11 14:04:15,498][25689] Fps is (10 sec: 5673.0, 60 sec: 5594.2, 300 sec: 5631.0). Total num frames: 1252293632. Throughput: 0: 5808.5. Samples: 1252295264. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:04:15,499][25689] Avg episode reward: [(0, '1.434')] [2022-07-11 14:04:16,072][26022] Updated weights on worker 0-0, policy_version 1222946 (0.00080) [2022-07-11 14:04:17,572][26022] Updated weights on worker 0-0, policy_version 1222956 (0.00081) [2022-07-11 14:04:19,588][26022] Updated weights on worker 0-0, policy_version 1222966 (0.00086) [2022-07-11 14:04:20,505][25689] Fps is (10 sec: 5830.3, 60 sec: 5630.6, 300 sec: 5636.3). Total num frames: 1252323328. Throughput: 0: 5836.0. Samples: 1252329736. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:04:20,506][25689] Avg episode reward: [(0, '1.414')] [2022-07-11 14:04:21,062][26022] Updated weights on worker 0-0, policy_version 1222976 (0.00082) [2022-07-11 14:04:23,130][26022] Updated weights on worker 0-0, policy_version 1222986 (0.00088) [2022-07-11 14:04:24,867][26022] Updated weights on worker 0-0, policy_version 1222996 (0.00094) [2022-07-11 14:04:25,563][25689] Fps is (10 sec: 5595.9, 60 sec: 5616.3, 300 sec: 5627.7). Total num frames: 1252349952. Throughput: 0: 5094.5. Samples: 1252346730. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:04:25,565][25689] Avg episode reward: [(0, '1.681')] [2022-07-11 14:04:26,742][26022] Updated weights on worker 0-0, policy_version 1223006 (0.00088) [2022-07-11 14:04:28,681][26022] Updated weights on worker 0-0, policy_version 1223016 (0.00082) [2022-07-11 14:04:30,152][26022] Updated weights on worker 0-0, policy_version 1223026 (0.00085) [2022-07-11 14:04:30,697][25689] Fps is (10 sec: 5526.2, 60 sec: 5592.2, 300 sec: 5630.4). Total num frames: 1252379648. Throughput: 0: 5937.2. Samples: 1252381006. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:04:30,697][25689] Avg episode reward: [(0, '1.259')] [2022-07-11 14:04:32,202][26022] Updated weights on worker 0-0, policy_version 1223036 (0.00086) [2022-07-11 14:04:33,956][26022] Updated weights on worker 0-0, policy_version 1223046 (0.00083) [2022-07-11 14:04:35,748][25689] Fps is (10 sec: 5831.0, 60 sec: 5640.2, 300 sec: 5630.4). Total num frames: 1252409344. Throughput: 0: 5896.5. Samples: 1252414668. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:04:35,749][25689] Avg episode reward: [(0, '1.239')] [2022-07-11 14:04:35,754][26022] Updated weights on worker 0-0, policy_version 1223056 (0.00087) [2022-07-11 14:04:37,610][26022] Updated weights on worker 0-0, policy_version 1223066 (0.00097) [2022-07-11 14:04:39,326][26022] Updated weights on worker 0-0, policy_version 1223076 (0.00080) [2022-07-11 14:04:40,778][25689] Fps is (10 sec: 5789.7, 60 sec: 5620.9, 300 sec: 5637.3). Total num frames: 1252438016. Throughput: 0: 5030.0. Samples: 1252431708. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2022-07-11 14:04:40,779][25689] Avg episode reward: [(0, '1.338')] [2022-07-11 14:04:41,368][26022] Updated weights on worker 0-0, policy_version 1223086 (0.00089) [2022-07-11 14:04:43,009][26022] Updated weights on worker 0-0, policy_version 1223096 (0.00282) [2022-07-11 14:04:44,918][26022] Updated weights on worker 0-0, policy_version 1223106 (0.00092) [2022-07-11 14:04:45,838][25689] Fps is (10 sec: 5581.8, 60 sec: 5632.4, 300 sec: 5627.4). Total num frames: 1252465664. Throughput: 0: 5868.6. Samples: 1252465718. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:04:45,839][25689] Avg episode reward: [(0, '1.217')] [2022-07-11 14:04:46,607][26022] Updated weights on worker 0-0, policy_version 1223116 (0.00085) [2022-07-11 14:04:48,527][26022] Updated weights on worker 0-0, policy_version 1223126 (0.00090) [2022-07-11 14:04:50,285][26022] Updated weights on worker 0-0, policy_version 1223136 (0.00081) [2022-07-11 14:04:50,940][25689] Fps is (10 sec: 5542.1, 60 sec: 5592.7, 300 sec: 5626.0). Total num frames: 1252494336. Throughput: 0: 5863.3. Samples: 1252499700. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:04:50,941][25689] Avg episode reward: [(0, '0.960')] [2022-07-11 14:04:51,922][26022] Updated weights on worker 0-0, policy_version 1223146 (0.00085) [2022-07-11 14:04:53,942][26022] Updated weights on worker 0-0, policy_version 1223156 (0.00086) [2022-07-11 14:04:55,506][26022] Updated weights on worker 0-0, policy_version 1223166 (0.00082) [2022-07-11 14:04:55,946][25689] Fps is (10 sec: 5774.7, 60 sec: 5644.7, 300 sec: 5633.3). Total num frames: 1252524032. Throughput: 0: 5071.6. Samples: 1252517102. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:04:55,946][25689] Avg episode reward: [(0, '1.123')] [2022-07-11 14:04:57,484][26022] Updated weights on worker 0-0, policy_version 1223176 (0.00085) [2022-07-11 14:04:59,069][26022] Updated weights on worker 0-0, policy_version 1223186 (0.00064) [2022-07-11 14:05:01,003][25689] Fps is (10 sec: 5698.6, 60 sec: 5641.0, 300 sec: 5636.1). Total num frames: 1252551680. Throughput: 0: 5917.2. Samples: 1252551382. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:05:01,004][25689] Avg episode reward: [(0, '1.717')] [2022-07-11 14:05:01,071][26022] Updated weights on worker 0-0, policy_version 1223196 (0.00087) [2022-07-11 14:05:03,236][26022] Updated weights on worker 0-0, policy_version 1223206 (0.00098) [2022-07-11 14:05:04,966][26022] Updated weights on worker 0-0, policy_version 1223216 (0.00093) [2022-07-11 14:05:06,023][25689] Fps is (10 sec: 5385.7, 60 sec: 5640.8, 300 sec: 5629.5). Total num frames: 1252578304. Throughput: 0: 5828.6. Samples: 1252583366. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:05:06,025][25689] Avg episode reward: [(0, '1.487')] [2022-07-11 14:05:06,801][26022] Updated weights on worker 0-0, policy_version 1223226 (0.00080) [2022-07-11 14:05:08,708][26022] Updated weights on worker 0-0, policy_version 1223236 (0.00088) [2022-07-11 14:05:10,308][26022] Updated weights on worker 0-0, policy_version 1223246 (0.00103) [2022-07-11 14:05:11,111][25689] Fps is (10 sec: 5571.7, 60 sec: 5659.1, 300 sec: 5631.9). Total num frames: 1252608000. Throughput: 0: 5003.1. Samples: 1252600618. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:05:11,112][25689] Avg episode reward: [(0, '1.429')] [2022-07-11 14:05:12,552][26022] Updated weights on worker 0-0, policy_version 1223256 (0.00089) [2022-07-11 14:05:13,963][26022] Updated weights on worker 0-0, policy_version 1223266 (0.00082) [2022-07-11 14:05:16,139][25689] Fps is (10 sec: 5567.3, 60 sec: 5623.1, 300 sec: 5624.7). Total num frames: 1252634624. Throughput: 0: 5807.0. Samples: 1252634364. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:05:16,140][25689] Avg episode reward: [(0, '1.536')] [2022-07-11 14:05:16,157][26022] Updated weights on worker 0-0, policy_version 1223276 (0.01084) [2022-07-11 14:05:17,692][26022] Updated weights on worker 0-0, policy_version 1223286 (0.00086) [2022-07-11 14:05:19,555][26022] Updated weights on worker 0-0, policy_version 1223296 (0.00095) [2022-07-11 14:05:21,147][25689] Fps is (10 sec: 5509.8, 60 sec: 5606.2, 300 sec: 5621.2). Total num frames: 1252663296. Throughput: 0: 5796.1. Samples: 1252668138. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:05:21,149][25689] Avg episode reward: [(0, '1.260')] [2022-07-11 14:05:21,287][26022] Updated weights on worker 0-0, policy_version 1223306 (0.00084) [2022-07-11 14:05:23,086][26022] Updated weights on worker 0-0, policy_version 1223316 (0.00088) [2022-07-11 14:05:24,912][26022] Updated weights on worker 0-0, policy_version 1223326 (0.00084) [2022-07-11 14:05:26,172][25689] Fps is (10 sec: 5817.4, 60 sec: 5659.8, 300 sec: 5632.5). Total num frames: 1252692992. Throughput: 0: 5063.4. Samples: 1252685392. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:05:26,174][25689] Avg episode reward: [(0, '1.149')] [2022-07-11 14:05:26,783][26022] Updated weights on worker 0-0, policy_version 1223336 (0.00084) [2022-07-11 14:05:28,538][26022] Updated weights on worker 0-0, policy_version 1223346 (0.00086) [2022-07-11 14:05:30,347][26022] Updated weights on worker 0-0, policy_version 1223356 (0.00087) [2022-07-11 14:05:31,228][25689] Fps is (10 sec: 5789.9, 60 sec: 5650.2, 300 sec: 5631.6). Total num frames: 1252721664. Throughput: 0: 5912.5. Samples: 1252719558. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:05:31,230][25689] Avg episode reward: [(0, '1.190')] [2022-07-11 14:05:32,091][26022] Updated weights on worker 0-0, policy_version 1223366 (0.00097) [2022-07-11 14:05:34,019][26022] Updated weights on worker 0-0, policy_version 1223376 (0.00086) [2022-07-11 14:05:35,931][26022] Updated weights on worker 0-0, policy_version 1223386 (0.00094) [2022-07-11 14:05:36,288][25689] Fps is (10 sec: 5567.8, 60 sec: 5615.6, 300 sec: 5630.6). Total num frames: 1252749312. Throughput: 0: 5906.3. Samples: 1252753368. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:05:36,288][25689] Avg episode reward: [(0, '0.979')] [2022-07-11 14:05:37,615][26022] Updated weights on worker 0-0, policy_version 1223396 (0.00091) [2022-07-11 14:05:39,422][26022] Updated weights on worker 0-0, policy_version 1223406 (0.00084) [2022-07-11 14:05:41,334][25689] Fps is (10 sec: 5573.0, 60 sec: 5614.1, 300 sec: 5630.3). Total num frames: 1252777984. Throughput: 0: 5901.0. Samples: 1252787260. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:05:41,334][25689] Avg episode reward: [(0, '1.038')] [2022-07-11 14:05:41,335][26022] Updated weights on worker 0-0, policy_version 1223416 (0.00080) [2022-07-11 14:05:42,890][26022] Updated weights on worker 0-0, policy_version 1223426 (0.00098) [2022-07-11 14:05:44,907][26022] Updated weights on worker 0-0, policy_version 1223436 (0.00084) [2022-07-11 14:05:46,366][25689] Fps is (10 sec: 5791.6, 60 sec: 5650.6, 300 sec: 5630.6). Total num frames: 1252807680. Throughput: 0: 5889.7. Samples: 1252804324. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:05:46,366][25689] Avg episode reward: [(0, '0.647')] [2022-07-11 14:05:46,430][26022] Updated weights on worker 0-0, policy_version 1223446 (0.00089) [2022-07-11 14:05:48,539][26022] Updated weights on worker 0-0, policy_version 1223456 (0.00084) [2022-07-11 14:05:50,203][26022] Updated weights on worker 0-0, policy_version 1223466 (0.00096) [2022-07-11 14:05:51,468][25689] Fps is (10 sec: 5658.3, 60 sec: 5633.6, 300 sec: 5625.5). Total num frames: 1252835328. Throughput: 0: 5861.7. Samples: 1252838202. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:05:51,469][25689] Avg episode reward: [(0, '0.831')] [2022-07-11 14:05:52,116][26022] Updated weights on worker 0-0, policy_version 1223476 (0.00095) [2022-07-11 14:05:53,871][26022] Updated weights on worker 0-0, policy_version 1223486 (0.00598) [2022-07-11 14:05:55,859][26022] Updated weights on worker 0-0, policy_version 1223496 (0.00085) [2022-07-11 14:05:56,491][25689] Fps is (10 sec: 5562.4, 60 sec: 5615.1, 300 sec: 5632.2). Total num frames: 1252864000. Throughput: 0: 5895.3. Samples: 1252872472. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:05:56,492][25689] Avg episode reward: [(0, '0.724')] [2022-07-11 14:05:57,531][26022] Updated weights on worker 0-0, policy_version 1223506 (0.00086) [2022-07-11 14:05:59,351][26022] Updated weights on worker 0-0, policy_version 1223516 (0.00094) [2022-07-11 14:06:01,043][26022] Updated weights on worker 0-0, policy_version 1223526 (0.00093) [2022-07-11 14:06:01,503][25689] Fps is (10 sec: 5714.8, 60 sec: 5636.3, 300 sec: 5635.6). Total num frames: 1252892672. Throughput: 0: 5080.3. Samples: 1252889724. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:06:01,503][25689] Avg episode reward: [(0, '-0.128')] [2022-07-11 14:06:03,016][26022] Updated weights on worker 0-0, policy_version 1223536 (0.00092) [2022-07-11 14:06:05,064][26022] Updated weights on worker 0-0, policy_version 1223546 (0.00084) [2022-07-11 14:06:06,504][25689] Fps is (10 sec: 5522.4, 60 sec: 5638.0, 300 sec: 5630.2). Total num frames: 1252919296. Throughput: 0: 5826.0. Samples: 1252921648. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:06:06,505][25689] Avg episode reward: [(0, '-0.054')] [2022-07-11 14:06:07,093][26022] Updated weights on worker 0-0, policy_version 1223556 (0.00082) [2022-07-11 14:06:08,663][26022] Updated weights on worker 0-0, policy_version 1223566 (0.00087) [2022-07-11 14:06:10,732][26022] Updated weights on worker 0-0, policy_version 1223576 (0.00084) [2022-07-11 14:06:11,629][25689] Fps is (10 sec: 5460.7, 60 sec: 5617.7, 300 sec: 5629.2). Total num frames: 1252947968. Throughput: 0: 5849.7. Samples: 1252956134. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:06:11,630][25689] Avg episode reward: [(0, '-0.167')] [2022-07-11 14:06:12,101][26022] Updated weights on worker 0-0, policy_version 1223586 (0.00085) [2022-07-11 14:06:13,058][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:06:13,068][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001223591_1252957184.pth [2022-07-11 14:06:13,069][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001221609_1250927616.pth [2022-07-11 14:06:14,074][26022] Updated weights on worker 0-0, policy_version 1223596 (0.00085) [2022-07-11 14:06:15,854][26022] Updated weights on worker 0-0, policy_version 1223606 (0.00092) [2022-07-11 14:06:16,653][25689] Fps is (10 sec: 5650.2, 60 sec: 5651.9, 300 sec: 5629.7). Total num frames: 1252976640. Throughput: 0: 5003.1. Samples: 1252973344. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:06:16,654][25689] Avg episode reward: [(0, '0.115')] [2022-07-11 14:06:17,523][26022] Updated weights on worker 0-0, policy_version 1223616 (0.00082) [2022-07-11 14:06:19,392][26022] Updated weights on worker 0-0, policy_version 1223626 (0.00092) [2022-07-11 14:06:21,371][26022] Updated weights on worker 0-0, policy_version 1223636 (0.00085) [2022-07-11 14:06:21,673][25689] Fps is (10 sec: 5607.1, 60 sec: 5633.8, 300 sec: 5626.9). Total num frames: 1253004288. Throughput: 0: 5824.1. Samples: 1253007200. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:06:21,674][25689] Avg episode reward: [(0, '-0.155')] [2022-07-11 14:06:23,203][26022] Updated weights on worker 0-0, policy_version 1223646 (0.00086) [2022-07-11 14:06:24,914][26022] Updated weights on worker 0-0, policy_version 1223656 (0.00085) [2022-07-11 14:06:26,562][26022] Updated weights on worker 0-0, policy_version 1223666 (0.00088) [2022-07-11 14:06:26,711][25689] Fps is (10 sec: 5701.5, 60 sec: 5632.7, 300 sec: 5634.3). Total num frames: 1253033984. Throughput: 0: 5921.7. Samples: 1253041306. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:06:26,711][25689] Avg episode reward: [(0, '-0.681')] [2022-07-11 14:06:28,685][26022] Updated weights on worker 0-0, policy_version 1223676 (0.00081) [2022-07-11 14:06:30,356][26022] Updated weights on worker 0-0, policy_version 1223686 (0.00088) [2022-07-11 14:06:31,825][25689] Fps is (10 sec: 5749.3, 60 sec: 5627.2, 300 sec: 5632.5). Total num frames: 1253062656. Throughput: 0: 5062.4. Samples: 1253058376. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:06:31,827][25689] Avg episode reward: [(0, '0.109')] [2022-07-11 14:06:32,239][26022] Updated weights on worker 0-0, policy_version 1223696 (0.00054) [2022-07-11 14:06:33,856][26022] Updated weights on worker 0-0, policy_version 1223706 (0.00089) [2022-07-11 14:06:35,830][26022] Updated weights on worker 0-0, policy_version 1223716 (0.00108) [2022-07-11 14:06:36,839][25689] Fps is (10 sec: 5560.7, 60 sec: 5631.5, 300 sec: 5632.7). Total num frames: 1253090304. Throughput: 0: 5887.7. Samples: 1253092192. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:06:36,839][25689] Avg episode reward: [(0, '0.032')] [2022-07-11 14:06:37,692][26022] Updated weights on worker 0-0, policy_version 1223726 (0.00082) [2022-07-11 14:06:39,536][26022] Updated weights on worker 0-0, policy_version 1223736 (0.00085) [2022-07-11 14:06:41,110][26022] Updated weights on worker 0-0, policy_version 1223746 (0.00085) [2022-07-11 14:06:41,867][25689] Fps is (10 sec: 5608.7, 60 sec: 5633.2, 300 sec: 5625.5). Total num frames: 1253118976. Throughput: 0: 5900.9. Samples: 1253126360. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:06:41,869][25689] Avg episode reward: [(0, '0.493')] [2022-07-11 14:06:43,102][26022] Updated weights on worker 0-0, policy_version 1223756 (0.00086) [2022-07-11 14:06:44,825][26022] Updated weights on worker 0-0, policy_version 1223766 (0.00089) [2022-07-11 14:06:46,537][26022] Updated weights on worker 0-0, policy_version 1223776 (0.00085) [2022-07-11 14:06:46,876][25689] Fps is (10 sec: 5815.0, 60 sec: 5635.3, 300 sec: 5634.2). Total num frames: 1253148672. Throughput: 0: 5065.5. Samples: 1253143454. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:06:46,877][25689] Avg episode reward: [(0, '0.727')] [2022-07-11 14:06:48,694][26022] Updated weights on worker 0-0, policy_version 1223786 (0.00088) [2022-07-11 14:06:49,981][26022] Updated weights on worker 0-0, policy_version 1223796 (0.00086) [2022-07-11 14:06:51,938][25689] Fps is (10 sec: 5592.3, 60 sec: 5622.2, 300 sec: 5624.4). Total num frames: 1253175296. Throughput: 0: 5907.2. Samples: 1253177184. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:06:51,938][25689] Avg episode reward: [(0, '0.954')] [2022-07-11 14:06:52,233][26022] Updated weights on worker 0-0, policy_version 1223806 (0.00085) [2022-07-11 14:06:53,742][26022] Updated weights on worker 0-0, policy_version 1223816 (0.00085) [2022-07-11 14:06:55,762][26022] Updated weights on worker 0-0, policy_version 1223826 (0.00092) [2022-07-11 14:06:56,975][25689] Fps is (10 sec: 5577.1, 60 sec: 5637.8, 300 sec: 5627.6). Total num frames: 1253204992. Throughput: 0: 5906.3. Samples: 1253211120. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:06:56,975][25689] Avg episode reward: [(0, '1.796')] [2022-07-11 14:06:57,562][26022] Updated weights on worker 0-0, policy_version 1223836 (0.00080) [2022-07-11 14:06:59,135][26022] Updated weights on worker 0-0, policy_version 1223846 (0.00087) [2022-07-11 14:07:01,096][26022] Updated weights on worker 0-0, policy_version 1223856 (0.00094) [2022-07-11 14:07:02,011][25689] Fps is (10 sec: 5591.2, 60 sec: 5601.6, 300 sec: 5630.7). Total num frames: 1253231616. Throughput: 0: 5064.9. Samples: 1253228386. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:07:02,011][25689] Avg episode reward: [(0, '2.120')] [2022-07-11 14:07:03,101][26022] Updated weights on worker 0-0, policy_version 1223866 (0.00090) [2022-07-11 14:07:04,967][26022] Updated weights on worker 0-0, policy_version 1223876 (0.00078) [2022-07-11 14:07:07,026][25689] Fps is (10 sec: 5297.6, 60 sec: 5600.3, 300 sec: 5618.0). Total num frames: 1253258240. Throughput: 0: 5788.5. Samples: 1253260090. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:07:07,027][25689] Avg episode reward: [(0, '1.764')] [2022-07-11 14:07:07,048][26022] Updated weights on worker 0-0, policy_version 1223886 (0.00084) [2022-07-11 14:07:08,955][26022] Updated weights on worker 0-0, policy_version 1223896 (0.00086) [2022-07-11 14:07:10,597][26022] Updated weights on worker 0-0, policy_version 1223906 (0.00086) [2022-07-11 14:07:12,135][25689] Fps is (10 sec: 5563.1, 60 sec: 5618.8, 300 sec: 5624.1). Total num frames: 1253287936. Throughput: 0: 5780.7. Samples: 1253293936. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:07:12,135][25689] Avg episode reward: [(0, '0.886')] [2022-07-11 14:07:12,547][26022] Updated weights on worker 0-0, policy_version 1223916 (0.00088) [2022-07-11 14:07:14,148][26022] Updated weights on worker 0-0, policy_version 1223926 (0.00086) [2022-07-11 14:07:16,042][26022] Updated weights on worker 0-0, policy_version 1223936 (0.00091) [2022-07-11 14:07:17,155][25689] Fps is (10 sec: 5762.6, 60 sec: 5619.2, 300 sec: 5620.6). Total num frames: 1253316608. Throughput: 0: 4945.7. Samples: 1253310924. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:07:17,155][25689] Avg episode reward: [(0, '0.368')] [2022-07-11 14:07:17,849][26022] Updated weights on worker 0-0, policy_version 1223946 (0.00087) [2022-07-11 14:07:19,570][26022] Updated weights on worker 0-0, policy_version 1223956 (0.00087) [2022-07-11 14:07:21,525][26022] Updated weights on worker 0-0, policy_version 1223966 (0.00067) [2022-07-11 14:07:22,212][25689] Fps is (10 sec: 5690.1, 60 sec: 5632.6, 300 sec: 5624.5). Total num frames: 1253345280. Throughput: 0: 5770.0. Samples: 1253344950. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:07:22,213][25689] Avg episode reward: [(0, '0.215')] [2022-07-11 14:07:23,331][26022] Updated weights on worker 0-0, policy_version 1223976 (0.00098) [2022-07-11 14:07:25,019][26022] Updated weights on worker 0-0, policy_version 1223986 (0.00085) [2022-07-11 14:07:26,919][26022] Updated weights on worker 0-0, policy_version 1223996 (0.00089) [2022-07-11 14:07:27,231][25689] Fps is (10 sec: 5589.5, 60 sec: 5600.5, 300 sec: 5626.6). Total num frames: 1253372928. Throughput: 0: 5874.5. Samples: 1253378782. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:07:27,231][25689] Avg episode reward: [(0, '0.092')] [2022-07-11 14:07:28,740][26022] Updated weights on worker 0-0, policy_version 1224006 (0.00083) [2022-07-11 14:07:30,712][26022] Updated weights on worker 0-0, policy_version 1224016 (0.00090) [2022-07-11 14:07:32,305][25689] Fps is (10 sec: 5580.2, 60 sec: 5604.2, 300 sec: 5618.7). Total num frames: 1253401600. Throughput: 0: 5048.7. Samples: 1253395772. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:07:32,307][25689] Avg episode reward: [(0, '0.019')] [2022-07-11 14:07:32,326][26022] Updated weights on worker 0-0, policy_version 1224026 (0.00087) [2022-07-11 14:07:34,206][26022] Updated weights on worker 0-0, policy_version 1224036 (0.00101) [2022-07-11 14:07:36,170][26022] Updated weights on worker 0-0, policy_version 1224046 (0.00091) [2022-07-11 14:07:37,309][25689] Fps is (10 sec: 5689.8, 60 sec: 5622.1, 300 sec: 5626.5). Total num frames: 1253430272. Throughput: 0: 5883.7. Samples: 1253429508. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:07:37,310][25689] Avg episode reward: [(0, '0.926')] [2022-07-11 14:07:37,745][26022] Updated weights on worker 0-0, policy_version 1224056 (0.00084) [2022-07-11 14:07:39,653][26022] Updated weights on worker 0-0, policy_version 1224066 (0.00080) [2022-07-11 14:07:41,245][26022] Updated weights on worker 0-0, policy_version 1224076 (0.00081) [2022-07-11 14:07:42,335][25689] Fps is (10 sec: 5513.2, 60 sec: 5588.4, 300 sec: 5612.6). Total num frames: 1253456896. Throughput: 0: 5897.1. Samples: 1253463614. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:07:42,335][25689] Avg episode reward: [(0, '0.870')] [2022-07-11 14:07:43,253][26022] Updated weights on worker 0-0, policy_version 1224086 (0.00089) [2022-07-11 14:07:45,101][26022] Updated weights on worker 0-0, policy_version 1224096 (0.00086) [2022-07-11 14:07:46,799][26022] Updated weights on worker 0-0, policy_version 1224106 (0.00081) [2022-07-11 14:07:47,354][25689] Fps is (10 sec: 5708.8, 60 sec: 5604.4, 300 sec: 5621.2). Total num frames: 1253487616. Throughput: 0: 5071.8. Samples: 1253480844. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:07:47,355][25689] Avg episode reward: [(0, '0.949')] [2022-07-11 14:07:48,718][26022] Updated weights on worker 0-0, policy_version 1224116 (0.00090) [2022-07-11 14:07:50,418][26022] Updated weights on worker 0-0, policy_version 1224126 (0.00082) [2022-07-11 14:07:52,288][26022] Updated weights on worker 0-0, policy_version 1224136 (0.00089) [2022-07-11 14:07:52,487][25689] Fps is (10 sec: 5749.5, 60 sec: 5614.7, 300 sec: 5626.1). Total num frames: 1253515264. Throughput: 0: 5892.4. Samples: 1253514690. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:07:52,487][25689] Avg episode reward: [(0, '-0.043')] [2022-07-11 14:07:54,093][26022] Updated weights on worker 0-0, policy_version 1224146 (0.00085) [2022-07-11 14:07:55,699][26022] Updated weights on worker 0-0, policy_version 1224156 (0.00087) [2022-07-11 14:07:57,578][25689] Fps is (10 sec: 5609.0, 60 sec: 5609.8, 300 sec: 5617.7). Total num frames: 1253544960. Throughput: 0: 5900.3. Samples: 1253549098. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:07:57,579][25689] Avg episode reward: [(0, '0.072')] [2022-07-11 14:07:57,631][26022] Updated weights on worker 0-0, policy_version 1224166 (0.00084) [2022-07-11 14:07:59,547][26022] Updated weights on worker 0-0, policy_version 1224176 (0.00081) [2022-07-11 14:08:01,344][26022] Updated weights on worker 0-0, policy_version 1224186 (0.00089) [2022-07-11 14:08:02,614][25689] Fps is (10 sec: 5662.6, 60 sec: 5626.7, 300 sec: 5628.1). Total num frames: 1253572608. Throughput: 0: 5784.3. Samples: 1253580912. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:08:02,615][25689] Avg episode reward: [(0, '0.664')] [2022-07-11 14:08:03,556][26022] Updated weights on worker 0-0, policy_version 1224196 (0.00093) [2022-07-11 14:08:05,272][26022] Updated weights on worker 0-0, policy_version 1224206 (0.00079) [2022-07-11 14:08:07,317][26022] Updated weights on worker 0-0, policy_version 1224216 (0.00086) [2022-07-11 14:08:07,644][25689] Fps is (10 sec: 5391.3, 60 sec: 5625.2, 300 sec: 5619.0). Total num frames: 1253599232. Throughput: 0: 5754.7. Samples: 1253597608. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:08:07,645][25689] Avg episode reward: [(0, '-0.033')] [2022-07-11 14:08:08,916][26022] Updated weights on worker 0-0, policy_version 1224226 (0.00089) [2022-07-11 14:08:11,003][26022] Updated weights on worker 0-0, policy_version 1224236 (0.00091) [2022-07-11 14:08:12,497][26022] Updated weights on worker 0-0, policy_version 1224246 (0.00091) [2022-07-11 14:08:12,737][25689] Fps is (10 sec: 5563.2, 60 sec: 5626.7, 300 sec: 5621.5). Total num frames: 1253628928. Throughput: 0: 5768.4. Samples: 1253631504. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:08:12,738][25689] Avg episode reward: [(0, '0.061')] [2022-07-11 14:08:13,244][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:08:13,253][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001224249_1253630976.pth [2022-07-11 14:08:13,253][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001222272_1251606528.pth [2022-07-11 14:08:14,576][26022] Updated weights on worker 0-0, policy_version 1224256 (0.00087) [2022-07-11 14:08:16,238][26022] Updated weights on worker 0-0, policy_version 1224266 (0.00077) [2022-07-11 14:08:17,741][25689] Fps is (10 sec: 5679.4, 60 sec: 5611.3, 300 sec: 5622.1). Total num frames: 1253656576. Throughput: 0: 5778.1. Samples: 1253665604. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:08:17,741][25689] Avg episode reward: [(0, '0.022')] [2022-07-11 14:08:18,102][26022] Updated weights on worker 0-0, policy_version 1224276 (0.00082) [2022-07-11 14:08:19,800][26022] Updated weights on worker 0-0, policy_version 1224286 (0.00083) [2022-07-11 14:08:21,694][26022] Updated weights on worker 0-0, policy_version 1224296 (0.00086) [2022-07-11 14:08:22,765][25689] Fps is (10 sec: 5616.6, 60 sec: 5614.4, 300 sec: 5626.7). Total num frames: 1253685248. Throughput: 0: 5046.9. Samples: 1253682612. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:08:22,767][25689] Avg episode reward: [(0, '0.920')] [2022-07-11 14:08:23,531][26022] Updated weights on worker 0-0, policy_version 1224306 (0.00087) [2022-07-11 14:08:25,342][26022] Updated weights on worker 0-0, policy_version 1224316 (0.00098) [2022-07-11 14:08:27,120][26022] Updated weights on worker 0-0, policy_version 1224326 (0.00083) [2022-07-11 14:08:27,780][25689] Fps is (10 sec: 5712.0, 60 sec: 5631.6, 300 sec: 5620.6). Total num frames: 1253713920. Throughput: 0: 5910.5. Samples: 1253716622. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:08:27,781][25689] Avg episode reward: [(0, '0.213')] [2022-07-11 14:08:28,724][26022] Updated weights on worker 0-0, policy_version 1224336 (0.00087) [2022-07-11 14:08:30,735][26022] Updated weights on worker 0-0, policy_version 1224346 (0.00084) [2022-07-11 14:08:32,390][26022] Updated weights on worker 0-0, policy_version 1224356 (0.00089) [2022-07-11 14:08:32,851][25689] Fps is (10 sec: 5584.0, 60 sec: 5615.1, 300 sec: 5623.1). Total num frames: 1253741568. Throughput: 0: 5932.0. Samples: 1253750816. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:08:32,851][25689] Avg episode reward: [(0, '-0.872')] [2022-07-11 14:08:34,374][26022] Updated weights on worker 0-0, policy_version 1224366 (0.00085) [2022-07-11 14:08:36,091][26022] Updated weights on worker 0-0, policy_version 1224376 (0.00094) [2022-07-11 14:08:37,813][26022] Updated weights on worker 0-0, policy_version 1224386 (0.00089) [2022-07-11 14:08:37,887][25689] Fps is (10 sec: 5775.3, 60 sec: 5645.9, 300 sec: 5626.0). Total num frames: 1253772288. Throughput: 0: 5073.5. Samples: 1253767814. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2022-07-11 14:08:37,887][25689] Avg episode reward: [(0, '-0.626')] [2022-07-11 14:08:39,842][26022] Updated weights on worker 0-0, policy_version 1224396 (0.00085) [2022-07-11 14:08:41,500][26022] Updated weights on worker 0-0, policy_version 1224406 (0.00083) [2022-07-11 14:08:42,909][25689] Fps is (10 sec: 5701.1, 60 sec: 5646.2, 300 sec: 5625.6). Total num frames: 1253798912. Throughput: 0: 5942.0. Samples: 1253802310. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:08:42,910][25689] Avg episode reward: [(0, '-0.171')] [2022-07-11 14:08:43,183][26022] Updated weights on worker 0-0, policy_version 1224416 (0.00086) [2022-07-11 14:08:45,011][26022] Updated weights on worker 0-0, policy_version 1224426 (0.00080) [2022-07-11 14:08:47,061][26022] Updated weights on worker 0-0, policy_version 1224436 (0.00086) [2022-07-11 14:08:47,939][25689] Fps is (10 sec: 5501.0, 60 sec: 5611.5, 300 sec: 5618.9). Total num frames: 1253827584. Throughput: 0: 5944.7. Samples: 1253836458. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:08:47,939][25689] Avg episode reward: [(0, '-0.546')] [2022-07-11 14:08:48,679][26022] Updated weights on worker 0-0, policy_version 1224446 (0.00084) [2022-07-11 14:08:50,481][26022] Updated weights on worker 0-0, policy_version 1224456 (0.00089) [2022-07-11 14:08:52,172][26022] Updated weights on worker 0-0, policy_version 1224466 (0.00089) [2022-07-11 14:08:53,005][25689] Fps is (10 sec: 5680.2, 60 sec: 5634.6, 300 sec: 5624.9). Total num frames: 1253856256. Throughput: 0: 5090.9. Samples: 1253853416. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:08:53,005][25689] Avg episode reward: [(0, '-0.603')] [2022-07-11 14:08:54,218][26022] Updated weights on worker 0-0, policy_version 1224476 (0.00085) [2022-07-11 14:08:55,971][26022] Updated weights on worker 0-0, policy_version 1224486 (0.00100) [2022-07-11 14:08:57,781][26022] Updated weights on worker 0-0, policy_version 1224496 (0.00082) [2022-07-11 14:08:58,040][25689] Fps is (10 sec: 5778.2, 60 sec: 5639.7, 300 sec: 5631.4). Total num frames: 1253885952. Throughput: 0: 5930.1. Samples: 1253887326. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:08:58,041][25689] Avg episode reward: [(0, '0.088')] [2022-07-11 14:08:59,572][26022] Updated weights on worker 0-0, policy_version 1224506 (0.00088) [2022-07-11 14:09:01,343][26022] Updated weights on worker 0-0, policy_version 1224516 (0.00507) [2022-07-11 14:09:03,057][25689] Fps is (10 sec: 5399.1, 60 sec: 5590.7, 300 sec: 5624.6). Total num frames: 1253910528. Throughput: 0: 5799.6. Samples: 1253919156. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:09:03,057][25689] Avg episode reward: [(0, '1.316')] [2022-07-11 14:09:03,525][26022] Updated weights on worker 0-0, policy_version 1224526 (0.00086) [2022-07-11 14:09:05,325][26022] Updated weights on worker 0-0, policy_version 1224536 (0.00085) [2022-07-11 14:09:07,150][26022] Updated weights on worker 0-0, policy_version 1224546 (0.00087) [2022-07-11 14:09:08,064][25689] Fps is (10 sec: 5311.9, 60 sec: 5626.7, 300 sec: 5626.4). Total num frames: 1253939200. Throughput: 0: 4938.9. Samples: 1253935856. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:09:08,066][25689] Avg episode reward: [(0, '1.277')] [2022-07-11 14:09:09,168][26022] Updated weights on worker 0-0, policy_version 1224556 (0.00094) [2022-07-11 14:09:10,797][26022] Updated weights on worker 0-0, policy_version 1224566 (0.00091) [2022-07-11 14:09:12,857][26022] Updated weights on worker 0-0, policy_version 1224576 (0.00088) [2022-07-11 14:09:13,185][25689] Fps is (10 sec: 5762.7, 60 sec: 5624.2, 300 sec: 5627.6). Total num frames: 1253968896. Throughput: 0: 5755.2. Samples: 1253969560. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:09:13,186][25689] Avg episode reward: [(0, '0.892')] [2022-07-11 14:09:14,551][26022] Updated weights on worker 0-0, policy_version 1224586 (0.00084) [2022-07-11 14:09:16,287][26022] Updated weights on worker 0-0, policy_version 1224596 (0.00081) [2022-07-11 14:09:18,000][26022] Updated weights on worker 0-0, policy_version 1224606 (0.00086) [2022-07-11 14:09:18,207][25689] Fps is (10 sec: 5654.0, 60 sec: 5622.5, 300 sec: 5620.5). Total num frames: 1253996544. Throughput: 0: 5786.6. Samples: 1254004022. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:09:18,208][25689] Avg episode reward: [(0, '1.400')] [2022-07-11 14:09:19,820][26022] Updated weights on worker 0-0, policy_version 1224616 (0.00092) [2022-07-11 14:09:21,651][26022] Updated weights on worker 0-0, policy_version 1224626 (0.00088) [2022-07-11 14:09:23,217][25689] Fps is (10 sec: 5716.0, 60 sec: 5640.6, 300 sec: 5631.7). Total num frames: 1254026240. Throughput: 0: 5035.4. Samples: 1254020674. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:09:23,219][25689] Avg episode reward: [(0, '1.386')] [2022-07-11 14:09:23,609][26022] Updated weights on worker 0-0, policy_version 1224636 (0.00081) [2022-07-11 14:09:25,099][26022] Updated weights on worker 0-0, policy_version 1224646 (0.00078) [2022-07-11 14:09:27,391][26022] Updated weights on worker 0-0, policy_version 1224656 (0.00087) [2022-07-11 14:09:28,235][25689] Fps is (10 sec: 5718.4, 60 sec: 5623.5, 300 sec: 5627.0). Total num frames: 1254053888. Throughput: 0: 5904.0. Samples: 1254054942. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:09:28,235][25689] Avg episode reward: [(0, '-0.175')] [2022-07-11 14:09:28,707][26022] Updated weights on worker 0-0, policy_version 1224666 (0.00086) [2022-07-11 14:09:30,977][26022] Updated weights on worker 0-0, policy_version 1224676 (0.00092) [2022-07-11 14:09:32,479][26022] Updated weights on worker 0-0, policy_version 1224686 (0.00083) [2022-07-11 14:09:33,315][25689] Fps is (10 sec: 5577.8, 60 sec: 5639.6, 300 sec: 5623.0). Total num frames: 1254082560. Throughput: 0: 5917.2. Samples: 1254088670. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:09:33,315][25689] Avg episode reward: [(0, '-0.331')] [2022-07-11 14:09:34,603][26022] Updated weights on worker 0-0, policy_version 1224696 (0.00087) [2022-07-11 14:09:36,206][26022] Updated weights on worker 0-0, policy_version 1224706 (0.00596) [2022-07-11 14:09:38,154][26022] Updated weights on worker 0-0, policy_version 1224716 (0.00082) [2022-07-11 14:09:38,355][25689] Fps is (10 sec: 5666.4, 60 sec: 5605.3, 300 sec: 5622.8). Total num frames: 1254111232. Throughput: 0: 5045.1. Samples: 1254105670. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:09:38,355][25689] Avg episode reward: [(0, '0.123')] [2022-07-11 14:09:39,730][26022] Updated weights on worker 0-0, policy_version 1224726 (0.00088) [2022-07-11 14:09:41,703][26022] Updated weights on worker 0-0, policy_version 1224736 (0.00089) [2022-07-11 14:09:43,342][26022] Updated weights on worker 0-0, policy_version 1224746 (0.00090) [2022-07-11 14:09:43,367][25689] Fps is (10 sec: 5704.6, 60 sec: 5640.2, 300 sec: 5627.2). Total num frames: 1254139904. Throughput: 0: 5906.2. Samples: 1254139680. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:09:43,367][25689] Avg episode reward: [(0, '-0.155')] [2022-07-11 14:09:45,269][26022] Updated weights on worker 0-0, policy_version 1224756 (0.00093) [2022-07-11 14:09:47,139][26022] Updated weights on worker 0-0, policy_version 1224766 (0.00087) [2022-07-11 14:09:48,375][25689] Fps is (10 sec: 5723.0, 60 sec: 5642.2, 300 sec: 5628.9). Total num frames: 1254168576. Throughput: 0: 5901.6. Samples: 1254173800. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:09:48,375][25689] Avg episode reward: [(0, '-0.309')] [2022-07-11 14:09:48,802][26022] Updated weights on worker 0-0, policy_version 1224776 (0.00083) [2022-07-11 14:09:50,855][26022] Updated weights on worker 0-0, policy_version 1224786 (0.00095) [2022-07-11 14:09:52,505][26022] Updated weights on worker 0-0, policy_version 1224796 (0.00084) [2022-07-11 14:09:53,437][25689] Fps is (10 sec: 5694.4, 60 sec: 5642.5, 300 sec: 5624.4). Total num frames: 1254197248. Throughput: 0: 5074.4. Samples: 1254190780. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:09:53,437][25689] Avg episode reward: [(0, '-0.131')] [2022-07-11 14:09:54,318][26022] Updated weights on worker 0-0, policy_version 1224806 (0.00080) [2022-07-11 14:09:55,997][26022] Updated weights on worker 0-0, policy_version 1224816 (0.00088) [2022-07-11 14:09:57,991][26022] Updated weights on worker 0-0, policy_version 1224826 (0.00077) [2022-07-11 14:09:58,442][25689] Fps is (10 sec: 5695.9, 60 sec: 5628.4, 300 sec: 5628.8). Total num frames: 1254225920. Throughput: 0: 5943.5. Samples: 1254225060. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:09:58,443][25689] Avg episode reward: [(0, '1.895')] [2022-07-11 14:09:59,528][26022] Updated weights on worker 0-0, policy_version 1224836 (0.00077) [2022-07-11 14:10:01,443][26022] Updated weights on worker 0-0, policy_version 1224846 (0.00084) [2022-07-11 14:10:03,445][25689] Fps is (10 sec: 5422.7, 60 sec: 5646.6, 300 sec: 5625.7). Total num frames: 1254251520. Throughput: 0: 5849.0. Samples: 1254257120. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:10:03,446][25689] Avg episode reward: [(0, '1.922')] [2022-07-11 14:10:03,619][26022] Updated weights on worker 0-0, policy_version 1224856 (0.00091) [2022-07-11 14:10:05,615][26022] Updated weights on worker 0-0, policy_version 1224866 (0.00096) [2022-07-11 14:10:07,127][26022] Updated weights on worker 0-0, policy_version 1224876 (0.00086) [2022-07-11 14:10:08,508][25689] Fps is (10 sec: 5290.1, 60 sec: 5624.6, 300 sec: 5619.3). Total num frames: 1254279168. Throughput: 0: 4990.3. Samples: 1254274270. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:10:08,508][25689] Avg episode reward: [(0, '1.710')] [2022-07-11 14:10:09,107][26022] Updated weights on worker 0-0, policy_version 1224886 (0.00092) [2022-07-11 14:10:10,948][26022] Updated weights on worker 0-0, policy_version 1224896 (0.00085) [2022-07-11 14:10:12,659][26022] Updated weights on worker 0-0, policy_version 1224906 (0.00084) [2022-07-11 14:10:13,327][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:10:13,343][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001224910_1254307840.pth [2022-07-11 14:10:13,344][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001222930_1252280320.pth [2022-07-11 14:10:13,593][25689] Fps is (10 sec: 5650.8, 60 sec: 5627.9, 300 sec: 5628.6). Total num frames: 1254308864. Throughput: 0: 5812.9. Samples: 1254307946. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:10:13,594][25689] Avg episode reward: [(0, '1.499')] [2022-07-11 14:10:14,667][26022] Updated weights on worker 0-0, policy_version 1224916 (0.00092) [2022-07-11 14:10:16,238][26022] Updated weights on worker 0-0, policy_version 1224926 (0.00081) [2022-07-11 14:10:18,154][26022] Updated weights on worker 0-0, policy_version 1224936 (0.00088) [2022-07-11 14:10:18,680][25689] Fps is (10 sec: 5637.2, 60 sec: 5621.8, 300 sec: 5623.6). Total num frames: 1254336512. Throughput: 0: 5769.0. Samples: 1254341812. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:10:18,681][25689] Avg episode reward: [(0, '1.683')] [2022-07-11 14:10:19,786][26022] Updated weights on worker 0-0, policy_version 1224946 (0.00092) [2022-07-11 14:10:21,902][26022] Updated weights on worker 0-0, policy_version 1224956 (0.00093) [2022-07-11 14:10:23,502][26022] Updated weights on worker 0-0, policy_version 1224966 (0.00083) [2022-07-11 14:10:23,753][25689] Fps is (10 sec: 5644.4, 60 sec: 5616.0, 300 sec: 5622.7). Total num frames: 1254366208. Throughput: 0: 5015.5. Samples: 1254358972. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:10:23,753][25689] Avg episode reward: [(0, '1.667')] [2022-07-11 14:10:25,512][26022] Updated weights on worker 0-0, policy_version 1224976 (0.00085) [2022-07-11 14:10:27,264][26022] Updated weights on worker 0-0, policy_version 1224986 (0.00093) [2022-07-11 14:10:28,780][25689] Fps is (10 sec: 5677.9, 60 sec: 5615.1, 300 sec: 5619.8). Total num frames: 1254393856. Throughput: 0: 5850.2. Samples: 1254392864. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:10:28,780][25689] Avg episode reward: [(0, '0.773')] [2022-07-11 14:10:29,111][26022] Updated weights on worker 0-0, policy_version 1224996 (0.00080) [2022-07-11 14:10:30,836][26022] Updated weights on worker 0-0, policy_version 1225006 (0.00089) [2022-07-11 14:10:32,557][26022] Updated weights on worker 0-0, policy_version 1225016 (0.00094) [2022-07-11 14:10:33,838][25689] Fps is (10 sec: 5686.1, 60 sec: 5634.1, 300 sec: 5626.8). Total num frames: 1254423552. Throughput: 0: 5883.9. Samples: 1254427062. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:10:33,838][25689] Avg episode reward: [(0, '0.147')] [2022-07-11 14:10:34,458][26022] Updated weights on worker 0-0, policy_version 1225026 (0.00094) [2022-07-11 14:10:36,149][26022] Updated weights on worker 0-0, policy_version 1225036 (0.00095) [2022-07-11 14:10:37,913][26022] Updated weights on worker 0-0, policy_version 1225046 (0.00092) [2022-07-11 14:10:38,853][25689] Fps is (10 sec: 5591.1, 60 sec: 5602.6, 300 sec: 5620.5). Total num frames: 1254450176. Throughput: 0: 5071.9. Samples: 1254444126. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:10:38,853][25689] Avg episode reward: [(0, '0.143')] [2022-07-11 14:10:40,077][26022] Updated weights on worker 0-0, policy_version 1225056 (0.00094) [2022-07-11 14:10:41,824][26022] Updated weights on worker 0-0, policy_version 1225066 (0.00087) [2022-07-11 14:10:43,517][26022] Updated weights on worker 0-0, policy_version 1225076 (0.00088) [2022-07-11 14:10:43,871][25689] Fps is (10 sec: 5613.6, 60 sec: 5618.9, 300 sec: 5620.7). Total num frames: 1254479872. Throughput: 0: 5892.2. Samples: 1254477510. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:10:43,871][25689] Avg episode reward: [(0, '0.386')] [2022-07-11 14:10:45,372][26022] Updated weights on worker 0-0, policy_version 1225086 (0.01005) [2022-07-11 14:10:47,083][26022] Updated weights on worker 0-0, policy_version 1225096 (0.00085) [2022-07-11 14:10:48,881][25689] Fps is (10 sec: 5616.4, 60 sec: 5584.9, 300 sec: 5619.0). Total num frames: 1254506496. Throughput: 0: 5905.6. Samples: 1254511572. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:10:48,881][25689] Avg episode reward: [(0, '-0.146')] [2022-07-11 14:10:49,160][26022] Updated weights on worker 0-0, policy_version 1225106 (0.00108) [2022-07-11 14:10:50,814][26022] Updated weights on worker 0-0, policy_version 1225116 (0.00081) [2022-07-11 14:10:52,634][26022] Updated weights on worker 0-0, policy_version 1225126 (0.00084) [2022-07-11 14:10:53,955][25689] Fps is (10 sec: 5584.9, 60 sec: 5600.7, 300 sec: 5621.5). Total num frames: 1254536192. Throughput: 0: 5884.5. Samples: 1254545442. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:10:53,955][25689] Avg episode reward: [(0, '0.276')] [2022-07-11 14:10:54,349][26022] Updated weights on worker 0-0, policy_version 1225136 (0.00088) [2022-07-11 14:10:56,318][26022] Updated weights on worker 0-0, policy_version 1225146 (0.00086) [2022-07-11 14:10:58,122][26022] Updated weights on worker 0-0, policy_version 1225156 (0.00088) [2022-07-11 14:10:59,011][25689] Fps is (10 sec: 5761.5, 60 sec: 5596.0, 300 sec: 5620.7). Total num frames: 1254564864. Throughput: 0: 5880.6. Samples: 1254562670. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:10:59,012][25689] Avg episode reward: [(0, '0.143')] [2022-07-11 14:10:59,780][26022] Updated weights on worker 0-0, policy_version 1225166 (0.00091) [2022-07-11 14:11:01,504][26022] Updated weights on worker 0-0, policy_version 1225176 (0.00086) [2022-07-11 14:11:03,862][26022] Updated weights on worker 0-0, policy_version 1225186 (0.00107) [2022-07-11 14:11:04,018][25689] Fps is (10 sec: 5393.1, 60 sec: 5595.7, 300 sec: 5617.1). Total num frames: 1254590464. Throughput: 0: 5821.4. Samples: 1254594796. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:11:04,019][25689] Avg episode reward: [(0, '-0.785')] [2022-07-11 14:11:05,471][26022] Updated weights on worker 0-0, policy_version 1225196 (0.00089) [2022-07-11 14:11:07,507][26022] Updated weights on worker 0-0, policy_version 1225206 (0.00088) [2022-07-11 14:11:09,051][25689] Fps is (10 sec: 5507.8, 60 sec: 5632.2, 300 sec: 5622.3). Total num frames: 1254620160. Throughput: 0: 5804.1. Samples: 1254628642. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:11:09,051][25689] Avg episode reward: [(0, '-0.765')] [2022-07-11 14:11:09,300][26022] Updated weights on worker 0-0, policy_version 1225216 (0.00088) [2022-07-11 14:11:11,021][26022] Updated weights on worker 0-0, policy_version 1225226 (0.00091) [2022-07-11 14:11:13,016][26022] Updated weights on worker 0-0, policy_version 1225236 (0.00090) [2022-07-11 14:11:14,119][25689] Fps is (10 sec: 5778.7, 60 sec: 5617.0, 300 sec: 5621.5). Total num frames: 1254648832. Throughput: 0: 4965.7. Samples: 1254645568. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:11:14,120][25689] Avg episode reward: [(0, '-0.851')] [2022-07-11 14:11:14,625][26022] Updated weights on worker 0-0, policy_version 1225246 (0.00089) [2022-07-11 14:11:16,570][26022] Updated weights on worker 0-0, policy_version 1225256 (0.00088) [2022-07-11 14:11:18,199][26022] Updated weights on worker 0-0, policy_version 1225266 (0.00088) [2022-07-11 14:11:19,197][25689] Fps is (10 sec: 5449.9, 60 sec: 5600.8, 300 sec: 5616.9). Total num frames: 1254675456. Throughput: 0: 5804.5. Samples: 1254679838. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:11:19,198][25689] Avg episode reward: [(0, '-0.063')] [2022-07-11 14:11:20,054][26022] Updated weights on worker 0-0, policy_version 1225276 (0.00084) [2022-07-11 14:11:22,124][26022] Updated weights on worker 0-0, policy_version 1225286 (0.00092) [2022-07-11 14:11:23,760][26022] Updated weights on worker 0-0, policy_version 1225296 (0.00089) [2022-07-11 14:11:24,199][25689] Fps is (10 sec: 5587.0, 60 sec: 5607.4, 300 sec: 5617.6). Total num frames: 1254705152. Throughput: 0: 5891.3. Samples: 1254713688. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:11:24,200][25689] Avg episode reward: [(0, '-0.412')] [2022-07-11 14:11:25,639][26022] Updated weights on worker 0-0, policy_version 1225306 (0.00087) [2022-07-11 14:11:27,303][26022] Updated weights on worker 0-0, policy_version 1225316 (0.00053) [2022-07-11 14:11:29,121][26022] Updated weights on worker 0-0, policy_version 1225326 (0.00086) [2022-07-11 14:11:29,223][25689] Fps is (10 sec: 5821.9, 60 sec: 5624.6, 300 sec: 5619.3). Total num frames: 1254733824. Throughput: 0: 5052.6. Samples: 1254730562. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:11:29,223][25689] Avg episode reward: [(0, '-0.423')] [2022-07-11 14:11:31,174][26022] Updated weights on worker 0-0, policy_version 1225336 (0.00087) [2022-07-11 14:11:32,726][26022] Updated weights on worker 0-0, policy_version 1225346 (0.00082) [2022-07-11 14:11:34,294][25689] Fps is (10 sec: 5680.3, 60 sec: 5606.4, 300 sec: 5621.7). Total num frames: 1254762496. Throughput: 0: 5905.9. Samples: 1254764724. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:11:34,295][25689] Avg episode reward: [(0, '1.259')] [2022-07-11 14:11:34,583][26022] Updated weights on worker 0-0, policy_version 1225356 (0.00085) [2022-07-11 14:11:36,372][26022] Updated weights on worker 0-0, policy_version 1225366 (0.00093) [2022-07-11 14:11:38,179][26022] Updated weights on worker 0-0, policy_version 1225376 (0.00088) [2022-07-11 14:11:39,373][25689] Fps is (10 sec: 5548.5, 60 sec: 5617.4, 300 sec: 5617.3). Total num frames: 1254790144. Throughput: 0: 5895.4. Samples: 1254798784. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:11:39,376][25689] Avg episode reward: [(0, '1.120')] [2022-07-11 14:11:39,950][26022] Updated weights on worker 0-0, policy_version 1225386 (0.00084) [2022-07-11 14:11:41,842][26022] Updated weights on worker 0-0, policy_version 1225396 (0.00088) [2022-07-11 14:11:43,451][26022] Updated weights on worker 0-0, policy_version 1225406 (0.00084) [2022-07-11 14:11:44,474][25689] Fps is (10 sec: 5633.3, 60 sec: 5609.7, 300 sec: 5615.5). Total num frames: 1254819840. Throughput: 0: 5021.9. Samples: 1254815506. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:11:44,474][25689] Avg episode reward: [(0, '0.995')] [2022-07-11 14:11:45,696][26022] Updated weights on worker 0-0, policy_version 1225416 (0.00082) [2022-07-11 14:11:47,159][26022] Updated weights on worker 0-0, policy_version 1225426 (0.00083) [2022-07-11 14:11:49,142][26022] Updated weights on worker 0-0, policy_version 1225436 (0.00097) [2022-07-11 14:11:49,539][25689] Fps is (10 sec: 5842.1, 60 sec: 5655.2, 300 sec: 5625.8). Total num frames: 1254849536. Throughput: 0: 5858.6. Samples: 1254849590. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:11:49,541][25689] Avg episode reward: [(0, '0.539')] [2022-07-11 14:11:50,938][26022] Updated weights on worker 0-0, policy_version 1225446 (0.00082) [2022-07-11 14:11:52,613][26022] Updated weights on worker 0-0, policy_version 1225456 (0.00090) [2022-07-11 14:11:54,618][25689] Fps is (10 sec: 5652.9, 60 sec: 5621.1, 300 sec: 5618.1). Total num frames: 1254877184. Throughput: 0: 5857.8. Samples: 1254883776. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:11:54,618][25689] Avg episode reward: [(0, '1.098')] [2022-07-11 14:11:54,625][26022] Updated weights on worker 0-0, policy_version 1225466 (0.00086) [2022-07-11 14:11:56,215][26022] Updated weights on worker 0-0, policy_version 1225476 (0.00088) [2022-07-11 14:11:58,042][26022] Updated weights on worker 0-0, policy_version 1225486 (0.00611) [2022-07-11 14:11:59,703][25689] Fps is (10 sec: 5440.5, 60 sec: 5601.5, 300 sec: 5620.6). Total num frames: 1254904832. Throughput: 0: 5016.2. Samples: 1254900764. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:11:59,704][25689] Avg episode reward: [(0, '0.181')] [2022-07-11 14:12:00,003][26022] Updated weights on worker 0-0, policy_version 1225496 (0.00087) [2022-07-11 14:12:02,069][26022] Updated weights on worker 0-0, policy_version 1225506 (0.00085) [2022-07-11 14:12:04,013][26022] Updated weights on worker 0-0, policy_version 1225516 (0.00086) [2022-07-11 14:12:04,785][25689] Fps is (10 sec: 5438.7, 60 sec: 5628.3, 300 sec: 5622.8). Total num frames: 1254932480. Throughput: 0: 5770.4. Samples: 1254932712. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:12:04,786][25689] Avg episode reward: [(0, '0.085')] [2022-07-11 14:12:05,891][26022] Updated weights on worker 0-0, policy_version 1225526 (0.00088) [2022-07-11 14:12:07,601][26022] Updated weights on worker 0-0, policy_version 1225536 (0.00514) [2022-07-11 14:12:09,411][26022] Updated weights on worker 0-0, policy_version 1225546 (0.00080) [2022-07-11 14:12:09,799][25689] Fps is (10 sec: 5578.4, 60 sec: 5613.2, 300 sec: 5621.1). Total num frames: 1254961152. Throughput: 0: 5764.2. Samples: 1254966372. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:12:09,800][25689] Avg episode reward: [(0, '-0.174')] [2022-07-11 14:12:11,390][26022] Updated weights on worker 0-0, policy_version 1225556 (0.00097) [2022-07-11 14:12:12,934][26022] Updated weights on worker 0-0, policy_version 1225566 (0.00085) [2022-07-11 14:12:13,436][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:12:13,440][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001225569_1254982656.pth [2022-07-11 14:12:13,441][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001223591_1252957184.pth [2022-07-11 14:12:14,877][25689] Fps is (10 sec: 5580.6, 60 sec: 5595.4, 300 sec: 5616.6). Total num frames: 1254988800. Throughput: 0: 4916.5. Samples: 1254983380. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:12:14,878][25689] Avg episode reward: [(0, '0.273')] [2022-07-11 14:12:15,072][26022] Updated weights on worker 0-0, policy_version 1225576 (0.00086) [2022-07-11 14:12:16,323][26022] Updated weights on worker 0-0, policy_version 1225586 (0.00082) [2022-07-11 14:12:18,483][26022] Updated weights on worker 0-0, policy_version 1225596 (0.00082) [2022-07-11 14:12:19,948][25689] Fps is (10 sec: 5750.9, 60 sec: 5663.5, 300 sec: 5623.2). Total num frames: 1255019520. Throughput: 0: 5769.0. Samples: 1255017560. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:12:19,949][25689] Avg episode reward: [(0, '0.161')] [2022-07-11 14:12:20,115][26022] Updated weights on worker 0-0, policy_version 1225606 (0.00095) [2022-07-11 14:12:22,001][26022] Updated weights on worker 0-0, policy_version 1225616 (0.00087) [2022-07-11 14:12:23,887][26022] Updated weights on worker 0-0, policy_version 1225626 (0.00086) [2022-07-11 14:12:25,011][25689] Fps is (10 sec: 5760.0, 60 sec: 5624.2, 300 sec: 5622.4). Total num frames: 1255047168. Throughput: 0: 5885.5. Samples: 1255051748. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:12:25,011][25689] Avg episode reward: [(0, '1.371')] [2022-07-11 14:12:25,572][26022] Updated weights on worker 0-0, policy_version 1225636 (0.00088) [2022-07-11 14:12:27,597][26022] Updated weights on worker 0-0, policy_version 1225646 (0.00089) [2022-07-11 14:12:29,275][26022] Updated weights on worker 0-0, policy_version 1225656 (0.00082) [2022-07-11 14:12:30,050][25689] Fps is (10 sec: 5575.1, 60 sec: 5622.7, 300 sec: 5623.1). Total num frames: 1255075840. Throughput: 0: 5051.5. Samples: 1255068672. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:12:30,051][25689] Avg episode reward: [(0, '1.229')] [2022-07-11 14:12:31,054][26022] Updated weights on worker 0-0, policy_version 1225666 (0.00084) [2022-07-11 14:12:32,964][26022] Updated weights on worker 0-0, policy_version 1225676 (0.00087) [2022-07-11 14:12:34,881][26022] Updated weights on worker 0-0, policy_version 1225686 (0.00093) [2022-07-11 14:12:35,203][25689] Fps is (10 sec: 5626.2, 60 sec: 5615.2, 300 sec: 5620.3). Total num frames: 1255104512. Throughput: 0: 5875.3. Samples: 1255102798. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:12:35,203][25689] Avg episode reward: [(0, '1.329')] [2022-07-11 14:12:36,551][26022] Updated weights on worker 0-0, policy_version 1225696 (0.00089) [2022-07-11 14:12:38,317][26022] Updated weights on worker 0-0, policy_version 1225706 (0.00093) [2022-07-11 14:12:40,067][26022] Updated weights on worker 0-0, policy_version 1225716 (0.00085) [2022-07-11 14:12:40,283][25689] Fps is (10 sec: 5603.8, 60 sec: 5631.9, 300 sec: 5626.1). Total num frames: 1255133184. Throughput: 0: 5860.4. Samples: 1255136732. Policy #0 lag: (min: 0.0, avg: 9.2, max: 19.0) [2022-07-11 14:12:40,284][25689] Avg episode reward: [(0, '1.273')] [2022-07-11 14:12:41,968][26022] Updated weights on worker 0-0, policy_version 1225726 (0.00087) [2022-07-11 14:12:43,958][26022] Updated weights on worker 0-0, policy_version 1225736 (0.00090) [2022-07-11 14:12:45,323][25689] Fps is (10 sec: 5666.4, 60 sec: 5620.7, 300 sec: 5618.8). Total num frames: 1255161856. Throughput: 0: 5845.0. Samples: 1255170474. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:12:45,323][25689] Avg episode reward: [(0, '1.430')] [2022-07-11 14:12:45,465][26022] Updated weights on worker 0-0, policy_version 1225746 (0.00082) [2022-07-11 14:12:47,392][26022] Updated weights on worker 0-0, policy_version 1225756 (0.00085) [2022-07-11 14:12:49,188][26022] Updated weights on worker 0-0, policy_version 1225766 (0.00093) [2022-07-11 14:12:50,342][25689] Fps is (10 sec: 5802.9, 60 sec: 5625.0, 300 sec: 5627.9). Total num frames: 1255191552. Throughput: 0: 5870.8. Samples: 1255187798. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:12:50,343][25689] Avg episode reward: [(0, '1.099')] [2022-07-11 14:12:51,106][26022] Updated weights on worker 0-0, policy_version 1225776 (0.00078) [2022-07-11 14:12:52,693][26022] Updated weights on worker 0-0, policy_version 1225786 (0.00087) [2022-07-11 14:12:54,805][26022] Updated weights on worker 0-0, policy_version 1225796 (0.00082) [2022-07-11 14:12:55,414][25689] Fps is (10 sec: 5784.4, 60 sec: 5642.5, 300 sec: 5624.8). Total num frames: 1255220224. Throughput: 0: 5892.7. Samples: 1255221894. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:12:55,414][25689] Avg episode reward: [(0, '0.464')] [2022-07-11 14:12:56,310][26022] Updated weights on worker 0-0, policy_version 1225806 (0.00096) [2022-07-11 14:12:58,330][26022] Updated weights on worker 0-0, policy_version 1225816 (0.00095) [2022-07-11 14:13:00,076][26022] Updated weights on worker 0-0, policy_version 1225826 (0.00088) [2022-07-11 14:13:00,443][25689] Fps is (10 sec: 5575.8, 60 sec: 5647.7, 300 sec: 5624.9). Total num frames: 1255247872. Throughput: 0: 5902.7. Samples: 1255255726. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:13:00,443][25689] Avg episode reward: [(0, '0.478')] [2022-07-11 14:13:02,175][26022] Updated weights on worker 0-0, policy_version 1225836 (0.00098) [2022-07-11 14:13:03,923][26022] Updated weights on worker 0-0, policy_version 1225846 (0.00075) [2022-07-11 14:13:05,469][25689] Fps is (10 sec: 5397.5, 60 sec: 5636.1, 300 sec: 5625.0). Total num frames: 1255274496. Throughput: 0: 4970.6. Samples: 1255270610. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:13:05,469][25689] Avg episode reward: [(0, '0.345')] [2022-07-11 14:13:05,889][26022] Updated weights on worker 0-0, policy_version 1225856 (0.00080) [2022-07-11 14:13:07,715][26022] Updated weights on worker 0-0, policy_version 1225866 (0.00082) [2022-07-11 14:13:09,375][26022] Updated weights on worker 0-0, policy_version 1225876 (0.00086) [2022-07-11 14:13:10,492][25689] Fps is (10 sec: 5400.4, 60 sec: 5618.3, 300 sec: 5619.4). Total num frames: 1255302144. Throughput: 0: 5797.9. Samples: 1255304628. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:13:10,493][25689] Avg episode reward: [(0, '0.318')] [2022-07-11 14:13:11,245][26022] Updated weights on worker 0-0, policy_version 1225886 (0.00088) [2022-07-11 14:13:13,036][26022] Updated weights on worker 0-0, policy_version 1225896 (0.00084) [2022-07-11 14:13:15,018][26022] Updated weights on worker 0-0, policy_version 1225906 (0.00087) [2022-07-11 14:13:15,566][25689] Fps is (10 sec: 5577.9, 60 sec: 5635.6, 300 sec: 5621.6). Total num frames: 1255330816. Throughput: 0: 5766.8. Samples: 1255338106. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:13:15,566][25689] Avg episode reward: [(0, '0.081')] [2022-07-11 14:13:16,776][26022] Updated weights on worker 0-0, policy_version 1225916 (0.00096) [2022-07-11 14:13:18,702][26022] Updated weights on worker 0-0, policy_version 1225926 (0.00087) [2022-07-11 14:13:20,392][26022] Updated weights on worker 0-0, policy_version 1225936 (0.00066) [2022-07-11 14:13:20,575][25689] Fps is (10 sec: 5687.7, 60 sec: 5607.6, 300 sec: 5621.8). Total num frames: 1255359488. Throughput: 0: 4933.4. Samples: 1255355044. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:13:20,575][25689] Avg episode reward: [(0, '0.357')] [2022-07-11 14:13:22,338][26022] Updated weights on worker 0-0, policy_version 1225946 (0.00083) [2022-07-11 14:13:24,223][26022] Updated weights on worker 0-0, policy_version 1225956 (0.00101) [2022-07-11 14:13:25,627][25689] Fps is (10 sec: 5496.0, 60 sec: 5591.6, 300 sec: 5614.3). Total num frames: 1255386112. Throughput: 0: 5846.6. Samples: 1255388468. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:13:25,627][25689] Avg episode reward: [(0, '-0.137')] [2022-07-11 14:13:26,080][26022] Updated weights on worker 0-0, policy_version 1225966 (0.00089) [2022-07-11 14:13:27,959][26022] Updated weights on worker 0-0, policy_version 1225976 (0.00083) [2022-07-11 14:13:29,674][26022] Updated weights on worker 0-0, policy_version 1225986 (0.00086) [2022-07-11 14:13:30,661][25689] Fps is (10 sec: 5482.4, 60 sec: 5592.2, 300 sec: 5618.4). Total num frames: 1255414784. Throughput: 0: 5835.6. Samples: 1255422322. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:13:30,661][25689] Avg episode reward: [(0, '0.481')] [2022-07-11 14:13:31,563][26022] Updated weights on worker 0-0, policy_version 1225996 (0.00087) [2022-07-11 14:13:33,293][26022] Updated weights on worker 0-0, policy_version 1226006 (0.00087) [2022-07-11 14:13:35,051][26022] Updated weights on worker 0-0, policy_version 1226016 (0.00098) [2022-07-11 14:13:35,703][25689] Fps is (10 sec: 5691.2, 60 sec: 5602.4, 300 sec: 5611.4). Total num frames: 1255443456. Throughput: 0: 5009.2. Samples: 1255438978. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:13:35,703][25689] Avg episode reward: [(0, '-0.281')] [2022-07-11 14:13:37,063][26022] Updated weights on worker 0-0, policy_version 1226026 (0.00101) [2022-07-11 14:13:38,615][26022] Updated weights on worker 0-0, policy_version 1226036 (0.00084) [2022-07-11 14:13:40,705][25689] Fps is (10 sec: 5504.9, 60 sec: 5575.7, 300 sec: 5611.8). Total num frames: 1255470080. Throughput: 0: 5854.9. Samples: 1255472908. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:13:40,706][25689] Avg episode reward: [(0, '-0.061')] [2022-07-11 14:13:40,727][26022] Updated weights on worker 0-0, policy_version 1226046 (0.00092) [2022-07-11 14:13:42,500][26022] Updated weights on worker 0-0, policy_version 1226057 (0.00092) [2022-07-11 14:13:44,207][26022] Updated weights on worker 0-0, policy_version 1226067 (0.00090) [2022-07-11 14:13:45,710][25689] Fps is (10 sec: 5628.2, 60 sec: 5595.9, 300 sec: 5615.7). Total num frames: 1255499776. Throughput: 0: 5882.0. Samples: 1255506594. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:13:45,710][25689] Avg episode reward: [(0, '0.039')] [2022-07-11 14:13:46,414][26022] Updated weights on worker 0-0, policy_version 1226077 (0.00084) [2022-07-11 14:13:47,801][26022] Updated weights on worker 0-0, policy_version 1226087 (0.00085) [2022-07-11 14:13:49,787][26022] Updated weights on worker 0-0, policy_version 1226097 (0.00092) [2022-07-11 14:13:50,775][25689] Fps is (10 sec: 5898.3, 60 sec: 5591.7, 300 sec: 5619.2). Total num frames: 1255529472. Throughput: 0: 5036.0. Samples: 1255523616. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:13:50,775][25689] Avg episode reward: [(0, '0.500')] [2022-07-11 14:13:51,769][26022] Updated weights on worker 0-0, policy_version 1226107 (0.00080) [2022-07-11 14:13:53,323][26022] Updated weights on worker 0-0, policy_version 1226117 (0.00085) [2022-07-11 14:13:55,378][26022] Updated weights on worker 0-0, policy_version 1226127 (0.00096) [2022-07-11 14:13:55,842][25689] Fps is (10 sec: 5659.3, 60 sec: 5575.1, 300 sec: 5611.7). Total num frames: 1255557120. Throughput: 0: 5883.5. Samples: 1255557468. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:13:55,843][25689] Avg episode reward: [(0, '1.298')] [2022-07-11 14:13:57,016][26022] Updated weights on worker 0-0, policy_version 1226137 (0.00083) [2022-07-11 14:13:58,817][26022] Updated weights on worker 0-0, policy_version 1226147 (0.00085) [2022-07-11 14:14:00,819][26022] Updated weights on worker 0-0, policy_version 1226157 (0.00089) [2022-07-11 14:14:00,856][25689] Fps is (10 sec: 5485.1, 60 sec: 5576.5, 300 sec: 5622.0). Total num frames: 1255584768. Throughput: 0: 5900.7. Samples: 1255591810. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:14:00,856][25689] Avg episode reward: [(0, '1.753')] [2022-07-11 14:14:02,620][26022] Updated weights on worker 0-0, policy_version 1226167 (0.00093) [2022-07-11 14:14:04,663][26022] Updated weights on worker 0-0, policy_version 1226177 (0.00088) [2022-07-11 14:14:05,871][25689] Fps is (10 sec: 5513.9, 60 sec: 5594.5, 300 sec: 5618.5). Total num frames: 1255612416. Throughput: 0: 4976.6. Samples: 1255606928. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:14:05,871][25689] Avg episode reward: [(0, '1.793')] [2022-07-11 14:14:06,187][26022] Updated weights on worker 0-0, policy_version 1226187 (0.00091) [2022-07-11 14:14:08,177][26022] Updated weights on worker 0-0, policy_version 1226197 (0.00084) [2022-07-11 14:14:09,887][26022] Updated weights on worker 0-0, policy_version 1226207 (0.00085) [2022-07-11 14:14:10,874][25689] Fps is (10 sec: 5621.6, 60 sec: 5613.3, 300 sec: 5617.2). Total num frames: 1255641088. Throughput: 0: 5862.0. Samples: 1255641440. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:14:10,876][25689] Avg episode reward: [(0, '1.084')] [2022-07-11 14:14:11,743][26022] Updated weights on worker 0-0, policy_version 1226217 (0.00084) [2022-07-11 14:14:13,442][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:14:13,450][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001226227_1255656448.pth [2022-07-11 14:14:13,455][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001224249_1253630976.pth [2022-07-11 14:14:13,461][26022] Updated weights on worker 0-0, policy_version 1226227 (0.00084) [2022-07-11 14:14:15,417][26022] Updated weights on worker 0-0, policy_version 1226237 (0.00093) [2022-07-11 14:14:15,928][25689] Fps is (10 sec: 5701.6, 60 sec: 5615.1, 300 sec: 5620.1). Total num frames: 1255669760. Throughput: 0: 5882.7. Samples: 1255675628. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:14:15,930][25689] Avg episode reward: [(0, '0.951')] [2022-07-11 14:14:16,998][26022] Updated weights on worker 0-0, policy_version 1226247 (0.00086) [2022-07-11 14:14:19,011][26022] Updated weights on worker 0-0, policy_version 1226257 (0.00085) [2022-07-11 14:14:20,580][26022] Updated weights on worker 0-0, policy_version 1226267 (0.00087) [2022-07-11 14:14:20,943][25689] Fps is (10 sec: 5593.3, 60 sec: 5597.6, 300 sec: 5613.1). Total num frames: 1255697408. Throughput: 0: 5025.8. Samples: 1255692766. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:14:20,944][25689] Avg episode reward: [(0, '1.193')] [2022-07-11 14:14:22,359][26022] Updated weights on worker 0-0, policy_version 1226277 (0.00082) [2022-07-11 14:14:24,435][26022] Updated weights on worker 0-0, policy_version 1226287 (0.00085) [2022-07-11 14:14:25,970][25689] Fps is (10 sec: 5710.3, 60 sec: 5650.8, 300 sec: 5619.8). Total num frames: 1255727104. Throughput: 0: 5978.7. Samples: 1255727098. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:14:25,971][25689] Avg episode reward: [(0, '0.130')] [2022-07-11 14:14:26,031][26022] Updated weights on worker 0-0, policy_version 1226297 (0.00096) [2022-07-11 14:14:27,975][26022] Updated weights on worker 0-0, policy_version 1226307 (0.00088) [2022-07-11 14:14:29,765][26022] Updated weights on worker 0-0, policy_version 1226317 (0.00089) [2022-07-11 14:14:31,000][25689] Fps is (10 sec: 5803.9, 60 sec: 5651.2, 300 sec: 5620.8). Total num frames: 1255755776. Throughput: 0: 5957.0. Samples: 1255761330. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:14:31,001][25689] Avg episode reward: [(0, '-0.125')] [2022-07-11 14:14:31,436][26022] Updated weights on worker 0-0, policy_version 1226327 (0.00086) [2022-07-11 14:14:33,401][26022] Updated weights on worker 0-0, policy_version 1226337 (0.00083) [2022-07-11 14:14:35,079][26022] Updated weights on worker 0-0, policy_version 1226347 (0.00090) [2022-07-11 14:14:36,120][25689] Fps is (10 sec: 5548.9, 60 sec: 5627.0, 300 sec: 5615.8). Total num frames: 1255783424. Throughput: 0: 5089.2. Samples: 1255778390. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:14:36,121][25689] Avg episode reward: [(0, '-0.028')] [2022-07-11 14:14:36,827][26022] Updated weights on worker 0-0, policy_version 1226357 (0.00083) [2022-07-11 14:14:38,727][26022] Updated weights on worker 0-0, policy_version 1226367 (0.00092) [2022-07-11 14:14:40,279][26022] Updated weights on worker 0-0, policy_version 1226377 (0.00086) [2022-07-11 14:14:41,139][25689] Fps is (10 sec: 5656.0, 60 sec: 5676.3, 300 sec: 5619.1). Total num frames: 1255813120. Throughput: 0: 5947.1. Samples: 1255812868. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:14:41,139][25689] Avg episode reward: [(0, '0.531')] [2022-07-11 14:14:42,421][26022] Updated weights on worker 0-0, policy_version 1226387 (0.00083) [2022-07-11 14:14:44,142][26022] Updated weights on worker 0-0, policy_version 1226397 (0.00083) [2022-07-11 14:14:45,920][26022] Updated weights on worker 0-0, policy_version 1226407 (0.00090) [2022-07-11 14:14:46,162][25689] Fps is (10 sec: 5812.5, 60 sec: 5657.6, 300 sec: 5618.8). Total num frames: 1255841792. Throughput: 0: 5928.6. Samples: 1255846806. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:14:46,163][25689] Avg episode reward: [(0, '0.632')] [2022-07-11 14:14:47,707][26022] Updated weights on worker 0-0, policy_version 1226417 (0.00086) [2022-07-11 14:14:49,409][26022] Updated weights on worker 0-0, policy_version 1226427 (0.00090) [2022-07-11 14:14:51,193][25689] Fps is (10 sec: 5703.3, 60 sec: 5643.8, 300 sec: 5619.4). Total num frames: 1255870464. Throughput: 0: 5079.9. Samples: 1255863910. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:14:51,194][25689] Avg episode reward: [(0, '0.302')] [2022-07-11 14:14:51,264][26022] Updated weights on worker 0-0, policy_version 1226437 (0.00092) [2022-07-11 14:14:52,963][26022] Updated weights on worker 0-0, policy_version 1226447 (0.00087) [2022-07-11 14:14:54,787][26022] Updated weights on worker 0-0, policy_version 1226457 (0.00085) [2022-07-11 14:14:56,319][25689] Fps is (10 sec: 5746.7, 60 sec: 5672.3, 300 sec: 5620.6). Total num frames: 1255900160. Throughput: 0: 5949.3. Samples: 1255898556. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:14:56,321][25689] Avg episode reward: [(0, '0.995')] [2022-07-11 14:14:56,530][26022] Updated weights on worker 0-0, policy_version 1226467 (0.00098) [2022-07-11 14:14:58,540][26022] Updated weights on worker 0-0, policy_version 1226477 (0.00087) [2022-07-11 14:15:00,254][26022] Updated weights on worker 0-0, policy_version 1226487 (0.00084) [2022-07-11 14:15:01,333][25689] Fps is (10 sec: 5655.6, 60 sec: 5672.3, 300 sec: 5627.3). Total num frames: 1255927808. Throughput: 0: 5935.3. Samples: 1255932724. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:15:01,333][25689] Avg episode reward: [(0, '1.157')] [2022-07-11 14:15:02,097][26022] Updated weights on worker 0-0, policy_version 1226497 (0.00089) [2022-07-11 14:15:04,261][26022] Updated weights on worker 0-0, policy_version 1226507 (0.00087) [2022-07-11 14:15:06,041][26022] Updated weights on worker 0-0, policy_version 1226517 (0.00087) [2022-07-11 14:15:06,347][25689] Fps is (10 sec: 5514.0, 60 sec: 5672.3, 300 sec: 5628.2). Total num frames: 1255955456. Throughput: 0: 4997.2. Samples: 1255947674. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:15:06,349][25689] Avg episode reward: [(0, '1.165')] [2022-07-11 14:15:07,984][26022] Updated weights on worker 0-0, policy_version 1226527 (0.00085) [2022-07-11 14:15:09,577][26022] Updated weights on worker 0-0, policy_version 1226537 (0.00088) [2022-07-11 14:15:11,423][25689] Fps is (10 sec: 5480.1, 60 sec: 5648.6, 300 sec: 5621.5). Total num frames: 1255983104. Throughput: 0: 5832.5. Samples: 1255981900. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:15:11,425][25689] Avg episode reward: [(0, '1.199')] [2022-07-11 14:15:11,477][26022] Updated weights on worker 0-0, policy_version 1226547 (0.00056) [2022-07-11 14:15:13,192][26022] Updated weights on worker 0-0, policy_version 1226557 (0.00091) [2022-07-11 14:15:14,882][26022] Updated weights on worker 0-0, policy_version 1226567 (0.00565) [2022-07-11 14:15:16,522][25689] Fps is (10 sec: 5635.9, 60 sec: 5661.3, 300 sec: 5628.1). Total num frames: 1256012800. Throughput: 0: 5826.2. Samples: 1256016264. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:15:16,522][25689] Avg episode reward: [(0, '0.444')] [2022-07-11 14:15:16,786][26022] Updated weights on worker 0-0, policy_version 1226577 (0.00089) [2022-07-11 14:15:18,452][26022] Updated weights on worker 0-0, policy_version 1226587 (0.00090) [2022-07-11 14:15:20,315][26022] Updated weights on worker 0-0, policy_version 1226597 (0.00091) [2022-07-11 14:15:21,586][25689] Fps is (10 sec: 5944.5, 60 sec: 5707.4, 300 sec: 5631.7). Total num frames: 1256043520. Throughput: 0: 5833.2. Samples: 1256050868. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:15:21,587][25689] Avg episode reward: [(0, '0.264')] [2022-07-11 14:15:22,040][26022] Updated weights on worker 0-0, policy_version 1226607 (0.00092) [2022-07-11 14:15:23,984][26022] Updated weights on worker 0-0, policy_version 1226617 (0.00100) [2022-07-11 14:15:25,631][26022] Updated weights on worker 0-0, policy_version 1226627 (0.00084) [2022-07-11 14:15:26,597][25689] Fps is (10 sec: 5691.9, 60 sec: 5658.3, 300 sec: 5628.6). Total num frames: 1256070144. Throughput: 0: 5945.5. Samples: 1256068068. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:15:26,597][25689] Avg episode reward: [(0, '0.388')] [2022-07-11 14:15:27,419][26022] Updated weights on worker 0-0, policy_version 1226637 (0.00088) [2022-07-11 14:15:29,297][26022] Updated weights on worker 0-0, policy_version 1226647 (0.00083) [2022-07-11 14:15:30,971][26022] Updated weights on worker 0-0, policy_version 1226657 (0.00087) [2022-07-11 14:15:31,654][25689] Fps is (10 sec: 5594.3, 60 sec: 5672.6, 300 sec: 5628.6). Total num frames: 1256099840. Throughput: 0: 5953.2. Samples: 1256102338. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:15:31,655][25689] Avg episode reward: [(0, '0.696')] [2022-07-11 14:15:32,971][26022] Updated weights on worker 0-0, policy_version 1226667 (0.00086) [2022-07-11 14:15:34,538][26022] Updated weights on worker 0-0, policy_version 1226677 (0.00086) [2022-07-11 14:15:36,217][26022] Updated weights on worker 0-0, policy_version 1226687 (0.00086) [2022-07-11 14:15:36,736][25689] Fps is (10 sec: 5857.5, 60 sec: 5709.9, 300 sec: 5637.6). Total num frames: 1256129536. Throughput: 0: 5959.3. Samples: 1256136726. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:15:36,736][25689] Avg episode reward: [(0, '0.624')] [2022-07-11 14:15:38,217][26022] Updated weights on worker 0-0, policy_version 1226697 (0.00088) [2022-07-11 14:15:39,838][26022] Updated weights on worker 0-0, policy_version 1226707 (0.00080) [2022-07-11 14:15:41,777][25689] Fps is (10 sec: 5664.7, 60 sec: 5674.0, 300 sec: 5630.3). Total num frames: 1256157184. Throughput: 0: 5103.0. Samples: 1256153900. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:15:41,777][25689] Avg episode reward: [(0, '1.273')] [2022-07-11 14:15:41,786][26022] Updated weights on worker 0-0, policy_version 1226717 (0.00085) [2022-07-11 14:15:43,575][26022] Updated weights on worker 0-0, policy_version 1226727 (0.00084) [2022-07-11 14:15:45,255][26022] Updated weights on worker 0-0, policy_version 1226737 (0.00086) [2022-07-11 14:15:46,792][25689] Fps is (10 sec: 5600.6, 60 sec: 5674.8, 300 sec: 5637.1). Total num frames: 1256185856. Throughput: 0: 5949.8. Samples: 1256188226. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:15:46,793][25689] Avg episode reward: [(0, '0.513')] [2022-07-11 14:15:47,332][26022] Updated weights on worker 0-0, policy_version 1226747 (0.00094) [2022-07-11 14:15:48,998][26022] Updated weights on worker 0-0, policy_version 1226757 (0.00097) [2022-07-11 14:15:50,846][26022] Updated weights on worker 0-0, policy_version 1226767 (0.00080) [2022-07-11 14:15:51,840][25689] Fps is (10 sec: 5698.4, 60 sec: 5673.2, 300 sec: 5634.2). Total num frames: 1256214528. Throughput: 0: 5934.6. Samples: 1256222136. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:15:51,841][25689] Avg episode reward: [(0, '0.457')] [2022-07-11 14:15:52,536][26022] Updated weights on worker 0-0, policy_version 1226777 (0.00082) [2022-07-11 14:15:54,383][26022] Updated weights on worker 0-0, policy_version 1226787 (0.00086) [2022-07-11 14:15:56,204][26022] Updated weights on worker 0-0, policy_version 1226797 (0.00080) [2022-07-11 14:15:56,936][25689] Fps is (10 sec: 5652.9, 60 sec: 5659.1, 300 sec: 5633.4). Total num frames: 1256243200. Throughput: 0: 5075.0. Samples: 1256239242. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:15:56,937][25689] Avg episode reward: [(0, '0.471')] [2022-07-11 14:15:58,111][26022] Updated weights on worker 0-0, policy_version 1226807 (0.00091) [2022-07-11 14:15:59,927][26022] Updated weights on worker 0-0, policy_version 1226817 (0.00085) [2022-07-11 14:16:01,600][26022] Updated weights on worker 0-0, policy_version 1226827 (0.00087) [2022-07-11 14:16:01,975][25689] Fps is (10 sec: 5658.2, 60 sec: 5673.7, 300 sec: 5643.1). Total num frames: 1256271872. Throughput: 0: 5904.5. Samples: 1256273158. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:16:01,975][25689] Avg episode reward: [(0, '0.636')] [2022-07-11 14:16:03,976][26022] Updated weights on worker 0-0, policy_version 1226837 (0.00097) [2022-07-11 14:16:05,660][26022] Updated weights on worker 0-0, policy_version 1226847 (0.00091) [2022-07-11 14:16:06,984][25689] Fps is (10 sec: 5503.5, 60 sec: 5657.3, 300 sec: 5633.3). Total num frames: 1256298496. Throughput: 0: 5787.7. Samples: 1256305088. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:16:06,984][25689] Avg episode reward: [(0, '0.461')] [2022-07-11 14:16:07,453][26022] Updated weights on worker 0-0, policy_version 1226857 (0.00086) [2022-07-11 14:16:09,344][26022] Updated weights on worker 0-0, policy_version 1226867 (0.00107) [2022-07-11 14:16:11,059][26022] Updated weights on worker 0-0, policy_version 1226877 (0.00080) [2022-07-11 14:16:12,020][25689] Fps is (10 sec: 5504.7, 60 sec: 5677.9, 300 sec: 5633.9). Total num frames: 1256327168. Throughput: 0: 4946.3. Samples: 1256321952. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:16:12,021][25689] Avg episode reward: [(0, '-0.061')] [2022-07-11 14:16:13,035][26022] Updated weights on worker 0-0, policy_version 1226887 (0.00085) [2022-07-11 14:16:13,593][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:16:13,616][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001226890_1256335360.pth [2022-07-11 14:16:13,616][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001224910_1254307840.pth [2022-07-11 14:16:14,788][26022] Updated weights on worker 0-0, policy_version 1226897 (0.00097) [2022-07-11 14:16:16,702][26022] Updated weights on worker 0-0, policy_version 1226907 (0.00084) [2022-07-11 14:16:17,095][25689] Fps is (10 sec: 5570.0, 60 sec: 5646.4, 300 sec: 5637.4). Total num frames: 1256354816. Throughput: 0: 5779.2. Samples: 1256355742. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:16:17,095][25689] Avg episode reward: [(0, '0.987')] [2022-07-11 14:16:18,326][26022] Updated weights on worker 0-0, policy_version 1226917 (0.00087) [2022-07-11 14:16:20,432][26022] Updated weights on worker 0-0, policy_version 1226927 (0.00085) [2022-07-11 14:16:22,047][26022] Updated weights on worker 0-0, policy_version 1226937 (0.01114) [2022-07-11 14:16:22,119][25689] Fps is (10 sec: 5576.9, 60 sec: 5616.3, 300 sec: 5633.5). Total num frames: 1256383488. Throughput: 0: 5789.8. Samples: 1256389788. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:16:22,119][25689] Avg episode reward: [(0, '0.808')] [2022-07-11 14:16:23,861][26022] Updated weights on worker 0-0, policy_version 1226947 (0.00088) [2022-07-11 14:16:25,715][26022] Updated weights on worker 0-0, policy_version 1226957 (0.00075) [2022-07-11 14:16:27,179][25689] Fps is (10 sec: 5686.6, 60 sec: 5645.5, 300 sec: 5632.8). Total num frames: 1256412160. Throughput: 0: 5028.2. Samples: 1256406634. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:16:27,179][25689] Avg episode reward: [(0, '0.462')] [2022-07-11 14:16:27,323][26022] Updated weights on worker 0-0, policy_version 1226967 (0.00086) [2022-07-11 14:16:29,461][26022] Updated weights on worker 0-0, policy_version 1226977 (0.00087) [2022-07-11 14:16:30,897][26022] Updated weights on worker 0-0, policy_version 1226987 (0.00081) [2022-07-11 14:16:32,195][25689] Fps is (10 sec: 5589.6, 60 sec: 5615.5, 300 sec: 5630.4). Total num frames: 1256439808. Throughput: 0: 5893.2. Samples: 1256440844. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:16:32,195][25689] Avg episode reward: [(0, '0.739')] [2022-07-11 14:16:32,924][26022] Updated weights on worker 0-0, policy_version 1226997 (0.00086) [2022-07-11 14:16:34,640][26022] Updated weights on worker 0-0, policy_version 1227007 (0.00090) [2022-07-11 14:16:36,447][26022] Updated weights on worker 0-0, policy_version 1227017 (0.00092) [2022-07-11 14:16:37,235][25689] Fps is (10 sec: 5702.1, 60 sec: 5619.4, 300 sec: 5638.0). Total num frames: 1256469504. Throughput: 0: 5912.2. Samples: 1256474818. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:16:37,236][25689] Avg episode reward: [(0, '1.037')] [2022-07-11 14:16:38,319][26022] Updated weights on worker 0-0, policy_version 1227027 (0.00092) [2022-07-11 14:16:40,105][26022] Updated weights on worker 0-0, policy_version 1227037 (0.00092) [2022-07-11 14:16:42,044][26022] Updated weights on worker 0-0, policy_version 1227047 (0.00086) [2022-07-11 14:16:42,249][25689] Fps is (10 sec: 5907.2, 60 sec: 5655.8, 300 sec: 5639.7). Total num frames: 1256499200. Throughput: 0: 5077.9. Samples: 1256492006. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2022-07-11 14:16:42,249][25689] Avg episode reward: [(0, '0.160')] [2022-07-11 14:16:43,731][26022] Updated weights on worker 0-0, policy_version 1227057 (0.00090) [2022-07-11 14:16:45,488][26022] Updated weights on worker 0-0, policy_version 1227067 (0.00088) [2022-07-11 14:16:47,253][25689] Fps is (10 sec: 5519.7, 60 sec: 5606.0, 300 sec: 5627.1). Total num frames: 1256524800. Throughput: 0: 5929.2. Samples: 1256525660. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:16:47,254][25689] Avg episode reward: [(0, '0.553')] [2022-07-11 14:16:47,466][26022] Updated weights on worker 0-0, policy_version 1227077 (0.00087) [2022-07-11 14:16:49,189][26022] Updated weights on worker 0-0, policy_version 1227087 (0.00082) [2022-07-11 14:16:51,034][26022] Updated weights on worker 0-0, policy_version 1227097 (0.00091) [2022-07-11 14:16:52,266][25689] Fps is (10 sec: 5417.8, 60 sec: 5609.2, 300 sec: 5631.8). Total num frames: 1256553472. Throughput: 0: 5924.9. Samples: 1256559766. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:16:52,267][25689] Avg episode reward: [(0, '0.535')] [2022-07-11 14:16:52,746][26022] Updated weights on worker 0-0, policy_version 1227107 (0.00082) [2022-07-11 14:16:54,493][26022] Updated weights on worker 0-0, policy_version 1227117 (0.00085) [2022-07-11 14:16:56,290][26022] Updated weights on worker 0-0, policy_version 1227127 (0.00089) [2022-07-11 14:16:57,377][25689] Fps is (10 sec: 5866.9, 60 sec: 5641.8, 300 sec: 5641.6). Total num frames: 1256584192. Throughput: 0: 5065.2. Samples: 1256576836. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:16:57,377][25689] Avg episode reward: [(0, '0.676')] [2022-07-11 14:16:58,269][26022] Updated weights on worker 0-0, policy_version 1227137 (0.00083) [2022-07-11 14:16:59,875][26022] Updated weights on worker 0-0, policy_version 1227147 (0.00082) [2022-07-11 14:17:02,205][26022] Updated weights on worker 0-0, policy_version 1227157 (0.00086) [2022-07-11 14:17:02,380][25689] Fps is (10 sec: 5568.7, 60 sec: 5594.2, 300 sec: 5636.2). Total num frames: 1256609792. Throughput: 0: 5929.4. Samples: 1256611370. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:17:02,380][25689] Avg episode reward: [(0, '-0.154')] [2022-07-11 14:17:03,845][26022] Updated weights on worker 0-0, policy_version 1227167 (0.00084) [2022-07-11 14:17:05,694][26022] Updated weights on worker 0-0, policy_version 1227177 (0.00077) [2022-07-11 14:17:07,433][25689] Fps is (10 sec: 5396.7, 60 sec: 5624.0, 300 sec: 5635.5). Total num frames: 1256638464. Throughput: 0: 5838.9. Samples: 1256643488. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:17:07,434][25689] Avg episode reward: [(0, '-0.217')] [2022-07-11 14:17:07,627][26022] Updated weights on worker 0-0, policy_version 1227187 (0.00085) [2022-07-11 14:17:09,241][26022] Updated weights on worker 0-0, policy_version 1227197 (0.00087) [2022-07-11 14:17:11,380][26022] Updated weights on worker 0-0, policy_version 1227208 (0.00083) [2022-07-11 14:17:12,458][25689] Fps is (10 sec: 5689.9, 60 sec: 5625.1, 300 sec: 5639.9). Total num frames: 1256667136. Throughput: 0: 4996.4. Samples: 1256660652. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:17:12,458][25689] Avg episode reward: [(0, '0.622')] [2022-07-11 14:17:13,053][26022] Updated weights on worker 0-0, policy_version 1227218 (0.00122) [2022-07-11 14:17:14,892][26022] Updated weights on worker 0-0, policy_version 1227228 (0.00089) [2022-07-11 14:17:16,995][26022] Updated weights on worker 0-0, policy_version 1227238 (0.00086) [2022-07-11 14:17:17,531][25689] Fps is (10 sec: 5678.9, 60 sec: 5642.2, 300 sec: 5633.0). Total num frames: 1256695808. Throughput: 0: 5843.2. Samples: 1256694600. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:17:17,531][25689] Avg episode reward: [(0, '-0.351')] [2022-07-11 14:17:18,528][26022] Updated weights on worker 0-0, policy_version 1227248 (0.00076) [2022-07-11 14:17:20,411][26022] Updated weights on worker 0-0, policy_version 1227258 (0.00092) [2022-07-11 14:17:22,423][26022] Updated weights on worker 0-0, policy_version 1227268 (0.00105) [2022-07-11 14:17:22,539][25689] Fps is (10 sec: 5485.3, 60 sec: 5609.8, 300 sec: 5630.6). Total num frames: 1256722432. Throughput: 0: 5783.4. Samples: 1256727956. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:17:22,539][25689] Avg episode reward: [(0, '0.059')] [2022-07-11 14:17:24,013][26022] Updated weights on worker 0-0, policy_version 1227278 (0.00106) [2022-07-11 14:17:26,042][26022] Updated weights on worker 0-0, policy_version 1227288 (0.00092) [2022-07-11 14:17:27,567][25689] Fps is (10 sec: 5611.8, 60 sec: 5629.7, 300 sec: 5634.3). Total num frames: 1256752128. Throughput: 0: 5026.7. Samples: 1256744694. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:17:27,568][25689] Avg episode reward: [(0, '0.551')] [2022-07-11 14:17:27,868][26022] Updated weights on worker 0-0, policy_version 1227298 (0.00091) [2022-07-11 14:17:29,537][26022] Updated weights on worker 0-0, policy_version 1227308 (0.00084) [2022-07-11 14:17:31,561][26022] Updated weights on worker 0-0, policy_version 1227318 (0.00083) [2022-07-11 14:17:32,579][25689] Fps is (10 sec: 5711.5, 60 sec: 5630.1, 300 sec: 5633.5). Total num frames: 1256779776. Throughput: 0: 5856.7. Samples: 1256778494. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:17:32,580][25689] Avg episode reward: [(0, '0.769')] [2022-07-11 14:17:33,043][26022] Updated weights on worker 0-0, policy_version 1227328 (0.00089) [2022-07-11 14:17:35,087][26022] Updated weights on worker 0-0, policy_version 1227338 (0.00087) [2022-07-11 14:17:36,625][26022] Updated weights on worker 0-0, policy_version 1227348 (0.00089) [2022-07-11 14:17:37,629][25689] Fps is (10 sec: 5495.6, 60 sec: 5595.3, 300 sec: 5630.6). Total num frames: 1256807424. Throughput: 0: 5867.9. Samples: 1256812532. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:17:37,630][25689] Avg episode reward: [(0, '-0.032')] [2022-07-11 14:17:38,592][26022] Updated weights on worker 0-0, policy_version 1227358 (0.00087) [2022-07-11 14:17:40,579][26022] Updated weights on worker 0-0, policy_version 1227368 (0.00086) [2022-07-11 14:17:42,278][26022] Updated weights on worker 0-0, policy_version 1227378 (0.00105) [2022-07-11 14:17:42,645][25689] Fps is (10 sec: 5594.7, 60 sec: 5578.1, 300 sec: 5631.1). Total num frames: 1256836096. Throughput: 0: 5059.2. Samples: 1256829680. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:17:42,646][25689] Avg episode reward: [(0, '-0.093')] [2022-07-11 14:17:44,174][26022] Updated weights on worker 0-0, policy_version 1227388 (0.00083) [2022-07-11 14:17:46,122][26022] Updated weights on worker 0-0, policy_version 1227398 (0.00092) [2022-07-11 14:17:47,649][25689] Fps is (10 sec: 5722.9, 60 sec: 5629.0, 300 sec: 5627.9). Total num frames: 1256864768. Throughput: 0: 5901.3. Samples: 1256863204. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:17:47,649][25689] Avg episode reward: [(0, '0.322')] [2022-07-11 14:17:47,703][26022] Updated weights on worker 0-0, policy_version 1227408 (0.00084) [2022-07-11 14:17:49,779][26022] Updated weights on worker 0-0, policy_version 1227418 (0.00094) [2022-07-11 14:17:51,440][26022] Updated weights on worker 0-0, policy_version 1227428 (0.00078) [2022-07-11 14:17:52,659][25689] Fps is (10 sec: 5522.4, 60 sec: 5595.4, 300 sec: 5622.2). Total num frames: 1256891392. Throughput: 0: 5869.7. Samples: 1256896356. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:17:52,659][25689] Avg episode reward: [(0, '0.621')] [2022-07-11 14:17:53,237][26022] Updated weights on worker 0-0, policy_version 1227438 (0.00090) [2022-07-11 14:17:55,281][26022] Updated weights on worker 0-0, policy_version 1227448 (0.00081) [2022-07-11 14:17:56,859][26022] Updated weights on worker 0-0, policy_version 1227458 (0.00079) [2022-07-11 14:17:57,732][25689] Fps is (10 sec: 5585.6, 60 sec: 5581.9, 300 sec: 5628.2). Total num frames: 1256921088. Throughput: 0: 5870.0. Samples: 1256930538. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:17:57,732][25689] Avg episode reward: [(0, '-0.128')] [2022-07-11 14:17:58,579][26022] Updated weights on worker 0-0, policy_version 1227468 (0.00080) [2022-07-11 14:18:00,605][26022] Updated weights on worker 0-0, policy_version 1227478 (0.00086) [2022-07-11 14:18:02,550][26022] Updated weights on worker 0-0, policy_version 1227488 (0.00087) [2022-07-11 14:18:02,756][25689] Fps is (10 sec: 5577.4, 60 sec: 5596.9, 300 sec: 5628.3). Total num frames: 1256947712. Throughput: 0: 5864.9. Samples: 1256947628. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:18:02,758][25689] Avg episode reward: [(0, '-1.237')] [2022-07-11 14:18:04,491][26022] Updated weights on worker 0-0, policy_version 1227498 (0.00085) [2022-07-11 14:18:06,489][26022] Updated weights on worker 0-0, policy_version 1227508 (0.00090) [2022-07-11 14:18:07,762][25689] Fps is (10 sec: 5614.8, 60 sec: 5618.2, 300 sec: 5635.5). Total num frames: 1256977408. Throughput: 0: 5801.9. Samples: 1256979902. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:18:07,763][25689] Avg episode reward: [(0, '0.152')] [2022-07-11 14:18:08,012][26022] Updated weights on worker 0-0, policy_version 1227518 (0.00088) [2022-07-11 14:18:10,003][26022] Updated weights on worker 0-0, policy_version 1227528 (0.00085) [2022-07-11 14:18:11,606][26022] Updated weights on worker 0-0, policy_version 1227538 (0.00075) [2022-07-11 14:18:12,779][25689] Fps is (10 sec: 5619.5, 60 sec: 5585.1, 300 sec: 5629.7). Total num frames: 1257004032. Throughput: 0: 5845.0. Samples: 1257013958. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:18:12,781][25689] Avg episode reward: [(0, '-0.658')] [2022-07-11 14:18:13,636][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:18:13,652][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001227548_1257009152.pth [2022-07-11 14:18:13,652][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001225569_1254982656.pth [2022-07-11 14:18:13,654][26022] Updated weights on worker 0-0, policy_version 1227548 (0.00079) [2022-07-11 14:18:15,274][26022] Updated weights on worker 0-0, policy_version 1227558 (0.00079) [2022-07-11 14:18:16,997][26022] Updated weights on worker 0-0, policy_version 1227568 (0.00098) [2022-07-11 14:18:17,836][25689] Fps is (10 sec: 5591.1, 60 sec: 5603.5, 300 sec: 5632.2). Total num frames: 1257033728. Throughput: 0: 5010.3. Samples: 1257031264. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:18:17,836][25689] Avg episode reward: [(0, '-0.816')] [2022-07-11 14:18:18,861][26022] Updated weights on worker 0-0, policy_version 1227578 (0.00087) [2022-07-11 14:18:20,583][26022] Updated weights on worker 0-0, policy_version 1227588 (0.00086) [2022-07-11 14:18:22,380][26022] Updated weights on worker 0-0, policy_version 1227598 (0.00092) [2022-07-11 14:18:22,892][25689] Fps is (10 sec: 5872.7, 60 sec: 5649.9, 300 sec: 5642.5). Total num frames: 1257063424. Throughput: 0: 5873.8. Samples: 1257065900. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:18:22,892][25689] Avg episode reward: [(0, '-0.850')] [2022-07-11 14:18:24,252][26022] Updated weights on worker 0-0, policy_version 1227608 (0.00086) [2022-07-11 14:18:26,122][26022] Updated weights on worker 0-0, policy_version 1227618 (0.00083) [2022-07-11 14:18:27,953][25689] Fps is (10 sec: 5566.6, 60 sec: 5596.0, 300 sec: 5635.1). Total num frames: 1257090048. Throughput: 0: 5942.6. Samples: 1257099888. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:18:27,954][25689] Avg episode reward: [(0, '-0.963')] [2022-07-11 14:18:27,990][26022] Updated weights on worker 0-0, policy_version 1227628 (0.00063) [2022-07-11 14:18:29,596][26022] Updated weights on worker 0-0, policy_version 1227638 (0.00091) [2022-07-11 14:18:31,488][26022] Updated weights on worker 0-0, policy_version 1227648 (0.00087) [2022-07-11 14:18:32,987][25689] Fps is (10 sec: 5680.2, 60 sec: 5644.8, 300 sec: 5642.1). Total num frames: 1257120768. Throughput: 0: 5095.5. Samples: 1257116932. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:18:32,988][25689] Avg episode reward: [(0, '-0.314')] [2022-07-11 14:18:33,327][26022] Updated weights on worker 0-0, policy_version 1227658 (0.00084) [2022-07-11 14:18:35,129][26022] Updated weights on worker 0-0, policy_version 1227668 (0.00095) [2022-07-11 14:18:36,866][26022] Updated weights on worker 0-0, policy_version 1227678 (0.00087) [2022-07-11 14:18:38,111][25689] Fps is (10 sec: 5846.6, 60 sec: 5654.8, 300 sec: 5646.7). Total num frames: 1257149440. Throughput: 0: 5920.6. Samples: 1257151310. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:18:38,112][25689] Avg episode reward: [(0, '-0.426')] [2022-07-11 14:18:38,758][26022] Updated weights on worker 0-0, policy_version 1227688 (0.00084) [2022-07-11 14:18:40,364][26022] Updated weights on worker 0-0, policy_version 1227698 (0.00080) [2022-07-11 14:18:42,320][26022] Updated weights on worker 0-0, policy_version 1227708 (0.00088) [2022-07-11 14:18:43,116][25689] Fps is (10 sec: 5661.4, 60 sec: 5655.9, 300 sec: 5643.2). Total num frames: 1257178112. Throughput: 0: 5931.2. Samples: 1257185856. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:18:43,117][25689] Avg episode reward: [(0, '0.666')] [2022-07-11 14:18:44,064][26022] Updated weights on worker 0-0, policy_version 1227718 (0.00917) [2022-07-11 14:18:45,864][26022] Updated weights on worker 0-0, policy_version 1227728 (0.00080) [2022-07-11 14:18:47,725][26022] Updated weights on worker 0-0, policy_version 1227738 (0.00089) [2022-07-11 14:18:48,147][25689] Fps is (10 sec: 5714.2, 60 sec: 5653.3, 300 sec: 5640.5). Total num frames: 1257206784. Throughput: 0: 5097.7. Samples: 1257202828. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:18:48,147][25689] Avg episode reward: [(0, '0.654')] [2022-07-11 14:18:49,492][26022] Updated weights on worker 0-0, policy_version 1227748 (0.00079) [2022-07-11 14:18:51,326][26022] Updated weights on worker 0-0, policy_version 1227758 (0.00082) [2022-07-11 14:18:53,159][25689] Fps is (10 sec: 5506.0, 60 sec: 5653.1, 300 sec: 5638.0). Total num frames: 1257233408. Throughput: 0: 5937.4. Samples: 1257236700. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:18:53,160][25689] Avg episode reward: [(0, '1.345')] [2022-07-11 14:18:53,183][26022] Updated weights on worker 0-0, policy_version 1227768 (0.00087) [2022-07-11 14:18:54,799][26022] Updated weights on worker 0-0, policy_version 1227778 (0.00084) [2022-07-11 14:18:56,796][26022] Updated weights on worker 0-0, policy_version 1227788 (0.00086) [2022-07-11 14:18:58,267][25689] Fps is (10 sec: 5565.1, 60 sec: 5649.9, 300 sec: 5643.1). Total num frames: 1257263104. Throughput: 0: 5943.2. Samples: 1257271098. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:18:58,267][25689] Avg episode reward: [(0, '0.429')] [2022-07-11 14:18:58,428][26022] Updated weights on worker 0-0, policy_version 1227798 (0.00092) [2022-07-11 14:19:00,277][26022] Updated weights on worker 0-0, policy_version 1227808 (0.00087) [2022-07-11 14:19:02,043][26022] Updated weights on worker 0-0, policy_version 1227818 (0.00080) [2022-07-11 14:19:03,315][25689] Fps is (10 sec: 5545.5, 60 sec: 5647.7, 300 sec: 5639.1). Total num frames: 1257289728. Throughput: 0: 5074.9. Samples: 1257288364. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:19:03,316][25689] Avg episode reward: [(0, '0.351')] [2022-07-11 14:19:04,093][26022] Updated weights on worker 0-0, policy_version 1227828 (0.00098) [2022-07-11 14:19:06,089][26022] Updated weights on worker 0-0, policy_version 1227838 (0.00089) [2022-07-11 14:19:07,634][26022] Updated weights on worker 0-0, policy_version 1227848 (0.00088) [2022-07-11 14:19:08,326][25689] Fps is (10 sec: 5598.9, 60 sec: 5647.2, 300 sec: 5642.4). Total num frames: 1257319424. Throughput: 0: 5831.2. Samples: 1257320498. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:19:08,327][25689] Avg episode reward: [(0, '0.333')] [2022-07-11 14:19:09,549][26022] Updated weights on worker 0-0, policy_version 1227858 (0.00106) [2022-07-11 14:19:11,306][26022] Updated weights on worker 0-0, policy_version 1227868 (0.00082) [2022-07-11 14:19:13,211][26022] Updated weights on worker 0-0, policy_version 1227878 (0.00085) [2022-07-11 14:19:13,337][25689] Fps is (10 sec: 5823.8, 60 sec: 5681.5, 300 sec: 5643.2). Total num frames: 1257348096. Throughput: 0: 5843.2. Samples: 1257354606. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:19:13,338][25689] Avg episode reward: [(0, '0.318')] [2022-07-11 14:19:15,326][26022] Updated weights on worker 0-0, policy_version 1227888 (0.00080) [2022-07-11 14:19:16,918][26022] Updated weights on worker 0-0, policy_version 1227898 (0.00086) [2022-07-11 14:19:18,455][25689] Fps is (10 sec: 5560.3, 60 sec: 5642.0, 300 sec: 5641.2). Total num frames: 1257375744. Throughput: 0: 4961.6. Samples: 1257371262. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:19:18,455][25689] Avg episode reward: [(0, '0.327')] [2022-07-11 14:19:18,844][26022] Updated weights on worker 0-0, policy_version 1227908 (0.00091) [2022-07-11 14:19:20,679][26022] Updated weights on worker 0-0, policy_version 1227918 (0.00089) [2022-07-11 14:19:22,488][26022] Updated weights on worker 0-0, policy_version 1227928 (0.00079) [2022-07-11 14:19:23,475][25689] Fps is (10 sec: 5555.7, 60 sec: 5628.5, 300 sec: 5637.9). Total num frames: 1257404416. Throughput: 0: 5796.1. Samples: 1257405212. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:19:23,475][25689] Avg episode reward: [(0, '0.421')] [2022-07-11 14:19:24,171][26022] Updated weights on worker 0-0, policy_version 1227938 (0.00081) [2022-07-11 14:19:26,019][26022] Updated weights on worker 0-0, policy_version 1227948 (0.00097) [2022-07-11 14:19:27,644][26022] Updated weights on worker 0-0, policy_version 1227958 (0.00086) [2022-07-11 14:19:28,480][25689] Fps is (10 sec: 5720.3, 60 sec: 5667.6, 300 sec: 5638.4). Total num frames: 1257433088. Throughput: 0: 5898.7. Samples: 1257439378. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:19:28,480][25689] Avg episode reward: [(0, '0.880')] [2022-07-11 14:19:29,718][26022] Updated weights on worker 0-0, policy_version 1227968 (0.00086) [2022-07-11 14:19:31,180][26022] Updated weights on worker 0-0, policy_version 1227978 (0.00092) [2022-07-11 14:19:33,297][26022] Updated weights on worker 0-0, policy_version 1227988 (0.00083) [2022-07-11 14:19:33,498][25689] Fps is (10 sec: 5619.2, 60 sec: 5618.4, 300 sec: 5640.4). Total num frames: 1257460736. Throughput: 0: 5049.2. Samples: 1257456398. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:19:33,498][25689] Avg episode reward: [(0, '0.691')] [2022-07-11 14:19:34,804][26022] Updated weights on worker 0-0, policy_version 1227998 (0.00086) [2022-07-11 14:19:36,893][26022] Updated weights on worker 0-0, policy_version 1228008 (0.00085) [2022-07-11 14:19:38,303][26022] Updated weights on worker 0-0, policy_version 1228018 (0.00087) [2022-07-11 14:19:38,556][25689] Fps is (10 sec: 5691.2, 60 sec: 5641.4, 300 sec: 5639.6). Total num frames: 1257490432. Throughput: 0: 5931.4. Samples: 1257490486. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:19:38,556][25689] Avg episode reward: [(0, '0.488')] [2022-07-11 14:19:40,519][26022] Updated weights on worker 0-0, policy_version 1228028 (0.00094) [2022-07-11 14:19:42,043][26022] Updated weights on worker 0-0, policy_version 1228038 (0.00096) [2022-07-11 14:19:43,578][25689] Fps is (10 sec: 5688.4, 60 sec: 5622.8, 300 sec: 5636.2). Total num frames: 1257518080. Throughput: 0: 5957.5. Samples: 1257524980. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:19:43,579][25689] Avg episode reward: [(0, '0.606')] [2022-07-11 14:19:43,982][26022] Updated weights on worker 0-0, policy_version 1228048 (0.00088) [2022-07-11 14:19:45,576][26022] Updated weights on worker 0-0, policy_version 1228058 (0.00093) [2022-07-11 14:19:47,488][26022] Updated weights on worker 0-0, policy_version 1228068 (0.00076) [2022-07-11 14:19:48,597][25689] Fps is (10 sec: 5608.9, 60 sec: 5624.0, 300 sec: 5636.4). Total num frames: 1257546752. Throughput: 0: 5105.2. Samples: 1257542080. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:19:48,597][25689] Avg episode reward: [(0, '1.028')] [2022-07-11 14:19:49,452][26022] Updated weights on worker 0-0, policy_version 1228078 (0.00091) [2022-07-11 14:19:51,175][26022] Updated weights on worker 0-0, policy_version 1228088 (0.00086) [2022-07-11 14:19:52,955][26022] Updated weights on worker 0-0, policy_version 1228098 (0.00096) [2022-07-11 14:19:53,620][25689] Fps is (10 sec: 5710.9, 60 sec: 5656.9, 300 sec: 5634.9). Total num frames: 1257575424. Throughput: 0: 5953.9. Samples: 1257576202. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:19:53,620][25689] Avg episode reward: [(0, '1.454')] [2022-07-11 14:19:54,932][26022] Updated weights on worker 0-0, policy_version 1228108 (0.00082) [2022-07-11 14:19:56,415][26022] Updated weights on worker 0-0, policy_version 1228118 (0.00086) [2022-07-11 14:19:58,494][26022] Updated weights on worker 0-0, policy_version 1228128 (0.00084) [2022-07-11 14:19:58,720][25689] Fps is (10 sec: 5765.8, 60 sec: 5657.6, 300 sec: 5640.2). Total num frames: 1257605120. Throughput: 0: 5947.2. Samples: 1257610406. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:19:58,722][25689] Avg episode reward: [(0, '1.602')] [2022-07-11 14:19:59,987][26022] Updated weights on worker 0-0, policy_version 1228138 (0.00083) [2022-07-11 14:20:01,976][26022] Updated weights on worker 0-0, policy_version 1228148 (0.00087) [2022-07-11 14:20:03,812][25689] Fps is (10 sec: 5525.5, 60 sec: 5653.4, 300 sec: 5635.2). Total num frames: 1257631744. Throughput: 0: 5069.6. Samples: 1257627558. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:20:03,813][25689] Avg episode reward: [(0, '1.643')] [2022-07-11 14:20:04,093][26022] Updated weights on worker 0-0, policy_version 1228158 (0.00081) [2022-07-11 14:20:05,852][26022] Updated weights on worker 0-0, policy_version 1228168 (0.00079) [2022-07-11 14:20:07,884][26022] Updated weights on worker 0-0, policy_version 1228178 (0.00081) [2022-07-11 14:20:08,824][25689] Fps is (10 sec: 5472.7, 60 sec: 5636.4, 300 sec: 5639.9). Total num frames: 1257660416. Throughput: 0: 5806.4. Samples: 1257659526. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:20:08,825][25689] Avg episode reward: [(0, '1.667')] [2022-07-11 14:20:09,320][26022] Updated weights on worker 0-0, policy_version 1228188 (0.00084) [2022-07-11 14:20:11,488][26022] Updated weights on worker 0-0, policy_version 1228198 (0.00086) [2022-07-11 14:20:12,967][26022] Updated weights on worker 0-0, policy_version 1228208 (0.00085) [2022-07-11 14:20:13,716][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:20:13,727][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001228211_1257688064.pth [2022-07-11 14:20:13,728][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001226227_1255656448.pth [2022-07-11 14:20:13,829][25689] Fps is (10 sec: 5622.8, 60 sec: 5620.1, 300 sec: 5634.8). Total num frames: 1257688064. Throughput: 0: 5827.1. Samples: 1257693962. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:20:13,829][25689] Avg episode reward: [(0, '1.679')] [2022-07-11 14:20:14,957][26022] Updated weights on worker 0-0, policy_version 1228218 (0.00083) [2022-07-11 14:20:16,748][26022] Updated weights on worker 0-0, policy_version 1228228 (0.00092) [2022-07-11 14:20:18,358][26022] Updated weights on worker 0-0, policy_version 1228238 (0.00083) [2022-07-11 14:20:18,892][25689] Fps is (10 sec: 5797.1, 60 sec: 5676.0, 300 sec: 5634.8). Total num frames: 1257718784. Throughput: 0: 4989.1. Samples: 1257711048. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:20:18,893][25689] Avg episode reward: [(0, '1.488')] [2022-07-11 14:20:20,353][26022] Updated weights on worker 0-0, policy_version 1228248 (0.00086) [2022-07-11 14:20:22,065][26022] Updated weights on worker 0-0, policy_version 1228258 (0.00952) [2022-07-11 14:20:23,914][25689] Fps is (10 sec: 5685.9, 60 sec: 5641.9, 300 sec: 5634.6). Total num frames: 1257745408. Throughput: 0: 5844.7. Samples: 1257745046. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:20:23,914][25689] Avg episode reward: [(0, '1.173')] [2022-07-11 14:20:23,958][26022] Updated weights on worker 0-0, policy_version 1228268 (0.00088) [2022-07-11 14:20:25,591][26022] Updated weights on worker 0-0, policy_version 1228278 (0.00082) [2022-07-11 14:20:27,491][26022] Updated weights on worker 0-0, policy_version 1228288 (0.00089) [2022-07-11 14:20:28,944][25689] Fps is (10 sec: 5501.1, 60 sec: 5639.6, 300 sec: 5631.7). Total num frames: 1257774080. Throughput: 0: 5941.5. Samples: 1257779070. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:20:28,951][25689] Avg episode reward: [(0, '0.502')] [2022-07-11 14:20:29,381][26022] Updated weights on worker 0-0, policy_version 1228298 (0.00092) [2022-07-11 14:20:31,126][26022] Updated weights on worker 0-0, policy_version 1228308 (0.00087) [2022-07-11 14:20:33,111][26022] Updated weights on worker 0-0, policy_version 1228318 (0.00081) [2022-07-11 14:20:33,993][25689] Fps is (10 sec: 5689.5, 60 sec: 5653.6, 300 sec: 5628.9). Total num frames: 1257802752. Throughput: 0: 5050.4. Samples: 1257795798. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:20:33,993][25689] Avg episode reward: [(0, '0.387')] [2022-07-11 14:20:34,887][26022] Updated weights on worker 0-0, policy_version 1228328 (0.00087) [2022-07-11 14:20:36,619][26022] Updated weights on worker 0-0, policy_version 1228338 (0.00093) [2022-07-11 14:20:38,573][26022] Updated weights on worker 0-0, policy_version 1228348 (0.00081) [2022-07-11 14:20:39,118][25689] Fps is (10 sec: 5535.8, 60 sec: 5613.5, 300 sec: 5627.3). Total num frames: 1257830400. Throughput: 0: 5865.3. Samples: 1257829676. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2022-07-11 14:20:39,118][25689] Avg episode reward: [(0, '0.079')] [2022-07-11 14:20:40,083][26022] Updated weights on worker 0-0, policy_version 1228358 (0.00084) [2022-07-11 14:20:42,264][26022] Updated weights on worker 0-0, policy_version 1228368 (0.00081) [2022-07-11 14:20:43,863][26022] Updated weights on worker 0-0, policy_version 1228378 (0.00079) [2022-07-11 14:20:44,122][25689] Fps is (10 sec: 5661.3, 60 sec: 5649.1, 300 sec: 5630.9). Total num frames: 1257860096. Throughput: 0: 5864.8. Samples: 1257863562. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:20:44,122][25689] Avg episode reward: [(0, '-0.241')] [2022-07-11 14:20:45,713][26022] Updated weights on worker 0-0, policy_version 1228388 (0.00087) [2022-07-11 14:20:47,558][26022] Updated weights on worker 0-0, policy_version 1228398 (0.00079) [2022-07-11 14:20:49,135][25689] Fps is (10 sec: 5724.6, 60 sec: 5632.7, 300 sec: 5628.1). Total num frames: 1257887744. Throughput: 0: 5851.1. Samples: 1257897208. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:20:49,135][25689] Avg episode reward: [(0, '-0.075')] [2022-07-11 14:20:49,365][26022] Updated weights on worker 0-0, policy_version 1228408 (0.00088) [2022-07-11 14:20:51,287][26022] Updated weights on worker 0-0, policy_version 1228418 (0.00080) [2022-07-11 14:20:53,036][26022] Updated weights on worker 0-0, policy_version 1228428 (0.00083) [2022-07-11 14:20:54,161][25689] Fps is (10 sec: 5406.1, 60 sec: 5598.5, 300 sec: 5622.6). Total num frames: 1257914368. Throughput: 0: 5857.9. Samples: 1257913942. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:20:54,162][25689] Avg episode reward: [(0, '0.028')] [2022-07-11 14:20:55,083][26022] Updated weights on worker 0-0, policy_version 1228438 (0.00086) [2022-07-11 14:20:56,863][26022] Updated weights on worker 0-0, policy_version 1228448 (0.00086) [2022-07-11 14:20:58,703][26022] Updated weights on worker 0-0, policy_version 1228458 (0.00087) [2022-07-11 14:20:59,258][25689] Fps is (10 sec: 5563.6, 60 sec: 5598.9, 300 sec: 5624.9). Total num frames: 1257944064. Throughput: 0: 5836.6. Samples: 1257947226. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:20:59,258][25689] Avg episode reward: [(0, '0.107')] [2022-07-11 14:21:00,553][26022] Updated weights on worker 0-0, policy_version 1228468 (0.00086) [2022-07-11 14:21:02,791][26022] Updated weights on worker 0-0, policy_version 1228478 (0.00091) [2022-07-11 14:21:04,273][25689] Fps is (10 sec: 5468.1, 60 sec: 5589.0, 300 sec: 5621.3). Total num frames: 1257969664. Throughput: 0: 5733.7. Samples: 1257979104. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:21:04,274][25689] Avg episode reward: [(0, '0.219')] [2022-07-11 14:21:04,459][26022] Updated weights on worker 0-0, policy_version 1228488 (0.00096) [2022-07-11 14:21:06,524][26022] Updated weights on worker 0-0, policy_version 1228498 (0.00090) [2022-07-11 14:21:08,129][26022] Updated weights on worker 0-0, policy_version 1228508 (0.00082) [2022-07-11 14:21:09,291][25689] Fps is (10 sec: 5409.4, 60 sec: 5588.5, 300 sec: 5621.7). Total num frames: 1257998336. Throughput: 0: 4892.0. Samples: 1257995810. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:21:09,291][25689] Avg episode reward: [(0, '0.523')] [2022-07-11 14:21:10,064][26022] Updated weights on worker 0-0, policy_version 1228518 (0.00090) [2022-07-11 14:21:11,895][26022] Updated weights on worker 0-0, policy_version 1228528 (0.00090) [2022-07-11 14:21:13,831][26022] Updated weights on worker 0-0, policy_version 1228538 (0.00086) [2022-07-11 14:21:14,313][25689] Fps is (10 sec: 5405.6, 60 sec: 5553.0, 300 sec: 5615.8). Total num frames: 1258023936. Throughput: 0: 5714.2. Samples: 1258029096. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:21:14,314][25689] Avg episode reward: [(0, '-0.030')] [2022-07-11 14:21:15,494][26022] Updated weights on worker 0-0, policy_version 1228548 (0.00095) [2022-07-11 14:21:17,570][26022] Updated weights on worker 0-0, policy_version 1228558 (0.00093) [2022-07-11 14:21:19,008][26022] Updated weights on worker 0-0, policy_version 1228568 (0.00092) [2022-07-11 14:21:19,398][25689] Fps is (10 sec: 5572.1, 60 sec: 5551.1, 300 sec: 5621.5). Total num frames: 1258054656. Throughput: 0: 5733.3. Samples: 1258062696. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:21:19,399][25689] Avg episode reward: [(0, '0.169')] [2022-07-11 14:21:21,071][26022] Updated weights on worker 0-0, policy_version 1228578 (0.00095) [2022-07-11 14:21:22,760][26022] Updated weights on worker 0-0, policy_version 1228588 (0.00090) [2022-07-11 14:21:24,436][25689] Fps is (10 sec: 5867.2, 60 sec: 5583.4, 300 sec: 5622.0). Total num frames: 1258083328. Throughput: 0: 4985.9. Samples: 1258079634. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:21:24,437][25689] Avg episode reward: [(0, '0.218')] [2022-07-11 14:21:24,597][26022] Updated weights on worker 0-0, policy_version 1228598 (0.00081) [2022-07-11 14:21:26,414][26022] Updated weights on worker 0-0, policy_version 1228608 (0.00090) [2022-07-11 14:21:28,382][26022] Updated weights on worker 0-0, policy_version 1228618 (0.00085) [2022-07-11 14:21:29,444][25689] Fps is (10 sec: 5606.3, 60 sec: 5568.5, 300 sec: 5622.1). Total num frames: 1258110976. Throughput: 0: 5832.0. Samples: 1258113344. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:21:29,444][25689] Avg episode reward: [(0, '-0.169')] [2022-07-11 14:21:30,170][26022] Updated weights on worker 0-0, policy_version 1228628 (0.00083) [2022-07-11 14:21:32,123][26022] Updated weights on worker 0-0, policy_version 1228638 (0.00082) [2022-07-11 14:21:33,731][26022] Updated weights on worker 0-0, policy_version 1228648 (0.00087) [2022-07-11 14:21:34,469][25689] Fps is (10 sec: 5613.4, 60 sec: 5570.7, 300 sec: 5619.0). Total num frames: 1258139648. Throughput: 0: 5868.8. Samples: 1258147386. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:21:34,470][25689] Avg episode reward: [(0, '-0.729')] [2022-07-11 14:21:35,610][26022] Updated weights on worker 0-0, policy_version 1228658 (0.00087) [2022-07-11 14:21:37,311][26022] Updated weights on worker 0-0, policy_version 1228668 (0.00086) [2022-07-11 14:21:39,220][26022] Updated weights on worker 0-0, policy_version 1228678 (0.00084) [2022-07-11 14:21:39,509][25689] Fps is (10 sec: 5595.7, 60 sec: 5578.6, 300 sec: 5611.6). Total num frames: 1258167296. Throughput: 0: 5048.6. Samples: 1258164226. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:21:39,509][25689] Avg episode reward: [(0, '-0.911')] [2022-07-11 14:21:40,968][26022] Updated weights on worker 0-0, policy_version 1228688 (0.00084) [2022-07-11 14:21:42,759][26022] Updated weights on worker 0-0, policy_version 1228698 (0.00089) [2022-07-11 14:21:44,530][25689] Fps is (10 sec: 5598.1, 60 sec: 5560.1, 300 sec: 5621.6). Total num frames: 1258195968. Throughput: 0: 5903.1. Samples: 1258198250. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:21:44,531][25689] Avg episode reward: [(0, '-0.137')] [2022-07-11 14:21:44,629][26022] Updated weights on worker 0-0, policy_version 1228708 (0.00086) [2022-07-11 14:21:46,479][26022] Updated weights on worker 0-0, policy_version 1228718 (0.00091) [2022-07-11 14:21:48,385][26022] Updated weights on worker 0-0, policy_version 1228728 (0.00090) [2022-07-11 14:21:49,555][25689] Fps is (10 sec: 5606.2, 60 sec: 5559.0, 300 sec: 5617.9). Total num frames: 1258223616. Throughput: 0: 5897.0. Samples: 1258231938. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:21:49,556][25689] Avg episode reward: [(0, '-0.247')] [2022-07-11 14:21:50,066][26022] Updated weights on worker 0-0, policy_version 1228738 (0.00085) [2022-07-11 14:21:52,097][26022] Updated weights on worker 0-0, policy_version 1228748 (0.00623) [2022-07-11 14:21:53,722][26022] Updated weights on worker 0-0, policy_version 1228758 (0.00091) [2022-07-11 14:21:54,570][25689] Fps is (10 sec: 5609.2, 60 sec: 5593.8, 300 sec: 5612.8). Total num frames: 1258252288. Throughput: 0: 5054.3. Samples: 1258248984. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:21:54,571][25689] Avg episode reward: [(0, '-0.241')] [2022-07-11 14:21:55,528][26022] Updated weights on worker 0-0, policy_version 1228768 (0.00091) [2022-07-11 14:21:57,344][26022] Updated weights on worker 0-0, policy_version 1228778 (0.00100) [2022-07-11 14:21:59,208][26022] Updated weights on worker 0-0, policy_version 1228788 (0.00100) [2022-07-11 14:21:59,619][25689] Fps is (10 sec: 5596.3, 60 sec: 5564.4, 300 sec: 5618.9). Total num frames: 1258279936. Throughput: 0: 5893.6. Samples: 1258282746. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:21:59,619][25689] Avg episode reward: [(0, '0.585')] [2022-07-11 14:22:01,058][26022] Updated weights on worker 0-0, policy_version 1228798 (0.00085) [2022-07-11 14:22:03,276][26022] Updated weights on worker 0-0, policy_version 1228808 (0.00089) [2022-07-11 14:22:04,663][25689] Fps is (10 sec: 5377.6, 60 sec: 5578.7, 300 sec: 5612.1). Total num frames: 1258306560. Throughput: 0: 5765.3. Samples: 1258314322. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:22:04,663][25689] Avg episode reward: [(0, '1.267')] [2022-07-11 14:22:04,947][26022] Updated weights on worker 0-0, policy_version 1228818 (0.00087) [2022-07-11 14:22:06,956][26022] Updated weights on worker 0-0, policy_version 1228828 (0.00085) [2022-07-11 14:22:08,738][26022] Updated weights on worker 0-0, policy_version 1228838 (0.00086) [2022-07-11 14:22:09,664][25689] Fps is (10 sec: 5402.7, 60 sec: 5563.3, 300 sec: 5609.1). Total num frames: 1258334208. Throughput: 0: 4929.9. Samples: 1258331074. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:22:09,664][25689] Avg episode reward: [(0, '1.477')] [2022-07-11 14:22:10,539][26022] Updated weights on worker 0-0, policy_version 1228848 (0.00085) [2022-07-11 14:22:12,565][26022] Updated weights on worker 0-0, policy_version 1228858 (0.00087) [2022-07-11 14:22:13,797][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:22:13,805][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001228865_1258357760.pth [2022-07-11 14:22:13,806][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001226890_1256335360.pth [2022-07-11 14:22:14,179][26022] Updated weights on worker 0-0, policy_version 1228868 (0.00090) [2022-07-11 14:22:14,667][25689] Fps is (10 sec: 5629.8, 60 sec: 5616.0, 300 sec: 5610.5). Total num frames: 1258362880. Throughput: 0: 5755.3. Samples: 1258364644. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:22:14,667][25689] Avg episode reward: [(0, '1.427')] [2022-07-11 14:22:16,292][26022] Updated weights on worker 0-0, policy_version 1228878 (0.00086) [2022-07-11 14:22:17,936][26022] Updated weights on worker 0-0, policy_version 1228888 (0.00082) [2022-07-11 14:22:19,743][25689] Fps is (10 sec: 5689.4, 60 sec: 5582.8, 300 sec: 5616.1). Total num frames: 1258391552. Throughput: 0: 5747.7. Samples: 1258398414. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:22:19,744][25689] Avg episode reward: [(0, '1.526')] [2022-07-11 14:22:19,752][26022] Updated weights on worker 0-0, policy_version 1228898 (0.00091) [2022-07-11 14:22:21,659][26022] Updated weights on worker 0-0, policy_version 1228908 (0.00089) [2022-07-11 14:22:23,191][26022] Updated weights on worker 0-0, policy_version 1228918 (0.00096) [2022-07-11 14:22:24,782][25689] Fps is (10 sec: 5567.5, 60 sec: 5565.7, 300 sec: 5609.0). Total num frames: 1258419200. Throughput: 0: 5018.1. Samples: 1258415284. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:22:24,783][25689] Avg episode reward: [(0, '1.365')] [2022-07-11 14:22:25,320][26022] Updated weights on worker 0-0, policy_version 1228928 (0.00085) [2022-07-11 14:22:26,830][26022] Updated weights on worker 0-0, policy_version 1228938 (0.00084) [2022-07-11 14:22:28,976][26022] Updated weights on worker 0-0, policy_version 1228948 (0.00079) [2022-07-11 14:22:29,799][25689] Fps is (10 sec: 5702.2, 60 sec: 5598.8, 300 sec: 5615.8). Total num frames: 1258448896. Throughput: 0: 5854.6. Samples: 1258448960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:22:29,800][25689] Avg episode reward: [(0, '1.313')] [2022-07-11 14:22:30,678][26022] Updated weights on worker 0-0, policy_version 1228958 (0.00083) [2022-07-11 14:22:32,502][26022] Updated weights on worker 0-0, policy_version 1228968 (0.00083) [2022-07-11 14:22:34,368][26022] Updated weights on worker 0-0, policy_version 1228978 (0.00091) [2022-07-11 14:22:34,846][25689] Fps is (10 sec: 5494.6, 60 sec: 5546.0, 300 sec: 5608.9). Total num frames: 1258474496. Throughput: 0: 5837.1. Samples: 1258482434. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:22:34,847][25689] Avg episode reward: [(0, '1.274')] [2022-07-11 14:22:36,061][26022] Updated weights on worker 0-0, policy_version 1228988 (0.00085) [2022-07-11 14:22:38,096][26022] Updated weights on worker 0-0, policy_version 1228998 (0.00090) [2022-07-11 14:22:39,926][25689] Fps is (10 sec: 5460.4, 60 sec: 5576.1, 300 sec: 5611.2). Total num frames: 1258504192. Throughput: 0: 4985.9. Samples: 1258499046. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:22:39,927][25689] Avg episode reward: [(0, '1.125')] [2022-07-11 14:22:39,933][26022] Updated weights on worker 0-0, policy_version 1229008 (0.00080) [2022-07-11 14:22:41,650][26022] Updated weights on worker 0-0, policy_version 1229018 (0.00089) [2022-07-11 14:22:43,574][26022] Updated weights on worker 0-0, policy_version 1229028 (0.00085) [2022-07-11 14:22:45,015][25689] Fps is (10 sec: 5740.0, 60 sec: 5569.9, 300 sec: 5609.6). Total num frames: 1258532864. Throughput: 0: 5814.8. Samples: 1258532932. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:22:45,016][25689] Avg episode reward: [(0, '0.295')] [2022-07-11 14:22:45,471][26022] Updated weights on worker 0-0, policy_version 1229038 (0.00091) [2022-07-11 14:22:47,360][26022] Updated weights on worker 0-0, policy_version 1229048 (0.00091) [2022-07-11 14:22:49,162][26022] Updated weights on worker 0-0, policy_version 1229058 (0.00082) [2022-07-11 14:22:50,043][25689] Fps is (10 sec: 5567.2, 60 sec: 5569.6, 300 sec: 5612.7). Total num frames: 1258560512. Throughput: 0: 5784.2. Samples: 1258566050. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:22:50,043][25689] Avg episode reward: [(0, '0.449')] [2022-07-11 14:22:50,848][26022] Updated weights on worker 0-0, policy_version 1229068 (0.00081) [2022-07-11 14:22:52,659][26022] Updated weights on worker 0-0, policy_version 1229078 (0.00096) [2022-07-11 14:22:54,637][26022] Updated weights on worker 0-0, policy_version 1229088 (0.00085) [2022-07-11 14:22:55,049][25689] Fps is (10 sec: 5510.7, 60 sec: 5553.6, 300 sec: 5607.0). Total num frames: 1258588160. Throughput: 0: 4972.3. Samples: 1258582890. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:22:55,050][25689] Avg episode reward: [(0, '0.478')] [2022-07-11 14:22:56,373][26022] Updated weights on worker 0-0, policy_version 1229098 (0.00081) [2022-07-11 14:22:58,396][26022] Updated weights on worker 0-0, policy_version 1229108 (0.00086) [2022-07-11 14:23:00,004][26022] Updated weights on worker 0-0, policy_version 1229118 (0.00091) [2022-07-11 14:23:00,153][25689] Fps is (10 sec: 5570.7, 60 sec: 5565.4, 300 sec: 5612.4). Total num frames: 1258616832. Throughput: 0: 5802.8. Samples: 1258616416. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:23:00,153][25689] Avg episode reward: [(0, '0.444')] [2022-07-11 14:23:02,331][26022] Updated weights on worker 0-0, policy_version 1229128 (0.00092) [2022-07-11 14:23:04,283][26022] Updated weights on worker 0-0, policy_version 1229138 (0.00829) [2022-07-11 14:23:05,197][25689] Fps is (10 sec: 5449.2, 60 sec: 5565.4, 300 sec: 5601.4). Total num frames: 1258643456. Throughput: 0: 5692.2. Samples: 1258647812. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:23:05,197][25689] Avg episode reward: [(0, '0.183')] [2022-07-11 14:23:05,979][26022] Updated weights on worker 0-0, policy_version 1229148 (0.00082) [2022-07-11 14:23:07,802][26022] Updated weights on worker 0-0, policy_version 1229158 (0.00085) [2022-07-11 14:23:09,636][26022] Updated weights on worker 0-0, policy_version 1229168 (0.00085) [2022-07-11 14:23:10,212][25689] Fps is (10 sec: 5293.5, 60 sec: 5547.2, 300 sec: 5601.4). Total num frames: 1258670080. Throughput: 0: 4888.5. Samples: 1258664646. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:23:10,212][25689] Avg episode reward: [(0, '0.145')] [2022-07-11 14:23:11,676][26022] Updated weights on worker 0-0, policy_version 1229178 (0.00090) [2022-07-11 14:23:13,417][26022] Updated weights on worker 0-0, policy_version 1229188 (0.00084) [2022-07-11 14:23:15,220][25689] Fps is (10 sec: 5516.8, 60 sec: 5546.7, 300 sec: 5598.9). Total num frames: 1258698752. Throughput: 0: 5721.4. Samples: 1258698294. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:23:15,220][25689] Avg episode reward: [(0, '0.220')] [2022-07-11 14:23:15,227][26022] Updated weights on worker 0-0, policy_version 1229198 (0.00294) [2022-07-11 14:23:17,010][26022] Updated weights on worker 0-0, policy_version 1229208 (0.00091) [2022-07-11 14:23:18,973][26022] Updated weights on worker 0-0, policy_version 1229218 (0.00087) [2022-07-11 14:23:20,370][25689] Fps is (10 sec: 5645.2, 60 sec: 5540.0, 300 sec: 5593.7). Total num frames: 1258727424. Throughput: 0: 5715.9. Samples: 1258731976. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:23:20,370][25689] Avg episode reward: [(0, '1.286')] [2022-07-11 14:23:20,494][26022] Updated weights on worker 0-0, policy_version 1229228 (0.00055) [2022-07-11 14:23:22,581][26022] Updated weights on worker 0-0, policy_version 1229238 (0.00087) [2022-07-11 14:23:24,012][26022] Updated weights on worker 0-0, policy_version 1229248 (0.00088) [2022-07-11 14:23:25,409][25689] Fps is (10 sec: 5627.9, 60 sec: 5556.9, 300 sec: 5601.0). Total num frames: 1258756096. Throughput: 0: 5846.9. Samples: 1258765992. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:23:25,410][25689] Avg episode reward: [(0, '1.537')] [2022-07-11 14:23:26,226][26022] Updated weights on worker 0-0, policy_version 1229258 (0.00087) [2022-07-11 14:23:27,865][26022] Updated weights on worker 0-0, policy_version 1229268 (0.00082) [2022-07-11 14:23:29,726][26022] Updated weights on worker 0-0, policy_version 1229278 (0.00087) [2022-07-11 14:23:30,475][25689] Fps is (10 sec: 5674.8, 60 sec: 5535.6, 300 sec: 5593.5). Total num frames: 1258784768. Throughput: 0: 5822.2. Samples: 1258782622. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:23:30,475][25689] Avg episode reward: [(0, '0.995')] [2022-07-11 14:23:31,675][26022] Updated weights on worker 0-0, policy_version 1229288 (0.00096) [2022-07-11 14:23:33,259][26022] Updated weights on worker 0-0, policy_version 1229298 (0.00082) [2022-07-11 14:23:35,270][26022] Updated weights on worker 0-0, policy_version 1229308 (0.00083) [2022-07-11 14:23:35,486][25689] Fps is (10 sec: 5690.5, 60 sec: 5589.4, 300 sec: 5595.6). Total num frames: 1258813440. Throughput: 0: 5830.9. Samples: 1258816466. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:23:35,487][25689] Avg episode reward: [(0, '1.028')] [2022-07-11 14:23:37,076][26022] Updated weights on worker 0-0, policy_version 1229318 (0.00088) [2022-07-11 14:23:38,787][26022] Updated weights on worker 0-0, policy_version 1229328 (0.00083) [2022-07-11 14:23:40,535][25689] Fps is (10 sec: 5598.2, 60 sec: 5558.5, 300 sec: 5591.4). Total num frames: 1258841088. Throughput: 0: 5873.1. Samples: 1258850408. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:23:40,536][25689] Avg episode reward: [(0, '1.114')] [2022-07-11 14:23:40,723][26022] Updated weights on worker 0-0, policy_version 1229338 (0.00085) [2022-07-11 14:23:42,335][26022] Updated weights on worker 0-0, policy_version 1229348 (0.00085) [2022-07-11 14:23:44,286][26022] Updated weights on worker 0-0, policy_version 1229358 (0.00087) [2022-07-11 14:23:45,609][25689] Fps is (10 sec: 5563.7, 60 sec: 5559.9, 300 sec: 5590.5). Total num frames: 1258869760. Throughput: 0: 5028.5. Samples: 1258867568. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:23:45,610][25689] Avg episode reward: [(0, '0.805')] [2022-07-11 14:23:45,996][26022] Updated weights on worker 0-0, policy_version 1229368 (0.00086) [2022-07-11 14:23:47,853][26022] Updated weights on worker 0-0, policy_version 1229378 (0.00083) [2022-07-11 14:23:49,664][26022] Updated weights on worker 0-0, policy_version 1229388 (0.00088) [2022-07-11 14:23:50,657][25689] Fps is (10 sec: 5766.3, 60 sec: 5591.8, 300 sec: 5600.2). Total num frames: 1258899456. Throughput: 0: 5889.9. Samples: 1258901496. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:23:50,658][25689] Avg episode reward: [(0, '0.456')] [2022-07-11 14:23:51,472][26022] Updated weights on worker 0-0, policy_version 1229398 (0.00090) [2022-07-11 14:23:53,271][26022] Updated weights on worker 0-0, policy_version 1229408 (0.00086) [2022-07-11 14:23:55,204][26022] Updated weights on worker 0-0, policy_version 1229418 (0.00091) [2022-07-11 14:23:55,667][25689] Fps is (10 sec: 5599.5, 60 sec: 5574.6, 300 sec: 5591.7). Total num frames: 1258926080. Throughput: 0: 5877.9. Samples: 1258935088. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:23:55,668][25689] Avg episode reward: [(0, '0.228')] [2022-07-11 14:23:56,860][26022] Updated weights on worker 0-0, policy_version 1229428 (0.00088) [2022-07-11 14:23:58,915][26022] Updated weights on worker 0-0, policy_version 1229438 (0.00088) [2022-07-11 14:24:00,682][26022] Updated weights on worker 0-0, policy_version 1229448 (0.00091) [2022-07-11 14:24:00,781][25689] Fps is (10 sec: 5461.9, 60 sec: 5573.6, 300 sec: 5597.4). Total num frames: 1258954752. Throughput: 0: 5008.7. Samples: 1258951814. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:24:00,782][25689] Avg episode reward: [(0, '1.203')] [2022-07-11 14:24:02,721][26022] Updated weights on worker 0-0, policy_version 1229458 (0.00090) [2022-07-11 14:24:04,750][26022] Updated weights on worker 0-0, policy_version 1229468 (0.00094) [2022-07-11 14:24:05,818][25689] Fps is (10 sec: 5548.5, 60 sec: 5591.2, 300 sec: 5590.0). Total num frames: 1258982400. Throughput: 0: 5734.3. Samples: 1258983450. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:24:05,818][25689] Avg episode reward: [(0, '1.371')] [2022-07-11 14:24:06,354][26022] Updated weights on worker 0-0, policy_version 1229478 (0.00090) [2022-07-11 14:24:08,313][26022] Updated weights on worker 0-0, policy_version 1229488 (0.00086) [2022-07-11 14:24:10,460][26022] Updated weights on worker 0-0, policy_version 1229498 (0.00094) [2022-07-11 14:24:10,852][25689] Fps is (10 sec: 5389.3, 60 sec: 5589.5, 300 sec: 5582.7). Total num frames: 1259009024. Throughput: 0: 5722.6. Samples: 1259017060. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:24:10,852][25689] Avg episode reward: [(0, '0.670')] [2022-07-11 14:24:11,847][26022] Updated weights on worker 0-0, policy_version 1229508 (0.00086) [2022-07-11 14:24:13,840][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:24:13,850][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001229517_1259025408.pth [2022-07-11 14:24:13,851][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001227548_1257009152.pth [2022-07-11 14:24:13,930][26022] Updated weights on worker 0-0, policy_version 1229518 (0.00088) [2022-07-11 14:24:15,604][26022] Updated weights on worker 0-0, policy_version 1229528 (0.00089) [2022-07-11 14:24:15,865][25689] Fps is (10 sec: 5503.8, 60 sec: 5589.0, 300 sec: 5588.1). Total num frames: 1259037696. Throughput: 0: 4894.1. Samples: 1259033936. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:24:15,865][25689] Avg episode reward: [(0, '0.713')] [2022-07-11 14:24:17,406][26022] Updated weights on worker 0-0, policy_version 1229538 (0.00091) [2022-07-11 14:24:19,472][26022] Updated weights on worker 0-0, policy_version 1229548 (0.00088) [2022-07-11 14:24:20,943][25689] Fps is (10 sec: 5784.1, 60 sec: 5612.6, 300 sec: 5590.4). Total num frames: 1259067392. Throughput: 0: 5720.1. Samples: 1259067138. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:24:20,943][25689] Avg episode reward: [(0, '-0.349')] [2022-07-11 14:24:20,947][26022] Updated weights on worker 0-0, policy_version 1229558 (0.00092) [2022-07-11 14:24:23,115][26022] Updated weights on worker 0-0, policy_version 1229568 (0.00087) [2022-07-11 14:24:24,650][26022] Updated weights on worker 0-0, policy_version 1229578 (0.00099) [2022-07-11 14:24:26,030][25689] Fps is (10 sec: 5339.1, 60 sec: 5540.6, 300 sec: 5575.1). Total num frames: 1259091968. Throughput: 0: 5810.4. Samples: 1259100888. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:24:26,031][25689] Avg episode reward: [(0, '-0.422')] [2022-07-11 14:24:26,552][26022] Updated weights on worker 0-0, policy_version 1229588 (0.00094) [2022-07-11 14:24:28,709][26022] Updated weights on worker 0-0, policy_version 1229598 (0.00082) [2022-07-11 14:24:30,288][26022] Updated weights on worker 0-0, policy_version 1229608 (0.00089) [2022-07-11 14:24:31,043][25689] Fps is (10 sec: 5373.2, 60 sec: 5562.2, 300 sec: 5582.1). Total num frames: 1259121664. Throughput: 0: 4973.0. Samples: 1259117472. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:24:31,043][25689] Avg episode reward: [(0, '-0.490')] [2022-07-11 14:24:32,243][26022] Updated weights on worker 0-0, policy_version 1229618 (0.00079) [2022-07-11 14:24:34,119][26022] Updated weights on worker 0-0, policy_version 1229628 (0.00092) [2022-07-11 14:24:35,667][26022] Updated weights on worker 0-0, policy_version 1229638 (0.00094) [2022-07-11 14:24:36,087][25689] Fps is (10 sec: 5803.2, 60 sec: 5559.3, 300 sec: 5578.9). Total num frames: 1259150336. Throughput: 0: 5794.5. Samples: 1259151116. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:24:36,088][25689] Avg episode reward: [(0, '-0.704')] [2022-07-11 14:24:37,860][26022] Updated weights on worker 0-0, policy_version 1229648 (0.00090) [2022-07-11 14:24:39,406][26022] Updated weights on worker 0-0, policy_version 1229658 (0.00088) [2022-07-11 14:24:41,149][25689] Fps is (10 sec: 5573.2, 60 sec: 5558.1, 300 sec: 5578.2). Total num frames: 1259177984. Throughput: 0: 5824.2. Samples: 1259184822. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2022-07-11 14:24:41,149][25689] Avg episode reward: [(0, '0.382')] [2022-07-11 14:24:41,429][26022] Updated weights on worker 0-0, policy_version 1229668 (0.00080) [2022-07-11 14:24:43,162][26022] Updated weights on worker 0-0, policy_version 1229678 (0.00086) [2022-07-11 14:24:45,172][26022] Updated weights on worker 0-0, policy_version 1229688 (0.00090) [2022-07-11 14:24:46,170][25689] Fps is (10 sec: 5687.4, 60 sec: 5579.9, 300 sec: 5581.6). Total num frames: 1259207680. Throughput: 0: 5005.0. Samples: 1259201690. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:24:46,170][25689] Avg episode reward: [(0, '0.093')] [2022-07-11 14:24:46,866][26022] Updated weights on worker 0-0, policy_version 1229698 (0.00081) [2022-07-11 14:24:48,645][26022] Updated weights on worker 0-0, policy_version 1229708 (0.00087) [2022-07-11 14:24:50,299][26022] Updated weights on worker 0-0, policy_version 1229718 (0.00083) [2022-07-11 14:24:51,259][25689] Fps is (10 sec: 5570.7, 60 sec: 5525.4, 300 sec: 5573.5). Total num frames: 1259234304. Throughput: 0: 5834.2. Samples: 1259235414. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:24:51,259][25689] Avg episode reward: [(0, '1.451')] [2022-07-11 14:24:52,296][26022] Updated weights on worker 0-0, policy_version 1229728 (0.00083) [2022-07-11 14:24:54,004][26022] Updated weights on worker 0-0, policy_version 1229738 (0.00088) [2022-07-11 14:24:56,017][26022] Updated weights on worker 0-0, policy_version 1229748 (0.00090) [2022-07-11 14:24:56,263][25689] Fps is (10 sec: 5580.3, 60 sec: 5576.7, 300 sec: 5575.3). Total num frames: 1259264000. Throughput: 0: 5865.9. Samples: 1259269462. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:24:56,263][25689] Avg episode reward: [(0, '0.470')] [2022-07-11 14:24:57,703][26022] Updated weights on worker 0-0, policy_version 1229758 (0.00079) [2022-07-11 14:24:59,629][26022] Updated weights on worker 0-0, policy_version 1229768 (0.00087) [2022-07-11 14:25:01,379][25689] Fps is (10 sec: 5666.2, 60 sec: 5559.5, 300 sec: 5578.3). Total num frames: 1259291648. Throughput: 0: 5010.5. Samples: 1259286186. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:25:01,381][25689] Avg episode reward: [(0, '0.433')] [2022-07-11 14:25:01,404][26022] Updated weights on worker 0-0, policy_version 1229778 (0.00090) [2022-07-11 14:25:03,642][26022] Updated weights on worker 0-0, policy_version 1229788 (0.00085) [2022-07-11 14:25:05,410][26022] Updated weights on worker 0-0, policy_version 1229798 (0.00085) [2022-07-11 14:25:06,400][25689] Fps is (10 sec: 5353.6, 60 sec: 5544.1, 300 sec: 5571.2). Total num frames: 1259318272. Throughput: 0: 5737.0. Samples: 1259317750. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:25:06,401][25689] Avg episode reward: [(0, '0.389')] [2022-07-11 14:25:07,402][26022] Updated weights on worker 0-0, policy_version 1229808 (0.00084) [2022-07-11 14:25:08,991][26022] Updated weights on worker 0-0, policy_version 1229818 (0.00092) [2022-07-11 14:25:11,054][26022] Updated weights on worker 0-0, policy_version 1229828 (0.00085) [2022-07-11 14:25:11,431][25689] Fps is (10 sec: 5398.9, 60 sec: 5561.2, 300 sec: 5570.7). Total num frames: 1259345920. Throughput: 0: 5746.7. Samples: 1259351340. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:25:11,435][25689] Avg episode reward: [(0, '0.146')] [2022-07-11 14:25:12,624][26022] Updated weights on worker 0-0, policy_version 1229838 (0.00090) [2022-07-11 14:25:14,583][26022] Updated weights on worker 0-0, policy_version 1229848 (0.00055) [2022-07-11 14:25:16,375][26022] Updated weights on worker 0-0, policy_version 1229858 (0.00084) [2022-07-11 14:25:16,470][25689] Fps is (10 sec: 5592.8, 60 sec: 5558.8, 300 sec: 5564.3). Total num frames: 1259374592. Throughput: 0: 4893.7. Samples: 1259368352. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:25:16,471][25689] Avg episode reward: [(0, '0.537')] [2022-07-11 14:25:18,339][26022] Updated weights on worker 0-0, policy_version 1229868 (0.00091) [2022-07-11 14:25:20,073][26022] Updated weights on worker 0-0, policy_version 1229878 (0.00091) [2022-07-11 14:25:21,530][25689] Fps is (10 sec: 5577.1, 60 sec: 5526.7, 300 sec: 5567.0). Total num frames: 1259402240. Throughput: 0: 5743.0. Samples: 1259401912. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:25:21,530][25689] Avg episode reward: [(0, '0.778')] [2022-07-11 14:25:21,763][26022] Updated weights on worker 0-0, policy_version 1229888 (0.00093) [2022-07-11 14:25:23,806][26022] Updated weights on worker 0-0, policy_version 1229898 (0.00100) [2022-07-11 14:25:25,368][26022] Updated weights on worker 0-0, policy_version 1229908 (0.00081) [2022-07-11 14:25:26,569][25689] Fps is (10 sec: 5576.9, 60 sec: 5598.7, 300 sec: 5566.8). Total num frames: 1259430912. Throughput: 0: 5858.4. Samples: 1259435908. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:25:26,570][25689] Avg episode reward: [(0, '1.554')] [2022-07-11 14:25:27,588][26022] Updated weights on worker 0-0, policy_version 1229918 (0.00090) [2022-07-11 14:25:28,984][26022] Updated weights on worker 0-0, policy_version 1229928 (0.00100) [2022-07-11 14:25:31,134][26022] Updated weights on worker 0-0, policy_version 1229938 (0.00088) [2022-07-11 14:25:31,661][25689] Fps is (10 sec: 5660.3, 60 sec: 5574.6, 300 sec: 5566.0). Total num frames: 1259459584. Throughput: 0: 5832.0. Samples: 1259469318. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:25:31,661][25689] Avg episode reward: [(0, '1.250')] [2022-07-11 14:25:32,750][26022] Updated weights on worker 0-0, policy_version 1229948 (0.00086) [2022-07-11 14:25:34,719][26022] Updated weights on worker 0-0, policy_version 1229958 (0.00085) [2022-07-11 14:25:36,414][26022] Updated weights on worker 0-0, policy_version 1229968 (0.00087) [2022-07-11 14:25:36,729][25689] Fps is (10 sec: 5644.1, 60 sec: 5572.4, 300 sec: 5570.5). Total num frames: 1259488256. Throughput: 0: 5827.0. Samples: 1259486400. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:25:36,731][25689] Avg episode reward: [(0, '1.371')] [2022-07-11 14:25:38,473][26022] Updated weights on worker 0-0, policy_version 1229978 (0.00093) [2022-07-11 14:25:40,176][26022] Updated weights on worker 0-0, policy_version 1229988 (0.00092) [2022-07-11 14:25:41,782][25689] Fps is (10 sec: 5564.5, 60 sec: 5573.1, 300 sec: 5562.7). Total num frames: 1259515904. Throughput: 0: 5818.6. Samples: 1259519752. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:25:41,783][25689] Avg episode reward: [(0, '1.465')] [2022-07-11 14:25:42,044][26022] Updated weights on worker 0-0, policy_version 1229998 (0.00094) [2022-07-11 14:25:43,651][26022] Updated weights on worker 0-0, policy_version 1230008 (0.00085) [2022-07-11 14:25:45,643][26022] Updated weights on worker 0-0, policy_version 1230018 (0.00093) [2022-07-11 14:25:46,846][25689] Fps is (10 sec: 5566.9, 60 sec: 5552.4, 300 sec: 5565.2). Total num frames: 1259544576. Throughput: 0: 5795.8. Samples: 1259553428. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:25:46,846][25689] Avg episode reward: [(0, '1.383')] [2022-07-11 14:25:47,350][26022] Updated weights on worker 0-0, policy_version 1230028 (0.00099) [2022-07-11 14:25:49,477][26022] Updated weights on worker 0-0, policy_version 1230038 (0.00087) [2022-07-11 14:25:51,082][26022] Updated weights on worker 0-0, policy_version 1230048 (0.00057) [2022-07-11 14:25:51,869][25689] Fps is (10 sec: 5481.9, 60 sec: 5558.4, 300 sec: 5565.3). Total num frames: 1259571200. Throughput: 0: 4970.4. Samples: 1259569762. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:25:51,870][25689] Avg episode reward: [(0, '1.676')] [2022-07-11 14:25:52,965][26022] Updated weights on worker 0-0, policy_version 1230058 (0.00091) [2022-07-11 14:25:54,936][26022] Updated weights on worker 0-0, policy_version 1230068 (0.00093) [2022-07-11 14:25:56,618][26022] Updated weights on worker 0-0, policy_version 1230078 (0.00087) [2022-07-11 14:25:56,885][25689] Fps is (10 sec: 5610.3, 60 sec: 5557.3, 300 sec: 5566.8). Total num frames: 1259600896. Throughput: 0: 5809.8. Samples: 1259603500. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:25:56,885][25689] Avg episode reward: [(0, '1.640')] [2022-07-11 14:25:58,680][26022] Updated weights on worker 0-0, policy_version 1230088 (0.00088) [2022-07-11 14:26:00,436][26022] Updated weights on worker 0-0, policy_version 1230098 (0.00086) [2022-07-11 14:26:01,955][25689] Fps is (10 sec: 5482.8, 60 sec: 5527.8, 300 sec: 5565.8). Total num frames: 1259626496. Throughput: 0: 5773.6. Samples: 1259636220. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:26:01,955][25689] Avg episode reward: [(0, '1.582')] [2022-07-11 14:26:02,726][26022] Updated weights on worker 0-0, policy_version 1230108 (0.00084) [2022-07-11 14:26:04,658][26022] Updated weights on worker 0-0, policy_version 1230118 (0.00115) [2022-07-11 14:26:06,223][26022] Updated weights on worker 0-0, policy_version 1230128 (0.00085) [2022-07-11 14:26:07,002][25689] Fps is (10 sec: 5364.3, 60 sec: 5559.2, 300 sec: 5565.2). Total num frames: 1259655168. Throughput: 0: 4877.8. Samples: 1259651744. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:26:07,002][25689] Avg episode reward: [(0, '1.470')] [2022-07-11 14:26:08,151][26022] Updated weights on worker 0-0, policy_version 1230138 (0.00091) [2022-07-11 14:26:10,052][26022] Updated weights on worker 0-0, policy_version 1230148 (0.00518) [2022-07-11 14:26:11,910][26022] Updated weights on worker 0-0, policy_version 1230158 (0.00090) [2022-07-11 14:26:12,019][25689] Fps is (10 sec: 5494.4, 60 sec: 5543.6, 300 sec: 5568.8). Total num frames: 1259681792. Throughput: 0: 5722.9. Samples: 1259685074. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:26:12,019][25689] Avg episode reward: [(0, '1.307')] [2022-07-11 14:26:13,748][26022] Updated weights on worker 0-0, policy_version 1230168 (0.00091) [2022-07-11 14:26:13,907][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:26:13,918][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001230169_1259693056.pth [2022-07-11 14:26:13,918][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001228211_1257688064.pth [2022-07-11 14:26:15,426][26022] Updated weights on worker 0-0, policy_version 1230178 (0.00090) [2022-07-11 14:26:17,027][25689] Fps is (10 sec: 5413.7, 60 sec: 5529.5, 300 sec: 5559.9). Total num frames: 1259709440. Throughput: 0: 5699.8. Samples: 1259718306. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:26:17,027][25689] Avg episode reward: [(0, '0.876')] [2022-07-11 14:26:17,366][26022] Updated weights on worker 0-0, policy_version 1230188 (0.00087) [2022-07-11 14:26:19,299][26022] Updated weights on worker 0-0, policy_version 1230198 (0.00092) [2022-07-11 14:26:21,149][26022] Updated weights on worker 0-0, policy_version 1230208 (0.00090) [2022-07-11 14:26:22,139][25689] Fps is (10 sec: 5666.2, 60 sec: 5558.5, 300 sec: 5561.9). Total num frames: 1259739136. Throughput: 0: 4896.2. Samples: 1259735046. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:26:22,140][25689] Avg episode reward: [(0, '0.866')] [2022-07-11 14:26:23,025][26022] Updated weights on worker 0-0, policy_version 1230218 (0.00085) [2022-07-11 14:26:24,849][26022] Updated weights on worker 0-0, policy_version 1230228 (0.00090) [2022-07-11 14:26:26,784][26022] Updated weights on worker 0-0, policy_version 1230238 (0.00084) [2022-07-11 14:26:27,163][25689] Fps is (10 sec: 5556.4, 60 sec: 5526.1, 300 sec: 5558.2). Total num frames: 1259765760. Throughput: 0: 5784.8. Samples: 1259768372. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:26:27,165][25689] Avg episode reward: [(0, '1.131')] [2022-07-11 14:26:28,510][26022] Updated weights on worker 0-0, policy_version 1230248 (0.00094) [2022-07-11 14:26:30,548][26022] Updated weights on worker 0-0, policy_version 1230258 (0.00103) [2022-07-11 14:26:32,043][26022] Updated weights on worker 0-0, policy_version 1230268 (0.00096) [2022-07-11 14:26:32,203][25689] Fps is (10 sec: 5494.6, 60 sec: 5530.8, 300 sec: 5557.9). Total num frames: 1259794432. Throughput: 0: 5756.4. Samples: 1259801262. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:26:32,203][25689] Avg episode reward: [(0, '1.083')] [2022-07-11 14:26:34,175][26022] Updated weights on worker 0-0, policy_version 1230278 (0.00089) [2022-07-11 14:26:36,082][26022] Updated weights on worker 0-0, policy_version 1230288 (0.00059) [2022-07-11 14:26:37,230][25689] Fps is (10 sec: 5492.9, 60 sec: 5500.8, 300 sec: 5554.7). Total num frames: 1259821056. Throughput: 0: 4933.7. Samples: 1259817980. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:26:37,232][25689] Avg episode reward: [(0, '1.214')] [2022-07-11 14:26:37,583][26022] Updated weights on worker 0-0, policy_version 1230298 (0.00075) [2022-07-11 14:26:39,842][26022] Updated weights on worker 0-0, policy_version 1230308 (0.00089) [2022-07-11 14:26:41,395][26022] Updated weights on worker 0-0, policy_version 1230318 (0.00088) [2022-07-11 14:26:42,311][25689] Fps is (10 sec: 5571.9, 60 sec: 5532.1, 300 sec: 5557.1). Total num frames: 1259850752. Throughput: 0: 5783.4. Samples: 1259851706. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:26:42,311][25689] Avg episode reward: [(0, '1.364')] [2022-07-11 14:26:43,228][26022] Updated weights on worker 0-0, policy_version 1230328 (0.00081) [2022-07-11 14:26:45,034][26022] Updated weights on worker 0-0, policy_version 1230338 (0.00085) [2022-07-11 14:26:47,048][26022] Updated weights on worker 0-0, policy_version 1230348 (0.00088) [2022-07-11 14:26:47,327][25689] Fps is (10 sec: 5577.6, 60 sec: 5502.5, 300 sec: 5553.8). Total num frames: 1259877376. Throughput: 0: 5815.4. Samples: 1259885636. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:26:47,329][25689] Avg episode reward: [(0, '0.978')] [2022-07-11 14:26:48,875][26022] Updated weights on worker 0-0, policy_version 1230358 (0.00093) [2022-07-11 14:26:50,675][26022] Updated weights on worker 0-0, policy_version 1230368 (0.00088) [2022-07-11 14:26:52,344][25689] Fps is (10 sec: 5511.3, 60 sec: 5537.0, 300 sec: 5553.8). Total num frames: 1259906048. Throughput: 0: 5006.2. Samples: 1259902092. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:26:52,344][25689] Avg episode reward: [(0, '0.550')] [2022-07-11 14:26:52,436][26022] Updated weights on worker 0-0, policy_version 1230378 (0.00090) [2022-07-11 14:26:54,216][26022] Updated weights on worker 0-0, policy_version 1230388 (0.00086) [2022-07-11 14:26:56,084][26022] Updated weights on worker 0-0, policy_version 1230398 (0.00102) [2022-07-11 14:26:57,399][25689] Fps is (10 sec: 5592.0, 60 sec: 5499.5, 300 sec: 5553.6). Total num frames: 1259933696. Throughput: 0: 5854.4. Samples: 1259936058. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:26:57,400][25689] Avg episode reward: [(0, '0.308')] [2022-07-11 14:26:57,815][26022] Updated weights on worker 0-0, policy_version 1230408 (0.00084) [2022-07-11 14:26:59,816][26022] Updated weights on worker 0-0, policy_version 1230418 (0.00089) [2022-07-11 14:27:01,972][26022] Updated weights on worker 0-0, policy_version 1230428 (0.00090) [2022-07-11 14:27:02,511][25689] Fps is (10 sec: 5438.8, 60 sec: 5529.5, 300 sec: 5555.8). Total num frames: 1259961344. Throughput: 0: 5723.4. Samples: 1259967320. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:27:02,511][25689] Avg episode reward: [(0, '0.297')] [2022-07-11 14:27:03,771][26022] Updated weights on worker 0-0, policy_version 1230438 (0.00099) [2022-07-11 14:27:05,607][26022] Updated weights on worker 0-0, policy_version 1230448 (0.00086) [2022-07-11 14:27:07,306][26022] Updated weights on worker 0-0, policy_version 1230458 (0.00079) [2022-07-11 14:27:07,521][25689] Fps is (10 sec: 5462.8, 60 sec: 5516.0, 300 sec: 5555.6). Total num frames: 1259988992. Throughput: 0: 4871.4. Samples: 1259984008. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:27:07,521][25689] Avg episode reward: [(0, '0.327')] [2022-07-11 14:27:09,256][26022] Updated weights on worker 0-0, policy_version 1230468 (0.00086) [2022-07-11 14:27:11,055][26022] Updated weights on worker 0-0, policy_version 1230478 (0.00081) [2022-07-11 14:27:12,602][25689] Fps is (10 sec: 5581.0, 60 sec: 5544.0, 300 sec: 5554.2). Total num frames: 1260017664. Throughput: 0: 5714.5. Samples: 1260017858. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:27:12,603][25689] Avg episode reward: [(0, '0.246')] [2022-07-11 14:27:12,840][26022] Updated weights on worker 0-0, policy_version 1230488 (0.00082) [2022-07-11 14:27:14,683][26022] Updated weights on worker 0-0, policy_version 1230498 (0.00088) [2022-07-11 14:27:16,447][26022] Updated weights on worker 0-0, policy_version 1230508 (0.00099) [2022-07-11 14:27:17,632][25689] Fps is (10 sec: 5671.7, 60 sec: 5558.9, 300 sec: 5555.0). Total num frames: 1260046336. Throughput: 0: 5727.0. Samples: 1260051932. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:27:17,632][25689] Avg episode reward: [(0, '0.164')] [2022-07-11 14:27:18,210][26022] Updated weights on worker 0-0, policy_version 1230518 (0.00625) [2022-07-11 14:27:20,060][26022] Updated weights on worker 0-0, policy_version 1230528 (0.00089) [2022-07-11 14:27:21,733][26022] Updated weights on worker 0-0, policy_version 1230538 (0.00081) [2022-07-11 14:27:22,723][25689] Fps is (10 sec: 5666.1, 60 sec: 5543.9, 300 sec: 5557.5). Total num frames: 1260075008. Throughput: 0: 5017.0. Samples: 1260068722. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:27:22,723][25689] Avg episode reward: [(0, '0.464')] [2022-07-11 14:27:23,673][26022] Updated weights on worker 0-0, policy_version 1230548 (0.00085) [2022-07-11 14:27:25,302][26022] Updated weights on worker 0-0, policy_version 1230558 (0.00096) [2022-07-11 14:27:27,487][26022] Updated weights on worker 0-0, policy_version 1230568 (0.00096) [2022-07-11 14:27:27,727][25689] Fps is (10 sec: 5578.9, 60 sec: 5562.6, 300 sec: 5550.9). Total num frames: 1260102656. Throughput: 0: 5880.8. Samples: 1260102834. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:27:27,727][25689] Avg episode reward: [(0, '-0.309')] [2022-07-11 14:27:29,033][26022] Updated weights on worker 0-0, policy_version 1230578 (0.00094) [2022-07-11 14:27:31,059][26022] Updated weights on worker 0-0, policy_version 1230588 (0.00098) [2022-07-11 14:27:32,732][25689] Fps is (10 sec: 5626.6, 60 sec: 5565.8, 300 sec: 5562.0). Total num frames: 1260131328. Throughput: 0: 5884.5. Samples: 1260136314. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:27:32,733][25689] Avg episode reward: [(0, '-0.390')] [2022-07-11 14:27:32,957][26022] Updated weights on worker 0-0, policy_version 1230598 (0.00088) [2022-07-11 14:27:34,565][26022] Updated weights on worker 0-0, policy_version 1230608 (0.00088) [2022-07-11 14:27:36,553][26022] Updated weights on worker 0-0, policy_version 1230618 (0.00095) [2022-07-11 14:27:37,753][25689] Fps is (10 sec: 5719.6, 60 sec: 5600.2, 300 sec: 5559.6). Total num frames: 1260160000. Throughput: 0: 5036.1. Samples: 1260153264. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:27:37,753][25689] Avg episode reward: [(0, '-0.441')] [2022-07-11 14:27:38,190][26022] Updated weights on worker 0-0, policy_version 1230628 (0.00095) [2022-07-11 14:27:40,050][26022] Updated weights on worker 0-0, policy_version 1230638 (0.00104) [2022-07-11 14:27:42,252][26022] Updated weights on worker 0-0, policy_version 1230648 (0.00090) [2022-07-11 14:27:42,840][25689] Fps is (10 sec: 5673.3, 60 sec: 5582.8, 300 sec: 5559.7). Total num frames: 1260188672. Throughput: 0: 5878.1. Samples: 1260186972. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:27:42,840][25689] Avg episode reward: [(0, '0.157')] [2022-07-11 14:27:43,843][26022] Updated weights on worker 0-0, policy_version 1230658 (0.00090) [2022-07-11 14:27:45,743][26022] Updated weights on worker 0-0, policy_version 1230668 (0.00086) [2022-07-11 14:27:47,542][26022] Updated weights on worker 0-0, policy_version 1230678 (0.00086) [2022-07-11 14:27:47,891][25689] Fps is (10 sec: 5454.0, 60 sec: 5579.5, 300 sec: 5555.8). Total num frames: 1260215296. Throughput: 0: 5853.8. Samples: 1260220874. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:27:47,892][25689] Avg episode reward: [(0, '0.163')] [2022-07-11 14:27:49,260][26022] Updated weights on worker 0-0, policy_version 1230688 (0.00090) [2022-07-11 14:27:51,205][26022] Updated weights on worker 0-0, policy_version 1230698 (0.00422) [2022-07-11 14:27:52,712][26022] Updated weights on worker 0-0, policy_version 1230708 (0.00084) [2022-07-11 14:27:52,901][25689] Fps is (10 sec: 5597.7, 60 sec: 5597.1, 300 sec: 5562.6). Total num frames: 1260244992. Throughput: 0: 5027.7. Samples: 1260237718. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:27:52,902][25689] Avg episode reward: [(0, '1.440')] [2022-07-11 14:27:54,859][26022] Updated weights on worker 0-0, policy_version 1230718 (0.00089) [2022-07-11 14:27:56,552][26022] Updated weights on worker 0-0, policy_version 1230728 (0.00090) [2022-07-11 14:27:57,944][25689] Fps is (10 sec: 5704.5, 60 sec: 5598.2, 300 sec: 5560.3). Total num frames: 1260272640. Throughput: 0: 5876.2. Samples: 1260271910. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:27:57,944][25689] Avg episode reward: [(0, '1.593')] [2022-07-11 14:27:58,459][26022] Updated weights on worker 0-0, policy_version 1230738 (0.00102) [2022-07-11 14:28:00,176][26022] Updated weights on worker 0-0, policy_version 1230748 (0.00086) [2022-07-11 14:28:02,402][26022] Updated weights on worker 0-0, policy_version 1230758 (0.00096) [2022-07-11 14:28:03,092][25689] Fps is (10 sec: 5225.0, 60 sec: 5561.1, 300 sec: 5554.9). Total num frames: 1260298240. Throughput: 0: 5846.7. Samples: 1260305380. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:28:03,093][25689] Avg episode reward: [(0, '0.500')] [2022-07-11 14:28:03,962][26022] Updated weights on worker 0-0, policy_version 1230768 (0.00093) [2022-07-11 14:28:06,255][26022] Updated weights on worker 0-0, policy_version 1230778 (0.00086) [2022-07-11 14:28:07,610][26022] Updated weights on worker 0-0, policy_version 1230788 (0.00087) [2022-07-11 14:28:08,124][25689] Fps is (10 sec: 5431.7, 60 sec: 5592.9, 300 sec: 5564.9). Total num frames: 1260327936. Throughput: 0: 5738.1. Samples: 1260336970. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:28:08,124][25689] Avg episode reward: [(0, '0.566')] [2022-07-11 14:28:09,789][26022] Updated weights on worker 0-0, policy_version 1230798 (0.00089) [2022-07-11 14:28:11,338][26022] Updated weights on worker 0-0, policy_version 1230808 (0.00084) [2022-07-11 14:28:13,178][25689] Fps is (10 sec: 5787.2, 60 sec: 5595.4, 300 sec: 5564.1). Total num frames: 1260356608. Throughput: 0: 5725.6. Samples: 1260353814. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:28:13,178][25689] Avg episode reward: [(0, '0.081')] [2022-07-11 14:28:13,179][26022] Updated weights on worker 0-0, policy_version 1230818 (0.00083) [2022-07-11 14:28:13,940][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:28:13,949][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001230821_1260360704.pth [2022-07-11 14:28:13,949][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001228865_1258357760.pth [2022-07-11 14:28:15,099][26022] Updated weights on worker 0-0, policy_version 1230828 (0.00082) [2022-07-11 14:28:16,850][26022] Updated weights on worker 0-0, policy_version 1230838 (0.00089) [2022-07-11 14:28:18,219][25689] Fps is (10 sec: 5579.1, 60 sec: 5577.4, 300 sec: 5562.7). Total num frames: 1260384256. Throughput: 0: 5701.3. Samples: 1260387504. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:28:18,219][25689] Avg episode reward: [(0, '-0.017')] [2022-07-11 14:28:18,954][26022] Updated weights on worker 0-0, policy_version 1230848 (0.00088) [2022-07-11 14:28:20,643][26022] Updated weights on worker 0-0, policy_version 1230858 (0.00085) [2022-07-11 14:28:22,452][26022] Updated weights on worker 0-0, policy_version 1230868 (0.00091) [2022-07-11 14:28:23,287][25689] Fps is (10 sec: 5672.5, 60 sec: 5596.4, 300 sec: 5565.6). Total num frames: 1260413952. Throughput: 0: 5736.9. Samples: 1260421236. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:28:23,287][25689] Avg episode reward: [(0, '-0.333')] [2022-07-11 14:28:24,203][26022] Updated weights on worker 0-0, policy_version 1230878 (0.00093) [2022-07-11 14:28:26,170][26022] Updated weights on worker 0-0, policy_version 1230888 (0.00098) [2022-07-11 14:28:27,920][26022] Updated weights on worker 0-0, policy_version 1230898 (0.00092) [2022-07-11 14:28:28,327][25689] Fps is (10 sec: 5571.8, 60 sec: 5576.3, 300 sec: 5559.2). Total num frames: 1260440576. Throughput: 0: 5003.9. Samples: 1260438062. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:28:28,327][25689] Avg episode reward: [(0, '-0.636')] [2022-07-11 14:28:29,715][26022] Updated weights on worker 0-0, policy_version 1230908 (0.00085) [2022-07-11 14:28:31,619][26022] Updated weights on worker 0-0, policy_version 1230918 (0.00085) [2022-07-11 14:28:33,294][26022] Updated weights on worker 0-0, policy_version 1230928 (0.00074) [2022-07-11 14:28:33,361][25689] Fps is (10 sec: 5590.4, 60 sec: 5590.4, 300 sec: 5562.2). Total num frames: 1260470272. Throughput: 0: 5835.0. Samples: 1260471584. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:28:33,362][25689] Avg episode reward: [(0, '0.283')] [2022-07-11 14:28:35,312][26022] Updated weights on worker 0-0, policy_version 1230938 (0.00092) [2022-07-11 14:28:37,113][26022] Updated weights on worker 0-0, policy_version 1230948 (0.00058) [2022-07-11 14:28:38,368][25689] Fps is (10 sec: 5608.9, 60 sec: 5558.0, 300 sec: 5559.5). Total num frames: 1260496896. Throughput: 0: 5845.3. Samples: 1260505282. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2022-07-11 14:28:38,368][25689] Avg episode reward: [(0, '0.143')] [2022-07-11 14:28:38,857][26022] Updated weights on worker 0-0, policy_version 1230958 (0.00086) [2022-07-11 14:28:40,650][26022] Updated weights on worker 0-0, policy_version 1230968 (0.00089) [2022-07-11 14:28:42,782][26022] Updated weights on worker 0-0, policy_version 1230978 (0.00083) [2022-07-11 14:28:43,449][25689] Fps is (10 sec: 5583.0, 60 sec: 5575.4, 300 sec: 5562.8). Total num frames: 1260526592. Throughput: 0: 4999.3. Samples: 1260522032. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:28:43,450][25689] Avg episode reward: [(0, '0.982')] [2022-07-11 14:28:44,287][26022] Updated weights on worker 0-0, policy_version 1230988 (0.00083) [2022-07-11 14:28:46,363][26022] Updated weights on worker 0-0, policy_version 1230998 (0.00080) [2022-07-11 14:28:47,887][26022] Updated weights on worker 0-0, policy_version 1231008 (0.00088) [2022-07-11 14:28:48,463][25689] Fps is (10 sec: 5578.7, 60 sec: 5578.8, 300 sec: 5553.1). Total num frames: 1260553216. Throughput: 0: 5850.1. Samples: 1260555864. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:28:48,464][25689] Avg episode reward: [(0, '0.348')] [2022-07-11 14:28:49,914][26022] Updated weights on worker 0-0, policy_version 1231018 (0.00092) [2022-07-11 14:28:51,742][26022] Updated weights on worker 0-0, policy_version 1231028 (0.00103) [2022-07-11 14:28:53,456][26022] Updated weights on worker 0-0, policy_version 1231038 (0.00091) [2022-07-11 14:28:53,482][25689] Fps is (10 sec: 5613.4, 60 sec: 5578.0, 300 sec: 5563.3). Total num frames: 1260582912. Throughput: 0: 5851.7. Samples: 1260589326. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:28:53,483][25689] Avg episode reward: [(0, '0.656')] [2022-07-11 14:28:55,484][26022] Updated weights on worker 0-0, policy_version 1231048 (0.00100) [2022-07-11 14:28:57,223][26022] Updated weights on worker 0-0, policy_version 1231058 (0.00091) [2022-07-11 14:28:58,490][25689] Fps is (10 sec: 5719.1, 60 sec: 5581.2, 300 sec: 5561.8). Total num frames: 1260610560. Throughput: 0: 5028.2. Samples: 1260606460. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:28:58,491][25689] Avg episode reward: [(0, '0.739')] [2022-07-11 14:28:58,693][26022] Updated weights on worker 0-0, policy_version 1231068 (0.00090) [2022-07-11 14:29:00,924][26022] Updated weights on worker 0-0, policy_version 1231078 (0.00093) [2022-07-11 14:29:02,828][26022] Updated weights on worker 0-0, policy_version 1231088 (0.00091) [2022-07-11 14:29:03,567][25689] Fps is (10 sec: 5280.1, 60 sec: 5587.8, 300 sec: 5554.2). Total num frames: 1260636160. Throughput: 0: 5829.3. Samples: 1260639306. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:29:03,569][25689] Avg episode reward: [(0, '1.010')] [2022-07-11 14:29:04,818][26022] Updated weights on worker 0-0, policy_version 1231098 (0.00096) [2022-07-11 14:29:06,656][26022] Updated weights on worker 0-0, policy_version 1231108 (0.00092) [2022-07-11 14:29:08,313][26022] Updated weights on worker 0-0, policy_version 1231118 (0.00092) [2022-07-11 14:29:08,596][25689] Fps is (10 sec: 5471.6, 60 sec: 5588.0, 300 sec: 5564.6). Total num frames: 1260665856. Throughput: 0: 5774.3. Samples: 1260672116. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:29:08,597][25689] Avg episode reward: [(0, '1.378')] [2022-07-11 14:29:10,337][26022] Updated weights on worker 0-0, policy_version 1231128 (0.00083) [2022-07-11 14:29:12,046][26022] Updated weights on worker 0-0, policy_version 1231138 (0.00087) [2022-07-11 14:29:13,611][25689] Fps is (10 sec: 5709.5, 60 sec: 5574.7, 300 sec: 5561.1). Total num frames: 1260693504. Throughput: 0: 4960.9. Samples: 1260689180. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:29:13,611][25689] Avg episode reward: [(0, '1.068')] [2022-07-11 14:29:13,758][26022] Updated weights on worker 0-0, policy_version 1231148 (0.00085) [2022-07-11 14:29:15,884][26022] Updated weights on worker 0-0, policy_version 1231158 (0.00091) [2022-07-11 14:29:17,341][26022] Updated weights on worker 0-0, policy_version 1231168 (0.00091) [2022-07-11 14:29:18,629][25689] Fps is (10 sec: 5613.3, 60 sec: 5593.7, 300 sec: 5558.8). Total num frames: 1260722176. Throughput: 0: 5791.6. Samples: 1260723098. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:29:18,630][25689] Avg episode reward: [(0, '2.088')] [2022-07-11 14:29:19,406][26022] Updated weights on worker 0-0, policy_version 1231178 (0.00084) [2022-07-11 14:29:21,078][26022] Updated weights on worker 0-0, policy_version 1231188 (0.00089) [2022-07-11 14:29:22,931][26022] Updated weights on worker 0-0, policy_version 1231198 (0.00090) [2022-07-11 14:29:23,699][25689] Fps is (10 sec: 5684.2, 60 sec: 5576.6, 300 sec: 5572.9). Total num frames: 1260750848. Throughput: 0: 5847.0. Samples: 1260757018. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:29:23,699][25689] Avg episode reward: [(0, '2.231')] [2022-07-11 14:29:24,792][26022] Updated weights on worker 0-0, policy_version 1231208 (0.00084) [2022-07-11 14:29:26,693][26022] Updated weights on worker 0-0, policy_version 1231218 (0.00087) [2022-07-11 14:29:28,547][26022] Updated weights on worker 0-0, policy_version 1231228 (0.00089) [2022-07-11 14:29:28,759][25689] Fps is (10 sec: 5559.9, 60 sec: 5591.7, 300 sec: 5565.1). Total num frames: 1260778496. Throughput: 0: 5046.1. Samples: 1260773860. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:29:28,760][25689] Avg episode reward: [(0, '1.874')] [2022-07-11 14:29:30,403][26022] Updated weights on worker 0-0, policy_version 1231238 (0.00089) [2022-07-11 14:29:32,138][26022] Updated weights on worker 0-0, policy_version 1231248 (0.00082) [2022-07-11 14:29:33,790][25689] Fps is (10 sec: 5479.6, 60 sec: 5558.1, 300 sec: 5561.9). Total num frames: 1260806144. Throughput: 0: 5840.8. Samples: 1260807044. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:29:33,791][25689] Avg episode reward: [(0, '0.561')] [2022-07-11 14:29:34,115][26022] Updated weights on worker 0-0, policy_version 1231258 (0.00086) [2022-07-11 14:29:35,753][26022] Updated weights on worker 0-0, policy_version 1231268 (0.00082) [2022-07-11 14:29:37,493][26022] Updated weights on worker 0-0, policy_version 1231278 (0.00070) [2022-07-11 14:29:38,792][25689] Fps is (10 sec: 5613.7, 60 sec: 5592.5, 300 sec: 5566.5). Total num frames: 1260834816. Throughput: 0: 5853.0. Samples: 1260841108. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:29:38,792][25689] Avg episode reward: [(0, '0.607')] [2022-07-11 14:29:39,408][26022] Updated weights on worker 0-0, policy_version 1231288 (0.00086) [2022-07-11 14:29:41,419][26022] Updated weights on worker 0-0, policy_version 1231298 (0.00083) [2022-07-11 14:29:43,017][26022] Updated weights on worker 0-0, policy_version 1231308 (0.00086) [2022-07-11 14:29:43,913][25689] Fps is (10 sec: 5563.9, 60 sec: 5555.0, 300 sec: 5557.8). Total num frames: 1260862464. Throughput: 0: 4991.9. Samples: 1260857922. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:29:43,913][25689] Avg episode reward: [(0, '0.715')] [2022-07-11 14:29:44,840][26022] Updated weights on worker 0-0, policy_version 1231318 (0.00082) [2022-07-11 14:29:46,751][26022] Updated weights on worker 0-0, policy_version 1231328 (0.00092) [2022-07-11 14:29:48,488][26022] Updated weights on worker 0-0, policy_version 1231338 (0.00084) [2022-07-11 14:29:48,937][25689] Fps is (10 sec: 5551.7, 60 sec: 5587.9, 300 sec: 5565.9). Total num frames: 1260891136. Throughput: 0: 5830.8. Samples: 1260891510. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:29:48,937][25689] Avg episode reward: [(0, '-0.208')] [2022-07-11 14:29:50,579][26022] Updated weights on worker 0-0, policy_version 1231348 (0.00090) [2022-07-11 14:29:52,113][26022] Updated weights on worker 0-0, policy_version 1231358 (0.00091) [2022-07-11 14:29:54,027][25689] Fps is (10 sec: 5669.5, 60 sec: 5564.4, 300 sec: 5560.8). Total num frames: 1260919808. Throughput: 0: 5839.3. Samples: 1260925214. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:29:54,028][25689] Avg episode reward: [(0, '-0.445')] [2022-07-11 14:29:54,067][26022] Updated weights on worker 0-0, policy_version 1231368 (0.00091) [2022-07-11 14:29:55,901][26022] Updated weights on worker 0-0, policy_version 1231378 (0.00087) [2022-07-11 14:29:57,687][26022] Updated weights on worker 0-0, policy_version 1231388 (0.00089) [2022-07-11 14:29:59,079][25689] Fps is (10 sec: 5654.0, 60 sec: 5577.3, 300 sec: 5565.4). Total num frames: 1260948480. Throughput: 0: 4975.1. Samples: 1260942038. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:29:59,079][25689] Avg episode reward: [(0, '0.825')] [2022-07-11 14:29:59,525][26022] Updated weights on worker 0-0, policy_version 1231398 (0.00086) [2022-07-11 14:30:01,449][26022] Updated weights on worker 0-0, policy_version 1231408 (0.00089) [2022-07-11 14:30:03,664][26022] Updated weights on worker 0-0, policy_version 1231418 (0.00087) [2022-07-11 14:30:04,209][25689] Fps is (10 sec: 5330.0, 60 sec: 5572.3, 300 sec: 5560.0). Total num frames: 1260974080. Throughput: 0: 5695.9. Samples: 1260973530. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:30:04,210][25689] Avg episode reward: [(0, '0.879')] [2022-07-11 14:30:05,567][26022] Updated weights on worker 0-0, policy_version 1231428 (0.00102) [2022-07-11 14:30:07,345][26022] Updated weights on worker 0-0, policy_version 1231438 (0.00083) [2022-07-11 14:30:09,114][26022] Updated weights on worker 0-0, policy_version 1231448 (0.00087) [2022-07-11 14:30:09,247][25689] Fps is (10 sec: 5337.3, 60 sec: 5554.7, 300 sec: 5563.3). Total num frames: 1261002752. Throughput: 0: 5664.9. Samples: 1261006568. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:30:09,248][25689] Avg episode reward: [(0, '0.929')] [2022-07-11 14:30:11,169][26022] Updated weights on worker 0-0, policy_version 1231458 (0.00083) [2022-07-11 14:30:12,779][26022] Updated weights on worker 0-0, policy_version 1231468 (0.00094) [2022-07-11 14:30:13,977][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:30:13,987][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001231473_1261028352.pth [2022-07-11 14:30:13,988][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001229517_1259025408.pth [2022-07-11 14:30:14,313][25689] Fps is (10 sec: 5574.5, 60 sec: 5550.0, 300 sec: 5559.3). Total num frames: 1261030400. Throughput: 0: 5667.1. Samples: 1261040174. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:30:14,313][25689] Avg episode reward: [(0, '1.685')] [2022-07-11 14:30:14,703][26022] Updated weights on worker 0-0, policy_version 1231478 (0.00087) [2022-07-11 14:30:16,523][26022] Updated weights on worker 0-0, policy_version 1231488 (0.00055) [2022-07-11 14:30:18,289][26022] Updated weights on worker 0-0, policy_version 1231498 (0.00086) [2022-07-11 14:30:19,367][25689] Fps is (10 sec: 5666.7, 60 sec: 5563.6, 300 sec: 5566.3). Total num frames: 1261060096. Throughput: 0: 5672.0. Samples: 1261057110. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:30:19,367][25689] Avg episode reward: [(0, '1.929')] [2022-07-11 14:30:20,270][26022] Updated weights on worker 0-0, policy_version 1231508 (0.00091) [2022-07-11 14:30:21,871][26022] Updated weights on worker 0-0, policy_version 1231518 (0.00095) [2022-07-11 14:30:23,846][26022] Updated weights on worker 0-0, policy_version 1231528 (0.00086) [2022-07-11 14:30:24,435][25689] Fps is (10 sec: 5664.8, 60 sec: 5546.8, 300 sec: 5562.3). Total num frames: 1261087744. Throughput: 0: 5798.0. Samples: 1261090800. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:30:24,436][25689] Avg episode reward: [(0, '1.729')] [2022-07-11 14:30:25,573][26022] Updated weights on worker 0-0, policy_version 1231538 (0.00092) [2022-07-11 14:30:27,467][26022] Updated weights on worker 0-0, policy_version 1231548 (0.00086) [2022-07-11 14:30:29,371][26022] Updated weights on worker 0-0, policy_version 1231558 (0.00092) [2022-07-11 14:30:29,469][25689] Fps is (10 sec: 5473.8, 60 sec: 5549.3, 300 sec: 5560.0). Total num frames: 1261115392. Throughput: 0: 5828.8. Samples: 1261124434. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:30:29,469][25689] Avg episode reward: [(0, '1.985')] [2022-07-11 14:30:31,084][26022] Updated weights on worker 0-0, policy_version 1231568 (0.00085) [2022-07-11 14:30:32,979][26022] Updated weights on worker 0-0, policy_version 1231578 (0.00091) [2022-07-11 14:30:34,515][25689] Fps is (10 sec: 5587.7, 60 sec: 5564.8, 300 sec: 5560.4). Total num frames: 1261144064. Throughput: 0: 5009.0. Samples: 1261141366. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:30:34,515][25689] Avg episode reward: [(0, '1.781')] [2022-07-11 14:30:34,780][26022] Updated weights on worker 0-0, policy_version 1231588 (0.00092) [2022-07-11 14:30:36,719][26022] Updated weights on worker 0-0, policy_version 1231598 (0.00083) [2022-07-11 14:30:38,351][26022] Updated weights on worker 0-0, policy_version 1231608 (0.00092) [2022-07-11 14:30:39,517][25689] Fps is (10 sec: 5604.7, 60 sec: 5547.8, 300 sec: 5561.3). Total num frames: 1261171712. Throughput: 0: 5835.9. Samples: 1261174708. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:30:39,518][25689] Avg episode reward: [(0, '1.574')] [2022-07-11 14:30:40,261][26022] Updated weights on worker 0-0, policy_version 1231618 (0.00095) [2022-07-11 14:30:42,102][26022] Updated weights on worker 0-0, policy_version 1231628 (0.00107) [2022-07-11 14:30:43,864][26022] Updated weights on worker 0-0, policy_version 1231638 (0.00080) [2022-07-11 14:30:44,575][25689] Fps is (10 sec: 5598.4, 60 sec: 5570.5, 300 sec: 5561.5). Total num frames: 1261200384. Throughput: 0: 5860.5. Samples: 1261208826. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:30:44,575][25689] Avg episode reward: [(0, '1.504')] [2022-07-11 14:30:45,692][26022] Updated weights on worker 0-0, policy_version 1231648 (0.00090) [2022-07-11 14:30:47,562][26022] Updated weights on worker 0-0, policy_version 1231658 (0.00094) [2022-07-11 14:30:49,264][26022] Updated weights on worker 0-0, policy_version 1231668 (0.00095) [2022-07-11 14:30:49,589][25689] Fps is (10 sec: 5795.5, 60 sec: 5588.3, 300 sec: 5572.0). Total num frames: 1261230080. Throughput: 0: 5036.6. Samples: 1261225770. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:30:49,589][25689] Avg episode reward: [(0, '1.537')] [2022-07-11 14:30:51,376][26022] Updated weights on worker 0-0, policy_version 1231678 (0.00086) [2022-07-11 14:30:52,883][26022] Updated weights on worker 0-0, policy_version 1231688 (0.00085) [2022-07-11 14:30:54,593][25689] Fps is (10 sec: 5519.4, 60 sec: 5545.5, 300 sec: 5558.4). Total num frames: 1261255680. Throughput: 0: 5890.5. Samples: 1261259638. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:30:54,594][25689] Avg episode reward: [(0, '1.396')] [2022-07-11 14:30:54,883][26022] Updated weights on worker 0-0, policy_version 1231698 (0.00092) [2022-07-11 14:30:56,420][26022] Updated weights on worker 0-0, policy_version 1231708 (0.00434) [2022-07-11 14:30:58,517][26022] Updated weights on worker 0-0, policy_version 1231718 (0.00091) [2022-07-11 14:30:59,611][25689] Fps is (10 sec: 5619.6, 60 sec: 5582.5, 300 sec: 5576.6). Total num frames: 1261286400. Throughput: 0: 5921.1. Samples: 1261293682. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:30:59,611][25689] Avg episode reward: [(0, '0.468')] [2022-07-11 14:30:59,986][26022] Updated weights on worker 0-0, policy_version 1231728 (0.00086) [2022-07-11 14:31:02,508][26022] Updated weights on worker 0-0, policy_version 1231738 (0.00078) [2022-07-11 14:31:04,163][26022] Updated weights on worker 0-0, policy_version 1231748 (0.00084) [2022-07-11 14:31:04,685][25689] Fps is (10 sec: 5580.6, 60 sec: 5587.7, 300 sec: 5565.8). Total num frames: 1261312000. Throughput: 0: 4950.3. Samples: 1261308380. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:31:04,686][25689] Avg episode reward: [(0, '0.501')] [2022-07-11 14:31:06,195][26022] Updated weights on worker 0-0, policy_version 1231758 (0.00085) [2022-07-11 14:31:07,747][26022] Updated weights on worker 0-0, policy_version 1231768 (0.00085) [2022-07-11 14:31:09,706][25689] Fps is (10 sec: 5274.3, 60 sec: 5572.3, 300 sec: 5569.1). Total num frames: 1261339648. Throughput: 0: 5774.9. Samples: 1261341948. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:31:09,707][25689] Avg episode reward: [(0, '-0.216')] [2022-07-11 14:31:09,749][26022] Updated weights on worker 0-0, policy_version 1231778 (0.00086) [2022-07-11 14:31:11,594][26022] Updated weights on worker 0-0, policy_version 1231788 (0.00095) [2022-07-11 14:31:13,335][26022] Updated weights on worker 0-0, policy_version 1231798 (0.00083) [2022-07-11 14:31:14,715][25689] Fps is (10 sec: 5615.3, 60 sec: 5594.5, 300 sec: 5572.5). Total num frames: 1261368320. Throughput: 0: 5768.0. Samples: 1261375700. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:31:14,715][25689] Avg episode reward: [(0, '0.192')] [2022-07-11 14:31:15,251][26022] Updated weights on worker 0-0, policy_version 1231808 (0.00087) [2022-07-11 14:31:17,045][26022] Updated weights on worker 0-0, policy_version 1231818 (0.00094) [2022-07-11 14:31:18,681][26022] Updated weights on worker 0-0, policy_version 1231828 (0.00086) [2022-07-11 14:31:19,735][25689] Fps is (10 sec: 5616.1, 60 sec: 5563.7, 300 sec: 5567.4). Total num frames: 1261395968. Throughput: 0: 4916.4. Samples: 1261392620. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:31:19,735][25689] Avg episode reward: [(0, '0.280')] [2022-07-11 14:31:20,910][26022] Updated weights on worker 0-0, policy_version 1231838 (0.00097) [2022-07-11 14:31:22,409][26022] Updated weights on worker 0-0, policy_version 1231848 (0.00093) [2022-07-11 14:31:24,668][26022] Updated weights on worker 0-0, policy_version 1231858 (0.00086) [2022-07-11 14:31:24,782][25689] Fps is (10 sec: 5391.3, 60 sec: 5548.8, 300 sec: 5567.0). Total num frames: 1261422592. Throughput: 0: 5853.1. Samples: 1261426006. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:31:24,783][25689] Avg episode reward: [(0, '0.117')] [2022-07-11 14:31:26,061][26022] Updated weights on worker 0-0, policy_version 1231868 (0.00088) [2022-07-11 14:31:28,283][26022] Updated weights on worker 0-0, policy_version 1231878 (0.00083) [2022-07-11 14:31:29,852][25689] Fps is (10 sec: 5566.4, 60 sec: 5579.2, 300 sec: 5569.8). Total num frames: 1261452288. Throughput: 0: 5826.3. Samples: 1261459324. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:31:29,854][25689] Avg episode reward: [(0, '0.700')] [2022-07-11 14:31:29,952][26022] Updated weights on worker 0-0, policy_version 1231888 (0.00095) [2022-07-11 14:31:31,681][26022] Updated weights on worker 0-0, policy_version 1231898 (0.00086) [2022-07-11 14:31:33,822][26022] Updated weights on worker 0-0, policy_version 1231908 (0.00093) [2022-07-11 14:31:34,940][25689] Fps is (10 sec: 5846.8, 60 sec: 5592.4, 300 sec: 5579.0). Total num frames: 1261481984. Throughput: 0: 4962.4. Samples: 1261476062. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:31:34,941][25689] Avg episode reward: [(0, '1.582')] [2022-07-11 14:31:35,300][26022] Updated weights on worker 0-0, policy_version 1231918 (0.00091) [2022-07-11 14:31:37,412][26022] Updated weights on worker 0-0, policy_version 1231928 (0.00091) [2022-07-11 14:31:39,219][26022] Updated weights on worker 0-0, policy_version 1231938 (0.00082) [2022-07-11 14:31:39,986][25689] Fps is (10 sec: 5557.6, 60 sec: 5571.4, 300 sec: 5569.3). Total num frames: 1261508608. Throughput: 0: 5769.4. Samples: 1261509460. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:31:39,987][25689] Avg episode reward: [(0, '1.252')] [2022-07-11 14:31:40,986][26022] Updated weights on worker 0-0, policy_version 1231948 (0.00085) [2022-07-11 14:31:42,874][26022] Updated weights on worker 0-0, policy_version 1231958 (0.00087) [2022-07-11 14:31:44,600][26022] Updated weights on worker 0-0, policy_version 1231968 (0.00100) [2022-07-11 14:31:45,052][25689] Fps is (10 sec: 5468.0, 60 sec: 5570.6, 300 sec: 5575.3). Total num frames: 1261537280. Throughput: 0: 5794.8. Samples: 1261543470. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:31:45,053][25689] Avg episode reward: [(0, '1.156')] [2022-07-11 14:31:46,396][26022] Updated weights on worker 0-0, policy_version 1231978 (0.00081) [2022-07-11 14:31:48,217][26022] Updated weights on worker 0-0, policy_version 1231988 (0.00089) [2022-07-11 14:31:50,063][25689] Fps is (10 sec: 5589.1, 60 sec: 5537.0, 300 sec: 5572.0). Total num frames: 1261564928. Throughput: 0: 5008.8. Samples: 1261560554. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:31:50,063][25689] Avg episode reward: [(0, '1.029')] [2022-07-11 14:31:50,147][26022] Updated weights on worker 0-0, policy_version 1231998 (0.00092) [2022-07-11 14:31:51,999][26022] Updated weights on worker 0-0, policy_version 1232008 (0.00114) [2022-07-11 14:31:53,751][26022] Updated weights on worker 0-0, policy_version 1232018 (0.00086) [2022-07-11 14:31:55,070][25689] Fps is (10 sec: 5622.2, 60 sec: 5587.6, 300 sec: 5576.3). Total num frames: 1261593600. Throughput: 0: 5856.9. Samples: 1261593962. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:31:55,070][25689] Avg episode reward: [(0, '1.847')] [2022-07-11 14:31:55,638][26022] Updated weights on worker 0-0, policy_version 1232028 (0.00090) [2022-07-11 14:31:57,283][26022] Updated weights on worker 0-0, policy_version 1232038 (0.00086) [2022-07-11 14:31:59,260][26022] Updated weights on worker 0-0, policy_version 1232048 (0.00085) [2022-07-11 14:32:00,136][25689] Fps is (10 sec: 5692.9, 60 sec: 5549.3, 300 sec: 5580.6). Total num frames: 1261622272. Throughput: 0: 5887.8. Samples: 1261628098. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:32:00,138][25689] Avg episode reward: [(0, '1.196')] [2022-07-11 14:32:00,903][26022] Updated weights on worker 0-0, policy_version 1232058 (0.00085) [2022-07-11 14:32:03,118][26022] Updated weights on worker 0-0, policy_version 1232068 (0.00092) [2022-07-11 14:32:05,016][26022] Updated weights on worker 0-0, policy_version 1232078 (0.00080) [2022-07-11 14:32:05,200][25689] Fps is (10 sec: 5559.6, 60 sec: 5584.1, 300 sec: 5579.6). Total num frames: 1261649920. Throughput: 0: 4936.1. Samples: 1261642920. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:32:05,201][25689] Avg episode reward: [(0, '1.014')] [2022-07-11 14:32:06,774][26022] Updated weights on worker 0-0, policy_version 1232088 (0.00090) [2022-07-11 14:32:08,787][26022] Updated weights on worker 0-0, policy_version 1232098 (0.00093) [2022-07-11 14:32:10,261][25689] Fps is (10 sec: 5360.0, 60 sec: 5563.5, 300 sec: 5573.1). Total num frames: 1261676544. Throughput: 0: 5739.9. Samples: 1261676490. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:32:10,262][25689] Avg episode reward: [(0, '1.037')] [2022-07-11 14:32:10,507][26022] Updated weights on worker 0-0, policy_version 1232108 (0.00083) [2022-07-11 14:32:12,347][26022] Updated weights on worker 0-0, policy_version 1232118 (0.00083) [2022-07-11 14:32:14,052][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:32:14,061][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001232127_1261698048.pth [2022-07-11 14:32:14,065][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001230169_1259693056.pth [2022-07-11 14:32:14,247][26022] Updated weights on worker 0-0, policy_version 1232128 (0.00087) [2022-07-11 14:32:15,315][25689] Fps is (10 sec: 5567.9, 60 sec: 5576.2, 300 sec: 5576.1). Total num frames: 1261706240. Throughput: 0: 5734.4. Samples: 1261710058. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:32:15,316][25689] Avg episode reward: [(0, '0.967')] [2022-07-11 14:32:15,885][26022] Updated weights on worker 0-0, policy_version 1232138 (0.00111) [2022-07-11 14:32:17,847][26022] Updated weights on worker 0-0, policy_version 1232148 (0.00084) [2022-07-11 14:32:19,505][26022] Updated weights on worker 0-0, policy_version 1232158 (0.00090) [2022-07-11 14:32:20,372][25689] Fps is (10 sec: 5671.4, 60 sec: 5572.8, 300 sec: 5573.3). Total num frames: 1261733888. Throughput: 0: 4889.3. Samples: 1261727040. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:32:20,373][25689] Avg episode reward: [(0, '0.508')] [2022-07-11 14:32:21,464][26022] Updated weights on worker 0-0, policy_version 1232168 (0.00090) [2022-07-11 14:32:23,307][26022] Updated weights on worker 0-0, policy_version 1232178 (0.00089) [2022-07-11 14:32:25,098][26022] Updated weights on worker 0-0, policy_version 1232188 (0.00091) [2022-07-11 14:32:25,475][25689] Fps is (10 sec: 5644.5, 60 sec: 5618.3, 300 sec: 5578.3). Total num frames: 1261763584. Throughput: 0: 5810.8. Samples: 1261760732. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:32:25,475][25689] Avg episode reward: [(0, '1.448')] [2022-07-11 14:32:27,019][26022] Updated weights on worker 0-0, policy_version 1232198 (0.00089) [2022-07-11 14:32:28,598][26022] Updated weights on worker 0-0, policy_version 1232208 (0.00089) [2022-07-11 14:32:30,561][25689] Fps is (10 sec: 5527.8, 60 sec: 5566.3, 300 sec: 5569.9). Total num frames: 1261790208. Throughput: 0: 5803.0. Samples: 1261794288. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:32:30,561][25689] Avg episode reward: [(0, '1.272')] [2022-07-11 14:32:30,721][26022] Updated weights on worker 0-0, policy_version 1232218 (0.00092) [2022-07-11 14:32:32,332][26022] Updated weights on worker 0-0, policy_version 1232228 (0.00092) [2022-07-11 14:32:34,419][26022] Updated weights on worker 0-0, policy_version 1232238 (0.00084) [2022-07-11 14:32:35,650][25689] Fps is (10 sec: 5534.9, 60 sec: 5566.1, 300 sec: 5572.1). Total num frames: 1261819904. Throughput: 0: 5793.3. Samples: 1261827862. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:32:35,651][25689] Avg episode reward: [(0, '1.379')] [2022-07-11 14:32:36,046][26022] Updated weights on worker 0-0, policy_version 1232248 (0.00088) [2022-07-11 14:32:38,027][26022] Updated weights on worker 0-0, policy_version 1232258 (0.00089) [2022-07-11 14:32:39,690][26022] Updated weights on worker 0-0, policy_version 1232268 (0.00086) [2022-07-11 14:32:40,679][25689] Fps is (10 sec: 5667.1, 60 sec: 5584.5, 300 sec: 5569.7). Total num frames: 1261847552. Throughput: 0: 5791.1. Samples: 1261844640. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2022-07-11 14:32:40,680][25689] Avg episode reward: [(0, '1.704')] [2022-07-11 14:32:41,663][26022] Updated weights on worker 0-0, policy_version 1232278 (0.00090) [2022-07-11 14:32:43,493][26022] Updated weights on worker 0-0, policy_version 1232288 (0.00093) [2022-07-11 14:32:45,320][26022] Updated weights on worker 0-0, policy_version 1232298 (0.00088) [2022-07-11 14:32:45,778][25689] Fps is (10 sec: 5459.7, 60 sec: 5564.7, 300 sec: 5572.3). Total num frames: 1261875200. Throughput: 0: 5794.0. Samples: 1261878370. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:32:45,779][25689] Avg episode reward: [(0, '2.245')] [2022-07-11 14:32:47,121][26022] Updated weights on worker 0-0, policy_version 1232308 (0.00086) [2022-07-11 14:32:48,825][26022] Updated weights on worker 0-0, policy_version 1232318 (0.00093) [2022-07-11 14:32:50,781][26022] Updated weights on worker 0-0, policy_version 1232328 (0.00082) [2022-07-11 14:32:50,814][25689] Fps is (10 sec: 5557.4, 60 sec: 5579.3, 300 sec: 5568.3). Total num frames: 1261903872. Throughput: 0: 5807.3. Samples: 1261911902. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:32:50,814][25689] Avg episode reward: [(0, '2.165')] [2022-07-11 14:32:52,549][26022] Updated weights on worker 0-0, policy_version 1232338 (0.00089) [2022-07-11 14:32:54,463][26022] Updated weights on worker 0-0, policy_version 1232348 (0.00057) [2022-07-11 14:32:55,835][25689] Fps is (10 sec: 5702.1, 60 sec: 5578.0, 300 sec: 5572.2). Total num frames: 1261932544. Throughput: 0: 5006.7. Samples: 1261928920. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:32:55,835][25689] Avg episode reward: [(0, '1.695')] [2022-07-11 14:32:56,343][26022] Updated weights on worker 0-0, policy_version 1232358 (0.00087) [2022-07-11 14:32:58,003][26022] Updated weights on worker 0-0, policy_version 1232368 (0.00094) [2022-07-11 14:32:59,996][26022] Updated weights on worker 0-0, policy_version 1232378 (0.00088) [2022-07-11 14:33:00,842][25689] Fps is (10 sec: 5615.9, 60 sec: 5566.5, 300 sec: 5581.7). Total num frames: 1261960192. Throughput: 0: 5871.8. Samples: 1261963032. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:33:00,843][25689] Avg episode reward: [(0, '1.244')] [2022-07-11 14:33:01,644][26022] Updated weights on worker 0-0, policy_version 1232388 (0.00091) [2022-07-11 14:33:03,865][26022] Updated weights on worker 0-0, policy_version 1232398 (0.00087) [2022-07-11 14:33:05,546][26022] Updated weights on worker 0-0, policy_version 1232408 (0.00082) [2022-07-11 14:33:05,946][25689] Fps is (10 sec: 5468.5, 60 sec: 5562.8, 300 sec: 5573.5). Total num frames: 1261987840. Throughput: 0: 5774.1. Samples: 1261994824. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:33:05,947][25689] Avg episode reward: [(0, '-0.512')] [2022-07-11 14:33:07,371][26022] Updated weights on worker 0-0, policy_version 1232418 (0.00084) [2022-07-11 14:33:09,326][26022] Updated weights on worker 0-0, policy_version 1232428 (0.00086) [2022-07-11 14:33:10,953][25689] Fps is (10 sec: 5367.7, 60 sec: 5567.8, 300 sec: 5567.5). Total num frames: 1262014464. Throughput: 0: 4940.8. Samples: 1262011408. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:33:10,954][25689] Avg episode reward: [(0, '-1.126')] [2022-07-11 14:33:11,160][26022] Updated weights on worker 0-0, policy_version 1232438 (0.00090) [2022-07-11 14:33:12,955][26022] Updated weights on worker 0-0, policy_version 1232448 (0.00096) [2022-07-11 14:33:14,859][26022] Updated weights on worker 0-0, policy_version 1232458 (0.00089) [2022-07-11 14:33:15,967][25689] Fps is (10 sec: 5518.5, 60 sec: 5554.6, 300 sec: 5571.4). Total num frames: 1262043136. Throughput: 0: 5787.5. Samples: 1262045434. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:33:15,967][25689] Avg episode reward: [(0, '-1.401')] [2022-07-11 14:33:16,456][26022] Updated weights on worker 0-0, policy_version 1232468 (0.00089) [2022-07-11 14:33:18,500][26022] Updated weights on worker 0-0, policy_version 1232478 (0.00093) [2022-07-11 14:33:20,156][26022] Updated weights on worker 0-0, policy_version 1232488 (0.00087) [2022-07-11 14:33:20,972][25689] Fps is (10 sec: 5723.8, 60 sec: 5576.3, 300 sec: 5569.2). Total num frames: 1262071808. Throughput: 0: 5779.6. Samples: 1262079372. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:33:20,972][25689] Avg episode reward: [(0, '-1.222')] [2022-07-11 14:33:22,091][26022] Updated weights on worker 0-0, policy_version 1232498 (0.00093) [2022-07-11 14:33:23,838][26022] Updated weights on worker 0-0, policy_version 1232508 (0.00086) [2022-07-11 14:33:25,786][26022] Updated weights on worker 0-0, policy_version 1232518 (0.00090) [2022-07-11 14:33:26,025][25689] Fps is (10 sec: 5599.2, 60 sec: 5547.0, 300 sec: 5572.3). Total num frames: 1262099456. Throughput: 0: 5043.4. Samples: 1262096090. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:33:26,026][25689] Avg episode reward: [(0, '-1.215')] [2022-07-11 14:33:27,515][26022] Updated weights on worker 0-0, policy_version 1232528 (0.01156) [2022-07-11 14:33:29,525][26022] Updated weights on worker 0-0, policy_version 1232538 (0.00089) [2022-07-11 14:33:30,983][26022] Updated weights on worker 0-0, policy_version 1232548 (0.00084) [2022-07-11 14:33:31,090][25689] Fps is (10 sec: 5667.0, 60 sec: 5599.6, 300 sec: 5571.8). Total num frames: 1262129152. Throughput: 0: 5874.2. Samples: 1262129698. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:33:31,091][25689] Avg episode reward: [(0, '-0.324')] [2022-07-11 14:33:33,100][26022] Updated weights on worker 0-0, policy_version 1232558 (0.00088) [2022-07-11 14:33:34,819][26022] Updated weights on worker 0-0, policy_version 1232568 (0.00088) [2022-07-11 14:33:36,092][25689] Fps is (10 sec: 5594.4, 60 sec: 5556.9, 300 sec: 5571.9). Total num frames: 1262155776. Throughput: 0: 5860.2. Samples: 1262163376. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:33:36,093][25689] Avg episode reward: [(0, '1.165')] [2022-07-11 14:33:36,774][26022] Updated weights on worker 0-0, policy_version 1232578 (0.00086) [2022-07-11 14:33:38,518][26022] Updated weights on worker 0-0, policy_version 1232588 (0.00090) [2022-07-11 14:33:40,259][26022] Updated weights on worker 0-0, policy_version 1232598 (0.00502) [2022-07-11 14:33:41,127][25689] Fps is (10 sec: 5407.7, 60 sec: 5556.5, 300 sec: 5565.9). Total num frames: 1262183424. Throughput: 0: 4995.8. Samples: 1262180062. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:33:41,127][25689] Avg episode reward: [(0, '1.275')] [2022-07-11 14:33:42,047][26022] Updated weights on worker 0-0, policy_version 1232608 (0.00091) [2022-07-11 14:33:44,224][26022] Updated weights on worker 0-0, policy_version 1232618 (0.00095) [2022-07-11 14:33:45,796][26022] Updated weights on worker 0-0, policy_version 1232628 (0.00091) [2022-07-11 14:33:46,176][25689] Fps is (10 sec: 5585.1, 60 sec: 5577.9, 300 sec: 5572.1). Total num frames: 1262212096. Throughput: 0: 5844.6. Samples: 1262213866. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:33:46,178][25689] Avg episode reward: [(0, '1.208')] [2022-07-11 14:33:47,648][26022] Updated weights on worker 0-0, policy_version 1232638 (0.00090) [2022-07-11 14:33:49,518][26022] Updated weights on worker 0-0, policy_version 1232648 (0.00089) [2022-07-11 14:33:51,199][25689] Fps is (10 sec: 5693.3, 60 sec: 5579.1, 300 sec: 5568.6). Total num frames: 1262240768. Throughput: 0: 5851.9. Samples: 1262247370. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:33:51,199][25689] Avg episode reward: [(0, '1.117')] [2022-07-11 14:33:51,252][26022] Updated weights on worker 0-0, policy_version 1232658 (0.00091) [2022-07-11 14:33:53,353][26022] Updated weights on worker 0-0, policy_version 1232668 (0.00085) [2022-07-11 14:33:54,966][26022] Updated weights on worker 0-0, policy_version 1232678 (0.00088) [2022-07-11 14:33:56,215][25689] Fps is (10 sec: 5406.4, 60 sec: 5528.7, 300 sec: 5561.5). Total num frames: 1262266368. Throughput: 0: 5003.0. Samples: 1262264048. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:33:56,216][25689] Avg episode reward: [(0, '1.434')] [2022-07-11 14:33:56,835][26022] Updated weights on worker 0-0, policy_version 1232688 (0.00084) [2022-07-11 14:33:58,801][26022] Updated weights on worker 0-0, policy_version 1232698 (0.00089) [2022-07-11 14:34:00,488][26022] Updated weights on worker 0-0, policy_version 1232708 (0.00092) [2022-07-11 14:34:01,252][25689] Fps is (10 sec: 5601.9, 60 sec: 5576.8, 300 sec: 5579.5). Total num frames: 1262297088. Throughput: 0: 5844.2. Samples: 1262297682. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:34:01,254][25689] Avg episode reward: [(0, '0.936')] [2022-07-11 14:34:02,848][26022] Updated weights on worker 0-0, policy_version 1232718 (0.00096) [2022-07-11 14:34:04,558][26022] Updated weights on worker 0-0, policy_version 1232728 (0.00092) [2022-07-11 14:34:06,327][25689] Fps is (10 sec: 5569.4, 60 sec: 5545.6, 300 sec: 5564.9). Total num frames: 1262322688. Throughput: 0: 5721.3. Samples: 1262329154. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:34:06,329][25689] Avg episode reward: [(0, '1.634')] [2022-07-11 14:34:06,355][26022] Updated weights on worker 0-0, policy_version 1232738 (0.00085) [2022-07-11 14:34:08,320][26022] Updated weights on worker 0-0, policy_version 1232748 (0.00094) [2022-07-11 14:34:10,084][26022] Updated weights on worker 0-0, policy_version 1232758 (0.00084) [2022-07-11 14:34:11,333][25689] Fps is (10 sec: 5282.3, 60 sec: 5562.7, 300 sec: 5565.0). Total num frames: 1262350336. Throughput: 0: 4891.8. Samples: 1262345862. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:34:11,333][25689] Avg episode reward: [(0, '1.610')] [2022-07-11 14:34:11,955][26022] Updated weights on worker 0-0, policy_version 1232768 (0.00088) [2022-07-11 14:34:13,724][26022] Updated weights on worker 0-0, policy_version 1232778 (0.00090) [2022-07-11 14:34:14,187][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:34:14,209][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001232780_1262366720.pth [2022-07-11 14:34:14,210][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001230821_1260360704.pth [2022-07-11 14:34:15,535][26022] Updated weights on worker 0-0, policy_version 1232788 (0.00086) [2022-07-11 14:34:16,359][25689] Fps is (10 sec: 5511.9, 60 sec: 5544.5, 300 sec: 5561.4). Total num frames: 1262377984. Throughput: 0: 5731.9. Samples: 1262379514. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:34:16,360][25689] Avg episode reward: [(0, '1.201')] [2022-07-11 14:34:17,476][26022] Updated weights on worker 0-0, policy_version 1232798 (0.00094) [2022-07-11 14:34:19,307][26022] Updated weights on worker 0-0, policy_version 1232808 (0.00087) [2022-07-11 14:34:20,947][26022] Updated weights on worker 0-0, policy_version 1232818 (0.00091) [2022-07-11 14:34:21,385][25689] Fps is (10 sec: 5704.5, 60 sec: 5559.5, 300 sec: 5565.7). Total num frames: 1262407680. Throughput: 0: 5753.6. Samples: 1262413518. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:34:21,386][25689] Avg episode reward: [(0, '1.352')] [2022-07-11 14:34:22,872][26022] Updated weights on worker 0-0, policy_version 1232828 (0.00084) [2022-07-11 14:34:24,695][26022] Updated weights on worker 0-0, policy_version 1232838 (0.00085) [2022-07-11 14:34:26,452][25689] Fps is (10 sec: 5580.1, 60 sec: 5541.4, 300 sec: 5562.1). Total num frames: 1262434304. Throughput: 0: 5024.3. Samples: 1262430268. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:34:26,453][25689] Avg episode reward: [(0, '1.159')] [2022-07-11 14:34:26,584][26022] Updated weights on worker 0-0, policy_version 1232848 (0.00082) [2022-07-11 14:34:28,385][26022] Updated weights on worker 0-0, policy_version 1232858 (0.00083) [2022-07-11 14:34:30,173][26022] Updated weights on worker 0-0, policy_version 1232868 (0.00085) [2022-07-11 14:34:31,486][25689] Fps is (10 sec: 5575.7, 60 sec: 5544.2, 300 sec: 5569.0). Total num frames: 1262464000. Throughput: 0: 5867.9. Samples: 1262464120. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:34:31,487][25689] Avg episode reward: [(0, '1.532')] [2022-07-11 14:34:31,928][26022] Updated weights on worker 0-0, policy_version 1232878 (0.00088) [2022-07-11 14:34:33,833][26022] Updated weights on worker 0-0, policy_version 1232888 (0.00095) [2022-07-11 14:34:35,631][26022] Updated weights on worker 0-0, policy_version 1232898 (0.00101) [2022-07-11 14:34:36,522][25689] Fps is (10 sec: 5796.5, 60 sec: 5575.0, 300 sec: 5568.3). Total num frames: 1262492672. Throughput: 0: 5867.2. Samples: 1262497810. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:34:36,522][25689] Avg episode reward: [(0, '1.513')] [2022-07-11 14:34:37,435][26022] Updated weights on worker 0-0, policy_version 1232908 (0.00085) [2022-07-11 14:34:39,189][26022] Updated weights on worker 0-0, policy_version 1232918 (0.00092) [2022-07-11 14:34:41,159][26022] Updated weights on worker 0-0, policy_version 1232928 (0.00089) [2022-07-11 14:34:41,561][25689] Fps is (10 sec: 5590.3, 60 sec: 5574.6, 300 sec: 5569.9). Total num frames: 1262520320. Throughput: 0: 5003.5. Samples: 1262514466. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:34:41,561][25689] Avg episode reward: [(0, '1.652')] [2022-07-11 14:34:43,140][26022] Updated weights on worker 0-0, policy_version 1232938 (0.00085) [2022-07-11 14:34:44,842][26022] Updated weights on worker 0-0, policy_version 1232948 (0.00088) [2022-07-11 14:34:46,635][25689] Fps is (10 sec: 5467.7, 60 sec: 5555.4, 300 sec: 5565.5). Total num frames: 1262547968. Throughput: 0: 5823.8. Samples: 1262547808. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:34:46,635][25689] Avg episode reward: [(0, '1.722')] [2022-07-11 14:34:46,742][26022] Updated weights on worker 0-0, policy_version 1232958 (0.00088) [2022-07-11 14:34:48,533][26022] Updated weights on worker 0-0, policy_version 1232968 (0.00095) [2022-07-11 14:34:50,419][26022] Updated weights on worker 0-0, policy_version 1232978 (0.00088) [2022-07-11 14:34:51,643][25689] Fps is (10 sec: 5586.2, 60 sec: 5556.7, 300 sec: 5567.0). Total num frames: 1262576640. Throughput: 0: 5834.2. Samples: 1262581718. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:34:51,643][25689] Avg episode reward: [(0, '1.683')] [2022-07-11 14:34:52,104][26022] Updated weights on worker 0-0, policy_version 1232988 (0.00086) [2022-07-11 14:34:54,056][26022] Updated weights on worker 0-0, policy_version 1232998 (0.00093) [2022-07-11 14:34:55,762][26022] Updated weights on worker 0-0, policy_version 1233008 (0.00084) [2022-07-11 14:34:56,658][25689] Fps is (10 sec: 5619.0, 60 sec: 5590.7, 300 sec: 5564.3). Total num frames: 1262604288. Throughput: 0: 5006.9. Samples: 1262598632. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:34:56,659][25689] Avg episode reward: [(0, '1.313')] [2022-07-11 14:34:57,568][26022] Updated weights on worker 0-0, policy_version 1233018 (0.00084) [2022-07-11 14:34:59,455][26022] Updated weights on worker 0-0, policy_version 1233028 (0.00096) [2022-07-11 14:35:01,162][26022] Updated weights on worker 0-0, policy_version 1233038 (0.00087) [2022-07-11 14:35:01,712][25689] Fps is (10 sec: 5491.7, 60 sec: 5538.4, 300 sec: 5572.6). Total num frames: 1262631936. Throughput: 0: 5857.8. Samples: 1262632506. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:35:01,714][25689] Avg episode reward: [(0, '1.278')] [2022-07-11 14:35:03,439][26022] Updated weights on worker 0-0, policy_version 1233048 (0.00096) [2022-07-11 14:35:05,250][26022] Updated weights on worker 0-0, policy_version 1233058 (0.00089) [2022-07-11 14:35:06,868][25689] Fps is (10 sec: 5415.8, 60 sec: 5564.7, 300 sec: 5566.9). Total num frames: 1262659584. Throughput: 0: 5737.2. Samples: 1262663890. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:35:06,869][25689] Avg episode reward: [(0, '1.213')] [2022-07-11 14:35:07,272][26022] Updated weights on worker 0-0, policy_version 1233068 (0.00091) [2022-07-11 14:35:08,849][26022] Updated weights on worker 0-0, policy_version 1233078 (0.00078) [2022-07-11 14:35:10,914][26022] Updated weights on worker 0-0, policy_version 1233088 (0.00090) [2022-07-11 14:35:11,907][25689] Fps is (10 sec: 5423.6, 60 sec: 5561.7, 300 sec: 5567.4). Total num frames: 1262687232. Throughput: 0: 5705.9. Samples: 1262697344. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:35:11,907][25689] Avg episode reward: [(0, '0.717')] [2022-07-11 14:35:12,452][26022] Updated weights on worker 0-0, policy_version 1233098 (0.00086) [2022-07-11 14:35:14,627][26022] Updated weights on worker 0-0, policy_version 1233108 (0.00092) [2022-07-11 14:35:16,214][26022] Updated weights on worker 0-0, policy_version 1233118 (0.00113) [2022-07-11 14:35:16,943][25689] Fps is (10 sec: 5590.2, 60 sec: 5577.7, 300 sec: 5564.3). Total num frames: 1262715904. Throughput: 0: 5694.9. Samples: 1262714152. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:35:16,944][25689] Avg episode reward: [(0, '0.496')] [2022-07-11 14:35:18,183][26022] Updated weights on worker 0-0, policy_version 1233128 (0.00085) [2022-07-11 14:35:19,975][26022] Updated weights on worker 0-0, policy_version 1233138 (0.00092) [2022-07-11 14:35:21,625][26022] Updated weights on worker 0-0, policy_version 1233148 (0.00084) [2022-07-11 14:35:22,031][25689] Fps is (10 sec: 5664.3, 60 sec: 5555.2, 300 sec: 5567.4). Total num frames: 1262744576. Throughput: 0: 5680.2. Samples: 1262747922. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:35:22,031][25689] Avg episode reward: [(0, '0.272')] [2022-07-11 14:35:23,477][26022] Updated weights on worker 0-0, policy_version 1233158 (0.00081) [2022-07-11 14:35:25,489][26022] Updated weights on worker 0-0, policy_version 1233168 (0.00086) [2022-07-11 14:35:27,071][25689] Fps is (10 sec: 5662.1, 60 sec: 5591.4, 300 sec: 5570.7). Total num frames: 1262773248. Throughput: 0: 5818.5. Samples: 1262781438. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:35:27,072][25689] Avg episode reward: [(0, '0.360')] [2022-07-11 14:35:27,095][26022] Updated weights on worker 0-0, policy_version 1233178 (0.00085) [2022-07-11 14:35:29,331][26022] Updated weights on worker 0-0, policy_version 1233188 (0.00086) [2022-07-11 14:35:30,825][26022] Updated weights on worker 0-0, policy_version 1233198 (0.00090) [2022-07-11 14:35:32,099][25689] Fps is (10 sec: 5492.3, 60 sec: 5541.3, 300 sec: 5564.2). Total num frames: 1262799872. Throughput: 0: 5004.0. Samples: 1262798384. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:35:32,100][25689] Avg episode reward: [(0, '0.397')] [2022-07-11 14:35:32,823][26022] Updated weights on worker 0-0, policy_version 1233208 (0.00096) [2022-07-11 14:35:34,464][26022] Updated weights on worker 0-0, policy_version 1233218 (0.00089) [2022-07-11 14:35:36,395][26022] Updated weights on worker 0-0, policy_version 1233228 (0.00089) [2022-07-11 14:35:37,106][25689] Fps is (10 sec: 5612.0, 60 sec: 5560.7, 300 sec: 5571.0). Total num frames: 1262829568. Throughput: 0: 5837.5. Samples: 1262831856. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:35:37,108][25689] Avg episode reward: [(0, '-0.158')] [2022-07-11 14:35:38,387][26022] Updated weights on worker 0-0, policy_version 1233238 (0.00096) [2022-07-11 14:35:40,000][26022] Updated weights on worker 0-0, policy_version 1233248 (0.00091) [2022-07-11 14:35:41,980][26022] Updated weights on worker 0-0, policy_version 1233258 (0.00089) [2022-07-11 14:35:42,120][25689] Fps is (10 sec: 5722.0, 60 sec: 5563.0, 300 sec: 5568.3). Total num frames: 1262857216. Throughput: 0: 5854.5. Samples: 1262865536. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:35:42,121][25689] Avg episode reward: [(0, '0.630')] [2022-07-11 14:35:43,652][26022] Updated weights on worker 0-0, policy_version 1233268 (0.00092) [2022-07-11 14:35:45,512][26022] Updated weights on worker 0-0, policy_version 1233278 (0.00093) [2022-07-11 14:35:47,175][25689] Fps is (10 sec: 5593.5, 60 sec: 5581.7, 300 sec: 5564.1). Total num frames: 1262885888. Throughput: 0: 5022.1. Samples: 1262882404. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:35:47,176][25689] Avg episode reward: [(0, '1.061')] [2022-07-11 14:35:47,314][26022] Updated weights on worker 0-0, policy_version 1233288 (0.00086) [2022-07-11 14:35:49,311][26022] Updated weights on worker 0-0, policy_version 1233298 (0.00086) [2022-07-11 14:35:50,902][26022] Updated weights on worker 0-0, policy_version 1233308 (0.00481) [2022-07-11 14:35:52,199][25689] Fps is (10 sec: 5486.2, 60 sec: 5546.4, 300 sec: 5567.2). Total num frames: 1262912512. Throughput: 0: 5851.0. Samples: 1262915994. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:35:52,200][25689] Avg episode reward: [(0, '0.961')] [2022-07-11 14:35:52,896][26022] Updated weights on worker 0-0, policy_version 1233318 (0.00089) [2022-07-11 14:35:54,650][26022] Updated weights on worker 0-0, policy_version 1233328 (0.00087) [2022-07-11 14:35:56,540][26022] Updated weights on worker 0-0, policy_version 1233338 (0.00087) [2022-07-11 14:35:57,204][25689] Fps is (10 sec: 5615.7, 60 sec: 5581.2, 300 sec: 5564.0). Total num frames: 1262942208. Throughput: 0: 5845.1. Samples: 1262949330. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:35:57,205][25689] Avg episode reward: [(0, '1.165')] [2022-07-11 14:35:58,453][26022] Updated weights on worker 0-0, policy_version 1233348 (0.00091) [2022-07-11 14:36:00,314][26022] Updated weights on worker 0-0, policy_version 1233358 (0.00088) [2022-07-11 14:36:02,226][25689] Fps is (10 sec: 5412.6, 60 sec: 5533.3, 300 sec: 5561.5). Total num frames: 1262966784. Throughput: 0: 4997.4. Samples: 1262966014. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:36:02,227][25689] Avg episode reward: [(0, '1.055')] [2022-07-11 14:36:02,595][26022] Updated weights on worker 0-0, policy_version 1233368 (0.00088) [2022-07-11 14:36:04,314][26022] Updated weights on worker 0-0, policy_version 1233378 (0.00080) [2022-07-11 14:36:06,063][26022] Updated weights on worker 0-0, policy_version 1233388 (0.00089) [2022-07-11 14:36:07,280][25689] Fps is (10 sec: 5386.3, 60 sec: 5576.7, 300 sec: 5567.8). Total num frames: 1262996480. Throughput: 0: 5740.3. Samples: 1262997812. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:36:07,281][25689] Avg episode reward: [(0, '1.356')] [2022-07-11 14:36:07,941][26022] Updated weights on worker 0-0, policy_version 1233398 (0.00092) [2022-07-11 14:36:09,681][26022] Updated weights on worker 0-0, policy_version 1233408 (0.00084) [2022-07-11 14:36:11,863][26022] Updated weights on worker 0-0, policy_version 1233418 (0.00091) [2022-07-11 14:36:12,304][25689] Fps is (10 sec: 5588.8, 60 sec: 5561.1, 300 sec: 5560.6). Total num frames: 1263023104. Throughput: 0: 5738.1. Samples: 1263031354. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:36:12,304][25689] Avg episode reward: [(0, '1.603')] [2022-07-11 14:36:13,318][26022] Updated weights on worker 0-0, policy_version 1233428 (0.00085) [2022-07-11 14:36:14,246][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:36:14,257][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001233433_1263035392.pth [2022-07-11 14:36:14,273][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001231473_1261028352.pth [2022-07-11 14:36:15,406][26022] Updated weights on worker 0-0, policy_version 1233438 (0.00079) [2022-07-11 14:36:16,968][26022] Updated weights on worker 0-0, policy_version 1233448 (0.00085) [2022-07-11 14:36:17,340][25689] Fps is (10 sec: 5496.9, 60 sec: 5561.1, 300 sec: 5563.8). Total num frames: 1263051776. Throughput: 0: 4915.2. Samples: 1263048300. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:36:17,340][25689] Avg episode reward: [(0, '1.336')] [2022-07-11 14:36:18,900][26022] Updated weights on worker 0-0, policy_version 1233458 (0.00093) [2022-07-11 14:36:20,685][26022] Updated weights on worker 0-0, policy_version 1233468 (0.00086) [2022-07-11 14:36:22,349][25689] Fps is (10 sec: 5606.7, 60 sec: 5551.4, 300 sec: 5567.9). Total num frames: 1263079424. Throughput: 0: 5752.1. Samples: 1263081762. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:36:22,349][25689] Avg episode reward: [(0, '1.742')] [2022-07-11 14:36:22,516][26022] Updated weights on worker 0-0, policy_version 1233478 (0.00090) [2022-07-11 14:36:24,435][26022] Updated weights on worker 0-0, policy_version 1233488 (0.00086) [2022-07-11 14:36:26,306][26022] Updated weights on worker 0-0, policy_version 1233498 (0.00085) [2022-07-11 14:36:27,481][25689] Fps is (10 sec: 5452.2, 60 sec: 5525.9, 300 sec: 5559.9). Total num frames: 1263107072. Throughput: 0: 5805.3. Samples: 1263115088. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:36:27,482][25689] Avg episode reward: [(0, '1.761')] [2022-07-11 14:36:28,055][26022] Updated weights on worker 0-0, policy_version 1233508 (0.00343) [2022-07-11 14:36:29,935][26022] Updated weights on worker 0-0, policy_version 1233518 (0.00081) [2022-07-11 14:36:31,674][26022] Updated weights on worker 0-0, policy_version 1233528 (0.00092) [2022-07-11 14:36:32,506][25689] Fps is (10 sec: 5545.0, 60 sec: 5560.1, 300 sec: 5557.6). Total num frames: 1263135744. Throughput: 0: 4970.3. Samples: 1263131764. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:36:32,506][25689] Avg episode reward: [(0, '2.193')] [2022-07-11 14:36:33,775][26022] Updated weights on worker 0-0, policy_version 1233538 (0.00085) [2022-07-11 14:36:35,279][26022] Updated weights on worker 0-0, policy_version 1233548 (0.00085) [2022-07-11 14:36:37,231][26022] Updated weights on worker 0-0, policy_version 1233558 (0.00086) [2022-07-11 14:36:37,546][25689] Fps is (10 sec: 5697.3, 60 sec: 5540.2, 300 sec: 5564.6). Total num frames: 1263164416. Throughput: 0: 5791.4. Samples: 1263165326. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:36:37,547][25689] Avg episode reward: [(0, '1.690')] [2022-07-11 14:36:38,939][26022] Updated weights on worker 0-0, policy_version 1233568 (0.00080) [2022-07-11 14:36:41,015][26022] Updated weights on worker 0-0, policy_version 1233578 (0.00100) [2022-07-11 14:36:42,562][25689] Fps is (10 sec: 5600.5, 60 sec: 5540.0, 300 sec: 5562.1). Total num frames: 1263192064. Throughput: 0: 5783.4. Samples: 1263198664. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:36:42,563][25689] Avg episode reward: [(0, '1.671')] [2022-07-11 14:36:42,958][26022] Updated weights on worker 0-0, policy_version 1233588 (0.00084) [2022-07-11 14:36:44,663][26022] Updated weights on worker 0-0, policy_version 1233598 (0.00080) [2022-07-11 14:36:46,516][26022] Updated weights on worker 0-0, policy_version 1233608 (0.00089) [2022-07-11 14:36:47,672][25689] Fps is (10 sec: 5562.0, 60 sec: 5534.9, 300 sec: 5563.7). Total num frames: 1263220736. Throughput: 0: 4978.7. Samples: 1263215612. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:36:47,673][25689] Avg episode reward: [(0, '1.812')] [2022-07-11 14:36:48,411][26022] Updated weights on worker 0-0, policy_version 1233618 (0.00082) [2022-07-11 14:36:49,881][26022] Updated weights on worker 0-0, policy_version 1233628 (0.00090) [2022-07-11 14:36:52,153][26022] Updated weights on worker 0-0, policy_version 1233638 (0.00090) [2022-07-11 14:36:52,687][25689] Fps is (10 sec: 5461.3, 60 sec: 5535.8, 300 sec: 5556.7). Total num frames: 1263247360. Throughput: 0: 5818.3. Samples: 1263249186. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:36:52,688][25689] Avg episode reward: [(0, '1.516')] [2022-07-11 14:36:53,638][26022] Updated weights on worker 0-0, policy_version 1233648 (0.00089) [2022-07-11 14:36:55,791][26022] Updated weights on worker 0-0, policy_version 1233658 (0.00091) [2022-07-11 14:36:57,528][26022] Updated weights on worker 0-0, policy_version 1233668 (0.00088) [2022-07-11 14:36:57,734][25689] Fps is (10 sec: 5597.3, 60 sec: 5531.9, 300 sec: 5560.5). Total num frames: 1263277056. Throughput: 0: 5813.7. Samples: 1263282692. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:36:57,735][25689] Avg episode reward: [(0, '1.341')] [2022-07-11 14:36:59,231][26022] Updated weights on worker 0-0, policy_version 1233678 (0.00085) [2022-07-11 14:37:00,974][26022] Updated weights on worker 0-0, policy_version 1233688 (0.00091) [2022-07-11 14:37:02,740][25689] Fps is (10 sec: 5602.5, 60 sec: 5567.3, 300 sec: 5558.1). Total num frames: 1263303680. Throughput: 0: 5009.6. Samples: 1263299746. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:37:02,740][25689] Avg episode reward: [(0, '0.973')] [2022-07-11 14:37:03,483][26022] Updated weights on worker 0-0, policy_version 1233698 (0.00091) [2022-07-11 14:37:05,018][26022] Updated weights on worker 0-0, policy_version 1233708 (0.00083) [2022-07-11 14:37:06,766][26022] Updated weights on worker 0-0, policy_version 1233718 (0.00092) [2022-07-11 14:37:07,801][25689] Fps is (10 sec: 5391.3, 60 sec: 5532.8, 300 sec: 5561.6). Total num frames: 1263331328. Throughput: 0: 5780.2. Samples: 1263331958. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:37:07,801][25689] Avg episode reward: [(0, '1.209')] [2022-07-11 14:37:08,716][26022] Updated weights on worker 0-0, policy_version 1233728 (0.00090) [2022-07-11 14:37:10,653][26022] Updated weights on worker 0-0, policy_version 1233738 (0.00095) [2022-07-11 14:37:12,401][26022] Updated weights on worker 0-0, policy_version 1233748 (0.00090) [2022-07-11 14:37:12,808][25689] Fps is (10 sec: 5593.9, 60 sec: 5568.1, 300 sec: 5559.0). Total num frames: 1263360000. Throughput: 0: 5762.0. Samples: 1263365122. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:37:12,809][25689] Avg episode reward: [(0, '1.081')] [2022-07-11 14:37:14,336][26022] Updated weights on worker 0-0, policy_version 1233758 (0.00091) [2022-07-11 14:37:15,904][26022] Updated weights on worker 0-0, policy_version 1233768 (0.00083) [2022-07-11 14:37:17,818][25689] Fps is (10 sec: 5622.6, 60 sec: 5553.6, 300 sec: 5559.9). Total num frames: 1263387648. Throughput: 0: 4951.9. Samples: 1263382142. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:37:17,818][25689] Avg episode reward: [(0, '0.227')] [2022-07-11 14:37:18,073][26022] Updated weights on worker 0-0, policy_version 1233778 (0.00834) [2022-07-11 14:37:19,591][26022] Updated weights on worker 0-0, policy_version 1233788 (0.00088) [2022-07-11 14:37:21,562][26022] Updated weights on worker 0-0, policy_version 1233798 (0.00089) [2022-07-11 14:37:22,839][25689] Fps is (10 sec: 5512.5, 60 sec: 5552.5, 300 sec: 5554.5). Total num frames: 1263415296. Throughput: 0: 5773.1. Samples: 1263415780. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:37:22,840][25689] Avg episode reward: [(0, '-0.213')] [2022-07-11 14:37:23,419][26022] Updated weights on worker 0-0, policy_version 1233808 (0.00083) [2022-07-11 14:37:25,155][26022] Updated weights on worker 0-0, policy_version 1233818 (0.00089) [2022-07-11 14:37:27,200][26022] Updated weights on worker 0-0, policy_version 1233828 (0.00091) [2022-07-11 14:37:27,938][25689] Fps is (10 sec: 5564.9, 60 sec: 5572.5, 300 sec: 5561.1). Total num frames: 1263443968. Throughput: 0: 5825.0. Samples: 1263449258. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:37:27,939][25689] Avg episode reward: [(0, '0.017')] [2022-07-11 14:37:28,933][26022] Updated weights on worker 0-0, policy_version 1233838 (0.00089) [2022-07-11 14:37:30,650][26022] Updated weights on worker 0-0, policy_version 1233848 (0.00088) [2022-07-11 14:37:32,711][26022] Updated weights on worker 0-0, policy_version 1233858 (0.00091) [2022-07-11 14:37:32,961][25689] Fps is (10 sec: 5665.2, 60 sec: 5572.6, 300 sec: 5558.9). Total num frames: 1263472640. Throughput: 0: 5013.0. Samples: 1263466150. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:37:32,963][25689] Avg episode reward: [(0, '-0.779')] [2022-07-11 14:37:34,214][26022] Updated weights on worker 0-0, policy_version 1233868 (0.00090) [2022-07-11 14:37:36,265][26022] Updated weights on worker 0-0, policy_version 1233878 (0.00088) [2022-07-11 14:37:37,806][26022] Updated weights on worker 0-0, policy_version 1233888 (0.00090) [2022-07-11 14:37:37,994][25689] Fps is (10 sec: 5702.6, 60 sec: 5573.4, 300 sec: 5562.3). Total num frames: 1263501312. Throughput: 0: 5835.9. Samples: 1263499890. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:37:37,995][25689] Avg episode reward: [(0, '-0.788')] [2022-07-11 14:37:39,818][26022] Updated weights on worker 0-0, policy_version 1233898 (0.00086) [2022-07-11 14:37:41,527][26022] Updated weights on worker 0-0, policy_version 1233908 (0.00088) [2022-07-11 14:37:43,050][25689] Fps is (10 sec: 5582.4, 60 sec: 5569.6, 300 sec: 5563.1). Total num frames: 1263528960. Throughput: 0: 5835.6. Samples: 1263533724. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:37:43,051][25689] Avg episode reward: [(0, '0.058')] [2022-07-11 14:37:43,505][26022] Updated weights on worker 0-0, policy_version 1233918 (0.00088) [2022-07-11 14:37:45,314][26022] Updated weights on worker 0-0, policy_version 1233928 (0.00080) [2022-07-11 14:37:47,006][26022] Updated weights on worker 0-0, policy_version 1233938 (0.00085) [2022-07-11 14:37:48,138][25689] Fps is (10 sec: 5451.3, 60 sec: 5554.8, 300 sec: 5558.7). Total num frames: 1263556608. Throughput: 0: 5845.4. Samples: 1263567332. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:37:48,138][25689] Avg episode reward: [(0, '-0.028')] [2022-07-11 14:37:49,215][26022] Updated weights on worker 0-0, policy_version 1233948 (0.00088) [2022-07-11 14:37:50,666][26022] Updated weights on worker 0-0, policy_version 1233958 (0.00084) [2022-07-11 14:37:52,713][26022] Updated weights on worker 0-0, policy_version 1233968 (0.00086) [2022-07-11 14:37:53,156][25689] Fps is (10 sec: 5674.3, 60 sec: 5605.3, 300 sec: 5562.2). Total num frames: 1263586304. Throughput: 0: 5841.7. Samples: 1263584122. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:37:53,156][25689] Avg episode reward: [(0, '0.561')] [2022-07-11 14:37:54,543][26022] Updated weights on worker 0-0, policy_version 1233978 (0.00098) [2022-07-11 14:37:56,312][26022] Updated weights on worker 0-0, policy_version 1233988 (0.00091) [2022-07-11 14:37:58,111][26022] Updated weights on worker 0-0, policy_version 1233998 (0.00088) [2022-07-11 14:37:58,176][25689] Fps is (10 sec: 5712.2, 60 sec: 5573.9, 300 sec: 5562.0). Total num frames: 1263613952. Throughput: 0: 5831.3. Samples: 1263617580. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:37:58,177][25689] Avg episode reward: [(0, '0.101')] [2022-07-11 14:37:59,944][26022] Updated weights on worker 0-0, policy_version 1234008 (0.00089) [2022-07-11 14:38:02,292][26022] Updated weights on worker 0-0, policy_version 1234018 (0.00110) [2022-07-11 14:38:03,187][25689] Fps is (10 sec: 5206.4, 60 sec: 5539.5, 300 sec: 5553.4). Total num frames: 1263638528. Throughput: 0: 5723.0. Samples: 1263648966. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:38:03,187][25689] Avg episode reward: [(0, '1.259')] [2022-07-11 14:38:04,042][26022] Updated weights on worker 0-0, policy_version 1234028 (0.00088) [2022-07-11 14:38:05,871][26022] Updated weights on worker 0-0, policy_version 1234038 (0.00092) [2022-07-11 14:38:07,524][26022] Updated weights on worker 0-0, policy_version 1234048 (0.00084) [2022-07-11 14:38:08,268][25689] Fps is (10 sec: 5377.9, 60 sec: 5571.6, 300 sec: 5562.3). Total num frames: 1263668224. Throughput: 0: 4892.3. Samples: 1263665816. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:38:08,268][25689] Avg episode reward: [(0, '1.458')] [2022-07-11 14:38:09,664][26022] Updated weights on worker 0-0, policy_version 1234058 (0.00094) [2022-07-11 14:38:11,346][26022] Updated weights on worker 0-0, policy_version 1234068 (0.00080) [2022-07-11 14:38:13,203][26022] Updated weights on worker 0-0, policy_version 1234078 (0.00092) [2022-07-11 14:38:13,327][25689] Fps is (10 sec: 5756.3, 60 sec: 5566.8, 300 sec: 5561.5). Total num frames: 1263696896. Throughput: 0: 5734.7. Samples: 1263699794. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:38:13,327][25689] Avg episode reward: [(0, '1.267')] [2022-07-11 14:38:14,351][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:38:14,364][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001234084_1263702016.pth [2022-07-11 14:38:14,364][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001232127_1261698048.pth [2022-07-11 14:38:14,900][26022] Updated weights on worker 0-0, policy_version 1234088 (0.00088) [2022-07-11 14:38:16,837][26022] Updated weights on worker 0-0, policy_version 1234098 (0.00084) [2022-07-11 14:38:18,402][25689] Fps is (10 sec: 5557.6, 60 sec: 5560.8, 300 sec: 5556.7). Total num frames: 1263724544. Throughput: 0: 5731.0. Samples: 1263733492. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:38:18,402][25689] Avg episode reward: [(0, '0.230')] [2022-07-11 14:38:18,680][26022] Updated weights on worker 0-0, policy_version 1234108 (0.00085) [2022-07-11 14:38:20,541][26022] Updated weights on worker 0-0, policy_version 1234118 (0.00606) [2022-07-11 14:38:22,204][26022] Updated weights on worker 0-0, policy_version 1234128 (0.00095) [2022-07-11 14:38:23,449][25689] Fps is (10 sec: 5563.8, 60 sec: 5575.3, 300 sec: 5560.3). Total num frames: 1263753216. Throughput: 0: 5014.9. Samples: 1263750580. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:38:23,450][25689] Avg episode reward: [(0, '-0.260')] [2022-07-11 14:38:24,059][26022] Updated weights on worker 0-0, policy_version 1234138 (0.00086) [2022-07-11 14:38:25,887][26022] Updated weights on worker 0-0, policy_version 1234148 (0.00089) [2022-07-11 14:38:27,696][26022] Updated weights on worker 0-0, policy_version 1234158 (0.00087) [2022-07-11 14:38:28,500][25689] Fps is (10 sec: 5678.5, 60 sec: 5579.7, 300 sec: 5557.1). Total num frames: 1263781888. Throughput: 0: 5854.4. Samples: 1263784264. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:38:28,501][25689] Avg episode reward: [(0, '-0.011')] [2022-07-11 14:38:29,522][26022] Updated weights on worker 0-0, policy_version 1234168 (0.00091) [2022-07-11 14:38:31,350][26022] Updated weights on worker 0-0, policy_version 1234178 (0.00087) [2022-07-11 14:38:33,238][26022] Updated weights on worker 0-0, policy_version 1234188 (0.00085) [2022-07-11 14:38:33,527][25689] Fps is (10 sec: 5690.3, 60 sec: 5579.4, 300 sec: 5563.5). Total num frames: 1263810560. Throughput: 0: 5856.1. Samples: 1263818088. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:38:33,527][25689] Avg episode reward: [(0, '-0.285')] [2022-07-11 14:38:35,065][26022] Updated weights on worker 0-0, policy_version 1234198 (0.00084) [2022-07-11 14:38:36,784][26022] Updated weights on worker 0-0, policy_version 1234208 (0.00085) [2022-07-11 14:38:38,538][25689] Fps is (10 sec: 5610.9, 60 sec: 5564.5, 300 sec: 5564.0). Total num frames: 1263838208. Throughput: 0: 5046.9. Samples: 1263835116. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:38:38,539][25689] Avg episode reward: [(0, '-0.655')] [2022-07-11 14:38:38,629][26022] Updated weights on worker 0-0, policy_version 1234218 (0.00087) [2022-07-11 14:38:40,384][26022] Updated weights on worker 0-0, policy_version 1234228 (0.00089) [2022-07-11 14:38:42,311][26022] Updated weights on worker 0-0, policy_version 1234238 (0.00089) [2022-07-11 14:38:43,545][25689] Fps is (10 sec: 5417.4, 60 sec: 5552.1, 300 sec: 5557.9). Total num frames: 1263864832. Throughput: 0: 5889.2. Samples: 1263868926. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:38:43,546][25689] Avg episode reward: [(0, '-0.609')] [2022-07-11 14:38:44,120][26022] Updated weights on worker 0-0, policy_version 1234248 (0.00082) [2022-07-11 14:38:45,989][26022] Updated weights on worker 0-0, policy_version 1234258 (0.00092) [2022-07-11 14:38:47,807][26022] Updated weights on worker 0-0, policy_version 1234268 (0.00083) [2022-07-11 14:38:48,630][25689] Fps is (10 sec: 5580.5, 60 sec: 5586.1, 300 sec: 5560.1). Total num frames: 1263894528. Throughput: 0: 5858.4. Samples: 1263902192. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:38:48,631][25689] Avg episode reward: [(0, '0.646')] [2022-07-11 14:38:49,849][26022] Updated weights on worker 0-0, policy_version 1234278 (0.00087) [2022-07-11 14:38:51,479][26022] Updated weights on worker 0-0, policy_version 1234288 (0.00085) [2022-07-11 14:38:53,404][26022] Updated weights on worker 0-0, policy_version 1234298 (0.00086) [2022-07-11 14:38:53,642][25689] Fps is (10 sec: 5679.4, 60 sec: 5552.9, 300 sec: 5567.1). Total num frames: 1263922176. Throughput: 0: 4998.7. Samples: 1263918636. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:38:53,642][25689] Avg episode reward: [(0, '1.169')] [2022-07-11 14:38:55,140][26022] Updated weights on worker 0-0, policy_version 1234308 (0.00086) [2022-07-11 14:38:57,056][26022] Updated weights on worker 0-0, policy_version 1234318 (0.00077) [2022-07-11 14:38:58,666][25689] Fps is (10 sec: 5510.1, 60 sec: 5552.6, 300 sec: 5557.0). Total num frames: 1263949824. Throughput: 0: 5825.6. Samples: 1263952372. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:38:58,667][25689] Avg episode reward: [(0, '1.420')] [2022-07-11 14:38:58,856][26022] Updated weights on worker 0-0, policy_version 1234328 (0.00098) [2022-07-11 14:39:00,516][26022] Updated weights on worker 0-0, policy_version 1234338 (0.00092) [2022-07-11 14:39:02,998][26022] Updated weights on worker 0-0, policy_version 1234348 (0.00093) [2022-07-11 14:39:03,685][25689] Fps is (10 sec: 5403.8, 60 sec: 5585.6, 300 sec: 5561.5). Total num frames: 1263976448. Throughput: 0: 5715.9. Samples: 1263984046. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:39:03,687][25689] Avg episode reward: [(0, '0.491')] [2022-07-11 14:39:04,504][26022] Updated weights on worker 0-0, policy_version 1234358 (0.00092) [2022-07-11 14:39:06,582][26022] Updated weights on worker 0-0, policy_version 1234368 (0.00088) [2022-07-11 14:39:08,348][26022] Updated weights on worker 0-0, policy_version 1234378 (0.00092) [2022-07-11 14:39:08,752][25689] Fps is (10 sec: 5482.3, 60 sec: 5570.0, 300 sec: 5563.8). Total num frames: 1264005120. Throughput: 0: 4901.6. Samples: 1264000820. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:39:08,752][25689] Avg episode reward: [(0, '0.711')] [2022-07-11 14:39:10,139][26022] Updated weights on worker 0-0, policy_version 1234388 (0.00056) [2022-07-11 14:39:12,097][26022] Updated weights on worker 0-0, policy_version 1234398 (0.00097) [2022-07-11 14:39:13,767][25689] Fps is (10 sec: 5484.5, 60 sec: 5540.1, 300 sec: 5560.6). Total num frames: 1264031744. Throughput: 0: 5749.1. Samples: 1264034340. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:39:13,768][25689] Avg episode reward: [(0, '0.763')] [2022-07-11 14:39:14,075][26022] Updated weights on worker 0-0, policy_version 1234408 (0.00084) [2022-07-11 14:39:15,764][26022] Updated weights on worker 0-0, policy_version 1234418 (0.00984) [2022-07-11 14:39:17,791][26022] Updated weights on worker 0-0, policy_version 1234428 (0.00086) [2022-07-11 14:39:18,782][25689] Fps is (10 sec: 5615.0, 60 sec: 5579.6, 300 sec: 5560.8). Total num frames: 1264061440. Throughput: 0: 5738.3. Samples: 1264067806. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:39:18,783][25689] Avg episode reward: [(0, '0.629')] [2022-07-11 14:39:19,205][26022] Updated weights on worker 0-0, policy_version 1234438 (0.00085) [2022-07-11 14:39:21,402][26022] Updated weights on worker 0-0, policy_version 1234448 (0.00082) [2022-07-11 14:39:22,959][26022] Updated weights on worker 0-0, policy_version 1234458 (0.00089) [2022-07-11 14:39:23,791][25689] Fps is (10 sec: 5618.8, 60 sec: 5549.2, 300 sec: 5561.9). Total num frames: 1264088064. Throughput: 0: 5002.0. Samples: 1264084614. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:39:23,791][25689] Avg episode reward: [(0, '-0.127')] [2022-07-11 14:39:24,809][26022] Updated weights on worker 0-0, policy_version 1234468 (0.00088) [2022-07-11 14:39:26,837][26022] Updated weights on worker 0-0, policy_version 1234478 (0.00084) [2022-07-11 14:39:28,479][26022] Updated weights on worker 0-0, policy_version 1234488 (0.00084) [2022-07-11 14:39:28,830][25689] Fps is (10 sec: 5605.2, 60 sec: 5567.3, 300 sec: 5561.8). Total num frames: 1264117760. Throughput: 0: 5852.0. Samples: 1264118316. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:39:28,831][25689] Avg episode reward: [(0, '-0.456')] [2022-07-11 14:39:30,363][26022] Updated weights on worker 0-0, policy_version 1234498 (0.00084) [2022-07-11 14:39:32,143][26022] Updated weights on worker 0-0, policy_version 1234508 (0.00086) [2022-07-11 14:39:33,855][25689] Fps is (10 sec: 5595.9, 60 sec: 5533.4, 300 sec: 5555.1). Total num frames: 1264144384. Throughput: 0: 5859.5. Samples: 1264152044. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:39:33,856][25689] Avg episode reward: [(0, '0.763')] [2022-07-11 14:39:34,120][26022] Updated weights on worker 0-0, policy_version 1234518 (0.00089) [2022-07-11 14:39:35,727][26022] Updated weights on worker 0-0, policy_version 1234528 (0.00091) [2022-07-11 14:39:37,759][26022] Updated weights on worker 0-0, policy_version 1234538 (0.00079) [2022-07-11 14:39:38,869][25689] Fps is (10 sec: 5508.0, 60 sec: 5550.2, 300 sec: 5559.0). Total num frames: 1264173056. Throughput: 0: 5028.9. Samples: 1264168822. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:39:38,870][25689] Avg episode reward: [(0, '-0.007')] [2022-07-11 14:39:39,452][26022] Updated weights on worker 0-0, policy_version 1234548 (0.00084) [2022-07-11 14:39:41,233][26022] Updated weights on worker 0-0, policy_version 1234558 (0.00085) [2022-07-11 14:39:43,222][26022] Updated weights on worker 0-0, policy_version 1234568 (0.00082) [2022-07-11 14:39:43,886][25689] Fps is (10 sec: 5716.9, 60 sec: 5583.2, 300 sec: 5563.5). Total num frames: 1264201728. Throughput: 0: 5875.5. Samples: 1264202684. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:39:43,886][25689] Avg episode reward: [(0, '0.195')] [2022-07-11 14:39:45,064][26022] Updated weights on worker 0-0, policy_version 1234578 (0.00079) [2022-07-11 14:39:46,742][26022] Updated weights on worker 0-0, policy_version 1234588 (0.00090) [2022-07-11 14:39:48,441][26022] Updated weights on worker 0-0, policy_version 1234598 (0.00085) [2022-07-11 14:39:48,967][25689] Fps is (10 sec: 5679.1, 60 sec: 5566.6, 300 sec: 5562.1). Total num frames: 1264230400. Throughput: 0: 5875.2. Samples: 1264236622. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:39:48,967][25689] Avg episode reward: [(0, '-0.421')] [2022-07-11 14:39:50,287][26022] Updated weights on worker 0-0, policy_version 1234608 (0.00082) [2022-07-11 14:39:52,041][26022] Updated weights on worker 0-0, policy_version 1234618 (0.00081) [2022-07-11 14:39:53,939][26022] Updated weights on worker 0-0, policy_version 1234628 (0.00086) [2022-07-11 14:39:54,004][25689] Fps is (10 sec: 5667.5, 60 sec: 5581.2, 300 sec: 5565.2). Total num frames: 1264259072. Throughput: 0: 5043.9. Samples: 1264253672. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:39:54,004][25689] Avg episode reward: [(0, '0.472')] [2022-07-11 14:39:55,675][26022] Updated weights on worker 0-0, policy_version 1234638 (0.00083) [2022-07-11 14:39:57,660][26022] Updated weights on worker 0-0, policy_version 1234648 (0.00090) [2022-07-11 14:39:59,028][25689] Fps is (10 sec: 5597.5, 60 sec: 5581.2, 300 sec: 5565.7). Total num frames: 1264286720. Throughput: 0: 5895.7. Samples: 1264287674. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:39:59,029][25689] Avg episode reward: [(0, '0.132')] [2022-07-11 14:39:59,500][26022] Updated weights on worker 0-0, policy_version 1234658 (0.00095) [2022-07-11 14:40:01,296][26022] Updated weights on worker 0-0, policy_version 1234668 (0.00084) [2022-07-11 14:40:03,508][26022] Updated weights on worker 0-0, policy_version 1234678 (0.00491) [2022-07-11 14:40:04,035][25689] Fps is (10 sec: 5512.6, 60 sec: 5599.4, 300 sec: 5568.5). Total num frames: 1264314368. Throughput: 0: 5785.0. Samples: 1264319246. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:40:04,035][25689] Avg episode reward: [(0, '0.003')] [2022-07-11 14:40:05,239][26022] Updated weights on worker 0-0, policy_version 1234688 (0.00085) [2022-07-11 14:40:06,937][26022] Updated weights on worker 0-0, policy_version 1234698 (0.00087) [2022-07-11 14:40:09,078][25689] Fps is (10 sec: 5298.2, 60 sec: 5550.6, 300 sec: 5561.6). Total num frames: 1264339968. Throughput: 0: 4942.5. Samples: 1264336024. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:40:09,079][25689] Avg episode reward: [(0, '0.627')] [2022-07-11 14:40:09,100][26022] Updated weights on worker 0-0, policy_version 1234708 (0.00049) [2022-07-11 14:40:10,810][26022] Updated weights on worker 0-0, policy_version 1234718 (0.00090) [2022-07-11 14:40:12,727][26022] Updated weights on worker 0-0, policy_version 1234728 (0.00095) [2022-07-11 14:40:14,081][25689] Fps is (10 sec: 5402.2, 60 sec: 5585.7, 300 sec: 5562.2). Total num frames: 1264368640. Throughput: 0: 5781.0. Samples: 1264369738. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:40:14,082][25689] Avg episode reward: [(0, '0.429')] [2022-07-11 14:40:14,466][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:40:14,480][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001234737_1264370688.pth [2022-07-11 14:40:14,481][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001232780_1262366720.pth [2022-07-11 14:40:14,604][26022] Updated weights on worker 0-0, policy_version 1234738 (0.00065) [2022-07-11 14:40:16,186][26022] Updated weights on worker 0-0, policy_version 1234748 (0.00086) [2022-07-11 14:40:18,230][26022] Updated weights on worker 0-0, policy_version 1234758 (0.00087) [2022-07-11 14:40:19,115][25689] Fps is (10 sec: 5713.6, 60 sec: 5567.0, 300 sec: 5563.2). Total num frames: 1264397312. Throughput: 0: 5747.6. Samples: 1264403124. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:40:19,115][25689] Avg episode reward: [(0, '0.955')] [2022-07-11 14:40:19,976][26022] Updated weights on worker 0-0, policy_version 1234768 (0.00084) [2022-07-11 14:40:21,763][26022] Updated weights on worker 0-0, policy_version 1234778 (0.00084) [2022-07-11 14:40:23,525][26022] Updated weights on worker 0-0, policy_version 1234788 (0.00088) [2022-07-11 14:40:24,117][25689] Fps is (10 sec: 5611.7, 60 sec: 5584.5, 300 sec: 5560.4). Total num frames: 1264424960. Throughput: 0: 5026.1. Samples: 1264420182. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:40:24,118][25689] Avg episode reward: [(0, '0.629')] [2022-07-11 14:40:25,413][26022] Updated weights on worker 0-0, policy_version 1234798 (0.00088) [2022-07-11 14:40:27,167][26022] Updated weights on worker 0-0, policy_version 1234808 (0.00083) [2022-07-11 14:40:29,111][26022] Updated weights on worker 0-0, policy_version 1234818 (0.00090) [2022-07-11 14:40:29,176][25689] Fps is (10 sec: 5597.8, 60 sec: 5565.8, 300 sec: 5566.8). Total num frames: 1264453632. Throughput: 0: 5867.8. Samples: 1264453952. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:40:29,176][25689] Avg episode reward: [(0, '0.497')] [2022-07-11 14:40:30,788][26022] Updated weights on worker 0-0, policy_version 1234828 (0.00084) [2022-07-11 14:40:32,671][26022] Updated weights on worker 0-0, policy_version 1234838 (0.00087) [2022-07-11 14:40:34,182][25689] Fps is (10 sec: 5595.8, 60 sec: 5584.5, 300 sec: 5559.9). Total num frames: 1264481280. Throughput: 0: 5856.0. Samples: 1264487448. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:40:34,184][25689] Avg episode reward: [(0, '0.342')] [2022-07-11 14:40:34,723][26022] Updated weights on worker 0-0, policy_version 1234848 (0.00090) [2022-07-11 14:40:36,326][26022] Updated weights on worker 0-0, policy_version 1234858 (0.00062) [2022-07-11 14:40:38,295][26022] Updated weights on worker 0-0, policy_version 1234868 (0.00084) [2022-07-11 14:40:39,233][25689] Fps is (10 sec: 5600.1, 60 sec: 5581.1, 300 sec: 5562.7). Total num frames: 1264509952. Throughput: 0: 5024.3. Samples: 1264504202. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:40:39,233][25689] Avg episode reward: [(0, '0.395')] [2022-07-11 14:40:40,023][26022] Updated weights on worker 0-0, policy_version 1234878 (0.00085) [2022-07-11 14:40:41,865][26022] Updated weights on worker 0-0, policy_version 1234888 (0.00088) [2022-07-11 14:40:43,882][26022] Updated weights on worker 0-0, policy_version 1234898 (0.00086) [2022-07-11 14:40:44,287][25689] Fps is (10 sec: 5573.7, 60 sec: 5560.7, 300 sec: 5559.2). Total num frames: 1264537600. Throughput: 0: 5838.8. Samples: 1264537946. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:40:44,287][25689] Avg episode reward: [(0, '-0.189')] [2022-07-11 14:40:45,595][26022] Updated weights on worker 0-0, policy_version 1234908 (0.00093) [2022-07-11 14:40:47,419][26022] Updated weights on worker 0-0, policy_version 1234918 (0.00092) [2022-07-11 14:40:49,316][26022] Updated weights on worker 0-0, policy_version 1234928 (0.00083) [2022-07-11 14:40:49,377][25689] Fps is (10 sec: 5551.9, 60 sec: 5559.8, 300 sec: 5564.9). Total num frames: 1264566272. Throughput: 0: 5806.4. Samples: 1264571248. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:40:49,378][25689] Avg episode reward: [(0, '0.434')] [2022-07-11 14:40:51,237][26022] Updated weights on worker 0-0, policy_version 1234938 (0.00087) [2022-07-11 14:40:52,838][26022] Updated weights on worker 0-0, policy_version 1234948 (0.00080) [2022-07-11 14:40:54,431][25689] Fps is (10 sec: 5551.9, 60 sec: 5541.4, 300 sec: 5557.1). Total num frames: 1264593920. Throughput: 0: 4966.4. Samples: 1264588012. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:40:54,431][25689] Avg episode reward: [(0, '0.144')] [2022-07-11 14:40:54,772][26022] Updated weights on worker 0-0, policy_version 1234958 (0.00085) [2022-07-11 14:40:56,376][26022] Updated weights on worker 0-0, policy_version 1234968 (0.00091) [2022-07-11 14:40:58,379][26022] Updated weights on worker 0-0, policy_version 1234978 (0.00091) [2022-07-11 14:40:59,460][25689] Fps is (10 sec: 5585.7, 60 sec: 5557.9, 300 sec: 5570.7). Total num frames: 1264622592. Throughput: 0: 5820.9. Samples: 1264621942. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:40:59,460][25689] Avg episode reward: [(0, '0.621')] [2022-07-11 14:41:00,023][26022] Updated weights on worker 0-0, policy_version 1234988 (0.00108) [2022-07-11 14:41:02,461][26022] Updated weights on worker 0-0, policy_version 1234998 (0.00084) [2022-07-11 14:41:04,284][26022] Updated weights on worker 0-0, policy_version 1235008 (0.00091) [2022-07-11 14:41:04,504][25689] Fps is (10 sec: 5489.5, 60 sec: 5537.5, 300 sec: 5560.6). Total num frames: 1264649216. Throughput: 0: 5725.4. Samples: 1264653698. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:41:04,507][25689] Avg episode reward: [(0, '0.627')] [2022-07-11 14:41:05,923][26022] Updated weights on worker 0-0, policy_version 1235018 (0.00092) [2022-07-11 14:41:07,856][26022] Updated weights on worker 0-0, policy_version 1235028 (0.00332) [2022-07-11 14:41:09,566][25689] Fps is (10 sec: 5471.4, 60 sec: 5586.6, 300 sec: 5566.7). Total num frames: 1264677888. Throughput: 0: 4920.3. Samples: 1264670584. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:41:09,568][25689] Avg episode reward: [(0, '-0.153')] [2022-07-11 14:41:09,783][26022] Updated weights on worker 0-0, policy_version 1235038 (0.00085) [2022-07-11 14:41:11,355][26022] Updated weights on worker 0-0, policy_version 1235048 (0.00084) [2022-07-11 14:41:13,402][26022] Updated weights on worker 0-0, policy_version 1235058 (0.00092) [2022-07-11 14:41:14,576][25689] Fps is (10 sec: 5693.5, 60 sec: 5586.0, 300 sec: 5567.2). Total num frames: 1264706560. Throughput: 0: 5784.9. Samples: 1264704548. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:41:14,576][25689] Avg episode reward: [(0, '0.118')] [2022-07-11 14:41:15,144][26022] Updated weights on worker 0-0, policy_version 1235068 (0.00088) [2022-07-11 14:41:17,025][26022] Updated weights on worker 0-0, policy_version 1235078 (0.00086) [2022-07-11 14:41:18,934][26022] Updated weights on worker 0-0, policy_version 1235088 (0.00088) [2022-07-11 14:41:19,625][25689] Fps is (10 sec: 5497.3, 60 sec: 5550.7, 300 sec: 5563.0). Total num frames: 1264733184. Throughput: 0: 5751.7. Samples: 1264737926. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:41:19,625][25689] Avg episode reward: [(0, '0.125')] [2022-07-11 14:41:20,616][26022] Updated weights on worker 0-0, policy_version 1235098 (0.00091) [2022-07-11 14:41:22,554][26022] Updated weights on worker 0-0, policy_version 1235108 (0.00091) [2022-07-11 14:41:24,378][26022] Updated weights on worker 0-0, policy_version 1235118 (0.00091) [2022-07-11 14:41:24,715][25689] Fps is (10 sec: 5554.5, 60 sec: 5576.4, 300 sec: 5570.7). Total num frames: 1264762880. Throughput: 0: 5834.8. Samples: 1264771628. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:41:24,717][25689] Avg episode reward: [(0, '-0.205')] [2022-07-11 14:41:26,067][26022] Updated weights on worker 0-0, policy_version 1235128 (0.00089) [2022-07-11 14:41:28,106][26022] Updated weights on worker 0-0, policy_version 1235138 (0.00083) [2022-07-11 14:41:29,750][26022] Updated weights on worker 0-0, policy_version 1235148 (0.00093) [2022-07-11 14:41:29,841][25689] Fps is (10 sec: 5713.7, 60 sec: 5570.3, 300 sec: 5568.8). Total num frames: 1264791552. Throughput: 0: 5806.7. Samples: 1264788310. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:41:29,841][25689] Avg episode reward: [(0, '-0.409')] [2022-07-11 14:41:31,652][26022] Updated weights on worker 0-0, policy_version 1235158 (0.00098) [2022-07-11 14:41:33,441][26022] Updated weights on worker 0-0, policy_version 1235168 (0.00083) [2022-07-11 14:41:34,851][25689] Fps is (10 sec: 5556.6, 60 sec: 5569.9, 300 sec: 5566.0). Total num frames: 1264819200. Throughput: 0: 5796.2. Samples: 1264822068. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:41:34,852][25689] Avg episode reward: [(0, '-0.461')] [2022-07-11 14:41:35,165][26022] Updated weights on worker 0-0, policy_version 1235178 (0.00093) [2022-07-11 14:41:37,047][26022] Updated weights on worker 0-0, policy_version 1235188 (0.00089) [2022-07-11 14:41:38,962][26022] Updated weights on worker 0-0, policy_version 1235198 (0.00114) [2022-07-11 14:41:39,907][25689] Fps is (10 sec: 5493.0, 60 sec: 5552.5, 300 sec: 5565.2). Total num frames: 1264846848. Throughput: 0: 5802.8. Samples: 1264855620. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:41:39,908][25689] Avg episode reward: [(0, '0.460')] [2022-07-11 14:41:40,831][26022] Updated weights on worker 0-0, policy_version 1235208 (0.00090) [2022-07-11 14:41:42,561][26022] Updated weights on worker 0-0, policy_version 1235218 (0.00092) [2022-07-11 14:41:44,448][26022] Updated weights on worker 0-0, policy_version 1235228 (0.00091) [2022-07-11 14:41:44,937][25689] Fps is (10 sec: 5583.9, 60 sec: 5571.6, 300 sec: 5566.7). Total num frames: 1264875520. Throughput: 0: 4977.6. Samples: 1264872284. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:41:44,938][25689] Avg episode reward: [(0, '-0.015')] [2022-07-11 14:41:46,167][26022] Updated weights on worker 0-0, policy_version 1235238 (0.00082) [2022-07-11 14:41:48,235][26022] Updated weights on worker 0-0, policy_version 1235248 (0.00092) [2022-07-11 14:41:49,978][26022] Updated weights on worker 0-0, policy_version 1235258 (0.00626) [2022-07-11 14:41:50,066][25689] Fps is (10 sec: 5644.8, 60 sec: 5568.1, 300 sec: 5571.5). Total num frames: 1264904192. Throughput: 0: 5803.9. Samples: 1264905696. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:41:50,066][25689] Avg episode reward: [(0, '-1.137')] [2022-07-11 14:41:51,850][26022] Updated weights on worker 0-0, policy_version 1235268 (0.00100) [2022-07-11 14:41:53,640][26022] Updated weights on worker 0-0, policy_version 1235278 (0.00094) [2022-07-11 14:41:55,075][25689] Fps is (10 sec: 5454.6, 60 sec: 5555.3, 300 sec: 5561.8). Total num frames: 1264930816. Throughput: 0: 5793.4. Samples: 1264939232. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:41:55,075][25689] Avg episode reward: [(0, '-0.937')] [2022-07-11 14:41:55,484][26022] Updated weights on worker 0-0, policy_version 1235288 (0.00088) [2022-07-11 14:41:57,303][26022] Updated weights on worker 0-0, policy_version 1235298 (0.00092) [2022-07-11 14:41:59,137][26022] Updated weights on worker 0-0, policy_version 1235308 (0.00092) [2022-07-11 14:42:00,086][25689] Fps is (10 sec: 5518.5, 60 sec: 5557.0, 300 sec: 5568.6). Total num frames: 1264959488. Throughput: 0: 4978.9. Samples: 1264956088. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:42:00,086][25689] Avg episode reward: [(0, '-0.508')] [2022-07-11 14:42:00,984][26022] Updated weights on worker 0-0, policy_version 1235318 (0.00621) [2022-07-11 14:42:03,281][26022] Updated weights on worker 0-0, policy_version 1235328 (0.00090) [2022-07-11 14:42:04,879][26022] Updated weights on worker 0-0, policy_version 1235338 (0.00089) [2022-07-11 14:42:05,135][25689] Fps is (10 sec: 5598.4, 60 sec: 5573.4, 300 sec: 5568.9). Total num frames: 1264987136. Throughput: 0: 5718.0. Samples: 1264987774. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:42:05,137][25689] Avg episode reward: [(0, '-0.557')] [2022-07-11 14:42:07,099][26022] Updated weights on worker 0-0, policy_version 1235348 (0.00084) [2022-07-11 14:42:08,447][26022] Updated weights on worker 0-0, policy_version 1235358 (0.00097) [2022-07-11 14:42:10,196][25689] Fps is (10 sec: 5368.2, 60 sec: 5539.7, 300 sec: 5561.0). Total num frames: 1265013760. Throughput: 0: 5743.9. Samples: 1265021320. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:42:10,196][25689] Avg episode reward: [(0, '-0.107')] [2022-07-11 14:42:10,581][26022] Updated weights on worker 0-0, policy_version 1235368 (0.00054) [2022-07-11 14:42:12,315][26022] Updated weights on worker 0-0, policy_version 1235378 (0.00109) [2022-07-11 14:42:14,211][26022] Updated weights on worker 0-0, policy_version 1235388 (0.00097) [2022-07-11 14:42:14,761][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:42:14,773][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001235391_1265040384.pth [2022-07-11 14:42:14,774][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001233433_1263035392.pth [2022-07-11 14:42:15,216][25689] Fps is (10 sec: 5586.5, 60 sec: 5555.6, 300 sec: 5567.6). Total num frames: 1265043456. Throughput: 0: 4905.8. Samples: 1265038042. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:42:15,217][25689] Avg episode reward: [(0, '0.855')] [2022-07-11 14:42:15,827][26022] Updated weights on worker 0-0, policy_version 1235398 (0.00094) [2022-07-11 14:42:17,876][26022] Updated weights on worker 0-0, policy_version 1235408 (0.00086) [2022-07-11 14:42:19,602][26022] Updated weights on worker 0-0, policy_version 1235418 (0.00090) [2022-07-11 14:42:20,233][25689] Fps is (10 sec: 5611.1, 60 sec: 5558.6, 300 sec: 5564.3). Total num frames: 1265070080. Throughput: 0: 5759.9. Samples: 1265072134. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:42:20,234][25689] Avg episode reward: [(0, '1.289')] [2022-07-11 14:42:21,499][26022] Updated weights on worker 0-0, policy_version 1235428 (0.00077) [2022-07-11 14:42:23,344][26022] Updated weights on worker 0-0, policy_version 1235438 (0.00093) [2022-07-11 14:42:24,982][26022] Updated weights on worker 0-0, policy_version 1235448 (0.00085) [2022-07-11 14:42:25,273][25689] Fps is (10 sec: 5600.0, 60 sec: 5563.2, 300 sec: 5568.8). Total num frames: 1265099776. Throughput: 0: 5862.7. Samples: 1265105840. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:42:25,274][25689] Avg episode reward: [(0, '1.838')] [2022-07-11 14:42:27,000][26022] Updated weights on worker 0-0, policy_version 1235458 (0.00468) [2022-07-11 14:42:28,740][26022] Updated weights on worker 0-0, policy_version 1235468 (0.00091) [2022-07-11 14:42:30,398][25689] Fps is (10 sec: 5641.2, 60 sec: 5546.3, 300 sec: 5563.5). Total num frames: 1265127424. Throughput: 0: 5009.1. Samples: 1265122516. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:42:30,399][25689] Avg episode reward: [(0, '1.706')] [2022-07-11 14:42:30,602][26022] Updated weights on worker 0-0, policy_version 1235478 (0.00088) [2022-07-11 14:42:32,248][26022] Updated weights on worker 0-0, policy_version 1235488 (0.00071) [2022-07-11 14:42:34,251][26022] Updated weights on worker 0-0, policy_version 1235498 (0.00087) [2022-07-11 14:42:35,450][25689] Fps is (10 sec: 5634.8, 60 sec: 5576.3, 300 sec: 5566.6). Total num frames: 1265157120. Throughput: 0: 5846.1. Samples: 1265156332. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:42:35,451][25689] Avg episode reward: [(0, '1.716')] [2022-07-11 14:42:36,051][26022] Updated weights on worker 0-0, policy_version 1235508 (0.00083) [2022-07-11 14:42:37,919][26022] Updated weights on worker 0-0, policy_version 1235518 (0.00101) [2022-07-11 14:42:39,585][26022] Updated weights on worker 0-0, policy_version 1235528 (0.00092) [2022-07-11 14:42:40,479][25689] Fps is (10 sec: 5688.3, 60 sec: 5578.8, 300 sec: 5567.1). Total num frames: 1265184768. Throughput: 0: 5840.2. Samples: 1265190376. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:42:40,480][25689] Avg episode reward: [(0, '0.775')] [2022-07-11 14:42:41,477][26022] Updated weights on worker 0-0, policy_version 1235538 (0.00085) [2022-07-11 14:42:43,266][26022] Updated weights on worker 0-0, policy_version 1235548 (0.00090) [2022-07-11 14:42:45,189][26022] Updated weights on worker 0-0, policy_version 1235558 (0.00084) [2022-07-11 14:42:45,524][25689] Fps is (10 sec: 5590.4, 60 sec: 5577.4, 300 sec: 5571.3). Total num frames: 1265213440. Throughput: 0: 5005.7. Samples: 1265207212. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:42:45,525][25689] Avg episode reward: [(0, '0.600')] [2022-07-11 14:42:47,012][26022] Updated weights on worker 0-0, policy_version 1235568 (0.00091) [2022-07-11 14:42:48,733][26022] Updated weights on worker 0-0, policy_version 1235578 (0.00086) [2022-07-11 14:42:50,599][25689] Fps is (10 sec: 5565.4, 60 sec: 5565.5, 300 sec: 5563.4). Total num frames: 1265241088. Throughput: 0: 5852.6. Samples: 1265240742. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:42:50,599][25689] Avg episode reward: [(0, '0.172')] [2022-07-11 14:42:50,777][26022] Updated weights on worker 0-0, policy_version 1235588 (0.00751) [2022-07-11 14:42:52,482][26022] Updated weights on worker 0-0, policy_version 1235598 (0.00087) [2022-07-11 14:42:54,196][26022] Updated weights on worker 0-0, policy_version 1235608 (0.00081) [2022-07-11 14:42:55,611][25689] Fps is (10 sec: 5482.2, 60 sec: 5582.1, 300 sec: 5563.5). Total num frames: 1265268736. Throughput: 0: 5855.9. Samples: 1265274392. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:42:55,613][25689] Avg episode reward: [(0, '0.227')] [2022-07-11 14:42:56,215][26022] Updated weights on worker 0-0, policy_version 1235618 (0.00096) [2022-07-11 14:42:57,791][26022] Updated weights on worker 0-0, policy_version 1235628 (0.00083) [2022-07-11 14:42:59,815][26022] Updated weights on worker 0-0, policy_version 1235638 (0.00080) [2022-07-11 14:43:00,621][25689] Fps is (10 sec: 5619.9, 60 sec: 5582.3, 300 sec: 5577.3). Total num frames: 1265297408. Throughput: 0: 5006.8. Samples: 1265291220. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:43:00,622][25689] Avg episode reward: [(0, '0.339')] [2022-07-11 14:43:01,351][26022] Updated weights on worker 0-0, policy_version 1235648 (0.00090) [2022-07-11 14:43:03,874][26022] Updated weights on worker 0-0, policy_version 1235658 (0.00086) [2022-07-11 14:43:05,714][25689] Fps is (10 sec: 5371.8, 60 sec: 5544.3, 300 sec: 5563.3). Total num frames: 1265323008. Throughput: 0: 5716.8. Samples: 1265322632. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:43:05,716][25689] Avg episode reward: [(0, '-0.042')] [2022-07-11 14:43:05,739][26022] Updated weights on worker 0-0, policy_version 1235668 (0.00088) [2022-07-11 14:43:07,452][26022] Updated weights on worker 0-0, policy_version 1235678 (0.00089) [2022-07-11 14:43:09,273][26022] Updated weights on worker 0-0, policy_version 1235688 (0.00084) [2022-07-11 14:43:10,767][25689] Fps is (10 sec: 5449.9, 60 sec: 5595.8, 300 sec: 5566.9). Total num frames: 1265352704. Throughput: 0: 5727.9. Samples: 1265356260. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:43:10,767][25689] Avg episode reward: [(0, '0.991')] [2022-07-11 14:43:11,236][26022] Updated weights on worker 0-0, policy_version 1235698 (0.00095) [2022-07-11 14:43:13,175][26022] Updated weights on worker 0-0, policy_version 1235708 (0.00087) [2022-07-11 14:43:15,019][26022] Updated weights on worker 0-0, policy_version 1235718 (0.00088) [2022-07-11 14:43:15,794][25689] Fps is (10 sec: 5688.8, 60 sec: 5561.4, 300 sec: 5567.8). Total num frames: 1265380352. Throughput: 0: 4881.3. Samples: 1265372912. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:43:15,795][25689] Avg episode reward: [(0, '1.152')] [2022-07-11 14:43:16,714][26022] Updated weights on worker 0-0, policy_version 1235728 (0.00083) [2022-07-11 14:43:18,674][26022] Updated weights on worker 0-0, policy_version 1235738 (0.00100) [2022-07-11 14:43:20,522][26022] Updated weights on worker 0-0, policy_version 1235748 (0.00092) [2022-07-11 14:43:20,834][25689] Fps is (10 sec: 5390.7, 60 sec: 5559.2, 300 sec: 5561.0). Total num frames: 1265406976. Throughput: 0: 5690.8. Samples: 1265406252. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:43:20,835][25689] Avg episode reward: [(0, '1.113')] [2022-07-11 14:43:22,219][26022] Updated weights on worker 0-0, policy_version 1235758 (0.00084) [2022-07-11 14:43:24,205][26022] Updated weights on worker 0-0, policy_version 1235768 (0.00087) [2022-07-11 14:43:25,837][25689] Fps is (10 sec: 5403.9, 60 sec: 5528.8, 300 sec: 5558.5). Total num frames: 1265434624. Throughput: 0: 5816.0. Samples: 1265439668. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:43:25,838][25689] Avg episode reward: [(0, '0.843')] [2022-07-11 14:43:26,127][26022] Updated weights on worker 0-0, policy_version 1235778 (0.00091) [2022-07-11 14:43:27,922][26022] Updated weights on worker 0-0, policy_version 1235788 (0.00087) [2022-07-11 14:43:29,666][26022] Updated weights on worker 0-0, policy_version 1235798 (0.00093) [2022-07-11 14:43:30,950][25689] Fps is (10 sec: 5668.6, 60 sec: 5563.7, 300 sec: 5560.3). Total num frames: 1265464320. Throughput: 0: 5787.4. Samples: 1265473072. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:43:30,951][25689] Avg episode reward: [(0, '0.951')] [2022-07-11 14:43:31,410][26022] Updated weights on worker 0-0, policy_version 1235808 (0.00102) [2022-07-11 14:43:33,250][26022] Updated weights on worker 0-0, policy_version 1235818 (0.00088) [2022-07-11 14:43:35,347][26022] Updated weights on worker 0-0, policy_version 1235828 (0.00093) [2022-07-11 14:43:35,963][25689] Fps is (10 sec: 5764.5, 60 sec: 5550.4, 300 sec: 5563.7). Total num frames: 1265492992. Throughput: 0: 5801.8. Samples: 1265489926. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:43:35,963][25689] Avg episode reward: [(0, '1.601')] [2022-07-11 14:43:36,825][26022] Updated weights on worker 0-0, policy_version 1235838 (0.00088) [2022-07-11 14:43:38,878][26022] Updated weights on worker 0-0, policy_version 1235848 (0.00086) [2022-07-11 14:43:40,620][26022] Updated weights on worker 0-0, policy_version 1235858 (0.00094) [2022-07-11 14:43:40,964][25689] Fps is (10 sec: 5521.9, 60 sec: 5536.0, 300 sec: 5563.8). Total num frames: 1265519616. Throughput: 0: 5828.7. Samples: 1265523584. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:43:40,965][25689] Avg episode reward: [(0, '1.735')] [2022-07-11 14:43:42,449][26022] Updated weights on worker 0-0, policy_version 1235868 (0.00091) [2022-07-11 14:43:44,374][26022] Updated weights on worker 0-0, policy_version 1235878 (0.00086) [2022-07-11 14:43:45,981][25689] Fps is (10 sec: 5417.1, 60 sec: 5521.7, 300 sec: 5558.2). Total num frames: 1265547264. Throughput: 0: 5808.1. Samples: 1265556666. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:43:45,982][25689] Avg episode reward: [(0, '1.785')] [2022-07-11 14:43:46,116][26022] Updated weights on worker 0-0, policy_version 1235888 (0.00087) [2022-07-11 14:43:48,048][26022] Updated weights on worker 0-0, policy_version 1235898 (0.00094) [2022-07-11 14:43:49,847][26022] Updated weights on worker 0-0, policy_version 1235908 (0.00088) [2022-07-11 14:43:51,078][25689] Fps is (10 sec: 5569.0, 60 sec: 5536.6, 300 sec: 5560.1). Total num frames: 1265575936. Throughput: 0: 4979.3. Samples: 1265573292. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:43:51,078][25689] Avg episode reward: [(0, '0.947')] [2022-07-11 14:43:51,756][26022] Updated weights on worker 0-0, policy_version 1235918 (0.00081) [2022-07-11 14:43:53,659][26022] Updated weights on worker 0-0, policy_version 1235928 (0.00501) [2022-07-11 14:43:55,468][26022] Updated weights on worker 0-0, policy_version 1235938 (0.00086) [2022-07-11 14:43:56,126][25689] Fps is (10 sec: 5551.9, 60 sec: 5533.3, 300 sec: 5559.6). Total num frames: 1265603584. Throughput: 0: 5775.4. Samples: 1265606374. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:43:56,126][25689] Avg episode reward: [(0, '1.179')] [2022-07-11 14:43:57,386][26022] Updated weights on worker 0-0, policy_version 1235948 (0.00088) [2022-07-11 14:43:59,325][26022] Updated weights on worker 0-0, policy_version 1235958 (0.00085) [2022-07-11 14:44:00,925][26022] Updated weights on worker 0-0, policy_version 1235968 (0.00088) [2022-07-11 14:44:01,193][25689] Fps is (10 sec: 5567.7, 60 sec: 5528.1, 300 sec: 5565.6). Total num frames: 1265632256. Throughput: 0: 5755.7. Samples: 1265640014. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:44:01,195][25689] Avg episode reward: [(0, '-0.090')] [2022-07-11 14:44:03,254][26022] Updated weights on worker 0-0, policy_version 1235978 (0.00086) [2022-07-11 14:44:04,939][26022] Updated weights on worker 0-0, policy_version 1235988 (0.00083) [2022-07-11 14:44:06,210][25689] Fps is (10 sec: 5381.6, 60 sec: 5535.0, 300 sec: 5556.2). Total num frames: 1265657856. Throughput: 0: 4856.2. Samples: 1265654902. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:44:06,211][25689] Avg episode reward: [(0, '-0.194')] [2022-07-11 14:44:06,883][26022] Updated weights on worker 0-0, policy_version 1235998 (0.00090) [2022-07-11 14:44:08,674][26022] Updated weights on worker 0-0, policy_version 1236008 (0.00086) [2022-07-11 14:44:10,504][26022] Updated weights on worker 0-0, policy_version 1236018 (0.00082) [2022-07-11 14:44:11,368][25689] Fps is (10 sec: 5434.3, 60 sec: 5525.4, 300 sec: 5563.9). Total num frames: 1265687552. Throughput: 0: 5682.8. Samples: 1265688600. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:44:11,369][25689] Avg episode reward: [(0, '-0.054')] [2022-07-11 14:44:12,370][26022] Updated weights on worker 0-0, policy_version 1236028 (0.00086) [2022-07-11 14:44:14,180][26022] Updated weights on worker 0-0, policy_version 1236038 (0.00084) [2022-07-11 14:44:14,928][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:44:14,943][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001236043_1265708032.pth [2022-07-11 14:44:14,943][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001234084_1263702016.pth [2022-07-11 14:44:16,138][26022] Updated weights on worker 0-0, policy_version 1236048 (0.00252) [2022-07-11 14:44:16,435][25689] Fps is (10 sec: 5608.5, 60 sec: 5521.8, 300 sec: 5556.0). Total num frames: 1265715200. Throughput: 0: 5708.5. Samples: 1265722310. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:44:16,435][25689] Avg episode reward: [(0, '-0.112')] [2022-07-11 14:44:17,571][26022] Updated weights on worker 0-0, policy_version 1236058 (0.00088) [2022-07-11 14:44:19,829][26022] Updated weights on worker 0-0, policy_version 1236068 (0.00085) [2022-07-11 14:44:21,281][26022] Updated weights on worker 0-0, policy_version 1236078 (0.00096) [2022-07-11 14:44:21,462][25689] Fps is (10 sec: 5579.6, 60 sec: 5556.8, 300 sec: 5562.5). Total num frames: 1265743872. Throughput: 0: 4886.7. Samples: 1265739064. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:44:21,463][25689] Avg episode reward: [(0, '0.761')] [2022-07-11 14:44:23,412][26022] Updated weights on worker 0-0, policy_version 1236088 (0.00091) [2022-07-11 14:44:25,023][26022] Updated weights on worker 0-0, policy_version 1236098 (0.00086) [2022-07-11 14:44:26,531][25689] Fps is (10 sec: 5477.0, 60 sec: 5533.9, 300 sec: 5551.7). Total num frames: 1265770496. Throughput: 0: 5782.5. Samples: 1265772406. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:44:26,531][25689] Avg episode reward: [(0, '0.325')] [2022-07-11 14:44:26,939][26022] Updated weights on worker 0-0, policy_version 1236108 (0.00086) [2022-07-11 14:44:28,826][26022] Updated weights on worker 0-0, policy_version 1236118 (0.00084) [2022-07-11 14:44:30,595][26022] Updated weights on worker 0-0, policy_version 1236128 (0.00092) [2022-07-11 14:44:31,631][25689] Fps is (10 sec: 5538.6, 60 sec: 5535.1, 300 sec: 5560.6). Total num frames: 1265800192. Throughput: 0: 5797.0. Samples: 1265806062. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:44:31,632][25689] Avg episode reward: [(0, '0.135')] [2022-07-11 14:44:32,487][26022] Updated weights on worker 0-0, policy_version 1236138 (0.00090) [2022-07-11 14:44:34,258][26022] Updated weights on worker 0-0, policy_version 1236148 (0.00086) [2022-07-11 14:44:36,193][26022] Updated weights on worker 0-0, policy_version 1236158 (0.00087) [2022-07-11 14:44:36,632][25689] Fps is (10 sec: 5879.7, 60 sec: 5553.0, 300 sec: 5564.3). Total num frames: 1265829888. Throughput: 0: 4973.4. Samples: 1265822758. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:44:36,633][25689] Avg episode reward: [(0, '-0.227')] [2022-07-11 14:44:37,944][26022] Updated weights on worker 0-0, policy_version 1236168 (0.00090) [2022-07-11 14:44:39,588][26022] Updated weights on worker 0-0, policy_version 1236178 (0.00085) [2022-07-11 14:44:41,575][26022] Updated weights on worker 0-0, policy_version 1236188 (0.00088) [2022-07-11 14:44:41,634][25689] Fps is (10 sec: 5630.5, 60 sec: 5553.0, 300 sec: 5557.7). Total num frames: 1265856512. Throughput: 0: 5815.7. Samples: 1265856376. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 14:44:41,634][25689] Avg episode reward: [(0, '-0.371')] [2022-07-11 14:44:43,276][26022] Updated weights on worker 0-0, policy_version 1236198 (0.00080) [2022-07-11 14:44:45,327][26022] Updated weights on worker 0-0, policy_version 1236208 (0.00089) [2022-07-11 14:44:46,646][25689] Fps is (10 sec: 5522.1, 60 sec: 5570.3, 300 sec: 5559.0). Total num frames: 1265885184. Throughput: 0: 5846.0. Samples: 1265889996. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:44:46,646][25689] Avg episode reward: [(0, '-0.245')] [2022-07-11 14:44:47,088][26022] Updated weights on worker 0-0, policy_version 1236218 (0.00084) [2022-07-11 14:44:48,947][26022] Updated weights on worker 0-0, policy_version 1236228 (0.00050) [2022-07-11 14:44:50,900][26022] Updated weights on worker 0-0, policy_version 1236238 (0.00096) [2022-07-11 14:44:51,751][25689] Fps is (10 sec: 5465.8, 60 sec: 5535.8, 300 sec: 5550.8). Total num frames: 1265911808. Throughput: 0: 5011.0. Samples: 1265906880. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:44:51,751][25689] Avg episode reward: [(0, '0.161')] [2022-07-11 14:44:52,599][26022] Updated weights on worker 0-0, policy_version 1236248 (0.00094) [2022-07-11 14:44:54,595][26022] Updated weights on worker 0-0, policy_version 1236258 (0.00088) [2022-07-11 14:44:56,297][26022] Updated weights on worker 0-0, policy_version 1236268 (0.00090) [2022-07-11 14:44:56,799][25689] Fps is (10 sec: 5446.2, 60 sec: 5552.6, 300 sec: 5553.8). Total num frames: 1265940480. Throughput: 0: 5821.2. Samples: 1265940152. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:44:56,799][25689] Avg episode reward: [(0, '0.141')] [2022-07-11 14:44:58,177][26022] Updated weights on worker 0-0, policy_version 1236278 (0.00080) [2022-07-11 14:45:00,070][26022] Updated weights on worker 0-0, policy_version 1236288 (0.00091) [2022-07-11 14:45:01,818][25689] Fps is (10 sec: 5594.2, 60 sec: 5540.2, 300 sec: 5553.6). Total num frames: 1265968128. Throughput: 0: 5818.2. Samples: 1265973812. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:45:01,819][25689] Avg episode reward: [(0, '0.109')] [2022-07-11 14:45:02,284][26022] Updated weights on worker 0-0, policy_version 1236298 (0.00088) [2022-07-11 14:45:04,062][26022] Updated weights on worker 0-0, policy_version 1236308 (0.00121) [2022-07-11 14:45:05,820][26022] Updated weights on worker 0-0, policy_version 1236318 (0.00087) [2022-07-11 14:45:06,839][25689] Fps is (10 sec: 5405.7, 60 sec: 5556.7, 300 sec: 5557.4). Total num frames: 1265994752. Throughput: 0: 4881.3. Samples: 1265988564. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:45:06,839][25689] Avg episode reward: [(0, '0.333')] [2022-07-11 14:45:07,718][26022] Updated weights on worker 0-0, policy_version 1236328 (0.00083) [2022-07-11 14:45:09,539][26022] Updated weights on worker 0-0, policy_version 1236338 (0.00484) [2022-07-11 14:45:11,377][26022] Updated weights on worker 0-0, policy_version 1236348 (0.00088) [2022-07-11 14:45:11,927][25689] Fps is (10 sec: 5571.3, 60 sec: 5563.1, 300 sec: 5559.3). Total num frames: 1266024448. Throughput: 0: 5720.8. Samples: 1266022306. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:45:11,928][25689] Avg episode reward: [(0, '0.163')] [2022-07-11 14:45:13,228][26022] Updated weights on worker 0-0, policy_version 1236358 (0.00084) [2022-07-11 14:45:15,021][26022] Updated weights on worker 0-0, policy_version 1236368 (0.00086) [2022-07-11 14:45:16,825][26022] Updated weights on worker 0-0, policy_version 1236378 (0.00085) [2022-07-11 14:45:16,964][25689] Fps is (10 sec: 5562.2, 60 sec: 5548.9, 300 sec: 5552.3). Total num frames: 1266051072. Throughput: 0: 5729.0. Samples: 1266055680. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:45:16,965][25689] Avg episode reward: [(0, '0.053')] [2022-07-11 14:45:18,714][26022] Updated weights on worker 0-0, policy_version 1236388 (0.00089) [2022-07-11 14:45:20,540][26022] Updated weights on worker 0-0, policy_version 1236398 (0.00084) [2022-07-11 14:45:22,008][25689] Fps is (10 sec: 5485.2, 60 sec: 5547.4, 300 sec: 5555.0). Total num frames: 1266079744. Throughput: 0: 4896.5. Samples: 1266072672. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:45:22,008][25689] Avg episode reward: [(0, '0.051')] [2022-07-11 14:45:22,347][26022] Updated weights on worker 0-0, policy_version 1236408 (0.00094) [2022-07-11 14:45:24,148][26022] Updated weights on worker 0-0, policy_version 1236418 (0.00092) [2022-07-11 14:45:26,179][26022] Updated weights on worker 0-0, policy_version 1236428 (0.00088) [2022-07-11 14:45:27,027][25689] Fps is (10 sec: 5495.1, 60 sec: 5552.0, 300 sec: 5548.9). Total num frames: 1266106368. Throughput: 0: 5814.2. Samples: 1266105942. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:45:27,027][25689] Avg episode reward: [(0, '1.512')] [2022-07-11 14:45:27,877][26022] Updated weights on worker 0-0, policy_version 1236438 (0.00091) [2022-07-11 14:45:29,895][26022] Updated weights on worker 0-0, policy_version 1236448 (0.00090) [2022-07-11 14:45:31,546][26022] Updated weights on worker 0-0, policy_version 1236458 (0.00087) [2022-07-11 14:45:32,114][25689] Fps is (10 sec: 5674.4, 60 sec: 5570.1, 300 sec: 5557.7). Total num frames: 1266137088. Throughput: 0: 5798.5. Samples: 1266139358. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:45:32,114][25689] Avg episode reward: [(0, '1.460')] [2022-07-11 14:45:33,451][26022] Updated weights on worker 0-0, policy_version 1236468 (0.00092) [2022-07-11 14:45:35,162][26022] Updated weights on worker 0-0, policy_version 1236478 (0.00080) [2022-07-11 14:45:37,019][26022] Updated weights on worker 0-0, policy_version 1236488 (0.00087) [2022-07-11 14:45:37,119][25689] Fps is (10 sec: 5783.3, 60 sec: 5535.8, 300 sec: 5555.1). Total num frames: 1266164736. Throughput: 0: 5837.1. Samples: 1266173328. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:45:37,120][25689] Avg episode reward: [(0, '1.428')] [2022-07-11 14:45:39,098][26022] Updated weights on worker 0-0, policy_version 1236498 (0.00084) [2022-07-11 14:45:40,492][26022] Updated weights on worker 0-0, policy_version 1236508 (0.00090) [2022-07-11 14:45:42,154][25689] Fps is (10 sec: 5303.6, 60 sec: 5515.9, 300 sec: 5548.6). Total num frames: 1266190336. Throughput: 0: 5829.3. Samples: 1266190108. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:45:42,154][25689] Avg episode reward: [(0, '1.852')] [2022-07-11 14:45:42,532][26022] Updated weights on worker 0-0, policy_version 1236518 (0.00077) [2022-07-11 14:45:44,327][26022] Updated weights on worker 0-0, policy_version 1236528 (0.00084) [2022-07-11 14:45:46,022][26022] Updated weights on worker 0-0, policy_version 1236538 (0.00086) [2022-07-11 14:45:47,206][25689] Fps is (10 sec: 5583.9, 60 sec: 5546.1, 300 sec: 5556.2). Total num frames: 1266221056. Throughput: 0: 5843.0. Samples: 1266223846. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:45:47,206][25689] Avg episode reward: [(0, '1.802')] [2022-07-11 14:45:48,201][26022] Updated weights on worker 0-0, policy_version 1236548 (0.00077) [2022-07-11 14:45:49,712][26022] Updated weights on worker 0-0, policy_version 1236558 (0.00085) [2022-07-11 14:45:51,797][26022] Updated weights on worker 0-0, policy_version 1236568 (0.00088) [2022-07-11 14:45:52,293][25689] Fps is (10 sec: 5756.9, 60 sec: 5564.7, 300 sec: 5555.5). Total num frames: 1266248704. Throughput: 0: 5839.2. Samples: 1266257186. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:45:52,293][25689] Avg episode reward: [(0, '1.801')] [2022-07-11 14:45:53,684][26022] Updated weights on worker 0-0, policy_version 1236578 (0.00090) [2022-07-11 14:45:55,280][26022] Updated weights on worker 0-0, policy_version 1236588 (0.00080) [2022-07-11 14:45:57,068][26022] Updated weights on worker 0-0, policy_version 1236598 (0.00084) [2022-07-11 14:45:57,328][25689] Fps is (10 sec: 5462.7, 60 sec: 5548.9, 300 sec: 5552.0). Total num frames: 1266276352. Throughput: 0: 4995.2. Samples: 1266274274. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:45:57,329][25689] Avg episode reward: [(0, '2.185')] [2022-07-11 14:45:58,787][26022] Updated weights on worker 0-0, policy_version 1236608 (0.00080) [2022-07-11 14:46:00,698][26022] Updated weights on worker 0-0, policy_version 1236618 (0.00082) [2022-07-11 14:46:02,351][25689] Fps is (10 sec: 5497.6, 60 sec: 5548.6, 300 sec: 5555.8). Total num frames: 1266304000. Throughput: 0: 5857.7. Samples: 1266308416. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:46:02,351][25689] Avg episode reward: [(0, '2.457')] [2022-07-11 14:46:02,884][26022] Updated weights on worker 0-0, policy_version 1236628 (0.00089) [2022-07-11 14:46:04,730][26022] Updated weights on worker 0-0, policy_version 1236638 (0.00084) [2022-07-11 14:46:06,503][26022] Updated weights on worker 0-0, policy_version 1236648 (0.00084) [2022-07-11 14:46:07,374][25689] Fps is (10 sec: 5504.5, 60 sec: 5565.3, 300 sec: 5553.1). Total num frames: 1266331648. Throughput: 0: 5764.7. Samples: 1266340108. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:46:07,374][25689] Avg episode reward: [(0, '1.965')] [2022-07-11 14:46:08,455][26022] Updated weights on worker 0-0, policy_version 1236658 (0.00086) [2022-07-11 14:46:10,147][26022] Updated weights on worker 0-0, policy_version 1236668 (0.00084) [2022-07-11 14:46:11,953][26022] Updated weights on worker 0-0, policy_version 1236678 (0.00091) [2022-07-11 14:46:12,482][25689] Fps is (10 sec: 5660.2, 60 sec: 5563.5, 300 sec: 5554.7). Total num frames: 1266361344. Throughput: 0: 4947.6. Samples: 1266357074. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:46:12,482][25689] Avg episode reward: [(0, '1.710')] [2022-07-11 14:46:13,778][26022] Updated weights on worker 0-0, policy_version 1236688 (0.00098) [2022-07-11 14:46:15,033][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:46:15,042][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001236694_1266374656.pth [2022-07-11 14:46:15,043][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001234737_1264370688.pth [2022-07-11 14:46:15,584][26022] Updated weights on worker 0-0, policy_version 1236698 (0.00086) [2022-07-11 14:46:17,484][25689] Fps is (10 sec: 5570.3, 60 sec: 5566.7, 300 sec: 5555.6). Total num frames: 1266387968. Throughput: 0: 5786.9. Samples: 1266390918. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:46:17,485][25689] Avg episode reward: [(0, '1.611')] [2022-07-11 14:46:17,551][26022] Updated weights on worker 0-0, policy_version 1236708 (0.00091) [2022-07-11 14:46:19,216][26022] Updated weights on worker 0-0, policy_version 1236718 (0.00086) [2022-07-11 14:46:21,103][26022] Updated weights on worker 0-0, policy_version 1236728 (0.00119) [2022-07-11 14:46:22,498][25689] Fps is (10 sec: 5520.8, 60 sec: 5569.5, 300 sec: 5553.6). Total num frames: 1266416640. Throughput: 0: 5773.0. Samples: 1266424724. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:46:22,498][25689] Avg episode reward: [(0, '1.498')] [2022-07-11 14:46:22,782][26022] Updated weights on worker 0-0, policy_version 1236738 (0.00092) [2022-07-11 14:46:24,819][26022] Updated weights on worker 0-0, policy_version 1236748 (0.00096) [2022-07-11 14:46:26,517][26022] Updated weights on worker 0-0, policy_version 1236758 (0.00081) [2022-07-11 14:46:27,510][25689] Fps is (10 sec: 5617.4, 60 sec: 5587.0, 300 sec: 5552.2). Total num frames: 1266444288. Throughput: 0: 5025.4. Samples: 1266441304. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:46:27,511][25689] Avg episode reward: [(0, '1.324')] [2022-07-11 14:46:28,582][26022] Updated weights on worker 0-0, policy_version 1236768 (0.00097) [2022-07-11 14:46:30,194][26022] Updated weights on worker 0-0, policy_version 1236778 (0.00083) [2022-07-11 14:46:32,121][26022] Updated weights on worker 0-0, policy_version 1236788 (0.00094) [2022-07-11 14:46:32,565][25689] Fps is (10 sec: 5492.8, 60 sec: 5539.2, 300 sec: 5551.4). Total num frames: 1266471936. Throughput: 0: 5863.1. Samples: 1266474822. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:46:32,565][25689] Avg episode reward: [(0, '1.026')] [2022-07-11 14:46:33,882][26022] Updated weights on worker 0-0, policy_version 1236798 (0.00084) [2022-07-11 14:46:35,824][26022] Updated weights on worker 0-0, policy_version 1236808 (0.00094) [2022-07-11 14:46:37,522][26022] Updated weights on worker 0-0, policy_version 1236818 (0.00090) [2022-07-11 14:46:37,567][25689] Fps is (10 sec: 5702.3, 60 sec: 5573.4, 300 sec: 5559.3). Total num frames: 1266501632. Throughput: 0: 5850.5. Samples: 1266508410. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:46:37,567][25689] Avg episode reward: [(0, '1.570')] [2022-07-11 14:46:39,565][26022] Updated weights on worker 0-0, policy_version 1236828 (0.00085) [2022-07-11 14:46:41,301][26022] Updated weights on worker 0-0, policy_version 1236838 (0.00088) [2022-07-11 14:46:42,578][25689] Fps is (10 sec: 5726.6, 60 sec: 5609.4, 300 sec: 5556.2). Total num frames: 1266529280. Throughput: 0: 5002.0. Samples: 1266525166. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:46:42,580][25689] Avg episode reward: [(0, '1.869')] [2022-07-11 14:46:43,122][26022] Updated weights on worker 0-0, policy_version 1236848 (0.00087) [2022-07-11 14:46:45,052][26022] Updated weights on worker 0-0, policy_version 1236858 (0.00090) [2022-07-11 14:46:46,541][26022] Updated weights on worker 0-0, policy_version 1236868 (0.00091) [2022-07-11 14:46:47,601][25689] Fps is (10 sec: 5408.6, 60 sec: 5544.2, 300 sec: 5551.3). Total num frames: 1266555904. Throughput: 0: 5850.6. Samples: 1266558846. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:46:47,602][25689] Avg episode reward: [(0, '1.967')] [2022-07-11 14:46:48,653][26022] Updated weights on worker 0-0, policy_version 1236878 (0.00091) [2022-07-11 14:46:50,400][26022] Updated weights on worker 0-0, policy_version 1236888 (0.00094) [2022-07-11 14:46:52,339][26022] Updated weights on worker 0-0, policy_version 1236898 (0.00087) [2022-07-11 14:46:52,670][25689] Fps is (10 sec: 5580.9, 60 sec: 5579.8, 300 sec: 5560.5). Total num frames: 1266585600. Throughput: 0: 5835.2. Samples: 1266592140. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:46:52,679][25689] Avg episode reward: [(0, '0.029')] [2022-07-11 14:46:53,995][26022] Updated weights on worker 0-0, policy_version 1236908 (0.00097) [2022-07-11 14:46:55,951][26022] Updated weights on worker 0-0, policy_version 1236918 (0.00093) [2022-07-11 14:46:57,689][25689] Fps is (10 sec: 5583.2, 60 sec: 5564.4, 300 sec: 5553.5). Total num frames: 1266612224. Throughput: 0: 4989.7. Samples: 1266608812. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:46:57,689][25689] Avg episode reward: [(0, '0.113')] [2022-07-11 14:46:57,943][26022] Updated weights on worker 0-0, policy_version 1236928 (0.00093) [2022-07-11 14:46:59,848][26022] Updated weights on worker 0-0, policy_version 1236938 (0.00095) [2022-07-11 14:47:01,988][26022] Updated weights on worker 0-0, policy_version 1236948 (0.00088) [2022-07-11 14:47:02,700][25689] Fps is (10 sec: 5206.9, 60 sec: 5531.6, 300 sec: 5547.3). Total num frames: 1266637824. Throughput: 0: 5781.9. Samples: 1266641504. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:47:02,700][25689] Avg episode reward: [(0, '-0.148')] [2022-07-11 14:47:03,777][26022] Updated weights on worker 0-0, policy_version 1236958 (0.00083) [2022-07-11 14:47:05,679][26022] Updated weights on worker 0-0, policy_version 1236968 (0.00091) [2022-07-11 14:47:07,447][26022] Updated weights on worker 0-0, policy_version 1236978 (0.00108) [2022-07-11 14:47:07,711][25689] Fps is (10 sec: 5415.0, 60 sec: 5549.6, 300 sec: 5555.1). Total num frames: 1266666496. Throughput: 0: 5698.3. Samples: 1266673438. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:47:07,712][25689] Avg episode reward: [(0, '-0.144')] [2022-07-11 14:47:09,501][26022] Updated weights on worker 0-0, policy_version 1236988 (0.00086) [2022-07-11 14:47:11,180][26022] Updated weights on worker 0-0, policy_version 1236998 (0.00086) [2022-07-11 14:47:12,747][25689] Fps is (10 sec: 5605.5, 60 sec: 5522.3, 300 sec: 5548.0). Total num frames: 1266694144. Throughput: 0: 4880.0. Samples: 1266690116. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:47:12,747][25689] Avg episode reward: [(0, '-0.397')] [2022-07-11 14:47:12,977][26022] Updated weights on worker 0-0, policy_version 1237008 (0.00091) [2022-07-11 14:47:14,951][26022] Updated weights on worker 0-0, policy_version 1237018 (0.00092) [2022-07-11 14:47:16,627][26022] Updated weights on worker 0-0, policy_version 1237028 (0.00089) [2022-07-11 14:47:17,750][25689] Fps is (10 sec: 5610.4, 60 sec: 5556.2, 300 sec: 5555.1). Total num frames: 1266722816. Throughput: 0: 5726.9. Samples: 1266723700. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:47:17,750][25689] Avg episode reward: [(0, '-0.271')] [2022-07-11 14:47:18,455][26022] Updated weights on worker 0-0, policy_version 1237038 (0.00089) [2022-07-11 14:47:20,319][26022] Updated weights on worker 0-0, policy_version 1237048 (0.00089) [2022-07-11 14:47:22,119][26022] Updated weights on worker 0-0, policy_version 1237058 (0.00054) [2022-07-11 14:47:22,757][25689] Fps is (10 sec: 5626.5, 60 sec: 5539.8, 300 sec: 5548.8). Total num frames: 1266750464. Throughput: 0: 5778.8. Samples: 1266757410. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:47:22,757][25689] Avg episode reward: [(0, '0.956')] [2022-07-11 14:47:23,979][26022] Updated weights on worker 0-0, policy_version 1237068 (0.00452) [2022-07-11 14:47:25,909][26022] Updated weights on worker 0-0, policy_version 1237078 (0.00084) [2022-07-11 14:47:27,563][26022] Updated weights on worker 0-0, policy_version 1237088 (0.00095) [2022-07-11 14:47:27,761][25689] Fps is (10 sec: 5625.3, 60 sec: 5557.5, 300 sec: 5554.5). Total num frames: 1266779136. Throughput: 0: 5013.8. Samples: 1266773968. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:47:27,762][25689] Avg episode reward: [(0, '1.084')] [2022-07-11 14:47:29,745][26022] Updated weights on worker 0-0, policy_version 1237098 (0.00093) [2022-07-11 14:47:31,306][26022] Updated weights on worker 0-0, policy_version 1237108 (0.00089) [2022-07-11 14:47:32,863][25689] Fps is (10 sec: 5471.2, 60 sec: 5536.2, 300 sec: 5543.3). Total num frames: 1266805760. Throughput: 0: 5818.3. Samples: 1266807160. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:47:32,864][25689] Avg episode reward: [(0, '1.219')] [2022-07-11 14:47:33,383][26022] Updated weights on worker 0-0, policy_version 1237118 (0.00084) [2022-07-11 14:47:34,967][26022] Updated weights on worker 0-0, policy_version 1237128 (0.00097) [2022-07-11 14:47:36,939][26022] Updated weights on worker 0-0, policy_version 1237138 (0.00089) [2022-07-11 14:47:37,939][25689] Fps is (10 sec: 5533.7, 60 sec: 5529.4, 300 sec: 5549.3). Total num frames: 1266835456. Throughput: 0: 5803.0. Samples: 1266840860. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:47:37,940][25689] Avg episode reward: [(0, '1.576')] [2022-07-11 14:47:38,808][26022] Updated weights on worker 0-0, policy_version 1237148 (0.00090) [2022-07-11 14:47:40,503][26022] Updated weights on worker 0-0, policy_version 1237158 (0.00091) [2022-07-11 14:47:42,400][26022] Updated weights on worker 0-0, policy_version 1237168 (0.00088) [2022-07-11 14:47:42,976][25689] Fps is (10 sec: 5670.5, 60 sec: 5527.1, 300 sec: 5546.0). Total num frames: 1266863104. Throughput: 0: 4958.4. Samples: 1266857666. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:47:42,976][25689] Avg episode reward: [(0, '1.528')] [2022-07-11 14:47:44,178][26022] Updated weights on worker 0-0, policy_version 1237178 (0.00089) [2022-07-11 14:47:46,019][26022] Updated weights on worker 0-0, policy_version 1237188 (0.00088) [2022-07-11 14:47:47,866][26022] Updated weights on worker 0-0, policy_version 1237198 (0.00091) [2022-07-11 14:47:48,001][25689] Fps is (10 sec: 5495.4, 60 sec: 5543.8, 300 sec: 5546.9). Total num frames: 1266890752. Throughput: 0: 5795.1. Samples: 1266891260. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:47:48,002][25689] Avg episode reward: [(0, '1.445')] [2022-07-11 14:47:49,907][26022] Updated weights on worker 0-0, policy_version 1237208 (0.00087) [2022-07-11 14:47:51,354][26022] Updated weights on worker 0-0, policy_version 1237218 (0.00372) [2022-07-11 14:47:53,127][25689] Fps is (10 sec: 5548.3, 60 sec: 5521.7, 300 sec: 5548.2). Total num frames: 1266919424. Throughput: 0: 5808.5. Samples: 1266924860. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:47:53,127][25689] Avg episode reward: [(0, '1.672')] [2022-07-11 14:47:53,437][26022] Updated weights on worker 0-0, policy_version 1237228 (0.00082) [2022-07-11 14:47:55,168][26022] Updated weights on worker 0-0, policy_version 1237238 (0.00090) [2022-07-11 14:47:57,201][26022] Updated weights on worker 0-0, policy_version 1237248 (0.00083) [2022-07-11 14:47:58,144][25689] Fps is (10 sec: 5754.8, 60 sec: 5572.7, 300 sec: 5551.5). Total num frames: 1266949120. Throughput: 0: 4982.6. Samples: 1266941532. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:47:58,145][25689] Avg episode reward: [(0, '1.832')] [2022-07-11 14:47:58,818][26022] Updated weights on worker 0-0, policy_version 1237258 (0.00086) [2022-07-11 14:48:00,881][26022] Updated weights on worker 0-0, policy_version 1237268 (0.00088) [2022-07-11 14:48:02,855][26022] Updated weights on worker 0-0, policy_version 1237278 (0.00093) [2022-07-11 14:48:03,234][25689] Fps is (10 sec: 5369.9, 60 sec: 5548.5, 300 sec: 5548.2). Total num frames: 1266973696. Throughput: 0: 5798.6. Samples: 1266975132. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:48:03,234][25689] Avg episode reward: [(0, '1.580')] [2022-07-11 14:48:04,769][26022] Updated weights on worker 0-0, policy_version 1237288 (0.00087) [2022-07-11 14:48:06,506][26022] Updated weights on worker 0-0, policy_version 1237298 (0.00090) [2022-07-11 14:48:08,308][25689] Fps is (10 sec: 5239.0, 60 sec: 5542.8, 300 sec: 5544.3). Total num frames: 1267002368. Throughput: 0: 5692.6. Samples: 1267006856. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:48:08,308][25689] Avg episode reward: [(0, '1.164')] [2022-07-11 14:48:08,354][26022] Updated weights on worker 0-0, policy_version 1237308 (0.00088) [2022-07-11 14:48:10,377][26022] Updated weights on worker 0-0, policy_version 1237318 (0.00084) [2022-07-11 14:48:12,026][26022] Updated weights on worker 0-0, policy_version 1237328 (0.00086) [2022-07-11 14:48:13,418][25689] Fps is (10 sec: 5630.8, 60 sec: 5552.9, 300 sec: 5546.2). Total num frames: 1267031040. Throughput: 0: 5714.5. Samples: 1267040812. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:48:13,418][25689] Avg episode reward: [(0, '0.436')] [2022-07-11 14:48:13,690][26022] Updated weights on worker 0-0, policy_version 1237338 (0.00055) [2022-07-11 14:48:15,246][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:48:15,264][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001237345_1267041280.pth [2022-07-11 14:48:15,265][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001235391_1265040384.pth [2022-07-11 14:48:15,475][26022] Updated weights on worker 0-0, policy_version 1237348 (0.00086) [2022-07-11 14:48:17,532][26022] Updated weights on worker 0-0, policy_version 1237358 (0.00091) [2022-07-11 14:48:18,435][25689] Fps is (10 sec: 5662.4, 60 sec: 5551.5, 300 sec: 5553.5). Total num frames: 1267059712. Throughput: 0: 5729.5. Samples: 1267057788. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:48:18,435][25689] Avg episode reward: [(0, '0.359')] [2022-07-11 14:48:19,117][26022] Updated weights on worker 0-0, policy_version 1237368 (0.00093) [2022-07-11 14:48:21,255][26022] Updated weights on worker 0-0, policy_version 1237378 (0.00080) [2022-07-11 14:48:22,832][26022] Updated weights on worker 0-0, policy_version 1237388 (0.00091) [2022-07-11 14:48:23,494][25689] Fps is (10 sec: 5690.9, 60 sec: 5563.6, 300 sec: 5555.9). Total num frames: 1267088384. Throughput: 0: 5749.3. Samples: 1267091616. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:48:23,495][25689] Avg episode reward: [(0, '0.270')] [2022-07-11 14:48:24,596][26022] Updated weights on worker 0-0, policy_version 1237398 (0.00093) [2022-07-11 14:48:26,703][26022] Updated weights on worker 0-0, policy_version 1237408 (0.00095) [2022-07-11 14:48:28,411][26022] Updated weights on worker 0-0, policy_version 1237418 (0.00092) [2022-07-11 14:48:28,511][25689] Fps is (10 sec: 5589.5, 60 sec: 5545.7, 300 sec: 5550.8). Total num frames: 1267116032. Throughput: 0: 5848.7. Samples: 1267125018. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:48:28,511][25689] Avg episode reward: [(0, '0.223')] [2022-07-11 14:48:30,328][26022] Updated weights on worker 0-0, policy_version 1237428 (0.00094) [2022-07-11 14:48:32,230][26022] Updated weights on worker 0-0, policy_version 1237438 (0.00088) [2022-07-11 14:48:33,613][25689] Fps is (10 sec: 5565.8, 60 sec: 5579.3, 300 sec: 5549.2). Total num frames: 1267144704. Throughput: 0: 4999.6. Samples: 1267141782. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:48:33,614][25689] Avg episode reward: [(0, '-0.665')] [2022-07-11 14:48:33,745][26022] Updated weights on worker 0-0, policy_version 1237448 (0.00085) [2022-07-11 14:48:35,962][26022] Updated weights on worker 0-0, policy_version 1237458 (0.00084) [2022-07-11 14:48:37,374][26022] Updated weights on worker 0-0, policy_version 1237468 (0.00083) [2022-07-11 14:48:38,672][25689] Fps is (10 sec: 5643.6, 60 sec: 5564.1, 300 sec: 5555.0). Total num frames: 1267173376. Throughput: 0: 5817.5. Samples: 1267175516. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:48:38,672][25689] Avg episode reward: [(0, '-0.021')] [2022-07-11 14:48:39,532][26022] Updated weights on worker 0-0, policy_version 1237478 (0.00086) [2022-07-11 14:48:41,218][26022] Updated weights on worker 0-0, policy_version 1237488 (0.00082) [2022-07-11 14:48:43,015][26022] Updated weights on worker 0-0, policy_version 1237498 (0.00089) [2022-07-11 14:48:43,699][25689] Fps is (10 sec: 5482.7, 60 sec: 5548.1, 300 sec: 5551.4). Total num frames: 1267200000. Throughput: 0: 5801.1. Samples: 1267208824. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 14:48:43,699][25689] Avg episode reward: [(0, '-0.112')] [2022-07-11 14:48:44,991][26022] Updated weights on worker 0-0, policy_version 1237508 (0.00095) [2022-07-11 14:48:46,760][26022] Updated weights on worker 0-0, policy_version 1237518 (0.00081) [2022-07-11 14:48:48,580][26022] Updated weights on worker 0-0, policy_version 1237528 (0.00089) [2022-07-11 14:48:48,711][25689] Fps is (10 sec: 5508.1, 60 sec: 5566.2, 300 sec: 5552.9). Total num frames: 1267228672. Throughput: 0: 4975.3. Samples: 1267225518. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:48:48,711][25689] Avg episode reward: [(0, '0.117')] [2022-07-11 14:48:50,415][26022] Updated weights on worker 0-0, policy_version 1237538 (0.00087) [2022-07-11 14:48:52,223][26022] Updated weights on worker 0-0, policy_version 1237548 (0.00085) [2022-07-11 14:48:53,782][25689] Fps is (10 sec: 5687.1, 60 sec: 5571.2, 300 sec: 5555.9). Total num frames: 1267257344. Throughput: 0: 5808.5. Samples: 1267258932. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:48:53,782][25689] Avg episode reward: [(0, '0.116')] [2022-07-11 14:48:54,449][26022] Updated weights on worker 0-0, policy_version 1237558 (0.00085) [2022-07-11 14:48:55,726][26022] Updated weights on worker 0-0, policy_version 1237568 (0.00084) [2022-07-11 14:48:58,066][26022] Updated weights on worker 0-0, policy_version 1237578 (0.00085) [2022-07-11 14:48:58,794][25689] Fps is (10 sec: 5687.0, 60 sec: 5554.7, 300 sec: 5557.0). Total num frames: 1267286016. Throughput: 0: 5808.7. Samples: 1267292400. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:48:58,796][25689] Avg episode reward: [(0, '0.285')] [2022-07-11 14:48:59,533][26022] Updated weights on worker 0-0, policy_version 1237588 (0.00100) [2022-07-11 14:49:01,513][26022] Updated weights on worker 0-0, policy_version 1237598 (0.00082) [2022-07-11 14:49:03,474][26022] Updated weights on worker 0-0, policy_version 1237608 (0.00090) [2022-07-11 14:49:03,803][25689] Fps is (10 sec: 5416.0, 60 sec: 5579.1, 300 sec: 5557.1). Total num frames: 1267311616. Throughput: 0: 4991.0. Samples: 1267309162. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:49:03,805][25689] Avg episode reward: [(0, '1.296')] [2022-07-11 14:49:05,445][26022] Updated weights on worker 0-0, policy_version 1237618 (0.00091) [2022-07-11 14:49:07,308][26022] Updated weights on worker 0-0, policy_version 1237628 (0.00089) [2022-07-11 14:49:08,805][25689] Fps is (10 sec: 5216.6, 60 sec: 5551.8, 300 sec: 5549.7). Total num frames: 1267338240. Throughput: 0: 5731.6. Samples: 1267340690. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:49:08,807][25689] Avg episode reward: [(0, '1.694')] [2022-07-11 14:49:09,147][26022] Updated weights on worker 0-0, policy_version 1237638 (0.00089) [2022-07-11 14:49:10,900][26022] Updated weights on worker 0-0, policy_version 1237648 (0.00084) [2022-07-11 14:49:12,958][26022] Updated weights on worker 0-0, policy_version 1237658 (0.00088) [2022-07-11 14:49:13,912][25689] Fps is (10 sec: 5469.9, 60 sec: 5552.1, 300 sec: 5552.4). Total num frames: 1267366912. Throughput: 0: 5723.5. Samples: 1267374146. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:49:13,912][25689] Avg episode reward: [(0, '0.727')] [2022-07-11 14:49:14,571][26022] Updated weights on worker 0-0, policy_version 1237668 (0.00083) [2022-07-11 14:49:16,334][26022] Updated weights on worker 0-0, policy_version 1237678 (0.00078) [2022-07-11 14:49:18,472][26022] Updated weights on worker 0-0, policy_version 1237688 (0.00086) [2022-07-11 14:49:19,007][25689] Fps is (10 sec: 5520.5, 60 sec: 5528.1, 300 sec: 5547.7). Total num frames: 1267394560. Throughput: 0: 4884.8. Samples: 1267391136. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:49:19,007][25689] Avg episode reward: [(0, '1.103')] [2022-07-11 14:49:19,899][26022] Updated weights on worker 0-0, policy_version 1237698 (0.00091) [2022-07-11 14:49:22,121][26022] Updated weights on worker 0-0, policy_version 1237708 (0.00092) [2022-07-11 14:49:23,668][26022] Updated weights on worker 0-0, policy_version 1237718 (0.00090) [2022-07-11 14:49:24,021][25689] Fps is (10 sec: 5571.1, 60 sec: 5532.2, 300 sec: 5555.6). Total num frames: 1267423232. Throughput: 0: 5714.2. Samples: 1267424694. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:49:24,023][25689] Avg episode reward: [(0, '0.328')] [2022-07-11 14:49:25,575][26022] Updated weights on worker 0-0, policy_version 1237728 (0.00089) [2022-07-11 14:49:27,678][26022] Updated weights on worker 0-0, policy_version 1237738 (0.00089) [2022-07-11 14:49:29,054][25689] Fps is (10 sec: 5707.7, 60 sec: 5547.7, 300 sec: 5553.4). Total num frames: 1267451904. Throughput: 0: 5800.6. Samples: 1267458144. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:49:29,054][25689] Avg episode reward: [(0, '-0.235')] [2022-07-11 14:49:29,199][26022] Updated weights on worker 0-0, policy_version 1237748 (0.00090) [2022-07-11 14:49:31,125][26022] Updated weights on worker 0-0, policy_version 1237758 (0.00089) [2022-07-11 14:49:33,281][26022] Updated weights on worker 0-0, policy_version 1237768 (0.00085) [2022-07-11 14:49:34,117][25689] Fps is (10 sec: 5680.0, 60 sec: 5551.3, 300 sec: 5548.8). Total num frames: 1267480576. Throughput: 0: 4978.9. Samples: 1267474744. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:49:34,117][25689] Avg episode reward: [(0, '-0.022')] [2022-07-11 14:49:34,664][26022] Updated weights on worker 0-0, policy_version 1237778 (0.00090) [2022-07-11 14:49:36,802][26022] Updated weights on worker 0-0, policy_version 1237788 (0.00085) [2022-07-11 14:49:38,475][26022] Updated weights on worker 0-0, policy_version 1237798 (0.00085) [2022-07-11 14:49:39,194][25689] Fps is (10 sec: 5553.9, 60 sec: 5532.6, 300 sec: 5550.8). Total num frames: 1267508224. Throughput: 0: 5817.7. Samples: 1267508580. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:49:39,195][25689] Avg episode reward: [(0, '0.645')] [2022-07-11 14:49:40,248][26022] Updated weights on worker 0-0, policy_version 1237808 (0.00093) [2022-07-11 14:49:42,322][26022] Updated weights on worker 0-0, policy_version 1237818 (0.00090) [2022-07-11 14:49:43,712][26022] Updated weights on worker 0-0, policy_version 1237828 (0.00092) [2022-07-11 14:49:44,211][25689] Fps is (10 sec: 5579.5, 60 sec: 5567.4, 300 sec: 5550.8). Total num frames: 1267536896. Throughput: 0: 5830.7. Samples: 1267542416. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:49:44,211][25689] Avg episode reward: [(0, '0.508')] [2022-07-11 14:49:45,803][26022] Updated weights on worker 0-0, policy_version 1237838 (0.00096) [2022-07-11 14:49:47,695][26022] Updated weights on worker 0-0, policy_version 1237848 (0.00092) [2022-07-11 14:49:49,233][25689] Fps is (10 sec: 5712.1, 60 sec: 5566.5, 300 sec: 5559.2). Total num frames: 1267565568. Throughput: 0: 5022.5. Samples: 1267559496. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:49:49,234][25689] Avg episode reward: [(0, '0.124')] [2022-07-11 14:49:49,299][26022] Updated weights on worker 0-0, policy_version 1237858 (0.00086) [2022-07-11 14:49:51,363][26022] Updated weights on worker 0-0, policy_version 1237868 (0.00086) [2022-07-11 14:49:52,992][26022] Updated weights on worker 0-0, policy_version 1237878 (0.00098) [2022-07-11 14:49:54,287][25689] Fps is (10 sec: 5589.2, 60 sec: 5551.1, 300 sec: 5555.6). Total num frames: 1267593216. Throughput: 0: 5874.1. Samples: 1267593228. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:49:54,288][25689] Avg episode reward: [(0, '1.209')] [2022-07-11 14:49:54,914][26022] Updated weights on worker 0-0, policy_version 1237888 (0.00083) [2022-07-11 14:49:56,641][26022] Updated weights on worker 0-0, policy_version 1237898 (0.00086) [2022-07-11 14:49:58,379][26022] Updated weights on worker 0-0, policy_version 1237908 (0.00091) [2022-07-11 14:49:59,303][25689] Fps is (10 sec: 5491.4, 60 sec: 5533.9, 300 sec: 5555.7). Total num frames: 1267620864. Throughput: 0: 5887.7. Samples: 1267626972. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:49:59,303][25689] Avg episode reward: [(0, '1.405')] [2022-07-11 14:50:00,314][26022] Updated weights on worker 0-0, policy_version 1237918 (0.00108) [2022-07-11 14:50:02,592][26022] Updated weights on worker 0-0, policy_version 1237928 (0.00084) [2022-07-11 14:50:04,193][26022] Updated weights on worker 0-0, policy_version 1237938 (0.00085) [2022-07-11 14:50:04,319][25689] Fps is (10 sec: 5512.3, 60 sec: 5567.0, 300 sec: 5559.2). Total num frames: 1267648512. Throughput: 0: 5013.5. Samples: 1267643228. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:50:04,319][25689] Avg episode reward: [(0, '1.251')] [2022-07-11 14:50:06,345][26022] Updated weights on worker 0-0, policy_version 1237948 (0.00090) [2022-07-11 14:50:08,003][26022] Updated weights on worker 0-0, policy_version 1237958 (0.00081) [2022-07-11 14:50:09,339][25689] Fps is (10 sec: 5407.5, 60 sec: 5565.4, 300 sec: 5550.2). Total num frames: 1267675136. Throughput: 0: 5768.0. Samples: 1267675466. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:50:09,340][25689] Avg episode reward: [(0, '1.620')] [2022-07-11 14:50:09,854][26022] Updated weights on worker 0-0, policy_version 1237968 (0.00086) [2022-07-11 14:50:11,838][26022] Updated weights on worker 0-0, policy_version 1237978 (0.00089) [2022-07-11 14:50:13,259][26022] Updated weights on worker 0-0, policy_version 1237988 (0.00082) [2022-07-11 14:50:14,379][25689] Fps is (10 sec: 5598.2, 60 sec: 5588.5, 300 sec: 5560.4). Total num frames: 1267704832. Throughput: 0: 5771.2. Samples: 1267709182. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:50:14,380][25689] Avg episode reward: [(0, '1.968')] [2022-07-11 14:50:15,452][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:50:15,466][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001237998_1267709952.pth [2022-07-11 14:50:15,467][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001236043_1265708032.pth [2022-07-11 14:50:15,470][26022] Updated weights on worker 0-0, policy_version 1237998 (0.00086) [2022-07-11 14:50:17,092][26022] Updated weights on worker 0-0, policy_version 1238008 (0.00085) [2022-07-11 14:50:18,980][26022] Updated weights on worker 0-0, policy_version 1238018 (0.00089) [2022-07-11 14:50:19,426][25689] Fps is (10 sec: 5786.8, 60 sec: 5609.9, 300 sec: 5560.4). Total num frames: 1267733504. Throughput: 0: 4916.7. Samples: 1267725910. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:50:19,426][25689] Avg episode reward: [(0, '1.901')] [2022-07-11 14:50:20,748][26022] Updated weights on worker 0-0, policy_version 1238028 (0.00092) [2022-07-11 14:50:22,592][26022] Updated weights on worker 0-0, policy_version 1238038 (0.00117) [2022-07-11 14:50:24,429][25689] Fps is (10 sec: 5502.4, 60 sec: 5577.1, 300 sec: 5560.7). Total num frames: 1267760128. Throughput: 0: 5784.2. Samples: 1267759546. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:50:24,430][25689] Avg episode reward: [(0, '0.879')] [2022-07-11 14:50:24,453][26022] Updated weights on worker 0-0, policy_version 1238048 (0.00090) [2022-07-11 14:50:26,155][26022] Updated weights on worker 0-0, policy_version 1238058 (0.00088) [2022-07-11 14:50:28,023][26022] Updated weights on worker 0-0, policy_version 1238068 (0.00093) [2022-07-11 14:50:29,483][25689] Fps is (10 sec: 5498.0, 60 sec: 5575.0, 300 sec: 5554.4). Total num frames: 1267788800. Throughput: 0: 5853.0. Samples: 1267793368. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:50:29,484][25689] Avg episode reward: [(0, '0.700')] [2022-07-11 14:50:29,987][26022] Updated weights on worker 0-0, policy_version 1238078 (0.00090) [2022-07-11 14:50:31,732][26022] Updated weights on worker 0-0, policy_version 1238088 (0.00091) [2022-07-11 14:50:33,742][26022] Updated weights on worker 0-0, policy_version 1238098 (0.00089) [2022-07-11 14:50:34,527][25689] Fps is (10 sec: 5577.1, 60 sec: 5559.9, 300 sec: 5553.7). Total num frames: 1267816448. Throughput: 0: 4999.9. Samples: 1267809920. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:50:34,527][25689] Avg episode reward: [(0, '0.855')] [2022-07-11 14:50:35,465][26022] Updated weights on worker 0-0, policy_version 1238108 (0.00065) [2022-07-11 14:50:37,336][26022] Updated weights on worker 0-0, policy_version 1238118 (0.00095) [2022-07-11 14:50:39,180][26022] Updated weights on worker 0-0, policy_version 1238128 (0.00082) [2022-07-11 14:50:39,594][25689] Fps is (10 sec: 5468.9, 60 sec: 5560.8, 300 sec: 5560.0). Total num frames: 1267844096. Throughput: 0: 5819.0. Samples: 1267843270. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:50:39,595][25689] Avg episode reward: [(0, '0.958')] [2022-07-11 14:50:41,065][26022] Updated weights on worker 0-0, policy_version 1238138 (0.00100) [2022-07-11 14:50:42,794][26022] Updated weights on worker 0-0, policy_version 1238148 (0.00086) [2022-07-11 14:50:44,694][25689] Fps is (10 sec: 5539.3, 60 sec: 5553.1, 300 sec: 5552.2). Total num frames: 1267872768. Throughput: 0: 5796.0. Samples: 1267877006. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:50:44,695][25689] Avg episode reward: [(0, '0.649')] [2022-07-11 14:50:44,881][26022] Updated weights on worker 0-0, policy_version 1238158 (0.00088) [2022-07-11 14:50:46,216][26022] Updated weights on worker 0-0, policy_version 1238168 (0.00088) [2022-07-11 14:50:48,598][26022] Updated weights on worker 0-0, policy_version 1238178 (0.00084) [2022-07-11 14:50:49,728][25689] Fps is (10 sec: 5759.9, 60 sec: 5569.1, 300 sec: 5560.1). Total num frames: 1267902464. Throughput: 0: 5793.1. Samples: 1267910646. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:50:49,729][25689] Avg episode reward: [(0, '-0.213')] [2022-07-11 14:50:50,097][26022] Updated weights on worker 0-0, policy_version 1238188 (0.00086) [2022-07-11 14:50:52,026][26022] Updated weights on worker 0-0, policy_version 1238198 (0.00096) [2022-07-11 14:50:53,822][26022] Updated weights on worker 0-0, policy_version 1238208 (0.00087) [2022-07-11 14:50:54,839][25689] Fps is (10 sec: 5652.8, 60 sec: 5563.8, 300 sec: 5558.7). Total num frames: 1267930112. Throughput: 0: 5772.0. Samples: 1267927158. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:50:54,839][25689] Avg episode reward: [(0, '0.310')] [2022-07-11 14:50:55,652][26022] Updated weights on worker 0-0, policy_version 1238218 (0.00097) [2022-07-11 14:50:57,448][26022] Updated weights on worker 0-0, policy_version 1238228 (0.00083) [2022-07-11 14:50:59,309][26022] Updated weights on worker 0-0, policy_version 1238238 (0.00087) [2022-07-11 14:50:59,874][25689] Fps is (10 sec: 5348.9, 60 sec: 5545.1, 300 sec: 5555.0). Total num frames: 1267956736. Throughput: 0: 5795.8. Samples: 1267960808. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:50:59,874][25689] Avg episode reward: [(0, '0.466')] [2022-07-11 14:51:01,203][26022] Updated weights on worker 0-0, policy_version 1238248 (0.00089) [2022-07-11 14:51:03,575][26022] Updated weights on worker 0-0, policy_version 1238258 (0.00082) [2022-07-11 14:51:04,877][25689] Fps is (10 sec: 5508.2, 60 sec: 5563.1, 300 sec: 5558.8). Total num frames: 1267985408. Throughput: 0: 5715.9. Samples: 1267992372. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:51:04,878][25689] Avg episode reward: [(0, '-0.748')] [2022-07-11 14:51:04,911][26022] Updated weights on worker 0-0, policy_version 1238268 (0.00088) [2022-07-11 14:51:07,206][26022] Updated weights on worker 0-0, policy_version 1238278 (0.00089) [2022-07-11 14:51:08,830][26022] Updated weights on worker 0-0, policy_version 1238288 (0.00096) [2022-07-11 14:51:09,888][25689] Fps is (10 sec: 5419.6, 60 sec: 5547.1, 300 sec: 5546.8). Total num frames: 1268011008. Throughput: 0: 4876.0. Samples: 1268008948. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:51:09,888][25689] Avg episode reward: [(0, '-0.808')] [2022-07-11 14:51:10,823][26022] Updated weights on worker 0-0, policy_version 1238298 (0.00087) [2022-07-11 14:51:12,701][26022] Updated weights on worker 0-0, policy_version 1238308 (0.00084) [2022-07-11 14:51:14,463][26022] Updated weights on worker 0-0, policy_version 1238318 (0.00094) [2022-07-11 14:51:14,964][25689] Fps is (10 sec: 5380.4, 60 sec: 5526.9, 300 sec: 5552.3). Total num frames: 1268039680. Throughput: 0: 5719.6. Samples: 1268042270. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:51:14,965][25689] Avg episode reward: [(0, '-0.493')] [2022-07-11 14:51:16,523][26022] Updated weights on worker 0-0, policy_version 1238328 (0.01207) [2022-07-11 14:51:18,155][26022] Updated weights on worker 0-0, policy_version 1238338 (0.00084) [2022-07-11 14:51:19,907][26022] Updated weights on worker 0-0, policy_version 1238348 (0.00093) [2022-07-11 14:51:19,983][25689] Fps is (10 sec: 5680.5, 60 sec: 5529.4, 300 sec: 5552.2). Total num frames: 1268068352. Throughput: 0: 5732.3. Samples: 1268076078. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:51:19,983][25689] Avg episode reward: [(0, '0.129')] [2022-07-11 14:51:21,803][26022] Updated weights on worker 0-0, policy_version 1238358 (0.00087) [2022-07-11 14:51:23,624][26022] Updated weights on worker 0-0, policy_version 1238368 (0.00615) [2022-07-11 14:51:25,000][25689] Fps is (10 sec: 5510.2, 60 sec: 5528.2, 300 sec: 5548.7). Total num frames: 1268094976. Throughput: 0: 4990.7. Samples: 1268092796. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:51:25,000][25689] Avg episode reward: [(0, '0.162')] [2022-07-11 14:51:25,414][26022] Updated weights on worker 0-0, policy_version 1238378 (0.00083) [2022-07-11 14:51:27,300][26022] Updated weights on worker 0-0, policy_version 1238388 (0.00064) [2022-07-11 14:51:29,071][26022] Updated weights on worker 0-0, policy_version 1238398 (0.00094) [2022-07-11 14:51:30,032][25689] Fps is (10 sec: 5604.4, 60 sec: 5547.1, 300 sec: 5556.0). Total num frames: 1268124672. Throughput: 0: 5850.6. Samples: 1268126804. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:51:30,033][25689] Avg episode reward: [(0, '0.019')] [2022-07-11 14:51:30,927][26022] Updated weights on worker 0-0, policy_version 1238408 (0.00082) [2022-07-11 14:51:32,678][26022] Updated weights on worker 0-0, policy_version 1238418 (0.00093) [2022-07-11 14:51:34,618][26022] Updated weights on worker 0-0, policy_version 1238428 (0.00087) [2022-07-11 14:51:35,147][25689] Fps is (10 sec: 5752.2, 60 sec: 5557.5, 300 sec: 5550.5). Total num frames: 1268153344. Throughput: 0: 5863.8. Samples: 1268160614. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:51:35,147][25689] Avg episode reward: [(0, '0.826')] [2022-07-11 14:51:36,432][26022] Updated weights on worker 0-0, policy_version 1238438 (0.00078) [2022-07-11 14:51:38,122][26022] Updated weights on worker 0-0, policy_version 1238448 (0.00087) [2022-07-11 14:51:40,007][26022] Updated weights on worker 0-0, policy_version 1238458 (0.00087) [2022-07-11 14:51:40,155][25689] Fps is (10 sec: 5665.0, 60 sec: 5579.9, 300 sec: 5554.0). Total num frames: 1268182016. Throughput: 0: 5045.8. Samples: 1268177860. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:51:40,155][25689] Avg episode reward: [(0, '1.188')] [2022-07-11 14:51:41,643][26022] Updated weights on worker 0-0, policy_version 1238468 (0.01237) [2022-07-11 14:51:43,540][26022] Updated weights on worker 0-0, policy_version 1238478 (0.00088) [2022-07-11 14:51:45,187][25689] Fps is (10 sec: 5711.3, 60 sec: 5586.1, 300 sec: 5560.7). Total num frames: 1268210688. Throughput: 0: 5904.6. Samples: 1268211996. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:51:45,188][25689] Avg episode reward: [(0, '1.336')] [2022-07-11 14:51:45,304][26022] Updated weights on worker 0-0, policy_version 1238488 (0.00094) [2022-07-11 14:51:47,160][26022] Updated weights on worker 0-0, policy_version 1238498 (0.00096) [2022-07-11 14:51:48,825][26022] Updated weights on worker 0-0, policy_version 1238508 (0.00091) [2022-07-11 14:51:50,279][25689] Fps is (10 sec: 5563.0, 60 sec: 5546.9, 300 sec: 5553.4). Total num frames: 1268238336. Throughput: 0: 5887.1. Samples: 1268245998. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:51:50,280][25689] Avg episode reward: [(0, '0.183')] [2022-07-11 14:51:50,748][26022] Updated weights on worker 0-0, policy_version 1238518 (0.00088) [2022-07-11 14:51:52,738][26022] Updated weights on worker 0-0, policy_version 1238528 (0.00080) [2022-07-11 14:51:54,331][26022] Updated weights on worker 0-0, policy_version 1238538 (0.00089) [2022-07-11 14:51:55,363][25689] Fps is (10 sec: 5534.7, 60 sec: 5566.3, 300 sec: 5559.0). Total num frames: 1268267008. Throughput: 0: 5043.9. Samples: 1268262584. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:51:55,363][25689] Avg episode reward: [(0, '0.343')] [2022-07-11 14:51:56,458][26022] Updated weights on worker 0-0, policy_version 1238548 (0.00093) [2022-07-11 14:51:57,995][26022] Updated weights on worker 0-0, policy_version 1238558 (0.00090) [2022-07-11 14:52:00,058][26022] Updated weights on worker 0-0, policy_version 1238568 (0.00078) [2022-07-11 14:52:00,383][25689] Fps is (10 sec: 5675.4, 60 sec: 5601.6, 300 sec: 5569.2). Total num frames: 1268295680. Throughput: 0: 5854.6. Samples: 1268296286. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:52:00,383][25689] Avg episode reward: [(0, '0.429')] [2022-07-11 14:52:01,759][26022] Updated weights on worker 0-0, policy_version 1238578 (0.00084) [2022-07-11 14:52:03,928][26022] Updated weights on worker 0-0, policy_version 1238588 (0.00085) [2022-07-11 14:52:05,419][25689] Fps is (10 sec: 5397.1, 60 sec: 5547.8, 300 sec: 5558.4). Total num frames: 1268321280. Throughput: 0: 5713.1. Samples: 1268327580. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:52:05,420][25689] Avg episode reward: [(0, '0.557')] [2022-07-11 14:52:06,159][26022] Updated weights on worker 0-0, policy_version 1238598 (0.00086) [2022-07-11 14:52:07,507][26022] Updated weights on worker 0-0, policy_version 1238608 (0.00079) [2022-07-11 14:52:09,709][26022] Updated weights on worker 0-0, policy_version 1238618 (0.00085) [2022-07-11 14:52:10,443][25689] Fps is (10 sec: 5394.9, 60 sec: 5597.3, 300 sec: 5562.1). Total num frames: 1268349952. Throughput: 0: 4862.9. Samples: 1268344054. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:52:10,444][25689] Avg episode reward: [(0, '0.255')] [2022-07-11 14:52:11,151][26022] Updated weights on worker 0-0, policy_version 1238628 (0.00094) [2022-07-11 14:52:13,327][26022] Updated weights on worker 0-0, policy_version 1238638 (0.00098) [2022-07-11 14:52:15,146][26022] Updated weights on worker 0-0, policy_version 1238648 (0.00086) [2022-07-11 14:52:15,475][25689] Fps is (10 sec: 5498.8, 60 sec: 5567.6, 300 sec: 5554.7). Total num frames: 1268376576. Throughput: 0: 5713.8. Samples: 1268377498. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:52:15,475][25689] Avg episode reward: [(0, '-0.058')] [2022-07-11 14:52:15,548][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:52:15,562][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001238650_1268377600.pth [2022-07-11 14:52:15,562][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001236694_1266374656.pth [2022-07-11 14:52:16,913][26022] Updated weights on worker 0-0, policy_version 1238658 (0.00091) [2022-07-11 14:52:18,688][26022] Updated weights on worker 0-0, policy_version 1238668 (0.00088) [2022-07-11 14:52:20,476][25689] Fps is (10 sec: 5511.3, 60 sec: 5569.2, 300 sec: 5558.2). Total num frames: 1268405248. Throughput: 0: 5731.9. Samples: 1268411458. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:52:20,476][25689] Avg episode reward: [(0, '1.408')] [2022-07-11 14:52:20,506][26022] Updated weights on worker 0-0, policy_version 1238678 (0.00091) [2022-07-11 14:52:22,301][26022] Updated weights on worker 0-0, policy_version 1238688 (0.00081) [2022-07-11 14:52:24,372][26022] Updated weights on worker 0-0, policy_version 1238698 (0.00092) [2022-07-11 14:52:25,479][25689] Fps is (10 sec: 5732.0, 60 sec: 5604.3, 300 sec: 5558.2). Total num frames: 1268433920. Throughput: 0: 5009.0. Samples: 1268428062. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:52:25,479][25689] Avg episode reward: [(0, '1.444')] [2022-07-11 14:52:26,034][26022] Updated weights on worker 0-0, policy_version 1238708 (0.00064) [2022-07-11 14:52:27,880][26022] Updated weights on worker 0-0, policy_version 1238718 (0.00085) [2022-07-11 14:52:29,766][26022] Updated weights on worker 0-0, policy_version 1238728 (0.00106) [2022-07-11 14:52:30,499][25689] Fps is (10 sec: 5517.1, 60 sec: 5554.7, 300 sec: 5559.8). Total num frames: 1268460544. Throughput: 0: 5855.1. Samples: 1268461484. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:52:30,499][25689] Avg episode reward: [(0, '1.572')] [2022-07-11 14:52:31,664][26022] Updated weights on worker 0-0, policy_version 1238738 (0.00096) [2022-07-11 14:52:33,502][26022] Updated weights on worker 0-0, policy_version 1238748 (0.00083) [2022-07-11 14:52:35,452][26022] Updated weights on worker 0-0, policy_version 1238758 (0.00090) [2022-07-11 14:52:35,567][25689] Fps is (10 sec: 5379.8, 60 sec: 5542.0, 300 sec: 5553.0). Total num frames: 1268488192. Throughput: 0: 5834.8. Samples: 1268494732. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:52:35,568][25689] Avg episode reward: [(0, '1.721')] [2022-07-11 14:52:37,021][26022] Updated weights on worker 0-0, policy_version 1238768 (0.00083) [2022-07-11 14:52:39,107][26022] Updated weights on worker 0-0, policy_version 1238778 (0.00082) [2022-07-11 14:52:40,620][25689] Fps is (10 sec: 5665.7, 60 sec: 5554.8, 300 sec: 5559.6). Total num frames: 1268517888. Throughput: 0: 4965.3. Samples: 1268511478. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2022-07-11 14:52:40,620][25689] Avg episode reward: [(0, '1.812')] [2022-07-11 14:52:40,888][26022] Updated weights on worker 0-0, policy_version 1238788 (0.00082) [2022-07-11 14:52:42,624][26022] Updated weights on worker 0-0, policy_version 1238798 (0.00088) [2022-07-11 14:52:44,633][26022] Updated weights on worker 0-0, policy_version 1238808 (0.00094) [2022-07-11 14:52:45,641][25689] Fps is (10 sec: 5692.1, 60 sec: 5538.9, 300 sec: 5559.7). Total num frames: 1268545536. Throughput: 0: 5811.3. Samples: 1268545232. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:52:45,642][25689] Avg episode reward: [(0, '1.971')] [2022-07-11 14:52:46,181][26022] Updated weights on worker 0-0, policy_version 1238818 (0.00085) [2022-07-11 14:52:48,224][26022] Updated weights on worker 0-0, policy_version 1238828 (0.00089) [2022-07-11 14:52:50,157][26022] Updated weights on worker 0-0, policy_version 1238838 (0.00087) [2022-07-11 14:52:50,681][25689] Fps is (10 sec: 5393.9, 60 sec: 5526.6, 300 sec: 5554.4). Total num frames: 1268572160. Throughput: 0: 5818.6. Samples: 1268578920. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:52:50,682][25689] Avg episode reward: [(0, '2.072')] [2022-07-11 14:52:51,681][26022] Updated weights on worker 0-0, policy_version 1238848 (0.00092) [2022-07-11 14:52:53,840][26022] Updated weights on worker 0-0, policy_version 1238858 (0.00092) [2022-07-11 14:52:55,299][26022] Updated weights on worker 0-0, policy_version 1238868 (0.00085) [2022-07-11 14:52:55,796][25689] Fps is (10 sec: 5646.8, 60 sec: 5557.7, 300 sec: 5556.0). Total num frames: 1268602880. Throughput: 0: 4993.6. Samples: 1268595750. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:52:55,797][25689] Avg episode reward: [(0, '1.834')] [2022-07-11 14:52:57,296][26022] Updated weights on worker 0-0, policy_version 1238878 (0.00080) [2022-07-11 14:52:59,123][26022] Updated weights on worker 0-0, policy_version 1238888 (0.00081) [2022-07-11 14:53:00,594][26022] Updated weights on worker 0-0, policy_version 1238898 (0.00087) [2022-07-11 14:53:00,821][25689] Fps is (10 sec: 5958.5, 60 sec: 5574.2, 300 sec: 5574.4). Total num frames: 1268632576. Throughput: 0: 5854.5. Samples: 1268629746. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:53:00,821][25689] Avg episode reward: [(0, '0.598')] [2022-07-11 14:53:03,091][26022] Updated weights on worker 0-0, policy_version 1238908 (0.00086) [2022-07-11 14:53:04,773][26022] Updated weights on worker 0-0, policy_version 1238918 (0.00086) [2022-07-11 14:53:05,913][25689] Fps is (10 sec: 5263.3, 60 sec: 5535.2, 300 sec: 5556.9). Total num frames: 1268656128. Throughput: 0: 5737.3. Samples: 1268661538. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:53:05,914][25689] Avg episode reward: [(0, '0.371')] [2022-07-11 14:53:06,902][26022] Updated weights on worker 0-0, policy_version 1238928 (0.00087) [2022-07-11 14:53:08,654][26022] Updated weights on worker 0-0, policy_version 1238938 (0.00095) [2022-07-11 14:53:10,409][26022] Updated weights on worker 0-0, policy_version 1238948 (0.00090) [2022-07-11 14:53:10,953][25689] Fps is (10 sec: 5356.2, 60 sec: 5567.5, 300 sec: 5565.1). Total num frames: 1268686848. Throughput: 0: 4891.4. Samples: 1268678078. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:53:10,954][25689] Avg episode reward: [(0, '-0.532')] [2022-07-11 14:53:12,334][26022] Updated weights on worker 0-0, policy_version 1238958 (0.00083) [2022-07-11 14:53:14,242][26022] Updated weights on worker 0-0, policy_version 1238968 (0.00093) [2022-07-11 14:53:15,823][26022] Updated weights on worker 0-0, policy_version 1238978 (0.00086) [2022-07-11 14:53:16,042][25689] Fps is (10 sec: 5762.7, 60 sec: 5579.3, 300 sec: 5560.3). Total num frames: 1268714496. Throughput: 0: 5741.7. Samples: 1268711992. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:53:16,042][25689] Avg episode reward: [(0, '-0.567')] [2022-07-11 14:53:17,898][26022] Updated weights on worker 0-0, policy_version 1238988 (0.00096) [2022-07-11 14:53:19,304][26022] Updated weights on worker 0-0, policy_version 1238998 (0.00094) [2022-07-11 14:53:21,071][25689] Fps is (10 sec: 5364.2, 60 sec: 5542.9, 300 sec: 5554.0). Total num frames: 1268741120. Throughput: 0: 5726.9. Samples: 1268745716. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:53:21,072][25689] Avg episode reward: [(0, '-0.717')] [2022-07-11 14:53:21,452][26022] Updated weights on worker 0-0, policy_version 1239008 (0.00086) [2022-07-11 14:53:23,117][26022] Updated weights on worker 0-0, policy_version 1239018 (0.00733) [2022-07-11 14:53:24,884][26022] Updated weights on worker 0-0, policy_version 1239028 (0.00086) [2022-07-11 14:53:26,103][25689] Fps is (10 sec: 5495.9, 60 sec: 5540.2, 300 sec: 5557.1). Total num frames: 1268769792. Throughput: 0: 5832.3. Samples: 1268779290. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:53:26,103][25689] Avg episode reward: [(0, '-0.830')] [2022-07-11 14:53:26,883][26022] Updated weights on worker 0-0, policy_version 1239038 (0.00097) [2022-07-11 14:53:28,795][26022] Updated weights on worker 0-0, policy_version 1239048 (0.00082) [2022-07-11 14:53:30,492][26022] Updated weights on worker 0-0, policy_version 1239058 (0.00093) [2022-07-11 14:53:31,120][25689] Fps is (10 sec: 5706.5, 60 sec: 5574.3, 300 sec: 5558.7). Total num frames: 1268798464. Throughput: 0: 5857.7. Samples: 1268796206. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:53:31,120][25689] Avg episode reward: [(0, '-0.997')] [2022-07-11 14:53:32,566][26022] Updated weights on worker 0-0, policy_version 1239068 (0.00053) [2022-07-11 14:53:34,160][26022] Updated weights on worker 0-0, policy_version 1239078 (0.00091) [2022-07-11 14:53:36,199][25689] Fps is (10 sec: 5476.7, 60 sec: 5556.4, 300 sec: 5551.5). Total num frames: 1268825088. Throughput: 0: 5833.5. Samples: 1268829582. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:53:36,200][25689] Avg episode reward: [(0, '1.080')] [2022-07-11 14:53:36,211][26022] Updated weights on worker 0-0, policy_version 1239088 (0.00084) [2022-07-11 14:53:37,863][26022] Updated weights on worker 0-0, policy_version 1239098 (0.00084) [2022-07-11 14:53:39,740][26022] Updated weights on worker 0-0, policy_version 1239108 (0.00088) [2022-07-11 14:53:41,229][25689] Fps is (10 sec: 5672.3, 60 sec: 5575.4, 300 sec: 5565.2). Total num frames: 1268855808. Throughput: 0: 5826.4. Samples: 1268863168. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:53:41,230][25689] Avg episode reward: [(0, '0.249')] [2022-07-11 14:53:41,511][26022] Updated weights on worker 0-0, policy_version 1239118 (0.00084) [2022-07-11 14:53:43,482][26022] Updated weights on worker 0-0, policy_version 1239128 (0.00087) [2022-07-11 14:53:45,124][26022] Updated weights on worker 0-0, policy_version 1239138 (0.00084) [2022-07-11 14:53:46,249][25689] Fps is (10 sec: 5705.8, 60 sec: 5558.6, 300 sec: 5558.1). Total num frames: 1268882432. Throughput: 0: 4995.6. Samples: 1268879934. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:53:46,250][25689] Avg episode reward: [(0, '0.212')] [2022-07-11 14:53:47,128][26022] Updated weights on worker 0-0, policy_version 1239148 (0.00089) [2022-07-11 14:53:48,893][26022] Updated weights on worker 0-0, policy_version 1239158 (0.00090) [2022-07-11 14:53:50,745][26022] Updated weights on worker 0-0, policy_version 1239168 (0.00092) [2022-07-11 14:53:51,262][25689] Fps is (10 sec: 5511.6, 60 sec: 5594.9, 300 sec: 5559.2). Total num frames: 1268911104. Throughput: 0: 5826.9. Samples: 1268913572. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:53:51,263][25689] Avg episode reward: [(0, '-0.496')] [2022-07-11 14:53:52,685][26022] Updated weights on worker 0-0, policy_version 1239178 (0.00100) [2022-07-11 14:53:54,368][26022] Updated weights on worker 0-0, policy_version 1239188 (0.00087) [2022-07-11 14:53:56,305][25689] Fps is (10 sec: 5498.9, 60 sec: 5533.8, 300 sec: 5551.8). Total num frames: 1268937728. Throughput: 0: 5818.6. Samples: 1268946570. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:53:56,307][25689] Avg episode reward: [(0, '0.143')] [2022-07-11 14:53:56,327][26022] Updated weights on worker 0-0, policy_version 1239198 (0.00089) [2022-07-11 14:53:58,098][26022] Updated weights on worker 0-0, policy_version 1239208 (0.00090) [2022-07-11 14:54:00,118][26022] Updated weights on worker 0-0, policy_version 1239218 (0.00084) [2022-07-11 14:54:01,308][25689] Fps is (10 sec: 5504.2, 60 sec: 5518.9, 300 sec: 5562.2). Total num frames: 1268966400. Throughput: 0: 4989.7. Samples: 1268963356. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:54:01,311][25689] Avg episode reward: [(0, '0.085')] [2022-07-11 14:54:02,340][26022] Updated weights on worker 0-0, policy_version 1239228 (0.00090) [2022-07-11 14:54:03,988][26022] Updated weights on worker 0-0, policy_version 1239238 (0.00091) [2022-07-11 14:54:06,050][26022] Updated weights on worker 0-0, policy_version 1239248 (0.00093) [2022-07-11 14:54:06,321][25689] Fps is (10 sec: 5419.0, 60 sec: 5560.1, 300 sec: 5558.6). Total num frames: 1268992000. Throughput: 0: 5699.3. Samples: 1268994326. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:54:06,321][25689] Avg episode reward: [(0, '-0.263')] [2022-07-11 14:54:07,816][26022] Updated weights on worker 0-0, policy_version 1239258 (0.00087) [2022-07-11 14:54:09,572][26022] Updated weights on worker 0-0, policy_version 1239268 (0.00088) [2022-07-11 14:54:11,324][25689] Fps is (10 sec: 5214.1, 60 sec: 5495.6, 300 sec: 5553.6). Total num frames: 1269018624. Throughput: 0: 5692.8. Samples: 1269027782. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:54:11,326][25689] Avg episode reward: [(0, '0.307')] [2022-07-11 14:54:11,598][26022] Updated weights on worker 0-0, policy_version 1239278 (0.00095) [2022-07-11 14:54:13,192][26022] Updated weights on worker 0-0, policy_version 1239288 (0.00090) [2022-07-11 14:54:15,227][26022] Updated weights on worker 0-0, policy_version 1239298 (0.00087) [2022-07-11 14:54:15,571][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:54:15,584][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001239300_1269043200.pth [2022-07-11 14:54:15,585][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001237345_1267041280.pth [2022-07-11 14:54:16,390][25689] Fps is (10 sec: 5491.5, 60 sec: 5514.6, 300 sec: 5557.6). Total num frames: 1269047296. Throughput: 0: 4871.1. Samples: 1269044404. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:54:16,391][25689] Avg episode reward: [(0, '-0.632')] [2022-07-11 14:54:16,914][26022] Updated weights on worker 0-0, policy_version 1239308 (0.00086) [2022-07-11 14:54:18,631][26022] Updated weights on worker 0-0, policy_version 1239318 (0.00088) [2022-07-11 14:54:20,687][26022] Updated weights on worker 0-0, policy_version 1239328 (0.00087) [2022-07-11 14:54:21,459][25689] Fps is (10 sec: 5658.0, 60 sec: 5544.9, 300 sec: 5556.6). Total num frames: 1269075968. Throughput: 0: 5694.7. Samples: 1269078110. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:54:21,460][25689] Avg episode reward: [(0, '-0.098')] [2022-07-11 14:54:22,402][26022] Updated weights on worker 0-0, policy_version 1239338 (0.00084) [2022-07-11 14:54:24,388][26022] Updated weights on worker 0-0, policy_version 1239348 (0.00087) [2022-07-11 14:54:25,972][26022] Updated weights on worker 0-0, policy_version 1239358 (0.00094) [2022-07-11 14:54:26,470][25689] Fps is (10 sec: 5790.2, 60 sec: 5563.8, 300 sec: 5560.4). Total num frames: 1269105664. Throughput: 0: 5831.0. Samples: 1269111820. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:54:26,472][25689] Avg episode reward: [(0, '-0.077')] [2022-07-11 14:54:28,144][26022] Updated weights on worker 0-0, policy_version 1239368 (0.00073) [2022-07-11 14:54:29,747][26022] Updated weights on worker 0-0, policy_version 1239378 (0.00090) [2022-07-11 14:54:31,561][25689] Fps is (10 sec: 5575.1, 60 sec: 5523.1, 300 sec: 5553.0). Total num frames: 1269132288. Throughput: 0: 4984.8. Samples: 1269128660. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:54:31,562][25689] Avg episode reward: [(0, '0.207')] [2022-07-11 14:54:31,673][26022] Updated weights on worker 0-0, policy_version 1239388 (0.00089) [2022-07-11 14:54:33,351][26022] Updated weights on worker 0-0, policy_version 1239398 (0.00086) [2022-07-11 14:54:35,481][26022] Updated weights on worker 0-0, policy_version 1239408 (0.00079) [2022-07-11 14:54:36,656][25689] Fps is (10 sec: 5428.9, 60 sec: 5555.6, 300 sec: 5556.1). Total num frames: 1269160960. Throughput: 0: 5798.2. Samples: 1269161910. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:54:36,657][25689] Avg episode reward: [(0, '0.479')] [2022-07-11 14:54:37,188][26022] Updated weights on worker 0-0, policy_version 1239418 (0.00090) [2022-07-11 14:54:39,069][26022] Updated weights on worker 0-0, policy_version 1239428 (0.00086) [2022-07-11 14:54:40,896][26022] Updated weights on worker 0-0, policy_version 1239438 (0.00092) [2022-07-11 14:54:41,701][25689] Fps is (10 sec: 5554.6, 60 sec: 5503.4, 300 sec: 5552.2). Total num frames: 1269188608. Throughput: 0: 5791.6. Samples: 1269195340. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:54:41,701][25689] Avg episode reward: [(0, '0.671')] [2022-07-11 14:54:42,672][26022] Updated weights on worker 0-0, policy_version 1239448 (0.00085) [2022-07-11 14:54:44,484][26022] Updated weights on worker 0-0, policy_version 1239458 (0.00080) [2022-07-11 14:54:46,335][26022] Updated weights on worker 0-0, policy_version 1239468 (0.00091) [2022-07-11 14:54:46,731][25689] Fps is (10 sec: 5488.7, 60 sec: 5519.5, 300 sec: 5548.6). Total num frames: 1269216256. Throughput: 0: 4959.0. Samples: 1269212290. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:54:46,731][25689] Avg episode reward: [(0, '1.309')] [2022-07-11 14:54:48,150][26022] Updated weights on worker 0-0, policy_version 1239478 (0.00090) [2022-07-11 14:54:50,064][26022] Updated weights on worker 0-0, policy_version 1239488 (0.00087) [2022-07-11 14:54:51,747][25689] Fps is (10 sec: 5504.2, 60 sec: 5502.2, 300 sec: 5549.3). Total num frames: 1269243904. Throughput: 0: 5809.1. Samples: 1269245920. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:54:51,747][25689] Avg episode reward: [(0, '0.487')] [2022-07-11 14:54:51,864][26022] Updated weights on worker 0-0, policy_version 1239498 (0.00082) [2022-07-11 14:54:53,819][26022] Updated weights on worker 0-0, policy_version 1239508 (0.00085) [2022-07-11 14:54:55,550][26022] Updated weights on worker 0-0, policy_version 1239518 (0.00093) [2022-07-11 14:54:56,871][25689] Fps is (10 sec: 5655.1, 60 sec: 5545.6, 300 sec: 5554.1). Total num frames: 1269273600. Throughput: 0: 5807.2. Samples: 1269279302. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:54:56,871][25689] Avg episode reward: [(0, '0.819')] [2022-07-11 14:54:57,358][26022] Updated weights on worker 0-0, policy_version 1239528 (0.00087) [2022-07-11 14:54:59,221][26022] Updated weights on worker 0-0, policy_version 1239538 (0.00086) [2022-07-11 14:55:00,970][26022] Updated weights on worker 0-0, policy_version 1239548 (0.00088) [2022-07-11 14:55:01,889][25689] Fps is (10 sec: 5452.2, 60 sec: 5493.5, 300 sec: 5547.2). Total num frames: 1269299200. Throughput: 0: 4996.4. Samples: 1269296208. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:55:01,889][25689] Avg episode reward: [(0, '0.624')] [2022-07-11 14:55:03,256][26022] Updated weights on worker 0-0, policy_version 1239558 (0.00095) [2022-07-11 14:55:05,124][26022] Updated weights on worker 0-0, policy_version 1239568 (0.00098) [2022-07-11 14:55:06,918][25689] Fps is (10 sec: 5299.9, 60 sec: 5525.8, 300 sec: 5550.5). Total num frames: 1269326848. Throughput: 0: 5710.7. Samples: 1269327574. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:55:06,918][25689] Avg episode reward: [(0, '0.635')] [2022-07-11 14:55:06,926][26022] Updated weights on worker 0-0, policy_version 1239578 (0.00091) [2022-07-11 14:55:08,978][26022] Updated weights on worker 0-0, policy_version 1239588 (0.00093) [2022-07-11 14:55:10,568][26022] Updated weights on worker 0-0, policy_version 1239598 (0.00093) [2022-07-11 14:55:11,934][25689] Fps is (10 sec: 5504.6, 60 sec: 5541.5, 300 sec: 5544.1). Total num frames: 1269354496. Throughput: 0: 5691.4. Samples: 1269360814. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:55:11,935][25689] Avg episode reward: [(0, '-0.091')] [2022-07-11 14:55:12,844][26022] Updated weights on worker 0-0, policy_version 1239608 (0.00094) [2022-07-11 14:55:14,186][26022] Updated weights on worker 0-0, policy_version 1239618 (0.00087) [2022-07-11 14:55:16,378][26022] Updated weights on worker 0-0, policy_version 1239628 (0.00098) [2022-07-11 14:55:16,998][25689] Fps is (10 sec: 5485.6, 60 sec: 5524.8, 300 sec: 5540.3). Total num frames: 1269382144. Throughput: 0: 4876.5. Samples: 1269377452. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:55:16,999][25689] Avg episode reward: [(0, '0.173')] [2022-07-11 14:55:17,917][26022] Updated weights on worker 0-0, policy_version 1239638 (0.00086) [2022-07-11 14:55:19,887][26022] Updated weights on worker 0-0, policy_version 1239648 (0.00077) [2022-07-11 14:55:21,716][26022] Updated weights on worker 0-0, policy_version 1239658 (0.00067) [2022-07-11 14:55:22,045][25689] Fps is (10 sec: 5570.3, 60 sec: 5526.8, 300 sec: 5546.4). Total num frames: 1269410816. Throughput: 0: 5689.4. Samples: 1269410884. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:55:22,047][25689] Avg episode reward: [(0, '0.660')] [2022-07-11 14:55:23,484][26022] Updated weights on worker 0-0, policy_version 1239668 (0.00093) [2022-07-11 14:55:25,384][26022] Updated weights on worker 0-0, policy_version 1239678 (0.00093) [2022-07-11 14:55:27,067][25689] Fps is (10 sec: 5695.3, 60 sec: 5509.0, 300 sec: 5547.0). Total num frames: 1269439488. Throughput: 0: 5782.1. Samples: 1269444076. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:55:27,068][25689] Avg episode reward: [(0, '0.325')] [2022-07-11 14:55:27,120][26022] Updated weights on worker 0-0, policy_version 1239688 (0.00091) [2022-07-11 14:55:29,115][26022] Updated weights on worker 0-0, policy_version 1239698 (0.00089) [2022-07-11 14:55:30,993][26022] Updated weights on worker 0-0, policy_version 1239708 (0.00087) [2022-07-11 14:55:32,069][25689] Fps is (10 sec: 5414.1, 60 sec: 5500.1, 300 sec: 5540.9). Total num frames: 1269465088. Throughput: 0: 4968.6. Samples: 1269460856. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:55:32,070][25689] Avg episode reward: [(0, '0.568')] [2022-07-11 14:55:32,714][26022] Updated weights on worker 0-0, policy_version 1239718 (0.00089) [2022-07-11 14:55:34,618][26022] Updated weights on worker 0-0, policy_version 1239728 (0.00091) [2022-07-11 14:55:36,449][26022] Updated weights on worker 0-0, policy_version 1239738 (0.00085) [2022-07-11 14:55:37,133][25689] Fps is (10 sec: 5595.0, 60 sec: 5536.8, 300 sec: 5551.3). Total num frames: 1269495808. Throughput: 0: 5827.8. Samples: 1269494794. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:55:37,133][25689] Avg episode reward: [(0, '1.481')] [2022-07-11 14:55:38,315][26022] Updated weights on worker 0-0, policy_version 1239748 (0.00085) [2022-07-11 14:55:40,222][26022] Updated weights on worker 0-0, policy_version 1239758 (0.00086) [2022-07-11 14:55:41,937][26022] Updated weights on worker 0-0, policy_version 1239768 (0.00087) [2022-07-11 14:55:42,138][25689] Fps is (10 sec: 5796.8, 60 sec: 5540.4, 300 sec: 5549.6). Total num frames: 1269523456. Throughput: 0: 5838.8. Samples: 1269528204. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:55:42,140][25689] Avg episode reward: [(0, '1.760')] [2022-07-11 14:55:43,836][26022] Updated weights on worker 0-0, policy_version 1239778 (0.00091) [2022-07-11 14:55:45,559][26022] Updated weights on worker 0-0, policy_version 1239788 (0.00103) [2022-07-11 14:55:47,226][25689] Fps is (10 sec: 5478.5, 60 sec: 5535.1, 300 sec: 5541.7). Total num frames: 1269551104. Throughput: 0: 5001.0. Samples: 1269544894. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:55:47,227][25689] Avg episode reward: [(0, '1.733')] [2022-07-11 14:55:47,530][26022] Updated weights on worker 0-0, policy_version 1239798 (0.00084) [2022-07-11 14:55:49,283][26022] Updated weights on worker 0-0, policy_version 1239808 (0.00089) [2022-07-11 14:55:51,252][26022] Updated weights on worker 0-0, policy_version 1239818 (0.00096) [2022-07-11 14:55:52,263][25689] Fps is (10 sec: 5562.7, 60 sec: 5550.1, 300 sec: 5546.5). Total num frames: 1269579776. Throughput: 0: 5816.6. Samples: 1269578314. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:55:52,275][25689] Avg episode reward: [(0, '1.878')] [2022-07-11 14:55:52,891][26022] Updated weights on worker 0-0, policy_version 1239828 (0.00093) [2022-07-11 14:55:54,866][26022] Updated weights on worker 0-0, policy_version 1239838 (0.00091) [2022-07-11 14:55:56,957][26022] Updated weights on worker 0-0, policy_version 1239848 (0.00046) [2022-07-11 14:55:57,360][25689] Fps is (10 sec: 5557.7, 60 sec: 5518.8, 300 sec: 5548.8). Total num frames: 1269607424. Throughput: 0: 5772.2. Samples: 1269611548. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:55:57,360][25689] Avg episode reward: [(0, '1.947')] [2022-07-11 14:55:58,283][26022] Updated weights on worker 0-0, policy_version 1239858 (0.00087) [2022-07-11 14:56:00,567][26022] Updated weights on worker 0-0, policy_version 1239868 (0.00086) [2022-07-11 14:56:02,136][26022] Updated weights on worker 0-0, policy_version 1239878 (0.00087) [2022-07-11 14:56:02,395][25689] Fps is (10 sec: 5457.5, 60 sec: 5551.0, 300 sec: 5544.8). Total num frames: 1269635072. Throughput: 0: 5719.3. Samples: 1269644058. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:56:02,395][25689] Avg episode reward: [(0, '2.263')] [2022-07-11 14:56:04,648][26022] Updated weights on worker 0-0, policy_version 1239888 (0.00089) [2022-07-11 14:56:06,317][26022] Updated weights on worker 0-0, policy_version 1239898 (0.00094) [2022-07-11 14:56:07,404][25689] Fps is (10 sec: 5199.2, 60 sec: 5502.1, 300 sec: 5541.3). Total num frames: 1269659648. Throughput: 0: 5680.9. Samples: 1269659524. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:56:07,405][25689] Avg episode reward: [(0, '1.972')] [2022-07-11 14:56:08,156][26022] Updated weights on worker 0-0, policy_version 1239908 (0.00092) [2022-07-11 14:56:10,128][26022] Updated weights on worker 0-0, policy_version 1239918 (0.00086) [2022-07-11 14:56:11,796][26022] Updated weights on worker 0-0, policy_version 1239928 (0.00088) [2022-07-11 14:56:12,424][25689] Fps is (10 sec: 5309.2, 60 sec: 5518.6, 300 sec: 5542.4). Total num frames: 1269688320. Throughput: 0: 5685.3. Samples: 1269692938. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:56:12,425][25689] Avg episode reward: [(0, '1.832')] [2022-07-11 14:56:13,736][26022] Updated weights on worker 0-0, policy_version 1239938 (0.00082) [2022-07-11 14:56:15,631][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:56:15,642][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001239948_1269706752.pth [2022-07-11 14:56:15,643][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001237998_1267709952.pth [2022-07-11 14:56:15,652][26022] Updated weights on worker 0-0, policy_version 1239948 (0.00093) [2022-07-11 14:56:17,414][26022] Updated weights on worker 0-0, policy_version 1239958 (0.00084) [2022-07-11 14:56:17,498][25689] Fps is (10 sec: 5782.8, 60 sec: 5551.6, 300 sec: 5544.8). Total num frames: 1269718016. Throughput: 0: 5712.0. Samples: 1269726576. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:56:17,498][25689] Avg episode reward: [(0, '2.038')] [2022-07-11 14:56:19,256][26022] Updated weights on worker 0-0, policy_version 1239968 (0.00085) [2022-07-11 14:56:20,954][26022] Updated weights on worker 0-0, policy_version 1239978 (0.00083) [2022-07-11 14:56:22,593][25689] Fps is (10 sec: 5639.4, 60 sec: 5530.3, 300 sec: 5546.8). Total num frames: 1269745664. Throughput: 0: 4914.0. Samples: 1269743310. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:56:22,593][25689] Avg episode reward: [(0, '1.964')] [2022-07-11 14:56:22,844][26022] Updated weights on worker 0-0, policy_version 1239988 (0.00091) [2022-07-11 14:56:24,690][26022] Updated weights on worker 0-0, policy_version 1239998 (0.00092) [2022-07-11 14:56:26,532][26022] Updated weights on worker 0-0, policy_version 1240008 (0.00086) [2022-07-11 14:56:27,635][25689] Fps is (10 sec: 5454.7, 60 sec: 5511.5, 300 sec: 5539.7). Total num frames: 1269773312. Throughput: 0: 5796.7. Samples: 1269776796. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:56:27,635][25689] Avg episode reward: [(0, '1.580')] [2022-07-11 14:56:28,302][26022] Updated weights on worker 0-0, policy_version 1240018 (0.00084) [2022-07-11 14:56:30,212][26022] Updated weights on worker 0-0, policy_version 1240028 (0.00089) [2022-07-11 14:56:31,944][26022] Updated weights on worker 0-0, policy_version 1240038 (0.00088) [2022-07-11 14:56:32,666][25689] Fps is (10 sec: 5692.7, 60 sec: 5576.5, 300 sec: 5544.7). Total num frames: 1269803008. Throughput: 0: 5806.3. Samples: 1269810466. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:56:32,667][25689] Avg episode reward: [(0, '1.227')] [2022-07-11 14:56:34,116][26022] Updated weights on worker 0-0, policy_version 1240048 (0.00085) [2022-07-11 14:56:35,548][26022] Updated weights on worker 0-0, policy_version 1240058 (0.00094) [2022-07-11 14:56:37,711][25689] Fps is (10 sec: 5589.5, 60 sec: 5510.6, 300 sec: 5537.2). Total num frames: 1269829632. Throughput: 0: 4968.0. Samples: 1269826996. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:56:37,713][25689] Avg episode reward: [(0, '1.263')] [2022-07-11 14:56:37,719][26022] Updated weights on worker 0-0, policy_version 1240068 (0.00090) [2022-07-11 14:56:39,287][26022] Updated weights on worker 0-0, policy_version 1240078 (0.00087) [2022-07-11 14:56:41,375][26022] Updated weights on worker 0-0, policy_version 1240088 (0.00096) [2022-07-11 14:56:42,728][25689] Fps is (10 sec: 5495.8, 60 sec: 5526.5, 300 sec: 5537.5). Total num frames: 1269858304. Throughput: 0: 5807.4. Samples: 1269860240. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2022-07-11 14:56:42,728][25689] Avg episode reward: [(0, '1.298')] [2022-07-11 14:56:42,983][26022] Updated weights on worker 0-0, policy_version 1240098 (0.00088) [2022-07-11 14:56:44,987][26022] Updated weights on worker 0-0, policy_version 1240108 (0.00435) [2022-07-11 14:56:46,880][26022] Updated weights on worker 0-0, policy_version 1240118 (0.00091) [2022-07-11 14:56:47,731][25689] Fps is (10 sec: 5620.9, 60 sec: 5534.3, 300 sec: 5539.1). Total num frames: 1269885952. Throughput: 0: 5820.4. Samples: 1269893762. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:56:47,732][25689] Avg episode reward: [(0, '1.358')] [2022-07-11 14:56:48,623][26022] Updated weights on worker 0-0, policy_version 1240128 (0.00086) [2022-07-11 14:56:50,361][26022] Updated weights on worker 0-0, policy_version 1240138 (0.00062) [2022-07-11 14:56:52,299][26022] Updated weights on worker 0-0, policy_version 1240148 (0.00052) [2022-07-11 14:56:52,743][25689] Fps is (10 sec: 5521.3, 60 sec: 5519.6, 300 sec: 5537.0). Total num frames: 1269913600. Throughput: 0: 4986.9. Samples: 1269910586. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:56:52,744][25689] Avg episode reward: [(0, '1.388')] [2022-07-11 14:56:53,976][26022] Updated weights on worker 0-0, policy_version 1240158 (0.00091) [2022-07-11 14:56:56,011][26022] Updated weights on worker 0-0, policy_version 1240168 (0.00094) [2022-07-11 14:56:57,597][26022] Updated weights on worker 0-0, policy_version 1240178 (0.00088) [2022-07-11 14:56:57,830][25689] Fps is (10 sec: 5678.2, 60 sec: 5554.4, 300 sec: 5539.2). Total num frames: 1269943296. Throughput: 0: 5823.2. Samples: 1269944150. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:56:57,830][25689] Avg episode reward: [(0, '1.522')] [2022-07-11 14:56:59,646][26022] Updated weights on worker 0-0, policy_version 1240188 (0.00087) [2022-07-11 14:57:01,391][26022] Updated weights on worker 0-0, policy_version 1240198 (0.00091) [2022-07-11 14:57:02,864][25689] Fps is (10 sec: 5463.3, 60 sec: 5520.6, 300 sec: 5539.2). Total num frames: 1269968896. Throughput: 0: 5763.6. Samples: 1269976298. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:57:02,865][25689] Avg episode reward: [(0, '2.000')] [2022-07-11 14:57:03,627][26022] Updated weights on worker 0-0, policy_version 1240208 (0.00094) [2022-07-11 14:57:05,458][26022] Updated weights on worker 0-0, policy_version 1240218 (0.00091) [2022-07-11 14:57:07,155][26022] Updated weights on worker 0-0, policy_version 1240228 (0.00061) [2022-07-11 14:57:07,942][25689] Fps is (10 sec: 5265.7, 60 sec: 5565.1, 300 sec: 5534.8). Total num frames: 1269996544. Throughput: 0: 4915.3. Samples: 1269993106. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:57:07,942][25689] Avg episode reward: [(0, '1.671')] [2022-07-11 14:57:09,091][26022] Updated weights on worker 0-0, policy_version 1240238 (0.00104) [2022-07-11 14:57:10,853][26022] Updated weights on worker 0-0, policy_version 1240248 (0.00090) [2022-07-11 14:57:12,740][26022] Updated weights on worker 0-0, policy_version 1240258 (0.00090) [2022-07-11 14:57:12,958][25689] Fps is (10 sec: 5579.5, 60 sec: 5565.4, 300 sec: 5542.0). Total num frames: 1270025216. Throughput: 0: 5724.2. Samples: 1270026304. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:57:12,959][25689] Avg episode reward: [(0, '1.156')] [2022-07-11 14:57:14,787][26022] Updated weights on worker 0-0, policy_version 1240268 (0.00096) [2022-07-11 14:57:16,264][26022] Updated weights on worker 0-0, policy_version 1240278 (0.00080) [2022-07-11 14:57:17,995][25689] Fps is (10 sec: 5500.5, 60 sec: 5518.0, 300 sec: 5534.4). Total num frames: 1270051840. Throughput: 0: 5731.7. Samples: 1270059732. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:57:17,995][25689] Avg episode reward: [(0, '0.944')] [2022-07-11 14:57:18,605][26022] Updated weights on worker 0-0, policy_version 1240288 (0.00082) [2022-07-11 14:57:19,976][26022] Updated weights on worker 0-0, policy_version 1240298 (0.00089) [2022-07-11 14:57:22,086][26022] Updated weights on worker 0-0, policy_version 1240308 (0.00104) [2022-07-11 14:57:23,038][25689] Fps is (10 sec: 5689.0, 60 sec: 5573.6, 300 sec: 5540.5). Total num frames: 1270082560. Throughput: 0: 4960.0. Samples: 1270076360. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:57:23,038][25689] Avg episode reward: [(0, '-0.061')] [2022-07-11 14:57:23,922][26022] Updated weights on worker 0-0, policy_version 1240318 (0.00092) [2022-07-11 14:57:25,869][26022] Updated weights on worker 0-0, policy_version 1240328 (0.00092) [2022-07-11 14:57:27,555][26022] Updated weights on worker 0-0, policy_version 1240338 (0.00108) [2022-07-11 14:57:28,050][25689] Fps is (10 sec: 5702.8, 60 sec: 5559.4, 300 sec: 5540.7). Total num frames: 1270109184. Throughput: 0: 5802.4. Samples: 1270109782. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:57:28,051][25689] Avg episode reward: [(0, '0.011')] [2022-07-11 14:57:29,443][26022] Updated weights on worker 0-0, policy_version 1240348 (0.00083) [2022-07-11 14:57:31,167][26022] Updated weights on worker 0-0, policy_version 1240358 (0.00090) [2022-07-11 14:57:33,060][26022] Updated weights on worker 0-0, policy_version 1240368 (0.00081) [2022-07-11 14:57:33,061][25689] Fps is (10 sec: 5312.4, 60 sec: 5510.4, 300 sec: 5538.3). Total num frames: 1270135808. Throughput: 0: 5847.7. Samples: 1270143862. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:57:33,062][25689] Avg episode reward: [(0, '-1.784')] [2022-07-11 14:57:34,707][26022] Updated weights on worker 0-0, policy_version 1240378 (0.00084) [2022-07-11 14:57:36,614][26022] Updated weights on worker 0-0, policy_version 1240388 (0.00086) [2022-07-11 14:57:38,108][25689] Fps is (10 sec: 5701.7, 60 sec: 5578.1, 300 sec: 5541.9). Total num frames: 1270166528. Throughput: 0: 5028.3. Samples: 1270160864. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:57:38,109][25689] Avg episode reward: [(0, '-1.584')] [2022-07-11 14:57:38,347][26022] Updated weights on worker 0-0, policy_version 1240398 (0.00091) [2022-07-11 14:57:40,271][26022] Updated weights on worker 0-0, policy_version 1240408 (0.00110) [2022-07-11 14:57:42,033][26022] Updated weights on worker 0-0, policy_version 1240418 (0.00076) [2022-07-11 14:57:43,210][25689] Fps is (10 sec: 5650.5, 60 sec: 5536.3, 300 sec: 5536.9). Total num frames: 1270193152. Throughput: 0: 5848.4. Samples: 1270194332. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:57:43,210][25689] Avg episode reward: [(0, '-1.117')] [2022-07-11 14:57:43,842][26022] Updated weights on worker 0-0, policy_version 1240428 (0.00092) [2022-07-11 14:57:45,735][26022] Updated weights on worker 0-0, policy_version 1240438 (0.00086) [2022-07-11 14:57:47,663][26022] Updated weights on worker 0-0, policy_version 1240448 (0.00087) [2022-07-11 14:57:48,268][25689] Fps is (10 sec: 5442.3, 60 sec: 5548.2, 300 sec: 5543.5). Total num frames: 1270221824. Throughput: 0: 5829.9. Samples: 1270227648. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:57:48,270][25689] Avg episode reward: [(0, '-1.218')] [2022-07-11 14:57:49,356][26022] Updated weights on worker 0-0, policy_version 1240458 (0.00083) [2022-07-11 14:57:51,298][26022] Updated weights on worker 0-0, policy_version 1240468 (0.00086) [2022-07-11 14:57:53,037][26022] Updated weights on worker 0-0, policy_version 1240478 (0.00091) [2022-07-11 14:57:53,321][25689] Fps is (10 sec: 5671.4, 60 sec: 5561.4, 300 sec: 5537.8). Total num frames: 1270250496. Throughput: 0: 4970.9. Samples: 1270244570. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:57:53,321][25689] Avg episode reward: [(0, '-1.499')] [2022-07-11 14:57:54,951][26022] Updated weights on worker 0-0, policy_version 1240488 (0.00096) [2022-07-11 14:57:56,846][26022] Updated weights on worker 0-0, policy_version 1240498 (0.00084) [2022-07-11 14:57:58,353][26022] Updated weights on worker 0-0, policy_version 1240508 (0.00090) [2022-07-11 14:57:58,376][25689] Fps is (10 sec: 5774.3, 60 sec: 5564.2, 300 sec: 5537.2). Total num frames: 1270280192. Throughput: 0: 5786.8. Samples: 1270278154. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:57:58,377][25689] Avg episode reward: [(0, '1.107')] [2022-07-11 14:58:00,577][26022] Updated weights on worker 0-0, policy_version 1240518 (0.00078) [2022-07-11 14:58:02,681][26022] Updated weights on worker 0-0, policy_version 1240528 (0.00090) [2022-07-11 14:58:03,391][25689] Fps is (10 sec: 5389.1, 60 sec: 5549.1, 300 sec: 5542.1). Total num frames: 1270304768. Throughput: 0: 5722.5. Samples: 1270309822. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:58:03,392][25689] Avg episode reward: [(0, '1.117')] [2022-07-11 14:58:04,366][26022] Updated weights on worker 0-0, policy_version 1240538 (0.00088) [2022-07-11 14:58:06,425][26022] Updated weights on worker 0-0, policy_version 1240548 (0.00087) [2022-07-11 14:58:07,925][26022] Updated weights on worker 0-0, policy_version 1240558 (0.00084) [2022-07-11 14:58:08,399][25689] Fps is (10 sec: 5210.7, 60 sec: 5555.6, 300 sec: 5532.3). Total num frames: 1270332416. Throughput: 0: 4918.1. Samples: 1270326652. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:58:08,399][25689] Avg episode reward: [(0, '1.010')] [2022-07-11 14:58:09,966][26022] Updated weights on worker 0-0, policy_version 1240568 (0.00083) [2022-07-11 14:58:11,667][26022] Updated weights on worker 0-0, policy_version 1240578 (0.00086) [2022-07-11 14:58:13,407][25689] Fps is (10 sec: 5623.4, 60 sec: 5556.3, 300 sec: 5537.3). Total num frames: 1270361088. Throughput: 0: 5751.9. Samples: 1270360102. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:58:13,407][25689] Avg episode reward: [(0, '1.188')] [2022-07-11 14:58:13,539][26022] Updated weights on worker 0-0, policy_version 1240588 (0.00085) [2022-07-11 14:58:15,577][26022] Updated weights on worker 0-0, policy_version 1240598 (0.00087) [2022-07-11 14:58:15,799][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 14:58:15,815][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001240599_1270373376.pth [2022-07-11 14:58:15,816][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001238650_1268377600.pth [2022-07-11 14:58:17,160][26022] Updated weights on worker 0-0, policy_version 1240608 (0.00087) [2022-07-11 14:58:18,475][25689] Fps is (10 sec: 5487.9, 60 sec: 5553.5, 300 sec: 5536.6). Total num frames: 1270387712. Throughput: 0: 5766.3. Samples: 1270394048. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:58:18,475][25689] Avg episode reward: [(0, '1.024')] [2022-07-11 14:58:19,196][26022] Updated weights on worker 0-0, policy_version 1240618 (0.00092) [2022-07-11 14:58:20,837][26022] Updated weights on worker 0-0, policy_version 1240628 (0.00087) [2022-07-11 14:58:22,661][26022] Updated weights on worker 0-0, policy_version 1240638 (0.00085) [2022-07-11 14:58:23,478][25689] Fps is (10 sec: 5592.0, 60 sec: 5540.1, 300 sec: 5540.5). Total num frames: 1270417408. Throughput: 0: 5033.7. Samples: 1270410934. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:58:23,479][25689] Avg episode reward: [(0, '1.841')] [2022-07-11 14:58:24,595][26022] Updated weights on worker 0-0, policy_version 1240648 (0.00081) [2022-07-11 14:58:26,447][26022] Updated weights on worker 0-0, policy_version 1240658 (0.00087) [2022-07-11 14:58:28,110][26022] Updated weights on worker 0-0, policy_version 1240668 (0.00093) [2022-07-11 14:58:28,503][25689] Fps is (10 sec: 5718.1, 60 sec: 5555.9, 300 sec: 5536.9). Total num frames: 1270445056. Throughput: 0: 5869.1. Samples: 1270444648. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:58:28,504][25689] Avg episode reward: [(0, '1.033')] [2022-07-11 14:58:29,970][26022] Updated weights on worker 0-0, policy_version 1240678 (0.00085) [2022-07-11 14:58:31,920][26022] Updated weights on worker 0-0, policy_version 1240688 (0.00086) [2022-07-11 14:58:33,505][25689] Fps is (10 sec: 5514.6, 60 sec: 5573.7, 300 sec: 5541.8). Total num frames: 1270472704. Throughput: 0: 5868.3. Samples: 1270478048. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:58:33,507][25689] Avg episode reward: [(0, '0.422')] [2022-07-11 14:58:33,682][26022] Updated weights on worker 0-0, policy_version 1240698 (0.00088) [2022-07-11 14:58:35,565][26022] Updated weights on worker 0-0, policy_version 1240708 (0.00090) [2022-07-11 14:58:37,469][26022] Updated weights on worker 0-0, policy_version 1240718 (0.00085) [2022-07-11 14:58:38,558][25689] Fps is (10 sec: 5601.2, 60 sec: 5539.2, 300 sec: 5534.5). Total num frames: 1270501376. Throughput: 0: 5003.3. Samples: 1270494532. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:58:38,560][25689] Avg episode reward: [(0, '-0.449')] [2022-07-11 14:58:39,321][26022] Updated weights on worker 0-0, policy_version 1240728 (0.00086) [2022-07-11 14:58:41,120][26022] Updated weights on worker 0-0, policy_version 1240738 (0.00092) [2022-07-11 14:58:42,995][26022] Updated weights on worker 0-0, policy_version 1240748 (0.00084) [2022-07-11 14:58:43,594][25689] Fps is (10 sec: 5481.2, 60 sec: 5545.3, 300 sec: 5534.2). Total num frames: 1270528000. Throughput: 0: 5809.4. Samples: 1270527792. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:58:43,597][25689] Avg episode reward: [(0, '-0.190')] [2022-07-11 14:58:44,790][26022] Updated weights on worker 0-0, policy_version 1240758 (0.00087) [2022-07-11 14:58:46,679][26022] Updated weights on worker 0-0, policy_version 1240768 (0.00088) [2022-07-11 14:58:48,407][26022] Updated weights on worker 0-0, policy_version 1240778 (0.00093) [2022-07-11 14:58:48,603][25689] Fps is (10 sec: 5505.0, 60 sec: 5549.8, 300 sec: 5534.3). Total num frames: 1270556672. Throughput: 0: 5807.1. Samples: 1270561368. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:58:48,605][25689] Avg episode reward: [(0, '-1.019')] [2022-07-11 14:58:50,323][26022] Updated weights on worker 0-0, policy_version 1240788 (0.00078) [2022-07-11 14:58:52,084][26022] Updated weights on worker 0-0, policy_version 1240798 (0.00088) [2022-07-11 14:58:53,617][25689] Fps is (10 sec: 5721.1, 60 sec: 5553.4, 300 sec: 5541.7). Total num frames: 1270585344. Throughput: 0: 4976.1. Samples: 1270578124. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:58:53,621][25689] Avg episode reward: [(0, '-1.810')] [2022-07-11 14:58:54,038][26022] Updated weights on worker 0-0, policy_version 1240808 (0.00087) [2022-07-11 14:58:55,839][26022] Updated weights on worker 0-0, policy_version 1240818 (0.00087) [2022-07-11 14:58:57,830][26022] Updated weights on worker 0-0, policy_version 1240828 (0.00087) [2022-07-11 14:58:58,672][25689] Fps is (10 sec: 5695.0, 60 sec: 5536.4, 300 sec: 5540.7). Total num frames: 1270614016. Throughput: 0: 5801.4. Samples: 1270611220. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:58:58,673][25689] Avg episode reward: [(0, '-0.930')] [2022-07-11 14:58:59,767][26022] Updated weights on worker 0-0, policy_version 1240838 (0.00099) [2022-07-11 14:59:01,356][26022] Updated weights on worker 0-0, policy_version 1240848 (0.00089) [2022-07-11 14:59:03,685][25689] Fps is (10 sec: 5187.2, 60 sec: 5519.7, 300 sec: 5533.8). Total num frames: 1270637568. Throughput: 0: 5727.5. Samples: 1270642862. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:59:03,685][25689] Avg episode reward: [(0, '-0.474')] [2022-07-11 14:59:03,700][26022] Updated weights on worker 0-0, policy_version 1240858 (0.00088) [2022-07-11 14:59:05,248][26022] Updated weights on worker 0-0, policy_version 1240868 (0.00093) [2022-07-11 14:59:07,247][26022] Updated weights on worker 0-0, policy_version 1240878 (0.00085) [2022-07-11 14:59:08,706][25689] Fps is (10 sec: 5204.7, 60 sec: 5535.4, 300 sec: 5540.4). Total num frames: 1270666240. Throughput: 0: 5726.5. Samples: 1270676490. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:59:08,707][25689] Avg episode reward: [(0, '-0.932')] [2022-07-11 14:59:09,197][26022] Updated weights on worker 0-0, policy_version 1240888 (0.00097) [2022-07-11 14:59:10,997][26022] Updated weights on worker 0-0, policy_version 1240898 (0.00086) [2022-07-11 14:59:12,937][26022] Updated weights on worker 0-0, policy_version 1240908 (0.00090) [2022-07-11 14:59:13,739][25689] Fps is (10 sec: 5703.6, 60 sec: 5533.1, 300 sec: 5541.0). Total num frames: 1270694912. Throughput: 0: 5704.2. Samples: 1270692902. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:59:13,739][25689] Avg episode reward: [(0, '-0.390')] [2022-07-11 14:59:14,760][26022] Updated weights on worker 0-0, policy_version 1240918 (0.00086) [2022-07-11 14:59:16,628][26022] Updated weights on worker 0-0, policy_version 1240928 (0.00085) [2022-07-11 14:59:18,299][26022] Updated weights on worker 0-0, policy_version 1240938 (0.00095) [2022-07-11 14:59:18,856][25689] Fps is (10 sec: 5548.7, 60 sec: 5545.5, 300 sec: 5536.7). Total num frames: 1270722560. Throughput: 0: 5698.8. Samples: 1270726246. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:59:18,858][25689] Avg episode reward: [(0, '1.240')] [2022-07-11 14:59:20,307][26022] Updated weights on worker 0-0, policy_version 1240948 (0.00082) [2022-07-11 14:59:21,999][26022] Updated weights on worker 0-0, policy_version 1240958 (0.00087) [2022-07-11 14:59:23,847][26022] Updated weights on worker 0-0, policy_version 1240968 (0.00087) [2022-07-11 14:59:23,944][25689] Fps is (10 sec: 5518.7, 60 sec: 5520.9, 300 sec: 5531.8). Total num frames: 1270751232. Throughput: 0: 5765.8. Samples: 1270759674. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:59:23,945][25689] Avg episode reward: [(0, '1.234')] [2022-07-11 14:59:25,557][26022] Updated weights on worker 0-0, policy_version 1240978 (0.00089) [2022-07-11 14:59:27,621][26022] Updated weights on worker 0-0, policy_version 1240988 (0.00090) [2022-07-11 14:59:28,979][25689] Fps is (10 sec: 5564.0, 60 sec: 5520.0, 300 sec: 5536.3). Total num frames: 1270778880. Throughput: 0: 4922.3. Samples: 1270776278. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:59:28,979][25689] Avg episode reward: [(0, '1.518')] [2022-07-11 14:59:29,399][26022] Updated weights on worker 0-0, policy_version 1240998 (0.00054) [2022-07-11 14:59:31,282][26022] Updated weights on worker 0-0, policy_version 1241008 (0.00090) [2022-07-11 14:59:33,083][26022] Updated weights on worker 0-0, policy_version 1241018 (0.00086) [2022-07-11 14:59:34,004][25689] Fps is (10 sec: 5598.5, 60 sec: 5534.8, 300 sec: 5537.6). Total num frames: 1270807552. Throughput: 0: 5772.1. Samples: 1270809874. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:59:34,004][25689] Avg episode reward: [(0, '1.263')] [2022-07-11 14:59:34,870][26022] Updated weights on worker 0-0, policy_version 1241028 (0.00087) [2022-07-11 14:59:36,628][26022] Updated weights on worker 0-0, policy_version 1241038 (0.00089) [2022-07-11 14:59:38,436][26022] Updated weights on worker 0-0, policy_version 1241048 (0.00100) [2022-07-11 14:59:39,100][25689] Fps is (10 sec: 5665.5, 60 sec: 5530.8, 300 sec: 5540.1). Total num frames: 1270836224. Throughput: 0: 5798.6. Samples: 1270843632. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:59:39,101][25689] Avg episode reward: [(0, '1.930')] [2022-07-11 14:59:40,564][26022] Updated weights on worker 0-0, policy_version 1241058 (0.00089) [2022-07-11 14:59:42,170][26022] Updated weights on worker 0-0, policy_version 1241068 (0.00085) [2022-07-11 14:59:44,056][26022] Updated weights on worker 0-0, policy_version 1241078 (0.00087) [2022-07-11 14:59:44,145][25689] Fps is (10 sec: 5553.4, 60 sec: 5546.9, 300 sec: 5539.8). Total num frames: 1270863872. Throughput: 0: 4987.8. Samples: 1270860434. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:59:44,146][25689] Avg episode reward: [(0, '1.803')] [2022-07-11 14:59:45,846][26022] Updated weights on worker 0-0, policy_version 1241088 (0.00083) [2022-07-11 14:59:47,567][26022] Updated weights on worker 0-0, policy_version 1241098 (0.00083) [2022-07-11 14:59:49,184][25689] Fps is (10 sec: 5585.2, 60 sec: 5544.2, 300 sec: 5542.8). Total num frames: 1270892544. Throughput: 0: 5821.1. Samples: 1270893896. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:59:49,185][25689] Avg episode reward: [(0, '1.677')] [2022-07-11 14:59:49,569][26022] Updated weights on worker 0-0, policy_version 1241108 (0.00083) [2022-07-11 14:59:51,376][26022] Updated weights on worker 0-0, policy_version 1241118 (0.00085) [2022-07-11 14:59:53,175][26022] Updated weights on worker 0-0, policy_version 1241128 (0.00088) [2022-07-11 14:59:54,207][25689] Fps is (10 sec: 5597.7, 60 sec: 5526.5, 300 sec: 5537.8). Total num frames: 1270920192. Throughput: 0: 5846.4. Samples: 1270927988. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:59:54,207][25689] Avg episode reward: [(0, '1.459')] [2022-07-11 14:59:54,888][26022] Updated weights on worker 0-0, policy_version 1241138 (0.00085) [2022-07-11 14:59:56,875][26022] Updated weights on worker 0-0, policy_version 1241148 (0.00084) [2022-07-11 14:59:58,741][26022] Updated weights on worker 0-0, policy_version 1241158 (0.00083) [2022-07-11 14:59:59,256][25689] Fps is (10 sec: 5490.3, 60 sec: 5510.1, 300 sec: 5544.1). Total num frames: 1270947840. Throughput: 0: 5017.2. Samples: 1270944756. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 14:59:59,256][25689] Avg episode reward: [(0, '1.302')] [2022-07-11 15:00:00,266][26022] Updated weights on worker 0-0, policy_version 1241168 (0.00087) [2022-07-11 15:00:02,892][26022] Updated weights on worker 0-0, policy_version 1241178 (0.00089) [2022-07-11 15:00:04,155][26022] Updated weights on worker 0-0, policy_version 1241188 (0.00085) [2022-07-11 15:00:04,303][25689] Fps is (10 sec: 5578.4, 60 sec: 5591.5, 300 sec: 5547.2). Total num frames: 1270976512. Throughput: 0: 5754.4. Samples: 1270976426. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 15:00:04,303][25689] Avg episode reward: [(0, '1.454')] [2022-07-11 15:00:06,342][26022] Updated weights on worker 0-0, policy_version 1241198 (0.00086) [2022-07-11 15:00:08,068][26022] Updated weights on worker 0-0, policy_version 1241208 (0.00086) [2022-07-11 15:00:09,323][25689] Fps is (10 sec: 5391.0, 60 sec: 5540.9, 300 sec: 5540.3). Total num frames: 1271002112. Throughput: 0: 5791.4. Samples: 1271010526. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 15:00:09,323][25689] Avg episode reward: [(0, '1.320')] [2022-07-11 15:00:09,931][26022] Updated weights on worker 0-0, policy_version 1241218 (0.00998) [2022-07-11 15:00:11,663][26022] Updated weights on worker 0-0, policy_version 1241228 (0.00096) [2022-07-11 15:00:13,455][26022] Updated weights on worker 0-0, policy_version 1241238 (0.00088) [2022-07-11 15:00:14,371][25689] Fps is (10 sec: 5492.3, 60 sec: 5556.4, 300 sec: 5547.5). Total num frames: 1271031808. Throughput: 0: 4929.7. Samples: 1271027390. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 15:00:14,371][25689] Avg episode reward: [(0, '1.593')] [2022-07-11 15:00:15,358][26022] Updated weights on worker 0-0, policy_version 1241248 (0.00097) [2022-07-11 15:00:15,999][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:00:16,010][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001241252_1271042048.pth [2022-07-11 15:00:16,012][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001239300_1269043200.pth [2022-07-11 15:00:17,255][26022] Updated weights on worker 0-0, policy_version 1241258 (0.00087) [2022-07-11 15:00:18,898][26022] Updated weights on worker 0-0, policy_version 1241268 (0.00085) [2022-07-11 15:00:19,419][25689] Fps is (10 sec: 5781.4, 60 sec: 5579.7, 300 sec: 5547.4). Total num frames: 1271060480. Throughput: 0: 5765.8. Samples: 1271061012. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 15:00:19,419][25689] Avg episode reward: [(0, '1.247')] [2022-07-11 15:00:20,863][26022] Updated weights on worker 0-0, policy_version 1241278 (0.00084) [2022-07-11 15:00:22,550][26022] Updated weights on worker 0-0, policy_version 1241288 (0.00089) [2022-07-11 15:00:24,421][25689] Fps is (10 sec: 5604.1, 60 sec: 5570.7, 300 sec: 5544.4). Total num frames: 1271088128. Throughput: 0: 5899.5. Samples: 1271095112. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 15:00:24,423][25689] Avg episode reward: [(0, '1.392')] [2022-07-11 15:00:24,435][26022] Updated weights on worker 0-0, policy_version 1241298 (0.00085) [2022-07-11 15:00:26,191][26022] Updated weights on worker 0-0, policy_version 1241308 (0.00092) [2022-07-11 15:00:28,179][26022] Updated weights on worker 0-0, policy_version 1241318 (0.00086) [2022-07-11 15:00:29,456][25689] Fps is (10 sec: 5611.3, 60 sec: 5587.6, 300 sec: 5554.1). Total num frames: 1271116800. Throughput: 0: 5034.1. Samples: 1271111876. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 15:00:29,456][25689] Avg episode reward: [(0, '0.764')] [2022-07-11 15:00:29,867][26022] Updated weights on worker 0-0, policy_version 1241328 (0.00089) [2022-07-11 15:00:31,916][26022] Updated weights on worker 0-0, policy_version 1241338 (0.00094) [2022-07-11 15:00:33,348][26022] Updated weights on worker 0-0, policy_version 1241348 (0.00083) [2022-07-11 15:00:34,465][25689] Fps is (10 sec: 5607.2, 60 sec: 5572.1, 300 sec: 5544.8). Total num frames: 1271144448. Throughput: 0: 5872.7. Samples: 1271145396. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 15:00:34,465][25689] Avg episode reward: [(0, '0.671')] [2022-07-11 15:00:35,495][26022] Updated weights on worker 0-0, policy_version 1241358 (0.00090) [2022-07-11 15:00:37,339][26022] Updated weights on worker 0-0, policy_version 1241368 (0.00082) [2022-07-11 15:00:38,975][26022] Updated weights on worker 0-0, policy_version 1241378 (0.00092) [2022-07-11 15:00:39,524][25689] Fps is (10 sec: 5695.7, 60 sec: 5592.6, 300 sec: 5550.7). Total num frames: 1271174144. Throughput: 0: 5865.2. Samples: 1271178930. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2022-07-11 15:00:39,524][25689] Avg episode reward: [(0, '0.779')] [2022-07-11 15:00:41,211][26022] Updated weights on worker 0-0, policy_version 1241388 (0.00093) [2022-07-11 15:00:42,600][26022] Updated weights on worker 0-0, policy_version 1241398 (0.00084) [2022-07-11 15:00:44,571][25689] Fps is (10 sec: 5674.4, 60 sec: 5592.4, 300 sec: 5551.4). Total num frames: 1271201792. Throughput: 0: 4998.2. Samples: 1271195826. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:00:44,571][25689] Avg episode reward: [(0, '-0.440')] [2022-07-11 15:00:44,573][26022] Updated weights on worker 0-0, policy_version 1241408 (0.00079) [2022-07-11 15:00:46,542][26022] Updated weights on worker 0-0, policy_version 1241418 (0.00086) [2022-07-11 15:00:48,125][26022] Updated weights on worker 0-0, policy_version 1241428 (0.00085) [2022-07-11 15:00:49,599][25689] Fps is (10 sec: 5386.6, 60 sec: 5559.5, 300 sec: 5544.7). Total num frames: 1271228416. Throughput: 0: 5827.4. Samples: 1271229258. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:00:49,599][25689] Avg episode reward: [(0, '0.181')] [2022-07-11 15:00:50,263][26022] Updated weights on worker 0-0, policy_version 1241438 (0.00103) [2022-07-11 15:00:51,876][26022] Updated weights on worker 0-0, policy_version 1241448 (0.00094) [2022-07-11 15:00:53,765][26022] Updated weights on worker 0-0, policy_version 1241458 (0.00087) [2022-07-11 15:00:54,647][25689] Fps is (10 sec: 5589.1, 60 sec: 5590.9, 300 sec: 5552.5). Total num frames: 1271258112. Throughput: 0: 5828.9. Samples: 1271263038. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:00:54,648][25689] Avg episode reward: [(0, '0.020')] [2022-07-11 15:00:55,512][26022] Updated weights on worker 0-0, policy_version 1241468 (0.00081) [2022-07-11 15:00:57,327][26022] Updated weights on worker 0-0, policy_version 1241478 (0.00055) [2022-07-11 15:00:59,435][26022] Updated weights on worker 0-0, policy_version 1241488 (0.00080) [2022-07-11 15:00:59,767][25689] Fps is (10 sec: 5639.4, 60 sec: 5584.4, 300 sec: 5550.9). Total num frames: 1271285760. Throughput: 0: 4984.6. Samples: 1271279836. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:00:59,768][25689] Avg episode reward: [(0, '0.570')] [2022-07-11 15:01:01,183][26022] Updated weights on worker 0-0, policy_version 1241498 (0.00080) [2022-07-11 15:01:03,242][26022] Updated weights on worker 0-0, policy_version 1241508 (0.00082) [2022-07-11 15:01:04,831][25689] Fps is (10 sec: 5228.6, 60 sec: 5532.1, 300 sec: 5553.3). Total num frames: 1271311360. Throughput: 0: 5699.3. Samples: 1271311298. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:01:04,832][25689] Avg episode reward: [(0, '0.919')] [2022-07-11 15:01:05,106][26022] Updated weights on worker 0-0, policy_version 1241518 (0.00089) [2022-07-11 15:01:06,937][26022] Updated weights on worker 0-0, policy_version 1241528 (0.00085) [2022-07-11 15:01:08,678][26022] Updated weights on worker 0-0, policy_version 1241538 (0.00089) [2022-07-11 15:01:09,875][25689] Fps is (10 sec: 5470.8, 60 sec: 5597.6, 300 sec: 5556.3). Total num frames: 1271341056. Throughput: 0: 5704.0. Samples: 1271344912. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:01:09,875][25689] Avg episode reward: [(0, '0.884')] [2022-07-11 15:01:10,691][26022] Updated weights on worker 0-0, policy_version 1241548 (0.00086) [2022-07-11 15:01:12,410][26022] Updated weights on worker 0-0, policy_version 1241558 (0.00086) [2022-07-11 15:01:14,335][26022] Updated weights on worker 0-0, policy_version 1241568 (0.00086) [2022-07-11 15:01:14,878][25689] Fps is (10 sec: 5605.8, 60 sec: 5550.9, 300 sec: 5547.3). Total num frames: 1271367680. Throughput: 0: 4883.4. Samples: 1271361832. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:01:14,878][25689] Avg episode reward: [(0, '1.908')] [2022-07-11 15:01:16,033][26022] Updated weights on worker 0-0, policy_version 1241578 (0.00083) [2022-07-11 15:01:18,118][26022] Updated weights on worker 0-0, policy_version 1241588 (0.00086) [2022-07-11 15:01:19,700][26022] Updated weights on worker 0-0, policy_version 1241598 (0.00091) [2022-07-11 15:01:19,958][25689] Fps is (10 sec: 5585.2, 60 sec: 5564.9, 300 sec: 5554.5). Total num frames: 1271397376. Throughput: 0: 5719.9. Samples: 1271395328. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:01:19,959][25689] Avg episode reward: [(0, '1.741')] [2022-07-11 15:01:21,702][26022] Updated weights on worker 0-0, policy_version 1241608 (0.00095) [2022-07-11 15:01:23,262][26022] Updated weights on worker 0-0, policy_version 1241618 (0.00090) [2022-07-11 15:01:24,998][25689] Fps is (10 sec: 5666.3, 60 sec: 5561.4, 300 sec: 5554.5). Total num frames: 1271425024. Throughput: 0: 5848.2. Samples: 1271429238. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:01:24,999][25689] Avg episode reward: [(0, '1.921')] [2022-07-11 15:01:25,208][26022] Updated weights on worker 0-0, policy_version 1241628 (0.00091) [2022-07-11 15:01:27,010][26022] Updated weights on worker 0-0, policy_version 1241638 (0.00088) [2022-07-11 15:01:28,831][26022] Updated weights on worker 0-0, policy_version 1241648 (0.00086) [2022-07-11 15:01:30,053][25689] Fps is (10 sec: 5579.2, 60 sec: 5559.6, 300 sec: 5550.6). Total num frames: 1271453696. Throughput: 0: 5849.3. Samples: 1271462942. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:01:30,054][25689] Avg episode reward: [(0, '1.689')] [2022-07-11 15:01:30,788][26022] Updated weights on worker 0-0, policy_version 1241658 (0.00085) [2022-07-11 15:01:32,367][26022] Updated weights on worker 0-0, policy_version 1241668 (0.00090) [2022-07-11 15:01:34,497][26022] Updated weights on worker 0-0, policy_version 1241678 (0.00090) [2022-07-11 15:01:35,126][25689] Fps is (10 sec: 5662.2, 60 sec: 5570.7, 300 sec: 5557.0). Total num frames: 1271482368. Throughput: 0: 5821.1. Samples: 1271479696. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:01:35,128][25689] Avg episode reward: [(0, '1.485')] [2022-07-11 15:01:36,119][26022] Updated weights on worker 0-0, policy_version 1241688 (0.00082) [2022-07-11 15:01:37,964][26022] Updated weights on worker 0-0, policy_version 1241698 (0.00085) [2022-07-11 15:01:39,850][26022] Updated weights on worker 0-0, policy_version 1241708 (0.00087) [2022-07-11 15:01:40,168][25689] Fps is (10 sec: 5466.5, 60 sec: 5521.4, 300 sec: 5549.6). Total num frames: 1271508992. Throughput: 0: 5842.1. Samples: 1271513398. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:01:40,170][25689] Avg episode reward: [(0, '1.282')] [2022-07-11 15:01:41,471][26022] Updated weights on worker 0-0, policy_version 1241718 (0.00085) [2022-07-11 15:01:43,500][26022] Updated weights on worker 0-0, policy_version 1241728 (0.00087) [2022-07-11 15:01:45,236][25689] Fps is (10 sec: 5570.4, 60 sec: 5553.3, 300 sec: 5555.3). Total num frames: 1271538688. Throughput: 0: 5807.8. Samples: 1271546776. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:01:45,238][25689] Avg episode reward: [(0, '0.597')] [2022-07-11 15:01:45,309][26022] Updated weights on worker 0-0, policy_version 1241738 (0.00630) [2022-07-11 15:01:47,291][26022] Updated weights on worker 0-0, policy_version 1241748 (0.00089) [2022-07-11 15:01:48,752][26022] Updated weights on worker 0-0, policy_version 1241758 (0.00090) [2022-07-11 15:01:50,277][25689] Fps is (10 sec: 5672.7, 60 sec: 5569.0, 300 sec: 5554.8). Total num frames: 1271566336. Throughput: 0: 4973.3. Samples: 1271563522. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:01:50,278][25689] Avg episode reward: [(0, '0.487')] [2022-07-11 15:01:50,983][26022] Updated weights on worker 0-0, policy_version 1241768 (0.00088) [2022-07-11 15:01:52,516][26022] Updated weights on worker 0-0, policy_version 1241778 (0.00085) [2022-07-11 15:01:54,616][26022] Updated weights on worker 0-0, policy_version 1241788 (0.00085) [2022-07-11 15:01:55,287][25689] Fps is (10 sec: 5603.5, 60 sec: 5555.7, 300 sec: 5552.8). Total num frames: 1271595008. Throughput: 0: 5841.8. Samples: 1271597476. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:01:55,287][25689] Avg episode reward: [(0, '0.364')] [2022-07-11 15:01:56,084][26022] Updated weights on worker 0-0, policy_version 1241798 (0.00085) [2022-07-11 15:01:58,213][26022] Updated weights on worker 0-0, policy_version 1241808 (0.00088) [2022-07-11 15:01:59,837][26022] Updated weights on worker 0-0, policy_version 1241818 (0.00083) [2022-07-11 15:02:00,398][25689] Fps is (10 sec: 5665.7, 60 sec: 5573.4, 300 sec: 5561.7). Total num frames: 1271623680. Throughput: 0: 5815.5. Samples: 1271631046. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:02:00,400][25689] Avg episode reward: [(0, '-0.189')] [2022-07-11 15:02:02,143][26022] Updated weights on worker 0-0, policy_version 1241828 (0.00087) [2022-07-11 15:02:03,778][26022] Updated weights on worker 0-0, policy_version 1241838 (0.00094) [2022-07-11 15:02:05,405][25689] Fps is (10 sec: 5364.1, 60 sec: 5578.7, 300 sec: 5556.1). Total num frames: 1271649280. Throughput: 0: 4912.6. Samples: 1271645856. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:02:05,405][25689] Avg episode reward: [(0, '-0.177')] [2022-07-11 15:02:05,961][26022] Updated weights on worker 0-0, policy_version 1241848 (0.00086) [2022-07-11 15:02:07,455][26022] Updated weights on worker 0-0, policy_version 1241858 (0.00067) [2022-07-11 15:02:09,669][26022] Updated weights on worker 0-0, policy_version 1241868 (0.00092) [2022-07-11 15:02:10,435][25689] Fps is (10 sec: 5509.7, 60 sec: 5579.9, 300 sec: 5559.3). Total num frames: 1271678976. Throughput: 0: 5737.4. Samples: 1271679172. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:02:10,435][25689] Avg episode reward: [(0, '0.204')] [2022-07-11 15:02:11,301][26022] Updated weights on worker 0-0, policy_version 1241878 (0.00100) [2022-07-11 15:02:13,177][26022] Updated weights on worker 0-0, policy_version 1241888 (0.00087) [2022-07-11 15:02:14,870][26022] Updated weights on worker 0-0, policy_version 1241898 (0.00094) [2022-07-11 15:02:15,515][25689] Fps is (10 sec: 5570.7, 60 sec: 5572.9, 300 sec: 5558.5). Total num frames: 1271705600. Throughput: 0: 5704.5. Samples: 1271712866. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:02:15,515][25689] Avg episode reward: [(0, '0.393')] [2022-07-11 15:02:16,093][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:02:16,110][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001241903_1271708672.pth [2022-07-11 15:02:16,111][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001239948_1269706752.pth [2022-07-11 15:02:16,771][26022] Updated weights on worker 0-0, policy_version 1241908 (0.00084) [2022-07-11 15:02:18,552][26022] Updated weights on worker 0-0, policy_version 1241918 (0.00098) [2022-07-11 15:02:20,576][25689] Fps is (10 sec: 5351.5, 60 sec: 5540.8, 300 sec: 5547.8). Total num frames: 1271733248. Throughput: 0: 4888.9. Samples: 1271729692. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:02:20,576][25689] Avg episode reward: [(0, '0.629')] [2022-07-11 15:02:20,660][26022] Updated weights on worker 0-0, policy_version 1241928 (0.00087) [2022-07-11 15:02:22,396][26022] Updated weights on worker 0-0, policy_version 1241938 (0.00089) [2022-07-11 15:02:24,151][26022] Updated weights on worker 0-0, policy_version 1241948 (0.00086) [2022-07-11 15:02:25,592][25689] Fps is (10 sec: 5588.6, 60 sec: 5559.9, 300 sec: 5554.6). Total num frames: 1271761920. Throughput: 0: 5824.0. Samples: 1271763430. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:02:25,593][25689] Avg episode reward: [(0, '1.502')] [2022-07-11 15:02:25,887][26022] Updated weights on worker 0-0, policy_version 1241958 (0.00081) [2022-07-11 15:02:27,844][26022] Updated weights on worker 0-0, policy_version 1241968 (0.00090) [2022-07-11 15:02:29,759][26022] Updated weights on worker 0-0, policy_version 1241978 (0.00085) [2022-07-11 15:02:30,616][25689] Fps is (10 sec: 5813.6, 60 sec: 5579.7, 300 sec: 5564.7). Total num frames: 1271791616. Throughput: 0: 5831.0. Samples: 1271796848. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:02:30,616][25689] Avg episode reward: [(0, '1.469')] [2022-07-11 15:02:31,580][26022] Updated weights on worker 0-0, policy_version 1241988 (0.00084) [2022-07-11 15:02:33,423][26022] Updated weights on worker 0-0, policy_version 1241998 (0.00083) [2022-07-11 15:02:35,211][26022] Updated weights on worker 0-0, policy_version 1242008 (0.00081) [2022-07-11 15:02:35,635][25689] Fps is (10 sec: 5608.3, 60 sec: 5550.8, 300 sec: 5551.5). Total num frames: 1271818240. Throughput: 0: 5004.8. Samples: 1271813562. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:02:35,635][25689] Avg episode reward: [(0, '1.522')] [2022-07-11 15:02:36,942][26022] Updated weights on worker 0-0, policy_version 1242018 (0.00087) [2022-07-11 15:02:38,913][26022] Updated weights on worker 0-0, policy_version 1242028 (0.00080) [2022-07-11 15:02:40,663][26022] Updated weights on worker 0-0, policy_version 1242038 (0.00123) [2022-07-11 15:02:40,694][25689] Fps is (10 sec: 5486.5, 60 sec: 5583.1, 300 sec: 5559.1). Total num frames: 1271846912. Throughput: 0: 5849.2. Samples: 1271847368. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:02:40,695][25689] Avg episode reward: [(0, '0.423')] [2022-07-11 15:02:42,573][26022] Updated weights on worker 0-0, policy_version 1242048 (0.00085) [2022-07-11 15:02:44,312][26022] Updated weights on worker 0-0, policy_version 1242058 (0.00084) [2022-07-11 15:02:45,718][25689] Fps is (10 sec: 5585.6, 60 sec: 5553.3, 300 sec: 5556.4). Total num frames: 1271874560. Throughput: 0: 5841.8. Samples: 1271880998. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:02:45,718][25689] Avg episode reward: [(0, '0.332')] [2022-07-11 15:02:46,382][26022] Updated weights on worker 0-0, policy_version 1242068 (0.01184) [2022-07-11 15:02:47,934][26022] Updated weights on worker 0-0, policy_version 1242078 (0.00095) [2022-07-11 15:02:49,903][26022] Updated weights on worker 0-0, policy_version 1242088 (0.00084) [2022-07-11 15:02:50,723][25689] Fps is (10 sec: 5717.9, 60 sec: 5590.4, 300 sec: 5560.7). Total num frames: 1271904256. Throughput: 0: 5016.1. Samples: 1271897708. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:02:50,724][25689] Avg episode reward: [(0, '-0.581')] [2022-07-11 15:02:51,626][26022] Updated weights on worker 0-0, policy_version 1242098 (0.00390) [2022-07-11 15:02:53,554][26022] Updated weights on worker 0-0, policy_version 1242108 (0.00093) [2022-07-11 15:02:55,268][26022] Updated weights on worker 0-0, policy_version 1242118 (0.00089) [2022-07-11 15:02:55,751][25689] Fps is (10 sec: 5511.4, 60 sec: 5538.0, 300 sec: 5547.4). Total num frames: 1271929856. Throughput: 0: 5861.1. Samples: 1271931464. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:02:55,751][25689] Avg episode reward: [(0, '-0.474')] [2022-07-11 15:02:57,104][26022] Updated weights on worker 0-0, policy_version 1242128 (0.00085) [2022-07-11 15:02:59,242][26022] Updated weights on worker 0-0, policy_version 1242138 (0.00082) [2022-07-11 15:03:00,702][26022] Updated weights on worker 0-0, policy_version 1242148 (0.00085) [2022-07-11 15:03:00,803][25689] Fps is (10 sec: 5486.1, 60 sec: 5560.4, 300 sec: 5564.0). Total num frames: 1271959552. Throughput: 0: 5843.7. Samples: 1271964874. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:03:00,803][25689] Avg episode reward: [(0, '-0.420')] [2022-07-11 15:03:03,183][26022] Updated weights on worker 0-0, policy_version 1242158 (0.00090) [2022-07-11 15:03:04,735][26022] Updated weights on worker 0-0, policy_version 1242168 (0.00105) [2022-07-11 15:03:05,813][25689] Fps is (10 sec: 5393.7, 60 sec: 5543.1, 300 sec: 5553.6). Total num frames: 1271984128. Throughput: 0: 4914.4. Samples: 1271979754. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:03:05,815][25689] Avg episode reward: [(0, '0.336')] [2022-07-11 15:03:06,759][26022] Updated weights on worker 0-0, policy_version 1242178 (0.00097) [2022-07-11 15:03:08,551][26022] Updated weights on worker 0-0, policy_version 1242188 (0.00088) [2022-07-11 15:03:10,456][26022] Updated weights on worker 0-0, policy_version 1242198 (0.00093) [2022-07-11 15:03:10,820][25689] Fps is (10 sec: 5315.6, 60 sec: 5528.2, 300 sec: 5553.6). Total num frames: 1272012800. Throughput: 0: 5750.7. Samples: 1272013278. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:03:10,820][25689] Avg episode reward: [(0, '0.372')] [2022-07-11 15:03:12,111][26022] Updated weights on worker 0-0, policy_version 1242208 (0.00089) [2022-07-11 15:03:14,116][26022] Updated weights on worker 0-0, policy_version 1242218 (0.00084) [2022-07-11 15:03:15,770][26022] Updated weights on worker 0-0, policy_version 1242228 (0.00098) [2022-07-11 15:03:15,822][25689] Fps is (10 sec: 5728.8, 60 sec: 5569.3, 300 sec: 5561.7). Total num frames: 1272041472. Throughput: 0: 5762.8. Samples: 1272047134. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:03:15,823][25689] Avg episode reward: [(0, '1.432')] [2022-07-11 15:03:17,735][26022] Updated weights on worker 0-0, policy_version 1242238 (0.00087) [2022-07-11 15:03:19,421][26022] Updated weights on worker 0-0, policy_version 1242248 (0.00079) [2022-07-11 15:03:20,883][25689] Fps is (10 sec: 5495.1, 60 sec: 5552.4, 300 sec: 5550.3). Total num frames: 1272068096. Throughput: 0: 4922.2. Samples: 1272063712. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:03:20,883][25689] Avg episode reward: [(0, '1.716')] [2022-07-11 15:03:21,479][26022] Updated weights on worker 0-0, policy_version 1242258 (0.00092) [2022-07-11 15:03:23,195][26022] Updated weights on worker 0-0, policy_version 1242268 (0.00085) [2022-07-11 15:03:25,029][26022] Updated weights on worker 0-0, policy_version 1242278 (0.00083) [2022-07-11 15:03:25,894][25689] Fps is (10 sec: 5592.3, 60 sec: 5569.9, 300 sec: 5557.5). Total num frames: 1272097792. Throughput: 0: 5854.7. Samples: 1272097322. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:03:25,894][25689] Avg episode reward: [(0, '1.574')] [2022-07-11 15:03:26,879][26022] Updated weights on worker 0-0, policy_version 1242288 (0.00086) [2022-07-11 15:03:28,688][26022] Updated weights on worker 0-0, policy_version 1242298 (0.00090) [2022-07-11 15:03:30,680][26022] Updated weights on worker 0-0, policy_version 1242308 (0.00094) [2022-07-11 15:03:30,917][25689] Fps is (10 sec: 5612.9, 60 sec: 5519.0, 300 sec: 5553.6). Total num frames: 1272124416. Throughput: 0: 5825.4. Samples: 1272130350. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:03:30,917][25689] Avg episode reward: [(0, '0.021')] [2022-07-11 15:03:32,472][26022] Updated weights on worker 0-0, policy_version 1242318 (0.00097) [2022-07-11 15:03:34,287][26022] Updated weights on worker 0-0, policy_version 1242328 (0.00089) [2022-07-11 15:03:35,925][25689] Fps is (10 sec: 5410.0, 60 sec: 5536.9, 300 sec: 5551.0). Total num frames: 1272152064. Throughput: 0: 4969.3. Samples: 1272147030. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:03:35,926][25689] Avg episode reward: [(0, '0.346')] [2022-07-11 15:03:36,092][26022] Updated weights on worker 0-0, policy_version 1242338 (0.00091) [2022-07-11 15:03:38,035][26022] Updated weights on worker 0-0, policy_version 1242348 (0.00084) [2022-07-11 15:03:39,794][26022] Updated weights on worker 0-0, policy_version 1242358 (0.00084) [2022-07-11 15:03:41,067][25689] Fps is (10 sec: 5548.5, 60 sec: 5529.4, 300 sec: 5555.9). Total num frames: 1272180736. Throughput: 0: 5780.1. Samples: 1272180380. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:03:41,068][25689] Avg episode reward: [(0, '0.509')] [2022-07-11 15:03:41,681][26022] Updated weights on worker 0-0, policy_version 1242368 (0.00090) [2022-07-11 15:03:43,350][26022] Updated weights on worker 0-0, policy_version 1242378 (0.00092) [2022-07-11 15:03:45,379][26022] Updated weights on worker 0-0, policy_version 1242388 (0.00086) [2022-07-11 15:03:46,090][25689] Fps is (10 sec: 5641.3, 60 sec: 5546.3, 300 sec: 5555.6). Total num frames: 1272209408. Throughput: 0: 5790.4. Samples: 1272214270. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:03:46,091][25689] Avg episode reward: [(0, '0.549')] [2022-07-11 15:03:47,015][26022] Updated weights on worker 0-0, policy_version 1242398 (0.00087) [2022-07-11 15:03:48,998][26022] Updated weights on worker 0-0, policy_version 1242408 (0.00080) [2022-07-11 15:03:50,851][26022] Updated weights on worker 0-0, policy_version 1242418 (0.00098) [2022-07-11 15:03:51,119][25689] Fps is (10 sec: 5603.2, 60 sec: 5510.4, 300 sec: 5551.9). Total num frames: 1272237056. Throughput: 0: 4977.6. Samples: 1272230906. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:03:51,119][25689] Avg episode reward: [(0, '-0.155')] [2022-07-11 15:03:52,663][26022] Updated weights on worker 0-0, policy_version 1242428 (0.00089) [2022-07-11 15:03:54,484][26022] Updated weights on worker 0-0, policy_version 1242438 (0.00087) [2022-07-11 15:03:56,174][25689] Fps is (10 sec: 5585.0, 60 sec: 5558.6, 300 sec: 5551.9). Total num frames: 1272265728. Throughput: 0: 5784.8. Samples: 1272264168. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:03:56,175][25689] Avg episode reward: [(0, '0.011')] [2022-07-11 15:03:56,451][26022] Updated weights on worker 0-0, policy_version 1242448 (0.00082) [2022-07-11 15:03:58,184][26022] Updated weights on worker 0-0, policy_version 1242458 (0.00091) [2022-07-11 15:04:00,060][26022] Updated weights on worker 0-0, policy_version 1242468 (0.00082) [2022-07-11 15:04:01,280][25689] Fps is (10 sec: 5542.4, 60 sec: 5519.8, 300 sec: 5564.0). Total num frames: 1272293376. Throughput: 0: 5820.9. Samples: 1272298038. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:04:01,281][25689] Avg episode reward: [(0, '1.669')] [2022-07-11 15:04:02,232][26022] Updated weights on worker 0-0, policy_version 1242478 (0.00090) [2022-07-11 15:04:04,107][26022] Updated weights on worker 0-0, policy_version 1242488 (0.00085) [2022-07-11 15:04:05,739][26022] Updated weights on worker 0-0, policy_version 1242498 (0.00083) [2022-07-11 15:04:06,341][25689] Fps is (10 sec: 5338.2, 60 sec: 5549.0, 300 sec: 5556.3). Total num frames: 1272320000. Throughput: 0: 5694.0. Samples: 1272329578. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:04:06,343][25689] Avg episode reward: [(0, '1.449')] [2022-07-11 15:04:07,626][26022] Updated weights on worker 0-0, policy_version 1242508 (0.00088) [2022-07-11 15:04:09,473][26022] Updated weights on worker 0-0, policy_version 1242518 (0.00099) [2022-07-11 15:04:11,277][26022] Updated weights on worker 0-0, policy_version 1242528 (0.00105) [2022-07-11 15:04:11,369][25689] Fps is (10 sec: 5582.7, 60 sec: 5564.0, 300 sec: 5559.9). Total num frames: 1272349696. Throughput: 0: 5708.7. Samples: 1272346508. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:04:11,370][25689] Avg episode reward: [(0, '1.528')] [2022-07-11 15:04:13,176][26022] Updated weights on worker 0-0, policy_version 1242538 (0.00089) [2022-07-11 15:04:14,856][26022] Updated weights on worker 0-0, policy_version 1242548 (0.00095) [2022-07-11 15:04:16,127][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:04:16,135][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001242554_1272375296.pth [2022-07-11 15:04:16,141][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001240599_1270373376.pth [2022-07-11 15:04:16,372][25689] Fps is (10 sec: 5512.6, 60 sec: 5513.2, 300 sec: 5555.1). Total num frames: 1272375296. Throughput: 0: 5749.0. Samples: 1272380284. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:04:16,374][25689] Avg episode reward: [(0, '0.898')] [2022-07-11 15:04:16,911][26022] Updated weights on worker 0-0, policy_version 1242558 (0.00094) [2022-07-11 15:04:18,522][26022] Updated weights on worker 0-0, policy_version 1242568 (0.00091) [2022-07-11 15:04:20,443][26022] Updated weights on worker 0-0, policy_version 1242578 (0.00094) [2022-07-11 15:04:21,489][25689] Fps is (10 sec: 5564.7, 60 sec: 5575.6, 300 sec: 5561.5). Total num frames: 1272406016. Throughput: 0: 5740.9. Samples: 1272414056. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:04:21,490][25689] Avg episode reward: [(0, '1.652')] [2022-07-11 15:04:22,112][26022] Updated weights on worker 0-0, policy_version 1242588 (0.00081) [2022-07-11 15:04:24,064][26022] Updated weights on worker 0-0, policy_version 1242598 (0.00092) [2022-07-11 15:04:25,741][26022] Updated weights on worker 0-0, policy_version 1242608 (0.00088) [2022-07-11 15:04:26,511][25689] Fps is (10 sec: 5858.0, 60 sec: 5557.8, 300 sec: 5565.1). Total num frames: 1272434688. Throughput: 0: 5033.9. Samples: 1272431108. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:04:26,511][25689] Avg episode reward: [(0, '1.320')] [2022-07-11 15:04:27,629][26022] Updated weights on worker 0-0, policy_version 1242618 (0.00049) [2022-07-11 15:04:29,455][26022] Updated weights on worker 0-0, policy_version 1242628 (0.00092) [2022-07-11 15:04:31,319][26022] Updated weights on worker 0-0, policy_version 1242638 (0.00085) [2022-07-11 15:04:31,607][25689] Fps is (10 sec: 5566.7, 60 sec: 5568.0, 300 sec: 5560.4). Total num frames: 1272462336. Throughput: 0: 5834.9. Samples: 1272464596. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:04:31,607][25689] Avg episode reward: [(0, '1.182')] [2022-07-11 15:04:33,073][26022] Updated weights on worker 0-0, policy_version 1242648 (0.00085) [2022-07-11 15:04:34,886][26022] Updated weights on worker 0-0, policy_version 1242658 (0.00083) [2022-07-11 15:04:36,637][25689] Fps is (10 sec: 5460.6, 60 sec: 5566.0, 300 sec: 5558.2). Total num frames: 1272489984. Throughput: 0: 5828.5. Samples: 1272498398. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:04:36,640][25689] Avg episode reward: [(0, '1.224')] [2022-07-11 15:04:36,835][26022] Updated weights on worker 0-0, policy_version 1242668 (0.00086) [2022-07-11 15:04:38,654][26022] Updated weights on worker 0-0, policy_version 1242678 (0.00086) [2022-07-11 15:04:40,451][26022] Updated weights on worker 0-0, policy_version 1242688 (0.00095) [2022-07-11 15:04:41,766][25689] Fps is (10 sec: 5543.5, 60 sec: 5567.1, 300 sec: 5560.0). Total num frames: 1272518656. Throughput: 0: 4990.0. Samples: 1272515238. Policy #0 lag: (min: 0.0, avg: 7.6, max: 18.0) [2022-07-11 15:04:41,767][25689] Avg episode reward: [(0, '1.303')] [2022-07-11 15:04:42,288][26022] Updated weights on worker 0-0, policy_version 1242698 (0.00086) [2022-07-11 15:04:44,111][26022] Updated weights on worker 0-0, policy_version 1242708 (0.00083) [2022-07-11 15:04:45,994][26022] Updated weights on worker 0-0, policy_version 1242718 (0.00085) [2022-07-11 15:04:46,785][25689] Fps is (10 sec: 5650.8, 60 sec: 5567.5, 300 sec: 5560.4). Total num frames: 1272547328. Throughput: 0: 5821.8. Samples: 1272549142. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:04:46,786][25689] Avg episode reward: [(0, '1.226')] [2022-07-11 15:04:47,736][26022] Updated weights on worker 0-0, policy_version 1242728 (0.00095) [2022-07-11 15:04:49,565][26022] Updated weights on worker 0-0, policy_version 1242738 (0.00091) [2022-07-11 15:04:51,358][26022] Updated weights on worker 0-0, policy_version 1242748 (0.00094) [2022-07-11 15:04:51,795][25689] Fps is (10 sec: 5718.2, 60 sec: 5586.1, 300 sec: 5564.1). Total num frames: 1272576000. Throughput: 0: 5839.9. Samples: 1272582492. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:04:51,795][25689] Avg episode reward: [(0, '0.305')] [2022-07-11 15:04:53,353][26022] Updated weights on worker 0-0, policy_version 1242758 (0.00088) [2022-07-11 15:04:55,041][26022] Updated weights on worker 0-0, policy_version 1242768 (0.00090) [2022-07-11 15:04:56,800][25689] Fps is (10 sec: 5623.6, 60 sec: 5573.9, 300 sec: 5564.9). Total num frames: 1272603648. Throughput: 0: 5013.3. Samples: 1272599480. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:04:56,800][25689] Avg episode reward: [(0, '0.190')] [2022-07-11 15:04:56,890][26022] Updated weights on worker 0-0, policy_version 1242778 (0.00095) [2022-07-11 15:04:58,539][26022] Updated weights on worker 0-0, policy_version 1242788 (0.00087) [2022-07-11 15:05:00,743][26022] Updated weights on worker 0-0, policy_version 1242798 (0.00088) [2022-07-11 15:05:01,855][25689] Fps is (10 sec: 5496.6, 60 sec: 5578.6, 300 sec: 5561.3). Total num frames: 1272631296. Throughput: 0: 5881.5. Samples: 1272633388. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:05:01,855][25689] Avg episode reward: [(0, '0.166')] [2022-07-11 15:05:02,579][26022] Updated weights on worker 0-0, policy_version 1242808 (0.00088) [2022-07-11 15:05:04,650][26022] Updated weights on worker 0-0, policy_version 1242818 (0.00110) [2022-07-11 15:05:06,303][26022] Updated weights on worker 0-0, policy_version 1242828 (0.00087) [2022-07-11 15:05:06,863][25689] Fps is (10 sec: 5393.1, 60 sec: 5583.4, 300 sec: 5565.0). Total num frames: 1272657920. Throughput: 0: 5771.0. Samples: 1272665014. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:05:06,864][25689] Avg episode reward: [(0, '0.110')] [2022-07-11 15:05:08,212][26022] Updated weights on worker 0-0, policy_version 1242838 (0.00080) [2022-07-11 15:05:10,019][26022] Updated weights on worker 0-0, policy_version 1242848 (0.00085) [2022-07-11 15:05:11,898][25689] Fps is (10 sec: 5505.8, 60 sec: 5565.8, 300 sec: 5561.8). Total num frames: 1272686592. Throughput: 0: 4943.6. Samples: 1272681874. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:05:11,899][25689] Avg episode reward: [(0, '-0.278')] [2022-07-11 15:05:11,905][26022] Updated weights on worker 0-0, policy_version 1242858 (0.00082) [2022-07-11 15:05:13,550][26022] Updated weights on worker 0-0, policy_version 1242868 (0.00089) [2022-07-11 15:05:15,405][26022] Updated weights on worker 0-0, policy_version 1242878 (0.00085) [2022-07-11 15:05:16,923][25689] Fps is (10 sec: 5801.9, 60 sec: 5631.4, 300 sec: 5565.7). Total num frames: 1272716288. Throughput: 0: 5792.5. Samples: 1272716044. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:05:16,924][25689] Avg episode reward: [(0, '-1.275')] [2022-07-11 15:05:17,032][26022] Updated weights on worker 0-0, policy_version 1242888 (0.00087) [2022-07-11 15:05:19,229][26022] Updated weights on worker 0-0, policy_version 1242898 (0.00091) [2022-07-11 15:05:20,818][26022] Updated weights on worker 0-0, policy_version 1242908 (0.00083) [2022-07-11 15:05:21,996][25689] Fps is (10 sec: 5577.3, 60 sec: 5567.9, 300 sec: 5560.9). Total num frames: 1272742912. Throughput: 0: 5750.2. Samples: 1272749204. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:05:21,997][25689] Avg episode reward: [(0, '-0.672')] [2022-07-11 15:05:22,954][26022] Updated weights on worker 0-0, policy_version 1242918 (0.00086) [2022-07-11 15:05:24,318][26022] Updated weights on worker 0-0, policy_version 1242928 (0.00094) [2022-07-11 15:05:26,675][26022] Updated weights on worker 0-0, policy_version 1242938 (0.00085) [2022-07-11 15:05:27,059][25689] Fps is (10 sec: 5556.8, 60 sec: 5581.0, 300 sec: 5563.8). Total num frames: 1272772608. Throughput: 0: 4999.8. Samples: 1272765988. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:05:27,059][25689] Avg episode reward: [(0, '-0.244')] [2022-07-11 15:05:28,343][26022] Updated weights on worker 0-0, policy_version 1242948 (0.00087) [2022-07-11 15:05:30,247][26022] Updated weights on worker 0-0, policy_version 1242958 (0.00089) [2022-07-11 15:05:31,843][26022] Updated weights on worker 0-0, policy_version 1242968 (0.00085) [2022-07-11 15:05:32,093][25689] Fps is (10 sec: 5679.3, 60 sec: 5586.7, 300 sec: 5563.3). Total num frames: 1272800256. Throughput: 0: 5821.4. Samples: 1272799438. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:05:32,093][25689] Avg episode reward: [(0, '-1.076')] [2022-07-11 15:05:33,749][26022] Updated weights on worker 0-0, policy_version 1242978 (0.00049) [2022-07-11 15:05:35,584][26022] Updated weights on worker 0-0, policy_version 1242988 (0.00089) [2022-07-11 15:05:37,109][25689] Fps is (10 sec: 5604.0, 60 sec: 5605.0, 300 sec: 5560.7). Total num frames: 1272828928. Throughput: 0: 5825.8. Samples: 1272833640. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:05:37,109][25689] Avg episode reward: [(0, '-1.100')] [2022-07-11 15:05:37,330][26022] Updated weights on worker 0-0, policy_version 1242998 (0.00097) [2022-07-11 15:05:39,016][26022] Updated weights on worker 0-0, policy_version 1243008 (0.00086) [2022-07-11 15:05:41,066][26022] Updated weights on worker 0-0, policy_version 1243018 (0.00084) [2022-07-11 15:05:42,167][25689] Fps is (10 sec: 5692.4, 60 sec: 5611.6, 300 sec: 5563.9). Total num frames: 1272857600. Throughput: 0: 5038.7. Samples: 1272850838. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:05:42,167][25689] Avg episode reward: [(0, '-0.408')] [2022-07-11 15:05:42,679][26022] Updated weights on worker 0-0, policy_version 1243028 (0.00085) [2022-07-11 15:05:44,694][26022] Updated weights on worker 0-0, policy_version 1243038 (0.00091) [2022-07-11 15:05:46,387][26022] Updated weights on worker 0-0, policy_version 1243048 (0.00083) [2022-07-11 15:05:47,267][25689] Fps is (10 sec: 5544.2, 60 sec: 5587.1, 300 sec: 5566.0). Total num frames: 1272885248. Throughput: 0: 5874.3. Samples: 1272884698. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:05:47,268][25689] Avg episode reward: [(0, '1.041')] [2022-07-11 15:05:48,300][26022] Updated weights on worker 0-0, policy_version 1243058 (0.00083) [2022-07-11 15:05:50,009][26022] Updated weights on worker 0-0, policy_version 1243068 (0.00084) [2022-07-11 15:05:51,908][26022] Updated weights on worker 0-0, policy_version 1243078 (0.00095) [2022-07-11 15:05:52,269][25689] Fps is (10 sec: 5473.5, 60 sec: 5570.9, 300 sec: 5560.0). Total num frames: 1272912896. Throughput: 0: 5880.1. Samples: 1272918076. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:05:52,270][25689] Avg episode reward: [(0, '1.293')] [2022-07-11 15:05:53,770][26022] Updated weights on worker 0-0, policy_version 1243088 (0.00078) [2022-07-11 15:05:55,537][26022] Updated weights on worker 0-0, policy_version 1243098 (0.00080) [2022-07-11 15:05:57,356][25689] Fps is (10 sec: 5582.7, 60 sec: 5580.3, 300 sec: 5564.1). Total num frames: 1272941568. Throughput: 0: 5002.7. Samples: 1272934928. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:05:57,356][25689] Avg episode reward: [(0, '0.601')] [2022-07-11 15:05:57,391][26022] Updated weights on worker 0-0, policy_version 1243108 (0.00086) [2022-07-11 15:05:59,240][26022] Updated weights on worker 0-0, policy_version 1243118 (0.00081) [2022-07-11 15:06:01,024][26022] Updated weights on worker 0-0, policy_version 1243128 (0.00084) [2022-07-11 15:06:02,449][25689] Fps is (10 sec: 5432.0, 60 sec: 5559.9, 300 sec: 5567.0). Total num frames: 1272968192. Throughput: 0: 5825.0. Samples: 1272968982. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:06:02,449][25689] Avg episode reward: [(0, '1.145')] [2022-07-11 15:06:03,179][26022] Updated weights on worker 0-0, policy_version 1243138 (0.00093) [2022-07-11 15:06:04,992][26022] Updated weights on worker 0-0, policy_version 1243148 (0.00085) [2022-07-11 15:06:06,900][26022] Updated weights on worker 0-0, policy_version 1243158 (0.00077) [2022-07-11 15:06:07,479][25689] Fps is (10 sec: 5563.2, 60 sec: 5608.5, 300 sec: 5567.2). Total num frames: 1272997888. Throughput: 0: 5738.8. Samples: 1273000690. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:06:07,480][25689] Avg episode reward: [(0, '0.955')] [2022-07-11 15:06:08,630][26022] Updated weights on worker 0-0, policy_version 1243168 (0.00092) [2022-07-11 15:06:10,561][26022] Updated weights on worker 0-0, policy_version 1243178 (0.00084) [2022-07-11 15:06:12,136][26022] Updated weights on worker 0-0, policy_version 1243188 (0.00088) [2022-07-11 15:06:12,521][25689] Fps is (10 sec: 5693.2, 60 sec: 5591.0, 300 sec: 5569.9). Total num frames: 1273025536. Throughput: 0: 5761.7. Samples: 1273034762. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:06:12,522][25689] Avg episode reward: [(0, '1.099')] [2022-07-11 15:06:14,015][26022] Updated weights on worker 0-0, policy_version 1243198 (0.00088) [2022-07-11 15:06:15,967][26022] Updated weights on worker 0-0, policy_version 1243208 (0.00084) [2022-07-11 15:06:16,166][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:06:16,206][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001243209_1273046016.pth [2022-07-11 15:06:16,207][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001241252_1271042048.pth [2022-07-11 15:06:17,539][25689] Fps is (10 sec: 5598.8, 60 sec: 5574.8, 300 sec: 5567.7). Total num frames: 1273054208. Throughput: 0: 5791.3. Samples: 1273051814. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:06:17,539][25689] Avg episode reward: [(0, '0.762')] [2022-07-11 15:06:17,695][26022] Updated weights on worker 0-0, policy_version 1243218 (0.00092) [2022-07-11 15:06:19,755][26022] Updated weights on worker 0-0, policy_version 1243228 (0.00088) [2022-07-11 15:06:21,113][26022] Updated weights on worker 0-0, policy_version 1243238 (0.00079) [2022-07-11 15:06:22,602][25689] Fps is (10 sec: 5485.4, 60 sec: 5575.7, 300 sec: 5563.8). Total num frames: 1273080832. Throughput: 0: 5784.8. Samples: 1273085562. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:06:22,602][25689] Avg episode reward: [(0, '1.009')] [2022-07-11 15:06:23,254][26022] Updated weights on worker 0-0, policy_version 1243248 (0.00090) [2022-07-11 15:06:25,141][26022] Updated weights on worker 0-0, policy_version 1243258 (0.00094) [2022-07-11 15:06:26,884][26022] Updated weights on worker 0-0, policy_version 1243268 (0.00077) [2022-07-11 15:06:27,654][25689] Fps is (10 sec: 5770.1, 60 sec: 5610.5, 300 sec: 5574.2). Total num frames: 1273112576. Throughput: 0: 5880.3. Samples: 1273119324. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:06:27,655][25689] Avg episode reward: [(0, '1.539')] [2022-07-11 15:06:28,815][26022] Updated weights on worker 0-0, policy_version 1243278 (0.00093) [2022-07-11 15:06:30,504][26022] Updated weights on worker 0-0, policy_version 1243288 (0.00080) [2022-07-11 15:06:32,372][26022] Updated weights on worker 0-0, policy_version 1243298 (0.00090) [2022-07-11 15:06:32,668][25689] Fps is (10 sec: 5696.5, 60 sec: 5578.5, 300 sec: 5564.9). Total num frames: 1273138176. Throughput: 0: 5023.1. Samples: 1273135962. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:06:32,669][25689] Avg episode reward: [(0, '1.533')] [2022-07-11 15:06:34,168][26022] Updated weights on worker 0-0, policy_version 1243308 (0.00085) [2022-07-11 15:06:36,127][26022] Updated weights on worker 0-0, policy_version 1243318 (0.00095) [2022-07-11 15:06:37,711][25689] Fps is (10 sec: 5396.6, 60 sec: 5576.1, 300 sec: 5571.8). Total num frames: 1273166848. Throughput: 0: 5847.5. Samples: 1273169772. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:06:37,712][25689] Avg episode reward: [(0, '1.844')] [2022-07-11 15:06:37,917][26022] Updated weights on worker 0-0, policy_version 1243328 (0.00072) [2022-07-11 15:06:39,778][26022] Updated weights on worker 0-0, policy_version 1243338 (0.00090) [2022-07-11 15:06:41,375][26022] Updated weights on worker 0-0, policy_version 1243348 (0.00090) [2022-07-11 15:06:42,759][25689] Fps is (10 sec: 5581.1, 60 sec: 5560.0, 300 sec: 5565.3). Total num frames: 1273194496. Throughput: 0: 5846.5. Samples: 1273203414. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:06:42,760][25689] Avg episode reward: [(0, '0.797')] [2022-07-11 15:06:43,451][26022] Updated weights on worker 0-0, policy_version 1243358 (0.00080) [2022-07-11 15:06:45,077][26022] Updated weights on worker 0-0, policy_version 1243368 (0.00146) [2022-07-11 15:06:46,971][26022] Updated weights on worker 0-0, policy_version 1243378 (0.00083) [2022-07-11 15:06:47,785][25689] Fps is (10 sec: 5489.1, 60 sec: 5566.9, 300 sec: 5565.6). Total num frames: 1273222144. Throughput: 0: 5016.1. Samples: 1273220300. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:06:47,785][25689] Avg episode reward: [(0, '0.687')] [2022-07-11 15:06:48,775][26022] Updated weights on worker 0-0, policy_version 1243388 (0.00089) [2022-07-11 15:06:50,510][26022] Updated weights on worker 0-0, policy_version 1243398 (0.00087) [2022-07-11 15:06:52,571][26022] Updated weights on worker 0-0, policy_version 1243408 (0.00087) [2022-07-11 15:06:52,809][25689] Fps is (10 sec: 5807.8, 60 sec: 5615.6, 300 sec: 5572.2). Total num frames: 1273252864. Throughput: 0: 5861.3. Samples: 1273254016. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:06:52,810][25689] Avg episode reward: [(0, '0.576')] [2022-07-11 15:06:54,320][26022] Updated weights on worker 0-0, policy_version 1243418 (0.00084) [2022-07-11 15:06:55,983][26022] Updated weights on worker 0-0, policy_version 1243428 (0.00108) [2022-07-11 15:06:57,774][26022] Updated weights on worker 0-0, policy_version 1243438 (0.00088) [2022-07-11 15:06:57,827][25689] Fps is (10 sec: 5812.3, 60 sec: 5605.1, 300 sec: 5570.5). Total num frames: 1273280512. Throughput: 0: 5880.8. Samples: 1273288070. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:06:57,827][25689] Avg episode reward: [(0, '0.529')] [2022-07-11 15:06:59,705][26022] Updated weights on worker 0-0, policy_version 1243448 (0.00089) [2022-07-11 15:07:01,475][26022] Updated weights on worker 0-0, policy_version 1243458 (0.00085) [2022-07-11 15:07:02,918][25689] Fps is (10 sec: 5166.5, 60 sec: 5571.4, 300 sec: 5565.5). Total num frames: 1273305088. Throughput: 0: 5039.0. Samples: 1273304992. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:07:02,918][25689] Avg episode reward: [(0, '0.732')] [2022-07-11 15:07:03,789][26022] Updated weights on worker 0-0, policy_version 1243468 (0.00092) [2022-07-11 15:07:05,413][26022] Updated weights on worker 0-0, policy_version 1243478 (0.00084) [2022-07-11 15:07:07,397][26022] Updated weights on worker 0-0, policy_version 1243488 (0.00092) [2022-07-11 15:07:07,923][25689] Fps is (10 sec: 5273.8, 60 sec: 5556.7, 300 sec: 5562.5). Total num frames: 1273333760. Throughput: 0: 5781.3. Samples: 1273336728. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:07:07,924][25689] Avg episode reward: [(0, '1.521')] [2022-07-11 15:07:08,975][26022] Updated weights on worker 0-0, policy_version 1243498 (0.00095) [2022-07-11 15:07:10,955][26022] Updated weights on worker 0-0, policy_version 1243508 (0.00087) [2022-07-11 15:07:12,523][26022] Updated weights on worker 0-0, policy_version 1243518 (0.00079) [2022-07-11 15:07:12,927][25689] Fps is (10 sec: 5831.5, 60 sec: 5594.2, 300 sec: 5574.3). Total num frames: 1273363456. Throughput: 0: 5796.7. Samples: 1273370630. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:07:12,927][25689] Avg episode reward: [(0, '1.290')] [2022-07-11 15:07:14,660][26022] Updated weights on worker 0-0, policy_version 1243528 (0.00092) [2022-07-11 15:07:16,255][26022] Updated weights on worker 0-0, policy_version 1243538 (0.00084) [2022-07-11 15:07:17,948][25689] Fps is (10 sec: 5720.5, 60 sec: 5576.9, 300 sec: 5575.0). Total num frames: 1273391104. Throughput: 0: 4952.6. Samples: 1273387722. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:07:17,948][25689] Avg episode reward: [(0, '0.879')] [2022-07-11 15:07:18,176][26022] Updated weights on worker 0-0, policy_version 1243548 (0.00085) [2022-07-11 15:07:20,076][26022] Updated weights on worker 0-0, policy_version 1243558 (0.00085) [2022-07-11 15:07:21,694][26022] Updated weights on worker 0-0, policy_version 1243568 (0.00095) [2022-07-11 15:07:23,026][25689] Fps is (10 sec: 5678.2, 60 sec: 5626.4, 300 sec: 5577.3). Total num frames: 1273420800. Throughput: 0: 5790.3. Samples: 1273421424. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:07:23,026][25689] Avg episode reward: [(0, '1.101')] [2022-07-11 15:07:23,700][26022] Updated weights on worker 0-0, policy_version 1243578 (0.00090) [2022-07-11 15:07:25,507][26022] Updated weights on worker 0-0, policy_version 1243588 (0.00082) [2022-07-11 15:07:27,417][26022] Updated weights on worker 0-0, policy_version 1243598 (0.00089) [2022-07-11 15:07:28,085][25689] Fps is (10 sec: 5757.7, 60 sec: 5574.9, 300 sec: 5573.2). Total num frames: 1273449472. Throughput: 0: 5877.3. Samples: 1273455226. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:07:28,087][25689] Avg episode reward: [(0, '1.149')] [2022-07-11 15:07:29,034][26022] Updated weights on worker 0-0, policy_version 1243608 (0.00085) [2022-07-11 15:07:30,910][26022] Updated weights on worker 0-0, policy_version 1243618 (0.00095) [2022-07-11 15:07:33,011][26022] Updated weights on worker 0-0, policy_version 1243628 (0.00089) [2022-07-11 15:07:33,181][25689] Fps is (10 sec: 5445.1, 60 sec: 5584.3, 300 sec: 5571.8). Total num frames: 1273476096. Throughput: 0: 5008.3. Samples: 1273472070. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:07:33,181][25689] Avg episode reward: [(0, '0.995')] [2022-07-11 15:07:34,711][26022] Updated weights on worker 0-0, policy_version 1243638 (0.00090) [2022-07-11 15:07:36,412][26022] Updated weights on worker 0-0, policy_version 1243648 (0.00091) [2022-07-11 15:07:38,212][25689] Fps is (10 sec: 5561.6, 60 sec: 5602.3, 300 sec: 5575.7). Total num frames: 1273505792. Throughput: 0: 5832.5. Samples: 1273505912. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:07:38,214][25689] Avg episode reward: [(0, '0.862')] [2022-07-11 15:07:38,219][26022] Updated weights on worker 0-0, policy_version 1243658 (0.00093) [2022-07-11 15:07:40,072][26022] Updated weights on worker 0-0, policy_version 1243668 (0.00089) [2022-07-11 15:07:42,229][26022] Updated weights on worker 0-0, policy_version 1243678 (0.00086) [2022-07-11 15:07:43,302][25689] Fps is (10 sec: 5666.0, 60 sec: 5598.5, 300 sec: 5574.5). Total num frames: 1273533440. Throughput: 0: 5809.4. Samples: 1273539216. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:07:43,303][25689] Avg episode reward: [(0, '1.116')] [2022-07-11 15:07:43,790][26022] Updated weights on worker 0-0, policy_version 1243688 (0.00093) [2022-07-11 15:07:45,515][26022] Updated weights on worker 0-0, policy_version 1243698 (0.00081) [2022-07-11 15:07:47,555][26022] Updated weights on worker 0-0, policy_version 1243708 (0.00077) [2022-07-11 15:07:48,341][25689] Fps is (10 sec: 5357.8, 60 sec: 5580.2, 300 sec: 5563.5). Total num frames: 1273560064. Throughput: 0: 4983.2. Samples: 1273556166. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:07:48,342][25689] Avg episode reward: [(0, '1.357')] [2022-07-11 15:07:49,171][26022] Updated weights on worker 0-0, policy_version 1243718 (0.00423) [2022-07-11 15:07:51,212][26022] Updated weights on worker 0-0, policy_version 1243728 (0.00085) [2022-07-11 15:07:52,824][26022] Updated weights on worker 0-0, policy_version 1243738 (0.00093) [2022-07-11 15:07:53,360][25689] Fps is (10 sec: 5599.4, 60 sec: 5563.9, 300 sec: 5577.5). Total num frames: 1273589760. Throughput: 0: 5827.1. Samples: 1273589654. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:07:53,361][25689] Avg episode reward: [(0, '1.294')] [2022-07-11 15:07:54,871][26022] Updated weights on worker 0-0, policy_version 1243748 (0.00381) [2022-07-11 15:07:56,668][26022] Updated weights on worker 0-0, policy_version 1243758 (0.00088) [2022-07-11 15:07:58,373][25689] Fps is (10 sec: 5716.1, 60 sec: 5564.2, 300 sec: 5571.3). Total num frames: 1273617408. Throughput: 0: 5828.0. Samples: 1273623414. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:07:58,375][25689] Avg episode reward: [(0, '0.744')] [2022-07-11 15:07:58,402][26022] Updated weights on worker 0-0, policy_version 1243768 (0.00086) [2022-07-11 15:08:00,269][26022] Updated weights on worker 0-0, policy_version 1243778 (0.00087) [2022-07-11 15:08:02,497][26022] Updated weights on worker 0-0, policy_version 1243788 (0.00086) [2022-07-11 15:08:03,514][25689] Fps is (10 sec: 5344.6, 60 sec: 5593.4, 300 sec: 5575.7). Total num frames: 1273644032. Throughput: 0: 4994.4. Samples: 1273640166. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:08:03,515][25689] Avg episode reward: [(0, '0.839')] [2022-07-11 15:08:04,303][26022] Updated weights on worker 0-0, policy_version 1243798 (0.00416) [2022-07-11 15:08:06,153][26022] Updated weights on worker 0-0, policy_version 1243808 (0.00087) [2022-07-11 15:08:08,231][26022] Updated weights on worker 0-0, policy_version 1243818 (0.00093) [2022-07-11 15:08:08,592][25689] Fps is (10 sec: 5411.1, 60 sec: 5586.8, 300 sec: 5574.4). Total num frames: 1273672704. Throughput: 0: 5705.4. Samples: 1273671706. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:08:08,593][25689] Avg episode reward: [(0, '1.094')] [2022-07-11 15:08:09,627][26022] Updated weights on worker 0-0, policy_version 1243828 (0.00085) [2022-07-11 15:08:11,827][26022] Updated weights on worker 0-0, policy_version 1243838 (0.00086) [2022-07-11 15:08:13,459][26022] Updated weights on worker 0-0, policy_version 1243848 (0.00093) [2022-07-11 15:08:13,630][25689] Fps is (10 sec: 5668.9, 60 sec: 5566.7, 300 sec: 5573.7). Total num frames: 1273701376. Throughput: 0: 5707.8. Samples: 1273705352. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:08:13,630][25689] Avg episode reward: [(0, '1.298')] [2022-07-11 15:08:15,359][26022] Updated weights on worker 0-0, policy_version 1243858 (0.00084) [2022-07-11 15:08:16,238][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:08:16,250][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001243865_1273717760.pth [2022-07-11 15:08:16,251][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001241903_1271708672.pth [2022-07-11 15:08:17,192][26022] Updated weights on worker 0-0, policy_version 1243868 (0.00086) [2022-07-11 15:08:18,654][25689] Fps is (10 sec: 5597.8, 60 sec: 5566.5, 300 sec: 5577.9). Total num frames: 1273729024. Throughput: 0: 4887.3. Samples: 1273722530. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:08:18,654][25689] Avg episode reward: [(0, '0.934')] [2022-07-11 15:08:18,753][26022] Updated weights on worker 0-0, policy_version 1243878 (0.00084) [2022-07-11 15:08:20,778][26022] Updated weights on worker 0-0, policy_version 1243888 (0.00091) [2022-07-11 15:08:22,372][26022] Updated weights on worker 0-0, policy_version 1243898 (0.00084) [2022-07-11 15:08:23,730][25689] Fps is (10 sec: 5576.1, 60 sec: 5549.8, 300 sec: 5573.2). Total num frames: 1273757696. Throughput: 0: 5751.6. Samples: 1273756440. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:08:23,731][25689] Avg episode reward: [(0, '0.946')] [2022-07-11 15:08:24,399][26022] Updated weights on worker 0-0, policy_version 1243908 (0.00089) [2022-07-11 15:08:26,194][26022] Updated weights on worker 0-0, policy_version 1243918 (0.00085) [2022-07-11 15:08:27,849][26022] Updated weights on worker 0-0, policy_version 1243928 (0.00092) [2022-07-11 15:08:28,743][25689] Fps is (10 sec: 5683.4, 60 sec: 5554.0, 300 sec: 5580.3). Total num frames: 1273786368. Throughput: 0: 5880.7. Samples: 1273790208. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:08:28,744][25689] Avg episode reward: [(0, '1.424')] [2022-07-11 15:08:29,766][26022] Updated weights on worker 0-0, policy_version 1243938 (0.00093) [2022-07-11 15:08:31,499][26022] Updated weights on worker 0-0, policy_version 1243948 (0.00085) [2022-07-11 15:08:33,388][26022] Updated weights on worker 0-0, policy_version 1243958 (0.00090) [2022-07-11 15:08:33,753][25689] Fps is (10 sec: 5721.2, 60 sec: 5595.7, 300 sec: 5583.7). Total num frames: 1273815040. Throughput: 0: 5887.4. Samples: 1273823826. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:08:33,754][25689] Avg episode reward: [(0, '0.582')] [2022-07-11 15:08:35,288][26022] Updated weights on worker 0-0, policy_version 1243968 (0.00589) [2022-07-11 15:08:37,099][26022] Updated weights on worker 0-0, policy_version 1243978 (0.00086) [2022-07-11 15:08:38,771][25689] Fps is (10 sec: 5514.5, 60 sec: 5546.2, 300 sec: 5579.1). Total num frames: 1273841664. Throughput: 0: 5877.6. Samples: 1273840772. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:08:38,771][25689] Avg episode reward: [(0, '0.064')] [2022-07-11 15:08:38,923][26022] Updated weights on worker 0-0, policy_version 1243988 (0.00092) [2022-07-11 15:08:40,803][26022] Updated weights on worker 0-0, policy_version 1243998 (0.00087) [2022-07-11 15:08:42,465][26022] Updated weights on worker 0-0, policy_version 1244008 (0.00085) [2022-07-11 15:08:43,801][25689] Fps is (10 sec: 5401.3, 60 sec: 5551.6, 300 sec: 5575.6). Total num frames: 1273869312. Throughput: 0: 5881.8. Samples: 1273874494. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2022-07-11 15:08:43,802][25689] Avg episode reward: [(0, '0.208')] [2022-07-11 15:08:44,323][26022] Updated weights on worker 0-0, policy_version 1244018 (0.00086) [2022-07-11 15:08:46,098][26022] Updated weights on worker 0-0, policy_version 1244028 (0.00098) [2022-07-11 15:08:48,007][26022] Updated weights on worker 0-0, policy_version 1244038 (0.00319) [2022-07-11 15:08:48,828][25689] Fps is (10 sec: 5803.9, 60 sec: 5620.6, 300 sec: 5585.9). Total num frames: 1273900032. Throughput: 0: 5884.5. Samples: 1273908394. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:08:48,828][25689] Avg episode reward: [(0, '-0.529')] [2022-07-11 15:08:49,911][26022] Updated weights on worker 0-0, policy_version 1244048 (0.00097) [2022-07-11 15:08:51,787][26022] Updated weights on worker 0-0, policy_version 1244058 (0.00090) [2022-07-11 15:08:53,504][26022] Updated weights on worker 0-0, policy_version 1244068 (0.00084) [2022-07-11 15:08:53,839][25689] Fps is (10 sec: 5815.1, 60 sec: 5587.4, 300 sec: 5583.3). Total num frames: 1273927680. Throughput: 0: 5030.7. Samples: 1273924870. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:08:53,839][25689] Avg episode reward: [(0, '-0.436')] [2022-07-11 15:08:55,295][26022] Updated weights on worker 0-0, policy_version 1244078 (0.00117) [2022-07-11 15:08:57,075][26022] Updated weights on worker 0-0, policy_version 1244088 (0.00078) [2022-07-11 15:08:58,850][25689] Fps is (10 sec: 5414.8, 60 sec: 5570.7, 300 sec: 5581.6). Total num frames: 1273954304. Throughput: 0: 5858.1. Samples: 1273958400. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:08:58,851][25689] Avg episode reward: [(0, '0.548')] [2022-07-11 15:08:59,092][26022] Updated weights on worker 0-0, policy_version 1244098 (0.00089) [2022-07-11 15:09:00,862][26022] Updated weights on worker 0-0, policy_version 1244108 (0.00086) [2022-07-11 15:09:02,983][26022] Updated weights on worker 0-0, policy_version 1244118 (0.00085) [2022-07-11 15:09:03,945][25689] Fps is (10 sec: 5471.3, 60 sec: 5608.8, 300 sec: 5587.9). Total num frames: 1273982976. Throughput: 0: 5747.7. Samples: 1273990276. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:09:03,946][25689] Avg episode reward: [(0, '0.797')] [2022-07-11 15:09:04,997][26022] Updated weights on worker 0-0, policy_version 1244128 (0.00088) [2022-07-11 15:09:06,538][26022] Updated weights on worker 0-0, policy_version 1244138 (0.00833) [2022-07-11 15:09:08,634][26022] Updated weights on worker 0-0, policy_version 1244148 (0.00082) [2022-07-11 15:09:08,990][25689] Fps is (10 sec: 5352.2, 60 sec: 5561.0, 300 sec: 5573.8). Total num frames: 1274008576. Throughput: 0: 4903.8. Samples: 1274007268. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:09:08,991][25689] Avg episode reward: [(0, '0.868')] [2022-07-11 15:09:10,025][26022] Updated weights on worker 0-0, policy_version 1244158 (0.00093) [2022-07-11 15:09:12,259][26022] Updated weights on worker 0-0, policy_version 1244168 (0.00088) [2022-07-11 15:09:13,956][26022] Updated weights on worker 0-0, policy_version 1244178 (0.00079) [2022-07-11 15:09:14,060][25689] Fps is (10 sec: 5467.1, 60 sec: 5575.0, 300 sec: 5586.3). Total num frames: 1274038272. Throughput: 0: 5751.8. Samples: 1274041176. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:09:14,060][25689] Avg episode reward: [(0, '1.240')] [2022-07-11 15:09:15,785][26022] Updated weights on worker 0-0, policy_version 1244188 (0.00082) [2022-07-11 15:09:17,309][26022] Updated weights on worker 0-0, policy_version 1244198 (0.00090) [2022-07-11 15:09:19,084][25689] Fps is (10 sec: 5782.7, 60 sec: 5591.9, 300 sec: 5581.2). Total num frames: 1274066944. Throughput: 0: 5757.3. Samples: 1274074890. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:09:19,084][25689] Avg episode reward: [(0, '1.255')] [2022-07-11 15:09:19,384][26022] Updated weights on worker 0-0, policy_version 1244208 (0.00090) [2022-07-11 15:09:21,156][26022] Updated weights on worker 0-0, policy_version 1244218 (0.00054) [2022-07-11 15:09:23,108][26022] Updated weights on worker 0-0, policy_version 1244228 (0.00100) [2022-07-11 15:09:24,128][25689] Fps is (10 sec: 5593.9, 60 sec: 5578.0, 300 sec: 5577.3). Total num frames: 1274094592. Throughput: 0: 5020.8. Samples: 1274091608. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:09:24,128][25689] Avg episode reward: [(0, '0.730')] [2022-07-11 15:09:24,732][26022] Updated weights on worker 0-0, policy_version 1244238 (0.00091) [2022-07-11 15:09:26,935][26022] Updated weights on worker 0-0, policy_version 1244248 (0.00082) [2022-07-11 15:09:28,369][26022] Updated weights on worker 0-0, policy_version 1244258 (0.00084) [2022-07-11 15:09:29,136][25689] Fps is (10 sec: 5602.7, 60 sec: 5578.4, 300 sec: 5582.4). Total num frames: 1274123264. Throughput: 0: 5854.8. Samples: 1274125218. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:09:29,137][25689] Avg episode reward: [(0, '0.624')] [2022-07-11 15:09:30,630][26022] Updated weights on worker 0-0, policy_version 1244268 (0.00091) [2022-07-11 15:09:32,021][26022] Updated weights on worker 0-0, policy_version 1244278 (0.00089) [2022-07-11 15:09:34,078][26022] Updated weights on worker 0-0, policy_version 1244288 (0.00109) [2022-07-11 15:09:34,159][25689] Fps is (10 sec: 5614.5, 60 sec: 5560.3, 300 sec: 5582.5). Total num frames: 1274150912. Throughput: 0: 5876.2. Samples: 1274159282. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:09:34,160][25689] Avg episode reward: [(0, '0.470')] [2022-07-11 15:09:35,727][26022] Updated weights on worker 0-0, policy_version 1244298 (0.00093) [2022-07-11 15:09:37,587][26022] Updated weights on worker 0-0, policy_version 1244308 (0.00085) [2022-07-11 15:09:39,181][25689] Fps is (10 sec: 5708.8, 60 sec: 5610.7, 300 sec: 5588.0). Total num frames: 1274180608. Throughput: 0: 5049.8. Samples: 1274176376. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:09:39,183][25689] Avg episode reward: [(0, '-0.459')] [2022-07-11 15:09:39,249][26022] Updated weights on worker 0-0, policy_version 1244318 (0.00082) [2022-07-11 15:09:41,384][26022] Updated weights on worker 0-0, policy_version 1244328 (0.00093) [2022-07-11 15:09:43,054][26022] Updated weights on worker 0-0, policy_version 1244338 (0.00088) [2022-07-11 15:09:44,281][25689] Fps is (10 sec: 5665.0, 60 sec: 5604.2, 300 sec: 5583.0). Total num frames: 1274208256. Throughput: 0: 5894.1. Samples: 1274210396. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:09:44,282][25689] Avg episode reward: [(0, '-1.442')] [2022-07-11 15:09:44,776][26022] Updated weights on worker 0-0, policy_version 1244348 (0.00086) [2022-07-11 15:09:46,482][26022] Updated weights on worker 0-0, policy_version 1244358 (0.00086) [2022-07-11 15:09:48,460][26022] Updated weights on worker 0-0, policy_version 1244368 (0.00087) [2022-07-11 15:09:49,289][25689] Fps is (10 sec: 5470.5, 60 sec: 5555.1, 300 sec: 5579.6). Total num frames: 1274235904. Throughput: 0: 5905.5. Samples: 1274244230. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:09:49,289][25689] Avg episode reward: [(0, '-0.703')] [2022-07-11 15:09:50,211][26022] Updated weights on worker 0-0, policy_version 1244378 (0.00093) [2022-07-11 15:09:52,294][26022] Updated weights on worker 0-0, policy_version 1244388 (0.00092) [2022-07-11 15:09:53,940][26022] Updated weights on worker 0-0, policy_version 1244398 (0.00087) [2022-07-11 15:09:54,300][25689] Fps is (10 sec: 5723.8, 60 sec: 5589.1, 300 sec: 5586.4). Total num frames: 1274265600. Throughput: 0: 5048.5. Samples: 1274260964. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:09:54,300][25689] Avg episode reward: [(0, '-0.623')] [2022-07-11 15:09:55,858][26022] Updated weights on worker 0-0, policy_version 1244408 (0.00087) [2022-07-11 15:09:57,437][26022] Updated weights on worker 0-0, policy_version 1244418 (0.00094) [2022-07-11 15:09:59,323][25689] Fps is (10 sec: 5613.0, 60 sec: 5588.0, 300 sec: 5583.5). Total num frames: 1274292224. Throughput: 0: 5877.0. Samples: 1274294750. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:09:59,323][25689] Avg episode reward: [(0, '-0.499')] [2022-07-11 15:09:59,723][26022] Updated weights on worker 0-0, policy_version 1244428 (0.00088) [2022-07-11 15:10:01,091][26022] Updated weights on worker 0-0, policy_version 1244438 (0.00086) [2022-07-11 15:10:03,431][26022] Updated weights on worker 0-0, policy_version 1244448 (0.00089) [2022-07-11 15:10:04,445][25689] Fps is (10 sec: 5349.5, 60 sec: 5568.5, 300 sec: 5584.9). Total num frames: 1274319872. Throughput: 0: 5746.3. Samples: 1274326264. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:10:04,446][25689] Avg episode reward: [(0, '-1.783')] [2022-07-11 15:10:05,310][26022] Updated weights on worker 0-0, policy_version 1244458 (0.00097) [2022-07-11 15:10:07,184][26022] Updated weights on worker 0-0, policy_version 1244468 (0.00087) [2022-07-11 15:10:08,823][26022] Updated weights on worker 0-0, policy_version 1244478 (0.00087) [2022-07-11 15:10:09,471][25689] Fps is (10 sec: 5550.0, 60 sec: 5621.1, 300 sec: 5585.0). Total num frames: 1274348544. Throughput: 0: 4890.5. Samples: 1274342928. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:10:09,471][25689] Avg episode reward: [(0, '-0.194')] [2022-07-11 15:10:11,002][26022] Updated weights on worker 0-0, policy_version 1244488 (0.00086) [2022-07-11 15:10:12,388][26022] Updated weights on worker 0-0, policy_version 1244498 (0.00093) [2022-07-11 15:10:14,488][25689] Fps is (10 sec: 5506.2, 60 sec: 5575.2, 300 sec: 5574.9). Total num frames: 1274375168. Throughput: 0: 5727.6. Samples: 1274376592. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:10:14,488][25689] Avg episode reward: [(0, '-0.092')] [2022-07-11 15:10:14,613][26022] Updated weights on worker 0-0, policy_version 1244508 (0.00087) [2022-07-11 15:10:16,004][26022] Updated weights on worker 0-0, policy_version 1244518 (0.00047) [2022-07-11 15:10:16,419][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:10:16,437][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001244520_1274388480.pth [2022-07-11 15:10:16,438][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001242554_1272375296.pth [2022-07-11 15:10:18,246][26022] Updated weights on worker 0-0, policy_version 1244528 (0.00087) [2022-07-11 15:10:19,496][25689] Fps is (10 sec: 5515.6, 60 sec: 5576.6, 300 sec: 5583.0). Total num frames: 1274403840. Throughput: 0: 5732.9. Samples: 1274410400. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:10:19,497][25689] Avg episode reward: [(0, '-0.113')] [2022-07-11 15:10:19,825][26022] Updated weights on worker 0-0, policy_version 1244538 (0.00087) [2022-07-11 15:10:21,709][26022] Updated weights on worker 0-0, policy_version 1244548 (0.00094) [2022-07-11 15:10:23,463][26022] Updated weights on worker 0-0, policy_version 1244558 (0.00093) [2022-07-11 15:10:24,576][25689] Fps is (10 sec: 5785.7, 60 sec: 5607.2, 300 sec: 5582.6). Total num frames: 1274433536. Throughput: 0: 5017.8. Samples: 1274427276. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:10:24,576][25689] Avg episode reward: [(0, '-0.536')] [2022-07-11 15:10:25,558][26022] Updated weights on worker 0-0, policy_version 1244568 (0.00085) [2022-07-11 15:10:27,194][26022] Updated weights on worker 0-0, policy_version 1244578 (0.00086) [2022-07-11 15:10:29,260][26022] Updated weights on worker 0-0, policy_version 1244588 (0.00084) [2022-07-11 15:10:29,600][25689] Fps is (10 sec: 5573.9, 60 sec: 5571.9, 300 sec: 5579.4). Total num frames: 1274460160. Throughput: 0: 5838.1. Samples: 1274460446. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:10:29,601][25689] Avg episode reward: [(0, '-0.351')] [2022-07-11 15:10:30,752][26022] Updated weights on worker 0-0, policy_version 1244598 (0.00084) [2022-07-11 15:10:32,895][26022] Updated weights on worker 0-0, policy_version 1244608 (0.00076) [2022-07-11 15:10:34,456][26022] Updated weights on worker 0-0, policy_version 1244618 (0.00087) [2022-07-11 15:10:34,626][25689] Fps is (10 sec: 5501.7, 60 sec: 5588.4, 300 sec: 5579.2). Total num frames: 1274488832. Throughput: 0: 5822.1. Samples: 1274493842. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:10:34,627][25689] Avg episode reward: [(0, '1.097')] [2022-07-11 15:10:36,395][26022] Updated weights on worker 0-0, policy_version 1244628 (0.00094) [2022-07-11 15:10:38,363][26022] Updated weights on worker 0-0, policy_version 1244638 (0.00087) [2022-07-11 15:10:39,634][25689] Fps is (10 sec: 5613.2, 60 sec: 5556.0, 300 sec: 5576.7). Total num frames: 1274516480. Throughput: 0: 4981.6. Samples: 1274510718. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:10:39,634][25689] Avg episode reward: [(0, '1.127')] [2022-07-11 15:10:40,107][26022] Updated weights on worker 0-0, policy_version 1244648 (0.00091) [2022-07-11 15:10:41,928][26022] Updated weights on worker 0-0, policy_version 1244658 (0.00102) [2022-07-11 15:10:43,863][26022] Updated weights on worker 0-0, policy_version 1244668 (0.00084) [2022-07-11 15:10:44,698][25689] Fps is (10 sec: 5591.8, 60 sec: 5576.2, 300 sec: 5580.8). Total num frames: 1274545152. Throughput: 0: 5821.3. Samples: 1274544414. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:10:44,699][25689] Avg episode reward: [(0, '-1.215')] [2022-07-11 15:10:45,439][26022] Updated weights on worker 0-0, policy_version 1244678 (0.00085) [2022-07-11 15:10:47,488][26022] Updated weights on worker 0-0, policy_version 1244688 (0.00087) [2022-07-11 15:10:49,094][26022] Updated weights on worker 0-0, policy_version 1244698 (0.00085) [2022-07-11 15:10:49,700][25689] Fps is (10 sec: 5696.7, 60 sec: 5593.7, 300 sec: 5584.3). Total num frames: 1274573824. Throughput: 0: 5873.0. Samples: 1274578492. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:10:49,702][25689] Avg episode reward: [(0, '-0.979')] [2022-07-11 15:10:51,120][26022] Updated weights on worker 0-0, policy_version 1244708 (0.00089) [2022-07-11 15:10:52,757][26022] Updated weights on worker 0-0, policy_version 1244718 (0.00095) [2022-07-11 15:10:54,699][26022] Updated weights on worker 0-0, policy_version 1244728 (0.00089) [2022-07-11 15:10:54,706][25689] Fps is (10 sec: 5627.5, 60 sec: 5560.2, 300 sec: 5582.3). Total num frames: 1274601472. Throughput: 0: 5048.9. Samples: 1274595224. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:10:54,708][25689] Avg episode reward: [(0, '-0.888')] [2022-07-11 15:10:56,489][26022] Updated weights on worker 0-0, policy_version 1244738 (0.00083) [2022-07-11 15:10:58,514][26022] Updated weights on worker 0-0, policy_version 1244748 (0.00088) [2022-07-11 15:10:59,724][25689] Fps is (10 sec: 5618.2, 60 sec: 5594.5, 300 sec: 5590.6). Total num frames: 1274630144. Throughput: 0: 5869.2. Samples: 1274628636. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:10:59,725][25689] Avg episode reward: [(0, '-0.523')] [2022-07-11 15:11:00,014][26022] Updated weights on worker 0-0, policy_version 1244758 (0.00095) [2022-07-11 15:11:02,517][26022] Updated weights on worker 0-0, policy_version 1244768 (0.00082) [2022-07-11 15:11:04,411][26022] Updated weights on worker 0-0, policy_version 1244778 (0.00085) [2022-07-11 15:11:04,770][25689] Fps is (10 sec: 5291.2, 60 sec: 5550.7, 300 sec: 5573.1). Total num frames: 1274654720. Throughput: 0: 5769.2. Samples: 1274660212. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:11:04,770][25689] Avg episode reward: [(0, '-0.596')] [2022-07-11 15:11:05,964][26022] Updated weights on worker 0-0, policy_version 1244788 (0.00084) [2022-07-11 15:11:07,936][26022] Updated weights on worker 0-0, policy_version 1244798 (0.00088) [2022-07-11 15:11:09,637][26022] Updated weights on worker 0-0, policy_version 1244808 (0.00085) [2022-07-11 15:11:09,787][25689] Fps is (10 sec: 5291.9, 60 sec: 5551.6, 300 sec: 5577.0). Total num frames: 1274683392. Throughput: 0: 4909.5. Samples: 1274677108. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:11:09,787][25689] Avg episode reward: [(0, '-0.469')] [2022-07-11 15:11:11,377][26022] Updated weights on worker 0-0, policy_version 1244818 (0.00088) [2022-07-11 15:11:13,439][26022] Updated weights on worker 0-0, policy_version 1244828 (0.00086) [2022-07-11 15:11:14,811][25689] Fps is (10 sec: 5711.1, 60 sec: 5584.9, 300 sec: 5576.9). Total num frames: 1274712064. Throughput: 0: 5744.5. Samples: 1274710714. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:11:14,811][25689] Avg episode reward: [(0, '1.006')] [2022-07-11 15:11:14,952][26022] Updated weights on worker 0-0, policy_version 1244838 (0.00082) [2022-07-11 15:11:17,016][26022] Updated weights on worker 0-0, policy_version 1244848 (0.00088) [2022-07-11 15:11:18,871][26022] Updated weights on worker 0-0, policy_version 1244858 (0.00106) [2022-07-11 15:11:19,828][25689] Fps is (10 sec: 5608.9, 60 sec: 5567.1, 300 sec: 5581.2). Total num frames: 1274739712. Throughput: 0: 5781.7. Samples: 1274744868. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:11:19,828][25689] Avg episode reward: [(0, '1.277')] [2022-07-11 15:11:20,553][26022] Updated weights on worker 0-0, policy_version 1244868 (0.00089) [2022-07-11 15:11:22,525][26022] Updated weights on worker 0-0, policy_version 1244878 (0.00090) [2022-07-11 15:11:24,265][26022] Updated weights on worker 0-0, policy_version 1244888 (0.00090) [2022-07-11 15:11:24,955][25689] Fps is (10 sec: 5552.0, 60 sec: 5545.8, 300 sec: 5569.5). Total num frames: 1274768384. Throughput: 0: 5022.7. Samples: 1274761594. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:11:24,955][25689] Avg episode reward: [(0, '1.262')] [2022-07-11 15:11:26,206][26022] Updated weights on worker 0-0, policy_version 1244898 (0.00080) [2022-07-11 15:11:27,942][26022] Updated weights on worker 0-0, policy_version 1244908 (0.00091) [2022-07-11 15:11:29,920][26022] Updated weights on worker 0-0, policy_version 1244918 (0.00085) [2022-07-11 15:11:29,957][25689] Fps is (10 sec: 5560.3, 60 sec: 5564.8, 300 sec: 5576.6). Total num frames: 1274796032. Throughput: 0: 5841.8. Samples: 1274794938. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:11:29,957][25689] Avg episode reward: [(0, '1.339')] [2022-07-11 15:11:31,491][26022] Updated weights on worker 0-0, policy_version 1244928 (0.00088) [2022-07-11 15:11:33,475][26022] Updated weights on worker 0-0, policy_version 1244938 (0.00087) [2022-07-11 15:11:34,974][25689] Fps is (10 sec: 5723.6, 60 sec: 5582.6, 300 sec: 5580.5). Total num frames: 1274825728. Throughput: 0: 5852.0. Samples: 1274828706. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:11:34,974][25689] Avg episode reward: [(0, '1.389')] [2022-07-11 15:11:35,297][26022] Updated weights on worker 0-0, policy_version 1244948 (0.00087) [2022-07-11 15:11:37,067][26022] Updated weights on worker 0-0, policy_version 1244958 (0.00083) [2022-07-11 15:11:39,004][26022] Updated weights on worker 0-0, policy_version 1244968 (0.00090) [2022-07-11 15:11:40,004][25689] Fps is (10 sec: 5605.5, 60 sec: 5563.5, 300 sec: 5577.4). Total num frames: 1274852352. Throughput: 0: 4996.7. Samples: 1274845682. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:11:40,004][25689] Avg episode reward: [(0, '0.885')] [2022-07-11 15:11:40,592][26022] Updated weights on worker 0-0, policy_version 1244978 (0.00090) [2022-07-11 15:11:42,859][26022] Updated weights on worker 0-0, policy_version 1244988 (0.00087) [2022-07-11 15:11:44,371][26022] Updated weights on worker 0-0, policy_version 1244998 (0.00084) [2022-07-11 15:11:45,049][25689] Fps is (10 sec: 5386.8, 60 sec: 5548.4, 300 sec: 5577.0). Total num frames: 1274880000. Throughput: 0: 5852.4. Samples: 1274879192. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:11:45,049][25689] Avg episode reward: [(0, '1.413')] [2022-07-11 15:11:46,251][26022] Updated weights on worker 0-0, policy_version 1245008 (0.00093) [2022-07-11 15:11:48,147][26022] Updated weights on worker 0-0, policy_version 1245018 (0.00081) [2022-07-11 15:11:49,699][26022] Updated weights on worker 0-0, policy_version 1245028 (0.00082) [2022-07-11 15:11:50,116][25689] Fps is (10 sec: 5772.1, 60 sec: 5576.3, 300 sec: 5576.2). Total num frames: 1274910720. Throughput: 0: 5857.4. Samples: 1274913020. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:11:50,117][25689] Avg episode reward: [(0, '1.131')] [2022-07-11 15:11:51,965][26022] Updated weights on worker 0-0, policy_version 1245038 (0.00053) [2022-07-11 15:11:53,402][26022] Updated weights on worker 0-0, policy_version 1245048 (0.00084) [2022-07-11 15:11:55,135][25689] Fps is (10 sec: 5685.1, 60 sec: 5558.1, 300 sec: 5572.8). Total num frames: 1274937344. Throughput: 0: 5855.6. Samples: 1274946766. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:11:55,136][25689] Avg episode reward: [(0, '0.954')] [2022-07-11 15:11:55,418][26022] Updated weights on worker 0-0, policy_version 1245058 (0.00091) [2022-07-11 15:11:57,264][26022] Updated weights on worker 0-0, policy_version 1245068 (0.00092) [2022-07-11 15:11:58,991][26022] Updated weights on worker 0-0, policy_version 1245078 (0.00087) [2022-07-11 15:12:00,163][25689] Fps is (10 sec: 5401.8, 60 sec: 5540.3, 300 sec: 5584.3). Total num frames: 1274964992. Throughput: 0: 5841.7. Samples: 1274963446. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:12:00,164][25689] Avg episode reward: [(0, '0.655')] [2022-07-11 15:12:00,980][26022] Updated weights on worker 0-0, policy_version 1245088 (0.00086) [2022-07-11 15:12:02,882][26022] Updated weights on worker 0-0, policy_version 1245098 (0.00087) [2022-07-11 15:12:04,904][26022] Updated weights on worker 0-0, policy_version 1245108 (0.00091) [2022-07-11 15:12:05,270][25689] Fps is (10 sec: 5355.3, 60 sec: 5568.6, 300 sec: 5575.5). Total num frames: 1274991616. Throughput: 0: 5734.2. Samples: 1274995142. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:12:05,272][25689] Avg episode reward: [(0, '0.155')] [2022-07-11 15:12:06,835][26022] Updated weights on worker 0-0, policy_version 1245118 (0.00083) [2022-07-11 15:12:08,621][26022] Updated weights on worker 0-0, policy_version 1245128 (0.00087) [2022-07-11 15:12:10,283][25689] Fps is (10 sec: 5464.1, 60 sec: 5568.9, 300 sec: 5571.9). Total num frames: 1275020288. Throughput: 0: 5742.8. Samples: 1275028834. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:12:10,283][25689] Avg episode reward: [(0, '0.461')] [2022-07-11 15:12:10,331][26022] Updated weights on worker 0-0, policy_version 1245138 (0.00082) [2022-07-11 15:12:12,265][26022] Updated weights on worker 0-0, policy_version 1245148 (0.00087) [2022-07-11 15:12:13,949][26022] Updated weights on worker 0-0, policy_version 1245158 (0.00091) [2022-07-11 15:12:15,328][25689] Fps is (10 sec: 5599.4, 60 sec: 5550.0, 300 sec: 5571.4). Total num frames: 1275047936. Throughput: 0: 4907.8. Samples: 1275045866. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:12:15,328][25689] Avg episode reward: [(0, '0.632')] [2022-07-11 15:12:15,941][26022] Updated weights on worker 0-0, policy_version 1245168 (0.00086) [2022-07-11 15:12:16,651][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:12:16,678][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001245173_1275057152.pth [2022-07-11 15:12:16,679][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001243209_1273046016.pth [2022-07-11 15:12:17,671][26022] Updated weights on worker 0-0, policy_version 1245178 (0.00079) [2022-07-11 15:12:19,335][26022] Updated weights on worker 0-0, policy_version 1245188 (0.00105) [2022-07-11 15:12:20,357][25689] Fps is (10 sec: 5692.2, 60 sec: 5582.8, 300 sec: 5572.3). Total num frames: 1275077632. Throughput: 0: 5760.1. Samples: 1275079766. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:12:20,358][25689] Avg episode reward: [(0, '1.087')] [2022-07-11 15:12:21,455][26022] Updated weights on worker 0-0, policy_version 1245198 (0.00548) [2022-07-11 15:12:23,087][26022] Updated weights on worker 0-0, policy_version 1245208 (0.00082) [2022-07-11 15:12:24,999][26022] Updated weights on worker 0-0, policy_version 1245218 (0.00083) [2022-07-11 15:12:25,430][25689] Fps is (10 sec: 5777.6, 60 sec: 5587.7, 300 sec: 5572.1). Total num frames: 1275106304. Throughput: 0: 5870.2. Samples: 1275113490. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:12:25,431][25689] Avg episode reward: [(0, '1.313')] [2022-07-11 15:12:26,744][26022] Updated weights on worker 0-0, policy_version 1245228 (0.00085) [2022-07-11 15:12:28,544][26022] Updated weights on worker 0-0, policy_version 1245238 (0.00086) [2022-07-11 15:12:30,481][25689] Fps is (10 sec: 5461.7, 60 sec: 5566.3, 300 sec: 5572.9). Total num frames: 1275132928. Throughput: 0: 5018.4. Samples: 1275130198. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:12:30,482][25689] Avg episode reward: [(0, '1.985')] [2022-07-11 15:12:30,558][26022] Updated weights on worker 0-0, policy_version 1245248 (0.00092) [2022-07-11 15:12:32,275][26022] Updated weights on worker 0-0, policy_version 1245258 (0.00098) [2022-07-11 15:12:34,103][26022] Updated weights on worker 0-0, policy_version 1245268 (0.00087) [2022-07-11 15:12:35,523][25689] Fps is (10 sec: 5478.9, 60 sec: 5547.1, 300 sec: 5569.3). Total num frames: 1275161600. Throughput: 0: 5843.9. Samples: 1275163884. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:12:35,528][25689] Avg episode reward: [(0, '1.955')] [2022-07-11 15:12:35,872][26022] Updated weights on worker 0-0, policy_version 1245278 (0.00084) [2022-07-11 15:12:37,575][26022] Updated weights on worker 0-0, policy_version 1245288 (0.00089) [2022-07-11 15:12:39,609][26022] Updated weights on worker 0-0, policy_version 1245298 (0.00090) [2022-07-11 15:12:40,543][25689] Fps is (10 sec: 5699.4, 60 sec: 5581.9, 300 sec: 5574.0). Total num frames: 1275190272. Throughput: 0: 5838.5. Samples: 1275197620. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2022-07-11 15:12:40,543][25689] Avg episode reward: [(0, '1.529')] [2022-07-11 15:12:41,411][26022] Updated weights on worker 0-0, policy_version 1245308 (0.00098) [2022-07-11 15:12:43,103][26022] Updated weights on worker 0-0, policy_version 1245318 (0.00085) [2022-07-11 15:12:45,268][26022] Updated weights on worker 0-0, policy_version 1245328 (0.00086) [2022-07-11 15:12:45,615][25689] Fps is (10 sec: 5580.7, 60 sec: 5579.4, 300 sec: 5576.9). Total num frames: 1275217920. Throughput: 0: 5001.5. Samples: 1275214442. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:12:45,615][25689] Avg episode reward: [(0, '1.474')] [2022-07-11 15:12:46,735][26022] Updated weights on worker 0-0, policy_version 1245338 (0.00083) [2022-07-11 15:12:48,788][26022] Updated weights on worker 0-0, policy_version 1245348 (0.00353) [2022-07-11 15:12:50,571][26022] Updated weights on worker 0-0, policy_version 1245358 (0.00095) [2022-07-11 15:12:50,659][25689] Fps is (10 sec: 5567.4, 60 sec: 5547.7, 300 sec: 5572.9). Total num frames: 1275246592. Throughput: 0: 5837.4. Samples: 1275247982. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:12:50,659][25689] Avg episode reward: [(0, '1.392')] [2022-07-11 15:12:52,391][26022] Updated weights on worker 0-0, policy_version 1245368 (0.00098) [2022-07-11 15:12:54,384][26022] Updated weights on worker 0-0, policy_version 1245378 (0.00084) [2022-07-11 15:12:55,700][25689] Fps is (10 sec: 5584.4, 60 sec: 5562.6, 300 sec: 5572.4). Total num frames: 1275274240. Throughput: 0: 5831.8. Samples: 1275281554. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:12:55,700][25689] Avg episode reward: [(0, '0.668')] [2022-07-11 15:12:56,136][26022] Updated weights on worker 0-0, policy_version 1245388 (0.00092) [2022-07-11 15:12:57,810][26022] Updated weights on worker 0-0, policy_version 1245398 (0.00089) [2022-07-11 15:12:59,835][26022] Updated weights on worker 0-0, policy_version 1245408 (0.00095) [2022-07-11 15:13:00,731][25689] Fps is (10 sec: 5693.2, 60 sec: 5596.1, 300 sec: 5584.8). Total num frames: 1275303936. Throughput: 0: 5006.0. Samples: 1275298682. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:13:00,732][25689] Avg episode reward: [(0, '0.612')] [2022-07-11 15:13:01,878][26022] Updated weights on worker 0-0, policy_version 1245418 (0.00088) [2022-07-11 15:13:03,819][26022] Updated weights on worker 0-0, policy_version 1245428 (0.00082) [2022-07-11 15:13:05,559][26022] Updated weights on worker 0-0, policy_version 1245438 (0.00091) [2022-07-11 15:13:05,839][25689] Fps is (10 sec: 5454.0, 60 sec: 5579.1, 300 sec: 5573.9). Total num frames: 1275329536. Throughput: 0: 5720.5. Samples: 1275330132. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:13:05,845][25689] Avg episode reward: [(0, '0.557')] [2022-07-11 15:13:07,478][26022] Updated weights on worker 0-0, policy_version 1245448 (0.00083) [2022-07-11 15:13:09,361][26022] Updated weights on worker 0-0, policy_version 1245458 (0.00107) [2022-07-11 15:13:10,871][25689] Fps is (10 sec: 5150.5, 60 sec: 5543.5, 300 sec: 5567.1). Total num frames: 1275356160. Throughput: 0: 5736.4. Samples: 1275363926. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:13:10,871][25689] Avg episode reward: [(0, '0.059')] [2022-07-11 15:13:11,166][26022] Updated weights on worker 0-0, policy_version 1245468 (0.00084) [2022-07-11 15:13:12,780][26022] Updated weights on worker 0-0, policy_version 1245478 (0.00091) [2022-07-11 15:13:14,776][26022] Updated weights on worker 0-0, policy_version 1245488 (0.00085) [2022-07-11 15:13:15,886][25689] Fps is (10 sec: 5707.2, 60 sec: 5597.0, 300 sec: 5577.6). Total num frames: 1275386880. Throughput: 0: 4916.1. Samples: 1275380792. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:13:15,888][25689] Avg episode reward: [(0, '0.345')] [2022-07-11 15:13:16,410][26022] Updated weights on worker 0-0, policy_version 1245498 (0.00089) [2022-07-11 15:13:18,337][26022] Updated weights on worker 0-0, policy_version 1245508 (0.00092) [2022-07-11 15:13:20,070][26022] Updated weights on worker 0-0, policy_version 1245518 (0.00085) [2022-07-11 15:13:20,921][25689] Fps is (10 sec: 5706.0, 60 sec: 5545.8, 300 sec: 5571.5). Total num frames: 1275413504. Throughput: 0: 5746.0. Samples: 1275414690. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:13:20,921][25689] Avg episode reward: [(0, '1.033')] [2022-07-11 15:13:22,000][26022] Updated weights on worker 0-0, policy_version 1245528 (0.00087) [2022-07-11 15:13:23,845][26022] Updated weights on worker 0-0, policy_version 1245538 (0.00094) [2022-07-11 15:13:25,717][26022] Updated weights on worker 0-0, policy_version 1245548 (0.00095) [2022-07-11 15:13:26,050][25689] Fps is (10 sec: 5541.3, 60 sec: 5557.6, 300 sec: 5572.8). Total num frames: 1275443200. Throughput: 0: 5858.2. Samples: 1275448534. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:13:26,050][25689] Avg episode reward: [(0, '0.905')] [2022-07-11 15:13:27,453][26022] Updated weights on worker 0-0, policy_version 1245558 (0.00092) [2022-07-11 15:13:29,455][26022] Updated weights on worker 0-0, policy_version 1245568 (0.00857) [2022-07-11 15:13:31,082][25689] Fps is (10 sec: 5643.5, 60 sec: 5576.2, 300 sec: 5569.0). Total num frames: 1275470848. Throughput: 0: 5018.5. Samples: 1275465354. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:13:31,082][25689] Avg episode reward: [(0, '0.950')] [2022-07-11 15:13:31,115][26022] Updated weights on worker 0-0, policy_version 1245578 (0.00089) [2022-07-11 15:13:33,065][26022] Updated weights on worker 0-0, policy_version 1245588 (0.00087) [2022-07-11 15:13:34,712][26022] Updated weights on worker 0-0, policy_version 1245598 (0.00088) [2022-07-11 15:13:36,087][25689] Fps is (10 sec: 5406.9, 60 sec: 5545.7, 300 sec: 5569.2). Total num frames: 1275497472. Throughput: 0: 5856.4. Samples: 1275499098. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:13:36,088][25689] Avg episode reward: [(0, '0.816')] [2022-07-11 15:13:36,494][26022] Updated weights on worker 0-0, policy_version 1245608 (0.00083) [2022-07-11 15:13:38,547][26022] Updated weights on worker 0-0, policy_version 1245618 (0.00086) [2022-07-11 15:13:40,203][26022] Updated weights on worker 0-0, policy_version 1245628 (0.00092) [2022-07-11 15:13:41,167][25689] Fps is (10 sec: 5787.7, 60 sec: 5590.9, 300 sec: 5582.0). Total num frames: 1275529216. Throughput: 0: 5845.6. Samples: 1275533040. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:13:41,167][25689] Avg episode reward: [(0, '0.551')] [2022-07-11 15:13:42,220][26022] Updated weights on worker 0-0, policy_version 1245638 (0.00080) [2022-07-11 15:13:43,584][26022] Updated weights on worker 0-0, policy_version 1245648 (0.00084) [2022-07-11 15:13:45,704][26022] Updated weights on worker 0-0, policy_version 1245658 (0.00092) [2022-07-11 15:13:46,219][25689] Fps is (10 sec: 5862.2, 60 sec: 5592.8, 300 sec: 5571.2). Total num frames: 1275556864. Throughput: 0: 5048.3. Samples: 1275550352. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:13:46,219][25689] Avg episode reward: [(0, '0.079')] [2022-07-11 15:13:47,284][26022] Updated weights on worker 0-0, policy_version 1245668 (0.00084) [2022-07-11 15:13:49,130][26022] Updated weights on worker 0-0, policy_version 1245678 (0.00083) [2022-07-11 15:13:50,880][26022] Updated weights on worker 0-0, policy_version 1245688 (0.00092) [2022-07-11 15:13:51,222][25689] Fps is (10 sec: 5499.3, 60 sec: 5579.7, 300 sec: 5571.4). Total num frames: 1275584512. Throughput: 0: 5900.6. Samples: 1275584192. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:13:51,222][25689] Avg episode reward: [(0, '0.258')] [2022-07-11 15:13:52,871][26022] Updated weights on worker 0-0, policy_version 1245698 (0.00090) [2022-07-11 15:13:54,563][26022] Updated weights on worker 0-0, policy_version 1245708 (0.00084) [2022-07-11 15:13:56,236][25689] Fps is (10 sec: 5622.2, 60 sec: 5599.0, 300 sec: 5578.2). Total num frames: 1275613184. Throughput: 0: 5880.2. Samples: 1275617576. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:13:56,237][25689] Avg episode reward: [(0, '0.004')] [2022-07-11 15:13:56,593][26022] Updated weights on worker 0-0, policy_version 1245718 (0.00080) [2022-07-11 15:13:58,348][26022] Updated weights on worker 0-0, policy_version 1245728 (0.00085) [2022-07-11 15:14:00,334][26022] Updated weights on worker 0-0, policy_version 1245738 (0.00101) [2022-07-11 15:14:01,242][25689] Fps is (10 sec: 5620.8, 60 sec: 5567.6, 300 sec: 5576.4). Total num frames: 1275640832. Throughput: 0: 5057.8. Samples: 1275634576. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:14:01,242][25689] Avg episode reward: [(0, '0.232')] [2022-07-11 15:14:02,299][26022] Updated weights on worker 0-0, policy_version 1245748 (0.00090) [2022-07-11 15:14:04,139][26022] Updated weights on worker 0-0, policy_version 1245758 (0.00081) [2022-07-11 15:14:06,062][26022] Updated weights on worker 0-0, policy_version 1245768 (0.01301) [2022-07-11 15:14:06,279][25689] Fps is (10 sec: 5403.9, 60 sec: 5590.9, 300 sec: 5580.0). Total num frames: 1275667456. Throughput: 0: 5774.7. Samples: 1275666196. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:14:06,280][25689] Avg episode reward: [(0, '0.224')] [2022-07-11 15:14:07,921][26022] Updated weights on worker 0-0, policy_version 1245778 (0.00088) [2022-07-11 15:14:09,611][26022] Updated weights on worker 0-0, policy_version 1245788 (0.00085) [2022-07-11 15:14:11,315][25689] Fps is (10 sec: 5387.9, 60 sec: 5607.6, 300 sec: 5573.8). Total num frames: 1275695104. Throughput: 0: 5751.3. Samples: 1275699752. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:14:11,315][25689] Avg episode reward: [(0, '0.220')] [2022-07-11 15:14:11,714][26022] Updated weights on worker 0-0, policy_version 1245798 (0.00082) [2022-07-11 15:14:13,262][26022] Updated weights on worker 0-0, policy_version 1245808 (0.00084) [2022-07-11 15:14:15,319][26022] Updated weights on worker 0-0, policy_version 1245818 (0.00083) [2022-07-11 15:14:16,320][25689] Fps is (10 sec: 5507.3, 60 sec: 5557.7, 300 sec: 5570.7). Total num frames: 1275722752. Throughput: 0: 4923.5. Samples: 1275716458. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:14:16,320][25689] Avg episode reward: [(0, '1.667')] [2022-07-11 15:14:16,898][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:14:16,907][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001245827_1275726848.pth [2022-07-11 15:14:16,907][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001243865_1273717760.pth [2022-07-11 15:14:17,086][26022] Updated weights on worker 0-0, policy_version 1245828 (0.00090) [2022-07-11 15:14:18,626][26022] Updated weights on worker 0-0, policy_version 1245838 (0.00089) [2022-07-11 15:14:20,652][26022] Updated weights on worker 0-0, policy_version 1245848 (0.00087) [2022-07-11 15:14:21,374][25689] Fps is (10 sec: 5802.1, 60 sec: 5623.6, 300 sec: 5580.8). Total num frames: 1275753472. Throughput: 0: 5753.9. Samples: 1275750418. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:14:21,375][25689] Avg episode reward: [(0, '1.644')] [2022-07-11 15:14:22,526][26022] Updated weights on worker 0-0, policy_version 1245858 (0.00096) [2022-07-11 15:14:24,247][26022] Updated weights on worker 0-0, policy_version 1245868 (0.00090) [2022-07-11 15:14:26,390][26022] Updated weights on worker 0-0, policy_version 1245878 (0.00091) [2022-07-11 15:14:26,487][25689] Fps is (10 sec: 5539.4, 60 sec: 5557.4, 300 sec: 5568.5). Total num frames: 1275779072. Throughput: 0: 5833.1. Samples: 1275784068. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:14:26,487][25689] Avg episode reward: [(0, '1.831')] [2022-07-11 15:14:27,715][26022] Updated weights on worker 0-0, policy_version 1245888 (0.00093) [2022-07-11 15:14:29,987][26022] Updated weights on worker 0-0, policy_version 1245898 (0.00093) [2022-07-11 15:14:31,437][26022] Updated weights on worker 0-0, policy_version 1245908 (0.00086) [2022-07-11 15:14:31,490][25689] Fps is (10 sec: 5567.8, 60 sec: 5610.9, 300 sec: 5579.2). Total num frames: 1275809792. Throughput: 0: 5846.1. Samples: 1275817698. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:14:31,491][25689] Avg episode reward: [(0, '1.415')] [2022-07-11 15:14:33,456][26022] Updated weights on worker 0-0, policy_version 1245918 (0.00083) [2022-07-11 15:14:35,340][26022] Updated weights on worker 0-0, policy_version 1245928 (0.00081) [2022-07-11 15:14:36,531][25689] Fps is (10 sec: 5709.3, 60 sec: 5607.6, 300 sec: 5568.6). Total num frames: 1275836416. Throughput: 0: 5850.1. Samples: 1275834694. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:14:36,532][25689] Avg episode reward: [(0, '1.104')] [2022-07-11 15:14:37,038][26022] Updated weights on worker 0-0, policy_version 1245938 (0.00096) [2022-07-11 15:14:39,008][26022] Updated weights on worker 0-0, policy_version 1245948 (0.00061) [2022-07-11 15:14:40,709][26022] Updated weights on worker 0-0, policy_version 1245958 (0.00090) [2022-07-11 15:14:41,559][25689] Fps is (10 sec: 5389.8, 60 sec: 5544.5, 300 sec: 5569.9). Total num frames: 1275864064. Throughput: 0: 5851.7. Samples: 1275868534. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:14:41,561][25689] Avg episode reward: [(0, '0.336')] [2022-07-11 15:14:42,571][26022] Updated weights on worker 0-0, policy_version 1245968 (0.00086) [2022-07-11 15:14:44,464][26022] Updated weights on worker 0-0, policy_version 1245978 (0.00090) [2022-07-11 15:14:46,272][26022] Updated weights on worker 0-0, policy_version 1245988 (0.00096) [2022-07-11 15:14:46,619][25689] Fps is (10 sec: 5684.0, 60 sec: 5577.7, 300 sec: 5575.8). Total num frames: 1275893760. Throughput: 0: 5854.9. Samples: 1275901942. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:14:46,620][25689] Avg episode reward: [(0, '-0.320')] [2022-07-11 15:14:48,032][26022] Updated weights on worker 0-0, policy_version 1245998 (0.00087) [2022-07-11 15:14:49,910][26022] Updated weights on worker 0-0, policy_version 1246008 (0.00095) [2022-07-11 15:14:51,695][25689] Fps is (10 sec: 5657.8, 60 sec: 5571.0, 300 sec: 5567.7). Total num frames: 1275921408. Throughput: 0: 5000.9. Samples: 1275918740. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:14:51,695][25689] Avg episode reward: [(0, '-0.086')] [2022-07-11 15:14:51,762][26022] Updated weights on worker 0-0, policy_version 1246018 (0.00097) [2022-07-11 15:14:53,419][26022] Updated weights on worker 0-0, policy_version 1246028 (0.00085) [2022-07-11 15:14:55,316][26022] Updated weights on worker 0-0, policy_version 1246038 (0.00093) [2022-07-11 15:14:56,727][25689] Fps is (10 sec: 5673.5, 60 sec: 5586.3, 300 sec: 5577.9). Total num frames: 1275951104. Throughput: 0: 5841.8. Samples: 1275952676. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:14:56,727][25689] Avg episode reward: [(0, '-0.236')] [2022-07-11 15:14:56,975][26022] Updated weights on worker 0-0, policy_version 1246048 (0.00086) [2022-07-11 15:14:58,967][26022] Updated weights on worker 0-0, policy_version 1246058 (0.00084) [2022-07-11 15:15:00,823][26022] Updated weights on worker 0-0, policy_version 1246068 (0.00087) [2022-07-11 15:15:01,759][25689] Fps is (10 sec: 5392.6, 60 sec: 5533.1, 300 sec: 5569.2). Total num frames: 1275975680. Throughput: 0: 5797.1. Samples: 1275985632. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:15:01,759][25689] Avg episode reward: [(0, '0.147')] [2022-07-11 15:15:02,996][26022] Updated weights on worker 0-0, policy_version 1246078 (0.00083) [2022-07-11 15:15:04,906][26022] Updated weights on worker 0-0, policy_version 1246088 (0.00083) [2022-07-11 15:15:06,747][26022] Updated weights on worker 0-0, policy_version 1246098 (0.00094) [2022-07-11 15:15:06,816][25689] Fps is (10 sec: 5379.2, 60 sec: 5582.1, 300 sec: 5572.1). Total num frames: 1276005376. Throughput: 0: 4914.2. Samples: 1276001192. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:15:06,816][25689] Avg episode reward: [(0, '0.811')] [2022-07-11 15:15:08,313][26022] Updated weights on worker 0-0, policy_version 1246108 (0.00086) [2022-07-11 15:15:10,481][26022] Updated weights on worker 0-0, policy_version 1246118 (0.00092) [2022-07-11 15:15:11,896][25689] Fps is (10 sec: 5757.6, 60 sec: 5594.8, 300 sec: 5577.8). Total num frames: 1276034048. Throughput: 0: 5739.4. Samples: 1276034686. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:15:11,897][25689] Avg episode reward: [(0, '0.898')] [2022-07-11 15:15:11,974][26022] Updated weights on worker 0-0, policy_version 1246128 (0.00086) [2022-07-11 15:15:14,128][26022] Updated weights on worker 0-0, policy_version 1246138 (0.00093) [2022-07-11 15:15:15,867][26022] Updated weights on worker 0-0, policy_version 1246148 (0.00101) [2022-07-11 15:15:16,966][25689] Fps is (10 sec: 5347.1, 60 sec: 5555.1, 300 sec: 5566.3). Total num frames: 1276059648. Throughput: 0: 5703.4. Samples: 1276068108. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:15:16,966][25689] Avg episode reward: [(0, '1.214')] [2022-07-11 15:15:17,586][26022] Updated weights on worker 0-0, policy_version 1246158 (0.00087) [2022-07-11 15:15:19,517][26022] Updated weights on worker 0-0, policy_version 1246168 (0.00091) [2022-07-11 15:15:21,272][26022] Updated weights on worker 0-0, policy_version 1246178 (0.00099) [2022-07-11 15:15:22,039][25689] Fps is (10 sec: 5552.8, 60 sec: 5553.4, 300 sec: 5569.9). Total num frames: 1276090368. Throughput: 0: 4895.8. Samples: 1276084920. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:15:22,040][25689] Avg episode reward: [(0, '1.207')] [2022-07-11 15:15:23,235][26022] Updated weights on worker 0-0, policy_version 1246188 (0.00088) [2022-07-11 15:15:24,927][26022] Updated weights on worker 0-0, policy_version 1246198 (0.00092) [2022-07-11 15:15:26,722][26022] Updated weights on worker 0-0, policy_version 1246208 (0.00086) [2022-07-11 15:15:27,097][25689] Fps is (10 sec: 5761.3, 60 sec: 5592.2, 300 sec: 5572.7). Total num frames: 1276118016. Throughput: 0: 5794.3. Samples: 1276118706. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:15:27,097][25689] Avg episode reward: [(0, '0.894')] [2022-07-11 15:15:28,804][26022] Updated weights on worker 0-0, policy_version 1246218 (0.00090) [2022-07-11 15:15:30,455][26022] Updated weights on worker 0-0, policy_version 1246228 (0.00084) [2022-07-11 15:15:32,101][25689] Fps is (10 sec: 5597.0, 60 sec: 5558.3, 300 sec: 5573.1). Total num frames: 1276146688. Throughput: 0: 5818.8. Samples: 1276152256. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:15:32,102][25689] Avg episode reward: [(0, '1.359')] [2022-07-11 15:15:32,319][26022] Updated weights on worker 0-0, policy_version 1246238 (0.00082) [2022-07-11 15:15:34,244][26022] Updated weights on worker 0-0, policy_version 1246248 (0.00089) [2022-07-11 15:15:35,756][26022] Updated weights on worker 0-0, policy_version 1246258 (0.00082) [2022-07-11 15:15:37,107][25689] Fps is (10 sec: 5524.1, 60 sec: 5561.5, 300 sec: 5569.7). Total num frames: 1276173312. Throughput: 0: 5019.2. Samples: 1276169200. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:15:37,107][25689] Avg episode reward: [(0, '1.443')] [2022-07-11 15:15:37,788][26022] Updated weights on worker 0-0, policy_version 1246268 (0.00090) [2022-07-11 15:15:39,550][26022] Updated weights on worker 0-0, policy_version 1246278 (0.00088) [2022-07-11 15:15:41,326][26022] Updated weights on worker 0-0, policy_version 1246288 (0.00085) [2022-07-11 15:15:42,133][25689] Fps is (10 sec: 5614.1, 60 sec: 5595.5, 300 sec: 5573.8). Total num frames: 1276203008. Throughput: 0: 5884.7. Samples: 1276203170. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:15:42,135][25689] Avg episode reward: [(0, '1.438')] [2022-07-11 15:15:43,490][26022] Updated weights on worker 0-0, policy_version 1246298 (0.00089) [2022-07-11 15:15:44,939][26022] Updated weights on worker 0-0, policy_version 1246308 (0.00049) [2022-07-11 15:15:47,043][26022] Updated weights on worker 0-0, policy_version 1246318 (0.00083) [2022-07-11 15:15:47,214][25689] Fps is (10 sec: 5673.5, 60 sec: 5559.8, 300 sec: 5568.9). Total num frames: 1276230656. Throughput: 0: 5875.5. Samples: 1276236904. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:15:47,216][25689] Avg episode reward: [(0, '0.879')] [2022-07-11 15:15:48,583][26022] Updated weights on worker 0-0, policy_version 1246328 (0.00095) [2022-07-11 15:15:50,679][26022] Updated weights on worker 0-0, policy_version 1246338 (0.00082) [2022-07-11 15:15:52,285][25689] Fps is (10 sec: 5548.1, 60 sec: 5577.2, 300 sec: 5571.2). Total num frames: 1276259328. Throughput: 0: 5030.1. Samples: 1276253780. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:15:52,285][25689] Avg episode reward: [(0, '0.886')] [2022-07-11 15:15:52,384][26022] Updated weights on worker 0-0, policy_version 1246348 (0.00087) [2022-07-11 15:15:54,103][26022] Updated weights on worker 0-0, policy_version 1246358 (0.00087) [2022-07-11 15:15:55,993][26022] Updated weights on worker 0-0, policy_version 1246368 (0.00095) [2022-07-11 15:15:57,295][25689] Fps is (10 sec: 5688.5, 60 sec: 5562.3, 300 sec: 5571.3). Total num frames: 1276288000. Throughput: 0: 5867.0. Samples: 1276287642. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:15:57,295][25689] Avg episode reward: [(0, '0.810')] [2022-07-11 15:15:57,807][26022] Updated weights on worker 0-0, policy_version 1246378 (0.00085) [2022-07-11 15:15:59,535][26022] Updated weights on worker 0-0, policy_version 1246388 (0.00093) [2022-07-11 15:16:01,451][26022] Updated weights on worker 0-0, policy_version 1246398 (0.00087) [2022-07-11 15:16:02,339][25689] Fps is (10 sec: 5601.7, 60 sec: 5611.9, 300 sec: 5581.7). Total num frames: 1276315648. Throughput: 0: 5870.7. Samples: 1276321790. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:16:02,340][25689] Avg episode reward: [(0, '0.974')] [2022-07-11 15:16:03,427][26022] Updated weights on worker 0-0, policy_version 1246408 (0.00081) [2022-07-11 15:16:05,395][26022] Updated weights on worker 0-0, policy_version 1246418 (0.00088) [2022-07-11 15:16:07,133][26022] Updated weights on worker 0-0, policy_version 1246428 (0.00099) [2022-07-11 15:16:07,427][25689] Fps is (10 sec: 5457.4, 60 sec: 5575.2, 300 sec: 5576.9). Total num frames: 1276343296. Throughput: 0: 4932.0. Samples: 1276336594. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:16:07,427][25689] Avg episode reward: [(0, '0.925')] [2022-07-11 15:16:09,035][26022] Updated weights on worker 0-0, policy_version 1246438 (0.00087) [2022-07-11 15:16:10,885][26022] Updated weights on worker 0-0, policy_version 1246448 (0.00088) [2022-07-11 15:16:12,458][25689] Fps is (10 sec: 5565.4, 60 sec: 5579.7, 300 sec: 5576.7). Total num frames: 1276371968. Throughput: 0: 5770.9. Samples: 1276370200. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:16:12,459][25689] Avg episode reward: [(0, '1.103')] [2022-07-11 15:16:12,618][26022] Updated weights on worker 0-0, policy_version 1246458 (0.00091) [2022-07-11 15:16:14,641][26022] Updated weights on worker 0-0, policy_version 1246468 (0.00087) [2022-07-11 15:16:16,497][26022] Updated weights on worker 0-0, policy_version 1246478 (0.00626) [2022-07-11 15:16:17,019][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:16:17,033][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001246482_1276397568.pth [2022-07-11 15:16:17,033][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001244520_1274388480.pth [2022-07-11 15:16:17,488][25689] Fps is (10 sec: 5495.8, 60 sec: 5600.3, 300 sec: 5573.1). Total num frames: 1276398592. Throughput: 0: 5758.7. Samples: 1276403932. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:16:17,489][25689] Avg episode reward: [(0, '1.568')] [2022-07-11 15:16:18,287][26022] Updated weights on worker 0-0, policy_version 1246488 (0.00091) [2022-07-11 15:16:19,722][26022] Updated weights on worker 0-0, policy_version 1246498 (0.00050) [2022-07-11 15:16:21,912][26022] Updated weights on worker 0-0, policy_version 1246508 (0.00090) [2022-07-11 15:16:22,501][25689] Fps is (10 sec: 5506.2, 60 sec: 5572.0, 300 sec: 5575.2). Total num frames: 1276427264. Throughput: 0: 4909.1. Samples: 1276420768. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:16:22,501][25689] Avg episode reward: [(0, '1.620')] [2022-07-11 15:16:23,588][26022] Updated weights on worker 0-0, policy_version 1246518 (0.00089) [2022-07-11 15:16:25,663][26022] Updated weights on worker 0-0, policy_version 1246528 (0.00084) [2022-07-11 15:16:27,385][26022] Updated weights on worker 0-0, policy_version 1246538 (0.00085) [2022-07-11 15:16:27,565][25689] Fps is (10 sec: 5690.7, 60 sec: 5588.4, 300 sec: 5577.5). Total num frames: 1276455936. Throughput: 0: 5850.0. Samples: 1276454404. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:16:27,566][25689] Avg episode reward: [(0, '1.450')] [2022-07-11 15:16:29,220][26022] Updated weights on worker 0-0, policy_version 1246548 (0.00089) [2022-07-11 15:16:31,141][26022] Updated weights on worker 0-0, policy_version 1246558 (0.00087) [2022-07-11 15:16:32,634][25689] Fps is (10 sec: 5557.8, 60 sec: 5565.5, 300 sec: 5569.6). Total num frames: 1276483584. Throughput: 0: 5849.8. Samples: 1276488226. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:16:32,635][25689] Avg episode reward: [(0, '1.207')] [2022-07-11 15:16:32,977][26022] Updated weights on worker 0-0, policy_version 1246568 (0.00087) [2022-07-11 15:16:34,700][26022] Updated weights on worker 0-0, policy_version 1246578 (0.00084) [2022-07-11 15:16:36,683][26022] Updated weights on worker 0-0, policy_version 1246588 (0.00085) [2022-07-11 15:16:37,666][25689] Fps is (10 sec: 5677.2, 60 sec: 5613.8, 300 sec: 5579.9). Total num frames: 1276513280. Throughput: 0: 5836.6. Samples: 1276521700. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:16:37,666][25689] Avg episode reward: [(0, '1.220')] [2022-07-11 15:16:38,289][26022] Updated weights on worker 0-0, policy_version 1246598 (0.00092) [2022-07-11 15:16:40,097][26022] Updated weights on worker 0-0, policy_version 1246608 (0.00101) [2022-07-11 15:16:41,846][26022] Updated weights on worker 0-0, policy_version 1246618 (0.00109) [2022-07-11 15:16:42,692][25689] Fps is (10 sec: 5599.8, 60 sec: 5563.1, 300 sec: 5576.8). Total num frames: 1276539904. Throughput: 0: 5837.0. Samples: 1276538622. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2022-07-11 15:16:42,692][25689] Avg episode reward: [(0, '1.229')] [2022-07-11 15:16:43,671][26022] Updated weights on worker 0-0, policy_version 1246628 (0.00077) [2022-07-11 15:16:45,605][26022] Updated weights on worker 0-0, policy_version 1246638 (0.00088) [2022-07-11 15:16:47,409][26022] Updated weights on worker 0-0, policy_version 1246648 (0.00084) [2022-07-11 15:16:47,761][25689] Fps is (10 sec: 5578.9, 60 sec: 5598.0, 300 sec: 5573.4). Total num frames: 1276569600. Throughput: 0: 5848.5. Samples: 1276572520. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:16:47,762][25689] Avg episode reward: [(0, '1.030')] [2022-07-11 15:16:49,158][26022] Updated weights on worker 0-0, policy_version 1246658 (0.00085) [2022-07-11 15:16:51,062][26022] Updated weights on worker 0-0, policy_version 1246668 (0.00085) [2022-07-11 15:16:52,760][26022] Updated weights on worker 0-0, policy_version 1246678 (0.00093) [2022-07-11 15:16:52,807][25689] Fps is (10 sec: 5770.5, 60 sec: 5600.3, 300 sec: 5579.7). Total num frames: 1276598272. Throughput: 0: 5867.6. Samples: 1276606588. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:16:52,807][25689] Avg episode reward: [(0, '0.202')] [2022-07-11 15:16:54,626][26022] Updated weights on worker 0-0, policy_version 1246688 (0.00089) [2022-07-11 15:16:56,344][26022] Updated weights on worker 0-0, policy_version 1246698 (0.00083) [2022-07-11 15:16:57,810][25689] Fps is (10 sec: 5604.8, 60 sec: 5584.1, 300 sec: 5580.2). Total num frames: 1276625920. Throughput: 0: 5061.0. Samples: 1276623646. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:16:57,810][25689] Avg episode reward: [(0, '0.430')] [2022-07-11 15:16:58,385][26022] Updated weights on worker 0-0, policy_version 1246708 (0.00085) [2022-07-11 15:17:00,054][26022] Updated weights on worker 0-0, policy_version 1246718 (0.00086) [2022-07-11 15:17:01,973][26022] Updated weights on worker 0-0, policy_version 1246728 (0.00088) [2022-07-11 15:17:02,826][25689] Fps is (10 sec: 5314.7, 60 sec: 5552.8, 300 sec: 5578.5). Total num frames: 1276651520. Throughput: 0: 5901.5. Samples: 1276657442. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:17:02,826][25689] Avg episode reward: [(0, '0.675')] [2022-07-11 15:17:03,964][26022] Updated weights on worker 0-0, policy_version 1246738 (0.00079) [2022-07-11 15:17:06,298][26022] Updated weights on worker 0-0, policy_version 1246748 (0.00096) [2022-07-11 15:17:07,600][26022] Updated weights on worker 0-0, policy_version 1246758 (0.00084) [2022-07-11 15:17:07,938][25689] Fps is (10 sec: 5459.4, 60 sec: 5584.4, 300 sec: 5580.0). Total num frames: 1276681216. Throughput: 0: 5761.0. Samples: 1276688760. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:17:07,939][25689] Avg episode reward: [(0, '0.302')] [2022-07-11 15:17:09,877][26022] Updated weights on worker 0-0, policy_version 1246768 (0.00049) [2022-07-11 15:17:11,265][26022] Updated weights on worker 0-0, policy_version 1246778 (0.00085) [2022-07-11 15:17:12,972][25689] Fps is (10 sec: 5550.9, 60 sec: 5550.4, 300 sec: 5576.8). Total num frames: 1276707840. Throughput: 0: 4899.8. Samples: 1276705392. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:17:12,972][25689] Avg episode reward: [(0, '0.614')] [2022-07-11 15:17:13,410][26022] Updated weights on worker 0-0, policy_version 1246788 (0.00089) [2022-07-11 15:17:15,029][26022] Updated weights on worker 0-0, policy_version 1246798 (0.00084) [2022-07-11 15:17:16,987][26022] Updated weights on worker 0-0, policy_version 1246808 (0.00085) [2022-07-11 15:17:17,992][25689] Fps is (10 sec: 5601.8, 60 sec: 5602.0, 300 sec: 5577.0). Total num frames: 1276737536. Throughput: 0: 5740.5. Samples: 1276739504. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:17:17,993][25689] Avg episode reward: [(0, '0.826')] [2022-07-11 15:17:18,728][26022] Updated weights on worker 0-0, policy_version 1246818 (0.00086) [2022-07-11 15:17:20,575][26022] Updated weights on worker 0-0, policy_version 1246828 (0.00083) [2022-07-11 15:17:22,307][26022] Updated weights on worker 0-0, policy_version 1246838 (0.00082) [2022-07-11 15:17:22,998][25689] Fps is (10 sec: 5821.7, 60 sec: 5602.7, 300 sec: 5578.2). Total num frames: 1276766208. Throughput: 0: 5754.4. Samples: 1276773520. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:17:22,998][25689] Avg episode reward: [(0, '1.370')] [2022-07-11 15:17:24,184][26022] Updated weights on worker 0-0, policy_version 1246848 (0.00089) [2022-07-11 15:17:25,979][26022] Updated weights on worker 0-0, policy_version 1246858 (0.00098) [2022-07-11 15:17:27,888][26022] Updated weights on worker 0-0, policy_version 1246868 (0.00086) [2022-07-11 15:17:28,069][25689] Fps is (10 sec: 5487.6, 60 sec: 5568.2, 300 sec: 5577.9). Total num frames: 1276792832. Throughput: 0: 5040.9. Samples: 1276790236. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:17:28,069][25689] Avg episode reward: [(0, '1.192')] [2022-07-11 15:17:29,419][26022] Updated weights on worker 0-0, policy_version 1246878 (0.00061) [2022-07-11 15:17:31,442][26022] Updated weights on worker 0-0, policy_version 1246888 (0.00086) [2022-07-11 15:17:33,131][25689] Fps is (10 sec: 5456.6, 60 sec: 5585.7, 300 sec: 5577.5). Total num frames: 1276821504. Throughput: 0: 5877.5. Samples: 1276823882. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:17:33,132][25689] Avg episode reward: [(0, '1.369')] [2022-07-11 15:17:33,402][26022] Updated weights on worker 0-0, policy_version 1246898 (0.00093) [2022-07-11 15:17:35,347][26022] Updated weights on worker 0-0, policy_version 1246908 (0.00086) [2022-07-11 15:17:36,997][26022] Updated weights on worker 0-0, policy_version 1246918 (0.00079) [2022-07-11 15:17:38,194][25689] Fps is (10 sec: 5663.6, 60 sec: 5566.0, 300 sec: 5576.7). Total num frames: 1276850176. Throughput: 0: 5835.9. Samples: 1276857398. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:17:38,195][25689] Avg episode reward: [(0, '0.348')] [2022-07-11 15:17:38,853][26022] Updated weights on worker 0-0, policy_version 1246928 (0.00089) [2022-07-11 15:17:40,583][26022] Updated weights on worker 0-0, policy_version 1246938 (0.00096) [2022-07-11 15:17:42,513][26022] Updated weights on worker 0-0, policy_version 1246948 (0.00087) [2022-07-11 15:17:43,251][25689] Fps is (10 sec: 5768.0, 60 sec: 5613.8, 300 sec: 5583.8). Total num frames: 1276879872. Throughput: 0: 4978.0. Samples: 1276874340. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:17:43,251][25689] Avg episode reward: [(0, '0.300')] [2022-07-11 15:17:44,371][26022] Updated weights on worker 0-0, policy_version 1246958 (0.00092) [2022-07-11 15:17:46,292][26022] Updated weights on worker 0-0, policy_version 1246968 (0.00091) [2022-07-11 15:17:48,130][26022] Updated weights on worker 0-0, policy_version 1246978 (0.00088) [2022-07-11 15:17:48,391][25689] Fps is (10 sec: 5523.3, 60 sec: 5556.6, 300 sec: 5575.2). Total num frames: 1276906496. Throughput: 0: 5776.7. Samples: 1276907628. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:17:48,391][25689] Avg episode reward: [(0, '-0.138')] [2022-07-11 15:17:49,857][26022] Updated weights on worker 0-0, policy_version 1246988 (0.00080) [2022-07-11 15:17:51,698][26022] Updated weights on worker 0-0, policy_version 1246998 (0.00086) [2022-07-11 15:17:53,455][25689] Fps is (10 sec: 5418.9, 60 sec: 5554.9, 300 sec: 5578.2). Total num frames: 1276935168. Throughput: 0: 5772.5. Samples: 1276941200. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:17:53,456][25689] Avg episode reward: [(0, '0.667')] [2022-07-11 15:17:53,642][26022] Updated weights on worker 0-0, policy_version 1247008 (0.00098) [2022-07-11 15:17:55,331][26022] Updated weights on worker 0-0, policy_version 1247018 (0.00081) [2022-07-11 15:17:57,419][26022] Updated weights on worker 0-0, policy_version 1247028 (0.00082) [2022-07-11 15:17:58,487][25689] Fps is (10 sec: 5679.8, 60 sec: 5569.2, 300 sec: 5574.7). Total num frames: 1276963840. Throughput: 0: 4950.2. Samples: 1276957852. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:17:58,487][25689] Avg episode reward: [(0, '0.516')] [2022-07-11 15:17:58,956][26022] Updated weights on worker 0-0, policy_version 1247038 (0.00100) [2022-07-11 15:18:00,991][26022] Updated weights on worker 0-0, policy_version 1247048 (0.00089) [2022-07-11 15:18:02,942][26022] Updated weights on worker 0-0, policy_version 1247058 (0.00088) [2022-07-11 15:18:03,515][25689] Fps is (10 sec: 5293.2, 60 sec: 5551.2, 300 sec: 5572.8). Total num frames: 1276988416. Throughput: 0: 5738.2. Samples: 1276990620. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:18:03,515][25689] Avg episode reward: [(0, '0.460')] [2022-07-11 15:18:04,996][26022] Updated weights on worker 0-0, policy_version 1247068 (0.00082) [2022-07-11 15:18:06,819][26022] Updated weights on worker 0-0, policy_version 1247078 (0.00094) [2022-07-11 15:18:08,573][25689] Fps is (10 sec: 5279.1, 60 sec: 5539.2, 300 sec: 5579.2). Total num frames: 1277017088. Throughput: 0: 5706.9. Samples: 1277022810. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:18:08,575][25689] Avg episode reward: [(0, '1.549')] [2022-07-11 15:18:08,620][26022] Updated weights on worker 0-0, policy_version 1247088 (0.00086) [2022-07-11 15:18:10,536][26022] Updated weights on worker 0-0, policy_version 1247098 (0.00091) [2022-07-11 15:18:12,169][26022] Updated weights on worker 0-0, policy_version 1247108 (0.00079) [2022-07-11 15:18:13,609][25689] Fps is (10 sec: 5681.1, 60 sec: 5572.8, 300 sec: 5571.9). Total num frames: 1277045760. Throughput: 0: 4891.8. Samples: 1277039786. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:18:13,610][25689] Avg episode reward: [(0, '1.697')] [2022-07-11 15:18:14,147][26022] Updated weights on worker 0-0, policy_version 1247118 (0.00083) [2022-07-11 15:18:15,845][26022] Updated weights on worker 0-0, policy_version 1247128 (0.00091) [2022-07-11 15:18:17,079][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:18:17,093][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001247135_1277066240.pth [2022-07-11 15:18:17,093][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001245173_1275057152.pth [2022-07-11 15:18:17,703][26022] Updated weights on worker 0-0, policy_version 1247138 (0.00097) [2022-07-11 15:18:18,623][25689] Fps is (10 sec: 5706.0, 60 sec: 5556.5, 300 sec: 5579.2). Total num frames: 1277074432. Throughput: 0: 5758.1. Samples: 1277073800. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:18:18,626][25689] Avg episode reward: [(0, '1.499')] [2022-07-11 15:18:19,487][26022] Updated weights on worker 0-0, policy_version 1247148 (0.00091) [2022-07-11 15:18:21,367][26022] Updated weights on worker 0-0, policy_version 1247158 (0.00088) [2022-07-11 15:18:23,077][26022] Updated weights on worker 0-0, policy_version 1247168 (0.00093) [2022-07-11 15:18:23,646][25689] Fps is (10 sec: 5712.8, 60 sec: 5554.9, 300 sec: 5577.7). Total num frames: 1277103104. Throughput: 0: 5825.1. Samples: 1277107888. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:18:23,647][25689] Avg episode reward: [(0, '1.438')] [2022-07-11 15:18:25,061][26022] Updated weights on worker 0-0, policy_version 1247178 (0.00089) [2022-07-11 15:18:26,711][26022] Updated weights on worker 0-0, policy_version 1247188 (0.00964) [2022-07-11 15:18:28,535][26022] Updated weights on worker 0-0, policy_version 1247198 (0.00095) [2022-07-11 15:18:28,770][25689] Fps is (10 sec: 5550.4, 60 sec: 5566.9, 300 sec: 5576.0). Total num frames: 1277130752. Throughput: 0: 5049.9. Samples: 1277124802. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:18:28,771][25689] Avg episode reward: [(0, '0.293')] [2022-07-11 15:18:30,370][26022] Updated weights on worker 0-0, policy_version 1247208 (0.00088) [2022-07-11 15:18:32,159][26022] Updated weights on worker 0-0, policy_version 1247218 (0.00087) [2022-07-11 15:18:33,858][25689] Fps is (10 sec: 5615.6, 60 sec: 5581.5, 300 sec: 5584.8). Total num frames: 1277160448. Throughput: 0: 5877.8. Samples: 1277158808. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:18:33,859][25689] Avg episode reward: [(0, '-0.330')] [2022-07-11 15:18:33,992][26022] Updated weights on worker 0-0, policy_version 1247228 (0.00087) [2022-07-11 15:18:35,887][26022] Updated weights on worker 0-0, policy_version 1247238 (0.00090) [2022-07-11 15:18:37,607][26022] Updated weights on worker 0-0, policy_version 1247248 (0.00088) [2022-07-11 15:18:38,936][25689] Fps is (10 sec: 5640.8, 60 sec: 5563.2, 300 sec: 5571.0). Total num frames: 1277188096. Throughput: 0: 5830.9. Samples: 1277192242. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:18:38,936][25689] Avg episode reward: [(0, '-0.628')] [2022-07-11 15:18:39,699][26022] Updated weights on worker 0-0, policy_version 1247258 (0.00086) [2022-07-11 15:18:41,178][26022] Updated weights on worker 0-0, policy_version 1247268 (0.00088) [2022-07-11 15:18:43,250][26022] Updated weights on worker 0-0, policy_version 1247278 (0.00090) [2022-07-11 15:18:44,019][25689] Fps is (10 sec: 5643.8, 60 sec: 5560.8, 300 sec: 5577.3). Total num frames: 1277217792. Throughput: 0: 5798.3. Samples: 1277226012. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:18:44,019][25689] Avg episode reward: [(0, '-1.103')] [2022-07-11 15:18:44,919][26022] Updated weights on worker 0-0, policy_version 1247288 (0.00095) [2022-07-11 15:18:46,752][26022] Updated weights on worker 0-0, policy_version 1247298 (0.00093) [2022-07-11 15:18:48,754][26022] Updated weights on worker 0-0, policy_version 1247308 (0.00085) [2022-07-11 15:18:49,078][25689] Fps is (10 sec: 5553.0, 60 sec: 5568.2, 300 sec: 5572.8). Total num frames: 1277244416. Throughput: 0: 5807.5. Samples: 1277242742. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:18:49,079][25689] Avg episode reward: [(0, '-1.081')] [2022-07-11 15:18:50,319][26022] Updated weights on worker 0-0, policy_version 1247318 (0.00084) [2022-07-11 15:18:52,441][26022] Updated weights on worker 0-0, policy_version 1247328 (0.00091) [2022-07-11 15:18:54,105][25689] Fps is (10 sec: 5482.1, 60 sec: 5571.6, 300 sec: 5572.6). Total num frames: 1277273088. Throughput: 0: 5804.9. Samples: 1277276342. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:18:54,106][25689] Avg episode reward: [(0, '-0.766')] [2022-07-11 15:18:54,331][26022] Updated weights on worker 0-0, policy_version 1247338 (0.00087) [2022-07-11 15:18:55,849][26022] Updated weights on worker 0-0, policy_version 1247348 (0.00081) [2022-07-11 15:18:57,919][26022] Updated weights on worker 0-0, policy_version 1247358 (0.00088) [2022-07-11 15:18:59,138][25689] Fps is (10 sec: 5700.6, 60 sec: 5571.6, 300 sec: 5575.5). Total num frames: 1277301760. Throughput: 0: 5820.2. Samples: 1277309820. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:18:59,138][25689] Avg episode reward: [(0, '0.271')] [2022-07-11 15:18:59,577][26022] Updated weights on worker 0-0, policy_version 1247368 (0.00082) [2022-07-11 15:19:01,683][26022] Updated weights on worker 0-0, policy_version 1247378 (0.00089) [2022-07-11 15:19:03,449][26022] Updated weights on worker 0-0, policy_version 1247388 (0.00081) [2022-07-11 15:19:04,180][25689] Fps is (10 sec: 5285.4, 60 sec: 5570.3, 300 sec: 5568.6). Total num frames: 1277326336. Throughput: 0: 4908.5. Samples: 1277324974. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:19:04,180][25689] Avg episode reward: [(0, '0.265')] [2022-07-11 15:19:05,540][26022] Updated weights on worker 0-0, policy_version 1247398 (0.00086) [2022-07-11 15:19:07,218][26022] Updated weights on worker 0-0, policy_version 1247408 (0.00084) [2022-07-11 15:19:09,035][26022] Updated weights on worker 0-0, policy_version 1247418 (0.00094) [2022-07-11 15:19:09,260][25689] Fps is (10 sec: 5462.7, 60 sec: 5602.0, 300 sec: 5578.0). Total num frames: 1277357056. Throughput: 0: 5735.0. Samples: 1277358484. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:19:09,262][25689] Avg episode reward: [(0, '1.293')] [2022-07-11 15:19:11,006][26022] Updated weights on worker 0-0, policy_version 1247428 (0.00085) [2022-07-11 15:19:12,607][26022] Updated weights on worker 0-0, policy_version 1247438 (0.00086) [2022-07-11 15:19:14,303][25689] Fps is (10 sec: 5765.9, 60 sec: 5584.5, 300 sec: 5577.3). Total num frames: 1277384704. Throughput: 0: 5736.4. Samples: 1277392202. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:19:14,303][25689] Avg episode reward: [(0, '1.895')] [2022-07-11 15:19:14,720][26022] Updated weights on worker 0-0, policy_version 1247448 (0.00114) [2022-07-11 15:19:16,524][26022] Updated weights on worker 0-0, policy_version 1247458 (0.00085) [2022-07-11 15:19:18,345][26022] Updated weights on worker 0-0, policy_version 1247468 (0.00051) [2022-07-11 15:19:19,368][25689] Fps is (10 sec: 5470.7, 60 sec: 5563.0, 300 sec: 5566.8). Total num frames: 1277412352. Throughput: 0: 4891.0. Samples: 1277408766. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:19:19,369][25689] Avg episode reward: [(0, '1.917')] [2022-07-11 15:19:20,107][26022] Updated weights on worker 0-0, policy_version 1247478 (0.00084) [2022-07-11 15:19:21,882][26022] Updated weights on worker 0-0, policy_version 1247488 (0.00082) [2022-07-11 15:19:23,747][26022] Updated weights on worker 0-0, policy_version 1247498 (0.00096) [2022-07-11 15:19:24,383][25689] Fps is (10 sec: 5485.5, 60 sec: 5546.8, 300 sec: 5575.5). Total num frames: 1277440000. Throughput: 0: 5828.7. Samples: 1277442734. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:19:24,384][25689] Avg episode reward: [(0, '1.276')] [2022-07-11 15:19:25,402][26022] Updated weights on worker 0-0, policy_version 1247508 (0.00101) [2022-07-11 15:19:27,543][26022] Updated weights on worker 0-0, policy_version 1247518 (0.00122) [2022-07-11 15:19:29,108][26022] Updated weights on worker 0-0, policy_version 1247528 (0.00100) [2022-07-11 15:19:29,512][25689] Fps is (10 sec: 5551.8, 60 sec: 5563.2, 300 sec: 5566.3). Total num frames: 1277468672. Throughput: 0: 5821.5. Samples: 1277476382. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:19:29,513][25689] Avg episode reward: [(0, '1.435')] [2022-07-11 15:19:31,050][26022] Updated weights on worker 0-0, policy_version 1247538 (0.00094) [2022-07-11 15:19:32,943][26022] Updated weights on worker 0-0, policy_version 1247548 (0.00598) [2022-07-11 15:19:34,515][25689] Fps is (10 sec: 5862.1, 60 sec: 5587.9, 300 sec: 5580.8). Total num frames: 1277499392. Throughput: 0: 4996.5. Samples: 1277493192. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:19:34,517][25689] Avg episode reward: [(0, '0.454')] [2022-07-11 15:19:34,523][26022] Updated weights on worker 0-0, policy_version 1247558 (0.00084) [2022-07-11 15:19:36,843][26022] Updated weights on worker 0-0, policy_version 1247568 (0.00085) [2022-07-11 15:19:38,253][26022] Updated weights on worker 0-0, policy_version 1247578 (0.00085) [2022-07-11 15:19:39,552][25689] Fps is (10 sec: 5711.6, 60 sec: 5574.7, 300 sec: 5577.2). Total num frames: 1277526016. Throughput: 0: 5846.5. Samples: 1277526774. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:19:39,553][25689] Avg episode reward: [(0, '0.577')] [2022-07-11 15:19:40,248][26022] Updated weights on worker 0-0, policy_version 1247588 (0.00080) [2022-07-11 15:19:41,944][26022] Updated weights on worker 0-0, policy_version 1247598 (0.00086) [2022-07-11 15:19:43,645][26022] Updated weights on worker 0-0, policy_version 1247608 (0.00087) [2022-07-11 15:19:44,642][25689] Fps is (10 sec: 5460.3, 60 sec: 5557.2, 300 sec: 5573.2). Total num frames: 1277554688. Throughput: 0: 5841.2. Samples: 1277561068. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:19:44,642][25689] Avg episode reward: [(0, '0.772')] [2022-07-11 15:19:45,716][26022] Updated weights on worker 0-0, policy_version 1247618 (0.00087) [2022-07-11 15:19:47,386][26022] Updated weights on worker 0-0, policy_version 1247628 (0.00089) [2022-07-11 15:19:49,143][26022] Updated weights on worker 0-0, policy_version 1247638 (0.00096) [2022-07-11 15:19:49,727][25689] Fps is (10 sec: 5736.4, 60 sec: 5605.5, 300 sec: 5579.8). Total num frames: 1277584384. Throughput: 0: 5028.0. Samples: 1277578024. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:19:49,728][25689] Avg episode reward: [(0, '0.692')] [2022-07-11 15:19:51,243][26022] Updated weights on worker 0-0, policy_version 1247648 (0.00089) [2022-07-11 15:19:52,790][26022] Updated weights on worker 0-0, policy_version 1247658 (0.00089) [2022-07-11 15:19:54,744][25689] Fps is (10 sec: 5574.7, 60 sec: 5572.7, 300 sec: 5569.8). Total num frames: 1277611008. Throughput: 0: 5855.2. Samples: 1277611640. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:19:54,745][25689] Avg episode reward: [(0, '0.995')] [2022-07-11 15:19:54,859][26022] Updated weights on worker 0-0, policy_version 1247668 (0.00095) [2022-07-11 15:19:56,476][26022] Updated weights on worker 0-0, policy_version 1247678 (0.00090) [2022-07-11 15:19:58,274][26022] Updated weights on worker 0-0, policy_version 1247688 (0.00094) [2022-07-11 15:19:59,747][25689] Fps is (10 sec: 5723.2, 60 sec: 5609.2, 300 sec: 5591.0). Total num frames: 1277641728. Throughput: 0: 5880.4. Samples: 1277645526. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:19:59,747][25689] Avg episode reward: [(0, '0.811')] [2022-07-11 15:20:00,131][26022] Updated weights on worker 0-0, policy_version 1247698 (0.00088) [2022-07-11 15:20:02,387][26022] Updated weights on worker 0-0, policy_version 1247708 (0.00081) [2022-07-11 15:20:04,172][26022] Updated weights on worker 0-0, policy_version 1247718 (0.00093) [2022-07-11 15:20:04,791][25689] Fps is (10 sec: 5503.7, 60 sec: 5609.0, 300 sec: 5574.0). Total num frames: 1277666304. Throughput: 0: 4925.3. Samples: 1277660312. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:20:04,792][25689] Avg episode reward: [(0, '0.863')] [2022-07-11 15:20:06,132][26022] Updated weights on worker 0-0, policy_version 1247728 (0.00084) [2022-07-11 15:20:07,915][26022] Updated weights on worker 0-0, policy_version 1247738 (0.00082) [2022-07-11 15:20:09,663][26022] Updated weights on worker 0-0, policy_version 1247748 (0.00085) [2022-07-11 15:20:09,915][25689] Fps is (10 sec: 5236.9, 60 sec: 5571.2, 300 sec: 5573.2). Total num frames: 1277694976. Throughput: 0: 5746.2. Samples: 1277694026. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:20:09,916][25689] Avg episode reward: [(0, '0.800')] [2022-07-11 15:20:11,571][26022] Updated weights on worker 0-0, policy_version 1247758 (0.00080) [2022-07-11 15:20:13,223][26022] Updated weights on worker 0-0, policy_version 1247768 (0.00084) [2022-07-11 15:20:14,932][25689] Fps is (10 sec: 5554.1, 60 sec: 5573.6, 300 sec: 5581.1). Total num frames: 1277722624. Throughput: 0: 5749.6. Samples: 1277727710. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:20:14,932][25689] Avg episode reward: [(0, '1.045')] [2022-07-11 15:20:15,213][26022] Updated weights on worker 0-0, policy_version 1247778 (0.00082) [2022-07-11 15:20:16,988][26022] Updated weights on worker 0-0, policy_version 1247788 (0.00093) [2022-07-11 15:20:17,236][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:20:17,250][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001247789_1277735936.pth [2022-07-11 15:20:17,251][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001245827_1275726848.pth [2022-07-11 15:20:18,555][26022] Updated weights on worker 0-0, policy_version 1247798 (0.00084) [2022-07-11 15:20:19,948][25689] Fps is (10 sec: 5511.2, 60 sec: 5578.0, 300 sec: 5571.8). Total num frames: 1277750272. Throughput: 0: 4895.5. Samples: 1277744422. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:20:19,949][25689] Avg episode reward: [(0, '1.119')] [2022-07-11 15:20:20,787][26022] Updated weights on worker 0-0, policy_version 1247808 (0.00098) [2022-07-11 15:20:22,268][26022] Updated weights on worker 0-0, policy_version 1247818 (0.00094) [2022-07-11 15:20:24,331][26022] Updated weights on worker 0-0, policy_version 1247828 (0.00085) [2022-07-11 15:20:24,969][25689] Fps is (10 sec: 5814.9, 60 sec: 5628.2, 300 sec: 5582.8). Total num frames: 1277780992. Throughput: 0: 5847.3. Samples: 1277778298. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:20:24,971][25689] Avg episode reward: [(0, '1.098')] [2022-07-11 15:20:26,499][26022] Updated weights on worker 0-0, policy_version 1247839 (0.00083) [2022-07-11 15:20:27,961][26022] Updated weights on worker 0-0, policy_version 1247849 (0.00082) [2022-07-11 15:20:29,982][26022] Updated weights on worker 0-0, policy_version 1247859 (0.00086) [2022-07-11 15:20:30,098][25689] Fps is (10 sec: 5649.8, 60 sec: 5594.5, 300 sec: 5573.7). Total num frames: 1277807616. Throughput: 0: 5841.6. Samples: 1277811928. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:20:30,098][25689] Avg episode reward: [(0, '1.415')] [2022-07-11 15:20:31,606][26022] Updated weights on worker 0-0, policy_version 1247869 (0.00083) [2022-07-11 15:20:33,563][26022] Updated weights on worker 0-0, policy_version 1247879 (0.00084) [2022-07-11 15:20:35,121][25689] Fps is (10 sec: 5447.1, 60 sec: 5558.8, 300 sec: 5580.2). Total num frames: 1277836288. Throughput: 0: 5846.2. Samples: 1277845740. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:20:35,123][25689] Avg episode reward: [(0, '1.747')] [2022-07-11 15:20:35,539][26022] Updated weights on worker 0-0, policy_version 1247889 (0.00084) [2022-07-11 15:20:37,208][26022] Updated weights on worker 0-0, policy_version 1247899 (0.00084) [2022-07-11 15:20:39,169][26022] Updated weights on worker 0-0, policy_version 1247909 (0.00093) [2022-07-11 15:20:40,157][25689] Fps is (10 sec: 5598.7, 60 sec: 5575.8, 300 sec: 5573.1). Total num frames: 1277863936. Throughput: 0: 5842.2. Samples: 1277862488. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2022-07-11 15:20:40,158][25689] Avg episode reward: [(0, '1.922')] [2022-07-11 15:20:40,995][26022] Updated weights on worker 0-0, policy_version 1247919 (0.00082) [2022-07-11 15:20:42,938][26022] Updated weights on worker 0-0, policy_version 1247929 (0.00084) [2022-07-11 15:20:44,652][26022] Updated weights on worker 0-0, policy_version 1247939 (0.00093) [2022-07-11 15:20:45,251][25689] Fps is (10 sec: 5661.0, 60 sec: 5592.3, 300 sec: 5579.8). Total num frames: 1277893632. Throughput: 0: 5798.5. Samples: 1277895898. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:20:45,251][25689] Avg episode reward: [(0, '1.349')] [2022-07-11 15:20:46,542][26022] Updated weights on worker 0-0, policy_version 1247949 (0.00089) [2022-07-11 15:20:48,194][26022] Updated weights on worker 0-0, policy_version 1247959 (0.00083) [2022-07-11 15:20:50,237][26022] Updated weights on worker 0-0, policy_version 1247969 (0.00089) [2022-07-11 15:20:50,324][25689] Fps is (10 sec: 5539.8, 60 sec: 5542.8, 300 sec: 5572.9). Total num frames: 1277920256. Throughput: 0: 5816.3. Samples: 1277929568. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:20:50,326][25689] Avg episode reward: [(0, '1.151')] [2022-07-11 15:20:51,994][26022] Updated weights on worker 0-0, policy_version 1247979 (0.00084) [2022-07-11 15:20:53,544][26022] Updated weights on worker 0-0, policy_version 1247989 (0.00085) [2022-07-11 15:20:55,337][25689] Fps is (10 sec: 5380.9, 60 sec: 5560.0, 300 sec: 5569.4). Total num frames: 1277947904. Throughput: 0: 4983.7. Samples: 1277946490. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:20:55,337][25689] Avg episode reward: [(0, '1.058')] [2022-07-11 15:20:55,746][26022] Updated weights on worker 0-0, policy_version 1247999 (0.00085) [2022-07-11 15:20:57,279][26022] Updated weights on worker 0-0, policy_version 1248009 (0.00507) [2022-07-11 15:20:59,306][26022] Updated weights on worker 0-0, policy_version 1248019 (0.00087) [2022-07-11 15:21:00,387][25689] Fps is (10 sec: 5902.0, 60 sec: 5572.6, 300 sec: 5583.0). Total num frames: 1277979648. Throughput: 0: 5821.3. Samples: 1277980250. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:21:00,388][25689] Avg episode reward: [(0, '0.860')] [2022-07-11 15:21:01,182][26022] Updated weights on worker 0-0, policy_version 1248029 (0.00088) [2022-07-11 15:21:03,085][26022] Updated weights on worker 0-0, policy_version 1248039 (0.00086) [2022-07-11 15:21:05,152][26022] Updated weights on worker 0-0, policy_version 1248049 (0.00093) [2022-07-11 15:21:05,427][25689] Fps is (10 sec: 5480.2, 60 sec: 5556.1, 300 sec: 5570.2). Total num frames: 1278003200. Throughput: 0: 5751.7. Samples: 1278011944. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:21:05,427][25689] Avg episode reward: [(0, '1.379')] [2022-07-11 15:21:06,830][26022] Updated weights on worker 0-0, policy_version 1248059 (0.00087) [2022-07-11 15:21:08,773][26022] Updated weights on worker 0-0, policy_version 1248069 (0.00091) [2022-07-11 15:21:10,532][25689] Fps is (10 sec: 5046.9, 60 sec: 5540.9, 300 sec: 5565.3). Total num frames: 1278030848. Throughput: 0: 4909.2. Samples: 1278028772. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:21:10,532][25689] Avg episode reward: [(0, '0.985')] [2022-07-11 15:21:10,726][26022] Updated weights on worker 0-0, policy_version 1248079 (0.00088) [2022-07-11 15:21:12,371][26022] Updated weights on worker 0-0, policy_version 1248089 (0.00087) [2022-07-11 15:21:14,233][26022] Updated weights on worker 0-0, policy_version 1248099 (0.00082) [2022-07-11 15:21:15,540][25689] Fps is (10 sec: 5670.1, 60 sec: 5575.5, 300 sec: 5576.1). Total num frames: 1278060544. Throughput: 0: 5726.6. Samples: 1278062188. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:21:15,541][25689] Avg episode reward: [(0, '1.632')] [2022-07-11 15:21:16,089][26022] Updated weights on worker 0-0, policy_version 1248109 (0.00098) [2022-07-11 15:21:18,023][26022] Updated weights on worker 0-0, policy_version 1248119 (0.00098) [2022-07-11 15:21:19,867][26022] Updated weights on worker 0-0, policy_version 1248129 (0.00086) [2022-07-11 15:21:20,611][25689] Fps is (10 sec: 5689.4, 60 sec: 5570.5, 300 sec: 5571.5). Total num frames: 1278088192. Throughput: 0: 5707.7. Samples: 1278095682. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:21:20,611][25689] Avg episode reward: [(0, '1.023')] [2022-07-11 15:21:21,637][26022] Updated weights on worker 0-0, policy_version 1248139 (0.00087) [2022-07-11 15:21:23,428][26022] Updated weights on worker 0-0, policy_version 1248149 (0.00094) [2022-07-11 15:21:25,109][26022] Updated weights on worker 0-0, policy_version 1248159 (0.00089) [2022-07-11 15:21:25,620][25689] Fps is (10 sec: 5587.4, 60 sec: 5537.8, 300 sec: 5572.6). Total num frames: 1278116864. Throughput: 0: 4992.0. Samples: 1278112748. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:21:25,621][25689] Avg episode reward: [(0, '0.948')] [2022-07-11 15:21:27,097][26022] Updated weights on worker 0-0, policy_version 1248169 (0.00093) [2022-07-11 15:21:28,785][26022] Updated weights on worker 0-0, policy_version 1248179 (0.00087) [2022-07-11 15:21:30,671][25689] Fps is (10 sec: 5598.4, 60 sec: 5561.8, 300 sec: 5572.9). Total num frames: 1278144512. Throughput: 0: 5830.6. Samples: 1278146196. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:21:30,671][25689] Avg episode reward: [(0, '0.996')] [2022-07-11 15:21:30,723][26022] Updated weights on worker 0-0, policy_version 1248189 (0.00090) [2022-07-11 15:21:32,441][26022] Updated weights on worker 0-0, policy_version 1248199 (0.00080) [2022-07-11 15:21:34,274][26022] Updated weights on worker 0-0, policy_version 1248209 (0.00087) [2022-07-11 15:21:35,732][25689] Fps is (10 sec: 5569.7, 60 sec: 5558.3, 300 sec: 5568.9). Total num frames: 1278173184. Throughput: 0: 5827.6. Samples: 1278179858. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:21:35,733][25689] Avg episode reward: [(0, '0.953')] [2022-07-11 15:21:36,087][26022] Updated weights on worker 0-0, policy_version 1248219 (0.00089) [2022-07-11 15:21:38,042][26022] Updated weights on worker 0-0, policy_version 1248229 (0.00089) [2022-07-11 15:21:39,789][26022] Updated weights on worker 0-0, policy_version 1248239 (0.00089) [2022-07-11 15:21:40,781][25689] Fps is (10 sec: 5570.6, 60 sec: 5557.2, 300 sec: 5571.9). Total num frames: 1278200832. Throughput: 0: 5010.8. Samples: 1278196752. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:21:40,781][25689] Avg episode reward: [(0, '1.114')] [2022-07-11 15:21:41,635][26022] Updated weights on worker 0-0, policy_version 1248249 (0.00084) [2022-07-11 15:21:43,587][26022] Updated weights on worker 0-0, policy_version 1248259 (0.00084) [2022-07-11 15:21:45,194][26022] Updated weights on worker 0-0, policy_version 1248269 (0.00087) [2022-07-11 15:21:45,795][25689] Fps is (10 sec: 5596.7, 60 sec: 5547.5, 300 sec: 5569.5). Total num frames: 1278229504. Throughput: 0: 5829.7. Samples: 1278230362. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:21:45,797][25689] Avg episode reward: [(0, '0.946')] [2022-07-11 15:21:47,286][26022] Updated weights on worker 0-0, policy_version 1248279 (0.00091) [2022-07-11 15:21:49,039][26022] Updated weights on worker 0-0, policy_version 1248289 (0.00086) [2022-07-11 15:21:50,851][25689] Fps is (10 sec: 5592.6, 60 sec: 5566.0, 300 sec: 5565.9). Total num frames: 1278257152. Throughput: 0: 5847.7. Samples: 1278264206. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:21:50,852][25689] Avg episode reward: [(0, '1.551')] [2022-07-11 15:21:50,864][26022] Updated weights on worker 0-0, policy_version 1248299 (0.00091) [2022-07-11 15:21:52,652][26022] Updated weights on worker 0-0, policy_version 1248309 (0.00084) [2022-07-11 15:21:54,395][26022] Updated weights on worker 0-0, policy_version 1248319 (0.00089) [2022-07-11 15:21:55,869][25689] Fps is (10 sec: 5692.1, 60 sec: 5599.4, 300 sec: 5572.5). Total num frames: 1278286848. Throughput: 0: 5023.9. Samples: 1278281026. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:21:55,870][25689] Avg episode reward: [(0, '0.375')] [2022-07-11 15:21:56,326][26022] Updated weights on worker 0-0, policy_version 1248329 (0.00084) [2022-07-11 15:21:58,017][26022] Updated weights on worker 0-0, policy_version 1248339 (0.00087) [2022-07-11 15:21:59,840][26022] Updated weights on worker 0-0, policy_version 1248349 (0.00083) [2022-07-11 15:22:00,963][25689] Fps is (10 sec: 5671.2, 60 sec: 5527.7, 300 sec: 5577.9). Total num frames: 1278314496. Throughput: 0: 5851.0. Samples: 1278314836. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:22:00,963][25689] Avg episode reward: [(0, '0.276')] [2022-07-11 15:22:01,636][26022] Updated weights on worker 0-0, policy_version 1248359 (0.00093) [2022-07-11 15:22:03,897][26022] Updated weights on worker 0-0, policy_version 1248369 (0.00088) [2022-07-11 15:22:05,646][26022] Updated weights on worker 0-0, policy_version 1248379 (0.00092) [2022-07-11 15:22:05,995][25689] Fps is (10 sec: 5359.8, 60 sec: 5579.2, 300 sec: 5569.1). Total num frames: 1278341120. Throughput: 0: 5763.2. Samples: 1278346780. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:22:05,996][25689] Avg episode reward: [(0, '0.375')] [2022-07-11 15:22:07,509][26022] Updated weights on worker 0-0, policy_version 1248389 (0.00080) [2022-07-11 15:22:09,219][26022] Updated weights on worker 0-0, policy_version 1248399 (0.00086) [2022-07-11 15:22:11,061][25689] Fps is (10 sec: 5577.4, 60 sec: 5616.6, 300 sec: 5578.8). Total num frames: 1278370816. Throughput: 0: 4918.1. Samples: 1278363598. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:22:11,063][26022] Updated weights on worker 0-0, policy_version 1248409 (0.00091) [2022-07-11 15:22:11,067][25689] Avg episode reward: [(0, '0.346')] [2022-07-11 15:22:13,059][26022] Updated weights on worker 0-0, policy_version 1248419 (0.00090) [2022-07-11 15:22:14,831][26022] Updated weights on worker 0-0, policy_version 1248429 (0.00087) [2022-07-11 15:22:16,071][25689] Fps is (10 sec: 5487.9, 60 sec: 5548.8, 300 sec: 5565.2). Total num frames: 1278396416. Throughput: 0: 5750.9. Samples: 1278397204. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:22:16,072][25689] Avg episode reward: [(0, '-0.006')] [2022-07-11 15:22:16,749][26022] Updated weights on worker 0-0, policy_version 1248439 (0.00093) [2022-07-11 15:22:17,286][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:22:17,303][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001248442_1278404608.pth [2022-07-11 15:22:17,304][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001246482_1276397568.pth [2022-07-11 15:22:18,265][26022] Updated weights on worker 0-0, policy_version 1248449 (0.00086) [2022-07-11 15:22:20,270][26022] Updated weights on worker 0-0, policy_version 1248459 (0.00089) [2022-07-11 15:22:21,084][25689] Fps is (10 sec: 5414.4, 60 sec: 5570.9, 300 sec: 5565.1). Total num frames: 1278425088. Throughput: 0: 5775.0. Samples: 1278431038. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:22:21,085][25689] Avg episode reward: [(0, '0.856')] [2022-07-11 15:22:22,087][26022] Updated weights on worker 0-0, policy_version 1248469 (0.00077) [2022-07-11 15:22:23,864][26022] Updated weights on worker 0-0, policy_version 1248479 (0.00084) [2022-07-11 15:22:25,691][26022] Updated weights on worker 0-0, policy_version 1248489 (0.00089) [2022-07-11 15:22:26,109][25689] Fps is (10 sec: 5916.9, 60 sec: 5603.4, 300 sec: 5579.7). Total num frames: 1278455808. Throughput: 0: 5044.5. Samples: 1278448244. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:22:26,109][25689] Avg episode reward: [(0, '1.107')] [2022-07-11 15:22:27,571][26022] Updated weights on worker 0-0, policy_version 1248499 (0.00085) [2022-07-11 15:22:29,148][26022] Updated weights on worker 0-0, policy_version 1248509 (0.00086) [2022-07-11 15:22:31,173][25689] Fps is (10 sec: 5582.5, 60 sec: 5568.3, 300 sec: 5569.4). Total num frames: 1278481408. Throughput: 0: 5890.5. Samples: 1278482070. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:22:31,173][25689] Avg episode reward: [(0, '0.106')] [2022-07-11 15:22:31,402][26022] Updated weights on worker 0-0, policy_version 1248519 (0.00090) [2022-07-11 15:22:32,758][26022] Updated weights on worker 0-0, policy_version 1248529 (0.00076) [2022-07-11 15:22:34,930][26022] Updated weights on worker 0-0, policy_version 1248539 (0.00087) [2022-07-11 15:22:36,185][25689] Fps is (10 sec: 5487.5, 60 sec: 5589.8, 300 sec: 5573.7). Total num frames: 1278511104. Throughput: 0: 5909.8. Samples: 1278516076. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:22:36,186][25689] Avg episode reward: [(0, '0.036')] [2022-07-11 15:22:36,518][26022] Updated weights on worker 0-0, policy_version 1248549 (0.00083) [2022-07-11 15:22:38,299][26022] Updated weights on worker 0-0, policy_version 1248559 (0.00092) [2022-07-11 15:22:40,424][26022] Updated weights on worker 0-0, policy_version 1248569 (0.00092) [2022-07-11 15:22:41,282][25689] Fps is (10 sec: 5774.1, 60 sec: 5602.3, 300 sec: 5569.6). Total num frames: 1278539776. Throughput: 0: 5041.1. Samples: 1278532854. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:22:41,282][25689] Avg episode reward: [(0, '-0.530')] [2022-07-11 15:22:42,194][26022] Updated weights on worker 0-0, policy_version 1248579 (0.00086) [2022-07-11 15:22:43,946][26022] Updated weights on worker 0-0, policy_version 1248589 (0.00088) [2022-07-11 15:22:45,935][26022] Updated weights on worker 0-0, policy_version 1248599 (0.00092) [2022-07-11 15:22:46,297][25689] Fps is (10 sec: 5569.7, 60 sec: 5585.2, 300 sec: 5575.3). Total num frames: 1278567424. Throughput: 0: 5853.8. Samples: 1278566424. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:22:46,298][25689] Avg episode reward: [(0, '-0.290')] [2022-07-11 15:22:47,670][26022] Updated weights on worker 0-0, policy_version 1248609 (0.00107) [2022-07-11 15:22:49,621][26022] Updated weights on worker 0-0, policy_version 1248619 (0.00086) [2022-07-11 15:22:51,245][26022] Updated weights on worker 0-0, policy_version 1248629 (0.00087) [2022-07-11 15:22:51,342][25689] Fps is (10 sec: 5598.0, 60 sec: 5603.2, 300 sec: 5575.7). Total num frames: 1278596096. Throughput: 0: 5831.4. Samples: 1278599686. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:22:51,343][25689] Avg episode reward: [(0, '-1.384')] [2022-07-11 15:22:53,243][26022] Updated weights on worker 0-0, policy_version 1248639 (0.00082) [2022-07-11 15:22:55,119][26022] Updated weights on worker 0-0, policy_version 1248649 (0.00081) [2022-07-11 15:22:56,360][25689] Fps is (10 sec: 5495.2, 60 sec: 5552.5, 300 sec: 5569.1). Total num frames: 1278622720. Throughput: 0: 5806.5. Samples: 1278633220. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:22:56,362][25689] Avg episode reward: [(0, '-0.557')] [2022-07-11 15:22:56,974][26022] Updated weights on worker 0-0, policy_version 1248659 (0.00085) [2022-07-11 15:22:58,558][26022] Updated weights on worker 0-0, policy_version 1248669 (0.00083) [2022-07-11 15:23:00,407][26022] Updated weights on worker 0-0, policy_version 1248679 (0.00089) [2022-07-11 15:23:01,380][25689] Fps is (10 sec: 5508.8, 60 sec: 5576.1, 300 sec: 5583.0). Total num frames: 1278651392. Throughput: 0: 5845.4. Samples: 1278650338. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:23:01,381][25689] Avg episode reward: [(0, '-0.350')] [2022-07-11 15:23:02,772][26022] Updated weights on worker 0-0, policy_version 1248689 (0.00090) [2022-07-11 15:23:04,668][26022] Updated weights on worker 0-0, policy_version 1248699 (0.00086) [2022-07-11 15:23:06,397][25689] Fps is (10 sec: 5407.4, 60 sec: 5560.7, 300 sec: 5573.5). Total num frames: 1278676992. Throughput: 0: 5728.2. Samples: 1278681556. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:23:06,397][25689] Avg episode reward: [(0, '0.536')] [2022-07-11 15:23:06,432][26022] Updated weights on worker 0-0, policy_version 1248709 (0.00082) [2022-07-11 15:23:08,227][26022] Updated weights on worker 0-0, policy_version 1248719 (0.00091) [2022-07-11 15:23:09,994][26022] Updated weights on worker 0-0, policy_version 1248729 (0.00087) [2022-07-11 15:23:11,508][25689] Fps is (10 sec: 5358.5, 60 sec: 5539.5, 300 sec: 5572.0). Total num frames: 1278705664. Throughput: 0: 5709.3. Samples: 1278714820. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:23:11,509][25689] Avg episode reward: [(0, '-0.059')] [2022-07-11 15:23:12,096][26022] Updated weights on worker 0-0, policy_version 1248739 (0.00086) [2022-07-11 15:23:13,615][26022] Updated weights on worker 0-0, policy_version 1248749 (0.00088) [2022-07-11 15:23:15,629][26022] Updated weights on worker 0-0, policy_version 1248759 (0.00092) [2022-07-11 15:23:16,524][25689] Fps is (10 sec: 5662.4, 60 sec: 5589.8, 300 sec: 5572.0). Total num frames: 1278734336. Throughput: 0: 4891.4. Samples: 1278731850. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:23:16,524][25689] Avg episode reward: [(0, '-0.206')] [2022-07-11 15:23:17,279][26022] Updated weights on worker 0-0, policy_version 1248769 (0.00086) [2022-07-11 15:23:19,076][26022] Updated weights on worker 0-0, policy_version 1248779 (0.00092) [2022-07-11 15:23:21,397][26022] Updated weights on worker 0-0, policy_version 1248789 (0.00081) [2022-07-11 15:23:21,544][25689] Fps is (10 sec: 5510.2, 60 sec: 5555.3, 300 sec: 5565.2). Total num frames: 1278760960. Throughput: 0: 5714.9. Samples: 1278765570. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:23:21,544][25689] Avg episode reward: [(0, '0.569')] [2022-07-11 15:23:22,765][26022] Updated weights on worker 0-0, policy_version 1248799 (0.00088) [2022-07-11 15:23:24,844][26022] Updated weights on worker 0-0, policy_version 1248809 (0.00095) [2022-07-11 15:23:26,402][26022] Updated weights on worker 0-0, policy_version 1248819 (0.00090) [2022-07-11 15:23:26,571][25689] Fps is (10 sec: 5605.8, 60 sec: 5538.2, 300 sec: 5573.9). Total num frames: 1278790656. Throughput: 0: 5834.5. Samples: 1278799260. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:23:26,571][25689] Avg episode reward: [(0, '0.579')] [2022-07-11 15:23:28,308][26022] Updated weights on worker 0-0, policy_version 1248829 (0.00081) [2022-07-11 15:23:30,433][26022] Updated weights on worker 0-0, policy_version 1248839 (0.00087) [2022-07-11 15:23:31,685][25689] Fps is (10 sec: 5755.4, 60 sec: 5584.3, 300 sec: 5569.9). Total num frames: 1278819328. Throughput: 0: 5017.8. Samples: 1278816062. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:23:31,686][25689] Avg episode reward: [(0, '0.631')] [2022-07-11 15:23:31,947][26022] Updated weights on worker 0-0, policy_version 1248849 (0.00088) [2022-07-11 15:23:34,000][26022] Updated weights on worker 0-0, policy_version 1248859 (0.00088) [2022-07-11 15:23:35,727][26022] Updated weights on worker 0-0, policy_version 1248869 (0.00091) [2022-07-11 15:23:36,722][25689] Fps is (10 sec: 5447.2, 60 sec: 5531.3, 300 sec: 5567.3). Total num frames: 1278845952. Throughput: 0: 5839.4. Samples: 1278849794. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:23:36,723][25689] Avg episode reward: [(0, '-0.560')] [2022-07-11 15:23:37,452][26022] Updated weights on worker 0-0, policy_version 1248879 (0.00082) [2022-07-11 15:23:39,527][26022] Updated weights on worker 0-0, policy_version 1248889 (0.00081) [2022-07-11 15:23:40,958][26022] Updated weights on worker 0-0, policy_version 1248899 (0.00078) [2022-07-11 15:23:41,743][25689] Fps is (10 sec: 5599.7, 60 sec: 5555.2, 300 sec: 5568.4). Total num frames: 1278875648. Throughput: 0: 5843.9. Samples: 1278883612. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:23:41,743][25689] Avg episode reward: [(0, '0.217')] [2022-07-11 15:23:43,136][26022] Updated weights on worker 0-0, policy_version 1248909 (0.00088) [2022-07-11 15:23:44,800][26022] Updated weights on worker 0-0, policy_version 1248919 (0.00087) [2022-07-11 15:23:46,655][26022] Updated weights on worker 0-0, policy_version 1248929 (0.00087) [2022-07-11 15:23:46,749][25689] Fps is (10 sec: 5718.6, 60 sec: 5556.0, 300 sec: 5572.9). Total num frames: 1278903296. Throughput: 0: 5010.9. Samples: 1278900374. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:23:46,750][25689] Avg episode reward: [(0, '-0.635')] [2022-07-11 15:23:48,560][26022] Updated weights on worker 0-0, policy_version 1248939 (0.00087) [2022-07-11 15:23:50,342][26022] Updated weights on worker 0-0, policy_version 1248949 (0.00083) [2022-07-11 15:23:51,860][25689] Fps is (10 sec: 5566.4, 60 sec: 5549.9, 300 sec: 5571.3). Total num frames: 1278931968. Throughput: 0: 5845.4. Samples: 1278933996. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:23:51,861][25689] Avg episode reward: [(0, '-1.354')] [2022-07-11 15:23:52,228][26022] Updated weights on worker 0-0, policy_version 1248959 (0.00095) [2022-07-11 15:23:53,977][26022] Updated weights on worker 0-0, policy_version 1248969 (0.00098) [2022-07-11 15:23:55,829][26022] Updated weights on worker 0-0, policy_version 1248979 (0.00080) [2022-07-11 15:23:56,886][25689] Fps is (10 sec: 5556.1, 60 sec: 5566.1, 300 sec: 5568.0). Total num frames: 1278959616. Throughput: 0: 5841.6. Samples: 1278967586. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:23:56,887][25689] Avg episode reward: [(0, '-1.541')] [2022-07-11 15:23:57,647][26022] Updated weights on worker 0-0, policy_version 1248989 (0.00086) [2022-07-11 15:23:59,556][26022] Updated weights on worker 0-0, policy_version 1248999 (0.00089) [2022-07-11 15:24:01,266][26022] Updated weights on worker 0-0, policy_version 1249009 (0.00087) [2022-07-11 15:24:01,890][25689] Fps is (10 sec: 5513.1, 60 sec: 5550.7, 300 sec: 5579.0). Total num frames: 1278987264. Throughput: 0: 5015.4. Samples: 1278984660. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:24:01,891][25689] Avg episode reward: [(0, '-1.456')] [2022-07-11 15:24:03,569][26022] Updated weights on worker 0-0, policy_version 1249019 (0.00090) [2022-07-11 15:24:05,186][26022] Updated weights on worker 0-0, policy_version 1249029 (0.00086) [2022-07-11 15:24:06,910][25689] Fps is (10 sec: 5414.1, 60 sec: 5567.3, 300 sec: 5566.4). Total num frames: 1279013888. Throughput: 0: 5749.4. Samples: 1279016286. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:24:06,911][25689] Avg episode reward: [(0, '-0.620')] [2022-07-11 15:24:07,083][26022] Updated weights on worker 0-0, policy_version 1249039 (0.00083) [2022-07-11 15:24:08,927][26022] Updated weights on worker 0-0, policy_version 1249049 (0.00084) [2022-07-11 15:24:10,760][26022] Updated weights on worker 0-0, policy_version 1249059 (0.00089) [2022-07-11 15:24:11,982][25689] Fps is (10 sec: 5580.9, 60 sec: 5587.9, 300 sec: 5572.7). Total num frames: 1279043584. Throughput: 0: 5756.5. Samples: 1279049826. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:24:11,984][25689] Avg episode reward: [(0, '-0.438')] [2022-07-11 15:24:12,830][26022] Updated weights on worker 0-0, policy_version 1249069 (0.00088) [2022-07-11 15:24:14,352][26022] Updated weights on worker 0-0, policy_version 1249079 (0.00080) [2022-07-11 15:24:16,231][26022] Updated weights on worker 0-0, policy_version 1249089 (0.00100) [2022-07-11 15:24:16,989][25689] Fps is (10 sec: 5588.0, 60 sec: 5554.8, 300 sec: 5570.4). Total num frames: 1279070208. Throughput: 0: 4926.2. Samples: 1279066616. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:24:16,991][25689] Avg episode reward: [(0, '0.766')] [2022-07-11 15:24:17,374][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:24:17,386][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001249095_1279073280.pth [2022-07-11 15:24:17,386][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001247135_1277066240.pth [2022-07-11 15:24:18,011][26022] Updated weights on worker 0-0, policy_version 1249099 (0.00087) [2022-07-11 15:24:19,818][26022] Updated weights on worker 0-0, policy_version 1249109 (0.00082) [2022-07-11 15:24:21,836][26022] Updated weights on worker 0-0, policy_version 1249119 (0.00096) [2022-07-11 15:24:22,044][25689] Fps is (10 sec: 5495.7, 60 sec: 5585.4, 300 sec: 5573.1). Total num frames: 1279098880. Throughput: 0: 5748.8. Samples: 1279100518. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:24:22,046][25689] Avg episode reward: [(0, '1.287')] [2022-07-11 15:24:23,605][26022] Updated weights on worker 0-0, policy_version 1249129 (0.00088) [2022-07-11 15:24:25,389][26022] Updated weights on worker 0-0, policy_version 1249139 (0.00085) [2022-07-11 15:24:27,068][25689] Fps is (10 sec: 5587.6, 60 sec: 5551.8, 300 sec: 5571.6). Total num frames: 1279126528. Throughput: 0: 5845.4. Samples: 1279134118. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:24:27,070][25689] Avg episode reward: [(0, '1.403')] [2022-07-11 15:24:27,223][26022] Updated weights on worker 0-0, policy_version 1249149 (0.00620) [2022-07-11 15:24:29,064][26022] Updated weights on worker 0-0, policy_version 1249159 (0.00091) [2022-07-11 15:24:30,935][26022] Updated weights on worker 0-0, policy_version 1249169 (0.00086) [2022-07-11 15:24:32,112][25689] Fps is (10 sec: 5695.4, 60 sec: 5575.2, 300 sec: 5567.4). Total num frames: 1279156224. Throughput: 0: 5029.4. Samples: 1279151068. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:24:32,113][25689] Avg episode reward: [(0, '1.239')] [2022-07-11 15:24:32,729][26022] Updated weights on worker 0-0, policy_version 1249179 (0.00083) [2022-07-11 15:24:34,381][26022] Updated weights on worker 0-0, policy_version 1249189 (0.00095) [2022-07-11 15:24:36,227][26022] Updated weights on worker 0-0, policy_version 1249199 (0.00094) [2022-07-11 15:24:37,156][25689] Fps is (10 sec: 5786.3, 60 sec: 5608.5, 300 sec: 5574.1). Total num frames: 1279184896. Throughput: 0: 5874.2. Samples: 1279185080. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:24:37,156][25689] Avg episode reward: [(0, '0.251')] [2022-07-11 15:24:38,164][26022] Updated weights on worker 0-0, policy_version 1249209 (0.00085) [2022-07-11 15:24:39,996][26022] Updated weights on worker 0-0, policy_version 1249219 (0.00085) [2022-07-11 15:24:41,909][26022] Updated weights on worker 0-0, policy_version 1249229 (0.00089) [2022-07-11 15:24:42,221][25689] Fps is (10 sec: 5571.5, 60 sec: 5570.5, 300 sec: 5571.1). Total num frames: 1279212544. Throughput: 0: 5862.4. Samples: 1279218804. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:24:42,222][25689] Avg episode reward: [(0, '0.064')] [2022-07-11 15:24:43,507][26022] Updated weights on worker 0-0, policy_version 1249239 (0.00085) [2022-07-11 15:24:45,475][26022] Updated weights on worker 0-0, policy_version 1249249 (0.00082) [2022-07-11 15:24:47,056][26022] Updated weights on worker 0-0, policy_version 1249259 (0.00086) [2022-07-11 15:24:47,281][25689] Fps is (10 sec: 5562.3, 60 sec: 5582.5, 300 sec: 5568.2). Total num frames: 1279241216. Throughput: 0: 5033.7. Samples: 1279235862. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:24:47,282][25689] Avg episode reward: [(0, '0.528')] [2022-07-11 15:24:48,976][26022] Updated weights on worker 0-0, policy_version 1249269 (0.00091) [2022-07-11 15:24:51,012][26022] Updated weights on worker 0-0, policy_version 1249279 (0.00094) [2022-07-11 15:24:52,399][25689] Fps is (10 sec: 5734.7, 60 sec: 5598.8, 300 sec: 5576.6). Total num frames: 1279270912. Throughput: 0: 5840.7. Samples: 1279269556. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:24:52,399][25689] Avg episode reward: [(0, '-0.183')] [2022-07-11 15:24:52,472][26022] Updated weights on worker 0-0, policy_version 1249289 (0.00570) [2022-07-11 15:24:54,747][26022] Updated weights on worker 0-0, policy_version 1249299 (0.00087) [2022-07-11 15:24:56,093][26022] Updated weights on worker 0-0, policy_version 1249309 (0.00093) [2022-07-11 15:24:57,447][25689] Fps is (10 sec: 5540.1, 60 sec: 5579.8, 300 sec: 5562.0). Total num frames: 1279297536. Throughput: 0: 5831.9. Samples: 1279303416. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:24:57,447][25689] Avg episode reward: [(0, '-0.345')] [2022-07-11 15:24:58,019][26022] Updated weights on worker 0-0, policy_version 1249319 (0.00083) [2022-07-11 15:25:00,167][26022] Updated weights on worker 0-0, policy_version 1249329 (0.00084) [2022-07-11 15:25:01,797][26022] Updated weights on worker 0-0, policy_version 1249339 (0.00087) [2022-07-11 15:25:02,481][25689] Fps is (10 sec: 5383.2, 60 sec: 5577.1, 300 sec: 5572.5). Total num frames: 1279325184. Throughput: 0: 5016.9. Samples: 1279320446. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:25:02,481][25689] Avg episode reward: [(0, '0.597')] [2022-07-11 15:25:03,965][26022] Updated weights on worker 0-0, policy_version 1249349 (0.00084) [2022-07-11 15:25:05,854][26022] Updated weights on worker 0-0, policy_version 1249359 (0.00082) [2022-07-11 15:25:07,481][26022] Updated weights on worker 0-0, policy_version 1249369 (0.00087) [2022-07-11 15:25:07,572][25689] Fps is (10 sec: 5562.6, 60 sec: 5604.4, 300 sec: 5573.1). Total num frames: 1279353856. Throughput: 0: 5734.1. Samples: 1279352210. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:25:07,572][25689] Avg episode reward: [(0, '0.560')] [2022-07-11 15:25:09,562][26022] Updated weights on worker 0-0, policy_version 1249379 (0.00087) [2022-07-11 15:25:11,177][26022] Updated weights on worker 0-0, policy_version 1249389 (0.00082) [2022-07-11 15:25:12,627][25689] Fps is (10 sec: 5450.0, 60 sec: 5555.2, 300 sec: 5569.0). Total num frames: 1279380480. Throughput: 0: 5767.1. Samples: 1279386212. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:25:12,627][25689] Avg episode reward: [(0, '0.530')] [2022-07-11 15:25:13,011][26022] Updated weights on worker 0-0, policy_version 1249399 (0.00099) [2022-07-11 15:25:14,706][26022] Updated weights on worker 0-0, policy_version 1249409 (0.00094) [2022-07-11 15:25:16,482][26022] Updated weights on worker 0-0, policy_version 1249419 (0.00086) [2022-07-11 15:25:17,644][25689] Fps is (10 sec: 5693.1, 60 sec: 5621.8, 300 sec: 5579.3). Total num frames: 1279411200. Throughput: 0: 4936.2. Samples: 1279403114. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:25:17,645][25689] Avg episode reward: [(0, '0.603')] [2022-07-11 15:25:18,621][26022] Updated weights on worker 0-0, policy_version 1249429 (0.00082) [2022-07-11 15:25:20,223][26022] Updated weights on worker 0-0, policy_version 1249439 (0.00090) [2022-07-11 15:25:21,914][26022] Updated weights on worker 0-0, policy_version 1249449 (0.00086) [2022-07-11 15:25:22,648][25689] Fps is (10 sec: 5824.3, 60 sec: 5609.6, 300 sec: 5569.3). Total num frames: 1279438848. Throughput: 0: 5793.3. Samples: 1279437282. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:25:22,650][25689] Avg episode reward: [(0, '1.281')] [2022-07-11 15:25:24,133][26022] Updated weights on worker 0-0, policy_version 1249459 (0.00091) [2022-07-11 15:25:25,567][26022] Updated weights on worker 0-0, policy_version 1249469 (0.00090) [2022-07-11 15:25:27,618][26022] Updated weights on worker 0-0, policy_version 1249479 (0.00089) [2022-07-11 15:25:27,659][25689] Fps is (10 sec: 5521.6, 60 sec: 5610.9, 300 sec: 5574.9). Total num frames: 1279466496. Throughput: 0: 5905.9. Samples: 1279470842. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:25:27,671][25689] Avg episode reward: [(0, '1.711')] [2022-07-11 15:25:29,459][26022] Updated weights on worker 0-0, policy_version 1249489 (0.00084) [2022-07-11 15:25:30,946][26022] Updated weights on worker 0-0, policy_version 1249499 (0.00084) [2022-07-11 15:25:32,784][25689] Fps is (10 sec: 5455.3, 60 sec: 5569.6, 300 sec: 5569.6). Total num frames: 1279494144. Throughput: 0: 5882.9. Samples: 1279504796. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:25:32,785][25689] Avg episode reward: [(0, '1.185')] [2022-07-11 15:25:33,132][26022] Updated weights on worker 0-0, policy_version 1249509 (0.00089) [2022-07-11 15:25:34,656][26022] Updated weights on worker 0-0, policy_version 1249519 (0.00085) [2022-07-11 15:25:36,458][26022] Updated weights on worker 0-0, policy_version 1249529 (0.00091) [2022-07-11 15:25:37,791][25689] Fps is (10 sec: 5760.8, 60 sec: 5606.8, 300 sec: 5580.4). Total num frames: 1279524864. Throughput: 0: 5894.6. Samples: 1279521870. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:25:37,792][25689] Avg episode reward: [(0, '0.381')] [2022-07-11 15:25:38,564][26022] Updated weights on worker 0-0, policy_version 1249539 (0.00087) [2022-07-11 15:25:40,184][26022] Updated weights on worker 0-0, policy_version 1249549 (0.00086) [2022-07-11 15:25:42,166][26022] Updated weights on worker 0-0, policy_version 1249559 (0.00087) [2022-07-11 15:25:42,795][25689] Fps is (10 sec: 5728.1, 60 sec: 5595.5, 300 sec: 5571.8). Total num frames: 1279551488. Throughput: 0: 5861.5. Samples: 1279555374. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:25:42,796][25689] Avg episode reward: [(0, '0.227')] [2022-07-11 15:25:43,800][26022] Updated weights on worker 0-0, policy_version 1249569 (0.00113) [2022-07-11 15:25:45,694][26022] Updated weights on worker 0-0, policy_version 1249579 (0.00096) [2022-07-11 15:25:47,798][25689] Fps is (10 sec: 5321.0, 60 sec: 5567.0, 300 sec: 5573.1). Total num frames: 1279578112. Throughput: 0: 5854.5. Samples: 1279588746. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:25:47,798][25689] Avg episode reward: [(0, '0.214')] [2022-07-11 15:25:47,909][26022] Updated weights on worker 0-0, policy_version 1249589 (0.00101) [2022-07-11 15:25:49,394][26022] Updated weights on worker 0-0, policy_version 1249599 (0.00087) [2022-07-11 15:25:51,468][26022] Updated weights on worker 0-0, policy_version 1249609 (0.00086) [2022-07-11 15:25:52,841][25689] Fps is (10 sec: 5606.0, 60 sec: 5573.8, 300 sec: 5579.4). Total num frames: 1279607808. Throughput: 0: 5020.1. Samples: 1279605486. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:25:52,842][25689] Avg episode reward: [(0, '-0.022')] [2022-07-11 15:25:53,016][26022] Updated weights on worker 0-0, policy_version 1249619 (0.00089) [2022-07-11 15:25:55,048][26022] Updated weights on worker 0-0, policy_version 1249629 (0.00089) [2022-07-11 15:25:56,773][26022] Updated weights on worker 0-0, policy_version 1249639 (0.00091) [2022-07-11 15:25:57,868][25689] Fps is (10 sec: 5796.1, 60 sec: 5609.7, 300 sec: 5569.5). Total num frames: 1279636480. Throughput: 0: 5840.2. Samples: 1279639126. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:25:57,868][25689] Avg episode reward: [(0, '-1.047')] [2022-07-11 15:25:58,611][26022] Updated weights on worker 0-0, policy_version 1249649 (0.00092) [2022-07-11 15:26:00,333][26022] Updated weights on worker 0-0, policy_version 1249659 (0.00089) [2022-07-11 15:26:02,729][26022] Updated weights on worker 0-0, policy_version 1249669 (0.00084) [2022-07-11 15:26:02,904][25689] Fps is (10 sec: 5393.7, 60 sec: 5575.6, 300 sec: 5576.5). Total num frames: 1279662080. Throughput: 0: 5731.7. Samples: 1279670630. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:26:02,904][25689] Avg episode reward: [(0, '-1.212')] [2022-07-11 15:26:04,405][26022] Updated weights on worker 0-0, policy_version 1249679 (0.00086) [2022-07-11 15:26:06,400][26022] Updated weights on worker 0-0, policy_version 1249689 (0.00093) [2022-07-11 15:26:07,920][25689] Fps is (10 sec: 5399.0, 60 sec: 5582.5, 300 sec: 5581.6). Total num frames: 1279690752. Throughput: 0: 4914.1. Samples: 1279687630. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:26:07,921][25689] Avg episode reward: [(0, '0.376')] [2022-07-11 15:26:07,974][26022] Updated weights on worker 0-0, policy_version 1249699 (0.00077) [2022-07-11 15:26:09,955][26022] Updated weights on worker 0-0, policy_version 1249709 (0.00078) [2022-07-11 15:26:11,688][26022] Updated weights on worker 0-0, policy_version 1249719 (0.00084) [2022-07-11 15:26:13,024][25689] Fps is (10 sec: 5565.0, 60 sec: 5595.0, 300 sec: 5572.9). Total num frames: 1279718400. Throughput: 0: 5730.0. Samples: 1279721132. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:26:13,024][25689] Avg episode reward: [(0, '0.112')] [2022-07-11 15:26:13,773][26022] Updated weights on worker 0-0, policy_version 1249729 (0.00082) [2022-07-11 15:26:15,390][26022] Updated weights on worker 0-0, policy_version 1249739 (0.00091) [2022-07-11 15:26:17,420][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:26:17,430][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001249749_1279742976.pth [2022-07-11 15:26:17,433][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001247789_1277735936.pth [2022-07-11 15:26:17,439][26022] Updated weights on worker 0-0, policy_version 1249749 (0.00090) [2022-07-11 15:26:18,029][25689] Fps is (10 sec: 5368.7, 60 sec: 5528.3, 300 sec: 5570.7). Total num frames: 1279745024. Throughput: 0: 5732.3. Samples: 1279754696. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:26:18,030][25689] Avg episode reward: [(0, '-0.026')] [2022-07-11 15:26:19,091][26022] Updated weights on worker 0-0, policy_version 1249759 (0.00097) [2022-07-11 15:26:21,020][26022] Updated weights on worker 0-0, policy_version 1249769 (0.00086) [2022-07-11 15:26:22,697][26022] Updated weights on worker 0-0, policy_version 1249779 (0.00080) [2022-07-11 15:26:23,057][25689] Fps is (10 sec: 5613.4, 60 sec: 5560.0, 300 sec: 5573.8). Total num frames: 1279774720. Throughput: 0: 5013.8. Samples: 1279771676. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:26:23,058][25689] Avg episode reward: [(0, '0.849')] [2022-07-11 15:26:24,527][26022] Updated weights on worker 0-0, policy_version 1249789 (0.00092) [2022-07-11 15:26:26,470][26022] Updated weights on worker 0-0, policy_version 1249799 (0.00080) [2022-07-11 15:26:28,045][26022] Updated weights on worker 0-0, policy_version 1249809 (0.00087) [2022-07-11 15:26:28,141][25689] Fps is (10 sec: 5873.5, 60 sec: 5587.1, 300 sec: 5580.1). Total num frames: 1279804416. Throughput: 0: 5839.7. Samples: 1279805714. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:26:28,141][25689] Avg episode reward: [(0, '0.204')] [2022-07-11 15:26:30,145][26022] Updated weights on worker 0-0, policy_version 1249819 (0.00086) [2022-07-11 15:26:31,758][26022] Updated weights on worker 0-0, policy_version 1249829 (0.00101) [2022-07-11 15:26:33,263][25689] Fps is (10 sec: 5518.6, 60 sec: 5570.5, 300 sec: 5572.0). Total num frames: 1279831040. Throughput: 0: 5862.1. Samples: 1279839774. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:26:33,264][25689] Avg episode reward: [(0, '0.217')] [2022-07-11 15:26:33,663][26022] Updated weights on worker 0-0, policy_version 1249839 (0.00086) [2022-07-11 15:26:35,400][26022] Updated weights on worker 0-0, policy_version 1249849 (0.00091) [2022-07-11 15:26:37,097][26022] Updated weights on worker 0-0, policy_version 1249859 (0.00081) [2022-07-11 15:26:38,273][25689] Fps is (10 sec: 5659.5, 60 sec: 5570.1, 300 sec: 5583.1). Total num frames: 1279861760. Throughput: 0: 5049.8. Samples: 1279856926. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:26:38,274][25689] Avg episode reward: [(0, '0.244')] [2022-07-11 15:26:39,235][26022] Updated weights on worker 0-0, policy_version 1249869 (0.00090) [2022-07-11 15:26:40,769][26022] Updated weights on worker 0-0, policy_version 1249879 (0.00079) [2022-07-11 15:26:42,749][26022] Updated weights on worker 0-0, policy_version 1249889 (0.00088) [2022-07-11 15:26:43,281][25689] Fps is (10 sec: 5928.5, 60 sec: 5603.7, 300 sec: 5583.2). Total num frames: 1279890432. Throughput: 0: 5899.4. Samples: 1279890984. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:26:43,282][25689] Avg episode reward: [(0, '0.554')] [2022-07-11 15:26:44,534][26022] Updated weights on worker 0-0, policy_version 1249899 (0.00055) [2022-07-11 15:26:46,187][26022] Updated weights on worker 0-0, policy_version 1249909 (0.00089) [2022-07-11 15:26:48,159][26022] Updated weights on worker 0-0, policy_version 1249919 (0.00088) [2022-07-11 15:26:48,298][25689] Fps is (10 sec: 5618.1, 60 sec: 5619.3, 300 sec: 5584.0). Total num frames: 1279918080. Throughput: 0: 5929.7. Samples: 1279925240. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:26:48,299][25689] Avg episode reward: [(0, '-0.080')] [2022-07-11 15:26:49,727][26022] Updated weights on worker 0-0, policy_version 1249929 (0.00083) [2022-07-11 15:26:51,740][26022] Updated weights on worker 0-0, policy_version 1249939 (0.00087) [2022-07-11 15:26:53,341][25689] Fps is (10 sec: 5700.1, 60 sec: 5619.4, 300 sec: 5583.5). Total num frames: 1279947776. Throughput: 0: 5098.5. Samples: 1279942144. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:26:53,341][25689] Avg episode reward: [(0, '0.719')] [2022-07-11 15:26:53,347][26022] Updated weights on worker 0-0, policy_version 1249949 (0.00083) [2022-07-11 15:26:55,367][26022] Updated weights on worker 0-0, policy_version 1249959 (0.00085) [2022-07-11 15:26:57,081][26022] Updated weights on worker 0-0, policy_version 1249969 (0.00086) [2022-07-11 15:26:58,352][25689] Fps is (10 sec: 5602.0, 60 sec: 5587.0, 300 sec: 5581.6). Total num frames: 1279974400. Throughput: 0: 5936.3. Samples: 1279976116. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:26:58,352][25689] Avg episode reward: [(0, '0.703')] [2022-07-11 15:26:58,915][26022] Updated weights on worker 0-0, policy_version 1249979 (0.00088) [2022-07-11 15:27:00,820][26022] Updated weights on worker 0-0, policy_version 1249989 (0.00088) [2022-07-11 15:27:02,937][26022] Updated weights on worker 0-0, policy_version 1249999 (0.00084) [2022-07-11 15:27:03,367][25689] Fps is (10 sec: 5311.0, 60 sec: 5605.8, 300 sec: 5581.9). Total num frames: 1280001024. Throughput: 0: 5819.1. Samples: 1280007868. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:27:03,369][25689] Avg episode reward: [(0, '0.836')] [2022-07-11 15:27:04,812][26022] Updated weights on worker 0-0, policy_version 1250009 (0.00096) [2022-07-11 15:27:06,498][26022] Updated weights on worker 0-0, policy_version 1250019 (0.00089) [2022-07-11 15:27:08,371][25689] Fps is (10 sec: 5416.8, 60 sec: 5590.0, 300 sec: 5576.2). Total num frames: 1280028672. Throughput: 0: 4963.7. Samples: 1280024874. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:27:08,371][25689] Avg episode reward: [(0, '0.846')] [2022-07-11 15:27:08,535][26022] Updated weights on worker 0-0, policy_version 1250029 (0.00085) [2022-07-11 15:27:10,175][26022] Updated weights on worker 0-0, policy_version 1250039 (0.00089) [2022-07-11 15:27:12,118][26022] Updated weights on worker 0-0, policy_version 1250049 (0.00087) [2022-07-11 15:27:13,482][25689] Fps is (10 sec: 5568.0, 60 sec: 5606.3, 300 sec: 5584.6). Total num frames: 1280057344. Throughput: 0: 5781.5. Samples: 1280058590. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:27:13,483][25689] Avg episode reward: [(0, '0.707')] [2022-07-11 15:27:13,911][26022] Updated weights on worker 0-0, policy_version 1250059 (0.00081) [2022-07-11 15:27:15,606][26022] Updated weights on worker 0-0, policy_version 1250069 (0.00091) [2022-07-11 15:27:17,675][26022] Updated weights on worker 0-0, policy_version 1250079 (0.00092) [2022-07-11 15:27:18,506][25689] Fps is (10 sec: 5758.8, 60 sec: 5655.3, 300 sec: 5587.9). Total num frames: 1280087040. Throughput: 0: 5760.2. Samples: 1280092210. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:27:18,507][25689] Avg episode reward: [(0, '1.607')] [2022-07-11 15:27:19,418][26022] Updated weights on worker 0-0, policy_version 1250089 (0.00087) [2022-07-11 15:27:21,272][26022] Updated weights on worker 0-0, policy_version 1250099 (0.00086) [2022-07-11 15:27:23,175][26022] Updated weights on worker 0-0, policy_version 1250109 (0.00082) [2022-07-11 15:27:23,531][25689] Fps is (10 sec: 5604.6, 60 sec: 5604.8, 300 sec: 5574.1). Total num frames: 1280113664. Throughput: 0: 5017.7. Samples: 1280109044. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:27:23,531][25689] Avg episode reward: [(0, '0.832')] [2022-07-11 15:27:24,782][26022] Updated weights on worker 0-0, policy_version 1250119 (0.00085) [2022-07-11 15:27:26,822][26022] Updated weights on worker 0-0, policy_version 1250129 (0.00092) [2022-07-11 15:27:28,417][26022] Updated weights on worker 0-0, policy_version 1250139 (0.00091) [2022-07-11 15:27:28,555][25689] Fps is (10 sec: 5502.8, 60 sec: 5593.4, 300 sec: 5585.2). Total num frames: 1280142336. Throughput: 0: 5842.4. Samples: 1280142798. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:27:28,555][25689] Avg episode reward: [(0, '0.864')] [2022-07-11 15:27:30,476][26022] Updated weights on worker 0-0, policy_version 1250149 (0.00087) [2022-07-11 15:27:32,241][26022] Updated weights on worker 0-0, policy_version 1250159 (0.00081) [2022-07-11 15:27:33,645][25689] Fps is (10 sec: 5670.0, 60 sec: 5630.3, 300 sec: 5580.3). Total num frames: 1280171008. Throughput: 0: 5855.9. Samples: 1280176660. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:27:33,645][25689] Avg episode reward: [(0, '0.288')] [2022-07-11 15:27:34,011][26022] Updated weights on worker 0-0, policy_version 1250169 (0.00085) [2022-07-11 15:27:35,803][26022] Updated weights on worker 0-0, policy_version 1250179 (0.00087) [2022-07-11 15:27:37,773][26022] Updated weights on worker 0-0, policy_version 1250189 (0.00089) [2022-07-11 15:27:38,744][25689] Fps is (10 sec: 5527.2, 60 sec: 5571.2, 300 sec: 5576.8). Total num frames: 1280198656. Throughput: 0: 5007.1. Samples: 1280193546. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:27:38,745][25689] Avg episode reward: [(0, '-1.323')] [2022-07-11 15:27:39,363][26022] Updated weights on worker 0-0, policy_version 1250199 (0.00085) [2022-07-11 15:27:41,376][26022] Updated weights on worker 0-0, policy_version 1250209 (0.00083) [2022-07-11 15:27:43,021][26022] Updated weights on worker 0-0, policy_version 1250219 (0.00095) [2022-07-11 15:27:43,768][25689] Fps is (10 sec: 5563.1, 60 sec: 5569.7, 300 sec: 5580.1). Total num frames: 1280227328. Throughput: 0: 5843.8. Samples: 1280227308. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:27:43,769][25689] Avg episode reward: [(0, '-1.994')] [2022-07-11 15:27:44,883][26022] Updated weights on worker 0-0, policy_version 1250229 (0.00109) [2022-07-11 15:27:46,812][26022] Updated weights on worker 0-0, policy_version 1250239 (0.00087) [2022-07-11 15:27:48,583][26022] Updated weights on worker 0-0, policy_version 1250249 (0.00077) [2022-07-11 15:27:48,779][25689] Fps is (10 sec: 5714.4, 60 sec: 5587.2, 300 sec: 5580.7). Total num frames: 1280256000. Throughput: 0: 5847.2. Samples: 1280261056. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:27:48,780][25689] Avg episode reward: [(0, '-1.060')] [2022-07-11 15:27:50,601][26022] Updated weights on worker 0-0, policy_version 1250259 (0.00082) [2022-07-11 15:27:51,961][26022] Updated weights on worker 0-0, policy_version 1250269 (0.00084) [2022-07-11 15:27:53,899][25689] Fps is (10 sec: 5660.4, 60 sec: 5563.2, 300 sec: 5585.7). Total num frames: 1280284672. Throughput: 0: 5001.4. Samples: 1280277960. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:27:53,900][25689] Avg episode reward: [(0, '-1.445')] [2022-07-11 15:27:53,910][26022] Updated weights on worker 0-0, policy_version 1250279 (0.00087) [2022-07-11 15:27:55,898][26022] Updated weights on worker 0-0, policy_version 1250289 (0.00092) [2022-07-11 15:27:57,663][26022] Updated weights on worker 0-0, policy_version 1250299 (0.00090) [2022-07-11 15:27:58,902][25689] Fps is (10 sec: 5665.2, 60 sec: 5597.7, 300 sec: 5586.0). Total num frames: 1280313344. Throughput: 0: 5869.6. Samples: 1280311860. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:27:58,902][25689] Avg episode reward: [(0, '-1.239')] [2022-07-11 15:27:59,611][26022] Updated weights on worker 0-0, policy_version 1250309 (0.00091) [2022-07-11 15:28:01,351][26022] Updated weights on worker 0-0, policy_version 1250319 (0.00077) [2022-07-11 15:28:03,451][26022] Updated weights on worker 0-0, policy_version 1250329 (0.00082) [2022-07-11 15:28:03,918][25689] Fps is (10 sec: 5416.9, 60 sec: 5580.7, 300 sec: 5586.0). Total num frames: 1280338944. Throughput: 0: 5770.3. Samples: 1280343578. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:28:03,920][25689] Avg episode reward: [(0, '-1.844')] [2022-07-11 15:28:05,188][26022] Updated weights on worker 0-0, policy_version 1250339 (0.00095) [2022-07-11 15:28:07,276][26022] Updated weights on worker 0-0, policy_version 1250349 (0.00087) [2022-07-11 15:28:08,871][26022] Updated weights on worker 0-0, policy_version 1250359 (0.00056) [2022-07-11 15:28:08,932][25689] Fps is (10 sec: 5410.8, 60 sec: 5596.7, 300 sec: 5587.8). Total num frames: 1280367616. Throughput: 0: 4936.4. Samples: 1280360536. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:28:08,932][25689] Avg episode reward: [(0, '-1.553')] [2022-07-11 15:28:10,920][26022] Updated weights on worker 0-0, policy_version 1250369 (0.00083) [2022-07-11 15:28:12,580][26022] Updated weights on worker 0-0, policy_version 1250379 (0.00087) [2022-07-11 15:28:14,040][25689] Fps is (10 sec: 5564.0, 60 sec: 5580.1, 300 sec: 5582.6). Total num frames: 1280395264. Throughput: 0: 5755.9. Samples: 1280393892. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:28:14,041][25689] Avg episode reward: [(0, '-0.804')] [2022-07-11 15:28:14,509][26022] Updated weights on worker 0-0, policy_version 1250389 (0.00084) [2022-07-11 15:28:16,151][26022] Updated weights on worker 0-0, policy_version 1250399 (0.00073) [2022-07-11 15:28:17,496][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:28:17,508][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001250404_1280413696.pth [2022-07-11 15:28:17,508][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001248442_1278404608.pth [2022-07-11 15:28:18,232][26022] Updated weights on worker 0-0, policy_version 1250409 (0.00088) [2022-07-11 15:28:19,065][25689] Fps is (10 sec: 5456.7, 60 sec: 5546.2, 300 sec: 5586.0). Total num frames: 1280422912. Throughput: 0: 5714.4. Samples: 1280427084. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:28:19,067][25689] Avg episode reward: [(0, '-0.998')] [2022-07-11 15:28:19,969][26022] Updated weights on worker 0-0, policy_version 1250419 (0.00088) [2022-07-11 15:28:21,833][26022] Updated weights on worker 0-0, policy_version 1250429 (0.00087) [2022-07-11 15:28:23,708][26022] Updated weights on worker 0-0, policy_version 1250439 (0.00084) [2022-07-11 15:28:24,140][25689] Fps is (10 sec: 5475.1, 60 sec: 5558.5, 300 sec: 5578.2). Total num frames: 1280450560. Throughput: 0: 5791.6. Samples: 1280460694. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:28:24,142][25689] Avg episode reward: [(0, '-1.485')] [2022-07-11 15:28:25,505][26022] Updated weights on worker 0-0, policy_version 1250449 (0.00086) [2022-07-11 15:28:27,402][26022] Updated weights on worker 0-0, policy_version 1250459 (0.00086) [2022-07-11 15:28:29,202][25689] Fps is (10 sec: 5556.3, 60 sec: 5555.0, 300 sec: 5579.2). Total num frames: 1280479232. Throughput: 0: 5760.1. Samples: 1280477292. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:28:29,203][25689] Avg episode reward: [(0, '-1.101')] [2022-07-11 15:28:29,373][26022] Updated weights on worker 0-0, policy_version 1250469 (0.00088) [2022-07-11 15:28:30,988][26022] Updated weights on worker 0-0, policy_version 1250479 (0.00093) [2022-07-11 15:28:33,081][26022] Updated weights on worker 0-0, policy_version 1250489 (0.00087) [2022-07-11 15:28:34,337][25689] Fps is (10 sec: 5724.0, 60 sec: 5567.7, 300 sec: 5587.7). Total num frames: 1280508928. Throughput: 0: 5761.3. Samples: 1280510828. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:28:34,338][25689] Avg episode reward: [(0, '-0.249')] [2022-07-11 15:28:34,751][26022] Updated weights on worker 0-0, policy_version 1250499 (0.00416) [2022-07-11 15:28:36,625][26022] Updated weights on worker 0-0, policy_version 1250509 (0.00084) [2022-07-11 15:28:38,348][26022] Updated weights on worker 0-0, policy_version 1250519 (0.00094) [2022-07-11 15:28:39,343][25689] Fps is (10 sec: 5755.2, 60 sec: 5593.2, 300 sec: 5584.5). Total num frames: 1280537600. Throughput: 0: 5792.9. Samples: 1280544552. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:28:39,344][25689] Avg episode reward: [(0, '-0.136')] [2022-07-11 15:28:40,436][26022] Updated weights on worker 0-0, policy_version 1250529 (0.00097) [2022-07-11 15:28:41,958][26022] Updated weights on worker 0-0, policy_version 1250539 (0.00095) [2022-07-11 15:28:43,847][26022] Updated weights on worker 0-0, policy_version 1250549 (0.00087) [2022-07-11 15:28:44,383][25689] Fps is (10 sec: 5504.4, 60 sec: 5558.0, 300 sec: 5580.4). Total num frames: 1280564224. Throughput: 0: 4973.0. Samples: 1280561362. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2022-07-11 15:28:44,383][25689] Avg episode reward: [(0, '0.717')] [2022-07-11 15:28:45,466][26022] Updated weights on worker 0-0, policy_version 1250559 (0.00091) [2022-07-11 15:28:47,637][26022] Updated weights on worker 0-0, policy_version 1250569 (0.00088) [2022-07-11 15:28:49,359][26022] Updated weights on worker 0-0, policy_version 1250579 (0.00091) [2022-07-11 15:28:49,424][25689] Fps is (10 sec: 5485.6, 60 sec: 5555.3, 300 sec: 5581.8). Total num frames: 1280592896. Throughput: 0: 5816.7. Samples: 1280594918. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:28:49,424][25689] Avg episode reward: [(0, '0.841')] [2022-07-11 15:28:51,434][26022] Updated weights on worker 0-0, policy_version 1250589 (0.00081) [2022-07-11 15:28:53,165][26022] Updated weights on worker 0-0, policy_version 1250599 (0.00084) [2022-07-11 15:28:54,475][25689] Fps is (10 sec: 5479.2, 60 sec: 5527.8, 300 sec: 5577.8). Total num frames: 1280619520. Throughput: 0: 5824.7. Samples: 1280628124. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:28:54,475][25689] Avg episode reward: [(0, '1.181')] [2022-07-11 15:28:54,879][26022] Updated weights on worker 0-0, policy_version 1250609 (0.00088) [2022-07-11 15:28:56,747][26022] Updated weights on worker 0-0, policy_version 1250619 (0.00090) [2022-07-11 15:28:58,613][26022] Updated weights on worker 0-0, policy_version 1250629 (0.00089) [2022-07-11 15:28:59,496][25689] Fps is (10 sec: 5489.7, 60 sec: 5526.0, 300 sec: 5581.0). Total num frames: 1280648192. Throughput: 0: 4983.6. Samples: 1280644986. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:28:59,497][25689] Avg episode reward: [(0, '1.798')] [2022-07-11 15:29:00,399][26022] Updated weights on worker 0-0, policy_version 1250639 (0.00091) [2022-07-11 15:29:02,541][26022] Updated weights on worker 0-0, policy_version 1250649 (0.00091) [2022-07-11 15:29:04,411][26022] Updated weights on worker 0-0, policy_version 1250659 (0.00088) [2022-07-11 15:29:04,499][25689] Fps is (10 sec: 5516.2, 60 sec: 5544.2, 300 sec: 5581.3). Total num frames: 1280674816. Throughput: 0: 5742.3. Samples: 1280676876. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:29:04,500][25689] Avg episode reward: [(0, '0.598')] [2022-07-11 15:29:06,300][26022] Updated weights on worker 0-0, policy_version 1250669 (0.00089) [2022-07-11 15:29:08,113][26022] Updated weights on worker 0-0, policy_version 1250679 (0.00090) [2022-07-11 15:29:09,527][25689] Fps is (10 sec: 5410.4, 60 sec: 5525.9, 300 sec: 5575.2). Total num frames: 1280702464. Throughput: 0: 5753.3. Samples: 1280710580. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:29:09,528][25689] Avg episode reward: [(0, '0.309')] [2022-07-11 15:29:09,828][26022] Updated weights on worker 0-0, policy_version 1250689 (0.00092) [2022-07-11 15:29:11,769][26022] Updated weights on worker 0-0, policy_version 1250699 (0.00086) [2022-07-11 15:29:13,620][26022] Updated weights on worker 0-0, policy_version 1250709 (0.00085) [2022-07-11 15:29:14,658][25689] Fps is (10 sec: 5544.1, 60 sec: 5540.9, 300 sec: 5579.8). Total num frames: 1280731136. Throughput: 0: 4927.6. Samples: 1280727576. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:29:14,658][25689] Avg episode reward: [(0, '-0.088')] [2022-07-11 15:29:15,210][26022] Updated weights on worker 0-0, policy_version 1250719 (0.00052) [2022-07-11 15:29:17,336][26022] Updated weights on worker 0-0, policy_version 1250729 (0.00093) [2022-07-11 15:29:18,989][26022] Updated weights on worker 0-0, policy_version 1250739 (0.00093) [2022-07-11 15:29:19,693][25689] Fps is (10 sec: 5640.9, 60 sec: 5556.8, 300 sec: 5580.1). Total num frames: 1280759808. Throughput: 0: 5760.5. Samples: 1280761330. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:29:19,694][25689] Avg episode reward: [(0, '-0.313')] [2022-07-11 15:29:20,907][26022] Updated weights on worker 0-0, policy_version 1250749 (0.00078) [2022-07-11 15:29:22,396][26022] Updated weights on worker 0-0, policy_version 1250759 (0.00074) [2022-07-11 15:29:24,527][26022] Updated weights on worker 0-0, policy_version 1250769 (0.00079) [2022-07-11 15:29:24,718][25689] Fps is (10 sec: 5598.1, 60 sec: 5561.3, 300 sec: 5580.1). Total num frames: 1280787456. Throughput: 0: 5855.1. Samples: 1280795260. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:29:24,719][25689] Avg episode reward: [(0, '-0.848')] [2022-07-11 15:29:26,384][26022] Updated weights on worker 0-0, policy_version 1250779 (0.00085) [2022-07-11 15:29:28,066][26022] Updated weights on worker 0-0, policy_version 1250789 (0.00093) [2022-07-11 15:29:29,736][25689] Fps is (10 sec: 5710.2, 60 sec: 5582.3, 300 sec: 5580.6). Total num frames: 1280817152. Throughput: 0: 5028.9. Samples: 1280812206. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:29:29,736][25689] Avg episode reward: [(0, '-0.689')] [2022-07-11 15:29:29,812][26022] Updated weights on worker 0-0, policy_version 1250799 (0.00087) [2022-07-11 15:29:31,806][26022] Updated weights on worker 0-0, policy_version 1250809 (0.00088) [2022-07-11 15:29:33,538][26022] Updated weights on worker 0-0, policy_version 1250819 (0.00084) [2022-07-11 15:29:34,853][25689] Fps is (10 sec: 5758.9, 60 sec: 5567.0, 300 sec: 5579.2). Total num frames: 1280845824. Throughput: 0: 5870.6. Samples: 1280846138. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:29:34,855][25689] Avg episode reward: [(0, '0.354')] [2022-07-11 15:29:35,361][26022] Updated weights on worker 0-0, policy_version 1250829 (0.00088) [2022-07-11 15:29:37,035][26022] Updated weights on worker 0-0, policy_version 1250839 (0.00084) [2022-07-11 15:29:39,119][26022] Updated weights on worker 0-0, policy_version 1250849 (0.00086) [2022-07-11 15:29:39,871][25689] Fps is (10 sec: 5657.6, 60 sec: 5566.0, 300 sec: 5583.6). Total num frames: 1280874496. Throughput: 0: 5888.2. Samples: 1280880142. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:29:39,872][25689] Avg episode reward: [(0, '0.456')] [2022-07-11 15:29:40,906][26022] Updated weights on worker 0-0, policy_version 1250859 (0.00078) [2022-07-11 15:29:42,551][26022] Updated weights on worker 0-0, policy_version 1250869 (0.00098) [2022-07-11 15:29:44,444][26022] Updated weights on worker 0-0, policy_version 1250879 (0.00087) [2022-07-11 15:29:44,878][25689] Fps is (10 sec: 5618.1, 60 sec: 5585.9, 300 sec: 5581.1). Total num frames: 1280902144. Throughput: 0: 5055.4. Samples: 1280897176. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:29:44,879][25689] Avg episode reward: [(0, '0.911')] [2022-07-11 15:29:46,052][26022] Updated weights on worker 0-0, policy_version 1250889 (0.00079) [2022-07-11 15:29:48,142][26022] Updated weights on worker 0-0, policy_version 1250899 (0.00098) [2022-07-11 15:29:49,897][25689] Fps is (10 sec: 5617.6, 60 sec: 5587.9, 300 sec: 5579.5). Total num frames: 1280930816. Throughput: 0: 5894.9. Samples: 1280931054. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:29:49,897][25689] Avg episode reward: [(0, '-0.400')] [2022-07-11 15:29:49,899][26022] Updated weights on worker 0-0, policy_version 1250909 (0.00094) [2022-07-11 15:29:51,603][26022] Updated weights on worker 0-0, policy_version 1250919 (0.00082) [2022-07-11 15:29:53,692][26022] Updated weights on worker 0-0, policy_version 1250929 (0.00087) [2022-07-11 15:29:54,999][25689] Fps is (10 sec: 5766.9, 60 sec: 5634.0, 300 sec: 5588.8). Total num frames: 1280960512. Throughput: 0: 5893.1. Samples: 1280964860. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:29:55,000][25689] Avg episode reward: [(0, '0.307')] [2022-07-11 15:29:55,366][26022] Updated weights on worker 0-0, policy_version 1250939 (0.00082) [2022-07-11 15:29:57,293][26022] Updated weights on worker 0-0, policy_version 1250949 (0.00083) [2022-07-11 15:29:59,005][26022] Updated weights on worker 0-0, policy_version 1250959 (0.00083) [2022-07-11 15:30:00,067][25689] Fps is (10 sec: 5638.7, 60 sec: 5612.8, 300 sec: 5588.2). Total num frames: 1280988160. Throughput: 0: 5034.0. Samples: 1280981806. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:30:00,067][25689] Avg episode reward: [(0, '0.261')] [2022-07-11 15:30:00,668][26022] Updated weights on worker 0-0, policy_version 1250969 (0.00096) [2022-07-11 15:30:02,975][26022] Updated weights on worker 0-0, policy_version 1250979 (0.00089) [2022-07-11 15:30:04,648][26022] Updated weights on worker 0-0, policy_version 1250989 (0.00101) [2022-07-11 15:30:05,099][25689] Fps is (10 sec: 5272.4, 60 sec: 5593.2, 300 sec: 5579.0). Total num frames: 1281013760. Throughput: 0: 5750.2. Samples: 1281013448. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:30:05,099][25689] Avg episode reward: [(0, '0.134')] [2022-07-11 15:30:06,308][26022] Updated weights on worker 0-0, policy_version 1250999 (0.00089) [2022-07-11 15:30:08,382][26022] Updated weights on worker 0-0, policy_version 1251009 (0.00091) [2022-07-11 15:30:10,108][25689] Fps is (10 sec: 5404.6, 60 sec: 5611.8, 300 sec: 5586.7). Total num frames: 1281042432. Throughput: 0: 5758.1. Samples: 1281047434. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:30:10,109][25689] Avg episode reward: [(0, '0.017')] [2022-07-11 15:30:10,309][26022] Updated weights on worker 0-0, policy_version 1251019 (0.00088) [2022-07-11 15:30:11,906][26022] Updated weights on worker 0-0, policy_version 1251029 (0.00093) [2022-07-11 15:30:13,862][26022] Updated weights on worker 0-0, policy_version 1251039 (0.00093) [2022-07-11 15:30:15,146][25689] Fps is (10 sec: 5707.1, 60 sec: 5620.3, 300 sec: 5579.5). Total num frames: 1281071104. Throughput: 0: 4932.5. Samples: 1281064236. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:30:15,147][25689] Avg episode reward: [(0, '-0.143')] [2022-07-11 15:30:15,746][26022] Updated weights on worker 0-0, policy_version 1251049 (0.00082) [2022-07-11 15:30:17,550][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:30:17,567][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001251059_1281084416.pth [2022-07-11 15:30:17,568][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001249095_1279073280.pth [2022-07-11 15:30:17,572][26022] Updated weights on worker 0-0, policy_version 1251059 (0.00085) [2022-07-11 15:30:19,403][26022] Updated weights on worker 0-0, policy_version 1251069 (0.00084) [2022-07-11 15:30:20,164][25689] Fps is (10 sec: 5600.7, 60 sec: 5605.1, 300 sec: 5579.2). Total num frames: 1281098752. Throughput: 0: 5781.0. Samples: 1281097990. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:30:20,165][25689] Avg episode reward: [(0, '0.388')] [2022-07-11 15:30:21,199][26022] Updated weights on worker 0-0, policy_version 1251079 (0.00084) [2022-07-11 15:30:22,986][26022] Updated weights on worker 0-0, policy_version 1251089 (0.00091) [2022-07-11 15:30:24,879][26022] Updated weights on worker 0-0, policy_version 1251099 (0.00084) [2022-07-11 15:30:25,178][25689] Fps is (10 sec: 5512.3, 60 sec: 5606.1, 300 sec: 5579.1). Total num frames: 1281126400. Throughput: 0: 5892.6. Samples: 1281131768. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:30:25,178][25689] Avg episode reward: [(0, '0.691')] [2022-07-11 15:30:26,724][26022] Updated weights on worker 0-0, policy_version 1251109 (0.00086) [2022-07-11 15:30:28,626][26022] Updated weights on worker 0-0, policy_version 1251119 (0.00085) [2022-07-11 15:30:30,210][25689] Fps is (10 sec: 5504.7, 60 sec: 5570.9, 300 sec: 5580.9). Total num frames: 1281154048. Throughput: 0: 5017.9. Samples: 1281148300. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:30:30,210][25689] Avg episode reward: [(0, '0.432')] [2022-07-11 15:30:30,445][26022] Updated weights on worker 0-0, policy_version 1251129 (0.00089) [2022-07-11 15:30:32,286][26022] Updated weights on worker 0-0, policy_version 1251139 (0.00094) [2022-07-11 15:30:33,887][26022] Updated weights on worker 0-0, policy_version 1251149 (0.00082) [2022-07-11 15:30:35,331][25689] Fps is (10 sec: 5648.0, 60 sec: 5587.5, 300 sec: 5575.3). Total num frames: 1281183744. Throughput: 0: 5819.9. Samples: 1281181708. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:30:35,331][25689] Avg episode reward: [(0, '-0.095')] [2022-07-11 15:30:35,994][26022] Updated weights on worker 0-0, policy_version 1251159 (0.00086) [2022-07-11 15:30:37,556][26022] Updated weights on worker 0-0, policy_version 1251169 (0.00086) [2022-07-11 15:30:39,673][26022] Updated weights on worker 0-0, policy_version 1251179 (0.00083) [2022-07-11 15:30:40,371][25689] Fps is (10 sec: 5744.3, 60 sec: 5585.5, 300 sec: 5581.5). Total num frames: 1281212416. Throughput: 0: 5819.1. Samples: 1281215572. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:30:40,371][25689] Avg episode reward: [(0, '-0.146')] [2022-07-11 15:30:41,087][26022] Updated weights on worker 0-0, policy_version 1251189 (0.00098) [2022-07-11 15:30:43,317][26022] Updated weights on worker 0-0, policy_version 1251199 (0.00095) [2022-07-11 15:30:44,814][26022] Updated weights on worker 0-0, policy_version 1251209 (0.00083) [2022-07-11 15:30:45,417][25689] Fps is (10 sec: 5584.1, 60 sec: 5581.9, 300 sec: 5584.2). Total num frames: 1281240064. Throughput: 0: 5814.3. Samples: 1281249442. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:30:45,417][25689] Avg episode reward: [(0, '1.056')] [2022-07-11 15:30:46,836][26022] Updated weights on worker 0-0, policy_version 1251219 (0.00092) [2022-07-11 15:30:48,623][26022] Updated weights on worker 0-0, policy_version 1251229 (0.00088) [2022-07-11 15:30:50,432][25689] Fps is (10 sec: 5496.0, 60 sec: 5565.3, 300 sec: 5577.8). Total num frames: 1281267712. Throughput: 0: 5840.2. Samples: 1281266402. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:30:50,434][25689] Avg episode reward: [(0, '0.024')] [2022-07-11 15:30:50,438][26022] Updated weights on worker 0-0, policy_version 1251239 (0.00080) [2022-07-11 15:30:52,190][26022] Updated weights on worker 0-0, policy_version 1251249 (0.00083) [2022-07-11 15:30:54,098][26022] Updated weights on worker 0-0, policy_version 1251259 (0.00090) [2022-07-11 15:30:55,528][25689] Fps is (10 sec: 5671.4, 60 sec: 5565.9, 300 sec: 5580.0). Total num frames: 1281297408. Throughput: 0: 5853.5. Samples: 1281299930. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:30:55,530][25689] Avg episode reward: [(0, '-0.004')] [2022-07-11 15:30:55,788][26022] Updated weights on worker 0-0, policy_version 1251269 (0.00089) [2022-07-11 15:30:57,839][26022] Updated weights on worker 0-0, policy_version 1251279 (0.00122) [2022-07-11 15:30:59,492][26022] Updated weights on worker 0-0, policy_version 1251289 (0.00104) [2022-07-11 15:31:00,572][25689] Fps is (10 sec: 5554.3, 60 sec: 5551.2, 300 sec: 5583.2). Total num frames: 1281324032. Throughput: 0: 5845.7. Samples: 1281333660. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:31:00,572][25689] Avg episode reward: [(0, '-0.331')] [2022-07-11 15:31:01,449][26022] Updated weights on worker 0-0, policy_version 1251299 (0.00085) [2022-07-11 15:31:03,658][26022] Updated weights on worker 0-0, policy_version 1251309 (0.00086) [2022-07-11 15:31:05,315][26022] Updated weights on worker 0-0, policy_version 1251319 (0.00085) [2022-07-11 15:31:05,626][25689] Fps is (10 sec: 5374.5, 60 sec: 5582.9, 300 sec: 5579.1). Total num frames: 1281351680. Throughput: 0: 4900.3. Samples: 1281348474. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:31:05,626][25689] Avg episode reward: [(0, '0.168')] [2022-07-11 15:31:07,365][26022] Updated weights on worker 0-0, policy_version 1251329 (0.00083) [2022-07-11 15:31:09,092][26022] Updated weights on worker 0-0, policy_version 1251339 (0.00101) [2022-07-11 15:31:10,639][25689] Fps is (10 sec: 5492.3, 60 sec: 5565.7, 300 sec: 5580.8). Total num frames: 1281379328. Throughput: 0: 5731.3. Samples: 1281382218. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:31:10,640][25689] Avg episode reward: [(0, '0.347')] [2022-07-11 15:31:11,084][26022] Updated weights on worker 0-0, policy_version 1251349 (0.00120) [2022-07-11 15:31:12,624][26022] Updated weights on worker 0-0, policy_version 1251359 (0.00088) [2022-07-11 15:31:14,596][26022] Updated weights on worker 0-0, policy_version 1251369 (0.00085) [2022-07-11 15:31:15,697][25689] Fps is (10 sec: 5592.4, 60 sec: 5563.9, 300 sec: 5586.7). Total num frames: 1281408000. Throughput: 0: 5746.8. Samples: 1281415836. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:31:15,697][25689] Avg episode reward: [(0, '-0.402')] [2022-07-11 15:31:16,436][26022] Updated weights on worker 0-0, policy_version 1251379 (0.00106) [2022-07-11 15:31:18,210][26022] Updated weights on worker 0-0, policy_version 1251389 (0.00091) [2022-07-11 15:31:19,967][26022] Updated weights on worker 0-0, policy_version 1251399 (0.00089) [2022-07-11 15:31:20,703][25689] Fps is (10 sec: 5698.3, 60 sec: 5581.9, 300 sec: 5583.7). Total num frames: 1281436672. Throughput: 0: 4924.1. Samples: 1281432788. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:31:20,703][25689] Avg episode reward: [(0, '-0.160')] [2022-07-11 15:31:21,785][26022] Updated weights on worker 0-0, policy_version 1251409 (0.00094) [2022-07-11 15:31:23,999][26022] Updated weights on worker 0-0, policy_version 1251419 (0.00092) [2022-07-11 15:31:25,474][26022] Updated weights on worker 0-0, policy_version 1251429 (0.00275) [2022-07-11 15:31:25,771][25689] Fps is (10 sec: 5692.2, 60 sec: 5593.8, 300 sec: 5580.5). Total num frames: 1281465344. Throughput: 0: 5865.4. Samples: 1281466632. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:31:25,771][25689] Avg episode reward: [(0, '0.321')] [2022-07-11 15:31:27,542][26022] Updated weights on worker 0-0, policy_version 1251440 (0.00087) [2022-07-11 15:31:29,207][26022] Updated weights on worker 0-0, policy_version 1251450 (0.00084) [2022-07-11 15:31:30,783][25689] Fps is (10 sec: 5485.7, 60 sec: 5578.7, 300 sec: 5582.6). Total num frames: 1281491968. Throughput: 0: 5876.1. Samples: 1281500582. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:31:30,783][25689] Avg episode reward: [(0, '-0.181')] [2022-07-11 15:31:31,173][26022] Updated weights on worker 0-0, policy_version 1251460 (0.00082) [2022-07-11 15:31:33,004][26022] Updated weights on worker 0-0, policy_version 1251470 (0.00087) [2022-07-11 15:31:34,741][26022] Updated weights on worker 0-0, policy_version 1251480 (0.00087) [2022-07-11 15:31:35,886][25689] Fps is (10 sec: 5466.3, 60 sec: 5563.4, 300 sec: 5573.9). Total num frames: 1281520640. Throughput: 0: 5031.1. Samples: 1281517414. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:31:35,887][25689] Avg episode reward: [(0, '0.142')] [2022-07-11 15:31:36,540][26022] Updated weights on worker 0-0, policy_version 1251490 (0.00082) [2022-07-11 15:31:38,393][26022] Updated weights on worker 0-0, policy_version 1251500 (0.00094) [2022-07-11 15:31:40,326][26022] Updated weights on worker 0-0, policy_version 1251510 (0.00084) [2022-07-11 15:31:40,929][25689] Fps is (10 sec: 5752.5, 60 sec: 5580.0, 300 sec: 5576.7). Total num frames: 1281550336. Throughput: 0: 5859.2. Samples: 1281551300. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:31:40,930][25689] Avg episode reward: [(0, '-1.507')] [2022-07-11 15:31:41,997][26022] Updated weights on worker 0-0, policy_version 1251520 (0.00085) [2022-07-11 15:31:43,883][26022] Updated weights on worker 0-0, policy_version 1251530 (0.00621) [2022-07-11 15:31:45,804][26022] Updated weights on worker 0-0, policy_version 1251540 (0.00091) [2022-07-11 15:31:46,031][25689] Fps is (10 sec: 5652.8, 60 sec: 5574.9, 300 sec: 5575.1). Total num frames: 1281577984. Throughput: 0: 5844.3. Samples: 1281585038. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:31:46,031][25689] Avg episode reward: [(0, '-0.610')] [2022-07-11 15:31:47,529][26022] Updated weights on worker 0-0, policy_version 1251550 (0.00084) [2022-07-11 15:31:49,376][26022] Updated weights on worker 0-0, policy_version 1251560 (0.00091) [2022-07-11 15:31:51,034][25689] Fps is (10 sec: 5472.4, 60 sec: 5576.0, 300 sec: 5569.0). Total num frames: 1281605632. Throughput: 0: 5006.6. Samples: 1281601970. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:31:51,034][25689] Avg episode reward: [(0, '-0.722')] [2022-07-11 15:31:51,200][26022] Updated weights on worker 0-0, policy_version 1251570 (0.00082) [2022-07-11 15:31:52,995][26022] Updated weights on worker 0-0, policy_version 1251580 (0.00085) [2022-07-11 15:31:55,010][26022] Updated weights on worker 0-0, policy_version 1251590 (0.00084) [2022-07-11 15:31:56,124][25689] Fps is (10 sec: 5681.3, 60 sec: 5576.6, 300 sec: 5577.8). Total num frames: 1281635328. Throughput: 0: 5837.5. Samples: 1281635552. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:31:56,125][25689] Avg episode reward: [(0, '-0.657')] [2022-07-11 15:31:56,588][26022] Updated weights on worker 0-0, policy_version 1251600 (0.00085) [2022-07-11 15:31:58,560][26022] Updated weights on worker 0-0, policy_version 1251610 (0.00083) [2022-07-11 15:32:00,143][26022] Updated weights on worker 0-0, policy_version 1251620 (0.00085) [2022-07-11 15:32:01,164][25689] Fps is (10 sec: 5559.7, 60 sec: 5576.9, 300 sec: 5577.4). Total num frames: 1281661952. Throughput: 0: 5842.2. Samples: 1281669514. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:32:01,165][25689] Avg episode reward: [(0, '-0.386')] [2022-07-11 15:32:02,424][26022] Updated weights on worker 0-0, policy_version 1251630 (0.00086) [2022-07-11 15:32:04,387][26022] Updated weights on worker 0-0, policy_version 1251640 (0.00085) [2022-07-11 15:32:05,945][26022] Updated weights on worker 0-0, policy_version 1251650 (0.00089) [2022-07-11 15:32:06,213][25689] Fps is (10 sec: 5481.2, 60 sec: 5594.4, 300 sec: 5580.0). Total num frames: 1281690624. Throughput: 0: 4924.5. Samples: 1281684424. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:32:06,213][25689] Avg episode reward: [(0, '-0.365')] [2022-07-11 15:32:07,896][26022] Updated weights on worker 0-0, policy_version 1251660 (0.00086) [2022-07-11 15:32:09,580][26022] Updated weights on worker 0-0, policy_version 1251670 (0.00091) [2022-07-11 15:32:11,234][25689] Fps is (10 sec: 5592.8, 60 sec: 5593.6, 300 sec: 5578.2). Total num frames: 1281718272. Throughput: 0: 5751.6. Samples: 1281718154. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:32:11,236][25689] Avg episode reward: [(0, '1.096')] [2022-07-11 15:32:11,614][26022] Updated weights on worker 0-0, policy_version 1251680 (0.00084) [2022-07-11 15:32:13,504][26022] Updated weights on worker 0-0, policy_version 1251690 (0.00090) [2022-07-11 15:32:15,146][26022] Updated weights on worker 0-0, policy_version 1251700 (0.00088) [2022-07-11 15:32:16,286][25689] Fps is (10 sec: 5489.4, 60 sec: 5577.2, 300 sec: 5570.8). Total num frames: 1281745920. Throughput: 0: 5772.0. Samples: 1281751924. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:32:16,286][25689] Avg episode reward: [(0, '1.518')] [2022-07-11 15:32:16,947][26022] Updated weights on worker 0-0, policy_version 1251710 (0.00082) [2022-07-11 15:32:17,708][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:32:17,725][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001251713_1281754112.pth [2022-07-11 15:32:17,726][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001249749_1279742976.pth [2022-07-11 15:32:18,817][26022] Updated weights on worker 0-0, policy_version 1251720 (0.00085) [2022-07-11 15:32:20,707][26022] Updated weights on worker 0-0, policy_version 1251730 (0.00088) [2022-07-11 15:32:21,289][25689] Fps is (10 sec: 5601.0, 60 sec: 5577.5, 300 sec: 5578.1). Total num frames: 1281774592. Throughput: 0: 4932.8. Samples: 1281768788. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:32:21,290][25689] Avg episode reward: [(0, '1.748')] [2022-07-11 15:32:22,685][26022] Updated weights on worker 0-0, policy_version 1251740 (0.00092) [2022-07-11 15:32:24,355][26022] Updated weights on worker 0-0, policy_version 1251750 (0.00100) [2022-07-11 15:32:26,244][26022] Updated weights on worker 0-0, policy_version 1251760 (0.00097) [2022-07-11 15:32:26,298][25689] Fps is (10 sec: 5625.0, 60 sec: 5566.0, 300 sec: 5574.9). Total num frames: 1281802240. Throughput: 0: 5869.6. Samples: 1281802318. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:32:26,299][25689] Avg episode reward: [(0, '1.821')] [2022-07-11 15:32:27,846][26022] Updated weights on worker 0-0, policy_version 1251770 (0.00082) [2022-07-11 15:32:29,822][26022] Updated weights on worker 0-0, policy_version 1251780 (0.00088) [2022-07-11 15:32:31,329][25689] Fps is (10 sec: 5609.6, 60 sec: 5598.1, 300 sec: 5576.0). Total num frames: 1281830912. Throughput: 0: 5864.5. Samples: 1281836002. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:32:31,330][25689] Avg episode reward: [(0, '2.261')] [2022-07-11 15:32:31,692][26022] Updated weights on worker 0-0, policy_version 1251790 (0.00093) [2022-07-11 15:32:33,559][26022] Updated weights on worker 0-0, policy_version 1251800 (0.00080) [2022-07-11 15:32:35,084][26022] Updated weights on worker 0-0, policy_version 1251810 (0.00096) [2022-07-11 15:32:36,371][25689] Fps is (10 sec: 5591.5, 60 sec: 5586.9, 300 sec: 5577.1). Total num frames: 1281858560. Throughput: 0: 5034.4. Samples: 1281853038. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:32:36,371][25689] Avg episode reward: [(0, '2.228')] [2022-07-11 15:32:37,240][26022] Updated weights on worker 0-0, policy_version 1251820 (0.00086) [2022-07-11 15:32:38,807][26022] Updated weights on worker 0-0, policy_version 1251830 (0.00088) [2022-07-11 15:32:40,695][26022] Updated weights on worker 0-0, policy_version 1251840 (0.00090) [2022-07-11 15:32:41,381][25689] Fps is (10 sec: 5501.1, 60 sec: 5556.0, 300 sec: 5574.0). Total num frames: 1281886208. Throughput: 0: 5853.1. Samples: 1281886386. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2022-07-11 15:32:41,381][25689] Avg episode reward: [(0, '2.238')] [2022-07-11 15:32:42,644][26022] Updated weights on worker 0-0, policy_version 1251850 (0.00094) [2022-07-11 15:32:44,443][26022] Updated weights on worker 0-0, policy_version 1251860 (0.00080) [2022-07-11 15:32:46,367][26022] Updated weights on worker 0-0, policy_version 1251870 (0.00094) [2022-07-11 15:32:46,392][25689] Fps is (10 sec: 5620.0, 60 sec: 5581.3, 300 sec: 5574.0). Total num frames: 1281914880. Throughput: 0: 5853.1. Samples: 1281919928. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:32:46,392][25689] Avg episode reward: [(0, '1.440')] [2022-07-11 15:32:48,001][26022] Updated weights on worker 0-0, policy_version 1251880 (0.00095) [2022-07-11 15:32:49,963][26022] Updated weights on worker 0-0, policy_version 1251890 (0.00076) [2022-07-11 15:32:51,403][25689] Fps is (10 sec: 5722.0, 60 sec: 5597.6, 300 sec: 5576.0). Total num frames: 1281943552. Throughput: 0: 5017.6. Samples: 1281936722. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:32:51,403][25689] Avg episode reward: [(0, '0.601')] [2022-07-11 15:32:51,983][26022] Updated weights on worker 0-0, policy_version 1251900 (0.00081) [2022-07-11 15:32:53,767][26022] Updated weights on worker 0-0, policy_version 1251910 (0.00055) [2022-07-11 15:32:55,673][26022] Updated weights on worker 0-0, policy_version 1251920 (0.00092) [2022-07-11 15:32:56,472][25689] Fps is (10 sec: 5485.6, 60 sec: 5548.6, 300 sec: 5567.9). Total num frames: 1281970176. Throughput: 0: 5824.2. Samples: 1281970114. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:32:56,473][25689] Avg episode reward: [(0, '-0.154')] [2022-07-11 15:32:57,356][26022] Updated weights on worker 0-0, policy_version 1251930 (0.00081) [2022-07-11 15:32:59,283][26022] Updated weights on worker 0-0, policy_version 1251940 (0.00087) [2022-07-11 15:33:00,968][26022] Updated weights on worker 0-0, policy_version 1251950 (0.00085) [2022-07-11 15:33:01,479][25689] Fps is (10 sec: 5487.9, 60 sec: 5585.6, 300 sec: 5578.4). Total num frames: 1281998848. Throughput: 0: 5850.3. Samples: 1282003964. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:33:01,479][25689] Avg episode reward: [(0, '-0.135')] [2022-07-11 15:33:03,189][26022] Updated weights on worker 0-0, policy_version 1251960 (0.00082) [2022-07-11 15:33:05,040][26022] Updated weights on worker 0-0, policy_version 1251970 (0.00083) [2022-07-11 15:33:06,503][25689] Fps is (10 sec: 5410.7, 60 sec: 5537.0, 300 sec: 5567.9). Total num frames: 1282024448. Throughput: 0: 4909.1. Samples: 1282018654. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:33:06,503][25689] Avg episode reward: [(0, '-1.586')] [2022-07-11 15:33:06,882][26022] Updated weights on worker 0-0, policy_version 1251980 (0.00086) [2022-07-11 15:33:08,548][26022] Updated weights on worker 0-0, policy_version 1251990 (0.00080) [2022-07-11 15:33:10,346][26022] Updated weights on worker 0-0, policy_version 1252000 (0.00083) [2022-07-11 15:33:11,505][25689] Fps is (10 sec: 5412.8, 60 sec: 5555.7, 300 sec: 5573.3). Total num frames: 1282053120. Throughput: 0: 5760.7. Samples: 1282052528. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:33:11,507][25689] Avg episode reward: [(0, '-1.887')] [2022-07-11 15:33:12,414][26022] Updated weights on worker 0-0, policy_version 1252010 (0.00088) [2022-07-11 15:33:14,190][26022] Updated weights on worker 0-0, policy_version 1252020 (0.00087) [2022-07-11 15:33:16,076][26022] Updated weights on worker 0-0, policy_version 1252030 (0.00083) [2022-07-11 15:33:16,551][25689] Fps is (10 sec: 5605.1, 60 sec: 5556.3, 300 sec: 5572.9). Total num frames: 1282080768. Throughput: 0: 5788.4. Samples: 1282086336. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:33:16,552][25689] Avg episode reward: [(0, '-1.089')] [2022-07-11 15:33:17,764][26022] Updated weights on worker 0-0, policy_version 1252040 (0.00090) [2022-07-11 15:33:19,554][26022] Updated weights on worker 0-0, policy_version 1252050 (0.00521) [2022-07-11 15:33:21,461][26022] Updated weights on worker 0-0, policy_version 1252060 (0.00090) [2022-07-11 15:33:21,556][25689] Fps is (10 sec: 5603.5, 60 sec: 5556.1, 300 sec: 5577.6). Total num frames: 1282109440. Throughput: 0: 4942.4. Samples: 1282103194. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:33:21,558][25689] Avg episode reward: [(0, '-0.726')] [2022-07-11 15:33:23,347][26022] Updated weights on worker 0-0, policy_version 1252070 (0.00085) [2022-07-11 15:33:25,247][26022] Updated weights on worker 0-0, policy_version 1252080 (0.00092) [2022-07-11 15:33:26,563][25689] Fps is (10 sec: 5625.0, 60 sec: 5556.3, 300 sec: 5575.2). Total num frames: 1282137088. Throughput: 0: 5879.7. Samples: 1282136604. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:33:26,565][25689] Avg episode reward: [(0, '-1.108')] [2022-07-11 15:33:27,159][26022] Updated weights on worker 0-0, policy_version 1252090 (0.00090) [2022-07-11 15:33:28,921][26022] Updated weights on worker 0-0, policy_version 1252100 (0.00091) [2022-07-11 15:33:30,862][26022] Updated weights on worker 0-0, policy_version 1252110 (0.00089) [2022-07-11 15:33:31,598][25689] Fps is (10 sec: 5506.5, 60 sec: 5538.9, 300 sec: 5570.2). Total num frames: 1282164736. Throughput: 0: 5827.4. Samples: 1282169616. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:33:31,599][25689] Avg episode reward: [(0, '0.030')] [2022-07-11 15:33:32,510][26022] Updated weights on worker 0-0, policy_version 1252120 (0.00093) [2022-07-11 15:33:34,463][26022] Updated weights on worker 0-0, policy_version 1252130 (0.00083) [2022-07-11 15:33:36,279][26022] Updated weights on worker 0-0, policy_version 1252140 (0.00087) [2022-07-11 15:33:36,649][25689] Fps is (10 sec: 5482.5, 60 sec: 5538.0, 300 sec: 5565.9). Total num frames: 1282192384. Throughput: 0: 4978.7. Samples: 1282186400. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:33:36,650][25689] Avg episode reward: [(0, '0.482')] [2022-07-11 15:33:38,081][26022] Updated weights on worker 0-0, policy_version 1252150 (0.00086) [2022-07-11 15:33:39,915][26022] Updated weights on worker 0-0, policy_version 1252160 (0.00092) [2022-07-11 15:33:41,659][25689] Fps is (10 sec: 5598.0, 60 sec: 5555.1, 300 sec: 5573.4). Total num frames: 1282221056. Throughput: 0: 5795.3. Samples: 1282219696. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:33:41,659][25689] Avg episode reward: [(0, '0.625')] [2022-07-11 15:33:41,673][26022] Updated weights on worker 0-0, policy_version 1252170 (0.00086) [2022-07-11 15:33:43,675][26022] Updated weights on worker 0-0, policy_version 1252180 (0.00084) [2022-07-11 15:33:45,341][26022] Updated weights on worker 0-0, policy_version 1252190 (0.00087) [2022-07-11 15:33:46,687][25689] Fps is (10 sec: 5610.9, 60 sec: 5536.5, 300 sec: 5570.2). Total num frames: 1282248704. Throughput: 0: 5811.5. Samples: 1282253552. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:33:46,687][25689] Avg episode reward: [(0, '0.503')] [2022-07-11 15:33:47,361][26022] Updated weights on worker 0-0, policy_version 1252200 (0.00087) [2022-07-11 15:33:48,838][26022] Updated weights on worker 0-0, policy_version 1252210 (0.00089) [2022-07-11 15:33:50,995][26022] Updated weights on worker 0-0, policy_version 1252220 (0.00088) [2022-07-11 15:33:51,707][25689] Fps is (10 sec: 5605.2, 60 sec: 5535.7, 300 sec: 5577.7). Total num frames: 1282277376. Throughput: 0: 5016.7. Samples: 1282270494. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:33:51,707][25689] Avg episode reward: [(0, '1.326')] [2022-07-11 15:33:52,575][26022] Updated weights on worker 0-0, policy_version 1252230 (0.00083) [2022-07-11 15:33:54,596][26022] Updated weights on worker 0-0, policy_version 1252240 (0.00095) [2022-07-11 15:33:56,430][26022] Updated weights on worker 0-0, policy_version 1252250 (0.00092) [2022-07-11 15:33:56,767][25689] Fps is (10 sec: 5587.1, 60 sec: 5553.5, 300 sec: 5573.5). Total num frames: 1282305024. Throughput: 0: 5833.3. Samples: 1282303754. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:33:56,768][25689] Avg episode reward: [(0, '2.202')] [2022-07-11 15:33:58,115][26022] Updated weights on worker 0-0, policy_version 1252260 (0.00083) [2022-07-11 15:34:00,175][26022] Updated weights on worker 0-0, policy_version 1252270 (0.00079) [2022-07-11 15:34:01,812][25689] Fps is (10 sec: 5674.5, 60 sec: 5566.9, 300 sec: 5583.0). Total num frames: 1282334720. Throughput: 0: 5838.8. Samples: 1282337368. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:34:01,813][25689] Avg episode reward: [(0, '2.148')] [2022-07-11 15:34:01,815][26022] Updated weights on worker 0-0, policy_version 1252280 (0.00088) [2022-07-11 15:34:04,129][26022] Updated weights on worker 0-0, policy_version 1252290 (0.00089) [2022-07-11 15:34:06,217][26022] Updated weights on worker 0-0, policy_version 1252300 (0.00086) [2022-07-11 15:34:06,852][25689] Fps is (10 sec: 5483.1, 60 sec: 5565.5, 300 sec: 5575.9). Total num frames: 1282360320. Throughput: 0: 5727.7. Samples: 1282369052. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:34:06,854][25689] Avg episode reward: [(0, '1.933')] [2022-07-11 15:34:07,671][26022] Updated weights on worker 0-0, policy_version 1252310 (0.00096) [2022-07-11 15:34:09,802][26022] Updated weights on worker 0-0, policy_version 1252320 (0.00086) [2022-07-11 15:34:11,303][26022] Updated weights on worker 0-0, policy_version 1252330 (0.00086) [2022-07-11 15:34:11,908][25689] Fps is (10 sec: 5274.4, 60 sec: 5543.6, 300 sec: 5573.9). Total num frames: 1282387968. Throughput: 0: 5713.6. Samples: 1282385916. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:34:11,910][25689] Avg episode reward: [(0, '1.975')] [2022-07-11 15:34:13,256][26022] Updated weights on worker 0-0, policy_version 1252340 (0.00086) [2022-07-11 15:34:15,153][26022] Updated weights on worker 0-0, policy_version 1252350 (0.00098) [2022-07-11 15:34:16,762][26022] Updated weights on worker 0-0, policy_version 1252360 (0.00048) [2022-07-11 15:34:16,970][25689] Fps is (10 sec: 5667.5, 60 sec: 5576.0, 300 sec: 5576.8). Total num frames: 1282417664. Throughput: 0: 5737.4. Samples: 1282419666. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:34:16,972][25689] Avg episode reward: [(0, '1.964')] [2022-07-11 15:34:17,788][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:34:17,799][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001252365_1282421760.pth [2022-07-11 15:34:17,800][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001250404_1280413696.pth [2022-07-11 15:34:18,790][26022] Updated weights on worker 0-0, policy_version 1252370 (0.00103) [2022-07-11 15:34:20,671][26022] Updated weights on worker 0-0, policy_version 1252380 (0.00087) [2022-07-11 15:34:22,008][25689] Fps is (10 sec: 5677.6, 60 sec: 5556.0, 300 sec: 5576.6). Total num frames: 1282445312. Throughput: 0: 5756.1. Samples: 1282453616. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:34:22,009][25689] Avg episode reward: [(0, '1.830')] [2022-07-11 15:34:22,306][26022] Updated weights on worker 0-0, policy_version 1252390 (0.00078) [2022-07-11 15:34:24,475][26022] Updated weights on worker 0-0, policy_version 1252400 (0.00090) [2022-07-11 15:34:25,839][26022] Updated weights on worker 0-0, policy_version 1252410 (0.00081) [2022-07-11 15:34:27,013][25689] Fps is (10 sec: 5505.9, 60 sec: 5556.2, 300 sec: 5569.9). Total num frames: 1282472960. Throughput: 0: 5027.6. Samples: 1282470414. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:34:27,015][25689] Avg episode reward: [(0, '0.916')] [2022-07-11 15:34:28,040][26022] Updated weights on worker 0-0, policy_version 1252420 (0.00091) [2022-07-11 15:34:29,567][26022] Updated weights on worker 0-0, policy_version 1252430 (0.00088) [2022-07-11 15:34:31,527][26022] Updated weights on worker 0-0, policy_version 1252440 (0.00088) [2022-07-11 15:34:32,016][25689] Fps is (10 sec: 5627.8, 60 sec: 5576.1, 300 sec: 5572.1). Total num frames: 1282501632. Throughput: 0: 5851.0. Samples: 1282503566. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:34:32,016][25689] Avg episode reward: [(0, '0.803')] [2022-07-11 15:34:33,415][26022] Updated weights on worker 0-0, policy_version 1252450 (0.00054) [2022-07-11 15:34:35,070][26022] Updated weights on worker 0-0, policy_version 1252460 (0.00087) [2022-07-11 15:34:36,998][26022] Updated weights on worker 0-0, policy_version 1252470 (0.00079) [2022-07-11 15:34:37,136][25689] Fps is (10 sec: 5563.7, 60 sec: 5569.7, 300 sec: 5566.7). Total num frames: 1282529280. Throughput: 0: 5852.2. Samples: 1282537680. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:34:37,146][25689] Avg episode reward: [(0, '0.990')] [2022-07-11 15:34:38,791][26022] Updated weights on worker 0-0, policy_version 1252480 (0.00089) [2022-07-11 15:34:40,528][26022] Updated weights on worker 0-0, policy_version 1252490 (0.00091) [2022-07-11 15:34:42,166][25689] Fps is (10 sec: 5447.7, 60 sec: 5551.0, 300 sec: 5566.3). Total num frames: 1282556928. Throughput: 0: 5014.2. Samples: 1282554692. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:34:42,167][25689] Avg episode reward: [(0, '0.641')] [2022-07-11 15:34:42,327][26022] Updated weights on worker 0-0, policy_version 1252500 (0.00084) [2022-07-11 15:34:44,393][26022] Updated weights on worker 0-0, policy_version 1252510 (0.00091) [2022-07-11 15:34:46,008][26022] Updated weights on worker 0-0, policy_version 1252520 (0.00099) [2022-07-11 15:34:47,182][25689] Fps is (10 sec: 5708.3, 60 sec: 5585.9, 300 sec: 5569.8). Total num frames: 1282586624. Throughput: 0: 5860.2. Samples: 1282588604. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:34:47,182][25689] Avg episode reward: [(0, '-0.002')] [2022-07-11 15:34:47,910][26022] Updated weights on worker 0-0, policy_version 1252530 (0.00095) [2022-07-11 15:34:49,630][26022] Updated weights on worker 0-0, policy_version 1252540 (0.00087) [2022-07-11 15:34:51,423][26022] Updated weights on worker 0-0, policy_version 1252550 (0.00090) [2022-07-11 15:34:52,207][25689] Fps is (10 sec: 5813.2, 60 sec: 5585.5, 300 sec: 5567.8). Total num frames: 1282615296. Throughput: 0: 5890.8. Samples: 1282622506. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:34:52,207][25689] Avg episode reward: [(0, '0.547')] [2022-07-11 15:34:53,454][26022] Updated weights on worker 0-0, policy_version 1252560 (0.00094) [2022-07-11 15:34:54,981][26022] Updated weights on worker 0-0, policy_version 1252570 (0.00092) [2022-07-11 15:34:57,146][26022] Updated weights on worker 0-0, policy_version 1252580 (0.00088) [2022-07-11 15:34:57,274][25689] Fps is (10 sec: 5580.6, 60 sec: 5584.9, 300 sec: 5567.8). Total num frames: 1282642944. Throughput: 0: 5045.2. Samples: 1282639280. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:34:57,274][25689] Avg episode reward: [(0, '0.781')] [2022-07-11 15:34:58,757][26022] Updated weights on worker 0-0, policy_version 1252590 (0.00086) [2022-07-11 15:35:00,578][26022] Updated weights on worker 0-0, policy_version 1252600 (0.00092) [2022-07-11 15:35:02,297][25689] Fps is (10 sec: 5378.6, 60 sec: 5536.1, 300 sec: 5571.4). Total num frames: 1282669568. Throughput: 0: 5871.1. Samples: 1282672882. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:35:02,298][25689] Avg episode reward: [(0, '0.717')] [2022-07-11 15:35:03,036][26022] Updated weights on worker 0-0, policy_version 1252610 (0.00090) [2022-07-11 15:35:04,615][26022] Updated weights on worker 0-0, policy_version 1252620 (0.00083) [2022-07-11 15:35:06,389][26022] Updated weights on worker 0-0, policy_version 1252630 (0.00092) [2022-07-11 15:35:07,385][25689] Fps is (10 sec: 5468.9, 60 sec: 5582.4, 300 sec: 5569.9). Total num frames: 1282698240. Throughput: 0: 5752.8. Samples: 1282704828. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:35:07,385][25689] Avg episode reward: [(0, '0.055')] [2022-07-11 15:35:08,276][26022] Updated weights on worker 0-0, policy_version 1252640 (0.00079) [2022-07-11 15:35:09,952][26022] Updated weights on worker 0-0, policy_version 1252650 (0.00089) [2022-07-11 15:35:12,156][26022] Updated weights on worker 0-0, policy_version 1252660 (0.00082) [2022-07-11 15:35:12,434][25689] Fps is (10 sec: 5656.9, 60 sec: 5600.0, 300 sec: 5569.7). Total num frames: 1282726912. Throughput: 0: 4913.5. Samples: 1282721894. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:35:12,434][25689] Avg episode reward: [(0, '0.589')] [2022-07-11 15:35:13,687][26022] Updated weights on worker 0-0, policy_version 1252670 (0.00090) [2022-07-11 15:35:15,565][26022] Updated weights on worker 0-0, policy_version 1252680 (0.00070) [2022-07-11 15:35:17,224][26022] Updated weights on worker 0-0, policy_version 1252690 (0.00090) [2022-07-11 15:35:17,546][25689] Fps is (10 sec: 5643.6, 60 sec: 5578.5, 300 sec: 5571.4). Total num frames: 1282755584. Throughput: 0: 5739.1. Samples: 1282755622. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:35:17,546][25689] Avg episode reward: [(0, '0.479')] [2022-07-11 15:35:19,244][26022] Updated weights on worker 0-0, policy_version 1252700 (0.00091) [2022-07-11 15:35:20,936][26022] Updated weights on worker 0-0, policy_version 1252710 (0.00091) [2022-07-11 15:35:22,552][25689] Fps is (10 sec: 5566.0, 60 sec: 5581.4, 300 sec: 5571.5). Total num frames: 1282783232. Throughput: 0: 5749.4. Samples: 1282789340. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:35:22,554][25689] Avg episode reward: [(0, '-0.114')] [2022-07-11 15:35:22,895][26022] Updated weights on worker 0-0, policy_version 1252720 (0.00085) [2022-07-11 15:35:24,583][26022] Updated weights on worker 0-0, policy_version 1252730 (0.00075) [2022-07-11 15:35:26,579][26022] Updated weights on worker 0-0, policy_version 1252740 (0.00103) [2022-07-11 15:35:27,573][25689] Fps is (10 sec: 5616.6, 60 sec: 5596.8, 300 sec: 5575.2). Total num frames: 1282811904. Throughput: 0: 5016.1. Samples: 1282806094. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:35:27,575][25689] Avg episode reward: [(0, '-0.007')] [2022-07-11 15:35:28,415][26022] Updated weights on worker 0-0, policy_version 1252750 (0.00086) [2022-07-11 15:35:30,103][26022] Updated weights on worker 0-0, policy_version 1252760 (0.00091) [2022-07-11 15:35:32,160][26022] Updated weights on worker 0-0, policy_version 1252770 (0.00082) [2022-07-11 15:35:32,591][25689] Fps is (10 sec: 5610.6, 60 sec: 5578.5, 300 sec: 5570.2). Total num frames: 1282839552. Throughput: 0: 5840.2. Samples: 1282839614. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:35:32,592][25689] Avg episode reward: [(0, '-0.214')] [2022-07-11 15:35:33,611][26022] Updated weights on worker 0-0, policy_version 1252780 (0.00556) [2022-07-11 15:35:35,550][26022] Updated weights on worker 0-0, policy_version 1252790 (0.00094) [2022-07-11 15:35:37,479][26022] Updated weights on worker 0-0, policy_version 1252800 (0.00092) [2022-07-11 15:35:37,716][25689] Fps is (10 sec: 5451.4, 60 sec: 5578.0, 300 sec: 5565.2). Total num frames: 1282867200. Throughput: 0: 5832.9. Samples: 1282873276. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:35:37,718][25689] Avg episode reward: [(0, '0.653')] [2022-07-11 15:35:39,070][26022] Updated weights on worker 0-0, policy_version 1252810 (0.00093) [2022-07-11 15:35:41,148][26022] Updated weights on worker 0-0, policy_version 1252820 (0.00091) [2022-07-11 15:35:42,742][25689] Fps is (10 sec: 5648.8, 60 sec: 5612.2, 300 sec: 5572.5). Total num frames: 1282896896. Throughput: 0: 4997.3. Samples: 1282890234. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:35:42,744][25689] Avg episode reward: [(0, '0.155')] [2022-07-11 15:35:42,815][26022] Updated weights on worker 0-0, policy_version 1252830 (0.00094) [2022-07-11 15:35:44,647][26022] Updated weights on worker 0-0, policy_version 1252840 (0.00090) [2022-07-11 15:35:46,709][26022] Updated weights on worker 0-0, policy_version 1252850 (0.00082) [2022-07-11 15:35:47,759][25689] Fps is (10 sec: 5608.3, 60 sec: 5561.4, 300 sec: 5569.0). Total num frames: 1282923520. Throughput: 0: 5834.6. Samples: 1282923870. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:35:47,759][25689] Avg episode reward: [(0, '-0.324')] [2022-07-11 15:35:48,422][26022] Updated weights on worker 0-0, policy_version 1252860 (0.00087) [2022-07-11 15:35:50,327][26022] Updated weights on worker 0-0, policy_version 1252870 (0.00089) [2022-07-11 15:35:52,026][26022] Updated weights on worker 0-0, policy_version 1252880 (0.00086) [2022-07-11 15:35:52,789][25689] Fps is (10 sec: 5605.6, 60 sec: 5577.8, 300 sec: 5570.2). Total num frames: 1282953216. Throughput: 0: 5816.2. Samples: 1282957092. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:35:52,790][25689] Avg episode reward: [(0, '0.488')] [2022-07-11 15:35:54,109][26022] Updated weights on worker 0-0, policy_version 1252890 (0.00087) [2022-07-11 15:35:55,848][26022] Updated weights on worker 0-0, policy_version 1252900 (0.00092) [2022-07-11 15:35:57,621][26022] Updated weights on worker 0-0, policy_version 1252910 (0.00086) [2022-07-11 15:35:57,872][25689] Fps is (10 sec: 5670.4, 60 sec: 5576.4, 300 sec: 5572.9). Total num frames: 1282980864. Throughput: 0: 4996.2. Samples: 1282973976. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:35:57,872][25689] Avg episode reward: [(0, '0.550')] [2022-07-11 15:35:59,603][26022] Updated weights on worker 0-0, policy_version 1252920 (0.00094) [2022-07-11 15:36:01,355][26022] Updated weights on worker 0-0, policy_version 1252930 (0.00083) [2022-07-11 15:36:02,903][25689] Fps is (10 sec: 5163.4, 60 sec: 5541.9, 300 sec: 5563.0). Total num frames: 1283005440. Throughput: 0: 5801.3. Samples: 1283007196. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:36:02,904][25689] Avg episode reward: [(0, '-0.041')] [2022-07-11 15:36:03,511][26022] Updated weights on worker 0-0, policy_version 1252940 (0.00088) [2022-07-11 15:36:05,222][26022] Updated weights on worker 0-0, policy_version 1252950 (0.00090) [2022-07-11 15:36:07,098][26022] Updated weights on worker 0-0, policy_version 1252960 (0.00081) [2022-07-11 15:36:07,910][25689] Fps is (10 sec: 5304.5, 60 sec: 5549.3, 300 sec: 5566.6). Total num frames: 1283034112. Throughput: 0: 5713.4. Samples: 1283039002. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:36:07,911][25689] Avg episode reward: [(0, '-0.040')] [2022-07-11 15:36:08,892][26022] Updated weights on worker 0-0, policy_version 1252970 (0.00082) [2022-07-11 15:36:10,945][26022] Updated weights on worker 0-0, policy_version 1252980 (0.00091) [2022-07-11 15:36:12,621][26022] Updated weights on worker 0-0, policy_version 1252990 (0.00084) [2022-07-11 15:36:12,915][25689] Fps is (10 sec: 5727.7, 60 sec: 5553.3, 300 sec: 5567.6). Total num frames: 1283062784. Throughput: 0: 4906.5. Samples: 1283055840. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:36:12,917][25689] Avg episode reward: [(0, '0.460')] [2022-07-11 15:36:14,495][26022] Updated weights on worker 0-0, policy_version 1253000 (0.00085) [2022-07-11 15:36:16,355][26022] Updated weights on worker 0-0, policy_version 1253010 (0.00089) [2022-07-11 15:36:17,828][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:36:17,840][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001253018_1283090432.pth [2022-07-11 15:36:17,841][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001251059_1281084416.pth [2022-07-11 15:36:18,006][25689] Fps is (10 sec: 5679.6, 60 sec: 5555.2, 300 sec: 5566.0). Total num frames: 1283091456. Throughput: 0: 5743.7. Samples: 1283089624. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:36:18,007][25689] Avg episode reward: [(0, '1.035')] [2022-07-11 15:36:18,098][26022] Updated weights on worker 0-0, policy_version 1253020 (0.00078) [2022-07-11 15:36:20,032][26022] Updated weights on worker 0-0, policy_version 1253030 (0.00086) [2022-07-11 15:36:21,587][26022] Updated weights on worker 0-0, policy_version 1253040 (0.00087) [2022-07-11 15:36:23,059][25689] Fps is (10 sec: 5552.3, 60 sec: 5551.0, 300 sec: 5562.8). Total num frames: 1283119104. Throughput: 0: 5766.2. Samples: 1283123416. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:36:23,059][25689] Avg episode reward: [(0, '0.814')] [2022-07-11 15:36:23,593][26022] Updated weights on worker 0-0, policy_version 1253050 (0.00084) [2022-07-11 15:36:25,507][26022] Updated weights on worker 0-0, policy_version 1253060 (0.00093) [2022-07-11 15:36:27,156][26022] Updated weights on worker 0-0, policy_version 1253070 (0.00942) [2022-07-11 15:36:28,108][25689] Fps is (10 sec: 5575.2, 60 sec: 5548.4, 300 sec: 5569.0). Total num frames: 1283147776. Throughput: 0: 5853.1. Samples: 1283157226. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:36:28,109][25689] Avg episode reward: [(0, '1.140')] [2022-07-11 15:36:29,136][26022] Updated weights on worker 0-0, policy_version 1253080 (0.00089) [2022-07-11 15:36:30,914][26022] Updated weights on worker 0-0, policy_version 1253090 (0.00092) [2022-07-11 15:36:32,741][26022] Updated weights on worker 0-0, policy_version 1253100 (0.00084) [2022-07-11 15:36:33,167][25689] Fps is (10 sec: 5774.4, 60 sec: 5578.4, 300 sec: 5573.3). Total num frames: 1283177472. Throughput: 0: 5830.3. Samples: 1283173912. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:36:33,167][25689] Avg episode reward: [(0, '1.537')] [2022-07-11 15:36:34,865][26022] Updated weights on worker 0-0, policy_version 1253110 (0.00089) [2022-07-11 15:36:36,138][26022] Updated weights on worker 0-0, policy_version 1253120 (0.00091) [2022-07-11 15:36:38,276][25689] Fps is (10 sec: 5438.3, 60 sec: 5546.1, 300 sec: 5558.3). Total num frames: 1283203072. Throughput: 0: 5812.4. Samples: 1283207440. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:36:38,277][25689] Avg episode reward: [(0, '1.284')] [2022-07-11 15:36:38,409][26022] Updated weights on worker 0-0, policy_version 1253130 (0.00087) [2022-07-11 15:36:39,995][26022] Updated weights on worker 0-0, policy_version 1253140 (0.00092) [2022-07-11 15:36:41,910][26022] Updated weights on worker 0-0, policy_version 1253150 (0.00100) [2022-07-11 15:36:43,297][25689] Fps is (10 sec: 5559.3, 60 sec: 5563.4, 300 sec: 5570.1). Total num frames: 1283233792. Throughput: 0: 5825.9. Samples: 1283241326. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2022-07-11 15:36:43,298][25689] Avg episode reward: [(0, '1.162')] [2022-07-11 15:36:43,739][26022] Updated weights on worker 0-0, policy_version 1253160 (0.00081) [2022-07-11 15:36:45,463][26022] Updated weights on worker 0-0, policy_version 1253170 (0.00080) [2022-07-11 15:36:47,526][26022] Updated weights on worker 0-0, policy_version 1253180 (0.00084) [2022-07-11 15:36:48,328][25689] Fps is (10 sec: 5806.8, 60 sec: 5579.1, 300 sec: 5569.6). Total num frames: 1283261440. Throughput: 0: 4989.0. Samples: 1283258102. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:36:48,329][25689] Avg episode reward: [(0, '1.572')] [2022-07-11 15:36:49,177][26022] Updated weights on worker 0-0, policy_version 1253190 (0.00095) [2022-07-11 15:36:51,139][26022] Updated weights on worker 0-0, policy_version 1253200 (0.00083) [2022-07-11 15:36:52,847][26022] Updated weights on worker 0-0, policy_version 1253210 (0.00085) [2022-07-11 15:36:53,344][25689] Fps is (10 sec: 5606.1, 60 sec: 5563.5, 300 sec: 5567.6). Total num frames: 1283290112. Throughput: 0: 5851.3. Samples: 1283291974. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:36:53,344][25689] Avg episode reward: [(0, '1.626')] [2022-07-11 15:36:54,589][26022] Updated weights on worker 0-0, policy_version 1253220 (0.00087) [2022-07-11 15:36:56,526][26022] Updated weights on worker 0-0, policy_version 1253230 (0.00093) [2022-07-11 15:36:58,337][26022] Updated weights on worker 0-0, policy_version 1253240 (0.00091) [2022-07-11 15:36:58,434][25689] Fps is (10 sec: 5573.0, 60 sec: 5562.8, 300 sec: 5570.0). Total num frames: 1283317760. Throughput: 0: 5862.0. Samples: 1283325604. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:36:58,436][25689] Avg episode reward: [(0, '1.686')] [2022-07-11 15:37:00,247][26022] Updated weights on worker 0-0, policy_version 1253250 (0.00086) [2022-07-11 15:37:02,362][26022] Updated weights on worker 0-0, policy_version 1253260 (0.00079) [2022-07-11 15:37:03,479][25689] Fps is (10 sec: 5354.7, 60 sec: 5595.4, 300 sec: 5563.2). Total num frames: 1283344384. Throughput: 0: 5008.6. Samples: 1283342408. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:37:03,481][25689] Avg episode reward: [(0, '1.745')] [2022-07-11 15:37:04,056][26022] Updated weights on worker 0-0, policy_version 1253270 (0.00084) [2022-07-11 15:37:06,091][26022] Updated weights on worker 0-0, policy_version 1253280 (0.00084) [2022-07-11 15:37:07,878][26022] Updated weights on worker 0-0, policy_version 1253290 (0.00096) [2022-07-11 15:37:08,578][25689] Fps is (10 sec: 5350.3, 60 sec: 5570.0, 300 sec: 5561.8). Total num frames: 1283372032. Throughput: 0: 5719.8. Samples: 1283373926. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:37:08,578][25689] Avg episode reward: [(0, '1.332')] [2022-07-11 15:37:09,700][26022] Updated weights on worker 0-0, policy_version 1253300 (0.00091) [2022-07-11 15:37:11,300][26022] Updated weights on worker 0-0, policy_version 1253310 (0.00093) [2022-07-11 15:37:13,377][26022] Updated weights on worker 0-0, policy_version 1253320 (0.00096) [2022-07-11 15:37:13,651][25689] Fps is (10 sec: 5537.1, 60 sec: 5563.8, 300 sec: 5564.8). Total num frames: 1283400704. Throughput: 0: 5681.6. Samples: 1283407350. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:37:13,651][25689] Avg episode reward: [(0, '1.396')] [2022-07-11 15:37:15,066][26022] Updated weights on worker 0-0, policy_version 1253330 (0.00091) [2022-07-11 15:37:17,045][26022] Updated weights on worker 0-0, policy_version 1253340 (0.00100) [2022-07-11 15:37:18,742][25689] Fps is (10 sec: 5641.6, 60 sec: 5563.8, 300 sec: 5563.2). Total num frames: 1283429376. Throughput: 0: 4850.8. Samples: 1283424116. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:37:18,743][25689] Avg episode reward: [(0, '1.410')] [2022-07-11 15:37:18,859][26022] Updated weights on worker 0-0, policy_version 1253350 (0.00096) [2022-07-11 15:37:20,770][26022] Updated weights on worker 0-0, policy_version 1253360 (0.00091) [2022-07-11 15:37:22,710][26022] Updated weights on worker 0-0, policy_version 1253370 (0.00095) [2022-07-11 15:37:23,775][25689] Fps is (10 sec: 5562.8, 60 sec: 5565.6, 300 sec: 5562.7). Total num frames: 1283457024. Throughput: 0: 5670.6. Samples: 1283457498. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:37:23,775][25689] Avg episode reward: [(0, '0.980')] [2022-07-11 15:37:24,409][26022] Updated weights on worker 0-0, policy_version 1253380 (0.00088) [2022-07-11 15:37:26,207][26022] Updated weights on worker 0-0, policy_version 1253390 (0.00087) [2022-07-11 15:37:28,167][26022] Updated weights on worker 0-0, policy_version 1253400 (0.00096) [2022-07-11 15:37:28,796][25689] Fps is (10 sec: 5398.2, 60 sec: 5534.5, 300 sec: 5556.0). Total num frames: 1283483648. Throughput: 0: 5795.8. Samples: 1283491106. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:37:28,796][25689] Avg episode reward: [(0, '1.255')] [2022-07-11 15:37:29,910][26022] Updated weights on worker 0-0, policy_version 1253410 (0.00089) [2022-07-11 15:37:32,092][26022] Updated weights on worker 0-0, policy_version 1253420 (0.00086) [2022-07-11 15:37:33,414][26022] Updated weights on worker 0-0, policy_version 1253430 (0.00086) [2022-07-11 15:37:33,823][25689] Fps is (10 sec: 5605.2, 60 sec: 5537.3, 300 sec: 5563.2). Total num frames: 1283513344. Throughput: 0: 4971.2. Samples: 1283507630. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:37:33,823][25689] Avg episode reward: [(0, '1.480')] [2022-07-11 15:37:35,754][26022] Updated weights on worker 0-0, policy_version 1253440 (0.00087) [2022-07-11 15:37:37,046][26022] Updated weights on worker 0-0, policy_version 1253450 (0.00092) [2022-07-11 15:37:38,905][25689] Fps is (10 sec: 5672.5, 60 sec: 5573.6, 300 sec: 5561.9). Total num frames: 1283540992. Throughput: 0: 5806.0. Samples: 1283541182. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:37:38,905][25689] Avg episode reward: [(0, '2.126')] [2022-07-11 15:37:39,500][26022] Updated weights on worker 0-0, policy_version 1253460 (0.00090) [2022-07-11 15:37:40,825][26022] Updated weights on worker 0-0, policy_version 1253470 (0.00092) [2022-07-11 15:37:42,946][26022] Updated weights on worker 0-0, policy_version 1253480 (0.00085) [2022-07-11 15:37:43,991][25689] Fps is (10 sec: 5639.8, 60 sec: 5550.8, 300 sec: 5563.9). Total num frames: 1283570688. Throughput: 0: 5802.4. Samples: 1283574796. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:37:43,991][25689] Avg episode reward: [(0, '1.108')] [2022-07-11 15:37:44,506][26022] Updated weights on worker 0-0, policy_version 1253490 (0.00085) [2022-07-11 15:37:46,512][26022] Updated weights on worker 0-0, policy_version 1253500 (0.00086) [2022-07-11 15:37:48,159][26022] Updated weights on worker 0-0, policy_version 1253510 (0.00080) [2022-07-11 15:37:49,013][25689] Fps is (10 sec: 5672.9, 60 sec: 5551.5, 300 sec: 5560.2). Total num frames: 1283598336. Throughput: 0: 4980.6. Samples: 1283591802. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:37:49,014][25689] Avg episode reward: [(0, '1.120')] [2022-07-11 15:37:50,080][26022] Updated weights on worker 0-0, policy_version 1253520 (0.00093) [2022-07-11 15:37:52,094][26022] Updated weights on worker 0-0, policy_version 1253530 (0.00080) [2022-07-11 15:37:53,704][26022] Updated weights on worker 0-0, policy_version 1253540 (0.00809) [2022-07-11 15:37:54,056][25689] Fps is (10 sec: 5493.6, 60 sec: 5532.2, 300 sec: 5564.2). Total num frames: 1283625984. Throughput: 0: 5831.2. Samples: 1283625614. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:37:54,056][25689] Avg episode reward: [(0, '1.418')] [2022-07-11 15:37:55,558][26022] Updated weights on worker 0-0, policy_version 1253550 (0.00093) [2022-07-11 15:37:57,584][26022] Updated weights on worker 0-0, policy_version 1253560 (0.00091) [2022-07-11 15:37:59,046][26022] Updated weights on worker 0-0, policy_version 1253570 (0.00088) [2022-07-11 15:37:59,193][25689] Fps is (10 sec: 5633.2, 60 sec: 5561.6, 300 sec: 5565.2). Total num frames: 1283655680. Throughput: 0: 5810.8. Samples: 1283659072. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:37:59,193][25689] Avg episode reward: [(0, '1.101')] [2022-07-11 15:38:01,316][26022] Updated weights on worker 0-0, policy_version 1253580 (0.00102) [2022-07-11 15:38:03,127][26022] Updated weights on worker 0-0, policy_version 1253590 (0.00087) [2022-07-11 15:38:04,248][25689] Fps is (10 sec: 5325.0, 60 sec: 5527.0, 300 sec: 5561.2). Total num frames: 1283680256. Throughput: 0: 5701.7. Samples: 1283690298. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:38:04,248][25689] Avg episode reward: [(0, '1.064')] [2022-07-11 15:38:05,124][26022] Updated weights on worker 0-0, policy_version 1253600 (0.00093) [2022-07-11 15:38:07,129][26022] Updated weights on worker 0-0, policy_version 1253610 (0.00091) [2022-07-11 15:38:08,788][26022] Updated weights on worker 0-0, policy_version 1253620 (0.00084) [2022-07-11 15:38:09,310][25689] Fps is (10 sec: 5263.0, 60 sec: 5547.2, 300 sec: 5560.0). Total num frames: 1283708928. Throughput: 0: 5670.0. Samples: 1283706886. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:38:09,313][25689] Avg episode reward: [(0, '0.108')] [2022-07-11 15:38:10,667][26022] Updated weights on worker 0-0, policy_version 1253630 (0.00096) [2022-07-11 15:38:12,701][26022] Updated weights on worker 0-0, policy_version 1253640 (0.00096) [2022-07-11 15:38:14,290][26022] Updated weights on worker 0-0, policy_version 1253650 (0.00094) [2022-07-11 15:38:14,391][25689] Fps is (10 sec: 5653.4, 60 sec: 5546.5, 300 sec: 5562.8). Total num frames: 1283737600. Throughput: 0: 5634.8. Samples: 1283740198. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:38:14,392][25689] Avg episode reward: [(0, '0.582')] [2022-07-11 15:38:16,407][26022] Updated weights on worker 0-0, policy_version 1253660 (0.00094) [2022-07-11 15:38:17,824][26022] Updated weights on worker 0-0, policy_version 1253670 (0.00087) [2022-07-11 15:38:18,095][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:38:18,104][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001253671_1283759104.pth [2022-07-11 15:38:18,105][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001251713_1281754112.pth [2022-07-11 15:38:19,500][25689] Fps is (10 sec: 5426.5, 60 sec: 5511.1, 300 sec: 5554.0). Total num frames: 1283764224. Throughput: 0: 5638.3. Samples: 1283773572. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:38:19,502][25689] Avg episode reward: [(0, '0.512')] [2022-07-11 15:38:20,077][26022] Updated weights on worker 0-0, policy_version 1253680 (0.00087) [2022-07-11 15:38:21,711][26022] Updated weights on worker 0-0, policy_version 1253690 (0.00096) [2022-07-11 15:38:23,621][26022] Updated weights on worker 0-0, policy_version 1253700 (0.00093) [2022-07-11 15:38:24,503][25689] Fps is (10 sec: 5671.1, 60 sec: 5564.5, 300 sec: 5564.4). Total num frames: 1283794944. Throughput: 0: 4942.0. Samples: 1283790398. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:38:24,504][25689] Avg episode reward: [(0, '0.210')] [2022-07-11 15:38:25,555][26022] Updated weights on worker 0-0, policy_version 1253710 (0.00092) [2022-07-11 15:38:27,276][26022] Updated weights on worker 0-0, policy_version 1253720 (0.00082) [2022-07-11 15:38:29,095][26022] Updated weights on worker 0-0, policy_version 1253730 (0.00092) [2022-07-11 15:38:29,567][25689] Fps is (10 sec: 5797.9, 60 sec: 5577.3, 300 sec: 5563.8). Total num frames: 1283822592. Throughput: 0: 5780.1. Samples: 1283823974. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:38:29,568][25689] Avg episode reward: [(0, '0.288')] [2022-07-11 15:38:30,948][26022] Updated weights on worker 0-0, policy_version 1253740 (0.00094) [2022-07-11 15:38:32,641][26022] Updated weights on worker 0-0, policy_version 1253750 (0.00089) [2022-07-11 15:38:34,529][26022] Updated weights on worker 0-0, policy_version 1253760 (0.00091) [2022-07-11 15:38:34,592][25689] Fps is (10 sec: 5480.6, 60 sec: 5543.8, 300 sec: 5564.3). Total num frames: 1283850240. Throughput: 0: 5792.3. Samples: 1283857208. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:38:34,595][25689] Avg episode reward: [(0, '1.226')] [2022-07-11 15:38:36,486][26022] Updated weights on worker 0-0, policy_version 1253770 (0.00090) [2022-07-11 15:38:38,331][26022] Updated weights on worker 0-0, policy_version 1253780 (0.00098) [2022-07-11 15:38:39,721][25689] Fps is (10 sec: 5445.9, 60 sec: 5539.6, 300 sec: 5558.7). Total num frames: 1283877888. Throughput: 0: 4960.8. Samples: 1283873884. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:38:39,722][25689] Avg episode reward: [(0, '0.866')] [2022-07-11 15:38:40,172][26022] Updated weights on worker 0-0, policy_version 1253790 (0.00083) [2022-07-11 15:38:41,913][26022] Updated weights on worker 0-0, policy_version 1253800 (0.00094) [2022-07-11 15:38:43,896][26022] Updated weights on worker 0-0, policy_version 1253810 (0.00092) [2022-07-11 15:38:44,803][25689] Fps is (10 sec: 5615.8, 60 sec: 5539.9, 300 sec: 5564.5). Total num frames: 1283907584. Throughput: 0: 5759.7. Samples: 1283907324. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:38:44,804][25689] Avg episode reward: [(0, '1.155')] [2022-07-11 15:38:45,571][26022] Updated weights on worker 0-0, policy_version 1253820 (0.00092) [2022-07-11 15:38:47,455][26022] Updated weights on worker 0-0, policy_version 1253830 (0.00077) [2022-07-11 15:38:49,298][26022] Updated weights on worker 0-0, policy_version 1253840 (0.00683) [2022-07-11 15:38:49,885][25689] Fps is (10 sec: 5541.0, 60 sec: 5517.7, 300 sec: 5556.5). Total num frames: 1283934208. Throughput: 0: 5750.4. Samples: 1283940810. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:38:49,886][25689] Avg episode reward: [(0, '1.284')] [2022-07-11 15:38:51,122][26022] Updated weights on worker 0-0, policy_version 1253850 (0.00089) [2022-07-11 15:38:53,074][26022] Updated weights on worker 0-0, policy_version 1253860 (0.00087) [2022-07-11 15:38:54,783][26022] Updated weights on worker 0-0, policy_version 1253870 (0.00087) [2022-07-11 15:38:54,979][25689] Fps is (10 sec: 5434.2, 60 sec: 5529.9, 300 sec: 5559.3). Total num frames: 1283962880. Throughput: 0: 4922.2. Samples: 1283957556. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:38:54,979][25689] Avg episode reward: [(0, '1.309')] [2022-07-11 15:38:56,620][26022] Updated weights on worker 0-0, policy_version 1253880 (0.00090) [2022-07-11 15:38:58,426][26022] Updated weights on worker 0-0, policy_version 1253890 (0.00084) [2022-07-11 15:39:00,029][25689] Fps is (10 sec: 5653.3, 60 sec: 5520.9, 300 sec: 5555.8). Total num frames: 1283991552. Throughput: 0: 5776.3. Samples: 1283991188. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:39:00,029][25689] Avg episode reward: [(0, '1.520')] [2022-07-11 15:39:00,397][26022] Updated weights on worker 0-0, policy_version 1253900 (0.00083) [2022-07-11 15:39:02,098][26022] Updated weights on worker 0-0, policy_version 1253910 (0.00090) [2022-07-11 15:39:04,480][26022] Updated weights on worker 0-0, policy_version 1253920 (0.00086) [2022-07-11 15:39:05,073][25689] Fps is (10 sec: 5477.9, 60 sec: 5555.6, 300 sec: 5559.1). Total num frames: 1284018176. Throughput: 0: 5714.1. Samples: 1284023148. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:39:05,074][25689] Avg episode reward: [(0, '1.684')] [2022-07-11 15:39:06,098][26022] Updated weights on worker 0-0, policy_version 1253930 (0.00081) [2022-07-11 15:39:07,979][26022] Updated weights on worker 0-0, policy_version 1253940 (0.00094) [2022-07-11 15:39:09,778][26022] Updated weights on worker 0-0, policy_version 1253950 (0.00091) [2022-07-11 15:39:10,133][25689] Fps is (10 sec: 5472.3, 60 sec: 5555.8, 300 sec: 5562.5). Total num frames: 1284046848. Throughput: 0: 4908.0. Samples: 1284040188. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:39:10,134][25689] Avg episode reward: [(0, '1.852')] [2022-07-11 15:39:11,646][26022] Updated weights on worker 0-0, policy_version 1253960 (0.00087) [2022-07-11 15:39:13,419][26022] Updated weights on worker 0-0, policy_version 1253970 (0.00083) [2022-07-11 15:39:15,012][26022] Updated weights on worker 0-0, policy_version 1253980 (0.00083) [2022-07-11 15:39:15,159][25689] Fps is (10 sec: 5685.7, 60 sec: 5560.8, 300 sec: 5559.7). Total num frames: 1284075520. Throughput: 0: 5780.3. Samples: 1284074202. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:39:15,160][25689] Avg episode reward: [(0, '1.877')] [2022-07-11 15:39:17,153][26022] Updated weights on worker 0-0, policy_version 1253990 (0.00085) [2022-07-11 15:39:18,627][26022] Updated weights on worker 0-0, policy_version 1254000 (0.00086) [2022-07-11 15:39:20,251][25689] Fps is (10 sec: 5566.8, 60 sec: 5579.3, 300 sec: 5558.7). Total num frames: 1284103168. Throughput: 0: 5783.3. Samples: 1284108136. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:39:20,251][25689] Avg episode reward: [(0, '1.778')] [2022-07-11 15:39:20,690][26022] Updated weights on worker 0-0, policy_version 1254010 (0.00081) [2022-07-11 15:39:22,552][26022] Updated weights on worker 0-0, policy_version 1254020 (0.00087) [2022-07-11 15:39:24,146][26022] Updated weights on worker 0-0, policy_version 1254030 (0.00087) [2022-07-11 15:39:25,285][25689] Fps is (10 sec: 5461.1, 60 sec: 5525.9, 300 sec: 5558.2). Total num frames: 1284130816. Throughput: 0: 5044.1. Samples: 1284125096. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:39:25,285][25689] Avg episode reward: [(0, '2.179')] [2022-07-11 15:39:26,215][26022] Updated weights on worker 0-0, policy_version 1254040 (0.00082) [2022-07-11 15:39:27,739][26022] Updated weights on worker 0-0, policy_version 1254050 (0.00086) [2022-07-11 15:39:29,723][26022] Updated weights on worker 0-0, policy_version 1254060 (0.00094) [2022-07-11 15:39:30,288][25689] Fps is (10 sec: 5713.2, 60 sec: 5565.2, 300 sec: 5561.6). Total num frames: 1284160512. Throughput: 0: 5895.0. Samples: 1284158998. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:39:30,289][25689] Avg episode reward: [(0, '1.736')] [2022-07-11 15:39:31,574][26022] Updated weights on worker 0-0, policy_version 1254070 (0.00658) [2022-07-11 15:39:33,214][26022] Updated weights on worker 0-0, policy_version 1254080 (0.00081) [2022-07-11 15:39:35,211][26022] Updated weights on worker 0-0, policy_version 1254090 (0.00087) [2022-07-11 15:39:35,306][25689] Fps is (10 sec: 5722.2, 60 sec: 5565.8, 300 sec: 5563.5). Total num frames: 1284188160. Throughput: 0: 5872.6. Samples: 1284192516. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:39:35,306][25689] Avg episode reward: [(0, '1.537')] [2022-07-11 15:39:36,958][26022] Updated weights on worker 0-0, policy_version 1254100 (0.00093) [2022-07-11 15:39:38,803][26022] Updated weights on worker 0-0, policy_version 1254110 (0.00096) [2022-07-11 15:39:40,347][25689] Fps is (10 sec: 5497.2, 60 sec: 5573.9, 300 sec: 5563.3). Total num frames: 1284215808. Throughput: 0: 5042.0. Samples: 1284209460. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:39:40,348][25689] Avg episode reward: [(0, '1.425')] [2022-07-11 15:39:40,722][26022] Updated weights on worker 0-0, policy_version 1254120 (0.00085) [2022-07-11 15:39:42,354][26022] Updated weights on worker 0-0, policy_version 1254130 (0.00094) [2022-07-11 15:39:44,441][26022] Updated weights on worker 0-0, policy_version 1254140 (0.00093) [2022-07-11 15:39:45,367][25689] Fps is (10 sec: 5597.8, 60 sec: 5562.7, 300 sec: 5559.8). Total num frames: 1284244480. Throughput: 0: 5869.3. Samples: 1284242966. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:39:45,371][25689] Avg episode reward: [(0, '0.816')] [2022-07-11 15:39:46,271][26022] Updated weights on worker 0-0, policy_version 1254150 (0.00085) [2022-07-11 15:39:48,064][26022] Updated weights on worker 0-0, policy_version 1254160 (0.00089) [2022-07-11 15:39:49,657][26022] Updated weights on worker 0-0, policy_version 1254170 (0.00080) [2022-07-11 15:39:50,386][25689] Fps is (10 sec: 5711.8, 60 sec: 5602.3, 300 sec: 5559.9). Total num frames: 1284273152. Throughput: 0: 5860.8. Samples: 1284276790. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:39:50,387][25689] Avg episode reward: [(0, '0.356')] [2022-07-11 15:39:51,650][26022] Updated weights on worker 0-0, policy_version 1254180 (0.00089) [2022-07-11 15:39:53,316][26022] Updated weights on worker 0-0, policy_version 1254190 (0.00086) [2022-07-11 15:39:55,414][25689] Fps is (10 sec: 5503.5, 60 sec: 5574.5, 300 sec: 5557.2). Total num frames: 1284299776. Throughput: 0: 5882.9. Samples: 1284310814. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:39:55,415][25689] Avg episode reward: [(0, '0.571')] [2022-07-11 15:39:55,435][26022] Updated weights on worker 0-0, policy_version 1254200 (0.00081) [2022-07-11 15:39:57,153][26022] Updated weights on worker 0-0, policy_version 1254210 (0.00082) [2022-07-11 15:39:59,013][26022] Updated weights on worker 0-0, policy_version 1254220 (0.00083) [2022-07-11 15:40:00,509][25689] Fps is (10 sec: 5664.7, 60 sec: 5604.2, 300 sec: 5569.6). Total num frames: 1284330496. Throughput: 0: 5865.1. Samples: 1284327714. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:40:00,510][25689] Avg episode reward: [(0, '0.740')] [2022-07-11 15:40:00,676][26022] Updated weights on worker 0-0, policy_version 1254230 (0.00095) [2022-07-11 15:40:03,210][26022] Updated weights on worker 0-0, policy_version 1254240 (0.00089) [2022-07-11 15:40:04,645][26022] Updated weights on worker 0-0, policy_version 1254250 (0.00081) [2022-07-11 15:40:05,519][25689] Fps is (10 sec: 5573.8, 60 sec: 5590.5, 300 sec: 5560.8). Total num frames: 1284356096. Throughput: 0: 5773.5. Samples: 1284359314. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:40:05,519][25689] Avg episode reward: [(0, '0.764')] [2022-07-11 15:40:06,762][26022] Updated weights on worker 0-0, policy_version 1254260 (0.00084) [2022-07-11 15:40:08,231][26022] Updated weights on worker 0-0, policy_version 1254270 (0.00085) [2022-07-11 15:40:10,247][26022] Updated weights on worker 0-0, policy_version 1254280 (0.00084) [2022-07-11 15:40:10,523][25689] Fps is (10 sec: 5317.2, 60 sec: 5578.7, 300 sec: 5558.2). Total num frames: 1284383744. Throughput: 0: 5799.3. Samples: 1284393572. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:40:10,524][25689] Avg episode reward: [(0, '0.860')] [2022-07-11 15:40:11,760][26022] Updated weights on worker 0-0, policy_version 1254290 (0.00084) [2022-07-11 15:40:13,809][26022] Updated weights on worker 0-0, policy_version 1254300 (0.00083) [2022-07-11 15:40:15,491][26022] Updated weights on worker 0-0, policy_version 1254310 (0.00090) [2022-07-11 15:40:15,573][25689] Fps is (10 sec: 5703.7, 60 sec: 5593.4, 300 sec: 5562.8). Total num frames: 1284413440. Throughput: 0: 4949.6. Samples: 1284410590. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:40:15,573][25689] Avg episode reward: [(0, '1.708')] [2022-07-11 15:40:17,495][26022] Updated weights on worker 0-0, policy_version 1254320 (0.00084) [2022-07-11 15:40:18,187][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:40:18,196][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001254324_1284427776.pth [2022-07-11 15:40:18,196][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001252365_1282421760.pth [2022-07-11 15:40:19,104][26022] Updated weights on worker 0-0, policy_version 1254330 (0.00092) [2022-07-11 15:40:20,611][25689] Fps is (10 sec: 5583.2, 60 sec: 5581.4, 300 sec: 5558.8). Total num frames: 1284440064. Throughput: 0: 5804.6. Samples: 1284444396. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:40:20,611][25689] Avg episode reward: [(0, '1.937')] [2022-07-11 15:40:21,109][26022] Updated weights on worker 0-0, policy_version 1254340 (0.00099) [2022-07-11 15:40:22,852][26022] Updated weights on worker 0-0, policy_version 1254350 (0.00094) [2022-07-11 15:40:24,727][26022] Updated weights on worker 0-0, policy_version 1254360 (0.00090) [2022-07-11 15:40:25,623][25689] Fps is (10 sec: 5705.8, 60 sec: 5634.4, 300 sec: 5565.8). Total num frames: 1284470784. Throughput: 0: 5928.8. Samples: 1284478506. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:40:25,623][25689] Avg episode reward: [(0, '1.886')] [2022-07-11 15:40:26,387][26022] Updated weights on worker 0-0, policy_version 1254370 (0.00084) [2022-07-11 15:40:28,423][26022] Updated weights on worker 0-0, policy_version 1254380 (0.00092) [2022-07-11 15:40:30,022][26022] Updated weights on worker 0-0, policy_version 1254390 (0.00092) [2022-07-11 15:40:30,644][25689] Fps is (10 sec: 5817.5, 60 sec: 5598.8, 300 sec: 5565.7). Total num frames: 1284498432. Throughput: 0: 5062.8. Samples: 1284495438. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:40:30,644][25689] Avg episode reward: [(0, '1.459')] [2022-07-11 15:40:32,063][26022] Updated weights on worker 0-0, policy_version 1254400 (0.00058) [2022-07-11 15:40:33,752][26022] Updated weights on worker 0-0, policy_version 1254410 (0.00091) [2022-07-11 15:40:35,660][25689] Fps is (10 sec: 5509.0, 60 sec: 5598.9, 300 sec: 5567.8). Total num frames: 1284526080. Throughput: 0: 5900.2. Samples: 1284529110. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:40:35,661][25689] Avg episode reward: [(0, '1.638')] [2022-07-11 15:40:35,663][26022] Updated weights on worker 0-0, policy_version 1254420 (0.00094) [2022-07-11 15:40:37,459][26022] Updated weights on worker 0-0, policy_version 1254430 (0.00080) [2022-07-11 15:40:39,314][26022] Updated weights on worker 0-0, policy_version 1254440 (0.00089) [2022-07-11 15:40:40,731][25689] Fps is (10 sec: 5583.5, 60 sec: 5613.1, 300 sec: 5563.5). Total num frames: 1284554752. Throughput: 0: 5887.9. Samples: 1284562860. Policy #0 lag: (min: 0.0, avg: 8.4, max: 19.0) [2022-07-11 15:40:40,731][25689] Avg episode reward: [(0, '1.010')] [2022-07-11 15:40:40,983][26022] Updated weights on worker 0-0, policy_version 1254450 (0.00093) [2022-07-11 15:40:42,957][26022] Updated weights on worker 0-0, policy_version 1254460 (0.00094) [2022-07-11 15:40:44,697][26022] Updated weights on worker 0-0, policy_version 1254470 (0.00094) [2022-07-11 15:40:45,735][25689] Fps is (10 sec: 5488.6, 60 sec: 5580.7, 300 sec: 5563.7). Total num frames: 1284581376. Throughput: 0: 5032.1. Samples: 1284579712. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:40:45,735][25689] Avg episode reward: [(0, '0.982')] [2022-07-11 15:40:46,540][26022] Updated weights on worker 0-0, policy_version 1254480 (0.00092) [2022-07-11 15:40:48,519][26022] Updated weights on worker 0-0, policy_version 1254490 (0.00088) [2022-07-11 15:40:50,195][26022] Updated weights on worker 0-0, policy_version 1254500 (0.00085) [2022-07-11 15:40:50,754][25689] Fps is (10 sec: 5516.9, 60 sec: 5580.7, 300 sec: 5560.5). Total num frames: 1284610048. Throughput: 0: 5840.6. Samples: 1284612892. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:40:50,754][25689] Avg episode reward: [(0, '0.817')] [2022-07-11 15:40:52,191][26022] Updated weights on worker 0-0, policy_version 1254510 (0.00091) [2022-07-11 15:40:53,998][26022] Updated weights on worker 0-0, policy_version 1254520 (0.00086) [2022-07-11 15:40:55,658][26022] Updated weights on worker 0-0, policy_version 1254530 (0.00087) [2022-07-11 15:40:55,769][25689] Fps is (10 sec: 5714.8, 60 sec: 5615.9, 300 sec: 5565.2). Total num frames: 1284638720. Throughput: 0: 5838.3. Samples: 1284646512. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:40:55,769][25689] Avg episode reward: [(0, '0.861')] [2022-07-11 15:40:57,904][26022] Updated weights on worker 0-0, policy_version 1254540 (0.00098) [2022-07-11 15:40:59,301][26022] Updated weights on worker 0-0, policy_version 1254550 (0.00079) [2022-07-11 15:41:00,801][25689] Fps is (10 sec: 5503.7, 60 sec: 5553.8, 300 sec: 5572.1). Total num frames: 1284665344. Throughput: 0: 4995.1. Samples: 1284663112. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:41:00,802][25689] Avg episode reward: [(0, '0.474')] [2022-07-11 15:41:01,407][26022] Updated weights on worker 0-0, policy_version 1254560 (0.00087) [2022-07-11 15:41:03,534][26022] Updated weights on worker 0-0, policy_version 1254570 (0.00084) [2022-07-11 15:41:05,307][26022] Updated weights on worker 0-0, policy_version 1254580 (0.00090) [2022-07-11 15:41:05,805][25689] Fps is (10 sec: 5305.7, 60 sec: 5571.3, 300 sec: 5565.2). Total num frames: 1284691968. Throughput: 0: 5728.1. Samples: 1284694676. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:41:05,805][25689] Avg episode reward: [(0, '0.308')] [2022-07-11 15:41:07,381][26022] Updated weights on worker 0-0, policy_version 1254590 (0.00264) [2022-07-11 15:41:09,124][26022] Updated weights on worker 0-0, policy_version 1254600 (0.00088) [2022-07-11 15:41:10,797][26022] Updated weights on worker 0-0, policy_version 1254610 (0.00082) [2022-07-11 15:41:10,846][25689] Fps is (10 sec: 5504.8, 60 sec: 5584.9, 300 sec: 5564.6). Total num frames: 1284720640. Throughput: 0: 5756.9. Samples: 1284728560. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:41:10,846][25689] Avg episode reward: [(0, '0.910')] [2022-07-11 15:41:12,736][26022] Updated weights on worker 0-0, policy_version 1254620 (0.00100) [2022-07-11 15:41:14,268][26022] Updated weights on worker 0-0, policy_version 1254630 (0.00090) [2022-07-11 15:41:15,863][25689] Fps is (10 sec: 5599.7, 60 sec: 5554.0, 300 sec: 5562.5). Total num frames: 1284748288. Throughput: 0: 4933.4. Samples: 1284745642. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:41:15,863][25689] Avg episode reward: [(0, '0.766')] [2022-07-11 15:41:16,416][26022] Updated weights on worker 0-0, policy_version 1254640 (0.00542) [2022-07-11 15:41:17,790][26022] Updated weights on worker 0-0, policy_version 1254650 (0.00087) [2022-07-11 15:41:20,155][26022] Updated weights on worker 0-0, policy_version 1254660 (0.00100) [2022-07-11 15:41:20,924][25689] Fps is (10 sec: 5791.5, 60 sec: 5619.7, 300 sec: 5572.7). Total num frames: 1284779008. Throughput: 0: 5775.2. Samples: 1284779328. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:41:20,925][25689] Avg episode reward: [(0, '0.284')] [2022-07-11 15:41:21,611][26022] Updated weights on worker 0-0, policy_version 1254670 (0.00087) [2022-07-11 15:41:23,578][26022] Updated weights on worker 0-0, policy_version 1254680 (0.00092) [2022-07-11 15:41:25,251][26022] Updated weights on worker 0-0, policy_version 1254690 (0.00087) [2022-07-11 15:41:25,982][25689] Fps is (10 sec: 5565.6, 60 sec: 5530.6, 300 sec: 5562.2). Total num frames: 1284804608. Throughput: 0: 5874.6. Samples: 1284813208. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:41:25,982][25689] Avg episode reward: [(0, '-1.168')] [2022-07-11 15:41:27,262][26022] Updated weights on worker 0-0, policy_version 1254700 (0.00083) [2022-07-11 15:41:29,042][26022] Updated weights on worker 0-0, policy_version 1254710 (0.00091) [2022-07-11 15:41:30,863][26022] Updated weights on worker 0-0, policy_version 1254720 (0.00081) [2022-07-11 15:41:30,994][25689] Fps is (10 sec: 5389.3, 60 sec: 5548.4, 300 sec: 5559.6). Total num frames: 1284833280. Throughput: 0: 5037.7. Samples: 1284830062. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:41:30,995][25689] Avg episode reward: [(0, '-0.596')] [2022-07-11 15:41:32,822][26022] Updated weights on worker 0-0, policy_version 1254730 (0.00103) [2022-07-11 15:41:34,635][26022] Updated weights on worker 0-0, policy_version 1254740 (0.00090) [2022-07-11 15:41:36,001][25689] Fps is (10 sec: 5621.1, 60 sec: 5549.2, 300 sec: 5568.4). Total num frames: 1284860928. Throughput: 0: 5853.3. Samples: 1284863520. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:41:36,002][25689] Avg episode reward: [(0, '-1.443')] [2022-07-11 15:41:36,262][26022] Updated weights on worker 0-0, policy_version 1254750 (0.00087) [2022-07-11 15:41:38,499][26022] Updated weights on worker 0-0, policy_version 1254760 (0.00088) [2022-07-11 15:41:39,851][26022] Updated weights on worker 0-0, policy_version 1254770 (0.00083) [2022-07-11 15:41:41,061][25689] Fps is (10 sec: 5594.7, 60 sec: 5550.3, 300 sec: 5560.8). Total num frames: 1284889600. Throughput: 0: 5846.6. Samples: 1284897062. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:41:41,062][25689] Avg episode reward: [(0, '-1.098')] [2022-07-11 15:41:42,000][26022] Updated weights on worker 0-0, policy_version 1254780 (0.00089) [2022-07-11 15:41:43,530][26022] Updated weights on worker 0-0, policy_version 1254790 (0.00085) [2022-07-11 15:41:45,584][26022] Updated weights on worker 0-0, policy_version 1254800 (0.00087) [2022-07-11 15:41:46,080][25689] Fps is (10 sec: 5689.8, 60 sec: 5582.8, 300 sec: 5564.5). Total num frames: 1284918272. Throughput: 0: 5010.6. Samples: 1284913910. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:41:46,082][25689] Avg episode reward: [(0, '-1.024')] [2022-07-11 15:41:47,480][26022] Updated weights on worker 0-0, policy_version 1254810 (0.00084) [2022-07-11 15:41:49,107][26022] Updated weights on worker 0-0, policy_version 1254820 (0.00056) [2022-07-11 15:41:51,088][25689] Fps is (10 sec: 5514.6, 60 sec: 5549.9, 300 sec: 5557.7). Total num frames: 1284944896. Throughput: 0: 5854.6. Samples: 1284947704. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:41:51,090][25689] Avg episode reward: [(0, '-0.273')] [2022-07-11 15:41:51,111][26022] Updated weights on worker 0-0, policy_version 1254830 (0.00088) [2022-07-11 15:41:52,618][26022] Updated weights on worker 0-0, policy_version 1254840 (0.00090) [2022-07-11 15:41:54,676][26022] Updated weights on worker 0-0, policy_version 1254850 (0.00080) [2022-07-11 15:41:56,125][25689] Fps is (10 sec: 5606.8, 60 sec: 5564.9, 300 sec: 5565.6). Total num frames: 1284974592. Throughput: 0: 5870.8. Samples: 1284981660. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:41:56,125][25689] Avg episode reward: [(0, '0.849')] [2022-07-11 15:41:56,496][26022] Updated weights on worker 0-0, policy_version 1254860 (0.00090) [2022-07-11 15:41:58,168][26022] Updated weights on worker 0-0, policy_version 1254870 (0.00084) [2022-07-11 15:42:00,263][26022] Updated weights on worker 0-0, policy_version 1254880 (0.00085) [2022-07-11 15:42:01,256][25689] Fps is (10 sec: 5740.8, 60 sec: 5589.6, 300 sec: 5570.9). Total num frames: 1285003264. Throughput: 0: 5022.9. Samples: 1284998496. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:42:01,256][25689] Avg episode reward: [(0, '1.003')] [2022-07-11 15:42:02,175][26022] Updated weights on worker 0-0, policy_version 1254890 (0.00084) [2022-07-11 15:42:04,190][26022] Updated weights on worker 0-0, policy_version 1254900 (0.00094) [2022-07-11 15:42:05,793][26022] Updated weights on worker 0-0, policy_version 1254910 (0.00092) [2022-07-11 15:42:06,291][25689] Fps is (10 sec: 5439.3, 60 sec: 5586.8, 300 sec: 5568.6). Total num frames: 1285029888. Throughput: 0: 5756.1. Samples: 1285030246. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:42:06,298][25689] Avg episode reward: [(0, '1.983')] [2022-07-11 15:42:07,711][26022] Updated weights on worker 0-0, policy_version 1254920 (0.00097) [2022-07-11 15:42:09,569][26022] Updated weights on worker 0-0, policy_version 1254930 (0.00088) [2022-07-11 15:42:11,323][25689] Fps is (10 sec: 5391.0, 60 sec: 5570.7, 300 sec: 5566.0). Total num frames: 1285057536. Throughput: 0: 5724.9. Samples: 1285063544. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:42:11,323][25689] Avg episode reward: [(0, '1.443')] [2022-07-11 15:42:11,572][26022] Updated weights on worker 0-0, policy_version 1254940 (0.00079) [2022-07-11 15:42:13,202][26022] Updated weights on worker 0-0, policy_version 1254950 (0.00089) [2022-07-11 15:42:15,146][26022] Updated weights on worker 0-0, policy_version 1254960 (0.00084) [2022-07-11 15:42:16,333][25689] Fps is (10 sec: 5608.5, 60 sec: 5588.2, 300 sec: 5567.5). Total num frames: 1285086208. Throughput: 0: 4891.7. Samples: 1285080510. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:42:16,333][25689] Avg episode reward: [(0, '1.150')] [2022-07-11 15:42:16,759][26022] Updated weights on worker 0-0, policy_version 1254970 (0.00093) [2022-07-11 15:42:18,201][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:42:18,214][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001254977_1285096448.pth [2022-07-11 15:42:18,214][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001253018_1283090432.pth [2022-07-11 15:42:18,936][26022] Updated weights on worker 0-0, policy_version 1254980 (0.00090) [2022-07-11 15:42:20,571][26022] Updated weights on worker 0-0, policy_version 1254990 (0.00086) [2022-07-11 15:42:21,448][25689] Fps is (10 sec: 5562.1, 60 sec: 5532.5, 300 sec: 5565.9). Total num frames: 1285113856. Throughput: 0: 5728.9. Samples: 1285114178. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:42:21,449][25689] Avg episode reward: [(0, '1.012')] [2022-07-11 15:42:22,627][26022] Updated weights on worker 0-0, policy_version 1255000 (0.00086) [2022-07-11 15:42:24,138][26022] Updated weights on worker 0-0, policy_version 1255010 (0.00085) [2022-07-11 15:42:26,323][26022] Updated weights on worker 0-0, policy_version 1255020 (0.00084) [2022-07-11 15:42:26,473][25689] Fps is (10 sec: 5352.0, 60 sec: 5552.4, 300 sec: 5565.9). Total num frames: 1285140480. Throughput: 0: 5811.9. Samples: 1285147544. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:42:26,474][25689] Avg episode reward: [(0, '0.914')] [2022-07-11 15:42:27,830][26022] Updated weights on worker 0-0, policy_version 1255030 (0.00109) [2022-07-11 15:42:30,005][26022] Updated weights on worker 0-0, policy_version 1255040 (0.00089) [2022-07-11 15:42:31,475][25689] Fps is (10 sec: 5616.8, 60 sec: 5570.3, 300 sec: 5566.3). Total num frames: 1285170176. Throughput: 0: 4992.5. Samples: 1285164156. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:42:31,476][25689] Avg episode reward: [(0, '0.821')] [2022-07-11 15:42:31,827][26022] Updated weights on worker 0-0, policy_version 1255050 (0.00086) [2022-07-11 15:42:33,551][26022] Updated weights on worker 0-0, policy_version 1255060 (0.00412) [2022-07-11 15:42:35,514][26022] Updated weights on worker 0-0, policy_version 1255070 (0.00090) [2022-07-11 15:42:36,478][25689] Fps is (10 sec: 5731.8, 60 sec: 5570.7, 300 sec: 5567.8). Total num frames: 1285197824. Throughput: 0: 5812.3. Samples: 1285197600. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:42:36,478][25689] Avg episode reward: [(0, '0.873')] [2022-07-11 15:42:37,216][26022] Updated weights on worker 0-0, policy_version 1255080 (0.00092) [2022-07-11 15:42:39,294][26022] Updated weights on worker 0-0, policy_version 1255090 (0.00088) [2022-07-11 15:42:40,794][26022] Updated weights on worker 0-0, policy_version 1255100 (0.00086) [2022-07-11 15:42:41,590][25689] Fps is (10 sec: 5467.1, 60 sec: 5549.0, 300 sec: 5560.4). Total num frames: 1285225472. Throughput: 0: 5815.0. Samples: 1285231300. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:42:41,590][25689] Avg episode reward: [(0, '0.360')] [2022-07-11 15:42:42,774][26022] Updated weights on worker 0-0, policy_version 1255110 (0.00120) [2022-07-11 15:42:44,486][26022] Updated weights on worker 0-0, policy_version 1255120 (0.00086) [2022-07-11 15:42:46,408][26022] Updated weights on worker 0-0, policy_version 1255130 (0.00088) [2022-07-11 15:42:46,612][25689] Fps is (10 sec: 5557.2, 60 sec: 5548.6, 300 sec: 5563.9). Total num frames: 1285254144. Throughput: 0: 4999.8. Samples: 1285248236. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:42:46,613][25689] Avg episode reward: [(0, '-0.205')] [2022-07-11 15:42:48,221][26022] Updated weights on worker 0-0, policy_version 1255140 (0.00094) [2022-07-11 15:42:49,935][26022] Updated weights on worker 0-0, policy_version 1255150 (0.00080) [2022-07-11 15:42:51,655][25689] Fps is (10 sec: 5595.4, 60 sec: 5562.4, 300 sec: 5563.9). Total num frames: 1285281792. Throughput: 0: 5827.7. Samples: 1285281758. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:42:51,656][25689] Avg episode reward: [(0, '-0.001')] [2022-07-11 15:42:51,780][26022] Updated weights on worker 0-0, policy_version 1255160 (0.00081) [2022-07-11 15:42:53,615][26022] Updated weights on worker 0-0, policy_version 1255170 (0.00092) [2022-07-11 15:42:55,528][26022] Updated weights on worker 0-0, policy_version 1255180 (0.00091) [2022-07-11 15:42:56,662][25689] Fps is (10 sec: 5706.2, 60 sec: 5565.1, 300 sec: 5566.3). Total num frames: 1285311488. Throughput: 0: 5848.6. Samples: 1285315648. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:42:56,662][25689] Avg episode reward: [(0, '0.315')] [2022-07-11 15:42:57,258][26022] Updated weights on worker 0-0, policy_version 1255190 (0.00089) [2022-07-11 15:42:59,047][26022] Updated weights on worker 0-0, policy_version 1255200 (0.00091) [2022-07-11 15:43:01,042][26022] Updated weights on worker 0-0, policy_version 1255210 (0.00092) [2022-07-11 15:43:01,710][25689] Fps is (10 sec: 5397.8, 60 sec: 5505.0, 300 sec: 5566.5). Total num frames: 1285336064. Throughput: 0: 5035.0. Samples: 1285332604. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:43:01,712][25689] Avg episode reward: [(0, '0.373')] [2022-07-11 15:43:03,076][26022] Updated weights on worker 0-0, policy_version 1255220 (0.00088) [2022-07-11 15:43:05,087][26022] Updated weights on worker 0-0, policy_version 1255230 (0.00091) [2022-07-11 15:43:06,669][26022] Updated weights on worker 0-0, policy_version 1255240 (0.00091) [2022-07-11 15:43:06,766][25689] Fps is (10 sec: 5371.1, 60 sec: 5553.9, 300 sec: 5570.0). Total num frames: 1285365760. Throughput: 0: 5760.6. Samples: 1285364334. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:43:06,767][25689] Avg episode reward: [(0, '1.484')] [2022-07-11 15:43:08,743][26022] Updated weights on worker 0-0, policy_version 1255250 (0.00090) [2022-07-11 15:43:10,435][26022] Updated weights on worker 0-0, policy_version 1255260 (0.00084) [2022-07-11 15:43:11,780][25689] Fps is (10 sec: 5694.6, 60 sec: 5555.5, 300 sec: 5567.8). Total num frames: 1285393408. Throughput: 0: 5761.7. Samples: 1285397710. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:43:11,780][25689] Avg episode reward: [(0, '1.460')] [2022-07-11 15:43:12,506][26022] Updated weights on worker 0-0, policy_version 1255270 (0.00093) [2022-07-11 15:43:14,023][26022] Updated weights on worker 0-0, policy_version 1255280 (0.00086) [2022-07-11 15:43:16,001][26022] Updated weights on worker 0-0, policy_version 1255290 (0.00089) [2022-07-11 15:43:16,798][25689] Fps is (10 sec: 5511.8, 60 sec: 5537.8, 300 sec: 5573.0). Total num frames: 1285421056. Throughput: 0: 4912.5. Samples: 1285414570. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:43:16,799][25689] Avg episode reward: [(0, '2.413')] [2022-07-11 15:43:17,795][26022] Updated weights on worker 0-0, policy_version 1255300 (0.00083) [2022-07-11 15:43:19,550][26022] Updated weights on worker 0-0, policy_version 1255310 (0.00464) [2022-07-11 15:43:21,336][26022] Updated weights on worker 0-0, policy_version 1255320 (0.00088) [2022-07-11 15:43:21,862][25689] Fps is (10 sec: 5586.2, 60 sec: 5559.6, 300 sec: 5564.9). Total num frames: 1285449728. Throughput: 0: 5733.8. Samples: 1285448150. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:43:21,862][25689] Avg episode reward: [(0, '2.635')] [2022-07-11 15:43:23,484][26022] Updated weights on worker 0-0, policy_version 1255330 (0.00094) [2022-07-11 15:43:24,915][26022] Updated weights on worker 0-0, policy_version 1255340 (0.00088) [2022-07-11 15:43:26,906][25689] Fps is (10 sec: 5470.7, 60 sec: 5557.8, 300 sec: 5561.9). Total num frames: 1285476352. Throughput: 0: 5830.8. Samples: 1285481766. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:43:26,907][25689] Avg episode reward: [(0, '1.801')] [2022-07-11 15:43:27,163][26022] Updated weights on worker 0-0, policy_version 1255350 (0.00082) [2022-07-11 15:43:28,561][26022] Updated weights on worker 0-0, policy_version 1255360 (0.00085) [2022-07-11 15:43:30,648][26022] Updated weights on worker 0-0, policy_version 1255370 (0.00087) [2022-07-11 15:43:31,915][25689] Fps is (10 sec: 5602.3, 60 sec: 5557.2, 300 sec: 5569.1). Total num frames: 1285506048. Throughput: 0: 5002.5. Samples: 1285498436. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:43:31,916][25689] Avg episode reward: [(0, '1.950')] [2022-07-11 15:43:32,612][26022] Updated weights on worker 0-0, policy_version 1255380 (0.00102) [2022-07-11 15:43:34,130][26022] Updated weights on worker 0-0, policy_version 1255390 (0.00084) [2022-07-11 15:43:36,175][26022] Updated weights on worker 0-0, policy_version 1255400 (0.00088) [2022-07-11 15:43:36,938][25689] Fps is (10 sec: 5716.0, 60 sec: 5555.2, 300 sec: 5571.0). Total num frames: 1285533696. Throughput: 0: 5826.6. Samples: 1285531916. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:43:36,939][25689] Avg episode reward: [(0, '0.361')] [2022-07-11 15:43:38,073][26022] Updated weights on worker 0-0, policy_version 1255410 (0.00085) [2022-07-11 15:43:39,836][26022] Updated weights on worker 0-0, policy_version 1255420 (0.00083) [2022-07-11 15:43:42,029][25689] Fps is (10 sec: 5366.0, 60 sec: 5540.3, 300 sec: 5560.5). Total num frames: 1285560320. Throughput: 0: 5794.1. Samples: 1285565000. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:43:42,031][26022] Updated weights on worker 0-0, policy_version 1255430 (0.00090) [2022-07-11 15:43:42,029][25689] Avg episode reward: [(0, '0.263')] [2022-07-11 15:43:43,536][26022] Updated weights on worker 0-0, policy_version 1255440 (0.00091) [2022-07-11 15:43:45,523][26022] Updated weights on worker 0-0, policy_version 1255450 (0.00087) [2022-07-11 15:43:47,071][25689] Fps is (10 sec: 5456.9, 60 sec: 5538.4, 300 sec: 5568.2). Total num frames: 1285588992. Throughput: 0: 5770.6. Samples: 1285598132. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:43:47,072][25689] Avg episode reward: [(0, '0.025')] [2022-07-11 15:43:47,190][26022] Updated weights on worker 0-0, policy_version 1255460 (0.00103) [2022-07-11 15:43:49,059][26022] Updated weights on worker 0-0, policy_version 1255470 (0.00094) [2022-07-11 15:43:51,339][26022] Updated weights on worker 0-0, policy_version 1255480 (0.00095) [2022-07-11 15:43:52,097][25689] Fps is (10 sec: 5593.8, 60 sec: 5540.0, 300 sec: 5566.0). Total num frames: 1285616640. Throughput: 0: 5759.1. Samples: 1285614668. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:43:52,097][25689] Avg episode reward: [(0, '-0.351')] [2022-07-11 15:43:52,992][26022] Updated weights on worker 0-0, policy_version 1255490 (0.00090) [2022-07-11 15:43:54,815][26022] Updated weights on worker 0-0, policy_version 1255500 (0.00089) [2022-07-11 15:43:56,545][26022] Updated weights on worker 0-0, policy_version 1255510 (0.00086) [2022-07-11 15:43:57,134][25689] Fps is (10 sec: 5393.4, 60 sec: 5486.4, 300 sec: 5559.3). Total num frames: 1285643264. Throughput: 0: 5722.0. Samples: 1285647476. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:43:57,135][25689] Avg episode reward: [(0, '-0.257')] [2022-07-11 15:43:58,551][26022] Updated weights on worker 0-0, policy_version 1255520 (0.00094) [2022-07-11 15:44:00,466][26022] Updated weights on worker 0-0, policy_version 1255530 (0.00089) [2022-07-11 15:44:02,195][25689] Fps is (10 sec: 5476.1, 60 sec: 5553.0, 300 sec: 5565.9). Total num frames: 1285671936. Throughput: 0: 5730.6. Samples: 1285680564. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:44:02,195][25689] Avg episode reward: [(0, '0.026')] [2022-07-11 15:44:02,614][26022] Updated weights on worker 0-0, policy_version 1255540 (0.00087) [2022-07-11 15:44:04,419][26022] Updated weights on worker 0-0, policy_version 1255550 (0.00089) [2022-07-11 15:44:06,223][26022] Updated weights on worker 0-0, policy_version 1255560 (0.00078) [2022-07-11 15:44:07,241][25689] Fps is (10 sec: 5471.3, 60 sec: 5503.1, 300 sec: 5559.3). Total num frames: 1285698560. Throughput: 0: 4829.5. Samples: 1285695538. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:44:07,241][25689] Avg episode reward: [(0, '1.137')] [2022-07-11 15:44:08,145][26022] Updated weights on worker 0-0, policy_version 1255570 (0.00093) [2022-07-11 15:44:09,978][26022] Updated weights on worker 0-0, policy_version 1255580 (0.00097) [2022-07-11 15:44:11,634][26022] Updated weights on worker 0-0, policy_version 1255590 (0.00743) [2022-07-11 15:44:12,303][25689] Fps is (10 sec: 5369.1, 60 sec: 5498.7, 300 sec: 5555.2). Total num frames: 1285726208. Throughput: 0: 5663.7. Samples: 1285729108. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:44:12,308][25689] Avg episode reward: [(0, '1.023')] [2022-07-11 15:44:13,594][26022] Updated weights on worker 0-0, policy_version 1255600 (0.00085) [2022-07-11 15:44:15,410][26022] Updated weights on worker 0-0, policy_version 1255610 (0.00084) [2022-07-11 15:44:17,161][26022] Updated weights on worker 0-0, policy_version 1255620 (0.00095) [2022-07-11 15:44:17,323][25689] Fps is (10 sec: 5586.0, 60 sec: 5515.5, 300 sec: 5560.0). Total num frames: 1285754880. Throughput: 0: 5731.1. Samples: 1285763180. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:44:17,323][25689] Avg episode reward: [(0, '1.317')] [2022-07-11 15:44:18,247][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:44:18,258][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001255626_1285761024.pth [2022-07-11 15:44:18,258][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001253671_1283759104.pth [2022-07-11 15:44:19,054][26022] Updated weights on worker 0-0, policy_version 1255630 (0.00086) [2022-07-11 15:44:20,774][26022] Updated weights on worker 0-0, policy_version 1255640 (0.00051) [2022-07-11 15:44:22,431][25689] Fps is (10 sec: 5762.7, 60 sec: 5528.3, 300 sec: 5565.5). Total num frames: 1285784576. Throughput: 0: 4911.7. Samples: 1285779960. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:44:22,432][25689] Avg episode reward: [(0, '1.330')] [2022-07-11 15:44:22,627][26022] Updated weights on worker 0-0, policy_version 1255650 (0.00092) [2022-07-11 15:44:24,563][26022] Updated weights on worker 0-0, policy_version 1255660 (0.00101) [2022-07-11 15:44:26,139][26022] Updated weights on worker 0-0, policy_version 1255670 (0.00085) [2022-07-11 15:44:27,435][25689] Fps is (10 sec: 5670.8, 60 sec: 5548.9, 300 sec: 5558.6). Total num frames: 1285812224. Throughput: 0: 5854.8. Samples: 1285813772. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:44:27,435][25689] Avg episode reward: [(0, '1.152')] [2022-07-11 15:44:28,315][26022] Updated weights on worker 0-0, policy_version 1255680 (0.00093) [2022-07-11 15:44:29,946][26022] Updated weights on worker 0-0, policy_version 1255690 (0.00083) [2022-07-11 15:44:31,836][26022] Updated weights on worker 0-0, policy_version 1255700 (0.00081) [2022-07-11 15:44:32,461][25689] Fps is (10 sec: 5615.4, 60 sec: 5530.4, 300 sec: 5561.9). Total num frames: 1285840896. Throughput: 0: 5877.4. Samples: 1285847586. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:44:32,461][25689] Avg episode reward: [(0, '0.714')] [2022-07-11 15:44:33,693][26022] Updated weights on worker 0-0, policy_version 1255710 (0.00152) [2022-07-11 15:44:35,298][26022] Updated weights on worker 0-0, policy_version 1255720 (0.00109) [2022-07-11 15:44:37,199][26022] Updated weights on worker 0-0, policy_version 1255730 (0.00086) [2022-07-11 15:44:37,490][25689] Fps is (10 sec: 5703.2, 60 sec: 5546.9, 300 sec: 5565.5). Total num frames: 1285869568. Throughput: 0: 5024.6. Samples: 1285864510. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:44:37,490][25689] Avg episode reward: [(0, '0.507')] [2022-07-11 15:44:39,007][26022] Updated weights on worker 0-0, policy_version 1255740 (0.00080) [2022-07-11 15:44:40,830][26022] Updated weights on worker 0-0, policy_version 1255750 (0.00087) [2022-07-11 15:44:42,541][25689] Fps is (10 sec: 5485.4, 60 sec: 5550.4, 300 sec: 5558.1). Total num frames: 1285896192. Throughput: 0: 5869.1. Samples: 1285897988. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2022-07-11 15:44:42,542][25689] Avg episode reward: [(0, '0.551')] [2022-07-11 15:44:42,775][26022] Updated weights on worker 0-0, policy_version 1255760 (0.00086) [2022-07-11 15:44:44,401][26022] Updated weights on worker 0-0, policy_version 1255770 (0.00089) [2022-07-11 15:44:46,430][26022] Updated weights on worker 0-0, policy_version 1255780 (0.00117) [2022-07-11 15:44:47,545][25689] Fps is (10 sec: 5499.0, 60 sec: 5554.0, 300 sec: 5558.3). Total num frames: 1285924864. Throughput: 0: 5852.0. Samples: 1285931458. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:44:47,546][25689] Avg episode reward: [(0, '0.579')] [2022-07-11 15:44:48,053][26022] Updated weights on worker 0-0, policy_version 1255790 (0.00084) [2022-07-11 15:44:50,074][26022] Updated weights on worker 0-0, policy_version 1255800 (0.00087) [2022-07-11 15:44:51,969][26022] Updated weights on worker 0-0, policy_version 1255810 (0.00086) [2022-07-11 15:44:52,562][25689] Fps is (10 sec: 5620.2, 60 sec: 5554.8, 300 sec: 5562.0). Total num frames: 1285952512. Throughput: 0: 5006.2. Samples: 1285948218. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:44:52,563][25689] Avg episode reward: [(0, '0.182')] [2022-07-11 15:44:53,765][26022] Updated weights on worker 0-0, policy_version 1255820 (0.00088) [2022-07-11 15:44:55,439][26022] Updated weights on worker 0-0, policy_version 1255830 (0.00087) [2022-07-11 15:44:57,355][26022] Updated weights on worker 0-0, policy_version 1255840 (0.00088) [2022-07-11 15:44:57,571][25689] Fps is (10 sec: 5617.3, 60 sec: 5591.2, 300 sec: 5556.7). Total num frames: 1285981184. Throughput: 0: 5854.6. Samples: 1285982080. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:44:57,572][25689] Avg episode reward: [(0, '1.003')] [2022-07-11 15:44:59,143][26022] Updated weights on worker 0-0, policy_version 1255850 (0.00088) [2022-07-11 15:45:01,005][26022] Updated weights on worker 0-0, policy_version 1255860 (0.00085) [2022-07-11 15:45:02,619][25689] Fps is (10 sec: 5396.8, 60 sec: 5541.6, 300 sec: 5556.0). Total num frames: 1286006784. Throughput: 0: 5813.8. Samples: 1286014712. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:45:02,619][25689] Avg episode reward: [(0, '0.114')] [2022-07-11 15:45:03,152][26022] Updated weights on worker 0-0, policy_version 1255870 (0.00087) [2022-07-11 15:45:04,899][26022] Updated weights on worker 0-0, policy_version 1255880 (0.00081) [2022-07-11 15:45:06,819][26022] Updated weights on worker 0-0, policy_version 1255890 (0.00080) [2022-07-11 15:45:07,647][25689] Fps is (10 sec: 5386.6, 60 sec: 5577.2, 300 sec: 5559.0). Total num frames: 1286035456. Throughput: 0: 4936.0. Samples: 1286030678. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:45:07,647][25689] Avg episode reward: [(0, '0.113')] [2022-07-11 15:45:08,859][26022] Updated weights on worker 0-0, policy_version 1255900 (0.00096) [2022-07-11 15:45:10,587][26022] Updated weights on worker 0-0, policy_version 1255910 (0.00093) [2022-07-11 15:45:12,561][26022] Updated weights on worker 0-0, policy_version 1255920 (0.00084) [2022-07-11 15:45:12,660][25689] Fps is (10 sec: 5507.1, 60 sec: 5564.8, 300 sec: 5549.4). Total num frames: 1286062080. Throughput: 0: 5755.6. Samples: 1286063890. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:45:12,660][25689] Avg episode reward: [(0, '-0.459')] [2022-07-11 15:45:14,177][26022] Updated weights on worker 0-0, policy_version 1255930 (0.00082) [2022-07-11 15:45:16,204][26022] Updated weights on worker 0-0, policy_version 1255940 (0.00086) [2022-07-11 15:45:17,675][25689] Fps is (10 sec: 5616.0, 60 sec: 5582.1, 300 sec: 5560.1). Total num frames: 1286091776. Throughput: 0: 5748.3. Samples: 1286097644. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:45:17,676][25689] Avg episode reward: [(0, '-0.451')] [2022-07-11 15:45:17,795][26022] Updated weights on worker 0-0, policy_version 1255950 (0.00093) [2022-07-11 15:45:19,780][26022] Updated weights on worker 0-0, policy_version 1255960 (0.00089) [2022-07-11 15:45:21,598][26022] Updated weights on worker 0-0, policy_version 1255970 (0.00083) [2022-07-11 15:45:22,745][25689] Fps is (10 sec: 5787.4, 60 sec: 5568.8, 300 sec: 5552.1). Total num frames: 1286120448. Throughput: 0: 4948.5. Samples: 1286114306. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:45:22,746][25689] Avg episode reward: [(0, '0.278')] [2022-07-11 15:45:23,567][26022] Updated weights on worker 0-0, policy_version 1255980 (0.00089) [2022-07-11 15:45:25,185][26022] Updated weights on worker 0-0, policy_version 1255990 (0.00089) [2022-07-11 15:45:27,355][26022] Updated weights on worker 0-0, policy_version 1256000 (0.00084) [2022-07-11 15:45:27,793][25689] Fps is (10 sec: 5465.3, 60 sec: 5547.7, 300 sec: 5548.2). Total num frames: 1286147072. Throughput: 0: 5833.5. Samples: 1286148200. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:45:27,795][25689] Avg episode reward: [(0, '0.244')] [2022-07-11 15:45:28,607][26022] Updated weights on worker 0-0, policy_version 1256010 (0.00089) [2022-07-11 15:45:31,006][26022] Updated weights on worker 0-0, policy_version 1256020 (0.00086) [2022-07-11 15:45:32,333][26022] Updated weights on worker 0-0, policy_version 1256030 (0.00097) [2022-07-11 15:45:32,843][25689] Fps is (10 sec: 5577.4, 60 sec: 5562.5, 300 sec: 5554.5). Total num frames: 1286176768. Throughput: 0: 5844.9. Samples: 1286181858. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:45:32,844][25689] Avg episode reward: [(0, '1.265')] [2022-07-11 15:45:34,339][26022] Updated weights on worker 0-0, policy_version 1256040 (0.00083) [2022-07-11 15:45:36,139][26022] Updated weights on worker 0-0, policy_version 1256050 (0.00094) [2022-07-11 15:45:37,861][25689] Fps is (10 sec: 5695.5, 60 sec: 5546.5, 300 sec: 5552.0). Total num frames: 1286204416. Throughput: 0: 5010.2. Samples: 1286198778. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:45:37,863][25689] Avg episode reward: [(0, '1.687')] [2022-07-11 15:45:37,871][26022] Updated weights on worker 0-0, policy_version 1256060 (0.00086) [2022-07-11 15:45:39,961][26022] Updated weights on worker 0-0, policy_version 1256070 (0.00095) [2022-07-11 15:45:41,777][26022] Updated weights on worker 0-0, policy_version 1256080 (0.00057) [2022-07-11 15:45:42,973][25689] Fps is (10 sec: 5458.5, 60 sec: 5557.9, 300 sec: 5553.4). Total num frames: 1286232064. Throughput: 0: 5826.5. Samples: 1286232166. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:45:42,981][25689] Avg episode reward: [(0, '1.850')] [2022-07-11 15:45:43,663][26022] Updated weights on worker 0-0, policy_version 1256090 (0.00061) [2022-07-11 15:45:45,384][26022] Updated weights on worker 0-0, policy_version 1256100 (0.00098) [2022-07-11 15:45:47,139][26022] Updated weights on worker 0-0, policy_version 1256110 (0.00086) [2022-07-11 15:45:48,014][25689] Fps is (10 sec: 5547.0, 60 sec: 5554.4, 300 sec: 5553.0). Total num frames: 1286260736. Throughput: 0: 5832.9. Samples: 1286266150. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:45:48,015][25689] Avg episode reward: [(0, '1.971')] [2022-07-11 15:45:48,883][26022] Updated weights on worker 0-0, policy_version 1256120 (0.00080) [2022-07-11 15:45:50,800][26022] Updated weights on worker 0-0, policy_version 1256130 (0.00094) [2022-07-11 15:45:52,639][26022] Updated weights on worker 0-0, policy_version 1256140 (0.00085) [2022-07-11 15:45:53,019][25689] Fps is (10 sec: 5708.2, 60 sec: 5572.5, 300 sec: 5553.2). Total num frames: 1286289408. Throughput: 0: 5013.2. Samples: 1286283006. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:45:53,019][25689] Avg episode reward: [(0, '1.689')] [2022-07-11 15:45:54,401][26022] Updated weights on worker 0-0, policy_version 1256150 (0.00504) [2022-07-11 15:45:56,268][26022] Updated weights on worker 0-0, policy_version 1256160 (0.00079) [2022-07-11 15:45:57,996][26022] Updated weights on worker 0-0, policy_version 1256170 (0.00087) [2022-07-11 15:45:58,086][25689] Fps is (10 sec: 5693.6, 60 sec: 5567.2, 300 sec: 5559.4). Total num frames: 1286318080. Throughput: 0: 5847.2. Samples: 1286317036. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:45:58,088][25689] Avg episode reward: [(0, '2.169')] [2022-07-11 15:45:59,973][26022] Updated weights on worker 0-0, policy_version 1256180 (0.00090) [2022-07-11 15:46:01,657][26022] Updated weights on worker 0-0, policy_version 1256190 (0.00099) [2022-07-11 15:46:03,162][25689] Fps is (10 sec: 5350.7, 60 sec: 5564.5, 300 sec: 5554.6). Total num frames: 1286343680. Throughput: 0: 5760.4. Samples: 1286348462. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:46:03,163][25689] Avg episode reward: [(0, '1.231')] [2022-07-11 15:46:03,984][26022] Updated weights on worker 0-0, policy_version 1256200 (0.00091) [2022-07-11 15:46:05,779][26022] Updated weights on worker 0-0, policy_version 1256210 (0.00090) [2022-07-11 15:46:07,489][26022] Updated weights on worker 0-0, policy_version 1256220 (0.00430) [2022-07-11 15:46:08,208][25689] Fps is (10 sec: 5462.8, 60 sec: 5579.8, 300 sec: 5558.0). Total num frames: 1286373376. Throughput: 0: 4910.6. Samples: 1286365314. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:46:08,209][25689] Avg episode reward: [(0, '1.060')] [2022-07-11 15:46:09,543][26022] Updated weights on worker 0-0, policy_version 1256230 (0.00088) [2022-07-11 15:46:11,197][26022] Updated weights on worker 0-0, policy_version 1256240 (0.00085) [2022-07-11 15:46:13,240][25689] Fps is (10 sec: 5486.8, 60 sec: 5561.1, 300 sec: 5550.8). Total num frames: 1286398976. Throughput: 0: 5717.7. Samples: 1286398624. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:46:13,240][25689] Avg episode reward: [(0, '-0.293')] [2022-07-11 15:46:13,269][26022] Updated weights on worker 0-0, policy_version 1256250 (0.00090) [2022-07-11 15:46:14,831][26022] Updated weights on worker 0-0, policy_version 1256260 (0.00097) [2022-07-11 15:46:16,639][26022] Updated weights on worker 0-0, policy_version 1256270 (0.00088) [2022-07-11 15:46:18,250][25689] Fps is (10 sec: 5404.5, 60 sec: 5544.7, 300 sec: 5544.9). Total num frames: 1286427648. Throughput: 0: 5729.4. Samples: 1286432566. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:46:18,251][25689] Avg episode reward: [(0, '-0.390')] [2022-07-11 15:46:18,289][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:46:18,312][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001256278_1286428672.pth [2022-07-11 15:46:18,313][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001254324_1284427776.pth [2022-07-11 15:46:18,877][26022] Updated weights on worker 0-0, policy_version 1256280 (0.00084) [2022-07-11 15:46:20,128][26022] Updated weights on worker 0-0, policy_version 1256290 (0.00095) [2022-07-11 15:46:22,486][26022] Updated weights on worker 0-0, policy_version 1256300 (0.00084) [2022-07-11 15:46:23,380][25689] Fps is (10 sec: 5756.0, 60 sec: 5556.1, 300 sec: 5557.3). Total num frames: 1286457344. Throughput: 0: 5821.9. Samples: 1286466172. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:46:23,381][25689] Avg episode reward: [(0, '-0.535')] [2022-07-11 15:46:24,119][26022] Updated weights on worker 0-0, policy_version 1256310 (0.00086) [2022-07-11 15:46:25,862][26022] Updated weights on worker 0-0, policy_version 1256320 (0.00093) [2022-07-11 15:46:27,877][26022] Updated weights on worker 0-0, policy_version 1256330 (0.00103) [2022-07-11 15:46:28,471][25689] Fps is (10 sec: 5510.4, 60 sec: 5552.2, 300 sec: 5549.0). Total num frames: 1286483968. Throughput: 0: 5797.4. Samples: 1286482786. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:46:28,471][25689] Avg episode reward: [(0, '-1.309')] [2022-07-11 15:46:29,528][26022] Updated weights on worker 0-0, policy_version 1256340 (0.00084) [2022-07-11 15:46:31,489][26022] Updated weights on worker 0-0, policy_version 1256350 (0.00093) [2022-07-11 15:46:33,487][25689] Fps is (10 sec: 5369.9, 60 sec: 5521.5, 300 sec: 5548.8). Total num frames: 1286511616. Throughput: 0: 5816.2. Samples: 1286516386. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:46:33,488][25689] Avg episode reward: [(0, '-0.171')] [2022-07-11 15:46:33,536][26022] Updated weights on worker 0-0, policy_version 1256360 (0.00099) [2022-07-11 15:46:34,990][26022] Updated weights on worker 0-0, policy_version 1256370 (0.00080) [2022-07-11 15:46:37,119][26022] Updated weights on worker 0-0, policy_version 1256380 (0.00098) [2022-07-11 15:46:38,490][25689] Fps is (10 sec: 5825.6, 60 sec: 5573.6, 300 sec: 5556.7). Total num frames: 1286542336. Throughput: 0: 5825.1. Samples: 1286550466. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:46:38,493][25689] Avg episode reward: [(0, '-0.150')] [2022-07-11 15:46:38,569][26022] Updated weights on worker 0-0, policy_version 1256390 (0.00085) [2022-07-11 15:46:40,574][26022] Updated weights on worker 0-0, policy_version 1256400 (0.00092) [2022-07-11 15:46:42,282][26022] Updated weights on worker 0-0, policy_version 1256410 (0.00087) [2022-07-11 15:46:43,583][25689] Fps is (10 sec: 5781.0, 60 sec: 5575.3, 300 sec: 5551.9). Total num frames: 1286569984. Throughput: 0: 5005.8. Samples: 1286567304. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:46:43,585][25689] Avg episode reward: [(0, '1.134')] [2022-07-11 15:46:44,175][26022] Updated weights on worker 0-0, policy_version 1256420 (0.00083) [2022-07-11 15:46:45,985][26022] Updated weights on worker 0-0, policy_version 1256430 (0.00091) [2022-07-11 15:46:47,896][26022] Updated weights on worker 0-0, policy_version 1256440 (0.00092) [2022-07-11 15:46:48,596][25689] Fps is (10 sec: 5572.9, 60 sec: 5577.9, 300 sec: 5558.7). Total num frames: 1286598656. Throughput: 0: 5884.0. Samples: 1286601204. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:46:48,598][25689] Avg episode reward: [(0, '0.992')] [2022-07-11 15:46:49,491][26022] Updated weights on worker 0-0, policy_version 1256450 (0.00082) [2022-07-11 15:46:51,451][26022] Updated weights on worker 0-0, policy_version 1256460 (0.00085) [2022-07-11 15:46:53,107][26022] Updated weights on worker 0-0, policy_version 1256470 (0.00085) [2022-07-11 15:46:53,605][25689] Fps is (10 sec: 5620.0, 60 sec: 5560.6, 300 sec: 5552.3). Total num frames: 1286626304. Throughput: 0: 5883.2. Samples: 1286634742. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:46:53,609][25689] Avg episode reward: [(0, '1.221')] [2022-07-11 15:46:55,138][26022] Updated weights on worker 0-0, policy_version 1256480 (0.00084) [2022-07-11 15:46:57,036][26022] Updated weights on worker 0-0, policy_version 1256490 (0.00089) [2022-07-11 15:46:58,692][25689] Fps is (10 sec: 5680.4, 60 sec: 5575.7, 300 sec: 5556.6). Total num frames: 1286656000. Throughput: 0: 5005.2. Samples: 1286651578. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:46:58,692][25689] Avg episode reward: [(0, '1.571')] [2022-07-11 15:46:58,698][26022] Updated weights on worker 0-0, policy_version 1256500 (0.00088) [2022-07-11 15:47:00,871][26022] Updated weights on worker 0-0, policy_version 1256510 (0.00088) [2022-07-11 15:47:02,868][26022] Updated weights on worker 0-0, policy_version 1256520 (0.00082) [2022-07-11 15:47:03,757][25689] Fps is (10 sec: 5446.7, 60 sec: 5576.7, 300 sec: 5552.6). Total num frames: 1286681600. Throughput: 0: 5739.2. Samples: 1286683084. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:47:03,758][25689] Avg episode reward: [(0, '0.267')] [2022-07-11 15:47:04,461][26022] Updated weights on worker 0-0, policy_version 1256530 (0.00082) [2022-07-11 15:47:06,787][26022] Updated weights on worker 0-0, policy_version 1256541 (0.00080) [2022-07-11 15:47:08,524][26022] Updated weights on worker 0-0, policy_version 1256551 (0.00084) [2022-07-11 15:47:08,800][25689] Fps is (10 sec: 5267.8, 60 sec: 5543.2, 300 sec: 5552.4). Total num frames: 1286709248. Throughput: 0: 5734.0. Samples: 1286717050. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:47:08,802][25689] Avg episode reward: [(0, '0.419')] [2022-07-11 15:47:10,198][26022] Updated weights on worker 0-0, policy_version 1256561 (0.00090) [2022-07-11 15:47:12,145][26022] Updated weights on worker 0-0, policy_version 1256571 (0.00084) [2022-07-11 15:47:13,839][25689] Fps is (10 sec: 5586.4, 60 sec: 5593.3, 300 sec: 5551.9). Total num frames: 1286737920. Throughput: 0: 4892.6. Samples: 1286733738. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:47:13,840][25689] Avg episode reward: [(0, '0.134')] [2022-07-11 15:47:13,860][26022] Updated weights on worker 0-0, policy_version 1256581 (0.00090) [2022-07-11 15:47:15,759][26022] Updated weights on worker 0-0, policy_version 1256591 (0.00088) [2022-07-11 15:47:17,548][26022] Updated weights on worker 0-0, policy_version 1256601 (0.00087) [2022-07-11 15:47:18,901][25689] Fps is (10 sec: 5677.2, 60 sec: 5588.5, 300 sec: 5556.3). Total num frames: 1286766592. Throughput: 0: 5745.5. Samples: 1286767690. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:47:18,901][25689] Avg episode reward: [(0, '0.201')] [2022-07-11 15:47:19,289][26022] Updated weights on worker 0-0, policy_version 1256611 (0.00090) [2022-07-11 15:47:21,330][26022] Updated weights on worker 0-0, policy_version 1256621 (0.00083) [2022-07-11 15:47:22,766][26022] Updated weights on worker 0-0, policy_version 1256631 (0.00085) [2022-07-11 15:47:23,953][25689] Fps is (10 sec: 5568.5, 60 sec: 5561.9, 300 sec: 5559.2). Total num frames: 1286794240. Throughput: 0: 5856.7. Samples: 1286801364. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:47:23,953][25689] Avg episode reward: [(0, '0.108')] [2022-07-11 15:47:24,882][26022] Updated weights on worker 0-0, policy_version 1256641 (0.00081) [2022-07-11 15:47:26,520][26022] Updated weights on worker 0-0, policy_version 1256651 (0.00085) [2022-07-11 15:47:28,571][26022] Updated weights on worker 0-0, policy_version 1256661 (0.00084) [2022-07-11 15:47:28,973][25689] Fps is (10 sec: 5693.4, 60 sec: 5619.2, 300 sec: 5558.9). Total num frames: 1286823936. Throughput: 0: 5005.5. Samples: 1286818024. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:47:28,973][25689] Avg episode reward: [(0, '0.308')] [2022-07-11 15:47:30,375][26022] Updated weights on worker 0-0, policy_version 1256671 (0.00086) [2022-07-11 15:47:32,231][26022] Updated weights on worker 0-0, policy_version 1256681 (0.00082) [2022-07-11 15:47:33,942][26022] Updated weights on worker 0-0, policy_version 1256691 (0.00091) [2022-07-11 15:47:34,027][25689] Fps is (10 sec: 5692.3, 60 sec: 5615.6, 300 sec: 5557.9). Total num frames: 1286851584. Throughput: 0: 5837.9. Samples: 1286851594. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:47:34,027][25689] Avg episode reward: [(0, '1.332')] [2022-07-11 15:47:35,852][26022] Updated weights on worker 0-0, policy_version 1256701 (0.00092) [2022-07-11 15:47:37,887][26022] Updated weights on worker 0-0, policy_version 1256711 (0.00082) [2022-07-11 15:47:39,059][25689] Fps is (10 sec: 5380.7, 60 sec: 5545.3, 300 sec: 5556.0). Total num frames: 1286878208. Throughput: 0: 5825.1. Samples: 1286885114. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:47:39,059][25689] Avg episode reward: [(0, '1.448')] [2022-07-11 15:47:39,528][26022] Updated weights on worker 0-0, policy_version 1256721 (0.00092) [2022-07-11 15:47:41,384][26022] Updated weights on worker 0-0, policy_version 1256731 (0.00066) [2022-07-11 15:47:43,375][26022] Updated weights on worker 0-0, policy_version 1256741 (0.00089) [2022-07-11 15:47:44,103][25689] Fps is (10 sec: 5589.2, 60 sec: 5583.6, 300 sec: 5559.0). Total num frames: 1286907904. Throughput: 0: 4978.7. Samples: 1286901688. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:47:44,104][25689] Avg episode reward: [(0, '1.802')] [2022-07-11 15:47:45,161][26022] Updated weights on worker 0-0, policy_version 1256751 (0.00084) [2022-07-11 15:47:46,903][26022] Updated weights on worker 0-0, policy_version 1256761 (0.00088) [2022-07-11 15:47:48,785][26022] Updated weights on worker 0-0, policy_version 1256771 (0.00084) [2022-07-11 15:47:49,182][25689] Fps is (10 sec: 5563.5, 60 sec: 5543.8, 300 sec: 5554.9). Total num frames: 1286934528. Throughput: 0: 5805.1. Samples: 1286935342. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:47:49,183][25689] Avg episode reward: [(0, '1.687')] [2022-07-11 15:47:50,554][26022] Updated weights on worker 0-0, policy_version 1256781 (0.00089) [2022-07-11 15:47:52,494][26022] Updated weights on worker 0-0, policy_version 1256791 (0.00112) [2022-07-11 15:47:54,170][26022] Updated weights on worker 0-0, policy_version 1256801 (0.00089) [2022-07-11 15:47:54,199][25689] Fps is (10 sec: 5578.8, 60 sec: 5576.8, 300 sec: 5554.7). Total num frames: 1286964224. Throughput: 0: 5831.1. Samples: 1286969218. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:47:54,199][25689] Avg episode reward: [(0, '1.623')] [2022-07-11 15:47:56,178][26022] Updated weights on worker 0-0, policy_version 1256811 (0.00086) [2022-07-11 15:47:57,889][26022] Updated weights on worker 0-0, policy_version 1256821 (0.00091) [2022-07-11 15:47:59,222][25689] Fps is (10 sec: 5711.6, 60 sec: 5548.8, 300 sec: 5565.5). Total num frames: 1286991872. Throughput: 0: 5005.9. Samples: 1286986048. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:47:59,222][25689] Avg episode reward: [(0, '1.862')] [2022-07-11 15:47:59,767][26022] Updated weights on worker 0-0, policy_version 1256831 (0.00081) [2022-07-11 15:48:01,469][26022] Updated weights on worker 0-0, policy_version 1256841 (0.00087) [2022-07-11 15:48:03,808][26022] Updated weights on worker 0-0, policy_version 1256851 (0.00081) [2022-07-11 15:48:04,355][25689] Fps is (10 sec: 5343.5, 60 sec: 5559.5, 300 sec: 5553.7). Total num frames: 1287018496. Throughput: 0: 5761.6. Samples: 1287018372. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:48:04,358][25689] Avg episode reward: [(0, '1.136')] [2022-07-11 15:48:05,716][26022] Updated weights on worker 0-0, policy_version 1256861 (0.00090) [2022-07-11 15:48:07,289][26022] Updated weights on worker 0-0, policy_version 1256871 (0.00087) [2022-07-11 15:48:09,237][26022] Updated weights on worker 0-0, policy_version 1256881 (0.00090) [2022-07-11 15:48:09,421][25689] Fps is (10 sec: 5421.7, 60 sec: 5574.3, 300 sec: 5556.2). Total num frames: 1287047168. Throughput: 0: 5727.0. Samples: 1287051250. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:48:09,421][25689] Avg episode reward: [(0, '1.264')] [2022-07-11 15:48:11,207][26022] Updated weights on worker 0-0, policy_version 1256891 (0.00079) [2022-07-11 15:48:12,838][26022] Updated weights on worker 0-0, policy_version 1256901 (0.00085) [2022-07-11 15:48:14,473][25689] Fps is (10 sec: 5465.5, 60 sec: 5539.3, 300 sec: 5552.2). Total num frames: 1287073792. Throughput: 0: 5683.2. Samples: 1287084440. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:48:14,473][25689] Avg episode reward: [(0, '0.509')] [2022-07-11 15:48:14,879][26022] Updated weights on worker 0-0, policy_version 1256911 (0.00095) [2022-07-11 15:48:16,500][26022] Updated weights on worker 0-0, policy_version 1256921 (0.00084) [2022-07-11 15:48:18,326][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:48:18,336][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001256930_1287096320.pth [2022-07-11 15:48:18,351][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001254977_1285096448.pth [2022-07-11 15:48:18,546][26022] Updated weights on worker 0-0, policy_version 1256931 (0.00083) [2022-07-11 15:48:19,533][25689] Fps is (10 sec: 5569.7, 60 sec: 5556.4, 300 sec: 5555.7). Total num frames: 1287103488. Throughput: 0: 5684.8. Samples: 1287101510. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:48:19,535][25689] Avg episode reward: [(0, '0.614')] [2022-07-11 15:48:20,234][26022] Updated weights on worker 0-0, policy_version 1256941 (0.00106) [2022-07-11 15:48:22,159][26022] Updated weights on worker 0-0, policy_version 1256951 (0.00091) [2022-07-11 15:48:23,775][26022] Updated weights on worker 0-0, policy_version 1256961 (0.00082) [2022-07-11 15:48:24,577][25689] Fps is (10 sec: 5776.8, 60 sec: 5574.0, 300 sec: 5562.6). Total num frames: 1287132160. Throughput: 0: 5787.9. Samples: 1287135410. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:48:24,577][25689] Avg episode reward: [(0, '0.513')] [2022-07-11 15:48:25,742][26022] Updated weights on worker 0-0, policy_version 1256971 (0.00092) [2022-07-11 15:48:27,377][26022] Updated weights on worker 0-0, policy_version 1256981 (0.00086) [2022-07-11 15:48:29,512][26022] Updated weights on worker 0-0, policy_version 1256991 (0.00093) [2022-07-11 15:48:29,607][25689] Fps is (10 sec: 5590.7, 60 sec: 5539.3, 300 sec: 5555.3). Total num frames: 1287159808. Throughput: 0: 5840.2. Samples: 1287169138. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:48:29,608][25689] Avg episode reward: [(0, '0.561')] [2022-07-11 15:48:31,010][26022] Updated weights on worker 0-0, policy_version 1257001 (0.00094) [2022-07-11 15:48:33,050][26022] Updated weights on worker 0-0, policy_version 1257011 (0.00087) [2022-07-11 15:48:34,504][26022] Updated weights on worker 0-0, policy_version 1257021 (0.00087) [2022-07-11 15:48:34,635][25689] Fps is (10 sec: 5803.1, 60 sec: 5592.4, 300 sec: 5565.5). Total num frames: 1287190528. Throughput: 0: 5039.8. Samples: 1287186052. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:48:34,635][25689] Avg episode reward: [(0, '0.960')] [2022-07-11 15:48:36,763][26022] Updated weights on worker 0-0, policy_version 1257031 (0.00091) [2022-07-11 15:48:38,237][26022] Updated weights on worker 0-0, policy_version 1257041 (0.00089) [2022-07-11 15:48:39,654][25689] Fps is (10 sec: 5503.8, 60 sec: 5559.8, 300 sec: 5560.0). Total num frames: 1287215104. Throughput: 0: 5875.7. Samples: 1287219734. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:48:39,655][25689] Avg episode reward: [(0, '1.005')] [2022-07-11 15:48:40,385][26022] Updated weights on worker 0-0, policy_version 1257051 (0.00085) [2022-07-11 15:48:41,912][26022] Updated weights on worker 0-0, policy_version 1257061 (0.00087) [2022-07-11 15:48:43,783][26022] Updated weights on worker 0-0, policy_version 1257071 (0.00091) [2022-07-11 15:48:44,750][25689] Fps is (10 sec: 5466.4, 60 sec: 5571.9, 300 sec: 5565.8). Total num frames: 1287245824. Throughput: 0: 5847.1. Samples: 1287253368. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2022-07-11 15:48:44,755][25689] Avg episode reward: [(0, '0.781')] [2022-07-11 15:48:45,863][26022] Updated weights on worker 0-0, policy_version 1257081 (0.00090) [2022-07-11 15:48:47,292][26022] Updated weights on worker 0-0, policy_version 1257091 (0.00082) [2022-07-11 15:48:49,383][26022] Updated weights on worker 0-0, policy_version 1257101 (0.00095) [2022-07-11 15:48:49,821][25689] Fps is (10 sec: 5740.9, 60 sec: 5589.6, 300 sec: 5565.0). Total num frames: 1287273472. Throughput: 0: 4999.8. Samples: 1287270204. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:48:49,821][25689] Avg episode reward: [(0, '1.021')] [2022-07-11 15:48:51,284][26022] Updated weights on worker 0-0, policy_version 1257111 (0.00090) [2022-07-11 15:48:53,106][26022] Updated weights on worker 0-0, policy_version 1257121 (0.00089) [2022-07-11 15:48:54,832][25689] Fps is (10 sec: 5383.3, 60 sec: 5539.4, 300 sec: 5565.5). Total num frames: 1287300096. Throughput: 0: 5806.4. Samples: 1287303326. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:48:54,833][25689] Avg episode reward: [(0, '0.863')] [2022-07-11 15:48:55,192][26022] Updated weights on worker 0-0, policy_version 1257131 (0.00088) [2022-07-11 15:48:56,571][26022] Updated weights on worker 0-0, policy_version 1257141 (0.00088) [2022-07-11 15:48:58,748][26022] Updated weights on worker 0-0, policy_version 1257151 (0.00080) [2022-07-11 15:48:59,912][25689] Fps is (10 sec: 5581.2, 60 sec: 5568.0, 300 sec: 5568.6). Total num frames: 1287329792. Throughput: 0: 5779.0. Samples: 1287336806. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:48:59,912][25689] Avg episode reward: [(0, '0.785')] [2022-07-11 15:49:00,579][26022] Updated weights on worker 0-0, policy_version 1257161 (0.00086) [2022-07-11 15:49:02,944][26022] Updated weights on worker 0-0, policy_version 1257171 (0.00094) [2022-07-11 15:49:04,670][26022] Updated weights on worker 0-0, policy_version 1257181 (0.00106) [2022-07-11 15:49:05,031][25689] Fps is (10 sec: 5522.3, 60 sec: 5569.3, 300 sec: 5567.2). Total num frames: 1287356416. Throughput: 0: 4837.6. Samples: 1287351476. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:49:05,031][25689] Avg episode reward: [(0, '0.727')] [2022-07-11 15:49:06,403][26022] Updated weights on worker 0-0, policy_version 1257191 (0.00083) [2022-07-11 15:49:08,233][26022] Updated weights on worker 0-0, policy_version 1257201 (0.00087) [2022-07-11 15:49:10,064][25689] Fps is (10 sec: 5244.7, 60 sec: 5538.5, 300 sec: 5564.3). Total num frames: 1287383040. Throughput: 0: 5656.0. Samples: 1287384702. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:49:10,065][25689] Avg episode reward: [(0, '0.706')] [2022-07-11 15:49:10,212][26022] Updated weights on worker 0-0, policy_version 1257211 (0.00089) [2022-07-11 15:49:11,950][26022] Updated weights on worker 0-0, policy_version 1257221 (0.00089) [2022-07-11 15:49:13,803][26022] Updated weights on worker 0-0, policy_version 1257231 (0.00085) [2022-07-11 15:49:15,078][25689] Fps is (10 sec: 5401.9, 60 sec: 5558.9, 300 sec: 5561.0). Total num frames: 1287410688. Throughput: 0: 5677.0. Samples: 1287418260. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:49:15,078][25689] Avg episode reward: [(0, '0.659')] [2022-07-11 15:49:15,499][26022] Updated weights on worker 0-0, policy_version 1257241 (0.00079) [2022-07-11 15:49:17,675][26022] Updated weights on worker 0-0, policy_version 1257251 (0.00083) [2022-07-11 15:49:19,350][26022] Updated weights on worker 0-0, policy_version 1257261 (0.00560) [2022-07-11 15:49:20,103][25689] Fps is (10 sec: 5712.5, 60 sec: 5562.1, 300 sec: 5562.5). Total num frames: 1287440384. Throughput: 0: 4868.4. Samples: 1287435104. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:49:20,103][25689] Avg episode reward: [(0, '0.509')] [2022-07-11 15:49:21,273][26022] Updated weights on worker 0-0, policy_version 1257271 (0.00083) [2022-07-11 15:49:22,958][26022] Updated weights on worker 0-0, policy_version 1257281 (0.00090) [2022-07-11 15:49:24,843][26022] Updated weights on worker 0-0, policy_version 1257291 (0.00086) [2022-07-11 15:49:25,155][25689] Fps is (10 sec: 5588.9, 60 sec: 5527.5, 300 sec: 5558.2). Total num frames: 1287467008. Throughput: 0: 5830.1. Samples: 1287468802. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:49:25,155][25689] Avg episode reward: [(0, '0.439')] [2022-07-11 15:49:26,631][26022] Updated weights on worker 0-0, policy_version 1257301 (0.00081) [2022-07-11 15:49:28,445][26022] Updated weights on worker 0-0, policy_version 1257311 (0.00086) [2022-07-11 15:49:30,156][25689] Fps is (10 sec: 5500.1, 60 sec: 5547.1, 300 sec: 5558.6). Total num frames: 1287495680. Throughput: 0: 5858.2. Samples: 1287502404. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:49:30,157][25689] Avg episode reward: [(0, '0.340')] [2022-07-11 15:49:30,189][26022] Updated weights on worker 0-0, policy_version 1257321 (0.00092) [2022-07-11 15:49:31,875][26022] Updated weights on worker 0-0, policy_version 1257331 (0.00097) [2022-07-11 15:49:34,020][26022] Updated weights on worker 0-0, policy_version 1257341 (0.00088) [2022-07-11 15:49:35,166][25689] Fps is (10 sec: 5829.9, 60 sec: 5531.8, 300 sec: 5562.4). Total num frames: 1287525376. Throughput: 0: 5022.2. Samples: 1287519150. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:49:35,168][25689] Avg episode reward: [(0, '0.465')] [2022-07-11 15:49:35,691][26022] Updated weights on worker 0-0, policy_version 1257351 (0.00088) [2022-07-11 15:49:37,561][26022] Updated weights on worker 0-0, policy_version 1257361 (0.00091) [2022-07-11 15:49:39,366][26022] Updated weights on worker 0-0, policy_version 1257371 (0.00092) [2022-07-11 15:49:40,194][25689] Fps is (10 sec: 5406.9, 60 sec: 5531.0, 300 sec: 5556.0). Total num frames: 1287549952. Throughput: 0: 5867.6. Samples: 1287552992. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:49:40,194][25689] Avg episode reward: [(0, '1.212')] [2022-07-11 15:49:41,186][26022] Updated weights on worker 0-0, policy_version 1257381 (0.00083) [2022-07-11 15:49:43,304][26022] Updated weights on worker 0-0, policy_version 1257391 (0.00091) [2022-07-11 15:49:44,763][26022] Updated weights on worker 0-0, policy_version 1257401 (0.00086) [2022-07-11 15:49:45,324][25689] Fps is (10 sec: 5645.4, 60 sec: 5561.7, 300 sec: 5567.4). Total num frames: 1287582720. Throughput: 0: 5836.7. Samples: 1287586524. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:49:45,325][25689] Avg episode reward: [(0, '1.436')] [2022-07-11 15:49:46,843][26022] Updated weights on worker 0-0, policy_version 1257411 (0.00087) [2022-07-11 15:49:48,495][26022] Updated weights on worker 0-0, policy_version 1257421 (0.00088) [2022-07-11 15:49:50,374][25689] Fps is (10 sec: 5733.4, 60 sec: 5529.8, 300 sec: 5559.9). Total num frames: 1287608320. Throughput: 0: 4992.1. Samples: 1287603334. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:49:50,375][25689] Avg episode reward: [(0, '1.718')] [2022-07-11 15:49:50,393][26022] Updated weights on worker 0-0, policy_version 1257431 (0.00085) [2022-07-11 15:49:52,158][26022] Updated weights on worker 0-0, policy_version 1257441 (0.00085) [2022-07-11 15:49:54,028][26022] Updated weights on worker 0-0, policy_version 1257451 (0.00091) [2022-07-11 15:49:55,397][25689] Fps is (10 sec: 5387.9, 60 sec: 5562.5, 300 sec: 5559.7). Total num frames: 1287636992. Throughput: 0: 5828.1. Samples: 1287637056. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:49:55,398][25689] Avg episode reward: [(0, '1.892')] [2022-07-11 15:49:55,771][26022] Updated weights on worker 0-0, policy_version 1257461 (0.00090) [2022-07-11 15:49:57,783][26022] Updated weights on worker 0-0, policy_version 1257471 (0.00603) [2022-07-11 15:49:59,399][26022] Updated weights on worker 0-0, policy_version 1257481 (0.00087) [2022-07-11 15:50:00,410][25689] Fps is (10 sec: 5713.9, 60 sec: 5551.7, 300 sec: 5570.6). Total num frames: 1287665664. Throughput: 0: 5826.3. Samples: 1287670778. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:50:00,412][25689] Avg episode reward: [(0, '1.521')] [2022-07-11 15:50:01,462][26022] Updated weights on worker 0-0, policy_version 1257491 (0.00089) [2022-07-11 15:50:03,212][26022] Updated weights on worker 0-0, policy_version 1257501 (0.00090) [2022-07-11 15:50:05,410][26022] Updated weights on worker 0-0, policy_version 1257511 (0.00088) [2022-07-11 15:50:05,472][25689] Fps is (10 sec: 5386.8, 60 sec: 5540.0, 300 sec: 5559.7). Total num frames: 1287691264. Throughput: 0: 4932.7. Samples: 1287685908. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:50:05,473][25689] Avg episode reward: [(0, '1.436')] [2022-07-11 15:50:07,099][26022] Updated weights on worker 0-0, policy_version 1257521 (0.00084) [2022-07-11 15:50:08,819][26022] Updated weights on worker 0-0, policy_version 1257531 (0.00090) [2022-07-11 15:50:10,494][25689] Fps is (10 sec: 5382.0, 60 sec: 5575.0, 300 sec: 5566.4). Total num frames: 1287719936. Throughput: 0: 5785.0. Samples: 1287719728. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:50:10,495][25689] Avg episode reward: [(0, '0.865')] [2022-07-11 15:50:10,918][26022] Updated weights on worker 0-0, policy_version 1257541 (0.00088) [2022-07-11 15:50:12,541][26022] Updated weights on worker 0-0, policy_version 1257551 (0.00065) [2022-07-11 15:50:14,560][26022] Updated weights on worker 0-0, policy_version 1257561 (0.00084) [2022-07-11 15:50:15,501][25689] Fps is (10 sec: 5717.8, 60 sec: 5592.5, 300 sec: 5563.1). Total num frames: 1287748608. Throughput: 0: 5772.3. Samples: 1287753104. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:50:15,503][25689] Avg episode reward: [(0, '0.481')] [2022-07-11 15:50:16,460][26022] Updated weights on worker 0-0, policy_version 1257571 (0.00088) [2022-07-11 15:50:18,027][26022] Updated weights on worker 0-0, policy_version 1257581 (0.00084) [2022-07-11 15:50:18,363][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:50:18,374][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001257583_1287764992.pth [2022-07-11 15:50:18,374][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001255626_1285761024.pth [2022-07-11 15:50:20,059][26022] Updated weights on worker 0-0, policy_version 1257591 (0.00092) [2022-07-11 15:50:20,526][25689] Fps is (10 sec: 5614.0, 60 sec: 5558.6, 300 sec: 5560.5). Total num frames: 1287776256. Throughput: 0: 4932.3. Samples: 1287769996. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:50:20,528][25689] Avg episode reward: [(0, '0.650')] [2022-07-11 15:50:21,704][26022] Updated weights on worker 0-0, policy_version 1257601 (0.00079) [2022-07-11 15:50:23,540][26022] Updated weights on worker 0-0, policy_version 1257611 (0.00099) [2022-07-11 15:50:25,473][26022] Updated weights on worker 0-0, policy_version 1257621 (0.00095) [2022-07-11 15:50:25,571][25689] Fps is (10 sec: 5593.2, 60 sec: 5593.2, 300 sec: 5567.4). Total num frames: 1287804928. Throughput: 0: 5873.5. Samples: 1287803956. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:50:25,571][25689] Avg episode reward: [(0, '0.573')] [2022-07-11 15:50:27,200][26022] Updated weights on worker 0-0, policy_version 1257631 (0.00083) [2022-07-11 15:50:29,187][26022] Updated weights on worker 0-0, policy_version 1257641 (0.00088) [2022-07-11 15:50:30,578][25689] Fps is (10 sec: 5603.0, 60 sec: 5575.7, 300 sec: 5561.3). Total num frames: 1287832576. Throughput: 0: 5860.8. Samples: 1287837436. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:50:30,580][25689] Avg episode reward: [(0, '0.518')] [2022-07-11 15:50:30,846][26022] Updated weights on worker 0-0, policy_version 1257651 (0.00093) [2022-07-11 15:50:32,748][26022] Updated weights on worker 0-0, policy_version 1257661 (0.00085) [2022-07-11 15:50:34,525][26022] Updated weights on worker 0-0, policy_version 1257671 (0.00081) [2022-07-11 15:50:35,591][25689] Fps is (10 sec: 5518.6, 60 sec: 5541.6, 300 sec: 5561.4). Total num frames: 1287860224. Throughput: 0: 5023.6. Samples: 1287854026. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:50:35,592][25689] Avg episode reward: [(0, '0.343')] [2022-07-11 15:50:36,458][26022] Updated weights on worker 0-0, policy_version 1257681 (0.00087) [2022-07-11 15:50:38,262][26022] Updated weights on worker 0-0, policy_version 1257691 (0.00091) [2022-07-11 15:50:39,948][26022] Updated weights on worker 0-0, policy_version 1257701 (0.00098) [2022-07-11 15:50:40,610][25689] Fps is (10 sec: 5613.9, 60 sec: 5610.1, 300 sec: 5566.6). Total num frames: 1287888896. Throughput: 0: 5873.7. Samples: 1287887964. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:50:40,612][25689] Avg episode reward: [(0, '0.714')] [2022-07-11 15:50:41,777][26022] Updated weights on worker 0-0, policy_version 1257711 (0.00089) [2022-07-11 15:50:43,724][26022] Updated weights on worker 0-0, policy_version 1257721 (0.00091) [2022-07-11 15:50:45,341][26022] Updated weights on worker 0-0, policy_version 1257731 (0.00090) [2022-07-11 15:50:45,730][25689] Fps is (10 sec: 5655.4, 60 sec: 5543.3, 300 sec: 5565.1). Total num frames: 1287917568. Throughput: 0: 5844.5. Samples: 1287921780. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:50:45,732][25689] Avg episode reward: [(0, '1.005')] [2022-07-11 15:50:47,431][26022] Updated weights on worker 0-0, policy_version 1257741 (0.00083) [2022-07-11 15:50:48,974][26022] Updated weights on worker 0-0, policy_version 1257751 (0.00082) [2022-07-11 15:50:50,759][25689] Fps is (10 sec: 5549.3, 60 sec: 5579.1, 300 sec: 5561.2). Total num frames: 1287945216. Throughput: 0: 5003.5. Samples: 1287938412. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:50:50,761][25689] Avg episode reward: [(0, '1.508')] [2022-07-11 15:50:50,989][26022] Updated weights on worker 0-0, policy_version 1257761 (0.00093) [2022-07-11 15:50:52,883][26022] Updated weights on worker 0-0, policy_version 1257771 (0.00093) [2022-07-11 15:50:54,601][26022] Updated weights on worker 0-0, policy_version 1257781 (0.00091) [2022-07-11 15:50:55,822][25689] Fps is (10 sec: 5580.9, 60 sec: 5575.5, 300 sec: 5561.3). Total num frames: 1287973888. Throughput: 0: 5825.4. Samples: 1287971880. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:50:55,822][25689] Avg episode reward: [(0, '1.596')] [2022-07-11 15:50:56,542][26022] Updated weights on worker 0-0, policy_version 1257791 (0.00085) [2022-07-11 15:50:58,392][26022] Updated weights on worker 0-0, policy_version 1257801 (0.00703) [2022-07-11 15:51:00,062][26022] Updated weights on worker 0-0, policy_version 1257811 (0.00086) [2022-07-11 15:51:00,829][25689] Fps is (10 sec: 5694.5, 60 sec: 5576.0, 300 sec: 5572.9). Total num frames: 1288002560. Throughput: 0: 5831.6. Samples: 1288005872. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:51:00,831][25689] Avg episode reward: [(0, '1.659')] [2022-07-11 15:51:02,536][26022] Updated weights on worker 0-0, policy_version 1257821 (0.00092) [2022-07-11 15:51:03,963][26022] Updated weights on worker 0-0, policy_version 1257831 (0.00094) [2022-07-11 15:51:05,918][25689] Fps is (10 sec: 5375.7, 60 sec: 5573.6, 300 sec: 5558.4). Total num frames: 1288028160. Throughput: 0: 5745.6. Samples: 1288037768. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:51:05,918][25689] Avg episode reward: [(0, '1.882')] [2022-07-11 15:51:06,010][26022] Updated weights on worker 0-0, policy_version 1257841 (0.00087) [2022-07-11 15:51:07,750][26022] Updated weights on worker 0-0, policy_version 1257851 (0.00087) [2022-07-11 15:51:09,714][26022] Updated weights on worker 0-0, policy_version 1257861 (0.00092) [2022-07-11 15:51:10,930][25689] Fps is (10 sec: 5373.2, 60 sec: 5574.5, 300 sec: 5569.1). Total num frames: 1288056832. Throughput: 0: 5753.8. Samples: 1288054468. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:51:10,930][25689] Avg episode reward: [(0, '1.555')] [2022-07-11 15:51:11,441][26022] Updated weights on worker 0-0, policy_version 1257871 (0.00088) [2022-07-11 15:51:13,273][26022] Updated weights on worker 0-0, policy_version 1257881 (0.00092) [2022-07-11 15:51:15,127][26022] Updated weights on worker 0-0, policy_version 1257891 (0.00145) [2022-07-11 15:51:15,931][25689] Fps is (10 sec: 5624.2, 60 sec: 5558.0, 300 sec: 5565.8). Total num frames: 1288084480. Throughput: 0: 5795.9. Samples: 1288088434. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:51:15,932][25689] Avg episode reward: [(0, '0.908')] [2022-07-11 15:51:16,825][26022] Updated weights on worker 0-0, policy_version 1257901 (0.00086) [2022-07-11 15:51:18,592][26022] Updated weights on worker 0-0, policy_version 1257911 (0.00087) [2022-07-11 15:51:20,692][26022] Updated weights on worker 0-0, policy_version 1257921 (0.00089) [2022-07-11 15:51:20,945][25689] Fps is (10 sec: 5521.1, 60 sec: 5559.1, 300 sec: 5561.1). Total num frames: 1288112128. Throughput: 0: 5786.2. Samples: 1288122266. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:51:20,946][25689] Avg episode reward: [(0, '0.876')] [2022-07-11 15:51:22,264][26022] Updated weights on worker 0-0, policy_version 1257931 (0.00087) [2022-07-11 15:51:24,175][26022] Updated weights on worker 0-0, policy_version 1257941 (0.00087) [2022-07-11 15:51:25,801][26022] Updated weights on worker 0-0, policy_version 1257951 (0.00097) [2022-07-11 15:51:26,031][25689] Fps is (10 sec: 5779.3, 60 sec: 5589.1, 300 sec: 5574.9). Total num frames: 1288142848. Throughput: 0: 5028.0. Samples: 1288138898. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:51:26,031][25689] Avg episode reward: [(0, '0.592')] [2022-07-11 15:51:27,865][26022] Updated weights on worker 0-0, policy_version 1257961 (0.00087) [2022-07-11 15:51:29,603][26022] Updated weights on worker 0-0, policy_version 1257971 (0.00088) [2022-07-11 15:51:31,087][25689] Fps is (10 sec: 5654.4, 60 sec: 5567.7, 300 sec: 5570.8). Total num frames: 1288169472. Throughput: 0: 5866.5. Samples: 1288172718. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:51:31,087][25689] Avg episode reward: [(0, '0.622')] [2022-07-11 15:51:31,577][26022] Updated weights on worker 0-0, policy_version 1257981 (0.00084) [2022-07-11 15:51:33,197][26022] Updated weights on worker 0-0, policy_version 1257991 (0.00094) [2022-07-11 15:51:35,218][26022] Updated weights on worker 0-0, policy_version 1258001 (0.00086) [2022-07-11 15:51:36,113][25689] Fps is (10 sec: 5484.8, 60 sec: 5583.4, 300 sec: 5563.5). Total num frames: 1288198144. Throughput: 0: 5858.4. Samples: 1288206664. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:51:36,113][25689] Avg episode reward: [(0, '0.871')] [2022-07-11 15:51:36,894][26022] Updated weights on worker 0-0, policy_version 1258011 (0.00093) [2022-07-11 15:51:38,882][26022] Updated weights on worker 0-0, policy_version 1258021 (0.00089) [2022-07-11 15:51:40,603][26022] Updated weights on worker 0-0, policy_version 1258031 (0.00080) [2022-07-11 15:51:41,212][25689] Fps is (10 sec: 5663.2, 60 sec: 5576.0, 300 sec: 5566.8). Total num frames: 1288226816. Throughput: 0: 4992.3. Samples: 1288223446. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:51:41,213][25689] Avg episode reward: [(0, '1.392')] [2022-07-11 15:51:42,429][26022] Updated weights on worker 0-0, policy_version 1258041 (0.00088) [2022-07-11 15:51:44,251][26022] Updated weights on worker 0-0, policy_version 1258051 (0.00106) [2022-07-11 15:51:46,127][26022] Updated weights on worker 0-0, policy_version 1258061 (0.00362) [2022-07-11 15:51:46,276][25689] Fps is (10 sec: 5642.1, 60 sec: 5581.2, 300 sec: 5565.8). Total num frames: 1288255488. Throughput: 0: 5845.6. Samples: 1288257244. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:51:46,277][25689] Avg episode reward: [(0, '1.485')] [2022-07-11 15:51:48,034][26022] Updated weights on worker 0-0, policy_version 1258071 (0.00086) [2022-07-11 15:51:49,764][26022] Updated weights on worker 0-0, policy_version 1258081 (0.00090) [2022-07-11 15:51:51,283][25689] Fps is (10 sec: 5592.7, 60 sec: 5583.3, 300 sec: 5565.9). Total num frames: 1288283136. Throughput: 0: 5853.9. Samples: 1288290944. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:51:51,283][25689] Avg episode reward: [(0, '1.376')] [2022-07-11 15:51:51,552][26022] Updated weights on worker 0-0, policy_version 1258091 (0.00086) [2022-07-11 15:51:53,442][26022] Updated weights on worker 0-0, policy_version 1258101 (0.00093) [2022-07-11 15:51:55,227][26022] Updated weights on worker 0-0, policy_version 1258111 (0.00098) [2022-07-11 15:51:56,293][25689] Fps is (10 sec: 5622.6, 60 sec: 5588.1, 300 sec: 5563.8). Total num frames: 1288311808. Throughput: 0: 5013.7. Samples: 1288307842. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:51:56,293][25689] Avg episode reward: [(0, '1.666')] [2022-07-11 15:51:56,989][26022] Updated weights on worker 0-0, policy_version 1258121 (0.00095) [2022-07-11 15:51:58,893][26022] Updated weights on worker 0-0, policy_version 1258131 (0.00084) [2022-07-11 15:52:00,548][26022] Updated weights on worker 0-0, policy_version 1258141 (0.00087) [2022-07-11 15:52:01,299][25689] Fps is (10 sec: 5623.0, 60 sec: 5571.3, 300 sec: 5571.8). Total num frames: 1288339456. Throughput: 0: 5893.3. Samples: 1288341820. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:52:01,299][25689] Avg episode reward: [(0, '1.822')] [2022-07-11 15:52:02,895][26022] Updated weights on worker 0-0, policy_version 1258151 (0.00088) [2022-07-11 15:52:04,559][26022] Updated weights on worker 0-0, policy_version 1258161 (0.00084) [2022-07-11 15:52:06,426][25689] Fps is (10 sec: 5456.8, 60 sec: 5601.6, 300 sec: 5570.3). Total num frames: 1288367104. Throughput: 0: 5778.0. Samples: 1288373672. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:52:06,427][25689] Avg episode reward: [(0, '2.073')] [2022-07-11 15:52:06,431][26022] Updated weights on worker 0-0, policy_version 1258171 (0.00086) [2022-07-11 15:52:08,161][26022] Updated weights on worker 0-0, policy_version 1258181 (0.00085) [2022-07-11 15:52:10,054][26022] Updated weights on worker 0-0, policy_version 1258191 (0.00085) [2022-07-11 15:52:11,516][25689] Fps is (10 sec: 5412.2, 60 sec: 5577.5, 300 sec: 5565.9). Total num frames: 1288394752. Throughput: 0: 4921.9. Samples: 1288390528. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:52:11,517][25689] Avg episode reward: [(0, '2.078')] [2022-07-11 15:52:11,898][26022] Updated weights on worker 0-0, policy_version 1258201 (0.00082) [2022-07-11 15:52:13,816][26022] Updated weights on worker 0-0, policy_version 1258211 (0.00089) [2022-07-11 15:52:15,360][26022] Updated weights on worker 0-0, policy_version 1258221 (0.00086) [2022-07-11 15:52:16,521][25689] Fps is (10 sec: 5579.2, 60 sec: 5594.1, 300 sec: 5566.9). Total num frames: 1288423424. Throughput: 0: 5766.2. Samples: 1288424480. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:52:16,523][25689] Avg episode reward: [(0, '2.144')] [2022-07-11 15:52:17,551][26022] Updated weights on worker 0-0, policy_version 1258231 (0.00090) [2022-07-11 15:52:18,457][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:52:18,476][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001258238_1288435712.pth [2022-07-11 15:52:18,477][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001256278_1286428672.pth [2022-07-11 15:52:19,141][26022] Updated weights on worker 0-0, policy_version 1258241 (0.00086) [2022-07-11 15:52:21,148][26022] Updated weights on worker 0-0, policy_version 1258251 (0.00085) [2022-07-11 15:52:21,564][25689] Fps is (10 sec: 5706.8, 60 sec: 5608.3, 300 sec: 5570.5). Total num frames: 1288452096. Throughput: 0: 5741.3. Samples: 1288458168. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:52:21,566][25689] Avg episode reward: [(0, '1.317')] [2022-07-11 15:52:22,630][26022] Updated weights on worker 0-0, policy_version 1258261 (0.00094) [2022-07-11 15:52:24,895][26022] Updated weights on worker 0-0, policy_version 1258271 (0.00090) [2022-07-11 15:52:26,226][26022] Updated weights on worker 0-0, policy_version 1258281 (0.00084) [2022-07-11 15:52:26,660][25689] Fps is (10 sec: 5756.7, 60 sec: 5590.4, 300 sec: 5569.1). Total num frames: 1288481792. Throughput: 0: 5000.5. Samples: 1288474852. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:52:26,661][25689] Avg episode reward: [(0, '1.470')] [2022-07-11 15:52:28,484][26022] Updated weights on worker 0-0, policy_version 1258291 (0.00057) [2022-07-11 15:52:29,875][26022] Updated weights on worker 0-0, policy_version 1258301 (0.00083) [2022-07-11 15:52:31,663][25689] Fps is (10 sec: 5475.4, 60 sec: 5578.4, 300 sec: 5563.2). Total num frames: 1288507392. Throughput: 0: 5863.5. Samples: 1288508662. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:52:31,663][25689] Avg episode reward: [(0, '1.291')] [2022-07-11 15:52:31,984][26022] Updated weights on worker 0-0, policy_version 1258311 (0.00096) [2022-07-11 15:52:33,726][26022] Updated weights on worker 0-0, policy_version 1258321 (0.00082) [2022-07-11 15:52:35,583][26022] Updated weights on worker 0-0, policy_version 1258331 (0.00088) [2022-07-11 15:52:36,700][25689] Fps is (10 sec: 5405.9, 60 sec: 5577.4, 300 sec: 5570.0). Total num frames: 1288536064. Throughput: 0: 5838.6. Samples: 1288542296. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:52:36,700][25689] Avg episode reward: [(0, '1.375')] [2022-07-11 15:52:37,597][26022] Updated weights on worker 0-0, policy_version 1258341 (0.00090) [2022-07-11 15:52:39,235][26022] Updated weights on worker 0-0, policy_version 1258351 (0.00087) [2022-07-11 15:52:41,195][26022] Updated weights on worker 0-0, policy_version 1258361 (0.00087) [2022-07-11 15:52:41,729][25689] Fps is (10 sec: 5696.7, 60 sec: 5583.9, 300 sec: 5566.8). Total num frames: 1288564736. Throughput: 0: 4999.9. Samples: 1288558992. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:52:41,730][25689] Avg episode reward: [(0, '1.404')] [2022-07-11 15:52:42,922][26022] Updated weights on worker 0-0, policy_version 1258371 (0.00088) [2022-07-11 15:52:44,759][26022] Updated weights on worker 0-0, policy_version 1258381 (0.00092) [2022-07-11 15:52:46,734][26022] Updated weights on worker 0-0, policy_version 1258391 (0.00087) [2022-07-11 15:52:46,850][25689] Fps is (10 sec: 5649.2, 60 sec: 5578.6, 300 sec: 5572.9). Total num frames: 1288593408. Throughput: 0: 5826.9. Samples: 1288592498. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:52:46,851][25689] Avg episode reward: [(0, '1.588')] [2022-07-11 15:52:48,346][26022] Updated weights on worker 0-0, policy_version 1258401 (0.00176) [2022-07-11 15:52:50,254][26022] Updated weights on worker 0-0, policy_version 1258411 (0.00089) [2022-07-11 15:52:51,856][25689] Fps is (10 sec: 5561.6, 60 sec: 5578.7, 300 sec: 5566.2). Total num frames: 1288621056. Throughput: 0: 5807.0. Samples: 1288625922. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:52:51,856][25689] Avg episode reward: [(0, '2.241')] [2022-07-11 15:52:52,376][26022] Updated weights on worker 0-0, policy_version 1258421 (0.00088) [2022-07-11 15:52:53,938][26022] Updated weights on worker 0-0, policy_version 1258431 (0.00082) [2022-07-11 15:52:55,916][26022] Updated weights on worker 0-0, policy_version 1258441 (0.00093) [2022-07-11 15:52:56,886][25689] Fps is (10 sec: 5510.2, 60 sec: 5560.0, 300 sec: 5566.1). Total num frames: 1288648704. Throughput: 0: 4968.3. Samples: 1288642586. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:52:56,886][25689] Avg episode reward: [(0, '2.298')] [2022-07-11 15:52:57,606][26022] Updated weights on worker 0-0, policy_version 1258451 (0.01031) [2022-07-11 15:52:59,463][26022] Updated weights on worker 0-0, policy_version 1258461 (0.00091) [2022-07-11 15:53:01,304][26022] Updated weights on worker 0-0, policy_version 1258471 (0.00093) [2022-07-11 15:53:01,898][25689] Fps is (10 sec: 5608.1, 60 sec: 5576.3, 300 sec: 5575.3). Total num frames: 1288677376. Throughput: 0: 5821.1. Samples: 1288676400. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:53:01,901][25689] Avg episode reward: [(0, '2.241')] [2022-07-11 15:53:03,663][26022] Updated weights on worker 0-0, policy_version 1258481 (0.00082) [2022-07-11 15:53:05,214][26022] Updated weights on worker 0-0, policy_version 1258491 (0.00073) [2022-07-11 15:53:07,024][25689] Fps is (10 sec: 5454.1, 60 sec: 5559.5, 300 sec: 5567.3). Total num frames: 1288704000. Throughput: 0: 5733.3. Samples: 1288708160. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:53:07,025][25689] Avg episode reward: [(0, '2.107')] [2022-07-11 15:53:07,202][26022] Updated weights on worker 0-0, policy_version 1258501 (0.00094) [2022-07-11 15:53:09,088][26022] Updated weights on worker 0-0, policy_version 1258511 (0.00090) [2022-07-11 15:53:10,772][26022] Updated weights on worker 0-0, policy_version 1258521 (0.00089) [2022-07-11 15:53:12,049][25689] Fps is (10 sec: 5346.8, 60 sec: 5565.5, 300 sec: 5571.2). Total num frames: 1288731648. Throughput: 0: 5733.3. Samples: 1288741694. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:53:12,049][25689] Avg episode reward: [(0, '1.557')] [2022-07-11 15:53:12,783][26022] Updated weights on worker 0-0, policy_version 1258531 (0.00086) [2022-07-11 15:53:14,504][26022] Updated weights on worker 0-0, policy_version 1258541 (0.00089) [2022-07-11 15:53:16,563][26022] Updated weights on worker 0-0, policy_version 1258551 (0.00082) [2022-07-11 15:53:17,131][25689] Fps is (10 sec: 5673.6, 60 sec: 5575.3, 300 sec: 5570.8). Total num frames: 1288761344. Throughput: 0: 5722.9. Samples: 1288758450. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:53:17,133][25689] Avg episode reward: [(0, '0.360')] [2022-07-11 15:53:18,092][26022] Updated weights on worker 0-0, policy_version 1258561 (0.00091) [2022-07-11 15:53:20,077][26022] Updated weights on worker 0-0, policy_version 1258571 (0.00096) [2022-07-11 15:53:21,947][26022] Updated weights on worker 0-0, policy_version 1258581 (0.00084) [2022-07-11 15:53:22,167][25689] Fps is (10 sec: 5566.1, 60 sec: 5542.2, 300 sec: 5564.1). Total num frames: 1288787968. Throughput: 0: 5701.1. Samples: 1288791954. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:53:22,167][25689] Avg episode reward: [(0, '0.090')] [2022-07-11 15:53:23,661][26022] Updated weights on worker 0-0, policy_version 1258591 (0.00083) [2022-07-11 15:53:25,382][26022] Updated weights on worker 0-0, policy_version 1258601 (0.00090) [2022-07-11 15:53:27,303][25689] Fps is (10 sec: 5436.4, 60 sec: 5521.7, 300 sec: 5565.5). Total num frames: 1288816640. Throughput: 0: 5795.6. Samples: 1288825686. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:53:27,303][25689] Avg episode reward: [(0, '0.129')] [2022-07-11 15:53:27,358][26022] Updated weights on worker 0-0, policy_version 1258611 (0.00090) [2022-07-11 15:53:29,068][26022] Updated weights on worker 0-0, policy_version 1258621 (0.00091) [2022-07-11 15:53:31,306][26022] Updated weights on worker 0-0, policy_version 1258631 (0.00090) [2022-07-11 15:53:32,401][25689] Fps is (10 sec: 5703.4, 60 sec: 5580.4, 300 sec: 5560.8). Total num frames: 1288846336. Throughput: 0: 4935.7. Samples: 1288842146. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:53:32,401][25689] Avg episode reward: [(0, '-0.148')] [2022-07-11 15:53:32,806][26022] Updated weights on worker 0-0, policy_version 1258641 (0.00091) [2022-07-11 15:53:34,718][26022] Updated weights on worker 0-0, policy_version 1258651 (0.00090) [2022-07-11 15:53:36,475][26022] Updated weights on worker 0-0, policy_version 1258661 (0.00087) [2022-07-11 15:53:37,464][25689] Fps is (10 sec: 5542.6, 60 sec: 5544.3, 300 sec: 5566.8). Total num frames: 1288872960. Throughput: 0: 5782.7. Samples: 1288876028. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:53:37,466][25689] Avg episode reward: [(0, '0.086')] [2022-07-11 15:53:38,115][26022] Updated weights on worker 0-0, policy_version 1258671 (0.00085) [2022-07-11 15:53:40,270][26022] Updated weights on worker 0-0, policy_version 1258681 (0.00100) [2022-07-11 15:53:41,769][26022] Updated weights on worker 0-0, policy_version 1258691 (0.00085) [2022-07-11 15:53:42,468][25689] Fps is (10 sec: 5594.5, 60 sec: 5563.5, 300 sec: 5565.1). Total num frames: 1288902656. Throughput: 0: 5797.3. Samples: 1288909644. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:53:42,469][25689] Avg episode reward: [(0, '0.088')] [2022-07-11 15:53:43,664][26022] Updated weights on worker 0-0, policy_version 1258701 (0.00085) [2022-07-11 15:53:45,756][26022] Updated weights on worker 0-0, policy_version 1258711 (0.00094) [2022-07-11 15:53:47,365][26022] Updated weights on worker 0-0, policy_version 1258721 (0.00085) [2022-07-11 15:53:47,505][25689] Fps is (10 sec: 5710.8, 60 sec: 5554.3, 300 sec: 5565.7). Total num frames: 1288930304. Throughput: 0: 4996.1. Samples: 1288926620. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:53:47,506][25689] Avg episode reward: [(0, '0.641')] [2022-07-11 15:53:49,367][26022] Updated weights on worker 0-0, policy_version 1258731 (0.00090) [2022-07-11 15:53:51,044][26022] Updated weights on worker 0-0, policy_version 1258741 (0.00100) [2022-07-11 15:53:52,543][25689] Fps is (10 sec: 5488.4, 60 sec: 5551.4, 300 sec: 5568.7). Total num frames: 1288957952. Throughput: 0: 5832.1. Samples: 1288959616. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:53:52,543][25689] Avg episode reward: [(0, '1.106')] [2022-07-11 15:53:52,962][26022] Updated weights on worker 0-0, policy_version 1258751 (0.00088) [2022-07-11 15:53:54,799][26022] Updated weights on worker 0-0, policy_version 1258761 (0.00088) [2022-07-11 15:53:56,658][26022] Updated weights on worker 0-0, policy_version 1258771 (0.00086) [2022-07-11 15:53:57,578][25689] Fps is (10 sec: 5591.4, 60 sec: 5567.7, 300 sec: 5566.1). Total num frames: 1288986624. Throughput: 0: 5840.5. Samples: 1288993504. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:53:57,579][25689] Avg episode reward: [(0, '1.128')] [2022-07-11 15:53:58,503][26022] Updated weights on worker 0-0, policy_version 1258781 (0.00088) [2022-07-11 15:54:00,296][26022] Updated weights on worker 0-0, policy_version 1258791 (0.00083) [2022-07-11 15:54:02,544][26022] Updated weights on worker 0-0, policy_version 1258801 (0.00087) [2022-07-11 15:54:02,587][25689] Fps is (10 sec: 5403.3, 60 sec: 5517.5, 300 sec: 5564.7). Total num frames: 1289012224. Throughput: 0: 4996.4. Samples: 1289010168. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:54:02,587][25689] Avg episode reward: [(0, '1.586')] [2022-07-11 15:54:04,480][26022] Updated weights on worker 0-0, policy_version 1258811 (0.00090) [2022-07-11 15:54:06,312][26022] Updated weights on worker 0-0, policy_version 1258821 (0.00095) [2022-07-11 15:54:07,628][25689] Fps is (10 sec: 5298.5, 60 sec: 5542.1, 300 sec: 5568.0). Total num frames: 1289039872. Throughput: 0: 5720.5. Samples: 1289041730. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:54:07,628][25689] Avg episode reward: [(0, '1.527')] [2022-07-11 15:54:08,047][26022] Updated weights on worker 0-0, policy_version 1258831 (0.00092) [2022-07-11 15:54:09,873][26022] Updated weights on worker 0-0, policy_version 1258841 (0.00091) [2022-07-11 15:54:11,678][26022] Updated weights on worker 0-0, policy_version 1258851 (0.00060) [2022-07-11 15:54:12,640][25689] Fps is (10 sec: 5602.3, 60 sec: 5560.1, 300 sec: 5571.5). Total num frames: 1289068544. Throughput: 0: 5728.3. Samples: 1289074738. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:54:12,641][25689] Avg episode reward: [(0, '1.669')] [2022-07-11 15:54:13,558][26022] Updated weights on worker 0-0, policy_version 1258861 (0.00087) [2022-07-11 15:54:15,376][26022] Updated weights on worker 0-0, policy_version 1258871 (0.00093) [2022-07-11 15:54:17,446][26022] Updated weights on worker 0-0, policy_version 1258881 (0.00088) [2022-07-11 15:54:17,647][25689] Fps is (10 sec: 5519.1, 60 sec: 5516.3, 300 sec: 5561.5). Total num frames: 1289095168. Throughput: 0: 4873.7. Samples: 1289091312. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:54:17,647][25689] Avg episode reward: [(0, '1.425')] [2022-07-11 15:54:18,483][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:54:18,500][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001258887_1289100288.pth [2022-07-11 15:54:18,500][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001256930_1287096320.pth [2022-07-11 15:54:19,311][26022] Updated weights on worker 0-0, policy_version 1258891 (0.00085) [2022-07-11 15:54:21,102][26022] Updated weights on worker 0-0, policy_version 1258901 (0.00084) [2022-07-11 15:54:22,662][25689] Fps is (10 sec: 5415.3, 60 sec: 5535.1, 300 sec: 5565.6). Total num frames: 1289122816. Throughput: 0: 5696.4. Samples: 1289124522. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:54:22,662][25689] Avg episode reward: [(0, '1.392')] [2022-07-11 15:54:22,867][26022] Updated weights on worker 0-0, policy_version 1258911 (0.00101) [2022-07-11 15:54:24,691][26022] Updated weights on worker 0-0, policy_version 1258921 (0.00086) [2022-07-11 15:54:26,587][26022] Updated weights on worker 0-0, policy_version 1258931 (0.00099) [2022-07-11 15:54:27,755][25689] Fps is (10 sec: 5571.9, 60 sec: 5539.1, 300 sec: 5563.9). Total num frames: 1289151488. Throughput: 0: 5789.3. Samples: 1289158250. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:54:27,757][25689] Avg episode reward: [(0, '1.557')] [2022-07-11 15:54:28,364][26022] Updated weights on worker 0-0, policy_version 1258941 (0.00091) [2022-07-11 15:54:30,139][26022] Updated weights on worker 0-0, policy_version 1258951 (0.00089) [2022-07-11 15:54:32,056][26022] Updated weights on worker 0-0, policy_version 1258961 (0.00084) [2022-07-11 15:54:32,771][25689] Fps is (10 sec: 5672.7, 60 sec: 5529.6, 300 sec: 5560.4). Total num frames: 1289180160. Throughput: 0: 4977.0. Samples: 1289174928. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:54:32,772][25689] Avg episode reward: [(0, '1.568')] [2022-07-11 15:54:33,867][26022] Updated weights on worker 0-0, policy_version 1258971 (0.00091) [2022-07-11 15:54:35,568][26022] Updated weights on worker 0-0, policy_version 1258981 (0.00084) [2022-07-11 15:54:37,539][26022] Updated weights on worker 0-0, policy_version 1258991 (0.00089) [2022-07-11 15:54:37,785][25689] Fps is (10 sec: 5614.7, 60 sec: 5551.1, 300 sec: 5570.9). Total num frames: 1289207808. Throughput: 0: 5831.5. Samples: 1289208750. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:54:37,787][25689] Avg episode reward: [(0, '1.572')] [2022-07-11 15:54:39,099][26022] Updated weights on worker 0-0, policy_version 1259001 (0.00086) [2022-07-11 15:54:41,326][26022] Updated weights on worker 0-0, policy_version 1259011 (0.00087) [2022-07-11 15:54:42,808][25689] Fps is (10 sec: 5611.0, 60 sec: 5532.4, 300 sec: 5559.2). Total num frames: 1289236480. Throughput: 0: 5860.5. Samples: 1289242588. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:54:42,809][25689] Avg episode reward: [(0, '1.991')] [2022-07-11 15:54:42,923][26022] Updated weights on worker 0-0, policy_version 1259021 (0.00081) [2022-07-11 15:54:44,759][26022] Updated weights on worker 0-0, policy_version 1259031 (0.00092) [2022-07-11 15:54:46,451][26022] Updated weights on worker 0-0, policy_version 1259041 (0.00086) [2022-07-11 15:54:47,881][25689] Fps is (10 sec: 5679.9, 60 sec: 5546.1, 300 sec: 5569.1). Total num frames: 1289265152. Throughput: 0: 5041.5. Samples: 1289259718. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:54:47,882][25689] Avg episode reward: [(0, '2.058')] [2022-07-11 15:54:48,404][26022] Updated weights on worker 0-0, policy_version 1259051 (0.00095) [2022-07-11 15:54:50,152][26022] Updated weights on worker 0-0, policy_version 1259061 (0.00091) [2022-07-11 15:54:52,138][26022] Updated weights on worker 0-0, policy_version 1259071 (0.00879) [2022-07-11 15:54:52,899][25689] Fps is (10 sec: 5682.9, 60 sec: 5564.9, 300 sec: 5569.2). Total num frames: 1289293824. Throughput: 0: 5902.4. Samples: 1289293732. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:54:52,899][25689] Avg episode reward: [(0, '1.960')] [2022-07-11 15:54:53,483][26022] Updated weights on worker 0-0, policy_version 1259081 (0.00083) [2022-07-11 15:54:55,703][26022] Updated weights on worker 0-0, policy_version 1259091 (0.00092) [2022-07-11 15:54:57,448][26022] Updated weights on worker 0-0, policy_version 1259101 (0.00096) [2022-07-11 15:54:57,906][25689] Fps is (10 sec: 5618.2, 60 sec: 5550.5, 300 sec: 5565.8). Total num frames: 1289321472. Throughput: 0: 5917.9. Samples: 1289327822. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:54:57,906][25689] Avg episode reward: [(0, '1.038')] [2022-07-11 15:54:59,119][26022] Updated weights on worker 0-0, policy_version 1259111 (0.00087) [2022-07-11 15:55:01,123][26022] Updated weights on worker 0-0, policy_version 1259121 (0.00088) [2022-07-11 15:55:02,918][25689] Fps is (10 sec: 5519.1, 60 sec: 5584.2, 300 sec: 5573.6). Total num frames: 1289349120. Throughput: 0: 5087.1. Samples: 1289344888. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:55:02,918][25689] Avg episode reward: [(0, '1.045')] [2022-07-11 15:55:03,043][26022] Updated weights on worker 0-0, policy_version 1259131 (0.00089) [2022-07-11 15:55:05,217][26022] Updated weights on worker 0-0, policy_version 1259141 (0.00092) [2022-07-11 15:55:06,774][26022] Updated weights on worker 0-0, policy_version 1259151 (0.00085) [2022-07-11 15:55:08,034][25689] Fps is (10 sec: 5459.5, 60 sec: 5577.2, 300 sec: 5568.4). Total num frames: 1289376768. Throughput: 0: 5815.5. Samples: 1289376918. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:55:08,035][25689] Avg episode reward: [(0, '1.135')] [2022-07-11 15:55:08,681][26022] Updated weights on worker 0-0, policy_version 1259161 (0.00085) [2022-07-11 15:55:10,357][26022] Updated weights on worker 0-0, policy_version 1259171 (0.00086) [2022-07-11 15:55:12,279][26022] Updated weights on worker 0-0, policy_version 1259181 (0.00085) [2022-07-11 15:55:13,051][25689] Fps is (10 sec: 5558.0, 60 sec: 5576.7, 300 sec: 5568.3). Total num frames: 1289405440. Throughput: 0: 5819.0. Samples: 1289410998. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:55:13,052][25689] Avg episode reward: [(0, '1.017')] [2022-07-11 15:55:13,982][26022] Updated weights on worker 0-0, policy_version 1259191 (0.00087) [2022-07-11 15:55:15,778][26022] Updated weights on worker 0-0, policy_version 1259201 (0.00090) [2022-07-11 15:55:17,733][26022] Updated weights on worker 0-0, policy_version 1259211 (0.00091) [2022-07-11 15:55:18,134][25689] Fps is (10 sec: 5677.9, 60 sec: 5603.6, 300 sec: 5570.6). Total num frames: 1289434112. Throughput: 0: 4943.3. Samples: 1289427814. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:55:18,134][25689] Avg episode reward: [(0, '0.194')] [2022-07-11 15:55:19,627][26022] Updated weights on worker 0-0, policy_version 1259221 (0.00088) [2022-07-11 15:55:21,416][26022] Updated weights on worker 0-0, policy_version 1259231 (0.00093) [2022-07-11 15:55:23,127][26022] Updated weights on worker 0-0, policy_version 1259241 (0.00089) [2022-07-11 15:55:23,224][25689] Fps is (10 sec: 5637.0, 60 sec: 5613.6, 300 sec: 5569.8). Total num frames: 1289462784. Throughput: 0: 5732.4. Samples: 1289461290. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:55:23,224][25689] Avg episode reward: [(0, '0.103')] [2022-07-11 15:55:25,217][26022] Updated weights on worker 0-0, policy_version 1259251 (0.00085) [2022-07-11 15:55:26,888][26022] Updated weights on worker 0-0, policy_version 1259261 (0.00091) [2022-07-11 15:55:28,323][25689] Fps is (10 sec: 5426.6, 60 sec: 5579.1, 300 sec: 5564.6). Total num frames: 1289489408. Throughput: 0: 5800.2. Samples: 1289494600. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:55:28,324][25689] Avg episode reward: [(0, '0.832')] [2022-07-11 15:55:28,911][26022] Updated weights on worker 0-0, policy_version 1259271 (0.00089) [2022-07-11 15:55:30,633][26022] Updated weights on worker 0-0, policy_version 1259281 (0.00093) [2022-07-11 15:55:32,461][26022] Updated weights on worker 0-0, policy_version 1259291 (0.00081) [2022-07-11 15:55:33,336][25689] Fps is (10 sec: 5569.7, 60 sec: 5596.4, 300 sec: 5571.5). Total num frames: 1289519104. Throughput: 0: 5773.6. Samples: 1289528114. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:55:33,336][25689] Avg episode reward: [(0, '0.952')] [2022-07-11 15:55:34,224][26022] Updated weights on worker 0-0, policy_version 1259301 (0.00085) [2022-07-11 15:55:36,063][26022] Updated weights on worker 0-0, policy_version 1259311 (0.00083) [2022-07-11 15:55:37,989][26022] Updated weights on worker 0-0, policy_version 1259321 (0.00090) [2022-07-11 15:55:38,362][25689] Fps is (10 sec: 5712.5, 60 sec: 5595.3, 300 sec: 5567.9). Total num frames: 1289546752. Throughput: 0: 5794.0. Samples: 1289545016. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:55:38,362][25689] Avg episode reward: [(0, '0.450')] [2022-07-11 15:55:39,903][26022] Updated weights on worker 0-0, policy_version 1259331 (0.00096) [2022-07-11 15:55:41,465][26022] Updated weights on worker 0-0, policy_version 1259341 (0.00089) [2022-07-11 15:55:43,379][25689] Fps is (10 sec: 5403.6, 60 sec: 5562.0, 300 sec: 5563.0). Total num frames: 1289573376. Throughput: 0: 5817.9. Samples: 1289578552. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:55:43,380][25689] Avg episode reward: [(0, '0.677')] [2022-07-11 15:55:43,739][26022] Updated weights on worker 0-0, policy_version 1259351 (0.00060) [2022-07-11 15:55:45,095][26022] Updated weights on worker 0-0, policy_version 1259361 (0.00094) [2022-07-11 15:55:47,251][26022] Updated weights on worker 0-0, policy_version 1259371 (0.00095) [2022-07-11 15:55:48,500][25689] Fps is (10 sec: 5656.1, 60 sec: 5591.4, 300 sec: 5571.6). Total num frames: 1289604096. Throughput: 0: 5824.0. Samples: 1289612110. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:55:48,501][25689] Avg episode reward: [(0, '-0.034')] [2022-07-11 15:55:48,967][26022] Updated weights on worker 0-0, policy_version 1259381 (0.00090) [2022-07-11 15:55:50,832][26022] Updated weights on worker 0-0, policy_version 1259391 (0.00091) [2022-07-11 15:55:52,718][26022] Updated weights on worker 0-0, policy_version 1259401 (0.00086) [2022-07-11 15:55:53,513][25689] Fps is (10 sec: 5658.6, 60 sec: 5558.0, 300 sec: 5565.6). Total num frames: 1289630720. Throughput: 0: 4989.3. Samples: 1289628786. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:55:53,514][25689] Avg episode reward: [(0, '-0.822')] [2022-07-11 15:55:54,509][26022] Updated weights on worker 0-0, policy_version 1259411 (0.00084) [2022-07-11 15:55:56,275][26022] Updated weights on worker 0-0, policy_version 1259421 (0.00087) [2022-07-11 15:55:58,205][26022] Updated weights on worker 0-0, policy_version 1259431 (0.00093) [2022-07-11 15:55:58,516][25689] Fps is (10 sec: 5520.8, 60 sec: 5575.3, 300 sec: 5565.7). Total num frames: 1289659392. Throughput: 0: 5830.6. Samples: 1289662528. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:55:58,517][25689] Avg episode reward: [(0, '-0.943')] [2022-07-11 15:55:59,832][26022] Updated weights on worker 0-0, policy_version 1259441 (0.00289) [2022-07-11 15:56:02,156][26022] Updated weights on worker 0-0, policy_version 1259451 (0.00085) [2022-07-11 15:56:03,522][25689] Fps is (10 sec: 5524.6, 60 sec: 5558.9, 300 sec: 5570.7). Total num frames: 1289686016. Throughput: 0: 5744.8. Samples: 1289694270. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:56:03,523][25689] Avg episode reward: [(0, '-0.928')] [2022-07-11 15:56:03,775][26022] Updated weights on worker 0-0, policy_version 1259461 (0.00079) [2022-07-11 15:56:05,923][26022] Updated weights on worker 0-0, policy_version 1259471 (0.00842) [2022-07-11 15:56:07,484][26022] Updated weights on worker 0-0, policy_version 1259481 (0.00088) [2022-07-11 15:56:08,563][25689] Fps is (10 sec: 5401.8, 60 sec: 5565.9, 300 sec: 5566.7). Total num frames: 1289713664. Throughput: 0: 4931.3. Samples: 1289711046. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:56:08,564][25689] Avg episode reward: [(0, '0.173')] [2022-07-11 15:56:09,558][26022] Updated weights on worker 0-0, policy_version 1259491 (0.00088) [2022-07-11 15:56:11,227][26022] Updated weights on worker 0-0, policy_version 1259501 (0.00082) [2022-07-11 15:56:13,267][26022] Updated weights on worker 0-0, policy_version 1259511 (0.00089) [2022-07-11 15:56:13,626][25689] Fps is (10 sec: 5473.0, 60 sec: 5544.7, 300 sec: 5565.5). Total num frames: 1289741312. Throughput: 0: 5771.3. Samples: 1289744862. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:56:13,626][25689] Avg episode reward: [(0, '-1.403')] [2022-07-11 15:56:14,872][26022] Updated weights on worker 0-0, policy_version 1259521 (0.00091) [2022-07-11 15:56:16,803][26022] Updated weights on worker 0-0, policy_version 1259531 (0.00088) [2022-07-11 15:56:18,627][25689] Fps is (10 sec: 5494.4, 60 sec: 5535.3, 300 sec: 5565.8). Total num frames: 1289768960. Throughput: 0: 5751.3. Samples: 1289778192. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:56:18,628][25689] Avg episode reward: [(0, '-0.626')] [2022-07-11 15:56:18,634][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:56:18,643][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001259541_1289769984.pth [2022-07-11 15:56:18,644][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001257583_1287764992.pth [2022-07-11 15:56:18,654][26022] Updated weights on worker 0-0, policy_version 1259541 (0.00095) [2022-07-11 15:56:20,450][26022] Updated weights on worker 0-0, policy_version 1259551 (0.00084) [2022-07-11 15:56:22,353][26022] Updated weights on worker 0-0, policy_version 1259561 (0.00091) [2022-07-11 15:56:23,696][25689] Fps is (10 sec: 5491.3, 60 sec: 5520.3, 300 sec: 5555.8). Total num frames: 1289796608. Throughput: 0: 4996.5. Samples: 1289795066. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:56:23,696][25689] Avg episode reward: [(0, '0.190')] [2022-07-11 15:56:24,150][26022] Updated weights on worker 0-0, policy_version 1259571 (0.00091) [2022-07-11 15:56:26,114][26022] Updated weights on worker 0-0, policy_version 1259581 (0.00093) [2022-07-11 15:56:27,740][26022] Updated weights on worker 0-0, policy_version 1259591 (0.00086) [2022-07-11 15:56:28,801][25689] Fps is (10 sec: 5737.1, 60 sec: 5587.5, 300 sec: 5568.6). Total num frames: 1289827328. Throughput: 0: 5789.1. Samples: 1289828204. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:56:28,802][25689] Avg episode reward: [(0, '0.378')] [2022-07-11 15:56:29,735][26022] Updated weights on worker 0-0, policy_version 1259601 (0.00082) [2022-07-11 15:56:31,407][26022] Updated weights on worker 0-0, policy_version 1259611 (0.00086) [2022-07-11 15:56:33,386][26022] Updated weights on worker 0-0, policy_version 1259621 (0.00090) [2022-07-11 15:56:33,872][25689] Fps is (10 sec: 5735.3, 60 sec: 5548.2, 300 sec: 5564.3). Total num frames: 1289854976. Throughput: 0: 5785.7. Samples: 1289862004. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:56:33,873][25689] Avg episode reward: [(0, '-0.327')] [2022-07-11 15:56:35,113][26022] Updated weights on worker 0-0, policy_version 1259631 (0.00085) [2022-07-11 15:56:36,972][26022] Updated weights on worker 0-0, policy_version 1259641 (0.00091) [2022-07-11 15:56:38,727][26022] Updated weights on worker 0-0, policy_version 1259651 (0.00083) [2022-07-11 15:56:38,929][25689] Fps is (10 sec: 5560.7, 60 sec: 5562.3, 300 sec: 5565.1). Total num frames: 1289883648. Throughput: 0: 4959.7. Samples: 1289878884. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:56:38,930][25689] Avg episode reward: [(0, '0.527')] [2022-07-11 15:56:40,552][26022] Updated weights on worker 0-0, policy_version 1259661 (0.00087) [2022-07-11 15:56:42,303][26022] Updated weights on worker 0-0, policy_version 1259671 (0.00079) [2022-07-11 15:56:44,023][25689] Fps is (10 sec: 5548.4, 60 sec: 5572.2, 300 sec: 5561.1). Total num frames: 1289911296. Throughput: 0: 5777.1. Samples: 1289912500. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2022-07-11 15:56:44,024][25689] Avg episode reward: [(0, '1.436')] [2022-07-11 15:56:44,248][26022] Updated weights on worker 0-0, policy_version 1259681 (0.00088) [2022-07-11 15:56:45,975][26022] Updated weights on worker 0-0, policy_version 1259691 (0.00092) [2022-07-11 15:56:47,947][26022] Updated weights on worker 0-0, policy_version 1259701 (0.00081) [2022-07-11 15:56:49,084][25689] Fps is (10 sec: 5748.0, 60 sec: 5577.8, 300 sec: 5570.4). Total num frames: 1289942016. Throughput: 0: 5835.8. Samples: 1289946570. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:56:49,084][25689] Avg episode reward: [(0, '0.717')] [2022-07-11 15:56:49,874][26022] Updated weights on worker 0-0, policy_version 1259711 (0.00088) [2022-07-11 15:56:51,433][26022] Updated weights on worker 0-0, policy_version 1259721 (0.00079) [2022-07-11 15:56:53,417][26022] Updated weights on worker 0-0, policy_version 1259731 (0.00092) [2022-07-11 15:56:54,102][25689] Fps is (10 sec: 5689.6, 60 sec: 5577.3, 300 sec: 5563.4). Total num frames: 1289968640. Throughput: 0: 5022.5. Samples: 1289963606. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:56:54,102][25689] Avg episode reward: [(0, '0.791')] [2022-07-11 15:56:54,987][26022] Updated weights on worker 0-0, policy_version 1259741 (0.00093) [2022-07-11 15:56:57,165][26022] Updated weights on worker 0-0, policy_version 1259751 (0.01106) [2022-07-11 15:56:58,910][26022] Updated weights on worker 0-0, policy_version 1259761 (0.00098) [2022-07-11 15:56:59,115][25689] Fps is (10 sec: 5410.4, 60 sec: 5559.5, 300 sec: 5563.3). Total num frames: 1289996288. Throughput: 0: 5841.3. Samples: 1289996794. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:56:59,115][25689] Avg episode reward: [(0, '0.550')] [2022-07-11 15:57:00,684][26022] Updated weights on worker 0-0, policy_version 1259771 (0.00088) [2022-07-11 15:57:03,019][26022] Updated weights on worker 0-0, policy_version 1259781 (0.00090) [2022-07-11 15:57:04,162][25689] Fps is (10 sec: 5394.5, 60 sec: 5555.7, 300 sec: 5561.3). Total num frames: 1290022912. Throughput: 0: 5764.2. Samples: 1290028586. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:57:04,164][25689] Avg episode reward: [(0, '0.436')] [2022-07-11 15:57:04,575][26022] Updated weights on worker 0-0, policy_version 1259791 (0.00081) [2022-07-11 15:57:06,573][26022] Updated weights on worker 0-0, policy_version 1259801 (0.00093) [2022-07-11 15:57:08,243][26022] Updated weights on worker 0-0, policy_version 1259811 (0.00084) [2022-07-11 15:57:09,307][25689] Fps is (10 sec: 5425.4, 60 sec: 5563.1, 300 sec: 5563.7). Total num frames: 1290051584. Throughput: 0: 4895.0. Samples: 1290045562. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:57:09,307][25689] Avg episode reward: [(0, '-0.494')] [2022-07-11 15:57:10,267][26022] Updated weights on worker 0-0, policy_version 1259821 (0.00084) [2022-07-11 15:57:11,852][26022] Updated weights on worker 0-0, policy_version 1259831 (0.00086) [2022-07-11 15:57:13,876][26022] Updated weights on worker 0-0, policy_version 1259841 (0.00088) [2022-07-11 15:57:14,336][25689] Fps is (10 sec: 5636.3, 60 sec: 5582.9, 300 sec: 5563.3). Total num frames: 1290080256. Throughput: 0: 5720.4. Samples: 1290079356. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:57:14,337][25689] Avg episode reward: [(0, '-0.362')] [2022-07-11 15:57:15,433][26022] Updated weights on worker 0-0, policy_version 1259851 (0.00087) [2022-07-11 15:57:17,555][26022] Updated weights on worker 0-0, policy_version 1259861 (0.00083) [2022-07-11 15:57:19,114][26022] Updated weights on worker 0-0, policy_version 1259871 (0.00066) [2022-07-11 15:57:19,340][25689] Fps is (10 sec: 5715.4, 60 sec: 5599.6, 300 sec: 5564.0). Total num frames: 1290108928. Throughput: 0: 5762.8. Samples: 1290113348. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:57:19,341][25689] Avg episode reward: [(0, '-0.040')] [2022-07-11 15:57:21,050][26022] Updated weights on worker 0-0, policy_version 1259881 (0.00085) [2022-07-11 15:57:22,778][26022] Updated weights on worker 0-0, policy_version 1259891 (0.00094) [2022-07-11 15:57:24,376][25689] Fps is (10 sec: 5508.0, 60 sec: 5585.7, 300 sec: 5554.8). Total num frames: 1290135552. Throughput: 0: 5027.6. Samples: 1290130210. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:57:24,377][25689] Avg episode reward: [(0, '0.835')] [2022-07-11 15:57:24,794][26022] Updated weights on worker 0-0, policy_version 1259901 (0.00084) [2022-07-11 15:57:26,570][26022] Updated weights on worker 0-0, policy_version 1259911 (0.00051) [2022-07-11 15:57:28,397][26022] Updated weights on worker 0-0, policy_version 1259921 (0.00085) [2022-07-11 15:57:29,507][25689] Fps is (10 sec: 5539.4, 60 sec: 5566.5, 300 sec: 5566.2). Total num frames: 1290165248. Throughput: 0: 5839.2. Samples: 1290163516. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:57:29,508][25689] Avg episode reward: [(0, '0.130')] [2022-07-11 15:57:30,187][26022] Updated weights on worker 0-0, policy_version 1259931 (0.00092) [2022-07-11 15:57:32,065][26022] Updated weights on worker 0-0, policy_version 1259941 (0.00085) [2022-07-11 15:57:33,907][26022] Updated weights on worker 0-0, policy_version 1259951 (0.00088) [2022-07-11 15:57:34,544][25689] Fps is (10 sec: 5639.4, 60 sec: 5569.6, 300 sec: 5562.7). Total num frames: 1290192896. Throughput: 0: 5835.8. Samples: 1290197284. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:57:34,545][25689] Avg episode reward: [(0, '0.288')] [2022-07-11 15:57:35,695][26022] Updated weights on worker 0-0, policy_version 1259961 (0.00093) [2022-07-11 15:57:37,629][26022] Updated weights on worker 0-0, policy_version 1259971 (0.00091) [2022-07-11 15:57:39,362][26022] Updated weights on worker 0-0, policy_version 1259981 (0.00093) [2022-07-11 15:57:39,570][25689] Fps is (10 sec: 5596.7, 60 sec: 5572.5, 300 sec: 5562.8). Total num frames: 1290221568. Throughput: 0: 5808.0. Samples: 1290230844. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:57:39,571][25689] Avg episode reward: [(0, '0.793')] [2022-07-11 15:57:41,261][26022] Updated weights on worker 0-0, policy_version 1259991 (0.00092) [2022-07-11 15:57:43,016][26022] Updated weights on worker 0-0, policy_version 1260001 (0.00087) [2022-07-11 15:57:44,606][25689] Fps is (10 sec: 5699.1, 60 sec: 5594.7, 300 sec: 5564.4). Total num frames: 1290250240. Throughput: 0: 5802.0. Samples: 1290247588. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:57:44,607][25689] Avg episode reward: [(0, '0.863')] [2022-07-11 15:57:44,917][26022] Updated weights on worker 0-0, policy_version 1260011 (0.00081) [2022-07-11 15:57:46,927][26022] Updated weights on worker 0-0, policy_version 1260021 (0.00083) [2022-07-11 15:57:48,323][26022] Updated weights on worker 0-0, policy_version 1260031 (0.00085) [2022-07-11 15:57:49,697][25689] Fps is (10 sec: 5561.6, 60 sec: 5541.3, 300 sec: 5562.8). Total num frames: 1290277888. Throughput: 0: 5840.1. Samples: 1290281426. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:57:49,697][25689] Avg episode reward: [(0, '0.698')] [2022-07-11 15:57:50,433][26022] Updated weights on worker 0-0, policy_version 1260041 (0.00086) [2022-07-11 15:57:52,058][26022] Updated weights on worker 0-0, policy_version 1260051 (0.00095) [2022-07-11 15:57:54,011][26022] Updated weights on worker 0-0, policy_version 1260061 (0.00090) [2022-07-11 15:57:54,713][25689] Fps is (10 sec: 5471.0, 60 sec: 5558.3, 300 sec: 5563.1). Total num frames: 1290305536. Throughput: 0: 5832.4. Samples: 1290314918. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:57:54,714][25689] Avg episode reward: [(0, '0.510')] [2022-07-11 15:57:55,863][26022] Updated weights on worker 0-0, policy_version 1260071 (0.00085) [2022-07-11 15:57:57,676][26022] Updated weights on worker 0-0, policy_version 1260081 (0.00094) [2022-07-11 15:57:59,455][26022] Updated weights on worker 0-0, policy_version 1260091 (0.00086) [2022-07-11 15:57:59,747][25689] Fps is (10 sec: 5604.1, 60 sec: 5573.3, 300 sec: 5562.7). Total num frames: 1290334208. Throughput: 0: 5016.5. Samples: 1290332058. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:57:59,747][25689] Avg episode reward: [(0, '0.985')] [2022-07-11 15:58:01,304][26022] Updated weights on worker 0-0, policy_version 1260101 (0.00095) [2022-07-11 15:58:03,431][26022] Updated weights on worker 0-0, policy_version 1260111 (0.00093) [2022-07-11 15:58:04,768][25689] Fps is (10 sec: 5397.6, 60 sec: 5558.8, 300 sec: 5561.2). Total num frames: 1290359808. Throughput: 0: 5757.3. Samples: 1290363666. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:58:04,769][25689] Avg episode reward: [(0, '-0.284')] [2022-07-11 15:58:05,350][26022] Updated weights on worker 0-0, policy_version 1260121 (0.00088) [2022-07-11 15:58:07,238][26022] Updated weights on worker 0-0, policy_version 1260131 (0.00086) [2022-07-11 15:58:09,011][26022] Updated weights on worker 0-0, policy_version 1260141 (0.00094) [2022-07-11 15:58:09,848][25689] Fps is (10 sec: 5372.7, 60 sec: 5564.7, 300 sec: 5563.6). Total num frames: 1290388480. Throughput: 0: 5747.1. Samples: 1290397238. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:58:09,849][25689] Avg episode reward: [(0, '-0.053')] [2022-07-11 15:58:10,924][26022] Updated weights on worker 0-0, policy_version 1260151 (0.00084) [2022-07-11 15:58:12,475][26022] Updated weights on worker 0-0, policy_version 1260161 (0.00116) [2022-07-11 15:58:14,627][26022] Updated weights on worker 0-0, policy_version 1260171 (0.00089) [2022-07-11 15:58:14,873][25689] Fps is (10 sec: 5573.5, 60 sec: 5548.2, 300 sec: 5557.8). Total num frames: 1290416128. Throughput: 0: 4918.9. Samples: 1290414082. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:58:14,874][25689] Avg episode reward: [(0, '-0.777')] [2022-07-11 15:58:16,205][26022] Updated weights on worker 0-0, policy_version 1260181 (0.00085) [2022-07-11 15:58:18,215][26022] Updated weights on worker 0-0, policy_version 1260191 (0.00098) [2022-07-11 15:58:18,650][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 15:58:18,660][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001260194_1290438656.pth [2022-07-11 15:58:18,660][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001258238_1288435712.pth [2022-07-11 15:58:19,880][25689] Fps is (10 sec: 5716.4, 60 sec: 5564.9, 300 sec: 5568.6). Total num frames: 1290445824. Throughput: 0: 5750.2. Samples: 1290447826. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:58:19,880][25689] Avg episode reward: [(0, '-0.702')] [2022-07-11 15:58:19,881][26022] Updated weights on worker 0-0, policy_version 1260201 (0.00088) [2022-07-11 15:58:21,783][26022] Updated weights on worker 0-0, policy_version 1260211 (0.00090) [2022-07-11 15:58:23,718][26022] Updated weights on worker 0-0, policy_version 1260221 (0.00082) [2022-07-11 15:58:24,894][25689] Fps is (10 sec: 5620.0, 60 sec: 5566.8, 300 sec: 5564.0). Total num frames: 1290472448. Throughput: 0: 5845.6. Samples: 1290481316. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:58:24,895][25689] Avg episode reward: [(0, '-0.646')] [2022-07-11 15:58:25,455][26022] Updated weights on worker 0-0, policy_version 1260231 (0.00093) [2022-07-11 15:58:27,337][26022] Updated weights on worker 0-0, policy_version 1260241 (0.00091) [2022-07-11 15:58:29,223][26022] Updated weights on worker 0-0, policy_version 1260251 (0.00086) [2022-07-11 15:58:29,965][25689] Fps is (10 sec: 5482.6, 60 sec: 5555.4, 300 sec: 5561.1). Total num frames: 1290501120. Throughput: 0: 4996.5. Samples: 1290497754. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:58:29,966][25689] Avg episode reward: [(0, '-0.716')] [2022-07-11 15:58:30,978][26022] Updated weights on worker 0-0, policy_version 1260261 (0.00087) [2022-07-11 15:58:33,056][26022] Updated weights on worker 0-0, policy_version 1260271 (0.00085) [2022-07-11 15:58:34,930][26022] Updated weights on worker 0-0, policy_version 1260281 (0.00099) [2022-07-11 15:58:34,981][25689] Fps is (10 sec: 5481.9, 60 sec: 5540.4, 300 sec: 5562.0). Total num frames: 1290527744. Throughput: 0: 5817.2. Samples: 1290531056. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:58:34,982][25689] Avg episode reward: [(0, '1.087')] [2022-07-11 15:58:36,522][26022] Updated weights on worker 0-0, policy_version 1260291 (0.00087) [2022-07-11 15:58:38,603][26022] Updated weights on worker 0-0, policy_version 1260301 (0.00088) [2022-07-11 15:58:39,987][25689] Fps is (10 sec: 5517.9, 60 sec: 5542.3, 300 sec: 5558.5). Total num frames: 1290556416. Throughput: 0: 5801.2. Samples: 1290564470. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:58:39,987][25689] Avg episode reward: [(0, '1.644')] [2022-07-11 15:58:40,298][26022] Updated weights on worker 0-0, policy_version 1260311 (0.00087) [2022-07-11 15:58:42,144][26022] Updated weights on worker 0-0, policy_version 1260321 (0.00080) [2022-07-11 15:58:43,903][26022] Updated weights on worker 0-0, policy_version 1260331 (0.00082) [2022-07-11 15:58:45,009][25689] Fps is (10 sec: 5616.3, 60 sec: 5526.6, 300 sec: 5558.8). Total num frames: 1290584064. Throughput: 0: 4971.7. Samples: 1290581322. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:58:45,010][25689] Avg episode reward: [(0, '1.891')] [2022-07-11 15:58:45,767][26022] Updated weights on worker 0-0, policy_version 1260341 (0.00095) [2022-07-11 15:58:47,627][26022] Updated weights on worker 0-0, policy_version 1260351 (0.00087) [2022-07-11 15:58:49,379][26022] Updated weights on worker 0-0, policy_version 1260361 (0.00082) [2022-07-11 15:58:50,086][25689] Fps is (10 sec: 5475.2, 60 sec: 5527.9, 300 sec: 5558.0). Total num frames: 1290611712. Throughput: 0: 5823.4. Samples: 1290614924. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:58:50,087][25689] Avg episode reward: [(0, '1.028')] [2022-07-11 15:58:51,355][26022] Updated weights on worker 0-0, policy_version 1260371 (0.00079) [2022-07-11 15:58:53,219][26022] Updated weights on worker 0-0, policy_version 1260381 (0.00098) [2022-07-11 15:58:55,033][26022] Updated weights on worker 0-0, policy_version 1260391 (0.00090) [2022-07-11 15:58:55,147][25689] Fps is (10 sec: 5555.5, 60 sec: 5540.7, 300 sec: 5557.6). Total num frames: 1290640384. Throughput: 0: 5804.2. Samples: 1290648102. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:58:55,148][25689] Avg episode reward: [(0, '0.767')] [2022-07-11 15:58:56,791][26022] Updated weights on worker 0-0, policy_version 1260401 (0.00088) [2022-07-11 15:58:58,723][26022] Updated weights on worker 0-0, policy_version 1260411 (0.00086) [2022-07-11 15:59:00,161][25689] Fps is (10 sec: 5691.9, 60 sec: 5542.6, 300 sec: 5567.8). Total num frames: 1290669056. Throughput: 0: 4971.5. Samples: 1290664766. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:59:00,161][25689] Avg episode reward: [(0, '0.652')] [2022-07-11 15:59:00,385][26022] Updated weights on worker 0-0, policy_version 1260421 (0.00082) [2022-07-11 15:59:02,839][26022] Updated weights on worker 0-0, policy_version 1260431 (0.00464) [2022-07-11 15:59:04,409][26022] Updated weights on worker 0-0, policy_version 1260441 (0.00086) [2022-07-11 15:59:05,177][25689] Fps is (10 sec: 5411.3, 60 sec: 5543.1, 300 sec: 5561.4). Total num frames: 1290694656. Throughput: 0: 5711.1. Samples: 1290696498. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:59:05,178][25689] Avg episode reward: [(0, '0.242')] [2022-07-11 15:59:06,339][26022] Updated weights on worker 0-0, policy_version 1260451 (0.00085) [2022-07-11 15:59:08,115][26022] Updated weights on worker 0-0, policy_version 1260461 (0.00083) [2022-07-11 15:59:09,868][26022] Updated weights on worker 0-0, policy_version 1260471 (0.00086) [2022-07-11 15:59:10,295][25689] Fps is (10 sec: 5355.5, 60 sec: 5539.6, 300 sec: 5559.4). Total num frames: 1290723328. Throughput: 0: 5711.1. Samples: 1290730338. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:59:10,295][25689] Avg episode reward: [(0, '0.174')] [2022-07-11 15:59:11,898][26022] Updated weights on worker 0-0, policy_version 1260481 (0.00074) [2022-07-11 15:59:13,691][26022] Updated weights on worker 0-0, policy_version 1260491 (0.00098) [2022-07-11 15:59:15,306][25689] Fps is (10 sec: 5660.9, 60 sec: 5557.7, 300 sec: 5566.2). Total num frames: 1290752000. Throughput: 0: 4919.4. Samples: 1290747272. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:59:15,307][25689] Avg episode reward: [(0, '0.177')] [2022-07-11 15:59:15,415][26022] Updated weights on worker 0-0, policy_version 1260501 (0.00084) [2022-07-11 15:59:17,284][26022] Updated weights on worker 0-0, policy_version 1260511 (0.00087) [2022-07-11 15:59:19,089][26022] Updated weights on worker 0-0, policy_version 1260521 (0.00101) [2022-07-11 15:59:20,317][25689] Fps is (10 sec: 5517.1, 60 sec: 5506.5, 300 sec: 5562.8). Total num frames: 1290778624. Throughput: 0: 5761.6. Samples: 1290780900. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:59:20,320][25689] Avg episode reward: [(0, '0.683')] [2022-07-11 15:59:21,154][26022] Updated weights on worker 0-0, policy_version 1260531 (0.00083) [2022-07-11 15:59:22,759][26022] Updated weights on worker 0-0, policy_version 1260541 (0.00087) [2022-07-11 15:59:24,667][26022] Updated weights on worker 0-0, policy_version 1260551 (0.00103) [2022-07-11 15:59:25,328][25689] Fps is (10 sec: 5517.7, 60 sec: 5540.8, 300 sec: 5564.4). Total num frames: 1290807296. Throughput: 0: 5853.1. Samples: 1290814446. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:59:25,330][25689] Avg episode reward: [(0, '0.907')] [2022-07-11 15:59:26,443][26022] Updated weights on worker 0-0, policy_version 1260561 (0.00087) [2022-07-11 15:59:28,480][26022] Updated weights on worker 0-0, policy_version 1260571 (0.00091) [2022-07-11 15:59:30,195][26022] Updated weights on worker 0-0, policy_version 1260581 (0.00091) [2022-07-11 15:59:30,439][25689] Fps is (10 sec: 5665.6, 60 sec: 5537.1, 300 sec: 5562.6). Total num frames: 1290835968. Throughput: 0: 4977.1. Samples: 1290830598. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:59:30,439][25689] Avg episode reward: [(0, '1.412')] [2022-07-11 15:59:32,163][26022] Updated weights on worker 0-0, policy_version 1260591 (0.00084) [2022-07-11 15:59:33,766][26022] Updated weights on worker 0-0, policy_version 1260601 (0.00085) [2022-07-11 15:59:35,532][25689] Fps is (10 sec: 5519.2, 60 sec: 5546.9, 300 sec: 5561.1). Total num frames: 1290863616. Throughput: 0: 5769.2. Samples: 1290863960. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:59:35,533][25689] Avg episode reward: [(0, '0.630')] [2022-07-11 15:59:35,906][26022] Updated weights on worker 0-0, policy_version 1260611 (0.00089) [2022-07-11 15:59:37,546][26022] Updated weights on worker 0-0, policy_version 1260621 (0.00096) [2022-07-11 15:59:39,594][26022] Updated weights on worker 0-0, policy_version 1260631 (0.00092) [2022-07-11 15:59:40,582][25689] Fps is (10 sec: 5451.7, 60 sec: 5526.0, 300 sec: 5557.2). Total num frames: 1290891264. Throughput: 0: 5731.1. Samples: 1290897038. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:59:40,582][25689] Avg episode reward: [(0, '0.287')] [2022-07-11 15:59:41,351][26022] Updated weights on worker 0-0, policy_version 1260641 (0.00092) [2022-07-11 15:59:43,243][26022] Updated weights on worker 0-0, policy_version 1260651 (0.00084) [2022-07-11 15:59:45,090][26022] Updated weights on worker 0-0, policy_version 1260661 (0.00098) [2022-07-11 15:59:45,637][25689] Fps is (10 sec: 5573.6, 60 sec: 5539.9, 300 sec: 5557.5). Total num frames: 1290919936. Throughput: 0: 4898.7. Samples: 1290913934. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:59:45,638][25689] Avg episode reward: [(0, '-0.115')] [2022-07-11 15:59:47,054][26022] Updated weights on worker 0-0, policy_version 1260671 (0.00068) [2022-07-11 15:59:48,747][26022] Updated weights on worker 0-0, policy_version 1260681 (0.00083) [2022-07-11 15:59:50,656][26022] Updated weights on worker 0-0, policy_version 1260691 (0.00084) [2022-07-11 15:59:50,698][25689] Fps is (10 sec: 5567.4, 60 sec: 5541.4, 300 sec: 5553.3). Total num frames: 1290947584. Throughput: 0: 5751.2. Samples: 1290947112. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:59:50,698][25689] Avg episode reward: [(0, '0.100')] [2022-07-11 15:59:52,418][26022] Updated weights on worker 0-0, policy_version 1260701 (0.00084) [2022-07-11 15:59:54,291][26022] Updated weights on worker 0-0, policy_version 1260711 (0.00098) [2022-07-11 15:59:55,701][25689] Fps is (10 sec: 5596.1, 60 sec: 5546.6, 300 sec: 5556.8). Total num frames: 1290976256. Throughput: 0: 5787.9. Samples: 1290980696. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 15:59:55,702][25689] Avg episode reward: [(0, '0.149')] [2022-07-11 15:59:56,263][26022] Updated weights on worker 0-0, policy_version 1260721 (0.00098) [2022-07-11 15:59:57,861][26022] Updated weights on worker 0-0, policy_version 1260731 (0.00063) [2022-07-11 15:59:59,960][26022] Updated weights on worker 0-0, policy_version 1260741 (0.00086) [2022-07-11 16:00:00,703][25689] Fps is (10 sec: 5628.8, 60 sec: 5530.8, 300 sec: 5557.0). Total num frames: 1291003904. Throughput: 0: 4968.2. Samples: 1290997006. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 16:00:00,704][25689] Avg episode reward: [(0, '0.248')] [2022-07-11 16:00:01,995][26022] Updated weights on worker 0-0, policy_version 1260751 (0.00085) [2022-07-11 16:00:03,882][26022] Updated weights on worker 0-0, policy_version 1260761 (0.00081) [2022-07-11 16:00:05,640][26022] Updated weights on worker 0-0, policy_version 1260771 (0.00087) [2022-07-11 16:00:05,712][25689] Fps is (10 sec: 5319.2, 60 sec: 5531.4, 300 sec: 5552.1). Total num frames: 1291029504. Throughput: 0: 5705.8. Samples: 1291028476. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 16:00:05,713][25689] Avg episode reward: [(0, '0.262')] [2022-07-11 16:00:07,551][26022] Updated weights on worker 0-0, policy_version 1260781 (0.00084) [2022-07-11 16:00:09,414][26022] Updated weights on worker 0-0, policy_version 1260791 (0.00087) [2022-07-11 16:00:10,821][25689] Fps is (10 sec: 5262.7, 60 sec: 5515.3, 300 sec: 5546.9). Total num frames: 1291057152. Throughput: 0: 5718.5. Samples: 1291062188. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 16:00:10,822][25689] Avg episode reward: [(0, '0.585')] [2022-07-11 16:00:11,283][26022] Updated weights on worker 0-0, policy_version 1260801 (0.00085) [2022-07-11 16:00:13,011][26022] Updated weights on worker 0-0, policy_version 1260811 (0.00085) [2022-07-11 16:00:14,852][26022] Updated weights on worker 0-0, policy_version 1260821 (0.00620) [2022-07-11 16:00:15,851][25689] Fps is (10 sec: 5453.9, 60 sec: 5496.8, 300 sec: 5544.5). Total num frames: 1291084800. Throughput: 0: 4883.1. Samples: 1291079088. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 16:00:15,851][25689] Avg episode reward: [(0, '0.997')] [2022-07-11 16:00:16,693][26022] Updated weights on worker 0-0, policy_version 1260831 (0.00082) [2022-07-11 16:00:18,346][26022] Updated weights on worker 0-0, policy_version 1260841 (0.00085) [2022-07-11 16:00:18,683][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 16:00:18,694][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001260843_1291103232.pth [2022-07-11 16:00:18,697][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001258887_1289100288.pth [2022-07-11 16:00:20,372][26022] Updated weights on worker 0-0, policy_version 1260851 (0.00086) [2022-07-11 16:00:20,873][25689] Fps is (10 sec: 5704.8, 60 sec: 5546.5, 300 sec: 5549.2). Total num frames: 1291114496. Throughput: 0: 5761.7. Samples: 1291113218. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 16:00:20,874][25689] Avg episode reward: [(0, '1.100')] [2022-07-11 16:00:21,948][26022] Updated weights on worker 0-0, policy_version 1260861 (0.01037) [2022-07-11 16:00:23,854][26022] Updated weights on worker 0-0, policy_version 1260871 (0.00053) [2022-07-11 16:00:25,685][26022] Updated weights on worker 0-0, policy_version 1260881 (0.00095) [2022-07-11 16:00:25,967][25689] Fps is (10 sec: 5769.8, 60 sec: 5538.9, 300 sec: 5556.2). Total num frames: 1291143168. Throughput: 0: 5846.3. Samples: 1291146892. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 16:00:25,967][25689] Avg episode reward: [(0, '1.904')] [2022-07-11 16:00:27,678][26022] Updated weights on worker 0-0, policy_version 1260891 (0.00089) [2022-07-11 16:00:29,420][26022] Updated weights on worker 0-0, policy_version 1260901 (0.00090) [2022-07-11 16:00:31,051][25689] Fps is (10 sec: 5533.6, 60 sec: 5524.4, 300 sec: 5548.0). Total num frames: 1291170816. Throughput: 0: 5831.2. Samples: 1291180150. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 16:00:31,053][25689] Avg episode reward: [(0, '1.855')] [2022-07-11 16:00:31,296][26022] Updated weights on worker 0-0, policy_version 1260911 (0.00091) [2022-07-11 16:00:32,972][26022] Updated weights on worker 0-0, policy_version 1260921 (0.00088) [2022-07-11 16:00:34,819][26022] Updated weights on worker 0-0, policy_version 1260931 (0.00083) [2022-07-11 16:00:36,055][25689] Fps is (10 sec: 5684.5, 60 sec: 5566.5, 300 sec: 5555.3). Total num frames: 1291200512. Throughput: 0: 5837.9. Samples: 1291197036. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 16:00:36,055][25689] Avg episode reward: [(0, '1.999')] [2022-07-11 16:00:36,764][26022] Updated weights on worker 0-0, policy_version 1260941 (0.00083) [2022-07-11 16:00:38,537][26022] Updated weights on worker 0-0, policy_version 1260951 (0.00085) [2022-07-11 16:00:40,408][26022] Updated weights on worker 0-0, policy_version 1260961 (0.00086) [2022-07-11 16:00:41,067][25689] Fps is (10 sec: 5724.9, 60 sec: 5569.8, 300 sec: 5558.8). Total num frames: 1291228160. Throughput: 0: 5845.9. Samples: 1291231272. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2022-07-11 16:00:41,069][25689] Avg episode reward: [(0, '1.854')] [2022-07-11 16:00:41,976][26022] Updated weights on worker 0-0, policy_version 1260971 (0.00084) [2022-07-11 16:00:43,960][26022] Updated weights on worker 0-0, policy_version 1260981 (0.00093) [2022-07-11 16:00:45,833][26022] Updated weights on worker 0-0, policy_version 1260991 (0.00138) [2022-07-11 16:00:46,115][25689] Fps is (10 sec: 5496.3, 60 sec: 5553.6, 300 sec: 5549.8). Total num frames: 1291255808. Throughput: 0: 5874.7. Samples: 1291265258. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:00:46,117][25689] Avg episode reward: [(0, '1.566')] [2022-07-11 16:00:47,527][26022] Updated weights on worker 0-0, policy_version 1261001 (0.00086) [2022-07-11 16:00:49,377][26022] Updated weights on worker 0-0, policy_version 1261011 (0.00089) [2022-07-11 16:00:51,075][26022] Updated weights on worker 0-0, policy_version 1261021 (0.00083) [2022-07-11 16:00:51,222][25689] Fps is (10 sec: 5647.2, 60 sec: 5583.2, 300 sec: 5558.4). Total num frames: 1291285504. Throughput: 0: 5062.5. Samples: 1291282266. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:00:51,222][25689] Avg episode reward: [(0, '0.875')] [2022-07-11 16:00:53,015][26022] Updated weights on worker 0-0, policy_version 1261031 (0.00088) [2022-07-11 16:00:54,696][26022] Updated weights on worker 0-0, policy_version 1261041 (0.00080) [2022-07-11 16:00:56,251][25689] Fps is (10 sec: 5758.7, 60 sec: 5580.9, 300 sec: 5557.9). Total num frames: 1291314176. Throughput: 0: 5911.1. Samples: 1291316418. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:00:56,251][25689] Avg episode reward: [(0, '1.117')] [2022-07-11 16:00:56,576][26022] Updated weights on worker 0-0, policy_version 1261051 (0.00092) [2022-07-11 16:00:58,445][26022] Updated weights on worker 0-0, policy_version 1261061 (0.00086) [2022-07-11 16:01:00,223][26022] Updated weights on worker 0-0, policy_version 1261071 (0.00088) [2022-07-11 16:01:01,311][25689] Fps is (10 sec: 5582.2, 60 sec: 5575.5, 300 sec: 5560.3). Total num frames: 1291341824. Throughput: 0: 5850.2. Samples: 1291349702. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:01:01,312][25689] Avg episode reward: [(0, '0.997')] [2022-07-11 16:01:02,548][26022] Updated weights on worker 0-0, policy_version 1261081 (0.00094) [2022-07-11 16:01:04,342][26022] Updated weights on worker 0-0, policy_version 1261091 (0.00054) [2022-07-11 16:01:06,045][26022] Updated weights on worker 0-0, policy_version 1261101 (0.00092) [2022-07-11 16:01:06,320][25689] Fps is (10 sec: 5390.0, 60 sec: 5592.4, 300 sec: 5557.5). Total num frames: 1291368448. Throughput: 0: 4904.1. Samples: 1291364344. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:01:06,320][25689] Avg episode reward: [(0, '-0.687')] [2022-07-11 16:01:08,035][26022] Updated weights on worker 0-0, policy_version 1261111 (0.00084) [2022-07-11 16:01:09,720][26022] Updated weights on worker 0-0, policy_version 1261121 (0.00085) [2022-07-11 16:01:11,423][25689] Fps is (10 sec: 5265.8, 60 sec: 5576.1, 300 sec: 5553.3). Total num frames: 1291395072. Throughput: 0: 5731.4. Samples: 1291398048. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:01:11,424][25689] Avg episode reward: [(0, '-1.034')] [2022-07-11 16:01:11,639][26022] Updated weights on worker 0-0, policy_version 1261131 (0.00091) [2022-07-11 16:01:13,566][26022] Updated weights on worker 0-0, policy_version 1261141 (0.00091) [2022-07-11 16:01:15,264][26022] Updated weights on worker 0-0, policy_version 1261151 (0.00087) [2022-07-11 16:01:16,451][25689] Fps is (10 sec: 5559.3, 60 sec: 5610.1, 300 sec: 5559.7). Total num frames: 1291424768. Throughput: 0: 5715.6. Samples: 1291431872. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:01:16,451][25689] Avg episode reward: [(0, '-1.191')] [2022-07-11 16:01:17,235][26022] Updated weights on worker 0-0, policy_version 1261161 (0.00082) [2022-07-11 16:01:18,874][26022] Updated weights on worker 0-0, policy_version 1261171 (0.00086) [2022-07-11 16:01:20,938][26022] Updated weights on worker 0-0, policy_version 1261181 (0.00082) [2022-07-11 16:01:21,466][25689] Fps is (10 sec: 5710.1, 60 sec: 5576.9, 300 sec: 5560.7). Total num frames: 1291452416. Throughput: 0: 4916.8. Samples: 1291448798. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:01:21,466][25689] Avg episode reward: [(0, '-1.174')] [2022-07-11 16:01:22,699][26022] Updated weights on worker 0-0, policy_version 1261191 (0.00081) [2022-07-11 16:01:24,286][26022] Updated weights on worker 0-0, policy_version 1261201 (0.00085) [2022-07-11 16:01:26,259][26022] Updated weights on worker 0-0, policy_version 1261211 (0.00089) [2022-07-11 16:01:26,491][25689] Fps is (10 sec: 5507.4, 60 sec: 5566.3, 300 sec: 5551.9). Total num frames: 1291480064. Throughput: 0: 5863.2. Samples: 1291482610. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:01:26,491][25689] Avg episode reward: [(0, '-1.332')] [2022-07-11 16:01:27,947][26022] Updated weights on worker 0-0, policy_version 1261221 (0.00091) [2022-07-11 16:01:30,035][26022] Updated weights on worker 0-0, policy_version 1261231 (0.00504) [2022-07-11 16:01:31,535][25689] Fps is (10 sec: 5695.2, 60 sec: 5603.9, 300 sec: 5559.3). Total num frames: 1291509760. Throughput: 0: 5862.6. Samples: 1291515952. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:01:31,535][25689] Avg episode reward: [(0, '-1.554')] [2022-07-11 16:01:31,681][26022] Updated weights on worker 0-0, policy_version 1261241 (0.00088) [2022-07-11 16:01:33,517][26022] Updated weights on worker 0-0, policy_version 1261251 (0.00088) [2022-07-11 16:01:35,248][26022] Updated weights on worker 0-0, policy_version 1261261 (0.00094) [2022-07-11 16:01:36,588][25689] Fps is (10 sec: 5679.1, 60 sec: 5565.4, 300 sec: 5555.9). Total num frames: 1291537408. Throughput: 0: 5024.0. Samples: 1291533044. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:01:36,589][25689] Avg episode reward: [(0, '0.348')] [2022-07-11 16:01:37,233][26022] Updated weights on worker 0-0, policy_version 1261271 (0.00084) [2022-07-11 16:01:38,966][26022] Updated weights on worker 0-0, policy_version 1261281 (0.00084) [2022-07-11 16:01:40,926][26022] Updated weights on worker 0-0, policy_version 1261291 (0.00090) [2022-07-11 16:01:41,682][25689] Fps is (10 sec: 5550.3, 60 sec: 5574.9, 300 sec: 5559.3). Total num frames: 1291566080. Throughput: 0: 5839.7. Samples: 1291566854. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:01:41,682][25689] Avg episode reward: [(0, '0.955')] [2022-07-11 16:01:42,631][26022] Updated weights on worker 0-0, policy_version 1261301 (0.00095) [2022-07-11 16:01:44,564][26022] Updated weights on worker 0-0, policy_version 1261311 (0.00083) [2022-07-11 16:01:46,536][26022] Updated weights on worker 0-0, policy_version 1261321 (0.00093) [2022-07-11 16:01:46,693][25689] Fps is (10 sec: 5573.3, 60 sec: 5578.2, 300 sec: 5549.9). Total num frames: 1291593728. Throughput: 0: 5832.6. Samples: 1291600444. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:01:46,694][25689] Avg episode reward: [(0, '0.699')] [2022-07-11 16:01:48,163][26022] Updated weights on worker 0-0, policy_version 1261331 (0.00091) [2022-07-11 16:01:50,060][26022] Updated weights on worker 0-0, policy_version 1261341 (0.00105) [2022-07-11 16:01:51,751][25689] Fps is (10 sec: 5593.4, 60 sec: 5565.9, 300 sec: 5556.1). Total num frames: 1291622400. Throughput: 0: 5007.2. Samples: 1291617178. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:01:51,751][25689] Avg episode reward: [(0, '0.641')] [2022-07-11 16:01:51,893][26022] Updated weights on worker 0-0, policy_version 1261351 (0.00085) [2022-07-11 16:01:53,512][26022] Updated weights on worker 0-0, policy_version 1261361 (0.00085) [2022-07-11 16:01:55,643][26022] Updated weights on worker 0-0, policy_version 1261371 (0.00091) [2022-07-11 16:01:56,784][25689] Fps is (10 sec: 5784.7, 60 sec: 5582.4, 300 sec: 5562.6). Total num frames: 1291652096. Throughput: 0: 5852.4. Samples: 1291651236. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:01:56,784][25689] Avg episode reward: [(0, '0.399')] [2022-07-11 16:01:57,295][26022] Updated weights on worker 0-0, policy_version 1261381 (0.00599) [2022-07-11 16:01:59,205][26022] Updated weights on worker 0-0, policy_version 1261391 (0.00081) [2022-07-11 16:02:01,065][26022] Updated weights on worker 0-0, policy_version 1261401 (0.00095) [2022-07-11 16:02:01,792][25689] Fps is (10 sec: 5303.0, 60 sec: 5519.5, 300 sec: 5553.0). Total num frames: 1291675648. Throughput: 0: 5847.3. Samples: 1291684444. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:02:01,792][25689] Avg episode reward: [(0, '0.524')] [2022-07-11 16:02:03,199][26022] Updated weights on worker 0-0, policy_version 1261411 (0.00082) [2022-07-11 16:02:05,125][26022] Updated weights on worker 0-0, policy_version 1261421 (0.00087) [2022-07-11 16:02:06,616][26022] Updated weights on worker 0-0, policy_version 1261431 (0.00093) [2022-07-11 16:02:06,799][25689] Fps is (10 sec: 5418.6, 60 sec: 5587.3, 300 sec: 5562.4). Total num frames: 1291706368. Throughput: 0: 4938.2. Samples: 1291699732. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:02:06,800][25689] Avg episode reward: [(0, '0.645')] [2022-07-11 16:02:08,772][26022] Updated weights on worker 0-0, policy_version 1261441 (0.00085) [2022-07-11 16:02:10,314][26022] Updated weights on worker 0-0, policy_version 1261451 (0.00096) [2022-07-11 16:02:11,877][25689] Fps is (10 sec: 5584.3, 60 sec: 5572.7, 300 sec: 5551.2). Total num frames: 1291731968. Throughput: 0: 5774.4. Samples: 1291733398. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:02:11,878][25689] Avg episode reward: [(0, '0.547')] [2022-07-11 16:02:12,466][26022] Updated weights on worker 0-0, policy_version 1261461 (0.00091) [2022-07-11 16:02:13,961][26022] Updated weights on worker 0-0, policy_version 1261471 (0.00090) [2022-07-11 16:02:15,915][26022] Updated weights on worker 0-0, policy_version 1261481 (0.00091) [2022-07-11 16:02:16,889][25689] Fps is (10 sec: 5581.9, 60 sec: 5591.1, 300 sec: 5557.9). Total num frames: 1291762688. Throughput: 0: 5774.6. Samples: 1291767338. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:02:16,890][25689] Avg episode reward: [(0, '1.184')] [2022-07-11 16:02:17,638][26022] Updated weights on worker 0-0, policy_version 1261491 (0.00090) [2022-07-11 16:02:18,883][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 16:02:18,892][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001261496_1291771904.pth [2022-07-11 16:02:18,892][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001259541_1289769984.pth [2022-07-11 16:02:19,528][26022] Updated weights on worker 0-0, policy_version 1261501 (0.00085) [2022-07-11 16:02:21,607][26022] Updated weights on worker 0-0, policy_version 1261511 (0.00090) [2022-07-11 16:02:21,918][25689] Fps is (10 sec: 5711.1, 60 sec: 5572.9, 300 sec: 5558.1). Total num frames: 1291789312. Throughput: 0: 4949.5. Samples: 1291784058. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:02:21,919][25689] Avg episode reward: [(0, '1.686')] [2022-07-11 16:02:23,257][26022] Updated weights on worker 0-0, policy_version 1261521 (0.00091) [2022-07-11 16:02:25,083][26022] Updated weights on worker 0-0, policy_version 1261531 (0.00090) [2022-07-11 16:02:26,879][26022] Updated weights on worker 0-0, policy_version 1261541 (0.00084) [2022-07-11 16:02:26,943][25689] Fps is (10 sec: 5500.1, 60 sec: 5589.9, 300 sec: 5556.6). Total num frames: 1291817984. Throughput: 0: 5863.5. Samples: 1291817844. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:02:26,943][25689] Avg episode reward: [(0, '1.581')] [2022-07-11 16:02:28,562][26022] Updated weights on worker 0-0, policy_version 1261551 (0.00085) [2022-07-11 16:02:30,648][26022] Updated weights on worker 0-0, policy_version 1261561 (0.00091) [2022-07-11 16:02:32,036][25689] Fps is (10 sec: 5667.4, 60 sec: 5568.4, 300 sec: 5559.0). Total num frames: 1291846656. Throughput: 0: 5828.1. Samples: 1291850886. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:02:32,037][25689] Avg episode reward: [(0, '1.264')] [2022-07-11 16:02:32,217][26022] Updated weights on worker 0-0, policy_version 1261571 (0.00092) [2022-07-11 16:02:34,389][26022] Updated weights on worker 0-0, policy_version 1261581 (0.00052) [2022-07-11 16:02:36,347][26022] Updated weights on worker 0-0, policy_version 1261591 (0.00085) [2022-07-11 16:02:37,090][25689] Fps is (10 sec: 5449.2, 60 sec: 5551.4, 300 sec: 5551.6). Total num frames: 1291873280. Throughput: 0: 4971.1. Samples: 1291867760. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:02:37,091][25689] Avg episode reward: [(0, '1.542')] [2022-07-11 16:02:37,963][26022] Updated weights on worker 0-0, policy_version 1261601 (0.00087) [2022-07-11 16:02:39,867][26022] Updated weights on worker 0-0, policy_version 1261611 (0.00084) [2022-07-11 16:02:41,527][26022] Updated weights on worker 0-0, policy_version 1261621 (0.00095) [2022-07-11 16:02:42,175][25689] Fps is (10 sec: 5453.9, 60 sec: 5552.2, 300 sec: 5550.7). Total num frames: 1291901952. Throughput: 0: 5806.0. Samples: 1291901670. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:02:42,176][25689] Avg episode reward: [(0, '1.476')] [2022-07-11 16:02:43,486][26022] Updated weights on worker 0-0, policy_version 1261631 (0.00088) [2022-07-11 16:02:45,222][26022] Updated weights on worker 0-0, policy_version 1261641 (0.00089) [2022-07-11 16:02:47,059][26022] Updated weights on worker 0-0, policy_version 1261651 (0.00087) [2022-07-11 16:02:47,196][25689] Fps is (10 sec: 5775.8, 60 sec: 5585.2, 300 sec: 5558.9). Total num frames: 1291931648. Throughput: 0: 5809.5. Samples: 1291935504. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:02:47,196][25689] Avg episode reward: [(0, '1.780')] [2022-07-11 16:02:48,948][26022] Updated weights on worker 0-0, policy_version 1261661 (0.00085) [2022-07-11 16:02:50,567][26022] Updated weights on worker 0-0, policy_version 1261671 (0.00088) [2022-07-11 16:02:52,256][25689] Fps is (10 sec: 5587.1, 60 sec: 5551.2, 300 sec: 5554.6). Total num frames: 1291958272. Throughput: 0: 5021.7. Samples: 1291952420. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:02:52,256][25689] Avg episode reward: [(0, '1.793')] [2022-07-11 16:02:52,616][26022] Updated weights on worker 0-0, policy_version 1261681 (0.00090) [2022-07-11 16:02:54,035][26022] Updated weights on worker 0-0, policy_version 1261691 (0.00086) [2022-07-11 16:02:56,104][26022] Updated weights on worker 0-0, policy_version 1261701 (0.00086) [2022-07-11 16:02:57,269][25689] Fps is (10 sec: 5794.3, 60 sec: 5586.8, 300 sec: 5565.3). Total num frames: 1291990016. Throughput: 0: 5881.2. Samples: 1291986438. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:02:57,270][25689] Avg episode reward: [(0, '1.675')] [2022-07-11 16:02:58,228][26022] Updated weights on worker 0-0, policy_version 1261711 (0.00094) [2022-07-11 16:02:59,631][26022] Updated weights on worker 0-0, policy_version 1261721 (0.00081) [2022-07-11 16:03:01,654][26022] Updated weights on worker 0-0, policy_version 1261731 (0.00089) [2022-07-11 16:03:02,323][25689] Fps is (10 sec: 5492.8, 60 sec: 5582.6, 300 sec: 5557.8). Total num frames: 1292013568. Throughput: 0: 5877.0. Samples: 1292020076. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:03:02,323][25689] Avg episode reward: [(0, '0.799')] [2022-07-11 16:03:03,806][26022] Updated weights on worker 0-0, policy_version 1261741 (0.00091) [2022-07-11 16:03:05,690][26022] Updated weights on worker 0-0, policy_version 1261751 (0.00099) [2022-07-11 16:03:07,400][25689] Fps is (10 sec: 5155.1, 60 sec: 5542.4, 300 sec: 5557.9). Total num frames: 1292042240. Throughput: 0: 5748.1. Samples: 1292051638. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:03:07,400][25689] Avg episode reward: [(0, '0.911')] [2022-07-11 16:03:07,467][26022] Updated weights on worker 0-0, policy_version 1261761 (0.00079) [2022-07-11 16:03:09,412][26022] Updated weights on worker 0-0, policy_version 1261771 (0.00092) [2022-07-11 16:03:11,050][26022] Updated weights on worker 0-0, policy_version 1261781 (0.00091) [2022-07-11 16:03:12,463][25689] Fps is (10 sec: 5655.2, 60 sec: 5594.5, 300 sec: 5560.6). Total num frames: 1292070912. Throughput: 0: 5735.2. Samples: 1292068312. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:03:12,463][25689] Avg episode reward: [(0, '0.978')] [2022-07-11 16:03:13,177][26022] Updated weights on worker 0-0, policy_version 1261791 (0.00089) [2022-07-11 16:03:14,827][26022] Updated weights on worker 0-0, policy_version 1261801 (0.00089) [2022-07-11 16:03:16,910][26022] Updated weights on worker 0-0, policy_version 1261811 (0.00092) [2022-07-11 16:03:17,514][25689] Fps is (10 sec: 5467.4, 60 sec: 5523.3, 300 sec: 5549.5). Total num frames: 1292097536. Throughput: 0: 5659.2. Samples: 1292101004. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:03:17,514][25689] Avg episode reward: [(0, '0.354')] [2022-07-11 16:03:18,787][26022] Updated weights on worker 0-0, policy_version 1261821 (0.00088) [2022-07-11 16:03:20,638][26022] Updated weights on worker 0-0, policy_version 1261831 (0.00089) [2022-07-11 16:03:22,526][25689] Fps is (10 sec: 5392.8, 60 sec: 5541.7, 300 sec: 5553.0). Total num frames: 1292125184. Throughput: 0: 5624.6. Samples: 1292133714. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:03:22,528][25689] Avg episode reward: [(0, '0.367')] [2022-07-11 16:03:22,530][26022] Updated weights on worker 0-0, policy_version 1261841 (0.00091) [2022-07-11 16:03:24,283][26022] Updated weights on worker 0-0, policy_version 1261851 (0.00091) [2022-07-11 16:03:26,107][26022] Updated weights on worker 0-0, policy_version 1261861 (0.00099) [2022-07-11 16:03:27,545][25689] Fps is (10 sec: 5512.3, 60 sec: 5525.3, 300 sec: 5550.5). Total num frames: 1292152832. Throughput: 0: 4914.9. Samples: 1292150650. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:03:27,547][25689] Avg episode reward: [(0, '1.266')] [2022-07-11 16:03:28,119][26022] Updated weights on worker 0-0, policy_version 1261871 (0.00087) [2022-07-11 16:03:29,755][26022] Updated weights on worker 0-0, policy_version 1261881 (0.00085) [2022-07-11 16:03:31,677][26022] Updated weights on worker 0-0, policy_version 1261891 (0.00091) [2022-07-11 16:03:32,666][25689] Fps is (10 sec: 5655.7, 60 sec: 5539.8, 300 sec: 5558.9). Total num frames: 1292182528. Throughput: 0: 5729.8. Samples: 1292184070. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:03:32,668][25689] Avg episode reward: [(0, '1.695')] [2022-07-11 16:03:33,620][26022] Updated weights on worker 0-0, policy_version 1261901 (0.00086) [2022-07-11 16:03:35,179][26022] Updated weights on worker 0-0, policy_version 1261911 (0.00086) [2022-07-11 16:03:37,214][26022] Updated weights on worker 0-0, policy_version 1261921 (0.01179) [2022-07-11 16:03:37,753][25689] Fps is (10 sec: 5617.7, 60 sec: 5553.6, 300 sec: 5553.9). Total num frames: 1292210176. Throughput: 0: 5776.9. Samples: 1292217924. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:03:37,753][25689] Avg episode reward: [(0, '1.584')] [2022-07-11 16:03:38,692][26022] Updated weights on worker 0-0, policy_version 1261931 (0.00092) [2022-07-11 16:03:40,804][26022] Updated weights on worker 0-0, policy_version 1261941 (0.00082) [2022-07-11 16:03:42,648][26022] Updated weights on worker 0-0, policy_version 1261951 (0.00089) [2022-07-11 16:03:42,770][25689] Fps is (10 sec: 5574.0, 60 sec: 5559.9, 300 sec: 5557.4). Total num frames: 1292238848. Throughput: 0: 4993.7. Samples: 1292234802. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:03:42,770][25689] Avg episode reward: [(0, '1.377')] [2022-07-11 16:03:44,274][26022] Updated weights on worker 0-0, policy_version 1261961 (0.00085) [2022-07-11 16:03:46,380][26022] Updated weights on worker 0-0, policy_version 1261971 (0.00085) [2022-07-11 16:03:47,823][25689] Fps is (10 sec: 5694.3, 60 sec: 5540.0, 300 sec: 5561.3). Total num frames: 1292267520. Throughput: 0: 5819.7. Samples: 1292268662. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:03:47,824][25689] Avg episode reward: [(0, '1.757')] [2022-07-11 16:03:47,922][26022] Updated weights on worker 0-0, policy_version 1261981 (0.00551) [2022-07-11 16:03:49,915][26022] Updated weights on worker 0-0, policy_version 1261991 (0.00090) [2022-07-11 16:03:51,844][26022] Updated weights on worker 0-0, policy_version 1262001 (0.00090) [2022-07-11 16:03:52,909][25689] Fps is (10 sec: 5554.5, 60 sec: 5554.4, 300 sec: 5557.4). Total num frames: 1292295168. Throughput: 0: 5822.2. Samples: 1292301932. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:03:52,910][25689] Avg episode reward: [(0, '2.285')] [2022-07-11 16:03:53,584][26022] Updated weights on worker 0-0, policy_version 1262011 (0.00079) [2022-07-11 16:03:55,462][26022] Updated weights on worker 0-0, policy_version 1262021 (0.00089) [2022-07-11 16:03:57,243][26022] Updated weights on worker 0-0, policy_version 1262031 (0.00085) [2022-07-11 16:03:57,967][25689] Fps is (10 sec: 5350.2, 60 sec: 5466.0, 300 sec: 5549.7). Total num frames: 1292321792. Throughput: 0: 4979.2. Samples: 1292318574. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:03:57,968][25689] Avg episode reward: [(0, '2.002')] [2022-07-11 16:03:59,108][26022] Updated weights on worker 0-0, policy_version 1262041 (0.00087) [2022-07-11 16:04:00,863][26022] Updated weights on worker 0-0, policy_version 1262051 (0.00083) [2022-07-11 16:04:02,999][25689] Fps is (10 sec: 5277.5, 60 sec: 5518.6, 300 sec: 5552.8). Total num frames: 1292348416. Throughput: 0: 5803.6. Samples: 1292352204. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:04:02,999][25689] Avg episode reward: [(0, '0.999')] [2022-07-11 16:04:03,158][26022] Updated weights on worker 0-0, policy_version 1262061 (0.00087) [2022-07-11 16:04:05,020][26022] Updated weights on worker 0-0, policy_version 1262071 (0.00084) [2022-07-11 16:04:06,867][26022] Updated weights on worker 0-0, policy_version 1262081 (0.00087) [2022-07-11 16:04:08,030][25689] Fps is (10 sec: 5494.8, 60 sec: 5522.8, 300 sec: 5554.5). Total num frames: 1292377088. Throughput: 0: 5683.9. Samples: 1292383518. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:04:08,031][25689] Avg episode reward: [(0, '0.323')] [2022-07-11 16:04:08,623][26022] Updated weights on worker 0-0, policy_version 1262091 (0.00083) [2022-07-11 16:04:10,527][26022] Updated weights on worker 0-0, policy_version 1262101 (0.00087) [2022-07-11 16:04:12,327][26022] Updated weights on worker 0-0, policy_version 1262111 (0.00087) [2022-07-11 16:04:13,091][25689] Fps is (10 sec: 5682.2, 60 sec: 5523.0, 300 sec: 5553.5). Total num frames: 1292405760. Throughput: 0: 4878.3. Samples: 1292400384. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:04:13,091][25689] Avg episode reward: [(0, '0.307')] [2022-07-11 16:04:14,148][26022] Updated weights on worker 0-0, policy_version 1262121 (0.00085) [2022-07-11 16:04:15,884][26022] Updated weights on worker 0-0, policy_version 1262131 (0.00091) [2022-07-11 16:04:17,672][26022] Updated weights on worker 0-0, policy_version 1262141 (0.00090) [2022-07-11 16:04:18,113][25689] Fps is (10 sec: 5585.6, 60 sec: 5542.5, 300 sec: 5556.8). Total num frames: 1292433408. Throughput: 0: 5742.0. Samples: 1292434254. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:04:18,114][25689] Avg episode reward: [(0, '0.088')] [2022-07-11 16:04:19,121][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 16:04:19,134][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001262149_1292440576.pth [2022-07-11 16:04:19,134][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001260194_1290438656.pth [2022-07-11 16:04:19,538][26022] Updated weights on worker 0-0, policy_version 1262151 (0.00096) [2022-07-11 16:04:21,467][26022] Updated weights on worker 0-0, policy_version 1262161 (0.00084) [2022-07-11 16:04:23,136][25689] Fps is (10 sec: 5606.4, 60 sec: 5558.4, 300 sec: 5556.5). Total num frames: 1292462080. Throughput: 0: 5739.1. Samples: 1292467776. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:04:23,137][25689] Avg episode reward: [(0, '-0.623')] [2022-07-11 16:04:23,278][26022] Updated weights on worker 0-0, policy_version 1262171 (0.00090) [2022-07-11 16:04:25,229][26022] Updated weights on worker 0-0, policy_version 1262181 (0.00175) [2022-07-11 16:04:27,111][26022] Updated weights on worker 0-0, policy_version 1262191 (0.00096) [2022-07-11 16:04:28,163][25689] Fps is (10 sec: 5604.4, 60 sec: 5557.7, 300 sec: 5554.7). Total num frames: 1292489728. Throughput: 0: 5020.5. Samples: 1292484592. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:04:28,163][25689] Avg episode reward: [(0, '-0.371')] [2022-07-11 16:04:28,966][26022] Updated weights on worker 0-0, policy_version 1262201 (0.00090) [2022-07-11 16:04:30,667][26022] Updated weights on worker 0-0, policy_version 1262211 (0.00092) [2022-07-11 16:04:32,536][26022] Updated weights on worker 0-0, policy_version 1262221 (0.00091) [2022-07-11 16:04:33,302][25689] Fps is (10 sec: 5540.2, 60 sec: 5539.1, 300 sec: 5557.3). Total num frames: 1292518400. Throughput: 0: 5801.0. Samples: 1292517630. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:04:33,303][25689] Avg episode reward: [(0, '0.385')] [2022-07-11 16:04:34,433][26022] Updated weights on worker 0-0, policy_version 1262231 (0.00084) [2022-07-11 16:04:36,183][26022] Updated weights on worker 0-0, policy_version 1262241 (0.00084) [2022-07-11 16:04:38,033][26022] Updated weights on worker 0-0, policy_version 1262251 (0.00089) [2022-07-11 16:04:38,319][25689] Fps is (10 sec: 5545.1, 60 sec: 5545.5, 300 sec: 5557.9). Total num frames: 1292546048. Throughput: 0: 5782.7. Samples: 1292551098. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:04:38,321][25689] Avg episode reward: [(0, '1.398')] [2022-07-11 16:04:40,032][26022] Updated weights on worker 0-0, policy_version 1262261 (0.00093) [2022-07-11 16:04:41,500][26022] Updated weights on worker 0-0, policy_version 1262271 (0.00092) [2022-07-11 16:04:43,347][25689] Fps is (10 sec: 5402.8, 60 sec: 5510.7, 300 sec: 5551.5). Total num frames: 1292572672. Throughput: 0: 4948.2. Samples: 1292567784. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2022-07-11 16:04:43,349][25689] Avg episode reward: [(0, '1.618')] [2022-07-11 16:04:43,722][26022] Updated weights on worker 0-0, policy_version 1262281 (0.00086) [2022-07-11 16:04:45,223][26022] Updated weights on worker 0-0, policy_version 1262291 (0.00088) [2022-07-11 16:04:47,475][26022] Updated weights on worker 0-0, policy_version 1262301 (0.00094) [2022-07-11 16:04:48,368][25689] Fps is (10 sec: 5604.9, 60 sec: 5530.6, 300 sec: 5559.1). Total num frames: 1292602368. Throughput: 0: 5781.1. Samples: 1292601400. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:04:48,368][25689] Avg episode reward: [(0, '1.940')] [2022-07-11 16:04:49,087][26022] Updated weights on worker 0-0, policy_version 1262311 (0.00085) [2022-07-11 16:04:50,971][26022] Updated weights on worker 0-0, policy_version 1262321 (0.00086) [2022-07-11 16:04:52,770][26022] Updated weights on worker 0-0, policy_version 1262331 (0.00087) [2022-07-11 16:04:53,488][25689] Fps is (10 sec: 5654.8, 60 sec: 5527.5, 300 sec: 5553.5). Total num frames: 1292630016. Throughput: 0: 5819.6. Samples: 1292635104. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:04:53,489][25689] Avg episode reward: [(0, '1.569')] [2022-07-11 16:04:54,416][26022] Updated weights on worker 0-0, policy_version 1262341 (0.00087) [2022-07-11 16:04:56,425][26022] Updated weights on worker 0-0, policy_version 1262351 (0.00086) [2022-07-11 16:04:58,246][26022] Updated weights on worker 0-0, policy_version 1262361 (0.00092) [2022-07-11 16:04:58,519][25689] Fps is (10 sec: 5548.3, 60 sec: 5563.8, 300 sec: 5556.4). Total num frames: 1292658688. Throughput: 0: 4994.2. Samples: 1292651978. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:04:58,519][25689] Avg episode reward: [(0, '1.534')] [2022-07-11 16:05:00,075][26022] Updated weights on worker 0-0, policy_version 1262371 (0.00087) [2022-07-11 16:05:02,260][26022] Updated weights on worker 0-0, policy_version 1262381 (0.00104) [2022-07-11 16:05:03,541][25689] Fps is (10 sec: 5398.8, 60 sec: 5547.8, 300 sec: 5556.2). Total num frames: 1292684288. Throughput: 0: 5805.4. Samples: 1292685014. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:05:03,541][25689] Avg episode reward: [(0, '1.571')] [2022-07-11 16:05:04,078][26022] Updated weights on worker 0-0, policy_version 1262391 (0.00091) [2022-07-11 16:05:06,012][26022] Updated weights on worker 0-0, policy_version 1262401 (0.00089) [2022-07-11 16:05:07,752][26022] Updated weights on worker 0-0, policy_version 1262411 (0.00092) [2022-07-11 16:05:08,578][25689] Fps is (10 sec: 5293.6, 60 sec: 5530.4, 300 sec: 5557.5). Total num frames: 1292711936. Throughput: 0: 5686.9. Samples: 1292716330. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:05:08,578][25689] Avg episode reward: [(0, '1.735')] [2022-07-11 16:05:09,858][26022] Updated weights on worker 0-0, policy_version 1262421 (0.00092) [2022-07-11 16:05:11,464][26022] Updated weights on worker 0-0, policy_version 1262431 (0.00082) [2022-07-11 16:05:13,280][26022] Updated weights on worker 0-0, policy_version 1262441 (0.00091) [2022-07-11 16:05:13,653][25689] Fps is (10 sec: 5569.4, 60 sec: 5529.0, 300 sec: 5560.1). Total num frames: 1292740608. Throughput: 0: 5698.5. Samples: 1292750014. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:05:13,654][25689] Avg episode reward: [(0, '1.829')] [2022-07-11 16:05:15,157][26022] Updated weights on worker 0-0, policy_version 1262451 (0.00094) [2022-07-11 16:05:16,955][26022] Updated weights on worker 0-0, policy_version 1262461 (0.00085) [2022-07-11 16:05:18,694][25689] Fps is (10 sec: 5567.0, 60 sec: 5527.3, 300 sec: 5552.9). Total num frames: 1292768256. Throughput: 0: 5694.4. Samples: 1292766866. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:05:18,695][25689] Avg episode reward: [(0, '2.178')] [2022-07-11 16:05:18,979][26022] Updated weights on worker 0-0, policy_version 1262471 (0.00086) [2022-07-11 16:05:20,807][26022] Updated weights on worker 0-0, policy_version 1262481 (0.00083) [2022-07-11 16:05:22,584][26022] Updated weights on worker 0-0, policy_version 1262491 (0.00096) [2022-07-11 16:05:23,696][25689] Fps is (10 sec: 5607.8, 60 sec: 5529.2, 300 sec: 5554.6). Total num frames: 1292796928. Throughput: 0: 5715.2. Samples: 1292800208. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:05:23,697][25689] Avg episode reward: [(0, '2.115')] [2022-07-11 16:05:24,471][26022] Updated weights on worker 0-0, policy_version 1262501 (0.00080) [2022-07-11 16:05:26,184][26022] Updated weights on worker 0-0, policy_version 1262511 (0.00105) [2022-07-11 16:05:27,874][26022] Updated weights on worker 0-0, policy_version 1262521 (0.00079) [2022-07-11 16:05:28,744][25689] Fps is (10 sec: 5705.8, 60 sec: 5544.1, 300 sec: 5558.7). Total num frames: 1292825600. Throughput: 0: 5836.5. Samples: 1292834034. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:05:28,745][25689] Avg episode reward: [(0, '2.337')] [2022-07-11 16:05:29,782][26022] Updated weights on worker 0-0, policy_version 1262531 (0.00082) [2022-07-11 16:05:31,560][26022] Updated weights on worker 0-0, policy_version 1262541 (0.00086) [2022-07-11 16:05:33,496][26022] Updated weights on worker 0-0, policy_version 1262551 (0.00088) [2022-07-11 16:05:33,821][25689] Fps is (10 sec: 5562.9, 60 sec: 5533.0, 300 sec: 5550.5). Total num frames: 1292853248. Throughput: 0: 5001.1. Samples: 1292850874. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:05:33,821][25689] Avg episode reward: [(0, '1.351')] [2022-07-11 16:05:35,391][26022] Updated weights on worker 0-0, policy_version 1262561 (0.00098) [2022-07-11 16:05:37,079][26022] Updated weights on worker 0-0, policy_version 1262571 (0.00083) [2022-07-11 16:05:38,836][25689] Fps is (10 sec: 5580.8, 60 sec: 5550.1, 300 sec: 5553.8). Total num frames: 1292881920. Throughput: 0: 5840.5. Samples: 1292884504. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:05:38,837][25689] Avg episode reward: [(0, '1.360')] [2022-07-11 16:05:38,970][26022] Updated weights on worker 0-0, policy_version 1262581 (0.00088) [2022-07-11 16:05:40,714][26022] Updated weights on worker 0-0, policy_version 1262591 (0.00091) [2022-07-11 16:05:42,737][26022] Updated weights on worker 0-0, policy_version 1262601 (0.00081) [2022-07-11 16:05:43,906][25689] Fps is (10 sec: 5685.7, 60 sec: 5580.0, 300 sec: 5556.9). Total num frames: 1292910592. Throughput: 0: 5841.9. Samples: 1292918272. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:05:43,907][25689] Avg episode reward: [(0, '1.025')] [2022-07-11 16:05:44,618][26022] Updated weights on worker 0-0, policy_version 1262611 (0.00083) [2022-07-11 16:05:46,138][26022] Updated weights on worker 0-0, policy_version 1262621 (0.00086) [2022-07-11 16:05:48,126][26022] Updated weights on worker 0-0, policy_version 1262631 (0.00085) [2022-07-11 16:05:48,912][25689] Fps is (10 sec: 5690.9, 60 sec: 5564.4, 300 sec: 5555.3). Total num frames: 1292939264. Throughput: 0: 5012.2. Samples: 1292935120. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:05:48,913][25689] Avg episode reward: [(0, '0.697')] [2022-07-11 16:05:50,064][26022] Updated weights on worker 0-0, policy_version 1262641 (0.00086) [2022-07-11 16:05:51,752][26022] Updated weights on worker 0-0, policy_version 1262651 (0.00090) [2022-07-11 16:05:53,774][26022] Updated weights on worker 0-0, policy_version 1262661 (0.00727) [2022-07-11 16:05:54,014][25689] Fps is (10 sec: 5571.9, 60 sec: 5566.1, 300 sec: 5550.5). Total num frames: 1292966912. Throughput: 0: 5829.1. Samples: 1292968584. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:05:54,015][25689] Avg episode reward: [(0, '1.052')] [2022-07-11 16:05:55,366][26022] Updated weights on worker 0-0, policy_version 1262671 (0.00090) [2022-07-11 16:05:57,304][26022] Updated weights on worker 0-0, policy_version 1262681 (0.00094) [2022-07-11 16:05:59,059][25689] Fps is (10 sec: 5449.6, 60 sec: 5547.9, 300 sec: 5550.8). Total num frames: 1292994560. Throughput: 0: 5817.0. Samples: 1293002142. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:05:59,061][25689] Avg episode reward: [(0, '1.218')] [2022-07-11 16:05:59,160][26022] Updated weights on worker 0-0, policy_version 1262691 (0.00082) [2022-07-11 16:06:00,956][26022] Updated weights on worker 0-0, policy_version 1262701 (0.00084) [2022-07-11 16:06:03,117][26022] Updated weights on worker 0-0, policy_version 1262711 (0.00084) [2022-07-11 16:06:04,088][25689] Fps is (10 sec: 5489.1, 60 sec: 5581.1, 300 sec: 5553.9). Total num frames: 1293022208. Throughput: 0: 4934.8. Samples: 1293017864. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:06:04,090][25689] Avg episode reward: [(0, '1.785')] [2022-07-11 16:06:05,104][26022] Updated weights on worker 0-0, policy_version 1262721 (0.00084) [2022-07-11 16:06:06,901][26022] Updated weights on worker 0-0, policy_version 1262731 (0.00092) [2022-07-11 16:06:08,871][26022] Updated weights on worker 0-0, policy_version 1262741 (0.00090) [2022-07-11 16:06:09,097][25689] Fps is (10 sec: 5305.0, 60 sec: 5549.9, 300 sec: 5552.2). Total num frames: 1293047808. Throughput: 0: 5704.9. Samples: 1293050268. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:06:09,097][25689] Avg episode reward: [(0, '2.025')] [2022-07-11 16:06:10,431][26022] Updated weights on worker 0-0, policy_version 1262751 (0.00084) [2022-07-11 16:06:12,511][26022] Updated weights on worker 0-0, policy_version 1262761 (0.00087) [2022-07-11 16:06:14,124][26022] Updated weights on worker 0-0, policy_version 1262771 (0.00088) [2022-07-11 16:06:14,203][25689] Fps is (10 sec: 5466.6, 60 sec: 5563.9, 300 sec: 5550.7). Total num frames: 1293077504. Throughput: 0: 5728.8. Samples: 1293084244. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:06:14,204][25689] Avg episode reward: [(0, '2.202')] [2022-07-11 16:06:15,920][26022] Updated weights on worker 0-0, policy_version 1262781 (0.00093) [2022-07-11 16:06:17,767][26022] Updated weights on worker 0-0, policy_version 1262791 (0.00090) [2022-07-11 16:06:19,190][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 16:06:19,201][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001262799_1293106176.pth [2022-07-11 16:06:19,201][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001260843_1291103232.pth [2022-07-11 16:06:19,206][25689] Fps is (10 sec: 5773.5, 60 sec: 5584.3, 300 sec: 5554.4). Total num frames: 1293106176. Throughput: 0: 4923.7. Samples: 1293101340. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:06:19,207][25689] Avg episode reward: [(0, '2.321')] [2022-07-11 16:06:19,634][26022] Updated weights on worker 0-0, policy_version 1262801 (0.00092) [2022-07-11 16:06:21,435][26022] Updated weights on worker 0-0, policy_version 1262811 (0.00093) [2022-07-11 16:06:23,248][26022] Updated weights on worker 0-0, policy_version 1262821 (0.00091) [2022-07-11 16:06:24,216][25689] Fps is (10 sec: 5522.4, 60 sec: 5549.8, 300 sec: 5551.2). Total num frames: 1293132800. Throughput: 0: 5809.0. Samples: 1293134790. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:06:24,217][25689] Avg episode reward: [(0, '2.076')] [2022-07-11 16:06:25,010][26022] Updated weights on worker 0-0, policy_version 1262831 (0.00084) [2022-07-11 16:06:26,957][26022] Updated weights on worker 0-0, policy_version 1262841 (0.00087) [2022-07-11 16:06:28,583][26022] Updated weights on worker 0-0, policy_version 1262851 (0.00091) [2022-07-11 16:06:29,225][25689] Fps is (10 sec: 5519.5, 60 sec: 5553.4, 300 sec: 5548.4). Total num frames: 1293161472. Throughput: 0: 5875.7. Samples: 1293168534. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:06:29,225][25689] Avg episode reward: [(0, '1.922')] [2022-07-11 16:06:30,612][26022] Updated weights on worker 0-0, policy_version 1262861 (0.00093) [2022-07-11 16:06:32,367][26022] Updated weights on worker 0-0, policy_version 1262871 (0.00095) [2022-07-11 16:06:34,271][26022] Updated weights on worker 0-0, policy_version 1262881 (0.00096) [2022-07-11 16:06:34,367][25689] Fps is (10 sec: 5649.3, 60 sec: 5564.3, 300 sec: 5550.2). Total num frames: 1293190144. Throughput: 0: 5008.2. Samples: 1293185228. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:06:34,367][25689] Avg episode reward: [(0, '2.049')] [2022-07-11 16:06:36,277][26022] Updated weights on worker 0-0, policy_version 1262891 (0.00096) [2022-07-11 16:06:37,849][26022] Updated weights on worker 0-0, policy_version 1262901 (0.00102) [2022-07-11 16:06:39,387][25689] Fps is (10 sec: 5441.2, 60 sec: 5530.0, 300 sec: 5544.7). Total num frames: 1293216768. Throughput: 0: 5802.8. Samples: 1293218444. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:06:39,388][25689] Avg episode reward: [(0, '1.388')] [2022-07-11 16:06:39,945][26022] Updated weights on worker 0-0, policy_version 1262911 (0.00087) [2022-07-11 16:06:41,719][26022] Updated weights on worker 0-0, policy_version 1262921 (0.00089) [2022-07-11 16:06:43,469][26022] Updated weights on worker 0-0, policy_version 1262931 (0.00088) [2022-07-11 16:06:44,415][25689] Fps is (10 sec: 5605.1, 60 sec: 5550.8, 300 sec: 5551.3). Total num frames: 1293246464. Throughput: 0: 5794.9. Samples: 1293251838. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:06:44,415][25689] Avg episode reward: [(0, '1.030')] [2022-07-11 16:06:45,522][26022] Updated weights on worker 0-0, policy_version 1262941 (0.00085) [2022-07-11 16:06:47,332][26022] Updated weights on worker 0-0, policy_version 1262951 (0.00088) [2022-07-11 16:06:49,058][26022] Updated weights on worker 0-0, policy_version 1262961 (0.00090) [2022-07-11 16:06:49,437][25689] Fps is (10 sec: 5807.6, 60 sec: 5549.3, 300 sec: 5551.9). Total num frames: 1293275136. Throughput: 0: 4949.3. Samples: 1293268572. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:06:49,438][25689] Avg episode reward: [(0, '0.843')] [2022-07-11 16:06:51,111][26022] Updated weights on worker 0-0, policy_version 1262971 (0.00088) [2022-07-11 16:06:52,568][26022] Updated weights on worker 0-0, policy_version 1262981 (0.00087) [2022-07-11 16:06:54,561][25689] Fps is (10 sec: 5449.8, 60 sec: 5530.4, 300 sec: 5539.9). Total num frames: 1293301760. Throughput: 0: 5793.9. Samples: 1293302232. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:06:54,562][25689] Avg episode reward: [(0, '1.122')] [2022-07-11 16:06:54,636][26022] Updated weights on worker 0-0, policy_version 1262991 (0.00086) [2022-07-11 16:06:56,315][26022] Updated weights on worker 0-0, policy_version 1263001 (0.00088) [2022-07-11 16:06:58,128][26022] Updated weights on worker 0-0, policy_version 1263011 (0.00092) [2022-07-11 16:06:59,595][25689] Fps is (10 sec: 5443.8, 60 sec: 5548.3, 300 sec: 5556.6). Total num frames: 1293330432. Throughput: 0: 5806.6. Samples: 1293335782. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:06:59,595][25689] Avg episode reward: [(0, '1.025')] [2022-07-11 16:07:00,083][26022] Updated weights on worker 0-0, policy_version 1263021 (0.00091) [2022-07-11 16:07:02,231][26022] Updated weights on worker 0-0, policy_version 1263031 (0.00091) [2022-07-11 16:07:04,043][26022] Updated weights on worker 0-0, policy_version 1263041 (0.00090) [2022-07-11 16:07:04,604][25689] Fps is (10 sec: 5608.1, 60 sec: 5550.1, 300 sec: 5546.3). Total num frames: 1293358080. Throughput: 0: 4894.0. Samples: 1293350644. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:07:04,604][25689] Avg episode reward: [(0, '1.171')] [2022-07-11 16:07:06,059][26022] Updated weights on worker 0-0, policy_version 1263051 (0.00086) [2022-07-11 16:07:07,622][26022] Updated weights on worker 0-0, policy_version 1263061 (0.01020) [2022-07-11 16:07:09,615][25689] Fps is (10 sec: 5314.2, 60 sec: 5549.9, 300 sec: 5547.5). Total num frames: 1293383680. Throughput: 0: 5727.2. Samples: 1293384132. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:07:09,617][25689] Avg episode reward: [(0, '1.624')] [2022-07-11 16:07:09,683][26022] Updated weights on worker 0-0, policy_version 1263071 (0.00066) [2022-07-11 16:07:11,324][26022] Updated weights on worker 0-0, policy_version 1263081 (0.00087) [2022-07-11 16:07:13,337][26022] Updated weights on worker 0-0, policy_version 1263091 (0.00090) [2022-07-11 16:07:14,719][25689] Fps is (10 sec: 5365.6, 60 sec: 5533.3, 300 sec: 5539.0). Total num frames: 1293412352. Throughput: 0: 5755.5. Samples: 1293418248. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:07:14,720][25689] Avg episode reward: [(0, '1.753')] [2022-07-11 16:07:14,983][26022] Updated weights on worker 0-0, policy_version 1263101 (0.00090) [2022-07-11 16:07:16,846][26022] Updated weights on worker 0-0, policy_version 1263111 (0.00083) [2022-07-11 16:07:18,380][26022] Updated weights on worker 0-0, policy_version 1263121 (0.00092) [2022-07-11 16:07:19,728][25689] Fps is (10 sec: 5771.6, 60 sec: 5549.6, 300 sec: 5549.6). Total num frames: 1293442048. Throughput: 0: 4941.3. Samples: 1293435266. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:07:19,729][25689] Avg episode reward: [(0, '1.294')] [2022-07-11 16:07:20,718][26022] Updated weights on worker 0-0, policy_version 1263131 (0.00095) [2022-07-11 16:07:22,167][26022] Updated weights on worker 0-0, policy_version 1263141 (0.00092) [2022-07-11 16:07:24,194][26022] Updated weights on worker 0-0, policy_version 1263151 (0.00084) [2022-07-11 16:07:24,735][25689] Fps is (10 sec: 5725.2, 60 sec: 5566.8, 300 sec: 5546.5). Total num frames: 1293469696. Throughput: 0: 5868.4. Samples: 1293468780. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:07:24,736][25689] Avg episode reward: [(0, '0.822')] [2022-07-11 16:07:25,839][26022] Updated weights on worker 0-0, policy_version 1263161 (0.00087) [2022-07-11 16:07:27,780][26022] Updated weights on worker 0-0, policy_version 1263171 (0.00090) [2022-07-11 16:07:29,369][26022] Updated weights on worker 0-0, policy_version 1263181 (0.00093) [2022-07-11 16:07:29,742][25689] Fps is (10 sec: 5623.8, 60 sec: 5566.9, 300 sec: 5548.1). Total num frames: 1293498368. Throughput: 0: 5887.7. Samples: 1293502638. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:07:29,746][25689] Avg episode reward: [(0, '0.742')] [2022-07-11 16:07:31,543][26022] Updated weights on worker 0-0, policy_version 1263191 (0.00091) [2022-07-11 16:07:33,335][26022] Updated weights on worker 0-0, policy_version 1263201 (0.00088) [2022-07-11 16:07:34,813][25689] Fps is (10 sec: 5588.2, 60 sec: 5556.5, 300 sec: 5551.3). Total num frames: 1293526016. Throughput: 0: 5028.0. Samples: 1293519282. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:07:34,813][25689] Avg episode reward: [(0, '0.848')] [2022-07-11 16:07:35,206][26022] Updated weights on worker 0-0, policy_version 1263211 (0.00085) [2022-07-11 16:07:36,735][26022] Updated weights on worker 0-0, policy_version 1263221 (0.00083) [2022-07-11 16:07:38,731][26022] Updated weights on worker 0-0, policy_version 1263231 (0.00086) [2022-07-11 16:07:39,893][25689] Fps is (10 sec: 5649.4, 60 sec: 5601.8, 300 sec: 5554.8). Total num frames: 1293555712. Throughput: 0: 5826.8. Samples: 1293552764. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:07:39,893][25689] Avg episode reward: [(0, '0.907')] [2022-07-11 16:07:40,527][26022] Updated weights on worker 0-0, policy_version 1263241 (0.00087) [2022-07-11 16:07:42,452][26022] Updated weights on worker 0-0, policy_version 1263251 (0.00082) [2022-07-11 16:07:44,254][26022] Updated weights on worker 0-0, policy_version 1263261 (0.00084) [2022-07-11 16:07:44,911][25689] Fps is (10 sec: 5678.7, 60 sec: 5568.9, 300 sec: 5548.0). Total num frames: 1293583360. Throughput: 0: 5835.4. Samples: 1293586518. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:07:44,912][25689] Avg episode reward: [(0, '0.987')] [2022-07-11 16:07:46,070][26022] Updated weights on worker 0-0, policy_version 1263271 (0.00094) [2022-07-11 16:07:47,798][26022] Updated weights on worker 0-0, policy_version 1263281 (0.00086) [2022-07-11 16:07:49,949][25689] Fps is (10 sec: 5396.8, 60 sec: 5533.6, 300 sec: 5548.4). Total num frames: 1293609984. Throughput: 0: 5806.9. Samples: 1293619978. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:07:49,950][25689] Avg episode reward: [(0, '1.293')] [2022-07-11 16:07:49,951][26022] Updated weights on worker 0-0, policy_version 1263291 (0.00089) [2022-07-11 16:07:51,595][26022] Updated weights on worker 0-0, policy_version 1263301 (0.00095) [2022-07-11 16:07:53,685][26022] Updated weights on worker 0-0, policy_version 1263311 (0.00094) [2022-07-11 16:07:55,002][25689] Fps is (10 sec: 5480.1, 60 sec: 5574.0, 300 sec: 5537.3). Total num frames: 1293638656. Throughput: 0: 5805.1. Samples: 1293636478. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:07:55,002][25689] Avg episode reward: [(0, '1.629')] [2022-07-11 16:07:55,293][26022] Updated weights on worker 0-0, policy_version 1263321 (0.00675) [2022-07-11 16:07:57,217][26022] Updated weights on worker 0-0, policy_version 1263331 (0.00082) [2022-07-11 16:07:59,092][26022] Updated weights on worker 0-0, policy_version 1263341 (0.00088) [2022-07-11 16:08:00,081][25689] Fps is (10 sec: 5558.8, 60 sec: 5552.9, 300 sec: 5550.6). Total num frames: 1293666304. Throughput: 0: 5809.3. Samples: 1293670042. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:08:00,081][25689] Avg episode reward: [(0, '1.330')] [2022-07-11 16:08:00,847][26022] Updated weights on worker 0-0, policy_version 1263351 (0.00084) [2022-07-11 16:08:03,052][26022] Updated weights on worker 0-0, policy_version 1263361 (0.00097) [2022-07-11 16:08:04,771][26022] Updated weights on worker 0-0, policy_version 1263371 (0.00090) [2022-07-11 16:08:05,100][25689] Fps is (10 sec: 5374.1, 60 sec: 5535.0, 300 sec: 5544.8). Total num frames: 1293692928. Throughput: 0: 5707.4. Samples: 1293701746. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:08:05,101][25689] Avg episode reward: [(0, '1.431')] [2022-07-11 16:08:06,781][26022] Updated weights on worker 0-0, policy_version 1263381 (0.00089) [2022-07-11 16:08:08,467][26022] Updated weights on worker 0-0, policy_version 1263391 (0.00094) [2022-07-11 16:08:10,119][25689] Fps is (10 sec: 5406.6, 60 sec: 5568.1, 300 sec: 5542.2). Total num frames: 1293720576. Throughput: 0: 4877.0. Samples: 1293718346. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:08:10,120][25689] Avg episode reward: [(0, '1.447')] [2022-07-11 16:08:10,511][26022] Updated weights on worker 0-0, policy_version 1263401 (0.00099) [2022-07-11 16:08:12,144][26022] Updated weights on worker 0-0, policy_version 1263411 (0.00086) [2022-07-11 16:08:14,190][26022] Updated weights on worker 0-0, policy_version 1263421 (0.00093) [2022-07-11 16:08:15,227][25689] Fps is (10 sec: 5561.7, 60 sec: 5567.8, 300 sec: 5548.0). Total num frames: 1293749248. Throughput: 0: 5701.3. Samples: 1293751788. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:08:15,227][25689] Avg episode reward: [(0, '1.565')] [2022-07-11 16:08:15,786][26022] Updated weights on worker 0-0, policy_version 1263431 (0.00093) [2022-07-11 16:08:17,781][26022] Updated weights on worker 0-0, policy_version 1263441 (0.00087) [2022-07-11 16:08:19,243][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 16:08:19,253][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001263450_1293772800.pth [2022-07-11 16:08:19,253][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001261496_1291771904.pth [2022-07-11 16:08:19,431][26022] Updated weights on worker 0-0, policy_version 1263451 (0.00084) [2022-07-11 16:08:20,239][25689] Fps is (10 sec: 5565.0, 60 sec: 5533.6, 300 sec: 5548.0). Total num frames: 1293776896. Throughput: 0: 5733.0. Samples: 1293785610. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:08:20,240][25689] Avg episode reward: [(0, '1.440')] [2022-07-11 16:08:21,309][26022] Updated weights on worker 0-0, policy_version 1263461 (0.00086) [2022-07-11 16:08:23,059][26022] Updated weights on worker 0-0, policy_version 1263471 (0.00096) [2022-07-11 16:08:24,927][26022] Updated weights on worker 0-0, policy_version 1263481 (0.00089) [2022-07-11 16:08:25,255][25689] Fps is (10 sec: 5718.5, 60 sec: 5566.7, 300 sec: 5554.9). Total num frames: 1293806592. Throughput: 0: 4998.1. Samples: 1293802480. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:08:25,255][25689] Avg episode reward: [(0, '1.331')] [2022-07-11 16:08:26,764][26022] Updated weights on worker 0-0, policy_version 1263491 (0.00086) [2022-07-11 16:08:28,744][26022] Updated weights on worker 0-0, policy_version 1263501 (0.00100) [2022-07-11 16:08:30,278][25689] Fps is (10 sec: 5610.4, 60 sec: 5531.4, 300 sec: 5546.4). Total num frames: 1293833216. Throughput: 0: 5830.1. Samples: 1293835874. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:08:30,279][25689] Avg episode reward: [(0, '1.235')] [2022-07-11 16:08:30,541][26022] Updated weights on worker 0-0, policy_version 1263511 (0.00086) [2022-07-11 16:08:32,530][26022] Updated weights on worker 0-0, policy_version 1263521 (0.00086) [2022-07-11 16:08:34,137][26022] Updated weights on worker 0-0, policy_version 1263531 (0.00092) [2022-07-11 16:08:35,337][25689] Fps is (10 sec: 5382.9, 60 sec: 5532.5, 300 sec: 5547.0). Total num frames: 1293860864. Throughput: 0: 5852.2. Samples: 1293869476. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:08:35,337][25689] Avg episode reward: [(0, '1.464')] [2022-07-11 16:08:36,117][26022] Updated weights on worker 0-0, policy_version 1263541 (0.00091) [2022-07-11 16:08:37,864][26022] Updated weights on worker 0-0, policy_version 1263551 (0.00091) [2022-07-11 16:08:39,753][26022] Updated weights on worker 0-0, policy_version 1263561 (0.00084) [2022-07-11 16:08:40,411][25689] Fps is (10 sec: 5558.3, 60 sec: 5516.1, 300 sec: 5545.9). Total num frames: 1293889536. Throughput: 0: 4987.3. Samples: 1293886210. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:08:40,412][25689] Avg episode reward: [(0, '0.967')] [2022-07-11 16:08:41,535][26022] Updated weights on worker 0-0, policy_version 1263571 (0.00091) [2022-07-11 16:08:43,490][26022] Updated weights on worker 0-0, policy_version 1263581 (0.00095) [2022-07-11 16:08:45,287][26022] Updated weights on worker 0-0, policy_version 1263591 (0.00085) [2022-07-11 16:08:45,436][25689] Fps is (10 sec: 5576.8, 60 sec: 5515.5, 300 sec: 5543.0). Total num frames: 1293917184. Throughput: 0: 5810.4. Samples: 1293919742. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2022-07-11 16:08:45,438][25689] Avg episode reward: [(0, '0.583')] [2022-07-11 16:08:47,032][26022] Updated weights on worker 0-0, policy_version 1263601 (0.00087) [2022-07-11 16:08:48,892][26022] Updated weights on worker 0-0, policy_version 1263611 (0.00086) [2022-07-11 16:08:50,461][25689] Fps is (10 sec: 5603.8, 60 sec: 5550.5, 300 sec: 5547.6). Total num frames: 1293945856. Throughput: 0: 5811.8. Samples: 1293953174. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:08:50,461][25689] Avg episode reward: [(0, '0.691')] [2022-07-11 16:08:50,750][26022] Updated weights on worker 0-0, policy_version 1263621 (0.00095) [2022-07-11 16:08:52,516][26022] Updated weights on worker 0-0, policy_version 1263631 (0.00089) [2022-07-11 16:08:54,524][26022] Updated weights on worker 0-0, policy_version 1263641 (0.00082) [2022-07-11 16:08:55,507][25689] Fps is (10 sec: 5592.2, 60 sec: 5534.2, 300 sec: 5551.2). Total num frames: 1293973504. Throughput: 0: 4962.1. Samples: 1293969562. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:08:55,508][25689] Avg episode reward: [(0, '1.004')] [2022-07-11 16:08:56,155][26022] Updated weights on worker 0-0, policy_version 1263651 (0.00081) [2022-07-11 16:08:57,960][26022] Updated weights on worker 0-0, policy_version 1263661 (0.00099) [2022-07-11 16:08:59,888][26022] Updated weights on worker 0-0, policy_version 1263671 (0.00091) [2022-07-11 16:09:00,524][25689] Fps is (10 sec: 5494.8, 60 sec: 5539.9, 300 sec: 5554.9). Total num frames: 1294001152. Throughput: 0: 5840.5. Samples: 1294003684. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:09:00,525][25689] Avg episode reward: [(0, '0.713')] [2022-07-11 16:09:01,654][26022] Updated weights on worker 0-0, policy_version 1263681 (0.00091) [2022-07-11 16:09:03,923][26022] Updated weights on worker 0-0, policy_version 1263691 (0.00080) [2022-07-11 16:09:05,536][25689] Fps is (10 sec: 5513.9, 60 sec: 5557.6, 300 sec: 5551.9). Total num frames: 1294028800. Throughput: 0: 5755.9. Samples: 1294035434. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:09:05,536][25689] Avg episode reward: [(0, '0.841')] [2022-07-11 16:09:05,551][26022] Updated weights on worker 0-0, policy_version 1263701 (0.00092) [2022-07-11 16:09:07,971][26022] Updated weights on worker 0-0, policy_version 1263711 (0.00097) [2022-07-11 16:09:09,343][26022] Updated weights on worker 0-0, policy_version 1263721 (0.00092) [2022-07-11 16:09:10,541][25689] Fps is (10 sec: 5520.3, 60 sec: 5558.8, 300 sec: 5549.5). Total num frames: 1294056448. Throughput: 0: 4921.0. Samples: 1294051988. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:09:10,542][25689] Avg episode reward: [(0, '0.946')] [2022-07-11 16:09:11,380][26022] Updated weights on worker 0-0, policy_version 1263731 (0.00053) [2022-07-11 16:09:12,998][26022] Updated weights on worker 0-0, policy_version 1263741 (0.00108) [2022-07-11 16:09:14,879][26022] Updated weights on worker 0-0, policy_version 1263751 (0.00086) [2022-07-11 16:09:15,596][25689] Fps is (10 sec: 5496.3, 60 sec: 5546.7, 300 sec: 5548.9). Total num frames: 1294084096. Throughput: 0: 5792.0. Samples: 1294085918. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:09:15,597][25689] Avg episode reward: [(0, '0.385')] [2022-07-11 16:09:16,642][26022] Updated weights on worker 0-0, policy_version 1263761 (0.00085) [2022-07-11 16:09:18,559][26022] Updated weights on worker 0-0, policy_version 1263771 (0.00066) [2022-07-11 16:09:20,292][26022] Updated weights on worker 0-0, policy_version 1263781 (0.00088) [2022-07-11 16:09:20,600][25689] Fps is (10 sec: 5497.2, 60 sec: 5547.5, 300 sec: 5545.8). Total num frames: 1294111744. Throughput: 0: 5788.6. Samples: 1294119896. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:09:20,601][25689] Avg episode reward: [(0, '0.366')] [2022-07-11 16:09:22,256][26022] Updated weights on worker 0-0, policy_version 1263791 (0.00086) [2022-07-11 16:09:23,982][26022] Updated weights on worker 0-0, policy_version 1263801 (0.00083) [2022-07-11 16:09:25,625][25689] Fps is (10 sec: 5616.0, 60 sec: 5529.6, 300 sec: 5549.2). Total num frames: 1294140416. Throughput: 0: 5041.6. Samples: 1294136716. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:09:25,625][25689] Avg episode reward: [(0, '0.674')] [2022-07-11 16:09:25,935][26022] Updated weights on worker 0-0, policy_version 1263811 (0.00093) [2022-07-11 16:09:27,552][26022] Updated weights on worker 0-0, policy_version 1263821 (0.00091) [2022-07-11 16:09:29,602][26022] Updated weights on worker 0-0, policy_version 1263831 (0.00092) [2022-07-11 16:09:30,626][25689] Fps is (10 sec: 5719.5, 60 sec: 5565.6, 300 sec: 5551.8). Total num frames: 1294169088. Throughput: 0: 5893.0. Samples: 1294170350. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:09:30,627][25689] Avg episode reward: [(0, '0.605')] [2022-07-11 16:09:31,455][26022] Updated weights on worker 0-0, policy_version 1263841 (0.00080) [2022-07-11 16:09:33,139][26022] Updated weights on worker 0-0, policy_version 1263851 (0.00092) [2022-07-11 16:09:35,019][26022] Updated weights on worker 0-0, policy_version 1263861 (0.00086) [2022-07-11 16:09:35,695][25689] Fps is (10 sec: 5592.6, 60 sec: 5564.7, 300 sec: 5550.9). Total num frames: 1294196736. Throughput: 0: 5874.5. Samples: 1294203988. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:09:35,697][25689] Avg episode reward: [(0, '0.577')] [2022-07-11 16:09:36,736][26022] Updated weights on worker 0-0, policy_version 1263871 (0.00090) [2022-07-11 16:09:38,875][26022] Updated weights on worker 0-0, policy_version 1263881 (0.00081) [2022-07-11 16:09:40,528][26022] Updated weights on worker 0-0, policy_version 1263891 (0.00092) [2022-07-11 16:09:40,716][25689] Fps is (10 sec: 5581.7, 60 sec: 5569.5, 300 sec: 5557.9). Total num frames: 1294225408. Throughput: 0: 5015.5. Samples: 1294220788. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:09:40,718][25689] Avg episode reward: [(0, '0.758')] [2022-07-11 16:09:42,509][26022] Updated weights on worker 0-0, policy_version 1263901 (0.00085) [2022-07-11 16:09:44,106][26022] Updated weights on worker 0-0, policy_version 1263911 (0.00090) [2022-07-11 16:09:45,737][25689] Fps is (10 sec: 5506.5, 60 sec: 5552.9, 300 sec: 5547.5). Total num frames: 1294252032. Throughput: 0: 5834.4. Samples: 1294254060. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:09:45,737][25689] Avg episode reward: [(0, '1.570')] [2022-07-11 16:09:46,138][26022] Updated weights on worker 0-0, policy_version 1263921 (0.00091) [2022-07-11 16:09:47,819][26022] Updated weights on worker 0-0, policy_version 1263931 (0.00088) [2022-07-11 16:09:49,680][26022] Updated weights on worker 0-0, policy_version 1263941 (0.00363) [2022-07-11 16:09:50,825][25689] Fps is (10 sec: 5571.5, 60 sec: 5564.1, 300 sec: 5555.0). Total num frames: 1294281728. Throughput: 0: 5818.8. Samples: 1294287882. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:09:50,826][25689] Avg episode reward: [(0, '0.616')] [2022-07-11 16:09:51,468][26022] Updated weights on worker 0-0, policy_version 1263951 (0.00088) [2022-07-11 16:09:53,222][26022] Updated weights on worker 0-0, policy_version 1263961 (0.00084) [2022-07-11 16:09:55,150][26022] Updated weights on worker 0-0, policy_version 1263971 (0.00087) [2022-07-11 16:09:55,884][25689] Fps is (10 sec: 5752.4, 60 sec: 5579.9, 300 sec: 5554.5). Total num frames: 1294310400. Throughput: 0: 4991.4. Samples: 1294304758. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:09:55,884][25689] Avg episode reward: [(0, '-0.419')] [2022-07-11 16:09:56,800][26022] Updated weights on worker 0-0, policy_version 1263981 (0.00086) [2022-07-11 16:09:58,637][26022] Updated weights on worker 0-0, policy_version 1263991 (0.00085) [2022-07-11 16:10:00,486][26022] Updated weights on worker 0-0, policy_version 1264001 (0.00091) [2022-07-11 16:10:00,914][25689] Fps is (10 sec: 5480.9, 60 sec: 5561.8, 300 sec: 5557.8). Total num frames: 1294337024. Throughput: 0: 5843.7. Samples: 1294338816. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:10:00,919][25689] Avg episode reward: [(0, '-0.586')] [2022-07-11 16:10:02,580][26022] Updated weights on worker 0-0, policy_version 1264011 (0.00091) [2022-07-11 16:10:04,679][26022] Updated weights on worker 0-0, policy_version 1264021 (0.00084) [2022-07-11 16:10:05,928][25689] Fps is (10 sec: 5403.5, 60 sec: 5561.5, 300 sec: 5558.2). Total num frames: 1294364672. Throughput: 0: 5765.7. Samples: 1294370472. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:10:05,928][25689] Avg episode reward: [(0, '-0.479')] [2022-07-11 16:10:06,229][26022] Updated weights on worker 0-0, policy_version 1264031 (0.00083) [2022-07-11 16:10:08,308][26022] Updated weights on worker 0-0, policy_version 1264041 (0.00091) [2022-07-11 16:10:10,035][26022] Updated weights on worker 0-0, policy_version 1264051 (0.00091) [2022-07-11 16:10:10,981][25689] Fps is (10 sec: 5492.6, 60 sec: 5557.1, 300 sec: 5555.2). Total num frames: 1294392320. Throughput: 0: 4933.0. Samples: 1294387312. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:10:10,982][25689] Avg episode reward: [(0, '-0.674')] [2022-07-11 16:10:12,076][26022] Updated weights on worker 0-0, policy_version 1264061 (0.00095) [2022-07-11 16:10:13,562][26022] Updated weights on worker 0-0, policy_version 1264071 (0.00097) [2022-07-11 16:10:15,504][26022] Updated weights on worker 0-0, policy_version 1264081 (0.00083) [2022-07-11 16:10:16,067][25689] Fps is (10 sec: 5554.7, 60 sec: 5571.2, 300 sec: 5557.8). Total num frames: 1294420992. Throughput: 0: 5742.6. Samples: 1294420662. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:10:16,067][25689] Avg episode reward: [(0, '-0.398')] [2022-07-11 16:10:17,382][26022] Updated weights on worker 0-0, policy_version 1264091 (0.00093) [2022-07-11 16:10:19,268][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 16:10:19,281][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001264101_1294439424.pth [2022-07-11 16:10:19,281][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001262149_1292440576.pth [2022-07-11 16:10:19,285][26022] Updated weights on worker 0-0, policy_version 1264101 (0.00089) [2022-07-11 16:10:21,083][25689] Fps is (10 sec: 5676.4, 60 sec: 5587.0, 300 sec: 5557.5). Total num frames: 1294449664. Throughput: 0: 5726.7. Samples: 1294454320. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:10:21,084][25689] Avg episode reward: [(0, '0.616')] [2022-07-11 16:10:21,087][26022] Updated weights on worker 0-0, policy_version 1264111 (0.00081) [2022-07-11 16:10:22,944][26022] Updated weights on worker 0-0, policy_version 1264121 (0.00093) [2022-07-11 16:10:24,629][26022] Updated weights on worker 0-0, policy_version 1264131 (0.00092) [2022-07-11 16:10:26,129][25689] Fps is (10 sec: 5597.0, 60 sec: 5568.1, 300 sec: 5554.1). Total num frames: 1294477312. Throughput: 0: 4980.6. Samples: 1294471088. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:10:26,131][25689] Avg episode reward: [(0, '1.931')] [2022-07-11 16:10:26,735][26022] Updated weights on worker 0-0, policy_version 1264141 (0.00082) [2022-07-11 16:10:28,388][26022] Updated weights on worker 0-0, policy_version 1264151 (0.00093) [2022-07-11 16:10:30,275][26022] Updated weights on worker 0-0, policy_version 1264161 (0.00083) [2022-07-11 16:10:31,132][25689] Fps is (10 sec: 5502.8, 60 sec: 5551.1, 300 sec: 5555.5). Total num frames: 1294504960. Throughput: 0: 5809.9. Samples: 1294504386. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:10:31,132][25689] Avg episode reward: [(0, '1.220')] [2022-07-11 16:10:31,997][26022] Updated weights on worker 0-0, policy_version 1264171 (0.00083) [2022-07-11 16:10:33,955][26022] Updated weights on worker 0-0, policy_version 1264181 (0.00083) [2022-07-11 16:10:35,867][26022] Updated weights on worker 0-0, policy_version 1264191 (0.00097) [2022-07-11 16:10:36,253][25689] Fps is (10 sec: 5562.9, 60 sec: 5563.2, 300 sec: 5553.5). Total num frames: 1294533632. Throughput: 0: 5815.7. Samples: 1294538062. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:10:36,254][25689] Avg episode reward: [(0, '1.183')] [2022-07-11 16:10:37,575][26022] Updated weights on worker 0-0, policy_version 1264201 (0.00093) [2022-07-11 16:10:39,487][26022] Updated weights on worker 0-0, policy_version 1264211 (0.00094) [2022-07-11 16:10:41,261][25689] Fps is (10 sec: 5560.2, 60 sec: 5547.5, 300 sec: 5551.2). Total num frames: 1294561280. Throughput: 0: 5795.8. Samples: 1294571266. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:10:41,262][25689] Avg episode reward: [(0, '1.569')] [2022-07-11 16:10:41,354][26022] Updated weights on worker 0-0, policy_version 1264221 (0.00085) [2022-07-11 16:10:43,334][26022] Updated weights on worker 0-0, policy_version 1264231 (0.00086) [2022-07-11 16:10:44,908][26022] Updated weights on worker 0-0, policy_version 1264241 (0.00091) [2022-07-11 16:10:46,296][25689] Fps is (10 sec: 5505.9, 60 sec: 5563.1, 300 sec: 5547.3). Total num frames: 1294588928. Throughput: 0: 5788.3. Samples: 1294587822. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:10:46,298][25689] Avg episode reward: [(0, '1.045')] [2022-07-11 16:10:46,948][26022] Updated weights on worker 0-0, policy_version 1264251 (0.00089) [2022-07-11 16:10:48,680][26022] Updated weights on worker 0-0, policy_version 1264261 (0.00089) [2022-07-11 16:10:50,528][26022] Updated weights on worker 0-0, policy_version 1264271 (0.00088) [2022-07-11 16:10:51,308][25689] Fps is (10 sec: 5605.5, 60 sec: 5553.1, 300 sec: 5552.4). Total num frames: 1294617600. Throughput: 0: 5810.3. Samples: 1294621618. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:10:51,310][25689] Avg episode reward: [(0, '0.533')] [2022-07-11 16:10:52,331][26022] Updated weights on worker 0-0, policy_version 1264281 (0.00608) [2022-07-11 16:10:54,350][26022] Updated weights on worker 0-0, policy_version 1264291 (0.00091) [2022-07-11 16:10:55,944][26022] Updated weights on worker 0-0, policy_version 1264301 (0.00087) [2022-07-11 16:10:56,362][25689] Fps is (10 sec: 5595.6, 60 sec: 5536.7, 300 sec: 5552.2). Total num frames: 1294645248. Throughput: 0: 5823.3. Samples: 1294655158. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:10:56,362][25689] Avg episode reward: [(0, '0.251')] [2022-07-11 16:10:57,910][26022] Updated weights on worker 0-0, policy_version 1264311 (0.00095) [2022-07-11 16:10:59,508][26022] Updated weights on worker 0-0, policy_version 1264321 (0.00084) [2022-07-11 16:11:01,400][25689] Fps is (10 sec: 5479.3, 60 sec: 5552.8, 300 sec: 5552.0). Total num frames: 1294672896. Throughput: 0: 5003.8. Samples: 1294672038. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:11:01,401][25689] Avg episode reward: [(0, '0.681')] [2022-07-11 16:11:01,777][26022] Updated weights on worker 0-0, policy_version 1264331 (0.00099) [2022-07-11 16:11:03,627][26022] Updated weights on worker 0-0, policy_version 1264341 (0.00088) [2022-07-11 16:11:05,649][26022] Updated weights on worker 0-0, policy_version 1264351 (0.00090) [2022-07-11 16:11:06,418][25689] Fps is (10 sec: 5396.9, 60 sec: 5535.6, 300 sec: 5555.3). Total num frames: 1294699520. Throughput: 0: 5745.4. Samples: 1294703426. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:11:06,418][25689] Avg episode reward: [(0, '0.575')] [2022-07-11 16:11:07,270][26022] Updated weights on worker 0-0, policy_version 1264361 (0.00096) [2022-07-11 16:11:09,234][26022] Updated weights on worker 0-0, policy_version 1264371 (0.00088) [2022-07-11 16:11:11,021][26022] Updated weights on worker 0-0, policy_version 1264381 (0.00095) [2022-07-11 16:11:11,458][25689] Fps is (10 sec: 5395.9, 60 sec: 5536.8, 300 sec: 5549.7). Total num frames: 1294727168. Throughput: 0: 5724.4. Samples: 1294736964. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:11:11,459][25689] Avg episode reward: [(0, '0.680')] [2022-07-11 16:11:12,954][26022] Updated weights on worker 0-0, policy_version 1264391 (0.00105) [2022-07-11 16:11:14,909][26022] Updated weights on worker 0-0, policy_version 1264401 (0.00081) [2022-07-11 16:11:16,520][25689] Fps is (10 sec: 5575.2, 60 sec: 5539.0, 300 sec: 5548.6). Total num frames: 1294755840. Throughput: 0: 4877.6. Samples: 1294753480. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:11:16,520][25689] Avg episode reward: [(0, '1.119')] [2022-07-11 16:11:16,540][26022] Updated weights on worker 0-0, policy_version 1264411 (0.00085) [2022-07-11 16:11:18,520][26022] Updated weights on worker 0-0, policy_version 1264421 (0.00082) [2022-07-11 16:11:20,485][26022] Updated weights on worker 0-0, policy_version 1264431 (0.00084) [2022-07-11 16:11:21,539][25689] Fps is (10 sec: 5688.2, 60 sec: 5538.7, 300 sec: 5555.3). Total num frames: 1294784512. Throughput: 0: 5711.6. Samples: 1294787064. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:11:21,540][25689] Avg episode reward: [(0, '1.565')] [2022-07-11 16:11:22,060][26022] Updated weights on worker 0-0, policy_version 1264441 (0.00090) [2022-07-11 16:11:24,066][26022] Updated weights on worker 0-0, policy_version 1264451 (0.00103) [2022-07-11 16:11:25,684][26022] Updated weights on worker 0-0, policy_version 1264461 (0.00083) [2022-07-11 16:11:26,555][25689] Fps is (10 sec: 5510.3, 60 sec: 5524.6, 300 sec: 5548.3). Total num frames: 1294811136. Throughput: 0: 5820.9. Samples: 1294820642. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:11:26,555][25689] Avg episode reward: [(0, '1.770')] [2022-07-11 16:11:27,705][26022] Updated weights on worker 0-0, policy_version 1264471 (0.00086) [2022-07-11 16:11:29,634][26022] Updated weights on worker 0-0, policy_version 1264481 (0.00085) [2022-07-11 16:11:31,259][26022] Updated weights on worker 0-0, policy_version 1264491 (0.00084) [2022-07-11 16:11:31,575][25689] Fps is (10 sec: 5510.1, 60 sec: 5539.9, 300 sec: 5550.5). Total num frames: 1294839808. Throughput: 0: 4990.5. Samples: 1294837356. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:11:31,575][25689] Avg episode reward: [(0, '1.806')] [2022-07-11 16:11:33,124][26022] Updated weights on worker 0-0, policy_version 1264501 (0.00091) [2022-07-11 16:11:34,937][26022] Updated weights on worker 0-0, policy_version 1264511 (0.01082) [2022-07-11 16:11:36,623][25689] Fps is (10 sec: 5695.6, 60 sec: 5546.6, 300 sec: 5556.9). Total num frames: 1294868480. Throughput: 0: 5845.7. Samples: 1294870998. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:11:36,623][25689] Avg episode reward: [(0, '2.119')] [2022-07-11 16:11:36,755][26022] Updated weights on worker 0-0, policy_version 1264521 (0.00095) [2022-07-11 16:11:38,760][26022] Updated weights on worker 0-0, policy_version 1264531 (0.00051) [2022-07-11 16:11:40,477][26022] Updated weights on worker 0-0, policy_version 1264541 (0.00089) [2022-07-11 16:11:41,647][25689] Fps is (10 sec: 5489.9, 60 sec: 5528.2, 300 sec: 5546.6). Total num frames: 1294895104. Throughput: 0: 5829.6. Samples: 1294904284. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:11:41,647][25689] Avg episode reward: [(0, '1.585')] [2022-07-11 16:11:42,439][26022] Updated weights on worker 0-0, policy_version 1264551 (0.00068) [2022-07-11 16:11:44,154][26022] Updated weights on worker 0-0, policy_version 1264561 (0.00093) [2022-07-11 16:11:45,967][26022] Updated weights on worker 0-0, policy_version 1264571 (0.00092) [2022-07-11 16:11:46,650][25689] Fps is (10 sec: 5412.6, 60 sec: 5531.2, 300 sec: 5543.6). Total num frames: 1294922752. Throughput: 0: 4990.3. Samples: 1294920924. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:11:46,650][25689] Avg episode reward: [(0, '0.874')] [2022-07-11 16:11:47,863][26022] Updated weights on worker 0-0, policy_version 1264581 (0.00086) [2022-07-11 16:11:49,908][26022] Updated weights on worker 0-0, policy_version 1264591 (0.00087) [2022-07-11 16:11:51,482][26022] Updated weights on worker 0-0, policy_version 1264601 (0.00085) [2022-07-11 16:11:51,659][25689] Fps is (10 sec: 5727.7, 60 sec: 5548.4, 300 sec: 5556.1). Total num frames: 1294952448. Throughput: 0: 5814.8. Samples: 1294954140. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:11:51,659][25689] Avg episode reward: [(0, '0.787')] [2022-07-11 16:11:53,454][26022] Updated weights on worker 0-0, policy_version 1264611 (0.00082) [2022-07-11 16:11:54,950][26022] Updated weights on worker 0-0, policy_version 1264621 (0.00100) [2022-07-11 16:11:56,763][25689] Fps is (10 sec: 5568.9, 60 sec: 5526.7, 300 sec: 5547.8). Total num frames: 1294979072. Throughput: 0: 5787.3. Samples: 1294987556. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:11:56,764][25689] Avg episode reward: [(0, '-0.044')] [2022-07-11 16:11:57,220][26022] Updated weights on worker 0-0, policy_version 1264631 (0.00085) [2022-07-11 16:11:59,235][26022] Updated weights on worker 0-0, policy_version 1264641 (0.00085) [2022-07-11 16:12:00,762][26022] Updated weights on worker 0-0, policy_version 1264651 (0.00084) [2022-07-11 16:12:01,806][25689] Fps is (10 sec: 5348.4, 60 sec: 5526.4, 300 sec: 5547.2). Total num frames: 1295006720. Throughput: 0: 4958.5. Samples: 1295004242. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:12:01,808][25689] Avg episode reward: [(0, '-0.049')] [2022-07-11 16:12:03,123][26022] Updated weights on worker 0-0, policy_version 1264661 (0.00093) [2022-07-11 16:12:04,723][26022] Updated weights on worker 0-0, policy_version 1264671 (0.00084) [2022-07-11 16:12:06,817][25689] Fps is (10 sec: 5398.4, 60 sec: 5527.0, 300 sec: 5550.7). Total num frames: 1295033344. Throughput: 0: 5683.1. Samples: 1295035534. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:12:06,817][25689] Avg episode reward: [(0, '-0.105')] [2022-07-11 16:12:06,835][26022] Updated weights on worker 0-0, policy_version 1264681 (0.00074) [2022-07-11 16:12:08,291][26022] Updated weights on worker 0-0, policy_version 1264691 (0.00083) [2022-07-11 16:12:10,518][26022] Updated weights on worker 0-0, policy_version 1264701 (0.00083) [2022-07-11 16:12:11,841][25689] Fps is (10 sec: 5510.0, 60 sec: 5545.4, 300 sec: 5552.1). Total num frames: 1295062016. Throughput: 0: 5698.9. Samples: 1295069160. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:12:11,842][25689] Avg episode reward: [(0, '-0.085')] [2022-07-11 16:12:12,215][26022] Updated weights on worker 0-0, policy_version 1264711 (0.00103) [2022-07-11 16:12:14,170][26022] Updated weights on worker 0-0, policy_version 1264721 (0.00094) [2022-07-11 16:12:15,933][26022] Updated weights on worker 0-0, policy_version 1264731 (0.00089) [2022-07-11 16:12:16,915][25689] Fps is (10 sec: 5678.3, 60 sec: 5544.2, 300 sec: 5547.5). Total num frames: 1295090688. Throughput: 0: 4876.5. Samples: 1295085828. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:12:16,916][25689] Avg episode reward: [(0, '-0.043')] [2022-07-11 16:12:17,762][26022] Updated weights on worker 0-0, policy_version 1264741 (0.00096) [2022-07-11 16:12:19,512][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 16:12:19,526][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001264751_1295105024.pth [2022-07-11 16:12:19,527][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001262799_1293106176.pth [2022-07-11 16:12:19,532][26022] Updated weights on worker 0-0, policy_version 1264751 (0.00091) [2022-07-11 16:12:21,462][26022] Updated weights on worker 0-0, policy_version 1264761 (0.00089) [2022-07-11 16:12:21,930][25689] Fps is (10 sec: 5582.8, 60 sec: 5527.8, 300 sec: 5547.3). Total num frames: 1295118336. Throughput: 0: 5723.0. Samples: 1295119410. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:12:21,930][25689] Avg episode reward: [(0, '0.012')] [2022-07-11 16:12:23,177][26022] Updated weights on worker 0-0, policy_version 1264771 (0.00084) [2022-07-11 16:12:24,958][26022] Updated weights on worker 0-0, policy_version 1264781 (0.00087) [2022-07-11 16:12:26,728][26022] Updated weights on worker 0-0, policy_version 1264791 (0.00093) [2022-07-11 16:12:26,963][25689] Fps is (10 sec: 5503.7, 60 sec: 5543.1, 300 sec: 5543.4). Total num frames: 1295145984. Throughput: 0: 5821.1. Samples: 1295152806. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:12:26,963][25689] Avg episode reward: [(0, '0.770')] [2022-07-11 16:12:28,768][26022] Updated weights on worker 0-0, policy_version 1264801 (0.00087) [2022-07-11 16:12:30,482][26022] Updated weights on worker 0-0, policy_version 1264811 (0.00092) [2022-07-11 16:12:31,975][25689] Fps is (10 sec: 5504.4, 60 sec: 5526.8, 300 sec: 5544.5). Total num frames: 1295173632. Throughput: 0: 4986.9. Samples: 1295169566. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:12:31,976][25689] Avg episode reward: [(0, '0.580')] [2022-07-11 16:12:32,539][26022] Updated weights on worker 0-0, policy_version 1264821 (0.00087) [2022-07-11 16:12:34,403][26022] Updated weights on worker 0-0, policy_version 1264831 (0.00350) [2022-07-11 16:12:36,158][26022] Updated weights on worker 0-0, policy_version 1264841 (0.00090) [2022-07-11 16:12:37,080][25689] Fps is (10 sec: 5566.4, 60 sec: 5521.6, 300 sec: 5540.6). Total num frames: 1295202304. Throughput: 0: 5809.9. Samples: 1295202984. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:12:37,081][25689] Avg episode reward: [(0, '0.428')] [2022-07-11 16:12:37,847][26022] Updated weights on worker 0-0, policy_version 1264851 (0.00092) [2022-07-11 16:12:39,775][26022] Updated weights on worker 0-0, policy_version 1264861 (0.00091) [2022-07-11 16:12:41,803][26022] Updated weights on worker 0-0, policy_version 1264871 (0.00087) [2022-07-11 16:12:42,126][25689] Fps is (10 sec: 5548.3, 60 sec: 5536.6, 300 sec: 5540.1). Total num frames: 1295229952. Throughput: 0: 5792.7. Samples: 1295236404. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2022-07-11 16:12:42,127][25689] Avg episode reward: [(0, '0.697')] [2022-07-11 16:12:43,326][26022] Updated weights on worker 0-0, policy_version 1264881 (0.00087) [2022-07-11 16:12:45,345][26022] Updated weights on worker 0-0, policy_version 1264891 (0.00089) [2022-07-11 16:12:47,129][25689] Fps is (10 sec: 5604.7, 60 sec: 5553.5, 300 sec: 5547.6). Total num frames: 1295258624. Throughput: 0: 4968.5. Samples: 1295253004. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:12:47,130][25689] Avg episode reward: [(0, '1.526')] [2022-07-11 16:12:47,135][26022] Updated weights on worker 0-0, policy_version 1264901 (0.00085) [2022-07-11 16:12:49,185][26022] Updated weights on worker 0-0, policy_version 1264911 (0.00080) [2022-07-11 16:12:50,922][26022] Updated weights on worker 0-0, policy_version 1264921 (0.00111) [2022-07-11 16:12:52,148][25689] Fps is (10 sec: 5517.4, 60 sec: 5501.8, 300 sec: 5541.3). Total num frames: 1295285248. Throughput: 0: 5778.8. Samples: 1295286142. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:12:52,149][25689] Avg episode reward: [(0, '1.798')] [2022-07-11 16:12:52,655][26022] Updated weights on worker 0-0, policy_version 1264931 (0.00089) [2022-07-11 16:12:54,539][26022] Updated weights on worker 0-0, policy_version 1264941 (0.00084) [2022-07-11 16:12:56,665][26022] Updated weights on worker 0-0, policy_version 1264951 (0.00092) [2022-07-11 16:12:57,190][25689] Fps is (10 sec: 5292.3, 60 sec: 5507.4, 300 sec: 5538.6). Total num frames: 1295311872. Throughput: 0: 5795.4. Samples: 1295319530. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:12:57,191][25689] Avg episode reward: [(0, '1.819')] [2022-07-11 16:12:58,206][26022] Updated weights on worker 0-0, policy_version 1264961 (0.00082) [2022-07-11 16:13:00,384][26022] Updated weights on worker 0-0, policy_version 1264971 (0.00090) [2022-07-11 16:13:01,720][26022] Updated weights on worker 0-0, policy_version 1264981 (0.00091) [2022-07-11 16:13:02,205][25689] Fps is (10 sec: 5498.6, 60 sec: 5527.0, 300 sec: 5545.6). Total num frames: 1295340544. Throughput: 0: 4979.4. Samples: 1295336382. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:13:02,205][25689] Avg episode reward: [(0, '2.012')] [2022-07-11 16:13:04,288][26022] Updated weights on worker 0-0, policy_version 1264991 (0.00086) [2022-07-11 16:13:05,903][26022] Updated weights on worker 0-0, policy_version 1265001 (0.00089) [2022-07-11 16:13:07,215][25689] Fps is (10 sec: 5516.2, 60 sec: 5527.1, 300 sec: 5542.3). Total num frames: 1295367168. Throughput: 0: 5716.8. Samples: 1295367830. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:13:07,215][25689] Avg episode reward: [(0, '2.297')] [2022-07-11 16:13:07,836][26022] Updated weights on worker 0-0, policy_version 1265011 (0.00092) [2022-07-11 16:13:09,683][26022] Updated weights on worker 0-0, policy_version 1265021 (0.00085) [2022-07-11 16:13:11,373][26022] Updated weights on worker 0-0, policy_version 1265031 (0.00088) [2022-07-11 16:13:12,251][25689] Fps is (10 sec: 5504.1, 60 sec: 5526.0, 300 sec: 5543.6). Total num frames: 1295395840. Throughput: 0: 5733.2. Samples: 1295401396. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:13:12,252][25689] Avg episode reward: [(0, '2.276')] [2022-07-11 16:13:13,402][26022] Updated weights on worker 0-0, policy_version 1265041 (0.00091) [2022-07-11 16:13:15,246][26022] Updated weights on worker 0-0, policy_version 1265051 (0.00091) [2022-07-11 16:13:16,952][26022] Updated weights on worker 0-0, policy_version 1265061 (0.00098) [2022-07-11 16:13:17,321][25689] Fps is (10 sec: 5572.8, 60 sec: 5509.4, 300 sec: 5542.6). Total num frames: 1295423488. Throughput: 0: 4883.4. Samples: 1295417836. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:13:17,322][25689] Avg episode reward: [(0, '1.990')] [2022-07-11 16:13:18,970][26022] Updated weights on worker 0-0, policy_version 1265071 (0.00095) [2022-07-11 16:13:20,827][26022] Updated weights on worker 0-0, policy_version 1265081 (0.00086) [2022-07-11 16:13:22,334][25689] Fps is (10 sec: 5585.6, 60 sec: 5526.5, 300 sec: 5539.1). Total num frames: 1295452160. Throughput: 0: 5699.4. Samples: 1295451108. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:13:22,336][25689] Avg episode reward: [(0, '1.836')] [2022-07-11 16:13:22,534][26022] Updated weights on worker 0-0, policy_version 1265091 (0.00094) [2022-07-11 16:13:24,515][26022] Updated weights on worker 0-0, policy_version 1265101 (0.00091) [2022-07-11 16:13:26,189][26022] Updated weights on worker 0-0, policy_version 1265111 (0.00090) [2022-07-11 16:13:27,355][25689] Fps is (10 sec: 5511.1, 60 sec: 5510.6, 300 sec: 5539.2). Total num frames: 1295478784. Throughput: 0: 5779.7. Samples: 1295484232. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:13:27,355][25689] Avg episode reward: [(0, '1.993')] [2022-07-11 16:13:28,221][26022] Updated weights on worker 0-0, policy_version 1265121 (0.00087) [2022-07-11 16:13:30,076][26022] Updated weights on worker 0-0, policy_version 1265131 (0.00092) [2022-07-11 16:13:31,745][26022] Updated weights on worker 0-0, policy_version 1265141 (0.00093) [2022-07-11 16:13:32,432][25689] Fps is (10 sec: 5476.0, 60 sec: 5521.7, 300 sec: 5542.3). Total num frames: 1295507456. Throughput: 0: 4932.6. Samples: 1295500940. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:13:32,433][25689] Avg episode reward: [(0, '1.576')] [2022-07-11 16:13:33,798][26022] Updated weights on worker 0-0, policy_version 1265151 (0.00091) [2022-07-11 16:13:35,491][26022] Updated weights on worker 0-0, policy_version 1265161 (0.00090) [2022-07-11 16:13:37,403][26022] Updated weights on worker 0-0, policy_version 1265171 (0.00086) [2022-07-11 16:13:37,543][25689] Fps is (10 sec: 5527.9, 60 sec: 5504.2, 300 sec: 5538.2). Total num frames: 1295535104. Throughput: 0: 5767.0. Samples: 1295534456. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:13:37,544][25689] Avg episode reward: [(0, '0.968')] [2022-07-11 16:13:39,082][26022] Updated weights on worker 0-0, policy_version 1265181 (0.00092) [2022-07-11 16:13:41,118][26022] Updated weights on worker 0-0, policy_version 1265191 (0.00084) [2022-07-11 16:13:42,591][25689] Fps is (10 sec: 5544.1, 60 sec: 5521.0, 300 sec: 5541.2). Total num frames: 1295563776. Throughput: 0: 5761.8. Samples: 1295567822. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:13:42,591][25689] Avg episode reward: [(0, '0.687')] [2022-07-11 16:13:42,799][26022] Updated weights on worker 0-0, policy_version 1265201 (0.00089) [2022-07-11 16:13:44,942][26022] Updated weights on worker 0-0, policy_version 1265211 (0.00093) [2022-07-11 16:13:46,470][26022] Updated weights on worker 0-0, policy_version 1265221 (0.00090) [2022-07-11 16:13:47,599][25689] Fps is (10 sec: 5498.7, 60 sec: 5486.6, 300 sec: 5534.6). Total num frames: 1295590400. Throughput: 0: 4957.8. Samples: 1295584600. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:13:47,600][25689] Avg episode reward: [(0, '0.226')] [2022-07-11 16:13:48,553][26022] Updated weights on worker 0-0, policy_version 1265231 (0.00091) [2022-07-11 16:13:50,308][26022] Updated weights on worker 0-0, policy_version 1265241 (0.00087) [2022-07-11 16:13:52,151][26022] Updated weights on worker 0-0, policy_version 1265251 (0.00094) [2022-07-11 16:13:52,603][25689] Fps is (10 sec: 5625.0, 60 sec: 5538.8, 300 sec: 5542.3). Total num frames: 1295620096. Throughput: 0: 5787.2. Samples: 1295617674. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:13:52,604][25689] Avg episode reward: [(0, '0.111')] [2022-07-11 16:13:54,102][26022] Updated weights on worker 0-0, policy_version 1265261 (0.00091) [2022-07-11 16:13:55,753][26022] Updated weights on worker 0-0, policy_version 1265271 (0.00085) [2022-07-11 16:13:57,715][25689] Fps is (10 sec: 5567.7, 60 sec: 5532.5, 300 sec: 5537.1). Total num frames: 1295646720. Throughput: 0: 5775.0. Samples: 1295650948. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:13:57,717][25689] Avg episode reward: [(0, '0.320')] [2022-07-11 16:13:57,792][26022] Updated weights on worker 0-0, policy_version 1265281 (0.00085) [2022-07-11 16:13:59,453][26022] Updated weights on worker 0-0, policy_version 1265291 (0.00105) [2022-07-11 16:14:01,355][26022] Updated weights on worker 0-0, policy_version 1265301 (0.00084) [2022-07-11 16:14:02,726][25689] Fps is (10 sec: 5260.3, 60 sec: 5498.9, 300 sec: 5533.6). Total num frames: 1295673344. Throughput: 0: 5693.5. Samples: 1295682462. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:14:02,728][25689] Avg episode reward: [(0, '0.048')] [2022-07-11 16:14:03,521][26022] Updated weights on worker 0-0, policy_version 1265311 (0.00096) [2022-07-11 16:14:05,502][26022] Updated weights on worker 0-0, policy_version 1265321 (0.00079) [2022-07-11 16:14:07,176][26022] Updated weights on worker 0-0, policy_version 1265331 (0.00093) [2022-07-11 16:14:07,767][25689] Fps is (10 sec: 5501.0, 60 sec: 5529.9, 300 sec: 5536.4). Total num frames: 1295702016. Throughput: 0: 5694.9. Samples: 1295699452. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:14:07,768][25689] Avg episode reward: [(0, '-0.264')] [2022-07-11 16:14:09,178][26022] Updated weights on worker 0-0, policy_version 1265341 (0.00084) [2022-07-11 16:14:10,749][26022] Updated weights on worker 0-0, policy_version 1265351 (0.00085) [2022-07-11 16:14:12,794][25689] Fps is (10 sec: 5390.5, 60 sec: 5480.0, 300 sec: 5530.1). Total num frames: 1295727616. Throughput: 0: 5706.4. Samples: 1295732890. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:14:12,795][25689] Avg episode reward: [(0, '0.152')] [2022-07-11 16:14:12,915][26022] Updated weights on worker 0-0, policy_version 1265361 (0.00090) [2022-07-11 16:14:14,436][26022] Updated weights on worker 0-0, policy_version 1265371 (0.00088) [2022-07-11 16:14:16,442][26022] Updated weights on worker 0-0, policy_version 1265381 (0.00084) [2022-07-11 16:14:17,834][25689] Fps is (10 sec: 5594.5, 60 sec: 5533.5, 300 sec: 5539.7). Total num frames: 1295758336. Throughput: 0: 5737.8. Samples: 1295766388. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:14:17,836][25689] Avg episode reward: [(0, '0.527')] [2022-07-11 16:14:18,460][26022] Updated weights on worker 0-0, policy_version 1265391 (0.00083) [2022-07-11 16:14:19,582][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 16:14:19,592][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001265398_1295767552.pth [2022-07-11 16:14:19,592][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001263450_1293772800.pth [2022-07-11 16:14:20,151][26022] Updated weights on worker 0-0, policy_version 1265401 (0.00081) [2022-07-11 16:14:22,001][26022] Updated weights on worker 0-0, policy_version 1265411 (0.00080) [2022-07-11 16:14:22,848][25689] Fps is (10 sec: 5805.5, 60 sec: 5516.5, 300 sec: 5536.5). Total num frames: 1295785984. Throughput: 0: 4990.7. Samples: 1295782886. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:14:22,848][25689] Avg episode reward: [(0, '0.695')] [2022-07-11 16:14:23,768][26022] Updated weights on worker 0-0, policy_version 1265421 (0.00086) [2022-07-11 16:14:25,727][26022] Updated weights on worker 0-0, policy_version 1265431 (0.00090) [2022-07-11 16:14:27,587][26022] Updated weights on worker 0-0, policy_version 1265441 (0.00097) [2022-07-11 16:14:27,882][25689] Fps is (10 sec: 5401.2, 60 sec: 5515.3, 300 sec: 5529.0). Total num frames: 1295812608. Throughput: 0: 5793.8. Samples: 1295815994. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:14:27,883][25689] Avg episode reward: [(0, '0.782')] [2022-07-11 16:14:29,414][26022] Updated weights on worker 0-0, policy_version 1265451 (0.00099) [2022-07-11 16:14:31,140][26022] Updated weights on worker 0-0, policy_version 1265461 (0.00086) [2022-07-11 16:14:32,906][25689] Fps is (10 sec: 5497.8, 60 sec: 5520.2, 300 sec: 5533.3). Total num frames: 1295841280. Throughput: 0: 5806.6. Samples: 1295849670. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:14:32,907][25689] Avg episode reward: [(0, '1.925')] [2022-07-11 16:14:33,139][26022] Updated weights on worker 0-0, policy_version 1265471 (0.00086) [2022-07-11 16:14:34,837][26022] Updated weights on worker 0-0, policy_version 1265481 (0.00095) [2022-07-11 16:14:36,637][26022] Updated weights on worker 0-0, policy_version 1265491 (0.00085) [2022-07-11 16:14:38,045][25689] Fps is (10 sec: 5642.5, 60 sec: 5534.5, 300 sec: 5531.1). Total num frames: 1295869952. Throughput: 0: 4955.1. Samples: 1295866536. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:14:38,047][25689] Avg episode reward: [(0, '1.743')] [2022-07-11 16:14:38,627][26022] Updated weights on worker 0-0, policy_version 1265501 (0.00090) [2022-07-11 16:14:40,396][26022] Updated weights on worker 0-0, policy_version 1265511 (0.00073) [2022-07-11 16:14:42,386][26022] Updated weights on worker 0-0, policy_version 1265521 (0.00088) [2022-07-11 16:14:43,090][25689] Fps is (10 sec: 5429.5, 60 sec: 5500.9, 300 sec: 5530.6). Total num frames: 1295896576. Throughput: 0: 5771.0. Samples: 1295899704. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:14:43,091][25689] Avg episode reward: [(0, '1.836')] [2022-07-11 16:14:43,983][26022] Updated weights on worker 0-0, policy_version 1265531 (0.00086) [2022-07-11 16:14:46,214][26022] Updated weights on worker 0-0, policy_version 1265541 (0.00098) [2022-07-11 16:14:47,627][26022] Updated weights on worker 0-0, policy_version 1265551 (0.00099) [2022-07-11 16:14:48,158][25689] Fps is (10 sec: 5467.6, 60 sec: 5529.3, 300 sec: 5527.5). Total num frames: 1295925248. Throughput: 0: 5749.8. Samples: 1295932578. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:14:48,159][25689] Avg episode reward: [(0, '1.689')] [2022-07-11 16:14:49,869][26022] Updated weights on worker 0-0, policy_version 1265561 (0.00096) [2022-07-11 16:14:51,428][26022] Updated weights on worker 0-0, policy_version 1265571 (0.00089) [2022-07-11 16:14:53,168][25689] Fps is (10 sec: 5588.5, 60 sec: 5495.0, 300 sec: 5525.0). Total num frames: 1295952896. Throughput: 0: 4905.0. Samples: 1295949056. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:14:53,168][25689] Avg episode reward: [(0, '1.737')] [2022-07-11 16:14:53,369][26022] Updated weights on worker 0-0, policy_version 1265581 (0.00051) [2022-07-11 16:14:55,063][26022] Updated weights on worker 0-0, policy_version 1265591 (0.00095) [2022-07-11 16:14:57,185][26022] Updated weights on worker 0-0, policy_version 1265601 (0.00092) [2022-07-11 16:14:58,281][25689] Fps is (10 sec: 5665.0, 60 sec: 5545.6, 300 sec: 5533.8). Total num frames: 1295982592. Throughput: 0: 5735.1. Samples: 1295982590. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:14:58,281][25689] Avg episode reward: [(0, '1.698')] [2022-07-11 16:14:58,890][26022] Updated weights on worker 0-0, policy_version 1265611 (0.00095) [2022-07-11 16:15:00,843][26022] Updated weights on worker 0-0, policy_version 1265621 (0.00089) [2022-07-11 16:15:03,017][26022] Updated weights on worker 0-0, policy_version 1265631 (0.00093) [2022-07-11 16:15:03,353][25689] Fps is (10 sec: 5328.7, 60 sec: 5506.2, 300 sec: 5522.4). Total num frames: 1296007168. Throughput: 0: 5631.9. Samples: 1296013822. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:15:03,353][25689] Avg episode reward: [(0, '1.427')] [2022-07-11 16:15:04,928][26022] Updated weights on worker 0-0, policy_version 1265641 (0.00092) [2022-07-11 16:15:06,657][26022] Updated weights on worker 0-0, policy_version 1265651 (0.00086) [2022-07-11 16:15:08,382][25689] Fps is (10 sec: 5271.6, 60 sec: 5507.3, 300 sec: 5526.3). Total num frames: 1296035840. Throughput: 0: 4851.3. Samples: 1296030688. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:15:08,382][25689] Avg episode reward: [(0, '1.297')] [2022-07-11 16:15:08,668][26022] Updated weights on worker 0-0, policy_version 1265661 (0.00081) [2022-07-11 16:15:10,135][26022] Updated weights on worker 0-0, policy_version 1265671 (0.00086) [2022-07-11 16:15:12,269][26022] Updated weights on worker 0-0, policy_version 1265681 (0.00083) [2022-07-11 16:15:13,403][25689] Fps is (10 sec: 5807.9, 60 sec: 5575.4, 300 sec: 5530.9). Total num frames: 1296065536. Throughput: 0: 5704.6. Samples: 1296064488. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:15:13,403][25689] Avg episode reward: [(0, '1.484')] [2022-07-11 16:15:13,913][26022] Updated weights on worker 0-0, policy_version 1265691 (0.00091) [2022-07-11 16:15:15,843][26022] Updated weights on worker 0-0, policy_version 1265701 (0.00082) [2022-07-11 16:15:17,625][26022] Updated weights on worker 0-0, policy_version 1265711 (0.00087) [2022-07-11 16:15:18,528][25689] Fps is (10 sec: 5651.4, 60 sec: 5516.9, 300 sec: 5525.5). Total num frames: 1296093184. Throughput: 0: 5707.0. Samples: 1296098144. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:15:18,529][25689] Avg episode reward: [(0, '0.876')] [2022-07-11 16:15:19,438][26022] Updated weights on worker 0-0, policy_version 1265721 (0.00352) [2022-07-11 16:15:21,463][26022] Updated weights on worker 0-0, policy_version 1265731 (0.00084) [2022-07-11 16:15:23,048][26022] Updated weights on worker 0-0, policy_version 1265741 (0.00091) [2022-07-11 16:15:23,532][25689] Fps is (10 sec: 5560.0, 60 sec: 5534.8, 300 sec: 5529.7). Total num frames: 1296121856. Throughput: 0: 5010.1. Samples: 1296114922. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:15:23,533][25689] Avg episode reward: [(0, '-0.604')] [2022-07-11 16:15:25,033][26022] Updated weights on worker 0-0, policy_version 1265751 (0.00081) [2022-07-11 16:15:26,951][26022] Updated weights on worker 0-0, policy_version 1265761 (0.00091) [2022-07-11 16:15:28,502][26022] Updated weights on worker 0-0, policy_version 1265771 (0.00090) [2022-07-11 16:15:28,547][25689] Fps is (10 sec: 5621.6, 60 sec: 5553.4, 300 sec: 5529.5). Total num frames: 1296149504. Throughput: 0: 5832.0. Samples: 1296148292. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:15:28,547][25689] Avg episode reward: [(0, '-1.562')] [2022-07-11 16:15:30,751][26022] Updated weights on worker 0-0, policy_version 1265781 (0.00099) [2022-07-11 16:15:32,195][26022] Updated weights on worker 0-0, policy_version 1265791 (0.00085) [2022-07-11 16:15:33,619][25689] Fps is (10 sec: 5380.5, 60 sec: 5515.3, 300 sec: 5523.5). Total num frames: 1296176128. Throughput: 0: 5813.4. Samples: 1296182014. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:15:33,620][25689] Avg episode reward: [(0, '-1.802')] [2022-07-11 16:15:34,152][26022] Updated weights on worker 0-0, policy_version 1265801 (0.00083) [2022-07-11 16:15:35,956][26022] Updated weights on worker 0-0, policy_version 1265811 (0.00088) [2022-07-11 16:15:37,689][26022] Updated weights on worker 0-0, policy_version 1265821 (0.00088) [2022-07-11 16:15:38,760][25689] Fps is (10 sec: 5414.3, 60 sec: 5515.1, 300 sec: 5524.5). Total num frames: 1296204800. Throughput: 0: 4981.2. Samples: 1296198922. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:15:38,760][25689] Avg episode reward: [(0, '-1.861')] [2022-07-11 16:15:39,636][26022] Updated weights on worker 0-0, policy_version 1265831 (0.00080) [2022-07-11 16:15:41,494][26022] Updated weights on worker 0-0, policy_version 1265841 (0.00086) [2022-07-11 16:15:43,273][26022] Updated weights on worker 0-0, policy_version 1265851 (0.00082) [2022-07-11 16:15:43,827][25689] Fps is (10 sec: 5717.5, 60 sec: 5563.6, 300 sec: 5530.7). Total num frames: 1296234496. Throughput: 0: 5791.1. Samples: 1296232454. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:15:43,828][25689] Avg episode reward: [(0, '-1.379')] [2022-07-11 16:15:45,467][26022] Updated weights on worker 0-0, policy_version 1265861 (0.00092) [2022-07-11 16:15:46,851][26022] Updated weights on worker 0-0, policy_version 1265871 (0.00079) [2022-07-11 16:15:48,873][25689] Fps is (10 sec: 5670.4, 60 sec: 5548.8, 300 sec: 5526.7). Total num frames: 1296262144. Throughput: 0: 5780.6. Samples: 1296265788. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:15:48,873][25689] Avg episode reward: [(0, '-1.158')] [2022-07-11 16:15:48,877][26022] Updated weights on worker 0-0, policy_version 1265881 (0.00082) [2022-07-11 16:15:50,519][26022] Updated weights on worker 0-0, policy_version 1265891 (0.00088) [2022-07-11 16:15:52,355][26022] Updated weights on worker 0-0, policy_version 1265901 (0.00095) [2022-07-11 16:15:53,940][25689] Fps is (10 sec: 5468.1, 60 sec: 5543.6, 300 sec: 5526.4). Total num frames: 1296289792. Throughput: 0: 5777.7. Samples: 1296299424. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:15:53,940][25689] Avg episode reward: [(0, '0.050')] [2022-07-11 16:15:54,602][26022] Updated weights on worker 0-0, policy_version 1265911 (0.00105) [2022-07-11 16:15:56,190][26022] Updated weights on worker 0-0, policy_version 1265921 (0.00088) [2022-07-11 16:15:58,050][26022] Updated weights on worker 0-0, policy_version 1265931 (0.00098) [2022-07-11 16:15:58,979][25689] Fps is (10 sec: 5572.7, 60 sec: 5533.4, 300 sec: 5529.9). Total num frames: 1296318464. Throughput: 0: 5797.2. Samples: 1296316138. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:15:58,980][25689] Avg episode reward: [(0, '1.110')] [2022-07-11 16:15:59,736][26022] Updated weights on worker 0-0, policy_version 1265941 (0.00090) [2022-07-11 16:16:01,997][26022] Updated weights on worker 0-0, policy_version 1265951 (0.00091) [2022-07-11 16:16:03,992][25689] Fps is (10 sec: 5399.1, 60 sec: 5555.7, 300 sec: 5526.5). Total num frames: 1296344064. Throughput: 0: 5708.8. Samples: 1296347570. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:16:03,992][25689] Avg episode reward: [(0, '1.743')] [2022-07-11 16:16:03,996][26022] Updated weights on worker 0-0, policy_version 1265961 (0.00090) [2022-07-11 16:16:05,795][26022] Updated weights on worker 0-0, policy_version 1265971 (0.00097) [2022-07-11 16:16:07,485][26022] Updated weights on worker 0-0, policy_version 1265981 (0.00086) [2022-07-11 16:16:08,995][25689] Fps is (10 sec: 5316.3, 60 sec: 5541.2, 300 sec: 5527.2). Total num frames: 1296371712. Throughput: 0: 5732.5. Samples: 1296381140. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:16:08,996][25689] Avg episode reward: [(0, '1.146')] [2022-07-11 16:16:09,404][26022] Updated weights on worker 0-0, policy_version 1265991 (0.00619) [2022-07-11 16:16:11,447][26022] Updated weights on worker 0-0, policy_version 1266001 (0.00090) [2022-07-11 16:16:13,091][26022] Updated weights on worker 0-0, policy_version 1266011 (0.00089) [2022-07-11 16:16:14,004][25689] Fps is (10 sec: 5625.1, 60 sec: 5525.4, 300 sec: 5528.2). Total num frames: 1296400384. Throughput: 0: 4910.3. Samples: 1296397946. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:16:14,005][25689] Avg episode reward: [(0, '1.402')] [2022-07-11 16:16:15,078][26022] Updated weights on worker 0-0, policy_version 1266021 (0.00089) [2022-07-11 16:16:16,691][26022] Updated weights on worker 0-0, policy_version 1266031 (0.00085) [2022-07-11 16:16:18,713][26022] Updated weights on worker 0-0, policy_version 1266041 (0.00084) [2022-07-11 16:16:19,083][25689] Fps is (10 sec: 5684.7, 60 sec: 5546.6, 300 sec: 5527.1). Total num frames: 1296429056. Throughput: 0: 5732.3. Samples: 1296431376. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:16:19,083][25689] Avg episode reward: [(0, '0.995')] [2022-07-11 16:16:19,673][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 16:16:19,689][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001266047_1296432128.pth [2022-07-11 16:16:19,690][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001264101_1294439424.pth [2022-07-11 16:16:20,322][26022] Updated weights on worker 0-0, policy_version 1266051 (0.00080) [2022-07-11 16:16:22,215][26022] Updated weights on worker 0-0, policy_version 1266061 (0.00085) [2022-07-11 16:16:24,087][25689] Fps is (10 sec: 5484.1, 60 sec: 5512.7, 300 sec: 5527.3). Total num frames: 1296455680. Throughput: 0: 5863.7. Samples: 1296465402. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:16:24,088][25689] Avg episode reward: [(0, '0.515')] [2022-07-11 16:16:24,116][26022] Updated weights on worker 0-0, policy_version 1266071 (0.00090) [2022-07-11 16:16:25,999][26022] Updated weights on worker 0-0, policy_version 1266081 (0.00090) [2022-07-11 16:16:27,650][26022] Updated weights on worker 0-0, policy_version 1266091 (0.00091) [2022-07-11 16:16:29,117][25689] Fps is (10 sec: 5510.8, 60 sec: 5528.3, 300 sec: 5527.1). Total num frames: 1296484352. Throughput: 0: 5029.6. Samples: 1296482344. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:16:29,117][25689] Avg episode reward: [(0, '0.269')] [2022-07-11 16:16:29,722][26022] Updated weights on worker 0-0, policy_version 1266101 (0.00090) [2022-07-11 16:16:31,363][26022] Updated weights on worker 0-0, policy_version 1266111 (0.00087) [2022-07-11 16:16:33,247][26022] Updated weights on worker 0-0, policy_version 1266121 (0.00083) [2022-07-11 16:16:34,118][25689] Fps is (10 sec: 5614.6, 60 sec: 5551.7, 300 sec: 5524.5). Total num frames: 1296512000. Throughput: 0: 5856.0. Samples: 1296515734. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:16:34,119][25689] Avg episode reward: [(0, '0.000')] [2022-07-11 16:16:35,120][26022] Updated weights on worker 0-0, policy_version 1266131 (0.00086) [2022-07-11 16:16:36,884][26022] Updated weights on worker 0-0, policy_version 1266141 (0.00093) [2022-07-11 16:16:38,715][26022] Updated weights on worker 0-0, policy_version 1266151 (0.00086) [2022-07-11 16:16:39,220][25689] Fps is (10 sec: 5574.4, 60 sec: 5555.2, 300 sec: 5530.0). Total num frames: 1296540672. Throughput: 0: 5860.3. Samples: 1296549388. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:16:39,221][25689] Avg episode reward: [(0, '0.599')] [2022-07-11 16:16:40,518][26022] Updated weights on worker 0-0, policy_version 1266161 (0.00087) [2022-07-11 16:16:42,231][26022] Updated weights on worker 0-0, policy_version 1266171 (0.00086) [2022-07-11 16:16:44,056][26022] Updated weights on worker 0-0, policy_version 1266181 (0.00078) [2022-07-11 16:16:44,233][25689] Fps is (10 sec: 5669.3, 60 sec: 5543.3, 300 sec: 5533.2). Total num frames: 1296569344. Throughput: 0: 5010.1. Samples: 1296566334. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2022-07-11 16:16:44,233][25689] Avg episode reward: [(0, '0.637')] [2022-07-11 16:16:46,098][26022] Updated weights on worker 0-0, policy_version 1266191 (0.00088) [2022-07-11 16:16:47,733][26022] Updated weights on worker 0-0, policy_version 1266201 (0.00089) [2022-07-11 16:16:49,288][25689] Fps is (10 sec: 5492.4, 60 sec: 5525.5, 300 sec: 5522.0). Total num frames: 1296595968. Throughput: 0: 5829.5. Samples: 1296599932. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:16:49,288][25689] Avg episode reward: [(0, '0.662')] [2022-07-11 16:16:49,774][26022] Updated weights on worker 0-0, policy_version 1266211 (0.00785) [2022-07-11 16:16:51,334][26022] Updated weights on worker 0-0, policy_version 1266221 (0.00082) [2022-07-11 16:16:53,452][26022] Updated weights on worker 0-0, policy_version 1266231 (0.00116) [2022-07-11 16:16:54,307][25689] Fps is (10 sec: 5692.4, 60 sec: 5580.8, 300 sec: 5537.4). Total num frames: 1296626688. Throughput: 0: 5831.3. Samples: 1296633460. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:16:54,307][25689] Avg episode reward: [(0, '1.191')] [2022-07-11 16:16:55,099][26022] Updated weights on worker 0-0, policy_version 1266241 (0.00093) [2022-07-11 16:16:57,110][26022] Updated weights on worker 0-0, policy_version 1266251 (0.00078) [2022-07-11 16:16:58,935][26022] Updated weights on worker 0-0, policy_version 1266261 (0.00090) [2022-07-11 16:16:59,383][25689] Fps is (10 sec: 5680.6, 60 sec: 5543.5, 300 sec: 5533.3). Total num frames: 1296653312. Throughput: 0: 4997.4. Samples: 1296650148. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:16:59,383][25689] Avg episode reward: [(0, '1.476')] [2022-07-11 16:17:00,631][26022] Updated weights on worker 0-0, policy_version 1266271 (0.00077) [2022-07-11 16:17:03,087][26022] Updated weights on worker 0-0, policy_version 1266281 (0.00116) [2022-07-11 16:17:04,420][25689] Fps is (10 sec: 5163.7, 60 sec: 5541.2, 300 sec: 5529.4). Total num frames: 1296678912. Throughput: 0: 5705.0. Samples: 1296681502. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:17:04,421][25689] Avg episode reward: [(0, '1.225')] [2022-07-11 16:17:04,833][26022] Updated weights on worker 0-0, policy_version 1266291 (0.00611) [2022-07-11 16:17:06,563][26022] Updated weights on worker 0-0, policy_version 1266301 (0.00079) [2022-07-11 16:17:08,608][26022] Updated weights on worker 0-0, policy_version 1266311 (0.00097) [2022-07-11 16:17:09,435][25689] Fps is (10 sec: 5501.0, 60 sec: 5574.1, 300 sec: 5533.0). Total num frames: 1296708608. Throughput: 0: 5702.3. Samples: 1296714816. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:17:09,435][25689] Avg episode reward: [(0, '1.562')] [2022-07-11 16:17:10,372][26022] Updated weights on worker 0-0, policy_version 1266321 (0.00081) [2022-07-11 16:17:12,248][26022] Updated weights on worker 0-0, policy_version 1266331 (0.00086) [2022-07-11 16:17:13,996][26022] Updated weights on worker 0-0, policy_version 1266341 (0.00087) [2022-07-11 16:17:14,475][25689] Fps is (10 sec: 5702.9, 60 sec: 5554.2, 300 sec: 5530.2). Total num frames: 1296736256. Throughput: 0: 4879.0. Samples: 1296731864. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:17:14,476][25689] Avg episode reward: [(0, '1.896')] [2022-07-11 16:17:15,766][26022] Updated weights on worker 0-0, policy_version 1266351 (0.00093) [2022-07-11 16:17:17,506][26022] Updated weights on worker 0-0, policy_version 1266361 (0.00088) [2022-07-11 16:17:19,505][26022] Updated weights on worker 0-0, policy_version 1266371 (0.00087) [2022-07-11 16:17:19,539][25689] Fps is (10 sec: 5472.7, 60 sec: 5538.7, 300 sec: 5529.3). Total num frames: 1296763904. Throughput: 0: 5708.2. Samples: 1296765202. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:17:19,539][25689] Avg episode reward: [(0, '1.628')] [2022-07-11 16:17:21,249][26022] Updated weights on worker 0-0, policy_version 1266381 (0.00085) [2022-07-11 16:17:23,229][26022] Updated weights on worker 0-0, policy_version 1266391 (0.00085) [2022-07-11 16:17:24,543][25689] Fps is (10 sec: 5594.3, 60 sec: 5572.6, 300 sec: 5533.3). Total num frames: 1296792576. Throughput: 0: 5839.2. Samples: 1296799002. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:17:24,543][25689] Avg episode reward: [(0, '1.372')] [2022-07-11 16:17:24,912][26022] Updated weights on worker 0-0, policy_version 1266401 (0.00097) [2022-07-11 16:17:26,685][26022] Updated weights on worker 0-0, policy_version 1266411 (0.00081) [2022-07-11 16:17:28,599][26022] Updated weights on worker 0-0, policy_version 1266421 (0.00083) [2022-07-11 16:17:29,604][25689] Fps is (10 sec: 5595.2, 60 sec: 5552.7, 300 sec: 5532.4). Total num frames: 1296820224. Throughput: 0: 5004.8. Samples: 1296815762. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:17:29,605][25689] Avg episode reward: [(0, '1.450')] [2022-07-11 16:17:30,432][26022] Updated weights on worker 0-0, policy_version 1266431 (0.00095) [2022-07-11 16:17:32,320][26022] Updated weights on worker 0-0, policy_version 1266441 (0.00085) [2022-07-11 16:17:34,250][26022] Updated weights on worker 0-0, policy_version 1266451 (0.00091) [2022-07-11 16:17:34,607][25689] Fps is (10 sec: 5392.7, 60 sec: 5535.7, 300 sec: 5527.4). Total num frames: 1296846848. Throughput: 0: 5831.6. Samples: 1296849262. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:17:34,607][25689] Avg episode reward: [(0, '1.810')] [2022-07-11 16:17:35,824][26022] Updated weights on worker 0-0, policy_version 1266461 (0.00092) [2022-07-11 16:17:37,856][26022] Updated weights on worker 0-0, policy_version 1266471 (0.00095) [2022-07-11 16:17:39,421][26022] Updated weights on worker 0-0, policy_version 1266481 (0.00085) [2022-07-11 16:17:39,728][25689] Fps is (10 sec: 5664.3, 60 sec: 5567.8, 300 sec: 5536.3). Total num frames: 1296877568. Throughput: 0: 5834.3. Samples: 1296882994. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:17:39,729][25689] Avg episode reward: [(0, '1.338')] [2022-07-11 16:17:41,524][26022] Updated weights on worker 0-0, policy_version 1266491 (0.00088) [2022-07-11 16:17:43,106][26022] Updated weights on worker 0-0, policy_version 1266501 (0.00094) [2022-07-11 16:17:44,795][25689] Fps is (10 sec: 5729.3, 60 sec: 5545.9, 300 sec: 5531.7). Total num frames: 1296905216. Throughput: 0: 4981.6. Samples: 1296899888. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:17:44,796][25689] Avg episode reward: [(0, '0.683')] [2022-07-11 16:17:45,062][26022] Updated weights on worker 0-0, policy_version 1266511 (0.00092) [2022-07-11 16:17:46,685][26022] Updated weights on worker 0-0, policy_version 1266521 (0.00599) [2022-07-11 16:17:48,816][26022] Updated weights on worker 0-0, policy_version 1266531 (0.00092) [2022-07-11 16:17:49,829][25689] Fps is (10 sec: 5575.6, 60 sec: 5581.6, 300 sec: 5538.3). Total num frames: 1296933888. Throughput: 0: 5825.2. Samples: 1296933576. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:17:49,833][25689] Avg episode reward: [(0, '0.598')] [2022-07-11 16:17:50,560][26022] Updated weights on worker 0-0, policy_version 1266541 (0.00081) [2022-07-11 16:17:52,464][26022] Updated weights on worker 0-0, policy_version 1266551 (0.00090) [2022-07-11 16:17:53,987][26022] Updated weights on worker 0-0, policy_version 1266561 (0.00086) [2022-07-11 16:17:54,896][25689] Fps is (10 sec: 5575.5, 60 sec: 5526.5, 300 sec: 5541.3). Total num frames: 1296961536. Throughput: 0: 5822.6. Samples: 1296967396. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:17:54,897][25689] Avg episode reward: [(0, '1.237')] [2022-07-11 16:17:56,144][26022] Updated weights on worker 0-0, policy_version 1266571 (0.00089) [2022-07-11 16:17:57,811][26022] Updated weights on worker 0-0, policy_version 1266581 (0.00088) [2022-07-11 16:17:59,515][26022] Updated weights on worker 0-0, policy_version 1266591 (0.00097) [2022-07-11 16:17:59,947][25689] Fps is (10 sec: 5566.3, 60 sec: 5562.6, 300 sec: 5540.6). Total num frames: 1296990208. Throughput: 0: 5010.6. Samples: 1296984304. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:17:59,948][25689] Avg episode reward: [(0, '0.245')] [2022-07-11 16:18:02,020][26022] Updated weights on worker 0-0, policy_version 1266601 (0.00083) [2022-07-11 16:18:03,599][26022] Updated weights on worker 0-0, policy_version 1266611 (0.00086) [2022-07-11 16:18:04,971][25689] Fps is (10 sec: 5386.7, 60 sec: 5563.9, 300 sec: 5536.9). Total num frames: 1297015808. Throughput: 0: 5752.4. Samples: 1297015952. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:18:04,973][25689] Avg episode reward: [(0, '0.418')] [2022-07-11 16:18:05,610][26022] Updated weights on worker 0-0, policy_version 1266621 (0.00090) [2022-07-11 16:18:07,220][26022] Updated weights on worker 0-0, policy_version 1266631 (0.00093) [2022-07-11 16:18:09,129][26022] Updated weights on worker 0-0, policy_version 1266641 (0.00088) [2022-07-11 16:18:09,985][25689] Fps is (10 sec: 5508.6, 60 sec: 5563.9, 300 sec: 5540.7). Total num frames: 1297045504. Throughput: 0: 5769.6. Samples: 1297049868. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:18:09,986][25689] Avg episode reward: [(0, '0.270')] [2022-07-11 16:18:10,914][26022] Updated weights on worker 0-0, policy_version 1266651 (0.00089) [2022-07-11 16:18:12,788][26022] Updated weights on worker 0-0, policy_version 1266661 (0.00088) [2022-07-11 16:18:14,521][26022] Updated weights on worker 0-0, policy_version 1266671 (0.00083) [2022-07-11 16:18:15,013][25689] Fps is (10 sec: 5710.7, 60 sec: 5565.1, 300 sec: 5541.5). Total num frames: 1297073152. Throughput: 0: 4933.0. Samples: 1297066632. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:18:15,014][25689] Avg episode reward: [(0, '0.540')] [2022-07-11 16:18:16,535][26022] Updated weights on worker 0-0, policy_version 1266681 (0.00085) [2022-07-11 16:18:18,163][26022] Updated weights on worker 0-0, policy_version 1266691 (0.00084) [2022-07-11 16:18:19,744][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 16:18:19,753][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001266698_1297098752.pth [2022-07-11 16:18:19,753][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001264751_1295105024.pth [2022-07-11 16:18:20,065][25689] Fps is (10 sec: 5485.7, 60 sec: 5566.1, 300 sec: 5537.3). Total num frames: 1297100800. Throughput: 0: 5775.1. Samples: 1297100488. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:18:20,067][25689] Avg episode reward: [(0, '0.563')] [2022-07-11 16:18:20,225][26022] Updated weights on worker 0-0, policy_version 1266701 (0.00088) [2022-07-11 16:18:21,876][26022] Updated weights on worker 0-0, policy_version 1266711 (0.00093) [2022-07-11 16:18:23,678][26022] Updated weights on worker 0-0, policy_version 1266721 (0.00052) [2022-07-11 16:18:25,073][25689] Fps is (10 sec: 5496.1, 60 sec: 5548.8, 300 sec: 5541.0). Total num frames: 1297128448. Throughput: 0: 5865.6. Samples: 1297133864. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:18:25,075][25689] Avg episode reward: [(0, '0.537')] [2022-07-11 16:18:25,494][26022] Updated weights on worker 0-0, policy_version 1266731 (0.00087) [2022-07-11 16:18:27,513][26022] Updated weights on worker 0-0, policy_version 1266741 (0.00093) [2022-07-11 16:18:29,382][26022] Updated weights on worker 0-0, policy_version 1266751 (0.00087) [2022-07-11 16:18:30,083][25689] Fps is (10 sec: 5622.2, 60 sec: 5570.5, 300 sec: 5542.3). Total num frames: 1297157120. Throughput: 0: 5009.8. Samples: 1297150552. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:18:30,083][25689] Avg episode reward: [(0, '0.466')] [2022-07-11 16:18:31,119][26022] Updated weights on worker 0-0, policy_version 1266761 (0.00096) [2022-07-11 16:18:32,933][26022] Updated weights on worker 0-0, policy_version 1266771 (0.00091) [2022-07-11 16:18:34,962][26022] Updated weights on worker 0-0, policy_version 1266781 (0.00102) [2022-07-11 16:18:35,095][25689] Fps is (10 sec: 5517.9, 60 sec: 5569.7, 300 sec: 5540.7). Total num frames: 1297183744. Throughput: 0: 5832.2. Samples: 1297183754. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:18:35,095][25689] Avg episode reward: [(0, '-0.533')] [2022-07-11 16:18:36,619][26022] Updated weights on worker 0-0, policy_version 1266791 (0.00107) [2022-07-11 16:18:38,801][26022] Updated weights on worker 0-0, policy_version 1266801 (0.00083) [2022-07-11 16:18:40,168][25689] Fps is (10 sec: 5584.2, 60 sec: 5557.1, 300 sec: 5543.6). Total num frames: 1297213440. Throughput: 0: 5809.6. Samples: 1297217278. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:18:40,170][25689] Avg episode reward: [(0, '-0.071')] [2022-07-11 16:18:40,325][26022] Updated weights on worker 0-0, policy_version 1266811 (0.00084) [2022-07-11 16:18:42,317][26022] Updated weights on worker 0-0, policy_version 1266821 (0.00084) [2022-07-11 16:18:43,990][26022] Updated weights on worker 0-0, policy_version 1266831 (0.00110) [2022-07-11 16:18:45,197][25689] Fps is (10 sec: 5676.2, 60 sec: 5560.6, 300 sec: 5546.7). Total num frames: 1297241088. Throughput: 0: 4974.6. Samples: 1297233972. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:18:45,198][25689] Avg episode reward: [(0, '0.419')] [2022-07-11 16:18:45,998][26022] Updated weights on worker 0-0, policy_version 1266841 (0.00084) [2022-07-11 16:18:47,670][26022] Updated weights on worker 0-0, policy_version 1266851 (0.00098) [2022-07-11 16:18:49,782][26022] Updated weights on worker 0-0, policy_version 1266861 (0.00093) [2022-07-11 16:18:50,220][25689] Fps is (10 sec: 5399.3, 60 sec: 5527.7, 300 sec: 5536.0). Total num frames: 1297267712. Throughput: 0: 5805.5. Samples: 1297267458. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:18:50,220][25689] Avg episode reward: [(0, '0.682')] [2022-07-11 16:18:51,384][26022] Updated weights on worker 0-0, policy_version 1266871 (0.00085) [2022-07-11 16:18:53,439][26022] Updated weights on worker 0-0, policy_version 1266881 (0.00422) [2022-07-11 16:18:55,254][26022] Updated weights on worker 0-0, policy_version 1266891 (0.00086) [2022-07-11 16:18:55,271][25689] Fps is (10 sec: 5387.6, 60 sec: 5529.2, 300 sec: 5540.6). Total num frames: 1297295360. Throughput: 0: 5789.5. Samples: 1297300564. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:18:55,271][25689] Avg episode reward: [(0, '0.822')] [2022-07-11 16:18:57,159][26022] Updated weights on worker 0-0, policy_version 1266901 (0.00088) [2022-07-11 16:18:58,793][26022] Updated weights on worker 0-0, policy_version 1266911 (0.00087) [2022-07-11 16:19:00,410][25689] Fps is (10 sec: 5527.0, 60 sec: 5521.2, 300 sec: 5545.1). Total num frames: 1297324032. Throughput: 0: 5751.9. Samples: 1297333706. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:19:00,411][25689] Avg episode reward: [(0, '1.174')] [2022-07-11 16:19:00,819][26022] Updated weights on worker 0-0, policy_version 1266921 (0.00082) [2022-07-11 16:19:02,698][26022] Updated weights on worker 0-0, policy_version 1266931 (0.00092) [2022-07-11 16:19:04,886][26022] Updated weights on worker 0-0, policy_version 1266941 (0.00089) [2022-07-11 16:19:05,443][25689] Fps is (10 sec: 5435.9, 60 sec: 5537.2, 300 sec: 5538.4). Total num frames: 1297350656. Throughput: 0: 5651.8. Samples: 1297348398. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:19:05,444][25689] Avg episode reward: [(0, '2.060')] [2022-07-11 16:19:06,354][26022] Updated weights on worker 0-0, policy_version 1266951 (0.00083) [2022-07-11 16:19:08,520][26022] Updated weights on worker 0-0, policy_version 1266961 (0.00094) [2022-07-11 16:19:10,326][26022] Updated weights on worker 0-0, policy_version 1266971 (0.00092) [2022-07-11 16:19:10,451][25689] Fps is (10 sec: 5507.2, 60 sec: 5520.9, 300 sec: 5549.0). Total num frames: 1297379328. Throughput: 0: 5662.9. Samples: 1297382022. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:19:10,452][25689] Avg episode reward: [(0, '1.248')] [2022-07-11 16:19:12,031][26022] Updated weights on worker 0-0, policy_version 1266981 (0.00086) [2022-07-11 16:19:13,959][26022] Updated weights on worker 0-0, policy_version 1266991 (0.00090) [2022-07-11 16:19:15,455][25689] Fps is (10 sec: 5625.5, 60 sec: 5523.0, 300 sec: 5539.4). Total num frames: 1297406976. Throughput: 0: 5691.1. Samples: 1297415432. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:19:15,460][25689] Avg episode reward: [(0, '0.715')] [2022-07-11 16:19:15,850][26022] Updated weights on worker 0-0, policy_version 1267001 (0.00098) [2022-07-11 16:19:17,590][26022] Updated weights on worker 0-0, policy_version 1267011 (0.00090) [2022-07-11 16:19:19,548][26022] Updated weights on worker 0-0, policy_version 1267021 (0.00087) [2022-07-11 16:19:20,538][25689] Fps is (10 sec: 5481.6, 60 sec: 5520.2, 300 sec: 5538.1). Total num frames: 1297434624. Throughput: 0: 4888.3. Samples: 1297432098. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:19:20,540][25689] Avg episode reward: [(0, '0.362')] [2022-07-11 16:19:21,166][26022] Updated weights on worker 0-0, policy_version 1267031 (0.00083) [2022-07-11 16:19:23,238][26022] Updated weights on worker 0-0, policy_version 1267041 (0.00088) [2022-07-11 16:19:24,909][26022] Updated weights on worker 0-0, policy_version 1267051 (0.00084) [2022-07-11 16:19:25,571][25689] Fps is (10 sec: 5465.8, 60 sec: 5517.9, 300 sec: 5541.6). Total num frames: 1297462272. Throughput: 0: 5823.0. Samples: 1297465604. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:19:25,573][25689] Avg episode reward: [(0, '0.539')] [2022-07-11 16:19:26,804][26022] Updated weights on worker 0-0, policy_version 1267061 (0.00105) [2022-07-11 16:19:28,853][26022] Updated weights on worker 0-0, policy_version 1267071 (0.00086) [2022-07-11 16:19:30,576][25689] Fps is (10 sec: 5610.7, 60 sec: 5518.4, 300 sec: 5541.9). Total num frames: 1297490944. Throughput: 0: 5802.6. Samples: 1297498802. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:19:30,577][25689] Avg episode reward: [(0, '0.274')] [2022-07-11 16:19:30,578][26022] Updated weights on worker 0-0, policy_version 1267081 (0.00096) [2022-07-11 16:19:32,384][26022] Updated weights on worker 0-0, policy_version 1267091 (0.00090) [2022-07-11 16:19:34,139][26022] Updated weights on worker 0-0, policy_version 1267101 (0.00093) [2022-07-11 16:19:35,595][25689] Fps is (10 sec: 5618.9, 60 sec: 5534.7, 300 sec: 5540.7). Total num frames: 1297518592. Throughput: 0: 4971.3. Samples: 1297515554. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:19:35,596][25689] Avg episode reward: [(0, '0.608')] [2022-07-11 16:19:35,996][26022] Updated weights on worker 0-0, policy_version 1267111 (0.00084) [2022-07-11 16:19:37,975][26022] Updated weights on worker 0-0, policy_version 1267121 (0.00087) [2022-07-11 16:19:39,624][26022] Updated weights on worker 0-0, policy_version 1267131 (0.00094) [2022-07-11 16:19:40,714][25689] Fps is (10 sec: 5555.3, 60 sec: 5513.6, 300 sec: 5546.2). Total num frames: 1297547264. Throughput: 0: 5791.2. Samples: 1297548942. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:19:40,715][25689] Avg episode reward: [(0, '0.503')] [2022-07-11 16:19:41,791][26022] Updated weights on worker 0-0, policy_version 1267141 (0.00083) [2022-07-11 16:19:43,383][26022] Updated weights on worker 0-0, policy_version 1267151 (0.00092) [2022-07-11 16:19:45,352][26022] Updated weights on worker 0-0, policy_version 1267161 (0.00093) [2022-07-11 16:19:45,757][25689] Fps is (10 sec: 5542.1, 60 sec: 5512.3, 300 sec: 5543.3). Total num frames: 1297574912. Throughput: 0: 5796.9. Samples: 1297582618. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:19:45,757][25689] Avg episode reward: [(0, '0.575')] [2022-07-11 16:19:46,843][26022] Updated weights on worker 0-0, policy_version 1267171 (0.00083) [2022-07-11 16:19:49,042][26022] Updated weights on worker 0-0, policy_version 1267181 (0.00103) [2022-07-11 16:19:50,750][26022] Updated weights on worker 0-0, policy_version 1267191 (0.00092) [2022-07-11 16:19:50,846][25689] Fps is (10 sec: 5558.7, 60 sec: 5540.0, 300 sec: 5545.2). Total num frames: 1297603584. Throughput: 0: 4968.1. Samples: 1297599506. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:19:50,847][25689] Avg episode reward: [(0, '0.513')] [2022-07-11 16:19:52,595][26022] Updated weights on worker 0-0, policy_version 1267201 (0.00098) [2022-07-11 16:19:54,427][26022] Updated weights on worker 0-0, policy_version 1267211 (0.00089) [2022-07-11 16:19:55,943][25689] Fps is (10 sec: 5629.7, 60 sec: 5552.7, 300 sec: 5542.1). Total num frames: 1297632256. Throughput: 0: 5773.3. Samples: 1297633030. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:19:55,944][25689] Avg episode reward: [(0, '0.823')] [2022-07-11 16:19:56,195][26022] Updated weights on worker 0-0, policy_version 1267221 (0.00092) [2022-07-11 16:19:58,235][26022] Updated weights on worker 0-0, policy_version 1267231 (0.00096) [2022-07-11 16:19:59,928][26022] Updated weights on worker 0-0, policy_version 1267241 (0.00089) [2022-07-11 16:20:00,987][25689] Fps is (10 sec: 5554.0, 60 sec: 5544.6, 300 sec: 5552.9). Total num frames: 1297659904. Throughput: 0: 5785.2. Samples: 1297666222. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:20:00,987][25689] Avg episode reward: [(0, '0.562')] [2022-07-11 16:20:02,320][26022] Updated weights on worker 0-0, policy_version 1267251 (0.00485) [2022-07-11 16:20:04,145][26022] Updated weights on worker 0-0, policy_version 1267261 (0.00094) [2022-07-11 16:20:05,887][26022] Updated weights on worker 0-0, policy_version 1267271 (0.01242) [2022-07-11 16:20:06,056][25689] Fps is (10 sec: 5265.3, 60 sec: 5524.4, 300 sec: 5541.8). Total num frames: 1297685504. Throughput: 0: 4826.0. Samples: 1297680582. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:20:06,056][25689] Avg episode reward: [(0, '1.314')] [2022-07-11 16:20:07,829][26022] Updated weights on worker 0-0, policy_version 1267281 (0.00079) [2022-07-11 16:20:09,450][26022] Updated weights on worker 0-0, policy_version 1267291 (0.00088) [2022-07-11 16:20:11,074][25689] Fps is (10 sec: 5177.0, 60 sec: 5489.6, 300 sec: 5531.6). Total num frames: 1297712128. Throughput: 0: 5658.8. Samples: 1297713972. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:20:11,075][25689] Avg episode reward: [(0, '1.450')] [2022-07-11 16:20:11,610][26022] Updated weights on worker 0-0, policy_version 1267301 (0.00091) [2022-07-11 16:20:13,165][26022] Updated weights on worker 0-0, policy_version 1267311 (0.00088) [2022-07-11 16:20:15,250][26022] Updated weights on worker 0-0, policy_version 1267321 (0.00094) [2022-07-11 16:20:16,100][25689] Fps is (10 sec: 5709.3, 60 sec: 5538.3, 300 sec: 5543.8). Total num frames: 1297742848. Throughput: 0: 5676.1. Samples: 1297747444. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:20:16,100][25689] Avg episode reward: [(0, '1.622')] [2022-07-11 16:20:17,201][26022] Updated weights on worker 0-0, policy_version 1267331 (0.00050) [2022-07-11 16:20:18,879][26022] Updated weights on worker 0-0, policy_version 1267341 (0.00085) [2022-07-11 16:20:19,857][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 16:20:19,869][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001267346_1297762304.pth [2022-07-11 16:20:19,870][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001265398_1295767552.pth [2022-07-11 16:20:20,769][26022] Updated weights on worker 0-0, policy_version 1267351 (0.00084) [2022-07-11 16:20:21,214][25689] Fps is (10 sec: 5554.3, 60 sec: 5501.8, 300 sec: 5531.4). Total num frames: 1297768448. Throughput: 0: 4829.0. Samples: 1297763902. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:20:21,215][25689] Avg episode reward: [(0, '1.876')] [2022-07-11 16:20:22,639][26022] Updated weights on worker 0-0, policy_version 1267361 (0.00398) [2022-07-11 16:20:24,567][26022] Updated weights on worker 0-0, policy_version 1267371 (0.00091) [2022-07-11 16:20:26,215][25689] Fps is (10 sec: 5365.3, 60 sec: 5521.5, 300 sec: 5535.1). Total num frames: 1297797120. Throughput: 0: 5785.8. Samples: 1297797220. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:20:26,216][25689] Avg episode reward: [(0, '1.724')] [2022-07-11 16:20:26,275][26022] Updated weights on worker 0-0, policy_version 1267381 (0.00097) [2022-07-11 16:20:28,053][26022] Updated weights on worker 0-0, policy_version 1267391 (0.00073) [2022-07-11 16:20:30,024][26022] Updated weights on worker 0-0, policy_version 1267401 (0.00085) [2022-07-11 16:20:31,295][25689] Fps is (10 sec: 5688.5, 60 sec: 5514.7, 300 sec: 5541.8). Total num frames: 1297825792. Throughput: 0: 5751.4. Samples: 1297830270. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:20:31,295][25689] Avg episode reward: [(0, '2.175')] [2022-07-11 16:20:31,933][26022] Updated weights on worker 0-0, policy_version 1267411 (0.00089) [2022-07-11 16:20:33,656][26022] Updated weights on worker 0-0, policy_version 1267421 (0.00088) [2022-07-11 16:20:35,598][26022] Updated weights on worker 0-0, policy_version 1267431 (0.00094) [2022-07-11 16:20:36,346][25689] Fps is (10 sec: 5559.2, 60 sec: 5511.8, 300 sec: 5540.0). Total num frames: 1297853440. Throughput: 0: 4912.0. Samples: 1297846894. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:20:36,355][25689] Avg episode reward: [(0, '2.103')] [2022-07-11 16:20:37,386][26022] Updated weights on worker 0-0, policy_version 1267441 (0.00082) [2022-07-11 16:20:39,304][26022] Updated weights on worker 0-0, policy_version 1267451 (0.00089) [2022-07-11 16:20:41,087][26022] Updated weights on worker 0-0, policy_version 1267461 (0.00090) [2022-07-11 16:20:41,482][25689] Fps is (10 sec: 5528.7, 60 sec: 5510.3, 300 sec: 5535.3). Total num frames: 1297882112. Throughput: 0: 5757.3. Samples: 1297880588. Policy #0 lag: (min: 0.0, avg: 7.8, max: 18.0) [2022-07-11 16:20:41,484][25689] Avg episode reward: [(0, '2.177')] [2022-07-11 16:20:42,906][26022] Updated weights on worker 0-0, policy_version 1267471 (0.00622) [2022-07-11 16:20:44,738][26022] Updated weights on worker 0-0, policy_version 1267481 (0.00084) [2022-07-11 16:20:46,505][25689] Fps is (10 sec: 5543.8, 60 sec: 5512.0, 300 sec: 5535.7). Total num frames: 1297909760. Throughput: 0: 5772.0. Samples: 1297914334. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:20:46,506][25689] Avg episode reward: [(0, '1.499')] [2022-07-11 16:20:46,626][26022] Updated weights on worker 0-0, policy_version 1267491 (0.00085) [2022-07-11 16:20:48,409][26022] Updated weights on worker 0-0, policy_version 1267501 (0.00089) [2022-07-11 16:20:49,947][26022] Updated weights on worker 0-0, policy_version 1267511 (0.00084) [2022-07-11 16:20:51,570][25689] Fps is (10 sec: 5582.7, 60 sec: 5514.3, 300 sec: 5539.2). Total num frames: 1297938432. Throughput: 0: 5792.0. Samples: 1297947704. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:20:51,572][25689] Avg episode reward: [(0, '1.139')] [2022-07-11 16:20:52,051][26022] Updated weights on worker 0-0, policy_version 1267521 (0.00098) [2022-07-11 16:20:53,852][26022] Updated weights on worker 0-0, policy_version 1267531 (0.00090) [2022-07-11 16:20:55,728][26022] Updated weights on worker 0-0, policy_version 1267541 (0.00090) [2022-07-11 16:20:56,601][25689] Fps is (10 sec: 5679.9, 60 sec: 5520.2, 300 sec: 5539.4). Total num frames: 1297967104. Throughput: 0: 5798.0. Samples: 1297964334. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:20:56,602][25689] Avg episode reward: [(0, '1.145')] [2022-07-11 16:20:57,695][26022] Updated weights on worker 0-0, policy_version 1267551 (0.00083) [2022-07-11 16:20:59,412][26022] Updated weights on worker 0-0, policy_version 1267561 (0.00112) [2022-07-11 16:21:01,575][26022] Updated weights on worker 0-0, policy_version 1267571 (0.00102) [2022-07-11 16:21:01,661][25689] Fps is (10 sec: 5378.4, 60 sec: 5485.0, 300 sec: 5538.5). Total num frames: 1297992704. Throughput: 0: 5801.0. Samples: 1297997646. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:21:01,662][25689] Avg episode reward: [(0, '0.971')] [2022-07-11 16:21:03,429][26022] Updated weights on worker 0-0, policy_version 1267581 (0.00085) [2022-07-11 16:21:05,249][26022] Updated weights on worker 0-0, policy_version 1267591 (0.00063) [2022-07-11 16:21:06,723][25689] Fps is (10 sec: 5260.9, 60 sec: 5519.4, 300 sec: 5537.4). Total num frames: 1298020352. Throughput: 0: 5683.5. Samples: 1298029240. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:21:06,725][25689] Avg episode reward: [(0, '1.404')] [2022-07-11 16:21:07,219][26022] Updated weights on worker 0-0, policy_version 1267601 (0.00086) [2022-07-11 16:21:09,046][26022] Updated weights on worker 0-0, policy_version 1267611 (0.00078) [2022-07-11 16:21:10,753][26022] Updated weights on worker 0-0, policy_version 1267621 (0.00091) [2022-07-11 16:21:11,734][25689] Fps is (10 sec: 5692.7, 60 sec: 5570.7, 300 sec: 5540.8). Total num frames: 1298050048. Throughput: 0: 4876.5. Samples: 1298046034. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:21:11,735][25689] Avg episode reward: [(0, '1.463')] [2022-07-11 16:21:12,732][26022] Updated weights on worker 0-0, policy_version 1267631 (0.00092) [2022-07-11 16:21:14,332][26022] Updated weights on worker 0-0, policy_version 1267641 (0.00086) [2022-07-11 16:21:16,171][26022] Updated weights on worker 0-0, policy_version 1267651 (0.00086) [2022-07-11 16:21:16,780][25689] Fps is (10 sec: 5599.8, 60 sec: 5501.3, 300 sec: 5534.5). Total num frames: 1298076672. Throughput: 0: 5722.3. Samples: 1298079804. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:21:16,781][25689] Avg episode reward: [(0, '2.016')] [2022-07-11 16:21:18,116][26022] Updated weights on worker 0-0, policy_version 1267661 (0.00093) [2022-07-11 16:21:20,027][26022] Updated weights on worker 0-0, policy_version 1267671 (0.00087) [2022-07-11 16:21:21,778][26022] Updated weights on worker 0-0, policy_version 1267681 (0.00083) [2022-07-11 16:21:21,837][25689] Fps is (10 sec: 5473.1, 60 sec: 5557.2, 300 sec: 5540.4). Total num frames: 1298105344. Throughput: 0: 5735.3. Samples: 1298113364. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:21:21,838][25689] Avg episode reward: [(0, '1.981')] [2022-07-11 16:21:23,650][26022] Updated weights on worker 0-0, policy_version 1267691 (0.00095) [2022-07-11 16:21:25,511][26022] Updated weights on worker 0-0, policy_version 1267701 (0.00093) [2022-07-11 16:21:26,855][25689] Fps is (10 sec: 5590.6, 60 sec: 5538.9, 300 sec: 5537.2). Total num frames: 1298132992. Throughput: 0: 5008.2. Samples: 1298130062. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:21:26,855][25689] Avg episode reward: [(0, '1.654')] [2022-07-11 16:21:27,259][26022] Updated weights on worker 0-0, policy_version 1267711 (0.00092) [2022-07-11 16:21:29,276][26022] Updated weights on worker 0-0, policy_version 1267721 (0.00087) [2022-07-11 16:21:30,839][26022] Updated weights on worker 0-0, policy_version 1267731 (0.00088) [2022-07-11 16:21:31,878][25689] Fps is (10 sec: 5609.0, 60 sec: 5544.0, 300 sec: 5540.2). Total num frames: 1298161664. Throughput: 0: 5839.2. Samples: 1298163660. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:21:31,879][25689] Avg episode reward: [(0, '1.393')] [2022-07-11 16:21:32,831][26022] Updated weights on worker 0-0, policy_version 1267741 (0.00091) [2022-07-11 16:21:34,376][26022] Updated weights on worker 0-0, policy_version 1267751 (0.00090) [2022-07-11 16:21:36,419][26022] Updated weights on worker 0-0, policy_version 1267761 (0.00089) [2022-07-11 16:21:36,905][25689] Fps is (10 sec: 5705.4, 60 sec: 5563.1, 300 sec: 5541.6). Total num frames: 1298190336. Throughput: 0: 5857.8. Samples: 1298197692. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:21:36,906][25689] Avg episode reward: [(0, '1.179')] [2022-07-11 16:21:38,049][26022] Updated weights on worker 0-0, policy_version 1267771 (0.00081) [2022-07-11 16:21:40,093][26022] Updated weights on worker 0-0, policy_version 1267781 (0.00082) [2022-07-11 16:21:41,885][26022] Updated weights on worker 0-0, policy_version 1267791 (0.00087) [2022-07-11 16:21:41,949][25689] Fps is (10 sec: 5592.4, 60 sec: 5554.6, 300 sec: 5537.6). Total num frames: 1298217984. Throughput: 0: 5020.6. Samples: 1298214338. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:21:41,950][25689] Avg episode reward: [(0, '0.952')] [2022-07-11 16:21:43,669][26022] Updated weights on worker 0-0, policy_version 1267801 (0.00086) [2022-07-11 16:21:45,328][26022] Updated weights on worker 0-0, policy_version 1267811 (0.00088) [2022-07-11 16:21:46,956][25689] Fps is (10 sec: 5501.6, 60 sec: 5556.1, 300 sec: 5542.0). Total num frames: 1298245632. Throughput: 0: 5873.0. Samples: 1298248120. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:21:46,957][25689] Avg episode reward: [(0, '1.305')] [2022-07-11 16:21:47,510][26022] Updated weights on worker 0-0, policy_version 1267821 (0.00100) [2022-07-11 16:21:48,934][26022] Updated weights on worker 0-0, policy_version 1267831 (0.00084) [2022-07-11 16:21:50,938][26022] Updated weights on worker 0-0, policy_version 1267841 (0.00086) [2022-07-11 16:21:51,979][25689] Fps is (10 sec: 5615.2, 60 sec: 5560.0, 300 sec: 5535.0). Total num frames: 1298274304. Throughput: 0: 5877.0. Samples: 1298281792. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:21:51,980][25689] Avg episode reward: [(0, '1.342')] [2022-07-11 16:21:52,687][26022] Updated weights on worker 0-0, policy_version 1267851 (0.00083) [2022-07-11 16:21:54,411][26022] Updated weights on worker 0-0, policy_version 1267861 (0.00083) [2022-07-11 16:21:56,389][26022] Updated weights on worker 0-0, policy_version 1267871 (0.00084) [2022-07-11 16:21:56,981][25689] Fps is (10 sec: 5720.1, 60 sec: 5562.7, 300 sec: 5543.3). Total num frames: 1298302976. Throughput: 0: 5035.2. Samples: 1298298778. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:21:56,982][25689] Avg episode reward: [(0, '1.170')] [2022-07-11 16:21:58,306][26022] Updated weights on worker 0-0, policy_version 1267881 (0.00094) [2022-07-11 16:21:59,821][26022] Updated weights on worker 0-0, policy_version 1267891 (0.00087) [2022-07-11 16:22:02,063][25689] Fps is (10 sec: 5483.8, 60 sec: 5577.6, 300 sec: 5545.9). Total num frames: 1298329600. Throughput: 0: 5863.0. Samples: 1298332264. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:22:02,063][25689] Avg episode reward: [(0, '1.135')] [2022-07-11 16:22:02,369][26022] Updated weights on worker 0-0, policy_version 1267901 (0.00094) [2022-07-11 16:22:03,882][26022] Updated weights on worker 0-0, policy_version 1267911 (0.00085) [2022-07-11 16:22:06,079][26022] Updated weights on worker 0-0, policy_version 1267921 (0.00087) [2022-07-11 16:22:07,143][25689] Fps is (10 sec: 5340.7, 60 sec: 5575.9, 300 sec: 5537.8). Total num frames: 1298357248. Throughput: 0: 5723.9. Samples: 1298363668. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:22:07,144][25689] Avg episode reward: [(0, '0.807')] [2022-07-11 16:22:07,690][26022] Updated weights on worker 0-0, policy_version 1267931 (0.00085) [2022-07-11 16:22:09,711][26022] Updated weights on worker 0-0, policy_version 1267941 (0.00087) [2022-07-11 16:22:11,333][26022] Updated weights on worker 0-0, policy_version 1267951 (0.00091) [2022-07-11 16:22:12,163][25689] Fps is (10 sec: 5576.2, 60 sec: 5558.2, 300 sec: 5541.6). Total num frames: 1298385920. Throughput: 0: 4887.6. Samples: 1298380442. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:22:12,164][25689] Avg episode reward: [(0, '1.000')] [2022-07-11 16:22:13,367][26022] Updated weights on worker 0-0, policy_version 1267961 (0.00084) [2022-07-11 16:22:15,055][26022] Updated weights on worker 0-0, policy_version 1267971 (0.00086) [2022-07-11 16:22:17,014][26022] Updated weights on worker 0-0, policy_version 1267981 (0.00094) [2022-07-11 16:22:17,166][25689] Fps is (10 sec: 5517.0, 60 sec: 5562.1, 300 sec: 5539.3). Total num frames: 1298412544. Throughput: 0: 5722.3. Samples: 1298414282. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:22:17,168][25689] Avg episode reward: [(0, '0.763')] [2022-07-11 16:22:18,606][26022] Updated weights on worker 0-0, policy_version 1267991 (0.00054) [2022-07-11 16:22:20,058][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 16:22:20,070][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001267997_1298428928.pth [2022-07-11 16:22:20,074][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001266047_1296432128.pth [2022-07-11 16:22:20,704][26022] Updated weights on worker 0-0, policy_version 1268001 (0.00078) [2022-07-11 16:22:22,285][25689] Fps is (10 sec: 5563.8, 60 sec: 5573.3, 300 sec: 5540.6). Total num frames: 1298442240. Throughput: 0: 5727.8. Samples: 1298448094. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:22:22,286][25689] Avg episode reward: [(0, '0.544')] [2022-07-11 16:22:22,346][26022] Updated weights on worker 0-0, policy_version 1268011 (0.00089) [2022-07-11 16:22:24,123][26022] Updated weights on worker 0-0, policy_version 1268021 (0.00086) [2022-07-11 16:22:26,229][26022] Updated weights on worker 0-0, policy_version 1268031 (0.00092) [2022-07-11 16:22:27,303][25689] Fps is (10 sec: 5657.1, 60 sec: 5573.3, 300 sec: 5541.4). Total num frames: 1298469888. Throughput: 0: 5024.3. Samples: 1298464956. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:22:27,303][25689] Avg episode reward: [(0, '1.229')] [2022-07-11 16:22:27,828][26022] Updated weights on worker 0-0, policy_version 1268041 (0.00090) [2022-07-11 16:22:29,817][26022] Updated weights on worker 0-0, policy_version 1268051 (0.00088) [2022-07-11 16:22:31,719][26022] Updated weights on worker 0-0, policy_version 1268061 (0.00093) [2022-07-11 16:22:32,329][25689] Fps is (10 sec: 5607.6, 60 sec: 5573.1, 300 sec: 5547.8). Total num frames: 1298498560. Throughput: 0: 5845.6. Samples: 1298498324. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:22:32,329][25689] Avg episode reward: [(0, '1.242')] [2022-07-11 16:22:33,397][26022] Updated weights on worker 0-0, policy_version 1268071 (0.00084) [2022-07-11 16:22:35,290][26022] Updated weights on worker 0-0, policy_version 1268081 (0.00096) [2022-07-11 16:22:37,201][26022] Updated weights on worker 0-0, policy_version 1268091 (0.00084) [2022-07-11 16:22:37,331][25689] Fps is (10 sec: 5616.2, 60 sec: 5558.5, 300 sec: 5539.7). Total num frames: 1298526208. Throughput: 0: 5834.1. Samples: 1298531926. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:22:37,331][25689] Avg episode reward: [(0, '0.556')] [2022-07-11 16:22:38,987][26022] Updated weights on worker 0-0, policy_version 1268101 (0.00086) [2022-07-11 16:22:40,916][26022] Updated weights on worker 0-0, policy_version 1268111 (0.00087) [2022-07-11 16:22:42,446][25689] Fps is (10 sec: 5465.7, 60 sec: 5551.9, 300 sec: 5538.8). Total num frames: 1298553856. Throughput: 0: 4978.6. Samples: 1298548464. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:22:42,446][25689] Avg episode reward: [(0, '-0.171')] [2022-07-11 16:22:42,699][26022] Updated weights on worker 0-0, policy_version 1268121 (0.00093) [2022-07-11 16:22:44,499][26022] Updated weights on worker 0-0, policy_version 1268131 (0.00084) [2022-07-11 16:22:46,305][26022] Updated weights on worker 0-0, policy_version 1268141 (0.00092) [2022-07-11 16:22:47,453][25689] Fps is (10 sec: 5564.1, 60 sec: 5568.8, 300 sec: 5539.3). Total num frames: 1298582528. Throughput: 0: 5807.1. Samples: 1298581970. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:22:47,453][25689] Avg episode reward: [(0, '-0.043')] [2022-07-11 16:22:48,211][26022] Updated weights on worker 0-0, policy_version 1268151 (0.00084) [2022-07-11 16:22:49,992][26022] Updated weights on worker 0-0, policy_version 1268161 (0.00888) [2022-07-11 16:22:51,927][26022] Updated weights on worker 0-0, policy_version 1268171 (0.00079) [2022-07-11 16:22:52,457][25689] Fps is (10 sec: 5625.6, 60 sec: 5553.6, 300 sec: 5540.5). Total num frames: 1298610176. Throughput: 0: 5810.8. Samples: 1298615286. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:22:52,458][25689] Avg episode reward: [(0, '-0.471')] [2022-07-11 16:22:53,726][26022] Updated weights on worker 0-0, policy_version 1268181 (0.00087) [2022-07-11 16:22:55,433][26022] Updated weights on worker 0-0, policy_version 1268191 (0.00085) [2022-07-11 16:22:57,418][26022] Updated weights on worker 0-0, policy_version 1268201 (0.00096) [2022-07-11 16:22:57,501][25689] Fps is (10 sec: 5503.1, 60 sec: 5532.9, 300 sec: 5537.2). Total num frames: 1298637824. Throughput: 0: 4966.6. Samples: 1298632104. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:22:57,501][25689] Avg episode reward: [(0, '-0.765')] [2022-07-11 16:22:59,164][26022] Updated weights on worker 0-0, policy_version 1268211 (0.00083) [2022-07-11 16:23:01,167][26022] Updated weights on worker 0-0, policy_version 1268221 (0.00095) [2022-07-11 16:23:02,559][25689] Fps is (10 sec: 5372.5, 60 sec: 5535.0, 300 sec: 5540.0). Total num frames: 1298664448. Throughput: 0: 5824.1. Samples: 1298665606. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:23:02,559][25689] Avg episode reward: [(0, '-0.362')] [2022-07-11 16:23:03,190][26022] Updated weights on worker 0-0, policy_version 1268231 (0.00089) [2022-07-11 16:23:05,101][26022] Updated weights on worker 0-0, policy_version 1268241 (0.00087) [2022-07-11 16:23:06,749][26022] Updated weights on worker 0-0, policy_version 1268251 (0.00088) [2022-07-11 16:23:07,625][25689] Fps is (10 sec: 5461.8, 60 sec: 5553.3, 300 sec: 5535.6). Total num frames: 1298693120. Throughput: 0: 5720.4. Samples: 1298697364. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:23:07,626][25689] Avg episode reward: [(0, '0.515')] [2022-07-11 16:23:08,830][26022] Updated weights on worker 0-0, policy_version 1268261 (0.00092) [2022-07-11 16:23:10,369][26022] Updated weights on worker 0-0, policy_version 1268271 (0.00083) [2022-07-11 16:23:12,440][26022] Updated weights on worker 0-0, policy_version 1268281 (0.00107) [2022-07-11 16:23:12,638][25689] Fps is (10 sec: 5587.8, 60 sec: 5536.9, 300 sec: 5535.9). Total num frames: 1298720768. Throughput: 0: 4906.6. Samples: 1298714306. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:23:12,639][25689] Avg episode reward: [(0, '1.432')] [2022-07-11 16:23:14,080][26022] Updated weights on worker 0-0, policy_version 1268291 (0.00090) [2022-07-11 16:23:15,887][26022] Updated weights on worker 0-0, policy_version 1268301 (0.00086) [2022-07-11 16:23:17,645][25689] Fps is (10 sec: 5518.7, 60 sec: 5553.5, 300 sec: 5536.7). Total num frames: 1298748416. Throughput: 0: 5760.5. Samples: 1298748144. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:23:17,645][25689] Avg episode reward: [(0, '0.852')] [2022-07-11 16:23:17,821][26022] Updated weights on worker 0-0, policy_version 1268311 (0.00100) [2022-07-11 16:23:19,645][26022] Updated weights on worker 0-0, policy_version 1268321 (0.00091) [2022-07-11 16:23:21,499][26022] Updated weights on worker 0-0, policy_version 1268331 (0.00084) [2022-07-11 16:23:22,742][25689] Fps is (10 sec: 5675.2, 60 sec: 5555.5, 300 sec: 5541.9). Total num frames: 1298778112. Throughput: 0: 5761.9. Samples: 1298781902. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:23:22,743][25689] Avg episode reward: [(0, '0.507')] [2022-07-11 16:23:23,315][26022] Updated weights on worker 0-0, policy_version 1268341 (0.00082) [2022-07-11 16:23:25,094][26022] Updated weights on worker 0-0, policy_version 1268351 (0.00089) [2022-07-11 16:23:27,056][26022] Updated weights on worker 0-0, policy_version 1268361 (0.00089) [2022-07-11 16:23:27,755][25689] Fps is (10 sec: 5570.8, 60 sec: 5539.1, 300 sec: 5535.0). Total num frames: 1298804736. Throughput: 0: 5046.5. Samples: 1298798948. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:23:27,755][25689] Avg episode reward: [(0, '0.404')] [2022-07-11 16:23:28,831][26022] Updated weights on worker 0-0, policy_version 1268371 (0.00082) [2022-07-11 16:23:30,606][26022] Updated weights on worker 0-0, policy_version 1268381 (0.00086) [2022-07-11 16:23:32,477][26022] Updated weights on worker 0-0, policy_version 1268391 (0.00086) [2022-07-11 16:23:32,815][25689] Fps is (10 sec: 5489.9, 60 sec: 5536.0, 300 sec: 5541.0). Total num frames: 1298833408. Throughput: 0: 5837.2. Samples: 1298832082. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:23:32,816][25689] Avg episode reward: [(0, '0.424')] [2022-07-11 16:23:34,394][26022] Updated weights on worker 0-0, policy_version 1268401 (0.00093) [2022-07-11 16:23:36,103][26022] Updated weights on worker 0-0, policy_version 1268411 (0.00084) [2022-07-11 16:23:37,818][25689] Fps is (10 sec: 5698.3, 60 sec: 5552.8, 300 sec: 5538.9). Total num frames: 1298862080. Throughput: 0: 5847.3. Samples: 1298866102. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:23:37,819][25689] Avg episode reward: [(0, '0.455')] [2022-07-11 16:23:37,945][26022] Updated weights on worker 0-0, policy_version 1268421 (0.00089) [2022-07-11 16:23:39,906][26022] Updated weights on worker 0-0, policy_version 1268431 (0.00099) [2022-07-11 16:23:41,757][26022] Updated weights on worker 0-0, policy_version 1268441 (0.00079) [2022-07-11 16:23:42,927][25689] Fps is (10 sec: 5366.8, 60 sec: 5519.4, 300 sec: 5530.5). Total num frames: 1298887680. Throughput: 0: 5695.1. Samples: 1298896856. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:23:42,928][25689] Avg episode reward: [(0, '1.242')] [2022-07-11 16:23:44,268][26022] Updated weights on worker 0-0, policy_version 1268451 (0.00086) [2022-07-11 16:23:45,837][26022] Updated weights on worker 0-0, policy_version 1268461 (0.00088) [2022-07-11 16:23:47,951][25689] Fps is (10 sec: 4850.9, 60 sec: 5433.3, 300 sec: 5520.2). Total num frames: 1298911232. Throughput: 0: 5597.1. Samples: 1298911986. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:23:47,951][25689] Avg episode reward: [(0, '0.614')] [2022-07-11 16:23:48,251][26022] Updated weights on worker 0-0, policy_version 1268471 (0.00081) [2022-07-11 16:23:50,365][26022] Updated weights on worker 0-0, policy_version 1268481 (0.00088) [2022-07-11 16:23:52,708][26022] Updated weights on worker 0-0, policy_version 1268491 (0.00090) [2022-07-11 16:23:52,981][25689] Fps is (10 sec: 4685.3, 60 sec: 5363.3, 300 sec: 5506.8). Total num frames: 1298934784. Throughput: 0: 5354.2. Samples: 1298940054. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:23:52,981][25689] Avg episode reward: [(0, '0.982')] [2022-07-11 16:23:55,360][26022] Updated weights on worker 0-0, policy_version 1268501 (0.00079) [2022-07-11 16:23:57,539][26022] Updated weights on worker 0-0, policy_version 1268511 (0.00085) [2022-07-11 16:23:57,993][25689] Fps is (10 sec: 4486.5, 60 sec: 5264.5, 300 sec: 5485.0). Total num frames: 1298956288. Throughput: 0: 4907.0. Samples: 1298965102. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:23:57,994][25689] Avg episode reward: [(0, '1.188')] [2022-07-11 16:24:00,028][26022] Updated weights on worker 0-0, policy_version 1268521 (0.00085) [2022-07-11 16:24:03,124][25689] Fps is (10 sec: 3937.3, 60 sec: 5123.0, 300 sec: 5455.7). Total num frames: 1298974720. Throughput: 0: 3996.3. Samples: 1298977568. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:24:03,124][25689] Avg episode reward: [(0, '1.242')] [2022-07-11 16:24:03,279][26022] Updated weights on worker 0-0, policy_version 1268531 (0.00089) [2022-07-11 16:24:05,637][26022] Updated weights on worker 0-0, policy_version 1268541 (0.00063) [2022-07-11 16:24:07,720][26022] Updated weights on worker 0-0, policy_version 1268551 (0.00085) [2022-07-11 16:24:08,138][25689] Fps is (10 sec: 4138.4, 60 sec: 5042.7, 300 sec: 5438.4). Total num frames: 1298998272. Throughput: 0: 4433.6. Samples: 1299001490. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:24:08,139][25689] Avg episode reward: [(0, '1.252')] [2022-07-11 16:24:09,313][26022] Updated weights on worker 0-0, policy_version 1268561 (0.01136) [2022-07-11 16:24:11,460][26022] Updated weights on worker 0-0, policy_version 1268571 (0.00968) [2022-07-11 16:24:13,130][26022] Updated weights on worker 0-0, policy_version 1268581 (0.00091) [2022-07-11 16:24:13,229][25689] Fps is (10 sec: 5168.1, 60 sec: 5053.2, 300 sec: 5440.3). Total num frames: 1299026944. Throughput: 0: 4665.7. Samples: 1299034526. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:24:13,229][25689] Avg episode reward: [(0, '1.346')] [2022-07-11 16:24:14,976][26022] Updated weights on worker 0-0, policy_version 1268591 (0.00085) [2022-07-11 16:24:16,870][26022] Updated weights on worker 0-0, policy_version 1268601 (0.00090) [2022-07-11 16:24:18,292][25689] Fps is (10 sec: 5546.9, 60 sec: 5048.5, 300 sec: 5440.6). Total num frames: 1299054592. Throughput: 0: 4232.9. Samples: 1299051022. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:24:18,292][25689] Avg episode reward: [(0, '1.334')] [2022-07-11 16:24:18,750][26022] Updated weights on worker 0-0, policy_version 1268611 (0.00097) [2022-07-11 16:24:20,156][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 16:24:20,170][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001268618_1299064832.pth [2022-07-11 16:24:20,171][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001266698_1297098752.pth [2022-07-11 16:24:20,691][26022] Updated weights on worker 0-0, policy_version 1268621 (0.00094) [2022-07-11 16:24:22,739][26022] Updated weights on worker 0-0, policy_version 1268631 (0.00087) [2022-07-11 16:24:23,425][25689] Fps is (10 sec: 5322.5, 60 sec: 4994.9, 300 sec: 5435.4). Total num frames: 1299081216. Throughput: 0: 5236.9. Samples: 1299083886. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:24:23,426][25689] Avg episode reward: [(0, '1.990')] [2022-07-11 16:24:24,335][26022] Updated weights on worker 0-0, policy_version 1268641 (0.00084) [2022-07-11 16:24:26,276][26022] Updated weights on worker 0-0, policy_version 1268651 (0.00514) [2022-07-11 16:24:27,976][26022] Updated weights on worker 0-0, policy_version 1268661 (0.00092) [2022-07-11 16:24:28,459][25689] Fps is (10 sec: 5539.4, 60 sec: 5043.8, 300 sec: 5438.3). Total num frames: 1299110912. Throughput: 0: 5674.1. Samples: 1299116792. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:24:28,459][25689] Avg episode reward: [(0, '1.902')] [2022-07-11 16:24:30,095][26022] Updated weights on worker 0-0, policy_version 1268671 (0.00091) [2022-07-11 16:24:31,796][26022] Updated weights on worker 0-0, policy_version 1268681 (0.00093) [2022-07-11 16:24:33,481][25689] Fps is (10 sec: 5601.1, 60 sec: 5013.2, 300 sec: 5434.8). Total num frames: 1299137536. Throughput: 0: 4868.1. Samples: 1299133114. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:24:33,481][25689] Avg episode reward: [(0, '1.756')] [2022-07-11 16:24:33,768][26022] Updated weights on worker 0-0, policy_version 1268691 (0.00091) [2022-07-11 16:24:35,431][26022] Updated weights on worker 0-0, policy_version 1268701 (0.00084) [2022-07-11 16:24:37,627][26022] Updated weights on worker 0-0, policy_version 1268711 (0.00088) [2022-07-11 16:24:38,511][25689] Fps is (10 sec: 5398.8, 60 sec: 4994.0, 300 sec: 5433.0). Total num frames: 1299165184. Throughput: 0: 5691.3. Samples: 1299166098. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:24:38,512][25689] Avg episode reward: [(0, '1.910')] [2022-07-11 16:24:39,271][26022] Updated weights on worker 0-0, policy_version 1268721 (0.00094) [2022-07-11 16:24:41,337][26022] Updated weights on worker 0-0, policy_version 1268731 (0.00086) [2022-07-11 16:24:43,110][26022] Updated weights on worker 0-0, policy_version 1268741 (0.00093) [2022-07-11 16:24:43,614][25689] Fps is (10 sec: 5355.7, 60 sec: 5011.4, 300 sec: 5428.4). Total num frames: 1299191808. Throughput: 0: 5676.9. Samples: 1299198494. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2022-07-11 16:24:43,615][25689] Avg episode reward: [(0, '1.288')] [2022-07-11 16:24:44,979][26022] Updated weights on worker 0-0, policy_version 1268751 (0.00088) [2022-07-11 16:24:47,143][26022] Updated weights on worker 0-0, policy_version 1268761 (0.00098) [2022-07-11 16:24:48,699][25689] Fps is (10 sec: 5327.4, 60 sec: 5073.8, 300 sec: 5425.1). Total num frames: 1299219456. Throughput: 0: 4849.6. Samples: 1299214948. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 16:24:48,699][25689] Avg episode reward: [(0, '1.152')] [2022-07-11 16:24:48,936][26022] Updated weights on worker 0-0, policy_version 1268771 (0.00090) [2022-07-11 16:24:50,734][26022] Updated weights on worker 0-0, policy_version 1268781 (0.00087) [2022-07-11 16:24:52,537][26022] Updated weights on worker 0-0, policy_version 1268791 (0.00089) [2022-07-11 16:24:53,710][25689] Fps is (10 sec: 5477.0, 60 sec: 5142.8, 300 sec: 5423.2). Total num frames: 1299247104. Throughput: 0: 5654.6. Samples: 1299247504. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 16:24:53,711][25689] Avg episode reward: [(0, '1.423')] [2022-07-11 16:24:54,448][26022] Updated weights on worker 0-0, policy_version 1268801 (0.00080) [2022-07-11 16:24:56,307][26022] Updated weights on worker 0-0, policy_version 1268811 (0.00092) [2022-07-11 16:24:58,210][26022] Updated weights on worker 0-0, policy_version 1268821 (0.00104) [2022-07-11 16:24:58,730][25689] Fps is (10 sec: 5614.6, 60 sec: 5260.2, 300 sec: 5427.1). Total num frames: 1299275776. Throughput: 0: 5663.5. Samples: 1299280604. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 16:24:58,731][25689] Avg episode reward: [(0, '1.394')] [2022-07-11 16:25:00,210][26022] Updated weights on worker 0-0, policy_version 1268831 (0.00088) [2022-07-11 16:25:02,342][26022] Updated weights on worker 0-0, policy_version 1268841 (0.00091) [2022-07-11 16:25:03,840][25689] Fps is (10 sec: 5155.4, 60 sec: 5346.3, 300 sec: 5419.4). Total num frames: 1299299328. Throughput: 0: 4832.4. Samples: 1299296234. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 16:25:03,841][25689] Avg episode reward: [(0, '1.510')] [2022-07-11 16:25:04,584][26022] Updated weights on worker 0-0, policy_version 1268851 (0.00109) [2022-07-11 16:25:06,251][26022] Updated weights on worker 0-0, policy_version 1268861 (0.00087) [2022-07-11 16:25:08,041][26022] Updated weights on worker 0-0, policy_version 1268871 (0.00053) [2022-07-11 16:25:08,862][25689] Fps is (10 sec: 5053.4, 60 sec: 5413.1, 300 sec: 5422.8). Total num frames: 1299326976. Throughput: 0: 5526.7. Samples: 1299326382. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 16:25:08,862][25689] Avg episode reward: [(0, '0.794')] [2022-07-11 16:25:10,252][26022] Updated weights on worker 0-0, policy_version 1268881 (0.00091) [2022-07-11 16:25:11,691][26022] Updated weights on worker 0-0, policy_version 1268891 (0.00090) [2022-07-11 16:25:13,815][26022] Updated weights on worker 0-0, policy_version 1268901 (0.00092) [2022-07-11 16:25:13,871][25689] Fps is (10 sec: 5512.4, 60 sec: 5403.5, 300 sec: 5412.8). Total num frames: 1299354624. Throughput: 0: 5526.8. Samples: 1299358930. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 16:25:13,872][25689] Avg episode reward: [(0, '0.947')] [2022-07-11 16:25:16,191][26022] Updated weights on worker 0-0, policy_version 1268911 (0.00092) [2022-07-11 16:25:17,569][26022] Updated weights on worker 0-0, policy_version 1268921 (0.00090) [2022-07-11 16:25:18,890][25689] Fps is (10 sec: 5309.5, 60 sec: 5373.6, 300 sec: 5414.5). Total num frames: 1299380224. Throughput: 0: 4638.2. Samples: 1299374112. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 16:25:18,891][25689] Avg episode reward: [(0, '0.933')] [2022-07-11 16:25:19,908][26022] Updated weights on worker 0-0, policy_version 1268931 (0.00092) [2022-07-11 16:25:21,605][26022] Updated weights on worker 0-0, policy_version 1268941 (0.00086) [2022-07-11 16:25:23,451][26022] Updated weights on worker 0-0, policy_version 1268951 (0.00095) [2022-07-11 16:25:23,955][25689] Fps is (10 sec: 5382.2, 60 sec: 5413.6, 300 sec: 5413.4). Total num frames: 1299408896. Throughput: 0: 5483.1. Samples: 1299406526. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 16:25:23,955][25689] Avg episode reward: [(0, '0.898')] [2022-07-11 16:25:25,426][26022] Updated weights on worker 0-0, policy_version 1268961 (0.00093) [2022-07-11 16:25:27,214][26022] Updated weights on worker 0-0, policy_version 1268971 (0.00086) [2022-07-11 16:25:28,963][25689] Fps is (10 sec: 5387.8, 60 sec: 5348.1, 300 sec: 5404.3). Total num frames: 1299434496. Throughput: 0: 5612.7. Samples: 1299439208. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 16:25:28,964][25689] Avg episode reward: [(0, '0.080')] [2022-07-11 16:25:29,235][26022] Updated weights on worker 0-0, policy_version 1268981 (0.00099) [2022-07-11 16:25:30,928][26022] Updated weights on worker 0-0, policy_version 1268991 (0.00080) [2022-07-11 16:25:32,801][26022] Updated weights on worker 0-0, policy_version 1269001 (0.00093) [2022-07-11 16:25:33,974][25689] Fps is (10 sec: 5314.4, 60 sec: 5366.0, 300 sec: 5405.1). Total num frames: 1299462144. Throughput: 0: 4814.7. Samples: 1299455720. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 16:25:33,975][25689] Avg episode reward: [(0, '-0.013')] [2022-07-11 16:25:34,803][26022] Updated weights on worker 0-0, policy_version 1269011 (0.00086) [2022-07-11 16:25:36,944][26022] Updated weights on worker 0-0, policy_version 1269021 (0.00093) [2022-07-11 16:25:39,030][25689] Fps is (10 sec: 5086.3, 60 sec: 5296.1, 300 sec: 5389.3). Total num frames: 1299485696. Throughput: 0: 5552.5. Samples: 1299485936. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 16:25:39,030][25689] Avg episode reward: [(0, '-0.165')] [2022-07-11 16:25:39,430][26022] Updated weights on worker 0-0, policy_version 1269031 (0.00089) [2022-07-11 16:25:41,775][26022] Updated weights on worker 0-0, policy_version 1269041 (0.00057) [2022-07-11 16:25:44,080][25689] Fps is (10 sec: 4458.4, 60 sec: 5216.0, 300 sec: 5368.2). Total num frames: 1299507200. Throughput: 0: 5281.7. Samples: 1299512822. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 16:25:44,081][25689] Avg episode reward: [(0, '0.130')] [2022-07-11 16:25:44,179][26022] Updated weights on worker 0-0, policy_version 1269051 (0.00093) [2022-07-11 16:25:46,935][26022] Updated weights on worker 0-0, policy_version 1269061 (0.00096) [2022-07-11 16:25:49,156][25689] Fps is (10 sec: 3943.6, 60 sec: 5064.3, 300 sec: 5333.6). Total num frames: 1299525632. Throughput: 0: 4722.1. Samples: 1299534562. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 16:25:49,157][25689] Avg episode reward: [(0, '0.185')] [2022-07-11 16:25:49,862][26022] Updated weights on worker 0-0, policy_version 1269071 (0.00102) [2022-07-11 16:25:52,871][26022] Updated weights on worker 0-0, policy_version 1269081 (0.00094) [2022-07-11 16:25:54,249][25689] Fps is (10 sec: 3625.3, 60 sec: 4905.3, 300 sec: 5298.1). Total num frames: 1299544064. Throughput: 0: 4425.1. Samples: 1299545436. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 16:25:54,250][25689] Avg episode reward: [(0, '0.349')] [2022-07-11 16:25:55,094][26022] Updated weights on worker 0-0, policy_version 1269091 (0.00084) [2022-07-11 16:25:56,998][26022] Updated weights on worker 0-0, policy_version 1269101 (0.00114) [2022-07-11 16:25:58,775][26022] Updated weights on worker 0-0, policy_version 1269111 (0.00089) [2022-07-11 16:25:59,326][25689] Fps is (10 sec: 4430.4, 60 sec: 4866.8, 300 sec: 5301.2). Total num frames: 1299570688. Throughput: 0: 4345.1. Samples: 1299574126. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 16:25:59,327][25689] Avg episode reward: [(0, '0.378')] [2022-07-11 16:26:00,870][26022] Updated weights on worker 0-0, policy_version 1269121 (0.00080) [2022-07-11 16:26:03,219][26022] Updated weights on worker 0-0, policy_version 1269131 (0.00086) [2022-07-11 16:26:04,403][25689] Fps is (10 sec: 5042.3, 60 sec: 4886.4, 300 sec: 5290.6). Total num frames: 1299595264. Throughput: 0: 4500.1. Samples: 1299604276. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 16:26:04,403][25689] Avg episode reward: [(0, '0.748')] [2022-07-11 16:26:05,087][26022] Updated weights on worker 0-0, policy_version 1269141 (0.00104) [2022-07-11 16:26:06,925][26022] Updated weights on worker 0-0, policy_version 1269151 (0.00092) [2022-07-11 16:26:08,921][26022] Updated weights on worker 0-0, policy_version 1269161 (0.00082) [2022-07-11 16:26:09,414][25689] Fps is (10 sec: 5176.7, 60 sec: 4887.2, 300 sec: 5283.7). Total num frames: 1299622912. Throughput: 0: 4240.9. Samples: 1299620474. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 16:26:09,415][25689] Avg episode reward: [(0, '0.365')] [2022-07-11 16:26:10,891][26022] Updated weights on worker 0-0, policy_version 1269171 (0.00084) [2022-07-11 16:26:12,885][26022] Updated weights on worker 0-0, policy_version 1269181 (0.00086) [2022-07-11 16:26:14,428][25689] Fps is (10 sec: 5515.8, 60 sec: 4886.9, 300 sec: 5287.8). Total num frames: 1299650560. Throughput: 0: 5331.4. Samples: 1299653018. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 16:26:14,430][25689] Avg episode reward: [(0, '0.408')] [2022-07-11 16:26:14,448][26022] Updated weights on worker 0-0, policy_version 1269191 (0.00091) [2022-07-11 16:26:16,490][26022] Updated weights on worker 0-0, policy_version 1269201 (0.00090) [2022-07-11 16:26:18,205][26022] Updated weights on worker 0-0, policy_version 1269211 (0.00083) [2022-07-11 16:26:19,447][25689] Fps is (10 sec: 5511.5, 60 sec: 4920.7, 300 sec: 5285.0). Total num frames: 1299678208. Throughput: 0: 5567.4. Samples: 1299686148. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2022-07-11 16:26:19,448][25689] Avg episode reward: [(0, '0.336')] [2022-07-11 16:26:20,143][26022] Updated weights on worker 0-0, policy_version 1269221 (0.00093) [2022-07-11 16:26:20,373][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 16:26:20,382][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001269222_1299683328.pth [2022-07-11 16:26:20,383][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001267346_1297762304.pth [2022-07-11 16:26:21,189][25689] Keyboard interrupt detected in driver loop, exiting... [2022-07-11 16:26:21,189][25689] Wait while learner 0 saves the model... [2022-07-11 16:26:21,220][25974] Saving ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint... [2022-07-11 16:26:21,235][25974] Renaming ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/.temp_checkpoint to ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001269227_1299688448.pth [2022-07-11 16:26:21,236][25974] Removing ./train_dir/quad_single_baseline/baseline_/00_baseline_see_0/checkpoint_p0/checkpoint_001267997_1298428928.pth [2022-07-11 16:26:21,236][25689] Learner 0 saved the model! [2022-07-11 16:26:21,350][25689] Closing workers... [2022-07-11 16:26:21,467][26023] Env runner 0, CPU aff. [0], rollouts 280958: timing wait_actor: 0.0000, waiting: 85398.4247, reset: 114.5901, split_output_tensors: 2041.4489, save_policy_outputs: 3843.3497, env_step: 105329.5544, overhead: 8453.8680, prepare_next_step: 2110.1351, complete_rollouts: 488.2411, enqueue_policy_requests: 21245.5694, one_step: 0.0110, work: 145652.7258, wait_buffers: 18.4233 [2022-07-11 16:26:21,503][26024] Env runner 1, CPU aff. [1], rollouts 284924: timing wait_actor: 0.0176, waiting: 87022.2913, reset: 105.2806, split_output_tensors: 2000.1237, save_policy_outputs: 3763.9473, env_step: 104806.6366, overhead: 8102.0132, prepare_next_step: 2076.4146, complete_rollouts: 495.6857, enqueue_policy_requests: 20756.6897, one_step: 0.0026, work: 144046.4606, wait_buffers: 21.7899 [2022-07-11 16:26:21,938][26022] Policy worker avg. requests 14.20, timing: init: 6.8816, wait_policy_total: 116302.5897, wait_policy: 0.0051, handle_policy_step: 109209.3591, one_step: 0.0016, deserialize: 3371.3259, stack: 603.2909, obs_to_device: 5972.2118, forward: 66961.2080, to_cpu: 5681.2591, format_outputs: 2822.9381, postprocess: 20290.2780, weight_update: 0.0006 [2022-07-11 16:26:22,044][25974] GPU learner timing: extract: 227.5615, buffers: 65.0543, buffer_stack_and_squeeze: 371.3897, calc_gae: 3749.2472, batching: 312.8633, buff_ready: 209.0659, tensors_gpu_float: 660.0884, prepare: 5665.0246, batcher_mem: 223.3676 [2022-07-11 16:26:22,350][25974] Train loop timing: init: 2.3661, train_wait: 0.3546, epoch_init: 6.6494, minibatch_init: 3.6202, forward_head: 1010.8689, bptt_initial: 15.7425, bptt: 5.7604, tail: 2704.1077, losses: 1444.0087, kl_divergence: 1312.8306, clip: 1663.3646, update: 9480.4879, after_optimizer: 65.3196, train: 16922.9347 [2022-07-11 16:26:22,417][25689] Workers joined! [2022-07-11 16:26:34,704][25689] Collected {0: 1299687424}, FPS: 5605.6 [2022-07-11 16:26:34,705][25689] Timing: experience: 231854.8393 [2022-07-11 16:26:35,206][25689] Done!